## General

Overview pages, getting started guides, and general documentation.

---
title: Alerts and Notifications
source: https://docs.snowflake.com/en/guides-overview-alerts.md
section: General
---

# Alerts and Notifications

You can use Snowflake alerts to send notifications and perform actions automatically. In SQL, you can send a notification to an
email address or queue by calling a built-in stored procedure.

[Snowflake Alerts](user-guide/alerts.md)
:   If you need to send a notification or perform an action when data in Snowflake meets certain
    conditions, you can set up a Snowflake Alert.

    Learn how to create, configure and maintain Snowflake alerts.

[Notifications in Snowflake](user-guide/notifications/about-notifications.md)
:   You can configure Snowflake to send notifications about [Snowpipe](user-guide/data-load-snowpipe-intro.md) and
    [task](user-guide/tasks-intro.md) errors to a cloud provider queue (Amazon SNS, Microsoft Azure Event Grid, or Google
    Cloud Pub/Sub).

    You can also use a SQL statement to send a notification to an email address, a cloud provider queue, or a webhook.

    Learn how to configure Snowflake to send notifications.

---
title: API Reference
source: https://docs.snowflake.com/en/api-reference.md
section: General
---

# API Reference

These topics provide reference information for the APIs available in Snowflake.

APIs for connecting to Snowflake

| Connector / Driver / Client API | Resources |
| --- | --- |
| Go Driver | * [Developer Guide](developer-guide/golang/go-driver.md) * [API Reference](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#pkg-index) |
| JDBC Driver | * [Developer Guide](developer-guide/jdbc/jdbc.md) * [JDBC API Support Reference](developer-guide/jdbc/jdbc-api.md) |
| .NET Driver | * [Developer Guide](developer-guide/dotnet/dotnet-driver.md) * [Source code in GitHub](https://github.com/snowflakedb/snowflake-connector-net/) |
| Node.js Driver | * [Developer Guide](developer-guide/node-js/nodejs-driver.md) * [Source code in GitHub](https://github.com/snowflakedb/snowflake-connector-nodejs/) |
| ODBC Driver | * [Developer Guide](developer-guide/odbc/odbc.md) * [ODBC Driver API Support Reference](developer-guide/odbc/odbc-api.md) |
| PHP PDO Driver | * [Developer Guide](developer-guide/php-pdo/php-pdo-driver.md) * [Source code in GitHub](https://github.com/snowflakedb/pdo_snowflake/) |
| Snowflake Connector for Kafka | * [Developer Guide](user-guide/kafka-connector.md) * [Source code in GitHub](https://github.com/snowflakedb/snowflake-kafka-connector) |
| Snowflake Connector for Python | * [Developer Guide](developer-guide/python-connector/python-connector.md) * [API Reference](developer-guide/python-connector/python-connector-api.md) * [Getting Started With Python](https://quickstarts.snowflake.com/guide/getting_started_with_python/index.html?index=..%2F..index) |
| Snowflake Connector for Spark | * [Developer Guide](user-guide/spark-connector.md) * [Source code in GitHub](https://github.com/snowflakedb/spark-snowflake) |
| Snowflake Python APIs | * [Developer Guide](developer-guide/snowflake-python-api/snowflake-python-overview.md) * [API Reference](developer-guide/snowflake-python-api/reference/latest/index) |
| Snowflake REST APIs | * [Developer Guide](developer-guide/snowflake-rest-api/snowflake-rest-api.md) * [API Reference](developer-guide/snowflake-rest-api/reference) |
| Snowflake SQL API | * [Developer Guide](developer-guide/sql-api/index.md) * [API Reference](developer-guide/sql-api/reference.md) * [SQL API Playground](https://api.developers.snowflake.com/) |

APIs for extending Snowflake:

| Extensibility Feature | Resources |
| --- | --- |
| User-Defined Functions (UDFs) | * [Developer Guide](developer-guide/udf/udf-overview.md) * [Getting Started With User-Defined Functions](https://quickstarts.snowflake.com/guide/getting_started_with_user_defined_functions/index.html?index=..%2F..index) |
| Snowpark for Scala | * [Developer Guide](developer-guide/snowpark/scala/index.md) * [API Reference](developer-guide/snowpark/reference/scala/com/snowflake/snowpark/index.md) * [Getting Started With Snowpark in Scala](https://quickstarts.snowflake.com/guide/getting_started_with_snowpark_scala/index.html) |
| Snowpark for Java | * [Developer Guide](developer-guide/snowpark/java/index.md) * [API Reference](developer-guide/snowpark/reference/java/index.md) |
| Snowpark for Python | * [Developer Guide](developer-guide/snowpark/python/index.md) * [API Reference](/developer-guide/snowpark/reference/python/latest/index.md "(in Snowpark API Reference (Python))") |
| Snowflake ML for Python | * [Developer Guide](developer-guide/snowflake-ml/overview.md) * [API Reference](/developer-guide/snowpark-ml/reference/latest/index.md "(in Snowpark ML API Reference (Python))") |
| External Functions | * [Developer Guide](sql-reference/external-functions.md) |
| Stored Procedures | * [Developer Guide](developer-guide/stored-procedure/stored-procedures-overview.md) * [API Reference](developer-guide/stored-procedure/stored-procedures-api.md) |

---
title: Appendices
source: https://docs.snowflake.com/en/appendices.md
section: General
---

# Appendices

* [Notational conventions](sql-reference/conventions.md)

  > Notational conventions used in the Snowflake documentation.
* [Reserved & limited keywords](sql-reference/reserved-keywords.md)

  > List of words reserved for Snowflake SQL.

---
title: Applications and tools for connecting to Snowflake
source: https://docs.snowflake.com/en/guides-overview-connecting.md
section: General
---

# Applications and tools for connecting to Snowflake

Snowflake provides several different applications and tools that you can use to access databases in Snowflake.

> **Note:**
>
> For information about configuring clients, driver, libraries, and third-party applications to connect to Snowflake, see
> [Configuring a client, driver, library, or third-party application to connect to Snowflake](user-guide/gen-conn-config.md).

## User interface

[Snowsight: The Snowflake web interface](user-guide/ui-snowsight.md)
:   Snowsight distills Snowflake’s powerful SQL support into a unified, easy-to-use experience.
    Use Snowsight to perform your critical Snowflake operations.

## Command-line clients

[Snowflake CLI](developer-guide/snowflake-cli/index.md)
:   Use the command line to create, manage, update, and view apps running on Snowflake across workloads.

[SnowSQL (CLI client)](user-guide/snowsql.md)
:   Detailed instructions for installing, configuring, and using the Snowflake command-line client.

## Extensions for code editors

[Snowflake Extension for Visual Studio Code](user-guide/vscode-ext.md)
:   Use the Snowflake Extension for Visual Studio Code to connect to Snowflake within Visual Studio Code and perform SQL operations.

## Infrastructure as code

> **Note:**
>
> The following content is not supported by Snowflake. All code is provided “AS IS” and without warranty.

[Snowflake Terraform provider](user-guide/terraform.md)
:   Documentation and resources for the Snowflake Terraform provider.

## Drivers and libraries

[API Reference](api-reference.md)
:   Lists the drivers and APIs provided by Snowflake for writing applications that connect to Snowflake.

## Integrating with third-party systems

[Snowflake Connectors](https://other-docs.snowflake.com/connectors.html)
:   Snowflake Connectors allow you to integrate third-party applications and database systems with Snowflake.

## Third-party software

[Snowflake Ecosystem](user-guide/ecosystem.md)
:   Overview of the third-party tools and technologies, as well as the Snowflake-provided clients, in the Snowflake ecosystem.

---
title: Cost & billing
source: https://docs.snowflake.com/en/guides-overview-cost.md
section: General
---

# Cost & billing

Snowflake provides a robust framework to manage costs. You can also obtain monthly usage statements and reconcile those statements with
usage data in views.

## Cost management

[Understanding overall cost](user-guide/cost-understanding-overall.md)
:   The total cost of using Snowflake is the aggregate of the cost of using data transfer, storage, and compute resources.

    Learn about how overall cost is calculated.

[Exploring overall cost](user-guide/cost-exploring-overall.md)
:   Snowsight allows you to quickly and easily obtain information about cost from a visual dashboard.
    Queries against the usage views allow you to drill down into cost data and can help generate custom reports and dashboards.

    Learn about exploring your spend using various queries to return cost information.

[Optimizing cost](user-guide/cost-optimize.md)
:   Learn how to optimize Snowflake in order to reduce costs and maximize your spend.

[Attributing cost](user-guide/cost-attributing.md)
:   Gain insight into Snowflake cost by attributing those costs to logical units within the organization such as departments, environments or
    other entities.

    Learn how to attribute cost to differing entities within your organization.

[Controlling cost](user-guide/cost-controlling.md)
:   Cost controls allow you to limit how much is spent on various services such as virtual warehouses.

    [Budgets](user-guide/budgets.md) allow you to monitor the credit usage of supported objects and serverless features in your account.
    [Resource monitors](user-guide/resource-monitors.md) allow you to monitor credit usage by user-managed virtual warehouses and the
    cloud services layer of the Snowflake architecture.

## Billing

[Access a billing usage statement](user-guide/billing-usage-statement.md)
:   Learn how to use Snowsight to view and download monthly usage statements.

[Reconcile a billing usage statement](user-guide/billing-reconcile.md)
:   Learn how to execute queries to reconcile usage data shown on a usage statement with data in the billing views of the Organization Usage
    schema.

[Update billing contact information](user-guide/billing-contacts.md)
:   Learn how to use Snowsight to update billing contact information.

---
title: Data Governance in Snowflake
source: https://docs.snowflake.com/en/guides-overview-govern.md
section: General
---

# Data Governance in Snowflake

Snowflake provides industry-leading features that ensure the highest levels of governance for your account and users, as well as all the data you store and access in Snowflake.

[Data Quality Monitoring and data metric functions](user-guide/data-quality-intro.md)
:   Allows the monitoring of the state and integrity of your data using system data metric functions and user-defined data metric functions.

[Column-level Security](user-guide/security-column-intro.md)
:   Allows the application of a masking policy to a column within a table or view.

[Row-level Security](user-guide/security-row-intro.md)
:   Allows the application of a row access policy to a table or view to determine which rows are visible in the query result.

[Introduction to object tagging](user-guide/object-tagging/introduction.md)
:   Allows the tracking of sensitive data for compliance, discovery, protection, and resource usage.

[Tag-based masking policies](user-guide/tag-based-masking-policies.md)
:   Allows protecting column data by assigning a masking policy to a tag and then setting the tag on a database object or the Snowflake
    account.

[Sensitive data classification](user-guide/classify-intro.md)
:   Allows categorizing potentially personal and/or sensitive data to support compliance and privacy regulations.

[Access History](user-guide/access-history.md)
:   Allows the auditing of the user access history through the Account Usage [ACCESS_HISTORY view](sql-reference/account-usage/access_history.md).

[Object Dependencies](user-guide/object-dependencies.md)
:   Allows the auditing of how one object references another object by its metadata (e.g. creating a view depends on a table name and column
    names) through the Account Usage [OBJECT_DEPENDENCIES](sql-reference/account-usage/object_dependencies.md) view.

Data Governance area in Snowsight
:   Allows using the Governance & security » Tags & policies area to monitor and report on the usage of policies and tags with
    tables, views, and columns using two different interfaces: Dashboard and Tagged Objects. For details, see:

    * [Use Snowsight to set tags](user-guide/object-tagging/work.md)
    * [Monitor tags with Snowsight](user-guide/object-tagging/monitor.md)
    * [Monitor masking policies with Snowsight](user-guide/security-column-intro.md)
    * [Monitor row access policies with Snowsight](user-guide/security-row-intro.md)

---
title: Data sharing and collaboration in Snowflake
source: https://docs.snowflake.com/en/guides-overview-sharing.md
section: General
---

# Data sharing and collaboration in Snowflake

There are many ways to share data from your Snowflake account with users in other Snowflake accounts, including collaborating with other
parties in a secure environment.

## Why share data with Snowflake

When you use Snowflake to share data as a provider, you can manage who has access to your data, and avoid challenges
keeping your data synchronized across different people and groups.

As a data consumer, you can reduce the data transformations you need to perform because the data stays in Snowflake, making it easy to join
datasets shared with you with your own data.

If you share your data using listings, you can include metadata with your data share, such as a title and description, and usage examples to
help consumers use the data quickly. In addition to the benefits for consumers, as a provider you get access to usage data, automatically
replicate your data to other regions, and can even decide to charge for access to your data or offer some datasets publicly
on the Snowflake Marketplace.

## Options for sharing

Listings let you share data with people in any Snowflake region, across clouds, without performing manual replication tasks.
If you use listings, you can provide additional metadata for the data that you share, view customer data usage, and for listings
offered publicly on the Snowflake Marketplace, gauge consumer interest in your listings.

If you don’t want to share data using a listing, you can use a direct share instead, see [Secure data sharing](user-guide/data-sharing-intro.md) and [Non-secure data sharing](user-guide/data-sharing-views.md). No matter which option you choose, you can share with people
who don’t have Snowflake accounts by using [Reader Accounts](user-guide/data-sharing-reader-create.md).

| Data Sharing Mechanism | Share With Whom? | Auto-fulfill Across Clouds? | Optionally Charge for Data? | Optionally Offer Data Publicly? | Get Consumer Usage Metrics? |
| --- | --- | --- | --- | --- | --- |
| Listing | One or more accounts in any region | Yes | Yes | Yes | Yes |
| Direct share | One or more accounts in your region | No | No | No | No |

If you want to manage a group of accounts, and control who can publish and consume listings in that group, consider using a Data Exchange.

## Listing

You can offer a listing privately to specific accounts, or publicly on the Snowflake Marketplace. For more about the Snowflake Marketplace, see
[About Snowflake Marketplace](collaboration/collaboration-marketplace-about.md).

After you accept the provider and consumer terms, you can start sharing and consuming data shared with you with a listing.
For more information, see [About listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about).

> **Note:**
>
> To learn more about sharing listings to or from [Virtual Private Snowflake (VPS)](user-guide/intro-editions.md),
> see [About collaboration in VPS environments](collaboration/virtual-private-snowflake/about-vps-collaboration.md).

## Direct share

Use a direct share to share data with one or more accounts in the same Snowflake region.
You don’t need to copy or move data shared with a direct share.

If you want to convert a direct share with active consumers to a listing, see [Convert a direct share to a listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing#convert-a-direct-share-to-a-private-listing).

For more information, see [Share secure database objects](user-guide/data-sharing-gs.md).

## Data Exchange

If creating listings that you offer privately to specific accounts isn’t an option, you can use a data exchange to share data with
a selected group of accounts that you invite.

You must request that a data exchange be provisioned and configured for your account, then you can invite members to the exchange
and specify whether they can consume data, provide data, or both.

For more information, see [About Data Exchange](user-guide/data-exchange.md).

## Collaborating with shared data in a secure environment

When you use listings, direct shares, and Data Exchange to share data with another party, they can directly access the data. If you want to
share data with other parties, but want to control how that data is accessed, you can use a Snowflake Data Clean Room to collaborate. The
provider who is sharing their data in a clean room defines what analyses can be run against the shared data, which allows the consumer to
gather insights from the data without having unrestricted access to it.

For more information, see [Overview of Snowflake Data Clean Rooms](user-guide/cleanrooms/overview.md).

---
title: Databases, Tables and Views - Overview
source: https://docs.snowflake.com/en/guides-overview-db.md
section: General
---

# Databases, Tables and Views - Overview

All data in Snowflake is maintained in databases. Each database consists of one or more schemas, which are logical groupings of database objects,
such as tables and views. Snowflake does not place any hard limits on the number of databases, schemas (within a database), or objects (within
a schema) you can create.

Use the following pages to learn about tables and table types, views, design considerations and other related content.

[Understanding Snowflake Table Structures](user-guide/tables-micro-partitions.md)
:   Introduction to *micro-partitions* and *data clustering*, two of the principal concepts utilized in Snowflake physical table structures.

[Temporary and Transient Tables](user-guide/tables-temp-transient.md)
:   Snowflake supports creating temporary tables for storing non-permanent, transitory data such as ETL data, session-specific
    or other short lived data.

[External Tables](user-guide/tables-external-intro.md)
:   Snowflake supports the concept of an external table. External tables are read-only, and their files are stored in an external stage.

[Hybrid Tables](user-guide/tables-hybrid.md)
:   Snowflake supports the concept of a hybrid table. Hybrid tables provide
    optimized performance for read and write operations in transactional and
    hybrid workloads.

[Apache Iceberg™ tables](user-guide/tables-iceberg.md)
:   Snowflake supports the Apache Iceberg™ open table format. Iceberg tables use data in external cloud
    storage and give you the option to use Snowflake as the Iceberg catalog, an external Iceberg catalog, or to create a table
    from files in object storage.

[Views](user-guide/views-introduction.md)
:   A view allows the result of a query to be accessed as if it were a table.
    Views serve a variety of purposes, including combining, segregating, and protecting data.

[Secure Views](user-guide/views-secure.md)
:   Snowflake supports the concept of a secure view. Secure views are specifically designed for data privacy.
    For example to limit access to sensitive data that should not be exposed to all users of the underlying table(s).

[Materialized Views](user-guide/views-materialized.md)
:   Materialized views are views precomputed from data derived from a query specification and stored for later use.
    Querying a materialized view is faster than executing a query against the base table of the view because the data is pre-computed.

[Table Design Best Practices](user-guide/table-considerations.md)
:   Best practices, general guidelines, and important considerations when designing and managing tables.

[Cloning Best Practices](user-guide/object-clone.md)
:   Best practices, general guidelines, and important considerations when cloning objects in Snowflake, particularly databases, schemas,
    and permanent tables.

[Data storage considerations](user-guide/tables-storage-considerations.md)
:   Best practices and guidelines for controlling data storage costs associated with Continuous Data Protection (CDP), particularly for tables.

---
title: Function and stored procedure reference
source: https://docs.snowflake.com/en/sql-reference-functions.md
section: General
---

# Function and stored procedure reference

These topics provide reference information for the system-defined functions and system-defined stored procedures.

* [Summary of functions](sql-reference/intro-summary-operators-functions.md) — combined summary of all system-defined functions. Can be used as a
  quick-reference.
* [All functions (alphabetical)](sql-reference/functions-all.md) — alphabetical list of all system-defined functions (scalar, aggregate, table, etc.).
* [Aggregate functions](sql-reference/functions-aggregation.md) — functions that take multiple rows/values as input and return a single value.
* [Scalar functions](sql-reference/functions.md) — functions that take a single row/value as input and return a single value:

  + [Bitwise expression functions](sql-reference/expressions-byte-bit.md)
  + [Conditional expression functions](sql-reference/expressions-conditional.md)
  + [Context functions](sql-reference/functions-context.md)
  + [Conversion functions](sql-reference/functions-conversion.md)
  + [Data generation functions](sql-reference/functions-data-generation.md)
  + [Date & time functions](sql-reference/functions-date-time.md)
  + [Differential privacy functions](sql-reference/functions-differential-privacy.md)
  + [Encryption functions](sql-reference/functions-encryption.md)
  + [Geospatial functions](sql-reference/functions-geospatial.md)
  + [Hash functions](sql-reference/functions-hash-scalar.md)
  + [Metadata functions](sql-reference/functions-metadata.md)
  + [Notification functions](sql-reference/functions-notification.md)
  + [Numeric functions](sql-reference/functions-numeric.md)
  + [Semi-structured and structured data functions](sql-reference/functions-semistructured.md)
  + [String functions (regular expressions)](sql-reference/functions-regexp.md) — regular expression (search) functions
  + [String & binary functions](sql-reference/functions-string.md)
  + [Vector functions](sql-reference/functions-vector.md)
* [Model monitor functions](sql-reference/functions-model-monitors.md) — functions that retrieve metrics from machine learning model monitors.
* [System functions](sql-reference/functions-system.md) — functions that perform control operations or return system-level information.
* [Table functions](sql-reference/functions-table.md) — functions that return results in tabular format.
* [Window functions](sql-reference/functions-window.md) — functions that run analytic calculations, such as moving aggregations and rankings.
* [Data metric functions](sql-reference/functions-data-metric.md) — functions that enable data quality measurements for tables and views.
* [Stored procedures](sql-reference-stored-procedures.md) — stored procedures to facilitate using certain Snowflake features.

---
title: Get started with Snowflake for users
source: https://docs.snowflake.com/en/getting-started-for-users.md
section: General
---

# Get started with Snowflake for users

These topics get you started with Snowflake:

[Before you begin](user-guide/setup.md)
:   Overview of getting an account and methods for accessing Snowflake.

[Sign in to Snowflake](user-guide/connecting.md)
:   Overview of the different ways to connect to Snowflake.

[Snowflake key concepts and architecture](user-guide/intro-key-concepts.md)
:   Description of Snowflake architecture, key concepts, and features.

[Snowsight quick tour](user-guide/ui-snowsight-quick-tour.md)
:   Overview of Snowsight, Snowflake’s web-based interface.

[Overview of the data lifecycle](user-guide/data-lifecycle.md)
:   Introduces the main operations and corresponding SQL commands for getting your data into Snowflake and
    then using it to perform queries and other SQL operations.

---
title: Key concepts for Snowflake administrators
source: https://docs.snowflake.com/en/concepts-for-administrators.md
section: General
---

# Key concepts for Snowflake administrators

These topics cover key concepts related to administering Snowflake.

## Cloud platforms and regions

These topics describe the cloud infrastructure on which Snowflake runs:

[Supported cloud platforms](user-guide/intro-cloud-platforms.md)
:   Describes the cloud computing platforms on which Snowflake is offered, which include
    Amazon Web Services (AWS), Google Cloud, and Microsoft Azure.

[Supported cloud regions](user-guide/intro-regions.md)
:   Describes the different cloud platform regions in which Snowflake is offered. This topic
    helps you choose where your data is geographically stored and your compute resources
    are provisioned.

## Editions, releases, and features

[Snowflake editions](user-guide/intro-editions.md)
:   Describes the services and features that are included with each edition of Snowflake.
    This topic helps you choose the right edition for your organization.

[Snowflake releases](user-guide/intro-releases.md)
:   Describes the Snowflake release process and provides instructions for requesting 12-hour early
    access for Enterprise Edition and Business Critical Edition accounts, or 24-hour early access
    for Virtual Private Snowflake (VPS) accounts.

[Overview of key features](user-guide/intro-supported-features.md)
:   Lists the key features of Snowflake to help you decide which features you want to use.

## Security and compliance

[Continuous data protection](user-guide/data-cdp.md)
:   Introduces the features that Snowflake provides for ensuring your data is protected, secure, and available.

[Regulatory compliance](user-guide/intro-compliance.md)
:   Describes the major regulatory compliance standards Snowflake meets to ensure the highest levels of data assurance, security, and governance for data in Snowflake.

---
title: Load data into Snowflake
source: https://docs.snowflake.com/en/guides-overview-loading-data.md
section: General
---

# Load data into Snowflake

Data can be loaded into Snowflake in a number of ways.
The following topics provide an overview of data loading concepts, tasks, tools, and techniques to quick and easily load data into your Snowflake database.

[Overview of data loading](user-guide/data-load-overview.md)
:   Options available to load data into Snowflake.

[Summary of data loading features](user-guide/intro-summary-loading.md)
:   Reference of the supported features for using the [COPY INTO <table>](sql-reference/sql/copy-into-table.md) command to load data from files.

[Tutorials: Load and query data](user-guide/data-load-tutorials.md)
:   Learn how to load data by using step-by-step instructions in tutorials.

[Data loading considerations](user-guide/data-load-considerations.md)
:   Best practices, general guidelines, and important considerations for bulk data loading.

[Work with Amazon S3-compatible storage](user-guide/data-load-s3-compatible-storage.md)
:   Instructions for accessing data in other storage.

[Load data using Snowsight](user-guide/data-load-web-ui.md)
:   Instructions for loading limited amounts of data using the web interface.

[Introduction to loading semi-structured data](user-guide/semistructured-intro.md)
:   Considerations for loading semi-structured data.

[Introduction to unstructured data](user-guide/unstructured-intro.md)
:   Considerations for loading unstructured data.

[Bulk loading from a local file system](user-guide/data-load-local-file-system.md)
:   Instructions for loading data in bulk using the COPY command.

[Snowpipe](user-guide/data-load-snowpipe-intro.md)
:   Instructions for loading data continuously using Snowpipe.

[Snowpipe Streaming](user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md)
:   Instructions for loading data streams continuously using Snowpipe Streaming.

[Multi-Location Resilience for Data Pipelines](user-guide/multi-location-resilience-data-pipelines.md)
:   Guidance for resilient Snowpipe and COPY INTO data pipelines across locations.

[Transform data during a load](user-guide/data-load-transform.md)
:   Instructions for transforming data while loading it into a table using the COPY INTO command.

[Query data in staged files](user-guide/querying-stage.md)
:   Instructions on using standard SQL to query internal and external named stages.

[Query metadata for staged files](user-guide/querying-metadata.md)
:   Instructions on querying metadata in internal and external stages.

---
title: Managing Snowflake
source: https://docs.snowflake.com/en/user-guide-manage.md
section: General
---

# Managing Snowflake

These topics describes the tasks associated with using Snowflake.

* [Virtual warehouses](user-guide/warehouses.md) — Key concepts and tasks for creating and using virtual warehouses to execute queries and perform DML operations, such as loading and unloading data:

  > + [Overview of warehouses](user-guide/warehouses-overview.md)
  > + [Multi-cluster warehouses](user-guide/warehouses-multicluster.md)
  > + [Warehouse considerations](user-guide/warehouses-considerations.md)
  > + [Working with warehouses](user-guide/warehouses-tasks.md)
  > + [Using the Query Acceleration Service (QAS)](user-guide/query-acceleration-service.md)
  > + [Monitoring warehouse load](user-guide/warehouses-load-monitoring.md)
* [Databases, Tables & Views](user-guide/databases.md) — Key concepts and tasks related to understanding and working with Snowflake databases and tables:

  > + [Understanding Snowflake Table Structures](user-guide/tables-micro-partitions.md)
  > + [Working with Temporary and Transient Tables](user-guide/tables-temp-transient.md)
  > + [Introduction to external tables](user-guide/tables-external-intro.md)
  > + [Overview of Views](user-guide/views-introduction.md)
  > + [Working with Secure Views](user-guide/views-secure.md)
  > + [Working with Materialized Views](user-guide/views-materialized.md)
  > + [Table Design Considerations](user-guide/table-considerations.md)
  > + [Cloning considerations](user-guide/object-clone.md)
  > + [Data storage considerations](user-guide/tables-storage-considerations.md)
* [Query Data in Snowflake](guides-overview-queries.md) — Key concepts and tasks for executing queries in Snowflake:

  > + [Working with joins](user-guide/querying-joins.md)
  > + [Understanding How Snowflake Can Eliminate Redundant Joins](user-guide/join-elimination.md)
  > + [Working with Subqueries](user-guide/querying-subqueries.md)
  > + [Querying Hierarchical Data](user-guide/queries-hierarchical.md)
  > + [Working with CTEs (Common Table Expressions)](user-guide/queries-cte.md)
  > + [Querying Semi-structured Data](user-guide/querying-semistructured.md)
  > + [Analyzing data with window functions](user-guide/functions-window-using.md)
  > + [Identifying Sequences of Rows That Match a Pattern](user-guide/match-recognize-introduction.md)
  > + [Using Sequences](user-guide/querying-sequences.md)
  > + [Using Persisted Query Results](user-guide/querying-persisted-results.md)
  > + [Computing the Number of Distinct Values](user-guide/querying-distinct-counts.md)
  > + [Estimating Similarity of Two or More Sets](user-guide/querying-approximate-similarity.md)
  > + [Estimating Frequent Values](user-guide/querying-approximate-frequent-values.md)
  > + [Estimating Percentile Values](user-guide/querying-approximate-percentile-values.md)
  > + [Querying data using worksheets](user-guide/ui-snowsight-query.md)
  > + [Canceling Statements](user-guide/querying-cancel-statements.md)
* [Date & time data types](sql-reference/data-types-datetime.md) — Reference information and examples for working with dates, times and timestamps, and time zones in Snowflake:

  > + [Date and time input and output formats](sql-reference/date-time-input-output.md)
  > + [Working with date and time values](sql-reference/date-time-examples.md)
* [Introduction to loading semi-structured data](user-guide/semistructured-intro.md) — Key concepts and tasks for working with JSON and other types of semi-structured data:

  > + [Introduction to loading semi-structured data](user-guide/semistructured-intro.md)
  > + [Supported formats for semi-structured data](user-guide/semistructured-data-formats.md)
  > + [Considerations for semi-structured data stored in VARIANT](user-guide/semistructured-considerations.md)
  > + [Tutorial: JSON basics for Snowflake](user-guide/tutorials/json-basics-tutorial.md)
* [Introduction to unstructured data](user-guide/unstructured-intro.md) — Key concepts and tasks for working with unstructured data:

  > + [Directory tables](user-guide/data-load-dirtables.md)
  > + [REST API for unstructured data support](user-guide/data-load-unstructured-rest-api.md)
  > + [Share unstructured data with a secure view](user-guide/unstructured-data-sharing.md)
  > + [Troubleshooting processing of unstructured data](user-guide/unstructured-ts.md)
* [String & binary data types](sql-reference/data-types-text.md) — Reference information and examples for working with binary data in Snowflake:

  > + [Binary input and output](sql-reference/binary-input-output.md)
  > + [Using binary data](sql-reference/binary-examples.md)
* [Snowflake Time Travel & Fail-safe](user-guide/data-availability.md) — Key concepts and tasks for understanding how Snowflake maintains access to deleted and modified data, and also how Snowflake enables data recovery in the
  event of loss:

  > + [Understanding & using Time Travel](user-guide/data-time-travel.md)
  > + [Understanding and viewing Fail-safe](user-guide/data-failsafe.md)
  > + [Storage costs for Time Travel and Fail-safe](user-guide/data-cdp-storage-costs.md)
* [Introduction to streams and tasks](user-guide/data-pipelines-intro.md) — Key concepts and tasks for transforming and optimizing loaded data for analysis:

  > + [Introduction to streams](user-guide/streams-intro.md)
  > + [Introduction to tasks](user-guide/tasks-intro.md)
  > + [Introduction to dynamic tables](user-guide/dynamic-tables-about.md)
* [Introduction to business continuity & disaster recovery](user-guide/replication-intro.md) — Key concepts and tasks for replicating and failing over objects across multiple Snowflake
  accounts, as well as redirecting client connections, for business continuity and disaster recovery:

  > + [Introduction to replication and failover across multiple accounts](user-guide/account-replication-intro.md)
  > + [Redirecting client connections](user-guide/client-redirect.md)
* [Sample data sets](user-guide/sample-data.md) — Key concepts and tasks for using the sample data sets provided with Snowflake:

  > + [Use the sample database](user-guide/sample-data-using.md)
  > + [Sample data: TPC-H](user-guide/sample-data-tpch.md)
  > + [Sample Data: OpenWeatherMap — Deprecated](user-guide/sample-data-openweathermap.md)

---
title: Managing Your Snowflake Account
source: https://docs.snowflake.com/en/user-guide-admin.md
section: General
---

# Managing Your Snowflake Account

These topics describe the administrative concepts and tasks associated with managing your account in Snowflake. These topics are intended primarily for administrators (i.e. users with the ACCOUNTADMIN,
SYSADMIN, or SECURITYADMIN roles).

* [Account identifiers](user-guide/admin-account-identifier.md)

  > Detailed descriptions of the two unique account identifiers supported for connecting to Snowflake and using features that span multiple accounts.
* [Trial accounts](user-guide/admin-trial-account.md)

  > Instructions for signing up for a trial account, adding a credit card to the account, and canceling the account.
* [Parameter management](user-guide/admin-account-management.md)

  > Instructions for setting account, session, and object parameters for your account.
* [User management](user-guide/admin-user-management.md)

  > Instructions for creating and managing users in your account.
* [Behavior change management](release-notes/bcr-bundles/managing-behavior-change-releases.md)

  > Instructions for enabling and disabling behavior change releases in your account.

---
title: ML Functions
source: https://docs.snowflake.com/en/guides-overview-ml-functions.md
section: General
---

# ML Functions

These powerful analysis functions give you automated predictions and insights into your data using machine learning.
Snowflake provides an appropriate type of model for each feature, so you don’t have to be a machine learning expert
to take advantage of them. All you need is your data.

## Time-Series Functions

Use time-series functions to train a machine learning model on your time-series data to determine how a specified metric (for example,
sales) varies over time and relative to other features of your data. The model then provides insights or predictions
based on the trends detected in the data.

* [Forecasting](user-guide/ml-functions/forecasting.md) predicts future metric values from past trends in time-series data.
* [Anomaly Detection](user-guide/ml-functions/anomaly-detection.md) flags metric values that differ from typical expectations.

## Other Analysis Functions

These features don’t require time series data.

* [Classification](user-guide/ml-functions/classification.md) sort rows into two or more classes based on
  their most predictive features.
* [Top Insights](user-guide/ml-functions/top-insights.md) helps you find dimensions and values that affect the metric in
  surprising ways.

## Cost Considerations

When you use ML functions, you incur storage and compute costs. These costs vary depending on the feature used and the
quantity of data used in training and prediction.

The storage costs you incur reflect storage of the ML model instances created during the training step. To view the
objects associated with your model instance, navigate to your [Account Usage views](sql-reference/account-usage.md)
(ACCOUNT_USAGE.TABLES and ACCOUNT_USAGE.STAGES). These objects appear with null database and schema columns. The
`instance_id` column, however, will be populated, indicating that these objects are contained in a model instance.
These objects are fully managed by the model instance, and you cannot access or delete them separately. To reduce
storage costs associated with your models, delete unused or obsolete models.

See [Understanding compute cost](user-guide/cost-understanding-compute.md) for general information on Snowflake compute costs.

## Limitations

Before you use ML functions, you must ensure [AUTOCOMMIT](sql-reference/transactions.md) is enabled in your session. AUTOCOMMIT is
enabled by default when you start a new Snowflake session.

## Using ML functions in Snowpark

`session.call` is not yet compatible with models created by ML functions. To call such a model in Snowpark, use
`session.sql` instead, as shown here.

```python
session.sql('call my_model!FORECAST(...)').collect()
```

---
title: Optimizing performance in Snowflake
source: https://docs.snowflake.com/en/guides-overview-performance.md
section: General
---

# Optimizing performance in Snowflake

The following topics help guide efforts to improve the performance of Snowflake.

[Exploring execution times](user-guide/performance-query-exploring.md)
:   Gain insights into the historical performance of queries using the web interface or by writing queries against data in the ACCOUNT_USAGE
    schema.

[Optimizing query performance](user-guide/performance-query-options.md)
:   Learn about options for optimizing Snowflake query performance.

[Optimizing warehouses for performance](user-guide/performance-query-warehouse.md)
:   Learn about strategies to fine-tune computing power in order to improve the performance of a query or set of
    queries running on a warehouse, including enabling the Query Acceleration Service.

[Optimizing storage for performance](user-guide/performance-query-storage.md)
:   Learn how storing similar data together, creating optimized data structures, and defining specialized data sets can improve the
    performance of queries.

    Helpful when choosing between Automatic Clustering, Search Optimization Service, and materialized views.

[Analyzing query workloads with Performance Explorer](user-guide/performance-explorer.md)
:   Learn how to use Performance Explorer in Snowsight to monitor interactive metrics for SQL workloads.

[Snowflake Optima](user-guide/snowflake-optima.md)
:   Learn how Snowflake Optima continuously analyzes workload patterns and implements the most effective strategies automatically.

---
title: Privacy in Snowflake
source: https://docs.snowflake.com/en/guides-overview-privacy.md
section: General
---

# Privacy in Snowflake

Snowflake provides industry-leading features that maintain the privacy of individuals and sensitive data.

[Differential privacy](user-guide/diff-privacy/differential-privacy-overview.md)
:   Protect the identity and information of entities against targeted privacy attacks. Data providers assign privacy policies to tables and
    views to protect their data with differential privacy.

[Aggregation policies](user-guide/aggregation-policies.md)
:   Require queries to aggregate data in order to return results.

[Join policies](user-guide/join-policies.md)
:   Require queries to join tables in order to return results.

    [Preview Feature](release-notes/preview-features.md) — Open

    Available to all accounts that are Enterprise Edition (or higher).

    To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

[Projection policies](user-guide/projection-policies.md)
:   Prevent queries from using a SELECT statement to project values from a column.

[Synthetic data](user-guide/synthetic-data.md)
:   Programmatically create realistic datasets that closely mirror your original data. This allows you to safely represent sensitive, confidential, or restricted information across various workloads, such as testing and validation.

---
title: Query Data in Snowflake
source: https://docs.snowflake.com/en/guides-overview-queries.md
section: General
---

# Query Data in Snowflake

Snowflake supports standard SQL, including a subset of ANSI SQL:1999 and the SQL:2003 analytic extensions.
Snowflake also supports common variations for a number of commands where those variations do not conflict with each other.

> **Tip:**
>
> You can use the search optimization service to improve query performance.
> For details, see [Search optimization service](user-guide/search-optimization-service.md).

[Working with joins](user-guide/querying-joins.md)
:   A join combines rows from two tables to create a new combined row that can be used in the query.

    Learn join concepts, types of joins, and how to work with joins.

[Analyzing time-series data](user-guide/querying-time-series-data.md)
:   Analyze time-series data, using SQL functionality designed for this purpose, such as the ASOF JOIN feature, date and time
    helper functions, aggregate functions for downsampling, and functions that support sliding window frames.

    Using ASOF JOIN, learn how to join tables on timestamp columns when their values closely follow each other, precede each other,
    or match exactly.

[Eliminate Redundant Joins](user-guide/join-elimination.md)
:   A join on a key column can refer to tables that are not needed for the join. Such a join is referred to as a *redundant join*.

    Learn about redundant joins, and how to eliminate them to improve query performance.

[Working with Subqueries](user-guide/querying-subqueries.md)
:   A subquery is a query within another query.

    Learn about subqueries and how to use them.

[Querying Hierarchical Data](user-guide/queries-hierarchical.md)
:   Relational databases often store hierarchical data by using different tables.

    Learn about querying hierarchical data using joins, Common Table Expressions(CTEs) and CONNECT BY.

[Working with CTEs (Common Table Expressions)](user-guide/queries-cte.md)
:   A CTE (common table expression) is a named subquery defined in a WITH clause, the result of which is effectively a table.

    Learn how to write and work with CTE expressions.

[Querying Semi-structured Data](user-guide/querying-semistructured.md)
:   Semi-structured data represents arbitrary hierarchical data structures, which can be used to load and operate on
    data in semi-structured formats (e.g. JSON, Avro, ORC, Parquet, or XML).

    Learn how to use special operators and functions to query complex hierarchical data stored in a VARIANT.

[Using full-text search](user-guide/querying-with-search-functions.md)
:   You can use full-text search to find character data (text) in specified columns
    from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns.

    Learn how to run queries that use full-text search.

[Constructing SQL at runtime](user-guide/querying-construct-at-runtime.md)
:   You can create programs that construct SQL statements dynamically at runtime.

    Learn about different options for constructing SQL at runtime.

[Analyzing data with window functions](user-guide/functions-window-using.md)
:   Window functions operate on windows, which are groups of rows that are related in some way.

    Learn about windows, window functions, and how to use window functions to examine data.

[Identifying Sequences of Rows That Match a Pattern](user-guide/match-recognize-introduction.md)
:   In some cases, you might need to identify sequences of table rows that match a pattern.

    Learn about pattern matching, and how to use MATCH_RECOGNIZE to work with table rows matching patterns.

[Using Sequences](user-guide/querying-sequences.md)
:   Sequences are used to generate unique numbers across sessions and statements, including concurrent statements.

    Learn what are sequences, and how to use them.

[Using Persisted Query Results](user-guide/querying-persisted-results.md)
:   When a query is executed, the result is persisted for a period of time.

    Learn how query results are persisted, how long persisted results are available,
    and how to use persisted query results to improve performance.

[Computing the Number of Distinct Values](user-guide/querying-distinct-counts.md)
:   Various methods exist to determine the count of distinct elements within a column.

    Learn methods to identify and report distinct elements in data.

[Estimating Similarity of Two or More Sets](user-guide/querying-approximate-similarity.md)
:   Snowflake provides mechanisms to compare data sets for similarity.

    Learn how Snowflake determines similarity and how to compare multiple data sets for similarity.

[Estimating Frequent Values](user-guide/querying-approximate-frequent-values.md)
:   Snowflake can examine data to determine how frequent values are within the data.

    Learn how frequency is determined and how to query data to determine data frequency using the through the APPROX_TOP_K family of functions.

[Estimating Percentile Values](user-guide/querying-approximate-percentile-values.md)
:   Snowflake can estimate percentages of values using an improved version of the t-Digest algorithm.

    Learn how to estimate percentages using the APPROX_PERCENTILE family of functions

[Monitor query activity with Query History](user-guide/ui-snowsight-activity.md)
:   Monitor the query activity in your account.

    Learn how examine queries, using query profiles, to understand and improve performance.

[Using query insights to improve performance](user-guide/query-insights.md)
:   Review the insights produced for a query.

    Learn how to improve the performance of a query.

[Using the Query Hash to Identify Patterns and Trends in Queries](user-guide/query-hash.md)
:   To identify patterns and trends in queries, you can use the hash of the query text, which is included in the `query_hash` and
    `query_parameterized_hash` columns in selected Account Usage view and in the output of selected Information Schema table
    functions.

    Learn how to use the query hash in these columns to identify repeated queries and detect patterns and trends in queries.

[Top-K pruning for improved query performance](user-guide/querying-top-k-pruning-optimization.md)
:   Instead of scanning all eligible rows in SELECT statements that contain LIMIT and ORDER BY clauses, SELECT statements
    that use top-K pruning scan a subset of rows, which can improve performance.

    Learn how to use top-K pruning to improve the performance of SELECT statements that contain LIMIT and ORDER BY clauses.

[Canceling Statements](user-guide/querying-cancel-statements.md)
:   Executing statements are typically cancelled using the interface used to start the query.

    Learn how to use system functions to cancel a specific query or all currently executing queries.

---
title: Reference
source: https://docs.snowflake.com/en/reference.md
section: General
---

# Reference

Reference information on various areas of Snowflake.

[SQL data types reference](sql-reference-data-types.md)
:   Reference for SQL data types.

[SQL command reference](sql-reference-commands.md)
:   Reference for SQL commands.

[Function and stored procedure reference](sql-reference-functions.md)
:   Reference for SQL functions.

[SQL class reference](sql-reference-classes.md)
:   Reference for SQL classes.

[Snowflake Scripting reference](sql-reference-snowflake-scripting.md)
:   Reference for [Snowflake Scripting](developer-guide/snowflake-scripting/index.md) constructs.

[General reference](sql-reference.md)
:   Reference material on other subjects.

---
title: Securing Snowflake
source: https://docs.snowflake.com/en/guides-overview-secure.md
section: General
---

# Securing Snowflake

Snowflake provides industry-leading features that help ensure you can configure the highest levels of security for your account and users,
as well as all the data you store in Snowflake.

These topics are intended primarily for administrators (i.e. users with the ACCOUNTADMIN, SYSADMIN, or SECURITYADMIN roles).

## Authentication

[Authentication policies](user-guide/authentication-policies.md)
:   Using authentication policies to restrict account and user authentication by client, authentication methods, and more.

[Multi-factor authentication (MFA)](user-guide/security-mfa.md)
:   Using multi-factor authentication with Snowflake.

[Federated Authentication & SSO](user-guide/admin-security-fed-auth-overview.md)
:   Topics related to federated authentication to Snowflake.

[Key-pair authentication and key-pair rotation](user-guide/key-pair-auth.md)
:   Using key-pair authentication to Snowflake.

[Using programmatic access tokens for authentication](user-guide/programmatic-access-tokens.md)
:   Generating and managing programmatic access tokens for authentication.

[OAuth](user-guide/oauth-intro.md)
:   Topics related to using Snowflake OAuth and External OAuth to connect to Snowflake.

[Workload identity federation](user-guide/workload-identity-federation.md)
:   Preferred authentication method for service-to-service workloads.

[External API authentication and secrets](user-guide/api-authentication.md)
:   Configuring Snowflake to authenticate to external services.

## Network security

[Malicious IP Protection](user-guide/malicious-ip-protection.md)
:   Protecting your account from IP addresses that are known to be malicious.

[Controlling network traffic with network policies](user-guide/network-policies.md)
:   Using network policies to restrict access to Snowflake.

[Network rules](user-guide/network-rules.md)
:   Using network rules with other Snowflake features to restrict access to and from Snowflake.

## Private connectivity

[Private connectivity for inbound network traffic](user-guide/private-connectivity-inbound.md)
:   Using private connectivity to access the Snowflake service, Snowsight, Streamlit in Snowflake, internal stages, Snowflake managed
    storage volumes, and Snowpark Container Services.

[Private connectivity for outbound network traffic](user-guide/private-connectivity-outbound.md)
:   Using private connectivity for external network locations, external functions, external stages, external tables, external
    volumes, and Snowpipe automation.

## Administration and authorization

[Trust Center](user-guide/trust-center/overview.md)
:   Using the Trust Center to evaluate and monitor your account for security risks.

[Snowflake sessions and session policies](user-guide/session-policies.md)
:   Using session policies to manage your Snowflake session.

[SCIM](user-guide/scim-intro.md)
:   Topics related to using SCIM to provision users and groups to Snowflake.

[Access Control](user-guide/security-access-control-overview.md)
:   Topics related to role-based access control (RBAC) in Snowflake.

[End to End Encryption](user-guide/security-encryption-end-to-end.md)
:   Using end-to-end encryption in Snowflake.

---
title: Snowflake AI and ML
source: https://docs.snowflake.com/en/guides-overview-ai-features.md
section: General
---

# Snowflake AI and ML

Snowflake offers two broad categories of powerful, intelligent features based on Artificial Intelligence (AI) and
Machine Learning (ML). These features can help you do more with your data in less time than ever before.

* **Snowflake Cortex** is a suite of AI features that use large language models (LLMs) to understand unstructured data,
  answer freeform questions, and provide intelligent assistance. This suite of Snowflake AI Features comprises:

  + [Cortex Agents](user-guide/snowflake-cortex/cortex-agents.md)
  + [Snowflake Cortex AI Functions (including LLM functions)](user-guide/snowflake-cortex/aisql.md)
  + [Cortex Analyst](user-guide/snowflake-cortex/cortex-analyst.md)
  + [Cortex Fine-tuning](user-guide/snowflake-cortex/cortex-finetuning.md)
  + [Cortex Search](user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md)
  + [Snowflake Intelligence](user-guide/snowflake-cortex/snowflake-intelligence.md)
  + [Cortex Code in Snowsight](user-guide/cortex-code/cortex-code-snowsight.md)
  + [Cortex Code CLI](user-guide/cortex-code/cortex-code-cli.md)
* **Snowflake ML** provides functionality for you to build your own models.

  + [ML Functions](guides-overview-ml-functions.md) simplify the process of creating and using traditional machine
    learning models to detect patterns in your structured data. These powerful out-of-the-box analysis tools help
    time-strapped analysts, data engineers, and data scientists understand, predict, and classify data, without any
    programming.
  + For data scientists and developers, [Snowflake ML](developer-guide/snowflake-ml/overview.md) lets you develop
    and operationalize custom models to solve your unique data challenges, while keeping your data inside Snowflake.
    Snowflake ML incorporates model development classes based on popular ML frameworks, along with ML Ops capabilities
    such as a feature store, a model registry, framework connectors, and immutable data snapshots.

## Use of Snowflake AI features

Snowflake AI Features and their underlying models are designed with the following principles in mind:

* **Full security.** Except as you elect, all AI models run inside of Snowflake’s security and governance perimeter. Your data is not
  available to other customers or model developers.
* **Data privacy.** Snowflake never uses your Customer Data to train models made available to our customer base.
* **Control.** You have control over your team’s use of Snowflake AI Features through familiar
  [role-based access control](user-guide/security-access-control-overview.md).

## AI/ML model update process

Snowflake is continually working to improve the quality of its offerings, including the models powering the Snowflake AI Features.
This section describes how updates to those models fit into [Snowflake’s Behavior Change](release-notes/intro-bcr-releases.md) process.

# Model Update and Behavior Change Policy

## Overview

Snowflake continuously updates the models that power Cortex AI features to improve quality,
performance, and availability. These updates may introduce changes to model behavior,
availability, or lifecycle status.

This document describes how model changes are defined, how they are communicated, and
how model lifecycle and deprecation are managed.

## Model lifecycle

Models in Cortex follow a defined lifecycle to communicate readiness and stability:

* Private Preview
* Public Preview
* General Availability (GA)
* Legacy
* End of Life (EOL)

Lifecycle status reflects the maturity and support level of a model. As models progress through
these stages, their status will be reflected across customer-facing surfaces.

Preview models are intended for evaluation and may change more frequently. GA models are
considered stable and suitable for production use.

## Types of model changes

A model update is considered a behavior change if it results in any of the following:

* Changes to required syntax, including specifying a model or model version
* Changes to the structure of model outputs
* Deprecation of a model

These changes may impact how customers interact with models and should be reviewed as part
of normal governance processes.

## How changes are communicated

Snowflake communicates model-related updates through the following mechanisms:

* [Behavior Change Releases (BCRs)](release-notes/intro-bcr-releases.md) —
  Used for changes that may require customer action or impact existing workflows
* [What’s New](release-notes/new-features.md) —
  Used for improvements or additions that do not materially change how customers
  interact with models

Model deprecations are communicated separately from bundled releases to provide clear and
timely notification.

## Deprecation policy

Snowflake periodically deprecates models to ensure customers have access to high-quality,
well-supported options.

For General Availability (GA) models:

* Snowflake will make reasonable efforts to provide at least 60 days advance notice prior
  to deprecation

For Preview models:

* Deprecation timelines are not guaranteed and may occur with shorter notice

During the deprecation period:

* Customers are expected to migrate to alternative models before the deprecation date
* After deprecation, models may no longer be available for use

Lifecycle status will reflect deprecation through the transition to Legacy and ultimately End of
Life.

## Legal notices

* If you choose to use any of the Snowflake AI Features, your use is subject to our
  [Acceptable Use Policy](https://www.snowflake.com/legal/acceptable-use-policy/).
* The outputs of Snowflake AI Features may be inaccurate, inappropriate, inefficient, or biased. Decisions based on such
  outputs, including those built into automatic pipelines, should have human oversight and review processes to ensure they are
  safe, accurate, and suitable for your intended use.
* Your use of any Snowflake AI Feature that is identified as being powered by a third-party, open-source model is subject to any
  applicable license agreement and/or acceptable use policy set forth under the Offering-Specific Terms page available at
  <https://www.snowflake.com/legal/>.
* For further information, see the [Snowflake AI Trust and Safety FAQ](https://www.snowflake.com/en/legal/snowflake-ai-trust-and-safety/).

---
title: Snowflake data types
source: https://docs.snowflake.com/en/data-types.md
section: General
---

# Snowflake data types

Snowflake supports most basic SQL data types (with some restrictions) for use in columns, local variables, expressions, parameters,
and any other appropriate locations.

> **Note:**
>
> You can also load unstructured data into Snowflake. For more information, see [Introduction to unstructured data](user-guide/unstructured-intro.md).

In some cases, data of one type can be converted to another type. For example, INTEGER data can be converted to FLOAT data.

Some conversions are lossless, but others might lose information. The amount of loss depends upon the data types and the specific
values. For example, converting a FLOAT value to an INTEGER value removes the digits after the decimal place. (The value is
rounded to the nearest integer.)

In some cases, the user must specify the desired conversion, such as when passing a VARCHAR value to the
[TIME_SLICE](sql-reference/functions/time_slice.md) function, which expects a TIMESTAMP or DATE argument. We
call this explicit casting.

In other cases, data types are converted automatically, such as when adding a float and an integer. We call this
implicit casting (or coercion). In Snowflake, data types are automatically coerced whenever necessary
and possible.

For more information about explicit and implicit casting, see [Data type conversion](sql-reference/data-type-conversion.md).

For more information about Snowflake data types, see the following topics:

* [Summary of data types](sql-reference/intro-summary-data-types.md)
* [Numeric data types](sql-reference/data-types-numeric.md)
* [String & binary data types](sql-reference/data-types-text.md)
* [Logical data types](sql-reference/data-types-logical.md)
* [Date & time data types](sql-reference/data-types-datetime.md)
* [Semi-structured data types](sql-reference/data-types-semistructured.md)
* [Structured data types](sql-reference/data-types-structured.md)
* [Unstructured data types](sql-reference/data-types-unstructured.md)
* [Geospatial data types](sql-reference/data-types-geospatial.md)
* [UUID data type](sql-reference/data-types-uuid.md)
* [Vector data types](sql-reference/data-types-vector.md)
* [User-defined types](sql-reference/data-types-user-defined.md)
* [Unsupported data types](sql-reference/data-types-unsupported.md)
* [Data type conversion](sql-reference/data-type-conversion.md)

---
title: Snowflake Scripting reference
source: https://docs.snowflake.com/en/sql-reference-snowflake-scripting.md
section: General
---

# Snowflake Scripting reference

These topics provide reference information for the language elements supported in
[Snowflake Scripting](developer-guide/snowflake-scripting/index.md).

```sqlsyntax
-- Variable declaration
[ DECLARE ... ]
  ...
BEGIN
  ...
  -- Branching
  [ IF ... ]
  [ CASE ... ]

  -- Looping
  [ FOR ... ]
  [ WHILE ... ]
  [ REPEAT ... ]
  [ LOOP ... ]

  -- Loop termination (within a looping construct)
  [ BREAK ]
  [ CONTINUE ]

  -- Variable assignment
  [ LET ... ]

  -- Cursor management
  [ OPEN ... ]
  [ FETCH ... ]
  [ CLOSE ... ]

  -- Asynchronous child job management
  [ AWAIT ... ]
  [ CANCEL ... ]

  -- "No-op" (no-operation) statement (usually within a branch or exception)
  [ NULL ]

  -- Raising exceptions
  [ RAISE ... ]

  -- Returning a value
  [ RETURN ... ]

-- Exception handling
[ EXCEPTION ... ]

END;
```

**Next Topics:**

* [AWAIT](sql-reference/snowflake-scripting/await.md)
* [BEGIN … END](sql-reference/snowflake-scripting/begin.md)
* [BREAK](sql-reference/snowflake-scripting/break.md)
* [CANCEL](sql-reference/snowflake-scripting/cancel.md)
* [CASE](sql-reference/snowflake-scripting/case.md)
* [CLOSE](sql-reference/snowflake-scripting/close.md)
* [CONTINUE](sql-reference/snowflake-scripting/continue.md)
* [DECLARE](sql-reference/snowflake-scripting/declare.md)
* [EXCEPTION](sql-reference/snowflake-scripting/exception.md)
* [FETCH](sql-reference/snowflake-scripting/fetch.md)
* [FOR](sql-reference/snowflake-scripting/for.md)
* [IF](sql-reference/snowflake-scripting/if.md)
* [LET](sql-reference/snowflake-scripting/let.md)
* [LOOP](sql-reference/snowflake-scripting/loop.md)
* [NULL](sql-reference/snowflake-scripting/null.md)
* [OPEN](sql-reference/snowflake-scripting/open.md)
* [RAISE](sql-reference/snowflake-scripting/raise.md)
* [REPEAT](sql-reference/snowflake-scripting/repeat.md)
* [RETURN](sql-reference/snowflake-scripting/return.md)
* [WHILE](sql-reference/snowflake-scripting/while.md)

---
title: SQL class reference
source: https://docs.snowflake.com/en/sql-reference-classes.md
section: General
---

# SQL class reference

These topics provide reference information for Snowflake [classes](sql-reference/snowflake-db-classes.md).

Each class supports one or more of the following SQL operations:

* ALTER: Modifies the properties of an instance of a class.
* CREATE: Creates an instance of a class.
* DROP: Deletes an instance of a class.
* SHOW: Lists instances of a class.

An instance of a class can have one or more methods. A method is a stored procedure or function and can be called by
using the instance name and method name, and arguments (if any) required by the method. For example,
`CALL instance_name!method_name(...)`.

## Updating your search path

You can add the schema for classes you use frequently to your search path to save typing and make your SQL statements
more concise. For more information about updating your search path, see [Update your search path](sql-reference/snowflake-db-classes.md).

## Available classes

Snowflake provides the following system-defined (built-in) classes.

[ANOMALY_DETECTION (SNOWFLAKE.ML)](sql-reference/classes/anomaly_detection.md)
:   Allows you to detect outliers in your time series data.

[ANOMALY_INSIGHTS (SNOWFLAKE.LOCAL)](sql-reference/classes/anomaly_insights.md)
:   Allows you to detect outliers in your costs.

[BUDGET (SNOWFLAKE.CORE)](sql-reference/classes/budget.md)
:   Allows you to monitor credit usage of supported objects.

[CLASSIFICATION (SNOWFLAKE.ML)](sql-reference/classes/classification.md)
:   Automatically sorts data into categories based on features in the data.

[CLASSIFICATION_PROFILE (SNOWFLAKE.DATA_PRIVACY)](sql-reference/classes/classification_profile.md)
:   Allows you to automatically classify sensitive data.

[CUSTOM_CLASSIFIER (SNOWFLAKE.DATA_PRIVACY)](sql-reference/classes/custom_classifier.md)
:   Allows you to define custom classifiers to extend your data classification capabilities.

[FORECAST (SNOWFLAKE.ML)](sql-reference/classes/forecast.md)
:   Represents a forecast model that produces a forecast for a single or multiple time series.

[TOP_INSIGHTS (SNOWFLAKE.ML)](sql-reference/classes/top-insights.md)
:   Allows you to determine the segments driving changes in a metric.

---
title: SQL command reference
source: https://docs.snowflake.com/en/sql-reference-commands.md
section: General
---

# SQL command reference

These topics provide reference information for all the Snowflake SQL commands (DDL, DML, and query syntax).

* [Query syntax](sql-reference/constructs.md) — structure of SQL queries in Snowflake.
* [Query operators](sql-reference/operators.md) — arithmetic, logical, and other types of operators.
* [Data Definition Language (DDL) commands](sql-reference/sql-ddl-summary.md) — overview of DDL commands.
* [Data Manipulation Language (DML) commands](sql-reference/sql-dml.md) — commands for performing DML operations, including:

  + Inserting, deleting, updating, and merging data in Snowflake tables.
  + Bulk copying data into and out of Snowflake tables.
  + Staging files for bulk copying.
* [All commands (alphabetical)](sql-reference/sql-all.md) — alphabetical list of all the commands.
* Commands categorized by the type of objects and operations they control, including:

  + General account-level objects (accounts, users, roles, security policies, integrations, etc.) and operations (failover & recovery, etc.).
  + Session-based operations (session context, queries, variables, transactions, etc.).
  + Virtual warehouses (for loading data and performing queries) and resource monitors (for controlling credit usage).
  + Databases, schemas, tables, and other schema-level objects (views, sequences, etc.).
  + Snowflake extensions and application development (user-defined functions, stored procedures, scripting, etc.).
  + Objects for sharing data (shares, listings, etc.).
  + Objects for classifying, protecting, and governing data (masking policies, row-access policies, tags, etc.).

---
title: SQL data types reference
source: https://docs.snowflake.com/en/sql-reference-data-types.md
section: General
---

# SQL data types reference

Snowflake supports most basic SQL data types (with some restrictions) for use in columns, local variables, expressions, parameters,
and any other appropriate locations.

> **Note:**
>
> You can also load unstructured data into Snowflake. For more information, see [Introduction to unstructured data](user-guide/unstructured-intro.md).

In some cases, data of one type can be converted to another type. For example, INTEGER data can be converted to FLOAT data.

Some conversions are lossless, but others might lose information. The amount of loss depends upon the data types and the specific
values. For example, converting a FLOAT value to an INTEGER value removes the digits after the decimal place. (The value is
rounded to the nearest integer.)

In some cases, the user must specify the desired conversion, such as when passing a VARCHAR value to the
[TIME_SLICE](sql-reference/functions/time_slice.md) function, which expects a TIMESTAMP or DATE argument. We
call this explicit casting.

In other cases, data types are converted automatically, such as when adding a float and an integer. We call this
implicit casting (or coercion). In Snowflake, data types are automatically coerced whenever necessary
and possible.

For more information about explicit and implicit casting, see [Data type conversion](sql-reference/data-type-conversion.md).

For more information about Snowflake data types, see the following topics:

* [Summary of data types](sql-reference/intro-summary-data-types.md)
* [Numeric data types](sql-reference/data-types-numeric.md)
* [String & binary data types](sql-reference/data-types-text.md)
* [Logical data types](sql-reference/data-types-logical.md)
* [Date & time data types](sql-reference/data-types-datetime.md)
* [Semi-structured data types](sql-reference/data-types-semistructured.md)
* [Structured data types](sql-reference/data-types-structured.md)
* [Unstructured data types](sql-reference/data-types-unstructured.md)
* [Geospatial data types](sql-reference/data-types-geospatial.md)
* [UUID data type](sql-reference/data-types-uuid.md)
* [Vector data types](sql-reference/data-types-vector.md)
* [User-defined types](sql-reference/data-types-user-defined.md)
* [Unsupported data types](sql-reference/data-types-unsupported.md)
* [Data type conversion](sql-reference/data-type-conversion.md)

---
title: Stored procedures
source: https://docs.snowflake.com/en/sql-reference-stored-procedures.md
section: General
---

# Stored procedures

Snowflake provides stored procedures to facilitate using certain Snowflake features. To find the stored procedures that are associated with a particular Snowflake Class, see [SQL class reference](sql-reference-classes.md).

Use [CALL](sql-reference/sql/call.md) to call a stored procedure. For example:

```sqlexample
CALL SYSTEM$CLASSIFY('hr.tables.empl_info', null);
```

Snowflake supports the following stored procedures, grouped by feature:

| Feature | Stored procedure |
| --- | --- |
| [Cortex Powered Object Descriptions](user-guide/sql-cortex-descriptions.md) | * [AI_GENERATE_TABLE_DESC](sql-reference/stored-procedures/ai_generate_table_desc.md) |
| [Data classification](user-guide/classify-intro.md) | * [ASSOCIATE_SEMANTIC_CATEGORY_TAGS](sql-reference/stored-procedures/associate_semantic_category_tags.md) * [SYSTEM$CLASSIFY](sql-reference/stored-procedures/system_classify.md) * [SYSTEM$CLASSIFY_SCHEMA](sql-reference/stored-procedures/system_classify_schema.md) * [SYSTEM$CANCEL_CLASSIFY_SCHEMA](sql-reference/stored-procedures/system_cancel_classify_schema.md) |
| [Data sharing and collaboration](guides-overview-sharing.md) | * [SYSTEM$REQUEST_LISTING_AND_WAIT](sql-reference/stored-procedures/system_request_listing_and_wait.md) |
| [Default event table](developer-guide/logging-tracing/event-table-setting-up.md) | * [ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW](sql-reference/stored-procedures/snowflake_telemetry_add_row_access_policy_on_events_view.md) * [DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW](sql-reference/stored-procedures/snowflake_telemetry_drop_row_access_policy_on_events_view.md) |
| [Differential privacy](user-guide/diff-privacy/differential-privacy-overview.md) | * [RESET_PRIVACY_BUDGET](sql-reference/stored-procedures/reset_privacy_budget.md) |
| [Network security](user-guide/network-policy-advisor.md) | * [EVALUATE_CANDIDATE_NETWORK_POLICY](sql-reference/stored-procedures/evaluate_candidate_network_policy.md) * [RECOMMEND_NETWORK_POLICY](sql-reference/stored-procedures/recommend_network_policy.md) |
| [Notifications](user-guide/notifications/about-notifications.md) | * [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](sql-reference/stored-procedures/system_send_snowflake_notification.md) * [SYSTEM$SEND_EMAIL](sql-reference/stored-procedures/system_send_email.md) |
| [Semantic views](user-guide/views-semantic/overview.md) | * [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md) |
| [Synthetic data](user-guide/synthetic-data.md) | * [GENERATE_SYNTHETIC_DATA](sql-reference/stored-procedures/generate_synthetic_data.md) |
| [Trust Center](user-guide/trust-center/overview.md) | * [REGISTER_EXTENSION](sql-reference/stored-procedures/register_extension.md) * [DEREGISTER_EXTENSION](sql-reference/stored-procedures/deregister_extension.md) |

---
title: Tutorials and Other Resources
source: https://docs.snowflake.com/en/other-resources.md
section: General
---

# Tutorials and Other Resources

This topic provides links to assorted “how to” tutorials/labs and “best practices” for using Snowflake.

## Tutorials

Snowflake provides several tutorials for getting started.

You will need a Snowflake account to explore these tutorials. If you sign up for a trial account,
the trial account has a user with necessary roles (ACCOUNTADMIN and SYSADMIN) and a virtual warehouse (COMPUTE_WH)
needed to explore this tutorial. If you use any other account to explore this tutorial, then make sure your user is
granted these roles and the account has the virtual warehouse.

For new users, we recommend you start with these tutorials:

* [Snowflake in 20 minutes](user-guide/tutorials/snowflake-in-20minutes.md) — A simple tutorial using SnowSQL, the Snowflake command-line client, to introduce key concepts and tasks.
* [Getting Started with Snowflake - Zero to Snowflake](https://quickstarts.snowflake.com/guide/getting_started_with_snowflake/index.html) — A comprehensive tutorial that uses both SnowSQL and [Snowsight](user-guide/ui-snowsight-gs.md) covers data loading,
  querying, working with semi-structured data, accessing
  historical data using Snowflake’s Time Travel feature, sharing, and so on.
* [Getting Started with Python](https://quickstarts.snowflake.com/guide/getting_started_with_python/index.html) — A tutorial in which you set up the Python Connector and then explore the basic operations you can do with it.

For tutorials on bulk loading, see:

* [Bulk Loading from a Local File System](user-guide/tutorials/data-load-internal-tutorial.md)
* [Bulk Loading from Amazon S3](user-guide/tutorials/data-load-external-tutorial.md)

In addition, you might explore the following pages that introduce important concepts about semi-structured data:

* [JSON Basics](user-guide/tutorials/json-basics-tutorial.md)
* [Loading JSON Data into a Relational Table](user-guide/tutorials/script-data-load-transform-json.md)
* [Loading and Unloading Parquet Data](user-guide/tutorials/script-data-load-transform-parquet.md)

## Best Practices

Snowflake best practices are provided throughout the documentation. The following are
links to important practices related to Snowflake features:

* [Roles and Access Control](user-guide/security-access-control-considerations.md)
* [Virtual Warehouses](user-guide/warehouses-considerations.md)
* [Table Design](user-guide/table-considerations.md)
* [Data Storage](user-guide/tables-storage-considerations.md)
* [Data Loading](user-guide/data-load-considerations.md)
* [Data Unloading](user-guide/data-unload-considerations.md)
* [Semi-structured Data](user-guide/semistructured-considerations.md)

## Sample Data Sets

The following benchmarking datasets are available for all Snowflake accounts:

* [TPC-DS](user-guide/sample-data-tpcds.md)
* [TPC-H](user-guide/sample-data-tpch.md)

In addition, [Snowflake Marketplace](https://app.snowflake.com/marketplace?pricing=free) is where you can
find additional data sets, provided by third-parties, for use with Snowflake. For related documentation, refer to
[Introduction to the Snowflake Marketplace](https://other-docs.snowflake.com/en/marketplace/intro.html).

---
title: Tutorials to get started with Snowflake
source: https://docs.snowflake.com/en/learn-tutorials.md
section: General
---

# Tutorials to get started with Snowflake

The tutorials in this topic provide hands-on examples that get you started with Snowflake. To explore these
tutorials, you must have a Snowflake account and a user with the required roles and access to a virtual
warehouse:

* If you have signed up for a [trial account](user-guide/admin-trial-account.md), the trial account user has
  the required roles and a virtual warehouse that you can use for several of these tutorials.
* If you use another account to explore these tutorials, you must sign in as a user that has the required
  roles and that can use a virtual warehouse.

Each tutorial describes the prerequisites that must be met before completing its tasks, including the roles required for
the user who performs the tasks. Several tutorials require the ACCOUNTADMIN and SYSADMIN roles.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage that you use for any sample data in
> these tutorials. Snowflake requires a [virtual warehouse](user-guide/warehouses.md) to
> load the data and execute queries. A running virtual warehouse consumes Snowflake credits.
> After you finish a tutorial, you can drop objects that are created in the tutorial to minimize
> costs.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

The following sections contain links to tutorials that get you started with Snowflake tasks and features:

## Tutorial that introduces you to Snowflake

Snowflake provides the following tutorial to introduce you to key concepts and tasks:

[Snowflake in 20 minutes](user-guide/tutorials/snowflake-in-20minutes.md)
:   Use SnowSQL, a Snowflake command-line client, to learn about key concepts and tasks.

## Tutorials to get started with data engineering

Snowflake provides the following tutorials to get you started with data engineering:

> **Note:**
>
> These tutorials show you how to load data into a table by using the
> [COPY INTO <table>](sql-reference/sql/copy-into-table.md) command. For information about other options
> for loading data, see [Overview of data loading](user-guide/data-load-overview.md).

### Load data

[Load and query sample data using SQL](user-guide/tutorials/tasty-bytes-sql-load.md)
:   Uses a fictitious food truck brand named Tasty Bytes to show you how to
    [load](user-guide/data-load-overview.md) and query data in Snowflake using
    SQL. You can access a pre-loaded
    [Snowsight template](user-guide/ui-snowsight/snowsight-templates.md) worksheet
    to complete these tasks.

[Load data from cloud storage: Amazon S3](user-guide/tutorials/load-from-cloud-tutorial.md)
:   Shows you how to load data from an Amazon S3 bucket into Snowflake using SQL. You can
    access a pre-loaded Snowsight template worksheet to complete these tasks.

[Load data from cloud storage: Microsoft Azure](user-guide/tutorials/load-from-cloud-tutorial-azure.md)
:   Shows you how to load data from Microsoft Azure cloud storage into Snowflake using SQL.
    You can access a pre-loaded Snowsight template worksheet to complete these tasks.

[Load data from cloud storage: Google Cloud Storage](user-guide/tutorials/load-from-cloud-tutorial-gcs.md)
:   Shows you how to load data from Google Cloud Storage into Snowflake using SQL.
    You can access a pre-loaded Snowsight template worksheet to complete these tasks.

### Bulk load data

[Bulk load from a local file system using COPY](user-guide/tutorials/data-load-internal-tutorial.md)
:   Describes how to [bulk load data](user-guide/data-load-local-file-system.md) from files in your
    local file system into a table.

[Bulk load from Amazon S3 using COPY](user-guide/tutorials/data-load-external-tutorial.md)
:   Describes how to bulk load data from files in an existing Amazon Simple Storage Service (Amazon S3)
    bucket into a table.

### Work with semi-structured data

[Learn the basics of using JSON with Snowflake](user-guide/tutorials/json-basics-tutorial.md)
:   Describes the basics of using [JSON](user-guide/semistructured-data-formats.md) with Snowflake.

[Load JSON data into a relational table](user-guide/tutorials/script-data-load-transform-json.md)
:   Uses a [COPY INTO <table>](sql-reference/sql/copy-into-table.md) command with a SELECT statement to load individual
    elements in a staged JSON file into a table.

[Load and unload Parquet data](user-guide/tutorials/script-data-load-transform-parquet.md)
:   Describes how you can upload [Parquet](user-guide/semistructured-data-formats.md) data by transforming elements of
    a staged Parquet file directly into table columns using the [COPY INTO <table>](sql-reference/sql/copy-into-table.md) command. The
    tutorial also describes how you can use the [COPY INTO <location>](sql-reference/sql/copy-into-location.md) command to unload table data
    into a Parquet file.

## Tutorial to get started with security

Snowflake provides the following tutorial to get you started with security:

[Create users and grant roles](user-guide/tutorials/users-and-roles-tutorial.md)
:   Shows you how to create a [user](user-guide/admin-user-management.md) and grant a role to it
    by using SQL commands. You can access a pre-loaded [Snowsight template](user-guide/ui-snowsight/snowsight-templates.md)
    worksheet to complete these tasks.

## Other learning resources

These other learning sources are available:

[Tutorials](https://docs.snowflake.com/tutorials)
:   Explore a large repository of tutorials with hands-on examples that help you learn about Snowflake’s features.

[Snowflake Education Services](https://learn.snowflake.com/en/)
:   Discover instructor-led classes, on-demand courses, and self-directed learning to get you started with Snowflake.

[Snowflake for Developers](https://www.snowflake.com/en/developers/guides/)
:   Discover product quickstarts, industry-specific use cases, administration best practices, and reference architectures
    from Snowflake experts and partners.

[Snowflake Developers YouTube Channel](https://www.youtube.com/@snowflakedevelopers)
:   Discover Snowflake product tips, demos, and tutorials.

---
title: Unload data from Snowflake
source: https://docs.snowflake.com/en/guides-overview-unloading-data.md
section: General
---

# Unload data from Snowflake

Snowflake supports bulk unloading of data from a database table into flat, delimited text files.
The following topics detail the processes and procedures associated with unloading data.

[Overview of data unloading](user-guide/data-unload-overview.md)
:   Introduction and overview of unloading data.

[Summary of Data Unloading Features](user-guide/intro-summary-unloading.md)
:   Reference of the supported features for using the [COPY INTO <location>](sql-reference/sql/copy-into-location.md) command to unload data from Snowflake tables into flat files.

[Data unloading considerations](user-guide/data-unload-considerations.md)
:   Best practices, general guidelines, and important considerations for unloading data.

[File formats to unload data](user-guide/data-unload-prepare.md)
:   Supported data file formats for unloading data.

[Unload into a Snowflake stage](user-guide/data-unload-snowflake.md)
:   Instructions on using the COPY command to unload data from a table into an internal (i.e. Snowflake) stage.

[Unload into Amazon S3](user-guide/data-unload-s3.md)
:   Instructions on using the COPY command to unload data from a table into an Amazon S3 bucket.

[Unload into Google Cloud Storage](user-guide/data-unload-gcs.md)
:   Instructions on using the COPY command to unload data from a table into an Google Cloud Storage bucket.

[Unload into Microsoft Azure](user-guide/data-unload-azure.md)
:   Instructions on using the COPY command to unload data from a table into an Azure container.

---
title: Welcome to Snowflake Documentation
source: https://docs.snowflake.com/en/index.md
section: General
---

# Welcome to Snowflake Documentation

In these topics, you will find the information you need to access your Snowflake account and perform all the administrative and user tasks associated
with using Snowflake. The documentation also provides conceptual overviews, tutorials, and a detailed reference for all supported SQL commands,
functions, and operators.

You can start by browsing the contents on the left or using the search box at the top to search across the documentation and other Snowflake resources.
If you do not find the information you are looking for, please feel free to reach out to Snowflake Documentation or Snowflake Support using the buttons
at the bottom of each page.

## [Get started with Snowflake for users](getting-started-for-users.md)

[Before you begin](user-guide/setup.md)
:   Overview of getting an account and methods for accessing Snowflake.

[Sign in to Snowflake](user-guide/connecting.md)
:   Overview of the different ways to connect to Snowflake.

[Snowflake key concepts and architecture](user-guide/intro-key-concepts.md)
:   Description of Snowflake architecture, key concepts, and features.

[Snowsight quick tour](user-guide/ui-snowsight-quick-tour.md)
:   Overview of Snowsight, Snowflake’s web-based interface.

[Overview of the data lifecycle](user-guide/data-lifecycle.md)
:   Introduces the main operations and corresponding SQL commands for getting your data into Snowflake and
    then using it to perform queries and other SQL operations.

## [Tutorials and Other Resources](other-resources.md)

This topic provides links to assorted “how to” tutorials/labs and “best practices” for using Snowflake.

## [Using Snowflake](user-guide.md)

* [Snowsight: The Snowflake web interface](user-guide/ui-snowsight.md) — Learn how to use Snowsight for your Snowflake operations:

  > + [Snowsight quick tour](user-guide/ui-snowsight-quick-tour.md)
  > + [Getting started with Snowsight](user-guide/ui-snowsight-gs.md)
  > + [Work with worksheets in Snowsight](user-guide/ui-snowsight-worksheets.md)
  > + [Workspaces](user-guide/ui-snowsight/workspaces.md)
  > + [About Legacy Snowflake Notebooks](user-guide/ui-snowsight/notebooks.md)
  > + [Using Snowflake Copilot](user-guide/snowflake-copilot.md)
  > + [Visualizing data with dashboards](user-guide/ui-snowsight-dashboards.md)
  > + [Explore and manage database objects in Snowsight](user-guide/ui-snowsight-data.md)
  > + [Monitor query activity with Query History](user-guide/ui-snowsight-activity.md)
  > + [Evaluating and monitoring account security in the Trust Center](user-guide/trust-center/overview.md)
  > + [Manage Snowflake Support cases](user-guide/ui-support.md)
  > + [Set up and manage notification contacts for Snowflake](user-guide/ui-snowsight-contacts.md)
* [Virtual warehouses](user-guide/warehouses.md) — Key concepts and tasks for creating and using virtual warehouses to execute queries and perform DML operations, such as loading and unloading data:

  > + [Overview of warehouses](user-guide/warehouses-overview.md)
  > + [Multi-cluster warehouses](user-guide/warehouses-multicluster.md)
  > + [Warehouse considerations](user-guide/warehouses-considerations.md)
  > + [Working with warehouses](user-guide/warehouses-tasks.md)
  > + [Using the Query Acceleration Service (QAS)](user-guide/query-acceleration-service.md)
  > + [Monitoring warehouse load](user-guide/warehouses-load-monitoring.md)
* [Databases, Tables & Views](user-guide/databases.md) — Key concepts and tasks related to understanding and working with Snowflake databases and tables:

  > + [Understanding Snowflake Table Structures](user-guide/tables-micro-partitions.md)
  > + [Working with Temporary and Transient Tables](user-guide/tables-temp-transient.md)
  > + [Introduction to external tables](user-guide/tables-external-intro.md)
  > + [Overview of Views](user-guide/views-introduction.md)
  > + [Working with Secure Views](user-guide/views-secure.md)
  > + [Working with Materialized Views](user-guide/views-materialized.md)
  > + [Table Design Considerations](user-guide/table-considerations.md)
  > + [Cloning considerations](user-guide/object-clone.md)
  > + [Data storage considerations](user-guide/tables-storage-considerations.md)
* [Query Data in Snowflake](guides-overview-queries.md) — Key concepts and tasks for executing queries in Snowflake:

  > + [Working with joins](user-guide/querying-joins.md)
  > + [Understanding How Snowflake Can Eliminate Redundant Joins](user-guide/join-elimination.md)
  > + [Working with Subqueries](user-guide/querying-subqueries.md)
  > + [Querying Hierarchical Data](user-guide/queries-hierarchical.md)
  > + [Working with CTEs (Common Table Expressions)](user-guide/queries-cte.md)
  > + [Querying Semi-structured Data](user-guide/querying-semistructured.md)
  > + [Analyzing data with window functions](user-guide/functions-window-using.md)
  > + [Identifying Sequences of Rows That Match a Pattern](user-guide/match-recognize-introduction.md)
  > + [Using Sequences](user-guide/querying-sequences.md)
  > + [Using Persisted Query Results](user-guide/querying-persisted-results.md)
  > + [Computing the Number of Distinct Values](user-guide/querying-distinct-counts.md)
  > + [Estimating Similarity of Two or More Sets](user-guide/querying-approximate-similarity.md)
  > + [Estimating Frequent Values](user-guide/querying-approximate-frequent-values.md)
  > + [Estimating Percentile Values](user-guide/querying-approximate-percentile-values.md)
  > + [Querying data using worksheets](user-guide/ui-snowsight-query.md)
  > + [Canceling Statements](user-guide/querying-cancel-statements.md)
* [Introduction to loading semi-structured data](user-guide/semistructured-intro.md) — Key concepts and tasks for working with JSON and other types of semi-structured data:

  > + [Supported formats for semi-structured data](user-guide/semistructured-data-formats.md)
  > + [Considerations for semi-structured data stored in VARIANT](user-guide/semistructured-considerations.md)
  > + [Tutorial: JSON basics for Snowflake](user-guide/tutorials/json-basics-tutorial.md)
* [Introduction to unstructured data](user-guide/unstructured-intro.md) — Key concepts and tasks for working with unstructured data:

  > + [Directory tables](user-guide/data-load-dirtables.md)
  > + [REST API for unstructured data support](user-guide/data-load-unstructured-rest-api.md)
  > + [Share unstructured data with a secure view](user-guide/unstructured-data-sharing.md)
  > + [Troubleshooting processing of unstructured data](user-guide/unstructured-ts.md)
* [Snowflake Time Travel & Fail-safe](user-guide/data-availability.md) — Key concepts and tasks for understanding how Snowflake maintains access to deleted and modified data, and also how Snowflake enables data recovery in the
  event of loss:

  > + [Understanding & using Time Travel](user-guide/data-time-travel.md)
  > + [Understanding and viewing Fail-safe](user-guide/data-failsafe.md)
  > + [Storage costs for Time Travel and Fail-safe](user-guide/data-cdp-storage-costs.md)
* [Introduction to streams and tasks](user-guide/data-pipelines-intro.md) — Key concepts and tasks for transforming and optimizing loaded data for analysis:

  > + [Introduction to streams](user-guide/streams-intro.md)
  > + [Introduction to tasks](user-guide/tasks-intro.md)
* [Introduction to business continuity & disaster recovery](user-guide/replication-intro.md) — Key concepts and tasks for replicating and failing over databases across multiple Snowflake accounts, as well as redirecting client connections, for business continuity and disaster recovery:

  > + [Introduction to replication and failover across multiple accounts](user-guide/account-replication-intro.md)
  > + [Redirecting client connections](user-guide/client-redirect.md)
* [Sample data sets](user-guide/sample-data.md) — Key concepts and tasks for using the sample data sets provided with Snowflake:

  > + [Use the sample database](user-guide/sample-data-using.md)
  > + [Sample data: TPC-H](user-guide/sample-data-tpch.md)
  > + [Sample Data: OpenWeatherMap — Deprecated](user-guide/sample-data-openweathermap.md)
* [Alerts and Notifications](guides-overview-alerts.md) — Key concepts and tasks for sending email notifications in SQL (e.g. from a
  stored procedure, task, etc.) and setting up alerts to perform actions or send notifications when data in Snowflake meets
  certain conditions.

  > + [Setting up alerts based on data in Snowflake](user-guide/alerts.md)
  > + [Notifications in Snowflake](user-guide/notifications/about-notifications.md)
* [Snowflake Postgres](user-guide/snowflake-postgres/about.md) — Create, manage, and use Postgres instances directly from Snowflake:

  > + [Creating a Snowflake Postgres Instance](user-guide/snowflake-postgres/postgres-create-instance.md)
  > + [Connecting to Snowflake Postgres](user-guide/snowflake-postgres/connecting-to-snowflakepg.md)
  > + [Snowflake Postgres Roles](user-guide/snowflake-postgres/postgres-roles.md)
  > + [Snowflake Postgres Connection Pooling](user-guide/snowflake-postgres/postgres-connection-pooling.md)
  > + [Snowflake Postgres Maintenance](user-guide/snowflake-postgres/postgres-maintenance.md)
  > + [Snowflake Postgres Read Replicas](user-guide/snowflake-postgres/postgres-create-replica.md)
  > + [Snowflake Postgres High Availability](user-guide/snowflake-postgres/high-availability.md)
  > + [Snowflake Postgres Cost Evaluation](user-guide/snowflake-postgres/postgres-cost.md)
  > + [Snowflake Postgres Insights](user-guide/snowflake-postgres/insights.md)
  > + [Snowflake Postgres logging](user-guide/snowflake-postgres/postgres-logging.md)
  > + [Using Cortex Code CLI with Snowflake Postgres](user-guide/snowflake-postgres/postgres-cortex-code.md)
  > + [Snowflake Postgres networking](user-guide/snowflake-postgres/postgres-network.md)
  > + [Snowflake Postgres Instance Sizes](user-guide/snowflake-postgres/postgres-instance-sizes.md)
  > + [Snowflake Postgres Extensions](user-guide/snowflake-postgres/postgres-extensions.md)
  > + [Snowflake Postgres Server Settings](user-guide/snowflake-postgres/postgres-server-settings.md)

## [Managing Your Snowflake Account](user-guide-admin.md)

* [Account identifiers](user-guide/admin-account-identifier.md)

  > Detailed descriptions of the two unique account identifiers supported for connecting to Snowflake and using features that span multiple accounts.
* [Trial accounts](user-guide/admin-trial-account.md)

  > Instructions for signing up for a trial account, adding a credit card to the account, and canceling the account.
* [Parameter management](user-guide/admin-account-management.md)

  > Instructions for setting account, session, and object parameters for your account.
* [User management](user-guide/admin-user-management.md)

  > Instructions for creating and managing users in your account.
* [Behavior change management](release-notes/bcr-bundles/managing-behavior-change-releases.md)

  > Instructions for enabling and disabling behavior change releases in your account.

## [General reference](sql-reference.md)

* [Parameters](sql-reference/parameters.md) — parameters that can be used to control system behavior at the account, user, session, and object
  level.
* [References](sql-reference/references.md) — use references to authorize access on objects for owner’s rights stored procedures,
  applications, and classes.
* [Ternary logic](sql-reference/ternary-logic.md) — information about the behavior of NULL in Boolean expressions and with comparison operators.
* [Collation support](sql-reference/collation.md) — information about sorting and other character-set-dependent operations on text strings.
* [SQL format models](sql-reference/sql-format-models.md) — formats for specifying conversion of numeric and date/time values to and from text strings.
* [Object identifiers](sql-reference/identifiers.md) — rules for defining and using object identifiers, including resolving object names used in SQL
  statements:

  + [Identifier requirements](sql-reference/identifiers-syntax.md)
  + [Literals and variables as identifiers with IDENTIFIER() syntax](sql-reference/identifier-literal.md)
  + [Object name resolution](sql-reference/name-resolution.md)
* [Constraints](sql-reference/constraints.md) — concepts and reference information for defining and maintaining unique, primary key, and foreign
  key constraints in tables:

  + [Overview of constraints](sql-reference/constraints-overview.md)
  + [Creating constraints](sql-reference/constraints-create.md)
  + [Modifying constraints](sql-reference/constraints-alter.md)
  + [Dropping constraints](sql-reference/constraints-drop.md)
* [SQL variables](sql-reference/session-variables.md) — concepts and reference for defining and using variables in sessions.
* [Transactions](sql-reference/transactions.md) — concepts and reference for using transactions with SQL statements.
* [Table literals](sql-reference/literals-table.md) — concepts and reference for using table literals instead of a single scalar value in queries.
* [SNOWFLAKE database](sql-reference/snowflake-db.md) — reference for the SNOWFLAKE shared database, which is provided by Snowflake for
  querying/reporting on your organization, account, data sharing, and other object usage.
* [Snowflake Information Schema](sql-reference/info-schema.md) — concepts and reference for the Snowflake Information Schema, which consists of a set of metadata
  views and historical table functions for querying/reporting on objects in Snowflake.
* [Metadata fields in Snowflake](sql-reference/metadata.md) — concepts and reference for metadata fields in Snowflake.

## [SQL command reference](sql-reference-commands.md)

* [Query syntax](sql-reference/constructs.md) — structure of SQL queries in Snowflake.
* [Query operators](sql-reference/operators.md) — arithmetic, logical, and other types of operators.
* [Data Definition Language (DDL) commands](sql-reference/sql-ddl-summary.md) — overview of DDL commands.
* [Data Manipulation Language (DML) commands](sql-reference/sql-dml.md) — commands for performing DML operations, including:

  + Inserting, deleting, updating, and merging data in Snowflake tables.
  + Bulk copying data into and out of Snowflake tables.
  + Staging files for bulk copying.
* [All commands (alphabetical)](sql-reference/sql-all.md) — alphabetical list of all the commands.
* Commands categorized by the type of objects and operations they control, including:

  + General account-level objects (accounts, users, roles, security policies, integrations, etc.) and operations (failover & recovery, etc.).
  + Session-based operations (session context, queries, variables, transactions, etc.).
  + Virtual warehouses (for loading data and performing queries) and resource monitors (for controlling credit usage).
  + Databases, schemas, tables, and other schema-level objects (views, sequences, etc.).
  + Snowflake extensions and application development (user-defined functions, stored procedures, scripting, etc.).
  + Objects for sharing data (shares, listings, etc.).
  + Objects for classifying, protecting, and governing data (masking policies, row-access policies, tags, etc.).

## [Function and stored procedure reference](sql-reference-functions.md)

* [Summary of functions](sql-reference/intro-summary-operators-functions.md) — combined summary of all system-defined functions. Can be used as a
  quick-reference.
* [All functions (alphabetical)](sql-reference/functions-all.md) — alphabetical list of all system-defined functions (scalar, aggregate, table, etc.).
* [Aggregate functions](sql-reference/functions-aggregation.md) — functions that take multiple rows/values as input and return a single value.
* [Scalar functions](sql-reference/functions.md) — functions that take a single row/value as input and return a single value:

  + [Bitwise expression functions](sql-reference/expressions-byte-bit.md)
  + [Conditional expression functions](sql-reference/expressions-conditional.md)
  + [Context functions](sql-reference/functions-context.md)
  + [Conversion functions](sql-reference/functions-conversion.md)
  + [Data generation functions](sql-reference/functions-data-generation.md)
  + [Date & time functions](sql-reference/functions-date-time.md)
  + [Differential privacy functions](sql-reference/functions-differential-privacy.md)
  + [Encryption functions](sql-reference/functions-encryption.md)
  + [Geospatial functions](sql-reference/functions-geospatial.md)
  + [Hash functions](sql-reference/functions-hash-scalar.md)
  + [Metadata functions](sql-reference/functions-metadata.md)
  + [Notification functions](sql-reference/functions-notification.md)
  + [Numeric functions](sql-reference/functions-numeric.md)
  + [Semi-structured and structured data functions](sql-reference/functions-semistructured.md)
  + [String functions (regular expressions)](sql-reference/functions-regexp.md) — regular expression (search) functions
  + [String & binary functions](sql-reference/functions-string.md)
  + [Vector functions](sql-reference/functions-vector.md)
* [Model monitor functions](sql-reference/functions-model-monitors.md) — functions that retrieve metrics from machine learning model monitors.
* [System functions](sql-reference/functions-system.md) — functions that perform control operations or return system-level information.
* [Table functions](sql-reference/functions-table.md) — functions that return results in tabular format.
* [Window functions](sql-reference/functions-window.md) — functions that run analytic calculations, such as moving aggregations and rankings.
* [Data metric functions](sql-reference/functions-data-metric.md) — functions that enable data quality measurements for tables and views.
* [Stored procedures](sql-reference-stored-procedures.md) — stored procedures to facilitate using certain Snowflake features.

## [Snowflake Scripting reference](sql-reference-snowflake-scripting.md)

* [AWAIT](sql-reference/snowflake-scripting/await.md)
* [BEGIN … END](sql-reference/snowflake-scripting/begin.md)
* [BREAK](sql-reference/snowflake-scripting/break.md)
* [CANCEL](sql-reference/snowflake-scripting/cancel.md)
* [CASE](sql-reference/snowflake-scripting/case.md)
* [CLOSE](sql-reference/snowflake-scripting/close.md)
* [CONTINUE](sql-reference/snowflake-scripting/continue.md)
* [DECLARE](sql-reference/snowflake-scripting/declare.md)
* [EXCEPTION](sql-reference/snowflake-scripting/exception.md)
* [FETCH](sql-reference/snowflake-scripting/fetch.md)
* [FOR](sql-reference/snowflake-scripting/for.md)
* [IF](sql-reference/snowflake-scripting/if.md)
* [LET](sql-reference/snowflake-scripting/let.md)
* [LOOP](sql-reference/snowflake-scripting/loop.md)
* [NULL](sql-reference/snowflake-scripting/null.md)
* [OPEN](sql-reference/snowflake-scripting/open.md)
* [RAISE](sql-reference/snowflake-scripting/raise.md)
* [REPEAT](sql-reference/snowflake-scripting/repeat.md)
* [RETURN](sql-reference/snowflake-scripting/return.md)
* [WHILE](sql-reference/snowflake-scripting/while.md)

## [Appendices](appendices.md)

* [Notational conventions](sql-reference/conventions.md)

  > Notational conventions used in the Snowflake documentation.
* [Reserved & limited keywords](sql-reference/reserved-keywords.md)

  > List of words reserved for Snowflake SQL.

---
title: Working with organizations and accounts
source: https://docs.snowflake.com/en/guides-overview-manage.md
section: General
---

# Working with organizations and accounts

The following topics describe how to manage Snowflake organizations and accounts.

## Organizations

[Introduction to organizations](user-guide/organizations.md)
:   Learn about organizations, which link the accounts owned by your business entity. You can find the name of your organization, list the
    accounts in your organization, and change the name of your organization.

[Organization administrators](user-guide/organization-administrators.md)
:   Learn about the system roles that administrators use to perform organization-level tasks.

[Organization users](user-guide/organization-users.md)
:   Learn about using organization users for users who need access to multiple accounts within the organization.

[Managing accounts in your organization](user-guide/organizations-manage-accounts.md)
:   Manage the lifecycle of an account such as creating it and deleting it. Also, manage the general characteristics of an account like
    its Snowflake edition.

[Connecting to your accounts](user-guide/organizations-connect.md)
:   Connect to accounts in your organization from SnowSQL, connectors, drivers, and through Snowsight.

## Organization accounts

[Organization accounts](user-guide/organization-accounts.md)
:   Learn how organization administrators of multi-account organizations use an organization account. Also, use premium views in the
    ORGANIZATION_USAGE schema to track usage across the organization.

## Accounts

[Account identifiers](user-guide/admin-account-identifier.md)
:   Learn how to use account identifiers to specify the account that you are using (e.g. to connect to the account, use
    Snowsight, etc.).

[Trial accounts](user-guide/admin-trial-account.md)
:   Sign up for a trial account, convert that account to a paid account, and cancel the trial account.

[Parameter management](user-guide/admin-account-management.md)
:   View and alter parameters for your account.

[User management](user-guide/admin-user-management.md)
:   Create, modify, view, and drop users in your account.

[Behavior change management](release-notes/bcr-bundles/managing-behavior-change-releases.md)
:   Enable, disable, and check the status of behavior changes.

## Loading & Unloading Data

Stages, COPY INTO, Snowpipe, file formats, and connectors for ingesting and exporting data.

---
title: AbortQueryJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/abortqueryjob.md
section: Loading & Unloading Data
---

# AbortQueryJob 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Aborts a Query Job in Salesforce using the Bulk API 2.0.

## Tags

abort, bulk, job, preview, query, salesforce

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Job ID | The ID of the job for which the status is checked. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the Query Job could not be aborted but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the Query Job could not be aborted |
| success | If the Query Job has been successfully aborted, the FlowFile is routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobStatus](getqueryjobstatus.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: About Openflow
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/about.md
section: Loading & Unloading Data
---

# About Openflow

Snowflake Openflow is an integration service that connects any data
source and any destination with hundreds of processors supporting
structured and unstructured text, images, audio, video and sensor data.
Built on [Apache NiFi](https://nifi.apache.org/), Openflow lets you run a fully managed service in
your own cloud for complete control.

> **Note:**
>
> The Openflow platform is currently available for deployment in customers’ own VPCs in both AWS and Snowpark Container Services.

This topic describes the key features of Openflow, its benefits,
architecture, and workflow, and use cases.

## Key features and benefits

Open and extensible
:   An extensible managed service that’s powered
    by Apache NiFi, enabling you to build and extend processors from any
    data source to any destination.

Unified data integration platform
:   Openflow enables data engineers to handle complex,
    bi-directional data extraction and loading through a fully managed service that can be deployed inside your
    own VPC or within your Snowflake deployment.

Enterprise-ready
:   Openflow offers out-of-the box security,
    compliance, and observability and maintainability hooks for data
    integration.

High speed ingestion of all types of data
:   One unified platform lets you handle structured and unstructured data, in both batch
    and streaming modes, from your data source to Snowflake at virtually
    any scale.

Continuous ingestion of multimodal data for AI processing
:   Near real-time unstructured data ingestion, so you can immediately chat
    with your data coming from sources such as Sharepoint, Google Drive,
    and so on.

## Openflow deployment types

Openflow is supported in both the Bring Your Own Cloud (BYOC) and Snowpark Container Services (SPCS) versions.

Openflow - Snowflake Deployment
:   Feature — Generally Available

    Snowflake Openflow - Snowflake Deployments are available to all accounts in AWS and Azure [Commercial regions](../../intro-regions.md).

    Openflow - Snowflake Deployment, using [Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) (SPCS),
    provides a streamlined and integrated solution for connectivity.
    Because SPCS is a self-contained service within Snowflake, it’s easy to deploy and manage.
    SPCS offers a convenient and cost-effective environment for running your data flows.
    A key advantage of Openflow - Snowflake Deployment is its native integration with Snowflake’s security model,
    which allows for seamless authentication, authorization, network security and simplified operations.

    When configuring Openflow - Snowflake Deployments, follow the process as outlined in [Setup Openflow - Snowflake Deployment](setup-openflow-spcs.md).

Openflow - Bring Your Own Cloud
:   Feature — Generally Available

    Snowflake Openflow BYOC deployments are available to all accounts in AWS [Commercial regions](../../intro-regions.md).

    Openflow - Bring Your Own Cloud (BYOC) provides a connectivity solution that you can use
    to connect public and private systems securely and handle sensitive data preprocessing
    locally, within the secure bounds of your organization’s cloud environment.
    BYOC refers to a deployment option where the Openflow data
    processing engine, or data plane, runs within your own cloud environment
    while Snowflake manages the overall Openflow service and control plane.

    When configuring BYOC deployments, follow the process as outlined in [Set up Openflow - BYOC](setup-openflow-byoc.md).

## Use cases

Use Openflow if you want to fetch data from any source and put it
in any destination with minimal management, coupled with Snowflake’s built-in data security and governance.

Openflow use cases include:

* Ingest data from unstructured data sources, such as Google Drive and Box, and make
  it ready for chat in your AI assistants with Snowflake Cortex or use the data for your own custom processing.
* Replicate the change data capture (CDC) of database tables into Snowflake for comprehensive, centralized
  reporting.
* Ingest real-time events from streaming services, such as Apache Kafka, into Snowflake for near real-time analytics.
* Ingest data from SaaS platforms, such as LinkedIn Ads, to Snowflake for reporting, analytics, and insights.
* Create an Openflow dataflow using Snowflake and NiFi
  [processors](processors/index.md) and [controller services](controllers/index.md).

## Security

Openflow uses industry-leading security features that help ensure you have
the highest levels of security for your account, and users,
and all the data you store in Snowflake. Some key aspects include:

Authentication
:   * Runtimes use Snowflake Managed Token as the
      default and recommended authentication method.
    * Snowflake Managed Token works consistently across SPCS and BYOC deployment types.
    * BYOC deployments can alternatively use key-pair authentication for explicit credential management.

Authorization
:   * Openflow supports fine-grained roles for RBAC.
    * ACCOUNTADMIN to grant privileges to be able to create deployments and runtimes.

Encryption in-transit
:   * Openflow connectors support TLS protocol, using standard Snowflake clients for data ingestion.
    * All the communications between the Openflow deployments and Openflow control plane are encrypted using TLS protocol.

Secrets management (BYOC)
:   * Integration with AWS Secrets Manager or Hashicorp Vault. For more information,
      see [Encrypted Passwords in Configuration Files](https://nifi.apache.org/docs/nifi-docs/html/administration-guide.html#encrypt-config_tool).

Private link support
:   * Openflow connectors are compatible with reading and writing data to Snowflake using inbound AWS PrivateLink.

Tri-Secret Secure support
:   * Openflow connectors are compatible with [Tri-Secret Secure](../../security-encryption-tss.md) for writing data to Snowflake.

## Snowflake Managed Token authentication

Snowflake Managed Token is the recommended and default authentication method for Openflow
runtimes to connect to Snowflake. This authentication method works consistently across both
[Openflow - Snowflake Deployments](about-spcs.md) and [BYOC deployments](about-byoc.md).
Snowflake Managed Token provides a unified and simplified experience for configuring Snowflake connectivity.

### Key benefits

Simplified configuration
:   Snowflake Managed Token eliminates the need to generate, store, and rotate long-lived credentials
    such as key pairs. The token is automatically managed by Snowflake, reducing operational overhead.

Unified across deployment types
:   Whether you deploy Openflow in Snowpark Container Services (SPCS) or Bring Your Own Cloud (BYOC),
    you configure authentication the same way using the `SNOWFLAKE_MANAGED` authentication strategy.

Enhanced security
:   Tokens are short-lived and automatically refreshed, minimizing the risk associated with credential exposure.

### How it works

When you configure a connector or processor to connect to Snowflake, select `SNOWFLAKE_MANAGED`
as the Snowflake Authentication Strategy. The runtime automatically obtains and manages
the token used to authenticate to Snowflake on your behalf.

The behavior of Snowflake Managed Token varies based on your deployment type:

Openflow - Snowflake Deployments
:   When running in a Snowflake-managed deployment, the runtime uses
    [SPCS session tokens](../../../developer-guide/snowpark-container-services/overview.md)
    provided natively by the SPCS environment.
    These tokens are available at runtime and require no additional configuration.

BYOC deployments
:   When running in a BYOC deployment, the runtime uses
    [workload identity federation](../../workload-identity-federation.md)
    to authenticate to Snowflake.
    The runtime automatically exchanges its cloud provider identity
    (for example, an AWS IAM role) for a Snowflake token.

    > **Note:**
    >
    > To use Snowflake Managed Token in BYOC deployments, you must first configure
    > [runtime roles](setup-openflow-byoc.md) for your deployment.

### When to use Snowflake Managed Token

Use Snowflake Managed Token for:

* All new connector configurations in both SPCS and BYOC deployments.
* Migrations from key-pair authentication to the simplified, managed authentication model.
* Scenarios where you want to avoid managing key pairs or other long-lived credentials.

### Alternative authentication methods

While Snowflake Managed Token is recommended, BYOC deployments also support key-pair authentication
(`KEY_PAIR`) for cases where you require explicit credential management.
For more information about key-pair authentication, see [Key-pair authentication and key-pair rotation](../../key-pair-auth.md).

For information about the underlying authentication mechanisms, see the following:

* [Workload identity federation](../../workload-identity-federation.md): Information about the authentication mechanism used in BYOC deployments.
* [Snowpark Container Services: Working with services](../../../developer-guide/snowpark-container-services/working-with-services.md): Information about how SPCS services authenticate to Snowflake.

## Architecture

The following diagram illustrates the architecture of Openflow:

The deployment agent installs and bootstraps the Openflow deployment infrastructure in your
VPC and regularly sync container images from the Snowflake system image registry.

Openflow components include:

Deployments
:   A deployment is where your data flows execute, within individual runtimes.
    You will often have multiple runtimes to isolate different projects, teams, or for SDLC reasons, all associated with a single deployment.
    Deployments come in two types [Bring Your Own Cloud (BYOC)](about-byoc.md)
    and [Openflow - Snowflake](about-spcs.md).

Control plane
:   The control plane is a layer containing all components used to manage and observe Openflow runtimes.
    This includes the Openflow service and API, which users interact with via the Openflow canvas or through interaction with Openflow APIs.
    On Openflow - Snowflake Deployments, the Control Plane consists of Snowflake-owned
    public cloud infrastructure and services as well as the control plane application itself.

BYOC deployments
:   BYOC deployments are deployments acting as containers for runtimes that are deployed in *your* cloud environment.
    They incur charges based on their compute, infrastructure, and storage use.
    See [Openflow BYOC cost and scaling considerations](cost-byoc.md) for more information.

Openflow - Snowflake Deployments
:   Openflow - Snowflake Deployments are containers for runtimes and are deployed
    using a [compute pool](../../../developer-guide/snowpark-container-services/working-with-compute-pool.md).
    They incur utilization charges based on their uptime and usage of compute.
    See [Openflow Snowflake Deployment cost and scaling considerations](cost-spcs.md) for more information.

Runtime
:   Runtimes host data pipelines, with the framework providing security, simplicity, and scalability.
    You can deploy Openflow runtimes in your VPC using Openflow.
    You can deploy Openflow connectors to your runtimes, and also build completely new pipelines
    using Openflow processors and controller services.

Openflow - Snowflake Deployment Runtime
:   Openflow - Snowflake Deployment Runtimes are deployed as [Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) service
    to an Openflow - Snowflake Deployment deployment, which is represented by an underlying compute pool.
    Customers request a Runtime through the deployment, which executes a request on behalf of the user to service.
    Once created, customers access it via a web browser at the URL generated for that underlying service.

---
title: About Openflow - Snowflake Deployments
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/about-spcs.md
section: Loading & Unloading Data
---

# About Openflow - Snowflake Deployments

Openflow - Snowflake Deployment run on [Snowpark Container Services (SPCS)](../../../developer-guide/snowpark-container-services/overview.md) and
provide a streamlined and integrated solution for data integration and connectivity across interoperable storage like Iceberg and Snowflake native storage.
As a fully self-contained service within Snowflake, it’s easy to deploy and manage, offering a convenient and cost-effective environment for running your data flows.
A key advantage is its native integration with Snowflake’s security model, which allows seamless authentication, authorization, and network security, and simplified operations.

Although customers can have both BYOC and Snowflake Deployments, the following list use cases that are well-suited to Snowflake Deployments:

* Incorporating full-fidelity data in the bronze layer: Landing raw data from various sources directly into Snowflake and using Openflow Snowflake Deployments to extract and load.
* Enriching data: Running pipelines to enrich tables that already exist inside Snowflake.
* From ingest to insight in one place: Building applications where the entire data lifecycle (ingest, process and serve) happens within the Snowflake ecosystem.
* Transforming raw data to insights with AI: Ingesting unstructured data and then, for instance, using Snowflake Intelligence to search and understand it better, all in concert with users’ other structured data.
* Employing reverse ETL: Closing the loop on insight generation by sharing with external operational systems via APIs, messaging infrastructure, and more.

## Understanding Snowflake roles and External Access Integrations

Openflow - Snowflake Deployments must be able to interact with data sources and destinations
that are typically outside Snowflake. In addition these deployments must also be able
to communicate with and access Snowflake itself.
Snowflake roles and external access integrations provide this support.

### What is a Snowflake role?

A Snowflake role is a traditional Snowflake role, associated with a specific Openflow Runtime, and used for the following tasks:

* Grant access to external access integrations (EAIs).
  These EAIs specify rules that allow the runtime
  to access the data sources and destinations from within Snowflake itself.
* Grant access to Snowflake resources.
* Grant access to resources that are connector-specific

Snowflake roles are linked to Openflow session tokens, avoiding the need for customers
to create separate service users and key pairs for authentication to Snowflake.

### What is an External Access Integration(EAI) within Openflow?

An [External access integration](../../../developer-guide/external-network-access/external-network-access-overview.md) (EAI)
is a Snowflake object designed to provide secure access to external resources,
like source systems from which Openflow connectors pull external data.
Openflow Snowflake Deployments use EAIs and network rules together to define the
endpoints an Openflow connector can read from or write to.

Data engineers define and configure EAIs and Snowflake roles specific to a given connector and its underlying runtime.

## Typical Openflow - Snowflake Deployment workflow

The following sections describe Openflow - Snowflake Deployment concepts and workflows.

| User persona | Task |
| --- | --- |
| Snowflake administrator | * Configures core Snowflake and external access integrations.  See [Set up Openflow - Snowflake Deployment - Task overview](setup-openflow-spcs.md). * Creates a set of deployments in Snowflake.  The Openflow UI is used to manage deployments and runtime creation and maintenance. The Openflow UI   allows users to create, upgrade, and delete runtimes in all deployments. |
| Data engineer (pipeline author, responsible for data ingestion) | * Works with a Snowflake administrator to configure required allow listed domains such   that Openflow - Snowflake Deployment can access the external data sources. * Creates Snowflake roles, external integrations, and other objects that can later be used by runtimes. * Uses the runtime canvas to build completely new flows or to configure deployed connectors.   Creates a completely new flow or uses an existing connector as-is or as a starting point to customize.  Connectors are a simple way to solve for a specific integration use case, and less technical users can deploy them without assistance from a data engineer. |
| Data engineer (pipeline operator) | Configures flow parameters and runs the flow. |
| Data engineer (responsible for transformation to silver and gold layers) | Responsible for transforming data from the bronze layer that was populated by the pipeline to silver and gold layers for analytics. |
| Business user | Makes use of gold layer objects for analytics. |

## Limitations

* Openflow - Snowflake Deployment is not supported in trial accounts.
* Only a single Openflow - Snowflake Deployment is supported per account.
  However, an account can have many Openflow - Snowflake Deployment runtimes — each having a separate role and network access — which allows users to separate the workload.
* Users with a default role of ACCOUNTADMIN can’t login to Openflow - Snowflake Deployment runtimes and will get an error message when attempting to do so.
* Customers requiring private connectivity will need to configure [outbound PrivateLink](../../private-connectivity-outbound.md).
  Private Link is available to [Business Critical Edition](../../intro-editions.md) only.

### Next steps

[Set up Openflow - Snowflake Deployment - Task overview](setup-openflow-spcs.md)

---
title: About Openflow Connector for Amazon Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/amazon-ads/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Amazon Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Amazon Ads,
its workflow, and limitations.

The Openflow Connector for Amazon Ads automatically ingests [Amazon Ads](https://advertising.amazon.com/) data into your Snowflake account by using
Amazon Ads [Reporting API V3](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/overview). Reporting API enables you to configure custom reports with selected
[report types](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/overview),
[columns](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/columns), filters and other groupings.

Use this connector if you’re looking to do the following:

* Bring data from Amazon Ads for Ad performance statistics and insights

## Workflow

1. A **Amazon Ads administrator** gets access to Reporting API by following the [onboarding instructions](https://advertising.amazon.com/API/docs/en-us/guides/onboarding/overview),
   [generates a refresh token](https://advertising.amazon.com/API/docs/en-us/guides/get-started/retrieve-access-token)
   and [retrieves the client ID and client secret](https://advertising.amazon.com/API/docs/en-us/guides/onboarding/create-lwa-app#retrieve-your-security-credentials).
2. 1. A **Snowflake account administrator** performs the following:
   2. Installs the connector.
   3. Configures the connector with the required parameters, for example refresh token, report configuration, and database and schema names.
   4. Runs the connector flow. The connector does the following:

      1. Fetches the specified report as specified in the connector configuration.
      2. Creates a temporary table and puts the report chunks in it.
      3. Creates a table in the provided destination schema.
      4. Synchronises data from the temporary table to the destination table.
      5. Removes the temporary table.
3. **Marketing users** with Snowflake access can view and perform operations on the data downloaded from Amazon Ads to destination tables.

## Limitations

* The connector supports incremental ingestion only for the daily value of `Report Time Increment` parameter.
* Modification of the report definition when the processors are running might lead to data inconsistencies.
  To ensure consistency, stop the processors and clear the queues before updating the configuration.
* If the Amazon Ads API [rate limit](https://advertising.amazon.com/API/docs/en-us/reference/concepts/rate-limiting)
  is reached, the data doesn’t get ingested despite the connector attempting to pull data from the source system.

## Next steps

[Set up the Openflow Connector for Amazon Ads](setup.md)

---
title: About Openflow Connector for Box
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/box/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Box

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Box, its workflow, and limitations.

The Openflow Connector for Box connects a Box enterprise with Snowflake.

Use this connector to do the following:

* Ingest Box content for your own custom processing in Snowflake
* Ingest Box content and make it ready for chat in your AI assistants with Snowflake Cortex
* Use Box AI to extract metadata from Box content for enrichment in Snowflake
* Add enriched metadata from Snowflake to content in Box

## Workflow

1. A **Box developer** creates a Box Platform app and submits it for authorization.
2. A **Box administrator** authorizes the app.
3. The **Box developer** then performs the following tasks:

   1. Shares a Box folder with the app service account.
   2. Shares a Platform app configuration JSON file and a folder ID with a Snowflake account administrator.
4. A **Snowflake account administrator** performs the following tasks:

   1. Installs the connector.
   2. Configures the connector with Snowflake connection details and the data provided by the Box developer.
   3. Runs the connector flow. The connector does the following:

      1. Creates the required tables, stages, and a Cortex Search service in the specified Snowflake schema.
      2. Fetches Box file content and permissions from the folder specified in the connector configuration.
      3. Runs parsing and chunking on the fetched documents, and saves them in Snowflake tables. The saved chunks are automatically indexed by the Cortex Search service.
5. A **Chatbot developer** uses the Cortex Search service to build a chatbot application.

## Limitations

* [Cortex Parse Document limitations and requirements](../../../../snowflake-cortex/parse-document.md)
* [Cortex Search limitations](../../../../snowflake-cortex/cortex-search/cortex-search-overview.md)
* Changes caused by moving folders out of the specified root folder aren’t captured during incremental ingestion.
* The connector ingests only the supported file types and ignores others.

> **Note:**
>
> These limitations apply to the predefined connector flow.
> If the flow is customized and doesn’t use some or all of the predefined components, then these limitations may not apply.

## Next steps

[Set up the Openflow Connector for Box](setup.md)

---
title: About Openflow Connector for Google Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-ads/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Google Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for
Google Ads, steps to set it up, and limitations.

Google Ads is an online advertising platform where advertisers can
create and run ads to promote their products or services. Through Google
Ads, you can create online ads to reach people exactly when they’re
interested in offered products and services.

The Openflow Connector for Google Ads:

* Automatically ingests Google Ads data into your Snowflake account.
* Downloads data using the [Google Ads
  API](https://cloud.google.com/endpoints/docs/openapi/enable-api).
* Lets you configure custom reports with chosen attributes,
  [metrics](https://developers.google.com/google-ads/api/fields/v17/metrics),
  and
  [segments](https://support.google.com/google-ads/answer/2454072).

Use this connector if you’re looking to do the following:

* Import metrics from Google Ads for performance tracking and optimization

## Use cases

### Run the connector in different ingestion modes

There are two ways of ingesting data incrementally and as a snapshot.
Snapshot mode is a default one and is on as long as **segments.date**
segment is not selected. It creates a table in the provided destination
schema and appends on each schedule the newest data from Google Ads.

To configure incremental ingestion user has to fill Report Segments
parameter with segment named **segments.date**, other segments can be
still preset. Then data will be overlapped between the one we fetched
previously and the date range of the current run. The overlap is caused
by the conversion window as we need to ask for the historical data for
the number of days that’s equal to the conversion window, for example, if the
conversion window is set to 14 days and the ingestion happens every day,
there is 13 days of overlap.

### Reconfigure currently running connector

The report configuration can be changed when the processor is running.
To do so go to GetGoogleAdsReportContext and change your desired
parameters. Upon changing only Report Attributes, Metrics or Segments
parameters, the current destination table will be removed and a new one
with updated schema will be created, so before updating them please be
aware that already downloaded data will be deleted.

When the Resource Name or Account Client ID will be changed a new table
will be created. The old destination table will not be dropped.

Modifying the Schedule and Conversion window will not affect in any way
the data already fetched in the destination table.

When the Start Date will be changed, the connector will perform a single
ingestion from that date to the current date and then proceed as
normally in incremental mode. If there is data downloaded from the
period between new Start Date and current date it will be replaced after
change. Data before the new Start Date will not be affected.

#### Rate Limiting Restrictions

[Google Ads API limits](https://developers.google.com/google-ads/api/docs/access-levels) govern how many requests can be made within a given time frame. If your flow exceeds the allowed quota, syncs may slow down or fail with an error. This mostly occurs when your access token makes higher number of requests than the source typically allows. In such cases, we recommend applying for higher access quota (wherever appliable) or reducing the sync frequency.

## Limitations

* Filtering is not supported. Instead, data can be filtered after
  ingestion.
* Custom column ingestion is not supported.
* When segmenting reports, if all selected metrics are zero, they are
  always excluded.
* Attributed resource ingestion is not supported. Instead, multiple
  reports can be joined after ingestion.
* There can be only one report for selected resource name and client id
  pair.
* Modification of report definition when processors are running may lead
  to data inconsistencies. To ensure consistency, before updating
  configuration stop processors and clear queues.

## Next steps

[Set up the Openflow Connector for Google Ads](setup.md)

---
title: About Openflow Connector for Google Drive
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-drive/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Google Drive

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

The Openflow Connector for Google Drive connects a Google Workspace Shared Drive and Snowflake to ingest
files and user permissions and keeps them up to date. Openflow Connector for Google Drive also supports the
Cortex Search service and can make ingested files ready for conversational
analysis for use in AI Assistants using SQL, Python or REST APIs.

Use this connector if you’re looking to do the following:

* Ingest Google Drive content for your own custom processing in Snowflake
* Ingest Google Drive content and make it ready for chat in your AI assistants with Snowflake Cortex

## Limitations

1. [Cortex Parse Document limitations and requirements](../../../../snowflake-cortex/parse-document.md).
2. [Cortex Search limitations](../../../../snowflake-cortex/cortex-search/cortex-search-overview.md).
3. Changes caused by moving or renaming folders aren’t captured during
   incremental ingestion.
4. The connector supports only explicit Google Permissions for Users and
   Groups. It does not currently support authentication models for links
   shared with Anyone.
5. The connector ingests only the supported file types and ignores
   others.

Please note, the limitations are listed for the predefined versioned
flow. If the flow was customized, and it doesn’t use some of the
predefined components, the limitations related to these components won’t
apply.

## Next steps

[Set up the Openflow Connector for Google Drive](setup.md)

---
title: About Openflow Connector for Google Sheets
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-sheets/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Google Sheets

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Google Sheets, its workflow, and limitations.

The Openflow Connector for Google Sheets enables the ingestion of Google Sheets data into Snowflake. It uses the Google Sheets API
to fetch data and persist that data in a table dedicated to a given range from a sheet.
The connector creates the destination table in the database and the schema provided in the configuration.

Use this connector if you’re looking to do the following:

* Load data from Google sheets into Snowflake tables for reporting, analytics and insights

## Workflow

1. A **Google Cloud administrator** creates a service account and a key as described in
   [Service account credentials](https://developers.google.com/workspace/guides/create-credentials#service-account).
2. A **Google Sheets user** creates a Google Sheets spreadsheet and shares it with the service account.
   The first row of data represents the column names in the destination table that the connector will create.
   It cannot contain actual data. If a column contains multiple data types, the connector selects the least restrictive type.
3. A **Snowflake account administrator** configures the connector as follows:

   1. Installs the connector.
   2. Creates Snowflake warehouse, destination database, destination schema, and key.
   3. Specifies the required parameters for the connector, such as Snowflake Warehouse, Destination Database, Snowflake Key, and Spreadsheet ID.
   4. Runs the connector flow. The connector performs the following tasks when run in Openflow:

      1. Retrieves the data from a specified spreadsheet.
      2. Creates and updates the destination table to reflect the schema of data from Google Sheets.
         If the destination table is not created, then it is truncated.
      3. Inserts the data into the destination table.

## Limitations

* The connector saves numeric values from a sheet only as INT or DOUBLE types.
  Because of this, small rounding errors may occur in the least significant digits if sheets contain floating point numbers.
  The connector currently doesn’t support higher precision.
* Incremental load is not supported. The connector uses the truncate and load ingestion strategy.

## Next steps

[Set up the Openflow Connector for Google Sheets](setup.md)

---
title: About Openflow Connector for HubSpot
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/hubspot/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for HubSpot

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for HubSpot, its workflow, and limitations.

The Openflow Connector for HubSpot ingests HubSpot data into Snowflake. It uses the HubSpot API to retrieve data, which is then stored in a Snowflake table.
Data ingestion happens in the following two phases:

1. Initial load, where all data is retrieved during the first API call.
2. Incremental load, which merges the updates and new data into the destination table and uses timestamps from
   previous calls to limit the result to the issues that were updated since the last data load.

For more information about HubSpot private apps, see [Private apps](https://developers.hubspot.com/docs/guides/apps/private-apps/overview).

Use this connector if you’re looking to do the following:

* Get HubSpot CRM data into Snowflake for reporting, analytics, and insights

## Workflow

1. A HubSpot administrator performs the following tasks:

   1. Generates an API token within the HubSpot instance with the necessary scopes required for the API requests intended to make.
      This token is used by the connector for authentication.
   2. Defines the criteria to search objects like `Object Types` and `Updated After (optional)` fields.
2. A Snowflake account administrator performs the following tasks:

   1. Installs the connector.
   2. Configures the connector parameters:

      * Provides the HubSpot private app API token.
      * Defines the criteria for the objects being ingested by providing filters.
      * Sets the desired database and schema names within Snowflake.
   3. Runs the connector flow. Upon execution, the connector does the following:

      1. Creates an API call to fetch objects from the configured HubSpot instance.
      2. Extracts the relevant data.
      3. Creates the configured destination table in the Snowflake database if the API call returned at least one result.
      4. Loads raw data into the specified Snowflake table and creates a processed view on top of the raw data.

## Limitations

* When multiple object types are defined, filtering by ‘Updated After’ applies to all object types defined in the parameter context.
* Currently, the connector supports basic authentication using a HubSpot private app and API token.
  This means that the connector is only able to ingest data that is accessible to the owner of the API token.
* The processors are designed to work on the primary node only with one thread.
* The number of calls your private app can make is based on your account subscription. To learn more about HubSpot private app limits,
  see [Private app limits](https://developers.hubspot.com/docs/guides/apps/private-apps/overview#private-app-limits).

## Next steps

[Set up the Openflow Connector for HubSpot](setup.md)

---
title: About Openflow Connector for Jira Cloud
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/jira-cloud/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Jira Cloud

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Jira Cloud, its workflow, and limitations.

The Openflow Connector for Jira Cloud ingests Atlassian Jira issues data to Snowflake. It uses the [Jira Cloud REST API](https://developer.atlassian.com/cloud/jira/platform/rest/v3/intro/#about)
and [Jira Query Language (JQL)](https://support.atlassian.com/jira-service-management-cloud/docs/use-advanced-search-with-jira-query-language-jql/) to retrieve data, which is then stored in a Snowflake table and is accessible via a view.
Data ingestion occurs in two phases:

1. An initial load, where all data is retrieved during the initial API call.
2. Incremental loads, which merge updates and new data into the destination table and use
   timestamps from previous calls to limit the result to the issues that were updated since the last load.

Use this connector if you’re looking to do the following:

* Extract Jira issues and project details for cross‐team visibility and deeper insights

## Workflow

1. A **Jira Cloud administrator** performs the following tasks:

   1. Generates an API token within the Jira instance. This token will be used by the connector for authentication.
      Both tokens with scopes (with `read:jira-work` and `read:jira-user` scopes) and without scopes are supported,
      although tokens with scopes are recommended for better fine-grained access control.
   2. Defines the criteria to search issues, such as project name, created field, and updated field.
2. A **Snowflake account administrator** performs the following tasks:

   1. Installs the connector.
   2. Configures the connector:

      1. Provides the Jira API token.
      2. Specifies the Jira instance URL.
      3. Defines the criteria for the issues being ingested, by providing JQL query or, for simpler cases just the project name.
      4. Sets the database and schema names in the Snowflake account.
   3. Runs the connector flow in the Openflow canvas. Upon execution, the connector performs the following actions:

      1. Creates an API call to fetch issues from the configured Jira instance.
      2. Extracts relevant data, such as issue creation dates, statuses, and assignees.
      3. Creates the configured destination table in the Snowflake database if the API call returned at least one result.
      4. Loads the processed data into the specified Snowflake table.
3. **Snowflake Business users** can then access views, and perform operations on the data downloaded from Jira Cloud to destination tables.

## Limitations

* Each connector instance can be associated with only one JQL search query.
* Timestamps in connector properties reflect the timezone of Jira Cloud, potentially resulting in discrepancies with the user’s local timezone.
  The Jira Cloud timezone is fetched once and kept in the state of the FetchJiraIssues processor. Updating the connector’s timezone requires clearing the state of this processor.
* The connector is unable to reflect deletions in the target Snowflake tables as the Jira Cloud REST API does not return information about data deletion.
* Basic authentication using an email and API token is the only supported authorization method. As a result the connector can only ingest data accessible by the owner of the API token.
* The FetchJiraIssues processor is single threaded, and designed to work on the primary node.

## Next steps

[Set up the Openflow Connector for Jira Cloud](setup.md)

---
title: About Openflow Connector for Kafka
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kafka/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Kafka

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for
Kafka and limitations.

Apache Kafka software uses a publish and subscribe model to write and
read streams of records, similar to a message queue or enterprise
messaging system. Kafka allows processes to read and write messages
asynchronously. A subscriber does not need to be connected directly to a
publisher; a publisher can queue a message in Kafka for the subscriber
to receive later.

An application publishes messages to a topic, and an application
subscribes to a topic to receive those messages. Kafka can process, as
well as transmit, messages; however, that is outside the scope of this
document. Topics can be divided into partitions to increase scalability.

The Openflow Connector for Kafka reads data from Kafka topics and writes
it into Snowflake tables using the [Snowpipe Streaming](../../../../snowpipe-streaming/data-load-snowpipe-streaming-overview.md) mechanism.

Use this connector if you’re looking to do the following:

* Ingest real‐time events from Apache Kafka into Snowflake for near real-time analytics

## Limitations

* If the `Topic To Table Map` parameter is not set:

  + Table names must precisely match the topic of the data they hold.
  + Table names must be in uppercase format.
* If the `Topic To Table Map` parameter is set:

  + Table names must match the table names specified in the mapping. The table names must be a valid Snowflake unquoted identifier. For information about valid table names, see [Identifier requirements](../../../../../sql-reference/identifiers-syntax.md).
* Only JSON and AVRO formats are supported.
* Only Confluent Schema Registry is supported.
* *PLAINTEXT*, *SASL_PLAIN*, *SSL*, and *SASL_SSL* security protocols are
  supported.
* *PLAIN*, *SCRAM-SHA-256*, *SCRAM-SHA-512* and *AWS_MSK_IAM* SASL mechanisms are
  supported.
* *mTLS* and *AWS MSK IAM* authentication methods require extra configuration via services. See [Configure other authentication methods for Openflow Connector for Kafka](authentication.md) for more details.
* In case of data insertion failure into a table, the connector will
  keep retrying infinitely.

## Field name mapping and special characters handling

When mapping field names from Kafka messages to Snowflake column names, the connector applies the following transformations to ensure compatibility with Snowflake naming conventions:

1. **First character**: The first character of the field name must be a letter. If it is not a letter, it is changed to an underscore.
2. **Other characters**: All other characters must be letters, numbers, or underscores. Any other characters are changed to underscores.

## Next steps

* [Set up the Openflow Connector for Kafka](setup.md)
* [Performance Tuning of the Openflow Connector for Kafka](performance-tuning.md)

---
title: About Openflow Connector for Kinesis
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kinesis/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Kinesis

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Kinesis, including its workflow and limitations.

You can use [Amazon Kinesis Data Streams](https://docs.aws.amazon.com/streams/latest/dev/introduction.html)
to collect and process large streams of data records in real time. Producers continually push data to
Kinesis Data Streams, and consumers process the data in real time.

A Kinesis data stream is a set of [shards](https://docs.aws.amazon.com/streams/latest/dev/key-concepts.html#shard). Each shard has a sequence of data records.
A data record is the unit of data stored in a Kinesis data stream. Data records are composed of
a sequence number, a partition key, and a data blob, which is an immutable sequence of bytes.

The Openflow Connector for Kinesis reads data from a Kinesis data stream and writes it to a Snowflake table using [Snowpipe Streaming](../../../../snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

## Use cases

Use this connector if you want to ingest real‐time events from Amazon Kinesis Data Streams into Snowflake for near real-time analytics.

## Workflow

### AWS administrator tasks

1. Create credentials for the connector to connect with Kinesis Stream and the associated DynamoDB.
2. Set up IAM policies that have the permissions listed in [IAM permissions required for KCL consumer applications](https://docs.aws.amazon.com/streams/latest/dev/kcl-iam-permissions.html).
3. Record the stream name and application name and provide them to your Snowflake account administrator. These are required when setting up the connector in the runtime.

Snowflake account administrator tasks
————————————————————————————————===

1. Install the connector.
2. Configure the connector:
   :   1. Provide the AWS and Snowflake credentials and settings.
       2. Provide the Kinesis stream name.
       3. Set the database and schema names in the Snowflake account.
       4. Customize other parameters.
3. Run the connector in the Openflow canvas. Upon execution, the connector performs the following actions:
   :   1. Creates DynamoDB tables for storing Kinesis Stream checkpoints.
       2. Extracts stream data.
       3. Creates the configured destination table in the Snowflake database if at least one record was received from the stream.
       4. Loads the processed data into the specified Snowflake table.

Business user tasks
————————————————————————————————===

Perform operations on the data downloaded from Kinesis into the destination table.

## Limitations

* The connector supports only a single stream.
* If you use a manually created table:
  :   + The table name must match the stream of the data it holds precisely.
      + The table name must be uppercase.
* The connector supports only JSON message format.
* The connector supports only Amazon Access Key IAM authentication.
* The connector logs failed messages to the Snowflake logs and does not route them to a DLQ stream.

## Next steps

For information on how to set up the connector, see the following topic:

* [Set up Openflow Connector for Kinesis for JSON data format](setup.md)

---
title: About Openflow Connector for LinkedIn Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/linkedin-ads/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for LinkedIn Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts, workflow, and limitations of Openflow Connector for LinkedIn Ads.

The Openflow Connector for LinkedIn Ads enables you to ingest LinkedIn Ads metrics into Snowflake.
This connector uses the [Reporting API](https://learn.microsoft.com/en-us/linkedin/marketing/integrations/ads-reporting/ads-reporting?view=li-lms-2025-02&tabs=http) to fetch data.
The connector persists data in a table dedicated to a given report. Each report can be configured to contain metrics, pivots, and facets chosen by the user.
The connector creates the destination table in the database and the schema provided in the configuration.

Use this connector if you’re looking to do the following:

* Import campaign performance data from LinkedIn Ads to Snowflake for reporting, analytics and insights

## Workflow

1. A **LinkedIn Ads user** obtains credentials required to connect to LinkedIn Ads API.
2. A **Snowflake account administrator** performs the following tasks:

   1. Installs the connector.
   2. Configures the connector with the required parameters.
   3. Runs the connector. The following happens when the connector is run in Openflow:

      1. Retrieves the data based on the specified configuration.
         :   If the Time Granularity parameter is set to `DAILY`, then the connector downloads only the data for a
             calculated timeframe. In other cases, the connector downloads all the data from the start date to the
             current time.
      2. Creates a temporary table and inserts the downloaded data into it.
      3. Recreates or updates the destination table to reflect the schema of data from LinkedIn Ads. If you change the schema, the connector drops the destination table and recreates it with a new schema.
         If `DAILY` time granularity is chosen in the Time Granularity parameter, then outdated data is deleted from the destination table.
      4. Inserts the data into the destination table with an additional insertion timestamp.
      5. Drops the temporary table.

## Limitations

* All metrics of type BigDecimal are saved as Strings. [Conversion functions](../../../../../sql-reference/functions-conversion.md) allow you to convert values manually to numeric types with chosen scale and precision.
* Some metrics and pivots return values that are IDs. The connector does not use the [URN resolution](https://learn.microsoft.com/en-us/linkedin/marketing/integrations/ads-reporting/ads-reporting?view=li-lms-2025-02&tabs=http#urn-resolution).
* The connector uses the [Authorization Code Flow](https://learn.microsoft.com/en-us/linkedin/shared/authentication/authorization-code-flow?context=linkedin%2Fcontext&tabs=HTTPS1) because the [Client Credentials Flow](https://learn.microsoft.com/en-us/linkedin/shared/authentication/client-credentials-flow?context=linkedin%2Fcontext&tabs=HTTPS1) is not available for Marketing API. This means that the refresh token must be refreshed manually every year.

---
title: About Openflow Connector for Meta Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/meta-ads/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Meta Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Meta Ads,
its workflow, and limitations.

[Meta Ads](https://www.facebook.com/business/ads) is an online advertising platform, which you can use to create and run ads to promote your
products or services on Meta products, such as Facebook and Instagram.
The Openflow Connector for Meta Ads automatically ingests Meta Ads data into your Snowflake account
by using [Meta Ads Insights API](https://developers.facebook.com/docs/marketing-api/insights).
Insights API enables you to configure custom reports with selected fields, [breakdowns](https://developers.facebook.com/docs/marketing-api/insights/breakdowns), and other aggregations.

Use this connector if you’re looking to do the following:

* Bring Meta Ads data to unify and analyze your marketing performance

## Workflow

1. A **Meta Ads administrator** performs the following:

   1. [Creates a Meta Ads app](https://developers.facebook.com/docs/development/create-an-app/).
   2. [Enables Marketing API](https://developers.facebook.com/docs/marketing-api/get-started).
   3. [Acquires a long-lived token](https://developers.facebook.com/docs/facebook-login/guides/access-tokens/get-long-lived/).
2. A **Snowflake account administrator** performs the following:

   1. Installs the connector.
   2. Configures the connector with the required parameters, for example long-lived token, report configuration, and database and schema names.
   3. Runs the connector flow. The connector does the following:

      1. Fetches the specified report as specified in the connector configuration.
      2. Creates a temporary table and puts the report chunks in it.
      3. Creates a table in the provided destination schema.
      4. Synchronises data from the temporary table to the destination table.
      5. Removes the temporary table.
3. **Marketing users** with Snowflake access can view and perform operations on the data downloaded from Meta Ads to destination tables.

## Limitations

* The connector supports incremental ingestion only for the daily value of `Report Time Increment` parameter.
* Modification of the report definition when the processors are running might lead to data inconsistencies.
  To ensure consistency, stop the processors and clear the queues before updating the configuration.
* If the Meta Ads API [rate limit](https://developers.facebook.com/docs/graph-api/overview/rate-limiting/#ads-insights)
  is reached, the data doesn’t get ingested even though the connector continues attempting to pull data from the source system.
  To increase the rate limit, [change the app access type](https://developers.facebook.com/docs/marketing-api/overview/rate-limiting) from `Standard access` to `Advanced access` of the Ads Management Standard Access, and enable the `ads_read` and `ads_management` [permissions](https://developers.facebook.com/docs/permissions/).
* Data can be fetched only from the past 37 months, as defined by Meta Ads.

## Next steps

[Set up the Openflow Connector for Meta Ads](setup.md)

---
title: About Openflow Connector for Microsoft Dataverse
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/dataverse/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Microsoft Dataverse

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

The Openflow Connector for Microsoft Dataverse connects a Microsoft
Dataverse storage and Snowflake to ingest Microsoft Dataverse tables and
keeps them up to date on Snowflake side. The outcome of the connector
are selected tables replicated on Snowflake Account in a database and
schema specified by the user.

Use this connector if you’re looking to do the following:

* Integrate data from Microsoft Power Platform and Dynamics 365 applications with Snowflake for holistic business insights

## Rate limiting restrictions

[Microsoft Dataverse API limits](https://learn.microsoft.com/en-us/power-apps/developer/data-platform/api-limits?tabs=sdk#how-service-protection-api-limits-are-enforced) govern how many requests can be made within a given time frame. If your flow exceeds the allowed quota, syncs may slow down or fail with an error. This mostly occurs when your access token makes higher number of requests than the source typically allows. In such cases, we recommend applying for higher access quota (wherever applicable) or reducing the sync frequency.

### Limitations

* Only tables with enabled change tracking can be replicated
* Schema of destination tables is discovered from the database metadata
  through REST APIs. Whenever new columns are added to the table, they
  appear in the destination table. Changes and removals of columns are
  not reflected in the destination table.
* All [limitations of Microsoft Dataverse Web
  API](https://learn.microsoft.com/en-us/power-apps/maker/data-platform/api-limits-overview)
  apply.
* Supported set of column types is limited by set of types supported by
  [Snowpipe Streaming](../../../../snowpipe-streaming/snowpipe-streaming-table-support.md).
* Each instance of the connector supports a single schedule. If you need
  multiple schedules, then you need to install multiple instances of the
  connector.
* Empty tables are not replicated.
* Removal of a table is not replicated. If a table was replicated previously and is removed, it will remain in destination schema.
* Delta tokens used for change tracking expire after 7 days of inactivity by default. If the connector
  is not run for more than 7 days, the delta token expires and the connector must perform a full
  resync of the affected tables. This duration is controlled by the `ExpireChangeTrackingInDays`
  setting in the Microsoft Dataverse organization configuration.

### Next steps

[Set up the Openflow Connector for Microsoft Dataverse](setup.md)

---
title: About Openflow Connector for MySQL
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/mysql/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for MySQL

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for MySQL,
its workflow, and limitations.

## About the Openflow Connector for MySQL

The Openflow Connector for MySQL connects a MySQL database instance to Snowflake and replicates data from selected tables in near real-time or on a specified schedule.
The connector also creates a log of all data changes, which is available along with the current state of the replicated tables.

## Use cases

Use this connector if you’re looking to do the following:

* CDC replication of MySQL tables into Snowflake for comprehensive, centralized reporting

## Supported MySQL versions

The following table lists the tested and officially supported MySQL versions.

|  | 8.0 | 8.4 |
| --- | --- | --- |
| [Standard](https://www.mysql.com/) | Yes | Yes |
| [AWS RDS](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/CHAP_MySQL.html) | Yes | Yes |
| [Amazon Aurora](https://docs.aws.amazon.com/AmazonRDS/latest/AuroraMySQLReleaseNotes/Welcome.html) | Yes, as Version 3 | Not applicable. Aurora 8.4 is not currently supported. |
| [GCP Cloud SQL](https://cloud.google.com/sql/mysql?hl=en) | Yes | Yes |
| [Azure Database](https://azure.microsoft.com/en-us/products/mysql/) | Yes | Yes |

## Openflow requirements

* The runtime size must be at least Medium. Use a bigger runtime when replicating large data volumes, especially when row sizes are large.
* The connector does not support multi-node Openflow runtimes. Configure the runtime for this connector with Min nodes and Max nodes set to `1`.

## Limitations

* The connector supports MySQL version 8 or later.
* The connector supports only username and password authentication with MySQL.
* Only database tables containing primary keys can be replicated.
* The connector does not replicate individual values larger than 16 MB. By default, processing such a value results in the associated table being marked permanently failed.
  To prevent table failures, modify the **Oversized Value Strategy** destination parameter.
* The connector does not replicate tables with data that exceeds
  [Snowflake’s type limitations](../../../../../sql-reference/intro-summary-data-types.md).
* The connector does not replicate columns of types GEOMETRY, GEOMETRYCOLLECTION, LINESTRING, MULTILINESTRING, MULTIPOINT, MULTIPOLYGON, POINT, and POLYGON.
* The connector has the [Group Replication Limitations of MySQL](https://dev.mysql.com/doc/refman/8.4/en/group-replication-limitations.html#group-replication-limitations-transaction-size).
  This means that a single transaction must fit into a binary log message of size no more than 4 GB.
* The connector does not support replicating tables from a reader instance in Amazon Aurora as Aurora reader instances do not maintain their own binary logs.
* The connector supports source table schema changes with the exception of changing primary key definitions and
  changing the precision or the scale of a numeric column.
* The connector does not support re-adding a column after it is dropped.
* For `DATE` and `DATETIME` types, any values that contain a zero month or day
  are mapped to the Unix epoch (‘1970-01-01’ or ‘1970-01-01T00:00’). Date zero (‘0000-00-00’)
  is also mapped to the Unix epoch. Values with a zero year are converted to year one, for
  example, ‘0000-05-30 7:59:59’ becomes ‘0001-05-30T7:59:59’). The remaining date and time
  components are unchanged.
* For `TIMESTAMP` type, value ‘0000-00-00 00:00:00’ is mapped to the Unix EPOCH (‘1970-01-01T00:00Z’).
* The connector does not capture cascade delete operations (ON DELETE CASCADE).
  Foreign key cascade deletions are executed internally by MySQL’s storage engine and are not recorded in the binary log,
  resulting in incomplete replication of dependent table deletions to Snowflake.

> **Note:**
>
> Limitations affecting certain table columns can be bypassed by excluding these specific columns from replication.

## Workflow

1. A **MySQL database administrator** performs the following tasks:

   > * Configure MySQL replication settings
   > * Create credentials for the connector
   > * (Optionally) Provide the SSL certificate.
2. A **Snowflake account administrator** performs the following tasks:

   1. Creates a service user for the connector, a warehouse for the connector, and a destination database for the replicated data.
   2. Installs the connector.
   3. Specifies the required parameters for the flow template.
   4. Runs the flow. The connector performs the following tasks when run in Openflow:

      1. Creates a schema for journal tables.
      2. Creates the schemas and destination tables matching the source tables configured for replication.
      3. Starts replicating the tables. For details on the replication process, see How tables are replicated.

## How the connector works

The following sections describe how the connector works in various scenarios, including replication, changes in schema, and data retention.

### How tables are replicated

The tables are replicated in the following stages:

1. Schema introspection: The connector discovers the columns in the source table, including the column names and types,
   then validates them against Snowflake’s and the connector’s Limitations. Validation failures cause
   this stage to fail, and the cycle completes. After successful completion of this stage, the connector creates an empty destination table.
2. Snapshot load: The connector copies all data available in the
   source table into the destination table. If this stage fails, then
   no more data is replicated. After successful completion, the data from the source table is available in the destination table.
3. Incremental load: The connector tracks
   changes in the source table and applies those changes to the destination table.
   This process continues until the table is removed from replication. Failure at this stage
   permanently stops replication of the source table, until the issue is resolved.

   > **Note:**
   >
   > This connector can be configured to immediately start replicating incremental changes for newly added tables,
   > bypassing the snapshot load phase. This option is often useful when reinstalling the connector
   > in an account where previously replicated data exists and you want to continue replication without having to re-snapshot tables.

   For details on the bypassing snapshot load and using the incremental load process, see [Incremental replication](incremental-replication.md).

> **Important:**
>
> Interim failures, such as connection errors, do not prevent tables from being replicated.
> Permanent failures, such as unsupported data types, do prevent tables from being replicated.
> If a permanent failure prevents a table from being replicated, remove the table from the list of replicated tables.
> After you address the problem that caused the failure, you can add the table back to the list of replicated tables.

### Table replication status

Interim failures, such as connection errors, do not prevent table replication. However,
permanent failures, such as unsupported data types, prevent table replication.

To troubleshoot replication issues or verify that a table has been successfully removed from the replication flow, check the Table State Store:

1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services. A table listing controller services displays.
2. Locate the row labeled Table State Store, click the More  button on the right side of the row, and then choose View State.

A list of tables and their current states displays. Type in the search box to filter the list by table name. The possible states are:

* NEW: The table is scheduled for replication but replication hasn’t started.
* SNAPSHOT_REPLICATION: The connector is copying existing data. This status displays until all records are stored in the destination table.
* INCREMENTAL_REPLICATION: The connector is actively replicating changes. This status displays after snapshot replication ends and continues to display indefinitely until a table is either removed from replication or replication fails.
* FAILED: Replication has permanently stopped due to an error.

> **Note:**
>
> The Openflow runtime canvas doesn’t display table status changes — only the current table status. However, table status changes are recorded in logs when they occur. Look for the following log message:
>
> ```text
> Replication state for table <database_name>.<schema_name>.<table_name> changed from <old_state> to <new_state>
> ```

If a permanent failure prevents table replication, remove the table from replication. After you address the problem that caused the failure, you can add the table back to replication. For more information, see [Restart table replication](setup.md).

## Understanding data retention

The connector follows a data retention philosophy where customer data is never automatically deleted.
You maintain full ownership and control over your replicated data, and the connector preserves historical
information rather than permanently removing it.

This approach has the following implications:

* Rows deleted from the source table are soft-deleted in the destination table rather than physically removed.
* Columns dropped from the source table are renamed in the destination table rather than dropped.
* Journal tables are retained indefinitely and are not automatically cleaned up.

### Destination table metadata columns

Each destination table includes the following metadata columns that track replication information:

| Column name | Type | Description |
| --- | --- | --- |
| `_SNOWFLAKE_INSERTED_AT` | TIMESTAMP_NTZ | The timestamp when the row was originally inserted into the destination table. |
| `_SNOWFLAKE_UPDATED_AT` | TIMESTAMP_NTZ | The timestamp when the row was last updated in the destination table. |
| `_SNOWFLAKE_DELETED` | BOOLEAN | Indicates whether the row was deleted from the source table. When `true`, the row has been soft-deleted and no longer exists in the source. |

### Soft-deleted rows

When a row is deleted from the source table, the connector does not physically remove it from the
destination table. Instead, the row is marked as deleted by setting the `_SNOWFLAKE_DELETED` metadata
column to `true`.

This approach allows you to:

* Retain historical data for auditing or compliance purposes.
* Query deleted records when needed.
* Decide when and how to permanently remove data based on your requirements.

To query only active (non-deleted) rows, filter on the `_SNOWFLAKE_DELETED` column:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = FALSE;
```

To query deleted rows:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = TRUE;
```

### Dropped columns

When a column is dropped from the source table, the connector does not drop the corresponding column
from the destination table. Instead, the column is renamed by appending the `__SNOWFLAKE_DELETED` suffix
to preserve historical values.

For example, if a column named `EMAIL` is dropped from the source table, it is renamed to
`EMAIL__SNOWFLAKE_DELETED` in the destination table. Rows that existed before the column was dropped
retain their original values, while rows added after the drop have `NULL` in this column.

You can still query historical values from the renamed column:

```sqlexample
SELECT EMAIL__SNOWFLAKE_DELETED FROM my_table;
```

### Renamed columns

Due to limitations in CDC (Change Data Capture) mechanisms, the connector cannot distinguish between
a column being renamed and a column being dropped followed by a new column being added. As a result,
when you rename a column in the source table, the connector treats this as two separate operations:
dropping the original column and adding a new column with the new name.

For example, if you rename a column from `A` to `B` in the source table, the destination table
will contain:

* `A__SNOWFLAKE_DELETED`: Contains values from before the rename. Rows added after the rename have
  `NULL` in this column.
* `B`: Contains values from after the rename. Rows that existed before the rename have `NULL`
  in this column.

#### Querying renamed columns

To retrieve data from both the original and renamed columns as a single unified column, use a
`COALESCE` or `CASE` expression:

```sqlexample
SELECT
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

Alternatively, using a `CASE` expression:

```sqlexample
SELECT
    CASE
        WHEN B IS NOT NULL THEN B
        ELSE A__SNOWFLAKE_DELETED
    END AS A_RENAMED_TO_B
FROM my_table;
```

#### Creating a view for renamed columns

Rather than manually modifying the destination table, you can create a view that presents the renamed
column as a single unified column. This approach is recommended because it preserves the original data
and avoids potential issues with ongoing replication.

```sqlexample
CREATE VIEW my_table_unified AS
SELECT
    *,
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

> **Important:**
>
> Manually modifying the destination table structure (such as dropping or renaming columns) is not
> recommended, as it may interfere with ongoing replication and cause data inconsistencies.

### Journal tables

During incremental replication, changes from the source database are first written to journal tables
before being merged into the destination tables. The connector does not automatically remove data from
journal tables, as this data may be useful for auditing, debugging, or reprocessing purposes.

Journal tables are created in the same schema as their corresponding destination tables and follow
this naming convention:

`<TABLE_NAME>_JOURNAL_<timestamp>_<number>`

Where:

* `<TABLE_NAME>` is the name of the destination table.
* `<timestamp>` is the creation timestamp in Unix epoch format (seconds since January 1, 1970),
  ensuring uniqueness.
* `<number>` starts at 1 and increments whenever the destination table schema changes, either due to
  schema changes in the source table or modifications to column filters.

For example, if your destination table is `SALES.ORDERS`, the journal table might be named
`SALES.ORDERS_JOURNAL_1705320000_1`.

> **Important:**
>
> Do not drop journal tables while replication is in progress. Removing an active journal table may
> cause data loss or replication failures. Only drop journal tables after the corresponding source
> table has been fully removed from replication.

#### Managing journal table storage

If you need to manage storage costs by removing old journal data, you can create a Snowflake task
that periodically cleans up journal tables for tables that are no longer being replicated.

Before implementing journal cleanup, verify that:

* The corresponding source tables have been fully removed from replication.
* You no longer need the journal data for auditing or processing purposes.

For information on creating and managing tasks for automated cleanup, see
[Introduction to tasks](../../../../tasks-intro.md).

## Next steps

Review [Openflow Connector for MySQL: Data mapping](data-mapping.md) to understand how the connector maps data types to Snowflake data types.

Review [Set up the Openflow Connector for MySQL](setup.md) to set up the connector.

---
title: About Openflow Connector for Oracle
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Oracle

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes the basic concepts of Openflow Connector for Oracle, its workflow, and limitations.

## About the Openflow Connector for Oracle

The Openflow Connector for Oracle connects an Oracle database instance to Snowflake and replicates
data from selected tables in near real-time or on a specified schedule.
The connector also creates a log of all data changes, which is available along
with the current state of the replicated tables.

## Use cases

The connector supports the following use case:

* Replicate Oracle database tables into Snowflake for comprehensive, centralized reporting.

## Licensing models and critical constraints

The Openflow Connector for Oracle supports two distinct licensing models. You must select the correct
model before installation. Failure to select the correct model might result in deployment
failure or unintended financial commitments.

For detailed licensing terms, comparison, and configuration instructions, see
Oracle XStream licensing.

### 1. Embedded License (Snowflake-provided)

Snowflake provides the Oracle XStream license to you directly for a fee. This model
allows you to consume XStream replication without a direct contract with Oracle.
For more information, see
Embedded license details and the
[Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

| Term | Details |
| --- | --- |
| Billing | License and Support & Maintenance (S&M) fees are drawn from your Snowflake Capacity. |
| Commitment | Activation initiates a non-cancelable 36-month term (after the 60-day trial). |
| Lifecycle | * **Post-term (36+ months)**: After the initial 36-month term, the license fee   drops to $0, but the S&M fee continues annually. * **Lock-out risk**: If you opt-out of S&M renewal, the connector will be   permanently locked when S&M coverage ends. Unlocking the connector requires   purchasing a new Embedded License, which triggers a new 36-month commitment   at full price. |
| Management UI | All license actions (Start/Cancel Trial, Monitor Usage, Opt-out) are performed by the ORGADMIN in Snowsight under Admin » Terms » Openflow for Oracle. For step-by-step instructions, see [Openflow Connector for Oracle: Enable and manage commercial terms](manage-commercial-terms.md). |
| Restrictions | The following customers are ineligible:   * Public sector entities. * Customers purchasing Snowflake through the GCP Marketplace. * Customers contracted with Snowflake through a third-party reseller. |

### 2. Independent license (Bring Your Own License - BYOL)

You provide your own Oracle license that includes XStream entitlements (for example, Oracle
GoldenGate license). For more information, see
Independent license (BYOL) details.

| Term | Details |
| --- | --- |
| Billing | No additional licensing fees from Snowflake. Standard storage and compute costs (for example, Openflow Compute) will apply. |
| Compliance | You are solely responsible for compliance with your Oracle license. |
| Usage | Mandatory for public sector, GCP Marketplace, and reseller customers. |

## Choosing an Oracle XStream licensing model

The Openflow Connector for Oracle requires a paid license for Oracle XStream services. Two licensing
models are available:

* Embedded Oracle License
* Independent Oracle License (Bring Your Own License - BYOL)

Use the following table to determine the appropriate model for your organization.

| Consideration | Embedded License | Independent License (BYOL) |
| --- | --- | --- |
| Who is it for? | Customers who need to license Oracle XStream technology and want to purchase it directly through their Snowflake agreement. | Customers who already have an Oracle GoldenGate license or another Oracle agreement that provides entitlement for XStream. |
| Billing | Billed through Snowflake based on the number of processor cores on your source Oracle DB. Involves a non-cancelable 36-month commitment. Also billed for support and maintenance services.  Additionally, standard storage and compute costs (for example, Openflow Compute) will apply. | No additional licensing or support and maintenance fees for Oracle XStream services from Snowflake. You are responsible for all licensing and compliance directly with Oracle.  Standard storage and compute costs (for example, Openflow Compute) will apply. |
| Configuration | Requires you to input your Oracle DB’s CPU core count and a processor multiplier factor in the connector parameters. | Does not require you to provide CPU core information to Snowflake. |
| Trial period | Includes a 60-day free trial for up to 16 licensed cores. Billing commences automatically on the 61st day. | No trial period is offered through Snowflake. Your use is subject to your existing Oracle agreement. |

## Embedded license details

By choosing this option, you are procuring the right to use Oracle XStream technology
with the connector through Snowflake. Be aware of the following key terms:

### Billing

Oracle XStream services are billed monthly and drawn from your Snowflake capacity
balance. The fee has two components - a license fee and a Support & Maintenance
(S&M) fee. The license fee is calculated based on the number of processor cores
in your source Oracle database, multiplied by the Oracle Processor Core Factor.

### Commitment (The “Day 61” Rule)

The first 60 days are free for up to 16 licensed cores. However, activating the
connector beyond the 60-day trial initiates a non-cancelable 36-month billing term
(“Initial Term”).

* **Automatic Conversion**: Billing commences automatically on Day 61. To avoid
  charges, you must cancel the trial in the
  Admin » Terms » Openflow for Oracle dashboard before
  Day 60.
* **Lock-in**: If your Snowflake agreement is terminated during this Initial Term,
  the entire remaining balance for the Initial Term becomes due immediately.

### Post-term renewal and penalties

After the Initial Term, the license fee becomes $0 but the Support & Maintenance
(S&M) fee continues.

* **Opt-out Consequence**: You can opt-out of S&M renewal through the dashboard in
  Admin » Terms » Openflow for Oracle. However, if S&M
  coverage stops, the connector processors are locked. To resume operations, you
  must purchase a NEW Embedded License (resetting the 36-month full-price
  commitment).

### Requirements

You are responsible for accurately reporting the number of processor cores and the
correct core factor in the connector configuration. This information must be kept
current if your source database hardware changes.

### Restrictions

This option is not available for:

* Public sector entities (for example, Government and Education entities).
* Customers purchasing Snowflake through the GCP Marketplace.
* Customers contracted with Snowflake through a third-party reseller (for example, CDW, Optiv).

### Configuration

To configure the Embedded License:

* Review and accept the
  [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/)
  terms presented in the UI.
* Select the Embedded License type.
* Enter the CPU core count details for your source Oracle database:
  **Total Cores** (the total number of physical cores on the source database
  server) and **Core Factor** (the Oracle processor core factor, for example, 0.5
  for Intel processors). Consult the Oracle Processor Core Factor Table for
  the correct value.

## Independent license (BYOL) details

This option is for customers who have already licensed the necessary Oracle technology.

### Requirements

You are solely responsible for ensuring that your use of the connector complies with
the terms of your existing Oracle license agreement. Snowflake does not validate or
audit your Oracle entitlements.

### Configuration

To configure the Independent License (BYOL):

* Review and accept the
  [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/)
  terms presented in the UI.
* Select the Independent License type.

When configuring the connector, proceed without entering any core
count or billing-related information.

## Openflow requirements

The following Openflow runtime requirements apply to the Openflow Connector for Oracle:

* The runtime size must be at least Medium. Use a bigger runtime when replicating
  large data volumes, especially when row sizes are large.
* The connector does not support multi-node Openflow runtimes. Configure the
  runtime for this connector with Min nodes and Max nodes set to `1`.

## Supported Oracle versions and platforms

The following Oracle database versions and platforms are supported:

* Oracle database versions 12cR1 and later
* On-premises servers
* Oracle Exadata
* OCI VM/Bare Metal
* AWS Custom RDS for Oracle
* AWS Standard Single-tenant RDS for Oracle

## Limitations

The following limitations apply to the Openflow Connector for Oracle:

* AWS Standard Multi-tenant RDS for Oracle is not supported.
* Oracle Autonomous Databases (ATP/ADW) are not supported.
* Oracle SaaS offerings such as Oracle Fusion Cloud Applications and NetSuite
  are not supported.
* The connector requires Openflow deployment version 0.55.0 or later for BYOC.
* The Openflow runtime must be created after the required Openflow deployment
  version is installed.
* Only database tables containing primary keys can be replicated.
* The connector works within a single database/container (PDB or CDB). To
  replicate tables from multiple containers, you must configure separate
  connector instances for each container.
* The connector does not support re-adding a column after it is dropped.
* The connector does not replicate individual values larger than 16 MB.
  By default, processing such a value results in the associated table being marked permanently failed.
  To prevent table failures, modify the **Oversized Value Strategy** destination parameter.
* Schema changes (such as ALTER TABLE statements that add or drop columns) are not supported
  while re-reading the redo logs from the earliest position. If any table’s schema was
  altered between the earliest available SCN and the current position, that table should
  be removed from replication and re-added with a fresh snapshot instead.

## How the connector works

The following sections describe how the connector works in different contexts,
including replication, schema changes, and data retention.

### How tables are replicated

The tables are replicated in the following stages:

1. Schema introspection: The connector discovers the columns in the source
   table, including the column names and types, then validates them against
   Snowflake’s and the connector’s Limitations. Validation failures cause
   this stage to fail, and the cycle completes. After successful completion
   of this stage, the connector creates an empty destination table in Snowflake.
2. Snapshot load: The connector copies all data available in the source table
   into the destination table. If this stage fails, then no more data is
   replicated. After successful completion, the data from the source table is
   available in the destination table.
3. Incremental load: The connector tracks
   changes in the source table and applies those changes to the destination table.
   This process continues until the table is removed from replication. Failure at this stage
   permanently stops replication of the source table until the issue is resolved.

### Table replication status

Interim failures, such as connection errors, do not prevent table replication. However,
permanent failures, such as unsupported data types, prevent table replication.

To troubleshoot replication issues or verify that a table has been successfully removed from the replication flow, check the Table State Store:

1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services. A table listing controller services displays.
2. Locate the row labeled Table State Store, click the More  button on the right side of the row, and then choose View State.

A list of tables and their current states displays. Type in the search box to filter the list by table name. The possible states are:

* NEW: The table is scheduled for replication but replication hasn’t started.
* SNAPSHOT_REPLICATION: The connector is copying existing data. This status displays until all records are stored in the destination table.
* INCREMENTAL_REPLICATION: The connector is actively replicating changes. This status displays after snapshot replication ends and continues to display indefinitely until a table is either removed from replication or replication fails.
* FAILED: Replication has permanently stopped due to an error.

> **Note:**
>
> The Openflow runtime canvas doesn’t display table status changes — only the current table status. However, table status changes are recorded in logs when they occur. Look for the following log message:
>
> ```text
> Replication state for table <database_name>.<schema_name>.<table_name> changed from <old_state> to <new_state>
> ```

If a permanent failure prevents table replication, remove the table from
replication. After you address the problem that caused the failure, you can add
the table back to replication. For more information, see
[Restart table replication](setup-connector.md).

## Understanding data retention

The connector follows a data retention philosophy where customer data is never automatically deleted.
You maintain full ownership and control over your replicated data, and the connector preserves historical
information rather than permanently removing it.

This approach has the following implications:

* Rows deleted from the source table are soft-deleted in the destination table rather than physically removed.
* Columns dropped from the source table are renamed in the destination table rather than dropped.
* Journal tables are retained indefinitely and are not automatically cleaned up.

### Destination table metadata columns

Each destination table includes the following metadata columns that track replication information:

| Column name | Type | Description |
| --- | --- | --- |
| `_SNOWFLAKE_INSERTED_AT` | TIMESTAMP_NTZ | The timestamp when the row was originally inserted into the destination table. |
| `_SNOWFLAKE_UPDATED_AT` | TIMESTAMP_NTZ | The timestamp when the row was last updated in the destination table. |
| `_SNOWFLAKE_DELETED` | BOOLEAN | Indicates whether the row was deleted from the source table. When `true`, the row has been soft-deleted and no longer exists in the source. |

### Soft-deleted rows

When a row is deleted from the source table, the connector does not physically remove it from the
destination table. Instead, the row is marked as deleted by setting the `_SNOWFLAKE_DELETED` metadata
column to `true`.

This approach allows you to:

* Retain historical data for auditing or compliance purposes.
* Query deleted records when needed.
* Decide when and how to permanently remove data based on your requirements.

To query only active (non-deleted) rows, filter on the `_SNOWFLAKE_DELETED` column:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = FALSE;
```

To query deleted rows:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = TRUE;
```

### Dropped columns

When a column is dropped from the source table, the connector does not drop the corresponding column
from the destination table. Instead, the column is renamed by appending the `__SNOWFLAKE_DELETED` suffix
to preserve historical values.

For example, if a column named `EMAIL` is dropped from the source table, it is renamed to
`EMAIL__SNOWFLAKE_DELETED` in the destination table. Rows that existed before the column was dropped
retain their original values, while rows added after the drop have `NULL` in this column.

You can still query historical values from the renamed column:

```sqlexample
SELECT EMAIL__SNOWFLAKE_DELETED FROM my_table;
```

### Renamed columns

Due to limitations in CDC (Change Data Capture) mechanisms, the connector cannot distinguish between
a column being renamed and a column being dropped followed by a new column being added. As a result,
when you rename a column in the source table, the connector treats this as two separate operations:
dropping the original column and adding a new column with the new name.

For example, if you rename a column from `A` to `B` in the source table, the destination table
will contain:

* `A__SNOWFLAKE_DELETED`: Contains values from before the rename. Rows added after the rename have
  `NULL` in this column.
* `B`: Contains values from after the rename. Rows that existed before the rename have `NULL`
  in this column.

#### Querying renamed columns

To retrieve data from both the original and renamed columns as a single unified column, use a
`COALESCE` or `CASE` expression:

```sqlexample
SELECT
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

Alternatively, using a `CASE` expression:

```sqlexample
SELECT
    CASE
        WHEN B IS NOT NULL THEN B
        ELSE A__SNOWFLAKE_DELETED
    END AS A_RENAMED_TO_B
FROM my_table;
```

#### Creating a view for renamed columns

Rather than manually modifying the destination table, you can create a view that presents the renamed
column as a single unified column. This approach is recommended because it preserves the original data
and avoids potential issues with ongoing replication.

```sqlexample
CREATE VIEW my_table_unified AS
SELECT
    *,
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

> **Important:**
>
> Manually modifying the destination table structure (such as dropping or renaming columns) is not
> recommended, as it may interfere with ongoing replication and cause data inconsistencies.

### Journal tables

During incremental replication, changes from the source database are first written to journal tables
before being merged into the destination tables. The connector does not automatically remove data from
journal tables, as this data may be useful for auditing, debugging, or reprocessing purposes.

Journal tables are created in the same schema as their corresponding destination tables and follow
this naming convention:

`<TABLE_NAME>_JOURNAL_<timestamp>_<number>`

Where:

* `<TABLE_NAME>` is the name of the destination table.
* `<timestamp>` is the creation timestamp in Unix epoch format (seconds since January 1, 1970),
  ensuring uniqueness.
* `<number>` starts at 1 and increments whenever the destination table schema changes, either due to
  schema changes in the source table or modifications to column filters.

For example, if your destination table is `SALES.ORDERS`, the journal table might be named
`SALES.ORDERS_JOURNAL_1705320000_1`.

> **Important:**
>
> Do not drop journal tables while replication is in progress. Removing an active journal table may
> cause data loss or replication failures. Only drop journal tables after the corresponding source
> table has been fully removed from replication.

#### Managing journal table storage

If you need to manage storage costs by removing old journal data, you can create a Snowflake task
that periodically cleans up journal tables for tables that are no longer being replicated.

Before implementing journal cleanup, verify that:

* The corresponding source tables have been fully removed from replication.
* You no longer need the journal data for auditing or processing purposes.

For information on creating and managing tasks for automated cleanup, see
[Introduction to tasks](../../../../tasks-intro.md).

## Next steps

After reviewing this topic, consider the following next steps:

* Review [Openflow Connector for Oracle: Enable and manage commercial terms](manage-commercial-terms.md) to enable the connector, accept the Oracle
  XStream terms, and configure your licensing model.
* Review [Openflow Connector for Oracle: Data mapping](data-mapping.md) to understand how the connector maps data types
  to Snowflake data types.
* Review [Set up tasks for the Openflow Connector for Oracle](setup-tasks.md) to set up the connector.

---
title: About Openflow Connector for PostgreSQL
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/postgres/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for PostgreSQL

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for PostgreSQL, its workflow, and limitations.

## About the Openflow Connector for PostgreSQL

The Openflow Connector for PostgreSQL connects a PostgreSQL database instance to Snowflake and replicates data from selected tables in near real-time or on schedule.
The connector also creates a log of all data changes, available along the current state of the replicated tables.

## Use cases

Use this connector if you’re looking to do the following:

* CDC replication of PostgreSQL data with Snowflake for comprehensive, centralized reporting.

## Supported PostgreSQL versions

The following are the supported PostgreSQL versions.

Supported PostgreSQL versions

|  | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
| [Standard](https://www.postgresql.org/) | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| [AWS RDS](https://docs.aws.amazon.com/AmazonRDS/latest/PostgreSQLReleaseNotes/Welcome.html) | Yes | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| [Amazon Aurora](https://docs.aws.amazon.com/AmazonRDS/latest/AuroraPostgreSQLReleaseNotes/Welcome.html) | Yes | Yes | Yes | Yes | Yes | Yes | Yes |  |
| [GCP Cloud SQL](https://cloud.google.com/sql/docs/postgres/) | Yes | Yes | Yes | Yes | Yes | Yes | Yes |  |
| [Azure Database](https://learn.microsoft.com/en-us/azure/postgresql/) | Yes | Yes | Yes | Yes | Yes | Yes | Yes |  |

## Openflow requirements

* The runtime size must be at least Medium. Use a bigger runtime when replicating large data volumes, especially when row sizes are large.
* The connector does not support multi-node Openflow runtimes. Configure the runtime for this connector with Min nodes and Max nodes set to `1`.

## Limitations

* The connector supports PostgreSQL version 11 or later.
* The connector supports only username and password authentication with PostgreSQL.
* The connector does not replicate individual values larger than 16 MB. By default, processing such a value results in the associated table being marked permanently failed.
  To prevent table failures, modify the **Oversized Value Strategy** destination parameter.
* The connector does not replicate tables with data that exceeds [Snowflake’s type limitations](../../../../../sql-reference/intro-summary-data-types.md).
  An exception to this rule is date & time data type columns that contain out-of-range values. For more information, see Out of range value support.
* The connector requires every replicated table to have a primary key, and that the replica identity of the table is the same as the primary key.
* The connector supports source table schema changes with the exception of changing primary key definitions, changing the precision, or the scale of a numeric column.
* The connector does not support re-adding a column after it is dropped.

> **Note:**
>
> Limitations affecting certain table columns can be bypassed by excluding these specific columns from replication.

## Workflow

1. A **Database administrator** configures PostgreSQL replication settings, creates a
   publication, and credentials for the connector. Optionally, they deliver the SSL certificate.
2. A **Snowflake account administrator** performs the following tasks:

   1. Creates a service user for the connector, a warehouse for the connector, and a destination database to replicate into.
   2. Installs the connector.
   3. Specifies the required parameters for the flow template.
   4. Runs the flow. The connector performs the following tasks when run in Openflow:

      1. Creates a schema for journal tables.
      2. Creates the schemas and destination tables matching the source tables configured for replication.
      3. Starts replication following the table replication lifecycle.

## How the connector works

The following sections describe how the connector works in various scenarios, including replication, changes in schema, and data retention.

### How tables are replicated

1. Schema introspection: The connector discovers the columns in the source table, their names, types,
   then validates them against Snowflake’s and the connector’s limitations. Validation failures cause
   this stage to fail, and the cycle completes. After successful completion of Schema Introspection, the connector creates an empty destination table.
2. Snapshot load: The connector copies all data available in the
   source table into the destination table. Failure of this stage finishes the cycle, and
   no more data is replicated. After successful completion, the whole set of data from the source table is available in the destination table.
3. Incremental load: The connector keeps tracking
   changes in the source table, and copying them into the destination table.
   This continues until the table is removed from replication. Failure at this stage
   permanently stops replication of the source table, until the issue is removed.

   > **Note:**
   >
   > This connector can be configured to immediately start replicating incremental changes for newly added tables,
   > bypassing the snapshot load phase. This option is often useful when reinstalling the connector
   > in an account where previously replicated data exists and you want to continue replication without having to re-snapshot tables.

   For details on the bypassing snapshot load and using the incremental load process, see [Incremental replication](incremental-replication.md).

> **Important:**
>
> Interim failures, such as connection errors, do not prevent tables from being replicated.
> Permanent failures, such as unsupported data types, do prevent tables from being replicated.
> If a permanent failure prevents a table from being replicated, remove the table from the list of replicated tables.
> After you address the problem that caused the failure, you can add the table back to the list of replicated tables.

### TOASTed value support

The connector supports replicating tables with [TOAST values](https://www.postgresql.org/docs/current/storage-toast.html) for columns of types: `array`, `bytea`, `json`, `jsonb`, `text`, `varchar`, `xml`.

Whenever the connector encounters a TOASTed value in the CDC stream, it substitutes a default placeholder of `__previous_value_unchanged`, formatted for the given column type, and stores it in the journal table. The `MERGE` query then accounts for placeholder values, so that the destination table always contains the last non-TOASTed value.

### Out of range value support

The connector supports replicating tables with columns of types `date`, `timestamp`, and `timestamptz` that contain out-of-range values.
If the connector encounters an out-of-range value in the CDC stream, it substitutes a default placeholder based on the type of the column.

Placeholder values for out-of-range values

| Column type | Placeholder value |
| --- | --- |
| `date` | `-9999-01-01` through `9999-12-31`. |
| `timestamp` | `0001-01-01 00:00:00` through `9999-12-31 23:59:59.999999999`. |
| `timestamptz` | `0001-01-01 00:00:00+00` through `9999-12-31 23:59:59.999999999+00`. |

> **Note:**
>
> `-Infinity` and `Infinity` values are also replaced with the respective placeholders for all three types.

### Table replication status

Interim failures, such as connection errors, do not prevent table replication. However,
permanent failures, such as unsupported data types, prevent table replication.

To troubleshoot replication issues or verify that a table has been successfully removed from the replication flow, check the Table State Store:

1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services. A table listing controller services displays.
2. Locate the row labeled Table State Store, click the More  button on the right side of the row, and then choose View State.

A list of tables and their current states displays. Type in the search box to filter the list by table name. The possible states are:

* NEW: The table is scheduled for replication but replication hasn’t started.
* SNAPSHOT_REPLICATION: The connector is copying existing data. This status displays until all records are stored in the destination table.
* INCREMENTAL_REPLICATION: The connector is actively replicating changes. This status displays after snapshot replication ends and continues to display indefinitely until a table is either removed from replication or replication fails.
* FAILED: Replication has permanently stopped due to an error.

> **Note:**
>
> The Openflow runtime canvas doesn’t display table status changes — only the current table status. However, table status changes are recorded in logs when they occur. Look for the following log message:
>
> ```text
> Replication state for table <database_name>.<schema_name>.<table_name> changed from <old_state> to <new_state>
> ```

If a permanent failure prevents table replication, remove the table from replication. After you address the problem that caused the failure, you can add the table back to replication. For more information, see [Restart table replication](maintenance.md).

## Understanding data retention

The connector follows a data retention philosophy where customer data is never automatically deleted.
You maintain full ownership and control over your replicated data, and the connector preserves historical
information rather than permanently removing it.

This approach has the following implications:

* Rows deleted from the source table are soft-deleted in the destination table rather than physically removed.
* Columns dropped from the source table are renamed in the destination table rather than dropped.
* Journal tables are retained indefinitely and are not automatically cleaned up.

### Destination table metadata columns

Each destination table includes the following metadata columns that track replication information:

| Column name | Type | Description |
| --- | --- | --- |
| `_SNOWFLAKE_INSERTED_AT` | TIMESTAMP_NTZ | The timestamp when the row was originally inserted into the destination table. |
| `_SNOWFLAKE_UPDATED_AT` | TIMESTAMP_NTZ | The timestamp when the row was last updated in the destination table. |
| `_SNOWFLAKE_DELETED` | BOOLEAN | Indicates whether the row was deleted from the source table. When `true`, the row has been soft-deleted and no longer exists in the source. |

### Soft-deleted rows

When a row is deleted from the source table, the connector does not physically remove it from the
destination table. Instead, the row is marked as deleted by setting the `_SNOWFLAKE_DELETED` metadata
column to `true`.

This approach allows you to:

* Retain historical data for auditing or compliance purposes.
* Query deleted records when needed.
* Decide when and how to permanently remove data based on your requirements.

To query only active (non-deleted) rows, filter on the `_SNOWFLAKE_DELETED` column:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = FALSE;
```

To query deleted rows:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = TRUE;
```

### Dropped columns

When a column is dropped from the source table, the connector does not drop the corresponding column
from the destination table. Instead, the column is renamed by appending the `__SNOWFLAKE_DELETED` suffix
to preserve historical values.

For example, if a column named `EMAIL` is dropped from the source table, it is renamed to
`EMAIL__SNOWFLAKE_DELETED` in the destination table. Rows that existed before the column was dropped
retain their original values, while rows added after the drop have `NULL` in this column.

You can still query historical values from the renamed column:

```sqlexample
SELECT EMAIL__SNOWFLAKE_DELETED FROM my_table;
```

### Renamed columns

Due to limitations in CDC (Change Data Capture) mechanisms, the connector cannot distinguish between
a column being renamed and a column being dropped followed by a new column being added. As a result,
when you rename a column in the source table, the connector treats this as two separate operations:
dropping the original column and adding a new column with the new name.

For example, if you rename a column from `A` to `B` in the source table, the destination table
will contain:

* `A__SNOWFLAKE_DELETED`: Contains values from before the rename. Rows added after the rename have
  `NULL` in this column.
* `B`: Contains values from after the rename. Rows that existed before the rename have `NULL`
  in this column.

#### Querying renamed columns

To retrieve data from both the original and renamed columns as a single unified column, use a
`COALESCE` or `CASE` expression:

```sqlexample
SELECT
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

Alternatively, using a `CASE` expression:

```sqlexample
SELECT
    CASE
        WHEN B IS NOT NULL THEN B
        ELSE A__SNOWFLAKE_DELETED
    END AS A_RENAMED_TO_B
FROM my_table;
```

#### Creating a view for renamed columns

Rather than manually modifying the destination table, you can create a view that presents the renamed
column as a single unified column. This approach is recommended because it preserves the original data
and avoids potential issues with ongoing replication.

```sqlexample
CREATE VIEW my_table_unified AS
SELECT
    *,
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

> **Important:**
>
> Manually modifying the destination table structure (such as dropping or renaming columns) is not
> recommended, as it may interfere with ongoing replication and cause data inconsistencies.

### Journal tables

During incremental replication, changes from the source database are first written to journal tables
before being merged into the destination tables. The connector does not automatically remove data from
journal tables, as this data may be useful for auditing, debugging, or reprocessing purposes.

Journal tables are created in the same schema as their corresponding destination tables and follow
this naming convention:

`<TABLE_NAME>_JOURNAL_<timestamp>_<number>`

Where:

* `<TABLE_NAME>` is the name of the destination table.
* `<timestamp>` is the creation timestamp in Unix epoch format (seconds since January 1, 1970),
  ensuring uniqueness.
* `<number>` starts at 1 and increments whenever the destination table schema changes, either due to
  schema changes in the source table or modifications to column filters.

For example, if your destination table is `SALES.ORDERS`, the journal table might be named
`SALES.ORDERS_JOURNAL_1705320000_1`.

> **Important:**
>
> Do not drop journal tables while replication is in progress. Removing an active journal table may
> cause data loss or replication failures. Only drop journal tables after the corresponding source
> table has been fully removed from replication.

#### Managing journal table storage

If you need to manage storage costs by removing old journal data, you can create a Snowflake task
that periodically cleans up journal tables for tables that are no longer being replicated.

Before implementing journal cleanup, verify that:

* The corresponding source tables have been fully removed from replication.
* You no longer need the journal data for auditing or processing purposes.

For information on creating and managing tasks for automated cleanup, see
[Introduction to tasks](../../../../tasks-intro.md).

## Next steps

Review [Openflow Connector for PostgreSQL: Data mapping](data-mapping.md) to understand how the connector maps data types to Snowflake data types.
Review [Set up the Openflow Connector for PostgreSQL](setup.md) to set up the connector.

---
title: About Openflow Connector for SharePoint
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sharepoint/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for SharePoint

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for
SharePoint, its use cases and limitations.

The Openflow Connector for SharePoint connects a Microsoft 365
SharePoint site and Snowflake to ingest files and user permissions and
keeps them up to date. Openflow Connector for SharePoint also supports
the Cortex Search service and can make ingested files ready for
conversational analysis for use in AI Assistants using SQL, Python or
REST APIs.

## Variants of the Openflow Connector for SharePoint

The Openflow Connector for SharePoint contains four variants which allow you to, optionally, index data
into Snowflake Cortex Search and include document metadata (ACLs).

|  |  |
| --- | --- |
| Variant | Description |
| Microsoft SharePoint (Cortex Search, document ACLs) | Indexes files and their permissions (ACLs) into Snowflake Cortex Search. |
| Microsoft SharePoint (Cortex Search, no document ACLs) | Indexes files without their permissions (ACLs) into Snowflake Cortex Search. |
| Microsoft SharePoint (Simple Ingest, document ACLs) | Ingests files and their permissions (ACLs) into a Snowflake stage. |
| Microsoft SharePoint (Simple Ingest, no document ACLs) | Ingests files without their permissions (ACLs) into a Snowflake stage. |

These variants appear as separate connectors in Marketplace. When installing the
connector, choose the variant that meets your requirements.

## Rate limiting restrictions

[SharePoint API limits](https://learn.microsoft.com/en-us/sharepoint/dev/embedded/development/limits-calling#api-rate-limits) govern how many requests can be made within a given time frame. If your flow exceeds the allowed quota, syncs may slow down or fail with an error. This mostly occurs when your access token makes higher number of requests than the source typically allows. In such cases, Snowflake recommends applying for higher access quota (wherever applicable) or reducing the sync frequency.

### Limitations

* [Input requirements](../../../../snowflake-cortex/parse-document.md).
* [Known limitations](../../../../snowflake-cortex/cortex-search/cortex-search-overview.md).
* Changes caused by moving or renaming folders aren’t captured during
  incremental ingestion.
* The connector ingests only the supported file types and ignores
  others.

### Next steps

[Set up the Openflow Connector for SharePoint](setup.md)

---
title: About Openflow Connector for Slack
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/slack/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Slack

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Slack,
steps to set it up, and limitations.

The Openflow Connector for Slack connects a Slack workspace to Snowflake
in order to ingest Slack messages, reactions, file attachments, and channel
memberships (ACLs). The connector also supports the Cortex Search service and can make
ingested Slack content ready for conversational analysis for use in AI
Assistants using SQL, Python or REST APIs.

Use this connector if you’re looking to do the following:

* Pull Slack messages and metadata into Snowflake for searchable, organization-wide insights
* Ingest Slack content and make it ready for chat in your AI assistants with Snowflake Cortex

## Limitations

* The connector captures historical file attachments, reactions and messages, but only after the Slack App is added to a conversation or channel.
* If a user edits an existing message or deletes a message, the changes are captured in Snowflake at the next refresh interval.

## Workflow

1. **Slack Admin** creates a Slack App as described later, then installs
   the App in the channels or conversations they wish to ingest messages
   from. The Bot token and App token from the Slack App need to be
   provided to the Snowflake Account Admin
2. **Snowflake account admin**:

   1. Installs the connector.
   2. Specifies the required parameters for the flow template, for
      example, Bot token, App token, and database and schema names.
   3. Runs flow. The following happens when the flow is run in Openflow:

      1. The flow automatically creates a database, schema and the
         necessary tables and external access integration in Snowflake
         on behalf of the admin. It also creates a Cortex Search and
         wires up chunks and ACLs and metadata. By default, these are
         only accessible to the Snowflake account admin role
      2. Fetches specified conversations, metadata, ACLs from the Slack
         channel(s). An ACL is defined as the snapshot list of user IDs
         and emails that are members of each channel being ingested.
      3. Chunks ingested conversation messages
      4. Puts chunked conversation messages along with metadata and ACLs
         into Snowflake tables
3. **IT Developer** in customer’s organization creates bespoke Chat App
   and passes user identity which is the user’s email registered on
   Slack, as a filter when invoking Cortex Search REST API with the end
   user’s question
4. **End users** of the Chat App in the customer’s organization see
   responses from Cortex Search restricted to chunks from conversations
   they have access to in the Slack channel based on ACLs, along with a
   link to the source conversation.

### Considerations

* By default, any user with the Snowflake account admin role will be
  able to “see” the raw ingested messages and conversations and tables
  created by the flow template
* The user with the Snowflake account admin role decides who can access
  the internal stage and tables through Snowflake roles.
* The user with the Snowflake account admin role decides who can query
  the Cortex Search service through Snowflake roles.

### Next steps

[Set up the Openflow Connector for Slack](setup.md)

---
title: About Openflow Connector for Snowflake to Kafka
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/snowflake-to-kafka/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Snowflake to Kafka

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of Openflow Connector for Snowflake to Kafka and limitations.

The connector consumes a Snowflake stream and sends consumed CDC records to a Kafka topic.
A Snowflake Stream object records data manipulation language (DML) changes made to tables,
including inserts, updates, and deletes, as well as metadata about each change, so that actions
can be taken using the changed data. This process is referred to as change data capture (CDC).

Use this connector if you’re looking to do the following:

* Replicate Snowflake tables to Apache Kafka using CDC for real-time insights distribution and event-driven architectures

## Workflow

Depending on the configuration of the Kafka broker, which is going to be receiving the CDC data, the workflow may differ slightly.

1. A Snowflake account administrator performs the following tasks:

   1. Creates or identifies the Snowflake stream that is going to be the source of the CDC data.
   2. Designates a warehouse to be used by the connector.
   3. Configures or identifies the Snowflake user used by the connector and a role for this user.
      The user must have appropriate permissions to the source Snowflake stream. At a minimum,
      the user needs USAGE privilege on the database and schema containing the Snowflake stream,
      and SELECT privilege on the stream and the stream’s underlying table or view object.
2. A Kafka administrator performs the following tasks.

   1. Creates or identifies a Kafka broker and topic that is going to be the destination for the CDC captured from the Snowflake stream.
   2. Sets up the authentication mechanism for the Kafka broker, which is going to be used by the connector.
3. A data engineer performs the following tasks:

   1. Installs and configures the connector.
   2. Provides Snowflake credentials and configuration.
   3. Provides Kafka credentials and configuration.
   4. Provides connector parameters.

## Stream metadata columns

Stream metadata columns `METADATA$ROW_ID`, `METADATA$ISUPDATE`, and `METADATA$ACTION` are sent to the Kafka topic.
The names of these columns are modified before they are sent to Kafka.
In the JSON message payload that is sent, they become `METADATA_ROW_ID`, `METADATA_ISUPDATE`, and `METADATA_ACTION`.

For more information, see [Stream columns](../../../../streams-intro.md).

## Limitations

* A single connector can only capture CDCs from one Snowflake stream.
* Messages are sent without a schema.
* Schema evolution is not supported.

## Next steps

[Set up the Openflow Connector for Snowflake to Kafka](setup.md)

---
title: About Openflow Connector for SQL Server
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sql-server/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for SQL Server

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts, workflow, and limitations of the Openflow Connector for SQL Server.

## About the Openflow Connector for SQL Server

The Openflow Connector for SQL Server connects a SQL Server database instance to Snowflake and replicates data from selected tables in near real-time or on schedule.
The connector uses SQL Server
[Change Tracking](https://learn.microsoft.com/en-us/sql/relational-databases/track-changes/about-change-tracking-sql-server)
to detect and apply changes to replicated tables. Change data is recorded in journal tables alongside the
current state of the replicated tables.

## Use cases

Use this connector if you’re looking to do the following:

* Synchronization of SQL Server data with Snowflake for comprehensive, centralized reporting.

## Supported SQL Server versions

The following SQL Server database versions and platforms are supported:

* [Microsoft SQL Server 2022](https://www.microsoft.com/sql-server)
* Microsoft SQL Server 2019
* Microsoft SQL Server 2017
* Microsoft SQL Server 2016
* [Azure SQL Database](https://learn.microsoft.com/azure/azure-sql/database/?view=azuresql)
* [Azure SQL Managed Instance](https://learn.microsoft.com/azure/azure-sql/managed-instance/?view=azuresql)
* [AWS RDS for SQL Server](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/CHAP_SQLServer.html)
* Google Cloud SQL for SQL Server

> **Note:**
>
> The connector relies on SQL Server Change Tracking, which is available starting with SQL Server 2008.
> Earlier versions do not support this feature and are incompatible with the connector.

## Openflow requirements

* The runtime size must be at least Medium. Use a bigger runtime when replicating large data volumes,
  especially when row sizes are large.
* The connector does not support multi-node Openflow runtimes. Configure the runtime for this connector
  with Min nodes and Max nodes set to `1`.

## Limitations

* You cannot run multiple connectors of the same type in a single runtime instance.
* The connector supports only username and password authentication with SQL Server.
* The connector only replicates tables with data types that are supported by Snowflake. For a list of these
  data types, see [Summary of data types](../../../../../sql-reference/intro-summary-data-types.md).
* The connector only replicates database tables that contain primary keys.
* The connector does not update existing records in the Snowflake database when a new NOT NULL column with
  a default value is added to one of the source databases.
* The connector does not update existing records in the Snowflake database when a new column is added to
  the included list in the Column Filter JSON.
* After you delete a column in one of the source databases and add it back with the same name, additional
  deletes cause errors.
* After you include a column in Column Filter JSON and exclude it, additional include attempts cause
  errors.
* The connector supports source table schema changes, except for changing primary key
  definitions, changing the precision, or the scale of a numeric column.
* The connector does not support the truncate table operation.
* The connector does not support re-adding a column after it is dropped.
* The connector does not replicate individual values larger than 16 MB. By default, processing such a value results in the associated table being marked permanently failed.
  To prevent table failures, modify the **Oversized Value Strategy** destination parameter.

> **Note:**
>
> You can bypass limitations affecting certain table columns by excluding these specific columns from replication.

## Workflow

The following workflow outlines the steps to set up and run the Openflow Connector for SQL Server:

1. A SQL Server database administrator performs the following tasks:

   1. Configures SQL Server replication settings and enables change tracking on the databases and tables
      being replicated.
   2. Creates credentials for the connector.
   3. (Optional) Provides the SSL certificate to connect to the SQL Server instance over SSL.
2. A Snowflake account administrator performs the following tasks:

   1. Creates a service user for the connector, a destination database to store replicated data,
      and a warehouse for the connector.
   2. Installs the connector.
   3. Specifies the required parameters for the connector flow definition.
   4. Runs the flow.

The connector does the following when run in Openflow:

1. Creates the schemas and destination tables matching the source tables configured for replication.
2. Begins replication according to the table replication lifecycle.

   For more information, see How tables are replicated.

## How the connector works

The following sections describe how the connector works in various scenarios, including replication, changes in schema, and data retention.

### Change tracking behavior

The connector uses SQL Server
[Change Tracking](https://learn.microsoft.com/en-us/sql/relational-databases/track-changes/about-change-tracking-sql-server)
(CT) to detect changes in the source tables. Change Tracking reports the net effect of changes between
polling intervals. If a row is updated multiple times between two consecutive polls, the connector sees
only the most recent version of that row. Intermediate states are not preserved.

This makes the connector suitable for **data synchronization** use cases, where the goal is to keep
the destination table in sync with the source. It is not suitable for **audit or history** use cases
where every intermediate change to a row must be captured.

### Data replication

The connector supports replicating tables from multiple SQL Server databases in a single SQL Server instance. The connector creates replicated tables from different databases in separate schemas in the destination Snowflake database.

Reference replicated tables by combining the source database name, the source schema name, and the
table name in the following format:

`<database_name>.<schema_name>.<table_name>`

For each schema in each source database being replicated, the connector creates a separate schema in the destination Snowflake database.
The name of the destination schema is a combination of the source database name and the source schema name, separated by an underscore character (`_`) as shown in the following example:

`<source_database_name>_<source_schema_name>`

The connector creates tables in the destination schema with the same name as the source table name as shown in the following example:

`<destination_database>_<destination_schema_name>.<source_table_name>`

### How tables are replicated

The connector replicates tables in the following stages:

1. Schema introspection: The connector discovers the columns in the source table, including the column
   names and types, then validates them against Snowflake’s and the connector’s limitations. Validation
   failures cause this stage to fail, and the cycle completes. After successful completion of this stage,
   the connector creates an empty destination table.
2. Snapshot load: The connector copies all data available in the source table into the destination table.
   If this stage fails, the connector stops replicating data. After successful completion, the data from the
   source table is available in the destination table.
3. Incremental load: The connector tracks changes in the source table and applies those changes to the
   destination table. This process continues until the table is removed from replication. Failure at this
   stage permanently stops replication of the source table, until the issue is resolved.

For information on bypassing snapshot load and using the incremental load process, see [Incremental replication](incremental-replication.md).

### Table replication status

Interim failures, such as connection errors, do not prevent table replication. However,
permanent failures, such as unsupported data types, prevent table replication.

To troubleshoot replication issues or verify that a table has been successfully removed from the replication flow, check the Table State Store:

1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services. A table listing controller services displays.
2. Locate the row labeled Table State Store, click the More  button on the right side of the row, and then choose View State.

A list of tables and their current states displays. Type in the search box to filter the list by table name. The possible states are:

* NEW: The table is scheduled for replication but replication hasn’t started.
* SNAPSHOT_REPLICATION: The connector is copying existing data. This status displays until all records are stored in the destination table.
* INCREMENTAL_REPLICATION: The connector is actively replicating changes. This status displays after snapshot replication ends and continues to display indefinitely until a table is either removed from replication or replication fails.
* FAILED: Replication has permanently stopped due to an error.

> **Note:**
>
> The Openflow runtime canvas doesn’t display table status changes — only the current table status. However, table status changes are recorded in logs when they occur. Look for the following log message:
>
> ```text
> Replication state for table <database_name>.<schema_name>.<table_name> changed from <old_state> to <new_state>
> ```

If a permanent failure prevents table replication, remove the table from replication. After you address the problem that caused the failure, you can add the table back to replication. For more information, see [Restart table replication](setup.md).

### Source database locking behavior

During snapshot and incremental replication, the connector reads from the source database tables to
retrieve row data and track changes.

Under SQL Server’s default READ COMMITTED isolation level, these read operations acquire shared locks on the
source tables. If other database clients hold conflicting locks on the same tables at the same time, this can
lead to deadlocks, where SQL Server terminates one of the conflicting sessions.

To avoid deadlocks between the connector and other database clients, enable
[Read Committed Snapshot Isolation (RCSI)](https://learn.microsoft.com/en-us/dotnet/framework/data/adonet/sql/snapshot-isolation-in-sql-server)
on the source database:

```sqlexample
ALTER DATABASE <database> SET READ_COMMITTED_SNAPSHOT ON;
```

With RCSI enabled, read operations use row versioning instead of shared locks, which eliminates lock
contention between the connector and concurrent write transactions on the source database.

## Understanding data retention

The connector follows a data retention philosophy where customer data is never automatically deleted.
You maintain full ownership and control over your replicated data, and the connector preserves historical
information rather than permanently removing it.

This approach has the following implications:

* Rows deleted from the source table are soft-deleted in the destination table rather than physically removed.
* Columns dropped from the source table are renamed in the destination table rather than dropped.
* Journal tables are retained indefinitely and are not automatically cleaned up.

### Destination table metadata columns

Each destination table includes the following metadata columns that track replication information:

| Column name | Type | Description |
| --- | --- | --- |
| `_SNOWFLAKE_INSERTED_AT` | TIMESTAMP_NTZ | The timestamp when the row was originally inserted into the destination table. |
| `_SNOWFLAKE_UPDATED_AT` | TIMESTAMP_NTZ | The timestamp when the row was last updated in the destination table. |
| `_SNOWFLAKE_DELETED` | BOOLEAN | Indicates whether the row was deleted from the source table. When `true`, the row has been soft-deleted and no longer exists in the source. |

### Soft-deleted rows

When a row is deleted from the source table, the connector does not physically remove it from the
destination table. Instead, the row is marked as deleted by setting the `_SNOWFLAKE_DELETED` metadata
column to `true`.

This approach allows you to:

* Retain historical data for auditing or compliance purposes.
* Query deleted records when needed.
* Decide when and how to permanently remove data based on your requirements.

To query only active (non-deleted) rows, filter on the `_SNOWFLAKE_DELETED` column:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = FALSE;
```

To query deleted rows:

```sqlexample
SELECT * FROM my_table WHERE _SNOWFLAKE_DELETED = TRUE;
```

### Dropped columns

When a column is dropped from the source table, the connector does not drop the corresponding column
from the destination table. Instead, the column is renamed by appending the `__SNOWFLAKE_DELETED` suffix
to preserve historical values.

For example, if a column named `EMAIL` is dropped from the source table, it is renamed to
`EMAIL__SNOWFLAKE_DELETED` in the destination table. Rows that existed before the column was dropped
retain their original values, while rows added after the drop have `NULL` in this column.

You can still query historical values from the renamed column:

```sqlexample
SELECT EMAIL__SNOWFLAKE_DELETED FROM my_table;
```

### Renamed columns

Due to limitations in CDC (Change Data Capture) mechanisms, the connector cannot distinguish between
a column being renamed and a column being dropped followed by a new column being added. As a result,
when you rename a column in the source table, the connector treats this as two separate operations:
dropping the original column and adding a new column with the new name.

For example, if you rename a column from `A` to `B` in the source table, the destination table
will contain:

* `A__SNOWFLAKE_DELETED`: Contains values from before the rename. Rows added after the rename have
  `NULL` in this column.
* `B`: Contains values from after the rename. Rows that existed before the rename have `NULL`
  in this column.

#### Querying renamed columns

To retrieve data from both the original and renamed columns as a single unified column, use a
`COALESCE` or `CASE` expression:

```sqlexample
SELECT
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

Alternatively, using a `CASE` expression:

```sqlexample
SELECT
    CASE
        WHEN B IS NOT NULL THEN B
        ELSE A__SNOWFLAKE_DELETED
    END AS A_RENAMED_TO_B
FROM my_table;
```

#### Creating a view for renamed columns

Rather than manually modifying the destination table, you can create a view that presents the renamed
column as a single unified column. This approach is recommended because it preserves the original data
and avoids potential issues with ongoing replication.

```sqlexample
CREATE VIEW my_table_unified AS
SELECT
    *,
    COALESCE(B, A__SNOWFLAKE_DELETED) AS A_RENAMED_TO_B
FROM my_table;
```

> **Important:**
>
> Manually modifying the destination table structure (such as dropping or renaming columns) is not
> recommended, as it may interfere with ongoing replication and cause data inconsistencies.

### Journal tables

During incremental replication, changes from the source database are first written to journal tables
before being merged into the destination tables. The connector does not automatically remove data from
journal tables, as this data may be useful for auditing, debugging, or reprocessing purposes.

Journal tables are created in the same schema as their corresponding destination tables and follow
this naming convention:

`<TABLE_NAME>_JOURNAL_<timestamp>_<number>`

Where:

* `<TABLE_NAME>` is the name of the destination table.
* `<timestamp>` is the creation timestamp in Unix epoch format (seconds since January 1, 1970),
  ensuring uniqueness.
* `<number>` starts at 1 and increments whenever the destination table schema changes, either due to
  schema changes in the source table or modifications to column filters.

For example, if your destination table is `SALES.ORDERS`, the journal table might be named
`SALES.ORDERS_JOURNAL_1705320000_1`.

> **Important:**
>
> Do not drop journal tables while replication is in progress. Removing an active journal table may
> cause data loss or replication failures. Only drop journal tables after the corresponding source
> table has been fully removed from replication.

#### Managing journal table storage

If you need to manage storage costs by removing old journal data, you can create a Snowflake task
that periodically cleans up journal tables for tables that are no longer being replicated.

Before implementing journal cleanup, verify that:

* The corresponding source tables have been fully removed from replication.
* You no longer need the journal data for auditing or processing purposes.

For information on creating and managing tasks for automated cleanup, see
[Introduction to tasks](../../../../tasks-intro.md).

## Next steps

Review [Openflow Connector for SQL Server: Data mapping](data-mapping.md) to understand how the connector maps data types to Snowflake data types.

Review [Set up the Openflow Connector for SQL Server](setup.md) to set up the connector.

---
title: About Openflow Connector for Workday
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/workday/about.md
section: Loading & Unloading Data
---

# About Openflow Connector for Workday

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

The Openflow Connector for Workday allows to ingest Workday reports into
Snowflake. It is built as the Apache NiFi flow and uses the RaaS
(Report-as-a-Service) API to fetch data from Workday. The connector
persists data in a dedicated table in the database and schema provided
in the configuration.

Use this connector if you’re looking to do the following:

* Get Workday data into Snowflake using Report-as-a-Service (RaaS) streams for enterprise-level analytics and planning

## Limitations

* Only advanced Workday reports are supported.
* Only reports in the JSON format are supported.
* All limitations of the RaaS API apply.
* The schema discovery is not supported - schema of a destination table
  is inferred based on data fetched from Workday.
* The incremental load is not supported - the connector uses the
  truncate & load ingestion strategy.

## Next steps

[Set up the Openflow Connector for Workday](setup.md)

---
title: About Openflow: BYOC deployments
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/about-byoc.md
section: Loading & Unloading Data
---

# About Openflow: BYOC deployments

Openflow BYOC *is* Openflow and contains all the benefits of Openflow, but within your existing cloud.

## Typical BYOC workflow

| User persona | Task |
| --- | --- |
| AWS cloud engineer/administrator | Creates a set of deployments in their AWS cloud account.  The Openflow UI is used to manage deployments and runtime creation and maintenance. The Openflow UI allows users to create, upgrade, and delete runtimes in all deployments.  Snowflake sign-ins are used to authenticate to Openflow, and roles and privileges are used to control access to Openflow deployments and runtimes. |
| Data engineer (pipeline author, responsible for data ingestion) | Uses the runtime canvas to build completely new flows or to configure deployed connectors.  Creates a completely new flow or uses an existing connector as-is or as a starting point to customize. Populates data in the bronze layer within your Snowflake account (or other target system).  Connectors are a simple way to solve for a specific integration use case, and less technical users can deploy them without necessarily needing a data engineer. |
| Data engineer (pipeline operator) | Configures the flow parameters and runs the flow. |
| Data engineer (responsible for transformation to silver and gold layers) | Responsible for transforming data from the bronze layer that was populated by the pipeline to silver and gold layers for analytics. |
| Business user | Makes use of gold layer objects for analytics. |

## Limitations

* As described in the [Snowflake Openflow BYOC terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-terms/),
  securing Openflow BYOC is a shared responsibility model.
* Openflow authorization uses roles and their associated privileges that are directly granted to the user.
  Currently, Openflow does not support authorization when the role is attached to another role within the user’s role hierarchy.

## Next steps

[Set up Openflow - BYOC](setup-openflow-byoc.md)

---
title: About Snowflake and SAP® Zero-Copy Integration
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/about-sap-snowflake.md
section: Loading & Unloading Data
---

# About Snowflake and SAP® Zero-Copy Integration

Snowflake and SAP® have partnered to offer customers a seamless zero-copy integration between the two platforms. The integration leverages SAP® Business Data Cloud that enables customers to harmonize SAP® and non-SAP® data at scale in Snowflake, while optimizing total cost of ownership across workloads.

Leveraging zero copy data access, data and AI teams can work with semantically rich SAP® Data Products in real time without added cost and complexity of ETL pipelines, and allows them to build AI and machine learning applications fueled by trusted SAP Data Products and grounded in the context of all their mission-critical data, ensuring accurate, reliable, and trustworthy AI outcomes.

## Two Ways to Integrate Snowflake and SAP®

The integration delivers two distinct offerings, providing customers choice.
Both leverage SAP® Business Data Cloud to enable zero-copy data sharing between SAP® Business Data Cloud and Snowflake.

### SAP® Snowflake

Designed for new Snowflake customers, SAP® Snowflake makes Snowflake available in SAP® Business Data Cloud as a certified SAP® Solution Extension. From advanced analytics and ML to data engineering, applications, and marketplace it puts the Snowflake platform directly in the hands of SAP® users. For more information, see [SAP Snowflake](https://www.sap.com/products/data-cloud/snowflake.html) in the SAP® documentation.

#### SAP® Business Data Cloud Connect for Snowflake

Designed for existing Snowflake customers, SAP® Business Data Cloud (BDC) Connect for Snowflake
enables customers to share Data Products from SAP® BDC with their existing Snowflake accounts.
This gives Snowflake users real-time access to semantically rich SAP® Data Products without duplication of data.

For more information and set up instructions for either of these offerings, see [Setup tasks for SAP® Snowflake and SAP® BDC Connect for Snowflake](sap-sql/setup-tasks.md).

---
title: About the Openflow Connector for Google BigQuery
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-big-query/about.md
section: Loading & Unloading Data
---

# About the Openflow Connector for Google BigQuery

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

The Openflow Connector for Google BigQuery connects a Google BigQuery project to Snowflake and replicates data from selected datasets, tables, and views on a schedule. The connector performs an initial full load for each table, followed by incremental updates using BigQuery’s native change-tracking functionality. Views are replicated using a truncate and load strategy.

## Use cases

The connector supports the following use cases:

* **Replication to Snowflake:** Continuously mirror datasets from BigQuery into Snowflake for downstream analytics and modeling. Incremental changes arrive on a schedule with a 10 minute delay window.
* **Selective replication:** Define which regions, datasets, tables, and views to include using names or regex filters for broad coverage with control.
* **Migration and change capture:** Perform a one-time snapshot load for migrations, then run incremental syncs using BigQuery’s change history to keep tables in sync.
* **View replication:** Replicate standard and materialized BigQuery views to Snowflake using a truncate and load strategy on a configurable schedule.

## The table replication lifecycle

A table’s replication cycle begins with schema discovery and an initial snapshot load of
the data. The cycle transitions to incremental synchronization after data has been ingested into Snowflake.

1. **Schema Introspection:** The connector discovers the source table’s schema, validates its data types, and creates a corresponding destination schema and table in Snowflake.
2. **Snapshot Load:** After creating schema and table, the connector performs a full copy of all existing data from the BigQuery table to Snowflake. This process runs sequentially for each table in the configuration.
3. **Incremental Sync:** Once the initial load is complete, the table enters a scheduled incremental synchronization mode. On each run, the connector uses BigQuery’s CHANGES function to read the journal of row-level changes (inserts, updates, deletes) that occurred since the last synchronization. These changes are then fetched and merged into the destination table in Snowflake.

## Openflow requirements

The minimum runtime size must be `Medium`. Use a larger runtime and multi-node Openflow setup if you
are replicating large data volumes.

## Limitations

* BigQuery guarantees that data streams used to fetch source data remain
  valid for at least 6 hours. As a result, the process of reading the
  source table must be completed in less than 6 hours to prevent the data
  streams from expiring. You must use a larger, multi-node runtime when
  ingesting tables with data volumes that are larger than 100GB.
* BigQuery’s BIGNUMERIC type supports a higher precision (up to 76 digits) than Snowflake’s
  NUMBER type (38 digits). The connector cannot ingest values from
  BIGNUMERIC columns that exceed the Snowflake limit.
* The connector does not support replication of external tables.
* View replication uses a truncate and load strategy only. Incremental
  synchronization (CDC) is not supported for views.
* Incremental syncs require a primary key to correctly handle updates
  and deletes. For tables without a primary key, the connector does not
  support deletes and treats updates as new inserts.

  > **Note:**
  >
  > You must ensure that the primary key constraints are met. If the
  > field marked as the primary key is not unique, data inconsistency
  > can occur during incremental mode.
* The connector uses the [BigQuery’s CHANGES](https://cloud.google.com/bigquery/docs/reference/standard-sql/time-series-functions#changes) function for incremental updates.
  Because this function cannot query the last ten minutes of table
  history, replicated data in incremental mode has a minimum 10-minute
  lag behind the source.
* The incremental sync process is limited to a maximum 24-hour data
  window due to the BigQuery CHANGES function. If the replication lag
  for a table exceeds this period, the connector truncates the change
  window to 24 hours to proceed with the sync. This truncation can
  result in data loss.
* The connector inherits all other limitations of the BigQuery CHANGES
  function. For more information, see the
  [BigQuery CHANGES function documentation](https://cloud.google.com/bigquery/docs/reference/standard-sql/time-series-functions#changes).

## View replication

The connector supports replication of standard views and materialized views from BigQuery to Snowflake. Unlike table replication, views do not support incremental synchronization (CDC). Instead, the connector uses a **truncate and load** strategy: on each synchronization cycle, the connector fully replaces the data in the Snowflake destination table with the current contents of the source view.

The view synchronization frequency is configured separately from table incremental sync frequency using the **View Sync Frequency** parameter. Runs do not overlap. If a cycle takes longer than the configured interval, the next run waits for the previous run to finish.

You can filter which views to replicate using the **Included View Names** and **Included View Names Regex** parameters. These filters apply across all datasets selected for replication.

The connector creates temporary tables in BigQuery during view ingestion. Use the **Temporary Table Dataset** parameter to specify a dedicated dataset for these temporary tables. Snowflake recommends using a separate dataset for temporary tables and not using the ingested dataset for this purpose.

## Data type mapping

The connector maps BigQuery data types to the corresponding Snowflake data types.

| BigQuery Data Type | Snowflake Data Type |
| --- | --- |
| BIGNUMERIC | NUMBER |
| NUMERIC | NUMBER |
| GEOGRAPHY | VARCHAR |
| DATETIME | TIMESTAMP_NTZ |
| JSON | OBJECT |
| STRUCT | OBJECT |
| RANGE | OBJECT |
| INTERVAL | OBJECT |
| TIMESTAMP | TIMESTAMP_NTZ |
| DATE | DATE |
| TIME | TIME |
| INT64 / INTEGER | NUMBER |
| FLOAT64 | FLOAT |
| BOOL / BOOLEAN | BOOLEAN |
| STRING | VARCHAR |
| BYTES | BINARY |
| ARRAY | ARRAY |

## Track data changes in Google BigQuery

The connector’s incremental sync functionality is built on [BigQuery’s native CHANGES function](https://cloud.google.com/bigquery/docs/reference/standard-sql/time-series-functions#changes). When you enable change history on a source table, BigQuery maintains an internal journal of all row-level modifications (inserts, updates, and deletes).

The connector queries this journal on a configured incremental sync frequency schedule to retrieve a feed of changes. The connector materializes these changes into a journal table within the same BigQuery dataset. This journal table follows a consistent naming convention: `<sourceTableName>_<incremental_number>_<hash>_journal`

These journal tables are managed entirely by the connector during the replication process and are used to merge data into the final destination table in Snowflake.

> **Warning:**
>
> Do not modify the journal tables in any way. Modifying journal tables can disrupt the synchronization process and lead to data integrity issues.

The merge operation handles changes differently for tables with a Primary Key (PK) and tables without one.

### Tables with a Primary Key

For tables with a primary key, the connector handles data changes as follows:

Inserts and Updates:
:   Rows identified as `INSERT` or `UPDATE` are “upserted” into the corresponding Snowflake table.

Deletes:
:   To preserve data history, the connector uses a soft-delete strategy. Instead of physically removing a deleted row from Snowflake, the connector performs an `UPDATE` on the target row, setting the `_SNOWFLAKE_DELETED` column to `TRUE`.

### Tables without a Primary Key

For tables without a primary key, the connector handles data changes as follows:

Inserts and Updates:
:   Rows identified as `INSERT` or `UPDATE` are treated the same way and are inserted into the corresponding Snowflake table.

Deletes:
:   Not supported.

> **Note:**
>
> The connector automatically adds the `_SNOWFLAKE_DELETED` (BOOLEAN) column to the destination table schema when it is created.

### Configured synchronization frequency schedule vs actual synchronization frequency

The Incremental Sync Frequency schedule determines the table synchronization frequency. If the schedule
you specified is more frequent than the actual time required to synchronize the table, the system does not follow
the schedule you specified. This occurs because incremental cycles must execute sequentially and cannot overlap.

## Schema Evolution

The connector supports several common schema changes in the source BigQuery table. The following
schema changes are detected and propagated to the Snowflake destination table:

Column Addition:
:   New columns added in BigQuery are automatically added to the corresponding Snowflake table.

Column Deletion (Soft Delete):
:   When a column is dropped in BigQuery, the connector performs a “soft delete” in
    Snowflake. The column is not dropped from the destination table. Instead, it is renamed by adding the
    `_SNOWFLAKE_DELETED` suffix to the end of the column name. For example `my_column` becomes `my_column_SNOWFLAKE_DELETED`. This preserves historical data in Snowflake.

Column Rename:
:   A column rename operation is a two-step process:

    1. The original column is “soft deleted” and renamed with the `_SNOWFLAKE_DELETED` suffix added.
    2. A new column with the new name is added to the Snowflake table.

Primary Key Modification:
:   Adding, removing and changing primary keys is supported.

Data Type Changes:
:   Only changes that widen the existing type are tolerated. Any change that narrows a column’s type or converts it to an incompatible type is not supported and will cause replication for that table to fail.

## Next steps

For information on how to set up the connector, see the following topic:

* [Setting Up the Openflow Connector for Google BigQuery](setup.md)

---
title: About the Openflow Connector for Salesforce Bulk API
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/salesforce-bulk-api/about.md
section: Loading & Unloading Data
---

# About the Openflow Connector for Salesforce Bulk API

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the basic concepts of the Openflow Connector for Salesforce Bulk API, its workflow, and limitations.

## Zero-copy integration with Salesforce Data Cloud

Snowflake offers zero-copy bidirectional sharing and integration with Salesforce. This integration is recommended if you use Salesforce Data Cloud and require near real-time bidirectional integration.

For more information about zero-copy integration with Salesforce Data Cloud, see the following blog posts:

* [Share Your Data from Salesforce Data Cloud to Snowflake](https://developer.salesforce.com/blogs/2024/08/share-your-data-from-salesforce-data-cloud-to-snowflake)
* [Zero Copy Data Federation with Snowflake and Salesforce Data Cloud](https://developer.salesforce.com/blogs/2024/08/zero-copy-data-federation-with-snowflake-and-salesforce-data-cloud)

## About the Openflow Connector for Salesforce Bulk API

The Openflow Connector for Salesforce Bulk API provides replication-based data integration.
This connector is designed for users who do not use Salesforce Data Cloud and prefer a fully managed Snowflake Openflow connector.
The connector uses public Salesforce REST APIs to replicate data from Salesforce to Snowflake at a user-defined frequency.
The connector supports Change Data Capture (CDC) and keeps data in Snowflake in sync with Salesforce.

You can use one or both types of data integrations depending on your specific use cases. This topic describes how to set up and use the Openflow Connector for Salesforce Bulk API to replicate data from Salesforce to Snowflake.

## Use cases

Use the Openflow Connector for Salesforce Bulk API to replicate standard or custom objects from Salesforce to Snowflake at a user-specified frequency and keep them up to date in Snowflake.

## Workflow

The following workflow describes the steps to set up and use the Openflow Connector for Salesforce Bulk API.

1. A Salesforce administrator creates and configures an external client app in Salesforce and approves it for a specific user.
2. The Openflow administrator performs the following tasks:

   1. Create a service user for the connector, a warehouse for the connector, and a destination database and schema to replicate into.
   2. Install the connector.
   3. Specify the required parameters for the flow template.
3. The data engineer runs the flow to replicate objects from Salesforce to Snowflake.

## Limitations

Consider the following limitations when using the connector:

* Custom Salesforce domains are not supported.
* Traversing object relationships and fetching related objects is not supported.
* The connector does not support hard deletes in Snowflake. You can either run a
  query on the destination table to delete all rows where the `isDeleted` column is `true` or perform a full refresh of the destination table to reflect “hard deletes”.
* Fields of type `location`, `address`, or `base64` are not supported and are
  ignored.
* You cannot consolidate data from multiple Salesforce instances into a single database in Snowflake.
  Data from a single Salesforce instance or org is ingested into a single database in Snowflake. A table is created in this database for each Salesforce object replicated.
* Files attached to Salesforce records are ignored.
* Formula fields are not replicated as data from Salesforce. Instead, the connector
  can translate supported Salesforce formulas into Snowflake SQL views. See
  Salesforce formula fields for details on supported formulas and limitations.

## Authentication

The connector uses the OAuth 2.0 JWT Bearer Flow via an external client app to connect to Salesforce and to retrieve data. This is the only supported OAuth flow for the connector. Using a different OAuth flow type (such as Authorization Code Flow) or misconfiguring the external client app can result in `invalid_grant` errors.

See [Openflow Connector for Salesforce Bulk API: Set up Salesforce](setup-salesforce.md) for documentation on how to configure the external client app in Salesforce, and [Troubleshooting the Openflow Connector for Salesforce Bulk API](troubleshoot.md) for help with common authentication errors.

## Replication lifecycle

The connector replicates data in two stages: initial replication and incremental replication.

### Initial replication

The connector calls the Salesforce Bulk API 2.0 to discover standard and custom objects specified in the connector configuration. The connector respects Bulk API 2.0 API limits.

* The connector creates one table per custom or standard object with one column for each field.
* The connector uses Snowpipe Streaming for the initial load to insert rows in the table based on the values of the fields from the Salesforce object.

### Incremental replication

Incremental updates use a Snowflake warehouse that can be configured in the connector parameters. Depending on your latency and data freshness requirements, you can configure the refresh frequency for updates from 1 minute to 24 hours, which determines how often the tables in Snowflake are refreshed.

Using the refresh frequency you specify, the connector calls the Salesforce Bulk API to detect changes in previously ingested objects. The connector identifies changed records by checking specific timestamp fields in the Salesforce objects.

For most objects, the connector uses the `SystemModstamp` field. If `SystemModstamp` is not available, the connector attempts to use the following fields, in order of preference:

1. `LastModifiedDate`
2. `CreatedDate`
3. `LoginTime`

> **Note:**
>
> For history tables (objects where History Tracking is enabled), the connector always uses the `CreatedDate` field to detect changes.

The connector then uses Snowpipe Streaming to push the incremental data into a staging table and executes a merge query to load the data into the final destination table.

## Schema evolution

The connector supports schema evolution when the source objects change in Salesforce.

When a new field is added to the source object:
:   The connector adds a new column to the destination table in Snowflake.

When an existing field is renamed in the source object:
:   The connector treats the rename as both a field deletion and a field addition. The field
    addition causes a new column to be added to the destination table. The field deletion
    is handled as described next.

When an existing field is deleted in the source object:
:   The connector supports three strategies:

    * Delete: Deletes the corresponding column in the destination table in Snowflake. This is the default behavior.
    * Ignore: Ignores the deleted field in the source and skips it in the future.
    * Rename: Renames the deleted field in the destination table.

For example, if the deletion strategy is set to `Ignore` and a field is renamed for a Salesforce object, the existing column in Snowflake will be unchanged and a new column with the new field name will be added.

## How objects are deleted

When objects are deleted in Salesforce, the connector does not “hard delete” them from Snowflake. The connector performs “soft deletes” for objects deleted in Salesforce and indicates that the source objects were deleted by setting the `isDeleted` column to `true` in the corresponding Snowflake tables.

The connector does not support “hard deletes”. You can either run a query on the destination table to delete all rows where the `isDeleted` column is `true` or perform a full refresh of the destination table to reflect “hard deletes”.

The connector may miss delete operations in situations where objects are deleted in Salesforce and purged from Salesforce’s recycling bin when the connector was not running, for example if the connector was paused or stopped. You must perform a full refresh of the destination table to recover in these situations.

## Automatic retry handling

The connector automatically retries failed operations or API errors using an exponential backoff strategy. The connector waits one second before the first retry, then doubles the wait time for each subsequent retry (two seconds, four seconds, and so on). If the failures persist, the connector stops retrying until the next scheduled run. You can monitor this activity in the [event table](../../monitor.md).

## Use multiple connector instances to handle different sync schedules

If you need to sync different objects at different frequencies, for example some every 30 minutes and others every 24 hours, Snowflake recommends deploying two separate connector instances within the same runtime. You can then configure the sync parameters independently for each instance.

> **Note:**
>
> Deploying multiple connector instances in the same runtime does not incur additional costs.

Similarly, if you need to fully fetch some objects every time the connector runs, Snowflake recommends deploying two separate connector instances within the same runtime and configuring the parameters for each instance.

## Salesforce formula fields

Salesforce formula fields are calculated fields whose values are derived from expressions defined in Salesforce. Because the Salesforce Bulk API does not support incremental retrieval of formula field values, the connector takes a different approach: it translates the Salesforce formula expressions into Snowflake SQL and creates a view for each object that contains formula fields.

To enable this feature, set the Enable Views Creation parameter to `true` in the connector configuration. See [Openflow Connector for Salesforce Bulk API: Configure the connector](configure-connector.md) for details.

For more information, see [Openflow Connector for Salesforce Bulk API: Salesforce formula fields](formula-fields.md).

## Next steps

For information on how to set up the connector, see the following topic:

* [Openflow Connector for Salesforce Bulk API: Set up Salesforce](setup-salesforce.md)

---
title: ADLSCredentialsControllerService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/adlscredentialscontrollerservice.md
section: Loading & Unloading Data
---

# ADLSCredentialsControllerService

## Description

Defines credentials for ADLS processors.

## Tags

adls, azure, cloud, credentials, microsoft, storage

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Account Key \* | Account Key |  |  | The storage account key. This is an admin-like password providing access to every container in this account. It is recommended one uses Shared Access Signature (SAS) token, Managed Identity or Service Principal instead for fine-grained control with policies. There are certain risks in allowing the account key to be stored as a FlowFile attribute. While it does provide for a more flexible flow by allowing the account key to be fetched dynamically from a FlowFile attribute, care must be taken to restrict access to the event provenance data (e.g., by strictly controlling the policies governing provenance for this processor). In addition, the provenance repositories may be put on encrypted disk partitions. |
| Credentials Type \* | Credentials Type | SAS_TOKEN | * Account Key * SAS Token * Managed Identity * Service Principal | Credentials type to be used for authenticating to Azure |
| Endpoint Suffix \* | Endpoint Suffix | dfs.core.windows.net |  | Storage accounts in public Azure always use a common FQDN suffix. Override this endpoint suffix with a different suffix in certain circumstances (like Azure Stack or non-public Azure regions). |
| Managed Identity Client ID | Managed Identity Client ID |  |  | Client ID of the managed identity. The property is required when User Assigned Managed Identity is used for authentication. It must be empty in case of System Assigned Managed Identity. |
| SAS Token \* | SAS Token |  |  | Shared Access Signature token (the leading ‘?’ may be included) There are certain risks in allowing the SAS token to be stored as a FlowFile attribute. While it does provide for a more flexible flow by allowing the SAS token to be fetched dynamically from a FlowFile attribute, care must be taken to restrict access to the event provenance data (e.g., by strictly controlling the policies governing provenance for this processor). In addition, the provenance repositories may be put on encrypted disk partitions. |
| Service Principal Client ID \* | Service Principal Client ID |  |  | Client ID (or Application ID) of the Client/Application having the Service Principal. |
| Service Principal Client Secret \* | Service Principal Client Secret |  |  | Password of the Client/Application. |
| Service Principal Tenant ID \* | Service Principal Tenant ID |  |  | Tenant ID of the Azure Active Directory hosting the Service Principal. |
| Storage Account Name \* | Storage Account Name |  |  | The storage account name. There are certain risks in allowing the account name to be stored as a FlowFile attribute. While it does provide for a more flexible flow by allowing the account name to be fetched dynamically from a FlowFile attribute, care must be taken to restrict access to the event provenance data (e.g., by strictly controlling the policies governing provenance for this processor). In addition, the provenance repositories may be put on encrypted disk partitions. |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ADLSCredentialsControllerServiceLookup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/adlscredentialscontrollerservicelookup.md
section: Loading & Unloading Data
---

# ADLSCredentialsControllerServiceLookup

## Description

Provides an ADLSCredentialsService that can be used to dynamically select another ADLSCredentialsService. This service requires an attribute named ‘adls.credentials.name’ to be passed in, and will throw an exception if the attribute is missing. The value of ‘adls.credentials.name’ will be used to select the ADLSCredentialsService that has been registered with that name. This will allow multiple ADLSCredentialsServices to be defined and registered, and then selected dynamically at runtime by tagging flow files with the appropriate ‘adls.credentials.name’ attribute.

## Tags

adls, azure, cloud, credentials, microsoft, storage

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: All controller services (alphabetical)
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/index.md
section: Loading & Unloading Data
---

# All controller services (alphabetical)

This topic provides a list of all openflow controller services in alphabetical order.
The list includes:

> * Type of controller service (Snowflake or not)
> * The name of each controller service
> * A summary of each controller service

## A

|  | Controller | Description |
| --- | --- | --- |
|  | [ADLSCredentialsControllerService](adlscredentialscontrollerservice.md) | Defines credentials for ADLS processors. |
|  | [ADLSCredentialsControllerServiceLookup](adlscredentialscontrollerservicelookup.md) | Provides an ADLSCredentialsService that can be used to dynamically select another ADLSCredentialsService. |
|  | [AmazonGlueEncodedSchemaReferenceReader](amazonglueencodedschemareferencereader.md) | Reads Schema Identifier according to AWS Glue Schema encoding as a header consisting of a two byte markers and a 16 byte UUID |
|  | [AmazonGlueSchemaRegistry](amazonglueschemaregistry.md) | Provides a Schema Registry that interacts with the AWS Glue Schema Registry so that those Schemas that are stored in the Glue Schema Registry can be used in NiFi. |
|  | [AmazonMSKConnectionService](amazonmskconnectionservice.md) | Provides and manages connections to AWS MSK Kafka Brokers for producer or consumer operations. |
|  | [AmazonMSKConnectionService](amazonmskconnectionservice.md) | Provides and manages connections to AWS MSK Kafka Brokers for producer or consumer operations. |
|  | [ApicurioSchemaRegistry](apicurioschemaregistry.md) | Provides a Schema Registry that interacts with the Apicurio Schema Registry so that those Schemas that are stored in the Apicurio Schema Registry can be used in NiFi. |
|  | [AvroReader](avroreader.md) | Parses Avro data and returns each Avro record as an separate Record object. |
|  | [AvroRecordSetWriter](avrorecordsetwriter.md) | Writes the contents of a RecordSet in Binary Avro format. |
|  | [AvroSchemaRegistry](avroschemaregistry.md) | Provides a service for registering and accessing schemas. |
|  | [AWSCredentialsProviderControllerService](awscredentialsprovidercontrollerservice.md) | Defines credentials for Amazon Web Services processors. |
|  | [AzureBlobStorageFileResourceService](azureblobstoragefileresourceservice.md) | Provides an Azure Blob Storage file resource for other components. |
|  | [AzureCosmosDBClientService](azurecosmosdbclientservice.md) | Provides a controller service that configures a connection to Cosmos DB (Core SQL API) and provides access to that connection to other Cosmos DB-related components. |
|  | [AzureDataLakeStorageFileResourceService](azuredatalakestoragefileresourceservice.md) | Provides an Azure Data Lake Storage (ADLS) file resource for other components. |
|  | [AzureEventHubRecordSink](azureeventhubrecordsink.md) | Format and send Records to Azure Event Hubs |
|  | [AzureStorageCredentialsControllerService_v12](azurestoragecredentialscontrollerservice_v12.md) | Provides credentials for Azure Storage processors using Azure Storage client library v12. |
|  | [AzureStorageCredentialsControllerServiceLookup_v12](azurestoragecredentialscontrollerservicelookup_v12.md) | Provides an AzureStorageCredentialsService_v12 that can be used to dynamically select another AzureStorageCredentialsService_v12. |

## C

|  | Controller | Description |
| --- | --- | --- |
|  | [CEFReader](cefreader.md) | Parses CEF (Common Event Format) events, returning each row as a record. |
|  | [ConfluentEncodedSchemaReferenceReader](confluentencodedschemareferencereader.md) | Reads Schema Identifier according to Confluent encoding as a header consisting of a byte marker and an integer represented as four bytes |
|  | [ConfluentEncodedSchemaReferenceWriter](confluentencodedschemareferencewriter.md) | Writes Schema Identifier according to Confluent encoding as a header consisting of a byte marker and an integer represented as four bytes |
|  | [ConfluentProtobufMessageNameResolver](confluentprotobufmessagenameresolver.md) | Resolves Protobuf message names from Confluent Schema Registry wire format by decoding message indexes and looking up the fully qualified name in the schema definition For Confluent wire format reference see: <https://docs>. |
|  | [ConfluentSchemaRegistry](confluentschemaregistry.md) | Provides a Schema Registry that interacts with the Confluent Schema Registry so that those Schemas that are stored in the Confluent Schema Registry can be used in NiFi. |
|  | [CSVReader](csvreader.md) | Parses CSV-formatted data, returning each row in the CSV file as a separate record. |
|  | [CSVRecordLookupService](csvrecordlookupservice.md) | A reloadable CSV file-based lookup service. |
|  | [CSVRecordSetWriter](csvrecordsetwriter.md) | Writes the contents of a RecordSet as CSV data. |

## D

|  | Controller | Description |
| --- | --- | --- |
|  | [DatabaseLookup](databaselookup.md) | A Lookup Service that allows for enrichment with a database using a user-specified SQL statement. |
|  | [DatabaseRecordLookupService](databaserecordlookupservice.md) | A relational-database-based lookup service. |
|  | [DatabaseRecordSink](databaserecordsink.md) | Provides a service to write records using a configured database connection. |
|  | [DBCPConnectionPool](dbcpconnectionpool.md) | Provides Database Connection Pooling Service. |
|  | [DBCPConnectionPoolLookup](dbcpconnectionpoollookup.md) | Provides a DBCPService that can be used to dynamically select another DBCPService. |
|  | [DeveloperBoxClientService](developerboxclientservice.md) | Provides Box client objects through which Box API calls can be used. |
|  | [DistributedMapCacheLookupService](distributedmapcachelookupservice.md) | Allows to choose a distributed map cache client to retrieve the value associated to a key. |

## E

|  | Controller | Description |
| --- | --- | --- |
|  | [ElasticSearchClientServiceImpl](elasticsearchclientserviceimpl.md) | A controller service for accessing an Elasticsearch client, using the Elasticsearch (low-level) REST Client. |
|  | [ElasticSearchLookupService](elasticsearchlookupservice.md) | Lookup a record from Elasticsearch Server associated with the specified document ID. |
|  | [ElasticSearchStringLookupService](elasticsearchstringlookupservice.md) | Lookup a string value from Elasticsearch Server associated with the specified document ID. |
|  | [EmailRecordSink](emailrecordsink.md) | Provides a RecordSinkService that can be used to send records in email using the specified writer for formatting. |
|  | [EmbeddedHazelcastCacheManager](embeddedhazelcastcachemanager.md) | A service that runs embedded Hazelcast and provides cache instances backed by that. |
|  | [ExcelReader](excelreader.md) | Parses a Microsoft Excel document returning each row in each sheet as a separate record. |
|  | [ExternalHazelcastCacheManager](externalhazelcastcachemanager.md) | A service that provides cache instances backed by Hazelcast running outside of NiFi. |

## F

|  | Controller | Description |
| --- | --- | --- |
|  | [FreeFormTextRecordSetWriter](freeformtextrecordsetwriter.md) | Writes the contents of a RecordSet as free-form text. |

## G

|  | Controller | Description |
| --- | --- | --- |
|  | [GCPCredentialsControllerService](gcpcredentialscontrollerservice.md) | Defines credentials for Google Cloud Platform processors. |
|  | [GCSFileResourceService](gcsfileresourceservice.md) | Provides a Google Compute Storage (GCS) file resource for other components. |
|  | [GrokReader](grokreader.md) | Provides a mechanism for reading unstructured text data, such as log files, and structuring the data so that it can be processed. |

## H

|  | Controller | Description |
| --- | --- | --- |
|  | [HazelcastMapCacheClient](hazelcastmapcacheclient.md) | An implementation of DistributedMapCacheClient that uses Hazelcast as the backing cache. |
|  | [HikariCPConnectionPool](hikaricpconnectionpool.md) | Provides Database Connection Pooling Service based on HikariCP. |
|  | [HttpRecordSink](httprecordsink.md) | Format and send Records to a configured uri using HTTP post. |

## I

|  | Controller | Description |
| --- | --- | --- |
|  | [IPLookupService](iplookupservice.md) | A lookup service that provides several types of enrichment information for IP addresses. |

## J

|  | Controller | Description |
| --- | --- | --- |
|  | [JettyWebSocketClient](jettywebsocketclient.md) | Implementation of WebSocketClientService. |
|  | [JettyWebSocketServer](jettywebsocketserver.md) | Implementation of WebSocketServerService. |
|  | [JMSConnectionFactoryProvider](jmsconnectionfactoryprovider.md) | Provides a generic service to create vendor specific javax. |
|  | [JndiJmsConnectionFactoryProvider](jndijmsconnectionfactoryprovider.md) | Provides a service to lookup an existing JMS ConnectionFactory using the Java Naming and Directory Interface (JNDI). |
|  | [JsonConfigBasedBoxClientService](jsonconfigbasedboxclientservice.md) | Provides Box client objects through which Box API calls can be used. |
|  | [JsonPathReader](jsonpathreader.md) | Parses JSON records and evaluates user-defined JSON Path ‘s against each JSON object. |
|  | [JsonRecordSetWriter](jsonrecordsetwriter.md) | Writes the results of a RecordSet as either a JSON Array or one JSON object per line. |
|  | [JsonTableColumnFilter](jsontablecolumnfilter.md) | Provides a table column filter based on a JSON configuration. |
|  | [JsonTreeReader](jsontreereader.md) | Parses JSON into individual Record objects. |
|  | [JWTBearerOAuth2AccessTokenProvider](jwtbeareroauth2accesstokenprovider.md) | Provides OAuth 2. |

## K

|  | Controller | Description |
| --- | --- | --- |
|  | [Kafka3ConnectionService](kafka3connectionservice.md) | Provides and manages connections to Kafka Brokers for producer or consumer operations. |
|  | [Kafka3ConnectionService](kafka3connectionservice.md) | Provides and manages connections to Kafka Brokers for producer or consumer operations. |

## L

|  | Controller | Description |
| --- | --- | --- |
|  | [LoggingRecordSink](loggingrecordsink.md) | Provides a RecordSinkService that can be used to log records to the application log (nifi-app. |

## M

|  | Controller | Description |
| --- | --- | --- |
|  | [MapCacheClientService](mapcacheclientservice.md) | Provides the ability to communicate with a MapCacheServer. |
|  | [MapCacheServer](mapcacheserver.md) | Provides a map (key/value) cache that can be accessed over a socket. |
|  | [MicrosoftClientCertificateOAuth2TokenProvider](microsoftclientcertificateoauth2tokenprovider.md) | Provides OAuth2 access tokens for the Microsoft Graph API using client_credentials with a client certificate. |
|  | [MicrosoftGraphAuthenticationProvider](microsoftgraphauthenticationprovider.md) | Provides authentication for the Microsoft Graph API, which can be used for interacting with Microsoft 365 services. |
|  | [MongoDBControllerService](mongodbcontrollerservice.md) | Provides a controller service that configures a connection to MongoDB and provides access to that connection to other Mongo-related components. |
|  | [MongoDBLookupService](mongodblookupservice.md) | Provides a lookup service based around MongoDB. |

## P

|  | Controller | Description |
| --- | --- | --- |
|  | [ParquetIcebergWriter](parqueticebergwriter.md) | Provides record serialization for Apache Iceberg using Apache Parquet formatting |
|  | [PEMEncodedSSLContextProvider](pemencodedsslcontextprovider.md) | SSLContext Provider configurable using PEM Private Key and Certificate files. |
|  | [PolarisIcebergCatalog](polarisicebergcatalog.md) | Provides Apache Iceberg integration with Apache Polaris Catalog access over REST HTTP |
|  | [PropertiesFileLookupService](propertiesfilelookupservice.md) | A reloadable properties file-based lookup service |
|  | [ProtobufReader](protobufreader.md) | Parses a Protocol Buffers message from binary format. |

## R

|  | Controller | Description |
| --- | --- | --- |
|  | [ReaderLookup](readerlookup.md) | Provides a RecordReaderFactory that can be used to dynamically select another RecordReaderFactory. |
|  | [RecordSetWriterLookup](recordsetwriterlookup.md) | Provides a RecordSetWriterFactory that can be used to dynamically select another RecordSetWriterFactory. |
|  | [RecordSinkServiceLookup](recordsinkservicelookup.md) | Provides a RecordSinkService that can be used to dynamically select another RecordSinkService. |
|  | [RedisConnectionPoolService](redisconnectionpoolservice.md) | A service that provides connections to Redis. |
|  | [RedisDistributedMapCacheClientService](redisdistributedmapcacheclientservice.md) | An implementation of DistributedMapCacheClient that uses Redis as the backing cache. |
|  | [RemoveFieldRecordReader](removefieldrecordreader.md) | A wrapper for a RecordReaderFactory that supports filtering out specified fields from NiFi Records. |
|  | [RestLookupService](restlookupservice.md) | Use a REST service to look up values. |

## S

|  | Controller | Description |
| --- | --- | --- |
|  | [S3FileResourceService](s3fileresourceservice.md) | Provides an Amazon Web Services (AWS) S3 file resource for other components. |
|  | [SalesforceDataCloudOAuthTokenProvider](salesforcedatacloudoauthtokenprovider.md) | Retrieves an OAuth2 access token from Salesforce using the configured OAuth2 Access Token Provider and exchanges the token for a Data Cloud API token. |
|  | [ScriptedLookupService](scriptedlookupservice.md) | Allows the user to provide a scripted LookupService instance in order to enrich records from an incoming flow file. |
|  | [ScriptedReader](scriptedreader.md) | Allows the user to provide a scripted RecordReaderFactory instance in order to read/parse/generate records from an incoming flow file. |
|  | [ScriptedRecordSetWriter](scriptedrecordsetwriter.md) | Allows the user to provide a scripted RecordSetWriterFactory instance in order to write records to an outgoing flow file. |
|  | [ScriptedRecordSink](scriptedrecordsink.md) | Allows the user to provide a scripted RecordSinkService instance in order to transmit records to the desired target. |
|  | [SetCacheClientService](setcacheclientservice.md) | Provides the ability to communicate with a SetCacheServer. |
|  | [SetCacheServer](setcacheserver.md) | Provides a set (collection of unique values) cache that can be accessed over a socket. |
|  | [SimpleCsvFileLookupService](simplecsvfilelookupservice.md) | A reloadable CSV file-based lookup service. |
|  | [SimpleDatabaseLookupService](simpledatabaselookupservice.md) | A relational-database-based lookup service. |
|  | [SimpleKeyValueLookupService](simplekeyvaluelookupservice.md) | Allows users to add key/value pairs as User-defined Properties. |
|  | [SimpleRedisDistributedMapCacheClientService](simpleredisdistributedmapcacheclientservice.md) | An implementation of DistributedMapCacheClient that uses Redis as the backing cache. |
|  | [SimpleScriptedLookupService](simplescriptedlookupservice.md) | Allows the user to provide a scripted LookupService instance in order to enrich records from an incoming flow file. |
|  | [SlackRecordSink](slackrecordsink.md) | Format and send Records to a configured Channel using the Slack Post Message API. |
|  | [SmbjClientProviderService](smbjclientproviderservice.md) | Provides access to SMB Sessions with shared authentication credentials. |
|  | [SnowflakeConnectionService](snowflakeconnectionservice.md) | Provides pooled database connections to Snowflake services |
|  | [SnowflakeDatabaseDialectService](snowflakedatabasedialectservice.md) | Database Dialect Service supporting Snowflake. |
|  | [SnowflakeSignJWTService](snowflakesignjwtservice.md) | Provides OAuth2 access token using a JWT signed with a secret stored in Snowflake. |
|  | [SnowflakeTableSchemaRegistry](snowflaketableschemaregistry.md) | Uses Snowflake tables as the source of schema — utilises Snowpipe Streaming REST API. |
|  | [StandardAnthropicLLMService](standardanthropicllmservice.md) | A Controller Service that provides integration with Anthropic’s Claude AI models through their Messages API. |
|  | [StandardAtlassianRequestRateManager](standardatlassianrequestratemanager.md) | Provides rate limiting coordination for Atlassian API calls across processors to prevent cascading rate limit issues. |
|  | [StandardAzureCredentialsControllerService](standardazurecredentialscontrollerservice.md) | Provide credentials to use with an Azure client. |
|  | [StandardConfluenceClientService](standardconfluenceclientservice.md) | Provides connection service to Confluence APIs |
|  | [StandardDatabricksWorkspaceClientService](standarddatabricksworkspaceclientservice.md) | Databricks client. |
|  | [StandardDropboxCredentialService](standarddropboxcredentialservice.md) | Defines credentials for Dropbox processors. |
|  | [StandardFileResourceService](standardfileresourceservice.md) | Provides a file resource for other components. |
|  | [StandardHashiCorpVaultClientService](standardhashicorpvaultclientservice.md) | A controller service for interacting with HashiCorp Vault. |
|  | [StandardHttpContextMap](standardhttpcontextmap.md) | Provides the ability to store and retrieve HTTP requests and responses external to a Processor, so that multiple Processors can interact with the same HTTP request. |
|  | [StandardHubSpotClientService](standardhubspotclientservice.md) | HubSpot Controller Service to integrate with HubSpot HTTP api. |
|  | [StandardJsonSchemaRegistry](standardjsonschemaregistry.md) | Provides a service for registering and accessing JSON schemas. |
|  | [StandardKustoIngestService](standardkustoingestservice.md) | Sends batches of flowfile content or stream flowfile content to an Azure ADX cluster. |
|  | [StandardKustoQueryService](standardkustoqueryservice.md) | Standard implementation of Kusto Query Service for Azure Data Explorer |
|  | [StandardMilvusConnectionService](standardmilvusconnectionservice.md) | Provides connection service to a Milvus instance |
|  | [StandardOauth2AccessTokenProvider](standardoauth2accesstokenprovider.md) | Provides OAuth 2. |
|  | [StandardOCRService](standardocrservice.md) | Provides integration to Openflow OCR Service |
|  | [StandardOpenAILLMService](standardopenaillmservice.md) | A Controller Service that provides integration with OpenAI’s Chat Completion API. |
|  | [StandardPGPPrivateKeyService](standardpgpprivatekeyservice.md) | PGP Private Key Service provides Private Keys loaded from files or properties |
|  | [StandardPGPPublicKeyService](standardpgppublickeyservice.md) | PGP Public Key Service providing Public Keys loaded from files |
|  | [StandardPrivateKeyService](standardprivatekeyservice.md) | Private Key Service provides access to a Private Key loaded from configured sources |
|  | [StandardProtobufReader](standardprotobufreader.md) | Parses Protocol Buffers messages from binary format into NiFi Records. |
|  | [StandardProxyConfigurationService](standardproxyconfigurationservice.md) | Provides a set of configurations for different NiFi components to use a proxy server. |
|  | [StandardRestrictedSSLContextService](standardrestrictedsslcontextservice.md) | Restricted implementation of the SSLContextService. |
|  | [StandardS3EncryptionService](standards3encryptionservice.md) | Adds configurable encryption to S3 Put and S3 Fetch operations. |
|  | [StandardSalesforceBulkJobsStateService](standardsalesforcebulkjobsstateservice.md) | Stores Salesforce Bulk Jobs state per object type at cluster scope |
|  | [StandardSalesforceClientService](standardsalesforceclientservice.md) | Provides connection service to Salesforce APIs |
|  | [StandardSalesforceDataCloudClientService](standardsalesforcedatacloudclientservice.md) | Provides connection service to Salesforce Data Cloud APIs |
|  | [StandardSlackRateLimiterService](standardslackratelimiterservice.md) | Provides rate limiting coordination for Slack API calls across processors to prevent cascading rate limit issues |
|  | [StandardSSLContextService](standardsslcontextservice.md) | Standard implementation of the SSLContextService. |
|  | [StandardTableStateService](standardtablestateservice.md) | A controller Service that provides and manages table state. |
|  | [StandardVectaraClientService](standardvectaraclientservice.md) | Vectara Controller Service to integrate with Vectara HTTP Api. |
|  | [StandardWebClientServiceProvider](standardwebclientserviceprovider.md) | Web Client Service Provider with support for configuring standard HTTP connection properties |
|  | [StateManagedCdcSchemaRegistry](statemanagedcdcschemaregistry.md) | Uses the in-built NiFi State Management to store the hashes of table schemas. |
|  | [Syslog5424Reader](syslog5424reader.md) | Provides a mechanism for reading RFC 5424 compliant Syslog data, such as log files, and structuring the data so that it can be processed. |
|  | [SyslogReader](syslogreader.md) | Attempts to parses the contents of a Syslog message in accordance to RFC5424 and RFC3164. |

## U

|  | Controller | Description |
| --- | --- | --- |
|  | [UDPEventRecordSink](udpeventrecordsink.md) | Format and send Records as UDP Datagram Packets to a configurable destination |

## V

|  | Controller | Description |
| --- | --- | --- |
|  | [VolatileSchemaCache](volatileschemacache.md) | Provides a Schema Cache that evicts elements based on a Least-Recently-Used algorithm. |

## W

|  | Controller | Description |
| --- | --- | --- |
|  | [WindowsEventLogReader](windowseventlogreader.md) | Reads Windows Event Log data as XML content having been generated by ConsumeWindowsEventLog, ParseEvtx, etc. |

## X

|  | Controller | Description |
| --- | --- | --- |
|  | [XMLFileLookupService](xmlfilelookupservice.md) | A reloadable XML file-based lookup service. |
|  | [XMLReader](xmlreader.md) | Reads XML content and creates Record objects. |
|  | [XMLRecordSetWriter](xmlrecordsetwriter.md) | Writes a RecordSet to XML. |

## Y

|  | Controller | Description |
| --- | --- | --- |
|  | [YamlTreeReader](yamltreereader.md) | Parses YAML into individual Record objects. |

---
title: All processors (alphabetical)
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/index.md
section: Loading & Unloading Data
---

# All processors (alphabetical)

This topic provides a list of all Snowflake openflow processors in alphabetical order.
The list includes:

> * The name of each processor
> * A summary of each processor

## A

|  | Processor | Description |
| --- | --- | --- |
|  | [AbortQueryJob](abortqueryjob.md) | Aborts a Query Job in Salesforce using the Bulk API 2. |
|  | [AttributesToCSV](attributestocsv.md) | Generates a CSV representation of the input FlowFile Attributes. |
|  | [AttributesToJSON](attributestojson.md) | Generates a JSON representation of the input FlowFile Attributes. |

## C

|  | Processor | Description |
| --- | --- | --- |
|  | [CalculateRecordStats](calculaterecordstats.md) | Counts the number of Records in a record set, optionally counting the number of elements per category, where the categories are defined by user-defined properties. |
|  | [CaptureChangeMySQL](capturechangemysql.md) | Reads CDC events from a MySQL database. |
|  | [CaptureChangePostgreSQL](capturechangepostgresql.md) | Reads CDC events from a PostgreSQL database. |
|  | [CaptureChangeSqlServer](capturechangesqlserver.md) | Reads CDC events from a SQL Server database. |
|  | [CaptureGoogleDriveChanges](capturegoogledrivechanges.md) | Captures changes to a Shared Google Drive and emits a FlowFile for each change that occurs. |
|  | [CaptureMicrosoft365GroupsChanges](capturemicrosoft365groupschanges.md) | Captures Microsoft365 groups changes and emits a FlowFile for each change that occurs. |
|  | [CaptureSharepointChanges](capturesharepointchanges.md) | Captures changes from a Sharepoint Document Library and emits a FlowFile for each change that occurs. |
|  | [CheckMetaAdsReportReadiness](checkmetaadsreportreadiness.md) | Processor checking if the Meta Ads report is ready for download. |
|  | [ChunkRecordText](chunkrecordtext.md) | Chunks text with options for recursively splitting by delimiters and max character length. |
|  | [ChunkText](chunktext.md) | Chunks text with options for recursively splitting by delimiters and max character length. |
|  | [CompressContent](compresscontent.md) | Compresses or decompresses the contents of FlowFiles using a user-specified compression algorithm and updates the mime. |
|  | [ConnectWebSocket](connectwebsocket.md) | Acts as a WebSocket client endpoint to interact with a remote WebSocket server. |
|  | [ConsumeAMQP](consumeamqp.md) | Consumes AMQP Messages from an AMQP Broker using the AMQP 0. |
|  | [ConsumeAzureEventHub](consumeazureeventhub.md) | Receives messages from Microsoft Azure Event Hubs with checkpointing to ensure consistent event processing. |
|  | [ConsumeBoxEnterpriseEvents](consumeboxenterpriseevents.md) | Consumes Enterprise Events from Box admin_logs_streaming Stream Type. |
|  | [ConsumeBoxEvents](consumeboxevents.md) | Consumes all events from Box. |
|  | [ConsumeElasticsearch](consumeelasticsearch.md) | A processor that repeatedly runs a paginated query against a field using a Range query to consume new Documents from an Elasticsearch index/query. |
|  | [ConsumeGCPubSub](consumegcpubsub.md) | Consumes messages from the configured Google Cloud PubSub subscription. |
|  | [ConsumeIMAP](consumeimap.md) | Consumes messages from Email Server using IMAP protocol. |
|  | [ConsumeJMS](consumejms.md) | Consumes JMS Message of type BytesMessage, TextMessage, ObjectMessage, MapMessage or StreamMessage transforming its content to a FlowFile and transitioning it to ‘success’ relationship. |
|  | [ConsumeKafka](consumekafka.md) | Consumes messages from Apache Kafka Consumer API. |
|  | [ConsumeKafka](consumekafka.md) | Consumes messages from Apache Kafka Consumer API. |
|  | [ConsumeKinesisStream](consumekinesisstream.md) | Reads data from the specified AWS Kinesis stream and outputs a FlowFile for every processed Record (raw) or a FlowFile for a batch of processed records if a Record Reader and Record Writer are configured. |
|  | [ConsumeMQTT](consumemqtt.md) | Subscribes to a topic and receives messages from an MQTT broker |
|  | [ConsumePOP3](consumepop3.md) | Consumes messages from Email Server using POP3 protocol. |
|  | [ConsumeSlack](consumeslack.md) | Retrieves messages from one or more configured Slack channels. |
|  | [ConsumeSlackConversation](consumeslackconversation.md) | Retrieves messages from Slack conversations available to the App. |
|  | [ConsumeSlackHistory](consumeslackhistory.md) | Fetches historical messages from all Slack channels available to the App. |
|  | [ConsumeSnowflakeStream](consumesnowflakestream.md) | Fetches data from a Snowflake stream and writes it to a FlowFile. |
|  | [ConsumeTwitter](consumetwitter.md) | Streams tweets from Twitter’s streaming API v2. |
|  | [ControlRate](controlrate.md) | Controls the rate at which data is transferred to follow-on processors. |
|  | [ConvertCharacterSet](convertcharacterset.md) | Converts a FlowFile’s content from one character set to another |
|  | [ConvertRecord](convertrecord.md) | Converts records from one data format to another using configured Record Reader and Record Write Controller Services. |
|  | [ConvertToJournalSchema](converttojournalschema.md) | Converts the incoming database schema into the appropriate schema for a Snowflake CDC Journal table. |
|  | [CopyAzureBlobStorage_v12](copyazureblobstorage_v12.md) | Copies a blob in Azure Blob Storage from one account/container to another. |
|  | [CopyS3Object](copys3object.md) | Copies a file from one bucket and key to another in AWS S3 |
|  | [CountText](counttext.md) | Counts various metrics on incoming text. |
|  | [CreateAmazonAdsReport](createamazonadsreport.md) | Processor which creates report configuration for Amazon Ads connector. |
|  | [CreateAzureOpenAiEmbeddings](createazureopenaiembeddings.md) | Uses Azure OpenAI to create embeddings for text. |
|  | [CreateBoxFileMetadataInstance](createboxfilemetadatainstance.md) | Creates a metadata instance for a Box file using a specified template with values from the flowFile content. |
|  | [CreateBoxMetadataTemplate](createboxmetadatatemplate.md) | Creates a Box metadata template using field specifications from the flowFile content. |
|  | [CreateCohereEmbeddings](createcohereembeddings.md) | Uses Cohere to create embeddings for text. |
|  | [CreateMetaAdsReport](createmetaadsreport.md) | Processor which creates report configuration for Meta Ads connector. |
|  | [CreateOpenAiEmbeddings](createopenaiembeddings.md) | Uses OpenAI to create embeddings for text. |
|  | [CreateSnowflakeEmbeddings](createsnowflakeembeddings.md) | Create vector embeddings using Snowflake Cortex Large Language Model functions |
|  | [CreateVertexAIEmbeddings](createvertexaiembeddings.md) | Uses VertexAI to create embeddings for text. |
|  | [CryptographicHashContent](cryptographichashcontent.md) | Calculates a cryptographic hash value for the flowfile content using the given algorithm and writes it to an output attribute. |

## D

|  | Processor | Description |
| --- | --- | --- |
|  | [DebugFlow](debugflow.md) | The DebugFlow processor aids testing and debugging the FlowFile framework by allowing various responses to be explicitly triggered in response to the receipt of a FlowFile or a timer event without a FlowFile if using timer or cron based scheduling. |
|  | [DecryptContentAge](decryptcontentage.md) | Decrypt content using the age-encryption. |
|  | [DecryptContentPGP](decryptcontentpgp.md) | Decrypt contents of OpenPGP messages. |
|  | [DeduplicateRecord](deduplicaterecord.md) | This processor de-duplicates individual records within a record set. |
|  | [DeleteAzureBlobStorage_v12](deleteazureblobstorage_v12.md) | Deletes the specified blob from Azure Blob Storage. |
|  | [DeleteAzureDataLakeStorage](deleteazuredatalakestorage.md) | Deletes the provided file from Azure Data Lake Storage |
|  | [DeleteBoxFileMetadataInstance](deleteboxfilemetadatainstance.md) | Deletes a metadata instance from a Box file using the specified template key |
|  | [DeleteByQueryElasticsearch](deletebyqueryelasticsearch.md) | Delete from an Elasticsearch index using a query. |
|  | [DeleteDBFSResource](deletedbfsresource.md) | Delete a DBFS files and directories. |
|  | [DeleteDynamoDB](deletedynamodb.md) | Deletes a document from DynamoDB based on hash and range key. |
|  | [DeleteFile](deletefile.md) | Deletes a file from the filesystem. |
|  | [DeleteGCSObject](deletegcsobject.md) | Deletes objects from a Google Cloud Bucket. |
|  | [DeleteGridFS](deletegridfs.md) | Deletes a file from GridFS using a file name or a query. |
|  | [DeleteMilvus](deletemilvus.md) | Deletes vectors from Milvus database from a collection by ID. |
|  | [DeleteMongo](deletemongo.md) | Executes a delete query against a MongoDB collection. |
|  | [DeletePinecone](deletepinecone.md) | Deletes vectors from a Pinecone index. |
|  | [DeleteQueryJob](deletequeryjob.md) | Deletes a Query Job in Salesforce using the Bulk API 2. |
|  | [DeleteS3Object](deletes3object.md) | Deletes a file from an Amazon S3 Bucket. |
|  | [DeleteSFTP](deletesftp.md) | Deletes a file residing on an SFTP server. |
|  | [DeleteSQS](deletesqs.md) | Deletes a message from an Amazon Simple Queuing Service Queue |
|  | [DeleteUnityCatalogResource](deleteunitycatalogresource.md) | Delete a Unity Catalog file or directory. |
|  | [DescribeDataShare](describedatashare.md) | Describe the specified data share metadata in Salesforce Data Cloud. |
|  | [DescribeSFDCObject](describesfdcobject.md) | Describe the specified object metadata in Salesforce. |
|  | [DetectDuplicate](detectduplicate.md) | Caches a value, computed from FlowFile attributes, for each incoming FlowFile and determines if the cached value has already been seen. |
|  | [DistributeLoad](distributeload.md) | Distributes FlowFiles to downstream processors based on a Distribution Strategy. |
|  | [DuplicateFlowFile](duplicateflowfile.md) | Intended for load testing, this processor will create the configured number of copies of each incoming FlowFile. |

## E

|  | Processor | Description |
| --- | --- | --- |
|  | [EncodeContent](encodecontent.md) | Encode or decode the contents of a FlowFile using Base64, Base32, or hex encoding schemes |
|  | [EncryptContentAge](encryptcontentage.md) | Encrypt content using the age-encryption. |
|  | [EncryptContentPGP](encryptcontentpgp.md) | Encrypt contents using OpenPGP. |
|  | [EnforceOrder](enforceorder.md) | Enforces expected ordering of FlowFiles that belong to the same data group within a single node. |
|  | [EnrichAttributes](enrichattributes.md) | Looks up a value using the configured Lookup Service and adds the results to the FlowFile as one or more attributes. |
|  | [EnrichCdcStream](enrichcdcstream.md) | Enriches incoming FlowFiles that come from CaptureChangePostgreSQL, etc. |
|  | [EvaluateJsonPath](evaluatejsonpath.md) | Evaluates one or more JsonPath expressions against the content of a FlowFile. |
|  | [EvaluateRagAnswerCorrectness](evaluateraganswercorrectness.md) | Evaluates the correctness of generated answers in a Retrieval-Augmented Generation (RAG) context by computing metrics such as F1 score, cosine similarity, and answer correctness. |
|  | [EvaluateRagFaithfulness](evaluateragfaithfulness.md) | Evaluates the faithfulness of generated answers in a Retrieval-Augmented Generation (RAG) system by analyzing responses using an LLM (e. |
|  | [EvaluateRagRetrieval](evaluateragretrieval.md) | Calculates retrieval metrics (Precision@N, Recall@N, FScore@N, MAP@N, MRR) for a RAG system using an LLM as a judge. |
|  | [EvaluateXPath](evaluatexpath.md) | Evaluates one or more XPaths against the content of a FlowFile. |
|  | [EvaluateXQuery](evaluatexquery.md) | Evaluates one or more XQueries against the content of a FlowFile. |
|  | [ExecuteGroovyScript](executegroovyscript.md) | Experimental Extended Groovy script processor. |
|  | [ExecuteProcess](executeprocess.md) | Runs an operating system command specified by the user and writes the output of that command to a FlowFile. |
|  | [ExecuteScript](executescript.md) | Experimental - Executes a script given the flow file and a process session. |
|  | [ExecuteSQL](executesql.md) | Executes provided SQL select query. |
|  | [ExecuteSQLRecord](executesqlrecord.md) | Executes provided SQL select query. |
|  | [ExecuteSQLStatement](executesqlstatement.md) | Executes a SQL DDL or DML Statement against a database. |
|  | [ExecuteStreamCommand](executestreamcommand.md) | The ExecuteStreamCommand processor provides a flexible way to integrate external commands and scripts into NiFi data flows. |
|  | [ExtractAvroMetadata](extractavrometadata.md) | Extracts metadata from the header of an Avro datafile. |
|  | [ExtractEmailAttachments](extractemailattachments.md) | Extract attachments from a mime formatted email file, splitting them into individual flowfiles. |
|  | [ExtractEmailHeaders](extractemailheaders.md) | Using the flowfile content as source of data, extract header from an RFC compliant email file adding the relevant attributes to the flowfile. |
|  | [ExtractGrok](extractgrok.md) | Evaluates one or more Grok Expressions against the content of a FlowFile, adding the results as attributes or replacing the content of the FlowFile with a JSON notation of the matched content |
|  | [ExtractRecordSchema](extractrecordschema.md) | Extracts the record schema from the FlowFile using the supplied Record Reader and writes it to the ‘avro. |
|  | [ExtractSchemaColumns](extractschemacolumns.md) | Extracts the record schema columns from the FlowFile using the supplied Record Reader and writes it to the ‘schema. |
|  | [ExtractStructuredBoxFileMetadata](extractstructuredboxfilemetadata.md) | Extracts metadata from a Box file using Box AI. |
|  | [ExtractText](extracttext.md) | Evaluates one or more Regular Expressions against the content of a FlowFile. |

## F

|  | Processor | Description |
| --- | --- | --- |
|  | [FetchAzureBlobStorage_v12](fetchazureblobstorage_v12.md) | Retrieves the specified blob from Azure Blob Storage and writes its content to the content of the FlowFile. |
|  | [FetchAzureDataLakeStorage](fetchazuredatalakestorage.md) | Fetch the specified file from Azure Data Lake Storage |
|  | [FetchBoxFile](fetchboxfile.md) | Fetches files from a Box Folder. |
|  | [FetchBoxFileInfo](fetchboxfileinfo.md) | Fetches metadata for files from Box and adds it to the FlowFile’s attributes. |
|  | [FetchBoxFileMetadataInstance](fetchboxfilemetadatainstance.md) | Retrieves specific metadata instance associated with a Box file using template key and scope. |
|  | [FetchBoxFileRepresentation](fetchboxfilerepresentation.md) | Fetches a Box file representation using a representation hint and writes it to the FlowFile content. |
|  | [FetchDistributedMapCache](fetchdistributedmapcache.md) | Computes cache key(s) from FlowFile attributes, for each incoming FlowFile, and fetches the value(s) from the Distributed Map Cache associated with each key. |
|  | [FetchDropbox](fetchdropbox.md) | Fetches files from Dropbox. |
|  | [FetchFile](fetchfile.md) | Reads the contents of a file from disk and streams it into the contents of an incoming FlowFile. |
|  | [FetchFTP](fetchftp.md) | Fetches the content of a file from a remote FTP server and overwrites the contents of an incoming FlowFile with the content of the remote file. |
|  | [FetchGCSObject](fetchgcsobject.md) | Fetches a file from a Google Cloud Bucket. |
|  | [FetchGoogleDrive](fetchgoogledrive.md) | Fetches files from a Google Drive Folder. |
|  | [FetchGoogleDriveFileComments](fetchgoogledrivefilecomments.md) | Fetches comments and their replies for a Google Drive file. |
|  | [FetchGoogleDriveMetadata](fetchgoogledrivemetadata.md) | Fetches Google Drive file metadata. |
|  | [FetchGridFS](fetchgridfs.md) | Retrieves one or more files from a GridFS bucket by file name or by a user-defined query. |
|  | [FetchJiraFields](fetchjirafields.md) | Retrieves comprehensive metadata for all fields available in the Jira Cloud instance using the REST API v3 /field endpoint. |
|  | [FetchJiraIssues](fetchjiraissues.md) | Fetches issues from Jira Cloud using REST API v3 with configurable search options. |
|  | [FetchMicrosoftDataverseTable](fetchmicrosoftdataversetable.md) | Fetch records from Microsoft Dataverse Tables |
|  | [FetchS3Object](fetchs3object.md) | Retrieves the contents of an S3 Object and writes it to the content of a FlowFile |
|  | [FetchSFTP](fetchsftp.md) | Fetches the content of a file from a remote SFTP server and overwrites the contents of an incoming FlowFile with the content of the remote file. |
|  | [FetchSharepointFile](fetchsharepointfile.md) | Fetches the contents of a file from a Sharepoint Drive, optionally downloading a PDF or HTML version of the file when applicable. |
|  | [FetchSharepointMetadata](fetchsharepointmetadata.md) | For each drive item retrieves its metadata and permissions and writes them as FlowFile attributes. |
|  | [FetchSlackConversationInfo](fetchslackconversationinfo.md) | Fetches Slack conversation info and member emails |
|  | [FetchSlackFile](fetchslackfile.md) | Downloads a file shared on Slack. |
|  | [FetchSlackMessage](fetchslackmessage.md) | Fetches data about a single Slack message |
|  | [FetchSmb](fetchsmb.md) | Fetches files from a SMB Share. |
|  | [FetchSnowflakeTableProperties](fetchsnowflaketableproperties.md) | Reads properties from a table and stores them as flow file attributes. |
|  | [FetchSourceTableSchema](fetchsourcetableschema.md) | Fetches the table schema (i. |
|  | [FetchTableSnapshot](fetchtablesnapshot.md) | Fetches a snapshot of a table from a database. |
|  | [FilterAttribute](filterattribute.md) | Filters the attributes of a FlowFile by retaining specified attributes and removing the rest or by removing specified attributes and retaining the rest. |
|  | [FindConfluencePages](findconfluencepages.md) | Processor for finding Confluence pages using space name and page name. |
|  | [FindSharepointDriveItem](findsharepointdriveitem.md) | Finds a Sharepoint Drive Item by its Drive ID and Item path. |
|  | [FlattenJson](flattenjson.md) | Provides the user with the ability to take a nested JSON document and flatten it into a simple key/value pair document. |
|  | [ForkEnrichment](forkenrichment.md) | Used in conjunction with the JoinEnrichment processor, this processor is responsible for adding the attributes that are necessary for the JoinEnrichment processor to perform its function. |
|  | [ForkRecord](forkrecord.md) | This processor allows the user to fork a record into multiple records. |

## G

|  | Processor | Description |
| --- | --- | --- |
|  | [GenerateAnswersFromContext](generateanswersfromcontext.md) | Generates synthetic answers for each question present in the incoming records using a Large Language Model (LLM). |
|  | [GenerateAnswersFromGroundTruth](generateanswersfromgroundtruth.md) | Generates synthetic answers for each question in the incoming records using an LLM. |
|  | [GenerateFlowFile](generateflowfile.md) | This processor creates FlowFiles with random data or custom content. |
|  | [GenerateJSON](generatejson.md) | Produces a batch of JSON Objects with random field values based on a configurable JSON Schema. |
|  | [GenerateRecord](generaterecord.md) | This processor creates FlowFiles with records having random value for the specified fields. |
|  | [GenerateTableFetch](generatetablefetch.md) | Generates SQL select queries that fetch “pages” of rows from a table. |
|  | [GeoEnrichIP](geoenrichip.md) | Looks up geolocation information for an IP address and adds the geo information to FlowFile attributes. |
|  | [GeoEnrichIPRecord](geoenrichiprecord.md) | Looks up geolocation information for an IP address and adds the geo information to FlowFile attributes. |
|  | [GetAmazonAdsReport](getamazonadsreport.md) | Processor downloading report from Amazon Ads if ready. |
|  | [GetAwsPollyJobStatus](getawspollyjobstatus.md) | Retrieves the current status of an AWS Polly job. |
|  | [GetAwsTextractJobStatus](getawstextractjobstatus.md) | Retrieves the current status of an AWS Textract job. |
|  | [GetAwsTranscribeJobStatus](getawstranscribejobstatus.md) | Retrieves the current status of an AWS Transcribe job. |
|  | [GetAwsTranslateJobStatus](getawstranslatejobstatus.md) | Retrieves the current status of an AWS Translate job. |
|  | [GetAzureEventHub](getazureeventhub.md) | Receives messages from Microsoft Azure Event Hubs without reliable checkpoint tracking. |
|  | [GetAzureQueueStorage_v12](getazurequeuestorage_v12.md) | Retrieves the messages from an Azure Queue Storage. |
|  | [GetBoxFileCollaborators](getboxfilecollaborators.md) | Retrieves all collaborators on a Box file and adds the collaboration information to the FlowFile’s attributes. |
|  | [GetBoxGroupMembers](getboxgroupmembers.md) | Retrieves members for a Box Group and writes their details in FlowFile attributes. |
|  | [GetConfluenceAuditRecords](getconfluenceauditrecords.md) | Processor listing Confluence audit records. |
|  | [GetConfluenceGroupUsers](getconfluencegroupusers.md) | Processor that downloads information about users belonging to a given Confluence group |
|  | [GetConfluencePageContent](getconfluencepagecontent.md) | Processor downloading Confluence pages. |
|  | [GetConfluencePageIds](getconfluencepageids.md) | Downloads changed Confluence pages since the last sync and emits each as a FlowFile with metadata. |
|  | [GetConfluencePagePermissions](getconfluencepagepermissions.md) | Processor downloading Confluence page permissions. |
|  | [GetConfluenceSpaceIds](getconfluencespaceids.md) | Processor for retrieving Confluence space ids. |
|  | [GetConfluenceSpacePermissions](getconfluencespacepermissions.md) | Processor downloading Confluence space permissions. |
|  | [GetDataShareCredentials](getdatasharecredentials.md) | Describe the specified data share metadata in Salesforce Data Cloud. |
|  | [GetDataShareTables](getdatasharetables.md) | Describe the specified data share metadata in Salesforce Data Cloud. |
|  | [GetDBFSFile](getdbfsfile.md) | Read a DBFS file. |
|  | [GetDynamoDB](getdynamodb.md) | Retrieves a document from DynamoDB based on hash and range key. |
|  | [GetElasticsearch](getelasticsearch.md) | Elasticsearch get processor that uses the official Elastic REST client libraries to fetch a single document from Elasticsearch by _id. |
|  | [GetFile](getfile.md) | Creates FlowFiles from files in a directory. |
|  | [GetFileResource](getfileresource.md) | This processor creates FlowFiles with the content of the configured File Resource. |
|  | [GetFTP](getftp.md) | Fetches files from an FTP Server and creates FlowFiles from them |
|  | [GetGcpVisionAnnotateFilesOperationStatus](getgcpvisionannotatefilesoperationstatus.md) | Retrieves the current status of an Google Vision operation. |
|  | [GetGcpVisionAnnotateImagesOperationStatus](getgcpvisionannotateimagesoperationstatus.md) | Retrieves the current status of an Google Vision operation. |
|  | [GetGoogleAdsReport](getgoogleadsreport.md) | A processor which can interact with Google Ads Reporting API. |
|  | [GetGoogleGroupMembers](getgooglegroupmembers.md) | Retrieves the members of one or more Google Groups, specified as a comma-separated list of group IDs that is given as a FlowFile attribute. |
|  | [GetGoogleSheets](getgooglesheets.md) | Processor responsible for fetching data from Google Sheets. |
|  | [GetHubSpot](gethubspot.md) | Retrieves JSON data from a private HubSpot application. |
|  | [GetHubSpotObject](gethubspotobject.md) | Get a HubSpot object and its associations by ID or unique value. |
|  | [GetHubSpotSchema](gethubspotschema.md) | Retrieves schema information for HubSpot object types including field names, types, and labels. |
|  | [GetLinkedInAdsReport](getlinkedinadsreport.md) | Processor downloading metrics from the LinkedIn Reporting APIs. |
|  | [GetMicrosoft365GroupMembers](getmicrosoft365groupmembers.md) | Retrieves Microsoft365 group members and emits a FlowFile for each change that occurs. |
|  | [GetMongo](getmongo.md) | Creates FlowFiles from documents in MongoDB loaded by a user-specified query. |
|  | [GetMongoRecord](getmongorecord.md) | A record-based version of GetMongo that uses the Record writers to write the MongoDB result set. |
|  | [GetQueryJobResult](getqueryjobresult.md) | Gets the results of a Query Job in Salesforce using the Bulk API 2. |
|  | [GetQueryJobStatus](getqueryjobstatus.md) | Gets the status of a Query Job in Salesforce using the Bulk API 2. |
|  | [GetS3ObjectMetadata](gets3objectmetadata.md) | Check for the existence of an Object in S3 and fetch its Metadata without attempting to download it. |
|  | [GetS3ObjectTags](gets3objecttags.md) | Check for the existence of an Object in S3 and fetch its Tags without attempting to download it. |
|  | [GetSFTP](getsftp.md) | Fetches files from an SFTP Server and creates FlowFiles from them |
|  | [GetSharepointSiteGroupMembers](getsharepointsitegroupmembers.md) | Retrieves all members of a SharePoint site group. |
|  | [GetShopify](getshopify.md) | Retrieves objects from a custom Shopify store. |
|  | [GetSmbFile](getsmbfile.md) | Reads file from a samba network location to FlowFiles. |
|  | [GetSplunk](getsplunk.md) | Retrieves data from Splunk Enterprise. |
|  | [GetSQS](getsqs.md) | Fetches messages from an Amazon Simple Queuing Service Queue |
|  | [GetUnityCatalogFile](getunitycatalogfile.md) | Read a Unity Catalog file up to 5 GiB. |
|  | [GetUnityCatalogFileMetadata](getunitycatalogfilemetadata.md) | Checks for Unity Catalog file metadata. |
|  | [GetWorkdayReport](getworkdayreport.md) | A processor which can interact with a configurable Workday Report. |
|  | [GetZendesk](getzendesk.md) | Incrementally fetches data from Zendesk API. |

## H

|  | Processor | Description |
| --- | --- | --- |
|  | [HandleHttpRequest](handlehttprequest.md) | Starts an HTTP Server and listens for HTTP Requests. |
|  | [HandleHttpResponse](handlehttpresponse.md) | Sends an HTTP Response to the Requestor that generated a FlowFile. |

## I

|  | Processor | Description |
| --- | --- | --- |
|  | [IdentifyMimeType](identifymimetype.md) | Attempts to identify the MIME Type used for a FlowFile. |
|  | [InvokeHTTP](invokehttp.md) | An HTTP client processor which can interact with a configurable HTTP Endpoint. |
|  | [InvokeScriptedProcessor](invokescriptedprocessor.md) | Experimental - Invokes a script engine for a Processor defined in the given script. |
|  | [ISPEnrichIP](ispenrichip.md) | Looks up ISP information for an IP address and adds the information to FlowFile attributes. |

## J

|  | Processor | Description |
| --- | --- | --- |
|  | [JoinEnrichment](joinenrichment.md) | Joins together Records from two different FlowFiles where one FlowFile, the ‘original’ contains arbitrary records and the second FlowFile, the ‘enrichment’ contains additional data that should be used to enrich the first. |
|  | [JoltTransformJSON](jolttransformjson.md) | Applies a list of Jolt specifications to either the FlowFile JSON content or a specified FlowFile JSON attribute. |
|  | [JoltTransformRecord](jolttransformrecord.md) | Applies a JOLT specification to each record in the FlowFile payload. |
|  | [JSLTTransformJSON](jslttransformjson.md) | Applies a JSLT transformation to the FlowFile JSON payload. |
|  | [JsonQueryElasticsearch](jsonqueryelasticsearch.md) | A processor that allows the user to run a query (with aggregations) written with the Elasticsearch JSON DSL. |

## L

|  | Processor | Description |
| --- | --- | --- |
|  | [ListArchivedHubSpotData](listarchivedhubspotdata.md) | Lists archived data from HubSpot for the chosen object type and generates one FlowFile per listed object with the corresponding metadata as FlowFile attributes. |
|  | [ListAzureBlobStorage_v12](listazureblobstorage_v12.md) | Lists blobs in an Azure Blob Storage container. |
|  | [ListAzureDataLakeStorage](listazuredatalakestorage.md) | Lists directory in an Azure Data Lake Storage Gen 2 filesystem |
|  | [ListBoxFile](listboxfile.md) | Lists files in a Box folder. |
|  | [ListBoxFileInfo](listboxfileinfo.md) | Fetches file metadata for each file in a Box Folder. |
|  | [ListBoxFileMetadataInstances](listboxfilemetadatainstances.md) | Retrieves all metadata instances associated with a Box file. |
|  | [ListBoxFileMetadataTemplates](listboxfilemetadatatemplates.md) | Retrieves all metadata templates associated with a Box file. |
|  | [ListConfluenceGroups](listconfluencegroups.md) | Processor listing Confluence groups. |
|  | [ListDatabaseTables](listdatabasetables.md) | Generates a set of flow files, each containing attributes corresponding to metadata about a table from a database connection. |
|  | [ListDBFSDirectory](listdbfsdirectory.md) | List file names in a DBFS directory and output a new FlowFile with the filename. |
|  | [ListDropbox](listdropbox.md) | Retrieves a listing of files from Dropbox (shortcuts are ignored). |
|  | [ListenFTP](listenftp.md) | Starts an FTP server that listens on the specified port and transforms incoming files into FlowFiles. |
|  | [ListenHTTP](listenhttp.md) | Starts an HTTP Server and listens on a given base path to transform incoming requests into FlowFiles. |
|  | [ListenOTLP](listenotlp.md) | Collect OpenTelemetry messages over HTTP or gRPC. |
|  | [ListenSlack](listenslack.md) | Retrieves real-time messages or Slack commands from one or more Slack conversations. |
|  | [ListenSyslog](listensyslog.md) | Listens for Syslog messages being sent to a given port over TCP or UDP. |
|  | [ListenTCP](listentcp.md) | Listens for incoming TCP connections and reads data from each connection using a line separator as the message demarcator. |
|  | [ListenUDP](listenudp.md) | Listens for Datagram Packets on a given port. |
|  | [ListenUDPRecord](listenudprecord.md) | Listens for Datagram Packets on a given port and reads the content of each datagram using the configured Record Reader. |
|  | [ListenWebSocket](listenwebsocket.md) | Acts as a WebSocket server endpoint to accept client connections. |
|  | [ListFile](listfile.md) | Retrieves a listing of files from the input directory. |
|  | [ListFTP](listftp.md) | Performs a listing of the files residing on an FTP server. |
|  | [ListGCSBucket](listgcsbucket.md) | Retrieves a listing of objects from a GCS bucket. |
|  | [ListGoogleDrive](listgoogledrive.md) | Performs a listing of concrete files (shortcuts are ignored) in a Google Drive folder. |
|  | [ListGoogleDriveFileInfo](listgoogledrivefileinfo.md) | Lists all files and folders in a specified Google Drive. |
|  | [ListGoogleGroups](listgooglegroups.md) | Lists all of the groups for a given domain in Google Workspace. |
|  | [ListHubSpotObjects](listhubspotobjects.md) | Fetches data from HubSpot for specified object types, and generates one FlowFile per listed object with the corresponding metadata as FlowFile attributes. |
|  | [ListMicrosoftDataverseTables](listmicrosoftdataversetables.md) | List Tables from Microsoft Dataverse environments |
|  | [ListS3](lists3.md) | Retrieves a listing of objects from an S3 bucket. |
|  | [ListSFDCDataShares](listsfdcdatashares.md) | List the available data shares in the organization that are available to the identified user. |
|  | [ListSFDCObjects](listsfdcobjects.md) | List the available objects in the organization that are available to the identified user. |
|  | [ListSFTP](listsftp.md) | Performs a listing of the files residing on an SFTP server. |
|  | [ListSharepointDrives](listsharepointdrives.md) | Emits a FlowFile for each Drive present in the specified Sharepoint Site. |
|  | [ListSharepointSiteGroups](listsharepointsitegroups.md) | Lists all SharePoint site groups available on a specified SharePoint site. |
|  | [ListSmb](listsmb.md) | Lists concrete files shared via SMB protocol. |
|  | [ListTableNames](listtablenames.md) | Fetches all source table names and matches them with one of the possible configurations: - regexp expression e. |
|  | [ListUnityCatalogDirectory](listunitycatalogdirectory.md) | List file names in a Unity Catalog directory and output a new FlowFile with the filename. |
|  | [LogAttribute](logattribute.md) | Emits attributes of the FlowFile at the specified log level |
|  | [LogMessage](logmessage.md) | Emits a log message at the specified log level |
|  | [LookupAttribute](lookupattribute.md) | Lookup attributes from a lookup service |
|  | [LookupRecord](lookuprecord.md) | Extracts one or more fields from a Record and looks up a value for those fields in a LookupService. |

## M

|  | Processor | Description |
| --- | --- | --- |
|  | [MergeContent](mergecontent.md) | Merges a Group of FlowFiles together based on a user-defined strategy and packages them into a single FlowFile. |
|  | [MergeRecord](mergerecord.md) | This Processor merges together multiple record-oriented FlowFiles into a single FlowFile that contains all of the Records of the input FlowFiles. |
|  | [MergeSnowflakeJournalTable](mergesnowflakejournaltable.md) | Triggers a merge operation on changes from journal table to a destination table in Snowflake. |
|  | [ModifyBytes](modifybytes.md) | Discard byte range at the start and end or all content of a binary file. |
|  | [ModifyCompression](modifycompression.md) | Changes the compression algorithm used to compress the contents of a FlowFile by decompressing the contents of FlowFiles using a user-specified compression algorithm and recompressing the contents using the specified compression format properties. |
|  | [MonitorActivity](monitoractivity.md) | Monitors the flow for activity and sends out an indicator when the flow has not had any data for some specified amount of time and again when the flow’s activity is restored |
|  | [MoveAzureDataLakeStorage](moveazuredatalakestorage.md) | Moves content within an Azure Data Lake Storage Gen 2. |

## N

|  | Processor | Description |
| --- | --- | --- |
|  | [Notify](notify.md) | Caches a release signal identifier in the distributed cache, optionally along with the FlowFile’s attributes. |

## O

|  | Processor | Description |
| --- | --- | --- |
|  | [OpenAiTranscribeAudio](openaitranscribeaudio.md) | Transcribes audio into English text. |

## P

|  | Processor | Description |
| --- | --- | --- |
|  | [PackageFlowFile](packageflowfile.md) | This processor will package FlowFile attributes and content into an output FlowFile that can be exported from NiFi and imported back into NiFi, preserving the original attributes and content. |
|  | [PaginatedJsonQueryElasticsearch](paginatedjsonqueryelasticsearch.md) | A processor that allows the user to run a paginated query (with aggregations) written with the Elasticsearch JSON DSL. |
|  | [ParseEvtx](parseevtx.md) | Parses the contents of a Windows Event Log file (evtx) and writes the resulting XML to the FlowFile |
|  | [ParseExcelCellReference](parseexcelcellreference.md) | Processor responsible for parsing Excel cell reference formula. |
|  | [ParseSyslog](parsesyslog.md) | Attempts to parses the contents of a Syslog message in accordance to RFC5424 and RFC3164 formats and adds attributes to the FlowFile for each of the parts of the Syslog message. |
|  | [ParseSyslog5424](parsesyslog5424.md) | Attempts to parse the contents of a well formed Syslog message in accordance to RFC5424 format and adds attributes to the FlowFile for each of the parts of the Syslog message, including Structured Data. |
|  | [PartitionRecord](partitionrecord.md) | Splits, or partitions, record-oriented data based on the configured fields in the data. |
|  | [PerformSnowflakeCortexOCR](performsnowflakecortexocr.md) | Performs Optical Character Recognition (OCR) on PDF documents using Snowflake Cortex ML functions. |
|  | [PickTablesForReplication](picktablesforreplication.md) | Accepts a list of fully qualified table names and determines if a table: - is new (is not replicated, but was added in the source) - is existing (is replicated and exists in the source) - is stale (is replicated but no longer exists in the source) Configuration is passed as a FlowFile attribute. |
|  | [PromptAnthropicAI](promptanthropicai.md) | Sends a prompt to Anthropic, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. |
|  | [PromptAzureOpenAI](promptazureopenai.md) | Sends a prompt to Azure’s OpenAI service, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. |
|  | [PromptLLM](promptllm.md) | This processor sends a user defined prompt to a Large Language Model (LLM) to respond. |
|  | [PromptOpenAI](promptopenai.md) | Sends a prompt to OpenAI, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. |
|  | [PromptSnowflakeCortex](promptsnowflakecortex.md) | Sends a prompt to Snowflake Cortex, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. |
|  | [PromptVertexAI](promptvertexai.md) | Sends a prompt to VertexAI, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. |
|  | [PublishAMQP](publishamqp.md) | Creates an AMQP Message from the contents of a FlowFile and sends the message to an AMQP Exchange. |
|  | [PublishGCPubSub](publishgcpubsub.md) | Publishes the content of the incoming flowfile to the configured Google Cloud PubSub topic. |
|  | [PublishJMS](publishjms.md) | Creates a JMS Message from the contents of a FlowFile and sends it to a JMS Destination (queue or topic) as JMS BytesMessage or TextMessage. |
|  | [PublishKafka](publishkafka.md) | Sends the contents of a FlowFile as either a message or as individual records to Apache Kafka using the Kafka Producer API. |
|  | [PublishKafka](publishkafka.md) | Sends the contents of a FlowFile as either a message or as individual records to Apache Kafka using the Kafka Producer API. |
|  | [PublishMQTT](publishmqtt.md) | Publishes a message to an MQTT topic |
|  | [PublishSlack](publishslack.md) | Posts a message to the specified Slack channel. |
|  | [PutAzureBlobStorage_v12](putazureblobstorage_v12.md) | Puts content into a blob on Azure Blob Storage. |
|  | [PutAzureCosmosDBRecord](putazurecosmosdbrecord.md) | This processor is a record-aware processor for inserting data into Cosmos DB with Core SQL API. |
|  | [PutAzureDataExplorer](putazuredataexplorer.md) | Acts as an Azure Data Explorer sink which sends FlowFiles to the provided endpoint. |
|  | [PutAzureDataLakeStorage](putazuredatalakestorage.md) | Writes the contents of a FlowFile as a file on Azure Data Lake Storage Gen 2 |
|  | [PutAzureEventHub](putazureeventhub.md) | Send FlowFile contents to Azure Event Hubs |
|  | [PutAzureQueueStorage_v12](putazurequeuestorage_v12.md) | Writes the content of the incoming FlowFiles to the configured Azure Queue Storage. |
|  | [PutBigQuery](putbigquery.md) | Writes the contents of a FlowFile to a Google BigQuery table. |
|  | [PutBoxFile](putboxfile.md) | Puts content to a Box folder. |
|  | [PutCloudWatchMetric](putcloudwatchmetric.md) | Publishes metrics to Amazon CloudWatch. |
|  | [PutDatabaseRecord](putdatabaserecord.md) | The PutDatabaseRecord processor uses a specified RecordReader to input (possibly multiple) records from an incoming flow file. |
|  | [PutDatabricksSQL](putdatabrickssql.md) | Submit a SQL Execution using Databricks REST API then write the JSON response to FlowFile Content. |
|  | [PutDBFSFile](putdbfsfile.md) | Write FlowFile content to DBFS. |
|  | [PutDistributedMapCache](putdistributedmapcache.md) | Gets the content of a FlowFile and puts it to a distributed map cache, using a cache key computed from FlowFile attributes. |
|  | [PutDropbox](putdropbox.md) | Puts content to a Dropbox folder. |
|  | [PutDynamoDB](putdynamodb.md) | Puts a document from DynamoDB based on hash and range key. |
|  | [PutDynamoDBRecord](putdynamodbrecord.md) | Inserts items into DynamoDB based on record-oriented data. |
|  | [PutElasticsearchJson](putelasticsearchjson.md) | An Elasticsearch put processor that uses the official Elastic REST client libraries. |
|  | [PutElasticsearchRecord](putelasticsearchrecord.md) | A record-aware Elasticsearch put processor that uses the official Elastic REST client libraries. |
|  | [PutEmail](putemail.md) | Sends an e-mail to configured recipients for each incoming FlowFile |
|  | [PutFile](putfile.md) | Writes the contents of a FlowFile to the local file system |
|  | [PutFTP](putftp.md) | Sends FlowFiles to an FTP Server |
|  | [PutGCSObject](putgcsobject.md) | Writes the contents of a FlowFile as an object in a Google Cloud Storage. |
|  | [PutGoogleDrive](putgoogledrive.md) | Writes the contents of a FlowFile as a file in Google Drive. |
|  | [PutGridFS](putgridfs.md) | Writes a file to a GridFS bucket. |
|  | [PutHubSpot](puthubspot.md) | Upsert a HubSpot object. |
|  | [PutIcebergTable](puticebergtable.md) | Store records in Iceberg using configurable Catalog for managing namespaces and tables. |
|  | [PutKinesisFirehose](putkinesisfirehose.md) | Sends the contents to a specified Amazon Kinesis Firehose. |
|  | [PutKinesisStream](putkinesisstream.md) | Sends the contents to a specified Amazon Kinesis. |
|  | [PutLambda](putlambda.md) | Sends the contents to a specified Amazon Lambda Function. |
|  | [PutMongo](putmongo.md) | Writes the contents of a FlowFile to MongoDB |
|  | [PutMongoBulkOperations](putmongobulkoperations.md) | Writes the contents of a FlowFile to MongoDB as bulk-update |
|  | [PutMongoRecord](putmongorecord.md) | This processor is a record-aware processor for inserting/upserting data into MongoDB. |
|  | [PutRecord](putrecord.md) | The PutRecord processor uses a specified RecordReader to input (possibly multiple) records from an incoming flow file, and sends them to a destination specified by a Record Destination Service (i. |
|  | [PutRedisHashRecord](putredishashrecord.md) | Puts record field data into Redis using a specified hash value, which is determined by a RecordPath to a field in each record containing the hash value. |
|  | [PutS3Object](puts3object.md) | Writes the contents of a FlowFile as an S3 Object to an Amazon S3 Bucket. |
|  | [PutSalesforceObject](putsalesforceobject.md) | Creates new records for the specified Salesforce sObject. |
|  | [PutSFTP](putsftp.md) | Sends FlowFiles to an SFTP Server |
|  | [PutSmbFile](putsmbfile.md) | Writes the contents of a FlowFile to a samba network location. |
|  | [PutSnowflakeInternalStageFile](putsnowflakeinternalstagefile.md) | Puts files into a Snowflake internal stage. |
|  | [PutSnowpipeStreaming](putsnowpipestreaming.md) | Streams records into a Snowflake table. |
|  | [PutSnowpipeStreaming2](putsnowpipestreaming2.md) | Send Records formatted as Newline Delimited JSON to Snowflake Database Pipes using Snowpipe Streaming Version 2. |
|  | [PutSNS](putsns.md) | Sends the content of a FlowFile as a notification to the Amazon Simple Notification Service |
|  | [PutSplunk](putsplunk.md) | Sends logs to Splunk Enterprise over TCP, TCP + TLS/SSL, or UDP. |
|  | [PutSplunkHTTP](putsplunkhttp.md) | Sends flow file content to the specified Splunk server over HTTP or HTTPS. |
|  | [PutSQL](putsql.md) | Executes a SQL UPDATE or INSERT command. |
|  | [PutSQS](putsqs.md) | Publishes a message to an Amazon Simple Queuing Service Queue |
|  | [PutSyslog](putsyslog.md) | Sends Syslog messages to a given host and port over TCP or UDP. |
|  | [PutTCP](puttcp.md) | Sends serialized FlowFiles or Records over TCP to a configurable destination with optional support for TLS |
|  | [PutUDP](putudp.md) | The PutUDP processor receives a FlowFile and packages the FlowFile content into a single UDP datagram packet which is then transmitted to the configured UDP server. |
|  | [PutUnityCatalogFile](putunitycatalogfile.md) | Write FlowFile content with max size of 5 GiB to Unity Catalog. |
|  | [PutVectaraDocument](putvectaradocument.md) | Generate and upload a JSON document to Vectara’s upload endpoint. |
|  | [PutVectaraFile](putvectarafile.md) | Upload a FlowFile content to Vectara’s index endpoint. |
|  | [PutWebSocket](putwebsocket.md) | Sends messages to a WebSocket remote endpoint using a WebSocket session that is established by either ListenWebSocket or ConnectWebSocket. |
|  | [PutZendeskTicket](putzendeskticket.md) | Create Zendesk tickets using the Zendesk API. |

## Q

|  | Processor | Description |
| --- | --- | --- |
|  | [QueryAzureDataExplorer](queryazuredataexplorer.md) | Query Azure Data Explorer and stream JSON results to output FlowFiles |
|  | [QueryDatabaseTable](querydatabasetable.md) | Generates a SQL select query, or uses a provided statement, and executes it to fetch all rows whose values in the specified Maximum Value column(s) are larger than the previously-seen maxima. |
|  | [QueryDatabaseTableRecord](querydatabasetablerecord.md) | Generates a SQL select query, or uses a provided statement, and executes it to fetch all rows whose values in the specified Maximum Value column(s) are larger than the previously-seen maxima. |
|  | [QueryMilvus](querymilvus.md) | Queries a given collection in a Milvus database using vectors. |
|  | [QueryPinecone](querypinecone.md) | Queries Pinecone for vectors that are similar to the input vector, or retrieves a vector by ID. |
|  | [QueryRecord](queryrecord.md) | Evaluates one or more SQL queries against the contents of a FlowFile. |
|  | [QuerySalesforceObject](querysalesforceobject.md) | Retrieves records from a Salesforce sObject. |
|  | [QuerySplunkIndexingStatus](querysplunkindexingstatus.md) | Queries Splunk server in order to acquire the status of indexing acknowledgement. |

## R

|  | Processor | Description |
| --- | --- | --- |
|  | [RemoveRecordField](removerecordfield.md) | Modifies the contents of a FlowFile that contains Record-oriented data (i. |
|  | [RenameRecordField](renamerecordfield.md) | Renames one or more fields in each Record of a FlowFile. |
|  | [ReplaceText](replacetext.md) | Updates the content of a FlowFile by searching for some textual value in the FlowFile content (via Regular Expression/regex, or literal value) and replacing the section of the content that matches with some alternate value. |
|  | [ReplaceTextWithMapping](replacetextwithmapping.md) | Updates the content of a FlowFile by evaluating a Regular Expression against it and replacing the section of the content that matches the Regular Expression with some alternate value provided in a mapping file. |
|  | [RetryFlowFile](retryflowfile.md) | FlowFiles passed to this Processor have a ‘Retry Attribute’ value checked against a configured ‘Maximum Retries’ value. |
|  | [RouteOnAttribute](routeonattribute.md) | Routes FlowFiles based on their Attributes using the Attribute Expression Language |
|  | [RouteOnContent](routeoncontent.md) | Applies Regular Expressions to the content of a FlowFile and routes a copy of the FlowFile to each destination whose Regular Expression matches. |
|  | [RouteText](routetext.md) | Routes textual data based on a set of user-defined rules. |
|  | [RunDatabricksJob](rundatabricksjob.md) | Triggers a pre-defined Databricks job to run with custom parameters. |
|  | [RunMongoAggregation](runmongoaggregation.md) | A processor that runs an aggregation query whenever a flowfile is received. |

## S

|  | Processor | Description |
| --- | --- | --- |
|  | [SampleRecord](samplerecord.md) | Samples the records of a FlowFile based on a specified sampling strategy (such as Reservoir Sampling). |
|  | [ScanAttribute](scanattribute.md) | Scans the specified attributes of FlowFiles, checking to see if any of their values are present within the specified dictionary of terms |
|  | [ScanContent](scancontent.md) | Scans the content of FlowFiles for terms that are found in a user-supplied dictionary. |
|  | [ScriptedFilterRecord](scriptedfilterrecord.md) | This processor provides the ability to filter records out from FlowFiles using the user-provided script. |
|  | [ScriptedPartitionRecord](scriptedpartitionrecord.md) | Receives Record-oriented data (i. |
|  | [ScriptedTransformRecord](scriptedtransformrecord.md) | Provides the ability to evaluate a simple script against each record in an incoming FlowFile. |
|  | [ScriptedValidateRecord](scriptedvalidaterecord.md) | This processor provides the ability to validate records in FlowFiles using the user-provided script. |
|  | [SearchElasticsearch](searchelasticsearch.md) | A processor that allows the user to repeatedly run a paginated query (with aggregations) written with the Elasticsearch JSON DSL. |
|  | [SegmentContent](segmentcontent.md) | Segments a FlowFile into multiple smaller segments on byte boundaries. |
|  | [SignContentPGP](signcontentpgp.md) | Sign content using OpenPGP Private Keys |
|  | [SnowflakeDetectDuplicate](snowflakedetectduplicate.md) | Checks if a FlowFile ‘s hash (provided as a FlowFile attribute) is already in a Snowflake table, and routes the FlowFile to’ duplicate ‘if found,’distinct ‘if not found, or’ failure’ on errors. |
|  | [SplitAvro](splitavro.md) | Splits a binary encoded Avro datafile into smaller files based on the configured Output Size. |
|  | [SplitContent](splitcontent.md) | Splits incoming FlowFiles by a specified byte sequence |
|  | [SplitExcel](splitexcel.md) | This processor splits a multi sheet Microsoft Excel spreadsheet into multiple Microsoft Excel spreadsheets where each sheet from the original file is converted to an individual spreadsheet in its own flow file. |
|  | [SplitJson](splitjson.md) | Splits a JSON File into multiple, separate FlowFiles for an array element specified by a JsonPath expression. |
|  | [SplitRecord](splitrecord.md) | Splits up an input FlowFile that is in a record-oriented data format into multiple smaller FlowFiles |
|  | [SplitText](splittext.md) | Splits a text file into multiple smaller text files on line boundaries limited by maximum number of lines or total size of fragment. |
|  | [SplitXml](splitxml.md) | Splits an XML File into multiple separate FlowFiles, each comprising a child or descendant of the original root element |
|  | [StartAwsPollyJob](startawspollyjob.md) | Trigger a AWS Polly job. |
|  | [StartAwsTextractJob](startawstextractjob.md) | Trigger a AWS Textract job. |
|  | [StartAwsTranscribeJob](startawstranscribejob.md) | Trigger a AWS Transcribe job. |
|  | [StartAwsTranslateJob](startawstranslatejob.md) | Trigger a AWS Translate job. |
|  | [StartGcpVisionAnnotateFilesOperation](startgcpvisionannotatefilesoperation.md) | Trigger a Vision operation on file input. |
|  | [StartGcpVisionAnnotateImagesOperation](startgcpvisionannotateimagesoperation.md) | Trigger a Vision operation on image input. |
|  | [SubmitQueryJob](submitqueryjob.md) | Submits a Query Job to Salesforce using the Bulk API 2. |
|  | [SummarizeText](summarizetext.md) | This processor uses a Large Language Model (LLM) to summarize the content of a FlowFile. |

## T

|  | Processor | Description |
| --- | --- | --- |
|  | [TagS3Object](tags3object.md) | Adds or updates a tag on an Amazon S3 Object. |
|  | [TailFile](tailfile.md) | “Tails” a file, or a list of files, ingesting data from the file as it is written to the file. |
|  | [TransformXml](transformxml.md) | Applies the provided XSLT file to the FlowFile XML payload. |

## U

|  | Processor | Description |
| --- | --- | --- |
|  | [UnpackContent](unpackcontent.md) | Unpacks the content of FlowFiles that have been packaged with one of several different Packaging Formats, emitting one to many FlowFiles for each input FlowFile. |
|  | [UpdateAttribute](updateattribute.md) | Updates the Attributes for a FlowFile by using the Attribute Expression Language and/or deletes the attributes based on a regular expression |
|  | [UpdateBoxFileMetadataInstance](updateboxfilemetadatainstance.md) | Updates metadata template values for a Box file using the record in the given flowFile. |
|  | [UpdateBulkJobState](updatebulkjobstate.md) | Updates the status of a Salesforce Bulk Job in the shared state service for a specific object type |
|  | [UpdateByQueryElasticsearch](updatebyqueryelasticsearch.md) | Update documents in an Elasticsearch index using a query. |
|  | [UpdateCounter](updatecounter.md) | This processor allows users to set specific counters and key points in their flow. |
|  | [UpdateDatabaseTable](updatedatabasetable.md) | This processor uses a JDBC connection and incoming records to generate any database table changes needed to support the incoming records. |
|  | [UpdateRecord](updaterecord.md) | Updates the contents of a FlowFile that contains Record-oriented data (i. |
|  | [UpdateSnowflakeDatabase](updatesnowflakedatabase.md) | Updates the definition of a Snowflake table based on the schema provided in the incoming FlowFile. |
|  | [UpdateSnowflakeIcebergDatabase](updatesnowflakeicebergdatabase.md) | Updates the definition of a Snowflake Iceberg table. |
|  | [UpdateSnowflakeSchema](updatesnowflakeschema.md) | Creates Snowflake database schema if it does not exist. |
|  | [UpdateSnowflakeStream](updatesnowflakestream.md) | Manages Snowflake streams by creating, dropping, or replacing them based on the configured operation. |
|  | [UpdateSnowflakeTable](updatesnowflaketable.md) | Updates the definition of a Snowflake table based on the schema provided in the incoming FlowFile. |
|  | [UpdateSnowflakeView](updatesnowflakeview.md) | Creates or replaces Snowflake views based on column mappings provided in the incoming FlowFile. |
|  | [UpdateTableState](updatetablestate.md) | Updates the state of a table in the Table State Service |
|  | [UpsertMilvus](upsertmilvus.md) | Upserts vectors into Milvus database for a given collection |
|  | [UpsertPinecone](upsertpinecone.md) | Publishes vectors, including metadata, and optionally text, to a Pinecone index. |
|  | [UpsertSFDCObjects](upsertsfdcobjects.md) | Upserts the records from the incoming FlowFile into Salesforce |

## V

|  | Processor | Description |
| --- | --- | --- |
|  | [ValidateCsv](validatecsv.md) | Validates the contents of FlowFiles or a FlowFile attribute value against a user-specified CSV schema. |
|  | [ValidateJson](validatejson.md) | Validates the contents of FlowFiles against a configurable JSON Schema. |
|  | [ValidateRecord](validaterecord.md) | Validates the Records of an incoming FlowFile against a given schema. |
|  | [ValidateXml](validatexml.md) | Validates XML contained in a FlowFile. |
|  | [VerifyContentMAC](verifycontentmac.md) | Calculates a Message Authentication Code using the provided Secret Key and compares it with the provided MAC property |
|  | [VerifyContentPGP](verifycontentpgp.md) | Verify signatures using OpenPGP Public Keys |

## W

|  | Processor | Description |
| --- | --- | --- |
|  | [Wait](wait.md) | Routes incoming FlowFiles to the ‘wait’ relationship until a matching release signal is stored in the distributed cache from a corresponding Notify processor. |
|  | [WaitForTableState](waitfortablestate.md) | Blocks incoming FlowFiles until the corresponding table state is not equal to accepted state. |

---
title: AmazonGlueEncodedSchemaReferenceReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/amazonglueencodedschemareferencereader.md
section: Loading & Unloading Data
---

# AmazonGlueEncodedSchemaReferenceReader

## Description

Reads Schema Identifier according to AWS Glue Schema encoding as a header consisting of a two byte markers and a 16 byte UUID

## Tags

avro, aws, glue, registry, schema

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AmazonGlueSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/amazonglueschemaregistry.md
section: Loading & Unloading Data
---

# AmazonGlueSchemaRegistry

## Description

Provides a Schema Registry that interacts with the AWS Glue Schema Registry so that those Schemas that are stored in the Glue Schema Registry can be used in NiFi. When a Schema is looked up by name by this registry, it will find a Schema in the Glue Schema Registry with their names.

## Tags

avro, aws, glue, registry, schema

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| AWS Credentials Provider Service | AWS Credentials Provider Service |  |  | The Controller Service that is used to obtain AWS credentials provider |
| Cache Expiration \* | Cache Expiration | 1 hour |  | Specifies how long a Schema that is cached should remain in the cache. Once this time period elapses, a cached version of a schema will no longer be used, and the service will have to communicate with the Schema Registry again in order to obtain the schema. |
| Cache Size \* | Cache Size | 1000 |  | Specifies how many Schemas should be cached from the Schema Registry |
| Communications Timeout \* | Communications Timeout | 30 secs |  | Specifies how long to wait to receive data from the Schema Registry before considering the communications a failure |
| Region \* | Region | us-west-2 | * AWS GovCloud (US-East) * AWS GovCloud (US-West) * Africa (Cape Town) * Asia Pacific (Hong Kong) * Asia Pacific (Hyderabad) * Asia Pacific (Jakarta) * Asia Pacific (Malaysia) * Asia Pacific (Melbourne) * Asia Pacific (Mumbai) * Asia Pacific (New Zealand) * Asia Pacific (Osaka) * Asia Pacific (Seoul) * Asia Pacific (Singapore) * Asia Pacific (Sydney) * Asia Pacific (Taipei) * Asia Pacific (Thailand) * Asia Pacific (Tokyo) * Canada (Central) * Canada West (Calgary) * China (Beijing) * China (Ningxia) * EU (Germany) * EU ISOE West * Europe (Frankfurt) * Europe (Ireland) * Europe (London) * Europe (Milan) * Europe (Paris) * Europe (Spain) * Europe (Stockholm) * Europe (Zurich) * Israel (Tel Aviv) * Mexico (Central) * Middle East (Bahrain) * Middle East (UAE) * South America (Sao Paulo) * US East (N. Virginia) * US East (Ohio) * US ISO East * US ISO WEST * US ISOB East (Ohio) * US ISOF EAST * US ISOF SOUTH * US West (N. California) * US West (Oregon) * aws global region * aws-cn global region * aws-iso global region * aws-iso-b global region * aws-iso-e global region * aws-iso-f global region * aws-us-gov global region | The region of the cloud resources |
| SSL Context Service | SSL Context Service |  |  | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Schema Registry Name \* | Schema Registry Name |  |  | The name of the Schema Registry |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AmazonMSKConnectionService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/amazonmskconnectionservice.md
section: Loading & Unloading Data
---

# AmazonMSKConnectionService

## Description

Provides and manages connections to AWS MSK Kafka Brokers for producer or consumer operations.

## Tags

aws, kafka, managed, msk, openflow, streaming

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| SSL Context Service | SSL Context Service |  |  | Service supporting SSL communication with Kafka brokers |
| Acknowledgment Wait Time \* | ack.wait.time | 5 sec |  | After sending a message to Kafka, this indicates the amount of time that the service will wait for a response from Kafka.If Kafka does not acknowledge the message within this time period, the service will throw an exception. |
| AWS Profile Name | aws.profile.name |  |  | The Amazon Web Services Profile to select when multiple profiles are available. |
| Bootstrap Servers \* | bootstrap.servers |  |  | Comma-separated list of Kafka Bootstrap Servers in the format host:port. Corresponds to Kafka bootstrap.servers property |
| Client Timeout \* | default.api.timeout.ms | 60 sec |  | Default timeout for Kafka client operations. Mapped to Kafka default.api.timeout.ms. The Kafka request.timeout.ms property is derived from half of the configured timeout |
| Transaction Isolation Level \* | isolation.level | read_committed | * Read Committed * Read Uncommitted | Specifies how the service should handle transaction isolation levels when communicating with Kafka.The uncommited option means that messages will be received as soon as they are written to Kafka but will be pulled, even if the producer cancels the transactions.The committed option configures the service to not receive any messages for which the producer’s transaction was canceled, but this can result in some latency since theconsumer must wait for the producer to finish its entire transaction instead of pulling as the messages become available.Corresponds to Kafka isolation.level property. |
| Max Metadata Wait Time \* | max.block.ms | 5 sec |  | The amount of time publisher will wait to obtain metadata or wait for the buffer to flush during the ‘send’ call before failing theentire ‘send’ call. Corresponds to Kafka max.block.ms property |
| Max Poll Records \* | max.poll.records | 10000 |  | Maximum number of records Kafka should return in a single poll. |
| SASL Mechanism \* | sasl.mechanism | AWS_MSK_IAM | * AWS_MSK_IAM * SCRAM-SHA-512 | SASL mechanism used for authentication. Corresponds to Kafka Client sasl.mechanism property |
| SASL Password \* | sasl.password |  |  | Password provided with configured username when using PLAIN or SCRAM SASL Mechanisms |
| SASL Username \* | sasl.username |  |  | Username provided with configured password when using PLAIN or SCRAM SASL Mechanisms |
| Security Protocol \* | security.protocol | PLAINTEXT | * PLAINTEXT * SSL * SASL_PLAINTEXT * SASL_SSL | Security protocol used to communicate with brokers. Corresponds to Kafka Client security.protocol property |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Apache Kafka for JSON/AVRO data format
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kafka/kafka-json-avro.md
section: Loading & Unloading Data
---

# Apache Kafka for JSON/AVRO data format

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the Apache Kafka connectors for JSON and AVRO data formats.
These are simplified connectors optimized for basic message ingestion with schema evolution
and topic-to-table mapping capabilities.

## Connector variants

### JSON data format connector

The Apache Kafka for JSON data format connector is designed for straightforward JSON message ingestion from Kafka topics to Snowflake tables.

Key features:

* JSON message format support
* Schema evolution
* Topic-to-table mapping
* SASL authentication

### AVRO data format connector

The Apache Kafka for AVRO data format connector is designed for AVRO message ingestion from Kafka topics to Snowflake tables with schema registry support.

Key features:

* AVRO message format support
* Schema registry integration
* Schema evolution
* Topic-to-table mapping
* SASL authentication

## Specific parameters

In addition to the common parameters described in [Set up the Openflow Connector for Kafka](setup.md), these connectors have specific parameter contexts.

### Schema registry parameters (AVRO connector only)

The AVRO connector includes additional parameters for schema registry integration:

| Parameter | Description | Required |
| --- | --- | --- |
| Schema Registry Authentication Type | The method of authenticating to schema registry if used. Otherwise, use *NONE*. One of: *NONE* / *BASIC*. Default: *NONE* | Yes |
| Schema Registry URL | The URL of Schema Registry. Required for *AVRO* message format. | No |
| Schema Registry Username | The username for Schema Registry. Required for *AVRO* message format. | No |
| Schema Registry Password | The password for Schema Registry. Required for *AVRO* message format. | No |
| AVRO Schema Access Strategy | The method of accessing the AVRO schema of a message. Required for *AVRO*. One of: *embedded-avro-schema* / *schema-reference-reader* / *schema-text-property*. Default: *embedded-avro-schema* | No |
| AVRO Schema | Avro schema in case schema-text-property is used in AVRO Schema Access Strategy with the AVRO message format. Note: this should only be used in case all messages consumed from the configured Kafka Topic(s) share the same schema. | No |

## Limitations

These simplified connectors have the following limitations compared to the full-featured DLQ and metadata connector:

* **No RECORD_METADATA column** - Kafka metadata is not stored in the target tables
* **No dead letter queue (DLQ)** - Failed messages are not routed to a DLQ topic
* **No Iceberg table support** - Only regular Snowflake tables are supported
* **Fixed schematization** - Schema detection is always enabled and cannot be disabled

> **Note:**
>
> Schema detection is enabled by default in these connectors and cannot be disabled.
> This means message fields are automatically flattened into individual table columns with automatic schema evolution.

## Use cases

These connectors are ideal for:

Simple data ingestion
:   When you only need the message content without Kafka metadata.

High-throughput scenarios
:   Where the simplified data structure improves performance.

Schema evolution use cases
:   Where automatic table schema updates are required

JSON or AVRO message formats
:   With consistent schemas

If you need Kafka metadata, DLQ support, or Iceberg table ingestion, use the [Apache Kafka with DLQ and metadata](kafka-dlq-metadata.md) connector instead.

## Schema detection and evolution

These connectors support automatic schema detection and evolution. The structure
of tables in Snowflake is defined and evolved automatically to support the structure
of new data loaded by the connector.

With schema detection enabled (which is always the case for these connectors),
Snowflake can detect the schema of the streaming data and load data into tables
that automatically match any user-defined schema. Snowflake also allows adding
new columns or dropping the `NOT NULL` constraint from columns missing in new data files.

Schema detection with the connector is supported with or without a provided schema registry.
If using schema registry (Avro), the column will be created with the data types defined
in the provided schema registry. If there is no schema registry (JSON), the data type will be inferred based on the data provided.

JSON ARRAY is not supported for further schematization.

### Schema evolution behavior

If the connector creates the target table, schema evolution is enabled by default.

If you want to enable or disable schema evolution on an existing table,
use the [ALTER TABLE](../../../../../sql-reference/sql/alter-table.md) command to set the `ENABLE_SCHEMA_EVOLUTION` parameter.
You must also use a role that has the `OWNERSHIP` privilege on the table. For more information, see [Enable automatic table schema evolution](../../../../data-load-schema-evolution.md).

---
title: Apache Kafka with DLQ and metadata
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kafka/kafka-dlq-metadata.md
section: Loading & Unloading Data
---

# Apache Kafka with DLQ and metadata

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the Apache Kafka with DLQ and metadata connector. This is the full-featured connector that provides feature parity with the
legacy Snowflake connector for Kafka and includes advanced capabilities for production use cases.

## Key features

The Apache Kafka with DLQ and metadata connector provides comprehensive functionality:

* **Dead Letter Queue (DLQ)** support for failed message handling
* **RECORD_METADATA** column with Kafka message metadata
* **Configurable schematization** - enable or disable schema detection
* **Iceberg table support** with schema evolution
* **Multiple message formats** - JSON and AVRO support
* **Schema registry integration** for AVRO messages
* **Topic-to-table mapping** with advanced patterns
* **SASL authentication** support

## Specific parameters

In addition to the common parameters described in [Set up the Openflow Connector for Kafka](setup.md), this connector includes additional parameter contexts for advanced features.

### Message format and schema parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Message Format | The format of messages in Kafka. One of: *JSON* / *AVRO*. Default: *JSON* | Yes |
| AVRO Schema | Avro schema in case *schema-text-property* is used in AVRO Schema Access Strategy with the AVRO message format. Note: this should only be used in case all messages consumed from the configured Kafka Topic(s) share the same schema. | No |
| AVRO Schema Access Strategy | The method of accessing the AVRO schema of a message. Required for *AVRO*. One of: *embedded-avro-schema* / *schema-reference-reader* / *schema-text-property*. Default: *embedded-avro-schema* | No |

### Schema registry parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Schema Registry Authentication Type | The method of authenticating to schema registry if used. Otherwise, use *NONE*. One of: *NONE* / *BASIC*. Default: *NONE* | Yes |
| Schema Registry URL | The URL of Schema Registry. Required for *AVRO* message format. | No |
| Schema Registry Username | The username for Schema Registry. Required for *AVRO* message format. | No |
| Schema Registry Password | The password for Schema Registry. Required for *AVRO* message format. | No |

### DLQ and advanced features parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Kafka DLQ Topic | DLQ topic to send messages with parsing errors to | Yes |
| Schematization Enabled | Determines whether data is inserted into individual columns or a single RECORD_CONTENT field. One of: *true* / *false*. Default: *true* | Yes |
| Iceberg Enabled | Specifies whether the processor ingests data into an Iceberg table. The processor fails if this property doesn’t match the actual table type. Default: *false* | Yes |

## Schematization behavior

The connector’s behavior changes based on the **Schematization Enabled** parameter:

### Schematization enabled

When schematization is enabled, the connector:

* Creates individual columns for each field in the message
* Includes a **RECORD_METADATA** column with Kafka metadata
* Automatically evolves the table schema when new fields are detected
* Flattens nested JSON/AVRO structures into separate columns

**Example table structure:**

| Row | RECORD_METADATA | ACCOUNT | SYMBOL | SIDE | QUANTITY |
| --- | --- | --- | --- | --- | --- |
| 1 | {“timestamp”:1669074170090, “headers”: {“current.iter… | ABC123 | ZTEST | BUY | 3572 |
| 2 | {“timestamp”:1669074170400, “headers”: {“current.iter… | XYZ789 | ZABX | SELL | 3024 |

### Schematization disabled

When schematization is disabled, the connector:

* Creates only two columns: **RECORD_CONTENT** and **RECORD_METADATA**
* Stores the entire message content as an OBJECT in **RECORD_CONTENT**
* Does not perform automatic schema evolution
* Provides maximum flexibility for downstream processing

**Example table structure:**

| Row | RECORD_METADATA | RECORD_CONTENT |
| --- | --- | --- |
| 1 | {“timestamp”:1669074170090, “headers”: {“current.iter… | {“account”: “ABC123”, “symbol”: “ZTEST”, “side”:… |
| 2 | {“timestamp”:1669074170400, “headers”: {“current.iter… | {“account”: “XYZ789”, “symbol”: “ZABX”, “side”:… |

Use the `Schematization Enabled` property in the connector configuration properties to enable or disable schema detection.

## Schema detection and evolution

The connector supports schema detection and evolution.
The structure of tables in Snowflake can be defined and evolved automatically to support the structure of new data loaded by the connector.

Without schema detection and evolution, the Snowflake table loaded by the connector
only consists of two `OBJECT` columns: `RECORD_CONTENT` and `RECORD_METADATA`.

With schema detection and evolution enabled, Snowflake can detect the schema of
the streaming data and load data into tables that automatically match any user-defined schema.
Snowflake also allows adding new columns or dropping the `NOT NULL` constraint from columns missing in new data files.

Schema detection with the connector is supported with or without a provided schema registry.
If using schema registry (Avro), the column will be created with the data types defined in the provided schema registry.
If there is no schema registry (JSON), the data type will be inferred based on the data provided.

JSON ARRAY is not supported for further schematization.

### Enabling schema evolution

If the connector creates the target table, schema evolution is enabled by default.

If you want to enable or disable schema evolution on the existing table, use
the [ALTER TABLE](../../../../../sql-reference/sql/alter-table.md) command to set the `ENABLE_SCHEMA_EVOLUTION` parameter.
You must also use a role that has the `OWNERSHIP` privilege on the table.
For more information, see [Enable automatic table schema evolution](../../../../data-load-schema-evolution.md).

However, if schema evolution is disabled for an existing table, then the connector
will try to send the rows with mismatched schemas to the configured dead-letter queues (DLQ).

### RECORD_METADATA structure

The **RECORD_METADATA** column contains important Kafka message metadata:

| Field | Description |
| --- | --- |
| offset | The message offset within the Kafka partition |
| topic | The Kafka topic name |
| partition | The Kafka partition number |
| key | The message key (if present) |
| timestamp | The message timestamp |
| SnowflakeConnectorPushTime | Timestamp when the connector fetched the message from Kafka |
| headers | Map of message headers (if present) |

## Dead Letter Queue (DLQ)

The DLQ functionality handles messages that cannot be processed successfully:

### DLQ behavior

* **Parse failures** - Messages with invalid JSON/AVRO format are sent to the DLQ
* **Schema mismatches** - Messages that don’t match the expected schema when schema evolution is disabled
* **Processing errors** - Other processing failures during ingestion

## Iceberg table support

Openflow Connector for Kafka can ingest data into a Snowflake-managed [Apache Iceberg™ table](../../../../tables-iceberg.md) when **Iceberg Enabled** is set to *true*.

### Requirements and limitations

Before you configure the Openflow Kafka connector for Iceberg table ingestion, note the following requirements and limitations:

* You must create an Iceberg table before running the connector.
* Make sure that the user has access to inserting data into the created tables.

### Configuration and setup

To configure the Openflow Connector for Kafka for Iceberg table ingestion, follow
the steps in [Set up the Openflow Connector for Kafka](setup.md) with a few differences noted in the following sections.

#### Enable ingestion into Iceberg table

To enable ingestion into an Iceberg table, you must set the `Iceberg Enabled` parameter to `true`.

#### Create an Iceberg table for ingestion

Before you run the connector, you must create an Iceberg table.
The initial table schema depends on your connector `Schematization Enabled` property settings.

If you enable schematization, you must create a table with a column named `record_metadata`:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    record_metadata OBJECT()
  )
  EXTERNAL_VOLUME = 'my_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_location/my_iceberg_table';
```

The connector automatically creates the columns for message fields and alters the `record_metadata` column schema.

If you don’t enable schematization, you must create a table with a column named
`record_content` of a type that matches the actual Kafka message content.
The connector automatically creates the `record_metadata` column.

When you create an Iceberg table, you can use Iceberg data types or
[compatible Snowflake types](../../../../tables-iceberg-data-types.md).
The semi-structured VARIANT type isn’t supported. Instead, use a
[structured OBJECT or MAP](../../../../../sql-reference/data-types-structured.md).

For example, consider the following message:

```sqljson
{
    "id": 1,
    "name": "Steve",
    "body_temperature": 36.6,
    "approved_coffee_types": ["Espresso", "Doppio", "Ristretto", "Lungo"],
    "animals_possessed":
    {
        "dogs": true,
        "cats": false
    },
    "date_added": "2024-10-15"
}
```

### Iceberg table creation examples

**With schematization enabled:**

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    RECORD_METADATA OBJECT(
        offset INTEGER,
        topic STRING,
        partition INTEGER,
        key STRING,
        timestamp TIMESTAMP,
        SnowflakeConnectorPushTime BIGINT,
        headers MAP(VARCHAR, VARCHAR)
    ),
    id INT,
    body_temperature FLOAT,
    name STRING,
    approved_coffee_types ARRAY(STRING),
    animals_possessed OBJECT(dogs BOOLEAN, cats BOOLEAN),
    date_added DATE
  )
  EXTERNAL_VOLUME = 'my_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_location/my_iceberg_table';
```

**With schematization disabled:**

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    RECORD_METADATA OBJECT(
        offset INTEGER,
        topic STRING,
        partition INTEGER,
        key STRING,
        timestamp TIMESTAMP,
        SnowflakeConnectorPushTime BIGINT,
        headers MAP(VARCHAR, VARCHAR)
    ),
    RECORD_CONTENT OBJECT(
        id INT,
        body_temperature FLOAT,
        name STRING,
        approved_coffee_types ARRAY(STRING),
        animals_possessed OBJECT(dogs BOOLEAN, cats BOOLEAN),
        date_added DATE
    )
  )
  EXTERNAL_VOLUME = 'my_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_location/my_iceberg_table';
```

> **Note:**
>
> RECORD_METADATA must always be created. Field names inside nested structures such as `dogs` or `cats` are case sensitive.

## Use cases

This connector is ideal for:

* **Production environments** requiring DLQ
* **Data lineage and auditing** where Kafka metadata is important
* **Complex message processing** with schema evolution requirements
* **Iceberg table integration**

If you need simpler ingestion without metadata or DLQ features, consider
the [Apache Kafka for JSON/AVRO data format](kafka-json-avro.md) connectors instead.

---
title: ApicurioSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/apicurioschemaregistry.md
section: Loading & Unloading Data
---

# ApicurioSchemaRegistry

## Description

Provides a Schema Registry that interacts with the Apicurio Schema Registry so that those Schemas that are stored in the Apicurio Schema Registry can be used in NiFi. When a Schema is looked up by name by this registry, it will find a Schema in the Apicurio Schema Registry with their artifact identifiers.

## Tags

apicurio, avro, registry, schema

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Cache Expiration \* | Cache Expiration | 1 hour |  | Specifies how long a Schema that is cached should remain in the cache. Once this time period elapses, a cached version of a schema will no longer be used, and the service will have to communicate with the Schema Registry again in order to obtain the schema. |
| Cache Size \* | Cache Size | 1000 |  | Specifies how many Schemas should be cached from the Schema Registry. The cache size must be a non-negative integer. When it is set to 0, the cache is effectively disabled. |
| Schema Group ID \* | Schema Group ID | default |  | The artifact Group ID for the schemas |
| Schema Registry URL \* | Schema Registry URL |  |  | The URL of the Schema Registry e.g. <http://localhost:8080> |
| Web Client Service Provider \* | Web Client Service Provider |  |  | Controller service for HTTP client operations |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AttributesToCSV 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/attributestocsv.md
section: Loading & Unloading Data
---

# AttributesToCSV 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Generates a CSV representation of the input FlowFile Attributes. The resulting CSV can be written to either a newly generated attribute named ‘CSVAttributes’ or written to the FlowFile as content. If the attribute value contains a comma, newline or double quote, then the attribute value will be escaped with double quotes. Any double quote characters in the attribute value are escaped with another double quote.

## Tags

attributes, csv, flowfile

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| attribute-list | Comma separated list of attributes to be included in the resulting CSV. If this value is left empty then all existing Attributes will be included. This list of attributes is case sensitive and supports attribute names that contain commas. If an attribute specified in the list is not found it will be emitted to the resulting CSV with an empty string or null depending on the ‘Null Value’ property. If a core attribute is specified in this list and the ‘Include Core Attributes’ property is false, the core attribute will be included. The attribute list ALWAYS wins. |
| attributes-regex | Regular expression that will be evaluated against the flow file attributes to select the matching attributes. This property can be used in combination with the attributes list property. The final output will contain a combination of matches found in the ATTRIBUTE_LIST and ATTRIBUTE_REGEX. |
| destination | Control if CSV value is written as a new flowfile attribute ‘CSVData’ or written in the flowfile content. |
| include-core-attributes | Determines if the FlowFile org.apache.nifi.flowfile.attributes. CoreAttributes, which are contained in every FlowFile, should be included in the final CSV value generated. Core attributes will be added to the end of the CSVData and CSVSchema strings. The Attribute List property overrides this setting. |
| include-schema | If true the schema (attribute names) will also be converted to a CSV string which will either be applied to a new attribute named ‘CSVSchema’ or applied at the first row in the content depending on the DESTINATION property setting. |
| null-value | If true a non existing or empty attribute will be ‘null’ in the resulting CSV. If false an empty string will be placed in the CSV |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to convert attributes to CSV |
| success | Successfully converted attributes to CSV |

## Writes attributes

| Name | Description |
| --- | --- |
| CSVSchema | CSV representation of the Schema |
| CSVData | CSV representation of Attributes |

---
title: AttributesToJSON 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/attributestojson.md
section: Loading & Unloading Data
---

# AttributesToJSON 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Generates a JSON representation of the input FlowFile Attributes. The resulting JSON can be written to either a new Attribute ‘JSONAttributes’ or written to the FlowFile as content. Attributes which contain nested JSON objects can either be handled as JSON or as escaped JSON depending on the strategy chosen.

## Tags

attributes, flowfile, json

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attributes List | Comma separated list of attributes to be included in the resulting JSON. If this value is left empty then all existing Attributes will be included. This list of attributes is case sensitive. If an attribute specified in the list is not found it will be emitted to the resulting JSON with an empty string or NULL value. |
| Destination | Control if JSON value is written as a new flowfile attribute ‘JSONAttributes’ or written in the flowfile content. Writing to flowfile content will overwrite any existing flowfile content. |
| Include Core Attributes | Determines if the FlowFile org.apache.nifi.flowfile.attributes. CoreAttributes which are contained in every FlowFile should be included in the final JSON value generated. |
| JSON Handling Strategy | Strategy to use for handling attributes which contain nested JSON. |
| Null Value | If true a non existing selected attribute will be NULL in the resulting JSON. If false an empty string will be placed in the JSON |
| Pretty Print | Apply pretty print formatting to the output. |
| attributes-to-json-regex | Regular expression that will be evaluated against the flow file attributes to select the matching attributes. This property can be used in combination with the attributes list property. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to convert attributes to JSON |
| success | Successfully converted attributes to JSON |

## Writes attributes

| Name | Description |
| --- | --- |
| JSONAttributes | JSON representation of Attributes |

---
title: AvroReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/avroreader.md
section: Loading & Unloading Data
---

# AvroReader

## Description

Parses Avro data and returns each Avro record as an separate Record object. The Avro data may contain the schema itself, or the schema can be externalized and accessed by one of the methods offered by the ‘Schema Access Strategy’ property.

## Tags

avro, comma, delimited, parse, reader, record, row, separated, values

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Schema Access Strategy \* | Schema Access Strategy | embedded-avro-schema | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Use Embedded Avro Schema | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Cache Size \* | cache-size | 1000 |  | Specifies how many Schemas should be cached |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AvroRecordSetWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/avrorecordsetwriter.md
section: Loading & Unloading Data
---

# AvroRecordSetWriter

## Description

Writes the contents of a RecordSet in Binary Avro format.

## Tags

avro, record, recordset, result, row, serializer, set, writer

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Schema Access Strategy \* | Schema Access Strategy | inherit-record-schema | * Inherit Record Schema * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Cache | Schema Cache |  |  | Specifies a Schema Cache to add the Record Schema to so that Record Readers can quickly lookup the schema. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Reference Writer \* | Schema Reference Writer |  |  | Service implementation responsible for writing FlowFile attributes or content header with Schema reference information |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Schema Write Strategy \* | Schema Write Strategy | avro-embedded | * Embed Avro Schema * Do Not Write Schema * Set ‘schema.name’ Attribute * Set ‘avro.schema’ Attribute * Schema Reference Writer | Specifies how the schema for a Record should be added to the data. |
| Cache Size \* | cache-size | 1000 |  | Specifies how many Schemas should be cached |
| Compression Format \* | compression-format | NONE | * BZIP2 * DEFLATE * NONE * SNAPPY * LZO | Compression type to use when writing Avro files. Default is None. |
| Encoder Pool Size \* | encoder-pool-size | 32 |  | Avro Writers require the use of an Encoder. Creation of Encoders is expensive, but once created, they can be reused. This property controls the maximum number of Encoders that can be pooled and reused. Setting this value too small can result in degraded performance, but setting it higher can result in more heap being used. This property is ignored if the Avro Writer is configured with a Schema Write Strategy of ‘Embed Avro Schema’. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AvroSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/avroschemaregistry.md
section: Loading & Unloading Data
---

# AvroSchemaRegistry

## Description

Provides a service for registering and accessing schemas. You can register a schema as a dynamic property where ‘name’ represents the schema name and ‘value’ represents the textual representation of the actual schema following the syntax and semantics of Avro’s Schema format.

## Tags

avro, csv, json, registry, schema

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Validate Field Names \* | avro-reg-validated-field-names | true | * true * false | Whether or not to validate the field names in the Avro schema based on Avro naming rules. If set to true, all field names must be valid Avro names, which must begin with `[A-Za-z_]`, and subsequently contain only `[A-Za-z0-9_]`. If set to false, no validation will be performed on the field names. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AWSCredentialsProviderControllerService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/awscredentialsprovidercontrollerservice.md
section: Loading & Unloading Data
---

# AWSCredentialsProviderControllerService

## Description

Defines credentials for Amazon Web Services processors. Uses default credentials without configuration. Default credentials support EC2 instance profile/role, default user profile, environment variables, etc. Additional options include access key / secret key pairs, credentials file, named profile, and assume role credentials.

## Tags

aws, credentials, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Access Key ID | Access Key ID |  |  |  |
| Assume Role ARN | Assume Role ARN |  |  | The AWS Role ARN for cross account access. This is used in conjunction with Assume Role Session Name and other Assume Role properties. |
| Assume Role External ID | Assume Role External ID |  |  | External ID for cross-account access. This is used in conjunction with Assume Role ARN. |
| Assume Role Proxy Configuration Service | Assume Role Proxy Configuration Service |  |  | Proxy configuration for cross-account access, if needed within your environment. This will configure a proxy to request for temporary access keys into another AWS account. |
| Assume Role SSL Context Service | Assume Role SSL Context Service |  |  | SSL Context Service used when connecting to the STS Endpoint. |
| Assume Role STS Endpoint Override | Assume Role STS Endpoint Override |  |  | The default AWS Security Token Service (STS) endpoint (“sts.amazonaws.com”) works for all accounts that are not for China (Beijing) region or GovCloud. You only need to set this property to “sts.cn-north-1.amazonaws.com.cn” when you are requesting session credentials for services in China(Beijing) region or to “sts.us-gov-west-1.amazonaws.com” for GovCloud. |
| Assume Role STS Region | Assume Role STS Region | us-west-2 | * Middle East (UAE) * US ISOF SOUTH * Asia Pacific (Taipei) * US West (N. California) * US West (Oregon) * Africa (Cape Town) * Asia Pacific (Osaka) * Asia Pacific (Seoul) * Asia Pacific (Tokyo) * Middle East (Bahrain) * South America (Sao Paulo) * China (Beijing) * Asia Pacific (Singapore) * Asia Pacific (Sydney) * Asia Pacific (Jakarta) * Asia Pacific (Melbourne) * Asia Pacific (Malaysia) * US East (N. Virginia) * Asia Pacific (New Zealand) * US East (Ohio) * Asia Pacific (Thailand) * China (Ningxia) * Asia Pacific (Hyderabad) * Asia Pacific (Mumbai) * Europe (Milan) * Europe (Spain) * AWS GovCloud (US-East) * Israel (Tel Aviv) * Canada (Central) * Mexico (Central) * Europe (Frankfurt) * EU (Germany) * US ISO WEST * Europe (Zurich) * EU ISOE West * Europe (Stockholm) * Europe (Paris) * Europe (London) * Europe (Ireland) * Asia Pacific (Hong Kong) * Canada West (Calgary) * AWS GovCloud (US-West) * US ISO East * US ISOB East (Ohio) * US ISOF EAST | The AWS Security Token Service (STS) region |
| Assume Role STS Signer Override | Assume Role STS Signer Override | Default Signature | * Default Signature * Signature Version 4 * Custom Signature | The AWS STS library uses Signature Version 4 by default. This property allows you to plug in your own custom signer implementation. |
| Assume Role Session Name \* | Assume Role Session Name |  |  | The AWS Role Session Name for cross account access. This is used in conjunction with Assume Role ARN. |
| Assume Role Session Time | Assume Role Session Time | 3600 |  | Session time for role based session (between 900 and 3600 seconds). This is used in conjunction with Assume Role ARN. |
| Credentials File | Credentials File |  |  | Path to a file containing AWS access key and secret key in properties file format. |
| Custom Signer Class Name \* | Custom Signer Class Name |  |  | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth.Signer interface. |
| Custom Signer Module Location | Custom Signer Module Location |  |  | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Profile Name | Profile Name |  |  | The AWS profile name for credentials from the profile configuration file. |
| Secret Access Key | Secret Access Key |  |  |  |
| Use Anonymous Credentials | Use Anonymous Credentials | false | * true * false | If true, uses Anonymous credentials |
| Use Default Credentials | Use Default Credentials | false | * true * false | If true, uses the Default Credential chain, including EC2 instance profiles or roles, environment variables, default user credentials, etc. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| access environment credentials | The default configuration can read environment variables and system properties for credentials |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AzureBlobStorageFileResourceService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/azureblobstoragefileresourceservice.md
section: Loading & Unloading Data
---

# AzureBlobStorageFileResourceService

## Description

Provides an Azure Blob Storage file resource for other components.

## Tags

azure, blob, cloud, file, microsoft, resource, storage

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Blob Name \* | Blob Name | ${azure.blobname} |  | The full name of the blob |
| Container Name \* | Container Name | ${azure.container} |  | Name of the Azure storage container. In case of PutAzureBlobStorage processor, container can be created if it does not exist. |
| Storage Credentials \* | Storage Credentials |  |  | Controller Service used to obtain Azure Blob Storage Credentials. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AzureCosmosDBClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/azurecosmosdbclientservice.md
section: Loading & Unloading Data
---

# AzureCosmosDBClientService

## Description

Provides a controller service that configures a connection to Cosmos DB (Core SQL API) and provides access to that connection to other Cosmos DB-related components.

## Tags

azure, cosmos, document, service

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Cosmos DB Access Key | Cosmos DB Access Key |  |  | Cosmos DB Access Key from Azure Portal (Settings->Keys). Choose a read-write key to enable database or container creation at run time |
| Cosmos DB Consistency Level | Cosmos DB Consistency Level | SESSION | * STRONG * BOUNDED_STALENESS * SESSION * CONSISTENT_PREFIX * EVENTUAL | Choose from five consistency levels on the consistency spectrum. Refer to Cosmos DB documentation for their differences |
| Cosmos DB URI | Cosmos DB URI |  |  | Cosmos DB URI, typically in the form of <https:/>/{databaseaccount}.documents.azure.com:443/ Note this host URL is for Cosmos DB with Core SQL API from Azure Portal (Overview->URI) |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AzureDataLakeStorageFileResourceService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/azuredatalakestoragefileresourceservice.md
section: Loading & Unloading Data
---

# AzureDataLakeStorageFileResourceService

## Description

Provides an Azure Data Lake Storage (ADLS) file resource for other components.

## Tags

adlsgen2, azure, cloud, datalake, file, microsoft, resource, storage

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| ADLS Credentials \* | ADLS Credentials |  |  | Controller Service used to obtain Azure Credentials. |
| Directory Name \* | Directory Name | ${azure.directory} |  | Name of the Azure Storage Directory. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. In case of the PutAzureDataLakeStorage processor, the directory will be created if not already existing. |
| File Name \* | File Name | ${azure.filename} |  | The filename |
| Filesystem Name \* | Filesystem Name | ${azure.filesystem} |  | Name of the Azure Storage File System (also called Container). It is assumed to be already existing. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AzureEventHubRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/azureeventhubrecordsink.md
section: Loading & Unloading Data
---

# AzureEventHubRecordSink

## Description

Format and send Records to Azure Event Hubs

## Tags

azure, record, sink

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Authentication Strategy \* | Authentication Strategy | DEFAULT_AZURE_CREDENTIAL | * Shared Access Key * Default Azure Credential | Strategy for authenticating to Azure Event Hubs |
| Event Hub Name \* | Event Hub Name |  |  | Provides the Event Hub Name for connections |
| Event Hub Namespace \* | Event Hub Namespace |  |  | Provides provides the host for connecting to Azure Event Hubs |
| Partition Key | Partition Key |  |  | A hint for Azure Event Hub message broker how to distribute messages across one or more partitions |
| Service Bus Endpoint \* | Service Bus Endpoint | .servicebus.windows.net | * Azure * Azure China * Azure Germany * Azure US Government | Provides the domain for connecting to Azure Event Hubs |
| Shared Access Policy | Shared Access Policy |  |  | The name of the shared access policy. This policy must have Send claims |
| Shared Access Policy Key | Shared Access Policy Key |  |  | The primary or secondary key of the shared access policy |
| Transport Type \* | Transport Type | Amqp | * AMQP * AMQP_WEB_SOCKETS | Advanced Message Queuing Protocol Transport Type for communication with Azure Event Hubs |
| Record Writer \* | record-sink-record-writer |  |  | Specifies the Controller Service to use for writing out the records. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AzureStorageCredentialsControllerService_v12
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/azurestoragecredentialscontrollerservice_v12.md
section: Loading & Unloading Data
---

# AzureStorageCredentialsControllerService_v12

## Description

Provides credentials for Azure Storage processors using Azure Storage client library v12.

## Tags

azure, blob, cloud, credentials, microsoft, queue, storage

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Account Key \* | Account Key |  |  | The storage account key. This is an admin-like password providing access to every container in this account. It is recommended one uses Shared Access Signature (SAS) token, Managed Identity or Service Principal instead for fine-grained control with policies. |
| Credentials Type \* | Credentials Type | SAS_TOKEN | * Account Key * SAS Token * Managed Identity * Service Principal | Credentials type to be used for authenticating to Azure |
| Endpoint Suffix \* | Endpoint Suffix | blob.core.windows.net |  | Storage accounts in public Azure always use a common FQDN suffix. Override this endpoint suffix with a different suffix in certain circumstances (like Azure Stack or non-public Azure regions). |
| Managed Identity Client ID | Managed Identity Client ID |  |  | Client ID of the managed identity. The property is required when User Assigned Managed Identity is used for authentication. It must be empty in case of System Assigned Managed Identity. |
| SAS Token \* | SAS Token |  |  | Shared Access Signature token (the leading ‘?’ may be included) |
| Service Principal Client ID \* | Service Principal Client ID |  |  | Client ID (or Application ID) of the Client/Application having the Service Principal. |
| Service Principal Client Secret \* | Service Principal Client Secret |  |  | Password of the Client/Application. |
| Service Principal Tenant ID \* | Service Principal Tenant ID |  |  | Tenant ID of the Azure Active Directory hosting the Service Principal. |
| Storage Account Name \* | Storage Account Name |  |  | The storage account name. |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: AzureStorageCredentialsControllerServiceLookup_v12
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/azurestoragecredentialscontrollerservicelookup_v12.md
section: Loading & Unloading Data
---

# AzureStorageCredentialsControllerServiceLookup_v12

## Description

Provides an AzureStorageCredentialsService_v12 that can be used to dynamically select another AzureStorageCredentialsService_v12. This service requires an attribute named ‘azure.storage.credentials.name’ to be passed in, and will throw an exception if the attribute is missing. The value of ‘azure.storage.credentials.name’ will be used to select the AzureStorageCredentialsService_v12 that has been registered with that name. This will allow multiple AzureStorageCredentialsServices_v12 to be defined and registered, and then selected dynamically at runtime by tagging flow files with the appropriate ‘azure.storage.credentials.name’ attribute.

## Tags

azure, blob, cloud, credentials, microsoft, queue, storage

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: CalculateRecordStats 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/calculaterecordstats.md
section: Loading & Unloading Data
---

# CalculateRecordStats 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Counts the number of Records in a record set, optionally counting the number of elements per category, where the categories are defined by user-defined properties.

## Tags

metrics, record, stats

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| record-stats-limit | Limit the number of individual stats that are returned for each record path to the top N results. |
| record-stats-reader | A record reader to use for reading the records. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be processed for any reason, it is routed to this Relationship. |
| success | All FlowFiles that are successfully processed, are routed to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | A count of the records in the record set in the FlowFile. |
| recordStats.<User Defined Property Name>.count | A count of the records that contain a value for the user defined property. |
| recordStats.<User Defined Property Name>.<value>.count | Each value discovered for the user defined property will have its own count attribute. Total number of top N value counts to be added is defined by the limit configuration. |

---
title: CaptureChangeMySQL 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/capturechangemysql.md
section: Loading & Unloading Data
---

# CaptureChangeMySQL 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Reads CDC events from a MySQL database. The processor continuously reads events from binary log files, filtering those related to the tables provided by the TableStateService, and discarding the rest. The processor outputs two types of FlowFiles: - DDLs containing the schema of a table (the initial schema and a new schema on every schema change). - DMLs with records representing changes to the data in the table. One FlowFile always represents data related to a single table. The DDL with the schema is written to the FlowFile content as a JSON object: { “columns”: [ { “name”: “<columnName>”, “type”: “<snowflakeType>”, “nullable”: <true|false>, “scale”: <scale>, “precision”: <precision> }, … ], “primaryKeys”: [“<primaryKey1>”, “<primaryKey2>”, …] } Structure of the FlowFiles containing the DML records: { “primaryKeys”: { “<column>”: <value>, … }, “payload”: { “<column>”: <value>, … }, “metadata”: { “<column>”: <value>, … }

## Tags

cdc, event, jdbc, mysql, sql

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Filter Store | Service storing per-table column filtering settings. |
| Connection Timeout | Connection to source database timeout |
| JDBC Driver Location | Comma-separated list of files/folders and/or URLs containing the driver JAR and its dependencies (if any). For example ‘/var/tmp/mariadb-java-client-3.4.1.jar’ |
| JDBC URL | JDBC URL of the database connection, ie. <jdbc:mariadb://localhost:3306/mysql> |
| Max Batch Size | The maximum number of records to process in a single iteration. The number of records may exceed the maximum batch size when the last binlog event contains more than one row. |
| Max Batch Wait Time | The maximum time to wait for data to appear in the binlog. |
| Max Queue Size | The maximum number of elements read from binlog until reader thread will wait for onTrigger |
| Password | Password to access the MySQL database |
| Record Writer | The Record Writer is used for serializing DML events |
| SSL Context Service | SSL Context Service supporting encrypted socket communication |
| SSL Mode | SSL Mode used when SSL Context Service configured supporting certificate verification options |
| Server ID | Server ID (in the range from 1 to 2^32 - 1). This value MUST be unique across whole replication group (that is, different from any other Server ID being used by any master or slave). Keep in mind that each binary log client should be treated as a simplified slave and thus MUST also use a different Server ID. |
| Server ID Strategy | Determines how the server ID is selected |
| Table State Store | The shared store holding the state of replicated tables. |
| Username | Username to access the MySQL database |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Information such as a ‘pointer’ to the current CDC event in the database is stored by this processor, such that it can continue from the same location if restarted. |

## Relationships

| Name | Description |
| --- | --- |
| success | Successfully created FlowFile from CDC stream events |

## Writes attributes

| Name | Description |
| --- | --- |
| source.schema.name | Name of the schema of the table from which an event originated |
| source.table.name | Name of the table from which an event originated |
| cdc.event.type | Type of event carried by the FlowFile: ddl or dml |
| cdc.most.significant.position | Ddl’s most significant position in cdc stream |
| cdc.least.significant.position | Ddl’s least significant position in cdc stream |
| cdc.event.seen.at | Timestamp from time when ddl event has been read by the processor |

---
title: CaptureChangePostgreSQL 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/capturechangepostgresql.md
section: Loading & Unloading Data
---

# CaptureChangePostgreSQL 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Reads CDC events from a PostgreSQL database. The processor continuously reads events arriving in the stream, filtering for those related to tables provided by the TableStateService, and discarding the rest. After the current batch of events is processed, the processor confirms the replication slot position back to PostgreSQL, letting it trim the WAL. The processor outputs two types of FlowFiles: DDLs, containing the initial schema of a table, and then every time its schema changes, and DMLs, with records representing changes to data in the table. One FlowFile always represents data related to a single table. The DDL with the schema is written to the FlowFile content as a JSON object, in a form such as: { “columns”: [ { “name”: “<columnName>”, “type”: “<snowflakeType>”, “nullable”: <true|false>, “scale”: <scale>, “precision”: <precision> }, … ], “primaryKeys”: [“<primaryKey1>”, “<primaryKey2>”, …] } The DML records are structured as: { “primaryKeys”: { “<column>”: <value>, … }, “payload”: { “<column>”: <value>, … }, “metadata”: { “<column>”: <value>, … }

## Tags

cdc, event, jdbc, postgresql, sql

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Filter Store | Service storing per-table column filtering settings. |
| JDBC Driver Location | Comma-separated list of files/folders and/or URLs containing the driver JAR and its dependencies (if any). For example ‘/var/tmp/postgresql-java-client-42.7.5.jar’ |
| JDBC URL | JDBC URL of the database connection, ie. <jdbc:postgresql://localhost:5432/postgres> |
| Max Batch Size | The maximum number of records to process in a single iteration |
| Max Batch Wait Time | The maximum time to wait for data to appear in the CDC stream. |
| Password | Password to access the PostgreSQL database |
| Publication Name | The name of the CDC publication to read from. |
| Record Writer | The Record Writer is used for serializing DML events |
| Replication Slot Name | The name of the replication slot to use. 63 characters maximum. If the slot doesn’t exist, the processor will create it. |
| SSL Context Service | SSL Context Service supporting encrypted socket communication |
| SSL Mode | Whether to use and enforce SSL when connecting to PostgreSQL |
| TOASTed Value Placeholder | The value to put into a TOASTed column |
| TOASTed Value Strategy | Determines how to handle TOASTed values. |
| Table State Store | The shared store holding the state of replicated tables. |
| Username | Username to access the PostgreSQL database |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Information such as a ‘pointer’ to the current CDC event in the database is stored by this processor, such that it can continue from the same location if restarted, and the name of the replication slot created in PostgreSQL. |

## Relationships

| Name | Description |
| --- | --- |
| success | Successfully created FlowFile from CDC stream events |

## Writes attributes

| Name | Description |
| --- | --- |
| source.schema.name | Name of the schema of the table from which an event originated |
| source.table.name | Name of the table from which an event originated |
| cdc.event.type | Type of event carried by the FlowFile: ddl or dml |
| cdc.most.significant.position | Ddl’s most significant position in cdc stream |
| cdc.least.significant.position | Ddl’s least significant position in cdc stream |
| cdc.event.seen.at | Timestamp from time when ddl event has been read by the processor |

---
title: CaptureChangeSqlServer 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/capturechangesqlserver.md
section: Loading & Unloading Data
---

# CaptureChangeSqlServer 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Reads CDC events from a SQL Server database. The processor periodically queries Change Tracking tables in the database, but only for the tables provided by the TableStateService. The processor maintains a state of the last processed event for each table. The processor moves the position after each processed table. The processor supports multi-threading. The number of threads and connection limit configured in the pool collectively define the upper bound of open connections to the source database. The processor outputs two types of FlowFiles: DDLs, containing the initial schema of a table, and then every time its schema changes, and DMLs, with records representing changes to data in the table. One FlowFile always represents data related to a single table. The DDL with the schema is written to the FlowFile content as a JSON object, in a form such as: { “columns”: [ { “name”: “<columnName>”, “type”: “<snowflakeType>”, “nullable”: <true|false>, “scale”: <scale>, “precision”: <precision> }, … ], “primaryKeys”: [“<primaryKey1>”, “<primaryKey2>”, …] } The DML records are structured as: { “primaryKeys”: { “<column>”: <value>, … }, “payload”: { “<column>”: <value>, … }, “metadata”: { “<column>”: <value>, … }

## Tags

cdc, event, jdbc, sql, sql server

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Filter Store | Service storing per-table column filtering settings. |
| Connection Pool | The connection pool |
| Fetch Size | The maximum number of rows loaded into memory at once |
| Max Batch Size | The maximum number of rows to fetch in a single batch |
| Record Writer | The Record Writer is used for serializing DML events |
| Table Changes Query Interval | The minimum time interval that must elapse before scheduling the next query for table changes. This controls the frequency of database polling to prevent excessive querying. |
| Table State Store | The shared store holding the state of replicated tables. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Information such as a version of the last processed record for each table is stored by this processor, such that it can continue from the same location if restarted. |

## Relationships

| Name | Description |
| --- | --- |
| success | Successfully created FlowFile from CDC stream events |

---
title: CaptureGoogleDriveChanges 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/capturegoogledrivechanges.md
section: Loading & Unloading Data
---

# CaptureGoogleDriveChanges 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-drive-nar

## Description

Captures changes to a Shared Google Drive and emits a FlowFile for each change that occurs. This includes addition and deletion of files, as well as changes to file metadata and permissions. The processor is designed to be used in conjunction with the FetchGoogleDrive processor.

## Tags

authorization, cdc, change data capture, cloud, drive, gcp, google, openflow, permissions, storage, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Drive ID | The ID of the Shared Google Drive to monitor. |
| GCP Credentials Service | The Controller Service used to obtain Google Cloud Platform credentials. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores a token/cursor to track which changes have already been processed. |

## Relationships

| Name | Description |
| --- | --- |
| created | This Relationship is used for any files that are created. |
| removed | This Relationship is used for any files that are deleted. |
| updated | This Relationship is used for any files that are updated. |

## Writes attributes

| Name | Description |
| --- | --- |
| google.drive.drive.id | The ID of the Shared Google Drive. |
| google.drive.file.id | The ID of the file that was changed. |
| drive.id | The ID of the file that was changed. This is repeated for compatibility with FetchGoogleDrive’s default configuration. |
| google.drive.file.name | The name of the file that was changed. |
| google.drive.change.type | The type of change that occurred. Possible values are ‘CREATED’, ‘UPDATED’, or ‘DELETED’. |
| google.drive.change.time | The timestamp of the change, in milliseconds since the Unix epoch. |
| google.drive.created.time | The timestamp when the file was created, in milliseconds since the Unix epoch. |
| google.drive.webUrl | A link for opening the file in a relevant Google editor or viewer in a browser. |
| google.drive.size | The size of the file in bytes. |
| google.drive.md5 | The MD5 checksum of the file. |
| google.drive.version | The version of the file. This changes based on user and system based updates to the file. |
| google.drive.mime.type | The MIME type of the file. |
| google.drive.lastModifiedBy.displayName | A display name of the user that modified the file. |
| google.drive.lastModifiedBy.email | An email of the user that modified the file. |
| google.drive.permissions.<role>.users | A comma-separated list of email addresses for users with the specified role. Valid roles are ‘owner’, ‘organizer’, ‘fileOrganizer’, ‘writer’, ‘commenter’, ‘reader’. For example, if the owner is [john.doe@gmail.com](mailto:john.doe%40gmail.com) and users [jane.doe@gmail.com](mailto:jane.doe%40gmail.com) and [jake.doe@gmail.com](mailto:jake.doe%40gmail.com) are readers, there would be an attribute named `google.drive.permissions.owner.users` with the value `john.doe@gmail.com`, and an attribute named `google.drive.permissions.reader.users` with the value `jane.doe@gmail.com, jake.doe@gmail.com` |
| google.drive.permissions.<role>.groups | A comma-separated list of email addresses for groups with the specified role. Valid roles are ‘owner’, ‘organizer’, ‘fileOrganizer’, ‘writer’, ‘commenter’, ‘reader’. For example, if the owner is `employees@openflow-all-dev.iam.gserviceaccount.com` and the group `contractors@openflow-all-dev.iam.gserviceaccount.com` is a reader, there would be an attribute named `google.drive.permissions.owner.groups` with the value `employees@openflow-all-dev.iam.gserviceaccount.com`, and an attribute named `google.drive.permissions.reader.groups` with the value `contractors@openflow-all-dev.iam.gserviceaccount.com` |
| google.drive.permissions.<role>.domains | A comma-separated list of domain names for which all users have the given role. Valid roles are ‘owner’, ‘organizer’, ‘fileOrganizer’, ‘writer’, ‘commenter’, ‘reader’. For example, if all users in the domain `snowflake.com` have the role of reader, there would be an attribute named `google.drive.permissions.reader.domains` with the value `snowflake.com` |
| google.drive.permissions.<role>.public | If a file is shared publicly, this attribute will be added with a value of ‘true’ for any role that applies to the public. |
| google.drive.file.path | The hierarchical path of the file in Google Drive, e.g. ‘parent_folder/child_folder/file.txt’. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.CaptureSharepointChanges](capturesharepointchanges.md)
* [org.apache.nifi.processors.gcp.drive.FetchGoogleDrive](fetchgoogledrive.md)

---
title: CaptureMicrosoft365GroupsChanges 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/capturemicrosoft365groupschanges.md
section: Loading & Unloading Data
---

# CaptureMicrosoft365GroupsChanges 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

Captures Microsoft365 groups changes and emits a FlowFile for each change that occurs. This includes membership changes.

## Tags

cdc, document, graph, library, microsoft, sharepoint, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API |
| Fallback Retry Duration | The time to wait before retrying the operation after a communication failure. This value is used when the response doesn’t contain a Retry-After header. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores a delta token for Microsoft365 groups |

## Relationships

| Name | Description |
| --- | --- |
| deleted | A FlowFile is routed to this relationship for each Microsoft365 group that has been deleted. |
| updated | A FlowFile is routed to this relationship for each Microsoft365 group whose membership has changed. |

## Writes attributes

| Name | Description |
| --- | --- |
| microsoft365.group.id | An id of a changed group |
| microsoft365.group.email | An email of the changed group |

---
title: CaptureSharepointChanges 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/capturesharepointchanges.md
section: Loading & Unloading Data
---

# CaptureSharepointChanges 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

Captures changes from a Sharepoint Document Library and emits a FlowFile for each change that occurs. This includes additions and deletions of files and folders, as well as changes to permissions, metadata, and file content.

## Tags

cdc, document, graph, library, microsoft, openflow, sharepoint, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API |
| Change Capture Initial Action | If the Processor is run without having any prior state, this property dictates how the Processor should treat existing Sharepoint items. |
| Document Library Name | The name of the Document Library to list. If not specified, all Document Libraries associated with the Site will be listed. |
| Fallback Retry Duration | The time to wait before retrying the operation after a communication failure. This value is used when the response doesn’t contain a Retry-After header. |
| Fetch Item Permissions | If true, the Processor will fetch user and group permission information for the captured Sharepoint item. |
| Folder Name | The name of the Folder/Directory to list |
| Item Permissions To Fetch | A comma-separated list of permission types to fetch for the captured Sharepoint item. Available permission types: USER, GROUP, SITE_USER, SITE_GROUP. |
| Site URL | The URL of the Sharepoint Site that data will be retrieved from. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores tokens for each Sharepoint folder to track state about which events have already been captured. |

## Relationships

| Name | Description |
| --- | --- |
| created | A FlowFile is routed to this relationship for each Sharepoint item that is created. |
| deleted | A FlowFile is routed to this relationship for each Sharepoint item that is deleted. |
| updated | A FlowFile is routed to this relationship for each Sharepoint item that is updated. |

## Writes attributes

| Name | Description |
| --- | --- |
| sharepoint.change.type | The type of change that occurred. Possible values are ‘Created’, ‘Updated’, ‘PermissionsUpdated’, ‘Deleted’. |
| sharepoint.item.id | The ID of the Sharepoint item that was changed. |
| sharepoint.item.type | The type of the Sharepoint item that was changed. Possible values are ‘File’ and ‘Folder’. |
| sharepoint.path | The path of the Sharepoint item that was changed. This is the path relative to the root of the Document Library. |
| sharepoint.filename | The name of the Sharepoint item that was changed. This attribute is not available for ‘Deleted’ changes. |
| sharepoint.size | The size of the Sharepoint item that was changed. |
| sharepoint.createdAt | The creation timestamp of the Sharepoint item that was changed. |
| sharepoint.lastModified | The last modified timestamp of the Sharepoint item that was changed. |
| sharepoint.createdBy.<identity>.id | An id of the identity that created the Sharepoint item that was changed. This attribute is not always available. |
| sharepoint.createdBy.<identity>.displayName | A display name of the identity that created the Sharepoint item that was changed. This attribute is not always available. |
| sharepoint.createdBy.<identity>.email | An email of the identity that created the Sharepoint item that was changed. This attribute is not always available. |
| sharepoint.lastModifiedBy.<identity>.id | An id of the identity that modified the Sharepoint item that was changed. This attribute is not always available. |
| sharepoint.lastModifiedBy.<identity>.displayName | A display name of the identity that modified the Sharepoint item that was changed. This attribute is not always available. |
| sharepoint.lastModifiedBy.<identity>.email | An email of the identity that modified the Sharepoint item that was changed. This attribute is not always available. |
| sharepoint.drive.id | The ID of the Sharepoint Drive that contains the item that was changed. |
| sharepoint.drive.name | The name of the Sharepoint Drive that contains the item that was changed. |
| sharepoint.site.id | The ID of the Sharepoint Site that contains the item that was changed. |
| sharepoint.site.url | The URL of the Sharepoint Site that contains the item that was changed. |
| sharepoint.ctag | The CTag of the Sharepoint item that was changed. |
| sharepoint.etag | The ETag of the Sharepoint item that was changed. |
| sharepoint.webUrl | The browser view url of the Sharepoint item that was changed. |
| sharepoint.permissions.read.groups | A comma-separated list of groups that have read permissions on the Sharepoint item that was changed. For each group, if an e-mail address is available in Sharepoint, it will be included. Additionally, the group principal, such as `mygroup@mytenant.onmicrosoft.com`, is included. |
| sharepoint.permissions.read.groups.ids | A comma-separated list of group IDs that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.users | A comma-separated list of users that have read permissions on the Sharepoint item that was changed. For each user, if an e-mail address is available in Sharepoint, it will be included. Additionally, the user principal, such as `johndoe@mytenant.onmicrosoft.com`, is included. |
| sharepoint.permissions.read.users.ids | A comma-separated list of Microsoft365 user IDs that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.siteusers | A comma-separated list of Sharepoint site user emails that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.siteusers.ids | A comma-separated list of Sharepoint site user IDs that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.sitegroups.ids | A comma-separated list of Sharepoint site group IDs that have read permissions on the Sharepoint item. |
| filename | The name of the Sharepoint item that was changed. This attribute is not available for ‘Deleted’ changes. |
| path | The path of the Sharepoint item that was changed. This is the path relative to the root of the Document Library. |
| mime.type | The MIME type of the Sharepoint item that was changed. This attribute is only available for ‘File’ items. |
| hash.quickxor | The QuickXor hash of the Sharepoint item that was changed. This attribute is not always available. |
| hash.sha256 | The SHA-256 hash of the Sharepoint item that was changed. This attribute is not always available. |
| hash.sha1 | The SHA-1 hash of the Sharepoint item that was changed. This attribute is not always available. |
| hash.crc32 | The CRC32 hash of the Sharepoint item that was changed. This attribute is not always available. |

## Use Cases Involving Other Components

|  |
| --- |
| Perform Change Data Capture on a Sharepoint Document Library, retrieving all data in the Document Library, including permissions, in order to keep a destination system in sync with Sharepoint. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.FetchSharepointFile](fetchsharepointfile.md)

---
title: CEFReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/cefreader.md
section: Loading & Unloading Data
---

# CEFReader

## Description

Parses CEF (Common Event Format) events, returning each row as a record. This reader allows for inferring a schema based on the first event in the FlowFile or providing an explicit schema for interpreting the values.

## Tags

cef, parser, reader, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Schema Access Strategy \* | Schema Access Strategy | infer-schema | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Infer Schema | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Accept empty extensions \* | accept-empty-extensions | false | * true * false | If set to true, empty extensions will be accepted and will be associated to a null value. |
| DateTime Locale \* | datetime-representation | en-US |  | The IETF BCP 47 representation of the Locale to be used when parsing date fields with long or short month names (e.g. may <en-US> vs. mai. <fr-FR>. The defaultvalue is generally safe. Only change if having issues parsing CEF messages |
| Inference Strategy \* | inference-strategy | custom-extensions-inferred | * Headers only * Headers and extensions * With custom extensions as strings * With custom extensions inferred | Defines the set of fields should be included in the schema and the way the fields are being interpreted. |
| Invalid Field | invalid-message-field |  |  | Used when a line in the FlowFile cannot be parsed by the CEF parser. If set, instead of failing to process the FlowFile, a record is being added with one field. This record contains one field with the name specified by the property and the raw message as value. |
| Raw Message Field | raw-message-field |  |  | If set the raw message will be added to the record using the property value as field name. This is not the same as the “rawEvent” extension field! |
| Schema Inference Cache | schema-inference-cache |  |  | Specifies a Schema Cache to use when inferring the schema. If not populated, the schema will be inferred each time. However, if a cache is specified, the cache will first be consulted and if the applicable schema can be found, it will be used instead of inferring the schema. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: CheckMetaAdsReportReadiness 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/checkmetaadsreportreadiness.md
section: Loading & Unloading Data
---

# CheckMetaAdsReportReadiness 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-meta-ads-processors-nar

## Description

Processor checking if the Meta Ads report is ready for download.

## Tags

Facebook, Meta, Meta Ads, report

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | Token required to request Meta Ads Marketing API. It must match pattern ‘Bearer <Access Token Value>’. |
| Meta Ads API Version | Version of Meta Ads API which is used for report generation. |
| Report ID | ID of the generated report. |
| Web Client Service Provider | Service providing client for REST request execution. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Error FlowFiles transferred when receiving error response from Meta Ads Marketing API or when an error occurred during response processing. |
| ready | Response FlowFiles transferred when receiving Job Completed response from Meta Ads Marketing API. |
| retry | Response FlowFiles transferred when report prepared by Meta Ads Marketing API is not yet ready to be downloaded. |

## Writes attributes

| Name | Description |
| --- | --- |
| meta.ads.report.status | Current state of the processed report. |

---
title: ChunkRecordText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/chunkrecordtext.md
section: Loading & Unloading Data
---

# ChunkRecordText 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-chunking-nar

## Description

Chunks text with options for recursively splitting by delimiters and max character length. The input text is expected to be in a record-oriented FlowFile that matches the configured Record Reader format.

## Tags

chunk, openflow, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Chunk Count Field Name | The field name in the record to write the total number of chunks created from the original record. |
| Chunk Delimiters | Specifies a comma-separated list of character sequences. Meta-characters n, r and t are automatically un-escaped. Delimiters are recursively applied in order to chunk the text. |
| Chunk Index Field Name | The field name in the record to write the chunk index. |
| Chunk Overlap | The max number of characters to include from preceding and subsequent chunks. |
| Chunking Strategy | Strategy to chunk text. ‘Recursive Delimiters’ will chunk text according to the recursive split by character algorithm. In this algorithm input text is split by the first delimiter and merged back into chunks that do not exceed the ‘Max Chunk Length’. Any splits that exceed ‘Max Chunk Length’ are then recursively split using the next delimiter. ‘Max Chunk Length’ will chunk text by creating chunks that are ‘Max Chunk Length’ in size. |
| Language | Language to use for parsing sentences. |
| Max Chunk Length | Maximum number of characters to include in output chunk. Setting this number too high can result in an out of memory error. |
| Record Reader | The Record Reader to use for reading the FlowFile. |
| Record Writer | The Record Writer to use for writing the results. |
| Sentence Similarity Threshold | Threshold for determining if two sentences are similar enough to occupy the same chunk. A value of 1.0 indicates the sentences are identical. A value of 0.0 indicates the sentences are completely dissimilar. |
| Text Record Path | The record path to a text field in the record. |
| Trim Whitespace | Trim whitespace surrounding the output text chunk. |

## Relationships

| Name | Description |
| --- | --- |
| original | The input Flow File is routed to the original relationship. |
| success | Text chunks are routed to the success relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| chunk.strategy | Strategy used to chunk text. One of ‘Max Chunk Length’, ‘Recursive Delimiters’, ‘Sentence’, ‘Semantic’. |
| chunk.semantic.threshold | Threshold for determining if two sentences are similar enough to occupy the same chunk. This attribute is added only when the ‘Semantic’ chunking strategy is used. |
| chunk.language | Language used for parsing sentences. This attribute is added only when the ‘Sentence’ or ‘Semantic’ chunking strategy is used. |
| chunk.delimiters | Comma-separated list of delimiters used to chunk text. This attribute is added only when the ‘Recursive Delimiters’ chunking strategy is used. |
| chunk.max.chars | Maximum number of characters to include in each chunk. |

---
title: ChunkText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/chunktext.md
section: Loading & Unloading Data
---

# ChunkText 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-chunking-nar

## Description

Chunks text with options for recursively splitting by delimiters and max character length. Each chunk is given the following attributes: fragment.identifier, fragment.index, fragment.count, segment.original.filename; these attributes can then be used by the MergeContent processor in order to reconstitute the original FlowFile

## Tags

chunk, openflow, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Chunk Delimiters | Specifies a comma-separated list of character sequences. Meta-characters n, r and t are automatically un-escaped. Delimiters are recursively applied in order to chunk the text. |
| Chunk Overlap | The max number of characters to include from preceding and subsequent chunks. |
| Chunking Strategy | Strategy to chunk text. ‘Recursive Delimiters’ will chunk text according to the recursive split by character algorithm. In this algorithm input text is split by the first delimiter and merged back into chunks that do not exceed the ‘Max Chunk Length’. Any splits that exceed ‘Max Chunk Length’ are then recursively split using the next delimiter. ‘Max Chunk Length’ will chunk text by creating chunks that are ‘Max Chunk Length’ in size. |
| Language | Language to use for parsing sentences. |
| Max Chunk Length | Maximum number of characters to include in output chunk. Setting this number too high can result in an out of memory error. |
| Sentence Similarity Threshold | Threshold for determining if two sentences are similar enough to occupy the same chunk. A value of 1.0 indicates the sentences are identical. A value of 0.0 indicates the sentences are completely dissimilar. |
| Trim Whitespace | Trim whitespace surrounding the output text chunk. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If any error during parsing occurs, the input Flow File will be routed to the failure relationship. |
| original | The input Flow File is routed to the original relationship. |
| success | Text chunks are routed to the success relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| segment.original.filename | Original filename of the input Flow File. |
| fragment.identifier | ID of the parent Flow File used to generate each chunk. |
| fragment.index | Index of the current Flow File chunk, starting at 0. |
| fragment.count | The total count of Flow File chunks produced. |
| chunk.start.offsets | The chunk.start.offsets attribute is added only to the original incoming FlowFile. It is a comma-separated list of start offsets for each chunk that gets generated. For example, if the FlowFile is chunked into 3 child FlowFiles, it might have a value of `0,183,365` indicating that the first chunk starts at offset 0, the second chunk starts at offset 183, and the third chunk starts at offset 365. Offsets are based on the number of characters. |
| chunk.end.offsets | The chunk.end.offsets attribute is added only to the original incoming FlowFile. It is a comma-separated list of end offsets for each chunk that gets generated. For example, if the FlowFile is chunked into 3 child FlowFiles, it might have a value of `183,365,548` indicating that the first chunk ends at offset 183, the second chunk ends at offset 365, and the third chunk ends at offset 548. Offsets are based on the number of characters. |
| chunk.strategy | Strategy used to chunk text. One of ‘Max Chunk Length’, ‘Recursive Delimiters’, ‘Sentence’, ‘Semantic’. |
| chunk.semantic.threshold | Threshold for determining if two sentences are similar enough to occupy the same chunk. This attribute is added only when the ‘Semantic’ chunking strategy is used. |
| chunk.language | Language used for parsing sentences. This attribute is added only when the ‘Sentence’ or ‘Semantic’ chunking strategy is used. |
| chunk.delimiters | Comma-separated list of delimiters used to chunk text. This attribute is added only when the ‘Recursive Delimiters’ chunking strategy is used. |
| chunk.max.chars | Maximum number of characters to include in each chunk. |

---
title: CompressContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/compresscontent.md
section: Loading & Unloading Data
---

# CompressContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Compresses or decompresses the contents of FlowFiles using a user-specified compression algorithm and updates the mime.type attribute as appropriate. A common idiom is to precede CompressContent with IdentifyMimeType and configure Mode=’decompress’ AND Compression Format=’use mime.type attribute’. When used in this manner, the MIME type is automatically detected and the data is decompressed, if necessary. If decompression is unnecessary, the data is passed through to the ‘success’ relationship. This processor operates in a very memory efficient way so very large objects well beyond the heap size are generally fine to process.

## Tags

brotli, bzip2, compress, content, decompress, deflate, gzip, lz4-framed, lzma, snappy, snappy framed, snappy-hadoop, xz-lzma2, zstd

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Compression Format | The compression format to use. Valid values are: GZIP, Deflate, ZSTD, BZIP2, XZ-LZMA2, LZMA, Brotli, Snappy, Snappy Hadoop, Snappy Framed, and LZ4-Framed |
| Compression Level | The compression level to use; this is valid only when using gzip, deflate or xz-lzma2 compression. A lower value results in faster processing but less compression; a value of 0 indicates no (that is, simple archiving) for gzip or minimal for xz-lzma2 compression. Higher levels can mean much larger memory usage such as the case with levels 7-9 for xz-lzma/2 so be careful relative to heap size. |
| Mode | Indicates whether the processor should compress content or decompress content. Must be either ‘compress’ or ‘decompress’ |
| Update Filename | If true, will remove the filename extension when decompressing data (only if the extension indicates the appropriate compression format) and add the appropriate extension when compressing data |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles will be transferred to the failure relationship if they fail to compress/decompress |
| success | FlowFiles will be transferred to the success relationship after successfully being compressed or decompressed |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | If the Mode property is set to compress, the appropriate MIME Type is set. If the Mode property is set to decompress and the file is successfully decompressed, this attribute is removed, as the MIME Type is no longer known. |

## Use cases

|  |
| --- |
| Compress the contents of a FlowFile |
| Decompress the contents of a FlowFile |

## Use Cases Involving Other Components

|  |
| --- |
| Check whether or not a FlowFile is compressed and if so, decompress it. |

---
title: Configure other authentication methods for Openflow Connector for Kafka
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kafka/authentication.md
section: Loading & Unloading Data
---

# Configure other authentication methods for Openflow Connector for Kafka

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how to configure other authentication methods for the
Openflow Connector for Kafka. The connector supports multiple authentication
mechanisms beyond basic SASL authentication.

> **Note:**
>
> Basic SASL authentication is configured through parameter contexts as described in [Set up the Openflow Connector for Kafka](setup.md).
> This page covers other authentication methods that require additional service configuration.

## Supported Authentication Methods

The Openflow Connector for Kafka supports the following authentication mechanisms:

* SASL with the following SASL mechanisms (configured via parameter contexts):

  + PLAIN
  + SCRAM-SHA-256
  + SCRAM-SHA-512
* SASL with AWS MSK IAM (extra configuration required via services)
* mTLS (extra configuration required via services)

## Configuring mTLS Authentication

mTLS (mutual Transport Layer Security) authentication requires both the client
and server to present certificates for mutual authentication.

### Prerequisites

Before configuring mTLS authentication, ensure you have:

1. Generated and configured the necessary certificates for both the connector and the Kafka broker
2. Created a keystore containing the connector’s private key and certificate
3. (Optional) Created a truststore containing the Kafka broker certificate or a certificate in the certification chain.
   This step is only required if the broker certificate is not signed by a trusted Certificate Authority (CA).
4. The supported keystore/truststore formats are PKCS12, JKS, and BCFKS

### Step 1: Configure SSL Context Service

1. From the NiFi canvas, access the Controller Services configuration:

   * Double click on the connector’s processing group
   * Right-click on the canvas and select Controller Services
2. Add a new StandardSSLContextService.

   * Click the + to add a new controller service.
   * Select StandardSSLContextService from the list.
   * Click Add.
3. Configure the SSL Context Service properties:

   | Property | Value |
   | --- | --- |
   | Keystore Filename | Full path to your keystore file (e.g., `/path/to/client-keystore.p12`), or Asset reference |
   | Keystore Password | Password for the keystore |
   | Keystore Type | Keystore format (`PKCS12`, `JKS`, or `BCFKS`) |
   | Key Password | Password for the private key (if the key is encrypted) |
   | Truststore Filename | Full path to your truststore file (e.g., `/path/to/client-truststore.p12`), or Asset reference |
   | Truststore Password | Password for the truststore |
   | Truststore Type | Truststore format (`PKCS12`, `JKS`, or `BCFKS`) |
4. Enable the SSL Context Service:

   * Click Enable for the service.
   * Confirm that the service status shows as Enabled.

### Step 2: Configure Kafka3Connection Service

1. In the same Controller Services tab, locate the Kafka3Connection service.
2. Configure the following properties:

   | Property | Value |
   | --- | --- |
   | Security Protocol | `SSL` |
   | SSL Context Service | Select the SSL Context Service you created in Step 1 |
3. Keep all other [Kafka3Connection service](../../controllers/kafka3connectionservice.md) settings unchanged
4. Verify the Kafka3Connection service:

   * Click Verify for the service.
   * Confirm that the service status shows as Verified.

## Configuring AWS MSK IAM Authentication

AWS MSK IAM authentication allows you to use AWS Identity and Access Management
(IAM) to authenticate to Amazon Managed Streaming for Apache Kafka (MSK).

### Prerequisites

1. Your Kafka cluster must be Amazon MSK with IAM authentication enabled.
2. You need to provide IAM credentials in Openflow with BYOC (bring your own cloud) configurations, deployed in your cloud.
3. The IAM role or user must have the necessary MSK permissions.

### Step 1: Create AmazonMSKConnectionService

1. From the NiFi canvas, access the Controller Services configuration:

   * Double click on the connector’s processing group
   * Right-click on the canvas and select Controller Services
2. Add a new [AmazonMSKConnectionService](../../controllers/amazonmskconnectionservice.md).

   * Click + to add a new controller service.
   * Select AmazonMSKConnectionService from the list.
   * Click Add
3. Configure the AmazonMSKConnectionService properties:

   | Property | Value |
   | --- | --- |
   | SASL Mechanism | `AWS_MSK_IAM` |
   | Security Protocol | `#{Kafka Security Protocol}` |
   | Bootstrap Servers | `#{Kafka Bootstrap Servers}` |
4. Verify the AmazonMSKConnectionService:

   * Click Verify for the service
   * Confirm that the service status shows as Verified

### Step 2: Configure ConsumeKafka Processor

1. In your Kafka connector flow, locate the ConsumeKafka processor
2. Configure the processor to use the new connection service:

   * Set the Kafka Connection Service property to the AmazonMSKConnectionService you created in Step 1: Create AmazonMSKConnectionService.

### Step 3: Remove Old Kafka Connection Service

1. In the Controller Services tab, locate the old Kafka3Connection service.
2. Disable and remove the old service:

   * Click Disable for the old service.
   * Once disabled, click Delete to remove the old service.

---
title: ConfluentEncodedSchemaReferenceReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/confluentencodedschemareferencereader.md
section: Loading & Unloading Data
---

# ConfluentEncodedSchemaReferenceReader

## Description

Reads Schema Identifier according to Confluent encoding as a header consisting of a byte marker and an integer represented as four bytes

## Tags

avro, confluent, kafka, registry, schema

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ConfluentEncodedSchemaReferenceWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/confluentencodedschemareferencewriter.md
section: Loading & Unloading Data
---

# ConfluentEncodedSchemaReferenceWriter

## Description

Writes Schema Identifier according to Confluent encoding as a header consisting of a byte marker and an integer represented as four bytes

## Tags

avro, confluent, kafka, registry, schema

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ConfluentProtobufMessageNameResolver
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/confluentprotobufmessagenameresolver.md
section: Loading & Unloading Data
---

# ConfluentProtobufMessageNameResolver

## Description

Resolves Protobuf message names from Confluent Schema Registry wire format by decoding message indexes and looking up the fully qualified name in the schema definition For Confluent wire format reference see: <https://docs.confluent.io/platform/current/schema-registry/fundamentals/serdes-develop/index.html#wire-format>

## Tags

confluent, message, name, protobuf, registry, resolver, schema

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ConfluentSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/confluentschemaregistry.md
section: Loading & Unloading Data
---

# ConfluentSchemaRegistry

## Description

Provides a Schema Registry that interacts with the Confluent Schema Registry so that those Schemas that are stored in the Confluent Schema Registry can be used in NiFi. The Confluent Schema Registry has a notion of a “subject” for schemas, which is their terminology for a schema name. When a Schema is looked up by name by this registry, it will find a Schema in the Confluent Schema Registry with that subject.

## Tags

avro, confluent, kafka, registry, schema

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Authentication Type | Authentication Type | NONE | * BASIC * NONE | HTTP Client Authentication Type for Confluent Schema Registry |
| Cache Expiration \* | Cache Expiration | 1 hour |  | Specifies how long a Schema that is cached should remain in the cache. Once this time period elapses, a cached version of a schema will no longer be used, and the service will have to communicate with the Schema Registry again in order to obtain the schema. |
| Cache Size \* | Cache Size | 1000 |  | Specifies how many Schemas should be cached from the Schema Registry |
| Communications Timeout \* | Communications Timeout | 30 secs |  | Specifies how long to wait to receive data from the Schema Registry before considering the communications a failure |
| Password | Password |  |  | Password for authentication to Confluent Schema Registry |
| SSL Context Service | SSL Context Service |  |  | Specifies the SSL Context Service to use for interacting with the Confluent Schema Registry |
| Schema Registry URLs \* | Schema Registry URLs | <http://localhost:8081> |  | A comma-separated list of URLs of the Schema Registry to interact with |
| Username | Username |  |  | Username for authentication to Confluent Schema Registry |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ConnectWebSocket 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/connectwebsocket.md
section: Loading & Unloading Data
---

# ConnectWebSocket 2025.10.9.21

## Bundle

org.apache.nifi | nifi-websocket-processors-nar

## Description

Acts as a WebSocket client endpoint to interact with a remote WebSocket server. FlowFiles are transferred to downstream relationships according to received message types as WebSocket client configured with this processor receives messages from remote WebSocket server. If a new flowfile is passed to the processor, the previous sessions will be closed and any data being sent will be aborted.

## Tags

WebSocket, consume, listen, subscribe

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| websocket-client-controller-service | A WebSocket CLIENT Controller Service which can connect to a WebSocket server. |
| websocket-client-id | The client ID to identify WebSocket session. It should be unique within the WebSocket Client Controller Service. Otherwise, it throws WebSocketConfigurationException when it gets started. |

## Relationships

| Name | Description |
| --- | --- |
| binary message | The WebSocket binary message output |
| connected | The WebSocket session is established |
| disconnected | The WebSocket session is disconnected |
| failure | FlowFile holding connection configuration attributes (like URL or HTTP headers) in case of connection failure |
| success | FlowFile holding connection configuration attributes (like URL or HTTP headers) in case of successful connection |
| text message | The WebSocket text message output |

## Writes attributes

| Name | Description |
| --- | --- |
| websocket.controller.service.id | WebSocket Controller Service id. |
| websocket.session.id | Established WebSocket session id. |
| websocket.endpoint.id | WebSocket endpoint id. |
| websocket.local.address | WebSocket client address. |
| websocket.remote.address | WebSocket server address. |
| websocket.message.type | TEXT or BINARY. |

---
title: ConsumeAMQP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeamqp.md
section: Loading & Unloading Data
---

# ConsumeAMQP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-amqp-nar

## Description

Consumes AMQP Messages from an AMQP Broker using the AMQP 0.9.1 protocol. Each message that is received from the AMQP Broker will be emitted as its own FlowFile to the ‘success’ relationship.

## Tags

amqp, consume, get, message, rabbit, receive

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AMQP Version | AMQP Version. Currently only supports AMQP v0.9.1. |
| Auto-Acknowledge Messages | If false (Non-Auto-Acknowledge), the messages will be acknowledged by the processor after transferring the FlowFiles to success and committing the NiFi session. Non-Auto-Acknowledge mode provides ‘at-least-once’ delivery semantics. If true (Auto-Acknowledge), messages that are delivered to the AMQP Client will be auto-acknowledged by the AMQP Broker just after sending them out. This generally will provide better throughput but will also result in messages being lost upon restart/crash of the AMQP Broker, NiFi or the processor. Auto-Acknowledge mode provides ‘at-most-once’ delivery semantics and it is recommended only if loosing messages is acceptable. |
| Batch Size | The maximum number of messages that should be processed in a single session. Once this many messages have been received (or once no more messages are readily available), the messages received will be transferred to the ‘success’ relationship and the messages will be acknowledged to the AMQP Broker. Setting this value to a larger number could result in better performance, particularly for very small messages, but can also result in more messages being duplicated upon sudden restart of NiFi. |
| Brokers | A comma-separated list of known AMQP Brokers in the format <host>:<port> (e.g., localhost:5672). If this is set, Host Name and Port are ignored. Only include hosts from the same AMQP cluster. |
| Client Certificate Authentication Enabled | Authenticate using the SSL certificate rather than user name/password. |
| Header Key Prefix | Text to be prefixed to header keys as the are added to the FlowFile attributes. Processor will append ‘.’ to the value of this property |
| Header Output Format | Defines how to output headers from the received message |
| Header Separator | The character that is used to separate key-value for header in String. The value must be only one character. |
| Host Name | Network address of AMQP broker (e.g., localhost). If Brokers is set, then this property is ignored. |
| Max Inbound Message Body Size | Maximum body size of inbound (received) messages. |
| Password | Password used for authentication and authorization. |
| Port | Numeric value identifying Port of AMQP broker (e.g., 5671). If Brokers is set, then this property is ignored. |
| Prefetch Count | The maximum number of unacknowledged messages for the consumer. If consumer has this number of unacknowledged messages, AMQP broker will no longer send new messages until consumer acknowledges some of the messages already delivered to it. Allowed values: from 0 to 65535.0 means no limit |
| Queue | The name of the existing AMQP Queue from which messages will be consumed. Usually pre-defined by AMQP administrator. |
| Remove Curly Braces | If true Remove Curly Braces, Curly Braces in the header will be automatically remove. |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Username | Username used for authentication and authorization. |
| Virtual Host | Virtual Host name which segregates AMQP system for enhanced security. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received from the AMQP queue are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| amqp$appId | The App ID field from the AMQP Message |
| amqp$contentEncoding | The Content Encoding reported by the AMQP Message |
| amqp$contentType | The Content Type reported by the AMQP Message |
| amqp$headers | The headers present on the AMQP Message. Added only if processor is configured to output this attribute. |
| <Header Key Prefix>.<attribute> | Each message header will be inserted with this attribute name, if processor is configured to output headers as attribute |
| amqp$deliveryMode | The numeric indicator for the Message’s Delivery Mode |
| amqp$priority | The Message priority |
| amqp$correlationId | The Message’s Correlation ID |
| amqp$replyTo | The value of the Message’s Reply-To field |
| amqp$expiration | The Message Expiration |
| amqp$messageId | The unique ID of the Message |
| amqp$timestamp | The timestamp of the Message, as the number of milliseconds since epoch |
| amqp$type | The type of message |
| amqp$userId | The ID of the user |
| amqp$clusterId | The ID of the AMQP Cluster |
| amqp$routingKey | The routingKey of the AMQP Message |
| amqp$exchange | The exchange from which AMQP Message was received |

---
title: ConsumeAzureEventHub 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeazureeventhub.md
section: Loading & Unloading Data
---

# ConsumeAzureEventHub 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Receives messages from Microsoft Azure Event Hubs with checkpointing to ensure consistent event processing. Checkpoint tracking avoids consuming a message multiple times and enables reliable resumption of processing in the event of intermittent network failures. Checkpoint tracking requires external storage and provides the preferred approach to consuming messages from Azure Event Hubs. In clustered environment, ConsumeAzureEventHub processor instances form a consumer group and the messages are distributed among the cluster nodes (each message is processed on one cluster node only).

## Tags

azure, cloud, eventhub, events, microsoft, streaming, streams

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of messages to process within a NiFi session. This parameter affects throughput and consistency. NiFi commits its session and Event Hubs checkpoints after processing this number of messages. If NiFi session is committed, but fails to create an Event Hubs checkpoint, then it is possible that the same messages will be received again. The higher number, the higher throughput, but possibly less consistent. |
| Checkpoint Strategy | Specifies which strategy to use for storing and retrieving partition ownership and checkpoint information for each partition. |
| Consumer Group | The name of the consumer group to use. |
| Event Hub Name | The name of the event hub to pull messages from. |
| Event Hub Namespace | The namespace that the Azure Event Hubs is assigned to. This is generally equal to <Event Hub Names>-ns. |
| Initial Offset | Specify where to start receiving messages if offset is not yet stored in the checkpoint store. |
| Message Receive Timeout | The amount of time this consumer should wait to receive the Batch Size before returning. |
| Prefetch Count |  |
| Record Reader | The Record Reader to use for reading received messages. The event hub name can be referred by Expression Language ‘${eventhub.name}’ to access a schema. |
| Record Writer | The Record Writer to use for serializing Records to an output FlowFile. The event hub name can be referred by Expression Language ‘${eventhub.name}’ to access a schema. If not specified, each message will create a FlowFile. |
| Service Bus Endpoint | To support namespaces not in the default windows.net domain. |
| Shared Access Policy Key | The key of the shared access policy. Either the primary or the secondary key can be used. |
| Shared Access Policy Name | The name of the shared access policy. This policy must have Listen claims. |
| Storage Account Key | The Azure Storage account key to store event hub consumer group state. |
| Storage Account Name | Name of the Azure Storage account to store event hub consumer group state. |
| Storage Container Name | Name of the Azure Storage container to store the event hub consumer group state. If not specified, event hub name is used. |
| Storage SAS Token | The Azure Storage SAS token to store Event Hub consumer group state. Always starts with a ? character. |
| Transport Type | Advanced Message Queuing Protocol Transport Type for communication with Azure Event Hubs |
| Use Azure Managed Identity | Choose whether or not to use the managed identity of Azure VM/VMSS |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | Local state is used to store the client id. Cluster state is used to store partition ownership and checkpoint information when component state is configured as the checkpointing strategy. |
| CLUSTER | Local state is used to store the client id. Cluster state is used to store partition ownership and checkpoint information when component state is configured as the checkpointing strategy. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles received from Event Hub. |

## Writes attributes

| Name | Description |
| --- | --- |
| eventhub.enqueued.timestamp | The time (in milliseconds since epoch, UTC) at which the message was enqueued in the event hub |
| eventhub.offset | The offset into the partition at which the message was stored |
| eventhub.sequence | The sequence number associated with the message |
| eventhub.name | The name of the event hub from which the message was pulled |
| eventhub.partition | The name of the partition from which the message was pulled |
| eventhub.property.\* | The application properties of this message. IE: ‘application’ would be ‘eventhub.property.application’ |

---
title: ConsumeBoxEnterpriseEvents 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeboxenterpriseevents.md
section: Loading & Unloading Data
---

# ConsumeBoxEnterpriseEvents 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Consumes Enterprise Events from Box admin_logs_streaming Stream Type. The content of the events is sent to the ‘success’ relationship as a JSON array. The last known position of the Box stream is stored in the processor state and is used to resume the stream from the last known position when the processor is restarted.

## Tags

box, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Event Types | A comma separated list of Enterprise Events to consume. If not set, all Events are consumed. See Additional Details for more information. |
| Start Event Position | What position to consume the Events from. |
| Start Offset | The offset to start consuming the Events from. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | The last known position of the Box Event stream is stored in the processor state and is used to resume the stream from the last known position when the processor is restarted. |

## Relationships

| Name | Description |
| --- | --- |
| success | Events received successfully will be sent out this relationship. |

## See also

* [org.apache.nifi.processors.box.ConsumeBoxEvents](consumeboxevents.md)
* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)

---
title: ConsumeBoxEvents 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeboxevents.md
section: Loading & Unloading Data
---

# ConsumeBoxEvents 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Consumes all events from Box. This processor can be used to capture events such as uploads, modifications, deletions, etc. The content of the events is sent to the ‘success’ relationship as a JSON array. Events can be dropped in case of NiFi restart or if the queue capacity is exceeded. The last known position of the Box stream is stored in the processor state and is used to resume the stream from the last known position when the processor is restarted.

## Tags

box, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Queue Capacity | The maximum size of the internal queue used to buffer events being transferred from the underlying stream to the processor. Setting this value higher allows more messages to be buffered in memory during surges of incoming messages, but increases the total memory used by the processor during these surges. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | The last known position of the Box stream is stored in the processor state and is used to resume the stream from the last known position when the processor is restarted. |

## Relationships

| Name | Description |
| --- | --- |
| success | Events received successfully will be sent out this relationship. |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.PutBoxFile](putboxfile.md)

---
title: ConsumeElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeelasticsearch.md
section: Loading & Unloading Data
---

# ConsumeElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

A processor that repeatedly runs a paginated query against a field using a Range query to consume new Documents from an Elasticsearch index/query. The processor will retrieve multiple pages of results until either no more results are available or the Pagination Keep Alive expiration is reached, after which the Range query will automatically update the field constraint based on the last retrieved Document value.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, json, page, query, scroll, search

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Additional Filters | One or more query filters in JSON syntax, not Lucene syntax. Ex: [{“match”:{“somefield”:”somevalue”}}, {“match”:{“anotherfield”:”anothervalue”}}]. These filters wil be used as part of a Bool query’s filter. |
| Aggregation Results Format | Format of Aggregation output. |
| Aggregation Results Split | Output a flowfile containing all aggregations or one flowfile for each individual aggregation. |
| Aggregations | One or more query aggregations (or “aggs”), in JSON syntax. Ex: {“items”: {“terms”: {“field”: “product”, “size”: 10}}} |
| Client Service | An Elasticsearch client service to use for running queries. |
| Fields | Fields of indexed documents to be retrieved, in JSON syntax. Ex: [“user.id”, “http.response.\*”, {“field”: “@timestamp”, “format”: “epoch_millis”}] |
| Index | The name of the index to use. |
| Initial Value | The initial value to use for the query if the processor has not run previously. If the processor has run previously and stored a value in its state, this property will be ignored. If no value is provided, and the processor has not previously run, no Range query bounds will be used, i.e. all documents will be retrieved in the specified “Sort Order”. |
| Initial Value Date Format | If the “Range Query Field” is a Date field, convert the “Initial Value” to a date with this format. If not specified, Elasticsearch will use the date format provided by the “Range Query Field“‘s mapping. For valid syntax, see <https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-date-format.html> |
| Initial Value Date Time Zone | If the “Range Query Field” is a Date field, convert the “Initial Value” to UTC with this time zone. Valid values are ISO 8601 UTC offsets, such as “+01:00” or “-08:00”, and IANA time zone IDs, such as “Europe/London”. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Output No Hits | Output a “hits” flowfile even if no hits found for query. If true, an empty “hits” flowfile will be output even if “aggregations” are output. |
| Pagination Keep Alive | Pagination “keep_alive” period. Period Elasticsearch will keep the scroll/pit cursor alive in between requests (this is not the time expected for all pages to be returned, but the maximum allowed time for requests between page retrievals). |
| Pagination Type | Pagination method to use. Not all types are available for all Elasticsearch versions, check the Elasticsearch docs to confirm which are applicable and recommended for your service. |
| Query Attribute | If set, the executed query will be set on each result flowfile in the specified attribute. |
| Range Query Field | Field to be tracked as part of an Elasticsearch Range query using a “gt” bound match. This field must exist within the Elasticsearch document for it to be retrieved. |
| Script Fields | Fields to created using script evaluation at query runtime, in JSON syntax. Ex: {“test1”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* 2”}}, “test2”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* params.factor”, “params”: {“factor”: 2.0}}}} |
| Search Results Format | Format of Hits output. |
| Search Results Split | Output a flowfile containing all hits or one flowfile for each individual hit or one flowfile containing all hits from all paged responses. |
| Size | The maximum number of documents to retrieve in the query. If the query is paginated, this “size” applies to each page of the query, not the “size” of the entire result set. |
| Sort | Sort results by one or more fields, in JSON syntax. Ex: [{“price” : {“order” : “asc”, “mode” : “avg”}}, {“post_date” : {“format”: “strict_date_optional_time_nanos”}}] |
| Sort Order | The order in which to sort the “Range Query Field”. A “sort” clause for the “Range Query Field” field will be prepended to any provided “Sort” clauses. If a “sort” clause already exists for the “Range Query Field” field, it will not be updated. |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | The pagination state (scrollId, searchAfter, pitId, hitCount, pageCount, pageExpirationTimestamp, trackingRangeValue) is retained in between invocations of this processor until the Scroll/PiT has expired (when the current time is later than the last query execution plus the Pagination Keep Alive interval). |

## Relationships

| Name | Description |
| --- | --- |
| aggregations | Aggregations are routed to this relationship. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| hits | Search hits are routed to this relationship. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| page.number | The number of the page (request), starting from 1, in which the results were returned that are in the output flowfile |
| hit.count | The number of hits that are in the output flowfile |
| elasticsearch.query.error | The error message provided by Elasticsearch if there is an error querying the index. |

## See also

* [org.apache.nifi.processors.elasticsearch.PaginatedJsonQueryElasticsearch](paginatedjsonqueryelasticsearch.md)
* [org.apache.nifi.processors.elasticsearch.SearchElasticsearch](searchelasticsearch.md)

---
title: ConsumeGCPubSub 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumegcpubsub.md
section: Loading & Unloading Data
---

# ConsumeGCPubSub 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Consumes messages from the configured Google Cloud PubSub subscription. The ‘Batch Size’ property specified the maximum number of messages that will be pulled from the subscription in a single request. The ‘Processing Strategy’ property specifies if each message should be its own FlowFile or if messages should be grouped into a single FlowFile. Using the Demarcator strategy will provide best throughput when the format allows it. Using Record allows to convert data format as well as doing schema enforcement. Using the FlowFile strategy will generate one FlowFile per message and will have the message’s attributes as FlowFile attributes.

## Tags

consume, gcp, google, google-cloud, message, pubsub

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| Message Demarcator | Since the PubSub client receives messages in batches, this Processor has an option to output FlowFiles which contains all the messages in a single batch. This property allows you to provide a string (interpreted as UTF-8) to use for demarcating apart multiple messages. To enter special character such as ‘new line’ use CTRL+Enter or Shift+Enter depending on the OS. |
| Output Strategy | The format used to output the Kafka Record into a FlowFile Record. |
| Processing Strategy | Strategy for processing PubSub Records and writing serialized output to FlowFiles |
| Record Reader | The Record Reader to use for incoming messages |
| Record Writer | The Record Writer to use in order to serialize the outgoing FlowFiles |
| api-endpoint | Override the gRPC endpoint in the form of [host:port] |
| gcp-project-id | Google Cloud Project ID |
| gcp-pubsub-publish-batch-size | Indicates the number of messages the cloud service should bundle together in a batch. If not set and left empty, only one message will be used in a batch |
| gcp-pubsub-subscription | Name of the Google Cloud Pub/Sub Subscription |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles are routed to this relationship after a successful Google Cloud Pub/Sub operation. |

## Writes attributes

| Name | Description |
| --- | --- |
| gcp.pubsub.ackId | Acknowledgement Id of the consumed Google Cloud PubSub message |
| gcp.pubsub.messageSize | Serialized size of the consumed Google Cloud PubSub message |
| gcp.pubsub.attributesCount | Number of attributes the consumed PubSub message has, if any |
| gcp.pubsub.publishTime | Timestamp value when the message was published |
| gcp.pubsub.subscription | Name of the PubSub subscription |
| Dynamic Attributes | Other than the listed attributes, this processor may write zero or more attributes, if the original Google Cloud Publisher client added any attributes to the message while sending |

## See also

* [org.apache.nifi.processors.gcp.pubsub.PublishGCPubSub](publishgcpubsub.md)

---
title: ConsumeIMAP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeimap.md
section: Loading & Unloading Data
---

# ConsumeIMAP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-email-nar

## Description

Consumes messages from Email Server using IMAP protocol. The raw-bytes of each received email message are written as contents of the FlowFile

## Tags

Consume, Email, Get, Imap, Ingest, Ingress, Message

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authorization Mode | How to authorize sending email on the user’s behalf. |
| Connection Timeout | The amount of time to wait to connect to Email server |
| Delete Messages | Specify whether mail messages should be deleted after retrieval. |
| Fetch Size | Specify the maximum number of Messages to fetch per call to Email Server. |
| Folder | Email folder to retrieve messages from (e.g., INBOX) |
| Host Name | Network address of Email server (e.g., pop.gmail.com, imap.gmail.com . .) |
| Mark Messages as Read | Specify if messages should be marked as read after retrieval. |
| OAuth2 Access Token Provider | OAuth2 service that can provide access tokens. |
| Password | Password used for authentication and authorization with Email server. |
| Port | Numeric value identifying Port of Email server (e.g., 993) |
| Use SSL | Specifies if IMAP connection must be obtained via SSL encrypted connection (i.e., IMAPS) |
| User Name | User Name used for authentication and authorization with Email server. |

## Relationships

| Name | Description |
| --- | --- |
| success | All messages that are the are successfully received from Email server and converted to FlowFiles are routed to this relationship |

---
title: ConsumeJMS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumejms.md
section: Loading & Unloading Data
---

# ConsumeJMS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-jms-processors-nar

## Description

Consumes JMS Message of type BytesMessage, TextMessage, ObjectMessage, MapMessage or StreamMessage transforming its content to a FlowFile and transitioning it to ‘success’ relationship. JMS attributes such as headers and properties will be copied as FlowFile attributes. MapMessages will be transformed into JSONs and then into byte arrays. The other types will have their raw contents as byte array transferred into the flowfile.

## Tags

consume, get, jms, message, receive

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Acknowledgement Mode | The JMS Acknowledgement Mode. Using Auto Acknowledge can cause messages to be lost on restart of NiFi but may provide better performance than Client Acknowledge. |
| Connection Client ID | The client id to be set on the connection, if set. For durable non shared consumer this is mandatory, for all others it is optional, typically with shared consumers it is undesirable to be set. Please see JMS spec for further details |
| Connection Factory Service | The Controller Service that is used to obtain Connection Factory. Alternatively, the ‘JNDI \*’ or the ‘JMS \*’ properties can also be used to configure the Connection Factory. |
| Destination Name | The name of the JMS Destination. Usually provided by the administrator (e.g., ‘topic://myTopic’ or ‘myTopic’). |
| Destination Type | The type of the JMS Destination. Could be one of ‘QUEUE’ or ‘TOPIC’. Usually provided by the administrator. Defaults to ‘QUEUE’ |
| Durable subscription | If destination is Topic if present then make it the consumer durable. @see <https://jakarta.ee/specifications/platform/9/apidocs/jakarta/jms/session#createDurableConsumer-jakarta.jms>. Topic-java.lang. String- |
| Error Queue Name | The name of a JMS Queue where - if set - unprocessed messages will be routed. Usually provided by the administrator (e.g., ‘queue://myErrorQueue’ or ‘myErrorQueue’).Only applicable if ‘Destination Type’ is set to ‘QUEUE’ |
| Maximum Batch Size | The maximum number of messages to publish or consume in each invocation of the processor. |
| Message Selector | The JMS Message Selector to filter the messages that the processor will receive |
| Password | Password used for authentication and authorization. |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Shared subscription | If destination is Topic if present then make it the consumer shared. @see <https://jakarta.ee/specifications/platform/9/apidocs/jakarta/jms/session#createSharedConsumer-jakarta.jms>. Topic-java.lang. String- |
| Subscription Name | The name of the subscription to use if destination is Topic and is shared or durable. |
| Timeout | How long to wait to consume a message from the remote broker before giving up. |
| User Name | User Name used for authentication and authorization. |
| broker | URI pointing to the network location of the JMS Message broker. Example for ActiveMQ: ‘<tcp://myhost:61616>’. Examples for IBM MQ: ‘myhost(1414)’ and ‘myhost01(1414),myhost02(1414)’. |
| cf | The fully qualified name of the JMS ConnectionFactory implementation class (eg. org.apache.activemq. ActiveMQConnectionFactory). |
| cflib | Path to the directory with additional resources (eg. JARs, configuration files etc.) to be added to the classpath (defined as a comma separated list of values). Such resources typically represent target JMS client libraries for the ConnectionFactory implementation. |
| character-set | The name of the character set to use to construct or interpret TextMessages |
| connection.factory.name | The name of the JNDI Object to lookup for the Connection Factory. |
| java.naming.factory.initial | The fully qualified class name of the JNDI Initial Context Factory Class (java.naming.factory.initial). |
| java.naming.provider.url | The URL of the JNDI Provider to use as the value for java.naming.provider.url. See additional details documentation for allowed URL schemes. |
| java.naming.security.credentials | The Credentials to use when authenticating with JNDI (java.naming.security.credentials). |
| java.naming.security.principal | The Principal to use when authenticating with JNDI (java.naming.security.principal). |
| naming.factory.libraries | Specifies jar files and/or directories to add to the ClassPath in order to load the JNDI / JMS client libraries. This should be a comma-separated list of files, directories, and/or URLs. If a directory is given, any files in that directory will be included, but subdirectories will not be included (i.e., it is not recursive). |
| output-strategy | The format used to output the JMS message into a FlowFile record. |
| record-reader | The Record Reader to use for parsing received JMS Messages into Records. |
| record-writer | The Record Writer to use for serializing Records before writing them to a FlowFile. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Client Library Location can reference resources over HTTP |

## Relationships

| Name | Description |
| --- | --- |
| parse.failure | If a message cannot be parsed using the configured Record Reader, the contents of the message will be routed to this Relationship as its own individual FlowFile. |
| success | All FlowFiles that are received from the JMS Destination are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| jms_deliveryMode | The JMSDeliveryMode from the message header. |
| jms_expiration | The JMSExpiration from the message header. |
| jms_priority | The JMSPriority from the message header. |
| jms_redelivered | The JMSRedelivered from the message header. |
| jms_timestamp | The JMSTimestamp from the message header. |
| jms_correlationId | The JMSCorrelationID from the message header. |
| jms_messageId | The JMSMessageID from the message header. |
| jms_type | The JMSType from the message header. |
| jms_replyTo | The JMSReplyTo from the message header. |
| jms_destination | The JMSDestination from the message header. |
| jms.messagetype | The JMS message type, can be TextMessage, BytesMessage, ObjectMessage, MapMessage or StreamMessage). |
| other attributes | Each message property is written to an attribute. |

## See also

* [org.apache.nifi.jms.processors.PublishJMS](publishjms.md)

---
title: ConsumeKafka 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumekafka.md
section: Loading & Unloading Data
---

# ConsumeKafka 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-kafka-nar

## Description

Consumes messages from Apache Kafka Consumer API. The complementary NiFi processor for sending messages is PublishKafka. The Processor supports consumption of Kafka messages, optionally interpreted as NiFi records. Please note that, at this time (in read record mode), the Processor assumes that all records that are retrieved from a given partition have the same schema. For this mode, if any of the Kafka messages are pulled but cannot be parsed or written with the configured Record Reader or Record Writer, the contents of the message will be written to a separate FlowFile, and that FlowFile will be transferred to the ‘parse.failure’ relationship. Otherwise, each FlowFile is sent to the ‘success’ relationship and may contain many individual messages within the single FlowFile. A ‘record.count’ attribute is added to indicate how many messages are contained in the FlowFile. No two Kafka messages will be placed into the same FlowFile if they have different schemas, or if they have different values for a message header that is included by the <Headers to Add as Attributes> property.

## Tags

avro, consume, csv, get, ingest, ingress, json, kafka, openflow, pubsub, record, topic

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Commit Offsets | Specifies whether this Processor should commit the offsets to Kafka after receiving messages. Typically, this value should be set to true so that messages that are received are not duplicated. However, in certain scenarios, we may want to avoid committing the offsets, that the data can be processed and later acknowledged by PublishKafka in order to provide Exactly Once semantics. |
| Content Field | Specifies under what field of the record the content will be added. If not set, the content will be at the root of the record |
| Group ID | Kafka Consumer Group Identifier corresponding to Kafka group.id property |
| Header Encoding | Character encoding applied when reading Kafka Record Header values and writing FlowFile attributes |
| Header Name Pattern | Regular Expression Pattern applied to Kafka Record Header Names for selecting Header Values to be written as FlowFile attributes |
| Headers Field Parent | Specifies under what field of the record the headers field will be added. If not set, the headers field will be at the root of the record |
| Kafka Connection Service | Provides connections to Kafka Broker for publishing Kafka Records |
| Key Attribute Encoding | Encoding for value of configured FlowFile attribute containing Kafka Record Key. |
| Key Field Parent | Specifies under what field of the record the key field will be added. If not set, the key field will be at the root of the record |
| Key Format | Specifies how to represent the Kafka Record Key in the output FlowFile |
| Key Record Reader | The Record Reader to use for parsing the Kafka Record Key into a Record |
| Max Uncommitted Time | Specifies the maximum amount of time that the Processor can consume from Kafka before it must transfer FlowFiles on through the flow and commit the offsets to Kafka (if appropriate). A larger time period can result in longer latency |
| Message Demarcator | Since KafkaConsumer receives messages in batches, this Processor has an option to output FlowFiles which contains all Kafka messages in a single batch for a given topic and partition and this property allows you to provide a string (interpreted as UTF-8) to use for demarcating apart multiple Kafka messages. This is an optional property and if not provided each Kafka message received will result in a single FlowFile which time it is triggered. To enter special character such as ‘new line’ use CTRL+Enter or Shift+Enter depending on the OS |
| Metadata Field | Specifies under what field of the record the metadata will be added. If not set, the metadata will be at the root of the record |
| Metadata Received Timestamp Field | If specified a timestamp will be placed under the specified field in the metadata of record in the output FlowFile |
| Output Strategy | The format used to output the Kafka Record into a FlowFile Record. |
| Processing Strategy | Strategy for processing Kafka Records and writing serialized output to FlowFiles |
| Record Reader | The Record Reader to use for incoming Kafka messages |
| Record Writer | The Record Writer to use in order to serialize the outgoing FlowFiles |
| Separate By Key | When this property is enabled, two messages will only be added to the same FlowFile if both of the Kafka Messages have identical keys. |
| Topic Format | Specifies whether the Topics provided are a comma separated list of names or a single regular expression |
| Topics | The name or pattern of the Kafka Topics from which the Processor consumes Kafka Records. More than one can be supplied if comma separated. |
| auto.offset.reset | Automatic offset configuration applied when no previous consumer offset found corresponding to Kafka auto.offset.reset property |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles containing one or more serialized Kafka Records |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records received |
| mime.type | The MIME Type that is provided by the configured Record Writer |
| kafka.count | The number of messages written if more than one |
| kafka.key | The key of message if present and if single message. How the key is encoded depends on the value of the ‘Key Attribute Encoding’ property. |
| kafka.offset | The offset of the message in the partition of the topic. |
| kafka.timestamp | The timestamp of the message in the partition of the topic. |
| kafka.partition | The partition of the topic the message or message bundle is from |
| kafka.topic | The topic the message or message bundle is from |
| kafka.tombstone | Set to true if the consumed message is a tombstone message |

## See also

* [com.snowflake.openflow.runtime.processors.kafka.PublishKafka](publishkafka.md)

---
title: ConsumeKinesisStream 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumekinesisstream.md
section: Loading & Unloading Data
---

# ConsumeKinesisStream 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Reads data from the specified AWS Kinesis stream and outputs a FlowFile for every processed Record (raw) or a FlowFile for a batch of processed records if a Record Reader and Record Writer are configured. At-least-once delivery of all Kinesis Records within the Stream while the processor is running. AWS Kinesis Client Library can take several seconds to initialise before starting to fetch data. Uses DynamoDB for check pointing and CloudWatch (optional) for metrics. Ensure that the credentials provided have access to DynamoDB and CloudWatch (optional) along with Kinesis.

## Tags

amazon, aws, consume, kinesis, stream

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Amazon Kinesis Stream Name | The name of Kinesis Stream |
| Application Name | The Kinesis stream reader application name. |
| Checkpoint Interval | Interval between Kinesis checkpoints |
| Communications Timeout |  |
| DynamoDB Override | DynamoDB override to use non-AWS deployments |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Failover Timeout | Kinesis Client Library failover timeout |
| FlowFile Handling On Schema Difference | The strategy used when records in a Kinesis Stream change their schema in a single batch. |
| Graceful Shutdown Timeout | Kinesis Client Library graceful shutdown timeout |
| Initial Stream Position | Initial position to read Kinesis streams. |
| Output Strategy | The format used to output the Kinesis Record into a FlowFile Record. |
| Record Reader | The Record Reader to use for reading received messages. The Kinesis Stream name can be referred to by Expression Language ‘${kinesis.name}’ to access a schema. If Record Reader/Writer are not specified, each Kinesis Record will create a FlowFile. |
| Record Writer | The Record Writer to use for serializing Records to an output FlowFile. The Kinesis Stream name can be referred to by Expression Language ‘${kinesis.name}’ to access a schema. If Record Reader/Writer are not specified, each Kinesis Record will create a FlowFile. |
| Region |  |
| Report Metrics to CloudWatch | Whether to report Kinesis usage metrics to CloudWatch. |
| Retry Count | Number of times to retry a Kinesis operation (process record, checkpoint, shutdown) |
| Retry Wait | Interval between Kinesis operation retries (process record, checkpoint, shutdown) |
| Stream Position Timestamp | Timestamp position in stream from which to start reading Kinesis Records. Required if Initial position to read Kinesis streams. is AT_TIMESTAMP. Uses the Timestamp Format to parse value into a Date. |
| Timestamp Format | Format to use for parsing the Stream Position Timestamp into a Date and converting the Kinesis Record’s Approximate Arrival Timestamp into a FlowFile attribute. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| aws.kinesis.partition.key | Partition key of the (last) Kinesis Record read from the Shard |
| aws.kinesis.shard.id | Shard ID from which the Kinesis Record was read |
| aws.kinesis.sequence.number | The unique identifier of the (last) Kinesis Record within its Shard |
| aws.kinesis.approximate.arrival.timestamp | Approximate arrival timestamp of the (last) Kinesis Record read from the stream |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer (if configured) |
| record.count | Number of records written to the FlowFiles by the Record Writer (if configured) |
| record.error.message | This attribute provides on failure the error message encountered by the Record Reader or Record Writer (if configured) |

## See also

* [org.apache.nifi.processors.aws.kinesis.stream.PutKinesisStream](putkinesisstream.md)

---
title: ConsumeMQTT 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumemqtt.md
section: Loading & Unloading Data
---

# ConsumeMQTT 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mqtt-nar

## Description

Subscribes to a topic and receives messages from an MQTT broker

## Tags

IOT, MQTT, consume, listen, subscribe

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Broker URI | The URI(s) to use to connect to the MQTT broker (e.g., <tcp://localhost:1883>). The ‘tcp’, ‘ssl’, ‘ws’ and ‘wss’schemes are supported. In order to use ‘ssl’, the SSL Context Service property must be set. When a comma-separated URI list is set (e.g., <tcp://localhost:1883,tcp://localhost:1884>), the processor will use a round-robin algorithm to connect to the brokers on connection failure. |
| Client ID | MQTT client ID to use. If not set, a UUID will be generated. |
| Connection Timeout (seconds) | Maximum time interval the client will wait for the network connection to the MQTT server to be established. The default timeout is 30 seconds. A value of 0 disables timeout processing meaning the client will wait until the network connection is made successfully or fails. |
| Group ID | MQTT consumer group ID to use. If group ID not set, client will connect as individual consumer. |
| Keep Alive Interval (seconds) | Defines the maximum time interval between messages sent or received. It enables the client to detect if the server is no longer available, without having to wait for the TCP/IP timeout. The client will ensure that at least one message travels across the network within each keep alive period. In the absence of a data-related message during the time period, the client sends a very small “ping” message, which the server will acknowledge. A value of 0 disables keepalive processing in the client. |
| Last Will Message | The message to send as the client’s Last Will. |
| Last Will QoS Level | QoS level to be used when publishing the Last Will Message. |
| Last Will Retain | Whether to retain the client’s Last Will. |
| Last Will Topic | The topic to send the client’s Last Will to. |
| MQTT Specification Version | The MQTT specification version when connecting with the broker. See the allowable value descriptions for more details. |
| Max Queue Size | The MQTT messages are always being sent to subscribers on a topic regardless of how frequently the processor is scheduled to run. If the ‘Run Schedule’ is significantly behind the rate at which the messages are arriving to this processor, then a back up can occur in the internal queue of this processor. This property specifies the maximum number of messages this processor will hold in memory at one time in the internal queue. This data would be lost in case of a NiFi restart. |
| Password | Password to use when connecting to the broker |
| Quality of Service(QoS) | The Quality of Service (QoS) to receive the message with. Accepts values ‘0’, ‘1’ or ‘2’; ‘0’ for ‘at most once’, ‘1’ for ‘at least once’, ‘2’ for ‘exactly once’. |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Session Expiry Interval | After this interval the broker will expire the client and clear the session state. |
| Session state | Whether to start a fresh or resume previous flows. See the allowable value descriptions for more details. |
| Topic Filter | The MQTT topic filter to designate the topics to subscribe to. |
| Username | Username to use when connecting to the broker |
| add-attributes-as-fields | If setting this property to true, default fields are going to be added in each record: _topic, _qos, _isDuplicate, _isRetained. |
| message-demarcator | With this property, you have an option to output FlowFiles which contains multiple messages. This property allows you to provide a string (interpreted as UTF-8) to use for demarcating apart multiple messages. This is an optional property ; if not provided, and if not defining a Record Reader/Writer, each message received will result in a single FlowFile. To enter special character such as ‘new line’ use CTRL+Enter or Shift+Enter depending on the OS. |
| record-reader | The Record Reader to use for parsing received MQTT Messages into Records. |
| record-writer | The Record Writer to use for serializing Records before writing them to a FlowFile. |

## Relationships

| Name | Description |
| --- | --- |
| Message | The MQTT message output |
| parse.failure | If a message cannot be parsed using the configured Record Reader, the contents of the message will be routed to this Relationship as its own individual FlowFile. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records received |
| mqtt.broker | MQTT broker that was the message source |
| mqtt.topic | MQTT topic on which message was received |
| mqtt.qos | The quality of service for this message. |
| mqtt.isDuplicate | Whether or not this message might be a duplicate of one which has already been received. |
| mqtt.isRetained | Whether or not this message was from a current publisher, or was “retained” by the server as the last message published on the topic. |

## See also

* [org.apache.nifi.processors.mqtt.PublishMQTT](publishmqtt.md)

---
title: ConsumePOP3 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumepop3.md
section: Loading & Unloading Data
---

# ConsumePOP3 2025.10.9.21

## Bundle

org.apache.nifi | nifi-email-nar

## Description

Consumes messages from Email Server using POP3 protocol. The raw-bytes of each received email message are written as contents of the FlowFile

## Tags

Consume, Email, Get, Ingest, Ingress, Message, POP3

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authorization Mode | How to authorize sending email on the user’s behalf. |
| Connection Timeout | The amount of time to wait to connect to Email server |
| Delete Messages | Specify whether mail messages should be deleted after retrieval. |
| Fetch Size | Specify the maximum number of Messages to fetch per call to Email Server. |
| Folder | Email folder to retrieve messages from (e.g., INBOX) |
| Host Name | Network address of Email server (e.g., pop.gmail.com, imap.gmail.com . .) |
| OAuth2 Access Token Provider | OAuth2 service that can provide access tokens. |
| Password | Password used for authentication and authorization with Email server. |
| Port | Numeric value identifying Port of Email server (e.g., 993) |
| User Name | User Name used for authentication and authorization with Email server. |

## Relationships

| Name | Description |
| --- | --- |
| success | All messages that are the are successfully received from Email server and converted to FlowFiles are routed to this relationship |

---
title: ConsumeSlack 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeslack.md
section: Loading & Unloading Data
---

# ConsumeSlack 2025.10.9.21

## Bundle

org.apache.nifi | nifi-slack-nar

## Description

Retrieves messages from one or more configured Slack channels. The messages are written out in JSON format. See Usage / Additional Details for more information about how to configure this Processor and enable it to retrieve messages from Slack.

## Tags

conversation, conversation.history, slack, social media, team, text, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | OAuth Access Token used for authenticating/authorizing the Slack request sent by NiFi. This may be either a User Token or a Bot Token. It must be granted the channels:history, groups:history, im:history, or mpim:history scope, depending on the type of conversation being used. |
| Batch Size | The maximum number of messages to retrieve in a single request to Slack. The entire response will be parsed into memory, so it is important that this be kept in mind when setting this value. |
| Channels | A comma-separated list of Slack Channels to Retrieve Messages From. Each element in the list may be either a Channel ID, such as C0L9VCD47, or (for public channels only) the name of a channel, prefixed with a # sign, such as #general. If any channel name is provided instead,instead of an ID, the Access Token provided must be granted the channels:read scope in order to resolve the Channel ID. See the Processor’s Additional Details for information on how to find a Channel ID. |
| Include Message Blocks | Specifies whether or not the output JSON should include the value of the ‘blocks’ field for each Slack Message. This field includes information such as individual parts of a message that are formatted using rich text. This may be useful, for instance, for parsing. However, it often accounts for a significant portion of the data and as such may be set to null when it is not useful to you. |
| Include Null Fields | Specifies whether or not fields that have null values should be included in the output JSON. If true, any field in a Slack Message that has a null value will be included in the JSON with a value of null. If false, the key omitted from the output JSON entirely. Omitting null values results in smaller messages that are generally more efficient to process, but including the values may provide a better understanding of the format, especially for schema inference. |
| Reply Monitor Frequency | After consuming all messages in a given channel, this Processor will periodically poll all “threaded messages”, aka Replies, whose timestamp falls between now and the amount of time specified by the <Reply Monitor Window> property. This property determines how frequently those messages are polled. Setting the value to a shorter duration may result in replies to messages being captured more quickly, providing a lower latency. However, it will also result in additional resource use and could trigger Rate Limiting to occur. |
| Reply Monitor Window | After consuming all messages in a given channel, this Processor will periodically poll all “threaded messages”, aka Replies, whose timestamp is between now and this amount of time in the past in order to check for any new replies. Setting this value to a larger value may result in additional resource use and may result in Rate Limiting. However, if a user replies to an old thread that was started outside of this window, the reply may not be captured. |
| Resolve Usernames | Specifies whether or not User IDs should be resolved to usernames. By default, Slack Messages provide the ID of the user that sends a message, such as U0123456789, but not the username, such as NiFiUser. The username may be resolved, but it may require additional calls to the Slack API and requires that the Token used be granted the users:read scope. If set to true, usernames will be resolved with a best-effort policy: if a username cannot be obtained, it will be skipped over. Also, note that when a username is obtained, the Message’s <username> field is populated, and the <text> field is updated such that any mention will be output such as “Hi @user” instead of “Hi <@U1234567>”. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Maintains a mapping of Slack Channel IDs to the timestamp of the last message that was retrieved for that channel. This allows the processor to only retrieve messages that have been posted since the last time the processor was run. This state is stored in the cluster so that if the Primary Node changes, the new node will pick up where the previous node left off. |

## Relationships

| Name | Description |
| --- | --- |
| success | Slack messages that are successfully received will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| slack.channel.id | The ID of the Slack Channel from which the messages were retrieved |
| slack.message.count | The number of slack messages that are included in the FlowFile |
| mime.type | Set to application/json, as the output will always be in JSON format |

## See also

* [org.apache.nifi.processors.slack.ListenSlack](listenslack.md)

---
title: ConsumeSlackConversation 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeslackconversation.md
section: Loading & Unloading Data
---

# ConsumeSlackConversation 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-slack-processors-nar

## Description

Retrieves messages from Slack conversations available to the App. New conversations are fetched based on the ‘Reply Monitor Frequency’. Ingested messages are written out in JSON format. See Usage / Additional Details for more information about how to configure this Processor and enable it to retrieve messages from Slack.

## Tags

conversation, conversation.history, slack, social media, team, text, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | OAuth Access Token used for authenticating/authorizing the Slack request sent by NiFi. This may be either a User Token or a Bot Token. It must be granted the channels:history, groups:history, im:history, or mpim:history scope, depending on the type of conversation being used. |
| Batch Size | The maximum number of messages to retrieve in a single request to Slack. The entire response will be parsed into memory, so it is important that this be kept in mind when setting this value. |
| Rate Limiter Service | Slack Rate Limiter Service to coordinate rate limiting across processors |
| Reply Monitor Frequency | After consuming all messages in a given channel, this Processor will periodically poll all “threaded messages”, aka Replies, whose timestamp falls between now and the amount of time specified by the <Reply Monitor Window> property. This property determines how frequently those messages are polled. Setting the value to a shorter duration may result in replies to messages being captured more quickly, providing a lower latency. However, it will also result in additional resource use and could trigger Rate Limiting to occur. This also determines how frequently newly added channels are checked. |
| Reply Monitor Window | After consuming all messages in a given channel, this Processor will periodically poll all “threaded messages”, aka Replies, whose timestamp is between now and this amount of time in the past in order to check for any new replies. Setting this value to a larger value may result in additional resource use and may result in Rate Limiting. However, if a user replies to an old thread that was started outside of this window, the reply may not be captured. |
| Resolve Usernames | Specifies whether or not User IDs should be resolved to usernames. By default, Slack Messages provide the ID of the user that sends a message, such as U0123456789, but not the username, such as NiFiUser. The username may be resolved, but it may require additional calls to the Slack API and requires that the Token used be granted the users:read scope. If set to true, usernames will be resolved with a best-effort policy: if a username cannot be obtained, it will be skipped over. Also, note that when a username is obtained, the Message’s <username> field is populated, and the <text> field is updated such that any mention will be output such as “Hi @user” instead of “Hi <@U1234567>”. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Maintains a mapping of Slack Channel IDs to the timestamp of the last message that was retrieved for that channel. This allows the processor to only retrieve messages that have been posted since the last time the processor was run. This state is stored in the cluster so that if the Primary Node changes, the new node will pick up where the previous node left off. |

## Relationships

| Name | Description |
| --- | --- |
| success | Slack messages that are successfully received will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| slack.channel.id | The ID of the Slack Channel from which the messages were retrieved |
| slack.message.count | The number of slack messages that are included in the FlowFile |
| mime.type | Set to application/json, as the output will always be in JSON format |

---
title: ConsumeSlackHistory 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumeslackhistory.md
section: Loading & Unloading Data
---

# ConsumeSlackHistory 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-slack-processors-nar

## Description

Fetches historical messages from all Slack channels available to the App. This processor queries Slack’s conversations.history and conversations.replies to retrieve older messages and outputs the result as records. The processor tracks the earliest retrieved message timestamp in the cluster state to allow it to continue the historical load on subsequent executions. Channels are discovered automatically, no channel ID or name needs to be configured.

## Tags

consume, conversation, history, slack

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | OAuth Access Token used for authenticating the Slack request. It must be granted the channels:history (and, if resolving usernames, users:read) scope. |
| Batch Size | The maximum number of messages to retrieve in a single request to Slack. |
| Channel Refresh Frequency | The frequency at which the processor refreshes the list of Slack channels accessible to the App. This helps detect newly available channels or remove channels that are no longer available. |
| Include Message Blocks | Specifies whether the output JSON should include the value of the ‘blocks’ field for each Slack Message. |
| Include Null Fields | Specifies whether fields that have null values should be included in the output JSON. If true, any field with a null value will be output as null; if false, it will be omitted. |
| Rate Limiter Service | Slack Rate Limiter Service to coordinate rate limiting across processors |
| Resolve Usernames | Specifies whether User IDs should be resolved to usernames. If true, usernames will be resolved with a best-effort policy; if a username cannot be obtained, it will be skipped. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Maintains a mapping of Slack Channel IDs to the earliest message timestamp that has been retrieved. When no more messages are available, a flag is set indicating that the historical load is complete for that channel. This state is stored in the cluster so that if the Primary Node changes, the new node will pick up where the previous node left off. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles containing the JSON-encoded Slack conversation history are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| slack.channel.id | The ID of the Slack Channel from which the messages were retrieved |
| slack.channel.name | The name of the Slack Channel from which the messages were retrieved |
| slack.message.count | The number of Slack messages that are included in the FlowFile |
| mime.type | Set to application/json, the output will always be in JSON format |

---
title: ConsumeSnowflakeStream 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumesnowflakestream.md
section: Loading & Unloading Data
---

# ConsumeSnowflakeStream 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Fetches data from a Snowflake stream and writes it to a FlowFile. The stream must be created in the database before using this processor. The processor will consume the stream and write the records to the FlowFile using the specified Record Writer. The processor will also add an attribute to the FlowFile with the name of the stream. The processor will not work if the stream is stale. Instead it will log an error message and stop processing. Stale stream has to be recreated in the database. After the stream is recreated in the database the processor will continue to read and process CDC records. For more information on Snowflake streams, see the <a href=”<https://docs.snowflake.com/en/user-guide/streams-intro>”>snowflake documentation</a>.

## Tags

connection, database, jdbc, openflow, snowflake, stream, table, view

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Max Chunk Size | Number of records to write into a single FlowFile. This value might be slightly exceeded. |
| Record Writer | The Record Writer to use for CDC record serialization |
| Snowflake Connection Service | Database Connection Service for accessing Snowflake |
| Stream Name | The name of the stream in the database |

## Relationships

| Name | Description |
| --- | --- |
| success | For FlowFiles with stream CDC records |

## Writes attributes

| Name | Description |
| --- | --- |
| snowflake.stream.name | Name of the Snowflake Stream |

---
title: ConsumeTwitter 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/consumetwitter.md
section: Loading & Unloading Data
---

# ConsumeTwitter 2025.10.9.21

## Bundle

org.apache.nifi | nifi-social-media-nar

## Description

Streams tweets from Twitter’s streaming API v2. The stream provides a sample stream or a search stream based on previously uploaded rules. This processor also provides a pass through for certain fields of the tweet to be returned as part of the response. See <https://developer.twitter.com/en/docs/twitter-api/data-dictionary/introduction> for more information regarding the Tweet object model.

## Tags

json, social media, status, tweets, twitter

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| backfill-minutes | The number of minutes (up to 5 minutes) of streaming data to be requested after a disconnect. Only available for project with academic research access. See <https://developer.twitter.com/en/docs/twitter-api/tweets/filtered-stream/integrate/recovery-and-redundancy-features> |
| backoff-attempts | The number of reconnection tries the processor will attempt in the event of a disconnection of the stream for any reason, before throwing an exception. To start a stream after this exception occur and the connection is fixed, please stop and restart the processor. If the valueof this property is 0, then backoff will never occur and the processor will always need to be restartedif the stream fails. |
| backoff-time | The duration to backoff before requesting a new stream ifthe current one fails for any reason. Will increase by factor of 2 every time a restart fails |
| base-path | The base path that the processor will use for making HTTP requests. The default value should be sufficient for most use cases. |
| batch-size | The maximum size of the number of Tweets to be written to a single FlowFile. Will write fewer Tweets based on the number available in the queue at the time of processor invocation. |
| bearer-token | The Bearer Token provided by Twitter. |
| connect-timeout | The maximum time in which client should establish a connection with the Twitter API before a time out. Setting the value to 0 disables connection timeouts. |
| expansions | A comma-separated list of expansions for objects in the returned tweet. See <https://developer.twitter.com/en/docs/twitter-api/expansions> for proper usage. Possible field values include: author_id, referenced_tweets.id, referenced_tweets.id.author_id, entities.mentions.username, attachments.poll_ids, attachments.media_keys ,in_reply_to_user_id, geo.place_id |
| maximum-backoff-time | The maximum duration to backoff to start attempting a new stream. It is recommended that this number be much higher than the ‘Backoff Time’ property |
| media-fields | A comma-separated list of media fields to be returned as part of the tweet. Refer to <https://developer.twitter.com/en/docs/twitter-api/data-dictionary/object-model/media> for proper usage. Possible field values include: alt_text, duration_ms, height, media_key, non_public_metrics, organic_metrics, preview_image_url, promoted_metrics, public_metrics, type, url, width |
| place-fields | A comma-separated list of place fields to be returned as part of the tweet. Refer to <https://developer.twitter.com/en/docs/twitter-api/data-dictionary/object-model/place> for proper usage. Possible field values include: contained_within, country, country_code, full_name, geo, id, name, place_type |
| poll-fields | A comma-separated list of poll fields to be returned as part of the tweet. Refer to <https://developer.twitter.com/en/docs/twitter-api/data-dictionary/object-model/poll> for proper usage. Possible field values include: duration_minutes, end_datetime, id, options, voting_status |
| queue-size | Maximum size of internal queue for streamed messages |
| read-timeout | The maximum time of inactivity between receiving tweets from Twitter through the API before a timeout. Setting the value to 0 disables read timeouts. |
| stream-endpoint | The source from which the processor will consume Tweets. |
| tweet-fields | A comma-separated list of tweet fields to be returned as part of the tweet. Refer to <https://developer.twitter.com/en/docs/twitter-api/data-dictionary/object-model/tweet> for proper usage. Possible field values include: attachments, author_id, context_annotations, conversation_id, created_at, entities, geo, id, in_reply_to_user_id, lang, non_public_metrics, organic_metrics, possibly_sensitive, promoted_metrics, public_metrics, referenced_tweets, reply_settings, source, text, withheld |
| user-fields | A comma-separated list of user fields to be returned as part of the tweet. Refer to <https://developer.twitter.com/en/docs/twitter-api/data-dictionary/object-model/user> for proper usage. Possible field values include: created_at, description, entities, id, location, name, pinned_tweet_id, profile_image_url, protected, public_metrics, url, username, verified, withheld |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles containing an array of one or more Tweets |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The MIME Type set to application/json |
| tweets | The number of Tweets in the FlowFile |

---
title: ControlRate 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/controlrate.md
section: Loading & Unloading Data
---

# ControlRate 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Controls the rate at which data is transferred to follow-on processors. If you configure a very small Time Duration, then the accuracy of the throttle gets worse. You can improve this accuracy by decreasing the Yield Duration, at the expense of more Tasks given to the processor.

## Tags

rate, rate control, throttle, throughput

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Grouping Attribute | By default, a single “throttle” is used for all FlowFiles. If this value is specified, a separate throttle is used for each value specified by the attribute with this name. Changing this value resets the rate counters. |
| Maximum Data Rate | The maximum rate at which data should pass through this processor. The format of this property is expected to be a Data Size (such as ‘1 MB’) representing bytes per Time Duration. |
| Maximum FlowFile Rate | The maximum rate at which FlowFiles should pass through this processor. The format of this property is expected to be a positive integer representing FlowFiles count per Time Duration |
| Maximum Rate | The maximum rate at which data should pass through this processor. The format of this property is expected to be a positive integer, or a Data Size (such as ‘1 MB’) if Rate Control Criteria is set to ‘data rate’. |
| Rate Control Criteria | Indicates the criteria that is used to control the throughput rate. Changing this value resets the rate counters. |
| Rate Controlled Attribute | The name of an attribute whose values build toward the rate limit if Rate Control Criteria is set to ‘attribute value’. The value of the attribute referenced by this property must be a positive long, or the FlowFile will be routed to failure. This value is ignored if Rate Control Criteria is not set to ‘attribute value’. Changing this value resets the rate counters. |
| Rate Exceeded Strategy | Specifies how to handle an incoming FlowFile when the maximum data rate has been exceeded. |
| Time Duration | The amount of time to which the Maximum Rate pertains. Changing this value resets the rate counters. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles will be routed to this relationship if they are missing a necessary Rate Controlled Attribute or the attribute is not in the expected format |
| success | FlowFiles are transferred to this relationship under normal conditions |

## Use cases

|  |
| --- |
| Limit the rate at which data is sent to a downstream system with little to no bursts |
| Limit the rate at which FlowFiles are sent to a downstream system with little to no bursts |
| Reject requests that exceed a specific rate with little to no bursts |
| Reject requests that exceed a specific rate, allowing for bursts |

---
title: ConvertCharacterSet 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/convertcharacterset.md
section: Loading & Unloading Data
---

# ConvertCharacterSet 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Converts a FlowFile’s content from one character set to another

## Tags

character set, characterset, convert, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Input Character Set | The name of the CharacterSet to expect for Input |
| Output Character Set | The name of the CharacterSet to convert to |

## Relationships

| Name | Description |
| --- | --- |
| success |  |

---
title: ConvertRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/convertrecord.md
section: Loading & Unloading Data
---

# ConvertRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Converts records from one data format to another using configured Record Reader and Record Write Controller Services. The Reader and Writer must be configured with “matching” schemas. By this, we mean the schemas must have the same field names. The types of the fields do not have to be the same if a field value can be coerced from one type to another. For instance, if the input schema has a field named “balance” of type double, the output schema can have a field named “balance” with a type of string, double, or float. If any field is present in the input that is not present in the output, the field will be left out of the output. If any field is specified in the output schema but is not present in the input data/schema, then the field will not be present in the output or will have a null value, depending on the writer.

## Tags

avro, convert, csv, freeform, generic, json, log, logs, record, schema, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Include Zero Record FlowFiles | When converting an incoming FlowFile, if the conversion results in no data, this property specifies whether or not a FlowFile will be sent to the corresponding relationship |
| Record Reader | Specifies the Controller Service to use for reading incoming data |
| Record Writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship |
| success | FlowFiles that are successfully transformed will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records in the FlowFile |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |

## Use cases

|  |
| --- |
| Convert data from one record-oriented format to another |

---
title: ConvertToJournalSchema 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/converttojournalschema.md
section: Loading & Unloading Data
---

# ConvertToJournalSchema 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Converts the incoming database schema into the appropriate schema for a Snowflake CDC Journal table.

## Tags

Snowflake, cdc, journal

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship if the schema cannot be translated. |
| original | The original FlowFile is routed to this relationship when the schema is successfully converted. |
| success | FlowFiles are routed to this relationship after the schema has been converted. |

---
title: CopyAzureBlobStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/copyazureblobstorage_v12.md
section: Loading & Unloading Data
---

# CopyAzureBlobStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Copies a blob in Azure Blob Storage from one account/container to another. The processor uses Azure Blob Storage client library v12.

## Tags

azure, blob, cloud, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Conflict Resolution Strategy | Specifies whether an existing blob will have its contents replaced upon conflict. |
| Create Container | Specifies whether to check if the container exists and to automatically create it if it does not. Permission to list containers is required. If false, this check is not made, but the Put operation will fail if the container does not exist. |
| Destination Blob Name | The full name of the destination blob defaults to the Source Blob Name when not specified |
| Destination Container Name | Name of the Azure storage container destination defaults to the Source Container Name when not specified |
| Destination Storage Credentials | Controller Service used to obtain Azure Blob Storage Credentials. |
| Source Blob Name | The full name of the source blob |
| Source Container Name | Name of the Azure storage container that will be copied |
| Source Storage Credentials | Credentials Service used to obtain Azure Blob Storage Credentials to read Source Blob information |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Unsuccessful operations will be transferred to the failure relationship. |
| success | All successfully processed FlowFiles are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.container | The name of the Azure Blob Storage container |
| azure.blobname | The name of the blob on Azure Blob Storage |
| azure.primaryUri | Primary location of the blob |
| azure.etag | ETag of the blob |
| azure.blobtype | Type of the blob (either BlockBlob, PageBlob or AppendBlob) |
| mime.type | MIME Type of the content |
| lang | Language code for the content |
| azure.timestamp | Timestamp of the blob |
| azure.length | Length of the blob |
| azure.error.code | Error code reported during blob operation |
| azure.ignored | When Conflict Resolution Strategy is ‘ignore’, this property will be true/false depending on whether the blob was ignored. |

## See also

* [org.apache.nifi.processors.azure.storage.DeleteAzureBlobStorage_v12](deleteazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureBlobStorage_v12](fetchazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.ListAzureBlobStorage_v12](listazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.PutAzureBlobStorage_v12](putazureblobstorage_v12.md)

---
title: CopyS3Object 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/copys3object.md
section: Loading & Unloading Data
---

# CopyS3Object 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Copies a file from one bucket and key to another in AWS S3

## Tags

AWS, Amazon, Archive, Copy, S3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Canned ACL | Amazon Canned ACL for an object, one of: BucketOwnerFullControl, BucketOwnerRead, LogDeliveryWrite, AuthenticatedRead, PublicReadWrite, PublicRead, Private; will be ignored if any other ACL/permission/owner property is specified |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Destination Bucket | The bucket that will receive the copy. |
| Destination Key | The target key in the target bucket |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| FullControl User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Full Control for an object |
| Owner | The Amazon ID to use for the object’s owner |
| Read ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to read the Access Control List for an object |
| Read Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Read Access for an object |
| Region | The AWS Region to connect to. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Source Bucket | The bucket that contains the file to be copied. |
| Source Key | The source key in the source bucket |
| Write ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to change the Access Control List for an object |
| Write Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Write Access for an object |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| success | FlowFiles are routed to this Relationship after they have been successfully processed. |

## See also

* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: CountText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/counttext.md
section: Loading & Unloading Data
---

# CountText 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Counts various metrics on incoming text. The requested results will be recorded as attributes. The resulting flowfile will not have its content modified.

## Tags

character, count, line, text, word

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ajust-immediately | If true, the counter will be updated immediately, without regard to whether the ProcessSession is commit or rolled back;otherwise, the counter will be incremented only if and when the ProcessSession is committed. |
| character-encoding | Specifies a character encoding to use. |
| split-words-on-symbols | If enabled, the word count will identify strings separated by common logical delimiters [ _ - . ] as independent words (ex. split-words-on-symbols = 4 words). |
| text-character-count | If enabled, will count the number of characters (including whitespace and symbols, but not including newlines and carriage returns) present in the incoming text. |
| text-line-count | If enabled, will count the number of lines present in the incoming text. |
| text-line-nonempty-count | If enabled, will count the number of lines that contain a non-whitespace character present in the incoming text. |
| text-word-count | If enabled, will count the number of words (alphanumeric character groups bounded by whitespace) present in the incoming text. Common logical delimiters [_-.] do not bound a word unless ‘Split Words on Symbols’ is true. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the flowfile text cannot be counted for some reason, the original file will be routed to this destination and nothing will be routed elsewhere |
| success | The flowfile contains the original content with one or more attributes added containing the respective counts |

## Writes attributes

| Name | Description |
| --- | --- |
| text.line.count | The number of lines of text present in the FlowFile content |
| text.line.nonempty.count | The number of lines of text (with at least one non-whitespace character) present in the original FlowFile |
| text.word.count | The number of words present in the original FlowFile |
| text.character.count | The number of characters (given the specified character encoding) present in the original FlowFile |

## See also

* [org.apache.nifi.processors.standard.SplitText](splittext.md)

---
title: CreateAmazonAdsReport 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createamazonadsreport.md
section: Loading & Unloading Data
---

# CreateAmazonAdsReport 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-amazon-ads-processors-nar

## Description

Processor which creates report configuration for Amazon Ads connector. By default it runs once a day.

## Tags

Amazon, Amazon Ads, report

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token Provider | Service providing OAuth access token. |
| Amazon Advertising Client ID | Client ID of the Amazon Advertising user. |
| Region | Environment from which advertising data will be downloaded. |
| Report Ad Product | Type of advertising product being reported. |
| Report Columns | List of columns fetched from Reporting API. |
| Report Filters | Set of filters used to trim returned data. |
| Report Group By | Level of granularity of the report. |
| Report Ingestion Strategy | Configuration of the report ingestion. |
| Report Ingestion Window | How many days from the past should be downloaded during incremental ingestion. |
| Report Name | Unique name of the report. |
| Report Profile ID | The profile ID associated with an advertising account in a specific marketplace. |
| Report Start Date | Start date from which the ingestion should happen. |
| Report Time Unit | Date aggregation. |
| Report Type | Data type contained in the report. |
| Web Client Service Provider | Service providing client for REST request execution. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores information about last report definition in form of hash to detect schema changes. Incrementally loaded reports persist last ingestion date to define ingestion date ranges after initial load. Additionally start date is saved. |

## Relationships

| Name | Description |
| --- | --- |
| success | Response FlowFiles transferred when receiving success response from Amazon Ads Reporting API. |

## Writes attributes

| Name | Description |
| --- | --- |
| amazon.ads.report.id | Unique identifier of the currently prepared job. |
| amazon.ads.report.name | Unique name of the report. |
| amazon.ads.ingestion.strategy | Strategy which defines if the report will be downloaded as a SNAPSHOT or INCREMENTALLY. |
| amazon.ads.run.id | Unique identifier of the current ingestion process. |
| amazon.ads.ingestion.start.date | Date from which data is downloaded from Amazon Ads (including given date). |
| amazon.ads.ingestion.end.date | Date to which data is downloaded from Amazon Ads (including given date). |
| amazon.ads.report.schema.changed | Flag meaning if the report schema has changed between processor executions. |
| avro.schema | Avro schema containing set of all configured fields. |
| fragment.identifier | A unique ID of each ingestion run. Allows to identify all flow files generated during a single run. |
| fragment.index | Number representing unique identifier in batch of flowfiles generated during one ingestion run. |
| fragment.count | Amount of flowfiles generated during processor execution. |

---
title: CreateAzureOpenAiEmbeddings 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createazureopenaiembeddings.md
section: Loading & Unloading Data
---

# CreateAzureOpenAiEmbeddings 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-openai-nar

## Description

Uses Azure OpenAI to create embeddings for text. The input text can be provided as a single FlowFile or as a record-oriented FlowFile.

## Tags

azure, chatbot, embeddings, gen ai, generative ai, llm, nlp, openai, openflow, text

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| API Key | The API Key for authenticating to Azure OpenAI |
| Deployment Name | The name of the OpenAI model deployment to use for creating embeddings |
| Dimensions | The number of dimensions to request the resulting output embeddings have. This is only supported in text-embedding-3 and later models. |
| Embeddings Record Path | The path to the field in the record where the embeddings are to be written. |
| Max Batch Size | The maximum number of records to include in each batch sent to OpenAI |
| OpenAI Service Name | The name of the OpenAI service to use |
| Record Reader | The record reader to use for reading record-oriented data. If the incoming data is to be treated as plaintext, this property should be left unset. |
| Record Writer | The Record Writer to use for writing the output |
| Text Record Path | The path to the field in the record that contains the text to be embedded. If the incoming data is to be treated as plaintext, this property should be left unset. |
| User | An identifier for the remote user on whose behalf the request is being made; OpenAI uses this to detect and prevent abuse. |
| Web Client Service | The Web Client Service to use for communicating with OpenAI |

## Relationships

| Name | Description |
| --- | --- |
| failure | The original FlowFile will be routed to this relationship if the embeddings could not be created |
| success | The embeddings will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records written to the output |
| mime.type | The MIME type of the output data, based on the chosen Record Writer |

## Use cases

|  |
| --- |
| Create embeddings for text using Azure OpenAI’s Embeddings |

## See also

* [com.snowflake.openflow.runtime.processors.openai.CreateOpenAiEmbeddings](createopenaiembeddings.md)
* [com.snowflake.openflow.runtime.processors.openai.PromptAzureOpenAI](promptazureopenai.md)

---
title: CreateBoxFileMetadataInstance 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createboxfilemetadatainstance.md
section: Loading & Unloading Data
---

# CreateBoxFileMetadataInstance 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Creates a metadata instance for a Box file using a specified template with values from the flowFile content. The Box API requires newly created templates to be created with the scope set as enterprise so no scope is required. The input record should be a flat key-value object where each field name is used as the metadata key.

## Tags

box, create, metadata, storage, templates

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the file for which to create metadata. |
| Record Reader | The Record Reader to use for parsing the incoming data |
| Template Key | The key of the metadata template to use for creation. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if an error occurs during metadata creation. |
| file not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile is routed to this relationship after metadata has been successfully created. |
| template not found | FlowFiles for which the specified metadata template was not found will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the file for which metadata was created |
| box.template.key | The template key used for metadata creation |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFileMetadataTemplates](listboxfilemetadatatemplates.md)
* [org.apache.nifi.processors.box.UpdateBoxFileMetadataInstance](updateboxfilemetadatainstance.md)

---
title: CreateBoxMetadataTemplate 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createboxmetadatatemplate.md
section: Loading & Unloading Data
---

# CreateBoxMetadataTemplate 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Creates a Box metadata template using field specifications from the flowFile content. Expects a schema with fields: “ ‘type’ (required), ‘key’ (required), ‘displayName’ (optional), ‘description’ (optional), ‘hidden’ (optional, boolean).

## Tags

box, create, metadata, storage, templates

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Hidden | Whether the template should be hidden in the Box UI. |
| Record Reader | The Record Reader to use for parsing the incoming data |
| Template Key | The key of the metadata template to create (used for API calls). |
| Template Name | The display name of the metadata template to create. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if an error occurs during template creation. |
| success | A FlowFile is routed to this relationship after a template has been successfully created. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.template.name | The template name that was created |
| box.template.key | The template key that was created |
| box.template.scope | The template scope. |
| box.template.fields.count | Number of fields created for the template |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.ListBoxFileMetadataTemplates](listboxfilemetadatatemplates.md)
* [org.apache.nifi.processors.box.UpdateBoxFileMetadataInstance](updateboxfilemetadatainstance.md)

---
title: CreateCohereEmbeddings 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createcohereembeddings.md
section: Loading & Unloading Data
---

# CreateCohereEmbeddings 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-cohere-nar

## Description

Uses Cohere to create embeddings for text. The input text can be provided as a single FlowFile or as a record-oriented FlowFile.

## Tags

chatbot, cohere, embeddings, gen ai, generative ai, llm, nlp, openflow, text

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Cohere API Key | The API Key for authenticating to Cohere |
| Embedding Type | Specifies the types of embeddings you want to get back. |
| Embeddings Model | The model to use for embeddings, available models are listed at <https://docs.cohere.com/reference/embed> |
| Embeddings Record Path | The path to the field in the record where the embeddings are to be written. |
| Input Type | Specifies the type of input passed to the model. Required for embedding models v3 and higher. |
| Max Batch Size | The maximum number of records to include in each batch sent to Cohere |
| Record Reader | The record reader to use for reading record-oriented data. If the incoming data is to be treated as plaintext, this property should be left unset. |
| Record Writer | The Record Writer to use for writing the output |
| Text Record Path | The path to the field in the record that contains the text to be embedded. If the incoming data is to be treated as plaintext, this property should be left unset. |
| Truncate Policy | One of NONE|START|END to specify how the API will handle inputs longer than the maximum token length. |
| User | An identifier for the remote user on whose behalf the request is being made. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The original FlowFile will be routed to this relationship if the embeddings could not be created |
| success | The embeddings will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records written to the output |
| mime.type | The MIME type of the output data, based on the chosen Record Writer |

## Use cases

|  |
| --- |
| Create embeddings for text using Cohere’s Embedding model |

---
title: CreateMetaAdsReport 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createmetaadsreport.md
section: Loading & Unloading Data
---

# CreateMetaAdsReport 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-meta-ads-processors-nar

## Description

Processor which creates report configuration for Meta Ads connector. By default it runs once a day.

## Tags

Facebook, Meta, Meta Ads, report

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | Token required to request Meta Ads Marketing API. It must match pattern ‘Bearer <Access Token Value>’. |
| Action Report Time | Determine the report time of action stats. |
| Click Attribution Window | Attribution window for the click action. |
| Meta Ads API Version | Version of Meta Ads API which is used for report generation. |
| Report Breakdowns | List of values which determine how to break down the result. Multiple breakdowns can be picked, but only some combinations work. |
| Report Fields | List of fields fetched from Marketing API. If non are selected most used fields will be downloaded. |
| Report Ingestion Strategy | Configuration of the report ingestion. |
| Report Level | Granularity of the report. |
| Report Name | Unique name of the report. |
| Report Object ID | ID of the object from which data will be fetched. It can be Account, Campaign, Ad or Ad Set ID. |
| Report Start Date | Start date from which the ingestion should happen. |
| Report Time Increment | Value of aggregation in days. |
| View Attribution Window | Attribution window for the view action. |
| Web Client Service Provider | Service providing client for REST request execution. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores information about last report definition in form of hash to detect schema changes. Incrementally loaded reports persist last ingestion date to define ingestion date ranges after initial load. Additionally start date is saved. |

## Relationships

| Name | Description |
| --- | --- |
| success | Response FlowFiles transferred when receiving success response from Meta Ads Marketing API. |

## Writes attributes

| Name | Description |
| --- | --- |
| meta.ads.report.id | Unique identifier of the currently prepared job. |
| meta.ads.report.name | Unique name of the report. |
| meta.ads.report.ingestion.strategy | Strategy which defines if the report will be downloaded as a SNAPSHOT or INCREMENTALLY. |
| meta.ads.run.id | Unique identifier of the current ingestion process. |
| meta.ads.ingestion.start.date | Date from which data is downloaded from Meta Ads (including given date). |
| meta.ads.ingestion.end.date | Date to which data is downloaded from Meta Ads (including given date). |
| meta.ads.report.schema.changed | Flag meaning if the report schema has changed between processor executions. |
| avro.schema | Avro schema containing set of all configured fields. |

---
title: CreateOpenAiEmbeddings 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createopenaiembeddings.md
section: Loading & Unloading Data
---

# CreateOpenAiEmbeddings 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-openai-nar

## Description

Uses OpenAI to create embeddings for text. The input text can be provided as a single FlowFile or as a record-oriented FlowFile.

## Tags

chatbot, embeddings, gen ai, generative ai, llm, nlp, openai, openflow, text

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Dimensions | The number of dimensions to request the resulting output embeddings have. This is only supported in text-embedding-3 and later models. |
| Embeddings Model | The model to use for embeddings |
| Embeddings Record Path | The path to the field in the record where the embeddings are to be written. |
| Max Batch Size | The maximum number of records to include in each batch sent to OpenAI |
| OpenAI API Key | The API Key for authenticating to OpenAI |
| OpenAI Organization | The organization to use for OpenAI |
| Record Reader | The record reader to use for reading record-oriented data. If the incoming data is to be treated as plaintext, this property should be left unset. |
| Record Writer | The Record Writer to use for writing the output |
| Text Record Path | The path to the field in the record that contains the text to be embedded. If the incoming data is to be treated as plaintext, this property should be left unset. |
| User | An identifier for the remote user on whose behalf the request is being made; OpenAI uses this to detect and prevent abuse. |
| Web Client Service | The Web Client Service to use for communicating with OpenAI |

## Relationships

| Name | Description |
| --- | --- |
| failure | The original FlowFile will be routed to this relationship if the embeddings could not be created |
| success | The embeddings will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records written to the output |
| mime.type | The MIME type of the output data, based on the chosen Record Writer |

## Use cases

|  |
| --- |
| Create embeddings for text using OpenAI’s Embeddings |

## See also

* [com.snowflake.openflow.runtime.processors.openai.CreateAzureOpenAiEmbeddings](createazureopenaiembeddings.md)
* [com.snowflake.openflow.runtime.processors.openai.PromptOpenAI](promptopenai.md)

---
title: CreateSnowflakeEmbeddings 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createsnowflakeembeddings.md
section: Loading & Unloading Data
---

# CreateSnowflakeEmbeddings 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Create vector embeddings using Snowflake Cortex Large Language Model functions

## Tags

chatbot, embeddings, gen ai, generative ai, llm, nlp, openflow, snowflake, text

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Dimensions | The number of dimensions to request the resulting output embeddings have. |
| Embeddings Model | The model to use for embeddings |
| Record Writer | The Record Writer to use for writing the output |
| Snowflake Connection Service | Database Connection Service for accessing Snowflake |

## Relationships

| Name | Description |
| --- | --- |
| failure | The original FlowFile will be routed to this relationship if the embeddings could not be created |
| success | The embeddings will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records written to the output |
| mime.type | The MIME type of the output data, based on the chosen Record Writer |

---
title: CreateVertexAIEmbeddings 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/createvertexaiembeddings.md
section: Loading & Unloading Data
---

# CreateVertexAIEmbeddings 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-vertexai-nar

## Description

Uses VertexAI to create embeddings for text. The input text can be provided as a single FlowFile or as a record-oriented FlowFile.

## Tags

chatbot, cloud, embeddings, gcp, gen ai, generative ai, google, llm, nlp, openflow, text, vertex

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Auto Truncate | If set to false, text that exceeds the token limit causes the request to fail. |
| Embeddings Model | The model to use for embeddings, available models are listed at <https://cloud.google.com/vertex-ai/generative-ai/docs/learn/models#models> |
| Embeddings Record Path | The path to the field in the record where the embeddings are to be written. |
| GCP Credentials Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| GCP Location | The location to configure the Vertex client with |
| GCP Project ID | The project ID to configure the Vertex client with |
| Max Batch Size | The maximum number of records to include in each batch sent to VertexAI |
| Model Publisher | The publisher of the model |
| Output Dimensionality | Used to specify output embedding size. If set, output embeddings will be truncated to the size specified. |
| Record Reader | The record reader to use for reading record-oriented data. If the incoming data is to be treated as plaintext, this property should be left unset. |
| Record Writer | The Record Writer to use for writing the output |
| Task Type | Used to convey intended downstream application of embeddings to help the model tune embeddings for a specific purpose. |
| Text Record Path | The path to the field in the record that contains the text to be embedded. If the incoming data is to be treated as plaintext, this property should be left unset. |
| User | An identifier for the remote user on whose behalf the request is being made. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The original FlowFile will be routed to this relationship if the embeddings could not be created |
| success | The embeddings will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records written to the output |
| mime.type | The MIME type of the output data, based on the chosen Record Writer |

## Use cases

|  |
| --- |
| Create embeddings for text using VertexAI’s Embedding model |

---
title: CryptographicHashContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/cryptographichashcontent.md
section: Loading & Unloading Data
---

# CryptographicHashContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Calculates a cryptographic hash value for the flowfile content using the given algorithm and writes it to an output attribute. Please refer to <https://csrc.nist.gov/Projects/Hash-Functions/NIST-Policy-on-Hash-Functions> for help to decide which algorithm to use.

## Tags

blake2, content, cryptography, hash, md5, sha

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| fail_when_empty | Route to failure if the content is empty. While hashing an empty value is valid, some flows may want to detect empty input. |
| hash_algorithm | The hash algorithm to use. Note that not all of the algorithms available are recommended for use (some are provided for legacy compatibility). There are many things to consider when picking an algorithm; it is recommended to use the most secure algorithm possible. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Used for flowfiles that have no content if the ‘fail on empty’ setting is enabled |
| success | Used for flowfiles that have a hash value added |

## Writes attributes

| Name | Description |
| --- | --- |
| content_<algorithm> | This processor adds an attribute whose value is the result of hashing the flowfile content. The name of this attribute is specified by the value of the algorithm, e.g. ‘content_SHA-256’. |

---
title: CSVReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/csvreader.md
section: Loading & Unloading Data
---

# CSVReader

## Description

Parses CSV-formatted data, returning each row in the CSV file as a separate record. This reader allows for inferring a schema based on the first line of the CSV, if a ‘header line’ is present, or providing an explicit schema for interpreting the values. See Controller Service’s Usage for further documentation.

## Tags

comma, csv, delimited, parse, reader, record, row, separated, values

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Allow Duplicate Header Names | Allow Duplicate Header Names | true | * true * false | Whether duplicate header names are allowed. Header names are case-sensitive, for example “name” and “Name” are treated as separate fields.Handling of duplicate header names is CSV Parser specific (where applicable):\* Apache Commons CSV - duplicate headers will result in column data “shifting” right with new fields created for “unknown_field_index_X” where “X” is the CSV column index number\* Jackson CSV - duplicate headers will be de-duplicated with the field value being that of the right-most duplicate CSV column\* FastCSV - duplicate headers will be de-duplicated with the field value being that of the left-most duplicate CSV column |
| CSV Format \* | CSV Format | custom | * Custom Format * RFC 4180 * Microsoft Excel * Tab-Delimited * MySQL Format * Informix Unload * Informix Unload Escape Disabled | Specifies which “format” the CSV data is in, or specifies if custom formatting should be used. |
| Character Set \* | Character Set | UTF-8 |  | The Character Encoding that is used to encode/decode the CSV file |
| Comment Marker | Comment Marker |  |  | The character that is used to denote the start of a comment. Any line that begins with this comment will be ignored. |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Escape Character \* | Escape Character |  |  | The character that is used to escape characters that would otherwise have a specific meaning to the CSV Parser. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Escape Character at runtime, then it will be skipped and the default Escape Character will be used. Setting it to an empty string means no escape character should be used. |
| Ignore CSV Header Column Names | Ignore CSV Header Column Names | false | * true * false | If the first line of a CSV is a header, and the configured schema does not match the fields named in the header line, this controls how the Reader will interpret the fields. If this property is true, then the field names mapped to each column are driven only by the configured schema and any fields not in the schema will be ignored. If this property is false, then the field names found in the CSV Header will be used as the names of the fields. |
| Null String | Null String |  |  | Specifies a String that, if present as a value in the CSV, should be considered a null field instead of using the literal value. |
| Quote Character \* | Quote Character | “ |  | The character that is used to quote values so that escape characters do not have to be used. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Quote Character at runtime, then it will be skipped and the default Quote Character will be used. |
| Record Separator \* | Record Separator | n |  | Specifies the characters to use in order to separate CSV Records |
| Schema Access Strategy \* | Schema Access Strategy | infer-schema | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Use String Fields From Header * Infer Schema | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Treat First Line as Header \* | Treat First Line as Header | false | * true * false | Specifies whether or not the first line of CSV should be considered a Header or should be considered a record. If the Schema Access Strategy indicates that the columns must be defined in the header, then this property will be ignored, since the header must always be present and won’t be processed as a Record. Otherwise, if ‘true’, then the first line of CSV data will not be processed as a record and if ‘false’,then the first line will be interpreted as a record. |
| Trim Fields \* | Trim Fields | true | * true * false | Whether or not white space should be removed from the beginning and end of fields |
| Trim double quote \* | Trim double quote | true | * true * false | Whether or not to trim starting and ending double quotes. For example: with trim string ‘“test”’ would be parsed to ‘test’, without trim would be parsed to ‘“test”’.If set to ‘false’ it means full compliance with RFC-4180. Default value is true, with trim. |
| Value Separator \* | Value Separator | , |  | The character that is used to separate values/fields in a CSV Record. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Value Separator at runtime, then it will be skipped and the default Value Separator will be used. |
| CSV Parser \* | csv-reader-csv-parser | commons-csv | * Apache Commons CSV * Jackson CSV * FastCSV | Specifies which parser to use to read CSV records. NOTE: Different parsers may support different subsets of functionality and may also exhibit different levels of performance. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: CSVRecordLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/csvrecordlookupservice.md
section: Loading & Unloading Data
---

# CSVRecordLookupService

## Description

A reloadable CSV file-based lookup service. When the lookup key is found in the CSV file, the columns are returned as a Record. All returned fields will be strings. The first line of the csv file is considered as header.

## Tags

cache, csv, enrich, join, key, lookup, record, reloadable, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| CSV Format \* | CSV Format | default | * Custom Format * RFC 4180 * Microsoft Excel * Tab-Delimited * MySQL Format * Informix Unload * Informix Unload Escape Disabled * Default Format * RFC4180 | Specifies which “format” the CSV data is in, or specifies if custom formatting should be used. |
| Character Set \* | Character Set | UTF-8 |  | The Character Encoding that is used to decode the CSV file. |
| Comment Marker | Comment Marker |  |  | The character that is used to denote the start of a comment. Any line that begins with this comment will be ignored. |
| Escape Character \* | Escape Character |  |  | The character that is used to escape characters that would otherwise have a specific meaning to the CSV Parser. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Escape Character at runtime, then it will be skipped and the default Escape Character will be used. Setting it to an empty string means no escape character should be used. |
| Quote Character \* | Quote Character | “ |  | The character that is used to quote values so that escape characters do not have to be used. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Quote Character at runtime, then it will be skipped and the default Quote Character will be used. |
| Quote Mode \* | Quote Mode | MINIMAL | * Quote All Values * Quote Minimal * Quote Non-Numeric Values * Do Not Quote Values | Specifies how fields should be quoted when they are written |
| Trim Fields \* | Trim Fields | true | * true * false | Whether or not white space should be removed from the beginning and end of fields |
| Value Separator \* | Value Separator | , |  | The character that is used to separate values/fields in a CSV Record. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Value Separator at runtime, then it will be skipped and the default Value Separator will be used. |
| CSV File \* | csv-file |  |  | Path to a CSV File in which the key value pairs can be looked up. |
| Ignore Duplicates \* | ignore-duplicates | true | * true * false | Ignore duplicate keys for records in the CSV file. |
| Lookup Key Column \* | lookup-key-column |  |  | The field in the CSV file that will serve as the lookup key. This is the field that will be matched against the property specified in the lookup processor. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: CSVRecordSetWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/csvrecordsetwriter.md
section: Loading & Unloading Data
---

# CSVRecordSetWriter

## Description

Writes the contents of a RecordSet as CSV data. The first line written will be the column names (unless the ‘Include Header Line’ property is false). All subsequent lines will be the values corresponding to the record fields.

## Tags

csv, delimited, record, recordset, result, row, separated, serializer, set, tab, tsv, writer

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| CSV Format \* | CSV Format | custom | * Custom Format * RFC 4180 * Microsoft Excel * Tab-Delimited * MySQL Format * Informix Unload * Informix Unload Escape Disabled | Specifies which “format” the CSV data is in, or specifies if custom formatting should be used. |
| Character Set \* | Character Set | UTF-8 |  | The Character Encoding that is used to encode/decode the CSV file |
| Comment Marker | Comment Marker |  |  | The character that is used to denote the start of a comment. Any line that begins with this comment will be ignored. |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Escape Character \* | Escape Character |  |  | The character that is used to escape characters that would otherwise have a specific meaning to the CSV Parser. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Escape Character at runtime, then it will be skipped and the default Escape Character will be used. Setting it to an empty string means no escape character should be used. |
| Include Header Line \* | Include Header Line | true | * true * false | Specifies whether or not the CSV column names should be written out as the first line. |
| Include Trailing Delimiter \* | Include Trailing Delimiter | false | * true * false | If true, a trailing delimiter will be added to each CSV Record that is written. If false, the trailing delimiter will be omitted. |
| Null String | Null String |  |  | Specifies a String that, if present as a value in the CSV, should be considered a null field instead of using the literal value. |
| Quote Character \* | Quote Character | “ |  | The character that is used to quote values so that escape characters do not have to be used. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Quote Character at runtime, then it will be skipped and the default Quote Character will be used. |
| Quote Mode \* | Quote Mode | MINIMAL | * Quote All Values * Quote Minimal * Quote Non-Numeric Values * Do Not Quote Values | Specifies how fields should be quoted when they are written |
| Record Separator \* | Record Separator | n |  | Specifies the characters to use in order to separate CSV Records |
| Schema Access Strategy \* | Schema Access Strategy | inherit-record-schema | * Inherit Record Schema * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Cache | Schema Cache |  |  | Specifies a Schema Cache to add the Record Schema to so that Record Readers can quickly lookup the schema. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Reference Writer \* | Schema Reference Writer |  |  | Service implementation responsible for writing FlowFile attributes or content header with Schema reference information |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Schema Write Strategy \* | Schema Write Strategy | no-schema | * Do Not Write Schema * Set ‘schema.name’ Attribute * Set ‘avro.schema’ Attribute * Schema Reference Writer | Specifies how the schema for a Record should be added to the data. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Trim Fields \* | Trim Fields | true | * true * false | Whether or not white space should be removed from the beginning and end of fields |
| Value Separator \* | Value Separator | , |  | The character that is used to separate values/fields in a CSV Record. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Value Separator at runtime, then it will be skipped and the default Value Separator will be used. |
| CSV Writer \* | csv-writer | commons-csv | * Apache Commons CSV * FastCSV | Specifies which writer implementation to use to write CSV records. NOTE: Different writers may support different subsets of functionality and may also exhibit different levels of performance. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DatabaseLookup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/databaselookup.md
section: Loading & Unloading Data
---

# DatabaseLookup

## Description

A Lookup Service that allows for enrichment with a database using a user-specified SQL statement. The SQL statement may reference any value from the FlowFile’s Record that is provided by the calling Processor.

## Tags

database, enrich, join, lookup, openflow, rdbms, record, sql

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Connection Pooling Service \* | Connection Pooling Service |  |  | The Connection Pooling Service that is used to obtain a connection to the database |
| Max Array Size \* | Max Array Size | 1000 |  | The maximum number of records to include in the array. This is a mechanism to ensure that the returned results due not cause memory issues. If the result set contains more records than this value, the lookup will fail. If the desire is instead to limit the number of rows returned, a LIMIT clause should be added to the SQL. |
| Multiple Result Field Name \* | Multiple Result Field Name | results |  | If multiple results are returned, they will be combined into an array. This property dictates the name of the field in the returned record. |
| Multiple Result Strategy \* | Multiple Result Strategy | Fail | * Use Array * Use First Only * Fail | Specifies how to handle the situation where the lookup results in multiple records. |
| SQL \* | SQL |  |  | The SQL statement to execute against the database in order to lookup the value. The statement may reference any attributes or values from the incoming Record that are provided by the calling Processor via Expression Language. The processor is will extract any Expression Language expressions and replace them with parameterized values so that the SQL can be safely executed, avoiding SQL Injection attacks. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DatabaseRecordLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/databaserecordlookupservice.md
section: Loading & Unloading Data
---

# DatabaseRecordLookupService

## Description

A relational-database-based lookup service. When the lookup key is found in the database, the specified columns (or all if Lookup Value Columns are not specified) are returned as a Record. Only one row will be returned for each lookup, duplicate database entries are ignored.

## Tags

cache, database, enrich, join, key, lookup, rdbms, record, reloadable, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Cache Expiration | Cache Expiration |  |  | Time interval to clear all cache entries. If the Cache Size is zero then this property is ignored. |
| Default Decimal Precision \* | Default Decimal Precision | 10 |  | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale \* | Default Decimal Scale | 0 |  | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| Cache Size \* | dbrecord-lookup-cache-size | 0 |  | Specifies how many lookup values/records should be cached. The cache is shared for all tables and keeps a map of lookup values to records. Setting this property to zero means no caching will be done and the table will be queried for each lookup value in each record. If the lookup table changes often or the most recent data must be retrieved, do not use the cache. |
| Clear Cache on Enabled \* | dbrecord-lookup-clear-cache-on-enabled | true | * true * false | Whether to clear the cache when this service is enabled. If the Cache Size is zero then this property is ignored. Clearing the cache when the service is enabled ensures that the service will first go to the database to get the most recent data. |
| Database Connection Pooling Service \* | dbrecord-lookup-dbcp-service |  |  | The Controller Service that is used to obtain connection to database |
| Lookup Key Column \* | dbrecord-lookup-key-column |  |  | The column in the table that will serve as the lookup key. This is the column that will be matched against the property specified in the lookup processor. Note that this may be case-sensitive depending on the database. |
| Table Name \* | dbrecord-lookup-table-name |  |  | The name of the database table to be queried. Note that this may be case-sensitive depending on the database. |
| Lookup Value Columns | dbrecord-lookup-value-columns |  |  | A comma-delimited list of columns in the table that will be returned when the lookup key matches. Note that this may be case-sensitive depending on the database. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DatabaseRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/databaserecordsink.md
section: Loading & Unloading Data
---

# DatabaseRecordSink

## Description

Provides a service to write records using a configured database connection.

## Tags

connection, database, db, jdbc, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Catalog Name | db-record-sink-catalog-name |  |  | The name of the catalog that the statement should update. This may not apply for the database that you are updating. In this case, leave the field empty |
| Database Connection Pooling Service \* | db-record-sink-dcbp-service |  |  | The Controller Service that is used to obtain a connection to the database for sending records. |
| Max Wait Time \* | db-record-sink-query-timeout | 0 seconds |  | The maximum amount of time allowed for a running SQL statement , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| Quote Column Identifiers | db-record-sink-quoted-identifiers | false | * true * false | Enabling this option will cause all column names to be quoted, allowing you to use reserved words as column names in your tables. |
| Quote Table Identifiers | db-record-sink-quoted-table-identifiers | false | * true * false | Enabling this option will cause the table name to be quoted to support the use of special characters in the table name. |
| Schema Name | db-record-sink-schema-name |  |  | The name of the schema that the table belongs to. This may not apply for the database that you are updating. In this case, leave the field empty |
| Table Name \* | db-record-sink-table-name |  |  | The name of the table that the statement should affect. |
| Translate Field Names | db-record-sink-translate-field-names | true | * true * false | If true, the Processor will attempt to translate field names into the appropriate column names for the table specified. If false, the field names must match the column names exactly, or the column will not be updated |
| Unmatched Column Behavior | db-record-sink-unmatched-column-behavior | Fail on Unmatched Columns | * Ignore Unmatched Columns * Warn on Unmatched Columns * Fail on Unmatched Columns | If an incoming record does not have a field mapping for all of the database table’s columns, this property specifies how to handle the situation |
| Unmatched Field Behavior | db-record-sink-unmatched-field-behavior | Ignore Unmatched Fields | * Ignore Unmatched Fields * Fail on Unmatched Fields | If an incoming record has a field that does not map to any of the database table’s columns, this property specifies how to handle the situation |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DBCPConnectionPool
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/dbcpconnectionpool.md
section: Loading & Unloading Data
---

# DBCPConnectionPool

## Description

Provides Database Connection Pooling Service. Connections can be asked from pool and returned after usage.

## Tags

connection, database, dbcp, jdbc, pooling, store

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Database Connection URL \* | Database Connection URL |  |  | A database connection URL used to connect to a database. May contain database system name, host, port, database name and some parameters. The exact syntax of a database connection URL is specified by your DBMS. |
| Database Driver Class Name \* | Database Driver Class Name |  |  | Database driver class name |
| Database Driver Location(s) | Database Driver Location(s) |  |  | Comma-separated list of files/folders and/or URLs containing the driver JAR and its dependencies (if any). For example ‘/var/tmp/mariadb-java-client-1.1.7.jar’ |
| Database User | Database User |  |  | Database user name |
| Kerberos User Service | Kerberos User Service |  |  | Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos |
| Max Total Connections \* | Max Total Connections | 8 |  | The maximum number of active connections that can be allocated from this pool at the same time, or negative for no limit. |
| Max Wait Time \* | Max Wait Time | 500 millis |  | The maximum amount of time that the pool will wait (when there are no available connections) for a connection to be returned before failing, or -1 to wait indefinitely. |
| Maximum Connection Lifetime | Maximum Connection Lifetime | -1 |  | The maximum lifetime of a connection. After this time is exceeded the connection will fail the next activation, passivation or validation test. A value of zero or less means the connection has an infinite lifetime. |
| Maximum Idle Connections | Maximum Idle Connections | 8 |  | The maximum number of connections that can remain idle in the pool without extra ones being released. Set to any negative value to allow unlimited idle connections. |
| Minimum Evictable Idle Time | Minimum Evictable Idle Time | 30 mins |  | The minimum amount of time a connection may sit idle in the pool before it is eligible for eviction. |
| Minimum Idle Connections | Minimum Idle Connections | 0 |  | The minimum number of connections that can remain idle in the pool without extra ones being created. Set to or zero to allow no idle connections. |
| Password | Password |  |  | The password for the database user |
| Soft Minimum Evictable Idle Time | Soft Minimum Evictable Idle Time | -1 |  | The minimum amount of time a connection may sit idle in the pool before it is eligible for eviction by the idle connection evictor, with the extra condition that at least a minimum number of idle connections remain in the pool. When the not-soft version of this option is set to a positive value, it is examined first by the idle connection evictor: when idle connections are visited by the evictor, idle time is first compared against it (without considering the number of idle connections in the pool) and then against this soft option, including the minimum idle connections constraint. |
| Time Between Eviction Runs | Time Between Eviction Runs | -1 |  | The time period to sleep between runs of the idle connection evictor thread. When non-positive, no idle connection evictor thread will be run. |
| Validation Query | Validation Query |  |  | Validation query used to validate connections before returning them. When connection is invalid, it gets dropped and new valid connection will be returned. Note!! Using validation might have some performance penalty. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Database Driver Location can reference resources over HTTP |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DBCPConnectionPoolLookup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/dbcpconnectionpoollookup.md
section: Loading & Unloading Data
---

# DBCPConnectionPoolLookup

## Description

Provides a DBCPService that can be used to dynamically select another DBCPService. This service requires an attribute named ‘database.name’ to be passed in when asking for a connection, and will throw an exception if the attribute is missing. The value of ‘database.name’ will be used to select the DBCPService that has been registered with that name. This will allow multiple DBCPServices to be defined and registered, and then selected dynamically at runtime by tagging flow files with the appropriate ‘database.name’ attribute.

## Tags

connection, database, dbcp, jdbc, pooling, store

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DebugFlow 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/debugflow.md
section: Loading & Unloading Data
---

# DebugFlow 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

The DebugFlow processor aids testing and debugging the FlowFile framework by allowing various responses to be explicitly triggered in response to the receipt of a FlowFile or a timer event without a FlowFile if using timer or cron based scheduling. It can force responses needed to exercise or test various failure modes that can occur when a processor runs.

## Tags

FlowFile, debug, flow, processor, test, utility

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| @OnScheduled Pause Time | Specifies how long the processor should sleep in the @OnScheduled method, so that the processor can be forced to take a long time to start up |
| @OnStopped Pause Time | Specifies how long the processor should sleep in the @OnStopped method, so that the processor can be forced to take a long time to shutdown |
| @OnUnscheduled Pause Time | Specifies how long the processor should sleep in the @OnUnscheduled method, so that the processor can be forced to take a long time to respond when user clicks stop |
| Content Size | The number of bytes to write each time that the FlowFile is written to |
| CustomValidate Pause Time | Specifies how long the processor should sleep in the customValidate() method |
| Fail When @OnScheduled called | Specifies whether or not the Processor should throw an Exception when the methods annotated with @OnScheduled are called |
| Fail When @OnStopped called | Specifies whether or not the Processor should throw an Exception when the methods annotated with @OnStopped are called |
| Fail When @OnUnscheduled called | Specifies whether or not the Processor should throw an Exception when the methods annotated with @OnUnscheduled are called |
| FlowFile Exception Class | Exception class to be thrown (must extend java.lang. RuntimeException). |
| FlowFile Exception Iterations | Number of FlowFiles to throw exception. |
| FlowFile Failure Iterations | Number of FlowFiles to forward to failure relationship. |
| FlowFile Rollback Iterations | Number of FlowFiles to roll back (without penalty). |
| FlowFile Rollback Penalty Iterations | Number of FlowFiles to roll back with penalty. |
| FlowFile Rollback Yield Iterations | Number of FlowFiles to roll back and yield. |
| FlowFile Success Iterations | Number of FlowFiles to forward to success relationship. |
| Ignore Interrupts When Paused | If the Processor’s thread(s) are sleeping (due to one of the “Pause Time” properties above), and the thread is interrupted, this indicates whether the Processor should ignore the interrupt and continue sleeping or if it should allow itself to be interrupted. |
| No FlowFile Exception Class | Exception class to be thrown if no FlowFile (must extend java.lang. RuntimeException). |
| No FlowFile Exception Iterations | Number of times to throw NPE exception if no FlowFile. |
| No FlowFile Skip Iterations | Number of times to skip onTrigger if no FlowFile. |
| No FlowFile Yield Iterations | Number of times to yield if no FlowFile. |
| OnTrigger Pause Time | Specifies how long the processor should sleep in the onTrigger() method, so that the processor can be forced to take a long time to perform its task |
| Write Iterations | Number of times to write to the FlowFile |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to process. |
| success | FlowFiles processed successfully. |

---
title: DecryptContentAge 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/decryptcontentage.md
section: Loading & Unloading Data
---

# DecryptContentAge 2025.10.9.21

## Bundle

org.apache.nifi | nifi-cipher-nar

## Description

Decrypt content using the age-encryption.org/v1 specification. Detects binary or ASCII armored content encoding using the initial file header bytes. The age standard uses ChaCha20-Poly1305 for authenticated encryption of the payload. The age-keygen command supports generating X25519 key pairs for encryption and decryption operations.

## Tags

ChaCha20-Poly1305, X25519, age, age-encryption.org, encryption

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Private Key Identities | One or more X25519 Private Key Identities, separated with newlines, encoded according to the age specification, starting with AGE-SECRET-KEY-1 |
| Private Key Identity Resources | One or more files or URLs containing X25519 Private Key Identities, separated with newlines, encoded according to the age specification, starting with AGE-SECRET-KEY-1 |
| Private Key Source | Source of information determines the loading strategy for X25519 Private Key Identities |

## Relationships

| Name | Description |
| --- | --- |
| failure | Decryption Failed |
| success | Decryption Completed |

## See also

* [org.apache.nifi.processors.cipher.EncryptContentAge](encryptcontentage.md)

---
title: DecryptContentPGP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/decryptcontentpgp.md
section: Loading & Unloading Data
---

# DecryptContentPGP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-pgp-nar

## Description

Decrypt contents of OpenPGP messages. Using the Packaged Decryption Strategy preserves OpenPGP encoding to support subsequent signature verification.

## Tags

Encryption, GPG, OpenPGP, PGP, RFC 4880

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| decryption-strategy | Strategy for writing files to success after decryption |
| passphrase | Passphrase used for decrypting data encrypted with Password-Based Encryption |
| private-key-service | PGP Private Key Service for decrypting data encrypted with Public Key Encryption |

## Relationships

| Name | Description |
| --- | --- |
| failure | Decryption Failed |
| success | Decryption Succeeded |

## Writes attributes

| Name | Description |
| --- | --- |
| pgp.literal.data.filename | Filename from decrypted Literal Data |
| pgp.literal.data.modified | Modified Date from decrypted Literal Data |
| pgp.symmetric.key.algorithm.block.cipher | Symmetric-Key Algorithm Block Cipher |
| pgp.symmetric.key.algorithm.id | Symmetric-Key Algorithm Identifier |

## See also

* [org.apache.nifi.processors.pgp.EncryptContentPGP](encryptcontentpgp.md)
* [org.apache.nifi.processors.pgp.SignContentPGP](signcontentpgp.md)
* [org.apache.nifi.processors.pgp.VerifyContentPGP](verifycontentpgp.md)

---
title: DeduplicateRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deduplicaterecord.md
section: Loading & Unloading Data
---

# DeduplicateRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor de-duplicates individual records within a record set. It can operate on a per-file basis using an in-memory hashset or bloom filter. When configured with a distributed map cache, it de-duplicates records across multiple files.

## Tags

change, dedupe, distinct, dupe, duplicate, filter, hash, modify, record, replace, text, unique, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| bloom-filter-certainty | The desired false positive probability when using the BloomFilter type. Using a value of .05 for example, guarantees a five-percent probability that the result is a false positive. The closer to 1 this value is set, the more precise the result at the expense of more storage space utilization. |
| cache-identifier | An optional expression language field that overrides the record’s computed cache key. This field has an additional attribute available: ${record.hash.value}, which contains the cache key derived from dynamic properties (if set) or record fields. |
| deduplication-strategy | The strategy to use for detecting and routing duplicate records. The option for detecting duplicates across a single FlowFile operates in-memory, whereas detection spanning multiple FlowFiles utilises a distributed map cache. |
| distributed-map-cache | This property is required when the deduplication strategy is set to ‘multiple files.’ The map cache will for each record, atomically check whether the cache key exists and if not, set it. |
| filter-capacity-hint | An estimation of the total number of unique records to be processed. The more accurate this number is will lead to fewer false negatives on a BloomFilter. |
| filter-type | The filter used to determine whether a record has been seen before based on the matching RecordPath criteria. If hash set is selected, a Java HashSet object will be used to deduplicate all encountered records. If the bloom filter option is selected, a bloom filter will be used. The bloom filter option is less memory intensive, but has a chance of having false positives. |
| include-zero-record-flowfiles | If a FlowFile sent to either the duplicate or non-duplicate relationships contains no records, a value of `false` in this property causes the FlowFile to be dropped. Otherwise, the empty FlowFile is emitted. |
| put-cache-identifier | For each record, check whether the cache identifier exists in the distributed map cache. If it doesn’t exist and this property is true, put the identifier to the cache. |
| record-hashing-algorithm | The algorithm used to hash the cache key. |
| record-reader | Specifies the Controller Service to use for reading incoming data |
| record-writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| duplicate | Records detected as duplicates are routed to this relationship. |
| failure | If unable to communicate with the cache, the FlowFile will be penalized and routed to this relationship |
| non-duplicate | Records not found in the cache are routed to this relationship. |
| original | The original input FlowFile is sent to this relationship unless a fatal error occurs. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | Number of records written to the destination FlowFile. |

## See also

* [org.apache.nifi.processors.standard.DetectDuplicate](detectduplicate.md)

---
title: DeleteAzureBlobStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deleteazureblobstorage_v12.md
section: Loading & Unloading Data
---

# DeleteAzureBlobStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Deletes the specified blob from Azure Blob Storage. The processor uses Azure Blob Storage client library v12.

## Tags

azure, blob, cloud, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Blob Name | The full name of the blob |
| Container Name | Name of the Azure storage container. In case of PutAzureBlobStorage processor, container can be created if it does not exist. |
| Delete Snapshots Option | Specifies the snapshot deletion options to be used when deleting a blob. |
| Storage Credentials | Controller Service used to obtain Azure Blob Storage Credentials. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Unsuccessful operations will be transferred to the failure relationship. |
| success | All successfully processed FlowFiles are routed to this relationship |

## See also

* [org.apache.nifi.processors.azure.storage.CopyAzureBlobStorage_v12](copyazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureBlobStorage_v12](fetchazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.ListAzureBlobStorage_v12](listazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.PutAzureBlobStorage_v12](putazureblobstorage_v12.md)

---
title: DeleteAzureDataLakeStorage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deleteazuredatalakestorage.md
section: Loading & Unloading Data
---

# DeleteAzureDataLakeStorage 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Deletes the provided file from Azure Data Lake Storage

## Tags

adlsgen2, azure, cloud, datalake, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ADLS Credentials | Controller Service used to obtain Azure Credentials. |
| Directory Name | Name of the Azure Storage Directory. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. In case of the PutAzureDataLakeStorage processor, the directory will be created if not already existing. |
| File Name | The filename |
| Filesystem Name | Name of the Azure Storage File System (also called Container). It is assumed to be already existing. |
| Filesystem Object Type | They type of the file system object to be deleted. It can be either folder or file. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Azure storage for some reason are transferred to this relationship |
| success | Files that have been successfully written to Azure storage are transferred to this relationship |

## See also

* [org.apache.nifi.processors.azure.storage.FetchAzureDataLakeStorage](fetchazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.ListAzureDataLakeStorage](listazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.PutAzureDataLakeStorage](putazuredatalakestorage.md)

---
title: DeleteBoxFileMetadataInstance 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deleteboxfilemetadatainstance.md
section: Loading & Unloading Data
---

# DeleteBoxFileMetadataInstance 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Deletes a metadata instance from a Box file using the specified template key

## Tags

box, delete, metadata, storage, templates

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the file from which to delete metadata. |
| Template Key | The key of the metadata template instance to delete. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if an error occurs during metadata deletion. |
| file not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile is routed to this relationship after metadata has been successfully deleted. |
| template not found | FlowFiles for which the specified metadata template was not found will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the file from which metadata was deleted |
| box.template.key | The template key used for metadata deletion |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.CreateBoxFileMetadataInstance](createboxfilemetadatainstance.md)
* [org.apache.nifi.processors.box.FetchBoxFileMetadataInstance](fetchboxfilemetadatainstance.md)
* [org.apache.nifi.processors.box.UpdateBoxFileMetadataInstance](updateboxfilemetadatainstance.md)

---
title: DeleteByQueryElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletebyqueryelasticsearch.md
section: Loading & Unloading Data
---

# DeleteByQueryElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

Delete from an Elasticsearch index using a query. The query can be loaded from a flowfile body or from the Query parameter.

## Tags

delete, elastic, elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, query

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Client Service | An Elasticsearch client service to use for running queries. |
| Index | The name of the index to use. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Query | A query in JSON syntax, not Lucene syntax. Ex: {“query”:{“match”:{“somefield”:”somevalue”}}}. If this parameter is not set, the query will be read from the flowfile content. If the query (property and flowfile content) is empty, a default empty JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Attribute | If set, the executed query will be set on each result flowfile in the specified attribute. |
| Query Clause | A “query” clause in JSON syntax, not Lucene syntax. Ex: {“match”:{“somefield”:”somevalue”}}. If the query is empty, a default JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Definition Style | How the JSON Query will be defined for use by the processor. |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the “by query” operation fails, and a flowfile was read, it will be sent to this relationship. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |
| success | If the “by query” operation succeeds, and a flowfile was read, it will be sent to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| elasticsearch.delete.took | The amount of time that it took to complete the delete operation in ms. |
| elasticsearch.delete.error | The error message provided by Elasticsearch if there is an error running the delete. |

---
title: DeleteDBFSResource 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletedbfsresource.md
section: Loading & Unloading Data
---

# DeleteDBFSResource 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Delete a DBFS files and directories.

## Tags

databricks, dbfs, openflow

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| DBFS File Path | DBFS file path e.g. /directory/file.txt |
| Databricks Client | Databricks Client Service. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: DeleteDynamoDB 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletedynamodb.md
section: Loading & Unloading Data
---

# DeleteDynamoDB 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Deletes a document from DynamoDB based on hash and range key. The key can be string or number. The request requires all the primary keys for the operation (hash or hash and range key)

## Tags

AWS, Amazon, Delete, DynamoDB, Remove

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Batch items for each request (between 1 and 50) | The items to be retrieved in one batch |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Hash Key Name | The hash key name of the item |
| Hash Key Value | The hash key value of the item |
| Hash Key Value Type | The hash key value type of the item |
| Range Key Name | The range key name of the item |
| Range Key Value |  |
| Range Key Value Type | The range key value type of the item |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Table Name | The DynamoDB table name |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |
| unprocessed | FlowFiles are routed to unprocessed relationship when DynamoDB is not able to process all the items in the request. Typical reasons are insufficient table throughput capacity and exceeding the maximum bytes per request. Unprocessed FlowFiles can be retried with a new request. |

## Writes attributes

| Name | Description |
| --- | --- |
| dynamodb.key.error.unprocessed | DynamoDB unprocessed keys |
| dynmodb.range.key.value.error | DynamoDB range key error |
| dynamodb.key.error.not.found | DynamoDB key not found |
| dynamodb.error.exception.message | DynamoDB exception message |
| dynamodb.error.code | DynamoDB error code |
| dynamodb.error.message | DynamoDB error message |
| dynamodb.error.service | DynamoDB error service |
| dynamodb.error.retryable | DynamoDB error is retryable |
| dynamodb.error.request.id | DynamoDB error request id |
| dynamodb.error.status.code | DynamoDB status code |

## See also

* [org.apache.nifi.processors.aws.dynamodb.GetDynamoDB](getdynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.PutDynamoDB](putdynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.PutDynamoDBRecord](putdynamodbrecord.md)

---
title: DeleteFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletefile.md
section: Loading & Unloading Data
---

# DeleteFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Deletes a file from the filesystem.

## Tags

delete, file, files, filesystem, local, remove

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Directory Path | The path to the directory the file to delete is located in. |
| Filename | The name of the file to delete. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |
| write filesystem | Provides operator the ability to delete any file that NiFi has access to. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles, for which an existing file could not be deleted, are routed to this relationship |
| not found | All FlowFiles, for which the file to delete did not exist, are routed to this relationship |
| success | All FlowFiles, for which an existing file has been deleted, are routed to this relationship |

## Use cases

|  |
| --- |
| Delete source file only after its processing completed |

---
title: DeleteGCSObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletegcsobject.md
section: Loading & Unloading Data
---

# DeleteGCSObject 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Deletes objects from a Google Cloud Bucket. If attempting to delete a file that does not exist, FlowFile is routed to success.

## Tags

delete, gcs, google, google cloud, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| gcp-project-id | Google Cloud Project ID |
| gcp-retry-count | How many retry attempts should be made before routing to the failure relationship. |
| gcs-bucket | Bucket of the object. |
| gcs-generation | The generation of the object to be deleted. If null, will use latest version of the object. |
| gcs-key | Name of the object. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| storage-api-url | Overrides the default storage URL. Configuring an alternative Storage API URL also overrides the HTTP Host header on requests as described in the Google documentation for Private Service Connections. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship if the Google Cloud Storage operation fails. |
| success | FlowFiles are routed to this relationship after a successful Google Cloud Storage operation. |

## See also

* [org.apache.nifi.processors.gcp.storage.FetchGCSObject](fetchgcsobject.md)
* [org.apache.nifi.processors.gcp.storage.ListGCSBucket](listgcsbucket.md)
* [org.apache.nifi.processors.gcp.storage.PutGCSObject](putgcsobject.md)

---
title: DeleteGridFS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletegridfs.md
section: Loading & Unloading Data
---

# DeleteGridFS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Deletes a file from GridFS using a file name or a query.

## Tags

delete, gridfs, mongodb

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| delete-gridfs-query | A valid MongoDB query to use to find and delete one or more files from GridFS. |
| gridfs-bucket-name | The GridFS bucket where the files will be stored. If left blank, it will use the default value ‘fs’ that the MongoDB client driver uses. |
| gridfs-client-service | The MongoDB client service to use for database connections. |
| gridfs-database-name | The name of the database to use |
| gridfs-file-name | The name of the file in the bucket that is the target of this processor. GridFS file names do not include path information because GridFS does not sort files into folders within a bucket. |
| mongo-query-attribute | If set, the query will be written to a specified attribute on the output flowfiles. |

## Relationships

| Name | Description |
| --- | --- |
| failure | When there is a failure processing the flowfile, it goes to this relationship. |
| success | When the operation succeeds, the flowfile is sent to this relationship. |

---
title: DeleteMilvus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletemilvus.md
section: Loading & Unloading Data
---

# DeleteMilvus 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-milvus-processors-nar

## Description

Deletes vectors from Milvus database from a collection by ID. Unmatched IDs are ignored by Milvus and not deleted.

## Tags

chatbot, delete, embeddings, gen ai, genai, generative ai, llm, metadata, milvus, openflow, text, vector

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Collection Name | The name of the Milvus collection name to use |
| Delete Filter | The filter to use in the delete request. Example: id like “prefix%” |
| Delete Strategy | The strategy to use for deleting vectors in Milvus |
| ID Record Path | The path to the ID field in the record |
| Milvus Connection Service | Connection Service for accessing Milvus Database |
| Partition | Partition of the vector database that you want to perform operations in. If the database has only one partition leave empty. |
| Record Reader | The Record Reader to use for reading the FlowFile |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be sent to Milvus, and for which a retry is not expected to be successful, are routed to this relationship |
| retry | FlowFiles that fail to be sent to Milvus, but for which a retry may help, are routed to this relationship |
| success | FlowFiles that are successfully sent to Milvus are routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.milvus.UpsertMilvus](upsertmilvus.md)

---
title: DeleteMongo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletemongo.md
section: Loading & Unloading Data
---

# DeleteMongo 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Executes a delete query against a MongoDB collection. The query is provided in the body of the flowfile and the user can select whether it will delete one or many documents that match it.

## Tags

delete, mongo, mongodb

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| delete-mongo-delete-mode | Choose between deleting one document by query or many documents by query. |
| delete-mongo-fail-on-no-delete | Determines whether to send the flowfile to the success or failure relationship if nothing is successfully deleted. |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be written to MongoDB are routed to this relationship |
| success | All FlowFiles that are written to MongoDB are routed to this relationship |

---
title: DeletePinecone 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletepinecone.md
section: Loading & Unloading Data
---

# DeletePinecone 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-pinecone-nar

## Description

Deletes vectors from a Pinecone index.

## Tags

delete, embeddings, genai, generative ai, openflow, pinecone, rag, retrieval augmented generation, vector store

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ID Prefix | The Pinecone vector ID prefix. If specified, only the vectors whose IDs start with the given value will be deleted. |
| Pinecone API Key | The API key for the Pinecone service |
| Pinecone Index | The name of the Pinecone index to use |
| Pinecone Namespace | The name of the Pinecone namespace to use |
| Web Client Service | The Web Client Service to use for communicating with Pinecone |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be sent to Pinecone, and for which a retry is not expected to be successful, are routed to this relationship |
| retry | FlowFiles that fail to be sent to Pinecone, but for which a retry may help, are routed to this relationship |
| success | FlowFiles that are successfully sent to Pinecone are routed to this relationship |

## Use cases

|  |
| --- |
| Delete all vectors from a Pinecone index. |
| Delete a namespace, along with all of its vectors, from a Pinecone index. |
| Delete all vectors for a particular document from a Pinecone index. |

---
title: DeleteQueryJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletequeryjob.md
section: Loading & Unloading Data
---

# DeleteQueryJob 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Deletes a Query Job in Salesforce using the Bulk API 2.0.

## Tags

bulk, delete, job, preview, query, salesforce

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Job ID | The ID of the job for which the status is checked. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the Query Job status could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the Query Job status could not be retrieved |
| success | If the Query Job has been successfully deleted, the FlowFile is routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.AbortQueryJob](abortqueryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobStatus](getqueryjobstatus.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: DeleteS3Object 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletes3object.md
section: Loading & Unloading Data
---

# DeleteS3Object 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Deletes a file from an Amazon S3 Bucket. If attempting to delete a file that does not exist, FlowFile is routed to success.

## Tags

AWS, Amazon, Archive, Delete, S3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Bucket | The S3 Bucket to interact with |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| FullControl User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Full Control for an object |
| Object Key | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Owner | The Amazon ID to use for the object’s owner |
| Read ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to read the Access Control List for an object |
| Read Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Read Access for an object |
| Region | The AWS Region to connect to. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Version | The Version of the Object to delete |
| Write ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to change the Access Control List for an object |
| Write Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Write Access for an object |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| success | FlowFiles are routed to this Relationship after they have been successfully processed. |

## Writes attributes

| Name | Description |
| --- | --- |
| s3.exception | The class name of the exception thrown during processor execution |
| s3.additionalDetails | The S3 supplied detail from the failed operation |
| s3.statusCode | The HTTP error code (if available) from the failed operation |
| s3.errorCode | The S3 moniker of the failed operation |
| s3.errorMessage | The S3 exception message from the failed operation |

## See also

* [org.apache.nifi.processors.aws.s3.CopyS3Object](copys3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: DeleteSFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletesftp.md
section: Loading & Unloading Data
---

# DeleteSFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Deletes a file residing on an SFTP server.

## Tags

delete, remote, remove, sftp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Algorithm Negotiation | Configuration strategy for SSH algorithm negotiation |
| Batch Size | The maximum number of FlowFiles to send in a single connection |
| Ciphers Allowed | A comma-separated list of Ciphers allowed for SFTP connections. Leave unset to allow all. Available options are: 3des-cbc, aes128-cbc, aes128-ctr, [aes128-gcm@openssh.com](mailto:aes128-gcm%40openssh.com), aes192-cbc, aes192-ctr, aes256-cbc, aes256-ctr, [aes256-gcm@openssh.com](mailto:aes256-gcm%40openssh.com), arcfour128, arcfour256, blowfish-cbc, [chacha20-poly1305@openssh.com](mailto:chacha20-poly1305%40openssh.com), none |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Directory Path | The path to the directory the file to delete is located in. |
| Filename | The name of the file to delete. |
| Host Key File | If supplied, the given file will be used as the Host Key; otherwise, if ‘Strict Host Key Checking’ property is applied (set to true) then uses the ‘known_hosts’ and ‘known_hosts2’ files from ~/.ssh directory else no host key file will be used |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Key Algorithms Allowed | A comma-separated list of Key Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: ecdsa-sha2-nistp256, [ecdsa-sha2-nistp256-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp256-cert-v01%40openssh.com), ecdsa-sha2-nistp384, [ecdsa-sha2-nistp384-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp384-cert-v01%40openssh.com), ecdsa-sha2-nistp521, [ecdsa-sha2-nistp521-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp521-cert-v01%40openssh.com), rsa-sha2-256, [rsa-sha2-256-cert-v01@openssh.com](mailto:rsa-sha2-256-cert-v01%40openssh.com), rsa-sha2-512, [rsa-sha2-512-cert-v01@openssh.com](mailto:rsa-sha2-512-cert-v01%40openssh.com), [sk-ecdsa-sha2-nistp256@openssh.com](mailto:sk-ecdsa-sha2-nistp256%40openssh.com), [sk-ssh-ed25519@openssh.com](mailto:sk-ssh-ed25519%40openssh.com), ssh-dss, [ssh-dss-cert-v01@openssh.com](mailto:ssh-dss-cert-v01%40openssh.com), ssh-ed25519, [ssh-ed25519-cert-v01@openssh.com](mailto:ssh-ed25519-cert-v01%40openssh.com), ssh-rsa, [ssh-rsa-cert-v01@openssh.com](mailto:ssh-rsa-cert-v01%40openssh.com) |
| Key Exchange Algorithms Allowed | A comma-separated list of Key Exchange Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: curve25519-sha256, [curve25519-sha256@libssh.org](mailto:curve25519-sha256%40libssh.org), curve448-sha512, diffie-hellman-group-exchange-sha1, diffie-hellman-group-exchange-sha256, diffie-hellman-group1-sha1, diffie-hellman-group14-sha1, diffie-hellman-group14-sha256, diffie-hellman-group15-sha512, diffie-hellman-group16-sha512, diffie-hellman-group17-sha512, diffie-hellman-group18-sha512, ecdh-sha2-nistp256, ecdh-sha2-nistp384, ecdh-sha2-nistp521, mlkem1024nistp384-sha384, mlkem768nistp256-sha256, mlkem768x25519-sha256, sntrup761x25519-sha512, [sntrup761x25519-sha512@openssh.com](mailto:sntrup761x25519-sha512%40openssh.com) |
| Message Authentication Codes Allowed | A comma-separated list of Message Authentication Codes allowed for SFTP connections. Leave unset to allow all. Available options are: hmac-md5, hmac-md5-96, hmac-sha1, hmac-sha1-96, [hmac-sha1-etm@openssh.com](mailto:hmac-sha1-etm%40openssh.com), hmac-sha2-256, [hmac-sha2-256-etm@openssh.com](mailto:hmac-sha2-256-etm%40openssh.com), hmac-sha2-512, [hmac-sha2-512-etm@openssh.com](mailto:hmac-sha2-512-etm%40openssh.com) |
| Password | Password for the user account |
| Port | The port that the remote system is listening on for file transfers |
| Private Key Passphrase | Password for the private key |
| Private Key Path | The fully qualified path to the Private Key file |
| Send Keep Alive On Timeout | Send a Keep Alive message every 5 seconds up to 5 times for an overall timeout of 25 seconds. |
| Strict Host Key Checking | Indicates whether or not strict enforcement of hosts keys should be applied |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Username | Username |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles, for which an existing file could not be deleted, are routed to this relationship |
| not found | All FlowFiles, for which the file to delete did not exist, are routed to this relationship |
| success | All FlowFiles, for which an existing file has been deleted, are routed to this relationship |

## Use cases

|  |
| --- |
| Delete source file only after its processing completed |

---
title: DeleteSQS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deletesqs.md
section: Loading & Unloading Data
---

# DeleteSQS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Deletes a message from an Amazon Simple Queuing Service Queue

## Tags

AWS, Amazon, Delete, Queue, SQS

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Queue URL | The URL of the queue delete from |
| Receipt Handle | The identifier that specifies the receipt of the message |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## See also

* [org.apache.nifi.processors.aws.sqs.GetSQS](getsqs.md)
* [org.apache.nifi.processors.aws.sqs.PutSQS](putsqs.md)

---
title: DeleteUnityCatalogResource 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/deleteunitycatalogresource.md
section: Loading & Unloading Data
---

# DeleteUnityCatalogResource 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Delete a Unity Catalog file or directory.

## Tags

databricks, openflow, unity catalog

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Missing Resource Policy | What to action to take if the resource is not found. |
| Unity Catalog Resource Path | Unity Catalog resource path e.g. /Volumes/catalog/schema/volume_name/path |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: DescribeDataShare 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/describedatashare.md
section: Loading & Unloading Data
---

# DescribeDataShare 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Describe the specified data share metadata in Salesforce Data Cloud.

## Tags

daas, data cloud, describe, object, preview, salesforce, sfdc

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Data Share Name | The name of the Data Share to describe. |
| Salesforce Data Cloud Client | Salesforce Data Cloud Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the data share metadata could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the data share metadata could not be retrieved |
| success | FlowFile containing the data share metadata will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| explicitDataLakeObjects | Comma-separated list of the names of the explicit data lake objects. |
| implicitDataLakeObjects | Comma-separated list of the names of the implicit data lake objects. |
| dataModelObjects | Comma-separated list of the names of the data model objects. |
| calculatedInsightObjects | Comma-separated list of the names of the calculated insights objects. |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.ListSFDCDataShares](listsfdcdatashares.md)

---
title: DescribeSFDCObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/describesfdcobject.md
section: Loading & Unloading Data
---

# DescribeSFDCObject 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Describe the specified object metadata in Salesforce.

## Tags

describe, object, preview, salesforce, sfdc

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Object Fields Filter JSON | JSON representation describing which fields to include or exclude for Salesforce objects. |
| Object Name | The name of the object to describe. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the object metadata could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the object metadata could not be retrieved |
| success | FlowFile containing the object metadata will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| sObjectFields | Comma-separated list of the fields of the object (without non-queryable fields). |
| sObjectExcludedFields | Comma-separated list of the non-queryable fields of the object. |
| sObjectSchema | The schema associated to the object based on its fields (without non-queryable fields). |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.AbortQueryJob](abortqueryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.ListSFDCObjects](listsfdcobjects.md)

---
title: DetectDuplicate 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/detectduplicate.md
section: Loading & Unloading Data
---

# DetectDuplicate 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Caches a value, computed from FlowFile attributes, for each incoming FlowFile and determines if the cached value has already been seen. If so, routes the FlowFile to ‘duplicate’ with an attribute named ‘original.identifier’ that specifies the original FlowFile ‘s “description”, which is specified in the <FlowFile Description> property. If the FlowFile is not determined to be a duplicate, the Processor routes the FlowFile to’ non-duplicate’

## Tags

dedupe, dupe, duplicate, hash

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Age Off Duration | Time interval to age off cached FlowFiles |
| Cache Entry Identifier | A FlowFile attribute, or the results of an Attribute Expression Language statement, which will be evaluated against a FlowFile in order to determine the value used to identify duplicates; it is this value that is cached |
| Cache The Entry Identifier | When true this cause the processor to check for duplicates and cache the Entry Identifier. When false, the processor would only check for duplicates and not cache the Entry Identifier, requiring another processor to add identifiers to the distributed cache. |
| Distributed Cache Service | The Controller Service that is used to cache unique identifiers, used to determine duplicates |
| FlowFile Description | When a FlowFile is added to the cache, this value is stored along with it so that if a duplicate is found, this description of the original FlowFile will be added to the duplicate’s “original.flowfile.description” attribute |

## Relationships

| Name | Description |
| --- | --- |
| duplicate | If a FlowFile has been detected to be a duplicate, it will be routed to this relationship |
| failure | If unable to communicate with the cache, the FlowFile will be penalized and routed to this relationship |
| non-duplicate | If a FlowFile’s Cache Entry Identifier was not found in the cache, it will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| original.flowfile.description | All FlowFiles routed to the duplicate relationship will have an attribute added named original.flowfile.description. The value of this attribute is determined by the attributes of the original copy of the data and by the FlowFile Description property. |

## See also

---
title: DeveloperBoxClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/developerboxclientservice.md
section: Loading & Unloading Data
---

# DeveloperBoxClientService

## Description

Provides Box client objects through which Box API calls can be used. This using a developer token and is for testing only.

## Tags

box, client, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Developer Token \* | Developer Token |  |  | The Developer Token to use to interact with the Box API. This is for testing only and should not be used in production. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DistributedMapCacheLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/distributedmapcachelookupservice.md
section: Loading & Unloading Data
---

# DistributedMapCacheLookupService

## Description

Allows to choose a distributed map cache client to retrieve the value associated to a key. The coordinates that are passed to the lookup must contain the key ‘key’.

## Tags

cache, distributed, enrich, key, lookup, map, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Character Encoding \* | character-encoding | UTF-8 | * ISO-8859-1 * UTF-8 * UTF-16 * UTF-16LE * UTF-16BE * US-ASCII | Specifies a character encoding to use. |
| Distributed Cache Service \* | distributed-map-cache-service |  |  | The Controller Service that is used to get the cached values. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: DistributeLoad 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/distributeload.md
section: Loading & Unloading Data
---

# DistributeLoad 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Distributes FlowFiles to downstream processors based on a Distribution Strategy. If using the Round Robin strategy, the default is to assign each destination a weighting of 1 (evenly distributed). However, optional properties can be added to the change this; adding a property with the name ‘5’ and value ‘10’ means that the relationship with name ‘5’ will be receive 10 FlowFiles in each iteration instead of 1.

## Tags

distribute, load balance, round robin, route, weighted

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Distribution Strategy | Determines how the load will be distributed. Relationship weight is in numeric order where ‘1’ has the greatest weight. |
| Number of Relationships | Determines the number of Relationships to which the load should be distributed |

## Relationships

| Name | Description |
| --- | --- |
| 1 | Where to route flowfiles for this relationship index |

## Writes attributes

| Name | Description |
| --- | --- |
| distribute.load.relationship | The name of the specific relationship the FlowFile has been routed through |

---
title: DuplicateFlowFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/duplicateflowfile.md
section: Loading & Unloading Data
---

# DuplicateFlowFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Intended for load testing, this processor will create the configured number of copies of each incoming FlowFile. The original FlowFile as well as all generated copies are sent to the ‘success’ relationship. In addition, each FlowFile gets an attribute ‘copy.index’set to the copy number, where the original FlowFile gets a value of zero, and all copies receive incremented integer values.

## Tags

duplicate, load, test

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Number of Copies | Specifies how many copies of each incoming FlowFile will be made |

## Relationships

| Name | Description |
| --- | --- |
| success | The original FlowFile and all copies will be sent to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| copy.index | A zero-based incrementing integer value based on which copy the FlowFile is. |

---
title: ElasticSearchClientServiceImpl
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/elasticsearchclientserviceimpl.md
section: Loading & Unloading Data
---

# ElasticSearchClientServiceImpl

## Description

A controller service for accessing an Elasticsearch client, using the Elasticsearch (low-level) REST Client.

## Tags

client, elasticsearch, elasticsearch6, elasticsearch7, elasticsearch8

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| API Key \* | API Key |  |  | Encoded API key. |
| API Key ID \* | API Key ID |  |  | Unique identifier of the API key. |
| Authorization Scheme \* | Authorization Scheme | BASIC | * None * PKI * Basic * API Key * JWT | Authorization Scheme used for optional authentication to Elasticsearch. |
| Character Set \* | Character Set | UTF-8 |  | The charset to use for interpreting the response from Elasticsearch. |
| Connect timeout \* | Connect timeout | 5000 |  | Controls the amount of time, in milliseconds, before a timeout occurs when trying to connect. |
| Enable Compression \* | Enable Compression | false | * true * false | Whether the REST client should compress requests using gzip content encoding and add the “Accept-Encoding: gzip” header to receive compressed responses |
| HTTP Hosts \* | HTTP Hosts |  |  | A comma-separated list of HTTP hosts that host Elasticsearch query nodes.The HTTP Hosts should be valid URIs including protocol, domain and port for each entry.For example “<https://elasticsearch1:9200>, <https://elasticsearch2:9200>”.Note that the Host is included in requests as a header (typically including domain and port, e.g. elasticsearch:9200). |
| JWT Shared Secret \* | JWT Shared Secret |  |  | JWT realm Shared Secret. |
| Node Selector \* | Node Selector | ANY | * Any * Skip Dedicated Masters | Selects Elasticsearch nodes that can receive requests. Used to keep requests away from dedicated Elasticsearch master nodes |
| OAuth2 Access Token Provider \* | OAuth2 Access Token Provider |  |  | The OAuth2 Access Token Provider used to provide JWTs for Bearer Token Authorization with Elasticsearch. |
| Password \* | Password |  |  | The password to use with XPack security. |
| Path Prefix | Path Prefix |  |  | Sets the path’s prefix for every request used by the http client. For example, if this is set to “/my/path”, then any client request will become “/my/path/” + endpoint. In essence, every request’s endpoint is prefixed by this pathPrefix. The path prefix is useful for when Elasticsearch is behind a proxy that provides a base path or a proxy that requires all paths to start with ‘/’; it is not intended for other purposes and it should not be supplied in other scenarios |
| Read Timeout \* | Read Timeout | 60000 |  | Controls the amount of time, in milliseconds, before a timeout occurs when waiting for a response. |
| Run As User | Run As User |  |  | The username to impersonate within Elasticsearch. |
| SSL Context Service | SSL Context Service |  |  | The SSL Context Service used to provide client certificate information for TLS/SSL connections. This service only applies if the Elasticsearch endpoint(s) have been secured with TLS/SSL. |
| Send Meta Header \* | Send Meta Header | true | * true * false | Whether to send a “X-Elastic-Client-Meta” header that describes the runtime environment. It contains information that is similar to what could be found in User-Agent. Using a separate header allows applications to use User-Agent for their own needs, e.g. to identify application version or other environment information |
| Sniff Cluster Nodes \* | Sniff Cluster Nodes | false | * true * false | Periodically sniff for nodes within the Elasticsearch cluster via the Elasticsearch Node Info API. If Elasticsearch security features are enabled (default to “true” for 8.x+), the Elasticsearch user must have the “monitor” or “manage” cluster privilege to use this API.Note that all HTTP Hosts (and those that may be discovered within the cluster using the Sniffer) must use the same protocol, e.g. http or https, and be contactable using the same client settings. Finally the Elasticsearch “network.publish_host” must match one of the “network.bind_host” list entries see <https://www.elastic.co/guide/en/elasticsearch/reference/current/modules-network.html> for more information |
| Sniff on Failure \* | Sniff on Failure | false | * true * false | Enable sniffing on failure, meaning that after each failure the Elasticsearch nodes list gets updated straight away rather than at the following ordinary sniffing round |
| Sniffer Failure Delay \* | Sniffer Failure Delay | 1 min |  | Delay between an Elasticsearch request failure and updating available Cluster nodes using the Sniffer |
| Sniffer Interval \* | Sniffer Interval | 5 mins |  | Interval between Cluster sniffer operations |
| Sniffer Request Timeout \* | Sniffer Request Timeout | 1 sec |  | Cluster sniffer timeout for node info requests |
| Strict Deprecation \* | Strict Deprecation | false | * true * false | Whether the REST client should return any response containing at least one warning header as a failure |
| Suppress Null and Empty Values \* | Suppress Null and Empty Values | always-suppress | * Never Suppress * Always Suppress | Specifies how the writer should handle null and empty fields (including objects and arrays) |
| Username \* | Username |  |  | The username to use with XPack security. |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ElasticSearchLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/elasticsearchlookupservice.md
section: Loading & Unloading Data
---

# ElasticSearchLookupService

## Description

Lookup a record from Elasticsearch Server associated with the specified document ID. The coordinates that are passed to the lookup must contain the key ‘id’.

## Tags

elasticsearch, enrich, lookup, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Client Service \* | Client Service |  |  | An ElasticSearch client service to use for running queries. |
| Index \* | Index |  |  | The name of the index to read from |
| Schema Access Strategy \* | Schema Access Strategy | infer | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Infer from Result | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Type | Type |  |  | The type of this document (used by Elasticsearch for indexing and searching) |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ElasticSearchStringLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/elasticsearchstringlookupservice.md
section: Loading & Unloading Data
---

# ElasticSearchStringLookupService

## Description

Lookup a string value from Elasticsearch Server associated with the specified document ID. The coordinates that are passed to the lookup must contain the key ‘id’.

## Tags

elasticsearch, enrich, key, lookup, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Client Service \* | Client Service |  |  | An ElasticSearch client service to use for running queries. |
| Index \* | Index |  |  | The name of the index to read from |
| Type | Type |  |  | The type of this document (used by Elasticsearch for indexing and searching) |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: EmailRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/emailrecordsink.md
section: Loading & Unloading Data
---

# EmailRecordSink

## Description

Provides a RecordSinkService that can be used to send records in email using the specified writer for formatting.

## Tags

email, record, send, sink, smtp, write

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| BCC | bcc |  |  | The recipients to include in the BCC-Line of the email. Comma separated sequence of addresses following RFC822 syntax. |
| CC | cc |  |  | The recipients to include in the CC-Line of the email. Comma separated sequence of addresses following RFC822 syntax. |
| From \* | from |  |  | Specifies the Email address to use as the sender. Comma separated sequence of addresses following RFC822 syntax. |
| Record Writer \* | record-sink-record-writer |  |  | Specifies the Controller Service to use for writing out the records. |
| SMTP Auth \* | smtp-auth | true |  | Flag indicating whether authentication should be used |
| SMTP Hostname \* | smtp-hostname |  |  | The hostname of the SMTP Server that is used to send Email Notifications |
| SMTP Password | smtp-password |  |  | Password for the SMTP account |
| SMTP Port \* | smtp-port | 25 |  | The Port used for SMTP communications |
| SMTP SSL \* | smtp-ssl | false |  | Flag indicating whether SSL should be enabled |
| SMTP STARTTLS \* | smtp-starttls | false |  | Flag indicating whether STARTTLS should be enabled. If the server does not support STARTTLS, the connection continues without the use of TLS |
| SMTP Username | smtp-username |  |  | Username for the SMTP account |
| SMTP X-Mailer Header \* | smtp-xmailer-header | NiFi |  | X-Mailer used in the header of the outgoing email |
| Subject \* | subject | Message from NiFi |  | The email subject |
| To | to |  |  | The recipients to include in the To-Line of the email. Comma separated sequence of addresses following RFC822 syntax. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: EmbeddedHazelcastCacheManager
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/embeddedhazelcastcachemanager.md
section: Loading & Unloading Data
---

# EmbeddedHazelcastCacheManager

## Description

A service that runs embedded Hazelcast and provides cache instances backed by that. The server does not ask for authentication, it is recommended to run it within secured network.

## Tags

cache, hazelcast

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Hazelcast Cluster Name \* | hazelcast-cluster-name | nifi |  | Name of the Hazelcast cluster. |
| Hazelcast Clustering Strategy \* | hazelcast-clustering-strategy | none | * None * All Nodes * Explicit | Specifies with what strategy the Hazelcast cluster should be created. |
| Hazelcast Instances | hazelcast-instances |  |  | Only used with “Explicit” Clustering Strategy! List of NiFi instance host names which should be part of the Hazelcast cluster. Host names are separated by comma. The port specified in the “Hazelcast Port” property will be used as server port. The list must contain every instance that will be part of the cluster. Other instances will join the Hazelcast cluster as clients. |
| Hazelcast Port \* | hazelcast-port | 5701 |  | Port for the Hazelcast instance to use. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: EncodeContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/encodecontent.md
section: Loading & Unloading Data
---

# EncodeContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Encode or decode the contents of a FlowFile using Base64, Base32, or hex encoding schemes

## Tags

base32, base64, decode, encode, hex

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Encoded Line Length | Each line of encoded data will contain up to the configured number of characters, rounded down to the nearest multiple of 4. |
| Encoding | Specifies the type of encoding used. |
| Line Output Mode | Controls the line formatting for encoded content based on selected property values. |
| Mode | Specifies whether the content should be encoded or decoded. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that cannot be encoded or decoded will be routed to failure |
| success | Any FlowFile that is successfully encoded or decoded will be routed to success |

---
title: EncryptContentAge 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/encryptcontentage.md
section: Loading & Unloading Data
---

# EncryptContentAge 2025.10.9.21

## Bundle

org.apache.nifi | nifi-cipher-nar

## Description

Encrypt content using the age-encryption.org/v1 specification. Supports binary or ASCII armored content encoding using configurable properties. The age standard uses ChaCha20-Poly1305 for authenticated encryption of the payload. The age-keygen command supports generating X25519 key pairs for encryption and decryption operations.

## Tags

ChaCha20-Poly1305, X25519, age, age-encryption.org, encryption

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File Encoding | Output encoding for encrypted files. Binary encoding provides optimal processing performance. |
| Public Key Recipient Resources | One or more files or URLs containing X25519 Public Key Recipients, separated with newlines, encoded according to the age specification, starting with age1 |
| Public Key Recipients | One or more X25519 Public Key Recipients, separated with newlines, encoded according to the age specification, starting with age1 |
| Public Key Source | Source of information determines the loading strategy for X25519 Public Key Recipients |

## Relationships

| Name | Description |
| --- | --- |
| failure | Encryption Failed |
| success | Encryption Completed |

## See also

* [org.apache.nifi.processors.cipher.DecryptContentAge](decryptcontentage.md)

---
title: EncryptContentPGP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/encryptcontentpgp.md
section: Loading & Unloading Data
---

# EncryptContentPGP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-pgp-nar

## Description

Encrypt contents using OpenPGP. The processor reads input and detects OpenPGP messages to avoid unnecessary additional wrapping in Literal Data packets.

## Tags

Encryption, GPG, OpenPGP, PGP, RFC 4880

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| file-encoding | File Encoding for encryption |
| passphrase | Passphrase used for encrypting data with Password-Based Encryption |
| public-key-search | PGP Public Key Search will be used to match against the User ID or Key ID when formatted as uppercase hexadecimal string of 16 characters |
| public-key-service | PGP Public Key Service for encrypting data with Public Key Encryption |
| symmetric-key-algorithm | Symmetric-Key Algorithm for encryption |

## Relationships

| Name | Description |
| --- | --- |
| failure | Encryption Failed |
| success | Encryption Succeeded |

## Writes attributes

| Name | Description |
| --- | --- |
| pgp.symmetric.key.algorithm | Symmetric-Key Algorithm |
| pgp.symmetric.key.algorithm.block.cipher | Symmetric-Key Algorithm Block Cipher |
| pgp.symmetric.key.algorithm.key.size | Symmetric-Key Algorithm Key Size |
| pgp.symmetric.key.algorithm.id | Symmetric-Key Algorithm Identifier |
| pgp.file.encoding | File Encoding |
| pgp.compression.algorithm | Compression Algorithm |
| pgp.compression.algorithm.id | Compression Algorithm Identifier |

## See also

* [org.apache.nifi.processors.pgp.DecryptContentPGP](decryptcontentpgp.md)
* [org.apache.nifi.processors.pgp.SignContentPGP](signcontentpgp.md)
* [org.apache.nifi.processors.pgp.VerifyContentPGP](verifycontentpgp.md)

---
title: EnforceOrder 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/enforceorder.md
section: Loading & Unloading Data
---

# EnforceOrder 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Enforces expected ordering of FlowFiles that belong to the same data group within a single node. Although PriorityAttributePrioritizer can be used on a connection to ensure that flow files going through that connection are in priority order, depending on error-handling, branching, and other flow designs, it is possible for FlowFiles to get out-of-order. EnforceOrder can be used to enforce original ordering for those FlowFiles. [IMPORTANT] In order to take effect of EnforceOrder, FirstInFirstOutPrioritizer should be used at EVERY downstream relationship UNTIL the order of FlowFiles physically get FIXED by operation such as MergeContent or being stored to the final destination.

## Tags

order, sort

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| batch-count | The maximum number of FlowFiles that EnforceOrder can process at an execution. |
| group-id | EnforceOrder is capable of multiple ordering groups. ‘Group Identifier’ is used to determine which group a FlowFile belongs to. This property will be evaluated with each incoming FlowFile. If evaluated result is empty, the FlowFile will be routed to failure. |
| inactive-timeout | Indicates the duration after which state for an inactive group will be cleared from managed state. Group is determined as inactive if any new incoming FlowFile has not seen for a group for specified duration. Inactive Timeout must be longer than Wait Timeout. If a FlowFile arrives late after its group is already cleared, it will be treated as a brand new group, but will never match the order since expected preceding FlowFiles are already gone. The FlowFile will eventually timeout for waiting and routed to ‘overtook’. To avoid this, group states should be kept long enough, however, shorter duration would be helpful for reusing the same group identifier again. |
| initial-order | When the first FlowFile of a group arrives, initial target order will be computed and stored in the managed state. After that, target order will start being tracked by EnforceOrder and stored in the state management store. If Expression Language is used but evaluated result was not an integer, then the FlowFile will be routed to failure, and initial order will be left unknown until consecutive FlowFiles provide a valid initial order. |
| maximum-order | If specified, any FlowFiles that have larger order will be routed to failure. This property is computed only once for a given group. After a maximum order is computed, it will be persisted in the state management store and used for other FlowFiles belonging to the same group. If Expression Language is used but evaluated result was not an integer, then the FlowFile will be routed to failure, and maximum order will be left unknown until consecutive FlowFiles provide a valid maximum order. |
| order-attribute | A name of FlowFile attribute whose value will be used to enforce order of FlowFiles within a group. If a FlowFile does not have this attribute, or its value is not an integer, the FlowFile will be routed to failure. |
| wait-timeout | Indicates the duration after which waiting FlowFiles will be routed to the ‘overtook’ relationship. |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | EnforceOrder uses following states per ordering group: ‘<groupId>.target’ is a order number which is being waited to arrive next. When a FlowFile with a matching order arrives, or a FlowFile overtakes the FlowFile being waited for because of wait timeout, target order will be updated to (FlowFile.order + 1). ‘<groupId>.max is the maximum order number for a group. ‘<groupId>.updatedAt’ is a timestamp when the order of a group was updated last time. These managed states will be removed automatically once a group is determined as inactive, see ‘Inactive Timeout’ for detail. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFiles which does not have required attributes, or fails to compute those will be routed to this relationship |
| overtook | A FlowFile that waited for preceding FlowFiles longer than Wait Timeout and overtook those FlowFiles, will be routed to this relationship. |
| skipped | A FlowFile that has an order younger than current, which means arrived too late and skipped, will be routed to this relationship. |
| success | A FlowFile with a matching order number will be routed to this relationship. |
| wait | A FlowFile with non matching order will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| EnforceOrder.startedAt | All FlowFiles going through this processor will have this attribute. This value is used to determine wait timeout. |
| EnforceOrder.result | All FlowFiles going through this processor will have this attribute denoting which relationship it was routed to. |
| EnforceOrder.detail | FlowFiles routed to ‘failure’ or ‘skipped’ relationship will have this attribute describing details. |
| EnforceOrder.expectedOrder | FlowFiles routed to ‘wait’ or ‘skipped’ relationship will have this attribute denoting expected order when the FlowFile was processed. |

---
title: EnrichAttributes 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/enrichattributes.md
section: Loading & Unloading Data
---

# EnrichAttributes 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-enrichment-nar

## Description

Looks up a value using the configured Lookup Service and adds the results to the FlowFile as one or more attributes. Frequently, this is used in conjunction with the DatabaseLookup Service in order to enrich a FlowFile by querying a database and adding the results as attributes.

## Tags

attributes, database, enrichment, json, lookup, openflow

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attribute Name | The name of the attribute to add, whose contents will be the JSON representation of the Record returned from the Lookup Service. |
| Attribute Prefix | A prefix to apply to all attribute names that are added. |
| Flattening Strategy | When a Record is returned from the Lookup Service, this property specifies how the Record should be flattened into the FlowFile’s attributes |
| Lookup Service | The Lookup Service to use for enrichment |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to enrich a given FlowFile for any reason, the FlowFile will be routed to this relationship. |
| matched | FlowFiles that are successfully enriched with the Record from the Lookup Service are routed to this relationship. |
| unmatched | FlowFiles for which the Lookup Service did not find a match are routed to this relationship. |

## Use cases

|  |
| --- |
| Query a database to retrieve information based on the attributes of a FlowFile |

## See also

---
title: EnrichCdcStream 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/enrichcdcstream.md
section: Loading & Unloading Data
---

# EnrichCdcStream 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Enriches incoming FlowFiles that come from CaptureChangePostgreSQL, etc. with information pertaining to which Journal Table to write to and relevant schema information. This Processor manages the schema versions for each table being processed in order to ensure that the correct Journal Table is used for each FlowFile.

## Tags

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| CDC Schema Registry | Specifies the CDC Schema Registry to use for managing the schemas of the CDC data |
| Record Reader | Specifies the Record Reader to use for reading the incoming data |
| Record Writer | Specifies the Record Writer to use for writing the outgoing data |
| Table State Service | Holds the state of replicated tables |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Tracks the current journal table version for each table being processed. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If any FlowFile is unable to be read, it will be routed to this Relationship. |
| schema update | If any schema update is required in order to handle incoming Records, a FlowFile is routed to this relationship. The FlowFile will include the schema information to indicate what changes are required. |
| skipped ddl event | This Relationship will be used for any DDL / Schema Change events that do not result in a change to the destination table’s schema. |
| success | Rows to be inserted into the Snowflake table will be routed to this Relationship. |
| table not in state | Used when a FlowFile references a table that does not exist in the state of replicated tables, probably after it was removed from replication. |

## Writes attributes

| Name | Description |
| --- | --- |
| table.schema.generation | The index of the journal table for incremental processing. |
| table.schema.initial | Marks the initial generation of a journal table. |
| destination.table.schema | The updated schema for the destination table. This attribute is only written for DDL events. |

## See also

* [com.snowflake.openflow.runtime.processors.database.CaptureChangePostgreSQL](capturechangepostgresql.md)

---
title: EvaluateJsonPath 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/evaluatejsonpath.md
section: Loading & Unloading Data
---

# EvaluateJsonPath 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Evaluates one or more JsonPath expressions against the content of a FlowFile. The results of those expressions are assigned to FlowFile Attributes or are written to the content of the FlowFile itself, depending on configuration of the Processor. JsonPaths are entered by adding user-defined properties; the name of the property maps to the Attribute Name into which the result will be placed (if the Destination is flowfile-attribute; otherwise, the property name is ignored). The value of the property must be a valid JsonPath expression. A Return Type of ‘auto-detect’ will make a determination based off the configured destination. When ‘Destination’ is set to ‘flowfile-attribute,’ a return type of ‘scalar’ will be used. When ‘Destination’ is set to ‘flowfile-content,’ a return type of ‘JSON’ will be used. If the JsonPath evaluates to a JSON array or JSON object and the Return Type is set to ‘scalar’ the FlowFile will be unmodified and will be routed to failure. A Return Type of JSON can return scalar values if the provided JsonPath evaluates to the specified value and will be routed as a match. If Destination is ‘flowfile-content’ and the JsonPath does not evaluate to a defined path, the FlowFile will be routed to ‘unmatched’ without having its contents modified. If Destination is ‘flowfile-attribute’ and the expression matches nothing, attributes will be created with empty strings as the value unless ‘Path Not Found Behaviour’ is set to ‘skip’, and the FlowFile will always be routed to ‘matched.’

## Tags

JSON, JsonPath, evaluate

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Destination | Indicates whether the results of the JsonPath evaluation are written to the FlowFile content or a FlowFile attribute; if using attribute, must specify the Attribute Name property. If set to flowfile-content, only one JsonPath may be specified, and the property name is ignored. |
| Max String Length | The maximum allowed length of a string value when parsing the JSON document |
| Null Value Representation | Indicates the desired representation of JSON Path expressions resulting in a null value. |
| Path Not Found Behavior | Indicates how to handle missing JSON path expressions when destination is set to ‘flowfile-attribute’. Selecting ‘warn’ will generate a warning when a JSON path expression is not found. Selecting ‘skip’ will omit attributes for any unmatched JSON path expressions. |
| Return Type | Indicates the desired return type of the JSON Path expressions. Selecting ‘auto-detect’ will set the return type to ‘json’ for a Destination of ‘flowfile-content’, and ‘scalar’ for a Destination of ‘flowfile-attribute’. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship when the JsonPath cannot be evaluated against the content of the FlowFile; for instance, if the FlowFile is not valid JSON |
| matched | FlowFiles are routed to this relationship when the JsonPath is successfully evaluated and the FlowFile is modified as a result |
| unmatched | FlowFiles are routed to this relationship when the JsonPath does not match the content of the FlowFile and the Destination is set to flowfile-content |

---
title: EvaluateRagAnswerCorrectness 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/evaluateraganswercorrectness.md
section: Loading & Unloading Data
---

# EvaluateRagAnswerCorrectness 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-rag-evaluation-processors-nar

## Description

Evaluates the correctness of generated answers in a Retrieval-Augmented Generation (RAG) context by computing metrics such as F1 score, cosine similarity, and answer correctness. The processor uses an LLM (e.g., OpenAI’s GPT) to assess the generated answer against the ground truth.

## Tags

ai, answer correctness, evaluation, llm, nlp, openai, openflow, rag

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Cosine Similarity Weight | The weight to apply to the cosine similarity when calculating answer correctness (between 0.0 and 1.0) |
| Evaluation Results Record Path | The RecordPath to write the results of the evaluation to. |
| F1 Score Weight | The weight to apply to the F1 score when calculating answer correctness (between 0.0 and 1.0) |
| Generated Answer Record Path | The path to the answer field in the record |
| Generated Answer Vector Record Path | The path to the answer vector field in the record. |
| Ground Truth Record Path | The RecordPath to the ground truth field in the record. |
| Ground Truth Vector Record Path | The path to the ground truth vector field in the record. |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Question Record Path | The RecordPath to the question field in the record. |
| Record Reader | The Record Reader to use for reading the FlowFile. |
| Record Writer | The Record Writer to use for writing the results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| average.f1Score | The average F1 score computed over all records. |
| average.cosineSim | The average cosine similarity between the ground truth and answer embeddings. |
| average.answerCorrectness | The average answer correctness score computed over all records. |
| json.parse.failures | Number of JSON parse failures encountered. |

## Use cases

|  |
| --- |
| Use this processor to assess the quality of answers generated by an LLM in comparison to ground truth answers, providing metrics that can be used for monitoring and improving the performance of RAG systems. |

---
title: EvaluateRagFaithfulness 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/evaluateragfaithfulness.md
section: Loading & Unloading Data
---

# EvaluateRagFaithfulness 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-rag-evaluation-processors-nar

## Description

Evaluates the faithfulness of generated answers in a Retrieval-Augmented Generation (RAG) system by analyzing responses using an LLM (e.g., OpenAI’s GPT). The processor enriches each FlowFile record with faithfulness metrics and detailed analysis.

## Tags

ai, evaluation, faithfulness, llm, nlp, openai, openflow, rag

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Context Identifier Record Path | The RecordPath to the array of contexts IDs in the record. |
| Context Record Path | The RecordPath to the array of contexts in the record. |
| Evaluation Results Record Path | The RecordPath to write the results of the evaluation to. |
| Generated Answer Record Path | The path to the answer field in the record |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Question Record Path | The RecordPath to the question field in the record. |
| Record Reader | The Record Reader to use for reading the FlowFile. |
| Record Writer | The Record Writer to use for writing the results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| average.answer.faithfulness | The average faithfulness score computed over all records. |
| json.parse.failures | Number of JSON parse failures encountered. |

## Use cases

|  |
| --- |
| Use this processor to assess the faithfulness of answers generated by an LLM compared to the provided context. It provides metrics that can be used for monitoring and improving the performance of RAG systems. |

---
title: EvaluateRagRetrieval 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/evaluateragretrieval.md
section: Loading & Unloading Data
---

# EvaluateRagRetrieval 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-rag-evaluation-processors-nar

## Description

Calculates retrieval metrics (Precision@N, Recall@N, FScore@N, MAP@N, MRR) for a RAG system using an LLM as a judge. For each record, it uses both Precision and Recall prompts to evaluate the response, and adds the metrics as attributes to the FlowFile.

## Tags

evaluation, fscore, llm, metrics, mrr, openai, openflow, precision, rag, recall, retrieval

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Context Identifier Record Path | The RecordPath to the array of contexts IDs in the record. |
| Context Record Path | The RecordPath to the array of contexts in the record. |
| Evaluation Results Record Path | The RecordPath to write the results of the evaluation to. |
| Ground Truth Record Path | The RecordPath to the ground truth field in the record. |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Question Record Path | The RecordPath to the question field in the record. |
| Record Reader | The Record Reader to use for reading the FlowFile. |
| Record Writer | The Record Writer to use for writing the results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| n | The average number of retrieved documents per query. |
| precision.at.n | The average precision at N over all queries. |
| recall.at.n | The average recall at N over all queries. |
| fscore.at.n | The average F-Score at N over all queries. |
| mrr | The Mean Reciprocal Rank. |
| retrieval.eval.failures | Number of records where the eval could not be calculated. |
| json.parse.failures | Number of JSON parse failures encountered. |

---
title: EvaluateXPath 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/evaluatexpath.md
section: Loading & Unloading Data
---

# EvaluateXPath 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Evaluates one or more XPaths against the content of a FlowFile. The results of those XPaths are assigned to FlowFile Attributes or are written to the content of the FlowFile itself, depending on configuration of the Processor. XPaths are entered by adding user-defined properties; the name of the property maps to the Attribute Name into which the result will be placed (if the Destination is flowfile-attribute; otherwise, the property name is ignored). The value of the property must be a valid XPath expression. If the XPath evaluates to more than one node and the Return Type is set to ‘nodeset’ (either directly, or via ‘auto-detect’ with a Destination of ‘flowfile-content’), the FlowFile will be unmodified and will be routed to failure. If the XPath does not evaluate to a Node, the FlowFile will be routed to ‘unmatched’ without having its contents modified. If Destination is flowfile-attribute and the expression matches nothing, attributes will be created with empty strings as the value, and the FlowFile will always be routed to ‘matched’

## Tags

XML, XPath, evaluate

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Destination | Indicates whether the results of the XPath evaluation are written to the FlowFile content or a FlowFile attribute; if using attribute, must specify the Attribute Name property. If set to flowfile-content, only one XPath may be specified, and the property name is ignored. |
| Return Type | Indicates the desired return type of the Xpath expressions. Selecting ‘auto-detect’ will set the return type to ‘nodeset’ for a Destination of ‘flowfile-content’, and ‘string’ for a Destination of ‘flowfile-attribute’. |
| Validate DTD | Allow embedded Document Type Declaration in XML. This feature should be disabled to avoid XML entity expansion vulnerabilities. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship when the XPath cannot be evaluated against the content of the FlowFile; for instance, if the FlowFile is not valid XML, or if the Return Type is ‘nodeset’ and the XPath evaluates to multiple nodes |
| matched | FlowFiles are routed to this relationship when the XPath is successfully evaluated and the FlowFile is modified as a result |
| unmatched | FlowFiles are routed to this relationship when the XPath does not match the content of the FlowFile and the Destination is set to flowfile-content |

## Writes attributes

| Name | Description |
| --- | --- |
| user-defined | This processor adds user-defined attributes if the <Destination> property is set to flowfile-attribute. |

---
title: EvaluateXQuery 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/evaluatexquery.md
section: Loading & Unloading Data
---

# EvaluateXQuery 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Evaluates one or more XQueries against the content of a FlowFile. The results of those XQueries are assigned to FlowFile Attributes or are written to the content of the FlowFile itself, depending on configuration of the Processor. XQueries are entered by adding user-defined properties; the name of the property maps to the Attribute Name into which the result will be placed (if the Destination is ‘flowfile-attribute’; otherwise, the property name is ignored). The value of the property must be a valid XQuery. If the XQuery returns more than one result, new attributes or FlowFiles (for Destinations of ‘flowfile-attribute’ or ‘flowfile-content’ respectively) will be created for each result (attributes will have a ‘.n’ one-up number appended to the specified attribute name). If any provided XQuery returns a result, the FlowFile(s) will be routed to ‘matched’. If no provided XQuery returns a result, the FlowFile will be routed to ‘unmatched’. If the Destination is ‘flowfile-attribute’ and the XQueries matche nothing, no attributes will be applied to the FlowFile.

## Tags

XML, XPath, XQuery, evaluate

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Destination | Indicates whether the results of the XQuery evaluation are written to the FlowFile content or a FlowFile attribute. If set to <flowfile-content>, only one XQuery may be specified and the property name is ignored. If set to <flowfile-attribute> and the XQuery returns more than one result, multiple attributes will be added to theFlowFile, each named with a ‘.n’ one-up number appended to the specified attribute name |
| Output: Indent | Specifies whether the processor may add additional whitespace when outputting a result tree. |
| Output: Method | Identifies the overall method that should be used for outputting a result tree. |
| Output: Omit XML Declaration | Specifies whether the processor should output an XML declaration when transforming a result tree. |
| Validate DTD | Allow embedded Document Type Declaration in XML. This feature should be disabled to avoid XML entity expansion vulnerabilities. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship when the XQuery cannot be evaluated against the content of the FlowFile. |
| matched | FlowFiles are routed to this relationship when the XQuery is successfully evaluated and the FlowFile is modified as a result |
| unmatched | FlowFiles are routed to this relationship when the XQuery does not match the content of the FlowFile and the Destination is set to flowfile-content |

## Writes attributes

| Name | Description |
| --- | --- |
| user-defined | This processor adds user-defined attributes if the <Destination> property is set to flowfile-attribute . |

---
title: ExcelReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/excelreader.md
section: Loading & Unloading Data
---

# ExcelReader

## Description

Parses a Microsoft Excel document returning each row in each sheet as a separate record. This reader allows for inferring a schema from all the required sheets or providing an explicit schema for interpreting the values. See Controller Service ‘s Usage for further documentation. This reader is capable of processing both password and non password protected .xlsx (XSSF 2007 OOXML file format) and older .xls (HSSF’97(-2007) file format) Excel documents.

## Tags

cell, excel, parse, reader, record, row, spreadsheet, values, xls, xlsx

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Input File Type \* | Input File Type | XLSX | * XLS * XLSX | Specifies type of Excel input file. |
| Password \* | Password |  |  | The password for a password protected Excel spreadsheet |
| Protection Type \* | Protection Type | UNPROTECTED | * Unprotected * Password Protected | Specifies whether an Excel spreadsheet is protected by a password or not. |
| Required Sheets | Required Sheets |  |  | Comma-separated list of Excel document sheet names whose rows should be extracted from the excel document. If this property is left blank then all the rows from all the sheets will be extracted from the Excel document. The list of names is case sensitive. Any sheets not specified in this value will be ignored. An exception will be thrown if a specified sheet(s) are not found. |
| Row Evaluation Strategy \* | Row Evaluation Strategy | STANDARD | * Standard * All Rows | A strategy to select how many rows after the starting row to use for determining the schema. |
| Schema Access Strategy \* | Schema Access Strategy | Use Starting Row | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Use Starting Row * Infer Schema | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Starting Row \* | Starting Row | 1 |  | The row number of the first row to start processing (One based). Use this to skip over rows of data at the top of a worksheet that are not part of the dataset. When using the ‘Use Starting Row’ strategy this should be the column header row. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ExecuteGroovyScript 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executegroovyscript.md
section: Loading & Unloading Data
---

# ExecuteGroovyScript 2025.10.9.21

## Bundle

org.apache.nifi | nifi-groovyx-nar

## Description

Experimental Extended Groovy script processor. The script is responsible for handling the incoming flow file (transfer to SUCCESS or remove, e.g.) as well as any flow files created by the script. If the handling is incomplete or incorrect, the session will be rolled back.

## Tags

groovy, groovyx, script

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| groovyx-additional-classpath | Classpath list separated by semicolon or comma. You can use masks like `*`, `*.jar` in file name. |
| groovyx-failure-strategy | What to do with unhandled exceptions. If you want to manage exception by code then keep the default value `rollback`. If `transfer to failure` selected and unhandled exception occurred then all flowFiles received from incoming queues in this session will be transferred to `failure` relationship with additional attributes set: ERROR_MESSAGE and ERROR_STACKTRACE. If `rollback` selected and unhandled exception occurred then all flowFiles received from incoming queues will be penalized and returned. If the processor has no incoming connections then this parameter has no effect. |
| groovyx-script-body | Body of script to execute. Only one of Script File or Script Body may be used |
| groovyx-script-file | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | Scripts can store and retrieve state using the State Management APIs. Consult the State Manager section of the Developer’s Guide for more details. |
| CLUSTER | Scripts can store and retrieve state using the State Management APIs. Consult the State Manager section of the Developer’s Guide for more details. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to be processed |
| success | FlowFiles that were successfully processed |

## See also

* [org.apache.nifi.processors.script.ExecuteScript](executescript.md)

---
title: ExecuteProcess 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executeprocess.md
section: Loading & Unloading Data
---

# ExecuteProcess 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Runs an operating system command specified by the user and writes the output of that command to a FlowFile. If the command is expected to be long-running, the Processor can output the partial data on a specified interval. When this option is used, the output is expected to be in textual format, as it typically does not make sense to split binary data on arbitrary time-based intervals.

## Tags

command, external, invoke, process, script, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Argument Delimiter | Delimiter to use to separate arguments for a command [default: space]. Must be a single character. |
| Batch Duration | If the process is expected to be long-running and produce textual output, a batch duration can be specified so that the output will be captured for this amount of time and a FlowFile will then be sent out with the results and a new FlowFile will be started, rather than waiting for the process to finish before sending out the results |
| Command | Specifies the command to be executed; if just the name of an executable is provided, it must be in the user’s environment PATH. |
| Command Arguments | The arguments to supply to the executable delimited by white space. White space can be escaped by enclosing it in double-quotes. |
| Output MIME type | Specifies the value to set for the “mime.type” attribute. This property is ignored if ‘Batch Duration’ is set. |
| Redirect Error Stream | If true will redirect any error stream output of the process to the output stream. This is particularly helpful for processes which write extensively to the error stream or for troubleshooting. |
| Working Directory | The directory to use as the current working directory when executing the command |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| success | All created FlowFiles are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| command | Executed command |
| command.arguments | Arguments of the command |
| mime.type | Sets the MIME type of the output if the ‘Output MIME Type’ property is set and ‘Batch Duration’ is not set |

---
title: ExecuteScript 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executescript.md
section: Loading & Unloading Data
---

# ExecuteScript 2025.10.9.21

## Bundle

org.apache.nifi | nifi-scripting-nar

## Description

Experimental - Executes a script given the flow file and a process session. The script is responsible for handling the incoming flow file (transfer to SUCCESS or remove, e.g.) as well as any flow files created by the script. If the handling is incomplete or incorrect, the session will be rolled back. Experimental: Impact of sustained usage not yet verified.

## Tags

clojure, execute, groovy, script

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Module Directory | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine | Language Engine for executing scripts |
| Script File | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | Scripts can store and retrieve state using the State Management APIs. Consult the State Manager section of the Developer’s Guide for more details. |
| CLUSTER | Scripts can store and retrieve state using the State Management APIs. Consult the State Manager section of the Developer’s Guide for more details. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to be processed |
| success | FlowFiles that were successfully processed |

## See also

* [org.apache.nifi.processors.script.InvokeScriptedProcessor](invokescriptedprocessor.md)

---
title: ExecuteSQL 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executesql.md
section: Loading & Unloading Data
---

# ExecuteSQL 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Executes provided SQL select query. Query result will be converted to Avro format. Streaming is used so arbitrarily large result sets are supported. This processor can be scheduled to run on a timer, or cron expression, using the standard scheduling methods, or it can be triggered by an incoming FlowFile. If it is triggered by an incoming FlowFile, then attributes of that FlowFile will be available when evaluating the select query, and the query may use the ? to escape parameters. In this case, the parameters to use must exist as FlowFile attributes with the naming convention sql.args. N.type and sql.args. N.value, where N is a positive integer. The sql.args. N.type is expected to be a number indicating the JDBC Type. The content of the FlowFile is expected to be in UTF-8 format. FlowFile attribute ‘executesql.row.count’ indicates how many rows were selected.

## Tags

database, jdbc, query, select, sql

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Content Output Strategy | Specifies the strategy for writing FlowFile content when processing input FlowFiles. The strategy applies when handling queries that do not produce results. |
| Database Connection Pooling Service | The Controller Service that is used to obtain connection to database |
| Default Decimal Precision | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| Max Wait Time | The maximum amount of time allowed for a running SQL select query , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| Normalize Table and Column Names | Whether to change non-Avro-compatible characters in column names to Avro-compatible characters. For example, colons and periods will be changed to underscores in order to build a valid Avro record. |
| SQL Query | The SQL query to execute. The query can be empty, a constant value, or built from attributes using Expression Language. If this property is specified, it will be used regardless of the content of incoming flowfiles. If this property is empty, the content of the incoming flow file is expected to contain a valid SQL select query, to be issued by the processor to the database. Note that Expression Language is not evaluated for flow file contents. |
| Use Avro Logical Types | Whether to use Avro Logical Types for DECIMAL/NUMBER, DATE, TIME and TIMESTAMP columns. If disabled, written as string. If enabled, Logical types are used and written as its underlying type, specifically, DECIMAL/NUMBER as logical ‘decimal’: written as bytes with additional precision and scale meta data, DATE as logical ‘date-millis’: written as int denoting days since Unix epoch (1970-01-01), TIME as logical ‘time-millis’: written as int denoting milliseconds since Unix epoch, and TIMESTAMP as logical ‘timestamp-millis’: written as long denoting milliseconds since Unix epoch. If a reader of written Avro records also knows these logical types, then these values can be deserialized with more context depending on reader implementation. |
| compression-format | Compression type to use when writing Avro files. Default is None. |
| esql-auto-commit | Enables or disables the auto commit functionality of the DB connection. Default value is ‘true’. The default value can be used with most of the JDBC drivers and this functionality doesn’t have any impact in most of the cases since this processor is used to read data. However, for some JDBC drivers such as PostgreSQL driver, it is required to disable the auto committing functionality to limit the number of result rows fetching at a time. When auto commit is enabled, postgreSQL driver loads whole result set to memory at once. This could lead for a large amount of memory usage when executing queries which fetch large data sets. More Details of this behaviour in PostgreSQL driver can be found in <https://jdbc.postgresql.org//documentation/head/query.html>. |
| esql-fetch-size | The number of result rows to be fetched from the result set at a time. This is a hint to the database driver and may not be honored and/or exact. If the value specified is zero, then the hint is ignored. |
| esql-max-rows | The maximum number of result rows that will be included in a single FlowFile. This will allow you to break up very large result sets into multiple FlowFiles. If the value specified is zero, then all rows are returned in a single FlowFile. |
| esql-output-batch-size | The number of output FlowFiles to queue before committing the process session. When set to zero, the session will be committed when all result set rows have been processed and the output FlowFiles are ready for transfer to the downstream relationship. For large result sets, this can cause a large burst of FlowFiles to be transferred at the end of processor execution. If this property is set, then when the specified number of FlowFiles are ready for transfer, then the session will be committed, thus releasing the FlowFiles to the downstream relationship. NOTE: The fragment.count attribute will not be set on FlowFiles when this property is set. |
| sql-post-query | A semicolon-delimited list of queries executed after the main SQL query is executed. Example like setting session properties after main query. It ‘s possible to include semicolons in the statements themselves by escaping them with a backslash (’;’). Results/outputs from these queries will be suppressed if there are no errors. |
| sql-pre-query | A semicolon-delimited list of queries executed before the main SQL query is executed. For example, set session properties before main query. It ‘s possible to include semicolons in the statements themselves by escaping them with a backslash (’;’). Results/outputs from these queries will be suppressed if there are no errors. |

## Relationships

| Name | Description |
| --- | --- |
| failure | SQL query execution failed. Incoming FlowFile will be penalized and routed to this relationship |
| success | Successfully created FlowFile from SQL query result set. |

## Writes attributes

| Name | Description |
| --- | --- |
| executesql.row.count | Contains the number of rows returned by the query. If ‘Max Rows Per Flow File’ is set, then this number will reflect the number of rows in the Flow File instead of the entire result set. |
| executesql.query.duration | Combined duration of the query execution time and fetch time in milliseconds. If ‘Max Rows Per Flow File’ is set, then this number will reflect only the fetch time for the rows in the Flow File instead of the entire result set. |
| executesql.query.executiontime | Duration of the query execution time in milliseconds. This number will reflect the query execution time regardless of the ‘Max Rows Per Flow File’ setting. |
| executesql.query.fetchtime | Duration of the result set fetch time in milliseconds. If ‘Max Rows Per Flow File’ is set, then this number will reflect only the fetch time for the rows in the Flow File instead of the entire result set. |
| executesql.resultset.index | Assuming multiple result sets are returned, the zero based index of this result set. |
| executesql.error.message | If processing an incoming flow file causes an Exception, the Flow File is routed to failure and this attribute is set to the exception message. |
| fragment.identifier | If ‘Max Rows Per Flow File’ is set then all FlowFiles from the same query result set will have the same value for the fragment.identifier attribute. This can then be used to correlate the results. |
| fragment.count | If ‘Max Rows Per Flow File’ is set then this is the total number of FlowFiles produced by a single ResultSet. This can be used in conjunction with the fragment.identifier attribute in order to know how many FlowFiles belonged to the same incoming ResultSet. If Output Batch Size is set, then this attribute will not be populated. |
| fragment.index | If ‘Max Rows Per Flow File’ is set then the position of this FlowFile in the list of outgoing FlowFiles that were all derived from the same result set FlowFile. This can be used in conjunction with the fragment.identifier attribute to know which FlowFiles originated from the same query result set and in what order FlowFiles were produced |
| input.flowfile.uuid | If the processor has an incoming connection, outgoing FlowFiles will have this attribute set to the value of the input FlowFile’s UUID. If there is no incoming connection, the attribute will not be added. |

---
title: ExecuteSQLRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executesqlrecord.md
section: Loading & Unloading Data
---

# ExecuteSQLRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Executes provided SQL select query. Query result will be converted to the format specified by a Record Writer. Streaming is used so arbitrarily large result sets are supported. This processor can be scheduled to run on a timer, or cron expression, using the standard scheduling methods, or it can be triggered by an incoming FlowFile. If it is triggered by an incoming FlowFile, then attributes of that FlowFile will be available when evaluating the select query, and the query may use the ? to escape parameters. In this case, the parameters to use must exist as FlowFile attributes with the naming convention sql.args. N.type and sql.args. N.value, where N is a positive integer. The sql.args. N.type is expected to be a number indicating the JDBC Type. The content of the FlowFile is expected to be in UTF-8 format. FlowFile attribute ‘executesql.row.count’ indicates how many rows were selected.

## Tags

database, jdbc, query, record, select, sql

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Database Connection Pooling Service | The Controller Service that is used to obtain connection to database |
| Default Decimal Precision | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| Max Wait Time | The maximum amount of time allowed for a running SQL select query , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| SQL Query | The SQL query to execute. The query can be empty, a constant value, or built from attributes using Expression Language. If this property is specified, it will be used regardless of the content of incoming flowfiles. If this property is empty, the content of the incoming flow file is expected to contain a valid SQL select query, to be issued by the processor to the database. Note that Expression Language is not evaluated for flow file contents. |
| Use Avro Logical Types | Whether to use Avro Logical Types for DECIMAL/NUMBER, DATE, TIME and TIMESTAMP columns. If disabled, written as string. If enabled, Logical types are used and written as its underlying type, specifically, DECIMAL/NUMBER as logical ‘decimal’: written as bytes with additional precision and scale meta data, DATE as logical ‘date-millis’: written as int denoting days since Unix epoch (1970-01-01), TIME as logical ‘time-millis’: written as int denoting milliseconds since Unix epoch, and TIMESTAMP as logical ‘timestamp-millis’: written as long denoting milliseconds since Unix epoch. If a reader of written Avro records also knows these logical types, then these values can be deserialized with more context depending on reader implementation. |
| esql-auto-commit | Enables or disables the auto commit functionality of the DB connection. Default value is ‘true’. The default value can be used with most of the JDBC drivers and this functionality doesn’t have any impact in most of the cases since this processor is used to read data. However, for some JDBC drivers such as PostgreSQL driver, it is required to disable the auto committing functionality to limit the number of result rows fetching at a time. When auto commit is enabled, postgreSQL driver loads whole result set to memory at once. This could lead for a large amount of memory usage when executing queries which fetch large data sets. More Details of this behaviour in PostgreSQL driver can be found in <https://jdbc.postgresql.org//documentation/head/query.html>. |
| esql-fetch-size | The number of result rows to be fetched from the result set at a time. This is a hint to the database driver and may not be honored and/or exact. If the value specified is zero, then the hint is ignored. |
| esql-max-rows | The maximum number of result rows that will be included in a single FlowFile. This will allow you to break up very large result sets into multiple FlowFiles. If the value specified is zero, then all rows are returned in a single FlowFile. |
| esql-output-batch-size | The number of output FlowFiles to queue before committing the process session. When set to zero, the session will be committed when all result set rows have been processed and the output FlowFiles are ready for transfer to the downstream relationship. For large result sets, this can cause a large burst of FlowFiles to be transferred at the end of processor execution. If this property is set, then when the specified number of FlowFiles are ready for transfer, then the session will be committed, thus releasing the FlowFiles to the downstream relationship. NOTE: The fragment.count attribute will not be set on FlowFiles when this property is set. |
| esqlrecord-normalize | Whether to change characters in column names. For example, colons and periods will be changed to underscores. |
| esqlrecord-record-writer | Specifies the Controller Service to use for writing results to a FlowFile. The Record Writer may use Inherit Schema to emulate the inferred schema behavior, i.e. an explicit schema need not be defined in the writer, and will be supplied by the same logic used to infer the schema from the column types. |
| sql-post-query | A semicolon-delimited list of queries executed after the main SQL query is executed. Example like setting session properties after main query. It ‘s possible to include semicolons in the statements themselves by escaping them with a backslash (’;’). Results/outputs from these queries will be suppressed if there are no errors. |
| sql-pre-query | A semicolon-delimited list of queries executed before the main SQL query is executed. For example, set session properties before main query. It ‘s possible to include semicolons in the statements themselves by escaping them with a backslash (’;’). Results/outputs from these queries will be suppressed if there are no errors. |

## Relationships

| Name | Description |
| --- | --- |
| failure | SQL query execution failed. Incoming FlowFile will be penalized and routed to this relationship |
| success | Successfully created FlowFile from SQL query result set. |

## Writes attributes

| Name | Description |
| --- | --- |
| executesql.row.count | Contains the number of rows returned in the select query |
| executesql.query.duration | Combined duration of the query execution time and fetch time in milliseconds |
| executesql.query.executiontime | Duration of the query execution time in milliseconds |
| executesql.query.fetchtime | Duration of the result set fetch time in milliseconds |
| executesql.resultset.index | Assuming multiple result sets are returned, the zero based index of this result set. |
| executesql.error.message | If processing an incoming flow file causes an Exception, the Flow File is routed to failure and this attribute is set to the exception message. |
| fragment.identifier | If ‘Max Rows Per Flow File’ is set then all FlowFiles from the same query result set will have the same value for the fragment.identifier attribute. This can then be used to correlate the results. |
| fragment.count | If ‘Max Rows Per Flow File’ is set then this is the total number of FlowFiles produced by a single ResultSet. This can be used in conjunction with the fragment.identifier attribute in order to know how many FlowFiles belonged to the same incoming ResultSet. If Output Batch Size is set, then this attribute will not be populated. |
| fragment.index | If ‘Max Rows Per Flow File’ is set then the position of this FlowFile in the list of outgoing FlowFiles that were all derived from the same result set FlowFile. This can be used in conjunction with the fragment.identifier attribute to know which FlowFiles originated from the same query result set and in what order FlowFiles were produced |
| input.flowfile.uuid | If the processor has an incoming connection, outgoing FlowFiles will have this attribute set to the value of the input FlowFile’s UUID. If there is no incoming connection, the attribute will not be added. |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer. |
| record.count | The number of records output by the Record Writer. |

---
title: ExecuteSQLStatement 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executesqlstatement.md
section: Loading & Unloading Data
---

# ExecuteSQLStatement 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-processors-nar

## Description

Executes a SQL DDL or DML Statement against a database. This Processor allows Expression Language to be evaluated against FlowFile attributes in order to parameterize the SQL for each FlowFile.

## Tags

database, delete, insert, jdbc, openflow, sql, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pooling Service | The Connection Pooling Service that is used to obtain a connection to the database |
| Max Batch Size | The maximum number of FlowFiles to process in a single batch |
| Max Content Reference Size | If the SQL property references ${flowfile_content}, this property specifies the maximum size of the FlowFile that is allowed to be read into memory. If the FlowFile is larger than this value, the FlowFile will be routed to failure. If the SQL property does not reference ${flowfile_content}, this value has no effect. |
| SQL | The SQL statement to execute. The SQL may make use of Expression Language to reference attributes. In this case, the Processor will rewrite the query using parameters in order to avoid SQL Injection attacks. When referencing Expression Language, the entire value must be a single Expression. For example, `INSERT INTO TABLE X (name) VALUES ( '${name}')` is valid, but `INSERT INTO TABLE X (name) VALUES ( 'Mr. ${name}')` is not because Expression Language is used within a String value. The SQL may also reference `${flowfile_content}` in order to reference the content of the FlowFile as UTF-8 encoded text. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The SQL statement could not be executed |
| success | The SQL statement was successfully executed |

---
title: ExecuteStreamCommand 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/executestreamcommand.md
section: Loading & Unloading Data
---

# ExecuteStreamCommand 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

The ExecuteStreamCommand processor provides a flexible way to integrate external commands and scripts into NiFi data flows. ExecuteStreamCommand can pass the incoming FlowFile’s content to the command that it executes similarly how piping works.

## Tags

command, command execution, execute, stream

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Argument Delimiter | Delimiter to use to separate arguments for a command [default: ;]. Must be a single character |
| Command Arguments | The arguments to supply to the executable delimited by the ‘;’ character. |
| Command Path | Specifies the command to be executed; if just the name of an executable is provided, it must be in the user’s environment PATH. |
| Ignore STDIN | If true, the contents of the incoming flowfile will not be passed to the executing command |
| Max Attribute Length | If routing the output of the stream command to an attribute, the number of characters put to the attribute value will be at most this amount. This is important because attributes are held in memory and large attributes will quickly cause out of memory issues. If the output goes longer than this value, it will truncated to fit. Consider making this smaller if able. |
| Output Destination Attribute | If set, the output of the stream command will be put into an attribute of the original FlowFile instead of a separate FlowFile. There will no longer be a relationship for ‘output stream’ or ‘nonzero status’. The value of this property will be the key for the output attribute. |
| Output MIME Type | Specifies the value to set for the “mime.type” attribute. This property is ignored if ‘Output Destination Attribute’ is set. |
| Working Directory | The directory to use as the current working directory when executing the command |
| argumentsStrategy | Strategy for configuring arguments to be supplied to the command. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| nonzero status | The destination path for the flow file created from the command’s output, if the returned status code is non-zero. All flow files routed to this relationship will be penalized. |
| original | The original FlowFile will be routed. It will have new attributes detailing the result of the script execution. |
| output stream | The destination path for the flow file created from the command’s output, if the returned status code is zero. |

## Writes attributes

| Name | Description |
| --- | --- |
| execution.command | The name of the command executed |
| execution.command.args | The semi-colon delimited list of arguments. Sensitive properties will be masked |
| execution.status | The exit status code returned from executing the command |
| execution.error | Any error messages returned from executing the command |
| mime.type | Sets the MIME type of the output if the ‘Output MIME Type’ property is set and ‘Output Destination Attribute’ is not set |

---
title: Explore Data from SAP® Business Data Cloud
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sap-sql/explore-data.md
section: Loading & Unloading Data
---

# Explore Data from SAP® Business Data Cloud

In this topic we will explore the data that has been shared with Snowflake.
All examples are intended to showcase accessing data using Snowflake.

The following sections use `CUSTOMER` as an example database, but you can use the same steps to explore the data in your database.

## Explore the database, schemas, and tables

Examine the database:

> ```sqlexample
> DESC DATABASE CUSTOMER;
> ```
>
> Which should produce results similar to:
>
> ```output
> +--------------------------------+---------------------------------+
> |created_on                              | name                            | kind      |
> +--------------------------------+---------------------------------+
> | 2025-12-17 13:30:01.062 -0800  | INFORMATION_SCHEMA      | SCHEMA    |
> | 2025-12-17 13:11:12.206 -0800  | customer            | SCHEMA    |
> +--------------------------------+---------------------------------+
> ```
>
> Where each row represents the schema in your database.

Examine the tables in the database:

> ```sqlexample
> SHOW TABLES IN CUSTOMER;
> ```
>
> Which should produce results similar to:
>
> ```output
> +------------------------+----------------+-------------+-------+-------+--------+
> | name                               | database_name  | schema_name | kind  | rows  | bytes  |
> +------------------------+----------------+-------------+-------+-------+--------+
> | customer                           | CUSTOMER       | customer    | TABLE | 2174  | 215708 |
> | customercompanycode          | CUSTOMER               | customer    | TABLE | 1792  | 37311  |
> | customerdunning                | CUSTOMER             | customer    | TABLE | 44    | 4912   |
> | customersalesarea            | CUSTOMER               | customer    | TABLE | 442   | 34415  |
> | customersalesareatax   | CUSTOMER             | customer    | TABLE | 883   | 9153   |
> | customerunloadingpoint | CUSTOMER             | customer    | TABLE | 37    | 13253  |
> +------------------------+----------------+-------------+-------+-------+--------+
> ```

## Query tables in the CUSTOMER database

Query the ‘customer’ table:

> ```sqlexample
> SELECT * FROM CUSTOMER.customer.customer;
>
> SELECT * FROM CUSTOMER.customer.customer WHERE CREATEDBYUSER = 'KAPOORM'
> ```

## Create Derived Data from shared data

Create L1 data by joining tables from 2 shared Data Products:

```sqlexample
-- Join tables in CUSTOMER and ENTRYVIEWJOURNALENTRY to find top 10 customers by revenue
SELECT
    c.customer,
    c.customername,
    c.country,
    c.region,
    c.businesstype,
    COUNT(DISTINCT e.accountingdocument) as num_transactions,
    SUM(e.amountincompanycodecurrency) as total_revenue,
    AVG(e.amountincompanycodecurrency) as avg_transaction_amount
FROM CUSTOMER.customer.customer c
JOIN ENTRYVIEWJOURNALENTRY.entryviewjournalentry.operationalacctgdocitem e
   ON c.customer = e.customer
WHERE c.deletionindicator = FALSE
GROUP BY 1,2,3,4,5
ORDER BY total_revenue DESC
LIMIT 10;
```

## Create Table As Select (CTAS) in a new database

Create a new database to hold the CTAS:

```sqlexample
CREATE DATABASE CUSTOMER_CTAS_DEMO;
USE DATABASE CUSTOMER_CTAS_DEMO;

-- Create the CTAS
CREATE OR REPLACE TABLE top_customers_by_revenue AS
SELECT
  c.customer,
  c.customername,
  c.country,
  c.region,
  c.businesstype,
  COUNT(DISTINCT e.accountingdocument) as num_transactions,
  SUM(e.amountincompanycodecurrency) as total_revenue,
  AVG(e.amountincompanycodecurrency) as avg_transaction_amount
FROM CUSTOMER.customer.customer c
JOIN ENTRYVIEWJOURNALENTRY.entryviewjournalentry.operationalacctgdocitem e
    ON c.customer = e.customer
WHERE c.deletionindicator = FALSE
 GROUP BY 1,2,3,4,5
 ORDER BY total_revenue DESC;

 -- Query the CTAS
 SELECT * FROM top_customers_by_revenue LIMIT 10;
```

---
title: ExternalHazelcastCacheManager
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/externalhazelcastcachemanager.md
section: Loading & Unloading Data
---

# ExternalHazelcastCacheManager

## Description

A service that provides cache instances backed by Hazelcast running outside of NiFi.

## Tags

cache, hazelcast

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Hazelcast Cluster Name \* | hazelcast-cluster-name | nifi |  | Name of the Hazelcast cluster. |
| Hazelcast Connection Timeout \* | hazelcast-connection-timeout | 20 secs |  | The maximum amount of time the client tries to connect or reconnect before giving up. |
| Hazelcast Initial Backoff \* | hazelcast-retry-backoff-initial | 1 secs |  | The amount of time the client waits before it tries to reestablish connection for the first time. |
| Hazelcast Maximum Backoff \* | hazelcast-retry-backoff-maximum | 5 secs |  | The maximum amount of time the client waits before it tries to reestablish connection. |
| Hazelcast Backoff Multiplier \* | hazelcast-retry-backoff-multiplier | 1.5 |  | A multiplier by which the wait time is increased before each attempt to reestablish connection. |
| Hazelcast Server Address \* | hazelcast-server-address |  |  | Addresses of one or more the Hazelcast instances, using {host:port} format, separated by comma. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ExtractAvroMetadata 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractavrometadata.md
section: Loading & Unloading Data
---

# ExtractAvroMetadata 2025.10.9.21

## Bundle

org.apache.nifi | nifi-avro-nar

## Description

Extracts metadata from the header of an Avro datafile.

## Tags

avro, metadata, schema

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Count Items | If true the number of items in the datafile will be counted and stored in a FlowFile attribute ‘item.count’. The counting is done by reading blocks and getting the number of items for each block, thus avoiding de-serializing. The items being counted will be the top-level items in the datafile. For example, with a schema of type record the items will be the records, and for a schema of type Array the items will be the arrays (not the number of entries in each array). |
| Fingerprint Algorithm | The algorithm used to generate the schema fingerprint. Available choices are based on the Avro recommended practices for fingerprint generation. |
| Metadata Keys | A comma-separated list of keys indicating key/value pairs to extract from the Avro file header. The key ‘avro.schema’ can be used to extract the full schema in JSON format, and ‘avro.codec’ can be used to extract the codec name if one exists. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if it cannot be parsed as Avro or metadata cannot be extracted for any reason |
| success | A FlowFile is routed to this relationship after metadata has been extracted. |

## Writes attributes

| Name | Description |
| --- | --- |
| schema.type | The type of the schema (i.e. record, enum, etc.). |
| schema.name | Contains the name when the type is a record, enum or fixed, otherwise contains the name of the primitive type. |
| schema.fingerprint | The result of the Fingerprint Algorithm as a Hex string. |
| item.count | The total number of items in the datafile, only written if Count Items is set to true. |

---
title: ExtractEmailAttachments 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractemailattachments.md
section: Loading & Unloading Data
---

# ExtractEmailAttachments 2025.10.9.21

## Bundle

org.apache.nifi | nifi-email-nar

## Description

Extract attachments from a mime formatted email file, splitting them into individual flowfiles.

## Tags

email, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Relationships

| Name | Description |
| --- | --- |
| attachments | Each individual attachment will be routed to the attachments relationship |
| failure | FlowFiles that could not be parsed |
| original | The original file |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The filename of the attachment |
| email.attachment.parent.filename | The filename of the parent FlowFile |
| email.attachment.parent.uuid | The UUID of the original FlowFile. |
| mime.type | The mime type of the attachment. |

---
title: ExtractEmailHeaders 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractemailheaders.md
section: Loading & Unloading Data
---

# ExtractEmailHeaders 2025.10.9.21

## Bundle

org.apache.nifi | nifi-email-nar

## Description

Using the flowfile content as source of data, extract header from an RFC compliant email file adding the relevant attributes to the flowfile. This processor does not perform extensive RFC validation but still requires a bare minimum compliance with RFC 2822

## Tags

email, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Additional Header List | COLON separated list of additional headers to be extracted from the flowfile content. NOTE the header key is case insensitive and will be matched as lower-case. Values will respect email contents. |
| Email Address Parsing | If “strict”, strict address format parsing rules are applied to mailbox and mailbox list fields, such as “to” and “from” headers, and FlowFiles with poorly formed addresses will be routed to the failure relationship, similar to messages that fail RFC compliant format validation. If “non-strict”, the processor will extract the contents of mailbox list headers as comma-separated values without attempting to parse each value as well-formed Internet mailbox addresses. This is optional and defaults to Strict Address Parsing |

## Relationships

| Name | Description |
| --- | --- |
| failure | Flowfiles that could not be parsed as a RFC-2822 compliant message |
| success | Extraction was successful |

## Writes attributes

| Name | Description |
| --- | --- |
| email.headers.bcc.\* | Each individual BCC recipient (if available) |
| email.headers.cc.\* | Each individual CC recipient (if available) |
| email.headers.from.\* | Each individual mailbox contained in the From of the Email (array as per RFC-2822) |
| email.headers.message-id | The value of the Message-ID header (if available) |
| email.headers.received_date | The Received-Date of the message (if available) |
| email.headers.sent_date | Date the message was sent |
| email.headers.subject | Subject of the message (if available) |
| email.headers.to.\* | Each individual TO recipient (if available) |
| email.attachment_count | Number of attachments of the message |

---
title: ExtractGrok 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractgrok.md
section: Loading & Unloading Data
---

# ExtractGrok 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Evaluates one or more Grok Expressions against the content of a FlowFile, adding the results as attributes or replacing the content of the FlowFile with a JSON notation of the matched content

## Tags

delimit, extract, grok, log, parse, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the file is encoded |
| Destination | Control if Grok output value is written as a new flowfile attributes, in this case each of the Grok identifier that is matched in the flowfile will be added as an attribute, prefixed with “grok.” or written in the flowfile content. Writing to flowfile content will overwrite any existing flowfile content. |
| Grok Expression | Grok expression. If other Grok expressions are referenced in this expression, they must be provided in the Grok Pattern File if set or exist in the default Grok patterns |
| Grok Pattern file | Custom Grok pattern definitions. These definitions will be loaded after the default Grok patterns. The Grok Parser will use the default Grok patterns when this property is not configured. |
| Keep Empty Captures | If true, then empty capture values will be included in the returned capture map. |
| Maximum Buffer Size | Specifies the maximum amount of data to buffer (per file) in order to apply the Grok expressions. Files larger than the specified maximum will not be fully evaluated. |
| Named captures only | Only store named captures from grok |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Patterns can reference resources over HTTP |

## Relationships

| Name | Description |
| --- | --- |
| matched | FlowFiles are routed to this relationship when the Grok Expression is successfully evaluated and the FlowFile is modified as a result |
| unmatched | FlowFiles are routed to this relationship when no provided Grok Expression matches the content of the FlowFile |

## Writes attributes

| Name | Description |
| --- | --- |
| grok.XXX | When operating in flowfile-attribute mode, each of the Grok identifier that is matched in the flowfile will be added as an attribute, prefixed with “grok.” For example,if the grok identifier “timestamp” is matched, then the value will be added to an attribute named “grok.timestamp” |

---
title: ExtractRecordSchema 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractrecordschema.md
section: Loading & Unloading Data
---

# ExtractRecordSchema 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Extracts the record schema from the FlowFile using the supplied Record Reader and writes it to the `avro.schema` attribute.

## Tags

avro, csv, freeform, generic, json, record, schema, text, xml

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| cache-size | Specifies the number of schemas to cache. This value should reflect the expected number of different schemas that may be in the incoming FlowFiles. This ensures more efficient retrieval of the schemas and thus the processor performance. |
| record-reader | Specifies the Controller Service to use for reading incoming data |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile’s record schema cannot be extracted from the configured input format, the FlowFile will be routed to this relationship |
| success | FlowFiles whose record schemas are successfully extracted will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.error.message | This attribute provides on failure the error message encountered by the Reader. |
| avro.schema | This attribute provides the schema extracted from the input FlowFile using the provided RecordReader. |

---
title: ExtractSchemaColumns 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractschemacolumns.md
section: Loading & Unloading Data
---

# ExtractSchemaColumns 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-record-schema-nar

## Description

Extracts the record schema columns from the FlowFile using the supplied Record Reader and writes it to the `schema.columns` attribute.

## Tags

avro, csv, freeform, generic, json, record, schema, text, xml

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| End Column Index | Specifies index of the column in schema to which columns should be taken. |
| Record Reader | Specifies the Controller Service to use for reading incoming data |
| Start Column Index | Specifies index of the column (numbered from 1) in schema from which columns should be taken. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile’s record schema cannot be extracted from the configured input format, the FlowFile will be routed to this relationship |
| success | FlowFiles whose record schemas are successfully extracted will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.error.message | This attribute provides on failure the error message encountered by the Reader. |
| schema.columns | This attribute provides columns extracted from the input FlowFile using the provided RecordReader. |

---
title: ExtractStructuredBoxFileMetadata 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extractstructuredboxfilemetadata.md
section: Loading & Unloading Data
---

# ExtractStructuredBoxFileMetadata 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Extracts metadata from a Box file using Box AI. The extraction can use either a template or a list of fields. The extracted metadata is written to the FlowFile content as JSON.

## Tags

ai, box, extract, metadata, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Extraction Method | The method to use for extracting metadata. TEMPLATE uses a Box metadata template for extraction. FIELDS uses a JSON schema of fields (read from FlowFile content) for extraction. |
| File ID | The ID of the file from which to extract metadata. |
| Record Reader | The Record Reader to use for parsing the incoming data. Required when Extraction Method is FIELDS. |
| Template Key | The key of the metadata template to use for extraction. Required when Extraction Method is TEMPLATE. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if an error occurs during metadata extraction. |
| file not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile is routed to this relationship after metadata has been successfully extracted. |
| template not found | FlowFiles for which the specified metadata template was not found will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the file from which metadata was extracted |
| box.ai.template.key | The template key used for extraction (when using TEMPLATE extraction method) |
| box.ai.extraction.method | The extraction method used (TEMPLATE or FIELDS) |
| box.ai.completion.reason | The completion reason from the AI extraction |
| mime.type | Set to ‘application/json’ for the JSON content |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFileMetadataTemplates](listboxfilemetadatatemplates.md)
* [org.apache.nifi.processors.box.UpdateBoxFileMetadataInstance](updateboxfilemetadatainstance.md)

---
title: ExtractText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/extracttext.md
section: Loading & Unloading Data
---

# ExtractText 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Evaluates one or more Regular Expressions against the content of a FlowFile. The results of those Regular Expressions are assigned to FlowFile Attributes. Regular Expressions are entered by adding user-defined properties; the name of the property maps to the Attribute Name into which the result will be placed. The attributes are generated differently based on the enabling of named capture groups. If named capture groups are not enabled: The first capture group, if any found, will be placed into that attribute name. But all capture groups, including the matching string sequence itself will also be provided at that attribute name with an index value provided, with the exception of a capturing group that is optional and does not match - for example, given the attribute name “regex” and expression “abc(def)?(g)” we would add an attribute “regex.1” with a value of “def” if the “def” matched. If the “def” did not match, no attribute named “regex.1” would be added but an attribute named “regex.2” with a value of “g” will be added regardless. If named capture groups are enabled: Each named capture group, if found will be placed into the attributes name with the name provided. If enabled the matching string sequence itself will be placed into the attribute name. If multiple matches are enabled, and index will be applied after the first set of matches. The exception is a capturing group that is optional and does not match For example, given the attribute name “regex” and expression “abc(?<NAMED>def)?(?<NAMED-TWO>g)” we would add an attribute “regex. NAMED” with the value of “def” if the “def” matched. We would add an attribute “regex. NAMED-TWO” with the value of “g” if the “g” matched regardless. The value of the property must be a valid Regular Expressions with one or more capturing groups. If named capture groups are enabled, all capture groups must be named. If they are not, then the processor configuration will fail validation. If the Regular Expression matches more than once, only the first match will be used unless the property enabling repeating capture group is set to true. If any provided Regular Expression matches, the FlowFile(s) will be routed to ‘matched’. If no provided Regular Expression matches, the FlowFile will be routed to ‘unmatched’ and no attributes will be applied to the FlowFile.

## Tags

Regular Expression, Text, evaluate, extract, regex

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the file is encoded |
| Enable Canonical Equivalence | Indicates that two characters match only when their full canonical decompositions match. |
| Enable Case-insensitive Matching | Indicates that two characters match even if they are in a different case. Can also be specified via the embedded flag (?i). |
| Enable DOTALL Mode | Indicates that the expression ‘.’ should match any character, including a line terminator. Can also be specified via the embedded flag (?s). |
| Enable Literal Parsing of the Pattern | Indicates that Metacharacters and escape characters should be given no special meaning. |
| Enable Multiline Mode | Indicates that ‘^’ and ‘$’ should match just after and just before a line terminator or end of sequence, instead of only the beginning or end of the entire input. Can also be specified via the embeded flag (?m). |
| Enable Unicode Predefined Character Classes | Specifies conformance with the Unicode Technical Standard #18: Unicode Regular Expression Annex C: Compatibility Properties. Can also be specified via the embedded flag (?U). |
| Enable Unicode-aware Case Folding | When used with ‘Enable Case-insensitive Matching’, matches in a manner consistent with the Unicode Standard. Can also be specified via the embedded flag (?u). |
| Enable Unix Lines Mode | Indicates that only the ‘line terminator is recognized in the behavior of’. ‘,’^ ‘, and’$’. Can also be specified via the embedded flag (?d). |
| Enable named group support | If set to true, when named groups are present in the regular expression, the name of the group will be used in the attribute name as opposed to the group index. All capturing groups must be named, if the number of groups (not including capture group 0) does not equal the number of named groups validation will fail. |
| Enable repeating capture group | If set to true, every string matching the capture groups will be extracted. Otherwise, if the Regular Expression matches more than once, only the first match will be extracted. |
| Include Capture Group 0 | Indicates that Capture Group 0 should be included as an attribute. Capture Group 0 represents the entirety of the regular expression match, is typically not used, and could have considerable length. |
| Maximum Buffer Size | Specifies the maximum amount of data to buffer (per FlowFile) in order to apply the regular expressions. FlowFiles larger than the specified maximum will not be fully evaluated. |
| Maximum Capture Group Length | Specifies the maximum number of characters a given capture group value can have. Any characters beyond the max will be truncated. |
| Permit Whitespace and Comments in Pattern | In this mode, whitespace is ignored, and embedded comments starting with # are ignored until the end of a line. Can also be specified via the embedded flag (?x). |

## Relationships

| Name | Description |
| --- | --- |
| matched | FlowFiles are routed to this relationship when the Regular Expression is successfully evaluated and the FlowFile is modified as a result |
| unmatched | FlowFiles are routed to this relationship when no provided Regular Expression matches the content of the FlowFile |

---
title: FetchAzureBlobStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchazureblobstorage_v12.md
section: Loading & Unloading Data
---

# FetchAzureBlobStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Retrieves the specified blob from Azure Blob Storage and writes its content to the content of the FlowFile. The processor uses Azure Blob Storage client library v12.

## Tags

azure, blob, cloud, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Blob Name | The full name of the blob |
| Client-Side Encryption Key ID | Specifies the ID of the key to use for client-side encryption. |
| Client-Side Encryption Key Type | Specifies the key type to use for client-side encryption. |
| Client-Side Encryption Local Key | When using local client-side encryption, this is the raw key, encoded in hexadecimal |
| Container Name | Name of the Azure storage container. In case of PutAzureBlobStorage processor, container can be created if it does not exist. |
| Range Length | The number of bytes to download from the blob, starting from the Range Start. An empty value or a value that extends beyond the end of the blob will read to the end of the blob. |
| Range Start | The byte position at which to start reading from the blob. An empty value or a value of zero will start reading at the beginning of the blob. |
| Storage Credentials | Controller Service used to obtain Azure Blob Storage Credentials. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Unsuccessful operations will be transferred to the failure relationship. |
| success | All successfully processed FlowFiles are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.container | The name of the Azure Blob Storage container |
| azure.blobname | The name of the blob on Azure Blob Storage |
| azure.primaryUri | Primary location of the blob |
| azure.etag | ETag of the blob |
| azure.blobtype | Type of the blob (either BlockBlob, PageBlob or AppendBlob) |
| mime.type | MIME Type of the content |
| lang | Language code for the content |
| azure.timestamp | Timestamp of the blob |
| azure.length | Length of the blob |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in an Azure Blob Storage container |

## See also

* [org.apache.nifi.processors.azure.storage.DeleteAzureBlobStorage_v12](deleteazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.ListAzureBlobStorage_v12](listazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.PutAzureBlobStorage_v12](putazureblobstorage_v12.md)

---
title: FetchAzureDataLakeStorage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchazuredatalakestorage.md
section: Loading & Unloading Data
---

# FetchAzureDataLakeStorage 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Fetch the specified file from Azure Data Lake Storage

## Tags

adlsgen2, azure, cloud, datalake, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ADLS Credentials | Controller Service used to obtain Azure Credentials. |
| Directory Name | Name of the Azure Storage Directory. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. In case of the PutAzureDataLakeStorage processor, the directory will be created if not already existing. |
| File Name | The filename |
| Filesystem Name | Name of the Azure Storage File System (also called Container). It is assumed to be already existing. |
| Number of Retries | The number of automatic retries to perform if the download fails. |
| Range Length | The number of bytes to download from the object, starting from the Range Start. An empty value or a value that extends beyond the end of the object will read to the end of the object. |
| Range Start | The byte position at which to start reading from the object. An empty value or a value of zero will start reading at the beginning of the object. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Azure storage for some reason are transferred to this relationship |
| success | Files that have been successfully written to Azure storage are transferred to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.datalake.storage.statusCode | The HTTP error code (if available) from the failed operation |
| azure.datalake.storage.errorCode | The Azure Data Lake Storage moniker of the failed operation |
| azure.datalake.storage.errorMessage | The Azure Data Lake Storage error message from the failed operation |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in an Azure DataLake Storage directory |

## See also

* [org.apache.nifi.processors.azure.storage.DeleteAzureDataLakeStorage](deleteazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.ListAzureDataLakeStorage](listazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.PutAzureDataLakeStorage](putazuredatalakestorage.md)

---
title: FetchBoxFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchboxfile.md
section: Loading & Unloading Data
---

# FetchBoxFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Fetches files from a Box Folder. Designed to be used in tandem with ListBoxFile.

## Tags

box, fetch, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the File to fetch |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here for each File for which fetch was attempted but failed. |
| success | A FlowFile will be routed here for each successfully fetched File. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The id of the file |
| filename | The name of the file |
| path | The folder path where the file is located |
| box.size | The size of the file |
| box.timestamp | The last modified time of the file |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.PutBoxFile](putboxfile.md)

---
title: FetchBoxFileInfo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchboxfileinfo.md
section: Loading & Unloading Data
---

# FetchBoxFileInfo 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Fetches metadata for files from Box and adds it to the FlowFile’s attributes.

## Tags

box, fetch, metadata, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the File to fetch metadata for |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here if fetching the file metadata fails. |
| not.found | FlowFiles for which the specified Box file was not found. |
| success | A FlowFile will be routed here after successfully fetching the file metadata. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The id of the file |
| filename | The name of the file |
| path | The folder path where the file is located |
| box.path.folder.ids | A comma separated list of file path_collection IDs |
| box.size | The size of the file |
| box.timestamp | The last modified time of the file |
| box.created.at | The creation date of the file |
| box.owner | The name of the file owner |
| box.owner.id | The ID of the file owner |
| box.owner.login | The login of the file owner |
| box.description | The description of the file |
| box.etag | The etag of the file |
| box.sha1 | The SHA-1 hash of the file |
| box.content.created.at | The date the content was created |
| box.content.modified.at | The date the content was modified |
| box.item.status | The status of the file (active, trashed, etc.) |
| box.sequence_id | The sequence ID of the file |
| box.parent.folder.id | The ID of the parent folder |
| box.trashed.at | The date the file was trashed, if applicable |
| box.purged.at | The date the file was purged, if applicable |
| box.shared.link | The shared link of the file, if any |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.PutBoxFile](putboxfile.md)

---
title: FetchBoxFileMetadataInstance 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchboxfilemetadatainstance.md
section: Loading & Unloading Data
---

# FetchBoxFileMetadataInstance 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Retrieves specific metadata instance associated with a Box file using template key and scope.

## Tags

box, instance, metadata, storage, template

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the file for which to fetch metadata. |
| Template Key | The metadata template key to retrieve. |
| Template Scope | The metadata template scope (e.g., ‘enterprise’, ‘global’). |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here if there is an error fetching metadata instance from the file. |
| file not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile containing the metadata instance will be routed to this relationship upon successful processing. |
| template not found | FlowFiles for which the specified metadata template was not found will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the file from which metadata was fetched |
| box.metadata.template.key | The metadata template key |
| box.metadata.template.scope | The metadata template scope |
| mime.type | The MIME Type of the FlowFile content |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.FetchBoxFileInfo](fetchboxfileinfo.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFileMetadataInstances](listboxfilemetadatainstances.md)

---
title: FetchBoxFileRepresentation 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchboxfilerepresentation.md
section: Loading & Unloading Data
---

# FetchBoxFileRepresentation 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Fetches a Box file representation using a representation hint and writes it to the FlowFile content.

## Tags

box, cloud, content, download, file, representation, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the Box file to retrieve. |
| Representation Type | The type of representation to fetch. Common values include ‘pdf’, ‘text’, ‘jpg’, ‘png’, etc. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that encounter errors during processing will be routed to this relationship. |
| file.not.found | FlowFiles for which the specified Box file was not found. |
| representation.not.found | FlowFiles for which the specified Box file’s requested representation was not found. |
| success | FlowFiles that are successfully processed will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the Box file. |
| box.file.name | The name of the Box file. |
| box.file.size | The size of the Box file in bytes. |
| box.file.created.time | The timestamp when the file was created. |
| box.file.modified.time | The timestamp when the file was last modified. |
| box.file.mime.type | The MIME type of the file. |
| box.file.representation.type | The representation type that was fetched. |
| box.error.message | The error message returned by Box if the operation fails. |
| box.error.code | The error code returned by Box if the operation fails. |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)

---
title: FetchDistributedMapCache 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchdistributedmapcache.md
section: Loading & Unloading Data
---

# FetchDistributedMapCache 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Computes cache key(s) from FlowFile attributes, for each incoming FlowFile, and fetches the value(s) from the Distributed Map Cache associated with each key. If configured without a destination attribute, the incoming FlowFile ‘s content is replaced with the binary data received by the Distributed Map Cache. If there is no value stored under that key then the flow file will be routed to’ not-found ‘. Note that the processor will always attempt to read the entire cached value into memory before placing it in it’s destination. This could be potentially problematic if the cached value is very large.

## Tags

cache, distributed, fetch, map

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Cache Entry Identifier | A comma-delimited list of FlowFile attributes, or the results of Attribute Expression Language statements, which will be evaluated against a FlowFile in order to determine the value(s) used to identify duplicates; it is these values that are cached. NOTE: Only a single Cache Entry Identifier is allowed unless Put Cache Value In Attribute is specified. Multiple cache lookups are only supported when the destination is a set of attributes (see the documentation for ‘Put Cache Value In Attribute’ for more details including naming convention. |
| Character Set | The Character Set in which the cached value is encoded. This will only be used when routing to an attribute. |
| Distributed Cache Service | The Controller Service that is used to get the cached values. |
| Max Length To Put In Attribute | If routing the cache value to an attribute of the FlowFile (by setting the “Put Cache Value in attribute” property), the number of characters put to the attribute value will be at most this amount. This is important because attributes are held in memory and large attributes will quickly cause out of memory issues. If the output goes longer than this value, it will be truncated to fit. Consider making this smaller if able. |
| Put Cache Value In Attribute | If set, the cache value received will be put into an attribute of the FlowFile instead of a the content of theFlowFile. The attribute key to put to is determined by evaluating value of this property. If multiple Cache Entry Identifiers are selected, multiple attributes will be written, using the evaluated value of this property, appended by a period (.) and the name of the cache entry identifier. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to communicate with the cache or if the cache entry is evaluated to be blank, the FlowFile will be penalized and routed to this relationship |
| not-found | If a FlowFile’s Cache Entry Identifier was not found in the cache, it will be routed to this relationship |
| success | If the cache was successfully communicated with it will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| user-defined | If the ‘Put Cache Value In Attribute’ property is set then whatever it is set to will become the attribute key and the value would be whatever the response was from the Distributed Map Cache. If multiple cache entry identifiers are selected, multiple attributes will be written, using the evaluated value of this property, appended by a period (.) and the name of the cache entry identifier. For example, if the Cache Entry Identifier property is set to ‘id,name’, and the user-defined property is named ‘fetched’, then two attributes will be written, fetched.id and fetched.name, containing their respective values. |

## See also

* [org.apache.nifi.processors.standard.PutDistributedMapCache](putdistributedmapcache.md)

---
title: FetchDropbox 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchdropbox.md
section: Loading & Unloading Data
---

# FetchDropbox 2025.10.9.21

## Bundle

org.apache.nifi | nifi-dropbox-processors-nar

## Description

Fetches files from Dropbox. Designed to be used in tandem with ListDropbox.

## Tags

dropbox, fetch, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Dropbox Credential Service | Controller Service used to obtain Dropbox credentials (App Key, App Secret, Access Token, Refresh Token). See controller service’s Additional Details for more information. |
| File | The Dropbox identifier or path of the Dropbox file to fetch. The ‘File’should match the following regular expression pattern: /.\*|id:.\* . When ListDropbox is used for input, either ‘${dropbox.id}’ (identifying files by Dropbox id) or ‘${path}/${filename}’ (identifying files by path) can be used as ‘File’ value. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here for each File for which fetch was attempted but failed. |
| success | A FlowFile will be routed here for each successfully fetched File. |

## Writes attributes

| Name | Description |
| --- | --- |
| error.message | The error message returned by Dropbox |
| dropbox.id | The Dropbox identifier of the file |
| path | The folder path where the file is located |
| filename | The name of the file |
| dropbox.size | The size of the file |
| dropbox.timestamp | The server modified time of the file |
| dropbox.revision | Revision of the file |

## See also

* [org.apache.nifi.processors.dropbox.ListDropbox](listdropbox.md)
* [org.apache.nifi.processors.dropbox.PutDropbox](putdropbox.md)

---
title: FetchFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchfile.md
section: Loading & Unloading Data
---

# FetchFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Reads the contents of a file from disk and streams it into the contents of an incoming FlowFile. Once this is done, the file is optionally moved elsewhere or deleted to help keep the file system organized.

## Tags

fetch, files, filesystem, get, ingest, ingress, input, local, source

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Completion Strategy | Specifies what to do with the original file on the file system once it has been pulled into NiFi |
| File to Fetch | The fully-qualified filename of the file to fetch from the file system |
| Log level when file not found | Log level to use in case the file does not exist when the processor is triggered |
| Log level when permission denied | Log level to use if the current application user does not have sufficient permissions to read the file |
| Move Conflict Strategy | If Completion Strategy is set to Move File and a file already exists in the destination directory with the same name, this property specifies how that naming conflict should be resolved |
| Move Destination Directory | The directory to the move the original file to once it has been fetched from the file system. This property is ignored unless the Completion Strategy is set to “Move File”. If the directory does not exist, it will be created. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |
| write filesystem | Provides operator the ability to delete any file that NiFi has access to. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that could not be fetched from the file system for any reason other than insufficient permissions or the file not existing will be transferred to this Relationship. |
| not.found | Any FlowFile that could not be fetched from the file system because the file could not be found will be transferred to this Relationship. |
| permission.denied | Any FlowFile that could not be fetched from the file system due to the user running NiFi not having sufficient permissions will be transferred to this Relationship. |
| success | Any FlowFile that is successfully fetched from the file system will be transferred to this Relationship. |

## Use Cases Involving Other Components

|  |
| --- |
| Ingest all files from a directory into NiFi |
| Ingest specific files from a directory into NiFi, filtering on filename |

## See also

* [org.apache.nifi.processors.standard.GetFile](getfile.md)
* [org.apache.nifi.processors.standard.ListFile](listfile.md)
* [org.apache.nifi.processors.standard.PutFile](putfile.md)

---
title: FetchFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchftp.md
section: Loading & Unloading Data
---

# FetchFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Fetches the content of a file from a remote FTP server and overwrites the contents of an incoming FlowFile with the content of the remote file.

## Tags

fetch, files, ftp, get, ingest, input, remote, retrieve, source

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Completion Strategy | Specifies what to do with the original file on the server once it has been pulled into NiFi. If the Completion Strategy fails, a warning will be logged but the data will still be transferred. |
| Connection Mode | The FTP Connection Mode |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Create Directory | Used when ‘Completion Strategy’ is ‘Move File’. Specifies whether or not the remote directory should be created if it does not exist. |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Hostname | The fully-qualified hostname or IP address of the host to fetch the data from |
| Internal Buffer Size | Set the internal buffer size for buffered data streams |
| Log Level When File Not Found | Log level to use in case the file does not exist when the processor is triggered |
| Move Destination Directory | The directory on the remote server to move the original file to once it has been ingested into NiFi. This property is ignored unless the Completion Strategy is set to ‘Move File’. The specified directory must already exist on the remote system if ‘Create Directory’ is disabled, or the rename will fail. |
| Password | Password for the user account |
| Port | The port to connect to on the remote host to fetch the data from |
| Remote File | The fully qualified filename on the remote system |
| Transfer Mode | The FTP Transfer Mode |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Username | Username |
| ftp-use-utf8 | Tells the client to use UTF-8 encoding when processing files and filenames. If set to true, the server must also support UTF-8 encoding. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | Any FlowFile that could not be fetched from the remote server due to a communications failure will be transferred to this Relationship. |
| not.found | Any FlowFile for which we receive a ‘Not Found’ message from the remote server will be transferred to this Relationship. |
| permission.denied | Any FlowFile that could not be fetched from the remote server due to insufficient permissions will be transferred to this Relationship. |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| ftp.remote.host | The hostname or IP address from which the file was pulled |
| ftp.remote.port | The port that was used to communicate with the remote FTP server |
| ftp.remote.filename | The name of the remote file that was pulled |
| filename | The filename is updated to point to the filename fo the remote file |
| path | If the Remote File contains a directory name, that directory name will be added to the FlowFile using the ‘path’ attribute |
| fetch.failure.reason | The name of the failure relationship applied when routing to any failure relationship |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in a directory of an FTP Server |

## See also

* [org.apache.nifi.processors.standard.GetFTP](getftp.md)
* [org.apache.nifi.processors.standard.GetSFTP](getsftp.md)
* [org.apache.nifi.processors.standard.PutFTP](putftp.md)
* [org.apache.nifi.processors.standard.PutSFTP](putsftp.md)

---
title: FetchGCSObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchgcsobject.md
section: Loading & Unloading Data
---

# FetchGCSObject 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Fetches a file from a Google Cloud Bucket. Designed to be used in tandem with ListGCSBucket.

## Tags

fetch, gcs, google, google cloud, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| gcp-project-id | Google Cloud Project ID |
| gcp-retry-count | How many retry attempts should be made before routing to the failure relationship. |
| gcs-bucket | Bucket of the object. |
| gcs-generation | The generation of the Object to download. If not set, the latest generation will be downloaded. |
| gcs-key | Name of the object. |
| gcs-object-range-length | The number of bytes to download from the object, starting from the Range Start. An empty value or a value that extends beyond the end of the object will read to the end of the object. |
| gcs-object-range-start | The byte position at which to start reading from the object. An empty value or a value of zero will start reading at the beginning of the object. |
| gcs-server-side-encryption-key | An AES256 Key (encoded in base64) which the object has been encrypted in. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| storage-api-url | Overrides the default storage URL. Configuring an alternative Storage API URL also overrides the HTTP Host header on requests as described in the Google documentation for Private Service Connections. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship if the Google Cloud Storage operation fails. |
| success | FlowFiles are routed to this relationship after a successful Google Cloud Storage operation. |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The name of the file, parsed if possible from the Content-Disposition response header |
| gcs.bucket | Bucket of the object. |
| gcs.key | Name of the object. |
| gcs.size | Size of the object. |
| gcs.cache.control | Data cache control of the object. |
| gcs.component.count | The number of components which make up the object. |
| gcs.content.disposition | The data content disposition of the object. |
| gcs.content.encoding | The content encoding of the object. |
| gcs.content.language | The content language of the object. |
| mime.type | The MIME/Content-Type of the object |
| gcs.crc32c | The CRC32C checksum of object’s data, encoded in base64 in big-endian order. |
| gcs.create.time | The creation time of the object (milliseconds) |
| gcs.update.time | The last modification time of the object (milliseconds) |
| gcs.encryption.algorithm | The algorithm used to encrypt the object. |
| gcs.encryption.sha256 | The SHA256 hash of the key used to encrypt the object |
| gcs.etag | The HTTP 1.1 Entity tag for the object. |
| gcs.generated.id | The service-generated for the object |
| gcs.generation | The data generation of the object. |
| gcs.md5 | The MD5 hash of the object’s data encoded in base64. |
| gcs.media.link | The media download link to the object. |
| gcs.metageneration | The metageneration of the object. |
| gcs.owner | The owner (uploader) of the object. |
| gcs.owner.type | The ACL entity type of the uploader of the object. |
| gcs.acl.owner | A comma-delimited list of ACL entities that have owner access to the object. Entities will be either email addresses, domains, or project IDs. |
| gcs.acl.writer | A comma-delimited list of ACL entities that have write access to the object. Entities will be either email addresses, domains, or project IDs. |
| gcs.acl.reader | A comma-delimited list of ACL entities that have read access to the object. Entities will be either email addresses, domains, or project IDs. |
| gcs.uri | The URI of the object as a string. |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in a Google Compute Storage (GCS) bucket |

## See also

* [org.apache.nifi.processors.gcp.storage.DeleteGCSObject](deletegcsobject.md)
* [org.apache.nifi.processors.gcp.storage.ListGCSBucket](listgcsbucket.md)
* [org.apache.nifi.processors.gcp.storage.PutGCSObject](putgcsobject.md)

---
title: FetchGoogleDrive 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchgoogledrive.md
section: Loading & Unloading Data
---

# FetchGoogleDrive 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Fetches files from a Google Drive Folder. Designed to be used in tandem with ListGoogleDrive. Please see Additional Details to set up access to Google Drive.

## Tags

drive, fetch, google, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Google Doc Export Type | Google Documents cannot be downloaded directly from Google Drive but instead must be exported to a specified MIME Type. In the event that the incoming FlowFile’s MIME Type indicates that the file is a Google Document, this property specifies the MIME Type to export the document to. |
| Google Drawing Export Type | Google Drawings cannot be downloaded directly from Google Drive but instead must be exported to a specified MIME Type. In the event that the incoming FlowFile’s MIME Type indicates that the file is a Google Drawing, this property specifies the MIME Type to export the drawing to. |
| Google Presentation Export Type | Google Presentations cannot be downloaded directly from Google Drive but instead must be exported to a specified MIME Type. In the event that the incoming FlowFile’s MIME Type indicates that the file is a Google Presentation, this property specifies the MIME Type to export the presentation to. |
| Google Spreadsheet Export Type | Google Spreadsheets cannot be downloaded directly from Google Drive but instead must be exported to a specified MIME Type. In the event that the incoming FlowFile’s MIME Type indicates that the file is a Google Spreadsheet, this property specifies the MIME Type to export the spreadsheet to. |
| connect-timeout | Maximum wait time for connection to Google Drive service. |
| drive-file-id | The Drive ID of the File to fetch. Please see Additional Details for information on how to obtain the Drive ID. |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| read-timeout | Maximum wait time for response from Google Drive service. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here for each File for which fetch was attempted but failed. |
| success | A FlowFile will be routed here for each successfully fetched File. |

## Writes attributes

| Name | Description |
| --- | --- |
| drive.id | The id of the file |
| filename | The name of the file |
| mime.type | The MIME type of the file |
| drive.size | The size of the file. Set to 0 when the file size is not available (e.g. externally stored files). |
| drive.size.available | Indicates if the file size is known / available |
| drive.timestamp | The last modified time or created time (whichever is greater) of the file. The reason for this is that the original modified date of a file is preserved when uploaded to Google Drive. ‘Created time’ takes the time when the upload occurs. However uploaded files can still be modified later. |
| drive.created.time | The file’s creation time |
| drive.modified.time | The file’s last modification time |
| drive.owner | The owner of the file |
| drive.last.modifying.user | The last modifying user of the file |
| drive.web.view.link | Web view link to the file |
| drive.web.content.link | Web content link to the file |
| drive.parent.folder.id | The id of the file’s parent folder |
| drive.parent.folder.name | The name of the file’s parent folder |
| drive.shared.drive.id | The id of the shared drive (if the file is located on a shared drive) |
| drive.shared.drive.name | The name of the shared drive (if the file is located on a shared drive) |
| error.code | The error code returned by Google Drive |
| error.message | The error message returned by Google Drive |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in a Google Drive folder |

## See also

* [org.apache.nifi.processors.gcp.drive.ListGoogleDrive](listgoogledrive.md)
* [org.apache.nifi.processors.gcp.drive.PutGoogleDrive](putgoogledrive.md)

---
title: FetchGoogleDriveFileComments 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchgoogledrivefilecomments.md
section: Loading & Unloading Data
---

# FetchGoogleDriveFileComments 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-drive-nar

## Description

Fetches comments and their replies for a Google Drive file. The file ID can be set by a FlowFile attribute. Records include comment metadata such as deleted status, resolved status, anchors, and a nested array of replies.

## Tags

comments, drive, gcp, google, openflow, replies

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File ID | Google Drive file ID. |
| GCP Credentials Service | Controller Service used to obtain Google Cloud Platform credentials. |
| Record Writer | Specifies the Record Writer to use when writing the comments. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed here if the processor fails to retrieve comments. |
| not.found | A FlowFile is routed here if the file was not found. |
| retry | FlowFiles are routed here if a connection or rate-limit issue occurs. |
| success | All FlowFiles that are successfully processed are routed here. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | Number of comment records returned (not including replies). |
| google.drive.file.id | The file ID from which comments were fetched. |

---
title: FetchGoogleDriveMetadata 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchgoogledrivemetadata.md
section: Loading & Unloading Data
---

# FetchGoogleDriveMetadata 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-drive-nar

## Description

Fetches Google Drive file metadata. This includes the file’s name, size, MIME type, and permissions. The file ID must be provided as a FlowFile attribute.

## Tags

authorization, cloud, drive, gcp, google, openflow, permissions, storage, unstructured

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File ID | An id of an file to retrieve the metadata for |
| GCP Credentials Service | The Controller Service used to obtain Google Cloud Platform credentials. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed here if the processor fails to retrieve Google Drive file metadata. |
| not.found | A FlowFile is routed here if the file metadata was not found |
| retry | A FlowFile is routed here if the processor should retry the request (e.g., after rate limiting). |
| success | A FlowFile is routed here after successfully retrieving Google Drive file metadata. |

## Writes attributes

| Name | Description |
| --- | --- |
| google.drive.drive.id | The ID of the Shared Google Drive. |
| google.drive.file.name | The name of the file. |
| google.drive.created.time | The timestamp when the file was created, in milliseconds since the Unix epoch. |
| google.drive.modified.time | The timestamp when the file was modified, in milliseconds since the Unix epoch. |
| google.drive.size | The size of the file in bytes. |
| google.drive.md5 | The MD5 checksum of the file. |
| google.drive.mime.type | The MIME type of the file. |
| google.drive.version | The version of the file. This changes based on user and system based updates to the file. |
| google.drive.webUrl | A link for opening the file in a relevant Google editor or viewer in a browser. |
| google.drive.lastModifiedBy.displayName | A display name of the user that modified the file. |
| google.drive.lastModifiedBy.email | An email of the user that modified the file. |
| google.drive.permissions.<role>.users | A comma-separated list of email addresses for users with the specified role. Valid roles are ‘owner’, ‘organizer’, ‘fileOrganizer’, ‘writer’, ‘commenter’, ‘reader’. For example, if the owner is [john.doe@gmail.com](mailto:john.doe%40gmail.com) and users [jane.doe@gmail.com](mailto:jane.doe%40gmail.com) and [jake.doe@gmail.com](mailto:jake.doe%40gmail.com) are readers, there would be an attribute named `google.drive.permissions.owner.users` with the value `john.doe@gmail.com`, and an attribute named `google.drive.permissions.reader.users` with the value `jane.doe@gmail.com, jake.doe@gmail.com` |
| google.drive.permissions.<role>.groups | A comma-separated list of email addresses for groups with the specified role. Valid roles are ‘owner’, ‘organizer’, ‘fileOrganizer’, ‘writer’, ‘commenter’, ‘reader’. For example, if the owner is `employees@openflow-all-dev.iam.gserviceaccount.com` and the group `contractors@openflow-all-dev.iam.gserviceaccount.com` is a reader, there would be an attribute named `google.drive.permissions.owner.groups` with the value `employees@openflow-all-dev.iam.gserviceaccount.com`, and an attribute named `google.drive.permissions.reader.groups` with the value `contractors@openflow-all-dev.iam.gserviceaccount.com` |
| google.drive.permissions.<role>.domains | A comma-separated list of domain names for which all users have the given role. Valid roles are ‘owner’, ‘organizer’, ‘fileOrganizer’, ‘writer’, ‘commenter’, ‘reader’. For example, if all users in the domain `snowflake.com` have the role of reader, there would be an attribute named `google.drive.permissions.reader.domains` with the value `snowflake.com` |
| google.drive.permissions.<role>.public | If a file is shared publicly, this attribute will be added with a value of ‘true’ for any role that applies to the public. |
| google.drive.file.path | The hierarchical path of the file in Google Drive, e.g. ‘parent_folder/child_folder/file.txt’. |

## See also

* [com.snowflake.openflow.runtime.processors.google.CaptureGoogleDriveChanges](capturegoogledrivechanges.md)

---
title: FetchGridFS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchgridfs.md
section: Loading & Unloading Data
---

# FetchGridFS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Retrieves one or more files from a GridFS bucket by file name or by a user-defined query.

## Tags

fetch, gridfs, mongo

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| gridfs-bucket-name | The GridFS bucket where the files will be stored. If left blank, it will use the default value ‘fs’ that the MongoDB client driver uses. |
| gridfs-client-service | The MongoDB client service to use for database connections. |
| gridfs-database-name | The name of the database to use |
| gridfs-file-name | The name of the file in the bucket that is the target of this processor. |
| gridfs-query | A valid MongoDB query to use to fetch one or more files from GridFS. |
| mongo-operation-mode | This option controls when results are made available to downstream processors. If Stream Query Results is enabled, provenance will not be tracked relative to the input flowfile if an input flowfile is received and starts the query. In Stream Query Results mode errors will be handled by sending a new flowfile with the original content and attributes of the input flowfile to the failure relationship. Streaming should only be used if there is reliable connectivity between MongoDB and NiFi. |
| mongo-query-attribute | If set, the query will be written to a specified attribute on the output flowfiles. |

## Relationships

| Name | Description |
| --- | --- |
| failure | When there is a failure processing the flowfile, it goes to this relationship. |
| original | The original input flowfile goes to this relationship if the query does not cause an error |
| success | When the operation succeeds, the flowfile is sent to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| gridfs.file.metadata | The custom metadata stored with a file is attached to this property if it exists. |

---
title: FetchJiraFields 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchjirafields.md
section: Loading & Unloading Data
---

# FetchJiraFields 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Retrieves comprehensive metadata for all fields available in the Jira Cloud instance using the REST API v3 /field endpoint. For each field, returns detailed information including field ID/key, display name, field properties, JQL clause names for queries, and schema details with data types.

## Tags

api, atlassian, fetch, jira, rest

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| API Token | Jira API token for authorization |
| Authorization Method | Authorization method for Jira Cloud API |
| Environment URL | URL to the Atlassian Jira Environment |
| Issue Fields | A list of fields to return for each issue. This property accepts a comma-separated list. |
| Jira Email | Email address associated with Jira account |
| Request Rate Manager | Controller service for keeping track of rate limits for Atlassian APIs |
| Web Client Service | Controller service for managing HTTP connections to Jira |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch Jira fields, e.g., due to connection issues or invalid credentials |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Jira fields |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The MIME type of the returned response, always set to ‘application/json’ |

## See also

* [com.snowflake.openflow.runtime.atlassian.jira.processors.FetchJiraIssues](fetchjiraissues.md)

---
title: FetchJiraIssues 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchjiraissues.md
section: Loading & Unloading Data
---

# FetchJiraIssues 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Fetches issues from Jira Cloud using REST API v3 with configurable search options. Provides two search modes: 1. Simple Search - Filter by project name, status category, created/updated dates 2. Advanced Search - Use custom JQL (Jira Query Language) expressions Key features: - Smart pagination handling with automatic state management - Incremental sync capability using timestamps between processor runs - Timezone-aware date handling using Jira user’s timezone - Configurable issue fields retrieval - Adds metadata to FlowFiles: source URL (jira.source.url), query (jira.query.jql), statement type (statement.type) - Adds insert,upsert attributes for downstream processing The processor maintains cluster state to resume operations after restarts Authentication is handled via basic auth using Jira email/API token credentials. Currently that is the only supported method. LIMITATIONS: - Jira issue deletes are not detected.

## Tags

api, atlassian, fetch, jira, rest

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| API Token | Jira API token for authorization |
| Authorization Method | Authorization method for Jira Cloud API |
| Created After | Filter issues created after specified date/time (optional, format: yyyy-MM-dd) |
| Environment URL | URL to the Atlassian Jira Environment |
| Issue Fields | A list of fields to return for each issue. This property accepts a comma-separated list. |
| JQL Query | JQL query string (required when using JQL query type) |
| Jira Email | Email address associated with Jira account |
| Maximum Page Size | The Maximum Page Size value must be between 50 and 1000 |
| Project Names | Comma-separated list of project names for simple search |
| Request Rate Manager | Controller service for keeping track of rate limits for Atlassian APIs |
| Search Type | Type of search to perform |
| Status Category | Status category filter for simple search (optional) |
| Updated After | Filter issues updated after specified date/time (optional, format: yyyy-MM-dd) |
| Web Client Service | Controller service for managing HTTP connections to Jira |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores pagination state to maintain position between restarts. Resets when ingestion configuration changes. |

## Relationships

| Name | Description |
| --- | --- |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Jira issues |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| jira.query.jql | The JQL query used for this fetch |
| jira.source.url | URL of the Jira source |
| statement.type | Statement type INSERT, UPSERT |

## See also

* [com.snowflake.openflow.runtime.atlassian.jira.processors.FetchJiraFields](fetchjirafields.md)

---
title: FetchMicrosoftDataverseTable 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchmicrosoftdataversetable.md
section: Loading & Unloading Data
---

# FetchMicrosoftDataverseTable 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-dataverse-processors-nar

## Description

Fetch records from Microsoft Dataverse Tables

## Tags

dataverse

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Delete Schema |  |
| Environment URL | URL to Microsoft Dataverse Environment |
| Logical Name | Logical Name of Dataverse Table |
| Max Page Size | Defines how many records will be fetched from Dataverse at once |
| OAuth2 Access Token Provider | Enables managed retrieval of OAuth2 Bearer Token. |
| Record Writer | Specifies the Controller Service to use for writing out the records |
| Rows Number Limit | Defines maximum number of rows returned in a single flow file. Multiple request will be made to API to reach the limit. When not set, a page size value will be used effectively. |
| Table Name | Dataverse Table Name |
| Upsert Schema |  |
| Web Client Service Provider | Creates instance of web client. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | status |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFile with errors occurred while fetching from Dataverse. |
| retry | FlowFile with maintainable errors occurred while fetching from Dataverse. |
| success | FlowFile with fetched data stored as records. |

---
title: FetchS3Object 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchs3object.md
section: Loading & Unloading Data
---

# FetchS3Object 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves the contents of an S3 Object and writes it to the content of a FlowFile

## Tags

AWS, Amazon, Fetch, Get, S3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Bucket | The S3 Bucket to interact with |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Encryption Service | Specifies the Encryption Service Controller used to configure requests. PutS3Object: For backward compatibility, this value is ignored when ‘Server Side Encryption’ is set. FetchS3Object: Only needs to be configured in case of Server-side Customer Key, Client-side KMS and Client-side Customer Key encryptions. |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Object Key | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Range Length | The number of bytes to download from the object, starting from the Range Start. An empty value or a value that extends beyond the end of the object will read to the end of the object. |
| Range Start | The byte position at which to start reading from the object. An empty value or a value of zero will start reading at the beginning of the object. |
| Region | The AWS Region to connect to. |
| Requester Pays | If true, indicates that the requester consents to pay any charges associated with retrieving objects from the S3 bucket. This sets the ‘x-amz-request-payer’ header to ‘requester’. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Version | The Version of the Object to download |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| success | FlowFiles are routed to this Relationship after they have been successfully processed. |

## Writes attributes

| Name | Description |
| --- | --- |
| s3.url | The URL that can be used to access the S3 object |
| s3.bucket | The name of the S3 bucket |
| path | The path of the file |
| absolute.path | The path of the file |
| filename | The name of the file |
| hash.value | The MD5 sum of the file |
| hash.algorithm | MD5 |
| mime.type | If S3 provides the content type/MIME type, this attribute will hold that file |
| s3.etag | The ETag that can be used to see if the file has changed |
| s3.exception | The class name of the exception thrown during processor execution |
| s3.additionalDetails | The S3 supplied detail from the failed operation |
| s3.statusCode | The HTTP error code (if available) from the failed operation |
| s3.errorCode | The S3 moniker of the failed operation |
| s3.errorMessage | The S3 exception message from the failed operation |
| s3.expirationTime | If the file has an expiration date, this attribute will be set, containing the milliseconds since epoch in UTC time |
| s3.expirationTimeRuleId | The ID of the rule that dictates this object’s expiration time |
| s3.sseAlgorithm | The server side encryption algorithm of the object |
| s3.version | The version of the S3 object |
| s3.encryptionStrategy | The name of the encryption strategy that was used to store the S3 object (if it is encrypted) |

## Use cases

|  |
| --- |
| Fetch a specific file from S3 |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in an S3 bucket |
| Retrieve only files from S3 that meet some specified criteria |
| Retrieve new files as they arrive in an S3 bucket |

## See also

* [org.apache.nifi.processors.aws.s3.CopyS3Object](copys3object.md)
* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: FetchSFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchsftp.md
section: Loading & Unloading Data
---

# FetchSFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Fetches the content of a file from a remote SFTP server and overwrites the contents of an incoming FlowFile with the content of the remote file.

## Tags

fetch, files, get, ingest, input, remote, retrieve, sftp, source

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Algorithm Negotiation | Configuration strategy for SSH algorithm negotiation |
| Ciphers Allowed | A comma-separated list of Ciphers allowed for SFTP connections. Leave unset to allow all. Available options are: 3des-cbc, aes128-cbc, aes128-ctr, [aes128-gcm@openssh.com](mailto:aes128-gcm%40openssh.com), aes192-cbc, aes192-ctr, aes256-cbc, aes256-ctr, [aes256-gcm@openssh.com](mailto:aes256-gcm%40openssh.com), arcfour128, arcfour256, blowfish-cbc, [chacha20-poly1305@openssh.com](mailto:chacha20-poly1305%40openssh.com), none |
| Completion Strategy | Specifies what to do with the original file on the server once it has been pulled into NiFi. If the Completion Strategy fails, a warning will be logged but the data will still be transferred. |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Create Directory | Used when ‘Completion Strategy’ is ‘Move File’. Specifies whether or not the remote directory should be created if it does not exist. |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Disable Directory Listing | Control how ‘Move Destination Directory’ is created when ‘Completion Strategy’ is ‘Move File’ and ‘Create Directory’ is enabled. If set to ‘true’, directory listing is not performed prior to create missing directories. By default, this processor executes a directory listing command to see target directory existence before creating missing directories. However, there are situations that you might need to disable the directory listing such as the following. Directory listing might fail with some permission setups (e.g. chmod 100) on a directory. Also, if any other SFTP client created the directory after this processor performed a listing and before a directory creation request by this processor is finished, then an error is returned because the directory already exists. |
| Host Key File | If supplied, the given file will be used as the Host Key; otherwise, if ‘Strict Host Key Checking’ property is applied (set to true) then uses the ‘known_hosts’ and ‘known_hosts2’ files from ~/.ssh directory else no host key file will be used |
| Hostname | The fully-qualified hostname or IP address of the host to fetch the data from |
| Key Algorithms Allowed | A comma-separated list of Key Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: ecdsa-sha2-nistp256, [ecdsa-sha2-nistp256-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp256-cert-v01%40openssh.com), ecdsa-sha2-nistp384, [ecdsa-sha2-nistp384-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp384-cert-v01%40openssh.com), ecdsa-sha2-nistp521, [ecdsa-sha2-nistp521-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp521-cert-v01%40openssh.com), rsa-sha2-256, [rsa-sha2-256-cert-v01@openssh.com](mailto:rsa-sha2-256-cert-v01%40openssh.com), rsa-sha2-512, [rsa-sha2-512-cert-v01@openssh.com](mailto:rsa-sha2-512-cert-v01%40openssh.com), [sk-ecdsa-sha2-nistp256@openssh.com](mailto:sk-ecdsa-sha2-nistp256%40openssh.com), [sk-ssh-ed25519@openssh.com](mailto:sk-ssh-ed25519%40openssh.com), ssh-dss, [ssh-dss-cert-v01@openssh.com](mailto:ssh-dss-cert-v01%40openssh.com), ssh-ed25519, [ssh-ed25519-cert-v01@openssh.com](mailto:ssh-ed25519-cert-v01%40openssh.com), ssh-rsa, [ssh-rsa-cert-v01@openssh.com](mailto:ssh-rsa-cert-v01%40openssh.com) |
| Key Exchange Algorithms Allowed | A comma-separated list of Key Exchange Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: curve25519-sha256, [curve25519-sha256@libssh.org](mailto:curve25519-sha256%40libssh.org), curve448-sha512, diffie-hellman-group-exchange-sha1, diffie-hellman-group-exchange-sha256, diffie-hellman-group1-sha1, diffie-hellman-group14-sha1, diffie-hellman-group14-sha256, diffie-hellman-group15-sha512, diffie-hellman-group16-sha512, diffie-hellman-group17-sha512, diffie-hellman-group18-sha512, ecdh-sha2-nistp256, ecdh-sha2-nistp384, ecdh-sha2-nistp521, mlkem1024nistp384-sha384, mlkem768nistp256-sha256, mlkem768x25519-sha256, sntrup761x25519-sha512, [sntrup761x25519-sha512@openssh.com](mailto:sntrup761x25519-sha512%40openssh.com) |
| Log Level When File Not Found | Log level to use in case the file does not exist when the processor is triggered |
| Message Authentication Codes Allowed | A comma-separated list of Message Authentication Codes allowed for SFTP connections. Leave unset to allow all. Available options are: hmac-md5, hmac-md5-96, hmac-sha1, hmac-sha1-96, [hmac-sha1-etm@openssh.com](mailto:hmac-sha1-etm%40openssh.com), hmac-sha2-256, [hmac-sha2-256-etm@openssh.com](mailto:hmac-sha2-256-etm%40openssh.com), hmac-sha2-512, [hmac-sha2-512-etm@openssh.com](mailto:hmac-sha2-512-etm%40openssh.com) |
| Move Destination Directory | The directory on the remote server to move the original file to once it has been ingested into NiFi. This property is ignored unless the Completion Strategy is set to ‘Move File’. The specified directory must already exist on the remote system if ‘Create Directory’ is disabled, or the rename will fail. |
| Password | Password for the user account |
| Port | The port to connect to on the remote host to fetch the data from |
| Private Key Passphrase | Password for the private key |
| Private Key Path | The fully qualified path to the Private Key file |
| Remote File | The fully qualified filename on the remote system |
| Send Keep Alive On Timeout | Send a Keep Alive message every 5 seconds up to 5 times for an overall timeout of 25 seconds. |
| Strict Host Key Checking | Indicates whether or not strict enforcement of hosts keys should be applied |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Username | Username |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | Any FlowFile that could not be fetched from the remote server due to a communications failure will be transferred to this Relationship. |
| not.found | Any FlowFile for which we receive a ‘Not Found’ message from the remote server will be transferred to this Relationship. |
| permission.denied | Any FlowFile that could not be fetched from the remote server due to insufficient permissions will be transferred to this Relationship. |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| sftp.remote.host | The hostname or IP address from which the file was pulled |
| sftp.remote.port | The port that was used to communicate with the remote SFTP server |
| sftp.remote.filename | The name of the remote file that was pulled |
| filename | The filename is updated to point to the filename fo the remote file |
| path | If the Remote File contains a directory name, that directory name will be added to the FlowFile using the ‘path’ attribute |
| fetch.failure.reason | The name of the failure relationship applied when routing to any failure relationship |

## Use Cases Involving Other Components

|  |
| --- |
| Retrieve all files in a directory of an SFTP Server |

## See also

* [org.apache.nifi.processors.standard.GetFTP](getftp.md)
* [org.apache.nifi.processors.standard.GetSFTP](getsftp.md)
* [org.apache.nifi.processors.standard.PutFTP](putftp.md)
* [org.apache.nifi.processors.standard.PutSFTP](putsftp.md)

---
title: FetchSharepointFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchsharepointfile.md
section: Loading & Unloading Data
---

# FetchSharepointFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

Fetches the contents of a file from a Sharepoint Drive, optionally downloading a PDF or HTML version of the file when applicable. Any FlowFile that represents a Sharepoint folder will be routed to success without fetching contents.

## Tags

cdc, document, graph, microsoft, openflow, sharepoint, unstructured

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API |
| Download PDF/HTML Version | Sharepoint supports automatically converting certain file formats to PDF or HTML. If this property is set to `true`, the Processor will inspect the FlowFile’s filename extension to determine if the file can be converted to PDF or HTML. If the file can be converted, the Processor will download the converted version. If the file cannot be converted, the Processor will download the original file. If this property is set to `false`, the Processor will always download the original file. |
| Drive ID | The ID of the drive that contains the file to fetch |
| Fallback Retry Duration | The time to wait before retrying the operation after a communication failure. This value is used when the response doesn’t contain a Retry-After header. |
| Item ID | The ID of the item to fetch |
| Update Extension | If true, the Processor will update the filename extension to match the format of the downloaded file |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed here if the processor failed to communicate with the Graph API. Can be retried |
| failure | An incoming FlowFile is routed to this relationship if the contents of the item could not be fetched |
| not.found | A FlowFile is routed here if the item was not found |
| success | An incoming FlowFile is routed to this relationship after the contents of the item have been fetched and written to the FlowFile |

## Use Cases Involving Other Components

|  |
| --- |
| Fetch a file from Sharepoint by the Site URL, Drive Name and file path. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.CaptureSharepointChanges](capturesharepointchanges.md)

---
title: FetchSharepointMetadata 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchsharepointmetadata.md
section: Loading & Unloading Data
---

# FetchSharepointMetadata 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

For each drive item retrieves its metadata and permissions and writes them as FlowFile attributes.

## Tags

cdc, document, graph, library, microsoft, openflow, sharepoint, unstructured

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API |
| Drive ID | A drive id where the Sharepoint file resides |
| Fallback Retry Duration | The time to wait before retrying the operation after a communication failure. This value is used when the response doesn’t contain a Retry-After header. |
| Fetch Item Permissions | If true, the Processor will fetch user and group permission information for the captured Sharepoint item. |
| Item ID | An id of an item to retrieve the metadata for |
| Item Permissions To Fetch | A comma-separated list of permission types to fetch for the captured Sharepoint item. Available permission types: USER, GROUP, SITE_USER, SITE_GROUP. |
| Site ID | A site id where the Sharepoint file resides |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed here if the processor failed to communicate with the Graph API. Can be retried |
| failure | An incoming FlowFile is routed to this relationship if the metadata and permissions of the item could not be fetched |
| not.found | A FlowFile is routed here if the item was not found |
| success | An incoming FlowFile is routed to this relationship after the metadata and permissions of the item have been fetched and written to the FlowFile attributes |

## Writes attributes

| Name | Description |
| --- | --- |
| sharepoint.item.id | The ID of the Sharepoint item. |
| sharepoint.item.type | The type of the Sharepoint item. Possible values are ‘File’ and ‘Folder’. |
| sharepoint.path | The path of the Sharepoint item. This is the path relative to the root of the Document Library. |
| sharepoint.filename | The name of the Sharepoint item. This attribute is not available for ‘Deleted’ changes. |
| sharepoint.size | The size of the Sharepoint item. |
| sharepoint.createdAt | The creation timestamp of the Sharepoint item. |
| sharepoint.lastModified | The last modified timestamp of the Sharepoint item. |
| sharepoint.createdBy.<identity>.id | An id of the identity that created the Sharepoint item. This attribute is not always available. |
| sharepoint.createdBy.<identity>.displayName | A display name of the identity that created the Sharepoint item. This attribute is not always available. |
| sharepoint.createdBy.<identity>.email | An email of the identity that created the Sharepoint item. This attribute is not always available. |
| sharepoint.lastModifiedBy.<identity>.id | An id of the identity that modified the Sharepoint item last. This attribute is not always available. |
| sharepoint.lastModifiedBy.<identity>.displayName | A display name of the identity that modified the Sharepoint item last. This attribute is not always available. |
| sharepoint.lastModifiedBy.<identity>.email | An email of the identity that modified the Sharepoint item last. This attribute is not always available. |
| sharepoint.drive.id | The ID of the Sharepoint Drive that contains the item. |
| sharepoint.site.id | The ID of the Sharepoint Site that contains the item. |
| sharepoint.ctag | The CTag of the Sharepoint item. |
| sharepoint.etag | The ETag of the Sharepoint item. |
| sharepoint.webUrl | The browser view url of the Sharepoint item. |
| sharepoint.permissions.read.groups | A comma-separated list of groups that have read permissions on the Sharepoint item. For each group, if an e-mail address is available in Sharepoint, it will be included. Additionally, the group principal, such as `mygroup@mytenant.onmicrosoft.com`, is included. |
| sharepoint.permissions.read.groups.ids | A comma-separated list of group IDs that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.users | A comma-separated list of users that have read permissions on the Sharepoint item. For each user, if an e-mail address is available in Sharepoint, it will be included. Additionally, the user principal, such as `johndoe@mytenant.onmicrosoft.com`, is included. |
| sharepoint.permissions.read.users.ids | A comma-separated list of Microsoft365 user IDs that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.siteusers | A comma-separated list of Sharepoint site user emails that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.siteusers.ids | A comma-separated list of Sharepoint site user IDs that have read permissions on the Sharepoint item. |
| sharepoint.permissions.read.sitegroups.ids | A comma-separated list of Sharepoint site group IDs that have read permissions on the Sharepoint item. |
| filename | The name of the Sharepoint item. |
| path | The path of the Sharepoint item. This is the path relative to the root of the Document Library. |
| mime.type | The MIME type of the Sharepoint item. This attribute is only available for ‘File’ items. |
| hash.quickxor | The QuickXor hash of the Sharepoint item. This attribute is not always available. |
| hash.sha256 | The SHA-256 hash of the Sharepoint item. This attribute is not always available. |
| hash.sha1 | The SHA-1 hash of the Sharepoint item. This attribute is not always available. |
| hash.crc32 | The CRC32 hash of the Sharepoint item. This attribute is not always available. |

---
title: FetchSlackConversationInfo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchslackconversationinfo.md
section: Loading & Unloading Data
---

# FetchSlackConversationInfo 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-slack-processors-nar

## Description

Fetches Slack conversation info and member emails

## Tags

conversation, conversation.members, slack, social media, team

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | OAuth Access Token used for authenticating/authorizing the Slack request sent by NiFi. This may be either a User Token or a Bot Token. It must be granted the channels:history, groups:history, im:history, or mpim:history scope, depending on the type of conversation being used. |
| Cache Expiration | User emails are cached to reduce network lookups. A longer expiration reduces network overhead but can cause data to be out of sync. |
| Cache Size | User emails are cached to reduce network lookups. A larger cache consumes memory but reduces network overhead. |
| Channel | The Slack Channel ID to retrieve info from. Leave blank to iterate over every available Conversation. |
| Rate Limiter Service | Slack Rate Limiter Service to coordinate rate limiting across processors |

## Relationships

| Name | Description |
| --- | --- |
| conversations | Each configured Slack Conversation info and members will be routed to this relationship in separate FlowFiles |
| failure | If Slack Conversation metadata is unable to be received the input FlowFile will be routed to this relationship |
| original | Original input FlowFile that has been successfully processed. |

## Writes attributes

| Name | Description |
| --- | --- |
| conversation.members.count | Set to the number of members of the conversation |
| conversation.id | Set to the number of members of the conversation |
| channel.name | Set to the name of the channel if the conversation is a channel |
| mime.type | Set to application/json, as the output will always be in JSON format |

---
title: FetchSlackFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchslackfile.md
section: Loading & Unloading Data
---

# FetchSlackFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-slack-processors-nar

## Description

Downloads a file shared on Slack. Writes the file content to the FlowFile content and FlowFile attributes from the file.

## Tags

download, file, slack

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Bot Token | The Bot Token that is registered to your Slack application |
| Channel ID | The Slack Channel ID where the file was shared. |
| File ID | The Slack File ID to download. |
| Rate Limiter Service | Slack Rate Limiter Service to coordinate rate limiting across processors |
| Web Client Service | The Web Client Service to use for downloading files from Slack |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that could not be processed are routed to this relationship |
| success | FlowFiles containing successfully downloaded Slack files are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The MIME type of the downloaded file |
| filename | The name of the downloaded file |
| slack.file.name | The Slack File name |
| slack.file.mimetype | The Slack File MIME type |
| slack.file.size | The Slack File size in bytes |
| slack.conversation.id | The Slack Channel ID |
| slack.event.ts | The Slack event timestamp |

## See also

* [com.snowflake.openflow.runtime.processors.slack.FetchSlackConversationInfo](fetchslackconversationinfo.md)
* [com.snowflake.openflow.runtime.processors.slack.FetchSlackMessage](fetchslackmessage.md)

---
title: FetchSlackMessage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchslackmessage.md
section: Loading & Unloading Data
---

# FetchSlackMessage 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-slack-processors-nar

## Description

Fetches data about a single Slack message

## Tags

conversation, conversation.history, slack, social media, team, text, unstructured

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | OAuth Access Token used for authenticating/authorizing the Slack request sent by NiFi. This may be either a User Token or a Bot Token. It must be granted the channels:history, groups:history, im:history, or mpim:history scope, depending on the type of conversation being used. |
| Channel | The Slack Channel ID to Retrieve a message from. |
| Include Message Blocks | Specifies whether or not the output JSON should include the value of the ‘blocks’ field for each Slack Message. This field includes information such as individual parts of a message that are formatted using rich text. This may be useful, for instance, for parsing. However, it often accounts for a significant portion of the data and as such may be set to null when it is not useful to you. |
| Include Null Fields | Specifies whether or not fields that have null values should be included in the output JSON. If true, any field in a Slack Message that has a null value will be included in the JSON with a value of null. If false, the key omitted from the output JSON entirely. Omitting null values results in smaller messages that are generally more efficient to process, but including the values may provide a better understanding of the format, especially for schema inference. |
| Message Timestamp | The timestamp of the message which is also its ID within a channel. |
| Rate Limiter Service | Slack Rate Limiter Service to coordinate rate limiting across processors |
| Resolve Usernames | Specifies whether or not User IDs should be resolved to usernames. By default, Slack Messages provide the ID of the user that sends a message, such as U0123456789, but not the username, such as NiFiUser. The username may be resolved, but it may require additional calls to the Slack API and requires that the Token used be granted the users:read scope. If set to true, usernames will be resolved with a best-effort policy: if a username cannot be obtained, it will be skipped over. Also, note that when a username is obtained, the Message’s <username> field is populated, and the <text> field is updated such that any mention will be output such as “Hi @user” instead of “Hi <@U1234567>”. |
| Thread Timestamp | The timestamp of the thread the message belongs to. This can be null or empty unless the message is a reply to another message. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Slack messages that fail to be received will be routed to this relationship |
| not found | Slack messages that were not found on the Slack server will be routed to this relationship |
| success | Slack messages that are successfully received will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Set to application/json, as the output will always be in JSON format |

---
title: FetchSmb 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchsmb.md
section: Loading & Unloading Data
---

# FetchSmb 2025.10.9.21

## Bundle

org.apache.nifi | nifi-smb-nar

## Description

Fetches files from a SMB Share. Designed to be used in tandem with ListSmb.

## Tags

cifs, fetch, files, samba, smb

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Completion Strategy | Specifies what to do with the original file on the server once it has been processed. If the Completion Strategy fails, a warning will be logged but the data will still be transferred. |
| Create Destination Directory | Specifies whether or not the remote directory should be created if it does not exist. |
| Destination Directory | The directory on the remote server to move the original file to once it has been processed. |
| remote-file | The full path of the file to be retrieved from the remote server. Expression language is supported. |
| smb-client-provider-service | Specifies the SMB client provider to use for creating SMB connections. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here when failed to fetch its content. |
| success | A FlowFile will be routed here for each successfully fetched file. |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code returned by SMB when the fetch of a file fails. |
| error.message | The error message returned by SMB when the fetch of a file fails. |

## See also

* [org.apache.nifi.processors.smb.GetSmbFile](getsmbfile.md)
* [org.apache.nifi.processors.smb.ListSmb](listsmb.md)
* [org.apache.nifi.processors.smb.PutSmbFile](putsmbfile.md)

---
title: FetchSnowflakeTableProperties 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchsnowflaketableproperties.md
section: Loading & Unloading Data
---

# FetchSnowflakeTableProperties 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Reads properties from a table and stores them as flow file attributes.

## Tags

database, jdbc, openflow, snowflake

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Schema Name | The name of the schema |
| Table Metadata Cache Expiration Time | The time in seconds after which the cache entry will be removed |
| Table Name | The name of the table |
| Use Table Metadata Cache | Whether to cache table’s metadata instead of reading it directly from Snowflake. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The incoming FlowFile is routed to this relationship if the properties cannot be read |
| success | The incoming FlowFile is routed to this relationship after the table properties has been successfully read |
| table not found | The incoming FlowFile is routed to this relationship if the specified table does not exist. |

---
title: FetchSourceTableSchema 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchsourcetableschema.md
section: Loading & Unloading Data
---

# FetchSourceTableSchema 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Fetches the table schema (i.e., column names, data types, etc.) for a given table in a database, converting the data types to Snowflake-compatible types. The schema is written to the FlowFile content as a JSON object, in a form such as: { “columns”: [ { “name”: “<columnName>”, “type”: “<snowflakeType>”, “nullable”: <true|false>, “scale”: <scale>, “precision”: <precision> }, … ], “primaryKeys”: [“<primaryKey1>”, “<primaryKey2>”, …] }

## Tags

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Filter Service | Specifies the Column Filter Service to be used for filtering out unwanted columns |
| Connection Pool | The connection pool to use to fetch the source table schema |
| Schema Name | The name of the schema that the source table is stored in |
| Table Name | The name of the source table |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship in the event that the source table’s schema cannot be fetched |
| success | FlowFiles are routed to this relationship when the source table’s schema is successfully fetched |
| table not found | FlowFiles are routed to this relationship when the source table does not exist |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| dbms.type | The type of database management system (DBMS) that the source table is stored in. E.g. `POSTGRESQL` |
| primary.key.count | The number of primary keys in the source table |
| column.count | The number of columns in the source table |

---
title: FetchTableSnapshot 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/fetchtablesnapshot.md
section: Loading & Unloading Data
---

# FetchTableSnapshot 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Fetches a snapshot of a table from a database. The snapshot is fetched incrementally, using the primary key columns of the table to fetch rows in batches. Replicating a table without primary key is not supported. The snapshot is written to a FlowFile in the specified Record Writer format. The input FlowFile is expected to consist of a JSON representation of the table schema in the following format: { “columns”: [{ “name”: “<column name>”, “type”: “<column type>” }, { “name”: “<column name>”, “type”: “<column type>” }, … ], “primaryKeys”: [“<name of first primary key column>”, “<name of second primary key column>”, …] } Only those columns that are specified in the schema will be fetched from the table.

## Tags

database, fetch, rdbms, snapshot, snowflake, table

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pool | The connection pool to use to fetch the database snapshot |
| Fetch Size | The maximum number of rows loaded into memory at once |
| JDBC Driver Location | Comma-separated list of files/folders and/or URLs containing the driver JAR and its dependencies (if any). For example ‘/var/tmp/postgresql-java-client-42.7.5.jar’ |
| Max Batch Size | The maximum number of rows to fetch in a single batch |
| Record Writer | The record writer to use to write the fetched snapshot |
| Schema Name | The name of the schema to fetch the snapshot from |
| Table Name | The name of the table to fetch the snapshot from |

## Relationships

| Name | Description |
| --- | --- |
| complete | When the snapshot is complete, the original FlowFile will be routed to this relationship |
| failure | If the data cannot be retrieved from the table represented by the FlowFile, the FlowFile will be routed to this relationship. |
| retryable failure | If the data cannot be retrieved from the table represented by the FlowFile but we expect it to be possible in future, the FlowFile will be routed to this relationship. |
| rows | When the snapshot is successfully retrieved from the table represented by the FlowFile, the rows will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| snapshot.complete | Indicates whether the snapshot is complete |
| rows.total.fetched | The total number of rows fetched for the table |
| rows.delta.fetched | The number of rows fetched for the table in the last iteration |
| start.row.index | The index of the first row within the snapshot for a given iteration, starting from 0 |
| last.row.index | The index of the last row within the snapshot for a given iteration, starting from 0 |
| fetch.delta.time.in.millis | The time in milliseconds taken to fetch the rows in the last iteration |
| fetch.total.time.in.millis | The time in milliseconds taken so far to fetch the rows |

---
title: FilterAttribute 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/filterattribute.md
section: Loading & Unloading Data
---

# FilterAttribute 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Filters the attributes of a FlowFile by retaining specified attributes and removing the rest or by removing specified attributes and retaining the rest.

## Tags

Attribute Expression Language, attributes, delete, filter, modification, regex, regular expression, remove, retain

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attribute Matching Strategy | Specifies the strategy to filter attributes by. |
| Filter Mode | Specifies the strategy to apply on filtered attributes. Either ‘Remove’ or ‘Retain’ only the matching attributes. |
| Filtered Attributes | A set of attribute names to filter from FlowFiles. Each attribute name is separated by the comma delimiter ‘,’. |
| Filtered Attributes Pattern | A regular expression to match names of attributes to filter from FlowFiles. |

## Relationships

| Name | Description |
| --- | --- |
| success | All successful FlowFiles are routed to this relationship |

## Use cases

|  |
| --- |
| Retain all FlowFile attributes matching a regular expression |
| Remove only a specified set of FlowFile attributes |

---
title: FindConfluencePages 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/findconfluencepages.md
section: Loading & Unloading Data
---

# FindConfluencePages 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor for finding Confluence pages using space name and page name.

## Tags

Preview, atlassian, confluence, fetch, pages

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Confluence Page Name | Name of the Confluence Page. If not provided, all pages in the space will be retrieved. |
| Confluence Space Name | Name of the Confluence Space |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to find Confluence pages |
| not found | Pages for given space name and page name not found |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully found Confluence pages |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.page.name | Unique identifier of the Confluence page. |
| confluence.page.change.type | Informs about status change for the searched page. |
| confluence.page.url | Confluence page url. |
| confluence.page.title | Confluence page title. |
| confluence.page.last.modification.date | Last modification date of the Confluence page. |
| confluence.space.name | Name of the Confluence space. |

---
title: FindSharepointDriveItem 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/findsharepointdriveitem.md
section: Loading & Unloading Data
---

# FindSharepointDriveItem 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

Finds a Sharepoint Drive Item by its Drive ID and Item path.

## Tags

document, graph, microsoft, openflow, sharepoint, unstructured

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API. |
| Drive ID | The ID of the Sharepoint Drive. |
| Fallback Retry Duration | The time to wait before retrying the operation after a communication failure. This value is used when the response doesn’t contain a Retry-After header. |
| Item Path | The path of the Drive Item to find in a Drive. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed here if the processor failed to communicate with the Graph API. Can be retried |
| failure | An incoming FlowFile is routed to this relationship if an unexpected error has occurred |
| found | An incoming FlowFile is routed to this relationship, with attributes about the Item added, if the specified item was found in Sharepoint |
| not.found | An incoming FlowFile is routed to this relationship if the specified item was not found in Sharepoint |

## Writes attributes

| Name | Description |
| --- | --- |
| sharepoint.item.id | The ID of the Sharepoint Drive Item. |
| sharepoint.item.type | The type of the Sharepoint Drive Item, possible values are ‘File’ and ‘Folder’. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.FetchSharepointFile](fetchsharepointfile.md)
* [com.snowflake.openflow.runtime.processors.sharepoint.ListSharepointDrives](listsharepointdrives.md)

---
title: FlattenJson 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/flattenjson.md
section: Loading & Unloading Data
---

# FlattenJson 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Provides the user with the ability to take a nested JSON document and flatten it into a simple key/value pair document. The keys are combined at each level with a user-defined separator that defaults to ‘.’. This Processor also allows to unflatten back the flattened json. It supports four kinds of flatten mode such as normal, keep-arrays, dot notation for MongoDB query and keep-primitive-arrays. Default flatten mode is ‘keep-arrays’.

## Tags

flatten, json, unflatten

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| flatten-json-character-set | The Character Set in which file is encoded |
| flatten-json-pretty-print-json | Specifies whether or not resulted json should be pretty printed |
| flatten-json-return-type | Specifies the desired return type of json such as flatten/unflatten |
| flatten-json-separator | The separator character used for joining keys. Must be a JSON-legal character. |
| flatten-mode | Specifies how json should be flattened/unflattened |
| ignore-reserved-characters | If true, reserved characters in keys will be ignored |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that cannot be flattened/unflattened go to this relationship. |
| success | Successfully flattened/unflattened files go to this relationship. |

---
title: ForkEnrichment 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/forkenrichment.md
section: Loading & Unloading Data
---

# ForkEnrichment 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Used in conjunction with the JoinEnrichment processor, this processor is responsible for adding the attributes that are necessary for the JoinEnrichment processor to perform its function. Each incoming FlowFile will be cloned. The original FlowFile will have appropriate attributes added and then be transferred to the ‘original’ relationship. The clone will have appropriate attributes added and then be routed to the ‘enrichment’ relationship. See the documentation for the JoinEnrichment processor (and especially its Additional Details) for more information on how these Processors work together and how to perform enrichment tasks in NiFi by using these Processors.

## Tags

enrich, fork, join, record

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Relationships

| Name | Description |
| --- | --- |
| enrichment | A clone of the incoming FlowFile will be routed to this relationship, after adding appropriate attributes. |
| original | The incoming FlowFile will be routed to this relationship, after adding appropriate attributes. |

## Writes attributes

| Name | Description |
| --- | --- |
| enrichment.group.id | The Group ID to use in order to correlate the ‘original’ FlowFile with the ‘enrichment’ FlowFile. |
| enrichment.role | The role to use for enrichment. This will either be ORIGINAL or ENRICHMENT. |

## See also

* [org.apache.nifi.processors.standard.JoinEnrichment](joinenrichment.md)

---
title: ForkRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/forkrecord.md
section: Loading & Unloading Data
---

# ForkRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor allows the user to fork a record into multiple records. The user must specify at least one Record Path, as a dynamic property, pointing to a field of type ARRAY containing RECORD objects. The processor accepts two modes: ‘split’ and ‘extract’. In both modes, there is one record generated per element contained in the designated array. In the ‘split’ mode, each generated record will preserve the same schema as given in the input but the array will contain only one element. In the ‘extract’ mode, the element of the array must be of record type and will be the generated record. Additionally, in the ‘extract’ mode, it is possible to specify if each generated record should contain all the fields of the parent records from the root level to the extracted record. This assumes that the fields to add in the record are defined in the schema of the Record Writer controller service. See examples in the additional details documentation of this processor.

## Tags

array, content, event, fork, record, stream

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| fork-mode | Specifies the forking mode of the processor |
| include-parent-fields | This parameter is only valid with the ‘extract’ mode. If set to true, all the fields from the root level to the given array will be added as fields of each element of the array to fork. |
| record-reader | Specifies the Controller Service to use for reading incoming data |
| record-writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | In case a FlowFile generates an error during the fork operation, it will be routed to this relationship |
| fork | The FlowFiles containing the forked records will be routed to this relationship |
| original | The original FlowFiles will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The generated FlowFile will have a ‘record.count’ attribute indicating the number of records that were written to the FlowFile. |
| mime.type | The MIME Type indicated by the Record Writer |
| <Attributes from Record Writer> | Any Attribute that the configured Record Writer returns will be added to the FlowFile. |

---
title: FreeFormTextRecordSetWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/freeformtextrecordsetwriter.md
section: Loading & Unloading Data
---

# FreeFormTextRecordSetWriter

## Description

Writes the contents of a RecordSet as free-form text. The configured text is able to make use of the Expression Language to reference each of the fields that are available in a Record, as well as the attributes in the FlowFile and variables. If there is a name collision, the field name/value is used before attributes or variables. Each record in the RecordSet will be separated by a single newline character.

## Tags

el, expression, freeform, language, record, recordset, resultset, serialize, text, writer

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Character Set \* | Character Set | UTF-8 |  | The Character set to use when writing the data to the FlowFile |
| Text \* | Text |  |  | The text to use when writing the results. This property will evaluate the Expression Language using any of the fields available in a Record. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: GCPCredentialsControllerService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/gcpcredentialscontrollerservice.md
section: Loading & Unloading Data
---

# GCPCredentialsControllerService

## Description

Defines credentials for Google Cloud Platform processors. Uses Application Default credentials without configuration. Application Default credentials support environmental variable (GOOGLE_APPLICATION_CREDENTIALS) pointing to a credential file, the config generated by `gcloud auth application-default login`, AppEngine/Compute Engine service accounts, etc.

## Tags

credentials, gcp, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Delegation Strategy \* | Delegation Strategy | Service Account | * Service Account * Delegated Account | The Delegation Strategy determines which account is used when calls are made with the GCP Credential. |
| Delegation User \* | Delegation User |  |  | This user will be impersonated by the service account for api calls. API calls made using this credential will appear as if they are coming from delegate user with the delegate user’s access. Any scopes supplied from processors to this credential must have domain-wide delegation setup with the service account. |
| Use Application Default Credentials | application-default-credentials | false | * true * false | If true, uses Google Application Default Credentials, which checks the GOOGLE_APPLICATION_CREDENTIALS environment variable for a filepath to a service account JSON key, the config generated by the gcloud sdk, the App Engine service account, and the Compute Engine service account. |
| Use Compute Engine Credentials | compute-engine-credentials | false | * true * false | If true, uses Google Compute Engine Credentials of the Compute Engine VM Instance which NiFi is running on. |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| Service Account JSON | service-account-json |  |  | The raw JSON containing a Service Account keyfile. |
| Service Account JSON File | service-account-json-file |  |  | Path to a file containing a Service Account key file in JSON format. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| access environment credentials | The default configuration can read environment variables and system properties for credentials |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: GCSFileResourceService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/gcsfileresourceservice.md
section: Loading & Unloading Data
---

# GCSFileResourceService

## Description

Provides a Google Compute Storage (GCS) file resource for other components.

## Tags

file, gcs, resource

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Bucket \* | Bucket | ${gcs.bucket} |  | Bucket of the object. |
| Name \* | Name | ${filename} |  | Name of the object. |
| GCP Credentials Provider Service \* | gcp-credentials-provider-service |  |  | The Controller Service used to obtain Google Cloud Platform credentials. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: GenerateAnswersFromContext 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/generateanswersfromcontext.md
section: Loading & Unloading Data
---

# GenerateAnswersFromContext 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-rag-evaluation-processors-nar

## Description

Generates synthetic answers for each question present in the incoming records using a Large Language Model (LLM). For every record, the processor extracts the question and its associated context based on the specified RecordPaths, constructs a prompt, and sends it to an LLM provider to obtain a synthetic answer. The generated answer is then inserted into the record at the designated RecordPath.

## Tags

ai, answers, contextual, generation, llm, nlp, openai, openflow, rag, synthetic

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Answer Record Path | The RecordPath to the synthetically generated answers |
| Context Record Path | The RecordPath to the array of contexts in the record. |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Max Character Context Length | Maximum character length of context window. |
| Question Record Path | The RecordPath to the question field in the record. |
| Record Reader | The Record Reader to use for reading the FlowFile. |
| Record Writer | The Record Writer to use for writing the results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| answers.successfully.generated | The total number of successfully generated synthetic answers for the FlowFile. |
| answers.failed.generated | The total number of synthetic answer generation attempts that failed for the FlowFile. |
| json.parse.failures | Number of JSON parse failures encountered. |

---
title: GenerateAnswersFromGroundTruth 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/generateanswersfromgroundtruth.md
section: Loading & Unloading Data
---

# GenerateAnswersFromGroundTruth 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-rag-evaluation-processors-nar

## Description

Generates synthetic answers for each question in the incoming records using an LLM. The synthetic answers are added to the specified RecordPath within each record. Additionally, the processor tracks the number of answers generated and updates the FlowFile attributes accordingly.

## Tags

ai, answers, generation, llm, nlp, openai, openflow, rag, synthetic

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Answer Record Path | The RecordPath to the synthetically generated answers. |
| Ground Truth Record Path | The RecordPath to the ground truth field in the record. |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Question Record Path | The RecordPath to the question field in the record. |
| Record Reader | The Record Reader to use for reading the FlowFile. |
| Record Writer | The Record Writer to use for writing the results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| answers.successfully.generated | The total number of successfully synthetic answers generated for the FlowFile. |
| answers.failed.generated | The total number of failed answer generation for the FlowFile. |
| json.parse.failures | Number of JSON parse failures encountered. |

---
title: GenerateFlowFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/generateflowfile.md
section: Loading & Unloading Data
---

# GenerateFlowFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor creates FlowFiles with random data or custom content. GenerateFlowFile is useful for load testing, configuration, and simulation. Also see DuplicateFlowFile for additional load testing.

## Tags

generate, load, random, test

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of FlowFiles to be transferred in each invocation |
| Data Format | Specifies whether the data should be Text or Binary |
| File Size | The size of the file that will be used |
| Unique FlowFiles | If true, each FlowFile that is generated will be unique. If false, a random value will be generated and all FlowFiles will get the same content but this offers much higher throughput |
| character-set | Specifies the character set to use when writing the bytes of Custom Text to a flow file. |
| generate-ff-custom-text | If Data Format is text and if Unique FlowFiles is false, then this custom text will be used as content of the generated FlowFiles and the File Size will be ignored. Finally, if Expression Language is used, evaluation will be performed only once per batch of generated FlowFiles |
| mime-type | Specifies the value to set for the “mime.type” attribute. |

## Relationships

| Name | Description |
| --- | --- |
| success |  |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the MIME type of the output if the ‘Mime Type’ property is set |

---
title: GenerateJSON 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/generatejson.md
section: Loading & Unloading Data
---

# GenerateJSON 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-record-generation-nar

## Description

Produces a batch of JSON Objects with random field values based on a configurable JSON Schema.

## Tags

JSON, JSON Schema, generate, random

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | Number of records generated per FlowFile produced |
| JSON Schema | JSON Schema version 2020-12 describing an object with properties indicating type and format for each field |
| Output Structure | Structure for writing batches of records to each FlowFile |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles with generated JSON records |

---
title: GenerateRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/generaterecord.md
section: Loading & Unloading Data
---

# GenerateRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor creates FlowFiles with records having random value for the specified fields. GenerateRecord is useful for testing, configuration, and simulation. It uses either user-defined properties to define a record schema or a provided schema and generates the specified number of records using random data for the fields in the schema.

## Tags

fake, generate, random, test

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| null-percentage | The percent probability (0-100%) that a generated value for any nullable field will be null. Set this property to zero to have no null values, or 100 to have all null values. |
| nullable-fields | Whether the generated fields will be nullable. Note that this property is ignored if Schema Text is set. Also it only affects the schema of the generated data, not whether any values will be null. If this property is true, see ‘Null Value Percentage’ to set the probability that any generated field will be null. |
| number-of-records | Specifies how many records will be generated for each outgoing FlowFile. |
| record-writer | Specifies the Controller Service to use for writing out the records |
| schema-text | The text of an Avro-formatted Schema used to generate record data. If this property is set, any user-defined properties are ignored. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles that are successfully created will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records in the FlowFile |

---
title: GenerateTableFetch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/generatetablefetch.md
section: Loading & Unloading Data
---

# GenerateTableFetch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Generates SQL select queries that fetch “pages” of rows from a table. The partition size property, along with the table ‘s row count, determine the size and number of pages and generated FlowFiles. In addition, incremental fetching can be achieved by setting Maximum-Value Columns, which causes the processor to track the columns’ maximum values, thus only fetching rows whose columns ‘values exceed the observed maximums. This processor is intended to be run on the Primary Node only. This processor can accept incoming connections; the behavior of the processor is different whether incoming connections are provided: - If no incoming connection(s) are specified, the processor will generate SQL queries on the specified processor schedule. Expression Language is supported for many fields, but no FlowFile attributes are available. However the properties will be evaluated using the Environment/System properties. - If incoming connection(s) are specified and no FlowFile is available to a processor task, no work will be performed. - If incoming connection(s) are specified and a FlowFile is available to a processor task, the FlowFile’s attributes may be used in Expression Language for such fields as Table Name and others. However, the Max-Value Columns and Columns to Return fields must be empty or refer to columns that are available in each specified table.

## Tags

database, fetch, generate, jdbc, query, select, sql

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Columns to Return | A comma-separated list of column names to be used in the query. If your database requires special treatment of the names (quoting, e.g.), each name should include such treatment. If no column names are supplied, all columns in the specified table will be returned. NOTE: It is important to use consistent column names for a given table for incremental fetch to work properly. |
| Database Connection Pooling Service | The Controller Service that is used to obtain a connection to the database. |
| Database Dialect Service | Database Dialect Service for generating statements specific to a particular service or vendor. |
| Max Wait Time | The maximum amount of time allowed for a running SQL select query , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| Maximum-value Columns | A comma-separated list of column names. The processor will keep track of the maximum value for each column that has been returned since the processor started running. Using multiple columns implies an order to the column list, and each column ‘s values are expected to increase more slowly than the previous columns’ values. Thus, using multiple columns implies a hierarchical structure of columns, which is usually used for partitioning tables. This processor can be used to retrieve only those rows that have been added/updated since the last retrieval. Note that some JDBC types such as bit/boolean are not conducive to maintaining maximum value, so columns of these types should not be listed in this property, and will result in error(s) during processing. If no columns are provided, all rows from the table will be considered, which could have a performance impact. NOTE: It is important to use consistent max-value column names for a given table for incremental fetch to work properly. |
| Table Name | The name of the database table to be queried. |
| db-fetch-db-type | Database Type for generating statements specific to a particular service or vendor. The Generic Type supports most cases but selecting a specific type enables optimal processing or additional features. |
| db-fetch-where-clause | A custom clause to be added in the WHERE condition when building SQL queries. |
| gen-table-column-for-val-partitioning | The name of a column whose values will be used for partitioning. The default behavior is to use row numbers on the result set for partitioning into ‘pages’ to be fetched from the database, using an offset/limit strategy. However for certain databases, it can be more efficient under the right circumstances to use the column values themselves to define the ‘pages’. This property should only be used when the default queries are not performing well, when there is no maximum-value column or a single maximum-value column whose type can be coerced to a long integer (i.e. not date or timestamp), and the column values are evenly distributed and not sparse, for best performance. |
| gen-table-custom-orderby-column | The name of a column to be used for ordering the results if Max-Value Columns are not provided and partitioning is enabled. This property is ignored if either Max-Value Columns is set or Partition Size = 0. NOTE: If neither Max-Value Columns nor Custom ORDER BY Column is set, then depending on the database/driver, the processor may report an error and/or the generated SQL may result in missing and/or duplicate rows. This is because without an explicit ordering, fetching each partition is done using an arbitrary ordering. |
| gen-table-fetch-partition-size | The number of result rows to be fetched by each generated SQL statement. The total number of rows in the table divided by the partition size gives the number of SQL statements (i.e. FlowFiles) generated. A value of zero indicates that a single FlowFile is to be generated whose SQL statement will fetch all rows in the table. |
| gen-table-output-flowfile-on-zero-results | Depending on the specified properties, an execution of this processor may not result in any SQL statements generated. When this property is true, an empty FlowFile will be generated (having the parent of the incoming FlowFile if present) and transferred to the ‘success’ relationship. When this property is false, no output FlowFiles will be generated. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a query on the specified table, the maximum values for the specified column(s) will be retained for use in future executions of the query. This allows the Processor to fetch only those records that have max values greater than the retained values. This can be used for incremental fetching, fetching of newly added rows, etc. To clear the maximum values, clear the state of the processor per the State Management documentation |

## Relationships

| Name | Description |
| --- | --- |
| failure | This relationship is only used when SQL query execution (using an incoming FlowFile) failed. The incoming FlowFile will be penalized and routed to this relationship. If no incoming connection(s) are specified, this relationship is unused. |
| success | Successfully created FlowFile from SQL query result set. |

## Writes attributes

| Name | Description |
| --- | --- |
| generatetablefetch.sql.error | If the processor has incoming connections, and processing an incoming FlowFile causes a SQL Exception, the FlowFile is routed to failure and this attribute is set to the exception message. |
| generatetablefetch.tableName | The name of the database table to be queried. |
| generatetablefetch.columnNames | The comma-separated list of column names used in the query. |
| generatetablefetch.whereClause | Where clause used in the query to get the expected rows. |
| generatetablefetch.maxColumnNames | The comma-separated list of column names used to keep track of data that has been returned since the processor started running. |
| generatetablefetch.limit | The number of result rows to be fetched by the SQL statement. |
| generatetablefetch.offset | Offset to be used to retrieve the corresponding partition. |
| fragment.identifier | All FlowFiles generated from the same query result set will have the same value for the fragment.identifier attribute. This can then be used to correlate the results. |
| fragment.count | This is the total number of FlowFiles produced by a single ResultSet. This can be used in conjunction with the fragment.identifier attribute in order to know how many FlowFiles belonged to the same incoming ResultSet. |
| fragment.index | This is the position of this FlowFile in the list of outgoing FlowFiles that were all generated from the same execution. This can be used in conjunction with the fragment.identifier attribute to know which FlowFiles originated from the same execution and in what order FlowFiles were produced |

## See also

* [org.apache.nifi.processors.standard.ExecuteSQL](executesql.md)
* [org.apache.nifi.processors.standard.ListDatabaseTables](listdatabasetables.md)
* [org.apache.nifi.processors.standard.QueryDatabaseTable](querydatabasetable.md)

---
title: GeoEnrichIP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/geoenrichip.md
section: Loading & Unloading Data
---

# GeoEnrichIP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-enrich-nar

## Description

Looks up geolocation information for an IP address and adds the geo information to FlowFile attributes. The geo data is provided as a MaxMind database. The attribute that contains the IP address to lookup is provided by the ‘IP Address Attribute’ property. If the name of the attribute provided is ‘X’, then the attributes added by enrichment will take the form X.geo.<fieldName>

## Tags

enrich, geo, ip, maxmind

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| IP Address Attribute | The name of an attribute whose value is a dotted decimal IP address for which enrichment should occur |
| Log Level | The Log Level to use when an IP is not found in the database. Accepted values: INFO, DEBUG, WARN, ERROR. |
| MaxMind Database File | Path to Maxmind IP Enrichment Database File |

## Relationships

| Name | Description |
| --- | --- |
| found | Where to route flow files after successfully enriching attributes with data provided by database |
| not found | Where to route flow files after unsuccessfully enriching attributes because no data was found |

## Writes attributes

| Name | Description |
| --- | --- |
| X.geo.lookup.micros | The number of microseconds that the geo lookup took |
| X.geo.city | The city identified for the IP address |
| X.geo.accuracy | The accuracy radius if provided by the database (in Kilometers) |
| X.geo.latitude | The latitude identified for this IP address |
| X.geo.longitude | The longitude identified for this IP address |
| X.geo.subdivision.N | Each subdivision that is identified for this IP address is added with a one-up number appended to the attribute name, starting with 0 |
| X.geo.subdivision.isocode.N | The ISO code for the subdivision that is identified by X.geo.subdivision.N |
| X.geo.country | The country identified for this IP address |
| X.geo.country.isocode | The ISO Code for the country identified |
| X.geo.postalcode | The postal code for the country identified |

---
title: GeoEnrichIPRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/geoenrichiprecord.md
section: Loading & Unloading Data
---

# GeoEnrichIPRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-enrich-nar

## Description

Looks up geolocation information for an IP address and adds the geo information to FlowFile attributes. The geo data is provided as a MaxMind database. This version uses the NiFi Record API to allow large scale enrichment of record-oriented data sets. Each field provided by the MaxMind database can be directed to a field of the user’s choosing by providing a record path for that field configuration.

## Tags

enrich, geo, ip, maxmind, record

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| City Record Path | Record path for putting the city identified for the IP address |
| Country ISO Code Record Path | Record path for putting the ISO Code for the country identified |
| Country Postal Code Record Path | Record path for putting the postal code for the country identified |
| Country Record Path | Record path for putting the country identified for this IP address |
| IP Address Record Path | The record path to retrieve the IP address for doing the lookup. |
| Latitude Record Path | Record path for putting the latitude identified for this IP address |
| Log Level | The Log Level to use when an IP is not found in the database. Accepted values: INFO, DEBUG, WARN, ERROR. |
| Longitude Record Path | Record path for putting the longitude identified for this IP address |
| MaxMind Database File | Path to Maxmind IP Enrichment Database File |
| Record Reader | Record reader service to use for reading the flowfile contents. |
| Record Writer | Record writer service to use for enriching the flowfile contents. |
| Separate Enriched From Not Enriched | Separate records that have been enriched from ones that have not. Default behavior is to send everything to the found relationship if even one record is enriched. |

## Relationships

| Name | Description |
| --- | --- |
| found | Where to route flow files after successfully enriching attributes with data provided by database |
| not found | Where to route flow files after unsuccessfully enriching attributes because no data was found |
| original | The original input flowfile goes to this relationship regardless of whether the content was enriched or not. |

---
title: GetAmazonAdsReport 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getamazonadsreport.md
section: Loading & Unloading Data
---

# GetAmazonAdsReport 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-amazon-ads-processors-nar

## Description

Processor downloading report from Amazon Ads if ready.

## Tags

Amazon, Amazon Ads, report

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token Provider | Service providing OAuth access token. |
| Amazon Advertising Client ID | Client ID of the Amazon Advertising user. |
| Region | Environment from which advertising data will be downloaded. |
| Report ID | ID of the generated report. |
| Report Profile ID | The profile ID associated with an advertising account in a specific marketplace. |
| Web Client Service Provider | Service providing client for REST request execution. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Error FlowFiles transferred when receiving error response from Amazon Ads Reporting API or when an error occurred during response processing. |
| retry | Response FlowFiles transferred when report prepared by Amazon Ads Reporting API is not yet ready to be downloaded. |
| success | Response FlowFiles transferred when receiving COMPLETED response from Amazon Ads Reporting API. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Mime type of the returned report. |

---
title: GetAwsPollyJobStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getawspollyjobstatus.md
section: Loading & Unloading Data
---

# GetAwsPollyJobStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves the current status of an AWS Polly job.

## Tags

AWS, Amazon, ML, Machine Learning, Polly

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| AWS Task ID |  |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The job failed, the original FlowFile will be routed to this relationship. |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| running | The job is currently still being processed |
| success | Job successfully finished. FlowFile will be routed to this relation. |

## Writes attributes

| Name | Description |
| --- | --- |
| PollyS3OutputBucket | The bucket name where polly output will be located. |
| filename | Object key of polly output. |
| outputLocation | S3 path-style output location of the result. |

## See also

* [org.apache.nifi.processors.aws.ml.polly.StartAwsPollyJob](startawspollyjob.md)

---
title: GetAwsTextractJobStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getawstextractjobstatus.md
section: Loading & Unloading Data
---

# GetAwsTextractJobStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves the current status of an AWS Textract job.

## Tags

AWS, Amazon, ML, Machine Learning, Textract

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| AWS Task ID |  |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Textract Type | Supported values: “Document Analysis”, “Document Text Detection”, “Expense Analysis” |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The job failed, the original FlowFile will be routed to this relationship. |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| running | The job is currently still being processed |
| success | Job successfully finished. FlowFile will be routed to this relation. |
| throttled | Retrieving results failed for some reason, but the issue is likely to resolve on its own, such as Provisioned Throughput Exceeded or a Throttling failure. It is generally expected to retry this relationship. |

## See also

* [org.apache.nifi.processors.aws.ml.textract.StartAwsTextractJob](startawstextractjob.md)

---
title: GetAwsTranscribeJobStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getawstranscribejobstatus.md
section: Loading & Unloading Data
---

# GetAwsTranscribeJobStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves the current status of an AWS Transcribe job.

## Tags

AWS, Amazon, ML, Machine Learning, Transcribe

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| AWS Task ID |  |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The job failed, the original FlowFile will be routed to this relationship. |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| running | The job is currently still being processed |
| success | Job successfully finished. FlowFile will be routed to this relation. |
| throttled | Retrieving results failed for some reason, but the issue is likely to resolve on its own, such as Provisioned Throughput Exceeded or a Throttling failure. It is generally expected to retry this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| outputLocation | S3 path-style output location of the result. |

## See also

* [org.apache.nifi.processors.aws.ml.transcribe.StartAwsTranscribeJob](startawstranscribejob.md)

---
title: GetAwsTranslateJobStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getawstranslatejobstatus.md
section: Loading & Unloading Data
---

# GetAwsTranslateJobStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves the current status of an AWS Translate job.

## Tags

AWS, Amazon, ML, Machine Learning, Translate

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| AWS Task ID |  |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The job failed, the original FlowFile will be routed to this relationship. |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| running | The job is currently still being processed |
| success | Job successfully finished. FlowFile will be routed to this relation. |
| throttled | Retrieving results failed for some reason, but the issue is likely to resolve on its own, such as Provisioned Throughput Exceeded or a Throttling failure. It is generally expected to retry this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| outputLocation | S3 path-style output location of the result. |

## See also

* [org.apache.nifi.processors.aws.ml.translate.StartAwsTranslateJob](startawstranslatejob.md)

---
title: GetAzureEventHub 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getazureeventhub.md
section: Loading & Unloading Data
---

# GetAzureEventHub 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Receives messages from Microsoft Azure Event Hubs without reliable checkpoint tracking. In clustered environment, GetAzureEventHub processor instances work independently and all cluster nodes process all messages (unless running the processor in Primary Only mode). ConsumeAzureEventHub offers the recommended approach to receiving messages from Azure Event Hubs. This processor creates a thread pool for connections to Azure Event Hubs.

## Tags

azure, cloud, eventhub, events, microsoft, streaming, streams

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Consumer Group | The name of the consumer group to use when pulling events |
| Event Hub Name | Name of Azure Event Hubs source |
| Event Hub Namespace | Namespace of Azure Event Hubs prefixed to Service Bus Endpoint domain |
| Message Enqueue Time | A timestamp (ISO-8601 Instant) formatted as YYYY-MM-DDThhmmss.sssZ (2016-01-01T01:01:01.000Z) from which messages should have been enqueued in the Event Hub to start reading from |
| Partition Receiver Fetch Size | The number of events that a receiver should fetch from an Event Hubs partition before returning. The default is 100 |
| Partition Receiver Timeout | The amount of time in milliseconds a Partition Receiver should wait to receive the Fetch Size before returning. The default is 60000 |
| Service Bus Endpoint | To support namespaces not in the default windows.net domain. |
| Shared Access Policy Key | The key of the shared access policy. Either the primary or the secondary key can be used. |
| Shared Access Policy Name | The name of the shared access policy. This policy must have Listen claims. |
| Transport Type | Advanced Message Queuing Protocol Transport Type for communication with Azure Event Hubs |
| Use Azure Managed Identity | Choose whether or not to use the managed identity of Azure VM/VMSS |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| success | Any FlowFile that is successfully received from the event hub will be transferred to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| eventhub.enqueued.timestamp | The time (in milliseconds since epoch, UTC) at which the message was enqueued in the event hub |
| eventhub.offset | The offset into the partition at which the message was stored |
| eventhub.sequence | The Azure sequence number associated with the message |
| eventhub.name | The name of the event hub from which the message was pulled |
| eventhub.partition | The name of the event hub partition from which the message was pulled |
| eventhub.property.\* | The application properties of this message. IE: ‘application’ would be ‘eventhub.property.application’ |

## See also

* [org.apache.nifi.processors.azure.eventhub.ConsumeAzureEventHub](consumeazureeventhub.md)

---
title: GetAzureQueueStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getazurequeuestorage_v12.md
section: Loading & Unloading Data
---

# GetAzureQueueStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Retrieves the messages from an Azure Queue Storage. The retrieved messages will be deleted from the queue by default. If the requirement is to consume messages without deleting them, set ‘Auto Delete Messages’ to ‘false’. Note: There might be chances of receiving duplicates in situations like when a message is received but was unable to be deleted from the queue due to some unexpected situations.

## Tags

azure, cloud, dequeue, microsoft, queue, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Auto Delete Messages | Specifies whether the received message is to be automatically deleted from the queue. |
| Credentials Service | Controller Service used to obtain Azure Storage Credentials. |
| Endpoint Suffix | Storage accounts in public Azure always use a common FQDN suffix. Override this endpoint suffix with a different suffix in certain circumstances (like Azure Stack or non-public Azure regions). |
| Message Batch Size | The number of messages to be retrieved from the queue. |
| Queue Name | Name of the Azure Storage Queue |
| Request Timeout | The timeout for read or write requests to Azure Queue Storage. Defaults to 1 second. |
| Visibility Timeout | The duration during which the retrieved message should be invisible to other consumers. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| success | All successfully processed FlowFiles are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.queue.uri | The absolute URI of the configured Azure Queue Storage |
| azure.queue.insertionTime | The time when the message was inserted into the queue storage |
| azure.queue.expirationTime | The time when the message will expire from the queue storage |
| azure.queue.messageId | The ID of the retrieved message |
| azure.queue.popReceipt | The pop receipt of the retrieved message |

## See also

* [org.apache.nifi.processors.azure.storage.queue.PutAzureQueueStorage_v12](putazurequeuestorage_v12.md)

---
title: GetBoxFileCollaborators 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getboxfilecollaborators.md
section: Loading & Unloading Data
---

# GetBoxFileCollaborators 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Retrieves all collaborators on a Box file and adds the collaboration information to the FlowFile’s attributes.

## Tags

box, collaboration, permissions, sharing, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the Box file to retrieve collaborators for |
| Roles | A comma-separated list of collaboration roles to retrieve. Available roles: editor, viewer, previewer, uploader, previewer uploader, viewer uploader, co-owner, owner. If not specified, no filtering by role will be applied. |
| Statuses | A comma-separated list of collaboration statuses to retrieve. Available statuses: accepted, pending, rejected. If not specified, no filtering by status will be applied. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that encounter errors during processing will be routed to this relationship |
| not.found | FlowFiles for which the specified Box file was not found |
| success | FlowFiles that have been successfully processed will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The id of the file |
| box.collaborations.<status>.users.ids | Comma-separated list of user collaborator IDs by status |
| box.collaborations.<status>.groups.ids | Comma-separated list of group collaborator IDs by status |
| box.collaborations.<status>.users.emails | Comma-separated list of user collaborator emails by status |
| box.collaborations.<status>.groups.emails | Comma-separated list of group collaborator emails by status |
| box.collaborations.<status>.<role>.users.ids | Comma-separated list of user collaborator IDs by status and role. Only present when both Roles and Statuses properties are set. |
| box.collaborations.<status>.<role>.users.logins | Comma-separated list of user collaborator logins by status and role. Only present when both Roles and Statuses properties are set. |
| box.collaborations.<status>.<role>.groups.ids | Comma-separated list of group collaborator IDs by status and role. Only present when both Roles and Statuses properties are set. |
| box.collaborations.<status>.<role>.groups.emails | Comma-separated list of group collaborator emails by status and role. Only present when both Roles and Statuses properties are set. |
| box.collaborations.count | Total number of collaborations on the file |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)

---
title: GetBoxGroupMembers 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getboxgroupmembers.md
section: Loading & Unloading Data
---

# GetBoxGroupMembers 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Retrieves members for a Box Group and writes their details in FlowFile attributes.

## Tags

box, metadata, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Group ID | The ID of the Group to retrieve members for |

## Relationships

| Name | Description |
| --- | --- |
| failure | The FlowFile will be routed here when Group memberships retrieval was attempted but failed. |
| not.found | The FlowFile will be routed here when the Group was not found. |
| success | The FlowFile will be routed here after successfully retrieving Group members. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.group.user.ids | A comma-separated list of user IDs in the group. |
| box.group.user.logins | A comma-separated list of user Logins (emails) in the group. |
| error.code | An http error code returned by Box. |
| error.message | An error message returned by Box. |

---
title: GetConfluenceAuditRecords 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluenceauditrecords.md
section: Loading & Unloading Data
---

# GetConfluenceAuditRecords 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor listing Confluence audit records.

## Tags

Preview, atlassian, audit log, confluence

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Audit Log Fetch Limit | How many audit logs will be fetched from Confluence API in one request |
| Confluence Client Service | Controller service for managing connections to Confluence |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores last synchronization timestamp. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch Confluence audit records |
| original | The input Flow File is routed to the original relationship. |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Confluence audit records |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.group.ids | List of identifiers of the Confluence groups. |
| confluence.page.names | List of the names of the Confluence page. |
| confluence.space.names | List of the Confluence spaces. |
| confluence.continue.fetching | Indicates whether there are more pages to fetch (true/false). |

---
title: GetConfluenceGroupUsers 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluencegroupusers.md
section: Loading & Unloading Data
---

# GetConfluenceGroupUsers 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor that downloads information about users belonging to a given Confluence group

## Tags

Preview, atlassian, confluence, groups, users

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Confluence Group ID | Identifier of the Confluence Group |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch Confluence group users |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Confluence group users |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.group.user.ids | Identifiers of the Confluence group users. |
| confluence.group.user.emails | Emails of the Confluence group users. |

---
title: GetConfluencePageContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluencepagecontent.md
section: Loading & Unloading Data
---

# GetConfluencePageContent 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor downloading Confluence pages.

## Tags

Preview, atlassian, confluence, content, fetch, page

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Body Format | Format in which body of the Confluence Page will be fetched |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Confluence Page ID | Identifier of the Confluence Page |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch Confluence page |
| not found | Confluence page not found |
| removed | Confluence page was removed |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Confluence page |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | text/html |
| confluence.page.version | Version of the Confluence page. |
| confluence.page.last.modification.date | Last modification date of the Confluence page. |
| confluence.page.change.type | Informs about status change for the searched page. |

---
title: GetConfluencePageIds 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluencepageids.md
section: Loading & Unloading Data
---

# GetConfluencePageIds 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Downloads changed Confluence pages since the last sync and emits each as a FlowFile with metadata.

## Tags

Preview, atlassian, changes, confluence, fetch, pages

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Page IDs | Comma separated list of page IDs to filter page by; only pages with these IDs are returned |
| Space IDs | Comma separated list of space IDs to filter pages by; only pages from these spaces are returned |
| Start Date | Start date from which the ingestion should happen (format: yyyy-MM-dd, inclusive) |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores pagination state to maintain position between restarts. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch changed Confluence pages |
| original | The input Flow File is routed to the original relationship. |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched changed Confluence pages |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.page.id | Unique identifier of the Confluence page. |
| confluence.page.change.type | Informs about status change for the searched page. |
| confluence.page.url | Confluence page url. |
| confluence.page.title | Confluence page title. |
| confluence.page.last.modification.date | Last modification date of the Confluence page. |
| confluence.space.id | Unique identifier of the Confluence space. |
| confluence.continue.fetching | Indicates whether there are more pages to fetch (true/false). |

---
title: GetConfluencePagePermissions 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluencepagepermissions.md
section: Loading & Unloading Data
---

# GetConfluencePagePermissions 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor downloading Confluence page permissions.

## Tags

Preview, atlassian, confluence, page, permissions

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Confluence Page ID | Identifier of the Confluence Page |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch and parse Confluence page permissions. |
| page not found | Confluence page not found |
| restrictions changed | Confluence page restrictions changed since last fetch |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Confluence page permissions. |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.permissions.users | IDs of users with permissions to the Confluence page |
| confluence.permissions.emails | Emails of users with permissions to the Confluence page |
| confluence.permissions.groups | Groups with permissions to the Confluence page |

---
title: GetConfluenceSpaceIds 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluencespaceids.md
section: Loading & Unloading Data
---

# GetConfluenceSpaceIds 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor for retrieving Confluence space ids.

## Tags

atlassian, confluence, preview, spaces

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Space Keys | Comma-separated list of space keys to filter. If not specified, all spaces will be retrieved. |

## Relationships

| Name | Description |
| --- | --- |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Confluence spaces |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.space.ids | List of identifiers of the Confluence spaces. |

---
title: GetConfluenceSpacePermissions 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getconfluencespacepermissions.md
section: Loading & Unloading Data
---

# GetConfluenceSpacePermissions 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor downloading Confluence space permissions.

## Tags

Preview, atlassian, confluence, permissions, space

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |
| Confluence Space ID | Identifier of the Confluence Space. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Failed to fetch and parse Confluence space permissions. |
| retry | Retryable failure occurred, e.g. rate limiting |
| space not found | Confluence space not found |
| success | Successfully fetched Confluence space permissions. |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.permissions.users | IDs of users with permissions to the Confluence space |
| confluence.permissions.emails | Emails of users with permissions to the Confluence space |
| confluence.permissions.groups | Groups with permissions to the Confluence space |

---
title: GetDataShareCredentials 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getdatasharecredentials.md
section: Loading & Unloading Data
---

# GetDataShareCredentials 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Describe the specified data share metadata in Salesforce Data Cloud.

## Tags

daas, data cloud, describe, object, preview, salesforce, sfdc

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Calculated Insights Objects | Comma separated list of Calculated Insight Object names to describe. |
| Connection Pooling Service | The Connection Pooling Service that is used to create the Snowflake volumes holding the credentials. |
| Data Lake Objects | Comma separated list of Data Lake Object names to describe. |
| Data Model Objects | Comma separated list of Data Model Object names to describe. |
| Data Share Name | The name of the Data Share to describe. |
| Salesforce Data Cloud Client | Salesforce Data Cloud Client to interact with the APIs |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Provides information about the last time an external volume has been created/updated for credentials. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the data share credentials metadata could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the data share credentials cannot be retrieved or volumes cannot be created |
| success | FlowFile containing the data share metadata after successful creation of the volumes will be routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.ListSFDCDataShares](listsfdcdatashares.md)

---
title: GetDataShareTables 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getdatasharetables.md
section: Loading & Unloading Data
---

# GetDataShareTables 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Describe the specified data share metadata in Salesforce Data Cloud.

## Tags

daas, data cloud, describe, object, preview, salesforce, sfdc

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Calculated Insights Objects | Comma separated list of Calculated Insight Object names to describe. |
| Data Lake Objects | Comma separated list of Data Lake Object names to describe. |
| Data Model Objects | Comma separated list of Data Model Object names to describe. |
| Data Share Name | The name of the Data Share to describe. |
| Salesforce Data Cloud Client | Salesforce Data Cloud Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the data share tables metadata could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the data share tables metadata could not be retrieved |
| success | FlowFile containing the data share tables metadata will be routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.ListSFDCDataShares](listsfdcdatashares.md)

---
title: GetDBFSFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getdbfsfile.md
section: Loading & Unloading Data
---

# GetDBFSFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Read a DBFS file.

## Tags

databricks, dbfs, openflow

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| DBFS File Path | DBFS file path e.g. /directory/file.txt |
| Databricks Client | Databricks Client Service. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: GetDynamoDB 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getdynamodb.md
section: Loading & Unloading Data
---

# GetDynamoDB 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves a document from DynamoDB based on hash and range key. The key can be string or number. For any get request all the primary keys are required (hash or hash and range based on the table keys).A Json Document ( ‘Map’) attribute of the DynamoDB item is read into the content of the FlowFile.

## Tags

AWS, Amazon, DynamoDB, Fetch, Get

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Batch items for each request (between 1 and 50) | The items to be retrieved in one batch |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Hash Key Name | The hash key name of the item |
| Hash Key Value | The hash key value of the item |
| Hash Key Value Type | The hash key value type of the item |
| Json Document attribute | The Json document to be retrieved from the dynamodb item ( ‘s’ type in the schema) |
| Range Key Name | The range key name of the item |
| Range Key Value |  |
| Range Key Value Type | The range key value type of the item |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Table Name | The DynamoDB table name |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| not found | FlowFiles are routed to not found relationship if key not found in the table |
| success | FlowFiles are routed to success relationship |
| unprocessed | FlowFiles are routed to unprocessed relationship when DynamoDB is not able to process all the items in the request. Typical reasons are insufficient table throughput capacity and exceeding the maximum bytes per request. Unprocessed FlowFiles can be retried with a new request. |

## Writes attributes

| Name | Description |
| --- | --- |
| dynamodb.key.error.unprocessed | DynamoDB unprocessed keys |
| dynmodb.range.key.value.error | DynamoDB range key error |
| dynamodb.key.error.not.found | DynamoDB key not found |
| dynamodb.error.exception.message | DynamoDB exception message |
| dynamodb.error.code | DynamoDB error code |
| dynamodb.error.message | DynamoDB error message |
| dynamodb.error.service | DynamoDB error service |
| dynamodb.error.retryable | DynamoDB error is retryable |
| dynamodb.error.request.id | DynamoDB error request id |
| dynamodb.error.status.code | DynamoDB status code |

## See also

* [org.apache.nifi.processors.aws.dynamodb.DeleteDynamoDB](deletedynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.PutDynamoDB](putdynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.PutDynamoDBRecord](putdynamodbrecord.md)

---
title: GetElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getelasticsearch.md
section: Loading & Unloading Data
---

# GetElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

Elasticsearch get processor that uses the official Elastic REST client libraries to fetch a single document from Elasticsearch by _id. Note that the full body of the document will be read into memory before being written to a FlowFile for transfer.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, index, json, put, record

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attribute Name | The name of the FlowFile attribute to use for the retrieved document output. |
| Client Service | An Elasticsearch client service to use for running queries. |
| Destination | Indicates whether the retrieved document is written to the FlowFile content or a FlowFile attribute. |
| Document Id | The _id of the document to retrieve. |
| Index | The name of the index to use. |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## Relationships

| Name | Description |
| --- | --- |
| document | Fetched documents are routed to this relationship. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| not_found | A FlowFile is routed to this relationship if the specified document does not exist in the Elasticsearch cluster. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The filename attribute is set to the document identifier |
| elasticsearch.index | The Elasticsearch index containing the document |
| elasticsearch.type | The Elasticsearch document type |
| elasticsearch.get.error | The error message provided by Elasticsearch if there is an error fetching the document. |

## See also

* [org.apache.nifi.processors.elasticsearch.JsonQueryElasticsearch](jsonqueryelasticsearch.md)

---
title: GetFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getfile.md
section: Loading & Unloading Data
---

# GetFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Creates FlowFiles from files in a directory. NiFi will ignore files it doesn’t have at least read permissions for.

## Tags

files, filesystem, get, ingest, ingress, input, local, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The maximum number of files to pull in each invocation of the processor |
| File Filter | Only files whose names match the given regular expression will be picked up |
| Ignore Hidden Files | Indicates whether or not hidden files should be ignored |
| Input Directory | The input directory from which to pull files |
| Keep Source File | If true, the file is not deleted after it has been copied to the Content Repository; this causes the file to be picked up continually and is useful for testing purposes. If not keeping original NiFi will need write permissions on the directory it is pulling from otherwise it will ignore the file. |
| Maximum File Age | The maximum age that a file must be in order to be pulled; any file older than this amount of time (according to last modification date) will be ignored |
| Maximum File Size | The maximum size that a file can be in order to be pulled |
| Minimum File Age | The minimum age that a file must be in order to be pulled; any file younger than this amount of time (according to last modification date) will be ignored |
| Minimum File Size | The minimum size that a file must be in order to be pulled |
| Path Filter | When Recurse Subdirectories is true, then only subdirectories whose path matches the given regular expression will be scanned |
| Polling Interval | Indicates how long to wait before performing a directory listing |
| Recurse Subdirectories | Indicates whether or not to pull files from subdirectories |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |
| write filesystem | Provides operator the ability to delete any file that NiFi has access to. |

## Relationships

| Name | Description |
| --- | --- |
| success | All files are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The filename is set to the name of the file on disk |
| path | The path is set to the relative path of the file’s directory on disk. For example, if the <Input Directory> property is set to /tmp, files picked up from /tmp will have the path attribute set to ./. If the <Recurse Subdirectories> property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to abc/1/2/3 |
| file.creationTime | The date and time that the file was created. May not work on all file systems |
| file.lastModifiedTime | The date and time that the file was last modified. May not work on all file systems |
| file.lastAccessTime | The date and time that the file was last accessed. May not work on all file systems |
| file.owner | The owner of the file. May not work on all file systems |
| file.group | The group owner of the file. May not work on all file systems |
| file.permissions | The read/write/execute permissions of the file. May not work on all file systems |
| absolute.path | The full/absolute path from where a file was picked up. The current ‘path’ attribute is still populated, but may be a relative path |

## See also

* [org.apache.nifi.processors.standard.FetchFile](fetchfile.md)
* [org.apache.nifi.processors.standard.PutFile](putfile.md)

---
title: GetFileResource 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getfileresource.md
section: Loading & Unloading Data
---

# GetFileResource 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor creates FlowFiles with the content of the configured File Resource. GetFileResource is useful for load testing, configuration, and simulation.

## Tags

file, generate, load, test

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File Resource | Location of the File Resource (Local File or URL). This file will be used as content of the generated FlowFiles. |
| MIME Type | Specifies the value to set for the [mime.type] attribute. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |
| reference remote resources | File Resource can reference resources over HTTP/HTTPS |

## Relationships

| Name | Description |
| --- | --- |
| success |  |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the MIME type of the output if the ‘MIME Type’ property is set |
| Dynamic property key | Value for the corresponding dynamic property, if any is set |

---
title: GetFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getftp.md
section: Loading & Unloading Data
---

# GetFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Fetches files from an FTP Server and creates FlowFiles from them

## Tags

FTP, fetch, files, get, ingest, input, remote, retrieve, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Mode | The FTP Connection Mode |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Delete Original | Determines whether or not the file is deleted from the remote system after it has been successfully transferred |
| File Filter Regex | Provides a Java Regular Expression for filtering Filenames; if a filter is supplied, only files whose names match that Regular Expression will be fetched |
| Follow Symbolic Links | If true, will pull even symbolic files and also nested symbolic subdirectories; otherwise, will not read symbolic files and will not traverse symbolic link subdirectories |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Ignore Dotted Files | If true, files whose names begin with a dot (“.”) will be ignored |
| Internal Buffer Size | Set the internal buffer size for buffered data streams |
| Max Selects | The maximum number of files to pull in a single connection |
| Password | Password for the user account |
| Path Filter Regex | When Search Recursively is true, then only subdirectories whose path matches the given Regular Expression will be scanned |
| Polling Interval | Determines how long to wait between fetching the listing for new files |
| Port | The port that the remote system is listening on for file transfers |
| Remote Path | The path on the remote system from which to pull or push files |
| Remote Poll Batch Size | The value specifies how many file paths to find in a given directory on the remote system when doing a file listing. This value in general should not need to be modified but when polling against a remote system with a tremendous number of files this value can be critical. Setting this value too high can result very poor performance and setting it too low can cause the flow to be slower than normal. |
| Search Recursively | If true, will pull files from arbitrarily nested subdirectories; otherwise, will not traverse subdirectories |
| Transfer Mode | The FTP Transfer Mode |
| Use Natural Ordering | If true, will pull files in the order in which they are naturally listed; otherwise, the order in which the files will be pulled is not defined |
| Username | Username |
| ftp-use-utf8 | Tells the client to use UTF-8 encoding when processing files and filenames. If set to true, the server must also support UTF-8 encoding. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The filename is set to the name of the file on the remote server |
| path | The path is set to the path of the file’s directory on the remote server. For example, if the <Remote Path> property is set to /tmp, files picked up from /tmp will have the path attribute set to /tmp. If the <Search Recursively> property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to /tmp/abc/1/2/3 |
| file.lastModifiedTime | The date and time that the source file was last modified |
| file.lastAccessTime | The date and time that the file was last accessed. May not work on all file systems |
| file.owner | The numeric owner id of the source file |
| file.group | The numeric group id of the source file |
| file.permissions | The read/write/execute permissions of the source file |
| absolute.path | The full/absolute path from where a file was picked up. The current ‘path’ attribute is still populated, but may be a relative path |

## See also

* [org.apache.nifi.processors.standard.PutFTP](putftp.md)

---
title: GetGcpVisionAnnotateFilesOperationStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getgcpvisionannotatefilesoperationstatus.md
section: Loading & Unloading Data
---

# GetGcpVisionAnnotateFilesOperationStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Retrieves the current status of an Google Vision operation.

## Tags

Cloud, Google, Machine Learning, Vision

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| operationKey | The unique identifier of the Vision operation. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| running | The job is currently still being processed |
| success | FlowFiles are routed to success relationship |

## See also

* [org.apache.nifi.processors.gcp.vision.StartGcpVisionAnnotateFilesOperation](startgcpvisionannotatefilesoperation.md)

---
title: GetGcpVisionAnnotateImagesOperationStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getgcpvisionannotateimagesoperationstatus.md
section: Loading & Unloading Data
---

# GetGcpVisionAnnotateImagesOperationStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Retrieves the current status of an Google Vision operation.

## Tags

Cloud, Google, Machine Learning, Vision

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| operationKey | The unique identifier of the Vision operation. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| running | The job is currently still being processed |
| success | FlowFiles are routed to success relationship |

## See also

* [org.apache.nifi.processors.gcp.vision.StartGcpVisionAnnotateImagesOperation](startgcpvisionannotateimagesoperation.md)

---
title: GetGoogleAdsReport 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getgoogleadsreport.md
section: Loading & Unloading Data
---

# GetGoogleAdsReport 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-ads-nar

## Description

A processor which can interact with Google Ads Reporting API. By default it fetch data once a day

## Tags

Google, Google Ads, report

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Client Account ID | ID of the Google Ads account for which the report should be fetched |
| GCP Credentials Service | Controller Service used to obtain Google Cloud Platform credentials. |
| Google Ads Resource Name | Name of the resource that should be used in ‘FROM’ clause of the query |
| Google Developer Token | Developer token required to access Google APIs |
| Report Attributes | List of comma-separated report attributes |
| Report Metrics | List of comma-separated report metrics |
| Report Segments | List of comma-separated report segments |
| Report Start Date | Start date from which the ingestion should happen. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores information about last report definition in form of hash to detect schema changes. In incremental ingestion (when the ‘segments.date’ segment is selected) it keeps track of latest ingested date to download only new data chunks. Additionally start date is saved. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Error FlowFiles transferred when receiving error response from Google Ads Reporting API or when an error occurred during response processing. |
| success | Response FlowFiles transferred when receiving success response from Google Ads Reporting API. |

## Writes attributes

| Name | Description |
| --- | --- |
| google.ads.client.account.id | ID of the account in Google Ads for which given report should be ingested |
| google.ads.resource.name | Name of the resource in Google Ads that is a source for the report |
| google.ads.query | Query used to fetch data from Google Ads StreamSearch API |
| google.ads.attributes | Attributes of the selected resource |
| google.ads.metrics | Metrics collected in the context of a given resource |
| google.ads.segments | Buckets in which metrics should be grouped |
| google.ads.ingestion.strategy | The strategy used for ingestion. Can be ‘SNAPSHOT’ or ‘INCREMENTAL’ |
| google.ads.start.date | Date from which data is downloaded from Google Ads (including given date) |
| google.ads.end.date | Date to which data is downloaded from Google Ads (including given date) |
| google.ads.report.schema.changed | Flag meaning if the report schema has changed between processor executions |
| google.ads.report.conversion.window | Number of days which are fetched from Google Ads during incremental load. Based on Conversion Window values |
| fragment.identifier | A unique ID of each ingestion run. Allows to identify all flow files generated during a single run. |
| fragment.index | Number representing unique identifier in batch of flowfiles generated during one ingestion run |
| fragment.count | Amount of flowfiles generated during processor execution |
| avro.schema | Avro schema representing fetched data |
| mime.type | Mime type of the returned report. |

---
title: GetGoogleGroupMembers 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getgooglegroupmembers.md
section: Loading & Unloading Data
---

# GetGoogleGroupMembers 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-drive-nar

## Description

Retrieves the members of one or more Google Groups, specified as a comma-separated list of group IDs that is given as a FlowFile attribute. Supports both immediate (top-level) and nested group member retrieval. Outputs four FlowFile attributes: ‘google.group.member.user.ids’, ‘google.group.member.user.emails’, ‘google.group.member.group.ids’, and ‘google.group.member.group.emails’. When nested fetching is enabled, it recursively expands sub-groups up to the specified depth. If an attribute already exists on the FlowFile, the new values are concatenated to the existing value (separated by a comma).

## Tags

cloud, directory, gcp, google, groups, membership

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Fetch Nested Groups | When enabled, recursively fetches members from nested groups within the specified groups. When disabled, only top-level members are retrieved. |
| GCP Credentials Service | Specifies the Controller Service used to obtain Google Cloud Platform credentials. |
| Google Group IDs | Specifies the comma-separated list of Google Group IDs (email addresses for the groups). Supports Expression Language. |
| Nested Depth Limit | Maximum depth to traverse when fetching nested group members. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed here if the processor fails to retrieve Google group members. |
| not.found | A FlowFile is routed here if for each Google group that was not found. |
| retry | A FlowFile is routed here if the processor should retry the request (e.g., after rate limiting). |
| success | A FlowFile is routed here after successfully retrieving Google group members. |

## Writes attributes

| Name | Description |
| --- | --- |
| google.group.ids | A comma-separated list of Google Group IDs that were found. |
| google.group.member.user.ids | A comma-separated list of user IDs found in the specified groups. When nested fetching is enabled, includes users from nested groups up to the specified depth. |
| google.group.member.user.emails | A comma-separated list of user email addresses found in the specified groups. When nested fetching is enabled, includes users from nested groups up to the specified depth. |
| google.group.member.group.ids | A comma-separated list of nested group IDs found in the specified groups. When nested fetching is enabled, includes all groups discovered during recursive traversal. |
| google.group.member.group.emails | A comma-separated list of nested group email addresses found in the specified groups. When nested fetching is enabled, includes all groups discovered during recursive traversal. |

## See also

* [com.snowflake.openflow.runtime.processors.google.CaptureGoogleDriveChanges](capturegoogledrivechanges.md)

---
title: GetGoogleSheets 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getgooglesheets.md
section: Loading & Unloading Data
---

# GetGoogleSheets 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-sheets-processors-nar

## Description

Processor responsible for fetching data from Google Sheets. By default it fetches data once a day.

## Tags

Google, Google Sheets, spreadsheet

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Date Time Render Option | Determines how dates should be rendered in the output. |
| GCP Credentials Service | Controller Service used to obtain Google Cloud Platform credentials. |
| Ranges | The A1 notation or R1C1 notation of the comma-separated ranges to retrieve values from. For example: Sheet1!A1:B2,Sheet2!D4:E5,Sheet3. The first row in a sheet must represent column names. If not specified, all sheets will be downloaded. |
| Spreadsheet ID | ID of the Google Sheets Spreadsheet. Can be found in the URL of the spreadsheet. |
| Value Render Option | Determines how values should be rendered in the output. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFile with errors occurred while fetching from Google Sheets. |
| success | FlowFile containing a JSON array where each object represents a row from the source sheet. Keys correspond to column headers from the first row, and values to the respective row entries. |

## Writes attributes

| Name | Description |
| --- | --- |
| google.sheets.spreadsheet.id | ID of the Google Sheets Spreadsheet. |
| google.sheets.range | Range in Google Sheets Spreadsheet that was fetched. |
| run.id | A unique ID of each ingestion run. Allows to identify all flow files generated during a single run. |
| destination.table.schema | A Snowflake schema of the destination table in the following format: { “columns”: [ { “name”: “<column name>”, “type”: “<column type>”, “nullable”: <true/false>, “precision”: <precision, only for numeric type>, “scale”: <scale, only for numeric type> }, … ], “primaryKeys”: [“<name of first primary key column>”, “<name of second primary key column>”, …] } |

---
title: GetHubSpot 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/gethubspot.md
section: Loading & Unloading Data
---

# GetHubSpot 2025.10.9.21

## Bundle

org.apache.nifi | nifi-hubspot-nar

## Description

Retrieves JSON data from a private HubSpot application. This processor is intended to be run on the Primary Node only.

## Tags

hubspot

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| access-token | Access Token to authenticate requests |
| incremental-delay | The ending timestamp of the time window will be adjusted earlier by the amount configured in this property. For example, with a property value of 10 seconds, an ending timestamp of 12:30:45 would be changed to 12:30:35. Set this property to avoid missing objects when the clock of your local machines and HubSpot servers ‘clock are not in sync and to protect against HubSpot’s mechanism that changes last updated timestamps after object creation. |
| incremental-initial-start-time | This property specifies the start time that the processor applies when running the first request. The expected format is a UTC date-time such as ‘2011-12-03T10:15:30Z’ |
| is-incremental | The processor can incrementally load the queried objects so that each object is queried exactly once. For each query, the processor queries objects within a time window where the objects were modified between the previous run time and the current time (optionally adjusted by the Incremental Delay property). |
| object-type | The HubSpot Object Type requested |
| result-limit | The maximum number of results to request for each invocation of the Processor |
| web-client-service-provider | Controller service for HTTP client operations |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | In case of incremental loading, the start and end timestamps of the last query time window are stored in the state. When the ‘Result Limit’ property is set, the paging cursor is saved after executing a request. Only the objects after the paging cursor will be retrieved. The maximum number of retrieved objects can be set in the ‘Result Limit’ property. |

## Relationships

| Name | Description |
| --- | --- |
| success | For FlowFiles created as a result of a successful HTTP request. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the MIME type to application/json |

---
title: GetHubSpotObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/gethubspotobject.md
section: Loading & Unloading Data
---

# GetHubSpotObject 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-hubspot-processors-nar

## Description

Get a HubSpot object and its associations by ID or unique value.

## Tags

Preview, hubspot

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| HubSpot Service | HubSpot Client Service. |
| Object ID Property | HubSpot property used to uniquely identify the object. |
| Object ID Value | Matching HubSpot property value to search for. |
| Object Type | HubSpot object type |

## Relationships

| Name | Description |
| --- | --- |
| failure | HubSpot fail relationship |
| missing | HubSpot object does not exist. |
| retry | HubSpot retry relationship. FlowFiles that failed to process due to a server timeout or rate limit related error. FlowFiles routed here should be routed back into the processor. |
| success | HubSpot success relationship |

## See also

* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotSchema](gethubspotschema.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListArchivedHubSpotData](listarchivedhubspotdata.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListHubSpotObjects](listhubspotobjects.md)
* [com.snowflake.openflow.runtime.processors.hubspot.PutHubSpot](puthubspot.md)

---
title: GetHubSpotSchema 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/gethubspotschema.md
section: Loading & Unloading Data
---

# GetHubSpotSchema 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-hubspot-processors-nar

## Description

Retrieves schema information for HubSpot object types including field names, types, and labels. Outputs detailed field metadata as JSON for schema discovery and mapping purposes.

## Tags

Preview, crm, hubspot, metadata, schema

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| HubSpot Service | HubSpot Client Service. |
| Object Type | HubSpot object type |

## Relationships

| Name | Description |
| --- | --- |
| failure | HubSpot fail relationship |
| retry | HubSpot retry relationship. FlowFiles that failed to process due to a server timeout or rate limit related error. FlowFiles routed here should be routed back into the processor. |
| success | HubSpot success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| hubspot.object.type | The HubSpot object type |
| hubspot.field.count | Number of fields retrieved |
| mime.type | MIME type of the output (application/json) |

## See also

* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotObject](gethubspotobject.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListArchivedHubSpotData](listarchivedhubspotdata.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListHubSpotObjects](listhubspotobjects.md)
* [com.snowflake.openflow.runtime.processors.hubspot.PutHubSpot](puthubspot.md)

---
title: GetLinkedInAdsReport 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getlinkedinadsreport.md
section: Loading & Unloading Data
---

# GetLinkedInAdsReport 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-linkedin-ads-processors-nar

## Description

Processor downloading metrics from the LinkedIn Reporting APIs.

## Tags

LinkedIn, LinkedIn Ads, ads, report

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Accounts | List of comma-separated accounts. |
| Campaign Groups | List of comma-separated campaign groups. |
| Campaigns | List of comma-separated campaigns. |
| Companies | List of comma-separated companies. |
| Conversion Window | Timeframe for which data is refreshed during incremental load. |
| Metrics | List of comma-separated metrics. |
| OAuth Token Provider | Service providing OAuth access token. |
| Pivots | List of comma-separated pivots. |
| Report Name | Unique name of the report. |
| Shares | List of comma-separated shares. |
| Start Date | Start date from which ingestion should begin. It must be in the yyyy-MM-dd format. |
| Time Granularity | Time granularity of results. |
| Web Client Service Provider | Service providing client for REST request execution. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Stores information about last report definition in form of hash to detect schema changes. Incrementally loaded reports persist last ingestion date to define ingestion date ranges after initial load. Additionally start date is saved. |

## Relationships

| Name | Description |
| --- | --- |
| success | Response FlowFiles transferred when successfully processed a response from the LinkedIn Ads Reporting API. |

## Writes attributes

| Name | Description |
| --- | --- |
| linkedin.ads.report.name | Unique name of the report. |
| linkedin.ads.run.id | Unique identifier of the run. |
| avro.schema | Avro schema that contains a set of all configured metrics and pivots. |
| linkedin.ads.ingestion.strategy | Strategy that defines whether the report will be downloaded as SNAPSHOT or INCREMENTAL. |
| linkedin.ads.report.schema.changed | Flag that indicates whether the report schema has changed between processor executions. |
| linkedin.ads.ingestion.start.date | Date from which data is downloaded from LinkedIn Ads (including a given date). |
| linkedin.ads.ingestion.end.date | Date to which data is downloaded from LinkedIn Ads (including a given date). |

---
title: GetMicrosoft365GroupMembers 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getmicrosoft365groupmembers.md
section: Loading & Unloading Data
---

# GetMicrosoft365GroupMembers 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

Retrieves Microsoft365 group members and emits a FlowFile for each change that occurs. This includes membership changes.

## Tags

cdc, document, graph, library, microsoft, sharepoint, unstructured

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API |
| Fallback Retry Duration | The time to wait before retrying the operation after a communication failure. This value is used when the response doesn’t contain a Retry-After header. |
| Microsoft365 Group id | Specifies a Microsoft365 group id to retrieve the members for. Supports Expression Language. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed here if the processor failed to communicate with the Graph API. Can be retried |
| failure | An incoming FlowFile is routed to this relationship if the group members could not be fetched |
| not.found | A FlowFile is routed here if the group was not found |
| success | A FlowFile is routed here if the group members were successfully retrieved |

## Writes attributes

| Name | Description |
| --- | --- |
| microsoft365.group.user.ids | A comma-separated list of Microsoft365 user ids that are members of the Microsoft365 group. |
| microsoft365.group.user.emails | A comma-separated list of user emails that are members of the Microsoft365 group. |

---
title: GetMongo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getmongo.md
section: Loading & Unloading Data
---

# GetMongo 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Creates FlowFiles from documents in MongoDB loaded by a user-specified query.

## Tags

get, mongodb, read

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of elements to be returned from the server in one batch |
| Limit | The maximum number of elements to return |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| Projection | The fields to be returned from the documents in the result set; must be a valid BSON document |
| Query | The selection criteria to do the lookup. If the field is left blank, it will look for input from an incoming connection from another processor to provide the query as a valid JSON document inside of the FlowFile’s body. If this field is left blank and a timer is enabled instead of an incoming connection, that will result in a full collection fetch using a “{}” query. |
| Sort | The fields by which to sort; must be a valid BSON document |
| get-mongo-send-empty | If a query executes successfully, but returns no results, send an empty JSON document signifying no result. |
| json-type | By default, MongoDB’s Java driver returns “extended JSON”. Some of the features of this variant of JSON may cause problems for other JSON parsers that expect only standard JSON types and conventions. This configuration setting controls whether to use extended JSON or provide a clean view that conforms to standard JSON. |
| mongo-charset | Specifies the character set of the document data. |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |
| mongo-date-format | The date format string to use for formatting Date fields that are returned from Mongo. It is only applied when the JSON output format is set to Standard JSON. |
| mongo-query-attribute | If set, the query will be written to a specified attribute on the output flowfiles. |
| results-per-flowfile | How many results to put into a FlowFile at once. The whole body will be treated as a JSON array of results. |
| use-pretty-printing | Choose whether or not to pretty print the JSON from the results of the query. Choosing ‘True’ can greatly increase the space requirements on disk depending on the complexity of the JSON document |

## Relationships

| Name | Description |
| --- | --- |
| failure | All input FlowFiles that are part of a failed query execution go here. |
| original | All input FlowFiles that are part of a successful query execution go here. |
| success | All FlowFiles that have the results of a successful query execution go here. |

## Writes attributes

| Name | Description |
| --- | --- |
| mongo.database.name | The database where the results came from. |
| mongo.collection.name | The collection where the results came from. |

---
title: GetMongoRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getmongorecord.md
section: Loading & Unloading Data
---

# GetMongoRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

A record-based version of GetMongo that uses the Record writers to write the MongoDB result set.

## Tags

fetch, get, json, mongo, mongodb, record

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of elements to be returned from the server in one batch |
| Limit | The maximum number of elements to return |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| Projection | The fields to be returned from the documents in the result set; must be a valid BSON document |
| Query | The selection criteria to do the lookup. If the field is left blank, it will look for input from an incoming connection from another processor to provide the query as a valid JSON document inside of the FlowFile’s body. If this field is left blank and a timer is enabled instead of an incoming connection, that will result in a full collection fetch using a “{}” query. |
| Sort | The fields by which to sort; must be a valid BSON document |
| get-mongo-record-writer-factory | The record writer to use to write the result sets. |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |
| mongo-query-attribute | If set, the query will be written to a specified attribute on the output flowfiles. |
| mongodb-schema-name | The name of the schema in the configured schema registry to use for the query results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All input FlowFiles that are part of a failed query execution go here. |
| original | All input FlowFiles that are part of a successful query execution go here. |
| success | All FlowFiles that have the results of a successful query execution go here. |

## Writes attributes

| Name | Description |
| --- | --- |
| mongo.database.name | The database where the results came from. |
| mongo.collection.name | The collection where the results came from. |

---
title: GetQueryJobResult 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getqueryjobresult.md
section: Loading & Unloading Data
---

# GetQueryJobResult 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Gets the results of a Query Job in Salesforce using the Bulk API 2.0. The output is CSV and GZIP compression is used.

## Tags

bulk, job, preview, query, salesforce

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Job ID | The ID of the job for which the status is checked. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the Query Job result could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the Query Job Results could not be retrieved |
| success | If Query Job Results have been successfully retrieved, the FlowFile is routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.AbortQueryJob](abortqueryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobStatus](getqueryjobstatus.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: GetQueryJobStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getqueryjobstatus.md
section: Loading & Unloading Data
---

# GetQueryJobStatus 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Gets the status of a Query Job in Salesforce using the Bulk API 2.0.

## Tags

bulk, job, preview, query, salesforce, status

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Job ID | The ID of the job for which the status is checked. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed to this relationship if the Query Job status could not be retrieved but the operation might be retried |
| failure | A FlowFile is routed to this relationship if the Query Job status could not be retrieved |
| job.aborted | If the Query Job has been aborted, the FlowFile is routed to this relationship |
| job.completed | If the Query Job completed, the FlowFile is routed to this relationship |
| job.failed | If the Query Job failed, the FlowFile is routed to this relationship |
| wait | If the Query Job is in the processing queue or in progress, the FlowFile is routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| jobState | The current state of processing for the job. |
| systemModstamp | The UTC date and time when the API last updated the job information. |
| numberRecordsProcessed | The number of records processed in this job. |
| retries | The number of times that Salesforce attempted to save the results of an operation. Repeated attempts indicate a problem such as a lock contention. |
| totalProcessingTime | The number of milliseconds taken to process the job. |
| isPkChunkingSupported | Whether PK chunking is supported for the queried object (true), or isn’t supported (false). |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.AbortQueryJob](abortqueryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: GetS3ObjectMetadata 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/gets3objectmetadata.md
section: Loading & Unloading Data
---

# GetS3ObjectMetadata 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Check for the existence of an Object in S3 and fetch its Metadata without attempting to download it. This processor can be used as a router for workflows that need to check on an Object in S3 before proceeding with data processing

## Tags

AWS, Amazon, Archive, Exists, S3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Bucket | The S3 Bucket to interact with |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| FullControl User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Full Control for an object |
| Metadata Attribute Include Pattern | A regular expression pattern to use for determining which object metadata entries are included as FlowFile attributes. This pattern is only applied to the ‘found’ relationship and will not be used to filter the error attributes in the ‘failure’ relationship. |
| Metadata Target | This determines where the metadata will be written when found. |
| Object Key | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Owner | The Amazon ID to use for the object’s owner |
| Read ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to read the Access Control List for an object |
| Read Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Read Access for an object |
| Region | The AWS Region to connect to. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Version | The Version of the Object for which to retrieve Metadata |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| found | An object was found in the bucket at the supplied key |
| not found | No object was found in the bucket the supplied key |

## See also

* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: GetS3ObjectTags 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/gets3objecttags.md
section: Loading & Unloading Data
---

# GetS3ObjectTags 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Check for the existence of an Object in S3 and fetch its Tags without attempting to download it. This processor can be used as a router for workflows that need to check on an Object in S3 before proceeding with data processing

## Tags

AWS, Amazon, Archive, Exists, S3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Bucket | The S3 Bucket to interact with |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| FullControl User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Full Control for an object |
| Object Key | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Owner | The Amazon ID to use for the object’s owner |
| Read ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to read the Access Control List for an object |
| Read Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Read Access for an object |
| Region | The AWS Region to connect to. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Tag Attribute Include Pattern | A regular expression pattern to use for determining which object tags are included as FlowFile attributes. This pattern is only applied to the ‘found’ relationship and will not be used to filter the error attributes in the ‘failure’ relationship. |
| Tags Target | This determines where the tags will be written when found. |
| Version | The Version of the Object for which to retrieve Tags |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| found | An object was found in the bucket at the supplied key |
| not found | No object was found in the bucket the supplied key |

## See also

* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: GetSFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getsftp.md
section: Loading & Unloading Data
---

# GetSFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Fetches files from an SFTP Server and creates FlowFiles from them

## Tags

fetch, files, get, ingest, input, remote, retrieve, sftp, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Algorithm Negotiation | Configuration strategy for SSH algorithm negotiation |
| Ciphers Allowed | A comma-separated list of Ciphers allowed for SFTP connections. Leave unset to allow all. Available options are: 3des-cbc, aes128-cbc, aes128-ctr, [aes128-gcm@openssh.com](mailto:aes128-gcm%40openssh.com), aes192-cbc, aes192-ctr, aes256-cbc, aes256-ctr, [aes256-gcm@openssh.com](mailto:aes256-gcm%40openssh.com), arcfour128, arcfour256, blowfish-cbc, [chacha20-poly1305@openssh.com](mailto:chacha20-poly1305%40openssh.com), none |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Delete Original | Determines whether or not the file is deleted from the remote system after it has been successfully transferred |
| File Filter Regex | Provides a Java Regular Expression for filtering Filenames; if a filter is supplied, only files whose names match that Regular Expression will be fetched |
| Follow Symbolic Links | If true, will pull even symbolic files and also nested symbolic subdirectories; otherwise, will not read symbolic files and will not traverse symbolic link subdirectories |
| Host Key File | If supplied, the given file will be used as the Host Key; otherwise, if ‘Strict Host Key Checking’ property is applied (set to true) then uses the ‘known_hosts’ and ‘known_hosts2’ files from ~/.ssh directory else no host key file will be used |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Ignore Dotted Files | If true, files whose names begin with a dot (“.”) will be ignored |
| Key Algorithms Allowed | A comma-separated list of Key Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: ecdsa-sha2-nistp256, [ecdsa-sha2-nistp256-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp256-cert-v01%40openssh.com), ecdsa-sha2-nistp384, [ecdsa-sha2-nistp384-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp384-cert-v01%40openssh.com), ecdsa-sha2-nistp521, [ecdsa-sha2-nistp521-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp521-cert-v01%40openssh.com), rsa-sha2-256, [rsa-sha2-256-cert-v01@openssh.com](mailto:rsa-sha2-256-cert-v01%40openssh.com), rsa-sha2-512, [rsa-sha2-512-cert-v01@openssh.com](mailto:rsa-sha2-512-cert-v01%40openssh.com), [sk-ecdsa-sha2-nistp256@openssh.com](mailto:sk-ecdsa-sha2-nistp256%40openssh.com), [sk-ssh-ed25519@openssh.com](mailto:sk-ssh-ed25519%40openssh.com), ssh-dss, [ssh-dss-cert-v01@openssh.com](mailto:ssh-dss-cert-v01%40openssh.com), ssh-ed25519, [ssh-ed25519-cert-v01@openssh.com](mailto:ssh-ed25519-cert-v01%40openssh.com), ssh-rsa, [ssh-rsa-cert-v01@openssh.com](mailto:ssh-rsa-cert-v01%40openssh.com) |
| Key Exchange Algorithms Allowed | A comma-separated list of Key Exchange Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: curve25519-sha256, [curve25519-sha256@libssh.org](mailto:curve25519-sha256%40libssh.org), curve448-sha512, diffie-hellman-group-exchange-sha1, diffie-hellman-group-exchange-sha256, diffie-hellman-group1-sha1, diffie-hellman-group14-sha1, diffie-hellman-group14-sha256, diffie-hellman-group15-sha512, diffie-hellman-group16-sha512, diffie-hellman-group17-sha512, diffie-hellman-group18-sha512, ecdh-sha2-nistp256, ecdh-sha2-nistp384, ecdh-sha2-nistp521, mlkem1024nistp384-sha384, mlkem768nistp256-sha256, mlkem768x25519-sha256, sntrup761x25519-sha512, [sntrup761x25519-sha512@openssh.com](mailto:sntrup761x25519-sha512%40openssh.com) |
| Max Selects | The maximum number of files to pull in a single connection |
| Message Authentication Codes Allowed | A comma-separated list of Message Authentication Codes allowed for SFTP connections. Leave unset to allow all. Available options are: hmac-md5, hmac-md5-96, hmac-sha1, hmac-sha1-96, [hmac-sha1-etm@openssh.com](mailto:hmac-sha1-etm%40openssh.com), hmac-sha2-256, [hmac-sha2-256-etm@openssh.com](mailto:hmac-sha2-256-etm%40openssh.com), hmac-sha2-512, [hmac-sha2-512-etm@openssh.com](mailto:hmac-sha2-512-etm%40openssh.com) |
| Password | Password for the user account |
| Path Filter Regex | When Search Recursively is true, then only subdirectories whose path matches the given Regular Expression will be scanned |
| Polling Interval | Determines how long to wait between fetching the listing for new files |
| Port | The port that the remote system is listening on for file transfers |
| Private Key Passphrase | Password for the private key |
| Private Key Path | The fully qualified path to the Private Key file |
| Remote Path | The path on the remote system from which to pull or push files |
| Remote Poll Batch Size | The value specifies how many file paths to find in a given directory on the remote system when doing a file listing. This value in general should not need to be modified but when polling against a remote system with a tremendous number of files this value can be critical. Setting this value too high can result very poor performance and setting it too low can cause the flow to be slower than normal. |
| Search Recursively | If true, will pull files from arbitrarily nested subdirectories; otherwise, will not traverse subdirectories |
| Send Keep Alive On Timeout | Send a Keep Alive message every 5 seconds up to 5 times for an overall timeout of 25 seconds. |
| Strict Host Key Checking | Indicates whether or not strict enforcement of hosts keys should be applied |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Use Natural Ordering | If true, will pull files in the order in which they are naturally listed; otherwise, the order in which the files will be pulled is not defined |
| Username | Username |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The filename is set to the name of the file on the remote server |
| path | The path is set to the path of the file’s directory on the remote server. For example, if the <Remote Path> property is set to /tmp, files picked up from /tmp will have the path attribute set to /tmp. If the <Search Recursively> property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to /tmp/abc/1/2/3 |
| file.lastModifiedTime | The date and time that the source file was last modified |
| file.owner | The numeric owner id of the source file |
| file.group | The numeric group id of the source file |
| file.permissions | The read/write/execute permissions of the source file |
| absolute.path | The full/absolute path from where a file was picked up. The current ‘path’ attribute is still populated, but may be a relative path |

## See also

* [org.apache.nifi.processors.standard.PutSFTP](putsftp.md)

---
title: GetSharepointSiteGroupMembers 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getsharepointsitegroupmembers.md
section: Loading & Unloading Data
---

# GetSharepointSiteGroupMembers 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-sharepoint-rest-nar

## Description

Retrieves all members of a SharePoint site group.

## Tags

groups, membership, microsoft, openflow, sharepoint

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Group ID | The ID of the SharePoint group. |
| OAuth2 Access Token Provider | Enables managed retrieval of OAuth2 Bearer Token. |
| Site URL | The URL of the SharePoint site. |
| Web Client Service | The Web Client Service to use for communicating with Sharepoint. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | A FlowFile is routed here if the processor failed to communicate with Sharepoint. Can be retried |
| failure | A FlowFile is routed here if the group members could not be fetched |
| success | A FlowFile is routed here if the group members were successfully retrieved |

## Writes attributes

| Name | Description |
| --- | --- |
| sharepoint.group.user.ids | The IDs of the users in the SharePoint site group. |
| sharepoint.group.user.emails | The emails of the users in the SharePoint site group. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.rest.ListSharepointSiteGroups](listsharepointsitegroups.md)

---
title: GetShopify 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getshopify.md
section: Loading & Unloading Data
---

# GetShopify 2025.10.9.21

## Bundle

org.apache.nifi | nifi-shopify-nar

## Description

Retrieves objects from a custom Shopify store. The processor yield time must be set to the account’s rate limit accordingly.

## Tags

shopify

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| CUSTOMERS | Customer resource to query |
| DISCOUNTS | Discount resource to query |
| INVENTORY | Inventory resource to query |
| ONLINE_STORE | Online Store resource to query |
| ORDERS | Order resource to query |
| PRODUCT | Product resource to query |
| SALES_CHANNELS | Sales Channel resource to query |
| STORE_PROPERTIES | Store Property resource to query |
| access-token | Access Token to authenticate requests |
| api-version | The Shopify REST API version |
| incremental-delay | The ending timestamp of the time window will be adjusted earlier by the amount configured in this property. For example, with a property value of 10 seconds, an ending timestamp of 12:30:45 would be changed to 12:30:35. Set this property to avoid missing objects when the clock of your local machines and Shopify servers’ clock are not in sync. |
| incremental-initial-start-time | This property specifies the start time when running the first request. Represents an ISO 8601-encoded date and time string. For example, 3:50 pm on September 7, 2019 in the time zone of UTC (Coordinated Universal Time) is represented as “2019-09-07T15:50:00Z”. |
| is-incremental | The processor can incrementally load the queried objects so that each object is queried exactly once. For each query, the processor queries objects which were created or modified after the previous run time but before the current time. |
| object-category | Shopify object category |
| result-limit | The maximum number of results to request for each invocation of the Processor |
| store-domain | The domain of the Shopify store, e.g. nifistore.myshopify.com |
| web-client-service-provider | Controller service for HTTP client operations |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | For a few resources the processor supports incremental loading. The list of the resources with the supported parameters can be found in the additional details. |

## Relationships

| Name | Description |
| --- | --- |
| success | For FlowFiles created as a result of a successful query. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the MIME type to application/json |

---
title: GetSmbFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getsmbfile.md
section: Loading & Unloading Data
---

# GetSmbFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-smb-nar

## Description

Reads file from a samba network location to FlowFiles. Use this processor instead of a cifs mounts if share access control is important. Configure the Hostname, Share and Directory accordingly: \[Hostname][Share][pathtoDirectory]

## Tags

samba, smb, cifs, files, get

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The maximum number of files to pull in each iteration |
| Directory | The network folder to which files should be written. This is the remaining relative path after the share: \hostnameshare[dir1dir2]. |
| Domain | The domain used for authentication. Optional, in most cases username and password is sufficient. |
| File Filter | Only files whose names match the given regular expression will be picked up |
| Hostname | The network host to which files should be written. |
| Ignore Hidden Files | Indicates whether or not hidden files should be ignored |
| Keep Source File | If true, the file is not deleted after it has been copied to the Content Repository; this causes the file to be picked up continually and is useful for testing purposes. If not keeping original NiFi will need write permissions on the directory it is pulling from otherwise it will ignore the file. |
| Password | The password used for authentication. Required if Username is set. |
| Path Filter | When Recurse Subdirectories is true, then only subdirectories whose path matches the given regular expression will be scanned |
| Polling Interval | Indicates how long to wait before performing a directory listing |
| Recurse Subdirectories | Indicates whether or not to pull files from subdirectories |
| Share | The network share to which files should be written. This is the “first folder”after the hostname: \hostname[share]dir1dir2 |
| Share Access Strategy | Indicates which shared access are granted on the file during the read. None is the most restrictive, but the safest setting to prevent corruption. |
| Username | The username used for authentication. If no username is set then anonymous authentication is attempted. |
| enable-dfs | Enables accessing Distributed File System (DFS) and following DFS links during SMB operations. |
| smb-dialect | The SMB dialect is negotiated between the client and the server by default to the highest common version supported by both end. In some rare cases, the client-server communication may fail with the automatically negotiated dialect. This property can be used to set the dialect explicitly (e.g. to downgrade to a lower version), when those situations would occur. |
| timeout | Timeout for read and write operations. |
| use-encryption | Turns on/off encrypted communication between the client and the server. The property’s behavior is SMB dialect dependent: SMB 2.x does not support encryption and the property has no effect. In case of SMB 3.x, it is a hint/request to the server to turn encryption on if the server also supports it. |

## Relationships

| Name | Description |
| --- | --- |
| success | All files are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The filename is set to the name of the file on the network share |
| path | The path is set to the relative path of the file’s network share name. For example, if the input is set to \hostnamesharetmp, files picked up from tmp will have the path attribute set to tmp |
| file.creationTime | The date and time that the file was created. May not work on all file systems |
| file.lastModifiedTime | The date and time that the file was last modified. May not work on all file systems |
| file.lastAccessTime | The date and time that the file was last accessed. May not work on all file systems |
| absolute.path | The full path from where a file was picked up. This includes the hostname and the share name |

## See also

* [org.apache.nifi.processors.smb.FetchSmb](fetchsmb.md)
* [org.apache.nifi.processors.smb.ListSmb](listsmb.md)
* [org.apache.nifi.processors.smb.PutSmbFile](putsmbfile.md)

---
title: GetSplunk 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getsplunk.md
section: Loading & Unloading Data
---

# GetSplunk 2025.10.9.21

## Bundle

org.apache.nifi | nifi-splunk-nar

## Description

Retrieves data from Splunk Enterprise.

## Tags

get, logs, splunk

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| API Version | Select which version of the Splunk Search API to use for search operations. Version 2 is recommended for newer Splunk instances. |
| Application | The Splunk Application to query. |
| Connection Timeout | Max wait time for connection to the Splunk server. |
| Earliest Time | The value to use for the earliest time when querying. Only used with a Time Range Strategy of Provided. See Splunk’s documentation on Search Time Modifiers for guidance in populating this field. |
| Hostname | The ip address or hostname of the Splunk server. |
| Latest Time | The value to use for the latest time when querying. Only used with a Time Range Strategy of Provided. See Splunk’s documentation on Search Time Modifiers for guidance in populating this field. |
| Output Mode | The output mode for the results. |
| Owner | The owner to pass to Splunk. |
| Password | The password to authenticate to Splunk. |
| Port | The port of the Splunk server. |
| Query | The query to execute. Typically beginning with a <search> command followed by a search clause, such as <search source=”<tcp:7689>”> to search for messages received on TCP port 7689. |
| Read Timeout | Max wait time for response from the Splunk server. |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Scheme | The scheme for connecting to Splunk. |
| Security Protocol | The security protocol to use for communicating with Splunk. |
| Time Field Strategy | Indicates whether to search by the time attached to the event, or by the time the event was indexed in Splunk. |
| Time Range Strategy | Indicates how to apply time ranges to each execution of the query. Selecting a managed option allows the processor to apply a time range from the last execution time to the current execution time. When using <Managed from Beginning>, an earliest time will not be applied on the first execution, and thus all records searched. When using <Managed from Current> the earliest time of the first execution will be the initial execution time. When using <Provided>, the time range will come from the Earliest Time and Latest Time properties, or no time range will be applied if these properties are left blank. |
| Time Zone | The Time Zone to use for formatting dates when performing a search. Only used with Managed time strategies. |
| Token | The token to pass to Splunk. |
| Username | The username to authenticate to Splunk. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | If using one of the managed Time Range Strategies, this processor will store the values of the latest and earliest times from the previous execution so that the next execution of the can pick up where the last execution left off. The state will be cleared and start over if the query is changed. |

## Relationships

| Name | Description |
| --- | --- |
| success | Results retrieved from Splunk are sent out this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| splunk.query | The query that performed to produce the FlowFile. |
| splunk.earliest.time | The value of the earliest time that was used when performing the query. |
| splunk.latest.time | The value of the latest time that was used when performing the query. |

---
title: GetSQS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getsqs.md
section: Loading & Unloading Data
---

# GetSQS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Fetches messages from an Amazon Simple Queuing Service Queue

## Tags

AWS, Amazon, Fetch, Get, Poll, Queue, SQS

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Auto Delete Messages | Specifies whether the messages should be automatically deleted by the processors once they have been received. |
| Batch Size | The maximum number of messages to send in a single network request |
| Character Set | The Character Set that should be used to encode the textual content of the SQS message |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Queue URL | The URL of the queue to get messages from |
| Receive Message Wait Time | The maximum amount of time to wait on a long polling receive call. Setting this to a value of 1 second or greater will reduce the number of SQS requests and decrease fetch latency at the cost of a constantly active thread. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Visibility Timeout | The amount of time after a message is received but not deleted that the message is hidden from other consumers |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| hash.value | The MD5 sum of the message |
| hash.algorithm | MD5 |
| sqs.message.id | The unique identifier of the SQS message |
| sqs.receipt.handle | The SQS Receipt Handle that is to be used to delete the message from the queue |

## See also

* [org.apache.nifi.processors.aws.sqs.DeleteSQS](deletesqs.md)
* [org.apache.nifi.processors.aws.sqs.PutSQS](putsqs.md)

---
title: GetUnityCatalogFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getunitycatalogfile.md
section: Loading & Unloading Data
---

# GetUnityCatalogFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Read a Unity Catalog file up to 5 GiB.

## Tags

databricks, openflow, unity catalog

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Unity Catalog File Path | Unity Catalog file path e.g. /Volumes/catalog/schema/volume_name/file.txt |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: GetUnityCatalogFileMetadata 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getunitycatalogfilemetadata.md
section: Loading & Unloading Data
---

# GetUnityCatalogFileMetadata 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Checks for Unity Catalog file metadata.

## Tags

databricks, openflow, unity catalog

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Unity Catalog File Path | Unity Catalog file path e.g. /Volumes/catalog/schema/volume_name/file.txt |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| not.found | The original FlowFile is transferred to this relationship if no Unity Catalog can be found at the specified path |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The content type of the checked file. |
| uc.size | The size of the Unity Catalog file. |
| uc.lastModifiedTime | The last modified time of the Unity Catalog file in milliseconds since epoch in UTC time. |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: GetWorkdayReport 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getworkdayreport.md
section: Loading & Unloading Data
---

# GetWorkdayReport 2025.10.9.21

## Bundle

org.apache.nifi | nifi-workday-processors-nar

## Description

A processor which can interact with a configurable Workday Report. The processor can forward the content without modification, or you can transform it by providing the specific Record Reader and Record Writer services based on your needs. You can also remove fields by defining schema in the Record Writer. Supported Workday report formats are: csv, simplexml, json

## Tags

Workday, report

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token Provider | Enables managed retrieval of OAuth2 Bearer Token. |
| Authorization Type | The type of authorization for retrieving data from Workday resources. |
| Web Client Service Provider | Web client which is used to communicate with the Workday API. |
| Workday Password | The password provided for authentication of Workday requests. Encoded using Base64 for HTTP Basic Authentication as described in RFC 7617. |
| Workday Report URL | HTTP remote URL of Workday report including a scheme of http or https, as well as a hostname or IP address with optional port and path elements. |
| Workday Username | The username provided for authentication of Workday requests. Encoded using Base64 for HTTP Basic Authentication as described in RFC 7617. |
| record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema. |
| record-writer | The Record Writer to use for serializing Records to an output FlowFile. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Request FlowFiles transferred when receiving socket communication errors. |
| original | Request FlowFiles transferred when receiving HTTP responses with a status code between 200 and 299. |
| success | Response FlowFiles transferred when receiving HTTP responses with a status code between 200 and 299. |

## Writes attributes

| Name | Description |
| --- | --- |
| getworkdayreport.java.exception.class | The Java exception class raised when the processor fails |
| getworkdayreport.java.exception.message | The Java exception message raised when the processor fails |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Source / Record Writer |
| record.count | The number of records in an outgoing FlowFile. This is only populated on the ‘success’ relationship when Record Reader and Writer is set. |

---
title: GetZendesk 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/getzendesk.md
section: Loading & Unloading Data
---

# GetZendesk 2025.10.9.21

## Bundle

org.apache.nifi | nifi-zendesk-nar

## Description

Incrementally fetches data from Zendesk API.

## Tags

zendesk

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| web-client-service-provider | Controller service for HTTP client operations. |
| zendesk-authentication-type-name | Type of authentication to Zendesk API. |
| zendesk-authentication-value-name | Password or authentication token for Zendesk login user. |
| zendesk-export-method | Method for incremental export. |
| zendesk-query-start-timestamp | Initial timestamp to query Zendesk API from in Unix timestamp seconds format. |
| zendesk-resource | The particular Zendesk resource which is meant to be exported. |
| zendesk-subdomain | Name of the Zendesk subdomain. |
| zendesk-user | Login user to Zendesk subdomain. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Paging cursor for Zendesk API is stored. Cursor is updated after each successful request. |

## Relationships

| Name | Description |
| --- | --- |
| success | For FlowFiles created as a result of a successful HTTP request. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records fetched by the processor. |

---
title: GrokReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/grokreader.md
section: Loading & Unloading Data
---

# GrokReader

## Description

Provides a mechanism for reading unstructured text data, such as log files, and structuring the data so that it can be processed. The service is configured using Grok patterns. The service reads from a stream of data and splits each message that it finds into a separate Record, each containing the fields that are configured. If a line in the input does not match the expected message pattern, the line of text is either considered to be part of the previous message or is skipped, depending on the configuration, with the exception of stack traces. A stack trace that is found at the end of a log message is considered to be part of the previous message but is added to the ‘stackTrace’ field of the Record. If a record has no stack trace, it will have a NULL value for the stackTrace field (assuming that the schema does in fact include a stackTrace field of type String). Assuming that the schema includes a ‘_raw’ field of type String, the raw message will be included in the Record.

## Tags

grok, logfiles, logs, logstash, parse, pattern, reader, record, regex, text, unstructured

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Grok Expressions \* | Grok Expression |  |  | Specifies the format of a log line in Grok format. This allows the Record Reader to understand how to parse each log line. The property supports one or more Grok expressions. The Reader attempts to parse input lines according to the configured order of the expressions.If a line in the log file does not match any expressions, the line will be assumed to belong to the previous log message.If other Grok patterns are referenced by this expression, they need to be supplied in the Grok Pattern File property. |
| Grok Patterns | Grok Pattern File |  |  | Grok Patterns to use for parsing logs. If not specified, a built-in default Pattern file will be used. If specified, all patterns specified will override the default patterns. See the Controller Service’s Additional Details for a list of pre-defined patterns. |
| Schema Access Strategy \* | Schema Access Strategy | string-fields-from-grok-expression | * Use String Fields From Grok Expression * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| No Match Behavior \* | no-match-behavior | append-to-previous-message | * Append to Previous Message * Skip Line * Raw Line | If a line of text is encountered and it does not match the given Grok Expression, and it is not part of a stack trace, this property specifies how the text should be processed. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Patterns and Expressions can reference resources over HTTP |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Guidelines for using Python extensions in Openflow
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors-python-ext-guide.md
section: Loading & Unloading Data
---

# Guidelines for using Python extensions in Openflow

This topic describes the limitations, supported configurations, and best practices when
using Python extensions in Openflow.

Python processors in Openflow use NiFi’s Py4J bridge architecture, which has fundamentally
different resource characteristics than native Java processors. Because Python processors
run as external OS processes outside the JVM, they consume additional system memory, are not
governed by NiFi’s internal resource management, and have limited observability. These
differences affect runtime sizing, capacity planning, and monitoring.

## Architecture differences

Python processors run as external OS processes rather than within the JVM. This
architecture affects how resources are allocated, monitored, and managed:

| Processor type | Java processor | Python processor |
| --- | --- | --- |
| Runtime environment | JVM internal threads | External OS process |
| Memory management | Managed within JVM heap | Separate process memory |
| Lifecycle | NiFi-controlled | External process lifecycle |
| Monitoring | Full NiFi observability | Limited visibility |

## Runtime size constraints

Python extensions are only available on Medium and Large runtimes. Small runtimes
do not support Python processors due to CPU and memory constraints. Snowflake Openflow
blocks Python extensions on Small runtimes:

| Runtime size | Python support | Notes |
| --- | --- | --- |
| Small | Not supported | Python processors are blocked on Small runtimes due to CPU and memory constraints. |
| Medium | Limited (up to 2 Python processors) | The limit is for the entire runtime, not per connector or process group. This limit is currently a recommendation that will be an enforced maximum value for Openflow runtimes in the future. |
| Large | Limited (up to 4 Python processors) | The limit is for the entire runtime, not per connector or process group. This limit is currently a recommendation that will be an enforced maximum value for Openflow runtimes in the future. |

## Best practices

Follow these guidelines for working with Python processors in Openflow:

* Use Java for CPU-heavy operations. Java provides more efficient thread management
  within the JVM. Groovy scripting is a Java-based alternative.
* Use Medium or Large runtimes. Python is not available on Small runtimes.
* Limit the number of Python processors. Stay within the documented limits per runtime size.
* Monitor resource usage. Watch for memory pressure and CPU contention.
* Plan for upgrades. Custom Python processors might require a virtual environment (venv) reset
  after runtime upgrades. For more information, see
  Restore Python processors following runtime upgrades.
* Use single-threaded Python processors. Openflow does not support Python processors spawning
  subprocesses or using multithreading.

## Limitations on using Python processors

The following limitations apply when using Python processors in Openflow.

Runtime constraints
:   Python extensions can only be used with Medium or Large runtimes. Python extensions
    cannot be used with Small runtimes. This is disabled by the platform.

Memory overhead
:   Each Python processor spawns an external OS process with its own memory footprint.
    Python processes can collectively compete with the JVM for resources.

No NiFi resource management
:   Python processors are not observed or limited by NiFi’s internal resource management.
    CPU-heavy Python operations can consume approximately 50% of total server CPU time.

Monitoring gaps
:   The platform lacks visibility into external Python process health and resource consumption.

Upgrade handling
:   After runtime upgrades, custom Python processors might fail to load or exhibit unexpected
    behavior until virtual environments are recreated.

## Restore Python processors following runtime upgrades

If Python processors fail after upgrading the runtime, do the following:

1. Increment the processor version in the `ProcessorDetails.version` field.
2. Rebuild and re-upload the NiFi Archive (NAR) binary. This triggers the Python virtual
   environment cache to reset.
3. Remove and re-add the processor on the canvas. This triggers reinitialization of the
   Py4J bridge.

---
title: HandleHttpRequest 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/handlehttprequest.md
section: Loading & Unloading Data
---

# HandleHttpRequest 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Starts an HTTP Server and listens for HTTP Requests. For each request, creates a FlowFile and transfers to ‘success’. This Processor is designed to be used in conjunction with the HandleHttpResponse Processor in order to create a Web Service. In case of a multipart request, one FlowFile is generated for each part.

## Tags

http, https, ingress, listen, request, web service

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Additional HTTP Methods | A comma-separated list of non-standard HTTP Methods that should be allowed |
| Allow DELETE | Allow HTTP DELETE Method |
| Allow GET | Allow HTTP GET Method |
| Allow HEAD | Allow HTTP HEAD Method |
| Allow OPTIONS | Allow HTTP OPTIONS Method |
| Allow POST | Allow HTTP POST Method |
| Allow PUT | Allow HTTP PUT Method |
| Allowed Paths | A Regular Expression that specifies the valid HTTP Paths that are allowed in the incoming URL Requests. If this value is specified and the path of the HTTP Requests does not match this Regular Expression, the Processor will respond with a 404: NotFound |
| Client Authentication | Specifies whether or not the Processor should authenticate clients. This value is ignored if the <SSL Context Service> Property is not specified or the SSL Context provided uses only a KeyStore and not a TrustStore. |
| Default URL Character Set | The character set to use for decoding URL parameters if the HTTP Request does not supply one |
| HTTP Context Map | The HTTP Context Map Controller Service to use for caching the HTTP Request Information |
| HTTP Protocols | HTTP Protocols supported for Application Layer Protocol Negotiation with TLS |
| Hostname | The Hostname to bind to. If not specified, will bind to all hosts |
| Listening Port | The Port to listen on for incoming HTTP requests |
| Maximum Threads | The maximum number of threads that the embedded HTTP server will use for handling requests. |
| Request Header Maximum Size | The maximum supported size of HTTP headers in requests sent to this processor |
| SSL Context Service | The SSL Context Service to use in order to secure the server. If specified, the server will accept only HTTPS requests; otherwise, the server will accept only HTTP requests |
| container-queue-size | The size of the queue for Http Request Containers |
| multipart-read-buffer-size | The threshold size, at which the contents of an incoming file would be written to disk. Only applies for requests with Content-Type: multipart/form-data. It is used to prevent denial of service type of attacks, to prevent filling up the heap or disk space. |
| multipart-request-max-size | The max size of the request. Only applies for requests with Content-Type: multipart/form-data, and is used to prevent denial of service type of attacks, to prevent filling up the heap or disk space |
| parameters-to-attributes | A comma-separated list of HTTP parameters or form data to output as attributes |

## Relationships

| Name | Description |
| --- | --- |
| success | All content that is received is routed to the ‘success’ relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| http.context.identifier | An identifier that allows the HandleHttpRequest and HandleHttpResponse to coordinate which FlowFile belongs to which HTTP Request/Response. |
| mime.type | The MIME Type of the data, according to the HTTP Header “Content-Type” |
| http.servlet.path | The part of the request URL that is considered the Servlet Path |
| http.context.path | The part of the request URL that is considered to be the Context Path |
| http.method | The HTTP Method that was used for the request, such as GET or POST |
| http.local.name | IP address/hostname of the server |
| http.server.port | Listening port of the server |
| http.query.string | The query string portion of the Request URL |
| http.remote.host | The hostname of the requestor |
| http.remote.addr | The hostname:port combination of the requestor |
| http.remote.user | The username of the requestor |
| http.protocol | The protocol used to communicate |
| http.request.uri | The full Request URL |
| http.auth.type | The type of HTTP Authorization used |
| http.principal.name | The name of the authenticated user making the request |
| http.query.param.XXX | Each of query parameters in the request will be added as an attribute, prefixed with “http.query.param.” |
| http.param.XXX | Form parameters in the request that are configured by “Parameters to Attributes List” will be added as an attribute, prefixed with “http.param.”. Putting form parameters of large size is not recommended. |
| http.subject.dn | The Distinguished Name of the requestor. This value will not be populated unless the Processor is configured to use an SSLContext Service |
| http.issuer.dn | The Distinguished Name of the entity that issued the Subject’s certificate. This value will not be populated unless the Processor is configured to use an SSLContext Service |
| http.certificate.sans.N.name | X.509 Client Certificate Subject Alternative Name value from mutual TLS authentication. The attribute name has a zero-based index ordered according to the content of Client Certificate |
| http.certificate.sans.N.nameType | X.509 Client Certificate Subject Alternative Name type from mutual TLS authentication. The attribute name has a zero-based index ordered according to the content of Client Certificate. The attribute value is one of the General Names from RFC 3280 Section 4.1.2.7 |
| http.headers.XXX | Each of the HTTP Headers that is received in the request will be added as an attribute, prefixed with “http.headers.” For example, if the request contains an HTTP Header named “x-my-header”, then the value will be added to an attribute named “http.headers.x-my-header” |
| http.headers.multipart.XXX | Each of the HTTP Headers that is received in the multipart request will be added as an attribute, prefixed with “http.headers.multipart.” For example, if the multipart request contains an HTTP Header named “content-disposition”, then the value will be added to an attribute named “http.headers.multipart.content-disposition” |
| http.multipart.size | For requests with Content-Type “multipart/form-data”, the part’s content size is recorded into this attribute |
| http.multipart.content.type | For requests with Content-Type “multipart/form-data”, the part’s content type is recorded into this attribute |
| http.multipart.name | For requests with Content-Type “multipart/form-data”, the part’s name is recorded into this attribute |
| http.multipart.filename | For requests with Content-Type “multipart/form-data”, when the part contains an uploaded file, the name of the file is recorded into this attribute. Files are stored temporarily at the default temporary-file directory specified in “java.io.File” Java Docs) |
| http.multipart.fragments.sequence.number | For requests with Content-Type “multipart/form-data”, the part’s index is recorded into this attribute. The index starts with 1. |
| http.multipart.fragments.total.number | For requests with Content-Type “multipart/form-data”, the count of all parts is recorded into this attribute. |

## See also

* [org.apache.nifi.processors.standard.HandleHttpResponse](handlehttpresponse.md)

---
title: HandleHttpResponse 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/handlehttpresponse.md
section: Loading & Unloading Data
---

# HandleHttpResponse 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Sends an HTTP Response to the Requestor that generated a FlowFile. This Processor is designed to be used in conjunction with the HandleHttpRequest in order to create a web service.

## Tags

egress, http, https, response, web service

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attributes to add to the HTTP Response (Regex) | Specifies the Regular Expression that determines the names of FlowFile attributes that should be added to the HTTP response |
| HTTP Context Map | The HTTP Context Map Controller Service to use for caching the HTTP Request Information |
| HTTP Status Code | The HTTP Status Code to use when responding to the HTTP Request. See Section 10 of RFC 2616 for more information. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles will be routed to this Relationship if the Processor is unable to respond to the requestor. This may happen, for instance, if the connection times out or if NiFi is restarted before responding to the HTTP Request. |
| success | FlowFiles will be routed to this Relationship after the response has been successfully sent to the requestor |

## See also

* [org.apache.nifi.processors.standard.HandleHttpRequest](handlehttprequest.md)

---
title: HazelcastMapCacheClient
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/hazelcastmapcacheclient.md
section: Loading & Unloading Data
---

# HazelcastMapCacheClient

## Description

An implementation of DistributedMapCacheClient that uses Hazelcast as the backing cache. This service relies on an other controller service, manages the actual Hazelcast calls, set in Hazelcast Cache Manager.

## Tags

cache, hazelcast, map

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Hazelcast Cache Manager \* | hazelcast-cache-manager |  |  | A Hazelcast Cache Manager which manages connections to Hazelcast and provides cache instances. |
| Hazelcast Cache Name \* | hazelcast-cache-name |  |  | The name of a given cache. A Hazelcast cluster may handle multiple independent caches, each identified by a name. Clients using caches with the same name are working on the same data structure within Hazelcast. |
| Hazelcast Entry Lifetime \* | hazelcast-entry-ttl | 5 min |  | Indicates how long the written entries should exist in Hazelcast. Setting it to ‘0 secs’ means that the datawill exists until its deletion or until the Hazelcast server is shut down. Using `EmbeddedHazelcastCacheManager` ascache manager will not provide policies to limit the size of the cache. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: HikariCPConnectionPool
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/hikaricpconnectionpool.md
section: Loading & Unloading Data
---

# HikariCPConnectionPool

## Description

Provides Database Connection Pooling Service based on HikariCP. Connections can be asked from pool and returned after usage.

## Tags

connection, database, dbcp, hikari, jdbc, pooling, store

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Database Connection URL \* | hikaricp-connection-url |  |  | A database connection URL used to connect to a database. May contain database system name, host, port, database name and some parameters. The exact syntax of a database connection URL is specified by your DBMS. |
| Database Driver Class Name \* | hikaricp-driver-classname |  |  | The fully-qualified class name of the JDBC driver. Example: com.mysql.jdbc.Driver |
| Database Driver Location(s) | hikaricp-driver-locations |  |  | Comma-separated list of files/folders and/or URLs containing the driver JAR and its dependencies (if any). For example ‘/var/tmp/mariadb-java-client-1.1.7.jar’ |
| Kerberos User Service | hikaricp-kerberos-user-service |  |  | Specifies the Kerberos User Controller Service that should be used for authenticating with Kerberos |
| Max Connection Lifetime | hikaricp-max-conn-lifetime | -1 |  | The maximum lifetime of a connection. After this time is exceeded the connection will fail the next activation, passivation or validation test. A value of zero or less means the connection has an infinite lifetime. |
| Max Total Connections \* | hikaricp-max-total-conns | 10 |  | This property controls the maximum size that the pool is allowed to reach, including both idle and in-use connections. Basically this value will determine the maximum number of actual connections to the database backend. A reasonable value for this is best determined by your execution environment. When the pool reaches this size, and no idle connections are available, the service will block for up to connectionTimeout milliseconds before timing out. |
| Max Wait Time \* | hikaricp-max-wait-time | 500 millis |  | The maximum amount of time that the pool will wait (when there are no available connections) for a connection to be returned before failing, or 0 <time units> to wait indefinitely. |
| Minimum Idle Connections \* | hikaricp-min-idle-conns | 10 |  | This property controls the minimum number of idle connections that HikariCP tries to maintain in the pool. If the idle connections dip below this value and total connections in the pool are less than ‘Max Total Connections’, HikariCP will make a best effort to add additional connections quickly and efficiently. It is recommended that this property to be set equal to ‘Max Total Connections’. |
| Password | hikaricp-password |  |  | The password for the database user |
| Database User | hikaricp-username |  |  | Database user name |
| Validation Query | hikaricp-validation-query |  |  | Validation Query used to validate connections before returning them. When connection is invalid, it gets dropped and new valid connection will be returned. NOTE: Using validation might have some performance penalty. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Database Driver Location can reference resources over HTTP |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: HttpRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/httprecordsink.md
section: Loading & Unloading Data
---

# HttpRecordSink

## Description

Format and send Records to a configured uri using HTTP post. The Record Writer formats the records which are sent as the body of the HTTP post request. JsonRecordSetWriter is often used with this processor because many HTTP posts require a JSON body.

## Tags

http, post, record, sink

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| API URL \* | API URL |  |  | The URL which receives the HTTP requests. |
| Maximum Batch Size \* | Maximum Batch Size | 0 |  | Specifies the maximum number of records to send in the body of each HTTP request. Zero means the batch size is not limited, and all records are sent together in a single HTTP request. |
| OAuth2 Access Token Provider | OAuth2 Access Token Provider |  |  | OAuth2 service that provides the access tokens for the HTTP requests. |
| Web Service Client Provider \* | Web Service Client Provider |  |  | Controller service to provide the HTTP client for sending the HTTP requests. |
| Record Writer \* | record-sink-record-writer |  |  | Specifies the Controller Service to use for writing out the records. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: IdentifyMimeType 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/identifymimetype.md
section: Loading & Unloading Data
---

# IdentifyMimeType 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Attempts to identify the MIME Type used for a FlowFile. If the MIME Type can be identified, an attribute with the name ‘mime.type’ is added with the value being the MIME Type. If the MIME Type cannot be determined, the value will be set to ‘application/octet-stream’. In addition, the attribute ‘mime.extension’ will be set if a common file extension for the MIME Type is known. If the MIME Type detected is of type text/\*, attempts to identify the charset used and an attribute with the name ‘mime.charset’ is added with the value being the charset.

## Tags

MIME, bzip2, compression, file, gzip, identify, mime.type, zip

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Custom MIME Configuration | A URL or file path to a custom Tika Mime type configuration or the actual content of a custom Tika Mime type configuration. |
| config-strategy | Select the loading strategy for MIME Type configuration to be used. |
| use-filename-in-detection | If true will pass the filename to Tika to aid in detection. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | This Processor sets the FlowFile’s mime.type attribute to the detected MIME Type. If unable to detect the MIME Type, the attribute’s value will be set to application/octet-stream |
| mime.extension | This Processor sets the FlowFile’s mime.extension attribute to the file extension associated with the detected MIME Type. If there is no correlated extension, the attribute’s value will be empty |
| mime.charset | This Processor sets the FlowFile’s mime.charset attribute to the detected charset. If unable to detect the charset or the detected MIME type is not of type text/\*, the attribute will not be set |

---
title: Install and configure the Openflow Connector for Oracle
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/setup-connector.md
section: Loading & Unloading Data
---

# Install and configure the Openflow Connector for Oracle

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes the steps to install and configure the Openflow Connector for Oracle connector.

As a data engineer, perform the following tasks to install and configure the connector:

## Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

## Configure the connector

To configure the connector, do the following as a data engineer:

1. Right-click on the added runtime and select Parameters.
2. Populate the required parameter values.

   For more information on the required parameter values, see the following sections:

   * Snowflake Destination Parameters: Used to establish connection with Snowflake.
   * Oracle Ingestion Parameters: Used to specify the tables to replicate.
   * Oracle Source Parameters: Used to define the configuration of data downloaded from Oracle.

### Snowflake Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Connection Strategy | When using KEY_PAIR, specify the strategy for connecting to Snowflake:   * **STANDARD** (default): Connect using standard public routing to Snowflake services. * **PRIVATE_CONNECTIVITY**: Connect using private addresses associated with the supporting cloud platform such as AWS PrivateLink. | Required for BYOC with KEY_PAIR only, otherwise ignored. |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake Private Key File. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use Snowflake Role assigned to the runtime or child role granted to this Snowflake Role.   You can find your runtime Snowflake Role in the Openflow UI, by expanding the More Options [⋮] button for your runtime and selecting Set Snowflake role. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

### Oracle Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Included Table Names | Comma-separated list of fully-qualified table paths. Tables must be specified using fully qualified database, schema and table name format: DATABASE_NAME.SCHEMA_NAME.TABLE_NAME.  For example: `MYPDB.SALES.CUSTOMERS, MYPDB.SALES.ORDERS` |
| Included Table Regex | A regular expression to match table paths for automatic inclusion of existing and new tables. The regex pattern must match the three-part naming convention: DATABASE_NAME.SCHEMA_NAME.TABLE_NAME.  For example: `MYPDB\.SALES\..*` to match all tables in the SALES schema within the MYPDB database. |
| Column Filter JSON | Optional. A JSON array of filter objects specifying which columns to include or exclude per table. For syntax details and examples, see Replicate a subset of columns in a table. |
| Merge Task Schedule CRON | A CRON expression to define when merge operations from the Journal to the Destination Table are triggered. For example, \* \* \* \* \* ? for continuous merge. |
| Object Identifier Resolution | Specifies how source object identifiers such as schemas, tables, and column names are stored and queried in Snowflake. This setting determines if you must use double quotes in SQL queries.  Option 1: Default, case-insensitive (recommended).   * **Transformation**: All identifiers are converted to uppercase. For   example, `My_Table` becomes `MY_TABLE`. * **Queries**: SQL queries are case-insensitive and don’t require SQL   double quotes.  For example `SELECT * FROM my_table;` returns the same results as `SELECT * FROM MY_TABLE;`.   **Note:** Snowflake recommends using this option if database objects are not expected to have mixed case names.  Option 2: case-sensitive.   * **Transformation**: Case is preserved.   For example, `My_Table` remains `My_Table`. * **Queries**: SQL queries must use double quotes to match the exact   case for database objects.   For example, `SELECT * FROM "My_Table";`.   **Important:** Do not change this setting after connector ingestion has begun. Changing this setting after ingestion has begun breaks the existing ingestion. If you must change this setting, create a new connector instance. |
| Snapshot Fetching Strategy | Determines the snapshot load fetching strategy:   * **SEQUENTIAL_BY_PRIMARY_KEY** (default): Uses fixed-size batches retrieved sequentially by primary key. * **CONCURRENT_BY_ROWID**: Splits tables into chunks bound by ranges of physical row ids, and retrieves each chunk in parallel. |

### Oracle Source Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Oracle Connection URL | JDBC URL of the database connection to the DB. The URL must specify the target container (PDB or CDB) that contains the data to be replicated. For example `jdbc:oracle:thin@<host>:<port>/YOUR_DB_NAME` where YOUR_DB_NAME is the name of your PDB or CDB.  When SSL is enabled, use the TCPS protocol, for example `jdbc:oracle:thin:@tcps://<host>:<tcps_port>/YOUR_DB_NAME`.  **Note:** The connector works within a single database/container. Ensure the JDBC URL points directly to the container that holds the tables to be replicated. | Yes |
| Oracle Username | Username of the connect user that has access to the XStream Server. | Yes |
| Oracle Password | Password of the connect user that has access to the XStream Server. | Yes |
| Oracle SSL Mode | Controls SSL encryption for connections to the Oracle database.   * **DISABLED**, which is the default: Connect without SSL. * **VERIFY_CA**: Connect with SSL. Verifies that a trusted Certificate Authority   issued the server certificate. * **VERIFY_IDENTITY**: Connect with SSL. Verifies the CA certificate and that the   server hostname matches the certificate’s subject.   When set to VERIFY_CA or VERIFY_IDENTITY, you must also provide the Oracle Wallet Filename parameter. | Yes |
| Oracle Wallet Filename | Upload the file that contains the Oracle auto-login wallet file (`cwallet.sso`). The wallet must contain the trusted server certificate for SSL connections.  For information about creating the wallet, see [Configure SSL connections (optional)](setup-oracledb.md). | Required when SSL Mode is not DISABLED |
| Oracle Database Processor Multiplier | Core Processor Licensing Factor as described in [Oracle Processor Core Factor Table](https://www.oracle.com/contracts/docs/processor-core-factor-table-070634.pdf) | Required for Embedded License only |
| Oracle Database Processor Cores | The number of processor cores in your Oracle database. | Required for Embedded License only |
| XStream Billing Acknowledgement | A confirmation of the licensing agreement | Required for Embedded License only |
| XStream Out Server Name | The name of the XStream Server that must already exist in Oracle. | Yes |
| XStream Out Server URL | JDBC URL of the database connection for XStream, must use OCI driver. For example `jdbc:oracle:oci:@<host>:<port>/SID`.  When SSL is enabled, use the TCPS protocol, for example `jdbc:oracle:oci:@tcps://<host>:<tcps_port>/SID`.  **Note:** When SSL Mode is enabled, the connector automatically adds `SSL_SERVER_DN_MATCH` and `MY_WALLET_DIRECTORY` to the XStream URL. You do not need to include these manually. | Yes |

## Restart table replication

A table in FAILED state — for example, due to a missing primary key or unsupported schema change — does not restart automatically. If a table enters a FAILED state or you need to restart replication from scratch, use the following procedure to remove and re-add the table to replication.

> **Note:**
>
> If the failure was caused by an issue in the source table such as a missing primary key, resolve that issue in the source database before continuing.

1. Remove the table from flow parameters: In the Ingestion Parameters context, either remove the table from the Included Table Names or modify the Included Table Regex so the table is no longer matched.
2. Verify the table has been removed:

   1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services.
   2. In the table listing controller services, locate the Table State Store row, click the three vertical dots on the right side of the row, then choose View State.
   > **Important:**
   >
   > You must wait until the table’s state is fully removed from this list before proceeding. Do not continue until this configuration change has completed.
3. Clean up the destination: Once the table’s state shows as fully removed, manually [DROP](../../../../../sql-reference/sql/drop-table.md) the destination table in Snowflake. Note that the connector will not overwrite an existing destination table during the snapshot phase; if the table still exists, replication will fail again. Optionally, the journal table and stream can also be removed if they are no longer needed.
4. Re-add the table: Update the Included Table Names or Included Table Regex parameters to include the table again.
5. Verify the restart: Check the Table State Store using the instructions given previously. The state of the table should appear with the status NEW, then transition to SNAPSHOT_REPLICATION, and finally INCREMENTAL_REPLICATION.

## Replicate a subset of columns in a table

The connector can filter the data replicated per table to a subset of configured columns.
Primary key columns are always included regardless of exclusions.

To apply column filters, set the Column Filter JSON parameter in the Ingestion Parameters context
to a JSON array of filter objects, one per table you want to filter.

Columns can be included or excluded by name or by regular expression pattern. You can apply a single condition per table,
or combine multiple conditions, with exclusions always taking precedence over inclusions.

### Syntax

Each object in the array identifies a table and specifies which columns to include or exclude.
Because this connector uses three-part fully qualified names (database, schema, and table), each object
can include a `database` or `databasePattern` field in addition to the schema and table fields.

```javascript
[
    {
        "database": "<database>" | "databasePattern": "<regex>",
        "schema": "<schema>" | "schemaPattern": "<regex>",
        "table": "<table>" | "tablePattern": "<regex>",
        "included": ["<column>", "<column>"],
        "excluded": ["<column>", "<column>"],
        "includedPattern": "<regex>",
        "excludedPattern": "<regex>"
    }
]
```

The following rules apply:

* Use `database`, `schema`, and `table` for exact name matching, or `databasePattern`,
  `schemaPattern`, and `tablePattern` for regex matching. You cannot use both a field and its
  pattern variant in the same object (for example, `schema` and `schemaPattern` cannot both appear).
* At least one of `included`, `excluded`, `includedPattern`, or `excludedPattern` must be provided.
* When both included and excluded filters are specified, exclusions take precedence.
* When multiple filters match the same table, the last matching filter is used, with exact matches
  taking precedence over pattern-based filters.
* The value can be an array of objects to apply different filters to different tables.

### Examples

Include specific columns by name:

```javascript
[
    {
        "database": "my_db",
        "schema": "dbo",
        "table": "orders",
        "included": ["account_id", "status", "created_at"]
    }
]
```

Exclude specific columns by name:

```javascript
[
    {
        "database": "my_db",
        "schema": "dbo",
        "table": "orders",
        "excluded": ["internal_note", "debug_flag"]
    }
]
```

Combine an include pattern with a specific exclusion (for example, include all email columns except `admin_email`):

```javascript
[
    {
        "database": "my_db",
        "schema": "dbo",
        "table": "contacts",
        "includedPattern": ".*_email",
        "excluded": ["admin_email"]
    }
]
```

Mix a database pattern with an exact schema and table name to apply a filter across databases:

```javascript
[
    {
        "databasePattern": "prod_.*",
        "schema": "dbo",
        "table": "customers",
        "excluded": ["internal_note"]
    }
]
```

Pass multiple filter objects to apply different rules to different tables:

```javascript
[
    {"database": "my_db", "schema": "dbo", "table": "orders", "included": ["account_id", "status"]},
    {"database": "my_db", "schema": "dbo", "table": "customers", "excludedPattern": ".*_internal"}
]
```

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

## Next steps

* (Optional) [Set up incremental replication without snapshots](incremental-replication.md).
* [Monitor the flow](../../monitor.md).

---
title: InvokeHTTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/invokehttp.md
section: Loading & Unloading Data
---

# InvokeHTTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

An HTTP client processor which can interact with a configurable HTTP Endpoint. The destination URL and HTTP Method are configurable. When the HTTP Method is PUT, POST or PATCH, the FlowFile contents are included as the body of the request and FlowFile attributes are converted to HTTP headers, optionally, based on configuration properties.

## Tags

client, http, https, rest

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Connection Timeout | Maximum time to wait for initial socket connection to the HTTP URL. |
| HTTP Method | HTTP request method (GET, POST, PUT, PATCH, DELETE, HEAD, OPTIONS). Arbitrary methods are also supported. Methods other than POST, PUT and PATCH will be sent without a message body. |
| HTTP URL | HTTP remote URL including a scheme of http or https, as well as a hostname or IP address with optional port and path elements. Any encoding of the URL must be done by the user. |
| HTTP/2 Disabled | Disable negotiation of HTTP/2 protocol. HTTP/2 requires TLS. HTTP/1.1 protocol supported is required when HTTP/2 is disabled. |
| OAuth2 Access Token Refresh Strategy | Specifies which strategy should be used to refresh the OAuth2 Access Token. |
| Request Body Enabled | Enable sending HTTP request body for PATCH, POST, or PUT methods. |
| Request Chunked Transfer-Encoding Enabled | Enable sending HTTP requests with the Transfer-Encoding Header set to chunked, and disable sending the Content-Length Header. Transfer-Encoding applies to the body in HTTP/1.1 requests as described in RFC 7230 Section 3.3.1 |
| Request Content-Encoding | HTTP Content-Encoding applied to request body during transmission. The receiving server must support the selected encoding to avoid request failures. |
| Request Content-Type | HTTP Content-Type Header applied to when sending an HTTP request body for PATCH, POST, or PUT methods. The Content-Type defaults to application/octet-stream when not configured. |
| Request Date Header Enabled | Enable sending HTTP Date Header on HTTP requests as described in RFC 7231 Section 7.1.1.2. |
| Request Digest Authentication Enabled | Enable Digest Authentication on HTTP requests with Username and Password credentials as described in RFC 7616. |
| Request Failure Penalization Enabled | Enable penalization of request FlowFiles when receiving HTTP response with a status code between 400 and 499. |
| Request Header Attributes Pattern | Regular expression that defines which FlowFile attributes to send as HTTP headers in the request. If not defined, no attributes are sent as headers. Dynamic properties will be always be sent as headers. The dynamic property name will be the header key and the dynamic property value, interpreted as Expression Language, will be the header value. Attributes and their values are limited to ASCII characters due to the requirement of the HTTP protocol. |
| Request Multipart Form-Data Filename Enabled | Enable sending the FlowFile filename attribute as the filename parameter in the Content-Disposition Header for multipart/form-data HTTP requests. |
| Request Multipart Form-Data Name | Enable sending HTTP request body formatted using multipart/form-data and using the form name configured. |
| Request OAuth2 Access Token Provider | Enables managed retrieval of OAuth2 Bearer Token applied to HTTP requests using the Authorization Header. |
| Request Password | The password provided for authentication of HTTP requests. Encoded using Base64 for HTTP Basic Authentication as described in RFC 7617. |
| Request User-Agent | HTTP User-Agent Header applied to requests. RFC 7231 Section 5.5.3 describes recommend formatting. |
| Request Username | The username provided for authentication of HTTP requests. Encoded using Base64 for HTTP Basic Authentication as described in RFC 7617. |
| Response Body Attribute Name | FlowFile attribute name used to write an HTTP response body for FlowFiles transferred to the Original relationship. |
| Response Body Attribute Size | Maximum size in bytes applied when writing an HTTP response body to a FlowFile attribute. Attributes exceeding the maximum will be truncated. |
| Response Body Ignored | Disable writing HTTP response FlowFiles to Response relationship |
| Response Cache Enabled | Enable HTTP response caching described in RFC 7234. Caching responses considers ETag and other headers. |
| Response Cache Size | Maximum size of HTTP response cache in bytes. Caching responses considers ETag and other headers. |
| Response Cookie Strategy | Strategy for accepting and persisting HTTP cookies. Accepting cookies enables persistence across multiple requests. |
| Response FlowFile Naming Strategy | Determines the strategy used for setting the filename attribute of FlowFiles transferred to the Response relationship. |
| Response Generation Required | Enable generation and transfer of a FlowFile to the Response relationship regardless of HTTP response status code received. |
| Response Header Request Attributes Enabled | Enable adding HTTP response headers as attributes to FlowFiles transferred to the Original, Retry or No Retry relationships. |
| Response Header Request Attributes Prefix | Prefix to HTTP response headers when included as attributes to FlowFiles transferred to the Original, Retry or No Retry relationships. It is recommended to end with a separator character like ‘.’ or ‘-‘. |
| Response Redirects Enabled | Enable following HTTP redirects sent with HTTP 300 series responses as described in RFC 7231 Section 6.4. |
| SSL Context Service | SSL Context Service provides trusted certificates and client certificates for TLS communication. |
| Socket Idle Connections | Maximum number of idle connections to the HTTP URL. |
| Socket Idle Timeout | Maximum time to wait before closing idle connections to the HTTP URL. |
| Socket Read Timeout | Maximum time to wait for receiving responses from a socket connection to the HTTP URL. |
| Socket Write Timeout | Maximum time to wait for write operations while sending requests from a socket connection to the HTTP URL. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| Failure | Request FlowFiles transferred when receiving socket communication errors. |
| No Retry | Request FlowFiles transferred when receiving HTTP responses with a status code between 400 an 499. |
| Original | Request FlowFiles transferred when receiving HTTP responses with a status code between 200 and 299. |
| Response | Response FlowFiles transferred when receiving HTTP responses with a status code between 200 and 299. Enabling [Response Generation Required] changes routing behavior, sending unsuccessful responses to their corresponding relationships and also sending FlowFiles to the Response relationship as well, regardless of status code received. |
| Retry | Request FlowFiles transferred when receiving HTTP responses with a status code between 500 and 599. |

## Writes attributes

| Name | Description |
| --- | --- |
| invokehttp.status.code | The status code that is returned |
| invokehttp.status.message | The status message that is returned |
| invokehttp.response.body | In the instance where the status code received is not a success (2xx) then the response body will be put to the ‘invokehttp.response.body’ attribute of the request FlowFile. |
| invokehttp.request.url | The original request URL |
| invokehttp.request.duration | Duration (in milliseconds) of the HTTP call to the external endpoint |
| invokehttp.response.url | The URL that was ultimately requested after any redirects were followed |
| invokehttp.tx.id | The transaction ID that is returned after reading the response |
| invokehttp.remote.dn | The DN of the remote server |
| invokehttp.java.exception.class | The Java exception class raised when the processor fails |
| invokehttp.java.exception.message | The Java exception message raised when the processor fails |
| user-defined | If the ‘Put Response Body In Attribute’ property is set then whatever it is set to will become the attribute key and the value would be the body of the HTTP response. |

---
title: InvokeScriptedProcessor 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/invokescriptedprocessor.md
section: Loading & Unloading Data
---

# InvokeScriptedProcessor 2025.10.9.21

## Bundle

org.apache.nifi | nifi-scripting-nar

## Description

Experimental - Invokes a script engine for a Processor defined in the given script. The script must define a valid class that implements the Processor interface, and it must set a variable ‘processor’ to an instance of the class. Processor methods such as onTrigger() will be delegated to the scripted Processor instance. Also any Relationships or PropertyDescriptors defined by the scripted processor will be added to the configuration dialog. The scripted processor can implement public void setLogger(ComponentLog logger) to get access to the parent logger, as well as public void onScheduled(ProcessContext context) and public void onStopped(ProcessContext context) methods to be invoked when the parent InvokeScriptedProcessor is scheduled or stopped, respectively. NOTE: The script will be loaded when the processor is populated with property values, see the Restrictions section for more security implications. Experimental: Impact of sustained usage not yet verified.

## Tags

groovy, invoke, script

## Input Requirement

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Module Directory | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine | Language Engine for executing scripts |
| Script File | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | Scripts can store and retrieve state using the State Management APIs. Consult the State Manager section of the Developer’s Guide for more details. |
| CLUSTER | Scripts can store and retrieve state using the State Management APIs. Consult the State Manager section of the Developer’s Guide for more details. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## See also

* [org.apache.nifi.processors.script.ExecuteScript](executescript.md)

---
title: IPLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/iplookupservice.md
section: Loading & Unloading Data
---

# IPLookupService

## Description

A lookup service that provides several types of enrichment information for IP addresses. The service is configured by providing a MaxMind Database file and specifying which types of enrichment should be provided for an IP Address or Hostname. Each type of enrichment is a separate lookup, so configuring the service to provide all of the available enrichment data may be slower than returning only a portion of the available enrichments. In order to use this service, a lookup must be performed using key of ‘ip’ and a value that is a valid IP address or hostname. View the Usage of this component and choose to view Additional Details for more information, such as the Schema that pertains to the information that is returned.

## Tags

anonymous, cellular, domain, enrich, geo, ip, ipgeo, isp, lookup, maxmind, tor

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| MaxMind Database File \* | database-file |  |  | Path to Maxmind IP Enrichment Database File |
| Lookup Anonymous IP Information \* | lookup-anonymous-ip | false | * true * false | Specifies whether or not information about whether or not the IP address belongs to an anonymous network should be returned. |
| Lookup Geo Enrichment \* | lookup-city | true | * true * false | Specifies whether or not information about the geographic information, such as cities, corresponding to the IP address should be returned |
| Lookup Connection Type \* | lookup-connection-type | false | * true * false | Specifies whether or not information about the Connection Type corresponding to the IP address should be returned. If true, the lookup will contain a ‘connectionType’ field that (if populated) will contain a value of ‘Dialup’, ‘Cable/DSL’, ‘Corporate’, or ‘Cellular’ |
| Lookup Domain Name \* | lookup-domain | false | * true * false | Specifies whether or not information about the Domain Name corresponding to the IP address should be returned. If true, the lookup will contain second-level domain information, such as foo.com but will not contain bar.foo.com |
| Lookup ISP \* | lookup-isp | false | * true * false | Specifies whether or not information about the Information Service Provider corresponding to the IP address should be returned |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ISPEnrichIP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/ispenrichip.md
section: Loading & Unloading Data
---

# ISPEnrichIP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-enrich-nar

## Description

Looks up ISP information for an IP address and adds the information to FlowFile attributes. The ISP data is provided as a MaxMind ISP database. (Note that this is NOT the same as the GeoLite database utilized by some geo enrichment tools). The attribute that contains the IP address to lookup is provided by the ‘IP Address Attribute’ property. If the name of the attribute provided is ‘X’, then the attributes added by enrichment will take the form X.isp.<fieldName>

## Tags

ISP, enrich, ip, maxmind

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| IP Address Attribute | The name of an attribute whose value is a dotted decimal IP address for which enrichment should occur |
| Log Level | The Log Level to use when an IP is not found in the database. Accepted values: INFO, DEBUG, WARN, ERROR. |
| MaxMind Database File | Path to Maxmind IP Enrichment Database File |

## Relationships

| Name | Description |
| --- | --- |
| found | Where to route flow files after successfully enriching attributes with data provided by database |
| not found | Where to route flow files after unsuccessfully enriching attributes because no data was found |

## Writes attributes

| Name | Description |
| --- | --- |
| X.isp.lookup.micros | The number of microseconds that the geo lookup took |
| X.isp.asn | The Autonomous System Number (ASN) identified for the IP address |
| X.isp.asn.organization | The Organization Associated with the ASN identified |
| X.isp.name | The name of the ISP associated with the IP address provided |
| X.isp.organization | The Organization associated with the IP address provided |

---
title: JettyWebSocketClient
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jettywebsocketclient.md
section: Loading & Unloading Data
---

# JettyWebSocketClient

## Description

Implementation of WebSocketClientService. This service uses Jetty WebSocket client module to provide WebSocket session management throughout the application.

## Tags

Jetty, WebSocket, client

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Authentication Header Charset \* | Authentication Header Charset | US-ASCII |  | The charset for Basic Authentication header base64 string. |
| Connection Attempt Count \* | Connection Attempt Count | 3 |  | The number of times to try and establish a connection. |
| Connection Timeout \* | Connection Timeout | 3 sec |  | The timeout to connect the WebSocket URI. |
| Custom Authorization | Custom Authorization |  |  | Configures a custom HTTP Authorization Header as described in RFC 7235 Section 4.2. Setting a custom Authorization Header excludes configuring the User Name and User Password properties for Basic Authentication. |
| HTTP Proxy Host | HTTP Proxy Host |  |  | The host name of the HTTP Proxy. |
| HTTP Proxy Port | HTTP Proxy Port |  |  | The port number of the HTTP Proxy. |
| Idle Timeout \* | Idle Timeout | 0 sec |  | The maximum amount of time that a WebSocket connection may remain idle before it is closed. A value of 0 sec disables the timeout. |
| Input Buffer Size \* | Input Buffer Size | 4 kb |  | The size of the input (read from network layer) buffer size. |
| Max Binary Message Size \* | Max Binary Message Size | 64 kb |  | The maximum size of a binary message during parsing/generating. |
| Max Text Message Size \* | Max Text Message Size | 64 kb |  | The maximum size of a text message during parsing/generating. |
| Password | Password |  |  | The user password for Basic Authentication. |
| SSL Context Service | SSL Context Service |  |  | The SSL Context Service to use in order to secure the server. If specified, the server will accept only WSS requests; otherwise, the server will accept only WS requests |
| Session Maintenance Interval \* | Session Maintenance Interval | 10 sec |  | The interval between session maintenance activities. A WebSocket session established with a WebSocket server can be terminated due to different reasons including restarting the WebSocket server or timing out inactive sessions. This session maintenance activity is periodically executed in order to reconnect those lost sessions, so that a WebSocket client can reuse the same session id transparently after it reconnects successfully. The maintenance activity is executed until corresponding processors or this controller service is stopped. |
| Username | Username |  |  | The user name for Basic Authentication. |
| WebSocket URI \* | WebSocket URI |  |  | The WebSocket URI this client connects to. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JettyWebSocketServer
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jettywebsocketserver.md
section: Loading & Unloading Data
---

# JettyWebSocketServer

## Description

Implementation of WebSocketServerService. This service uses Jetty WebSocket server module to provide WebSocket session management throughout the application.

## Tags

Jetty, WebSocket, server

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Basic Authentication Enabled \* | Basic Authentication Enabled | false | * true * false | If enabled, client connection requests are authenticated with Basic authentication using the specified Login Provider. |
| Basic Authentication Path Spec | Basic Authentication Path Spec | /\* |  | Specify a Path Spec to apply Basic Authentication. |
| Basic Authentication Roles | Basic Authentication Roles | `**` |  | The authenticated user must have one of specified role. Multiple roles can be set as comma separated string. ‘\*’ represents any role and so does ‘\*\*’ any role including no role. |
| Client Authentication \* | Client Authentication | no | * No Authentication * Want Authentication * Need Authentication | Specifies whether or not the Processor should authenticate client by its certificate. This value is ignored if the <SSL Context Service> Property is not specified or the SSL Context provided uses only a KeyStore and not a TrustStore. |
| Idle Timeout \* | Idle Timeout | 0 sec |  | The maximum amount of time that a WebSocket connection may remain idle before it is closed. A value of 0 sec disables the timeout. |
| Input Buffer Size \* | Input Buffer Size | 4 kb |  | The size of the input (read from network layer) buffer size. |
| Login Service | Login Service | hash | * HashLoginService | Specify which Login Service to use for Basic Authentication. |
| Max Binary Message Size \* | Max Binary Message Size | 64 kb |  | The maximum size of a binary message during parsing/generating. |
| Max Text Message Size \* | Max Text Message Size | 64 kb |  | The maximum size of a text message during parsing/generating. |
| Port \* | Port |  |  | The port number on which this WebSocketServer listens to. |
| SSL Context Service | SSL Context Service |  |  | The SSL Context Service to use in order to secure the server. If specified, the server will accept only WSS requests; otherwise, the server will accept only WS requests |
| Users Properties File | users-properties-file |  |  | Specify a property file containing users for Basic Authentication using HashLoginService. See <http://www.eclipse.org/jetty/documentation/current/configuring-security.html> for detail. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JMSConnectionFactoryProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jmsconnectionfactoryprovider.md
section: Loading & Unloading Data
---

# JMSConnectionFactoryProvider

## Description

Provides a generic service to create vendor specific javax.jms. ConnectionFactory implementations. The Connection Factory can be served once this service is configured successfully.

## Tags

integration, jms, messaging, publish, queue, subscribe, topic

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| JMS SSL Context Service | SSL Context Service |  |  | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| JMS Broker URI | broker |  |  | URI pointing to the network location of the JMS Message broker. Example for ActiveMQ: ‘<tcp://myhost:61616>’. Examples for IBM MQ: ‘myhost(1414)’ and ‘myhost01(1414),myhost02(1414)’. |
| JMS Connection Factory Implementation Class \* | cf |  |  | The fully qualified name of the JMS ConnectionFactory implementation class (eg. org.apache.activemq.ActiveMQConnectionFactory). |
| JMS Client Libraries | cflib |  |  | Path to the directory with additional resources (eg. JARs, configuration files etc.) to be added to the classpath (defined as a comma separated list of values). Such resources typically represent target JMS client libraries for the ConnectionFactory implementation. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Client Library Location can reference resources over HTTP |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JndiJmsConnectionFactoryProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jndijmsconnectionfactoryprovider.md
section: Loading & Unloading Data
---

# JndiJmsConnectionFactoryProvider

## Description

Provides a service to lookup an existing JMS ConnectionFactory using the Java Naming and Directory Interface (JNDI).

## Tags

integration, jms, jndi, messaging, publish, queue, subscribe, topic

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| JNDI Name of the Connection Factory \* | connection.factory.name |  |  | The name of the JNDI Object to lookup for the Connection Factory. |
| JNDI Initial Context Factory Class \* | java.naming.factory.initial |  |  | The fully qualified class name of the JNDI Initial Context Factory Class (java.naming.factory.initial). |
| JNDI Provider URL \* | java.naming.provider.url |  |  | The URL of the JNDI Provider to use as the value for java.naming.provider.url. See additional details documentation for allowed URL schemes. |
| JNDI Credentials | java.naming.security.credentials |  |  | The Credentials to use when authenticating with JNDI (java.naming.security.credentials). |
| JNDI Principal | java.naming.security.principal |  |  | The Principal to use when authenticating with JNDI (java.naming.security.principal). |
| JNDI / JMS Client Libraries | naming.factory.libraries |  |  | Specifies jar files and/or directories to add to the ClassPath in order to load the JNDI / JMS client libraries. This should be a comma-separated list of files, directories, and/or URLs. If a directory is given, any files in that directory will be included, but subdirectories will not be included (i.e., it is not recursive). |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JoinEnrichment 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/joinenrichment.md
section: Loading & Unloading Data
---

# JoinEnrichment 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Joins together Records from two different FlowFiles where one FlowFile, the ‘original’ contains arbitrary records and the second FlowFile, the ‘enrichment’ contains additional data that should be used to enrich the first. See Additional Details for more information on how to configure this processor and the different use cases that it aims to accomplish.

## Tags

combine, enrichment, fork, join, merge, record, recordpath, sql, streams, wrap

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Default Decimal Precision | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| Enrichment Record Reader | The Record Reader for reading the ‘enrichment’ FlowFile |
| Insertion Record Path | Specifies where in the ‘original’ Record the ‘enrichment’ Record’s fields should be inserted. Note that if the RecordPath does not point to any existing field in the original Record, the enrichment will not be inserted. |
| Join Strategy | Specifies how to join the two FlowFiles into a single FlowFile |
| Maximum number of Bins | Specifies the maximum number of bins that can be held in memory at any one time |
| Original Record Reader | The Record Reader for reading the ‘original’ FlowFile |
| Record Writer | The Record Writer to use for writing the results. If the Record Writer is configured to inherit the schema from the Record, the schema that it will inherit will be the result of merging both the ‘original’ record schema and the ‘enrichment’ record schema. |
| SQL | The SQL SELECT statement to evaluate. Expression Language may be provided, but doing so may result in poorer performance. Because this Processor is dealing with two FlowFiles at a time, it ‘s also important to understand how attributes will be referenced. If both FlowFiles have an attribute with the same name but different values, the Expression Language will resolve to the value provided by the’ enrichment’ FlowFile. |
| Timeout | Specifies the maximum amount of time to wait for the second FlowFile once the first arrives at the processor, after which point the first FlowFile will be routed to the ‘timeout’ relationship. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If both the ‘original’ and ‘enrichment’ FlowFiles arrive at the processor but there was a failure in joining the records, both of those FlowFiles will be routed to this relationship. |
| joined | The resultant FlowFile with Records joined together from both the original and enrichment FlowFiles will be routed to this relationship |
| original | Both of the incoming FlowFiles (‘original’ and ‘enrichment’) will be routed to this Relationship. I.e., this is the ‘original’ version of both of these FlowFiles. |
| timeout | If one of the incoming FlowFiles (i.e., the ‘original’ FlowFile or the ‘enrichment’ FlowFile) arrives to this Processor but the other does not arrive within the configured Timeout period, the FlowFile that did arrive is routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records in the FlowFile |

## See also

* [org.apache.nifi.processors.standard.ForkEnrichment](forkenrichment.md)

---
title: JoltTransformJSON 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/jolttransformjson.md
section: Loading & Unloading Data
---

# JoltTransformJSON 2025.10.9.21

## Bundle

org.apache.nifi | nifi-jolt-nar

## Description

Applies a list of Jolt specifications to either the FlowFile JSON content or a specified FlowFile JSON attribute. If the JSON transform fails, the original FlowFile is routed to the ‘failure’ relationship.

## Tags

cardinality, chainr, default, jolt, json, remove, shift, sort, transform

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Custom Module Directory | Comma-separated list of paths to files and/or directories which contain modules containing custom transformations (that are not included on NiFi’s classpath). |
| Custom Transformation Class Name | Fully Qualified Class Name for Custom Transformation |
| JSON Source | Specifies whether the Jolt transformation is applied to FlowFile JSON content or to specified FlowFile JSON attribute. |
| JSON Source Attribute | The FlowFile attribute containing JSON to be transformed. |
| Jolt Specification | Jolt Specification for transformation of JSON data. The value for this property may be the text of a Jolt specification or the path to a file containing a Jolt specification. ‘Jolt Specification’ must be set, or the value is ignored if the Jolt Sort Transformation is selected. |
| Jolt Transform | Specifies the Jolt Transformation that should be used with the provided specification. |
| Max String Length | The maximum allowed length of a string value when parsing the JSON document |
| Pretty Print | Apply pretty print formatting to the output of the Jolt transform |
| Transform Cache Size | Compiling a Jolt Transform can be fairly expensive. Ideally, this will be done only once. However, if the Expression Language is used in the transform, we may need a new Transform for each FlowFile. This value controls how many of those Transforms we cache in memory in order to avoid having to compile the Transform each time. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the JSON transformation fails (e.g., due to invalid JSON in the content or attribute), the original FlowFile is routed to this relationship. |
| success | The FlowFile with successfully transformed content or updated attribute will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Always set to application/json |

---
title: JoltTransformRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/jolttransformrecord.md
section: Loading & Unloading Data
---

# JoltTransformRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-jolt-nar

## Description

Applies a JOLT specification to each record in the FlowFile payload. A new FlowFile is created with transformed content and is routed to the ‘success’ relationship. If the transform fails, the original FlowFile is routed to the ‘failure’ relationship.

## Tags

cardinality, chainr, defaultr, jolt, record, removr, shiftr, sort, transform

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Custom Module Directory | Comma-separated list of paths to files and/or directories which contain modules containing custom transformations (that are not included on NiFi’s classpath). |
| Custom Transformation Class Name | Fully Qualified Class Name for Custom Transformation |
| Jolt Specification | Jolt Specification for transformation of JSON data. The value for this property may be the text of a Jolt specification or the path to a file containing a Jolt specification. ‘Jolt Specification’ must be set, or the value is ignored if the Jolt Sort Transformation is selected. |
| Jolt Transform | Specifies the Jolt Transformation that should be used with the provided specification. |
| Transform Cache Size | Compiling a Jolt Transform can be fairly expensive. Ideally, this will be done only once. However, if the Expression Language is used in the transform, we may need a new Transform for each FlowFile. This value controls how many of those Transforms we cache in memory in order to avoid having to compile the Transform each time. |
| jolt-record-record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema. |
| jolt-record-record-writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the FlowFile records cannot be parsed), it will be routed to this relationship |
| original | The original FlowFile that was transformed. If the FlowFile fails processing, nothing will be sent to this relationship |
| success | The FlowFile with transformed content will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records in an outgoing FlowFile |
| mime.type | The MIME Type that the configured Record Writer indicates is appropriate |

---
title: JSLTTransformJSON 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/jslttransformjson.md
section: Loading & Unloading Data
---

# JSLTTransformJSON 2025.10.9.21

## Bundle

org.apache.nifi | nifi-jslt-nar

## Description

Applies a JSLT transformation to the FlowFile JSON payload. A new FlowFile is created with transformed content and is routed to the ‘success’ relationship. If the JSLT transform fails, the original FlowFile is routed to the ‘failure’ relationship.

## Tags

jslt, json, transform

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| jslt-transform-cache-size | Compiling a JSLT Transform can be fairly expensive. Ideally, this will be done only once. However, if the Expression Language is used in the transform, we may need a new Transform for each FlowFile. This value controls how many of those Transforms we cache in memory in order to avoid having to compile the Transform each time. |
| jslt-transform-pretty_print | Apply pretty-print formatting to the output of the JSLT transform |
| jslt-transform-result-filter | A filter for output JSON results using a JSLT expression. This property supports changing the default filter, which removes JSON objects with null values, empty objects and empty arrays from the output JSON. This JSLT must return true for each JSON object to be included and false for each object to be removed. Using a filter value of “true” to disables filtering. |
| jslt-transform-transformation | JSLT Transformation for transform of JSON data. Any NiFi Expression Language present will be evaluated first to get the final transform to be applied. The JSLT Tutorial provides an overview of supported expressions: <https://github.com/schibsted/jslt/blob/master/tutorial.md> |
| jslt-transform-transformation-strategy | Whether to apply the JSLT transformation to the entire FlowFile contents or each JSON object in the root-level array |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the FlowFile is not valid JSON), it will be routed to this relationship |
| success | The FlowFile with transformed content will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Always set to application/json |

---
title: JsonConfigBasedBoxClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jsonconfigbasedboxclientservice.md
section: Loading & Unloading Data
---

# JsonConfigBasedBoxClientService

## Description

Provides Box client objects through which Box API calls can be used.

## Tags

box, client, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Account ID \* | Account ID |  |  | The ID of the Box account which the app will act on behalf of. |
| App Actor \* | App Actor | impersonated-user | * Service Account * Impersonated User | Specifies on behalf of whom Box API calls will be made. |
| App Config File | App Config File |  |  | Full path of an App config JSON file. See Additional Details for more information. |
| App Config JSON | App Config JSON |  |  | The raw JSON containing an App config. See Additional Details for more information. |
| Connect Timeout \* | Connect Timeout | 10 secs |  | Maximum amount of time to wait before failing during initial socket connection. |
| Read Timeout \* | Read Timeout | 30 secs |  | Maximum amount of time to wait before failing while reading socket responses. |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JsonPathReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jsonpathreader.md
section: Loading & Unloading Data
---

# JsonPathReader

## Description

Parses JSON records and evaluates user-defined JSON Path ‘s against each JSON object. While the reader expects each record to be well-formed JSON, the content of a FlowFile may consist of many records, each as a well-formed JSON array or JSON object with optional whitespace between them, such as the common’JSON-per-line’ format. If an array is encountered, each element in that array will be treated as a separate record. User-defined properties define the fields that should be extracted from the JSON in order to form the fields of a Record. Any JSON field that is not extracted via a JSONPath will not be returned in the JSON Records.

## Tags

json, jsonpath, parser, reader, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Allow Comments \* | Allow Comments | false | * true * false | Whether to allow comments when parsing the JSON document |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Max String Length \* | Max String Length | 20 MB |  | The maximum allowed length of a string value when parsing the JSON document |
| Schema Access Strategy \* | Schema Access Strategy | infer-schema | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Infer Schema | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JsonQueryElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/jsonqueryelasticsearch.md
section: Loading & Unloading Data
---

# JsonQueryElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

A processor that allows the user to run a query (with aggregations) written with the Elasticsearch JSON DSL. It does not automatically paginate queries for the user. If an incoming relationship is added to this processor, it will use the flowfile’s content for the query. Care should be taken on the size of the query because the entire response from Elasticsearch will be loaded into memory all at once and converted into the resulting flowfiles.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, get, json, query, read

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Aggregation Results Format | Format of Aggregation output. |
| Aggregation Results Split | Output a flowfile containing all aggregations or one flowfile for each individual aggregation. |
| Aggregations | One or more query aggregations (or “aggs”), in JSON syntax. Ex: {“items”: {“terms”: {“field”: “product”, “size”: 10}}} |
| Client Service | An Elasticsearch client service to use for running queries. |
| Fields | Fields of indexed documents to be retrieved, in JSON syntax. Ex: [“user.id”, “http.response.\*”, {“field”: “@timestamp”, “format”: “epoch_millis”}] |
| Index | The name of the index to use. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Output No Hits | Output a “hits” flowfile even if no hits found for query. If true, an empty “hits” flowfile will be output even if “aggregations” are output. |
| Query | A query in JSON syntax, not Lucene syntax. Ex: {“query”:{“match”:{“somefield”:”somevalue”}}}. If this parameter is not set, the query will be read from the flowfile content. If the query (property and flowfile content) is empty, a default empty JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Attribute | If set, the executed query will be set on each result flowfile in the specified attribute. |
| Query Clause | A “query” clause in JSON syntax, not Lucene syntax. Ex: {“match”:{“somefield”:”somevalue”}}. If the query is empty, a default JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Definition Style | How the JSON Query will be defined for use by the processor. |
| Script Fields | Fields to created using script evaluation at query runtime, in JSON syntax. Ex: {“test1”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* 2”}}, “test2”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* params.factor”, “params”: {“factor”: 2.0}}}} |
| Search Results Format | Format of Hits output. |
| Search Results Split | Output a flowfile containing all hits or one flowfile for each individual hit. |
| Size | The maximum number of documents to retrieve in the query. If the query is paginated, this “size” applies to each page of the query, not the “size” of the entire result set. |
| Sort | Sort results by one or more fields, in JSON syntax. Ex: [{“price” : {“order” : “asc”, “mode” : “avg”}}, {“post_date” : {“format”: “strict_date_optional_time_nanos”}}] |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## Relationships

| Name | Description |
| --- | --- |
| aggregations | Aggregations are routed to this relationship. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| hits | Search hits are routed to this relationship. |
| original | All original flowfiles that don’t cause an error to occur go to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| aggregation.name | The name of the aggregation whose results are in the output flowfile |
| aggregation.number | The number of the aggregation whose results are in the output flowfile |
| hit.count | The number of hits that are in the output flowfile |
| elasticsearch.query.error | The error message provided by Elasticsearch if there is an error querying the index. |

## See also

* [org.apache.nifi.processors.elasticsearch.PaginatedJsonQueryElasticsearch](paginatedjsonqueryelasticsearch.md)

---
title: JsonRecordSetWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jsonrecordsetwriter.md
section: Loading & Unloading Data
---

# JsonRecordSetWriter

## Description

Writes the results of a RecordSet as either a JSON Array or one JSON object per line. If using Array output, then even if the RecordSet consists of a single row, it will be written as an array with a single element. If using One Line Per Object output, the JSON objects cannot be pretty-printed.

## Tags

json, record, recordset, resultset, row, serialize, writer

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Allow Scientific Notation \* | Allow Scientific Notation | false | * true * false | Specifies whether or not scientific notation should be used when writing numbers |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Pretty Print JSON \* | Pretty Print JSON | false | * true * false | Specifies whether or not the JSON should be pretty printed |
| Schema Access Strategy \* | Schema Access Strategy | inherit-record-schema | * Inherit Record Schema * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Cache | Schema Cache |  |  | Specifies a Schema Cache to add the Record Schema to so that Record Readers can quickly lookup the schema. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Reference Writer \* | Schema Reference Writer |  |  | Service implementation responsible for writing FlowFile attributes or content header with Schema reference information |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Schema Write Strategy \* | Schema Write Strategy | no-schema | * Do Not Write Schema * Set ‘schema.name’ Attribute * Set ‘avro.schema’ Attribute * Schema Reference Writer | Specifies how the schema for a Record should be added to the data. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Compression Format \* | compression-format | none | * none * gzip * bzip2 * xz-lzma2 * snappy * snappy framed * zstd | The compression format to use. Valid values are: GZIP, BZIP2, ZSTD, XZ-LZMA2, LZMA, Snappy, and Snappy Framed |
| Compression Level \* | compression-level | 1 | * 0 * 1 * 2 * 3 * 4 * 5 * 6 * 7 * 8 * 9 | The compression level to use; this is valid only when using GZIP compression. A lower value results in faster processing but less compression; a value of 0 indicates no compression but simply archiving |
| Output Grouping \* | output-grouping | output-array | * Array * One Line Per Object | Specifies how the writer should output the JSON records (as an array or one object per line, e.g.) Note that if ‘One Line Per Object’ is selected, then Pretty Print JSON must be false. |
| Suppress Null Values \* | suppress-nulls | never-suppress | * Never Suppress * Always Suppress * Suppress Missing Values | Specifies how the writer should handle a null field |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JsonTableColumnFilter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jsontablecolumnfilter.md
section: Loading & Unloading Data
---

# JsonTableColumnFilter

## Description

Provides a table column filter based on a JSON configuration. The JSON configuration should be an array of objects, where each object represents a table and its column filter. The object should have the following properties: - schema: the schema name of the table - table: the table name - included: an array of column names to include - excluded: an array of column names to exclude - includedPattern: a regular expression pattern to include columns - excludedPattern: a regular expression pattern to exclude columns The schema and table must be provided for each object, and one or more of the `included`, `excluded`, `includedPattern`, or `excludedPattern` properties must be provided. If any column is included as both included and excluded, the column will be excluded. If only a single filter is provided, the JSON configuration may be a single JSON object, rather than an array.

## Tags

column, database, filter, snowflake, table

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Filter JSON | Filter JSON |  |  | JSON representation of the column filter |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JsonTreeReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jsontreereader.md
section: Loading & Unloading Data
---

# JsonTreeReader

## Description

Parses JSON into individual Record objects. While the reader expects each record to be well-formed JSON, the content of a FlowFile may consist of many records, each as a well-formed JSON array or JSON object with optional whitespace between them, such as the common ‘JSON-per-line’ format. If an array is encountered, each element in that array will be treated as a separate record. If the schema that is configured contains a field that is not present in the JSON, a null value will be used. If the JSON contains a field that is not present in the schema, that field will be skipped. See the Usage of the Controller Service for more information and examples.

## Tags

json, parser, reader, record, tree

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Allow Comments \* | Allow Comments | false | * true * false | Whether to allow comments when parsing the JSON document |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Max String Length \* | Max String Length | 20 MB |  | The maximum allowed length of a string value when parsing the JSON document |
| Schema Access Strategy \* | Schema Access Strategy | infer-schema | * Infer Schema * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Schema Application Strategy \* | schema-application-strategy | SELECTED_PART | * Whole JSON * Selected Part | Specifies whether the schema is defined for the whole JSON or for the selected part starting from “Starting Field Name”. |
| Schema Inference Cache | schema-inference-cache |  |  | Specifies a Schema Cache to use when inferring the schema. If not populated, the schema will be inferred each time. However, if a cache is specified, the cache will first be consulted and if the applicable schema can be found, it will be used instead of inferring the schema. |
| Starting Field Name | starting-field-name |  |  | Skips forward to the given nested JSON field (array or object) to begin processing. |
| Starting Field Strategy \* | starting-field-strategy | ROOT_NODE | * Root Node * Nested Field | Start processing from the root node or from a specified nested node. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: JWTBearerOAuth2AccessTokenProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/jwtbeareroauth2accesstokenprovider.md
section: Loading & Unloading Data
---

# JWTBearerOAuth2AccessTokenProvider

## Description

Provides OAuth 2.0 access tokens that can be used as Bearer authorization header in HTTP requests. This controller service is for implementing the OAuth 2.0 JWT Bearer Flow.

## Tags

access token, authorization, hjwt, oauth2, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Assertion Parameter Name \* | Assertion Parameter Name | assertion |  | Name of the parameter to use for the JWT assertion in the request to the token endpoint. |
| Audience | Audience |  |  | The audience claim (aud) for the JWT. Space-separated list of audiences if multiple are expected. |
| Grant Type \* | Grant Type | <urn:ietf:params:oauth:grant-type:jwt-bearer> |  | Value to set for the grant_type parameter in the request to the token endpoint. |
| Issuer | Issuer |  |  | The issuer claim (iss) for the JWT. |
| JWT Expiration Time \* | JWT Expiration Time | 1 hour |  | Expiration time used to set the corresponding claim of the JWT. In case the returned access token does not includean expiration time, this will be used with the refresh window to re-acquire a new access token. |
| JWT ID | JWT ID |  |  | The “jti” (JWT ID) claim provides a unique identifier for the JWT. The identifier value must be assigned in amanner that ensures that there’s a negligible probability that the same value will be accidentally assigned to adifferent data object; if the application uses multiple issuers, collisions MUST be prevented among values producedby different issuers as well. The “jti” value is a case-sensitive string. If set, it is recommended to set thisvalue to ${UUID()}. |
| Key ID | Key ID |  |  | The ID of the public key used to sign the JWT. It’ll be used as the kid header in the JWT. |
| Private Key Service \* | Private Key Service |  |  | The private key service to use for signing JWTs. |
| Refresh Window \* | Refresh Window | 5 minutes |  | The service will attempt to refresh tokens expiring within the refresh window, subtracting the configured duration from the token expiration. |
| SSL Context Service \* | SSL Context Service |  |  | An instance of SSLContextProvider configured with a certificate that will be used to set the x5t header. Must be using RSA algorithm. |
| Scope | Scope |  |  | The scope claim (scope) for the JWT. |
| Set JWT Header X.509 Cert Thumbprint \* | Set JWT Header X.509 Cert Thumbprint | false | * true * false | If true, will set the JWT header x5t field with the base64url-encoded SHA-256 thumbprint of the X.509 certificate’s DER encoding.If set to true, an instance of SSLContextProvider must be configured with a certificate using RSA algorithm. |
| Signing Algorithm \* | Signing Algorithm | PS256 | * RS256 * RS384 * RS512 * PS256 * PS384 * PS512 * ES256 * ES384 * ES512 * Ed25519 | The algorithm to use for signing the JWT. |
| Subject | Subject |  |  | The subject claim (sub) for the JWT. |
| Token Endpoint URL \* | Token Endpoint URL |  |  | The URL of the OAuth2 token endpoint. |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for calling the token endpoint. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Kafka3ConnectionService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/kafka3connectionservice.md
section: Loading & Unloading Data
---

# Kafka3ConnectionService

## Description

Provides and manages connections to Kafka Brokers for producer or consumer operations.

## Tags

kafka, openflow

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| SSL Context Service | SSL Context Service |  |  | Service supporting SSL communication with Kafka brokers |
| Acknowledgment Wait Time \* | ack.wait.time | 5 sec |  | After sending a message to Kafka, this indicates the amount of time that the service will wait for a response from Kafka.If Kafka does not acknowledge the message within this time period, the service will throw an exception. |
| Bootstrap Servers \* | bootstrap.servers |  |  | Comma-separated list of Kafka Bootstrap Servers in the format host:port. Corresponds to Kafka bootstrap.servers property |
| Client Timeout \* | default.api.timeout.ms | 60 sec |  | Default timeout for Kafka client operations. Mapped to Kafka default.api.timeout.ms. The Kafka request.timeout.ms property is derived from half of the configured timeout |
| Transaction Isolation Level \* | isolation.level | read_committed | * Read Committed * Read Uncommitted | Specifies how the service should handle transaction isolation levels when communicating with Kafka.The uncommited option means that messages will be received as soon as they are written to Kafka but will be pulled, even if the producer cancels the transactions.The committed option configures the service to not receive any messages for which the producer’s transaction was canceled, but this can result in some latency since theconsumer must wait for the producer to finish its entire transaction instead of pulling as the messages become available.Corresponds to Kafka isolation.level property. |
| Max Metadata Wait Time \* | max.block.ms | 5 sec |  | The amount of time publisher will wait to obtain metadata or wait for the buffer to flush during the ‘send’ call before failing theentire ‘send’ call. Corresponds to Kafka max.block.ms property |
| Max Poll Records \* | max.poll.records | 10000 |  | Maximum number of records Kafka should return in a single poll. |
| SASL Mechanism \* | sasl.mechanism | GSSAPI | * GSSAPI * PLAIN * SCRAM-SHA-256 * SCRAM-SHA-512 | SASL mechanism used for authentication. Corresponds to Kafka Client sasl.mechanism property |
| SASL Password \* | sasl.password |  |  | Password provided with configured username when using PLAIN or SCRAM SASL Mechanisms |
| SASL Username \* | sasl.username |  |  | Username provided with configured password when using PLAIN or SCRAM SASL Mechanisms |
| Security Protocol \* | security.protocol | PLAINTEXT | * PLAINTEXT * SSL * SASL_PLAINTEXT * SASL_SSL | Security protocol used to communicate with brokers. Corresponds to Kafka Client security.protocol property |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ListArchivedHubSpotData 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listarchivedhubspotdata.md
section: Loading & Unloading Data
---

# ListArchivedHubSpotData 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-hubspot-processors-nar

## Description

Lists archived data from HubSpot for the chosen object type and generates one FlowFile per listed object with the corresponding metadata as FlowFile attributes. The object type must be searchable, which means it supports access to the /search endpoint. For more information about searchable object types, see: <https://developers.hubspot.com/docs/reference/api/crm/objects/objects#search>”)

## Tags

Preview, hubspot

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| HubSpot Service | HubSpot Client Service. |
| Object Type | HubSpot object type |
| Updated After | Filter objects updated after specified date (format: yyyy-MM-dd) |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Maintains pagination state and last sync timestamp to continue data retrieval from the last known position after restarts and to fetch only changed data. |

## Relationships

| Name | Description |
| --- | --- |
| failure | HubSpot fail relationship |
| original | The input Flow File is routed to the original relationship. |
| retry | HubSpot retry relationship. FlowFiles that failed to process due to a server timeout or rate limit related error. FlowFiles routed here should be routed back into the processor. |
| success | HubSpot success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| statement.type | DELETE |
| hubspot.object.type | HubSpot Object Type for this fetch |
| hubspot.object.id | HubSpot Object ID for this fetch |
| hubspot.run.id | Timestamp of the start of this run. Obtained from the incoming FlowFile or current time if not available |
| hubspot.is_last | Whether this is the last paged object of the ingestion |

## Use cases

|  |
| --- |
| This processor is typically used in conjunction with a GenerateFlowFile processor |

## See also

* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotObject](gethubspotobject.md)
* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotSchema](gethubspotschema.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListHubSpotObjects](listhubspotobjects.md)
* [com.snowflake.openflow.runtime.processors.hubspot.PutHubSpot](puthubspot.md)

---
title: ListAzureBlobStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listazureblobstorage_v12.md
section: Loading & Unloading Data
---

# ListAzureBlobStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Lists blobs in an Azure Blob Storage container. Listing details are attached to an empty FlowFile for use with FetchAzureBlobStorage. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data. The processor uses Azure Blob Storage client library v12.

## Tags

azure, blob, cloud, microsoft, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Blob Name Prefix | Search prefix for listing |
| Container Name | Name of the Azure storage container. In case of PutAzureBlobStorage processor, container can be created if it does not exist. |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Maximum File Age | The maximum age that a file must be in order to be pulled; any file older than this amount of time (according to last modification date) will be ignored |
| Maximum File Size | The maximum size that a file can be in order to be pulled |
| Minimum File Age | The minimum age that a file must be in order to be pulled; any file younger than this amount of time (according to last modification date) will be ignored |
| Minimum File Size | The minimum size that a file must be in order to be pulled |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Storage Credentials | Controller Service used to obtain Azure Blob Storage Credentials. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of blobs, the timestamp of the newest blob is stored if ‘Tracking Timestamps’ Listing Strategy is in use (by default). This allows the Processor to list only blobs that have been added or modified after this date the next time that the Processor is run. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.container | The name of the Azure Blob Storage container |
| azure.blobname | The name of the blob on Azure Blob Storage |
| azure.primaryUri | Primary location of the blob |
| azure.etag | ETag of the blob |
| azure.blobtype | Type of the blob (either BlockBlob, PageBlob or AppendBlob) |
| mime.type | MIME Type of the content |
| lang | Language code for the content |
| azure.timestamp | Timestamp of the blob |
| azure.length | Length of the blob |

## See also

* [org.apache.nifi.processors.azure.storage.CopyAzureBlobStorage_v12](copyazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.DeleteAzureBlobStorage_v12](deleteazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureBlobStorage_v12](fetchazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.PutAzureBlobStorage_v12](putazureblobstorage_v12.md)

---
title: ListAzureDataLakeStorage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listazuredatalakestorage.md
section: Loading & Unloading Data
---

# ListAzureDataLakeStorage 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Lists directory in an Azure Data Lake Storage Gen 2 filesystem

## Tags

adlsgen2, azure, cloud, datalake, microsoft, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ADLS Credentials | Controller Service used to obtain Azure Credentials. |
| Directory Name | Name of the Azure Storage Directory. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. In case of the PutAzureDataLakeStorage processor, the directory will be created if not already existing. |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| File Filter | Only files whose names match the given regular expression will be listed |
| Filesystem Name | Name of the Azure Storage File System (also called Container). It is assumed to be already existing. |
| Include Temporary Files | Whether to include temporary files when listing the contents of configured directory paths. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Maximum File Age | The maximum age that a file must be in order to be pulled; any file older than this amount of time (according to last modification date) will be ignored |
| Maximum File Size | The maximum size that a file can be in order to be pulled |
| Minimum File Age | The minimum age that a file must be in order to be pulled; any file younger than this amount of time (according to last modification date) will be ignored |
| Minimum File Size | The minimum size that a file must be in order to be pulled |
| Path Filter | When ‘Recurse Subdirectories’ is true, then only subdirectories whose paths match the given regular expression will be scanned |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Recurse Subdirectories | Indicates whether to list files from subdirectories of the directory |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of files, the timestamp of the newest file is stored. This allows the Processor to list only files that have been added or modified after this date the next time that the Processor is run. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.filesystem | The name of the Azure File System |
| azure.filePath | The full path of the Azure File |
| azure.directory | The name of the Azure Directory |
| azure.filename | The name of the Azure File |
| azure.length | The length of the Azure File |
| azure.lastModified | The last modification time of the Azure File |
| azure.etag | The ETag of the Azure File |

## See also

* [org.apache.nifi.processors.azure.storage.DeleteAzureDataLakeStorage](deleteazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureDataLakeStorage](fetchazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.PutAzureDataLakeStorage](putazuredatalakestorage.md)

---
title: ListBoxFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listboxfile.md
section: Loading & Unloading Data
---

# ListBoxFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Lists files in a Box folder. Each listed file may result in one FlowFile, the metadata being written as FlowFile attributes. Or - in case the ‘Record Writer’ property is set - the entire result is written as records to a single FlowFile. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data.

## Tags

box, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| Folder ID | The ID of the folder from which to pull list of files. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Minimum File Age | The minimum age a file must be in order to be considered; any files younger than this will be ignored. |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Search Recursively | When ‘true’, will include list of files from sub-folders. Otherwise, will return only files that are within the folder defined by the ‘Folder ID’ property. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | The processor stores necessary data to be able to keep track what files have been listed already. What exactly needs to be stored depends on the ‘Listing Strategy’. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The id of the file |
| filename | The name of the file |
| path | The folder path where the file is located |
| box.size | The size of the file |
| box.timestamp | The last modified time of the file |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.PutBoxFile](putboxfile.md)

---
title: ListBoxFileInfo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listboxfileinfo.md
section: Loading & Unloading Data
---

# ListBoxFileInfo 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Fetches file metadata for each file in a Box Folder. Takes a flowFile with a folder ID attribute and outputs flowFiles with records containing all file metadata.

## Tags

box, fetch, files, folder, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Folder ID | The ID of the folder from which to fetch files. |
| Minimum File Age | The minimum age a file must be in order to be considered; any files younger than this will be ignored. |
| Record Writer | Specifies the Controller Service to use for writing the metadata records. Must be set. |
| Search Recursively | When ‘true’, will include files from sub-folders. Otherwise, will return only files that are within the folder defined by the ‘Folder ID’ property. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here if there is an error fetching file metadata from the folder. |
| not.found | FlowFiles for which the specified Box folder was not found will be routed to this relationship. |
| success | A FlowFile containing the file metadata records will be routed to this relationship upon successful processing. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.folder.id | The ID of the folder from which files were fetched |
| record.count | The number of records in the FlowFile |
| mime.type | The MIME Type specified by the Record Writer |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.PutBoxFile](putboxfile.md)

---
title: ListBoxFileMetadataInstances 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listboxfilemetadatainstances.md
section: Loading & Unloading Data
---

# ListBoxFileMetadataInstances 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Retrieves all metadata instances associated with a Box file.

## Tags

box, instances, metadata, storage, templates

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the file for which to fetch metadata. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here if there is an error fetching metadata instances from the file. |
| not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile containing the metadata instances records will be routed to this relationship upon successful processing. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the file from which metadata was fetched |
| record.count | The number of records in the FlowFile |
| mime.type | The MIME Type specified by the Record Writer |
| box.metadata.instances.names | Comma-separated list of instances names |
| box.metadata.instances.count | Number of metadata instances found |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.FetchBoxFileInfo](fetchboxfileinfo.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)

---
title: ListBoxFileMetadataTemplates 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listboxfilemetadatatemplates.md
section: Loading & Unloading Data
---

# ListBoxFileMetadataTemplates 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Retrieves all metadata templates associated with a Box file.

## Tags

box, metadata, storage, templates

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the file for which to fetch metadata. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here if there is an error fetching metadata templates from the file. |
| not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile containing the metadata template records will be routed to this relationship upon successful processing. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.file.id | The ID of the file from which metadata was fetched |
| record.count | The number of records in the FlowFile |
| mime.type | The MIME Type specified by the Record Writer |
| box.metadata.templates.names | Comma-separated list of template names |
| box.metadata.templates.count | Number of metadata templates found |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.FetchBoxFileInfo](fetchboxfileinfo.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)

---
title: ListConfluenceGroups 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listconfluencegroups.md
section: Loading & Unloading Data
---

# ListConfluenceGroups 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-atlassian-processors-nar

## Description

Processor listing Confluence groups.

## Tags

Preview, atlassian, confluence, groups

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Confluence Client Service | Controller service for managing connections to Confluence |

## Relationships

| Name | Description |
| --- | --- |
| retry | Retryable failure occurred, e.g. rate limiting |
| success | Successfully fetched Confluence group page |

## Writes attributes

| Name | Description |
| --- | --- |
| confluence.group.ids | List of identifiers of the Confluence groups. |

---
title: ListDatabaseTables 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listdatabasetables.md
section: Loading & Unloading Data
---

# ListDatabaseTables 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Generates a set of flow files, each containing attributes corresponding to metadata about a table from a database connection. Once metadata about a table has been fetched, it will not be fetched again until the Refresh Interval (if set) has elapsed, or until state has been manually cleared.

## Tags

database, jdbc, list, sql, table

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| list-db-include-count | Whether to include the table’s row count as a flow file attribute. This affects performance as a database query will be generated for each table in the retrieved list. |
| list-db-refresh-interval | The amount of time to elapse before resetting the processor state, thereby causing all current tables to be listed. During this interval, the processor may continue to run, but tables that have already been listed will not be re-listed. However new/added tables will be listed as the processor runs. A value of zero means the state will never be automatically reset, the user must Clear State manually. |
| list-db-tables-catalog | The name of a catalog from which to list database tables. The name must match the catalog name as it is stored in the database. If the property is not set, the catalog name will not be used to narrow the search for tables. If the property is set to an empty string, tables without a catalog will be listed. |
| list-db-tables-db-connection | The Controller Service that is used to obtain connection to database |
| list-db-tables-name-pattern | A pattern for matching tables in the database. Within a pattern, “%” means match any substring of 0 or more characters, and “_” means match any one character. The pattern must match the table name as it is stored in the database. If the property is not set, all tables will be retrieved. |
| list-db-tables-schema-pattern | A pattern for matching schemas in the database. Within a pattern, “%” means match any substring of 0 or more characters, and “_” means match any one character. The pattern must match the schema name as it is stored in the database. If the property is not set, the schema name will not be used to narrow the search for tables. If the property is set to an empty string, tables without a schema will be listed. |
| list-db-tables-types | A comma-separated list of table types to include. For example, some databases support TABLE and VIEW types. If the property is not set, tables of all types will be returned. |
| record-writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of tables, the timestamp of the query is stored. This allows the Processor to not re-list tables the next time that the Processor is run. Specifying the refresh interval in the processor properties will indicate that when the processor detects the interval has elapsed, the state will be reset and tables will be re-listed as a result. This processor is meant to be run on the primary node only. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| db.table.name | Contains the name of a database table from the connection |
| db.table.catalog | Contains the name of the catalog to which the table belongs (may be null) |
| db.table.schema | Contains the name of the schema to which the table belongs (may be null) |
| db.table.fullname | Contains the fully-qualifed table name (possibly including catalog, schema, etc.) |
| db.table.type | Contains the type of the database table from the connection. Typical types are “TABLE”, “VIEW”, “SYSTEM TABLE”, “GLOBAL TEMPORARY”, “LOCAL TEMPORARY”, “ALIAS”, “SYNONYM” |
| db.table.remarks | Contains the name of a database table from the connection |
| db.table.count | Contains the number of rows in the table |

## Use Cases Involving Other Components

|  |
| --- |
| Perform a full load of a database, retrieving all rows from all tables, or a specific set of tables. |

---
title: ListDBFSDirectory 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listdbfsdirectory.md
section: Loading & Unloading Data
---

# ListDBFSDirectory 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

List file names in a DBFS directory and output a new FlowFile with the filename.

## Tags

databricks, dbfs, openflow

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| DBFS File Path | DBFS file path e.g. /directory/file.txt |
| Databricks Client | Databricks Client Service. |
| Include Directories | Include directories in FlowFiles produced. |
| Recursive Directory Listing | Recursively list files in sub directories. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| original | The original FlowFile is routed to this relationship when processing is successful. |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | Base filename of the DBFS file or directory. |
| path | Path to parent directory containing the DBFS file or directory. |
| absolute.path | Full path to the DBFS file or directory. |
| dbfs.resourceType | The type of resource, ‘file’ or ‘directory’ of the DBFS resource. |
| dbfs.size | The size of the DBFS file. |
| dbfs.lastModifiedTime | The last modified time of the DBFS file, in milliseconds since epoch in UTC time. |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: ListDropbox 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listdropbox.md
section: Loading & Unloading Data
---

# ListDropbox 2025.10.9.21

## Bundle

org.apache.nifi | nifi-dropbox-processors-nar

## Description

Retrieves a listing of files from Dropbox (shortcuts are ignored). Each listed file may result in one FlowFile, the metadata being written as FlowFile attributes. When the ‘Record Writer’ property is set, the entire result is written as records to a single FlowFile. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data.

## Tags

dropbox, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Dropbox Credential Service | Controller Service used to obtain Dropbox credentials (App Key, App Secret, Access Token, Refresh Token). See controller service’s Additional Details for more information. |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| Folder | The Dropbox identifier or path of the folder from which to pull list of files. ‘Folder’should match the following regular expression pattern: /.\*|id:.\* . Example for folder identifier: id:odTlUvbpIEAAAAAAAAAGGQ. Example for folder path: /Team1/Task1. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Minimum File Age | The minimum age a file must be in order to be considered; any files newer than this will be ignored. |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Search Recursively | Indicates whether to list files from subfolders of the Dropbox folder. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | The processor stores necessary data to be able to keep track what files have been listed already. What exactly needs to be stored depends on the ‘Listing Strategy’. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| dropbox.id | The Dropbox identifier of the file |
| path | The folder path where the file is located |
| filename | The name of the file |
| dropbox.size | The size of the file |
| dropbox.timestamp | The server modified time of the file |
| dropbox.revision | Revision of the file |

## See also

* [org.apache.nifi.processors.dropbox.FetchDropbox](fetchdropbox.md)
* [org.apache.nifi.processors.dropbox.PutDropbox](putdropbox.md)

---
title: ListenFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenftp.md
section: Loading & Unloading Data
---

# ListenFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Starts an FTP server that listens on the specified port and transforms incoming files into FlowFiles. The URI of the service will be <ftp:/>/{hostname}:{port}. The default port is 2221.

## Tags

FTP, FTPS, ingest, listen

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Address | The address the FTP server should be bound to. If not set (or set to 0.0.0.0), the server binds to all available addresses (i.e. all network interfaces of the host machine). |
| Password | If the Username is set, then a password must also be specified. The password provided by the client trying to log in to the FTP server will be checked against this password. |
| Port | The Port to listen on for incoming connections. On Linux, root privileges are required to use port numbers below 1024. |
| SSL Context Service | Specifies the SSL Context Service that can be used to create secure connections. If an SSL Context Service is selected, then a keystore file must also be specified in the SSL Context Service. Without a keystore file, the processor cannot be started successfully. Specifying a truststore file is optional. If a truststore file is specified, client authentication is required (the client needs to send a certificate to the server).Regardless of the selected TLS protocol, the highest available protocol is used for the connection. For example if NiFi is running on Java 11 and TLSv1.2 is selected in the controller service as the preferred TLS Protocol, TLSv1.3 will be used (regardless of TLSv1.2 being selected) because Java 11 supports TLSv1.3. |
| Username | The name of the user that is allowed to log in to the FTP server. If a username is provided, a password must also be provided. If no username is specified, anonymous connections will be permitted. |

## Relationships

| Name | Description |
| --- | --- |
| success | Relationship for successfully received files. |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The name of the file received via the FTP/FTPS connection. |
| path | The path pointing to the file’s target directory. E.g.: file.txt is uploaded to /Folder1/SubFolder, then the value of the path attribute will be “/Folder1/SubFolder/” (note that it ends with a separator character). |

---
title: ListenHTTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenhttp.md
section: Loading & Unloading Data
---

# ListenHTTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Starts an HTTP Server and listens on a given base path to transform incoming requests into FlowFiles. The default URI of the Service will be <http:/>/{hostname}:{port}/contentListener. Only HEAD and POST requests are supported. GET, PUT, DELETE, OPTIONS and TRACE will result in an error and the HTTP response status code 405; CONNECT will also result in an error and the HTTP response status code 400. GET is supported on <service_URI>/healthcheck. If the service is available, it returns “200 OK” with the content “OK”. The health check functionality can be configured to be accessible via a different port. For details, see the documentation of the “Listening Port for health check requests” property. A Record Reader and Record Writer property can be enabled on the processor to process incoming requests as records. Record processing is not allowed for multipart requests and request in FlowFileV3 format (minifi). If the incoming request contains a FlowFileV3 package format, the data will be unpacked automatically into individual FlowFile(s) contained within the package; the original FlowFile names are restored.

## Tags

http, https, ingest, listen, rest

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authorized DN Pattern | A Regular Expression to apply against the Subject’s Distinguished Name of incoming connections. If the Pattern does not match the Subject DN, the processor will respond with a status of HTTP 403 Forbidden. |
| Base Path | Base path for incoming connections |
| HTTP Headers to receive as Attributes (Regex) | Specifies the Regular Expression that determines the names of HTTP Headers that should be passed along as FlowFile attributes |
| HTTP Protocols | HTTP Protocols supported for Application Layer Protocol Negotiation with TLS |
| Listening Port | The Port to listen on for incoming connections |
| Max Unconfirmed Flowfile Time | The maximum amount of time to wait for a FlowFile to be confirmed before it is removed from the cache |
| Request Header Maximum Size | The maximum supported size of HTTP headers in requests sent to this processor |
| Return Code | The HTTP return code returned after every HTTP call |
| SSL Context Service | SSL Context Service enables support for HTTPS |
| authorized-issuer-dn-pattern | A Regular Expression to apply against the Issuer’s Distinguished Name of incoming connections. If the Pattern does not match the Issuer DN, the processor will respond with a status of HTTP 403 Forbidden. |
| client-authentication | Client Authentication policy for TLS connections. Required when SSL Context Service configured. |
| health-check-port | The port to listen on for incoming health check requests. If set, it must be different from the Listening Port. Configure this port if the processor is set to use two-way SSL and a load balancer that does not support client authentication for health check requests is used. Only /<base_path>/healthcheck service is available via this port and only GET and HEAD requests are supported. If the processor is set not to use SSL, SSL will not be used on this port, either. If the processor is set to use one-way SSL, one-way SSL will be used on this port. If the processor is set to use two-way SSL, one-way SSL will be used on this port (client authentication not required). |
| max-thread-pool-size | The maximum number of threads to be used by the embedded Jetty server. The value can be set between 8 and 1000. The value of this property affects the performance of the flows and the operating system, therefore the default value should only be changed in justified cases. A value that is less than the default value may be suitable if only a small number of HTTP clients connect to the server. A greater value may be suitable if a large number of HTTP clients are expected to make requests to the server simultaneously. |
| multipart-read-buffer-size | The threshold size, at which the contents of an incoming file would be written to disk. Only applies for requests with Content-Type: multipart/form-data. It is used to prevent denial of service type of attacks, to prevent filling up the heap or disk space. |
| multipart-request-max-size | The max size of the request. Only applies for requests with Content-Type: multipart/form-data, and is used to prevent denial of service type of attacks, to prevent filling up the heap or disk space |
| record-reader | The Record Reader to use parsing the incoming FlowFile into Records |
| record-writer | The Record Writer to use for serializing Records after they have been transformed |

## Relationships

| Name | Description |
| --- | --- |
| success | Relationship for successfully received FlowFiles |

## Use cases

|  |
| --- |
| Unpack FlowFileV3 content received in a POST |

## Use Cases Involving Other Components

|  |
| --- |
| Limit the date flow rate that is accepted |

---
title: ListenOTLP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenotlp.md
section: Loading & Unloading Data
---

# ListenOTLP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-opentelemetry-nar

## Description

Collect OpenTelemetry messages over HTTP or gRPC. Supports standard Export Service Request messages for logs, metrics, and traces. Implements OpenTelemetry OTLP Specification 1.0.0 with OTLP/gRPC and OTLP/HTTP. Provides protocol detection using the HTTP Content-Type header.

## Tags

OTLP, OTel, OpenTelemetry, logs, metrics, telemetry, traces

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Address | Internet Protocol Address on which to listen for OTLP Export Service Requests. The default value enables listening on all addresses. |
| Batch Size | Maximum number of OTLP request resource elements included in each FlowFile produced |
| Client Authentication | Client authentication policy for TLS communication with HTTPS |
| Port | TCP port number on which to listen for OTLP Export Service Requests over HTTP and gRPC |
| Queue Capacity | Maximum number of OTLP request resource elements that can be received and queued |
| SSL Context Service | SSL Context Service enables TLS communication for HTTPS |
| Worker Threads | Number of threads responsible for decoding and queuing incoming OTLP Export Service Requests |

## Relationships

| Name | Description |
| --- | --- |
| success | Export Service Requests containing OTLP Telemetry |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Content-Type set to application/json |
| resource.type | OpenTelemetry Resource Type: LOGS, METRICS, or TRACES |
| resource.count | Count of resource elements included in messages |

---
title: ListenSlack 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenslack.md
section: Loading & Unloading Data
---

# ListenSlack 2025.10.9.21

## Bundle

org.apache.nifi | nifi-slack-nar

## Description

Retrieves real-time messages or Slack commands from one or more Slack conversations. The messages are written out in JSON format. Note that this Processor should be used to obtain real-time messages and commands from Slack and does not provide a mechanism for obtaining historical messages. The ConsumeSlack Processor should be used for an initial load of messages from a channel. See Usage / Additional Details for more information about how to configure this Processor and enable it to retrieve messages and commands from Slack.

## Tags

command, event, listen, message, real-time, receive, slack, social media, team, text, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| App Token | The Application Token that is registered to your Slack application |
| Bot Token | The Bot Token that is registered to your Slack application |
| Event Type to Receive | Specifies the type of Event that the Processor should respond to |
| Resolve User Details | Specifies whether the Processor should lookup details about the Slack User who sent the received message. If true, the output JSON will contain an additional field named ‘userDetails’. The ‘user’ field will still contain the ID of the user. In order to enable this capability, the Bot Token must be granted the ‘users:read’ and optionally the ‘users.profile:read’ Bot Token Scope. If the rate limit is exceeded when retrieving this information, the received message will be rejected and must be re-delivered. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are created will be sent to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Set to application/json, as the output will always be in JSON format |
| slack.event.type | Set to the type of Slack event that occurred |

## See also

* [org.apache.nifi.processors.slack.ConsumeSlack](consumeslack.md)

---
title: ListenSyslog 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listensyslog.md
section: Loading & Unloading Data
---

# ListenSyslog 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Listens for Syslog messages being sent to a given port over TCP or UDP. Incoming messages are checked against regular expressions for RFC5424 and RFC3164 formatted messages. The format of each message is: (<PRIORITY>)(VERSION )(TIMESTAMP) (HOSTNAME) (BODY) where version is optional. The timestamp can be an RFC5424 timestamp with a format of “yyyy-MM-dd ‘T’HH:mm:ss. SZ” or “yyyy-MM-dd ‘T’HH:mm:ss. S+hh:mm”, or it can be an RFC3164 timestamp with a format of “MMM d HH:mm:ss”. If an incoming messages matches one of these patterns, the message will be parsed and the individual pieces will be placed in FlowFile attributes, with the original message in the content of the FlowFile. If an incoming message does not match one of these patterns it will not be parsed and the syslog.valid attribute will be set to false with the original message in the content of the FlowFile. Valid messages will be transferred on the success relationship, and invalid messages will be transferred on the invalid relationship.

## Tags

listen, logs, syslog, tcp, udp

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | Specifies the character set of the Syslog messages. Note that Expression language is not evaluated per FlowFile. |
| Client Auth | The client authentication policy to use for the SSL Context. Only used if an SSL Context Service is provided. |
| Local Network Interface | The name of a local network interface to be used to restrict listening to a specific LAN. |
| Max Batch Size | The maximum number of Syslog events to add to a single FlowFile. If multiple events are available, they will be concatenated along with the <Message Delimiter> up to this configured maximum number of messages |
| Max Size of Message Queue | The maximum size of the internal queue used to buffer messages being transferred from the underlying channel to the processor. Setting this value higher allows more messages to be buffered in memory during surges of incoming messages, but increases the total memory used by the processor. |
| Max Size of Socket Buffer | The maximum size of the socket buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Message Delimiter | Specifies the delimiter to place between Syslog messages when multiple messages are bundled together (see <Max Batch Size> property). |
| Parse Messages | Indicates if the processor should parse the Syslog messages. If set to false, each outgoing FlowFile will only contain the sender, protocol, and port, and no additional attributes. |
| Port | The port for Syslog communication. Note that Expression language is not evaluated per FlowFile. |
| Protocol | The protocol for Syslog communication. |
| Receive Buffer Size | The size of each buffer used to receive Syslog messages. Adjust this value appropriately based on the expected size of the incoming Syslog messages. When UDP is selected each buffer will hold one Syslog message. When TCP is selected messages are read from an incoming connection until the buffer is full, or the connection is closed. |
| SSL Context Service | The Controller Service to use in order to obtain an SSL Context. If this property is set, syslog messages will be received over a secure connection. |
| Socket Keep Alive | Whether or not to have TCP socket keep alive turned on. Timing details depend on operating system properties. |
| Worker Threads | Number of threads responsible for decoding and queuing incoming syslog messages |

## Relationships

| Name | Description |
| --- | --- |
| invalid | Syslog messages that do not match one of the expected formats will be sent out this relationship as a FlowFile per message. |
| success | Syslog messages that match one of the expected formats will be sent out this relationship as a FlowFile per message. |

## Writes attributes

| Name | Description |
| --- | --- |
| syslog.priority | The priority of the Syslog message. |
| syslog.severity | The severity of the Syslog message derived from the priority. |
| syslog.facility | The facility of the Syslog message derived from the priority. |
| syslog.version | The optional version from the Syslog message. |
| syslog.timestamp | The timestamp of the Syslog message. |
| syslog.hostname | The hostname or IP address of the Syslog message. |
| syslog.sender | The hostname of the Syslog server that sent the message. |
| syslog.body | The body of the Syslog message, everything after the hostname. |
| syslog.valid | An indicator of whether this message matched the expected formats. If this value is false, the other attributes will be empty and only the original message will be available in the content. |
| syslog.protocol | The protocol over which the Syslog message was received. |
| syslog.port | The port over which the Syslog message was received. |
| mime.type | The mime.type of the FlowFile which will be text/plain for Syslog messages. |

## See also

* [org.apache.nifi.processors.standard.ParseSyslog](parsesyslog.md)
* [org.apache.nifi.processors.standard.PutSyslog](putsyslog.md)

---
title: ListenTCP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listentcp.md
section: Loading & Unloading Data
---

# ListenTCP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Listens for incoming TCP connections and reads data from each connection using a line separator as the message demarcator. The default behavior is for each message to produce a single FlowFile, however this can be controlled by increasing the Batch Size to a larger value for higher throughput. The Receive Buffer Size must be set as large as the largest messages expected to be received, meaning if every 100kb there is a line separator, then the Receive Buffer Size must be greater than 100kb. The processor can be configured to use an SSL Context Service to only allow secure connections. When connected clients present certificates for mutual TLS authentication, the Distinguished Names of the client certificate’s issuer and subject are added to the outgoing FlowFiles as attributes. The processor does not perform authorization based on Distinguished Name values, but since these values are attached to the outgoing FlowFiles, authorization can be implemented based on these attributes.

## Tags

listen, ssl, tcp, tls

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batching Message Delimiter | Specifies the delimiter to place between messages when multiple messages are bundled together (see <Max Batch Size> property). |
| Character Set | Specifies the character set of the received data. |
| Client Auth | The client authentication policy to use for the SSL Context. Only used if an SSL Context Service is provided. |
| Local Network Interface | The name of a local network interface to be used to restrict listening to a specific LAN. |
| Max Batch Size | The maximum number of messages to add to a single FlowFile. If multiple messages are available, they will be concatenated along with the <Message Delimiter> up to this configured maximum number of messages |
| Max Size of Message Queue | The maximum size of the internal queue used to buffer messages being transferred from the underlying channel to the processor. Setting this value higher allows more messages to be buffered in memory during surges of incoming messages, but increases the total memory used by the processor during these surges. |
| Max Size of Socket Buffer | The maximum size of the socket buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Port | The port to listen on for communication. |
| Receive Buffer Size | The size of each buffer used to receive messages. Adjust this value appropriately based on the expected size of the incoming messages. |
| SSL Context Service | The Controller Service to use in order to obtain an SSL Context. If this property is set, messages will be received over a secure connection. |
| Worker Threads | The maximum number of worker threads available for servicing TCP connections. |
| idle-timeout | The amount of time a client’s connection will remain open if no data is received. The default of 0 seconds will leave connections open until they are closed by the client. |
| pool-receive-buffers | Enable or disable pooling of buffers that the processor uses for handling bytes received on socket connections. The framework allocates buffers as needed during processing. |

## Relationships

| Name | Description |
| --- | --- |
| success | Messages received successfully will be sent out this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| tcp.sender | The sending host of the messages. |
| tcp.port | The sending port the messages were received. |
| client.certificate.issuer.dn | For connections using mutual TLS, the Distinguished Name of the Certificate Authority that issued the client’s certificate is attached to the FlowFile. |
| client.certificate.subject.dn | For connections using mutual TLS, the Distinguished Name of the client certificate’s owner (subject) is attached to the FlowFile. |

---
title: ListenUDP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenudp.md
section: Loading & Unloading Data
---

# ListenUDP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Listens for Datagram Packets on a given port. The default behavior produces a FlowFile per datagram, however for higher throughput the Max Batch Size property may be increased to specify the number of datagrams to batch together in a single FlowFile. This processor can be restricted to listening for datagrams from a specific remote host and port by specifying the Sending Host and Sending Host Port properties, otherwise it will listen for datagrams from all hosts and ports.

## Tags

ingest, listen, source, udp

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batching Message Delimiter | Specifies the delimiter to place between messages when multiple messages are bundled together (see <Max Batch Size> property). |
| Character Set | Specifies the character set of the received data. |
| Local Network Interface | The name of a local network interface to be used to restrict listening to a specific LAN. |
| Max Batch Size | The maximum number of messages to add to a single FlowFile. If multiple messages are available, they will be concatenated along with the <Message Delimiter> up to this configured maximum number of messages |
| Max Size of Message Queue | The maximum size of the internal queue used to buffer messages being transferred from the underlying channel to the processor. Setting this value higher allows more messages to be buffered in memory during surges of incoming messages, but increases the total memory used by the processor. |
| Max Size of Socket Buffer | The maximum size of the socket buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Port | The port to listen on for communication. |
| Receive Buffer Size | The size of each buffer used to receive messages. Adjust this value appropriately based on the expected size of the incoming messages. |
| Sending Host | IP, or name, of a remote host. Only Datagrams from the specified Sending Host Port and this host will be accepted. Improves Performance. May be a system property or an environment variable. |
| Sending Host Port | Port being used by remote host to send Datagrams. Only Datagrams from the specified Sending Host and this port will be accepted. Improves Performance. May be a system property or an environment variable. |

## Relationships

| Name | Description |
| --- | --- |
| success | Messages received successfully will be sent out this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| udp.sender | The sending host of the messages. |
| udp.port | The sending port the messages were received. |

---
title: ListenUDPRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenudprecord.md
section: Loading & Unloading Data
---

# ListenUDPRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Listens for Datagram Packets on a given port and reads the content of each datagram using the configured Record Reader. Each record will then be written to a flow file using the configured Record Writer. This processor can be restricted to listening for datagrams from a specific remote host and port by specifying the Sending Host and Sending Host Port properties, otherwise it will listen for datagrams from all hosts and ports.

## Tags

ingest, listen, record, source, udp

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | Specifies the character set of the received data. |
| Local Network Interface | The name of a local network interface to be used to restrict listening to a specific LAN. |
| Max Size of Message Queue | The maximum size of the internal queue used to buffer messages being transferred from the underlying channel to the processor. Setting this value higher allows more messages to be buffered in memory during surges of incoming messages, but increases the total memory used by the processor. |
| Max Size of Socket Buffer | The maximum size of the socket buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Port | The port to listen on for communication. |
| Receive Buffer Size | The size of each buffer used to receive messages. Adjust this value appropriately based on the expected size of the incoming messages. |
| batch-size | The maximum number of datagrams to write as records to a single FlowFile. The Batch Size will only be reached when data is coming in more frequently than the Poll Timeout. |
| poll-timeout | The amount of time to wait when polling the internal queue for more datagrams. If no datagrams are found after waiting for the configured timeout, then the processor will emit whatever records have been obtained up to that point. |
| record-reader | The Record Reader to use for reading the content of incoming datagrams. |
| record-writer | The Record Writer to use in order to serialize the data before writing to a flow file. |
| sending-host | IP, or name, of a remote host. Only Datagrams from the specified Sending Host Port and this host will be accepted. Improves Performance. May be a system property or an environment variable. |
| sending-host-port | Port being used by remote host to send Datagrams. Only Datagrams from the specified Sending Host and this port will be accepted. Improves Performance. May be a system property or an environment variable. |

## Relationships

| Name | Description |
| --- | --- |
| parse.failure | If a datagram cannot be parsed using the configured Record Reader, the contents of the message will be routed to this Relationship as its own individual FlowFile. |
| success | Messages received successfully will be sent out this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| udp.sender | The sending host of the messages. |
| udp.port | The sending port the messages were received. |
| record.count | The number of records written to the flow file. |
| mime.type | The mime-type of the writer used to write the records to the flow file. |

---
title: ListenWebSocket 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listenwebsocket.md
section: Loading & Unloading Data
---

# ListenWebSocket 2025.10.9.21

## Bundle

org.apache.nifi | nifi-websocket-processors-nar

## Description

Acts as a WebSocket server endpoint to accept client connections. FlowFiles are transferred to downstream relationships according to received message types as the WebSocket server configured with this processor receives client requests

## Tags

WebSocket, consume, listen, subscribe

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| server-url-path | The WetSocket URL Path on which this processor listens to. Must starts with ‘/’, e.g. ‘/example’. |
| websocket-server-controller-service | A WebSocket SERVER Controller Service which can accept WebSocket requests. |

## Relationships

| Name | Description |
| --- | --- |
| binary message | The WebSocket binary message output |
| connected | The WebSocket session is established |
| disconnected | The WebSocket session is disconnected |
| text message | The WebSocket text message output |

## Writes attributes

| Name | Description |
| --- | --- |
| websocket.controller.service.id | WebSocket Controller Service id. |
| websocket.session.id | Established WebSocket session id. |
| websocket.endpoint.id | WebSocket endpoint id. |
| websocket.local.address | WebSocket server address. |
| websocket.remote.address | WebSocket client address. |
| websocket.message.type | TEXT or BINARY. |

---
title: ListFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listfile.md
section: Loading & Unloading Data
---

# ListFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Retrieves a listing of files from the input directory. For each file listed, creates a FlowFile that represents the file so that it can be fetched in conjunction with FetchFile. This Processor is designed to run on Primary Node only in a cluster when ‘Input Directory Location’ is set to ‘Remote’. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all the data. When ‘Input Directory Location’ is ‘Local’, the ‘Execution’ mode can be anything, and synchronization won’t happen. Unlike GetFile, this Processor does not delete any data from the local filesystem.

## Tags

file, filesystem, get, ingest, list, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Node Identifier | The configured value will be appended to the cache key so that listing state can be tracked per NiFi node rather than cluster wide when tracking state is scoped to LOCAL. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| File Filter | Only files whose names match the given regular expression will be picked up |
| Ignore Hidden Files | Indicates whether or not hidden files should be ignored |
| Include File Attributes | Whether or not to include information such as the file’s Last Modified Time and Owner as FlowFile Attributes. Depending on the File System being used, gathering this information can be expensive and as a result should be disabled. This is especially true of remote file shares. |
| Input Directory | The input directory from which files to pull files |
| Input Directory Location | Specifies where the Input Directory is located. This is used to determine whether state should be stored locally or across the cluster. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Maximum File Age | The maximum age that a file must be in order to be pulled; any file older than this amount of time (according to last modification date) will be ignored |
| Maximum File Size | The maximum size that a file can be in order to be pulled |
| Minimum File Age | The minimum age that a file must be in order to be pulled; any file younger than this amount of time (according to last modification date) will be ignored |
| Minimum File Size | The minimum size that a file must be in order to be pulled |
| Path Filter | When Recurse Subdirectories is true, then only subdirectories whose path matches the given regular expression will be scanned |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Recurse Subdirectories | Indicates whether to list files from subdirectories of the directory |
| Target System Timestamp Precision | Specify timestamp precision at the target system. Since this processor uses timestamp of entities to decide which should be listed, it is crucial to use the right timestamp precision. |
| max-listing-time | The maximum amount of time that listing any single directory is expected to take. If the listing for the directory specified by the ‘Input Directory’ property, or the listing of any subdirectory (if ‘Recurse’ is set to true) takes longer than this amount of time, a warning bulletin will be generated for each directory listing that exceeds this amount of time. |
| max-operation-time | The maximum amount of time that any single disk operation is expected to take. If any disk operation takes longer than this amount of time, a warning bulletin will be generated for each operation that exceeds this amount of time. |
| max-performance-metrics | If the ‘Track Performance’ property is set to ‘true’, this property indicates the maximum number of files whose performance metrics should be held onto. A smaller value for this property will result in less heap utilization, while a larger value may provide more accurate insights into how the disk access operations are performing |
| track-performance | Whether or not the Processor should track the performance of disk access operations. If true, all accesses to disk will be recorded, including the file being accessed, the information being obtained, and how long it takes. This is then logged periodically at a DEBUG level. While the amount of data will be capped, this option may still consume a significant amount of heap (controlled by the ‘Maximum Number of Files to Track’ property), but it can be very useful for troubleshooting purposes if performance is poor is degraded. |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | After performing a listing of files, the timestamp of the newest file is stored. This allows the Processor to list only files that have been added or modified after this date the next time that the Processor is run. Whether the state is stored with a Local or Cluster scope depends on the value of the <Input Directory Location> property. |
| CLUSTER | After performing a listing of files, the timestamp of the newest file is stored. This allows the Processor to list only files that have been added or modified after this date the next time that the Processor is run. Whether the state is stored with a Local or Cluster scope depends on the value of the <Input Directory Location> property. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The name of the file that was read from filesystem. |
| path | The path is set to the relative path of the file’s directory on filesystem compared to the Input Directory property. For example, if Input Directory is set to /tmp, then files picked up from /tmp will have the path attribute set to “/”. If the Recurse Subdirectories property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to “abc/1/2/3/”. |
| absolute.path | The absolute.path is set to the absolute path of the file’s directory on filesystem. For example, if the Input Directory property is set to /tmp, then files picked up from /tmp will have the path attribute set to “/tmp/”. If the Recurse Subdirectories property is set to true and a file is picked up from /tmp/abc/1/2/3, then the path attribute will be set to “/tmp/abc/1/2/3/”. |
| file.owner | The user that owns the file in filesystem |
| file.group | The group that owns the file in filesystem |
| file.size | The number of bytes in the file in filesystem |
| file.permissions | The permissions for the file in filesystem. This is formatted as 3 characters for the owner, 3 for the group, and 3 for other users. For example rw-rw-r– |
| file.lastModifiedTime | The timestamp of when the file in filesystem was last modified as ‘yyyy-MM-dd’T’HH:mm:ssZ’ |
| file.lastAccessTime | The timestamp of when the file in filesystem was last accessed as ‘yyyy-MM-dd’T’HH:mm:ssZ’ |
| file.creationTime | The timestamp of when the file in filesystem was created as ‘yyyy-MM-dd’T’HH:mm:ssZ’ |

## See also

* [org.apache.nifi.processors.standard.FetchFile](fetchfile.md)
* [org.apache.nifi.processors.standard.GetFile](getfile.md)
* [org.apache.nifi.processors.standard.PutFile](putfile.md)

---
title: ListFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listftp.md
section: Loading & Unloading Data
---

# ListFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Performs a listing of the files residing on an FTP server. For each file that is found on the remote server, a new FlowFile will be created with the filename attribute set to the name of the file on the remote server. This can then be used in conjunction with FetchFTP in order to fetch those files.

## Tags

files, ftp, ingest, input, list, remote, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Mode | The FTP Connection Mode |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| File Filter Regex | Provides a Java Regular Expression for filtering Filenames; if a filter is supplied, only files whose names match that Regular Expression will be fetched |
| Follow Symbolic Links | If true, will pull even symbolic files and also nested symbolic subdirectories; otherwise, will not read symbolic files and will not traverse symbolic link subdirectories |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Ignore Dotted Files | If true, files whose names begin with a dot (“.”) will be ignored |
| Internal Buffer Size | Set the internal buffer size for buffered data streams |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Password | Password for the user account |
| Path Filter Regex | When Search Recursively is true, then only subdirectories whose path matches the given Regular Expression will be scanned |
| Port | The port to connect to on the remote host to fetch the data from |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Remote Path | The path on the remote system from which to pull or push files |
| Remote Poll Batch Size | The value specifies how many file paths to find in a given directory on the remote system when doing a file listing. This value in general should not need to be modified but when polling against a remote system with a tremendous number of files this value can be critical. Setting this value too high can result very poor performance and setting it too low can cause the flow to be slower than normal. |
| Search Recursively | If true, will pull files from arbitrarily nested subdirectories; otherwise, will not traverse subdirectories |
| Target System Timestamp Precision | Specify timestamp precision at the target system. Since this processor uses timestamp of entities to decide which should be listed, it is crucial to use the right timestamp precision. |
| Transfer Mode | The FTP Transfer Mode |
| Username | Username |
| ftp-use-utf8 | Tells the client to use UTF-8 encoding when processing files and filenames. If set to true, the server must also support UTF-8 encoding. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of files, the timestamp of the newest file is stored. This allows the Processor to list only files that have been added or modified after this date the next time that the Processor is run. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node will not duplicate the data that was listed by the previous Primary Node. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| ftp.remote.host | The hostname of the FTP Server |
| ftp.remote.port | The port that was connected to on the FTP Server |
| ftp.listing.user | The username of the user that performed the FTP Listing |
| file.owner | The numeric owner id of the source file |
| file.group | The numeric group id of the source file |
| file.permissions | The read/write/execute permissions of the source file |
| file.size | The number of bytes in the source file |
| file.lastModifiedTime | The timestamp of when the file in the filesystem waslast modified as ‘yyyy-MM-dd’T’HH:mm:ssZ’ |
| filename | The name of the file on the FTP Server |
| path | The fully qualified name of the directory on the FTP Server from which the file was pulled |

## See also

* [org.apache.nifi.processors.standard.FetchFTP](fetchftp.md)
* [org.apache.nifi.processors.standard.GetFTP](getftp.md)
* [org.apache.nifi.processors.standard.PutFTP](putftp.md)

---
title: ListGCSBucket 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listgcsbucket.md
section: Loading & Unloading Data
---

# ListGCSBucket 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Retrieves a listing of objects from a GCS bucket. For each object that is listed, creates a FlowFile that represents the object so that it can be fetched in conjunction with FetchGCSObject. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data.

## Tags

gcs, google, google cloud, list, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| gcp-project-id | Google Cloud Project ID |
| gcp-retry-count | How many retry attempts should be made before routing to the failure relationship. |
| gcs-bucket | Bucket of the object. |
| gcs-prefix | The prefix used to filter the object list. In most cases, it should end with a forward slash ( ‘/’). |
| gcs-use-generations | Specifies whether to use GCS Generations, if applicable. If false, only the latest version of each object will be returned. |
| listing-strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| record-writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| storage-api-url | Overrides the default storage URL. Configuring an alternative Storage API URL also overrides the HTTP Host header on requests as described in the Google documentation for Private Service Connections. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of keys, the timestamp of the newest key is stored, along with the keys that share that same timestamp. This allows the Processor to list only keys that have been added or modified after this date the next time that the Processor is run. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles are routed to this relationship after a successful Google Cloud Storage operation. |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The name of the file |
| gcs.bucket | Bucket of the object. |
| gcs.key | Name of the object. |
| gcs.size | Size of the object. |
| gcs.cache.control | Data cache control of the object. |
| gcs.component.count | The number of components which make up the object. |
| gcs.content.disposition | The data content disposition of the object. |
| gcs.content.encoding | The content encoding of the object. |
| gcs.content.language | The content language of the object. |
| mime.type | The MIME/Content-Type of the object |
| gcs.crc32c | The CRC32C checksum of object’s data, encoded in base64 in big-endian order. |
| gcs.create.time | The creation time of the object (milliseconds) |
| gcs.update.time | The last modification time of the object (milliseconds) |
| gcs.encryption.algorithm | The algorithm used to encrypt the object. |
| gcs.encryption.sha256 | The SHA256 hash of the key used to encrypt the object |
| gcs.etag | The HTTP 1.1 Entity tag for the object. |
| gcs.generated.id | The service-generated for the object |
| gcs.generation | The data generation of the object. |
| gcs.md5 | The MD5 hash of the object’s data encoded in base64. |
| gcs.media.link | The media download link to the object. |
| gcs.metageneration | The metageneration of the object. |
| gcs.owner | The owner (uploader) of the object. |
| gcs.owner.type | The ACL entity type of the uploader of the object. |
| gcs.acl.owner | A comma-delimited list of ACL entities that have owner access to the object. Entities will be either email addresses, domains, or project IDs. |
| gcs.acl.writer | A comma-delimited list of ACL entities that have write access to the object. Entities will be either email addresses, domains, or project IDs. |
| gcs.acl.reader | A comma-delimited list of ACL entities that have read access to the object. Entities will be either email addresses, domains, or project IDs. |
| gcs.uri | The URI of the object as a string. |

## See also

* [org.apache.nifi.processors.gcp.storage.DeleteGCSObject](deletegcsobject.md)
* [org.apache.nifi.processors.gcp.storage.FetchGCSObject](fetchgcsobject.md)
* [org.apache.nifi.processors.gcp.storage.PutGCSObject](putgcsobject.md)

---
title: ListGoogleDrive 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listgoogledrive.md
section: Loading & Unloading Data
---

# ListGoogleDrive 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Performs a listing of concrete files (shortcuts are ignored) in a Google Drive folder. If the ‘Record Writer’ property is set, a single Output FlowFile is created, and each file in the listing is written as a single record to the output file. Otherwise, for each file in the listing, an individual FlowFile is created, the metadata being written as FlowFile attributes. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data. Please see Additional Details to set up access to Google Drive.

## Tags

drive, google, storage

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| connect-timeout | Maximum wait time for connection to Google Drive service. |
| folder-id | The ID of the folder from which to pull list of files. Please see Additional Details to set up access to Google Drive and obtain Folder ID. WARNING: Unauthorized access to the folder is treated as if the folder was empty. This results in the processor not creating outgoing FlowFiles. No additional error message is provided. |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| min-age | The minimum age a file must be in order to be considered; any files younger than this will be ignored. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| read-timeout | Maximum wait time for response from Google Drive service. |
| recursive-search | When ‘true’, will include list of files from concrete sub-folders (ignores shortcuts). Otherwise, will return only files that have the defined ‘Folder ID’ as their parent directly. WARNING: The listing may fail if there are too many sub-folders (500+). |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | The processor stores necessary data to be able to keep track what files have been listed already. What exactly needs to be stored depends on the ‘Listing Strategy’. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| drive.id | The id of the file |
| filename | The name of the file |
| mime.type | The MIME type of the file |
| drive.size | The size of the file. Set to 0 when the file size is not available (e.g. externally stored files). |
| drive.size.available | Indicates if the file size is known / available |
| drive.timestamp | The last modified time or created time (whichever is greater) of the file. The reason for this is that the original modified date of a file is preserved when uploaded to Google Drive. ‘Created time’ takes the time when the upload occurs. However uploaded files can still be modified later. |
| drive.created.time | The file’s creation time |
| drive.modified.time | The file’s last modification time |
| drive.path | The path of the file’s directory from the base directory. The path contains the folder names in URL encoded form because Google Drive allows special characters in file names, including ‘/’ (slash) and ‘' (backslash). The URL encoded folder names are separated by ‘/’ in the path. |
| drive.owner | The owner of the file |
| drive.last.modifying.user | The last modifying user of the file |
| drive.web.view.link | Web view link to the file |
| drive.web.content.link | Web content link to the file |
| drive.parent.folder.id | The id of the file’s parent folder |
| drive.parent.folder.name | The name of the file’s parent folder |
| drive.listed.folder.id | The id of the base folder that was listed |
| drive.listed.folder.name | The name of the base folder that was listed |
| drive.shared.drive.id | The id of the shared drive (if the file is located on a shared drive) |
| drive.shared.drive.name | The name of the shared drive (if the file is located on a shared drive) |

## See also

* [org.apache.nifi.processors.gcp.drive.FetchGoogleDrive](fetchgoogledrive.md)
* [org.apache.nifi.processors.gcp.drive.PutGoogleDrive](putgoogledrive.md)

---
title: ListGoogleDriveFileInfo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listgoogledrivefileinfo.md
section: Loading & Unloading Data
---

# ListGoogleDriveFileInfo 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-drive-nar

## Description

Lists all files and folders in a specified Google Drive. The processor requires a Drive ID and can optionally list files recursively through all folders within the drive.

## Tags

cloud, drive, files, gcp, google, list, openflow, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Drive ID | The ID of the drive to list files from. This can be a shared drive ID. |
| GCP Credentials Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| Include Folders | When ‘true’, both files and folders will be included in the results. When ‘false’, only files (not folders) will be included. |
| Minimum File Age | The minimum age a file must be in order to be considered; any files younger than this will be ignored. |
| Record Writer | Specifies the Controller Service to use for writing the metadata records. Must be set. |
| Search Recursively | When ‘true’, will recursively list files in all folders within the drive. When ‘false’, will only list files at the root level of the drive. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile will be routed here if there is an error fetching file metadata. |
| retry | A FlowFile is routed here if the processor should retry the request (e.g., after rate limiting). |
| success | A FlowFile containing the file metadata records will be routed to this relationship upon successful processing. |

## Writes attributes

| Name | Description |
| --- | --- |
| google.drive.drive.id | The ID of the drive from which files were listed |
| record.count | The number of records in the FlowFile |
| mime.type | The MIME Type specified by the Record Writer |
| google.drive.error.code | The error code if the request to Google Drive API fails |
| google.drive.error.message | The error message if the request to Google Drive API fails |

## See also

* [com.snowflake.openflow.runtime.processors.google.CaptureGoogleDriveChanges](capturegoogledrivechanges.md)
* [com.snowflake.openflow.runtime.processors.google.FetchGoogleDriveMetadata](fetchgoogledrivemetadata.md)

---
title: ListGoogleGroups 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listgooglegroups.md
section: Loading & Unloading Data
---

# ListGoogleGroups 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-google-drive-nar

## Description

Lists all of the groups for a given domain in Google Workspace. It supports an optional ‘Query’ to filter the groups. The retrieved group metadata (id, etag, email, name, directMembersCount, description) are output to a Record Writer.

## Tags

cloud, directory, domain, gcp, google, groups, list

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Custom Query | Custom query to filter the returned groups. For example, ‘email=test-\*’. See Google’s Admin SDK Directory API documentation for supported syntax. |
| GCP Credentials Service | Controller Service used to obtain Google Cloud Platform credentials. |
| Google Domain | Domain name to list Google Groups (e.g., ‘example.com’). |
| Record Writer | Record writer used for writing out the records of retrieved Google Groups. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed here if the processor fails to retrieve Google Groups. |
| retry | FlowFiles are routed here if a transient failure occurs (e.g. rate-limited, socket timeouts) and should be retried. |
| success | A FlowFile containing a record set of the groups is routed here upon success. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records (groups) returned. |
| mime.type | The MIME type for the resulting FlowFile. |

## See also

* [com.snowflake.openflow.runtime.processors.google.GetGoogleGroupMembers](getgooglegroupmembers.md)

---
title: ListHubSpotObjects 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listhubspotobjects.md
section: Loading & Unloading Data
---

# ListHubSpotObjects 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-hubspot-processors-nar

## Description

Fetches data from HubSpot for specified object types, and generates one FlowFile per listed object with the corresponding metadata as FlowFile attributes. The object type must be searchable, which means it supports access to the /search endpoint. For more information about searchable object types, see: <https://developers.hubspot.com/docs/reference/api/crm/objects/objects#search>”)

## Tags

Preview, hubspot

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| HubSpot Service | HubSpot Client Service. |
| Object Type | HubSpot object type |
| Updated After | Filter objects updated after specified date (format: yyyy-MM-dd) |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | Maintains pagination state and last sync timestamp to continue data retrieval from the last known position after restarts and to fetch only changed data. |

## Relationships

| Name | Description |
| --- | --- |
| failure | HubSpot fail relationship |
| original | The input Flow File is routed to the original relationship. |
| retry | HubSpot retry relationship. FlowFiles that failed to process due to a server timeout or rate limit related error. FlowFiles routed here should be routed back into the processor. |
| success | HubSpot success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| statement.type | Always ‘UPSERT’ for this processor |
| hubspot.object.type | HubSpot Object Type for this fetch |
| hubspot.object.id | HubSpot Object ID for this fetch |
| hubspot.run.id | Timestamp of the start of this run. Obtained from the incoming FlowFile or current time if not available |
| hubspot.is_last | Whether this is the last paged object of the ingestion |

## Use cases

|  |
| --- |
| This processor is typically used in conjunction with a GenerateFlowFile processor |

## See also

* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotObject](gethubspotobject.md)
* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotSchema](gethubspotschema.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListArchivedHubSpotData](listarchivedhubspotdata.md)
* [com.snowflake.openflow.runtime.processors.hubspot.PutHubSpot](puthubspot.md)

---
title: ListMicrosoftDataverseTables 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listmicrosoftdataversetables.md
section: Loading & Unloading Data
---

# ListMicrosoftDataverseTables 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-dataverse-processors-nar

## Description

List Tables from Microsoft Dataverse environments

## Tags

dataverse

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Environment URL | URL to Microsoft Dataverse Environment |
| OAuth2 Access Token Provider | Enables managed retrieval of OAuth2 Bearer Token. |
| Tables Filter Strategy | List of table names. Output will be limited to those names if defined. |
| Tables Filter Value | Value of Table Names filter. It is regexp or separated list, depending on selected filtering strategy. |
| Web Client Service Provider | Creates instance of web client. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFile with errors occurred while fetching from Dataverse. |
| success | FlowFile with listed tables from Dataverse. |

---
title: ListS3 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/lists3.md
section: Loading & Unloading Data
---

# ListS3 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Retrieves a listing of objects from an S3 bucket. For each object that is listed, creates a FlowFile that represents the object so that it can be fetched in conjunction with FetchS3Object. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data.

## Tags

AWS, Amazon, S3, list

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Bucket | The S3 Bucket to interact with |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Delimiter | The string used to delimit directories within the bucket. Please consult the AWS documentation for the correct use of this field. |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| List Type | Specifies whether to use the original List Objects or the newer List Objects Version 2 endpoint. |
| Listing Batch Size | If not using a Record Writer, this property dictates how many S3 objects should be listed in a single batch. Once this number is reached, the FlowFiles that have been created will be transferred out of the Processor. Setting this value lower may result in lower latency by sending out the FlowFiles before the complete listing has finished. However, it can significantly reduce performance. Larger values may take more memory to store all of the information before sending the FlowFiles out. This property is ignored if using a Record Writer, as one of the main benefits of the Record Writer is being able to emit the entire listing as a single FlowFile. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Maximum Object Age | The maximum age that an S3 object can be in order to be considered; any object older than this amount of time (according to last modification date) will be ignored |
| Minimum Object Age | The minimum age that an S3 object must be in order to be considered; any object younger than this amount of time (according to last modification date) will be ignored |
| Prefix | The prefix used to filter the object list. Do not begin with a forward slash ‘/’. In most cases, it should end with a forward slash ‘/’. |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Region | The AWS Region to connect to. |
| Requester Pays | If true, indicates that the requester consents to pay any charges associated with listing the S3 bucket. This sets the ‘x-amz-request-payer’ header to ‘requester’. Note that this setting is not applicable when ‘Use Versions’ is ‘true’. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Use Versions | Specifies whether to use S3 versions, if applicable. If false, only the latest version of each object will be returned. |
| Write Object Tags | If set to ‘True’, the tags associated with the S3 object will be written as FlowFile attributes |
| Write User Metadata | If set to ‘True’, the user defined metadata associated with the S3 object will be added to FlowFile attributes/records |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of keys, the timestamp of the newest key is stored, along with the keys that share that same timestamp. This allows the Processor to list only keys that have been added or modified after this date the next time that the Processor is run. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles are routed to this Relationship after they have been successfully processed. |

## Writes attributes

| Name | Description |
| --- | --- |
| s3.bucket | The name of the S3 bucket |
| s3.region | The region of the S3 bucket |
| filename | The name of the file |
| s3.etag | The ETag that can be used to see if the file has changed |
| s3.isLatest | A boolean indicating if this is the latest version of the object |
| s3.lastModified | The last modified time in milliseconds since epoch in UTC time |
| s3.length | The size of the object in bytes |
| s3.storeClass | The storage class of the object |
| s3.version | The version of the object, if applicable |
| s3.tag.___ | If ‘Write Object Tags’ is set to ‘True’, the tags associated to the S3 object that is being listed will be written as part of the flowfile attributes |
| s3.user.metadata.___ | If ‘Write User Metadata’ is set to ‘True’, the user defined metadata associated to the S3 object that is being listed will be written as part of the flowfile attributes |

## See also

* [org.apache.nifi.processors.aws.s3.CopyS3Object](copys3object.md)
* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: ListSFDCDataShares 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listsfdcdatashares.md
section: Loading & Unloading Data
---

# ListSFDCDataShares 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

List the available data shares in the organization that are available to the identified user.

## Tags

list, objects, preview, salesforce, sfdc

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Salesforce Data Cloud Client | Salesforce Data Cloud Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFile containing the list of available objects will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| nbObjects | The number of data shares listed in the organization that are available to the identified user. |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DescribeSFDCObject](describesfdcobject.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: ListSFDCObjects 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listsfdcobjects.md
section: Loading & Unloading Data
---

# ListSFDCObjects 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

List the available objects in the organization that are available to the identified user.

## Tags

list, objects, preview, salesforce, sfdc

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFile containing the list of available objects will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| nbObjects | The number of objects listed in the organization that are available to the identified user. |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DescribeSFDCObject](describesfdcobject.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: ListSFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listsftp.md
section: Loading & Unloading Data
---

# ListSFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Performs a listing of the files residing on an SFTP server. For each file that is found on the remote server, a new FlowFile will be created with the filename attribute set to the name of the file on the remote server. This can then be used in conjunction with FetchSFTP in order to fetch those files.

## Tags

files, ingest, input, list, remote, sftp, source

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Algorithm Negotiation | Configuration strategy for SSH algorithm negotiation |
| Ciphers Allowed | A comma-separated list of Ciphers allowed for SFTP connections. Leave unset to allow all. Available options are: 3des-cbc, aes128-cbc, aes128-ctr, [aes128-gcm@openssh.com](mailto:aes128-gcm%40openssh.com), aes192-cbc, aes192-ctr, aes256-cbc, aes256-ctr, [aes256-gcm@openssh.com](mailto:aes256-gcm%40openssh.com), arcfour128, arcfour256, blowfish-cbc, [chacha20-poly1305@openssh.com](mailto:chacha20-poly1305%40openssh.com), none |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| File Filter Regex | Provides a Java Regular Expression for filtering Filenames; if a filter is supplied, only files whose names match that Regular Expression will be fetched |
| Follow Symbolic Links | If true, will pull even symbolic files and also nested symbolic subdirectories; otherwise, will not read symbolic files and will not traverse symbolic link subdirectories |
| Host Key File | If supplied, the given file will be used as the Host Key; otherwise, if ‘Strict Host Key Checking’ property is applied (set to true) then uses the ‘known_hosts’ and ‘known_hosts2’ files from ~/.ssh directory else no host key file will be used |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Ignore Dotted Files | If true, files whose names begin with a dot (“.”) will be ignored |
| Key Algorithms Allowed | A comma-separated list of Key Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: ecdsa-sha2-nistp256, [ecdsa-sha2-nistp256-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp256-cert-v01%40openssh.com), ecdsa-sha2-nistp384, [ecdsa-sha2-nistp384-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp384-cert-v01%40openssh.com), ecdsa-sha2-nistp521, [ecdsa-sha2-nistp521-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp521-cert-v01%40openssh.com), rsa-sha2-256, [rsa-sha2-256-cert-v01@openssh.com](mailto:rsa-sha2-256-cert-v01%40openssh.com), rsa-sha2-512, [rsa-sha2-512-cert-v01@openssh.com](mailto:rsa-sha2-512-cert-v01%40openssh.com), [sk-ecdsa-sha2-nistp256@openssh.com](mailto:sk-ecdsa-sha2-nistp256%40openssh.com), [sk-ssh-ed25519@openssh.com](mailto:sk-ssh-ed25519%40openssh.com), ssh-dss, [ssh-dss-cert-v01@openssh.com](mailto:ssh-dss-cert-v01%40openssh.com), ssh-ed25519, [ssh-ed25519-cert-v01@openssh.com](mailto:ssh-ed25519-cert-v01%40openssh.com), ssh-rsa, [ssh-rsa-cert-v01@openssh.com](mailto:ssh-rsa-cert-v01%40openssh.com) |
| Key Exchange Algorithms Allowed | A comma-separated list of Key Exchange Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: curve25519-sha256, [curve25519-sha256@libssh.org](mailto:curve25519-sha256%40libssh.org), curve448-sha512, diffie-hellman-group-exchange-sha1, diffie-hellman-group-exchange-sha256, diffie-hellman-group1-sha1, diffie-hellman-group14-sha1, diffie-hellman-group14-sha256, diffie-hellman-group15-sha512, diffie-hellman-group16-sha512, diffie-hellman-group17-sha512, diffie-hellman-group18-sha512, ecdh-sha2-nistp256, ecdh-sha2-nistp384, ecdh-sha2-nistp521, mlkem1024nistp384-sha384, mlkem768nistp256-sha256, mlkem768x25519-sha256, sntrup761x25519-sha512, [sntrup761x25519-sha512@openssh.com](mailto:sntrup761x25519-sha512%40openssh.com) |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Maximum File Age | The maximum age that a file must be in order to be pulled; any file older than this amount of time (according to last modification date) will be ignored |
| Maximum File Size | The maximum size that a file can be in order to be pulled |
| Message Authentication Codes Allowed | A comma-separated list of Message Authentication Codes allowed for SFTP connections. Leave unset to allow all. Available options are: hmac-md5, hmac-md5-96, hmac-sha1, hmac-sha1-96, [hmac-sha1-etm@openssh.com](mailto:hmac-sha1-etm%40openssh.com), hmac-sha2-256, [hmac-sha2-256-etm@openssh.com](mailto:hmac-sha2-256-etm%40openssh.com), hmac-sha2-512, [hmac-sha2-512-etm@openssh.com](mailto:hmac-sha2-512-etm%40openssh.com) |
| Minimum File Age | The minimum age that a file must be in order to be pulled; any file younger than this amount of time (according to last modification date) will be ignored |
| Minimum File Size | The minimum size that a file must be in order to be pulled |
| Password | Password for the user account |
| Path Filter Regex | When Search Recursively is true, then only subdirectories whose path matches the given Regular Expression will be scanned |
| Port | The port that the remote system is listening on for file transfers |
| Private Key Passphrase | Password for the private key |
| Private Key Path | The fully qualified path to the Private Key file |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Remote Path | The path on the remote system from which to pull or push files |
| Search Recursively | If true, will pull files from arbitrarily nested subdirectories; otherwise, will not traverse subdirectories |
| Send Keep Alive On Timeout | Send a Keep Alive message every 5 seconds up to 5 times for an overall timeout of 25 seconds. |
| Strict Host Key Checking | Indicates whether or not strict enforcement of hosts keys should be applied |
| Target System Timestamp Precision | Specify timestamp precision at the target system. Since this processor uses timestamp of entities to decide which should be listed, it is crucial to use the right timestamp precision. |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Username | Username |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of files, the timestamp of the newest file is stored. This allows the Processor to list only files that have been added or modified after this date the next time that the Processor is run. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node will not duplicate the data that was listed by the previous Primary Node. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| sftp.remote.host | The hostname of the SFTP Server |
| sftp.remote.port | The port that was connected to on the SFTP Server |
| sftp.listing.user | The username of the user that performed the SFTP Listing |
| file.owner | The numeric owner id of the source file |
| file.group | The numeric group id of the source file |
| file.permissions | The read/write/execute permissions of the source file |
| file.size | The number of bytes in the source file |
| file.lastModifiedTime | The timestamp of when the file in the filesystem waslast modified as ‘yyyy-MM-dd’T’HH:mm:ssZ’ |
| filename | The name of the file on the SFTP Server |
| path | The fully qualified name of the directory on the SFTP Server from which the file was pulled |
| mime.type | The MIME Type that is provided by the configured Record Writer |

## See also

* [org.apache.nifi.processors.standard.FetchSFTP](fetchsftp.md)
* [org.apache.nifi.processors.standard.GetSFTP](getsftp.md)
* [org.apache.nifi.processors.standard.PutSFTP](putsftp.md)

---
title: ListSharepointDrives 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listsharepointdrives.md
section: Loading & Unloading Data
---

# ListSharepointDrives 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-msgraph-nar

## Description

Emits a FlowFile for each Drive present in the specified Sharepoint Site.

## Tags

document, graph, microsoft, openflow, sharepoint, unstructured

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Authentication Service | The service that provides authentication for the SharePoint API. |
| Site URL | The URL of the Sharepoint Site. |

## Relationships

| Name | Description |
| --- | --- |
| success | FlowFiles for each Drive are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| sharepoint.site.url | The URL of the Sharepoint Site. |
| sharepoint.site.id | The ID of the Sharepoint Site. |
| sharepoint.drive.name | The name of the Sharepoint Drive. |
| sharepoint.drive.id | The ID of the Sharepoint Drive. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.FetchSharepointFile](fetchsharepointfile.md)
* [com.snowflake.openflow.runtime.processors.sharepoint.FindSharepointDriveItem](findsharepointdriveitem.md)

---
title: ListSharepointSiteGroups 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listsharepointsitegroups.md
section: Loading & Unloading Data
---

# ListSharepointSiteGroups 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-sharepoint-rest-nar

## Description

Lists all SharePoint site groups available on a specified SharePoint site.

## Tags

groups, list, microsoft, openflow, sharepoint

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| OAuth2 Access Token Provider | Enables managed retrieval of OAuth2 Bearer Token. |
| Record Writer | Record writer used for writing out the records of retrieved Sharepoint Site Groups. |
| Site URL | The URL of the SharePoint site. |
| Web Client Service | The Web Client Service to use for communicating with Sharepoint. |

## Relationships

| Name | Description |
| --- | --- |
| success | Successfully listed all SharePoint site groups. Each group will be represented as a separate FlowFile. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records (groups) returned. |
| mime.type | The MIME type for the resulting FlowFile. |

## See also

* [com.snowflake.openflow.runtime.processors.sharepoint.rest.GetSharepointSiteGroupMembers](getsharepointsitegroupmembers.md)

---
title: ListSmb 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listsmb.md
section: Loading & Unloading Data
---

# ListSmb 2025.10.9.21

## Bundle

org.apache.nifi | nifi-smb-nar

## Description

Lists concrete files shared via SMB protocol. Each listed file may result in one FlowFile, the metadata being written as FlowFile attributes. Or - in case the ‘Record Writer’ property is set - the entire result is written as records to a single FlowFile. This Processor is designed to run on Primary Node only in a cluster. If the primary node changes, the new Primary Node will pick up where the previous node left off without duplicating all of the data.

## Tags

list, samba, smb, cifs, files

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Entity Tracking Initial Listing Target | Specify how initial listing should be handled. Used by ‘Tracking Entities’strategy. |
| Entity Tracking State Cache | Listed entities are stored in the specified cache storage so that this processor can resume listing across NiFi restart or in case of primary node change. ‘Tracking Entities’strategy require tracking information of all listed entities within the last ‘Tracking Time Window’. To support large number of entities, the strategy uses DistributedMapCache instead of managed state. Cache key format is ‘ListedEntities::{processorId}(::{nodeId})’. If it tracks per node listed entities, then the optional ‘::{nodeId}’ part is added to manage state separately. E.g. cluster wide cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b’, per node cache key =’ListedEntities::8dda2321-0164-1000-50fa-3042fe7d6a7b::nifi-node3’ The stored cache content is Gzipped JSON string. The cache key will be deleted when target listing configuration is changed. Used by ‘Tracking Entities’strategy. |
| Entity Tracking Time Window | Specify how long this processor should track already-listed entities. ‘Tracking Entities’strategy can pick any entity whose timestamp is inside the specified time window. For example, if set to ‘30 minutes’, any entity having timestamp in recent 30 minutes will be the listing target when this processor runs. A listed entity is considered ‘new/updated’ and a FlowFile is emitted if one of following condition meets: 1. does not exist in the already-listed entities, 2. has newer timestamp than the cached entity, 3. has different size than the cached entity. If a cached entity ‘s timestamp becomes older than specified time window, that entity will be removed from the cached already-listed entities. Used by’Tracking Entities’strategy. |
| Listing Strategy | Specify how to determine new/updated entities. See each strategy descriptions for detail. |
| Record Writer | Specifies the Record Writer to use for creating the listing. If not specified, one FlowFile will be created for each entity that is listed. If the Record Writer is specified, all entities will be written to a single FlowFile instead of adding attributes to individual FlowFiles. |
| Target System Timestamp Precision | Specify timestamp precision at the target system. Since this processor uses timestamp of entities to decide which should be listed, it is crucial to use the right timestamp precision. |
| directory | The network folder from which to list files. This is the remaining relative path after the share: <smb://HOSTNAME:PORT/SHARE/[DIRECTORY]/sub/directories>. It is also possible to add subdirectories. The given path on the remote file share must exist. This can be checked using verification. You may mix Windows and Linux-style directory separators. |
| file-filter | Only files whose names match the given regular expression will be listed. |
| file-name-suffix-filter | Files ending with the given suffix will be omitted. Can be used to make sure that files that are still uploading are not listed multiple times, by having those files have a suffix and remove the suffix once the upload finishes. This is highly recommended when using ‘Tracking Entities’ or ‘Tracking Timestamps’ listing strategies. |
| initial-listing-strategy | Specifies how to handle existing files on the SMB share when the processor is started for the first time (or its state has been cleared). |
| initial-listing-timestamp | The timestamp from which the files will be listed when the processor is started for the first time (or its state has been cleared). The value can be specified as an epoch timestamp in milliseconds or as a UTC datetime in a format such as 2025-02-01T00:00:00Z |
| max-file-age | Any file older than the given value will be omitted. |
| max-file-size | Any file larger than the given value will be omitted. |
| min-file-age | The minimum age that a file must be in order to be listed; any file younger than this amount of time will be ignored. |
| min-file-size | Any file smaller than the given value will be omitted. |
| path-filter | Only files whose paths (up to the file’s parent directory) match the given regular expression will be listed. |
| smb-client-provider-service | Specifies the SMB client provider to use for creating SMB connections. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a listing of files, the state of the previous listing can be stored in order to list files continuously without duplication. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles that are received are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The name of the file that was read from filesystem. |
| shortName | The short name of the file that was read from filesystem. |
| path | The path is set to the relative path of the file’s directory on the remote filesystem compared to the Share root directory. For example, for a given remote locationsmb://HOSTNAME:PORT/SHARE/DIRECTORY, and a file is being listed from smb://HOSTNAME:PORT/SHARE/DIRECTORY/sub/folder/file then the path attribute will be set to “DIRECTORY/sub/folder”. |
| serviceLocation | The SMB URL of the share. |
| lastModifiedTime | The timestamp of when the file’s content changed in the filesystem as ‘yyyy-MM-dd’T’HH:mm:ss’. |
| creationTime | The timestamp of when the file was created in the filesystem as ‘yyyy-MM-dd’T’HH:mm:ss’. |
| lastAccessTime | The timestamp of when the file was accessed in the filesystem as ‘yyyy-MM-dd’T’HH:mm:ss’. |
| changeTime | The timestamp of when the file’s attributes was changed in the filesystem as ‘yyyy-MM-dd’T’HH:mm:ss’. |
| size | The size of the file in bytes. |
| allocationSize | The number of bytes allocated for the file on the server. |

## See also

* [org.apache.nifi.processors.smb.FetchSmb](fetchsmb.md)
* [org.apache.nifi.processors.smb.GetSmbFile](getsmbfile.md)
* [org.apache.nifi.processors.smb.PutSmbFile](putsmbfile.md)

---
title: ListTableNames 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listtablenames.md
section: Loading & Unloading Data
---

# ListTableNames 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Fetches all source table names and matches them with one of the possible configurations: - regexp expression e.g. “(?i)customer.(orders|payments)” - it matches names in case insensitive way. It would match both “CUSTOMER.ORDERS” and “customer.orders” source table names. - comma separated list of source table names. e.g. “customer.orders, customer.payments”. It matches source table names in case sensitive way i.e. “customer.orders” source table will be forwarded to MATCH relationship but “customer. ORDERS” won ‘t match. Matched source tables that cannot be replicated will be routed to FAILURE relationship, each table in a separate FlowFile, with a reason in attributes. Configuration is passed as a FlowFile attribute. Source table name is represented as <schema_name>.<table_name> so both inputs should take that into consideration. Matched source table names are forwarded to MATCHED relationship. Processor generates a single FlowFile with matching tables. Disclaimers - Postgresql allows to define database object names in case sensitive or case insensitive way. When user creates a table using following query’CREATE TABLE ORDERS(id int not null) ‘then internally Postgresql stores it using lower case letters i.e. orders. To enforce case sensitivity user has to wrap the table name with double quotes i.e.’CREATE TABLE “ORDERS”(id int not null)’. This is important aspect when configuring table that we would like to replicate.

## Tags

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pool | The Controller Service that is used to obtain a connection to the database. |
| Included Comma Separated Source Table Names | The list of comma separated list of tables to replicate. A single table should be formatted as <schema_name>.<table_name> e.g. customer.orders, customer.payments. This is combined with the regular expression to include any matching table. |
| Included Source Table Pattern | Regular Expression for specifying table names to replicate e.g. customer.(orders|payments). This is combined with the comma-separated list to include any matching table. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile attribute cannot be read or is incorrect, it will be routed to this Relationship. |
| matched | Successfully created FlowFile, with a list of matching tables found in the source database. |

## Writes attributes

| Name | Description |
| --- | --- |
| source.schema.name | Name of the schema of the table from which an event originated |
| source.table.name | Name of the table from which an event originated |
| source.entry | The original entry that was attempted to parse when processing table names |
| reason | Reason why table cannot be replicated |
| source.database.version.major | The major version of the source database. |
| mime.type | The MIME type of the FlowFile content. |

---
title: ListUnityCatalogDirectory 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/listunitycatalogdirectory.md
section: Loading & Unloading Data
---

# ListUnityCatalogDirectory 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

List file names in a Unity Catalog directory and output a new FlowFile with the filename.

## Tags

databricks, openflow, unity catalog

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Include Directories | Include directories in FlowFiles produced. |
| Recursive Directory Listing | Recursively list files in sub directories. |
| Unity Catalog Directory Path | Unity Catalog directory path e.g. /Volumes/catalog/schema/volume_name/directory |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| original | The original FlowFile is routed to this relationship when processing is successful. |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | Base filename of the Unity Catalog file or directory. |
| path | Path to parent directory containing the Unity Catalog file or directory. |
| absolute.path | Full path to the Unity Catalog file or directory. |
| uc.resourceType | The type of resource, ‘file’ or ‘directory’ of the Unity Catalog resource. |
| uc.size | The size of the Unity Catalog file. |
| uc.lastModifiedTime | The last modified time of the Unity Catalog file in milliseconds since epoch in UTC time. |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: LogAttribute 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/logattribute.md
section: Loading & Unloading Data
---

# LogAttribute 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Emits attributes of the FlowFile at the specified log level

## Tags

attributes, logging

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attributes to Ignore | A comma-separated list of Attributes to ignore. If not specified, no attributes will be ignored unless `Attributes to Ignore by Regular Expression` is modified. There’s an OR relationship between the two properties. |
| Attributes to Log | A comma-separated list of Attributes to Log. If not specified, all attributes will be logged unless `Attributes to Log by Regular Expression` is modified. There’s an AND relationship between the two properties. |
| Log FlowFile Properties | Specifies whether or not to log FlowFile “properties”, such as Entry Date, Lineage Start Date, and content size |
| Log Level | The Log Level to use when logging the Attributes |
| Log Payload | If true, the FlowFile’s payload will be logged, in addition to its attributes; otherwise, just the Attributes will be logged. |
| Log prefix | Log prefix appended to the log lines. It helps to distinguish the output of multiple LogAttribute processors. |
| Output Format | Specifies the format to use for logging FlowFile attributes |
| attributes-to-ignore-regex | A regular expression indicating the Attributes to Ignore. If not specified, no attributes will be ignored unless `Attributes to Ignore` is modified. There’s an OR relationship between the two properties. |
| attributes-to-log-regex | A regular expression indicating the Attributes to Log. If not specified, all attributes will be logged unless `Attributes to Log` is modified. There’s an AND relationship between the two properties. |
| character-set | The name of the CharacterSet to use |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles are routed to this relationship |

---
title: LoggingRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/loggingrecordsink.md
section: Loading & Unloading Data
---

# LoggingRecordSink

## Description

Provides a RecordSinkService that can be used to log records to the application log (nifi-app.log, e.g.) using the specified writer for formatting.

## Tags

log, record, sink

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Log Level \* | logsink-log-level | INFO | * TRACE * DEBUG * INFO * WARN * ERROR * FATAL * NONE | The Log Level at which to log records (INFO, DEBUG, e.g.) |
| Record Writer \* | record-sink-record-writer |  |  | Specifies the Controller Service to use for writing out the records. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: LogMessage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/logmessage.md
section: Loading & Unloading Data
---

# LogMessage 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Emits a log message at the specified log level

## Tags

attributes, logging

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| log-level | The Log Level to use when logging the message: [trace, debug, info, warn, error] |
| log-message | The log message to emit |
| log-prefix | Log prefix appended to the log lines. It helps to distinguish the output of multiple LogMessage processors. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles are routed to this relationship |

---
title: LookupAttribute 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/lookupattribute.md
section: Loading & Unloading Data
---

# LookupAttribute 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Lookup attributes from a lookup service

## Tags

Attribute Expression Language, attributes, cache, enrich, join, lookup

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| include-empty-values | Include null or blank values for keys that are null or blank |
| lookup-service | The lookup service to use for attribute lookups |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles with failing lookups are routed to this relationship |
| matched | FlowFiles with matching lookups are routed to this relationship |
| unmatched | FlowFiles with missing lookups are routed to this relationship |

---
title: LookupRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/lookuprecord.md
section: Loading & Unloading Data
---

# LookupRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Extracts one or more fields from a Record and looks up a value for those fields in a LookupService. If a result is returned by the LookupService, that result is optionally added to the Record. In this case, the processor functions as an Enrichment processor. Regardless, the Record is then routed to either the ‘matched’ relationship or ‘unmatched’ relationship (if the ‘Routing Strategy’ property is configured to do so), indicating whether or not a result was returned by the LookupService, allowing the processor to also function as a Routing processor. The “coordinates” to use for looking up a value in the Lookup Service are defined by adding a user-defined property. Each property that is added will have an entry added to a Map, where the name of the property becomes the Map Key and the value returned by the RecordPath becomes the value for that key. If multiple values are returned by the RecordPath, then the Record will be routed to the ‘unmatched’ relationship (or ‘success’, depending on the ‘Routing Strategy’ property’s configuration). If one or more fields match the Result RecordPath, all fields that match will be updated. If there is no match in the configured LookupService, then no fields will be updated. I.e., it will not overwrite an existing value in the Record with a null value. Please note, however, that if the results returned by the LookupService are not accounted for in your schema (specifically, the schema that is configured for your Record Writer) then the fields will not be written out to the FlowFile.

## Tags

avro, convert, csv, database, db, enrichment, filter, json, logs, lookup, record, route

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Root Record Path | A RecordPath that points to a child Record within each of the top-level Records in the FlowFile. If specified, the additional RecordPath properties will be evaluated against this child Record instead of the top-level Record. This allows for performing enrichment against multiple child Records within a single top-level Record. |
| lookup-service | The Lookup Service to use in order to lookup a value in each Record |
| record-path-lookup-miss-result-cache-size | Specifies how many lookup values/records should be cached. Setting this property to zero means no caching will be done and the table will be queried for each lookup value in each record. If the lookup table changes often or the most recent data must be retrieved, do not use the cache. |
| record-reader | Specifies the Controller Service to use for reading incoming data |
| record-update-strategy | This property defines the strategy to use when updating the record with the value returned by the Lookup Service. |
| record-writer | Specifies the Controller Service to use for writing out the records |
| result-contents | When a result is obtained that contains a Record, this property determines whether the Record itself is inserted at the configured path or if the contents of the Record (i.e., the sub-fields) will be inserted at the configured path. |
| result-record-path | A RecordPath that points to the field whose value should be updated with whatever value is returned from the Lookup Service. If not specified, the value that is returned from the Lookup Service will be ignored, except for determining whether the FlowFile should be routed to the ‘matched’ or ‘unmatched’ Relationship. |
| routing-strategy | Specifies how to route records after a Lookup has completed |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be enriched, the unchanged FlowFile will be routed to this relationship |
| success | All records will be sent to this Relationship if configured to do so, unless a failure occurs |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records in the FlowFile |

## See also

* [org.apache.nifi.processors.standard.ConvertRecord](convertrecord.md)
* [org.apache.nifi.processors.standard.SplitRecord](splitrecord.md)

---
title: Maintain Openflow Connector for Kinesis
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kinesis/maintenance.md
section: Loading & Unloading Data
---

# Maintain Openflow Connector for Kinesis

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how to maintain the Openflow Connector for Kinesis connector, including how to manage and reset the connector state.

## Manage connector state

The Openflow Connector for Kinesis uses DynamoDB to store the consumer application state.

### DynamoDB tables created by the connector

For each Kinesis Application Name configured in the connector, the KCL creates three DynamoDB tables:

`<Kinesis Application Name>`
:   Stores the checkpointed sequence number for each shard in the stream.
    This tracks which records have been processed.

`<Kinesis Application Name>-CoordinatorState`
:   Used for coordination between workers when multiple processors share the same Application Name.

`<Kinesis Application Name>-WorkerMetricStats`
:   Used for workers to report metrics, which are used during work assignment.

In these table names, `<Kinesis Application Name>` is the value provided when you set up the connector.

If multiple processors use the same Application Name, they cooperate to consume data from the stream
and share these tables. If processors have different Application Names, each creates its own set of tables
to independently track consumed records.

For more information about DynamoDB tables, see the [AWS Kinesis Client Library documentation](https://docs.aws.amazon.com/streams/latest/dev/kcl-dynamoDB.html).

## Reset the connector state

If the connector state in DynamoDB becomes corrupted or inconsistent, you may need to reset it.
There are two approaches to reset the connector state.

### Reset by changing the Application Name

The simplest way to reset the connector state is to change the Kinesis Application Name parameter:

1. Stop the connector.
2. Navigate to the connector’s parameter context.
3. Change the `Kinesis Application Name` parameter value to a new value.
4. Start the connector.

The connector creates new DynamoDB tables with the new Application Name and begins consuming
records from the position specified by the [Kinesis Initial Stream Position](setup.md) parameter.

> **Note:**
>
> If your IAM policy restricts DynamoDB access to specific table names, you must update the policy
> to allow access to the new table names. For more information on configuring IAM permissions,
> see [Set up Openflow Connector for Kinesis for JSON data format](setup.md).

### Reset by deleting the DynamoDB tables

Alternatively, you can delete the existing DynamoDB tables to reset the state:

1. Stop the connector.
2. In the AWS Console or using the AWS CLI, delete the three DynamoDB tables associated with the Application Name:

   * `<Kinesis Application Name>`
   * `<Kinesis Application Name>-CoordinatorState`
   * `<Kinesis Application Name>-WorkerMetricStats`
3. Start the connector.

The connector recreates the tables and begins consuming records from the position specified by the [Kinesis Initial Stream Position](setup.md) parameter.

> **Warning:**
>
> Resetting the connector state causes the connector to reprocess records from the position specified by the
> initial stream position. Depending on your [Kinesis Initial Stream Position](setup.md) setting,
> this may result in duplicate data being ingested into Snowflake or data not being ingested at all.

---
title: Manage Openflow
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/manage.md
section: Loading & Unloading Data
---

# Manage Openflow

This topic describes the steps to manage Openflow components.

## Delete a deployment

Deleting a deployment removes the management compute pool and all deployment-level
configuration. You must delete all runtimes first. Any data or objects
already integrated into Snowflake aren’t affected.

> **Warning:**
>
> Deleting a deployment can’t be undone. Before you delete, make sure all runtimes
> have been removed and you no longer need the deployment configuration.

From the AWS Console:

1. Navigate to EC2 Instances.
2. Select the `openflow-agent-{deployment-key}` instance with your deployment key.
3. Click Connect at the top of the page.
4. Switch from EC2 Instance Connect to Connect using EC2 Instance Connect Endpoint. Leave the default EC2 Instance Connect Endpoint
   in place.
5. Click Connect. A new browser tab or window will appear with a
   command-line interface.
6. Run `./destroy.sh` from the shell.

   * This may take 20-30 minutes. If your connection is interrupted, the process continues running in the background.
   * You can log back in and view its status with the command: `journalctl -u docker -f -n 250`
   * The `destroy` process is complete when you see output of `delete successful`.
7. Navigate to
   [CloudFormation](https://us-east-1.console.aws.amazon.com/cloudformation/home)
   in the AWS Console for your region.
8. Delete the CloudFormation stack for your deployment.

From Snowsight:

1. In the navigation menu, select Ingestion » Openflow.
2. Select Launch Openflow.
3. Select the Deployments tab.
4. In the row of the deployment you want to delete, select the More options icon.
5. Select Delete.
6. In the confirmation dialog, type `delete` to confirm deletion.
7. Click Delete deployment.

## Upgrade a deployment

A deployment includes several components: the agent, deployment service, deployment UI,
runtime gateway, and runtime operator. You can upgrade via the UI or, for BYOC deployments,
via the deployment agent script. For details on what’s included in each release, see
[Openflow version history](version-history.md).

> **Note:**
>
> Only the owner of a deployment can perform an upgrade.

### Upgrade from the UI

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. Select Launch Openflow.
4. Select the Deployments tab.
5. Look for the upgrade arrow to the left of the deployment name. This indicates an upgrade is available.
6. Select  next to the deployment » Upgrade.

### Upgrade via the deployment agent (BYOC)

For BYOC deployments, use the deployment agent script to upgrade the agent, deployment service, deployment UI, runtime gateway, and runtime operator.

#### Connect to the deployment agent

1. Navigate to Openflow.
2. Select the Deployments tab.
3. View your deployment details and note the deployment key.
4. In your AWS account, view the EC2 instances and filter using the deployment key.
5. Locate the deployment agent EC2 instance named `openflow-agent-{deployment-key}`.
6. Connect using EC2 Instance Connect Endpoint and accepting all defaults.
7. Run the remaining commands from the new browser tab or window that appears with a command-line interface.

#### Check for available upgrades

```bash
cat ~/.upgrade
```

The script will display the latest available version of the various deployment components.

If no upgrades are available, you will see an output similar to this:

```output
AGENT_IMAGE_VERSION_UPGRADE=
OPERATOR_CHART_VERSION_UPGRADE=
GATEWAY_IMAGE_VERSION_UPGRADE=
DPS_CHART_VERSION_UPGRADE=
DPUI_CHART_VERSION_UPGRADE=
```

Otherwise, you will see the version that upgraded components will use, such as:

```output
AGENT_IMAGE_VERSION_UPGRADE=0.17.0
OPERATOR_CHART_VERSION_UPGRADE=0.31.0
GATEWAY_IMAGE_VERSION_UPGRADE=
DPS_CHART_VERSION_UPGRADE=
DPUI_CHART_VERSION_UPGRADE=
```

#### Upgrading the AMI for the Openflow BYOC deployment

When you upgrade your Openflow BYOC deployment, Openflow will find and upgrade to the latest AMI for Amazon Linux 2023 recommended by
[AWS Systems Manager](https://aws.amazon.com/systems-manager/).

If a new AMI is found, it will restart all Openflow services in your deployment, and runtimes will be temporarily halted.
Openflow runtimes and connectors maintain data integrity across restarts automatically.

Snowflake does not automatically upgrade deployments. You determine upgrade timing and frequency.

#### Initiate the upgrade

If the output indicates that upgrades are available, run the following script to initiate the upgrade. Older Openflow deployments may use the script `upgrade-data-plane.sh` instead.

```bash
./upgrade.sh
```

You will see output similar to this:

```output
openflow-data-plane-agent-aws is set to version 0.16.0
   Upgrade set to version 0.17.0
openflow-dataplane-service-chart is set to version 0.47.0
   No upgrade is available
openflow-dataplane-ui-chart is set to version 0.5.0
   No upgrade is available
openflow-runtime-gateway is set to version 2025.6.8.2
   No upgrade is available
runtime-operator-chart is set to version 0.30.0
   Upgrade set to version 0.31.0
```

Then, you have two options:

* Wait for an automatic upgrade: The system will automatically initiate the upgrade process within approximately 10 minutes.
* Manual upgrade: To start the upgrade immediately, run the following command:

```bash
./create.sh
```

#### Monitor the upgrade process

To track the progress of the upgrade, use the `journalctl` command:

```bash
journalctl -u openflow-apply-infrastructure -f -n 250
```

#### Verify a successful upgrade

A successful upgrade will typically show output similar to this:

```output
All resources applied successfully and log uploaded to s3
openflow-apply-infrastructure.service: Deactivated successfully
```

## Upgrade a runtime

Snowflake periodically releases runtime updates that introduce new Openflow processors, newer versions of
existing processors, or new runtime functionality. When updates are available, an indicator
appears next to the runtime name in the UI. For details on what’s included in each release, see
[Openflow version history](version-history.md).

> **Note:**
>
> Only the owner of a deployment can perform an upgrade.

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. Select Launch Openflow.
4. Select the Runtimes tab.
5. Look for the upgrade arrow to the left of the runtime name. This indicates an upgrade is available.
6. Select  next to the runtime » Upgrade.

## Upgrade a connector

Connector updates are made available by Snowflake when functionality is added,
processing logic is improved, or new processor versions are used–for example, to add support for a new source API version.

When connector updates are available, you will see an Upgrade icon in your process group on the canvas.

> **Note:**
>
> You can only upgrade connectors after you have upgraded their runtime.

To upgrade a connector, do the following:

1. In the navigation menu, select Ingestion » Openflow.
2. Select Launch Openflow.
3. Select the Runtimes tab.
4. Select the runtime name, or select View Canvas in the More Options menu to navigate to the canvas.
5. Find the processor groups with a red upgrade arrow next to their names. For each of these groups, change the version:

   1. Recommended: Check to see whether the parameter uses a custom value for the Parameter context. If so, make a note of the custom value. You will need to reapply it after the upgrade.

      1. Right-click the process group and select Parameters.
      2. Select Parameters in the Parameter Contexts list.
      3. Select the Inheritance tab, and check if it uses custom values. If so, make a note of the custom values.
   2. Right-click the group and select Version » Change Version.
   3. Select the latest available version and select Change.
   4. Confirm that the connector was upgraded to the latest version. The upgraded version should show a green check mark.
   5. Confirm that all processors in the connector’s process group are running. If not, start them.

      You can also validate the version by hovering over the speech bubble at the bottom right of the process group.
   6. If you noted a custom parameter value in step 4, reapply the custom value. For more information, see [Openflow connectors](connectors/about-openflow-connectors.md).

### Configure Snowflake Connector Flow Registry

> **Important:**
>
> Early preview releases of Openflow did not configure a runtime for connector upgrades.
> If you don’t see the Version option when right clicking on a process group, you
> have to configure the Snowflake Connector Flow Registry and manually enable version control for existing connectors.

To configure the Snowflake Connector Flow Registry, do the following:

1. Navigate to the canvas.
2. Click on the menu in the top right corner and select Controller Settings.
3. Switch to the Registry Clients tab.
4. Click the + icon to add a new Registry Client.
5. Select the ConnectorFlowRegistryClient and select Add.
6. Click More Options for the ConnectorFlowRegistryClient row and select Edit.
7. Enter `/nifi/configuration_resources/connector_flow_registry` as the value
   for Storage Location and select Apply.

After configuring the Snowflake Connector Flow Registry you can now enable version control for your existing connectors.

To enable version control for existing connectors, do the following:

1. Navigate to the canvas and locate the process group where you want to add version control.
2. Right click on the process group and select Version » Set Version.
3. In the Set Version dialog, choose the flow that matches your process group.

   For example, choose **sqlserver** if you are using the SQL Server connector.

   Note that flow names do not exactly match the connector name.
4. Select the latest version and then select Set version to enable version control.
5. From the canvas, right click on the process group again and select Version » Revert Local Changes
   to apply the latest connector version.
6. Review the list of changes and select Revert.
7. Confirm that your connector was upgraded to the latest version which should now show a green check mark.
   You can also validate the version by hovering over the speech bubble at the bottom right of the process group.

---
title: MapCacheClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/mapcacheclientservice.md
section: Loading & Unloading Data
---

# MapCacheClientService

## Description

Provides the ability to communicate with a MapCacheServer. This can be used in order to share a Map between nodes in a NiFi cluster

## Tags

cache, cluster, distributed, map, state

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Communications Timeout \* | Communications Timeout | 30 secs |  | Specifies how long to wait when communicating with the remote server before determining that there is a communications failure if data cannot be sent or received |
| SSL Context Service | SSL Context Service |  |  | If specified, indicates the SSL Context Service that is used to communicate with the remote server. If not specified, communications will not be encrypted |
| Server Hostname \* | Server Hostname |  |  | The name of the server that is running the DistributedMapCacheServer service |
| Server Port \* | Server Port | 4557 |  | The port on the remote server that is to be used when communicating with the DistributedMapCacheServer service |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: MapCacheServer
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/mapcacheserver.md
section: Loading & Unloading Data
---

# MapCacheServer

## Description

Provides a map (key/value) cache that can be accessed over a socket. Interaction with this service is typically accomplished via a Map Cache Client Service.

## Tags

cache, cluster, distributed, key/value, map, server

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Eviction Strategy \* | Eviction Strategy | Least Frequently Used | * Least Frequently Used * Least Recently Used * First In, First Out | Determines which strategy should be used to evict values from the cache to make room for new entries |
| Maximum Cache Entries \* | Maximum Cache Entries | 10000 |  | The maximum number of cache entries that the cache can hold |
| Persistence Directory | Persistence Directory |  |  | If specified, the cache will be persisted in the given directory; if not specified, the cache will be in-memory only |
| Port \* | Port | 4557 |  | The port to listen on for incoming connections |
| SSL Context Service | SSL Context Service |  |  | If specified, this service will be used to create an SSL Context that will be used to secure communications; if not specified, communications will not be secure |
| Maximum Read Size | maximum-read-size | 1 MB |  | The maximum number of network bytes to read for a single cache item |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: MergeContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/mergecontent.md
section: Loading & Unloading Data
---

# MergeContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Merges a Group of FlowFiles together based on a user-defined strategy and packages them into a single FlowFile. It is recommended that the Processor be configured with only a single incoming connection, as Group of FlowFiles will not be created from FlowFiles in different connections. This processor updates the mime.type attribute as appropriate. NOTE: this processor should NOT be configured with Cron Driven for the Scheduling Strategy.

## Tags

archive, concatenation, content, correlation, flowfile-stream, flowfile-stream-v3, merge, stream, tar, zip

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attribute Strategy | Determines which FlowFile attributes should be added to the bundle. If ‘Keep All Unique Attributes’ is selected, any attribute on any FlowFile that gets bundled will be kept unless its value conflicts with the value from another FlowFile. If ‘Keep Only Common Attributes’ is selected, only the attributes that exist on all FlowFiles in the bundle, with the same value, will be preserved. |
| Bin Termination Check | Specifies an Expression Language Expression that is to be evaluated against each FlowFile. If the result of the expression is ‘true’, the bin that the FlowFile corresponds to will be terminated, even if the bin has not met the minimum number of entries or minimum size. Note that if the FlowFile that triggers the termination of the bin is itself larger than the Maximum Bin Size, it will be placed into its own bin without triggering the termination of any other bin. When using this property, it is recommended to use Prioritizers in the flow’s connections to ensure that the ordering is as desired. |
| Compression Level | Specifies the compression level to use when using the Zip Merge Format; if not using the Zip Merge Format, this value is ignored |
| Correlation Attribute Name | If specified, like FlowFiles will be binned together, where ‘like FlowFiles’ means FlowFiles that have the same value for this Attribute. If not specified, FlowFiles are bundled by the order in which they are pulled from the queue. |
| Delimiter Strategy | Determines if Header, Footer, and Demarcator should point to files containing the respective content, or if the values of the properties should be used as the content. |
| Demarcator File | Filename or text specifying the demarcator to use. If not specified, no demarcator is supplied. |
| FlowFile Insertion Strategy | If a given FlowFile terminates the bin based on the <Bin Termination Check> property, specifies where the FlowFile should be included in the bin. |
| Footer File | Filename or text specifying the footer to use. If not specified, no footer is supplied. |
| Header File | Filename or text specifying the header to use. If not specified, no header is supplied. |
| Keep Path | If using the Zip or Tar Merge Format, specifies whether or not the FlowFiles’ paths should be included in their entry names. |
| Max Bin Age | The maximum age of a Bin that will trigger a Bin to be complete. Expected format is <duration> <time unit> where <duration> is a positive integer and time unit is one of seconds, minutes, hours |
| Maximum Group Size | The maximum size for the bundle. If not specified, there is no maximum. |
| Maximum Number of Entries | The maximum number of files to include in a bundle |
| Maximum number of Bins | Specifies the maximum number of bins that can be held in memory at any one time |
| Merge Format | Determines the format that will be used to merge the content. |
| Merge Strategy | Specifies the algorithm used to merge content. The ‘Defragment’ algorithm combines fragments that are associated by attributes back into a single cohesive FlowFile. The ‘Bin-Packing Algorithm’ generates a FlowFile populated by arbitrarily chosen FlowFiles |
| Minimum Group Size | The minimum size for the bundle |
| Minimum Number of Entries | The minimum number of files to include in a bundle |
| Tar Modified Time | If using the Tar Merge Format, specifies if the Tar entry should store the modified timestamp either by expression (e.g. ${file.lastModifiedTime} or static value, both of which must match the ISO8601 format ‘yyyy-MM-dd’T ‘HH:mm:ssZ’. |
| mergecontent-metadata-strategy | For FlowFiles whose input format supports metadata (Avro, e.g.), this property determines which metadata should be added to the bundle. If ‘Use First Metadata’ is selected, the metadata keys/values from the first FlowFile to be bundled will be used. If ‘Keep Only Common Metadata’ is selected, only the metadata that exists on all FlowFiles in the bundle, with the same value, will be preserved. If ‘Ignore Metadata’ is selected, no metadata is transferred to the outgoing bundled FlowFile. If ‘Do Not Merge Uncommon Metadata’ is selected, any FlowFile whose metadata values do not match those of the first bundled FlowFile will not be merged. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the bundle cannot be created, all FlowFiles that would have been used to created the bundle will be transferred to failure |
| merged | The FlowFile containing the merged content |
| original | The FlowFiles that were used to create the bundle |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | When more than 1 file is merged, the filename comes from the segment.original.filename attribute. If that attribute does not exist in the source FlowFiles, then the filename is set to the number of nanoseconds matching system time. Then a filename extension may be applied:if Merge Format is TAR, then the filename will be appended with .tar, if Merge Format is ZIP, then the filename will be appended with .zip, if Merge Format is FlowFileStream, then the filename will be appended with .pkg |
| merge.count | The number of FlowFiles that were merged into this bundle |
| merge.bin.age | The age of the bin, in milliseconds, when it was merged and output. Effectively this is the greatest amount of time that any FlowFile in this bundle remained waiting in this processor before it was output |
| merge.uuid | UUID of the merged flow file that will be added to the original flow files attributes. |
| merge.reason | This processor allows for several thresholds to be configured for merging FlowFiles. This attribute indicates which of the Thresholds resulted in the FlowFiles being merged. For an explanation of each of the possible values and their meanings, see the Processor’s Usage / documentation and see the ‘Additional Details’ page. |

## Use cases

|  |
| --- |
| Concatenate FlowFiles with textual content together in order to create fewer, larger FlowFiles. |
| Concatenate FlowFiles with binary content together in order to create fewer, larger FlowFiles. |
| Reassemble a FlowFile that was previously split apart into smaller FlowFiles by a processor such as SplitText, UnpackContext, SplitRecord, etc. |

## See also

* [org.apache.nifi.processors.standard.MergeRecord](mergerecord.md)
* [org.apache.nifi.processors.standard.SegmentContent](segmentcontent.md)

---
title: MergeRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/mergerecord.md
section: Loading & Unloading Data
---

# MergeRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This Processor merges together multiple record-oriented FlowFiles into a single FlowFile that contains all of the Records of the input FlowFiles. This Processor works by creating ‘bins’ and then adding FlowFiles to these bins until they are full. Once a bin is full, all of the FlowFiles will be combined into a single output FlowFile, and that FlowFile will be routed to the ‘merged’ Relationship. A bin will consist of potentially many ‘like FlowFiles’. In order for two FlowFiles to be considered ‘like FlowFiles’, they must have the same Schema (as identified by the Record Reader) and, if the <Correlation Attribute Name> property is set, the same value for the specified attribute. See Processor Usage and Additional Details for more information. NOTE: this processor should NOT be configured with Cron Driven for the Scheduling Strategy.

## Tags

content, correlation, event, merge, record, stream

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attribute Strategy | Determines which FlowFile attributes should be added to the bundle. If ‘Keep All Unique Attributes’ is selected, any attribute on any FlowFile that gets bundled will be kept unless its value conflicts with the value from another FlowFile. If ‘Keep Only Common Attributes’ is selected, only the attributes that exist on all FlowFiles in the bundle, with the same value, will be preserved. |
| correlation-attribute-name | If specified, two FlowFiles will be binned together only if they have the same value for this Attribute. If not specified, FlowFiles are bundled by the order in which they are pulled from the queue. |
| max-bin-age | The maximum age of a Bin that will trigger a Bin to be complete. Expected format is <duration> <time unit> where <duration> is a positive integer and time unit is one of seconds, minutes, hours |
| max-bin-size | The maximum size for the bundle. If not specified, there is no maximum. This is a ‘soft limit’ in that if a FlowFile is added to a bin, all records in that FlowFile will be added, so this limit may be exceeded by up to the number of bytes in last input FlowFile. |
| max-records | The maximum number of Records to include in a bin. This is a ‘soft limit’ in that if a FlowFIle is added to a bin, all records in that FlowFile will be added, so this limit may be exceeded by up to the number of records in the last input FlowFile. |
| max.bin.count | Specifies the maximum number of bins that can be held in memory at any one time. This number should not be smaller than the maximum number of concurrent threads for this Processor, or the bins that are created will often consist only of a single incoming FlowFile. |
| merge-strategy | Specifies the algorithm used to merge records. The ‘Defragment’ algorithm combines fragments that are associated by attributes back into a single cohesive FlowFile. The ‘Bin-Packing Algorithm’ generates a FlowFile populated by arbitrarily chosen FlowFiles |
| min-bin-size | The minimum size of for the bin |
| min-records | The minimum number of records to include in a bin |
| record-reader | Specifies the Controller Service to use for reading incoming data |
| record-writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the bundle cannot be created, all FlowFiles that would have been used to created the bundle will be transferred to failure |
| merged | The FlowFile containing the merged records |
| original | The FlowFiles that were used to create the bundle |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The merged FlowFile will have a ‘record.count’ attribute indicating the number of records that were written to the FlowFile. |
| mime.type | The MIME Type indicated by the Record Writer |
| merge.count | The number of FlowFiles that were merged into this bundle |
| merge.bin.age | The age of the bin, in milliseconds, when it was merged and output. Effectively this is the greatest amount of time that any FlowFile in this bundle remained waiting in this processor before it was output |
| merge.uuid | UUID of the merged FlowFile that will be added to the original FlowFiles attributes |
| merge.completion.reason | This processor allows for several thresholds to be configured for merging FlowFiles. This attribute indicates which of the Thresholds resulted in the FlowFiles being merged. For an explanation of each of the possible values and their meanings, see the Processor’s Usage / documentation and see the ‘Additional Details’ page. |
| <Attributes from Record Writer> | Any Attribute that the configured Record Writer returns will be added to the FlowFile. |

## Use cases

|  |
| --- |
| Combine together many arbitrary Records in order to create a single, larger file |

## Use Cases Involving Other Components

|  |
| --- |
| Combine together many Records that have the same value for a particular field in the data, in order to create a single, larger file |

## See also

* [org.apache.nifi.processors.standard.MergeContent](mergecontent.md)
* [org.apache.nifi.processors.standard.PartitionRecord](partitionrecord.md)
* [org.apache.nifi.processors.standard.SplitRecord](splitrecord.md)

---
title: MergeSnowflakeJournalTable 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/mergesnowflakejournaltable.md
section: Loading & Unloading Data
---

# MergeSnowflakeJournalTable 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Triggers a merge operation on changes from journal table to a destination table in Snowflake. The merge operation is performed asynchronously and the processor polls the result of the operation. If the query is still in progress the FlowFile will be penalized.

## Tags

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Destination Database Name | The name of the Snowflake database where the data is being ingested to. |
| Merge Query Retry Count | Indicates how many times the merge query should be retried if it fails. |
| Object Identifier Resolution | Controls how source object identifiers (schemas, tables, columns) are stored and queried in Snowflake. This setting determines whether you will need to use double quotes in your SQL queries. The ‘Case-Sensitive’ option is the default, production behavior — ‘Case-Insensitive’ is considered preview for the time being. |
| Placeholder Value | The value of the payload placeholder to look for in a MERGE. This will be converted to the destination column’s data type. |
| Snowflake Connection Pool | The Controller Service that is used to obtain a connection to the Snowflake database to perform merge operation. |
| Unchanged Value Strategy | Determines how the MERGE query should handle unchanged values in journal columns. By default it expects full values. |

## Relationships

| Name | Description |
| --- | --- |
| ddl | DDL to execute. |
| deleted during compaction | FlowFile deleted during compaction based on table name and generation. |
| failure | Failure query execution. |
| failure retry | Retry failure query execution. |
| poll query result | Scheduled async query execution. |
| success | Success query execution. |
| unknown file type | Unknown file type. |

## Writes attributes

| Name | Description |
| --- | --- |
| merge.query.id | The ID of the query that is used to merge the journal table into the target table. |

---
title: MicrosoftClientCertificateOAuth2TokenProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/microsoftclientcertificateoauth2tokenprovider.md
section: Loading & Unloading Data
---

# MicrosoftClientCertificateOAuth2TokenProvider

## Description

Provides OAuth2 access tokens for the Microsoft Graph API using client_credentials with a client certificate.

## Tags

access token, authorization, graph, http, microsoft, oauth2, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Client ID \* | Client ID |  |  | The Client ID for the Microsoft Graph API |
| Refresh Window \* | Refresh Window | 5 s |  | The service will attempt to refresh tokens expiring within the refresh window, subtracting the configured duration from the token expiration. |
| SSL Context Service \* | SSL Context Service |  |  | An instance of SSLContextProvider configured with a certificate and a private key which will be used to sign the JWT assertion. The keys must use RSA algorithm. |
| Tenant ID \* | Tenant ID |  |  | The Tenant ID for the Microsoft Graph API |
| Token Scope \* | Token Scope |  |  | The scope of the requested token.For Graph API should be: <https://graph.microsoft.com/.defaultFor> Sharepoint should in the following format: <https://organization.sharepoint.com/.default> |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to retrieve access tokens. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: MicrosoftGraphAuthenticationProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/microsoftgraphauthenticationprovider.md
section: Loading & Unloading Data
---

# MicrosoftGraphAuthenticationProvider

## Description

Provides authentication for the Microsoft Graph API, which can be used for interacting with Microsoft 365 services.

## Tags

graph, microsoft, openflow

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Authentication Mechanism \* | Authentication Mechanism | Client Secret | * Client Secret * Username / Password | The mechanism to use for authenticating with the Microsoft Graph API |
| Client ID \* | Client ID |  |  | The Client ID for the Microsoft Graph API |
| Client Secret \* | Client Secret |  |  | The Client Secret for the Microsoft Graph API |
| Password \* | Password |  |  | The password to use for authentication |
| Tenant ID \* | Tenant ID |  |  | The Tenant ID for the Microsoft Graph API |
| Username \* | Username |  |  | The username to use for authentication |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ModifyBytes 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/modifybytes.md
section: Loading & Unloading Data
---

# ModifyBytes 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Discard byte range at the start and end or all content of a binary file.

## Tags

binary, discard, keep

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| End Offset | Number of bytes removed at the end of the file. |
| Remove All Content | Remove all content from the FlowFile superseding Start Offset and End Offset properties. |
| Start Offset | Number of bytes removed at the beginning of the file. |

## Relationships

| Name | Description |
| --- | --- |
| success | Processed flowfiles. |

---
title: ModifyCompression 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/modifycompression.md
section: Loading & Unloading Data
---

# ModifyCompression 2025.10.9.21

## Bundle

org.apache.nifi | nifi-compress-nar

## Description

Changes the compression algorithm used to compress the contents of a FlowFile by decompressing the contents of FlowFiles using a user-specified compression algorithm and recompressing the contents using the specified compression format properties. This processor operates in a very memory efficient way so very large objects well beyond the heap size are generally fine to process

## Tags

brotli, bzip2, compress, content, deflate, gzip, lz4-framed, lzma, recompress, snappy, snappy framed, snappy-hadoop, xz-lzma2, zstd

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Input Compression Strategy | The strategy to use for decompressing input FlowFiles |
| Output Compression Level | The compression level for output FlowFiles for supported formats. A lower value results in faster processing but less compression; a value of 0 indicates no (that is, simple archiving) for gzip or minimal for xz-lzma2 compression. Higher levels can mean much larger memory usage such as the case with levels 7-9 for xz-lzma/2 so be careful relative to heap size. |
| Output Compression Strategy | The strategy to use for compressing output FlowFiles |
| Output Filename Strategy | Processing strategy for filename attribute on output FlowFiles |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles will be transferred to the failure relationship on compression modification errors |
| success | FlowFiles will be transferred to the success relationship on compression modification success |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The appropriate MIME Type is set based on the value of the Compression Format property. If the Compression Format is ‘no compression’ this attribute is removed as the MIME Type is no longer known. |

---
title: MongoDBControllerService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/mongodbcontrollerservice.md
section: Loading & Unloading Data
---

# MongoDBControllerService

## Description

Provides a controller service that configures a connection to MongoDB and provides access to that connection to other Mongo-related components.

## Tags

mongo, mongodb, service

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Database User | Database User |  |  | Database user name |
| Mongo URI \* | Mongo URI |  |  | MongoURI, typically of the form: mongodb://host1[:port1][,host2[:port2],…] |
| Password | Password |  |  | The password for the database user |
| SSL Context Service | SSL Context Service |  |  | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Write Concern \* | Write Concern | ACKNOWLEDGED | * ACKNOWLEDGED * UNACKNOWLEDGED * FSYNCED * JOURNALED * REPLICA_ACKNOWLEDGED * MAJORITY * W1 * W2 * W3 | The write concern to use |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: MongoDBLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/mongodblookupservice.md
section: Loading & Unloading Data
---

# MongoDBLookupService

## Description

Provides a lookup service based around MongoDB. Each key that is specified will be added to a query as-is. For example, if you specify the two keys, user and email, the resulting query will be { “user”: “tester”, “email”: “[tester@test.com](mailto:tester%40test.com)” }. The query is limited to the first result (findOne in the Mongo documentation). If no “Lookup Value Field” is specified then the entire MongoDB result document minus the _id field will be returned as a record.

## Tags

lookup, mongo, mongodb, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Schema Access Strategy \* | Schema Access Strategy | infer | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Infer from Result | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Mongo Collection Name \* | mongo-collection-name |  |  | The name of the collection to use |
| Mongo Database Name \* | mongo-db-name |  |  | The name of the database to use |
| Client Service \* | mongo-lookup-client-service |  |  | A MongoDB controller service to use with this lookup service. |
| Projection | mongo-lookup-projection |  |  | Specifies a projection for limiting which fields will be returned. |
| Lookup Value Field | mongo-lookup-value-field |  |  | The field whose value will be returned when the lookup key(s) match a record. If not specified then the entire MongoDB result document minus the _id field will be returned as a record. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Monitor connectors using the Openflow Connectors Dashboard
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors-dashboard.md
section: Loading & Unloading Data
---

# Monitor connectors using the Openflow Connectors Dashboard

The Openflow Connectors Dashboard provides a high-level view of all installed connectors, health snapshots,
and key performance indicators, such as the aggregated average throughput and total data ingested by all connectors matching
the filter criteria.

## Prerequisites

To use the Openflow Connectors Dashboard, the following prerequisites must be met:

* You need at least read-only permissions on the event table.
* You must have the following minimum Openflow versions:

  + BYOC deployment: 1.36.0
  + Snowflake deployment: 1.26.0
  + Runtime: 2026.3.17.13
* You must have the following minimum connector versions. These versions apply to change data capture (CDC) connectors only.
  Other connector types don’t have a minimum version requirement for dashboard support.

  | Connector | Minimum version |
  | --- | --- |
  | MySQL | 0.33.0 |
  | PostgreSQL | 0.39.0 |
  | MongoDB | 0.17.0 |
  | SQL Server | 0.27.0 |
  | Oracle Embedded License | 0.25.0 |
  | Oracle Independent License | 0.24.0 |

See [Snowflake Openflow version history](version-history.md) for more information.

## Access the Openflow Connectors Dashboard

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow and navigate to the Connector Observability tab.

   The Openflow Connectors Dashboard appears.

## The Openflow Connectors Dashboard overview

The Openflow Connectors Dashboard displays the following information:

Status
:   Shows the number of connectors with the following statuses:

    * Healthy: Didn’t encounter any errors during the selected time period.
    * Unhealthy: Logged errors in the event table during the selected time period or has one or more tables in
      Failed state (change data capture (CDC) connectors only).
    * Upgrade required: Openflow deployment, runtime, or connector aren’t running the minimum required versions
      to display health and performance metrics. Review the version prerequisites and upgrade as needed.

Average throughput
:   Measures the rate at which data is read from source systems and sent to Snowflake across all connectors.

    * The Average throughput » Ingested metric measures how fast data is sent to Snowflake across all connectors that match
      the primary filter criteria (time frame and event table).
    * The Average throughput » Read metric measures how fast Openflow reads data from source systems across all connectors that match
      the primary filter criteria (time frame and event table).

Total data ingested
:   Shows how much data all connectors that match the primary filter criteria for time frame and event table have sent to Snowflake during the selected time period.
    Use this metric to quickly identify ingestion anomalies over a specific time period.

For custom telemetry queries beyond the dashboard, see [Monitor Openflow using telemetry data](monitor.md).

> **Note:**
>
> * Total data ingested and Average throughput metrics include both raw payload and structural overhead such as JSON keys, braces,
>   and delimiters. Because these metrics track the total transmitted volume, these figures might be higher than the uncompressed data reported
>   by Snowpipe Streaming or the final storage volume in your destination table.
> * The connectors appear in the list if they match the selected filter criteria and have recorded telemetry events during the selected time frame.
> * If you examine longer time frames, the list might show connectors that were previously deleted.
>
>   For example, you deployed a connector six days ago, and then deleted that connector two days ago. If you set the time frame to Last 7 days,
>   the connector appears in the list because it recorded telemetry events in the last 7 days.

### Filtering connectors

The Openflow Connectors Dashboard supports the following filters:

Event table
:   The Openflow connectors event table you want to monitor. This filter displays event tables that are associated with at least one Openflow deployment,
    as well as the default event table and the account event table. You can select only one event table at a time. Event table views are also supported.

    The event table is set when you set up Openflow.

    > **Tip:**
    >
    > To view the event table associated with an Openflow deployment, use the [DESCRIBE OPENFLOW DATA PLANE INTEGRATION](../../../sql-reference/sql/desc-oflow-data-plane-integration.md) command.
    > See [Set up Openflow - Snowflake Deployment](setup-openflow-spcs-deployment.md) or
    > [Set up Openflow - BYOC](setup-openflow-byoc.md) for more information on configuring event tables.

Time frame
:   Use this filter to identify relevant connectors in a specific time frame.

    > **Tip:**
    >
    > To get the most up-to-date results about the connector health, select the Last Hour time period.

Status
:   Enables filtering for Healthy, Unhealthy, or All connectors.

Source
:   Enables filtering by the source system based on known deployed connectors. The filter only shows sources that are used by your connectors.

Deployment
:   Enables filtering by Snowflake Openflow deployments.

    This filter displays data plane integration names, which are composed of the prefix `OPENFLOW_DATAPLANE_` followed by the deployment ID.
    To find the deployment ID, navigate to Openflow, select the Deployments tab, then select View Details.

Runtime
:   Enables filtering by Snowflake Openflow runtimes.

    This filter displays the runtime keys. To match runtime keys with Openflow runtime names in the UI, navigate to Openflow, select the Runtimes tab, then
    select View Details, and find the corresponding key.

Type
:   Enables filtering by connector type: Databases, SaaS, Streaming, Unstructured, Other.

> **Note:**
>
> * Primary filters (event table and time frame) are applied before secondary filters (status, source, deployment, runtime, or type).
> * The secondary filters (status, source, deployment, runtime, type) don’t apply to the throughput and data ingested visuals.

## Monitoring Openflow connectors

To monitor the connector details, select  » View Details.

### Change data capture connectors

The details page shows the following information for each table that is part of the change data capture configuration:

Table replication status
:   Tables can either be in Active or Failed replication status. The replication status is based on the most recent telemetry event
    that is available for the table. Events that cause replication to fail for a table immediately result in a Failed replication
    status in the dashboard. Use the Failure Reason message to identify the issue.

Error distribution
:   Helps you understand when the connector experienced issues, so that you can identify any potential problems with source systems,
    connector configuration, or the Snowflake destination.

Table name
:   Shows the schema and table names for all tables that are configured to be replicated by the connector. The list matches the
    Included Table Names or Included Table Regex configuration parameters of the connector.

Replication status
:   Shows whether each table is in Active or Failed replication status.

Replication phase
:   Shows the current table replication phase. After configuration in the connector, tables enter the New replication
    phase, progress to the Snapshot Load phase, perform the initial load, and ultimately enter the Incremental Replication phase
    when individual change data capture events are processed.

Last Ingested
:   Shows the timestamp of the last inserted record into the destination table during the selected time frame. When looking at this
    metric, consider a short delay between the records being ingested and events being logged and available to query.

You can use the Replication status, Replication phase, and time frame filters to narrow down the table list.

### All connectors

Connector status
:   Shows the connector health status: Healthy if no error messages were encountered during the selected time frame,
    or Unhealthy if any error messages were encountered.

Error distribution
:   Shows a count of how many errors this connector encountered during the selected time period.

Average throughput
:   Measures the rate at which data is read from source systems and ingested into Snowflake for the selected connector.

    * The Average throughput » Ingested metric measures how fast the selected connector ingests data into Snowflake.
    * The Average throughput » Read metric measures how fast the selected connector reads data from source systems.

Total data ingested
:   Shows how much data the selected connector has ingested into Snowflake during the selected time period.
    Use this metric to quickly identify ingestion anomalies over a specific time period.

### Custom flows

Custom flows built on the Openflow canvas can also be monitored on the dashboard, but only if they are actively
version-controlled in a customer Git repository using the Openflow Git integration. Flows that aren’t version-controlled
don’t appear in the dashboard.

For more information, see [Version control for custom flows](version-control-custom-flows.md).

## Debugging Openflow connectors

The Openflow Connectors Dashboard serves as an entry point for debugging connector-specific issues and makes all connector logs easily accessible to users.

### Viewing the connector errors

To view all errors that a connector encountered in the selected time frame, first navigate to the connector details page by
selecting  » View Details, and then select the Issues tab.

The error headline tells you what type of error the connector encountered, and the content provides the entire stacktrace of the error.

### Viewing the connector logs

You might also want to look at additional connector logs to understand the context around an error message. To view all logs for the selected connector,
select  » View logs.

After you open the log explorer, you can also change the filters to view logs for different connectors or for entire
runtimes or deployments. The log explorer supports Openflow-specific filters like the dataplane ID, the runtime key, and the process group ID.

### Accessing the Openflow canvas

When you identify a connector issue, you probably need to navigate to the Openflow canvas to fix it; for example, adjust some configuration parameters or
upgrade to a newer connector version.

To navigate to the selected connector in the Openflow canvas, select  » Go to canvas.

## Optimizing performance

### Select a larger warehouse

Use the warehouse selector in the top right section of the screen to choose a different warehouse to run the queries.

> **Note:**
>
> While larger warehouses run queries faster, they take longer to resume, which might increase the initial page load time.

### Set up clustering on the Openflow event table

By using clustering keys, you can avoid unnecessary scanning of micro-partitions during querying, significantly accelerating
the performance of queries that reference these columns. For more information, see
[What is Data Clustering?](../../tables-clustering-micropartitions.md).

Run the following query, replacing the placeholders with your Openflow event table:

```sqlsyntax
ALTER TABLE <database>.<schema>.<event_table_name>
  CLUSTER BY (
    DATE_TRUNC('HOUR', timestamp),
    RECORD_TYPE,
    CAST(record_attributes:"metricNameHash" AS STRING)
  );
```

> **Note:**
>
> * Automatic clustering consumes Snowflake credits using serverless compute resources. To learn how many credits
>   per compute-hour are consumed, refer to the “Serverless Feature Credit Table” in the
>   [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
> * After you enable clustering on your event table, a background process starts that takes some time to complete.
>   After the process is complete, you should see improved performance when using the dashboard.

### Reduce the queried time frame

Selecting a smaller time frame in the filter scans less data and leads to faster query performance.
Use the Last Hour filter for the best performance and the most up-to-date view of your connector health and performance.

## Limitations

* The Openflow Connectors Dashboard uses data stored in event tables to provide insight into Openflow connectors. Depending on the selected time period and event table,
  information provided on the dashboard might not reflect the current status of a connector.
* Detailed health monitoring is currently only available for Database CDC connectors.
* The Deployment and Runtime filters use internal names that differ from the display names in the Openflow UI.
  For details on matching these names, see Filtering connectors.

## Known issues

* After upgrading the deployment, runtime, and connector to the versions mentioned in the prerequisites, the error count metric is only accurate
  for errors encountered after the upgrade.

---
title: Monitor Openflow
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/monitor-overview.md
section: Loading & Unloading Data
---

# Monitor Openflow

Openflow provides two approaches for monitoring your data integration pipelines:

[Monitor connectors using the Openflow Connectors Dashboard](connectors-dashboard.md)
:   Use the Openflow Connectors Dashboard in Snowsight to get a high-level view of connector health, throughput,
    and data ingestion. The dashboard provides filtering, error distribution, and per-connector detail pages.

[Monitor Openflow using telemetry data](monitor.md)
:   Query the Openflow telemetry data stored in your event table to monitor logs, application metrics, JVM and system
    metrics, and build custom queries tailored to your environment.

---
title: Monitor Openflow using telemetry data
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/monitor.md
section: Loading & Unloading Data
---

# Monitor Openflow using telemetry data

This topic describes how to monitor the state of Openflow and troubleshoot problems.

## Accessing Openflow logs

Snowflake sends Openflow logs to the event table you configured when you set up Openflow
([BYOC](setup-openflow-byoc.md) | [Snowflake deployment](setup-openflow-spcs-deployment.md)).

Snowflake recommends that you include a timestamp in the WHERE clause of event table queries.
This is particularly important because of the potential volume of data generated by various Snowflake components.
By applying filters, you can retrieve a smaller subset of data, which improves query performance.

To get started quickly with Openflow’s telemetry, see Example Queries below.

## Openflow Telemetry Schema

For information about the event table columns, see [Event table columns](../../../developer-guide/logging-tracing/event-table-columns.md).

The following sections describe how Openflow structures telemetry in an Event Table.

### Resource Attributes

Describes the event metadata set by Openflow. For general information on other
types of resource attributes see [RESOURCE_ATTRIBUTES column](../../../developer-guide/logging-tracing/event-table-columns.md) in the Event Table columns documentation.

| Name | Type | Description |
| --- | --- | --- |
| application | String | The fixed value `openflow` |
| cloud.service.provider | String | One of `aws`, `snowflake` |
| container.id | String | Unique identifier of the container |
| container.image.name | String | Fully qualified name of the container image. All Openflow images are hosted by Snowflake repositories.  For example, `<account>-openflow-<env>.registry-internal.snowflakecomputing.com/openflow/openflow/openflow_repo/runtime-server` |
| container.image.tag | String | Version of the container image |
| k8s.container.name | String | The name of the K8s container. Openflow Runtime containers will start with the “Runtime Key” and end with `-gateway` or `-server`.  For example, an Openflow Runtime named “PostgreSQL CDC” with a Runtime Key of postgresql-cdc, so it would have container names of:   * postgresql-cdc-gateway * postgresql-cdc-server |
| k8s.container.restart_count | Numeric String | The number of times this container has restarted since it was created. |
| k8s.namespace.name | String | K8s namespace of the pod or container, starting with `runtime-` for Openflow Runtimes. Values also include `kube-system` and `openflow-runtime-infra`. |
| k8s.node.name | String | The internal domain name of the EKS node hosting the pod / container, or the EKS node itself.  For example, ip-10-12-13-144.us-west-2.compute.internal |
| k8s.pod.name | String | The name of the K8s pod. Openflow Runtime pods will start with the “Runtime Key” and end with a numeric identifier for each pod replica. This number can grow up to the “Max Nodes” set for the Runtime, indexed at 0.  For example, an Openflow Runtime named “PostgreSQL CDC” with a Runtime Key of postgresql-cdc and 3 nodes would have pod names of:   * postgresql-cdc-0 * postgresql-cdc-1 * postgresql-cdc-2 |
| k8s.pod.start_time | ISO 8601 Date String | Timestamp that the pod was started |
| k8s.pod.uid | UUID String | Unique identifier of the pod within the cluster |
| deployment.version | String | The Openflow deployment version. |
| openflow.dataplane.id | UUID String | The unique identifier of the Openflow Deployment, matching the “ID” shown in the Snowflake Openflow UI through Deployment > View Details. |

Resource Attributes Example:
:   ```json
    {
      "application": "openflow",
      "cloud.service.provider": "aws",
      "container.id": "a1b2c3d4e5f6",
      "container.image.name": "example-openflow-prod.registry-internal.snowflakecomputing.com/openflow/openflow/openflow_repo/runtime-server",
      "container.image.tag": "2026.3.17.13",
      "deployment.version": "1.35.0",
      "k8s.container.name": "pg-dev-server",
      "k8s.container.restart_count": "0",
      "k8s.namespace.name": "runtime-pg-dev",
      "k8s.node.name": "ip-10-10-62-36.us-east-2.compute.internal",
      "k8s.pod.name": "pg-dev-0",
      "k8s.pod.start_time": "2025-04-25T22:14:29Z",
      "k8s.pod.uid": "94610175-1685-4c8f-b0a1-42898d1058e6",
      "openflow.dataplane.id": "abeddb4f-95ae-45aa-95b1-b4752f30c64a"
    }
    ```

### Scope

| Name | Type | Description |
| --- | --- | --- |
| name | String | Provider of the metric. One of:   * `runtime` for Openflow Connector metrics * `github.com/open-telemetry/opentelemetry-collector-contrib/receiver/kubeletstatsreceiver` for system-level metrics |

Scope Example:
:   ```json
    {
      "name": "runtime"
    }
    ```

### Record Type

Depending on the type of Openflow telemetry represented by this row, this will be one of:

* LOG
* METRIC

Openflow does not collect TRACE records, but that is also a valid type for this column in Snowflake Event Tables.

### Record

Optional. This JSON object describes the type of metric represented by this row.

| Name | Type | Description |
| --- | --- | --- |
| metric | Object | Contains two fields:   * `name` for the unique metric produced, typically using dot-delimited namespaces * `unit` for the value represented by the type, such as byte, nanosecond, and thread   The name and unit values vary widely. For the full list, see Application Metrics below. |
| metric_type | String | One of:   * `gauge` for most Openflow metrics, a snapshot value that can increase or decrease * `sum` for cumulative metrics like pod CPU time and network IO |
| value_type | String | The primitive type of the value produced by this metric. One of:   * INT * DOUBLE |
| aggregation_temporality | String | Optional. Set to cumulative for metrics that are strictly increasing and dependent on previous values, such as pod CPU time and network IO. |
| is_monotonic | Boolean | Optional. For cumulative metrics, this is true to show that it is strictly increasing within the time series. |

Record Example:
:   ```json
    {
      "metric": {
        "name": "connection.queued.duration.max",
        "unit": "millisecond"
      },
      "metric_type": "gauge",
      "value_type": "INT"
    }
    ```

### Record Attributes

#### Logs

Record attributes for Logs will typically indicate where this log was sourced. For example, logs from an Openflow Runtime named `testruntime` could have Record Attributes of:

> ```json
> {
>   "log.file.path": "/var/log/pods/runtime-testruntime_testruntime-0_66d80cdb-9484-40a4-bdba-f92eb0af14c7/testruntime-server/0.log",
>   "log.iostream": "stdout",
>   "logtag": "F"
> }
> ```

#### System Metrics

System metrics like CPU usage will typically not set Record Attributes, so this will be `null`.

#### Openflow Application Metrics

Record Attributes for Application or “Flow” metrics provide details about the component in the data pipeline that produced the metric. This will vary based on the type of component. See Application Metrics

> ```json
> {
>   "component": "PutSnowpipeStreaming",
>   "execution.node": "ALL",
>   "group.id": "c052f9d7-7f76-3013-a2c5-d3b064fa7326",
>   "id": "c69e2913-22a9-36bb-a159-6a5ed1fb9d63",
>   "name": "PutSnowpipeStreaming",
>   "type": "processor"
> }
> ```

### Value

This column contains the raw value of the telemetry. For metrics, this will be a numeric value (integer or double). For logs, this will either be a semi-structured string value or a well-formatted JSON string.

#### Openflow Runtime Logs

Openflow Runtimes emit most logs as JSON, so applying Snowflake’s [TRY_PARSE_JSON](../../../sql-reference/functions/try_parse_json.md) to the `VALUE` column allows you to further break this value into the following structured fields:

| Name | Type | Description |
| --- | --- | --- |
| formattedMessage | String | The actual log message emitted from the Runtime logger. |
| level | String | One of:   * ERROR * WARN * INFO * DEBUG * TRACE |
| loggerName | String | The fully qualified classname for the logger. Openflow processors will typically use logger names that start with `com.snowflake.openflow.runtime.processors`.  This is useful to view logs for a specific processor, controller service, or bundled library. |
| nanoseconds | Integer | Nanosecond-level time that this log message was created, starting at milliseconds.  For example, a nanosecond value of 111222333 could correspond to a timestamp value of 1749180210111 with the leftmost 3 digits of nanosecond matching the right-most 3 digits of timestamp. |
| threadName | String | Name of the thread handling this call. For example, `Timer-Driven Process Thread-7` |
| throwable | JSON Object | `null` when there is no exception or stacktrace for this log message. Otherwise, it logs the stacktrace as a JSON string with fields:   * `className` - the exception thrown * `message` - any message logged with the exception * `stepArray` - array of method calls for the stack trace, including:    + `className`   + `fileName`   + `lineNumber`   + `methodName` |
| timestamp | Integer | Time that this log message was created, represented as milliseconds since the UNIX epoch.  For example, 1749180210044 indicates that the log was created at 2025-06-05 03:23:30.044 UTC |
| mdc | JSON Object | Mapped Diagnostic Context (MDC) providing additional flow-level context for the log entry. Contains the following fields:   * `processGroupId` - unique identifier of the process group * `processGroupIdPath` - hierarchical path of process group IDs * `processGroupName` - name of the process group * `processGroupNamePath` - hierarchical path of process group names * `registeredFlowIdentifier` - identifier of the registered flow (present for all versioned flows, including out-of-the-box Openflow connectors) * `registeredFlowVersion` - version of the registered flow (present for all versioned flows, including out-of-the-box Openflow connectors)   For example:  ```json {   "processGroupId": "6dc1d98f-019d-1000-ffff-ffffa3ba8a09",   "processGroupIdPath": "/58385a8b-019d-1000-2a52-9ef1c34b0e5f/6dc1d98f-019d-1000-ffff-ffffa3ba8a09",   "processGroupName": "latency targets",   "processGroupNamePath": "/Openflow/latency targets",   "registeredFlowIdentifier": "sqlserver-multidatabase",   "registeredFlowVersion": "0.29.0-ebb7a257" } ``` |

## Application Metrics

> **Note:**
>
> The following list covers all application metrics available for Openflow Runtimes. Runtimes only emit a subset of metrics relevant to Openflow Connectors to persist in a Snowflake Event Table.
>
> Snowflake’s OpenTelemetry Reporting Task can send some or all metrics to any OTLP destination.

### Connection Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| connection.input.bytes | bytes | Size of Items Input |
| connection.input.count | items | Count of Items Input |
| connection.output.bytes | bytes | Size of Items Output |
| connection.output.count | items | Count of Items Output |
| connection.queued.bytes | bytes | Size of Items Queued |
| connection.queued.bytes.max | bytes | Max Size of Items Queued |
| connection.queued.count | items | Count of Items Queued |
| connection.queued.count.max | items | Max Count of Items Queued |
| connection.queued.duration.total | milliseconds | Total Duration of Queued Items |
| connection.queued.duration.max | milliseconds | Max Duration of Queued Items |
| connection.backpressure.threshold.bytes | bytes | The maximum size of data in bytes that can be queued in this connection before it applies back pressure. |
| connection.backpressure.threshold.objects | items | The configured maximum number of FlowFiles that can be queued in this connection before it applies back pressure. |
| connection.loadbalance.status.load_balance_not_configured | binary, 0 or 1 | 1 if the connection does not have a configured load balance setting. Otherwise, 0. |
| connection.loadbalance.status.load_balance_active | binary, 0 or 1 | 1 if the connection is load balancing across the cluster. Otherwise, 0. |
| connection.loadbalance.status.load_balance_inactive | binary, 0 or 1 | 1 if the connection is not load balancing across the cluster. Otherwise, 0. |

### Connection Record Attributes

Each Connection metric includes the following Record Attributes:

| Attribute | Description |
| --- | --- |
| id | The unique identifier of the connection |
| name | The user-visible name of the connection |
| type | The fixed value `connection` |
| source.id | The unique identifier of the component that is sending FlowFiles to this connection |
| source.name | The user-visible name of the component that is sending FlowFiles to this connection |
| destination.id | The unique identifier of the component that is receiving FlowFiles from this connection |
| destination.name | The user-visible name of the component that is receiving FlowFiles from this connection |
| group.id | The unique identifier of the Process Group that contains this Connection |

### Input and Output Port Metrics

Input Port and Output Ports are technically two separate types of components. For consistency, metrics and attributes for Input and Output Ports are the same, with the exception of the `type` attribute that indicates whether it is an input port or an output port.

| Metric Name | Unit | Description |
| --- | --- | --- |
| port.thread.count.active | threads | Number of Active Threads |
| port.bytes.received | bytes | Number of Bytes Received |
| port.bytes.sent | bytes | Number of Bytes Sent |
| port.flowfiles.received | flowfiles | Number of FlowFiles Received |
| port.flowfiles.sent | flowfiles | Number of FlowFiles Sent |
| port.input.bytes | bytes | Size of Items Input |
| port.input.count | items | Count of Items Input |
| port.output.bytes | bytes | Size of Items Output |
| port.output.count | items | Count of Items Output |

### Input and Output Port Record Attributes

Each Port metric includes the following Record Attributes:

| Attribute | Description |
| --- | --- |
| id | The unique identifier of the port |
| name | The user-visible name of the port |
| type | One of `port-input` or `port-output` |
| group.id | The unique identifier of the Process Group that contains this Port |

### Process Group Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| processgroup.thread.count.active | threads | Number of Active Threads |
| processgroup.thread.count.stateless | threads | Number of Stateless Threads |
| processgroup.thread.count.terminated | threads | Number of Terminated Threads |
| processgroup.bytes.read | bytes | Number of Bytes Read |
| processgroup.bytes.received | bytes | Number of Bytes Received |
| processgroup.bytes.transferred | bytes | Number of Bytes Transferred |
| processgroup.bytes.sent | bytes | Number of Bytes Sent |
| processgroup.bytes.written | bytes | Number of Bytes Written |
| processgroup.flowfiles.received | flowfiles | Number of FlowFiles Received |
| processgroup.flowfiles.sent | flowfiles | Number of FlowFiles Sent |
| processgroup.flowfiles.transferred | flowfiles | Number of FlowFiles Transferred |
| processgroup.input.count | items | Number of Items Input |
| processgroup.input.content.size | bytes | Size of Items Input |
| processgroup.output.count | items | Number of Items Output |
| processgroup.output.content.size | bytes | Size of Items Output |
| processgroup.queued.count | items | Number of Items Queued |
| processgroup.queued.content.size | bytes | Size of Items Queued |
| processgroup.time.processing | nanoseconds | Time Spent Processing |

### Process Group Record Attributes

Each Process Group metric includes the following Record Attributes:

| Attribute | Description |
| --- | --- |
| id | The unique identifier of the Process Group |
| name | The user-visible name of the Process Group |
| type | The fixed value `process-group` |
| tree.level | The depth of the Process Group, relative to the root process group of the flow. Process Groups at the highest level of the flow will have a tree.level of 1 |

### Processor Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| processor.thread.count.active | thread | Number of Active Threads |
| processor.thread.count.terminated | thread | Number of Terminated Threads |
| processor.time.lineage.average | nanosecond | Average Lineage Duration |
| processor.invocations | invocations | Number of Invocations |
| processor.bytes.read | byte | Number of Bytes Read |
| processor.bytes.received | byte | Number of Bytes Received |
| processor.bytes.sent | byte | Number of Bytes Sent |
| processor.bytes.written | byte | Number of Bytes Written |
| processor.flowfiles.received | flowfiles | Number of FlowFiles Received |
| processor.flowfiles.removed | flowfiles | Number of FlowFiles Removed |
| processor.flowfiles.sent | flowfiles | Number of FlowFiles Sent |
| processor.input.count | item | Number of Items Input |
| processor.input.content.size | bytes | Size of Items Input |
| processor.output.count | item | Number of Items Output |
| processor.output.content.size | byte | Size of Items Output |
| processor.time.processing | nanosecond | Time Spent Processing |
| processor.run.status.running | binary, 0 or 1 | 1 if running; 0 otherwise |
| processor.run.status.stopped | binary, 0 or 1 | 1 if stopped; 0 otherwise |
| processor.run.status.validating | binary, 0 or 1 | 1 if validating; 0 otherwise |
| processor.run.status.invalid | binary, 0 or 1 | 1 if invalid; 0 otherwise |
| processor.run.status.disabled | binary, 0 or 1 | 1 if disabled; 0 otherwise |
| processor.counter | count | Value of the counter |

### Processor Record Attributes

Each Processor metric includes the following Record Attributes:

| Attribute | Description |
| --- | --- |
| id | The unique identifier of the processor |
| name | The user-visible and user-editable name of the Processor |
| type | The fixed value `processor` |
| component | The immutable class name of the processor. |
| execution.node | Either `ALL` or `PRIMARY`, depending on how this Processor is configured to run |
| group.id | The unique identifier of the Process Group that contains this Processor |

### Additional Attributes for Counters​

In addition to the standard Processor attributes above, `processor.counter` metrics include the following:

| Attribute | Description |
| --- | --- |
| type | The fixed value `counter` |
| counter | The user- or system-generated name of the counter |

### Remote Process Group Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| remoteprocessgroup.thread.count.active | threads | Number of Active Threads |
| remoteprocessgroup.remote.port.count.active | ports | Number of Active Remote Ports |
| remoteprocessgroup.remote.port.count.inactive | ports | Number of Inactive Remote Ports |
| remoteprocessgroup.duration.lineage.average | nanoseconds | Average Lineage Duration |
| remoteprocessgroup.refresh.age | milliseconds | Time since last refresh |
| remoteprocessgroup.received.count | items | Number of Received Items |
| remoteprocessgroup.received.content.size | bytes | Size of Received Items |
| remoteprocessgroup.sent.count | items | Number of Sent Items |
| remoteprocessgroup.sent.content.size | bytes | Size of Sent Items |
| remoteprocessgroup.transmission.status.transmitting | binary, 0 or 1 | 1 if the Remote Process Group is transmitting. Otherwise, 0. |
| remoteprocessgroup.transmission.status.nottransmitting | binary, 0 or 1 | 0 if the Remote Process Group is transmitting. Otherwise, 1. |

### Remote Process Group Record Attributes

Each Remote Process Group metric includes the following Record Attributes:

| Attribute | Description |
| --- | --- |
| id | The unique identifier of the remote process group |
| name | The user-visible name of the Remote Process Group |
| group.id | The unique identifier of the Process Group that contains this Remote Process Group |
| authorization.issue | The Authorization used to access the Remote Process Group |
| target.uri | The URI of the Remote Process Group |
| type | The fixed value `remote-process-group` |

### JVM Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| jvm.memory.heap.used | bytes | The amount of memory currently occupied by objects on the JVM Heap |
| jvm.memory.heap.committed | bytes | The amount of memory guaranteed to be available for use by the JVM Heap |
| jvm.memory.heap.max | bytes | Maximum amount of memory allocated for the JVM Heap |
| jvm.memory.heap.init | bytes | Initial amount of memory allocated for the JVM Heap |
| jvm.memory.heap.usage | percentage | JVM Heap Usage |
| jvm.memory.non-heap.usage | percentage | JVM Non-Heap Usage |
| jvm.memory.total.init | bytes | Initial amount of memory allocated for the JVM |
| jvm.memory.total.used | bytes | Current amount of memory used by the JVM |
| jvm.memory.total.max | bytes | Maximum amount of memory that can be used by the JVM |
| jvm.memory.total.committed | bytes | The amount of memory guaranteed to be available for use by the JVM |
| jvm.threads.count | threads | Number of live threads |
| jvm.threads.deadlocks | threads | JVM Thread Deadlocks |
| jvm.threads.daemon.count | threads | Number of live daemon threads |
| jvm.uptime | seconds | Number of seconds the JVM process has been running |
| jvm.file.descriptor.usage | percentage | Percentage of available file descriptors currently in use. |
| jvm.gc.G1-Concurrent-GC.runs | runs | Total number of times that the G1 Concurrent Garbage Collection has run |
| jvm.gc.G1-Concurrent-GC.time | milliseconds | Total amount of time that the G1 Concurrent Garbage Collection has been running |
| jvm.gc.G1-Young-Generation.runs | runs | Total number of times that the G1 Young Generation has run |
| jvm.gc.G1-Young-Generation.time | milliseconds | Total amount of time that the G1 Young Generation has been running |
| jvm.gc.G1-Old-Generation.runs | runs | Total number of times that the G1 Old Generation has run |
| jvm.gc.G1-Old-Generation.time | milliseconds | Total amount of time that the G1 Old Generation has been running |

### JVM Record Attributes

JVM metrics do not provide Record Attributes.

### CPU Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| cores.available | cores | The number of available cores for the Runtime |
| cores.load | percentage | Either the system load average or -1 if it is not available |

### CPU Record Attributes

| Attribute | Description |
| --- | --- |
| id | The fixed value `cpu` |
| name | The name of the operating system |
| architecture | The architecture of the operating system |
| version | The version of the operating system |

### Storage Metrics

| Metric Name | Unit | Description |
| --- | --- | --- |
| storage.free | bytes | The amount of free storage for a given repository |
| storage.used | bytes | The amount of used storage for a given repository |

### Storage Record Attributes

| Attribute | Description |
| --- | --- |
| id | The unique identifier of the storage repository |
| name | Same as id and provided for consistency |
| storage.type | One of `flowfile`, `content`, or `provenance` |

## Example Queries

The following queries are examples to get you started with Openflow Telemetry.

All queries assume that Openflow is configured to send telemetry to the default Event Table of `SNOWFLAKE.TELEMETRY.EVENTS`. If your Snowflake Account or Openflow Deployment is configured with a different Event Table, substitute that table name where you see `SNOWFLAKE.TELEMETRY.EVENTS`.

### Find Stuck FlowFiles

This query returns connections with FlowFiles that have been queued for more than some threshold, indicating that they may be stuck and require intervention. Adjust the 30 minute threshold as needed for your use case.

```sqlexample
SELECT * FROM (
  SELECT
    resource_attributes:"openflow.dataplane.id" as Deployment_ID,
    resource_attributes:"k8s.namespace.name" as Runtime_Key,
    record_attributes:name as Connection_Name,
    record_attributes:id as Connection_ID,
    MAX(TO_NUMBER(value / 60 / 1000)) as Max_Queued_File_Minutes
  FROM snowflake.telemetry.events
  WHERE true
    AND record_type = 'METRIC'
    AND record:metric:name = 'connection.queued.duration.max'
    AND timestamp > dateadd(minutes, -30, sysdate())
  GROUP BY 1, 2, 3, 4
  ORDER BY Max_Queued_File_Minutes DESC
) WHERE Max_Queued_File_Minutes > 30;
```

### Find Error Logs for Openflow Runtimes

```sqlexample
SELECT
  timestamp,
  Deployment_ID,
  Runtime_Key,
  parsed_log:level as log_level,
  parsed_log:loggerName as logger,
  parsed_log:formattedMessage as message,
  parsed_log
FROM (
  SELECT
    timestamp,
    resource_attributes:"openflow.dataplane.id" as Deployment_ID,
    resource_attributes:"k8s.namespace.name" as Runtime_Key,
    TRY_PARSE_JSON(value) as parsed_log
  FROM snowflake.telemetry.events
  WHERE true
    AND timestamp > dateadd('minutes', -30, sysdate())
    AND record_type = 'LOG'
    AND resource_attributes:"k8s.namespace.name" like 'runtime-%'
  ORDER BY timestamp DESC
) WHERE log_level = 'ERROR';
```

### Find Running and Non-Running Processors

Some flows expect that all processors are in a “running” state, even if they are not actively processing data.

This query helps you find any processors that are running or in another state, such as:

* stopped
* invalid
* disabled

```sqlexample
SELECT
  timestamp,
  resource_attributes:"openflow.dataplane.id" as Deployment_ID,
  resource_attributes:"k8s.namespace.name" as Runtime_Key,
  record_attributes:component as Processor,
  record_attributes:id as Processor_ID,
  TO_NUMBER(value) as Running
FROM snowflake.telemetry.events
WHERE true
  AND record:metric:name = 'processor.run.status.running'
  AND record_type = 'METRIC'
  AND timestamp > dateadd(minutes, -30, sysdate());
```

### Find High CPU Usage for Openflow Runtimes

Slow data flows or reduced throughput may be the result of a bottleneck on the CPU. Openflow Runtimes scale up automatically, based on the number of minimum and maximum nodes you have configured.

If an Openflow Runtime is using its maximum number of nodes and still CPU usage remains high, consider:

1. Increasing the maximum number of nodes allocated to the Runtime
2. Troubleshoot the Connector or flow to identify the bottleneck

Snowsight Charts provide an easy way to visualize query results for CPU usage over time.

```sqlexample
SELECT
  timestamp,
  resource_attributes:"openflow.dataplane.id" as Deployment_ID,
  resource_attributes:"k8s.namespace.name" as Runtime_Key,
  resource_attributes:"k8s.pod.name" as Runtime_Pod,
  TO_NUMBER(value, 10, 3) * 100 as CPU_Usage_Percentage
FROM snowflake.telemetry.events
WHERE true
  AND timestamp > dateadd(minute, -30, sysdate())
  AND record_type = 'METRIC'
  AND record:metric:name ilike 'container.cpu.usage'
  AND resource_attributes:"k8s.namespace.name" ilike 'runtime-%'
  AND resource_attributes:"k8s.container.name" ilike '%-server'
ORDER BY timestamp desc, CPU_Usage_Percentage desc;
```

---
title: MonitorActivity 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/monitoractivity.md
section: Loading & Unloading Data
---

# MonitorActivity 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Monitors the flow for activity and sends out an indicator when the flow has not had any data for some specified amount of time and again when the flow’s activity is restored

## Tags

active, activity, detection, flow, inactive, monitor

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Activity Restored Message | The message that will be the content of FlowFiles that are sent to ‘activity.restored’ relationship |
| Continually Send Messages | If true, will send inactivity indicator continually every Threshold Duration amount of time until activity is restored; if false, will send an indicator only when the flow first becomes inactive |
| Copy Attributes | If true, will copy all flow file attributes from the flow file that resumed activity to the newly created indicator flow file |
| Inactivity Message | The message that will be the content of FlowFiles that are sent to the ‘inactive’ relationship |
| Monitoring Scope | Specify how to determine activeness of the flow. ‘node’ means that activeness is examined at individual node separately. It can be useful if DFM expects each node should receive flow files in a distributed manner. With ‘cluster’, it defines the flow is active while at least one node receives flow files actively. If NiFi is running as standalone mode, this should be set as ‘node’, if it ‘s’ cluster ‘, NiFi logs a warning message and act as’ node’scope. |
| Reporting Node | Specify which node should send notification flow-files to inactive and activity.restored relationships. With ‘all’, every node in this cluster send notification flow-files. ‘primary’ means flow-files will be sent only from a primary node. If NiFi is running as standalone mode, this should be set as ‘all’, even if it ‘s’ primary ‘, NiFi act as’ all’. |
| Reset State on Restart | When the processor gets started or restarted, if set to true, the initial state will always be active. Otherwise, the last reported flow state will be preserved. |
| Threshold Duration | Determines how much time must elapse before considering the flow to be inactive |
| Wait for Activity | When the processor gets started or restarted, if set to true, only send an inactive indicator if there had been activity beforehand. Otherwise send an inactive indicator even if there had not been activity beforehand. |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | MonitorActivity stores the last timestamp at each node as state, so that it can examine activity at cluster wide. If ‘Copy Attribute’ is set to true, then flow file attributes are also persisted. In local scope, it stores last known activity timestamp if the flow is inactive. |
| CLUSTER | MonitorActivity stores the last timestamp at each node as state, so that it can examine activity at cluster wide. If ‘Copy Attribute’ is set to true, then flow file attributes are also persisted. In local scope, it stores last known activity timestamp if the flow is inactive. |

## Relationships

| Name | Description |
| --- | --- |
| activity.restored | This relationship is used to transfer an Activity Restored indicator when FlowFiles are routing to ‘success’ following a period of inactivity |
| inactive | This relationship is used to transfer an Inactivity indicator when no FlowFiles are routed to ‘success’ for Threshold Duration amount of time |
| success | All incoming FlowFiles are routed to success |

## Writes attributes

| Name | Description |
| --- | --- |
| inactivityStartMillis | The time at which Inactivity began, in the form of milliseconds since Epoch |
| inactivityDurationMillis | The number of milliseconds that the inactivity has spanned |

---
title: MoveAzureDataLakeStorage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/moveazuredatalakestorage.md
section: Loading & Unloading Data
---

# MoveAzureDataLakeStorage 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Moves content within an Azure Data Lake Storage Gen 2. After the move, files will be no longer available on source location.

## Tags

adlsgen2, azure, cloud, datalake, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ADLS Credentials | Controller Service used to obtain Azure Credentials. |
| Conflict Resolution Strategy | Indicates what should happen when a file with the same name already exists in the output directory |
| Destination Directory | Name of the Azure Storage Directory where the files will be moved. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. Non-existing directories will be created. If the original directory structure should be kept, the full directory path needs to be provided after the destination directory. e.g.: destdir/${azure.directory} |
| Destination Filesystem | Name of the Azure Storage File System where the files will be moved. |
| File Name | The filename |
| Source Directory | Name of the Azure Storage Directory from where the move should happen. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. |
| Source Filesystem | Name of the Azure Storage File System from where the move should happen. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Azure storage for some reason are transferred to this relationship |
| success | Files that have been successfully written to Azure storage are transferred to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.source.filesystem | The name of the source Azure File System |
| azure.source.directory | The name of the source Azure Directory |
| azure.filesystem | The name of the Azure File System |
| azure.directory | The name of the Azure Directory |
| azure.filename | The name of the Azure File |
| azure.primaryUri | Primary location for file content |
| azure.length | The length of the Azure File |

## See also

* [org.apache.nifi.processors.azure.storage.DeleteAzureDataLakeStorage](deleteazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureDataLakeStorage](fetchazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.ListAzureDataLakeStorage](listazuredatalakestorage.md)

---
title: Notify 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/notify.md
section: Loading & Unloading Data
---

# Notify 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Caches a release signal identifier in the distributed cache, optionally along with the FlowFile’s attributes. Any flow files held at a corresponding Wait processor will be released once this signal in the cache is discovered.

## Tags

cache, distributed, map, notify, release, signal

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| attribute-cache-regex | Any attributes whose names match this regex will be stored in the distributed cache to be copied to any FlowFiles released from a corresponding Wait processor. Note that the uuid attribute will not be cached regardless of this value. If blank, no attributes will be cached. |
| distributed-cache-service | The Controller Service that is used to cache release signals in order to release files queued at a corresponding Wait processor |
| release-signal-id | A value, or the results of an Attribute Expression Language statement, which will be evaluated against a FlowFile in order to determine the release signal cache key |
| signal-buffer-count | Specify the maximum number of incoming flow files that can be buffered until signals are notified to cache service. The more buffer can provide the better performance, as it reduces the number of interactions with cache service by grouping signals by signal identifier when multiple incoming flow files share the same signal identifier. |
| signal-counter-delta | A value, or the results of an Attribute Expression Language statement, which will be evaluated against a FlowFile in order to determine the signal counter delta. Specify how much the counter should increase. For example, if multiple signal events are processed at upstream flow in batch oriented way, the number of events processed can be notified with this property at once. Zero (0) has a special meaning, it clears target count back to 0, which is especially useful when used with Wait Releasable FlowFile Count = Zero (0) mode, to provide ‘open-close-gate’ type of flow control. One (1) can open a corresponding Wait processor, and Zero (0) can negate it as if closing a gate. |
| signal-counter-name | A value, or the results of an Attribute Expression Language statement, which will be evaluated against a FlowFile in order to determine the signal counter name. Signal counter name is useful when a corresponding Wait processor needs to know the number of occurrences of different types of events, such as success or failure, or destination data source names, etc. |

## Relationships

| Name | Description |
| --- | --- |
| failure | When the cache cannot be reached, or if the Release Signal Identifier evaluates to null or empty, FlowFiles will be routed to this relationship |
| success | All FlowFiles where the release signal has been successfully entered in the cache will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| notified | All FlowFiles will have an attribute ‘notified’. The value of this attribute is true, is the FlowFile is notified, otherwise false. |

## See also

* [org.apache.nifi.processors.standard.Wait](wait.md)

---
title: OpenAiTranscribeAudio 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/openaitranscribeaudio.md
section: Loading & Unloading Data
---

# OpenAiTranscribeAudio 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-openai-nar

## Description

Transcribes audio into English text. The audio data must be in one of these formats: flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm

## Tags

audio, flac, m4a, mp3, mp4, mpeg, mpga, ogg, openai, openflow, speech-to-text, text, transcribe, translate, wav, webm

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Model Name | The name of the OpenAI Model to use |
| OpenAI API Key | The API Key for interacting with OpenAI |
| Prompt | Text that can be used to guide the model’s style or continue a previous audio segment. The text must be in English. |
| Response Format | Specifies which format is desired for the output |
| Temperature | The sampling temperature to use. The value must be a floating-point number between 0.0 and 1.0. A higher value, such as 0.8 will result in more of an interpreted translation, whereas a value of 0.0 will result in a more literal translation. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that could not be transcribed are routed to this relationship. |
| success | FlowFiles that have been successfully transcribed will be transferred to this relationship. |

## Use Cases Involving Other Components

|  |
| --- |
| Create embeddings for audio data and insert them into Pinecone so that the audio can be made available to a large language model (LLM) such as OpenAI’s GPT models. |

---
title: Openflow BYOC - Set up custom ingress
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-byoc-custom-ingress.md
section: Loading & Unloading Data
---

# Openflow BYOC - Set up custom ingress

This topic describes the considerations for and steps required to set up an Openflow BYOC deployment with a custom ingress solution managed within your own AWS account.

## Benefits

Custom ingress for Openflow BYOC deployments provides your organization with:

* Stronger security with network-level restrictions that can limit access to only your VPN or private network.
* Full control over the URL and TLS certificate used to access Openflow to meet your security and compliance requirements.

## Considerations

With Snowflake managed ingress, Openflow creates the necessary DNS records, public load balancer, and manages the TLS certificate for the Openflow runtimes in your BYOC deployment.

When you enable custom ingress, Openflow will no longer automatically manage external DNS records, will not create a public load balancer automatically, and will no longer manage certificates for the Openflow runtimes. You must manage these resources within your own AWS account.

## Configure custom ingress in Snowflake Openflow

1. Enable custom ingress during deployment creation.

   > * During deployment creation, enable Custom ingress and specify your preferred fully qualified domain name (FQDN) in the Hostname field.
   > * You must be able to manage this DNS record and create a TLS certificate for this FQDN. Do not use a subdomain of `snowflakecomputing.com`.
   > * You must not include the protocol https:// or a trailing slash / in the FQDN.
   > * For example, if you specify `openflow01.your-domain.org`, you will access a runtime named “My Runtime” at `https://openflow01.your-domain.org/my-runtime/nifi/`.
2. Download the CloudFormation template. This file has all of the settings required for Openflow to run as your custom ingress domain.

## Configure custom ingress in AWS

> **Note:**
>
> `{deployment-key}` represents the Openflow unique identifier applied to cloud resources created and managed by Openflow for a particular deployment.
>
> This is in the `DataPlaneKey` parameter of the CloudFormation template, also available in Openflow through the View Details menu option for the deployment.

1. Add the following tag to the private subnets for your Openflow deployment:

   > * Key: kubernetes.io/role/internal-elb
   > * Value: `1`
2. If your private subnets are used by other EKS clusters, you must also tag them with the name of the Openflow cluster. This allows Openflow to create a load balancer alongside other load balancers.

   > * Key: kubernetes.io/cluster/{deployment-key}
   > * Value: `1`
3. Upload the CloudFormation template. Wait approximately 30 minutes for Openflow to create the internal network load balancer.

   > * You can find the internal network load balancer in the AWS Console under EC2 » Load Balancers.
   > * The load balancer will be named `runtime-ingress-{deployment-key}`.
4. Obtain the internal IP address of the Openflow-managed AWS internal network load balancer.

   > * Under EC2 » Load Balancers, navigate to the details page and copy the DNS name of the Load Balancer.
   > * Log into your agent EC2 instance (identified as openflow-agent-{deployment-key}) and run the command `nslookup {openflow-load-balancer-dns-name}`.
   > * Copy the IP addresses of the Openflow-managed AWS internal network load balancer. These are destinations for the target group of the load balancer you will create in a following step.
5. Provision a TLS certificate.

   > * Obtain a TLS certificate for the load balancer that will handle traffic to the Openflow runtime UIs. You can generate a certificate using AWS Certificate Manager (ACM) or import an existing certificate.
6. Create a network load balancer that will route traffic to the Openflow-managed AWS internal network load balancer.

   > 1. In your AWS account, create a Network Load Balancer with the following configuration:
   >
   >    * Name: We recommend the naming convention `custom-ingress-external-{deployment-key}`, where `{deployment-key}` is the key of your Openflow deployment.
   >    * Type: Network Load Balancer
   >    * Scheme: Internal or Internet-facing, depending on your requirements.
   >    * VPC: Select the VPC of your deployment
   >    * Availability Zones: Select both Availability Zones where your Openflow deployment is running.
   >    * Subnets: Select the private subnets of your VPC for an Internal Load Balancer, or the public subnets of your VPC for an Internet-facing Load Balancer.
   >    * Security groups: Select or create a security group that allows traffic on port `443`
   >    * Default SSL/TLS server certificate: Import your SSL/TLS certificate
   >    * Target group: Create a new target group with the following settings:
   >
   >      + Target type: IP addresses
   >      + Protocol: TLS
   >      + Port: 443
   >      + VPC: Verify the VPC matches your deployment
   >      + Type the IP address of the internal network load balancer created by Openflow (obtained in the previous step) as the target and select Include as pending below.
   > 2. Once the load balancer is created, copy the DNS name for the load balancer to use in the next step.
   > 3. For more information on how to create a network load balancer, see [Create a Network Load Balancer](https://docs.aws.amazon.com/elasticloadbalancing/latest/network/create-network-load-balancer.html).
7. Create a DNS CNAME record that maps your custom ingress FQDN to the AWS load balancer’s DNS name.

   > * For detailed DNS configuration instructions in Route 53, see [Create records in Route 53](https://docs.aws.amazon.com/Route53/latest/DeveloperGuide/resource-record-sets-creating.html).

## Verification

1. The Openflow deployment shows a status of Active in the Deployments page.
2. Create a runtime in the Openflow deployment.
3. Once the runtime is Active, click on the runtime name or use the View canvas menu option to access the runtime’s UI.
4. Openflow directs you to the runtime with the hostname specified during deployment creation. For example, `https://openflow01.your-domain.org/my-runtime/nifi/`.

## Troubleshooting

The following sections provide troubleshooting steps for common issues with custom ingress. If you are still experiencing issues after performing these checks, file a [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) case.

### Load balancer target health check

The target group for your network load balancer should list the IP addresses of the Openflow-managed internal network load balancer as targets. All of these targets should show as Healthy. If targets are Unhealthy, use the following checks to narrow down where traffic is failing.

1. In the AWS console, open EC2 » Load Balancers.
2. Locate the Openflow-managed load balancer that manages ingress to the Kubernetes cluster. This load balancer is named `runtime-ingress-{deployment-key}`.
3. Review the target health for that load balancer under the Resource map tab.
4. If the Openflow-managed load balancer is not active or has Unhealthy targets:

   * Traffic may be blocked between the Openflow-managed load balancer and the BYOC cluster, or a service inside the cluster may not be ready.
   * Generate a diagnostic bundle by running `./diagnostics.sh` from the openflow-agent-{deployment-key} EC2 instance and attach it to a [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) case.
5. If the Openflow-managed load balancer is active and has healthy targets, check the target health for your load balancer.
6. If your load balancer’s targets are Unhealthy, the path from your load balancer to the Openflow-managed load balancer is the most likely problem:

   * **Incorrect or stale IP addresses in your target group.** The Openflow-managed load balancer exposes multiple IP addresses that can change over time. To get the latest values, run `nslookup` with the DNS name of the Openflow-managed load balancer. Update your load balancer’s targets as necessary.
   * **Security group rules.** Confirm that inbound rules on the Openflow-managed load balancer’s security groups allow TCP `443` from your load balancer. Traffic can fail if your load balancer can’t reach the Openflow load balancer on port `443`.

### Browser security blocking

Some problems with custom ingress are caused by corporate browser security, firewalls, or web proxies that block or inspect traffic to your custom hostname. Those policies are separate from AWS load balancer configuration. You may find that users can’t open the Openflow UI even when AWS load balancers report healthy targets.

To verify connectivity through the load balancers to the Openflow services:

1. In the AWS console, open EC2 » Load Balancers to get the DNS name of the load balancer that is serving traffic and the TLS certificate for your custom ingress domain name.

   * This is **not** the runtime-ingress-{deployment-key} load balancer.
2. From the openflow-agent-{deployment-key} EC2 instance, verify connectivity through the load balancers to the Openflow deployment. Run the command:

   ```bash
   curl -kv https://{your-load-balancer-dns-name}
   ```

   * If the command outputs the expected certificate information and a successful 404 status code response, you have successfully verified connectivity to your Openflow deployment.
   * If the command times out or returns an error, create a [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) case and attach a diagnostic bundle generated by running `./diagnostics.sh` from the Openflow Agent instance.
3. From the Openflow Agent instance, you can also verify the DNS CNAME record for your custom ingress FQDN. Run the command:

   ```bash
   source ~/.env && nslookup $DOMAIN
   ```

   * If the command returns the IP addresses of the load balancer that is performing TLS termination for your custom ingress domain name, you have successfully verified the DNS CNAME record.
   * If the command returns no results, the DNS CNAME record is not configured correctly. Check the DNS record for your custom ingress FQDN and ensure it points to your load balancer’s DNS name.

If the Openflow Agent connected successfully through your load balancer’s DNS and you have verified the DNS CNAME record, a security policy or firewall is likely blocking traffic from your browser to the Openflow BYOC deployment. Work with your security team to allowlist your custom ingress FQDN.

---
title: Openflow BYOC - Set up encrypted EBS volumes
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-byoc-encrypted-volumes.md
section: Loading & Unloading Data
---

# Openflow BYOC - Set up encrypted EBS volumes

This topic describes the steps to set up an Openflow BYOC deployment with encrypted Elastic Block Storage (EBS) volumes using one of the following methods:

* Provide a specific AWS KMS Key for Encrypted EBS Volumes
* Enable Encrypted EBS Volumes by default for your AWS Account

Both of these solutions provide encrypted EBS volumes that meet the following storage requirements of Openflow BYOC:

* Root volume for the Openflow Agent EC2 instance
* Root volumes for the EC2 instances in each EKS Cluster Node Group
* Persistent volumes for Openflow’s runtimes and supporting components

> **Note:**
>
> * `$AWS_ACCOUNT_ID` represents the AWS Account ID of the account where Openflow is deployed.
> * `$AWS_REGION` represents the AWS Region of the account, for example `us-west-2`.
> * `$AWS_KMS_KEY_ARN` represents the Amazon Resource Name (ARN) of the Amazon Key Management Service (AWS KMS) key that Openflow will use for encrypted EBS volumes.
> * `$DEPLOYMENT_KEY` represents the Openflow unique identifier applied to cloud resources created and managed by Openflow for a particular deployment.
>   This is in the `DataPlaneKey` parameter of the CloudFormation template, also available in Openflow through the View Details menu option for the deployment.

## Prerequisites

This topic assumes that you have completed the prerequisites for setting up Openflow BYOC. For more information, see [Set up Openflow - BYOC](setup-openflow-byoc.md).

You must also have access to an AWS KMS key that Openflow will use for encrypted EBS volumes.

## Provide a specific AWS KMS Key for Encrypted EBS Volumes

When uploading the CloudFormation template for your Openflow BYOC Deployment, you can provide the ARN for the AWS KMS key that Openflow uses for encrypted EBS volumes.

Using this configuration, Openflow makes requests for encrypted EBS volumes, ensuring that all SCP policies are satisfied. Snowflake recommends this approach for most customers.

This allows you to use different KMS keys for different applications, reducing the risk of a single key being compromised.

To ensure that Openflow has the necessary permissions to use this key, perform the following tasks:

1. Ensure that the AWS KMS key grants permissions to the AWS Autoscaling Service Role. The Key Policy must include the following statement:

   > ```json
   > {
   > "Sid": "Allow Autoscaling to use the key",
   > "Effect": "Allow",
   > "Principal": {
   >     "AWS": "arn:aws:iam::$AWS_ACCOUNT_ID:role/aws-service-role/autoscaling.amazonaws.com/AWSServiceRoleForAutoScaling"
   > },
   > "Action": [
   >     "kms:CreateGrant",
   >     "kms:Decrypt",
   >     "kms:Encrypt",
   >     "kms:ReEncrypt*",
   >     "kms:GenerateDataKey*",
   >     "kms:DescribeKey"
   > ],
   > "Resource": "*"
   > }
   > ```
2. Enter the ARN of the AWS KMS key in the `EBSKMSKeyArn` parameter of the CloudFormation stack when uploading the template.

   > For example, `arn:aws:kms:$AWS_REGION:$AWS_ACCOUNT_ID:key/1a1a11aa-aa1a-aaa1a-a1a1-000000000000`.
   >
   > Approximately 20 minutes after uploading the CloudFormation template, the Openflow BYOC Deployment creates a new IAM Role with the name `$DEPLOYMENT_KEY-eks-role`.
3. Add the following statement to the KMS key policy to grant permissions for Openflow to use the key:

   > ```json
   > {
   > "Sid": "Allow Openflow Deployment to encrypt EBS volumes",
   > "Effect": "Allow",
   > "Principal": {
   >     "AWS": "arn:aws:iam::$AWS_ACCOUNT_ID:role/$DEPLOYMENT_KEY-eks-role"
   > },
   > "Action": [
   >     "kms:Decrypt",
   >     "kms:Encrypt",
   >     "kms:ReEncrypt*",
   >     "kms:GenerateDataKey*",
   >     "kms:CreateGrant",
   >     "kms:DescribeKey"
   > ],
   > "Resource": "*"
   > }
   > ```

Openflow automatically detects the new permissions for the KMS key and continues the installation process. The Openflow BYOC deployment will become `Active` after approximately 20 minutes.

## Enable Encrypted EBS Volumes by default for your AWS Account

AWS accounts can encrypt new EBS volumes by default by following the [AWS EBS encryption by default documentation](https://docs.aws.amazon.com/ebs/latest/userguide/encryption-by-default.html).

With this configuration, Openflow makes requests for unencrypted EBS volumes, but the AWS API will return an encrypted EBS volume. The following steps ensure that Openflow has permissions to use the KMS key for these encrypted volumes.

Whether you choose to use the AWS managed key `aws/ebs` or your own KMS key, you must attach an IAM Policy to the Openflow IAM Role `$DEPLOYMENT_KEY-eks-role` that grants the necessary permissions to use the key.

1. Create an IAM Policy to allow Openflow to use the KMS key by replacing `$AWS_KMS_KEY_ARN` with the ARN of the KMS key.

   > ```json
   > {
   > "Sid": "Allow Openflow EKS Role to encrypt EBS volumes",
   > "Effect": "Allow",
   > "Action": [
   >     "kms:Decrypt",
   >     "kms:Encrypt",
   >     "kms:ReEncrypt*",
   >     "kms:GenerateDataKey*",
   >     "kms:CreateGrant",
   >     "kms:DescribeKey"
   > ],
   > "Resource": "$AWS_KMS_KEY_ARN"
   > }
   > ```
2. Ensure that the AWS KMS key grants permissions to the AWS Autoscaling Service Role. The Key Policy must include the following statement:

   > ```json
   > {
   > "Sid": "Allow Autoscaling to use the key",
   > "Effect": "Allow",
   > "Principal": {
   >     "AWS": "arn:aws:iam::$AWS_ACCOUNT_ID:role/aws-service-role/autoscaling.amazonaws.com/AWSServiceRoleForAutoScaling"
   > },
   > "Action": [
   >     "kms:CreateGrant",
   >     "kms:Decrypt",
   >     "kms:Encrypt",
   >     "kms:ReEncrypt*",
   >     "kms:GenerateDataKey*",
   >     "kms:DescribeKey"
   > ],
   > "Resource": "*"
   > }
   > ```
3. When uploading the Openflow BYOC CloudFormation template:

   > * Leave the optional `EBSKMSKeyArn` parameter blank.
   > * Set the `AdditionalEksRolePolicyArns` parameter to the ARN of the new IAM Policy created previously. For example, `arn:aws:iam::$AWS_ACCOUNT_ID:policy/openflow-kms-key-access-policy`.
   >
   > Approximately 20 minutes after uploading the CloudFormation template, the Openflow BYOC Deployment creates a new IAM Role with the name `$DEPLOYMENT_KEY-eks-role`.
4. Add the following statement to the KMS key policy to grant permissions for Openflow to use the key:

   > ```json
   > {
   > "Sid": "Allow Openflow Deployment to encrypt EBS volumes",
   > "Effect": "Allow",
   > "Principal": {
   >     "AWS": "arn:aws:iam::$AWS_ACCOUNT_ID:role/$DEPLOYMENT_KEY-eks-role"
   > },
   > "Action": [
   >     "kms:Decrypt",
   >     "kms:Encrypt",
   >     "kms:ReEncrypt*",
   >     "kms:GenerateDataKey*",
   >     "kms:CreateGrant",
   >     "kms:DescribeKey"
   > ],
   > "Resource": "*"
   > }
   > ```

Openflow automatically detects the new permissions for the KMS key and continues the installation process. The Openflow BYOC deployment will become `Active` after approximately 20 minutes.

---
title: Openflow BYOC cost and scaling considerations
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/cost-byoc.md
section: Loading & Unloading Data
---

# Openflow BYOC cost and scaling considerations

Snowflake Openflow BYOC has cost considerations in multiple areas, including infrastructure, compute, data ingestion and others.
Scaling Openflow involves understanding these costs. The following sections describe Openflow BYOC costs in general,
and provide a number of examples of scaling Openflow BYOC runtimes and associated costs.

## Openflow BYOC costs

When using Openflow, you can incur the following types of costs:

| Cost category | Description |
| --- | --- |
| Openflow (shown as **Openflow Compute BYOC** on your Snowflake bill) | Cost based on the number of virtual CPU cores (vCPU) used by connector runtimes within your “bring your own cloud (BYOC)” environment. You are charged for active runtimes only. The compute used for Openflow management processes is excluded from this specific charge. Credits are billed per-second with a 60 second minimum.  For an example of using of VCPU and the impacts of scaling see Openflow BYOC scaling.  For information on the rate per vCPU per hour, refer to Table 1(g) in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).  Additionally, the [METERING_DAILY_HISTORY](../../../sql-reference/account-usage/metering_daily_history.md) and [METERING_HISTORY](../../../sql-reference/account-usage/metering_history.md) views in the [Account Usage](../../../sql-reference/account-usage.md) schema can provide additional details on Openflow compute costs using queries for `SERVICE_TYPE=OPENFLOW_COMPUTE_BYOC`.  See [Exploring compute cost](../../cost-exploring-compute.md) for more information on exploring compute costs in Snowflake. |
| Infrastructure (only for BYOC configuration) | Applicable only for BYOC deployments, you directly pay your cloud provider, for example, AWS, for the underlying infrastructure provisioned in your environment to run Openflow. This primarily includes compute (for runtimes you provision to run the connectors and for managing the runtimes), networking, and storage costs and will appear on your CSP bill.  The EC2 compute requirements are illustrated in the following image: |
| Ingestion | Cost for loading data into Snowflake using services such as Snowpipe or Snowpipe Streaming, based on data volume. Appears on your Snowflake bill under respective ingestion services line items. Certain connectors may require a standard Snowflake warehouse, incurring additional warehouse costs. For example, database CDC connectors require a Snowflake warehouse for both initial snapshot and incremental Change Data Capture (CDC). You can schedule [MERGE](../../../sql-reference/sql/merge.md) operations to manage the compute cost. |
| Telemetry Data Ingest | Standard Snowflake charges for sending logs and metrics to Openflow deployments and sending runtimes to your event table within Snowflake. The rate for credits per GB of telemetry data can be found in Table 5 in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). |

## Openflow BYOC scaling

The runtimes and scaling behavior you choose are crucial for managing costs effectively.
Openflow supports different runtime types, each with its own scaling characteristics.

### Runtime types and the associated costs

The following table illustrates the scaling behavior of various runtimes and their associated costs:

| Runtimes | Activity | Snowflake costs | Cloud costs |
| --- | --- | --- | --- |
| No runtimes | None | No cost | Compute and storage of Dataplane |
| 1 small runtime (1vCPU) . (min 1 max 2) | Active for 1 hour . Runtime does not scale to 2. | 1 runtime x 1 node x 1 vCPU x 1 hour = 1 . Total = 1 vCPU-hour | Compute and storage of Dataplane |
| 2 small runtime (1 vCPU) (min/max=2) . 1 large runtime (8 vCPU) (min/max=10) | Small: 2 nodes active for 1 hour . Large: 10 nodes active for 1 hour | 2 runtime2 x 2 node x 2 vCPU x 1 hour = 4 vCPU . 1 runtime x 10 nodes x 8 vCPU x 1 hour = 80 vCPU . Total = 84 vCPU-hours | Compute and storage of Dataplane |
| 1 medium (4vCPU) . (min =1 max=2) | First 20 minutes 1 node is running . After 20 minutes, scales to 2 nodes . After 40 minutes, scales back to 1 node . Total 1 hour . | 20 minutes = 1/3 hour . 1 runtime x 1 node x 4 vCPU x 1/3 hour = 1 1/3 . 1 runtime x 2 nodes x 4 vCPU x 2/3 hour = 2 1/3 . 1 runtime x 1 node x 4 vCPU x 1/3 hour = 1 1/3 . Total = 5 1/2 vCPU-hours | Compute and storage of Dataplane |
| 1 medium (4vCPU) . (min/max=2) | First 30 minutes 2 nodes running . Suspends after first 30 minutes. | 30 minutes = 1/2 hour . 1 runtime x 2 nodes x 4 vCPU x 1/2 hour = 4 . Total = 4 vCPU-hours | Compute and storage of Dataplane |

### Mapping runtimes to EC2 instance types

Choosing a runtime type (t-shirt size) results in the runtime pods being scheduled on the associated EC2
node group {key}-sm-group, {key}-md-group, or {key}-lg-group with resources described in the following table:

| Runtime type | vCPUs | Available memory (GB) | EC2 instance type | EC2 node group | EC2 node - CPUs | EC2 node - memory (GB) |
| --- | --- | --- | --- | --- | --- | --- |
| Small | 1 | 2 | m7i.xlarge | {key}-sm-group | 4 | 16 |
| Medium | 4 | 10 | m7i.4xlarge | {key}-md-group | 16 | 64 |
| Large | 8 | 20 | m7i.8xlarge | {key}-lg-group | 32 | 128 |

The type of runtime that you choose impacts the number of cores (vCPUs) consumed each second. Openflow scales the underlying EC2 node group
when additional pods need to be scheduled, based on CPU consumption, and up to the maximum node setting set during runtime creation.

EKS node groups are configured with a minimum size of 0 nodes and a maximum of 50 nodes.
The desired size is dynamically adjusted depending on the runtime required CPU and memory.

Customers are charged by their cloud service provider for the underlying nodes that host their runtime.
The underlying EC2 instances are created when the first runtime of a respective size is scheduled.

### Examples for calculating Openflow BYOC runtime consumption

A user requests a BYOC deployment from Openflow and then installs the Openflow agent and deployment
:   * The user has not created any runtimes. 0 vCPUs are allocated, so there is no Openflow software cost.
    * The user is charged by their cloud service provider for the provisioned compute and storage of the Openflow BYOC deployment.
    * Total Openflow consumption = 0 vCPU-hours

A user creates one small runtime with Min Nodes = 1 and Max Nodes = 2. Runtime stays at 1 node for 1 hour.
:   * 1 small runtime = 1 vCPU
    * Total Openflow consumption = 1 vCPU-hour

A user creates 2 small runtimes with min/max of 2 nodes each, and one large runtime with min/max of 10 nodes. These Runtimes are active for 1 hour
:   * 2 small runtimes at 2 nodes = 2 Runtimes x 2 nodes x 1 vCPU = 4 vCPUs
    * 1 large runtime at 10 nodes = 1 Runtime x 10 nodes x 8 vCPU = 80 vCPUs
    * Total Openflow consumption = (4 vCPU + 80 vCPU) x 1 hour = 84 vCPU-hours

A user creates 1 medium runtime with 1 node. After 20 minutes, it scales to 2 nodes. After 20 minutes, it scales back down to 1 node and runs for another 20 minutes.
:   * 1 medium runtime = 4 vCPUs
    * 20 minutes = ⅓ hour
    * (1 node x 4 vCPU x ⅓ hour) + (2 nodes x 4 vCPU x ⅓ hour) + (1 node x 4 vCPU x ⅓ hour)

      > + 4/3 vCPU-hours + 8/3 vCPU-hours + 4/3 vCPU-hours
    * Total Openflow consumption = 16/3 vCPU-hours, so approximately 5.33 vCPU-hours

A user creates 1 medium runtime with 2 nodes, then suspends it after 30 minutes
:   * 1 medium runtime = 4 vCPU
    * 30 minutes = ½ hour
    * Total Openflow consumption = (2 nodes x 4 vCPU x ½ hour) = 4 vCPU-hours

---
title: Openflow Connector for MySQL: Data mapping
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/mysql/data-mapping.md
section: Loading & Unloading Data
---

# Openflow Connector for MySQL: Data mapping

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes MySQL data types are mapped
to Snowflake data types.

## MySQL to Snowflake data type mapping

The following table shows how MySQL data types are mapped to Snowflake data types
when replicating data.

| MySQL type | Snowflake type | Notes |
| --- | --- | --- |
| DECIMAL / NUMERIC | NUMBER | The maximum number of digits in DECIMAL format for MySQL is 65. For Snowflake, the maximum is 38. Precision is lost when exceeded. |
| INT / INTEGER | INT |  |
| TINYINT / BOOL | INT |  |
| SMALLINT | INT |  |
| MEDIUMINT | INT |  |
| BIGINT | INT |  |
| YEAR | INT |  |
| FLOAT | FLOAT |  |
| DOUBLE | FLOAT |  |
| VARCHAR | TEXT |  |
| CHAR | TEXT | Trailing spaces are not preserved. |
| TINYTEXT | TEXT |  |
| TEXT | TEXT |  |
| MEDIUMTEXT | TEXT | Supported up to the maximum entry size in Snowflake (16 MB). |
| LONGTEXT | TEXT | Supported up to the maximum entry size in Snowflake (16 MB). |
| ENUM | TEXT | Stored as a string value. For example, for `ENUM('one', 'two')` the possible values are `'one'` and `'two'`. |
| SET | TEXT | Stored as a comma-separated string in column declaration order. For example, for `SET('one', 'two')` the possible values are `''`, `'one'`, `'two'`, and `'one,two'`. |
| BIT | TEXT | Represented as a hexadecimal string. For example: `'83060c183060c183'`. |
| DATE | DATE |  |
| DATETIME | TIMESTAMP_NTZ |  |
| TIMESTAMP | TIMESTAMP_TZ | Values are stored in UTC. |
| TIME | TIME |  |
| BINARY | BINARY |  |
| VARBINARY | BINARY |  |
| TINYBLOB | BINARY |  |
| BLOB | BINARY |  |
| MEDIUMBLOB | BINARY | Supported up to the maximum entry size in Snowflake (16 MB). |
| LONGBLOB | BINARY | Supported up to the maximum entry size in Snowflake (16 MB). |
| JSON | VARIANT | Supported up to the maximum entry size in Snowflake (16 MB). |

> **Note:**
>
> Any MySQL data types not listed in this table are mapped to TEXT by default.

---
title: Openflow Connector for MySQL: Maintenance
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/mysql/maintenance.md
section: Loading & Unloading Data
---

# Openflow Connector for MySQL: Maintenance

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes important maintenance considerations and best practices for
maintaining the Openflow Connector for MySQL such as reinstalling the connector or setting the starting binary log position for loading.

These operations are often used in conjunction with [Incremental replication with snapshots](incremental-replication.md).

## Reinstall the connector

This section provides instructions on how to reinstall the connector, and continue replicating data for
the same tables without having to snapshot them again.
It covers situations where the new connector is installed in the same runtime, as well as moved to a new runtime.

> **Warning:**
>
> For the connector to continue replicating from the same CDC stream position where it stopped before reinstallation,
> the source database must retain the binary log long enough to cover the time since the prior connector was stopped
> and the new connector is started.
> Make sure the `binlog_expire_logs_seconds` parameter of the MySQL server is high enough, and keep the reinstallation time to a minimum.
>
> The value of `binlog_expire_logs_seconds` needs to be longer than the expected time expected to reinstall the connector.
> Typically 86400s, a day is seconds, is sufficient, however longer times might be appropriate to ensure time to reinstall.

### Prerequisites

Review and note connector parameter context values.
If you’re reinstalling the connector in the same runtime, you can reuse the existing context.
If the new instance is located in a different runtime, you must re-enter all parameters.

1. Finish processing all in-flight FlowFiles in the existing connector, then stop the connector.

   1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
   2. In the navigation menu, select Ingestion » Openflow.
   3. Select Launch Openflow.
   4. In the Openflow pane select the Runtimes tab.
   5. Select the runtime containing the connector.
   6. Select the connector.
   7. Stop the topmost processor Set Tables for Replication in the Snapshot Load group.
   8. Stop the topmost processor Read MySQL CDC Stream in the Incremental Load group.
   9. If you changed the value of the Merge Task Schedule CRON parameter, return it to `* * * * * ?`, otherwise queues will not be emptied until the next scheduled run.

      Wait until all FlowFiles in the connector have been processed, and all queues are empty.
      When all FlowFiles have been processed, the Queued value on the connector’s processor group becomes zero.
      If there are any items left in the original connector’s queues, there may be data gaps when the new connector starts.
   10. Stop all processors and controller services in the connector.
   > **Caution:**
   >
   > The existing connector can remain in the runtime and doesn’t interfere with the new instance, as long as it remains stopped.
2. Create a new instance of the connector. If you’re using the same runtime as the original connector, you can choose to keep the existing parameter contexts and reuse the settings.
3. If you’re installing into a different runtime or you deleted the previous parameter contexts, enter the configuration settings into the new parameter contexts,
   including the table names and patterns as described in [Set up the Openflow Connector for MySQL](setup.md).
4. Navigate to the `MySQL Ingestion Parameters` context, and set the following parameters:

   * Set the `Ingestion Type` parameter to `incremental`. For more information on the concerns see [Enable incremental replication without snapshots](incremental-replication.md).
   * Set the `Starting Binlog Position` parameter to `Earliest`.
     For more information and potential concerns see Specify load from binary log position.
5. Start the new connector.

### Usage notes

The new connector uses the existing destination tables that were created by the original connector, but the connector creates new journal tables.

## Specify load from binary log position

The Openflow Connector for MySQL connector allows you to select the starting position where MySQL binary logs are read.
By default the connector reads from the latest available position. Alternatively, you can choose the earliest position available on the source instance.
Choosing to start from the earliest position is common when reinstalling the connector.
This allows the new instance to catch up and continue replicating existing tables without having to snapshot each again.

Note that switching a running connector from latest to earliest position cause the entire available binary log
to be re-read, re-processed, and re-applied to the destination table.

> **Warning:**
>
> While the binary log is being re-read, the columns and data in affected destination tables
> can become out of sync with their sources until all events have been re-processed and merged.

The following parameters control snapshot loads are available in the `Ingestion Parameters` context:

| Parameter | Description |
| --- | --- |
| Starting Binlog Position | * `Latest` (default): CDC stream reading starts at the latest available position and continues from there. * `Earliest`: Switches the incremental load to start, or restart reading from the earliest available   binary log position. |
| Re-read Tables in State | * `New` (default):   While re-reading the binary log, only those events will be processed   from new tables added to replication after the re-reading started.   Other events are discarded until the connector reaches the position just before re-reading started. * `Any active`: Re-read and re-process events from any table currently in replication. |

To determine whether the connector finished re-reading the binary log:

1. Navigate to the Openflow canvas.
2. Open the Incremental Load process group.
3. Right-click the topmost processor named Read MySQL CDC Stream, then select View state.
4. Compare the state entries:

   * binlog.position.rewind: the latest position the processor read before re-reading of the binary log started.
   * binlog.position.dml: the current latest position read by the processor. As long as this value is lower than the rewind value above, the processor is still re-reading the binary log.

### Usage notes

* After a running connector is switched to read from the earliest position, and starts running,
  the process cannot be reconfigured or cancelled, and will continue until the currently-read position reaches the position from before it started.
* Switching to the earliest position on a running connector will, for any tables being re-processed,
  finish their existing journals, and create new journal tables.
* If the binary log contains events from a previous table that was dropped
  and re-created in the source database, the re-reading the stream re-processes all events in the current destination.
  The connector cannot distinguish between a previous and current source table if they share the same name.

---
title: Openflow Connector for MySQL: Set up incremental replication without snapshots
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/mysql/incremental-replication.md
section: Loading & Unloading Data
---

# Openflow Connector for MySQL: Set up incremental replication without snapshots

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

The Openflow Connector for MySQL connector can be configured to immediately start replicating incremental changes for newly added tables,
bypassing snapshots. Incremental load is useful when reinstalling the connector over previously replicated data
and to continue replication without snapshotting every table again.

Incremental replication can be enabled in a new instance of the connector, or in an existing one.

To enable incremental replication in a new instance of the connector perform the following tasks:

1. Setup the connector as described in [Set up the Openflow Connector for MySQL](setup.md).
2. In the `MySQL Ingestion Parameters` context, set the `Ingestion Type` parameter to `incremental`.

## Enable incremental replication without snapshots

To enable incremental replication on an existing connector:

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. In the Openflow pane select the Runtimes tab.
4. Select the runtime containing the connector.
5. Select the connector.
6. In the `Ingestion Parameters` context, specify `Ingestion Type` = `incremental`.
7. Add new replication tables. These tables immediately switch to their incremental load.

> **Note:**
>
> To return to replicating tables with the snapshot load, change Ingestion Type from `incremental` to `full`.

### Usage notes

* Changing the value of Ingestion Type does not impact any tables that have begun replicating data.
  Tables currently in the snapshot phase continue until the snapshot load is complete.
* While Ingestion Type is set to `incremental`, new tables added to the list of replicated tables bypass the snapshot phase.
  This includes new tables added to the source database that match the `Included Table Regex` parameter.
  Ensure that the ingestion type is set to `incremental` to bypass the snapshot phase.

  > **Note:**
  >
  > Connectors should only remain in `incremental` mode as long as required as it bypasses snapshots.
  > Once customer needs for incremental updates have been satisfied the connector should be returned to `full` mode.
* For tables that bypass snapshot load, the connector creates a destination table in Snowflake,
  by executing `CREATE TABLE IF NOT EXISTS`, only if no destination table already exists.
  Tables going through the snapshot require that no destination table exist.

---
title: Openflow Connector for Oracle: Configure the Oracle database
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/setup-oracledb.md
section: Loading & Unloading Data
---

# Openflow Connector for Oracle: Configure the Oracle database

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes how to set up the Oracle database for Openflow Connector for Oracle.

> **Note:**
>
> Your Oracle database setup depends on your organization’s security policies
> and database architecture. For example, if tables reside in a Container
> Database (CDB), a Pluggable Database (PDB), multiple PDBs, or a combination.
>
> The steps provided in this topic are examples only. Modify them
> as required for your environment.

As an Oracle database administrator, perform the following procedures on your source database:

1. Configure the retention period for archived redo logs
2. Enable XStream and supplemental logging
3. Create the XStream administrator user
4. Grant XStream administrator privileges
5. Configure XStream server connect user
6. Create XStream Outbound Server
7. Set up the XStream Outbound Server Connect User
8. Set up the XStream Outbound Server Capture User
9. (Optional) Configure SSL connections (optional)

> **Note:**
>
> The steps in this topic are written for a multi-tenant architecture with a Container
> Database (CDB) and one or more Pluggable Databases (PDB). If your Oracle database uses a single-tenant
> architecture, see Set up XStream for single-tenant databases.

## Configure the retention period for archived redo logs

You must enable the `ARCHIVELOG` mode to ensure that change data is available for replication.

If you use AWS RDS for Oracle, you must also configure the retention period for archived redo logs.
Determine this period based on the volume of changes in the source database and your storage capacity.

To set the retention period, for example to 24 hours, follow the procedures in the following table:

| Database version | Procedure |
| --- | --- |
| AWS RDS (Standard) | Run the following:  ```sqlexample begin     rdsadmin.rdsadmin_util.set_configuration(         name  => 'archivelog retention hours',         value => '24'); end; / commit; ```  For more information see [Retaining archived redo logs](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/Appendix.Oracle.CommonDBATasks.RetainRedoLogs.html). |
| AWS RDS Custom | 1. Create a text file named `/opt/aws/rdscustomagent/config/redo_logs_custom_configuration.json`. 2. Add a JSON object to this file in the following format: `{"archivedLogRetentionHours" : "24"}`.   For more information see [Restoring an RDS Custom for Oracle instance](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/custom-backup.pitr.html). |

## Enable XStream and supplemental logging

> **Note:**
>
> XStream is included with Oracle Database and does not require any additional software.

To enable and configure XStream replication to capture and stream change data, run the following commands:

1. Enable XStream replication:

```sqlexample
ALTER SYSTEM SET enable_goldengate_replication=TRUE SCOPE=BOTH;

ALTER SYSTEM SET STREAMS_POOL_SIZE = 2560M;
```

> **Note:**
>
> Snowflake recommends setting the streams pool size to 2.5 GB. This allocation covers the following:
>
> * 1 GB for Capture
> * 1 GB for Apply
> * An additional 25% buffer

To enable supplemental logging to ensure that the redo logs capture the information required for logical
replication, run the following commands:

1. Confirm that the database is in ARCHIVELOG mode as shown in the following example:

   ```sqlexample
   SELECT LOG_MODE, FORCE_LOGGING FROM V$DATABASE;
   ```

   Snowflake recommends forcing logging on database or table space level.
2. Set the container to the root container and add supplemental logging to the database:

   ```sqlexample
   ALTER SESSION SET CONTAINER = CDB$ROOT;
   ALTER DATABASE ADD SUPPLEMENTAL LOG DATA (ALL) COLUMNS;
   ```

   Alternatively, you can enable logging only on specific tables as shown in the following example:

   ```sqlexample
   ALTER TABLE schema_name.table_name ADD SUPPLEMENTAL LOG DATA (ALL) COLUMNS;
   ```

## Create the XStream administrator user

An XStream administrator user is required to manage XStream components, including the
creation and alteration of outbound servers.
You can either create a dedicated user for this purpose or use an existing user,
provided that the necessary XStream administration privileges are granted (see the next section).

The following example details the setup of a dedicated XStream administrator user in the root container of a CDB.

> **Note:**
>
> The following example assumes that the database also has a PDB containing tables to be replicated.

Connect as SYSDBA or a user with appropriate privileges and run the following commands:

```sqlexample
-- Switch to the root container.
ALTER SESSION SET CONTAINER = CDB$ROOT;

--  Create a tablespace for the XStream administrator user.
CREATE TABLESPACE xstream_adm_tbs DATAFILE '/path/to/your/cdb/xstream_adm_tbs.dbf'
   SIZE 25M REUSE AUTOEXTEND ON MAXSIZE UNLIMITED;

-- Switch to the Pluggable Database (PDB) and create a tablespace there.
ALTER SESSION SET CONTAINER = YOUR_PDB_NAME;

CREATE TABLESPACE xstream_adm_tbs DATAFILE '/path/to/your/pdb/xstream_adm_tbs.dbf'
   SIZE 25M REUSE AUTOEXTEND ON MAXSIZE UNLIMITED;

-- Switch back to the root container to create the common user.
ALTER SESSION SET CONTAINER = CDB$ROOT;

-- Create the XStream administrator user.
-- Note  'c##' prefix indicates a common user in a CDB environment, and CONTAINER=ALL grants privileges across all containers.
-- Replace "YOUR_XSTREAM_ADMIN_PASSWORD" with a strong, secure password.

CREATE USER c##xstreamadmin IDENTIFIED BY "YOUR_XSTREAM_ADMIN_PASSWORD"
   DEFAULT TABLESPACE xstream_adm_tbs
   QUOTA UNLIMITED ON xstream_adm_tbs
   CONTAINER=ALL;
```

## Grant XStream administrator privileges

Connect as SYSDBA or a user with appropriate privileges and grant the required privileges
to the XStream administrator user.

1. Grant the CREATE SESSION privilege to the XStream administrator:

   ```sqlexample
   GRANT CREATE SESSION TO c##xstreamadmin CONTAINER=ALL;
   ```
2. Grant XStream capture privileges using one of the following commands, depending on your Oracle Database version:

   | Database version | Command |
   | --- | --- |
   | Oracle Database 21c and earlier | Run the following:  ```sqlexample BEGIN   DBMS_XSTREAM_AUTH.GRANT_ADMIN_PRIVILEGE(     grantee                 => 'c##xstreamadmin',     privilege_type          => 'CAPTURE',     grant_select_privileges => TRUE,     container               => 'ALL'); END; / ``` |
   | Oracle Database 23c and later | Oracle Database 23c introduced a dedicated `XSTREAM_CAPTURE` system privilege. Run the following:  ```sqlexample GRANT XSTREAM_CAPTURE TO c##xstreamadmin CONTAINER=ALL; ``` |

## Configure XStream server connect user

The Snowflake Openflow Connector uses a dedicated connect user to establish a connection to the XStream Outbound Server and receive change data.
This user requires specific privileges to facilitate replication:

* **Read from XStream Outbound Server**: The user must be able to access the change data stream from the configured XStream Outbound Server.
* **Select from Data Dictionary Views**: The connect user needs SELECT access to various data dictionary views.
  This can be achieved by granting SELECT_CATALOG_ROLE or SELECT ANY DICTIONARY.
  If granting SELECT ANY DICTIONARY is not desired due to company policy, the user specifically needs SELECT access to the following views:

  + ALL_USERS
  + ALL_TABLES
  + ALL_TAB_COLS
  + ALL_CONS_COLUMNS
  + ALL_CONSTRAINTS
  + V$DATABASE
* **Select from Source Tables**: The user must have SELECT privileges on all tables that are intended for replication.

The following is an example of how to set up such a user in the root container of the CDB.
The example assumes that the database also has a PDB containing tables to be replicated.

```sqlexample
-- Connect as SYSDBA or a user with appropriate privileges
-- Switch to the root container.

ALTER SESSION SET CONTAINER = CDB$ROOT;

-- Create the connect user.
-- Replace "YOUR_CAPTURE_USER_PASSWORD" with a strong, secure password.
CREATE USER c##connectuser IDENTIFIED BY "YOUR_CAPTURE_USER_PASSWORD"
    CONTAINER=ALL;

-- Grant necessary privileges to the connect user.
-- You can choose to grant access to specific tables
-- instead of SELECT ANY TABLE for more granular control,
-- for example, GRANT SELECT ON schema.table TO c##connectuser;
GRANT CREATE SESSION, SELECT_CATALOG_ROLE, SELECT ANY TABLE TO c##connectuser CONTAINER=ALL;
```

## Create XStream Outbound Server

The XStream Outbound Server captures changes from redo logs for consumption by the Openflow Connector. Define which schemas or tables to replicate.
For more information see [DBMS_XSTREAM_ADM.CREATE_OUTBOUND Documentation](https://docs.oracle.com/en/database/oracle/oracle-database/19/arpls/DBMS_XSTREAM_ADM.html#GUID-A602ED86-0F5A-4A27-92A0-55D5ADC0AF0D).

Important considerations for replication scope:

* If a table is included in the XStream Outbound filtering rules command, it will not be replicated.
* A table or schema included here must also be defined in the connector parameters for it to be replicated.
  You can include an entire schema in the server filtering rules and later, in the connector parameters,
  specify only certain tables within that schema for replication.

> **Note:**
>
> The XStream Outbound Server can only be created from root container. However,
> starting with Oracle Database version 23ai, it can also be created on the PDB level.

To avoid a significant hit to your CPU and network, and to prevent your queues from being filled with irrelevant data, it’s essential to use a granular approach. The best way to do this is with the DBMS_XSTREAM_ADM.ADD_TABLE_RULES procedure, which lets you choose only the specific tables
you need.

The following examples show how to set up the XStream Outbound Server based on different replication needs. In practice, when setting up your XStream Outbound Server on your production environment, you should be selective about what changes you capture. Capturing everything can have serious consequences for your database’s performance and resource usage.

For information on how to configure XStream Outbound Server, see
[Configuring XStream Out](https://docs.oracle.com/en/database/oracle/oracle-database/19/xstrm/configuring-xstream-out.html#GUID-A1C8430E-565B-4F66-8E00-495F283AAAFB).

**Example 1:** Capture all tables from all schemas in the root container and all PDBs

```sqlexample
-- Connect as a user with XStream admin privileges to the root container.
-- Ensure serveroutput is enabled to see messages from the PL/SQL block.
SET SERVEROUTPUT ON;

DECLARE
    tables  DBMS_UTILITY.UNCL_ARRAY;
    schemas DBMS_UTILITY.UNCL_ARRAY;
BEGIN
   -- To replicate all tables in all schemas across all containers, set both to NULL.
   tables(1) := NULL;
   schemas(1) := NULL;
   DBMS_XSTREAM_ADM.CREATE_OUTBOUND(
       server_name => 'XOUT1',
       table_names => tables,
       schema_names => schemas,
       include_ddl => TRUE
   );
   DBMS_OUTPUT.PUT_LINE('XStream Outbound Server created.');
   EXCEPTION
   WHEN OTHERS THEN
       DBMS_OUTPUT.PUT_LINE('Error creating XStream Outbound Server: ' || SQLERRM);
       RAISE;
END;
/
```

**Example 2:** Capture all tables from a single schema in a Pluggable Database (PDB)

```sqlexample
-- Connect as a user with XStream admin privileges to the root container.
-- Ensure serveroutput is enabled to see messages from the PL/SQL block.
SET SERVEROUTPUT ON;

DECLARE
    tables  DBMS_UTILITY.UNCL_ARRAY;
    schemas DBMS_UTILITY.UNCL_ARRAY;
BEGIN
    -- To replicate all tables in a schemas in the single PDB, set source_container_name.
    tables(1) := NULL;
    schemas(1) := 'schema_name';
    DBMS_XSTREAM_ADM.CREATE_OUTBOUND(
        server_name => 'XOUT1',
        table_names => tables,
        schema_names => schemas,
        include_ddl => TRUE,
        source_container_name => 'YOUR_PDB_NAME'
    );
    DBMS_OUTPUT.PUT_LINE('XStream Outbound Server created.');
EXCEPTION
    WHEN OTHERS THEN
        DBMS_OUTPUT.PUT_LINE('Error creating XStream Outbound Server: ' || SQLERRM);
      RAISE;
END;
/
```

## Set up the XStream Outbound Server Connect User

Set the connect user on the XStream Outbound Server. This ensures that the previously created connect user is associated with the XStream Outbound Server (XOUT1), allowing it to receive change data.

> **Note:**
>
> The following example assumes that the connect user is c##connectuser.

```sqlexample
BEGIN
    DBMS_XSTREAM_ADM.ALTER_OUTBOUND(
        server_name  => 'XOUT1',
        connect_user => 'c##connectuser');
   END;
/
```

## Set up the XStream Outbound Server Capture User

> **Note:**
>
> If you want the data to be captured by the same user that created the server (the administrator), skip this section.

If you configured a separate capture user, configure the XStream Outbound Server to run
as this user. This ensures that the dedicated capture user is associated with the XStream Outbound Server (XOUT1), allowing that user to capture change data.

```sqlexample
BEGIN
    DBMS_XSTREAM_ADM.ALTER_OUTBOUND(
        server_name  => 'XOUT1',
      capture_user => 'yourcaptureuser');
END;
/
```

## Set up XStream for single-tenant databases

The default architecture for Oracle 12c and later is a multi-tenant architecture with
a Container Database (CDB) and one or more Pluggable Databases (PDB).

If your Oracle database uses a single-tenant architecture, note the following
differences in setting up XStream:

* Do not use `ALTER SESSION SET CONTAINER` commands. In a single-tenant
  database, there is only one instance, so container switching does not apply.
* Create only one `xstream_adm_tbs` tablespace. Do not create a second
  tablespace in a PDB.
* Do not use the `C##` prefix on user names. For example, create
  `xstreamadmin` instead of `c##xstreamadmin` and `connectuser` instead
  of `c##connectuser`. The `C##` prefix is required only in multi-tenant
  environments.
* Do not include `CONTAINER=ALL` or `container => 'ALL'` in any commands.
  These clauses grant privileges across multiple containers and do not apply
  in a single-tenant database.

## Configure SSL connections (optional)

The Openflow Connector for Oracle supports encrypted SSL connections to the Oracle database using the TCPS
(TCP with SSL) protocol. When SSL is enabled, both the database connection and the XStream connection use encrypted communication.

To use SSL, you must:

1. Enable TCPS on the Oracle database
2. Create a client wallet

### Enable TCPS on the Oracle database

You must configure the Oracle database to accept connections using the TCPS protocol.
Follow the procedure for your database environment.

#### On-premises / OCI

1. Create an SSL server wallet with the server certificate.
2. Configure the `listener.ora` to include a TCPS endpoint (default port 2484).
3. Configure the `sqlnet.ora` to reference the server wallet.
4. Restart the listener.

For more information, see
[Configuring Transport Layer Security Encryption](https://docs.oracle.com/en/database/oracle/oracle-database/23/dbseg/configuring-transport-layer-security-encryption.html).

#### AWS RDS (Standard)

1. Add the Oracle SSL option to the option group associated with the DB instance.
2. Specify the SSL port (for example, 2484).

For more information, see
[Oracle Secure Sockets Layer](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/Appendix.Oracle.Options.SSL.html).

### Create a client wallet

After TCPS is enabled on the database, create an Oracle auto-login wallet (`cwallet.sso`)
containing the server’s trusted certificate. This wallet is provided to the connector so
that it can verify the server during the SSL handshake.

1. Export the server certificate from the Oracle database server as a PEM file.
2. Use the Oracle `orapki` utility to create a client wallet and import the server certificate:

   ```bash
   orapki wallet create -wallet /path/to/client/wallet -pwd <wallet_password> -auto_login

   orapki wallet add -wallet /path/to/client/wallet -pwd <wallet_password> \
      -trusted_cert -cert /path/to/server-cert.pem
   ```
3. Copy the generated `cwallet.sso` file to a location accessible by the Openflow runtime.

> **Note:**
>
> For AWS RDS, download the root certificate from AWS instead of exporting it from the
> database server. For more information, see
> [Connecting to an RDS for Oracle DB instance using SSL](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/Appendix.Oracle.Options.SSL.Connecting.html).

For more information, see
[Using the orapki Utility to Manage PKI Elements](https://docs.oracle.com/en/database/oracle/oracle-database/23/dbseg/using-the-orapki-utility-to-manage-pki-elements.html).

## Next steps

[Configure the connector](setup-connector.md).

---
title: Openflow Connector for Oracle: Data mapping
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/data-mapping.md
section: Loading & Unloading Data
---

# Openflow Connector for Oracle: Data mapping

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes how Oracle data types are mapped to Snowflake data types when replicating data.

## Oracle to Snowflake data type mapping

The following table shows how Oracle data types are mapped to Snowflake data types
when replicating data.

| Oracle type | Snowflake type | Notes |
| --- | --- | --- |
| NUMBER | NUMBER | If precision is undefined, mapped to NUMBER(38, 19). If precision or scale exceeds Snowflake limitations (precision > 38 or scale > 37), the value is stored as TEXT. |
| FLOAT | FLOAT |  |
| BINARY_FLOAT | FLOAT |  |
| BINARY_DOUBLE | FLOAT |  |
| CHAR | TEXT |  |
| VARCHAR2 | TEXT |  |
| NCHAR | TEXT |  |
| NVARCHAR2 | TEXT |  |
| CLOB | TEXT | Supported up to the maximum entry size in Snowflake (16 MB). |
| NCLOB | TEXT | Supported up to the maximum entry size in Snowflake (16 MB). |
| LONG | TEXT |  |
| DATE | TIMESTAMP_NTZ |  |
| TIMESTAMP | TIMESTAMP_NTZ |  |
| TIMESTAMP WITH TIME ZONE | TIMESTAMP_TZ |  |
| TIMESTAMP WITH LOCAL TIME ZONE | TIMESTAMP_LTZ |  |
| INTERVAL | TEXT |  |
| INTERVAL YEAR TO MONTH | TEXT |  |
| INTERVAL DAY TO SECOND | TEXT |  |
| RAW | BINARY |  |
| LONG RAW | BINARY |  |
| BLOB | BINARY | Supported up to the maximum entry size in Snowflake (16 MB). |
| BOOLEAN | BOOLEAN |  |
| JSON | VARIANT | Supported up to the maximum entry size in Snowflake (16 MB). |
| XMLTYPE | TEXT |  |

> **Note:**
>
> Any Oracle data types not listed in this table are mapped to TEXT by default.

## Next steps

Review [Set up tasks for the Openflow Connector for Oracle](setup-tasks.md) to set up the connector.

---
title: Openflow Connector for Oracle: Enable and manage commercial terms
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/manage-commercial-terms.md
section: Loading & Unloading Data
---

# Openflow Connector for Oracle: Enable and manage commercial terms

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes how to enable the Openflow Connector for Oracle in the list of available connectors and manage
the licensing lifecycle.

> **Note:**
>
> This task must be performed by the organization administrator (ORGADMIN).

Setting up the Openflow Connector for Oracle is a two-stage process. First, enable Oracle XStream services to make
the connector available for installation. Then, finalize the license configuration after
the connector detects your source database inventory.

## Part 1: Enable service (pre-installation)

By default, the Openflow Connector for Oracle is not displayed in the list of available connectors. You must accept the
[Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/)
terms to make it available for installation. This is required for all license models.

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. Locate the item Oracle Connector Terms in the list.
4. Select Review & Enable.

After you complete these steps, the following changes take effect:

* The Openflow Connector for Oracle listing becomes visible in the list of available connectors.
* A new tab titled Openflow for Oracle appears in the Admin » Terms tab.

## Part 2: License setup and lifecycle

Complete the steps for the license model you selected during configuration:

* Option A: Embedded license (Snowflake-provided)
* Option B: Independent license / BYOL

### Option A: Embedded license (Snowflake-provided)

For this licensing model, you must activate the trial to enable the connector.

> **Note:**
>
> Even if you install the connector, data replication does not start until this step is complete.

#### Step 1: Start the trial (prerequisite)

To start the trial:

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. Select Openflow for Oracle tab.
4. Locate the Trial Status card (status: “Ready to Activate”).
5. Select Start Trial.
6. Accept the terms to start the 60-day trial period.

> **Note:**
>
> This action enables the captureChangeOracle processor, allowing it to connect to
> your database.

#### Step 2: Configure connector

After starting the trial, install and configure the connector. For more information,
see [Configure the connector](setup-connector.md).

After the connector successfully connects to the source database, a subscription is
automatically created and displayed in the Openflow for Oracle dashboard.

#### Step 3: Verify inventory

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. Select the Openflow for Oracle tab.
4. Review the Subscription Inventory section.
5. Verify that the CPU core count matches your physical source database hardware.
6. If the core count is incorrect, update the runtime configuration.

#### Step 4: Lifecycle management

For more information about the licensing models and terms, see
[Licensing models and critical constraints](about.md).

The following table describes the actions available at each stage of the embedded
license lifecycle.

| Stage | Action | Result |
| --- | --- | --- |
| Trial period (Day 1 to 60) | Select Cancel Trial in the Openflow for Oracle dashboard before Day 60. | Oracle XStream services stop. No charges are incurred. |
| 36-month commitment (Day 61+) | No action required. If the trial is not canceled, the non-cancelable 36-month term begins automatically on Day 61. | The license can’t be canceled during this period. If your Snowflake agreement is terminated, the full remaining balance is due immediately. |
| Post-term S&M renewal (after month 36) | The license fee drops to $0. The annual Support & Maintenance (S&M) fee continues. You may opt out of S&M renewal in the Openflow for Oracle dashboard. | If you opt out and S&M coverage expires, the connector is permanently locked. To resume, you must purchase a new embedded license, which resets the 36-month commitment. |

### Option B: Independent license / BYOL

If you are using the independent license (Bring Your Own License), no prior trial activation
is required.

#### Step 1: Configure the connector

To set up the connector with the independent/BYOL license, follow the steps in
[Configure the connector](setup-connector.md).

#### Step 2: Verify inventory (recommended)

Verify that Snowflake has correctly identified your database inventory.

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. Select the Openflow for Oracle tab.
4. Review the database inventory details.

> **Note:**
>
> The Start Trial button does not appear for this license model, and the
> 36-month lifecycle rules do not apply. You are responsible for maintaining a valid
> Oracle license that includes XStream entitlements.

---
title: Openflow Connector for Oracle: Maintenance
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/maintenance.md
section: Loading & Unloading Data
---

# Openflow Connector for Oracle: Maintenance

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes maintenance tasks for the Openflow Connector for Oracle, such as reinstalling the
connector or setting the starting redo log position.

These operations are often used in conjunction with [Incremental replication with snapshots](incremental-replication.md).

## Reinstall the connector

This section provides instructions on how to reinstall the connector, and continue replicating data for
the same tables without having to snapshot them again.
It covers situations where the new connector is installed in the same runtime, as well as moved to a new runtime.

> **Warning:**
>
> For the connector to continue replicating from the same CDC stream position where it stopped before reinstallation,
> the source database must retain the archived redo logs long enough to cover the time after the prior connector was stopped
> and before the new connector is started.
> Ensure the archived redo log retention period of the Oracle database is high enough, and keep the reinstallation time to a minimum.
>
> Typically a retention period of 24 hours is sufficient, however longer times might be appropriate to ensure time to reinstall.
> For more information on configuring archived redo log retention, see [Openflow Connector for Oracle: Configure the Oracle database](setup-oracledb.md).

### Prerequisites

Review and note connector parameter context values.
If you’re reinstalling the connector in the same runtime, you can reuse the existing context.
If the new instance is located in a different runtime, you must re-enter all parameters.

1. Finish processing all in-flight FlowFiles in the existing connector, then stop the connector.

   1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
   2. In the navigation menu, select Ingestion » Openflow.
   3. Select Launch Openflow.
   4. In the Openflow pane select the Runtimes tab.
   5. Select the runtime containing the connector.
   6. Select the connector.
   7. Stop the topmost processor Set Tables for Replication in the Snapshot Load group.
   8. Stop the topmost processor Read Oracle CDC Stream in the Incremental Load group.
   9. If you changed the value of the Merge Task Schedule CRON parameter, return it to `* * * * * ?`, otherwise queues will not be emptied until the next scheduled run.

      Wait until all FlowFiles in the connector have been processed, and all queues are empty.
      When all FlowFiles have been processed, the Queued value on the connector’s processor group becomes zero.
      If any items remain in the original connector’s queues, data gaps might occur when the new connector starts.
   10. Stop all processors and controller services in the connector.
   > **Caution:**
   >
   > The existing connector can remain in the runtime and doesn’t interfere with the new instance, as long as it remains stopped.
2. Create a new instance of the connector. If you’re using the same runtime as the original connector, you can choose to keep the existing parameter contexts and reuse the settings.
3. If you’re installing into a different runtime or you deleted the previous parameter contexts, enter the configuration settings into the new parameter contexts,
   including the table names and patterns as described in [Install and configure the Openflow Connector for Oracle](setup-connector.md).
4. Navigate to the `Oracle Ingestion Parameters` context, and set the following parameters:

   * Set the `Ingestion Type` parameter to `incremental`. For more information on the concerns see [Enable incremental replication without snapshots on an existing connector](incremental-replication.md).
   * Set the `Starting Redo Log Position` parameter to `Earliest`.
     For more information and potential concerns see Alter XStream outbound server.
5. Start the new connector.

### Usage notes

The new connector uses the existing destination tables that were created by the original connector, but the connector creates new journal tables.

## Alter XStream outbound server

The connector regularly updates the XStream server with the latest SCN position it processed. If the connector
is reinstalled and connects to the same XStream outbound server, it will resume reading from the SCN position where it left off.
This SCN number can be checked with:

```sql
SELECT PROCESSED_LOW_SCN
FROM DBA_XSTREAM_OUTBOUND_PROGRESS
WHERE SERVER_NAME = 'XOUT1';
```

If you want to re-read data from an earlier position, you must first change the start SCN of the XStream server:

```sql
BEGIN
    DBMS_XSTREAM_ADM.ALTER_OUTBOUND(
        server_name => 'XOUT1',
        start_scn => <start_scn>
    );
END;
/
```

The value of `<start_scn>` must be a valid SCN within the range of available redo logs. The lowest SCN that the start position can be reset to can be checked with:

```sql
SELECT REQUIRED_CHECKPOINT_SCN
FROM DBA_CAPTURE
WHERE CLIENT_NAME = 'XOUT1';
```

This is the lowest SCN for which the capture process requires redo information.

## Specify load from XStream position

The Openflow Connector for Oracle connector allows you to select the starting position where Oracle redo logs are read.
By default the connector reads from the latest available position. Alternatively, you can choose the earliest position available on the source instance.
Choosing to start from the earliest position is common when reinstalling the connector.
This allows the new instance to catch up and continue replicating existing tables without having to snapshot each again.

> **Note:**
>
> Switching a running connector from latest to earliest position causes the entire available redo logs
> to be re-read, re-processed, and re-applied to the destination table.

> **Warning:**
>
> While the redo logs are being re-read, the columns and data in affected destination tables
> can become out of sync with their sources until all events have been re-processed and merged.

The following parameters are available in the `Ingestion Parameters` context:

| Parameter | Description |
| --- | --- |
| Starting XStream Position | * `Latest` (default): CDC stream reading starts at the latest available position and continues from there. * `Earliest`: Switches the incremental load to start, or restart reading from the earliest available   XStream position. |
| Re-read Tables in State | * `New` (default):   While re-reading the redo logs, only those LCRs (Logical Change Records) will be processed   from new tables added to replication after the re-reading started.   Other LCRs are discarded until the connector reaches the position just before re-reading started. * `Any active`: Re-read and re-process events from any table currently in replication. |

To determine whether the connector finished re-reading the redo logs:

1. Navigate to the Openflow canvas.
2. Open the Incremental Load process group.
3. Right-click the topmost processor named Read Oracle CDC Stream, then select View state.
4. Compare the state entries:

   * lcr.position.rewind: the latest position the processor read before re-reading of the redo logs started.
   * lcr.position.last: the current latest position read by the processor. As long as this value is lower than the rewind value above, the processor is still re-reading the redo logs.

### Usage notes

* After a running connector is switched to read from the earliest position, and starts running,
  the process can’t be reconfigured or cancelled, and continues until the currently-read position reaches the position from before it started.
* Switching to the earliest position on a running connector will, for any tables being re-processed,
  finish their existing journals, and create new journal tables.
* If the redo log contains events from a previous table that was dropped
  and re-created in the source database, the re-reading the stream re-processes all events in the current destination.
  The connector can’t distinguish between a previous and current source table if they share the same name.

> **Note:**
>
> Schema changes (such as ALTER TABLE statements that add or drop columns) are not supported
> while re-reading the redo logs from the earliest position. If any table’s schema was
> altered between the earliest available SCN and the current position, that table should
> be removed from replication and re-added with a fresh snapshot instead.

---
title: Openflow Connector for Oracle: Set up incremental replication without snapshots
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/incremental-replication.md
section: Loading & Unloading Data
---

# Openflow Connector for Oracle: Set up incremental replication without snapshots

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes how to configure the Openflow Connector for Oracle connector to start replicating incremental changes for newly added tables immediately, bypassing snapshots. This configuration is useful when you reinstall the connector over previously replicated data and want to continue replication without snapshotting every table again.

You can enable incremental replication on either a new or an existing connector instance.

## Enable incremental replication without snapshots on a new connector

To enable incremental replication on a new connector instance:

1. Set up the connector as described in [Install and configure the Openflow Connector for Oracle](setup-connector.md).
2. In the `Oracle Ingestion Parameters` context, set the `Ingestion Type` parameter to `incremental`.

## Enable incremental replication without snapshots on an existing connector

To enable incremental replication on an existing connector:

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. In the Openflow pane select the Runtimes tab.
4. Select the runtime containing the connector.
5. Select the connector.
6. In the `Ingestion Parameters` context, specify `Ingestion Type` = `incremental`.
7. Add new replication tables. These tables immediately switch to their incremental load.

> **Note:**
>
> To return to replicating tables with the snapshot load, change Ingestion Type from `incremental` to `full`.

### Usage notes

* Changing the value of Ingestion Type does not impact any tables that have begun replicating data.
  Tables currently in the snapshot phase continue until the snapshot load is complete.
* While Ingestion Type is set to `incremental`, new tables added to the list of replicated tables bypass the snapshot phase.
  This includes new tables added to the source database that match the `Included Table Regex` parameter.
  Ensure that the ingestion type is set to `incremental` to bypass the snapshot phase.

  > **Note:**
  >
  > Connectors should only remain in `incremental` mode as long as required as it bypasses snapshots.
  > Once customer needs for incremental updates have been satisfied the connector should be returned to `full` mode.
* For tables that bypass snapshot load, the connector creates a destination table in Snowflake,
  by executing `CREATE TABLE IF NOT EXISTS`, only if no destination table already exists.
  Tables going through the snapshot require that no destination table exist.

---
title: Openflow Connector for Oracle: Set up Snowflake
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/setup-snowflake.md
section: Loading & Unloading Data
---

# Openflow Connector for Oracle: Set up Snowflake

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes how to set up your Snowflake environment for the
Openflow Connector for Oracle.

As a Snowflake administrator, perform the following tasks:

1. Create a destination database in Snowflake to store the replicated data:

   ```sqlexample
   CREATE DATABASE <destination_database>;
   ```
2. Create a Snowflake [service user](../../../../../sql-reference/sql/create-user.md):

   ```sqlexample
   CREATE USER <openflow_user>
     TYPE = SERVICE
     COMMENT='Service user for automated access of Openflow';
   ```
3. Create a Snowflake role for the connector and grant the required
   privileges:

   ```sqlexample
   CREATE ROLE <openflow_role>;
   GRANT ROLE <openflow_role> TO USER <openflow_user>;
   GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_role>;
   GRANT CREATE SCHEMA ON DATABASE <destination_database>
     TO ROLE <openflow_role>;
   ```

   Use this role to manage the connector’s access to the Snowflake database.

   To create objects in the destination database, you must grant the
   [USAGE and CREATE SCHEMA privileges](../../../../security-access-control-privileges.md)
   on the database to the role used to manage access.
4. Create a Snowflake warehouse for the connector and grant the required
   privileges:

   ```sqlexample
   CREATE WAREHOUSE <openflow_warehouse> WITH
     WAREHOUSE_SIZE = 'XSMALL'
     AUTO_SUSPEND = 300
     AUTO_RESUME = TRUE;
   GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse>
     TO ROLE <openflow_role>;
   ```

   Snowflake recommends starting with a XSMALL warehouse size, then
   experimenting with size depending on the number of tables being
   replicated and the amount of data transferred. Large numbers of tables
   typically scale better with multi-cluster warehouses, rather than a
   larger warehouse size. For more information, see
   [multi-cluster warehouses](../../../../warehouses-multicluster.md).
5. Set up the public and private keys for key pair authentication:

   1. Create a pair of secure keys (public and private).
   2. Store the private key for the user in a file to supply to the
      connector’s configuration.
   3. Assign the public key to the Snowflake service user:

      ```sqlexample
      ALTER USER <openflow_user> SET RSA_PUBLIC_KEY = 'thekey';
      ```

      For more information, see [Key-pair authentication and key-pair rotation](../../../../key-pair-auth.md).

## Next steps

[Configure the connector](setup-connector.md).

---
title: Openflow Connector for PostgreSQL Maintenance
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/postgres/maintenance.md
section: Loading & Unloading Data
---

# Openflow Connector for PostgreSQL Maintenance

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes important maintenance considerations and best practices for
maintaining the Openflow Connector for PostgreSQL when making changes to the source PostgreSQL database.
In addition, this topic describes how to restart table replication and reinstall the connector.

## Restart table replication

A table in FAILED state — for example, due to a missing primary key or unsupported schema change — does not restart automatically. If a table enters a FAILED state or you need to restart replication from scratch, use the following procedure to remove and re-add the table to replication.

> **Note:**
>
> If the failure was caused by an issue in the source table such as a missing primary key, resolve that issue in the source database before continuing.

1. Remove the table from flow parameters: In the Ingestion Parameters context, either remove the table from the Included Table Names or modify the Included Table Regex so the table is no longer matched.
2. Verify the table has been removed:

   1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services.
   2. In the table listing controller services, locate the Table State Store row, click the three vertical dots on the right side of the row, then choose View State.
   > **Important:**
   >
   > You must wait until the table’s state is fully removed from this list before proceeding. Do not continue until this configuration change has completed.
3. Clean up the destination: Once the table’s state shows as fully removed, manually [DROP](../../../../../sql-reference/sql/drop-table.md) the destination table in Snowflake. Note that the connector will not overwrite an existing destination table during the snapshot phase; if the table still exists, replication will fail again. Optionally, the journal table and stream can also be removed if they are no longer needed.
4. Re-add the table: Update the Included Table Names or Included Table Regex parameters to include the table again.
5. Verify the restart: Check the Table State Store using the instructions given previously. The state of the table should appear with the status NEW, then transition to SNAPSHOT_REPLICATION, and finally INCREMENTAL_REPLICATION.

## Upgrading PostgreSQL

Upgrading the connector requires a different approach depending on whether PostgreSQL is being upgraded to the next minor or major version.

Minor version upgrades

> * Are data safe.
> * Require no special treatment.
> * Require stopping the connector for the duration of the upgrade to avoid reporting connectivity issues.
> * Continue replicating, after the upgrade, with no data loss.

Major version upgrades

> * Require the PostgreSQL server to drop replication slots, including any used by the connector.
> * Cannot preserve, or migrate replication slots to the new version. See also PostgresSQL 17 and later versions upgrades.
> * Restart replicating all tables from the prior snapshot phase.

To perform a minor version upgrade, do the following:

1. Stop the connector, including all Processors and Controller Services.
2. Upgrade PostgreSQL.
3. Restart the connector.

To perform a major version upgrade, do the following:

1. Remove all tables from replication in the connector.
2. Wait until all queues in the connector are empty.
3. Stop the connector, including all Processors and Controller Services.
4. Open the Incremental Load group in the connector.
5. Right-click the top Processor in the group, Read PostgreSQL CDC Stream, and select View state.
6. Click Clear state.
7. Click Close.
8. Upgrade PostgreSQL.
9. Restart the connector. A new replication slot will be created.
10. Re-add all tables to begin replication.

### PostgresSQL 17 and later versions upgrades

PostgreSQL 17 improved upgrading such that it no longer requires dropping replication slots when upgrading to later versions such as 17.1 » 18.0.
Upgrading to PostgreSQL 17.0 or later from prior versions (16 and earlier) drops replications slots and should be treated as a major upgrade.
Future versions of PostgreSQL may also improve the upgrade process further.

## Reinstall the connector

This section describes how to reinstall the connector.
It covers situations where the new connector is installed in the same runtime, or when it is moved to a new runtime.
Reinstall is often used in conjunction with [Incremental replication with snapshots](incremental-replication.md).

> **Warning:**
>
> For the connector to be able to continue replicating from the same CDC stream position where it stopped before reinstallation,
> the source database must retain the WAL long enough to cover the time since the old connector is stopped and the new connector is started.
> Ensure the `max_wal_size` parameter of the PostgreSQL server is high enough, depending on your traffic, and keep the reinstallation time to a minimum.

### Prerequisites

Review and note connector parameter context values.
If you’re reinstalling the connector in the same runtime, you can reuse the existing context.
If the new instance will be located in a different runtime, you will have to re-enter all parameters.

To reinstall the connector:

1. Finish processing all in-flight FlowFiles in the existing connector, and then stop the connector.

   1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
   2. In the navigation menu, select Ingestion » Openflow.
   3. Select Launch Openflow.
   4. In the Openflow pane select the Runtimes tab.
   5. Select the runtime containing the connector.
   6. Select the connector.
   7. Stop the topmost processor Set Tables for Replication in the Snapshot Load group.
   8. Stop the topmost processor Read PostgreSQL CDC Stream in the Incremental Load group.
   9. If you changed the value of the Merge Task Schedule CRON parameter, return it to `* * * * * ?`, otherwise queues will not be emptied until the next scheduled run.

      Wait until all FlowFiles in the connector have been processed, and all queues are empty.
      When all FlowFiles have been processed, the Queued value on the connector’s processor group becomes zero.
      If there are any items left in the original connector’s queues, there may be data gaps when the new connector starts.
   10. Stop all processors and controller services in the connector.
2. Find and copy the name of the replication slot used by the original connector,
   by viewing the state of the topmost processor in the `Incremental Load` group with name `Read PostgreSQL CDC Stream`.
   The replication slot name is stored under the key `replication.slot.name`.
   Copy the value of the key to a text editor.
3. Create a new instance of the connector. If you’re using the same runtime as the original connector, you can choose to keep the existing parameter contexts, and reuse the settings.

   > **Caution:**
   >
   > The existing connector can remain in the runtime and doesn’t interfere with the new instance, as long as it remains stopped.
4. If you’re installing into a different runtime, or you deleted the previous parameter contexts, enter all the configuration settings into the new parameter contexts,
   including the table names and patterns as described in [Set up the Openflow Connector for PostgreSQL](setup.md).
5. Open the `PostgreSQL Ingestion Parameters` context, and set `Ingestion Type` parameter to `incremental`.
   For more information on the concerns see [Enable incremental replication without snapshots](incremental-replication.md).
6. Open the `PostgreSQL Source Parameters` context, and set the `Replication Slot Name` parameter to the value you copied earlier.
7. Start the new connector.

### Usage notes

The new connector will use the same, existing destination tables that created by the original connector, but will create new journal tables.

---
title: Openflow Connector for PostgreSQL: Data mapping
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/postgres/data-mapping.md
section: Loading & Unloading Data
---

# Openflow Connector for PostgreSQL: Data mapping

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how PostgreSQL data types are mapped
to Snowflake data types.

## PostgreSQL to Snowflake data type mapping

The following table shows how PostgreSQL data types are mapped to Snowflake data types
when replicating data.

| PostgreSQL type | Snowflake type | Notes |
| --- | --- | --- |
| SMALLINT / INT2 | INT |  |
| INTEGER / INT / INT4 | INT |  |
| BIGINT / INT8 | INT |  |
| SMALLSERIAL / SERIAL2 | INT |  |
| SERIAL / SERIAL4 | INT |  |
| BIGSERIAL / SERIAL8 | INT |  |
| NUMERIC / DECIMAL | NUMBER | Scale and precision are preserved within Snowflake limitations. Negative scale is converted to scale 0 with adjusted precision. |
| REAL / FLOAT4 | FLOAT |  |
| DOUBLE PRECISION / FLOAT8 | FLOAT |  |
| MONEY | FLOAT |  |
| BOOLEAN / BOOL | BOOLEAN |  |
| CHARACTER / CHAR / BPCHAR | TEXT |  |
| CHARACTER VARYING / VARCHAR | TEXT |  |
| TEXT | TEXT |  |
| BYTEA | BINARY | Supported up to the maximum entry size in Snowflake (16 MB). |
| DATE | DATE |  |
| TIME / TIME WITHOUT TIME ZONE | TIME |  |
| TIME WITH TIME ZONE / TIMETZ | TIMESTAMP_TZ |  |
| TIMESTAMP / TIMESTAMP WITHOUT TIME ZONE | TIMESTAMP_NTZ |  |
| TIMESTAMP WITH TIME ZONE / TIMESTAMPTZ | TIMESTAMP_LTZ |  |
| INTERVAL | TEXT |  |
| JSON | VARIANT | Supported up to the maximum entry size in Snowflake (16 MB). |
| JSONB | VARIANT | Supported up to the maximum entry size in Snowflake (16 MB). |
| UUID | TEXT |  |
| XML | TEXT |  |
| BIT | TEXT |  |
| BIT VARYING / VARBIT | TEXT |  |
| POINT | TEXT |  |
| LINE | TEXT |  |
| LSEG | TEXT |  |
| BOX | TEXT |  |
| PATH | TEXT |  |
| POLYGON | TEXT |  |
| CIRCLE | TEXT |  |
| CIDR | TEXT |  |
| INET | TEXT |  |
| MACADDR | TEXT |  |
| MACADDR8 | TEXT |  |
| TSVECTOR | TEXT |  |
| TSQUERY | TEXT |  |
| PG_LSN | TEXT |  |

> **Note:**
>
> Any PostgreSQL data types not listed in this table are mapped to TEXT by default.

---
title: Openflow Connector for PostgreSQL: Set up incremental replication without snapshots
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/postgres/incremental-replication.md
section: Loading & Unloading Data
---

# Openflow Connector for PostgreSQL: Set up incremental replication without snapshots

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

The Openflow Connector for PostgreSQL connector can be configured to immediately start replicating incremental changes for newly added tables,
bypassing snapshots. Incremental load is useful when reinstalling the connector over previously replicated data
and to continue replication without snapshotting every table again.

Incremental replication can be enabled in a new instance of the connector or in an existing one.

To enable incremental replication in a new instance of the connector perform the following tasks:

1. Setup the connector as described in [Set up the Openflow Connector for PostgreSQL](setup.md).
2. In the `PostgreSQL Ingestion Parameters` context, set the `Ingestion Type` parameter to `incremental`.

## Enable incremental replication without snapshots

To enable incremental replication on an existing connector:

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. In the Openflow pane select the Runtimes tab.
4. Select the runtime containing the connector.
5. Select the connector.
6. In the `Ingestion Parameters` context, specify `Ingestion Type` = `incremental`.
7. Add new replication tables. These tables immediately switch to their incremental load.

> **Note:**
>
> To return to replicating tables with the snapshot load, change Ingestion Type from `incremental` to `full`.

### Usage notes

* Changing the value of Ingestion Type does not impact any tables that have begun replicating data.
  Tables currently in the snapshot phase continue until the snapshot load is complete.
* While Ingestion Type is set to `incremental`, new tables added to the list of replicated tables bypass the snapshot phase.
  This includes new tables added to the source database that match the `Included Table Regex` parameter.
  Ensure that the ingestion type is set to `incremental` to bypass the snapshot phase.

  > **Note:**
  >
  > Connectors should only remain in `incremental` mode as long as required as it bypasses snapshots.
  > Once customer needs for incremental updates have been satisfied the connector should be returned to `full` mode.
* For tables that bypass snapshot load, the connector creates a destination table in Snowflake,
  by executing `CREATE TABLE IF NOT EXISTS`, only if no destination table already exists.
  Tables going through the snapshot require that no destination table exist.

---
title: Openflow Connector for Salesforce Bulk API: Configure the connector
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/salesforce-bulk-api/configure-connector.md
section: Loading & Unloading Data
---

# Openflow Connector for Salesforce Bulk API: Configure the connector

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to configure the Openflow Connector for Salesforce Bulk API.

## Install the connector

Follow these steps to install the Openflow Connector for Salesforce Bulk API in an Openflow runtime:

1. Navigate to the Openflow Overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find Openflow connector for Salesforce Bulk API and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down.

The Openflow canvas appears with the connector process group added to it.

## Configure the connector

To configure the connector, perform the following steps:

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in the table below.

| Parameter | Description |
| --- | --- |
| Column Removal Strategy | Defines the strategy to adopt when a column should be removed in the destination table based on the latest received schema. Three possible values: `Drop Column`, `Rename Column`, `Ignore Column`.   * `Drop Column`: Drop the column from the Snowflake table. * `Rename Column`: Rename the column in the Snowflake table. * `Ignore Column`: Ignore the column, leaving it as is in the Snowflake table. |
| Connected App Key | The private key used for JWT Bearer Flow authentication with Salesforce. Copy-paste the content of the `private.key` file generated during the [Salesforce setup](setup-salesforce.md). This private key must correspond to the public certificate (`public.crt`) uploaded to the external client app in Salesforce. You can also use the next parameter to upload the private key file instead. |
| Connected App Key File | Upload the `private.key` file by selecting the Reference asset checkbox, then upload the file as an asset and select the asset as the value for the parameter. This is an alternative to pasting the key content in the Connected App Key parameter. |
| Connected App Key Password | Password set on the private key file during the [Salesforce setup](setup-salesforce.md) steps. |
| Destination Database | Name of the database in Snowflake where the Salesforce data will be replicated. The database must exist before starting the connector. |
| Destination Schema | Name of the schema, in the database above, into which the connector will create tables for the Salesforce data to be added. The schema must exist before starting the connector. |
| Enable Journal Tables | If set to `true`, a `JOURNAL_<Object Name>` table is created for each synced object that has a `SystemModstamp` or `LastModifiedDate` field. All changes are appended to the journal table, providing a full history of modifications. This is in addition to the main table that contains the merged data for the object. If a full reload occurs for a given object type, its journal table is also recreated. Default: `false`. |
| Enable Views Creation | If set to `true`, a view named `<Object Type>_FORMULA_VW` is created for each synced object that contains formula fields. The view translates supported Salesforce formula expressions into Snowflake SQL, allowing you to query formula results directly without replicating formula field values from Salesforce. See [Salesforce formula fields](about.md) for details. Default: `false`. |
| Filter | Comma-separated list of objects to replicate from Salesforce, or regular expression to apply against all existing objects. The filter is case-insensitive, meaning that a filter set to `account` would match the object type `Account`. Example: `Account, Opportunity, Contact`.  **Note:** If left empty, all objects will be replicated. This is not recommended as there are usually thousands of objects in a Salesforce instance. |
| Incremental Offload | Whether the processor should perform incremental offload. If `true`, the processor will only fetch the records that have been modified since the last query job submission by using a `WHERE` clause on the appropriate timestamp field. If `false`, all records will be fetched at every execution of the connector. |
| Initial Load Chunking | If set to a value other than `NONE`, the initial data load will be split into multiple jobs based on this interval. On the first run for an object, the connector will query Salesforce to find the oldest record and use that as the starting point. Each subsequent job will query the next time chunk until caught up to the current time. Should be set with one of: `NONE`, `MONTHLY`, `QUARTERLY`, `YEARLY`.  This is useful for large datasets where loading all historical data in a single query may time out, exceed API limits, or exceed the storage size of the content repository of the runtime. After catching up, the processor continues with normal incremental offload behavior. |
| OAuth2 Audience | Audience to set in the JWT token. Set to `https://login.salesforce.com` for production environments or `https://test.salesforce.com` for sandboxes and test environments. |
| OAuth2 Client ID | Should be set to the Consumer Key value retrieved during the Salesforce Setup steps. |
| OAuth2 Subject | Should be set to the username of an admin-approved user for the application to interact with Salesforce APIs on behalf of this user. |
| OAuth2 Token Endpoint URL | Endpoint to negotiate tokens via the JWT Bearer Flow. Example: `https://myCompany.my.salesforce.com/services/oauth2/token`. |
| Object Fields Filter JSON | A JSON specifying which fields and field patterns should be included or excluded, per Salesforce object. Takes the form of an array with one item per object.  Example 1: This will include all fields that end with ‘name’ in the ‘Account’ Salesforce object:  `[ {"objectType":"Account", "includedPattern":".*name"} ]`  Example 2: This will include the fields Id, Name, and Revenue in the ‘Account’ Salesforce object:  `[ {"objectType":"Account", "included": ["Id", "Name", "Revenue"]} ]`  `excluded` and `excludedPattern` are also available for configuring the filters. |
| Object Identifier Resolution | Determines if schema / table / column names are treated as case-sensitive or case-insensitive. One of: `CASE_INSENSITIVE` / `CASE_SENSITIVE`.  **Note:** Changing this parameter value will require clearing the state and doing a full reload of all objects. |
| Removed Column Name Suffix | Suffix added to the column name when the parameter Column Removal Strategy is set to `Rename Column`. Default: `__deleted`. |
| Run Schedule | Frequency at which the connector will check for updates in Salesforce for configured objects via the Filter parameter. Default: `15 minutes`. |
| Salesforce Instance | Hostname of the Salesforce instance including the domain name. Do not include the protocol prefix (`https://`). For example, use `myCompany.my.salesforce.com`. |
| Snowflake Account Identifier | Snowflake account name formatted as `[organization-name]-[account-name]` where data will be persisted. Example: `PM-CONNECTORS`. |
| Snowflake Username | The name of the service user that the connector uses to connect to Snowflake. The service user is required only when using the `KEY_PAIR` authentication strategy (Openflow BYOC only). |
| Snowflake Private Key | The RSA Private Key that the connector uses for authentication to Snowflake, formatted according to PKCS8 standards and including standard PEM headers and footers. The header line starts with `-----BEGIN PRIVATE`. This is required only when using the `KEY_PAIR` authentication strategy (Openflow BYOC only).  You may also use the next parameter to upload the private key to the Openflow runtime instead. |
| Snowflake Private Key File | The file containing the RSA Private Key that the connector uses for authentication to Snowflake, formatted according to PKCS8 standards and including standard PEM headers and footers. The header line starts with `-----BEGIN PRIVATE`. Required only when using the `KEY_PAIR` authentication strategy (Openflow BYOC only).  Select the Reference asset checkbox to upload the private key file and store it securely in the Openflow runtime. |
| Snowflake Private Key Password | The password associated with the Snowflake Private Key File (if encrypted). This is required only when using the `KEY_PAIR` authentication strategy (Openflow BYOC only). |
| Snowflake Role | Name of the Snowflake role used during query execution. When using `SNOWFLAKE_MANAGED`, this is the Snowflake Role for Openflow Runtimes. When using `KEY_PAIR` (Openflow BYOC only), this is the role assigned to the specified Snowflake username. |
| Snowflake Authentication Strategy | Authentication strategy for the connector to connect to Snowflake.  Using `SNOWFLAKE_MANAGED` (default) uses the Snowflake managed token associated with the specified Snowflake Runtime Role. If using Openflow BYOC, you can also use `KEY_PAIR` to specify a specific user and role via a custom Key Pair. |
| Snowflake Warehouse | The Snowflake warehouse used to run queries. |
| Special Objects Filter | Comma-separated list of objects to offload from Salesforce (using direct API access), or regular expression to apply against all existing objects. The filter is case-insensitive, meaning that a filter set to `account` would match the object type `Account`.  This filter should only be used for objects that are **not** supported by the Salesforce Bulk API such as knowledge data, for example. This parameter should not overlap with the parameter Filter.  Example: `Knowledge.*` |

## Verify the Salesforce connection

Before enabling and starting the connector, Snowflake recommends verifying that the Salesforce authentication is properly configured. The **Verification** feature on controller services lets you test the connection without starting the full connector flow.

The JWT Bearer OAuth2 Access Token Provider controller service depends on two other controller services that must be enabled first: the Salesforce Private Key Service and the Web Client Service Provider.

1. Double-click the connector process group to open it.
2. Right-click on an empty area of the canvas and select Controller Services.
3. Enable the Salesforce Private Key Service and the Web Client Service Provider services.
4. Locate the JWT Bearer OAuth2 Access Token Provider service in the list.
5. Click the Verification button for the service. A dialog opens where you can provide property overrides. You can ignore this and click Verify directly.
6. If everything is configured properly, the Acquire token step shows a green checkmark indicating success. This confirms the connector can authenticate with Salesforce and obtain an access token. You can proceed to the next step to run the connector.
7. If verification fails, review the error message and check the following:

   * The OAuth2 Client ID parameter matches the Consumer Key from the external client app in Salesforce.
   * The private key corresponds to the certificate uploaded to the external client app.
   * The OAuth2 Subject user is authorized for the external client app (see [Approve the client app for a user](setup-salesforce.md)).
   * The OAuth2 Token Endpoint URL uses the correct Salesforce instance hostname.
   * The OAuth2 Audience is set to the correct value: `https://login.salesforce.com` for production or `https://test.salesforce.com` for sandboxes.

   For detailed troubleshooting, see [Troubleshooting the Openflow Connector for Salesforce Bulk API](troubleshoot.md).

## Run the connector

Follow these steps to start the connector and begin replicating data from Salesforce to Snowflake:

1. Right-click on an empty area in the canvas and select Enable all Controller Services.
2. Right-click on the connector process group and select Start.

## Manage object replication

After the connector has been started and objects have been replicated, you can add new objects or remove existing objects from replication.

### Add new objects to replication

To add a new object to replication, update the Filter parameter (or Special Objects Filter parameter, if applicable) with the new object names. You do not need to stop the connector. The new object is replicated at the next scheduled execution.

For example, if the current Filter value is `Account, Opportunity` and you want to add the `Contact` object, change the value to `Account, Opportunity, Contact`.

### Remove objects from replication

Removing an object from replication requires stopping the connector and cleaning up both the connector state and the destination table in Snowflake:

1. Stop all processors in the flow by right-clicking on the connector process group and selecting Stop.
2. Ensure that no in-flight FlowFiles are being processed.
3. Right-click on the canvas and select Parameters, then remove the object name from the Filter parameter (or the Special Objects Filter parameter, if applicable).
4. Right-click on the canvas and select Disable all controller services.
5. Go to Controller services and open the state of the controller service named Salesforce Bulk Jobs State.
6. Select the trash icon next to the object type you removed to delete its state entry.
7. Right-click on the canvas and select Enable all controller services, then start all processors to resume the connector.
8. If applicable, drop the corresponding table from the Snowflake destination database to clean up the previously replicated data. For example:

   ```sqlexample
   DROP TABLE <database_name>.<schema_name>.<object_name>;
   ```

## Next steps

* To monitor and troubleshoot the connector, see [Troubleshooting the Openflow Connector for Salesforce Bulk API](troubleshoot.md).

---
title: Openflow Connector for Salesforce Bulk API: Salesforce formula fields
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/salesforce-bulk-api/formula-fields.md
section: Loading & Unloading Data
---

# Openflow Connector for Salesforce Bulk API: Salesforce formula fields

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how the Openflow Connector for Salesforce Bulk API translates Salesforce formula
fields into Snowflake SQL views, including supported functions and limitations.

## How formula views work

When Enable Views Creation is set to `true`, the connector performs the following for each object that has formula fields:

1. Retrieves the formula expressions from the Salesforce object metadata via the Describe API.
2. Parses each formula expression and translates it into equivalent Snowflake SQL.
3. Generates a `CREATE OR REPLACE VIEW` statement that combines non-formula columns from the base table with the translated formula expressions as computed columns.
4. Runs the DDL against Snowflake to create or update the view.

The resulting view is named `<Object Type>_FORMULA_VW`. For example, the `Account` object produces a view named `ACCOUNT_FORMULA_VW`. You can query this view to obtain formula field values alongside the replicated data.

The view is automatically updated whenever the connector detects schema changes in the source object, ensuring that formula definitions stay in sync with Salesforce.

## Cross-object formula fields

Salesforce formulas can reference fields from related objects using relationship traversal (for example, `Account.Owner.Name`). The connector supports these cross-object references by generating `LEFT JOIN` clauses in the view definition. Each relationship traversal produces a join to the corresponding related table in Snowflake.

For cross-object formulas to work correctly, the related objects must also be replicated by the connector. The connector does not check whether the referenced tables exist in Snowflake at translation time. If a related object is not being synced, the generated `CREATE OR REPLACE VIEW` statement references a table that does not exist in Snowflake, and the view creation fails. To resolve this, ensure that all related objects referenced by formula fields are included in the Filter parameter. The view is automatically recreated on the next connector run after the referenced tables exist.

## Formula view column comments

Each formula column in the generated view includes a SQL `COMMENT` annotation:

* For successfully translated formulas, the comment contains the original Salesforce formula expression.
* For formulas that could not be translated, the comment contains the failure reason code.

You can inspect these comments by running `DESCRIBE VIEW <view_name>` in Snowflake.

## Supported formula functions

The following Salesforce formula functions are translated into equivalent Snowflake SQL:

| Category | Salesforce function | Snowflake equivalent |
| --- | --- | --- |
| Logical | `IF` | `CASE WHEN ... THEN ... ELSE ... END` |
| Logical | `CASE` | `CASE ... WHEN ... THEN ... ELSE ... END` |
| Logical | `AND` / `OR` / `NOT` | `AND` / `OR` / `NOT` |
| Null handling | `ISBLANK` | `LENGTH(COALESCE(expr, '')) = 0` |
| Null handling | `ISNULL` | `expr IS NULL` |
| Null handling | `NULLVALUE` | `COALESCE` |
| Null handling | `BLANKVALUE` | `CASE WHEN ... IS NULL OR LENGTH(...) = 0 THEN ... END` |
| Text | `LEFT` | `LEFT` |
| Text | `RIGHT` | `RIGHT` |
| Text | `MID` | `SUBSTR` |
| Text | `LEN` | `LENGTH` |
| Text | `SUBSTITUTE` | `REPLACE` |
| Text | `TRIM` | `TRIM` |
| Text | `UPPER` | `UPPER` |
| Text | `LOWER` | `LOWER` |
| Text | `CONTAINS` | `CONTAINS` |
| Text | `BEGINS` | `STARTSWITH` |
| Text | `FIND` | `CHARINDEX` |
| Text | `LPAD` | `LPAD` |
| Text | `RPAD` | `RPAD` |
| Text | `BR` | Newline character literal |
| Conversion | `TEXT` | `CAST(... AS STRING)` |
| Conversion | `VALUE` | `TRY_CAST(... AS NUMBER)` |
| Math | `ABS` | `ABS` |
| Math | `ROUND` | `ROUND` |
| Math | `CEILING` | `CEIL` |
| Math | `FLOOR` | `FLOOR` |
| Math | `MOD` | `MOD` |
| Math | `SQRT` | `SQRT` |
| Math | `MAX` | `GREATEST` |
| Math | `MIN` | `LEAST` |
| Math | `LOG` | `LOG(10, ...)` |
| Math | `EXP` | `EXP` |
| Math | `LN` | `LN` |
| Date and time | `NOW` | `CURRENT_TIMESTAMP()` |
| Date and time | `TODAY` | `CURRENT_DATE()` |
| Date and time | `YEAR` | `YEAR` |
| Date and time | `MONTH` | `MONTH` |
| Date and time | `DAY` | `DAY` |
| Date and time | `DATEVALUE` | `TO_DATE` |
| Date and time | `DATETIMEVALUE` | `TO_TIMESTAMP` |
| Date and time | `ADDMONTHS` | `DATEADD(MONTH, ...)` |
| Picklist | `ISPICKVAL` | `COALESCE(field, '') = COALESCE(value, '')` |

In addition to functions, the following operators are supported:

* Arithmetic: `+`, `-`, `*`, `/`, `^` (exponentiation, translated to `POWER`)
* Comparison: `=`, `==`, `!=`, `<>`, `<`, `<=`, `>`, `>=`
* Logical: `AND`, `OR`, `&&`, `||`
* String concatenation: `&` (translated to `||` with `COALESCE` null handling)
* Unary: `-` (negation), `NOT`

## Unsupported formula constructs

The following formula constructs are not yet supported. Support for additional functions and constructs will be added in future releases. When a formula uses any of these, the corresponding column in the view returns `NULL` and the column comment indicates the failure reason.

| Failure reason | Description |
| --- | --- |
| `FUNCTION_NOT_SUPPORTED` | The formula uses a function that has no Snowflake equivalent or that is specific to the Salesforce UI. This includes: `IMAGE`, `HYPERLINK`, `URLFOR`, `HTMLENCODE`, `JSENCODE`, `LINKTO`, `GEOLOCATION`, `DISTANCE`, `VLOOKUP`, `REGEX`, `PREDICT`, `GETSESSIONID`, `GETRECORDIDS`, `REQUIRESCRIPT`, `ISCHANGED`, `ISNEW`, `ISCLONE`, `PRIORVALUE`. |
| `GLOBAL_VARIABLE_NOT_SUPPORTED` | The formula references a Salesforce global variable such as `$User.Name`, `$Organization.Name`, or `$Profile.Name`. These variables have no equivalent in Snowflake. |
| `FORMULA_CHAIN_NOT_SUPPORTED` | The formula references another formula field. Chained formula references (a formula field that depends on another formula field) are not supported. |
| `ROLLUP_NOT_SUPPORTED` | The field is a rollup summary field rather than a formula field. Rollup summaries aggregate data from child records and cannot be expressed as a simple SQL view. |
| `LOOKUP_NOT_SYNCED` | The formula references a relationship that cannot be resolved from the Salesforce object metadata. This typically occurs when the relationship name in the formula does not match any known relationship on the object. |
| `ID_FORMAT_MISMATCH` | The formula contains a hardcoded 15-character Salesforce ID. Salesforce uses 15-character IDs internally, but the Bulk API returns 18-character IDs. Formulas with hardcoded 15-character IDs cannot be reliably translated. |
| `COMPOUND_FIELD_REFERENCE` | The formula references a compound field (such as `MailingAddress`) that is not stored as a single column in Snowflake. |
| `PARSE_ERROR` | The formula expression could not be parsed. This might indicate a syntax that the connector does not yet recognize. |
| `UNSUPPORTED_SYNTAX` | The formula uses a syntax construct that is recognized but cannot be translated (for example, an `IF` function with fewer than three arguments). |

---
title: Openflow Connector for Salesforce Bulk API: Set up Salesforce
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/salesforce-bulk-api/setup-salesforce.md
section: Loading & Unloading Data
---

# Openflow Connector for Salesforce Bulk API: Set up Salesforce

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up Salesforce for the Openflow Connector for Salesforce Bulk API.

The connector authenticates with Salesforce using the OAuth 2.0 JWT Bearer Flow. This requires creating a certificate key pair, configuring an external client app in Salesforce, and authorizing a user to use the app.

> **Important:**
>
> Salesforce has deprecated Connected Apps in favor of External Client Apps. If you have an existing Connected App, Snowflake recommends creating a new External Client App instead.

## Create certificates

You need a private key and public certificate to configure the external client app in Salesforce. The private key is used by the connector to sign JWT tokens, and the public certificate is uploaded to the external client app in Salesforce so that Salesforce can verify the signature.

1. Generate the private key. You are asked for a password to secure the private key.

   ```bash
   openssl genpkey -algorithm RSA -out private.key -aes256
   ```

   Record the password. You need it when configuring the connector parameters in Snowflake.
2. Create a self-signed certificate from the private key.

   ```bash
   openssl req -new -x509 -key private.key -out public.crt -days 365
   ```

   You can also generate a Certificate Signing Request (CSR) to have a certificate signed by your company CA.

> **Note:**
>
> You are responsible for safeguarding and rotating the public key and private key files used for key-pair authentication according to the security policies of your organization.

## Create an external client app in Salesforce

Create an external client app in Salesforce with JWT Bearer Flow. The connector requires this specific OAuth flow to authenticate. Using a different OAuth flow (such as Authorization Code Flow) causes `invalid_grant` errors.

1. Log in to Salesforce as an administrator.
2. Go to Setup » Apps » App Manager, and then select New External Client App.
3. Fill in the required fields:

   * External Client App Name: For example, `Openflow connector for Salesforce Bulk API`.
   * Contact Email: For example, `salesforceadmin@mycompany.com`.
4. In the API (Enable OAuth Settings) section, select the Enable OAuth checkbox.
5. Provide a valid Callback URL (for example, `https://www.google.com/`).

   > **Note:**
   >
   > The callback URL is required by Salesforce, but it is not used by the JWT Bearer Flow. You can provide any valid URL.
6. Provide the desired OAuth Scopes for the application. The following scopes are required for the connector to operate properly:

   * Manage user data via APIs (`api`)
   * Perform requests at any time (`refresh_token`, `offline_access`)
7. In Flow Enablement, select the Enable JWT Bearer Flow checkbox and upload the `public.crt` file created in the previous step.

   > **Important:**
   >
   > You must select **Enable JWT Bearer Flow** specifically. Do not enable other flows unless you have a specific reason to do so. The certificate you upload here must correspond to the private key (`private.key`) that you configure in the connector parameters.
8. Click Create to complete the application creation process.
9. Go to the Settings tab, expand the OAuth Settings section, and click Consumer Key and Secret to retrieve the credentials of your application.
10. Record the values for the Consumer Key and the Consumer Secret for use when configuring the connector in Snowflake. The Consumer Key is used as the OAuth2 Client ID parameter in the connector configuration.

## Approve the client app for a user

The connector interacts with Salesforce APIs on behalf of a specific user (the OAuth2 Subject configured in the connector parameters). You must authorize this user to use the external client app by assigning the appropriate profiles or permission sets.

If this step is not completed, the connector receives a permission error when attempting to authenticate, even if the JWT Bearer Flow is configured correctly.

1. Go to the Policies tab of the client application.
2. Click Edit.
3. Expand the OAuth Policies section and change Permitted Users to Admin approved users are pre-authorized.
4. Expand the App Policies section and select the profiles or permission sets that are assigned to the Salesforce user you want the connector to use. For example, if the user has the `System Administrator` profile, select that profile.

   > **Note:**
   >
   > The user specified as the OAuth2 Subject in the connector configuration must belong to at least one of the profiles or permission sets selected here. If the user is not authorized, you receive a permission error when verifying or running the connector.
5. Click Save.

## Verify credentials match

Before proceeding to the Snowflake setup, confirm that the following credentials all belong to the same external client app and key pair:

* The **Consumer Key** (Client ID) was retrieved from the external client app you just created.
* The **private key** (`private.key`) corresponds to the **certificate** (`public.crt`) uploaded to the same external client app.
* The **OAuth2 Subject** (user) is authorized for this external client app through the profile or permission set assignment.

If you have created multiple external client apps or experimented with different configurations, mixing credentials from different apps or key pairs is a common source of `invalid_grant` errors. When in doubt, create a new external client app with a fresh certificate and key pair.

## Next steps

Perform the Snowflake setup tasks:

[Openflow Connector for Salesforce Bulk API: Set up Snowflake](setup-snowflake.md)

---
title: Openflow Connector for Salesforce Bulk API: Set up Snowflake
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/salesforce-bulk-api/setup-snowflake.md
section: Loading & Unloading Data
---

# Openflow Connector for Salesforce Bulk API: Set up Snowflake

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up Snowflake for the Openflow Connector for Salesforce Bulk API.

## Prerequisites

Before you begin, ensure you have completed the following:

* Install Openflow (either BYOC or SPCS). For more information, see [About Openflow](../../about.md).
* Create an Openflow deployment. For more information, see [Set up Openflow - Snowflake Deployment: Create deployment](../../setup-openflow-spcs-deployment.md) or [Set up Openflow - BYOC](../../setup-openflow-byoc.md).
* Create an Openflow runtime. For more information, see [Set up Openflow - Snowflake Deployment: Create runtime](../../setup-openflow-spcs-create-runtime.md) or [Set up Openflow - BYOC](../../setup-openflow-byoc.md).
* Review the known limitations of the connector in [About the Openflow Connector for Salesforce Bulk API](about.md).

## Create a key pair

Create a key pair that will be used by the service account user in the connector to interact with the database.

> **Note:**
>
> This step is only required if you are deploying the connector in Openflow BYOC. It is NOT needed when deploying the connector in Openflow SPCS.

1. Generate a private key. The example below shows how to generate an unencrypted private key.

   ```bash
   openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out rsa_key.p8 -nocrypt
   ```

   The content of the `rsa_key.p8` file will look like this:

   ```text
   -----BEGIN PRIVATE KEY-----
   MIIE6T...
   -----END PRIVATE KEY-----
   ```
2. Generate the public key by referencing the private key.

   ```bash
   openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
   ```

   The content of the `rsa_key.pub` file will look like this:

   ```text
   -----BEGIN PUBLIC KEY-----
   MIIBIjANBgkqh...
   -----END PUBLIC KEY-----
   ```

   Copy the contents of this file (without the `-----BEGIN PUBLIC KEY-----` and `-----END PUBLIC KEY-----` headers) to use when creating the user in the next section.

## Create objects and grant privileges

Create a service account, role, database, schema, and warehouse for the connector, and grant the appropriate permissions.

1. Use a role with `ACCOUNTADMIN` privileges to set the role:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
2. Create the destination Snowflake database, if it does not
   exist:

   ```sqlexample
   CREATE DATABASE IF NOT EXISTS <my_salesforce_db>;
   ```
3. Create the destination schema in the database, if it does
   not exist:

   ```sqlexample
   CREATE SCHEMA IF NOT EXISTS <my_salesforce_db>.<my_salesforce_schema>;
   ```
4. Create the role used by the Openflow connector:

   ```sqlexample
   CREATE ROLE IF NOT EXISTS <Salesforce_connector_role_name>;
   ```
5. Grant the privileges to the role to use the database:

   ```sqlexample
   GRANT USAGE ON DATABASE <my_salesforce_db> TO ROLE <Salesforce_connector_role_name>;
   GRANT USAGE ON SCHEMA <my_salesforce_db>.<my_salesforce_schema> TO ROLE <Salesforce_connector_role_name>;
   GRANT CREATE TABLE ON SCHEMA <my_salesforce_db>.<my_salesforce_schema> TO ROLE <Salesforce_connector_role_name>;
   ```
6. Create a warehouse for the connector (or use an existing one) and grant usage privileges to the connector role:

   ```sqlexample
   -- Create a warehouse (skip if you wish to use an existing warehouse)
   CREATE OR REPLACE WAREHOUSE MY_WAREHOUSE WITH
    WAREHOUSE_SIZE = 'SMALL'
    AUTO_SUSPEND = 300
    AUTO_RESUME = TRUE;

   GRANT USAGE, OPERATE ON WAREHOUSE MY_WAREHOUSE TO ROLE <Salesforce_connector_role_name>;
   ```
7. Create the service user and assign the role and public key:

   ```sqlexample
   -- Create a service user that the connector will use to interact with Snowflake
   -- Set default role to <Salesforce_connector_role_name>
   -- Assign the public key generated with openssl in the previous step (only for BYOC)
   CREATE OR REPLACE USER <Salesforce_connector_user_name>
     TYPE = SERVICE
     DEFAULT_ROLE = <Salesforce_connector_role_name>
     RSA_PUBLIC_KEY = '<public_key_generated_by openssl_in_step_1>';

   -- Grant the role to the user
   GRANT ROLE <Salesforce_connector_role_name> TO USER <Salesforce_connector_user_name>;
   ```

## Create a network rule (Openflow Snowflake Deployment only)

If you are deploying the connector in a runtime that is in an Openflow Snowflake Deployment, you must create a network rule and external access integration and set them on the runtime.

```sqlexample
USE ROLE SECURITYADMIN;

CREATE NETWORK RULE MY_OPENFLOW_SALESFORCE_NETWORK_RULE
   TYPE = HOST_PORT
   MODE = EGRESS
   VALUE_LIST = ('<salesforce_instance_host>:443');

CREATE EXTERNAL ACCESS INTEGRATION MY_OPENFLOW_SALESFORCE_EAI
   ALLOWED_NETWORK_RULES = (MY_OPENFLOW_SALESFORCE_NETWORK_RULE)
   ENABLED = TRUE
   COMMENT = 'External Access Integration to connect to Salesforce';

GRANT USAGE ON INTEGRATION MY_OPENFLOW_SALESFORCE_EAI TO ROLE <openflow_role_name>;
```

## Next steps

Configure the connector in Openflow:

[Openflow Connector for Salesforce Bulk API: Configure the connector](configure-connector.md)

---
title: Openflow Connector for SQL Server: Data mapping
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sql-server/data-mapping.md
section: Loading & Unloading Data
---

# Openflow Connector for SQL Server: Data mapping

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how the SQL Server data types are mapped
to Snowflake data types.

## SQL Server to Snowflake data type mapping

The following table shows how SQL Server data types are mapped to Snowflake data types
when replicating data.

| SQL Server type | Snowflake type | Notes |
| --- | --- | --- |
| TINYINT | INT |  |
| SMALLINT | INT |  |
| INT | INT |  |
| BIGINT | INT |  |
| DECIMAL | NUMBER | If precision exceeds Snowflake limitations (precision > 38), the value is stored as TEXT. |
| NUMERIC | NUMBER | If precision exceeds Snowflake limitations (precision > 38), the value is stored as TEXT. |
| SMALLMONEY | NUMBER |  |
| MONEY | NUMBER |  |
| REAL | FLOAT |  |
| FLOAT | FLOAT |  |
| BIT | BOOLEAN |  |
| CHAR | TEXT |  |
| VARCHAR | TEXT |  |
| NCHAR | TEXT |  |
| NVARCHAR | TEXT |  |
| TEXT | TEXT |  |
| NTEXT | TEXT |  |
| DATE | DATE |  |
| TIME | TIME |  |
| SMALLDATETIME | TIMESTAMP_NTZ |  |
| DATETIME | TIMESTAMP_NTZ |  |
| DATETIME2 | TIMESTAMP_NTZ |  |
| DATETIMEOFFSET | TIMESTAMP_TZ |  |
| BINARY | BINARY |  |
| VARBINARY | BINARY |  |
| IMAGE | BINARY | Supported up to the maximum entry size in Snowflake (16 MB). |
| JSON | VARIANT | Supported up to the maximum entry size in Snowflake (16 MB). |
| VECTOR | VARIANT |  |
| XML | TEXT |  |
| UNIQUEIDENTIFIER | TEXT |  |
| ROWVERSION / TIMESTAMP | TEXT |  |
| SQL_VARIANT | TEXT |  |
| GEOGRAPHY | TEXT | Values of this type are inserted as NULL. |
| GEOMETRY | TEXT | Values of this type are inserted as NULL. |

> **Note:**
>
> Any SQL Server data types not listed in this table are mapped to TEXT by default.

---
title: Openflow Connector for SQL Server: Maintenance
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sql-server/maintenance.md
section: Loading & Unloading Data
---

# Openflow Connector for SQL Server: Maintenance

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes maintenance considerations and best practices for the Openflow Connector for SQL Server, such as reinstalling the connector or setting the change tracking starting position.

These operations are often used in conjunction with [Incremental replication with snapshots](incremental-replication.md).

## Reinstall the connector

This section provides instructions on how to reinstall the connector, and continue replicating data for
the same tables without having to snapshot them again.
It covers situations where the new connector is installed in the same runtime, as well as moved to a new runtime.

### Prerequisites

Review and note connector parameter context values.
If you reinstall the connector in the same runtime, you can reuse the existing context.
If the new instance is located in a different runtime, you must re-enter all parameters.

1. Finish processing all in-flight FlowFiles in the existing connector, then stop the connector.

   1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
   2. In the navigation menu, select Ingestion » Openflow.
   3. Select Launch Openflow.
   4. In the Openflow pane select the Runtimes tab.
   5. Select the runtime containing the connector.
   6. Select the connector.
   7. Stop the topmost processor Set Tables for Replication in the Snapshot Load group.
   8. Stop the topmost processor Read SQLServer Change Tracking tables in the Incremental Load group.
   9. If you changed the value of the Merge Task Schedule CRON parameter, return it to `* * * * * ?`, otherwise queues will not be emptied until the next scheduled run.

      Wait until all FlowFiles in the connector have been processed, and all queues are empty.
      When all FlowFiles have been processed, the Queued value on the connector’s processor group becomes zero.
      If there are any items left in the original connector’s queues, there may be data gaps when the new connector starts.
   10. Stop all processors and controller services in the connector.
   > **Caution:**
   >
   > The existing connector can remain in the runtime and doesn’t interfere with the new instance, as long as it remains stopped.
2. Create a new instance of the connector. If you use the same runtime as the original connector, you can choose to keep the existing parameter contexts and reuse the settings.
3. If you install into a different runtime or you deleted the previous parameter contexts, enter the configuration settings into the new parameter contexts,
   including the table names and patterns as described in [Set up the Openflow Connector for SQL Server](setup.md).
4. Navigate to the `SQLServer Ingestion Parameters` context, and set the following parameters:

   * Set the `Ingestion Type` parameter to `incremental`. For information, see [Enable incremental replication without snapshots](incremental-replication.md).
   * Set the `Starting Change Tracking Position` parameter to `Earliest`.
     For information, see Specify load from change tracking table position.
5. Start the new connector.

### Usage notes

The new connector uses the existing destination tables created by the original connector, but creates new journal tables.

## Specify load from change tracking table position

The Openflow Connector for SQL Server connector lets you select the starting position where change tracking tables are read.
By default, the connector reads from the latest available position. Alternatively, you can choose the earliest position available on the source instance.
Choosing to start from the earliest position is common when reinstalling the connector.
This allows the new instance to catch up and continue replicating existing tables without having to snapshot each again.

Switching a running connector from latest to earliest position causes the contents of change tracking tables to be re-read, re-processed, and re-applied to the destination table.

> **Warning:**
>
> While the change tracking tables are being re-read, the data in affected destination tables
> can become out of sync with their sources until all events have been re-processed and merged.

The following parameters are available in the `Ingestion Parameters` context:

| Parameter | Description |
| --- | --- |
| Starting Change Tracking Position | * `Latest` (default): change tracking table reading starts at the latest available position and continues from there. * `Earliest`: Switches the incremental load to start, or restart reading from the earliest available   change tracking table positions. |
| Re-read Tables in State | * `New` (default):   Only new tables, added after the starting position was switched to `Earliest`, will have their change   tracking tables read from the earliest available positions. Tables that started replication before the   configuration change will continue reading from their last positions. * `Any active`: Re-read and re-process changes from any table currently in replication. |

To determine whether the connector finished re-reading the change tracking tables:

1. Navigate to the Openflow canvas.
2. Open the Incremental Load process group.
3. Right-click the topmost processor named Read SQLServer Change Tracking tables, then select View state.
4. Check the state entries for every table with keys starting with `position.`. If a value is `0/0` then the connector has not yet finished re-reading the changes for this table.

### Usage notes

* After you switch a running connector to read from the earliest positions and start it,
  you cannot reconfigure or cancel the process, and it will continue until the currently-read positions reach the latest values.
* Switching to the earliest position on a running connector will, for any tables being re-processed,
  finish their existing journals, and create new journal tables.

---
title: Openflow Connector for SQL Server: Set up incremental replication without snapshots
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sql-server/incremental-replication.md
section: Loading & Unloading Data
---

# Openflow Connector for SQL Server: Set up incremental replication without snapshots

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

You can configure the Openflow Connector for SQL Server connector to immediately replicate incremental changes for newly added tables, bypassing snapshots. Use incremental load to continue replication without snapshotting every table again when you reinstall the connector over previously replicated data.

You can enable incremental replication in a new or existing connector instance.

To enable incremental replication in a new or existing connector instance:

1. Set up the connector as described in [Set up the Openflow Connector for SQL Server](setup.md).
2. In the `SQLServer Ingestion Parameters` context, set the `Ingestion Type` parameter to `incremental`.

## Enable incremental replication without snapshots

To enable incremental replication on an existing connector:

1. Sign in to [Snowsight](../../../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. In the Openflow pane select the Runtimes tab.
4. Select the runtime containing the connector.
5. Select the connector.
6. In the `Ingestion Parameters` context, specify `Ingestion Type` = `incremental`.
7. Add new replication tables. These tables immediately switch to their incremental load.

> **Note:**
>
> To return to replicating tables with the snapshot load, change Ingestion Type from `incremental` to `full`.

### Usage notes

* Changing the value of Ingestion Type does not impact any tables that have begun replicating data.
  Tables currently in the snapshot phase continue until the snapshot load is complete.
* While Ingestion Type is set to `incremental`, new tables added to the list of replicated tables bypass the snapshot phase.
  This includes new tables added to the source database that match the `Included Table Regex` parameter.
  Ensure that the ingestion type is set to `incremental` to bypass the snapshot phase.

  > **Note:**
  >
  > Connectors should only remain in `incremental` mode as long as required as it bypasses snapshots.
  > Once customer needs for incremental updates have been satisfied the connector should be returned to `full` mode.
* For tables that bypass snapshot load, the connector creates a destination table in Snowflake,
  by executing `CREATE TABLE IF NOT EXISTS`, only if no destination table already exists.
  Tables going through the snapshot require that no destination table exist.

---
title: Openflow connectors
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/about-openflow-connectors.md
section: Loading & Unloading Data
---

# Openflow connectors

> **Note:**
>
> The connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

Openflow connectors are curated, versioned Apache NiFi flow definitions built using open-source and proprietary NiFi components.
These connectors follow a strict set of design patterns to ensure performance, fault-tolerance, and ease of configuration.

Review the details of the following connectors available in Openflow:

| Connector | Description |
| --- | --- |
| [Openflow Connector for Amazon Ads](amazon-ads/about.md) | Bring data from Amazon Ads for Ad performance statistics and insights |
| [Openflow Connector for Box](box/about.md) | Ingest Box content for your own custom processing in Snowflake  Ingest Box content and make it ready for chat in your AI assistants with Snowflake Cortex  Use Box AI to extract metadata from Box content for enrichment in Snowflake  Add enriched metadata from Snowflake to content in Box |
| [Openflow Connector for Google Ads](google-ads/about.md) | Import metrics from Google Ads for performance tracking and optimization |
| [Openflow Connector for Google BigQuery](google-big-query/about.md) | Replicate datasets and tables from Google BigQuery into Snowflake with incremental change capture |
| [Openflow Connector for Google Drive](google-drive/about.md) | Ingest Google Drive content and make it ready for chat in your AI assistants with Snowflake Cortex  Ingest Google Drive content for your own custom processing in Snowflake |
| [Openflow Connector for Google Sheets](google-sheets/about.md) | Load data from Google sheets into Snowflake tables for reporting, analytics, and insights |
| [Openflow Connector for HubSpot](hubspot/about.md) | Get HubSpot CRM data into Snowflake for reporting, analytics, and insights |
| [Openflow Connector for Jira Cloud](jira-cloud/about.md) | Extract Jira issues and project details for cross‐team visibility and deeper insights |
| [Openflow Connector for Kafka](kafka/about.md) | Ingest real‐time events from Apache Kafka into Snowflake for near real-time analytics |
| [Openflow Connector for Kinesis Data Streams](kinesis/about.md) | Ingest real‐time events from Amazon Kinesis Data Streams into Snowflake for near real-time analytics |
| [Openflow Connector for LinkedIn Ads](linkedin-ads/about.md) | Import campaign performance data from LinkedIn Ads to Snowflake for reporting, analytics, and insights |
| [Openflow Connector for Meta Ads](meta-ads/about.md) | Bring Meta (Facebook) Ads data to unify and analyze your marketing performance |
| [Openflow Connector for Microsoft Dataverse](dataverse/about.md) | Integrate data from Microsoft Power Platform and Dynamics 365 applications with Snowflake for holistic business insights |
| [Openflow Connector for MySQL](mysql/about.md) | CDC replication of MySQL tables into Snowflake for comprehensive, centralized reporting |
| [Openflow Connector for Oracle](oracle/about.md) | CDC replication of Oracle database tables into Snowflake for comprehensive, centralized reporting |
| [Openflow Connector for PostgreSQL](postgres/about.md) | CDC replication of PostgreSQL data with Snowflake for comprehensive, centralized reporting |
| [Openflow Connector for SharePoint](sharepoint/about.md) | Ingest SharePoint content and make it ready for chat in your AI assistants with Snowflake Cortex  Ingest SharePoint content for your own custom processing in Snowflake |
| [Openflow Connector for Slack](slack/about.md) | Pull Slack messages and metadata into Snowflake for searchable, organization‐wide insights |
| [Openflow Connector for Snowflake to Kafka](snowflake-to-kafka/about.md) | CDC replication of Snowflake tables into Apache Kafka for real-time insights distribution and event-driven architectures |
| [Openflow Connector for SQL Server](sql-server/about.md) | CDC replication of Microsoft SQL Server data with Snowflake for comprehensive, centralized reporting |
| [Openflow Connector for Workday](workday/about.md) | Get Workday data into Snowflake using Report-as-a-Service (RaaS) streams for enterprise-level analytics and planning |

---
title: Openflow Snowflake Deployment cost and scaling considerations
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/cost-spcs.md
section: Loading & Unloading Data
---

# Openflow Snowflake Deployment cost and scaling considerations

When running Openflow - Snowflake Deployment you must be aware of the cost considerations associated with multiple Snowflake components, including, but not limited to the following cost categories:

* Compute pool costs
* Snowpark Container Services infrastructure
* Data Ingestion
* Telemetry Data Ingestion
* Other costs not explicitly mentioned in this topic

Using and scaling Openflow involves understanding these costs. The following sections describe Openflow costs in general, and provide a number of examples of scaling Openflow runtimes and associated costs.

## Openflow - Snowflake Deployment costs

When using Openflow - Snowflake Deployment, you can incur costs from multiple Snowflake components that
Openflow uses. These cost categories are described in the following sections.

However, your actual costs may vary based on your specific environment. See Examples for calculating Openflow - Snowflake Deployment consumption for examples of different
cost consumption scenarios.

### Openflow compute pool costs

> **Note:**
>
> This cost category is shown as **Openflow Compute Snowflake** on your Snowflake bill.

The total costs for running Openflow are based on the number and types of instances used by [Snowpark Container Service compute pools](../../../developer-guide/snowpark-container-services/working-with-compute-pool.md) in your Snowflake account.

Openflow uses compute pools for two different purposes:

* Openflow Management Services

  Openflow Management Services run as part of an Openflow deployment. They
  use a compute pool to manage the Openflow deployment. This compute pool begins running
  as soon as you create a deployment. It continues to run as long as the deployment is
  active.

  > **Caution:**
  >
  > The compute pool associated with the Openflow Management Services continues to run and incurs costs, even if there are no runtimes running.
* Openflow runtimes

  Openflow uses compute pools to run the Openflow runtimes. The number of compute
  pools required and the number of nodes within each compute pool are scaled based on the
  number of runtimes that are currently running.

  When all runtimes associated with a runtime are stopped, the compute pool associated
  with the runtimes is scaled down to 0 nodes. No costs are incurred for a runtime compute pool when it is not in use.

Credits are billed per-second with a 5 minute minimum. For information on the rate per Snowpark Container Services
Compute Instance Family per hour, refer to Table 1(d) in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

The following views in the [Account Usage](../../../sql-reference/account-usage.md) schema provide additional details on Openflow
compute costs:

* [METERING_DAILY_HISTORY](../../../sql-reference/account-usage/metering_daily_history.md)
* [METERING_HISTORY](../../../sql-reference/account-usage/metering_history.md)

Compute pool costs related to Openflow appear under `SERVICE_TYPE` as `OPENFLOW_COMPUTE_SNOWFLAKE`.

> **Note:**
>
> The [OPENFLOW_USAGE_HISTORY](../../../sql-reference/account-usage/openflow_usage_history.md) view currently does not
> contain records for the `OPENFLOW_COMPUTE_SNOWFLAKE` service type.

For more information on compute costs in Snowflake, see [Exploring compute cost](../../cost-exploring-compute.md).

### Snowpark Container Services infrastructure costs

In addition to compute pool costs, there are costs associated with additional Snowpark Container Services infrastructure, including storage and data transfer.

For additional information, see [Snowpark Container Services costs](../../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md).

### Data ingestion costs

Costs are incurred when loading data into Snowflake using services such as Snowpipe or Snowpipe Streaming. These costs are based on the volume of data ingested.

> **Note:**
>
> These costs appear on your Snowflake bill under their respective ingestion services line items.

Additionally, some connectors may require a warehouse and will incur warehouse costs. For example, database CDC connectors require a warehouse for both the
initial snapshots and ongoing incremental Change Data Capture (CDC).

### Telemetry data ingestion costs

When using an event table to store telemetry data for Openflow, Snowflake charges
for sending logs and metrics to Openflow deployments. There are also charges for
sending runtime telemetry data to your event table within Snowflake.

The rate for credits per GB of telemetry data is specified in Table 5 in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf)
This item is referred to as Telemetry Data Ingest.

## Reducing Openflow credit consumption

If you have runtimes that are not actively in use, you can suspend them to reduce costs. Suspending a runtime
stops credit consumption for the associated runtime compute pool. When a runtime is suspended, its compute pool
scales down to 0 nodes and no longer incurs charges.

## Openflow - Snowflake Deployment costs associated with runtimes and scaling behavior

How you choose to configure and scale runtimes is important for managing costs effectively. Openflow supports different runtime types, each with its own scaling characteristics and associated costs.

### Mapping runtimes to Snowflake compute pools

The runtime type you choose determines the runtime pods that are scheduled on the associated compute pool. Using a larger runtime type will result in a larger compute pool being used, which will incur higher costs.

The runtime sizes and their scaling behavior are described in the following table:

| Runtime type | vCPUs | Available memory (GB) | Snowflake Compute Pool instance family | Snowflake Compute Pool | Instance Family - vCPUs | Instance Family - memory (GB) |
| --- | --- | --- | --- | --- | --- | --- |
| Small | 1 | 2 | CPU_X64_S | INTERNAL_OPENFLOW_0_SMALL | 4 | 16 |
| Medium | 4 | 10 | CPU_X64_SL | INTERNAL_OPENFLOW_0_MEDIUM | 16 | 64 |
| Large | 8 | 20 | CPU_X64_L | INTERNAL_OPENFLOW_0_LARGE | 32 | 128 |

Openflow scales the underlying Snowflake Compute Pools when additional compute pool
nodes need to be scheduled, based on CPU consumption, and up to the maximum node setting set during runtime creation.

Compute pools are configured with a minimum size of 0 nodes and a maximum of 50 nodes. The required size is dynamically adjusted depending on the CPU and memory
requirements of the runtimes.

If there are no resource demands, for example, if the runtime is not running, a compute pool scales down to 0 nodes after 600 seconds (10 minutes).

| Runtime | Activity | Snowflake costs | Cloud costs |
| --- | --- | --- | --- |
| No runtimes | None | Openflow Control Pool x 1 node = 1 CPU_X64_S instance-hour | None |
| 1 small runtime (1vCPU) (min=1 max=2) | Active for 1 hour.  Runtime does not scale to 2. | Openflow Control Pool x 1 node + Small Openflow Compute Pool (CPU_X64_S) x 1 node = 2 CPU_X64_S instance-hours | None |
| 2 small runtime (1 vCPU) (min/max=2) 1 large runtime (8 vCPU) (min/max=10) | Small: 4 nodes active for 1 hour Large: 10 nodes active for 1 hour | Openflow Control Pool x 1 node + Small Openflow Compute Pool (CPU_X64_S) x 2 node + Large Openflow Compute Pool (CPU_X64_L) x 4 nodes = 3 CPU_X64_S instance-hours + 4 CPU_X64_L instance-hours | None |
| 1 medium (4vCPU) (min=1 max=2) | First 20 minutes 1 node is running After 20 minutes, scales to 2 nodes After 40 minutes, scales back to 1 node Total 1 hour | Openflow Control Pool x 1 node + Medium Openflow Compute Pool (CPU_X64_SL) x 1 node = 1 CPU_X64_S instance-hour + 1 CPU_X64_SL instance-hour | None |
| 1 medium (4vCPU) (min/max=2) | First 30 minutes 2 nodes running Suspends after the first 30 minutes | Openflow Control Pool x 1 node + Medium Openflow Compute Pool (CPU_X64_SL) x 1 node x 1/2 hour = 1 CPU_X64_S instance-hour + 1/2 CPU_X64_SL instance-hour | None |

### Examples for calculating Openflow - Snowflake Deployment consumption

You created an Openflow Snowflake Deployment and have not created any runtimes.
:   * The Openflow_Control_Pool_0 Compute Pool is running with one CPU_X64_S instance
    * Total Openflow consumption = 1 CPU_X64_S instance-hour

You created one small runtime with Min Nodes = 1 and Max Nodes = 2. Runtime stays at 1 node for 1 hour.
:   * The Openflow_Control_Pool_0 Compute Pool is running with 1 CPU_X64_S instance
    * The INTERNAL_OPENFLOW_0_SMALL Compute Pool is running with 1 CPU_X64_S instance
    * Total Openflow consumption = 2 CPU_X64_S instance-hours

You created two small runtimes with min/max of two nodes each, and one large runtime with min/max of 10 nodes. These Runtimes are active for one hour.
:   * The Openflow_Control_Pool_0 Compute Pool is running with 1 CPU_X64_S instance

      + Two small runtimes at two nodes = INTERNAL_OPENFLOW_0_SMALL Compute Pool is running with 2 CPU_X64_S instances = 2 CPU_X64_S instance-hours
      + One large runtime at 10 nodes = INTERNAL_OPENFLOW_0_LARGE Compute Pool is running with 4 CPU_X64_L instances = 4 CPU_X64_L instance-hours
    * Total Openflow consumption = 3 CPU_X64_S instance-hours + 4 CPU_X64_L instance-hour

You created one medium runtime with one node. After 20 minutes, it scales to two nodes. After 20 minutes, it scales back down to one node and runs for another 20 minutes.
:   * The Openflow_Control_Pool_0 Compute Pool is running with 1 CPU_X64_S instance
    * One medium runtime scaling up to two medium runtimes = INTERNAL_OPENFLOW_0_MEDIUM Compute Pool is running with 1 CPU_X64_SL instance = 1 CPU_X64_SL instance-hour
    * Total Openflow consumption = 1 CPU_X64_S instance-hour + 1 CPU_X64_SL instance-hour

You created one medium runtime with two nodes, then suspended it after 30 minutes.
:   * The Openflow_Control_Pool_0 Compute Pool is running with 1 CPU_X64_S instance
    * One medium runtime at one node = INTERNAL_OPENFLOW_0_MEDIUM Compute Pool is running with 1 CPU_X64_SL instance
    * 30 minutes = 1/2 hour
    * Total Openflow consumption = 1 CPU_X64_S instance-hour +1/2 CPU_X64_SL instance-hour

---
title: PackageFlowFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/packageflowfile.md
section: Loading & Unloading Data
---

# PackageFlowFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor will package FlowFile attributes and content into an output FlowFile that can be exported from NiFi and imported back into NiFi, preserving the original attributes and content.

## Tags

attributes, flowfile, flowfile-stream, flowfile-stream-v3, package

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Maximum Batch Content Size | Maximum combined content size of FlowFiles to package into one output FlowFile. Note, that FlowFiles whose content exceeds this limit are packaged separately. |
| max-batch-size | Maximum number of FlowFiles to package into one output FlowFile. |

## Relationships

| Name | Description |
| --- | --- |
| original | The FlowFiles that were used to create the package are sent to this relationship |
| success | The packaged FlowFile is sent to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The mime.type will be changed to application/flowfile-v3 |

## Use Cases Involving Other Components

|  |
| --- |
| Send FlowFile content and attributes from one NiFi instance to another NiFi instance. |
| Export FlowFile content and attributes from NiFi to external storage and reimport. |

## See also

* [org.apache.nifi.processors.standard.MergeContent](mergecontent.md)
* [org.apache.nifi.processors.standard.UnpackContent](unpackcontent.md)

---
title: PaginatedJsonQueryElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/paginatedjsonqueryelasticsearch.md
section: Loading & Unloading Data
---

# PaginatedJsonQueryElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

A processor that allows the user to run a paginated query (with aggregations) written with the Elasticsearch JSON DSL. It will use the flowfile’s content for the query unless the QUERY attribute is populated. Search After/Point in Time queries must include a valid “sort” field.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, json, page, query, read, scroll

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Aggregation Results Format | Format of Aggregation output. |
| Aggregation Results Split | Output a flowfile containing all aggregations or one flowfile for each individual aggregation. |
| Aggregations | One or more query aggregations (or “aggs”), in JSON syntax. Ex: {“items”: {“terms”: {“field”: “product”, “size”: 10}}} |
| Client Service | An Elasticsearch client service to use for running queries. |
| Fields | Fields of indexed documents to be retrieved, in JSON syntax. Ex: [“user.id”, “http.response.\*”, {“field”: “@timestamp”, “format”: “epoch_millis”}] |
| Index | The name of the index to use. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Output No Hits | Output a “hits” flowfile even if no hits found for query. If true, an empty “hits” flowfile will be output even if “aggregations” are output. |
| Pagination Keep Alive | Pagination “keep_alive” period. Period Elasticsearch will keep the scroll/pit cursor alive in between requests (this is not the time expected for all pages to be returned, but the maximum allowed time for requests between page retrievals). |
| Pagination Type | Pagination method to use. Not all types are available for all Elasticsearch versions, check the Elasticsearch docs to confirm which are applicable and recommended for your service. |
| Query | A query in JSON syntax, not Lucene syntax. Ex: {“query”:{“match”:{“somefield”:”somevalue”}}}. If this parameter is not set, the query will be read from the flowfile content. If the query (property and flowfile content) is empty, a default empty JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Attribute | If set, the executed query will be set on each result flowfile in the specified attribute. |
| Query Clause | A “query” clause in JSON syntax, not Lucene syntax. Ex: {“match”:{“somefield”:”somevalue”}}. If the query is empty, a default JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Definition Style | How the JSON Query will be defined for use by the processor. |
| Script Fields | Fields to created using script evaluation at query runtime, in JSON syntax. Ex: {“test1”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* 2”}}, “test2”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* params.factor”, “params”: {“factor”: 2.0}}}} |
| Search Results Format | Format of Hits output. |
| Search Results Split | Output a flowfile containing all hits or one flowfile for each individual hit or one flowfile containing all hits from all paged responses. |
| Size | The maximum number of documents to retrieve in the query. If the query is paginated, this “size” applies to each page of the query, not the “size” of the entire result set. |
| Sort | Sort results by one or more fields, in JSON syntax. Ex: [{“price” : {“order” : “asc”, “mode” : “avg”}}, {“post_date” : {“format”: “strict_date_optional_time_nanos”}}] |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## Relationships

| Name | Description |
| --- | --- |
| aggregations | Aggregations are routed to this relationship. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| hits | Search hits are routed to this relationship. |
| original | All original flowfiles that don’t cause an error to occur go to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| aggregation.name | The name of the aggregation whose results are in the output flowfile |
| aggregation.number | The number of the aggregation whose results are in the output flowfile |
| page.number | The number of the page (request), starting from 1, in which the results were returned that are in the output flowfile |
| hit.count | The number of hits that are in the output flowfile |
| elasticsearch.query.error | The error message provided by Elasticsearch if there is an error querying the index. |

## See also

* [org.apache.nifi.processors.elasticsearch.ConsumeElasticsearch](consumeelasticsearch.md)
* [org.apache.nifi.processors.elasticsearch.JsonQueryElasticsearch](jsonqueryelasticsearch.md)
* [org.apache.nifi.processors.elasticsearch.SearchElasticsearch](searchelasticsearch.md)

---
title: ParquetIcebergWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/parqueticebergwriter.md
section: Loading & Unloading Data
---

# ParquetIcebergWriter

## Description

Provides record serialization for Apache Iceberg using Apache Parquet formatting

## Tags

iceberg, openflow, parquet, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Write Target File Size \* | Write Target File Size | 512 MB |  | Controls the size of files generated to target about this many bytes |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ParseEvtx 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/parseevtx.md
section: Loading & Unloading Data
---

# ParseEvtx 2025.10.9.21

## Bundle

org.apache.nifi | nifi-evtx-nar

## Description

Parses the contents of a Windows Event Log file (evtx) and writes the resulting XML to the FlowFile

## Tags

event, evtx, file, logs, message, windows

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Granularity | Output flow file for each Record, Chunk, or File encountered in the event log |

## Relationships

| Name | Description |
| --- | --- |
| bad chunk | Any bad chunks of records will be transferred to this relationship in their original binary form |
| failure | Any FlowFile that encountered an exception during conversion will be transferred to this relationship with as much parsing as possible done |
| original | The unmodified input FlowFile will be transferred to this relationship |
| success | Any FlowFile that was successfully converted from evtx to XML |

## Writes attributes

| Name | Description |
| --- | --- |
| filename | The output filename |
| mime.type | The output filetype (application/xml for success and failure relationships, original value for bad chunk and original relationships) |

---
title: ParseExcelCellReference 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/parseexcelcellreference.md
section: Loading & Unloading Data
---

# ParseExcelCellReference 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-office-nar

## Description

Processor responsible for parsing Excel cell reference formula.

## Tags

cell, excel, parse, spreadsheet, xls, xlsx

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Ranges | The comma-separated Excel ranges to parse in the A1 notation. For example: Sheet1!A1:B2,Sheet2!D4:E5,Sheet3. Ranges in R1C1 and 3-D reference style are not allowed. The value can’t be empty. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFile with errors occurred while parsing ranges. |
| success | FlowFile annotated with attributes containing parsed Excel range. For each range a separate FlowFile is produced. |

## Writes attributes

| Name | Description |
| --- | --- |
| range.formula | Single range formula that was used to produce other attributes, e.g. Sheet1!A1:B2. |
| range.sheetname | Parsed sheet name. |
| range.rows.starting | Starting row (numbered from 1) of parsed range. |
| range.rows.ending | Ending row of parsed range. |
| range.columns.starting | Number of starting column of parsed range. |
| range.columns.ending | Number of ending column of parsed range. |

---
title: ParseSyslog 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/parsesyslog.md
section: Loading & Unloading Data
---

# ParseSyslog 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Attempts to parses the contents of a Syslog message in accordance to RFC5424 and RFC3164 formats and adds attributes to the FlowFile for each of the parts of the Syslog message. Note: Be mindfull that RFC3164 is informational and a wide range of different implementations are present in the wild. If messages fail parsing, considering using RFC5424 or using a generic parsing processors such as ExtractGrok.

## Tags

attributes, event, logs, message, syslog, system

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | Specifies which character set of the Syslog messages |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that could not be parsed as a Syslog message will be transferred to this Relationship without any attributes being added |
| success | Any FlowFile that is successfully parsed as a Syslog message will be to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| syslog.priority | The priority of the Syslog message. |
| syslog.severity | The severity of the Syslog message derived from the priority. |
| syslog.facility | The facility of the Syslog message derived from the priority. |
| syslog.version | The optional version from the Syslog message. |
| syslog.timestamp | The timestamp of the Syslog message. |
| syslog.hostname | The hostname or IP address of the Syslog message. |
| syslog.sender | The hostname of the Syslog server that sent the message. |
| syslog.body | The body of the Syslog message, everything after the hostname. |

## See also

* [org.apache.nifi.processors.standard.ListenSyslog](listensyslog.md)
* [org.apache.nifi.processors.standard.PutSyslog](putsyslog.md)

---
title: ParseSyslog5424 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/parsesyslog5424.md
section: Loading & Unloading Data
---

# ParseSyslog5424 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Attempts to parse the contents of a well formed Syslog message in accordance to RFC5424 format and adds attributes to the FlowFile for each of the parts of the Syslog message, including Structured Data. Structured Data will be written to attributes as one attribute per item id + parameter see <https://tools.ietf.org/html/rfc5424.Note>: ParseSyslog5424 follows the specification more closely than ParseSyslog. If your Syslog producer does not follow the spec closely, with regards to using ‘-’ for missing header entries for example, those logs will fail with this parser, where they would not fail with ParseSyslog.

## Tags

attributes, event, logs, message, syslog, syslog5424, system

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | Specifies which character set of the Syslog messages |
| include_policy | If true, then the Syslog Message body will be included in the attributes. |
| nil_policy | Defines how NIL values are handled for header fields. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that could not be parsed as a Syslog message will be transferred to this Relationship without any attributes being added |
| success | Any FlowFile that is successfully parsed as a Syslog message will be to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| syslog.priority | The priority of the Syslog message. |
| syslog.severity | The severity of the Syslog message derived from the priority. |
| syslog.facility | The facility of the Syslog message derived from the priority. |
| syslog.version | The optional version from the Syslog message. |
| syslog.timestamp | The timestamp of the Syslog message. |
| syslog.hostname | The hostname or IP address of the Syslog message. |
| syslog.appname | The appname of the Syslog message. |
| syslog.procid | The procid of the Syslog message. |
| syslog.messageid | The messageid the Syslog message. |
| syslog.structuredData | Multiple entries per structuredData of the Syslog message. |
| syslog.sender | The hostname of the Syslog server that sent the message. |
| syslog.body | The body of the Syslog message, everything after the hostname. |

## See also

* [org.apache.nifi.processors.standard.ListenSyslog](listensyslog.md)
* [org.apache.nifi.processors.standard.ParseSyslog](parsesyslog.md)
* [org.apache.nifi.processors.standard.PutSyslog](putsyslog.md)

---
title: PartitionRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/partitionrecord.md
section: Loading & Unloading Data
---

# PartitionRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Splits, or partitions, record-oriented data based on the configured fields in the data. One or more properties must be added. The name of the property is the name of an attribute to add. The value of the property is a RecordPath to evaluate against each Record. Two records will go to the same outbound FlowFile only if they have the same value for each of the given RecordPaths. Because we know that all records in a given output FlowFile have the same value for the fields that are specified by the RecordPath, an attribute is added for each field. See Additional Details on the Usage page for more information and examples.

## Tags

bin, group, organize, partition, record, recordpath, rpath, segment, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| record-reader | Specifies the Controller Service to use for reading incoming data |
| record-writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be partitioned from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship |
| original | Once all records in an incoming FlowFile have been partitioned, the original FlowFile is routed to this relationship. |
| success | FlowFiles that are successfully partitioned will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records in an outgoing FlowFile |
| mime.type | The MIME Type that the configured Record Writer indicates is appropriate |
| fragment.identifier | All partitioned FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the partitioned FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of partitioned FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |
| <dynamic property name> | For each dynamic property that is added, an attribute may be added to the FlowFile. See the description for Dynamic Properties for more information. |

## Use cases

|  |
| --- |
| Separate records into separate FlowFiles so that all of the records in a FlowFile have the same value for a given field or set of fields. |
| Separate records based on whether or not they adhere to a specific criteria |

## See also

* [org.apache.nifi.processors.standard.ConvertRecord](convertrecord.md)
* [org.apache.nifi.processors.standard.QueryRecord](queryrecord.md)
* [org.apache.nifi.processors.standard.SplitRecord](splitrecord.md)
* [org.apache.nifi.processors.standard.UpdateRecord](updaterecord.md)

---
title: PEMEncodedSSLContextProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/pemencodedsslcontextprovider.md
section: Loading & Unloading Data
---

# PEMEncodedSSLContextProvider

## Description

SSLContext Provider configurable using PEM Private Key and Certificate files. Supports PKCS1 and PKCS8 encoding for Private Keys as well as X.509 encoding for Certificates.

## Tags

Certificate, ECDSA, Ed25519, Key, PEM, PKCS1, PKCS8, RSA, SSL, TLS, X.509

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Certificate Authorities \* | Certificate Authorities |  |  | PEM X.509 Certificate Authorities trusted for verifying peers in TLS communications containing one or more standard certificates |
| Certificate Authorities Source \* | Certificate Authorities Source | PROPERTIES | * Properties * System | Source of information for loading trusted Certificate Authorities |
| Certificate Chain \* | Certificate Chain |  |  | PEM X.509 Certificate Chain associated with Private Key starting with standard BEGIN CERTIFICATE header |
| Certificate Chain Location \* | Certificate Chain Location |  |  | PEM X.509 Certificate Chain file location associated with Private Key starting with standard BEGIN CERTIFICATE header |
| Private Key \* | Private Key |  |  | PEM Private Key encoded using either PKCS1 or PKCS8. Supported algorithms include ECDSA, Ed25519, and RSA |
| Private Key Location \* | Private Key Location |  |  | PEM Private Key file location encoded using either PKCS1 or PKCS8. Supported algorithms include ECDSA, Ed25519, and RSA |
| Private Key Source \* | Private Key Source | PROPERTIES | * Undefined * Properties * Files | Source of information for loading Private Key and Certificate Chain |
| TLS Protocol \* | TLS Protocol | TLS | * TLS * TLSv1.3 * TLSv1.2 | TLS protocol version required for negotiating encrypted communications. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Performance Tuning of the Openflow Connector for Kafka
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kafka/performance-tuning.md
section: Loading & Unloading Data
---

# Performance Tuning of the Openflow Connector for Kafka

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic provides guidance for optimizing the performance of the [About Openflow Connector for Kafka](about.md)
to achieve optimal throughput and minimize latency when ingesting data into Snowflake.

## Performance considerations

When configuring the Openflow Connector for Kafka for optimal performance,
consider the following key factors that impact ingestion throughput and latency:

### Message characteristics

Message size
:   Larger messages may provide better throughput but may require more memory and processing time per message.

Message format
:   JSON messages typically require more processing overhead compared to AVRO
    messages due to schema inference and different serialization/deserialization.

Message volume
:   Higher message volumes benefit from parallel processing and larger batch sizes.

### Kafka configuration

Partition count
:   More partitions allow for higher parallelism but require careful coordination with consumer configuration.

Compression
:   Message compression can reduce network bandwidth but increases CPU overhead.

### Flowfile optimization

Flowfile size
:   For optimal performance, flowfiles should be in the range 1-10 MB rather than containing individual small messages.
    Larger flowfiles reduce processing overhead and improve throughput by minimizing the number of individual file operations.
    Default settings should yield flowfiles in an acceptable size range.
    Small flowfiles are expected when throughput is low.

    If you observe small flowfiles with high throughput, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for assistance.

### Network and infrastructure

Network latency
:   Lower latency between Kafka brokers and Openflow improves overall performance.

Bandwidth
:   Sufficient network bandwidth is critical for high-throughput scenarios.

## Node size recommendations

The following table provides configuration recommendations based on expected workload characteristics.

| Node Size | Recommended For | Message Rate Capacity |
| --- | --- | --- |
| Small (S) | Low to moderate throughput scenarios | Up to 10 MB/s per node |
| Medium (M) | Moderate to high throughput scenarios | Up to 40 MB/s per node |
| Large (L) | High throughput scenarios | Exceeding 40 MB/s per node |

## Performance optimization best practices

### Adjusting processor concurrent tasks

To optimize processor performance, you can adjust the number of concurrent tasks for both [ConsumeKafka](../../processors/consumekafka.md)
and [PutSnowpipeStreaming](../../processors/putsnowpipestreaming.md) processors.
Concurrent tasks allow processors to run multiple threads simultaneously, improving throughput for high-volume scenarios.

To adjust concurrent tasks for a processor, perform the following tasks:

1. Right-click on the processor in the Openflow canvas.
2. Select Configure from the context menu.
3. Navigate to the Scheduling tab.
4. In the Concurrent tasks field, enter the preferred number of concurrent tasks.
5. Select Apply to save the configuration.

#### Recommended concurrent task settings

| Node Size | ConsumeKafka Tasks | PutSnowpipeStreaming Tasks |
| --- | --- | --- |
| Small (S) | 1 | 1-2 |
| Medium (M) | 2 | 2-4 |
| Large (L) | 4-8 | 4-10 |

#### Important considerations

Memory usage
:   Each concurrent task consumes additional memory. Monitor JVM heap usage when increasing concurrent tasks.

Kafka partitions
:   For ConsumeKafka, the number of concurrent tasks multiplied by number of nodes should not exceed the number of total Kafka partitions from all topics.

Start conservatively
:   Begin with lower values and gradually increase while monitoring performance metrics.

### Adjusting Max Batch Size in PutSnowpipeStreaming processor

The Max Batch Size parameter in the PutSnowpipeStreaming processor controls how many records are processed in a single batch. Tuning this parameter helps optimize memory usage and throughput.

The Max Batch Size should be tuned based on average record size to keep total batch size (Max Batch Size × average record size) around 4 MB, not exceeding 16 MB for optimal performance.

For example: If average record size is 1KB, Max Batch Size should be set to 4,000.

To adjust Max Batch Size, perform the following

1. Right-click on the PutSnowpipeStreaming processor.
2. Select Configure from the context menu.
3. Navigate to the Properties tab.
4. Locate the Max Batch Size property.
5. Enter the calculated value based on your average record size.
6. Select Apply to save the changes.

#### Important considerations

* Monitor memory usage and throughput when adjusting batch size.
* Start with these recommended values and adjust only if needed while monitoring performance.

## Scaling considerations

The Openflow Platform uses a Horizontal Pod Autoscaler (HPA) based on CPU utilization and does not support custom metrics-based autoscaling.

Proper configuration of concurrent tasks is critical for effective autoscaling.
If concurrent tasks are set too low, the system may not scale up even when Kafka lag is increasing,
because the CPU utilization threshold required to trigger scaling may not be reached.
This can result in processing delays and accumulated backlogs despite the availability of additional resources.

To ensure optimal scaling behavior, configure concurrent tasks according to the
recommendations in Adjusting processor concurrent tasks and monitor both CPU utilization and Kafka lag metrics.

## Troubleshooting performance issues

### Common performance bottlenecks

#### High consumer lag or Snowflake ingestion bottlenecks

If Kafka consumer lag is increasing or Snowflake ingestion is slow, then perform the following tasks:

1. Verify network connectivity and bandwidth between Openflow and Kafka brokers.
2. Observe if the queue in front of the PutSnowpipeStreaming processor increases.

   1. If yes, consider adding more concurrent tasks for PutSnowpipeStreaming processor in the range limitations provided in Adjusting processor concurrent tasks.
   2. If not, consider adding more concurrent tasks for the ConsumeKafka processor in the range limitations provided in Adjusting processor concurrent tasks.
3. Consider using a bigger node type.
4. Consider increasing the max number of nodes for the runtime.

#### Memory pressure

If experiencing memory-related issues:

1. Reduce the batch sizes to lower memory footprint.
2. Reduce the number of concurrent tasks for the ConsumeKafka processor.
3. Consider upgrading to a bigger node type.

#### Network latency issues

If experiencing high latency:

1. Verify network configuration between Openflow and external systems.
2. Consider deploying Openflow closer to your Kafka cluster.
3. If working with low throughput, consider lowering the **Client Lag** settings
   in the PutSnowpipeStreaming processor and **Max Uncommitted Time** in the ConsumeKafka processor.

## Next steps

* Start with the recommended configuration for your node size.
* Monitor performance metrics and adjust settings based on observed behavior.
* Consider load testing in a non-production environment before deploying to production.

---
title: PerformSnowflakeCortexOCR 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/performsnowflakecortexocr.md
section: Loading & Unloading Data
---

# PerformSnowflakeCortexOCR 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Performs Optical Character Recognition (OCR) on PDF documents using Snowflake Cortex ML functions. Documents must be staged in a Snowflake internal stage with server-side encryption enabled. The processor extracts text content from PDFs and can output the results either as FlowFile content or as an attribute.

## Tags

ai, cortex, document, ml, ocr, openflow, pdf, snowflake

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Database | The Snowflake database containing the stage |
| Filename | The filename of the file to perform OCR on, it must be uploaded to the stage prior to performing OCR. FlowFile attributes may be referenced via Expression Language. |
| Max Attribute Size | The maximum size of the OCR results that can be written to an attribute. If the OCR results exceed this, the FlowFile will be routed to failure. |
| OCR Mode | Specifies how document text and structure should be extracted. In ‘OCR’ mode, only raw text content is extracted, ignoring formatting and table structures. In ‘LAYOUT’ mode, the output preserves table structures as markdown. |
| Output Strategy | Determines response output destination |
| Results Attribute | The name of the attribute to write the OCR response to. |
| Schema | The Snowflake schema containing the stage |
| Snowflake Connection Service | Database Connection Service for accessing Snowflake |
| Stage | The Snowflake stage where PDFs will be temporarily stored. The stage must have server-side encryption enabled. FlowFile attributes may be referenced via Expression Language |

## Relationships

| Name | Description |
| --- | --- |
| empty | FlowFiles for which OCR results are empty |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed (with non-empty OCR results) are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The MIME type of the output content (text/plain when output strategy is FLOW_FILE) |
| snowflake.error.information | Contains error information if Snowflake Cortex OCR operation returns an error |

## See also

* [com.snowflake.openflow.runtime.processors.snowflake.PutSnowflakeInternalStageFile](putsnowflakeinternalstagefile.md)

---
title: PickTablesForReplication 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/picktablesforreplication.md
section: Loading & Unloading Data
---

# PickTablesForReplication 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Accepts a list of fully qualified table names and determines if a table: - is new (is not replicated, but was added in the source) - is existing (is replicated and exists in the source) - is stale (is replicated but no longer exists in the source) Configuration is passed as a FlowFile attribute. Processor generates a separate FlowFile for each source table.

## Tags

snowflake, state, table

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Table State Service | A service containing currently replicated tables and their states |

## Relationships

| Name | Description |
| --- | --- |
| existing | FlowFile with qualified table name that is already being replicated |
| failure | If a FlowFile attribute cannot be read or is incorrect, it will be routed to this Relationship. |
| new | FlowFile with qualified table name that was is not replicated |
| stale | FlowFile with qualified table name that used to be replicated but no longer is, either because it was removed from source database or excluded by parameter |

## Writes attributes

| Name | Description |
| --- | --- |
| source.schema.name | Name of the schema of the table from which an event originated |
| source.table.name | Name of the table from which an event originated |

---
title: PolarisIcebergCatalog
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/polarisicebergcatalog.md
section: Loading & Unloading Data
---

# PolarisIcebergCatalog

## Description

Provides Apache Iceberg integration with Apache Polaris Catalog access over REST HTTP

## Tags

catalog, iceberg, openflow, polaris

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Access Token Scopes \* | Access Token Scopes | catalog |  | Comma-separated list of one or more OAuth 2 scopes requested for Access Tokens |
| Authentication Strategy \* | Authentication Strategy | OAUTH2 | * Bearer Authentication * OAuth 2.0 | Strategy for authenticating with the Apache Iceberg Catalog over HTTP |
| Authorization Grant Type \* | Authorization Grant Type | CLIENT_CREDENTIALS | * Client Credentials | OAuth 2.0 Authorization Grant Type for obtaining Access Tokens |
| Authorization Server URI \* | Authorization Server URI |  |  | Authorization Server URI supporting OAuth 2 |
| Bearer Token \* | Bearer Token |  |  | Bearer Token for authentication to Apache Iceberg Catalog |
| Catalog URI \* | Catalog URI |  |  | Apache Iceberg Catalog REST URI |
| Client ID \* | Client ID |  |  | Client ID for OAuth 2 Client Credentials |
| Client Secret \* | Client Secret |  |  | Client Secret for OAuth 2 Client Credentials |
| Warehouse Location | Warehouse Location |  |  | Apache Iceberg Catalog Warehouse location or identifier |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: PromptAnthropicAI 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/promptanthropicai.md
section: Loading & Unloading Data
---

# PromptAnthropicAI 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-anthropic-nar

## Description

Sends a prompt to Anthropic, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. The prompt may consist of pure text interaction or may include an image. Use dynamic properties to enable beta features in the Anthropic endpoint.

## Tags

ai, anthropic, chat, image, openflow, prompt, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Anthropic API Key | The API Key for authenticating to Anthropic |
| Assistant Message | The assistant message to send to Anthropic. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content}. The assistant message is added last |
| Image MIME Type | The MIME type of the image in the FlowFile content. Supported types are image/jpeg, image/png, image/gif, and image/webp. |
| Max File Size | The maximum size of a FlowFile that can be sent to Anthropic as an image. If the FlowFile is larger than this, it will be routed to ‘failure’. |
| Max Tokens | The maximum number of tokens to generate |
| Model Name | The name of the Anthropic model |
| Output Strategy | Determines response output destination |
| Prompt Type | The type of prompt to send to Anthropic. TEXT to send a simple prompt. IMAGE to send an image first and then a prompt. Use JSON for advanced use of Anthropic’s /v1/messages endpoint. |
| Response Format | The format of the response from Anthropic |
| Results Attribute | The name of the attribute to write the response to. |
| Stop Sequences | A comma delimited list of strings act as stop sequences. The model will halt after encountering one of the stop sequences. |
| System Message | The system message to send to Anthropic. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Temperature | The temperature to use for generating the response. Defaults to 1.0. Ranges from 0.0 to 1.0. Use temperature closer to 0.0 for analytical / multiple choice, and closer to 1.0 for creative and generative tasks. |
| Top K | The top K value to use for generating the response. Only sample from the top K options for each subsequent token. Recommended for advanced use cases only. You usually only need to use temperature. |
| Top P | The top P value to use for generating the response. Top P is for nucleus sampling, we compute the cumulative distribution over all the options for each subsequent token in decreasing probability order and cut it off once it reaches a particular probability specified by top_p. Recommended for advanced use cases only. You usually only need to use temperature. |
| User ID | The user id to set in the request metadata |
| User Message | The user message to send to Anthropic. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content}. The user message is added first, unless an image is present. |
| Web Client Service | The Web Client Service to use for communicating with Anthropic |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to obtain a valid response from Anthropic, the original FlowFile will be routed to this relationship |
| retry | If a 5XX response from Anthropic is returned, the original FlowFile will be routed to this relationship |
| success | The response from Anthropic is routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| anthropic.usage.inputTokens | The number of input tokens read in the request. |
| anthropic.usage.outputTokens | The number of output tokens generated in the response. |
| anthropic.chat.completion.id | A unique id assigned to the conversation |
| anthropic.chat.completion.stop.reason | The reason that we stopped. |
| anthropic.chat.completion.stop.sequence | Which custom stop sequence was generated, if any, may be ‘null’. |
| mime.type | The mime type of the response. |
| filename | An updated filename for the response. |

---
title: PromptAzureOpenAI 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/promptazureopenai.md
section: Loading & Unloading Data
---

# PromptAzureOpenAI 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-openai-nar

## Description

Sends a prompt to Azure’s OpenAI service, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. The prompt may consist of pure text interaction or may include images. In the case of images, a URL may be provided, or the contents of the FlowFile may be used, depending on the provided configuration

## Tags

ai, azure, chat, image, openai, openflow, prompt, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| API Key | The API key for authenticating to the Azure OpenAI service |
| Deployment Name | The name of the OpenAI model deployment |
| Detail Level | The image detail level that OpenAI should use for processing the image. Low detail will be less expensive and lower latency, while a high level may provide better results. |
| Image MIME Type | The MIME type of the image |
| Image URL | The URL of the image to send to OpenAI. If not specified, the contents of the FlowFile will be used as the image. |
| Max File Size | The maximum size of a FlowFile that can be sent to OpenAI as an image. If the FlowFile is larger than this, it will be routed to ‘failure’. |
| Max Tokens | The maximum number of tokens to generate |
| OpenAI Service Name | The name of the OpenAI service to use |
| Prompt Type | The type of prompt to send to OpenAI |
| Response Format | The format of the response from OpenAI |
| Results Attribute | The name of the attribute to write the response to. If unset, the response will be written to the FlowFile content. |
| Seed | The seed to use for generating the response |
| System Message | The system message to send to OpenAI. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Temperature | The temperature to use for generating the response. |
| Top P | The top P value to use for generating the response |
| User | Your end user, sent to OpenAI for monitoring and detection of abuse |
| User Message | The user message to send to OpenAI. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Web Client Service | The Web Client Service to use for communicating with OpenAI |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to obtain a valid response from Azure OpenAI, the original FlowFile will be routed to this relationship |
| success | The response from Azure OpenAI is routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.openai.CreateAzureOpenAiEmbeddings](createazureopenaiembeddings.md)
* [com.snowflake.openflow.runtime.processors.openai.PromptOpenAI](promptopenai.md)

---
title: PromptLLM 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/promptllm.md
section: Loading & Unloading Data
---

# PromptLLM 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-llm-processors-nar

## Description

This processor sends a user defined prompt to a Large Language Model (LLM) to respond.

## Tags

ai, llm, openflow, prompt, text processing

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Assistant Message | The assistant message to send to the LLM. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content}. The assistant message is added last |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Output Strategy | Determines response output destination |
| Results Attribute | The name of the attribute to write the response to. |
| System Message | The system message to send to the LLM. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content}. The system message is added first. |
| User Message | The user message to send to the LLM. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content}. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

---
title: PromptOpenAI 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/promptopenai.md
section: Loading & Unloading Data
---

# PromptOpenAI 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-openai-nar

## Description

Sends a prompt to OpenAI, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. The prompt may consist of pure text interaction or may include images. In the case of images, a URL may be provided, or the contents of the FlowFile may be used, depending on the provided configuration

## Tags

ai, chat, image, openai, openflow, prompt, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Detail Level | The image detail level that OpenAI should use for processing the image. Low detail will be less expensive and lower latency, while a high level may provide better results. |
| Image MIME Type | The MIME type of the image |
| Image Model Name | The name of the OpenAI model |
| Image URL | The URL of the image to send to OpenAI. If not specified, the contents of the FlowFile will be used as the image. |
| Max File Size | The maximum size of a FlowFile that can be sent to OpenAI as an image. If the FlowFile is larger than this, it will be routed to ‘failure’. |
| Max Tokens | The maximum number of tokens to generate |
| OpenAI API Key | The API Key for authenticating to OpenAI |
| OpenAI Organization | The organization to use for OpenAI |
| Prompt Type | The type of prompt to send to OpenAI |
| Response Format | The format of the response from OpenAI |
| Results Attribute | The name of the attribute to write the response to. If unset, the response will be written to the FlowFile content. |
| Seed | The seed to use for generating the response |
| System Message | The system message to send to OpenAI. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Temperature | The temperature to use for generating the response. |
| Text Model Name | The name of the OpenAI model |
| Top P | The top P value to use for generating the response |
| User | Your end user, sent to OpenAI for monitoring and detection of abuse |
| User Message | The user message to send to OpenAI. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Web Client Service | The Web Client Service to use for communicating with OpenAI |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to obtain a valid response from OpenAI, the original FlowFile will be routed to this relationship |
| success | The response from OpenAI is routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.openai.CreateOpenAiEmbeddings](createopenaiembeddings.md)
* [com.snowflake.openflow.runtime.processors.openai.PromptAzureOpenAI](promptazureopenai.md)

---
title: PromptSnowflakeCortex 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/promptsnowflakecortex.md
section: Loading & Unloading Data
---

# PromptSnowflakeCortex 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Sends a prompt to Snowflake Cortex, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. The prompt may consist of pure text interaction only.

## Tags

ai, chat, cortex, openflow, prompt, snowflake, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Enable Cortex Guardrails | Filters potentially unsafe and harmful responses from a language model. Either true or false. |
| Max Tokens | The maximum number of tokens to generate |
| Output Strategy | Determines response output destination |
| Response Format | The format of the response from Snowflake Cortex |
| Results Attribute | The name of the attribute to write the response to. |
| Snowflake Connection Service | Database Connection Service for accessing Snowflake |
| System Message | The system message to send to Snowflake Cortex. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Temperature | The temperature to use for generating the response. |
| Text Model Name | The name of the Snowflake Cortex model |
| Top P | The top P value to use for generating the response |
| User Message | The user message to send to Snowflake Cortex. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to obtain a valid response from Snowflake Cortex, the original FlowFile will be routed to this relationship |
| success | The response from Snowflake Cortex is routed to this relationship |

---
title: PromptVertexAI 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/promptvertexai.md
section: Loading & Unloading Data
---

# PromptVertexAI 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-vertexai-nar

## Description

Sends a prompt to VertexAI, writing the response either as a FlowFile attribute or to the contents of the incoming FlowFile. The prompt may consist of pure text interaction or may include multimedia.

## Tags

ai, chat, cloud, gcp, google, image, openflow, pdf, prompt, text, video

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| GCP Credentials Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| GCP Location | The location to configure the Vertex client with |
| GCP Project ID | The project ID to configure the Vertex client with |
| Max File Size | The maximum size of a FlowFile that can be sent to Vertex as an image. If the FlowFile is larger than this, it will be routed to ‘failure’. |
| Max Tokens | The maximum number of tokens to generate |
| Media MIME Type | The MIME type of the media in the FlowFile content. Supported media types are listed here: <https://firebase.google.com/docs/vertex-ai/input-file-requirements> |
| Model Name | The name of the Vertex model |
| Output Strategy | Determines response output destination |
| Prompt Type | The type of prompt to send to Vertex. Text to send a simple prompt. Media to send a multimedia type first followed by a text prompt. |
| Response Format | The format of the response from Vertex |
| Results Attribute | The name of the attribute to write the response to. |
| Stop Sequences | A comma delimited list of strings act as stop sequences. The model will halt after encountering one of the stop sequences. |
| System Message | The system message to send to Vertex. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| Temperature | The temperature to use for generating the response. Defaults to 1.0. Ranges from 0.0 to 1.0. Use temperature closer to 0.0 for analytical / multiple choice, and closer to 1.0 for creative and generative tasks. |
| Top K | The top K value to use for generating the response. Only sample from the top K options for each subsequent token. Recommended for advanced use cases only. You usually only need to use temperature. |
| Top P | The top P value to use for generating the response. Top P is for nucleus sampling, we compute the cumulative distribution over all the options for each subsequent token in decreasing probability order and cut it off once it reaches a particular probability specified by top_p. Recommended for advanced use cases only. You usually only need to use temperature. |
| User Message | The user message to send to Vertex. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content}. The user message is added first, unless an image is present. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If unable to obtain a valid response from Vertex, the original FlowFile will be routed to this relationship |
| success | The response from Vertex is routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| vertex.usage.inputTokens | The number of input tokens read in the request. |
| vertex.usage.outputTokens | The number of output tokens generated in the response. |
| vertex.chat.completion.id | A unique id assigned to the conversation |
| mime.type | The mime type of the response. |
| filename | An updated filename for the response. |

---
title: PropertiesFileLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/propertiesfilelookupservice.md
section: Loading & Unloading Data
---

# PropertiesFileLookupService

## Description

A reloadable properties file-based lookup service

## Tags

cache, enrich, join, key, lookup, properties, reloadable, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Configuration File \* | configuration-file |  |  | A configuration file |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ProtobufReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/protobufreader.md
section: Loading & Unloading Data
---

# ProtobufReader

## Description

Parses a Protocol Buffers message from binary format.

## Tags

parser, protobuf, reader, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Message Type \* | Message Type |  |  | Fully qualified name of the Protocol Buffers message type including its package (eg. mypackage.MyMessage). The .proto files configured in ‘Proto Directory’ must contain the definition of this message type. |
| Proto Directory \* | Proto Directory |  |  | Directory containing Protocol Buffers message definition (.proto) file(s). |
| Schema Access Strategy \* | Schema Access Strategy | generate-from-proto-file | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Generate from Proto file | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: PublishAMQP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/publishamqp.md
section: Loading & Unloading Data
---

# PublishAMQP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-amqp-nar

## Description

Creates an AMQP Message from the contents of a FlowFile and sends the message to an AMQP Exchange. In a typical AMQP exchange model, the message that is sent to the AMQP Exchange will be routed based on the ‘Routing Key’ to its final destination in the queue (the binding). If due to some misconfiguration the binding between the Exchange, Routing Key and Queue is not set up, the message will have no final destination and will return (i.e., the data will not make it to the queue). If that happens you will see a log in both app-log and bulletin stating to that effect, and the FlowFile will be routed to the ‘failure’ relationship.

## Tags

amqp, message, publish, put, rabbit, send

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AMQP Version | AMQP Version. Currently only supports AMQP v0.9.1. |
| Brokers | A comma-separated list of known AMQP Brokers in the format <host>:<port> (e.g., localhost:5672). If this is set, Host Name and Port are ignored. Only include hosts from the same AMQP cluster. |
| Client Certificate Authentication Enabled | Authenticate using the SSL certificate rather than user name/password. |
| Exchange Name | The name of the AMQP Exchange the messages will be sent to. Usually provided by the AMQP administrator (e.g., ‘amq.direct’). It is an optional property. If kept empty the messages will be sent to a default AMQP exchange. |
| Header Separator | The character that is used to split key-value for headers. The value must only one character. Otherwise you will get an error message |
| Headers Pattern | Regular expression that will be evaluated against the FlowFile attributes to select the matching attributes and put as AMQP headers. Attribute name will be used as header key. |
| Headers Source | The source of the headers which will be applied to the published message. |
| Host Name | Network address of AMQP broker (e.g., localhost). If Brokers is set, then this property is ignored. |
| Password | Password used for authentication and authorization. |
| Port | Numeric value identifying Port of AMQP broker (e.g., 5671). If Brokers is set, then this property is ignored. |
| Routing Key | The name of the Routing Key that will be used by AMQP to route messages from the exchange to a destination queue(s). Usually provided by the administrator (e.g., ‘myKey’)In the event when messages are sent to a default exchange this property corresponds to a destination queue name, otherwise a binding from the Exchange to a Queue via Routing Key must be set (usually by the AMQP administrator) |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Username | Username used for authentication and authorization. |
| Virtual Host | Virtual Host name which segregates AMQP system for enhanced security. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be routed to the AMQP destination are routed to this relationship |
| success | All FlowFiles that are sent to the AMQP destination are routed to this relationship |

---
title: PublishGCPubSub 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/publishgcpubsub.md
section: Loading & Unloading Data
---

# PublishGCPubSub 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Publishes the content of the incoming flowfile to the configured Google Cloud PubSub topic. The processor supports dynamic properties. If any dynamic properties are present, they will be sent along with the message in the form of ‘attributes’.

## Tags

gcp, google, google-cloud, message, publish, pubsub

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| Input Batch Size | Maximum number of FlowFiles processed for each Processor invocation |
| Maximum Message Size | The maximum size of a Google PubSub message in bytes. Defaults to 1 MB (1048576 bytes) |
| Message Derivation Strategy | The strategy used to publish the incoming FlowFile to the Google Cloud PubSub endpoint. |
| Record Reader | The Record Reader to use for incoming FlowFiles |
| Record Writer | The Record Writer to use in order to serialize the data before sending to GCPubSub endpoint |
| api-endpoint | Override the gRPC endpoint in the form of [host:port] |
| gcp-batch-bytes | Publish request gets triggered based on this Batch Bytes Threshold property and the Batch Size Threshold property, whichever condition is met first. |
| gcp-project-id | Google Cloud Project ID |
| gcp-pubsub-publish-batch-delay | Indicates the delay threshold to use for batching. After this amount of time has elapsed (counting from the first element added), the elements will be wrapped up in a batch and sent. This value should not be set too high, usually on the order of milliseconds. Otherwise, calls might appear to never complete. |
| gcp-pubsub-publish-batch-size | Indicates the number of messages the cloud service should bundle together in a batch. If not set and left empty, only one message will be used in a batch |
| gcp-pubsub-topic | Name of the Google Cloud PubSub Topic |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship if the Google Cloud Pub/Sub operation fails. |
| retry | FlowFiles are routed to this relationship if the Google Cloud Pub/Sub operation fails but attempting the operation again may succeed. |
| success | FlowFiles are routed to this relationship after a successful Google Cloud Pub/Sub operation. |

## Writes attributes

| Name | Description |
| --- | --- |
| gcp.pubsub.messageId | ID of the pubsub message published to the configured Google Cloud PubSub topic |
| gcp.pubsub.count.records | Count of pubsub messages published to the configured Google Cloud PubSub topic |
| gcp.pubsub.topic | Name of the Google Cloud PubSub topic the message was published to |

## See also

* [org.apache.nifi.processors.gcp.pubsub.ConsumeGCPubSub](consumegcpubsub.md)

---
title: PublishJMS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/publishjms.md
section: Loading & Unloading Data
---

# PublishJMS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-jms-processors-nar

## Description

Creates a JMS Message from the contents of a FlowFile and sends it to a JMS Destination (queue or topic) as JMS BytesMessage or TextMessage. FlowFile attributes will be added as JMS headers and/or properties to the outgoing JMS message.

## Tags

jms, message, publish, put, send

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Client ID | The client id to be set on the connection, if set. For durable non shared consumer this is mandatory, for all others it is optional, typically with shared consumers it is undesirable to be set. Please see JMS spec for further details |
| Connection Factory Service | The Controller Service that is used to obtain Connection Factory. Alternatively, the ‘JNDI \*’ or the ‘JMS \*’ properties can also be used to configure the Connection Factory. |
| Destination Name | The name of the JMS Destination. Usually provided by the administrator (e.g., ‘topic://myTopic’ or ‘myTopic’). |
| Destination Type | The type of the JMS Destination. Could be one of ‘QUEUE’ or ‘TOPIC’. Usually provided by the administrator. Defaults to ‘QUEUE’ |
| Maximum Batch Size | The maximum number of messages to publish or consume in each invocation of the processor. |
| Password | Password used for authentication and authorization. |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| User Name | User Name used for authentication and authorization. |
| allow-illegal-chars-in-jms-header-names | Specifies whether illegal characters in header names should be sent to the JMS broker. Usually hyphens and full-stops. |
| attributes-to-send-as-jms-headers-regex | Specifies the Regular Expression that determines the names of FlowFile attributes that should be sent as JMS Headers |
| broker | URI pointing to the network location of the JMS Message broker. Example for ActiveMQ: ‘<tcp://myhost:61616>’. Examples for IBM MQ: ‘myhost(1414)’ and ‘myhost01(1414),myhost02(1414)’. |
| cf | The fully qualified name of the JMS ConnectionFactory implementation class (eg. org.apache.activemq. ActiveMQConnectionFactory). |
| cflib | Path to the directory with additional resources (eg. JARs, configuration files etc.) to be added to the classpath (defined as a comma separated list of values). Such resources typically represent target JMS client libraries for the ConnectionFactory implementation. |
| character-set | The name of the character set to use to construct or interpret TextMessages |
| connection.factory.name | The name of the JNDI Object to lookup for the Connection Factory. |
| java.naming.factory.initial | The fully qualified class name of the JNDI Initial Context Factory Class (java.naming.factory.initial). |
| java.naming.provider.url | The URL of the JNDI Provider to use as the value for java.naming.provider.url. See additional details documentation for allowed URL schemes. |
| java.naming.security.credentials | The Credentials to use when authenticating with JNDI (java.naming.security.credentials). |
| java.naming.security.principal | The Principal to use when authenticating with JNDI (java.naming.security.principal). |
| message-body-type | The type of JMS message body to construct. |
| naming.factory.libraries | Specifies jar files and/or directories to add to the ClassPath in order to load the JNDI / JMS client libraries. This should be a comma-separated list of files, directories, and/or URLs. If a directory is given, any files in that directory will be included, but subdirectories will not be included (i.e., it is not recursive). |
| record-reader | The Record Reader to use for parsing the incoming FlowFile into Records. |
| record-writer | The Record Writer to use for serializing Records before publishing them as an JMS Message. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Client Library Location can reference resources over HTTP |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be sent to JMS destination are routed to this relationship |
| success | All FlowFiles that are sent to the JMS destination are routed to this relationship |

## See also

* [org.apache.nifi.jms.processors.ConsumeJMS](consumejms.md)

---
title: PublishKafka 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/publishkafka.md
section: Loading & Unloading Data
---

# PublishKafka 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-kafka-nar

## Description

Sends the contents of a FlowFile as either a message or as individual records to Apache Kafka using the Kafka Producer API. The messages to send may be individual FlowFiles, may be delimited using a user-specified delimiter (such as a new-line), or may be record-oriented data that can be read by the configured Record Reader. The complementary NiFi processor for fetching messages is ConsumeKafka.

## Tags

apache, avro, csv, json, kafka, logs, message, openflow, pubsub, put, record, send

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Failure Strategy | Specifies how the processor handles a FlowFile if it is unable to publish the data to Kafka |
| FlowFile Attribute Header Pattern | A Regular Expression that is matched against all FlowFile attribute names. Any attribute whose name matches the pattern will be added to the Kafka messages as a Header. If not specified, no FlowFile attributes will be added as headers. |
| Header Encoding | For any attribute that is added as a Kafka Record Header, this property indicates the Character Encoding to use for serializing the headers. |
| Kafka Connection Service | Provides connections to Kafka Broker for publishing Kafka Records |
| Kafka Key | The Key to use for the Message. If not specified, the FlowFile attribute ‘kafka.key’ is used as the message key, if it is present. Beware that setting Kafka key and demarcating at the same time may potentially lead to many Kafka messages with the same key. Normally this is not a problem as Kafka does not enforce or assume message and key uniqueness. Still, setting the demarcator and Kafka key at the same time poses a risk of data loss on Kafka. During a topic compaction on Kafka, messages will be deduplicated based on this key. |
| Kafka Key Attribute Encoding | FlowFiles that are emitted have an attribute named ‘kafka.key’. This property dictates how the value of the attribute should be encoded. |
| Message Demarcator | Specifies the string (interpreted as UTF-8) to use for demarcating multiple messages within a single FlowFile. If not specified, the entire content of the FlowFile will be used as a single message. If specified, the contents of the FlowFile will be split on this delimiter and each section sent as a separate Kafka message. To enter special character such as ‘new line’ use CTRL+Enter or Shift+Enter, depending on your OS. |
| Message Key Field | The name of a field in the Input Records that should be used as the Key for the Kafka message. |
| Publish Strategy | The format used to publish the incoming FlowFile record to Kafka. |
| Record Key Writer | The Record Key Writer to use for outgoing FlowFiles |
| Record Metadata Strategy | Specifies whether the Record ‘s metadata (topic and partition) should come from the Record’s metadata field or if it should come from the configured Topic Name and Partition / Partitioner class properties |
| Record Reader | The Record Reader to use for incoming FlowFiles |
| Record Writer | The Record Writer to use in order to serialize the data before sending to Kafka |
| Topic Name | Name of the Kafka Topic to which the Processor publishes Kafka Records |
| Transactional ID Prefix | Specifies the KafkaProducer config transactional.id will be a generated UUID and will be prefixed with the configured string. |
| Transactions Enabled | Specifies whether to provide transactional guarantees when communicating with Kafka. If there is a problem sending data to Kafka, and this property is set to false, then the messages that have already been sent to Kafka will continue on and be delivered to consumers. If this is set to true, then the Kafka transaction will be rolled back so that those messages are not available to consumers. Setting this to true requires that the [Delivery Guarantee] property be set to [Guarantee Replicated Delivery.] |
| acks | Specifies the requirement for guaranteeing that a message is sent to Kafka. Corresponds to Kafka Client acks property. |
| compression.type | Specifies the compression strategy for records sent to Kafka. Corresponds to Kafka Client compression.type property. |
| max.request.size | The maximum size of a request in bytes. Corresponds to Kafka Client max.request.size property. |
| partition | Specifies the Kafka Partition destination for Records. |
| partitioner.class | Specifies which class to use to compute a partition id for a message. Corresponds to Kafka Client partitioner.class property. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that cannot be sent to Kafka will be routed to this Relationship |
| success | FlowFiles for which all content was sent to Kafka. |

## Writes attributes

| Name | Description |
| --- | --- |
| msg.count | The number of messages that were sent to Kafka for this FlowFile. This attribute is added only to FlowFiles that are routed to success. |

## See also

* [com.snowflake.openflow.runtime.processors.kafka.ConsumeKafka](consumekafka.md)

---
title: PublishMQTT 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/publishmqtt.md
section: Loading & Unloading Data
---

# PublishMQTT 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mqtt-nar

## Description

Publishes a message to an MQTT topic

## Tags

IOT, MQTT, publish

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Broker URI | The URI(s) to use to connect to the MQTT broker (e.g., <tcp://localhost:1883>). The ‘tcp’, ‘ssl’, ‘ws’ and ‘wss’schemes are supported. In order to use ‘ssl’, the SSL Context Service property must be set. When a comma-separated URI list is set (e.g., <tcp://localhost:1883,tcp://localhost:1884>), the processor will use a round-robin algorithm to connect to the brokers on connection failure. |
| Client ID | MQTT client ID to use. If not set, a UUID will be generated. |
| Connection Timeout (seconds) | Maximum time interval the client will wait for the network connection to the MQTT server to be established. The default timeout is 30 seconds. A value of 0 disables timeout processing meaning the client will wait until the network connection is made successfully or fails. |
| Keep Alive Interval (seconds) | Defines the maximum time interval between messages sent or received. It enables the client to detect if the server is no longer available, without having to wait for the TCP/IP timeout. The client will ensure that at least one message travels across the network within each keep alive period. In the absence of a data-related message during the time period, the client sends a very small “ping” message, which the server will acknowledge. A value of 0 disables keepalive processing in the client. |
| Last Will Message | The message to send as the client’s Last Will. |
| Last Will QoS Level | QoS level to be used when publishing the Last Will Message. |
| Last Will Retain | Whether to retain the client’s Last Will. |
| Last Will Topic | The topic to send the client’s Last Will to. |
| MQTT Specification Version | The MQTT specification version when connecting with the broker. See the allowable value descriptions for more details. |
| Password | Password to use when connecting to the broker |
| Quality of Service(QoS) | The Quality of Service (QoS) to send the message with. Accepts three values ‘0’, ‘1’ and ‘2’; ‘0’ for ‘at most once’, ‘1’ for ‘at least once’, ‘2’ for ‘exactly once’. Expression language is allowed in order to support publishing messages with different QoS but the end value of the property must be either ‘0’, ‘1’ or ‘2’. |
| Retain Message | Whether or not the retain flag should be set on the MQTT message. |
| SSL Context Service | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Session Expiry Interval | After this interval the broker will expire the client and clear the session state. |
| Session state | Whether to start a fresh or resume previous flows. See the allowable value descriptions for more details. |
| Topic | The topic to publish the message to. |
| Username | Username to use when connecting to the broker |
| message-demarcator | With this property, you have an option to publish multiple messages from a single FlowFile. This property allows you to provide a string (interpreted as UTF-8) to use for demarcating apart the FlowFile content. This is an optional property ; if not provided, and if not defining a Record Reader/Writer, each FlowFile will be published as a single message. To enter special character such as ‘new line’ use CTRL+Enter or Shift+Enter depending on the OS. |
| record-reader | The Record Reader to use for parsing the incoming FlowFile into Records. |
| record-writer | The Record Writer to use for serializing Records before publishing them as an MQTT Message. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the destination are transferred to this relationship. |
| success | FlowFiles that are sent successfully to the destination are transferred to this relationship. |

## See also

* [org.apache.nifi.processors.mqtt.ConsumeMQTT](consumemqtt.md)

---
title: PublishSlack 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/publishslack.md
section: Loading & Unloading Data
---

# PublishSlack 2025.10.9.21

## Bundle

org.apache.nifi | nifi-slack-nar

## Description

Posts a message to the specified Slack channel. The content of the message can be either a user-defined message that makes use of Expression Language or the contents of the FlowFile can be sent as the message. If sending a user-defined message, the contents of the FlowFile may also be optionally uploaded as a file attachment.

## Tags

chat.postMessage, conversation, publish, send, slack, social media, team, text, unstructured, upload, write

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Access Token | OAuth Access Token used for authenticating/authorizing the Slack request sent by NiFi. This may be either a User Token or a Bot Token. The token must be granted the chat:write scope. Additionally, in order to upload FlowFile contents as an attachment, it must be granted files:write. |
| Channel | The name or identifier of the channel to send the message to. If using a channel name, it must be prefixed with the # character. For example, #general. This is valid only for public channels. Otherwise, the unique identifier of the channel to publish to must be provided. |
| Character Set | Specifies the name of the Character Set used to encode the FlowFile contents. |
| Include FlowFile Content as Attachment | Specifies whether or not the contents of the FlowFile should be uploaded as an attachment to the Slack message. |
| Max FlowFile Size | The maximum size of a FlowFile that can be sent to Slack. If any FlowFile exceeds this size, it will be routed to failure. This plays an important role because the entire contents of the file must be loaded into NiFi’s heap in order to send the data to Slack. |
| Message Text | The text of the message to send to Slack. |
| Methods Endpoint Url Prefix | Customization of the Slack Client. Set the methodsEndpointUrlPrefix. If you need to set a different URL prefix for Slack API Methods calls, you can set the one. Default value: <https://slack.com/api/> |
| Publish Strategy | Specifies how the Processor will send the message or file to Slack. |
| Thread Timestamp | The Timestamp identifier for the thread that this message is to be a part of. If not specified, the message will be a top-level message instead of being in a thread. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to ‘failure’ if unable to be sent to Slack for any other reason |
| rate limited | FlowFiles are routed to ‘rate limited’ if the Rate Limit has been exceeded |
| success | FlowFiles are routed to success after being successfully sent to Slack |

## Writes attributes

| Name | Description |
| --- | --- |
| slack.channel.id | The ID of the Slack Channel from which the messages were retrieved |
| slack.ts | The timestamp of the slack messages that was sent; this is used by Slack as a unique identifier |

## Use cases

|  |
| --- |
| Send specific text as a message to Slack, optionally including the FlowFile’s contents as an attached file. |
| Send the contents of the FlowFile as a message to Slack. |

## Use Cases Involving Other Components

|  |
| --- |
| Respond to a Slack message in a thread. |

## See also

* [org.apache.nifi.processors.slack.ConsumeSlack](consumeslack.md)
* [org.apache.nifi.processors.slack.ListenSlack](listenslack.md)

---
title: PutAzureBlobStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putazureblobstorage_v12.md
section: Loading & Unloading Data
---

# PutAzureBlobStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Puts content into a blob on Azure Blob Storage. The processor uses Azure Blob Storage client library v12.

## Tags

azure, blob, cloud, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Blob Name | The full name of the blob |
| Client-Side Encryption Key ID | Specifies the ID of the key to use for client-side encryption. |
| Client-Side Encryption Key Type | Specifies the key type to use for client-side encryption. |
| Client-Side Encryption Local Key | When using local client-side encryption, this is the raw key, encoded in hexadecimal |
| Conflict Resolution Strategy | Specifies whether an existing blob will have its contents replaced upon conflict. |
| Container Name | Name of the Azure storage container. In case of PutAzureBlobStorage processor, container can be created if it does not exist. |
| Create Container | Specifies whether to check if the container exists and to automatically create it if it does not. Permission to list containers is required. If false, this check is not made, but the Put operation will fail if the container does not exist. |
| File Resource Service | File Resource Service providing access to the local resource to be transferred |
| Resource Transfer Source | The source of the content to be transferred |
| Storage Credentials | Controller Service used to obtain Azure Blob Storage Credentials. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Unsuccessful operations will be transferred to the failure relationship. |
| success | All successfully processed FlowFiles are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.container | The name of the Azure Blob Storage container |
| azure.blobname | The name of the blob on Azure Blob Storage |
| azure.primaryUri | Primary location of the blob |
| azure.etag | ETag of the blob |
| azure.blobtype | Type of the blob (either BlockBlob, PageBlob or AppendBlob) |
| mime.type | MIME Type of the content |
| lang | Language code for the content |
| azure.timestamp | Timestamp of the blob |
| azure.length | Length of the blob |
| azure.error.code | Error code reported during blob operation |
| azure.ignored | When Conflict Resolution Strategy is ‘ignore’, this property will be true/false depending on whether the blob was ignored. |

## See also

* [org.apache.nifi.processors.azure.storage.CopyAzureBlobStorage_v12](copyazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.DeleteAzureBlobStorage_v12](deleteazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureBlobStorage_v12](fetchazureblobstorage_v12.md)
* [org.apache.nifi.processors.azure.storage.ListAzureBlobStorage_v12](listazureblobstorage_v12.md)

---
title: PutAzureCosmosDBRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putazurecosmosdbrecord.md
section: Loading & Unloading Data
---

# PutAzureCosmosDBRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

This processor is a record-aware processor for inserting data into Cosmos DB with Core SQL API. It uses a configured record reader and schema to read an incoming record set from the body of a Flowfile and then inserts those records into a configured Cosmos DB Container.

## Tags

azure, cosmos, insert, put, record

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Cosmos DB Access Key | Cosmos DB Access Key from Azure Portal (Settings->Keys). Choose a read-write key to enable database or container creation at run time |
| Cosmos DB Conflict Handling Strategy | Choose whether to ignore or upsert when conflict error occurs during insertion |
| Cosmos DB Connection Service | If configured, the controller service used to obtain the connection string and access key |
| Cosmos DB Consistency Level | Choose from five consistency levels on the consistency spectrum. Refer to Cosmos DB documentation for their differences |
| Cosmos DB Container ID | The unique identifier for the container |
| Cosmos DB Name | The database name or id. This is used as the namespace for document collections or containers |
| Cosmos DB Partition Key | The partition key used to distribute data among servers |
| Cosmos DB URI | Cosmos DB URI, typically in the form of <https:/>/{databaseaccount}.documents.azure.com:443/ Note this host URL is for Cosmos DB with Core SQL API from Azure Portal (Overview->URI) |
| Insert Batch Size | The number of records to group together for one single insert operation against Cosmos DB |
| Record Reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be written to Cosmos DB are routed to this relationship |
| success | All FlowFiles that are written to Cosmos DB are routed to this relationship |

---
title: PutAzureDataExplorer 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putazuredataexplorer.md
section: Loading & Unloading Data
---

# PutAzureDataExplorer 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Acts as an Azure Data Explorer sink which sends FlowFiles to the provided endpoint. Data can be sent through queued ingestion or streaming ingestion to the Azure Data Explorer cluster.

## Tags

ADX, Azure, Data, Explorer, Kusto

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Data Format | The format of the data that is sent to Azure Data Explorer. Supported formats include: avro, csv, json |
| Database Name | Azure Data Explorer Database Name for ingesting data |
| Ingest Mapping Name | The name of the mapping responsible for storing the data in the appropriate columns. |
| Ingest Status Polling Interval | Defines the value of interval of time to poll for ingestion status |
| Ingest Status Polling Timeout | Defines the total amount time to poll for ingestion status |
| Ingestion Ignore First Record | Defines whether ignore first record while ingestion. |
| Kusto Ingest Service | Azure Data Explorer Kusto Ingest Service |
| Partially Succeeded Routing Strategy | Defines where to route FlowFiles that resulted in a partially succeeded status. |
| Poll for Ingest Status | Determines whether to poll on ingestion status after an ingestion to Azure Data Explorer is completed |
| Streaming Enabled | Whether to stream data to Azure Data Explorer. |
| Table Name | Azure Data Explorer Table Name for ingesting data |

## Relationships

| Name | Description |
| --- | --- |
| failure | Ingest processing failed |
| success | Ingest processing succeeded |

---
title: PutAzureDataLakeStorage 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putazuredatalakestorage.md
section: Loading & Unloading Data
---

# PutAzureDataLakeStorage 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Writes the contents of a FlowFile as a file on Azure Data Lake Storage Gen 2

## Tags

adlsgen2, azure, cloud, datalake, microsoft, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ADLS Credentials | Controller Service used to obtain Azure Credentials. |
| Base Temporary Path | The Path where the temporary directory will be created. The Path name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. Non-existing directories will be created. The Temporary File Directory name is _nifitempdirectory |
| Conflict Resolution Strategy | Indicates what should happen when a file with the same name already exists in the output directory |
| Directory Name | Name of the Azure Storage Directory. The Directory Name cannot contain a leading ‘/’. The root directory can be designated by the empty string value. In case of the PutAzureDataLakeStorage processor, the directory will be created if not already existing. |
| File Name | The filename |
| File Resource Service | File Resource Service providing access to the local resource to be transferred |
| Filesystem Name | Name of the Azure Storage File System (also called Container). It is assumed to be already existing. |
| Resource Transfer Source | The source of the content to be transferred |
| Writing Strategy | Defines the approach for writing the Azure file. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Azure storage for some reason are transferred to this relationship |
| success | Files that have been successfully written to Azure storage are transferred to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| azure.filesystem | The name of the Azure File System |
| azure.directory | The name of the Azure Directory |
| azure.filename | The name of the Azure File |
| azure.primaryUri | Primary location for file content |
| azure.length | The length of the Azure File |

## See also

* [org.apache.nifi.processors.azure.storage.DeleteAzureDataLakeStorage](deleteazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.FetchAzureDataLakeStorage](fetchazuredatalakestorage.md)
* [org.apache.nifi.processors.azure.storage.ListAzureDataLakeStorage](listazuredatalakestorage.md)

---
title: PutAzureEventHub 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putazureeventhub.md
section: Loading & Unloading Data
---

# PutAzureEventHub 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Send FlowFile contents to Azure Event Hubs

## Tags

azure, cloud, eventhub, events, microsoft, streaming, streams

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Event Hub Name | Name of Azure Event Hubs destination |
| Event Hub Namespace | Namespace of Azure Event Hubs prefixed to Service Bus Endpoint domain |
| Maximum Batch Size | Maximum number of FlowFiles processed for each Processor invocation |
| Partitioning Key Attribute Name | If specified, the value from argument named by this field will be used as a partitioning key to be used by event hub. |
| Service Bus Endpoint | To support namespaces not in the default windows.net domain. |
| Shared Access Policy Key | The key of the shared access policy. Either the primary or the secondary key can be used. |
| Shared Access Policy Name | The name of the shared access policy. This policy must have Send claims. |
| Transport Type | Advanced Message Queuing Protocol Transport Type for communication with Azure Event Hubs |
| Use Azure Managed Identity | Choose whether or not to use the managed identity of Azure VM/VMSS |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that could not be sent to the event hub will be transferred to this Relationship. |
| success | Any FlowFile that is successfully sent to the event hubs will be transferred to this Relationship. |

---
title: PutAzureQueueStorage_v12 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putazurequeuestorage_v12.md
section: Loading & Unloading Data
---

# PutAzureQueueStorage_v12 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Writes the content of the incoming FlowFiles to the configured Azure Queue Storage.

## Tags

azure, cloud, enqueue, microsoft, queue, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Credentials Service | Controller Service used to obtain Azure Storage Credentials. |
| Endpoint Suffix | Storage accounts in public Azure always use a common FQDN suffix. Override this endpoint suffix with a different suffix in certain circumstances (like Azure Stack or non-public Azure regions). |
| Message Time To Live | Maximum time to allow the message to be in the queue |
| Queue Name | Name of the Azure Storage Queue |
| Request Timeout | The timeout for read or write requests to Azure Queue Storage. Defaults to 1 second. |
| Visibility Timeout | The length of time during which the message will be invisible after it is read. If the processing unit fails to delete the message after it is read, then the message will reappear in the queue. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Unsuccessful operations will be transferred to the failure relationship. |
| success | All successfully processed FlowFiles are routed to this relationship |

## See also

* [org.apache.nifi.processors.azure.storage.queue.GetAzureQueueStorage_v12](getazurequeuestorage_v12.md)

---
title: PutBigQuery 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putbigquery.md
section: Loading & Unloading Data
---

# PutBigQuery 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Writes the contents of a FlowFile to a Google BigQuery table. The processor is record based so the schema that is used is driven by the RecordReader. Attributes that are not matched to the target schema are skipped. Exactly once delivery semantics are achieved via stream offsets.

## Tags

bigquery, bq, google, google cloud

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| bigquery-api-endpoint | Can be used to override the default BigQuery endpoint. Default is bigquerystorage.googleapis.com:443. Format must be hostname:port. |
| bq.append.record.count | The number of records to be appended to the write stream at once. Applicable for both batch and stream types |
| bq.dataset | BigQuery dataset name (Note - The dataset must exist in GCP) |
| bq.record.reader | Specifies the Controller Service to use for parsing incoming data. |
| bq.skip.invalid.rows | Sets whether to insert all valid rows of a request, even if invalid rows exist. If not set the entire insert request will fail if it contains an invalid row. |
| bq.table.name | BigQuery table name |
| bq.transfer.type | Defines the preferred transfer type streaming or batching |
| gcp-project-id | Google Cloud Project ID |
| gcp-retry-count | How many retry attempts should be made before routing to the failure relationship. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship if the Google BigQuery operation fails. |
| success | FlowFiles are routed to this relationship after a successful Google BigQuery operation. |

## Writes attributes

| Name | Description |
| --- | --- |
| bq.records.count | Number of records successfully inserted |

---
title: PutBoxFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putboxfile.md
section: Loading & Unloading Data
---

# PutBoxFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Puts content to a Box folder.

## Tags

box, put, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| Chunked Upload Threshold | The maximum size of the content which is uploaded at once. FlowFiles larger than this threshold are uploaded in chunks. Chunked upload is allowed for files larger than 20 MB. It is recommended to use chunked upload for files exceeding 50 MB. |
| Conflict Resolution Strategy | Indicates what should happen when a file with the same name already exists in the specified Box folder. |
| Create Subfolder | Specifies whether to check if the subfolder exists and to automatically create it if it does not. Permission to list folders is required. |
| Filename | The name of the file to upload to the specified Box folder. |
| Folder ID | The ID of the folder where the file is uploaded. Please see Additional Details to obtain Folder ID. |
| Subfolder Name | The name (path) of the subfolder where files are uploaded. The subfolder name is relative to the folder specified by ‘Folder ID’. Example: subFolder, subFolder1/subfolder2 |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Box for some reason are transferred to this relationship. |
| success | Files that have been successfully written to Box are transferred to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The id of the file |
| filename | The name of the file |
| path | The folder path where the file is located |
| box.size | The size of the file |
| box.timestamp | The last modified time of the file |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)

---
title: PutCloudWatchMetric 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putcloudwatchmetric.md
section: Loading & Unloading Data
---

# PutCloudWatchMetric 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Publishes metrics to Amazon CloudWatch. Metric can be either a single value, or a StatisticSet comprised of minimum, maximum, sum and sample count.

## Tags

amazon, aws, cloudwatch, metrics, publish, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Maximum | The maximum value of the sample set. Must be a double |
| Metric Name | The name of the metric |
| Minimum | The minimum value of the sample set. Must be a double |
| Namespace | The namespace for the metric data for CloudWatch |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Sample Count | The number of samples used for the statistic set. Must be a double |
| Sum | The sum of values for the sample set. Must be a double |
| Timestamp | A point in time expressed as the number of milliseconds since Jan 1, 1970 00:00:00 UTC. If not specified, the default value is set to the time the metric data was received |
| Unit | The unit of the metric. (e.g Seconds, Bytes, Megabytes, Percent, Count, Kilobytes/Second, Terabits/Second, Count/Second) For details see <http://docs.aws.amazon.com/AmazonCloudWatch/latest/APIReference/API_MetricDatum.html> |
| Value | The value for the metric. Must be a double |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

---
title: PutDatabaseRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdatabaserecord.md
section: Loading & Unloading Data
---

# PutDatabaseRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

The PutDatabaseRecord processor uses a specified RecordReader to input (possibly multiple) records from an incoming flow file. These records are translated to SQL statements and executed as a single transaction. If any errors occur, the flow file is routed to failure or retry, and if the records are transmitted successfully, the incoming flow file is routed to success. The type of statement executed by the processor is specified via the Statement Type property, which accepts some hard-coded values such as INSERT, UPDATE, and DELETE, as well as ‘Use statement.type Attribute’, which causes the processor to get the statement type from a flow file attribute. IMPORTANT: If the Statement Type is UPDATE, then the incoming records must not alter the value(s) of the primary keys (or user-specified Update Keys). If such records are encountered, the UPDATE statement issued to the database may do nothing (if no existing records with the new primary key values are found), or could inadvertently corrupt the existing data (by changing records for which the new values of the primary keys exist).

## Tags

database, delete, insert, jdbc, put, record, sql, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Name Translation Pattern | Column name will be normalized with this regular expression |
| Column Name Translation Strategy | The strategy used to normalize table column name. Column Name will be uppercased to do case-insensitive matching irrespective of strategy |
| Data Record Path | If specified, this property denotes a RecordPath that will be evaluated against each incoming Record and the Record that results from evaluating the RecordPath will be sent to the database instead of sending the entire incoming Record. If not specified, the entire incoming Record will be published to the database. |
| Database Dialect Service | Database Dialect Service for generating statements specific to a particular service or vendor. |
| Delete Keys | A comma-separated list of column names that uniquely identifies a row in the database for DELETE statements. If the Statement Type is DELETE and this property is not set, the table’s columns are used. This property is ignored if the Statement Type is not DELETE |
| Rollback On Failure | Specify how to handle error. By default (false), if an error occurs while processing a FlowFile, the FlowFile will be routed to ‘failure’ or ‘retry’ relationship based on error type, and processor can continue with next FlowFile. Instead, you may want to rollback currently processed FlowFiles and stop further processing immediately. In that case, you can do so by enabling this ‘Rollback On Failure’ property. If enabled, failed FlowFiles will stay in the input relationship without penalizing it and being processed repeatedly until it gets processed successfully or removed by other means. It is important to set adequate ‘Yield Duration’ to avoid retrying too frequently. |
| Statement Type Record Path | Specifies a RecordPath to evaluate against each Record in order to determine the Statement Type. The RecordPath should equate to either INSERT, UPDATE, UPSERT, or DELETE. (Debezium style operation types are also supported: “r” and “c” for INSERT, “u” for UPDATE, and “d” for DELETE) |
| database-session-autocommit | The autocommit mode to set on the database connection being used. If set to false, the operation(s) will be explicitly committed or rolled back (based on success or failure respectively). If set to true, the driver/database automatically handles the commit/rollback. |
| db-type | Database Type for generating statements specific to a particular service or vendor. The Generic Type supports most cases but selecting a specific type enables optimal processing or additional features. |
| put-db-record-allow-multiple-statements | If the Statement Type is ‘SQL’ (as set in the statement.type attribute), this field indicates whether to split the field value by a semicolon and execute each statement separately. If any statement causes an error, the entire set of statements will be rolled back. If the Statement Type is not ‘SQL’, this field is ignored. |
| put-db-record-binary-format | The format to be applied when decoding string values to binary. |
| put-db-record-catalog-name | The name of the database (or the name of the catalog, depending on the destination system) that the statement should update. This may not apply for the database that you are updating. In this case, leave the field empty. Note that if the property is set and the database is case-sensitive, the catalog name must match the database’s catalog name exactly. |
| put-db-record-dcbp-service | The Controller Service that is used to obtain a connection to the database for sending records. |
| put-db-record-field-containing-sql | If the Statement Type is ‘SQL’ (as set in the statement.type attribute), this field indicates which field in the record(s) contains the SQL statement to execute. The value of the field must be a single SQL statement. If the Statement Type is not ‘SQL’, this field is ignored. |
| put-db-record-max-batch-size | Specifies maximum number of sql statements to be included in each batch sent to the database. Zero means the batch size is not limited, and all statements are put into a single batch which can cause high memory usage issues for a very large number of statements. |
| put-db-record-query-timeout | The maximum amount of time allowed for a running SQL statement , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| put-db-record-quoted-identifiers | Enabling this option will cause all column names to be quoted, allowing you to use reserved words as column names in your tables. |
| put-db-record-quoted-table-identifiers | Enabling this option will cause the table name to be quoted to support the use of special characters in the table name. |
| put-db-record-record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema. |
| put-db-record-schema-name | The name of the schema that the table belongs to. This may not apply for the database that you are updating. In this case, leave the field empty. Note that if the property is set and the database is case-sensitive, the schema name must match the database’s schema name exactly. |
| put-db-record-statement-type | Specifies the type of SQL Statement to generate. Please refer to the database documentation for a description of the behavior of each operation. Please note that some Database Types may not support certain Statement Types. If ‘Use statement.type Attribute’ is chosen, then the value is taken from the statement.type attribute in the FlowFile. The ‘Use statement.type Attribute’ option is the only one that allows the ‘SQL’statement type. If ‘SQL’ is specified, the value of the field specified by the ‘Field Containing SQL’ property is expected to be a valid SQL statement on the target database, and will be executed as-is. |
| put-db-record-table-name | The name of the table that the statement should affect. Note that if the database is case-sensitive, the table name must match the database’s table name exactly. |
| put-db-record-translate-field-names | If true, the Processor will attempt to translate field names into the appropriate column names for the table specified. If false, the field names must match the column names exactly, or the column will not be updated |
| put-db-record-unmatched-column-behavior | If an incoming record does not have a field mapping for all of the database table’s columns, this property specifies how to handle the situation |
| put-db-record-unmatched-field-behavior | If an incoming record has a field that does not map to any of the database table’s columns, this property specifies how to handle the situation |
| put-db-record-update-keys | A comma-separated list of column names that uniquely identifies a row in the database for UPDATE statements. If the Statement Type is UPDATE and this property is not set, the table’s Primary Keys are used. In this case, if no Primary Key exists, the conversion to SQL will fail if Unmatched Column Behaviour is set to FAIL. This property is ignored if the Statement Type is INSERT |
| table-schema-cache-size | Specifies how many Table Schemas should be cached |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if the database cannot be updated and retrying the operation will also fail, such as an invalid query or an integrity constraint violation |
| retry | A FlowFile is routed to this relationship if the database cannot be updated but attempting the operation again may succeed |
| success | Successfully created FlowFile from SQL query result set. |

## Writes attributes

| Name | Description |
| --- | --- |
| putdatabaserecord.error | If an error occurs during processing, the flow file will be routed to failure or retry, and this attribute will be populated with the cause of the error. |

## Use cases

|  |
| --- |
| Insert records into a database |

---
title: PutDatabricksSQL 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdatabrickssql.md
section: Loading & Unloading Data
---

# PutDatabricksSQL 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Submit a SQL Execution using Databricks REST API then write the JSON response to FlowFile Content. For high performance SELECT or INSERT queries use ExecuteSQL instead.

## Tags

databricks, openflow, sql

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Default Catalog | Default table catalog, some SQL statements such as ‘COPY INTO’ do not support using a default catalog |
| Default Schema | Default table schema, some SQL statements such as ‘COPY INTO’ do not support using a default schema |
| Record Writer | Specifies the Controller Service to use for writing results to a FlowFile. The Record Writer may use Inherit Schema to emulate the inferred schema behavior, i.e. an explicit schema need not be defined in the writer, and will be supplied by the same logic used to infer the schema from the column types. |
| SQL Warehouse ID | Warehouse ID used to execute SQL |
| SQL Warehouse Name | SQL Warehouse Name used to execute SQL, will search through all SQL Warehouses to find matching name. |
| Statement | SQL statement to execute |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| http.response | HTTP Response to SQL API Request |
| original | The original FlowFile is routed to this relationship when processing is successful. |
| records | Serialized SQL Records |

## Writes attributes

| Name | Description |
| --- | --- |
| statement.state | The final state of the executed SQL statement |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: PutDBFSFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdbfsfile.md
section: Loading & Unloading Data
---

# PutDBFSFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Write FlowFile content to DBFS.

## Tags

databricks, dbfs, openflow

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| DBFS File Path | DBFS file path e.g. /directory/file.txt |
| Databricks Client | Databricks Client Service. |
| Overwrite Policy | What action to take if a file already exists at the destination path. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: PutDistributedMapCache 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdistributedmapcache.md
section: Loading & Unloading Data
---

# PutDistributedMapCache 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Gets the content of a FlowFile and puts it to a distributed map cache, using a cache key computed from FlowFile attributes. If the cache already contains the entry and the cache update strategy is ‘keep original’ the entry is not replaced.’

## Tags

cache, distributed, map, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Cache Entry Identifier | A FlowFile attribute, or the results of an Attribute Expression Language statement, which will be evaluated against a FlowFile in order to determine the cache key |
| Cache update strategy | Determines how the cache is updated if the cache already contains the entry |
| Distributed Cache Service | The Controller Service that is used to cache flow files |
| Max cache entry size | The maximum amount of data to put into cache |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that cannot be inserted into the cache will be routed to this relationship |
| success | Any FlowFile that is successfully inserted into cache will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| cached | All FlowFiles will have an attribute ‘cached’. The value of this attribute is true, is the FlowFile is cached, otherwise false. |

## See also

* [org.apache.nifi.processors.standard.FetchDistributedMapCache](fetchdistributedmapcache.md)

---
title: PutDropbox 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdropbox.md
section: Loading & Unloading Data
---

# PutDropbox 2025.10.9.21

## Bundle

org.apache.nifi | nifi-dropbox-processors-nar

## Description

Puts content to a Dropbox folder.

## Tags

dropbox, put, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Chunked Upload Size | Defines the size of a chunk. Used when a FlowFile ‘s size exceeds’Chunked Upload Threshold ‘and content is uploaded in smaller chunks. It is recommended to specify chunked upload size smaller than’Chunked Upload Threshold’ and as multiples of 4 MB. Maximum allowed value is 150 MB. |
| Chunked Upload Threshold | The maximum size of the content which is uploaded at once. FlowFiles larger than this threshold are uploaded in chunks. Maximum allowed value is 150 MB. |
| Conflict Resolution Strategy | Indicates what should happen when a file with the same name already exists in the specified Dropbox folder. |
| Dropbox Credential Service | Controller Service used to obtain Dropbox credentials (App Key, App Secret, Access Token, Refresh Token). See controller service’s Additional Details for more information. |
| Filename | The full name of the file to upload. |
| Folder | The path of the Dropbox folder to upload files to. The folder will be created if it does not exist yet. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Dropbox for some reason are transferred to this relationship. |
| success | Files that have been successfully written to Dropbox are transferred to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| error.message | The error message returned by Dropbox |
| dropbox.id | The Dropbox identifier of the file |
| path | The folder path where the file is located |
| filename | The name of the file |
| dropbox.size | The size of the file |
| dropbox.timestamp | The server modified time of the file |
| dropbox.revision | Revision of the file |

## See also

* [org.apache.nifi.processors.dropbox.FetchDropbox](fetchdropbox.md)
* [org.apache.nifi.processors.dropbox.ListDropbox](listdropbox.md)

---
title: PutDynamoDB 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdynamodb.md
section: Loading & Unloading Data
---

# PutDynamoDB 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Puts a document from DynamoDB based on hash and range key. The table can have either hash and range or hash key alone. Currently the keys supported are string and number and value can be json document. In case of hash and range keys both key are required for the operation. The FlowFile content must be JSON. FlowFile content is mapped to the specified Json Document attribute in the DynamoDB item.

## Tags

AWS, Amazon, DynamoDB, Insert, Put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Batch items for each request (between 1 and 50) | The items to be retrieved in one batch |
| Character set of document | Character set of data in the document |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Hash Key Name | The hash key name of the item |
| Hash Key Value | The hash key value of the item |
| Hash Key Value Type | The hash key value type of the item |
| Json Document attribute | The Json document to be retrieved from the dynamodb item ( ‘s’ type in the schema) |
| Range Key Name | The range key name of the item |
| Range Key Value |  |
| Range Key Value Type | The range key value type of the item |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Table Name | The DynamoDB table name |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |
| unprocessed | FlowFiles are routed to unprocessed relationship when DynamoDB is not able to process all the items in the request. Typical reasons are insufficient table throughput capacity and exceeding the maximum bytes per request. Unprocessed FlowFiles can be retried with a new request. |

## Writes attributes

| Name | Description |
| --- | --- |
| dynamodb.key.error.unprocessed | DynamoDB unprocessed keys |
| dynmodb.range.key.value.error | DynamoDB range key error |
| dynamodb.key.error.not.found | DynamoDB key not found |
| dynamodb.error.exception.message | DynamoDB exception message |
| dynamodb.error.code | DynamoDB error code |
| dynamodb.error.message | DynamoDB error message |
| dynamodb.error.service | DynamoDB error service |
| dynamodb.error.retryable | DynamoDB error is retryable |
| dynamodb.error.request.id | DynamoDB error request id |
| dynamodb.error.status.code | DynamoDB error status code |
| dynamodb.item.io.error | IO exception message on creating item |

## See also

* [org.apache.nifi.processors.aws.dynamodb.DeleteDynamoDB](deletedynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.GetDynamoDB](getdynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.PutDynamoDBRecord](putdynamodbrecord.md)

---
title: PutDynamoDBRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putdynamodbrecord.md
section: Loading & Unloading Data
---

# PutDynamoDBRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Inserts items into DynamoDB based on record-oriented data. The record fields are mapped into DynamoDB item fields, including partition and sort keys if set. Depending on the number of records the processor might execute the insert in multiple chunks in order to overcome DynamoDB’s limitation on batch writing. This might result partially processed FlowFiles in which case the FlowFile will be transferred to the “unprocessed” relationship with the necessary attribute to retry later without duplicating the already executed inserts.

## Tags

AWS, Amazon, DynamoDB, Insert, Put, Record

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Partition Key Attribute | Specifies the FlowFile attribute that will be used as the value of the partition key when using “Partition by attribute” partition key strategy. |
| Partition Key Field | Defines the name of the partition key field in the DynamoDB table. Partition key is also known as hash key. Depending on the “Partition Key Strategy” the field value might come from the incoming Record or a generated one. |
| Partition Key Strategy | Defines the strategy the processor uses to assign partition key value to the inserted Items. |
| Record Reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Sort Key Field | Defines the name of the sort key field in the DynamoDB table. Sort key is also known as range key. |
| Sort Key Strategy | Defines the strategy the processor uses to assign sort key to the inserted Items. |
| Table Name | The DynamoDB table name |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |
| unprocessed | FlowFiles are routed to unprocessed relationship when DynamoDB is not able to process all the items in the request. Typical reasons are insufficient table throughput capacity and exceeding the maximum bytes per request. Unprocessed FlowFiles can be retried with a new request. |

## Writes attributes

| Name | Description |
| --- | --- |
| dynamodb.chunks.processed | Number of chunks successfully inserted into DynamoDB. If not set, it is considered as 0 |
| dynamodb.key.error.unprocessed | DynamoDB unprocessed keys |
| dynmodb.range.key.value.error | DynamoDB range key error |
| dynamodb.key.error.not.found | DynamoDB key not found |
| dynamodb.error.exception.message | DynamoDB exception message |
| dynamodb.error.code | DynamoDB error code |
| dynamodb.error.message | DynamoDB error message |
| dynamodb.error.service | DynamoDB error service |
| dynamodb.error.retryable | DynamoDB error is retryable |
| dynamodb.error.request.id | DynamoDB error request id |
| dynamodb.error.status.code | DynamoDB error status code |
| dynamodb.item.io.error | IO exception message on creating item |

## See also

* [org.apache.nifi.processors.aws.dynamodb.DeleteDynamoDB](deletedynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.GetDynamoDB](getdynamodb.md)
* [org.apache.nifi.processors.aws.dynamodb.PutDynamoDB](putdynamodb.md)

---
title: PutElasticsearchJson 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putelasticsearchjson.md
section: Loading & Unloading Data
---

# PutElasticsearchJson 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

An Elasticsearch put processor that uses the official Elastic REST client libraries. Each FlowFile is treated as a document to be sent to the Elasticsearch _bulk API. Multiple FlowFiles can be batched together into each Request sent to Elasticsearch.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, index, json, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The preferred number of FlowFiles to send over in a single batch |
| Character Set | Specifies the character set of the document data. |
| Client Service | An Elasticsearch client service to use for running queries. |
| Dynamic Templates | The dynamic_templates for the document. Must be parsable as a JSON Object. Requires Elasticsearch 7+ |
| Identifier Attribute | The name of the FlowFile attribute containing the identifier for the document. If the Index Operation is “index”, this property may be left empty or evaluate to an empty value, in which case the document’s identifier will be auto-generated by Elasticsearch. For all other Index Operations, the attribute must evaluate to a non-empty value. |
| Index | The name of the index to use. |
| Index Operation | The type of the operation used to index (create, delete, index, update, upsert) |
| Log Error Responses | If this is enabled, errors will be logged to the NiFi logs at the error log level. Otherwise, they will only be logged if debug logging is enabled on NiFi as a whole. The purpose of this option is to give the user the ability to debug failed operations without having to turn on debug logging. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Output Error Responses | If this is enabled, response messages from Elasticsearch marked as “error” will be output to the “error_responses” relationship. This does not impact the output of flowfiles to the “successful” or “errors” relationships |
| Script | The script for the document update/upsert. Only applies to Update/Upsert operations. Must be parsable as JSON Object. If left blank, the FlowFile content will be used for document update/upsert |
| Scripted Upsert | Whether to add the scripted_upsert flag to the Upsert Operation. If true, forces Elasticsearch to execute the Script whether or not the document exists, defaults to false. If the Upsert Document provided (from FlowFile content) will be empty, but sure to set the Client Service controller service’s Suppress Null and Empty Values to Never Suppress or no “upsert” doc will be, included in the request to Elasticsearch and the operation will not create a new document for the script to execute against, resulting in a “not_found” error |
| Treat Not Found as Success | If true, “not_found” Elasticsearch Document associated Records will be routed to the “successful” relationship, otherwise to the “errors” relationship. If Output Error Responses is “true” then “not_found” responses from Elasticsearch will be sent to the error_responses relationship. |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## Relationships

| Name | Description |
| --- | --- |
| errors | Record(s)/Flowfile(s) corresponding to Elasticsearch document(s) that resulted in an “error” (within Elasticsearch) will be routed here. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| original | All flowfiles that are sent to Elasticsearch without request failures go to this relationship. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |
| successful | Record(s)/Flowfile(s) corresponding to Elasticsearch document(s) that did not result in an “error” (within Elasticsearch) will be routed here. |

## Writes attributes

| Name | Description |
| --- | --- |
| elasticsearch.put.error | The error message if there is an issue parsing the FlowFile, sending the parsed document to Elasticsearch or parsing the Elasticsearch response |
| elasticsearch.bulk.error | The _bulk response if there was an error during processing the document within Elasticsearch. |

## See also

* [org.apache.nifi.processors.elasticsearch.PutElasticsearchRecord](putelasticsearchrecord.md)

---
title: PutElasticsearchRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putelasticsearchrecord.md
section: Loading & Unloading Data
---

# PutElasticsearchRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

A record-aware Elasticsearch put processor that uses the official Elastic REST client libraries. Each Record within the FlowFile is converted into a document to be sent to the Elasticsearch _bulk APi. Multiple documents can be batched into each Request sent to Elasticsearch. Each document’s Bulk operation can be configured using Record Path expressions.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, index, json, put, record

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of records to send over in a single batch. |
| Client Service | An Elasticsearch client service to use for running queries. |
| Date Format | Specifies the format to use when writing Date fields. If not specified, the default format ‘yyyy-MM-dd’ is used. If specified, the value must match the Java Simple Date Format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/25/2017). |
| Dynamic Templates Record Path | A RecordPath pointing to a field in the record(s) that contains the dynamic_templates for the document. Field must be Map-type compatible (e.g. a Map or Record) or a String parsable into a JSON Object. Requires Elasticsearch 7+ |
| Group Results by Bulk Error Type | The errored records written to the “errors” relationship will be grouped by error type and the error related to the first record within the FlowFile added to the FlowFile as “elasticsearch.bulk.error”. If “Treat Not Found as Success” is “false” then records associated with “not_found” Elasticsearch document responses will also be send to the “errors” relationship. |
| ID Record Path | A record path expression to retrieve the ID field for use with Elasticsearch. If left blank the ID will be automatically generated by Elasticsearch. |
| Index | The name of the index to use. |
| Index Operation | The type of the operation used to index (create, delete, index, update, upsert) |
| Index Operation Record Path | A record path expression to retrieve the Index Operation field for use with Elasticsearch. If left blank the Index Operation will be determined using the main Index Operation property. |
| Index Record Path | A record path expression to retrieve the index field for use with Elasticsearch. If left blank the index will be determined using the main index property. |
| Log Error Responses | If this is enabled, errors will be logged to the NiFi logs at the error log level. Otherwise, they will only be logged if debug logging is enabled on NiFi as a whole. The purpose of this option is to give the user the ability to debug failed operations without having to turn on debug logging. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Output Error Responses | If this is enabled, response messages from Elasticsearch marked as “error” will be output to the “error_responses” relationship. This does not impact the output of flowfiles to the “successful” or “errors” relationships |
| Record Reader | The record reader to use for reading incoming records from flowfiles. |
| Result Record Writer | The response from Elasticsearch will be examined for failed records and the failed records will be written to a record set with this record writer service and sent to the “errors” relationship. Successful records will be written to a record set with this record writer service and sent to the “successful” relationship. |
| Retain ID (Record Path) | Whether to retain the existing field used as the ID Record Path. |
| Retain Record Timestamp | Whether to retain the existing field used as the @timestamp Record Path. |
| Script Record Path | A RecordPath pointing to a field in the record(s) that contains the script for the document update/upsert. Only applies to Update/Upsert operations. Field must be Map-type compatible (e.g. a Map or a Record) or a String parsable into a JSON Object |
| Scripted Upsert Record Path | A RecordPath pointing to a field in the record(s) that contains the scripted_upsert boolean flag. Whether to add the scripted_upsert flag to the Upsert Operation. Forces Elasticsearch to execute the Script whether or not the document exists, defaults to false. If the Upsert Document provided (from FlowFile content) will be empty, but sure to set the Client Service controller service’s Suppress Null and Empty Values to Never Suppress or no “upsert” doc will be, included in the request to Elasticsearch and the operation will not create a new document for the script to execute against, resulting in a “not_found” error |
| Time Format | Specifies the format to use when writing Time fields. If not specified, the default format ‘HH:mm:ss’ is used. If specified, the value must match the Java Simple Date Format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Specifies the format to use when writing Timestamp fields. If not specified, the default format ‘yyyy-MM-dd HH:mm:ss’ is used. If specified, the value must match the Java Simple Date Format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/25/2017 18:04:15). |
| Timestamp Record Path | A RecordPath pointing to a field in the record(s) that contains the @timestamp for the document. If left blank the @timestamp will be determined using the main @timestamp property |
| Timestamp Value | The value to use as the @timestamp field (required for Elasticsearch Data Streams) |
| Treat Not Found as Success | If true, “not_found” Elasticsearch Document associated Records will be routed to the “successful” relationship, otherwise to the “errors” relationship. If Output Error Responses is “true” then “not_found” responses from Elasticsearch will be sent to the error_responses relationship. |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |
| Type Record Path | A record path expression to retrieve the type field for use with Elasticsearch. If left blank the type will be determined using the main type property. |

## Relationships

| Name | Description |
| --- | --- |
| errors | Record(s)/Flowfile(s) corresponding to Elasticsearch document(s) that resulted in an “error” (within Elasticsearch) will be routed here. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| original | All flowfiles that are sent to Elasticsearch without request failures go to this relationship. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |
| successful | Record(s)/Flowfile(s) corresponding to Elasticsearch document(s) that did not result in an “error” (within Elasticsearch) will be routed here. |

## Writes attributes

| Name | Description |
| --- | --- |
| elasticsearch.put.error | The error message if there is an issue parsing the FlowFile records, sending the parsed documents to Elasticsearch or parsing the Elasticsearch response. |
| elasticsearch.put.error.count | The number of records that generated errors in the Elasticsearch _bulk API. |
| elasticsearch.put.success.count | The number of records that were successfully processed by the Elasticsearch _bulk API. |
| elasticsearch.bulk.error | The _bulk response if there was an error during processing the record within Elasticsearch. |

## See also

* [org.apache.nifi.processors.elasticsearch.PutElasticsearchJson](putelasticsearchjson.md)

---
title: PutEmail 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putemail.md
section: Loading & Unloading Data
---

# PutEmail 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Sends an e-mail to configured recipients for each incoming FlowFile

## Tags

email, notify, put, smtp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

true

## Properties

| Property | Description |
| --- | --- |
| Attach File | Specifies whether or not the FlowFile content should be attached to the email |
| BCC | The recipients to include in the BCC-Line of the email. Comma separated sequence of addresses following RFC822 syntax. |
| CC | The recipients to include in the CC-Line of the email. Comma separated sequence of addresses following RFC822 syntax. |
| Content Type | Mime Type used to interpret the contents of the email, such as text/plain or text/html |
| From | Specifies the Email address to use as the sender. Comma separated sequence of addresses following RFC822 syntax. |
| Include All Attributes In Message | Specifies whether or not all FlowFile attributes should be recorded in the body of the email message |
| Message | The body of the email message |
| Reply-To | The recipients that will receive the reply instead of the from (see RFC2822 §3.6.2).This feature is useful, for example, when the email is sent by a no-reply account. This field is optional. Comma separated sequence of addresses following RFC822 syntax. |
| SMTP Auth | Flag indicating whether authentication should be used |
| SMTP Hostname | The hostname of the SMTP host |
| SMTP Password | Password for the SMTP account |
| SMTP Port | The Port used for SMTP communications |
| SMTP Socket Factory | Socket Factory to use for SMTP Connection |
| SMTP TLS | Flag indicating whether Opportunistic TLS should be enabled using STARTTLS command |
| SMTP Username | Username for the SMTP account |
| SMTP X-Mailer Header | X-Mailer used in the header of the outgoing email |
| Subject | The email subject |
| To | The recipients to include in the To-Line of the email. Comma separated sequence of addresses following RFC822 syntax. |
| attribute-name-regex | A Regular Expression that is matched against all FlowFile attribute names. Any attribute whose name matches the regex will be added to the Email messages as a Header. If not specified, no FlowFile attributes will be added as headers. |
| authorization-mode | How to authorize sending email on the user’s behalf. |
| email-ff-content-as-message | Specifies whether or not the FlowFile content should be the message of the email. If true, the ‘Message’ property is ignored. |
| input-character-set | Specifies the character set of the FlowFile contents for reading input FlowFile contents to generate the message body or as an attachment to the message. If not set, UTF-8 will be the default value. |
| oauth2-access-token-provider | OAuth2 service that can provide access tokens. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that fail to send will be routed to this relationship |
| success | FlowFiles that are successfully sent will be routed to this relationship |

---
title: PutFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putfile.md
section: Loading & Unloading Data
---

# PutFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Writes the contents of a FlowFile to the local file system

## Tags

archive, copy, files, filesystem, local, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Conflict Resolution Strategy | Indicates what should happen when a file with the same name already exists in the output directory |
| Create Missing Directories | If true, then missing destination directories will be created. If false, flowfiles are penalized and sent to failure. |
| Directory | The directory to which files should be written. You may use expression language such as /aa/bb/${path} |
| Group | Sets the group on the output file to the value of this attribute. You may also use expression language such as ${file.group}. |
| Last Modified Time | Sets the lastModifiedTime on the output file to the value of this attribute. Format must be yyyy-MM-dd ‘T’HH:mm:ssZ. You may also use expression language such as ${file.lastModifiedTime}. |
| Maximum File Count | Specifies the maximum number of files that can exist in the output directory |
| Owner | Sets the owner on the output file to the value of this attribute. You may also use expression language such as ${file.owner}. Note on many operating systems Nifi must be running as a super-user to have the permissions to set the file owner. |
| Permissions | Sets the permissions on the output file to the value of this attribute. Format must be either UNIX rwxrwxrwx with a - in place of denied permissions (e.g. rw-r–r–) or an octal number (e.g. 644). You may also use expression language such as ${file.permissions}. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| write filesystem | Provides operator the ability to write to any file that NiFi has access to. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to the output directory for some reason are transferred to this relationship |
| success | Files that have been successfully written to the output directory are transferred to this relationship |

## See also

* [org.apache.nifi.processors.standard.FetchFile](fetchfile.md)
* [org.apache.nifi.processors.standard.GetFile](getfile.md)

---
title: PutFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putftp.md
section: Loading & Unloading Data
---

# PutFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Sends FlowFiles to an FTP Server

## Tags

archive, copy, egress, files, ftp, put, remote

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The maximum number of FlowFiles to send in a single connection |
| Conflict Resolution | Determines how to handle the problem of filename collisions |
| Connection Mode | The FTP Connection Mode |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Create Directory | Specifies whether or not the remote directory should be created if it does not exist. |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Dot Rename | If true, then the filename of the sent file is prepended with a “.” and then renamed back to the original once the file is completely sent. Otherwise, there is no rename. This property is ignored if the Temporary Filename property is set. |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Internal Buffer Size | Set the internal buffer size for buffered data streams |
| Last Modified Time | The lastModifiedTime to assign to the file after transferring it. If not set, the lastModifiedTime will not be changed. Format must be yyyy-MM-dd ‘T’HH:mm:ssZ. You may also use expression language such as ${file.lastModifiedTime}. If the value is invalid, the processor will not be invalid but will fail to change lastModifiedTime of the file. |
| Password | Password for the user account |
| Permissions | The permissions to assign to the file after transferring it. Format must be either UNIX rwxrwxrwx with a - in place of denied permissions (e.g. rw-r–r–) or an octal number (e.g. 644). If not set, the permissions will not be changed. You may also use expression language such as ${file.permissions}. If the value is invalid, the processor will not be invalid but will fail to change permissions of the file. |
| Port | The port that the remote system is listening on for file transfers |
| Reject Zero-Byte Files | Determines whether or not Zero-byte files should be rejected without attempting to transfer |
| Remote Path | The path on the remote system from which to pull or push files |
| Temporary Filename | If set, the filename of the sent file will be equal to the value specified during the transfer and after successful completion will be renamed to the original filename. If this value is set, the Dot Rename property is ignored. |
| Transfer Mode | The FTP Transfer Mode |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Username | Username |
| ftp-use-utf8 | Tells the client to use UTF-8 encoding when processing files and filenames. If set to true, the server must also support UTF-8 encoding. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the remote system; failure is usually looped back to this processor |
| reject | FlowFiles that were rejected by the destination system |
| success | FlowFiles that are successfully sent will be routed to success |

## See also

* [org.apache.nifi.processors.standard.GetFTP](getftp.md)

---
title: PutGCSObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putgcsobject.md
section: Loading & Unloading Data
---

# PutGCSObject 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Writes the contents of a FlowFile as an object in a Google Cloud Storage.

## Tags

archive, gcs, google, google cloud, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File Resource Service | File Resource Service providing access to the local resource to be transferred |
| GCP Credentials Provider Service | The Controller Service used to obtain Google Cloud Platform credentials. |
| Resource Transfer Source | The source of the content to be transferred |
| gcp-project-id | Google Cloud Project ID |
| gcp-retry-count | How many retry attempts should be made before routing to the failure relationship. |
| gcs-bucket | Bucket of the object. |
| gcs-content-disposition-type | Type of RFC-6266 Content Disposition to be attached to the object |
| gcs-content-type | Content Type for the file, i.e. text/plain |
| gcs-key | Name of the object. |
| gcs-object-acl | Access Control to be attached to the object uploaded. Not providing this will revert to bucket defaults. |
| gcs-object-crc32c | CRC32C Checksum (encoded in Base64, big-Endian order) of the file for server-side validation. |
| gcs-overwrite-object | If false, the upload to GCS will succeed only if the object does not exist. |
| gcs-server-side-encryption-key | An AES256 Encryption Key (encoded in base64) for server-side encryption of the object. |
| gzip.content.enabled | Signals to the GCS Blob Writer whether GZIP compression during transfer is desired. False means do not gzip and can boost performance in many cases. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| storage-api-url | Overrides the default storage URL. Configuring an alternative Storage API URL also overrides the HTTP Host header on requests as described in the Google documentation for Private Service Connections. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to this relationship if the Google Cloud Storage operation fails. |
| success | FlowFiles are routed to this relationship after a successful Google Cloud Storage operation. |

## Writes attributes

| Name | Description |
| --- | --- |
| gcs.bucket | Bucket of the object. |
| gcs.key | Name of the object. |
| gcs.size | Size of the object. |
| gcs.cache.control | Data cache control of the object. |
| gcs.component.count | The number of components which make up the object. |
| gcs.content.disposition | The data content disposition of the object. |
| gcs.content.encoding | The content encoding of the object. |
| gcs.content.language | The content language of the object. |
| mime.type | The MIME/Content-Type of the object |
| gcs.crc32c | The CRC32C checksum of object’s data, encoded in base64 in big-endian order. |
| gcs.create.time | The creation time of the object (milliseconds) |
| gcs.update.time | The last modification time of the object (milliseconds) |
| gcs.encryption.algorithm | The algorithm used to encrypt the object. |
| gcs.encryption.sha256 | The SHA256 hash of the key used to encrypt the object |
| gcs.etag | The HTTP 1.1 Entity tag for the object. |
| gcs.generated.id | The service-generated for the object |
| gcs.generation | The data generation of the object. |
| gcs.md5 | The MD5 hash of the object’s data encoded in base64. |
| gcs.media.link | The media download link to the object. |
| gcs.metageneration | The metageneration of the object. |
| gcs.owner | The owner (uploader) of the object. |
| gcs.owner.type | The ACL entity type of the uploader of the object. |
| gcs.uri | The URI of the object as a string. |

## See also

* [org.apache.nifi.processors.gcp.storage.DeleteGCSObject](deletegcsobject.md)
* [org.apache.nifi.processors.gcp.storage.FetchGCSObject](fetchgcsobject.md)
* [org.apache.nifi.processors.gcp.storage.ListGCSBucket](listgcsbucket.md)

---
title: PutGoogleDrive 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putgoogledrive.md
section: Loading & Unloading Data
---

# PutGoogleDrive 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Writes the contents of a FlowFile as a file in Google Drive.

## Tags

drive, google, put, storage

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| chunked-upload-size | Defines the size of a chunk. Used when a FlowFile ‘s size exceeds’Chunked Upload Threshold’ and content is uploaded in smaller chunks. Minimum allowed chunk size is 256 KB, maximum allowed chunk size is 1 GB. |
| chunked-upload-threshold | The maximum size of the content which is uploaded at once. FlowFiles larger than this threshold are uploaded in chunks. |
| conflict-resolution-strategy | Indicates what should happen when a file with the same name already exists in the specified Google Drive folder. |
| connect-timeout | Maximum wait time for connection to Google Drive service. |
| file-name | The name of the file to upload to the specified Google Drive folder. |
| folder-id | The ID of the shared folder. Please see Additional Details to set up access to Google Drive and obtain Folder ID. |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |
| read-timeout | Maximum wait time for response from Google Drive service. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to Google Drive for some reason are transferred to this relationship. |
| success | Files that have been successfully written to Google Drive are transferred to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| drive.id | The id of the file |
| filename | The name of the file |
| mime.type | The MIME type of the file |
| drive.size | The size of the file. Set to 0 when the file size is not available (e.g. externally stored files). |
| drive.size.available | Indicates if the file size is known / available |
| drive.timestamp | The last modified time or created time (whichever is greater) of the file. The reason for this is that the original modified date of a file is preserved when uploaded to Google Drive. ‘Created time’ takes the time when the upload occurs. However uploaded files can still be modified later. |
| drive.created.time | The file’s creation time |
| drive.modified.time | The file’s last modification time |
| error.code | The error code returned by Google Drive |
| error.message | The error message returned by Google Drive |

## See also

* [org.apache.nifi.processors.gcp.drive.FetchGoogleDrive](fetchgoogledrive.md)
* [org.apache.nifi.processors.gcp.drive.ListGoogleDrive](listgoogledrive.md)

---
title: PutGridFS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putgridfs.md
section: Loading & Unloading Data
---

# PutGridFS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Writes a file to a GridFS bucket.

## Tags

file, gridfs, mongo, put, store

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| gridfs-bucket-name | The GridFS bucket where the files will be stored. If left blank, it will use the default value ‘fs’ that the MongoDB client driver uses. |
| gridfs-client-service | The MongoDB client service to use for database connections. |
| gridfs-database-name | The name of the database to use |
| gridfs-file-name | The name of the file in the bucket that is the target of this processor. GridFS file names do not include path information because GridFS does not sort files into folders within a bucket. |
| putgridfs-chunk-size | Controls the maximum size of each chunk of a file uploaded into GridFS. |
| putgridfs-enforce-uniqueness | When enabled, this option will ensure that uniqueness is enforced on the bucket. It will do so by creating a MongoDB index that matches your selection. It should ideally be configured once when the bucket is created for the first time because it could take a long time to build on an existing bucket wit a lot of data. |
| putgridfs-hash-attribute | If uniquness enforcement is enabled and the file hash is part of the constraint, this must be set to an attribute that exists on all incoming flowfiles. |
| putgridfs-properties-prefix | Attributes that have this prefix will be added to the file stored in GridFS as metadata. |

## Relationships

| Name | Description |
| --- | --- |
| duplicate | Flowfiles that fail the duplicate check are sent to this relationship. |
| failure | When there is a failure processing the flowfile, it goes to this relationship. |
| success | When the operation succeeds, the flowfile is sent to this relationship. |

---
title: PutHubSpot 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/puthubspot.md
section: Loading & Unloading Data
---

# PutHubSpot 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-hubspot-processors-nar

## Description

Upsert a HubSpot object.

## Tags

Preview, hubspot

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Associated Object ID Property | Target HubSpot property used to uniquely identify the object to associate to from the configured object. |
| Associated Object ID Value | Target HubSpot property value for the ‘Associated Object ID Property’ to associate to from the configured object. |
| Associated Object Type | Target HubSpot object type to associate to from the configured object. |
| Association Type ID | The HubSpot defined association id from the ‘Object ID Value’ to the ‘Associated Object ID Value’. |
| HubSpot Service | HubSpot Client Service. |
| Inverse Association Type ID | The HubSpot defined association id from the ‘Associated Object ID Value’ to the ‘Object ID Value’. |
| Missing HubSpot Property Policy | What to action to take if HubSpot does not have a matching property. |
| Object ID Property | HubSpot property used to uniquely identify the object. |
| Object ID Value | Matching HubSpot property value to search for. |
| Object Override Properties | Comma-delimited list of NiFi attributes, which if exist, will be added as object properties. Any existing properties in HubSpot will be overridden. |
| Object Set Properties | Comma-delimited list of NiFi attributes, which if exist, will be added as object properties if the current object property in HubSpot is empty. |
| Object Type | HubSpot object type |

## Relationships

| Name | Description |
| --- | --- |
| failure | HubSpot fail relationship |
| retry | HubSpot retry relationship. FlowFiles that failed to process due to a server timeout or rate limit related error. FlowFiles routed here should be routed back into the processor. |
| success | HubSpot success relationship |

## See also

* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotObject](gethubspotobject.md)
* [com.snowflake.openflow.runtime.processors.hubspot.GetHubSpotSchema](gethubspotschema.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListArchivedHubSpotData](listarchivedhubspotdata.md)
* [com.snowflake.openflow.runtime.processors.hubspot.ListHubSpotObjects](listhubspotobjects.md)

---
title: PutIcebergTable 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/puticebergtable.md
section: Loading & Unloading Data
---

# PutIcebergTable 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-iceberg-processors-nar

## Description

Store records in Iceberg using configurable Catalog for managing namespaces and tables.

## Tags

analytics, iceberg, openflow, parquet, polaris, s3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Iceberg Catalog | Provider Service for Iceberg Catalog |
| Iceberg Writer | Provider Service for Iceberg Row Writers responsible for producing formatted Iceberg Data Files |
| Namespace | Iceberg Namespace containing Tables |
| Record Reader | Record Reader for incoming FlowFiles |
| Table Name | Iceberg Table Name |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles not transferred to Iceberg |
| success | FlowFiles transferred to Iceberg |

---
title: PutKinesisFirehose 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putkinesisfirehose.md
section: Loading & Unloading Data
---

# PutKinesisFirehose 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Sends the contents to a specified Amazon Kinesis Firehose. In order to send data to firehose, the firehose delivery stream name has to be specified.

## Tags

amazon, aws, firehose, kinesis, put, stream

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Amazon Kinesis Firehose Delivery Stream Name | The name of kinesis firehose delivery stream |
| Batch Size | Batch size for messages (1-500). |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Max message buffer size | Max message buffer |
| Region |  |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| aws.kinesis.firehose.error.message | Error message on posting message to AWS Kinesis Firehose |
| aws.kinesis.firehose.error.code | Error code for the message when posting to AWS Kinesis Firehose |
| aws.kinesis.firehose.record.id | Record id of the message posted to Kinesis Firehose |

---
title: PutKinesisStream 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putkinesisstream.md
section: Loading & Unloading Data
---

# PutKinesisStream 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Sends the contents to a specified Amazon Kinesis. In order to send data to Kinesis, the stream name has to be specified.

## Tags

amazon, aws, kinesis, put, stream

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Max Message Buffer Size | Max message buffer size defined with standard data size units |
| Message Batch Size | Batch size for messages (1-500). |
| Region |  |
| Stream Name | The name of Kinesis Stream |
| Stream Partition Key | The partition key attribute. If it is not set, a random value is used |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| aws.kinesis.error.message | Error message on posting message to AWS Kinesis |
| aws.kinesis.error.code | Error code for the message when posting to AWS Kinesis |
| aws.kinesis.sequence.number | Sequence number for the message when posting to AWS Kinesis |
| aws.kinesis.shard.id | Shard id of the message posted to AWS Kinesis |

## See also

* [org.apache.nifi.processors.aws.kinesis.stream.ConsumeKinesisStream](consumekinesisstream.md)

---
title: PutLambda 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putlambda.md
section: Loading & Unloading Data
---

# PutLambda 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Sends the contents to a specified Amazon Lambda Function. The AWS credentials used for authentication must have permissions execute the Lambda function (lambda:InvokeFunction).The FlowFile content must be JSON.

## Tags

amazon, aws, lambda, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Amazon Lambda Name | The Lambda Function Name |
| Amazon Lambda Qualifier (version) | The Lambda Function Version |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| aws.lambda.result.function.error | Function error message in result on posting message to AWS Lambda |
| aws.lambda.result.status.code | Status code in the result for the message when posting to AWS Lambda |
| aws.lambda.result.payload | Payload in the result from AWS Lambda |
| aws.lambda.result.log | Log in the result of the message posted to Lambda |
| aws.lambda.exception.message | Exception message on invoking from AWS Lambda |
| aws.lambda.exception.cause | Exception cause on invoking from AWS Lambda |
| aws.lambda.exception.error.code | Exception error code on invoking from AWS Lambda |
| aws.lambda.exception.request.id | Exception request id on invoking from AWS Lambda |
| aws.lambda.exception.status.code | Exception status code on invoking from AWS Lambda |

---
title: PutMongo 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putmongo.md
section: Loading & Unloading Data
---

# PutMongo 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Writes the contents of a FlowFile to MongoDB

## Tags

insert, mongodb, put, update, write

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the data is encoded |
| Mode | Indicates whether the processor should insert or update content |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| Update Method | MongoDB method for running collection update operations, such as updateOne or updateMany |
| Update Query Key | One or more comma-separated document key names used to build the update query criteria, such as _id |
| Upsert | When true, inserts a document if no document matches the update query criteria; this property is valid only when using update mode, otherwise it is ignored |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |
| put-mongo-update-mode | Choose an update mode. You can either supply a JSON document to use as a direct replacement or specify a document that contains update operators like $set, $unset, and $inc. When Operators mode is enabled, the flowfile content is expected to be the operator part for example: {$set:{“key”: “value”},$inc:{“count”:1234}} and the update query will come from the configured Update Query property. |
| putmongo-update-query | Specify a full MongoDB query to be used for the lookup query to do an update/upsert. NOTE: this field is ignored if the ‘Update Query Key’ value is not empty. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be written to MongoDB are routed to this relationship |
| success | All FlowFiles that are written to MongoDB are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mongo.put.update.match.count | The match count from result if update/upsert is performed, otherwise not set. |
| mongo.put.update.modify.count | The modify count from result if update/upsert is performed, otherwise not set. |
| mongo.put.upsert.id | The ‘_id’ hex value if upsert is performed, otherwise not set. |

---
title: PutMongoBulkOperations 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putmongobulkoperations.md
section: Loading & Unloading Data
---

# PutMongoBulkOperations 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

Writes the contents of a FlowFile to MongoDB as bulk-update

## Tags

bulk, insert, mongodb, put, update, write

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the data is encoded |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| Ordered | Ordered execution of bulk-writes and break on error - otherwise arbitrary order and continue on error |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be written to MongoDB are routed to this relationship |
| success | All FlowFiles that are written to MongoDB are routed to this relationship |

---
title: PutMongoRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putmongorecord.md
section: Loading & Unloading Data
---

# PutMongoRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

This processor is a record-aware processor for inserting/upserting data into MongoDB. It uses a configured record reader and schema to read an incoming record set from the body of a flowfile and then inserts/upserts batches of those records into a configured MongoDB collection. This processor does not support deletes. The number of documents to insert/upsert at a time is controlled by the “Batch Size” configuration property. This value should be set to a reasonable size to ensure that MongoDB is not overloaded with too many operations at once.

## Tags

insert, mongodb, put, record, update, upsert

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| bypass-validation | Enable or disable bypassing document schema validation during insert or update operations. Bypassing document validation is a Privilege Action in MongoDB. Enabling this property can result in authorization errors for users with limited privileges. |
| insert_count | The number of records to group together for one single insert/upsert operation against MongoDB. |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |
| ordered | Perform ordered or unordered operations |
| record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema |
| update-key-fields | Comma separated list of fields based on which to identify documents that need to be updated. If this property is set NiFi will attempt an upsert operation on all documents. If this property is not set all documents will be inserted. |
| update-mode | Choose between updating a single document or multiple documents per incoming record. |

## Relationships

| Name | Description |
| --- | --- |
| failure | All FlowFiles that cannot be written to MongoDB are routed to this relationship |
| success | All FlowFiles that are written to MongoDB are routed to this relationship |

---
title: PutRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putrecord.md
section: Loading & Unloading Data
---

# PutRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

The PutRecord processor uses a specified RecordReader to input (possibly multiple) records from an incoming flow file, and sends them to a destination specified by a Record Destination Service (i.e. record sink).

## Tags

put, record, sink

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| put-record-include-zero-record-results | If no records are read from the incoming FlowFile, this property specifies whether or not an empty record set will be transmitted. The original FlowFile will still be routed to success, but if no transmission occurs, no provenance SEND event will be generated. |
| put-record-reader | Specifies the Controller Service to use for reading incoming data |
| put-record-sink | Specifies the Controller Service to use for writing out the query result records to some destination. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if the records could not be transmitted and retrying the operation will also fail |
| retry | The original FlowFile is routed to this relationship if the records could not be transmitted but attempting the operation again may succeed |
| success | The original FlowFile will be routed to this relationship if the records were transmitted successfully |

---
title: PutRedisHashRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putredishashrecord.md
section: Loading & Unloading Data
---

# PutRedisHashRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-redis-nar

## Description

Puts record field data into Redis using a specified hash value, which is determined by a RecordPath to a field in each record containing the hash value. The record fields and values are stored as key/value pairs associated by the hash value. NOTE: Neither the evaluated hash value nor any of the field values can be null. If the hash value is null, the FlowFile will be routed to failure. For each of the field values, if the value is null that field will be not set in Redis.

## Tags

hash, put, record, redis

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| charset | Specifies the character set to use when storing record field values as strings. All fields will be converted to strings using this character set before being stored in Redis. |
| data-record-path | This property denotes a RecordPath that will be evaluated against each incoming Record and the Record that results from evaluating the RecordPath will be sent to Redis instead of sending the entire incoming Record. The property defaults to the root ‘/’ which corresponds to a ‘flat’ record (all fields/values at the top level of the Record. |
| hash-value-record-path | Specifies a RecordPath to evaluate against each Record in order to determine the hash value associated with all the record fields/values (see ‘hset’ in Redis documentation for more details). The RecordPath must point at exactly one field or an error will occur. |
| record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema |
| redis-connection-pool |  |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles containing Records with processing errors will be routed to this relationship |
| success | FlowFiles having all Records stored in Redis will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| redis.success.record.count | Number of records written to Redis |

---
title: PutS3Object 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/puts3object.md
section: Loading & Unloading Data
---

# PutS3Object 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Writes the contents of a FlowFile as an S3 Object to an Amazon S3 Bucket.

## Tags

AWS, Amazon, Archive, Put, S3

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Bucket | The S3 Bucket to interact with |
| Cache Control | Sets the Cache-Control HTTP header indicating the caching directives of the associated object. Multiple directives are comma-separated. |
| Canned ACL | Amazon Canned ACL for an object, one of: BucketOwnerFullControl, BucketOwnerRead, LogDeliveryWrite, AuthenticatedRead, PublicReadWrite, PublicRead, Private; will be ignored if any other ACL/permission/owner property is specified |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Content Disposition | Sets the Content-Disposition HTTP header indicating if the content is intended to be displayed inline or should be downloaded. Possible values are ‘inline’ or ‘attachment’. If this property is not specified, object ‘s content-disposition will be set to filename. When’ attachment ‘is selected,’; filename=’plus object key are automatically appended to form final value’ attachment; filename=”filename.jpg”’. |
| Content Type | Sets the Content-Type HTTP header indicating the type of content stored in the associated object. The value of this header is a standard MIME type. AWS S3 Java client will attempt to determine the correct content type if one hasn’t been set yet. Users are responsible for ensuring a suitable content type is set when uploading streams. If no content type is provided and cannot be determined by the filename, the default content type “application/octet-stream” will be used. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Encryption Service | Specifies the Encryption Service Controller used to configure requests. PutS3Object: For backward compatibility, this value is ignored when ‘Server Side Encryption’ is set. FetchS3Object: Only needs to be configured in case of Server-side Customer Key, Client-side KMS and Client-side Customer Key encryptions. |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Expiration Time Rule |  |
| File Resource Service | File Resource Service providing access to the local resource to be transferred |
| FullControl User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Full Control for an object |
| Multipart Part Size | Specifies the part size for use when the PutS3Multipart Upload API is used. Flow files will be broken into chunks of this size for the upload process, but the last part sent can be smaller since it is not padded. The valid range is 50MB to 5GB. |
| Multipart Threshold | Specifies the file size threshold for switch from the PutS3Object API to the PutS3MultipartUpload API. Flow files bigger than this limit will be sent using the stateful multipart process. The valid range is 50MB to 5GB. |
| Multipart Upload AgeOff Interval | Specifies the interval at which existing multipart uploads in AWS S3 will be evaluated for ageoff. When processor is triggered it will initiate the ageoff evaluation if this interval has been exceeded. |
| Multipart Upload Max Age Threshold | Specifies the maximum age for existing multipart uploads in AWS S3. When the ageoff process occurs, any upload older than this threshold will be aborted. |
| Object Key | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Object Tags Prefix | Specifies the prefix which would be scanned against the incoming FlowFile ‘s attributes and the matching attribute’s name and value would be considered as the outgoing S3 object ‘s Tag name and Tag value respectively. For Ex: If the incoming FlowFile carries the attributes tagS3country, tagS3PII, the tag prefix to be specified would be’ tagS3’ |
| Owner | The Amazon ID to use for the object’s owner |
| Read ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to read the Access Control List for an object |
| Read Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Read Access for an object |
| Region | The AWS Region to connect to. |
| Remove Tag Prefix | If set to ‘True’, the value provided for ‘Object Tags Prefix’ will be removed from the attribute(s) and then considered as the Tag name. For ex: If the incoming FlowFile carries the attributes tagS3country, tagS3PII and the prefix is set to ‘tagS3’ then the corresponding tag values would be ‘country’ and ‘PII’ |
| Resource Transfer Source | The source of the content to be transferred |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Server Side Encryption | Specifies the algorithm used for server side encryption. |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Storage Class |  |
| Temporary Directory Multipart State | Directory in which, for multipart uploads, the processor will locally save the state tracking the upload ID and parts uploaded which must both be provided to complete the upload. |
| Use Chunked Encoding | Enables / disables chunked encoding for upload requests. Set it to false only if your endpoint does not support chunked uploading. |
| Use Path Style Access | Path-style access can be enforced by setting this property to true. Set it to true if your endpoint does not support virtual-hosted-style requests, only path-style requests. |
| Write ACL User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have permissions to change the Access Control List for an object |
| Write Permission User List | A comma-separated list of Amazon User ID’s or E-mail addresses that specifies who should have Write Access for an object |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| success | FlowFiles are routed to this Relationship after they have been successfully processed. |

## Writes attributes

| Name | Description |
| --- | --- |
| s3.url | The URL that can be used to access the S3 object |
| s3.bucket | The S3 bucket where the Object was put in S3 |
| s3.key | The S3 key within where the Object was put in S3 |
| s3.contenttype | The S3 content type of the S3 Object that put in S3 |
| s3.version | The version of the S3 Object that was put to S3 |
| s3.exception | The class name of the exception thrown during processor execution |
| s3.additionalDetails | The S3 supplied detail from the failed operation |
| s3.statusCode | The HTTP error code (if available) from the failed operation |
| s3.errorCode | The S3 moniker of the failed operation |
| s3.errorMessage | The S3 exception message from the failed operation |
| s3.etag | The ETag of the S3 Object |
| s3.contentdisposition | The content disposition of the S3 Object that put in S3 |
| s3.cachecontrol | The cache-control header of the S3 Object |
| s3.uploadId | The uploadId used to upload the Object to S3 |
| s3.expiration | A human-readable form of the expiration date of the S3 object, if one is set |
| s3.sseAlgorithm | The server side encryption algorithm of the object |
| s3.usermetadata | A human-readable form of the User Metadata of the S3 object, if any was set |
| s3.encryptionStrategy | The name of the encryption strategy, if any was set |

## See also

* [org.apache.nifi.processors.aws.s3.CopyS3Object](copys3object.md)
* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.TagS3Object](tags3object.md)

---
title: PutSalesforceObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsalesforceobject.md
section: Loading & Unloading Data
---

# PutSalesforceObject 2025.10.9.21

## Bundle

org.apache.nifi | nifi-salesforce-nar

## Description

Creates new records for the specified Salesforce sObject. The type of the Salesforce object must be set in the input flowfile ‘s’ objectType’ attribute. This processor cannot update existing records.

## Tags

put, salesforce, sobject

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| oauth2-access-token-provider | Service providing OAuth2 Access Tokens for authenticating using the HTTP Authorization Header |
| read-timeout | Maximum time allowed for reading a response from the Salesforce REST API |
| record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema |
| salesforce-api-version | The version number of the Salesforce REST API appended to the URL after the services/data path. See Salesforce documentation for supported versions |
| salesforce-url | The URL of the Salesforce instance including the domain without additional path information, such as <https://MyDomainName.my.salesforce.com> |

## Relationships

| Name | Description |
| --- | --- |
| failure | For FlowFiles created as a result of an execution error. |
| success | For FlowFiles created as a result of a successful execution. |

## Writes attributes

| Name | Description |
| --- | --- |
| error.message | The error message returned by Salesforce. |

## See also

* [org.apache.nifi.processors.salesforce.QuerySalesforceObject](querysalesforceobject.md)

---
title: PutSFTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsftp.md
section: Loading & Unloading Data
---

# PutSFTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Sends FlowFiles to an SFTP Server

## Tags

archive, copy, egress, files, put, remote, sftp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Algorithm Negotiation | Configuration strategy for SSH algorithm negotiation |
| Batch Size | The maximum number of FlowFiles to send in a single connection |
| Ciphers Allowed | A comma-separated list of Ciphers allowed for SFTP connections. Leave unset to allow all. Available options are: 3des-cbc, aes128-cbc, aes128-ctr, [aes128-gcm@openssh.com](mailto:aes128-gcm%40openssh.com), aes192-cbc, aes192-ctr, aes256-cbc, aes256-ctr, [aes256-gcm@openssh.com](mailto:aes256-gcm%40openssh.com), arcfour128, arcfour256, blowfish-cbc, [chacha20-poly1305@openssh.com](mailto:chacha20-poly1305%40openssh.com), none |
| Conflict Resolution | Determines how to handle the problem of filename collisions |
| Connection Timeout | Amount of time to wait before timing out while creating a connection |
| Create Directory | Specifies whether or not the remote directory should be created if it does not exist. |
| Data Timeout | When transferring a file between the local and remote system, this value specifies how long is allowed to elapse without any data being transferred between systems |
| Disable Directory Listing | If set to ‘true’, directory listing is not performed prior to create missing directories. By default, this processor executes a directory listing command to see target directory existence before creating missing directories. However, there are situations that you might need to disable the directory listing such as the following. Directory listing might fail with some permission setups (e.g. chmod 100) on a directory. Also, if any other SFTP client created the directory after this processor performed a listing and before a directory creation request by this processor is finished, then an error is returned because the directory already exists. |
| Dot Rename | If true, then the filename of the sent file is prepended with a “.” and then renamed back to the original once the file is completely sent. Otherwise, there is no rename. This property is ignored if the Temporary Filename property is set. |
| Host Key File | If supplied, the given file will be used as the Host Key; otherwise, if ‘Strict Host Key Checking’ property is applied (set to true) then uses the ‘known_hosts’ and ‘known_hosts2’ files from ~/.ssh directory else no host key file will be used |
| Hostname | The fully qualified hostname or IP address of the remote system |
| Key Algorithms Allowed | A comma-separated list of Key Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: ecdsa-sha2-nistp256, [ecdsa-sha2-nistp256-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp256-cert-v01%40openssh.com), ecdsa-sha2-nistp384, [ecdsa-sha2-nistp384-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp384-cert-v01%40openssh.com), ecdsa-sha2-nistp521, [ecdsa-sha2-nistp521-cert-v01@openssh.com](mailto:ecdsa-sha2-nistp521-cert-v01%40openssh.com), rsa-sha2-256, [rsa-sha2-256-cert-v01@openssh.com](mailto:rsa-sha2-256-cert-v01%40openssh.com), rsa-sha2-512, [rsa-sha2-512-cert-v01@openssh.com](mailto:rsa-sha2-512-cert-v01%40openssh.com), [sk-ecdsa-sha2-nistp256@openssh.com](mailto:sk-ecdsa-sha2-nistp256%40openssh.com), [sk-ssh-ed25519@openssh.com](mailto:sk-ssh-ed25519%40openssh.com), ssh-dss, [ssh-dss-cert-v01@openssh.com](mailto:ssh-dss-cert-v01%40openssh.com), ssh-ed25519, [ssh-ed25519-cert-v01@openssh.com](mailto:ssh-ed25519-cert-v01%40openssh.com), ssh-rsa, [ssh-rsa-cert-v01@openssh.com](mailto:ssh-rsa-cert-v01%40openssh.com) |
| Key Exchange Algorithms Allowed | A comma-separated list of Key Exchange Algorithms allowed for SFTP connections. Leave unset to allow all. Available options are: curve25519-sha256, [curve25519-sha256@libssh.org](mailto:curve25519-sha256%40libssh.org), curve448-sha512, diffie-hellman-group-exchange-sha1, diffie-hellman-group-exchange-sha256, diffie-hellman-group1-sha1, diffie-hellman-group14-sha1, diffie-hellman-group14-sha256, diffie-hellman-group15-sha512, diffie-hellman-group16-sha512, diffie-hellman-group17-sha512, diffie-hellman-group18-sha512, ecdh-sha2-nistp256, ecdh-sha2-nistp384, ecdh-sha2-nistp521, mlkem1024nistp384-sha384, mlkem768nistp256-sha256, mlkem768x25519-sha256, sntrup761x25519-sha512, [sntrup761x25519-sha512@openssh.com](mailto:sntrup761x25519-sha512%40openssh.com) |
| Last Modified Time | The lastModifiedTime to assign to the file after transferring it. If not set, the lastModifiedTime will not be changed. Format must be yyyy-MM-dd ‘T’HH:mm:ssZ. You may also use expression language such as ${file.lastModifiedTime}. If the value is invalid, the processor will not be invalid but will fail to change lastModifiedTime of the file. |
| Message Authentication Codes Allowed | A comma-separated list of Message Authentication Codes allowed for SFTP connections. Leave unset to allow all. Available options are: hmac-md5, hmac-md5-96, hmac-sha1, hmac-sha1-96, [hmac-sha1-etm@openssh.com](mailto:hmac-sha1-etm%40openssh.com), hmac-sha2-256, [hmac-sha2-256-etm@openssh.com](mailto:hmac-sha2-256-etm%40openssh.com), hmac-sha2-512, [hmac-sha2-512-etm@openssh.com](mailto:hmac-sha2-512-etm%40openssh.com) |
| Password | Password for the user account |
| Permissions | The permissions to assign to the file after transferring it. Format must be either UNIX rwxrwxrwx with a - in place of denied permissions (e.g. rw-r–r–) or an octal number (e.g. 644). If not set, the permissions will not be changed. You may also use expression language such as ${file.permissions}. If the value is invalid, the processor will not be invalid but will fail to change permissions of the file. |
| Port | The port that the remote system is listening on for file transfers |
| Private Key Passphrase | Password for the private key |
| Private Key Path | The fully qualified path to the Private Key file |
| Reject Zero-Byte Files | Determines whether or not Zero-byte files should be rejected without attempting to transfer |
| Remote Group | Integer value representing the Group ID to set on the file after transferring it. If not set, the group will not be set. You may also use expression language such as ${file.group}. If the value is invalid, the processor will not be invalid but will fail to change the group of the file. |
| Remote Owner | Integer value representing the User ID to set on the file after transferring it. If not set, the owner will not be set. You may also use expression language such as ${file.owner}. If the value is invalid, the processor will not be invalid but will fail to change the owner of the file. |
| Remote Path | The path on the remote system from which to pull or push files |
| Send Keep Alive On Timeout | Send a Keep Alive message every 5 seconds up to 5 times for an overall timeout of 25 seconds. |
| Strict Host Key Checking | Indicates whether or not strict enforcement of hosts keys should be applied |
| Temporary Filename | If set, the filename of the sent file will be equal to the value specified during the transfer and after successful completion will be renamed to the original filename. If this value is set, the Dot Rename property is ignored. |
| Use Compression | Indicates whether or not ZLIB compression should be used when transferring files |
| Username | Username |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the remote system; failure is usually looped back to this processor |
| reject | FlowFiles that were rejected by the destination system |
| success | FlowFiles that are successfully sent will be routed to success |

## See also

* [org.apache.nifi.processors.standard.GetSFTP](getsftp.md)

---
title: PutSmbFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsmbfile.md
section: Loading & Unloading Data
---

# PutSmbFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-smb-nar

## Description

Writes the contents of a FlowFile to a samba network location. Use this processor instead of a cifs mounts if share access control is important. Configure the Hostname, Share and Directory accordingly: \[Hostname][Share][pathtoDirectory]

## Tags

samba, smb, cifs, files, put

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The maximum number of files to put in each iteration |
| Conflict Resolution Strategy | Indicates what should happen when a file with the same name already exists in the output directory |
| Create Missing Directories | If true, then missing destination directories will be created. If false, flowfiles are penalized and sent to failure. |
| Directory | The network folder to which files should be written. This is the remaining relative path after the share: \hostnameshare[dir1dir2]. You may use expression language. |
| Domain | The domain used for authentication. Optional, in most cases username and password is sufficient. |
| Hostname | The network host to which files should be written. |
| Password | The password used for authentication. Required if Username is set. |
| Share | The network share to which files should be written. This is the “first folder”after the hostname: \hostname[share]dir1dir2 |
| Share Access Strategy | Indicates which shared access are granted on the file during the write. None is the most restrictive, but the safest setting to prevent corruption. |
| Temporary Suffix | A temporary suffix which will be apended to the filename while it’s transfering. After the transfer is complete, the suffix will be removed. |
| Username | The username used for authentication. If no username is set then anonymous authentication is attempted. |
| enable-dfs | Enables accessing Distributed File System (DFS) and following DFS links during SMB operations. |
| smb-dialect | The SMB dialect is negotiated between the client and the server by default to the highest common version supported by both end. In some rare cases, the client-server communication may fail with the automatically negotiated dialect. This property can be used to set the dialect explicitly (e.g. to downgrade to a lower version), when those situations would occur. |
| timeout | Timeout for read and write operations. |
| use-encryption | Turns on/off encrypted communication between the client and the server. The property’s behavior is SMB dialect dependent: SMB 2.x does not support encryption and the property has no effect. In case of SMB 3.x, it is a hint/request to the server to turn encryption on if the server also supports it. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Files that could not be written to the output network path for some reason are transferred to this relationship |
| success | Files that have been successfully written to the output network path are transferred to this relationship |

## See also

* [org.apache.nifi.processors.smb.FetchSmb](fetchsmb.md)
* [org.apache.nifi.processors.smb.GetSmbFile](getsmbfile.md)
* [org.apache.nifi.processors.smb.ListSmb](listsmb.md)

---
title: PutSnowflakeInternalStageFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsnowflakeinternalstagefile.md
section: Loading & Unloading Data
---

# PutSnowflakeInternalStageFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Puts files into a Snowflake internal stage. The internal stage must be created in the Snowflake account beforehand.

## Tags

connection, database, jdbc, openflow, snowflake, snowpipe

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Compression Enabled | Set true to compress data before uploading the file |
| Database | The database to use by default. The same as passing ‘db=DATABASE_NAME’ to the connection string. |
| File Name | Destination file name to use. |
| File Prefix | Path prefix under which the data should be uploaded on the stage. |
| Internal Stage Type | The type of internal stage to use |
| Schema | The schema to use by default. The same as passing ‘schema=SCHEMA’ to the connection string. |
| Snowflake Connection Service | Database Connection Service for accessing Snowflake |
| Stage | The name of the internal stage in the Snowflake account to put files into. |
| Table | The name of the table in the Snowflake account. |

## Relationships

| Name | Description |
| --- | --- |
| failure | For FlowFiles of failed PUT operation |
| success | For FlowFiles of successful PUT operation |

## Writes attributes

| Name | Description |
| --- | --- |
| snowflake.staged.file.path | Staged file path |

---
title: PutSnowpipeStreaming 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsnowpipestreaming.md
section: Loading & Unloading Data
---

# PutSnowpipeStreaming 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowpipe-processors-nar

## Description

Streams records into a Snowflake table. The table must be created in the Snowflake account beforehand.

## Tags

connection, database, jdbc, openflow, snowflake, snowpipe streaming

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Account | Snowflake Account Identifier with Organization Name and Account Name formatted as [organization-name]-[account-name] |
| Authentication Strategy | Strategy for authenticating Snowflake connections |
| Client Lag | The maximum amount of time that the client will wait before flushing records to Snowflake. A larger value can increase latency while sending to Snowflake, but for tables that are not constantly updated it can result in queries that are faster and more cost efficient. |
| Concurrency Group | Allows specifying a ‘Concurrency Group’ that a given FlowFile belongs to, so that the number of Concurrent Tasks that write to tables in a given group can be limited. |
| Connection Strategy | Strategy for connecting to Snowflake Snowpipe Streaming services |
| Database | Snowflake Database destination for processed records |
| Delivery Guarantee | Specifies the delivery guarantee for the records being sent to Snowflake. |
| Iceberg Enabled | Specifies whether the processor ingests data into an Iceberg table. The processor fails if this property doesn’t match the actual table type. |
| Max Batch Size | Maximum number of records to ingest in a single call. Multiple ingest calls will be made if the number of records exceeds the max batch size. Current guidance recommends batch sizes less than 16MB. The Max Batch Size can be tuned based on the average record size such that batches are generally less than 16MB. |
| Max Tasks Per Group | The maximum number of channels to create for a given Snowpipe Channel Prefix. This allows limiting the number of concurrent tasks that can be writing to a given Snowflake table. |
| Private Key Service | RSA Private Key Service for authenticating connections |
| Record Offset | The Expression Language expression to use to determine the offset of the first record in a FlowFile. |
| Record Offset Record Path | The Record Path expression to use to determine the offset of the first record in a FlowFile. |
| Record Offset Strategy | Specifies the strategy for determining the offset of each record. |
| Record Reader | The Record Reader to use for reading the input |
| Role | Snowflake Role the user will assume when authenticating connections |
| Schema | Snowflake Schema destination for processed records |
| Snowpipe Channel Index | The index to use for the Snowpipe channel name. The full channel name will be constructed as openflow.[prefix].[index]. This is necessary in order to provide Exactly Once delivery to Snowflake, as any retry must be tried against the same channel as was previously used. |
| Snowpipe Channel Prefix | The prefix to use for the Snowpipe channel name. The full channel name will be constructed as openflow.[prefix].[index]. The default value is ${hostname(false)}, which ensures that each NiFi node in the cluster writes to a unique channel by incorporating the hostname of the NiFi instance into the channel name. |
| Table | Snowflake Table destination for processed records |
| User | Snowflake User for authenticating connections |

## Relationships

| Name | Description |
| --- | --- |
| failure | For FlowFiles that failed to upload to Snowflake |
| success | For FlowFiles successfully uploaded to Snowflake |

## Use cases

|  |
| --- |
| Write record-oriented data to a Snowflake table as fast as possible, accepting the possible of occasional duplicates. |

---
title: PutSnowpipeStreaming2 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsnowpipestreaming2.md
section: Loading & Unloading Data
---

# PutSnowpipeStreaming2 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowpipe-streaming-2-processors-nar

## Description

Send Records formatted as Newline Delimited JSON to Snowflake Database Pipes using Snowpipe Streaming Version 2.

## Tags

NDJSON, Preview, Snowflake, Snowpipe Streaming

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Account | Snowflake Account Identifier with Organization Name and Account Name formatted as [organization-name]-[account-name] |
| Authentication Strategy | Strategy for authenticating Snowflake connections |
| Channel Group | Group for managing distinct Snowpipe Streaming Channels with partitioning |
| Channel Insert Timeout | Maximum duration to retry inserting records before failing with an upper bound of 5 minutes |
| Database | Snowflake Database destination for processed records |
| File Fragment Count | Maximum number of File Fragments sent to object storage for Snowpipe Streaming ingestion from input FlowFiles. Must be between 1 and 100. |
| File Fragment Size | Maximum size in bytes for each File Fragment sent to object storage for Snowpipe Streaming ingestion. Must be between 1 KB and 256 MB |
| Offset Token End Expression | Expression Language definition to produce the highest offset token for a FlowFile as a monotonically increasing number |
| Offset Token Record Pointer | JSON Pointer to offset token in each record required when the last committed offset token is between start and end boundaries |
| Offset Token Start Expression | Expression Language definition to produce the lowest offset token for a FlowFile as a monotonically increasing number |
| Offset Tracking Timeout | Maximum duration to poll channel status for committed offset tokens |
| Pipe | Snowflake Pipe destination for processed records |
| Private Key Service | RSA Private Key Service for authenticating connections |
| Schema | Snowflake Schema destination for processed records |
| Transfer Strategy | Strategy for transferring records to Snowpipe Streaming |
| User | Snowflake User for authenticating connections |
| Web Client Service Provider | Web Client Service Provider supporting HTTP request and response handling |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to upload to Snowflake |
| invalid | FlowFiles that Snowflake identified as containing one or more invalid rows resulting in partial transmission |
| success | FlowFiles successfully uploaded to Snowflake |

---
title: PutSNS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsns.md
section: Loading & Unloading Data
---

# PutSNS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Sends the content of a FlowFile as a notification to the Amazon Simple Notification Service

## Tags

amazon, aws, publish, pubsub, put, sns, topic

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ARN Type | The type of Amazon Resource Name that is being used. |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Amazon Resource Name (ARN) | The name of the resource to which notifications should be published |
| Character Set | The character set in which the FlowFile’s content is encoded |
| Communications Timeout |  |
| Deduplication Message ID | The token used for deduplication of sent messages |
| E-mail Subject | The optional subject to use for any subscribers that are subscribed via E-mail |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Message Group ID | If using FIFO, the message group to which the flowFile belongs |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Use JSON Structure | If true, the contents of the FlowFile must be JSON with a top-level element named ‘default’. Additional elements can be used to send different messages to different protocols. See the Amazon SNS Documentation for more information. |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## See also

* [org.apache.nifi.processors.aws.sqs.GetSQS](getsqs.md)
* [org.apache.nifi.processors.aws.sqs.PutSQS](putsqs.md)

---
title: PutSplunk 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsplunk.md
section: Loading & Unloading Data
---

# PutSplunk 2025.10.9.21

## Bundle

org.apache.nifi | nifi-splunk-nar

## Description

Sends logs to Splunk Enterprise over TCP, TCP + TLS/SSL, or UDP. If a Message Delimiter is provided, then this processor will read messages from the incoming FlowFile based on the delimiter, and send each message to Splunk. If a Message Delimiter is not provided then the content of the FlowFile will be sent directly to Splunk as if it were a single message.

## Tags

logs, splunk, tcp, udp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | Specifies the character set of the data being sent. |
| Hostname | Destination hostname or IP address |
| Idle Connection Expiration | The amount of time a connection should be held open without being used before closing the connection. A value of 0 seconds will disable this feature. |
| Max Size of Socket Send Buffer | The maximum size of the socket send buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Message Delimiter | Specifies the delimiter to use for splitting apart multiple messages within a single FlowFile. If not specified, the entire content of the FlowFile will be used as a single message. If specified, the contents of the FlowFile will be split on this delimiter and each section sent as a separate message. Note that if messages are delimited and some messages for a given FlowFile are transferred successfully while others are not, the messages will be split into individual FlowFiles, such that those messages that were successfully sent are routed to the ‘success’ relationship while other messages are sent to the ‘failure’ relationship. |
| Port | Destination port number |
| Protocol | The protocol for communication. |
| SSL Context Service | Specifies the SSL Context Service to enable TLS socket communication |
| Timeout | The timeout for connecting to and communicating with the destination. Does not apply to UDP |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the destination are sent out this relationship. |
| success | FlowFiles that are sent successfully to the destination are sent out this relationship. |

---
title: PutSplunkHTTP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsplunkhttp.md
section: Loading & Unloading Data
---

# PutSplunkHTTP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-splunk-nar

## Description

Sends flow file content to the specified Splunk server over HTTP or HTTPS. Supports HEC Index Acknowledgement.

## Tags

http, logs, splunk

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Hostname | The ip address or hostname of the Splunk server. |
| Owner | The owner to pass to Splunk. |
| Password | The password to authenticate to Splunk. |
| Port | The HTTP Event Collector HTTP Port Number. |
| Scheme | The scheme for connecting to Splunk. |
| Security Protocol | The security protocol to use for communicating with Splunk. |
| Token | HTTP Event Collector token starting with the string Splunk. For example ‘Splunk 1234578-abcd-1234-abcd-1234abcd’ |
| Username | The username to authenticate to Splunk. |
| character-set | The name of the character set. |
| content-type | The media type of the event sent to Splunk. If not set, “mime.type” flow file attribute will be used. In case of neither of them is specified, this information will not be sent to the server. |
| host | Specify with the host query string parameter. Sets a default for all events when unspecified. |
| index | Index name. Specify with the index query string parameter. Sets a default for all events when unspecified. |
| request-channel | Identifier of the used request channel. |
| source | User-defined event source. Sets a default for all events when unspecified. |
| source-type | User-defined event sourcetype. Sets a default for all events when unspecified. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the destination are sent to this relationship. |
| success | FlowFiles that are sent successfully to the destination are sent to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| splunk.acknowledgement.id | The indexing acknowledgement id provided by Splunk. |
| splunk.responded.at | The time of the response of put request for Splunk. |

## See also

* [org.apache.nifi.processors.splunk.QuerySplunkIndexingStatus](querysplunkindexingstatus.md)

---
title: PutSQL 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsql.md
section: Loading & Unloading Data
---

# PutSQL 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Executes a SQL UPDATE or INSERT command. The content of an incoming FlowFile is expected to be the SQL command to execute. The SQL command may use the ? to escape parameters. In this case, the parameters to use must exist as FlowFile attributes with the naming convention sql.args. N.type and sql.args. N.value, where N is a positive integer. The sql.args. N.type is expected to be a number indicating the JDBC Type. The content of the FlowFile is expected to be in UTF-8 format.

## Tags

database, insert, put, rdbms, relational, sql, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The preferred number of FlowFiles to put to the database in a single transaction |
| JDBC Connection Pool | Specifies the JDBC Connection Pool to use in order to convert the JSON message to a SQL statement. The Connection Pool is necessary in order to determine the appropriate database column types. |
| Obtain Generated Keys | If true, any key that is automatically generated by the database will be added to the FlowFile that generated it using the sql.generate.key attribute. This may result in slightly slower performance and is not supported by all databases. |
| Rollback On Failure | Specify how to handle error. By default (false), if an error occurs while processing a FlowFile, the FlowFile will be routed to ‘failure’ or ‘retry’ relationship based on error type, and processor can continue with next FlowFile. Instead, you may want to rollback currently processed FlowFiles and stop further processing immediately. In that case, you can do so by enabling this ‘Rollback On Failure’ property. If enabled, failed FlowFiles will stay in the input relationship without penalizing it and being processed repeatedly until it gets processed successfully or removed by other means. It is important to set adequate ‘Yield Duration’ to avoid retrying too frequently. |
| Support Fragmented Transactions | If true, when a FlowFile is consumed by this Processor, the Processor will first check the fragment.identifier and fragment.count attributes of that FlowFile. If the fragment.count value is greater than 1, the Processor will not process any FlowFile with that fragment.identifier until all are available; at that point, it will process all FlowFiles with that fragment.identifier as a single transaction, in the order specified by the FlowFiles ‘fragment.index attributes. This Provides atomicity of those SQL statements. Once any statement of this transaction throws exception when executing, this transaction will be rolled back. When transaction rollback happened, none of these FlowFiles would be routed to’success ‘. If the <Rollback On Failure> is set true, these FlowFiles will stay in the input relationship. When the <Rollback On Failure> is set false,, if any of these FlowFiles will be routed to’ retry ‘, all of these FlowFiles will be routed to’ retry ‘.Otherwise, they will be routed to’ failure’. If this value is false, these attributes will be ignored and the updates will occur independent of one another. |
| Transaction Timeout | If the <Support Fragmented Transactions> property is set to true, specifies how long to wait for all FlowFiles for a particular fragment.identifier attribute to arrive before just transferring all of the FlowFiles with that identifier to the ‘failure’ relationship |
| database-session-autocommit | The autocommit mode to set on the database connection being used. If set to false, the operation(s) will be explicitly committed or rolled back (based on success or failure respectively), if set to true the driver/database handles the commit/rollback. |
| putsql-sql-statement | The SQL statement to execute. The statement can be empty, a constant value, or built from attributes using Expression Language. If this property is specified, it will be used regardless of the content of incoming FlowFiles. If this property is empty, the content of the incoming FlowFile is expected to contain a valid SQL statement, to be issued by the processor to the database. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if the database cannot be updated and retrying the operation will also fail, such as an invalid query or an integrity constraint violation |
| retry | A FlowFile is routed to this relationship if the database cannot be updated but attempting the operation again may succeed |
| success | A FlowFile is routed to this relationship after the database is successfully updated |

## Writes attributes

| Name | Description |
| --- | --- |
| sql.generated.key | If the database generated a key for an INSERT statement and the Obtain Generated Keys property is set to true, this attribute will be added to indicate the generated key, if possible. This feature is not supported by all database vendors. |

---
title: PutSQS 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsqs.md
section: Loading & Unloading Data
---

# PutSQS 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Publishes a message to an Amazon Simple Queuing Service Queue

## Tags

AWS, Amazon, Publish, Put, Queue, SQS

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Deduplication Message ID | The token used for deduplication of sent messages |
| Delay | The amount of time to delay the message before it becomes available to consumers |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Message Group ID | If using FIFO, the message group to which the FlowFile belongs |
| Queue URL | The URL of the queue to act upon |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## See also

* [org.apache.nifi.processors.aws.sqs.DeleteSQS](deletesqs.md)
* [org.apache.nifi.processors.aws.sqs.GetSQS](getsqs.md)

---
title: PutSyslog 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putsyslog.md
section: Loading & Unloading Data
---

# PutSyslog 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Sends Syslog messages to a given host and port over TCP or UDP. Messages are constructed from the “Message ___” properties of the processor which can use expression language to generate messages from incoming FlowFiles. The properties are used to construct messages of the form: (<PRIORITY>)(VERSION )(TIMESTAMP) (HOSTNAME) (BODY) where version is optional. The constructed messages are checked against regular expressions for RFC5424 and RFC3164 formatted messages. The timestamp can be an RFC5424 timestamp with a format of “yyyy-MM-dd ‘T’HH:mm:ss. S ‘Z’” or “yyyy-MM-dd ‘T’HH:mm:ss. S+hh:mm”, or it can be an RFC3164 timestamp with a format of “MMM d HH:mm:ss”. If a message is constructed that does not form a valid Syslog message according to the above description, then it is routed to the invalid relationship. Valid messages are sent to the Syslog server and successes are routed to the success relationship, failures routed to the failure relationship.

## Tags

logs, put, syslog, tcp, udp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of incoming FlowFiles to process in a single execution of this processor. |
| Character Set | Specifies the character set of the Syslog messages. Note that Expression language is not evaluated per FlowFile. |
| Hostname | The IP address or hostname of the Syslog server. |
| Idle Connection Expiration | The amount of time a connection should be held open without being used before closing the connection. |
| Max Size of Socket Send Buffer | The maximum size of the socket send buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Message Body | The body for the Syslog messages. |
| Message Hostname | The hostname for the Syslog messages. |
| Message Priority | The priority for the Syslog messages, excluding < >. |
| Message Timestamp | The timestamp for the Syslog messages. The timestamp can be an RFC5424 timestamp with a format of “yyyy-MM-dd ‘T’HH:mm:ss. S ‘Z’” or “yyyy-MM-dd ‘T’HH:mm:ss. S+hh:mm”, “ or it can be an RFC3164 timestamp with a format of “MMM d HH:mm:ss”. |
| Message Version | The version for the Syslog messages. |
| Port | The port for Syslog communication. Note that Expression language is not evaluated per FlowFile. |
| Protocol | The protocol for Syslog communication. |
| SSL Context Service | The Controller Service to use in order to obtain an SSL Context. If this property is set, syslog messages will be sent over a secure connection. |
| Timeout | The timeout for connecting to and communicating with the syslog server. Does not apply to UDP. Note that Expression language is not evaluated per FlowFile. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to Syslog are sent out this relationship. |
| invalid | FlowFiles that do not form a valid Syslog message are sent out this relationship. |
| success | FlowFiles that are sent successfully to Syslog are sent out this relationship. |

## See also

* [org.apache.nifi.processors.standard.ListenSyslog](listensyslog.md)
* [org.apache.nifi.processors.standard.ParseSyslog](parsesyslog.md)

---
title: PutTCP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/puttcp.md
section: Loading & Unloading Data
---

# PutTCP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Sends serialized FlowFiles or Records over TCP to a configurable destination with optional support for TLS

## Tags

egress, put, remote, tcp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | Specifies the character set of the data being sent. |
| Connection Per FlowFile | Specifies whether to send each FlowFile’s content on an individual connection. |
| Hostname | Destination hostname or IP address |
| Idle Connection Expiration | The amount of time a connection should be held open without being used before closing the connection. A value of 0 seconds will disable this feature. |
| Max Size of Socket Send Buffer | The maximum size of the socket send buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Outgoing Message Delimiter | Specifies the delimiter to use when sending messages out over the same TCP stream. The delimiter is appended to each FlowFile message that is transmitted over the stream so that the receiver can determine when one message ends and the next message begins. Users should ensure that the FlowFile content does not contain the delimiter character to avoid errors. In order to use a new line character you can enter ‘n’. For a tab character use ‘t’. Finally for a carriage return use ‘r’. |
| Port | Destination port number |
| Record Reader | Specifies the Controller Service to use for reading Records from input FlowFiles |
| Record Writer | Specifies the Controller Service to use for writing Records to the configured socket address |
| SSL Context Service | Specifies the SSL Context Service to enable TLS socket communication |
| Timeout | The timeout for connecting to and communicating with the destination. Does not apply to UDP |
| Transmission Strategy | Specifies the strategy used for reading input FlowFiles and transmitting messages to the destination socket address |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the destination are sent out this relationship. |
| success | FlowFiles that are sent successfully to the destination are sent out this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count.transmitted | Count of records transmitted to configured destination address |

## See also

* [org.apache.nifi.processors.standard.ListenTCP](listentcp.md)
* [org.apache.nifi.processors.standard.PutUDP](putudp.md)

---
title: PutUDP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putudp.md
section: Loading & Unloading Data
---

# PutUDP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

The PutUDP processor receives a FlowFile and packages the FlowFile content into a single UDP datagram packet which is then transmitted to the configured UDP server. The user must ensure that the FlowFile content being fed to this processor is not larger than the maximum size for the underlying UDP transport. The maximum transport size will vary based on the platform setup but is generally just under 64KB. FlowFiles will be marked as failed if their content is larger than the maximum transport size.

## Tags

egress, put, remote, udp

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Hostname | Destination hostname or IP address |
| Idle Connection Expiration | The amount of time a connection should be held open without being used before closing the connection. A value of 0 seconds will disable this feature. |
| Max Size of Socket Send Buffer | The maximum size of the socket send buffer that should be used. This is a suggestion to the Operating System to indicate how big the socket buffer should be. If this value is set too low, the buffer may fill up before the data can be read, and incoming data will be dropped. |
| Port | Destination port number |
| Timeout | The timeout for connecting to and communicating with the destination. Does not apply to UDP |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the destination are sent out this relationship. |
| success | FlowFiles that are sent successfully to the destination are sent out this relationship. |

## See also

* [org.apache.nifi.processors.standard.ListenUDP](listenudp.md)
* [org.apache.nifi.processors.standard.PutTCP](puttcp.md)

---
title: PutUnityCatalogFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putunitycatalogfile.md
section: Loading & Unloading Data
---

# PutUnityCatalogFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Write FlowFile content with max size of 5 GiB to Unity Catalog.

## Tags

databricks, openflow, unity catalog

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Unity Catalog File Path | Unity Catalog file path e.g. /Volumes/catalog/schema/volume_name/file.txt |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: PutVectaraDocument 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putvectaradocument.md
section: Loading & Unloading Data
---

# PutVectaraDocument 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-vectara-processors-nar

## Description

Generate and upload a JSON document to Vectara’s upload endpoint. The input text can be JSON Object, JSON Array, or JSONL format.

## Tags

ai, llm, openflow, rag, vectara

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Corpus ID | Identifier of the Vectara corpus |
| Document Attributes | A comma delimited list of NiFi attributes fields, which if present will be included in the document metadata. |
| Document Author | Author of the document |
| Document Creation Time | Timestamp in epoch seconds when the document was created |
| Document Date | Date of document creation |
| Document Description | Description of the document |
| Document ID | A unique identifier for the document constructed either from the source path of the document or a hash of the document’s content. |
| Document Source URL | Source URL for document |
| Document Title | Document Title |
| Index Input Format | Input format for indexing service. JSON Object: Load FlowFile content directly as JSON payload. JSON Lines: Create a new section for each line of JSON. JSON Array: Load FlowFile content as a JSON array and create a new section for each element in the JSON array. |
| Section Custom Dimensions | A comma delimited list of metadata fields, which if present in the metadata path will be included as a section’s custom dimension. The values for custom dimensions must be valid numbers. |
| Section Filter Attributes | A comma delimited list of metadata fields, which if present in the metadata path will be included as a section metadata filter. |
| Section ID Attribute | The field for setting section id, which is populated if present in the metadata path. |
| Section Metadata Attributes | A comma delimited list of metadata fields, which if present in the metadata path will be included will be included in the section metadata. |
| Section Metadata JSON Path | A JSON Path expression to a metadata JSON Object. The JSON Object needs to contain the list of metadata fields. These fields will be included in Section metadata. |
| Section Text JSON Path | A JSON Path expression to the text field. |
| Section Title Attribute | The field for setting the section title, which is populated if present in the metadata path. |
| Vectara Client | Vectara Client Service. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Vectara failure relationship |
| original | Original relationship |
| success | Vectara success relationship |

## Use Cases Involving Other Components

|  |
| --- |
| Publish a PDF file to a Vectara corpus. |

## See also

* [com.snowflake.openflow.runtime.processors.vectara.PutVectaraFile](putvectarafile.md)

---
title: PutVectaraFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putvectarafile.md
section: Loading & Unloading Data
---

# PutVectaraFile 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-vectara-processors-nar

## Description

Upload a FlowFile content to Vectara’s index endpoint. Document filter attributes and metadata attributes can be set by referencing FlowFile attributes.

## Tags

ai, llm, openflow, rag, vectara

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Corpus ID | Identifier of the Vectara corpus |
| Document Filter Attributes | A comma delimited list of metadata fields, which if present in the FlowFile attributes will be included in as a document metadata filter. |
| Document ID | A unique identifier for the document constructed either from the source path of the document or a hash of the document’s content. |
| Document Metadata Attributes | A comma delimited list of metadata fields, which if present in the FlowFile attributes will be included will be included in the document metadata. |
| Vectara Client | Vectara Client Service. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Vectara failure relationship |
| original | Original relationship |
| success | Vectara success relationship |

## See also

* [com.snowflake.openflow.runtime.processors.vectara.PutVectaraDocument](putvectaradocument.md)

---
title: PutWebSocket 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putwebsocket.md
section: Loading & Unloading Data
---

# PutWebSocket 2025.10.9.21

## Bundle

org.apache.nifi | nifi-websocket-processors-nar

## Description

Sends messages to a WebSocket remote endpoint using a WebSocket session that is established by either ListenWebSocket or ConnectWebSocket.

## Tags

WebSocket, publish, send

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| websocket-controller-service-id | A NiFi Expression to retrieve the id of a WebSocket ControllerService. |
| websocket-endpoint-id | A NiFi Expression to retrieve the endpoint id of a WebSocket ControllerService. |
| websocket-message-type | The type of message content: TEXT or BINARY |
| websocket-session-id | A NiFi Expression to retrieve the session id. If not specified, a message will be sent to all connected WebSocket peers for the WebSocket controller service endpoint. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to send to the destination are transferred to this relationship. |
| success | FlowFiles that are sent successfully to the destination are transferred to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| websocket.controller.service.id | WebSocket Controller Service id. |
| websocket.session.id | Established WebSocket session id. |
| websocket.endpoint.id | WebSocket endpoint id. |
| websocket.message.type | TEXT or BINARY. |
| websocket.local.address | WebSocket server address. |
| websocket.remote.address | WebSocket client address. |
| websocket.failure.detail | Detail of the failure. |

---
title: PutZendeskTicket 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/putzendeskticket.md
section: Loading & Unloading Data
---

# PutZendeskTicket 2025.10.9.21

## Bundle

org.apache.nifi | nifi-zendesk-nar

## Description

Create Zendesk tickets using the Zendesk API.

## Tags

zendesk, ticket

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| web-client-service-provider | Controller service for HTTP client operations. |
| zendesk-authentication-type-name | Type of authentication to Zendesk API. |
| zendesk-authentication-value-name | Password or authentication token for Zendesk login user. |
| zendesk-comment-body | The content or the path to the comment body in the incoming record. |
| zendesk-priority | The content or the path to the priority in the incoming record. |
| zendesk-record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema. |
| zendesk-subdomain | Name of the Zendesk subdomain. |
| zendesk-subject | The content or the path to the subject in the incoming record. |
| zendesk-type | The content or the path to the type in the incoming record. |
| zendesk-user | Login user to Zendesk subdomain. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if the operation failed and retrying the operation will also fail, such as an invalid data or schema. |
| success | For FlowFiles created as a result of a successful HTTP request. |

## Writes attributes

| Name | Description |
| --- | --- |
| record.count | The number of records processed. |
| error.code | The error code of from the response. |
| error.message | The error message of from the response. |

---
title: QueryAzureDataExplorer 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/queryazuredataexplorer.md
section: Loading & Unloading Data
---

# QueryAzureDataExplorer 2025.10.9.21

## Bundle

org.apache.nifi | nifi-azure-nar

## Description

Query Azure Data Explorer and stream JSON results to output FlowFiles

## Tags

ADX, Azure, Data, Explorer, Kusto

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Database Name | Azure Data Explorer Database Name for querying |
| Kusto Query Service | Azure Data Explorer Kusto Query Service |
| Query | Query to be run against Azure Data Explorer |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles containing original input associated with a failed Query |
| success | FlowFiles containing results of a successful Query |

## Writes attributes

| Name | Description |
| --- | --- |
| query.error.message | Azure Data Explorer query error message on failures |
| query.executed | Azure Data Explorer query executed |
| mime.type | Content Type set to application/json |

---
title: QueryDatabaseTable 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/querydatabasetable.md
section: Loading & Unloading Data
---

# QueryDatabaseTable 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Generates a SQL select query, or uses a provided statement, and executes it to fetch all rows whose values in the specified Maximum Value column(s) are larger than the previously-seen maxima. Query result will be converted to Avro format. Expression Language is supported for several properties, but no incoming connections are permitted. The Environment/System properties may be used to provide values for any property containing Expression Language. If it is desired to leverage flow file attributes to perform these queries, the GenerateTableFetch and/or ExecuteSQL processors can be used for this purpose. Streaming is used so arbitrarily large result sets are supported. This processor can be scheduled to run on a timer or cron expression, using the standard scheduling methods. This processor is intended to be run on the Primary Node only. FlowFile attribute ‘querydbtable.row.count’ indicates how many rows were selected.

## Tags

database, jdbc, query, select, sql

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Columns to Return | A comma-separated list of column names to be used in the query. If your database requires special treatment of the names (quoting, e.g.), each name should include such treatment. If no column names are supplied, all columns in the specified table will be returned. NOTE: It is important to use consistent column names for a given table for incremental fetch to work properly. |
| Database Connection Pooling Service | The Controller Service that is used to obtain a connection to the database. |
| Database Dialect Service | Database Dialect Service for generating statements specific to a particular service or vendor. |
| Default Decimal Precision | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| Fetch Size | The number of result rows to be fetched from the result set at a time. This is a hint to the database driver and may not be honored and/or exact. If the value specified is zero, then the hint is ignored. If using PostgreSQL, then ‘Set Auto Commit’ must be equal to ‘false’ to cause ‘Fetch Size’ to take effect. |
| Max Wait Time | The maximum amount of time allowed for a running SQL select query , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| Maximum-value Columns | A comma-separated list of column names. The processor will keep track of the maximum value for each column that has been returned since the processor started running. Using multiple columns implies an order to the column list, and each column ‘s values are expected to increase more slowly than the previous columns’ values. Thus, using multiple columns implies a hierarchical structure of columns, which is usually used for partitioning tables. This processor can be used to retrieve only those rows that have been added/updated since the last retrieval. Note that some JDBC types such as bit/boolean are not conducive to maintaining maximum value, so columns of these types should not be listed in this property, and will result in error(s) during processing. If no columns are provided, all rows from the table will be considered, which could have a performance impact. NOTE: It is important to use consistent max-value column names for a given table for incremental fetch to work properly. |
| Normalize Table and Column Names | Whether to change non-Avro-compatible characters in column names to Avro-compatible characters. For example, colons and periods will be changed to underscores in order to build a valid Avro record. |
| Set Auto Commit | Allows enabling or disabling the auto commit functionality of the DB connection. Default value is ‘No value set’. ‘No value set’ will leave the db connection ‘s auto commit mode unchanged. For some JDBC drivers such as PostgreSQL driver, it is required to disable the auto commit functionality to get the’Fetch Size ‘setting to take effect. When auto commit is enabled, PostgreSQL driver ignores’Fetch Size’setting and loads all rows of the result set to memory at once. This could lead for a large amount of memory usage when executing queries which fetch large data sets. More Details of this behaviour in PostgreSQL driver can be found in <https://jdbc.postgresql.org//documentation/head/query.html>. |
| Table Name | The name of the database table to be queried. When a custom query is used, this property is used to alias the query and appears as an attribute on the FlowFile. |
| Use Avro Logical Types | Whether to use Avro Logical Types for DECIMAL/NUMBER, DATE, TIME and TIMESTAMP columns. If disabled, written as string. If enabled, Logical types are used and written as its underlying type, specifically, DECIMAL/NUMBER as logical ‘decimal’: written as bytes with additional precision and scale meta data, DATE as logical ‘date-millis’: written as int denoting days since Unix epoch (1970-01-01), TIME as logical ‘time-millis’: written as int denoting milliseconds since Unix epoch, and TIMESTAMP as logical ‘timestamp-millis’: written as long denoting milliseconds since Unix epoch. If a reader of written Avro records also knows these logical types, then these values can be deserialized with more context depending on reader implementation. |
| db-fetch-db-type | Database Type for generating statements specific to a particular service or vendor. The Generic Type supports most cases but selecting a specific type enables optimal processing or additional features. |
| db-fetch-sql-query | A custom SQL query used to retrieve data. Instead of building a SQL query from other properties, this query will be wrapped as a sub-query. Query must have no ORDER BY statement. |
| db-fetch-where-clause | A custom clause to be added in the WHERE condition when building SQL queries. |
| initial-load-strategy | How to handle existing rows in the database table when the processor is started for the first time (or its state has been cleared). The property will be ignored, if any ‘initial.maxvalue.\*’ dynamic property has also been configured. |
| qdbt-max-frags | The maximum number of fragments. If the value specified is zero, then all fragments are returned. This prevents OutOfMemoryError when this processor ingests huge table. NOTE: Setting this property can result in data loss, as the incoming results are not ordered, and fragments may end at arbitrary boundaries where rows are not included in the result set. |
| qdbt-max-rows | The maximum number of result rows that will be included in a single FlowFile. This will allow you to break up very large result sets into multiple FlowFiles. If the value specified is zero, then all rows are returned in a single FlowFile. |
| qdbt-output-batch-size | The number of output FlowFiles to queue before committing the process session. When set to zero, the session will be committed when all result set rows have been processed and the output FlowFiles are ready for transfer to the downstream relationship. For large result sets, this can cause a large burst of FlowFiles to be transferred at the end of processor execution. If this property is set, then when the specified number of FlowFiles are ready for transfer, then the session will be committed, thus releasing the FlowFiles to the downstream relationship. NOTE: The maxvalue.\* and fragment.count attributes will not be set on FlowFiles when this property is set. |
| transaction-isolation-level | This setting will set the transaction isolation level for the database connection for drivers that support this setting |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a query on the specified table, the maximum values for the specified column(s) will be retained for use in future executions of the query. This allows the Processor to fetch only those records that have max values greater than the retained values. This can be used for incremental fetching, fetching of newly added rows, etc. To clear the maximum values, clear the state of the processor per the State Management documentation |

## Relationships

| Name | Description |
| --- | --- |
| success | Successfully created FlowFile from SQL query result set. |

## Writes attributes

| Name | Description |
| --- | --- |
| tablename | Name of the table being queried |
| querydbtable.row.count | The number of rows selected by the query |
| fragment.identifier | If ‘Max Rows Per Flow File’ is set then all FlowFiles from the same query result set will have the same value for the fragment.identifier attribute. This can then be used to correlate the results. |
| fragment.count | If ‘Max Rows Per Flow File’ is set then this is the total number of FlowFiles produced by a single ResultSet. This can be used in conjunction with the fragment.identifier attribute in order to know how many FlowFiles belonged to the same incoming ResultSet. If Output Batch Size is set, then this attribute will not be populated. |
| fragment.index | If ‘Max Rows Per Flow File’ is set then the position of this FlowFile in the list of outgoing FlowFiles that were all derived from the same result set FlowFile. This can be used in conjunction with the fragment.identifier attribute to know which FlowFiles originated from the same query result set and in what order FlowFiles were produced |
| maxvalue.\* | Each attribute contains the observed maximum value of a specified ‘Maximum-value Column’. The suffix of the attribute is the name of the column. If Output Batch Size is set, then this attribute will not be populated. |

## See also

* [org.apache.nifi.processors.standard.ExecuteSQL](executesql.md)
* [org.apache.nifi.processors.standard.GenerateTableFetch](generatetablefetch.md)

---
title: QueryDatabaseTableRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/querydatabasetablerecord.md
section: Loading & Unloading Data
---

# QueryDatabaseTableRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Generates a SQL select query, or uses a provided statement, and executes it to fetch all rows whose values in the specified Maximum Value column(s) are larger than the previously-seen maxima. Query result will be converted to the format specified by the record writer. Expression Language is supported for several properties, but no incoming connections are permitted. The Environment/System properties may be used to provide values for any property containing Expression Language. If it is desired to leverage flow file attributes to perform these queries, the GenerateTableFetch and/or ExecuteSQL processors can be used for this purpose. Streaming is used so arbitrarily large result sets are supported. This processor can be scheduled to run on a timer or cron expression, using the standard scheduling methods. This processor is intended to be run on the Primary Node only. FlowFile attribute ‘querydbtable.row.count’ indicates how many rows were selected.

## Tags

database, jdbc, query, record, select, sql

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Columns to Return | A comma-separated list of column names to be used in the query. If your database requires special treatment of the names (quoting, e.g.), each name should include such treatment. If no column names are supplied, all columns in the specified table will be returned. NOTE: It is important to use consistent column names for a given table for incremental fetch to work properly. |
| Database Connection Pooling Service | The Controller Service that is used to obtain a connection to the database. |
| Database Dialect Service | Database Dialect Service for generating statements specific to a particular service or vendor. |
| Default Decimal Precision | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| Fetch Size | The number of result rows to be fetched from the result set at a time. This is a hint to the database driver and may not be honored and/or exact. If the value specified is zero, then the hint is ignored. If using PostgreSQL, then ‘Set Auto Commit’ must be equal to ‘false’ to cause ‘Fetch Size’ to take effect. |
| Max Wait Time | The maximum amount of time allowed for a running SQL select query , zero means there is no limit. Max time less than 1 second will be equal to zero. |
| Maximum-value Columns | A comma-separated list of column names. The processor will keep track of the maximum value for each column that has been returned since the processor started running. Using multiple columns implies an order to the column list, and each column ‘s values are expected to increase more slowly than the previous columns’ values. Thus, using multiple columns implies a hierarchical structure of columns, which is usually used for partitioning tables. This processor can be used to retrieve only those rows that have been added/updated since the last retrieval. Note that some JDBC types such as bit/boolean are not conducive to maintaining maximum value, so columns of these types should not be listed in this property, and will result in error(s) during processing. If no columns are provided, all rows from the table will be considered, which could have a performance impact. NOTE: It is important to use consistent max-value column names for a given table for incremental fetch to work properly. |
| Set Auto Commit | Allows enabling or disabling the auto commit functionality of the DB connection. Default value is ‘No value set’. ‘No value set’ will leave the db connection ‘s auto commit mode unchanged. For some JDBC drivers such as PostgreSQL driver, it is required to disable the auto commit functionality to get the’Fetch Size ‘setting to take effect. When auto commit is enabled, PostgreSQL driver ignores’Fetch Size’setting and loads all rows of the result set to memory at once. This could lead for a large amount of memory usage when executing queries which fetch large data sets. More Details of this behaviour in PostgreSQL driver can be found in <https://jdbc.postgresql.org//documentation/head/query.html>. |
| Table Name | The name of the database table to be queried. When a custom query is used, this property is used to alias the query and appears as an attribute on the FlowFile. |
| Use Avro Logical Types | Whether to use Avro Logical Types for DECIMAL/NUMBER, DATE, TIME and TIMESTAMP columns. If disabled, written as string. If enabled, Logical types are used and written as its underlying type, specifically, DECIMAL/NUMBER as logical ‘decimal’: written as bytes with additional precision and scale meta data, DATE as logical ‘date-millis’: written as int denoting days since Unix epoch (1970-01-01), TIME as logical ‘time-millis’: written as int denoting milliseconds since Unix epoch, and TIMESTAMP as logical ‘timestamp-millis’: written as long denoting milliseconds since Unix epoch. If a reader of written Avro records also knows these logical types, then these values can be deserialized with more context depending on reader implementation. |
| db-fetch-db-type | Database Type for generating statements specific to a particular service or vendor. The Generic Type supports most cases but selecting a specific type enables optimal processing or additional features. |
| db-fetch-sql-query | A custom SQL query used to retrieve data. Instead of building a SQL query from other properties, this query will be wrapped as a sub-query. Query must have no ORDER BY statement. |
| db-fetch-where-clause | A custom clause to be added in the WHERE condition when building SQL queries. |
| initial-load-strategy | How to handle existing rows in the database table when the processor is started for the first time (or its state has been cleared). The property will be ignored, if any ‘initial.maxvalue.\*’ dynamic property has also been configured. |
| qdbt-max-frags | The maximum number of fragments. If the value specified is zero, then all fragments are returned. This prevents OutOfMemoryError when this processor ingests huge table. NOTE: Setting this property can result in data loss, as the incoming results are not ordered, and fragments may end at arbitrary boundaries where rows are not included in the result set. |
| qdbt-max-rows | The maximum number of result rows that will be included in a single FlowFile. This will allow you to break up very large result sets into multiple FlowFiles. If the value specified is zero, then all rows are returned in a single FlowFile. |
| qdbt-output-batch-size | The number of output FlowFiles to queue before committing the process session. When set to zero, the session will be committed when all result set rows have been processed and the output FlowFiles are ready for transfer to the downstream relationship. For large result sets, this can cause a large burst of FlowFiles to be transferred at the end of processor execution. If this property is set, then when the specified number of FlowFiles are ready for transfer, then the session will be committed, thus releasing the FlowFiles to the downstream relationship. NOTE: The maxvalue.\* and fragment.count attributes will not be set on FlowFiles when this property is set. |
| qdbtr-normalize | Whether to change characters in column names when creating the output schema. For example, colons and periods will be changed to underscores. |
| qdbtr-record-writer | Specifies the Controller Service to use for writing results to a FlowFile. The Record Writer may use Inherit Schema to emulate the inferred schema behavior, i.e. an explicit schema need not be defined in the writer, and will be supplied by the same logic used to infer the schema from the column types. |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | After performing a query on the specified table, the maximum values for the specified column(s) will be retained for use in future executions of the query. This allows the Processor to fetch only those records that have max values greater than the retained values. This can be used for incremental fetching, fetching of newly added rows, etc. To clear the maximum values, clear the state of the processor per the State Management documentation |

## Relationships

| Name | Description |
| --- | --- |
| success | Successfully created FlowFile from SQL query result set. |

## Writes attributes

| Name | Description |
| --- | --- |
| tablename | Name of the table being queried |
| querydbtable.row.count | The number of rows selected by the query |
| fragment.identifier | If ‘Max Rows Per Flow File’ is set then all FlowFiles from the same query result set will have the same value for the fragment.identifier attribute. This can then be used to correlate the results. |
| fragment.count | If ‘Max Rows Per Flow File’ is set then this is the total number of FlowFiles produced by a single ResultSet. This can be used in conjunction with the fragment.identifier attribute in order to know how many FlowFiles belonged to the same incoming ResultSet. If Output Batch Size is set, then this attribute will not be populated. |
| fragment.index | If ‘Max Rows Per Flow File’ is set then the position of this FlowFile in the list of outgoing FlowFiles that were all derived from the same result set FlowFile. This can be used in conjunction with the fragment.identifier attribute to know which FlowFiles originated from the same query result set and in what order FlowFiles were produced |
| maxvalue.\* | Each attribute contains the observed maximum value of a specified ‘Maximum-value Column’. The suffix of the attribute is the name of the column. If Output Batch Size is set, then this attribute will not be populated. |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer. |
| record.count | The number of records output by the Record Writer. |

## Use cases

|  |
| --- |
| Retrieve all rows from a database table. |
| Perform an incremental load of a single database table, fetching only new rows as they are added to the table. |

## Use Cases Involving Other Components

|  |
| --- |
| Perform an incremental load of multiple database tables, fetching only new rows as they are added to the tables. |

## See also

* [org.apache.nifi.processors.standard.ExecuteSQL](executesql.md)
* [org.apache.nifi.processors.standard.GenerateTableFetch](generatetablefetch.md)

---
title: QueryMilvus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/querymilvus.md
section: Loading & Unloading Data
---

# QueryMilvus 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-milvus-processors-nar

## Description

Queries a given collection in a Milvus database using vectors. Results of query are added to current record under the results record path for each vector searched.

## Tags

chatbot, embeddings, gen ai, genai, generative ai, llm, metadata, milvus, openflow, publish, query, search, text, vector

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Collection Name | The name of the Milvus collection name to use |
| Max Query Batch Size | This is the number of vectors that are contained in a single request to Milvus during a query. Milvus is unable to support batch queries of more then 10 vectors at a time. |
| Maximum Results | The maximum number of results to return (i.e., Top K) |
| Milvus Connection Service | Connection Service for accessing Milvus Database |
| Output Search Fields | Comma separated list of additional fields to return from a search against the Milvus database. Milvus will return the score and id fields by default. |
| Partition | Partition of the vector database that you want to perform operations in. If the database has only one partition leave empty. |
| Record Reader | The Record Reader to use for reading the FlowFile |
| Record Writer | The Record Writer to use for writing the results |
| Reranking Smoothing Parameter | Smoothing Parameter of the Reciprocal Rank Fusion (RRFRanker) during Hybrid Search |
| Results Record Path | Specifies where in the record to place the results. |
| Sparse Vector Field Name | The name of the field to use for storing the sparse vectors. |
| Sparse Vector Indices Path | If, Sparse Vectors are to be provided, this RecordPath points to the indices of the sparse data to use. |
| Sparse Vector Values Path | If, Sparse Vectors are to be provided, this RecordPath points to the values of the sparse data to use. |
| Vector Field Name | The name of the field in Milvus to use for storing the vectors. |
| Vector Record Path | The path to the vector field in the record |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be sent to Milvus, and for which a retry is not expected to be successful, are routed to this relationship |
| retry | FlowFiles that fail to be sent to Milvus, but for which a retry may help, are routed to this relationship |
| success | FlowFiles that are successfully sent to Milvus are routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.milvus.UpsertMilvus](upsertmilvus.md)

---
title: QueryPinecone 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/querypinecone.md
section: Loading & Unloading Data
---

# QueryPinecone 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-pinecone-nar

## Description

Queries Pinecone for vectors that are similar to the input vector, or retrieves a vector by ID.

## Tags

chatbot, gen ai, generative ai, llm, openflow, pinecone, query, similarity, vector

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ID Record Path | The path to the ID field in the record |
| Include Metadata | Specifies whether to include metadata in the results |
| Include Vectors | Specifies whether to include vectors in the results |
| Number of Results | The number of results to return (i.e., Top K) |
| Pinecone API Key | The API key for the Pinecone service |
| Pinecone Index | The name of the Pinecone index to use |
| Pinecone Namespace | The name of the Pinecone namespace to use |
| Query Filter | A JSON representation of the query filter to use |
| Query Strategy | The strategy to use for querying Pinecone |
| Record Reader | The Record Reader to use for reading the FlowFile |
| Record Writer | The Record Writer to use for writing the results |
| Results Record Path | Specifies where in the record to place the results. |
| Sparse Dense Vector Weighting | Ranges from 0.0 to 1.0. Weight to apply on dense and sparse vectors when doing an hybrid search. (1 - weight) will be applied to the values of the sparse vector and (weight) will be applied to the dense vector. |
| Sparse Vector Indices Path | If, Sparse Vectors are to be provided, this RecordPath points to the indices of the sparse data to use. |
| Sparse Vector Values Path | If, Sparse Vectors are to be provided, this RecordPath points to the values of the sparse data to use. |
| Vector Record Path | The path to the vector field in the record |
| Web Client Service | The Web Client Service to use for communicating with Pinecone |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be sent to Pinecone, and for which a retry is not expected to be successful, are routed to this relationship |
| retry | FlowFiles that fail to be sent to Pinecone, but for which a retry may help, are routed to this relationship |
| success | FlowFiles that are successfully sent to Pinecone are routed to this relationship |

## Use Cases Involving Other Components

|  |
| --- |
| Query Pinecone for vectors that are similar to some input text |

---
title: QueryRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/queryrecord.md
section: Loading & Unloading Data
---

# QueryRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Evaluates one or more SQL queries against the contents of a FlowFile. The result of the SQL query then becomes the content of the output FlowFile. This can be used, for example, for field-specific filtering, transformation, and row-level filtering. Columns can be renamed, simple calculations and aggregations performed, etc. The Processor is configured with a Record Reader Controller Service and a Record Writer service so as to allow flexibility in incoming and outgoing data formats. The Processor must be configured with at least one user-defined property. The name of the Property is the Relationship to route data to, and the value of the Property is a SQL SELECT statement that is used to specify how input data should be transformed/filtered. The SQL statement must be valid ANSI SQL and is powered by Apache Calcite. If the transformation fails, the original FlowFile is routed to the ‘failure’ relationship. Otherwise, the data selected will be routed to the associated relationship. If the Record Writer chooses to inherit the schema from the Record, it is important to note that the schema that is inherited will be from the ResultSet, rather than the input Record. This allows a single instance of the QueryRecord processor to have multiple queries, each of which returns a different set of columns and aggregations. As a result, though, the schema that is derived will have no schema name, so it is important that the configured Record Writer not attempt to write the Schema Name as an attribute if inheriting the Schema from the Record. See the Processor Usage documentation for more information.

## Tags

aggregate, avro, calcite, csv, etl, filter, json, logs, modify, query, record, route, select, sql, text, transform, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Default Decimal Precision | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘precision’ denoting number of available digits is required. Generally, precision is defined by column data type definition or database engines default. However undefined precision (0) can be returned from some database engines. ‘Default Decimal Precision’ is used when writing those undefined precision numbers. |
| Default Decimal Scale | When a DECIMAL/NUMBER value is written as a ‘decimal’ Avro logical type, a specific ‘scale’ denoting number of available decimal digits is required. Generally, scale is defined by column data type definition or database engines default. However when undefined precision (0) is returned, scale can also be uncertain with some database engines. ‘Default Decimal Scale’ is used when writing those undefined numbers. If a value has more decimals than specified scale, then the value will be rounded-up, e.g. 1.53 becomes 2 with scale 0, and 1.5 with scale 1. |
| include-zero-record-flowfiles | When running the SQL statement against an incoming FlowFile, if the result has no data, this property specifies whether or not a FlowFile will be sent to the corresponding relationship |
| record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema |
| record-writer | Specifies the Controller Service to use for writing results to a FlowFile |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the SQL statement contains columns not present in input data), the original FlowFile it will be routed to this relationship |
| original | The original FlowFile is routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records selected by the query |
| QueryRecord.Route | The relation to which the FlowFile was routed |

## Use cases

|  |
| --- |
| Filter out records based on the values of the records’ fields |
| Keep only specific records |
| Keep only specific fields in a a Record, where the names of the fields to keep are known |
| Route record-oriented data for processing based on its contents |

---
title: QuerySalesforceObject 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/querysalesforceobject.md
section: Loading & Unloading Data
---

# QuerySalesforceObject 2025.10.9.21

## Bundle

org.apache.nifi | nifi-salesforce-nar

## Description

Retrieves records from a Salesforce sObject. Users can add arbitrary filter conditions by setting the ‘Custom WHERE Condition’ property. The processor can also run a custom query, although record processing is not supported in that case. Supports incremental retrieval: users can define a field in the ‘Age Field’ property that will be used to determine when the record was created. When this property is set the processor will retrieve new records. Incremental loading and record-based processing are only supported in property-based queries. It ‘s also possible to define an initial cutoff value for the age, filtering out all older records even for the first run. In case of’Property Based Query ‘this processor should run on the Primary Node only. FlowFile attribute’ record.count ‘indicates how many records were retrieved and written to the output. The processor can accept an optional input FlowFile and reference the FlowFile attributes in the query. When’Include Deleted Records ‘is true, the processor will include deleted records (soft-deletes) in the results by using the’ queryAll ‘API. The’IsDeleted’ field will be automatically included in the results when querying deleted records.

## Tags

query, salesforce, sobject, soql

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| age-delay | The ending timestamp of the time window will be adjusted earlier by the amount configured in this property. For example, with a property value of 10 seconds, an ending timestamp of 12:30:45 would be changed to 12:30:35. |
| age-field | The name of a TIMESTAMP field that will be used to filter records using a bounded time window. The processor will return only those records with a timestamp value newer than the timestamp recorded after the last processor run. |
| create-zero-record-files | Specifies whether or not to create a FlowFile when the Salesforce REST API does not return any records |
| custom-soql-query | Specify the SOQL query to run. |
| custom-where-condition | A custom expression to be added in the WHERE clause of the query |
| field-names | Comma-separated list of field names requested from the sObject to be queried. When this field is left empty, all fields are queried. |
| include-deleted-records | If true, the processor will include deleted records (IsDeleted = true) in the query results. When enabled, the processor will use the ‘queryAll’ API. |
| initial-age-filter | This property specifies the start time that the processor applies when running the first query. |
| oauth2-access-token-provider | Service providing OAuth2 Access Tokens for authenticating using the HTTP Authorization Header |
| query-type | Choose to provide the query by parameters or a full custom query. |
| read-timeout | Maximum time allowed for reading a response from the Salesforce REST API |
| record-writer | Service used for writing records returned from the Salesforce REST API |
| salesforce-api-version | The version number of the Salesforce REST API appended to the URL after the services/data path. See Salesforce documentation for supported versions |
| salesforce-url | The URL of the Salesforce instance including the domain without additional path information, such as <https://MyDomainName.my.salesforce.com> |
| sobject-name | The Salesforce sObject to be queried |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | When ‘Age Field’ is set, after performing a query the time of execution is stored. Subsequent queries will be augmented with an additional condition so that only records that are newer than the stored execution time (adjusted with the optional value of ‘Age Delay’) will be retrieved. State is stored across the cluster so that this Processor can be run on Primary Node only and if a new Primary Node is selected, the new node can pick up where the previous node left off, without duplicating the data. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The input flowfile gets sent to this relationship when the query fails. |
| original | The input flowfile gets sent to this relationship when the query succeeds. |
| success | For FlowFiles created as a result of a successful query. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer. |
| record.count | Sets the number of records in the FlowFile. |
| total.record.count | Sets the total number of records in the FlowFile. |

## See also

* [org.apache.nifi.processors.salesforce.PutSalesforceObject](putsalesforceobject.md)

---
title: QuerySplunkIndexingStatus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/querysplunkindexingstatus.md
section: Loading & Unloading Data
---

# QuerySplunkIndexingStatus 2025.10.9.21

## Bundle

org.apache.nifi | nifi-splunk-nar

## Description

Queries Splunk server in order to acquire the status of indexing acknowledgement.

## Tags

acknowledgement, http, logs, splunk

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Hostname | The ip address or hostname of the Splunk server. |
| Owner | The owner to pass to Splunk. |
| Password | The password to authenticate to Splunk. |
| Port | The HTTP Event Collector HTTP Port Number. |
| Scheme | The scheme for connecting to Splunk. |
| Security Protocol | The security protocol to use for communicating with Splunk. |
| Token | HTTP Event Collector token starting with the string Splunk. For example ‘Splunk 1234578-abcd-1234-abcd-1234abcd’ |
| Username | The username to authenticate to Splunk. |
| max-query-size | The maximum number of acknowledgement identifiers the outgoing query contains in one batch. It is recommended not to set it too low in order to reduce network communication. |
| request-channel | Identifier of the used request channel. |
| ttl | The maximum time the processor tries to acquire acknowledgement confirmation for an index, from the point of registration. After the given amount of time, the processor considers the index as not acknowledged and transfers the FlowFile to the “unacknowledged” relationship. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is transferred to this relationship when the acknowledgement was not successful due to errors during the communication. FlowFiles are timing out or unknown by the Splunk server will transferred to “undetermined” relationship. |
| success | A FlowFile is transferred to this relationship when the acknowledgement was successful. |
| unacknowledged | A FlowFile is transferred to this relationship when the acknowledgement was not successful. This can happen when the acknowledgement did not happened within the time period set for Maximum Waiting Time. FlowFiles with acknowledgement id unknown for the Splunk server will be transferred to this relationship after the Maximum Waiting Time is reached. |
| undetermined | A FlowFile is transferred to this relationship when the acknowledgement state is not determined. FlowFiles transferred to this relationship might be penalized. This happens when Splunk returns with HTTP 200 but with false response for the acknowledgement id in the flow file attribute. |

## See also

* [org.apache.nifi.processors.splunk.PutSplunkHTTP](putsplunkhttp.md)

---
title: ReaderLookup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/readerlookup.md
section: Loading & Unloading Data
---

# ReaderLookup

## Description

Provides a RecordReaderFactory that can be used to dynamically select another RecordReaderFactory. This will allow multiple RecordReaderFactories to be defined and registered, and then selected dynamically at runtime by referencing a FlowFile attribute in the Service to Use property.

## Tags

lookup, parse, reader, record, row

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Service to Use \* | Service to Use | ${recordreader.name} |  | Specifies the name of the user-defined property whose associated Controller Service should be used. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RecordSetWriterLookup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/recordsetwriterlookup.md
section: Loading & Unloading Data
---

# RecordSetWriterLookup

## Description

Provides a RecordSetWriterFactory that can be used to dynamically select another RecordSetWriterFactory. This will allow multiple RecordSetWriterFactory’s to be defined and registered, and then selected dynamically at runtime by tagging FlowFiles with the attributes and referencing those attributes in the Service to Use property.

## Tags

lookup, record, recordset, result, row, serializer, set, writer

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Service to Use \* | Service to Use | ${recordsetwriter.name} |  | Specifies the name of the user-defined property whose associated Controller Service should be used. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RecordSinkServiceLookup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/recordsinkservicelookup.md
section: Loading & Unloading Data
---

# RecordSinkServiceLookup

## Description

Provides a RecordSinkService that can be used to dynamically select another RecordSinkService. This service requires an attribute named ‘record.sink.name’ to be passed in when asking for a connection, and will throw an exception if the attribute is missing. The value of ‘record.sink.name’ will be used to select the RecordSinkService that has been registered with that name. This will allow multiple RecordSinkServices to be defined and registered, and then selected dynamically at runtime by tagging flow files with the appropriate ‘record.sink.name’ attribute. Note that this controller service is not intended for use in reporting tasks that employ RecordSinkService instances, such as QueryNiFiReportingTask.

## Tags

lookup, record, sink

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RedisConnectionPoolService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/redisconnectionpoolservice.md
section: Loading & Unloading Data
---

# RedisConnectionPoolService

## Description

A service that provides connections to Redis.

## Tags

cache, redis

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Cluster Max Redirects \* | Cluster Max Redirects | 5 |  | The maximum number of redirects that can be performed when clustered. |
| Communication Timeout \* | Communication Timeout | 10 seconds |  | The timeout to use when attempting to communicate with Redis. |
| Connection String \* | Connection String |  |  | The connection string for Redis. In a standalone instance this value will be of the form hostname:port. In a sentinel instance this value will be the comma-separated list of sentinels, such as host1:port1,host2:port2,host3:port3. In a clustered instance this value will be the comma-separated list of cluster masters, such as host1:port,host2:port,host3:port. |
| Database Index \* | Database Index | 0 |  | The database index to be used by connections created from this connection pool. See the databases property in redis.conf, by default databases 0-15 will be available. |
| Password | Password |  |  | The password used to authenticate to the Redis server. See the ‘requirepass’ property in redis.conf. |
| Pool - Block When Exhausted \* | Pool - Block When Exhausted | true | * true * false | Whether or not clients should block and wait when trying to obtain a connection from the pool when the pool has no available connections. Setting this to false means an error will occur immediately when a client requests a connection and none are available. |
| Pool - Max Idle \* | Pool - Max Idle | 8 |  | The maximum number of idle connections that can be held in the pool, or a negative value if there is no limit. |
| Pool - Max Total \* | Pool - Max Total | 8 |  | The maximum number of connections that can be allocated by the pool (checked out to clients, or idle awaiting checkout). A negative value indicates that there is no limit. |
| Pool - Max Wait Time \* | Pool - Max Wait Time | 10 seconds |  | The amount of time to wait for an available connection when Block When Exhausted is set to true. |
| Pool - Min Evictable Idle Time \* | Pool - Min Evictable Idle Time | 60 seconds |  | The minimum amount of time an object may sit idle in the pool before it is eligible for eviction. |
| Pool - Min Idle \* | Pool - Min Idle | 0 |  | The target for the minimum number of idle connections to maintain in the pool. If the configured value of Min Idle is greater than the configured value for Max Idle, then the value of Max Idle will be used instead. |
| Pool - Num Tests Per Eviction Run \* | Pool - Num Tests Per Eviction Run | -1 |  | The number of connections to tests per eviction attempt. A negative value indicates to test all connections. |
| Pool - Test On Borrow \* | Pool - Test On Borrow | false | * true * false | Whether or not connections should be tested upon borrowing from the pool. |
| Pool - Test On Create \* | Pool - Test On Create | false | * true * false | Whether or not connections should be tested upon creation. |
| Pool - Test On Return \* | Pool - Test On Return | false | * true * false | Whether or not connections should be tested upon returning to the pool. |
| Pool - Test While Idle \* | Pool - Test While Idle | true | * true * false | Whether or not connections should be tested while idle. |
| Pool - Time Between Eviction Runs \* | Pool - Time Between Eviction Runs | 30 seconds |  | The amount of time between attempting to evict idle connections from the pool. |
| Redis Mode \* | Redis Mode | Standalone | * Standalone * Sentinel * Cluster | The type of Redis being communicated with - standalone, sentinel, or clustered. |
| SSL Context Service | SSL Context Service |  |  | If specified, this service will be used to create an SSL Context that will be used to secure communications; if not specified, communications will not be secure |
| Sentinel Master | Sentinel Master |  |  | The name of the sentinel master, require when Mode is set to Sentinel |
| Sentinel Password | Sentinel Password |  |  | The password used to authenticate to the Redis Sentinel server. See the ‘requirepass’ and ‘sentinel sentinel-pass’ properties in sentinel.conf. |
| Sentinel Username | Sentinel Username |  |  | The username used to authenticate to the Redis sentinel server. |
| Username | Username |  |  | The username used to authenticate to the Redis server. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RedisDistributedMapCacheClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/redisdistributedmapcacheclientservice.md
section: Loading & Unloading Data
---

# RedisDistributedMapCacheClientService

## Description

An implementation of DistributedMapCacheClient that uses Redis as the backing cache. This service relies on the WATCH, MULTI, and EXEC commands in Redis, which are not fully supported when Redis is clustered. As a result, this service can only be used with a Redis Connection Pool that is configured for standalone or sentinel mode. Sentinel mode can be used to provide high-availability configurations.

## Tags

cache, distributed, map, redis

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| TTL \* | redis-cache-ttl | 0 secs |  | Indicates how long the data should exist in Redis. Setting ‘0 secs’ would mean the data would exist forever |
| Redis Connection Pool \* | redis-connection-pool |  |  |  |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RemoveFieldRecordReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/removefieldrecordreader.md
section: Loading & Unloading Data
---

# RemoveFieldRecordReader

## Description

A wrapper for a RecordReaderFactory that supports filtering out specified fields from NiFi Records. It allows users to specify a list of field names that should be ignored when reading records from the record reader returned from the wrapped RecordReaderFactory. The ignored record fields are specified as dynamic properties. At least one dynamic property must be set. The dynamic property name is used as a description of the field to remove, and the dynamic property value is a RecordPath that identifies the field to be removed. Nested paths are supported. Record paths targeting the root path (“/”) are not allowed and will result in a validation error. This service should be used when all of the following criteria are met: - your delegate RecordReaderFactory is configured to infer the schema from the data - you do not have or do not want to define a static schema for the data you ‘re reading - the fields you set to be ignored should not be serialized to the NiFi content repository for security or performance reasons If any of the above criteria are not met, consider using the RecordFieldRemover processor instead. NOTE: The RecordReader returned by this implementation is hardcoded to drop unknown fields rather than ignoring them. Even when the RecordReader’s nextRecord(coerceTypes, dropUnknownFields) method is called with dropUnknownFields set to false, the RecordReader will still drop unknown fields.

## Tags

delete, field, filter, reader, record, remove

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Record Reader \* | Record Reader |  |  | The underlying RecordReaderFactory service that will be used to read records before filtering is applied. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RemoveRecordField 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/removerecordfield.md
section: Loading & Unloading Data
---

# RemoveRecordField 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Modifies the contents of a FlowFile that contains Record-oriented data (i.e. data that can be read via a RecordReader and written by a RecordWriter) by removing selected fields. This Processor requires that at least one user-defined Property be added. The name of the property is ignored by the processor, but could be a meaningful identifier for the user. The value of the property should indicate a RecordPath that determines the field to be removed. The processor executes the removal in the order in which these properties are added to the processor. Set the “Record Writer” to “Inherit Record Schema” in order to use the updated Record Schema modified when removing Fields.

## Tags

avro, csv, delete, freeform, generic, json, record, remove, schema, text, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Record Reader | Specifies the Controller Service to use for reading incoming data |
| Record Writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship |
| success | FlowFiles that are successfully transformed will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |

## Use cases

|  |
| --- |
| Remove one or more fields from a Record, where the names of the fields to remove are known. |

## See also

* [org.apache.nifi.processors.standard.UpdateRecord](updaterecord.md)

---
title: RenameRecordField 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/renamerecordfield.md
section: Loading & Unloading Data
---

# RenameRecordField 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Renames one or more fields in each Record of a FlowFile. This Processor requires that at least one user-defined Property be added. The name of the Property should indicate a RecordPath that determines the field that should be updated. The value of the Property is the new name to assign to the Record Field that matches the RecordPath. The property value may use Expression Language to reference FlowFile attributes as well as the variables `field.name`, `field.value`, `field.type`, and `record.index`

## Tags

avro, csv, field, generic, json, log, logs, record, rename, schema, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Record Reader | Specifies the Controller Service to use for reading incoming data |
| Record Writer | Specifies the Controller Service to use for writing out the records |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship |
| success | FlowFiles that are successfully transformed will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.index | This attribute provides the current row index and is only available inside the literal value expression. |

## Use cases

|  |
| --- |
| Rename a field in each Record to a specific, known name. |
| Rename a field in each Record to a name that is derived from a FlowFile attribute. |
| Rename a field in each Record to a new name that is derived from the current field name. |

## See also

* [org.apache.nifi.processors.standard.RemoveRecordField](removerecordfield.md)
* [org.apache.nifi.processors.standard.UpdateRecord](updaterecord.md)

---
title: ReplaceText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/replacetext.md
section: Loading & Unloading Data
---

# ReplaceText 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Updates the content of a FlowFile by searching for some textual value in the FlowFile content (via Regular Expression/regex, or literal value) and replacing the section of the content that matches with some alternate value. It can also be used to append or prepend text to the contents of a FlowFile.

## Tags

Change, Modify, Regex, Regular Expression, Replace, Text, Update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the file is encoded |
| Evaluation Mode | Run the ‘Replacement Strategy’ against each line separately (Line-by-Line) or buffer the entire file into memory (Entire Text) and run against that. |
| Line-by-Line Evaluation Mode | Run the ‘Replacement Strategy’ against each line separately (Line-by-Line) for all lines in the FlowFile, First Line (Header) alone, Last Line (Footer) alone, Except the First Line (Header) or Except the Last Line (Footer). |
| Maximum Buffer Size | Specifies the maximum amount of data to buffer (per file or per line, depending on the Evaluation Mode) in order to apply the replacement. If ‘Entire Text’ (in Evaluation Mode) is selected and the FlowFile is larger than this value, the FlowFile will be routed to ‘failure’. In ‘Line-by-Line’ Mode, if a single line is larger than this value, the FlowFile will be routed to ‘failure’. A default value of 1 MB is provided, primarily for ‘Entire Text’ mode. In ‘Line-by-Line’ Mode, a value such as 8 KB or 16 KB is suggested. This value is ignored if the <Replacement Strategy> property is set to one of: Append, Prepend, Always Replace |
| Regular Expression | The Search Value to search for in the FlowFile content. Only used for ‘Literal Replace’ and ‘Regex Replace’ matching strategies |
| Replacement Strategy | The strategy for how and what to replace within the FlowFile’s text content. |
| Replacement Value | The value to insert using the ‘Replacement Strategy’. Using “Regex Replace” back-references to Regular Expression capturing groups are supported, but back-references that reference capturing groups that do not exist in the regular expression will be treated as literal value. Back References may also be referenced using the Expression Language, as ‘$1’, ‘$2’, etc. The single-tick marks MUST be included, as these variables are not “Standard” attribute names (attribute names must be quoted unless they contain only numbers, letters, and _). |
| Text to Append | The text to append to the end of the FlowFile, or each line, depending on the configured value of the Evaluation Mode property |
| Text to Prepend | The text to prepend to the start of the FlowFile, or each line, depending on the configured value of the Evaluation Mode property |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that could not be updated are routed to this relationship |
| success | FlowFiles that have been successfully processed are routed to this relationship. This includes both FlowFiles that had text replaced and those that did not. |

## Use cases

|  |
| --- |
| Append text to the end of every line in a FlowFile |
| Prepend text to the beginning of every line in a FlowFile |
| Replace every occurrence of a literal string in the FlowFile with a different value |
| Transform every occurrence of a literal string in a FlowFile |
| Completely replace the contents of a FlowFile to a specific text |

---
title: ReplaceTextWithMapping 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/replacetextwithmapping.md
section: Loading & Unloading Data
---

# ReplaceTextWithMapping 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Updates the content of a FlowFile by evaluating a Regular Expression against it and replacing the section of the content that matches the Regular Expression with some alternate value provided in a mapping file.

## Tags

Change, Mapping, Modify, Regex, Regular Expression, Replace, Text, Update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the file is encoded |
| Mapping File | The name of the file (including the full path) containing the Mappings. |
| Mapping File Refresh Interval | The polling interval to check for updates to the mapping file. The default is 60s. |
| Matching Group | The number of the matching group of the provided regex to replace with the corresponding value from the mapping file (if it exists). |
| Maximum Buffer Size | Specifies the maximum amount of data to buffer (per file) in order to apply the regular expressions. If a FlowFile is larger than this value, the FlowFile will be routed to ‘failure’ |
| Regular Expression | The Regular Expression to search for in the FlowFile content |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that could not be updated are routed to this relationship |
| success | FlowFiles that have been successfully updated are routed to this relationship, as well as FlowFiles whose content does not match the given Regular Expression |

---
title: RestLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/restlookupservice.md
section: Loading & Unloading Data
---

# RestLookupService

## Description

Use a REST service to look up values.

## Tags

http, json, lookup, rest, xml

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. In case of SOCKS, it is not guaranteed that the selected SOCKS Version will be used by the processor. |
| Authentication Strategy \* | rest-lookup-authentication-strategy | NONE | * None * Basic * OAuth2 | Authentication strategy to use with REST service. |
| Basic Authentication Password | rest-lookup-basic-auth-password |  |  | The password to be used by the client to authenticate against the Remote URL. |
| Basic Authentication Username | rest-lookup-basic-auth-username |  |  | The username to be used by the client to authenticate against the Remote URL. Cannot include control characters (0-31), ‘:’, or DEL (127). |
| Connection Timeout \* | rest-lookup-connection-timeout | 5 secs |  | Max wait time for connection to remote service. |
| Use Digest Authentication | rest-lookup-digest-auth | false | * true * false | Whether to communicate with the website using Digest Authentication. ‘Basic Authentication Username’ and ‘Basic Authentication Password’ are used for authentication. |
| OAuth2 Access Token Provider \* | rest-lookup-oauth2-access-token-provider |  |  | Enables managed retrieval of OAuth2 Bearer Token applied to HTTP requests using the Authorization Header. |
| Read Timeout \* | rest-lookup-read-timeout | 15 secs |  | Max wait time for response from remote service. |
| Record Path | rest-lookup-record-path |  |  | An optional record path that can be used to define where in a record to get the real data to merge into the record set to be enriched. See documentation for examples of when this might be useful. |
| Record Reader \* | rest-lookup-record-reader |  |  | The record reader to use for loading the payload and handling it as a record set. |
| Response Handling Strategy \* | rest-lookup-response-handling-strategy | RETURNED | * Returned * Evaluated | Whether to return all responses or throw errors for unsuccessful HTTP status codes. |
| SSL Context Service | rest-lookup-ssl-context-service |  |  | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| URL \* | rest-lookup-url |  |  | The URL for the REST endpoint. Expression language is evaluated against the lookup key/value pairs, not flowfile attributes. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: RetryFlowFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/retryflowfile.md
section: Loading & Unloading Data
---

# RetryFlowFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

FlowFiles passed to this Processor have a ‘Retry Attribute’ value checked against a configured ‘Maximum Retries’ value. If the current attribute value is below the configured maximum, the FlowFile is passed to a retry relationship. The FlowFile may or may not be penalized in that condition. If the FlowFile ‘s attribute value exceeds the configured maximum, the FlowFile will be passed to a’ retries_exceeded ‘relationship. WARNING: If the incoming FlowFile has a non-numeric value in the configured’Retry Attribute ‘attribute, it will be reset to’1 ‘. You may choose to fail the FlowFile instead of performing the reset. Additional dynamic properties can be defined for any attributes you wish to add to the FlowFiles transferred to’ retries_exceeded’. These attributes support attribute expression language.

## Tags

FlowFile, Retry

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Fail on Non-numerical Overwrite | If the FlowFile already has the attribute defined in ‘Retry Attribute’ that is \*not\* a number, fail the FlowFile instead of resetting that value to ‘1’ |
| maximum-retries | The maximum number of times a FlowFile can be retried before being passed to the ‘retries_exceeded’ relationship |
| penalize-retries | If set to ‘true’, this Processor will penalize input FlowFiles before passing them to the ‘retry’ relationship. This does not apply to the ‘retries_exceeded’ relationship. |
| retry-attribute | The name of the attribute that contains the current retry count for the FlowFile. WARNING: If the name matches an attribute already on the FlowFile that does not contain a numerical value, the processor will either overwrite that attribute with ‘1’ or fail based on configuration. |
| reuse-mode | Defines how the Processor behaves if the retry FlowFile has a different retry UUID than the instance that received the FlowFile. This generally means that the attribute was not reset after being successfully retried by a previous instance of this processor. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The processor is configured such that a non-numerical value on ‘Retry Attribute’ results in a failure instead of resetting that value to ‘1’. This will immediately terminate the limited feedback loop. Might also include when ‘Maximum Retries’ contains attribute expression language that does not resolve to an Integer. |
| retries_exceeded | Input FlowFile has exceeded the configured maximum retry count, do not pass this relationship back to the input Processor to terminate the limited feedback loop. |
| retry | Input FlowFile has not exceeded the configured maximum retry count, pass this relationship back to the input Processor to create a limited feedback loop. |

## Writes attributes

| Name | Description |
| --- | --- |
| Retry Attribute | User defined retry attribute is updated with the current retry count |
| Retry Attribute .uuid | User defined retry attribute with .uuid that determines what processor retried the FlowFile last |

---
title: RouteOnAttribute 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/routeonattribute.md
section: Loading & Unloading Data
---

# RouteOnAttribute 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Routes FlowFiles based on their Attributes using the Attribute Expression Language

## Tags

Attribute Expression Language, Expression Language, Regular Expression, attributes, detect, filter, find, regex, regexp, routing, search, string, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Routing Strategy | Specifies how to determine which relationship to use when evaluating the Expression Language |

## Relationships

| Name | Description |
| --- | --- |
| unmatched | FlowFiles that do not match any user-define expression will be routed here |

## Writes attributes

| Name | Description |
| --- | --- |
| RouteOnAttribute.Route | The relation to which the FlowFile was routed |

## Use cases

|  |
| --- |
| Route data to one or more relationships based on its attributes using the NiFi Expression Language. |
| Keep data only if its attributes meet some criteria, such as its filename ends with .txt. |
| Discard or drop a file based on attributes, such as filename. |

## Use Cases Involving Other Components

|  |
| --- |
| Route record-oriented data based on whether or not the record’s values meet some criteria |

---
title: RouteOnContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/routeoncontent.md
section: Loading & Unloading Data
---

# RouteOnContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Applies Regular Expressions to the content of a FlowFile and routes a copy of the FlowFile to each destination whose Regular Expression matches. Regular Expressions are added as User-Defined Properties where the name of the property is the name of the relationship and the value is a Regular Expression to match against the FlowFile content. User-Defined properties do support the Attribute Expression Language, but the results are interpreted as literal values, not Regular Expressions

## Tags

content, detect, filter, find, regex, regexp, regular expression, route, search, string, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the file is encoded |
| Content Buffer Size | Specifies the maximum amount of data to buffer in order to apply the regular expressions. If the size of the FlowFile exceeds this value, any amount of this value will be ignored |
| Match Requirement | Specifies whether the entire content of the file must match the regular expression exactly, or if any part of the file (up to Content Buffer Size) can contain the regular expression in order to be considered a match |

## Relationships

| Name | Description |
| --- | --- |
| unmatched | FlowFiles that do not match any of the user-supplied regular expressions will be routed to this relationship |

---
title: RouteText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/routetext.md
section: Loading & Unloading Data
---

# RouteText 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Routes textual data based on a set of user-defined rules. Each line in an incoming FlowFile is compared against the values specified by user-defined Properties. The mechanism by which the text is compared to these user-defined properties is defined by the ‘Matching Strategy’. The data is then routed according to these rules, routing each line of the text individually.

## Tags

Expression Language, Regular Expression, attributes, csv, delimited, detect, filter, find, logs, regex, regexp, routing, search, string, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Character Set | The Character Set in which the incoming text is encoded |
| Grouping Regular Expression | Specifies a Regular Expression to evaluate against each line to determine which Group the line should be placed in. The Regular Expression must have at least one Capturing Group that defines the line’s Group. If multiple Capturing Groups exist in the Regular Expression, the values from all Capturing Groups will be concatenated together. Two lines will not be placed into the same FlowFile unless they both have the same value for the Group (or neither line matches the Regular Expression). For example, to group together all lines in a CSV File by the first column, we can set this value to “(.\*?),.\*”. Two lines that have the same Group but different Relationships will never be placed into the same FlowFile. |
| Ignore Case | If true, capitalization will not be taken into account when comparing values. E.g., matching against ‘HELLO’ or ‘hello’ will have the same result. This property is ignored if the ‘Matching Strategy’ is set to ‘Satisfies Expression’. |
| Ignore Leading/Trailing Whitespace | Indicates whether or not the whitespace at the beginning and end of the lines should be ignored when evaluating the line. |
| Matching Strategy | Specifies how to evaluate each line of incoming text against the user-defined properties. |
| Routing Strategy | Specifies how to determine which Relationship(s) to use when evaluating the lines of incoming text against the ‘Matching Strategy’ and user-defined properties. |

## Relationships

| Name | Description |
| --- | --- |
| original | The original input file will be routed to this destination when the lines have been successfully routed to 1 or more relationships |
| unmatched | Data that does not satisfy the required user-defined rules will be routed to this Relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| RouteText.Route | The name of the relationship to which the FlowFile was routed. |
| RouteText.Group | The value captured by all capturing groups in the ‘Grouping Regular Expression’ property. If this property is not set or contains no capturing groups, this attribute will not be added. |

## Use cases

|  |
| --- |
| Drop blank or empty lines from the FlowFile’s content. |
| Remove specific lines of text from a file, such as those containing a specific word or having a line length over some threshold. |

---
title: RunDatabricksJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/rundatabricksjob.md
section: Loading & Unloading Data
---

# RunDatabricksJob 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-databricks-processors-nar

## Description

Triggers a pre-defined Databricks job to run with custom parameters. Job parameters can be set using dynamic properties

## Tags

databricks, jobs, openflow

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Databricks Client | Databricks Client Service. |
| Job ID | Databricks Job ID |
| Job Name | Databricks Job Name |
| Wait for Job Completion | Wait for the Databricks job to complete before transferring the FlowFile to success |

## Relationships

| Name | Description |
| --- | --- |
| failure | Databricks failure relationship |
| success | Databricks success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| job.run.id | The run id assigned to the invoked job |
| job.result.state | The result state for the invoked job |
| error.code | The error code for the SQL statement if an error occurred. |
| error.message | The error message for the SQL statement if an error occurred. |

---
title: RunMongoAggregation 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/runmongoaggregation.md
section: Loading & Unloading Data
---

# RunMongoAggregation 2025.10.9.21

## Bundle

org.apache.nifi | nifi-mongodb-nar

## Description

A processor that runs an aggregation query whenever a flowfile is received.

## Tags

aggregate, aggregation, mongo

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Batch Size | The number of elements returned from the server in one batch. |
| Mongo Collection Name | The name of the collection to use |
| Mongo Database Name | The name of the database to use |
| allow-disk-use | Set this to true to enable writing data to temporary files to prevent exceeding the maximum memory use limit during aggregation pipeline staged when handling large datasets. |
| json-type | By default, MongoDB’s Java driver returns “extended JSON”. Some of the features of this variant of JSON may cause problems for other JSON parsers that expect only standard JSON types and conventions. This configuration setting controls whether to use extended JSON or provide a clean view that conforms to standard JSON. |
| mongo-agg-query | The aggregation query to be executed. |
| mongo-charset | Specifies the character set of the document data. |
| mongo-client-service | If configured, this property will use the assigned client service for connection pooling. |
| mongo-date-format | The date format string to use for formatting Date fields that are returned from Mongo. It is only applied when the JSON output format is set to Standard JSON. |
| mongo-query-attribute | If set, the query will be written to a specified attribute on the output flowfiles. |
| results-per-flowfile | How many results to put into a flowfile at once. The whole body will be treated as a JSON array of results. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The input flowfile gets sent to this relationship when the query fails. |
| original | The input flowfile gets sent to this relationship when the query succeeds. |
| results | The result set of the aggregation will be sent to this relationship. |

---
title: S3FileResourceService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/s3fileresourceservice.md
section: Loading & Unloading Data
---

# S3FileResourceService

## Description

Provides an Amazon Web Services (AWS) S3 file resource for other components.

## Tags

AWS, Amazon, S3, file, resource

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| AWS Credentials Provider service \* | AWS Credentials Provider service |  |  | The Controller Service that is used to obtain AWS credentials provider |
| Bucket \* | Bucket | ${s3.bucket} |  | The S3 Bucket to interact with |
| Object Key \* | Object Key | ${filename} |  | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Region \* | Region | us-west-2 | * AWS GovCloud (US) * AWS GovCloud (US-East) * US East (N. Virginia) * US East (Ohio) * US West (N. California) * US West (Oregon) * EU (Ireland) * EU (London) * EU (Paris) * EU (Frankfurt) * EU (Zurich) * EU (Stockholm) * EU (Milan) * EU (Spain) * Asia Pacific (Hong Kong) * Asia Pacific (Taipei) * Asia Pacific (Mumbai) * Asia Pacific (Hyderabad) * Asia Pacific (Singapore) * Asia Pacific (Sydney) * Asia Pacific (Jakarta) * Asia Pacific (Melbourne) * Asia Pacific (Malaysia) * Asia Pacific (Thailand) * Asia Pacific (Tokyo) * Asia Pacific (Seoul) * Asia Pacific (Osaka) * South America (Sao Paulo) * China (Beijing) * China (Ningxia) * Canada (Central) * Canada West (Calgary) * Middle East (UAE) * Middle East (Bahrain) * Africa (Cape Town) * US ISO East * US ISOB East (Ohio) * US ISO West * US ISOF East1 (California) * US ISOF South1 (Alpine) * Israel (Tel Aviv) * Mexico (Central) * EU ISOE West * Use ‘s3.region’ Attribute | The AWS Region to connect to. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SalesforceDataCloudOAuthTokenProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/salesforcedatacloudoauthtokenprovider.md
section: Loading & Unloading Data
---

# SalesforceDataCloudOAuthTokenProvider

## Description

Retrieves an OAuth2 access token from Salesforce using the configured OAuth2 Access Token Provider and exchanges the token for a Data Cloud API token. The token is then used to authenticate with Salesforce Data Cloud APIs.

## Tags

preview, salesforce

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| OAuth2 Access Token Provider \* | OAuth2 Access Token Provider |  |  | JWT Token Provider to use in order to retrieve an access token from Salesforce that will be exchanged for a Data Cloud API token. |
| Refresh Window \* | Refresh Window | 0 s |  | The service will attempt to refresh tokens expiring within the refresh window, subtracting the configured duration from the token expiration. |
| Salesforce Instance \* | Salesforce Instance |  |  | The hostname of the Salesforce instance including the domain such as MyDomainName.my.salesforce.com |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for communicating with Salesforce |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SampleRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/samplerecord.md
section: Loading & Unloading Data
---

# SampleRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Samples the records of a FlowFile based on a specified sampling strategy (such as Reservoir Sampling). The resulting FlowFile may be of a fixed number of records (in the case of reservoir-based algorithms) or some subset of the total number of records (in the case of probabilistic sampling), or a deterministic number of records (in the case of interval sampling).

## Tags

interval, range, record, reservoir, sample

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| record-reader | Specifies the Controller Service to use for parsing incoming data and determining the data’s schema |
| record-writer | Specifies the Controller Service to use for writing results to a FlowFile |
| sample-record-interval | Specifies the number of records to skip before writing a record to the outgoing FlowFile. This property is only used if Sampling Strategy is set to Interval Sampling. A value of zero (0) will cause no records to be included in theoutgoing FlowFile, a value of one (1) will cause all records to be included, and a value of two (2) will cause half the records to be included, and so on. |
| sample-record-probability | Specifies the probability (as a percent from 0-100) of a record being included in the outgoing FlowFile. This property is only used if Sampling Strategy is set to Probabilistic Sampling. A value of zero (0) will cause no records to be included in theoutgoing FlowFile, and a value of 100 will cause all records to be included in the outgoing FlowFile.. |
| sample-record-random-seed | Specifies a particular number to use as the seed for the random number generator (used by probabilistic strategies). Setting this property will ensure the same records are selected even when using probabilistic strategies. |
| sample-record-range | Specifies the range of records to include in the sample, from 1 to the total number of records. An example is ‘3,6-8,20-’ which includes the third record, the sixth, seventh and eighth records, and all records from the twentieth record on. Commas separate intervals that don’t overlap, and an interval can be between two numbers (i.e. 6-8) or up to a given number (i.e. -5), or from a number to the number of the last record (i.e. 20-). If this property is unset, all records will be included. |
| sample-record-reservoir | Specifies the number of records to write to the outgoing FlowFile. This property is only used if Sampling Strategy is set to reservoir-based strategies such as Reservoir Sampling. |
| sample-record-sampling-strategy | Specifies which method to use for sampling records from the incoming FlowFile |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, any record is not valid), the original FlowFile will be routed to this relationship |
| original | The original FlowFile is routed to this relationship if sampling is successful |
| success | The FlowFile is routed to this relationship if the sampling completed successfully |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | The MIME type indicated by the record writer |
| record.count | The number of records in the resulting flow file |

---
title: SAP® and Snowflake - Setup
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sap-sql/setup-sap.md
section: Loading & Unloading Data
---

# SAP® and Snowflake - Setup

This topic describes the prerequisite setup tasks for using SAP® Snowflake or SAP® BDC Connect for Snowflake.

* For customers without an existing Snowflake account, see SAP® Snowflake set up.
* For customers with an existing Snowflake account, see SAP® BDC Connect for Snowflake set up.

After completing either of these steps, you will need to create a catalog integration in Snowflake to share Data Products from SAP® Business Data Cloud to Snowflake. See [Share Data Products from SAP® Business Data Cloud to Snowflake](share-data-products.md) for more information.

## SAP® Snowflake set up

This section describes the steps to configure an instance for SAP® Snowflake for SAP customers without an existing Snowflake account.

> **Note:**
>
> For Restricted Release, the SAP® Snowflake account provisioned is the Business Critical edition
> and must be on AWS in a supported region as described in [regions](../../../../intro-regions.md).

As an SAP® administrator, perform the following steps:

1. Sign in to [SAP for Me](https://me.sap.com/) with an S-user ID or login name.
2. From the sidebar menu, choose Portfolio & products.
3. In the My Product Packages tab, select the SAP Business Data Cloud product.
4. Select the Applications tab and in the SAP Snowflake card, click Start Provisioning. .
   The Provision SAP® Snowflake wizard dialog displays and guides you through the provisioning process.
5. In the Provision SAP® Snowflake dialog, configure the following parameters and click Next:

   * **Entitlement System**: Displays the ID of the SAP® Business Data Cloud Entitlement set. Cannot be changed.
   * **Name**: Enter an appropriate name for the SAP solution.
   * **Path**: Select or create a resource group under which to group the solution
     components provisioned for SAP® Business Data Cloud.
     Create it in the same location selected for the SAP® Business Data Cloud cockpit system.
   * **Business Type**: Preset to Production.
6. In the Select Application step, SAP Snowflake is pre-selected. .
   The Configure Parameters step displays.
7. In the Configure Parameters step, configure the following parameters and click Next:

   * **Region**: You can provision the SAP Snowflake solution in any region.
     Snowflake recommend choosing the same region as the SAP® Business Data Cloud Cockpit for optimal performance.
   * **Admin email**: Provide the email address of the user to be defined as the administrator of your SAP Snowflake system.
     This user is responsible for adding additional users and for further configuration.
   * **Admin First Name**: The first name of the administrator of your SAP Snowflake system.
   * **Admin Last Name**: The last name of the administrator of your SAP Snowflake system.

   Provisioning begins and SAP® notifies you that a provisioning request was sent to the specified owner’s e-mail address.
8. Click View in Resources to view the tenant within the indicated resource group.
   The Resources tab shows the current solution status, which should be `Processing`.
9. Select the tenant below the new solution and click Details to view the details of the tenant.
10. On top of the details view of the tenant, choose the View Details link.

    A pop-up window opens that provides an activation link to the SAP Snowflake account.
    If you are the SAP Snowflake system owner, select this link and complete the activation flow
    in SAP Snowflake (see [Activating the SAP Snowflake Account](https://accounts.sap.com/saml2/idp/sso/accounts.sap.com)).

    If not, share the activation link with the SAP Snowflake owner and ask them to complete the activation flow.
11. After the account has been activated in SAP for Me, the status for your SAP
    Snowflake solution and tenant changes to `Ready`. In the details view of the SAP
    Snowflake tenant, in the Path field, select the URL to open SAP Snowflake and log in.
    The SAP® DBC admin may provision as many SAP® Snowflake accounts as required with unique account names to
    help distinguish them. Every SAP® Snowflake account will need to be activated per the activation flow.

### Next steps

The SAP® DBC admin may provision as many SAP® Snowflake accounts as they need with unique account names to help distinguish them.
Every SAP® Snowflake account will need to be activated as described in the note below.

After activation, the SAP® Snowflake is ready for you to share Data Products from SAP BDC to SAP® Snowflake as described in Use Cases.
As part of the provisioning process for a new SAP® Snowflake account, a catalog integration called SAP_BDC_INTEGRATION is automatically created and enrolled with SAP® Business Data Cloud in the SAP® Snowflake account.
Customers can create additional catalog integrations in the same SAP® Snowflake account
and enroll them with the same or different SAP® Business Data Cloud tenant.
Each catalog integration requires a new Invitation Link that can be obtained from SAP 4 Me.
Each catalog integration requires a new Invitation Link that can be obtained from SAP 4 Me.
Each Invitation Link can be enrolled only once with SAP® Business Data Cloud.

> **Note:**
>
> Customers can view the status of provisioning in the Details view.
> After provisioning is complete, the customer can click the Snowflake activation link available in the Details view to activate their SAP® Snowflake account, login, change their username and reset their password, setup MFA, and perform other operations.

## SAP® BDC Connect for Snowflake set up

This section describes the steps to set up an SAP® Business Data Cloud connection for use with an existing Snowflake account.

> **Note:**
>
> For Restricted Release, the Snowflake account must be Standard, Enterprise, or Business Critical edition and must be on AWS in a supported region as described in [Supported Cloud Regions](../../../../intro-regions.md).

For more information see, [Provisioning SAP Business Data Cloud Connect](https://help.sap.com/docs/business-data-cloud/administering-sap-business-data-cloud/provision-sap-business-data-cloud-connector-for-supported-external-systems).

As an SAP® administrator, perform the following steps:

1. Obtain your Snowflake account URL and ensure it follows the format below: <https:/>/<orgName>-<accountName>.snowflakecomputing.com.
   Which should be all lower-case and replace _ (underscore) with - (dash) for RFC compliance.
2. Provision SAP Business Data Cloud Connect as documented here: [Provisioning SAP Business Data Cloud Connect](https://help.sap.com/docs/business-data-cloud/administering-sap-business-data-cloud/provision-sap-business-data-cloud-connector-for-supported-external-systems).
3. Follow steps 1-5 in the wizard
4. In wizard step 6: Configure Parameters:

   * **External System Instance Identifier**: Enter your Snowflake account URL: <https:/>/<orgName>-<accountName>.snowflakecomputing.com
   * **Region**: Use the drop-down menu to choose the region of your Snowflake account. We recommend that your Snowflake account is in the same cloud and region as your SAP Business Data Cloud Core.
5. Complete wizard steps 7 and 8.
6. In step 9: Hover over the View Tenant Notifications button.
   A pop-up window opens with an Invitation Link that can be used to complete the configuration in Snowflake.
7. Copy the Invitation Link
8. Log into your Snowflake account to complete the remainder of the configuration
   to create an SAP BDC connection as described in [Share Data Products from SAP® Business Data Cloud to Snowflake](share-data-products.md).

### Next steps

In your [SAP for Me](https://me.sap.com/) environment, choose the Customer Landscape tab and, under the Formations tab, choose Include Systems to add the SAP BDC Connect instance to an existing formation.

Customers can create additional catalog integrations in the same Snowflake account and enroll them with the same or
different SAP® Business Data Cloud tenant. Each catalog integration requires a
new Invitation Link that can be obtained from [SAP for Me](https://me.sap.com/).
Each Invitation Link can be enrolled only once with SAP® Business Data Cloud.

> **Note:**
>
> To create a new formation, see [Creating SAP Business Data Cloud Formations](https://help.sap.com/docs/business-data-cloud/administering-sap-business-data-cloud/integrate-sap-business-data-cloud-provisioned-systems?locale=en-US&state=PRODUCTION&version=SHIP).

---
title: ScanAttribute 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/scanattribute.md
section: Loading & Unloading Data
---

# ScanAttribute 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Scans the specified attributes of FlowFiles, checking to see if any of their values are present within the specified dictionary of terms

## Tags

attributes, find, lookup, scan, search, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Attribute Pattern | Regular Expression that specifies the names of attributes whose values will be matched against the terms in the dictionary |
| Dictionary File | A new-line-delimited text file that includes the terms that should trigger a match. Empty lines are ignored. The contents of the text file are loaded into memory when the processor is scheduled and reloaded when the contents are modified. |
| Dictionary Filter Pattern | A Regular Expression that will be applied to each line in the dictionary file. If the regular expression does not match the line, the line will not be included in the list of terms to search for. If a Matching Group is specified, only the portion of the term that matches that Matching Group will be used instead of the entire term. If not specified, all terms in the dictionary will be used and each term will consist of the text of the entire line in the file |
| Match Criteria | If set to All Must Match, then FlowFiles will be routed to ‘matched’ only if all specified attributes ‘values are found in the dictionary. If set to At Least 1 Must Match, FlowFiles will be routed to’ matched’ if any attribute specified is found in the dictionary |

## Relationships

| Name | Description |
| --- | --- |
| matched | FlowFiles whose attributes are found in the dictionary will be routed to this relationship |
| unmatched | FlowFiles whose attributes are not found in the dictionary will be routed to this relationship |

---
title: ScanContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/scancontent.md
section: Loading & Unloading Data
---

# ScanContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Scans the content of FlowFiles for terms that are found in a user-supplied dictionary. If a term is matched, the UTF-8 encoded version of the term will be added to the FlowFile using the ‘matching.term’ attribute

## Tags

aho-corasick, byte sequence, content, dictionary, find, scan, search

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Dictionary Encoding | Indicates how the dictionary is encoded. If ‘text’, dictionary terms are new-line delimited and UTF-8 encoded; if ‘binary’, dictionary terms are denoted by a 4-byte integer indicating the term length followed by the term itself |
| Dictionary File | The filename of the terms dictionary |

## Relationships

| Name | Description |
| --- | --- |
| matched | FlowFiles that match at least one term in the dictionary are routed to this relationship |
| unmatched | FlowFiles that do not match any term in the dictionary are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| matching.term | The term that caused the Processor to route the FlowFile to the ‘matched’ relationship; if FlowFile is routed to the ‘unmatched’ relationship, this attribute is not added |

---
title: ScriptedFilterRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/scriptedfilterrecord.md
section: Loading & Unloading Data
---

# ScriptedFilterRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-scripting-nar

## Description

This processor provides the ability to filter records out from FlowFiles using the user-provided script. Every record will be evaluated by the script which must return with a boolean value. Records with “true” result will be routed to the “matching” relationship in a batch. Other records will be filtered out.

## Tags

filter, groovy, record, script

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Module Directory | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Record Reader | The Record Reader to use parsing the incoming FlowFile into Records |
| Record Writer | The Record Writer to use for serializing Records after they have been transformed |
| Script Body | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine | The Language to use for the script |
| Script File | Path to script file to execute. Only one of Script File or Script Body may be used |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| failure | In case of any issue during processing the incoming FlowFile, the incoming FlowFile will be routed to this relationship. |
| original | After successful procession, the incoming FlowFile will be transferred to this relationship. This happens regardless the number of filtered or remaining records. |
| success | Matching records of the original FlowFile will be routed to this relationship. If there are no matching records, no FlowFile will be routed here. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records within the flow file. |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |

## See also

* [org.apache.nifi.processors.script.ScriptedPartitionRecord](scriptedpartitionrecord.md)
* [org.apache.nifi.processors.script.ScriptedTransformRecord](scriptedtransformrecord.md)
* [org.apache.nifi.processors.script.ScriptedValidateRecord](scriptedvalidaterecord.md)

---
title: ScriptedLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/scriptedlookupservice.md
section: Loading & Unloading Data
---

# ScriptedLookupService

## Description

Allows the user to provide a scripted LookupService instance in order to enrich records from an incoming flow file.

## Tags

groovy, invoke, lookup, record, script

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Module Directory | Module Directory |  |  | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Script Body |  |  | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine \* | Script Engine | Groovy | * Groovy | Language Engine for executing scripts |
| Script File | Script File |  |  | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ScriptedPartitionRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/scriptedpartitionrecord.md
section: Loading & Unloading Data
---

# ScriptedPartitionRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-scripting-nar

## Description

Receives Record-oriented data (i.e., data that can be read by the configured Record Reader) and evaluates the user provided script against each record in the incoming flow file. Each record is then grouped with other records sharing the same partition and a FlowFile is created for each groups of records. Two records shares the same partition if the evaluation of the script results the same return value for both. Those will be considered as part of the same partition.

## Tags

groovy, group, organize, partition, record, script, segment, split

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Module Directory | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Record Reader | The Record Reader to use parsing the incoming FlowFile into Records |
| Record Writer | The Record Writer to use for serializing Records after they have been transformed |
| Script Body | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine | The Language to use for the script |
| Script File | Path to script file to execute. Only one of Script File or Script Body may be used |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be partitioned from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship |
| original | Once all records in an incoming FlowFile have been partitioned, the original FlowFile is routed to this relationship. |
| success | FlowFiles that are successfully partitioned will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| partition | The partition of the outgoing flow file. If the script indicates that the partition has a null value, the attribute will be set to the literal string “<null partition>” (without quotes). Otherwise, the attribute is set to the String representation of whatever value is returned by the script. |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records within the flow file. |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |
| fragment.index | A one-up number that indicates the ordering of the partitioned FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of partitioned FlowFiles generated from the parent FlowFile |

## See also

* [org.apache.nifi.processors.script.ScriptedFilterRecord](scriptedfilterrecord.md)
* [org.apache.nifi.processors.script.ScriptedTransformRecord](scriptedtransformrecord.md)
* [org.apache.nifi.processors.script.ScriptedValidateRecord](scriptedvalidaterecord.md)

---
title: ScriptedReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/scriptedreader.md
section: Loading & Unloading Data
---

# ScriptedReader

## Description

Allows the user to provide a scripted RecordReaderFactory instance in order to read/parse/generate records from an incoming flow file.

## Tags

groovy, invoke, record, recordFactory, script

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Module Directory | Module Directory |  |  | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Script Body |  |  | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine \* | Script Engine | Groovy | * Groovy | Language Engine for executing scripts |
| Script File | Script File |  |  | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ScriptedRecordSetWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/scriptedrecordsetwriter.md
section: Loading & Unloading Data
---

# ScriptedRecordSetWriter

## Description

Allows the user to provide a scripted RecordSetWriterFactory instance in order to write records to an outgoing flow file.

## Tags

groovy, invoke, record, script, writer

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Module Directory | Module Directory |  |  | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Script Body |  |  | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine \* | Script Engine | Groovy | * Groovy | Language Engine for executing scripts |
| Script File | Script File |  |  | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ScriptedRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/scriptedrecordsink.md
section: Loading & Unloading Data
---

# ScriptedRecordSink

## Description

Allows the user to provide a scripted RecordSinkService instance in order to transmit records to the desired target. The script must set a variable ‘recordSink’ to an implementation of RecordSinkService.

## Tags

groovy, invoke, record, record sink, script

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Module Directory | Module Directory |  |  | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Script Body |  |  | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine \* | Script Engine | Groovy | * Groovy | Language Engine for executing scripts |
| Script File | Script File |  |  | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: ScriptedTransformRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/scriptedtransformrecord.md
section: Loading & Unloading Data
---

# ScriptedTransformRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-scripting-nar

## Description

Provides the ability to evaluate a simple script against each record in an incoming FlowFile. The script may transform the record in some way, filter the record, or fork additional records. See Processor’s Additional Details for more information.

## Tags

filter, groovy, modify, record, script, transform, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Module Directory | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Record Reader | The Record Reader to use parsing the incoming FlowFile into Records |
| Record Writer | The Record Writer to use for serializing Records after they have been transformed |
| Script Body | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine | The Language to use for the script |
| Script File | Path to script file to execute. Only one of Script File or Script Body may be used |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| failure | Any FlowFile that cannot be transformed will be routed to this Relationship |
| success | Each FlowFile that were successfully transformed will be routed to this Relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records in the FlowFile |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |

## See also

* [org.apache.nifi.processors.jolt.JoltTransformRecord](jolttransformrecord.md)
* [org.apache.nifi.processors.script.ExecuteScript](executescript.md)
* [org.apache.nifi.processors.standard.LookupRecord](lookuprecord.md)
* [org.apache.nifi.processors.standard.QueryRecord](queryrecord.md)
* [org.apache.nifi.processors.standard.UpdateRecord](updaterecord.md)

---
title: ScriptedValidateRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/scriptedvalidaterecord.md
section: Loading & Unloading Data
---

# ScriptedValidateRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-scripting-nar

## Description

This processor provides the ability to validate records in FlowFiles using the user-provided script. The script is expected to have a record as incoming argument and return with a boolean value. Based on this result, the processor categorizes the records as “valid” or “invalid” and routes them to the respective relationship in batch. Additionally the original FlowFile will be routed to the “original” relationship or in case of unsuccessful processing, to the “failed” relationship.

## Tags

groovy, record, script, validate

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Module Directory | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Record Reader | The Record Reader to use parsing the incoming FlowFile into Records |
| Record Writer | The Record Writer to use for serializing Records after they have been transformed |
| Script Body | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine | The Language to use for the script |
| Script File | Path to script file to execute. Only one of Script File or Script Body may be used |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## Relationships

| Name | Description |
| --- | --- |
| failure | In case of any issue during processing the incoming flow file, the incoming FlowFile will be routed to this relationship. |
| invalid | FlowFile containing the invalid records from the incoming FlowFile will be routed to this relationship. If there are no invalid records, no FlowFile will be routed to this Relationship. |
| original | After successful procession, the incoming FlowFile will be transferred to this relationship. This happens regardless the FlowFiles might routed to “valid” and “invalid” relationships. |
| valid | FlowFile containing the valid records from the incoming FlowFile will be routed to this relationship. If there are no valid records, no FlowFile will be routed to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records within the flow file. |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |

## See also

* [org.apache.nifi.processors.script.ScriptedFilterRecord](scriptedfilterrecord.md)
* [org.apache.nifi.processors.script.ScriptedPartitionRecord](scriptedpartitionrecord.md)
* [org.apache.nifi.processors.script.ScriptedTransformRecord](scriptedtransformrecord.md)

---
title: SearchElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/searchelasticsearch.md
section: Loading & Unloading Data
---

# SearchElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

A processor that allows the user to repeatedly run a paginated query (with aggregations) written with the Elasticsearch JSON DSL. Search After/Point in Time queries must include a valid “sort” field. The processor will retrieve multiple pages of results until either no more results are available or the Pagination Keep Alive expiration is reached, after which the query will restart with the first page of results being retrieved.

## Tags

elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, json, page, query, scroll, search

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Aggregation Results Format | Format of Aggregation output. |
| Aggregation Results Split | Output a flowfile containing all aggregations or one flowfile for each individual aggregation. |
| Aggregations | One or more query aggregations (or “aggs”), in JSON syntax. Ex: {“items”: {“terms”: {“field”: “product”, “size”: 10}}} |
| Client Service | An Elasticsearch client service to use for running queries. |
| Fields | Fields of indexed documents to be retrieved, in JSON syntax. Ex: [“user.id”, “http.response.\*”, {“field”: “@timestamp”, “format”: “epoch_millis”}] |
| Index | The name of the index to use. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Output No Hits | Output a “hits” flowfile even if no hits found for query. If true, an empty “hits” flowfile will be output even if “aggregations” are output. |
| Pagination Keep Alive | Pagination “keep_alive” period. Period Elasticsearch will keep the scroll/pit cursor alive in between requests (this is not the time expected for all pages to be returned, but the maximum allowed time for requests between page retrievals). |
| Pagination Type | Pagination method to use. Not all types are available for all Elasticsearch versions, check the Elasticsearch docs to confirm which are applicable and recommended for your service. |
| Query | A query in JSON syntax, not Lucene syntax. Ex: {“query”:{“match”:{“somefield”:”somevalue”}}}. If the query is empty, a default JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Attribute | If set, the executed query will be set on each result flowfile in the specified attribute. |
| Query Clause | A “query” clause in JSON syntax, not Lucene syntax. Ex: {“match”:{“somefield”:”somevalue”}}. If the query is empty, a default JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Definition Style | How the JSON Query will be defined for use by the processor. |
| Restart On Finish | Whether the processor should start another search with the same query once a paginated search has completed. |
| Script Fields | Fields to created using script evaluation at query runtime, in JSON syntax. Ex: {“test1”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* 2”}}, “test2”: {“script”: {“lang”: “painless”, “source”: “doc[ ‘price’].value \* params.factor”, “params”: {“factor”: 2.0}}}} |
| Search Results Format | Format of Hits output. |
| Search Results Split | Output a flowfile containing all hits or one flowfile for each individual hit or one flowfile containing all hits from all paged responses. |
| Size | The maximum number of documents to retrieve in the query. If the query is paginated, this “size” applies to each page of the query, not the “size” of the entire result set. |
| Sort | Sort results by one or more fields, in JSON syntax. Ex: [{“price” : {“order” : “asc”, “mode” : “avg”}}, {“post_date” : {“format”: “strict_date_optional_time_nanos”}}] |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | The pagination state (scrollId, searchAfter, pitId, hitCount, pageCount, pageExpirationTimestamp) is retained in between invocations of this processor until the Scroll/PiT has expired (when the current time is later than the last query execution plus the Pagination Keep Alive interval). |

## Relationships

| Name | Description |
| --- | --- |
| aggregations | Aggregations are routed to this relationship. |
| failure | All flowfiles that fail for reasons unrelated to server availability go to this relationship. |
| hits | Search hits are routed to this relationship. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | application/json |
| aggregation.name | The name of the aggregation whose results are in the output flowfile |
| aggregation.number | The number of the aggregation whose results are in the output flowfile |
| page.number | The number of the page (request), starting from 1, in which the results were returned that are in the output flowfile |
| hit.count | The number of hits that are in the output flowfile |
| elasticsearch.query.error | The error message provided by Elasticsearch if there is an error querying the index. |

## See also

* [org.apache.nifi.processors.elasticsearch.ConsumeElasticsearch](consumeelasticsearch.md)
* [org.apache.nifi.processors.elasticsearch.PaginatedJsonQueryElasticsearch](paginatedjsonqueryelasticsearch.md)

---
title: SegmentContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/segmentcontent.md
section: Loading & Unloading Data
---

# SegmentContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Segments a FlowFile into multiple smaller segments on byte boundaries. Each segment is given the following attributes: fragment.identifier, fragment.index, fragment.count, segment.original.filename; these attributes can then be used by the MergeContent processor in order to reconstitute the original FlowFile

## Tags

segment, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Segment Size | The maximum data size in bytes for each segment |

## Relationships

| Name | Description |
| --- | --- |
| original | The original FlowFile will be sent to this relationship |
| segments | All segments will be sent to this relationship. If the file was small enough that it was not segmented, a copy of the original is sent to this relationship as well as original |

## Writes attributes

| Name | Description |
| --- | --- |
| fragment.identifier | All segments produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the segments that were created from a single parent FlowFile |
| fragment.count | The number of segments generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |
| segment.original.filename | The filename will be updated to include the parent’s filename, the segment index, and the segment count |

## See also

* [org.apache.nifi.processors.standard.MergeContent](mergecontent.md)

---
title: Set up and access Openflow
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-roles-login.md
section: Loading & Unloading Data
---

# Set up and access Openflow

To use Openflow, you must configure roles and permissions in your Snowflake account, and set up a database. This topic describes how to set up the necessary roles and permissions.

## Set up the Openflow admin roles

The **Openflow Admin role** is used by a deployment engineer to set up Openflow workflows. A Snowflake administrator adds this role by performing the following steps:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. Open a SQL worksheet.
3. Create a role for the Openflow admin, allowing it the required permissions to manage integrations and compute pools required for deployments. In the SQL below, OPENFLOW_ADMIN is the default name for the Openflow admin, but you can choose any name.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE IF NOT EXISTS OPENFLOW_ADMIN;

   GRANT CREATE ROLE ON ACCOUNT TO ROLE OPENFLOW_ADMIN;

   GRANT CREATE OPENFLOW DATA PLANE INTEGRATION ON ACCOUNT
      TO ROLE OPENFLOW_ADMIN;

   GRANT CREATE OPENFLOW RUNTIME INTEGRATION ON ACCOUNT
      TO ROLE OPENFLOW_ADMIN;
   ```
4. Grant the admin role and secondary roles to a user.

   To prevent issues with login, when you create an Openflow user, Snowflake recommends that you also assign and set default secondary roles to that user. This is helpful because Openflow doesn’t allow users with the following roles to log in: ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, or SECURITYADMIN. While logged in, Openflow actions can be authorized by any of the authenticated user’s roles, not just the default role.

   Substitute <OPENFLOW_USER> with the appropriate username:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   GRANT ROLE OPENFLOW_ADMIN TO USER <OPENFLOW_USER>;
   ALTER USER <OPENFLOW_USER> SET DEFAULT_ROLE = OPENFLOW_ADMIN;
   ALTER USER <OPENFLOW_USER> SET DEFAULT_SECONDARY_ROLES = ('ALL');
   ```

## Accept the Openflow terms of service

This step is only required once for your organization.

1. Sign in to Snowflake as a user with the ORGADMIN role.
2. In the navigation menu, select Ingestion » Openflow.
3. Review the agreement and select **Accept**.

## Start Openflow

Log in to Openflow by performing the following steps:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. Select Launch Openflow.

### Troubleshooting login issues

* If you can log into Snowflake but can’t log into Openflow, try the following:

  + Try changing your role to something other than ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, or SECURITYADMIN.
  + Try adding default secondary roles to the account:

    ```sqlexample
    USE ROLE ACCOUNTADMIN;
    ALTER USER <OPENFLOW_USER> SET DEFAULT_SECONDARY_ROLES = ('ALL');
    ```

---
title: Set up Openflow - BYOC
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-byoc.md
section: Loading & Unloading Data
---

# Set up Openflow - BYOC

This topic describes the steps to set up Openflow.

Setting up Openflow involves the following steps:

* Create a deployment in your cloud
* Create a Runtime environment in your cloud

## Prerequisites

The prerequisites to be completed on your Snowflake and AWS accounts are as follows:

### Snowflake account

You’ll need to first define privileges at the Snowflake account level.

1. Run the following SQL commands to grant the required privileges to the Openflow admin role:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   GRANT CREATE OPENFLOW DATA PLANE INTEGRATION ON ACCOUNT TO ROLE $openflow_admin_role;
   GRANT CREATE OPENFLOW RUNTIME INTEGRATION ON ACCOUNT TO ROLE $openflow_admin_role;
   ```

   The new privileges are assigned to the ACCOUNTADMIN role as part of the default set of privileges, and that role can grant the privileges to a role of their choosing for the Openflow admin role, denoted as $openflow_admin_role in the code.
2. Next, set `default_secondary_roles` to `ALL` for all Openflow users:

   1. Sign in to Snowflake with a role that your ACCOUNTADMIN assigned for using Openflow.

      This may not be any of the following roles: ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, or SECURITYADMIN.

      If you see a blank screen or the error “message: Invalid consent request” when logging into Openflow, change your role to a role that is not one of these listed roles.

      For more information, see Prerequisites.
   2. Run the following code, replacing $openflow_user for each Openflow user:
   > ```sqlexample
   > USE ROLE ACCOUNTADMIN;
   > ALTER USER $openflow_user SET DEFAULT_SECONDARY_ROLES = ('ALL');
   > ```
   >
   > This setting is required because Openflow actions are authorized by using any of the authenticated user’s roles, and not just the default role.

#### Deployment integration privileges

The deployment integration object represents a set of resources provisioned to deploy one or more Snowflake Openflow runtimes. For organizations bringing
their own cloud resources, the deployment integration object represents a managed Kubernetes cluster along with its associated nodes.

Users with the CREATE DATA PLANE INTEGRATION privilege on the Snowflake account can create and delete the deployment integration objects.

Additional privileges can be defined on deployment integration objects directly to support differentiation of access.

You can grant the following privileges on a deployment integration object:

* OWNERSHIP: Enables full control over deployment actions objects, including deletion of the deployment.
* USAGE: Enables creation of runtime child objects.

#### Runtime privileges

The runtime object represents a cluster of one or more Snowflake Openflow runtime servers, provisioned to run flow definitions. For Kubernetes deployments, the runtime object represents a stateful set of Snowflake Openflow runtime containers deployed in a namespace, along with supporting components.

Users with the OWNERSHIP privilege on the parent deployment integration object and the CREATE RUNTIME INTEGRATION account-level privilege can create runtime integration objects. Additional privileges can be defined on runtime integration objects directly to support differentiation of access.

You can grant the following privileges on a runtime integration object:

* OWNERSHIP: Enables full control over runtime actions, including deletion of the associated runtime and modification of runtime flow definitions.
* USAGE: Enables read access to the deployed runtime for observing health and status, without making any changes.

#### Snowflake role

A Snowflake role is a Snowflake role that is associated with a specific Openflow runtime and used for the following tasks:

* Grant access to Snowflake resources.
* Grant access to connector-specific resources

Snowflake roles are linked to Openflow Snowflake Managed Token, avoiding the need for customers to create separate service users and key pairs for authentication to Snowflake.

> **Note:**
>
> <RUNTIMENAME> denotes the name of the associated runtime.

To create a Snowflake role:

1. Create the required Snowflake role.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE ROLE IF NOT EXISTS OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>
   ```
2. Grant the Snowflake role access to a warehouse.
   Snowflake recommends using a dedicated warehouse for data ingestion.
   This warehouse should be used when configuring your connectors for runtimes where you will be using this Snowflake role.

   ```sqlexample
   GRANT USAGE, OPERATE ON WAREHOUSE <OPENFLOW_INGEST_WAREHOUSE> TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;
   ```
3. Allow the Snowflake role to use, create or otherwise access Snowflake objects.

   > > **Note:**
   > >
   > > Depending on the Openflow connector being created the required underlying objects will vary.
   > > The example below is for illustration purposes only.

   ```sqlexample
   GRANT USAGE ON DATABASE <OPENFLOW_DATABASE> TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;
   GRANT USAGE ON SCHEMA <OPENFLOW_SCHEMA> TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;
   ```
4. Allow the user to use the Snowflake role

   ```sqlexample
   GRANT ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME> TO USER <username>;
   ```

#### Example for role setup

Consider a scenario where the following roles should be set up:

* **accountadmin:** Out-of-the box role from Snowflake, which has these two CREATE privileges:

  + CREATE OPENFLOW DATA PLANE INTEGRATION
  + CREATE OPENFLOW RUNTIME INTEGRATION
* **deployment_manager:** Can create, manage, and delete deployments.
* **deployment1_runtime_manager_1:** Can create a runtime only within deployment 1. It can modify and delete a runtime that it created within deployment 1, but not a runtime created by deployment1_runtime_manager_2.
* **deployment1_runtime_manager_2:** Can create a runtime only within deployment 1. It can modify and delete a runtime that it created within deployment 1, but not a runtime created by deployment1_runtime_manager_1.
* **deployment1_runtime_viewer_1:** Can view a runtime canvas within deployment 1 that was created by deployment1_runtime_manager_1.
* **deployment1_runtime_viewer_2:** Can view a runtime canvas within deployment 1 that was created by deployment1_runtime_manager_2.
* **deployment2_runtime_manager:** Can create a runtime only within deployment 2.
* **deployment2_runtime_viewer:** Can view a runtime canvas within deployment 2.

To set up Openflow with these roles, follow these steps:

1. Create new roles and assign the relevant privileges:

   ```sqlexample
   use role ACCOUNTADMIN;
   create role if not exists deployment_manager;
   create role if not exists deployment1_runtime_manager_1;
   create role if not exists deployment1_runtime_manager_2;
   create role if not exists deployment1_runtime_viewer_1;
   create role if not exists deployment1_runtime_viewer_2;
   create role if not exists deployment2_runtime_manager;
   create role if not exists deployment2_runtime_viewer;

   -- Assign create deployment privilege to roles. (This privilege cannot be granted in Openflow UI.)

   grant create openflow data plane integration on account to role deployment_manager;

   -- Assign create runtime privilege to roles. (This privilege cannot be granted in the Control Pane UI.)

   grant create openflow runtime integration on account to role deployment1_runtime_manager_1;
   grant create openflow runtime integration on account to role deployment1_runtime_manager_2;
   grant create openflow runtime integration on account to role deployment2_runtime_manager;

   -- Grant roles to users. (Repeat this step for each user.)

   grant role <role name> to user <username>;
   ```
2. To create a deployment, follow these steps:

   1. Sign in to Snowsight as deployment_manager.
   2. In the navigation menu, select Ingestion » Openflow.
   3. To create deployment 1, select Create a deployment, and grant the USAGE privilege to deployment1_runtime_manager_1 and deployment1_runtime_manager_2.
   4. To create deployment 2, select Create a deployment, and grant the USAGE privilege to deployment2_runtime_manager.
3. To create a runtime in deployment 1, follow these steps:

   1. Log in as deployment1_runtime_manager_1.
   2. Create a runtime as described in the following sections. deployment1_runtime_manager_1 should be able to create runtimes and manage any runtimes it created within this deployment.
   3. In the Openflow UI, select deployment1_runtime_viewer_1 and grant it the USAGE privilege.

### AWS account

Ensure the following on your AWS account:

* You have an AWS account with permissions required to create a CloudFormation stack.
* An AWS administrator in your organization can execute CloudFormation script to set up Amazon Elastic Kubernetes Service (EKS) inside a new VPC (created by
  CloudFormation) or an existing VPC. See Prerequisites for BYO-VPC (existing VPC).

> **Note:**
>
> To learn about how the Openflow installation happens in your AWS account and the permissions that are configured by the CloudFormation template, see Installation process.

#### Prerequisites for BYO-VPC (existing VPC)

If you want to use an existing VPC and your own subnets, ensure that you have the following:

* For Snowflake managed ingress, two public subnets with:

  > + Different availability zones
  > + At least /27 CIDR ranges with 32 available IPs.
  > + Routes for destination 0.0.0.0/0 and target internet gateway or some other egress routing to the internet.
  > + A tag that allows Openflow to create a load balancer:
  >
  >   > - Key: `kubernetes.io/role/elb`
  >   > - Value: `1`
  > + If your public subnets are used by other EKS clusters, a tag that allows Openflow to create a load balancer alongside other load balancers:
  >
  >   > - Key: `kubernetes.io/cluster/{deployment-key}`
  >   > - Value: `1`
  > > **Note:**
  > >
  > > Managing your own ingress eliminates the need for public subnets, but requires additional configuration in your AWS account.
  > > For more information, see [Openflow BYOC - Set up custom ingress](setup-openflow-byoc-custom-ingress.md).
* Two private subnets with:

  > + Different availability zones
  > + At least /24 CIDR ranges with 255 available IPs. This limits the number and
  >   scale of runtimes you can create, so it may be more appropriate to use a larger range for the deployment.
  > + Connectivity to Snowflake and AWS services from Private Subnet 1 where the Openflow deployment runs.
  >
  >   > - Among many options, you can connect using route tables with a NAT Gateway, a Transit Gateway, or PrivateLink VPC Endpoints.
  >   > - Without this connectivity, the Openflow deployment will not initialize or set up properly and no infrastructure will be provisioned.
  > + For Snowflake managed ingress, egress connectivity to [LetsEncrypt.org](https://letsencrypt.org), which will provision a TLS certificate.

## Accept the Openflow terms of service

This step is only required once for your organization.

1. Sign in to Snowflake as a user with the ORGADMIN role.
2. In the navigation menu, select Ingestion » Openflow.
3. Accept Openflow terms of services.

## Create a deployment in your cloud

### Configure the deployment in your Snowflake account

> **Important:**
>
> Sign in to Snowflake with a role that your ACCOUNTADMIN assigned for using Openflow.
>
> This may not be any of the following roles: ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, or SECURITYADMIN.
>
> If you see a blank screen, or the error: “message: Invalid consent request”, when logging into Openflow, change your role to a role that is not one of these listed roles.
>
> For more information, see Prerequisites.

1. In the navigation menu, select Ingestion » Openflow.
2. Select Launch Openflow.
3. In the Openflow UI, select Create a deployment.
4. On the Deployments tab, select Create a deployment.
   The **Creating a deployment** wizard opens.
5. In the Prerequisites step, ensure that you meet all the requirements, and then select Next.
6. In the Deployment location step, select Amazon Web Services as the deployment location, enter a name for your deployment, and then select Next.
7. In the Configuration step, select one of the following options:

   * Managed VPC: Choose this option if you want your VPC to be managed by Snowflake.
   * Bring your own VPC: Choose this option if you want to use an existing VPC.

9. In the PrivateLink step, you can select if you want to establish communication with Snowflake over the private link.
   Enabling this option requires additional setup in your AWS account. For more information, see [AWS PrivateLink and Snowflake](../../admin-security-privatelink.md).

   * If the PrivateLink option is enabled, the End user authentication over PrivateLink step displays.

     + If enabled, browser-based authentication redirects use PrivateLink endpoints.
     + If disabled, end-user authentication uses public Snowflake URLs.

     Regardless of this setting, Deployment communications to Snowflake will use PrivateLink.

     If you access Snowsight through a PrivateLink URL, ensure it is enabled.
     If you access Snowsight through a non-PrivateLink URL, leave it disabled.
10. In the Custom Ingress step, you can choose to manage your own ingress configuration for the Openflow deployment, such as specifying custom security groups, load balancer settings, or other network controls.

    Enabling this option requires additional setup in your AWS account. For more information, see [Openflow BYOC - Set up custom ingress](setup-openflow-byoc-custom-ingress.md).
11. Select Create Deployment.
12. Once your deployment is configured, a dialog box appears that lets you download the CloudFormation template to complete the setup process in your AWS account. Download this template. Note that Openflow doesn’t support modifying the CloudFormation template. Don’t modify any values after downloading the template, other than choosing drop-down options.
13. (Optional) To encrypt EBS volumes for your Openflow BYOC deployment, see [Openflow BYOC - Set up encrypted EBS volumes](setup-openflow-byoc-encrypted-volumes.md).

### Apply the CloudFormation template in your AWS account

1. In your AWS account, create a new CloudFormation Stack using the template. After the Openflow deployment agent’s Amazon Elastic Compute Cloud (EC2) instance is created, it completes the rest of the Installation process using infrastructure as code scripts.
   You can track the installation progress as described in Track the installation progress.

   If you’re using an existing VPC, upon uploading the CloudFormation template, select the respective values in the drop-down lists for the two private subnets and your VPC.

### Create a network rule for Openflow in your Snowflake account

This step is required only if you’re using network policies to control access to Snowflake. A network policy is a set of rules that control which IP addresses can access your Snowflake account.

1. Navigate to your Snowflake account.
2. Identify the NAT gateway public IP address that was created as part of the CloudFormation stack. You can find this either by searching for NAT Gateway on AWS console or checking the output of the CloudFormation stack.

   The NAT gateway is responsible for Openflow egress for both the Data Plane Agent (DPA) and EKS. Both DPA and EKS run in the Private Subnet 1 of the installation.
3. Create a network rule for Openflow and add it to your existing network policy. Replace {$NAT_GATEWAY_PUBLIC_IP} in the following code snippet with the NAT gateway public IP address that was created as part of the CloudFormation stack.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   USE DATABASE {REPLACE_WITH_YOUR_DB_NAME};

   CREATE NETWORK RULE allow_openflow_deployment
   MODE = INGRESS
   TYPE = IPV4
   VALUE_LIST = ('{$NAT_GATEWAY_PUBLIC_IP}/32');
   ```
4. Find your currently active network policy.

   ```sqlexample
   SHOW PARAMETERS LIKE 'NETWORK_POLICY' IN ACCOUNT;
   ```
5. Copy the value column from the output, and use it to create a network rule:

   ```sqlexample
   ALTER NETWORK POLICY {ENTER_YOUR_ACTIVE_NETWORK_POLICY_NAME} ADD ALLOWED_NETWORK_RULE_LIST = (allow_openflow_deployment);
   ```

### Set up an event table to log Openflow events (required)

Use one of the following options to set up an event table:

* Create a new Openflow-specific event table (recommended):

  ```sqlexample
  USE ROLE ACCOUNTADMIN;

  CREATE DATABASE IF NOT EXISTS openflow;
  USE openflow;
  CREATE SCHEMA IF NOT EXISTS openflow;
  USE SCHEMA openflow;

  GRANT CREATE EVENT TABLE
    ON SCHEMA openflow.openflow
    TO ROLE $role_of_deployment_owner;
  USE ROLE $role_of_deployment_owner;
  CREATE EVENT TABLE IF NOT EXISTS openflow.openflow.openflow_events;
  -- Find the Data Plane Integrations
  SHOW OPENFLOW DATA PLANE INTEGRATIONS;
  ALTER OPENFLOW DATA PLANE INTEGRATION
    $openflow_dataplane_name
    SET EVENT_TABLE = 'openflow.openflow.openflow_events';
  ```
* Create an account-specific event table:

  ```sqlexample
  USE DATABASE openflow;
  CREATE SCHEMA IF NOT EXISTS openflow.telemetry;
  CREATE EVENT TABLE IF NOT EXISTS openflow.telemetry.events;
  ALTER ACCOUNT SET EVENT_TABLE = openflow.telemetry.events;
  ```
* Use an existing account-specific event table:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  ALTER ACCOUNT SET EVENT_TABLE = 'existing_database.existing_schema.existing_event_table';
  ```

### Verify the deployment

1. In the navigation menu, select Ingestion » Openflow. Creating a deployment takes about 45 minutes on AWS. Once it’s created, you can view your deployment in the Deployments tab of Openflow UI with its state marked as Active.

## Create a runtime environment in your cloud

1. In Openflow Control Plane, select Create a runtime. The Create Runtime dialog box appears.
2. From the Deployment drop-down list, choose the deployment in which you want to create a runtime.
3. Enter a name for your runtime.
4. Choose a node type from the Node type drop-down list. This specifies the size of your nodes.
5. In the Min/Max node range selector, select a range. The minimum value specifies the
   number of nodes that the runtime starts with when idle and the maximum value specifies the
   number of nodes that the runtime can scale up to, in the event of high data volume or CPU load.
6. Select Create. The runtime takes a couple of minutes to get created.

Once created, you can view your runtime by navigating to the Runtimes tab of the Openflow control plane. Click the runtime to open the Openflow canvas.

## Next step

Deploy a connector in a runtime. For a list of connectors available in Openflow, see [Openflow connectors](connectors/about-openflow-connectors.md).

## Networking considerations: Openflow EKS to source systems

For BYOC deployments, take note of the following considerations:

* Openflow CloudFormation stack creates one VPC with two public subnets and two private subnets.
* Public subnets host the AWS Network Load Balancer, which is created later. Private subnets host the EKS Cluster and all of the EC2 instances backing the node groups. Openflow runtimes run within Private subnet 1.
* NAT Gateway is currently the egress for both DPA and EKS. Both DPA and EKS run in the Private subnet 1 of the installation.

For BYO-VPC deployments, take note of the following considerations:

* Openflow requires you to enter the two private subnets that will run Openflow and two public subnets for the AWS Load Balancer.
* You have to provide your own egress routing to the Internet from those private subnets, which can be the central NAT Gateway.
* No Internet Gateway is created by Openflow. You have to provide appropriate public internet egress routing.

The network connectivity generally is as follows:
**An Openflow EC2 Instance** (Agent or EKS) runs in a **private subnet** that requires **Route Table entries** to send egress traffic to a **Transit Gateway**, a **PrivateLink VPC Endpoint**, or a **NAT Gateway** connected to an **Internet Gateway**.

### Example: BYOC deployment with a new VPC to communicate with RDS in a different VPC of the same account

To enable communication between the Openflow EKS cluster and the RDS instance, you need to create a new
security group, with the EKS cluster security group as the source for the inbound rule for RDS connectivity, and attach the group in RDS.

1. Find the EKS cluster security group, navigate to EKS and find your deployment key.
   You can also find it on the Openflow UI by performing the following steps:

   1. Sign in to Openflow.
   2. Go to the Deployments tab.
   3. Select the More options icon next to your deployment.
   4. Select View details. The value in the field Key is your deployment key.
2. After finding the deployment key, you can use it to filter your AWS resources by the key value.
3. Create a new security group that allows access from the Openflow EKS cluster using the relevant database
   port. For PostgreSQL the default port is 5432.
4. Attach it in RDS as a new security group.

If you need to troubleshoot, the [Reachability Analyzer](https://docs.aws.amazon.com/vpc/latest/reachability/getting-started.html) can be useful.
It will give you detailed information about what may be blocking connectivity by using tracing capabilities within the AWS platform.

See the following AWS docs for accessing DB instances using VPC peering and the associated security group configuration:

* [Scenarios for accessing a DB instance in a VPC - Amazon Relational Database Service](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/USER_VPC.Scenarios.html#USER_VPC.Scenario3)
* [Update your security groups to reference peer security groups - Amazon Virtual Private Cloud](https://docs.aws.amazon.com/vpc/latest/peering/vpc-peering-security-groups.html)

## Configuring PrivateLink in AWS

This section explains how to access and configure Openflow using private connectivity.

### Access Openflow over PrivateLink

Before starting with the private link configuration, enable PrivateLink for your account as described in [AWS PrivateLink and Snowflake](../../admin-security-privatelink.md).

1. Using the `ACCOUNTADMIN` role, call the `SYSTEM$GET_PRIVATELINK_CONFIG` function in your Snowflake account and identify the value for `openflow-privatelink-url`. This is the URL for accessing Openflow over PrivateLink.
2. Create a `CNAME` record in your DNS to resolve the URL value to your VPC endpoint.
3. Confirm that your DNS settings can resolve the value.
4. Confirm that you can connect to Openflow UI using this URL from your browser.

### Configure a new deployment using PrivateLink

> **Note:**
>
> Snowflake recommends that you use the Bring your own VPC version of Openflow deployment and create a VPC endpoint in your VPC before applying the CloudFormation template.

Before starting with the PrivateLink configuration, make sure that PrivateLink is enabled for your account as described in [AWS PrivateLink and Snowflake](../../admin-security-privatelink.md).

Perform the following steps:

1. Retrieve Snowflake’s VPC endpoint service ID and Openflow PrivateLink URLs:

   > 1. Run the following SQL command using the `ACCOUNTADMIN` role:
   >
   >    > ```sqlexample
   >    > SELECT SYSTEM$GET_PRIVATELINK_CONFIG()
   >    > ```
   > 2. From the output, identify and save the values for the following keys:
   >
   >    * `privatelink-vpce-id`
   >    * `openflow-privatelink-url`
   >    * `external-telemetry-privatelink-url`
2. Create a VPC endpoint with parameters:

   > * Type: PrivateLink Ready partner services
   > * Service: `privatelink-vpce-id` value obtained in the previous step.
   > * VPC: The VPC where your Openflow deployment will be running.
   > * Subnets: Select two availability zones and private subnets where your Openflow deployment will be running.
3. Set up Route 53 private hosted zone with the following parameters:

   > 1. Domain: `privatelink.snowflakecomputing.com`
   > 2. Type: Private hosted zone
   > 3. Select the region and VPC where your Openflow deployment will be running.
4. Add two `CNAME` records for the URLs identified in the first step:

   > 1. For `openflow-privatelink-url`
   >
   >    > * Record name: `openflow-privatelink-url` value obtained in the first step
   >    > * Record type: `CNAME`
   >    > * Value: DNS name of your VPC endpoint
   > 2. For `external-telemetry-privatelink-url`
   >
   >    > * Record name: `external-telemetry-privatelink-url` value obtained in the first step
   >    > * Record type: `CNAME`
   >    > * Value: DNS name of your VPC endpoint
5. Create a dedicated security group for the deployment and enable traffic from the security group to the VPC endpoint:

   > 1. Open the security group associated with your VPC endpoint.
   > 2. Add an inbound rule to the security group that allows All traffic from the security group created for your deployment.
6. Create a new deployment and apply the CloudFormation Stack following the instructions in the Create a deployment in your cloud section and ensure that:

   * The PrivateLink option is enabled. The End user authentication over PrivateLink option can be either enabled or disabled.
   * The security group created for the deployment is used when creating the CloudFormation stack.
7. Wait until the EKS cluster for your deployment is created. To confirm successful creation, navigate to AWS Console under Elastic Kubernetes Service. Verify that a cluster identified as `<deployment-key>` displays status ACTIVE.
8. Allow for traffic from your EKS to the VPC endpoint:

   > 1. Open the security group associated with your VPC endpoint.
   > 2. Add an inbound rule to the security group that allows All traffic from the security group assigned to your EKS cluster. The EKS cluster’s security group starts with `eks-cluster-sg-<deployment-key>-`.

### Configuring VPC Gateway Endpoints for S3 in AWS

Configuring an AWS VPC Gateway Endpoint for S3 is the primary method to allow an Agent EC2 instance in a private subnet to access the Amazon Linux 2023 repository privately,
without requiring an Internet Gateway, a NAT Gateway, or a public IP address on the instance. The Agent EC2 instance uses this repository to install its dependencies, for instance Docker.

To configure a VPC Gateway Endpoint for S3:

1. Open a browser to the AWS VPC dashboard.
2. In the navigation pane, select Endpoints.
3. Click Create endpoint and create a new VPC endpoint with parameters:

   > * Type: AWS services
   > * Service: `com.amazonaws.<your-region>.s3` of type `Gateway`
   > * VPC: Select the VPC of your deployment
   > * Route tables: Select the route table(s) that are associated with your private subnet(s)
   > * Policy: Choose Full access

## Configuring private deployments

Private deployments are a feature that allows you to deploy Openflow in a VPC without the need for public internet ingress or egress.

To configure private deployments, you need to choose the following options when creating a new deployment:

1. In the Deployment location step, select Amazon Web Services as the deployment location.
2. In the VPC Configuration step, select Bring your own VPC to use an existing VPC.
3. In the PrivateLink step, enable the PrivateLink feature. Enabling this option requires additional setup in your AWS account, see Configuring PrivateLink in AWS. The End user authentication over PrivateLink option can be either enabled or disabled.
4. In the Custom ingress step, enable the custom ingress feature. Enabling this option requires additional setup in your AWS account. For more information, see [Openflow BYOC - Set up custom ingress](setup-openflow-byoc-custom-ingress.md).

Private deployments require that your existing VPC is able to access the following domains:

* `*.amazonaws.com`, a detailed list of services being accessed includes:

  > + `com.amazonaws.iam`
  > + `com.amazonaws.<your-region>.s3`
  > + `com.amazonaws.<your-region>.ec2`
  > + `com.amazonaws.<your-region>.ecr.api`
  > + `com.amazonaws.<your-region>.ecr.dkr`
  > + `com.amazonaws.<your-region>.secretsmanager`
  > + `com.amazonaws.<your-region>.sts`
  > + `com.amazonaws.<your-region>.eks`
  > + `com.amazonaws.<your-region>.autoscaling`
* `*.privatelink.snowflakecomputing.com`
* `oidc-eks.<your-region>.api.aws`
* `shield.us-east-1.amazonaws.com`

## Installation process

Between the CloudFormation stack and the Openflow Agent, there are
several coordinated steps that the BYOC deployment installation process
manages. The goal is to separate responsibilities between a cold-start
that gives organizations an easy way to provide inputs to their BYOC
deployment (solved via CloudFormation), and the configuration of the
deployment and its core software components that will need to change
over time (solved by the Openflow Agent).

The deployment Agent facilitates the creation of the Openflow deployment infrastructure and
installation of the deployment software components including the deployment service. The deployment agent authenticates
with Snowflake System Image Registry to obtain Openflow container images.

The steps are as follows:

> **Note:**
>
> When using BYO-VPC, you will choose a VPC ID and two private subnet IDs from the template, and
> the CloudFormation stack will use the selected ones rather than creating the resources mentioned in steps 1a, 1b, and 1c.

1. The CloudFormation template creates the following and configures with the AWS permissions mentioned in Configured AWS permissions:

   1. One VPC with two public subnets and two private subnets. Public
      subnets host the AWS Network Load Balancer (created later).
      Private Subnets host the EKS cluster and all of the EC2 instances
      backing the NodeGroups. Openflow runtimes run within a private
      subnet.
   2. Internet Gateway for egress from the VPC
   3. NAT Gateway for egress from the private subnets
   4. AWS Secrets Manager entry for the OIDC configuration input by the user
   5. IAM role and instance profile for the Openflow Agent to use from its EC2 instance
   6. An EC2 instance for Openflow deployment agent, complete with a UserData
      script to automatically run the initialization process. This
      script sets environment variables for the Openflow deployment agent to use,
      derived from the input CloudFormation parameters.
   7. EC2 Instance Connect endpoint for the Openflow deployment agent to upgrade
      the deployment when needed.

      * When using BYO-VPC, by default the CloudFormation stack will create an EC2 Instance Connect endpoint. However, this default behavior can be modified. When using the managed VPC option, the CloudFormation stack will always create an EC2 Instance Connect endpoint.
      * The Instance Connect endpoint can be shared across many VPCs.
      * If a deployment is deleted, along with deleting the CloudFormation stack, it will also remove the endpoint. This would block access to other BYO-VPC agents if the endpoint is shared.
      * To add an EC2 Instance Connect endpoint, perform the following steps in your AWS account:

        1. In the left navigation, navigate to VPC » Endpoints.
        2. Select Create Endpoint.
        3. Choose the endpoint type as EC2 Instance Connect Endpoint.
        4. Select a VPC. Leave all the security groups clear (not selected) to use the default VPC security group.
        5. When selecting a subnet, use the same value as Private Subnet 1 in the CloudFormation parameters.
        6. Select Create. It takes approximately 5 minutes for the endpoint to be created.
   8. S3 Bucket that stores the Terraform state, logs, and outputs for
      the Openflow Agent
2. The Openflow deployment agent creates the following:

   1. An EKS cluster containing:
   > * Node groups
   > * Autoscaling groups
   > * AWS VPC Container Network Interface (CNI) add-on
   > * Amazon Elastic Block Store (EBS) CSI add-on

   1. Secrets manager records for PostgreSQL, OAuth credentials, and so on.
   2. IAM policies and roles for various K8s service accounts to
      retrieve their secrets from AWS Secrets Manager.
   3. K8s components
   > * Namespaces
   > * Cluster autoscaler
   > * EBS CSI expandable storage
   > * AWS Load Balancer Controller, which creates the publicly accessible Network Load Balancer
   > * Let’s Encrypt certificate issuer
   > * Nginx Ingress, configured for Let’s Encrypt
   > * Metrics Server
   > * Certificate manager from [Jetstack](http://jetstack.io/)
   > * [External secrets operator](http://external-secrets.io/)
   > * Service accounts for Temporal, deployment service, and OIDC
   > * Secrets stores for Temporal, deployment service, and OIDC
   > * External secrets for Temporal and deployment service. The external secret for OIDC is created and managed by the runtime operator.
   > * PostgreSQL
   > * Temporal
   > * Self-signed certificate issuer and ingress configuration for communications between runtime nodes
   > * Openflow runtime operator
   > * Openflow deployment service

By default, all AWS accounts have a quota of five Elastic IP addresses
per region, because public (IPv4) internet addresses are a scarce public
resource. Snowflake strongly recommends that you use Elastic IP
addresses primarily for their ability to remap the address to another
instance in the case of instance failure, and to use DNS hostnames for
all other inter-node communication.

### Track the installation progress

After the CloudFormation stack moves into the CREATE_COMPLETE state, the Openflow agent automatically creates the rest of the infrastructure.

There are a few steps that can take 10-15 minutes each, such as:

1. Creating the EKS cluster
2. Installing the EBS CSI add-on to the EKS cluster
3. Creating the RDS PostgreSQL database

Status reporting for the Openflow agent is not available yet. In the meantime, you
can view logs on the Openflow agent to verify whether the BYOC deployment is ready for runtimes. To do this, perform the following steps:

1. In the EC2 instances list, locate the following two instances:

   * openflow-agent-{data-plane-key}: This is the Openflow agent that you will use to manage runtimes
   * {data-plane-key}-mgmt-group: This is a node in the BYOC deployment’s EKS cluster that runs an operator and other core software
2. Right-click on the openflow-agent-{data-plane-key} instance and select Connect.
3. Switch from EC2 Instance Connect to Connect using EC2 Instance Connect Endpoint. Leave the default EC2 Instance Connect Endpoint
   in place.
4. Click Connect. A new browser tab or window will appear with a
   command-line interface.
5. Run the following command to tail the installation logs of the docker image that is configuring your deployment:

   ```bash
   journalctl -xe -f -n 100 -u docker
   ```
6. Once the installation is complete, you’ll see the following output:

   ```output
   {timestamp} - app stack applied successfully
   {timestamp} - All resources applied successfully
   ```

### Configured AWS permissions

This section lists the AWS permissions configured by Openflow BYOC stack based on the roles.

> **Note:**
>
> {key} represents the deployment key that uniquely identifies cloud resources created and managed by Openflow for a particular deployment.

**Administrative user**

`cloudformation` and all of the following permissions.

**IAM Role: openflow-agent-role-{key}**

```json
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "autoscaling:DescribeTags",
               "ec2:DescribeImages",
               "ec2:DescribeInstances",
               "ec2:DescribeLaunchTemplates",
               "ec2:DescribeLaunchTemplateVersions",
               "ec2:DescribeNetworkInterfaces",
               "ec2:DescribeSecurityGroups",
               "ec2:DescribeSubnets",
               "ec2:DescribeTags",
               "ec2:DescribeVolumes",
               "ec2:DescribeVpcs",
               "ec2:DescribeVpcAttribute",
               "iam:GetRole",
               "iam:GetOpenIDConnectProvider",
               "ecr:GetAuthorizationToken",
               "ec2:RunInstances",
               "ec2:CreateLaunchTemplate",
               "ec2:CreateSecurityGroup",
               "ec2:CreateTags",
               "ec2:DeleteTags"
            ],
            "Resource": "*",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "aws:ResourceTag/Name": [
                        "{key}-oidc-provider"
                  ]
               }
            },
            "Action": [
               "iam:CreateOpenIDConnectProvider",
               "iam:DeleteOpenIDConnectProvider",
               "iam:TagOpenIDConnectProvider"
            ],
            "Resource": "arn:aws:iam::{Account_ID}:oidc-provider/oidc.eks.{Region}.amazonaws.com/id/*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "iam:DeletePolicy",
               "iam:CreatePolicy",
               "iam:GetPolicy",
               "iam:GetPolicyVersion",
               "iam:ListPolicyVersions"
            ],
            "Resource": [
               "arn:aws:iam::{Account_ID}:policy/dp-service-role-policy-{key}",
               "arn:aws:iam::{Account_ID}:policy/oauth2-role-policy-{key}",
               "arn:aws:iam::{Account_ID}:policy/temporal-service-role-policy-{key}",
               "arn:aws:iam::{Account_ID}:policy/oidc-service-role-policy-{key}",
               "arn:aws:iam::{Account_ID}:policy/dps-temporal-role-policy-{key}"
               "arn:aws:iam::{Account_ID}:policy/dps-postgres-role-policy-{key}"
            ],
            "Effect": "Allow"
      },
      {
            "Action": [
               "iam:UpdateAssumeRolePolicy",
               "iam:PutRolePolicy",
               "iam:ListInstanceProfilesForRole",
               "iam:ListAttachedRolePolicies",
               "iam:ListRolePolicies",
               "iam:GetRolePolicy",
               "iam:CreateRole",
               "iam:AttachRolePolicy",
               "iam:DeleteRole",
               "iam:DeleteRolePolicy",
               "iam:DetachRolePolicy",
               "iam:TagRole"
            ],
            "Resource": [
               "arn:aws:iam::{Account_ID}:role/openflow-agent-role-{key}",
               "arn:aws:iam::{Account_ID}:role/{key}-*",
               "arn:aws:iam::{Account_ID}:role/dps-temporal-role-{key}",
               "arn:aws:iam::{Account_ID}:role/dps-postgres-role-{key}",
               "arn:aws:iam::{Account_ID}:role/dp-service-role-{key}",
               "arn:aws:iam::{Account_ID}:role/oauth2-role-{key}",
               "arn:aws:iam::{Account_ID}:role/oidc-service-role-{key}"
            ],
            "Effect": "Allow"
      },
      {
            "Action": [
               "autoscaling:CreateOrUpdateTags",
               "autoscaling:DeleteTags"
            ],
            "Resource": "arn:aws:autoscaling:{Region}:{Account_ID}:autoScalingGroup:*:autoScalingGroupName/eks-{key}-*",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "aws:ResourceTag/Name": [
                        "{key}-EC2SecurityGroup-*",
                        "k8s-traffic-{key}-*",
                        "eks-cluster-sg-{key}-*",
                        "{key}-cluster-sg",
                        "postgres-{key}-sg"
                  ]
               }
            },
            "Action": [
               "ec2:AuthorizeSecurityGroupEgress",
               "ec2:AuthorizeSecurityGroupIngress",
               "ec2:RevokeSecurityGroupEgress",
               "ec2:DeleteSecurityGroup",
               "ec2:CreateTags",
               "ec2:DeleteTags",
               "ec2:CreateNetworkInterface",
               "ec2:DeleteNetworkInterface"
            ],
            "Resource": "arn:aws:ec2:{Region}:{Account_ID}:security-group/*",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "aws:ResourceTag/elbv2.k8s.aws/cluster": "{key}"
               }
            },
            "Action": [
               "ec2:AuthorizeSecurityGroupEgress",
               "ec2:AuthorizeSecurityGroupIngress",
               "ec2:RevokeSecurityGroupEgress",
               "ec2:DeleteSecurityGroup",
               "ec2:CreateTags",
               "ec2:DeleteTags",
               "ec2:CreateNetworkInterface",
               "ec2:DeleteNetworkInterface"
            ],
            "Resource": "arn:aws:ec2:{Region}:{Account_ID}:security-group/*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "ec2:CreateSecurityGroup"
            ],
            "Resource": "arn:aws:ec2:{Region}:{Account_ID}:vpc/vpc-018d2da0fde903de4",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "ec2:ResourceTag/Name": "openflow-agent-{key}"
               }
            },
            "Action": [
               "ec2:AttachNetworkInterface"
            ],
            "Resource": "arn:aws:ec2:{Region}:{Account_ID}:instance/*",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "aws:ResourceTag/Name": "{key}-*-group"
               }
            },
            "Action": [
               "ec2:DeleteLaunchTemplate"
            ],
            "Resource": "arn:aws:ec2:{Region}:{Account_ID}:launch-template/*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "eks:CreateCluster",
               "eks:CreateAccessEntry",
               "eks:CreateAddon",
               "eks:CreateNodegroup",
               "eks:DeleteCluster",
               "eks:DescribeCluster",
               "eks:ListClusters",
               "eks:ListNodeGroups",
               "eks:DescribeUpdate",
               "eks:UpdateClusterConfig",
               "eks:TagResource"
            ],
            "Resource": "arn:aws:eks:{Region}:{Account_ID}:cluster/{key}",
            "Effect": "Allow"
      },
      {
            "Action": [
               "eks:DescribeAddon",
               "eks:DescribeAddonVersions",
               "eks:UpdateAddon",
               "eks:DeleteAddon",
               "eks:DescribeUpdate"
            ],
            "Resource": "arn:aws:eks:{Region}:{Account_ID}:addon/{key}/*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "eks:DeleteNodegroup",
               "eks:DescribeNodegroup",
               "eks:ListNodegroups",
               "eks:UpdateNodegroupConfig",
               "eks:TagResource",
               "eks:DescribeUpdate"
            ],
            "Resource": "arn:aws:eks:{Region}:{Account_ID}:nodegroup/{key}/*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "s3:CreateBucket",
               "s3:ListBucket"
            ],
            "Resource": "arn:aws:s3:::byoc-tf-state-{key}",
            "Effect": "Allow"
      },
      {
            "Action": [
               "s3:DeleteObject",
               "s3:GetObject",
               "s3:PutObject"
            ],
            "Resource": "arn:aws:s3:::byoc-tf-state-{key}/*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "secretsmanager:CreateSecret",
               "secretsmanager:DeleteSecret",
               "secretsmanager:DescribeSecret",
               "secretsmanager:GetResourcePolicy",
               "secretsmanager:GetSecretValue",
               "secretsmanager:PutSecretValue",
               "secretsmanager:UpdateSecretVersionStage"
            ],
            "Resource": "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:*-{key}*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "ecr:BatchCheckLayerAvailability",
               "ecr:BatchGetImage",
               "ecr:DescribeImages",
               "ecr:DescribeRepositories",
               "ecr:GetDownloadUrlForLayer",
               "ecr:ListImages"
            ],
            "Resource": "arn:aws:ecr:{Region}:{Account_ID}:*",
            "Effect": "Allow"
      },
      {
            "Action": [
               "ecr:CreateRepository",
               "ecr:CompleteLayerUpload",
               "ecr:InitiateLayerUpload",
               "ecr:PutImage",
               "ecr:UploadLayerPart"
            ],
            "Resource": "arn:aws:ecr:{Region}:{Account_ID}:repository/snowflake-openflow/*",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "iam:AWSServiceName": "eks.amazonaws.com"
               }
            },
            "Action": [
               "iam:CreateServiceLinkedRole"
            ],
            "Resource": "arn:aws:iam::*:role/aws-service-role/eks.amazonaws.com/AWSServiceRoleForAmazonEKS",
            "Effect": "Allow"
      },
      {
            "Condition": {
               "StringLike": {
                  "iam:AWSServiceName": "eks-nodegroup.amazonaws.com"
               }
            },
            "Action": [
               "iam:CreateServiceLinkedRole"
            ],
            "Resource": "arn:aws:iam::*:role/aws-service-role/eks-nodegroup.amazonaws.com/AWSServiceRoleForAmazonEKSNodegroup",
            "Effect": "Allow"
      },
      {
            "Action": [
               "eks:AssociateAccessPolicy",
               "eks:ListAssociatedAccessPolicies",
               "eks:DisassociateAccessPolicy"
            ],
            "Resource": "arn:aws:eks:{Region}:{Account_ID}:access-entry/{key}/*",
            "Effect": "Allow"
      },
      {
            "Action": "iam:PassRole",
            "Resource": "*",
            "Effect": "Allow"
      }
   ]
}
```

**IAM Role: {key}-cluster-ServiceRole**

AWS-managed policies:

* AmazonEKSClusterPolicy
* AmazonEKSVPCResourceController

```json
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "cloudwatch:PutMetricData"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "ec2:DescribeAccountAttributes",
               "ec2:DescribeAddresses",
               "ec2:DescribeInternetGateways"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
```

**IAM Role: {key}-addon-vpc-cni-Role**

AWS-managed policies:

* AmazonEKS_CNI_Policy

**IAM Role: {key}-eks-role**

AWS-managed policies:

* AmazonEBSCSIDriverPolicy
* AmazonEC2ContainerRegistryReadOnly
* AmazonEKS_CNI_Policy
* AmazonEKSWorkerNodePolicy
* AmazonSSMManagedInstanceCore
* AutoScalingFullAccess
* ElasticLoadBalancingFullAccess

```json
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "ec2:CreateSecurityGroup",
               "ec2:CreateTags"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:ec2:{Region}:{Account_ID}:security-group/*",
               "arn:aws:ec2:{Region}:{Account_ID}:vpc/{VPC_ID}"
            ],
            "Sid": "CreateOpenflowEKSSecurityGroupAndTags"
      },
      {
            "Action": [
               "ec2:AuthorizeSecurityGroupIngress",
               "ec2:DeleteSecurityGroup"
            ],
            "Condition": {
               "StringLike": {
                  "aws:ResourceTag/Name": "eks-cluster-sg-{key}-*"
               }
            },
            "Effect": "Allow",
            "Resource": [
               "arn:aws:ec2:{Region}:{Account_ID}:security-group/*"
            ],
            "Sid": "OpenflowManageEKSSecurityGroup"
      }
   ]
}
```

> **Note:**
>
> {VPC_ID} represents the identifier of the VPC that was either created by BYOC or used by BYO-VPC.

**IAM Role: oidc-service-role-{key}**

```json
{
   "Statement": [
      {
            "Action": [
               "secretsmanager:GetSecretValue",
               "secretsmanager:DescribeSecret",
               "secretsmanager:GetResourcePolicy",
               "secretsmanager:ListSecretVersionIds"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:oidc-{key}*"
            ]
      }
   ],
   "Version": "2012-10-17"
}
```

**IAM Role: dps-postgres-role-{key}**

```json
{
   "Statement": [
      {
            "Action": [
               "secretsmanager:GetSecretValue",
               "secretsmanager:DescribeSecret",
               "secretsmanager:GetResourcePolicy",
               "secretsmanager:ListSecretVersionIds"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:postgres_creds-{key}*"
            ]
      }
   ],
   "Version": "2012-10-17"
}
```

**IAM Role: dps-temporal-role-{key}**

```json
{
   "Statement": [
      {
            "Action": [
               "secretsmanager:GetSecretValue",
               "secretsmanager:DescribeSecret",
               "secretsmanager:GetResourcePolicy",
               "secretsmanager:ListSecretVersionIds"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:temporal_creds-{key}*"
            ]
      }
   ],
   "Version": "2012-10-17"
}
```

**IAM Role: dp-service-role-{key}**

```json
{
   "Statement": [
      {
            "Action": [
               "secretsmanager:GetSecretValue",
               "secretsmanager:DescribeSecret",
               "secretsmanager:GetResourcePolicy",
               "secretsmanager:ListSecretVersionIds"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:dps_creds-{key}*",
               "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:snowflake-oauth2-{key}*"
            ]
      }
   ],
   "Version": "2012-10-17"
}
```

**IAM Role: oauth2-role-{key}**

```json
{
   "Statement": [
      {
            "Action": [
               "secretsmanager:GetSecretValue",
               "secretsmanager:DescribeSecret",
               "secretsmanager:GetResourcePolicy",
               "secretsmanager:ListSecretVersionIds"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:secretsmanager:{Region}:{Account_ID}:secret:snowflake-oauth2-{key}*"
            ]
      }
   ],
   "Version": "2012-10-17"
}
```

**IAM Role: {key}-nodegroup-NodeInstanceRole**

AWS-managed policies:

* AmazonEBSCSIDriverPolicy
* AmazonEC2ContainerRegistryReadOnly
* AmazonEKS_CNI_Policy
* AmazonEKSWorkerNodePolicy
* AmazonSSMManagedInstanceCore
* AutoScalingFullAccess
* ElasticLoadBalancingFullAccess

```json
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "servicediscovery:CreateService",
               "servicediscovery:DeleteService",
               "servicediscovery:GetService",
               "servicediscovery:GetInstance",
               "servicediscovery:RegisterInstance",
               "servicediscovery:DeregisterInstance",
               "servicediscovery:ListInstances",
               "servicediscovery:ListNamespaces",
               "servicediscovery:ListServices",
               "servicediscovery:GetInstancesHealthStatus",
               "servicediscovery:UpdateInstanceCustomHealthStatus",
               "servicediscovery:GetOperation",
               "route53:GetHealthCheck",
               "route53:CreateHealthCheck",
               "route53:UpdateHealthCheck",
               "route53:ChangeResourceRecordSets",
               "route53:DeleteHealthCheck",
               "appmesh:*"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "autoscaling:DescribeAutoScalingGroups",
               "autoscaling:DescribeAutoScalingInstances",
               "autoscaling:DescribeLaunchConfigurations",
               "autoscaling:DescribeScalingActivities",
               "autoscaling:DescribeTags",
               "ec2:DescribeInstanceTypes",
               "ec2:DescribeLaunchTemplateVersions"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "autoscaling:SetDesiredCapacity",
               "autoscaling:TerminateInstanceInAutoScalingGroup",
               "ec2:DescribeImages",
               "ec2:GetInstanceTypesFromInstanceRequirements",
               "eks:DescribeNodegroup"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "iam:CreateServiceLinkedRole"
            ],
            "Condition": {
               "StringEquals": {
                  "iam:AWSServiceName": "elasticloadbalancing.amazonaws.com"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:DescribeAccountAttributes",
               "ec2:DescribeAddresses",
               "ec2:DescribeAvailabilityZones",
               "ec2:DescribeInternetGateways",
               "ec2:DescribeVpcs",
               "ec2:DescribeVpcPeeringConnections",
               "ec2:DescribeSubnets",
               "ec2:DescribeSecurityGroups",
               "ec2:DescribeInstances",
               "ec2:DescribeNetworkInterfaces",
               "ec2:DescribeTags",
               "ec2:GetCoipPoolUsage",
               "ec2:DescribeCoipPools",
               "elasticloadbalancing:DescribeLoadBalancers",
               "elasticloadbalancing:DescribeLoadBalancerAttributes",
               "elasticloadbalancing:DescribeListeners",
               "elasticloadbalancing:DescribeListenerCertificates",
               "elasticloadbalancing:DescribeSSLPolicies",
               "elasticloadbalancing:DescribeRules",
               "elasticloadbalancing:DescribeTargetGroups",
               "elasticloadbalancing:DescribeTargetGroupAttributes",
               "elasticloadbalancing:DescribeTargetHealth",
               "elasticloadbalancing:DescribeTags"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "cognito-idp:DescribeUserPoolClient",
               "acm:ListCertificates",
               "acm:DescribeCertificate",
               "iam:ListServerCertificates",
               "iam:GetServerCertificate",
               "waf-regional:GetWebACL",
               "waf-regional:GetWebACLForResource",
               "waf-regional:AssociateWebACL",
               "waf-regional:DisassociateWebACL",
               "wafv2:GetWebACL",
               "wafv2:GetWebACLForResource",
               "wafv2:AssociateWebACL",
               "wafv2:DisassociateWebACL",
               "shield:GetSubscriptionState",
               "shield:DescribeProtection",
               "shield:CreateProtection",
               "shield:DeleteProtection"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:AuthorizeSecurityGroupIngress",
               "ec2:RevokeSecurityGroupIngress"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:CreateSecurityGroup"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:CreateTags"
            ],
            "Condition": {
               "Null": {
                  "aws:RequestTag/elbv2.k8s.aws/cluster": "false"
               },
               "StringEquals": {
                  "ec2:CreateAction": "CreateSecurityGroup"
               }
            },
            "Effect": "Allow",
            "Resource": "arn:aws:ec2:*:*:security-group/*"
      },
      {
            "Action": [
               "ec2:CreateTags",
               "ec2:DeleteTags"
            ],
            "Condition": {
               "Null": {
                  "aws:RequestTag/elbv2.k8s.aws/cluster": "true",
                  "aws:ResourceTag/elbv2.k8s.aws/cluster": "false"
               }
            },
            "Effect": "Allow",
            "Resource": "arn:aws:ec2:*:*:security-group/*"
      },
      {
            "Action": [
               "ec2:AuthorizeSecurityGroupIngress",
               "ec2:RevokeSecurityGroupIngress",
               "ec2:DeleteSecurityGroup"
            ],
            "Condition": {
               "Null": {
                  "aws:ResourceTag/elbv2.k8s.aws/cluster": "false"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "elasticloadbalancing:CreateLoadBalancer",
               "elasticloadbalancing:CreateTargetGroup"
            ],
            "Condition": {
               "Null": {
                  "aws:RequestTag/elbv2.k8s.aws/cluster": "false"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "elasticloadbalancing:CreateListener",
               "elasticloadbalancing:DeleteListener",
               "elasticloadbalancing:CreateRule",
               "elasticloadbalancing:DeleteRule"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "elasticloadbalancing:AddTags",
               "elasticloadbalancing:RemoveTags"
            ],
            "Condition": {
               "Null": {
                  "aws:RequestTag/elbv2.k8s.aws/cluster": "true",
                  "aws:ResourceTag/elbv2.k8s.aws/cluster": "false"
               }
            },
            "Effect": "Allow",
            "Resource": [
               "arn:aws:elasticloadbalancing:*:*:targetgroup/*/*",
               "arn:aws:elasticloadbalancing:*:*:loadbalancer/net/*/*",
               "arn:aws:elasticloadbalancing:*:*:loadbalancer/app/*/*"
            ]
      },
      {
            "Action": [
               "elasticloadbalancing:AddTags",
               "elasticloadbalancing:RemoveTags"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:elasticloadbalancing:*:*:listener/net/*/*/*",
               "arn:aws:elasticloadbalancing:*:*:listener/app/*/*/*",
               "arn:aws:elasticloadbalancing:*:*:listener-rule/net/*/*/*",
               "arn:aws:elasticloadbalancing:*:*:listener-rule/app/*/*/*"
            ]
      },
      {
            "Action": [
               "elasticloadbalancing:ModifyLoadBalancerAttributes",
               "elasticloadbalancing:SetIpAddressType",
               "elasticloadbalancing:SetSecurityGroups",
               "elasticloadbalancing:SetSubnets",
               "elasticloadbalancing:DeleteLoadBalancer",
               "elasticloadbalancing:ModifyTargetGroup",
               "elasticloadbalancing:ModifyTargetGroupAttributes",
               "elasticloadbalancing:DeleteTargetGroup"
            ],
            "Condition": {
               "Null": {
                  "aws:ResourceTag/elbv2.k8s.aws/cluster": "false"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "elasticloadbalancing:AddTags"
            ],
            "Condition": {
               "Null": {
                  "aws:RequestTag/elbv2.k8s.aws/cluster": "false"
               },
               "StringEquals": {
                  "elasticloadbalancing:CreateAction": [
                        "CreateTargetGroup",
                        "CreateLoadBalancer"
                  ]
               }
            },
            "Effect": "Allow",
            "Resource": [
               "arn:aws:elasticloadbalancing:*:*:targetgroup/*/*",
               "arn:aws:elasticloadbalancing:*:*:loadbalancer/net/*/*",
               "arn:aws:elasticloadbalancing:*:*:loadbalancer/app/*/*"
            ]
      },
      {
            "Action": [
               "elasticloadbalancing:RegisterTargets",
               "elasticloadbalancing:DeregisterTargets"
            ],
            "Effect": "Allow",
            "Resource": "arn:aws:elasticloadbalancing:*:*:targetgroup/*/*"
      },
      {
            "Action": [
               "elasticloadbalancing:SetWebAcl",
               "elasticloadbalancing:ModifyListener",
               "elasticloadbalancing:AddListenerCertificates",
               "elasticloadbalancing:RemoveListenerCertificates",
               "elasticloadbalancing:ModifyRule"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "route53:ChangeResourceRecordSets"
            ],
            "Effect": "Allow",
            "Resource": "arn:aws:route53:::hostedzone/*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "route53:GetChange"
            ],
            "Effect": "Allow",
            "Resource": "arn:aws:route53:::change/*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "route53:ListResourceRecordSets",
               "route53:ListHostedZonesByName"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "ec2:CreateSnapshot",
               "ec2:AttachVolume",
               "ec2:DetachVolume",
               "ec2:ModifyVolume",
               "ec2:DescribeAvailabilityZones",
               "ec2:DescribeInstances",
               "ec2:DescribeSnapshots",
               "ec2:DescribeTags",
               "ec2:DescribeVolumes",
               "ec2:DescribeVolumesModifications"
            ],
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:CreateTags"
            ],
            "Condition": {
               "StringEquals": {
                  "ec2:CreateAction": [
                        "CreateVolume",
                        "CreateSnapshot"
                  ]
               }
            },
            "Effect": "Allow",
            "Resource": [
               "arn:aws:ec2:*:*:volume/*",
               "arn:aws:ec2:*:*:snapshot/*"
            ]
      },
      {
            "Action": [
               "ec2:DeleteTags"
            ],
            "Effect": "Allow",
            "Resource": [
               "arn:aws:ec2:*:*:volume/*",
               "arn:aws:ec2:*:*:snapshot/*"
            ]
      },
      {
            "Action": [
               "ec2:CreateVolume"
            ],
            "Condition": {
               "StringLike": {
                  "aws:RequestTag/ebs.csi.aws.com/cluster": "true"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:CreateVolume"
            ],
            "Condition": {
               "StringLike": {
                  "aws:RequestTag/CSIVolumeName": "*"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:DeleteVolume"
            ],
            "Condition": {
               "StringLike": {
                  "ec2:ResourceTag/ebs.csi.aws.com/cluster": "true"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:DeleteVolume"
            ],
            "Condition": {
               "StringLike": {
                  "ec2:ResourceTag/CSIVolumeName": "*"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:DeleteVolume"
            ],
            "Condition": {
               "StringLike": {
                  "ec2:ResourceTag/kubernetes.io/created-for/pvc/name": "*"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:DeleteSnapshot"
            ],
            "Condition": {
               "StringLike": {
                  "ec2:ResourceTag/CSIVolumeSnapshotName": "*"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      },
      {
            "Action": [
               "ec2:DeleteSnapshot"
            ],
            "Condition": {
               "StringLike": {
                  "ec2:ResourceTag/ebs.csi.aws.com/cluster": "true"
               }
            },
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "route53:ChangeResourceRecordSets"
            ],
            "Effect": "Allow",
            "Resource": "arn:aws:route53:::hostedzone/*"
      }
   ]
}
{
   "Version": "2012-10-17",
   "Statement": [
      {
            "Action": [
               "route53:ListHostedZones",
               "route53:ListResourceRecordSets",
               "route53:ListTagsForResource"
            ],
            "Effect": "Allow",
            "Resource": "*"
      }
   ]
}
```

---
title: Set up Openflow - Snowflake Deployment - Task overview
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs.md
section: Loading & Unloading Data
---

# Set up Openflow - Snowflake Deployment - Task overview

To setup an Openflow - Snowflake Deployment, perform the following tasks:

| Order | Task | Description | Persona |
| --- | --- | --- | --- |
| 1 | [Setup core Snowflake](setup-openflow-spcs-sf.md) | Before creating a deployment, you must configure core Snowflake which include an Openflow admin role, required privileges, and network configuration. | Snowflake administrator |
| 2 | Optionally [Set up PrivateLink UI access](setup-openflow-spcs-configure-pr-ui.md) | Configure PrivateLink to access the Snowflake Openflow Runtime UI using private connectivity. | Snowflake administrator |
| 3 | [Create deployment](setup-openflow-spcs-deployment.md) | After configuring core Snowflake, you then create an Openflow deployment.  Optionally, configure a Openflow-specific event table to store Openflow logs and metrics. | Deployment engineer, Snowflake administrator for event table configuration |
| 4 | [Create Snowflake role](setup-openflow-spcs-create-rr.md) | After creating an Openflow - Snowflake Deployment, you must create a Snowflake role and associated external access integrations. | Data engineer |
| 5 | [Create runtime](setup-openflow-spcs-create-runtime.md) | Create a runtime associated with the previously created Snowflake role. | Data engineer |
| 6 | [Configure allowed domains for Openflow connectors](setup-openflow-spcs-sf-allow-list.md) | Configure access to external domains for Openflow connectors. | Data engineer |
| 7 | [Connect your data sources using Openflow connectors](connectors/about-openflow-connectors.md) | Configure one or more connectors in the Openflow - Snowflake Deployment. | Data engineer |

Note that steps 3, 4 and 5 are typically repeated for each connector you want to configure in a given deployment.

## Next steps

[Set up Openflow - Snowflake Deployment: Core Snowflake](setup-openflow-spcs-sf.md)

---
title: Set up Openflow - Snowflake Deployment: Configure allowed domains for Openflow connectors
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs-sf-allow-list.md
section: Loading & Unloading Data
---

# Set up Openflow - Snowflake Deployment: Configure allowed domains for Openflow connectors

Openflow - Snowflake Deployments access external domain resources. Snowflake controls access to external domains
using [network rules](../../network-rules.md) and
[external access integrations](../../../developer-guide/external-network-access/creating-using-external-network-access.md)
to either grant or deny access to specific domains.

This topic describes the process of [creating a network rule](../../../sql-reference/sql/create-network-rule.md)
and [creating an external access integration](../../../sql-reference/sql/create-external-access-integration.md) to grant access to a specific domain.
In addition, the known domains used by Openflow connectors are provided.

Two possible workflows exist for managing access to external domains:

* Create a new network rule and external access integration: Create a new network rule that defines a list of allowed domain/port combinations
  and create a new external access integration using the newly created network rule.
* Alter an existing network rule: Alter an existing network rule
  to add a list of allowed domain/port combinations.

## Create a network rule granting access to one or more domains

To create a new network rule that grants access to one or more domain/port combinations,
execute an SQL statement similar to:

```sqlexample
USE ROLE SECURITYADMIN;

CREATE NETWORK RULE MY_OPENFLOW_NETWORK_RULE
   TYPE = HOST_PORT
   MODE = EGRESS
   VALUE_LIST = ('<domain>', '<domain>');
```

For example, to allow Snowflake to access `googleads.googleapis.com`, execute the following.

```sqlexample
USE ROLE SECURITYADMIN;

CREATE NETWORK RULE GOOGLEADS_OPENFLOW_NETWORK_RULE
   TYPE = HOST_PORT
   MODE = EGRESS
   VALUE_LIST = ('googleads.googleapis.com');
```

For more information, see [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md).

After the network rule is created, a external access integration has to be created.

To create a new integration, execute an SQL statement similar to:

```sqlexample
USE ROLE SECURITYADMIN;

CREATE EXTERNAL ACCESS INTEGRATION MY_OPENFLOW_EAI
   ALLOWED_NETWORK_RULES = (MY_OPENFLOW_NETWORK_RULE)
   ENABLED = TRUE
   COMMENT = 'External Access Integration for Openflow connectivity';
```

## Alter an existing network rule granting access to one or more domains

To alter an existing network rule to grant access to one or more domain/port combinations,
execute an SQL statement similar to:

```sqlexample
USE ROLE SECURITYADMIN;

ALTER NETWORK RULE GOOGLEADS_OPENFLOW_NETWORK_RULE SET
   VALUE_LIST = ('<existing domain>', '<existing domain>', 'googleads.googleapis.com');
```

For more information, see [ALTER NETWORK RULE](../../../sql-reference/sql/alter-network-rule.md).

> **Note:**
>
> Use [SHOW NETWORK RULES](../../../sql-reference/sql/show-network-rules.md) to list the existing network rules. .
> Use [DESCRIBE NETWORK RULE](../../../sql-reference/sql/desc-network-rule.md) to describe the properties of a specific network rule.

If the altered network rule is already associated with an external access integration, it will be updated automatically.
If you do not have an external access integration for the altered network rule,
refer to the section above for instructions on creating a new integration.

## Next steps

1. Associate an external access integration with your runtime:

   1. Navigate to the Openflow canvas.
   2. Select the Runtimes tab.
   3. For the runtime which requires the new external access integration,
      click the  menu.
   4. Select External access integrations.
   5. Select all required external access integrations from the dropdown list.
      .
      Note you may select multiple external access integrations.
   6. Click Save.

      > **Note:**
      >
      > Restarting the runtime is not required and the changes are applied immediately.
2. Deploy a connector in a runtime, for a list of connectors available in Openflow, see [Openflow connectors](connectors/about-openflow-connectors.md).

## Domains used by Openflow connectors

The following domains are used by Openflow connectors and require network rules to be granted access.

### Amazon Ads

The following domains are used by the Amazon Ads connector.

* `advertising-api.amazon.com`
* `advertising-api-eu.amazon.com`
* `advertising-api-fe.amazon.com`
* `api.amazon.com`
* `api.amazon.co.uk`
* `api.amazon.co.jp`
* Report location.
  For example, `offline-report-storage-eu-west-1-prod.s3.eu-west-1.amazonaws.com` is used to download reports.

The exact report URL location is not always known before creating a report.
Snowflake recommends allow listing all s3 regions:

> * `*.s3.eu-west-[1-3].amazonaws.com`
> * `*.s3.eu-central-[1-2].amazonaws.com`
> * `*.s3.eu-north-1.amazonaws.com`
> * `*.s3.eu-south-[1-2].amazonaws.com`
> * `*.s3.il-central-1.amazonaws.com`

* For advertising-api-fe.amazon.com (Far East / APAC):

  + `*.s3.ap-northeast-[1-3].amazonaws.com`
  + `*.s3.ap-south-[1-2].amazonaws.com`
  + `*.s3.ap-southeast-[1-7].amazonaws.com`
  + `*.s3.ap-east-[1-2].amazonaws.com`
  + `*.s3.me-south-1.amazonaws.com`
  + `*.s3.me-central-1.amazonaws.com`
  + `*.s3.af-south-1.amazonaws.com`

The last domain is obtained from the report URL is returned after the report is ready to fetch.
This is an Amazon S3 bucket where the report is stored. Customers will need to specify their own AWS region.
for example, `us-east-1` or `eu-west-1` and a specific bucket. As it may be not possible to know the
exact region and bucket, Snowflake suggests using wildcards and listing all possible regions for a given location.

### AWS Secret Manager

The following domains are used by the AWS Secret Manager connector.

* `secretsmanager.us-west-2.amazonaws.com`
* `sts.us-west-2.amazonaws.com`
* `aws.amazon.com`
* `amazonaws.com`

### Box

The following domains are used by the Box connector.

> * `api.box.com`
> * `box.com`

### Confluence

The following domains are used by the Confluence connector.

> * Customer-specific domain name, such as `https://company-name.atlassian.net/`.
> * For OAuth, <https://atlassian.company-name.com/>

### Microsoft Dataverse

The following domains are used by the Dataverse connector.

* Customer-specific domain name, such as `org12345467.crm.dynamics.com`
* For OAuth, `login.microsoftonline.com`

### Google Ads

The following domains are used by the Google Ads connector.

* `googleads.googleapis.com`

### Google Drive

The following domains are used by the Google Drive connector:

* `drive.google.com`
* `www.googleapis.com`
* `oauth2.googleapis.com`
* `www.googleapis.com`

### Google Sheets

The following domains are used by the Google Sheets connector.

* `sheets.googleapis.com`

### Hubspot

The following domains are used by the HubSpot connector.

* `api.hubapi.com`

### Jira Cloud

The following domains are used by the Jira Cloud connector.

* Customer-specific domain name, for example `company-name.atlassian.net`
* `api.atlassian.com`

### Kafka

The following domains are used by the Kafka connector.

* Customer Kafka bootstrap servers and all Kafka brokers

### Kinesis

The following domains are used by the Kinesis connector.

* AWS region dependent. For example:

  > for us-west-2:
  >
  > + `kinesis.us-west-2.amazonaws.com`
  > + `kinesis-fips.us-west-2.api.aws`
  > + `kinesis-fips.us-west-2.amazonaws.com`
  > + `kinesis.us-west-2.api.aws`
  > + `*.control-kinesis.us-west-2.amazonaws.com`
  > + `*.control-kinesis.us-west-2.api.aws`
  > + `*.data-kinesis.us-west-2.amazonaws.com`
  > + `*.data-kinesis.us-west-2.api.aws`
  > + `dynamodb.us-west-2.amazonaws.com`
  > + `monitoring.us-west-2.amazonaws.com:80`
  > + `monitoring.us-west-2.amazonaws.com:443`
  > + `monitoring-fips.us-west-2.amazonaws.com:80`
  > + `monitoring-fips.us-west-2.amazonaws.com:443`
  > + `monitoring.us-west-2.api.aws:80`
  > + `monitoring.us-west-2.api.aws:443`

### LinkedIn Ads

The following domains are used by the LinkedIn Ads connector.

* `www.linkedin.com`
* `api.linkedin.com`

### Meta Ads

The following domains are used by the Meta Ads connector.

* `graph.facebook.com`

### MySQL

The following domains are used by the MySQL connector.

* Customer-specific domain and port combination.

### PostgreSQL

The following domains are used by the PostgreSQL connector.

* Customer-specific domain and port combination.

### SharePoint

The following domains are used by the SharePoint connector.

* Customer-specific domain—for example, `company-domain.sharepoint.com` or an alias that redirects to `company-domain.sharepoint.com`
* `graph.microsoft.com:80`
* `graph.microsoft.com:443`
* `login.microsoftonline.com`

### Slack

The following domains are used by the Slack connector.

* `slack.com`
* `api.slack.com`
* `hooks.slack.com`
* `files.slack.com`
* `wss-primary.slack.com`
* `wss-backup.slack.com`

### SQL Server

The following domains are used by the SQL Server connector.

* Customer-specific domain and port combination.

### Workday

The following domains are used by the Workday connector.

* Customer-specific domain and port combination. For example, `company-domain.tenant.myworkday.com`.

  To obtain the domain, you can use the report URL (base URL is always the same).

---
title: Set up Openflow - Snowflake Deployment: Core Snowflake
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs-sf.md
section: Loading & Unloading Data
---

# Set up Openflow - Snowflake Deployment: Core Snowflake

Openflow - Snowflake Deployment requires the creation of the following Snowflake specific resources:

> 1. Create the OPENFLOW_ADMIN role
> 2. Configure required privileges

To complete these tasks, Sign in to [Snowsight](../../ui-snowsight-gs.md) and open a SQL worksheet.

## Create the OPENFLOW_ADMIN role

Create the required Openflow administration role.

> **Note:**
>
> `<OPENFLOW_USER>` denotes the user that will be used to access Openflow.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE IF NOT EXISTS OPENFLOW_ADMIN;

GRANT ROLE OPENFLOW_ADMIN TO USER <OPENFLOW_USER>;
```

> **Caution:**
>
> Users with a default role of ACCOUNTADMIN can’t login to Openflow - Snowflake Deployment runtimes and will get an error message when attempting to do so.
> Snowflake recommends assigning a different default role to any user that will login to a runtime.
> In addition, Snowflake recommends setting default secondary roles to `ALL` for all Openflow users.
>
> To change the default role and enable all secondary roles, execute the following:
>
> For example:
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> ALTER USER <openflow_user> SET DEFAULT_ROLE = <openflow_admin>;
> ALTER USER <openflow_user> SET DEFAULT_SECONDARY_ROLES = ('ALL');
> ```

## Configure required privileges

Openflow requires defining specific Snowflake Account level privileges.
These privileges are assigned to the ACCOUNTADMIN role as part of the default set of privileges.
ACCOUNTADMIN will automatically have the following privileges and will be able to grant them
to a role of their choosing for the Openflow admin role, shown as `OPENFLOW_ADMIN` role in the following example:

```sqlexample
USE ROLE ACCOUNTADMIN;

GRANT CREATE OPENFLOW DATA PLANE INTEGRATION ON ACCOUNT TO ROLE OPENFLOW_ADMIN;
GRANT CREATE OPENFLOW RUNTIME INTEGRATION ON ACCOUNT TO ROLE OPENFLOW_ADMIN;
GRANT CREATE COMPUTE POOL ON ACCOUNT TO ROLE OPENFLOW_ADMIN;
```

## Next steps

Optionally, [Set up PrivateLink UI access](setup-openflow-spcs-configure-pr-ui.md) to access the Snowflake Openflow Runtime UI using private connectivity.

[Create deployment](setup-openflow-spcs-deployment.md)

---
title: Set up Openflow - Snowflake Deployment: Create deployment
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs-deployment.md
section: Loading & Unloading Data
---

# Set up Openflow - Snowflake Deployment: Create deployment

After configuring core Snowflake, create an Openflow deployment. A deployment is the
control plane component that manages your runtimes and connectors. Each deployment can host
multiple runtimes, and each runtime can run multiple connectors, giving you flexibility to
isolate workloads by project, team, or environment. There is no separate charge for the
deployment itself; only active runtimes consume Snowflake credits.

1. Create a deployment - create the deployment itself.
2. [Optional] Configure an Openflow-specific event table - configure an Openflow-specific event table to store Openflow logs and metrics.

## Create a deployment

> **Note:**
>
> To access the Openflow Runtime UI using PrivateLink as described in [Setup PrivateLink UI access](setup-openflow-spcs-configure-pr-ui.md),
> ensure the **PrivateLink** option is enabled when creating a new Openflow - Snowflake Deployment.

1. Sign in to [Snowsight](../../ui-snowsight-gs.md) with a role defined in [Configure core Snowflake requirements](setup-openflow-spcs-sf.md).
2. In the navigation menu, select Ingestion » Openflow.
3. Select Launch Openflow.
4. In the Openflow UI, select Create a deployment. The Deployments tab opens.
5. Select Create a deployment. The Creating a deployment wizard opens.
6. In the Prerequisites step, ensure that you meet all the requirements. Select Next.
7. In the Deployment location step, select Snowflake as the deployment location.
   Enter a name for your deployment. Select Next.
8. Select Create Deployment.

Your deployment will then be created.

## [Optional] Configure an Openflow-specific event table

Openflow generates logs and metrics and sends them to the Snowflake Event Table.
For helpful queries to analyze this telemetry data, see [Monitor Openflow](monitor.md).

By default, Openflow uses the [account event table](../../../developer-guide/logging-tracing/event-table-setting-up.md) (SNOWFLAKE.TELEMETRY.EVENTS), but you can configure an Openflow-specific event table per deployment. A dedicated event table is recommended to optimize query performance, enable granular access control, and simplify Openflow monitoring and maintenance.

1. To store the event table outside the Openflow database, grant the OPENFLOW_ADMIN role
   access to the `<DATABASE>` and `<SCHEMA>` where you want to store it:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   GRANT USAGE ON DATABASE <DATABASE> TO ROLE OPENFLOW_ADMIN;
   GRANT USAGE ON SCHEMA <DATABASE>.<SCHEMA> TO ROLE OPENFLOW_ADMIN;
   ```
2. Create the event table:

   ```sqlexample
   USE ROLE OPENFLOW_ADMIN;

   CREATE EVENT TABLE IF NOT EXISTS <DATABASE>.<SCHEMA>.EVENTS;
   ```
3. Get your dataplane name, which you use in the next step, from the `name` column:

   ```sqlexample
   SHOW OPENFLOW DATA PLANE INTEGRATIONS;
   ```
4. Set the event table for this deployment, replacing `<OPENFLOW_DATAPLANE_NAME>` with the value from the previous step:

   ```sqlexample
   ALTER OPENFLOW DATA PLANE INTEGRATION <OPENFLOW_DATAPLANE_NAME>
     SET EVENT_TABLE = '<DATABASE>.<SCHEMA>.EVENTS';
   ```

## [Optional] Create a monitoring role

A monitoring role lets data engineers or operations teams monitor Openflow without having the OPENFLOW_ADMIN role.

* To create a monitoring role, run the following code:

  ```sqlexample
  USE ROLE OPENFLOW_ADMIN;

  -- Create a role for monitoring Openflow deployments and runtimes if it doesn't yet exist
  CREATE ROLE IF NOT EXISTS <OPENFLOW_MONITOR_ROLE>;

  GRANT MONITOR ON OPENFLOW DATA PLANE INTEGRATION <OPENFLOW_DATAPLANE_NAME> TO ROLE <OPENFLOW_MONITOR_ROLE>;

  -- Add to role hierarchy so administrators can manage objects owned by this role
  GRANT ROLE <OPENFLOW_MONITOR_ROLE> TO ROLE <OPENFLOW_ADMIN_ROLE>;

  -- Grant the role to the appropriate Snowflake users
  GRANT ROLE <OPENFLOW_MONITOR_ROLE> TO USER <SNOWFLAKE_USER>;
  ```

### Next steps

[Create Snowflake role](setup-openflow-spcs-create-rr.md)

---
title: Set up Openflow - Snowflake Deployment: Create runtime
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs-create-runtime.md
section: Loading & Unloading Data
---

# Set up Openflow - Snowflake Deployment: Create runtime

A runtime is a containerized Apache NiFi instance that executes your data integration flows –
connectors and custom flow definitions. Each runtime is isolated for security and resource
control, and can scale from one node up to fifty to handle varying data volumes.

To create a runtime in your Snowflake deployment:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Openflow.
3. Select Launch Openflow. A new tab opens for the Openflow canvas.
4. In Openflow Control Plane, select Create a runtime. The Create Runtime dialog box appears.
5. In the Create Runtime populate the following fields:

   | Field | Description |
   | --- | --- |
   | Runtime Name | Enter a name for your runtime. |
   | Deployment drop down | Choose the deployment previously created in [Set up Openflow - Snowflake Deployment: Create deployment](setup-openflow-spcs-deployment.md) |
   | Node Type | Choose a node type from the Node type drop-down list. This specifies the size of your nodes. |
   | Min/Max node | In the Min/Max node range selector, select a range. The minimum value specifies the number of nodes that the runtime starts with when idle and the maximum value specifies the number of nodes that the runtime can scale up to, in the event of high data volume or CPU load. |
   | Snowflake Role | Choose the Snowflake role previously created in [Set up Openflow - Snowflake Deployment: Create Snowflake role](setup-openflow-spcs-create-rr.md). |
   | Usage Roles | Optionally, select the roles created to grant usage to the runtime for required databases, schema, and table access. |
   | External Access Integrations | Optionally, select the previously created external access integrations to grant access to external resources. |
6. Select Create. The runtime takes a couple of minutes to be created.

Once created, view your runtime by navigating to the Runtimes tab of the Openflow control plane.
Select the runtime to open the Openflow canvas.

## [Optional] Grant MONITOR privileges on the runtime

If you created a [monitoring role](setup-openflow-spcs-deployment.md) when setting up your deployment, you can add the runtime to that role. This allows data engineers or operations teams to monitor the runtime without having the OPENFLOW_ADMIN role.

* To add the runtime to the monitoring role, run the following code, replacing `<OPENFLOW_RUNTIME_NAME>` with the name of the Openflow runtime integration:

  ```sqlexample
  USE ROLE OPENFLOW_ADMIN;

  GRANT MONITOR ON OPENFLOW RUNTIME INTEGRATION <OPENFLOW_RUNTIME_NAME> TO ROLE <OPENFLOW_MONITOR_ROLE>;
  ```

## Next step

Configure allowed domains for Openflow connectors.
See [Set up Openflow - Snowflake Deployment: Configure allowed domains for Openflow connectors](setup-openflow-spcs-sf-allow-list.md).

---
title: Set up Openflow - Snowflake Deployment: Create Snowflake role
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs-create-rr.md
section: Loading & Unloading Data
---

# Set up Openflow - Snowflake Deployment: Create Snowflake role

Openflow - Snowflake Deployment requires the creation of a number of resources which are specific not to a deployment
but to a specific runtime. Typically such resources include:

* Creation of Runtime specific Snowflake role
* Creation of Runtime specific network rules and External Access Integrations (EAI)

This topic describes the creation of these resources.

1. Create a Snowflake Role and associated privileges to write data to Snowflake Role for Runtimes on Snowflake Deployment Section
2. Associate Snowflake Role. See Snowflake Role for Runtimes in the Snowflake Deployment Section.
3. Create External Access Integrations and associate them to Runtimes.
   See Creating External Access Integrations
4. When Outbound PrivateLink connectivity is required to connect to a private system using SPCS Egress.

## Create a Snowflake role

When creating and editing Openflow Runtimes, Runtime Owners will have the ability to associate a role with the Runtime.
This role will be used for flows that execute within the Runtime.
For more information about Snowflake Roles, see [What is a Snowflake role?](about-spcs.md).

Creating a Snowflake role is a prerequisite for creating a Runtime and involves the following steps:

1. Create the role itself
2. Grant the role access to the warehouse used by the Runtime.
3. Grant the role access to the Snowflake objects used by the Runtime.
4. Grant the role access to the External Access Integrations used by the Runtime.

To create a Snowflake role:

1. Create the required Snowflake role.

   > **Note:**
   >
   > `<RUNTIMENAME>` denotes the name of the associated runtime.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE IF NOT EXISTS OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;

   GRANT ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME> TO USER <username>;
   ```
2. Allow the Snowflake role to use an existing warehouse that you are planning to use for data ingestion.
   Use this warehouse later when configuring your connectors for runtimes where you will be using this Snowflake role.

   ```sqlexample
   GRANT USAGE, OPERATE ON WAREHOUSE <OPENFLOW_INGEST_WAREHOUSE> TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;
   ```
3. Allow the Snowflake role to use, create or otherwise access Snowflake objects.

   > **Note:**
   >
   > Depending on the Openflow connector being created the required underlying objects will vary.
   > The example below is for illustration purposes only.

   ```sqlexample
   GRANT USAGE ON DATABASE <OPENFLOW_SPCS_DATABASE> TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;
   GRANT USAGE ON SCHEMA <OPENFLOW_SPCS_SCHEMA> TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIMENAME>;
   ```

### Creating Network Rules and External Access Integrations

Snowflake’s security model provides secure access to specific endpoints and systems
external to Snowflake using [network policies](../../network-policies.md).

Two key aspects of network policies are [Network rules](../../network-rules.md) and
[External Access Integrations (EAI)](../../../developer-guide/external-network-access/external-network-access-overview.md).
Each of which is used to provide secure access to external resources required by the runtime.

There are three steps that are required to create network rules and external access integrations:

1. Create the network rule, grouping the network identifiers into logical areas.
2. Create the external access integration (EAI), specifying the list of network rules and assuring the Snowflake Role has USAGE on the EAI.
3. Associate the EAI with the Runtime in the Openflow UI when creating Runtimes.

To create the required network rule and EAI, perform the following steps:

> **Note:**
>
> These examples use RUNTIME_NAME as a placeholder for the name of the Runtime being created.

1. Create an appropriate network rule. See [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md) for more information.

   > **Note:**
   >
   > `<OPENFLOW_DATABASE>` denotes the name of the database that will contain the network rule.
   > Snowflake suggests creating a specific database for network rules and external access integrations related to Openflow.

   ```sqlexample
   USE DATABASE <OPENFLOW_DATABASE>;

   CREATE NETWORK RULE IF NOT EXISTS OPENFLOW_<RUNTIME_NAME>_NETWORK_RULE
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('comma separated list of host:port pairs');
   ```
2. Create an external access integration, or add the network rule to an existing one.
   See [CREATE EXTERNAL ACCESS INTEGRATION](../../../sql-reference/sql/create-external-access-integration.md) for more information.

   To create a new EAI:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE EXTERNAL ACCESS INTEGRATION IF NOT EXISTS OPENFLOW_<RUNTIME_NAME>_EAI
      ALLOWED_NETWORK_RULES = (OPENFLOW_<RUNTIME_NAME>_NETWORK_RULE)
      ENABLED = TRUE;
   ```

   To add the network rule to an existing EAI, first check which rules are already
   associated with it, then update the EAI to include both the existing and new rules:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   -- Check the current rules on the EAI
   DESCRIBE EXTERNAL ACCESS INTEGRATION OPENFLOW_<RUNTIME_NAME>_EAI;
   ```

   In the output, find the `ALLOWED_NETWORK_RULES` property and note the existing rules.
   Then update the EAI, listing all existing rules along with the new one:

   ```sqlexample
   ALTER EXTERNAL ACCESS INTEGRATION OPENFLOW_<RUNTIME_NAME>_EAI
      SET ALLOWED_NETWORK_RULES = (
         <EXISTING_RULE_1>,
         <EXISTING_RULE_2>,
         OPENFLOW_<RUNTIME_NAME>_NETWORK_RULE
      );
   ```
3. Grant access to the EAI to the previously created Snowflake role.

   ```sqlexample
   GRANT USAGE ON INTEGRATION OPENFLOW_<RUNTIME_NAME>_EAI TO ROLE OPENFLOW_RUNTIME_ROLE_<RUNTIME_NAME>;
   ```

## Next steps

[Create runtime](setup-openflow-spcs-create-runtime.md)

---
title: Set up Openflow Connector for Kinesis for JSON data format
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kinesis/setup.md
section: Loading & Unloading Data
---

# Set up Openflow Connector for Kinesis for JSON data format

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how to set up Openflow Connector for Kinesis for JSON data format.
This is a simplified connector optimized for basic message ingestion with schema evolution capabilities.

The Openflow Connector for Kinesis for JSON data format is designed for straightforward JSON message ingestion from Kinesis streams to Snowflake tables.

## Prerequisites

1. Review [About Openflow Connector for Kinesis](about.md).
2. Ensure that you have [set up Openflow with BYOC](../../setup-openflow-byoc.md) or [set up Openflow with Snowflake Deployments](../../setup-openflow-spcs.md).
3. If you are using Openflow - Snowflake Deployments, ensure that you have reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Kinesis](../../setup-openflow-spcs-sf-allow-list.md) connector.

> **Note:**
>
> If you need the support of other data formats or features, such as DLQ, reach out to your Snowflake representative.

## Set up a Kinesis stream

As an AWS administrator, perform the following actions in your AWS account:

1. Ensure that you have an AWS User with [IAM permissions to access Kinesis Streams and DynamoDB](https://docs.aws.amazon.com/streams/latest/dev/kcl-iam-permissions.html).
2. Ensure that the AWS User has configured [Access Key credentials](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_access-keys.html).

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a destination database and destination schema that will be used to create destination tables for storing the data.

   1. If you plan to use the connector’s capability to automatically create destination table if it does not already exist, make sure the user has the required privileges for creating and managing Snowflake objects:

      | Object | Privilege | Notes |
      | --- | --- | --- |
      | Database | USAGE |  |
      | Schema | USAGE . CREATE TABLE . | After the schema-level objects have been created, the CREATE `object` privileges can be revoked. |
      | Table | OWNERSHIP | Only required when using the Kinesis connector to ingest data into an existing table. . If the connector creates a new target table for records from the Kinesis stream, the default role for the user specified in the configuration becomes the table owner. |

      You can use the following script to create and configure a custom role (requires SECURITYADMIN or equivalent):

      ```sqlexample
      USE ROLE SECURITYADMIN;

      CREATE ROLE kinesis_connector_role;
      GRANT USAGE ON DATABASE kinesis_db TO ROLE kinesis_connector_role;
      GRANT USAGE ON SCHEMA kinesis_schema TO ROLE kinesis_connector_role;
      GRANT CREATE TABLE ON SCHEMA kinesis_schema TO ROLE kinesis_connector_role;

      -- Only for existing tables.
      GRANT OWNERSHIP ON TABLE existing_table TO ROLE kinesis_connector_role;
      ```
3. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
4. Grant the Snowflake service user the role you created in the previous steps.

   ```sqlexample
   GRANT ROLE kinesis_connector_role TO USER kinesis_connector_user;
   ALTER USER kinesis_connector_user SET DEFAULT_ROLE = kinesis_connector_role;
   ```
5. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 3.
6. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. After the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you use the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
7. If any other Snowflake users require access to the ingested data and created tables (for example, for custom processing in Snowflake),
   grant those users the role created in step 2.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click the imported process group and select Parameters.
2. Populate the required parameter values as described in Parameters.

## Parameters

This section describes all parameters for the Openflow Connector for Kinesis for JSON data format.

The connector consists of several modules. To see the set, double-click the connector process group.
You can set the parameters for each module in the module’s parameter context.

### Snowflake destination parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Iceberg Enabled | Whether Iceberg is enabled for table operations. One of `true` / `false`. | Yes |
| Schema Evolution Enabled | Enables or disables schema evolution at the connector level. When enabled, allows automatic schema changes for tables. Note that schema evolution can also be controlled at the individual table level through table-specific parameters. One of: `true` / `false`. | Yes |
| Schema Evolution For New Tables Enabled | Controls whether schema evolution is enabled when creating new tables. When set to ‘true’, new tables will be created with ENABLE_SCHEMA_EVOLUTION = TRUE parameter. When set to ‘false’, new tables will be created with ENABLE_SCHEMA_EVOLUTION = FALSE parameter. Not applicable to Iceberg tables as they are not being created automatically. This setting only affects table creation, not existing tables. One of: `true` / `false`. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake Private Key File. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake Role.   You can find your Snowflake Role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |

### Kinesis JSON Source Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| AWS Region Code | The AWS region where your Kinesis Stream is located, for example `us-west-2`. | Yes |
| AWS Access Key ID | The AWS Access Key ID to connect to your Kinesis Stream, DynamoDB, and, optionally, CloudWatch. | Yes |
| AWS Secret Access Key | The AWS Secret Access Key to connect to your Kinesis Stream, DynamoDB, and, optionally, CloudWatch. | Yes |
| Kinesis Application Name | The name that is used for DynamoDB table name for tracking application’s progress on Kinesis Stream consumption. | Yes |
| Kinesis Consumer Type | The strategy used to read records from a Kinesis Stream. Must be one of the following values: `SHARED_THROUGHPUT` or `ENHANCED_FAN_OUT`. For more information, see [Develop enhanced fan-out consumers](https://docs.aws.amazon.com/streams/latest/dev/enhanced-consumers.html). | Yes |
| Kinesis Initial Stream Position | The initial stream position from which the data starts replication.  Possible values are:   * `LATEST`: Latest stored record * `TRIM_HORIZON`: Earliest stored record | Yes |
| Kinesis Stream Name | AWS Kinesis Stream Name to consume data from. | Yes |
| Metrics Publishing | Specifies where Kinesis Client Library metrics are published to. Possible values: `DISABLED`, `LOGS`, `CLOUDWATCH`. | Yes |

## Run the flow

1. Right-click the plane and select Enable all Controller Services.
2. Right-click the connector’s process group and select Start.

The connector starts the data ingestion.

### Table schema

The Snowflake table loaded by the connector contains columns named by the keys of your Kinesis messages.
The connector also adds a `KINESISMETADATA` column which stores metadata about the record.

Below is an example of a Snowflake table loaded by the connector:

| Row | ACCOUNT | SYMBOL | SIDE | QUANTITY | KINESISMETADATA |
| --- | --- | --- | --- | --- | --- |
| 1 | ABC123 | ZTEST | BUY | 3572 | { … KINESISMETADATA object … } |
| 2 | XYZ789 | ZABZX | SELL | 3024 | { … KINESISMETADATA object … } |
| 3 | XYZ789 | ZTEST | SELL | 799 | { … KINESISMETADATA object … } |
| 4 | ABC123 | ZABZX | BUY | 2033 | { … KINESISMETADATA object … } |

The `KINESISMETADATA` column contains an object with the following fields:

| Field Name | Field Type | Example Value | Description |
| --- | --- | --- | --- |
| `stream` | String | `stream-name` | The name of the Kinesis stream the record came from. |
| `shardId` | String | `shardId-000000000001` | The identifier of the shard in the stream the record came from. |
| `approximateArrival` | String | `2025-11-05T09:12:15.300` | The approximate time that the record was inserted into the stream (ISO 8601 format). |
| `partitionKey` | String | `key-1234` | The partition key specified by the data producer for the record. |
| `sequenceNumber` | String | `123456789` | The unique sequence number assigned by Kinesis Data Streams to the record in the shard. |
| `subSequenceNumber` | Number | `2` | The subsequence number for the record (used for aggregated records with the same sequence number). |
| `shardedSequenceNumber` | String | `12345678900002` | A combination of the sequence number and the subsequence number for the record. |

#### Schema evolution

The connector supports automatic schema detection and evolution. The structure
of tables in Snowflake is defined and evolved automatically to support the structure
of new data loaded by the connector.

Snowflake detects the schema of the incoming data and loads data into tables
that match any user-defined schema. Snowflake also allows adding
new columns or dropping the `NOT NULL` constraint from columns missing in new incoming records.

Schema detection with the connector infers data types based on the JSON data provided.

If the connector creates the target table, schema evolution is enabled by default.

If you want to enable or disable schema evolution on an existing table,
use the [ALTER TABLE](../../../../../sql-reference/sql/alter-table.md) command to set the `ENABLE_SCHEMA_EVOLUTION` parameter.
You must also use a role that has the `OWNERSHIP` privilege on the table. For more information, see [Enable automatic table schema evolution](../../../../data-load-schema-evolution.md).

However, if schema evolution is disabled for an existing table, then the connector
tries to send the rows with mismatched schemas to the configured failure output port.

## Iceberg table support

Openflow Connector for Kinesis can ingest data into a Snowflake-managed [Apache Iceberg™ table](../../../../tables-iceberg.md) when **Iceberg Enabled** is set to *true*.

### Requirements and limitations

Before you configure Openflow Connector for Kinesis for Iceberg table ingestion, note the following requirements and limitations:

* You must create an Iceberg table before running the connector.
* Make sure that the user has access to inserting data into the created tables.

### Configuration and setup

To configure Openflow Connector for Kinesis for Iceberg table ingestion, follow the steps in Set up Openflow Connector for Kinesis for JSON data format with a few differences noted in the following sections.

#### Enable ingestion into Iceberg table

To enable ingestion into an Iceberg table, you must set the `Iceberg Enabled` parameter to `true`.

#### Create an Iceberg table for ingestion

Before you run the connector, you must create an Iceberg table.
The initial table schema depends on your connector `Schema Evolution Enabled` property settings.

If schema evolution is enabled, you must create a table with a column named `kinesisMetadata`.
The connector automatically creates the columns for message fields and alters the `kinesisMetadata` column schema.

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    kinesisMetadata OBJECT()
  )
  EXTERNAL_VOLUME = 'my_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_location/my_iceberg_table'
  ENABLE_SCHEMA_EVOLUTION = true;
```

If schema evolution is disabled, you must create the table with all fields the Kinesis message contains.
When you create an Iceberg table, you can use Iceberg data types or
[compatible Snowflake types](../../../../tables-iceberg-data-types.md).
The semi-structured VARIANT type isn’t supported. Instead, use a
[structured OBJECT or MAP](../../../../../sql-reference/data-types-structured.md).

For example, consider the following message:

```sqljson
{
    "id": 1,
    "name": "Steve",
    "body_temperature": 36.6,
    "approved_coffee_types": ["Espresso", "Doppio", "Ristretto", "Lungo"],
    "animals_possessed":
    {
        "dogs": true,
        "cats": false
    },
    "date_added": "2024-10-15"
}
```

The following statement creates a table with all fields the Kinesis message contains:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    kinesisMetadata OBJECT(
        stream STRING,
        shardId STRING,
        approximateArrival STRING,
        partitionKey STRING,
        sequenceNumber STRING,
        subSequenceNumber INTEGER,
        shardedSequenceNumber STRING
    ),
    id INT,
    body_temperature FLOAT,
    name STRING,
    approved_coffee_types ARRAY(STRING),
    animals_possessed OBJECT(dogs BOOLEAN, cats BOOLEAN),
    date_added DATE
  )
  EXTERNAL_VOLUME = 'my_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_location/my_iceberg_table';
```

> **Note:**
>
> `kinesisMetadata` must always be created. Field names inside nested structures such as `dogs` or `cats` are case sensitive.

---
title: Set up PrivateLink UI access in Openflow - Snowflake Deployments
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/setup-openflow-spcs-configure-pr-ui.md
section: Loading & Unloading Data
---

# Set up PrivateLink UI access in Openflow - Snowflake Deployments

This topic explains how to configure access to the Snowflake Openflow Runtime UI using private connectivity.

> **Important:**
>
> This is an optional task. If you will not be accessing the Openflow Runtime UI using public connectivity,
> you can skip this task.

There are two tasks to configure access to the Snowflake Openflow Runtime UI using private connectivity:

1. Determine PrivateLink URLs
2. Configure PrivateLink for Openflow Runtime UI access

## Prerequisites

Before configuring private link for the Openflow Runtime UI, enable PrivateLink for your account as described in [AWS PrivateLink and Snowflake](../../admin-security-privatelink.md).

## Determine PrivateLink URLs

1. Using the ACCOUNTADMIN role, call the SYSTEM$GET_PRIVATELINK_CONFIG function in your Snowflake account and identify the value for `openflow-privatelink-url`. This is the URL for accessing Openflow UI over PrivateLink in the form:

   * `<org>-<account>.openflow.<shard-id>.privatelink.snowflakecomputing.com`
2. The URL for accessing the Runtime UI in a Snowflake deployment will be in the form:

   * `of--<org>-<account>.spcs.<shard-id>.privatelink.snowflake.app`
3. Create CNAME records in your DNS to resolve these URL values to your VPC endpoint.
4. Confirm that your DNS settings can resolve the value.
5. Confirm that you can connect to Openflow UI using this URL from your browser.
6. Confirm that you can connect to Runtime UI using this URL from your browser.

## Configure PrivateLink for Openflow Runtime UI access

Perform the following steps:

1. Retrieve Snowflake’s VPC endpoint service ID and Openflow PrivateLink URLs:

   1. As a user with the ACCOUNTADMIN role, execute

   ```sqlexample
   SELECT SYSTEM$GET_PRIVATELINK_CONFIG();
   ```

   1. From the output, identify and save the values for the following keys:

      * `privatelink-vpce-id`
      * `openflow-privatelink-url`
      * `external-telemetry-privatelink-url`
   2. Construct the Runtime URL

      * `of--<org>-<account>.spcs.<shard-id>.privatelink.snowflake.app`
2. Create a VPC endpoint with parameters:

   > **Note:**
   >
   > If the Snowflake account where you plan to create your Openflow Deployment
   > had previously configured PrivateLink for Snowsight,
   > use the existing AWS VPC endpoint and add the additional OpenFlow DNS records to your Route 53.

   * Type: `PrivateLink Ready partner services`
   * Service: `privatelink-vpce-id` value obtained in the previous step.
   * VPC: The VPC where your Openflow deployment will be running.
   * Subnets: Select two availability zones and private subnets where your Openflow deployment will run.
3. Set up a Route 53 private hosted zone for Openflow UI with the following parameters:

   * Domain: `privatelink.snowflakecomputing.com`
   * Type: `Private hosted zone`
   * Select the region and VPC where your Openflow deployment will run.
4. Set up a Route 53 private hosted zone for Openflow UI with the following parameters:

   * Domain: `privatelink.snowflakecomputing.com`
   * Type: `Private hosted zone`
   * Select the region and VPC where your Openflow deployment will run.
5. Set up a Route 53 private hosted zone for Runtime UI with the following parameters:

   * Domain: `privatelink.snowflake.app`
   * Type: `Private hosted zone`
   * Select the region and VPC where your Openflow deployment will run.
6. Add two CNAME records for the URLs identified in the first step:

   * For `openflow-privatelink-url`

     + Record name: `openflow-privatelink-url` value obtained in the first step
     + Record type: `CNAME`
     + Value: DNS name of your VPC endpoint
   * For Runtime UI URL

     + Record name: `openflow-runtime-ui-privatelink-url` value obtained in the first step
     + Record type: `CNAME`
     + Value: DNS name of your VPC endpoint

> **Note:**
>
> When creating a new Openflow - Snowflake Deployment, ensure the **PrivateLink** option is enabled.

### Next steps

[Create deployment](setup-openflow-spcs-deployment.md)

---
title: Set up tasks for the Openflow Connector for Oracle
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/setup-tasks.md
section: Loading & Unloading Data
---

# Set up tasks for the Openflow Connector for Oracle

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes the overall tasks required to set up, configure, and run the Openflow Connector for Oracle.

## Prerequisites

Before you set up the Openflow Connector for Oracle, verify that the following prerequisites are met:

1. Ensure that you have reviewed [About Openflow Connector for Oracle](about.md).
2. Ensure that you have set up an Openflow deployment:

   * [Set up Openflow - BYOC](../../setup-openflow-byoc.md)
   * [Set up Openflow - Snowflake Deployment](../../setup-openflow-spcs.md)
3. Ensure that you add only one connector instance per runtime.

## Tasks

Perform the following tasks to set up, configure, and run the Openflow Connector for Oracle.

| Order | Task | Description | Persona |
| --- | --- | --- | --- |
| 1 | Review Prerequisites | Review and confirm all required prerequisites. | **Snowflake account administrator** |
| 2 | [Enable the connector](manage-commercial-terms.md) | Accept the Oracle XStream terms to make the connector visible in the list of available connectors. | **Organization administrator (ORGADMIN)** |
| 3 | [Configure the Oracle database](setup-oracledb.md) | Configure the Oracle database for Openflow Connector for Oracle including replication settings and credentials. | **Oracle database administrator** |
| 4 | [Set up Snowflake](setup-snowflake.md) | Create the destination database, service user, role, warehouse, and key pair authentication for the Openflow Connector for Oracle. | **Snowflake account administrator** |
| 5 | [Configure the connector](setup-connector.md) | Install, configure, and run the Openflow Connector for Oracle connector. | **Snowflake account administrator** |
| 6 | [Set up licensing](manage-commercial-terms.md) | Configure your licensing model after the connector detects your source database inventory. | **Organization administrator (ORGADMIN)** |

## Next steps

* [Monitor the flow](../../monitor.md).
* [Maintenance](maintenance.md) for reinstalling the connector or changing the XStream position.

---
title: Set up the Openflow Connector for Amazon Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/amazon-ads/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Amazon Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Amazon Ads.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Amazon Ads](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you have reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Amazon Ads](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As an Amazon Ads administrator, perform the following actions:

1. Make sure that you have access to an [Amazon Ads account](https://advertising.amazon.com/).
2. [Acquire Access to Amazon Ads API](https://advertising.amazon.com/API/docs/en-us/guides/onboarding/overview) and complete the onboarding process.
3. [Get client ID and client secret](https://advertising.amazon.com/API/docs/en-us/guides/get-started/retrieve-access-token).
4. [Create an authorization grant](https://advertising.amazon.com/API/docs/en-us/guides/get-started/create-authorization-grant)
   and [retrieve a refresh token](https://advertising.amazon.com/API/docs/en-us/guides/get-started/retrieve-access-token).
5. Review the [available regions](https://advertising.amazon.com/API/docs/en-us/reference/api-overview#api-endpoints)
   and get a base URL used for requests based on the region in which you are advertising.
6. [Fetch profile IDs](https://advertising.amazon.com/API/docs/en-us/guides/get-started/retrieve-profiles) for report configuration.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow,
   for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* Amazon Ads source parameters: Used to establish connection with Amazon Ads API.
* Amazon Ads destination parameters: Used to establish connection with Snowflake.
* Amazon Ads ingestion parameters: Used to define the configuration of data downloaded from Amazon Ads.

#### Amazon Ads source parameters

| Parameter | Description |
| --- | --- |
| Client ID | Client ID of the Amazon Advertising account |
| Client Secret | Client secret of the Amazon Advertising account |
| OAuth Base URL | The URL of the authorization server that issues the access token  Possible values:  * <https://api.amazon.com/auth/o2/token> * <https://api.amazon.co.uk/auth/o2/token> * <https://api.amazon.co.jp/auth/o2/token> |
| Refresh Token | Refresh Token for Amazon Ads API |
| Region | Environment from which the advertising data is downloaded  Possible values:  * NA * EU * FE |

#### Amazon Ads destination parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Amazon Ads Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Report Name | Name of the report to be used as a destination table name. The name must be unique within the destination schema. |
| Report Ad Product | Type of advertising product being reported  Possible values:  * SPONSORED_PRODUCTS * SPONSORED_BRANDS * SPONSORED_DISPLAY * SPONSORED_TELEVISION * DEMAND_SIDE_PLATFORM |
| Report Columns | Set of columns which will be present in the end report. The list of available columns depends on the report type and can be found in the [Amazon Ads API documentation](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/overview). For example, for the `spCampaigns` report type, the list of available columns can be found in the [Sponsored Products documentation](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/campaign#sponsored-products). |
| Report Filters | Set of filters used to trim the data returned. The list of available filters depends on the report type and can be found in the [Amazon Ads API documentation](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/overview). For example, for the `spCampaigns` report type, the list of available filters can be found in the [Sponsored Products documentation](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/campaign#sponsored-products). Filters must be in the format of `columnName=filterValue` and values must separated by a comma (`,`). For example, `campaignStatus=ENABLED,PAUSED`. |
| Report Group By | Determines the level of granularity and how the data within the report will be aggregated and presented. The list of available group by columns depends on the report type and can be found in the [Amazon Ads API documentation](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/overview). For example, for the `spCampaigns` report type, the list of available group by columns can be found in the [Sponsored Products documentation](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/campaign#sponsored-products). |
| Report Ingestion Strategy | Mode in which data is fetched, either snapshot or incremental  Possible values:  * `SNAPSHOT` * `INCREMENTAL` |
| Report Ingestion Window | Specifies the number of days, data from which should be downloaded during incremental ingestion. For example, with a 30-day report ingestion window, an incremental load starts ingestion from 30 days prior to the last successful ingestion date, unless this calculated date falls before the overall start date, in which case ingestion begins from the overall start date. If the `SNAPSHOT` ingestion strategy is used, all available data from the start date to the present is downloaded, so there is no need to use a report ingestion window. |
| Report Profile ID | The [profile ID](https://advertising.amazon.com/API/docs/en-us/guides/get-started/retrieve-profiles) associated with an advertising account in a specific marketplace |
| Report Time Unit | Date aggregation  Possible values:  * `DAILY`: Each day is represented by a one row * `SUMMARY`: The whole ingested date period is represented as one row |
| Report Type | The Amazon Ads API supports a number of [report types](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/overview). For example: [sbAds](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/ad) and [spCampaigns](https://advertising.amazon.com/API/docs/en-us/guides/reporting/v3/report-types/campaign). Copy value of `reportTypeId` from the documentation and paste it into the parameter value. |
| Report Start Date | Start date from which the ingestion should happen. The date format is YYYY-MM-DD. |
| Report Schedule | Schedule time for processor creating reports. For example: `8 h` or `1 d`. The `h` represents hours and `d` days. |

> **Note:**
>
> Data retention in the Amazon Ads API is a specific timeframe, ranging from 60
> to 365 days depending on the report type, during which historical advertising
> performance data is stored and accessible for retrieval.
> After this period, older data may no longer be available.

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start.

   The connector starts the data ingestion.

---
title: Set up the Openflow Connector for Box
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/box/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Box

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Box.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Box](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you have reviewed [configuring requireddomains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Box](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a **Box developer** or **Box administrator**, create a [Box Platform application](https://developer.box.com/guides/applications/app-types/platform-apps/) as follows:

1. Navigate to [Box Developer Console](https://app.box.com/developers/console).
2. Select Create Platform App.
3. Select Custom App as the application type.
4. Provide a name and description for the app, and select a purpose from the drop-down list.
5. Select Server Authentication (with JWT) as the authentication method.
6. Select Create App.
7. To configure the app, navigate to the Configuration tab.
8. In the App Access Level section, select App + Enterprise Access.
9. In the Application Scopes section, select the following options:

   > * Read all files and folders stored in Box.
   > * Write all files and folders stored in Box: To download files and folders. Note that the connector can’t upload any files.
   >   Snowflake recommends granting the service account with only the Viewer role.
   >   To grant the application access to files in Box, select a folder that you want to synchronize. Share it with the app service account using the email of the service account from step n.
   >   Openflow Connector for Box is able to discover and download files from the specified folder and all its subfolders, but it cannot modify the files.
   > * Manage users: To read users in the enterprise.
   > * Manage groups: To read groups and their members in the enterprise.
   > * Manage enterprise properties: To read enterprise events.
10. In the Add and Manage Public Keys section, generate a public/private key pair. Box downloads a JSON configuration file with a private key.
11. Save the changes.
12. Navigate to the Authorization tab, and submit the app for authorization for access to the enterprise.
13. Request your enterprise administrator to approve the app.
14. After the approval is granted, go to the General Settings tab and save the app service account email address.

    For more information, see [Setup with JWT](https://developer.box.com/guides/authentication/jwt/jwt-setup/).

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks manually
or by using the script included below:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

### Example setup

> ```sqlexample
> --The following script assumes you'll need to create all required roles, users, and objects.
> --However, you may want to reuse some that are already in existence.
>
> --Create a Snowflake service user to manage the connector
> USE ROLE USERADMIN;
> CREATE USER <openflow_service_user> TYPE=SERVICE COMMENT='Service user for Openflow automation';
>
> --Create a pair of secure keys (public and private). For more information, see
> --key-pair authentication. Store the private key for the user in a file to supply
> --to the connector’s configuration. Assign the public key to the Snowflake service user:
> ALTER USER <openflow_service_user> SET RSA_PUBLIC_KEY = '<pubkey>';
>
>
> --Create a role to manage the connector and the associated data and
> --grant it to that user
> USE ROLE SECURITYADMIN;
> CREATE ROLE <openflow_connector_admin_role>;
> GRANT ROLE <openflow_connector_admin_role> TO USER <openflow_service_user>;
>
>
> --The following block is for the use case: Ingest files and perform processing with Cortex
> --Create a role for read access to the cortex search service created by this connector.
> --This role should be granted to any role that will use the service
> CREATE ROLE <cortex_search_service_read_only_role>;
> GRANT ROLE <cortex_search_service_read_only_role> TO ROLE <whatever_roles_will_access_search_service>;
>
> --Create the database the data will be stored in and grant usage to the roles created
> USE ROLE ACCOUNTADMIN; --use whatever role you want to own your DB
> CREATE DATABASE IF NOT EXISTS <destination_database>;
> GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_connector_admin_role>;
>
> --Create the schema the data will be stored in and grant the necessary privileges
> --on that schema to the connector admin role:
> USE DATABASE <destination_database>;
> CREATE SCHEMA IF NOT EXISTS <destination_schema>;
> GRANT USAGE ON SCHEMA <destination_schema> TO ROLE <openflow_connector_admin_role>;
> GRANT CREATE TABLE, CREATE DYNAMIC TABLE, CREATE STAGE, CREATE SEQUENCE, CREATE CORTEX
> SEARCH SERVICE ON SCHEMA <destination_schema> TO ROLE <openflow_connector_admin_role>;
>
> --The following block is for use case: Ingest files and perform processing with Cortex
> --Grant the Cortex read-only role access to the database and schema
> GRANT USAGE ON DATABASE <destination_database> TO ROLE <cortex_search_service_read_only_role>;
> GRANT USAGE ON SCHEMA <destination_schema> TO ROLE <cortex_search_service_read_only_role>;
>
> --Create the warehouse this connector will use if it doesn't already exist. Grant the
> --appropriate privileges to the connector admin role. Adjust the size according to your needs.
> CREATE WAREHOUSE <openflow_warehouse>
> WITH
>    WAREHOUSE_SIZE = 'MEDIUM'
>    AUTO_SUSPEND = 300
>    AUTO_RESUME = TRUE;
> GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_connector_admin_role>;
> ```

## Use cases

You can configure the connector for the following use cases:

* Ingest files only
* Ingest files and perform processing with Cortex
* Extract Box metadata using Box AI and ingest it into a Snowflake table
* Synchronize Box file metadata instances with a Snowflake table

### Ingest files only

Use the connector definition to perform custom processing on ingested files.

#### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

##### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

##### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Enter the required parameter values as described in Box ingestion parameters, Box destination parameters and Box source parameters.

###### Box source parameters

| Parameter | Description |
| --- | --- |
| Box App Config JSON | An application JSON configuration that was downloaded during the app creation. |
| Box App Config File | An application json file that was downloaded during the app creation. Either “Box App Config File” or “Box App Config JSON” has to be set. Select the Reference asset checkbox to upload the config file. |

###### Box destination parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

###### Box ingestion parameters

| Parameter | Description |
| --- | --- |
| Box Folder ID | The ID of the folder to read the files from. Set this to `0` to synchronize all folders the Box app has access to. It can be retrieved from the URL, for example <https://app.box.com/folder/FOLDER_ID>. |
| File Extensions To Ingest | A comma-separated list that specifies file extensions to ingest. The connector tries to convert the files to PDF format first, if possible. Nonetheless, the extension check is performed on the original file extension. If some of the specified file extensions are not supported by Cortex Parse Document, then the connector ignores those files, logs a warning message in an event log, and continues processing other files. |
| Snowflake File Hash Table Name | Name of the table to store file hashes to determine if the content has changed. This parameter should generally not be changed. |

#### Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

After starting the connector, it retrieves all files from the specified folder, and then consumes `admin_logs_streaming` events within the last 14 days.
This is done to capture data that may otherwise have been missed during the initialization process.
During that time, `not found` errors may occur, which are caused by files that appear in the events but are no longer present.

### Ingest files and perform processing with Cortex

Use the connector definition to:

* Create AI assistants for public documents within your organization’s Box enterprise
* Enable your AI assistants to adhere to access controls specified in your organization’s Box enterprise

#### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

##### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

##### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Box Cortex Connect Ingestion Parameters, Box Cortex Connect Destination Parameters and Box Cortex Connect Source Parameters.

###### Box Cortex Connect Source Parameters

| Parameter | Description |
| --- | --- |
| Box App Config JSON | An application JSON configuration that was downloaded during the app creation. |
| Box App Config File | An application json file that was downloaded during the app creation. Either “Box App Config File” or “Box App Config JSON” has to be set. Select the Reference asset checkbox to upload the config file. |

###### Box Cortex Connect Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

###### Box Cortex connect ingestion parameters

| Parameter | Description |
| --- | --- |
| Box Folder ID | The ID of the folder to read the files from. Set this to `0` to synchronize all folders the Box app has access to. It can be retrieved from the URL, for example <https://app.box.com/folder/FOLDER_ID>. |
| File Extensions To Ingest | A comma-separated list that specifies file extensions to ingest. The connector tries to convert the files to PDF format first, if possible. Nonetheless, the extension check is performed on the original file extension. If some of the specified file extensions are not supported by Cortex Parse Document, then the connector ignores those files, logs a warning message in an event log, and continues processing other files. |
| Snowflake File Hash Table Name | Name of the table to store file hashes to determine if the content has changed. This parameter should generally not be changed. |
| OCR Mode | The OCR mode to use when parsing files with [Parsing documents with AI_PARSE_DOCUMENT](../../../../snowflake-cortex/parse-document.md) function. The value can be `OCR` or `LAYOUT`. |
| Snowflake Cortex Search Service User Role | An identifier of a role that is assigned usage permissions on the Cortex Search service. |
| Snowflake File Hash Table Name | Name of the table to store file hashes to determine if the content has changed. This parameter should generally not be changed. |

#### Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

After starting the connector, it retrieves all files from the specified folder, and then consumes `admin_logs_streaming` events within the last 14 days.
This is done to capture any data that may have been missed during the initialization process.
During that time, `not found` errors may occur, caused by the files that appear in the events but are no longer present.

#### Query the Cortex Search service

You can use the [Cortex Search](../../../../snowflake-cortex/cortex-search/cortex-search-overview.md) service to build chat
and search applications to chat with or query your documents in Box.

After you install and configure the connector and it begins
ingesting content from Box, you can query the Cortex Search service.
For more information about using Cortex Search, see [Query a Cortex Search service](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md).

**Filter responses**

To restrict responses from the Cortex Search service to documents that a specific user
has access to in Box, you can specify a filter containing the user ID or email address of the user
when you query Cortex Search. For example, `filter.@contains.user_ids` or `filter.@contains.user_emails`.
The name of the Cortex Search service created by the connector is `search_service` in the schema `Cortex`.

Run the following SQL code in a SQL worksheet to query
the Cortex Search service with files ingested from your Box site.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.
* `your_question`: The question that you want to get responses for.
* `number_of_results`: Maximum number of results to return in the response. The maximum value is 1,000 and the default value is 10.

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    '<application_instance_name>.cortex.search_service',
      '{
        "query": "<your_question>",
         "columns": ["chunk", "web_url"],
         "filter": {"@contains": {"user_emails": "<user_emailID>"} },
         "limit": <number_of_results>
       }'
   )
)['results'] AS results
```

Here is a complete list of values that you can enter for `columns`:

| Column name | Type | Description |
| --- | --- | --- |
| `full_name` | String | A full path to the file from the Box site documents root. Example: `folder_1/folder_2/file_name.pdf`. |
| `web_url` | String | A URL that displays an original Box file in a browser. |
| `last_modified_date_time` | String | Date and time when the item was most recently modified. |
| `chunk` | String | A piece of text from the document that matched the Cortex Search query. |
| `user_ids` | Array | An array of user IDs that have access to the document. |
| `user_emails` | Array | An array of user email IDs that have access to the document. It also includes user email IDs from all the Microsoft 365 groups that are assigned to the document. |

**Example: Query an AI assistant for human resources (HR) information**

You can use Cortex Search to query an AI assistant for employees to chat with the latest versions of
HR information, such as onboarding, code of conduct, team processes, and organization policies.
Using response filters, you can also allow HR team members to query employee contracts while adhering to access controls configured in Box.

SQLPythonREST API

Run the following in a [SQL worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the Cortex Search service with files ingested from Box.
Select the database as your application instance name and schema as **Cortex**.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```sqlexample
SELECT PARSE_JSON(
     SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
          '<application_instance_name>.cortex.search_service',
          '{
             "query": "What is my vacation carryover policy?",
             "columns": ["chunk", "web_url"],
             "filter": {"@contains": {"user_emails": "<user_emailID>"} },
             "limit": 1
          }'
     )
 )['results'] AS results
```

Run the following code in a [Python worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the
Cortex Search service with files ingested from Box.
Ensure that you add the `snowflake.core` package to your database.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.core import Root

def main(session: snowpark.Session):

   root = Root(session)

   # fetch service
   my_service = (root
     .databases["<application_instance_name>"]
     .schemas["cortex"]
     .cortex_search_services["search_service"]
   )

   # query service
   resp = my_service.search(
     query="What is my vacation carryover policy?",
     columns = ["chunk", "web_url"],
     filter = {"@contains": {"user_emails": "<user_emailID>"} },
     limit=1
   )
   return (resp.to_json())
```

Execute the following code in a command-line interface to query the Cortex Search
service with files ingested from your Box.
Access to the Snowflake REST APIs requires authentication via both key pair authentication and OAuth.
For more information,
see [REST API](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md)
and [Authenticating Snowflake REST APIs with Snowflake](../../../../../developer-guide/snowflake-rest-api/authentication.md).

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `account_url`: Your Snowflake account URL. For instructions on finding your account URL, see [Finding the organization and account name for an account](../../../../admin-account-identifier.md).

```bash
curl --location "https://<account_url>/api/v2/databases/<application_instance_name>/schemas/cortex/cortex-search-services/search_service" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer <CORTEX_SEARCH_JWT>" \
     --data '{
         "query": "What is my vacation carryover policy?",
         "columns": ["chunk", "web_url"],
         "limit": 1
     }'
```

Sample response:

```output
{
  "results" : [ {
  "web_url" : "https://<domain>.box.com/sites/<site_name>/<path_to_file>",
  "chunk" : "Answer to the question asked."
  } ]
}
```

### Extract Box metadata using Box AI and ingest it into a Snowflake table

Use the connector definition to:

* Extract metadata about your Box files and ingest them to into a Snowflake table
* Perform operations on the metadata of your files stored in Box

#### Create a Snowflake table for storing the Box metadata

1. Ensure that Box AI is enabled for the extraction of metadata to occur. For more information, see [Configuring Box AI](https://support.box.com/hc/en-us/articles/22166647877011-Configuring-Box-AI).
2. Create a Snowflake table where the metadata will be sent

   For the connector to know what kind of metadata to extract, you must create a Snowflake table in your database and schema with the column names of the fields you would like to extract.
   Add descriptions to each column to improve the performance of the model used to extract the metadata from the files.
3. In the table created in the previous step, ensure that there is a column to store the Box file ID and that it is of type VARCHAR.

   The name of this column is required to be entered as the Box File Identifier Column parameter in later steps.
   The list of supported columns types for the metadata table is VARCHAR, STRING, TEXT, FLOAT, DOUBLE, and DATE.

Here is an example of the table that you can create for this connector:

```sqlexample
CREATE OR REPLACE TABLE OPENFLOW.BOX_METADATA_SCHEMA.LOAN_AGREEMENT_METADATA (
  BOX_FILE_ID               VARCHAR    COMMENT 'Box file identifier column',
  LOAN_ID                   STRING     COMMENT 'Unique loan agreement identifier (e.g. L-2025-0001)',
  BORROWER_NAME             STRING     COMMENT 'Name of the borrower entity or individual',
  LENDER_NAME               STRING     COMMENT 'Name of the lending institution',
  LOAN_AMOUNT               DOUBLE     COMMENT 'Principal amount of the loan (in USD)',
  INTEREST_RATE             FLOAT      COMMENT 'Annual interest rate (%)',
  EFFECTIVE_DATE            DATE       COMMENT 'Date on which the loan becomes effective',
  MATURITY_DATE             DATE       COMMENT 'Scheduled loan maturity date',
  LOAN_TERM_MONTHS          FLOAT      COMMENT 'Original term length in months',
  COLLATERAL_DESCRIPTION    TEXT       COMMENT 'Description of collateral securing the loan',
  CREDIT_SCORE              FLOAT      COMMENT 'Borrower credit score',
  JURISDICTION              STRING     COMMENT 'Governing law jurisdiction (e.g. NY, CA)'
);
```

#### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

##### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

##### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Box Ingest Metadata Source Parameters, Box Ingest Metadata Destination Parameters and Box Ingest Metadata Ingestion Parameters.

###### Box Ingest Metadata Source Parameters

| Parameter | Description |
| --- | --- |
| Box App Config JSON | An application JSON configuration that was downloaded during the app creation. |
| Box App Config File | An application json file that was downloaded during the app creation. Either “Box App Config File” or “Box App Config JSON” has to be set. Select the Reference asset checkbox to upload the config file. |

###### Box Ingest Metadata Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

###### Box Ingest Metadata Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Box Folder ID | The ID of the folder to read the files from. Set this to `0` to synchronize all folders the Box app has access to. The ID can be retrieved from the URL, for example <https://app.box.com/folder/FOLDER_ID>. |
| Box File Identifier Column | The column of the metadata table that will store the Box file ID to associate the given metadata with a file. This column must be of type VARCHAR and be part of the table created in Create a Snowflake table for storing the Box metadata. |
| Destination Metadata Table | The Snowflake table you created in Create a Snowflake table for storing the Box metadata, which has the columns of the metadata you want to collect. |

#### Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

After starting the connector, it retrieves all files from the specified folder, and then consumes `admin_logs_streaming` events from the last 14 days.
This is done to capture any data that may have been missed during the initialization process.
During that time, `not found` errors may occur, caused by the files that appear in the events but are no longer present.

### Synchronize Box file metadata instances with a Snowflake table

Use the connector definition to perform a data transformation on metadata
from Box in a Snowflake table and add the changes back to a Box metadata instance.

#### Create a Snowflake stream for storing the Box metadata

1. Create a Snowflake stream for the metadata table you want to use. The stream is used to monitor any changes that occur to the table with which you want to synchronize your Box files.
   To learn how to create a table for storing Box metadata, see Create a Snowflake table for storing the Box metadata.
   If the connector is stopped beyond the data retention time and the stream becomes stale, then you must recreate a stream and replace the previous one. To learn more about managing streams, see [Manage streams](../../../../streams-manage.md).

   Here is an example of a stream that you can create for this connector:

   ```sqlexample
   CREATE OR REPLACE STREAM OPENFLOW.BOX_METADATA_SCHEMA.LOAN_AGREEMENT_METADATA_STREAM
   ON TABLE OPENFLOW.BOX_METADATA_SCHEMA.LOAN_AGREEMENT_METADATA
   ```
2. In the metadata table, ensure that there is a column to store the Box file ID and that it is of type VARCHAR.

   The name of this column is required to be entered as the Box File Identifier Column parameter in later steps.
   The list of supported columns types for the metadata table is VARCHAR, STRING, TEXT, FLOAT, DOUBLE, and DATE.

#### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

##### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

##### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Box Publish Metadata Source Parameters, Box Publish Metadata Destination Parameters and Box Publish Metadata Ingestion Parameters.

###### Box Publish Metadata Source Parameters

| Parameter | Description |
| --- | --- |
| Source Database | Snowflake Database that contains the schema that contains the Snowflake Stream that ingests the changes |
| Source Schema | Schema that contains the Snowflake Stream that ingests the changes |
| Snowflake Account Identifier | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide your Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. |
| Snowflake Private Key | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the RSA private key used for authentication. The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers. Note that either Snowflake Private Key File or Snowflake Private Key must be defined. |
| Snowflake Private Key File | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, upload the file that contains the RSA Private Key used for authentication to Snowflake, formatted according to PKCS8 standards and having standard PEM headers and footers. The header line begins with `-----BEGIN PRIVATE`. Select the Reference asset checkbox to upload the private key file. |
| Snowflake Private Key Password | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the password associated with the Snowflake Private Key File. |
| Snowflake Role | When using Session Token for your Authentication Strategy, use your Snowflake Role. You can find your Snowflake Role in the Openflow UI, by going to View Details for your Runtime. When using Key Pair for your Authentication Strategy, use a valid role configured for your service user. |
| Snowflake Username | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the user name used to connect to Snowflake instance. |
| Snowflake Warehouse | Snowflake warehouse used to run queries |
| Snowflake Stream Name | Snowflake stream name used for ingestion of changes from the source Snowflake table. You must create it before starting the connector and link to the table. |

###### Box Publish Metadata Destination Parameters

| Parameter | Description |
| --- | --- |
| Box App Config JSON | An application JSON configuration that was downloaded during the app creation. |
| Box App Config File | An application json file that was downloaded during the app creation. Either “Box App Config File” or “Box App Config JSON” has to be set. Select the Reference asset checkbox to upload the config file. |

###### Box Publish Metadata Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Box File Identifier Column | The column of the metadata table that will store the Box file ID to associate the given metadata with a file. This column must be of type VARCHAR and be part of the table created in Create a Snowflake table for storing the Box metadata. |
| Box Metadata Template Name | Template name of the Box metadata template that will be added to the Box files. You don’t need to manually create a template before starting the connector. If you enter a value in this parameter, a template is automatically created with this template name. The name provided should not overlap with any template that you have already created in your Box environment. |
| Box Metadata Template Key | The Box template key of the Box metadata template that will be added to the Box files. This is the key that will be used to reference the template in the Box API. You don’t need to manually create a template before starting the connector. If you enter a value in this parameter, a template is automatically created with this template key. The key provided should not overlap with any template that you have already created in your Box environment. |

#### Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

After running the flow, you can query the Cortex Search service. For information on how to query the Cortex Search service, see Query the Cortex Search service.

### Finding files in stage

Files stored in the stage may have unreadable names. To find specific files, use the metadata
tables as your source of truth. These tables contain the mapping between file names and their
corresponding file IDs in the stage.

For Cortex-enabled setups, use the following query to find files:

```sqlexample
SELECT DISTINCT METADATA:id FROM DOCS_CHUNKS WHERE METADATA:fullName LIKE '%<file_name>';
```

For non-Cortex setups, use the following query:

```sqlexample
SELECT FILE_ID FROM DOC_METADATA WHERE FILE_NAME = '<file_name>';
```

Replace `<file_name>` with the name or partial name of the file you’re looking for.

The files in the stage start with the ID returned from these queries.

---
title: Set up the Openflow Connector for Google Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-ads/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Google Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Google Ads.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Google Ads](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Google Ads](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a Google Ads administrator, perform the following steps:

* Ensure that you have access to a Google Cloud project or [create a new
  one](https://developers.google.com/workspace/guides/create-project).
* Ensure that the [Google Ads
  API](https://cloud.google.com/endpoints/docs/openapi/enable-api) is
  enabled for your Google Cloud project. Google Ads API access is
  required to ingest data.
* [Configure](https://developers.google.com/google-ads/api/docs/oauth/service-accounts)
  Service account authentication for Google Ads.
* Obtain developer token for your organization following
  [instructions](https://developers.google.com/google-ads/api/docs/get-started/dev-token).

> **Note:**
>
> Developer token should have Access Level either Basic or Standard. For more information about Access Level please see [documentation](https://developers.google.com/google-ads/api/docs/access-levels).

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

#. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step.
Substitute the role placeholder with the actual value and use the following sql commands:

> ```sqlexample
> CREATE DATABASE GOOGLE_ADS_DESTINATION_DB;
> CREATE SCHEMA GOOGLE_ADS_DESTINATION_DB.GOOGLE_ADS_DESTINATION_SCHEMA;
> GRANT USAGE ON DATABASE GOOGLE_ADS_DESTINATION_DB TO ROLE <GOOGLE_ADS_CONNECTOR_ROLE>;
> GRANT USAGE ON SCHEMA GOOGLE_ADS_DESTINATION_DB.GOOGLE_ADS_DESTINATION_SCHEMA TO ROLE <GOOGLE_ADS_CONNECTOR_ROLE>;
> GRANT CREATE TABLE ON SCHEMA GOOGLE_ADS_DESTINATION_DB.GOOGLE_ADS_DESTINATION_SCHEMA TO ROLE <GOOGLE_ADS_CONNECTOR_ROLE>;
> ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

#### Flow parameters

There are three parameter contexts. `Google Ads Destination Parameters` and
`Google Ads Source Parameters` are respectively responsible for allowing
connections with GoogleAds API and Snowflake. `Google Ads Ingestion Parameters`
is used to define the reconfiguration of data downloaded from Google
Ads. `Google Ads Parameters` aggregates all of them in one.

##### Google Ads Ingestion Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Client Account ID | ID of the account in the Google Ads for which given report should be ingested | true |
| Login Customer ID | Customer ID of the Google Ads manager account (MCC) for which the report should be ingested | false |
| Google Ads Resource Name | Name of the resource in Google Ads that is a source for the report | true |
| Report Attributes | Attributes of the selected resource | true |
| Report Metrics | Metrics collected in the context of a given resource | false |
| Report Segments | Buckets in which metrics should be grouped | false |
| Report Start Date | Start date from which the ingestion should happen. The date format is YYYY-MM-DD. | false |
| Schedule | Get Google Ads Report processor schedule | true |

> **Note:**
>
> The easiest way to obtain proper combination of `Report Attributes`, `Report Metrics` and `Report Segments` is to use [Google Ads Query Builder](https://developers.google.com/google-ads/api/fields/v19/overview_query_builder).
> Select the resource based on the one inserted into parameter `Google Ads Resource Name` and construct the query. Then copy and pase attributes, metrics and segments to corresponding parameters.

##### Google Ads Source Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Google Developer Token | Developer token required to query Google Ads API | true |
| Google Service Account JSON | Service Account JSON required for Google Ads authentication | true |

##### Google Ads Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

## How to reset the connector

To fully reset connector to the initial state, do the following:

1. Ensure that there are no more flow files in the queues.
2. Stop all the processors.
3. Clear the state of the initial processor.

   > 1. Right click on the processor `Get Google Ads Report` and select View State.
   > 2. Select the option Clear State. This resets the state of the processor.
4. Drop the destination table in Snowflake.

---
title: Set up the Openflow Connector for Google Drive
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-drive/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Google Drive

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Google Drive.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Google Drive](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Google Drive](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

Setting up the connector requires specific permissions and account
settings for Snowflake Openflow processors to read data from Google.
This access is provided in part through setting up a service account and
a key for Openflow to authenticate as that service account.
For more information, see:

* [Configure access to the Google Cloud Search
  API](https://developers.google.com/cloud-search/docs/guides/project-setup#create_service_account_credentials)
* [Delegating domain-wide authority to the service
  account](https://developers.google.com/identity/protocols/oauth2/service-account#delegatingauthority)

As a Google Drive administrator, perform the following steps:

### Prerequisites

Ensure that you meet the following requirements:

* You have a Google user with Super Admin permissions
* You have a Google Cloud Project with the following roles:

  + Organization Policy Administrator
  + Organization Administrator

### Enable service account key creation

By default Google disables service account key creation. For Openflow to
use the service account JSON, this key creation policy must be turned
off.

1. Log in to the [Google Cloud
   Console](https://console.cloud.google.com/) with a super admin
   account that has the Organizational Policy Admin Role.
2. Ensure you are in the project associated with your organization, not
   the project in your organization.
3. Click Organization Policies.
4. Select the Disable service account key creation policy.
5. Click Manage Policy and turn off enforcement.
6. Click Set Policy.

### Create service account and key

1. Open the [Google Cloud Console](https://console.cloud.google.com/)
   and authenticate using a user that has been granted access to create
   service accounts.
2. Ensure you are in a project of your organization.
3. In the left navigation, under the IAM & Admin, select the
   Service Accounts tab.
4. Click Create Service Account.
5. Enter the service account name and click Create and Continue.
6. Click Done. In the table with the service accounts listed, find
   the OAuth 2 Client ID column. Copy the Client ID as this will be
   required later to set up domain-wide delegation in the next section.
7. On the newly created service account, click the menu under the table
   with the service accounts listed for that service account and select
   Manage keys.
8. Select Add key and then Create new key.
9. Leave the default selection of JSON and click Create.

The key is downloaded into your browser Downloads directory as a .json
file.

### Grant service account domain-wide delegation for listed scopes

1. Log in to your Google Admin account.
2. Select Admin from Google Apps selector.
3. In the left navigation, expand Security and then Access and select Data
   control then click on API Controls.
4. On the API Controls screen, select Manage domain wild
   delegation.
5. Click Add new.
6. Enter the OAuth 2 Client ID taken from the Create Service Account and
   Key section and the following scopes:

   * <https://www.googleapis.com/auth/drive>
   * <https://www.googleapis.com/auth/drive.metadata.readonly>
   * <https://www.googleapis.com/auth/admin.directory.group.member.readonly>
   * <https://www.googleapis.com/auth/admin.directory.group.readonly>
   * <https://www.googleapis.com/auth/drive.file>
   * <https://www.googleapis.com/auth/drive.metadata>
7. Click Authorize.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks manually
or by using the script included below:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

### Example setup

> ```sqlexample
> --The following script assumes you'll need to create all required roles, users, and objects.
> --However, you may want to reuse some that are already in existence.
>
> --Create a Snowflake service user to manage the connector
> USE ROLE USERADMIN;
> CREATE USER <openflow_service_user> TYPE=SERVICE COMMENT='Service user for Openflow automation';
>
> --Create a pair of secure keys (public and private). For more information, see
> --key-pair authentication. Store the private key for the user in a file to supply
> --to the connector’s configuration. Assign the public key to the Snowflake service user:
> ALTER USER <openflow_service_user> SET RSA_PUBLIC_KEY = '<pubkey>';
>
>
> --Create a role to manage the connector and the associated data and
> --grant it to that user
> USE ROLE SECURITYADMIN;
> CREATE ROLE <openflow_connector_admin_role>;
> GRANT ROLE <openflow_connector_admin_role> TO USER <openflow_service_user>;
>
>
> --The following block is for USE CASE 2 (Cortex connect) ONLY
> --Create a role for read access to the cortex search service created by this connector.
> --This role should be granted to any role that will use the service
> CREATE ROLE <cortex_search_service_read_only_role>;
> GRANT ROLE <cortex_search_service_read_only_role> TO ROLE <whatever_roles_will_access_search_service>;
>
> --Create the database the data will be stored in and grant usage to the roles created
> USE ROLE ACCOUNTADMIN; --use whatever role you want to own your DB
> CREATE DATABASE IF NOT EXISTS <destination_database>;
> GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_connector_admin_role>;
>
> --Create the schema the data will be stored in and grant the necessary privileges
> --on that schema to the connector admin role:
> USE DATABASE <destination_database>;
> CREATE SCHEMA IF NOT EXISTS <destination_schema>;
> GRANT USAGE ON SCHEMA <destination_schema> TO ROLE <openflow_connector_admin_role>;
> GRANT CREATE TABLE, CREATE DYNAMIC TABLE, CREATE STAGE, CREATE SEQUENCE, CREATE CORTEX
> SEARCH SERVICE ON SCHEMA <destination_schema> TO ROLE <openflow_connector_admin_role>;
>
> --The following block is for CASE 2 (Cortex connect) ONLY
> --Grant the Cortex read-only role access to the database and schema
> GRANT USAGE ON DATABASE <destination_database> TO ROLE <cortex_search_service_read_only_role>;
> GRANT USAGE ON SCHEMA <destination_schema> TO ROLE <cortex_search_service_read_only_role>;
>
> --Create the warehouse this connector will use if it doesn't already exist. Grant the
> --appropriate privileges to the connector admin role. Adjust the size according to your needs.
> CREATE WAREHOUSE <openflow_warehouse>
> WITH
>    WAREHOUSE_SIZE = 'MEDIUM'
>    AUTO_SUSPEND = 300
>    AUTO_RESUME = TRUE;
> GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_connector_admin_role>;
> ```

## Use case 1: Use the connector definition to ingest files only

Use the connector definition to:

* Perform custom processing on ingested files
* Ingest Google Drive files and permissions and keep them up to date

### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

#### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Right-click on the imported process group and select **Parameters**.
2. Enter the required parameter values as described in Google Drive Source Parameters, Google Drive Destination Parameters and Google Drive Ingestion Parameters.

##### Google Drive Source Parameters

| Parameter | Description |
| --- | --- |
| Google Delegation User | The user that is used by the service account |
| GCP Service Account JSON | The service account JSON downloaded from Google Cloud Console to allow access to Google APIs in the connector |

##### Google Drive Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

##### Google Drive Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Google Drive ID | The Google Shared Drive to watch for content and updates |
| Google Folder Name | Optionally, the Google Drive folder identifier (human readable folder name) can be set to filter incoming files by. If all file types are desired then select “Set Empty String”. When set, only files that are in the provided folder or subfolder will be retrieved. When blank or unset, no folder filtering is applied and all files under the drive are retrieved. |
| Google Domain | The Google Workspace Domain that the Google Groups and Drive resides in. |
| File Extensions To Ingest | A comma-separated list that specifies file extensions to ingest. The connector tries to convert the files to PDF format first, if possible. Nonetheless, the extension check is performed on the original file extension. If some of the specified file extensions are not supported by Cortex Parse Document, then the connector ignores those files, logs a warning message in an event log, and continues processing other files. |
| Snowflake File Hash Table Name | Internal table used to store file content hashes to prevent updates to content when it has not changed. |

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

## Use case 2: Use the connector definition to ingest files and perform processing with Cortex

Use the predefined flow definition to:

* Create AI assistants for public documents within your organization’s
  Google Drive.
* Enable your AI assistants to adhere to access controls specified in
  your organization’s Google Drive.

### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

#### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Right-click on the imported process group and select **Parameters**.
2. Enter the required parameter values as described in Google Drive Cortex Connect Source Parameters, Google Drive Cortex Connect Destination Parameters and Google Drive Cortex Connect Ingestion Parameters.

##### Google Drive Cortex Connect Source Parameters

| Parameter | Description |
| --- | --- |
| Google Delegation User | The user that is used by the service account |
| GCP Service Account JSON | The service account JSON downloaded from Google Cloud Console to allow access to Google APIs in the connector |

##### Google Drive Cortex Connect Destination Parameters

| Parameter | Description |
| --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake |
| Destination Schema | The schema where data will be persisted. It must already exist in Snowflake |
| Snowflake Account Identifier | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide your Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. |
| Snowflake Private Key | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the RSA private key used for authentication. The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers. Note that either Snowflake Private Key File or Snowflake Private Key must be defined. |
| Snowflake Private Key File | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, upload the file that contains the RSA Private Key used for authentication to Snowflake, formatted according to PKCS8 standards and having standard PEM headers and footers. The header line begins with `-----BEGIN PRIVATE`. Select the Reference asset checkbox to upload the private key file. |
| Snowflake Private Key Password | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the password associated with the Snowflake Private Key File. |
| Snowflake Role | When using Session Token for your Authentication Strategy, use your Snowflake Role. You can find your Snowflake Role in the Openflow UI, by going to View Details for your Runtime. When using Key Pair for your Authentication Strategy, use a valid role configured for your service user. |
| Snowflake Username | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the user name used to connect to Snowflake instance. |
| Snowflake Warehouse | Snowflake warehouse used to run queries |

##### Google Drive Cortex Connect Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Google Drive ID | The Google Shared Drive to watch for content and updates |
| Google Folder Name | Optionally, the Google Drive folder identifier (human readable folder name) can be set to filter incoming files by. If all file types are desired then select “Set Empty String”.  When set, only files that are in the provided folder or subfolder will be retrieved. When blank or unset, no folder filtering is applied and all files under the drive are retrieved. |
| Google Domain | The Google Workspace Domain that the Google Groups and Drive resides in. |
| OCR Mode | The OCR mode to use when parsing files with [Parsing documents with AI_PARSE_DOCUMENT](../../../../snowflake-cortex/parse-document.md) function. The value can be `OCR` or `LAYOUT`. |
| File Extensions To Ingest | A comma-separated list that specifies file extensions to ingest. The connector tries to convert the files to PDF format first, if possible. Nonetheless, the extension check is performed on the original file extension. If some of the specified file extensions are not supported by Cortex Parse Document, then the connector ignores those files, logs a warning message in an event log, and continues processing other files. |
| Snowflake File Hash Table Name | Internal table used to store file content hashes to prevent updates to content when it has not changed. |
| Snowflake Cortex Search Service User Role | An identifier of a role that is assigned usage permissions on the Cortex Search service. |

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.
3. Query the Cortex Search service.

## Use case 3: Customise the connector definition

Customize the connector definition to perform custom processing on ingested files.

### Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

#### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Customize the connector definition.

   > 1. Remove the following process groups:
   >
   >    * Check If Duplicate Content
   >    * Snowflake Stage and Parse PDF
   >    * Update Snowflake Cortex
   > 2. Attach any custom processing to the output of the *Process Google
   >    Drive Metadata* process group. Each flow file represents a single
   >    Google Drive file change. Flow file attributes can be seen in the
   >    `Fetch Google Drive Metadata` documentation.
2. Populate the process group parameters. Follow the same process as for
   Use case 1: Use the connector definition to ingest files only. Note that after modifying the connector definition,
   not all parameters might be required.

### Run the flow

1. Run the flow.

   1. Start the process group. The flow will create all required objects
      inside of Snowflake.
   2. Right click on the imported process group and select **Start**.
2. Query the Cortex Search service.

#### Query the Cortex Search service

You can use the [Cortex Search](../../../../snowflake-cortex/cortex-search/cortex-search-overview.md) service to build chat
and search applications to chat with or query your documents in Google Drive.

After you install and configure the connector and it begins
ingesting content from Google Drive, you can query the Cortex Search service.
For more information about using Cortex Search, see [Query a Cortex Search service](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md).

**Filter responses**

To restrict responses from the Cortex Search service to documents that a specific user
has access to in Google Drive, you can specify a filter containing the user ID or email address of the user
when you query Cortex Search. For example, `filter.@contains.user_ids` or `filter.@contains.user_emails`.
The name of the Cortex Search service created by the connector is `search_service` in the schema `Cortex`.

Run the following SQL code in a SQL worksheet to query
the Cortex Search service with files ingested from your Google Drive.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.
* `your_question`: The question that you want to get responses for.
* `number_of_results`: Maximum number of results to return in the response. The maximum value is 1000 and the default value is 10.

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    '<application_instance_name>.cortex.search_service',
      '{
        "query": "<your_question>",
         "columns": ["chunk", "web_url"],
         "filter": {"@contains": {"user_emails": "<user_emailID>"} },
         "limit": <number_of_results>
       }'
   )
)['results'] AS results
```

Here’s a complete list of values that you can enter for `columns`:

| Column name | Type | Description |
| --- | --- | --- |
| `full_name` | String | A full path to the file from the Google Drive documents root. Example: `folder_1/folder_2/file_name.pdf`. |
| `web_url` | String | A URL that displays an original Google Drive file in a browser. |
| `last_modified_date_time` | String | Date and time when the item was most recently modified. |
| `chunk` | String | A piece of text from the document that matched the Cortex Search query. |
| `user_ids` | Array | An array of Microsoft 365 user IDs that have access to the document. It also includes user IDs from all the Microsoft 365 groups that are assigned to the document. To find a specific user ID, see [Get a user](https://learn.microsoft.com/en-us/graph/api/user-get?view=graph-rest-1.0&tabs=http). |
| `user_emails` | Array | An array of Microsoft 365 user email IDs that have access to the document. It also includes user email IDs from all the Microsoft 365 groups that are assigned to the document. |

**Example: Query an AI assistant for human resources (HR) information**

You can use Cortex Search to query an AI assistant for employees to chat with the latest versions of
HR information, such as onboarding, code of conduct, team processes, and organization policies.
Using response filters, you can also allow HR team members to query employee contracts while adhering to access controls configured in Google Drive.

SQLPythonREST API

Run the following in a [SQL worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the Cortex Search service with files ingested from Google Drive.
Select the database as your application instance name and schema as **Cortex**.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```sqlexample
SELECT PARSE_JSON(
     SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
          '<application_instance_name>.cortex.search_service',
          '{
             "query": "What is my vacation carry over policy?",
             "columns": ["chunk", "web_url"],
             "filter": {"@contains": {"user_emails": "<user_emailID>"} },
             "limit": 1
          }'
     )
 )['results'] AS results
```

Run the following code in a [Python worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the
Cortex Search service with files ingested from Google Drive.
Ensure that you add the `snowflake.core` package to your database.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.core import Root

def main(session: snowpark.Session):

   root = Root(session)

   # fetch service
   my_service = (root
     .databases["<application_instance_name>"]
     .schemas["cortex"]
     .cortex_search_services["search_service"]
   )

   # query service
   resp = my_service.search(
     query="What is my vacation carry over policy?",
     columns = ["chunk", "web_url"],
     filter = {"@contains": {"user_emails": "<user_emailID>"} },
     limit=1
   )
   return (resp.to_json())
```

Execute the following code in a command-line interface to query the Cortex Search
service with files ingested from your Google Drive.
You will need to authentication through key pair authentication and OAuth to access the
Snowflake REST APIs. For more information,
see [REST API](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md)
and [Authenticating Snowflake REST APIs with Snowflake](../../../../../developer-guide/snowflake-rest-api/authentication.md).

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `account_url`: Your Snowflake account URL. For instructions on finding your account URL, see [Finding the organization and account name for an account](../../../../admin-account-identifier.md).

```bash
curl --location "https://<account_url>/api/v2/databases/<application_instance_name>/schemas/cortex/cortex-search-services/search_service" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer <CORTEX_SEARCH_JWT>" \
     --data '{
         "query": "What is my vacation carry over policy?",
         "columns": ["chunk", "web_url"],
         "limit": 1
     }'
```

## Finding files in stage

Files stored in the stage may have unreadable names. To find specific files, use the metadata
tables as your source of truth. These tables contain the mapping between file names and their
corresponding file IDs in the stage.

For Cortex-enabled setups, use the following query to find files:

```sqlexample
SELECT DISTINCT METADATA:id FROM DOCS_CHUNKS WHERE METADATA:fullName LIKE '%<file_name>%';
```

For non-Cortex setups, use the following query:

```sqlexample
SELECT FILE_ID FROM DOC_METADATA WHERE FILE_NAME = '<file_name>';
```

Replace `<file_name>` with the name or partial name of the file you’re looking for.

The files in the stage start with the ID returned from these queries.

---
title: Set up the Openflow Connector for Google Sheets
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-sheets/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Google Sheets

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Google Sheets.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Google Sheets](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you have reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Google Sheets](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the Google Cloud credentials and set up your Google Cloud Project

As a Google Cloud administrator, perform the following tasks:

1. Ensure that you have the following:

   * A Google user with [Super Admin permissions](https://support.google.com/a/answer/2405986?hl)
   * A [Google Cloud Project](https://developers.google.com/workspace/guides/create-project) with the following roles:

     + [Organization Policy Administrator](https://cloud.google.com/iam/docs/understanding-roles#orgpolicy.policyAdmin)
     + [Organization Administrator](https://cloud.google.com/iam/docs/understanding-roles#resourcemanager.organizationAdmin)
2. Enable service account key creation. Google disables service account key creation by default.

   This key creation policy must be turned off for Snowflake Openflow to use the service account JSON. To enable service account key creation, perform the following tasks:

   1. Log in to the [Google Cloud Console](https://console.cloud.google.com/) with a super admin account that has the Organizational Policy Admin role.
   2. Ensure that you are in the project associated with your organization, not the project in your organization.
   3. Select Organization Policies.
   4. Select the Disable service account key creation policy.
   5. Select Manage Policy and turn off enforcement.
   6. Select Set Policy.
3. [Create a service account and key](https://developers.google.com/workspace/guides/create-credentials#service-account).
4. Share the Google Sheets spreadsheet with the service account email address. The email address can be found in the service account JSON file under the `client_email` field. Set the sharing permissions to `Viewer`.
5. Enable the Google Sheets API for your Google Cloud Project.

   For more information, see [Enable the Google Sheets API](https://developers.google.com/sheets/api/guides/concepts#enable_the_google_sheets_api).

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

The configuration of the connector definition is divided into three parameter contexts:

* Google Sheets Source Parameters: Used to establish connection with Google Sheets.
* Google Sheets Destination Parameters: Used to establish connection with Snowflake.
* Google Sheets Ingestion Parameters: Used to define the configuration of data downloaded from Google Sheets.

> **Note:**
>
> The Google Sheets Ingestion Parameters parameter context contains spreadsheet-specific details,
> so you must create new parameter contexts for each new spreadsheet and process group.
>
> To create a new parameter context, go to the Openflow Canvas menu, select Parameter Contexts and add a new parameter context.
> It inherits parameters from both the Google Sheets Destination Parameters and Google Sheets Source Parameters parameter contexts.

The following tables describe the flow parameters that you can configure based on the parameter contexts:

#### Google Sheets Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Google Sheets Source Parameters

| Parameter | Description |
| --- | --- |
| Service Account JSON | Contents of the file containing Service Account credentials, such as client_id, client_email, and private_key. Copy the entire contents of the file. |

#### Google Sheets Ingestion Parameters

The following table lists only those parameters that are not inherited from other parameter contexts.

| Parameter | Description |
| --- | --- |
| Date Time Render Option | Determines how dates should be rendered in the output. You can select one of these options: `SERIAL_NUMBER` and `FORMATTED_STRING`. Select `SERIAL_NUMBER` only when the Value Render Option parameter is set to `UNFORMATTED_VALUE`. For more information, see [DateTimeRenderOption](https://developers.google.com/sheets/api/reference/rest/v4/DateTimeRenderOption). |
| Destination Database | The destination database in which the destination table is created. |
| Destination Schema | The destination schema in which the destination table is created. |
| Destination Table Prefix | The destination table prefix is where report data pulled from Google Sheets is stored. The connector creates one destination table for each range. If no ranges are provided then sheet names are used as table identifiers. The first row in a sheet represents the column names in the destination table. |
| Ranges | The list of ranges to retrieve from the spreadsheet. If no range is specified, all sheets in the specified spreadsheet will be downloaded. Provide each range in either [A1 or R1C1 notation](https://developers.google.com/sheets/api/guides/concepts#cell), separated by a comma. For example: `Sheet1!A1:B2,Sheet2!D4:E5,Sheet3`. |
| Run Schedule | Run schedule on which data is retrieved from Google Sheets and saved in Snowflake. By default, the timer-driven scheduling strategy is used and here the user specifies an interval, for example, `8h`. |
| Spreadsheet ID | The [unique identifier](https://developers.google.com/sheets/api/guides/concepts) for a spreadsheet. You can find it in the URL of the spreadsheet. |
| Value Render Option | Determines how values should be rendered in the output. You can select one of these options: `FORMATTED_VALUE` and `UNFORMATTED_VALUE`. If you select `FORMATTED_VALUE`, then all the columns in the destination table are of VARCHAR type. For more information, see [ValueRenderOption](https://developers.google.com/sheets/api/reference/rest/v4/ValueRenderOption). |

> **Note:**
>
> The destination table identifier is a combination of the destination table prefix and range name and must be unique.
> If you download data from multiple spreadsheets, or single sheets, and ranges names are not unique, then you must specify unique destination table prefix for each flow.
> The connector may fail, overwriting existing destination tables, if destination table names aren’t unique.

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

> **Note:**
>
> Imported `.xlsx` must be in Google Sheets format.
> If you import files, ensure that the file is converted to Google Sheets format before running flows.
> Spreadsheets in any format other than Google Sheets cannot be read.
> For more information, see [Convert files to Google Sheets format](https://support.google.com/docs/answer/9331167?hl=en#2.5).

---
title: Set up the Openflow Connector for HubSpot
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/hubspot/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for HubSpot

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for HubSpot.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for HubSpot](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Hubspot](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a HubSpot administrator, generate a HubSpot private app token or create one in your HubSpot account. This lets you authenticate your requests to the HubSpot API.

1. Log in to your HubSpot account.
2. Navigate to Settings by selecting the gear icon in the top navigation bar.
3. In the left navigation, go to Integrations » Private Apps.
4. Select Create a private app.

   1. Enter a name for your app.
   2. Navigate to the Scopes tab.
   3. Select the scopes required for the API requests you intend to make. To find scopes required for the API requests, see [Scopes](https://developers.hubspot.com/docs/guides/apps/authentication/scopes).
   4. Select Create app.
   5. Set the required scopes for the API requests you intend to make for each endpoint.
5. Select View access token to view the access token. Paste the token in the connector parameters, or save it securely.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md) and [View privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not want to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. After the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Create a database and schema in Snowflake for the connector to store ingested data. Grant the following [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step.

   > ```sqlexample
   > CREATE DATABASE hubspot_destination_db;
   > CREATE SCHEMA hubspot_destination_db.hubspot_destination_schema;
   > GRANT USAGE ON DATABASE hubspot_destination_db TO ROLE <hubspot_connector_role>;
   > GRANT USAGE ON SCHEMA hubspot_destination_db.hubspot_destination_schema TO ROLE <hubspot_connector_role>;
   > GRANT CREATE TABLE, CREATE VIEW ON SCHEMA hubspot_destination_db.hubspot_destination_schema TO ROLE <hubspot_connector_role>;
   > ```
8. Create a warehouse that will be used by the connector or use an existing one. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.
9. Ensure that the user with role used by the connector has the required privileges to use the warehouse. If that’s not the case then grant the required privileges to the role.

   > ```sqlexample
   > CREATE WAREHOUSE hubspot_connector_warehouse WITH WAREHOUSE_SIZE = 'X-Small';
   > GRANT USAGE ON WAREHOUSE hubspot_connector_warehouse TO ROLE <hubspot_connector_role>;
   > ```

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* HubSpot Source Parameters: Used to establish connection with HubSpot.
* HubSpot Destination Parameters: Used to establish connection with Snowflake.
* HubSpot Ingestion Parameters: Used to define the configuration of data downloaded from HubSpot.

#### HubSpot Source Parameters

| Parameter | Description |
| --- | --- |
| HubSpot Access Token | HubSpot Private Application access token. |

#### HubSpot Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### HubSpot Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Object Types | List of comma-separated HubSpot object types to ingest.  Supported object type values are:  * Appointments * Calls * Campaigns * Carts * Commerce Payments * Communications * Companies * Contacts * Courses * Deals * Discounts * Emails * Fees * Feedback Submissions * Goals * Invoices * Leads * Line Items * Listings * Meetings * Notes * Orders * Postal Mail * Products * Quotes * Quote Templates * Services * Subscriptions * Tasks * Taxes * Tickets * Users |
| Updated After | Filter objects updated after specified date or time. This parameter is optional. |
| Data Ingestion Schedule | Time between the next schedule. It should have a valid time duration, such as 30 minutes or 1 hour. |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

### Reconfigure the connector

You can modify the connector parameters after the connector has started ingesting data. If the issue query criteria changes, perform the following steps to make sure that the data in the destination table is consistent.

1. Stop the connector: Ensure that all Openflow processors are stopped.
2. Access configuration settings: Navigate to the connector’s configuration settings within the Snowflake Openflow interface.
3. Modify parameters: Adjust the parameters as required.
4. Clear processor state: If you are changing ingestion criteria, then Snowflake strongly recommends that you start ingestion from the beginning to keep the data in the destination table consistent. After clearing the state in the `List Fresh HubSpot Objects` processor, the connector will fetch all the objects from the beginning. Manual truncation of the destination table may be needed to prevent duplication of rows.

## Data structure and views

The connector stores data in the following two formats within your Snowflake database:

### Raw data storage

All raw HubSpot data is stored in tables with the exact names specified in the Object Types parameter. For example:

* If you configure `Products,Contacts,Companies` in the Object Types parameter, the connector creates three tables: `PRODUCTS`, `CONTACTS`, and `COMPANIES`.
* Each table contains the complete JSON payload from the HubSpot API responses.
* Raw data preserves the original structure and all metadata from HubSpot.

### Flattened views

For easier querying and analysis, the connector automatically creates flattened views for each object type:

* Each raw table has a corresponding view with the suffix `_VIEW`. For example: `PRODUCTS_VIEW`, `CONTACTS_VIEW`, and `COMPANIES_VIEW`.
* Views extract commonly used fields from the JSON payload into individual columns.
* Complex nested structures are flattened for simplified SQL queries.

---
title: Set up the Openflow Connector for Jira Cloud
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/jira-cloud/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Jira Cloud

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Jira Cloud.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Jira Cloud](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Jira Cloud](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a Jira Cloud administrator, perform the following tasks in your Atlassian account:

1. Navigate to the [API tokens page](https://id.atlassian.com/manage-profile/security/api-tokens).
2. Select Create API token with scopes.
3. In the Create an API token dialog box, provide a descriptive name for the API token and select an expiration date for the API token. This can range from 1 to 365 days.
4. Select the Api token app Jira.
5. Select jira scopes `read:jira-work` and `read:jira-user`.
6. Select Create token.
7. In the Copy your API token dialog box, select Copy to copy your generated API token and then paste the token to the connector parameters, or save it securely.
8. Select Close to close the dialog box.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role.
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Create a database and schema in Snowflake for the connector to store ingested data. Grant the following [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step.

   > ```sqlexample
   > CREATE DATABASE jira_destination_db;
   > CREATE SCHEMA jira_destination_db.jira_destination_schema;
   > GRANT USAGE ON DATABASE jira_destination_db TO ROLE <jira_connector_role>;
   > GRANT USAGE ON SCHEMA jira_destination_db.jira_destination_schema TO ROLE <jira_connector_role>;
   > GRANT CREATE TABLE, CREATE VIEW ON SCHEMA jira_destination_db.jira_destination_schema TO ROLE <jira_connector_role>;
   > ```
8. Create a warehouse that will be used by the connector or use an existing one. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.
9. Ensure that the user with role used by the connector has the required privileges to use the warehouse. If that’s not the case then grant the required privileges to the role.

   > ```sqlexample
   > CREATE WAREHOUSE jira_connector_warehouse WITH WAREHOUSE_SIZE = 'X-Small';
   > GRANT USAGE ON WAREHOUSE jira_connector_warehouse TO ROLE <jira_connector_role>;
   > ```

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* Jira Cloud Source Parameters: Used to establish connection with Jira API.
* Jira Cloud Destination Parameters: Used to establish connection with Snowflake.
* Jira Cloud Ingestion Parameters: Used to define the configuration of data downloaded from Jira.

> **Note:**
>
> Modifying the parameters related to ingestion configuration (for example, Search Type, JQL Query, Project Names, and Created After) will reset the state of the `FetchJiraIssues` processor,
> allowing it to fetch all issues again. This is useful if you want to change the issue query criteria or restart the ingestion from scratch. This reset action does not truncate the destination table.

#### Jira Cloud Source Parameters

| Parameter | Description |
| --- | --- |
| Jira Email | Email address for the Atlassian account. |
| Jira API Token | API access token for your Atlassian Jira account with the necessary scopes (`read:jira-work` and `read:jira-user`). |
| Environment URL | URL to the Atlassian Jira environment. For example, `https://your-domain.atlassian.net`. |
| Connection Method | Must be set to `DIRECT` unless otherwise instructed by Snowflake. |

#### Jira Cloud Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Jira Cloud Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Search Type | Type of search to perform. It has one of these possible values `SIMPLE` and `JQL`. Default value: `SIMPLE`. |
| Destination Table | The Snowflake table where data is stored. It will be created if it doesn’t exist. The name of the table must be unquoted and must be provided in uppercase. Additionally to the destination table, a flattened view based on destination table is created. The view name is a concatenation of the table name and the suffix `_VIEW` |
| JQL Query | A JQL query used to search for Jira issues to fetch. It should be used only when Search Type is `JQL`. |
| Project Names | List of projects from which the issues should be fetched. You can search for issues belonging to a particular project by project name, project key, or project ID. It should be used only when Search Type is `SIMPLE`. Provide a list of items, separated by commas. For example: `Project1, Project2`. |
| Status Category | Status category filter for simple search. It should be used only when Search Type is `SIMPLE`. Example values are: `Done`, `In Progress`, `To Do`. |
| Updated After | Filter issues updated after a specified date and time. It should be used only when Search Type is `SIMPLE`. It should be in the yyyy-MM-dd format, such as 2023-10-01. |
| Created After | Filter issues created after a specified date and time. It should be used only when Search Type is `SIMPLE`. It should be in the yyyy-MM-dd format, such as 2023-10-01. |
| Issue Fields | A list of fields to return for each issue, which is used to retrieve a subset of fields. IDs of custom fields can be obtained by following [this guide](https://confluence.atlassian.com/jirakb/get-custom-field-ids-for-jira-and-jira-service-management-744522503.html). This parameter accepts a comma-separated list. You can use special values: `*all` to fetch all fields, `*navigable` to fetch navigable fields, field prefixed with minus (`-`) to exclude field. For example, `*all,-description` returns all fields except description. Default value: `*all`. |
| Fetch All Worklogs | Determines whether to fetch all worklogs for each issue. Default value: `false`.   * When set to `true`, the connector enriches issues with all associated worklogs, beyond the default 20 worklogs per issue returned by the Jira Cloud REST API. * When set to `false`, only the first 20 worklogs per issue are fetched.   **Note:** Setting this parameter to `true` can impact performance due to the increased number of API calls required to fetch all worklogs for issues with more than 20 worklogs. |
| Maximum Page Size | Maximum number of issues to return per request, with a default and maximum value of `1000`. Note that the Jira API may return fewer results depending on the total response size. |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

If you need to change the issue query criteria or want to restart the ingestion from scratch, perform the following steps to ensure that the data in the destination table is consistent:

1. Right-click on the FetchJiraIssues processor and stop it.
2. Right-click on the FetchJiraIssues processor and then select View State.
3. In the State dialog box, select Clear State. This action clears the state of the processor and allows it to fetch all issues again.
4. Optional: If you want to change the issue query criteria, right-click on the imported process group and select Parameters. Update the parameters as needed.
5. Optional: If you want to change the destination table name, right-click on the imported process group and select Parameters. Update the `Destination Table` parameter.
6. Right-click on the FetchJiraIssues processor and select Start. The connector starts the data ingestion.
7. After ingestion, the data is available in the Snowflake destination table and in a flattened format in the destination view. The view includes all fields available in the Jira instance.

## Accessing the data

Data fetched from Jira is available in the destination table. All fields fetched for Jira issue is available in the
`ISSUE` column as an object in raw form fetched from the API.

To help with querying the data, a flattened view is created based on the destination table. The view name is a
concatenation of the table name and the suffix `_VIEW`. For example, if the destination table is named `JIRA_ISSUES`,
then the view will be named `JIRA_ISSUES_VIEW`. In the view, all issue fields are extracted and available as separate
columns. The column name is set to the field label. If there are many issues with the same label, a suffix with field ID
is added to the column name to ensure uniqueness. For example, if there are two fields with IDs `customfield_1`,
`customfield_2`, the label set in both fields to `Custom Field`, then the columns in the view will be named
`Custom Field (customfield_1)`, `Custom Field (customfield_2)`.

---
title: Set up the Openflow Connector for Kafka
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kafka/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Kafka

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Kafka](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Kafka](../../setup-openflow-spcs-sf-allow-list.md) connector.
4. Ensure that your Kafka cluster is running version 0.10.0.0 or later. Prior versions of Kafka are not supported.

### Required network rule (Snowflake Deployment)

If you are using Snowflake Deployment, your Kafka cluster must be reachable from within the Snowflake deployment. This requires creating a network rule that includes all Kafka broker host:port pairs, not only the bootstrap servers. For details on creating network rules and External Access Integrations, see [Creating Network Rules and External Access Integrations](../../setup-openflow-spcs-create-rr.md).

## Connector types

The Openflow Connector for Kafka is available in three different configurations, each optimized for specific use cases. You can download these connector definitions from the connectors gallery:

Apache Kafka for JSON data format
:   Simplified connector for JSON message ingestion with schema evolution and topic-to-table mapping

Apache Kafka for AVRO data format
:   Simplified connector for AVRO message ingestion with schema evolution and topic-to-table mapping

Apache Kafka with DLQ and metadata
:   Full-featured connector with dead letter queue (DLQ) support, metadata handling, and feature parity with the [Snowflake connector for Kafka](../../../../kafka-connector-overview.md)

For detailed configuration of specific connector types, see:

* [Apache Kafka for JSON/AVRO data format](kafka-json-avro.md) - JSON/AVRO data format connectors
* [Apache Kafka with DLQ and metadata](kafka-dlq-metadata.md) - DLQ and metadata connector

### Which connector should you choose?

Choose the connector variant that best matches your data format, operational requirements, and feature needs:

Choose [Apache Kafka for JSON or AVRO data format](kafka-json-avro.md) when:

* Your Kafka messages are in JSON or AVRO format
* You need basic schema evolution capabilities
* You want a simple setup with minimal configuration
* You don’t require advanced error handling or dead letter queue functionality
* You’re setting up a new integration and want to get started quickly

*Format-specific considerations:*

* **JSON format**: More flexible for varied data structures, easier to debug and inspect
* **AVRO format**: Strongly typed data with built-in schema registry integration, better for structured data pipelines

Choose [Apache Kafka with DLQ and metadata](kafka-dlq-metadata.md) when:

* You’re migrating from the [Snowflake connector for Kafka](../../../../kafka-connector-overview.md) and need feature parity with compatible functionality
* You need robust error handling and dead letter queue support for failed messages
* You require detailed metadata about message ingestion (timestamps, offsets, headers)

#### Migration considerations

If you’re currently using the Snowflake connector for Kafka, choose the **Apache Kafka with DLQ and metadata** connector for a seamless migration experience with feature compatibility.

**Field name handling differences**: The Openflow Connector for Kafka handles special characters in field names differently from the Snowflake connector for Kafka. After migration, the Openflow Connector for Kafka may create new Snowflake columns with different names due to these naming convention differences. For detailed information about how field names are transformed, see [Field name mapping and special characters handling](about.md).

#### Performance considerations

* JSON and AVRO format connectors offer better performance for simple use cases due to their streamlined design
* The DLQ and metadata connector provides more comprehensive monitoring and error handling at the cost of slightly higher resource usage

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
2. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).

   Since the connector has the capability to automatically create the destination table if it does not already exist, make sure the user has the required privileges for creating and managing Snowflake objects:

   | Object | Privilege | Notes |
   | --- | --- | --- |
   | Database | USAGE |  |
   | Schema | USAGE . CREATE TABLE . | After the schema-level objects have been created, the CREATE `object` privileges can be revoked. |
   | Table | OWNERSHIP | Only required when using the Kafka connector to ingest data into an existing table. . If the connector creates a new target table for records from the Kafka topic, the default role for the user specified in the configuration becomes the table owner. |

   Snowflake recommends creating a separate user and role for each Kafka instance for better access control.

   You can use the following script to create and configure a custom role (requires SECURITYADMIN or equivalent):

   ```sqlexample
   USE ROLE securityadmin;

   CREATE ROLE kafka_connector_role_1;
   GRANT USAGE ON DATABASE kafka_db TO ROLE kafka_connector_role_1;
   GRANT USAGE ON SCHEMA kafka_schema TO ROLE kafka_connector_role_1;
   GRANT CREATE TABLE ON SCHEMA kafka_schema TO ROLE kafka_connector_role_1;

   -- Only for existing tables
   GRANT OWNERSHIP ON TABLE existing_table1 TO ROLE kafka_connector_role_1;
   ```

   Note that privileges must be granted directly to the connector role and cannot be inherited.
3. Grant the Snowflake service user the role you created in the previous steps.

   The role should be assigned as the default role for the user:

   ```sqlexample
   GRANT ROLE kafka_connector_role_1 TO USER kafka_connector_user_1;
   ALTER USER kafka_connector_user_1 SET DEFAULT_ROLE = kafka_connector_role_1;
   ```
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 1.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Populate the process group parameters

   1. Right click on the imported process group and select **Parameters**.
   2. Fill out the required parameter values as described in Common parameters.

### Common parameters

All Kafka connector variants share common parameter contexts for basic connectivity and authentication.

#### Snowflake destination parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Kafka source parameters (SASL authentication)

| Parameter | Description | Required |
| --- | --- | --- |
| Kafka Security Protocol | Security protocol used to communicate with brokers. Corresponds to Kafka Client security.protocol property. One of: *SASL_PLAINTEXT* / *SASL_SSL* | Yes |
| Kafka SASL Mechanism | SASL mechanism used for authentication. Corresponds to Kafka Client sasl.mechanism property. One of: *PLAIN* / *SCRAM-SHA-256* / *SCRAM-SHA-512* | Yes |
| Kafka SASL Username | The username to authenticate to Kafka | Yes |
| Kafka SASL Password | The password to authenticate to Kafka | Yes |
| Kafka Bootstrap Servers | A comma-separated list of Kafka broker to fetch data from, should contain port, for example kafka-broker:9092. The same instance is used for the DLQ topic. | Yes |

#### Kafka ingestion parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Kafka Topic Format | One of: *names* / *pattern*. Specifies whether the “Kafka Topics” provided are a comma separated list of names or a single regular expression. | Yes |
| Kafka Topics | A comma-separated list of Kafka topics or a regular expression. | Yes |
| Kafka Group Id | The ID of a consumer group used by the connector. Can be arbitrary but must be unique. | Yes |
| Kafka Auto Offset Reset | Automatic offset configuration applied when no previous consumer offset is found corresponding to Kafka `auto.offset.reset` property. One of: *earliest* / *latest.* Default: *latest* | Yes |
| Topic To Table Map | This optional parameter allows user to specify which topics should be mapped to which tables. Each topic and its table name should be separated by a colon (see example below). This table name must be a valid Snowflake unquoted identifier. The regular expressions cannot be ambiguous — any matched topic must match only a single target table. If empty or no matches found, topic name will be used as table name. Note: The mapping cannot contain spaces after commas. | No |

`Topic To Table Map` example values:

* `topic1:low_range,topic2:low_range,topic5:high_range,topic6:high_range`
* `topic[0-4]:low_range,topic[5-9]:high_range`
* `.*:destination_table` - maps all topics to the **destination_table**

## Configure variant-specific settings

After configuring the common parameters, you need to configure settings specific to your chosen connector variant:

For **Apache Kafka for JSON data format** and **Apache Kafka for AVRO data format** connectors:
:   See [Apache Kafka for JSON/AVRO data format](kafka-json-avro.md) for JSON/AVRO-specific parameters.

For **Apache Kafka with DLQ and metadata** connector:
:   See [Apache Kafka with DLQ and metadata](kafka-dlq-metadata.md) for advanced parameters including DLQ configuration, schematization settings, Iceberg table support, and message format options.

## Authentication

All connector variants support SASL authentication configured through parameter contexts as described in Kafka source parameters (SASL authentication).

For other authentication methods including mTLS and AWS MSK IAM, see [Configure other authentication methods for Openflow Connector for Kafka](authentication.md).

## Run the flow

1. Right click on the plane and click **Enable all Controller Services**.
2. Right click on the plane and click **Start**. The connector starts data ingestion.

---
title: Set up the Openflow Connector for LinkedIn Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/linkedin-ads/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for LinkedIn Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for LinkedIn Ads.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for LinkedIn Ads](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [LinkedIn Ads](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

1. As a LinkedIn Ads user, perform the following tasks:

   1. Optional: If you don’t have an ad account to run and manage campaigns, [create one](https://www.linkedin.com/help/linkedin/answer/a426102/create-an-ad-account-in-campaign-manager-as-a-new-advertiser).
   2. Ensure that the [user account](https://www.linkedin.com/help/lms/answer/a417905?trk=hc-articlePage-peopleAlsoViewed) has at least a VIEWER role on the ad account.
   3. Use the user account to apply for Advertising API access.
      For more information, see the [Microsoft quick start](https://learn.microsoft.com/en-us/linkedin/marketing/quick-start?view=li-lms-2025-02#step-1-apply-for-api-access).
   4. Obtain a [refresh token](https://learn.microsoft.com/en-us/linkedin/shared/authentication/developer-portal-tools?context=linkedin%2Fcontext#generate-a-token-in-the-developer-portal). Use `3-legged oAuth` and the `r_ads_reporting` scope.
   5. Obtain the client ID and client secret from the LinkedIn Developer Portal. These credentials are available in the Auth tab in [App Details](https://www.linkedin.com/developers/apps).

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role.
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.
6. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
   EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
7. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
   Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
8. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
9. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
10. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following sql commands:

    > ```sqlexample
    > CREATE DATABASE linkedin_destination_db;
    > CREATE SCHEMA linkedin_destination_db.linkedin_destination_schema;
    > GRANT USAGE ON DATABASE linkedin_destination_db TO ROLE <linkedin_connector_role>;
    > GRANT USAGE ON SCHEMA linkedin_destination_db.linkedin_destination_schema TO ROLE <linkedin_connector_role>;
    > GRANT CREATE TABLE ON SCHEMA linkedin_destination_db.linkedin_destination_schema TO ROLE <linkedin_connector_role>;
    > ```
11. Create a warehouse that will be used by the connector or use an existing one. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
    and the amount of data transferred. Large table numbers typically scale better with
    [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.
12. Ensure that the user with role used by the connector has the required privileges to use the warehouse. If that’s not the case then grant the required privileges to the role.

    > ```sqlexample
    > CREATE WAREHOUSE linkedin_connector_warehouse WITH WAREHOUSE_SIZE = 'X-Small';
    > GRANT USAGE ON WAREHOUSE linkedin_connector_warehouse TO ROLE <linkedin_connector_role>;
    > ```

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

1. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following SQL commands:

   > ```sqlexample
   > CREATE DATABASE DESTINATION_DB;
   > CREATE SCHEMA DESTINATION_DB.DESTINATION_SCHEMA;
   > GRANT USAGE ON DATABASE DESTINATION_DB TO ROLE <CONNECTOR_ROLE>;
   > GRANT USAGE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > GRANT CREATE TABLE, CREATE PIPE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

> **Note:**
>
> Each process group is responsible for fetching data for a single report configuration.
> To use multiple configurations on a regular schedule, create a separate process group for each report configuration.

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* Linkedin Ads Source Parameters: Used to establish connection with LinkedIn Ads API.
* Linkedin Ads Destination Parameters: Used to establish connection with Snowflake.
* Linkedin Ads Ingestion Parameters: Contains all parameters from the other two parameter contexts and additional parameters specific to a given process group.
  :   Because this parameter context contains ingestion-specific details, you must create new parameter contexts for each new report and process group.

#### Linkedin Ads Source Parameters

| Parameter | Description |
| --- | --- |
| Client ID | The client ID of an application registered on LinkedIn |
| Client Secret | The client secret related to the client ID |
| Refresh Token | A user obtains the refresh token after the app registration process. They use it together with the client ID and the client secret to get an access token. |
| Token Endpoint | The token endpoint is obtained by a user during the app registration process |

#### Linkedin Ads Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Linkedin Ads Ingestion Parameters

The following table lists parameters that are not inherited from the other parameter contexts:

| Parameter | Description |
| --- | --- |
| Report Name | The unique name of the report. It is uppercased and used as the destination table name. |
| Start Date | Start date from which ingestion should begin. Must be in the yyyy-MM-dd format. |
| Time Granularity | Time granularity of results. Possible values:   * `ALL`: Results grouped into a single result across the entire time range of the report. * `DAILY`: Results grouped by day. * `MONTHLY`: Results grouped by month. * `YEARLY`: Results grouped by year. |
| Conversion Window | The timeframe for which data is refreshed during incremental load when `DAILY` time granularity is chosen. For example, if the [conversion window](https://www.linkedin.com/help/lms/answer/a426359) is equal to 30 days, then during the INCREMENTAL load, the ingestion starts from the date of the last successful ingestion minus 30 days.  Required when `DAILY` time granularity is specified. For other possible time granularities, such as `ALL`, `MONTHLY`, and `YEARLY`, the SNAPSHOT ingestion strategy is used. Data from the start date to the present is always downloaded, so there is no need to use a conversion window.  The conversion window can be any number from 1 to 365. |
| Metrics | Comma-separated list of metrics. Metrics are case-sensitive. For more information, see [Reporting](https://learn.microsoft.com/en-us/linkedin/marketing/integrations/ads-reporting/ads-reporting?view=li-lms-2025-03&tabs=http#metrics-available).  The `pivotValues` and `dateRange` metrics are mandatory and are automatically included by the connector.  Up to 20 metrics can be specified, including the mandatory metrics. |
| Pivots | Comma-separated list of pivots. The available pivots are as follows:   * [Analytics Finder](https://learn.microsoft.com/en-us/linkedin/marketing/integrations/ads-reporting/ads-reporting?view=li-lms-2025-03&tabs=http#analytics-finder) * [Statistics Finder](https://learn.microsoft.com/en-us/linkedin/marketing/integrations/ads-reporting/ads-reporting?view=li-lms-2025-03&tabs=http#statistics-finder)   The connector uses the Analytics Finder when zero or one pivot is specified, and switches to the Statistics Finder when two or three pivots are selected. You can use a maximum of three pivots. |
| Shares | Comma-separated list of share IDs. This parameter can be used to filter results by share ID. |
| Campaigns | Comma-separated list of campaign IDs. This parameter can be used to filter results by campaign ID. |
| Campaign Groups | Comma-separated list of campaign group IDs. This parameter can be used to filter results by campaign group ID. |
| Accounts | Comma-separated list of account IDs. This parameter can be used to filter results by account ID. |
| Companies | Comma-separated list of company IDs. This parameter can be used to filter results by company ID. |

> **Note:**
>
> You must specify at least one of the filters, that is shares, campaigns, campaign groups, accounts, or companies.

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start.
   :   The connector starts the data ingestion.

---
title: Set up the Openflow Connector for Meta Ads
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/meta-ads/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Meta Ads

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Meta Ads.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Meta Ads](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Meta Ads](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a Meta Ads administrator, perform the following actions in your Meta Ads account:

1. [Create a Meta App](https://developers.facebook.com/docs/development/create-an-app/) or ensure that you have access to one.
2. Enable [Marketing API](https://developers.facebook.com/docs/marketing-api/get-started) in the [App dashboard](https://developers.facebook.com/apps).
3. Generate a [long-lived token](https://developers.facebook.com/docs/facebook-login/guides/access-tokens/get-long-lived/).
4. Optional: Increase the rate limit by [changing the app access type](https://developers.facebook.com/docs/marketing-api/overview/rate-limiting) from `Standard access` to `Advanced access` of the Ads Management Standard Access. Enable the `ads_read` and `ads_management` [permissions](https://developers.facebook.com/docs/permissions/).

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

1. Create a database and schema in Snowflake for the connector to store ingested data.Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following sql commands:

   > ```sqlexample
   > CREATE DATABASE META_ADS_DESTINATION_DB;
   > CREATE SCHEMA META_ADS_DESTINATION_DB.META_ADS_DESTINATION_SCHEMA;
   > GRANT USAGE ON DATABASE META_ADS_DESTINATION_DB TO ROLE <META_ADS_CONNECTOR_ROLE>;
   > GRANT USAGE ON SCHEMA META_ADS_DESTINATION_DB.META_ADS_DESTINATION_SCHEMA TO ROLE <META_ADS_CONNECTOR_ROLE>;
   > GRANT CREATE TABLE ON SCHEMA META_ADS_DESTINATION_DB.META_ADS_DESTINATION_SCHEMA TO ROLE <META_ADS_CONNECTOR_ROLE>;
   > ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* Meta Ads Source Parameters: Used to establish connection with MetaAds API.
* Meta Ads Destination Parameters: Used to establish connection with Snowflake.
* Meta Ads Ingestion Parameters: Used to define the configuration of data downloaded from Meta Ads.

#### Meta Ads Source Parameters

| Parameter | Description |
| --- | --- |
| Access Token | Token required to request Meta Ads Insights API |

#### Meta Ads Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Meta Ads Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Report Name | Name of the report to be used as a destination table name. The name must be unique within the destination schema. |
| Report Object Id | Identifier of the downloaded object from Meta Ads.  Reference to API listing different object ids:  * [Ad Accounts](https://developers.facebook.com/docs/graph-api/reference/user/adaccounts) * [Ad Sets](https://developers.facebook.com/docs/marketing-api/reference/ad-account/adsets/) * [Ads](https://developers.facebook.com/docs/marketing-api/reference/ad-account/ads/) * [Campaigns](https://developers.facebook.com/docs/marketing-api/reference/ad-account/campaigns/) |
| Report Ingestion Strategy | Mode in which data is fetched, either snapshot or incremental |
| Meta Ads Version | Version of Meta Ads API used for downloading reports. Allowed value: `v22.0`. |
| Report Level | Presents the aggregation level of the result.  Possible values:  * `account` * `campaign` * `ad` * `adset`. |
| Report Fields | Comma separated list of report fields |
| Report Breakdowns | Comma separated list of report breakdowns. Full list of available breakdowns can be found [here](https://developers.facebook.com/docs/marketing-api/insights/breakdowns). |
| Report Time Increment | Level of aggregation based on the day count  Possible values:  * `1` - Daily * `3` - Every 3 days * `7` - Weekly * `monthly` - Monthly * `90` - Quarterly * `all_days` - All days; do not slice the result |
| Report Action Time | Time of action stats  Possible values:  * `conversion` - Reports action based on conversion date * `impression` - Reports action based on impression date * `mixed` - Mixed approach between conversion and impression |
| Report Click Attribution Window | Attribution window for the click action  Possible values:  * `1d_click` * `7d_click` * `28d_click` |
| Report View Attribution Window | Attribution window for the view action  Possible values:  * `1d_view` * `7d_view` * `28d_view` |
| Report Schedule | Schedule time for processor creating reports |
| Report Start Date | Start date from which the ingestion should happen. The date format is YYYY-MM-DD. |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

## How to reset the connector

To fully reset connector to the initial state, do the following:

1. Ensure that there are no more flow files in the queues.
2. Stop all the processors.
3. Clear the state of the initial processor.

   > 1. Right click on the processor `Create Meta Ads Report` and select View State.
   > 2. Select the option Clear State. This resets the state of the processor.
4. Drop the destination table in Snowflake.

---
title: Set up the Openflow Connector for Microsoft Dataverse
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/dataverse/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Microsoft Dataverse

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Microsoft Dataverse.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Microsoft Dataverse](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Microsoft Dataverse](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a Microsoft Dataverse administrator, perform the following steps:

1. Ensure you have a Dataverse Environment to work with, and you have
   access to that environment through
   <https://admin.powerplatform.microsoft.com/>.
2. Ensure that you have an application registered in Microsoft Entra ID in portal.azure.com. This application must have
   access to the tenant we have our Dataverse Environment available. To register the application follow
   [this guide](https://learn.microsoft.com/en-us/power-apps/developer/data-platform/walkthrough-register-app-azure-active-directory).
3. Generate and store ClientID and Client Secret within that application.
4. Go to Power Apps Admin Center and configure your Dataverse Environment to be accessed via applications registered before.
   To do that, go to Manage » Environments and select the environment to configure. Then go to
   Settings » Users & permissions » Application users. Previously created applications
   must be added and granted with privileges necessary to read data from Microsoft Dataverse.
5. Copy and save the Environment URL of the selected Dataverse
   Environment from <https://admin.powerplatform.microsoft.com/>.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a Snowflake user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
   :   Create a database and schema to store the replicated data, and set up
       privileges for the service user to create tables in destination schema by granting the [USAGE and CREATE TABLE privileges](../../../../security-access-control-privileges.md).

       ```sqlexample
       CREATE DATABASE <destination_database>;
       CREATE SCHEMA <destination_database>.<destination_schema>;
       CREATE USER <openflow_user> TYPE=SERVICE COMMENT='Service user for automated access of Openflow';
       CREATE ROLE <openflow_role>;
       GRANT ROLE <openflow_role> TO USER <openflow_user>;
       GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_role>;
       GRANT USAGE ON SCHEMA <destination_database>.<destination_schema> TO ROLE <openflow_role>;
       GRANT CREATE TABLE ON SCHEMA <destination_database>.<destination_schema> TO ROLE <openflow_role>;
       CREATE WAREHOUSE <openflow_warehouse>
            WITH
                WAREHOUSE_SIZE = 'SMALL'
                AUTO_SUSPEND = 300
                AUTO_RESUME = TRUE;
       GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_role>;
       ```

   1. Create a pair of secure keys (public and private). Store the private key for the user in a file to supply to the connector’s configuration.
      Assign the public key to the Snowflake service user:

      ```sqlexample
      ALTER USER <openflow_user> SET RSA_PUBLIC_KEY = 'thekey';
      ```

      For more information, see [pair of keys](../../../../key-pair-auth.md).
2. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
3. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
4. Designate a warehouse for the connector to use. Grant the USAGE privilege on the warehouse to the role created before. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Set up the connector

As a data engineer, perform the following tasks to install and configure the connector:

### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* Dataverse Source Parameters: Used to establish connection with Dataverse.
* Dataverse Destination Parameters: Used to establish connection with Snowflake.
* Dataverse Ingestion Parameters: Used to define the configuration of data downloaded from Dataverse.

#### Dataverse Source Parameters

| Parameter | Description |
| --- | --- |
| Source Dataverse Environment URL | The main identifier of a source system to fetch data. The URL indicates a namespace where Dataverse tables exist. It also lets you create a scope parameter for OAuth. |
| Source Tenant ID | Microsoft Azure Tenant ID. It’s used to create OAuth URLs. Microsoft Dataverse Environment must belong to this tenant. |
| Source OAuth Client ID | Microsoft Azure Client ID used to access Microsoft Dataverse API. [Microsoft Dataverse Web API](https://learn.microsoft.com/en-us/power-apps/developer/data-platform/webapi/overview) uses OAuth authentication to secure access, and the connector uses the client credentials flow. To learn about client ID and how to find it in Microsoft Entra, see [Application ID (client ID)](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#application-id-client-id). |
| Source OAuth Client Secret | Microsoft Azure Client Secret used to access Microsoft Dataverse API. [Microsoft Dataverse Web API](https://learn.microsoft.com/en-us/power-apps/developer/data-platform/webapi/overview) uses OAuth authentication to secure access, and the connector uses the client credentials flow. To learn about client secret and how to find it in Microsoft Entra, see [Certificates & secrets](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#certificates--secrets). |

#### Dataverse Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

#### Dataverse Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Scheduling Interval | Interval to be used as a triggering interval for the processor fetching list of tables and initializing ingestion. |
| Source Tables Filter Strategy | Strategy for filtering tables to be ingested. Can be one of REGEXP and LIST. |
| Source Tables Filter Value | Value of the tables filter. When Source Tables Filter Strategy is set to REGEXP - this is the regular expression to be matching selected tables. When LIST is provided, then it is a comma separated list of table names. |
| Column Filter JSON | Optional. A JSON array specifying per-table column filters. Columns can be included or excluded by name (`included`, `excluded`) or by regular expression pattern (`includedPattern`, `excludedPattern`). The `table` value must be the **singular logical entity name** (e.g., `annotation`), not the plural entity set name used in `Source Tables Filter Value` (e.g., `annotations`). For example: `[ {"table": "mytable", "excluded": ["binarycolumn", "binarycolumn_binary"]} ]` excludes large binary columns from `mytable`. See Replicate a subset of columns in a table for full details. |

> **Note:**
>
> When configuring `Source Tables Filter Value`, use the **entity set name** (plural form,
> e.g., `annotations`) rather than the table name displayed in the Microsoft Dataverse
> interface. To find the entity set name for a table, go to
> [Power Apps](https://make.powerapps.com), select Tables, find your table,
> then select Advanced » Tools » Copy set name.
>
> The `Column Filter JSON` parameter uses a different naming convention — it requires the
> **singular logical entity name** (e.g., `annotation`). See
> Replicate a subset of columns in a table for details.

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

### Replicate a subset of columns in a table

The connector can filter the data replicated per table to a subset of configured columns.

To apply filters to columns, modify the Replication Parameters context `Column Filter` property to specify a JSON filter.
Add an array of configurations, one entry for every table to which you want to apply a filter.

> **Important:**
>
> The `table` field must use the **singular logical entity name** (e.g., `annotation`),
> not the plural entity set name used in `Source Tables Filter Value` (e.g., `annotations`).
> To find the logical entity name in Power Apps, go to [Power Apps](https://make.powerapps.com),
> select Tables, find your table, then select Advanced » Tools
> » Copy logical name.
>
> Some columns have a binary representation stored under a `_binary`-suffixed column name
> (for example, a column `mycolumn` may also appear as `mycolumn_binary`). To fully
> exclude such a column, list both names in the `excluded` array.

The following example excludes large binary columns from a table:

```javascript
[
    {
        "table": "mytable",
        "excluded": ["mycolumn", "mycolumn_binary"]
    }
]
```

Columns can be included or excluded by name or pattern. You can apply a single condition per table,
or combine multiple conditions, with exclusions taking precedence over inclusions.

The following example shows all available fields. The `table` field is mandatory. One or
more of `included`, `excluded`, `includedPattern`, `excludedPattern` is required.

```javascript
[
    {
        "table" : "<singular logical entity name>",
        "included": ["<column name>", "<column name>"],
        "excluded": ["<column name>", "<column name>"],
        "includedPattern": "<regular expression>",
        "excludedPattern": "<regular expression>",
    }
]
```

### Manage table state

The connector maintains per-table ingestion state in the `Dataverse Table State Service`
controller service. Each entry records the current ingestion status and the delta token
used for change tracking.

#### View connector state

To view the current state of all tables:

1. Right-click on the canvas and select Controller services.
2. Locate the controller service named Dataverse Table State Service.
3. In the Dataverse Table State Service menu, click View state.

The state is a set of key/value pairs where the key is the table entity set name
(for example, `accounts`). The value has the format
`<STATUS>;<deltaToken>;<skipToken>;<staleFlag>`, for example:

```text
accounts -> DONE;!AAAAAjE...;;
```

The `STATUS` can be one of the following:

* `FETCHING` — the connector is actively fetching records for this table.
* `DONE` — the last ingestion run completed successfully.

#### Restart ingestion for a single table

Clearing a table’s state causes the connector to perform a full re-ingestion of that
table on the next run. All previously synced records will be re-ingested.

To restart ingestion for a specific table:

1. Stop all processors in the flow.
2. Ensure that no in-flight FlowFiles are being processed for that table.
3. Right-click on the canvas and select Disable all controller services.
4. Go to Controller services and open the state view for Dataverse Table State Service.
5. Select the trash icon next to the table entry (identified by its entity set name) to
   remove the state for that table only.
6. Right-click on the canvas, select Enable all controller services, and then start all processors.

#### Restart ingestion for all tables

To restart ingestion for all replicated tables:

1. Stop all processors in the flow.
2. Clear all FlowFiles from the connector’s queues.
3. Right-click on the canvas and select Disable all controller services.
4. Go to Controller services and open the state view for Dataverse Table State Service.
5. Select Clear state to remove all table entries.
6. Right-click on the canvas, select Enable all controller services, and then start all processors.

> **Caution:**
>
> Do not delete FlowFiles manually while the connector is running. Doing so can leave
> a table in the `FETCHING` status indefinitely. If this occurs, restart ingestion
> for that table as described above.

---
title: Set up the Openflow Connector for MySQL
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/mysql/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for MySQL

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for MySQL.

> **Note:**
>
> This connector can be configured to immediately start replicating incremental changes for newly added tables,
> bypassing the snapshot load phase. This option is often useful when reinstalling the connector
> in an account where previously replicated data exists and you want to continue replication without having to re-snapshot tables.

For details on the incremental load process, see [Incremental replication](incremental-replication.md).

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for MySQL](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [MySQL](../../setup-openflow-spcs-sf-allow-list.md) connector.
4. Ensure that you have a MySQL 8 or a later version to synchronize data with Snowflake.
5. Recommended: Ensure that you add only one connector instance per runtime.
6. As a database administrator, perform the following tasks:

   1. Enable [binary logs](https://dev.mysql.com/doc/refman/8.4/en/binary-log.html), then save and configure its format as follows:

      |  |  |
      | --- | --- |
      | `log_bin` | Set to `on`.  This enables the binary log that records structural and data changes. |
      | `binlog_format` | Set to `row`.  The connector supports only row-based replication. MySQL 8.x versions may be the last ones to support this setting, and future versions will only support row-based replication.  Not applicable in GCP Cloud SQL, where it is fixed at the right value. |
      | `binlog_row_metadata` | Set to `full`.  The connector requires all row metadata to operate, most importantly, column names and primary key information.  Under Microsoft Azure Database for MySQL the `binlog_row_metadata` field is not user modifiable. Raise a Microsoft support ticket to change this value. |
      | `binlog_row_image` | Set to `full`.  The connector requires that all columns be written into the binary log.  Not applicable in Amazon Aurora, where it is fixed at the right value. |
      | `binlog_row_value_options` | Leave empty.  This option ony affects JSON columns, where it can be set to include only the modified parts of JSON documents for `UPDATE` statements. The connector requires that full documents are written into the binary log. |
      | `binlog_expire_logs_seconds` | Set to at least a few hours, or longer to ensure that the database agent can continue incremental replication after extended pauses or downtime. Snowflake recommends that you set the [binary log expiration period (binlog_expire_logs_seconds)](https://dev.mysql.com/doc/refman/8.4/en/replication-options-binary-log.html#sysvar_binlog_expire_logs_seconds) to at least a few hours to ensure stable working of the connector. After binary log expiration period ends, binary log files might be automatically removed. If the integration is paused for a long period, for example due to maintenance work, and the expired binary log files are deleted during this time, Openflow will not be able to replicate the data from these files.  If you’re using scheduled replication, the value needs to be longer than the configured schedule. |

      For example:

      ```sqlexample
      log_bin = on
      binlog_format = row
      binlog_row_metadata = full
      binlog_row_image = full
      binlog_row_value_options =
      ```
   2. Increase the value of `sort_buffer_size`.

      ```sqlexample
      sort_buffer_size = 4194304
      ```

      `sort_buffer_size` defines the amount of memory (in bytes) allocated per query thread for in-memory sorting operations, such ORDER BY.
      If the value is too small, the connector may fail with the following error message:

      `Out of sort memory, consider increasing server sort buffer size`.
      This indicates that `sort_buffer_size` should be raised.
   3. If you’re using Amazon RDS databases, then increase the retention period relevant to `binlog_expire_logs_seconds` using `rds_set_configuration`.
      For example, if you want to store binlog for 24 hours, then call `mysql.rds_set_configuration('binlog retention hours', 24)`.
   4. When using a read replica to connect, binary logging must be enabled on the replica.

      Configuration details are provided in step 4.
   5. After binary logging is enabled, configure the replica to log the events received from its source into its own binary log.

      ```sqlexample
      log_replica_updates = ON
      ```

      `log_replica_updates` allows the replica to write events received from its source to its own binary
      log, making those changes available to any databases that are replicating from it.
   6. Connect via SSL. If you’re planning to use an SSL connection to MySQL, prepare the root certificate for your database server.
      It is required during configuration.
   7. Create a user for the connector. The connector requires a user with the REPLICATION_SLAVE and REPLICATION_CLIENT privileges
      for reading the binary logs. Grant these privileges:

      ```sqlexample
      GRANT REPLICATION SLAVE ON *.* TO '<username>'@'%'
      GRANT REPLICATION CLIENT ON *.* TO '<username>'@'%'
      ```
   8. Grant the SELECT privilege on every replicated table:

      ```sqlexample
      GRANT SELECT ON <schema>.* TO '<username>'@'%'
      GRANT SELECT ON <schema>.<table> TO '<username>'@'%'
      ```

      For more information on replication security, see [Binary log](https://dev.mysql.com/doc/refman/8.4/en/binary-log.html).
7. As a Snowflake account administrator, perform the following tasks:

   1. Create a Snowflake user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
      Create a database to store the replicated data, and set up
      privileges for the Snowflake user to create objects in that database by granting the [USAGE and CREATE SCHEMA privileges](../../../../security-access-control-privileges.md).

      ```sqlexample
      CREATE DATABASE <destination_database>;
      CREATE USER <openflow_user> TYPE=SERVICE COMMENT='Service user for automated access of Openflow';
      CREATE ROLE <openflow_role>;
      GRANT ROLE <openflow_role> TO USER <openflow_user>;
      GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_role>;
      GRANT CREATE SCHEMA ON DATABASE <destination_database> TO ROLE <openflow_role>;
      CREATE WAREHOUSE <openflow_warehouse>
           WITH
               WAREHOUSE_SIZE = 'XSMALL'
               AUTO_SUSPEND = 300
               AUTO_RESUME = TRUE;
      GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_role>;
      ```
   2. Create a pair of secure keys (public and private). Store the private key for the user in a file to supply to the connector’s configuration.
      Assign the public key to the Snowflake service user:

      ```sqlexample
      ALTER USER <openflow_user> SET RSA_PUBLIC_KEY = 'thekey';
      ```

      For more information, see [pair of keys](../../../../key-pair-auth.md).
   3. Designate a warehouse for the connector to use. Start with the `XSMALL` warehouse size,
      then experiment with size depending on the amount of tables being replicated, and the amount of data
      transferred. Large table numbers typically scale better with [multi-cluster warehouses](../../../../warehouses-multicluster.md),
      rather than the warehouse size.

## Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

## Configure the connector

To configure the connector, do the following as a data engineer:

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values.

   For more information on the required parameter values, see the following sections:

   * MySQL Source Parameters: Used to establish a connection with MySQL.
   * MySQL Destination Parameters: Used to establish a connection with Snowflake.
   * MySQL Ingestion Parameters: Used to specify the tables to replicate.

Start with setting the parameters of the MySQL Source Parameters context, then the MySQL Destination Parameters context.
After this is done, you can enable the connector. The connector should connect to both MySQL and Snowflake and start running.
However, the connector does not replicate any data until any tables to be replicated are explicitly added to its configuration.

To configure specific tables for replication, edit the MySQL Ingestion Parameters context. After you apply the changes to the
Replication Parameters context, the configuration is picked up by the connector, and the replication lifecycle starts for every table.

### MySQL Source Parameters

| Parameter | Description |
| --- | --- |
| MySQL Connection URL | The full JDBC URL to the source database. The connector uses the MariaDB driver, which is compatible with MySQL and requires the `jdbc:mariadb` prefix in the URL. If the SSL is disabled, then the connection URL should have the `allowPublicKeyRetrieval` parameter set to `true`.  Examples:   * With SSL enabled: `jdbc:mariadb://example.com:3306` * With SSL disabled: `jdbc:mariadb://example.com:3306?allowPublicKeyRetrieval=true` |
| MySQL JDBC Driver | The absolute path to the [MariaDB JDBC driver jar](https://mariadb.com/downloads/connectors/connectors-data-access/java8-connector/). The connector uses the MariaDB driver, which is compatible with MySQL. Select the Reference asset checkbox to upload the MariaDB JDBC driver.  Example: `/opt/resources/drivers/mariadb-java-client-3.5.2.jar` |
| MySQL Username | The username for the connector. |
| MySQL Password | The password for the connector. |

### MySQL Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Connection Strategy | When using KEY_PAIR, specify the strategy for connecting to Snowflake:   * **STANDARD** (default): Connect using standard public routing to Snowflake services. * **PRIVATE_CONNECTIVITY**: Connect using private addresses associated with the supporting cloud platform such as AWS PrivateLink. | Required for BYOC with KEY_PAIR only, otherwise ignored. |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake Private Key File. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use Snowflake Role assigned to the runtime or child role granted to this Snowflake Role.   You can find your runtime Snowflake Role in the Openflow UI, by expanding the More Options [⋮] button for your runtime and selecting Set Snowflake role. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

### MySQL Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Included Table Names | A comma-separated list of table paths, including their schemas. Example: `public.my_table, other_schema.other_table` |
| Included Table Regex | A regular expression to match against table paths. Every path matching the expression will be replicated, and new tables matching the pattern that get created later will also be included automatically. Example: `public\.auto_.*` |
| Column Filter JSON | Optional. A JSON array of filter objects specifying which columns to include or exclude per table. For syntax details and examples, see Replicate a subset of columns in a table. |
| Merge Task Schedule CRON | CRON expression defining periods when merge operations from Journal to Destination Table will be triggered. Set it to `* * * * * ?` if you want to have continuous merge or time schedule to limit warehouse run time. For example, the string `* 0 * * * ?` indicates that you want to schedule merges at full hour for one minute. The string `* 20 14 ? * MON-FRI` indicates that you want to schedule merges at 2:20 PM every Monday through Friday. For more information and examples, see the [CronTrigger tutorial](https://www.quartz-scheduler.org/documentation/quartz-2.2.2/tutorials/tutorial-lesson-06.html). |
| Object Identifier Resolution | Specifies how source object identifiers such as the names of schemas, tables, and columns are stored and queried in Snowflake. This setting specifies that you must use double quotes in SQL queries.  Option 1: Default, case-sensitive. For backwards compatibility.   * **Transformation**: Case is preserved.   For example, `My_Table` remains `My_Table`. * **Queries**: SQL queries must use double quotes to match the exact case for database objects.   For example, `SELECT * FROM "My_Table";`.   **Note:** Snowflake recommends using this option if you must preserve source casing for legacy or compatibility reasons. For example, if the source database includes table names that differ in case only–such as `MY_TABLE` and `my_table`–that would result in a name collision when using when using case-insensitive comparisons.  Option 2: Recommended, case-insensitive   * **Transformation**: All identifiers are converted to uppercase. For example, `My_Table` becomes `MY_TABLE`. * **Queries**: SQL queries are case-insensitive and don’t require SQL double quotes.   For example, `SELECT * FROM my_table;` returns the same results as `SELECT * FROM MY_TABLE;`.   **Note:** Snowflake recommends using this option if database objects are not expected to have mixed case names.  **Important:** Do not change this setting after the connector has begun ingesting data. Changing this setting after ingestion has begun breaks the existing ingestion. If you must change this setting, create a new connector instance. |

## Restart table replication

A table in FAILED state — for example, due to a missing primary key or unsupported schema change — does not restart automatically. If a table enters a FAILED state or you need to restart replication from scratch, use the following procedure to remove and re-add the table to replication.

> **Note:**
>
> If the failure was caused by an issue in the source table such as a missing primary key, resolve that issue in the source database before continuing.

1. Remove the table from flow parameters: In the Ingestion Parameters context, either remove the table from the Included Table Names or modify the Included Table Regex so the table is no longer matched.
2. Verify the table has been removed:

   1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services.
   2. In the table listing controller services, locate the Table State Store row, click the three vertical dots on the right side of the row, then choose View State.
   > **Important:**
   >
   > You must wait until the table’s state is fully removed from this list before proceeding. Do not continue until this configuration change has completed.
3. Clean up the destination: Once the table’s state shows as fully removed, manually [DROP](../../../../../sql-reference/sql/drop-table.md) the destination table in Snowflake. Note that the connector will not overwrite an existing destination table during the snapshot phase; if the table still exists, replication will fail again. Optionally, the journal table and stream can also be removed if they are no longer needed.
4. Re-add the table: Update the Included Table Names or Included Table Regex parameters to include the table again.
5. Verify the restart: Check the Table State Store using the instructions given previously. The state of the table should appear with the status NEW, then transition to SNAPSHOT_REPLICATION, and finally INCREMENTAL_REPLICATION.

## Replicate a subset of columns in a table

The connector can filter the data replicated per table to a subset of configured columns.
Primary key columns are always included regardless of exclusions.

To apply column filters, set the Column Filter JSON parameter in the Ingestion Parameters context
to a JSON array of filter objects, one per table you want to filter.

Columns can be included or excluded by name or by regular expression pattern. You can apply a single condition per table,
or combine multiple conditions, with exclusions always taking precedence over inclusions.

### Syntax

Each object in the array identifies a table and specifies which columns to include or exclude.

```javascript
[
    {
        "schema": "<schema>" | "schemaPattern": "<regex>",
        "table": "<table>" | "tablePattern": "<regex>",
        "included": ["<column>", "<column>"],
        "excluded": ["<column>", "<column>"],
        "includedPattern": "<regex>",
        "excludedPattern": "<regex>"
    }
]
```

The following rules apply:

* Use `schema` and `table` for exact name matching, or `schemaPattern` and `tablePattern`
  for regex matching. You cannot use both a field and its pattern variant in the same object
  (for example, `schema` and `schemaPattern` cannot both appear).
* At least one of `included`, `excluded`, `includedPattern`, or `excludedPattern` must be provided.
* When both included and excluded filters are specified, exclusions take precedence.
* When multiple filters match the same table, the last matching filter is used, with exact matches
  taking precedence over pattern-based filters.
* The value can be an array of objects to apply different filters to different tables.

### Examples

Include specific columns by name:

```javascript
[
    {
        "schema": "public",
        "table": "orders",
        "included": ["account_id", "status", "created_at"]
    }
]
```

Exclude specific columns by name:

```javascript
[
    {
        "schema": "public",
        "table": "orders",
        "excluded": ["internal_note", "debug_flag"]
    }
]
```

Combine an include pattern with a specific exclusion (for example, include all email columns except `admin_email`):

```javascript
[
    {
        "schema": "public",
        "table": "contacts",
        "includedPattern": ".*_email",
        "excluded": ["admin_email"]
    }
]
```

Mix a schema pattern with an exact table name to apply a filter across schemas:

```javascript
[
    {
        "schemaPattern": "data_.*",
        "table": "customers",
        "excluded": ["internal_note"]
    }
]
```

Pass multiple filter objects to apply different rules to different tables:

```javascript
[
    {"schema": "public", "table": "orders", "included": ["account_id", "status"]},
    {"schema": "public", "table": "customers", "excludedPattern": ".*_internal"}
]
```

## Track data changes in tables

The connector replicates not only the current state of data from the source tables,
but also every state of every row from every changeset. This data is stored in journal tables
created in the same schema as the destination table.

The journal table names are formatted as: `<source_table_name>_JOURNAL_<timestamp>_<schema_generation>`
where `<timestamp>` is the value of epoch seconds when the source table was added to replication, and `<schema_generation>` is an integer increasing with every schema change on the source table.
As a result, source tables that undergo schema changes will have multiple journal tables.

When a table is removed from replication, then added back, the `<timestamp>` value will change, and `<schema_generation>` will start again from `1`.

> **Important:**
>
> Snowflake recommends that you do not alter the structure of journal tables in any way.
> They are used by the connector to update the destination table as part of the replication process.

The connector never drops journal tables, but does make use of the latest
journal for every replicated source table, only reading append-only streams on top of journals.
To reclaim the storage, you can:

* Truncate all journal tables at any time.
* Drop the journal tables related to source tables that were removed from replication.
* Drop all but the latest generation journal tables for actively replicated tables.

For example, if your connector is set to actively replicate source table `orders`, and you have earlier removed table `customers` from replication, you may have the following journal tables. In this case you can drop all of them *except* `orders_5678_2`.

```output
customers_1234_1
customers_1234_2
orders_5678_1
orders_5678_2
```

## Configure scheduling of merge tasks

The connector uses a warehouse to merge change data capture (CDC) data into destination tables.
This operation is triggered by the MergeSnowflakeJournalTable processor. If there are no new changes or if no new flow files are waiting in
the MergeSnowflakeJournalTable queue, no merge is triggered and the warehouse auto-suspends.

To limit the warehouse cost and limit merges to only scheduled time, use the CRON expression in the Merge task Schedule CRON parameter.
It throttles the flow files coming to the MergeSnowflakeJournalTable processor
and merges are triggered only in a dedicated period of time.
For more information about scheduling, see [Scheduling strategy](https://nifi.apache.org/docs/nifi-docs/html/user-guide.html#scheduling-strategy).

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

---
title: Set up the Openflow Connector for PostgreSQL
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/postgres/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for PostgreSQL

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for PostgreSQL.

> **Note:**
>
> This connector can be configured to immediately start replicating incremental changes for newly added tables,
> bypassing the snapshot load phase. This option is often useful when reinstalling the connector
> in an account where previously replicated data exists and you want to continue replication without having to re-snapshot tables.

For details on the incremental load process, see [Incremental replication](incremental-replication.md).

For information about restarting table replication for failed tables, see
[Restart table replication](maintenance.md).

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for PostgreSQL](about.md).
2. Ensure that you have reviewed the [supported PostgreSQL versions](about.md).
3. Recommended: Ensure that you add only one connector instance per runtime.
4. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
5. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [PostgreSQL](../../setup-openflow-spcs-sf-allow-list.md) connector.
6. As a database administrator, perform the following tasks:

   1. Configure wal_level
   2. Create a publication
   3. Ensure that there is enough disk space on your PostgreSQL server for the WAL.
      This is because once created, a replication slot causes PostgreSQL to retain the WAL data from the position held by the replication slot,
      until the connector confirms and advances that position.
   4. Allow at least **1** logical replication slot and **2** WAL senders per Openflow Connector for PostgreSQL connector instance on the server. Set `max_replication_slots` and `max_wal_senders` high enough to cover that and all other replication traffic on the instance.
   5. Ensure that every table enabled for replication has a primary key. The key can be a single column or composite.
   6. Set the [REPLICA IDENTITY](https://www.postgresql.org/docs/current/sql-altertable.html#SQL-ALTERTABLE-REPLICA-IDENTITY) of tables to
      `DEFAULT`. This ensures that the primary keys are represented in the WAL, and the connector can read them.
   7. Create a user for the connector. The connector requires a user with the `REPLICATION` attribute and permissions
      to SELECT from every replicated table. Create that user with a password to enter into the connector’s configuration.
      For more information on replication security, see [Security](https://www.postgresql.org/docs/current/logical-replication-security.html).
7. As a Snowflake account administrator, perform the following tasks:

   1. Create a Snowflake user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
      Create a database to store the replicated data, and set up
      privileges for the Snowflake user to create objects in that database by granting the [USAGE and CREATE SCHEMA privileges](../../../../security-access-control-privileges.md).

      ```sqlexample
      CREATE DATABASE <destination_database>;
      CREATE USER <openflow_user> TYPE=SERVICE COMMENT='Service user for automated access of Openflow';
      CREATE ROLE <openflow_role>;
      GRANT ROLE <openflow_role> TO USER <openflow_user>;
      GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_role>;
      GRANT CREATE SCHEMA ON DATABASE <destination_database> TO ROLE <openflow_role>;
      CREATE WAREHOUSE <openflow_warehouse>
        WITH
          WAREHOUSE_SIZE = 'XSMALL'
          AUTO_SUSPEND = 300
          AUTO_RESUME = TRUE;
      GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_role>;
      ```
   2. Create a pair of secure keys (public and private). Store the private key for the user in a file to use while configuring the connector.
      Assign the public key to the Snowflake service user:

      ```sqlexample
      ALTER USER <openflow_user> SET RSA_PUBLIC_KEY = 'thekey';
      ```

      For more information, see [key-pair authentication](../../../../key-pair-auth.md).
   3. Designate a warehouse for the connector to use. Start with the `XSMALL` warehouse size,
      then experiment with size depending on the amount of tables being replicated, and the amount of data
      transferred. Large numbers of tables typically scale better with [multi-cluster warehouses](../../../../warehouses-multicluster.md),
      rather than the warehouse size.

### Configure wal_level

Openflow Connector for PostgreSQL requires [wal_level](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-WAL-LEVEL) to be set to `logical`.

Depending on where your PostgreSQL server is hosted, you can configure the wal_level as follows:

|  |  |
| --- | --- |
| On premise | Execute following query with superuser or user with `ALTER SYSTEM` privilege:  ```ini ALTER SYSTEM SET wal_level = logical; ``` |
| RDS | User used by the agent needs to have the `rds_superuser` or `rds_replication` roles assigned.  You also need to set:  * `rds.logical_replication` static parameter to 1. * `max_replication_slots`, `max_connections` and `max_wal_senders` parameters according to your database and replication setup. |
| AWS Aurora | Set the `rds.logical_replication` static parameter to 1. |
| GCP | Set the following flags:  * `cloudsql.logical_decoding=on`. * `cloudsql.enable_pglogical=on`.   For more information, see [Google Cloud documentation](https://cloud.google.com/sql/docs/postgres/replication/configure-logical-replication#set-up-logical-replication-with-pglogical). |
| Azure | Set the replication support to `Logical`. For more information, see [Azure documentation](https://learn.microsoft.com/en-us/azure/postgresql/single-server/concepts-logical#set-up-your-server). |

### Create a publication

Openflow Connector for PostgreSQL requires a [publication](https://www.postgresql.org/docs/current/logical-replication-publication.html#LOGICAL-REPLICATION-PUBLICATION) to be created and configured in PostgreSQL before replication starts.
You can create it for all, or a subset of tables, as well as for specific tables with specified columns only.
Make sure that every table and column that you plan to have replicated is included in the publication.
You can also modify the publication later, while the connector is running. To create and configure a publication, do the following:

1. Log in as a user with the CREATE privilege on the database and run the following query:

   * For PostgreSQL 13 and later:

     ```sqlsyntax
     CREATE PUBLICATION <publication name> WITH (publish_via_partition_root = true);
     ```

     The additional `publish_via_partition_root` is needed for correct replication of partitioned tables.
     To learn more about ingestion of partitioned tables see Replicate a partitioned table.
   * For PostgreSQL versions earlier than 13:

     ```sqlsyntax
     CREATE PUBLICATION <publication name>;
     ```
2. Define tables that the database agent will be able to see using:

> ```sqlsyntax
> ALTER PUBLICATION <publication name> ADD TABLE <table name>;
> ```
>
> For partitioned tables, it’s enough to just add the root partition table to the publication.
> See Replicate a partitioned table for more details.
>
> > **Important:**
> >
> > **PostgreSQL 15 and later** support configuring publications for a specified subset of table columns. For the
> > connector to support this correctly, you must use the column filtering settings
> > to include the same columns as set on the publication.
> >
> > Without this setting, the connector will behave as follows:
> >
> > * In the destination table, columns that are not included in the filter will be suffixed with `__DELETED`. All data
> >   replicated during the snapshot phase will be retained.
> > * After you add new columns to the publication, the table will be permanently failed, and you will need to restart its replication.
> >
> > For more information, see [ALTER PUBLICATION](https://www.postgresql.org/docs/current/sql-alterpublication.html).

## Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

## Configure the connector

To configure the connector, do the following as a data engineer:

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values.

   For more information on the required parameter values, see the following sections:

   * PostgreSQL Source Parameters: Used to establish a connection with PostgreSQL.
   * PostgreSQL Destination Parameters: Used to establish a connection with Snowflake.
   * PostgreSQL Ingestion Parameters: Used to specify the tables to replicate.

Start with setting the parameters of the PostgreSQL Source Parameters context, then the PostgreSQL Destination Parameters context.
Once this is done, you can enable the connector, and it should connect both to PostgreSQL and Snowflake and start running.
However, it will not replicate any data until any tables are explicitly added to its configuration.

To configure specific tables for replication, edit the PostgreSQL Ingestion Parameters context. Shortly after you apply the changes to the
Replication Parameters context, the configuration will be picked up by the connector, and the replication lifecycle will start for every table.

### PostgreSQL Source Parameters

| Parameter | Description |
| --- | --- |
| PostgreSQL Connection URL | The full JDBC URL to the source database. Example: `jdbc:postgresql://example.com:5432/public`  If you are connecting to PostgreSQL replica server, see Replicate tables from a PostgreSQL replica server. |
| PostgreSQL JDBC Driver | The path to the [PostgreSQL JDBC driver jar](https://jdbc.postgresql.org/). Download the jar from its website, then select the Reference asset checkbox to upload and attach it. |
| PostgreSQL Username | The username for the connector. |
| PostgreSQL Password | The password for the connector. |
| Publication Name | The name of the publication you created earlier. |
| Replication Slot Name | Optional. When no value is provided, the connector will create a new, uniquely-named slot. When given a value, the connector will use the existing slot, or create a new one with the provided name.  Changing the value for a running connector will restart reading the incremental change data capture (CDC) stream from the updated slot’s position. |

### PostgreSQL Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Connection Strategy | When using KEY_PAIR, specify the strategy for connecting to Snowflake:   * **STANDARD** (default): Connect using standard public routing to Snowflake services. * **PRIVATE_CONNECTIVITY**: Connect using private addresses associated with the supporting cloud platform such as AWS PrivateLink. | Required for BYOC with KEY_PAIR only, otherwise ignored. |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake Private Key File. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use Snowflake Role assigned to the runtime or child role granted to this Snowflake Role.   You can find your runtime Snowflake Role in the Openflow UI, by expanding the More Options [⋮] button for your runtime and selecting Set Snowflake role. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

### PostgreSQL Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Included Table Names | A comma-separated list of table paths, including their schemas. Example: `public.my_table, other_schema.other_table`.  Select tables either by name or by Regex. If you use both, all matching tables from either option will be included.  Tables being sub-partitions are always excluded from ingestion. See Replicate a partitioned table for more information. |
| Included Table Regex | A regular expression to match against table paths. Every path matching the expression will be replicated, and new tables matching the pattern that get created later will also be included automatically. Example: `public\.auto_.*`  Select tables either by name or by Regex. If you use both, all matching tables from either option will be included.  Tables being sub-partitions are always excluded from ingestion. See Replicate a partitioned table for more information. |
| Column Filter JSON | Optional. A JSON array of filter objects specifying which columns to include or exclude per table. For syntax details and examples, see Replicate a subset of columns in a table. |
| Merge Task Schedule CRON | CRON expression defining periods when merge operations from Journal to Destination Table will be triggered. Set it to `* * * * * ?` if you want to have continuous merge or time schedule to limit warehouse run time.  For example:  * The string `* 0 * * * ?` indicates that you want to schedule merges at full hour for one minute * The string `* 20 14 ? * MON-FRI` indicates that you want to schedule merges at 2:20 PM every Monday through Friday.  For additional information and examples, see the cron triggers tutorial in the [Quartz Documentation](https://www.quartz-scheduler.org/documentation/quartz-2.2.2/tutorials/tutorial-lesson-06.html) |
| Object Identifier Resolution | Specifies how source object identifiers such as the names of schemas, tables, and columns are stored and queried in Snowflake. This setting specifies that you must use double quotes in SQL queries.  Option 1: Default, case-sensitive. For backwards compatibility.   * **Transformation**: Case is preserved.   For example, `My_Table` remains `My_Table`. * **Queries**: SQL queries must use double quotes to match the exact case for database objects.   For example, `SELECT * FROM "My_Table";`.   **Note:** Snowflake recommends using this option if you must preserve source casing for legacy or compatibility reasons. For example, if the source database includes table names that differ in case only–such as `MY_TABLE` and `my_table`–that would result in a name collision when using when using case-insensitive comparisons.  Option 2: Recommended, case-insensitive   * **Transformation**: All identifiers are converted to uppercase. For example, `My_Table` becomes `MY_TABLE`. * **Queries**: SQL queries are case-insensitive and don’t require SQL double quotes.   For example, `SELECT * FROM my_table;` returns the same results as `SELECT * FROM MY_TABLE;`.   **Note:** Snowflake recommends using this option if database objects are not expected to have mixed case names.  **Important:** Do not change this setting after the connector has begun ingesting data. Changing this setting after ingestion has begun breaks the existing ingestion. If you must change this setting, create a new connector instance. |

## Replicate tables from a PostgreSQL replica server

The connector can ingest data from a primary server, a [hot standby replica](https://www.postgresql.org/docs/current/hot-standby.html),
or subscriber server using [logical replication](https://www.postgresql.org/docs/current/logical-replication.html).
Before configuring the connector to connect to a PostgreSQL replica, ensure that replication between primary and replica
nodes works correctly. When investigating issues with missing data in the connector, first ensure that missing rows are
present in replica server used by the connector.

Additional considerations when connecting to a standby replica:

> * Only connecting to hot standby replica is supported. Note that warm standby replicas cannot accept connections
>   from clients until they are promoted to a primary instance.
> * PostgreSQL version of the server must be >= 16.
> * The publication needed by the connector must be created on
>   the primary server, not the standby server. The standby server is read-only and doesn’t allow to create publication.

If you connect to a hot standby instance and see
Trying to create the replication slot ‘<replication slot>’ timed out. If connecting to a standby instance, ensure there is some traffic on the primary PostgreSQL instance, otherwise the call to create a replication slot will never return.
error in the Openflow bulletin, or the Read PostgreSQL CDC Stream processor isn’t starting, log in to the primary PostgreSQL instance and
execute the following query:

```sqlsyntax
SELECT pg_log_standby_snapshot();
```

The error occurs when there are no data changes in the primary server. As such the connector can stall while
creating a replication slot on the replica server. This results from the replica server requiring information about
running transactions from the primary server to be able to create a replication slot. Primary servers won’t send the
information while idle. The `pg_log_standby_snapshot()` function forces the primary server to send information
about running transactions to the replica server.

## Replicate a subset of columns in a table

The connector can filter the data replicated per table to a subset of configured columns.
Primary key columns are always included regardless of exclusions.

To apply column filters, set the Column Filter JSON parameter in the Ingestion Parameters context
to a JSON array of filter objects, one per table you want to filter.

Columns can be included or excluded by name or by regular expression pattern. You can apply a single condition per table,
or combine multiple conditions, with exclusions always taking precedence over inclusions.

### Syntax

Each object in the array identifies a table and specifies which columns to include or exclude.

```javascript
[
    {
        "schema": "<schema>" | "schemaPattern": "<regex>",
        "table": "<table>" | "tablePattern": "<regex>",
        "included": ["<column>", "<column>"],
        "excluded": ["<column>", "<column>"],
        "includedPattern": "<regex>",
        "excludedPattern": "<regex>"
    }
]
```

The following rules apply:

* Use `schema` and `table` for exact name matching, or `schemaPattern` and `tablePattern`
  for regex matching. You cannot use both a field and its pattern variant in the same object
  (for example, `schema` and `schemaPattern` cannot both appear).
* At least one of `included`, `excluded`, `includedPattern`, or `excludedPattern` must be provided.
* When both included and excluded filters are specified, exclusions take precedence.
* When multiple filters match the same table, the last matching filter is used, with exact matches
  taking precedence over pattern-based filters.
* The value can be an array of objects to apply different filters to different tables.

### Examples

Include specific columns by name:

```javascript
[
    {
        "schema": "public",
        "table": "orders",
        "included": ["account_id", "status", "created_at"]
    }
]
```

Exclude specific columns by name:

```javascript
[
    {
        "schema": "public",
        "table": "orders",
        "excluded": ["internal_note", "debug_flag"]
    }
]
```

Combine an include pattern with a specific exclusion (for example, include all email columns except `admin_email`):

```javascript
[
    {
        "schema": "public",
        "table": "contacts",
        "includedPattern": ".*_email",
        "excluded": ["admin_email"]
    }
]
```

Mix a schema pattern with an exact table name to apply a filter across schemas:

```javascript
[
    {
        "schemaPattern": "data_.*",
        "table": "customers",
        "excluded": ["internal_note"]
    }
]
```

Pass multiple filter objects to apply different rules to different tables:

```javascript
[
    {"schema": "public", "table": "orders", "included": ["account_id", "status"]},
    {"schema": "public", "table": "customers", "excludedPattern": ".*_internal"}
]
```

## Replicate a partitioned table

The connector supports replication of partitioned tables for PostgreSQL servers with version >= 15. A PostgreSQL
partitioned table will be replicated into Snowflake as a single destination table.

For example, if you have a partitioned table `orders`, with sub-partitions `orders_2023`, `orders_2024`,
and configured the connector to ingest all tables matching `orders.*` pattern, then only the `orders` table
will be replicated to Snowflake, and it will include data from all sub-partitions.

To support replication of partitioned tables, ensure that the publication
created in PostgreSQL has the `publish_via_partition_root` option set to `true`.

Ingestion of partitioned tables has currently the following limitations:

* When a table is attached as a partition to a partitioned table after ingestion was started, the connector won’t fetch
  data that existed in the partition table before attaching.
* When a sub-partition table is detached from the partitioned table after ingestion was started, the connector won’t
  mark the data from this sub-partition as deleted in the root partition table.
* Truncate operation on subpartitions will not mark affected records as deleted.

## Track data changes in tables

The connector replicates not only the current state of data from the source tables,
but also every state of every row from every changeset. This data is stored in journal tables
created in the same schema as the destination table.

The journal table names are formatted as: `<source_table_name>_JOURNAL_<timestamp>_<schema_generation>`
where `<timestamp>` is the value of epoch seconds when the source table was added to replication, and `<schema_generation>` is an integer increasing with every schema change on the source table.
As a result, source tables that undergo schema changes will have multiple journal tables.

When a table is removed from replication, then added back, the `<timestamp>` value will change, and `<schema_generation>` will start again from `1`.

> **Important:**
>
> Snowflake recommends that you do not alter the structure of journal tables in any way.
> They are used by the connector to update the destination table as part of the replication process.

The connector never drops journal tables, but does make use of the latest
journal for every replicated source table, only reading append-only streams on top of journals.
To reclaim the storage, you can:

* Truncate all journal tables at any time.
* Drop the journal tables related to source tables that were removed from replication.
* Drop all but the latest generation journal tables for actively replicated tables.

For example, if your connector is set to actively replicate source table `orders`,
and you have earlier removed table `customers` from replication, you may have
the following journal tables. In this case you can drop all of them *except* `orders_5678_2`.

```output
customers_1234_1
customers_1234_2
orders_5678_1
orders_5678_2
```

## Configure scheduling of merge tasks

The connector uses a warehouse to merge change data capture (CDC) data into destination tables.
This operation is triggered by the MergeSnowflakeJournalTable processor. If there are no new changes or if no new flow files are waiting in
the MergeSnowflakeJournalTable queue, no merge is triggered and the warehouse auto-suspends.

To limit the warehouse cost and limit merges to only scheduled time, use the CRON expression in the Merge task Schedule CRON parameter.
It throttles the flow files coming to the MergeSnowflakeJournalTable processor
and merges are triggered only in a dedicated period of time.
For more information about scheduling, see [Scheduling strategy](https://nifi.apache.org/docs/nifi-docs/html/user-guide.html#scheduling-strategy).

## Stop or delete the connector

When stopping or removing the connector, you have to consider the [replication slot](https://www.postgresql.org/docs/current/warm-standby.html#STREAMING-REPLICATION-SLOTS) that the connector uses.

The connector creates its own replication slot with a name starting with
`snowflake_connector_` followed by a random suffix. As the connector reads the replication stream,
it advances the slot, so that PostgreSQL can trim its WAL log and free up disk space.

When the connector is paused, the slot is not advanced, and changes to the source database keep increasing the WAL
log size. You should not keep the connector paused for extended periods of time, especially on high-traffic databases.

When the connector is removed, whether by deleting it from the Openflow canvas,
or any other means, such as deleting the whole Openflow instance, the replication slot remains in place, and must be dropped manually.

If you have multiple connector instances replicating from the same PostgreSQL database,
each instance will create its own uniquely-named replication slot. When dropping a replication slot manually, make sure
it’s the right one. You can see which replication slot is used by a given connector instance by checking the state of the `CaptureChangePostgreSQL` processor.

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

---
title: Set up the Openflow Connector for SharePoint
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sharepoint/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for SharePoint

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for SharePoint.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for SharePoint](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [SharePoint](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Set up access to your SharePoint site

As an Azure or Office 365 account administrator, perform the following actions:

#. Ensure that you have a [Microsoft Graph](https://learn.microsoft.com/en-us/graph/overview) application registered and that it is configured with the
following [application permissions](https://learn.microsoft.com/en-us/graph/permissions-overview?tabs=http#application-permissions) based on your requirements:

> **For Microsoft SharePoint (Cortex Search, document ACLs) and Microsoft SharePoint (Simple Ingest, document ACLs):**
>
> * `Sites.Selected`: Limits access to only specified sites.
>   :   For more information, see [Sites.Selected](https://learn.microsoft.com/en-us/graph/permissions-reference#sitesselected).
> * `GroupMember.Read.All`: Used for resolving SharePoint group permissions.
>   :   For more information, see [GroupMember.Read.All](https://learn.microsoft.com/en-us/graph/permissions-reference#groupmemberreadall).
> * `User.ReadBasic.All`: Used for resolving Microsoft 365 user emails.
>   :   For more information, see [User.ReadBasic.All](https://learn.microsoft.com/en-us/graph/permissions-reference#userreadbasicall).
>
> **For Microsoft SharePoint (Cortex Search, no document ACLs) and Microsoft SharePoint (Simple Ingest, no document ACLs):**
>
> * `Sites.Selected`: Limits access to only specified sites.
>   :   For more information, see [Sites.Selected](https://learn.microsoft.com/en-us/graph/permissions-reference#sitesselected).

1. Grant the `fullcontrol` role to the application in the selected sites.

   This role handles folder access changes during CDC ingestion. Grant it using the [Grant-PnPAzureADAppSitePermission](https://github.com/pnp/powershell/blob/dev/documentation/Grant-PnPAzureADAppSitePermission.md) cmdlet, or by calling the [GraphAPI permission endpoint](https://learn.microsoft.com/en-us/graph/api/site-post-permissions), e.g. using `curl`.

   For more information, see [Roles](https://learn.microsoft.com/en-us/graph/permissions-selected-overview?tabs=http#roles).

   > **Note:**
   >
   > If you cannot grant the `fullcontrol` role, grant the narrower `read` role to the application instead. However, if access to a folder in the ingested site changes, the connector may enter an irreparable state and will require a full re-ingestion of data. Snowflake recommends granting the `fullcontrol` role to fully mitigate this issue.
2. Configure application credentials based on your use case:

   **For Microsoft SharePoint (Cortex Search, document ACLs) and Microsoft SharePoint (Simple Ingest, document ACLs):**

   * Add a new certificate or ensure that you have access to the existing certificate file and its private key.
     For more information, see [Option 1: Add a certificate](https://learn.microsoft.com/en-us/graph/auth-register-app-v2#option-1-add-a-certificate).
   * Create a new client secret and record the secret’s value.
     :   For more information, see [Option 2: Add a client secret](https://learn.microsoft.com/en-us/graph/auth-register-app-v2#option-2-add-a-client-secret).

   **For Microsoft SharePoint (Cortex Search, no document ACLs) and Microsoft SharePoint (Simple Ingest, no document ACLs):**

   * Create a new client secret and record the secret’s value.
     :   For more information, see [Option 2: Add a client secret](https://learn.microsoft.com/en-us/graph/auth-register-app-v2#option-2-add-a-client-secret).
3. Record the following information from your Microsoft Graph application:

   * The client ID of your application.
     :   For more information, see [Application ID (client ID)](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#application-id-client-id).
   * The tenant ID of your application.
     :   For more information, see [Find your Microsoft 365 tenant ID](https://learn.microsoft.com/en-us/sharepoint/find-your-office-365-tenant-id).
   * The site URL of the Microsoft 365 SharePoint site with the files or folders that you want to ingest into Snowflake; for example, `https://yourtenant.sharepoint.com/sites/YourSite`.

## Set up your Snowflake account

As a Snowflake account administrator, perform the following tasks manually
or by using the script included below:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

### Example setup

> ```sqlexample
> --The following script assumes you'll need to create all required roles, users, and objects.
> --However, you may want to reuse some that are already in existence.
>
> --Create a Snowflake service user to manage the connector
> USE ROLE USERADMIN;
> CREATE USER <openflow_service_user> TYPE=SERVICE COMMENT='Service user for Openflow automation';
>
> --Create a pair of secure keys (public and private). For more information, see
> --key-pair authentication. Store the private key for the user in a file to supply
> --to the connector’s configuration. Assign the public key to the Snowflake service user:
> ALTER USER <openflow_service_user> SET RSA_PUBLIC_KEY = '<pubkey>';
>
>
> --Create a role to manage the connector and the associated data and
> --grant it to that user
> USE ROLE SECURITYADMIN;
> CREATE ROLE <openflow_connector_admin_role>;
> GRANT ROLE <openflow_connector_admin_role> TO USER <openflow_service_user>;
>
>
> --The following block is for USE CASE 2 (Cortex connect) ONLY
> --Create a role for read access to the cortex search service created by this connector.
> --This role should be granted to any role that will use the service
> CREATE ROLE <cortex_search_service_read_only_role>;
> GRANT ROLE <cortex_search_service_read_only_role> TO ROLE <whatever_roles_will_access_search_service>;
>
> --Create the database the data will be stored in and grant usage to the roles created
> USE ROLE ACCOUNTADMIN; --use whatever role you want to own your DB
> CREATE DATABASE IF NOT EXISTS <destination_database>;
> GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_connector_admin_role>;
>
> --Create the schema the data will be stored in and grant the necessary privileges
> --on that schema to the connector admin role:
> USE DATABASE <destination_database>;
> CREATE SCHEMA IF NOT EXISTS <destination_schema>;
> GRANT USAGE ON SCHEMA <destination_schema> TO ROLE <openflow_connector_admin_role>;
> GRANT CREATE TABLE, CREATE DYNAMIC TABLE, CREATE STAGE, CREATE SEQUENCE, CREATE CORTEX
> SEARCH SERVICE ON SCHEMA <destination_schema> TO ROLE <openflow_connector_admin_role>;
>
> --The following block is for CASE 2 (Cortex connect) ONLY
> --Grant the Cortex read-only role access to the database and schema
> GRANT USAGE ON DATABASE <destination_database> TO ROLE <cortex_search_service_read_only_role>;
> GRANT USAGE ON SCHEMA <destination_schema> TO ROLE <cortex_search_service_read_only_role>;
>
> --Create the warehouse this connector will use if it doesn't already exist. Grant the
> --appropriate privileges to the connector admin role. Adjust the size according to your needs.
> CREATE WAREHOUSE <openflow_warehouse>
> WITH
>    WAREHOUSE_SIZE = 'MEDIUM'
>    AUTO_SUSPEND = 300
>    AUTO_RESUME = TRUE;
> GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_connector_admin_role>;
> ```

## Use case 1: Ingest files only

Use a connector to:

* Ingest and continuously update Sharepoint files for custom processing within Snowflake
* Optionally ingest file permissions (ACL connectors) to persist access controls downstream

### Set up the connector

As a data engineer, perform the following tasks to configure the connector:

#### Install the connector

> **Note:**
>
> There are multiple variants of the SharePoint connector. Choose the variant that best fits your use case as described in [Variants of the Openflow Connector for SharePoint](about.md).

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Populate the process group parameters

   1. Right-click on the imported process group and select **Parameters**.
   2. Enter the required parameter values as described in Sharepoint Ingestion Parameters, Sharepoint Destination Parameters and Sharepoint Source Parameters.

##### Sharepoint Source Parameters

**For all connectors:**

| Parameter | Description |
| --- | --- |
| SharePoint Site URL | URL or SharePoint site from which the connector will ingest content |
| SharePoint Client ID | Microsoft Entra client ID. To learn about client ID and how to find it in Microsoft Entra, see [Application ID (client ID)](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#application-id-client-id). |
| SharePoint Client Secret | Microsoft Entra Client Secret. To learn about a client secret and how to find it in Microsoft Entra, see [Certificates & secrets](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#certificates--secrets). |
| SharePoint Tenant ID | Microsoft Entra Tenant ID. To learn about tenant ID and how to find it in Microsoft Entra, see [Find your Microsoft 365 tenant ID](https://learn.microsoft.com/en-us/sharepoint/find-your-office-365-tenant-id). |

**For ACL connectors only:**

| Parameter | Description |
| --- | --- |
| Sharepoint Application Private Key | A generated application private key in PEM format. The key must be unencrypted. |
| Sharepoint Site Domain | A domain name of the synchronized Sharepoint site. |
| Sharepoint Application Certificate | A generated application certificate in PEM format. |

##### Sharepoint Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

##### Sharepoint Ingestion Parameters

**For all connectors:**

| Parameter | Description |
| --- | --- |
| SharePoint Source Folder | Supported files from this folder and all its subfolders is ingested into Snowflake. The folder path is relative to a Shared Documents library. |
| File Extensions To Ingest | A comma-separated list that specifies file extensions to ingest. The connector tries to convert the files to PDF format first, if possible. Nonetheless, the extension check is performed on the original file extension. To learn about the formats that can be converted, see [Format options](https://learn.microsoft.com/en-us/graph/api/driveitem-get-content-format?view=graph-rest-1.0&tabs=http#format-options) If some of the specified file extensions are not supported by Cortex Parse Document, then the connector ignores those files, logs a warning message in an event log, and continues processing other files. |
| Sharepoint Document Library Name | A library in the SharePoint Site to ingest files from. |
| Snowflake File Hash Table Name | Name of the table to store file hashes to determine if the content has changed. This parameter should generally not be changed. |

**For ACL connectors only:**

| Parameter | Description |
| --- | --- |
| Sharepoint Site Groups Enabled | Specifies whether the Site Groups functionality is enabled. |

1. Run the flow.

   1. Start the process group. The flow will create all required objects
      inside of Snowflake.
   2. Right click on the imported process group and select **Start**.

## Use case 2: Ingest files and perform processing with Cortex

Use the predefined flow definition to:

* Create AI assistants for documents within your organization’s SharePoint site
* Enable your AI assistants to adhere to access controls specified in your organization’s SharePoint site

### Set up the connector

As a data engineer, perform the following tasks to configure the connector:

#### Install the connector

1. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following SQL commands:

   > ```sqlexample
   > CREATE DATABASE DESTINATION_DB;
   > CREATE SCHEMA DESTINATION_DB.DESTINATION_SCHEMA;
   > GRANT USAGE ON DATABASE DESTINATION_DB TO ROLE <CONNECTOR_ROLE>;
   > GRANT USAGE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > GRANT CREATE TABLE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Populate the process group parameters

   1. Right click on the imported process group and select **Parameters**.
   2. Enter the required parameter values as described in Sharepoint Cortex Connect Source Parameters, Sharepoint Cortex Connect Destination Parameters and Sharepoint Cortex Connect Ingestion Parameters.

##### Sharepoint Cortex Connect Source Parameters

**For all connectors:**

| Parameter | Description |
| --- | --- |
| SharePoint Site URL | URL or SharePoint site from which the connector will ingest content |
| SharePoint Client ID | Microsoft Entra client ID. To learn about client ID and how to find it in Microsoft Entra, see [Application ID (client ID)](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#application-id-client-id). |
| SharePoint Client Secret | Microsoft Entra Client Secret. To learn about a client secret and how to find it in Microsoft Entra, see [Certificates & secrets](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#certificates--secrets). |
| SharePoint Tenant ID | Microsoft Entra Tenant ID. To learn about tenant ID and how to find it in Microsoft Entra, see [Find your Microsoft 365 tenant ID](https://learn.microsoft.com/en-us/sharepoint/find-your-office-365-tenant-id). |

**For ACL connectors only:**

| Parameter | Description |
| --- | --- |
| Sharepoint Application Private Key | A generated application private key in PEM format. The key must be unencrypted. |
| Sharepoint Site Domain | A domain name of the synchronized Sharepoint site. |
| Sharepoint Application Certificate | A generated application certificate in PEM format. |

##### Sharepoint Cortex Connect Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

##### Sharepoint Cortex Connect Ingestion Parameters

**For all connectors:**

| Parameter | Description |
| --- | --- |
| SharePoint Source Folder | Supported files from this folder and all its subfolders is ingested into Snowflake. The folder path is relative to a Shared Documents library. |
| File Extensions To Ingest | A comma-separated list that specifies file extensions to ingest. The connector tries to convert the files to PDF format first, if possible. Nonetheless, the extension check is performed on the original file extension. To learn about the formats that can be converted, see [Format options](https://learn.microsoft.com/en-us/graph/api/driveitem-get-content-format?view=graph-rest-1.0&tabs=http#format-options) If some of the specified file extensions are not supported by Cortex Parse Document, then the connector ignores those files, logs a warning message in an event log, and continues processing other files. |
| Sharepoint Document Library Name | A library in the SharePoint Site to ingest files from. |
| Snowflake File Hash Table Name | Name of the table to store file hashes to determine if the content has changed. This parameter should generally not be changed. |
| OCR Mode | The OCR mode to use when parsing files with [Parsing documents with AI_PARSE_DOCUMENT](../../../../snowflake-cortex/parse-document.md) function. The value can be `OCR` or `LAYOUT`. In `OCR` mode, only raw text content is extracted, ignoring formatting and table structures. In `LAYOUT` mode, the output preserves table structures as Markdown. |
| Snowflake Cortex Search Service User Role | An identifier of a role that is assigned usage permissions on the Cortex Search service. |

**For ACL connectors only:**

| Parameter | Description |
| --- | --- |
| Sharepoint Site Groups Enabled | Specifies whether the Site Groups functionality is enabled. |

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.
3. Query the Cortex Search service.

## Use case 3: Customise the connector definition

Customize the connector definition to perform custom processing on ingested files.

### Set up the connector

As a data engineer, perform the following tasks to configure the connector:

#### Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Customize the connector definition.

   1. Remove the following process groups:

      * Check If Duplicate Content
      * Snowflake Stage and Parse PDF
      * Update Snowflake Cortex
      * (Optional) Process Microsoft365 Groups
   2. Attach any custom processing to the output of the
      `Process SharePoint Metadata` process group. Each flow file
      represents a single SharePoint file change.
2. Populate the process group parameters. Follow the same process as for
   the use case 1. Note that after modifying the connector definition,
   not all parameters might be required.
3. Run the flow.

   1. Start the process group. The flow will create all required objects
      inside of Snowflake.
   2. Right click on the imported process group and select **Start**.
4. Query the Cortex Search service.

## Enabling Sharepoint site groups

### Microsoft Graph application for site groups

In addition to the steps specified in Set up access to your SharePoint site, do the following:

1. Add [Sites.Selected](https://learn.microsoft.com/en-us/graph/permissions-reference#sitesselected) SharePoint permission.

   > **Note:**
   >
   > You should see `Sites.Selected` in both Microsoft Graph and SharePoint permissions.
2. [Generate a key pair](https://learn.microsoft.com/en-us/entra/identity-platform/howto-create-self-signed-certificate).
   Alternatively, you can create a self-signed certificate with `openssl` by running the following command:

   ```bash
   openssl req -x509 -nodes -newkey rsa:2048 -keyout key.pem -out cert.pem -days 365
   ```

   > **Note:**
   >
   > The command above doesn’t encrypt the generated private key. Remove the `-nodes` argument if you want to generate an encrypted key.
3. [Attach the certificate](https://learn.microsoft.com/en-us/graph/applications-how-to-add-certificate?tabs=http) to the Microsoft Graph application.

## Query the Cortex Search service

You can use the [Cortex Search](../../../../snowflake-cortex/cortex-search/cortex-search-overview.md) service to build chat
and search applications to chat with or query your documents in SharePoint.

After you install and configure the connector and it begins
ingesting content from Sharepoint, you can query the Cortex Search service.
For more information about using Cortex Search, see [Query a Cortex Search service](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md).

**Filter responses**

To restrict responses from the Cortex Search service to documents that a specific user
has access to in SharePoint, you can specify a filter containing the user ID or email address of the user
when you query Cortex Search. For example, `filter.@contains.user_ids` or `filter.@contains.user_emails`.
The name of the Cortex Search service created by the connector is `search_service` in the schema `Cortex`.

Run the following SQL code in a SQL worksheet to query
the Cortex Search service with files ingested from your SharePoint site.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.
* `your_question`: The question that you want to get responses for.
* `number_of_results`: Maximum number of results to return in the response. The maximum value is 1000 and the default value is 10.

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    '<application_instance_name>.cortex.search_service',
      '{
        "query": "<your_question>",
         "columns": ["chunk", "web_url"],
         "filter": {"@contains": {"user_emails": "<user_emailID>"} },
         "limit": <number_of_results>
       }'
   )
)['results'] AS results
```

Here’s a complete list of values that you can enter for `columns`:

**For all connectors:**

| Column name | Type | Description |
| --- | --- | --- |
| `full_name` | String | A full path to the file from the Sharepoint site documents root. Example: `folder_1/folder_2/file_name.pdf`. |
| `web_url` | String | A URL that displays an original Sharepoint file in a browser. |
| `last_modified_date_time` | String | Date and time when the item was most recently modified. |
| `chunk` | String | A piece of text from the document that matched the Cortex Search query. |

**For ACL connectors only:**

| Column name | Type | Description |
| --- | --- | --- |
| `user_ids` | Array | An array of Microsoft 365 user IDs that have access to the document. It also includes user IDs from all the Microsoft 365 groups that are assigned to the document. To find a specific user ID, see [Get a user](https://learn.microsoft.com/en-us/graph/api/user-get?view=graph-rest-1.0&tabs=http). |
| `user_emails` | Array | An array of Microsoft 365 user email IDs that have access to the document. It also includes user email IDs from all the Microsoft 365 groups that are assigned to the document. |

**Example: Query an AI assistant for human resources (HR) information**

You can use Cortex Search to query an AI assistant for employees to chat with the latest versions of
HR information, such as onboarding, code of conduct, team processes, and organization policies.
Using response filters, you can also allow HR team members to query employee contracts while adhering to access controls configured in SharePoint.

SQLPythonREST API

Run the following in a [SQL worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the Cortex Search service with files ingested from SharePoint.
Select the database as your application instance name and schema as **Cortex**.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```sqlexample
SELECT PARSE_JSON(
     SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
          '<application_instance_name>.cortex.search_service',
          '{
             "query": "What is my vacation carry over policy?",
             "columns": ["chunk", "web_url"],
             "filter": {"@contains": {"user_emails": "<user_emailID>"} },
             "limit": 1
          }'
     )
 )['results'] AS results
```

Run the following code in a [Python worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the
Cortex Search service with files ingested from SharePoint.
Ensure that you add the `snowflake.core` package to your database.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.core import Root

def main(session: snowpark.Session):

   root = Root(session)

   # fetch service
   my_service = (root
     .databases["<application_instance_name>"]
     .schemas["cortex"]
     .cortex_search_services["search_service"]
   )

   # query service
   resp = my_service.search(
     query="What is my vacation carry over policy?",
     columns = ["chunk", "web_url"],
     filter = {"@contains": {"user_emails": "<user_emailID>"} },
     limit=1
   )
   return (resp.to_json())
```

Execute the following code in a command-line interface to query the Cortex Search
service with files ingested from your SharePoint.
You will need to authentication through key pair authentication and OAuth to access the
Snowflake REST APIs. For more information,
see [REST API](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md)
and [Authenticating Snowflake REST APIs with Snowflake](../../../../../developer-guide/snowflake-rest-api/authentication.md).

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `account_url`: Your Snowflake account URL. For instructions on finding your account URL, see [Finding the organization and account name for an account](../../../../admin-account-identifier.md).

```bash
curl --location "https://<account_url>/api/v2/databases/<application_instance_name>/schemas/cortex/cortex-search-services/search_service" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer <CORTEX_SEARCH_JWT>" \
     --data '{
         "query": "What is my vacation carry over policy?",
         "columns": ["chunk", "web_url"],
         "limit": 1
     }'
```

Sample response:

```output
{
  "results" : [ {
  "web_url" : "https://<domain>.sharepoint.com/sites/<site_name>/<path_to_file>",
  "chunk" : "Answer to the question asked."
  } ]
}
```

## Finding files in stage

Files stored in the stage may have unreadable names. To find specific files, use the metadata
tables as your source of truth. These tables contain the mapping between file names and their
corresponding file IDs in the stage.

For Cortex-enabled setups, use the following query to find files:

```sqlexample
SELECT DISTINCT METADATA:id FROM DOCS_CHUNKS WHERE METADATA:fullName LIKE '%<file_name>%';
```

For non-Cortex setups, use the following query:

```sqlexample
SELECT FILE_ID FROM DOC_METADATA WHERE FILE_NAME = '<file_name>';
```

Replace `<file_name>` with the name or partial name of the file you’re looking for.

The files in the stage start with the ID returned from these queries.

---
title: Set up the Openflow Connector for Slack
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/slack/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Slack

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Slack.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Slack](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Slack](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Set up a Slack App

Set up a Slack App in your Slack workspace. A Slack Admin is needed to
set up access to the Slack Workspace. This is done by creating or
supplying credentials to a Slack App and installing the App to the
Slack workspace and channels. You can create a Slack App by using the
JSON configuration:

1. Update the JSON manifest. Copy the JSON manifest text below. Change
   the name and display name properties from
   `EXAMPLE_NAME_CHANGE_THIS` to the desired name of your Slack App.
   It is recommended to use the same name and display name for your App.

   > ```json
   > {
   >     "display_information": {
   >         "name": "EXAMPLE_NAME_CHANGE_THIS"
   >     },
   >     "features": {
   >         "bot_user": {
   >             "display_name": "EXAMPLE_NAME_CHANGE_THIS",
   >             "always_online": false
   >         }
   >     },
   >     "oauth_config": {
   >         "scopes": {
   >             "bot": [
   >                 "channels:history",
   >                 "channels:read",
   >                 "groups:history",
   >                 "groups:read",
   >                 "im:history",
   >                 "im:read",
   >                 "mpim:history",
   >                 "mpim:read",
   >                 "users.profile:read",
   >                 "users:read",
   >                 "users:read.email",
   >                 "files:read",
   >                 "app_mentions:read",
   >                 "reactions:read"
   >             ]
   >         }
   >     },
   >     "settings": {
   >         "event_subscriptions": {
   >             "bot_events": [
   >                 "message.channels",
   >                 "message.groups",
   >                 "message.im",
   >                 "message.mpim",
   >                 "reaction_added",
   >                 "reaction_removed",
   >                 "file_created",
   >                 "file_deleted",
   >                 "file_change"
   >             ]
   >         },
   >         "interactivity": {
   >             "is_enabled": true
   >         },
   >         "org_deploy_enabled": false,
   >         "socket_mode_enabled": true,
   >         "token_rotation_enabled": false
   >     }
   > }
   > ```
2. Create a Slack app through the [Apps page](https://api.slack.com/apps).

   > 1. On the **Your Apps** page, select **Create New App**.
   > 2. Select **From a manifest**.
   > 3. Select the **Workspace** where you’ll be developing your app. You’ll be able to [distribute your app](<https://api.slack.com/distribution>) to other workspaces later if you choose.
   > 4. Copy the updated manifest JSON from step 1.
3. Generate an app-level token. You need to create an app-level token even after using the JSON manifest. Under **Basic Information**, scroll to the **App-level tokens** section and click the button to generate an [app-level token](<https://api.slack.com/concepts/token-types#app>). Include the `connections:write` scope to the token.
4. Install and authorize the app.

   > 1. Return to the **Basic Information** section of the app management page.
   > 2. Install your app by selecting the **Install to Workspace** button.
   > 3. You’ll now be sent through the Slack OAuth flow. Select **Allow** on the following screen.
   >
   > If you want to add your app to a different workspace besides your own, these steps would need to be performed by a user from that workspace.
   > After installation, navigate back to the **OAuth & Permissions** page. You’ll see an **access token** under **OAuth Tokens**.
   > Access tokens represent the permissions delegated to your app by the installing user. Keep it safe and secure. Avoid checking them into public version control. Instead, access them through an environment variable.
5. Adding the App to channels. Your app isn’t a member of any channels yet, so pick a channel to add some test messages in and `/invite` your app. For example, `/invite @Grocery Reminders`.

> **Note:**
>
> Restart the processors to load the new channels. After the App is added to a new channel, the `Consume Slack Conversation` processor in the OpenFlow Runtime needs to be stopped and restarted.

## Setup necessary ingress rules

A Snowflake Admin should follow the [egress guide](../../../../../developer-guide/snowpark-container-services/service-network-communications.md)
to apply egress rules to the endpoint `https://slack.com/api` and
enable WebSocket egress on `wss://wss.slack.com`. This is easiest
done by adding a rule to enable egress on the “slack.com” domain.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Use case 1: Ingest Slack content only

Use the connector definition to:

> * Perform custom analysis on ingested Slack data (no Cortex Search processing).
> * Ingest Slack messages, reactions, file attachments, and member lists into Snowflake, and keep them up to date.

### Set up the connector

As a data engineer, perform the following tasks to configure the connector:

#### Install the connector

1. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following SQL commands:

   > ```sqlexample
   > CREATE DATABASE DESTINATION_DB;
   > CREATE SCHEMA DESTINATION_DB.DESTINATION_SCHEMA;
   > GRANT USAGE ON DATABASE DESTINATION_DB TO ROLE <CONNECTOR_ROLE>;
   > GRANT USAGE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > GRANT CREATE TABLE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Right-click on the imported process group and select **Parameters**.
2. Enter the required parameter values as described in **Flow parameters: Ingest content only** below.
3. Right-click on the canvas and select **Enable all controller services**.
4. Right-click on the imported process group and select **Start**. The flow creates all required Snowflake objects and begins ingesting Slack data.

##### Flow parameters: Ingest content only

| Parameter | Description |
| --- | --- |
| App Token | Slack *App-level token* generated in the Slack App. |
| Bot Token | Slack *Bot token* generated in the Slack App. |
| Destination Database | Database to contain all connector objects (created if absent). |
| Destination Schema | Schema inside the database (created if absent). |
| Snowflake Account | Snowflake account identifier. |
| Snowflake Role | Role the flow assumes after authentication. |
| Snowflake User | Username the flow uses to connect. |
| Snowflake Private Key | RSA private key used for authentication (PKCS8 PEM format). Note that either Snowflake Private Key or Snowflake Private Key File must be defined. |
| Snowflake Private Key Password | Password for the encrypted private key (leave blank if unencrypted). |
| Snowflake Private Key File | File containing the RSA Private Key (PKCS8 PEM format). The header line starts with `-----BEGIN PRIVATE`. |
| Snowflake Warehouse | Warehouse used for SQL executed by the flow. |
| Upload Interval | Time to gather data before pushing to Snowflake. A longer interval reduces load on Snowflake but may increase latency and memory usage. |
| Refresh Slack Members | Minutes between Slack membership (ACL) refreshes. |

## Use case 2: Ingest Slack content and enable Cortex

Use the connector definition to:

> * Make Slack data ready for conversational search with Snowflake Cortex.
> * Ensure Slack channel access controls are respected in search results.

### Set up the connector

As a data engineer, perform the following tasks to configure the connector:

#### Install the connector

1. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following SQL commands:

   > ```sqlexample
   > CREATE DATABASE DESTINATION_DB;
   > CREATE SCHEMA DESTINATION_DB.DESTINATION_SCHEMA;
   > GRANT USAGE ON DATABASE DESTINATION_DB TO ROLE <CONNECTOR_ROLE>;
   > GRANT USAGE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > GRANT CREATE TABLE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

#### Configure the connector

1. Right-click on the imported process group and select **Parameters**.
2. Enter the required parameter values as described in **Flow parameters: Ingest content and enable Cortex** below.
3. Right-click on the canvas and select **Enable all controller services**.
4. Right-click on the imported process group and select **Start**.
5. Once the flow is running, proceed to Query the Cortex Search service for testing.

##### Flow parameters: Ingest content and enable Cortex

| Parameter | Description |
| --- | --- |
| App Token | Slack *App-level token* generated in the Slack App. |
| Bot Token | Slack *Bot token* generated in the Slack App. |
| Destination Database | Database to contain all connector objects (created if absent). |
| Destination Schema | Schema inside the database (created if absent). |
| Upload Interval | Time to gather data before pushing to Snowflake. A larger value reduces load but increases data latency. |
| Snowflake Account | Snowflake account identifier. |
| Snowflake Role | Role the flow assumes after authentication. |
| Snowflake User | Username the flow uses to connect. |
| Snowflake Private Key | PEM-formatted private key for key-pair authentication. |
| Snowflake Private Key Password | Password for the encrypted private key (blank if unencrypted). |
| Snowflake Warehouse | Warehouse used for all SQL executed by the flow **and** by Cortex. |
| Refresh Slack Members | Minutes between Slack membership (ACL) refreshes. |

## Enabling private-channel ACLs

No extra steps are required beyond **inviting the Slack App** to each private channel. The connector automatically refreshes the member list and stores it in the membership table at each **Refresh Slack Members** interval.

## Query the Cortex Search service

After Use case 2 is running and the Cortex Search service has been created, you can query it as follows:

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    '<openflow_db>.<openflow_schema>.<<SLACK_CORTEX_SEARCH>',
    '{
      "query": "What is my vacation carry over policy?",
      "columns": ["text","channel","ts","username"],
      "filter": {"@contains": {"memberemails": "alice@example.com"}},
      "limit": 10
    }'
  )
)['results'] AS results;
```

**Common searchable columns**

`text`, `type`, `subtype`, `channel`, `user`, `username`, `connectorId`, `workspaceId`, `ts`, `threadTs`

**Example: Query an AI assistant for human resources (HR) information**

You can use Cortex Search to query an AI assistant for employees to chat about the latest Slack posts. The messages
that are searched can come from informative Slack channels such as general or it-help.

SQLPythonREST API

Run the following in a [SQL worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the Cortex Search service over messages ingested from Slack.

Replace the following:

* `cortex_db`: Name of the database containing the cortex search service, specified by the `Destination Database` parameter.
* `cortex_schema`: Name of the schema containing the cortex search service, specified by the `Destination Schema` parameter.
* `cortex_search_service_name`: Name of the cortex search service, specified by the `Cortex Search Name` parameter.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```sqlexample
SELECT PARSE_JSON(
     SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
          '<cortex_db>.<cortex_schema>.<cortex_search_service_name>',
          '{
             "query": "What is my vacation carry over policy?",
             "columns": ["text", "channel", “ts”,”username”],
             "filter": {"@contains": {"memberemails": "<user_emailID>"} },
             "limit": 1
          }'
     )
 )['results'] AS results
```

Run the following code in a [Python worksheet](../../../../ui-snowsight-worksheets-gs.md) to query the
Cortex Search service over messages ingested from Slack
Ensure that you add the `snowflake.core` package to your database.

Replace the following:

* `cortex_db`: Name of the database containing the cortex search service, specified by the `Destination Database` parameter.
* `cortex_schema`: Name of the schema containing the cortex search service, specified by the `Destination Schema` parameter.
* `cortex_search_service_name`: Name of the cortex search service, specified by the `Cortex Search Name` parameter.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.core import Root

def main(session: snowpark.Session):

   root = Root(session)

   # fetch service
   my_service = (root
     .databases["<cortex_db>"]
     .schemas["<cortex_schema>"]
     .cortex_search_services["<cortex_search_service_name>"]
   )

   # query service
   resp = my_service.search(
     query="What is my vacation carry over policy?",
     columns = ["text", "channel", "ts","username"],
     filter = {"@contains": {"memberemails": "<user_emailID>"} },
     limit=1
     )
   return (resp.to_json())
```

Execute the following code in a command-line interface to query the Cortex Search
service over messages ingested from Slack.
You will need to authentication through key pair authentication and OAuth to access the
Snowflake REST APIs. For more information,
see [REST API](../../../../snowflake-cortex/cortex-search/query-cortex-search-service.md)
and [Authenticating Snowflake REST APIs with Snowflake](../../../../../developer-guide/snowflake-rest-api/authentication.md).

Replace the following:

* `cortex_db`: Name of the database containing the cortex search service, specified by the `Destination Database` parameter.
* `cortex_schema`: Name of the schema containing the cortex search service, specified by the `Destination Schema` parameter.
* `cortex_search_service_name`: Name of the cortex search service, specified by the `Cortex Search Name` parameter.
* `account_url`: Your Snowflake account URL. For instructions on finding your account URL, see [Finding the organization and account name for an account](../../../../admin-account-identifier.md).

```bash
curl --location "https://<account_url>/api/v2/databases/<cortex_db>/schemas/<cortex_schema>/cortex-search-services/<cortex_search_service_name>" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer <CORTEX_SEARCH_JWT>" \
     --data '{
         "query": "What is my vacation carry over policy?",
         "columns": ["text", "channel"],
         "limit": 1
     }'
```

Sample response:

```output
{
  "results" : [ {
  "channel" : "dev notes",
  "text" : "Answer to the question asked."
  } ]
}
```

---
title: Set up the Openflow Connector for Snowflake to Kafka
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/snowflake-to-kafka/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Snowflake to Kafka

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Snowflake to Kafka.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Snowflake to Kafka](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. Create a Snowflake stream that will be queried for the changes.
4. Create a Kafka topic that will receive CDC messages from the Snowflake stream.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create the database, source table, and the stream object that the connector will use for reading CDC events. For example:

   ```sqlexample
   create database stream_db;
   use database stream_db;
   create table stream_source (user_id varchar, data varchar);
   create stream stream_on_table on table stream_source;
   ```
2. Create a new role or use an existing role, and grant the SELECT privilege on the stream
   and the source object for the stream. The connector will also need the USAGE privilege on the database and
   schema containing the stream and source object for the stream. For example:

   ```sqlexample
   create role stream_reader;
   grant usage on database stream_db to role stream_reader;
   grant usage on schema stream_db.public to role stream_reader;
   grant select on stream_source to role stream_reader;
   grant select on stream_on_table to role stream_reader;
   ```
3. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md). For example:

   ```sqlexample
   create user stream_user type = service;
   ```
4. Grant the Snowflake service user the role you created in the previous steps. For example:

   ```sqlexample
   grant role stream_reader to user stream_user;
   ```
5. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 3.
6. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp,
   and store the public and private keys in the secret store. However, note that the private key generated in step 4 can be used
   directly as a configuration parameter for the connector configuration. In such a case, the private key is stored in Openflow runtime configuration.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
7. Designate a warehouse for the connector to use. One connector can replicate single table to a single Kafka Topic.
   For this kind of processing, you can select the smallest warehouse.

## Set up the connector

As a data engineer, perform the following tasks to install and configure a connector:

1. Navigate to the Openflow Overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find and choose the connector depending on what kind of Kafka broker instance the connector should communicate with.

   * mTLS version: Choose this connector if you are using the SSL (mutual TLS) security protocol, or if you are using
     the SASL_SSL protocol and connecting to the broker that is using self-signed certificates.
   * SASL version: Choose this connector if you are using any other security protocol
3. Select Add to runtime.
4. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list.
5. Select Add.
6. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
7. Authenticate to the runtime with your Snowflake account credentials.

   The Openflow canvas appears with the connector process group added to it.
8. Right-click on the imported process group and select Parameters.
9. Populate the required parameter values as described in Flow parameters.

### Flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* Kafka Sink Source Parameters
* Kafka Sink Destination Parameters
* Kafka Sink Ingestion Parameters

#### Kafka Sink Source Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Source Database | Source database. This database should contain the Snowflake Stream object that will be consumed. | Yes |
| Snowflake Private Key Password | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake Private Key File. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake Role.   You can find your Snowflake Role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Snowflake Private Key | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, provide the RSA private key used for authentication. The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers. Note that either Snowflake Private Key File or Snowflake Private Key must be defined. | Yes |
| Snowflake Private Key File | Leave this blank when using Session Token for your Authentication Strategy. When using KEY_PAIR, upload the file that contains the RSA Private Key used for authentication to Snowflake, formatted according to PKCS8 standards and having standard PEM headers and footers. The header line begins with `-----BEGIN PRIVATE`. Select the Reference asset checkbox to upload the private key file. | No |
| Source Schema | The source schema. This schema should contain Snowflake Stream object that will be consumed. | Yes |
| Snowflake Warehouse | Snowflake warehouse used to run queries | Yes |

#### Kafka Sink Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Kafka Bootstrap Servers | A comma-separated list of Kafka brokers to send data to. | Yes |
| Kafka SASL Mechanism | SASL mechanism used for authentication. Corresponds to the Kafka Client `sasl.mechanism` property. Possible values:   * `PLAIN` * `SCRAM-SHA-256` * `SCRAM-SHA-512` * `AWS_MSK_IAM` | Yes |
| Kafka SASL Username | The username to authenticate to Kafka | Yes |
| Kafka SASL Password | The password to authenticate to Kafka | Yes |
| Kafka Security Protocol | Security protocol used to communicate with brokers. Corresponds to the Kafka Client `security.protocol` property. Possible values:   * `PLAINTEXT` * `SASL_PLAINTEXT` * `SASL_SSL` * `SSL` | Yes |
| Kafka Topic | The Kafka topic, where CDCs from Snowflake Stream will be sent | Yes |
| Kafka Message Key Field | Specify the database column name that will be used as the Kafka message key. If not specified, the message key will not be set. If specified, the value of this column will be used as a message key. The value of this parameter is case-sensitive. | No |
| Kafka Keystore Filename | A full path to a keystore storing a client key and certificate for mTLS authentication method. Required for mTLS authentication and when the security protocol is SSL. | No |
| Kafka Keystore Type | The type of keystore. Required for mTLS authentication. Possible values:   * `PKCS12` * `JKS` * `BCFKS` | No |
| Kafka Keystore Password | The password used to secure keystore file. | No |
| Kafka Key Password | A password for the private key stored in the keystore. Required for mTLS authentication. | No |
| Kafka Truststore Filename | A full path to a truststore storing broker certificates. The client will use the certificate from this truststore to verify broker identity. | No |
| Kafka Truststore Type | The type of truststore file. Possible values:   * `PKCS12` * `JKS` * `BCFKS` | No |
| Kafka Truststore Password | A password for the truststore file. | No |

#### Kafka Sink Ingestion Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Snowflake FQN Stream Name | Fully qualified Snowflake stream name. | Yes |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

---
title: Set up the Openflow Connector for SQL Server
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sql-server/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for SQL Server

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how to set up the Openflow Connector for SQL Server.

For information on the incremental load process, see [Incremental replication](incremental-replication.md).

## Prerequisites

Before setting up the connector, ensure that you have completed the following prerequisites:

1. Ensure that you have reviewed [About Openflow Connector for SQL Server](about.md).
2. Ensure that you have reviewed [Supported SQL Server versions](about.md).
3. Ensure that you have set up your runtime deployment. For more information, see the following topics:

   * [Set up Openflow - BYOC](../../setup-openflow-byoc.md)
   * [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
4. If you use Openflow - Snowflake Deployments, ensure that you have reviewed
   [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md) and have granted access to the required domains for the [SQL Server](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Set up your SQL Server instance

Before setting up the connector, perform the following tasks in your SQL Server environment:

> **Note:**
>
> You must perform these tasks as a database administrator.

1. Enable change tracking on the
   [databases](https://learn.microsoft.com/en-us/sql/relational-databases/track-changes/enable-and-disable-change-tracking-sql-server?view=sql-server-ver16#enable-change-tracking-for-a-database)
   and
   [tables](https://learn.microsoft.com/en-us/sql/relational-databases/track-changes/enable-and-disable-change-tracking-sql-server?view=sql-server-ver16#enable-change-tracking-for-a-table)
   that you plan to replicate, as shown in the following SQL Server example:

   ```sqlexample
   ALTER DATABASE <database>
     SET CHANGE_TRACKING = ON
     (CHANGE_RETENTION = 2 DAYS, AUTO_CLEANUP = ON);

   ALTER TABLE <schema>.<table>
     ENABLE CHANGE_TRACKING;
   ```

   > **Note:**
   >
   > Run these commands for every database and table that you plan to replicate.

   The connector requires that change tracking is enabled on the databases and tables before replication
   starts. Ensure that every table that you plan to replicate has enabled change tracking. You
   can also enable change tracking on additional tables while the connector is running.
2. Create a login for the SQL Server instance:

   ```sqlexample
   CREATE LOGIN <user_name> WITH PASSWORD = '<password>';
   ```

   This login is used to create users for the databases you plan to replicate.
3. Create a user for each database you are replicating by running the following
   SQL Server command in each database:

   ```sqlexample
   USE <source_database>;
   CREATE USER <user_name> FOR LOGIN <user_name>;
   ```
4. Grant the SELECT and VIEW CHANGE TRACKING permissions to the user for each database that you are
   replicating:

   ```sqlexample
   GRANT SELECT ON <database>.<schema>.<table> TO <user_name>;
   GRANT VIEW CHANGE TRACKING ON <database>.<schema>.<table> TO <user_name>;
   ```

   Run these commands in each database for every table that you plan to replicate.
   These permissions must be granted to the user of each database that you created in a
   previous step.
5. (Optional) Grant the VIEW DEFINITION privilege on the User Defined Data Types (UDDT).

   If your tables contain columns that use User Defined Data Types (UDDT), and the UDDT is owned by
   a different user than the connector user, you must grant the VIEW DEFINITION permission
   to the connector user as shown in the following SQL Server example:

   ```sqlexample
   GRANT VIEW DEFINITION TO <user_name>;
   ```

   Without this permission, columns using UDDT are silently excluded from replication.
6. (Optional) Configure SSL connection.

   If you use an SSL connection to connect SQL Server, create the root certificate for your database
   server. This is required when configuring the connector.

## Set up your Snowflake environment

As a Snowflake administrator, perform the following tasks:

1. Create a destination database in Snowflake to store the replicated data:

   ```sqlexample
   CREATE DATABASE <destination_database>;
   ```
2. Create a Snowflake [service user](../../../../../sql-reference/sql/create-user.md):

   ```sqlexample
   CREATE USER <openflow_user>
     TYPE = SERVICE
     COMMENT='Service user for automated access of Openflow';
   ```
3. Create a Snowflake role for the connector and grant the required privileges:

   ```sqlexample
   CREATE ROLE <openflow_role>;
   GRANT ROLE <openflow_role> TO USER <openflow_user>;
   GRANT USAGE ON DATABASE <destination_database> TO ROLE <openflow_role>;
   GRANT CREATE SCHEMA ON DATABASE <destination_database> TO ROLE <openflow_role>;
   ```

   Use this role to manage the connector’s access to the Snowflake database.

   To create objects in the destination database, you must grant the
   [USAGE and CREATE SCHEMA privileges](../../../../security-access-control-privileges.md) on the database to the
   role used to manage access.
4. Create a Snowflake warehouse for the connector and grant the required privileges:

   ```sqlexample
   CREATE WAREHOUSE <openflow_warehouse> WITH
     WAREHOUSE_SIZE = 'XSMALL'
     AUTO_SUSPEND = 300
     AUTO_RESUME = TRUE;
   GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO ROLE <openflow_role>;
   ```

   Snowflake recommends starting with a XSMALL warehouse size, then experimenting with size
   depending on the number of tables being replicated and the amount of data transferred. Large
   numbers of tables typically scale better with multi-cluster warehouses, rather than a larger
   warehouse size. For more information, see
   [multi-cluster warehouses](../../../../warehouses-multicluster.md).
5. Set up the public and private keys for key pair authentication:

   1. Create a pair of secure keys (public and private).
   2. Store the private key for the user in a file to supply to the connector’s configuration.
   3. Assign the public key to the Snowflake service user:

      ```sqlexample
      ALTER USER <openflow_user> SET RSA_PUBLIC_KEY = 'thekey';
      ```

      For more information, see [Key-pair authentication and key-pair rotation](../../../../key-pair-auth.md).

## Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

## Configure the connector

To configure the connector, do the following as a data engineer:

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values.

   For more information on the required parameter values, see the following sections:

   * SQLServer Source Parameters: Used to establish a connection with SQL Server.
   * SQLServer Destination Parameters: Used to establish a connection with Snowflake.
   * SQLServer Ingestion Parameters: Used to specify the tables to replicate.

Start by setting the parameters of the SQLServer Source Parameters context, then the SQLServer Destination Parameters context.
After you complete this, enable the connector. The connector connects to both SQLServer and Snowflake and starts running.
However, the connector does not replicate any data until any tables to be replicated are explicitly added to its configuration.

To configure specific tables for replication, edit the SQLServer Ingestion Parameters context. After you apply the changes to the
SQLServer Ingestion Parameters context, the configuration is picked up by the connector, and the replication lifecycle starts for every table.

### SQLServer Source Parameters

| Parameter | Description |
| --- | --- |
| SQLServer Connection URL | The full JDBC URL to the source database.  Example:   * `jdbc:sqlserver://example.com:1433;encrypt=false;` |
| SQLServer JDBC Driver | Select the Reference asset checkbox to upload the [SQL Server JDBC driver](https://learn.microsoft.com/sql/connect/jdbc/download-microsoft-jdbc-driver-for-sql-server). |
| SQLServer Username | The user name for the connector. |
| SQLServer Password | The password for the connector. |

### SQLServer Destination Parameters

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data is persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data is persisted. | Yes |
| Snowflake Connection Strategy | When using KEY_PAIR, specify the strategy for connecting to Snowflake:   * **STANDARD** (default): Connect using standard public routing to Snowflake services. * **PRIVATE_CONNECTIVITY**: Connect using private addresses associated with the supporting cloud platform such as AWS PrivateLink. | Required for BYOC with KEY_PAIR only, otherwise ignored. |
| Snowflake Object Identifier Resolution | Specifies how source object identifiers such as schemas, tables, and columns names are stored and queried in Snowflake. This setting dictates whether you must use double quotes in SQL queries.  Option 1: Default, case-insensitive (recommended).   * **Transformation**: All identifiers are converted to uppercase. For   example, `My_Table` becomes `MY_TABLE`. * **Queries**: SQL queries are case-insensitive and don’t require SQL   double quotes.  For example `SELECT * FROM my_table;` returns the same results as `SELECT * FROM MY_TABLE;`.   **Note:** Snowflake recommends using this option if database objects are not expected to have mixed case names.  **Important:** Do not change this setting after connector ingestion has begun. Changing this setting after ingestion has begun breaks the existing ingestion. If you must change this setting, create a new connector instance.  Option 2: case-sensitive.   * **Transformation**: Case is preserved.   For example, `My_Table` remains `My_Table`. * **Queries**: SQL queries must use double quotes to match the exact   case for database objects.   For example, `SELECT * FROM "My_Table";`.   **Note:** Snowflake recommends using this option if you must preserve source casing for legacy or compatibility reasons. For example, if the source database includes table names that differ in case only, such as `MY_TABLE` and `my_table`, that result in a name collision when using case-insensitive comparisons. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake Private Key File. | No |
| Snowflake Role | When using:   * **Session Token Authentication Strategy**: Use Snowflake Role assigned to the runtime or child role granted to this Snowflake Role.   You can find your runtime Snowflake Role in the Openflow UI, by expanding the More Options [⋮] button for your runtime and selecting Set Snowflake role. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

### SQLServer Ingestion Parameters

| Parameter | Description |
| --- | --- |
| Included Table Names | A comma-separated list of source table paths, including their databases and schemas, for example:  `database_1.public.table_1, database_2.schema_2.table_2` |
| Included Table Regex | A regular expression to match against table paths, including database and schema names. Every path matching the expression is replicated, and new tables matching the pattern that are created later are also included automatically, for example:  `database_name\.public\.auto_.*` |
| Column Filter JSON | Optional. A JSON array of filter objects specifying which columns to include or exclude per table. For syntax details and examples, see Replicate a subset of columns in a table. |
| Merge Task Schedule CRON | CRON expression defining periods when merge operations from Journal to Destination Table will be triggered. Set it to `* * * * * ?` if you want to have continuous merge or time schedule to limit warehouse run time.  For example:  * The string `* 0 * * * ?` indicates that you want to schedule merges at full hour for one minute * The string `* 20 14 ? * MON-FRI` indicates that you want to schedule merges at 2:20 PM every   Monday through Friday.  For additional information and examples, see the cron triggers tutorial in the [Quartz Documentation](https://www.quartz-scheduler.org/documentation/quartz-2.2.2/tutorials/tutorial-lesson-06.html) |

## Replicate tables from a SQL Server replica server

The connector can ingest data from a primary server or from a subscriber server using
[transactional replication](https://learn.microsoft.com/en-us/sql/relational-databases/replication/transactional/transactional-replication).
Before configuring the connector to connect to a SQL Server replica, ensure that replication between the primary and replica
nodes works correctly. For instructions on setting up transactional replication, see
[Tutorial: Configure transactional replication](https://learn.microsoft.com/en-us/sql/relational-databases/replication/tutorial-replicating-data-between-continuously-connected-servers).
When investigating issues with missing data in the connector, first ensure that missing rows and
change tracking events are present in the replica server used by the connector.

> **Note:**
>
> When using a replica server, the connector setup differs from the standard primary server configuration.
> The connection user and change tracking don’t need to be configured on the primary server. Instead, make sure that the
> connection user is available on the replica server and has access to the data and change tracking tables there.

To configure the connector to read from a subscriber server instead of the publisher, specify the subscriber server URL in the
SQLServer Connection URL parameter.

> **Warning:**
>
> Do not change the database server after replication has started. Each database maintains its own change tracking
> state independently, so switching to a different server would cause the connector to lose track of which changes
> have already been processed, and may result in data loss.

## Restart table replication

A table in FAILED state — for example, due to a missing primary key or unsupported schema change — does not restart automatically. If a table enters a FAILED state or you need to restart replication from scratch, use the following procedure to remove and re-add the table to replication.

> **Note:**
>
> If the failure was caused by an issue in the source table such as a missing primary key, resolve that issue in the source database before continuing.

1. Remove the table from flow parameters: In the Ingestion Parameters context, either remove the table from the Included Table Names or modify the Included Table Regex so the table is no longer matched.
2. Verify the table has been removed:

   1. In the Openflow runtime canvas, right-click a processor group and choose Controller Services.
   2. In the table listing controller services, locate the Table State Store row, click the three vertical dots on the right side of the row, then choose View State.
   > **Important:**
   >
   > You must wait until the table’s state is fully removed from this list before proceeding. Do not continue until this configuration change has completed.
3. Clean up the destination: Once the table’s state shows as fully removed, manually [DROP](../../../../../sql-reference/sql/drop-table.md) the destination table in Snowflake. Note that the connector will not overwrite an existing destination table during the snapshot phase; if the table still exists, replication will fail again. Optionally, the journal table and stream can also be removed if they are no longer needed.
4. Re-add the table: Update the Included Table Names or Included Table Regex parameters to include the table again.
5. Verify the restart: Check the Table State Store using the instructions given previously. The state of the table should appear with the status NEW, then transition to SNAPSHOT_REPLICATION, and finally INCREMENTAL_REPLICATION.

## Replicate a subset of columns in a table

The connector can filter the data replicated per table to a subset of configured columns.
Primary key columns are always included regardless of exclusions.

To apply column filters, set the Column Filter JSON parameter in the Ingestion Parameters context
to a JSON array of filter objects, one per table you want to filter.

Columns can be included or excluded by name or by regular expression pattern. You can apply a single condition per table,
or combine multiple conditions, with exclusions always taking precedence over inclusions.

### Syntax

Each object in the array identifies a table and specifies which columns to include or exclude.
Because this connector uses three-part fully qualified names (database, schema, and table), each object
can include a `database` or `databasePattern` field in addition to the schema and table fields.

```javascript
[
    {
        "database": "<database>" | "databasePattern": "<regex>",
        "schema": "<schema>" | "schemaPattern": "<regex>",
        "table": "<table>" | "tablePattern": "<regex>",
        "included": ["<column>", "<column>"],
        "excluded": ["<column>", "<column>"],
        "includedPattern": "<regex>",
        "excludedPattern": "<regex>"
    }
]
```

The following rules apply:

* Use `database`, `schema`, and `table` for exact name matching, or `databasePattern`,
  `schemaPattern`, and `tablePattern` for regex matching. You cannot use both a field and its
  pattern variant in the same object (for example, `schema` and `schemaPattern` cannot both appear).
* At least one of `included`, `excluded`, `includedPattern`, or `excludedPattern` must be provided.
* When both included and excluded filters are specified, exclusions take precedence.
* When multiple filters match the same table, the last matching filter is used, with exact matches
  taking precedence over pattern-based filters.
* The value can be an array of objects to apply different filters to different tables.

### Examples

Include specific columns by name:

```javascript
[
    {
        "database": "my_db",
        "schema": "dbo",
        "table": "orders",
        "included": ["account_id", "status", "created_at"]
    }
]
```

Exclude specific columns by name:

```javascript
[
    {
        "database": "my_db",
        "schema": "dbo",
        "table": "orders",
        "excluded": ["internal_note", "debug_flag"]
    }
]
```

Combine an include pattern with a specific exclusion (for example, include all email columns except `admin_email`):

```javascript
[
    {
        "database": "my_db",
        "schema": "dbo",
        "table": "contacts",
        "includedPattern": ".*_email",
        "excluded": ["admin_email"]
    }
]
```

Mix a database pattern with an exact schema and table name to apply a filter across databases:

```javascript
[
    {
        "databasePattern": "prod_.*",
        "schema": "dbo",
        "table": "customers",
        "excluded": ["internal_note"]
    }
]
```

Pass multiple filter objects to apply different rules to different tables:

```javascript
[
    {"database": "my_db", "schema": "dbo", "table": "orders", "included": ["account_id", "status"]},
    {"database": "my_db", "schema": "dbo", "table": "customers", "excludedPattern": ".*_internal"}
]
```

## Replicate a partitioned table

The connector supports replication of partitioned tables. A SQL Server
partitioned table is replicated into Snowflake as a single destination table,
containing data from all partitions.

To replicate a partitioned table, ensure that change tracking is enabled on the
partitioned table, as described in Set up your SQL Server instance.

## Track data changes in tables

The connector replicates the current state of data from the source tables,
as well as detected changes from each polling interval. This data is stored in journal tables
created in the same schema as the destination table.

> **Note:**
>
> Because the connector uses SQL Server Change Tracking, multiple updates to the same row between
> polling intervals are rolled up into a single change. Journal tables reflect the net result of
> changes, not every intermediate state. For more information, see [About Openflow Connector for SQL Server](about.md).

The journal table names are formatted as: `<source_table_name>_JOURNAL_<timestamp>_<schema_generation>`
where `<timestamp>` is the value of epoch seconds when the source table was added to replication, and `<schema_generation>` is an integer increasing with every schema change on the source table.
As a result, source tables that undergo schema changes will have multiple journal tables.

When you remove a table from replication, then add it back, the `<timestamp>` value changes, and `<schema_generation>` starts again from `1`.

> **Important:**
>
> Snowflake recommends not altering the structure of journal tables in any way.
> The connector uses them to update the destination table as part of the replication process.

The connector never drops journal tables, but uses the latest
journal for every replicated source table, only reading append-only streams on top of journals.
To reclaim the storage, you can:

* Truncate all journal tables at any time.
* Drop the journal tables related to source tables that were removed from replication.
* Drop all but the latest generation journal tables for actively replicated tables.

For example, if your connector is set to actively replicate source table `orders`,
and you have earlier removed table `customers` from replication, you may have
the following journal tables. In this case you can drop all of them *except* `orders_5678_2`.

```output
customers_1234_1
customers_1234_2
orders_5678_1
orders_5678_2
```

## Configure scheduling of merge tasks

The connector uses a warehouse to merge change data capture (CDC) data into destination tables.
This operation is triggered by the MergeSnowflakeJournalTable processor. If there are no new changes or if no new flow files are waiting in
the MergeSnowflakeJournalTable queue, no merge is triggered and the warehouse auto-suspends.

Use the CRON expression in the Merge task Schedule CRON parameter to limit the warehouse cost and limit merges to only scheduled time.
It throttles the flow files coming to the MergeSnowflakeJournalTable processor
and merges are triggered only in a dedicated period of time.
For more information about scheduling, see [Scheduling strategy](https://nifi.apache.org/docs/nifi-docs/html/user-guide.html#scheduling-strategy).

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

---
title: Set up the Openflow Connector for Workday
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/workday/setup.md
section: Loading & Unloading Data
---

# Set up the Openflow Connector for Workday

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Workday.

## Prerequisites

1. Ensure that you have reviewed [About Openflow Connector for Workday](about.md).
2. Ensure that you have [Set up Openflow - BYOC](../../setup-openflow-byoc.md) or [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md).
3. If using Openflow - Snowflake Deployments, ensure that you’ve reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md)
   and have granted access to the required domains for the [Workday](../../setup-openflow-spcs-sf-allow-list.md) connector.

## Get the credentials

As a Workday administrator, perform the following actions:

1. Create a user in Workday:

   1. Go to Workday and log in as an administrator. In the Workday
      search bar, type Create user.
   2. Click Create Integration System User: Task.
   3. Enter a username and password.
2. Create a security group and add the user from step 1 to it:

   1. In the Workday search bar, type Create Security Group.
   2. Click Create Security Group: Task.
   3. Set the type to Integration System Security Group (Unconstrained).
   4. Enter a Security Group Name and click OK.
   5. In the Edit Integration System Security Group (Unconstrained)
      window, add the integration system user created in Step 1 in the
      Integration System Users field.
3. Add domain security policies to the security group created on step 2:

   1. In the Workday search bar, type View Security Group.
   2. Go to Security Group Settings » Maintain Domain Permissions for Security Group.
   3. In the Integration Permissions section, in the Domain Security
      Policies permitting Get access field, select the security domains
      associated with the reports you want to sync.
   4. Go to the Activate Pending Security Policy Changes page and click
      OK.
4. Create an OAuth client app:

   1. In the Workday search bar, type Register API Client, and click
      Register API Client for Integrations: Task.
   2. Enter a Client Name.
   3. Click Non-Expiring Refresh Token.
   4. In the Scope search bar, type System and select it.
   5. Click OK.
   6. Copy the Client ID and Client Secret, then click Done.
5. In the View Integration System Security Group page, note the
   functional areas under Domain Security Policies. Then, add these as
   Scopes/Functional Areas in the API Client:

   1. In the search bar, type View API Client.
   2. Choose your API client from the list.
   3. In the top blue bar, click the three dots, then select API Client » API Clients for Integrations.
   4. In the Scope (Functional Areas) field, search for and add the
      functional areas that you noted.
6. In the same menu as before (5c), select Manage Refresh Tokens for Integrations.

   1. In the form, search for the ISU user and select it.
   2. Click OK.
   3. Click Generate new token and copy the refresh token details which will be used later.

## Set up Snowflake account

As a Snowflake account administrator, perform the following tasks:

1. Create a new role or use an existing role and grant the [Database privileges](../../../../security-access-control-privileges.md).
2. Create a new Snowflake service user with the type as [SERVICE](../../../../../sql-reference/sql/create-user.md).
3. Grant the Snowflake service user the role you created in the previous steps.
4. Configure with [key-pair auth](../../../../key-pair-auth.md) for the Snowflake SERVICE user from step 2.
5. Snowflake strongly recommends this step. Configure a secrets manager supported by Openflow, for example, AWS, Azure, and Hashicorp, and store the public and private keys in the secret store.

   > **Note:**
   >
   > If for any reason, you do not wish to use a secrets manager, then you are responsible for safeguarding the
   > public key and private key files used for key-pair authentication according to the security policies of your organization.

   1. Once the secrets manager is configured, determine how you will authenticate to it. On AWS, it’s recommended that you the
      EC2 instance role associated with Openflow as this way no other secrets have to be persisted.
   2. In Openflow, configure a Parameter Provider associated with this Secrets Manager, from the hamburger menu in the upper right.
      Navigate to Controller Settings » Parameter Provider and then fetch your parameter values.
   3. At this point all credentials can be referenced with the associated parameter paths and no sensitive values need to be persisted within Openflow.
6. If any other Snowflake users require access to the raw ingested documents and tables ingested by the connector (for example, for custom processing in Snowflake),
   then grant those users the role created in step 1.
7. Designate a warehouse for the connector to use. Start with the smallest warehouse size, then experiment with size depending on the number of tables being replicated,
   and the amount of data transferred. Large table numbers typically scale better with
   [multi-cluster warehouses](../../../../warehouses-multicluster.md), rather than larger warehouse sizes.

## Set up the connector

As a data engineer, perform the following tasks to configure the connector:

### Install the connector

1. Create a database and schema in Snowflake for the connector to store ingested data. Grant required [Database privileges](../../../../security-access-control-privileges.md) to the role created in the first step. Substitute the role placeholder with the actual value and use the following SQL commands:

   > ```sqlexample
   > CREATE DATABASE DESTINATION_DB;
   > CREATE SCHEMA DESTINATION_DB.DESTINATION_SCHEMA;
   > GRANT USAGE ON DATABASE DESTINATION_DB TO ROLE <CONNECTOR_ROLE>;
   > GRANT USAGE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > GRANT CREATE TABLE, CREATE PIPE ON SCHEMA DESTINATION_DB.DESTINATION_SCHEMA TO ROLE <CONNECTOR_ROLE>;
   > ```

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

### Configure the connector

1. Right-click on the imported process group and select Parameters.
2. Populate the required parameter values as described in Flow parameters.

#### Flow parameters

The configuration is divided into three parameter contexts. The *Workday
Destination Parameters* and *Workday Source Parameters*
contexts are responsible for connecting with Snowflake and Workday. The
*Workday Ingestion Parameters* contains all parameters from both
configs and other parameters specific to a given report (e.g., *Report URL*).

Because the *Workday Ingestion Parameters* parameter context
contains report-specific details, new parameter contexts must be created
for each new report and process group. To create a new parameter
context, go to the menu, select Parameter Contexts, and add a new
context. It should inherit from both the *Workday Destination Parameters*
and *Workday Source Parameters* parameter contexts.

**Workday Destination Parameters** **parameter context**

| Parameter | Description | Required |
| --- | --- | --- |
| Destination Database | The database where data will be persisted. It must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase. | Yes |
| Destination Schema | The schema where data will be persisted, which must already exist in Snowflake. The name is case-sensitive. For unquoted identifiers, provide the name in uppercase.  See the following examples:  * `CREATE SCHEMA SCHEMA_NAME` or `CREATE SCHEMA schema_name`: use `SCHEMA_NAME` * `CREATE SCHEMA "schema_name"` or `CREATE SCHEMA "SCHEMA_NAME"`: use `schema_name` or `SCHEMA_NAME`, respectively | Yes |
| Snowflake Authentication Strategy | When using:   * **Snowflake Openflow Deployment** or **BYOC**: Use SNOWFLAKE_MANAGED_TOKEN.   This token is managed automatically by Snowflake.   BYOC deployments must have previously configured   [runtime roles](../../setup-openflow-byoc.md) to use SNOWFLAKE_MANAGED_TOKEN. * **BYOC:** Alternatively BYOC can use KEY_PAIR as the value for authentication strategy. | Yes |
| Snowflake Account Identifier | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Snowflake account name formatted as [organization-name]-[account-name] where data will be persisted. | Yes |
| Snowflake Private Key | When using:   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Must be the RSA private key used for authentication.  The RSA key must be formatted according to PKCS8 standards and have standard PEM headers and footers.   Note that either a Snowflake Private Key File or a Snowflake Private Key must be defined. | No |
| Snowflake Private Key File | When using:   * **Session token authentication strategy**: The private key file must be blank. * **KEY_PAIR**: Upload the file that contains the RSA private key used for authentication to Snowflake,   formatted according to PKCS8 standards and including standard PEM headers and footers.   The header line begins with `-----BEGIN PRIVATE`.   To upload the private key file, select the Reference asset checkbox. | No |
| Snowflake Private Key Password | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the password associated with the Snowflake private key file. | No |
| Snowflake Role | When using   * **Session Token Authentication Strategy**: Use your Snowflake role.   You can find your Snowflake role in the Openflow UI, by navigating to View Details for your Runtime. * **KEY_PAIR** Authentication Strategy: Use a valid role configured for your service user. | Yes |
| Snowflake Username | When using   * **Session Token Authentication Strategy**: Must be blank. * **KEY_PAIR**: Provide the user name used to connect to the Snowflake instance. | Yes |
| Oversized Value Strategy | Determines how the connector handles values that exceed its internal size limits (16 MB) during replication. Possible values are:  * **Fail Table** (default): The table is marked as permanently failed, and replication stops for that table. * **Set Null**: The value is replaced with `NULL` in the destination table.   Use this to prevent table failures when it is acceptable to lose data in tables beyond the oversized value. | No |
| Snowflake Warehouse | Snowflake warehouse used to run queries. | Yes |

**Workday Source Parameters** **parameter context**

| Parameter | Description |
| --- | --- |
| Authorization Type | Choose between *OAUTH* or *BASIC_AUTH*. If *OAUTH* is chosen, then *OAuth Client ID, OAuth Client Secret, OAuth Refresh Token* and *OAuth Token Endpoint* must be defined. If *BASIC_AUTH* is chosen, then *Workday Username* and *Workday Password* must be defined. |
| OAuth Client ID | The client ID of an application registered in Workday. |
| OAuth Client Secret | The client secret related to the Client ID. |
| OAuth Refresh Token | The refresh token is obtained by a user during the app registration process. It is used together with the client ID and the client secret to get an access token. |
| OAuth Token Endpoint | The token endpoint is obtained by a user during the app registration process. |
| Workday Username | The username is used to log into a Workday account. Must be set only when *BASIC_AUTH* is chosen. |
| Workday Password | The password is associated with the Workday username. Must be set only when *BASIC_AUTH* is chosen. |

**Workday Ingestion Parameters** **parameter context**

| Parameter | Description |
| --- | --- |
| Destination Table | The destination table where report data pulled from Workday is stored. It is created by the connector if it does not exist. |
| Report URL | A RaaS API URL to a report created in Workday. |
| Run Schedule | Run schedule on which data is retrieved from Workday and saved in Snowflake. This value is a time duration specified by a number followed by a time unit. For example, 1 second or 5 mins. |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

---
title: SetCacheClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/setcacheclientservice.md
section: Loading & Unloading Data
---

# SetCacheClientService

## Description

Provides the ability to communicate with a SetCacheServer. This can be used in order to share a Set between nodes in a NiFi cluster

## Tags

cache, cluster, distributed, set, state

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Communications Timeout \* | Communications Timeout | 30 secs |  | Specifies how long to wait when communicating with the remote server before determining that there is a communications failure if data cannot be sent or received |
| SSL Context Service | SSL Context Service |  |  | If specified, indicates the SSL Context Service that is used to communicate with the remote server. If not specified, communications will not be encrypted |
| Server Hostname \* | Server Hostname |  |  | The name of the server that is running the DistributedSetCacheServer service |
| Server Port \* | Server Port | 4557 |  | The port on the remote server that is to be used when communicating with the DistributedSetCacheServer service |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SetCacheServer
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/setcacheserver.md
section: Loading & Unloading Data
---

# SetCacheServer

## Description

Provides a set (collection of unique values) cache that can be accessed over a socket. Interaction with this service is typically accomplished via a DistributedSetCacheClient service.

## Tags

cache, distinct, distributed, server, set

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Eviction Strategy \* | Eviction Strategy | Least Frequently Used | * Least Frequently Used * Least Recently Used * First In, First Out | Determines which strategy should be used to evict values from the cache to make room for new entries |
| Maximum Cache Entries \* | Maximum Cache Entries | 10000 |  | The maximum number of cache entries that the cache can hold |
| Persistence Directory | Persistence Directory |  |  | If specified, the cache will be persisted in the given directory; if not specified, the cache will be in-memory only |
| Port \* | Port | 4557 |  | The port to listen on for incoming connections |
| SSL Context Service | SSL Context Service |  |  | If specified, this service will be used to create an SSL Context that will be used to secure communications; if not specified, communications will not be secure |
| Maximum Read Size | maximum-read-size | 1 MB |  | The maximum number of network bytes to read for a single cache item |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Setting Up the Openflow Connector for Google BigQuery
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-big-query/setup.md
section: Loading & Unloading Data
---

# Setting Up the Openflow Connector for Google BigQuery

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes the steps to set up the Openflow Connector for Google BigQuery.

## Prerequisites

1. Review [About the Openflow Connector for Google BigQuery](about.md).
2. Set up your runtime deployment.

   * [Set up Openflow - BYOC](../../setup-openflow-byoc.md)
   * [Set up Openflow - Snowflake Deployments](../../setup-openflow-spcs.md)
3. If you are using Openflow - Snowflake Deployments, ensure that you have reviewed [configuring required domains](../../setup-openflow-spcs-sf-allow-list.md) and have granted access to the domains required by the connector.
4. You have access to the Openflow admin role or similar role you use to manage Openflow.
5. If you are creating a Snowflake service user to manage the connector, you have created a key pair authentication. For more information, see [key-pair authentication](../../../../key-pair-auth.md).

## Required endpoints

The following endpoints are required for the connector to function:

* `bigquery.googleapis.com:443`
* `bigquerystorage.googleapis.com:443`
* `oauth2.googleapis.com:443`

If you are using Openflow - BYOC, you need to configure your cloud network egress to allow TLS 443 access to the endpoints listed above.
If you are using Openflow - Snowflake Deployments, you need to create a network rule and an external access integration (EAI). Then, grant the Snowflake Role usage privileges on the EAI.

## Set up BigQuery

1. Create a Google Cloud Service account and grant it the necessary permissions to read BigQuery data. The connector uses this account for authentication.

   This account must have the following permissions:

   * [BigQuery User](https://docs.cloud.google.com/bigquery/docs/access-control#bigquery.user)
   * [BigQuery Data Editor](https://docs.cloud.google.com/bigquery/docs/access-control#bigquery.dataEditor)

> > **Important:**
> >
> > `BigQuery Data Editor` must be granted at the **project level**, not at individual datasets.
> > The connector queries `{project}.{region}.INFORMATION_SCHEMA.TABLES` to discover tables
> > across all configured regions - a region-scoped view that requires project-level access. The
> > connector also queries `{project}.{dataset}.INFORMATION_SCHEMA.KEY_COLUMN_USAGE` to
> > determine primary keys for each replicated table. Without project-level access, the query
> > fails with a `Access Denied` error and the connector does not run correctly.

1. Generate and download the corresponding JSON key file for the service account. You will need the full contents of this file for the connector’s configuration.
2. Enable change history on each source table to allow the connector to perform incremental replication. This feature allows BigQuery to track row-level changes (inserts, updates, and deletes), which the connector uses to sync data efficiently.

   Run the following query in the BigQuery console for each table:

   ```sql
   ALTER TABLE `project.dataset.table`
   SET OPTIONS (enable_change_history = TRUE);
   ```

## Set up your Snowflake account

As an Openflow administrator, perform the following tasks to set up your Snowflake account:

1. Create a Snowflake service user:

   ```sqlexample
   USE ROLE USERADMIN;
   CREATE USER <openflow_service_user>
     TYPE=SERVICE
     COMMENT='Service user for Openflow automation';
   ```
2. Store the private key for that user in a file to supply to the connector’s configuration. For more information, see [key-pair authentication](../../../../key-pair-auth.md).

   ```sqlexample
   ALTER USER <openflow_service_user> SET RSA_PUBLIC_KEY = '<pubkey>';
   ```
3. Create a database that stores the replicated data, and set up permissions for the
   Snowflake user to create objects in that database by granting USAGE and CREATE SCHEMA privileges.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE DATABASE IF NOT EXISTS <destination_database>;
   GRANT USAGE ON DATABASE <destination_database> TO USER <openflow_service_user>;
   GRANT CREATE SCHEMA ON DATABASE <destination_database> TO USER <openflow_service_user>;
   ```
4. Create a new warehouse or use an existing warehouse for the connector.

   To create a new warehouse:

   ```sqlexample
   CREATE WAREHOUSE <openflow_warehouse>
   WITH
      WAREHOUSE_SIZE = 'MEDIUM'
      AUTO_SUSPEND = 300
      AUTO_RESUME = TRUE;
   GRANT USAGE, OPERATE ON WAREHOUSE <openflow_warehouse> TO USER <openflow_service_user>;
   ```

   Start with the MEDIUM warehouse size, then experiment with size depending on the amount of tables being replicated, and the amount of data transferred.

   To determine if you should increase, monitor the connector and database while data replication is in progress. If you observe significant delays during incremental replication, experiment with a larger warehouse size. However large table numbers typically scale better using [multi-cluster warehouses](../../../../warehouses-multicluster.md) instead of increasing the warehouse size.
5. Create an external access integration to enable network access outside of Snowflake.

   > **Caution:**
   >
   > If your runtime executes in Openflow - BYOC, you do not need to create an External Access Integration (EAI). Instead, configure your cloud network egress to allow TLS 443 access to the endpoints listed below.
   >
   > Required host:port endpoints are listed in Required endpoints.

   To allow the connector to call the required Google APIs from a Snowflake-hosted runtime, you must create a network rule and an external access integration (EAI). Then, grant the Snowflake role usage privileges on the EAI.

   To create the external access integration and network rule and grant access, perform the following steps:

   1. Create a network rule to allow the connector to access the required Google APIs:

      ```sqlexample
      USE ROLE ACCOUNTADMIN;
      USE DATABASE <openflow_network_db>;

      CREATE OR REPLACE NETWORK RULE openflow_<runtime_name>_network_rule
        TYPE = HOST_PORT
        MODE = EGRESS
        VALUE_LIST = (
          'bigquery.googleapis.com:443',
          'bigquerystorage.googleapis.com:443',
          'oauth2.googleapis.com:443'
        );
      ```
   2. Create an External Access Integration that references the network rule:

      ```sqlexample
      CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION openflow_<runtime_name>_eai
        ALLOWED_NETWORK_RULES = (openflow_<runtime_name>_network_rule)
        ENABLED = TRUE;
      ```
   3. Grant your Snowflake Role USAGE on the integration:

      ```sqlexample
      GRANT USAGE ON INTEGRATION openflow_<runtime_name>_eai
        TO ROLE openflow_runtime_role_<runtime_name>;
      ```

## Install the connector

To install the connector, do the following as a data engineer:

1. Navigate to the Openflow overview page. In the Featured connectors section, select View more connectors.
2. On the Openflow connectors page, find the connector and select Add to runtime.
3. In the Select runtime dialog, select your runtime from the Available runtimes drop-down list and click Add.

   > **Note:**
   >
   > Before you install the connector, ensure that you have created a database and schema in Snowflake for the connector to store ingested data.
4. Authenticate to the deployment with your Snowflake account credentials and select Allow when prompted to allow the runtime application to access your Snowflake account. The connector installation process takes a few minutes to complete.
5. Authenticate to the runtime with your Snowflake account credentials.

The Openflow canvas appears with the connector process group added to it.

## Configure the connector

To configure the connector, perform the following steps:

1. Right-click on the added runtime and select Parameters.
2. Populate the required parameter values as described in Specify flow parameters.

### Specify flow parameters

This section describes the flow parameters that you can configure based on the following parameter contexts:

* BigQuery Source Parameters: Used to define the configuration for reading data from BigQuery.
* BigQuery Destination Parameters: Used to establish connection with Snowflake.
* BigQuery Ingestion Parameters: Used to specify the tables and views to replicate.

#### BigQuery Source Parameters

| Parameter | Description |
| --- | --- |
| BigQuery Project Name | The unique identifier of the Google Cloud Project that contains BigQuery datasets and tables.  Where to find: open BigQuery Studio (Google Cloud Console > BigQuery) and in the left Explorer pane hover over your project to see the Project ID.  **Example:** `example-team-gcp` |
| GCP Service Account JSON | The entire content of the JSON key file for the Google Cloud Platform Service Account used for authentication. Ensure the service account has the necessary IAM permissions to perform BigQuery operations, such as the BigQuery Job User and BigQuery Data Viewer roles.  Where to get it: Google Cloud Console > IAM & Admin > Service Accounts > select the service account > Keys tab > Add key > Create new key > JSON. This downloads a .json file—open it and paste the entire file content (including braces) into this field. |

#### BigQuery Destination Parameters

| Parameter | Description |
| --- | --- |
| Snowflake Authentication Strategy | When using SPCS, use SNOWFLAKE_SESSION_TOKEN as the value for Authentication Strategy. When using BYOC, use KEY_PAIR as the value for Authentication Strategy.  **Example:** `KEY_PAIR` |
| Snowflake Account Identifier | When using:   * Session Token Authentication Strategy: Must be blank. * KEY_PAIR: Snowflake account name where data will be persisted. |
| Destination Database | The name of the destination database to replicate into. Mixed case is supported. |
| Snowflake Private Key File | When using:   * Session token authentication strategy: The private key file must be blank. * KEY_PAIR: Upload the file that contains the RSA private key used for authentication to Snowflake, formatted according to PKCS8 standards and including standard PEM headers and footers. The header line begins with `-----BEGIN PRIVATE`. To upload the private key file, select the Reference asset checkbox. |
| Snowflake Private Key Password | When using:   * Session Token Authentication Strategy: Must be blank. * KEY_PAIR: Provide the password associated with the Snowflake Private Key File. |
| Snowflake Role | When using:   * Session Token Authentication Strategy: Use your Snowflake Role. You can find your Snowflake Role in the Openflow UI, by navigating to View Details for your Runtime. * KEY_PAIR Authentication Strategy: Use a valid role configured for your service user. |
| Snowflake Username | When using:   * Session Token Authentication Strategy: Must be blank. * KEY_PAIR: Provide the user name used to connect to the Snowflake instance. |
| Snowflake Warehouse | The name of the warehouse to use by the connector. |

#### BigQuery Ingestion Parameters

| Parameter | Description |
| --- | --- |
| BigQuery Regions | Specifies a comma-separated list of the locations to query for BigQuery datasets. You can combine both regional and multi-regional locations in the same list.  **Example:** `us,eu,us-west1` |
| Included Dataset Names | Comma-separated list of datasets to replicate (queried across all selected regions).  **Example:** `sales_data,marketing_leads` |
| Included Dataset Names Regex | Regular expression for specifying dataset names to replicate (queried across all selected regions). Combined with the Included Dataset Names to include any matching dataset. Note: REGEXP expression should match Google’s RE2 syntax.  **Example:** `^sales_.*` |
| Included Table Names | Comma-separated list of tables to replicate across datasets.  **Example:** `transactions,customers` |
| Included Table Names Regex | Regular expression for specifying table names to replicate across datasets. Combined with the Included Table Names to include any matching table. Note: REGEXP expression should match Google’s RE2 syntax.  **Example:** `^revenue_.*` |
| Included View Names | Comma-separated list of views to replicate across datasets.  **Example:** `customer_summary,revenue_report` |
| Included View Names Regex | Regular expression for specifying view names to replicate across datasets. Combined with the Included View Names to include any matching view. Note: REGEXP expression should match Google’s RE2 syntax.  **Example:** `^report_.*` |
| Incremental Sync Frequency | How often the connector runs incremental synchronization for each table. Runs do not overlap if a cycle takes longer than the configured interval, the next run waits for the prior one to finish. Because BigQuery limits max size of window to 24h, schedule must be more frequent than this value.  **Example:** `10m` |
| View Sync Frequency | How often the connector runs synchronization for each view. Runs do not overlap, if a cycle takes longer than the configured interval, the next run waits for the prior one to finish. View ingestion does not support CDC, only truncate and load.  **Example:** `1h` |
| Temporary Table Dataset | Dataset in which necessary temporary tables are created, such as CDC journal tables or temporary tables for view ingestion. Snowflake recommends having a separate dataset for temporary tables and not using the ingested dataset for this purpose.  **Example:** `openflow_temp` |

## Run the flow

1. Right-click on the plane and select Enable all Controller Services.
2. Right-click on the imported process group and select Start. The connector starts the data ingestion.

## Next steps

* For information on tasks you can perform after installing the connector, see
  [Use the connector](use.md)
* For information on monitoring the flow, see
  [Monitor the flow](../../monitor.md)

---
title: Setup tasks for SAP® Snowflake and SAP® BDC Connect for Snowflake
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sap-sql/setup-tasks.md
section: Loading & Unloading Data
---

# Setup tasks for SAP® Snowflake and SAP® BDC Connect for Snowflake

This topic describes the overall tasks required to set up, configure, and run
either SAP® Snowflake or SAP® BDC Connect for Snowflake.

## Prerequisites

1. Ensure that you have reviewed [About Snowflake and SAP® Zero-Copy Integration](../about-sap-snowflake.md).

## Tasks

Perform the following tasks to set up, configure, and run the Openflow Connector for Oracle.

| Order | Task | Description | Persona |
| --- | --- | --- | --- |
| 1 | Review [SAP® and Snowflake - Setup](setup-sap.md) | Setup for either SAP® Snowflake or SAP® BDC Connect for Snowflake. | SAP® administrator |
| 2 | Review [Share Data Products from SAP® Business Data Cloud to Snowflake](share-data-products.md) | Share data products and configure the catalog integration. | SAP® administrator and Snowflake account administrator |
| 3 | [Explore Data from SAP® Business Data Cloud](explore-data.md) | Explore the data that has been shared with Snowflake. | Snowflake account administrator and data engineer |

---
title: Share Data Products from SAP® Business Data Cloud to Snowflake
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/sap-sql/share-data-products.md
section: Loading & Unloading Data
---

# Share Data Products from SAP® Business Data Cloud to Snowflake

The integration between SAP® and Snowflake relies on the
catalog integration
capability in Snowflake for zero-copy data sharing of Data Products
from SAP® Snowflake and SAP® Business Data Cloud Connect for Snowflake.

The steps to share Data Products from SAP® BDC to SAP® Snowflake accounts and
existing Snowflake accounts that use the SAP® BDC Connect for Snowflake are largely the same.

This topic describes the steps to create a catalog integration and share data products/

If you are using SAP® Snowflake, review the following section, as a reference.
if you are using SAP® BDC Connect for Snowflake, review and complete the steps in SAP® Business Data Cloud Connect for Snowflake.

In this section you will:

1. Review SAP® Snowflake SAP® Snowflake, or
   configure a catalog integration for SAP® BDC Connect for Snowflake.
2. In SAP® BDC, Choose Data Products to share with Snowflake to share data products with Snowflake.
3. If you are using SAP® Snowflake, Create a Catalog Linked Database for shared Data Products
   to create a catalog linked database for shared Data Products.

## SAP® Snowflake

As part of the provisioning process for a new SAP® Snowflake account, a catalog integration named `SAP_BDC_INTEGRATION` is automatically created in the SAP® Snowflake account and enrolled with SAP® Business Data Cloud. You can use this catalog integration to share data from SAP® Business Data Cloud or optionally create an additional catalog integration as described in the following section.

## SAP® Business Data Cloud Connect for Snowflake

> **Note:**
>
> Before you can create a catalog integration with `SAP_BDC` as the `CATALOG_SOURCE`, you will need to accept
> the SAP® BDC Connect for Snowflake Terms as an `ORGADMIN`.
> Creating a catalog integration will fail with an error if these terms are not accepted.
> An `ORGADMIN` needs to only do this once for the Snowflake organization.
>
> To accept the SAP® BDC Connect for Snowflake Terms in Snowsight:
>
> 1. Sign in to Snowflake as a user with the `ORGADMIN` role.
> 2. Sign in to [Snowsight](../../../../ui-snowsight-gs.md) as a user with the `ORGADMIN` role.
> 3. In the navigation menu, select Admin » Terms.
> 4. In the Snowflake Marketplace section, next to **SAP® BDC Connect for Snowflake Terms**, select Review.
> 5. select Acknowledge & Continue.

For existing Snowflake accounts that integrate with SAP® Business Data Cloud Connect for Snowflake, users need to first create and enroll a catalog integration prior to sharing data from SAP® Business Data Cloud to Snowflake.

To create and review the catalog integration, run the following command:

1. Create a Catalog Integration and enroll with SAP Business Data Cloud

> ```sqlexample
> CREATE OR REPLACE CATALOG INTEGRATION MY_SAP_BDC_CATALOG_INT
>    CATALOG_SOURCE = SAP_BDC
>    TABLE_FORMAT = DELTA
>     REST_CONFIG = (
>       SAP_BDC_INVITATION_LINK = '<Invitation Link from SAP BDC>'
>       ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
>     )
>     REFRESH_INTERVAL_SECONDS = 900
>     ENABLED = TRUE
>     COMMENT = 'My SAP BDC catalog integration';
> ```

1. Verify the catalog integration was created successfully>

   ```sqlexample
   SHOW CATALOG INTEGRATIONS;
   ```

> Which should produce results similar to:
>
> ```output
> MY_SAP_BDC_CATALOG_INT     CATALOG CATALOG true    2025-12-10 18:27:45.181 -0800
> ```

## In SAP® BDC, Choose Data Products to share with Snowflake

In order to search for and share data products with Snowflake, the user must use the central SAP Business Data Cloud catalog and have a global role that grants them the following privileges:

* BDC Data Packages (read) - To access SAP Business Data Cloud.
* Catalog Asset (read) - To access the catalog and view objects in the Assets and Data Products collections.
* Cloud Data Product (share) - To share data products to target systems.

Users with these privileges can share data products from the SAP Business Data Cloud catalog with the desired SAP Snowflake account to make them available for consumption to specific roles in that account.

To share data products with Snowflake:

1. In the central SAP Business Data Cloud catalog, select data products to share with an SAP Snowflake account
2. From Catalog & Marketplace, search for (or use filters) to find the data products to be shared
3. From the search results, click the Share button in the data product to be shared (for example customer)
   to open the Manage Share Access dialog
4. In the Overview section, learn more about the data product by reviewing its details and available objects.
5. Under Target System:

   1. Choose the Snowflake account with the enrolled catalog integration to share with (if there is more than one).
   2. Click the Update button

A message appears letting you know that the share process has started. After the process finishes, a notification appears letting you know the result.

## Create a Catalog Linked Database for shared Data Products

If you are using SAP® Snowflake, you can create a catalog linked database for shared Data Products.

1. List shares available from SAP® Business Data Cloud for the enrolled catalog integration:

   ```sqlexample
   SELECT SYSTEM$SAP_BDC_LIST_SHARES('MY_SAP_BDC_CATALOG_INT');
   ```

> Which should produce results similar to:
>
> ```output
> ["usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:cashflow:v:1",
>  "usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:customer:v:1",
>  "usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:entryviewjournalentry:v:1"]
> ```

Each element represents a shared Data Product. The highlighted text is an example of the name of the Data Product shared from SAP® Business Data Cloud to Snowflake with the enrolled catalog integration `MY_SAP_BDC_CATALOG_INT`.

1. Create a catalog linked database for the shared data products:

   ```sqlexample
   CREATE OR REPLACE DATABASE CUSTOMER
      LINKED_CATALOG = (
        CATALOG = MY_SAP_BDC_CATALOG_INT,
        CATALOG_NAME = 'shares/usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:customer:v:1',
        ALLOWED_WRITE_OPERATIONS = NONE,
        SYNC_INTERVAL_SECONDS = 86400
      );
   ```

   Which should produce results similar to:

   ```output
   Database CUSTOMER successfully created.
   ```
2. Confirm link status

> ```sqlexample
> SELECT SYSTEM$CATALOG_LINK_STATUS('CUSTOMER');
> ```
>
> Which should produce results similar to:
>
> ```output
> {"failureDetails":[],"executionState":"RUNNING","lastLinkAttemptStartTime":"2025-12-17T21:13:29.611Z"}
> ```

In this example, we only created a single catalog linked database `CUSTOMER`.
You can create additional catalog linked databases depending on Data Products shared
with the enrolled catalog integration in the Snowflake account.

## Next steps

After sharing data products, you can [Explore Data from SAP® Business Data Cloud](explore-data.md) the data that has been shared with Snowflake.

---
title: SignContentPGP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/signcontentpgp.md
section: Loading & Unloading Data
---

# SignContentPGP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-pgp-nar

## Description

Sign content using OpenPGP Private Keys

## Tags

Encryption, GPG, OpenPGP, PGP, RFC 4880, Signing

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| file-encoding | File Encoding for signing |
| hash-algorithm | Hash Algorithm for signing |
| private-key-id | PGP Private Key Identifier formatted as uppercase hexadecimal string of 16 characters used for signing |
| private-key-service | PGP Private Key Service for generating content signatures |
| signing-strategy | Strategy for writing files to success after signing |

## Relationships

| Name | Description |
| --- | --- |
| failure | Content signing failed |
| success | Content signing succeeded |

## Writes attributes

| Name | Description |
| --- | --- |
| pgp.compression.algorithm | Compression Algorithm |
| pgp.compression.algorithm.id | Compression Algorithm Identifier |
| pgp.file.encoding | File Encoding |
| pgp.signature.algorithm | Signature Algorithm including key and hash algorithm names |
| pgp.signature.hash.algorithm.id | Signature Hash Algorithm Identifier |
| pgp.signature.key.algorithm.id | Signature Key Algorithm Identifier |
| pgp.signature.key.id | Signature Public Key Identifier |
| pgp.signature.type.id | Signature Type Identifier |
| pgp.signature.version | Signature Version Number |

## See also

* [org.apache.nifi.processors.pgp.DecryptContentPGP](decryptcontentpgp.md)
* [org.apache.nifi.processors.pgp.EncryptContentPGP](encryptcontentpgp.md)
* [org.apache.nifi.processors.pgp.VerifyContentPGP](verifycontentpgp.md)

---
title: SimpleCsvFileLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/simplecsvfilelookupservice.md
section: Loading & Unloading Data
---

# SimpleCsvFileLookupService

## Description

A reloadable CSV file-based lookup service. The first line of the csv file is considered as header.

## Tags

cache, csv, enrich, join, key, lookup, reloadable, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| CSV Format \* | CSV Format | default | * Custom Format * RFC 4180 * Microsoft Excel * Tab-Delimited * MySQL Format * Informix Unload * Informix Unload Escape Disabled * Default Format * RFC4180 | Specifies which “format” the CSV data is in, or specifies if custom formatting should be used. |
| Character Set \* | Character Set | UTF-8 |  | The Character Encoding that is used to decode the CSV file. |
| Comment Marker | Comment Marker |  |  | The character that is used to denote the start of a comment. Any line that begins with this comment will be ignored. |
| Escape Character \* | Escape Character |  |  | The character that is used to escape characters that would otherwise have a specific meaning to the CSV Parser. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Escape Character at runtime, then it will be skipped and the default Escape Character will be used. Setting it to an empty string means no escape character should be used. |
| Quote Character \* | Quote Character | “ |  | The character that is used to quote values so that escape characters do not have to be used. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Quote Character at runtime, then it will be skipped and the default Quote Character will be used. |
| Quote Mode \* | Quote Mode | MINIMAL | * Quote All Values * Quote Minimal * Quote Non-Numeric Values * Do Not Quote Values | Specifies how fields should be quoted when they are written |
| Trim Fields \* | Trim Fields | true | * true * false | Whether or not white space should be removed from the beginning and end of fields |
| Value Separator \* | Value Separator | , |  | The character that is used to separate values/fields in a CSV Record. If the property has been specified via Expression Language but the expression gets evaluated to an invalid Value Separator at runtime, then it will be skipped and the default Value Separator will be used. |
| CSV File \* | csv-file |  |  | Path to a CSV File in which the key value pairs can be looked up. |
| Ignore Duplicates \* | ignore-duplicates | true | * true * false | Ignore duplicate keys for records in the CSV file. |
| Lookup Key Column \* | lookup-key-column |  |  | The field in the CSV file that will serve as the lookup key. This is the field that will be matched against the property specified in the lookup processor. |
| Lookup Value Column \* | lookup-value-column |  |  | Lookup value column. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SimpleDatabaseLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/simpledatabaselookupservice.md
section: Loading & Unloading Data
---

# SimpleDatabaseLookupService

## Description

A relational-database-based lookup service. When the lookup key is found in the database, the specified lookup value column is returned. Only one value will be returned for each lookup, duplicate database entries are ignored.

## Tags

cache, database, enrich, join, key, lookup, rdbms, reloadable, value

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Cache Expiration | Cache Expiration |  |  | Time interval to clear all cache entries. If the Cache Size is zero then this property is ignored. |
| Cache Size \* | dbrecord-lookup-cache-size | 0 |  | Specifies how many lookup values/records should be cached. The cache is shared for all tables and keeps a map of lookup values to records. Setting this property to zero means no caching will be done and the table will be queried for each lookup value in each record. If the lookup table changes often or the most recent data must be retrieved, do not use the cache. |
| Clear Cache on Enabled \* | dbrecord-lookup-clear-cache-on-enabled | true | * true * false | Whether to clear the cache when this service is enabled. If the Cache Size is zero then this property is ignored. Clearing the cache when the service is enabled ensures that the service will first go to the database to get the most recent data. |
| Database Connection Pooling Service \* | dbrecord-lookup-dbcp-service |  |  | The Controller Service that is used to obtain connection to database |
| Lookup Key Column \* | dbrecord-lookup-key-column |  |  | The column in the table that will serve as the lookup key. This is the column that will be matched against the property specified in the lookup processor. Note that this may be case-sensitive depending on the database. |
| Table Name \* | dbrecord-lookup-table-name |  |  | The name of the database table to be queried. Note that this may be case-sensitive depending on the database. |
| Lookup Value Column \* | lookup-value-column |  |  | The column whose value will be returned when the Lookup value is matched |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SimpleKeyValueLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/simplekeyvaluelookupservice.md
section: Loading & Unloading Data
---

# SimpleKeyValueLookupService

## Description

Allows users to add key/value pairs as User-defined Properties. Each property that is added can be looked up by Property Name. The coordinates that are passed to the lookup must contain the key ‘key’.

## Tags

enrich, key, lookup, value

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SimpleRedisDistributedMapCacheClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/simpleredisdistributedmapcacheclientservice.md
section: Loading & Unloading Data
---

# SimpleRedisDistributedMapCacheClientService

## Description

An implementation of DistributedMapCacheClient that uses Redis as the backing cache. This service is intended to be used when a non-atomic DistributedMapCacheClient is required.

## Tags

cache, distributed, map, redis

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| TTL \* | redis-cache-ttl | 0 secs |  | Indicates how long the data should exist in Redis. Setting ‘0 secs’ would mean the data would exist forever |
| Redis Connection Pool \* | redis-connection-pool |  |  |  |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SimpleScriptedLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/simplescriptedlookupservice.md
section: Loading & Unloading Data
---

# SimpleScriptedLookupService

## Description

Allows the user to provide a scripted LookupService instance in order to enrich records from an incoming flow file. The script is expected to return an optional string value rather than an arbitrary object (record, e.g.). Also the scripted lookup service should implement StringLookupService, otherwise the getValueType() method must be implemented even though it will be ignored, as SimpleScriptedLookupService returns String as the value type on the script’s behalf.

## Tags

groovy, invoke, lookup, script

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Module Directory | Module Directory |  |  | Comma-separated list of paths to files and/or directories which contain modules required by the script. |
| Script Body | Script Body |  |  | Body of script to execute. Only one of Script File or Script Body may be used |
| Script Engine \* | Script Engine | Groovy | * Groovy | Language Engine for executing scripts |
| Script File | Script File |  |  | Path to script file to execute. Only one of Script File or Script Body may be used |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| execute code | Provides operator the ability to execute arbitrary code assuming all permissions that NiFi has. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SlackRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/slackrecordsink.md
section: Loading & Unloading Data
---

# SlackRecordSink

## Description

Format and send Records to a configured Channel using the Slack Post Message API. The service requires a Slack App with a Bot User configured for access to a Slack workspace. The Bot User OAuth Bearer Token is required for posting messages to Slack.

## Tags

record, sink, slack

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Access Token \* | access-token |  |  | Bot OAuth Token used for authenticating and authorizing the Slack request sent by NiFi. |
| API URL \* | api-url | <https://slack.com/api> |  | Slack Web API URL for posting text messages to channels. It only needs to be changed if Slack changes its API URL. |
| Channel ID \* | channel-id |  |  | Slack channel, private group, or IM channel to send the message to. Use Channel ID instead of the name. |
| Input Character Set \* | input-character-set | UTF-8 |  | Specifies the character set of the records used to generate the Slack message. |
| Record Writer \* | record-sink-record-writer |  |  | Specifies the Controller Service to use for writing out the records. |
| Web Service Client Provider \* | web-service-client-provider |  |  | Controller service to provide HTTP client for communicating with Slack API |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SmbjClientProviderService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/smbjclientproviderservice.md
section: Loading & Unloading Data
---

# SmbjClientProviderService

## Description

Provides access to SMB Sessions with shared authentication credentials.

## Tags

samba, smb, cifs, files

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Domain | domain |  |  | The domain used for authentication. Optional, in most cases username and password is sufficient. |
| Enable DFS \* | enable-dfs | false | * true * false | Enables accessing Distributed File System (DFS) and following DFS links during SMB operations. |
| Hostname \* | hostname |  |  | The network host of the SMB file server. |
| Password | password |  |  | The password used for authentication. |
| Port \* | port | 445 |  | Port to use for connection. |
| Share \* | share |  |  | The network share to which files should be listed from. This is the “first folder”after the hostname: <smb://hostname:port/[share]/dir1/dir2> |
| SMB Dialect \* | smb-dialect | AUTO | * AUTO * SMB 2.0.2 * SMB 2.1 * SMB 3.0 * SMB 3.0.2 * SMB 3.1.1 | The SMB dialect is negotiated between the client and the server by default to the highest common version supported by both end. In some rare cases, the client-server communication may fail with the automatically negotiated dialect. This property can be used to set the dialect explicitly (e.g. to downgrade to a lower version), when those situations would occur. |
| Timeout \* | timeout | 5 sec |  | Timeout for read and write operations. |
| Use Encryption \* | use-encryption | false | * true * false | Turns on/off encrypted communication between the client and the server. The property’s behavior is SMB dialect dependent: SMB 2.x does not support encryption and the property has no effect. In case of SMB 3.x, it is a hint/request to the server to turn encryption on if the server also supports it. |
| Username | username | Guest |  | The username used for authentication. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Snowflake Openflow version history
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/version-history.md
section: Loading & Unloading Data
---

# Snowflake Openflow version history

This topic provides version history for [Snowflake Openflow](about.md).

To apply the latest updates to your deployment, runtimes, or connectors, see [Manage Openflow](manage.md).

## April 16, 2026

### Runtime Extensions 2026.4.16.18

* CDC Oracle: Auto-detects UNIQUE key constraints as replication keys when no
  primary key is defined.
* CDC SQL Server: Fixed `sysname` columns causing infinite schema-mismatch
  loop.
* CDC SQL Server: Fixed duplicate DDL emission for unchanged tables by
  disambiguating `EARLIEST` position.
* CDC Databases: Made `CdcSchemaRegistry` resilient to internal
  `TableSchema` class changes during upgrades.
* Removed preview labels from UpdateSnowflake\* processors.

### Connectors 2026.4.16.16

* CDC MySQL: Added Oversized Value Strategy parameter.
* Dataverse: Fixed inability to change type of _SNOWFLAKE_DELETED column.
* CDC SQL Server: Increased incremental load batch size to 100K rows.

## April 14, 2026

### Runtime Server 2026.4.14.16

* Fixed local change detection for versioned flows when updating a property that
  was not set previously.
* Runtime UI: Allows users to resume a suspended runtime in recovery mode.

### Runtime Extensions 2026.4.14.16

* Jira (Atlassian): Added Jira v2 core components including new processors for
  ingesting comments, changelogs, deleted issues, projects, permissions, users,
  worklogs, and other Jira entities.
* CDC Databases: Introduced a configurable Oversized Value Limit property
  (default 16 MB) on CDC and snapshot processors.
* CDC Oracle: Removed unnecessary Oracle database privileges from configuration
  scripts.
* Snowpipe Streaming: Removed Preview tags from PublishSnowpipeStreaming
  processors, marking them as generally available.

### Connectors 2026.4.14.15

* Kafka: Disabled flow-file-based offset tracking to prevent data loss during
  downscaling.
* Kafka: Added a new high-performance Kafka connector flow with
  PublishSnowpipeStreaming.
* CDC Oracle and SQL Server: Exposed table exclusion parameter for multi-database
  connectors.
* CDC Oracle: Added Snowpipe Streaming v2 routing with automatic v1 fallback.
* Salesforce: Explicitly set warehouse in MERGE pre-query to prevent failures
  when no default warehouse is configured.

## April 13, 2026

### AWS Data Plane Agent 1.37.0

* Improved custom ingress to simultaneously support load balancer security
  groups managed by both Openflow and deployment-specific configurations.
* Removed duplicate ingress rules for default custom ingress security group.

## April 10, 2026

### AWS Data Plane Agent 1.36.0

* Security patches and dependency upgrades.
* Improved resiliency of new Deployments and upgrades related to how AWS IAM
  permissions are created and refreshed.
* Improved cost efficiency of telemetry by removing unused or low-value metrics from being
  exported to Event Tables.

### SPCS Data Plane Agent 1.26.0

* Security patches and dependency upgrades.
* Improved cost efficiency of telemetry by removing unused or low-value metrics from being
  exported to Event Tables.

### Control Plane Core 0.109.1

* Security patches and dependency upgrades.
* Oracle Embedded License Connector added to Featured Connectors.

### Data Plane Service 0.109.0

* Security patches and dependency upgrades.

### Ingress Controller 2026.4.7

* Security patches and dependency upgrades.

### Runtime Operator 0.58.0

* Security patches and dependency upgrades.

### Control Plane UI 0.77.0

* Added third-party icons for Atlassian, Salesforce, and Microsoft SQL Server
  connector cards.
* Added support for resuming suspended Runtimes in recovery mode.
* Hide gateway version in Runtime upgrade dialog when appropriate.
* Improved warnings and guidance when users lack permissions to create a
  Deployment.
* Improved Connector installation process to no longer wait for available
  Runtimes to load before opening the dialog.

### Data Plane UI 0.14.0

* Security patches and dependency upgrades.

## April 9, 2026

### Runtime Server 2026.4.9.16

* Fixed Parameter and Parameter Context descriptions being lost during versioned
  flow upgrades.
* Fixed record path functions (`toBytes`, `toDate`, `toString`, and
  `format`) to return the correct types.
* Snowpipe Streaming: Exports metrics from `PublishChangeDataSnowpipeStreaming`
  to the event table for better observability.

### Runtime Extensions 2026.4.9.16

* CDC SQL Server: Improved error visibility during connection setup by fixing
  exception masking in `CatalogHelper` and unifying `setCatalog` usage.
* CDC SQL Server and Oracle: Switched to `DirectJsonRecordWriter` for storing
  change data as JSON in VARIANT columns, improving Snowflake ingestion
  efficiency.
* CDC MySQL: Fixed composite primary key column ordering by reading from
  `KEY_COLUMN_USAGE`, ensuring correct row identification during snapshot.

### Connectors 2026.4.9.15

* Dataverse: Updated merge journal process to include `SNOWFLAKE_ID` in the
  Dataverse schema.
* Box, Confluence, Google Drive, SharePoint, and Slack: Fixed Cortex-enabled
  connectors to preserve existing CORTEX SEARCH SERVICE configuration instead of
  overwriting it.
* CDC MySQL, PostgreSQL, Oracle, and SQL Server: Increased run duration on
  CPU-bound processors in incremental flows to reduce backpressure.
* CDC Oracle: Added staleness prevention to keep pipelines active during periods
  of low data volume.

## April 7, 2026

### Runtime Extensions 2026.4.7.16

* Kafka: Fixed duplicate message delivery in ConsumeKafka when consumers rejoin a
  consumer group during rebalance.
* CDC MySQL: Fixed data corruption when a MySQL server restart reassigns table IDs to
  different tables, preventing stale schema mappings from causing type mismatch errors
  during data ingestion.
* CDC Oracle: Fixed Oracle XStream CDC failing to read the LCR version when the XStream
  outbound server is configured on a different database instance (PDB vs CDB).
* CDC Databases: Reduced unnecessary Snowpipe Streaming query retries by penalizing the
  MergeSnowflakeJournalTable processor when no new data is available or the connection is
  disconnected, improving throughput.
* CDC SQL Server: Added table draining to ensure tables with large change backlogs are
  fully consumed before the connector moves to the next table.

### Connectors 2026.4.7.16

* CDC MySQL and PostgreSQL: Added a `Re-snapshot Table Exclusions` parameter to allow
  specific tables to be excluded from replication, enabling re-snapshotting use cases.

## April 6, 2026

### AWS Data Plane Agent 1.34.0

* Improved deployment upgrade time for customers with many Runtimes.
* Improved speed and reliability of upgrades from Openflow Deployments running EKS 1.32 to EKS 1.35.

## April 2, 2026

### Runtime Server 2026.4.2.16

* Increased the CDC connector metrics table row limit for observability dashboards from 30,000 to 40,000.
* Fixed inherited Parameter Context synchronization on versioned Process Group upgrades when new parameters are added.
* Fixed the provenance repository to honor the configured maximum attribute character size when reading entries.
* Fixed component bundle resolution and rollback behavior on versioned flow changes.
* Updated to Runtime UI 0.70.0.

### Runtime Extensions 2026.4.2.16

* Added Google Cloud Storage Provider for Iceberg.
* Fixed empty Private Key check for PGP Secret Key.
* CDC Databases: Added a new `MultiDatabaseGetSnowflakeJournalStreams` processor that supports multi-source CDC replication by mapping 3-part source table names (database + schema + table) to Snowflake destination schemas using a configurable naming pattern.
* CDC PostgreSQL: Added a “Flatten DML Records” option to `CaptureChangePostgreSQL` that writes change events in the final flat format at capture time, eliminating the intermediate file read-and-rewrite step in `EnrichCdcStream` and reducing disk I/O.
* Snowpipe Streaming 2: Added a “Destination Type” property to `PublishSnowpipeStreaming` that allows users to target either a named Pipe or a Table directly, with automatic migration to preserve existing Pipe-based configurations.
* CDC Databases: Added an “Excluded Comma Separated Source Table Names” property to `ListTableNames` (and its multi-database equivalent) that lets users exclude specific tables from replication.
* BigQuery: Fixed a property migration bug in `CreateReadSession` that caused incorrect processor configuration when upgrading from older flow versions.
* CDC PostgreSQL: Fixed `FetchTableSnapshot` failures on tables containing `bytea` columns by using a more compatible JDBC method to read binary data.
* CDC Databases: Updated the `DESTINATION_SCHEMA_NAME_PATTERN` placeholders from `{database}`, `{schema}` to `${source.database.name}`, `${source.schema.name}`, and `${source.table.name}`. A fixed schema name is now valid (the validator constraint requiring at least one placeholder has been removed).
* All connectors: Added a `Validation Mode` property to the `SetAttributesValidatingReferences` processor, allowing configuration of how attribute reference validation is enforced.

### Connectors 2026.4.2.16

* BigQuery: Reduced the concurrency of parallel streaming jobs (PSS) to prevent resource contention issues.
* SharePoint: Fixed a file removal pattern bug for customers with the `ENABLE_FIX_209969` account parameter set to false, where the pattern would fail to match and remove processed files.
* BigQuery: Prevented CDC and view ingestion from starting when the required temporary dataset parameter is not configured, avoiding accidental table failures.
* CDC MySQL, PostgreSQL, and SQL Server: Configured the source database connection pool with validation-on-borrow and periodic eviction of idle connections, preventing misleading “connection reset” errors caused by stale connections being reused after a server-side timeout.
* Dataverse: Added a `_SNOWFLAKE_ID` column to replicated records by mapping the source primary key, allowing downstream consumers to uniquely identify each record in Snowflake.
* CDC SQL Server, MySQL, and PostgreSQL: Added the new stream staleness prevention mechanism to the connector.
* Jira: Fixed an incorrect merge query in the Jira connector.

## April 1, 2026

### AWS Data Plane Agent 1.33.0

* Upgraded to AWS EKS 1.35.
* Added the EKS kube-proxy add-on for automated, managed upgrades of networking components.
* Improved auto-healing when EKS node groups are down or offline for extended periods.
* Improved cost efficiency of telemetry collection by ignoring low-value metrics.

## March 31, 2026

### AWS Data Plane Agent 1.31.3

* Security patches and dependency upgrades.

### SPCS Data Plane Agent 1.24.1

* Security patches and dependency upgrades.

### Control Plane Core 0.108.2

* Security patches and dependency upgrades.
* Fixed Oracle Connector License syncing for accounts with renamed organizations.

### Data Plane Service 0.108.2

* Security patches and dependency upgrades.

## March 30, 2026

### Runtime Server 2026.3.27.21

* Added `isValidDate` and `isValidInstant` Expression Language functions.
* Fixed inherited parameter context preservation during `KEEP_EXISTING` versioned flow deployment.
* **Behavior change:** When upgrading a running versioned flow to a new version, new components added in the new version are automatically started.

### Runtime Extensions 2026.3.27.21

* AWS: Fixed AWS connection pool shutdown on EKS with STS credential refresh.
* Google Drive: Reverted to correct default scopes in Google Drive components, with a new property to use the Google Cloud Platform scope when using Workload Identity Federation with impersonation.
* Kinesis: Handled `ResourceNotFoundException` in `ConsumeKinesis` when shards are not found or removed.
* Oracle: Added support for LCR positions V1 and separate connection and XStream attach, adding support for 12.1 and 12.0.
* Dataverse: Unknown Dataverse attribute types fall back to STRING.
* Salesforce: Fixed Salesforce formula field translation producing invalid SQL for date arithmetic.
* Snowflake: Fixed `PublishSnowpipeStreaming` skipping FlowFiles after pipe recreation due to a stale offset.
* Dataverse: Added a parameter to configure maximum fetched column size in `FetchMicrosoftDataverseTable`.
* Dataverse: Added table-level removal to the Dataverse connector.
* Snowflake: Fixed Workload Identity Federation token header format for Snowpipe Streaming 2.

### Connectors 2026.3.26.19

* CDC database connectors: Added metrics collection for observability dashboards.
* CDC SQL Server: Switched from Snowpipe Streaming v1 to v2.

## March 27, 2026

### AWS Data Plane Agent 1.31.2

* Replaced Ingress-Nginx with Openflow Ingress Controller.
* Fixed an issue with load balancer security group rules when multiple deployments share the same private security group.
* Security patches and dependency upgrades.
* Improved support for adding many Runtimes to a Deployment at once.
* Removed the need for ingress on port 80. All Deployment and Runtime ingress uses port 443.

### SPCS Data Plane Agent 1.24.0

* Security patches and dependency upgrades.

### Control Plane Core 0.108.0

* Security patches and dependency upgrades.
* Improved handling for users with large sets of roles when they need to provide this list of roles for creating and managing resources.
* Salesforce Bulk API connector is now generally available (GA).

### Control Plane UI 0.74.0

* Upgraded to stellar 0.28.0.
* The Create Runtime and EAIs dialog can open while EAIs are still loading.

### Data Plane Service 0.108.0

* Security patches and dependency upgrades.

### Openflow Runtime Gateway 2026.3.25.14

* Security patches and dependency upgrades.

## March 24, 2026

### Runtime Server 2026.3.24.18

* Improved layout of vertical space in the Runtime UI for longer lists of tables and schemas.

### Runtime Extensions 2026.3.24.20

* SQL Server: CDC components are now generally available (GA).
* Oracle: Fixed `TIMESTAMPTZ` mapping for named time zones.
* Added processors to support Snowpipe Streaming v2 in CDC database connectors.
* Improved `UpdateSnowflakeTable` caching and query batching for better performance.

### Connectors 2026.3.24.18

* CDC database connectors: Added metrics collection for observability dashboards.
* MySQL: Switched CDC connector from Snowpipe Streaming v1 to v2.
* PostgreSQL: Switched CDC connector from Snowpipe Streaming v1 to v2.
* Salesforce: Salesforce Bulk API connector is now generally available (GA) and includes a parameter to control warehouse cost optimization.

## March 20, 2026

### Control Plane Core 0.107.1

* Improved role picker for users with a large number of available roles.

### Runtime Server 2026.3.19.18

* Fixed Content Repository defragmentation.
* Improved reloading behavior for Scripted Record Reader and Writer processors.

### Runtime Extensions 2026.3.19.20

* PostgreSQL and MySQL: Set explicit field sizes for `VARCHAR` and `BINARY` columns to support larger values.
* PostgreSQL: Fixed cursor-based fetching by disabling `autoCommit` to prevent out-of-memory errors on large rows.
* Salesforce: Salesforce Bulk API components are now generally available (GA).
* Salesforce: Fixed Salesforce Upsert Lookup failing when field values contain a `+` sign.
* SQL Server: Improved handling of source database failover to prevent table sync failures.
* SQL Server: Improved snapshot performance for clustered and partitioned tables.

### Connectors 2026.3.19.15

* Dataverse: Added Table State Service to the Dataverse connector flow.
* Oracle: Parallelized Snowpipe Streaming v1 snapshot by primary key for improved performance.
* Slack: Added option to ignore channels from historical load in the Slack connector.

## March 17, 2026

### AWS Data Plane Agent 1.30.0

* Added support for recovering a deployment after Snowflake organization or account rename via an `update-account.sh` script.
* Improved Runtime metrics collection for larger data flows and Connectors handling large numbers of tables.
* Preparing for dedicated Openflow Ingress Controller to replace Nginx for BYOC runtime traffic.
* Added memory limiter and metrics routing in the OTEL collector for stability and separate pipelines.
* Security patches and dependency upgrades, including OTEL Collector 0.146.1.

### SPCS Data Plane Agent 1.27.0

* Improved Runtime metrics collection for larger data flows and Connectors handling large numbers of tables.
* Security patches and dependency upgrades.

### Control Plane UI 0.73.0

* Upgraded to stellar icons.
* Fixed a condition that could have prevented the splash screen from hiding when an unhandled error occurs.

### Control Plane Core 0.107.0

* Reliability improvements for Deployments when an account or organization is renamed.

### Data Plane Service 0.107.0

* Security patches and dependency upgrades.

### Runtime Operator 0.57.0

* Security patches and dependency upgrades.

### Openflow Ingress Controller 2026.3.16-17

* Security patches and dependency upgrades.
* Preparing for replacement of Nginx as the ingress controller for BYOC runtime traffic

### Runtime Server 2026.3.17.13

* New Runtime UI 0.68.0.
* Fixed splash screen which may stay visible under specific conditions.
* Improved content viewer to improve MIME type support.
* Track Content and truncate large resource claims in FileSystemRepository.
* Performance improvements for OpenTelemetry data collection.

### Runtime Extensions 2026.3.17.13

* Updated log configuration to capture INFO level logs for DescribeSFDCObject processor.
* Added Session Header handling to Snowpipe Streaming 2.
* Reduced default batch size and handle query timeout for SQL Server table name fetching.
* Kinesis: Significantly improved the `ConsumeKinesis` processor, removing use of the Kinesis Client Library.
* MySQL: Fixed `NullPointerException` in `CaptureChangeMySQL.disconnectBinlogClient` when `tableMapStore` is `null`.
* MySQL: Added TLS support for JDBC in `CaptureChangeMySQL`.
* SQL Server: Fixed `VARCHAR/NVARCHAR` sorting issue that may cause duplicate rows during batched paging.
* SQL Server: Reduced default batch size and improved query timeout handling table name fetching.
* Added `MultiDatabaseRouteOnSnowflakeParameter` and `MultiDatabaseExitRouteOnSnowflakeParameter` processors.
* Added `GetActiveSnowflakeStreams` processor for stream staleness prevention.

## March 13, 2026

### Control Plane Core 0.106.0

* Oracle connector is now generally available (GA).
* Registered preview of new PostgreSQL CDC SOM connector in the control plane catalog.
* Added data plane configuration options for CDC Snowpipe Streaming v2 rollout across MySQL, PostgreSQL, SQL Server, and Oracle connectors.

## March 12, 2026

### Runtime Server 2026.3.12.13

* Fixed 409 Conflict in Azure DevOps and Bitbucket flow registry clients for multiple Flows with shared branch.
* Fixed Flow Comparison showing changes for nested child components when using nesting flow versioning.

### Runtime Extensions 2026.3.12.15

* Iceberg: Added Storage Class to S3FileIOProvider.
* Salesforce: Use FQN database/schema for Salesforce Merge queries.
* Salesforce: Improved Salesforce formula parsing and logging.
* Salesforce: Fixed ListSFDCObjects becoming invalid after upgrade with dynamic relationships.
* MySQL: Escalate binlog communication failure log from WARN to ERROR.

### Connectors 2026.3.12.15

* Salesforce: Use FQN database/schema for Salesforce Merge queries.
* Salesforce: Do not emit bulletin on suspend warehouse attempts.

## March 11, 2026

### Runtime Server 2026.3.10.21

* New Runtime UI 0.67.0.
* Fixed flicker of overlapping connection warning in connector canvas.
* Aligned reusable canvas renderers for borders around PGs and RPGs.
* Fixed support for floating point numbers in connection flow file expiration.
* Restored special treatment of trigger serially processors (no concurrent tasks).

### Runtime Extensions 2026.3.10.20

* AWS Secrets Manager Parameter Provider now supports plain text secrets.
* ListS3: Fixed V1 pagination failing when delimiter is not set, which caused an infinite loop.
* MultiDatabaseFetchTableSnapshot: Added FlowFile size limiting.
* Slack: ConsumeSlackHistory now has an option to ignore channels from historical load.
* MySQL: Fixed PutSnowpipeStreaming storing JSON in VARIANT columns as strings by directly writing JSON records in CaptureChangeMySQL.
* Dataverse: Added `SNOWFLAKE_ID` column to the schema.
* Snowpipe Streaming: Improved channel error handling in PublishSnowpipeStreaming.
* Snowpipe Streaming v2: Fixed error when streaming `timestamp_tz` with seconds in offset.
* CDC Databases: Fixed tables transitioning to FAILED during snapshot load by adding buffer for lineageStartDate comparison.

### Connectors 2026.3.10.20

* CDC SQL Server: Added Oversized Value Strategy parameter.
* Salesforce Bulk API: Added initial support for formulas.
* Salesforce Bulk API: Warehouse suspension now occurs immediately after all merge queries are executed.

## March 6, 2026

### Runtime Extensions 2026.3.6.12

* UpdateSnowflakeView: Added support for raw SQL.
* PostgreSQL: Fixed PutSnowpipeStreaming storing JSON in VARIANT columns as strings by directly writing JSON records in CaptureChangePostgreSQL and FetchTableSnapshot.
* HubSpot: Added missing CRM object types and fixed API compatibility.
* CDC Oracle: Fixed partitioned tables support during snapshot.
* Snowpipe Streaming v1: Fixed empty FlowFile handling in PutSnowpipeStreaming when configured to use exactly-once delivery.

## March 5, 2026

### AWS Data Plane Agent 1.26.0

* Security patches and dependency upgrades.

### SPCS Data Plane Agent 1.20.0

* Security patches and dependency upgrades.
* Added per-connector error metrics, improving speed and reliability of the Openflow Observability dashboard.

### Control Plane UI 0.72.0

* Introduced a Connector listing for managing Connector Snowflake Objects (hidden by a feature flag until verified and ready).
* Updated the Connector installation process for Connector Snowflake Objects (hidden by a feature flag until verified and ready).
* Added MongoDB Connector Definition icon.
* Updated the Oracle Connector terms dialog to account for the new independent (BYOL) Oracle connector.
* Updated the user-facing action for “Rename” to “Set display name”.

### Control Plane Core 0.105.0

* Snowflake deployments can now heal from `UPGRADE_FAILED` state if they report a healthy status and version.
* Added MongoDB connector flow.
* Added support for `OPENFLOW_INGRESS_NAME` parameter when creating the URL to access Snowflake Deployments.

### Data Plane Service 0.105.0

* **Behavior change:** Runtime Python processor properties are now set based on Runtime node size (Small: disabled, Medium: <=2, Large: <=4).
* Security patches and dependency upgrades.

### Runtime Operator 0.56.0

* **Behavior change:** Python processors are now disabled by default to improve Runtime stability. Python processor usage is controlled by Runtime size (Small: disallowed, Medium: <=2, Large: <=4).
* Security patches and dependency upgrades.

## March 3, 2026

### AWS Data Plane Agent 1.25.0

* Reduced downtime for Openflow Runtimes when upgrading the AMI for BYOC Deployments.
* Added support for upcoming EKS 1.35 upgrade, though BYOC is still using EKS 1.34.

### Runtime Server 2026.3.4.15

* Expression Language: Added `compactDelimitedList()` and `trimDelimitedList()` functions.
* New Runtime UI 0.66.0.
* Hidden environmental changes in show/revert local changes.
* Connections now avoid overlapping, and warnings are shown for existing overlaps.

### Runtime Extensions 2026.3.4.16

* Snowpipe Streaming: Added optional Role property.
* CDC Databases: Fixed JSON column filtering in incremental load.
* CDC Databases: Fixed `clearSession()` removing already-transferred FlowFiles in FetchTableSnapshot.
* CDC Databases: Fixed `SEEN_AT` value being interpreted as seconds instead of milliseconds in incremental mode.
* CDC Databases: Minimized the risk of filling up the waiting queue in EnforceOrder processor.
* CDC Databases: Added “Oversized Value Strategy” to MultiDatabaseFetchTableSnapshot.
* CDC Oracle: Fixed verification in CaptureChangeOracle.
* CDC SQL Server: Added “Oversized Value Strategy” to MultiDatabaseCaptureChangeSqlServer processor.
* CDC SQL Server: Fixed DESC primary key handling in MultidatabaseFetchTableSnapshot.
* Salesforce: Fixed two bulletins in SubmitQueryJob for non-supported objects.
* Snowpipe Streaming: Added “Disabled” option for Offset Token Resolution in PublishSnowpipeStreaming.
* Snowpipe Streaming: Added channel error message on invalid rows log.

### Connectors 2026.3.4.15

* Added customer-facing metrics for MySQL connectors.
* Added customer-facing metrics for PostgreSQL connectors.
* Added customer-facing metrics for Oracle connectors.
* Confluence: Fixed user emails with quotes breaking the connector.
* Salesforce Bulk API: Added WaitForBulkJobs for warehouse usage cost optimization.

## February 26, 2026

### AWS Data Plane Agent 1.24.0

* Improved upgrade speed and reliability from EKS 1.32 to 1.34, fixing the temporary “Upgrade Failed” status for BYOC and BYO-VPC deployments.

### Runtime Server 2026.2.26.15

* Expression Language: Added `unique()` function for removing duplicates from delimited strings.

### Runtime Extensions 2026.2.26.16

* CDC Oracle: Added configurable starting position in CaptureChangeOracle to control where CDC begins reading.
* CDC Oracle: Added SSL/TLS connection support for CaptureChangeOracle.
* CDC Oracle: Removed preview tags from multi-database Oracle CDC processors (now generally available).
* CDC SQL Server: Fixed ChangeTrackingPosition parsing.
* CDC Databases: Removed stale entries from IncrementGroupAttribute processor to prevent unbounded state growth.
* Salesforce Bulk API: Fixed Merge Query failing when containing reserved keywords.
* Salesforce Bulk API: Moved row deduplication in the Merge Query to fix an error and remove the need for the pre-SQL query DELETE.
* Salesforce: Populated `sErrorMessage` when duplicate error occurs in `UpsertSFDCObjects` processor.

### Connectors 2026.2.26.15

* CDC Oracle: Added SSL/TLS connection support in Oracle connector.
* CDC Oracle: Added schema name mapping in Oracle connector.
* CDC Oracle: Parameterized starting position properties in Oracle connector.
* CDC Oracle: Fixed missing log in concurrent snapshot.
* Salesforce Bulk API: Ignore changes on not null constraints to prevent ingestion failures.

## February 25, 2026

### Runtime Server 2026.2.24.16

* Security patches and dependency upgrades.
* Improved observability for Connectors and custom groovy scripts.

### Runtime Extensions 2026.2.24.20

* Salesforce Bulk API: Added SubmitDeleteJob processor to delete data using Bulk API.
* Snowpipe Streaming: Added PublishSnowpipeStreaming processor.
* CDC SQL Server: MultiDatabaseCaptureChangeSqlServer now has parameterized concurrency level.
* CDC Oracle: Fixed CaptureChangeOracle processor blocking during license validation.
* CDC Oracle: Added source state verification in CaptureChangeOracle to detect source database issues.
* CDC PostgreSQL: Added support for enum primary key in PostgreSQL.
* CDC PostgreSQL: Map PostgreSQL `DOUBLE PRECISION` and `MONEY` types to `RecordFieldType.DOUBLE`.
* Snowpipe Streaming v2: Added Snowflake Managed Authentication to PutSnowpipeStreaming2.
* CDC PostgreSQL: Added Password Provider support to CaptureChangePostgreSQL, which gives support for AWS IAM Authentication with AWS RDS.

### Connectors 2026.2.24.20

* Salesforce Bulk API: Added `CLUSTER BY ("ID")` on table creation for better query performance.
* Salesforce Bulk API: Disabled NOT NULL constraints on Alter Table processors to prevent ingestion failures.
* Salesforce Bulk API: Added description for ‘Enable Journal Tables’ parameter.
* Slack: Connector performance optimizations.
* CDC SQL Server: Fixed EnforceOrder processor being triggered every second instead of on flow file arrival.

## February 20, 2026

### AWS Data Plane Agent 1.23.0

* Fixed CloudFormation template formatting that could cause false drift detection by Terraform.
* Fixed a rare issue with custom ingress and PrivateLink where EKS control plane nodes couldn’t communicate with worker nodes.

### Control Plane UI 0.70.0

* Runtime diagnostic bundles are now sorted consistently.

### Control Plane Core 0.104.0

* Deployments and runtimes remain accessible after an organization or account name change.
* Removed the temporary restriction that limited Snowflake deployment upgrades to deployment owners whose active role matched the deployment owner role.

### Data Plane Service 0.104.0

* Deployments and runtimes remain accessible after an organization or account name change.

### Runtime Server 2026.2.19.16

* Fixed an issue where the flow version changed unexpectedly when the flow contains a ghosted parameter provider.
* New Runtime UI 0.65.0.
* The provenance lineage view now displays the component type alongside the event type.
* Diagnostic bundles are now sorted and ordered consistently.

### Runtime Extensions 2026.2.19.20

* Azure: Added support for Azure federated identity credentials.
* Google Ads: GetGoogleAdsReport now supports batch ingestion with configurable date range batching.
* CDC Oracle: Fixed ALTER TABLE parsing for integer-type columns (INT, SMALLINT, INTEGER, DEC, DECIMAL, NUMERIC) that incorrectly defaulted the scale to 19 instead of 0 when precision wasn’t specified.
* Fixed S3 processors using the global endpoint for `us-east-1`.
* Fixed an error in DBCPConnectionPool when a dynamic property has a null value.
* Kafka: ConsumeKafka now includes a `kafka.timestamp` attribute on FlowFiles emitted with the `Record` processing strategy.
* Kinesis: ConsumeKinesis now supports a `Demarcator` processing strategy.
* Snowpipe Streaming v1: PutSnowpipeStreaming now includes a `Binary Encoding Format` property for HEX binary string data.
* CDC SQL Server: MultiDatabaseCaptureChangeSqlServer now uses dynamic backoff when there are no new changes.
* CDC Multi-Database: MultiDatabaseFetchTableSnapshot can now run multiple select statements concurrently.
* CDC SQL Server: Fixed an ingestion failure when a table is re-added with a different schema.
* CDC Oracle: Fixed handling of license changes in a duplicated database.
* CDC Databases: Improved error handling for DML operations in the EnrichCdcStream and MultiDatabaseEnrichCdcStream processors.
* SharePoint: Fixed file path decoding for folders containing percent signs.
* CDC Databases: Warnings are now logged when oversized values are set to null, making it easier to identify data truncation.
* CDC Databases: Added a `CLEARING_FLOWFILE_FAILED` failure reason for table state tracking.
* **Behavior change:** Removed the Vectara, Pinecone, RAG evaluation, Milvus, and Cohere bundles.

### Connectors 2026.2.19.20

* CDC Oracle: The default snapshot fetching strategy is now `CONCURRENT_BY_ROWID` instead of `SEQUENTIAL_BY_PRIMARY_KEY`, improving snapshot performance.
* CDC SQL Server: Added customer-facing metrics for the SQL Server multi-database connector.
* Slack: Thread broadcast replies are now filtered from Slack collection to prevent duplicate messages.
* Salesforce Bulk API: The staging table is now truncated instead of deleted, preventing channel invalidation errors with Snowpipe Streaming.
* Salesforce Bulk API: Object filters are no longer case-sensitive.
* Salesforce Bulk API: Merge queries are no longer executed when no data has been captured.
* Salesforce Bulk API: Added an `Enable Journal Tables` parameter (default: false) that creates a `JOURNAL_Object` table where data changes are appended.
* CDC SQL Server: Snapshots now use multiple channels per table to improve throughput.
* CDC PostgreSQL: The oversized value strategy is now configurable in the PostgreSQL connector.

## February 13, 2026

### AWS Data Plane Agent 1.20.0

* Fixed an upgrade issue for older BYOC deployments where permissions failures occurred for tags on IAM OpenID Connect providers.
* BYOC deployments now more clearly report their `Upgrading` status.

## February 11, 2026

### Control Plane Core 0.102.0

* BYOC deployments now automatically restore access to runtimes when their AWS load balancers are recreated with a new DNS.
* Improved upgrade reliability for deployments and runtimes.

### Openflow Runtime Gateway 2026.2.10.21

* Fixed connector installation failures in Snowflake deployments with PrivateLink enabled.

### AWS Data Plane Agent 1.19.0

* Fixed an upgrade issue for older BYOC deployments caused by an `eks:ListTagsForResource` permissions failure.

### Runtime Server 2026.2.10.18

* Python: Fixed an issue where NAR deletion could block indefinitely while a Python processor was initializing.
* Fixed Parameter Provider version fallback when importing a flow.
* Fixed Parameter Context binding for new process groups during version upgrades.

### Runtime Extensions 2026.2.11.9

* Kafka: Fixed an issue where ConsumeKafka could create duplicate messages during a consumer group rebalance.
* Parquet: Fixed a ParquetReader error (`ClassCastException`) for `java.time` logical types.
* MongoDB: Added components for the upcoming private preview of the MongoDB CDC connector.
* Slack: Fixed duplicate messages caused by thread broadcast replies.
* MySQL: Added an oversized data property to the CaptureChangeMySQL processor.
* MySQL & PostgreSQL: Fixed FetchTableSnapshot incorrectly flagging interim FlowFiles as the final snapshot.
* MySQL & PostgreSQL: You can now configure how values larger than 16 MB are handled when they exceed the supported limit.
* Confluence Data Center: Added support for the export page permission.

### Connectors 2026.2.10.18

* Box: Removed the concurrency limit on stage inserts, improving overall performance.
* MultiDB MS SQL Server: Added schema name mapping.

## February 6, 2026

### Runtime Operator 0.54.0

* Fixed asset synchronization in runtimes when parameter providers are used.

### AWS Data Plane Agent 1.18.0

* Fixed an issue where migrating secrets during an upgrade caused failures for AWS deployments between versions 0.55.0 and 1.1.0.

## February 4, 2026

### Runtime Server 2026.2.3.19

* Python: Fixed an issue where imported properties couldn’t be used as `PropertyDependency` parameters in Python processors.
* Records: Added timestamp truncation support in the RecordPath DSL.
* New Runtime UI 0.64.0.

### Runtime Extensions 2026.2.4.10

* Iceberg: Added `Endpoint URL` and `Path Style Access` properties to the S3 FileIO Iceberg Provider.
* Avro: Added a `Fast Reader Enabled` property to the Avro Reader.
* CDC Databases: MultiDatabaseFetchTableSnapshot now numbers outgoing FlowFiles with a 1-based `chunk.index` attribute.
* CDC Databases: The EnrichCdcStream and MultiDatabaseEnrichCdcStream processors now write `min(seenAt)` to FlowFile attributes.
* CDC Databases: FlowFile attributes now include the number of rows inserted and updated during journal merge.
* CDC Oracle: Oracle DML/DDL FlowFiles now include index attributes, consistent with other CDC database components.
* CDC MySQL: Fixed replication failures for zero-date datetime values (such as 0000-00-00) by aligning snapshot and CDC mapping.
* Salesforce Bulk API: Base64 fields (Blobs) are now automatically skipped for synced objects because this type isn’t supported by the Bulk API.

### Connectors 2026.2.3.18

* Kafka: New Kafka to Snowflake connector with Kafka OAuth authentication support.
* CDC Databases: Non-CDC processors in CDC connectors now include a table state change reason.
* Salesforce Bulk API: Reduced the default `Max Batch Size` in PutSnowpipeStreaming to lower memory pressure for records with large fields.
* Salesforce Bulk API: Added a parameter to disable incremental offloading, allowing full object syncs each execution to account for formula fields.
* Salesforce Bulk API: Added support for non-Bulk API compatible objects such as Knowledge data.

## February 3, 2026

### Control Plane Core 0.101.2

* Temporarily restricting Snowflake deployment upgrades to users whose active role matches the deployment owner role until a related issue is resolved.

## February 2, 2026

### Control Plane Core 0.101.1

* Temporarily limiting Snowflake deployment upgrades to the deployment owner while an issue preventing roles with `OPERATE` privilege from upgrading is resolved.

## January 30, 2026

### Control Plane UI 0.69.0

* Fixed an issue where some actions weren’t reevaluated on the current page after an active role change.
* BYOC deployments running the latest version now show their current status while processing actions like creating, upgrading, and deleting, including reporting failures when they occur.
* Added a `Download validator` button to the deployment creation dialog.

### Control Plane Core 0.101.0

* Fixed an issue where Snowflake deployments briefly showed a `Not Healthy` status while creating, just before becoming active.
* BYOC deployments running the latest version now show their current status while processing actions like creating, upgrading, and deleting, including reporting failures when they occur.
* Improved the logic for showing the Private Link option when creating SPCS deployments to avoid failures when the option isn’t fully supported.
* Added an API to generate and download CloudFormation templates for BYOC and BYO-VPC validators.

### Data Plane Service 0.101.0

* Fixed runtime creation on newly active Snowflake deployments. Previously, the latest available runtime versions weren’t always used.

### AWS Data Plane Agent 1.16.0

* Snowflake-hosted container images are now pulled directly from Snowflake registries into the deployment EC2 agent host and EKS cluster. Upgrade existing Openflow runtimes to switch entirely to Snowflake-hosted images.
* The agent now reports its current status to Openflow while processing user-requested actions, so it can be reflected in the Control Plane UI.
* Security patches and dependency upgrades.
* Added quick validation tools for BYOC and BYO-VPC deployments that report common errors to resolve before installing a full Openflow cluster.
* Improved reliability of deployment upgrades by automatically resolving issues where services were blocked from starting.
* Improved reliability of deleting deployments that had been upgraded multiple times.

### Runtime Server 2026.1.29.22

* Upgraded JDK to 21.0.10.
* Upgraded Apache NiFi API to 2.6.0, adding support for the `Record Gauge` method in ProcessSession.

### Runtime Extensions 2026.1.29.23

* Added the UpdateGauge processor with configurable `Gauge Name` and `Gauge Value` recording.
* Improved JSON Schema validation in GenerateJSON to address potential edge cases for nested fields.
* Added the PutIcebergRecord processor and Iceberg REST Catalog controller services, supporting both AWS and Azure storage FileIO providers.
* Deprecated the PutIcebergTable processor in favor of PutIcebergRecord.

## January 23, 2026

### Runtime Server 2026.1.22.19

* Resolved an issue where simultaneous commits to a Git-based Flow Registry Client could cause one user’s changes to overwrite another’s.
* New Runtime UI 0.63.0.

### Runtime Extensions 2026.1.22.19

* Salesforce Bulk API: Fixed an edge case where the initial snapshot might not create the destination table as expected.
* BigQuery: Processor properties now reference FlowFile attributes, making it easier to understand component behavior.
* Snowpipe Streaming v2: PutSnowpipeStreaming2 now tracks request IDs and automatically terminates empty relationships.
* CDC SQL Server: You can now set a maximum FlowFile size in CaptureChangeSQLServer.
* Jira: Components now include a verification feature to confirm that your configuration is correct.
* Confluence: Components now include a verification feature to confirm that your configuration is correct.

## January 21, 2026

### Runtime Server 2026.1.20.19

* You can now configure custom SSL certificates in GitHub and Gitlab Flow Registry Clients.

### Runtime Extensions 2026.1.20.21

* Enhanced PerformSnowflakeCortexOCR with page splitting and filtering features.
* BigQuery: Fixed time travel timestamp handling in TriggerBigQueryCdcOnState processor.
* Jira: Better handling of API rate limiting.
* CDC Oracle: You can now set a maximum FlowFile size in CaptureChangeOracle.
* CDC MySQL: Added logging in CaptureChangeMysql processor to log the retention period for binlog on start.
* Confluence: The connector can now ingest file attachments and embedded images.
* CDC MySQL and PostgreSQL: FetchTableSnapshot now includes partition chunk attributes to enable multi-channel streaming.

### Connectors 2026.1.20.18

* All Connectors: The default Snowflake Authentication Strategy is now SNOWFLAKE_MANAGED, a token-based method that works in both SPCS and BYOC deployments.
* Salesforce Bulk API: Added new parameter, Initial Load Chunking. This option lets you split large initial data loads into time-based chunks (MONTHLY, QUARTERLY, YEARLY) to avoid timeouts and API limits.

  When set, the initial data load is split into multiple jobs based on the interval. On the first run for an object, the connector queries Salesforce to find the oldest record and uses that as the starting point. Each subsequent job queries the next time chunk until caught up to the current time.

  Once caught up, the processor continues with normal incremental offload behavior.
* Oracle: Initial snapshot loads can now run with multiple concurrent threads for faster performance.
* SharePoint: The connector now logs when it encounters and processes empty files.
* Confluence: A new connector version is available that does not fetch access control lists (ACLs).

## January 16, 2026

### Runtime Server 2026.1.15.20

* New Runtime UI 0.62.0.
* The copy button in the Bulletin tooltip has been moved so it’s always visible.

### Runtime Extensions 2026.1.15.20

* SQL: Added support for Pre-Queries and Post-Queries in PutDatabaseRecord processor.
* CDC PostgreSQL: You can now set a maximum FlowFile size in CaptureChangePostgreSQL.
* CDC PostgreSQL and MySQL: FlowFiles now include start.row.index and last.row.index attributes.
* CDC MySQL: CaptureChangeMySQL now reads the event position from the header instead of from the binlog client.
* CDC Connectors: Splitting FetchTableSnapshot output FlowFiles into chunks of MAX_OUTPUT_FLOWFILE_SIZE size.
* Snowpipe Streaming: PutSnowpipeStreaming2 now has dedicated handling for empty FlowFiles.
* Salesforce Bulk API: Added support for Objects without SystemModStamp field.
* Salesforce Bulk API: You can now configure how the initial snapshot is split into time-based chunks.

### Connectors 2026.1.15.18

* Salesforce Bulk API: Added support for Objects with Tracking History enabled.
* Salesforce Bulk API: Added support for Objects without SystemModStamp field.
* CDC Connectors: Clearer log messages when a table enters a failed replication state.
* Google Ads: New “Login Customer ID” parameter lets you specify which manager account (MCC) to fetch reports for.
* Dataverse: The COPY GRANTS option is now applied to destination tables.

## January 15, 2026

### AWS Data Plane Agent 1.15.0

* Resolved an issue where some IAM policies were not deleted when a Deployment was deleted.

## January 14, 2026

### Runtime Server 2026.1.13.18

* Resolved an issue with how validation was triggered when Flow Registry Clients were configured.

### Runtime Extensions 2026.1.13.19

* Google Ads: The connector now works with manager accounts and their subaccounts.
* Oracle: Added new processors designed to accelerate initial snapshot loads.
* Snowpipe Streaming: PutSnowpipeStreaming2 now includes a counter for each destination.
* SQL: PutDatabaseRecord now uses setBytes binding for BINARY SQL types.

### Connectors 2026.1.13.16

* Slack: Improved handling of file attachments with Slack messages.
* Unstructured Connectors: Resolved Null Pointer Exceptions that occurred when parameters were left empty.
* Google Drive: You can now specify multiple folders by using a comma-separated list in the “Folder Name” parameter.
* Google Drive: New Simple Ingest and Cortex connectors that don’t require domain-wide delegation.
* Streaming Destination Modules: PutSnowpipeStreaming now limits channel concurrency for streaming destinations.

## January 12, 2026

### Control Plane UI 0.68.0

* You no longer need OWNERSHIP privilege on the Snowflake Role when configuring BYOC and SPCS Runtimes.
* You no longer need CREATE USER privilege to create a BYOC Runtime.
* **Behavior change:** Starting with AWS Data Plane Agent 0.37.0, you must specify a Snowflake Role when creating a Runtime.

## January 8, 2026

### Control Plane UI 0.67.0

* The Deployment details dialog now correctly shows Private Link and End User Auth over Private Link settings.
* The SAP connector card now displays an updated icon.
* The Runtime and Deployment details dialogs now display the SQL name when available.
* The Create Runtime dialog now requires a Snowflake role. It no longer requires CREATE USER privilege.

### Data Plane Service 0.98.0

* The system now polls less frequently for new Runtime versions, reducing query costs.
* Runtime Upgrades are now more reliable because all related components are discovered and upgraded together.

### AWS Data Plane Agent 1.13.0

* Resolved an upgrade failure affecting older Deployments that pulled helm charts from AWS OCI Repository.

### SPCS Data Plane Agent 1.11.0

* The deployment creation sequence has been optimized to reduce wait time.

## January 6, 2026

### Runtime Server 2026.1.5.14

* When you clear bulletins on a process group, bulletins for its scoped controller services are also cleared.
* Registry Clients no longer log confusing WARN messages when you commit the first version of a flow.

### Runtime Extensions 2026.1.5.19

* Oracle: Archive logs are now properly removed even when database traffic isn’t captured by XStream Out server.
* JIRA: Resolved a resource leak triggered by certain HTTP error codes and improved log messages.
* Azure components: Fixed NoClassDefFoundError: io/netty/handler/codec/quic/Quic.
* Kafka: The verification process is improved and now returns information about the Kafka Connection Controller Service.
* MS SQL Server: Database names with special characters are now properly quoted when available tables are fetched.

### Connectors 2026.1.5.13

* All Database CDC Connectors: The snapshot completion log now shows the correct total number of rows ingested.
* All Unstructured Connectors: The Cortex service name parameter is now correctly applied to documents.
* MySQL & PostgreSQL: You can now configure concurrency settings for Snapshot loads.
* JIRA: Performance is improved by reducing small FlowFiles and batching data sent via Snowpipe Streaming.
* Google Drive: Inserts via Snowpipe Streaming can now run in parallel instead of sequentially.

## December 19, 2025

### Runtime Oracle Extensions 2025.12.19.8

* Fixed an issue validating Oracle licenses that prevented the OracleCapture processor from starting.
* Improved change detection for large schemas

## December 17, 2025

### Runtime Server 2025.12.16.19

* Improved how invalid controller services are handled when you enable or disable them.
* Included Registry Clients in the Runtime documentation.

### Runtime Extensions 2025.12.16.19

* PostgreSQL: Fixed ordering of composite key columns.

### Connectors 2025.12.16.19

* Salesforce Bulk API: Added a new parameter to control case sensitivity for object identifiers created in Snowflake. By default, column names remain case sensitive for backward compatibility. This default may change at public preview or general availability.
* Confluence Data Center: New connector to integrate with Confluence Data Center edition.

## December 16, 2025

### AWS Data Plane Agent 1.12.0

* Fixed an issue where BYOC deployment upgrades failed due to a mismatch between the machine image and Kubernetes cluster versions.
* Fixed an issue where BYOC deployment upgrades failed with the error message “OCI Registry Login Failed”.

## December 11, 2025

### Control Plane Core 0.95.0

* Fixed an issue where the Runtime Run As Role couldn’t be set for roles containing Snowflake-restricted characters, such as hyphens.

### Runtime Server 2025.12.11.21

* Improved behavior when enabling controller services that are invalid and shouldn’t be enabled.
* New Runtime UI 0.59.0.
* Registry clients now support property verification.

### Runtime Extensions 2025.12.11.21

* AWS Secrets Manager: Parameter Provider now considers non-string values as valid parameters.
* RenameRecordField processor now properly handles multiple records per FlowFile.
* Kinesis: Fixed an issue where the ConsumeKinesis processor throttled new records even when buffers were empty.
* Snowflake: Added a default network timeout to the Snowflake Connection Service.
* Confluence: Fixed handling of page deletion.
* Confluence Data Center: Fixed the HTTP response decoder for the client.
* MySQL: Improved logging for table mapping when consuming binlog events.
* CDC Databases Connectors: Observability dashboards now display the failure reason when a table’s replication status changes to failed.

### Connectors 2025.12.11.18

* SQL Server: Exposed new parameters (Re-read Tables in State and Starting Change Tracking Position) for starting position.
* Oracle: Set CASE_INSENSITIVE as the default for created Snowflake objects.
* Jira: Added support for App Forge authentication method.
* Confluence: Added support for App Forge authentication method.
* Oracle: Fixed missing service name in XStream URL in default parameter values.
* Oracle: Added support for internationalization.
* Unstructured Connectors: Added a parameter to specify the Cortex Search Service name.
* Slack Connectors: Added a parameter to control whether user names are resolved.

## December 9, 2025

### Control Plane Core 0.94.0

* Added support for accessing and using Openflow with an organization or account that has been renamed.

## December 8, 2025

### AWS Data Plane Agent 1.11.0

* Fixed issue upgrading older deployments with non-critical “inconsistent result after apply” error message.

## December 5, 2025

### Runtime Server 2025.12.4.19

* New Runtime UI 0.58.0.
* Added new action to clear bulletins.
* Improved error handling when launching the Status History dialog.

### Runtime Extensions 2025.12.4.19

* Kinesis: Fixed checkpoint committed records in ConsumeKinesis that could previously cause data loss.
* PostgreSQL: Fixed issue where CaptureChangePostgreSQL ignored events when data was loaded via COPY FROM STDIN.

### Connectors 2025.12.4.17

* CDC SQL Server MultiDB: Added and exposed support for case sensitivity for created Snowflake objects.

## December 3, 2025

### AWS Data Plane Agent 1.10.0

* Added support for encrypting EBS volumes across the entire Openflow Deployment.

### SPCS Data Plane Agent 1.9.0

* Snowflake Deployments encountering internal certificate authority mismatch issues are now auto-healed on upgrade.

### Control Plane Core 0.93.0

* Retained visibility and use of resources when an account name or organization is changed.
* Improved resource utilization efficiency for Small size runtimes in Snowflake Deployments, allowing 3 runtime pods per node instead of 2.
* Added Manage endpoints action for SPCS deployments (requires account parameter).
* Improved external access integration (EAI) list to only show those EAIs the user has access to when creating a runtime in a Snowflake Deployment.

### Data Plane Service 0.93.0

* Improved resiliency of automatic diagnostic bundling and cleanup behavior when a runtime fails to create.
* Added management capabilities for Openflow endpoints in a deployment accessible via new API methods.
* Extended wait time for runtime upgrade failures in SPCS deployments to avoid premature timeout and failure.

### Control Plane UI 0.65.0

* Added Manage endpoints action for SPCS deployments (requires account parameter).

### Data Plane UI 0.11.0

* Added Openflow endpoints management view for SPCS deployments (requires account parameter).

### Openflow Ingress Controller 2025.12.2-17

* Added support for routing to Openflow endpoints attached to Openflow runtimes.
* Fixed client IP address forwarding when evaluating Snowflake privileges for Openflow runtimes.
* Fixed request header propagation to support deployments with Private Link enabled.

### Runtime Server 2025.12.3.16

* Added support for discovering listen ports from Openflow runtime processors to provide users as available targets for Openflow endpoints.
* Controller Services: Fixed validation and enabling that could take too long and cause the runtime to not start.

### Runtime Extensions 2025.12.3.16

* Kinesis: Introduced Shared Throughput consumer in ConsumeKinesis and removed concurrency limits in the HTTP client.
* Kafka: Added support for specifying custom SASL Extensions.
* EventHub: Added support for OAuth authentication in EventHub processors.
* AWS RDS: Added support for AWS RDS IAM Authentication in the DBCP Connection Pool to access databases over JDBC.
* Listen\* Processors (Examples: ListenHTTP, HandleHttpRequest, ListenOTLP): Added support for new ListenComponent and ListenPortDefinition NiFi APIs to allow discovery of listen ports for use with Openflow endpoints.
* OpenflowRuntimeSSLContextProvider: Added new control service for use with Listen\* Processors to integrate with Openflow endpoints.
* Oracle: Fixed handling of case sensitivity on column names when using lower casing.
* Oracle: Fixed support for internationalization.
* MySQL: Fixed filtering of Azure-specific system tables.
* BigQuery: Added new components for the Google BigQuery Change Data Capture (CDC) connector.
* SQL Server: Added ability to choose the starting position when reading the stream.
* Confluence Data Center: Improved support for Audit Records ingestion.
* Confluence: Improved performance to retrieve Confluence page IDs.
* Confluence: Added support for Forge App authentication method.
* Google Drive: Improved recursive listing efficiency when listing the content of a drive.
* Slack: Fixed fetching information of users for large workspaces with a large number of users.

### Connectors 2025.12.3.15

* Google Drive: Fixed potential NullPointerException when Google Drive Folder parameter is not set.
* Slack: Added check to verify files have content before uploading to Snowflake.
* CDC PostgreSQL: Increased backpressure settings to better support large number of synced tables.
* Dataverse: Improved the query for the deletes in the Journal Table.

## December 2, 2025

### Control Plane Core 0.92.0

* Fixed a thread contention issue in Snowflake Deployments that could cause some Runtime actions triggered from Control Plane to time out and fail.

### Data Plane Service 0.92.0

* Fixed a thread contention issue in Snowflake Deployments that could cause some Runtime actions triggered from Control Plane to time out and fail.

### Openflow Ingress Controller 2025.11.20-18

* Added support for Programmatic Access Token authentication and authorization.

### Openflow Runtime Gateway 2025.11.19.22

* Added support for Programmatic Access Token authentication and authorization.

## November 21, 2025

### AWS Data Plane Agent 1.8.0

* Restores support for private Openflow BYOC Deployments by removing all dependencies on URLs outside of Snowflake and AWS, addressing an issue introduced in 1.6.0

### Control Plane Core 0.91.0

* Fixed a rare issue that prevented a Runtime from being activated after it had been suspended

### Data Plane Service 0.91.0

* Fixed an issue causing connector installations to fail on new Runtimes when bulletins are present

## November 20, 2025

### Runtime Server 2025.11.20.20

* Improved visibility of Runtime operations with new metrics for Connectors

### Runtime Extensions 2025.11.20.19

* New components to interact with SAP Business Data Cloud and mapping of CSNs into Snowflake Semantic Views
* CDC MySQL - Improved reliability by clearing out the table map prior to CDC reconnects
* CDC Databases - Fixed a potential deadlock issue with MergeSnowflakeJournalTable when “poll query result” is cleared during operation
* CDC MariaDB - Added support for MariaDB in the MySQL components
* CDC PostgreSQL - Added support for primary keys of type `numeric`

### Connectors 2025.11.20.17

* CDC Databases - Adjusted backpressure thresholds on some connections when processing a lot of data
* Salesforce - Gracefully handle scenarios where we insert duplicate rows in the staging table
* CDC Databases - Exposed parameter to enable private connectivity in the PutSnowpipeStreaming processor for data ingest
* CDC PostgreSQL - Adjusted yield duration on CaptureChangePostgreSQL to not overuse replication connections

## November 19, 2025

### Runtime Extensions 2025.11.18.22

* SQL Server - Performance improvement in Snapshot query
* Dataverse - Schemas are no longer filtered when no column filtering value is provided
* Telemetry - “Bytes Received” is now available for many Snowflake processors after fixing the file size for provenance events
* Azure components - Fix ConsumeAzureEventHub by excluding netty-codec-http3 dependency
* Google Cloud - Added support for Workload Identity Federation
* Azure Blob Storage - Added support for uploading file larger than 200GB

### Connectors 2025.11.18.17

* Google Ads - Set the new Authentication Strategy property of the GCP Credentials Controller Service
* Multi Database SQL Server - Fix `source.table.fqn` value handling
* Salesforce - Add logging for successful sync operations to ease monitoring via the events table

### AWS Data Plane Agent 1.6.0

* Improves security by removing unused inbound ports on Load Balancers configured for “Custom Ingress.” You can further limit access with your own Security Group for these Openflow Deployments.
* Improves security and eased configuration of Runtimes with an optional Deployment-level IAM Role to securely access AWS resources like RDS, MSK, Kinesis, and S3. You can now attach IAM Policies to Openflow’s “NodeInstanceRole” that are granted to all Runtimes in that Openflow Deployment.
* Upgrades EKS Cluster from 1.32 to 1.34 for long-term maintenance and security patching
* Resolves an issue where restarting some EC2 nodes frequently caused the Openflow Deployment to freeze
* Security patches and upgrades to third party libraries

### SPCS Data Plane Agent 1.4.0

* A missing event table will no longer cause failure when creating an Openflow Snowflake Deployment.
* Fixed certificate based issues accessing Runtimes and deploying Connectors into Deployments older than 60 days
* Security patches and upgrades to third party libraries

### Control Plane UI 0.64.0

* Adding support for the new Cleaning Up Runtime state
* Adding support for terms accepted trial not started or active trial

### Control Plane Core 0.90.0

* If a Runtime fails to create, it will automatically generate a diagnostics bundle and clean up any partially created resources in the cluster.

### Data Plane Service 0.90.0

* If a Runtime fails to create, it will automatically generate a diagnostics bundle and clean up any partially created resources in the cluster.

### Openflow Ingress Controller 2025.11.12-18

* Security patches and upgrades to third party libraries

## November 15, 2025

### Runtime Extensions 2025.11.16.2

* Improved reliability for high volume deployments by relocating state tracking for replication position and journal versioning in CDC Connectors

## November 14, 2025

### Runtime Server 2025.11.14.17

* Easier debugging with an updated Bulletin Board that can expand stack traces
* Fixed bug when rendering Documentation for extensions that lack tags
* Viewing component state now supports showing 5,000 local entries and 5,000 cluster entries, up from 500 each

### Runtime Extensions 2025.11.14.17

* Reduced costs by removing the validation query in the Snowflake Connection Service

### Connectors 2025.11.14.14

* Google Drive and Google Sheet - Set the new Authentication Strategy property of the GCP Credentials Controller Service

## November 13, 2025

### Runtime Extensions 2025.11.13.19

* Added support for Web Identity authentication to AWS MSK IAM Connection Service in Kafka components
* Added support for Web Identity authentication to AWS Credentials Controller Service for all AWS components
* Added Flow Registry Client support for Bit Bucket Data Center edition
* Fixed Worker ID generation in ConsumeKinesis and added provenance data
* Added support for nested paths in HashiCorpVaultParameterProvider
* Dataverse - Added retry-after mechanism
* Added Snowflake Secrets Parameter Provider
* CDC Database Connectors - Improved reliability and performance on state management
* JIRA - Enriched issues with email addresses
* HubSpot - Fixed handling of 414 error code responses while fetching objects

### Connectors 2025.11.12.21

* Dataverse - Add Column JSON Filtering parameter
* PostgreSQL - Added FIFO FlowFile prioritizer on queues in Postgres Snapshot Load
* MySQL - Expose parameter for the starting position of the replication
* Dataverse - Added updated_ad and deleted columns
* Salesforce - Switched the CSV Reader to RFC 4180
* Salesforce - Fixed configuration to capture soft deletes

## October 31, 2025

### AWS Data Plane Agent 1.1.0

* Security patches and upgrades to third party libraries

### SPCS Data Plane Agent 1.1.0

* Security patches and upgrades to third party libraries

### Control Plane Core 0.88.0

* Enable Openflow Oracle Connector in Snowflake Deployments

### Data Plane Service 0.88.0

* Security patches and upgrades to third party libraries

### Runtime Operator 0.45.0

* Security patches and upgrades to third party libraries

### Runtime Extensions 2025.10.31.13

* Improved reliability of high volume CDC Connectors

## October 30, 2025

### Runtime Extensions 2025.10.30.21

* CDC Database Connectors - New components for multi-databases support are now included in the runtime image
* JIRA - Added support for Forge App authentication method
* New OAuth2 controller service to get Snowflake issued JWTs for Workload Identity Federation

### Connectors 2025.10.30.20

* PostgreSQL - Added First In First Out (FIFO) connection prioritizer in PostgreSQL Snapshot load
* CDC Database Connectors - Disabled load balancing in the incremental flow to ensure single node processing of the data
* Dataverse - Added parameter for the new JSON Column Filtering property

## October 29, 2025

### Runtime Server 2025.10.28.18

* Fixed bug in Summary table formatting Process Group task time

### Runtime Extensions 2025.10.28.20

* Kinesis - Support Output Strategy property in ConsumeKinesis processor
* Kinesis - Added the new Kinesis components leveraging the latest AWS client library
* SQL Server - Added support for multiple databases
* MySQL - Added the possibility to specify the binlog starting position for reading the CDC stream
* PostgreSQL - Added support for negative scale in numeric types
* SQL Server - Improved the ordering of the ORDER BY clause
* Snowpipe Streaming - Improved Input Buffer Handling in PutSnowpipeStreaming2
* PostgreSQL - Improved performances in FetchTableSnapshot on large tables with composite primary key
* MySQL - Fixed incorrectly replicated DATEs pre 1582-10-15 (Julian calendar)

### Connectors 2025.10.28.9

* Oracle - support for multiple logical databases
* MySQL, PostgreSQL, SQL Server - no longer writing the unused avro.schema FlowFile attribute
* Jira - support for fetching Worklogs

## October 28, 2025

### AWS Data Plane Agent 0.61.0

* Security patches and upgrades to third party libraries

## October 27, 2025

### Control Plane UI 0.63.0

* Security patches and upgrades to third party libraries

## October 24, 2025

### Control Plane Core 0.86.0

* Fixed issue where a user can’t log into Openflow if their most recently selected active role was revoked.
* Disable creating a Snowflake Deployment if the user’s role is not granted CREATE COMPUTE POOL privilege.

### Control Plane UI 0.62.0

* Improved display for Upgrade Failed, Inactive, and Activate Failed states
* Always show the current “Run As” role even if it’s not in the current user’s set of account roles
* Fixed issue with “Run As” role validation in Create Runtime dialog

## October 23, 2025

### Runtime Server 2025.10.23.16

* Bulletin icons now reflect the severity of the message
* Parameters can now be edited by double clicking on the row in the Parameter Context
* Included state of system diagnostics API call in the loading skeleton and spinner in the Cluster Listing
* Improved awareness of errors through the global banner when extension types fail to load
* Updated styling for unset, blank, empty styles throughout the Runtime UI

### Runtime Extensions 2025.10.23.11

* Fixed incorrect handling of Drop Table actions in UpdateSnowflakeTable processor
* Oracle - Improved performance by moving metadata generation in FetchSnapshot processor
* Oracle - Fixed handling of column filters DDL
* Dataverse - Added optional configuration to filter columns being fetched
* Cortex - Improved error message when there is an issue calling Cortex in PromptSnowflakeCortex processor
* MySQL - Fixed the filtering out of the “user” table
* Salesforce Data Cloud - Added support for detecting deletions of Data Shares and linked Objects in the shares
* MySQL - Fixed skipping compressed transaction DDLs and DMLs spanning over the transaction
* JIRA - Enrich Jira Worklogs processor
* Confluence - Support for Confluence Data Center edition
* Added Offset Tracking Resolution to PutSnowpipeStreaming2 processor
* Sharepoint - Fixed pagination handling when listing more than 200 items
* Salesforce - Added optional lookup key in UpsertSFDCObjects processor allowing user to specify a field other than ID for retrieving the record to upsert

### Connectors 2025.10.22.17

* Excel - Added missing SPCS related configuration options
* HubSpot - Added support for new object types: Notes, Orders and Carts
* Salesforce - Added missing configuration for authentication strategy for usage of the connector in Openflow Snowflake Deployments
* PostgreSQL - Migrated the connector to standard identifiers for better management of case sensitivity on object naming
* Oracle - Removed the addition of Snowflake Specific Columns to leverage FetchSnapshot processor instead and improve performances
* Sharepoint - New Simple Ingest connector that does not fetch the ACLs associated to the data
* Salesforce - Added support for specifying the object fields that should be included/excluded when retrieving the data

## October 20, 2025

### Control Plane Core 0.84.1

* Released Control Plane Core version 0.84.1.

## October 17, 2025

### AWS Data Plane Agent 0.60.0

* Fixed certificate issues that blocked access to runtimes and connector deployments in deployments older than 60 days.

### Control Plane Core 0.84.0

* Fixed input validation issues when filtering in role selection menus.
* Fixed an issue where links to runtimes were shown to users without access privileges.
* Fixed an issue where users with only USAGE privilege on a runtime couldn’t create connectors in that runtime.
* Fixed an issue for users accessing Openflow over PrivateLink with a network policy enforcing the VPCE ID.
* Added support for suspending and activating runtimes in Snowflake deployments.
* Snowflake deployments now display their current version immediately after creation is initiated.

### Control Plane UI 0.60.0

* Warns users before navigating to a deployment or runtime where VPN connectivity may be required when using custom ingress.
* Keeps the selection panel open for multi-select components after a selection is made.
* Enforces user permissions for viewing the runtime canvas and hides links if permissions are missing.
* Improves setup experience by considering total counts of runtimes and deployments, not just those in the ACTIVE state.
* Makes the Snowflake role optional when creating a runtime in a BYOC deployment.
* Improves text overflow handling for connector cards.

### Openflow Ingress Controller 2025.10.16-17

* Fixed an issue that prevented access to runtimes over PrivateLink.
* Fixed an issue where a new runtime couldn’t be accessed if its name matched that of a previously deleted runtime.

## October 15, 2025

### Runtime Extensions 2025.10.14.22

* Added Snowflake Managed Authentication Strategy to SnowflakeConnectionService and PutSnowpipeStreaming.

### Runtime Oracle Extensions 2025.10.14.22

* Improved snapshot query performance by correcting ORDER BY column sorting.

### Runtime Server 2025.10.14.12

* Fixed missing Process Group identifier information in Processor and Controller Service log records.

## October 08, 2025

### AWS Data Plane Agent 0.59.0

* Added support for workarounds when using self-managed certificates in AWS.
* Fixed issues that caused BYOC deployment upgrades to get stuck with invalid image references and job cleanup.
* Restored support for adding customer-managed IAM policies to Openflow’s IAM roles.

## October 03, 2025

### Connectors 2025.9.30.17

* Updated the Dataverse connector to set empty collation for the Dataverse journal table.

### Runtime Extensions 2025.10.2.19

* Added better support for case sensitivity on Snowflake objects in `MergeSnowflakeJournalTable`.
* Improved HubSpot pagination handling when retrieving more than 10,000 records.
* Unstructured Processing - `PerformSnowflakeCortexOCR` now uses the `AI_PARSE_DOCUMENT` function instead of `PARSE_DOCUMENT`.
* Added better support for case sensitivity on Snowflake objects in PutSnowpipeStreaming.
* PostgreSQL - Fixed unsigned handling of type OIDs in the CaptureChangePostgreSQL processor.

### Runtime Server 2025.9.30.19

* New Runtime UI 0.53.0.
* Fixed a regression that prevented tabbed dialogs from remembering the previously active tab.
* Fixed balto icon regressions and selected radio button display issues.
* Fixed an issue where Parameter Context update requests weren’t deleted when users canceled the request.
* Fixed an issue that caused double scroll bars to appear in the asset upload dialog.
* Fixed an issue where the selected asset count could get out of sync.

## September 26, 2025

### AWS Data Plane Agent 0.52.0

* Improved efficiency of private IP addresses used by EKS cluster nodes, reducing the total number required for scaling out to many Runtime nodes.
* Fixed issue with Runtime logs that incorrectly redacted some component IDs.

### Connectors 2025.9.25.17

* Confluence connector - Better failure handling and retries when facing API rate limits.

### Control Plane Core 0.80.0

* Support for deploying Oracle Runtime Extensions to Runtimes in BYOC Deployments for PrPr customers who have accepted the Terms of Service.
* Fixed an issue where Snowflake deployment moved into an active state prematurely during an upgrade.
* Fixed a rare issue where Snowflake deployment deletions could get stuck and need manual intervention.

### Control Plane UI 0.57.0

* Introduced new deployment upgrade dialog that shows the version mapping.

### Data Plane Service 0.80.0

* Support for deploying Oracle Runtime Extensions to Runtimes in BYOC Deployments for PrPr customers who have accepted the Terms of Service.

### Runtime Extensions 2025.9.25.19

* CDC database connectors: Removed Record Reader from MergeSnowflakeJournalTable processor.
* All connectors log the Query ID whenever a connector executes a query in Snowflake.

### Runtime Oracle Extensions 2025.9.23.19

* PrPr release of Oracle Extension for Openflow Runtimes.

### Runtime Server 2025.9.25.19

* Improved the Openflow Connectors upgrade user experience.

## September 23, 2025

### Connectors 2025.9.23.17

* PostgreSQL connector now includes a new parameter so you can set the replication slot name.
* The PostgreSQL, MySQL, and SQL Server connectors now support column names that include special characters.

### Runtime Extensions 2025.9.23.19

* Added compression to rows added using the Insert Rows method through PutSnowpipeStreaming2.
* MySQL: Added support for compressed bin log events.
* Added new processors, UpdateSnowflakeSchema and UpdateSnowflakeStream, to better manage object lifecycles and support case sensitivity.
* HubSpot: Added support for new “Notes,” “Orders,” and “Carts” object types.
* Slack: Fixed Null Pointer Exception when trying to verify the configuration of ConsumeSlackConservations processor.

### Runtime Server 2025.9.23.19

* Using latest Apache NiFi 2.6.0 release.
* Improved the flow upgrade user experience by improving Flow Differences Filters to handle renameProperty, removeProperty, and createControllerService.
* New Runtime UI 0.52.0.
* Fixed bug allowing default values for dynamic properties.
* Improved the performance of the searchable select used in the Property combo editor.

## September 19, 2025

### AWS Data Plane Agent 0.50.0

* Openflow now supports VPCs with DHCP Option Sets, making it easier to connect to private data sources.
* You can now secure Openflow deployments with PrivateLink, while still allowing browser-based authentication to runtimes without PrivateLink.
* Fixed an issue during upgrades where IAM inline policies failed by exceeding maximum character limits.

### Control Plane Core 0.78.0

* Improved error messages for Snowflake deployment failures to show the root causes.
* Fixed a case where BYOC deployment ends up in Not Healthy state but can’t be deleted from Openflow Control Plane.

### Control Plane UI 0.55.0

* Removed unnecessary title on Runtime and Deployment state columns.

### Openflow Runtime Gateway 2025.9.18.22

* Improved cookie session handling to allow users to remain logged in, even when Runtime is open in an inactive browser tab.

## September 18, 2025

### Connectors 2025.9.17.18

* Addition of the 2 new Oracle CDC connectors.
* Confluence connector - The introduction of a new controller service to handle API rate limits will show the connector as a process group with local changes.
  This can be ignored and will be resolved when upgrading the connector to the next version, when available.

### Runtime Extensions 2025.9.18.18

* Introduced the `UpdateSnowflakeTable` processor, which is like `UpdateSnowflakeDatabase`, but designed for tables and improved case sensitivity.

## September 16, 2025

### Connectors 2025.9.16.18

* SQL Server connector: Exposed the new SQL Server query interval property as a parameter.
* The new controller service for API rate limits in the Jira connector causes the connector to appear as a process group with local changes. You can safely ignore this; it will be fixed in a future connector upgrade.

### Control Plane UI 0.54.0

* Allow users to optionally configure whether end users authenticate over PrivateLink.
* The Estimated time to completion shown when creating Snowflake Deployments and Runtimes is now more accurate.

### Openflow Ingress Controller 2025.9.15-14

* Initial release offering privilege isolation for Openflow runtime authentication and authorization to Snowflake deployments.

### Runtime Extensions 2025.9.16.20

* Added support for DATETIME columns with PutBigQuery processor.
* You can now specify the HTTP protocol version in StandardWebClientServiceProvider.
* Better logging and increased timeouts for FetchSharepointFile processor.
* Added the option to set the replication slot name in CaptureChangePostgreSQL processor.
* You can now use `-infinity` and `+infinity` with Postgres TIMESTAMPTZ values.
* New controller service StandardAtlassianRequestRateManager to deal with API rate limits for the Jira connector.
* Fixed exceptions thrown from ListMicrosoftDataverseTables when table schema isn’t returned by API.

### Runtime Operator 0.40.0

* Support deploying the new Openflow ingress controller for PuPr release of Snowflake deployments.

### Runtime Server 2025.9.16.19

* New Runtime UI 0.51.0.
* You can now delete individual entries in the component state if the component allows it.
* Improved tooltips for Property and Parameter values, especially when values are long or reference external resources.

## September 15, 2025

### AWS Data Plane Agent 0.41.1

* Fixed an issue from AWS Data Plane Agent 0.39.0 that blocked the first install of an Openflow deployment into a new AWS region.

## September 11, 2025

### Control Plane Core 0.73.0

* Fixed issue preventing runtime deletion in Snowflake deployments when a network policy is present.

### Data Plane Service 0.73.0

* Fixed an issue that prevented runtime deletion in Snowflake deployments when a network policy was present.
* Fixed an issue that prevented new versions of runtime extensions from being used when runtimes were created or upgraded.

### Runtime Extensions 2025.9.11.18

* CaptureChangeSQLServer: A new setting, `Table Changes Query Interval`, is introduced to reduce the resource pressure on the source database. Now, the processor queries the source database every 10 seconds (`10 sec`) by default. To restore the original behavior, change the setting to `0 sec`.

## September 10, 2025

### AWS Data Plane Agent 0.40.0

* Resolved an issue where deployments were left partially upgraded after AWS Data Plane Agent 0.39.0 was used.

### Connectors 2025.9.9.18

* Unstructured connectors: Improved reporting on `ChunkText` failures.

### Runtime Extensions 2025.9.10.7

* Microsoft Dataverse: Fixed handling of schemas that include the `Edm.Date` type.
* Fixed attribute prefix handling in the XML Reader.
* Fixed MongoDB controller service for certain authentication methods when information is provided through the URI.
* Added Azure DevOps Flow Registry Client for Git integration with Azure DevOps to version flows.

### Runtime Server 2025.9.9.20

* Added the ability to change the version of a ghosted component if a bundle with the same coordinates and a different version exists.

## September 8, 2025

### AWS Data Plane Agent 0.37.0

* Added support for AWS Data Plane Agent deployments that have DHCP Option Sets configured on the account.
* Upgraded all EKS nodes from Amazon Linux 2 to Amazon Linux 2023.

### AWS Data Plane Agent 0.38.0

* Added support for AWS accounts that require encrypted EBS volumes by default, even if an unencrypted EBS volume is requested. Customers can enable this by adding IAM Policies to the `*-eks-role IAM Role` that grant access to their KMS keys.

### Control Plane Core 0.72.0

* Error messages are now clearer and more informative when runtime-related failures occur.
* Fixed a rare case where an older deployment version disallowed creating a runtime with the same name as a previously deleted runtime.

### Control Plane UI 0.52.0

* Deployment listing and details now include the deployment version number.
* Control Plane logout page now offers a link back to Snowsight
* Searchable select control (used in **Create Runtime** and **Manage Access**) now offers improved behavior when text overflows available space.
* Fixed a bug that temporarily showed duplicate roles when revoking privileges through the **Manage Access** dialog.

### Data Plane Service 0.70.0

* Added support for AWS Data Plane Agent deployments that have DHCP Option Sets configured on the account.
* Allowed customers to delete a runtime and create a new one with the same name shortly thereafter.

## September 5, 2025

### Connectors 2025.9.4.19

* Confluence Connector: Refresh frequency is now set to 1 minute and is no longer exposed as a parameter.

### Runtime Extensions 2025.9.4.20

* Resolved an incompatibility between the Github Registry Client and the latest Jackson release.
* Fixed attribute prefix handling in XML Reader
* Added `StandardProtobufReader` controller service for Protobuf record processing
* `ListTableName` won’t fail the entire FlowFile if partial input is incorrect.

### Runtime Server 2025.9.4.20

* Introduced Runtime UI 0.50.0
* Added a new logout page that provides users options for logging back in or navigating to the Control Plane.
* Enhanced the searchable select control to display options more clearly when text exceeds available space.
* Fixed casing and icon issues when inputting attributes during extension verification.
* Fixed header styling applied to additionalDetails markdown files.

## September 2, 2025

### AWS Data Plane Agent 0.35.0

* Support for AWS Tags with dots in the Tag key.

### Connectors 2025.9.2.16

* MySQL CDC: Always create a new table (and fail if the table already exists) when replication mode is set to `full`.

### Runtime Extensions 2025.9.2.17

* The GitLab Flow Registry Client now supports versioning flows larger than 2MB.
* Fixed issue in the MongoDB Controller Service preventing users to authenticate using X509.
* Fixed irrelevant error logs about schema hash in `UpdateSnowflakeDatabase` processor.
* Confluence: Fixed a bug that prevented users from being added to authorized users even though they had permissions to the space from the group level.
* Fixed `NoSuchElementException` thrown in ChunkText processor and better failure handling with dedicated relationship.
* HubSpot: Fixed bug preventing the List processors to properly go through all the pages.

### Runtime Server 2025.9.2.20

* New Runtime UI 0.48.0.
* Upgraded to latest version of Codemirror and updated usage throughout the application.

## August 28, 2025

### Connectors 2025.8.28.17

* MS SQL CDC Connector: Added support for incremental only mode.
* HubSpot connector: Fixed table creation on invalid object type.

### Runtime Extensions 2025.8.28.19

* Added StandardProtobufReader Controller Service for Protobuf record processing

## August 27, 2025

### AWS Data Plane Agent 0.33.0

* Fixes health checks for Load Balancer Target Groups, so everything shows green in the AWS Console.

### Control Plane Core v0.68.0

* Supports a finer-grained privilege model for deployments and runtimes including MONITOR and OPERATE privileges.

### Control Plane UI v0.51.0

* Supports a finer-grained privilege model for deployments and runtimes including MONITOR and OPERATE privileges.

### Runtime Operator 0.39.0

* Supports a finer-grained privilege model for deployments and runtimes including MONITOR and OPERATE privileges.

## August 26, 2025

### Runtime Extensions 2025.8.26.18

* MS SQL Server: Fixes handling of datetime when used as a primary key.

## August 21, 2025

### Connectors 2025.8.21.16

* PostgreSQL connector: Supports TOASTed values.

### Runtime Extensions 2025.8.21.17

* Uses Google Ads API v21 (Note, v18 is no longer supported).

### Runtime Server 2025.8.21.17

* New Runtime UI 0.47.0.

## August 20, 2025

### Connectors 2025.8.19.17

* Slack connectors: Fixes handling of attachments by appending the File ID to the filename for the files stored in the stage.

### Runtime Extensions 2025.8.20.10

* Adds Google Cloud support to PutSnowpipeStreaming2.
* Adds support for Incremental Only mode in PostgreSQL CDC connector.
* Fixes error when trying to verify configuration in List Azure processors.

### Runtime Server 2025.8.19.18

* Supports unquoted parameter references with spaces in their names within an expression language.

## August 15, 2025

### Control Plane Core 0.64.0

* Resolves an issue that sometimes caused runtime deletion to fail in Snowflake deployments.

### Runtime Operator 0.38.0

* Resolves an issue facilitating runtime autoscaling in Snowflake deployments.

### Runtime Server 2025.8.14.18

* Improves readability in Provenance Event dialog.

## August 13, 2025

### Control Plane Core 0.62.0

* New AWS BYO-VPC deployments now adds the “Private Security Group” to the EKS cluster, making it easier to configure connections to data sources.
* Resolves an issue for new Deployments with a private security group configuration that
  couldn’t pull images from Snowflake over PrivateLink.

### Control Plane UI 0.49.0

* Runtime and Deployment action menus now have separators to help group actions.
* Account roles show in a searchable selection with virtual scrolling.

### Data Plane Service 0.62.0

* Runtime flows no longer disappear after suspend and reactivate due to a conflicting auto scaling operation.

### Runtime Extensions 2025.8.12.20

* Adds FlowFile attributes support for Database and Schema properties in PutSnowflakeInternalStageFile.
* New GetConfluenceSpaces processor.
* PostgreSQL CDC now properly handles DATE, TIME, TIMESTAMP primary keys.

### Runtime Server 2025.8.12.20

* New Runtime UI 0.45.0: Minor improvements to the Component State dialog to improve
  readability of state entries.

## August 12, 2025

### AWS Data Plane Agent 0.32.0

* Fixes issue destroying BYOC deployments that was introduced with 0.29.0.
* Fixes issue from 0.29.0 release where BYOC deployments in AWS Regions with longer names may fail due to IAM Policy length limitations.

## August 7, 2025

### AWS Data Plane Agent 0.30.0

* Upgrades the AMI of EKS nodes when the deployment is upgraded.
* Removes unnecessary IPv6 Security Group rules for ingress and egress.

### Runtime Extensions 2025.8.7.20

* Improves ConsumeKafka by introducing an Inject Offset Output strategy to add a field kafkaOffset to the records.
* Adds the preview tag for Salesforce, Confluence and HubSpot components.
* Better configuration validation in UpdateSnowflakeDatabase to avoid using empty parameters.
* Adds GetConfluencePageContent and GetConfluencePageIds processors for Confluence.
* Fixes UpdateSnowflakeDatabase to properly redirect to the failure relationship when schema is not specified or does not exist.
* Improves error handling of non-authorized calls in HubSpot processors.

### Runtime Server 2025.8.7.20

* New Runtime UI 0.44.0: Improves ConsumeKinesisStream by introducing a schema difference handling strategy to specify how records using the same schema should be grouped.
* Fixes issue in rendering the canvas that surfaced on initial page load.

## August 6, 2025

### Runtime Extensions 2025.8.5.19

* Adds Pipe Info Counter and Channel Error Message to PutSnowpipeStreaming2.
* MySQL connector: Supports enabling the connector in Incremental mode only.
* HubSpot connector: Improves handling of non-supported object types and fixed processing ordering of the events.

### Runtime Server 2025.8.5.19

* New Runtime UI 0.43.0: The Runtime UI now supports labeling extensions in Preview.
  The badge is shown in the create dialog, on the canvas, in the operate palette,
  in the edit dialog, and in listings for extensions not on the canvas.

## August 5, 2025

### AWS Data Plane Agent 0.29.0

* Private deployments: All images and binaries are provided by Snowflake instead of various internet sources.
* Custom Ingress for “Bring Your Own VPC” deployments: Supports enterprise customers
  who use VPNs to access their cloud infrastructure and self-managed TLS certificates.
* Adds end-to-end support for PrivateLink. Previously, data and management communications
  were available over PrivateLink. Now, the deployment can install over PrivateLink, too.

### Control Plane Core 0.60.0

* Adds improvements necessary to support BYOC private deployments.
* Improves handling of outbound grants when transferring ownership of a runtime or deployment.
* Trial accounts are now permitted to use Openflow with relevant parameter enabled.
* Fixes an issue that disrupted use of Control Plane for customers with a large number of Snowflake roles.

### Control Plane UI 0.48.0

* In runtime and deployment listings, more actions in the menus are disabled rather than hidden.
* Removes a link to accept terms. This change prevents problems when the user doesn’t have an active Snowsight session.
* When a new version is detected, prompts the user to reload the CP UI.
* Disallows changing ownership of runtimes in Snowflake deployments.
* Fixes bug that required a Snowflake role, even when the field was hidden.

### Data Plane Service 0.60.0

* Includes improvements necessary to support BYOC private deployments.
* Fixes an issue that disrupted Connector deployment for customers with a large number of Snowflake roles.

### Openflow Runtime Gateway 2025.8.1.14

* Fixes an issue with certificate refresh upon renewal which prevented users from logging into older runtimes.

## July 31, 2025

### Connectors 2025.7.31.17

* Jira: Improved readability of the flow. The scheduling is now exposed via a parameter.

### Runtime Extensions 2025.7.31.18

* Adds File Fragment Size and Count to PutSnowpipeStreaming2.
* Introduces new Confluence processors for the upcoming connector GetConfluenceGroupUsers,
  GetConfluencePagePermissions, GetConfluenceSpacePermissions, ListConfluenceGroups.
* Adds support for TOASTed value in PostgreSQL CDC.
* Fixes initial rendering of canvas when fonts may load slowly.
* Fixes parameter removal in Parameter Contexts owned by a Parameter Provider.

### Runtime Server 2025.7.31.18

* New Runtime UI 0.42.0: Improves formatting in Status History dialog when values are lengthy.

## July 29, 2025

### Runtime Server 2025.7.29.9

* Fixes an issue with scaling that left some nodes in a disconnected state.

## July 24, 2025

### Connectors 2025.7.24.17

* Kafka Connectors: Fixes referenced readers when writing to Iceberg formatted tables.

### Runtime Extensions 2025.7.24.18

* Fixes S3 Location Type in PutSnowpipeStreaming2.

### Runtime Server 2025.7.24.18

* Adds support for users to reset all Counters in a single action.
* Fixes an issue that caused upgrade failure for runtimes with more than 1 node present.

## July 23, 2025

### Control Plane Core 0.58.0

* Adds support for selecting an active role to use in the application, rather than relying on a default role and secondary role inheritance.
* Adds support for considering Snowflake role hierarchy during authorization controls.

### Control Plane UI 0.47.0

* Adds support for selecting an active role to use in the application, rather than relying on a default role and secondary role inheritance.

### Data Plane Service 0.58.0

* Adds support for considering Snowflake role hierarchy during authorization controls.

### Openflow Runtime Gateway 2025.7.22.20

* Adds support for considering Snowflake role hierarchy during authorization controls.

## July 22, 2025

### Runtime Extensions 2025.7.22.19

* A new controller service better supports Slack API rate limits.
* Fixes SnowflakeSignJWT controller service.

## July 16, 2025

### AWS Data Plane Agent 0.25.1

* Fixes upgrades to pull and use the latest host scripts.
  This change enables Openflow to more easily make changes to the agent itself during an upgrade.

## July 15, 2025

### Connectors 2025.7.15.14

* Confluence JIRA connector: Improves type mapping for the JIRA issues.
  Uses the new processor for managing lifecycle of views.
* Slack connectors: Changes defaults for run schedule properties to avoid rate limiting errors.

### Control Plane Core 0.53.0

* Adds support for generating and downloading runtime diagnostic bundles.

### Control Plane UI 0.46.0

* Adds support for generating and downloading runtime diagnostic bundles.

### Data Plane Service 0.53.0

* Adds support for generating and downloading runtime diagnostic bundles.

### Runtime Extensions 2025.7.15.16

* Adds the PutSnowpipeStreaming2 processor using SSv2.

### Runtime Server 2025.7.15.16

* New Runtime UI 0.40.0: Fixes a bug that prevented tooltips from closing on the canvas.

## July 10, 2025

### Control Plane UI 0.45.2

* Adds support for PrivateLink redirects for the Launch Openflow button.
* Fixes an issue where logout doesn’t log the user out if the user revisits soon after.

## July 9, 2025

### Connectors 2025.7.8.14

* PostgresSQL, SQL Server and MySQL Connectors: Change to Journal creation process
  group to remove the false positive error bulletin for PutSnowpipeStreaming
  when it was asked to create channels on non yet existing tables/streams.

### Control Plane Core 0.52.0

* Users must have proper privileges before they can list or view a runtime.

### Control Plane UI 0.45.1

* Fixes a bug that caused runtime and deployment listings not to show and prevented creation of new resources.

### Runtime Extensions 2025.7.9.14

* Git Registry clients have the option to ignore parameter changes when versioning a new version of a flow.
* New HubSpot processor to retrieve the schema of HubSpot objects.
* New processor UpdateSnowflakeView to manage lifecycle of Snowflake views.
* New controller service RemoveFieldRecordReader to drop fields on read.
* Supports PostgreSQL Aurora.
* CaptureChangeSQLServer generates a valid query when the primary key consists of multiple columns.
* UpdateSnowflakeDatabase now checks only column types when required.

### Runtime Server 2025.7.9.14

* New Runtime UI 0.39.0
* Improves colors in canvas for Process Group version control status.
* Improves styling for better alignment with Balto colors.
* Assets are no longer prevented from being re-uploaded in the Manage Assets dialog.
* When using form control to increment a numeric value, output from a dirty Edit Processor form is no longer prevented.

## July 3, 2025

### AWS Data Plane Agent 0.22.2

* Upgrades no longer get stuck when upgrading due to a missing Data Plane UI 0.7.0 image.

## July 1, 2025

### AWS Data Plane Agent 0.22.1

* New deployments no longer fail to install due to mid-handling failure code when
  checking for the presence of AWS ECR repositories.

### Runtime Extensions 2025.7.1.18

* Google Ads: Limits the numbers of calls to Google Ads API when validating the
  components to avoid rate limit errors.

## June 28, 2025

### Runtime Extensions 2025.6.27.21

* Fixes NullPointerException in PutSnowpipeStreaming when empty flow files are being processed
  and Delivery Guarantee is set to `Exactly once`.

## June 27, 2025

### Control Plane Core 0.51.0

* New terms of service flow: Customers can use Control Plane to create
  Snowflake-managed deployments without accepting BYOC and Connector terms.

### Control Plane UI 0.43.0

* New terms of service flow: Customers can use Control Plane to create
  Snowflake-managed deployments without accepting BYOC and Connector terms.

## June 26, 2025

### Connectors 2025.6.26.15

* Kafka Connectors: Ignore column type mismatch in UpdateSnowflakeDatabase for Kafka
  connectors is more resilient in case of issue with schema inference.
* Google Drive & SharePoint Connectors: Improves the flow to avoid a race condition
  where group synchronization kicks off but PERMS_GROUPS has not been created yet
* Kafka Connectors: Warehouse is no longer needed. The corresponding parameter is removed.

### Runtime Extensions 2025.6.26.16

* Tables without primary keys are retried instead of failed.
* New Alter Strategy in UpdateSnowflakeDatabase processor has the option to ignore column type changes.
* Fixes fetching of HubSpot archived records.

## June 24, 2025

### Control Plane Core 0.50.0

* New deployments send status updates to Openflow Control Plane indicating when upgrades are present.
* The PrPr tag is included on some new connectors.

### Control Plane UI 0.42.0

* New deployments now surface when an upgrade is available, with link to documentation.
  earlier deployments can also use this functionality after a migration to a newer version.

### Data Plane Service 0.50.0

* Fixes Create runtime failures where the minimum node count is greater than one.

### Data Plane UI 0.7.0

* Active role now displays in the current user menu.

## June 20, 2025

### AWS Data Plane Agent 0.21.0

* Deployments created with AWS Data Plane Agent 0.20.0 are no longer prevented from
  adopting future updates to EC2 Agent Host scripts.

## June 18, 2025

### AWS Data Plane Agent 0.19.0

* Supports tagging all AWS resources created and managed by Openflow.
  Enables deployments governed by security controls like AWS SCP and cost controls like AWS MAP.

### AWS Data Plane Agent 0.20.0

* New Openflow BYOC deployments and upgrades of existing deployments are no longer blocked by an “Unsupported block type” error.

### Connectors 2025.6.17.15

* JIRA: Multi-projects support flattened views in Snowflake destination.

### Runtime Server 2025.6.17.16

* Process Group metrics are now visible when using the Stateless engine.
* The toolbar renders properly when font size is scaled in the browser settings.
* The UnpackContent shows the TAR option again.

## June 12, 2025

### Connectors 2025.6.12.19

* Google Sheets: Improves failure handling by retrying when ingesting data into Snowflake.
* Workday: Uses TRUNCATE instead of REPLACE when possible on the destination table.
* Sharepoint / Google Drive: Improves failure handling with proper retry / logging in case of failures.
* SQL Server: Prevents stream staleness.
* Box: Properly reflects permissions when groups are removed from files permissions in Box.
* Google Drive (Simple Ingest) - Fixes handling of files being deleted.
* Workday: Fixes clustering configuration to have the first processor run on the primary node only.

### Control Plane UI 0.41.0

* Skeleton loaders are now shown in the deployment and runtime listings when permissions are evaluated.
* Skeleton loaders are now shown in **Create Runtime** and **Add Connector to Runtime** dialogs
  while options are loaded and permissions are evaluated.

### Runtime Extensions 2025.6.12.21

* Adds the possibility to specify multiple projects to fetch JIRA issues when using ‘Simple Search’.
* Improves handling of all fields in the JIRA connector.
  Improves mapping into destination table by using an individual column per field.
* Adds support for the PuPr of Snowflake Structured Maps/Arrays/Objects.
* Google Sheets connector now supports Boolean and numbers to be used in the same column.
* MySQL: Properly handles a changes in the column filtering parameter during replication.
* MySQL: Fixes potential connection leakage when being disconnected from the binlog.
* SQL Server: Fixes column ordering handling in the Journal Log table.

### Runtime Server 2025.6.12.21

* Error reporting now shows in banners instead of toast notifications.
* Adds support for different ranges in the Status History dialog by selecting different start timestamps.
* Introduces a Process Group column to the Parameter Context table to more efficiently see bound Process Groups.

## June 8, 2025

### Runtime Extensions 2025.6.6.16

* Upgrades Snowflake JDBC Driver to 3.24.2
* Resolves an issue that prevented newer runtimes from installing the latest Microsoft Dataverse Connector.
* Removes Microsoft SQL Server replication of logical databases.

### Runtime Gateway 2025.6.8.2

* Adds support for logging in to Openflow runtimes using role names with dashes.

### Runtime Server 2025.6.6.19

* Adds pre-configured version control support for custom flows.
* Gracefully shuts down processors and controller services for stateless process groups.

## May 31, 2025

### Runtime Extensions 2025.5.31.15

* Add kafka.max.offset attribute to Records produced by ConsumeKafka

---
title: SnowflakeConnectionService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/snowflakeconnectionservice.md
section: Loading & Unloading Data
---

# SnowflakeConnectionService

## Description

Provides pooled database connections to Snowflake services

## Tags

connection, database, jdbc, openflow, snowflake

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Account \* | Account |  |  | Snowflake Account Identifier with Organization Name and Account Name formatted as [organization-name]-[account-name] |
| Authentication Strategy \* | Authentication Strategy | PASSWORD | * Password * Key Pair * Snowflake Session Token | Strategy for authenticating Snowflake connections |
| Connection Strategy \* | Connection Strategy | STANDARD | * Standard * Private Connectivity | Strategy for connecting to Snowflake services |
| Connection Timeout \* | Connection Timeout | 30 seconds |  | Maximum amount of time to wait for a connection from a reusable pool |
| Database Name | Database Name |  |  | Default Snowflake Database for connections |
| Idle Timeout \* | Idle Timeout | 10 minutes |  | Maximum amount of time for a connection to remain idle in a reusable pool |
| Maximum Connections \* | Maximum Connections | 10 |  | Maximum number of connections created and managed in a reusable pool |
| Maximum Lifetime \* | Maximum Lifetime | 30 minutes |  | Maximum lifetime for each connection in a reusable pool |
| Password \* | Password |  |  | Snowflake Password for authenticating connections |
| Private Key Service \* | Private Key Service |  |  | RSA Private Key Service for authenticating connections |
| Role | Role |  |  | Default Snowflake Role for connections |
| Schema | Schema |  |  | Default Snowflake Schema for connections |
| User \* | User |  |  | Snowflake User for authenticating connections |
| Warehouse | Warehouse |  |  | Default Snowflake Warehouse for connections |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SnowflakeDatabaseDialectService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/snowflakedatabasedialectservice.md
section: Loading & Unloading Data
---

# SnowflakeDatabaseDialectService

## Description

Database Dialect Service supporting Snowflake. Supported Statement Types: ALTER, CREATE, SELECT, UPSERT (MERGE INTO)

## Tags

Database, JDBC, Relational, SQL

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SnowflakeDetectDuplicate 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/snowflakedetectduplicate.md
section: Loading & Unloading Data
---

# SnowflakeDetectDuplicate 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Checks if a FlowFile ‘s hash (provided as a FlowFile attribute) is already in a Snowflake table, and routes the FlowFile to’ duplicate ‘if found,’distinct ‘if not found, or’ failure’ on errors.

## Tags

database, detect, duplicates, hash, snowflake

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Content Hash | The name of the FlowFile attribute that holds the pre-computed hash. Supports Expression Language. |
| Document Source Identifier | Specifies the document source identifier (doc ID). Supports Expression Language. |
| Document Source Name | Specifies the document source system name. Supports Expression Language. |
| Snowflake Connection Service | The DBCPService that provides connection to Snowflake. |
| Snowflake Table Name | The Snowflake table name that stores the file hashes. The table name is case-insensitive. Database and schema must be configured prior in the Snowflake Connection Service. |

## Relationships

| Name | Description |
| --- | --- |
| distinct | FlowFiles that do not match an existing document are routed here (new hash inserted). |
| duplicate | FlowFiles that match an existing document (same hash) are routed here. |
| failure | FlowFiles that encounter an error or exception during processing are routed here. |

## Writes attributes

| Name | Description |
| --- | --- |
| snowflake.detect.duplicate | A ‘true’ or ‘false’ attribute indicating if the FlowFile was detected as a duplicate. |

---
title: SnowflakeSignJWTService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/snowflakesignjwtservice.md
section: Loading & Unloading Data
---

# SnowflakeSignJWTService

## Description

Provides OAuth2 access token using a JWT signed with a secret stored in Snowflake. The JWT is signed using the SYSTEM$SIGN_JWT_USING_SECRET function, which requires a valid Snowflake connection.

## Tags

jwt, preview, snowflake

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Audience \* | Audience |  |  | The audience claim (aud) for the JWT. |
| Connection Pooling Service \* | Connection Pooling Service |  |  | The Connection Pooling Service that is used to obtain a connection to the database |
| JWT Expiration Time \* | JWT Expiration Time | 5 minutes |  | Expiration time used to set the corresponding claim of the JWT. |
| Snowflake Secret Name \* | Snowflake Secret Name |  |  | Name of the JWT Key Pair secret in Snowflake that will be used to sign the JWT. |
| Subject \* | Subject |  |  | The subject claim (sub) for the JWT. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SnowflakeTableSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/snowflaketableschemaregistry.md
section: Loading & Unloading Data
---

# SnowflakeTableSchemaRegistry

## Description

Uses Snowflake tables as the source of schema — utilises Snowpipe Streaming REST API. Requires a fully qualified table name as the schema name.

## Tags

openflow, registry, schema, snowflake

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Account \* | Account |  |  | Snowflake Account Identifier with Organization Name and Account Name formatted as [organization-name]-[account-name] |
| Private Key Service \* | Private Key Service |  |  | RSA Private Key Service for authenticating connections |
| User \* | User |  |  | Snowflake User for authenticating connections |
| Web Client Service Provider \* | Web Client Service Provider |  |  | Web Client Service Provider to make connections |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SplitAvro 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splitavro.md
section: Loading & Unloading Data
---

# SplitAvro 2025.10.9.21

## Bundle

org.apache.nifi | nifi-avro-nar

## Description

Splits a binary encoded Avro datafile into smaller files based on the configured Output Size. The Output Strategy determines if the smaller files will be Avro datafiles, or bare Avro records with metadata in the FlowFile attributes. The output will always be binary encoded.

## Tags

avro, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Output Size | The number of Avro records to include per split file. In cases where the incoming file has less records than the Output Size, or when the total number of records does not divide evenly by the Output Size, it is possible to get a split file with less records. |
| Output Strategy | Determines the format of the output. Either Avro Datafile, or bare record. Bare record output is only intended for use with systems that already require it, and shouldn’t be needed for normal use. |
| Split Strategy | The strategy for splitting the incoming datafile. The Record strategy will read the incoming datafile by de-serializing each record. |
| Transfer Metadata | Whether or not to transfer metadata from the parent datafile to the children. If the Output Strategy is Bare Record, then the metadata will be stored as FlowFile attributes, otherwise it will be in the Datafile header. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the FlowFile is not valid Avro), it will be routed to this relationship |
| original | The original FlowFile that was split. If the FlowFile fails processing, nothing will be sent to this relationship |
| split | All new files split from the original FlowFile will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| fragment.identifier | All split FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of split FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |

---
title: SplitContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splitcontent.md
section: Loading & Unloading Data
---

# SplitContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Splits incoming FlowFiles by a specified byte sequence

## Tags

binary, content, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Byte Sequence | A representation of bytes to look for and upon which to split the source file into separate files |
| Byte Sequence Format | Specifies how the <Byte Sequence> property should be interpreted |
| Byte Sequence Location | If <Keep Byte Sequence> is set to true, specifies whether the byte sequence should be added to the end of the first split or the beginning of the second; if <Keep Byte Sequence> is false, this property is ignored. |
| Keep Byte Sequence | Determines whether or not the Byte Sequence should be included with each Split |

## Relationships

| Name | Description |
| --- | --- |
| original | The original file |
| splits | All Splits will be routed to the splits relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| fragment.identifier | All split FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of split FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |

## See also

* [org.apache.nifi.processors.standard.MergeContent](mergecontent.md)

---
title: SplitExcel 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splitexcel.md
section: Loading & Unloading Data
---

# SplitExcel 2025.10.9.21

## Bundle

org.apache.nifi | nifi-poi-nar

## Description

This processor splits a multi sheet Microsoft Excel spreadsheet into multiple Microsoft Excel spreadsheets where each sheet from the original file is converted to an individual spreadsheet in its own flow file. Currently this processor is only capable of processing .xlsx (XSSF 2007 OOXML file format) Excel documents and not older .xls (HSSF ‘97(-2007) file format) documents. Please note all original cell styles are dropped and formulas are removed leaving only the calculated values. Even a single sheet Microsoft Excel spreadsheet is converted to its own flow file with all the original cell styles dropped and formulas removed.

## Tags

split, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Password | The password for a password protected Excel spreadsheet |
| Protection Type | Specifies whether an Excel spreadsheet is protected by a password or not. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship. |
| original | The original FlowFile that was split into segments. If the FlowFile fails processing, nothing will be sent to this relationship |
| split | The individual Excel ‘segments’ of the original Excel FlowFile will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| fragment.identifier | All split Excel FlowFiles produced from the same parent Excel FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split Excel FlowFiles that were created from a single parent Excel FlowFile |
| fragment.count | The number of split Excel FlowFiles generated from the parent Excel FlowFile |
| segment.original.filename | The filename of the parent Excel FlowFile |
| sheetname | The name of the Excel sheet from the original spreadsheet. |
| total.rows | The number of rows in the Excel sheet from the original spreadsheet. |

---
title: SplitJson 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splitjson.md
section: Loading & Unloading Data
---

# SplitJson 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Splits a JSON File into multiple, separate FlowFiles for an array element specified by a JsonPath expression. Each generated FlowFile is comprised of an element of the specified array and transferred to relationship ‘split,’ with the original file transferred to the ‘original’ relationship. If the specified JsonPath is not found or does not evaluate to an array element, the original file is routed to ‘failure’ and no files are generated.

## Tags

json, jsonpath, split

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| JsonPath Expression | A JsonPath expression that indicates the array element to split into JSON/scalar fragments. |
| Max String Length | The maximum allowed length of a string value when parsing the JSON document |
| Null Value Representation | Indicates the desired representation of JSON Path expressions resulting in a null value. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the FlowFile is not valid JSON or the specified path does not exist), it will be routed to this relationship |
| original | The original FlowFile that was split into segments. If the FlowFile fails processing, nothing will be sent to this relationship |
| split | All segments of the original FlowFile will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| fragment.identifier | All split FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of split FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |

---
title: SplitRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splitrecord.md
section: Loading & Unloading Data
---

# SplitRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Splits up an input FlowFile that is in a record-oriented data format into multiple smaller FlowFiles

## Tags

avro, csv, freeform, generic, json, log, logs, schema, split, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Record Reader | Specifies the Controller Service to use for reading incoming data |
| Record Writer | Specifies the Controller Service to use for writing out the records |
| Records Per Split | Specifies how many records should be written to each ‘split’ or ‘segment’ FlowFile |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship. |
| original | Upon successfully splitting an input FlowFile, the original FlowFile will be sent to this relationship. |
| splits | The individual ‘segments’ of the original FlowFile will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer for the FlowFiles routed to the ‘splits’ Relationship. |
| record.count | The number of records in the FlowFile. This is added to FlowFiles that are routed to the ‘splits’ Relationship. |
| fragment.identifier | All split FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of split FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |

---
title: SplitText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splittext.md
section: Loading & Unloading Data
---

# SplitText 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Splits a text file into multiple smaller text files on line boundaries limited by maximum number of lines or total size of fragment. Each output split file will contain no more than the configured number of lines or bytes. If both Line Split Count and Maximum Fragment Size are specified, the split occurs at whichever limit is reached first. If the first line of a fragment exceeds the Maximum Fragment Size, that line will be output in a single split file which exceeds the configured maximum size limit. This component also allows one to specify that each split should include a header lines. Header lines can be computed by either specifying the amount of lines that should constitute a header or by using header marker to match against the read lines. If such match happens then the corresponding line will be treated as header. Keep in mind that upon the first failure of header marker match, no more matches will be performed and the rest of the data will be parsed as regular lines for a given split. If after computation of the header there are no more data, the resulting split will consists of only header lines.

## Tags

split, text

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Header Line Count | The number of lines that should be considered part of the header; the header lines will be duplicated to all split files |
| Header Line Marker Characters | The first character(s) on the line of the datafile which signifies a header line. This value is ignored when Header Line Count is non-zero. The first line not containing the Header Line Marker Characters and all subsequent lines are considered non-header |
| Line Split Count | The number of lines that will be added to each split file, excluding header lines. A value of zero requires Maximum Fragment Size to be set, and line count will not be considered in determining splits. |
| Maximum Fragment Size | The maximum size of each split file, including header lines. NOTE: in the case where a single line exceeds this property (including headers, if applicable), that line will be output in a split of its own which exceeds this Maximum Fragment Size setting. |
| Remove Trailing Newlines | Whether to remove newlines at the end of each split file. This should be false if you intend to merge the split files later. If this is set to ‘true’ and a FlowFile is generated that contains only ‘empty lines’ (i.e., consists only of r and n characters), the FlowFile will not be emitted. Note, however, that if header lines are specified, the resultant FlowFile will never be empty as it will consist of the header lines, so a FlowFile may be emitted that contains only the header lines. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a file cannot be split for some reason, the original file will be routed to this destination and nothing will be routed elsewhere |
| original | The original input file will be routed to this destination when it has been successfully split into 1 or more files |
| splits | The split files will be routed to this destination when an input file is successfully split into 1 or more split files |

## Writes attributes

| Name | Description |
| --- | --- |
| text.line.count | The number of lines of text from the original FlowFile that were copied to this FlowFile |
| fragment.size | The number of bytes from the original FlowFile that were copied to this FlowFile, including header, if applicable, which is duplicated in each split FlowFile |
| fragment.identifier | All split FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of split FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |

## See also

* [org.apache.nifi.processors.standard.MergeContent](mergecontent.md)

---
title: SplitXml 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/splitxml.md
section: Loading & Unloading Data
---

# SplitXml 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Splits an XML File into multiple separate FlowFiles, each comprising a child or descendant of the original root element

## Tags

split, xml

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Split Depth | Indicates the XML-nesting depth to start splitting XML fragments. A depth of 1 means split the root ‘s children, whereas a depth of 2 means split the root’s children’s children and so forth. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the FlowFile is not valid XML), it will be routed to this relationship |
| original | The original FlowFile that was split into segments. If the FlowFile fails processing, nothing will be sent to this relationship |
| split | All segments of the original FlowFile will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| fragment.identifier | All split FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the split FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of split FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile |

---
title: StandardAnthropicLLMService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardanthropicllmservice.md
section: Loading & Unloading Data
---

# StandardAnthropicLLMService

## Description

A Controller Service that provides integration with Anthropic’s Claude AI models through their Messages API. Supports configurable parameters including model selection, response generation settings (temperature, top_p, top_k), token limits, and retry behavior.

## Tags

ai, anthropic, api, claude, language model, llm, openflow

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Anthropic API Key \* | Anthropic API Key |  |  | The API Key for authenticating to Anthropic |
| Backoff Base Delay (ms) \* | Backoff Base Delay (ms) | 1000 |  | The base delay in milliseconds for exponential backoff between retries |
| Max Response Tokens \* | Max Response Tokens | 1000 |  | The maximum number of tokens to generate in the response. |
| Max Retries \* | Max Retries | 3 |  | The maximum number of retry attempts for API calls |
| Model Name \* | Model Name | claude-3-5-sonnet-latest |  | The name of the Anthropic model |
| Temperature | Temperature |  |  | The temperature to use for generating the response. |
| Top K | Top K |  |  | The top K value to use for generating the response. Only sample from the top K options for each subsequent token. Recommended for advanced use cases only. You usually only need to use temperature. |
| Top P | Top P |  |  | The top_p value for nucleus sampling. It controls the diversity of the generated responses. |
| User ID | User ID |  |  | The user id to set in the request metadata |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for communicating with the LLM provider. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardAtlassianRequestRateManager
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardatlassianrequestratemanager.md
section: Loading & Unloading Data
---

# StandardAtlassianRequestRateManager

## Description

Provides rate limiting coordination for Atlassian API calls across processors to prevent cascading rate limit issues. Throttles when limit is reached (HTTP 429).

## Tags

api, atlassian, confluence, jira, limit, openflow, rate

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardAzureCredentialsControllerService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardazurecredentialscontrollerservice.md
section: Loading & Unloading Data
---

# StandardAzureCredentialsControllerService

## Description

Provide credentials to use with an Azure client.

## Tags

azure, credentials, provider, security, session

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Credential Configuration Strategy \* | Credential Configuration Strategy | default-credential | * Default Credential * Managed Identity |  |
| Managed Identity Client ID | Managed Identity Client ID |  |  | Client ID of the managed identity. The property is required when User Assigned Managed Identity is used for authentication. It must be empty in case of System Assigned Managed Identity. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardConfluenceClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardconfluenceclientservice.md
section: Loading & Unloading Data
---

# StandardConfluenceClientService

## Description

Provides connection service to Confluence APIs

## Tags

Preview, atlassian, confluence

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| API Token \* | API Token |  |  | Token used for API authentication |
| Environment URL \* | Environment URL |  |  | URL to the Atlassian Confluence Environment ie. <https://domain.atlassian.net> |
| Request Rate Manager \* | Request Rate Manager |  |  | Controller service for keeping track of rate limits for Atlassian APIs |
| User Email \* | User Email |  |  | Confluence user email |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for communicating with Confluence |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardDatabricksWorkspaceClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standarddatabricksworkspaceclientservice.md
section: Loading & Unloading Data
---

# StandardDatabricksWorkspaceClientService

## Description

Databricks client.

## Tags

databricks, openflow

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Authentication Method \* | Authentication Method | OAUTH_M2M | * OAuth M2M * PAT | Method to authenticate with Databricks |
| OAuth Client ID \* | OAuth Client ID |  |  | Databricks OAuth Client ID, also known as an application ID |
| OAuth Client Secret \* | OAuth Client Secret |  |  | Databricks Service Principal’s OAuth Client Secret. |
| Personal Access Token \* | Personal Access Token |  |  | Databricks Personal Access Token |
| Workspace ID \* | Workspace ID |  |  | Databricks Workspace ID |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardDropboxCredentialService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standarddropboxcredentialservice.md
section: Loading & Unloading Data
---

# StandardDropboxCredentialService

## Description

Defines credentials for Dropbox processors.

## Tags

credentials, dropbox, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Access Token \* | Access Token |  |  | Access Token of the user’s Dropbox app. See Additional Details for more information about Access Token generation. |
| App Key \* | App Key |  |  | App Key of the user’s Dropbox app. See Additional Details for more information. |
| App Secret \* | App Secret |  |  | App Secret of the user’s Dropbox app. See Additional Details for more information. |
| Refresh Token \* | Refresh Token |  |  | Refresh Token of the user’s Dropbox app. See Additional Details for more information about Refresh Token generation. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardFileResourceService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardfileresourceservice.md
section: Loading & Unloading Data
---

# StandardFileResourceService

## Description

Provides a file resource for other components. The file needs to be available locally by Nifi (e.g. local disk or mounted storage). NiFi needs to have read permission to the file.

## Tags

file, resource

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| File Path \* | file-path | ${absolute.path}/${filename} |  | Path to a file that can be accessed locally. |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardHashiCorpVaultClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardhashicorpvaultclientservice.md
section: Loading & Unloading Data
---

# StandardHashiCorpVaultClientService

## Description

A controller service for interacting with HashiCorp Vault.

## Tags

client, hashicorp, vault

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Configuration Strategy \* | configuration-strategy | direct-properties | * Direct Properties * Properties Files | Specifies the source of the configuration properties. |
| Vault Authentication \* | vault.authentication | TOKEN | * TOKEN * APPID * APPROLE * AWS_EC2 * AZURE * CERT * CUBBYHOLE * KUBERNETES | Vault authentication method, as described in the Spring Vault Environment Configuration documentation (<https://docs.spring.io/spring-vault/docs/2.3.x/reference/html/#vault.core.environment-vault-configuration>). |
| Connection Timeout \* | vault.connection.timeout | 5 sec |  | The connection timeout for the HashiCorp Vault client |
| Vault Properties Files \* | vault.properties.files |  |  | A comma-separated list of files containing HashiCorp Vault configuration properties, as described in the Spring Vault Environment Configuration documentation (<https://docs.spring.io/spring-vault/docs/2.3.x/reference/html/#vault.core.environment-vault-configuration>). All of the Spring property keys and authentication-specific property keys are supported. |
| Read Timeout \* | vault.read.timeout | 15 sec |  | The read timeout for the HashiCorp Vault client |
| SSL Context Service | vault.ssl.context.service |  |  | The SSL Context Service used to provide client certificate information for TLS/SSL connections to the HashiCorp Vault server. |
| Vault URI \* | vault.uri |  |  | The URI of the HashiCorp Vault server (e.g., <http://localhost:8200>). Required if not specified in the Bootstrap HashiCorp Vault Configuration File. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardHttpContextMap
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardhttpcontextmap.md
section: Loading & Unloading Data
---

# StandardHttpContextMap

## Description

Provides the ability to store and retrieve HTTP requests and responses external to a Processor, so that multiple Processors can interact with the same HTTP request.

## Tags

http, request, response

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Maximum Outstanding Requests \* | Maximum Outstanding Requests | 5000 |  | The maximum number of HTTP requests that can be outstanding at any one time. Any attempt to register an additional HTTP Request will cause an error |
| Request Expiration \* | Request Expiration | 1 min |  | Specifies how long an HTTP Request should be left unanswered before being evicted from the cache and being responded to with a Service Unavailable status code |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardHubSpotClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardhubspotclientservice.md
section: Loading & Unloading Data
---

# StandardHubSpotClientService

## Description

HubSpot Controller Service to integrate with HubSpot HTTP api.

## Tags

Preview, hubSpot

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| HubSpot Access Token \* | HubSpot Access Token |  |  | HubSpot Access Token |
| Web Client Service Provider \* | Web Client Service Provider |  |  | The Web Client Service to use for communicating with HubSpot |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardJsonSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardjsonschemaregistry.md
section: Loading & Unloading Data
---

# StandardJsonSchemaRegistry

## Description

Provides a service for registering and accessing JSON schemas. One can register a schema as a dynamic property where ‘name’ represents the schema name and ‘value’ represents the textual representation of the actual schema following the syntax and semantics of the JSON Schema format. Empty schemas and schemas only consisting of whitespace are not acceptable schemas. The registry is heterogeneous registry as it can store schemas of different schema draft versions. By default the registry is configured to store schemas of Draft 2020-12. When a schema is added, the version which is currently is set, is what the schema is saved as.

## Tags

json, registry, schema

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| JSON Schema Version \* | JSON Schema Version | DRAFT_2020_12 | * Draft 4 * Draft 6 * Draft 7 * Draft 2019-09 * Draft 2020-12 | The JSON schema specification |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardKustoIngestService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardkustoingestservice.md
section: Loading & Unloading Data
---

# StandardKustoIngestService

## Description

Sends batches of flowfile content or stream flowfile content to an Azure ADX cluster.

## Tags

ADX, Azure, Data, Explorer, Kusto, azure, ingest

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Application Client ID \* | Application Client ID |  |  | Azure Data Explorer Application Client Identifier for Authentication |
| Application Key \* | Application Key |  |  | Azure Data Explorer Application Key for Authentication |
| Application Tenant ID \* | Application Tenant ID |  |  | Azure Data Explorer Application Tenant Identifier for Authentication |
| Authentication Strategy \* | Authentication Strategy | MANAGED_IDENTITY | * Application Credentials * Managed Identity * Azure CLI (Dev Only) | Authentication method for access to Azure Data Explorer |
| Cluster URI \* | Cluster URI |  |  | Azure Data Explorer Cluster URI |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardKustoQueryService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardkustoqueryservice.md
section: Loading & Unloading Data
---

# StandardKustoQueryService

## Description

Standard implementation of Kusto Query Service for Azure Data Explorer

## Tags

ADX, Azure, Data, Explorer, Kusto

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Application Client ID \* | Application Client ID |  |  | Azure Data Explorer Application Client Identifier for Authentication |
| Application Key \* | Application Key |  |  | Azure Data Explorer Application Key for Authentication |
| Application Tenant ID \* | Application Tenant ID |  |  | Azure Data Explorer Application Tenant Identifier for Authentication |
| Authentication Strategy \* | Authentication Strategy | MANAGED_IDENTITY | * Application Credentials * Managed Identity * Azure CLI (Dev Only) | Authentication method for access to Azure Data Explorer |
| Cluster URI \* | Cluster URI |  |  | Azure Data Explorer Cluster URI |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardMilvusConnectionService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardmilvusconnectionservice.md
section: Loading & Unloading Data
---

# StandardMilvusConnectionService

## Description

Provides connection service to a Milvus instance

## Tags

connection, database, milvus, openflow, vector

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| API Key \* | API Key |  |  | Milvus API Key for authenticating connections |
| Authentication Strategy \* | Authentication Strategy | PASSWORD | * Password * API Key | Strategy for authenticating Milvus connections |
| Connection Timeout \* | Connection Timeout | 30 seconds |  | Maximum amount of time to wait for a connection from a reusable pool |
| Idle Timeout \* | Idle Timeout | 10 minutes |  | Maximum amount of time for a connection to remain idle in a reusable pool |
| Password \* | Password |  |  | Milvus password for authenticating connections |
| SSL Context Service | SSL Context Service |  |  | The SSL Context Service used to provide client certificate information for TLS/SSL connections. |
| Service URI \* | Service URI |  |  | The URI to use to communicate with Milvus |
| User \* | User |  |  | Milvus username for authenticating connections |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardOauth2AccessTokenProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardoauth2accesstokenprovider.md
section: Loading & Unloading Data
---

# StandardOauth2AccessTokenProvider

## Description

Provides OAuth 2.0 access tokens that can be used as Bearer authorization header in HTTP requests. Can use either Resource Owner Password Credentials Grant or Client Credentials Grant. Client authentication can be done with either HTTP Basic authentication or in the request body.

## Tags

access token, authorization, http, oauth2, provider

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Audience | Audience |  |  | Audience for the access token request defined in RFC 8693 Section 2.1 |
| Authorization Server URL \* | Authorization Server URL |  |  | The URL of the authorization server that issues access tokens. |
| Client Authentication Strategy \* | Client Authentication Strategy | REQUEST_BODY | * REQUEST_BODY * BASIC_AUTHENTICATION | Strategy for authenticating the client against the OAuth2 token provider service. |
| Client ID | Client ID |  |  |  |
| Client secret \* | Client secret |  |  |  |
| Grant Type \* | Grant Type | password | * User Password * Client Credentials * Refresh Token | The OAuth2 Grant Type to be used when acquiring an access token. |
| HTTP Protocols \* | HTTP Protocols | H2_HTTP_1_1 | * http/1.1 * h2 http/1.1 * h2 | HTTP Protocols supported for Application Layer Protocol Negotiation with TLS |
| Password \* | Password |  |  | Password for the username on the service that is being accessed. |
| Refresh Token \* | Refresh Token |  |  | Refresh Token supports retrieving a new Access Token when configured |
| Refresh Window \* | Refresh Window | 0 s |  | The service will attempt to refresh tokens expiring within the refresh window, subtracting the configured duration from the token expiration. |
| Resource | Resource |  |  | Resource URI for the access token request defined in RFC 8707 Section 2 |
| SSL Context Service | SSL Context Service |  |  |  |
| Scope | Scope |  |  | Space-delimited, case-sensitive list of scopes of the access request (as per the OAuth 2.0 specification) |
| Username \* | Username |  |  | Username on the service that is being accessed. |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardOCRService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardocrservice.md
section: Loading & Unloading Data
---

# StandardOCRService

## Description

Provides integration to Openflow OCR Service

## Tags

extract, image, ocr, openflow, tesseract, text

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Communications Timeout \* | Communications Timeout | 60 secs |  | The amount of time to wait for a response from the OCR Service. |
| Custom Service URL \* | Custom Service URL |  |  | The Custom URL of the Openflow Tesseract OCR Service. |
| OCR Languages \* | OCR Languages | ENGLISH |  | The Languages to use when performing OCR if none are provided by the caller.This is a commma separated list of the following Valid Values:ENGLISH, KOREAN, KOREAN_VERT, HEBREW |
| Service Location Strategy \* | Service Location Strategy | Default | * Default * Custom | Determines how Service Locations configured within this Controller for the Openflow Tesseract OCR Service. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardOpenAILLMService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardopenaillmservice.md
section: Loading & Unloading Data
---

# StandardOpenAILLMService

## Description

A Controller Service that provides integration with OpenAI’s Chat Completion API. Supports configurable parameters including model selection, temperature, top_p, max tokens, and retry behavior. Handles API authentication, request retries with exponential backoff, and error handling.

## Tags

ai, chat completion, chatgpt, large language model, llm, openai, openflow

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Backoff Base Delay (ms) \* | Backoff Base Delay (ms) | 1000 |  | The base delay in milliseconds for exponential backoff between retries |
| Max Response Tokens | Max Response Tokens |  |  | The maximum number of tokens to generate in the response. |
| Max Retries \* | Max Retries | 3 |  | The maximum number of retry attempts for API calls |
| Model Name \* | Model Name | gpt-4o-mini |  | The name of the OpenAI model. |
| OpenAI API Key \* | OpenAI API Key |  |  | The API Key for authenticating to OpenAI. |
| Seed | Seed |  |  | The seed to use for generating the response |
| Temperature | Temperature |  |  | The temperature to use for generating the response. |
| Top P | Top P |  |  | The top_p value for nucleus sampling. It controls the diversity of the generated responses. |
| User | User |  |  | Your end user, sent to OpenAI for monitoring and detection of abuse |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for communicating with the LLM provider. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardPGPPrivateKeyService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardpgpprivatekeyservice.md
section: Loading & Unloading Data
---

# StandardPGPPrivateKeyService

## Description

PGP Private Key Service provides Private Keys loaded from files or properties

## Tags

Encryption, GPG, Key, OpenPGP, PGP, Private, RFC 4880

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Key Password \* | key-password |  |  | Password used for decrypting Private Keys |
| Keyring | keyring |  |  | PGP Keyring or Secret Key encoded in ASCII Armor |
| Keyring File | keyring-file |  |  | File path to PGP Keyring or Secret Key encoded in binary or ASCII Armor |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardPGPPublicKeyService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardpgppublickeyservice.md
section: Loading & Unloading Data
---

# StandardPGPPublicKeyService

## Description

PGP Public Key Service providing Public Keys loaded from files

## Tags

Encryption, GPG, Key, OpenPGP, PGP, Private, RFC 4880

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Keyring | keyring |  |  | PGP Keyring or Public Key encoded in ASCII Armor |
| Keyring File | keyring-file |  |  | File path to PGP Keyring or Public Key encoded in binary or ASCII Armor |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardPrivateKeyService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardprivatekeyservice.md
section: Loading & Unloading Data
---

# StandardPrivateKeyService

## Description

Private Key Service provides access to a Private Key loaded from configured sources

## Tags

PEM, PKCS8

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Key | key |  |  | Private Key structured using PKCS8 and encoded as PEM |
| Key File | key-file |  |  | File path to Private Key structured using PKCS8 and encoded as PEM |
| Key Password | key-password |  |  | Password used for decrypting Private Keys |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardProtobufReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardprotobufreader.md
section: Loading & Unloading Data
---

# StandardProtobufReader

## Description

Parses Protocol Buffers messages from binary format into NiFi Records. Supports multiple schema access strategies including inline schema text, schema registry lookup, and schema reference readers. Protobuf reader needs to know the Proto schema message name in order to deserialize the binary payload correctly. The name of this message can be determined statically using ‘Message Name’ property, or dynamically, using a Message Name Resolver service.

## Tags

parser, protobuf, reader, record

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Message Name \* | Message Name |  |  | Fully qualified name of the Protocol Buffers message including its package (eg. mypackage.MyMessage). |
| Message Name Resolution Strategy \* | Message Name Resolution Strategy | MESSAGE_NAME_PROPERTY | * Message Name Property * Message Name Resolver | Strategy for determining the Protocol Buffers message name for processing |
| Message Name Resolver \* | Message Name Resolver |  |  | Service that dynamically resolves Protocol Buffer message names from FlowFile content or attributes |
| Schema Access Strategy \* | Schema Access Strategy | schema-name | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text \* | Schema Text | ${proto.schema} |  | The text of a Proto 3 formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardProxyConfigurationService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardproxyconfigurationservice.md
section: Loading & Unloading Data
---

# StandardProxyConfigurationService

## Description

Provides a set of configurations for different NiFi components to use a proxy server.

## Tags

Proxy

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Proxy Server Host | proxy-server-host |  |  | Proxy server hostname or ip-address. |
| Proxy Server Port | proxy-server-port |  |  | Proxy server port number. |
| Proxy Type \* | proxy-type | DIRECT | * DIRECT * HTTP * SOCKS | Proxy type. |
| Proxy User Name | proxy-user-name |  |  | The name of the proxy client for user authentication. |
| Proxy User Password | proxy-user-password |  |  | The password of the proxy client for user authentication. |
| SOCKS Version \* | socks-version | SOCKS5 | * SOCKS4 * SOCKS5 | SOCKS Protocol Version |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardRestrictedSSLContextService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardrestrictedsslcontextservice.md
section: Loading & Unloading Data
---

# StandardRestrictedSSLContextService

## Description

Restricted implementation of the SSLContextService. Provides the ability to configure keystore and/or truststore properties once and reuse that configuration throughout the application, but only allows a restricted set of TLS/SSL protocols to be chosen (no SSL protocols are supported). The set of protocols selectable will evolve over time as new protocols emerge and older protocols are deprecated. This service is recommended over StandardSSLContextService if a component doesn’t expect to communicate with legacy systems since it is unlikely that legacy systems will support these protocols.

## Tags

certificate, jks, keystore, p12, pkcs, pkcs12, secure, ssl, tls, truststore

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Keystore Filename | Keystore Filename |  |  | The fully-qualified filename of the Keystore |
| Keystore Password | Keystore Password |  |  | The password for the Keystore |
| Keystore Type | Keystore Type |  | * BCFKS * PKCS12 * JKS | The Type of the Keystore |
| TLS Protocol | SSL Protocol | TLS | * TLS * TLSv1.3 * TLSv1.2 | TLS Protocol Version for encrypted connections. Supported versions depend on the specific version of Java used. |
| Truststore Filename | Truststore Filename |  |  | The fully-qualified filename of the Truststore |
| Truststore Password | Truststore Password |  |  | The password for the Truststore |
| Truststore Type | Truststore Type |  | * BCFKS * PKCS12 * JKS | The Type of the Truststore |
| Key Password | key-password |  |  | The password for the key. If this is not specified, but the Keystore Filename, Password, and Type are specified, then the Keystore Password will be assumed to be the same as the Key Password. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardS3EncryptionService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standards3encryptionservice.md
section: Loading & Unloading Data
---

# StandardS3EncryptionService

## Description

Adds configurable encryption to S3 Put and S3 Fetch operations.

## Tags

aws, decrypt, decryption, encrypt, encryption, key, s3, service

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Encryption Strategy \* | Encryption Strategy | NONE | * None * Server-side S3 * Server-side KMS * Server-side Customer Key * Client-side KMS * Client-side Customer Key | Strategy to use for S3 data encryption and decryption. |
| KMS Region | KMS Region | us-west-2 | * AWS GovCloud (US) * AWS GovCloud (US-East) * US East (N. Virginia) * US East (Ohio) * US West (N. California) * US West (Oregon) * EU (Ireland) * EU (London) * EU (Paris) * EU (Frankfurt) * EU (Zurich) * EU (Stockholm) * EU (Milan) * EU (Spain) * Asia Pacific (Hong Kong) * Asia Pacific (Taipei) * Asia Pacific (Mumbai) * Asia Pacific (Hyderabad) * Asia Pacific (Singapore) * Asia Pacific (Sydney) * Asia Pacific (Jakarta) * Asia Pacific (Melbourne) * Asia Pacific (Malaysia) * Asia Pacific (Thailand) * Asia Pacific (Tokyo) * Asia Pacific (Seoul) * Asia Pacific (Osaka) * South America (Sao Paulo) * China (Beijing) * China (Ningxia) * Canada (Central) * Canada West (Calgary) * Middle East (UAE) * Middle East (Bahrain) * Africa (Cape Town) * US ISO East * US ISOB East (Ohio) * US ISO West * US ISOF East1 (California) * US ISOF South1 (Alpine) * Israel (Tel Aviv) * Mexico (Central) * EU ISOE West | The Region of the AWS Key Management Service. Only used in case of Client-side KMS. |
| Key ID or Key Material | Key ID or Key Material |  |  | For None and Server-side S3: not used. For Server-side KMS and Client-side KMS: the KMS Key ID must be configured. For Server-side Customer Key and Client-side Customer Key: the Key Material must be specified in Base64 encoded form. In case of Server-side Customer Key, the key must be an AES-256 key. In case of Client-side Customer Key, it can be an AES-256, AES-192 or AES-128 key. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardSalesforceBulkJobsStateService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardsalesforcebulkjobsstateservice.md
section: Loading & Unloading Data
---

# StandardSalesforceBulkJobsStateService

## Description

Stores Salesforce Bulk Jobs state per object type at cluster scope

## Tags

bulk, preview, salesforce, state

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardSalesforceClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardsalesforceclientservice.md
section: Loading & Unloading Data
---

# StandardSalesforceClientService

## Description

Provides connection service to Salesforce APIs

## Tags

preview, salesforce

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| API Version \* | API Version | 63.0 |  | The version number of the Salesforce REST API appended to the URL after the services/data path. See Salesforce documentation for supported versions. |
| OAuth2 Access Token Provider \* | OAuth2 Access Token Provider |  |  | Service providing OAuth2 Access Tokens for authenticating using the HTTP Authorization Header |
| Salesforce Instance \* | Salesforce Instance |  |  | The hostname of the Salesforce instance including the domain such as MyDomainName.my.salesforce.com |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for communicating with Salesforce |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardSalesforceDataCloudClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardsalesforcedatacloudclientservice.md
section: Loading & Unloading Data
---

# StandardSalesforceDataCloudClientService

## Description

Provides connection service to Salesforce Data Cloud APIs

## Tags

daas, data cloud, preview, salesforce

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Data Cloud Instance | Data Cloud Instance |  |  | The hostname of the Salesforce instance including the domain such as MyDomainName.my.salesforce.com |
| Data Cloud Token Provider \* | Data Cloud Token Provider |  |  | Service providing OAuth2 Access Tokens for authenticating using the HTTP Authorization Header |
| Web Client Service \* | Web Client Service |  |  | The Web Client Service to use for communicating with Salesforce |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardSlackRateLimiterService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardslackratelimiterservice.md
section: Loading & Unloading Data
---

# StandardSlackRateLimiterService

## Description

Provides rate limiting coordination for Slack API calls across processors to prevent cascading rate limit issues

## Tags

api, limit, openflow, rate, slack

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Enable Rate Limiting \* | Enable Rate Limiting | true | * true * false | Enable or disable rate limiting functionality |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardSSLContextService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardsslcontextservice.md
section: Loading & Unloading Data
---

# StandardSSLContextService

## Description

Standard implementation of the SSLContextService. Provides the ability to configure keystore and/or truststore properties once and reuse that configuration throughout the application. This service can be used to communicate with both legacy and modern systems. If you only need to communicate with non-legacy systems, then the StandardRestrictedSSLContextService is recommended as it only allows a specific set of SSL protocols to be chosen.

## Tags

certificate, jks, keystore, p12, pkcs, pkcs12, secure, ssl, tls, truststore

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Keystore Filename | Keystore Filename |  |  | The fully-qualified filename of the Keystore |
| Keystore Password | Keystore Password |  |  | The password for the Keystore |
| Keystore Type | Keystore Type |  | * BCFKS * PKCS12 * JKS | The Type of the Keystore |
| TLS Protocol | SSL Protocol | TLS | * SSL * TLS * TLSv1.3 * TLSv1.2 * TLSv1.1 * TLSv1 | SSL or TLS Protocol Version for encrypted connections. Supported versions include insecure legacy options and depend on the specific version of Java used. |
| Truststore Filename | Truststore Filename |  |  | The fully-qualified filename of the Truststore |
| Truststore Password | Truststore Password |  |  | The password for the Truststore |
| Truststore Type | Truststore Type |  | * BCFKS * PKCS12 * JKS | The Type of the Truststore |
| Key Password | key-password |  |  | The password for the key. If this is not specified, but the Keystore Filename, Password, and Type are specified, then the Keystore Password will be assumed to be the same as the Key Password. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardTableStateService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardtablestateservice.md
section: Loading & Unloading Data
---

# StandardTableStateService

## Description

A controller Service that provides and manages table state. The state is cached and refreshed only when one of set table state method is invoked. This caching method requires that getting or setting state for a given table must be done on the same node. The Tables processing can be partitioned between NiFi nodes, but the get and set state operations for a single table must be associated with a single NiFi node.

## Tags

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardVectaraClientService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardvectaraclientservice.md
section: Loading & Unloading Data
---

# StandardVectaraClientService

## Description

Vectara Controller Service to integrate with Vectara HTTP Api.

## Tags

ai, llm, openflow, rag, vectara

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| API Key \* | API Key |  |  | Vectara API Key |
| Customer ID \* | Customer ID |  |  | Vectara Customer ID |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StandardWebClientServiceProvider
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/standardwebclientserviceprovider.md
section: Loading & Unloading Data
---

# StandardWebClientServiceProvider

## Description

Web Client Service Provider with support for configuring standard HTTP connection properties

## Tags

Client, HTTP, Web

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Connect Timeout \* | Connect Timeout | 10 secs |  | Maximum amount of time to wait before failing during initial socket connection |
| HTTP Protocol Version \* | HTTP Protocol Version | HTTP_2 | * HTTP_1_1 * HTTP_2 | Preferred HTTP protocol version for requests |
| Read Timeout \* | Read Timeout | 10 secs |  | Maximum amount of time to wait before failing while reading socket responses |
| Redirect Handling Strategy \* | Redirect Handling Strategy | FOLLOWED | * FOLLOWED * IGNORED | Handling strategy for responding to HTTP 301 or 302 redirects received with a Location header |
| SSL Context Service | SSL Context Service |  |  | SSL Context Service overrides system default TLS settings for HTTPS communication |
| Write Timeout \* | Write Timeout | 10 secs |  | Maximum amount of time to wait before failing while writing socket requests |
| Proxy Configuration Service | proxy-configuration-service |  |  | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: StartAwsPollyJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/startawspollyjob.md
section: Loading & Unloading Data
---

# StartAwsPollyJob 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Trigger a AWS Polly job. It should be followed by GetAwsPollyJobStatus processor in order to monitor job status.

## Tags

AWS, Amazon, ML, Machine Learning, Polly

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| JSON Payload | JSON request for AWS Machine Learning services. The Processor will use FlowFile content for the request when this property is not specified. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| awsTaskId | The task ID that can be used to poll for Job completion in GetAwsPollyJobStatus |

## See also

* [org.apache.nifi.processors.aws.ml.polly.GetAwsPollyJobStatus](getawspollyjobstatus.md)

---
title: StartAwsTextractJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/startawstextractjob.md
section: Loading & Unloading Data
---

# StartAwsTextractJob 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Trigger a AWS Textract job. It should be followed by GetAwsTextractJobStatus processor in order to monitor job status.

## Tags

AWS, Amazon, ML, Machine Learning, Textract

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| JSON Payload | JSON request for AWS Machine Learning services. The Processor will use FlowFile content for the request when this property is not specified. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Textract Type | Supported values: “Document Analysis”, “Document Text Detection”, “Expense Analysis” |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| awsTaskId | The task ID that can be used to poll for Job completion in GetAwsTextractJobStatus |
| awsTextractType | The selected Textract type, which can be used in GetAwsTextractJobStatus |

## See also

* [org.apache.nifi.processors.aws.ml.textract.GetAwsTextractJobStatus](getawstextractjobstatus.md)

---
title: StartAwsTranscribeJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/startawstranscribejob.md
section: Loading & Unloading Data
---

# StartAwsTranscribeJob 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Trigger a AWS Transcribe job. It should be followed by GetAwsTranscribeStatus processor in order to monitor job status.

## Tags

AWS, Amazon, ML, Machine Learning, Transcribe

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| JSON Payload | JSON request for AWS Machine Learning services. The Processor will use FlowFile content for the request when this property is not specified. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| awsTaskId | The task ID that can be used to poll for Job completion in GetAwsTranscribeJobStatus |

## See also

* [org.apache.nifi.processors.aws.ml.transcribe.GetAwsTranscribeJobStatus](getawstranscribejobstatus.md)

---
title: StartAwsTranslateJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/startawstranslatejob.md
section: Loading & Unloading Data
---

# StartAwsTranslateJob 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Trigger a AWS Translate job. It should be followed by GetAwsTranslateJobStatus processor in order to monitor job status.

## Tags

AWS, Amazon, ML, Machine Learning, Translate

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Communications Timeout |  |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| JSON Payload | JSON request for AWS Machine Learning services. The Processor will use FlowFile content for the request when this property is not specified. |
| Region |  |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| original | Upon successful completion, the original FlowFile will be routed to this relationship. |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| awsTaskId | The task ID that can be used to poll for Job completion in GetAwsTranslateJobStatus |

## See also

* [org.apache.nifi.processors.aws.ml.translate.GetAwsTranslateJobStatus](getawstranslatejobstatus.md)

---
title: StartGcpVisionAnnotateFilesOperation 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/startgcpvisionannotatefilesoperation.md
section: Loading & Unloading Data
---

# StartGcpVisionAnnotateFilesOperation 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Trigger a Vision operation on file input. It should be followed by GetGcpVisionAnnotateFilesOperationStatus processor in order to monitor operation status.

## Tags

Cloud, Google, Machine Learning, Vision

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| json-payload | JSON request for AWS Machine Learning services. The Processor will use FlowFile content for the request when this property is not specified. |
| output-bucket | Name of the GCS bucket where the output of the Vision job will be persisted. The value of this property applies when the JSON Payload property is configured. The JSON Payload property value can use Expression Language to reference the value of ${output-bucket} |
| vision-feature-type | Type of GCP Vision Feature. The value of this property applies when the JSON Payload property is configured. The JSON Payload property value can use Expression Language to reference the value of ${vision-feature-type} |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| operationKey | A unique identifier of the operation returned by the Vision server. |

## See also

* [org.apache.nifi.processors.gcp.vision.GetGcpVisionAnnotateFilesOperationStatus](getgcpvisionannotatefilesoperationstatus.md)

---
title: StartGcpVisionAnnotateImagesOperation 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/startgcpvisionannotateimagesoperation.md
section: Loading & Unloading Data
---

# StartGcpVisionAnnotateImagesOperation 2025.10.9.21

## Bundle

org.apache.nifi | nifi-gcp-nar

## Description

Trigger a Vision operation on image input. It should be followed by GetGcpVisionAnnotateImagesOperationStatus processor in order to monitor operation status.

## Tags

Cloud, Google, Machine Learning, Vision

## Input Requirement

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| gcp-credentials-provider-service | The Controller Service used to obtain Google Cloud Platform credentials. |
| json-payload | JSON request for AWS Machine Learning services. The Processor will use FlowFile content for the request when this property is not specified. |
| output-bucket | Name of the GCS bucket where the output of the Vision job will be persisted. The value of this property applies when the JSON Payload property is configured. The JSON Payload property value can use Expression Language to reference the value of ${output-bucket} |
| vision-feature-type | Type of GCP Vision Feature. The value of this property applies when the JSON Payload property is configured. The JSON Payload property value can use Expression Language to reference the value of ${vision-feature-type} |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles are routed to failure relationship |
| success | FlowFiles are routed to success relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| operationKey | A unique identifier of the operation returned by the Vision server. |

## See also

* [org.apache.nifi.processors.gcp.vision.GetGcpVisionAnnotateImagesOperationStatus](getgcpvisionannotateimagesoperationstatus.md)

---
title: StateManagedCdcSchemaRegistry
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/statemanagedcdcschemaregistry.md
section: Loading & Unloading Data
---

# StateManagedCdcSchemaRegistry

## Description

Uses the in-built NiFi State Management to store the hashes of table schemas. This allows for a relatively high performance, low latency, low memory utilization mechanism for storing and comparing table schemas with no external dependencies.

## Tags

CDC, Database, Schema, Snowflake

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SubmitQueryJob 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/submitqueryjob.md
section: Loading & Unloading Data
---

# SubmitQueryJob 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Submits a Query Job to Salesforce using the Bulk API 2.0. In SIMPLE mode, per-object state (previousLast/currentLast and status) is stored in the configured controller service. In ADVANCED mode, a single ‘last’ timestamp is stored at processor scope to support incremental queries across objects.

## Tags

bulk, job, preview, query, salesforce

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Delimiter | The column delimiter used for CSV job data. |
| Configuration Mode | The configuration mode for configuring this processor. If using advanced mode, the SOQL query has to be provided and the processor ‘s state will only store the timestamp of the last query job submission regardless of the object queried. If using simple mode, the object name and the fields to be queried have to be provided and the processor’s state will store the timestamp of the last query job submission for each object queried. |
| Incremental Offload | Whether the processor should perform incremental offload. If true, the processor will only fetch the records that have been modified since the last query job submission by using a WHERE clause on the SystemModstamp field. |
| Line Ending | The line ending used for CSV job data, marking the end of a data row. |
| Object Fields | Comma separated list of the name of the fields to be queried for the specified object. |
| Object Name | The name of the object to be queried. |
| Operation | The type of query to submit. |
| Query | The query to be performed. In order to perform incremental retrieval (ie. only the added/modified/deleted elements since the last submission of the query are retrieved), this processor exposes two attributes: ${nowTs} and ${lastJobTimestamp}. It is possible to use those placeholders like SELECT Id FROM Account WHERE SystemModstamp > ${lastJobTimestamp} AND SystemModstamp <= ${nowTs}. |
| Result Format | The format to be used for the results. Currently the only supported value is CSV. |
| Salesforce Bulk Job State Service | Controller Service to store Bulk Jobs state per object type (used in SIMPLE mode). In ADVANCED mode, the processor stores a single ‘last’ timestamp in processor state. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## State management

| Scopes | Description |
| --- | --- |
| CLUSTER | In case the placeholders for incremental retrieval are used in the query field, the timestamp of the last Query Job submission time minus 30 seconds will be stored in the state. |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | An incoming FlowFile is routed to this relationship if the Query Job could not be submitted but the operation might be retried |
| failure | An incoming FlowFile is routed to this relationship if the Query Job could not be submitted |
| in.progress | An incoming FlowFile is routed to this relationship when a previous job for the same object is still IN_PROGRESS |
| success | When a Query Job is successfully submited, a FlowFile is routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| jobId | The unique ID for this job. |
| operationType | The type of query. |
| objectType | The object type being queried. |
| createdById | The ID of the user who created the job. |
| createdDate | The UTC date and time when the job was created. |
| systemModstamp | The UTC date and time when the API last updated the job information. |
| jobState | The current state of processing for the job. |
| concurrencyMode | How the request is processed. |
| contentType | The format to be used for the results. |
| apiVersion | The API version that the job was created in. |
| lineEnding | The line ending used for CSV job data, marking the end of a data row. |
| columnDelimiter | The column delimiter used for CSV job data. |
| nowTs | Upper limit of the time range used in the WHERE close to construct the Query Job. |
| lastJobTimestamp | Lower limit of the time range used in the WHERE close to construct the Query Job. |

## Use cases

|  |
| --- |
| Submits a Query Job to Salesforce using the Bulk API 2.0. |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.AbortQueryJob](abortqueryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobStatus](getqueryjobstatus.md)

---
title: SummarizeText 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/summarizetext.md
section: Loading & Unloading Data
---

# SummarizeText 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-llm-processors-nar

## Description

This processor uses a Large Language Model (LLM) to summarize the content of a FlowFile. It sends the content to an LLM service and writes the summary back to the FlowFile or as an attribute.

## Tags

ai, llm, openflow, summarization, text processing

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Content | The content to be summarized. FlowFile attributes may be referenced via Expression Language, and the contents of the FlowFile may be referenced via the flowfile_content variable. E.g., ${flowfile_content} |
| LLM Provider Service | The provider service for sending evaluation prompts to LLM |
| Max File Size | The maximum size of a FlowFile that can be summarized. If the FlowFile is larger than this, it will be routed to ‘failure’. |
| Output Strategy | Determines response output destination |
| Results Attribute | The name of the attribute to write the response to. |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be processed are routed to this relationship |
| success | FlowFiles that are successfully processed are routed to this relationship |

---
title: Syslog5424Reader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/syslog5424reader.md
section: Loading & Unloading Data
---

# Syslog5424Reader

## Description

Provides a mechanism for reading RFC 5424 compliant Syslog data, such as log files, and structuring the data so that it can be processed.

## Tags

logfiles, logs, parse, reader, record, syslog, syslog 5424, text

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Character Set \* | Character Set | UTF-8 |  | Specifies which character set of the Syslog messages |
| Raw message \* | syslog-5424-reader-raw-message | false | * true * false | If true, the record will have a _raw field containing the raw message |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: SyslogReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/syslogreader.md
section: Loading & Unloading Data
---

# SyslogReader

## Description

Attempts to parses the contents of a Syslog message in accordance to RFC5424 and RFC3164. In the case of RFC5424 formatted messages, structured data is not supported, and will be returned as part of the message. Note: Be mindfull that RFC3164 is informational and a wide range of different implementations are present in the wild.

## Tags

logfiles, logs, parse, reader, record, syslog, text

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Character Set \* | Character Set | UTF-8 |  | Specifies which character set of the Syslog messages |
| Raw message \* | syslog-5424-reader-raw-message | false | * true * false | If true, the record will have a _raw field containing the raw message |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: TagS3Object 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/tags3object.md
section: Loading & Unloading Data
---

# TagS3Object 2025.10.9.21

## Bundle

org.apache.nifi | nifi-aws-nar

## Description

Adds or updates a tag on an Amazon S3 Object.

## Tags

AWS, Amazon, Archive, S3, Tag

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| AWS Credentials Provider service | The Controller Service that is used to obtain AWS credentials provider |
| Append Tag | If set to true, the tag will be appended to the existing set of tags on the S3 object. Any existing tags with the same key as the new tag will be updated with the specified value. If set to false, the existing tags will be removed and the new tag will be set on the S3 object. |
| Bucket | The S3 Bucket to interact with |
| Communications Timeout | The amount of time to wait in order to establish a connection to AWS or receive data from AWS before timing out. |
| Custom Signer Class Name | Fully qualified class name of the custom signer class. The signer must implement com.amazonaws.auth. Signer interface. |
| Custom Signer Module Location | Comma-separated list of paths to files and/or directories which contain the custom signer’s JAR file and its dependencies (if any). |
| Endpoint Override URL | Endpoint URL to use instead of the AWS default including scheme, host, port, and path. The AWS libraries select an endpoint URL based on the AWS region, but this property overrides the selected endpoint URL, allowing use with other S3-compatible endpoints. |
| Object Key | The S3 Object Key to use. This is analogous to a filename for traditional file systems. |
| Region | The AWS Region to connect to. |
| SSL Context Service | Specifies an optional SSL Context Service that, if provided, will be used to create connections |
| Signer Override | The AWS S3 library uses Signature Version 4 by default but this property allows you to specify the Version 2 signer to support older S3-compatible services or even to plug in your own custom signer implementation. |
| Tag Key | The key of the tag that will be set on the S3 Object |
| Tag Value | The value of the tag that will be set on the S3 Object |
| Version | The Version of the Object to tag |
| proxy-configuration-service | Specifies the Proxy Configuration Controller Service to proxy network requests. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the Processor is unable to process a given FlowFile, it will be routed to this Relationship. |
| success | FlowFiles are routed to this Relationship after they have been successfully processed. |

## Writes attributes

| Name | Description |
| --- | --- |
| s3.tag.___ | The tags associated with the S3 object will be written as part of the FlowFile attributes |
| s3.exception | The class name of the exception thrown during processor execution |
| s3.additionalDetails | The S3 supplied detail from the failed operation |
| s3.statusCode | The HTTP error code (if available) from the failed operation |
| s3.errorCode | The S3 moniker of the failed operation |
| s3.errorMessage | The S3 exception message from the failed operation |

## See also

* [org.apache.nifi.processors.aws.s3.CopyS3Object](copys3object.md)
* [org.apache.nifi.processors.aws.s3.DeleteS3Object](deletes3object.md)
* [org.apache.nifi.processors.aws.s3.FetchS3Object](fetchs3object.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectMetadata](gets3objectmetadata.md)
* [org.apache.nifi.processors.aws.s3.GetS3ObjectTags](gets3objecttags.md)
* [org.apache.nifi.processors.aws.s3.ListS3](lists3.md)
* [org.apache.nifi.processors.aws.s3.PutS3Object](puts3object.md)

---
title: TailFile 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/tailfile.md
section: Loading & Unloading Data
---

# TailFile 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

“Tails” a file, or a list of files, ingesting data from the file as it is written to the file. The file is expected to be textual. Data is ingested only when a new line is encountered (carriage return or new-line character or combination). If the file to tail is periodically “rolled over”, as is generally the case with log files, an optional Rolling Filename Pattern can be used to retrieve data from files that have rolled over, even if the rollover occurred while NiFi was not running (provided that the data still exists upon restart of NiFi). It is generally advisable to set the Run Schedule to a few seconds, rather than running with the default value of 0 secs, as this Processor will consume a lot of resources if scheduled very aggressively. At this time, this Processor does not support ingesting files that have been compressed when ‘rolled over’.

## Tags

file, log, source, tail, text

## Input Requirement

FORBIDDEN

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File Location | Specifies where the state is located either local or cluster so that state can be stored appropriately in order to ensure that all data is consumed without duplicating data upon restart of NiFi |
| File to Tail | Path of the file to tail in case of single file mode. If using multifile mode, regular expression to find files to tail in the base directory. In case recursivity is set to true, the regular expression will be used to match the path starting from the base directory (see additional details for examples). |
| Initial Start Position | When the Processor first begins to tail data, this property specifies where the Processor should begin reading data. Once data has been ingested from a file, the Processor will continue from the last point from which it has received data. |
| Line Start Pattern | A Regular Expression to match against the start of a log line. If specified, any line that matches the expression, and any following lines, will be buffered until another line matches the Expression. In doing this, we can avoid splitting apart multi-line messages in the file. This assumes that the data is in UTF-8 format. |
| Max Buffer Size | When using the Line Start Pattern, there may be situations in which the data in the file being tailed never matches the Regular Expression. This would result in the processor buffering all data from the tailed file, which can quickly exhaust the heap. To avoid this, the Processor will buffer only up to this amount of data before flushing the buffer, even if it means ingesting partial data from the file. |
| Post-Rollover Tail Period | When a file is rolled over, the processor will continue tailing the rolled over file until it has not been modified for this amount of time. This allows for another process to rollover a file, and then flush out any buffered data. Note that when this value is set, and the tailed file rolls over, the new file will not be tailed until the old file has not been modified for the configured amount of time. Additionally, when using this capability, in order to avoid data duplication, this period must be set longer than the Processor’s Run Schedule, and the Processor must not be stopped after the file being tailed has been rolled over and before the data has been fully consumed. Otherwise, the data may be duplicated, as the entire file may be written out as the contents of a single FlowFile. |
| Rolling Filename Pattern | If the file to tail “rolls over” as would be the case with log files, this filename pattern will be used to identify files that have rolled over so that if NiFi is restarted, and the file has rolled over, it will be able to pick up where it left off. This pattern supports wildcard characters \* and ?, it also supports the notation ${filename} to specify a pattern based on the name of the file (without extension), and will assume that the files that have rolled over live in the same directory as the file being tailed. The same glob pattern will be used for all files. |
| pre-allocated-buffer-size | Sets the amount of memory that is pre-allocated for each tailed file. |
| reread-on-nul | If this option is set to ‘true’, when a NUL character is read, the processor will yield and try to read the same part again later. (Note: Yielding may delay the processing of other files tailed by this processor, not just the one with the NUL character.) The purpose of this flag is to allow users to handle cases where reading a file may return temporary NUL values. NFS for example may send file contents out of order. In this case the missing parts are temporarily replaced by NUL values. CAUTION! If the file contains legitimate NUL values, setting this flag causes this processor to get stuck indefinitely. For this reason users should refrain from using this feature if they can help it and try to avoid having the target file on a file system where reads are unreliable. |
| tail-base-directory | Base directory used to look for files to tail. This property is required when using Multifile mode. |
| tail-mode | Mode to use: single file will tail only one file, multiple file will look for a list of file. In Multiple mode the Base directory is required. |
| tailfile-lookup-frequency | Only used in Multiple files mode. It specifies the minimum duration the processor will wait before listing again the files to tail. |
| tailfile-maximum-age | Only used in Multiple files mode. It specifies the necessary minimum duration to consider that no new messages will be appended in a file regarding its last modification date. This should not be set too low to avoid duplication of data in case new messages are appended at a lower frequency. |
| tailfile-recursive-lookup | When using Multiple files mode, this property defines if files must be listed recursively or not in the base directory. |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | Stores state about where in the Tailed File it left off so that on restart it does not have to duplicate data. State is stored either local or clustered depend on the <File Location> property. |
| CLUSTER | Stores state about where in the Tailed File it left off so that on restart it does not have to duplicate data. State is stored either local or clustered depend on the <File Location> property. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |

## Relationships

| Name | Description |
| --- | --- |
| success | All FlowFiles are routed to this Relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| tailfile.original.path | Path of the original file the flow file comes from. |

---
title: TransformXml 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/transformxml.md
section: Loading & Unloading Data
---

# TransformXml 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Applies the provided XSLT file to the FlowFile XML payload. A new FlowFile is created with transformed content and is routed to the ‘success’ relationship. If the XSL transform fails, the original FlowFile is routed to the ‘failure’ relationship

## Tags

transform, xml, xslt

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| XSLT file name | Provides the name (including full path) of the XSLT file to apply to the FlowFile XML content. One of the ‘XSLT file name’ and ‘XSLT Lookup’ properties must be defined. |
| cache-size | Maximum number of stylesheets to cache. Zero disables the cache. |
| cache-ttl-after-last-access | The cache TTL (time-to-live) or how long to keep stylesheets in the cache after last access. |
| indent-output | Whether or not to indent the output. |
| secure-processing | Whether or not to mitigate various XML-related attacks like XXE (XML External Entity) attacks. |
| xslt-controller | Controller lookup used to store XSLT definitions. One of the ‘XSLT file name’ and ‘XSLT Lookup’ properties must be defined. WARNING: note that the lookup controller service should not be used to store large XSLT files. |
| xslt-controller-key | Key used to retrieve the XSLT definition from the XSLT lookup controller. This property must be set when using the XSLT controller property. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile fails processing for any reason (for example, the FlowFile is not valid XML), it will be routed to this relationship |
| success | The FlowFile with transformed content will be routed to this relationship |

---
title: Troubleshoot Openflow
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/troubleshoot.md
section: Loading & Unloading Data
---

# Troubleshoot Openflow

This topic describes the steps to troubleshoot the Openflow components.

## Openflow BYOC troubleshooting

### BYOC custom ingress troubleshooting

For help with BYOC custom ingress, see [Custom ingress troubleshooting](setup-openflow-byoc-custom-ingress.md).

### General BYOC troubleshooting

If any part of a deployment, connector, or runtime is causing problems,
you can use a built-in tool to generate a diagnostic bundle. This bundle
includes the information necessary to keep your Openflow BYOC deployment
secure while allowing the [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) team to
troubleshoot the issue. To share the diagnostic bundle with [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support), attach it to your support case.

1. From the AWS Console UI for EC2, right-click on the openflow-agent-{deployment-key} instance with your Deployment Key.
2. In the context menu, click the Connect button.
3. Switch from EC2 Instance Connect to Connect using EC2 Instance Connect Endpoint. Leave the default EC2 Instance Connect Endpoint in place.
4. Click the Connect button. A new browser tab or window will appear with a command-line interface.
5. Run `./diagnostics.sh` from this browser-based CLI. Follow a few simple prompts to confirm that you want to create the bundle, and then optionally create a shareable link. The diagnostic utility will upload the file to an S3 bucket created for the Deployment using the Deployment Key. For example, `s3://byoc-tf-state-{deployment-key}/diagnostics/openflow_20250131123456.tar.gz`

With the pre-signed URL, you can safely share temporary access to the
diagnostic bundle with the Snowflake team for up to 1 hour. Your S3 bucket and all of its
contents remain private.

---
title: Troubleshooting the Openflow Connector for Kinesis
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/kinesis/troubleshoot.md
section: Loading & Unloading Data
---

# Troubleshooting the Openflow Connector for Kinesis

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how to troubleshoot common issues with the
Openflow Connector for Kinesis.

## Common issues

### Messages are not ingested

**Symptom**

The `ConsumeKinesis` processor in a `Kinesis JSON Source` process group is running, but no data is produced and no error bulletins are emitted.

**Cause**

An error might have occurred in an underlying AWS KCL library, which was not propagated to the Openflow UI.

**Solution**

Check the KCL logs to identify the underlying error.

### FlowFile queues are full

**Symptom**

FlowFile queues are filled up and the connector is not processing data fast enough.

**Cause**

The downstream processors cannot keep up with the incoming data rate.
Most likely the slowest processor is `Put Data To Snowflake` of
[PutSnowpipeStreaming](../../processors/putsnowpipestreaming.md) type
in a `Streaming Destination` process group.

**Solution**

Adjust the number of concurrent tasks for the processor.
Concurrent tasks allow processors to run multiple threads simultaneously, improving throughput for high-volume scenarios.

To adjust concurrent tasks for a processor, perform the following tasks:

1. Right-click the processor in the Openflow canvas.
2. Select Configure from the context menu.
3. Navigate to the Scheduling tab.
4. In the Concurrent tasks field, enter the preferred number of concurrent tasks.
5. Select Apply to save the configuration.

Snowflake recommends the following task count values, although the
correct value might differ for a particular use case:

* 1-2 on small size runtimes
* 2-4 on medium size runtimes
* 6-8 on large size runtimes

After changing the task count, observe the processor to ensure that increasing the tasks count improves the throughput.

## Check KCL logs

The connector uses the [AWS Kinesis Client Library (KCL) v3](https://docs.aws.amazon.com/streams/latest/dev/kcl.html)
under the hood. Errors that occur in KCL are not always propagated to
the Openflow UI, so checking KCL logs might be necessary for troubleshooting.

The KCL logs are stored in a [configured event table](../../monitor.md).
You can retrieve them with the following query:

```sqlexample
SELECT
    timestamp,
    runtime_key,
    resource_attributes,
    log,
    log:formattedMessage,
FROM (
    SELECT
        timestamp,
        resource_attributes,
        resource_attributes:"openflow.dataplane.id" AS deployment_id,
        resource_attributes:"k8s.namespace.name" AS runtime_key,
        resource_attributes:"k8s.pod.name" AS runtime_pod,
        TRY_PARSE_JSON(value) AS log,
    FROM <event_table>
    WHERE TRUE
        AND timestamp > DATEADD(minute, -30, SYSDATE())
        AND record_type = 'LOG'
        AND runtime_key = 'runtime-<runtime_name>'
        AND resource_attributes:"k8s.container.name" ILIKE '%-server'
)
WHERE TRUE
    AND log:loggerName LIKE 'software.amazon.kinesis.%'
    AND log:level IN ('WARN', 'ERROR')
ORDER BY timestamp DESC
;
```

Replace `<event_table>` with a configured event table name and `<runtime_name>` with a runtime name.

## Common KCL errors

This section describes common errors that can appear in the KCL logs and how to resolve them.

### Error: User is not authorized

**Error message**

```text
User: **** is not authorized to perform: kinesis:RegisterStreamConsumer on
resource: arn:aws:kinesis:us-east-2:***:stream/*** because no identity-based
policy allows the kinesis:RegisterStreamConsumer action (Service: Kinesis,
Status Code: 400, Request ID: ***, Extended Request ID: ***)
(SDK Attempt Count: 1)
```

**Cause**

The configured AWS user does not have the necessary permissions to access the Kinesis stream.

**Solution**

Make sure the AWS user is configured with the permissions specified in the
[IAM permissions required for KCL consumer applications](https://docs.aws.amazon.com/streams/latest/dev/kcl-iam-permissions.html).

### Error: UnknownHostException

**Error message**

```text
java.net.UnknownHostException: dynamodb.eu-west-1.amazonaws.com
```

**Cause**

If the runtime is using a Snowflake Deployment, the network rule is most likely misconfigured.

**Solution**

Make sure the required AWS domains are allowlisted in your network rule.
For the list of required domains, see
[Set up Openflow - Snowflake Deployment: Configure allowed domains for Openflow connectors](../../setup-openflow-spcs-sf-allow-list.md).

### Error: No shards found

**Error message**

```text
java.lang.IllegalStateException: No shards found when attempting to
validate complete hash range.
```

**Cause**

This error can occur if the Kinesis stream does not exist or the AWS region is incorrectly specified.

**Solution**

1. Check the KCL logs for messages like:

   ```text
   Got ResourceNotFoundException when fetching shard list for stream-name.
   Stream no longer exists.
   ```
2. Verify that the stream name is correct and that the stream exists in AWS.
3. Verify that the AWS region is specified correctly in the connector configuration.

---
title: Troubleshooting the Openflow Connector for Oracle
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/oracle/troubleshoot.md
section: Loading & Unloading Data
---

# Troubleshooting the Openflow Connector for Oracle

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Note:**
>
> The Openflow Connector for Oracle is also subject to additional terms of service beyond the standard
> connector terms of service. For more information, see the
> [Openflow Connector for Oracle Addendum](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/openflow-oracle-terms/).

This topic describes how to troubleshoot common issues with the
Openflow Connector for Oracle.

## A table was added to replication but doesn’t appear in Snowflake

The table’s fully qualified name (FQN) may be incorrectly specified in the connector
configuration.

**Solution**

* Check the format of the FQN in `Oracle Ingestion Parameters`. It should be
  `<database_name>.<schema_name>.<table_name>` (note the database prefix).
* Check the database name in `Oracle Source Parameters` » `Oracle Connection URL`.
  While FQNs support specifying the name of the database, currently data must reside in
  the same database instance as the one used for this connection.
* Verify that you have provided the full database name including the domain name in the
  connector configuration. For example, use `MYDB.EXAMPLE.COM` instead of just `MYDB`.

  To find the correct database name, run the following query on your Oracle database:

  ```sqlexample
  SELECT property_value
    FROM database_properties
    WHERE property_name = 'GLOBAL_DB_NAME';
  ```

  In general, `property_value` is the same as the service name of the database.
  However, the returned database name might include an appended domain name (for
  example, for service name `FOO`, the query might return `FOO.EXAMPLE.COM`). In
  that case, use the full name with the domain (double-quoted, because it contains dots).

## No changes in incremental load

The incremental load is not capturing or applying changes from the source database.

**Solution**

Run the verification for the **Read Oracle CDC Stream** processor:

1. In your Openflow runtime, double-click the Oracle flow.
2. Double-click the process group named Incremental Load.
3. Find the Read Oracle CDC Stream processor.

   1. If it is running, right-click and select Stop. The processor must be stopped
      before you can verify its configuration.
4. Right-click Read Oracle CDC Stream again, then select Configure.
5. Select the Properties tab.
6. Select the Verification checkmark icon in the upper-right corner.
7. In the popup window that appears, select Verify in the lower-right corner.

   The results of the verification procedure appear below. The procedure validates
   database connectivity and checks the status of the components required for incremental
   load to work.

If any of the verification steps fail, view the error message, fix the issue, and run the
verification again. The following sections describe specific issues and solutions.

## Capture Status not ENABLED

The capture process status is `DISABLED` or `ABORTED`. A `DISABLED` status means the
capture process was stopped manually (with `DBMS_XSTREAM_ADM.STOP_OUTBOUND`) or the
database was restarted. An `ABORTED` status means the capture encountered an error,
usually because redo logs needed for the capture process have been deleted.
You can confirm this by checking the System Change Number (SCN) position or querying
the capture status.

**Solution**

Start the outbound server:

```sqlexample
BEGIN
   DBMS_XSTREAM_ADM.START_OUTBOUND('XOUT1');
END;
/
```

## UNKNOWN status of LogMiner session

The LogMiner status is `UNKNOWN`, which means that archived logs that LogMiner depended
on were deleted. You can confirm this by querying `V$ARCHIVED_LOG` and checking for rows
where the DELETED column has value YES.

**Solution**

Recreate the XStream outbound server. For more information, see Problems occur with the XStream outbound server

## WAITING FOR REDO status of XStream capture

The XStream capture status shows
`WAITING FOR REDO: FILE NA, THREAD 1, SEQUENCE 47, SCN 0x0000000000190ac4`.
This means LogMiner is waiting for an archived log file that is not available because it
was deleted. You can confirm this by querying `V$ARCHIVED_LOG` and checking for rows
where the DELETED column has value YES.

**Solution**

Recreate the XStream outbound server. For more information, see Problems occur with the XStream outbound server

## XStream capture rules are incorrect

XStream is not configured to capture changes from the expected schemas or tables.

**Solution**

Verify the capture rules by running the following query:

```sqlexample
SELECT STREAMS_NAME, SCHEMA_NAME, OBJECT_NAME, RULE_TYPE
FROM DBA_XSTREAM_RULES
WHERE STREAMS_NAME = 'XOUT1';
```

You can also query the capture status and error message directly:

```sqlexample
SELECT CLIENT_NAME, STATUS, ERROR_MESSAGE FROM ALL_CAPTURE;
```

This query returns:

* `CLIENT_NAME`: The name of the XStream client (outbound server).
* `STATUS`: The current status of the capture process (for example, `ENABLED`,
  `DISABLED`, `ABORTED`).
* `ERROR_MESSAGE`: Any error message associated with the capture process.

## Error ORA-21560: argument last_position is null, invalid, or out of range

The connector attempted to connect to an SCN position for which redo logs are no longer
available.

**Solution**

Confirm the issue by running the following query. The SCN for
`Last SCN processed by XStream` must be higher than the lowest SCN for which redo logs
exist.

```sqlexample
SELECT min(FIRST_CHANGE#) as SCN,
       'Lowest SCN for which redo logs still exist' AS DESCRIPTION
FROM V$ARCHIVED_LOG
WHERE DELETED = 'NO'
UNION ALL
SELECT PROCESSED_LOW_SCN,
       'Last SCN processed by XStream'
FROM DBA_XSTREAM_OUTBOUND_PROGRESS
WHERE SERVER_NAME = 'XOUT1'
ORDER BY SCN;
```

To recover from this error, recreate the XStream outbound server. For more information,
see Problems occur with the XStream outbound server

## Error ORA-26701: Streams process XOUT1 does not exist

The XStream outbound server cannot be found on the database instance.

**Solution**

Verify the following:

* The database name in `Oracle Source Parameters` » `XStream Out Server URL` points
  to the database instance with the XStream outbound server, not a different PDB.
* XStream has been created on this instance and has the same name.

## Error ORA-01722: invalid number when creating the outbound server

Executing `DBMS_XSTREAM_ADM.CREATE_OUTBOUND` fails with:

```sqlexample
ORA-01722: invalid number
ORA-06512: at "SYS.DBMS_LOGREP_UTIL", line 582
ORA-06512: at "SYS.DBMS_LOGREP_UTIL", line 636
ORA-06512: at "SYS.DBMS_XSTREAM_ADM_UTL", line 440
ORA-06512: at "SYS.DBMS_XSTREAM_UTL_IVK", line 2094
ORA-06512: at "SYS.DBMS_XSTREAM_UTL_IVK", line 2302
ORA-06512: at "SYS.DBMS_XSTREAM_ADM", line 44
ORA-06512: at line 8
```

This error is misleading. The outbound server already exists.

**Solution**

No action is needed. Use the existing outbound server.

## Problems occur with the XStream outbound server

Multiple issues, such as deleted redo logs or corrupted LogMiner state, can be resolved
by recreating the XStream outbound server.

**Solution**

1. Drop the existing outbound server:

   ```sqlexample
   BEGIN
      DBMS_XSTREAM_ADM.DROP_OUTBOUND('XOUT1');
   END;
   /
   ```
2. Create the outbound server again. For more information, see
   [Create XStream Outbound Server](setup-oracledb.md).

---
title: Troubleshooting the Openflow Connector for Salesforce Bulk API
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/salesforce-bulk-api/troubleshoot.md
section: Loading & Unloading Data
---

# Troubleshooting the Openflow Connector for Salesforce Bulk API

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes how to troubleshoot the Openflow Connector for Salesforce Bulk API.

## Monitoring

To track the amount of data being synced from Salesforce to Snowflake, query the event table.
The following example query retrieves relevant logs from the last 30 minutes:

```sqlexample
SELECT
  timestamp,
  Deployment_ID,
  Runtime_Key,
  parsed_log:level as log_level,
  parsed_log:loggerName as logger,
  parsed_log:formattedMessage as message,
  parsed_log
FROM (
  SELECT
    timestamp,
    resource_attributes:"openflow.dataplane.id" as Deployment_ID,
    resource_attributes:"k8s.namespace.name" as Runtime_Key,
    TRY_PARSE_JSON(value) as parsed_log
  FROM OPENFLOW.TELEMETRY.EVENTS
  WHERE true
    AND timestamp > dateadd('minutes', -30, sysdate())
    AND record_type = 'LOG'
    AND resource_attributes:"k8s.namespace.name" like 'runtime-%'
  ORDER BY timestamp DESC
)
WHERE true
  AND logger = 'org.apache.nifi.processors.standard.LogMessage'
  AND message LIKE '%SALESFORCE_BULK_API%';
```

## Troubleshooting

Use the following information to troubleshoot issues with the connector.

### Authentication and OAuth errors

The connector uses the OAuth 2.0 JWT Bearer Flow to authenticate with Salesforce. Authentication errors typically occur during initial setup and can be diagnosed using the [Verification feature](configure-connector.md) on the controller service before starting the connector.

#### `invalid_grant` error

The `invalid_grant` error indicates that Salesforce rejected the OAuth token request. Common causes include:

* **Wrong OAuth flow type.** The external client app in Salesforce does not have the Enable JWT Bearer Flow checkbox selected. The connector requires this specific flow. Other OAuth flows (such as Authorization Code Flow) are not supported. See [Create an external client app in Salesforce](setup-salesforce.md).
* **Mismatched private key and certificate.** The private key configured in the connector (the Connected App Key parameter) does not match the public certificate uploaded to the external client app in Salesforce.
* **Wrong Consumer Key.** The OAuth2 Client ID parameter does not match the Consumer Key of the external client app where the certificate was uploaded.
* **Mixed credentials from multiple apps.** If you have created multiple external client apps or experimented with different configurations, the Client ID, certificate, and private key might belong to different apps. All three must come from the same external client app.
* **Deprecated Connected App.** Salesforce has deprecated Connected Apps in favor of External Client Apps. If you are using a Connected App, Snowflake recommends creating a new external client app instead.
* **Incorrect token endpoint URL.** The OAuth2 Token Endpoint URL parameter must point to the correct Salesforce instance. For example: `https://myCompany.my.salesforce.com/services/oauth2/token`.
* **Incorrect audience.** The OAuth2 Audience parameter must be set to `https://login.salesforce.com` for production environments or `https://test.salesforce.com` for sandboxes and test environments.

#### Permission errors

If the JWT token is successfully generated but the user lacks permissions, you see a permission or authorization error. This means the JWT Bearer Flow is working, but the Salesforce user (the OAuth2 Subject) is not authorized to use the external client app.

To resolve this issue:

1. In Salesforce, go to the Policies tab of the external client app.
2. Verify that Permitted Users is set to Admin approved users are pre-authorized.
3. Verify that the profiles or permission sets assigned in the App Policies section include the user specified in the OAuth2 Subject parameter of the connector.

For more details, see [Approve the client app for a user](setup-salesforce.md).

### Check the connector state

You can examine the connector state to ensure that data is being replicated as expected. The connector maintains a state of current and past operations to ensure no Salesforce changes are missed and to retry bulk job queries if failures occur.

To view the state:

1. Right-click on the canvas and select Controller services.
2. Locate the controller service named Salesforce Bulk Jobs State.
3. In the Salesforce Bulk Jobs State menu, click View state.

The state is a set of key/value pairs where the key is the Salesforce Object type. For
example, the state for the `Account` object might look like the following example:

```json
{"previousLast":"2025-09-30T09:41:23.484406926Z","currentLast":"2025-09-30T09:41:23.484406926Z","status":"COMPLETED"}
```

The `status` can be one of the following:

* `IN_PROGRESS`
* `COMPLETED`
* `FAILED`
* `ABORTED`

If the status is `IN_PROGRESS`, a FlowFile is still being processed for that object type.

> **Caution:**
>
> Do not delete flow files manually. This can cause a job to remain in the `IN_PROGRESS` status indefinitely because the state cannot be manually updated.
>
> If this occurs, you must perform a full reload for that object type.

### Force a full load for a given object type

To force the connector to perform a full refresh for one or more object types:

1. Stop all processors in the flow.
2. Ensure that no in-flight FlowFiles are being processed.
3. Right-click on the canvas and select Disable all controller services.
4. Go to Controller services and open the state of the controller service named
   Salesforce Bulk Jobs State.
5. Perform one of the following actions:

   * Select Clear state to clear the entire state. This forces a full load for
     **all** configured Object types fetched by the connector.
   * Select the trash icon next to a specific Object type to clear the state for a
     specific object type only. This forces a full load of that specific object type
     during the next execution of the connector.
6. In the canvas, right-click, select Enable all controller services, and then start all processors.

### If an object type remains in status IN_PROGRESS

If the state for a given object type is stuck in `IN_PROGRESS` and there are no in-flight FlowFiles for that object type, a FlowFile may have been manually deleted before it could update the status.

In this case, you must perform a full load for that object type to ensure the connector
captures all events.

If the state is stuck in `IN_PROGRESS` but no FlowFiles were manually deleted, contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: UDPEventRecordSink
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/udpeventrecordsink.md
section: Loading & Unloading Data
---

# UDPEventRecordSink

## Description

Format and send Records as UDP Datagram Packets to a configurable destination

## Tags

UDP, event, record, sink

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Hostname \* | hostname |  |  | Destination hostname or IP address |
| Port \* | port |  |  | Destination port number |
| Record Writer \* | record-sink-record-writer |  |  | Specifies the Controller Service to use for writing out the records. |
| Sender Threads \* | sender-threads | 2 |  | Number of worker threads allocated for handling socket communication |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: UnpackContent 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/unpackcontent.md
section: Loading & Unloading Data
---

# UnpackContent 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Unpacks the content of FlowFiles that have been packaged with one of several different Packaging Formats, emitting one to many FlowFiles for each input FlowFile. Supported formats are TAR, ZIP, and FlowFile Stream packages.

## Tags

Unpack, archive, flowfile-stream, flowfile-stream-v3, tar, un-merge, zip

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| File Filter | Only files contained in the archive whose names match the given regular expression will be extracted (tar/zip only) |
| Filename Character Set | If supplied this character set will be supplied to the Zip utility to attempt to decode filenames using the specific character set. If not specified the default platform character set will be used. This is useful if a Zip was created with a different character set than the platform default and the zip uses non standard values to specify. |
| Packaging Format | The Packaging Format used to create the file |
| Password | Password used for decrypting Zip archives encrypted with ZipCrypto or AES. Configuring a password disables support for alternative Zip compression algorithms. |
| allow-stored-entries-wdd | Some zip archives contain stored entries with data descriptors which by spec should not happen. If this property is true they will be read anyway. If false and such an entry is discovered the zip will fail to process. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The original FlowFile is sent to this relationship when it cannot be unpacked for some reason |
| original | The original FlowFile is sent to this relationship after it has been successfully unpacked |
| success | Unpacked FlowFiles are sent to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | If the FlowFile is successfully unpacked, its MIME Type is no longer known, so the mime.type attribute is set to application/octet-stream. |
| fragment.identifier | All unpacked FlowFiles produced from the same parent FlowFile will have the same randomly generated UUID added for this attribute |
| fragment.index | A one-up number that indicates the ordering of the unpacked FlowFiles that were created from a single parent FlowFile |
| fragment.count | The number of unpacked FlowFiles generated from the parent FlowFile |
| segment.original.filename | The filename of the parent FlowFile. Extensions of .tar, .zip or .pkg are removed because the MergeContent processor automatically adds those extensions if it is used to rebuild the original FlowFile |
| file.lastModifiedTime | The date and time that the unpacked file was last modified (tar and zip only). |
| file.creationTime | The date and time that the file was created. For encrypted zip files this attribute always holds the same value as file.lastModifiedTime. For tar and unencrypted zip files if available it will be returned otherwise this will be the same value asfile.lastModifiedTime. |
| file.lastMetadataChange | The date and time the file’s metadata changed (tar only). |
| file.lastAccessTime | The date and time the file was last accessed (tar and unencrypted zip files only) |
| file.owner | The owner of the unpacked file (tar only) |
| file.group | The group owner of the unpacked file (tar only) |
| file.size | The uncompressed size of the unpacked file (tar and zip only) |
| file.permissions | The read/write/execute permissions of the unpacked file (tar and unencrypted zip files only) |
| file.encryptionMethod | The encryption method for entries in Zip archives |

## Use cases

|  |
| --- |
| Unpack Zip containing filenames with special characters, created on Windows with filename charset ‘Cp437’ or ‘IBM437’. |

## See also

* [org.apache.nifi.processors.standard.MergeContent](mergecontent.md)

---
title: UpdateAttribute 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updateattribute.md
section: Loading & Unloading Data
---

# UpdateAttribute 2025.10.9.21

## Bundle

org.apache.nifi | nifi-update-attribute-nar

## Description

Updates the Attributes for a FlowFile by using the Attribute Expression Language and/or deletes the attributes based on a regular expression

## Tags

Attribute Expression Language, attributes, delete, modification, state, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Delete Attributes Expression | Regular expression for attributes to be deleted from FlowFiles. Existing attributes that match will be deleted regardless of whether they are updated by this processor. |
| Stateful Variables Initial Value | If using state to set/reference variables then this value is used to set the initial value of the stateful variable. This will only be used in the @OnScheduled method when state does not contain a value for the variable. This is required if running statefully but can be empty if needed. |
| Store State | Select whether or not state will be stored. Selecting ‘Stateless’ will offer the default functionality of purely updating the attributes on a FlowFile in a stateless manner. Selecting a stateful option will not only store the attributes on the FlowFile but also in the Processors state. See the ‘Stateful Usage’ topic of the ‘Additional Details’section of this processor’s documentation for more information |
| canonical-value-lookup-cache-size | Specifies how many canonical lookup values should be stored in the cache |

## State management

| Scopes | Description |
| --- | --- |
| LOCAL | Gives the option to store values not only on the FlowFile but as stateful variables to be referenced in a recursive manner. |

## Relationships

| Name | Description |
| --- | --- |
| success | All successful FlowFiles are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| See additional details | This processor may write or remove zero or more attributes as described in additional details |

## Use cases

|  |
| --- |
| Add a new FlowFile attribute |
| Overwrite a FlowFile attribute with a new value |
| Rename a file |

---
title: UpdateBoxFileMetadataInstance 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updateboxfilemetadatainstance.md
section: Loading & Unloading Data
---

# UpdateBoxFileMetadataInstance 2025.10.9.21

## Bundle

org.apache.nifi | nifi-box-nar

## Description

Updates metadata template values for a Box file using the record in the given flowFile. This record represents the desired end state of the template after the update. The processor will calculate the necessary changes (add/replace/remove) to transform the current metadata to the desired state. The input record should be a flat key-value object.

## Tags

box, metadata, storage, templates, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Box Client Service | Controller Service used to obtain a Box API connection. |
| File ID | The ID of the file for which to update metadata. |
| Record Reader | The Record Reader to use for parsing the incoming data |
| Template Key | The key of the metadata template to update. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile is routed to this relationship if an error occurs during metadata update. |
| file not found | FlowFiles for which the specified Box file was not found will be routed to this relationship. |
| success | A FlowFile is routed to this relationship after metadata has been successfully updated. |
| template not found | FlowFiles for which the specified metadata template was not found will be routed to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| box.id | The ID of the file whose metadata was updated |
| box.template.name | The template name used for metadata update |
| box.template.scope | The template scope used for metadata update |
| error.code | The error code returned by Box |
| error.message | The error message returned by Box |

## See also

* [org.apache.nifi.processors.box.FetchBoxFile](fetchboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFile](listboxfile.md)
* [org.apache.nifi.processors.box.ListBoxFileMetadataTemplates](listboxfilemetadatatemplates.md)

---
title: UpdateBulkJobState 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatebulkjobstate.md
section: Loading & Unloading Data
---

# UpdateBulkJobState 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Updates the status of a Salesforce Bulk Job in the shared state service for a specific object type

## Tags

bulk, preview, salesforce, state

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Object Type | Salesforce object type whose state should be updated |
| Salesforce Bulk Job State Service | Controller Service managing Bulk Jobs state |
| Status | Status to set for the object type |

## Relationships

| Name | Description |
| --- | --- |
| failure | Incoming FlowFile is routed here if update fails |
| success | Incoming FlowFile is routed here after state update |

---
title: UpdateByQueryElasticsearch 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatebyqueryelasticsearch.md
section: Loading & Unloading Data
---

# UpdateByQueryElasticsearch 2025.10.9.21

## Bundle

org.apache.nifi | nifi-elasticsearch-restapi-nar

## Description

Update documents in an Elasticsearch index using a query. The query can be loaded from a flowfile body or from the Query parameter. The loaded Query can contain any JSON accepted by Elasticsearch’s _update_by_query API, for example a “query” object to identify what documents are to be updated, plus a “script” to define the updates to perform.

## Tags

elastic, elasticsearch, elasticsearch7, elasticsearch8, elasticsearch9, query, update

## Input Requirement

ALLOWED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Client Service | An Elasticsearch client service to use for running queries. |
| Index | The name of the index to use. |
| Max JSON Field String Length | The maximum allowed length of a string value when parsing a JSON document or attribute. |
| Query | A query in JSON syntax, not Lucene syntax. Ex: {“query”:{“match”:{“somefield”:”somevalue”}}}. If this parameter is not set, the query will be read from the flowfile content. If the query (property and flowfile content) is empty, a default empty JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Attribute | If set, the executed query will be set on each result flowfile in the specified attribute. |
| Query Clause | A “query” clause in JSON syntax, not Lucene syntax. Ex: {“match”:{“somefield”:”somevalue”}}. If the query is empty, a default JSON Object will be used, which will result in a “match_all” query in Elasticsearch. |
| Query Definition Style | How the JSON Query will be defined for use by the processor. |
| Script | A “script” to execute during the operation, in JSON syntax. Ex: {“source”: “ctx._source.count++”, “lang”: “painless”} |
| Type | The type of this document (used by Elasticsearch for indexing and searching). |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the “by query” operation fails, and a flowfile was read, it will be sent to this relationship. |
| retry | All flowfiles that fail due to server/cluster availability go to this relationship. |
| success | If the “by query” operation succeeds, and a flowfile was read, it will be sent to this relationship. |

## Writes attributes

| Name | Description |
| --- | --- |
| elasticsearch.update.took | The amount of time that it took to complete the update operation in ms. |
| elasticsearch.update.error | The error message provided by Elasticsearch if there is an error running the update. |

---
title: UpdateCounter 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatecounter.md
section: Loading & Unloading Data
---

# UpdateCounter 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor allows users to set specific counters and key points in their flow. It is useful for debugging and basic counting functions.

## Tags

counter, debug, instrumentation

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| counter-name | The name of the counter you want to set the value of - supports expression language like ${counterName} |
| delta | Adjusts the counter by the specified delta for each flow file received. May be a positive or negative integer. |

## Relationships

| Name | Description |
| --- | --- |
| success | Counter was updated/retrieved |

---
title: UpdateDatabaseTable 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatedatabasetable.md
section: Loading & Unloading Data
---

# UpdateDatabaseTable 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

This processor uses a JDBC connection and incoming records to generate any database table changes needed to support the incoming records. It expects a ‘flat’ record layout, meaning none of the top-level record fields has nested fields that are intended to become columns themselves.

## Tags

alter, database, jdbc, metadata, table, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Column Name Translation Pattern | Column name will be normalized with this regular expression |
| Column Name Translation Strategy | The strategy used to normalize table column name. Column Name will be uppercased to do case-insensitive matching irrespective of strategy |
| Database Dialect Service | Database Dialect Service for generating statements specific to a particular service or vendor. |
| db-type | Database Type for generating statements specific to a particular service or vendor. The Generic Type supports most cases but selecting a specific type enables optimal processing or additional features. |
| record-reader | The service for reading incoming flow files. The reader is only used to determine the schema of the records, the actual records will not be processed. |
| updatedatabasetable-catalog-name | The name of the catalog that the statement should update. This may not apply for the database that you are updating. In this case, leave the field empty. Note that if the property is set and the database is case-sensitive, the catalog name must match the database’s catalog name exactly. |
| updatedatabasetable-create-table | Specifies how to process the target table when it does not exist (create it, fail, e.g.). |
| updatedatabasetable-dbcp-service | The Controller Service that is used to obtain connection(s) to the database |
| updatedatabasetable-primary-keys | A comma-separated list of record field names that uniquely identifies a row in the database. This property is only used if the specified table needs to be created, in which case the Primary Key Fields will be used to specify the primary keys of the newly-created table. IMPORTANT: Primary Key Fields must match the record field names exactly unless ‘Quote Column Identifiers’ is false and the database allows for case-insensitive column names. In practice it is best to specify Primary Key Fields that exactly match the record field names, and those will become the column names in the created table. |
| updatedatabasetable-query-timeout | Sets the number of seconds the driver will wait for a query to execute. A value of 0 means no timeout. NOTE: Non-zero values may not be supported by the driver. |
| updatedatabasetable-quoted-column-identifiers | Enabling this option will cause all column names to be quoted, allowing you to use reserved words as column names in your tables and/or forcing the record field names to match the column names exactly. |
| updatedatabasetable-quoted-table-identifiers | Enabling this option will cause the table name to be quoted to support the use of special characters in the table name and/or forcing the value of the Table Name property to match the target table name exactly. |
| updatedatabasetable-record-writer | Specifies the Controller Service to use for writing results to a FlowFile. The Record Writer should use Inherit Schema to emulate the inferred schema behavior, i.e. an explicit schema need not be defined in the writer, and will be supplied by the same logic used to infer the schema from the column types. If Create Table Strategy is set ‘Create If Not Exists’, the Record Writer ‘s output format must match the Record Reader’s format in order for the data to be placed in the created table location. Note that this property is only used if ‘Update Field Names’ is set to true and the field names do not all match the column names exactly. If no update is needed for any field names (or ‘Update Field Names’ is false), the Record Writer is not used and instead the input FlowFile is routed to success or failure without modification. |
| updatedatabasetable-schema-name | The name of the database schema that the table belongs to. This may not apply for the database that you are updating. In this case, leave the field empty. Note that if the property is set and the database is case-sensitive, the schema name must match the database’s schema name exactly. |
| updatedatabasetable-table-name | The name of the database table to update. If the table does not exist, then it will either be created or an error thrown, depending on the value of the Create Table property. |
| updatedatabasetable-translate-field-names | If true, the Processor will attempt to translate field names into the corresponding column names for the table specified, for the purposes of determining whether the field name exists as a column in the target table. NOTE: If the target table does not exist and is to be created, this property is ignored and the field names will be used as-is. If false, the field names must match the column names exactly, or the column may not be found and instead an error my be reported that the column already exists. |
| updatedatabasetable-update-field-names | This property indicates whether to update the output schema such that the field names are set to the exact column names from the specified table. This should be used if the incoming record field names may not match the table ‘s column names in terms of upper- and lower-case. For example, this property should be set to true if the output FlowFile is destined for Oracle e.g., which expects the field names to match the column names exactly. NOTE: The value of the’Translate Field Names’ property is ignored when updating field names; instead they are updated to match the column name as returned by the database. |

## Relationships

| Name | Description |
| --- | --- |
| failure | A FlowFile containing records routed to this relationship if the record could not be transmitted to the database. |
| success | A FlowFile containing records routed to this relationship after the record has been successfully transmitted to the database. |

## Writes attributes

| Name | Description |
| --- | --- |
| output.table | This attribute is written on the flow files routed to the ‘success’ and ‘failure’ relationships, and contains the target table name. |
| output.path | This attribute is written on the flow files routed to the ‘success’ and ‘failure’ relationships, and contains the path on the file system to the table (or partition location if the table is partitioned). |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer, only if a Record Writer is specified and Update Field Names is ‘true’. |
| record.count | Sets the number of records in the FlowFile, only if a Record Writer is specified and Update Field Names is ‘true’. |

---
title: UpdateRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updaterecord.md
section: Loading & Unloading Data
---

# UpdateRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Updates the contents of a FlowFile that contains Record-oriented data (i.e., data that can be read via a RecordReader and written by a RecordWriter). This Processor requires that at least one user-defined Property be added. The name of the Property should indicate a RecordPath that determines the field that should be updated. The value of the Property is either a replacement value (optionally making use of the Expression Language) or is itself a RecordPath that extracts a value from the Record. Whether the Property value is determined to be a RecordPath or a literal value depends on the configuration of the <Replacement Value Strategy> Property.

## Tags

avro, csv, freeform, generic, json, log, logs, record, schema, text, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Record Reader | Specifies the Controller Service to use for reading incoming data |
| Record Writer | Specifies the Controller Service to use for writing out the records |
| Replacement Value Strategy | Specifies how to interpret the configured replacement values |

## Relationships

| Name | Description |
| --- | --- |
| failure | If a FlowFile cannot be transformed from the configured input format to the configured output format, the unchanged FlowFile will be routed to this relationship |
| success | FlowFiles that are successfully transformed will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| record.index | This attribute provides the current row index and is only available inside the literal value expression. |
| record.error.message | This attribute provides on failure the error message encountered by the Reader or Writer. |

## Use cases

|  |
| --- |
| Combine multiple fields into a single field. |
| Change the value of a record field to an explicit value. |
| Copy the value of one record field to another record field. |
| Enrich data by injecting the value of an attribute into each Record. |
| Change the format of a record field’s value. |

## See also

* [org.apache.nifi.processors.standard.ConvertRecord](convertrecord.md)

---
title: UpdateSnowflakeDatabase 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatesnowflakedatabase.md
section: Loading & Unloading Data
---

# UpdateSnowflakeDatabase 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Updates the definition of a Snowflake table based on the schema provided in the incoming FlowFile. The schema is expected to be in JSON with the following format, regardless of whether it is provided via FlowFile content or specified as a property: { “columns”: [ { “name”: “<column name>”, “type”: “<column type>”, “nullable”: <true/false>, “precision”: <precision, only for numeric type>, “scale”: <scale, only for numeric type> }, … ], “primaryKeys”: [“<name of first primary key column>”, “<name of second primary key column>”, …] }

## Tags

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Add Column Strategy | The strategy to use when the incoming schema has a column that is not present in the existing table |
| Add Not Null Strategy | The strategy to use when the incoming schema has a not-null constraint that is not present in the existing table |
| Alter Column Type Strategy | The strategy to use when the existing table has a column with a different type than the incoming schema. |
| Column Name Transformation | An optional transformation that can be applied to the names of columns defined in the schema. This transformation is applied to the column names before they are compared to the existing columns in the table. This property can reference the following variables via Expression Language, in addition to attributes: `column.name`, `column.type`, `column.nullable`, `column.precision`, `column.scale`, `column.primaryKey`. |
| Column Removal Strategy | The strategy to use when the existing table has a column that is not present in the incoming schema |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Create Stream | Whether or not to create a Snowflake Stream for the table |
| Creation Parameters | Additional parameters to include in the CREATE TABLE statement. For example, ‘CLUSTER BY (column_name)’ |
| Desired Schema | The desired schema / table definition |
| Drop Column Strategy | The strategy to use when the existing table has a column that is not present in the incoming schema |
| Drop Not Null Strategy | The strategy to use when the existing table has a not-null constraint that is not present in the incoming schema |
| Include Default Values | Whether or not to include DEFAULT values in CREATE TABLE or ALTER TABLE ADD COLUMN statements |
| Include Not Null Constraints | Whether or not to include NOT NULL constraints in CREATE TABLE or ALTER TABLE ADD COLUMN statements |
| Include Primary Key Constraints | Whether or not to include primary key constraints in the creation statement |
| Max Batch Size | The maximum number of FlowFiles that can be processed in a single execution for a given table. |
| Modify Primary Key Strategy | The strategy to use when the incoming schema has a primary key that differs from the existing primary key. Modifying the Primary Key requires dropping the existing one, if any, and adding a new one. |
| Record Reader | Record Reader to use for obtaining the desired schema |
| Removed Column Name Suffix | The suffix to append to a column that was removed. For example, to rename column ‘foo’ to ‘foo__deleted’, the property can be set to `__deleted` |
| Schema Name | The name of the schema to update |
| Stream Creation Parameters | Additional parameters to include in the CREATE STREAM statement. For example, ‘APPEND_ONLY=TRUE’ |
| Stream Name | The name of the stream |
| Table Metadata Cache Expiration Time | The time in seconds after which the cache entry will be removed |
| Table Name | The name of the table to update or create stream on |
| Table Schema Strategy | Specifies how to obtain the desired schema / table definition |
| Table Stream Creation Parameters | Parameters to include in the CREATE STREAM statement. For example, ‘APPEND_ONLY=TRUE’. The stream will be created along with the table as it’s source. |
| Table Stream Name | The name of the stream created along with the table. Stream source will be the created table. |
| Update Type | The type of update to perform |
| Use Table Metadata Cache | Whether to cache table’s metadata instead of reading it directly from Snowflake. Applies to [Create Table If Not Exists, Alter Table] |

## Relationships

| Name | Description |
| --- | --- |
| failure | The incoming FlowFile is routed to this relationship if the table cannot be updated |
| success | The incoming FlowFile is routed to this relationship after the table has been updated successfully |

## Writes attributes

| Name | Description |
| --- | --- |
| schema.hash | A SHA-256 hash of the final table schema after all updates have been completed. Can be used for change detection and caching purposes. |

---
title: UpdateSnowflakeIcebergDatabase 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatesnowflakeicebergdatabase.md
section: Loading & Unloading Data
---

# UpdateSnowflakeIcebergDatabase 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Updates the definition of a Snowflake Iceberg table. A target schema can be inferred from a RecordReader or defined explicitly using the format below: { “columns”: [ { “name”: “<column name>”, “type”: “<iceberg data type>” }, … ] } where <iceberg data type> can be one of: - primitive iceberg type (“string”, “int”, “boolean”,…) - decimal with given precision and scale (“decimal(P,S)”) - {“type”: “list”, “element”: <iceberg data type>} - {“type”: “map”, “key”: <iceberg data type>, “value”: <iceberg data type>} - {“type”: “struct”, “fields”:[<list of struct fields>] }

## Tags

iceberg

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Add Column Strategy | The strategy to use when the incoming schema has a column that is not present in the existing table |
| Alter Column Strategy | The strategy to use when a column has different data type in the incoming schema from the existing table |
| Alter Column Type Strategy | The strategy to use when the existing table has a column with a different type than the incoming schema. |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Desired Schema | The desired schema / table definition |
| Drop Column Strategy | The strategy to use when the existing table has a column that is not present in the incoming schema |
| Max Batch Size | The maximum number of FlowFiles that can be processed in a single execution for a given table. |
| Record Reader | Record Reader to use for obtaining the desired schema |
| Schema Name | The name of the schema to update |
| Table Metadata Cache Expiration Time | The time in seconds after which the cache entry will be removed |
| Table Name | The name of the table to update |
| Table Schema Strategy | Specifies how to obtain the desired schema / table definition |
| Use Table Metadata Cache | Whether to cache table’s metadata instead of reading it directly from Snowflake |

## Relationships

| Name | Description |
| --- | --- |
| failure | The incoming FlowFile is routed to this relationship if the table cannot be updated |
| illegal alteration | The incoming FlowFile is routed to this relationship if the update requires an alteration that is configured to fail |
| success | The incoming FlowFile is routed to this relationship after the table has been updated successfully |
| table not found | The incoming FlowFile is routed to this relationship if the specified table does not exist. |

## Writes attributes

| Name | Description |
| --- | --- |
| schema.hash | A hexadecimal-encoded SHA-256 hash of the final table schema after all updates have been completed. |

---
title: UpdateSnowflakeSchema 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatesnowflakeschema.md
section: Loading & Unloading Data
---

# UpdateSnowflakeSchema 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Creates Snowflake database schema if it does not exist.

## Tags

create, ddl, preview, schema, snowflake

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Object Identifier Resolution | Controls how source object identifiers (schemas, tables, columns) are stored and queried in Snowflake. This setting determines whether you will need to use double quotes in your SQL queries. |
| Schema Creation Cache Expiration Time | The time after which the cache entry will be removed |
| Schema Name | The name of the schema to create |
| Use Schema Creation Cache | Whether to cache schema’s creation instead of executing CREATE SCHEMA IF NOT EXISTS statement for each FlowFile. |

## Relationships

| Name | Description |
| --- | --- |
| failure | The incoming FlowFile is routed to this relationship if the schema cannot be created |
| success | The incoming FlowFile is routed to this relationship after the schema has been created successfully |

---
title: UpdateSnowflakeStream 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatesnowflakestream.md
section: Loading & Unloading Data
---

# UpdateSnowflakeStream 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Manages Snowflake streams by creating, dropping, or replacing them based on the configured operation. Streams in Snowflake capture data change for tables and can be used to track DML changes over time.

## Tags

cdc, create, drop, preview, replace, snowflake, stream, table

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Object Identifier Resolution | Controls how source object identifiers (schemas, tables, columns) are stored and queried in Snowflake. This setting determines whether you will need to use double quotes in your SQL queries. |
| Schema Name | The name of the schema containing the stream and/or source table |
| Source Table Name | The name of the source table for the stream |
| Stream Creation Parameters | Additional parameters to include in the CREATE STREAM statement. For example, ‘APPEND_ONLY=TRUE SHOW_INITIAL_ROWS=TRUE’ |
| Stream Name | The name of the stream to create, drop, or replace |
| Update Type | The type of stream operation to perform |

## Relationships

| Name | Description |
| --- | --- |
| failure | The incoming FlowFile is routed to this relationship if the stream operation cannot be completed |
| object not found | The incoming FlowFile is routed to this relationship if the specified stream or source table does not exist. |
| success | The incoming FlowFile is routed to this relationship after the stream operation has been completed successfully |

---
title: UpdateSnowflakeTable 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatesnowflaketable.md
section: Loading & Unloading Data
---

# UpdateSnowflakeTable 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Updates the definition of a Snowflake table based on the schema provided in the incoming FlowFile. The schema is expected to be in JSON with the following format, regardless of whether it is provided via FlowFile content or specified as a property: { “columns”: [ { “name”: “<column name>”, “type”: “<column type>”, “nullable”: <true/false>, “precision”: <only for numeric type>, “scale”: <only for numeric type> }, … ], “primaryKeys”: [“<name of first primary key column>”, “<name of second primary key column>”, …] } This processor supports table-only operations: creating, altering, and dropping tables.

## Tags

alter, columns, create, ddl, drop, preview, snowflake, table, update

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Add Column Strategy | The strategy to use when the incoming schema has a column that is not present in the existing table |
| Add Not Null Strategy | The strategy to use when the incoming schema has a not-null constraint that is not present in the existing table |
| Alter Column Type Strategy | The strategy to use when the existing table has a column with a different type than the incoming schema. |
| Column Name Transformation | An optional transformation that can be applied to the names of columns defined in the schema. This transformation is applied to the column names before they are compared to the existing columns in the table. This property can reference the following variables via Expression Language, in addition to attributes: `column.name`, `column.type`, `column.nullable`, `column.precision`, `column.scale`, `column.primaryKey`.The result of applying transformations based on this property will be treated according to the setting of `Object Name Handling` property. |
| Column Removal Strategy | The strategy to use when the existing table has a column that is not present in the incoming schema |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Creation Parameters | Additional parameters to include in the CREATE TABLE statement. For example, ‘CLUSTER BY (column_name)’ |
| Desired Schema | The desired schema / table definition |
| Drop Column Strategy | The strategy to use when the existing table has a column that is not present in the incoming schema |
| Drop Not Null Strategy | The strategy to use when the existing table has a not-null constraint that is not present in the incoming schema |
| Include Default Values | Whether or not to include DEFAULT values in CREATE TABLE or ALTER TABLE ADD COLUMN statements |
| Include Not Null Constraints | Whether or not to include NOT NULL constraints in CREATE TABLE or ALTER TABLE ADD COLUMN statements |
| Include Primary Key Constraints | Whether or not to include primary key constraints in the creation statement |
| Max Batch Size | The maximum number of FlowFiles that can be processed in a single execution for a given table. |
| Modify Primary Key Strategy | The strategy to use when the incoming schema has a primary key that differs from the existing primary key. Modifying the Primary Key requires dropping the existing one, if any, and adding a new one. |
| Object Identifier Resolution | Controls how source object identifiers (schemas, tables, columns) are stored and queried in Snowflake. This setting determines whether you will need to use double quotes in your SQL queries. |
| Record Reader | Record Reader to use for obtaining the desired schema |
| Removed Column Name Suffix | The suffix to append to a column that was removed. For example, to rename column ‘foo’ to ‘foo__deleted’, the property can be set to `__deleted`. This property value will behave differently depending on the value of `Object Name Handling` property, i.e. If `Object Name Handling` is set to `Case Sensitive Name`, then the suffix will be appended as-is. If `Object Name Handling` is set to `SQL Identifier`, then the suffix and must consist of only letters, numbers, dollar sign ($), and underscore (_) characters, additionally it will be appended as case-insensitive or case-sensitive depending on the column name it is being appended to is case-insensitive (not double-quoted) or case-sensitive (double-quoted) respectively. |
| Schema Name | The name of the schema containing the table |
| Table Metadata Cache Expiration Time | The time in seconds after which the cache entry will be removed |
| Table Name | The name of the table to update |
| Table Schema Strategy | Specifies how to obtain the desired schema / table definition |
| Update Type | The type of table update to perform |
| Use Table Metadata Cache | Whether to cache table’s metadata instead of reading it directly from Snowflake. Applies to [Create Table If Not Exists, Alter Table] |

## Relationships

| Name | Description |
| --- | --- |
| failure | The incoming FlowFile is routed to this relationship if the table cannot be updated |
| success | The incoming FlowFile is routed to this relationship after the table has been updated successfully |

## Writes attributes

| Name | Description |
| --- | --- |
| schema.hash | A SHA-256 hash of the final table schema after all updates have been completed. Can be used for change detection and caching purposes. |

---
title: UpdateSnowflakeView 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatesnowflakeview.md
section: Loading & Unloading Data
---

# UpdateSnowflakeView 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-snowflake-processors-nar

## Description

Creates or replaces Snowflake views based on column mappings provided in the incoming FlowFile. The processor checks if the view exists and only recreates it if the definition has changed. The FlowFile content should contain JSON with column mappings, optional join configuration, and optional flatten configuration: { “columns”: [ { “source_field”: “customer_data:id”, “destination_column”: “customer_id”, “type”: “VARCHAR” }, { “source_field”: “f.value:order_amount”, “destination_column”: “order_amount”, “type”: “NUMBER” }, { “expression”: “SUM(f.value:order_amount::NUMBER)”, “destination_column”: “total_amount” }, { “expression”: “COUNT(\*)”, “destination_column”: “order_count” } ], “from”: { “table”: “raw_data”, “alias”: “rd”, “joins”: [ { “type”: “INNER”, “table”: “customers”, “alias”: “c”, “on”: “customer_data:id::VARCHAR = c.customer_id” } ] }, “flatten”: [ { “input”: “rd.orders”, “alias”: “f”, “path”: null } ], “where”: “active = true AND status =’VALID’”, “group_by”: [“customer_id”, “region”], “order_by”: [“order_amount DESC”, “customer_id ASC”] } Column configuration supports: - source_field: Simple field/column reference (supports JSON notation like “data:field” or table aliases like “t.column”) - expression: Complex SQL expression (e.g., “SUM(amount)”, “COUNT(\*)”) - destination_column: The output column name in the view (optional - auto-generated if not provided) - type: Snowflake data type for automatic type casting (VARCHAR, NUMBER, BOOLEAN, DATE, TIMESTAMP, etc.) Use either source_field OR expression, not both. When type is specified, automatic type casting is applied. When type is omitted, the expression is used as-is without casting. Flatten configuration supports: - input: The nested field/column to flatten (required) - alias: Alias for the flattened data (required) - path: Optional path within the nested structure The “from” section is required and specifies the source table and optional joins. Optional SQL clauses can be included: - where: WHERE clause condition (e.g., “active = true AND status =’VALID’”) - group_by: GROUP BY clause as an array of column names (e.g., [“customer_id”, “region”]) - order_by: ORDER BY clause as an array of column/expression with direction (e.g., [“order_amount DESC”, “customer_id ASC”])

## Tags

flatten, view

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Connection Pool | The connection pool to use to connect to Snowflake |
| Schema Name | The name of the schema where the view will be created |
| Secure | Whether to create a secure view. Secure views hide the view definition from unauthorized users. |
| View Name | The name of the view to create or update |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that failed to be processed |
| success | FlowFiles that were successfully processed |
| unchanged | FlowFiles where the view already exists and hasn’t changed |

---
title: UpdateTableState 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/updatetablestate.md
section: Loading & Unloading Data
---

# UpdateTableState 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Updates the state of a table in the Table State Service

## Tags

snowflake, state, table

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| CDC Schema Registry | When the state of the table is removed, the table will also be removed from the specified CDC Schema Registry. |
| Desired State | The desired state of the table |
| Overwrite Existing | Whether to overwrite the existing state of the table. If false, the state will only be updated if the state is currently unknown. |
| Schema Name | The name of the table’s schema |
| Table Name | The name of the table |
| Table State Service | The Table State Service to update |

## Relationships

| Name | Description |
| --- | --- |
| comms failure | A FlowFile is routed to this relationship if the table state could not be updated due to a communication failure with the Table State Service |
| state exists | A FlowFile is routed to this relationship if the table state was not updated because the state is already known for the table and the ‘Overwrite Existing’ property is set to ‘false’ |
| success | A FlowFile is routed to this relationship after the table state has been updated |

## Writes attributes

| Name | Description |
| --- | --- |
| table.state | The state of the table after updating the Table State Service |
| previous.table.state | The state of the table before the Table State Service was updated |

---
title: UpsertMilvus 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/upsertmilvus.md
section: Loading & Unloading Data
---

# UpsertMilvus 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-milvus-processors-nar

## Description

Upserts vectors into Milvus database for a given collection

## Tags

chatbot, embeddings, gen ai, genai, generative ai, insert, llm, metadata, milvus, openflow, publish, text, upsert, vector

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Collection Name | The name of the Milvus collection name to use |
| ID Field Name | The name of the field in Milvus to use for storing the IDs of vectors. If a record path is not provided along with the field name the IDs will be generated based on the filename in the format of a string. |
| ID Record Path | The path to the ID field in the record |
| Max Batch Size | If the number of Records in a FlowFile is large, creating a single request to Milvus can consume significant amounts of NiFi heap. In order to avoid this, the Max Batch Size can limit the number of Records to send in a single request. |
| Metadata Field Name | The name of the field to use for storing other metadata associated with the vectors. This data must be in the format of valid json. |
| Metadata Record Path | The path to the metadata field in the record |
| Milvus Connection Service | Connection Service for accessing Milvus Database |
| Partition | Partition of the vector database that you want to perform operations in. If the database has only one partition leave empty. |
| Record Reader | The Record Reader to use for reading the FlowFile |
| Sparse Vector Field Name | The name of the field to use for storing the sparse vectors. |
| Sparse Vector Indices Path | If, Sparse Vectors are to be provided, this RecordPath points to the indices of the sparse data to use. |
| Sparse Vector Values Path | If, Sparse Vectors are to be provided, this RecordPath points to the values of the sparse data to use. |
| Text Field Name | The name of the field in Milvus to use for storing the text associated with the vectors. |
| Text Record Path | The path to the field in the record that contains the text associated with the vectors. If specified, the text will be inserted under the text field in Milvus. If not specified, the text will not be sent to the Milvus database. |
| Vector Field Name | The name of the field in Milvus to use for storing the vectors. |
| Vector Record Path | The path to the vector field in the record |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be sent to Milvus, and for which a retry is not expected to be successful, are routed to this relationship |
| retry | FlowFiles that fail to be sent to Milvus, but for which a retry may help, are routed to this relationship |
| success | FlowFiles that are successfully sent to Milvus are routed to this relationship |

## See also

* [com.snowflake.openflow.runtime.processors.milvus.DeleteMilvus](deletemilvus.md)

---
title: UpsertPinecone 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/upsertpinecone.md
section: Loading & Unloading Data
---

# UpsertPinecone 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-pinecone-nar

## Description

Publishes vectors, including metadata, and optionally text, to a Pinecone index.

## Tags

chatbot, embeddings, gen ai, genai, generative ai, llm, metadata, openflow, pinecone, publish, text, upsert, vector

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| ID Record Path | The path to the ID field in the record |
| Max Batch Size | If the number of Records in a FlowFile is large, creating a single request to Pinecone can consume significant amounts of NiFi heap. In order to avoid this, the Max Batch Size can limit the number of Records to send in a single request. If the number of Records exceeds this value, multiple requests will be sent to Pinecone. |
| Metadata Record Path | The path to the metadata field in the record |
| Pinecone API Key | The API key for the Pinecone service |
| Pinecone Index | The name of the Pinecone index to use |
| Pinecone Namespace | The name of the Pinecone namespace to use |
| Record Reader | The Record Reader to use for reading the FlowFile |
| Sparse Vector Indices Path | If, Sparse Vectors are to be provided, this RecordPath points to the indices of the sparse data to use. |
| Sparse Vector Values Path | If, Sparse Vectors are to be provided, this RecordPath points to the values of the sparse data to use. |
| Text Field Name | The name of the field in the metadata to use for storing the text associated with the vectors. |
| Text Record Path | The path to the field in the record that contains the text associated with the vectors. If specified, the text will be inserted into the metadata when publishing to Pinecone. If not specified, the text will not be sent to Pinecone. |
| Vector Record Path | The path to the vector field in the record |
| Web Client Service | The Web Client Service to use for communicating with Pinecone |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be sent to Pinecone, and for which a retry is not expected to be successful, are routed to this relationship |
| retry | FlowFiles that fail to be sent to Pinecone, but for which a retry may help, are routed to this relationship |
| success | FlowFiles that are successfully sent to Pinecone are routed to this relationship |

## Use Cases Involving Other Components

|  |
| --- |
| Create embeddings for raw text data, or text that exists in a Record field such as JSON, using OpenAI’s embeddings model and publish the vectors to Pinecone. |
| Add embeddings for a document to a Pinecone index, replacing any embeddings that already exist for the document. |

## See also

* [com.snowflake.openflow.runtime.processors.openai.CreateOpenAiEmbeddings](createopenaiembeddings.md)
* [com.snowflake.openflow.runtime.processors.pinecone.DeletePinecone](deletepinecone.md)

---
title: UpsertSFDCObjects 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/upsertsfdcobjects.md
section: Loading & Unloading Data
---

# UpsertSFDCObjects 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-salesforce-processors-nar

## Description

Upserts the records from the incoming FlowFile into Salesforce

## Tags

insert, objects, preview, salesforce, sfdc, update, upsert

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Object Name | The name of the object type for the records included in the FlowFile. |
| Record Reader | Specifies the Controller Service to use for reading incoming data. Each record will be converted into a JSON object and upserted into Salesforce using a dedicated API call. |
| Salesforce Client | Salesforce Client to interact with the APIs |

## Relationships

| Name | Description |
| --- | --- |
| comms.failure | The FlowFile is routed to this relationship if any record could not be upserted in Salesforce but the operation might be retried |
| failure | The FlowFile is routed to this relationship if any record could not be upserted in Salesforce |
| success | The FlowFile is routed to this relationship after all records have been successfully upserted |

## Writes attributes

| Name | Description |
| --- | --- |
| sObjectId | ID of the created object in Salesforce when using this processor with a single record. |

## See also

* [com.snowflake.openflow.runtime.processors.salesforce.DeleteQueryJob](deletequeryjob.md)
* [com.snowflake.openflow.runtime.processors.salesforce.DescribeSFDCObject](describesfdcobject.md)
* [com.snowflake.openflow.runtime.processors.salesforce.GetQueryJobResult](getqueryjobresult.md)
* [com.snowflake.openflow.runtime.processors.salesforce.SubmitQueryJob](submitqueryjob.md)

---
title: Use the Openflow Connector for Google BigQuery
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/connectors/google-big-query/use.md
section: Loading & Unloading Data
---

# Use the Openflow Connector for Google BigQuery

> **Note:**
>
> This connector is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

This topic describes tasks you may need to perform after installing and configuring the
connector.

## Remove and re-add a table for replication

To remove a table from replication:

1. Verify the table’s state in the Table State Store.
2. If the state is `INCREMENTAL_IN_PROGRESS`, stop the **Trigger BigQuery Cdc On Incremental** processor.
   Wait for the state to change to `INCREMENTAL_REPLICATION`.
3. Remove the table from the **Included Table Names** or **Included Table Names Regex** parameters in the BigQuery Ingestion Parameters context.

To re-add a table for replication:

1. Drop the destination table in Snowflake.
2. Add the table back to the **Included Table Names** or **Included Table Names Regex** parameters.

This approach can also be used to recover from a failed table replication scenario.

---
title: Validate your BYOC deployment
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/byoc-validate-vpc-config.md
section: Loading & Unloading Data
---

# Validate your BYOC deployment

This topic describes how to use BYOC Pre-flight Validation to verify your AWS
network configuration for Openflow deployments.

## About BYOC Pre-flight Validation

BYOC Pre-flight Validation is a script that verifies your AWS network
environment is ready for an Openflow deployment. It checks that required
networking, connectivity, and access settings are in place.

Use this tool to identify and resolve network or access misconfigurations
before deployment. This helps prevent failures and ensures a smoother rollout
by providing specific feedback and actionable guidance for any issues found.

There are two versions of this script:

`byoc-validator.sh`:
:   Verifies that your AWS environment is ready for a new Openflow deployment.

`byo-vpc-validator.sh`:
:   Verifies that your existing VPC is configured correctly for Openflow.

## What does BYOC Pre-flight Validation review?

BYOC Pre-flight Validation performs a pre-deployment review that verifies your
existing AWS setup, identifies issues, and explains what needs to be corrected.

BYOC Pre-flight Validation checks the following:

* Prerequisites (applies only to existing VPCs)

  + VPC components such as subnets, gateways, and routing
  + Network sizing and placement across availability zones
  + Required resource tags
* Network Connectivity

  + Access to Openflow services and endpoints
  + Image registry access for required containers
  + Connectivity to core AWS services
* Permissions

  + Security group rules
  + Required IAM permissions
  + Encryption key access when needed

## When to use BYOC Pre-flight Validation?

Use BYOC Pre-flight Validation:

* Before your initial Openflow deployment
* After AWS networking changes that might impact connectivity
* During troubleshooting to confirm your setup
* When migrating Openflow to a new VPC or AWS account

## Download the CloudFormation template for BYOC Pre-flight Validation

Follow these steps to set up BYOC Pre-flight Validation in your AWS environment:

1. Create a new BYOC deployment in the Openflow Control Plane.
2. Download the CloudFormation template for BYOC Pre-flight Validation.

   To download the CloudFormation template for BYOC Pre-flight Validation, click
   Download Validator in the confirmation dialog that appears after
   creating the deployment.
3. Apply the BYOC Pre-flight Validation CloudFormation template in AWS.
4. Access the EC2 instance where BYOC Pre-flight Validation is installed.

## Configure the CloudFormation template for BYOC Pre-flight Validation

The CloudFormation template for the BYOC validator includes defaults for all
parameters, and those defaults should not be changed.

The CloudFormation template for the BYO-VPC validator includes defaults for most
parameters, and those defaults should not be changed. However, the following
parameters do not have defaults and must be provided, using the inputs you plan
to use for the actual deployment:

`InfraVPC`
:   Select an existing VPC.

`PrivateSubnet1`
:   The first private subnet for Openflow runtimes.

`PrivateSubnet2`
:   The second private subnet for the EKS control plane.

`PrivateSecurityGroup`
:   Security group for the agent instance, EC2 Instance Connect endpoint, and EKS
    cluster.

`EBSKMSKeyArn`
:   Optional KMS key ARN for encrypted EBS volumes.

## Run BYOC Pre-flight Validation and view results

Follow these steps to run BYOC Pre-flight Validation:

1. Connect to the EC2 instance where BYOC Pre-flight Validation is installed.
2. Run the BYOC Pre-flight Validation script from the home directory:

   ```bash
   /home/ec2-user/byoc-validator.sh
   ```

   You can run BYOC Pre-flight Validation as many times as needed.
3. Review the output file in the `home` directory:

   Each run produces a new, timestamped results file, for example:
   `/home/ec2-user/byoc-validation-results-YYYYMMDDHHMMSS.txt`
4. Open and inspect the results:

   Use a tool of your preference to read the output and review pass/fail
   messages.

Follow these steps to run BYOC Pre-flight Validation for an existing VPC:

1. Connect to the EC2 instance where BYOC Pre-flight Validation is installed.
2. Run the BYOC Pre-flight Validation script in the home directory:

   ```bash
   /home/ec2-user/byo-vpc-validator.sh
   ```

   You can run BYOC Pre-flight Validation as many times as needed.
3. Review the output file in the `home` directory:

   Each run produces a new, timestamped results file, for example:
   `/home/ec2-user/byo-vpc-validation-results-YYYYMMDDHHMMSS.txt`
4. Open and inspect the results:

   Use a tool of your preference to read the output and review pass/fail
   messages.

## Example output

The following example shows a successful validation output:

```text
2026-01-15 11:43:37,599 - INFO - Starting BYO-VPC validation suite...
2026-01-15 11:43:37,599 - INFO - ============================================================
...
2026-01-15 11:43:37,599 - INFO - Starting Prerequisites validation...
2026-01-15 11:43:37,704 - INFO - Running validation rule: internet_gateway
2026-01-15 11:43:38,538 - INFO - ✅ internet_gateway: Internet Gateway validation passed
...
2026-01-15 11:43:39,769 - INFO - Prerequisites Summary: 4/4 rules passed
2026-01-15 11:43:39,769 - INFO - --------------------------------------------------
2026-01-15 11:43:39,769 - INFO - Starting Network validation...
2026-01-15 11:43:39,780 - INFO - Running validation rule: snowflake_authentication
2026-01-15 11:43:41,130 - INFO - ✅ snowflake_authentication: Snowflake OAuth authentication successful
...
2026-01-15 11:43:55,920 - INFO - Network Summary: 7/7 rules passed
2026-01-15 11:43:55,920 - INFO - --------------------------------------------------
2026-01-15 11:43:55,920 - INFO - Starting Permissions validation...
2026-01-15 11:43:55,946 - INFO - Running validation rule: private_security_group
2026-01-15 11:43:56,766 - INFO - ✅ private_security_group: Private security group validation passed
...
2026-01-15 11:43:57,560 - INFO - Permissions Summary: 2/2 rules passed
2026-01-15 11:43:57,560 - INFO - ============================================================
2026-01-15 11:43:57,560 - INFO - 🎉 Openflow compatibility checker completed successfully!
```

The output highlights each check with a status icon:

* ✅ - The requirement is met.
* ❌ - The requirement is not met, and action is needed.

## AWS permissions required

The CloudFormation template creates an IAM role with the necessary permissions
for the EC2 instance where BYOC Pre-flight Validation is installed. If your
organization uses custom IAM controls, ensure the instance role includes the
following permissions:

* Required to access the Snowflake OAuth secret created by the template:

  + `secretsmanager:GetSecretValue`
* Required to inspect network resources:

  + `ec2:DescribeInternetGateways`
  + `ec2:DescribeSubnets`
  + `ec2:DescribeRouteTables`
  + `ec2:DescribeNATGateways`
  + `ec2:DescribeSecurityGroups`
* Required only when validating an optional EBS KMS key:

  + `kms:DescribeKey`
  + `kms:GetKeyPolicy`

The Secrets Manager permission is scoped to the BYOC Pre-flight Validation
secret created by the template. The EC2 and KMS actions can be scoped to `*`
(read-only metadata).

## Cleanup

After validation is complete, you can delete BYOC Pre-flight Validation to
avoid ongoing AWS costs. To delete BYOC Pre-flight Validation, delete the
CloudFormation stack used to create it. This automatically removes the EC2
instance, the IAM role, and the Secrets Manager secret.

---
title: ValidateCsv 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/validatecsv.md
section: Loading & Unloading Data
---

# ValidateCsv 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Validates the contents of FlowFiles or a FlowFile attribute value against a user-specified CSV schema. Take a look at the additional documentation of this processor for some schema examples.

## Tags

csv, schema, validation

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| CSV Source Attribute | The name of the attribute containing CSV data to be validated. If this property is blank, the FlowFile content will be validated. |
| Max Lines Per Row | The maximum number of lines that a row can span before an exception is thrown. This option allows the processor to fail fast when encountering CSV with mismatching quotes - the normal behaviour would be to continue reading until the matching quote is found, which could potentially mean reading the whole file (and exhausting all available memory). Zero value will disable this option. |
| validate-csv-delimiter | Character used as ‘delimiter’ in the incoming data. Example: , |
| validate-csv-eol | Symbols used as ‘end of line’ in the incoming data. Example: n |
| validate-csv-header | True if the incoming flow file contains a header to ignore, false otherwise. |
| validate-csv-quote | Character used as ‘quote’ in the incoming data. Example: “ |
| validate-csv-schema | The schema to be used for validation. Is expected a comma-delimited string representing the cell processors to apply. The following cell processors are allowed in the schema definition: [ParseBigDecimal, ParseBool, ParseChar, ParseDate, ParseDouble, ParseInt, ParseLong, Optional, DMinMax, Equals, ForbidSubStr, LMinMax, NotNull, Null, RequireHashCode, RequireSubStr, Strlen, StrMinMax, StrNotNullOrEmpty, StrRegEx, Unique, UniqueHashCode, IsIncludedIn]. Note: cell processors cannot be nested except with Optional. Schema is required if Header is false. |
| validate-csv-strategy | Strategy to apply when routing input files to output relationships. |
| validate-csv-violations | If true, the validation.error.message attribute would include the list of all the violations for the first invalid line. Note that setting this property to true would slightly decrease the performances as all columns would be validated. If false, a line is invalid as soon as a column is found violating the specified constraint and only this violation for the first invalid line will be included in the validation.error.message attribute. |

## Relationships

| Name | Description |
| --- | --- |
| invalid | FlowFiles that are not valid according to the specified schema, or no schema or CSV header can be identified, are routed to this relationship |
| valid | FlowFiles that are successfully validated against the schema are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| count.valid.lines | If line by line validation, number of valid lines extracted from the source data |
| count.invalid.lines | If line by line validation, number of invalid lines extracted from the source data |
| count.total.lines | If line by line validation, total number of lines in the source data |
| validation.error.message | For flow files routed to invalid, message of the first validation error |

---
title: ValidateJson 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/validatejson.md
section: Loading & Unloading Data
---

# ValidateJson 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Validates the contents of FlowFiles against a configurable JSON Schema. See json-schema.org for specification standards. This Processor does not support input containing multiple JSON objects, such as newline-delimited JSON. If the input FlowFile contains newline-delimited JSON, only the first line will be validated.

## Tags

JSON, schema, validation

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| JSON Schema | A URL or file path to the JSON schema or the actual JSON schema content |
| JSON Schema Registry | Specifies the Controller Service to use for the JSON Schema Registry |
| JSON Schema Version | The JSON schema specification |
| Max String Length | The maximum allowed length of a string value when parsing the JSON document |
| Schema Access Strategy | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Name | Specifies the name of the schema to lookup in the Schema Registry property |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Schema configuration can reference resources over HTTP |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles that cannot be read as JSON are routed to this relationship |
| invalid | FlowFiles that are not valid according to the specified schema are routed to this relationship |
| valid | FlowFiles that are successfully validated against the schema are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| json.validation.errors | If the flow file is routed to the invalid relationship , this attribute will contain the error message resulting from the validation failure. |

---
title: ValidateRecord 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/validaterecord.md
section: Loading & Unloading Data
---

# ValidateRecord 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Validates the Records of an incoming FlowFile against a given schema. All records that adhere to the schema are routed to the “valid” relationship while records that do not adhere to the schema are routed to the “invalid” relationship. It is therefore possible for a single incoming FlowFile to be split into two individual FlowFiles if some records are valid according to the schema and others are not. Any FlowFile that is routed to the “invalid” relationship will emit a ROUTE Provenance Event with the Details field populated to explain why records were invalid. In addition, to gain further explanation of why records were invalid, DEBUG-level logging can be enabled for the “org.apache.nifi.processors.standard. ValidateRecord” logger.

## Tags

record, schema, validate

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Schema Access Strategy | Specifies how to obtain the schema that should be used to validate records |
| Schema Branch | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Registry | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | The text of an Avro-formatted Schema |
| Schema Version | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| allow-extra-fields | If the incoming data has fields that are not present in the schema, this property determines whether or not the Record is valid. If true, the Record is still valid. If false, the Record will be invalid due to the extra fields. |
| coerce-types | If enabled, the processor will coerce every field to the type specified in the Reader ‘s schema. If the value of a field cannot be coerced to the type, the field will be skipped (will not be read from the input data), thus will not appear in the output. If not enabled, then every field will appear in the output but their types may differ from what is specified in the schema. For details please see the Additional Details page of the processor’s Help. This property controls how the data is read by the specified Record Reader. |
| invalid-record-writer | If specified, this Controller Service will be used to write out any records that are invalid. If not specified, the writer specified by the “Record Writer” property will be used with the schema used to read the input records. This is useful, for example, when the configured Record Writer cannot write data that does not adhere to its schema (as is the case with Avro) or when it is desirable to keep invalid records in their original format while converting valid records to another format. |
| maximum-validation-details-length | Specifies the maximum number of characters that validation details value can have. Any characters beyond the max will be truncated. This property is only used if ‘Validation Details Attribute Name’ is set |
| record-reader | Specifies the Controller Service to use for reading incoming data |
| record-writer | Specifies the Controller Service to use for writing out the records. Regardless of the Controller Service schema access configuration, the schema that is used to validate record is used to write the valid results. |
| strict-type-checking | If the incoming data has a Record where a field is not of the correct type, this property determines how to handle the Record. If true, the Record will be considered invalid. If false, the Record will be considered valid and the field will be coerced into the correct type (if possible, according to the type coercion supported by the Record Writer). This property controls how the data is validated against the validation schema. |
| validation-details-attribute-name | If specified, when a validation error occurs, this attribute name will be used to leave the details. The number of characters will be limited by the property ‘Maximum Validation Details Length’. |

## Relationships

| Name | Description |
| --- | --- |
| failure | If the records cannot be read, validated, or written, for any reason, the original FlowFile will be routed to this relationship |
| invalid | Records that are not valid according to the schema will be routed to this relationship |
| valid | Records that are valid according to the schema will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| mime.type | Sets the mime.type attribute to the MIME Type specified by the Record Writer |
| record.count | The number of records in the FlowFile routed to a relationship |

---
title: ValidateXml 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/validatexml.md
section: Loading & Unloading Data
---

# ValidateXml 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Validates XML contained in a FlowFile. By default, the XML is contained in the FlowFile content. If the ‘XML Source Attribute’ property is set, the XML to be validated is contained in the specified attribute. It is not recommended to use attributes to hold large XML documents; doing so could adversely affect system performance. Full schema validation is performed if the processor is configured with the XSD schema details. Otherwise, the only validation performed is to ensure the XML syntax is correct and well-formed, e.g. all opening tags are properly closed.

## Tags

schema, validation, xml, xsd

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Schema File | The file path or URL to the XSD Schema file that is to be used for validation. If this property is blank, only XML syntax/structure will be validated. |
| XML Source Attribute | The name of the attribute containing XML to be validated. If this property is blank, the FlowFile content will be validated. |

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| reference remote resources | Schema configuration can reference resources over HTTP |

## Relationships

| Name | Description |
| --- | --- |
| invalid | FlowFiles that are not valid according to the specified schema or contain invalid XML are routed to this relationship |
| valid | FlowFiles that are successfully validated against the schema, if provided, or verified to be well-formed XML are routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| validatexml.invalid.error | If the flow file is routed to the invalid relationship the attribute will contain the error message resulting from the validation failure. |

---
title: VerifyContentMAC 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/verifycontentmac.md
section: Loading & Unloading Data
---

# VerifyContentMAC 2025.10.9.21

## Bundle

org.apache.nifi | nifi-cipher-nar

## Description

Calculates a Message Authentication Code using the provided Secret Key and compares it with the provided MAC property

## Tags

Authentication, HMAC, MAC, Signing

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Message Authentication Code | The MAC to compare with the calculated value |
| Message Authentication Code Algorithm | Hashed Message Authentication Code Function |
| Message Authentication Code Encoding | Encoding of the Message Authentication Code |
| Secret Key | Secret key to calculate the hash |
| Secret Key Encoding | Encoding of the Secret Key |

## Relationships

| Name | Description |
| --- | --- |
| failure | Signature Verification Failed |
| success | Signature Verification Succeeded |

## Writes attributes

| Name | Description |
| --- | --- |
| mac.calculated | Calculated Message Authentication Code encoded by the selected encoding |
| mac.encoding | The Encoding of the Hashed Message Authentication Code |
| mac.algorithm | Hashed Message Authentication Code Algorithm |

---
title: VerifyContentPGP 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/verifycontentpgp.md
section: Loading & Unloading Data
---

# VerifyContentPGP 2025.10.9.21

## Bundle

org.apache.nifi | nifi-pgp-nar

## Description

Verify signatures using OpenPGP Public Keys

## Tags

Encryption, GPG, OpenPGP, PGP, RFC 4880, Signing

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| public-key-service | PGP Public Key Service for verifying signatures with Public Key Encryption |

## Relationships

| Name | Description |
| --- | --- |
| failure | Signature Verification Failed |
| success | Signature Verification Succeeded |

## Writes attributes

| Name | Description |
| --- | --- |
| pgp.literal.data.filename | Filename from Literal Data |
| pgp.literal.data.modified | Modified Date Time from Literal Data in milliseconds |
| pgp.signature.created | Signature Creation Time in milliseconds |
| pgp.signature.algorithm | Signature Algorithm including key and hash algorithm names |
| pgp.signature.hash.algorithm.id | Signature Hash Algorithm Identifier |
| pgp.signature.key.algorithm.id | Signature Key Algorithm Identifier |
| pgp.signature.key.id | Signature Public Key Identifier |
| pgp.signature.type.id | Signature Type Identifier |
| pgp.signature.version | Signature Version Number |

## See also

* [org.apache.nifi.processors.pgp.DecryptContentPGP](decryptcontentpgp.md)
* [org.apache.nifi.processors.pgp.EncryptContentPGP](encryptcontentpgp.md)
* [org.apache.nifi.processors.pgp.SignContentPGP](signcontentpgp.md)

---
title: Version control for custom flows
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/version-control-custom-flows.md
section: Loading & Unloading Data
---

# Version control for custom flows

Openflow supports Registry Clients, including the GitHub Registry Client, which allows you to
use a Git repository to store and version your custom flow definitions. This enables standard software
development lifecycle (SDLC) practices, such as branching, pull
requests, code review, and environment promotion.

A common workflow is:

* Maintain a `main` branch representing your production flow definitions.
* Create feature branches for new development.
* Develop and commit changes on the Openflow canvas.
* Open pull requests, review with Flow Diff, and merge.

## Prerequisites

* A GitHub repository for storing flow definitions.
* A GitHub Personal Access Token with `repository` access.
* An Openflow Runtime with access to the Openflow canvas.
* Appropriate Snowflake role privileges on the Runtime Integration object.

## Step 1: Create a GitHub Registry Client

1. Create a repository in GitHub to store your flow definitions.
2. Generate a Personal Access Token (PAT) in GitHub with repository access permissions.
3. On the Openflow canvas, navigate to Controller Settings and create a new Registry Client.
4. Select GitHub Registry Client as the type.
5. Configure the Registry Client with:

   * Your GitHub repository URL.
   * The GitHub repository owner.
   * Your Personal Access Token for authentication.

## Step 2: Create and version a new flow

1. On the Openflow canvas, create a new Process Group for your flow.
2. Build your flow: add processors, configure connections, and set up your data pipeline.
3. Right-click the Process Group and select Start Version Control.
4. Choose the GitHub Registry Client you configured in
   Step 1.
5. Provide a flow name and an initial commit message.

After you save, the flow definition is committed to your GitHub repository. You can verify by
checking the repository in GitHub.

## Step 3: Use branches to manage changes

### Create a development branch

In your GitHub repository, create a new branch (for example, `dev` or a feature branch
like `feature/add-new-table`).

### Import and develop on the branch

1. On the Openflow canvas, import the flow from the GitHub Registry into a new Process Group by
   dragging the Import from Registry icon from the toolbar to the canvas.
2. When importing, select the target branch (for example, `dev`) to work against.
3. Make your changes to the flow inside the Process Group.
4. Commit your changes in Openflow. This pushes the updated flow definition to the selected
   branch in GitHub.

### Review and merge via pull request

1. In GitHub, open a pull request from your development branch to `main`.
2. Review the changes. Use the Snowflake Flow Diff GitHub Action
   (see Step 4) for human-readable diffs.
3. Merge the pull request after it’s approved.
4. Back on the Openflow canvas, update the `main` Process Group to pull the latest version from
   the `main` branch.

## Step 4: Set up Snowflake Flow Diff (GitHub Action)

Snowflake Flow Diff is a GitHub Action that makes flow changes human-readable by rendering
a visual diff of your pipeline changes directly in pull request conversations.

### Set up the workflow file

1. In your GitHub repository, create the file `.github/workflows/flowdiff.yml`.
2. Copy the workflow configuration from the
   [Snowflake Flow Diff repository](https://github.com/Snowflake-Labs/snowflake-flow-diff)
   (see the Usage section in the README).
3. Commit and push the workflow file.

### Review flow changes

1. When a pull request is opened, the Flow Diff action runs automatically.
2. Navigate to the Conversations tab on the pull request and wait for the Flow Diff
   analysis to appear.
3. The analysis shows a visual, human-readable comparison of flow changes instead of raw
   JSON diffs.

## Manage parameters across environments

Openflow uses Parameters to manage environment-specific values (for example, connection strings,
credentials, table names) across different Runtimes.

Keep the following concepts in mind:

* Parameters are grouped into a Parameter Context, which has a one-to-one mapping with a
  Process Group.
* Parameter Context inheritance allows you to define shared parameters in a parent context and
  override specific values in child contexts. This is useful for promoting flows across dev,
  staging, and production environments.
* Parameter Contexts can integrate with Secrets Managers to securely handle sensitive credentials
  without storing them in the flow definition.

## Recommended SDLC workflow

1. **Development environment**: Developers create feature branches, build or modify flows, and
   commit changes on the Openflow canvas against their feature branch.
2. **Code review**: Open a pull request in GitHub. Use Snowflake Flow Diff for readable reviews.
3. **Merge to main**: After approval, merge the pull request into the `main` branch.
4. **Promote to production**: In your production Runtime, update the Process Group to pull the
   latest version from `main`.
5. **Parameterize**: Use Parameter Contexts to handle environment-specific configuration without
   modifying the flow definition itself.

---
title: VolatileSchemaCache
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/volatileschemacache.md
section: Loading & Unloading Data
---

# VolatileSchemaCache

## Description

Provides a Schema Cache that evicts elements based on a Least-Recently-Used algorithm. This cache is not persisted, so any restart of NiFi will result in the cache being cleared. Additionally, the cache will be cleared any time that the Controller Service is stopped and restarted.

## Tags

cache, record, schema

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Maximum Cache Size \* | max-cache-size | 100 |  | The maximum number of Schemas to cache. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: Wait 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/wait.md
section: Loading & Unloading Data
---

# Wait 2025.10.9.21

## Bundle

org.apache.nifi | nifi-standard-nar

## Description

Routes incoming FlowFiles to the ‘wait’ relationship until a matching release signal is stored in the distributed cache from a corresponding Notify processor. When a matching release signal is identified, a waiting FlowFile is routed to the ‘success’ relationship. The release signal entry is then removed from the cache. The attributes of the FlowFile that produced the release signal are copied to the waiting FlowFile if the Attribute Cache Regex property of the corresponding Notify processor is set properly. If there are multiple release signals in the cache identified by the Release Signal Identifier, and the Notify processor is configured to copy the FlowFile attributes to the cache, then the FlowFile passing the Wait processor receives the union of the attributes of the FlowFiles that produced the release signals in the cache (identified by Release Signal Identifier). Waiting FlowFiles will be routed to ‘expired’ if they exceed the Expiration Duration. If you need to wait for more than one signal, specify the desired number of signals via the ‘Target Signal Count’ property. This is particularly useful with processors that split a source FlowFile into multiple fragments, such as SplitText. In order to wait for all fragments to be processed, connect the ‘original’ relationship to a Wait processor, and the ‘splits’ relationship to a corresponding Notify processor. Configure the Notify and Wait processors to use the ‘${fragment.identifier}’ as the value of ‘Release Signal Identifier’, and specify ‘${fragment.count}’ as the value of ‘Target Signal Count’ in the Wait processor. It is recommended to use a prioritizer (for instance First In First Out) when using the ‘wait’ relationship as a loop.

## Tags

cache, distributed, hold, map, release, signal, wait

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| attribute-copy-mode | Specifies how to handle attributes copied from FlowFiles entering the Notify processor |
| distributed-cache-service | The Controller Service that is used to check for release signals from a corresponding Notify processor |
| expiration-duration | Indicates the duration after which waiting FlowFiles will be routed to the ‘expired’ relationship |
| releasable-flowfile-count | A value, or the results of an Attribute Expression Language statement, which will be evaluated against a FlowFile in order to determine the releasable FlowFile count. This specifies how many FlowFiles can be released when a target count reaches target signal count. Zero (0) has a special meaning, any number of FlowFiles can be released as long as signal count matches target. |
| release-signal-id | A value that specifies the key to a specific release signal cache. To decide whether the FlowFile that is being processed by the Wait processor should be sent to the ‘success’ or the ‘wait’ relationship, the processor checks the signals in the cache specified by this key. |
| signal-counter-name | Within the cache (specified by the Release Signal Identifier) the signals may belong to different counters. If this property is specified, the processor checks the number of signals in the cache that belong to this particular counter. If not specified, the processor checks the total number of signals in the cache. |
| target-signal-count | The number of signals that need to be in the cache (specified by the Release Signal Identifier) in order for the FlowFile processed by the Wait processor to be sent to the ‘success’ relationship. If the number of signals in the cache has reached this number, the FlowFile is routed to the ‘success’ relationship and the number of signals in the cache is decreased by this value. If Signal Counter Name is specified, this processor checks a particular counter, otherwise checks against the total number of signals in the cache. |
| wait-buffer-count | Specify the maximum number of incoming FlowFiles that can be buffered to check whether it can move forward. The more buffer can provide the better performance, as it reduces the number of interactions with cache service by grouping FlowFiles by signal identifier. Only a signal identifier can be processed at a processor execution. |
| wait-mode | Specifies how to handle a FlowFile waiting for a notify signal |
| wait-penalty-duration | If configured, after a signal identifier got processed but did not meet the release criteria, the signal identifier is penalized and FlowFiles having the signal identifier will not be processed again for the specified period of time, so that the signal identifier will not block others to be processed. This can be useful for use cases where a Wait processor is expected to process multiple signal identifiers, and each signal identifier has multiple FlowFiles, and also the order of releasing FlowFiles is important within a signal identifier. The FlowFile order can be configured with Prioritizers. IMPORTANT: There is a limitation of number of queued signals can be processed, and Wait processor may not be able to check all queued signal ids. See additional details for the best practice. |

## Relationships

| Name | Description |
| --- | --- |
| expired | A FlowFile that has exceeded the configured Expiration Duration will be routed to this relationship |
| failure | When the cache cannot be reached, or if the Release Signal Identifier evaluates to null or empty, FlowFiles will be routed to this relationship |
| success | A FlowFile with a matching release signal in the cache will be routed to this relationship |
| wait | A FlowFile with no matching release signal in the cache will be routed to this relationship |

## Writes attributes

| Name | Description |
| --- | --- |
| wait.start.timestamp | All FlowFiles will have an attribute ‘wait.start.timestamp’, which sets the initial epoch timestamp when the file first entered this processor. This is used to determine the expiration time of the FlowFile. This attribute is not written when the FlowFile is transferred to failure, expired or success |
| wait.counter.<counterName> | The name of each counter for which at least one signal has been present in the cache since the last time the cache was empty gets copied to the current FlowFile as an attribute. |

## See also

* [org.apache.nifi.processors.standard.Notify](notify.md)

---
title: WaitForTableState 2025.10.9.21
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/processors/waitfortablestate.md
section: Loading & Unloading Data
---

# WaitForTableState 2025.10.9.21

## Bundle

com.snowflake.openflow.runtime | runtime-database-cdc-processors-nar

## Description

Blocks incoming FlowFiles until the corresponding table state is not equal to accepted state. Blocked FlowFiles stay in the upstream queue. When table is in terminated state or table is removed from the state then all FlowFiles are routed to the ‘failure’ relationship.

## Tags

cdc, event, jdbc, mysql, postgresql, sql

## Input Requirement

REQUIRED

## Supports Sensitive Dynamic Properties

false

## Properties

| Property | Description |
| --- | --- |
| Accepted State | Blocks FlowFiles for a given SourceTableFQN until corresponding state is equal to the Accepted State |
| Table State Service | Manages the state of each replicated table |

## Relationships

| Name | Description |
| --- | --- |
| failure | FlowFiles for tables in terminal states will be routed to this relationship |
| success | FlowFiles fulfilling a given condition will be routed to this relationship |

---
title: WindowsEventLogReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/windowseventlogreader.md
section: Loading & Unloading Data
---

# WindowsEventLogReader

## Description

Reads Windows Event Log data as XML content having been generated by ConsumeWindowsEventLog, ParseEvtx, etc. (see Additional Details) and creates Record object(s). If the root tag of the input XML is ‘Events’, the child content is expected to be a series of ‘Event’ tags, each of which will constitute a single record. If the root tag is ‘Event’, the content is expected to be a single ‘Event’ and thus a single record. No other root tags are valid. Only events of type ‘System’ are currently supported.

## Tags

event, log, parser, reader, record, windows, xml

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: XMLFileLookupService
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/xmlfilelookupservice.md
section: Loading & Unloading Data
---

# XMLFileLookupService

## Description

A reloadable XML file-based lookup service. This service uses Apache Commons Configuration. Example XML configuration file and how to access specific configuration can be found at <http://commons.apache.org/proper/commons-configuration/userguide/howto_hierarchical.html>. External entity processing is disabled.

## Tags

cache, enrich, join, key, lookup, reloadable, value, xml

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Configuration File \* | configuration-file |  |  | A configuration file |

## State management

This component does not store state.

## Restricted

## Restrictions

| Required Permission | Explanation |
| --- | --- |
| read filesystem | Provides operator the ability to read from any file that NiFi has access to. |

## System Resource Considerations

This component does not specify system resource considerations.

---
title: XMLReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/xmlreader.md
section: Loading & Unloading Data
---

# XMLReader

## Description

Reads XML content and creates Record objects. Records are expected in the second level of XML data, embedded in an enclosing root tag.

## Tags

parser, reader, record, xml

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Schema Access Strategy \* | Schema Access Strategy | infer-schema | * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader * Infer Schema | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Attribute Prefix | attribute_prefix |  |  | If this property is set, the name of attributes will be prepended with a prefix when they are added to a record. |
| Field Name for Content | content_field_name |  |  | If tags with content (e. g. <field>content</field>) are defined as nested records in the schema, the name of the tag will be used as name for the record and the value of this property will be used as name for the field. If tags with content shall be parsed together with attributes (e. g. <field attribute=”123”>content</field>), they have to be defined as records. In such a case, the name of the tag will be used as the name for the record and the value of this property will be used as the name for the field holding the original content. The name of the attribute will be used to create a new record field, the content of which will be the value of the attribute. For more information, see the ‘Additional Details…’ section of the XMLReader controller service’s documentation. |
| Parse XML Attributes | parse_xml_attributes | true | * true * false | When ‘Schema Access Strategy’ is ‘Infer Schema’ and this property is ‘true’ then XML attributes are parsed and added to the record as new fields. When the schema is inferred but this property is ‘false’, XML attributes and their values are ignored. |
| Expect Records as Array \* | record_format | false | * false * true * Use attribute ‘xml.stream.is.array’ | This property defines whether the reader expects a FlowFile to consist of a single Record or a series of Records with a “wrapper element”. Because XML does not provide for a way to read a series of XML documents from a stream directly, it is common to combine many XML documents by concatenating them and then wrapping the entire XML blob with a “wrapper element”. This property dictates whether the reader expects a FlowFile to consist of a single Record or a series of Records with a “wrapper element” that will be ignored. |
| Schema Inference Cache | schema-inference-cache |  |  | Specifies a Schema Cache to use when inferring the schema. If not populated, the schema will be inferred each time. However, if a cache is specified, the cache will first be consulted and if the applicable schema can be found, it will be used instead of inferring the schema. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: XMLRecordSetWriter
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/xmlrecordsetwriter.md
section: Loading & Unloading Data
---

# XMLRecordSetWriter

## Description

Writes a RecordSet to XML. The records are wrapped by a root tag.

## Tags

record, recordset, resultset, row, serialize, writer, xml

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Character Set \* | Character Set | UTF-8 |  | The Character set to use when writing the data to the FlowFile |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Schema Access Strategy \* | Schema Access Strategy | inherit-record-schema | * Inherit Record Schema * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Cache | Schema Cache |  |  | Specifies a Schema Cache to add the Record Schema to so that Record Readers can quickly lookup the schema. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Reference Writer \* | Schema Reference Writer |  |  | Service implementation responsible for writing FlowFile attributes or content header with Schema reference information |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Schema Write Strategy \* | Schema Write Strategy | no-schema | * Do Not Write Schema * Set ‘schema.name’ Attribute * Set ‘avro.schema’ Attribute * Schema Reference Writer | Specifies how the schema for a Record should be added to the data. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Array Tag Name | array_tag_name |  |  | Name of the tag used by property “Wrap Elements of Arrays” to write arrays |
| Wrap Elements of Arrays \* | array_wrapping | no-wrapping | * Use Property as Wrapper * Use Property for Elements * No Wrapping | Specifies how the writer wraps elements of fields of type array |
| Omit XML Declaration \* | omit_xml_declaration | false | * true * false | Specifies whether or not to include XML declaration |
| Pretty Print XML \* | pretty_print_xml | false | * true * false | Specifies whether or not the XML should be pretty printed |
| Name of Record Tag | record_tag_name |  |  | Specifies the name of the XML record tag wrapping the record fields. If this is not set, the writer will use the record name in the schema. |
| Name of Root Tag | root_tag_name |  |  | Specifies the name of the XML root tag wrapping the record set. This property has to be defined if the writer is supposed to write multiple records in a single FlowFile. |
| Suppress Null Values \* | suppress_nulls | never-suppress | * Never Suppress * Always Suppress * Suppress Missing Values | Specifies how the writer should handle a null field |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

---
title: YamlTreeReader
source: https://docs.snowflake.com/en/user-guide/data-integration/openflow/controllers/yamltreereader.md
section: Loading & Unloading Data
---

# YamlTreeReader

## Description

Parses YAML into individual Record objects. While the reader expects each record to be well-formed YAML, the content of a FlowFile may consist of many records, each as a well-formed YAML array or YAML object. If an array is encountered, each element in that array will be treated as a separate record. If the schema that is configured contains a field that is not present in the YAML, a null value will be used. If the YAML contains a field that is not present in the schema, that field will be skipped. Please note this controller service does not support resolving the use of YAML aliases. Any alias present will be treated as a string. See the Usage of the Controller Service for more information and examples.

## Tags

parser, reader, record, tree, yaml

## Properties

In the list below required Properties are shown with an asterisk (\*).
Other properties are considered optional. The table also indicates any default values, and whether a property supports the NiFi Expression Language.

| Display Name | API Name | Default Value | Allowable Values | Description |
| --- | --- | --- | --- | --- |
| Allow Comments \* | Allow Comments | false | * true * false | Whether to allow comments when parsing the JSON document |
| Date Format | Date Format |  |  | Specifies the format to use when reading/writing Date fields. If not specified, Date fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters, as in 01/01/2017). |
| Max String Length \* | Max String Length | 20 MB |  | The maximum allowed length of a string value when parsing the JSON document |
| Schema Access Strategy \* | Schema Access Strategy | infer-schema | * Infer Schema * Use ‘Schema Name’ Property * Use ‘Schema Text’ Property * Schema Reference Reader | Specifies how to obtain the schema that is to be used for interpreting the data. |
| Schema Branch | Schema Branch |  |  | Specifies the name of the branch to use when looking up the schema in the Schema Registry property. If the chosen Schema Registry does not support branching, this value will be ignored. |
| Schema Name | Schema Name | ${schema.name} |  | Specifies the name of the schema to lookup in the Schema Registry property |
| Schema Reference Reader \* | Schema Reference Reader |  |  | Service implementation responsible for reading FlowFile attributes or content to determine the Schema Reference Identifier |
| Schema Registry | Schema Registry |  |  | Specifies the Controller Service to use for the Schema Registry |
| Schema Text | Schema Text | ${avro.schema} |  | The text of an Avro-formatted Schema |
| Schema Version | Schema Version |  |  | Specifies the version of the schema to lookup in the Schema Registry. If not specified then the latest version of the schema will be retrieved. |
| Time Format | Time Format |  |  | Specifies the format to use when reading/writing Time fields. If not specified, Time fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, HH:mm:ss for a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 18:04:15). |
| Timestamp Format | Timestamp Format |  |  | Specifies the format to use when reading/writing Timestamp fields. If not specified, Timestamp fields will be assumed to be number of milliseconds since epoch (Midnight, Jan 1, 1970 GMT). If specified, the value must match the Java java.time.format.DateTimeFormatter format (for example, MM/dd/yyyy HH:mm:ss for a two-digit month, followed by a two-digit day, followed by a four-digit year, all separated by ‘/’ characters; and then followed by a two-digit hour in 24-hour format, followed by a two-digit minute, followed by a two-digit second, all separated by ‘:’ characters, as in 01/01/2017 18:04:15). |
| Schema Application Strategy \* | schema-application-strategy | SELECTED_PART | * Whole JSON * Selected Part | Specifies whether the schema is defined for the whole JSON or for the selected part starting from “Starting Field Name”. |
| Schema Inference Cache | schema-inference-cache |  |  | Specifies a Schema Cache to use when inferring the schema. If not populated, the schema will be inferred each time. However, if a cache is specified, the cache will first be consulted and if the applicable schema can be found, it will be used instead of inferring the schema. |
| Starting Field Name | starting-field-name |  |  | Skips forward to the given nested JSON field (array or object) to begin processing. |
| Starting Field Strategy \* | starting-field-strategy | ROOT_NODE | * Root Node * Nested Field | Start processing from the root node or from a specified nested node. |

## State management

This component does not store state.

## Restricted

This component is not restricted.

## System Resource Considerations

This component does not specify system resource considerations.

## Snowflake Cortex (AI & ML)

LLM functions, vector search, document AI, Cortex Analyst, and AI-powered features.

---
title: AI Observability Tutorial
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-observability/tutorial.md
section: Snowflake Cortex (AI & ML)
---

# AI Observability Tutorial

Learn how to implement AI observability in a retrieval-augmented generation (RAG) application using [Cortex Search](../cortex-search/cortex-search-overview.md)
and [COMPLETE (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/complete-snowflake-cortex.md) functions.

In the [Getting started with AI observability](https://quickstarts.snowflake.com/guide/getting_started_with_ai_observability) tutorial, you’ll learn how to do the following tasks:

* Build a RAG application using Snowflake Cortex Search and Snowflake Cortex LLM functions.
* Create a run.
* Compute evaluation metrics.

---
title: AI_COMPLETE structured outputs
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/complete-structured-outputs.md
section: Snowflake Cortex (AI & ML)
---

# AI_COMPLETE structured outputs

AI_COMPLETE lets you supply a JSON schema or SQL type literal that completion responses must follow, producing structured
output. Structured output reduces the need for post-processing in your AI data pipelines and enables seamless
integration with systems that require deterministic responses. AI_COMPLETE verifies each generated token against your
structured output definition to ensure that the response conforms to your type structure.

Every model supported by AI_COMPLETE supports structured output, but the most powerful models typically generate higher
quality responses.

## Using AI_COMPLETE with type literals

Type literals allow you to define structured output for AI_COMPLETE using SQL types, taking advantage of Snowflake’s
built-in mappings between SQL and JSON types. Begin your type literal with the TYPE keyword and use a SQL OBJECT as the top-level
type. The properties of your top-level object can be any SQL type with a supported mapping to JSON.

> **Note:**
>
> Type literals are supported only for the single string text prompt version of AI_COMPLETE. For more information, see
> [AI_COMPLETE (Single string)](../../sql-reference/functions/ai_complete-single-string.md).

The following example uses a type literal to produce structured output for a prompt. The prompt contains both instructions to the model and the data to process. The `response_format` type literal produces the model’s response as a JSON object with a top-level `note` containing a `date`, `address`, `items_count`, and a `price` array containing prices.

```sqlexample
SELECT AI_COMPLETE(
    model => 'llama3.3-70b',
    prompt => 'Extract structured data from this customer interaction note: Customer Sarah Jones complained about the mobile app crashing during checkout. She tried to purchase 3 items: a red XL jacket ($89.99), blue running shoes ($129.50), and a fitness tracker ($199.00). The app crashed after she entered her shipping address at 123 Main St, Portland OR, 97201. She has been a premium member since January 2024.',
    response_format => TYPE OBJECT(note OBJECT(items_count NUMBER, price ARRAY(STRING), address STRING, member_date STRING)),
    show_details => TRUE
);
```

The following is a full response to this query:

```output
{
  "created": 1758755328,
  "model": "llama3.3-70b",
  "structured_output": [
    {
      "raw_message": {
        "note": {
          "items_count": 3,
          "price": [
            "$89.99",
            "$129.50",
            "$199.00"
          ]
        }
      },
      "type": "json"
    }
  ],
  "usage": {
    "completion_tokens": 49,
    "prompt_tokens": 100,
    "total_tokens": 149
  }
}
```

### Type literal notes and limitations

Specifying a structured output schema as a type literal follows these rules:

* STRING and VARCHAR types are mapped to JSON strings.
* VARCHAR types aren’t guaranteed to produce output of a specific length.
* FIXED types without a scale are mapped to JSON integers. All other numeric types are mapped to JSON numbers.

Type literals have restrictions around supported types:

* The empty object OBJECT() isn’t allowed as a type literal.
* Not all SQL types have a mapping for structured output. These include, but aren’t limited, to:

  + VARIANT
  + MAP
  + [Date & time data types](../../sql-reference/data-types-datetime.md)

The use of an unsupported data type returns an error.

## Using AI_COMPLETE with JSON schemas

For more control over structured output, use a [JSON schema](https://json-schema.org/) as the value for
`response_format`. The supplied JSON schema defines the structure, data types, and constraints that the generated text
must conform to, including required fields.

For simple tasks, you don’t need to specify any details of the output format, or even instruct the model to “respond in
JSON.” For more complex tasks, prompting the model to respond in JSON can improve accuracy; see
Optimizing JSON adherence accuracy.

The following illustrates the syntax of an AI_COMPLETE function call that uses a JSON schema to specify the structured
output format. The schema defines a top-level object, `properties`, with a `property_name` property of type string;
this field is required in the response.

```sqlexample
AI_COMPLETE(
    ...
    response_format => {
        'type': 'json',
        'schema': {
            'type': 'object',
            'properties': {
                'property_name': {
                    'type': 'string'
                },
                ...
            },
            'required': ['property_name', ...]
        }
    }
)
```

> **Important:**
>
> For OpenAI (GPT) models, the following requirements apply:
>
> * [additionalProperties](https://json-schema.org/understanding-json-schema/reference/object#additionalproperties) field must be set to
>   `false` in every node of the schema.
> * The [required](https://json-schema.org/understanding-json-schema/reference/object#required) field must be included and contain the names of
>   every property in the schema.
>
> Other models do not require these fields, but you might include them anyway so you don’t need a different schema for OpenAI models.

## SQL examples

The following example is a more complete demonstration of using AI_COMPLETE with a single string input.

```sqlexample
SELECT AI_COMPLETE(
    model => 'mistral-large2',
    prompt => 'Return the customer sentiment for the following review: New kid on the block, this pizza joint! The pie arrived neither in a flash nor a snail\'s pace, but the taste? Divine! Like a symphony of Italian flavors, it was a party in my mouth. But alas, the party was a tad pricey for my humble abode\'s standards. A mixed bag, I\'d say!',
    response_format => {
            'type':'json',
            'schema':{'type' : 'object','properties' : {'sentiment_categories':{'type': 'object','properties':
            {'food_quality' : {'type' : 'string'},'food_taste': {'type':'string'}, 'wait_time': {'type':'string'}, 'food_cost': {'type':'string'}},'required':['food_quality','food_taste' ,'wait_time','food_cost']}}}

    }
);
```

Response:

```output
{
    "sentiment_categories":
    {
        "food_cost": "negative",
        "food_quality": "positive",
        "food_taste": "positive",
        "wait_time": "neutral"
    }
}
```

The following example demonstrates how to use the `response_format` argument to specify a JSON schema for the response and using the
`show_details` argument to return inference metadata.

```sqlexample
SELECT AI_COMPLETE(
    model => 'mistral-large2',
    prompt => 'Return the customer sentiment for the following review: New kid on the block, this pizza joint! The pie arrived neither in a flash nor a snail\'s pace, but the taste? Divine! Like a symphony of Italian flavors, it was a party in my mouth. But alas, the party was a tad pricey for my humble abode\'s standards. A mixed bag, I\'d say!',
    response_format => {
            'type':'json',
            'schema':{'type' : 'object','properties' : {'sentiment_categories':{'type': 'object','properties':
            {'food_quality' : {'type' : 'string'},'food_taste': {'type':'string'}, 'wait_time': {'type':'string'}, 'food_cost': {'type':'string'}},'required':['food_quality','food_taste' ,'wait_time','food_cost']}}}

    },
    show_details => TRUE
);
```

Response:

```output
{
    "created": 1738683744,
    "model": "mistral-large2",
    "structured_output": [
        {
        "raw_message": {
            "sentiment_categories":
            {
                "food_cost": "negative",
                "food_quality": "positive",
                "food_taste": "positive",
                "wait_time": "neutral"
            }
        },
        "type": "json"
        }
    ],
    "usage": {
        "completion_tokens": 60,
        "prompt_tokens": 94,
        "total_tokens": 154
    }
}
```

## Python example

> **Note:**
>
> Structured output is supported in `snowflake-ml-python` version 1.8.0 and later.

The following example demonstrates how to use the `response_format` argument to specify a JSON schema for the response.

```python
from snowflake.cortex import complete, CompleteOptions

response_format = {
    "type": "json",
    "schema": {
        "type": "object",
        "properties": {
            "people": {
                "type": "array",
                "items": {
                    "type": "object",
                    "properties": {
                        "name": {"type": "string"},
                        "age": {"type": "number"},
                    },
                    "required": ["name", "age"],
                },
            }
        },
        "required": ["people"],
    },
}
prompt = [{
    "role": "user",
    "content": "Please prepare me a data set of 5 ppl and their age",
}]

options = CompleteOptions(
        max_tokens=4096,
        temperature=0.7,
        top_p=1,
        guardrails=False,
        response_format=response_format
    )

result = complete(
model="claude-sonnet-4-6",
prompt=prompt,
session={session_object}, # session created via connector
stream=True,
options=options,
)

output = "".join(result)
print(output)
```

Response:

```output
{"people": [{"name":"John Smith","age":32},{"name":"Sarah Johnson","age":28},
{"name":"Michael Chen","age":45},{"name":"Emily Davis","age":19},{"name":"Robert Wilson","age":56}]}
```

### Pydantic example

Pydantic is a data validation and settings management library for Python. This example uses Pydantic to define a schema
for the response format. The code performs these steps:

1. Uses Pydantic to define a schema
2. Converts the Pydantic model to a JSON schema using the `model_json_schema` method
3. Passes the JSON schema to the `complete` function as the `response_format` argument

> **Note:**
>
> This example is meant to be run in a Snowsight Python worksheet, which already has a connection to Snowflake. To run
> it in a different environment, you might need to [establish a connection to Snowflake](../../developer-guide/python-connector/python-connector-connect.md)
> using the Snowflake Connector for Python.

```python
from pydantic import BaseModel, Field
import json
from snowflake.cortex import complete, CompleteOptions
from snowflake.snowpark.context import get_active_session

class Person(BaseModel):
    age: int = Field(description="Person age")
    name: str = Field(description="Person name")

class People(BaseModel):
    people: list[Person] = Field(description="People list")

ppl = People.model_json_schema()
'''
This is the ppl object, keep in mind there's a '$defs' key used

{'$defs': {'Person': {'properties': {'age': {'description': 'Person age', 'title': 'Age', 'type': 'integer'}, 'name': {'description': 'Person name', 'title': 'Name', 'type': 'string'}}, 'required': ['age', 'name'], 'title': 'Person', 'type': 'object'}}, 'properties': {'people': {'description': 'People list', 'items': {'$ref': '#/$defs/Person'}, 'title': 'People', 'type': 'array'}}, 'required': ['people'], 'title': 'People', 'type': 'object'}

'''

response_format_pydantic={
    "type": "json",
    "schema": ppl,
}
prompt=[{"role": "user", "content": "Please prepare me a data set of 5 ppl and their age"}]
options_pydantic = CompleteOptions(  # random params
        max_tokens=4096,
        temperature=0.7,
        top_p=1,
        guardrails=False,
        response_format=response_format_pydantic
    )
model_name = "claude-sonnet-4-6"

session = get_active_session()
try:
    result_pydantic = complete(
        model=model_name,
        prompt=prompt,
        session=session,
        stream=True,
        options=options_pydantic,
    )
except Exception as err:
    result_pydantic = (chunk for chunk in err.response.text) # making sure it's generator, similar to the valid response

output_pydantic = "".join(result_pydantic)
print(output_pydantic)
```

Response:

```output
{"people": [{"name":"John Smith","age":32},{"name":"Sarah Johnson","age":45},
{"name":"Mike Chen","age":28},{"name":"Emma Wilson","age":19},{"name":"Robert Brown","age":56}]}
```

## REST API example

You can use the [Snowflake Cortex LLM REST API](cortex-rest-api.md) to invoke
COMPLETE with the LLM of your choice. Below is an example supplying a schema using the Cortex LLM REST API:

```shell-session
curl --location --request POST 'https://<account_identifier>.snowflakecomputing.com/api/v2/cortex/inference:complete'
--header 'Authorization: Bearer <jwt>' \
--header 'Accept: application/json, text/event-stream' \
--header 'Content-Type: application/json' \
--data-raw '{
    "model": "claude-sonnet-4-6",
    "messages": [{
        "role": "user",
        "content": "Order a pizza for a hungry space traveler heading to the planet Zorgon. Make sure to include a special instruction to avoid any intergalactic allergens."
    }],
    "max_tokens": 1000,
    "response_format": {
            "type": "json",
            "schema":
            {
                "type": "object",
                "properties":
                {
                    "crust":
                    {
                        "type": "string",
                        "enum":
                        [
                            "thin",
                            "thick",
                            "gluten-free",
                            "Rigellian fungus-based"
                        ]
                    },
                    "toppings":
                    {
                        "type": "array",
                        "items":
                        {
                            "type": "string",
                            "enum":
                            [
                                "Gnorchian sausage",
                                "Andromedian mushrooms",
                                "Quasar cheese"
                            ]
                        }
                    },
                    "delivery_planet":
                    {
                        "type": "string"
                    },
                    "special_instructions":
                    {
                        "type": "string"
                    }
                },
                "required":
                [
                    "crust",
                    "toppings",
                    "delivery_planet"
                ]
            }
        }
    }'
```

Response:

```output
data: {"id":"4d62e41a-d2d7-4568-871a-48de1463ed2a","model":"claude-sonnet-4-6","choices":[{"delta":{"content":"{\"crust\":","content_list":[{"type":"text","text":"{\"crust\":"}]}}],"usage":{}}

data: {"id":"4d62e41a-d2d7-4568-871a-48de1463ed2a","model":"claude-sonnet-4-6","choices":[{"delta":{"content":" \"thin\"","content_list":[{"type":"text","text":" \"thin\""}]}}],"usage":{}}

data: {"id":"4d62e41a-d2d7-4568-871a-48de1463ed2a","model":"claude-sonnet-4-6","choices":[{"delta":{"content":", \"topping","content_list":[{"type":"text","text":", \"topping"}]}}],"usage":{}}

data: {"id":"4d62e41a-d2d7-4568-871a-48de1463ed2a","model":"claude-sonnet-4-6","choices":[{"delta":{"content":"s\": [\"Quasar","content_list":[{"type":"text","text":"s\": [\"Quasar"}]}}],"usage":{}}
```

## Create a JSON schema definition

To get the best accuracy from COMPLETE Structured Outputs, follow these guidelines:

* **Use the “required” field** in the schema to specify required fields. COMPLETE raises an error if a required
  field cannot be extracted.

  In the following example, the schema directs COMPLETE to find people mentioned in the document. The `people` field
  is marked as required to make sure people are identified.

  > ```sqlexample
  > {
  >     'type': 'object',
  >     'properties': {
  >         'dataset_name': {
  >             'type': 'string'
  >         },
  >         'created_at': {
  >             'type': 'string'
  >         },
  >         'people': {
  >             'type': 'array',
  >             'items': {
  >                 'type': 'object',
  >                 'properties': {
  >                     'name': {
  >                         'type': 'string'
  >                     },
  >                     'age': {
  >                         'type': 'number'
  >                     },
  >                     'isAdult': {
  >                         'type': 'boolean'
  >                     }
  >                 }
  >             }
  >         }
  >     },
  >     'required': [
  >         'dataset_name',
  >         'created_at',
  >         'people'
  >     ]
  > }
  > ```
  >
  > Response:
  >
  > ```output
  > {
  >     "dataset_name": "name",
  >     "created_at": "date",
  >     "people": [
  >         {
  >             "name": "Andrew",
  >             "isAdult": true
  >         }
  >     ]
  > }
  > ```
* **Provide detailed descriptions** of the fields to be extracted so that the model can more accurately identify them. For
  example, the following schema includes a description of each of the fields of `people`: `name`, `age`, and `isAdult`.

  > ```sqlexample
  > {
  >     'type': 'object',
  >     'properties': {
  >         'dataset_name': {
  >             'type': 'string'
  >         },
  >         'created_at': {
  >             'type': 'string'
  >         },
  >         'people': {
  >             'type': 'array',
  >             'items': {
  >                 'type': 'object',
  >                 'properties': {
  >                     'name': {
  >                         'type': 'string',
  >                         'description': 'name should be between 9 to 10 characters'
  >                     },
  >                     'age': {
  >                         'type': 'number',
  >                         'description': 'Should be a value between 0 and 200'
  >                     },
  >                     'isAdult': {
  >                         'type': 'boolean',
  >                         'description': 'Persons is older than 18'
  >                     }
  >                 }
  >             }
  >         }
  >     }
  > }
  > ```

### Using a JSON reference

Schema references solve practical problems when using Cortex COMPLETE Structured Outputs. With references, represented
by `$ref`, you can define common objects like addresses or prices once, then reuse them throughout the
schema. This way, when you need to update validation logic or add a field, you can change it in one place instead of in
multiple locations.

Using references reduces coding effort, reduces bugs from inconsistent implementations, and makes code reviews simpler. Referenced components
create cleaner hierarchies that better represent entity relationships in your data model. As projects grow more complex,
this modular approach helps you manage technical debt while maintaining schema integrity.

Third-party libraries such as Pydantic support the reference mechanism natively in Python, simplifying schema usage in
your code.

The following guidelines apply to the use of references in JSON schema:

* **Scope limitation:** The `$ref` mechanism is limited to the user’s schema only; external schema references (such as HTTP URLs) are not supported.
* **Definition placement:** Object definitions should be placed at the top level of the schema, specifically under the definitions or `$defs` key.
* **Enforcement:** While the JSON Schema specification recommends using the `$defs` key for definitions, Snowflake’s
  validation mechanism strictly enforces this structure. This is an example of a valid `$defs` object:

```javascript
{
    '$defs': {
        'person':{'type':'object','properties':{'name' : {'type' : 'string'},'age': {'type':'number'}}, 'required':['name','age']}},
    'type': 'object',
    'properties': {'title':{'type':'string'},'people':{'type':'array','items':{'$ref':'#/$defs/person'}}}
}
```

#### Example using JSON reference

This SQL example demonstrates the use of references in a JSON schema.

```sqlexample
select ai_complete(
    model => 'claude-sonnet-4-6',
    prompt => 'Extract structured data from this customer interaction note: Customer Sarah Jones complained about the mobile app crashing during checkout. She tried to purchase 3 items: a red XL jacket ($89.99), blue running shoes ($129.50), and a fitness tracker ($199.00). The app crashed after she entered her shipping address at 123 Main St, Portland OR, 97201. She has been a premium member since January 2024.',
    'response_format' => {
            'type': 'json',
            'schema': {
'type': 'object',
'$defs': {
    'price': {
        'type': 'object',
        'properties': {
            'amount': {'type': 'number'},
            'currency': {'type': 'string'}
        },
        'required': ['amount']
    },
    'address': {
        'type': 'object',
        'properties': {
            'street': {'type': 'string'},
            'city': {'type': 'string'},
            'state': {'type': 'string'},
            'zip': {'type': 'string'},
            'country': {'type': 'string'}
        },
        'required': ['street', 'city', 'state']
    },
    'product': {
        'type': 'object',
        'properties': {
            'name': {'type': 'string'},
            'category': {'type': 'string'},
            'color': {'type': 'string'},
            'size': {'type': 'string'},
            'price': {'$ref': '#/$defs/price'}
        },
        'required': ['name', 'price']
    }
},
'properties': {
    'customer': {
        'type': 'object',
        'properties': {
            'name': {'type': 'string'},
            'membership': {
                'type': 'object',
                'properties': {
                    'type': {'type': 'string'},
                    'since': {'type': 'string'}
                }
            },
            'shipping_address': {'$ref': '#/$defs/address'}
        },
        'required': ['name']
    },
    'issue': {
        'type': 'object',
        'properties': {
            'type': {'type': 'string'},
            'platform': {'type': 'string'},
            'stage': {'type': 'string'},
            'severity': {'type': 'string', 'enum': ['low', 'medium', 'high', 'critical']}
        },
        'required': ['type', 'platform']
    },
    'cart': {
        'type': 'object',
        'properties': {
            'items': {
                'type': 'array',
                'items': {'$ref': '#/$defs/product'}
            },
            'total': {'$ref': '#/$defs/price'},
            'item_count': {'type': 'integer'}
        }
    },
    'recommended_actions': {
        'type': 'array',
        'items': {
            'type': 'object',
            'properties': {
                'department': {'type': 'string'},
                'action': {'type': 'string'},
                'priority': {'type': 'string', 'enum': ['low', 'medium', 'high', 'urgent']}
            }
        }
    }
},
'required': ['customer', 'issue','cart']
}
        }
    }
);
```

Response:

```output
{
  "created": 1747313083,
  "model": "claude-sonnet-4-6",
  "structured_output": [
    {
      "raw_message": {
        "cart": {
          "item_count": 3,
          "items": [
            {
              "color": "red",
              "name": "jacket",
              "price": {
                "amount": 89.99,
                "currency": "USD"
              },
              "size": "XL"
            },
            {
              "color": "blue",
              "name": "running shoes",
              "price": {
                "amount": 129.5,
                "currency": "USD"
              }
            },
            {
              "name": "fitness tracker",
              "price": {
                "amount": 199,
                "currency": "USD"
              }
            }
          ],
          "total": {
            "amount": 418.49,
            "currency": "USD"
          }
        },
        "customer": {
          "membership": {
            "since": "2024-01",
            "type": "premium"
          },
          "name": "Sarah Jones",
          "shipping_address": {
            "city": "Portland",
            "state": "OR",
            "street": "123 Main St",
            "zip": "97201"
          }
        },
        "issue": {
          "platform": "mobile",
          "severity": "high",
          "stage": "checkout",
          "type": "app_crash"
        }
      },
      "type": "json"
    }
  ],
  "usage": {
    "completion_tokens": 57,
    "prompt_tokens": 945,
    "total_tokens": 1002
  }
}
```

## Optimizing JSON adherence accuracy

COMPLETE Structured Outputs does not usually require a prompt; it already understands that its response should conform
to the schema you specify. However, task complexity can significantly influence the ability of LLMs to follow a JSON
response format. The more complex the task, the more you can improve the accuracy of results by specifying a prompt.

* **Simple tasks** such as text classification, entity extraction, paraphrasing, and summarization tasks that don’t
  require complex reasoning generally do not require additional prompting. For smaller models of lower intelligence,
  just using Structured Outputs significantly improves JSON adherence accuracy, as it ignores any text the model
  provides unrelated to the supplied schema.
* **Medium-complexity tasks** include any simple task in which the model is asked for additional reasoning, such as
  providing its rationale for a classification decision. For these use cases, we recommend adding “Respond in JSON” in
  the prompt to optimize performance.
* **Complex reasoning tasks** prompt models to perform more open-ended ambiguous tasks, such as assessing and scoring
  the quality of a call based on the relevance, professionalism, and faithfulness of answers. For these use cases, we
  recommend using the most powerful models like Anthropic’s `claude-sonnet-4-6` or Mistral AI’s `mistral-large2` and
  adding “Respond in JSON”, and details about the schema you want to generate in the prompt.

For the most consistent results, set the `temperature` option to 0 when you call COMPLETE, regardless of the task or
model.

> **Tip:**
>
> To handle possible errors raised by a model, use [TRY_COMPLETE](../../sql-reference/functions/try_complete-snowflake-cortex.md) rather than COMPLETE.

## Cost considerations

Cortex COMPLETE Structured Outputs incurs compute cost based on the number of tokens processed, but does not incur
additional compute cost for the overhead of verifying each token against the supplied JSON schema. However, the number
of tokens processed (and billed) increases with schema complexity. In general, the larger and more complex the supplied
schema is, the more input and output tokens are consumed. Highly-structured responses with deep nesting (e.g.,
hierarchical data) consume a larger number of tokens than simpler schemas.

## Limitations

* You cannot use spaces in the keys of the schema.
* The characters allowed for property names are letters, digits, hyphen, underscore. Names may be a maximum of 64 characters long.
* You cannot address external schemas using `$ref` or `$dynamicRef`.

The following constraint keywords are not supported. The use of an unsupported constraint keyword results in an error.

| Type | Keywords |
| --- | --- |
| integer | `multipleOf` |
| number | `multipleOf`, `minimum`, `maximum`, `exclusiveMinimum`, `exclusiveMaximum` |
| string | `minLength`, `maxLength`, `format` |
| array | `uniqueItems`, `contains`, `minContains`, `maxContains`, `minItems`, `maxItems` |
| object | `patternProperties`, `minProperties`, `maxProperties`, `propertyNames` |

These limitations might be addressed in future releases.

## Error conditions

| Situation | Example message | HTTP status code |
| --- | --- | --- |
| Request validation failed. The query was cancelled as the model wouldn’t be able to generate a valid response. This can be caused by a malformed request. | `please provide a type for the response format object`, `please provide a schema for the response format object` | 400 |
| Input schema validation failed. The query was cancelled as the model wouldn’t be able to generate a valid response. This can be caused by missing required properties in request payload or using unsupported json schema features such as constraints, or inappropriate use of $ref mechanism (for example, reaching outside of the schema | `input schema validation error: <reason>` with one of the reasons below:   * `/properties/city additional properties are not allowed` * `/properties/arrondissement regexp pattern ^[a-zA-Z0-9_-]{1,64}$ mismatch on string` * `/properties/province/type sting should be one of [\"object\", \"array\", \"string\", \"number\", \"integer\", \"boolean\", \"null\"]` * `Invalid ref #/http://example.com/custom-email-validator.json#. Please define a valid object in #/$defs/ section` | 400 |
| Model output validation failed. The model could not generate a response that matched the schema. | `json mode output validation error: <reason>` with one of the reasons below:   * `An error occurred while unmarshalling the model output. Model returned invalid JSON that cannot be parsed due to: unexpected end of JSON input` | 422 |

---
title: AI_COMPLETE with documents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-complete-document-intelligence.md
section: Snowflake Cortex (AI & ML)
---

# AI_COMPLETE with documents

The Cortex AI_COMPLETE function is a general purpose AI Function that can understand data stored in PDF, Microsoft Word,
and other document file formats. You can use AI_COMPLETE to perform a variety of document data extraction tasks, such as:

* Answer questions using data in graphs and charts.
* Finding relations between charts and document text.
* Summarizing document content in the a specific question.
* Extracting entities from documents.

An advantage of AI_COMPLETE over other [document processing AI Functions](ai-documents.md) is the ability to choose a
model, so you can use the best model for your specific document processing task.

## Processing documents with AI_COMPLETE

The COMPLETE function processes documents files stored in an internal Snowflake stage or an external stage. The
completion prompt can reference a single document or multiple documents. For example, you compare the correctness of a
translation of marketing materials by providing the original and translated documents as input to the function, along
with a prompt asking the model to evaluate the translation quality.

When calling the function, you must specify the model to use and a prompt. The prompt should include instructions along
with a FILE object reference for each document you want to process. See Examples for sample prompts and completions, and
[AI_COMPLETE (Prompt object)](../../sql-reference/functions/ai_complete-prompt-object.md) for function call syntax.

### Input requirements

AI_COMPLETE is optimized for documents both digital-born and scanned. The following table lists the limitations and
requirements of input documents:

|  |  |
| --- | --- |
| Supported file type | All models: .txt, .md, .pdf  Claude models: .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml |
| Stage encryption | Server-side encryption |
| Data type | [FILE](../../sql-reference/functions/to_file.md) object |

> **Note:**
>
> Processing files from stages with AI_COMPLETE is currently incompatible with custom network policies.

### Examples

The following examples illustrate how to use AI_COMPLETE to process documents for three common use cases: chart Q&A,
contextualized document summarization, and technical report exploration.

## Chart Q&A example

The following example uses Anthropic’s Claude Opus 4 model to analyze data represented in a chart within the context of
the document `hdr2023-24snapshoten.pdf` stored in the `@docs` stage.

```sqlexample
SELECT AI_COMPLETE(
  MODEL => 'claude-4-opus',
  PROMPT => PROMPT('Compare the distributions of HDI in each group: low HDI group, medium HDI group, high HDI group and very high HDI group visualized in {0}', TO_FILE('@docs', 'hdr2023-24snapshoten.pdf'))
);
```

Response:

```output
Looking at the document, I can see Figure S.2 on page 6 which shows the recovery of HDI values since the 2020-2021
decline across different HDI groups. The visualization shows:

**Low HDI group**:
- 49% recovered
- 51% did not recover

**Medium HDI group**:
- The document doesn't provide specific recovery percentages for this group in the figure

**High HDI group**:
- The document doesn't provide specific recovery percentages for this group in the figure

**Very high HDI group**:
- 100% recovered (all OECD countries)

The document also provides additional insights about HDI distributions:

1. **Inequality trends** (Figure S.3, page 7): The inequality between very high HDI and low HDI countries has been
   increasing since 2020, reversing a long-term declining trend. The difference in HDI values between these groups
   increased from 0.38 in 2017 to 0.39 in 2020 and is projected to continue rising.

2. **Overall recovery patterns**: The text notes that "every Organisation for Economic Co-operation and Development
   country is projected to have recovered, but only about half of the Least Developed Countries are projected to have
   done so."

3. **Global HDI trajectory**: While the global HDI value is projected to reach a record high in 2023, it remains below
   its pre-2019 trend, suggesting a permanent shift in the development trajectory.

The document emphasizes the highly unequal nature of recovery across HDI groups, with countries in higher HDI
categories showing much stronger recovery rates than those in lower HDI categories.
```

## Contextualized document summarization example

The following example uses Anthropic’s Claude Sonnet 4 model to extract the summary of a legal text with a complex
layout. The document `CELEX_32008R1008_EN_TXT.pdf` is stored inthe @docs stage; the prompt narrows the summarization
context.

```sqlexample
SELECT AI_COMPLETE(
  MODEL => 'claude-4-sonnet',
  PROMPT => PROMPT('summarize the changes from the perspective of private aviation {0}', TO_FILE('@docs', 'CELEX_32008R1008_EN_TXT.pdf'))
);
```

Response:

```output
From a private aviation perspective, this EU Regulation 1008/2008 introduces several significant changes that affect
private operators and business aviation:

## Key Changes Affecting Private Aviation:

### **Operating License Requirements**
- **Stricter financial oversight**: Private operators must demonstrate they can meet financial obligations for 24
  months from start of operations, with enhanced monitoring during the first two years
- **Simplified requirements for smaller operators**: Aircraft under 10 tonnes MTOM and/or less than 20 seats need
  only demonstrate €100,000 net capital (unless operating scheduled services or exceeding €3 million turnover)

### **Aircraft Registration and Leasing**
- **Flexible registration**: Private operators can register aircraft either in their home Member State or anywhere within the Community

- **Wet lease restrictions**: Stricter limitations on wet leasing aircraft from third countries, requiring prior
  approval and justification based on:
  - Exceptional needs (up to 7 months, renewable once)
  - Seasonal capacity requirements
  - Operational difficulties when EU-registered aircraft unavailable
- **Safety equivalence**: All third-country leased aircraft must meet equivalent safety standards to EU requirements.

### **Enhanced Supervision**

- **Unified oversight**: The same Member State authority now oversees both the Air Operator Certificate (AOC) and
  operating license, improving efficiency for operators with bases in multiple countries
- **Regular assessments**: Mandatory financial reviews, particularly after two years of operation and when potential
  problems are suspected

### **Insurance Requirements**
- **Extended coverage**: Insurance requirements now explicitly include mail liability coverage in addition to
  passengers, cargo, and third parties

### **Operational Flexibility**
- **Code-sharing freedom**: Private operators can more freely enter into code-share arrangements on intra-Community
  routes and routes to third countries
- **Pricing freedom**: Complete freedom to set fares and rates for intra-Community services

### **Administrative Streamlining**
- **Consolidated regulation**: The three separate regulations are now combined into one comprehensive framework,
  simplifying compliance
- **Reduced bureaucracy**: Member States cannot require documents already provided to licensing authorities

These changes generally **liberalize** private aviation operations within the EU while **strengthening** financial
and safety oversight, creating a more integrated and competitive market for private operators.
```

## Technical report exploration

The following example uses the Gemini 3.1 Pro model to analyze casualty data represented in the diagrams of a technical report. The document `75mm-M3-spec-booklet-MK-VI.pdf` is stored in the `@docs` stage.

```sqlexample
SELECT AI_COMPLETE(
  MODEL => 'gemini-3.1-pro',
  PROMPT => PROMPT('explain findings from figures 69-73 of {0}', TO_FILE('@docs', '75mm-M3-spec-booklet-MK-VI.pdf'))
);
```

Response:

```output
Based on the provided document, specifically **page 4**, here is an explanation of the findings from Figures 69
through 73. These figures illustrate the fragmentation patterns and effectiveness of the **75-mm Shell, H.E., M48**
when fired from an M3 Gun. They visualize how dangerous the shell is to personnel (casualties) and equipment
(perforation of mild steel) at different burst heights and orientations.
```

### Supported models and limitations

All models available to Snowflake Cortex have limitations on the total number of input and output tokens, known as the
model’s *context window.* The context window size is measured in tokens. Inputs exceeding the context window limit
result in an error.

For text models, tokens generally represent approximately four characters of text; the word count corresponding to a
limit is somewhat less than the context window given in tokens. For image models, the token count per document depends
on the vision model’s architecture. Tokens within a prompt (e.g., “summarize this document:”) also contribute to the
model’s context window.

| Model | Context window (tokens) | File types | File size | Max pages | Documents per prompt |
| --- | --- | --- | --- | --- | --- |
| `gemini-3.1-pro` | 1,000,000 | .pdf, .txt, .md | 37.5MB | 3,000 | 20 |
| `gemini-2.5-flash` | 1,000,000 | .pdf, .txt, .md | 37.5MB | 1,000 | 20 |
| `gemini-2.5-flash-lite` | 1,000,000 | .pdf, .txt, .md | 37.5MB | 1,000 | 20 |
| `claude-3-7-sonnet` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 4.5MB | 100 | 5 |
| `claude-4-sonnet` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 22MB | 100 | 5 |
| `claude-4-opus` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 22MB | 100 | 5 |
| `claude-haiku-4-5` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 22MB | 100 | 5 |
| `claude-sonnet-4-5` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 22MB | 100 | 5 |
| `claude-opus-4-5` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 22MB | 100 | 5 |
| `claude-sonnet-4-6` | 200,000 | .txt, .md, .pdf, .doc, .docx, .xls, .xlsx, .csv, .xhtml | 22MB | 100 | 5 |

### Access control requirements

To use the AI_COMPLETE function, a user with the ACCOUNTADMIN role must grant the SNOWFLAKE.CORTEX_USER database role to the user who
will call the function. See [Cortex LLM privileges](aisql.md) topic for details.

Users must also have READ access to the stage and file being processed.

### Cost considerations

Cost is determined by the total number of [tokens processed](aisql.md), not by file
size. When documents are uploaded, textual content is extracted and converted into tokens; visual page segments (images)
are also transformed into tokens. Billing is based on the sum of input tokens (text plus images that the model reads)
and output tokens (text the model generates).

Actual token counts vary based on the underlying architecture of a model, as well as the document composition and
structure. Content such as dense tables, spreadsheets, structured data, code, repeated headers and footers, or
OCR-derived text may increase token volume. Conversely, image-heavy or slide-based documents with minimal extractable
text may result in lower token counts.

> **Note:**
>
> The AI_COUNT_TOKENS function does not currently support document inputs in multimodal models.

### Choosing a model

The [MMLongBench-Doc](https://proceedings.neurips.cc/paper_files/paper/2024/hash/ae0e43289bffea0c1fa34633fc608e92-Abstract-Datasets_and_Benchmarks_Track.html)
benchmark is used for evaluating model capabilities in multimodal and long context comprehension, including cross page information retrieval.

| Model | MMLongBench-Doc score |
| --- | --- |
| claude-4-6-sonnet | 46.8% |
| claude-3-7-sonnet | 52.8% |
| claude-4-sonnet | 50.2% |
| claude-4-opus | 53.0% |
| claude-haiku-4-5 | 48.9% |
| claude-4-6-sonnet-5 | 61.4% |
| claude-opus-4-5 | 63.8% |
| claude-4-6-sonnet-6 | 62.3% |
| gemini-3.1-pro | 60.5% |

### Regional availability

See [Regional availability](aisql.md).

### Error conditions

Snowflake Cortex AI_COMPLETE can produce the following error messages:

| Message | Explanation |
| --- | --- |
| _COMPLETE_WITH_PROMPT_HISTORY_LLM$V1 with remote service error: 400 ‘“invalid request parameters: unsupported document content type: application/vnd.ms-excel” | The selected file of an unsupported type (in this example, a Microsoft Excel file). Only Claude models support Excel files. |
| Request failed for external function _COMPLETE_WITH_PROMPT_HISTORY_LLM$V1 with remote service error: 400 ‘“invalid request parameters: File data exceeds the limit of 10.00 MB for file prefix/file.pdf” | File size exceeds limit (10MB in this example). |
| Remote file [‘@docs/file.pdf](mailto:'%40docs/file.pdf)’ was not found. There are several potential causes. The file might not exist. The required credentials may be missing or invalid. If you are running a copy command, please make sure files are not deleted when they are being loaded or files are not being loaded into two different tables concurrently with auto purge option. | Possibly an error in the filename. Filenames are case-sensitive. Or the file might have been deleted. |
| Error in secure object | May indicate that the stage does not exist. Check the stage name and ensure that the stage exists and is accessible. Be sure to use an at sign (@) at the beginning of the stage name. Ensure that the stage uses server-side encryption. |
| Request failed for external function COMPLETE$V6 with remote service error: 400 ‘“model "model_name" does not support given modality” | The model provided in the request doesn’t support document or text modality. |
| Request failed for external function _COMPLETE_WITH_PROMPT with remote service error: 500 ‘“internal error” | Issue with processing the request on the server side. It could be the case that the file is corrupted or truncated. |

### Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Artifacts in Snowflake Intelligence
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/artifacts.md
section: Snowflake Cortex (AI & ML)
---

# Artifacts in Snowflake Intelligence

Snowflake Intelligence delivers rich, interactive charts and tables as part of its responses. When you find an insight worth keeping, you can save or share it as an artifact. An artifact is a persistent representation of that insight that you can revisit, refresh, and collaborate on without regenerating it. After being created, an artifact preserves the query, visualization, and context so you can return later to see fresh data, or share it with a teammate who sees the same artifact filtered through their own data permissions.

## Interactive tables and charts

When you ask Snowflake Intelligence a question, the agent generates a response that may include a chart, a table, or both. Charts and tables are interactive, so you can sort, filter, search, and resize directly without asking a new question. In explorer mode, charts and tables are synced so that interactions in one update the other.

> **Note:**
>
> Queries default to rolling time windows (for example, “last 30 days” always means the most recent 30 days). If you need a fixed time period, ask with explicit dates such as “show me data from November 15 through November 22.”

## Save artifacts

When a chart or table contains an insight you want to keep, select Save to create an artifact. The artifact preserves the underlying query, visualization settings, and a data snapshot so it loads instantly when viewed later.

## Manage artifacts

The artifacts hub is the central place to manage your artifacts. It contains the following tabs:

* Saved: All artifacts you’ve saved.
* Shared with me: Artifacts shared with you through a link.

The hub displays cached snapshots as tile previews for fast loading. You can select a tile to expand the artifact, see additional context, and start a follow-up conversation. You can also search for saved artifacts by name within the artifacts hub.

Artifacts auto-refresh when you view them more than 12 hours after your last view. You can also refresh manually at any time. The refresh re-runs the original SQL query with your current credentials and updates both the data and the snapshot.

You can ask follow-up questions on any saved artifact. Each follow-up starts a new conversation thread that includes the artifact’s visualization spec, data snapshot, and a summary of the original conversation context. The original conversation stays private and unchanged.

## Share artifacts

You can share an artifact by copying a link and sending it through any communication channel. When you share a link, you create a pointer to a single artifact object, not a copy. Any account user with the link can open the shared artifact, as long as they have access to the underlying data.

When a recipient opens the link:

* The artifact runs the SQL query using the recipient’s credentials, respecting their role-based access controls (RBAC), row-level security, and column masking.
* The artifact appears in the recipient’s Shared with me tab in the artifacts hub.
* The recipient can explore the artifact, ask follow-up questions, and return to it later.

> **Note:**
>
> Recipients can re-share artifacts they have access to. Recipients can also save a shared artifact to their own Saved tab.

### Follow-up conversations about shared artifacts

Recipients can ask follow-up questions using the same agent that created the artifact, if they have access to that agent. If they don’t have agent access, Snowflake Intelligence displays a warning that follow-up questions may not be available or may produce degraded results with a different agent.

Follow-up conversations are private to the person asking. No information flows back to the original sharer.

### Revoking access

You can unshare an artifact at any time. Unsharing invalidates the link immediately and no one can open it afterward.

> **Important:**
>
> Admins can’t disable artifact sharing using the UI or SQL. To disable sharing for your account, contact your Sales Engineer, Account Executive, or [Snowflake Support](https://community.snowflake.com/s/article/How-To-Submit-a-Support-Case-in-Snowflake-Lodge).

## Security and access control

Artifacts follow a caller’s-rights model. Every data interaction validates the current user’s permissions at runtime.

The following security behaviors apply:

* **Saved artifacts are user-scoped:** Saved artifacts are private to each user. Other users can only see artifacts that are explicitly shared.
* **RBAC is enforced:** Every refresh and share runs the query under the viewer’s current role and credentials. Two users with different roles may see different results from the same artifact.
* **Ownership is persistent:** Artifacts are tied to the user, not to a specific role or agent. If you lose access to the originating agent, you keep the artifact and can still refresh it as long as you have access to the underlying data.

## Artifact lifecycle

Saved artifacts persist until you explicitly delete them. Snowflake Intelligence never automatically deletes a saved artifact.

The following table describes what happens when access conditions change:

| Condition | What happens |
| --- | --- |
| You lose agent access | You can still view and refresh the artifact. Follow-up questions with the original agent are not available. |
| You lose data access | The last cached snapshot remains visible but refresh is unavailable. |
| Agent is deleted or modified | The artifact and its saved query are unaffected. Follow-up questions use the current agent definition, if available. |

When an agent is no longer available, Snowflake Intelligence displays a warning.

## Known limitations

* **Single artifacts only:** Currently, you can save and share an individual tile per artifact. Collections of multiple tiles aren’t supported.
* **No user-level sharing permissions:** Currently, sharing is link-based and public within the account. You can’t restrict a shared link to specific users.
* **No folders or labels:** Currently, artifacts can’t be organized into groups, folders, or labeled for categorization.

---
title: Batch Cortex Search
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/batch-cortex-search.md
section: Snowflake Cortex (AI & ML)
---

# Batch Cortex Search

The Batch Cortex Search function is a table function that lets you submit a batch of queries to a Cortex Search Service. It is intended for offline use cases with high throughput requirements, such as entity resolution, deduplication, or clustering tasks.

Jobs submitted to a Cortex Search Service with the `CORTEX_SEARCH_BATCH` function leverage additional compute resources to provide significantly higher throughput (queries per second) than the interactive (Python, REST, or SEARCH_PREVIEW) API search query surfaces.

## Syntax

Use the following syntax to query a Cortex Search Service in batch mode using the `CORTEX_SEARCH_BATCH`
table function:

```sqlexample
SELECT
    q.query,
    r.*
FROM query_table AS q,
LATERAL CORTEX_SEARCH_BATCH(
    service_name => '<database>.<schema>.<cortex_search_service>',
    query => q.query,                   -- optional STRING
    multi_index_query => q.miq,         -- optional VARIANT
    filter => q.filter,                 -- optional VARIANT
    limit => 10,                        -- optional INT
    options => q.options                -- optional VARIANT
) AS r;
```

## Parameters

The `CORTEX_SEARCH_BATCH` function supports the following parameters:

`service_name` (string, required)
:   Fully-qualified name of the Cortex Search Service to query.

`query` (string, optional)
:   Column containing query string for searching the service.

`multi_index_query` (variant, optional)
:   An object that specifies one or more vector or keyword query inputs to search against the service index. See [multi_index_query](cortex-search-overview.md) for details on how to construct this parameter.

    > **Note:**
    >
    > For performance reasons, `multi_index_query` currently supports at most one vector index entry in the query array.

`filter` (variant, optional)
:   Column containing filter objects to apply to the search results.

`limit` (integer, optional)
:   Maximum number of results to return per query. Default: 10.

`options` (variant, optional)
:   Column containing a VARIANT object with optional per-query settings. Supported top-level keys include:

    * `scoring_config` (object, optional): Same structure as the `scoring_config` parameter for interactive Cortex Search queries (Python, REST, or `SEARCH_PREVIEW`). Use it to customize ranking for that row’s batch query. See [Customizing Cortex Search scoring](cortex-search-customize-scoring.md).
    * `replicas` (integer, optional): How many copies of the search index serve that row’s batch query. Default: 2. Higher values can improve throughput; serving cost rises in proportion to the replica count.
    * `experimental` (object, optional): Object reserved for experimental or preview search behavior. Fields and semantics can change without notice. Use only when Snowflake documentation or support directs you to set specific keys.

> **Note:**
>
> At least one of `query`, `multi_index_query`, or `filter` must be specified.

## Usage notes

* The throughput of the batch search function might vary depending on the amount of data indexed in the queried Cortex Search Service and the complexity of the search queries. Run the function on a small number of queries to measure the throughput for your specific workload. In general, queries to larger services with more filter conditions see lower throughput.
* The throughput of the batch search function, the number of search queries processed per second, is not influenced by the size of the warehouse used to query it.
* Because batch search spins up dedicated resources to serve each job, it incurs additional startup latency. If you need to run fewer than 2,000 queries, you’ll typically get faster results using the interactive Cortex Search API (Python or REST API) rather than batch search.
* Unlike the interactive Cortex Search API, the batch search function can query services that are currently suspended in serving.
* A single Cortex Search Service can be queried in interactive and batch mode concurrently without any degradation to interactive query performance or throughput. Separate compute resources are used to serve interactive and batch queries.

## Cost considerations

Batch search has three cost components:

**Serving cost**
:   A charge based on the size of the search index data and the duration of the batch search job, excluding the startup time. It also reflects the `replicas` value in `options` (default 2); see the `replicas` option above.

**Query embedding cost**
:   A charge for the number of tokens embedded as a result of the input queries. Unlike interactive Cortex Search, query embedding is not free for batch search.

**Virtual warehouse cost**
:   A charge for the virtual warehouse compute used to run the batch job.

For usage tracking, see the [CORTEX_SEARCH_BATCH_QUERY_USAGE_HISTORY](../../../sql-reference/account-usage/cortex_search_batch_query_usage_history.md) Account Usage view.
For more information on Cortex Search costs, see [Cost considerations](cortex-search-overview.md).

## Regional availability

Batch search is available in all regions where Cortex Search is available.
See [Regional availability](cortex-search-overview.md) for a full list of supported regions.

## Example Usage

In this example, match products in a user-submitted order form to a “golden” product catalog.
The `CORTEX_SEARCH_BATCH` call uses `options` so embeddings are computed without the default search query prefix; see [Disabling query prefix for vector embeddings](cortex-search-customize-scoring.md).
Use that setting only when you have evaluated the impact on result quality.

```sqlexample
-- Create the golden product catalog with canonical product names
CREATE OR REPLACE TABLE golden_catalog (product_name TEXT);
INSERT INTO golden_catalog VALUES
  ('Wireless Bluetooth Headphones'),
  ('Wireless Noise-Canceling Earbuds'),
  ('USB-C Charging Cable 6ft'),
  ('Portable Power Bank 10000mAh');

-- Create Cortex Search Service on the golden catalog
CREATE CORTEX SEARCH SERVICE golden_product_service
ON product_name
WAREHOUSE = <warehouse_name>
TARGET_LAG = '1 day'
AS
SELECT product_name FROM golden_catalog;

-- Create a table of user-submitted products (may contain variations or typos)
CREATE OR REPLACE TABLE submitted_products (product TEXT);
INSERT INTO submitted_products VALUES
  ('bluetooth headphones wireless'),
  ('usb c cable');

-- For each user-submitted product, query the service for the two closest golden results
SELECT
  q.product, s.*
FROM submitted_products AS q,
LATERAL CORTEX_SEARCH_BATCH(
    service_name => 'golden_product_service',
    query => q.product,
    limit => 2,
    options => OBJECT_CONSTRUCT(
        'scoring_config', OBJECT_CONSTRUCT(
            'disable_vector_embedding_query_prefix', true
        )
    )
) AS s;
```

The following example uses `multi_index_query` to submit precomputed embeddings as the query input instead of raw text. Here, the source table `my_db.my_schema.product_embeddings` contains a column `embedding` with precomputed vectors, and the Cortex Search Service `my_db.my_schema.golden_product_service` was created with a bring-your-own-vector (BYOV) configuration. For details on constructing `multi_index_query`, see [multi_index_query](cortex-search-overview.md).

```sqlexample
SELECT
    q.product_name,
    s.*
FROM (
    SELECT
        product_name,
        embedding::ARRAY AS emb_arr
    FROM my_db.my_schema.product_embeddings
    LIMIT 100000
) q,
LATERAL CORTEX_SEARCH_BATCH(
    service_name => 'my_db.my_schema.golden_product_service',
    multi_index_query => OBJECT_CONSTRUCT(
        'EMBEDDING', ARRAY_CONSTRUCT(
            OBJECT_CONSTRUCT('vector', q.emb_arr)
        )
    ),
    limit => 5
) s;
```

---
title: Build agents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/build-agents.md
section: Snowflake Cortex (AI & ML)
---

# Build agents

You can build agents for Snowflake Intelligence using the following methods:

* In Snowsight
* Using the [Agent object REST API](../cortex-agents-rest-api.md)
* With the [Cortex Agents SQL](../../../sql-reference/commands-cortex-agent.md) commands

The following sections provide information about how to build agents for Snowflake Intelligence using SQL commands. Each section provides information about a different part of the agent configuration. The final section shows an example of an agent configuration that includes all of the components described in this topic.

For more information about the other methods to create an agent and the options available, see [Configure and interact with Agents](../cortex-agents-manage.md).

## Agent structure

An agent consists of the following parts:

* The base model that provides the foundation for the agent’s behavior
* A model (the orchestrator) that interprets intent, selects the right tools, and plans the sequence of actions
* Instructions for the agent’s behavior
* Tools for the agent to use
* Resources for the tools

The following sections provide information about model selection and tool configuration. This example uses a semantic view, a Cortex Search service, and a custom tool to provide answers. Although you can create a basic agent that doesn’t use any of these tools, that basic agent can only use the base model to provide answers. As a result, the agent lacks access to data within your Snowflake account and has limited context for answers.

For information about the other components of the agent, see [Cortex Agents](../cortex-agents.md).

## Prerequisites

To create a Cortex Agent, you must use a role with the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE AGENT | Schema | Required to create the Cortex Agent. |
| USAGE | Database, schema | Required to create the Cortex Agent in the specified database and schema. |

The following code grants the necessary privileges to create a Cortex Agent:

> ```sqlexample
> GRANT USAGE ON DATABASE <database_name> to ROLE <role_name>;
> GRANT USAGE ON SCHEMA <database_name>.<schema_name> to ROLE <role_name>;
> GRANT CREATE AGENT ON SCHEMA <database_name>.<schema_name> to ROLE <role_name>;
> ```

In addition to the privileges required to create a Cortex Agent, the following prerequisites are necessary to connect the agent to specific tools:

* A semantic view to connect to the agent
  :   For information about creating a semantic view, see [Overview of semantic views](../../views-semantic/overview.md).
* A Cortex Analyst tool to connect to the agent
  :   For information about creating a Cortex Analyst tool, see [Cortex Analyst](../cortex-analyst.md).
* Unstructured data in a database to connect to the agent
* A Cortex Search tool to connect to the agent
  :   For information about creating a Cortex Search tool, see [Cortex Search](../cortex-search/cortex-search-overview.md).
* A custom tool to connect to the agent
  :   For information about creating user-defined functions (UDFs) and stored procedures to use as custom tools, see [Extending Snowflake with Functions and Procedures](../../../developer-guide/extensibility.md).

To attach tools to an agent, the role that is used to create the agent must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Cortex Search service | Required to add the Cortex Search service to the Cortex Agent. |
| SELECT | Table/View | Required to access the objects referenced in the agent’s semantic view/model. |
| USAGE | Tools | Required to access all of the custom tools to attach to the agent. For example, if the custom tool is a stored procedure, then the you must have USAGE on the procedure. |
| USAGE | Semantic view/model | Required to access the semantic view/model to attach to the agent. |

## Agent configuration basics

When you create an agent, you must specify information about the agent, such as the name, description, and model. You can also specify the tools that the agent can use and the resources that the agent can access. These resources are passed as a YAML specification in the `FROM SPECIFICATION` clause of the `CREATE AGENT` command.

The following recommendations provide best practices for this configuration:

**Scope agents narrowly:** Before adding tools or writing instructions, define why the agent exists, who it serves, and what specific questions it should answer. This step shapes everything that follows, from tool selection to performance and trust. Snowflake recommends that you narrow the agent’s scope to a specific, high-value use case.

After an agent proves reliable in one area, you can replicate the pattern for others. For example, you could have one agent to analyze your store’s recent sales and marketing data, and another that recommends the best SKUs to pitch to the retailer.

**Select the number of tools carefully:** Every agent should have access to only the tools it needs. To determine that, consider the documents or data that the agent needs to fulfill its purpose. If the agent needs to access unstructured data, use [Cortex Search](../cortex-search/cortex-search-overview.md). If the agent needs to access structured data, use [Cortex Analyst](../cortex-analyst.md). If the agent needs other tools, you can use custom tools.

**Write a useful tool description:** These descriptions are used to help the agent understand what the tool does and how to use it. Unclear tool descriptions can create cascading failures and lead to “hallucinations.”

> To create a useful tool description, follow these guidelines:
>
> > * Add a clear and specific tool name that clarifies the tool’s domain (“Customer”, “Sales”) and function (“Analytics”, “Search”).
> > * Write a purpose-driven tool description that tells the agent:
> >
> >   + What the tool does
> >   + Which data it accesses
> >   + When to use it
> >   + When NOT to use it
> > * Be explicit about the tool’s expected inputs. Ambiguous inputs to your tools lead to incorrect tool calls and errors.
> >
> >   + Be specific.
> >   + Specify the data format.
> >   + Provide clear data instructions.
> >   + Provide default guidance.
> >   + Use consistent terminology.

For more agent configuration recommendations, see [Best Practices for Building Cortex Agents](https://www.snowflake.com/en/developers/guides/best-practices-to-building-cortex-agents/).

## Model selection

When you create an agent, we recommend that you select auto for the model. With this option, Cortex automatically selects the highest quality model for your account, and the quality automatically improves as new models become available. For more information about the available models, see [Supported models and regions](reference.md).

The following example shows how to specify the model for the agent:

```yaml
models:
  orchestration: auto
```

### Cross-region inference

> **Important:**
>
> Cross-region inference is disabled by default. We recommend using cross-region inference to access the full set of LLMs and avoid limitations within a single region.

When using a model that is not available in the local region, you must use Cortex cross-region inference. This setting enables inference requests to be processed in a different region from the default region. The parameter for cross-region inference can only be set at the account level by the ACCOUNTADMIN role, not at the user or session levels.

To set the parameter, use the following command:

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'ANY_REGION';
```

For more information about configuring Cortex cross-region inference, see [Cross-region inference](../cross-region-inference.md).

## Connect semantic views using Cortex Analyst (structured data)

Snowflake Intelligence supports semantic views, which are a type of structured data with instructions that tell the agent how to query or interpret the data. Cortex Agents use Cortex Analyst to retrieve structured data from semantic views by converting natural language requests into SQL queries. Agents can route across multiple semantic views to provide the response.

Each semantic view should cover a similar set of tables. You can set data-specific defaults, such as always adding a date filter for the past three months if not specified or always excluding internal accounts.

You can connect a semantic view to an agent by specifying the semantic view as part of the tool resources. The following example shows how to connect a semantic view to an agent and how to specify the Cortex Analyst tool to retrieve structured data from the semantic view:

```yaml
tools:
  - tool_spec:
      type: "cortex_analyst_text_to_sql"
      name: "<your cortex analyst tool name>"
      description: "<clear and specific tool description>"

tool_resources:
  <your cortex analyst tool name>:
    semantic_view: "<db>.<schema>.<semantic_view>"
```

### Best practices for semantic views

Semantic views power how Snowflake Intelligence understands and queries your data. A well-designed semantic view improves accuracy, reduces latency, and builds user trust. The following best practices are designed to help you create a semantic view that is accurate and efficient:

**Start small and focused:** Begin with 5-10 tables in a single business domain. Organize by use case (Sales Performance, Customer Support Metrics) rather than by data structure. Scale after you validate accuracy.

**Write clear descriptions:** Descriptions are the most important element. Every table and column should have a business-friendly description that explains what the data represents, not just its name. Include context like calculation logic, business definitions, and any legacy terminology.

**Add verified queries:** These are examples of questions paired with validated SQL. They improve accuracy on similar questions, reduce latency, and help the system learn your business patterns. Start with 10 to 20 queries that cover your most common questions, and add more based on actual usage.

**Define metrics and filters:** Pre-define reusable calculations (like total revenue or average order value) and common conditions (like active customers or current fiscal year). These can significantly improve consistency.

**Use custom instructions for business logic:** Add SQL generation instructions for data quirks, fiscal year definitions, default filters, or domain-specific rules. Be specific: “If no date filter is provided, default to last 12 months” is better than “filter by date.”

**Enable Cortex Search for text matching:** For high-cardinality text columns like product names, customer names, or company names, Cortex Search enables fuzzy matching when user input does not exactly match your data.

**Test and iterate:** Create an evaluation set of representative questions, measure accuracy, and refine based on real usage patterns. Review suggestions regularly to add verified queries and improve descriptions over time.

For more information about best practices for creating semantic views, see [Best Practices for Semantic Views in Cortex Analyst](https://www.snowflake.com/en/developers/guides/best-practices-semantic-views-cortex-analyst/).

## Connect Cortex Search (unstructured data)

To process unstructured data, you can connect a Cortex Search tool to an agent by specifying the Cortex Search tool in the YAML specification as part of the tool resources. Cortex Search services retrieve documents and records from unstructured data sources using semantic search. The two primary use cases for Cortex Search are retrieval augmented generation (RAG) and enterprise search. For information about creating a Cortex Search service, see [Cortex Search](../cortex-search/cortex-search-overview.md). You can also use a Cortex Knowledge Extension (CKE) that is shared with you.

When you connect a Cortex Search tool to an agent, it is especially important to include the following information about the parameters and their expected values:

* Type and format (include examples)
* Whether required or optional (with default values)
* Valid values or constraints (enums, ranges, formats)
* Relationship to other parameters (dependencies, conflicts)
* How to obtain the value (especially for IDs)

The following example shows how to connect a Cortex Search tool to an agent and how to specify the Cortex Search tool in the YAML specification:

```yaml
tools:
  - tool_spec:
      type: "cortex_search"
      name: "<your cortex search tool name>"
      description: "<clear and specific tool description>"

tool_resources:
  <your cortex search tool name>:
    name: "<db>.<schema>.<search_service_name>"
    max_results: "5"
    filter:
      "@eq":
        region: "North America"
    title_column: "<title_name>"
    id_column: "<column_name>"
```

## Add custom tools

Snowflake Intelligence supports custom tools, which are user-defined functions or stored procedures that can be used to implement custom business logic. You can connect a custom tool to an agent by specifying the custom tool in the YAML specification as part of the tool resources.

The following example shows how to connect a custom tool to an agent and how to specify the custom tool in the YAML specification:

```yaml
tools:
  - tool_spec:
      type: "custom_tool"
      name: "<your custom tool name>"
      description: "<clear and specific tool description>"

tool_resources:
  <your custom tool name>:
    user-defined-function-argument: "argument1"
```

## Create an agent

* Combine all of the tools and components to create an agent using SQL:

  > ```sqlexample-yaml
  > CREATE OR REPLACE AGENT <agent_name>
  >     COMMENT = 'agent level comment'
  >     PROFILE = '{"display_name": "My Business Assistant", "avatar":  "business-icon.png", "color": "blue"}'
  >     FROM SPECIFICATION
  >     $$
  >     models:
  >     orchestration: claude-4-sonnet
  >
  >     orchestration:
  >     budget:
  >         seconds: 30
  >         tokens: 16000
  >
  >     instructions:
  >     response: "You will respond in a friendly but concise manner"
  >     orchestration: "For any revenue question, use Analyst; for policy questions, use Search"
  >     system: "You are a friendly agent that helps with business questions"
  >     sample_questions:
  >         - question: "What was our revenue last quarter?"
  >         answer: "I'll analyze the revenue data using our financial database."
  >
  >     tools:
  >     - tool_spec:
  >         type: "cortex_analyst_text_to_sql"
  >         name: "<your cortex analyst tool name>"
  >         description: "<clear and specific tool description>"
  >     - tool_spec:
  >         type: "cortex_search"
  >         name: "<your cortex search tool name>"
  >         description: "<clear and specific tool description>"
  >     - tool_spec:
  >         type: "data_to_chart"
  >         name: "data_to_chart"
  >         description: "Generates visualizations from data"
  >
  >     tool_resources:
  >     <your cortex analyst tool name>:
  >         semantic_view: "<db>.<schema>.<semantic_view>"
  >     <your cortex search tool name>:
  >         name: "<db>.<schema>.<search_service_name>"
  >         max_results: "5"
  >         filter:
  >         "@eq":
  >             region: "North America"
  >         title_column: "<title_name>"
  >         id_column: "<column_name>"
  >     $$;
  > ```

## Modifying an existing agent

For instructions on modifying the configuration for an existing agent, including adding tools and updating other details, see [Add tools](../cortex-agents-manage.md).

---
title: CKE document access history
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-access-history.md
section: Snowflake Cortex (AI & ML)
---

# CKE document access history

To help providers know which documents are being accessed in their [Cortex Knowledge Extensions (CKE)](cke-overview.md), Snowflake provides the following features:

* CKE access history data in the [LISTING_ACCESS_HISTORY view](../../../sql-reference/data-sharing-usage/listing-access-history.md) in the [SHARE_OBJECTS_ACCESSED array](../../../sql-reference/data-sharing-usage/listing-access-history.md).
* A [SYSTEM$ENCODE_CKE_PRIMARY_KEY](../../../sql-reference/functions/system_encode_cke_primary_key.md) system function.
* A [SYSTEM$CKE_HASH_FUNCTION](../../../sql-reference/functions/system_cke_hash_function.md) system function.

## Prerequisites

Because [primary keys](../cortex-search/cortex-search-overview.md) define a unique identifier for each document, you must specify a primary key
for the [Cortex Search Service](../cortex-search/cortex-search-overview.md) to get the access history.

> **Note:**
>
> Modifying the primary key columns of an existing Cortex Search Service invalidates the previous CKE access history.
>
> To interpret the previous CKE access history, save a mapping from the old primary key columns to the new primary key columns.

## Understand document IDs

Document IDs are composed of Cortex Search Service [primary keys](../cortex-search/cortex-search-overview.md). To protect customer data, Snowflake encodes and hashes the primary key columns when tracking the access history. You can map the primary keys to the provided hashed document ID using the following functions:

* [SYSTEM$ENCODE_CKE_PRIMARY_KEY](../../../sql-reference/functions/system_encode_cke_primary_key.md) function: Transform and anonymize the primary key from the set of selected columns.
* [SYSTEM$CKE_HASH_FUNCTION](../../../sql-reference/functions/system_cke_hash_function.md) function: Hash the primary key.

## Example CKE access history in the LISTING_ACCESS_HISTORY view

This example performs the following actions:

* Retrieves only CKE access information from the [LISTING_ACCESS_HISTORY view](../../../sql-reference/data-sharing-usage/listing-access-history.md) view and excludes all other events
* Uses the [SYSTEM$ENCODE_CKE_PRIMARY_KEY](../../../sql-reference/functions/system_encode_cke_primary_key.md) function to build an encoded representation of the CKE document’s primary key columns
* Retrieves the hash version and uses the [SYSTEM$CKE_HASH_FUNCTION](../../../sql-reference/functions/system_cke_hash_function.md) to compute a hashed document ID for every primary key
* Joins the computed hashed IDs and versions to the view to recover the original primary key columns

Step 1. Create a daily access summary table that retrieves only CKE access information.

```sqlexample
CREATE TABLE IF NOT EXISTS cke_document_daily_access AS
SELECT query_date,
       consumer_account_name,
       consumer_name,
       hashed_doc_id,
       hash_version,
       total_access_count
  FROM (
    SELECT query_date,
           consumer_account_name,
           consumer_name,
           flattened.value::string AS hashed_doc_id,
           lah.share_objects_accessed[0]:"hashVersion"::string AS hash_version,
      COUNT(*) AS total_access_count
      FROM snowflake.data_sharing_usage.listing_access_history AS lah,
        LATERAL FLATTEN(
          input => lah.share_objects_accessed[0]:"hashedDocumentIds"
        ) AS flattened
      WHERE lah.share_objects_accessed[0]:"objectDomain" = 'Cortex Search Service'
        AND lah.share_objects_accessed[0]:"hashVersion" IS NOT NULL
      GROUP BY query_date,
               consumer_account_name,
               consumer_name,
               hashed_doc_id,
               hash_version
);
```

Step 2. Create a table to store the encoded primary keys.

```sqlexample
CREATE TABLE IF NOT EXISTS encoded_primary_keys AS
  (
    SELECT pkCol1,
           pkCol2,
           SYSTEM$ENCODE_CKE_PRIMARY_KEY(pkCol1, pkCol2) AS encoded_primary_key
      FROM your_cortex_search_table
  )
```

Step 3. From the table you created in the previous step, prepare hash versions and compute hashed IDs for your primary keys. Then join the
`cke_document_daily_access` table with the hashed primary key view to recover the original primary key columns.

```sqlexample
WITH hash_versions AS
  (
    SELECT DISTINCT hash_version AS hash_version
      FROM cke_document_daily_access
  ),
  hashed_primary_key AS
  (
    SELECT pkCol1,
           pkCol2,
           hash_version,
           SYSTEM$CKE_HASH_FUNCTION(hash_version, encoded_primary_key) AS hashed_doc_id
      FROM encoded_primary_keys
      CROSS JOIN hash_versions
  )
SELECT pk.pkCol1,
       pk.pkCol2,
       a.query_date,
       a.consumer_account_name,
       a.consumer_name,
       a.total_access_count
  FROM cke_document_daily_access AS a
  JOIN hashed_primary_key AS pk
    ON a.hashed_doc_id = pk.hashed_doc_id
    AND a.hash_version = pk.hash_version;
```

---
title: Configure and interact with Agents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-manage.md
section: Snowflake Cortex (AI & ML)
---

# Configure and interact with Agents

You can build an agent with the following methods:

* In Snowsight
* Using the [Agents REST API](cortex-agents-rest-api.md)
* With the [Cortex Agents SQL](../../sql-reference/commands-cortex-agent.md) commands

You can then integrate the agent into your application to perform tasks or respond to queries. You must first create an agent object that contains information such as the metadata, tools, and orchestration instructions that the agent can use to perform a task or answer questions. You can then reference the agent object in your application to integrate the agent’s functionality. You can configure a thread to maintain the context in memory, so that the client does not have to send the context at every turn of the conversation.

> **Note:**
>
> Snowflake REST APIs support authentication via programmatic access tokens (PATs), key pair authentication using JSON Web Tokens (JWTs), and OAuth. For details, see [Authenticating Snowflake REST APIs with Snowflake](../../developer-guide/snowflake-rest-api/authentication.md).

## Create an agent

Create an agent object by specifying the database and schema where the agent should be located, along with a name and description for the agent. In addition, specify the display name, avatar, and the color. These attributes are used by the client application to display the agent. The display name is also used as the handle to reference the agent in conversations.

For best practices when creating an agent, see [Best Practices to Building Cortex Agents](https://www.snowflake.com/en/developers/guides/best-practices-to-building-cortex-agents).

The following examples show how to create an agent object from Snowsight or using the REST API:

> Method 1: Snowsight UIMethod 2: REST APIMethod 3: SQL
>
> 1. Sign in to [Snowsight](../ui-snowsight-gs.md).
> 2. In the navigation menu, select AI & ML » Agents.
> 3. Select Create agent.
> 4. For Agent object name, specify a name for the agent that is displayed to users in the UI.
> 5. For Display name, specify a name for the agent that is displayed to admins in the agent list.
> 6. Select Create agent.
> 7. Prompt the agent with general knowledge requests.
>
> 1. Create an agent object by specifying the database and schema where the agent will be created, as well as the parameters needed for the agent. You can also specify tool fields when creating the agent object.
>
>    > ```bash
>    > curl -X POST "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents" \
>    > --header 'Content-Type: application/json' \
>    > --header 'Accept: application/json' \
>    > --header "Authorization: Bearer $PAT" \
>    > --data '{
>    >     "name": "TransportationAgent",
>    >     "comment": "This agent handles queries related to transportation methods and costs.",
>    >     "models": {
>    >         "orchestration": "claude-4-sonnet"
>    >     }
>    > }'
>    > ```
>
> Create an agent object in the database and schema where the agent will be created. You can specify the agent properties and specification using the `FROM SPECIFICATION` clause in the CREATE AGENT command. For more information, see [CREATE AGENT](../../sql-reference/sql/create-agent.md).
>
> ```sqlexample-yaml
> CREATE OR REPLACE AGENT myagent
>   COMMENT = 'agent level comment'
>   PROFILE = '{"display_name": "My Business Assistant", "avatar":  "business-icon.png", "color": "blue"}'
>   FROM SPECIFICATION
>   $$
>   orchestration:
>     budget:
>       seconds: 30
>       tokens: 16000
>
>   instructions:
>     response: "You will respond in a friendly but concise manner"
>     orchestration: "For any revenue question use Analyst; for policy use Search"
>     system: "You are a friendly agent that helps with business questions"
>     sample_questions:
>       - question: "What was our revenue last quarter?"
>         answer: "I'll analyze the revenue data using our financial database."
>
>   tools:
>     - tool_spec:
>         type: "cortex_analyst_text_to_sql"
>         name: "Analyst1"
>         description: "Converts natural language to SQL queries for financial analysis"
>     - tool_spec:
>         type: "cortex_search"
>         name: "Search1"
>         description: "Searches company policy and documentation"
>     - tool_spec:
>         type: "data_to_chart"
>         name: "data_to_chart"
>         description: "Generates visualizations from data"
>
>   tool_resources:
>     Analyst1:
>       semantic_view: "db.schema.semantic_view"
>     Search1:
>       name: "db.schema.service_name"
>       max_results: "5"
>       filter:
>         "@eq":
>           region: "North America"
>       title_column: "<title_name>"
>       id_column: "<column_name>"
>       columns_and_descriptions:
>         TEXT:
>           description: "The main text content of the document"
>           type: "string"
>           searchable: true
>           filterable: false
>         CATEGORY:
>           description: "Document category. Values include: policy, guide, reference."
>           type: "string"
>           searchable: false
>           filterable: true
>   $$;
> ```

## Add tools

After you’ve created the agent, you need to add tools and provide instructions on how to orchestrate across the tools. Agents support the following tool types:

> * **Cortex Analyst:** You specify the semantic views so that Cortex Analyst can use these to retrieve structured data. The Agents can route across multiple semantic views to provide the response.
>
>   > **Note:**
>   >
>   > When Cortex Analyst is invoked by an agent, it does not have access to open source LLM models. For a list of the models that Cortex Analyst can use when invoked by an agent, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
> * **Cortex Search:** You provide the Cortex Search indices as tools, along with column descriptions for filterable and searchable columns. The Cortex Agent uses the Cortex Search indices to retrieve unstructured data.
> * **Data to Chart:** You can enable the agent to automatically generate visualizations from data. When included in the tools array, the agent can create charts using Vega-Lite specifications in response to queries that would benefit from visual representation.
> * **Custom tools:** You can implement code for a specific business logic as a stored procedure or user defined function (UDF). Alternatively, you can use the custom tools to retrieve data from your backend systems using APIs.
> * **Web Search:** You can enable the agent to search the web and use those search results to generate responses and plan tasks.

You also specify the resources used by each tool. For example, on Cortex Analyst you specify the warehouse along with the timeout for SQL query execution. Similarly for Cortex Search, you specify the filters and column names used in the search query, along with the max results in the search response. For custom tools, you will provide the warehouse details.

> Method 1: Snowsight UIMethod 2: REST APIMethod 3: SQL
>
> To modify the configuration for an existing agent, follow these steps:
>
> 1. In the navigation menu, select AI & ML » Agents.
> 2. From the list of agents, select the agent that you want to modify.
>    :   The configuration details for the agent are displayed.
> 3. Select Edit.
> 4. For Description, describe the agent and how users can interact with it.
> 5. To add sample questions that users can ask the agent, enter a sample question and select Add a question.
> 6. Select Tools. Add one or more of the following tools.
>
>    > * **To add a semantic view in Cortex Analyst to the agent**: This section assumes that you already have a semantic view created. For information about semantic views and how to create one, see [Overview of semantic views](../views-semantic/overview.md).
>    >
>    >   > 1. Find Cortex Analyst and select the respective + Add button.
>    >   > 2. For Name, enter a name for the semantic view.
>    >   > 3. Select Semantic view.
>    >   > 4. Select the semantic view that the agent uses.
>    >   > 5. For Warehouse, select the warehouse that the agent uses to run queries.
>    >   > 6. For Query timeout (seconds), specify the maximum time in seconds that the agent waits for a query to complete before timing out.
>    >   > 7. For Description, describe the semantic view.
>    >   > 8. Select Add.
>    > * **To add a Cortex Search service to the agent**: This section assumes that you’ve already created a Cortex Search service. For information about creating a Cortex Search service, see [Cortex Search](cortex-search/cortex-search-overview.md). You can also use a Cortex Knowledge Extension (CKE) that is shared with you. For a tutorial that uses a CKE, see [Common issues and solutions](snowflake-intelligence/troubleshooting.md).
>    >
>    >   > 1. Find Cortex Search Services and select the respective + Add button.
>    >   > 2. For Name, enter a name for the Cortex Search service.
>    >   > 3. For Description, describe the Cortex Search service.
>    >   > 4. For Search service, select the Cortex Search service that the agent uses.
>    >   > 5. Under Tool details, add Columns Description to help the agent effectively use the search service. Column descriptions are not required for all columns, but providing them for filterable and searchable columns is recommended to improve the quality of results. Provide a description that explains the column’s content and sample values.
>    >   > 6. Select Add.
>    > * **To add a custom tool to the agent**: By adding custom tools, you can extend the functionality of your agents. With custom tools, the agent can call stored procedures and functions that you have defined to perform actions or do computations. This section assumes that you’ve already created a custom tool. For information about procedures and functions, see [Extending Snowflake with Functions and Procedures](../../developer-guide/extensibility.md).
>    >
>    >   > 1. Find Custom tools and select the respective + Add button.
>    >   > 2. For Name, enter a name for the custom tool.
>    >   > 3. For Resource type, select whether the custom tool is a function or a procedure. For information about whether to use a function or procedure, see [Choosing whether to write a stored procedure or a user-defined function](../../developer-guide/stored-procedures-vs-udfs.md).
>    >   > 4. For Custom tool identifier, select the existing function or procedure that you want to add as a custom tool.
>    >   > 5. The related parameters for the function or procedure automatically appear. You can manually add parameters for the custom tool by adding a name, type, description, and selecting whether the parameter is required. You can also modify parameters that automatically populate.
>    >   >
>    >   >    > **Note:**
>    >   >    >
>    >   >    > Snowflake Cortex does not support stored procedures and custom tools with a parameter of type `object`.
>    >   > 6. For Warehouse, select the warehouse that the agent uses to run the custom tool. You must manually select a warehouse.
>    >   > 7. For Description, describe the custom tool and how to use it.
>    >   > 8. Select Add.
>    >   > 9. After creating the custom tool, make sure users are granted USAGE privileges to the function or procedure that you added as a custom tool. When using stored procedures, agents maintain whether the procedure runs with owner’s or caller’s rights. For information about owner’s and caller’s rights, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
>    > * **To add web search tool to the agent**: This section assumes that you’ve already enabled web search at the account level. For information about enabling web search at the account level, see [Web search](cortex-agents.md).
>    >
>    >   > 1. Find Web search and select the respective toggle to enable the feature.
> 7. Select Save.
>
> To add tools to an agent using the REST API, add the following payloads as part of a request to [Update Cortex Agent](cortex-agents-rest-api.md). You can also specify these fields when creating the agent object.
>
> > * **Add Cortex Analyst tool and tool resources**: The following example shows how to add a Cortex Analyst tool and tool resources to an existing agent object.
> >
> >   > 1. Add a Cortex Analyst tool
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tools": [
> >   >    >   {
> >   >    >    "tool_spec": {
> >   >    >     "description": "Analyst to analyze price",
> >   >    >     "type": "cortex_analyst_text_to_sql",
> >   >    >     "name": "Analyst1"
> >   >    >    }
> >   >    >   }
> >   >    >  ]
> >   >    > }'
> >   >    > ```
> >   > 2. Add a Cortex Analyst tool resource
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tool_resources": {
> >   >    >   "Analyst1": {
> >   >    >    "semantic_model_file": "stage1",
> >   >    >    "semantic_view": "The name of the Snowflake native semantic model object",
> >   >    >    "execution_environment": {"type":"warehouse", "warehouse":"my_wh"}
> >   >    >   }
> >   >    >  }
> >   >    > }'
> >   >    > ```
> > * **Add Cortex Search tool and tool resources**: The following example shows how to add a Cortex Search tool and tool resources to an existing agent object.
> >
> >   > 1. Add a Cortex Search tool
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tool_spec": {
> >   >    >   "type": "cortex_search",
> >   >    >   "name": "Search1"
> >   >    >  }
> >   >    > }'
> >   >    > ```
> >   > 2. Add a Cortex Search tool resource:
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tool_resources": {
> >   >    >   "Search1": {
> >   >    >    "search_service": "db.schema.service_name",
> >   >    >    "filter": {"@eq": {"region": "North America"} },
> >   >    >    "max_results": 10,
> >   >    >    "title_column": "TITLE",
> >   >    >    "columns_and_descriptions": {
> >   >    >      "TEXT": {
> >   >    >        "description": "The main text content of the document",
> >   >    >        "type": "string",
> >   >    >        "searchable": true,
> >   >    >        "filterable": false
> >   >    >      },
> >   >    >      "CATEGORY": {
> >   >    >        "description": "Document category. Values include: policy, guide, reference.",
> >   >    >        "type": "string",
> >   >    >        "searchable": false,
> >   >    >        "filterable": true
> >   >    >      },
> >   >    >      "AUTHOR": {
> >   >    >        "description": "Author name in format: firstname.lastname",
> >   >    >        "type": "string",
> >   >    >        "searchable": false,
> >   >    >        "filterable": true
> >   >    >      }
> >   >    >    }
> >   >    >   }
> >   >    >  }
> >   >    > }'
> >   >    > ```
> >   >    >
> >   >    > The `columns_and_descriptions` field is a map of column names to column properties. Descriptions are not required for all columns, but providing them for filterable and searchable columns improves the quality of results. Each column entry must include:
> >   >    >
> >   >    > + `description` (string): A description of the column content and sample values. Include guidance on when and how to filter on this column.
> >   >    > + `type` (string): The column data type. Use `"string"` or `"datetime"`.
> >   >    > + `searchable` (boolean): Set to `true` for text index columns that can be searched. Vector index columns are not supported.
> >   >    > + `filterable` (boolean): Set to `true` for attribute columns that can be used in filter conditions.
> > * **Add data_to_chart tool**: The following example shows how to add the data to chart tool to an existing agent object.
> >
> >   > 1. Add the data_to_chart tool
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tools": [
> >   >    >   {
> >   >    >    "tool_spec": {
> >   >    >      "type": "data_to_chart",
> >   >    >      "name": "data_to_chart",
> >   >    >      "description": "Generates visualizations from data"
> >   >    >    }
> >   >    >   }
> >   >    >  ]
> >   >    > }'
> >   >    > ```
> > * **Add custom tool and tool resources**: The following example shows how to add a custom tool and tool resources to an existing agent object.
> >
> >   > 1. Add a custom tool
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tools": [
> >   >    >   {
> >   >    >    "tool_spec": {
> >   >    >      "description": "Custom tool",
> >   >    >      "type": "generic",
> >   >    >      "name": "custom1"
> >   >    >    }
> >   >    >   }
> >   >    >  ]
> >   >    > }'
> >   >    > ```
> >   > 2. Add a custom tool resource
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tool_resources": {
> >   >    >   "Custom1": {
> >   >    >    "user-defined-function-argument": "argument1"
> >   >    >   }
> >   >    >  }
> >   >    > }'
> >   >    > ```
> > * **Add web_search tool**: The following example shows how to add the web_search tool to an existing agent object. This section assumes that you’ve already enabled web search at the account level. For information about enabling web search at the account level, see [Web search](cortex-agents.md).
> >
> >   > 1. Add the web_search tool
> >   >
> >   >    > ```bash
> >   >    > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
> >   >    > --header 'Content-Type: application/json' \
> >   >    > --header 'Accept: application/json' \
> >   >    > --header "Authorization: Bearer $PAT" \
> >   >    > --data '{
> >   >    >  "tools": [
> >   >    >   {
> >   >    >    "tool_spec": {
> >   >    >      "type": "web_search",
> >   >    >      "name": "Web Search",
> >   >    >    }
> >   >    >   }
> >   >    >  ]
> >   >    > }'
> >   >    > ```
>
> You can update an agent object to add tools and tool resources using the ALTER AGENT command. For information about the ALTER AGENT command, see [ALTER AGENT](../../sql-reference/sql/alter-agent.md).
>
> > **Note:**
> >
> > The new specification completely replaces the existing one. Fields that are not included in the new specification are removed.
>
> ```sqlexample-yaml
> ALTER AGENT <agent_name> MODIFY LIVE VERSION SET SPECIFICATION =
> $$
> models:
>     orchestration: claude-4-sonnet
>
>   orchestration:
>     budget:
>       seconds: 30
>       tokens: 16000
>
>   instructions:
>     response: "You will respond in a friendly but concise manner"
>     orchestration: "For any revenue question use Analyst; for policy use Search"
>     system: "You are a friendly agent that helps with business questions"
>     sample_questions:
>       - question: "What was our revenue last quarter?"
>         answer: "I'll analyze the revenue data using our financial database."
>
>   tools:
>     - tool_spec:
>         type: "cortex_analyst_text_to_sql"
>         name: "Analyst1"
>         description: "Converts natural language to SQL queries for financial analysis"
>     - tool_spec:
>         type: "cortex_search"
>         name: "Search1"
>         description: "Searches company policy and documentation"
>     - tool_spec:
>         type: "data_to_chart"
>         name: "data_to_chart"
>         description: "Generates visualizations from data"
>
>   tool_resources:
>     Analyst1:
>       semantic_view: "db.schema.semantic_view"
>     Search1:
>       name: "db.schema.service_name"
>       max_results: "5"
>       filter:
>         "@eq":
>           region: "North America"
>       title_column: "<title_name>"
>       id_column: "<column_name>"
> $$;
> ```

## Specify orchestration

Cortex Agents orchestrate the task by breaking it into a sequence of sub-tasks and identifying the right tool for each sub-task. You specify the LLM that the Agent should use to conduct this orchestration. You can also influence the orchestration by providing instructions. For example, consider an agent built to respond to retail product questions. You can use the orchestration instruction `"Use the search tool for all requests related to refunds"` to ensure the Agent only provides refund policy details (using Cortex Search) and does not actually calculate the refund amounts (using Cortex Analyst). You can also specify instructions to align the response to a brand or a tone, such as `"Always provide provide a concise response; maintain a friendly tone"`.

Method 1: Snowsight UIMethod 2: REST APIMethod 3: SQL

1. Select Orchestration.
2. For the Orchestration model, select the model that the agent uses to handle orchestration.
3. For Planning instructions, provide instructions that influence tool selection by the agent based on user-provided input. These can include specific instructions about when to use each tool, or even to always use a tool at the beginning or end of a response.
4. For Response instruction, provide instructions that the model uses for response generation. For example, specify if you want the agent to prioritize chart creation, or to keep a certain tone with users.
5. For Budget configuration, you can specify time limit and token limit for the agent. The budget is the maximum amount of time or tokens that the agent can use to generate a response. After either one of the limits is reached, the agent will stop generating a response. Token limits are used only for orchestration and don’t include tokens used by Cortex Analyst, Cortex Search, and other tools invoked.
6. Select Save.

> To update an agent using the REST API, add the following payloads as part of a request to [Update Cortex Agent](cortex-agents-rest-api.md). You can also specify these fields when creating the agent object. The following procedure shows how to update the agent with planning and response instructions, and specify the LLM model used for orchestration.

1. Update the LLM model

   > ```bash
   > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
   > --header 'Content-Type: application/json' \
   > --header 'Accept: application/json' \
   > --header "Authorization: Bearer $PAT" \
   > --data '{
   >  "models": {
   >   "orchestration": "llama3.3-70B"
   > }'
   > ```
2. Specify the planning and response instructions

   > ```bash
   > curl -X PUT "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>" \
   > --header 'Content-Type: application/json' \
   > --header 'Accept: application/json' \
   > --header "Authorization: Bearer $PAT" \
   > --data '{
   >  "instructions": {
   >   "response": "Always provide a concise response and maintain a friendly tone.",
   >   "orchestration": "<orchestration instructions>",
   >   "system": "You are a helpful data agent."
   >  }
   > }'
   > ```

You can update an agent object to add orchestration information using the ALTER AGENT command. For information about the ALTER AGENT command, see [ALTER AGENT](../../sql-reference/sql/alter-agent.md).

```sqlexample-yaml
ALTER AGENT <agent_name> MODIFY LIVE VERSION SET SPECIFICATION =
$$
models:
    orchestration: claude-4-sonnet

  orchestration:
    budget:
      seconds: 30
      tokens: 16000

  instructions:
    response: "You will respond in a friendly but concise manner"
    orchestration: "For any revenue question use Analyst; for policy use Search"
    system: "You are a friendly agent that helps with business questions"
    sample_questions:
      - question: "What was our revenue last quarter?"
        answer: "I'll analyze the revenue data using our financial database."

  tools:
    - tool_spec:
        type: "cortex_analyst_text_to_sql"
        name: "Analyst1"
        description: "Converts natural language to SQL queries for financial analysis"
    - tool_spec:
        type: "cortex_search"
        name: "Search1"
        description: "Searches company policy and documentation"
    - tool_spec:
        type: "data_to_chart"
        name: "data_to_chart"
        description: "Generates visualizations from data"

  tool_resources:
    Analyst1:
      semantic_view: "db.schema.semantic_view"
    Search1:
      name: "db.schema.service_name"
      max_results: "5"
      filter:
        "@eq":
          region: "North America"
      title_column: "<title_name>"
      id_column: "<column_name>"
$$;
```

## Set up access to the agent

> **Important:**
>
> By default, Cortex Agents uses the user’s default role and the default warehouse. If another user is using the agent, make sure that they’ve done the following:
>
> * Set a default role
> * Set a default warehouse
> * Granted USAGE on the agent to the default role
>
> For information about granting usage, see [Access control requirements](cortex-agents.md).
>
> You must use the user’s default role when calling or updating Cortex Agents. To allow another role to use the agent, grant USAGE on the agent to that role:
>
> ```sqlexample
> GRANT USAGE ON AGENT <database_name>.<schema_name>.<agent_name> TO ROLE <role_name>;
> ```

Set up access policies from Snowsight UI or using SQL so that users can access the Agent. Specify the role to provide access to the Agent.

Method 1: Snowsight UIMethod 2: SQL

1. Select Access.
2. To give a role access to the agent, select Add role, then select the role from the dropdown menu.
3. Select Save.

```sqlexample
GRANT USAGE ON AGENT myagent TO ROLE test_rl;
```

## Review the agent

After you have built the Agent, you can review the Agent to verify all parameters.

Method 1: Snowsight UIMethod 2: REST APIMethod 3: SQL

> **Note:**
>
> When reviewing agents from Snowsight, you can only view agents in the Agent Admin UI. You cannot view agents in the database object explorer.

1. In the navigation menu, select AI & ML » Agents.
2. From the list of agents, select the agent that you want to view the details for. This opens a new page that gives an overview of the agent details.
3. To review all agent details, select Next.

You can list and describe agents using the REST APIs.

1. List all agents.

   > ```bash
   > curl -X GET "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/{database}/schemas/{schema}/agents:" \
   >  --header 'Content-Type: application/json' \
   >  --header 'Accept: application/json' \
   >  --header "Authorization: Bearer $PAT" \
   > ```
2. Describe the desired agent.

   > ```bash
   > curl -X GET "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/{database}/schemas/{schema}/agents/{name}:" \
   >  --header 'Content-Type: application/json' \
   >  --header 'Accept: application/json' \
   >  --header "Authorization: Bearer $PAT" \
   > ```

You can list and describe agents using SQL.

1. List all agents.

   ```sqlexample
   SHOW AGENTS IN ACCOUNT;
   ```
2. Describe the desired agent.

   ```sqlexample
   DESCRIBE AGENT myagent;
   ```

## Test the agent

After you’ve created the agent, you can test it to see how it responds to user queries. You can also test the agent using [Agent run request with agent object](cortex-agents-run.md).

To test the agent, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. Select the agent from the list of agents.
4. On the agent details page, enter a query in the agent playground.
5. Verify that the agent responds to the query as expected. If the agent does not respond as expected, modify the agent’s configuration by following the steps in Add tools.

## Interact with the agent

After creating the agent object, you can integrate the agent directly into your application using the REST API. To maintain context during the interaction, use a thread. The agent object and thread combined simplify the client application code.

### Create a thread

Create a thread to maintain the context during a conversation. When the thread is created successfully, the system returns a `Thread ID`.

```bash
curl -X POST "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/cortex/threads" \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
    "origin_application": <application_name>,
}'
```

### Send a request to the agent

To interact with the Agent, you must pass the agent object, thread ID, and a unique `parent_message_id` as part of your REST API request. The initial `parent_message_id` should be `0`.

```bash
curl -X POST "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/{database}/schemas/{schema}/agents/{name}:run" \
 --header 'Content-Type: application/json' \
 --header 'Accept: application/json' \
 --header "Authorization: Bearer $PAT" \
 --data '{
     "thread_id": <thread id for context>,
     "parent_message_id": <parent message id>,
     "messages": [
      {
         "role": "user",
         "content": [
           {
            "type": "text",
             "text": "What are the projected transportation costs for the next three quarters? "
             }
         ]
       }
     ],
     "tool_choice": {
       "type": "required",
       "name": [
         "Analyst1",
         "Search1"
       ]
     }
 }'
```

## Collect feedback about the agent

You can collect feedback from users about the responses given by the agent. This feedback can help you refine the agent as you iterate on your use case. Users can provide an objective rating (postive/negative), as well as more subjective detail with a message. Also, users can classify the feedback across one of many categories.

```bash
curl -X POST "$SNOWFLAKE_ACCOUNT_BASE_URL/api/v2/databases/<database-name>/schemas/<schema-name>/agents/<agent-name>:feedback:" \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
    "request_id": "<request-id>",
    "positive": true
    "feedback_message": "This answer was great",
    "categories":[
        "category1", "category2", "category3"
    ],
    "thread_id": "<thread-id>"
}'
```

## Interact without an agent object

In some cases, you may want to get started with Cortex Agents by using `agent:run` without an agent object. For example, this may be useful when you want to quickly try out a use case. For more information about the REST API, see [Agent run without an agent object](cortex-agents-run.md).

> **Note:**
>
> When interacting with an agent without creating an agent object, you must manually maintain the context for the agent with every request.

---
title: Copy arctic-extract models between databases, schemas, and accounts
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/copy-arctic-extract-models.md
section: Snowflake Cortex (AI & ML)
---

# Copy `arctic-extract` models between databases, schemas, and accounts

This topic explains how to copy fine-tuned `arctic-extract` models between databases or schemas in the same account or between different accounts in the same organization.
For example, you might want to copy a model from a development account to a production account.

## Copy models between databases and/or schemas within an account

1. Create the model from the source model using the role that created the source model:

   > **Tip:**
   >
   > To list the versions in a model, use [SHOW VERSIONS IN MODEL](../../sql-reference/sql/show-versions-in-model.md).

   ```sqlexample
   CREATE MODEL prod_db.prod_schema.invoices_model
     WITH VERSION V1
     FROM MODEL dev_db.dev_schema.invoices_source_model
       VERSION V1;
   ```
2. Optional: Add another version of the model:

   ```sqlexample
   ALTER MODEL prod_db.prod_schema.invoices_model
     ADD VERSION V2
     FROM MODEL dev_db.dev_schema.invoices_source_model
       VERSION V2;
   ```
3. To enable the `prod_role` role to use the copied model, grant the OWNERSHIP privilege on the model to that role:

   ```sqlexample
   GRANT OWNERSHIP ON MODEL prod_db.prod_schema.invoices_model
     TO ROLE prod_role;
   ```

## Copy models between accounts

You can replicate a model from a source account to one or more target accounts in the same organization.
For more information about replication, see [Introduction to replication and failover across multiple accounts](../account-replication-intro.md).

To replicate the model from a source account to a target account, you need to create a replication group in the source account to enable replication of the database
in which the model was created to a target account, and set up the production user role.

> **Note:**
>
> You must be a user with the ACCOUNTADMIN role to create a replication group and to set up the production user role.

### Replicate the database in which the model was created

1. Create a primary replication group in the source account:

   ```sqlexample
   CREATE REPLICATION GROUP models_replication_group
   OBJECT_TYPES = DATABASES
   ALLOWED_DATABASES = dev_db
   ALLOWED_ACCOUNTS = org.production_account;
   ```
2. Create a secondary replication group in a target account as a replica of the primary replication group in the source account:

   ```sqlexample
   CREATE REPLICATION GROUP models_secondary_replication_group
   AS REPLICA OF org.dev_account.models_replication_group;
   ```
3. Refresh the database in the target account from the source account:

   ```sqlexample
   ALTER REPLICATION GROUP models_secondary_replication_group REFRESH;
   ```
4. Optional: Specify the schedule for refreshing the secondary replication group so that the account is synchronized automatically every 10 minutes:

   ```sqlexample
   ALTER REPLICATION GROUP models_secondary_replication_group
     SET REPLICATION_SCHEDULE = '10 MINUTE';
   ```

### Set up the production user role

To ensure that the user working on the target production account (for example, a user with the `prod_role` role) can use the replicated model, follow these steps:

1. Grant the USAGE privilege on the source database and schema, and ownership on all models in that schema, to the `prod_role` role:

   ```sqlexample
   GRANT USAGE ON DATABASE dev_db TO ROLE prod_role;
   GRANT USAGE ON SCHEMA dev_db.dev_schema TO ROLE prod_role;
   GRANT OWNERSHIP ON ALL MODELS IN SCHEMA dev_db.dev_schema TO ROLE prod_role;
   ```
2. Optional: Grant ownership on all the future models that will be replicated:

   ```sqlexample
   GRANT OWNERSHIP ON ALL FUTURE MODELS IN SCHEMA dev_db.dev_schema TO ROLE prod_role;
   ```

After you grant the required privileges, a user with the `prod_role` role must follow these steps:

1. Create the model from the source model:

   ```sqlexample
   CREATE MODEL prod_db.prod_schema.invoices_model
     WITH VERSION V1
     FROM MODEL dev_db.dev_schema.invoices_source_model
       VERSION V1;
   ```
2. Optional: Add another version of the model:

   ```sqlexample
   ALTER MODEL prod_db.prod_schema.invoices_model
     ADD VERSION V2
     FROM MODEL dev_db.dev_schema.invoices_source_model
       VERSION V2;
   ```

> **Note:**
>
> The model in the target schema is a separate model object from the model in the replicated database. New versions are not copied automatically;
> you must add each version using [ALTER MODEL … ADD VERSION](../../sql-reference/sql/alter-model-add-version.md).

---
title: Cortex Agent evaluations
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-evaluations.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Agent evaluations

Cortex Agent evaluations allow you to monitor your agent’s behavior and performance. Evaluate your agent against both ground truth and reference-free evaluation metrics. During evaluation, your agent’s activity is traced and monitored so you can ensure that each step in the process advances towards your end goal.

Snowflake offers the following metrics to evaluate your agent against:

* **Answer correctness** – How closely the answer from an agent to your prepared query matches an expected answer. This metric is most useful when the dataset powering your Cortex Agent is static.
* **Logical consistency** – Measures consistency across agent instructions, planning, and tool calls. This metric is *reference-free*, meaning you don’t need to prepare any information in your dataset for evaluation.

Snowflake also allows you to create custom evaluation metrics that use the LLM judging process to measure context critical to your Agent’s domain and use case. Custom metrics use an LLM prompt and scoring methodology, which are passed to the evaluation judging system to produce a score.

For additional details about how agent evaluations are conducted on Snowflake, including the LLM judging system used for reference-free evaluations, see the Snowflake engineering blog [What’s Your Agent’s GPA? A Framework for Evaluating AI Agent Reliability](https://www.snowflake.com/en/engineering-blog/ai-agent-evaluation-gpa-framework/). For an example of running an Agent Evaluation programmatically, see the guide [Getting Started with Cortex Agent Evaluations](https://www.snowflake.com/en/developers/guides/getting-started-with-cortex-agent-evaluations/).

## Access control requirements

The ability to run a Cortex Agent evaluation requires a role with the following:

* The DATABASE ROLE SNOWFLAKE.CORTEX_USER role
* The EXECUTE TASK ON ACCOUNT permission
* The USAGE permission on the database containing your agent
* The following permissions on the schema containing your agent:

  + USAGE
  + CREATE FILE FORMAT ON SCHEMA
  + CREATE TASK
  + EXECUTE TASK
* The USAGE permission on the database containing your evaluation data
* The following permissions on the schema containing your evaluation data:

  + USAGE
  + EXECUTE TASK
  + If creating a dataset from an input table, CREATE DATASET ON SCHEMA
* The USAGE or OWNERSHIP privilege on your agent
* The MONITOR or OWNERSHIP privilege on your agent
* If using an agent evaluation configuration, READ privilege on the stage containing the configuration file.

If the agent being evaluated uses tools, your role also needs access to all of them.

Additionally, if working with evaluations in Snowsight, the role you use to run or an inspect an evaluation needs the USAGE privilege on your default warehouse.

## Prepare an evaluation dataset

Before starting a Cortex Agent evaluation, prepare a table containing your evaluation inputs. This table is used to create a dataset for your evaluation to run against. To learn more about datasets on Snowflake, see [Snowflake Datasets](../../developer-guide/snowflake-ml/dataset.md).

### Cortex Code

To have [Cortex Code](../cortex-code/cortex-code.md) assist you with creating a dataset for your evaluation, use the `dataset-curation` sub-skill of the Cortex Code `cortex-agent` skill. For more information about Cortex Code skills, see [Cortex Code CLI - Skills](../cortex-code/extensibility.md).

### Dataset format

The table used to create a dataset for evaluation has an input query column of type VARCHAR that represents your query, and an output column of type VARIANT that contains a description of expected agent behavior. This single output column is used as the ground truth by the LLM judge.

Values in the output column have one key, `ground_truth_output`. The value of this key is used in answer correctness evaluation. LLM judges use ground truth to evaluate your agent’s output by including it in their prompt.

> **Tip:**
>
> Take advantage of the fact that ground truth is included in an LLM prompt by using natural language to describe a *type* of response, in addition to exact or semantic response matches. For example, you could provide a ground truth of `Output is in the following JSON format ...` followed by a string containing either a description of the structure or a JSON example itself. If you need a more rigorous examination of output based on a full custom prompt, create a custom metric.

To bring a JSON dataset into a Snowflake table, use the [PARSE_JSON](../../sql-reference/functions/parse_json.md) SQL function. The following example creates a new table `agent_evaluation_data` to use for an evaluation dataset, and inserts a row for the input query `What was the temperature in San Francisco on August 2nd 2019?` with the ground truth of `The temperature was 14 degrees Celsius in San Francisco on August 2nd, 2019.`.

```sqlexample
CREATE OR REPLACE TABLE agent_evaluation_data (
    input_query VARCHAR
);

INSERT INTO agent_evaluation_data
  SELECT
    'What was the temperature in San Francisco on August 2nd 2019?',
    PARSE_JSON('
      {
        "ground_truth_output": "The temperature was 14 degrees Celsius in San Francisco on August 2nd, 2019.",
      }
    ');
```

> **Important:**
>
> The functions [OBJECT_CONSTRUCT](../../sql-reference/functions/object_construct.md) and [ARRAY_CONSTRUCT](../../sql-reference/functions/array_construct.md) return non-VARIANT results. Use a function that produces a VAIRANT from your raw input like [PARSE_JSON](../../sql-reference/functions/parse_json.md), or call [TO_VARIANT](../../sql-reference/functions/to_variant.md) to guarantee the value type.

Data you provide in the `ground_truth` column that isn’t used by a selected metric is ignored. When conducting an evaluation run with only reference-free metrics, you can leave the output column empty.

When running your first evaluation, you’ll have the option to create a new dataset from an existing table.

## Start an agent evaluation

### Cortex Code

To have [Cortex Code](../cortex-code/cortex-code.md) run an evaluation, use the `evaluate-cortex-agent` sub-skill of the Cortex Code `cortex-agent` skill. For more information about Cortex Code skills, see [Cortex Code CLI - Skills](../cortex-code/extensibility.md).

### Snowsight

> **Note:**
>
> Agent evaluations run as your currently selected role in Snowsight, not your default role. Make sure a role with the correct permissions is active before starting an evaluation.

Begin your evaluation of a Cortex Agent by doing the following:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. Select the agent you want to conduct an evaluation of.
4. Select the Evaluations tab.
5. Select New evaluation run.

   > The New evaluation run modal opens.
6. In the Name field, provide a name for your evaluation. This name should be unique for the agent being evaluated.
7. Optional: In the Description field, provide any comments for the evaluation.
8. Select Next.

   > This advances to the Select dataset modal.
9. Select the dataset used to evaluate your agent. You can choose either Existing dataset or Create new dataset.

   > To use an existing dataset:
   >
   > 1. From the Database and schema list, select the database and schema containing your dataset.
   > 2. From the Select dataset list, select your dataset.
   >
   > To create a new dataset:
   >
   > 1. From the Source table - Database and schema list, select the database and schema containing the table you want to import to a dataset.
   > 2. From the Select source table list, select your source table.
   > 3. From the New dataset location - Database and schema list, select the database and schema to place your new dataset.
   > 4. In the Dataset name field, enter your dataset name. This name needs to be unique among the schema-level objects in your selected schema.
10. Select Next.

    > This advances to the Select metrics modal.
11. From the Input query list, select the column of your dataset which contains the input queries.
12. For each of the System metrics, change the toggle to active for any metric you want included in your evaluation. Select the column of your dataset containing the ground truth for your evaluation.
13. (Optional) To conduct a custom evaluation, toggle on Custom metrics.

    > 1. Select the database and schema containing the stage where your custom evaluation configuration is stored.
    > 2. Select the stage where your custom evaluation configuration is stored.
    > 3. Select the YAML configuration file for your custom evaluation.
    >
    >    > > **Note:**
    >    > >
    >    > > In Snowsight, only the custom evaluation definitions are loaded from your YAML configuration. The rest of the YAML file must still be valid. For the evaluation YAML specification, see Agent Evaluation YAML specification.
    > 4. For each custom metric, change the toggle to active if you want it included in your evaluation. Select the column of your dataset containing the ground truth for this evaluation.
14. Select Create to create the evaluation and begin the evaluation process.

At any point, you can select Cancel to cancel creating the evaluation, or select Prev to return to the previous modal.

### SQL

To start or retrieve information on an evaluation with SQL, use the [EXECUTE_AI_EVALUATION](../../sql-reference/functions/execute_ai_evaluation.md) function. This function has the following required arguments:

* `evaluation_job`: A string value of ‘START’ or ‘STATUS’.
* `run_parameters`: A SQL [OBJECT](../../sql-reference/data-types-semistructured.md) containing the key `run_name`, with a value of the name of your run.
* `config_file_path:` A stage file path pointing to your run configuration YAML file. This path can’t be a signed URL. For the evaluation YAML specification, see Agent Evaluation YAML specification.

Use the `evaluation_job` value ‘START’ to start an evaluation. The following example starts a run called `run-1` using the agent evaluation configuration from `@eval_db.eval_schema.metrics/agent_evaluation_config.yaml`:

```sqlexample
CALL EXECUTE_AI_EVALUATION(
  'START',
  OBJECT_CONSTRUCT('run_name', 'run-1'),
  '@eval_db.eval_schema.metrics/agent_evaluation_config.yaml'
);
```

After a run starts, you can query its progress with the `evaluation_job` value ‘STATUS’. This call returns a table in the format used for [AI Observability Runs](ai-observability/reference.md). The following example queries the status of the agent evaluation started from the previous example:

```sqlexample
CALL EXECUTE_AI_EVALUATION(
  'STATUS',
  OBJECT_CONSTRUCT('run_name', 'run-1'),
  '@eval_db.eval_schema.metrics/agent_evaluation_config.yaml'
);
```

> **Tip:**
>
> You can call the EXECUTE_AI_EVALUATION function from a [Task](../tasks-intro.md) to regularly run an evaluation or check the status of one.

## Inspect evaluation results

Evaluation results include information about the requested metrics, details of the agent’s threads of reasoning, and information about the LLM planning stage for each executed trace in the thread.

### Cortex Code

Cortex Code offers two sub-skills of the `cortex-agent` skill. Use the `investigate-cortex-agent-evals` sub-skill to inspect evaluations and find any issues in your configuration or data. Use the `optimize-cortex-agent` sub-skill to take results from completed evaluations and improve the performance of your agent.

### Snowsight

The Evaluations tab for an agent in Snowsight gives you an overview of every evaluation run and its summary results.

To view evaluation results in Snowsight:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. Select the agent you want to conduct an evaluation of.
4. Select the Evaluations tab.

#### Evaluation runs listing

The summary of run information for each run includes:

* `RUN NAME` – The name of the evaluation run.
* `# OF RECORDS` – The number of queries performed and answered as part of the run.
* `STATUS` – The status of the evaluation run, which is one of:

  > + – All inputs were evaluated and results are available.
  > + **A spinner is displayed** – The run is in progress, with no information available yet.
  > + – The run experienced an error at some point. Some or all metrics may be unavailable for the run.
* `DATASET` – The name of the dataset used for the evaluation.
* `AVG DURATION` – The average duration of time taken to execute an input query for the run.
* `LOGICAL CONSISTENCY` – Average over all inputs of the logical consistency evaluation for the run, if requested.
* `DESCRIPTION` – The description of the evaluation run.
* `CREATED` – The time at which the run was created and started.

Each custom metric evaluated for this run also receives its own column, defined by the evaluation metric `name` value. For more information on custom metrics, see Defining a custom metric.

#### Evaluation run overview

When you select an individual run in Snowsight, you’re presented with the run overview. This overview includes summary averages for each metric evaluated during the run, and a summary of each input execution. The overview for each input execution includes:

* `STATUS` – The status of the evaluation run, which is one of:

  > + – All inputs were evaluated and results are available.
  > + **A spinner is displayed** – The run is in progress, with no information available yet.
  > + – The run experienced an error at some point. Some or all metrics may be unavailable for the run.
* `INPUT` – The input query used for the evaluation.
* `OUTPUT` – The output produced by the agent.
* `DURATION` – The length of time taken to process the input and produce output.
* `LOGICAL CONSISTENCY` – The logical consistency evaluation for the input, if requested.
* `EVALUATED` – The time at which the input was processed.

Each custom metric evaluated for this run also receives its own column, defined by the evaluation metric `name` value. For more information about custom metrics, see Defining a custom metric.

#### Record details

When you select an individual input in Snowsight, you’re presented with the Record details view. This view includes three panes: Evaluation results, Thread details, and Trace details.

##### Evaluation results

Your evaluation results are presented here in detail. Each metric has its own presentation box of overall average across inputs, which can be selected to display a popover containing more information. This popover contains a breakdown of the number of runs which performed at high accuracy (80% or more accurate), medium accuracy (30% or more accurate, but not high accuracy), and which failed.

##### Thread details

The information logged during the execution of each agent thread. This includes planning and response generation by default, as well as a thread trace for each tool that the agent invoked during that thread.

##### Trace details

Each trace pane includes input, processing, and output information relevant to that stage of agent execution. This information is the same as that provided by [agent monitoring](cortex-agents-monitor.md).

### SQL

To retrieve raw evaluation details, use the [GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)](../../sql-reference/functions/get_ai_evaluation_data-snowflake-local.md) function. This function has the following required arguments:

* `database`: The database containing the agent.
* `schema`: The schema containing the agent.
* `agent_name`: The name of the agent.
* `agent_type`: The string constant ‘CORTEX AGENT’. This value is case-insensitive.
* `run_name`: The name of the evaluation run to retrieve.

This function returns a table of event data described in Evaluation results table format. The following example displays the full evaluation details for a run called `run-1`, where the agent is named `evaluated_agent` stored on the schema `eval_db.eval_schema`:

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_EVALUATION_DATA(
  'eval_db',
  'eval_schema',
  'evaluated_agent',
  'CORTEX AGENT',
  'run-1')
);
```

### Query traces for a single record

To access a single record from an evaluation trace, use the [GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)](../../sql-reference/functions/get_ai_record_trace-snowflake-local.md) function. This function has the following required arguments:

* `database`: The database containing the agent.
* `schema`: The schema containing the agent.
* `agent_name`: The name of the agent.
* `agent_type`: The string constant ‘CORTEX AGENT’. This value is case-insensitive.
* `record_id`: The record ID to filter by.

This function returns a table of event data described in Evaluation results table format. The following example displays the trace for the record `9346efc3-5dd6-4038-9b1a-72ca3d3b768c`, where the agent is named `evaluated_agent` stored on the schema `eval_db.eval_schema`:

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_RECORD_TRACE(
  'eval_db',
  'eval_schema',
  'evaluated_agent',
  'CORTEX AGENT',
  '9346efc3-5dd6-4038-9b1a-72ca3d3b768c'
));
```

### Query evaluation errors and warnings for a run

To access logs for warnings and errors that happened during an evaluation run, use the [GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)](../../sql-reference/functions/get_ai_observability_logs-snowflake-local.md) function. This function has the following required arguments:

* `database`: The database containing the agent.
* `schema`: The schema containing the agent.
* `agent_name`: The name of the agent.
* `agent_type`: The string constant ‘CORTEX AGENT’. This value is case-insensitive.

This function returns a table of event data described in Evaluation results table format. The following example checks for errors and warnings for a run called `run-1`, where the agent is named `evaluated_agent` stored on the schema `eval_db.eval_schema`:

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_OBSERVABILITY_LOGS(
  'eval_db',
  'eval_schema',
  'evaluated_agent',
  'CORTEX AGENT')
)
  WHERE TRUE
  AND (record:"severity_text"='ERROR' or record:"severity_text"='WARN')
  AND record_attributes:"snow.ai.observability.run.name"='run-1';
```

> **Note:**
>
> The fields of `record` and `record_attributes` are subject to change, but the fields `record:"severity_text"` and `record_attributes:"snow.ai.observability.run.name"` are guaranteed to be present in AI Observability logs.

## Agent Evaluation YAML specification

To define the YAML file to configure an Agent Evaluation, including defining custom metrics, there are three top-level keys:

* (Optional) `dataset`: A definition of how to create a dataset for the evaluation. This value is optional when using a YAML specification to start an evaluation in Snowsight, or when using an existing dataset.
* `evaluation`: Settings for the agent to be evaluated.
* `metrics`: The metrics recorded during an evaluation run, including definitions for custom metrics.

### Dataset definition

The `dataset` value defines a new dataset from existing table data, mapping columns for the input query and ground truth. For the structure required for your `ground_truth` column, see Dataset format. The keys for the `dataset` value are:

* `dataset_type`: The string constant “CORTEX AGENT”. This value is case-insensitive.
* `table_name`: The fully qualified name of the table to use for the dataset’s contents.
* `dataset_name`: The name of the created dataset.
* `column_mapping`: The mapping of the required evaluation input column `query_text` and output column `ground_truth` to columns of the table to create the dataset from.

The resulting dataset is stored in the same database and schema as the table it’s constructed from.

The following example dataset definition shows a dataset named `evaluation_input` created from the `evals_db.evals_schema.evaluation_data` table, using the `user_question` as input and `expected_outcome` to define ground truth:

```yaml
dataset:
 dataset_type: "CORTEX AGENT"
 table_name: "evals_db.evals_schema.evaluation_data"
 dataset_name: "evaluation_input"
 column_mapping:
   query_text: "user_question"
   ground_truth: "expected_outcome"
```

### Agent configuration

The `evaluation` value sets the configuration for the agent to conduct an evaluation against. The keys for the `evaluation` value are:

* `agent_params`: A dictionary describing the agent to conduct the evaluation for. This value uses the keys:

  > + `agent_name`: The name of the agent to evaluate.
  > + `agent_type`: The string constant “CORTEX AGENT”. This value is case-insensitive.
* (Optional) `run_params`: Metadata for identifying this evaluation run. This value uses the keys:

  > + (Optional) `label`: The label for this evaluation.
  > + (Optional) `description`: A detailed description of the evaluation.
* `source_metadata`: A dictionary describing the dataset used for the evaluation. This value uses the keys:

  > + `type`: The string constant “DATASET”. This value is case-insensitive.
  > + `dataset_name`: The name of the dataset to use.

The following example agent configuration runs an agent named `evaluated_agent` with the label `Basic evaluation`, using the dataset `evaluation_input`:

```yaml
evaluation:
 agent_params:
   agent_name: "evaluated_agent"
   agent_type: "CORTEX AGENT"
  run_params:
   label: "Basic evaluation"
  source_metadata:
   type: "DATASET"
   dataset_name: "evaluation_input"
```

### Metrics selection

The `metrics` value is a sequence of metrics to evaluate, including your own custom metric definitions. The accepted values for pre-defined metrics are:

* `answer_correctness`: Measure the agent’s response correctness against a ground truth output.
* `logical_consistency`: Measure consistency across agent instructions, planning, and tool calls. This metric is *reference-free* and doesn’t use a dataset.

#### Defining a custom metric

You can define your own custom metric by providing an identifier, prompt, and score ranges. The prompt you provide is passed to an LLM judge along with run traces to conduct your custom evaluation. Custom metrics have the following required key-value pairs:

* `name`: The name of the metric.
* `score_ranges`: A mapping that defines low, medium, and high-quality score ranges. This mapping uses the keys:

  > + `min_score`: The score range used to identify low-quality results, as a two-element sequence of the inclusive lower bound to exclusive upper bound.
  > + `median_score`: The score range used to identify medium-quality results, as a two-element sequence of the inclusive lower bound to inclusive upper bound.
  > + `max_score`: The score range used to identify high-quality results, as a two-element sequence of the exclusive lower bound to inclusive upper bound.
* `prompt`: The prompt template to pass to the LLM judge along with the agent run trace data.

  > > **Important:**
  > >
  > > This template must include a scoring mechanism which produces a numeric value represented in the ranges provided for `score_ranges`.

A custom metric’s prompt is able to reference the trace data generated by the agent during an evaluation run. Snowflake passes the entire trace as input to the LLM judge, but you can emphasize certain information by using a replacement string that references data in a GET_AI_RECORD_TRACE column directly. The following replacement strings are available:

| Replacement string | GET_AI_RECORD_TRACE column |
| --- | --- |
| `{{input}}` | INPUT |
| `{{output}}` | OUTPUT |
| `{{ground_truth}}` | GROUND_TRUTH |
| `{{tool_info}}` | TOOL |
| `{{start_timestamp}}` | START_TIMESTAMP |
| `{{duration}}` | DURATION_MS |
| `{{span_id}}` | SPAN_ID |
| `{{span_type}}` | SPAN_TYPE |
| `{{span_name}}` | SPAN_NAME |
| `{{llm_model}}` | LLM_MODEL |
| `{{error}}` | ERROR |
| `{{status}}` | STATUS |

#### Metrics configuration example

The following example defines a metrics configuration that enables answer correctness and logical consistency checks, and also defines a custom `relevance` metric which returns a score between 1-10 based on how ground truth compares against agent output:

```yaml
metrics:
  # Built-in metrics
  - "answer_correctness"
  - "logical_consistency"
  # Custom metric with prompt
  - name: "relevance"
    score_ranges:
      min_score: [1, 3]
      median_score: [4, 6]
      max_score: [7, 10]
    prompt: |
      Evaluate the relevance of the agent's response to the user's query.
      Rate from 1-10 where:
      1 = Completely irrelevant
      4 = Somewhat irrelevant
      6 = Neutral
      8 = Mostly relevant
      10 = Highly relevant and on-topic

      You can compare the {{output}} with the {{ground_truth}} to help you understand if the contents are relevant or not

      Consider:
      - Does the response address the user's question?
      - Is the information provided appropriate to the context?
      - Are there any tangential or off-topic elements?
```

### Full example configuration

Combining all of the previous example sections gives a full Agent Evaluation configuration:

```yaml
# Optional: Create dataset before running evaluation
dataset:
  dataset_type: "CORTEX AGENT"
  table_name: "EVALS_DB.EVALS_SCHEMA.EVALUATION_DATA"
  dataset_name: "EVALUATION_INPUT"
  column_mapping:
    query_text: "user_question"
    ground_truth: "expected_outcome"

# Evaluation task configuration
evaluation:
 agent_params:
   agent_name: "evaluated_agent"
   agent_type: "CORTEX AGENT"
  run_params:
   label: "Basic evaluation"
  source_metadata:
   type: "DATASET"
   dataset_name: "EVALUATION_INPUT"

  # Built-in metrics (simple strings)
  - "answer_correctness"
  - "logical_consistency"

  # Custom metric definition
  - name: "relevance"
    score_ranges:
      min_score: [1, 3]
      median_score: [4, 6]
      max_score: [7, 10]
    prompt: |
      Evaluate the relevance of the agent's response to the user's query.
      Rate from 1-10 where:
      1 = Completely irrelevant
      4 = Somewhat irrelevant
      6 = Neutral
      8 = Mostly relevant
      10 = Highly relevant and on-topic

      You can compare the {{output}} with the {{ground_truth}} to help you understand if the contents are relevant or not

      Consider:
      - Does the response address the user's question?
      - Is the information provided appropriate to the context?
      - Are there any tangential or off-topic elements?
```

### Upload configuration to a stage

Agent Evaluation configurations are required to have a specific file format for Snowflake to parse them. The following snippet demonstrates creating the required `yaml_file_format` on the schema `evals_db.evals_schema`, then creates the stage `evaluation_config` to upload an agent configuration to:

```sqlexample
CREATE OR REPLACE FILE FORMAT evals_db.evals_schema.yaml_file_format
  TYPE = 'CSV'
  FIELD_DELIMITER = NONE
  RECORD_DELIMITER = '\n'
  SKIP_HEADER = 0
  FIELD_OPTIONALLY_ENCLOSED_BY = NONE
  ESCAPE_UNENCLOSED_FIELD = NONE;

CREATE OR REPLACE STAGE evals_db.evals_schema.evaluation_config
  FILE_FORMAT = evals_db.evals_schema.yaml_file_format;
```

Upload your configuration to a created stage through Snowsight by navigating to In the navigation menu, select Ingestion » Add Data and selecting Load files into a Stage. You can also use the SQL [PUT](../../sql-reference/sql/put.md) command to upload a local YAML file. The following example demonstrates copying the local file `/Users/dev/evaluation_config.yaml` to the stage `evals_db.evals_schema.evaluation_config`:

```sqlexample
PUT file:///Users/dev/evaluation_config.yaml @evals_db.evals_schema.evaluation_config
  AUTO_COMPRESS='false'
  OVERWRITE=TRUE;
```

If you create your YAML in a [Workspace](../ui-snowsight/workspaces.md), you can copy it from your active workspace to a stage. The following example copies the file `evaluation_config.yaml` from your workspace to the stage `evals_db.evals_schema.evaluation_config`:

```sqlexample
COPY FILES INTO @evals_db.evals_schema.evaluation_config
  FROM 'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live'
  FILES=('custom_metric_config.yaml');
```

> **Tip:**
>
> Snowflake recommends keeping your YAML file uncompressed.

## Evaluation results table format

Functions which return information about a Cortex Agent evaluation all produce a table with the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| RECORD_ID | VARCHAR | The unique identifier assigned by Snowflake for this evaluation record. |
| INPUT_ID | VARCHAR | The unique identifier assigned by Snowflake for this evaluation input. |
| REQUEST_ID | VARCHAR | The unique identifier assigned by Snowflake for this request. |
| TIMESTAMP | TIMESTAMP_TZ | The time (in UTC) at which the request was made. |
| DURATION_MS | INT | The amount of time, in milliseconds, that it took for the agent to return a response. |
| INPUT | VARCHAR | The query string used as input for this evaluation record. |
| OUTPUT | VARCHAR | The response returned by the Cortex Agent for this evaluation record. |
| ERROR | VARCHAR | Information about any errors that occurred during the request. |
| GROUND_TRUTH | VARCHAR | The ground truth information used to evaluate this record’s Cortex Agent output. |
| METRIC_NAME | VARCHAR | The name of the metric evaluated for this record. |
| EVAL_AGG_SCORE | NUMBER | The evaluation score assigned for this record. |
| METRIC_TYPE | VARCHAR | The type of metric being evaluated. For built-in metrics, the value is `system`. For custom metrics, the value is `custom`. |
| METRIC_STATUS | VARIANT | A map containing information about the agent’s HTTP response for this record, with the following keys:  * `status`: The HTTP status code of the response. * `message`: The HTTP message sent in the status response. |
| METRIC_CALLS | ARRAY | An array of VARIANT values that contain information about the computed metric. Each array entry contains the metric’s criteria, an explanation of the metric score, and metadata. The keys of each entry are:  * `criteria`: The criteria used by an LLM judge to evaluate response correctness. * `explanation`: An explanation of why the score was assigned. * `full_metadata`: A VARIANT value that contains metadata and information about this metric’s processing by the LLM judge. The keys of this map include:  + `completion_tokens`: The number of output tokens generated by the LLM for this metric evaluation call.   + `guard_tokens`: The number of tokens consumed by Cortex Guard for this metric evaluation call.   + `normalized_score`: The original evaluation score normalized to the range [0.0, 1.0], rounded to two decimal places.   + `original_score`: The original score assigned by this metric evaluation for the record.   + `prompt_tokens`: The number of tokens taken up by the prompt provided to the LLM judge.   + `total_tokens`: The total number of tokens used by the LLM judge for this computation. |
| TOTAL_INPUT_TOKENS | INT | The total number of tokens used to process the input query. |
| TOTAL_OUTPUT_TOKENS | INT | The total number of output tokens produced by the Cortex Agent. |
| LLM_CALL_COUNT | INT | Counts the number of times any LLM was called, either by the agent or an evaluation judge. |

## Model availability

Agent Evaluations currently only supports the `claude-4-sonnet` and `claude-3-5-sonnet` models, using cross-region inference. Snowflake automatically chooses from these models based on your account settings.

| Model | Cross Cloud (Any Region) | AWS US | AWS US Commercial Gov | AWS EU | AWS APJ |
| --- | --- | --- | --- | --- | --- |
| `claude-4-sonnet` | ✔ | ✔ | ✔ | ✔ | ✔ |
| `claude-3.5-sonnet` | ✔ | ✔ |  |  |  |

## Known limitations

Cortex Agent evaluations are subject to the following limitations:

* **Agent response times and throughput**: The number of inputs that can be processed during an evaluation is constrained by agent response times and the amount of trace detail. If you experience timeouts or long delays in your evaluation, split your evaluation data. For example, if you have queries which are guaranteed to invoke many different tools, you can partition data by common tool invocation. If you have a custom evaluation that results in timeouts, refine or shorten your prompt. You may also want to consider splitting custom evaluations to only focus on one specific element of your agent’s output.
* **Ground truth staleness**: Depending on how you word your input queries, results may drift over time and result in less accurate evaluation results. In particular you should try and scope input queries to specific, absolute dates and times. As an example, both of the input queries `What was our revenue?` and `What was our revenue for the first quarter?` will experience drift, while the query `What was our revenue between January and March of 2025?` is scoped to a specific window of time that can be consistently referenced in the evaluation data.

## Cost Considerations

Agent Evaluations run a Cortex Agent to create output for evaluation, and LLM judges to compute the evaluation metrics. You’re charged for each run of the agent against a ground truth query. The evaluation’s LLM judges are run by the [AI_COMPLETE](../../sql-reference/functions/ai_complete.md) function, and you incur charges based on the model Snowflake selects for judging. Additionally, you’re charged for the following:

* Warehouse charges for tasks used to manage evaluation runs
* Warehouse charges for queries used to compute evaluation metrics
* Storage charges for datasets and evaluation results
* Warehouse charges to retrieve evaluation results viewed in Snowsight

For more information on estimating costs, see [Understanding overall cost](../cost-understanding-overall.md). Refer to the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) for full cost information.

---
title: Cortex Agents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Agents

> Get started with Cortex Agents
>
> [Try it in Snowsight](https://app.snowflake.com/_deeplink/#/agents?utm_source=docs&utm_medium=growth&utm_campaign=-us-en-all&utm_content=-app-user-guide-snowflake-cortex-cortex-agents)

## Overview

Cortex Agents orchestrate across both structured and unstructured data sources to deliver insights. They plan tasks, use tools to execute these tasks, and generate responses. Agents use Cortex Analyst (structured) and Cortex Search (unstructured) as tools, along with LLMs, to analyze data. Cortex Search extracts insights from unstructured sources, while Cortex Analyst generates SQL to process structured data. In addition, you can use stored procedures and user defined functions (UDFs) to implement custom tools. A comprehensive support for tool identification and tool execution enables delivery of sophisticated applications grounded in enterprise data.

The workflow involves four key components:

1. **Planning**: Applications often switch between processing data from structured and unstructured sources. For example, consider a conversational app designed to answer user queries. A business user may first ask for top distributors by revenue (structured) and then switch to inquiring about a contract (unstructured). Cortex Agents can parse a request to orchestrate a plan and arrive at the solution or response.

   1. **Explore options**: When the user poses an ambiguous question (for example, “Tell me about Acme Supplies”), the agent considers different permutations - products, location, or sales personnel - to disambiguate and improve accuracy.
   2. **Split into subtasks**: Cortex Agents can split a task or request (for example, “What are the differences between contract terms for Acme Supplies and Acme Stationery?”) into multiple parts for a more precise response.
   3. **Route across tools**: The agent selects the right tool - Cortex Analyst or Cortex Search - to ensure governed access and compliance with enterprise policies.
2. **Tool use**: With a plan in place, the agent retrieves data efficiently. Cortex Search extracts insights from unstructured sources, while Cortex Analyst generates SQL to process structured data. A comprehensive support for tool identification and tool execution enables delivery of sophisticated applications grounded in enterprise data.
3. **Reflection**: After each tool use, the agent evaluates results to determine the next steps - asking for clarification, iterating, or generating a final response. This orchestration allows it to handle complex data queries while ensuring accuracy and compliance within Snowflake’s secure perimeter.
4. **Monitor, evaluate, and iterate**: After deployment, you can track metrics, analyze performance, perform evaluations, and refine behavior for continuous improvements. By monitoring and refining your agent, you can continuously improve performance and response accuracy.

For tutorials to help you get started, see [Cortex Agents tutorials](cortex-agents-tutorials.md).

> **Note:**
>
> While Snowflake strives to provide high quality responses, the accuracy of the LLM responses or
> the citations provided are not guaranteed. You should review all answers from the Agents API before serving them to your users.

## Access control requirements

To make a request to Cortex Agent via agent:run API, you can use a role that has the
SNOWFLAKE.CORTEX_USER or SNOWFLAKE.CORTEX_AGENT_USER role granted. The CORTEX_USER provides
access to all Covered AI features including Cortex Agents whereas CORTEX_AGENT_USER provides access to
the Agents feature.

> **Note:**
>
> You must use the user’s default role when calling or updating Cortex Agents. To allow another role to edit the agent, grant USAGE on the database, schema, and agent to that role.
>
> ```sqlexample
> GRANT USAGE ON DATABASE <database_name> to ROLE <role_name>;
> GRANT USAGE ON SCHEMA <database_name>.<schema_name> to ROLE <role_name>;
> GRANT USAGE ON AGENT <database_name>.<schema_name>.<agent_name> to ROLE <role_name>;
> ```

To use Cortex Agents with a semantic model, you also need the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE AGENT | Schema | Required to create the Cortex Agent. |
| USAGE | Cortex Search service | Required to run the Cortex Search services in the Cortex Agents request. |
| USAGE | Database, schema, table | Required for access the objects referenced in the Cortex Agents semantic model. |
| OWNERSHIP | Agent | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../../sql-reference/sql/grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). In a managed access schema, only the schema owner (for example. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| MODIFY | Agent | Required to update the Cortex Agent. |
| MONITOR | Agent | Required to view threads, logs, and traces of the Cortex Agent. |
| USAGE | Agent | Required to query the Cortex Agent to generate responses. |

Requests to the Cortex Agents API must include an authorization token. For details on how to authenticate to
the API, see [Authenticating Snowflake REST APIs with Snowflake](../../developer-guide/snowflake-rest-api/authentication.md). Note that the example in this topic uses a
session token to authenticate to a Snowflake account.

**Limiting access to specific roles**

By default, the CORTEX_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted to all
users and roles. If you don’t want all users to have this privilege, you can revoke access to the PUBLIC role and
grant access to specific roles. For more information, see [Cortex LLM privileges](aisql.md).

To provide selective access to Cortex Agents so that only a subset of users have access to the
feature, use the CORTEX_AGENTS_USER role.

**Limiting access using the Cortex Agents user role**

To provide selective access to Cortex Agents for specific users, use the SNOWFLAKE.CORTEX_AGENT_USER database role.
This role includes the privileges needed to call the Cortex Agent API.

> **Important:**
>
> If your user roles have the CORTEX_USER role, you must revoke access to the CORTEX_USER role.
> To revoke the CORTEX_USER database role from your user roles, run the following command using the
> ACCOUNTADMIN role:
>
> ```sqlexample
> REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER FROM ROLE agent;
> ```

To provide access to Cortex Agents, use the ACCOUNTADMIN role to do the following:

1. Grant the SNOWFLAKE.CORTEX_AGENT_USER database role to a custom role.
2. Assign this custom role to users.

> **Note:**
>
> You can’t grant database roles directly to users. For more information, see [GRANT DATABASE ROLE](../../sql-reference/sql/grant-database-role.md).

The following example:

1. Creates the custom role, `cortex_agent_user_role`.
2. Grants it the CORTEX_AGENT_USER database role.
3. Assigns this role to `example_user`.

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE cortex_agent_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_AGENT_USER TO ROLE cortex_agent_user_role;

GRANT ROLE cortex_agent_user_role TO USER example_user;
```

You can also grant access to Cortex Agents through existing roles. For example, if you have an `agent` role
used by agents in your organization, you can grant access with a single GRANT statement:

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_AGENT_USER TO ROLE agent;
```

## Authentication

Snowflake REST APIs support authentication via programmatic access tokens (PATs),
key pair authentication using JSON Web Tokens (JWTs), and OAuth.
For details, see [Authenticating Snowflake REST APIs with Snowflake](../../developer-guide/snowflake-rest-api/authentication.md).

> **Important:**
>
> Cortex Agents uses models that might not be available in all regions. To access these models, you will have to enable cross-region inference, if feasible. For more information, see [Regional availability](aisql.md).

> **Important:**
>
> Cortex Agent APIs are not supported from within a Streamlit in Snowflake (SiS) application using a warehouse runtime.
> To call Cortex Agent APIs from a SiS app, use a container runtime instead. For more information, see
> [Runtime environments for Streamlit apps](../../developer-guide/streamlit/app-development/runtime-environments.md).

## Cost considerations

> Cortex Agents incur charges for the orchestration and use of tools.
>
> * The orchestration usage is charged based on the tokens used.
> * Cortex Analyst is charged per token.
> * Cortex Search charges depend on the size of the index and the time it has persisted.
> * Warehouse charges depend on the size of the warehouse and how long it runs.

For more information, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). Also, use of custom tools may incur [warehouse costs](../cost-understanding-compute.md).

## Models

You can use the following models with Cortex Agents. If the model is not available in the local region, you must use cross-region inference.

When creating an agent, we recommend selecting auto for the model. With this option, Cortex automatically selects the highest quality model for your account, and the quality automatically improves as new models become available.

* `auto`
* `claude-haiku-4-5`
* `claude-sonnet-4-5`
* `claude-4-6-sonnet`
* `claude-4-sonnet`
* `openai-gpt-4-1`

The following tables show the models that are available for each region:

Cross-region and Cross-cloud

| Model | Cross-cloud  (Any region) | AWS US  (Cross-region) | AWS EU  (Cross-region) | AWS APJ  (Cross-region) | Azure US  (Cross-region) |
| --- | --- | --- | --- | --- | --- |
| `claude-haiku-4-5` | \* | \* |  |  |  |
| `claude-sonnet-4-5` | ✔ | ✔ | ✔ |  |  |
| `claude-4-sonnet` | ✔ | ✔ | ✔ | ✔ |  |
| `claude-4-6-sonnet` | ✔ | ✔ |  |  |  |
| `openai-gpt-4.1` | ✔ |  |  |  | ✔ |

**\*** Indicates a preview function or model. Preview features are not suitable for production workloads.

## Cortex Agent Concepts

Cortex Agents use Cortex Analyst, Cortex Search and custom tools to plan tasks and generate responses. You can influence the orchestration with instructions. You can also specify attributes to dynamically select a tool based on business logic.

During an interaction, Agents use a thread to maintain context. A thread provides an easy retrieval of the entire conversation context for use in application logic.

You can collect feedback from end-users as you continuously iterate and refine the Agent. An explicit feedback mechanism (positive/negative rating) coupled with subjective feedback (text) allows you to capture user inputs throughout the lifecycle of the Agent.

### Agent object

The agent configuration includes all metadata, orchestration settings, and tool details that are stored in the agent object. You can use the agent object to interact with the agent.

### Threads

Threads persist the context of your interactions with the agent, so you don’t have to maintain context on the client application. To use threads, you create a thread object and reference the thread ID in the agent interactions.

### Orchestration

Cortex Agents use LLM-based orchestration to plan tasks and generate responses. You can control the orchestration with the following settings:

#### Models

For information about the models you can use with Cortex Agents for orchestration, see Models.

#### Instructions

Response instructions allow you to configure the agent responses to a brand and tone of your preference.

#### Sample questions

You can use these questions to seed the conversation in your client application. These are common questions that can get users started with the interaction.

### Tools

Cortex Agents can orchestrate across both structured and unstructured data. Also, custom tools allow agents to interact with other backend systems or implement custom logic.

#### Cortex Analyst semantic view

You can use Cortex Analyst to create SQL queries from natural language. To use Cortex Analyst, you must create a Semantic Model. For more information, see [Create a semantic model](cortex-analyst.md).

#### Cortex Search Service

Use Cortex Search to search through your data. For more information, see [CREATE CORTEX SEARCH SERVICE](../../sql-reference/sql/create-cortex-search.md).

Agents can dynamically adjust the following search parameters if the user’s query requires it: filter conditions, metadata columns to retrieve, number of results,
per-index queries for multi-index services, and time-decay settings.

> **Note:**
>
> The DEFAULT_ROLE of the querying user must have USAGE privilege on the Cortex Search Service, as well as the database and schema
> in which it resides.

#### Custom tools

You can use stored procedures and user defined functions (UDF) to implement custom business logic as a tool. For more information, see [Stored procedures overview](../../developer-guide/stored-procedure/stored-procedures-overview.md) and [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

### Thinking and reflection

The Agent emits events throughout the interaction, providing insights into the reasoning process. These steps cover the initial splitting of tasks, sequencing into sub-tasks, and selection of tools for the sub-task. In addition, the agent also surfaces its reflections about tool results and how these influence further orchestration.

### Monitor, evaluate, and iterate

You can collect feedback from the end user as a rating (positive/negative), along with any subjective inputs (as text). These can be used to refine and improve the agent over the lifecycle. For more information on how to perform monitoring and evaluation with native Snowflake features, see [Monitor Cortex Agent requests](cortex-agents-monitor.md) and [Cortex Agent evaluations](cortex-agents-evaluations.md).

## Web search

Before providing web search access to your agents, an ACCOUNTADMIN role must first enable web search access at the account level. To properly enable web search:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. Select Settings.
4. Select the Web search toggle to enable the feature, as shown below.

After enabling web search at the account level, you can use the web search tool in your agents. For more information, see [Create an agent](cortex-agents-manage.md).

Cortex Agents use the Brave Web Search API to query the web and retrieve results for real-time information during an interaction. The agent creates a query based on the user’s input and any relevant context from the interaction. The API returns results from Brave Search’s independent web index. The query and the results leave Snowflake and traverse the public internet. The agent then incorporates the relevant results into its response alongside any data from other configured tools. Snowflake has enabled zero data retention (ZDR) with Brave, which means no search queries are stored by Brave for any length of time. This applies to the search query text, the results returned, and any metadata associated with the request. ZDR simplifies compliance obligations and reduces risk — because the data is never stored.

## Interact with agents

Cortex Agents support two distinct methods of interacting with agents through the REST API:

* **Configure an agent object to interact with the agent**: With this method, you first configure an agent object that can be reused for the entire interaction. Configuring an agent object simplifies client code and enables CI/CD for enterprise-ready applications.
* **Interact without an agent object**: With this method, you must pass the agent configuration as part of every interaction request. Interaction without an agent object allows you to quickly try out use cases and experiment with different scenarios.

For information about these methods, see [Configure and interact with Agents](cortex-agents-manage.md).

## Legal notices

Where your configuration of Cortex Agents uses a model provided on the
[Model and Service Flow-down Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/open-source-model-flow-down-terms/),
your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Features [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Cortex Agents for Microsoft Teams and Microsoft 365 Copilot
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-teams-integration.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Agents for Microsoft Teams and Microsoft 365 Copilot

## Introduction

For most teams, accessing timely data insights means context-switching between dedicated analytics platforms and
communication tools, leading to delays and reduced productivity. Integrating an agentic AI system into Microsoft Teams
can bring the answers directly to where conversations and decisions happen, accelerating the flow of information across
your business. But building a secure, in-chat analytics solution that is both powerful and intuitive is a significant
undertaking. Fortunately, Snowflake has built one for you.

The Snowflake Cortex Agents integration for Microsoft Teams and Microsoft 365 Copilot embeds Snowflake’s conversational
AI agents into your business communication platform. Business teams and non-technical users can interact with their
Snowflake structured and unstructured data using simple, natural language to receive direct answers and visualizations without leaving
their Teams chats or the broader Microsoft 365 ecosystem. The integration is available via
[Microsoft AppSource](https://appsource.microsoft.com/en-us/product/Office365/WA200008996) for
seamless deployment.

Use the following sections to set up the integration and start using it to get value from your data.
For a Quickstart guide, see [Getting Started with Cortex Agents for Microsoft Teams and Microsoft 365 Copilot](https://quickstarts.snowflake.com/guide/getting_started_with_the_microsoft_teams_and_365_copilot_cortex_app).

> **Important:**
>
> When you use this integration, you are directing Snowflake to send or receive data between the Snowflake Service and Microsoft services (including Microsoft Teams and Microsoft 365 Copilot). Snowflake is not responsible for the privacy, security, or integrity of data once it leaves the Snowflake Service boundary. Your use of Microsoft Teams or Microsoft 365 Copilot, and any data you process with it, is governed solely by the terms between you and Microsoft.

### Key features

* **Seamless analytics via natural language.** Delight your business decision-makers by empowering them to get insights
  themselves within the Microsoft Teams and Microsoft 365 Copilot interfaces. You can discover trends and analyze data
  without technical expertise or waiting for a custom dashboard to be built. Users can ask questions conversationally
  and receive accurate, LLM-powered answers in text, tabular, or chart form on the fly, dramatically accelerating
  data-driven decision-making.
* **Dual interfaces for comprehensive workflows.** Cortex Agents for Microsoft Teams offer two distinct interfaces to
  support different business needs. Use the standard Teams Application for dedicated, in-depth analysis within a Teams
  Bot application chat, or leverage the Microsoft 365 Copilot Agent to bring targeted Snowflake insights into your
  wider conversational workflow within the Microsoft 365 Copilot ecosystem.
* **Powered by Snowflake Cortex Agents.** This integration is powered by the Snowflake Cortex Agents API, which handles
  the complexities of generating accurate, reliable insights from your data. The agentic system intelligently interprets
  user requests and generates responses, saving your teams from having to build complex conversational AI patterns or
  manage underlying models. You can reuse the same agents you use with
  [Snowflake Intelligence](snowflake-intelligence.md), avoiding duplicate
  configuration and governance effort.
* **Enterprise-grade security and governance.** Built on Snowflake’s privacy-first foundation, the integration ensures you
  can confidently explore AI-driven use cases. This means:

  > + **Your data stays within Snowflake’s governance boundary.** User prompts are sent to the Cortex Agents API, but the
  >   underlying data queried to generate an answer never leaves Snowflake’s secure environment. The resulting SQL query is
  >   executed within your Snowflake virtual warehouse.
  > + **Seamless integration with Snowflake’s privacy and governance features.** The integration fully respects Snowflake’s
  >   role-based access control (RBAC). All queries executed on behalf of a user adhere to their established permissions,
  >   guaranteeing that users can only see data they are authorized to access.

## Regional availability and limitations

The Cortex Agents integration for Microsoft Teams and Microsoft 365 Copilot is available across all Snowflake public
cloud deployments. However, there are some regional considerations and current limitations you should be aware of:

### Consent for accounts outside Azure US East 2

When connecting a Snowflake account that is based in a region other than Azure US East 2, administrators are
prompted to accept a consent notification during the account setup process. This consent acknowledges that the bot backend infrastructure processes user prompts and bot responses through
service hosted in Azure US East 2 region.

To withdraw consent, the account must be removed by an administrator through the Teams application interface.

> **Consent text displayed during setup:**
>
> The following is the exact consent you will be asked to accept when connecting your Snowflake account to the Teams bot:
>
> ```
> Data Processing.
> Use of this integration requires an intermediate processing (but not storage) step in Snowflake's Azure East US 2 region,
> regardless of the region where your Snowflake account is located.
> By proceeding, you are authorizing Snowflake to process your data within Snowflake's Azure East US 2 region.
>
> For more information on this behavior, please refer to documentation.
> ```

### Private Link

Private Link configurations are not supported. You must disable Private Link to use this integration.

### Sovereign cloud regions

The integration is not available for Snowflake accounts in sovereign cloud regions.

## Set up integration

Cortex Agent’s Microsoft Teams integration allows organization administrators to connect multiple Snowflake accounts to
the Teams and Copilot workspaces in their organizations. Setting up the integration involves a few simple steps, summarized
below:

1. **Tenant-wide setup by Azure administrator.** The integration requires a one-time setup by a Microsoft Azure
   administrator to grant consent for the Snowflake application within the Microsoft Entra ID (formerly Azure Active Directory) tenant. This
   step enables secure OAuth 2.0 authentication for the integration.
2. **Snowflake security integration.** After the Azure administrator has completed the tenant-wide setup, a
   Snowflake administrator must configure a security integration for each individual Snowflake account that they wish to
   connect to the Microsoft Teams or M365 Copilot application. This step ensures that the integration can securely access
   the necessary data within each Snowflake account.
3. **Linking accounts to the bot.** Once the security integration is configured, the Snowflake administrator can link
   the Snowflake account to the Microsoft Teams or M365 Copilot bot. This step allows the bot to access the data and
   functionality of the Snowflake account, enabling users to interact with their data directly within Teams or Copilot.

### Prerequisites

Before you begin the integration process, make sure you have established the following:

* **Administrator access.** Setup requires administrative access on both Snowflake and your Microsoft tenant.
* **Snowflake administrative privileges:** Your Snowflake user must have access to the ACCOUNTADMIN or SECURITYADMIN
  role. These permissions are required to create the necessary security integration object in your Snowflake account.
* **Microsoft administrative privileges:** You Azure user must have Global Administrator privileges (or an equivalent
  role) for your Microsoft Entra ID tenant. These privileges are required to grant the necessary tenant-wide admin
  consent for the application.
* **Microsoft tenant ID:** You need your organization’s Microsoft tenant ID to configure the Snowflake security
  integration. For more information on finding your organization’s Tenant ID, see
  [Get subscription and tenant IDs in the Azure portal](https://learn.microsoft.com/en-us/azure/azure-portal/get-subscription-tenant-id).
* **Individual User Accounts:** Every end user must have their own Microsoft and Snowflake user accounts.
* **End-user licensing:** Users must have the appropriate Microsoft licenses to access Microsoft Teams. A Copilot license
  is also required if you plan to use the integration with Microsoft 365 Copilot.

### Step 1: Tenant-wide Entra ID configuration

To enable secure authentication for Cortex Agents, a Microsoft Azure administrator must grant consent for two
applications hosted in Snowflake’s tenant, creating a *service principal* for each application within your Entra ID tenant.
The two applications are:

* **Cortex Agents Bot OAuth Resource:** Represents the protected Snowflake API and defines the access permissions
  (scopes) for client applications.
* **Cortex Agents Bot Snowflake OAuth Client:** Represents the client application, in this case the Teams application
  back end service, that calls the Snowflake API after requesting an access token.

Instructions for granting consent for these applications are provided below. The process is very similar for both applications,
but the specific permissions and scopes differ slightly.

#### Granting consent for OAuth Resource principal

To grant consent for the Cortex Agents Bot OAuth Resource application service principal:

1. In your browser, navigate to `https://login.microsoftonline.com/<tenant-id>/adminconsent?client_id=5a840489-78db-4a42-8772-47be9d833efe`,
   where `tenant-id` is your organization’s Microsoft tenant ID.

   If you are not already signed in, you are prompted to do so.

   A Permission requested dialog appears, showing the permission that the application requires.
2. Select Accept to grant the requested permission.

#### Granting consent for OAuth Client principal

This process displays two dialogs. Each is similar to the one for the OAuth Resource principal, but the permissions requested are different.

To grant consent for the Cortex Agents Bot Snowflake OAuth Client application service principal:

1. In your browser, navigate to `https://login.microsoftonline.com/<tenant-id>/adminconsent?client_id=bfdfa2a2-bce5-4aee-ad3d-41ef70eb5086`,
   where `tenant-id` is your organization’s Microsoft tenant ID.

   A Permissions requested (1 of 2) dialog appears, showing one set of permissions that the application requires.
2. Select Accept to grant the requested permissions.

   The second permission dialog appears (Permissions requested (2 of 2)).
3. Select Accept to grant the requested permissions.

> **Important:**
>
> You may see an error message stating that a required query string parameter was missing, like the following.
>
> ```output
> {
>   "error": {
>     "code": "ServiceError",
>     "message": "Missing required query string parameter: code. Url = https://unitedstates.token.botframework.com/.auth/web/redirect?admin_consent=True&tenant=<TENANT-ID>"
>   }
> }
> ```
>
> You can safely ignore this error. Consent was still granted successfully. To be sure, confirm the permissions were granted successfully
> by following the instructions in the next section.

#### Confirming permission grants

After granting consent for both applications, you can confirm that the permissions were granted successfully by checking the
Enterprise applications section of the Microsoft Entra ID portal.

1. Log in to the [Microsoft Entra admin center](https://entra.microsoft.com/) if necessary.
2. Navigate to Enterprise Applications by typing “enterprise applications” in the search box, then selecting Enterprise applications in the results.
3. In the All applications list, find the two applications for which you just granted consent: Snowflake Cortex Agents Bot OAuth Resource and
   Snowflake Cortex Agents Bot OAuth Client. An easy way to do this is to search for “Snowflake Cortex Agent.”

   If both applications appear in the list, permissions have been correctly granted. If one or both applications are missing, try granting consent again.

### Step 2: Snowflake security integration

Integrating Snowflake with Microsoft Teams requires a [security integration](../../sql-reference/sql/create-security-integration.md)
that establishes cryptographic trust between your Snowflake account and your Entra ID tenant. This process requires:

* Enabling Entra ID as an external OAuth provider in Snowflake.
* Choosing or creating at least one Cortex Agent object for the integration.
* Granting required roles and privileges so intended users can invoke the agent.

#### Enabling Entra ID as an external OAuth provider

A Snowflake security integration object represents an integration with an external OAuth provider, in this case
Microsoft Entra ID. This integration allows Snowflake to authenticate users who are logged into Microsoft Teams or
Copilot.

The following SQL statement is an annotated template for creating the integration. This command must be
executed by a role with ACCOUNTADMIN privileges. Replace the `tenant-id` placeholders with your Microsoft
Tenant ID.

```sqlexample
CREATE OR REPLACE SECURITY INTEGRATION entra_id_cortex_agents_integration
    TYPE = EXTERNAL_OAUTH
    ENABLED = TRUE
    EXTERNAL_OAUTH_TYPE = AZURE
    EXTERNAL_OAUTH_ISSUER = 'https://login.microsoftonline.com/<tenant-id>/v2.0'
    EXTERNAL_OAUTH_JWS_KEYS_URL = 'https://login.microsoftonline.com/<tenant-id>/discovery/v2.0/keys'
    EXTERNAL_OAUTH_AUDIENCE_LIST = ('5a840489-78db-4a42-8772-47be9d833efe')
    EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = ('email', 'upn')
    EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = 'email_address'
    EXTERNAL_OAUTH_ANY_ROLE_MODE = 'ENABLE'
```

See [CREATE SECURITY INTEGRATION (External OAuth)](../../sql-reference/sql/create-security-integration-oauth-external.md) for a complete reference of the parameters
available for this command.

Together, the EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM and EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE parameters
link an Entra ID identity to a Snowflake identity. For authentication to succeed, the value of the specified claim in
the JWT must exactly match the value of the specified attribute on a user object in Snowflake. The two main configurations
Snowflake recommends are:

* Mapping by User Principal Name (UPN): Set the EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM parameter to ‘upn’ and the
  EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE parameter to ‘LOGIN_NAME’.
* Mapping by email address: Set the EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM parameter to ‘email’ and the
  EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE parameter to ‘EMAIL_ADDRESS’.

The example statement above uses the email address mapping configuration, but also specifies UPN in the
EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM parameter, allowing you to change the mapping method by changing only the
EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE.

The example statement also enables EXTERNAL_OAUTH_ANY_ROLE_MODE, so that the user’s default role is used.

For more information on OAuth scopes, see [Scopes](../oauth-ext-overview.md).

#### User provisioning requirements

To ensure successful authentication using the mapping configuration described previously, make sure that a strict
one-to-one mapping exists between Entra ID users and Snowflake users. Designate or create a Snowflake user for every
Entra ID user who will use the integration.

Each Entra ID user must map to exactly one Snowflake user. For email mapping, the Entra ID primary email must exactly
match the Snowflake user’s EMAIL_ADDRESS. For UPN mapping, the Entra ID UPN must exactly match the Snowflake user’s
LOGIN_NAME.

To reduce manual administration effort, you can optionally configure automatic user provisioning and deprovisioning from
Entra ID to Snowflake. See
[Configure automatic provisioning](https://learn.microsoft.com/en-us/entra/identity/saas-apps/snowflake-provisioning-tutorial).

#### Create and configure the Cortex Agents

After you create the security integration, ensure that at least one
[Cortex Agent Object](cortex-agents-rest-api.md) exists in your Snowflake account for the Teams or Microsoft 365
Copilot integration to use.

If you already have a working agent that you want to use, no further action is required for this step.

To create a new agent, follow the [instructions](cortex-agents-manage.md).

> **Note:**
>
> If you already use Snowflake Intelligence and have created agents for that experience,
> you can reuse those agents with the Microsoft Teams and Microsoft 365 Copilot integration.
> You don’t need to recreate or reconfigure them;
> any changes you make to an agent (such as instructions, tools, underlying objects, or privileges)
> are immediately reflected across all three interfaces.

##### Grant required privileges to users

Make sure the role under which the integration will run (each user’s default role or permitted secondary roles) has the grants described in the
[access control requirements section](cortex-agents.md).

### Step 3: Setting up the Teams app and connecting your Snowflake account

The final step in the integration process is to set up the Microsoft Teams application and connect it to the Snowflake
users who will use it. This requires you to complete the following tasks:

* Install the Cortex Agents app from the Teams store
* Connect your Snowflake account to the Teams application

#### Install the app from the Teams store

All users must install the Cortex Agents app from the Microsoft Teams store. To install the app, search for “Snowflake
Cortex Agents” in the Teams app store, then click Add to install the app.

> **Note:**
>
> Depending on your organization’s Microsoft Teams policies, a Teams Administrator may need to approve the app before it is available to users.
> See [Overview of app management and governance in Teams admin center](https://learn.microsoft.com/en-us/microsoftteams/manage-apps) for instructions.

#### Connect your Snowflake account to the Teams app

The first user to interact with the Cortex Agents app in Teams is prompted to connect their Snowflake account to the
app. This user must have the ACCOUNTADMIN or SECURITYADMIN role in Snowflake for this step to succeed.

To recap, every user’s default role in Snowflake must have the required privileges to access the agent’s objects, as
described in the [access control requirements section](cortex-agents.md) of the Cortex Agents
topic.

Security integrations block the main Snowflake administrative roles by default. Therefore, you cannot use administrative
roles such as ACCOUNTADMIN as the default role for the user that will set up the Teams bot. For information on this
restriction, see [BLOCKED_ROLES_LIST](../../sql-reference/sql/create-security-integration-oauth-snowflake.md) in the CREATE SECURITY INTEGRATION topic.

Snowflake recommends you create a dedicated, non-administrative role with the required permissions and set it as the
default for the setup user. Alternatively, use the [SECONDARY ROLES](../../sql-reference/sql/use-secondary-roles.md) mechanism to grant the additional permissions without
altering the user’s primary default role, as follows:

```sqlexample
GRANT ROLE <integration_specific_role> TO USER <user_name>;
ALTER USER <user_name> SET DEFAULT_SECONDARY_ROLES = ('ALL');
```

To set up the Teams bot, follow these steps:

1. Click I’m the Snowflake administrator, below the notice stating that an administrator needs to configure
   Snowflake for the Teams enticement, to begin the process.
2. Provide your Snowflake account URL where indicated, and select Connect Snowflake account.

   To find your account URL, log in to Snowsight and click the account selector in the bottom left corner of the page. The
   hostname portion of the URL is displayed at the top of the menu and is in the format `your-organization-your-account`.
   The full URL is `your-organization-your-account.snowflakecomputing.com`.

   The configuration wizard verifies that the URL leads to a valid Snowflake instance and confirms that your user has
   access to it and has the required administrative privileges. If your account is in a region other than Azure US East 2,
   you are prompted to accept a consent notification during this process.

After the setup passes final validation, the Teams app is connected to your Snowflake account and the agents are ready to use.

> **Tip:**
>
> After you have connected your Snowflake account to the Cortex Teams app, you can connect additional Snowflake accounts to the same
> app by logging into the Teams app with a user that has the necessary privileges and issuing the “add new account” command in the chat.

## Using the Cortex Agents

After the integration is set up, the bot appears in the Microsoft Teams interface, allowing your users to interact with
it in a private chat. Users can ask questions in natural language, and the bot responds with answers based on Snowflake
data.

In Microsoft 365 Copilot, your users can interact with the agents in the context of their broader workflows, asking
questions and receiving answers about their Snowflake data within the Copilot interface.

### Available commands

In addition to asking natural language questions, Cortex Agent bots accept predefined commands from Microsoft Teams chat. These commands help manage accounts and agents within the Teams interface.

The following commands are available:

| Command | Description |
| --- | --- |
| `Help` | Display a list of available commands and usage instructions. |
| `Choose agent` | Switch between available Cortex Agents within the current account. Displays a list of agents you have access to. |
| `Logout` | Log out from the current account. |
| `Show configured accounts` | Display a list of all configured Snowflake accounts. |
| `Clear context` | Clear agent’s internal chat history. |
| `Starter prompts` | Explore example questions you can ask the chosen agent. |
| `Admin Panel` | Display a list of available admin commands for your Snowflake account. |
| `Add account` | Connect an additional Snowflake account to the Teams app. Requires administrative privileges on the Snowflake account. |
| `Describe account` | Display information about the current Snowflake account. Displays a list of accounts with admin privileges to describe. |
| `Remove account` | Disconnect a Snowflake account from the Teams app. Requires administrative privileges. |

> **Note:**
>
> Commands are case-insensitive and can be entered conversationally in the Teams chat. For example, you can send `Help`
> or `help` in the chat to access the help command.

### Feedback on answers (Teams only)

Users can provide qualitative feedback on the agent’s responses directly in the Microsoft Teams interface (for example,
marking an answer as helpful or not helpful and optionally adding a comment). Users can also review the feedback they
have previously submitted. For instructions, see [View feedback provided by users](cortex-agents-monitor.md).

> **Note:**
>
> The feedback capability is available only in Microsoft Teams and is not supported in the Microsoft 365 Copilot experience.

### Switching between accounts and agents

You can connect multiple Snowflake accounts to the integration. Each connected account can expose one or more Cortex
Agents. Once the accounts are connected, users can switch among accounts and agents in the Teams UI with a single click;
no need to re-authenticate or re-enter connection details. Switching between accounts and agents makes it easier to
compare insights across business domains (for example, sales vs. marketing) while preserving each user’s security
context.

> **Tip:**
>
> You can also switch among agents in an account conversationally (for example, by entering “Choose agent”) if
> you prefer a command interaction instead of the UI.

## Security considerations

The Cortex Agents integration for Microsoft Teams is designed with security in mind, leveraging Snowflake’s existing
security features and Microsoft Entra ID’s authentication capabilities. The integration ensures that user data remains
secure and that access is controlled through Snowflake’s role-based access control (RBAC) system.

### End-to-end authentication flow

To understand the security implications of using the Cortex Agents integration for Microsoft Teams, it is important to
understand the end-to-end authentication flow. This process involves the following steps:

* **User interaction:** A user sends a message to the Snowflake Cortex Agents bot in Microsoft Teams.
* **Authentication trigger:** The bot’s back end service (the “Client” app) initiates an OAuth 2.0 flow, redirecting the
  user to the Microsoft Entra ID.
* **User authentication:** The user signs in to their Microsoft account with their corporate credentials, satisfying any
  MFA or Conditional Access policies enforced by their tenant.
* **Token issuance:** Entra ID provides a short-lived authorization code. The bot’s backend securely exchanges this code
  for a JWT access token.
* **API call to Snowflake:** The bot back end calls the Snowflake Cortex Agents API, including the access token in the
  `Authorization: Bearer` header.
* **Snowflake token validation:** The Snowflake service receives the request and validates the JWT against the policy
  defined in the Snowflake security integration object.

### Role-Based Access Control

Because it uses the Cortex Agents API under a specific user role, the Teams integration executes Cortex Agents requests
with the exact privileges of the user’s designated Snowflake role. The agent inherits all existing data governance
controls, including:

* **Role-Based Access Control:** The agent can only access databases, schemas, tables, and warehouses that the user’s role permits them to use.
* **Data masking policies:** The agent respects dynamic data masking policies, granting access only when allowed by the user’s role.
* **Row-Level access policies:** The agent enforces row-level security policies.

The agent cannot bypass any existing Snowflake security controls, and users cannot access data that they are not already
authorized to see.

### Network policies

The integration supports Snowflake [network policies](../network-policies.md) by forwarding the client
IP address received from Microsoft to Snowflake for policy enforcement. Network policies allow administrators to
control inbound access to the Snowflake service by restricting connections based on IP addresses and other network
identifiers.

> **Important:**
>
> The Cortex Agents integration for Microsoft Teams and Microsoft 365 Copilot does not create, modify, or activate any
> network policies on your Snowflake account; it only respects the network policies that exist in your Snowflake
> instance. Network policy configuration is entirely under the control of your Snowflake account administrators.

When a user signs in to the Cortex Agents bot, Microsoft issues a token that includes an `ipaddr` claim representing
the user’s IP address at the time of sign-in. The integration forwards this IP address to Snowflake with each request,
allowing Snowflake to enforce any network policies that rely on client IP information. Microsoft might periodically
issue additional tokens with the same IP address for the duration of the user’s session. The IP address claim in the
token is updated only when a user completely signs out and back in within the bot.

> **Caution:**
>
> The IP address used for network policy enforcement reflects the user’s address at the time of Microsoft sign-in and
> does not update if the user changes their IP address (for example, by connecting to a different network or by connecting
> to or disconnecting from a VPN) during their session with the bot, unless otherwise controlled by your Microsoft tenant
> configuration. Snowflake continues enforcing network policies against the original IP address until the user
> explicitly signs out of the bot and signs back in.
>
> In Snowsight, a client IP change typically invalidates the session immediately when network policies are enabled.
> In the Microsoft Teams and Microsoft 365 Copilot integration, session persistence and IP refresh behavior are
> controlled by Microsoft.

## Current limitations

OAuth identity provider must be Entra ID
:   The integration exclusively supports Microsoft Entra ID as the identity provider for authentication and requires a
    direct one-to-one mapping between Entra ID users and Snowflake users. Organizations that use another primary IdP
    (for example, Okta or another SAML/OIDC provider) can enable this integration by configuring standard identity
    federation between that provider and Microsoft Entra ID. In this federated model, the primary IdP handles the user’s
    sign-in, after which Entra ID issues the final token required by the integration.

Default user role reliance
:   The integration’s functionality is tied to each user’s default Snowflake role due to an architectural constraint in
    the Cortex Agents API, which determines session permissions based on the role context established during
    authentication. Therefore, the user’s default role must be granted all necessary privileges on the underlying objects
    for the agent to function correctly. While Snowflake’s [secondary roles](../security-access-control-overview.md)
    feature can help to broaden data access, the primary execution context is governed by the user’s default role.

## Troubleshooting

If you encounter issues with the Cortex Agents integration for Microsoft Teams, check the following sections for possible solutions.

### Privilege and access issues

The user’s default role must have the required privileges to access the objects used or accessed by the agent.
Error messages caused by access issues typically include the phrase “database object does not exist or not authorized.”

Troubleshooting such issues involves checking that user’s default role is set to a role that has the required privileges.

#### Default role setting

The first step in troubleshooting access issues is to check the user’s default role setting. To verify this setting,
use the DESCRIBE USER command. Check the DEFAULT_ROLE property in the output. If the user’s default role is incorrect,
change it using the ALTER USER command.

```sqlexample
ALTER USER <user_name> SET DEFAULT_ROLE = '<correct_role>';
```

If changing the user’s primary DEFAULT_ROLE is not feasible, you can use the Snowflake’s secondary roles mechanism. A
user can perform actions using the combined privileges of their primary and active secondary roles. This lets you to
grant an additional, integration-specific role to the user without altering their primary role.

To add a secondary role for the Cortex Agents integration, use SQL commands like the following.

```sqlexample
GRANT ROLE <integration_specific_role> TO USER <user_name>;
ALTER USER <user_name> SET DEFAULT_SECONDARY_ROLES = ('ALL');
```

#### Required permissions

Make sure the role under which the integration will run (each user’s default role or permitted secondary roles) has the
grants described in the [access control requirements section](cortex-agents.md).

### Security integration issues

A Snowflake security integration connects the Microsoft Entra ID tenant to the Snowflake account. The issues in this
section are related to the security integration.

#### Invalid OAuth access token (error code 390303)

This error can indicate that one or more property values in the security integration are incorrect, preventing Snowflake
from validating the access token received from Entra ID. To rectify this, check the following fields in the security
integration. In particular, make sure the tenant ID is correct in the URLs.

* **EXTERNAL_OAUTH_ISSUER:** This must be set to the correct Entra ID issuer URL, which is in the format
  `https://login.microsoftonline.com/tenant-id/v2.0`, where `tenant-id` is your organization’s Microsoft
  tenant ID.
* **EXTERNAL_OAUTH_JWS_KEYS_URL:** This must be set to the correct JWS keys URL, which is in the format
  `https://login.microsoftonline.com/tenant-id/discovery/v2.0/keys`, where `tenant-id` is your organization’s
  Microsoft tenant ID.
* **EXTERNAL_OAUTH_AUDIENCE_LIST:** This must include the correct audience for the Cortex Agents Bot OAuth Resource
  application, which is the application ID `5a840489-78db-4a42-8772-47be9d833efe`.

Update any incorrect values using the ALTER SECURITY INTEGRATION command.

#### Incorrect username or password (error code 390304)

This error message points to a mismatch between the user identifier sent by Entra ID and the corresponding user’s record
in Snowflake, usually because the Entra ID user identity does not map to exactly one Snowflake user. This can happen when the Snowflake
user does not exist, when the mapped UPN or email address is incorrect, or when the mapping resolves to multiple Snowflake
users (for example, if the mapping is performed using email address and multiple users share the same address).

The error message includes the UPN and email of the user attempting to log in. Use this information to verify the
affected user’s configuration using the DESCRIBE USER command. Make sure the user’s NAME or EMAIL property matches the
value of the same property in Entra ID for the corresponding user. When using email address mapping, each user in the
Snowflake account that will use the integration must have a unique email address.

#### Role not listed in the access token or was filtered out (error code 390317)

This error occurs when Snowflake cannot assign a role to the user based on the information in the OAuth access token.
The access token is configured with the `session:role-any` scope, which allows the user to assume any of their
assigned roles in Snowflake. However, the security integration must be explicitly configured to permit this behavior.

Use the DESCRIBE SECURITY INTEGRATION command to check the value of the EXTERNAL_OAUTH_ANY_ROLE_MODE property, then
change it to `ENABLE` or `ENABLE_FOR_LOGIN`.

```sqlexample
DESCRIBE SECURITY INTEGRATION entra_id_cortex_agents_integration;

ALTER SECURITY INTEGRATION entra_id_cortex_agents_integration
    SET EXTERNAL_OAUTH_ANY_ROLE_MODE = 'ENABLE';
```

#### Role specified in the connect string is not granted to this user (error code 390186)

This error occurs when Snowflake security integration doesn’t allow the user’s default role to use the security
integration.

To resolve this, check the following properties in the output of DESCRIBE SECURITY INTEGRATION:

* EXTERNAL_OAUTH_ALLOWED_ROLES_LIST: If the parameter is enabled, verify that it contains the user’s default role.
* EXTERNAL_OAUTH_BLOCKED_ROLES_LIST: If the parameter is enabled, verify that it does not contain the user’s default role.

### Network policy issues

If a user is blocked by a network policy when using the Cortex Agents integration for Microsoft Teams or
Microsoft 365 Copilot, try the following steps:

1. **Verify that the user’s IP address is allowlisted.** Confirm that the user’s current IP address is included in the
   account’s network policy. A simple way to test this is to have the user log in to their Snowflake account directly at
   [Snowflake](https://app.snowflake.com/). If the user can log in successfully, their IP address is allowlisted.
2. **Verify that the user’s IP address is not IPv6.** If you encounter an IPv6 address in an error related to a
   network policy, this indicates that Microsoft is sending an IPv6 address as a claim within the authentication token.
   Snowflake network policies currently do not support IPv6 rules, but this functionality is planned for the near
   future. For further details on the timeline, please contact
   [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
3. **Refresh the Entra ID token.** The bot may be using a token with an outdated IP address. To force a token
   refresh, have the user type `/logout` in the chat window, then type `/login` and sign in to Microsoft again.

---
title: Cortex Agents REST API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-rest-api.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Agents REST API

> **Note:**
>
> Requests to the Cortex Agent REST API time out after 15 minutes.

You can use the Cortex Agent REST API to create, manage, and interact with Cortex Agent Objects in your Snowflake account.

## Create Cortex Agent

`POST /api/v2/databases/{database}/schemas/{schema}/agents`

Creates a new Cortex Agent Object with the specified attributes and specification.

### Request

#### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) Your Snowflake Account URL. |
| `schema` | (Required) Schema identifier. |

#### Query parameters

| Parameter | Description |
| --- | --- |
| `createMode` | (Optional) Resource creation mode. Valid values:  * `errorIfExists` * `orReplace` * `ifNotExists` |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

#### Request body

| Field | Type | Description |
| --- | --- | --- |
| `name` | string | Name of the agent. |
| `comment` | string | Optional comment about the agent. |
| `profile` | AgentProfile | Agent profile information (display name, avatar, color, etc.). |
| `models` | ModelConfig | Model configuration for the agent. Includes the orchestration model (e.g., claude-4-sonnet). If not provided, a model is automatically selected. Currently only available for the `orchestration` step. |
| `instructions` | AgentInstructions | Instructions for the agent’s behavior, including response, orchestration, system, and sample questions. |
| `orchestration` | OrchestrationConfig | Orchestration configuration, including budget constraints (e.g., seconds, tokens). |
| `tools` | array of Tool | List of tools available for the agent to use. Each tool includes a tool_spec with type, name, description, and input schema. Tools may have a corresponding configuration in tool_resources. |
| `tool_resources` | map of ToolResource | Configuration for each tool referenced in the tools array. Keys must match the name of the respective tool. |

**Example**

```json
{
  "name": "MY_AGENT",
  "comment": "An agent to answer questions about all my data",
  "profile": {
    "display_name": "My Agent"
  },
  "models": {
    "orchestration": "claude-4-sonnet"
  },
  "instructions": {
    "response": "You will respond in a friendly but concise manner",
    "orchestration": "For any query related to revenue we should use Analyst; For all policy questions we should use Search",
    "system": "You are a friendly agent ..."
  },
  "orchestration": {
    "budget": {
      "seconds": 30,
      "tokens": 16000
    }
  },
  "tools": [
    {
      "tool_spec": {
        "type": "generic",
        "name": "get_revenue",
        "description": "Fetch the delivery revenue for a location.",
        "input_schema": {
          "type": "object",
          "properties": {
            "location": {
              "type": "string",
              "description": "The city and state, e.g. San Francisco, CA"
            }
          }
        },
        "required": [
          "location"
        ]
      }
    }
  ],
  "tool_resources": {
    "get_revenue": {
      "type": "function",
      "execution_environment": {
        "type": "warehouse",
        "warehouse": "MY_WH"
      },
      "identifier": "DB.SCHEMA.UDF"
    }
  }
}
```

### Response

A successful response returns a JSON object with details about the status of Cortex Agent creation.

#### Response body

```json
{"status": "Agent xxxx successfully created."}
```

## Describe Cortex Agent

`GET /api/v2/databases/{database}/schemas/{schema}/agents/{name}`

Describes a Cortex Agent.

### Request

#### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) Identifier for the database to which the resource belongs. You can use the /api/v2/databases GET request to get a list of available databases. |
| `schema` | (Required) Identifier for the schema to which the resource belongs. You can use the /api/v2/databases/{database}/schemas GET request to get a list of available schemas for the specified database. |
| `name` | (Required) Identifier for the agent. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

### Response

A successful response returns a JSON object describing the Cortex Agent.

#### Response headers

| Header | Description |
| --- | --- |
| `X-Snowflake-Request-ID` | Unique ID of the API request. |
| `Link` | Links to the page of results (e.g. the first page, the last page, etc.). The header can include multiple url entries with different rel attribute values that specify the page to return (first, next, prev, and last). |

#### Response body

The response body contains the details of the Cortex Agent.

```json
{
  "agent_spec": "{\"models\":{\"orchestration\":\"llama3.1-70B\"},\"experimental\":{\"foo\":\"bar\",\"nested\":{\"key\":\"value\"}},\"orchestration\":{\"budget\":{\"seconds\":30,\"tokens\":16000}},\"instructions\":{\"response\":\"You will respond in a friendly but concise manner\",\"orchestration\":\"For any revenue question use Analyst; for policy use Search\",\"system\":\"You are a friendly agent.\",\"sample_questions\":[{\"question\":\"question 1\"},{\"question\":\"question 2\"},{\"question\":\"question 3\"}]},\"tools\":[{\"tool_spec\":{\"type\":\"cortex_analyst_text_to_sql\",\"name\":\"Analyst1\",\"description\":\"test\"}},{\"tool_spec\":{\"type\":\"cortex_analyst_sql_exec\",\"name\":\"SQL_exec1\"}},{\"tool_spec\":{\"type\":\"cortex_search\",\"name\":\"Search1\"}},{\"tool_spec\":{\"type\":\"web_search\",\"name\":\"web_search_1\"}},{\"tool_spec\":{\"type\":\"generic\",\"name\":\"get_weather\",\"input_schema\":{\"type\":\"object\",\"properties\":{\"location\":{\"type\":\"string\",\"description\":\"The city and state\"}},\"required\":[\"Location\"]}}}],\"tool_unable_to_answer\":\"I don't know the answer to that\",\"tool_resources\":{\"Analyst1\":{\"semantic_model_file\":\"stage1\"},\"Analyst2\":{\"semantic_view\":\"db.schema.semantic_view\"},\"Search1\":{\"name\":\"db.schema.service_name\",\"Max_results\":\"5\",\"filter\":{\"@eq\":{\"region\":\"North America\"}},\"Title_column\":\"<title_name>\",\"ID_column\":\"<column_name>\"},\"SQL_exec1\":{\"Name\":\"my_warehouse\",\"Timeout\":\"30\",\"AutoExecute\":\"true\"},\"web_search\":{\"name\":\"web_search_1\",\"Function\":\"db/schema/search_web\"}}}",
  "name": "MY_AGENT1",
  "database_name": "TEST_DATABASE",
  "schema_name": "TEST_SCHEMA",
  "owner": "ACCOUNTADMIN",
  "created_on": "1967-06-23T07:00:00.123+00:00"
}
```

## Update Cortex Agent

`PUT /api/v2/databases/{database}/schemas/{schema}/agents/{name}`

Updates an existing Cortex Agent with the specified attributes and specification.

### Request

#### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) Your Snowflake Account URL. You can use the `/api/v2/databases` GET request to get a list of available databases. |
| `schema` | (Required) Schema identifier. You can use the `/api/v2/databases/{database}/schemas` GET request to get a list of available schemas for the specified database. |
| `name` | (Required) Name of the agent. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

#### Request body

| Field | Type | Description |
| --- | --- | --- |
| `comment` | string | Optional comment about the agent. |
| `profile` | AgentProfile | Agent profile information (display name, avatar, color, etc.). |
| `models` | ModelConfig | Model configuration for the agent. Includes the orchestration model (e.g., claude-4-sonnet). If not provided, a model is automatically selected. Currently only available for the `orchestration` step. |
| `instructions` | AgentInstructions | Instructions for the agent’s behavior, including response, orchestration, system, and sample questions. |
| `orchestration` | OrchestrationConfig | Orchestration configuration, including budget constraints (e.g., seconds, tokens). |
| `tools` | array of Tool | List of tools available for the agent to use. Each tool includes a tool_spec with type, name, description, and input schema. Tools may have a corresponding configuration in tool_resources. |
| `tool_resources` | map of ToolResource | Configuration for each tool referenced in the tools array. Keys must match the name of the respective tool. |

**Example**

```json
{
  "comment": "An agent to answer questions about all my data",
  "profile": {
    "display_name": "My Agent"
  },
  "models": {
    "orchestration": "claude-4-sonnet"
  },
  "instructions": {
    "response": "You will respond in a friendly but concise manner",
    "orchestration": "For any query related to revenue we should use Analyst; For all policy questions we should use Search",
    "system": "You are a friendly agent ..."
  },
  "orchestration": {
    "budget": {
      "seconds": 30,
      "tokens": 16000
    }
  },
  "tools": [
    {
      "tool_spec": {
        "type": "generic",
        "name": "get_revenue",
        "description": "Fetch the delivery revenue for a location.",
        "input_schema": {
          "type": "object",
          "properties": {
            "location": {
              "type": "string",
              "description": "The city and state, e.g. San Francisco, CA"
            }
          }
        },
        "required": [
          "location"
        ]
      }
    }
  ],
  "tool_resources": {
    "get_revenue": {
      "type": "function",
      "execution_environment": {
        "type": "warehouse",
        "warehouse": "MY_WH"
      },
      "identifier": "DB.SCHEMA.UDF"
    }
  }
}
```

### Response

A successful response returns a JSON object with details about the status of Cortex Agent update.

#### Response body

```json
{"status": "Agent xxxx successfully updated."}
```

## List Cortex Agents

`GET /api/v2/databases/{database}/schemas/{schema}/agents`

Lists the Cortex Agents under the specified database and schema.

### Request

#### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) Identifier for the database to which the resource belongs. You can use the /api/v2/databases GET request to get a list of available databases. |
| `schema` | (Required) Identifier for the schema to which the resource belongs. You can use the /api/v2/databases/{database}/schemas GET request to get a list of available schemas for the specified database. |

#### Query parameters

| Parameter | Description |
| --- | --- |
| `like` | (Optional) Filter the output by resource name. Uses case-insensitive pattern matching with support for SQL wildcard characters. |
| `fromName` | (Optional) Enable fetching rows only following the first row whose object name matches the specified string. Case-sensitive and does not have to be the full name. |
| `showLimit` | (Optional) Limit the maximum number of rows returned by the command. Minimum: 1. Maximum: 10000. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

### Response

A successful response returns a JSON array of Cortex Agent resources.

#### Response headers

| Header | Description |
| --- | --- |
| `X-Snowflake-Request-ID` | Unique ID of the API request. |
| `Link` | Links to the page of results (e.g. the first page, the last page, etc.). The header can include multiple url entries with different rel attribute values that specify the page to return (first, next, prev, and last). |

#### Response body

```json
[
 {
  "name": "my_agent",
  "database": "TEST_DB",
  "schema": "TEST_SCHEMA",
  "created_on": "2024-06-01T12:00:00Z",
  "owner": "ACCOUNTADMIN",
  "comment": "Sample agent"
 },
 {
  "name": "another_agent",
  "database": "TEST_DB",
  "schema": "TEST_SCHEMA",
  "created_on": "2024-06-02T08:30:00Z",
  "owner": "SYSADMIN",
  "comment": ""
 }
]
```

## Delete Cortex Agent

`DELETE /api/v2/databases/{database}/schemas/{schema}/agents/{name}`

Deletes a Cortex Agent with the specified name. If the `ifExists` parameter is set to `true`, the operation succeeds even if the agent does not exist. Otherwise, the operation fails if the agent cannot be deleted.

### Request

#### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) Identifier for the database to which the resource belongs. You can use the /api/v2/databases GET request to get a list of available databases. |
| `schema` | (Required) Identifier for the schema to which the resource belongs. You can use the /api/v2/databases/{database}/schemas GET request to get a list of available schemas for the specified database. |
| `name` | (Required) Identifier for the agent. |

#### Query parameters

| Parameter | Description |
| --- | --- |
| `ifExists` | (Optional) Specifies how to handle the request if the agent does not exist.   * `true`: The endpoint does not throw an error if the agent does not exist. It returns a 200 success response, but does not take any action. * `false`: The endpoint throws an error if the agent does not exist. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

### Response

A successful response returns a confirmation message.

#### Response body

```json
{
 "status": "Request successfully completed"
}
```

## Schemas

### `AgentInstructions`

| Field | Type | Description |
| --- | --- | --- |
| `response` | string | Instructions for response generation. |
| `orchestration` | string | These custom instructions are used when the agent is planning which tools to use. |
| `system` | string | System instructions for the agent. |

**Example**

```json
{
  "response": "You will respond in a friendly but concise manner",
  "orchestration": "For any query related to revenue we should use Analyst; For all policy questions we should use Search",
  "system": "You are a friendly agent ..."
}
```

### `AgentProfile`

The profile information for a Data Cortex agent.

| Field | Type | Description |
| --- | --- | --- |
| `display_name` | string | Display name for the agent. |

**Example**

```json
{
  "display_name": "My Agent"
}
```

### `BudgetConfig`

| Field | Type | Description |
| --- | --- | --- |
| `seconds` | integer | Time budget in seconds. |
| `tokens` | integer | Token budget. |

**Example**

```json
{
  "seconds": 30,
  "tokens": 16000
}
```

### `ExecutionEnvironment`

Configuration for server-executed tools.

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | The type of execution environment, currently only `warehouse` is supported. |
| `warehouse` | string | The name of the warehouse. Case-sensitive, if it is an unquoted identifier, provide the name in all-caps. |
| `query_timeout` | integer | The query timeout in seconds |

**Example**

```json
{
  "type": "warehouse",
  "warehouse": "MY_WAREHOUSE",
  "query_timeout": 60
}
```

### `ModelConfig`

| Field | Type | Description |
| --- | --- | --- |
| `orchestration` | string | Model to use for orchestration. If not provided, a model is automatically selected. |

**Example**

```json
{
  "orchestration": "claude-4-sonnet"
}
```

### `OrchestrationConfig`

| Field | Type | Description |
| --- | --- | --- |
| `budget` | BudgetConfig | Budget constraints for the agent. If more than one constraint is specified, whichever is first hit will end the request. |

**Example**

```json
{
  "budget": {
    "seconds": 30,
    "tokens": 16000
  }
}
```

### `Tool`

Defines a tool that can be used by the agent. Tools provide specific capabilities like data analysis, search, or generic functions.

| Field | Type | Description |
| --- | --- | --- |
| `tool_spec` | ToolSpec | Specification of the tool’s type, configuration, and input requirements. |

**Example**

```json
{
  "tool_spec": {
    "type": "generic",
    "name": "get_revenue",
    "description": "Fetch the delivery revenue for a location.",
    "input_schema": {
      "type": "object",
      "properties": {
        "location": {
          "type": "string",
          "description": "The city and state, e.g. San Francisco, CA"
        }
      }
    },
    "required": [
      "location"
    ]
  }
}
```

### `ToolInputSchema`

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | The type of the input schema object. |
| `description` | string | A description of what the input is. |
| `properties` | map of ToolInputSchema | If type is `object`, definitions of each input parameter. |
| `items` | ToolInputSchema | If type is `array`, the schema for the elements of the array. |
| `required` | array of string | If type is `object`, list of required input parameter names. |

**Example**

```json
{
  "type": "object",
  "description": "Input for my custom tool",
  "properties": {
    "location": {
      "type": "string",
      "description": "The city and state, e.g. San Francisco, CA"
    }
  },
  "items": {},
  "required": [
    "location"
  ]
}
```

### `ToolResource`

> cortex_analyst_text_to_sqlcortex_searchgenericweb_search
>
> Configuration for text-to-SQL analysis tool. Provides parameters for SQL query generation and execution. Exactly one of semantic_model_file or semantic_view must be provided.
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `semantic_model_file` | string | The path to a file stored in a Snowflake Stage holding the semantic model yaml. |
> | `semantic_view` | string | The name of the Snowflake native semantic model object. |
> | `execution_environment` | ExecutionEnvironment | Configuration for how to execute the generated SQL query. |
>
> **Example**
>
> ```json
> {
>   "semantic_model_file": "@db.schema.stage/semantic_model.yaml",
>   "semantic_view": "db.schema.semantic_view",
>   "execution_environment": {
>     "type": "warehouse",
>     "warehouse": "MY_WAREHOUSE",
>     "query_timeout": 60
>   }
> }
> ```
>
> Configuration for search functionality. Defines how document search and retrieval should be performed.
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `search_service` | string | The fully qualified name of the search service. |
> | `title_column` | string | The title column of the document. |
> | `id_column` | string | The ID column of the document. |
> | `filter` | object | Filter query for search results. |
>
> **Example**
>
> ```json
> {
>   "search_service": "database.schema.service_name",
>   "title_column": "account_name",
>   "id_column": "account_id",
>   "filter": {
>     "@eq": {
>       "<column>": "<value>"
>     }
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | If the tool is server-side executed, whether it is a Stored Procedure or a UDF. |
> | `execution_environment` | ExecutionEnvironment |  |
> | `identifier` | string | Fully qualified name of the Stored Procedure or UDF. |
>
> **Example**
>
> ```json
> {
>   "type": "function",
>   "execution_environment": {
>     "type": "warehouse",
>     "warehouse": "MY_WAREHOUSE",
>     "query_timeout": 60
>   },
>   "identifier": "MY_DB.MY_SCHEMA.MY_UDF"
> }
> ```
>
> Configuration for web search functionality.
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `max_results` | integer | Max web search results returned. |
>
> **Example**
>
> ```json
> {
>   "max_results": 20
> }
> ```

### `ToolSpec`

Specification of the tool’s type, configuration, and input requirements.

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | The type of tool capability. Can be specialized types like ‘cortex_analyst_text_to_sql’ or ‘generic’ for general-purpose tools. |
| `name` | string | Unique identifier for referencing this tool instance. Used to match with configuration in tool_resources. |
| `description` | string | Description of the tool to be considered for tool use. |
| `input_schema` | ToolInputSchema | JSON Schema definition of the expected input parameters for this tool. This will be fed to the agent so it knows the structure it should follow for when generating the input for ToolUses. Required for generic tools to specify their input parameters. |

**Example**

```json
{
  "type": "generic",
  "name": "get_weather",
  "description": "lorem ipsum",
  "input_schema": {
    "type": "object",
    "properties": {
      "location": {
        "type": "string",
        "description": "The city and state, e.g. San Francisco, CA"
      }
    },
    "required": [
      "location"
    ]
  }
}
```

---
title: Cortex Agents Run API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-run.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Agents Run API

> **Note:**
>
> Requests to the Cortex Agent REST API time out after 15 minutes.

There are two methods to interact with an Agent:

* Build an agent object and reference this agent object in a request to the `agent:run` API.
* Call `agent:run` directly without an agent object. You provide the configuration in the request body of `agent:run`.

`agent:run` supports **streaming responses by default**. To disable streaming and receive a single JSON response, set `stream` to `false`.

## Agent run request with agent object

`POST /api/v2/databases/{database}/schemas/{schema}/agents/{name}:run`

Sends a user query to the agent object and returns its response.

By default, the API streams responses as server-sent events (SSE). To receive a single JSON response, set `stream` to `false` in the request body.

> **Note:**
>
> You can’t set, update, or overwrite the `models`, `instructions`, and `orchestration` fields using this request. To update these fields, you must use [Update Cortex Agent](cortex-agents-rest-api.md).

### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) The database containing the agent. You can use the `/api/v2/databases` GET request to get a list of available databases. |
| `schema` | (Required) The schema containing the agent. You can use the `/api/v2/databases/{database}/schemas` GET request to get a list of available schemas for the specified database. |
| `name` | (Required) The name of the agent. |

### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. See [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |
| `Accept` | (Optional) Response content type. Use `text/event-stream` for streaming responses or `application/json` for a single non-streaming response. |

### Request body

| Field | Type | Description |
| --- | --- | --- |
| `thread_id` | integer | The thread ID for the conversation. If thread_id is used, then parent_message_id must be passed as well. |
| `parent_message_id` | integer | The ID of the parent message in the thread. If this is the first message, parent_message_id should be 0. |
| `messages` | array of Message | If thread_id and parent_message_id are passed in the request, messages includes the current user message in the conversation. Else, messages includes the conversation history and the current message. Messages contains both user queries and assistant responses in chronological order. |
| `stream` | boolean | Whether to return a streaming response (`text/event-stream`) or a non-streaming JSON response (`application/json`). If true, the response will be streamed as Server-Sent Events. If false, the response will be returned as JSON. |
| `tool_choice` | ToolChoice | Configures how the agent should select and use tools during the interaction. Controls whether tool use is automatic, required, or whether specific tools should be used. |

**Example**

```json
{
  "thread_id": 0,
  "parent_message_id": 0,
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is the total revenue for 2023?"
        }
      ]
    }
  ],
  "stream": false,
  "tool_choice": {
    "type": "auto",
    "name": [
      "analyst_tool",
      "search_tool"
    ]
  }
}
```

The request body supports an optional `stream` boolean field:

* If `stream` is omitted, it defaults to `true` and the response is streamed as SSE events.
* If `stream` is `false`, the API returns a single JSON object (see Non-streaming response (stream: false)).

## Agent run without an agent object

`POST /api/v2/cortex/agent:run`

Sends a user query to the Cortex Agents service provided in the request body and returns its response.
Interacts with the agent without creating an agent object.

> **Note:**
>
> Before September 1st, 2025, the request and response schemas for the `agent:run` API were different from the schema listed in this document. Previously, the orchestration was static and the same sequence of tools was used to generate an answer. `agent:run` now has an updated schema for both the request and response. In addition, the API now dynamically orchestrates and iterates to arrive at the final response. We recommend using the schema described in this document for an improved end-user experience.
>
> To use the legacy schema and behavior, use the following schema:
>
> ```json
> {
>   "model": "claude-4-sonnet",
>   "messages": [
>      {"role":"user", "content": [] }
>   ]
> }
> ```

### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. See [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |
| `Accept` | (Optional) Response content type. Use `text/event-stream` for streaming responses or `application/json` for a single non-streaming response. |

### Request body

| Field | Type | Description |
| --- | --- | --- |
| `thread_id` | integer | The thread ID for the conversation. If thread_id is used, then parent_message_id must be passed as well. |
| `parent_message_id` | integer | The ID of the parent message in the thread. If this is the first message, parent_message_id should be 0. |
| `messages` | array of Message | If thread_id and parent_message_id are passed in the request, messages includes the current user message in the conversation. Else, messages includes the conversation history and the current message. Messages contains both user queries and assistant responses in chronological order. |
| `stream` | boolean | Whether to return a streaming response (`text/event-stream`) or a non-streaming JSON response (`application/json`). If true, the response will be streamed as Server-Sent Events. If false, the response will be returned as JSON. |
| `tool_choice` | ToolChoice | Configures how the agent should select and use tools during the interaction. Controls whether tool use is automatic, required, or whether specific tools should be used. |
| `models` | ModelConfig | Model configuration for the agent. Includes the orchestration model (e.g., claude-4-sonnet). If not provided, a model is automatically selected. Currently only available for the `orchestration` step. |
| `instructions` | AgentInstructions | Instructions for the agent’s behavior, including response, orchestration, system, and sample questions. |
| `orchestration` | OrchestrationConfig | Orchestration configuration, including budget constraints (e.g., seconds, tokens). |
| `tools` | array of Tool | List of tools available for the agent to use. Each tool includes a tool_spec with type, name, description, and input schema. Tools may have a corresponding configuration in tool_resources. |
| `tool_resources` | map of ToolResource | Configuration for each tool referenced in the tools array. Keys must match the name of the respective tool. |

**Example**

```json
{
  "thread_id": 0,
  "parent_message_id": 0,
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What is the total revenue for 2023?"
        }
      ]
    }
  ],
  "stream": false,
  "tool_choice": {
    "type": "auto",
    "name": [
      "analyst_tool",
      "search_tool"
    ]
  },
  "models": {
    "orchestration": "claude-4-sonnet"
  },
  "instructions": {
    "response": "You will respond in a friendly but concise manner",
    "orchestration": "For any query related to revenue we should use Analyst; For all policy questions we should use Search",
    "system": "You are a friendly agent ..."
  },
  "orchestration": {
    "budget": {
      "seconds": 30,
      "tokens": 16000
    }
  },
  "tools": [
    {
      "tool_spec": {
        "type": "generic",
        "name": "get_revenue",
        "description": "Fetch the delivery revenue for a location.",
        "input_schema": {
          "type": "object",
          "properties": {
            "location": {
              "type": "string",
              "description": "The city and state, e.g. San Francisco, CA"
            }
          }
        },
        "required": [
          "location"
        ]
      }
    }
  ],
  "tool_resources": {
    "get_revenue": {
      "type": "function",
      "execution_environment": {
        "type": "warehouse",
        "warehouse": "MY_WH"
      },
      "identifier": "DB.SCHEMA.UDF"
    }
  }
}
```

The request body supports an optional `stream` boolean field:

* If `stream` is omitted, it defaults to `true` and the response is streamed as SSE events.
* If `stream` is `false`, the API returns a single JSON object (see Non-streaming response (stream: false)).

## Streaming responses

The `agent:run` API provides streaming responses. The server streams back events. This allows you to display responses in your application, token-by-token, as they are generated by the Agent.
Each event streamed in the API response has a strictly typed schema. You can find a list of all of the events in the following section and select to which ones you’d like to subscribe.

The last event sent by the API is a `response` event. This event contains the entire agent output. You can use this as
the agent’s final response. For any non-streaming clients, you can subscribe to this event because it is the logical aggregation of all prior events. If you don’t want to use streaming responses, wait for the `response` event and ignore all prior events.

The majority of the other events streamed can be split into two categories: `Delta` and `Content Items`.

`Delta` events represent a single token generated by the Agent. By listening to these events, you can create
a typewriter effect. The main delta events are `response.thinking.delta`, which
represents a reasoning token, and `response.text.delta`, which represent an answer token.

`Content Item` events represent elements from the `content` array in the final agent response.

> **Note:**
>
> Make sure your application can handle unknown event types.

**Example Response**

```none
event: response.status
data: {"message":"Planning the next steps","status":"planning"}

event: response.thinking.delta
data: {"content_index":0,"text":"\nThe user is asking for a"}

event: response.thinking.delta
data: {"content_index":0,"text":" chart showing the"}

...
...
...

event: response.status
data: {"message":"Reviewing the results","status":"reasoning_agent_stop"}

event: response.status
data: {"message":"Forming the answer","status":"proceeding_to_answer"}
```

### `response`

Event streamed when the final response is available. This is the last event emitted, it represents the aggregation of all other events previously streamed.

| Field | Type | Description |
| --- | --- | --- |
| `role` | string | The role for the message. Always `assistant` in the API response. |
| `content` | array of MessageContentItem | The content generated by the agent. |
| `warnings` | array of Warning | Non-fatal warnings that occurred during processing. Present for non-streaming clients or as a summary. |
| `metadata` | ResponseMetadata |  |

**Example**

```json
{
  "role": "assistant",
  "content": [
    {
      "type": "chart",
      "chart": {
        "tool_use_id": "toolu_123",
        "chart_spec": "{\"$schema\":\"https://vega.github.io/schema/vega-lite/v5.json\",\"data\":{...},\"mark\":\"bar\"}"
      }
    }
  ],
  "warnings": [
    {
      "message": "Unable to fetch tools from MCP server 'foo'. Response quality may be degraded."
    }
  ],
  "metadata": {
    "usage": {
      "tokens_consumed": [
        {
          "model_name": "llama3.1-70b",
          "input_tokens": {
            "total": 175,
            "cache_read": 50,
            "cache_write": 25,
            "uncached": 100
          },
          "output_tokens": {
            "total": 75
          },
          "context_window": 128000
        }
      ]
    },
    "run_id": "123-456"
  }
}
```

### `response.text`

An event streamed when a text content block is done streaming, including all the aggregated deltas for a particular content index.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `text` | string | A text result from the agent |
| `annotations` | array of Annotation | Any annotations attached to the text result (e.g. citations) |
| `is_elicitation` | boolean | Whether this text content is the agent asking for more information from the end user. |

**Example**

```json
{
  "content_index": 0,
  "text": "Lorem ipsum dolor...",
  "annotations": [
    {
      "type": "cortex_search_citation",
      "index": 0,
      "search_result_id": "cs_61987ff6-6d56-4695-83c0-1e7cfed818c7",
      "doc_id": "4ac085cb-82d0-4eb4-94f3-2672aa0599a2",
      "doc_title": "Earnings Report",
      "text": "The revenue for 2025 was..."
    }
  ],
  "is_elicitation": false
}
```

### `response.text.delta`

Event streamed when a new output text delta is generated.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `text` | string | The text delta |
| `is_elicitation` | boolean | Whether this text content is the agent asking for more information from the end user. |

**Example**

```json
{
  "content_index": 0,
  "text": "Hello",
  "is_elicitation": false
}
```

### `response.text.annotation`

Event streamed when an annotation is added to a text content.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `annotation_index` | integer | The index in the annotation array this `annotation` belongs to. |
| `annotation` | Annotation | The annotation object being added. |

**Example**

```json
{
  "content_index": 0,
  "annotation_index": 0,
  "annotation": {
    "type": "cortex_search_citation",
    "index": 0,
    "search_result_id": "cs_61987ff6-6d56-4695-83c0-1e7cfed818c7",
    "doc_id": "4ac085cb-82d0-4eb4-94f3-2672aa0599a2",
    "doc_title": "Earnings Report",
    "text": "The revenue for 2025 was..."
  }
}
```

### `response.thinking`

An event streamed when a thinking content block is done streaming, including all the aggregated deltas for a particular content index.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `text` | string | Thinking tokens from the agent |
| `signature` | string | The signature of the thinking token |

**Example**

```json
{
  "content_index": 0,
  "text": "To answer your question I must...",
  "signature": "lorem ipsum"
}
```

### `response.thinking.delta`

Event streamed when a thinking delta is generated.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `text` | string | The thinking token |
| `signature` | string | The signature of the thinking token |

**Example**

```json
{
  "content_index": 0,
  "text": "lorem ipsum",
  "signature": "lorem ipsum"
}
```

### `response.tool_use`

An event streamed when the agent requests a tool use.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `tool_use_id` | string | Unique identifier for this tool use. Can be used to associated tool results. |
| `type` | string | The type of the tool (e.g. cortex_search, cortex_analyst_text_to_sql) |
| `name` | string | The unique identifier for this tool instance |
| `input` | object | The structured input for this tool. The schema of this object should will vary depending on the tool spec. |
| `client_side_execute` | boolean | Whether the tool use is executed on the client side. |

**Example**

```json
{
  "content_index": 0,
  "tool_use_id": "toolu_123",
  "type": "cortex_analyst_text_to_sql",
  "name": "my_cortex_analyst_semantic_view",
  "input": {
    "location": "San Francisco, CA"
  },
  "client_side_execute": "true"
}
```

### `response.tool_result`

Event streamed when a tool finishes executing, including the tool result.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `tool_use_id` | string | Unique identifier for this tool use. Can be used to associated tool results. |
| `type` | string | The type of the tool (e.g. cortex_search, cortex_analyst_text_to_sql) |
| `name` | string | The unique identifier for this tool instance |
| `content` | array of ToolResultContent | The content on the tool result |
| `status` | string | The status of tool execution |

**Example**

```json
{
  "content_index": 0,
  "tool_use_id": "toolu_123",
  "type": "cortex_analyst_text_to_sql",
  "name": "my_cortex_analyst_semantic_view",
  "content": [
    {
      "type": "json",
      "json": {
        "answer": 42
      }
    }
  ],
  "status": "success"
}
```

### `response.tool_result.status`

Status update for a specific tool use.

| Field | Type | Description |
| --- | --- | --- |
| `tool_use_id` | string | Unique identifier for this tool use. |
| `tool_type` | string | The type of the tool (e.g. cortex_search, cortex_analyst_text_to_sql) |
| `status` | string | Enum for the current state. |
| `message` | string | A more descriptive message expanding on the current status. |
| `details` | object | Tool-specific status details. |

**Example**

```json
{
  "tool_use_id": "toolu_123",
  "tool_type": "cortex_analyst_text_to_sql",
  "status": "Executing SQL",
  "message": "Executing query 'SELECT * FROM my_table'",
  "details": {}
}
```

### `response.tool_result.analyst.delta`

An delta event streamed for the Cortex Analyst tool execution

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `tool_use_id` | string | Unique identifier for this tool use. Can be used to associated tool results. |
| `tool_type` | string | The type of the tool (always cortex_analyst_text_to_sql for this event) |
| `tool_name` | string | The unique identifier for this tool instance |
| `delta` | CortexAnalystToolResultDelta | The content delta |

**Example**

```json
{
  "content_index": 0,
  "tool_use_id": "toolu_123",
  "tool_type": "cortex_analyst_text_to_sql",
  "tool_name": "my_cortex_analyst_semantic_view",
  "delta": {
    "text": "The...",
    "think": "Thinking...",
    "sql": "SELECT...",
    "sql_explanation": "This...",
    "query_id": "707787a0-a684-4ead-adb0-3c3b62b043d9",
    "verified_query_used": false,
    "result_set": {
      "statementHandle": "707787a0-a684-4ead-adb0-3c3b62b043d9",
      "resultSetMetaData": {
        "partition": 0,
        "numRows": 0,
        "format": "jsonv2",
        "rowType": [
          {
            "name": "my_column",
            "type": "VARCHAR",
            "length": 0,
            "precision": 0,
            "scale": 0,
            "nullable": false
          }
        ]
      },
      "data": [
        [
          "row1 col1",
          "row1 col2"
        ],
        [
          "row2 col1",
          "row2 col2"
        ]
      ]
    },
    "suggestions": {
      "index": 0,
      "delta": "What..."
    }
  }
}
```

### `response.table`

An event streamed when a table content block is added.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `tool_use_id` | string | The ID of the tool use that generated this table |
| `query_id` | string | The query id of the sql query that generated this data |
| `result_set` | ResultSet | The SQL results to render a table. Matches the schema from Snowflake’s SQL API ResultSet (<https://docs.snowflake.com/en/developer-guide/sql-api/reference#resultset>) |
| `title` | string | The title for this table |

**Example**

```json
{
  "content_index": 0,
  "tool_use_id": "toolu_123",
  "query_id": "6ac75378-6337-48a6-80ab-6de48dd680eb",
  "result_set": {
    "statementHandle": "707787a0-a684-4ead-adb0-3c3b62b043d9",
    "resultSetMetaData": {
      "partition": 0,
      "numRows": 0,
      "format": "jsonv2",
      "rowType": [
        {
          "name": "my_column",
          "type": "VARCHAR",
          "length": 0,
          "precision": 0,
          "scale": 0,
          "nullable": false
        }
      ]
    },
    "data": [
      [
        "row1 col1",
        "row1 col2"
      ],
      [
        "row2 col1",
        "row2 col2"
      ]
    ]
  },
  "title": "Revenue by Month"
}
```

### `response.chart`

An event streamed when a chart content block is added.

| Field | Type | Description |
| --- | --- | --- |
| `content_index` | integer | The index in the response content array this event represents |
| `tool_use_id` | string | The ID of the tool use that generated this chart |
| `chart_spec` | string | The vega-lite chart specification serialized as a string |

**Example**

```json
{
  "content_index": 0,
  "tool_use_id": "toolu_123",
  "chart_spec": "{\"$schema\":\"https://vega.github.io/schema/vega-lite/v5.json\",\"data\":{...},\"mark\":\"bar\"}"
}
```

### `response.status`

Status update for the agent execution.

| Field | Type | Description |
| --- | --- | --- |
| `status` | string | Enum for the current state. |
| `message` | string | A more descriptive message expanding on the current status. |

**Example**

```json
{
  "status": "executing_tool",
  "message": "Executing tool `my_analyst_tool`"
}
```

### `response.warning`

Sent when a non-fatal warning occurs. The stream continues after this event.

| Field | Type | Description |
| --- | --- | --- |
| `message` | string | The warning message to display to the user. |

**Example**

```json
{
  "message": "Unable to fetch tools from MCP server 'foo'. Response quality may be degraded."
}
```

### `error`

Sent when a fatal error is encountered.

| Field | Type | Description |
| --- | --- | --- |
| `code` | string | The Snowflake error code |
| `message` | string | The error message |
| `request_id` | string | The unique identifier for this request |

**Example**

```json
{
  "code": "399504",
  "message": "Error during execution",
  "request_id": "61987ff6-6d56-4695-83c0-1e7cfed818c7"
}
```

### `metadata`

Metadata about the request. This event is sent when a message is added to the thread. It is useful for getting the `parent_message_id` to use in following requests to the Agents API.

| Field | Type | Description |
| --- | --- | --- |
| `metadata` | Metadata |  |

**Example**

```json
{
  "metadata": {
    "role": "user",
    "message_id": 0,
    "run_id": "123-456"
  }
}
```

## Schemas

### `AgentInstructions`

| Field | Type | Description |
| --- | --- | --- |
| `response` | string | Instructions for response generation. |
| `orchestration` | string | These custom instructions are used when the agent is planning which tools to use. |
| `system` | string | System instructions for the agent. |

**Example**

```json
{
  "response": "You will respond in a friendly but concise manner",
  "orchestration": "For any query related to revenue we should use Analyst; For all policy questions we should use Search",
  "system": "You are a friendly agent ..."
}
```

### `Annotation`

> cortex_search_citation
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The citation type (always `cortex_search_citation`) |
> | `index` | integer | The index of the citation in the search results. |
> | `search_result_id` | string | The unique identifier for the search result. |
> | `doc_id` | string | The unique identifier for the document. |
> | `doc_title` | string | The title of the document. |
> | `text` | string | The text excerpt from the document used as the citation. |
>
> **Example**
>
> ```json
> {
>   "type": "cortex_search_citation",
>   "index": 0,
>   "search_result_id": "cs_61987ff6-6d56-4695-83c0-1e7cfed818c7",
>   "doc_id": "4ac085cb-82d0-4eb4-94f3-2672aa0599a2",
>   "doc_title": "Earnings Report",
>   "text": "The revenue for 2025 was..."
> }
> ```

### `BudgetConfig`

| Field | Type | Description |
| --- | --- | --- |
| `seconds` | integer | Time budget in seconds. |
| `tokens` | integer | Token budget. |

**Example**

```json
{
  "seconds": 30,
  "tokens": 16000
}
```

### `ChartContent`

| Field | Type | Description |
| --- | --- | --- |
| `tool_use_id` | string | The ID of the tool use that generated this chart |
| `chart_spec` | string | The vega-lite chart specification serialized as a string |

**Example**

```json
{
  "tool_use_id": "toolu_123",
  "chart_spec": "{\"$schema\":\"https://vega.github.io/schema/vega-lite/v5.json\",\"data\":{...},\"mark\":\"bar\"}"
}
```

### `CortexAnalystSuggestionDelta`

| Field | Type | Description |
| --- | --- | --- |
| `index` | integer | The index of the suggestion array this delta represents |
| `delta` | string | The text delta for the suggestion in this index |

**Example**

```json
{
  "index": 0,
  "delta": "What..."
}
```

### `CortexAnalystToolResultDelta`

| Field | Type | Description |
| --- | --- | --- |
| `text` | string | A text delta from Cortex Analyst’s final response. |
| `think` | string | A text delta from Cortex Analyst’s reasoning steps. |
| `sql` | string | A delta from Cortex Analyst’s SQL output. Currently, the entire SQL query comes in a single event but we may stream the SQL token-by-token in the future. |
| `sql_explanation` | string | A delta from Cortex Analyst’s explanation of what the SQL query does |
| `query_id` | string | The query id once SQL execution begins |
| `verified_query_used` | boolean | Whether a verified query was used to generate this response |
| `result_set` | ResultSet | The results from SQL execution. Matches the schema from Snowflake’s SQL API ResultSet (<https://docs.snowflake.com/en/developer-guide/sql-api/reference#resultset>) |
| `suggestions` | CortexAnalystSuggestionDelta | A delta from Cortex Analyst’s suggested questions. This is sent when Analyst cannot answer the question due to missing information or other failures. |

**Example**

```json
{
  "text": "The...",
  "think": "Thinking...",
  "sql": "SELECT...",
  "sql_explanation": "This...",
  "query_id": "707787a0-a684-4ead-adb0-3c3b62b043d9",
  "verified_query_used": false,
  "result_set": {
    "statementHandle": "707787a0-a684-4ead-adb0-3c3b62b043d9",
    "resultSetMetaData": {
      "partition": 0,
      "numRows": 0,
      "format": "jsonv2",
      "rowType": [
        {
          "name": "my_column",
          "type": "VARCHAR",
          "length": 0,
          "precision": 0,
          "scale": 0,
          "nullable": false
        }
      ]
    },
    "data": [
      [
        "row1 col1",
        "row1 col2"
      ],
      [
        "row2 col1",
        "row2 col2"
      ]
    ]
  },
  "suggestions": {
    "index": 0,
    "delta": "What..."
  }
}
```

### `ExecutionEnvironment`

Configuration for server-executed tools.

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | The type of execution environment, currently only `warehouse` is supported. |
| `warehouse` | string | The name of the warehouse. Case-sensitive, if it is an unquoted identifier, provide the name in all-caps. |
| `query_timeout` | integer | The query timeout in seconds |

**Example**

```json
{
  "type": "warehouse",
  "warehouse": "MY_WAREHOUSE",
  "query_timeout": 60
}
```

### `InputTokens`

Input token breakdown by cache usage.

| Field | Type | Description |
| --- | --- | --- |
| `total` | integer | Total input tokens processed (including cached tokens). |
| `cache_read` | integer | Input tokens read from cache. |
| `cache_write` | integer | Input tokens written to cache. |
| `uncached` | integer | Input tokens that were not cached. |

**Example**

```json
{
  "total": 175,
  "cache_read": 50,
  "cache_write": 25,
  "uncached": 100
}
```

### `Message`

Represents a single message in the conversation. Can be either from the user or the assistant.

| Field | Type | Description |
| --- | --- | --- |
| `role` | string | Identifies who sent the message - either the user or the assistant. User messages typically contain queries, while assistant messages contain responses and tool results. |
| `content` | array of MessageContentItem | Array of content elements making up the message. Can include text, tool results, or custom content types. |

**Example**

```json
{
  "role": "user",
  "content": [
    {
      "type": "text",
      "text": "What is the total revenue for 2023?"
    }
  ]
}
```

### `MessageContentItem`

> charttabletextthinkingtool_resulttool_use
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The content type (always `chart`). |
> | `chart` | ChartContent | The chart. |
>
> **Example**
>
> ```json
> {
>   "type": "chart",
>   "chart": {
>     "tool_use_id": "toolu_123",
>     "chart_spec": "{\"$schema\":\"https://vega.github.io/schema/vega-lite/v5.json\",\"data\":{...},\"mark\":\"bar\"}"
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The content type (always `table`). |
> | `table` | TableContent | The table. |
>
> **Example**
>
> ```json
> {
>   "type": "table",
>   "table": {
>     "tool_use_id": "toolu_123",
>     "query_id": "6ac75378-6337-48a6-80ab-6de48dd680eb",
>     "result_set": {
>       "statementHandle": "707787a0-a684-4ead-adb0-3c3b62b043d9",
>       "resultSetMetaData": {
>         "partition": 0,
>         "numRows": 0,
>         "format": "jsonv2",
>         "rowType": [
>           {
>             "name": "my_column",
>             "type": "VARCHAR",
>             "length": 0,
>             "precision": 0,
>             "scale": 0,
>             "nullable": false
>           }
>         ]
>       },
>       "data": [
>         [
>           "row1 col1",
>           "row1 col2"
>         ],
>         [
>           "row2 col1",
>           "row2 col2"
>         ]
>       ]
>     },
>     "title": "Revenue by Month"
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `text` | string | A text result from the agent |
> | `annotations` | array of Annotation | Any annotations attached to the text result (e.g. citations) |
> | `is_elicitation` | boolean | Whether this text content is the agent asking for more information from the end user. |
> | `type` | string | The content type (always `text`). |
>
> **Example**
>
> ```json
> {
>   "text": "Lorem ipsum dolor...",
>   "annotations": [
>     {
>       "type": "cortex_search_citation",
>       "index": 0,
>       "search_result_id": "cs_61987ff6-6d56-4695-83c0-1e7cfed818c7",
>       "doc_id": "4ac085cb-82d0-4eb4-94f3-2672aa0599a2",
>       "doc_title": "Earnings Report",
>       "text": "The revenue for 2025 was..."
>     }
>   ],
>   "is_elicitation": false,
>   "type": "text"
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The content type (always `thinking`). |
> | `thinking` | ThinkingContent | The thinking content. |
>
> **Example**
>
> ```json
> {
>   "type": "thinking",
>   "thinking": {
>     "text": "To answer your question I must...",
>     "signature": "lorem ipsum"
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The content type (always `tool_result`). |
> | `tool_result` | ToolResult | The tool result. |
>
> **Example**
>
> ```json
> {
>   "type": "tool_result",
>   "tool_result": {
>     "tool_use_id": "toolu_123",
>     "type": "cortex_analyst_text_to_sql",
>     "name": "my_cortex_analyst_semantic_view",
>     "content": [
>       {
>         "type": "json",
>         "json": {
>           "answer": 42
>         }
>       }
>     ],
>     "status": "success"
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The content type (always `tool_use`). |
> | `tool_use` | ToolUse | The tool use. |
>
> **Example**
>
> ```json
> {
>   "type": "tool_use",
>   "tool_use": {
>     "tool_use_id": "toolu_123",
>     "type": "cortex_analyst_text_to_sql",
>     "name": "my_cortex_analyst_semantic_view",
>     "input": {
>       "location": "San Francisco, CA"
>     },
>     "client_side_execute": "true"
>   }
> }
> ```

### `Metadata`

| Field | Type | Description |
| --- | --- | --- |
| `role` | string | Identifies who sent the message - either the user or the assistant. |
| `message_id` | integer | The thread message id. Use this ID (when role is `assistant`) to ask a followup question on the thread. |
| `run_id` | string | The unique identifier for this Agent Run. Can be used to reconnect to the output stream. |

**Example**

```json
{
  "role": "user",
  "message_id": 0,
  "run_id": "123-456"
}
```

### `ModelConfig`

| Field | Type | Description |
| --- | --- | --- |
| `orchestration` | string | Model to use for orchestration. If not provided, a model is automatically selected. |

**Example**

```json
{
  "orchestration": "claude-4-sonnet"
}
```

### `OrchestrationConfig`

| Field | Type | Description |
| --- | --- | --- |
| `budget` | BudgetConfig | Budget constraints for the agent. If more than one constraint is specified, whichever is first hit will end the request. |

**Example**

```json
{
  "budget": {
    "seconds": 30,
    "tokens": 16000
  }
}
```

### `OutputTokens`

Output token details.

| Field | Type | Description |
| --- | --- | --- |
| `total` | integer | Total output tokens generated. |

**Example**

```json
{
  "total": 75
}
```

### `ResponseMetadata`

Metadata about the response, including usage information.

| Field | Type | Description |
| --- | --- | --- |
| `usage` | UsageMetadata |  |
| `run_id` | string | The unique identifier for this Agent Run. Can be used to reconnect to the output stream. |

**Example**

```json
{
  "usage": {
    "tokens_consumed": [
      {
        "model_name": "llama3.1-70b",
        "input_tokens": {
          "total": 175,
          "cache_read": 50,
          "cache_write": 25,
          "uncached": 100
        },
        "output_tokens": {
          "total": 75
        },
        "context_window": 128000
      }
    ]
  },
  "run_id": "123-456"
}
```

### `ResultSet`

| Field | Type | Description |
| --- | --- | --- |
| `statementHandle` | string | The query id. |
| `resultSetMetaData` | ResultSetMetaData | Metadata on the result set. |
| `data` | array of array | 2D array representing the data |

**Example**

```json
{
  "statementHandle": "707787a0-a684-4ead-adb0-3c3b62b043d9",
  "resultSetMetaData": {
    "partition": 0,
    "numRows": 0,
    "format": "jsonv2",
    "rowType": [
      {
        "name": "my_column",
        "type": "VARCHAR",
        "length": 0,
        "precision": 0,
        "scale": 0,
        "nullable": false
      }
    ]
  },
  "data": [
    [
      "row1 col1",
      "row1 col2"
    ],
    [
      "row2 col1",
      "row2 col2"
    ]
  ]
}
```

### `ResultSetMetaData`

| Field | Type | Description |
| --- | --- | --- |
| `partition` | integer | The index number of the partition. |
| `numRows` | integer | The total number of rows of results. |
| `format` | string | Format of the data in the result set. |
| `rowType` | array of RowType | Description of the columns in the result. |

**Example**

```json
{
  "partition": 0,
  "numRows": 0,
  "format": "jsonv2",
  "rowType": [
    {
      "name": "my_column",
      "type": "VARCHAR",
      "length": 0,
      "precision": 0,
      "scale": 0,
      "nullable": false
    }
  ]
}
```

### `RowType`

| Field | Type | Description |
| --- | --- | --- |
| `name` | string | Name of the column. |
| `type` | string | Snowflake data type of the column. (<https://docs.snowflake.com/en/sql-reference/intro-summary-data-types>) |
| `length` | integer | Length of the column. |
| `precision` | integer | Precision of the column. |
| `scale` | integer | Scale of the column. |
| `nullable` | boolean | Specifies whether or not the column is nullable. |

**Example**

```json
{
  "name": "my_column",
  "type": "VARCHAR",
  "length": 0,
  "precision": 0,
  "scale": 0,
  "nullable": false
}
```

### `TableContent`

| Field | Type | Description |
| --- | --- | --- |
| `tool_use_id` | string | The ID of the tool use that generated this table |
| `query_id` | string | The query id of the sql query that generated this data |
| `result_set` | ResultSet | The SQL results to render a table. Matches the schema from Snowflake’s SQL API ResultSet (<https://docs.snowflake.com/en/developer-guide/sql-api/reference#resultset>) |
| `title` | string | The title for this table |

**Example**

```json
{
  "tool_use_id": "toolu_123",
  "query_id": "6ac75378-6337-48a6-80ab-6de48dd680eb",
  "result_set": {
    "statementHandle": "707787a0-a684-4ead-adb0-3c3b62b043d9",
    "resultSetMetaData": {
      "partition": 0,
      "numRows": 0,
      "format": "jsonv2",
      "rowType": [
        {
          "name": "my_column",
          "type": "VARCHAR",
          "length": 0,
          "precision": 0,
          "scale": 0,
          "nullable": false
        }
      ]
    },
    "data": [
      [
        "row1 col1",
        "row1 col2"
      ],
      [
        "row2 col1",
        "row2 col2"
      ]
    ]
  },
  "title": "Revenue by Month"
}
```

### `ThinkingContent`

| Field | Type | Description |
| --- | --- | --- |
| `text` | string | Thinking tokens from the agent |
| `signature` | string | The signature of the thinking token |

**Example**

```json
{
  "text": "To answer your question I must...",
  "signature": "lorem ipsum"
}
```

### `TokensConsumed`

Token consumption for a specific model.

| Field | Type | Description |
| --- | --- | --- |
| `model_name` | string | Name of the model used. |
| `input_tokens` | InputTokens |  |
| `output_tokens` | OutputTokens |  |
| `context_window` | integer | The model’s context window size (in tokens). |

**Example**

```json
{
  "model_name": "llama3.1-70b",
  "input_tokens": {
    "total": 175,
    "cache_read": 50,
    "cache_write": 25,
    "uncached": 100
  },
  "output_tokens": {
    "total": 75
  },
  "context_window": 128000
}
```

### `Tool`

Defines a tool that can be used by the agent. Tools provide specific capabilities like data analysis, search, or generic functions.

| Field | Type | Description |
| --- | --- | --- |
| `tool_spec` | ToolSpec | Specification of the tool’s type, configuration, and input requirements. |

**Example**

```json
{
  "tool_spec": {
    "type": "generic",
    "name": "get_revenue",
    "description": "Fetch the delivery revenue for a location.",
    "input_schema": {
      "type": "object",
      "properties": {
        "location": {
          "type": "string",
          "description": "The city and state, e.g. San Francisco, CA"
        }
      }
    },
    "required": [
      "location"
    ]
  }
}
```

### `ToolChoice`

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | Determines how tools are selected: - auto - Automatic tool selection (default) - required - Must use at least one tool - tool - Use specific named tools |
| `name` | array of string | List of specific tool names to use when type is ‘tool’. |

**Example**

```json
{
  "type": "auto",
  "name": [
    "analyst_tool",
    "search_tool"
  ]
}
```

### `ToolInputSchema`

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | The type of the input schema object. |
| `description` | string | A description of what the input is. |
| `properties` | map of ToolInputSchema | If type is `object`, definitions of each input parameter. |
| `items` | ToolInputSchema | If type is `array`, the schema for the elements of the array. |
| `required` | array of string | If type is `object`, list of required input parameter names. |

**Example**

```json
{
  "type": "object",
  "description": "Input for my custom tool",
  "properties": {
    "location": {
      "type": "string",
      "description": "The city and state, e.g. San Francisco, CA"
    }
  },
  "items": {},
  "required": [
    "location"
  ]
}
```

### `ToolResource`

> cortex_analyst_text_to_sqlcortex_searchgenericweb_search
>
> Configuration for text-to-SQL analysis tool. Provides parameters for SQL query generation and execution. Exactly one of semantic_model_file or semantic_view must be provided.
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `semantic_model_file` | string | The path to a file stored in a Snowflake Stage holding the semantic model yaml. |
> | `semantic_view` | string | The name of the Snowflake native semantic model object. |
> | `execution_environment` | ExecutionEnvironment | Configuration for how to execute the generated SQL query. |
>
> **Example**
>
> ```json
> {
>   "semantic_model_file": "@db.schema.stage/semantic_model.yaml",
>   "semantic_view": "db.schema.semantic_view",
>   "execution_environment": {
>     "type": "warehouse",
>     "warehouse": "MY_WAREHOUSE",
>     "query_timeout": 60
>   }
> }
> ```
>
> Configuration for search functionality. Defines how document search and retrieval should be performed.
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `search_service` | string | The fully qualified name of the search service. |
> | `title_column` | string | The title column of the document. |
> | `id_column` | string | The ID column of the document. |
> | `filter` | object | Filter query for search results. |
>
> **Example**
>
> ```json
> {
>   "search_service": "database.schema.service_name",
>   "title_column": "account_name",
>   "id_column": "account_id",
>   "filter": {
>     "@eq": {
>       "<column>": "<value>"
>     }
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | If the tool is server-side executed, whether it is a Stored Procedure or a UDF. |
> | `execution_environment` | ExecutionEnvironment |  |
> | `identifier` | string | Fully qualified name of the Stored Procedure or UDF. |
>
> **Example**
>
> ```json
> {
>   "type": "function",
>   "execution_environment": {
>     "type": "warehouse",
>     "warehouse": "MY_WAREHOUSE",
>     "query_timeout": 60
>   },
>   "identifier": "MY_DB.MY_SCHEMA.MY_UDF"
> }
> ```
>
> Configuration for web search functionality.
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `max_results` | integer | Max web search results returned. |
>
> **Example**
>
> ```json
> {
>   "max_results": 20
> }
> ```

### `ToolResult`

| Field | Type | Description |
| --- | --- | --- |
| `tool_use_id` | string | Unique identifier for this tool use. Can be used to associated tool results. |
| `type` | string | The type of the tool (e.g. cortex_search, cortex_analyst_text_to_sql) |
| `name` | string | The unique identifier for this tool instance |
| `content` | array of ToolResultContent | The content on the tool result |
| `status` | string | The status of tool execution |

**Example**

```json
{
  "tool_use_id": "toolu_123",
  "type": "cortex_analyst_text_to_sql",
  "name": "my_cortex_analyst_semantic_view",
  "content": [
    {
      "type": "json",
      "json": {
        "answer": 42
      }
    }
  ],
  "status": "success"
}
```

### `ToolResultContent`

> jsontext
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The type of result (always `json`) |
> | `json` | object | Structured output from a tool. The schema varies depending on the tool type. |
>
> **Example**
>
> ```json
> {
>   "type": "json",
>   "json": {
>     "answer": 42
>   }
> }
> ```
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `type` | string | The type of result (always `text`) |
> | `text` | string | The result text |
>
> **Example**
>
> ```json
> {
>   "type": "text",
>   "text": "The answer is 42"
> }
> ```

### `ToolSpec`

Specification of the tool’s type, configuration, and input requirements.

| Field | Type | Description |
| --- | --- | --- |
| `type` | string | The type of tool capability. Can be specialized types like ‘cortex_analyst_text_to_sql’ or ‘generic’ for general-purpose tools. |
| `name` | string | Unique identifier for referencing this tool instance. Used to match with configuration in tool_resources. |
| `description` | string | Description of the tool to be considered for tool use. |
| `input_schema` | ToolInputSchema | JSON Schema definition of the expected input parameters for this tool. This will be fed to the agent so it knows the structure it should follow for when generating the input for ToolUses. Required for generic tools to specify their input parameters. |

**Example**

```json
{
  "type": "generic",
  "name": "get_weather",
  "description": "lorem ipsum",
  "input_schema": {
    "type": "object",
    "properties": {
      "location": {
        "type": "string",
        "description": "The city and state, e.g. San Francisco, CA"
      }
    },
    "required": [
      "location"
    ]
  }
}
```

### `ToolUse`

| Field | Type | Description |
| --- | --- | --- |
| `tool_use_id` | string | Unique identifier for this tool use. Can be used to associated tool results. |
| `type` | string | The type of the tool (e.g. cortex_search, cortex_analyst_text_to_sql) |
| `name` | string | The unique identifier for this tool instance |
| `input` | object | The structured input for this tool. The schema of this object should will vary depending on the tool spec. |
| `client_side_execute` | boolean | Whether the tool use is executed on the client side. |

**Example**

```json
{
  "tool_use_id": "toolu_123",
  "type": "cortex_analyst_text_to_sql",
  "name": "my_cortex_analyst_semantic_view",
  "input": {
    "location": "San Francisco, CA"
  },
  "client_side_execute": "true"
}
```

### `UsageMetadata`

Token usage information for this request.

| Field | Type | Description |
| --- | --- | --- |
| `tokens_consumed` | array of TokensConsumed | Token consumption details per model used in this request. |

**Example**

```json
{
  "tokens_consumed": [
    {
      "model_name": "llama3.1-70b",
      "input_tokens": {
        "total": 175,
        "cache_read": 50,
        "cache_write": 25,
        "uncached": 100
      },
      "output_tokens": {
        "total": 75
      },
      "context_window": 128000
    }
  ]
}
```

### `Warning`

| Field | Type | Description |
| --- | --- | --- |
| `message` | string | The warning message to display to the user. |

**Example**

```json
{
  "message": "Unable to fetch tools from MCP server 'foo'. Response quality may be degraded."
}
```

## Non-streaming response (stream: false)

To receive a **single non-streaming JSON response**, set `stream` to `false` in the request body and set the request `Accept` header to `application/json`.

The response body is the same object as the `response` event payload in streaming mode (that is, it corresponds to the JSON returned in the SSE `response` event’s `data` field).

**Example response**

```json
{
  "role": "assistant",
  "content": [
    {
      "thinking": {
        "text": "\nThe user is asking about types of products...\n"
      },
      "type": "thinking"
    },
    {
      "tool_use": {
        "client_side_execute": false,
        "input": {
          "has_time_column": false,
          "need_future_forecasting_data": false,
          "original_query": "what are some types of products?",
          "previous_related_tool_result_id": "",
          "query": "What are the different types or categories of products?"
        },
        "name": "semantic_view_a",
        "tool_use_id": "<tool_use_id>",
        "type": "cortex_analyst_text_to_sql"
      },
      "type": "tool_use"
    },
    {
      "tool_result": {
        "content": [
          {
            "json": {
              "query_id": "<query_id>",
              "result_set": {
                "data": [
                  ["Electronics", "3", "3"],
                  ["Furniture", "2", "2"]
                ],
                "resultSetMetaData": {
                  "format": "jsonv2",
                  "numRows": 2,
                  "partition": 0
                },
                "statementHandle": "<statement_handle>"
              },
              "sql": "WITH __table_a AS (...) SELECT ...",
              "text": "The question is clear and I can answer it with the following SQL."
            },
            "type": "json"
          }
        ],
        "name": "semantic_view_a",
        "status": "success",
        "tool_use_id": "<tool_use_id>",
        "type": "cortex_analyst_text_to_sql"
      },
      "type": "tool_result"
    },
    {
      "text": "Based on the data available, there are 2 main types of products...",
      "type": "text"
    }
  ]
}
```

---
title: Cortex Agents tutorials
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-tutorials.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Agents tutorials

Use Cortex Agents to get insights from both structured and unstructured data sources. You can use the following tutorials to help you get started with Cortex Agents:

* [Getting Started with Cortex Agents](https://quickstarts.snowflake.com/guide/getting_started_with_cortex_agents/index.html?index=../..index#0)
* [Getting Started with Snowflake Cortex Agents API and React](https://quickstarts.snowflake.com/guide/getting_started_with_snowflake_agents_api_and_react/index.html?index=../..index#0)
* [Getting Started with Cortex Agents and Slack](https://quickstarts.snowflake.com/guide/integrate_snowflake_cortex_agents_with_slack/index.html#0)
* [Getting Started with Cortex Agents for Microsoft Teams and Microsoft 365 Copilot](https://quickstarts.snowflake.com/guide/getting_started_with_the_microsoft_teams_and_365_copilot_cortex_app)
* [Best Practices to Building Cortex Agents](https://www.snowflake.com/en/developers/guides/best-practices-to-building-cortex-agents)

For more information about Cortex Agents, see [Cortex Agents](cortex-agents.md).

---
title: Cortex AI Functions: Audio
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-audio.md
section: Snowflake Cortex (AI & ML)
---

# Cortex AI Functions: Audio

Cortex AI Audio provides advanced LLM-powered audio processing capabilities, including:

* **Transcription:** Convert spoken language to text.
* **Speaker identification:** Determine who is speaking in each part of a multi-speaker audio file.
* **Timestamp extraction:** Identify the timestamp of each spoken word.

These capabilities are available through the AI_TRANSCRIBE function. Because AI_TRANSCRIBE
is managed and hosted inside Snowflake, you can easily integrate audio processing into your data workflows without
onerous setup or infrastructure management.

> **Note:**
>
> The AI_TRANSCRIBE function also processes audio tracks in video files.

## AI_TRANSCRIBE

[AI_TRANSCRIBE](../../sql-reference/functions/ai_transcribe.md) is a fully managed SQL function that transcribes audio and video files
stored in a stage, extracting text, timestamps, and speaker information. See [Create stage for media files](aisql.md) for
information on creating a stage suitable for storing files for processing by AI_TRANSCRIBE.

Under the hood, AI_TRANSCRIBE orchestrates optimized AI models for transcription and speaker
diarization, processing audio files of up to two hours in length. AI_TRANSCRIBE is horizontally scalable, allowing
efficient batch processing by processing multiple files at the same time. Audio can be processed directly from
object storage to avoid unnecessary data movement.

By default, AI_TRANSCRIBE converts audio files to clean, readable text. You can also specify a timestamp granularity to
extract timestamps for each word or change of speaker. Word-level timestamps are useful for applications such as subtitles
or for letting the user to jump to specific parts of the audio by clicking words in the transcript. Speaker-level timestamps
are useful for understanding who said what in meetings, interviews, or phone calls.

| Timestamp granularity mode | Result |
| --- | --- |
| Default | Transcription of entire audio file in one piece |
| Word | Transcription with timestamps for each word |
| Speaker | Indicates who is speaking, and a timestamp, at each change of speaker |

### Supported languages

AI_TRANSCRIBE supports the following languages, which are automatically detected. Files can contain multiple supported languages.

> **Note:**
>
> Language detection requires audio to begin within the first five seconds of the file. For best results, trim excess silence
> before uploading.

* Arabic
* Bulgarian
* Cantonese
* Catalan
* Chinese
* Czech
* Dutch
* English
* French
* German
* Greek
* Hebrew
* Hindi
* Hungarian
* Indonesian
* Italian
* Japanese
* Korean
* Latvian
* Malay
* Norwegian
* Polish
* Portuguese
* Romanian
* Russian
* Serbian
* Slovenian
* Spanish
* Swedish
* Thai
* Turkish
* Ukrainian

### Supported media formats

AI_TRANSCRIBE supports the following audio and video file formats:

|  |  |
| --- | --- |
| Audio | FLAC, MP3, MP4, OGG, WAV, WEBM |
| Video | MKV, MP4, OGV, WEBM |

Video files must contain at least one audio track in FLAC, MP3, OPUS, VORBIS, or WAV format.

## Examples

### Text transcription

The following example transcribes [`an audio file`](../../_downloads/a1ea941b9694993063b55e621dae1cd0/consultation.wav) stored in the
`financial_consultation` stage, returning a text transcript of the entire file. The
[TO_FILE function](../../sql-reference/functions/to_file.md) converts the staged file to a file reference.

```sqlexample
SELECT AI_TRANSCRIBE(TO_FILE(
    '@financial_consultation', 'consultation.wav'));
```

Response:

```output
{"audio_duration":321.78,"text":"Good afternoon, Robert. Thanks for calling in
today. I understand you had some concerns about your portfolio you wanted to
discuss. Yes, I'm really worried. I've been watching the news and the market's
been all over the place lately. I'm thinking maybe I should just sell
everything, all my stocks and mutual funds and put it all in bonds or CDs. At
least then I could sleep at night. I can definitely understand that concern,
Robert. Market volatility can be unsettling, especially when you're seeing
those daily swings in the headlines. Before we talk about any major moves, can
you help me understand what specifically is driving this anxiety? Is it the
recent tech sector pullback or something more general? It's everything. I'm 52
years old and I keep thinking about what happened in 2008. I lost so much then
and I'm worried we're heading for another crash with this new administration. I
can't afford to lose my retirement savings. Those are absolutely valid
concerns, and I appreciate you sharing that context. That was a really
challenging time for everyone. Let me ask you this. When we last reviewed your
portfolio in March, we had you allocated at about 70% equities and 30% bonds,
correct? And your target retirement age is still 62%. That's right. But
honestly, 70% in stocks feels way too risky right now. I'm thinking more like
20% stocks, 80% bonds, maybe even less in stocks. I understand that instinct,
Robert. Let's walk through this together. First, I want to remind you of
something important. Your current portfolio is already designed with volatility
in mind. You're not in individual stocks. You're in diversified index funds and
some actively managed funds across different sectors and even international
markets. but they're still going down. My quarterly statement showed I was down
8% this quarter alone. You're absolutely right, and that's painful to see, but
let's put this in perspective. Over the past 12 months, even with this recent
volatility, your portfolio is still up about 3%. The market has given back some
gains, but we're not in crisis territory. Remember, we built your allocation
specifically because you have 10 years until retirement. That time horizon is
actually your biggest asset here. So you're saying I should just do nothing?
Not exactly nothing, but I am suggesting we don't make dramatic changes based
on short-term market movements. However, I do hear your concern about risk
tolerance. What if we made a smaller adjustment? Instead of going to 20%
stocks, what if we moved to 60% stocks and 40% bonds? That would reduce your
equity exposure by 10%, which might help you sleep better, but wouldn't take
you completely out of the growth potential you need for retirement. That
actually sounds more reasonable, but I'm still worried about losing more
money. I understand completely. Let me ask you this. What's your bigger worry,
the volatility of the next year, two or two, or having enough money to retire
comfortably at 62? Because if we get too conservative now, inflation alone
could erode your purchasing power over the next decade. I didn't really thought
about inflation that way. I guess I've been so focused on not losing money that
I forgot about the money I might not make. Exactly. And remember, Robert,
you're not alone in this. I've had this conversation with many clients over the
past few weeks. The ones who stayed disciplined during previous market
downturns are generally glad they did. What if we also set up a plan where we
review your portfolio monthly for the next few months? That way you'll have
regular check-ins and won't feel like you're just riding this out blindly.
Monthly reviews would definitely help. And maybe the 60-40 split is a good
compromise. I just, I don't want to be stupid about this. Overt, wanting to
protect your retirement isn't stupid. It's exactly what you should be thinking
about. The key is making sure we're protecting it in the right way. Staying
invested in a diversified portfolio, even with some volatility, has
historically been the best way to preserve and grow wealth over time. okay, I
think I can live with moving to 60% stocks, but if things get really bad... If
things get really bad, we'll talk again. That's what I'm here for. And
remember, we'll be reviewing this monthly anyway. You're not locked into
anything forever. But I do want to emphasize that market timing is incredibly
difficult, even for professionals. The goal isn't to avoid all volatility.
It's to stay invested long enough to benefit from the market's long-term
upward trend. All right, Sarah, let's do the rebalancing to 60-40 and I'll try
to stop checking my account balance every day. It sounds like a solid plan,
Robert. And yes, definitely limit the daily balance checking. That's a recipe
for anxiety. I'll send you some research on historical market recoveries after
our call and we'll schedule our first monthly review for next month. How does
that sound? That sounds good. Thanks for talking me through this, Sarah. I feel
a lot better than when I call. I'm so glad to hear that, Robert. Remember,
staying invested requires patience, but your future self will thank you for it.
I'll have the rebalancing done by tomorrow morning, and you should see the
changes reflected in your account by Thursday. Perfect. Thanks again, Sarah. I
thank you deeply for your patience and understanding. I'll talk to you next
month."}
```

### Word-level segmentation with timestamps

Set the timestamp granularity to “word” to extract precise timestamps for every word spoken, enabling searchable, navigable transcripts.
Note that [`this audio file`](../../_downloads/ed8a8177fc066cdea7c80b650d0a0302/consultation_3_sp.wav) is in Spanish.

```sqlexample
SELECT AI_TRANSCRIBE(TO_FILE('@financial_consultation', 'consultation_3_sp.wav'),
    {'timestamp_granularity': 'word'});
```

Response:

> **Note:**
>
> The output is truncated for brevity. The full output contains a segment for each word spoken in the audio file.

```output
{
    "audio_duration": 150.66,
    "segments": [
        {
            "end": 1.513,
            "start": 0.031,
            "text": "«Buenos"
        },
        {
            "end": 2.034,
            "start": 1.553,
            "text": "días,"
        },
        {
            "end": 2.334,
            "start": 2.054,
            "text": "doña"
        },
        {
            "end": 4.457,
            "start": 2.374,
            "text": "Esperanza."
        },
        {
            "end": 4.597,
            "start": 4.477,
            "text": "¿En"
        },
        {
            "end": 4.857,
            "start": 4.697,
            "text": "qué"
        },
        {
            "end": 5.118,
            "start": 4.917,
            "text": "puedo"
        },
        {
            "end": 5.518,
            "start": 5.178,
            "text": "ayudarla"
        },
        {
            "end": 6.5,
            "start": 5.578,
            "text": "hoy?»"
        },

        ...

        {
            "end": 146.671,
            "start": 146.551,
            "text": "Ya"
        },
        {
            "end": 147.234,
            "start": 146.732,
            "text": "veremos,"
        },
        {
            "end": 147.837,
            "start": 147.355,
            "text": "Roberto."
        },
        {
            "end": 148.581,
            "start": 148.078,
            "text": "Gracias"
        },
        {
            "end": 148.822,
            "start": 148.661,
            "text": "por"
        },
        {
            "end": 149.646,
            "start": 148.902,
            "text": "tu"
        },
        {
            "end": 150.711,
            "start": 150.249,
            "text": "ayuda."
        }
    ],
    "text": "«Buenos días, doña Esperanza. ¿En qué puedo ayudarla hoy?» «Roberto, quiero
    hacer un cambio grande en mi portafolio. Quiero vender todo y compra solo acciones
    de Tesla». «¿Tesla? Doña Esperanza, usted tiene 72 años. ¿Por qué quiere poner todo
    su dinero en una sola compañía?» «¿Por qué Tesla va a ser el futuro?» Un minuto me
    explico que van a dominar los carros eléctricos. Dice que puedo triplicar mi dinero
    en dos años. Entiendo que Tesla es una impresión innovador, pero poner todos sus
    ajuros en una sola acción es muy arriesgado. ¿Qué pasa si Tesla baja? No va a bajar.
    Elon Musk es un genio. Además, mi vecina compró Teslas. Teslas es tres años. Y Aorus
    tiene el doble de dinero. Doña Esperanza, su vecina tuvo suerte, pero las yantes
    individuales pueden ser muy volátiles. Usted necesita dinero estable para sus gastos
    de retiro. Roberto, tengo $400,000 en mi cuenta. Si te la sube como dismi, voy a
    tener más de un año. Podré dejarle más dinero a mi familia. Pero también podría
    perder la mitad de su dinero o más. Te sabía Jairo 60% antes. No puedo recomendarle
    que haga esto. Entonces no me dejas escuchando. Yo sé lo que quiero hacer con mi
    dinero. Es mi decisión. Tienes razón, es su dinero. Pero como su asesor tengo que
    decir que esto es extremamanda peligroso para alguien de su edad. Eva, no importa.
    Quiero tomar este riesgo. Vas a Edom o no. Doña Esperanza, ¿qué tal si compramos
    algo de Tesla perronoto? ¿Podríamos poner 10% en Tesla y el resto en versiones más
    seguras? No, Roberto, quiero el 100% en Tesla. Si no me ayudas, voy a alcanzar otro
    asesor. Que sí lo haga. Está bien, Doña Presanza. Voy a procesar la orden, pero voy
    a documentar que fue contra mi recomendación profesional. Perfecto. Hazlo hoy mismo.
    Quiero compra antes que suba más. Será ahora. Él considera lo que le estoy diciendo.
    Esto puede ser ver muy mal a la vida. Ya veremos, Roberto. Gracias por tu ayuda."
}
```

### Speaker recognition

Set timestamp granularity to “speaker” to detect, separate, and identify unique speakers in conversations or meetings.
This example uses [`an audio file`](../../_downloads/723595d8b4eaf09cad6ce639b6466e03/consultation_5_mix_es_en.wav) an audio file with two speakers,
one speaking English and the other Spanish.

```sqlexample
SELECT AI_TRANSCRIBE(TO_FILE('@financial_consultation', 'consultation_5_mix_es_en.wav'),
    {'timestamp_granularity': 'speaker'});
```

Response:

> **Note:**
>
> The output is truncated for brevity. The full output contains a segment for each conversational “turn” in the audio file.

```output
{
    "audio_duration": 208.66,
    "segments": [
        {
            "end": 3.076,
            "speaker_label": "SPEAKER_00",
            "start": 0.031,
            "text": "Good afternoon, this is Aaliyah Johnson from Secure Financial Services."
        },
        {
            "end": 4.297,
            "speaker_label": "SPEAKER_02",
            "start": 3.196,
            "text": "How can I help you today?"
        },
        {
            "end": 7.182,
            "speaker_label": "SPEAKER_02",
            "start": 5.139,
            "text": "Hola, necesito ayuda con mis inversiones."
        },
        {
            "end": 11.528,
            "speaker_label": "SPEAKER_02",
            "start": 7.482,
            "text": "Estoy muy preocupada porque he perdido mucho dinero y no sé qué hacer."
        },
        {
            "end": 14.132,
            "speaker_label": "SPEAKER_02",
            "start": 12.289,
            "text": "I'm sorry, I'm not understanding."
        },
        {
            "end": 15.795,
            "speaker_label": "SPEAKER_02",
            "start": 14.553,
            "text": "Do you speak English?"
        },
        ...
        {
            "end": 189.169,
            "speaker_label": "SPEAKER_02",
            "start": 185.841,
            "text": "Es muy difícil entender estas cosas en inglés."
        },
        {
            "end": 192.326,
            "speaker_label": "SPEAKER_01",
            "start": 190.178,
            "text": "Por supuesto, señora Ramírez."
        },
        {
            "end": 197.145,
            "speaker_label": "SPEAKER_01",
            "start": 192.788,
            "text": "Es muy importante que entienda completamente sus opciones."
        },
        {
            "end": 203.229,
            "speaker_label": "SPEAKER_01",
            "start": 197.165,
            "text": "Voy a hacer los cambios hoy mismo y la llamaré la próxima semana para ver cómo se siente."
        },
        {
            "end": 205.759,
            "speaker_label": "SPEAKER_02",
            "start": 203.891,
            "text": "Muchísimas gracias, María."
        },
        {
            "end": 208.71,
            "speaker_label": "SPEAKER_02",
            "start": 206.18,
            "text": "Me siento mucho más tranquila ahora."
        }
    ],
    "text": "Good afternoon, this is Aaliyah Johnson from Secure Financial Services.
    How can I help you today? Hola, necesito ayuda con mis inversiones. Estoy muy
    preocupada porque he perdido mucho dinero y no sé qué hacer. I'm sorry, I'm not
    understanding. Do you speak English? Un poquito, pero es muy difícil para mí. Aquí
    hay alguien que habla español, ¿ok? Es muy importante. He perdido miles de dólares.
    I'm really sorry, but I don't speak Spanish. Let me see. I think we might have
    someone who speaks Spanish, but they're not available right now. ¿Cuándo pueden
    ayudarme? Necesito hablar con a lguien hoy. Mi esposo está muy enojado y quiere que
    vendamos todo. I understand you need someone who speaks Spanish. Let me check if
    Maria is available. She's our Spanish-speaking advisor. Can you hold for just a
    moment? No entiendo. Mañana. Pero necesito ayuda ahora. ¿No hay nadie más? I am
    going to transfer you to Maria right now. She'll be able to help you with your
    investment concerns. Hola, soy María González. Entiendo que necesita ayuda con sus
    inversiones. ¿Cómo está usted? ¡Ay, qué alivio! Sí, estoy muy preocupada. He
    perdido casi 20.000 dólares en las últimas semanas y mi esposo quiere que vendamos
    todo. Comprendo perfectamente su preocupación, señora Ramírez. Perder dinero es muy
    estresante. Cuénteme un poco más sobre su situación. ¿Qué tipo de inversiones
    tiene? Tengo fondos mutuos y algunas acciones. Todo está bajando mucho. Mi esposo
    dice que es mejor tener el dinero en el banco, pero yo no estoy segura. Es natural
    sentirse nerviosa cuando el mercado está volátil. Pero antes de tomar decisiones
    importantes, vamos a revisar su situación completa. ¿Cuántos años tiene usted y
    cuándo planea retirarse? Tengo 55 años y quiero retirarme a los 65, pero con estas
    pérdidas no sé si voy a poder. Señora Ramírez, usted todavía tiene 10 años hasta el
    retiro. Eso es tiempo suficiente para que sus inversiones se recuperen. El mercado
    siempre tiene altibajos, pero históricamente se ha recuperado. ¿Pero qué pasa si no
    se recupera esta vez? No puedo perder más dinero. Entiendo su miedo. ¿Qué le parece
    si hacemos algunos ajustes para que se sienta más cómoda? Podemos mover parte de su
    dinero a inversiones más conservadoras, como bonos. Eso suena mejor. No quiero
    arriesgar todo, pero tampoco quiero perder la oportunidad de crecer mi dinero.
    Perfecto. Vamos a encontrar un equilibrio. ¿Qué tal si movemos el 40% de sus
    acciones a bonos? Así tendrá menos riesgo, pero todavía podrá crecer su dinero para
    el retiro. Sí, eso me hace sentir mucho mejor. Gracias por explicarme todo en
    español. Es muy difícil entender estas cosas en inglés. Por supuesto, señora
    Ramírez. Es muy importante que entienda completamente sus opciones. Voy a hacer los
    cambios hoy mismo y la llamaré la próxima semana para ver cómo se siente. Muchísimas
    gracias, María. Me siento mucho más tranquila ahora."
}
```

## Use with other AI Functions

### Call transcript analysis

You can pass the output of AI_TRANSCRIBE to other AI Functions for further processing. For example, you can use
AI_SUMMARIZE to summarize the transcription, or AI_CLASSIFY to classify the content of the transcription. This example
uses AI_SENTIMENT and AI_COMPLETE to analyze the text transcribed from
[`customer call audio`](../../_downloads/e9d32cfe1b904b4b57ff66879eece999/consultation_1.wav) and provide sentiment on four dimensions
and an assessment of the agent.

> **Note:**
>
> AI_SENTIMENT analyzes only text and does not consider speech characteristics like tone of voice.

```sqlexample
WITH transcriptions AS
    ( SELECT TO_VARCHAR (AI_TRANSCRIBE(TO_FILE('@financial_consultation',
        'consultation_1.wav'))) AS transcribed_call )
SELECT
    AI_SENTIMENT(transcribed_call, ['Professionalism', 'Resolution',
        'Wait Time', 'Market Conditions']) AS call_sentiment,
    AI_COMPLETE ('claude-4-opus', CONCAT ('Summarize how the agent can improve in 50 words',
        transcribed_call)) AS agent_assessment
FROM transcriptions
```

AI_SENTIMENT response:

```output
{
    "categories": [
        {
            "name": "overall",
            "sentiment": "negative"
        },
        {
            "name": "Market Conditions",
            "sentiment": "negative"
        },
        {
            "name": "Professionalism",
            "sentiment": "negative"
        },
        {
            "name": "Resolution",
            "sentiment": "negative"
        },
        {
            "name": "Wait Time",
            "sentiment": "unknown"
        }
    ]
}
```

AI_COMPLETE response:

```output
"The agent needs significant improvement in empathy, active listening, and client-centered communication. Instead of
dismissing concerns and using condescending language, they should validate emotions, explain market conditions
professionally, present multiple options, and guide clients through informed decision-making while respecting their
risk tolerance and personal circumstances."
```

### Video transcript analysis

The following example transcribes a [video file](https://www.youtube.com/watch?v=QEQZs8SLhQE) stored in the `podcast_videos_S3` stage,

```sqlexample
SELECT AI_TRANSCRIBE(TO_FILE( '@podcast_videos_S3', 'podcast-interview.mp4'));
```

Response:

```output
{
"audio_duration": 5423.744,
"text": "Welcome to the New York Times Popcast, your deepest duende of music news and criticism. I'm John Caramonica, and I'm the critic. I'm Joe Cascarelli, and I'm the reporter. I'm Rosalía and I'm here today with you guys. Yes. Thank you so much for being here. Like literally on some days, Jo. Some days. On some days, I think, is this person the only good pop star?
...
Thank you for being here. Loved. Every episode of Popcast is at nytimes.com slash popcast. We're on YouTube at Popcast. Subscribe. We're on Instagram and TikTok at Popcast. Tap that like. Tap that follow. Tap in. Don't tap out. Credits and links and bio. We'll be back next week. Yes. Invite me anytime to eat more snacks, please. I lost my hands in Jerez"
}
```

Once you have the transcript, you can use AI_COMPLETE to perform additional analysis. This example identifies retail brands mentioned in the conversation for use in advertising or sponsorship analytics.

```sqlexample
SELECT
  AI_COMPLETE('claude-sonnet-4-5',
    PROMPT('Return a list of any Retail Brands mentioned in this podcast {0}',
      TO_VARCHAR(transcription_results))) as brands_identified
FROM podcast_video_transcription;
```

Response

```output
Retail Brands Mentioned in Podcast

Based on the transcript analysis, the following brands were identified:

Calvin Klein — Mentioned in relation to Rosalía’s commercial appearance
Kinder Bueno — Cited as one of Rosalía’s favorite snacks.
Nutella — Referenced as a preferred treat.
Nestlé — Mentioned as the manufacturer of Milky Bar ice cream bites.
Nongshim — Korean snack brand discussed during the tasting segment.
Cap'n Crunch — Referenced for its scent similarity to Korean snacks.
Doritos — Mentioned by one of the hosts while discussing snack collections.
```

## Cost considerations

Billing for all AI Functions is based on token consumption. For transcription, each second of audio processed is 50 tokens, regardless of language or segmentation method.
A full hour of audio is therefore 180,000 tokens. Assuming that processing a million tokens costs 1.3 credits, and that Snowflake credits
cost US $3 each, each hour of audio processed costs about US $0.702. This estimate is subject to change. For current pricing information, see the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

> **Note:**
>
> AI_TRANSCRIBE has a minimum billing duration of 1 minute. Files shorter than 1 minute are still processed, but are
> billed at 1 minute. To efficiently process large numbers of short audio files, consider batching them into a single file and
> using timestamps to identify the start and end of each original file in the resulting transcription.

---
title: Cortex AI Functions: Documents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-documents.md
section: Snowflake Cortex (AI & ML)
---

# Cortex AI Functions: Documents

Snowflake provides advanced AI-powered document intelligence capabilities as Cortex AI Functions. These functions help
you to process, parse, classify, and extract information from a wide variety of document types to power analytics,
automation, and intelligent applications, all using simple SQL. Document functions help you with the following tasks:

* **Parse documents** to convert unstructured text and layouts into structured, searchable, analyzable content.
* **Extract structured information** (entities, tables, or fields) from documents.
* **Classify document types** to drive downstream workflows and analytics.

Cortex document processing functions can be combined to build retrieval augmented generation (RAG) pipelines,
intelligent search and chatbot systems, and large-scale document analytics. The following illustration shows how Cortex
document processing functions form a composable framework in which components can be mixed and matched to build
tailored solutions.

## Document functions

The core Cortex AI Functions for document processing are:

* [AI_PARSE_DOCUMENT](parse-document.md): Converts digital-native or scanned documents into rich text while
  preserving layout and context. Optionally extracts images from documents. Ideal for semantic search, RAG pipelines,
  and summarization workflows. Works well with document analysis that requires understanding the entire document
  content.
* [AI_EXTRACT](document-extraction.md): Provides high-quality structured extraction of information from
  documents. Understands text, tables, checkboxes, handwriting, and other visual elements. Specializes in extracting
  structured data based on a schema.
* [AI_COMPLETE](../../sql-reference/functions/ai_complete.md): The most general-purpose AI Function, AI_COMPLETE generates
  text completions based on a prompt you provide, and so can be used for a wide variety of tasks involving extracting or
  transforming text from documents. An advantage of AI_COMPLETE is the ability to choose a model.

The following text-processing AI Functions can be used to further analyze or transform text extracted from documents.

* [AI_SENTIMENT](../../sql-reference/functions/ai_sentiment.md): Analyzes the sentiment of text content.
* [AI_TRANSLATE](../../sql-reference/functions/ai_translate.md): Translates text content between languages.
* [SUMMARIZE](../../sql-reference/functions/summarize-snowflake-cortex.md): Generates concise summaries of text content.

## Use cases

Cortex AI Functions for document processing are designed to be used together or individually to address a variety of use
cases, and are well-suited for these two use cases:

### Building RAG pipelines for chatbots and enterprise search services

Documents processed by AI_PARSE_DOCUMENT can be indexed by Cortex Search Services, which can act as retrieval augmented
generation (RAG) engines to improve language model responses to user queries. In this scenario, you use the Cortex
Search Service to find documents related to the query, then pass these documents to AI_COMPLETE as part of the prompt to
generate more contextually relevant responses.

### Building document processing pipelines for streamlining workflows and analytics

Cortex document processing AI Functions help you build intelligent, flexible, and scalable document processing pipelines
using modular components. Such a pipeline ingests documents in various formats and transforms them into actionable data,
allowing you to build workflows like these:

* Schema based extraction: Apply a natural language schema to extract entities – ranging from single entities to complex tabular data – from a set of documents
* Q&A against document: Ask questions about a document in natural language.
* Text and layout extraction: Capture document text (with or without layout) to extract entities, generate summaries, and perform analysis using other AI Functions.
* Classification: Determine the document type (e.g., “invoice,” “contract,” “report”) when ingesting data to route each
  type to an appropriate processing workflow.
* Build a model registry to share custom extraction and classification models: A model registry stores document
  extraction models fine-tuned for custom use cases specific to your organization. Reusing these models across teams
  saves time and effort.

---
title: Cortex AI Functions: Image extraction with AI_PARSE_DOCUMENT
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/image-extraction.md
section: Snowflake Cortex (AI & ML)
---

# Cortex AI Functions: Image extraction with AI_PARSE_DOCUMENT

AI_PARSE_DOCUMENT is a Cortex AI
function that extracts text, data, layout elements, and images, from PDFs, Word documents, and images. Use this
high-fidelity image extraction capability to power advanced, multimodal document processing workflows, such as:

* *Enrich data*: Extract images from documents to add visual context for deeper insights.
* *Multimodal RAG*: Combine images and text for retrieval-augmented generation (RAG) to improve model responses.
* *Image classification*: Use extracted images with AI_EXTRACT or AI_COMPLETE for automatic tagging and analysis.
* *Knowledge bases*: Build richer repositories by including both text and images for better search and reasoning.
* *Compliance*: Extract and analyze images (e.g., charts, signatures) for regulatory and audit workflows.

For an introduction to AI_PARSE_DOCUMENT, see [Parsing documents with AI_PARSE_DOCUMENT](parse-document.md).

## Using AI_PARSE_DOCUMENT to extract images

To extract images from a document using AI_PARSE_DOCUMENT:

* Set the `'mode'` option to `'LAYOUT'`. Image extraction requires LAYOUT mode.
* Set the `'extract_images'` option to TRUE.

AI_PARSE_DOCUMENT image extraction returns an array, `images`, in the JSON output. Each element of `images` contains a
field, `image_base64`, with the extracted image data encoded as a base64 string. Image OBJECT_CONSTRUCT also contains fields
for a unique ID and image bounding boxes.

```sqlexample
SELECT AI_PARSE_DOCUMENT(
    TO_FILE('@my_stage', 'my_document.pdf'),
    {'mode': 'LAYOUT', 'extract_images': true})
AS layout_wƒith_images;
```

You can decode the images using BASE64_DECODE_BINARY, then pass them directly to AI_EXTRACT to process or describe the
image contents. Alternatively, you can store them in a stage for processing using multimodal AI_COMPLETE. (AI_COMPLETE
does not currently support direct image input.)

## Examples

### Extract and describe images

After extracting image data, you can use AI_EXTRACT to process or describe the image content. The following example
generates a description for the first extracted image after converting it to binary from base64. (AI_EXTRACT requires
binary input.) The query uses a regular expression to strip the metadata (schema and format) from the base64 string.

```sqlexample
SELECT AI_EXTRACT(
file_data => BASE64_DECODE_BINARY(
    REGEXP_REPLACE(
    (
        SELECT (
            AI_PARSE_DOCUMENT(
                TO_FILE('@image_docs', 'my_document.pdf'),
                {'mode': 'LAYOUT', 'extract_images': true}
            ):images[0]['image_base64']
            )::STRING
        ),
    '^data:image/[^;]+;base64,', '')
    ),
responseFormat => {'Image Name': 'Describe the image'}
);
```

### Store extracted images in a stage

You can store extracted images from documents in a Snowflake stage for reuse, auditing, or additional processing with
other Cortex AI functions. This example creates and uses a Python stored procedure to decode base64
image data from AI_PARSE_DOCUMENT and upload the resulting image files to a specified stage.

```sqlexample-python
CREATE OR REPLACE PROCEDURE SAVE_EXTRACTED_IMAGES(r VARIANT)
RETURNS ARRAY
LANGUAGE PYTHON
RUNTIME_VERSION = '3.9'
PACKAGES = ('pillow', 'snowflake-snowpark-python')
HANDLER = 'run'
AS
$$
import base64
import io
import os
import tempfile
from PIL import Image

def process_parse_document_result(data: dict) -> tuple[str, str, str]:
    images = data["images"]
    for image in images:
        id = image["id"]
        data, image_base64 = image["image_base64"].split(";", 1)
        extension = data.split("/")[1]
        base64 = image_base64.split(",")[1]
        yield id, extension, base64

def decode_base64(encoded_image: str) -> bytes:
    return base64.b64decode(encoded_image)

def run(session, r):
    destination_path = r["DESTINATION_PATH"]
    parse_document_result = r["PARSE_DOCUMENT_RESULT"]

    if not destination_path:
        return ["Error: destination_path parameter is required"]
    if not destination_path.startswith("@"):
        return ["Error: destination_path must start with @ (e.g. @output_stage/path"]
    if destination_path == "@":
        return ["Error: destination_path must include a stage name after @"]

    # Clean the result directory
    session.sql(f"RM destination_path")

    uploaded_files = []
    with tempfile.TemporaryDirectory() as temp_dir:
        for image_id, extension, encoded_image in process_parse_document_result(parse_document_result):
            image_bytes = decode_base64(encoded_image)
            image: Image = Image.open(io.BytesIO(image_bytes))

            image_path = os.path.join(temp_dir, image_id)
            image.save(image_path)

            # Use session.file.put with source file path and auto_compress=False
            session.file.put(
                image_path, destination_path, auto_compress=False, overwrite=True
            )
            uploaded_files.append(f"{destination_path}/{image_id}")

            # Cleanup
            os.remove(image_path)
    return uploaded_files
$$;
```

After creating the SAVE_EXTRACTED_IMAGES procedure, you can call it to extract images from a document and store them in
a stage, as shown in the following code snippet:

```sqlexample
CALL SAVE_EXTRACTED_IMAGES(
(
SELECT OBJECT_CONSTRUCT(*)
FROM ( SELECT
    '@image_docs/output' as destination_path,
    AI_PARSE_DOCUMENT(
    TO_FILE('@image_docs/my_document.pdf'),
    {'mode': 'LAYOUT', 'extract_images': true}
    ) as parse_document_result
) LIMIT 1
));
```

The output of this query is a list of file paths for the images stored in the specified stage, such as:

```output
image_docs/output/img-0.jpeg
image_docs/output/img-1.jpeg
image_docs/output/img-10.jpeg
image_docs/output/img-11.jpeg
image_docs/output/img-12.jpeg
image_docs/output/img-13.jpeg
```

Now you can process the stored images using other Cortex AI functions, such as AI_COMPLETE for multimodal analysis or generation.

```sqlexample
SELECT AI_COMPLETE(
    'pixtral-large',
    'Describe the image in 10 words.',
    TO_FILE('@image_docs/output/img-0.jpeg')
);
```

Response:

```output
The image shows central bank policy rates for various countries from 2000 to 2025.
```

## Cost considerations

AI_PARSE_DOCUMENT uses billing based on the number of pages processed. A single image file is considered to be a page
for billing purposes. Extracting images does not incur additional costs.

## Current limitations

* No more than fifty images can be extracted from a single document. Additional images are ignored.
* Images smaller than 4x4 pixels are not extracted.
* If the size of a response exceeds the account parameter EXTERNAL_FUNCTION_MAx_RESPONSE_SIZE, the function returns an
  error. Increase the value of this parameter if necessary.

---
title: Cortex AI Functions: Images
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-images.md
section: Snowflake Cortex (AI & ML)
---

# Cortex AI Functions: Images

With Cortex AI Images, you can accomplish the following:

* Compare images
* Caption images
* Classify images
* Extract entities from images
* Generate embedding vectors for use in retrieval systems
* Answer questions using data in graphs and charts

You can do those tasks with the following functions:

* [AI_COMPLETE](../../sql-reference/functions/ai_complete.md)
* [AI_EMBED](../../sql-reference/functions/ai_embed.md)
* [AI_FILTER](../../sql-reference/functions/ai_filter.md)
* [AI_CLASSIFY](../../sql-reference/functions/ai_classify.md)
* [AI_SIMILARITY](../../sql-reference/functions/ai_similarity.md)

## Input requirements

COMPLETE Multimodal can process images with the following characteristics:

| Requirement | Value |
| --- | --- |
| Filename extensions | `.jpg`, `.jpeg`, `.png`, `.webp`, `.gif` |
| Stage encryption | Server-side encryption |
| Data type | [FILE](../../sql-reference/data-types-unstructured.md) |

> **Note:**
>
> Processing files from stages is currently incompatible with custom network policies.

## Analyze images

The COMPLETE function processes a single image or multiple images (for example, extracting differences in entities across various images) stored in a stage. See [Create stage for media files](aisql.md) for information on creating a suitable stage.

The function call specifies the following:

* The multimodal model to be used
* A prompt
* The stage path of the image file(s) via a [FILE](../../sql-reference/data-types-unstructured.md) object

### Vision Q&A example

The following example uses Anthropic’s Claude Sonnet 4.6 model to summarize a pie chart `science-employment-slide.jpeg` stored in the `@myimages` stage.

```sqlexample
SELECT AI_COMPLETE('claude-4-6-sonnet',
    'Summarize the insights from this pie chart in 100 words',
    TO_FILE('@myimages', 'science-employment-slide.jpeg'));
```

Response:

```output
This pie chart shows the distribution of occupations where mathematics is considered "extremely important" in 2023.
Data scientists dominate with nearly half (48.7%) of all such positions, followed by operations research analysts
at 29.6%. The remaining positions are distributed among statisticians (7.8%), actuaries (7.2%), physicists (5.1%),
mathematicians (0.6%), and other mathematical science occupations (1.1%). This distribution highlights the growing
importance of data science in mathematics-intensive careers, while traditional mathematics roles represent a smaller
share of the workforce.
```

### Compare images example

> **Note:**
>
> Currently, only Anthropic (`claude`) and Meta (`llama`) models can reference multiple images in a single prompt.
> Multiple image support for other models may be available in a future release.

Use the [PROMPT helper function](../../sql-reference/functions/prompt.md) to process multiple images in a single COMPLETE call. The following example uses
Anthropic’s Claude Sonnet 4.6 model to compare two different ad creatives from the `@myimages` stage.

```sqlexample
SELECT AI_COMPLETE('claude-4-6-sonnet',
    PROMPT('Compare this image {0} to this image {1} and describe the ideal audience for each in two concise bullets no longer than 10 words',
    TO_FILE('@myimages', 'adcreative_1.png'),
    TO_FILE('@myimages', 'adcreative_2.png')
));
```

Response:

```output
First image ("Discover a New Energy"):
• Conservative luxury SUV buyers seeking a subtle transition to electrification

Second image ("Electrify Your Drive"):
• Young, tech-savvy urbanites attracted to bold, progressive automotive design
```

### Classify images example

The following example uses AI_CLASSIFY to classify an image for a real estate application.

The following SQL uses the AI_CLASSIFY function to classify the image as a picture of a living area, kitchen, bath, garden, or master bedroom.

```sqlexample
SELECT AI_CLASSIFY(TO_FILE('@my_images', 'REAL_ESTATE_STAGING.PNG'),
    ['Living Area', 'Kitchen', 'Bath', 'Garden', 'Master Bedroom']) AS room_classification;
```

Response:

```output
{ "labels": [ "Living Area" ] }
```

The SQL below categorizes the objects found in the above image as a couch, window, table, television, or artwork.

```sqlexample
SELECT AI_CLASSIFY (TO_FILE ('@my_images', 'REAL_ESTATE_STAGING.PNG'),
    ['Couch', 'Window', 'Table', 'Television', 'Art'],  {'output_mode': 'multi'} )
    AS living_room_objects;
```

Response:

```output
{
  "labels": [
    "Art",
    "Couch",
    "Table",
    "Window"
  ]
}
```

## Search images

You can use AI_EMBED to find images that are similar to a target image. First, use the AI_EMBED function to generate an
embedding vector for the target image, mapping its visual features into an abstract vector space, a numerical
representation of the image’s features. You can then use vector similarity functions to compare this embedding vector
to the embedding vectors of other images, producing a similarity score based on their common or similar visual features.
This score can be used to classify, rank, or filter images based on their similarity to the target image.

|  |  |
| --- | --- |
|  |  |

For example, given the images above, the following SQL generates an embedding vector for each image, then compares the
vectors using cosine similarity. The result, about 0.5, indicates that the images are somewhat similar. Both photos are
taken in an urban setting and contain background crowds, but the main subjects are different.

```sqlexample
WITH ai_image_embeddings as (
    SELECT
        AI_EMBED('voyage-multimodal-3',
            TO_FILE ('@my_images', 'CITY_WALKING1.PNG')) as image1_embeddings,
        AI_EMBED('voyage-multimodal-3',
            TO_FILE ('@my_images', 'CITY_WALKING2.PNG')) as image2_embeddings
)
SELECT VECTOR_COSINE_SIMILARITY(image1_embeddings,image2_embeddings) as similarity FROM ai_image_embeddings;
```

```output
0.5359029029
```

To find images that are similar to a target image, you can use AI_SIMILARITY. The example below computes a similarity
score for possibly thousands of images, and returns the advertising creatives that are most similar to the motorcycle
advertisement below.

```sqlexample
SELECT
    TO_FILE('@ad_images', relative_path) as ALL_ADS
    FROM DIRECTORY(@ad_images)
WHERE AI_SIMILARITY(TO_FILE('@ad_images', 'image_226.jpg'), ALL_ADS) >= 0.5;
```

The query returns images from a multimodal table where the similarity score is greater than 0.50. One of the images
identified (`image_226.jpg`) is the one we used as a reference.

```output
+-----------------------------------------------------------+
| {} ALL_ADS                                                |
+-----------------------------------------------------------+
|  { "CONTENT_TYPE": "image/jpeg",                          |
|    "ETAG": "686897696a7c876b7e",                          |
|    "LAST_MODIFIED": "Wed, 26 Mar 2025 18:11:45 GMT",      |
|    "RELATIVE_PATH": "image_226.jpg",                      |
|    "SIZE": 39086,                                         |
|    "STAGE": "@ad_images" }                                |
+-----------------------------------------------------------+
|  { "CONTENT_TYPE": "image/jpeg",                          |
|    "ETAG": "e7b678c7a696798686",                          |
|    "LAST_MODIFIED": "Wed, 26 Mar 2025 18:11:57 GMT",      |
|    "RELATIVE_PATH": "image_441.jpg",                      |
|    "SIZE": 12650,                                         |
|    "STAGE": "@ad_images" },                               |
+-----------------------------------------------------------+
```

## Model limitations

All models available to Snowflake Cortex have limitations on the total number of input and output tokens, known as the
model’s *context window*. The context window size is measured in tokens. Inputs exceeding the context window limit
result in an error. Output which would exceed the context window limit is truncated.

For text models, tokens generally represent approximately four characters of text, so the word count corresponding to a
limit is less than the token count.

For image models, the token count per image depends on the vision model’s architecture. Tokens within a prompt (for
example, “what animal is this?”) also contribute to the model’s context window.

| Model | Context window (tokens) | File types | File size | Images per prompt |
| --- | --- | --- | --- | --- |
| `openai-gpt-4.1` | 1,047,576 | .jpg, .jpeg, .png, .webp, .gif | 10MB | 5 |
| `claude-4-opus` | 200,000 | .jpg, .jpeg, .png, .webp, .gif | 3.75 MB [L1] | 20 |
| `claude-4-sonnet` | 200,000 | .jpg, .jpeg, .png, .webp, .gif | 3.75 MB [L1] | 20 |
| `claude-3-7-sonnet` | 200,000 | .jpg, .jpeg, .png, .webp, .gif | 3.75 MB [L1] | 20 |
| `claude-4-6-sonnet` | 200,000 | .jpg, .jpeg, .png, .webp, .gif | 3.75 MB [L1] | 20 |
| `llama4-maverick` | 128,000 | .jpg, .jpeg, .png, .webp, .gif, .bmp | 10 MB | 10 |
| `llama-4-scout` | 128,000 | .jpg, .jpeg, .png, .webp, .gif, .bmp | 10 MB | 10 |
| `pixtral-large` | 128,000 | .jpg, .jpeg, .png, .webp, .gif, .bmp | 10 MB | 1 |
| `voyage-multimodal-3` | 32,768 | .jpg, .png, .pg, .gif, .bmp | 10 MB | 1 |

[L1]
(1,2,3,4)

Images must be smaller than 8000x8000 pixels. Limits apply to each individual image.

## Cost considerations

Billing scales with the number of tokens processed. The number of tokens per image depends on the architecture of the vision model.

* Anthropic (`claude`) models’ formula is roughly: tokens = (Width in pixels x Height in pixels) / 750.
* Mistral (`pixtral`) models divide each image into batches of 16x16 pixels and converts each batch to a token.
  The total number of tokens is equivalent to roughly (Width in pixels / 16) \* (Height in pixels / 16).
* Meta (`llama`) models try to tile the image with square tiles. Depending on the image’s aspect ratio and size, the number of
  tiles can be up to 16, each represented by around 153 tokens.
* Open AI models rescale the image and tile it with square patches. For `openai-gpt-4.1`, depending on the image ratio
  and size, the number of tokens can be 211 (images up to 512x512px), 352 (non-square images with longer side length
  1024px), or from 630 tokens (square images at least 1024x1024px) to 913 tokens (non-square images with shorter side
  length 1024px).
* `voyage-multimodal-3` operates on an array of image patches that are roughly 14x14px in size. The image is rescaled
  so that it is covered by a grid, which has a minimum of 64 patches and a maximum of 2500 patches. Two extra image
  tokens are added, so the input ranges from 66 to 2502 tokens, depending on the image size and aspect ratio.

> **Note:**
>
> The COUNT_TOKENS function does not currently support image inputs.

## Choosing a vision model

The COMPLETE function supports multiple models of varying capability, latency, and cost. To achieve optimal performance
per credit, choose a model that aligns with the content size and task complexity.

| Model | MMMU | Mathvista | ChartQA | DocVQA | VQAv2 |
| --- | --- | --- | --- | --- | --- |
| GPT-4o | 68.6 | 64.6 | 85.1 | 88.9 | 77.8 |
| `openai-gpt-4.1` | 75.0 | 72.0 |  |  |  |
| `llama-4-maverick` | 73.4 | 73.7 | 90 | 94.4 |  |
| `llama-4-scout` | 69.4 | 70.7 | 88.8 | 94.4 |  |
| `pixtral-large` | 64.0 | 69.4 | 88.1 | 85.7 | 67 |

The benchmarks are:

* MMMU: Evaluates multimodal models on multidisciplinary tasks that require college-level reasoning.
* Mathvista: Mathematical reasoning benchmark within a visual context.
* ChartQA: Evaluates complex reasoning questions about charts.
* DocVQA and VQv2: Benchmarks for visual question-answering on documents.

For multimodal embeddings, only the `voyage-multimodal-3` model is currently available. `voyage-multimodal-3` is a
state-of-art multimodal embedding model capable of embedding text and images. It can extract key visual features from
sources such as screenshots of PDFs, slides, tables, and figures, reducing the need for complex document parsing
workflows. According to Voyage AI internal benchmarks, the `voyage-multimodal-3` model outperforms competing models
such as OpenAI CLIP Large, Amazon Titan Multimodal, and Cohere Multimodal v3.

## Regional availability

Support for this feature is available natively to accounts in the following Snowflake regions:

| Model | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) |
| --- | --- | --- | --- |
| `claude-3-7-sonnet` [A1] |  |  |  |
| `claude-4-sonnet` [A1] |  |  |  |
| `claude-4-opus` [A1] |  |  |  |
| `pixtral-large` | ✔ | ✔ | ✔ |
| `llama4-maverick` | ✔ |  |  |
| `llama4-scout` | ✔ |  |  |
| `voyage-multimodal-3` [A1] |  |  |  |

[A1]
(1,2,3,4)

Model is available via cross-region inference only.

AI_COMPLETE is available in additional regions through [cross-region inference](cross-region-inference.md).

## Error Conditions

| Message | Explanation |
| --- | --- |
| Request failed for external function SYSTEM$COMPLETE_WITH_IMAGE_INTERNAL with remote service error: 400 ‘“invalid image path” | Either the file extension or the file itself is not accepted by the model. The message might also mean that the file path is incorrect; that is, the file does not exist at the specified location. Filenames are case-sensitive. |
| Error in secure object | May indicate that the stage does not exist. Check the stage name and ensure that the stage exists and is accessible. Be sure to use the at (@) sign at the beginning of the stage path, such as `@myimages`. |
| Request failed for external function _COMPLETE_WITH_PROMPT with remote service error: 400 ‘“invalid request parameters: unsupported image format: image/\*\* | Unsupported image format given to `claude-4-6-sonnet`, i.e. other than .jpeg, .png, .webp, or .gif. |
| Request failed for external function _COMPLETE_WITH_PROMPT with remote service error: 400 ‘“invalid request parameters: Image data exceeds the limit of 5.00 MB” | The provided image given to `claude-4-6-sonnet` exceeds 5 MB. |

## Legal

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Cortex Analyst administrator monitoring
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/admin-observability.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Analyst administrator monitoring

To improve the quality of answers provided by Cortex Analyst, you must continue to refine the semantic model or view.
To help you refine the model or view, Cortex Analyst logs requests to an event table in the Snowflake database.

The logs include the following:

* The user who asked the question
* The question asked
* Generated SQL
* Errors and/or warnings
* Request and response bodies
* Other metadata

There is a small lag, on the order of 1-2 minutes, between a request being made and it being visible in the view.

## Accessing logs

You can view these logs in the Monitoring tab of the Semantic View within Snowsight.
In order to view the logs, users must have the SELECT privilege on referenced tables, in
addition to:

* MONITOR or OWNERSHIP on the semantic view (when using semantic views)
* WRITE privilege on the stage (for semantic models stored in a file on a stage)

Alternatively, you can query the logs directly from the Snowflake database using SQL,
depending on your privileges.

## Querying logs with SQL

Call the SNOWFLAKE.LOCAL.CORTEX_ANALYST_REQUESTS table function to retrieve logs for a specific semantic model or view. This
table function performs access control checks to ensure that the caller has required privileges to access the request data.

The following is an example of how to call the function:

```sqlsyntax
SELECT * FROM TABLE(
  SNOWFLAKE.LOCAL.CORTEX_ANALYST_REQUESTS(
    '<semantic_model_or_view_type>',
    '<semantic_model_or_view_name>'
  )
);
```

When calling this function, pass in the following arguments:

* `semantic_model_or_view_type`: Specify the type of semantic model or view used in the requests:

  + For a semantic model defined in a file on a stage, specify `'FILE_ON_STAGE'`.
  + For a semantic view, specify `'SEMANTIC_VIEW'`.
* `semantic_model_or_view_name`: Specify the location where the semantic model or view is defined:

  + For a semantic view defined in a file on a stage, specify the fully qualified path to the semantic view specification file
    (for example, `@my_db.my_schema.my_stage/path/to/file.yaml`).
  + For a semantic view, specify the fully qualified name of the semantic view.

Returns: A table with all API requests for the specified semantic model or view.

If a query was made using inline YAML (instead of a semantic view or a file on stage), the request will not be accessible via
the table function, but will be visible in the view and event table detailed below.

If you are using a role that has been granted the SNOWFLAKE.CORTEX_ANALYST_REQUESTS_ADMIN or SNOWFLAKE.CORTEX_ANALYST_REQUESTS_VIEWER application role, you can query the
[SNOWFLAKE.LOCAL.CORTEX_ANALYST_REQUESTS_V](../../../sql-reference/local/cortex_analyst_requests_v.md) view. This view includes all requests
to Cortex Analyst across all semantic models and views.

You can also query the raw event data in the SNOWFLAKE.LOCAL.CORTEX_ANALYST_REQUESTS_RAW event table. The responses are in the
[OpenTelemetry format](https://opentelemetry.io/docs/specs/otel/). The SNOWFLAKE.LOCAL.CORTEX_ANALYST_REQUESTS_V view contains the same data, formatted and processed for human readability.

---
title: Cortex Analyst REST API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/rest-api.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Analyst REST API

Use this API to answer questions about your data with natural language queries.

## Send message

`POST /api/v2/cortex/analyst/message`

Generates a SQL query for the given question using a semantic model or [semantic view](../../views-semantic/overview.md)
provided in the request. One or more models can be specified; when multiple models are specified, Cortex Analyst chooses the most appropriate one.
You can have multi-turn conversations where you can ask follow-up questions that build upon previous queries. For more information, see [Multi-turn conversation in Cortex Analyst](../cortex-analyst.md).

The request includes a user question; the response includes the user question and the analyst response. Each message in a response
can have multiple content blocks of different types. Three values that are currently supported for the `type` field of the content
object are: `text`, `suggestions`, and `sql`.

Responses can be sent all at once after processing is complete, or incrementally as they are generated.

### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authenticating to the server](../../../developer-guide/sql-api/authenticating.md). |
| `Content-Type` | (Required) application/json |
| `X-Snowflake-Authorization-Token-Type` | (Optional) Authorization token type. Defaults to OAuth. For more information, see [Authenticating to the server](../../../developer-guide/sql-api/authenticating.md). |

### Request body

In the request body:

* Set the last `messages[].role` field to the role of the speaker, which must be `user`.
* Include the user’s question in the `content` object. In this object:

  + Set `type` to `text`.
  + Set `text` to the user’s question.
* Include one of the following:

  + The [YAML specification](../../views-semantic/semantic-view-yaml-spec.md) for a semantic view.
  + The path to the YAML file that contains the semantic view specification. This file must be on a stage.
  + The name of the semantic view.

The following table describes the fields that you can set in the body of the request:

| Field | Description |
| --- | --- |
| `messages[].role` | (Required) The role of the entity that is creating the message. Currently only supports `user`.  Type: string:enum  Example: `user` |
| `messages[].content[]` | (Required) The content object that is part of a message.  Type: object  Example:  ```json {   "type": "text",   "text":  "Which company had the most revenue?" } ``` |
| `messages[].content[].type` | (Required) The content type. Currently only `text` is supported.  Type: string:enum  Example: `text` |
| `messages[].content[].text` | (Required) The user’s question.  Type: string  Example: `Which company had the most revenue?` |
| `semantic_model_file` | Path to the semantic model YAML file. Must be a fully qualified stage URL including the database and schema.  To specify multiple semantic models, use the `semantic_models` field.  If you want to provide the YAML specification directly in the request instead, set the `semantic_model` field to the YAML specification for the semantic model.  Type: string  Example: `@my_db.my_schema.my_stage/my_semantic_model.yaml` |
| `semantic_model` | A string containing the entire semantic model YAML.  To specify multiple semantic models, use the `semantic_models` field instead.  If you want to point to a YAML specification in a file instead, upload the file to a stage, and set the `semantic_model_file` field to the path to the file.  Type: string |
| `semantic_models` | An array containing JSON objects, each of which contains a `semantic_model_file` or `semantic_view` field.  These fields have the same semantics as the top-level `semantic_model_file` and `semantic_view` fields:   * `semantic_model_file` specifies a YAML file, stored in a stage, that contains a semantic model definition.   (You cannot specify the YAML for the semantic model directly in the request with this form.) * `semantic_view` specifies the fully qualified name of a [semantic view](../../views-semantic/overview.md).   For example:  ```json   {     /* ... */     "semantic_models": [       {"semantic_view": "my_db.my_sch.my_sem_view_1" },       {"semantic_view": "my_db.my_sch.my_sem_view_2" }     ]     /* ... */   }   ```   For each query, Cortex Analyst chooses the most appropriate model or view from the list.  This capability simplifies user interactions with Cortex Analyst. You don’t need to choose a data source to query, and you don’t need to keep track of which semantic model or semantic view to use for each. Just specify all of your models or views with each query and let Cortex Analyst figure out which one to use.  Type: array  **Tip:** Cortex Analyst does not require that you specify more than one model or view. If you specify a single model or view, the request is functionally equivalent to one containing a top-level `semantic_model_file` or `semantic_view` field.  The advantage of using `semantic_models` for single-model requests is that you can use the same client code, regardless of the number of models or views. |
| `semantic_view` | Fully qualified name of the [semantic view](../../views-semantic/overview.md). For example:  ```json {   /* ... */   "semantic_view": "MY_DB.MY_SCHEMA.SEMANTIC_VIEW"   /* ... */ } ```  If the name is case-sensitive or contains characters that are not allowed in an [unquoted identifier](../../../sql-reference/identifiers-syntax.md), you must enclose the name in backslash-escaped double quotes. For example, if the database name, schema name, and view name include hyphens (`my-database.my-schema.my-semantic-view`):  ```json {   /* ... */   "semantic_view": "\"my-database\".\"my-schema\".\"\"my-semantic-view\"\""   /* ... */ } ```  To specify multiple semantic views, use the `semantic_models` field.  Type: string |
| `stream` | (Optional) If set to `true`, the response is streamed to the client using [server-sent events](https://developer.mozilla.org/en-US/docs/Web/API/Server-sent_events) as it is generated (see Streaming response). Otherwise the complete response is returned after Cortex Analyst has fully processed the user’s question.  Type: boolean |

> **Important:**
>
> You must specify one of the following fields in the body of the request:
>
> * `semantic_model_file`
> * `semantic_model`
> * `semantic_models`
> * `semantic_view`

#### Example of specifying a semantic model in a file on a stage

```json
{
    "messages": [
        {
            "role": "user",
            "content": [
                {
                    "type": "text",
                    "text": "which company had the most revenue?"
                }
            ]
        }
    ],
    "semantic_model_file": "@my_db.my_schema.my_stage/my_semantic_model.yaml"
}
```

#### Example of specifying a semantic view

```json
{
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "which company had the most revenue?"
        }
      ]
    }
  ],
  "semantic_view": "MY_DB.MY_SCH.MY_SEMANTIC_VIEW"
}
```

### Non-streaming response

This operation can return the response codes listed below.
The response always has the following structure. Currently, three content types are supported for the
response, `text`, `suggestion`, and `sql`. The content types `suggestion` and `sql` are mutually exclusive so that if the
response contains a `sql` content type, it won’t contain a `suggestion` content type, and vice versa. The `suggestion` content type is only included
in a response if the user question was ambiguous and Cortex Analyst could not return a SQL statement for that query.

When the request contains a `semantic_models` field, the response includes a `semantic_model_selection` field that indicates
which semantic model was chosen for the request.

To ensure forward compatibility, make sure your implementation takes the content type into account and handles types.

| Code | Description |
| --- | --- |
| 200 | The statement was executed successfully.  The body of the response contains a message object that contains the following fields:   * `message`: Messages of the conversation between the user and analyst. * `message` (object): Represents a message within a chat. * `message.role` (string:enum): The entity that produced the message. One of `user` or `analyst`. * `message.content[]` (object): The content object that is part of a message. * `message.content[].type` (string:enum): The content type of the message. One of `text`, `suggestion`, or `sql`. * `message.content[].text` (string): The text of the content. Only returned for content type `text`. * `message.content[].statement` (string): A SQL statement. Only returned for content type `sql`. * `message.content[].confidence` (object): Contains confidence-related information. Only returned for the `sql` content type. * `message.content[].confidence.verified_query_used` (object): Represents the verified query from Verified Query Repository used in SQL response generation. If no verified query used, the field value is `null`. * `message.content[].confidence.verified_query_used.name` (string): The name of the verified query used, extracted from the Verified Query Repository. * `message.content[].confidence.verified_query_used.question` (string): The question that is answered by the verified query, extracted from the Verified Query Repository. * `message.content[].confidence.verified_query_used.sql` (string): The SQL statement of the verified query used, extracted from the Verified Query Repository. * `message.content[].confidence.verified_query_used.verified_at` (integer): The numeric representation of the timestamp when the query is verified, extracted from the Verified Query Repository. * `message.content[].confidence.verified_query_used.verified_by` (string): The person who verified the query, extracted from the Verified Query Repository. * `message.content[].suggestions` (string): If SQL cannot be generated, a list of questions the semantic model can   generate SQL for. Only returned for content type `suggestion`. * `warnings`: List of warnings from the analyst about the user’s request. * `warnings[].message` (string): Contains a detailed description of one individual warning. * `response_metadata` (object): Metadata containing response generation details. * `response_metadata.model_names`: List of models used to generate response. * `response_metadata.cortex_search_retrieval` (object): Entities resolved with cortex search. * `response_metadata.question_category` (string): How the question in the request is categorized. |

By default, the response is returned all at once after Cortex Analyst has fully processed the user’s question. See Streaming response
for the format of streaming mode responses.

> ```json
> {
>     "request_id": "75d343ee-699c-483f-83a1-e314609fb563",
>     "message": {
>         "role": "analyst",
>         "content": [
>             {
>                 "type": "text",
>                 "text": "We interpreted your question as ..."
>             },
>             {
>                 "type": "sql",
>                 "statement": "SELECT * FROM table",
>                 "confidence": {
>                     "verified_query_used": {
>                         "name": "My verified query",
>                         "question": "What was the total revenue?",
>                         "sql": "SELECT * FROM table2",
>                         "verified_at": 1714497970,
>                         "verified_by": "Jane Doe"
>                     }
>                 }
>             }
>         ]
>     },
>     "warnings": [
>         {
>             "message": "Table table1 has (30) columns, which exceeds the recommended maximum of 10"
>         },
>         {
>             "message": "Table table2 has (40) columns, which exceeds the recommended maximum of 10"
>         }
>     ],
>     "response_metadata": {
>         "model_names": [
>             "claude-3-5-sonnet"
>         ],
>         "cortex_search_retrieval": [
>             {
>                 "service": "my_db.my_schema.my_search_service",
>                 "response_body": {
>                     "results": [
>                         {
>                             "CUST_NAME": "customer1"
>                         }
>                     ],
>                     "request_id": "request1"
>                 },
>                 "query": "'customer1'"
>             }
>         ],
>         "question_category": "CLEAR_SQL"
>     }
> }
> ```

### Streaming response

Streaming mode lets your client receive responses as they are generated by Cortex Analyst, rather than waiting for the entire response to be generated.
This improves the perceived responsiveness of your application, especially for long-running queries, because users begin seeing output much sooner.
Streaming responses also provide status information that can help you understand where Cortex Analyst is in the process of generating a response, and
warnings that can help understand what went wrong when Cortex Analyst doesn’t work as you expected.

To receive a streaming response, set the `stream` field in the request body to `true`.
Streaming responses use [server-sent events](https://developer.mozilla.org/en-US/docs/Web/API/Server-sent_events).

Cortex Analyst sends five distinct types of events in a streaming response:

* `status`: Conveys status updates about the SQL generation process.
* `message.content.delta`: Contains a piece of the response. This event is sent multiple times.
* `error`: Indicates that Cortex Analyst has encountered an error and cannot continue processing the request. No further `message.content.delta` events will be sent.
* `warnings`: Contains any warnings encountered during processing. Warnings do not stop processing.
* `response_metadata`: Sent at the end of a response to display data about request processing.
* `done`: Sent to indicate that processing is complete and no further `message.content.delta` events will be sent.

Of these, the `message.content.delta` events are the most crucial to understand, because they contain the actual
response content. Each `delta` contains tokens from some field in the complete response. It is possible for each
`delta` event to contain anywhere between a single character to the full response, and they may be of different lengths. You receive these tokens as they
are generated; it is up to you to assemble them into the final response.

> **Important:**
>
> Events from different responses (even extremely similar ones) can vary. There is no guarantee that events will be sent in the same order or with the same content.

#### Simple example

The following is a sample non-streaming response for a simple query:

```json
{
    "message": {
        "role": "analyst",
        "content": [
            {
                "type": "text",
                "text": "This is how we interpreted your question and this is how the sql is generated"
            },
            {
                "type": "sql",
                "statement": "SELECT * FROM table"
            }
        ]
    }
}
```

And this is one possible series of streaming events for that response (a different series of events is also possible):

```output
event: status
data: { status: "interpreting_question" }

event: message.content.delta
data: {
  index: 0,
  type: "text",
  text_delta: "This is how we interpreted your question"
}

event: status
data: { status: "generating_sql" }

event: status
data: { status: "validating_sql" }

event: message.content.delta
data: {
  index: 0,
  type: "text",
  text_delta: " and this is how the sql is generated"
}

event: message.content.delta
data: {
  index: 1,
  type: "sql",
  statement_delta: "SELECT * FROM table"
}

event: status
data: { status: "done" }
```

Use the `index` field in the `message.content.delta` respnoses to determine which field in the full response the event is part of.
For example, here the first two `delta` events use index 0, which means they are part of the first field (element 0) in the `content` array
of the non-streaming response. Similarly, the `delta` event that contains the SQL response uses index 1.

#### Example with suggestions

This example contains suggested questions for an ambiguous question. The following is the non-streaming response:

```json
{
    "message": {
        "role": "analyst",
        "content": [
            {
                "type": "text",
                "text": "Your question is ambigous, here are some alternatives:"
            },
            {
                "type": "suggestions",
                "suggestions": [
                    "which company had the most revenue?",
                    "which company placed the most orders?"
                ]
            }
        ]
    }
}
```

And here is a possible series of streaming events that constitute that response:

```output
event: status
data: { status: "interpreting_question" }

event: message.content.delta
data: {
  index: 0,
  type: "text",
  text_delta: "Your question is ambigous,"
}

event: status
data: { status: "generating_suggestions" }

event: message.content.delta
data: {
  index: 0,
  type: "text",
  text_delta: " here are some alternatives:"
}

event: message.content.delta
data: {
  index: 1,
  type: "suggestions",
  suggestions_delta: {
    index: 0,
    suggestion_delta: "which company had",
  }
}

event: message.content.delta
data: {
  index: 1,
  type: "suggestions",
  suggestions_delta: {
    index: 0,
    suggestion_delta: " the most revenue?",
  }
}

event: message.content.delta
data: {
  index: 1,
  type: "suggestions",
  suggestions_delta: {
    index: 1,
    suggestion_delta: "which company placed",
  }
}

event: message.content.delta
data: {
  index: 1,
  type: "suggestions",
  suggestions_delta: {
    index: 1,
    suggestion_delta: " the most orders?",
  }
}

event: status
data: { status: "done" }
```

In this example, the `content` field of the non-streaming response is an array. One of the elements of `content` is the `suggestions` array.
So the meaning of `index` fields for `text` and `suggestions` delta events refer to the location of elements in these two different arrays.
You will need to keep track of these indexes separately when assembling the full response.

> **Note:**
>
> Currently, the generated SQL statement is always sent in a single event. This may not be the case in the future. Your client must be prepared to
> receive the SQL statement in multiple events.

#### Other examples

You can find a Streamlit streaming client for Cortex Analyst in the Cortex Analyst
[GitHub repo](https://github.com/Snowflake-Labs/sfguide-getting-started-with-cortex-analyst/blob/main/cortex_analyst_streaming_demo.py).
This demo must be run locally; SiS does not currently support streaming.

See the Cortex Analyst playground in the AI/ML Studio (in Snowsight) for an interactive demonstration of streaming response.

### Streaming event schemas

The following are the OpenAPI/Swagger schemas of the events sent by Cortex Analyst in a streaming response.

status

message.content.delta

error
:   ```none
    StreamingError:
    type: object
    properties:
      message:
        type: string
        description: A description of the error
      code:
        type: string
        description: The Snowflake error code categorizing the error
      request_id:
        type: string
        description: Unique request ID
    ```

warnings
:   ```none
    Warnings:
    type: object
    description: Warnings found while processing the request
    properties:
      warnings:
        type: array
        items:
          $ref: "#/components/schemas/Warning"
    Warning:
    type: object
    title: The warning object
    description: Represents a warning within a chat.
    properties:
      message:
        type: string
        description: A human-readable message describing the warning
    ```

response_metadata
:   ```none
    ResponseMetadata:
    type: object
    description: Details about request processing
    ```

## Send feedback

`POST /api/v2/cortex/analyst/feedback`

Provides qualitative end-user feedback. Within Snowsight, the feedback is shown in Snowsight → AI & ML → Cortex Analyst → Select Semantic View → Monitoring tab.

### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authenticating to the server](../../../developer-guide/sql-api/authenticating.md). |
| `Content-Type` | (Required) application/json |

### Request body

| Field | Description |
| --- | --- |
| `request_id` | (Required) The id of the request that you’ve made to send a message. Returned in the `request_id` field of `/api/v2/cortex/analyst/message`. For more information, see Non-streaming response.  Type: string  Example: `75d343ee-699c-483f-83a1-e314609fb563` |
| `positive` | (Required) Whether the feedback is positive or negative. `true` for positive or “thumbs up”, `false` for negative or “thumbs down”.  Type: boolean  Example:  `true` |
| `feedback_message` | (Optional) The feedback message from the user.  Example: `This is the best answer I've ever seen!` |

### Response

Empty response body with status code 200.

## Access control requirements

For information on the required privileges, see [Access control requirements](../cortex-analyst.md).

For details about authenticating to the API, see [Authenticating Snowflake REST APIs with Snowflake](../../../developer-guide/snowflake-rest-api/authentication.md).

---
title: Cortex Analyst Verified Query Repository
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/verified-query-repository.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Analyst Verified Query Repository

The Cortex Analyst Verified Query Repository (VQR) can help improve the accuracy and trustworthiness of results by
providing a collection of questions and corresponding SQL queries to answer them. Cortex Analyst then leverages
relevant SQL queries from the repository when answering similar questions. You can specify verified queries in your
semantic model YAML file.

> **Important:**
>
> Verified SQL queries must use the names of the logical tables and columns defined in the semantic model, not those
> in the underlying dataset. See the example query and its discussion
> for more information.

Verified queries are specified in the `verified_queries` section of the semantic model, as shown here.

```yaml
verified_queries:

# Verified Query 1
- name:                         # A descriptive name of the query.
  question:                     # The natural language question that this query answers.
  verified_at:                  # Optional: Time (in seconds since the UNIX epoch, January 1, 1970) when the query was verified.
  verified_by:                  # Optional: Name of the person who verified the query.
  use_as_onboarding_question:   # Optional: Marks this question as an onboarding question for the end user.
  sql:                          # The SQL query for answering the question.

# Verified Query 2
- name:
  question:
  verified_at:
  verified_by:
  use_as_onboarding_question:
  sql:
```

Below is a sample semantic model that includes a verified query.

```yaml
name: Sales Data
tables:
- name: sales_data
  base_table:
    database: sales
    schema: public
    table: sd_data

  dimensions:
    - name: state
      description: The state where the sale took place.
      expr: d_state
      data_type: TEXT
        unique: false
        sample_values:
          - "CA"
          - "IL"

    # Time dimension columns in the logical table.
    time_dimensions:
      - name: sale_timestamp
        synonyms:
          - "time_of_sale"
          - "transaction_time"
        description: The time when the sale occurred. In UTC.
        expr: dt
        data_type: TIMESTAMP
        unique: false

    # Measure columns in the logical table.
    measures:
      - name: profit
        synonyms:
          - "earnings"
          - "net income"
        description: The profit generated from a sale.
        expr: amt - cst
        data_type: NUMBER
        default_aggregation: sum

verified_queries:
  - name: "California profit"
    question: "What was the profit from California last month?"
    verified_at: 1714497970
    verified_by: Jane Doe
    use_as_onboarding_question: true
    sql: "
SELECT sum(profit)
FROM __sales_data
WHERE state = 'CA'
    AND sale_timestamp >= DATE_TRUNC('month', DATEADD('month', -1, CURRENT_DATE))
    AND sale_timestamp < DATE_TRUNC('month', CURRENT_DATE)
"
```

In the example above, `__sales_data` corresponds to the `sales_data` table defined in the
model. To avoid name conflicts, the name of the logical table is prefixed with two underscores. The columns used
in the query (`state`, `sale_timestamp`, and `profit`) are the logical columns defined in the model’s
`sale_data` table. The names of the underlying columns (`d_state`, `dt`, `amt`, and `cst`) are not used
directly in the query.

As illustrated in the example, the question doesn’t need to be a complete sentence, or actually in the form of a
question, but it should reflect something a user might ask. Ensure that the SQL queries are syntactically correct and
actually answer the posed questions; this is the essence of a “verified query.” Invalid or inaccurate queries can
negatively impact Cortex Analyst’s performance and accuracy.

> **Tip:**
>
> Use the open-source semantic model generator app, described in the next section, to help add verified queries to
> your semantic model, without needing to concern yourself with SQL or YAML syntax.

## Adding verified queries using the semantic model generator

Snowflake provides an open-source Streamlit app to help add verified queries to your model. To
install and use this app, follow these instructions.

1. **Clone the repository.** Start by cloning the [semantic-model-generator](https://github.com/Snowflake-Labs/semantic-model-generator)
   repository.
2. **Configure credentials and install the app.** Follow the setup instructions in the repo’s [README](https://github.com/Snowflake-Labs/semantic-model-generator/blob/main/README.md) to provide your Snowflake credentials and run the app either on Snowflake or locally.
3. **Configure the app.** Once the app is running, enter the database, schema, and stage location of your semantic model YAML
   file into the provided fields. The YAML file will appear in an interactive editor on the left side of the window.
4. **Generate a Query.** On the right side of the window, use the chat interface to ask a question that will generate a SQL
   query.
5. **Verify and Save the Query.**

> * Inspect the generated query and the results it produces. If it works as expected, select the Save as verified query
>   button below the assistant’s answer to add the query to your semantic model.
> * If the generated query is incorrect, select the Edit button to modify the query. Run the modified query to check if
>   it produces the intended results. Continue editing and testing until the query works as desired. Then select
>   Save as verified query to add it to your semantic model.

6. **Update the Semantic Model.** Select the Save button in the bottom left of the window to update the semantic model.
   Repeat the process to add more queries.
7. **Upload the new YAML file.** Once you’re satisfied with the queries you’ve added, select the Upload button, enter a file
   name for your new YAML file, and select Submit Upload.

When you return to your stage in Snowsight, you’ll see the new semantic model YAML file with your verified queries.

## Adding suggested Cortex Analyst Verified Query entries

Cortex Analyst also provides the Verified Query Suggestion interface in Snowsight, which offers potential new verified queries based on user behavior. For information about adding verified query suggestions, see [Suggestions for semantic models and views](verified-query-suggestions.md).

## Viewing verified queries used in the Cortex Analyst response

When the user’s question is similar to a query in the Verified Query Repository (VQR), Cortex Analyst uses that query to generate
the SQL query in its response. To see which verified query was used, see the [confidence field](rest-api.md)
in the API response.

---
title: Cortex Knowledge Extensions
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Knowledge Extensions

## Overview

Cortex Knowledge Extensions (CKEs) are
[Cortex Search Services](../cortex-search/cortex-search-overview.md) that can be shared on the
[Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace) or via
[private listings](../../../collaboration/provider-listings-creating-publishing.md)
or [organizational listings](../../collaboration/listings/organizational/org-listing-about.md). They can be used in a
retrieval-augmented generation (RAG) architecture to integrate licensed and proprietary content into Cortex AI applications. For
example, CKEs can be used to integrate knowledge from unstructured content, such as articles, market research, books, or forum
posts, into Cortex AI applications, such as chatbots and agentic systems.

## How CKE works

Here’s how it works:

1. A Provider uploads their text data into a table in their account and creates a [Cortex Search Service](../cortex-search/cortex-search-overview.md) on the table. This Cortex Search Service is then shared on the on the [Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace). A Cortex Search Service that is shared on the Snowflake Marketplace is known as a Cortex Knowledge Extension (CKE).
2. A Consumer builds an application leveraging Cortex AI, such as a chatbot, using [Cortex AI Functions](../aisql.md) or the [Cortex Agent API](../cortex-agents.md) with the CKE.
3. When a prompt is given to the Cortex AI application that is integrated with a CKE, the prompt is passed on to the CKE to get relevant knowledge by performing a semantic search. The relevant knowledge is given back to the Cortex AI applications’s LLM and reasoned over before returning an answer back to the user with citations and attribution.

## CKE features

Some of the key features of Cortex Knowledge Extensions include:

* Content protection
* Management
* Trial support
* Monetization

Each of these features is described in more detail below.

### Content protection

Providers can limit the percentage of indexed content in their CKE that can be returned to their consumers within a rolling 24-hour period. This is done by setting a threshold using the commands below. The threshold is not applied at the individual document level, but rather across the entire corpus of indexed content. Consumers will only be able to access the threshold percentage of the indexed content in the CKE.

Refer to the [Listing manifest reference](../../../progaccess/listing-manifest-reference.md) for more information about the
`cke_content_protection` field.

```sqlexample
-- Use CREATE to create a new CKE listing with content protection.
-- Use ALTER to update an existing listing with content protection.

-- This example creates a CKE listing targeting to two accounts.
CREATE EXTERNAL LISTING cke_listing
SHARE cke_share AS
$$
title: "CKE Listing Title"
description: "Cortex Knowledge Extension Listing Description"
listing_terms:
  type: "STANDARD"
auto_fulfillment:
  refresh_type: "SUB_DATABASE"
  refresh_schedule: "1440 MINUTE"
targets:
  accounts:
    - "ORG1.ACCOUNT1"
    - "ORG2.ACCOUNT2"
cke_content_protection:
  enable: true,
  threshold: 0.2
$$

-- DESCRIBE LISTING cke_listing
-- See the manifest_yaml column for the cke_content_protection setting
```

When the threshold has been hit by a consumer, queries to the CKE are blocked from executing, and the consumer receives the following error:

```output
You have reached the content protection threshold. Please try again later.
```

The consumer can re-query the data when the threshold refreshes.

### Management

To see the number of queries that the CKE executed, sign in to [Snowsight](../../ui-snowsight-gs.md). In the navigation menu, select Marketplace » Provider Studio » Home. The Analytics section shows the number of queries executed.

### Trial support

As a provider, you can offer customers a [limited trial](../../../collaboration/collaboration-listings-about.md) of your CKE so that they can try your product before they commit to paying for it.

### Monetization

Cortex Knowledge Extensions can be monetized using the on-platform [Snowflake Marketplace Monetization](../../../collaboration/provider-becoming.md) capability via [subscriptions](../../../collaboration/provider-listings-pricing-model.md) or through [off-platform](../../../collaboration/provider-listings-creating-publishing.md) monetization.

## Region availability

Cortex Knowledge Extensions are available in any region where
[Cortex Search](../cortex-search/cortex-search-overview.md) is available.

## Key considerations

When customers use your Cortex Knowledge Extension, be careful when disabling serving of the [Cortex Search Service](../cortex-search/cortex-search-overview.md), as that will break customers’ applications.

For advanced tuning of a Cortex Knowledge Extension, refer to the [Cortex Search](../cortex-search/cortex-search-overview.md) documentation.

## Costs for CKE

Providers:

* Providers pay to host the Cortex Search Service in their account, including indexing, servicing, and replication to other regions. For more information about costs associated with Cortex Search Services, providers can refer to [Understanding cost for Cortex Search Services](../cortex-search/cortex-search-costs.md).

Consumers:

* If the CKE isn’t free, consumers pay the provider to access the CKE.
* If the CKE leverages a Cortex Agent, consumers pay for the Cortex Agent. For more information, see [Cost considerations](../cortex-agents.md) for Cortex Agents.

## Citations

To ensure that the CKE is providing citations, when you configure the [Cortex Search Services](../cortex-search/cortex-search-overview.md), make sure that you include a `SOURCE_URL` column that points to the source of the document in the indexed columns. This can be used by LLMs or Snowflake Intelligence to provide clear attribution and hyperlinks back to the source material.

## Publishing the CKE to the Snowflake Marketplace

After you create a Cortex Search Service that you want to publish to the Marketplace, [create a listing](../../../collaboration/provider-listings-creating-publishing.md). Make sure that you point to the Cortex Search Service object that you created as an object that you want to publish.

## Talking with the CKE

You can use the following methods to ask the CKE questions.

* Use the Cortex Search Playground:

  1. In Snowsight, in the navigation menu, select AI & ML » Cortex Search.
  2. Select the CKE from the Database/Schema drop down menu.
  3. Click on Playground in the upper-right corner.
  4. Type in a search query and see the results
* Use Snowflake Intelligence:

  + Follow the steps outlined in [Tutorial 3: Add a CKE to Snowflake Intelligence](tutorials/add-cke-to-snowflake-intelligence-tutorial.md).
* Use Cortex Agent API:

  + Use the Cortex Agent API, and specify the shared CKE in the [CREATE CORTEX SEARCH](../../../sql-reference/sql/create-cortex-search.md) parameter. Refer to the [Cortex Agent API](../cortex-agents.md) documentation for more information.

## Updating your CKE

Keeping a CKE up-to-date is a common use case for providers that regularly introduce new or updated content. To ensure your Cortex Knowledge Extension is up-to-date do the following:

1. Ensure that the underlying table with content has been updated via some separate process of inserting new / updated documents
   into your Snowflake account.
2. Review the Cortex Search Service target lag. The Cortex Search Service is configured to refresh and to keep the data fresh up
   to a certain `target_lag`. Refer to the Cortex Search
   [Use SQL](../cortex-search/cortex-search-overview.md) topic for more information about `target_lag`.
3. Run the following commands to ensure that the Cortex Search Service is indexing.

   ```sqlexample
   -- Get the status of the search service
   DESCRIBE CORTEX SEARCH SERVICE cke_simple_cortex_search_service;

   -- If the indexing status is suspended, you can resume it with the following command
   ALTER CORTEX SEARCH SERVICE cke_simple_cortex_search_service RESUME INDEXING;
   ```

## CKE and auto-fulfillment

Consumers can only access a Cortex Knowledge Extension made available in their region. Providers can automatically replicate their Cortex Search Service to remote consumer regions by [enabling auto-fulfillment](../../../collaboration/provider-listings-auto-fulfillment.md) on their Cortex Knowledge Extension listing in Provider Studio.

## Limitations

* [Usage-based](../../../collaboration/provider-listings-pricing-model.md) billing with CKEs isn’t supported.
* CKEs are not supported in listings that have [Egress Cost Optimizer (ECO)](../../../collaboration/provider-listings-auto-fulfillment-eco.md) enabled.

  Providers should be aware of the cost implications for replication with listings that have a CKE.

  Adding a CKE to a listing that has ECO enabled will automatically turn off ECO. With ECO turned off, costs associated with the listing can increase. An email notification will also be sent to the provider indicating that ECO was turned off.

  Similarly, if a CKE is added to a listing that’s part of a replication group, then ECO will be turned off for all listings within that replication group. An email notification will be sent to the provider indicating that the ECO was turned off.

---
title: Cortex Playground
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-playground.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Playground

The Cortex Playground lets you compare text completions across the multiple large language models available in
Cortex AI. You can test language model responses across prompts and model settings, and perform side-by-side comparisons
of model outputs. With a few clicks, you can also connect the model to a Snowflake table to experiment directly on your
data. The Cortex Playground is purpose-built to help you easily test how different language models perform for your
use case before you decide which model to deploy into production.

The Cortex Playground supports all of the models available for the COMPLETE function that are available in your
account’s region. For the complete list of models, see [Model availability](aisql.md).

## Required privileges

The Cortex Playground requires the CORTEX_USER database role that includes the privileges to call Snowflake Cortex LLM functions.
For more information, see [Cortex LLM privileges](aisql.md).

## Get started with the Cortex Playground

The Cortex Playground is accessible from the Snowflake AI & ML Studio. You can access the studio from Snowsight as follows:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » AI Studio. The Cortex Playground appears among other the other Studio functions.
3. To open the playground, select Try.

## Test your prompt with a language model

Use the Cortex Playground to test prompts across different language models.

1. Select a warehouse. This warehouse is used to run the SQL command that calls the COMPLETE function.
2. Select a model from the dropdown menu at the top. The drop down menu includes only the models that are available in the region
   of the account being used.
3. Enter your prompt in the prompt box and select `Enter`.
4. The model output appears above the prompt box. You can select View Code to see and copy the SQL command used to process your prompt.

To try a different prompt or model, choose the desired model and enter a new prompt in the prompt box, then select `Enter`.

## Compare model outputs

To compare the output of your prompts between two different models or two different settings of the same model, use the Compare
feature.

### Compare two models

1. Select Compare in the top right corner.
2. Select different models for the two panels using the dropdown menu on each side.
3. Open the settings panel by selecting Change settings  next to Compare.
4. Select the Sync toggle to use the same settings for the two models.
5. Enter your prompt and select `Enter`. The output from the models you selected appears on each side.

### Compare settings for one model

1. Select Compare in the top right corner.
2. Select the same model for the two panels.
3. Open the settings panel by selecting Change settings  next to Compare.
4. Choose different settings for temperature, top_p or max_tokens for each tab to compare how the language model response
   changes with different model settings. For more details on these parameters, see
   [COMPLETE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/complete-snowflake-cortex.md).
5. You can also check Enable Cortex Guard to implement safeguards that filter out potentially inappropriate or unsafe large
   language model (LLM) responses. For more details on Cortex Guard, see [Cortex Guard](aisql.md).
6. Enter your prompt and select `Enter`. The output from the model for each set of settings appears on each side.

## Connect to Snowflake tables

You can connect the model to a Snowflake table with textual data that you want to test with text completion.

> **Note:**
>
> You can select only one column. The Cortex Playground returns at most 100 rows.

1. Select the + Connect your data button in the prompt box.
2. Select your Snowflake data source from the drop down menu.
3. Select the column with the textual data you want to test.
4. Select a column to use as a filter. You can use this column to select a record from your data source.
5. Select Done.
6. Select a record from your data source using the Select <filter column> field in the prompt box. You can select a record by
   scrolling or by searching for a term in the text data. To search, enter a term in the search box. The following example shows
   a filter column named **ID**. In this example, you could search for a particular ID number or enter a string to match the text data.
7. Enter a System Prompt and select `Enter` to see the model response. A system prompt provides instructions to the model on how
   to process the input text. For example, you might want the model to summarize the selected text or pull out keywords from it.

## Controlling settings

You can adjust model settings to compare how the language model response changes when provided with different temperature,
top_p, and max_tokens settings. To implement safeguards that filter out potentially inappropriate or unsafe
responses, select Enable Cortex Guard in the settings panel.

You can read more about how these settings potentially impact language model responses in the
[Controlling temperature and tokens](../../sql-reference/functions/complete-snowflake-cortex.md) page.

1. Select Change settings  to open the settings menu on the top right corner.
2. Check the box for the setting to adjust its value.
3. Try out prompts with different settings.

## Exporting a SQL query

To get a SQL query that includes the settings, such as temperature, that you’ve defined in the Cortex Playground,
select View Code after any model response. The displayed code can be executed from a
[worksheet](../ui-snowsight-worksheets-gs.md) or [notebook](../ui-snowsight/notebooks.md), or
automated for continuous execution using [streams and tasks](../data-pipelines-intro.md).
You can also use this code with a [dynamic table](../dynamic-tables-about.md).

> **Note:**
>
> Dynamic tables do not support incremental refresh with COMPLETE.

The following images show examples of the View SQL dialog.

---
title: Cortex Search
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Search

> Get started with Cortex Search
>
> [Try it in Snowsight](https://app.snowflake.com/_deeplink/#/cortex/search?utm_source=docs&utm_medium=growth&utm_campaign=-us-en-all&utm_content=-app-user-guide-snowflake-cortex-cortex-search-cortex-search-overview)

## Overview

Cortex Search enables low-latency, high-quality “fuzzy” search over your Snowflake data.
It powers a broad array of search experiences for Snowflake users
including [Retrieval Augmented Generation (RAG)](https://en.wikipedia.org/wiki/Prompt_engineering#Retrieval-augmented_generation)
applications leveraging Large Language Models (LLMs).

Cortex Search gets you up and running with a hybrid (vector and keyword) search engine on your text data in minutes, without having to
worry about embedding, infrastructure maintenance, search quality parameter tuning, or ongoing index refreshes. This means you can spend
less time on infrastructure and search quality tuning, and more time developing high-quality chat and search
experiences using your data. Check out the [Cortex Search tutorials](overview-tutorials.md)
for step-by-step instructions on using Cortex Search to power AI chat and search applications.

## When to use Cortex Search

The two primary use cases for Cortex Search are retrieval augmented generation (RAG) and enterprise search.

* **RAG engine for LLM chatbots**: Use Cortex Search as a RAG engine for chat applications with your
  text data by leveraging semantic search for customized, contextualized responses.
* **Enterprise search**: Use Cortex Search as a backend for a high-quality search bar embedded in your application.

### Cortex Search for RAG

Retrieval augmented generation (RAG) is a technique for retrieving data from a knowledge base to enhance the generated
response of a large language model. The following architecture diagram shows how you can combine Cortex Search with
[Cortex LLM Functions](../aisql.md) to create
enterprise chatbots with RAG using your Snowflake data as a knowledge base.

Cortex Search is the retrieval engine that provides the Large Language Model with the context it needs
to return answers that are grounded in your most up-to-date proprietary data.

## Example: Create and query a Cortex Search service

This example takes you through the steps of creating a Cortex Search Service and querying
it using the REST API. Refer to the [Querying a Cortex Search Service](query-cortex-search-service.md) topic for
more details about querying the service.

This example uses a sample customer support transcript dataset.

Run the following commands to setup the example database and schema.

```sqlexample
CREATE DATABASE IF NOT EXISTS cortex_search_db;

CREATE OR REPLACE WAREHOUSE cortex_search_wh WITH
   WAREHOUSE_SIZE='X-SMALL';

CREATE OR REPLACE SCHEMA cortex_search_db.services;
```

Run the following SQL commands to create the dataset.

```sqlexample
CREATE OR REPLACE TABLE support_transcripts (
    transcript_text VARCHAR,
    region VARCHAR,
    agent_id VARCHAR
);

INSERT INTO support_transcripts VALUES
    ('My internet has been down since yesterday, can you help?', 'North America', 'AG1001'),
    ('I was overcharged for my last bill, need an explanation.', 'Europe', 'AG1002'),
    ('How do I reset my password? The email link is not working.', 'Asia', 'AG1003'),
    ('I received a faulty router, can I get it replaced?', 'North America', 'AG1004');
```

### Create the service

You can create a Cortex Search Service with a single SQL query or from the Snowflake AI & ML Studio. When you create a
Cortex Search Service, Snowflake performs transformations on your source data to get it ready for low-latency serving. The following
sections show how to create a service using both SQL and in the Snowflake AI & ML Studio in Snowsight.

> **Note:**
>
> When you create a search service, the search index is built as part of the create process. This means the CREATE CORTEX SEARCH SERVICE
> statement may take longer to complete for larger datasets.

#### Use SQL

The following example demonstrates how to create a Cortex Search Service
with [CREATE CORTEX SEARCH SERVICE](../../../sql-reference/sql/create-cortex-search.md) on the sample customer support transcript dataset created in the previous section.

```sqlexample
CREATE OR REPLACE CORTEX SEARCH SERVICE transcript_search_service
  ON transcript_text
  ATTRIBUTES region
  WAREHOUSE = cortex_search_wh
  TARGET_LAG = '1 day'
  EMBEDDING_MODEL = 'snowflake-arctic-embed-l-v2.0'
  AS (
    SELECT
        transcript_text,
        region,
        agent_id
    FROM support_transcripts
);
```

This command triggers the building of the search service for your data. In this example:

> * Queries to the service will search for matches in the `transcript_text` column.
> * The `TARGET_LAG` parameter dictates that the Cortex Search Service will check for updates to the
>   base table `support_transcripts` approximately once per day.
> * The columns `region` and `agent_id` will be indexed so that they can be returned along with
>   results of queries on the `transcript_text` column.
> * The column `region` will be available as a filter column when querying the `transcript_text` column.
> * The warehouse `cortex_search_wh` will be used for materializing the results of the specified query initially
>   and each time the base table is changed.

> **Note:**
>
> * Depending on the size of the warehouse specified in the query and the number of rows in
>   your table, this CREATE command may take up to several hours to complete.
> * Snowflake recommends using a dedicated warehouse of size no larger than MEDIUM for each service.
> * Columns in the ATTRIBUTES field must be included in the source query, either via
>   explicit enumeration or wildcard, ( `*` ) .

#### Use Snowsight

Follow these steps to create a Cortex Search Service in Snowsight:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. Choose a role that is granted the SNOWFLAKE.CORTEX_USER database role.
3. In the navigation menu, select AI & ML » Cortex Search.
4. Select Create.
5. Select a role and warehouse.

   The role must be granted the SNOWFLAKE.CORTEX_USER database role. The warehouse is used for
   materializing the results of the source query when the service is created and refreshed.
6. Select a database and schema in which the service is defined.
7. Enter a name for your service, then select Next.
8. Select data to be indexed.

   * To select a table or view, select Table or view.

     Select the table or view that contains the text data to be indexed for searching, then select Next. For example, select
     the `support_transcripts` table.
   * To select files from a stage, select Stage. (Preview)

     Select the stage that contains the files to be indexed for searching, then select Next.
   > **Note:**
   >
   > If you want to specify multiple data sources or perform transformations when defining your service,
   > use SQL.
9. If you selected Table or view:

   * Select the columns you want included in the search results, for example, `transcript_text`, `region`, and `agent_id`, then select Next.
   * Select the column that will be searched, for example, `transcript_text`, then select Next.
   * If you want to be able to filter your search results based on particular columns, select those columns, then select Next.
     If you don’t need any filters, select Skip this option.

   If you selected Stage (Preview):

   * Select the destination for your processed data, then select Next.
10. Select the configuration parameters for the service.

    Set your target lag, which is the amount of time your service content should lag behind updates to the base data, then select Create.

The final step confirms that your service has been created and displays the service name and its data source.

> **Note:**
>
> When you create the service from Snowsight, the name of the service is double-quoted. For details on what that means when
> referencing the service in SQL, see [Double-quoted identifiers](../../../sql-reference/identifiers-syntax.md).

### Grant usage permissions

After the service and index are created, you can grant usage on the service, its database,
and schema to other roles like customer_support.

```sqlexample
GRANT USAGE ON DATABASE cortex_search_db TO ROLE customer_support;
GRANT USAGE ON SCHEMA services TO ROLE customer_support;

GRANT USAGE ON CORTEX SEARCH SERVICE transcript_search_service TO ROLE customer_support;
```

### Preview the service

To confirm that the service is populated with data properly, you can preview the service via the
[SEARCH_PREVIEW function](query-cortex-search-service.md) from a SQL environment:

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'cortex_search_db.services.transcript_search_service',
      '{
        "query": "internet issues",
        "columns":[
            "transcript_text",
            "region"
        ],
        "filter": {"@eq": {"region": "North America"} },
        "limit":1
      }'
  )
)['results'] as results;
```

Sample successful query response:

```json-object
[
  {
  "transcript_text" : "My internet has been down since yesterday, can you help?",
  "region" : "North America"
  }
]
```

This response confirms that the service is populated with data and serving reasonable results for the given query.

You can also use the [CORTEX_SEARCH_DATA_SCAN](../../../sql-reference/functions/cortex_search_data_scan.md) table function to inspect the contents of the service.

```sqlexample
SELECT
  *
FROM
  TABLE (
    CORTEX_SEARCH_DATA_SCAN (
      SERVICE_NAME => 'transcript_search_service'
    )
  );
```

```output
+ ---------------------------------------------------------- + --------------- + -------- + ------------------------------ +
|                      transcript_text                       |     region      | agent_id | _GENERATED_EMBEDDINGS_MY_MODEL |
| ---------------------------------------------------------- | --------------- | -------- | ------------------------------ |
| 'My internet has been down since yesterday, can you help?' | 'North America' | 'AG1001' | [0.1, 0.2, 0.3, 0.4]           |
| 'I was overcharged for my last bill, need an explanation.' | 'Europe'        | 'AG1002' | [0.1, 0.2, 0.3, 0.4]           |
+ ---------------------------------------------------------- + --------------- + -------- + ------------------------------ +
```

### Query the service from your application

Once you’ve created the search service, granted usage on it to your role, and previewed it, you can
now query it from your application using the [Python API](query-cortex-search-service.md).

The following code shows using the Python API to retrieving the support ticket most relevant to
a query about `internet issues`, filtered to return results in the `North America` region:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

CONNECTION_PARAMETERS = {"..."}

session = Session.builder.configs(CONNECTION_PARAMETERS).create()
root = Root(session)

transcript_search_service = (root
  .databases["cortex_search_db"]
  .schemas["services"]
  .cortex_search_services["transcript_search_service"]
)

resp = transcript_search_service.search(
  query="internet issues",
  columns=["transcript_text", "region"],
  filter={"@eq": {"region": "North America"} },
  limit=1
)
print(resp.to_json())
```

Sample successful query response:

```json-object
{
  "results": [
    {
      "transcript_text": "My internet has been down since yesterday, can you help?",
      "region": "North America"
    }
  ],
  "request_id": "5d8eaa5a-800c-493c-a561-134c712945ba"
}
```

Cortex Search Services return all columns specified in the `columns` field in your query.

## Required privileges

* To create a Cortex Search Service, your role must have the required privileges to use the Cortex embedding functions, which
  requires granting the [SNOWFLAKE.CORTEX_USER](../../../sql-reference/snowflake-db-roles.md) database role
  or the [SNOWFLAKE.CORTEX_EMBED_USER](../../../sql-reference/snowflake-db-roles.md) database
  role to the service creator role. You must also have the following privileges:

  + The CREATE CORTEX SEARCH SERVICE or OWNERSHIP privilege on the schema where you create the service.
  + The SELECT privilege on the underlying table(s) or view(s) that the service queries.
  + The USAGE privilege on the warehouse that refreshes the service.
* Change tracking must be enabled on all underlying objects used by a Cortex Search Service.
  For more information about change tracking requirements, see [Change Tracking Requirements](../../../sql-reference/sql/create-cortex-search.md).
* To query a Cortex Search Service, the role of the querying user must have USAGE privileges on the service itself,
  as well as on the database and schema in which the service resides. See [Cortex Search Access Control Requirements](query-cortex-search-service.md).
* To suspend or resume a Cortex Search Service using the ALTER command, the role of the querying user must have the OPERATE privilege on
  the service. See [ALTER CORTEX SEARCH SERVICE](../../../sql-reference/sql/alter-cortex-search.md).

> **Important:**
>
> Cortex Search Services perform searches with [owner’s rights](../../../developer-guide/stored-procedure/stored-procedures-rights.md) and follow
> the same security model as other Snowflake objects that run with owner’s rights. For more information, see
> [Cortex Search Access Control Requirements](query-cortex-search-service.md)

## Understanding Cortex Search quality

Cortex Search leverages an ensemble of retrieval and ranking models to provide you with a high level of search quality with little to no tuning required.
Under the hood, Cortex Search takes a “hybrid” approach to retrieving and ranking documents. Each search query utilizes:

* **Vector search** for retrieving semantically similar documents.
* **Keyword search** for retrieving lexically similar documents.
* **Semantic reranking** for reranking the most relevant documents in the result set.

This hybrid retrieval approach, coupled with a semantic reranking step, achieves high search quality across a broad range of datasets and queries.

You can customize the scoring of search results by applying numeric boosts, time decays, adjusting component weights, or disabling reranking. For more information, see [Customizing Cortex Search scoring](cortex-search-customize-scoring.md).

### Cortex Search Embedding Models

Cortex Search allows users to select a hosted embedding model to be leveraged in the vector search stage of retrieval.
The following embedding models are available in Cortex Search.

> **Important:**
>
> Model pricing varies. Canonical model pricing is available in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). If
> a price shown below differs from the price shown for the model in the Snowflake Service Consumption Table, the
> Snowflake Service Consumption table shall govern.

| Model name | Output Dimensions | Context window size (tokens) | Language support | Description |
| --- | --- | --- | --- | --- |
| `snowflake-arctic-embed-m-v1.5` (default) | 768 | 512 | English-only | Snowflake’s most practical, English-only embedding model. This open-source, 110M-parameter model yields the fastest indexing times of the available models in Cortex Search. For more information, see the [Arctic Embed 1.5 blog post](https://www.snowflake.com/en/engineering-blog/arctic-embed-m-v1-5-enterprise-retrieval) and [Arctic Embed 1.5 model card](https://huggingface.co/Snowflake/snowflake-arctic-embed-m-v1.5). |
| `snowflake-arctic-embed-l-v2.0` | 1024 | 512 | Multilingual | Snowflake’s price-performant multilingual embedding model with a context window of 512 tokens. This open-source, 568M-parameter model yields high quality on both English and non-English datasets. For more information, see the [Arctic Embed 2 blog post](https://www.snowflake.com/en/engineering-blog/snowflake-arctic-embed-2-multilingual/) and [Arctic Embed 2 model card](https://huggingface.co/Snowflake/snowflake-arctic-embed-l-v2.0). |
| `snowflake-arctic-embed-l-v2.0-8k` | 1024 | 8192 | Multilingual | Snowflake’s price-performant multilingual embedding model, with an increased context window of 8000 tokens. This open-source, 568M-parameter model yields high quality on both English and non-English datasets. |
| `voyage-multilingual-2` | 1024 | 32,000 | Multilingual | Voyage’s multilingual embedding model. This model yields high quality on both English and non-English datasets. For more information, see the [Voyage Multilingual 2 blog post](https://blog.voyageai.com/2024/06/10/voyage-multilingual-2-multilingual-embedding-model/) |

Some embedding models are only available in certain cloud regions for Cortex Search.
For an availability list by model by region, see Cortex Search Regional Availability.

Each model has different performance, cost, context window size, and quality characteristics. Carefully review the model specifications to determine the best
model for your specific workload. Refer to the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) for most accurate view of each model’s cost in credits per million tokens.

### Tokens, model context windows, and text splitting

A token is a sequence of characters and is the smallest unit of text that can be processed by a large language model.
As an approximation, one token is equivalent to about 3/4 of an English word, or around 4 characters.
To calculate the number of tokens in a string, use the
[COUNT_TOKENS Cortex Function](../../../sql-reference/functions/count_tokens-snowflake-cortex.md). For example, calculating the
tokens for a string to be embedded with the `snowflake-arctic-embed-m-v1.5` model:

```sqlexample
SELECT SNOWFLAKE.CORTEX.COUNT_TOKENS('snowflake-arctic-embed-m', '<input_text>') as token_count
```

Each vector embedding model supports a fixed size context window for text inputs, indicated in the preceding embedding
model table. During both indexing and serving, when the number of tokens in a value in the search column exceeds the
context window size, Cortex Search truncates the string to the size of the context window before embedding it into
vector space for semantic search. However, Cortex Search uses the full body of text for keyword-based retrieval.

Snowflake provides built-in functions to assist in splitting of text into smaller chunks. For more information, see
[SPLIT_TEXT_RECURSIVE_CHARACTER](../../../sql-reference/functions/split_text_recursive_character-snowflake-cortex.md).

For best search results with Cortex Search, Snowflake recommends splitting the text in your search column into chunks of no more than 512 tokens (about 385 English words). While there are longer-context embedding models available today, such as `snowflake-arctic-embed-l-v2.0-8k`, [research](https://www.snowflake.com/en/engineering-blog/impact-retrieval-chunking-finance-rag/) shows that *a smaller chunk size typically results in higher retrieval and downstream LLM response quality*. With smaller chunks, retrieval can be more precise for a given query and, in a retrieval-augmented generation (RAG) scenario, the downstream LLM receives text chunks that are more relevant to the query.

## Refreshes

The content served in a Cortex Search Service is based on the results of a specific query.
When the data underlying a Cortex Search Service changes, the service
updates to reflect those changes. These updates are referred to as a refresh. This process
is automated, and it involves analyzing the query that underlies the table.

Cortex Search Services have the same refresh properties as Dynamic Tables. See [Understanding dynamic table initialization and refresh](../../dynamic-tables-refresh.md) topic to
understand the refresh characteristics of a Cortex Search Service.

The source query for a Cortex Search Service must be a candidate for dynamic table incremental refresh. For details on those requirements,
see [Support for incremental refresh](../../dynamic-tables-limitations.md). This restriction is designed to prevent any unwanted runaway costs associated
with vector embedding computation. For more information about the constructs that are not supported for dynamic table incremental refresh,
see [Supported queries for dynamic tables](../../dynamic-tables-supported-queries.md).

### Primary keys

A primary key of a Cortex Search Service is an optional set of columns that uniquely identify each row in the source
query (that is, only one row has that exact combination of values in the designated columns). To be used with Cortex
Search Services, primary key columns must be of the [TEXT](../../../sql-reference/data-types-text.md) data type.

A primary key can be specified when creating the service as follows:

```sqlexample
CREATE OR REPLACE CORTEX SEARCH SERVICE transcript_search_service
  ON transcript_text
  PRIMARY KEY (region, agent_id)
  WAREHOUSE = cortex_search_wh
  TARGET_LAG = '1 day'
  AS (
    SELECT
        transcript_text, region, agent_id
    FROM support_transcripts
);
```

The primary key columns of existing services can be modified with `ALTER CORTEX SEARCH SERVICE ... SET PRIMARY KEY (...)`.
For detailed syntax, see [ALTER CORTEX SEARCH SERVICE](../../../sql-reference/sql/alter-cortex-search.md).

Services with primary keys can make use of an optimized refresh path when data underlying the service changes.
This optimized path can result in significant reductions to the cost and latency of a refresh. With this optimization
enabled, the search service periodically compacts index information generated during a refresh. You can specify a target frequency for index refreshes by setting the
`FULL_INDEX_BUILD_INTERVAL_DAYS` property on the service. For syntax details, see [CREATE CORTEX SEARCH SERVICE](../../../sql-reference/sql/create-cortex-search.md) and [ALTER CORTEX SEARCH SERVICE](../../../sql-reference/sql/alter-cortex-search.md).

> **Note:**
>
> `FULL_INDEX_BUILD_INTERVAL_DAYS` is a soft target. Full rebuilds may occur more frequently than the specified interval to optimize serving performance based on factors such as service target lag, change rate in the service source data, and overall service size.

Queries to services with primary keys may also make use of the `@primarykey` [filter operator](query-cortex-search-service.md).

> **Important:**
>
> The set of primary key column values must be unique for each row in the source query. Duplicates are
> ignored in the resulting search index.

## Multi-index Cortex Search

Cortex Search can index multiple columns or use custom vector embeddings for queries, allowing you additional flexibility in how your Cortex Search Service interprets data and responds to user requests. You should use Multi-index Cortex Search when you have a use case that features one or more of:

* **Multiple search fields**: Users need to search across different fields of a record.
* **User-provided vector embeddings**: You have pre-computed vector embeddings for one or more columns prior to
  ingestion into the Cortex Search Service.
* **Mixed search types**: You want to support searching different fields with preference to a type of search.

  + Use *text indexes* for fields where exact or fuzzy keyword matches are important. Some examples are product codes, names, and categories.
  + Use *vector indexes* for fields with longer text content where semantic understanding is valuable. Examples include product descriptions, user reviews, and support cases.
* **Field-specific relevance**: Different fields of your data should contribute differently to relevance of a search result.

For example, for a product catalog search use case, you can create a multi-index service where:

* Product names and SKUs are *text indexes* for precise lexical matching.
* Product descriptions are *vector indexes* for semantic matching.
* Category and brand names are both text *and* vector indexes to support both lexical and semantic matches.

For examples of creating a multi-index Cortex Search service, see [CREATE CORTEX SEARCH SERVICE … TEXT INDEXES .. VECTOR INDEXES](../../../sql-reference/sql/create-cortex-search.md).
For examples of querying a multi-index service, see [Query a Cortex Search service - Multi-index queries](query-cortex-search-service.md).

### User-provided vector embeddings

Multi-index Cortex Search allows you to use pre-computed vector embeddings from any embedding model (including
open-source, commercial, and custom-trained models). Use user-provided vector embeddings when:

* You want to use an embedding model not natively available in Cortex Search, or you want to reuse embeddings you have
  already generate to reduce cost and improve performance.
* You want to combine your vector embeddings with Cortex Search text indexes for hybrid retrieval.

When you specify a bare column name in the VECTOR INDEXES clause, but do not specify a model, Cortex Search treats the
contents of the column as user-provided vector embeddings. User-provided vectors are indexed as-is and do not incur any
embedding cost.

> **Note:**
>
> You cannot load vectors directly into a Snowflake table. Instead, cast an array of numbers to the VECTOR data type when inserting or updating data in the source table for your Cortex Search Service.
> See [Vector conversion](../../../sql-reference/data-types-vector.md) for details and examples of how to do this.

Cortex Search chooses one of the following modes at search time, depending on whether you provide a query vector or query text in your search request:

| Mode | Index time | Query time |
| --- | --- | --- |
| Fully user-managed | Provide vectors in a VECTOR column | Provide a query vector via multi_index_query |
| User-managed with managed query embeddings | Provide vectors in a VECTOR column | Cortex Search embeds query text using the specified model |

## Suspension of indexing and serving

Much like Dynamic Tables, Cortex Search Services automatically suspend their indexing state when they encounter five
consecutive refresh failures related to the source query. If you encounter this failure for your service, you can view the specific SQL
error using either [DESCRIBE CORTEX SEARCH SERVICE](../../../sql-reference/sql/desc-cortex-search.md) or the [CORTEX_SEARCH_SERVICES view](../../../sql-reference/info-schema/cortex_search.md). The output from
both includes the following columns:

* The INDEXING_STATE column, which is SUSPENDED for a suspended service.
* The INDEXING_ERROR column, which contains the specific SQL error encountered in the source query.

Once the root issue is resolved, you can resume the service with `ALTER CORTEX SEARCH SERVICE <name> RESUME INDEXING`.
For detailed syntax, see [ALTER CORTEX SEARCH SERVICE](../../../sql-reference/sql/alter-cortex-search.md).

## Cost considerations

A Cortex Search Service incurs cost in the following ways:

| Category | Description |
| --- | --- |
| Virtual warehouse compute | A Cortex Search Service requires a [virtual warehouse](../../cost-understanding-compute.md) to refresh the service: to run queries against base objects when they are initialized and refreshed, including orchestrating text embedding jobs and building the search index. These operations use compute resources, which consume [credits](../../cost-understanding-compute.md). If no changes are identified during a refresh, virtual warehouse credits aren’t consumed since there’s no new data to refresh. |
| EMBED_TEXT tokens compute | A Cortex Search Service automatically embeds each text row in the search column specified in the `ON` parameter into vector space to enable semantic search, which incurs a credit cost per token embedded. This involves calling [EMBED_TEXT_768](../../../sql-reference/functions/embed_text-snowflake-cortex.md) or [EMBED_TEXT_1024](../../../sql-reference/functions/embed_text_1024-snowflake-cortex.md) to convert each document as a series of numbers that encodes its meaning. Embeddings are computed each time a row is inserted or updated. Embeddings are processed incrementally in the evaluation of the source query, so the embedding cost is only incurred for added or changed documents. See [Vector Embeddings](../vector-embeddings.md) for more information on vector embedding costs. |
| Multi-index Cortex Search | Multi-index Cortex Search Services have costs dependent on how you embed tokens and the number of columns you index. Larger embedding vectors or higher numbers of index columns incur higher costs. Embeddings are computed each time a row is inserted or updated. Embeddings are processed incrementally in the evaluation of the source query, so the embedding cost is only incurred for added or changed documents. |
| Serving compute | A Cortex Search Service uses multi-tenant serving compute, separate from a user-provided Virtual Warehouse, to establish a low-latency, high-throughput service. The compute cost for this component is incurred per GB per month (GB/mo) of uncompressed indexed data, where indexed data is the user-provided data in the Cortex Search source query, plus vector embeddings computed on the user’s behalf. You incur these costs while the service is available to respond to queries, even if no queries are served during a given period. For the Cortex Search Serving credit rate per GB/mo of indexed data, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). |
| Storage | Cortex Search Services materialize the source query into a table stored in your account. This table is transformed into data structures that are optimized for low-latency serving, also stored in your account. Storage for the table and intermediate data structures are based on a flat rate per terabyte (TB). |
| Cloud services compute | Cortex Search Services use [Cloud Services compute](../../cost-understanding-compute.md) to identify changes in underlying base objects and whether the virtual warehouse needs to be invoked. Cloud services compute cost is subject to the constraint that Snowflake only bills if the daily cloud services cost is greater than 10% of the daily warehouse cost for the account. |

For best practices on managing the costs of a Cortex Search Service, see [Understanding cost for Cortex Search Services](cortex-search-costs.md).

To view the **AI Services**-related consumption costs for each Cortex Search Service in your account, aggregated daily,
see the [CORTEX_SEARCH_DAILY_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_search_daily_usage_history.md)

## Known limitations

Usage of Cortex Search is subject to the following limitations:

* **Base table size**: The result of the materialized query in the search service must be
  less than 100M rows in size to maintain optimal serving performance. If the materialized result
  of your query has more than 100M rows, the creation query fails with an error.

  > **Note:**
  >
  > To increase the row scaling limits on a Cortex Search Service above 100M, please contact
  > your Snowflake account team.
* **Throughput and rate limiting**: Cortex Search returns a 429 HTTP status code if a client sends requests too quickly or if the service becomes overloaded. Client logic calling the search service should implement backoff and retry logic to handle these 429 responses gracefully.

  > **Note:**
  >
  > To increase throughput beyond 20 QPS for a single search service or 140 QPS across all services in your account, contact
  > your Snowflake account team.
* **Query constructs**: Cortex Search Service source queries must adhere to the same query restrictions
  that Dynamic Tables have. Please see the [Dynamic table limitations](../../dynamic-tables-limitations.md) for more detail.
* **Data retention**: Cortex Search Services have the same requirements as dynamic tables around data retentions.
  Specifically, you can’t set the [DATA_RETENTION_TIME_IN_DAYS](../../../sql-reference/parameters.md) object parameter in your base tables to zero
  or set this parameter on the schema or database containing the search service. Additionally, search services
  can become stale if they are not refreshed within [MAX_DATA_EXTENSION_TIME_IN_DAYS](../../../sql-reference/parameters.md). Once stale, they must be
  recreated to resume refreshes. Please see the [Dynamic table limitations](../../dynamic-tables-limitations.md) for more detail.
* **Cloning**: Cortex Search Services do not currently support [cloning](../../object-clone.md).
  Snowflake intends to provide this capability in some future release, but cannot guarantee a specific timeline.
* **Table immutability**: While running, your Cortex Search Services require tables they access aren’t modified or dropped. To safely update tables used by a Cortex Search Service, stop the service before making your changes.

## Regional availability

Support for this feature is available to accounts in the following Snowflake regions. Availability for specific embedding models
within a region is denoted with a checkmark.

| Cloud Provider | Region | `snowflake-arctic-embed-m-v1.5` | `snowflake-arctic-embed-l-v2.0` | `snowflake-arctic-embed-l-v2.0-8k` | `voyage-multilingual-2` |
| --- | --- | --- | --- | --- | --- |
| AWS | US West 2 (Oregon) | ✔ | ✔ | ✔ | ✔ |
| AWS | US East 2 (Ohio) | ✔ | ✔ | ✔ |  |
| AWS | US East 1 (N. Virginia) | ✔ | ✔ | ✔ | ✔ |
| AWS | US East (Commercial Gov - N. Virginia) | ✔ | ✔ | ✔ | ✔ |
| AWS | Canada (Central) | ✔ | ✔ | ✔ |  |
| AWS | South America (São Paulo) | ✔ | ✔ | ✔ |  |
| AWS | Europe (Ireland) | ✔ | ✔ | ✔ |  |
| AWS | Europe (London) | ✔ | ✔ | ✔ |  |
| AWS | Europe Central 1 (Frankfurt) | ✔ | ✔ | ✔ | ✔ |
| AWS | Europe (Stockholm) | ✔ | ✔ | ✔ |  |
| AWS | Asia Pacific (Tokyo) | ✔ | ✔ | ✔ | ✔ |
| AWS | Asia Pacific (Mumbai) | ✔ | ✔ | ✔ |  |
| AWS | Asia Pacific (Sydney) | ✔ | ✔ | ✔ |  |
| AWS | Asia Pacific (Jakarta) | ✔ | ✔ | ✔ |  |
| AWS | Asia Pacific (Seoul) | ✔ | ✔ | ✔ |  |
| Azure | East US 2 (Virginia) | ✔ | ✔ | ✔ |  |
| Azure | West US 2 (Washington) | ✔ | ✔ | ✔ |  |
| Azure | South Central US (Texas) | ✔ | ✔ | ✔ |  |
| Azure | UK South (London) | ✔ | ✔ | ✔ |  |
| Azure | North Europe (Ireland) | ✔ | ✔ | ✔ |  |
| Azure | West Europe (Netherlands) | ✔ | ✔ | ✔ | ✔ |
| Azure | Switzerland North (Zürich) | ✔ | ✔ | ✔ |  |
| Azure | Central India (Pune) | ✔ | ✔ | ✔ |  |
| Azure | Japan East (Tokyo, Saitama) | ✔ | ✔ | ✔ |  |
| Azure | Southeast Asia (Singapore) | ✔ | ✔ | ✔ |  |
| Azure | Australia East (New South Wales) | ✔ | ✔ | ✔ |  |
| GCP | Europe West 2 (London) | ✔ | ✔ | ✔ |  |
| GCP | Europe West 3 (Frankfurt) | ✔ | ✔ | ✔ |  |
| GCP | Europe West 4 (Netherlands) | ✔ | ✔ | ✔ |  |
| GCP | Middle East Central 2 (Dammam) | ✔ | ✔ | ✔ |  |
| GCP | US Central 1 (Iowa) | ✔ | ✔ | ✔ |  |
| GCP | US East 4 (N. Virginia) | ✔ | ✔ | ✔ |  |

> **Note:**
>
> You can specify the [cross-region inference parameter](../cross-region-inference.md) in any of
> the above regions to access models which aren’t directly supported from your default region.

Cortex Search is available in the following regions **only** using cross-region inference.
To use Cortex Search with cross-region inference, use the [cross-region inference parameter](../cross-region-inference.md).

* AWS Europe (Paris)
* AWS Europe (Zurich)
* AWS Asia Pacific (Singapore)
* AWS Asia Pacific (Osaka)
* Azure Canada Central (Toronto)
* Azure Central US (Iowa)
* Azure UAE North (Dubai)

> **Note:**
>
> When using cross-region inference, query latency between regions depends on the cloud provider infrastructure and network status.
> Snowflake recommends that you test your specific use-case with cross-region inference enabled.

## Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../../guides-overview-ai-features.md).

---
title: Cortex Search tutorials
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/overview-tutorials.md
section: Snowflake Cortex (AI & ML)
---

# Cortex Search tutorials

These tutorials provide step-by-step instructions for you to explore how to use Cortex Search.

[Tutorial 1: Build a simple search application with Cortex Search](tutorials/cortex-search-tutorial-1-search.md)
:   Walks through building a simple search experience using Cortex Search on a dataset consisting of AirBnb listing reviews.

[Tutorial 2: Build a simple chat application with Cortex Search](tutorials/cortex-search-tutorial-2-chat.md)
:   Walks through building a basic chatbot with Cortex Search and LLM functions on a dataset consisting of TED Talk transcripts.

[Tutorial 3: Build a PDF chatbot with Cortex Search](tutorials/cortex-search-tutorial-3-chat-advanced.md)
:   Walks through an end-to-end setup for creating a Chatbot using Cortex Search on a PDF dataset consisting of Federal Open Market Committee (FOMC) meeting minutes.

---
title: Cross-region inference
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cross-region-inference.md
section: Snowflake Cortex (AI & ML)
---

# Cross-region inference

Inference is the process of using a machine learning model to get an output based on a user input. For example, when you call the
SNOWFLAKE.CORTEX.COMPLETE function, you are requesting an inference from the LLM with your prompt as the input. In Snowflake, you can
configure your account to allow cross-region inference processing with the [CORTEX_ENABLED_CROSS_REGION](../../sql-reference/parameters.md)
parameter. This parameter enables inference requests to be processed in a different region from the default region.
The cross-region inference parameter is used to determine the inference behavior for any Snowflake feature supported by
cross-region inference, including Cortex LLM Functions.

When enabled, cross-region inference occurs if the LLM or feature is not supported in your default region.

By default, the parameter is set to `DISABLED` for most accounts, which allows requests to be processed only in the default region. For new accounts
created in new organizations within commercial regions created after March 9, 2026, the default is `ANY_REGION`.

You can specify the regions you want to allow cross-region inference to using the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command.

For details on this parameter, see [CORTEX_ENABLED_CROSS_REGION](../../sql-reference/parameters.md).

## Access control requirements

This parameter can only be set at the account level, not at the user or session levels. Only the ACCOUNTADMIN role can set the parameter
using the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command:

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'AWS_US';
```

This parameter cannot be set by the ORGADMIN role.

## How to use the cross-region inference parameter

By default, this parameter is set to `DISABLED` for most accounts, which means inference requests are only processed in the default region.
For new accounts in new organizations within commercial regions created after March 9, 2026, the default is `ANY_REGION`. The
following examples show how to set the cross-region parameter for various use cases.

### Any region

To allow any of the Snowflake regions that support cross-region inference requests to process your requests, set the parameter to
`'ANY_REGION'`.

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'ANY_REGION';
```

### Default region only

To process inference requests only in the default region, set this parameter to `'DISABLED'`.

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'DISABLED';
```

### Specify regions

To allow only specified regions to process your requests, set this parameter to the regions separated by commas. For a full list of
regions, see [CORTEX_ENABLED_CROSS_REGION](../../sql-reference/parameters.md).

The following example specifies `AWS_US` and `AWS_EU` regions to process your inference requests:

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'AWS_US,AWS_EU';
```

### US Commercial Gov regions

Cross-region inference for Snowflake’s government-authorized, FIPS-compliant commercial environments is designed to maintain data-handling boundaries while providing access to supported AI models. When enabled, inference requests remain within the same cloud and compliance boundary, and processing occurs on FIPS-validated infrastructure such as AWS Bedrock FIPS endpoints. This approach allows customers in select U.S. government-authorized regions to use Snowflake AI capabilities securely and without exceptions to compliance policies.

To enable this feature, set the CORTEX_ENABLED_CROSS_REGION parameter to `AWS_US` for workloads in a supported government-authorized region:

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'AWS_US';
```

Cross-region inference is available for US Commercial Gov in these regions:

* US East (Commercial Gov - N. Virginia)
* US West (Commercial Gov - Oregon)

## Cost considerations

* You are charged credits for the use of LLM as listed in the
  [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
  Credits are considered consumed in the requesting region. For example, if you call an LLM Function from the `us-east-2` region and
  the request is processed in the `us-west-2` region, the credits are considered consumed in the `us-east-2` region.
* You do not incur data egress charges for using cross-region inference.

## Considerations

* Latency between regions depends on the cloud provider infrastructure and network status. Snowflake recommends that you test your specific
  use-case with cross-region inference enabled.
* Cross-region inference is not supported in [U.S. SnowGov regions](../intro-regions.md). This means you cannot make cross-region
  inference requests into or out of the SnowGov regions.
* You can use this setting from GCP or Azure regions to make inference requests for features that are not supported in those regions.
* User inputs, service generated prompts, and outputs are not stored or cached during cross-region inference.
* The data required for the inference request traverses between regions as follows:

  + If both the source and destination regions are in AWS, the data stays within the [AWS global network](https://aws.amazon.com/about-aws/global-infrastructure/).
    All data flowing across the AWS global network that interconnects the data centers and regions is automatically
    encrypted at the physical layer.
  + If both the source and destination regions are in Azure, the traffic stays entirely within the Azure global network. It never enters the public internet.
  + If the regions are on different cloud providers, then the data traverses the public internet using Mutual Transport Layer Security (mTLS).
* Cross-region inference for [Cortex Search](cortex-search/cortex-search-overview.md) is not supported in [all regions](cortex-search/cortex-search-overview.md).

## Next steps

* For details on the cross-region inference parameter, see [CORTEX_ENABLED_CROSS_REGION](../../sql-reference/parameters.md) section of the SQL parameter reference.

---
title: Custom instructions in Cortex Analyst
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/custom-instructions.md
section: Snowflake Cortex (AI & ML)
---

# Custom instructions in Cortex Analyst

Custom instructions let you have greater control over SQL generation. Using natural language, you can tell Cortex Analyst
exactly how to generate SQL queries from within your semantic model YAML file. For example, use custom instructions
to tell Cortex Analyst what you mean by *performance* or *financial year*. In this way, you can improve the accuracy of the generated SQL
by incorporating custom logic or additional elements.

For more granular control, you can also specify custom instructions for individual modules in the SQL generation pipeline. See
Module custom instructions for more information.

## How custom instructions work

Cortex Analyst introduces the `custom_instructions` field into the semantic model YAML file.
This field enables you to apply defining modifications or additions to SQL generation.

For more information about the semantic model syntax, see [Using SQL commands to create and manage semantic views](../../views-semantic/sql.md).

## Examples

To explore possible use cases for custom instructions, consider the following examples.

### Formatting data output

Ensure that all numbers in the output are rounded to two decimal points.

#### The `custom_instructions` field in the semantic model YAML file

```yaml
custom_instructions: "Ensure that all numeric columns are rounded to 2 decimal points in the output."
```

#### Generated SQL query

```sqlexample
SELECT
  ROUND(column_name, 2) AS column_name,
  ...
FROM
  your_table;
```

### Adjusting percentages

Automatically multiply percentage or rate calculations by 100 for consistency.

#### The `custom_instructions` field in the semantic model YAML file

```yaml
custom_instructions: "For any percentage or rate calculation, multiply the result by 100."
```

#### Generated SQL query

```sqlexample
SELECT
  (column_a / column_b) * 100 AS percentage_rate,
  ...
FROM
  your_table;
```

### Adding default filters

Apply a filter if the user doesn’t specify one (for example, default to the last year).

#### The `custom_instructions` field in the semantic model YAML file

```yaml
custom_instructions: "If no date filter is provided, apply a filter for the last year."
```

#### Generated SQL query

```sqlexample
SELECT
  ...
FROM
  your_table
WHERE
  date_column >= DATEADD(YEAR, -1, CURRENT_DATE);
```

### Linking column filters

Apply additional filters on related columns based on user input.

#### The `custom_instructions` field in the semantic model YAML file

```yaml
custom_instructions: "If a filter is applied on column X, ensure that the same filter is applied to dimension Y."
```

#### Generated SQL query

```sqlexample
SELECT
  ...
FROM
  your_table
WHERE
  column_x = 'filter_value' AND
  dimension_y = 'filter_value';
```

## Module custom instructions

Set the `module_custom_instructions` key in the top level of your semantic model to define custom instructions for specific components in the SQL generation pipeline.
This feature is useful for use cases like the following:

* Define logic that influences how user questions are interpreted before SQL is generated
* Maintain separate, more structured instructions for different parts of the Analyst workflow
* Transition from existing `custom_instructions` to a more modular format as your usage grows

Currently, `module_custom_instructions` supports the following components:

* `question_categorization`: Define how Cortex Analyst should classify user questions (for example, by blocking certain topics or guiding user behavior).
* `sql_generation`: Specify how SQL should be generated (for example, data formatting and filtering).

Instructions for either or both of these components can be set under the `module_custom_instructions` key.

> **Important:**
>
> Migrate any existing `custom_instructions` to the `sql_generation` component, as shown in the following example.

### Migrating existing custom instructions

If your model already has a `custom_instructions` field, migrate its content to the `sql_generation` field
under `module_custom_instructions`.

Before:

```yaml
custom_instructions: "Ensure that all numeric columns are rounded to 2 decimal points."
```

After:

```yaml
module_custom_instructions:
  sql_generation: |
     "Ensure that all numeric columns are rounded to 2 decimal points."
```

### Blocking questions about specific topics

You can use the `question_categorization` component to block questions about specific topics. For example, if you want
to block questions about users, you might set the following instructions. Cortex Analyst then rejects questions about
users with a message telling them to contact their administrator.

```yaml
module_custom_instructions:
  question_categorization: |
     Reject all questions asking about users. Ask users to contact their admin.
```

You can also use question categorization instructions to ask for missing details. In the following example, Cortex Analyst asks the user to
provide a product type if they ask about users and do not specify one.

```yaml
module_custom_instructions:
  question_categorization: |
    - If the question asks for users without providing a product_type, consider this question UNCLEAR and ask the user to specify product_type.
```

## Best practices

Be specific.
:   Clearly describe the modifications; for example, “Add a column with a fixed value of 42” or “Include a sum calculation for column X.”

Start small.
:   Start with simple modifications, such as adding a static column or default filters, before moving to more complex scenarios.

Preview the generated SQL query.
:   Ensure that the instructions apply as intended and that the generated SQL query is correct.

Iterate gradually.
:   Experiment with more complex use cases as your familiarity with the feature grows.

---
title: Customize charts in Snowflake Intelligence
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/chart-customization.md
section: Snowflake Cortex (AI & ML)
---

# Customize charts in Snowflake Intelligence

Snowflake Intelligence generates charts automatically from your data. You can customize those charts,
controlling colors, fonts, chart types, and more, by adding configuration to your agent or
semantic view.

## Overview

Customization works at two levels:

* **Agent level**: Applies to all charts across every semantic view attached to the agent. Use this
  for global defaults such as brand colors and fonts.
* **Semantic view level**: Applies only to charts generated from that specific semantic view. Use
  this for column-specific rules and domain-specific chart type preferences.

At each level, two mechanisms are available:

* **vega_template**: A partial [Vega-Lite](https://vega.github.io/vega-lite/) JSON spec that is
  deterministically merged into every generated chart. Use this for anything that must always apply.
* **Free-text instructions**: Natural language guidance injected into the chart generation prompt.
  The LLM makes a best effort to follow these, but they aren’t guaranteed.

> **Note:**
>
> When both agent and semantic view define a `vega_template`, the agent template is applied
> first and the semantic view template is applied second. On conflicting keys, the semantic view
> wins.

## Agent-level customization

Add a `<chart_customization>` block inside `instructions.orchestration` in your agent
configuration. You can combine font theming and a global default palette:

```none
<chart_customization>
Prefer horizontal bar charts for ranked data.
vega_template:
{
  "background": "antiquewhite",
  "config": {
    "title":  { "font": "monospace", "fontStyle": "italic", "fontSize": 20, "fontWeight": "lighter" },
    "axis":   { "labelFont": "monospace", "titleFont": "monospace", "titleFontSize": 15, "labelFontSize": 10 },
    "header": { "labelFont": "monospace", "titleFont": "monospace", "labelFontSize": 10 },
    "legend": { "labelFont": "monospace", "titleFont": "monospace", "titleFontSize": 18, "labelFontSize": 15 },
    "mark":   { "font": "monospace" }
  }
}
</chart_customization>
```

Use the agent level for:

* Brand color palette
* Default visual theme (fonts, background)
* Cross-domain style preferences
* Number or currency formatting defaults

Avoid column-specific color mappings at the agent level. Column names differ between semantic views
and are silently ignored when not found.

## Semantic-view-level customization

Add a `<chart_customization>` block inside the `module_custom_instructions.sql_generation`
field of your semantic view YAML. This field takes precedence over the legacy
`custom_instructions` field when both are set.

```sql
CREATE OR REPLACE SEMANTIC VIEW my_db.my_schema.my_view
  FROM @my_stage/semantic_view.yaml;
```

```yaml
# semantic_view.yaml
name: my_view
module_custom_instructions:
  sql_generation: |
    <chart_customization>
    Always use a line chart for time series data.
    vega_template:
    {
      "transform": [
        {
          "calculate": "datum.CATEGORY === 'Furniture' ? '#4e79a7' : datum.CATEGORY === 'Technology' ? '#f28e2b' : datum.CATEGORY === 'Office Supplies' ? '#e15759' : ''",
          "as": "_color"
        }
      ],
      "encoding": {
        "color": { "field": "CATEGORY", "type": "nominal", "scale": { "range": { "field": "_color" } } }
      }
    }
    </chart_customization>
tables:
  ...
```

Use the semantic view level for:

* Per-column color mappings
* Domain-specific chart type rules
* Metric-specific formatting
* Overriding agent-level defaults

## Use with caution: templates affect every chart

`vega_template` is merged into **every chart** generated at that level. There’s no per-question
or per-chart-type filtering. If you add an `encoding.y` override at the agent level, it applies
to bar charts, line charts, scatter plots, and pie charts alike.

Before adding a template, consider:

* **Scope**: Agent-level templates affect all charts across all semantic views. Prefer the semantic
  view level when a rule is specific to one domain or dataset.
* **Wildcard encodings**: A template encoding that omits `field` (for example,
  `"y": {"axis": {"format": "..."}}`) applies to every chart’s `y` axis regardless of what
  column is plotted. Use `field` to pin it to a specific column when the semantic view is known.
* **Mark overrides**: Setting `"mark": "line"` at the agent level forces every chart to a line,
  including ones where the LLM would correctly choose a bar or pie. Only override `mark` at the
  semantic view level where you have domain knowledge about the data.
* **Transform arrays**: A `calculate` transform in the template (for example, `_color`) is
  injected into every chart’s `transform` array. If the data doesn’t contain the referenced
  column, Vega-Lite silently produces `null` values for the calculated field.

When in doubt, start at the semantic view level and promote to the agent level only after
confirming the rule is safe for all charts.

To validate a template before deploying it, paste a representative chart spec (with your
`vega_template` already merged in) into the
[Vega Editor](https://vega.github.io/editor). The editor shows live warnings and errors in the
console. A valid template should produce no warnings. Common things to catch this way: invalid
property names, type mismatches, unreachable `calculate` expressions, and scale configuration
errors.

## Fonts

Font settings are controlled through the `config` block in `vega_template`. All font properties
are applied globally to the chart and affect every chart generated, regardless of data.

> **Note:**
>
> Use CSS generic font families for maximum compatibility. Charts in Snowflake Intelligence are
> rendered in two contexts: in the Snowsight browser UI (client-side, fonts depend on the user’s
> OS and browser) and server-side in a Linux container for validation and image export. Named fonts
> like `Arial` or `Georgia` might not be installed in the server-side container. CSS generic
> families always resolve correctly in both contexts:
>
> | Generic family | Resolves to |
> | --- | --- |
> | `sans-serif` | Arial (Windows/macOS), DejaVu Sans or Liberation Sans (Linux) |
> | `serif` | Times New Roman (Windows/macOS), DejaVu Serif or Liberation Serif (Linux) |
> | `monospace` | Courier New (Windows/macOS), DejaVu Sans Mono or Liberation Mono (Linux) |
>
> If you need a custom brand font, it must be installed in the server-side rendering container
> **and** served through CSS `@font-face` in Snowsight.

```json
{
  "config": {
    "title":  { "font": "serif", "fontSize": 20, "fontWeight": "bold", "fontStyle": "italic" },
    "axis":   { "labelFont": "monospace", "titleFont": "monospace", "labelFontSize": 11, "titleFontSize": 13 },
    "header": { "labelFont": "serif", "titleFont": "serif", "labelFontSize": 11 },
    "legend": { "labelFont": "serif", "titleFont": "serif", "labelFontSize": 12, "titleFontSize": 13 },
    "mark":   { "font": "serif" }
  }
}
```

Common `config` font properties:

| Property | Where it applies |
| --- | --- |
| `title.font`, `title.fontSize`, `title.fontWeight`, `title.fontStyle` | Chart title |
| `axis.labelFont`, `axis.labelFontSize` | Axis tick labels |
| `axis.titleFont`, `axis.titleFontSize` | Axis titles (for example, “Revenue”) |
| `header.labelFont`, `header.labelFontSize` | Facet / small-multiple headers |
| `legend.labelFont`, `legend.labelFontSize` | Legend value labels |
| `legend.titleFont`, `legend.titleFontSize` | Legend title |
| `mark.font` | Text marks (annotations) |

You can also set a global `background` color alongside fonts:

```json
{
  "background": "#f9f9f9",
  "config": {
    "title":  { "font": "monospace", "fontStyle": "italic", "fontSize": 20, "fontWeight": "lighter" },
    "axis":   { "labelFont": "monospace", "titleFont": "monospace", "titleFontSize": 15, "labelFontSize": 10 },
    "header": { "labelFont": "monospace", "titleFont": "monospace", "labelFontSize": 10 },
    "legend": { "labelFont": "monospace", "titleFont": "monospace", "titleFontSize": 18, "labelFontSize": 15 },
    "mark":   { "font": "monospace" }
  }
}
```

## Colors

### LLM instructions (soft)

The simplest way to apply color rules is to describe them in free text. The LLM interprets these
on a best-effort basis.

```none
<chart_customization>
Color Active status green, Inactive status red, and Pending status yellow.
</chart_customization>
```

Use this for quick, approximate color guidance when exact hex values aren’t required.

### Exact value mapping with _color

Map specific column values to exact hex colors using a `calculate` transform. Values not listed
receive an empty string, and Vega-Lite renders those with its own default.

```json
{
  "transform": [
    {
      "calculate": "datum.STATUS === 'Active' ? '#22c55e' : datum.STATUS === 'Inactive' ? '#ef4444' : datum.STATUS === 'Pending' ? '#eab308' : ''",
      "as": "_color"
    }
  ],
  "encoding": {
    "color": {
      "field": "STATUS",
      "type": "nominal",
      "scale": { "range": { "field": "_color" } }
    }
  }
}
```

Use this when you need exact, guaranteed colors for every known value.

> **Note:**
>
> The `_color` transform and the `encoding.color` block are always merged into the chart,
> regardless of which column the LLM chose to color by. This means:
>
> * The mapping only works correctly when the chart’s color channel actually uses the same column
>   referenced in the `calculate` expression (for example, `STATUS`). If the LLM assigns color
>   to a different column, the `_color` field is present in the data but the colors don’t match.
> * Only one column can be targeted per template.

### Pinned values with palette fallback

Pin colors for key values and let the rest be auto-assigned from a palette. Use
`"merge": "extend"` to preserve the LLM’s existing color choices and only add new mappings.

```json
{
  "encoding": {
    "color": {
      "scale": {
        "domain": ["Furniture", "Technology", "Office Supplies"],
        "range":  ["#4e79a7", "#f28e2b", "#e15759"],
        "scheme": "tableau10"
      }
    }
  },
  "usermeta": { "merge": "extend" }
}
```

Data values not in `domain` are automatically assigned the next available color from `scheme`.
After assignment, `scheme` is removed from the final spec.

Supported scheme names: `tableau10`, `tableau20`, `category10`, `category20`,
`category20b`, `category20c`, `dark2`, `paired`, `pastel1`, `pastel2`, `set1`,
`set2`, `set3`, `accent`.

## Disabling Snowsight styling

By default, Snowflake Intelligence applies Snowsight UI theme adjustments on top of the generated chart.
To opt out and render the chart exactly as specified in your `vega_template`, set `ui-merge` to
`"none"` in `usermeta`:

```json
{
  "usermeta": { "ui-merge": "none" }
}
```

This is useful when you want full control over the visual output, for example, when applying a
custom brand theme and you don’t want Snowsight to override colors, fonts, or backgrounds.

> **Note:**
>
> `ui-merge` is interpreted by the Snowsight client-side renderer, not by the orchestrator
> backend. It has no effect on the chart spec produced by the merge engine. It only controls how
> Snowsight applies its own theme on top of the final spec when displaying the chart in the
> browser.

## Number and currency formatting (experimental)

Axis and legend labels can be formatted using [D3 format strings](https://d3js.org/d3-format)
through `vega_template`. This is useful for enforcing consistent currency symbols, decimal
places, or SI suffixes across all charts.

Set `axis.format` for quantitative axes (`x`, `y`) and `legend.format` for color/size
legends:

```json
{
  "encoding": {
    "y": { "axis": { "format": "$,.0f" } }
  }
}
```

> **Note:**
>
> `axis.format` is applied by Vega-Lite only when the channel’s data type is
> `"quantitative"`. If the LLM infers a different type (for example, `"ordinal"` for a year
> or ID column), the format string is silently ignored. This is an accepted limitation of the
> `vega_template` approach because the merge is applied without inspecting inferred types.
>
> **Workaround**: Force the type explicitly in the template (`override` mode):
>
> ```json
> {
>   "encoding": {
>     "y": { "type": "quantitative", "axis": { "format": "$,.0f" } }
>   }
> }
> ```
>
> This guarantees the format applies but may affect other type-dependent rendering (axis ticks,
> binning).

Common D3 format strings:

| Format | Output example | Use for |
| --- | --- | --- |
| `$,.0f` | $1,234,567 | Dollar amounts, no decimals |
| `$,.2f` | $1,234,567.89 | Dollar amounts, 2 decimals |
| `,.0f` | 1,234,567 | Large integers with thousands separator |
| `.1%` | 42.3% | Percentages |
| `.2s` | 1.2M | Large numbers with SI prefix |
| `.2f` | 3.14 | Fixed 2 decimal places |

To apply formatting to all quantitative channels at the agent level (without knowing the specific
column name):

```json
{
  "encoding": {
    "y": { "axis": { "format": "$,.0f" } },
    "x": { "axis": { "format": "$,.0f" } },
    "color": { "legend": { "format": "$,.0f" } }
  },
  "usermeta": { "merge": "extend" }
}
```

Use `"merge": "extend"` so the format is added only to channels the LLM already populated,
without overwriting their `field` or `type` settings.

## Merge modes

Control how `vega_template` interacts with the LLM-generated chart by setting
`"usermeta": {"merge": "<mode>"}` inside the template.

| Mode | Behavior |
| --- | --- |
| `override` (default) | Template values overwrite the chart. Use when you need to enforce a specific setting. |
| `extend` | Existing chart values are preserved. New keys and additional scale entries are added. Use when you want to add to the chart without replacing what the LLM chose. |

Rules that apply to both modes:

* The `data` block is never overwritten.
* Encoding overrides apply only when the template’s `field` matches the chart’s `field`, or the
  template omits `field`.
* After merging, domain entries not present in the actual data are automatically removed.

**Example: force a line chart**

```json
{
  "mark": "line",
  "usermeta": { "merge": "override" }
}
```

---
title: Customizing Cortex Search scoring
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md
section: Snowflake Cortex (AI & ML)
---

# Customizing Cortex Search scoring

By default, queries to Cortex Search Services leverage vector similarity, text matching, and reranking
to determine the relevance of each result. You can customize the scoring of search results in several ways:

* Apply numeric boosts based on numeric metadata columns.
* Apply time decays based on timestamp metadata columns.
* Disable reranking to reduce query latency.
* Modify component weights to adjust the weight of individual scoring components (vector, text, reranking) in the overall search ranking.
* Disable the query prefix for vector embeddings for advanced use cases.
* Modify index-specific boosts to adjust the weight of individual indices in a multi-index search.

## Numeric boosts and time decays

You can boost or apply decays search results based on numeric or timestamp metadata. This feature is useful
when you have structured metadata, such as popularity or recency signals, for each result that can help determine the relevance of documents
at query time. You can specify two categories of ranking signals when making a query:

| Type | Description | Applicable column types | Example metadata fields (illustrative) |
| --- | --- | --- | --- |
| Numeric boost | Numeric metadata that boosts results having more attention or activity. | [Numeric data type](../../../sql-reference/data-types-numeric.md) | `clicks`, `likes`, `comments` |
| Time decay | Date or time metadata that boosts more recent results. The influence of recency signals decays over time. | [Date and time data type](../../../sql-reference/data-types-datetime.md) | `created_timestamp`, `last_opened_timestamp`, `action_date` |

Boost and decay metadata come from columns in the source table from which a Cortex Search Service is created. You
specify the metadata columns to use for boosting or decaying when you make the query, but those columns must be included
when creating the Cortex Search service.

When querying a Cortex Search Service, specify the columns to use for boosting or decaying in the optional
`numeric_boosts` and `time_decays` fields in the `scoring_config.functions` field. You can also specify the weight
for each boost or decay.

```json
{
  "scoring_config": {
    "functions": {
      "numeric_boosts": [
        {
          "column": "column_name",
          "weight": 1
        },
        /* ... */
      ],
      "time_decays": [
        {
          "column": "column_name",
          "weight": 1,
          "limit_hours": 120
        },
        /* ... */
      ]
    }
  }
}
```

### Properties

* `numeric_boosts` (array, optional):

  + `<numeric_boost_object>` (object, optional):

    - `column_name` (string): Specifies the numeric column to which the boost should be applied.
    - `weight` (float): Specifies the weight or importance assigned to the boosted column in the ranking process. When multiple columns are specified, a higher weight increases the influence of the field.
* `time_decays` (array, optional):

  + `<time_decay_object>` (object, optional):

    - `column_name` (string): Specifies the time or date column to which the decay should be applied.
    - `weight` (float): Specifies the weight or importance assigned to the decayed column in the ranking process. When multiple columns are specified, a higher weight increases the influence of the field.
    - `limit_hours` (float): Sets the boundary after which time starts to have less effect on the relevance or importance of the document. For example,
      a `limit_hours` value of 240 indicates that documents with timestamps greater than 240 hours (10 days) in the past from the `now` timestamp do not receive significant boosting,
      while documents with a timestamp within the last 240 hours should receive a more significant boost.
    - `now` (string, optional): Optional reference timestamp from which decays are calculated in ISO-8601 format `yyyy-MM-dd'T'HH:mm:ss.SSSXXX`.
      For example, `"2025-02-19T14:30:45.123-08:00"`. Defaults to the current timestamp if not specified.

> **Note:**
>
> Numeric boosts are applied as weighted averages to the returned fields, while decays leverage a log-smoothed function to
> demote less recent values.
>
> Weights are relative across the specified boost or decay fields. If only a single field is provided within a `boosts` or
> `decays` array, the value of its weight is irrelevant.
>
> If more than one field is provided, the weights are applied relative to each other. A field with a weight of 10, for
> example, affects the record’s ranking twice as much as a field with a weight of 5.

## Reranking

By default, queries to Cortex Search Services leverage *semantic reranking* to improve search result relevance.
While reranking can measurably increase result relevance, it can also noticeably increase query latency.
You can disable reranking in any Cortex Search query if you’ve found that
the quality benefit that reranking provides can be sacrificed for faster query speeds in your business use case.

> **Note:**
>
> Disabling reranking reduces query latency by 100-300ms on average, but the exact reduction in latency, as
> well as the magnitude of the quality degradation, varies across workloads.
> Evaluate results side-by-side, with and without reranking, before you decide to disable it in queries.

You can disable the reranker for an individual query at query time in the `scoring_config.reranker` field in the
following format:

```json
{
  "scoring_config": {
      "reranker": "none"
  }
}
```

### Properties

* `reranker` (string, optional): Parameter that can be set to “none” if the reranker should be turned off. If excluded or null, the default reranker is used.

## Component weights

The `weights` field in the `scoring_config` object allows you to specify the
weights of individual scoring components (`vectors`, `texts`, `reranker`) in the overall
score for each result. By default, the weights are set to 1.0 for each component, with
an equal contribution to the overall scoring.

You can specify weights in the following format:

```json
{
  "scoring_config": {
    "functions": {
      "weights": {
        "texts": 3,
        "vectors": 2,
        "reranker": 1
      }
    }
  }
}
```

> **Note:**
>
> When using index-specific boosts with `text_boots` or `vector_boosts` on a multi-index service, the `weights` property
> is placed at the top level of the scoring configuration, not as part of the `functions` object:
>
> ```json
> {
>   "scoring_config": {
>     "weights": {
>       "texts": 3,
>       "vectors": 2,
>       "reranker": 1
>     },
>     "functions": {
>       // ...
>     }
>   }
> }
> ```

### Properties

* `weights` (object, optional): Specifies weights for combining text, vector, and
  reranker scores for each document. Weights are applied relative to one another within this field.

For example, the following specifies that text scores should be weighted 3 times more than vector scores,
and reranker scores should be weighted 2 times more than text scores:

```json
{
  "scoring_config": {
    "functions": {
      "weights": {
        "texts": 3,
        "vectors": 1,
        "reranker": 2
      }
    }
  }
}
```

## Disabling query prefix for vector embeddings

By default, Cortex Search adds a prefix to queries before computing vector embeddings. This prefix varies by model, but generally has the following format: `Represent this sentence for searching relevant passages: query`. This improves search quality in many cases by providing context to the embedding model, which helps differentiate search queries from other texts you have stored in the Cortex Search service.

However, you might want to disable this prefix in some cases such as the following scenario:

* When you want to use similarity search without the prefix. For example, if you want to search “what is the best data cloud” and you want to get “Snowflake” as a result, then use the default prefix. However, if you want to search “what is the data cloud” and you want to get “which is the best data cloud” as a result, then you can disable the prefix.

You can disable the query prefix for an individual query at query time using the `disable_vector_embedding_query_prefix` parameter in the `scoring_config` field:

```json
{
  "scoring_config": {
    "disable_vector_embedding_query_prefix": true
  }
}
```

### Properties

* `disable_vector_embedding_query_prefix` (boolean, optional): When set to `true`, a search prefix is not added automatically to the query before computing vector embeddings. Defaults to `false`.

> **Note:**
>
> Disabling the query prefix might reduce search quality in most cases because the prefix helps the embedding model understand that the text is a search query. Only disable this if you have a specific reason to do so and have evaluated the impact on your search results.

## Named scoring profiles

Boosts/decays and reranker settings together form a *scoring configuration*, which can be specified in the `scoring_config` parameter
when making a query. Scoring configurations can also be given a name and attached to the Cortex Search service.

Using a named scoring profile lets you easily use a scoring configuration across applications and queries without having
to specify the full scoring configuration each time. If you change the scoring configuration, you only need to update it
in one place, not in every query.

To add a scoring profile to your Cortex Search Service, use the [ALTER CORTEX SEARCH SERVICE … ADD SCORING PROFILE](../../../sql-reference/sql/alter-cortex-search.md) command,
as shown in the following example:

```sqlexample
ALTER CORTEX SEARCH SERVICE my_search_service
  ADD SCORING PROFILE IF NOT EXISTS heavy_comments_with_likes
  '{
    "functions": {
            "numeric_boosts": [
                { "column": "comments", "weight": 6 },
                { "column": "likes", "weight": 1 }
            ]
    }
  }'
```

The syntax of the scoring profile definition is the same schema used in the `scoring_config` parameter when making a query.

Scoring profiles can’t be modified after being created; to change a profile, drop it and recreate it with the new scoring configuration.
To delete a named scoring profile, use [ALTER CORTEX SEARCH SERVICE … DROP SCORING PROFILE](../../../sql-reference/sql/alter-cortex-search.md).

To query a Cortex Search Service using a named scoring profile, specify the profile name in the `scoring_profile` parameter when making a query,
as shown in the following examples:

PythonREST APISQL

```python
results = svc.search(
    query="technology",
    columns=["comments", "likes"],
    scoring_profile="heavy_comments_with_likes",
    limit=10
)
```

```javascript
curl --location https://<account_url>/api/v2/databases/<db_name>/schemas/<schema_name>/cortex-search-services/<service_name>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "technology",
  "columns": ["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
  "scoring_profile": "heavy_comments_with_likes",
  "limit": 10
}'
```

```sqlexample
SELECT SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
  'my_search_service',
  '{
    "query": "technology",
    "columns": ["comments", "likes"],
    "scoring_profile": "heavy_comments_with_likes",
    "limit": 10
  }'
);
```

To see a service’s stored scoring profiles, query the `CORTEX_SEARCH_SERVICE_SCORING_PROFILES` view in the
`INFORMATION_SCHEMA` schema, as shown in the following example:

```sqlexample
SELECT *
  FROM my_db.INFORMATION_SCHEMA.CORTEX_SEARCH_SERVICE_SCORING_PROFILES
  WHERE service_name = 'my_search_service';
```

> **Note:**
>
> The DESCRIBE CORTEX SEARCH SERVICE and SHOW CORTEX SEARCH SERVICE results contain a column
> named `scoring_profile_count` that indicates the number of scoring profiles for each service.

## Component Scores

Component Scores provide detailed scoring information for search results. They allow developers to understand how search rankings are determined and debug search performance.
Scores for each result are returned in the `@scores` field for each retrieval “component” (text, vector).
Component scores are useful in scenarios where there is a need to:

* **Establish thresholds:** Use component scores to determine when to pass results on to a downstream process, like an agent.
* **Debug search rankings:** Understand why certain documents rank higher than others in search results.

### Understanding Component Scores

Component scores provide detailed breakdowns of how Cortex Search calculates the final relevance score
for each search result. The scoring system consists of multiple components:

**Cosine Similarity**
:   Scores based on semantic similarity between the query and vector indexes. Higher scores indicate
    stronger conceptual or meaning-based matches using vector embeddings.

**Text Match**
:   Scores based on keyword/lexical similarity between the query and text indexes. Higher scores indicate
    stronger exact or fuzzy keyword matches.

**Reranker Score**
:   Scores based on meaning-based matches between the query and the value in the text index. Higher scores indicate stronger conceptual or meaning-based matches using reranker. Scores are provided only for the top results which are reranked.

**Function Scores**
:   Additional detailed scoring information from boost functions when applied (such as `text_boosts`, `vector_boosts`,
    numeric boosts, time decay). Contains nested objects for each boost type (such as `text_boost` and `vector_boost`)
    showing individual column scores, weights, and weighted totals. Useful for understanding how matches in different fields contribute
    to the final scoring of the document.

### Response format

With component scores enabled, the following scoring information is returned for all your Cortex Search queries.
For more information on Cortex Search Query syntax, see [Query a Cortex Search Service](query-cortex-search-service.md).

```output
{
  "results": [
    {
      "@scores": {
        "cosine_similarity": <cosine_similarity_score>,
        "text_match": <text_match_score>
      }
    }
  ]
}
```

#### Score fields

* `@scores.cosine_similarity`: Cosine similarity score between the query and the value in the vector index, in the range [-1, 1].
* `@scores.text_match`: Text match score between the query and the value in the text index. This score is unbounded and its range
  depends on the query.
* `@scores.reranker_score`: Reranker score between the query and the value in the text index. This score is unbounded and its range
  depends on the query.
* `@scores.function_scores`: Nested object containing detailed boost function scoring (only present when `functions` are specified in the query):

  + `text_boost.column_scores.column_name.score`: Individual score for the specified column from text boost.
  + `text_boost.column_scores.column_name.weight`: Applied weight for the specified column from text boost.
  + `text_boost.weighted_score`: Final weighted score from text boost function.
  + `vector_boost.column_scores.column_name.score`: Individual score for the specified column from vector boost.
  + `vector_boost.column_scores.column_name.weight`: Applied weight for the specified column from vector boost.
  + `vector_boost.weighted_score`: Final weighted score from vector boost function.
  + `numeric_boost.column_scores.column_name.score`: Individual score for the specified column from numeric boost.
  + `numeric_boost.column_scores.column_name.weight`: Applied weight for the specified column from numeric boost.
  + `numeric_boost.weighted_score`: Final weighted score from numeric boost function.
  + `time_decay.column_scores.column_name.score`: Individual score for the specified column from time decay.
  + `time_decay.column_scores.column_name.weight`: Applied weight for the specified column from time decay.
  + `time_decay.weighted_score`: Final weighted score from time decay function.

#### Usage Notes

* `cosine_similarity` scores are:

  > + Returned for any query that includes a VECTOR INDEX.
  > + Bounded in the range [-1, 1] and comparable across different queries.
  > + Computed assuming normalized vectors.
  > + Subject to minor precision loss due to compression in the vector index, which means that
  >   `cosine_similarity(v, v)` might return `1.0 +/- epsilon` rather than exactly `1.0`.
  >   Compression details might vary over time, and epsilon might not be stable.
  > + Computed after prepending each query with a prefix that increases search quality in many cases.
  >   This prefix varies per model, but generally looks like: `Represent this sentence for searching relevant passages: {query}`.
  >   The returned cosine similarity score is the cosine similarity between the query with the prefix and the value in the vector index.
* `text_match` scores are:

  > + Returned for any query that includes a TEXT INDEX. `text_match` scores are unbounded.
  > + Not comparable across different queries. For example, a text match score of 0.95 on a result for a given query is not comparable to a
  >   text match score of 0.95 on a result for a different query to the same service.
* `@scores` values are not affected by the `weights` parameter. The weights only affect the final ordering of the results.

## Index-specific boosts

Index-specific boosts adjust the weight of influence for indexes in a [multi-index Cortex Search service](cortex-search-overview.md). You can adjust the text matching and vector matching weights, which are applied relative to the other provided weights. Higher values take priority over lower values, using the same behavior as component weights.

### Properties

* `text_boosts` (array, optional): Index-specific weights to be applied to text index columns. When this value is present, you’re required to include a weight for all text columns. Column weights are applied relative to one another.
* `vector_boosts` (array, optional): Index-specific weights to be applied to vector columns. When this value is present, you’re required to include a weight for all vector columns. Column weights are applied relative to one another.

Index-specific weights are objects containing `column` and `weight` keys:

```output
{
  "column": "<column name>",
  "weight": <weight>
}
```

As an example, consider the following table indexed for search:

```sqlexample
CREATE TABLE feedback_info (
  id VARCHAR,
  comment VARCHAR,
  support_note VARCHAR,
  sentiment VECTOR(FLOAT, 3),
  issue_category VECTOR(FLOAT, 3)
);
```

The following JSON shows a `scoring_config` for a multi-index Cortex Search service that de-ranks the `id` text column while boosting the `comment` text column, and adjusting the vector rankings of `sentiment` to be twice as important as other vector columns.

```json
{
  "scoring_config": {
    "functions": {
      "text_boosts": [
        { "column": "id", "weight": 1 },
        { "column": "support_note", "weight": 2},
        { "column": "comment", "weight": 3},
      ],
      "vector_boosts": [
        { "column": "issue_category", "weight": 1 },
        { "column": "sentiment", "weight": 2 }
      ]
    }
  }
}
```

## Diversity

In some cases, one type of result may return more results than others. To prevent a certain type of result from dominating the search results, use the `diversity` parameter.

For example, if a Cortex Search Service is created using long documents and these documents are indexed by chunking, the `diversity` parameter can be used to ensure that multiple chunks from the same document are not surfaced in the final result set.

You can enable diversity for an individual query at query time in the `scoring_config.diversity` field in the following format:

```none
{
  "scoring_config": {
    "diversity": {
      "group_by": <array_of_columns_to_group_by>,
      "max_results": <num_results_for_each_group>,
    }
  }
}
```

### Properties

* `diversity` (object, optional): Parameter that can be set to “none” if result diversity should be turned off.

  + `group_by` (array): Columns to group by.
  + `max_results` (integer): Maximum number of results for each group.

---
title: Detect and redact personally identifiable information (PII)
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/redact-pii.md
section: Snowflake Cortex (AI & ML)
---

# Detect and redact personally identifiable information (PII)

Personally identifiable information (PII) includes names, addresses, phone numbers, email addresses, tax identification numbers, and other
data that can be used (alone or with other information) to identify an individual. Most organizations have regulatory and compliance
requirements around handling PII data. [AI_REDACT](../../sql-reference/functions/ai_redact.md) is a fully-managed Cortex AI Function that uses a large language model (LLM) to help you
detect, locate, and redact PII from unstructured text data.

AI_REDACT can help you prepare text for call center coaching, sentiment analysis, insurance and medical analysis,
and machine learning (ML) model training, among other use cases.

> **Tip:**
>
> Use AI_PARSE_DOCUMENT or AI_TRANSCRIBE to convert document or speech data into text before applying AI_REDACT.

## AI_REDACT

The AI_REDACT function has two modes of operation: `detect` and `redact`. The default is `redact`. Use AI_REDACT in `detect` mode to
identify PII locations then programmatically choose which PII to redact. Use AI_REDACT in `redact` mode to replace PII in the input text with placeholder values.

> **Important:**
>
> AI_REDACT performs detection and redaction in a best-effort manner using AI models. Always review the output to ensure compliance
> with your organization’s data privacy policies. If AI_REDACT fails to detect or redact any PII in your data,
> [contact Snowflake Support](../contacting-support.md).

## Regional availability

See [Regional availability](aisql.md).

## Limitations

* AI_REDACT uses AI models and may not find all personally identifiable information. Always review the output
  to ensure compliance with your organization’s data privacy policies. If AI_REDACT fails to redact certain PII, contact [Snowflake Support](../contacting-support.md).
* The COUNT_TOKENS and AI_COUNT_TOKENS functions do not yet support AI_REDACT.
* AI_REDACT works best with well-formed English text. Performance may vary with other languages or text with many
  spelling, punctuation, or grammatical errors.
* AI_REDACT currently supports only US PII and some UK and Canadian PII, where noted in Detected PII categories.
* AI_REDACT is currently limited in the number of tokens it can input and output. Input and output together can be up to
  4,096 tokens. Output is limited to 1,024 tokens. If the input text is longer, split it into smaller chunks and
  redact each chunk separately, perhaps using
  [SPLIT_TEXT_RECURSIVE_CHARACTER](../../sql-reference/functions/split_text_recursive_character-snowflake-cortex.md).
  See Chunking example for an example of redacting text that exceeds token limits.

  > **Note:**
  >
  > A token is the smallest unit of data processed by the AI model. For English text, industry guidelines consider one token to be approximately four characters, or 0.75 words.

## Detected PII categories

AI_REDACT supports the detection and redaction of the following categories of PII. The values in the Category column are the strings
that are supported in the optional `categories` argument.

> | Category | Notes |
> | --- | --- |
> | NAME | Recognizes full name, first name, middle name, and last name |
> | EMAIL |  |
> | PHONE_NUMBER |  |
> | DATE_OF_BIRTH |  |
> | GENDER | Recognizes male, female, and nonbinary |
> | AGE |  |
> | ADDRESS | Identifies:   * complete postal address (US, UK, CA) * street address (US, UK, CA) * postal code (US, UK, CA) * city (US, UK, CA) * state (US) or province (CA) * county, borough, or township (US) |
> | NATIONAL_ID | Identifies Social Security numbers (US) |
> | PASSPORT | Identifies passport numbers (US, UK, CA) |
> | TAX_IDENTIFIER | Identifies Individual Taxpayer Numbers (ITNs) |
> | PAYMENT_CARD_DATA | Identifies complete card information, card number, expiration date, and CVV |
> | DRIVERS_LICENSE | Identifies US, UK, and CA licenses |
> | IP_ADDRESS |  |

> **Note:**
>
> AI_REDACT supports partial matches for some PII categories. For example, a first name alone is sufficient to trigger
> redaction with the [NAME] placeholder.

## Retain specific PII with detect mode

By default, AI_REDACT replaces all detected PII with placeholder values. In some cases, you might want to retain certain PII while redacting
the rest. For example, you might want to redact all names in call center transcripts or customer reviews except for known employee names.

Use `detect` mode to build a selective redaction workflow:

1. Call AI_REDACT with the `mode` argument set to `detect` to identify and locate PII in the input text.
2. Compare the detected spans against an allowlist of values you want to keep.
3. Redact only the PII that is not in the allowlist.

When you call AI_REDACT in `detect` mode, the function returns an OBJECT containing a `spans` array. Each element
in the array is an OBJECT with the following fields:

| Field | Type | Description |
| --- | --- | --- |
| `category` | VARCHAR | The PII category, such as `NAME` or `ADDRESS`. See Detected PII categories for supported categories. |
| `start` | NUMBER | The start index of the detected PII in the input text. |
| `end` | NUMBER | The end index of the detected PII in the input text. |
| `text` | VARCHAR | The matched PII text from the input. |

For examples of using `detect` mode, see Detection and selective redaction examples.

## Handle row-level errors in multi-row queries

> **Important:**
>
> If your query fails on every row, the cause might be a known constraint rather than a row-level error.
> See Limitations for details on token limits, language support, and other restrictions.

AI_REDACT raises an error if it cannot process the input text. When a query redacts multiple rows, an error causes the entire query to fail.
To allow processing to continue with other rows, you can set the session parameter `AI_SQL_ERROR_HANDLING_USE_FAIL_ON_ERROR` to FALSE.
Errors then return NULL instead of stopping the query.

```sqlexample
ALTER SESSION SET AI_SQL_ERROR_HANDLING_USE_FAIL_ON_ERROR=FALSE;
```

With this parameter set to FALSE, you can also pass TRUE as the final argument to AI_REDACT, which causes the return value to
be an OBJECT that contains separate fields for the redacted text and any error message. One of these fields is NULL
depending on whether the AI_REDACT call processed successfully.

The following example shows how to use error handling when processing multiple rows:

1. Create a table with unredacted text.

   > ```sqlexample
   > CREATE OR REPLACE TABLE raw_table AS
   >   SELECT 'My previous manager, Washington, used to live in Kirkland. His first name was Mike.' AS my_column
   >   UNION ALL
   >   SELECT 'My name is William and I live in San Francisco. You can reach me at (415).450.0973';
   > ```
2. Set the session parameter.

   > ```sqlexample
   > ALTER SESSION SET AI_SQL_ERROR_HANDLING_USE_FAIL_ON_ERROR=FALSE;
   > ```
3. Create a redaction table with columns for `value` and `error`.

   > ```sqlexample
   > CREATE OR REPLACE TABLE redaction_table (
   >   value VARCHAR,
   >   error VARCHAR
   >   );
   > ```
4. Redact PII from `raw_table` and insert the rows into `redaction_table` to store the redacted text and error messages.

   > ```sqlexample
   > INSERT INTO redaction_table
   > SELECT
   >     result:value::STRING AS value,
   >     result:error::STRING AS error
   >   FROM (SELECT AI_REDACT(my_column, TRUE) AS result FROM raw_table);
   > ```

## Cost considerations

AI_REDACT incurs costs based on the number of input and output tokens processed, as with other Cortex AI Functions.
See the [Snowflake Pricing Guide](https://www.snowflake.com/pricing/pricing-guide/) for details.

## Redaction examples

### Basic redaction examples

The following example redacts a name and an address from the input text.

```sqlexample
SELECT AI_REDACT(
  input => 'My name is John Smith and I live at twenty third street, San Francisco.'
  );
```

Basic redaction output:

```output
My name is [NAME] and I live at [ADDRESS]
```

The following example redacts only names and email addresses from the input text. Note that the text only contains a
first name, which is recognized and redacted as [NAME]. The input text does not contain an email address, so no email
placeholder appears in the output.

```sqlexample
SELECT AI_REDACT(
  input => 'My name is John and I live at twenty third street, San Francisco.',
  categories => ['NAME', 'EMAIL']
  );
```

Selective redaction output:

```output
My name is [NAME] and I live at twenty third street, San Francisco.
```

### End-to-end example

The following example processes rows from one table and inserts the redacted output into another table. You could use a similar approach to
store the redacted data in a column in an existing table. After redaction, the text is passed to the
[AI_SENTIMENT](../../sql-reference/functions/ai_sentiment.md) function to extract overall sentiment information.

1. Create a table with unredacted text.

   > ```sqlexample
   > CREATE OR REPLACE TABLE raw_table AS
   >   SELECT 'My previous manager, Washington, used to live in Kirkland. His first name was Mike.' AS my_column
   >   UNION ALL
   >   SELECT 'My name is William and I live in San Francisco. You can reach me at (415).450.0973';
   > ```
2. View unredacted data.

   > ```sqlexample
   > SELECT * FROM raw_table;
   > ```
3. Create a redaction table.

   > ```sqlexample
   > CREATE OR REPLACE TABLE redaction_table (value VARCHAR);
   > ```
4. Redact PII from `raw_table` and insert the rows into `redaction_table`.

   > ```sqlexample
   > INSERT INTO redaction_table
   >   SELECT AI_REDACT(my_column) AS value FROM raw_table;
   > ```
5. View redacted results.

   > ```sqlexample
   > SELECT * FROM redaction_table;
   > ```
6. Run the AI_SENTIMENT function on redacted text.

   > ```sqlexample
   > SELECT
   >     value AS redacted_text,
   >     AI_SENTIMENT(value) AS summary_sentiment
   >   FROM redaction_table;
   > ```

### Chunking example

This example illustrates how to redact PII from long text by splitting the text into smaller chunks, redacting each chunk separately,
and then recombining the redacted chunks into the final output. This approach works around AI_REDACT’s token limits.

1. Create a table with patient data.

   > ```sqlexample
   > CREATE OR REPLACE TABLE patients (
   >   patient_id INT PRIMARY KEY,
   >   patient_notes TEXT
   >   );
   > ```
2. Split the text into chunks, apply AI_REDACT to each chunk, and concatenate the redacted chunks.

   > ```sqlexample
   > CREATE OR REPLACE TABLE final_temp_table AS
   >   WITH chunked_data AS (
   >     SELECT
   >         patient_id,
   >         chunk.value AS chunk_text,
   >         chunk.index AS chunk_index
   >       FROM
   >         patients,
   >         LATERAL FLATTEN(
   >             input => SNOWFLAKE.CORTEX.SPLIT_TEXT_RECURSIVE_CHARACTER(
   >                 patient_notes,
   >                 'none',
   >                 1000
   >                 )
   >             ) AS chunk
   >       WHERE
   >         patient_notes IS NOT NULL
   >         AND LENGTH(patient_notes) > 0
   >     ),
   >   redacted_chunks AS (
   >       SELECT
   >           patient_id,
   >           chunk_index,
   >           chunk_text,
   >           TO_VARIANT(results:value) AS redacted_chunk,
   >           TO_VARIANT(results:error) AS error_string
   >         FROM (
   >           SELECT
   >               patient_id,
   >               chunk_index,
   >               chunk_text,
   >               AI_REDACT(chunk_text,TRUE) AS results
   >             FROM
   >               chunked_data
   >         )
   >   ),
   >   final AS (
   >       SELECT
   >           chunk_text AS original,
   >           IFF(error_string IS NOT NULL, chunk_text, redacted_chunk) AS redacted_text,
   >           patient_id,
   >           chunk_index
   >         FROM
   >           redacted_chunks
   >   )
   >   SELECT * FROM final;
   > ```
3. Query the results.

   > ```sqlexample
   > SELECT
   >     patient_id,
   >     LISTAGG(redacted_text, '') WITHIN GROUP (ORDER BY chunk_index) AS full_output
   >   FROM final_temp_table
   >   GROUP BY patient_id;
   > ```

## Detection and selective redaction examples

### Basic detection example

The following example identifies and returns the category, location, and text of each detected PII instance without redacting the input.

```sqlexample
SELECT AI_REDACT(
    input => 'My old manager, Washington, used to live in Washington. His first name was Mike.',
    return_error_details => FALSE,
    mode => 'detect'
    );
```

Basic detection output:

```output
{
  "spans": [
    {
      "category": "NAME",
      "end": 26,
      "start": 16,
      "text": "Washington"
    },
    {
      "category": "ADDRESS",
      "end": 54,
      "start": 44,
      "text": "Washington"
    },
    {
      "category": "NAME",
      "end": 79,
      "start": 75,
      "text": "Mike"
    }
  ]
}
```

### End-to-end with allowlist example

The following example demonstrates a selective redaction workflow that uses `detect` mode and an allowlist. It loads a list of names to
retain from a staged file, uses AI_REDACT in `detect` mode to identify PII locations, and then passes the results to a Python UDF that
redacts only the PII not in the allowlist.

1. Retain an allowlist of values by loading the list from a stage into a temporary table.

   > ```sqlexample
   > CREATE OR REPLACE TEMP TABLE string_list (value STRING);
   >
   > COPY INTO string_list
   >   FROM @mystage/allowlist.txt
   >   FILE_FORMAT = (
   >     TYPE = 'CSV'
   >     RECORD_DELIMITER = '\n'
   >     FIELD_DELIMITER = '\t'   -- any char NOT in file
   >     TRIM_SPACE = TRUE
   >     SKIP_HEADER = 0
   >     );
   > ```
2. View the allowlist table

   > ```sqlexample
   > SELECT * FROM string_list;
   > ```
   >
   > Allowlist table output:
   >
   > ```output
   > VALUE
   > Mike
   > David
   > ```
3. Create a Python UDF that selectively redacts PII based on the allowlist.

   > ```sqlexample
   > CREATE OR REPLACE FUNCTION redact_spans_with_allowlist(
   >   SPAN_DATA VARIANT,
   >   ALLOWLIST ARRAY,
   >   ORIGINAL_TEXT STRING
   >   )
   >   RETURNS STRING
   >   LANGUAGE PYTHON
   >   RUNTIME_VERSION = '3.8'
   >   HANDLER = 'redact_text'
   >   AS
   >   $$
   >   def redact_text(span_data, allowlist, original_text):
   >       spans = span_data.get('spans', [])
   >       # Sort descending to maintain index integrity
   >       sorted_spans = sorted(spans, key=lambda x: x['start'], reverse=True)
   >
   >       result = original_text
   >
   >       for span in sorted_spans:
   >           text_val = span.get('text')
   >           if text_val in allowlist:
   >               continue
   >
   >           start, end = span['start'], span['end']
   >           label = f"[{span['category']}]"
   >
   >           # Splice the string
   >           result = result[:start] + label + result[end:]
   >
   >       return result
   >   $$;
   > ```
4. Test the UDF.

   > ```sqlexample
   > SELECT redact_spans_with_allowlist(
   >   PARSE_JSON('{"spans": [{"category": "NAME", "end": 26, "start": 16, "text": "Washington"}, {"category": "NAME", "end": 79, "start": 75, "text": "Mike"}]}'),
   >   ARRAY_CONSTRUCT('Washington'), -- This will NOT be redacted
   >   'Hello, my name is Washington and his is Mike.'
   >   );
   > ```
5. Run AI_REDACT in `detect` mode.

   > ```sqlexample
   > CREATE OR REPLACE TABLE raw (message TEXT);
   >
   > INSERT INTO raw (message) VALUES
   >   ('My old manager, Washington, used to live in Washington. His first name was Mike.');
   >
   > SELECT
   >     t.message AS message,
   >     AI_REDACT(input=>t.message, return_error_details=>FALSE, mode=>'detect') AS spans,
   >     redact_spans_with_allowlist(spans, l.str_list, message) AS result
   >   FROM raw t
   >     CROSS JOIN (
   >       SELECT ARRAY_AGG(value) AS str_list
   >         FROM string_list
   >       ) l;
   > ```

End-to-end with allowlist example output:

| MESSAGE | SPANS | RESULT |
| --- | --- | --- |
| My old manager, Washington, used to live in Washington. His first name was Mike. | ```json {   "spans": [     {"category": "NAME",     "end": 26,     "start": 16,     "text": "Washington"     },     {"category": "ADDRESS",     "end": 54,     "start": 44,     "text": "Washington"     },     {"category": "NAME",     "end": 79,     "start": 75,     "text": "Mike"     }   ] } ``` | My old manager, [NAME], used to live in [ADDRESS]. His first name was Mike. |

## Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Document Processing Playground
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/document-processing-playground.md
section: Snowflake Cortex (AI & ML)
---

# Document Processing Playground

The Document Processing Playground provides a user interface for exploring the AI_EXTRACT and AI_PARSE_DOCUMENT functions.
You can upload your own documents from stage, ask questions to extract information using AI_EXTRACT, and preview both the layout
and OCR results generated by AI_PARSE_DOCUMENT. The playground lets you explore how the functions process your documents,
and copy the corresponding code snippets for further use.

For more information, see [AI_EXTRACT](../../sql-reference/functions/ai_extract.md) and [Parsing documents with AI_PARSE_DOCUMENT](parse-document.md).

## Required privileges

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../../sql-reference/snowflake-db-roles.md).
For information about granting this privilege, see [Cortex LLM privileges](aisql.md).

## Get started with the Document Processing Playground

To access the Document Processing Playground:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » AI Studio.
   The Document Processing Playground appears among the other Studio functions.
3. To open the Document Processing Playground, select Open.

## Upload your documents

You can upload up to 10 documents.

### Upload your documents from a local machine

> **Note:**
>
> * To upload documents from a local machine, you must have a personal database enabled. For more information, see [Personal Databases](../personal-databases.md).
> * The maximum file size is 50 MB.

1. Select Select Warehouse, and then select the warehouse from the drop-down list.
2. Select Choose file.
3. Drag and drop files, or select Browse to select files from your local machine.
4. Select Upload.

   The playground appears.

### Upload your documents from a stage

> **Note:**
>
> When you upload files from a stage, the default warehouse is selected. To change the warehouse or if you don’t have a default warehouse,
> use Select Warehouse to choose a warehouse from the drop-down list.

1. Select Add from stage.

   A dialog appears.
2. Select the database, schema, and stage that contains your documents.
3. Select the document files that you want to add to the playground.
4. Select Open playground.

   The playground appears.

## The Document Processing Playground interface

The Document Processing Playground interface displays a preview of a document on the right and a prompt area on the left where you can enter prompts.

> **Tip:**
>
> To change the document that you are previewing, select the document name, and then select another document from the list.

The Document Processing Playground interface consists of the following tabs:

* Extraction: The view where you can ask questions to extract information from the document.
* Markdown: The view where you can see the markdown representation of the document. It’s the LAYOUT mode output from AI_PARSE_DOCUMENT.
* Text: The view where you can see the text representation of the document. It’s the OCR mode output from AI_PARSE_DOCUMENT.

## Extract information by asking questions

You can ask questions to extract information from the document.

1. Select the Extraction tab.
2. Select the extraction type:

   * To ask a question, select Ask.
   * To extract a list, select List.
   * To extract a table, select Extract table.
3. Create key and question pairs, for example:

   * Key: `company`
   * Question: `What is the name of the company?`
4. To confirm, select Add Prompt.

## Preview the markdown and text versions of the document

The Markdown and the Text tabs display the results of the AI_PARSE_DOCUMENT function.

* To see the Layout mode results, select the Markdown tab.
* To see the OCR mode results, select the Text tab.

## Get the code snippets for further use

The playground creates code snippets that use the AI_EXTRACT and AI_PARSE_DOCUMENT functions to process your documents.

If you uploaded files from a local machine, you can preview and copy the code snippets:

1. In the top right corner of the interface, select Code Snippets.
2. Select the language of the code snippet: SQL or Python.

   You can now copy the code snippet.

If you uploaded files from a stage, you can open the code snippet directly in Workspaces:

* In the top right corner of the interface, select Open in Workspaces.

  A new workspace opens with the code snippet.

## Regional availability

The Document Processing Playground is available in the following regions:

| Cloud platform | Cloud region |
| --- | --- |
| Amazon Web Services (AWS) | * US East (N. Virginia) * US East (Ohio) * US West (Oregon) * Canada (Central) * South America (Sao Paulo) * Europe (London) * EU (Stockholm) * EU (Ireland) * EU (Frankfurt) * Asia Pacific (Mumbai) * Asia Pacific (Tokyo) * Asia Pacific (Seoul) * Asia Pacific (Sydney) * Asia Pacific (Jakarta) |
| Microsoft Azure | * East US 2 (Virginia) * West US 2 (Washington) * South Central US (Texas) * Canada Central (Toronto) * UK South (London) * North Europe (Ireland) * West Europe (Netherlands) * Southeast Asia (Singapore) * UAE North (Dubai) * Australia East (New South Wales) * Central India (Pune) * Japan East (Tokyo) |
| Google Cloud | * US East4 (N. Virginia) * US Central1 (Iowa) * Europe West2 (London) * Europe West3 (Frankfurt) * Europe West4 (Netherlands) |

## Limitations

Limitations of the AI_EXTRACT and AI_PARSE_DOCUMENT functions apply to the Document Processing Playground.
For more information, see [AI_EXTRACT](../../sql-reference/functions/ai_extract.md) and [Parsing documents with AI_PARSE_DOCUMENT](parse-document.md).

---
title: Evaluate AI applications
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-observability/evaluate-ai-applications.md
section: Snowflake Cortex (AI & ML)
---

# Evaluate AI applications

To evaluate a generative AI application, follow these steps:

1. Build the app and instrument it using Trulens SDK (Applications built using Python are supported).
2. Register the app in Snowflake.
3. Create a run by specifying the input dataset.
4. Execute the run to generate traces and compute evaluation metrics.
5. View the evaluation results in Snowsight.

## Instrument the app

After you create your generative AI application in Python, import the TruLens SDK to instrument it. The TruLens SDK provides an `@instrument()` decorator to instrument the functions in your application to generate the traces and compute the metric.

* To use the decorator, add the following import to your Python application:

  ```python
  from trulens.core.otel.instrument import instrument
  ```

You can change the granularity of the `@instrument()` decorator depending on your requirements.

### Scenario 1: Trace a function

You can add `@instrument()` before the function you need to trace. This automatically captures the inputs to the function, the outputs (return values), and the latency of execution. For example, the following code demonstrates tracing an `answer_query` function that automatically captures input query and the final response:

```python
@instrument()
def answer_query(self, query: str) -> str:
    context_str = self.retrieve_context(query)
    return self.generate_completion(query, context_str)
```

### Scenario 2: Trace a function with a specific span type

A span type specifies the nature of the function and improves the readability and understanding of the traces. For example, in a RAG application you can specify span type as `RETRIEVAL` for your search service (or retriever) and specify the span type as `GENERATION` for the LLM inference call. The following span types are supported:

* `RETRIEVAL`: Span type for retrieval or search functions
* `GENERATION`: Span type for model inference calls from an LLM
* `RECORD_ROOT`: Span type for the main function in your application

If you don’t specify a span type with the `@instrument()`, an `UNKNOWN` span type is assigned by default. To use span attributes, add the following import to your Python application.

```python
from trulens.otel.semconv.trace import SpanAttributes
```

The following code snippet demonstrates tracing a RAG application. The span type must always be prefixed with `SpanAttributes.SpanType`.

```python
@instrument(span_type=SpanAttributes.SpanType.RETRIEVAL)
def retrieve_context(self, query: str) -> list:
    """
    Retrieve relevant text from vector store.
    """
    return self.retrieve(query)

@instrument(span_type=SpanAttributes.SpanType.GENERATION)
def generate_completion(self, query: str, context_str: list) -> str:
    """
    Generate answer from context by calling an LLM.
    """
    return response

@instrument(span_type=SpanAttributes.SpanType.RECORD_ROOT)
def answer_query(self, query: str) -> str:
    context_str = self.retrieve_context(query)
    return self.generate_completion(query, context_str)
```

### Scenario 3: Trace a function and compute evaluations

In addition to providing span types, you must assign relevant parameters in your application to span attributes to compute the metrics. For example, to compute context relevance in a RAG application, you must assign the relevant query and retrieval results parameter to appropriate attributes `RETRIEVAL.QUERY_TEXT` and `RETRIEVAL.RETRIEVED_CONTEXTS` respectively. The attributes required to compute each individual metric can be found in the Metrics page.

The following span attributes are supported for each span type:

* `RECORD_ROOT`: `INPUT`, `OUTPUT`, `GROUND_TRUTH_OUTPUT`
* `RETRIEVAL`: `QUERY_TEXT`, `RETRIEVED_CONTEXTS`
* `GENERATION`: None

To use span attributes, you need to add the following import to your Python application.

```python
from trulens.otel.semconv.trace import SpanAttributes
```

The following code snippet provides an example to compute context relevance for a retrieval service. The attributes must always follow the format `SpanAttributes.<span type>.<attribute name>` (e.g., `SpanAttributes.RETRIEVAL.QUERY_TEXT`).

```python
@instrument(
    span_type=SpanAttributes.SpanType.RETRIEVAL,
    attributes={
        SpanAttributes.RETRIEVAL.QUERY_TEXT: "query",
        SpanAttributes.RETRIEVAL.RETRIEVED_CONTEXTS: "return",
    }
)
def retrieve_context(self, query: str) -> list:
    """
    Retrieve relevant text from vector store.
    """
    return self.retrieve(query)
```

In the preceding example, `query` represents the input parameter to `retrieve_context()` and `return` represents the value returned. These are assigned to the attributes `RETRIEVAL.QUERY_TEXT` and `RETRIEVAL.RETRIEVED_CONTEXTS` to compute context relevance.

### Auto-instrument framework applications

In addition to manual instrumentation using the `@instrument()` decorator, TruLens provides specialized wrappers that automatically instrument applications built with popular LLM frameworks. These wrappers provide integration and automatic tracing without requiring manual decoration of individual functions.

#### TruChain for LangChain

`TruChain` provides automatic instrumentation for applications built with [LangChain](https://www.langchain.com/). It automatically captures the execution of key LangChain classes including chains, LLMs, prompts, and retrievers.

```python
from trulens.apps.langchain import TruChain

# Wrap your LangChain application
tru_recorder = TruChain(
    rag_chain,
    app_name="my_langchain_app",
    app_version="v1.0"
)

# Use the recorder as a context manager
with tru_recorder as recording:
    response = rag_chain.invoke(input_query)
```

`TruChain` supports:

* Automatic instrumentation of LangChain Expression Language (LCEL) chains
* Async support through the `ainvoke` method
* Built-in selectors (`on_input`, `on_output`, `on_context`) for RAG triad evaluation

#### TruGraph for LangGraph

`TruGraph` provides automatic instrumentation for applications built with [LangGraph](https://langchain-ai.github.io/langgraph/). It automatically detects LangGraph applications and instruments both LangChain and LangGraph components.

```python
from trulens.apps.langgraph import TruGraph

# Wrap your LangGraph application
tru_recorder = TruGraph(
    graph,
    app_name="my_langgraph_app",
    app_version="v1.0"
)

# Use the recorder as a context manager
with tru_recorder as recording:
    response = graph.invoke({"messages": [("user", input_query)]})
```

`TruGraph` supports:

* Automatic `@task` instrumentation with intelligent attribute extraction
* Multi-agent evaluation capabilities
* Combined instrumentation of both LangChain and LangGraph components

#### TruLlama for LlamaIndex

`TruLlama` provides automatic instrumentation for applications built with [LlamaIndex](https://www.llamaindex.ai/). It automatically captures the execution of key LlamaIndex classes including query engines, retrievers, and response synthesizers.

```python
from trulens.apps.llamaindex import TruLlama

# Wrap your LlamaIndex query engine
tru_recorder = TruLlama(
    query_engine,
    app_name="my_llamaindex_app",
    app_version="v1.0"
)

# Use the recorder as a context manager
with tru_recorder as recording:
    response = query_engine.query(input_query)
```

`TruLlama` supports:

* Automatic instrumentation of query engines, chat engines, and retrievers
* Async support through `aquery`, `achat`, and `astream_chat` methods
* Streaming support for LlamaIndex applications
* Built-in selectors (`on_input`, `on_output`, `on_context`) for RAG triad evaluation

For more information about framework-specific instrumentation, see the [TruLens documentation](https://www.trulens.org/component_guides/instrumentation/).

## Register app in Snowflake

To register your generative AI application in Snowflake for capturing traces and conducting evaluations, you need to create a `TruApp` object using the TruLens SDK that records the invocation (execution) of the user’s app and exports traces to Snowflake.

```python
tru_app = TruApp(
    app: Any,
    app_name: str,
    app_version: str,
    connector: SnowflakeConnector,
    main_method: callable  # i.e. app.query
)
```

> **Note:**
>
> If your application is built using LangChain, LangGraph, or LlamaIndex, you can use `TruChain`, `TruGraph`, or `TruLlama` respectively in place of `TruApp`. These framework-specific wrappers provide the same registration functionality while also enabling automatic instrumentation of your application. See Auto-instrument framework applications for more details.

Parameters:

* `app: Any`: an instance of the user-defined application that will later be invoked during a run for evaluation. i.e. `app = RAG()`
* `app_name: str`: is the name of the application user can specify and will be maintained in the user’s Snowflake account.
* `app_version: str`: is the version user can specify for the app to allow experiments tracking and comparison.
* `connector: SnowflakeConnector`: a wrapper class that manages snowpark session and Snowflake DB connection.
* `main_method: callable` (Optional): is the entry point method for the user’s application, which tells the SDK how the app is expected to be called by users and where to start tracing the invocation of the user app (specified by app). For the example of RAG class, the main_method can be specified as `app.answer_query`, assuming the answer method is the entry point of the app. Alternatively, instrument the entry point method with span attribute RECORD_ROOT. In that case, this parameter is not required.

## Create Run

To begin an evaluation job, you need to create a run. Creating a run requires a run configuration to be specified. The `add_run()` function uses the run configuration to create a new run.

### Run Configuration

A run is created from a `RunConfig`

```python
run_config = RunConfig(
    run_name=run_name,
    description="desc",
    label="custom tag useful for grouping comparable runs",
    source_type="DATAFRAME",
    dataset_name="My test dataframe name",
    dataset_spec={
        "RETRIEVAL.QUERY_TEXT": "user_query_field",
        "RECORD_ROOT.INPUT": "user_query_field",
        "RECORD_ROOT.GROUND_TRUTH_OUTPUT": "golden_answer_field",
    },
    llm_judge_name: "mistral-large2"
)
```

* `run_name: str`: name of the run, should be unique under the same `TruApp`
* `description: str` (optional): string description of the run
* `label: str` (optional): label used to group run together
* `source_type: str`: specifies the source of the dataset. It can either be `DATAFRAME` for a python dataframe or `TABLE` for a user table in the Snowflake account.
* `dataset_name: str`: any arbitrary name specified by the user if source_type is `DATAFRAME`. Or, a valid Snowflake table name under the user’s account under current context (database and schema) or Snowflake fully-qualified name in the form of “database.schema.table_name”.
* `dataset_spec: Dict[str, str]`: a dictionary mapping supported span attributes to user’s column names in the dataframe or table. The allowed keys are span attributes as specified in the Dataset page and the allowed values are column names in the user’s specified dataframe or table. For example, “golden_answer_field” in the run config example above must be a valid column name
* `llm_judge_name: str` (Optional): name to use as LLM judges during LLM-based metric computation. Please see the models page for supported judges. If not specified, the default value is `llama3.1-70b`

```python
run = tru_app.add_run(run_config=run_config)
```

Request Parameters:

* `run_config: RunConfig`: contains the configuration for the run.

### Retrieve Run

Retrieves the run.

```python
run = tru_app.get_run(run_name=run_name)
```

Request parameters:

* `run_name: str`: name of the run

### View Run metadata

Describes the details of the run.

```python
run.describe()
```

### Invoke Run

You can invoke the run using the `run.start()` function. It reads the inputs from the dataset specified in the run configuration, invokes the application for each input, generates the traces, and ingests the information for storage in your Snowflake account. `run.start()` is a blocking call until the application is invoked for all inputs in your dataset and ingestion is completed or timed out.

```python
run.start()  # if source_type is "TABLE"

run.start(input_df=user_input_df)  # if source_type is "DATAFRAME"
```

Request Parameters:

* `input_df: DataFrame` (Optional): is a pandas dataframe from the SDK. If the source_type in run configuration is specified as `DATAFRAME`, this field is mandatory. If the source_type is `TABLE`, this field is not required.

### Compute metrics

You can start metric computations using `run.compute_metrics()` after the application is invoked and all traces are ingested. As long as the status of the run is `INVOCATION_IN_PROGRESS`, computation cannot be started. Once the status is `INVOCATION_COMPLETED` or `INVOCATION_PARTIALLY_COMPLETED`, `run.compute_metrics()` can be initiated. `run.compute_metrics()` is an asynchronous non-blocking function. You can call `compute_metrics` multiple times on the same run with a different set of metrics, and each call will trigger a new computation job. Note that metrics once computed cannot be re-computed again for the same run.

```python
run.compute_metrics(metrics=[
    "coherence",
    "answer_relevance",
    "groundedness",
    "context_relevance",
    "correctness",
])
```

Request Parameters:

* `metrics: List[str]`: list of string names of the metrics listed in Metrics. The name of metrics should be specified in snake cases. i.e. Context Relevance should be specified as `context_relevance`.

### Check Run Status

You can check the status of the run after it is in progress. The list of statuses are in Run Status section.

```python
run.get_status()
```

### Cancel Run

You can cancel an existing run using `run.cancel()`. This operation will prevent any future updates to the run, including run status and metadata fields.

```python
run.cancel()
```

### Delete Run

You can delete an existing run using `run.delete()`. This operation deletes the metadata associated with the run and the evaluation results cannot be accessed. However, the traces and evaluations generated as part of the runs are not deleted and remain stored. Please refer to Observability data section for more information about storage and deletion of evaluation and traces.

```python
run.delete()
```

### List Runs for an application

You can see the list of all available runs corresponding to a specific `TruApp` application object using the `list_runs()` function.

```python
tru_app.list_runs()
```

Response:

Return a list of all Runs created under the `tru_app`.

## View Evaluations and Traces

To view evaluation results do the following:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Evaluations.

Do the following to view the evaluation results for your application runs:

* To view runs corresponding to a specific application, select the application.
* To view the evaluation results for a run, select the run. You view the aggregated results and the results corresponding to each record.
* To view traces for a record, select it. You can view detailed traces, latency, inputs and outputs into each stage of the application, evaluation results, and explanation provided by the LLM judge for the accuracy score that have been generated.

To compare runs that use the same dataset, select multiple runs and select Compare to compare the outputs and the evaluation scores.

---
title: Extracting information from documents with AI_EXTRACT
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/document-extraction.md
section: Snowflake Cortex (AI & ML)
---

# Extracting information from documents with AI_EXTRACT

AI_EXTRACT is a Cortex AI Function that lets you extract structured information, such as entities, lists, and tables,
from text or document files, by asking questions in natural language or by describing information to be extracted. It
can be used with other functions to create custom document processing pipelines for a variety of use cases (see
[Cortex AI Functions: Documents](ai-documents.md)).

AI_EXTRACT can process documents of various formats in multiple languages and extract information from both text-heavy
paragraphs and content in a graphical form, such as logos, handwritten text (for example, signatures), tables, or checkmarks).
AI_EXTRACT can extract information in the following structured formats:

* Entity: Ask questions in natural language or describe the information to be extracted (such as city, street, or ZIP
  code).
* List (or array): You can provide a JSON schema to extract an array or list of information present in the document,
  such as the name of all account holders in a bank statement or a list of all addresses in a Document.
* Table: Provide a JSON schema to extract tabular data present in the document by specifying the table title
  and a list of columns that should be extracted.

AI_EXTRACT scales automatically with your workload by processing multiple documents simultaneously. Documents can be
processed directly from object storage to avoid unnecessary data movement.

> **Note:**
>
> AI_EXTRACT is currently incompatible with custom [network policies](../network-policies.md).

> **Tip:**
>
> For more information on AI_EXTRACT, including supported languages, regional availability, and cost, see [AI_EXTRACT](../../sql-reference/functions/ai_extract.md).

## Extraction quality

AI_EXTRACT uses `arctic-extract`, a proprietary vision based large language model (LLM) that delivers high extraction accuracy.
The following table presents the model’s scores on various standard benchmarks, with the scores of other popular models for comparison:

### Visual question answering (VQA)

| Offering | DocVQA score |
| --- | --- |
| Human evaluation | 0.9811 |
| **Snowflake Arctic-Extract** | 0.9433 |
| Azure OpenAI GPT-o3 | 0.9339 |
| Google Gemini 2.5-Pro | 0.9316 |
| Google Anthropic Claude 4 Sonnet | 0.9119 |
| Azure Document Intelligence + GPT-o3 | 0.8853 |
| Google Document AI + Gemini | 0.8497 |
| Azure OpenAI GPT-o3 | 0.9339 |
| AWS Textract | 0.8313 |

### Text-only question answering (SQuAD v2)

| Offering | ANLS | Exact match |
| --- | --- | --- |
| **Snowflake Arctic-Extract** | 81.18 | 78.74 |
| Anthropic Claude 4 Sonnet | 80.54 | 77.10 |
| Meta LLaMA 3.1 405B | 80.37 | 76.56 |
| Meta LLaMA 4 Scout | 74.30 | 70.70 |
| OpenAI GPT 4.1 | 70.71 | 66.81 |
| Meta LLaMA 3.1 8B | 59.13 | 54.48 |

## Question optimization for extracting information

When you work with AI_EXTRACT, use natural language to ask questions about your documents. To ask a question that
returns an accurate answer, follow these guidelines:

* Use plain English.
* For each question, know what answers you expect.
* Be specific; for example, if the document includes several dates (such as issuing date and signature date), do not ask
  “What is the date?” without including more details.
* Ask for a single value in each question.
* Do not expect AI_EXTRACT to guess your intentions or have extended knowledge in a specific domain.

Consider the following document as an example. This purchase and sale agreement includes information such as the offer expiration date,
the names of the buyers and the seller, and the included items.

The following table provides examples of questions you can ask AI_EXTRACT and the expected answers.

| Example question | Answer |
| --- | --- |
| What is the date of this agreement? | `'October 6, 2023'` |
| Who is the buyer of the condo? | `'John Davis', 'Jane Davis'` |
| What home appliances are included with the unit? | `'stove/range', 'refrigerator', 'washer', 'dishwasher', 'attached television(s)', 'microwave'` |
| What items are not included with the flat? | `'dryer', 'security system', 'satellite dish', 'wood stove', 'fireplace insert', 'hot tub', 'attached speaker(s)', 'generator'` |
| Is there a dryer in the flat? | `'No'` |
| What addenda are attached to this purchase and sale agreement? | `'22A (Financing)', '2AA (Appraisal)', '22FSBO (Owner Sale)'` |
| What is the seller’s fax number? | None |
| Is the buyer’s signature present on the form? | `'No'` |
| What is the MLS number? | `'59844680'` |
| What is the property’s address? | `'604 Bishop Crossing Land, Fort Lauderdale, Broward County, FL, 33338'` |

## Table extraction best practices

This section provides best practices when working with table extraction in AI_EXTRACT.

### Use one schema for a specific type of document

Each extraction workload must contain documents of the same type, and the data that you want to extract should be similar for most
of the tables. If the number of columns in the source document differs from one document to another, but all documents contain
a defined subset of columns to be extracted and the common columns have the same or a similar name and location, then those common
columns can be extracted.

For example, invoices may have different numbers of columns with various data, but if all of the tables have the same first
three columns — `Item Description`, `Quantity`, and `Price` — that data can be extracted.

### Use natural language to define column names

You can copy the column names from the document so that they’re exactly the same.
For example, don’t name the columns `product_code` or `REPORT_DATE`; instead, name them `Product Code` or `Report Date`.

### Skip the empty rows

When you create a fine-tuning dataset, skip the rows with no answer (where the returned answer would be `None`).

### Define the columns in the same order they appear in the document

To improve accuracy, define the columns in the same order as they appear in the document, which is usually from left to right,
or top to bottom for the transposed tables. If you
choose to define the order differently, training might be needed.

However, for columns where values are the same for multiple rows, such as `Invoice Number` and `Invoice Date`, add these columns at the beginning.
For example:

* `Invoice Number`
* `Invoice Date`
* `Item Code`
* `Item Name`
* `Quantity`

### Define values using casing from the document

When possible, define values using casing (uppercase and lowercase) from the document. If the casing in the document is varied, use
capitalization.

### Use the description field

The `description` field in the AI_EXTRACT response format is optional; in most cases, you don’t have to fill it in. However, if there
are multiple similar tables in a document, the model might answer inaccurately. If the answers come from a different source table
than expected or the model can’t find the table, try using the `description` field. Add information that helps the model identify
the right table, such as the table title or number.

### Add a section column to describe the layout of the table

If the table is divided into multiple named sections, add a section column. This helps the model understand the layout better and
improve the accuracy. For example, you can name the column `Section`, `Item section`, or `Item category`. If there is a second
level of nesting in the sections, you can add two columns: `Section` and `Subsection`.

### To group values, create an additional column

You can add a column to the existing table to group values. In this way, you can join results from the whole document set in a single
table; for example:

| Invoice Number | Item Details | Item Price | Quantity |
| --- | --- | --- | --- |
| A | Item A1 | 10.00 | 1 |
| A | Item A2 | 20.00 | 1 |
| A | Item A3 | 30.00 | 1 |
| B | Item B1 | 15.00 | 1 |
| B | Item B2 | 25.00 | 1 |
| B | Item B3 | 35.00 | 1 |

Note that the value in the first column is repeated for corresponding items.

### Make the column names distinguishable between documents

Try to semantically distinguish a column. Don’t use names such as `col1`, `val1`, `item1`.

In some cases, transposition can work better, especially when the row names don’t differ between documents or differ slightly and
are within a closed set of values.

Note that training on the specified column set might improve the results.

### Use the parent name as a prefix when working with hierarchical headers

To extract information from tables with hierarchical headers, join the header path using each parent name as a prefix. For example,
for the following table, define the columns as:

* `Category A Type X Column 1`
* `Category A Type Y Column 2`
* `Category A Type Y Column 3`
* `Category B Column 4`
* `Category B Column 5`

### Transpose the tables if needed

You can extract information from transposed tables by using values from the first column of the table in the document as column names
in the output table.

For example, for the following table, name the columns:

* `Type A: Item 1`
* `Type A: Item 2`
* `Type B: Item 3`
* `Type B: Item 4`

Note that this example includes hierarchical headers.

### For large tables, split the document

The model for table extraction returns answers that are up to 4096 tokens long. It means that the model stops extracting when it reaches
that limit. You can approach this in the following ways:

* If the table covers several pages, split the document into multiple one-page documents, and join the results in postprocessing.
* If the table is so dense that the data can’t be extracted even from a single page, divide the table by columns.

  For example, if the table contains 10 columns, try defining two separate values: one with 5 columns from the left half, and the other
  with 5 columns from the right half of the table. You might need to experiment with the column choice for best results.

### Create names for the columns that don’t have a name in the document

If the first column in the document doesn’t have a name, you must create that name yourself when defining the value. You can approach it
in the following ways:

* Use the table title or a significant part of the title.
* Create a descriptive name that represents the data in the column; for example, `description`, `type of asset`, `year`, `category`.

### Compare data from two different periods of time

If you want to compare data from two different periods of time, for example, years 2023 and 2024 in financial documents such as annual reports,
you can prefix the columns with “current” and “previous”. Note that training might be needed to improve the results.

## Examples: Extract information from a purchase and sale agreement

The following examples extract information from the condominium purchase and sale agreement
which you can view in the Question optimization for extracting information section.

### Extract an entity

Extract the seller name and the offer expiration date:

```sqlexample
SELECT AI_EXTRACT(
  file => TO_FILE('@db.schema.stage','document.pdf'),
  responseFormat => [['seller_name', 'What is the seller name?'], ['address', 'What is the offer expiration date?']]
);
```

Result:

```json
{
    "error": null,
    "response": {
        "address": "12/12/2023",
        "seller_name": "Paul Doyle"
    }
}
```

### Extract checkbox information

Extract information about items that are not included, based on the checkboxes marked in the document:

```sqlexample
SELECT AI_EXTRACT(
  file => TO_FILE('@db.schema.stage','document.pdf'),
  responseFormat => [['flat_items', 'What items are not included with the flat?'], ['default', 'What Default is selected?']]
);
```

Result:

```json
{
    "error": null,
    "response": {
        "default": "Forfeiture of Earnest Money",
        "flat_items": "dryer, security system, satellite dish, wood stove, fireplace insert, hot tub, attached speaker(s), generator, other"
    }
}
```

### Extract signature status

Extract information about whether the agreement has been signed:

```sqlexample
SELECT AI_EXTRACT(
    file => TO_FILE('@db.schema.stage','document.pdf'),
    responseFormat => [['signature', 'Is this document signed?']]
);
```

Result:

```json
{
  "error": null,
    "response": {
        "signature": "no"
    }
}
```

### Extract a list of entities

Extract a list of buyer names:

```sqlexample
SELECT AI_EXTRACT(
    file => TO_FILE('@db.schema.files', 'report.pdf'),
    responseFormat => {
        'schema': {
        'type': 'object',
        'properties': {
            'buyer_list': {
            'description': 'What are the buyer names?',
            'type': 'array'
            }
        }
        }
    }
);
```

Result:

```json
{
    "error": null,
    "response": {
        "buyer_list": [
        "John Davis",
        "Jane Davis"
        ]
    }
}
```

## Example: Extract information from a table

This example extracts information from the following document:

```sqlexample
SELECT AI_EXTRACT(
    file => TO_FILE('@db.schema.files', 'report.pdf'),
    responseFormat => {
        'schema': {
            'type': 'object',
            'properties': {
                'income_table': {
                'description': 'Table 2: Granger Causality Tests - P-values',
                'type': 'object',
                'column_ordering': ['description', 'countries','lags','z','z_approx'],
                'properties': {
                    'description': {
                        'description': 'Description',
                        'type': 'array'
                        },
                    'countries': {
                        'description': 'Countries',
                        'type': 'array'
                        },
                    'lags': {
                        'description': 'Lags',
                        'type': 'array'
                        },
                    'z': {
                        'description': 'Z',
                        'type': 'array'
                    },
                    'z_approx': {
                        'description': 'Z approx.',
                        'type': 'array'
                    }
                }
            }
        }
    }
);
```

Result:

```json
{
    "error": null,
    "response": {
        "income_table": {
            "countries": [
                "33","80","29","84","34"
            ],
            "description": [
                "Commodity exporters",
                "Non-commodity exporters",
                "AE",
                "EMDE",
                "Large or market-dominant countries"
            ],
            "lags": [
                "2","1","1","1","1"
            ],
            "z": [
                "0.11","0.08","0.89","0.12","0.07"
            ],
            "z_approx": [
                "0.25","0.19","0.95","0.25","0.14"
            ]
        }
    }
}
```

## Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Feedback REST API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-feedback-rest-api.md
section: Snowflake Cortex (AI & ML)
---

# Feedback REST API

Use this API to collect feedback about Cortex Agents from end users.

## Collect feedback about a Cortex Agent

`POST /api/v2/databases/{database}/schemas/{schema}/agents/{name}:feedback`

Creates a feedback event for a Cortex Agent response.

### Request

#### Path parameters

| Parameter | Description |
| --- | --- |
| `database` | (Required) Identifier for the database to which the resource belongs. You can use the /api/v2/databases GET request to get a list of available databases. |
| `schema` | (Required) Identifier for the schema to which the resource belongs. You can use the /api/v2/databases/{database}/schemas GET request to get a list of available schemas for the specified database. |
| `name` | (Required) Identifier for the agent. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

#### Request body

The request body contains the feedback details for the agent response.

| Field | Type | Description |
| --- | --- | --- |
| `orig_request_id` | string | Request ID for the message associated with the feedback. If this value is not set, then feedback is logged for the agent. |
| `positive` | boolean | Whether the response was good (`true`) or bad (`false`). |
| `feedback_message` | string | The text for the detailed feedback message. |
| `categories` | array of strings | List of categories for the feedback. Each category is a string that represents a specific category of feedback. |
| `thread_id` | integer | The id of the thread. |

#### Example request body for agent-level feedback

```json
{
  "categories": [
    "Something worked well"
  ],
  "feedback_message": "this is fantastic!",
  "positive": true
}
```

#### Example request body for request-level feedback

```json
{
  "orig_request_id": "aa123456-789a-a1-2a34-a1a234a56789",
  "categories": [
    "Something worked well"
  ],
  "feedback_message": "this is fantastic!",
  "positive": true
}
```

### Response

A successful response returns a confirmation message.

#### Response headers

| Header | Description |
| --- | --- |
| `X-Snowflake-Request-ID` | Unique ID of the API request. |

#### Response body

```json
{
  "status": "Feedback submitted successfully"
}
```

## View feedback for Cortex Agents

For information about required privileges and how to query feedback events (including example SQL), see [View feedback provided by users](cortex-agents-monitor.md).

---
title: Fine-tuning (Snowflake Cortex)
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-finetuning.md
section: Snowflake Cortex (AI & ML)
---

# Fine-tuning (Snowflake Cortex)

The Snowflake Cortex Fine-tuning function offers a way to customize large language models for your specific task.
This topic describes how the feature works and how to get started with creating your own fine-tuned model.

## Overview

Cortex Fine-tuning allows users to leverage parameter-efficient fine-tuning (PEFT) to create customized
adaptors for use with pre-trained models on more specialized tasks. If you don’t want
the high cost of training a large model from scratch but need better latency and results than you’re
getting from prompt engineering or even retrieval augmented generation (RAG) methods, fine-tuning
an existing large model is an option. Fine-tuning allows you to use examples to adjust the behavior
of the model and improve the model’s knowledge of domain-specific tasks.

Cortex Fine-tuning is a fully managed service that lets you fine-tune popular LLMs using your data, all within Snowflake.

Cortex Fine-tuning features are provided as a Snowflake Cortex function, [FINETUNE](../../sql-reference/functions/finetune-snowflake-cortex.md),
with the following arguments:

* [CREATE](../../sql-reference/functions/finetune-create.md): Creates a fine-tuning job with the given training data.
* [SHOW](../../sql-reference/functions/finetune-show.md): Lists all the fine-tuning jobs in the current account.
* [DESCRIBE](../../sql-reference/functions/finetune-describe.md): Describes the progress and status of a particular fine-tuning job.
* [CANCEL](../../sql-reference/functions/finetune-cancel.md): Cancels a given fine-tuning job.

## Cost considerations

The Snowflake Cortex Fine-tuning function incurs compute cost based on the number of tokens used in training. In addition,
running the [AI_COMPLETE](../../sql-reference/functions/ai_complete.md) function on a fine-tuned model incurs compute costs based on the number of tokens
processed. Refer to the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) for each
cost in credits per million tokens.

A token is the smallest unit of text processed by the Snowflake Cortex Fine-tuning function, approximately equal to four
characters of text. The equivalence of raw input or output text to tokens can vary by model.

* For the COMPLETE function, which generates new text in the response, both input and output tokens are counted.
* Fine-tuning trained tokens are calculated as follows:

  ```none
  Fine-tuning trained tokens = number of input tokens * number of epochs trained
  ```

  Use the [FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)](../../sql-reference/functions/finetune-describe.md) to see the number of trained tokens for your fine-tuning job.
* In addition to tuning and inference charges, standard [storage](../cost-understanding-data-storage.md) and
  [warehouse](../cost-understanding-compute.md) costs apply for storing the customized adaptors and running SQL commands.

### Track credit consumption for Fine-tuning training

To view the credit and token consumption for fine-tuning training jobs, use the [CORTEX_FINE_TUNING_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_fine_tuning_usage_history.md):

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FINE_TUNING_USAGE_HISTORY;
```

## Other considerations

* Fine-tuning jobs are often long running and are not attached to a worksheet session.
* The number of rows in the training/validation dataset is limited by the base model and the number of training epochs. The following table shows the limits for 3 epochs:

  ```none
  Effective row count limit = 1 epoch limit for base model / number of epochs trained
  ```

  | Model | 1 epoch | 3 epochs (default) |
  | --- | --- | --- |
  | `llama3-8b` | 186k | 62k |
  | `llama3-70b` | 21k | 7k |
  | `llama3.1-8b` | 150k | 50k |
  | `llama3.1-70b` | 13.5k | 4.5k |
  | `mistral-7b` | 45k | 15k |
  | `mixtral-8x7b` | 27k | 9k |

## Access control requirements

To run a fine-tuning job, the role that creates the fine-tuning job needs the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | DATABASE | The database that the training (and validation) data are queried from. |
| CREATE MODEL or OWNERSHIP | SCHEMA | The schema that the model is saved to. |

The following SQL is an example of granting the CREATE MODEL privilege to a role, `my_role`, on `my_schema`.

```sqlexample
GRANT CREATE MODEL ON SCHEMA my_schema TO ROLE my_role;
```

Additionally, to use the FINETUNE function,
the ACCOUNTADMIN role must grant the SNOWFLAKE.CORTEX_USER database role to the user who will call the function.
See [LLM Functions required privileges](aisql.md) topic for details.

To give other roles access to use the fine-tuned model, you must grant usage on the model. For details, see
[Model privileges](../security-access-control-privileges.md).

## Models available to fine-tune

You have the following base models that you can fine-tune. Models available for fine-tuning may be added or removed in the future:

| Name | Description |
| --- | --- |
| `llama3-8b` | A large language model from Meta that is ideal for tasks that require low to moderate reasoning like text classification, summarization, and sentiment analysis. |
| `llama3-70b` | An LLM from Meta that delivers state of the art performance ideal for chat applications, content creation, and enterprise applications. |
| `llama3.1-8b` | A large language model from Meta that is ideal for tasks that require low to moderate reasoning. It’s a light-weight, ultra-fast model with a context window of 24K. |
| `llama3.1-70b` | An open source model that demonstrates state-of-the-art performance ideal for chat applications, content creation, and enterprise applications. It is a highly performant, cost effective model that enables diverse use cases. |
| `mistral-7b` | 7 billion parameter large language model from Mistral AI that is ideal for your simplest summarization, structuration, and question answering tasks that need to be done quickly. It offers low latency and high throughput processing for multiple pages of text with its 32K context window. |
| `mixtral-8x7b` | A large language model from Mistral AI that is ideal for text generation, classification, and question answering. Mistral models are optimized for low latency with low memory requirements, which translates into higher throughput for enterprise use cases. |

## How to fine-tune a model

The overall workflow for tuning a model is as follows:

1. Prepare the training data.
2. Start the fine-tuning job with the required parameters.
3. Monitor training job.

Once training is complete, you can use the model name provided by Cortex Fine-tuning to run inference on your model.

### Prepare the fine-tuning data

The fine-tuning data must come from a Snowflake table or view and the query result must contain columns named `prompt` and `completion`.
If your table or view does not contain columns with the required names, use a column alias in your query to name them. This query is given
as a parameter to the FINETUNE function. You will get an error if the results do not contain `prompt` and `completion` column names.

> **Note:**
>
> All columns other than the prompt and completion columns will be ignored by the FINETUNE function. Snowflake recommends using a
> query that selects only the columns you need.

The following code calls the FINETUNE function and uses the `SELECT ... AS` syntax to set two of the columns in the query result
to `prompt` and `completion`.

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  'my_tuned_model',
  'mistral-7b',
  'SELECT a AS prompt, d AS completion FROM train',
  'SELECT a AS prompt, d AS completion FROM validation'
);
```

> **Note:**
>
> To get responses that follow a schema you define, use structured outputs to generate fine-tuning data.
> For more information about structured outputs, see [AI_COMPLETE structured outputs](complete-structured-outputs.md).

A prompt is an input to the LLM and completion is the response from the LLM. Your training data should include prompt and completion pairs
that show how you want the model to respond to particular prompts.

The following are additional recommendations and requirements regarding your training data for getting
optimal performance from fine-tuning.

* Start with a few hundred examples. Starting with too many examples may increase tuning time drastically with
  minimal improvement in performance.
* For each example, you must use only a portion of the allotted context window for the base model you are tuning. Context window is
  defined in terms of tokens. A token is the smallest unit of text processed by Snowflake Cortex functions, approximately equal to
  four characters of text. Prompt and completion pairs that exceed this limit will be truncated, which may negatively impact
  the quality of the trained model.
* The portion of the context window allotted for `prompt` and `completion` for each base model is defined in the following table:

  > | Model | Context Window | Input Context (prompt) | Output Context (completion) |
  > | --- | --- | --- | --- |
  > | llama3-8b | 8k | 6k | 2k |
  > | llama3-70b | 8k | 6k | 2k |
  > | llama3.1-8b | 24k | 20k | 4k |
  > | llama3.1-70b | 8k | 6k | 2k |
  > | mistral-7b | 32k | 28k | 4k |
  > | mixtral-8x7b | 32k | 28k | 4k |

### Start the fine-tuning job

You can start a fine-tuning job by
calling the [SNOWFLAKE.CORTEX.FINETUNE function and passing in ‘CREATE’ as the first argument](../../sql-reference/functions/finetune-create.md)
or using Snowsight.

#### Use SQL

This example uses the `mistral-7b` model as the base model to create a job with a model output name of `my_tuned_model` and training
and validation data querying from the `my_training_data` and `my_validation_data` tables respectively.

```sqlexample
USE DATABASE mydb;
USE SCHEMA myschema;

SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  'my_tuned_model',
  'mistral-7b',
  'SELECT prompt, completion FROM my_training_data',
  'SELECT prompt, completion FROM my_validation_data'
);
```

You can use absolute paths for each of the database objects such as the model or data if you want to use different database and schema for each. The following example shows creating a fine-tuning job with data from `mydb2.myschema2` database and schema and saving the fine-tuned model to the `mydb.myschema` database and schema.

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  'mydb.myschema.my_tuned_model',
  'mistral-7b',
  'SELECT prompt, completion FROM mydb2.myschema2.my_training_data',
  'SELECT prompt, completion FROM mydb2.myschema2.my_validation_data'
);
```

The [SNOWFLAKE.CORTEX.FINETUNE function with ‘CREATE’ as the first argument](../../sql-reference/functions/finetune-create.md)
returns a fine-tuned model ID as the output. Use this ID to get status or job progress using the
[SNOWFLAKE.CORTEX.FINETUNE function with ‘DESCRIBE’ as the first argument](../../sql-reference/functions/finetune-describe.md).

#### Use Snowsight

Follow these steps to create a fine-tuning job in the Snowsight:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Choose a role that is granted the SNOWFLAKE.CORTEX_USER database role.
3. In the navigation menu, select AI & ML » AI Studio.
4. Select Fine-tune from the Create Custom LLM box.
5. Select a base model using the drop-down menu.
6. Select the role under which the fine-tuning job will execute and the warehouse where it will run. The role must be granted
   the SNOWFLAKE.CORTEX_USER database role.
7. Select a database in which to store the fine-tuned model.
8. Enter a name for your fine-tuned model, then select Let’s go.
9. Select the table or view that contains your training data, then select Next. The training data can come from any database or
   schema that the role has access to.
10. Select the column that contains the prompts in your training data, then select Next.
11. Select the column that contains the completions in your training data, then select Next.
12. If you have a validation dataset, select the table or view that contains your validation data, then select Next.
    If you don’t have separate validation data, select Skip this option.
13. Verify your choices, then select Start training.

The final step confirms that your fine-tuning job has started and displays the Job ID. Use this ID to get status or job progress
using the [SNOWFLAKE.CORTEX.FINETUNE function with ‘DESCRIBE’ as the first argument](../../sql-reference/functions/finetune-describe.md).

### Manage fine-tuned jobs

Fine-tuning jobs are long running, which means they are not tied to a worksheet session. You can check the status of your tuning job using the
[SNOWFLAKE.CORTEX.FINETUNE](../../sql-reference/functions/finetune-snowflake-cortex.md) function with [SHOW](../../sql-reference/functions/finetune-show.md)
or [‘DESCRIBE’](../../sql-reference/functions/finetune-describe.md) as the first argument.

If you no longer need a fine-tuning job, you can terminate the job using the [SNOWFLAKE.CORTEX.FINETUNE](../../sql-reference/functions/finetune-snowflake-cortex.md)
function with [CANCEL](../../sql-reference/functions/finetune-cancel.md) as the first argument and the job ID as the second argument.

### Analyze fine-tuned models

After a fine-tuning job completes, you can analyze the results of the training
process by examining the fine-tuned model’s artifacts. The OWNERSHIP privilege on the model is
required to access the fine-tuned model’s artifacts; for details, see
[Model privileges](../security-access-control-privileges.md).

The artifacts include a `training_results.csv` file. This CSV file
contains one header row followed by a row for each training step recorded by the
fine-tuning job. The file contains the following columns:

> | Column name | Description |
> | --- | --- |
> | step | Number of training steps completed in the entire training process. Starts at 1. |
> | epoch | The epoch in the training process. Starts at 1. |
> | training_loss | The loss for the training batch. A lower number indicates a closer fit between the model and the data. |
> | validation_loss | The loss on the validation dataset. This is only available at the last step in each epoch. |

The `training_results.csv` file can be found in the [Model Registry UI](../../developer-guide/snowflake-ml/model-registry/snowsight-ui.md) in Snowsight
and accessed directly via SQL or Python API.
For more information, see
[Working with model artifacts](../../developer-guide/snowflake-ml/model-registry/overview.md).

## Use your fine-tuned model for inference

Use the [COMPLETE LLM function](../../sql-reference/functions/complete-snowflake-cortex.md) with the name your fine-tuned model to make inferences.

This example shows a call to the [COMPLETE](../../sql-reference/functions/complete-snowflake-cortex.md) function with
the name of your fine-tuned model.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE(
  'my_tuned_model',
  'How to fine-tune mistral models'
);
```

The following is a snippet of the output from the example call:

> ```output
> Mistral models are a type of deep learning model used for image recognition and classification. Fine-tuning a Mistral model involves adjusting the model's parameters to ...
> ```

## Limitations and known issues

* Fine-tuning jobs are listable at the account-level only.
* The fine-tuning jobs returned from [FINETUNE ('SHOW') (SNOWFLAKE.CORTEX)](../../sql-reference/functions/finetune-show.md) are not permanent and may be garbage
  collected periodically.
* If a base model is removed from the Cortex LLM Functions, your fine-tuned model will no longer work.

## Sharing models

Fine-tuned models can be shared to other accounts with the USAGE privilege via [Data Sharing](../data-sharing-intro.md).

## Replicating models

[Cross-region inference](cross-region-inference.md) does not support fine-tuned models. Inference must take place in the same
region where the model object is located. You can use database replication to replicate the fine-tuned model object to a region you want
to make inference from if it’s different than the region the model was trained in.

For example,
if you create a fine-tuned model based on `mistral-7b` in your account in the AWS US West 2 region, you can use data sharing to share it
with another account in this region, or you can use database replication to replicate the model to another account in your organization
in a different region that supports the `mistral-7b` model, such as AWS Europe West. For details on replicating objects, see
[Replicating databases and account objects across multiple accounts](../account-replication-config.md).

## Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Fine-tuning arctic-extract models
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/arctic-extract-finetuning.md
section: Snowflake Cortex (AI & ML)
---

# Fine-tuning `arctic-extract` models

You can now fine-tune `arctic-extract` models using the [Snowflake Cortex Fine-tuning](cortex-finetuning.md)
function and [Snowflake Datasets](../../developer-guide/snowflake-ml/dataset.md). The fine-tuned model can then be used for inference with the
[AI_EXTRACT](../../sql-reference/functions/ai_extract.md) function.

## Syntax

For specific syntax, usage notes, and examples, see:

* FINETUNE ('CREATE') (SNOWFLAKE.CORTEX)
* FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)
* [FINETUNE ('SHOW') (SNOWFLAKE.CORTEX)](../../sql-reference/functions/finetune-show.md)
* [FINETUNE ('CANCEL') (SNOWFLAKE.CORTEX)](../../sql-reference/functions/finetune-cancel.md)

### FINETUNE ('CREATE') (SNOWFLAKE.CORTEX)

Creates a fine-tuning job.

#### Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  '@<database>.<schema>.<model_name>',
  'arctic-extract',
  '<training_dataset>'
  [
    , '<validation_dataset>'
  ]
)
```

#### Required parameters

`'CREATE'`
:   Specifies that you want to create a fine-tuning job.

`'training_dataset'`
:   Dataset object to use for training. For more information, see Dataset requirements.

#### Optional parameters

`'validation_dataset'`
:   Dataset object to use for validation. For more information, see Dataset requirements.

> **Note:**
>
> The `options` parameter is not supported for fine-tuning `arctic-extract` models. The number of epochs is automatically
> determined by the system.

#### Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE or OWNERSHIP | DATABASE | The database that the Dataset object is stored in. |
| USAGE or OWNERSHIP | SCHEMA | The schema that the Dataset object is stored in. |
| READ or OWNERSHIP | STAGE | The stage that the document files are stored in. |
| USAGE or OWNERSHIP | SCHEMA | The schema that the fine-tuned model is stored in. |
| CREATE MODEL | SCHEMA | The schema that the fine-tuned model is stored in. |

Additionally, to use the FINETUNE function, the ACCOUNTADMIN role must grant the SNOWFLAKE.CORTEX_USER database
role to the user who will call the function. See [LLM Functions required privileges](aisql.md)
topic for details.

#### Example

```sqlexample
  SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  '@database.schema.model_name',
  'arctic-extract`,
  'snow://dataset/training_ds/versions/2',
  'snow://dataset/validation_ds/versions/4'
);
```

### FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)

Describes the properties of a fine-tuning job.

For syntax and parameters, see [FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)](../../sql-reference/functions/finetune-describe.md).

An example output for a successful job when fine-tuning `arctic-extract` model:

```output
{
  "base_model":"arctic-extract",
  "created_on":1717004388348,
  "finished_on":1717004691577,
  "id":"ft_6556e15c-8f12-4d94-8cb0-87e6f2fd2299",
  "model":"mydb.myschema.my_tuned_model",
  "progress":1.0,
  "status":"SUCCESS",
  "training_data":"snow://dataset/training_ds/versions/2",
  "trained_tokens":2670734,
  "training_result":{"validation_loss":1.0138969421386719,"training_loss":0.6477728401547047},
  "validation_data":"snow://dataset/validation_ds/versions/4",
}
```

## Dataset requirements

The [Dataset](../../developer-guide/snowflake-ml/dataset.md) used for training and validation must contain the following columns:

File:
:   A string containing the file path to the document for extraction. For example: `@db.schema.stage/file.pdf`

Prompt:
:   A JSON value that specifies key and question pairs for extraction in one of the formats supported by the `responseFormat` argument of
    the [AI_EXTRACT](../../sql-reference/functions/ai_extract.md) function.

    For more information, see [AI_EXTRACT](../../sql-reference/functions/ai_extract.md).

Response:
:   A JSON object containing key and response pairs.

> **Note:**
>
> Column names are case-insensitive and can be in any order in the Dataset; however, all required columns
> (`File`, `Prompt`, and `Response`) must be present for the Dataset to be valid. Additional columns in the Dataset are ignored.

When preparing the Dataset, note the following:

* The schema of the fine-tuned model is the unique set of all questions in the Dataset.
* The answers in the `Response` column should match the questions in the `Prompt` column by matching keys in the `Prompt` and `Response` columns.
* You don’t have to specify the same set of questions for every document.
* To improve model accuracy, add a prompt and response row for each question, even if the model’s default response is correct. This action confirms that the default answer is accurate.

For more information about Datasets, see [Snowflake Datasets](../../developer-guide/snowflake-ml/dataset.md).

### Example Dataset

| File | Prompt | Response |
| --- | --- | --- |
| `file1.pdf` | `{"date": "What is the date?", "total": "What is the total amount?"}` | `{"date": "2024-06-30", "total": "82.50"}` |
| `file2.pdf` | `[["invoice_number", "What is the invoice number?"], ["vendor", "What is the vendor name?"]]` | `{"invoice_number": "543433434", "vendor": "Example Corp"}` |
| `file3.pdf` | ```output {   "schema":   {     "type": "object",     "properties": {       "deductions": {         "description": "Deductions",         "type": "object",         "properties": {           "deductions_name": {             "type": "array"           },           "current": {             "type": "array"           }         }       }     }   } } ``` | ```output {   "deductions": {     "deductions_name": [       "Federal Tax",       "Wyoming State Tax",       "SDI",       "Soc Sec / OASDI",       "Health Insurance Tax",       "None"     ],     "current": [       "82.50",       "64.08",       "None",       "13.32",       "91.74",       "21.46"     ]   } } ``` |

> **Note:**
>
> When you create the Dataset, set the response to `None` if the document does not contain an answer to the question.

## Usage notes

* Snowflake recommends using at least 20 documents for fine-tuning.
* Supported file formats for documents are:

  > + PDF
  > + PNG
  > + JPG, JPEG
  > + TIFF, TIF
* The maximum number of pages per document is:

  + 64 pages for AWS US West 2 (Oregon) and AWS Europe Central 1 (Frankfurt)
  + 125 pages for AWS US East 1 (N. Virginia) and Azure East US 2 (Virginia)
* The maximum number of unique document files in the Dataset is 1,000. You can reference the same document file multiple times.
* A limit exists on how many questions and documents can be in a fine-tuning job. Number of questions multiplied by total number
  of pages in all document files in the Dataset must be equal or less than 50,000.

  For example, some valid combinations are:

  | Number of questions | Number of pages | Number of document file references [1] |
  | --- | --- | --- |
  | 10 | 1 | 5,000 |
  | 100 | 1 | 500 |
  | 10 | 10 | 500 |
  | 25 | 10 | 200 |

[1]

The total number of document file references (including repeats). The maximum number of unique document files in the Dataset is 1,000. However, you can reference the same file multiple times.

## Create a fine-tuning job

To create a fine-tuning job, you must create a Dataset object that contains the training data. The following example shows how to
create a Dataset object and use the Dataset to create a fine-tuning job for an `arctic-extract` model.

1. Create the table which will contain the training data:

   ```sqlexample
   CREATE OR REPLACE TABLE my_data_table (f FILE, p VARCHAR, r VARCHAR);
   ```
2. Populate the table with the training data:

   ```sqlexample
   INSERT INTO my_data_table (f, p, r)
   SELECT TO_FILE('@db.schema.stage', '1.pdf'), '{"net": "What is the net value?"}', '{"net": "3,762.56"}';
   ```
3. Create the Dataset object:

   ```sqlexample
   CREATE OR REPLACE DATASET my_dataset;
   ```
4. Create a new version of the Dataset that adds the training data, using the [FL_GET_STAGE](../../sql-reference/functions/fl_get_stage.md) and
   the [FL_GET_RELATIVE_PATH](../../sql-reference/functions/fl_get_relative_path.md) functions to get the file paths:

   ```sqlexample
   ALTER DATASET my_dataset
   ADD VERSION 'v1' FROM (
     SELECT FL_GET_STAGE(f) || '/' || FL_GET_RELATIVE_PATH(f) AS "file",
          p AS "prompt",
          r AS "response"
     FROM my_data_table
   );
   ```
5. Create a fine-tuning job:

   ```sqlexample
   SELECT SNOWFLAKE.CORTEX.FINETUNE(
     'CREATE',
     'my_tuned_model',
     'arctic-extract',
     'snow://dataset/db.schema.my_dataset/versions/v1'
   );
   ```

## Use your fine-tuned `arctic-extract` model for inference

To use the fine-tuned `arctic-extract` model for inference, ensure you have the following privileges on the model object:

* OWNERSHIP
* USAGE
* READ

To use the fine-tuned `arctic-extract` model for inference with the [AI_EXTRACT](../../sql-reference/functions/ai_extract.md) function,
specify the model using the `model` parameter as shown in the following example:

```sqlexample
SELECT AI_EXTRACT(
  model => 'db.schema.my_tuned_model',
  file => TO_FILE('@db.schema.files','document.pdf')
);
```

You can overwrite questions used for fine-tuning by using the `responseFormat` parameter as shown in the following example:

```sqlexample
SELECT AI_EXTRACT(
  model => 'db.schema.my_tuned_model',
  file => TO_FILE('@db.schema.files','document.pdf'),
  responseFormat => [['name', 'What is the first name of the employee?'], ['city', 'Where does the employee live?']]
);
```

For more information, see [AI_EXTRACT](../../sql-reference/functions/ai_extract.md).

> **Tip:**
>
> You can copy your fine-tuned `arctic-extract` model between databases and/or schemas within an account or between accounts.
> For more information, see [Copy arctic-extract models between databases, schemas, and accounts](copy-arctic-extract-models.md).

---
title: Getting started with Snowflake Intelligence
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/getting-started.md
section: Snowflake Cortex (AI & ML)
---

# Getting started with Snowflake Intelligence

This topic provides information about getting started with Snowflake Intelligence with a simple example of creating an enterprise agent. This agent can be used with Snowflake Intelligence to respond to questions by reasoning over both structured and unstructured data. For a more detailed guide, see [Getting Started with Snowflake Intelligence](https://www.snowflake.com/en/developers/guides/getting-started-with-snowflake-intelligence/).

## Prerequisites

* [Git installed](https://git-scm.com/book/en/v2/Getting-Started-Installing-Git)
* A Snowflake account
* Access to the ACCOUNTADMIN role

## Create a database, schema, and tables and load data from AWS S3

To create the building blocks for the enterprise agent, you must create a database, schema, tables, and load data from AWS S3.

1. Clone the [Getting Started with Snowflake Intelligence GitHub repository](https://github.com/Snowflake-Labs/sfguide-getting-started-with-snowflake-intelligence/) to your local machine:

   ```bash
   git clone https://github.com/Snowflake-Labs/sfguide-getting-started-with-snowflake-intelligence.git
   ```
2. Sign in to [Snowsight](../../ui-snowsight-gs.md).
3. In the navigation menu, select Projects » Workspaces.
4. Select + Add new.
5. Select SQL File.
6. Enter a name for the file.
7. Open the file.
8. Copy the contents of the [setup.sql](https://github.com/Snowflake-Labs/sfguide-getting-started-with-snowflake-intelligence/blob/main/setup.sql) file to the workspace.
9. Run all statements in order.
10. Run the following SQL statements in the workspace:

    ```sqlexample
    USE ROLE ACCOUNTADMIN;
    CREATE SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT;
    GRANT USAGE ON SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT TO ROLE snowflake_intelligence_admin;
    GRANT CREATE SEMANTIC VIEW ON SCHEMA DASH_DB_SI.RETAIL TO ROLE ACCOUNTADMIN;
    ```
11. Optionally, run the following SQL statement to enable cross-region inference:

    ```sqlexample
    ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'ANY_REGION';
    ```
12. Switch the user role in Snowsight to SNOWFLAKE_INTELLIGENCE_ADMIN.

## Create tools for the agent to use

Create the tools that the agent will use.

**Create a semantic view for use with Cortex Analyst.**

1. In the navigation menu, select AI & ML » Cortex Analyst.
2. Select Create new, then select Create new Semantic View.
3. For the location to store the semantic view, select DASH_DB_SI.RETAIL.
4. For the name, enter `SALES_AND_MARKETING_DATA`.
5. For the description, enter `Semantic view for sales and marketing data analysis across campaigns, products, transactions, and social media engagement.`.
6. Select Next.
7. Select Skip.
8. Select the DASH_DB_SI.RETAIL schema.
9. For the tables, select the MARKETING_CAMPAIGN_METRICS, PRODUCTS, SALES, and SOCIAL_MEDIA tables.
10. Select Next.
11. For the columns, select all available columns for the selected tables.
12. Select Next.
13. Review and accept all of the relationship and metric suggestions.
14. Select Save.
15. Wait for the semantic view to be created.

**Create a Cortex search tool by creating a search service.**

1. In the navigation menu, select AI & ML » Cortex Search.
2. Select Create.
3. For Service database and schema, select **DASH_DB_SI.RETAIL**.
4. For Service name, enter **Support_Cases**, and then select Next.
5. In the list of data sources, select the SUPPORT_CASES table, and then select Next.
6. In the list of search columns, select **TRANSCRIPT**, and then select Next.
7. For the attribute columns, select **TITLE** and **PRODUCT**, and then select Next.
8. For the columns to include, select Select all, and then select Next.
9. For the warehouse, select **DASH_WH_SI** (if that warehouse is not available, select **COMPUTE_WH**), and then select Create.

## Create a Cortex Agent

To create the agent that will use the tools, follow these steps:

1. In the navigation menu, select AI & ML » Agents.
2. Select Create agent.
3. For the schema, use SNOWFLAKE_INTELLIGENCE.AGENTS.
4. For the agent object name, use `Sales_AI`.
5. For the display name, use `Sales AI`.
6. Select Create agent.

## Add the tools to the agent

**Add the Cortex Analyst tool to the agent.**

1. From the agent page, select the Tools tab.
2. Navigate to the Cortex Analyst entry.
3. Select + Add, then select Semantic view.
4. For the database and schema, select DASH_DB_SI.RETAIL.
5. For the semantic view, select `SALES_AND_MARKETING_DATA`.
6. For the name, use `SALES_AND_MARKETING_DATA`.
7. For the description, use the following:

   > ```text
   > The Sales and Marketing Data semantic view in DASH_DB_SI.RETAIL schema provides a complete view of retail business performance by connecting marketing campaigns, product information, sales data, and social media engagement. The view enables tracking of marketing campaign effectiveness through clicks and impressions, while linking to actual sales performance across different regions. Social media engagement is monitored through influencer activities and mentions, with all data connected through product categories and IDs. The temporal alignment across tables allows for comprehensive analysis of marketing impact on sales performance and social media engagement over time.
   > ```
8. For the warehouse, select Custom, then select DASH_WH_SI.
9. For the query timeout, use `60`.
10. Select Add.

**Add the Cortex Search tool to the agent.**

1. Navigate to the Cortex Search Services entry.
2. Select + Add.
3. For the database and schema, select DASH_DB_SI.RETAIL.
4. For the search service, select `DASH_DB_SI.RETAIL.Support_Cases`.
5. For the ID column, use `ID`.
6. For the title column, use `TITLE`.
7. For the name, use `Support_Cases`.
8. Select Add.
9. Select the Orchestration tab.
10. Add the following orchestration instructions:

    ```text
    Whenever you can answer visually with a chart, always choose to generate a chart even if the user didn't specify to.
    ```
11. Select Save.

## Use Snowflake Intelligence

Interact with the agent from Snowflake Intelligence.

1. Navigate to Snowflake Intelligence using one of the methods described in [Access the agent](deploy-agents.md).
2. Select the newly created agent.
3. Enter the following prompts:

   > * “What issues are reported with jackets recently in customer support tickets?”
   > * “Show me the trend of sales by product category between June and August.”
   > * “Why did sales of Fitness Wear grow so much in July?”

---
title: Improve literal search to enhance Cortex Analyst responses
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/cortex-analyst-search-integration.md
section: Snowflake Cortex (AI & ML)
---

# Improve literal search to enhance Cortex Analyst responses

This topic describes ways to improve literal string searches to help Cortex Analyst generate more accurate
SQL queries. Writing the correct SQL query to answer a question sometimes requires knowing exact literal values to filter on.
Since those values can’t always be extracted directly from the question, a search of some kind may be needed.

For example, if a user asks a question such as:

```text
What was my overall sales of iced tea in Q1?
```

You might try the following query:

```sqlexample
SELECT DISTINCT name FROM product WHERE name LIKE '%iced%tea%'
```

If you’ve ever gone through this process yourself, you’ll know that this
isn’t a perfect solution. For example, this query won’t show you any products named “Ice Tea”, but it will show you some “spiced tea”.

Cortex Analyst offers two solutions to help improve literal usage:

* Semantic search over the provided sample values in your [semantic model](../../views-semantic/sql.md).
* Semantic search using [Cortex Search Services](../cortex-search/cortex-search-overview.md).

This is where integrating with Cortex Search can help. [Cortex Search](../cortex-search/cortex-search-overview.md)
is a feature that enables low-latency, high-quality “fuzzy” search over text data. You can create a Cortex Search service to do a semantic
search over the underlying database column to find any literal values needed for Cortex Analyst to use in the SQL query that answers the
user’s question.

## Semantic search over sample values

For dimensions with relatively low-cardinality (about 1 - 10 distinct values), using a sample value search by specifying enough sample
values to show the structure of the response for the dimension is recommended. This solution requires no
additional storage besides the minimal increase to the semantic model size.

Before Cortex Analyst generates a SQL query for your question, it does a semantic similarity search between your question and the provided
sample values to identify any appropriate literal values that may be needed to write your query. Note that the semantic similarity search
may retrieve more relevant literals than the fuzzy string matching query approach mentioned above.

Only a fixed-sized set of retrieved sample values will be presented to the LLM as literals that may be needed to write the SQL query.
That means adding more sample values does not put you at risk of exceeding the LLM’s context window.

## Semantic search using Cortex Search Service

For dimensions with higher cardinality (more than 10 distinct values) or dimensions whose values change frequently, you can use a
Cortex Search Service to search through the literals. This solution reduces data duplication and keeps your semantic model concise.

Cortex Search Services do come with additional storage and compute costs. For details, see [Cost considerations](../cortex-search/cortex-search-overview.md).

> **Note:**
>
> In this preview, only a single Cortex Search Service per logical dimension is supported.

There are two options for creating a Cortex Search Service for a logical dimension in your Cortex Analyst semantic model:

* Use the Cortex Analyst UI to create a Cortex Search Service. This is the recommended approach, because it is simpler
  and less error-prone than manual setup.
* Create a Cortex Search Service manually with SQL code. This approach is more flexible but requires you to write code.

### Option 1: Use the Cortex Analyst UI

You can create a Cortex Search Service in Snowsight using the Cortex Analyst semantic model creation UI. This approach
requires no writing or editing of SQL or YAML, and is suitable for most uses.

Sign in to [Snowsight](../../ui-snowsight-gs.md). in the navigation menu, select AI & ML » Cortex Analyst » Create new model. Follow the model creation
flow to create the Cortex Analyst semantic model. The screen for setting up Cortex Search Services is at the end of this flow.

When defining dimensions in the UI, select columns that contain text values you want to improve literal matching for.
The wizard automatically selects high cardinality columns for you, but you can choose other columns. Next, the UI lets
you choose settings for your new service, then creates the service automatically when you complete the flow.

The service is provisioned in database and schema that you selected. Once created, the service is automatically linked to your
semantic model. (The wizard also generates the YAML that links the service.)

### Option 2: Create a Cortex Search Service manually

The following steps show how to manually set up a Cortex Search Service for a logical dimension in your Cortex Analyst semantic model:

1. Create Cortex Search Service

   > ```sqlexample
   > CREATE OR REPLACE CORTEX SEARCH SERVICE my_logical_dimension_search_service
   >   ON my_dimension
   >   WAREHOUSE = xsmall
   >   TARGET_LAG = '1 hour'
   >   AS (
   >       SELECT DISTINCT my_dimension FROM my_logical_dimension_landing_table
   >   );`
   > ```
2. Include the Cortex Search service in your semantic model using the following yaml snippet:

   > ```yaml
   > tables:
   >
   >   - name: my_table
   >
   >     base_table:
   >       database: my_database
   >       schema: my_schema
   >       table: my_table
   >
   >     dimensions:
   >       - name: my_dimension
   >         expr: my_column
   >         cortex_search_service:
   >           service: my_logical_dimension_search_service
   >           literal_column: my_column     # optional
   >           database: my_search_database  # optional
   >           schema: my_search_schema      # optional
   > ```
   >
   > The following fields are optional under `cortex_search_service`:
   >
   > * `literal_column`: Defaults to the search index.
   > * `database`: Defaults to the database of the specified base table.
   > * `schema`: Defaults to the schema of the specified base table.

---
title: Integrate tools and data
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/integrate-tools.md
section: Snowflake Cortex (AI & ML)
---

# Integrate tools and data

In some cases, you may want to integrate other tools and data sources with your agents in Snowflake Intelligence. Snowflake Intelligence supports the Model Context Protocol (MCP), which is an [open-source standard](https://modelcontextprotocol.io/docs/getting-started/intro) that lets AI agents securely interact with business applications and external data systems, such as databases and content repositories.

The MCP server provides a standards-based interface that allows AI agents to discover and invoke tools, such as Cortex Analyst and Cortex Search, and retrieve the data they need. For more information, see [Snowflake-managed MCP server](../cortex-agents-mcp.md).

With MCP, you can:

* Allow your agent to retrieve data from Snowflake accounts using a Snowflake-managed MCP server without needing to deploy separate infrastructure. You can configure the MCP server to serve Cortex Analyst, Cortex Search, and Cortex Agents as tools, along with custom tools and SQL executions on the standards-based interface.
* Connect to your agents in Snowflake Intelligence from external MCP clients.

For information about creating and managing the Snowflake-managed MCP server, see [Snowflake-managed MCP server](../cortex-agents-mcp.md).

## Use the Snowflake-managed MCP server to connect to your agents from external MCP clients

Any agent that you create in Snowflake, or the tools that the agent is connected to, can have a managed endpoint for other systems to connect to your agent with MCP. This provides a seamless integration layer for tools like Claude Desktop, Langgraph, and other tools that integrate with MCP.

When connecting to your agents from an external MCP client, you must use the URL endpoint with the following format:

```none
https://<account_URL>/api/v2/databases/{database}/schemas/{schema}/mcp-servers/{name}
```

For information about formatting your account URL, see [Account identifiers](../../admin-account-identifier.md).

For information about interacting with the MCP server, see [Build an MCP client](https://modelcontextprotocol.io/docs/develop/build-client).

---
title: Managing Cortex AI Function costs with Account Usage
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-func-cost-management.md
section: Snowflake Cortex (AI & ML)
---

# Managing Cortex AI Function costs with Account Usage

Snowflake Cortex AI Functions (AI_COMPLETE, AI_SUMMARIZE, AI_TRANSLATE, AI_SENTIMENT, and others) consume credits based
on token or page usage. Without monitoring and controls, costs for using these functions can escalate quickly due to:

* Unoptimized prompts generating excessive tokens
* Long-running or runaway queries
* Lack of per-user spending limits
* Insufficient visibility into usage patterns

This topic suggests strategies for monitoring, managing, and controlling the costs associated with Snowflake Cortex AI
Functions. Using the CORTEX_AI_FUNCTIONS_USAGE_HISTORY view, you can track usage patterns and implement automated cost
controls. These techniques can help you monitor usage, alert when spending limits are exceeded, control access to
functions based on monthly limits, and stop runaway queries.

## Usage history view

The SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AI_FUNCTIONS_USAGE_HISTORY view provides detailed telemetry for all Cortex AI Functions invoked via SQL.
The view has a maximum latency of sixty minutes, although data may be available in as few as ten minutes after function execution begins.
For detailed information on this view, see the [CORTEX_AI_FUNCTIONS_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_ai_functions_usage_history.md).

## Basic usage monitoring

The following queries help you understand your AI Functions usage patterns. Run these periodically yourself, or integrate them into dashboards for ongoing visibility.

### Daily credit consumption by function and model

Track daily spending trends to identify usage spikes and understand which functions and models consume the most credits.

```sqlexample
SELECT
    DATE_TRUNC('day', START_TIME) AS usage_date,
    FUNCTION_NAME,
    MODEL_NAME,
    SUM(CREDITS) AS total_credits,
    COUNT(DISTINCT QUERY_ID) AS query_count
FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AI_FUNCTIONS_USAGE_HISTORY
WHERE START_TIME >= DATEADD('day', -30, CURRENT_TIMESTAMP())
GROUP BY 1, 2, 3
ORDER BY usage_date DESC, total_credits DESC;
```

### Monthly credit consumption by user

Identify top consumers and track per-user spending over time. This query joins with the USERS view to provide user details including email and default role for easier identification and follow-up.

```sqlexample
SELECT
    DATE_TRUNC('month', h.START_TIME) AS usage_month,
    u.NAME AS user_name,
    u.EMAIL,
    u.DEFAULT_ROLE,
    SUM(h.CREDITS) AS total_credits,
    COUNT(DISTINCT h.QUERY_ID) AS query_count
FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AI_FUNCTIONS_USAGE_HISTORY h
JOIN SNOWFLAKE.ACCOUNT_USAGE.USERS u
    ON h.USER_ID = u.USER_ID
WHERE h.START_TIME >= DATEADD('month', -3, CURRENT_TIMESTAMP())
GROUP BY 1, 2, 3, 4
ORDER BY usage_month DESC, total_credits DESC;
```

## Cost control

Define automated mechanisms to detect excessive spending and take corrective action. These queries can be used independently of each other, or combined for comprehensive cost governance.

### Account-level monthly spending alert

Set up an automated alert that monitors total monthly AI Function credit consumption across your entire account. When spending exceeds a defined threshold, the alert sends an email notification to designated administrators.
Setting up the alert requires the following prerequisites:

* ACCOUNTADMIN role or appropriate privileges to [create notification integrations](../notifications/email-notifications.md) and [alerts](../alerts.md)
* A warehouse to execute the alert condition check
* Verified email addresses of alert recipients

First, create a notification integration if one does not already exist. This example replaces any existing integration named `ai_cost_alerts`.

```sqlexample
CREATE OR REPLACE NOTIFICATION INTEGRATION ai_cost_alerts
    TYPE = EMAIL
    ENABLED = TRUE
    ALLOWED_RECIPIENTS = ('admin@company.com', 'finops@company.com')
```

Next, create a table to track when alerts were sent for each month. This is used to prevent duplicate alerts within a month.

```sqlexample
CREATE TABLE IF NOT EXISTS AI_FUNCTIONS_ALERT_STATE (
    ALERT_NAME VARCHAR NOT NULL,
    ALERT_MONTH DATE NOT NULL,
    SENT_AT TIMESTAMP_LTZ DEFAULT CURRENT_TIMESTAMP(),
    CREDITS_AT_ALERT NUMBER(38,6),
    PRIMARY KEY (ALERT_NAME, ALERT_MONTH)
);
```

Now create a stored procedure to check if an alert was already sent this month, record the alert state, and send the email notification.

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE SEND_MONTHLY_SPEND_ALERT(P_THRESHOLD FLOAT)
RETURNS VARCHAR
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
    // Check if alert already sent this month
    var check_sent = snowflake.execute({
        sqlText: `SELECT COUNT(*) AS cnt FROM AI_FUNCTIONS_ALERT_STATE
                WHERE ALERT_NAME = 'monthly_spend'
                AND ALERT_MONTH = DATE_TRUNC('month', CURRENT_DATE())`
    });
    check_sent.next();
    var already_sent = check_sent.getColumnValue(1);

    if (already_sent > 0) {
        return 'Alert already sent for this month';
    }

    // Get current spend
    var spend_result = snowflake.execute({
        sqlText: `SELECT COALESCE(SUM(CREDITS), 0) AS total
                FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AI_FUNCTIONS_USAGE_HISTORY
                WHERE START_TIME >= DATE_TRUNC('month', CURRENT_TIMESTAMP())`
    });
    spend_result.next();
    var v_credits = spend_result.getColumnValue(1);

    // Check threshold
    if (v_credits <= P_THRESHOLD) {
        return 'Threshold not exceeded. Current: ' + v_credits + ' / ' + P_THRESHOLD;
    }

    // Record alert
    snowflake.execute({
        sqlText: `INSERT INTO AI_FUNCTIONS_ALERT_STATE (ALERT_NAME, ALERT_MONTH, CREDITS_AT_ALERT)
                VALUES ('monthly_spend', DATE_TRUNC('month', CURRENT_DATE()), ?)`,
        binds: [v_credits]
    });

    // Send email - update the recipient email address
    snowflake.execute({
        sqlText: `CALL SYSTEM$SEND_EMAIL(
            'ai_cost_alerts',
            'admin@company.com',
            'AI Functions Monthly Spend Alert',
            'Monthly AI Function credit consumption has exceeded the threshold.\\n\\n' ||
            'Current spend: ' || ${v_credits}::VARCHAR || ' credits\\n' ||
            'Threshold: ' || ${P_THRESHOLD}::VARCHAR || ' credits\\n\\n' ||
            'Please review usage accordingly.'
        )`
    });

    return 'Alert sent. Credits: ' + v_credits;
$$;
```

Finally, create an alert that checks usage against the spending threshold each hour and calls the procedure to send the notification if needed.
You should adjust the limit of 1000 credits, which appears in two places in the example below, to the desired threshold.

```sqlexample
CREATE OR REPLACE ALERT ai_functions_monthly_spend_alert
    WAREHOUSE = <your_warehouse>
    SCHEDULE = 'USING CRON 0 * * * * UTC'  -- Runs every hour
    IF (EXISTS (
        SELECT 1
        FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AI_FUNCTIONS_USAGE_HISTORY
        WHERE START_TIME >= DATE_TRUNC('month', CURRENT_TIMESTAMP())
        HAVING SUM(CREDITS) > 1000  -- adjust the limit accordingly
    ))
    THEN
        CALL SEND_MONTHLY_SPEND_ALERT(1000);  -- please adjust the limit accordingly

-- enable the alert
ALTER ALERT ai_functions_monthly_spend_alert RESUME;
```

> **Tip:**
>
> For testing purposes, set the limit to 0 at first to trigger the alert immediately. Recreate the alert with the desired threshold after confirming that it works as expected.
>
> After testing with a 0 threshold, run the following SQL to allow the alert to trigger again in the current month.
>
> ```sqlexample
> DELETE FROM AI_FUNCTIONS_ALERT_STATE
> WHERE ALERT_NAME = 'monthly_spend'
> AND ALERT_MONTH = DATE_TRUNC('month', CURRENT_DATE());
> ```

You can make sure that the alert is operating by querying the alert history and the alert state table as follows:

```sqlexample
-- Make sure alert exists
SHOW ALERTS LIKE 'ai_functions_monthly_spend_alert';

-- Check alert history
SELECT *
FROM TABLE(INFORMATION_SCHEMA.ALERT_HISTORY(
    SCHEDULED_TIME_RANGE_START => DATEADD('day', -1, CURRENT_TIMESTAMP()),
    ALERT_NAME => 'ai_functions_monthly_spend_alert'
))
ORDER BY SCHEDULED_TIME DESC;

-- Check which months have had alerts sent
SELECT * FROM AI_FUNCTIONS_ALERT_STATE ORDER BY ALERT_MONTH DESC;
```

### Per-user monthly spending limits

This example implements per-user monthly spending limits. Users are granted a dedicated custom AI_FUNCTIONS_USER_ROLE that
provides access to Cortex AI Functions. A table stores individual users’ monthly token budget. When a user exceeds their
budget for the month, an hourly task revokes their access to AI Functions by removing AI_FUNCTIONS_USER_ROLE. A monthly
task restores the role at the beginning of the next month.

> **Important:**
>
> By default, all users have access to AI Functions (and other Snowflake Cortex features) because the SNOWFLAKE.CORTEX_USER database role is granted to the PUBLIC role.
> To enforce per-user limits, you must revoke SNOWFLAKE.CORTEX_USER from PUBLIC and grant it only through the AI_FUNCTIONS_USER_ROLE. Use the
> following SQL to revoke the role from PUBLIC:
>
> ```sqlexample
> REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER FROM ROLE PUBLIC;
> ```
>
> Be sure that all users who need access to Cortex features are granted only the AI_FUNCTIONS_USER_ROLE. Use of
> any other role that includes SNOWFLAKE.CORTEX_USER allows users to bypass the spending limit controls implemented in
> this example. In some cases, you could use a more specific role; for example, users who need access only to Cortex Analyst
> can be granted the SNOWFLAKE.CORTEX_ANALYST_USER role instead of SNOWFLAKE.CORTEX_USER.

To set up per-user spending limits, first create a role that controls access to AI Functions, allowing this access to be managed separately from other privileges.

```sqlexample
-- Create a role specifically for AI Function access
CREATE ROLE IF NOT EXISTS AI_FUNCTIONS_USER_ROLE;

-- Grant necessary privileges to the role
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE AI_FUNCTIONS_USER_ROLE;

-- Grant usage on warehouse
GRANT USAGE ON WAREHOUSE AI_FUNCTIONS_WAREHOUSE TO ROLE AI_FUNCTIONS_USER_ROLE;
```

Now, set up the access control table, which tracks which users have AI Function access, their individual spending
limits, and their revocation history. It serves as the source of truth for the automated monitoring and access
restoration processes.

```sqlexample
CREATE TABLE IF NOT EXISTS AI_FUNCTIONS_ACCESS_CONTROL (
    USER_NAME VARCHAR NOT NULL,
    USER_ID NUMBER,
    GRANTED_AT TIMESTAMP_LTZ DEFAULT CURRENT_TIMESTAMP(),
    MONTHLY_CREDIT_LIMIT NUMBER(38,6) DEFAULT 100,  -- adjust the limit accordingly
    IS_ACTIVE BOOLEAN DEFAULT TRUE,
    REVOKED_AT TIMESTAMP_LTZ,
    REVOCATION_REASON VARCHAR,
    PRIMARY KEY (USER_NAME)
);
```

Next, create a stored procedure to grant AI Function access to a user and register them in the access control table with their spending limit.
The code looks up the user’s ID from the Account Usage view to enable efficient joins in monitoring queries.

```sqlexample
CREATE OR REPLACE PROCEDURE GRANT_AI_FUNCTIONS_ACCESS(
    P_USER_NAME VARCHAR,
    P_MONTHLY_LIMIT NUMBER(38,6) DEFAULT 100  -- adjust the limit accordingly
)
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
DECLARE
    v_user_id NUMBER;
BEGIN
    -- Look up USER_ID from account usage
    SELECT USER_ID INTO :v_user_id
    FROM SNOWFLAKE.ACCOUNT_USAGE.USERS
    WHERE NAME = :P_USER_NAME
    LIMIT 1;

    -- Grant the AI Functions role to the user
    EXECUTE IMMEDIATE 'GRANT ROLE AI_FUNCTIONS_USER_ROLE TO USER ' || P_USER_NAME;

    -- Register or update the user in the access control table
    MERGE INTO AI_FUNCTIONS_ACCESS_CONTROL tgt
    USING (SELECT :P_USER_NAME AS USER_NAME) src
    ON tgt.USER_NAME = src.USER_NAME
    WHEN MATCHED THEN
        UPDATE SET
            USER_ID = :v_user_id,
            IS_ACTIVE = TRUE,
            MONTHLY_CREDIT_LIMIT = :P_MONTHLY_LIMIT,
            GRANTED_AT = CURRENT_TIMESTAMP(),
            REVOKED_AT = NULL,
            REVOCATION_REASON = NULL
    WHEN NOT MATCHED THEN
        INSERT (USER_NAME, USER_ID, MONTHLY_CREDIT_LIMIT, IS_ACTIVE)
        VALUES (:P_USER_NAME, :v_user_id, :P_MONTHLY_LIMIT, TRUE);

    RETURN 'Access granted to ' || P_USER_NAME || ' with monthly limit of ' || P_MONTHLY_LIMIT || ' credits';
END;
$$;
```

Use this stored procedure to add users and their credit quotas to the access control table.

```sqlexample
CALL GRANT_AI_FUNCTIONS_ACCESS('ALICE', 1000);  -- grants access to user ALICE with a monthly limit of 100 credits
CALL GRANT_AI_FUNCTIONS_ACCESS('BOB', 2000);    -- grants access to user BOB with a monthly limit of 200 credits
```

Create the monthly access refresh task next. This task runs on the first day of each month to restore AI Function access
for all entitled users. When a user’s access was revoked due to exceeding their limit in the previous month, this task
grants them a fresh budget for the new month.

```sqlexample
-- Create a procedure to re-grant access to all entitled users
CREATE OR REPLACE PROCEDURE GRANT_ALL_ENTITLED_USERS()
RETURNS TABLE (USER_NAME VARCHAR, CREDIT_LIMIT NUMBER, ACTION VARCHAR)
LANGUAGE SQL
AS
$$
DECLARE
    result RESULTSET;
BEGIN
    result := (
        SELECT
            USER_NAME,
            MONTHLY_CREDIT_LIMIT AS CREDIT_LIMIT,
            'GRANTED' AS ACTION
        FROM AI_FUNCTIONS_ACCESS_CONTROL
    );

    -- Re-grant access for each entitled user
    FOR rec IN result DO
        CALL GRANT_AI_FUNCTIONS_ACCESS(rec.USER_NAME, rec.CREDIT_LIMIT);
    END FOR;

    RETURN TABLE(result);
END;
$$;

-- Create a task to run on the 1st of each month at midnight UTC
CREATE OR REPLACE TASK MONTHLY_AI_FUNCTIONS_ACCESS_REFRESH
    WAREHOUSE = <your_warehouse>
    SCHEDULE = 'USING CRON 0 0 1 * * UTC'  -- 1st day of each month at 00:00 UTC
AS
    CALL GRANT_ALL_ENTITLED_USERS();

-- Enable the task
ALTER TASK MONTHLY_AI_FUNCTIONS_ACCESS_REFRESH RESUME;

-- Run once initially to populate grantees
CALL GRANT_ALL_ENTITLED_USERS();

-- Verify task status
SHOW TASKS LIKE 'MONTHLY_AI_FUNCTIONS_ACCESS_REFRESH';
```

Finally, create an hourly task to monitor user spending and revoke access for any user who exceeds their monthly limit.

```sqlexample
-- Create a procedure to re-grant access to all entitled users
CREATE OR REPLACE PROCEDURE GRANT_ALL_ENTITLED_USERS()
RETURNS TABLE (USER_NAME VARCHAR, CREDIT_LIMIT NUMBER, ACTION VARCHAR)
LANGUAGE SQL
AS
$$
DECLARE
    result RESULTSET;
BEGIN
    result := (
        SELECT
            USER_NAME,
            MONTHLY_CREDIT_LIMIT AS CREDIT_LIMIT,
            'GRANTED' AS ACTION
        FROM AI_FUNCTIONS_ACCESS_CONTROL
    );

    -- Re-grant access for each entitled user
    FOR rec IN result DO
        CALL GRANT_AI_FUNCTIONS_ACCESS(rec.USER_NAME, rec.CREDIT_LIMIT);
    END FOR;

    RETURN TABLE(result);
END;
$$;

-- Create a task to run on the 1st of each month at midnight UTC
CREATE OR REPLACE TASK MONTHLY_AI_FUNCTIONS_ACCESS_REFRESH
    WAREHOUSE = <your_warehouse>
    SCHEDULE = 'USING CRON 0 0 1 * * UTC'  -- 1st day of each month at 00:00 UTC
AS
    CALL GRANT_ALL_ENTITLED_USERS();

-- Enable the task
ALTER TASK MONTHLY_AI_FUNCTIONS_ACCESS_REFRESH RESUME;

-- Run once initially to populate grantees
CALL GRANT_ALL_ENTITLED_USERS();

-- Verify task status
SHOW TASKS LIKE 'MONTHLY_AI_FUNCTIONS_ACCESS_REFRESH';
```

### Runaway query detection and cancellation

Long-running AI Function queries can accumulate significant costs. This example implements an automated system to detect queries that exceed a credit threshold and cancel them before they consume even more resources.
An email alert is sent with full query details.

> **Note:**
>
> When a query is cancelled, the client is still charged for all resources consumed up to the moment of cancellation. Cancelling a runaway query prevents further cost accumulation but does not refund credits already spent.

This procedure finds AI Function queries from the last 48 hours that have exceeded the credit threshold and are still running, cancels them, and reports them to an administrator.

```sqlexample
-- Create a procedure to detect and cancel expensive runaway queries
CREATE OR REPLACE PROCEDURE MONITOR_AND_CANCEL_RUNAWAY_QUERIES(
    P_CREDIT_THRESHOLD NUMBER DEFAULT 50  -- adjust the limit accordingly
)
RETURNS TABLE (
    QUERY_ID VARCHAR,
    USER_NAME VARCHAR,
    FUNCTION_NAME VARCHAR,
    MODEL_NAME VARCHAR,
    CREDITS NUMBER,
    START_TIME TIMESTAMP_LTZ,
    ACTION VARCHAR
)
LANGUAGE SQL
AS
$$
DECLARE
    result RESULTSET;
BEGIN
    -- Find queries from the last 48 hours that exceed the threshold and are still running
    result := (
        SELECT
            h.QUERY_ID,
            u.NAME AS USER_NAME,
            h.FUNCTION_NAME,
            h.MODEL_NAME,
            h.CREDITS,
            h.START_TIME,
            h.ROLE_NAMES,
            h.QUERY_TAG,
            h.WAREHOUSE_ID,
            'CANCELLED' AS ACTION
        FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AI_FUNCTIONS_USAGE_HISTORY h
        LEFT JOIN SNOWFLAKE.ACCOUNT_USAGE.USERS u
            ON h.USER_ID = u.USER_ID
        WHERE h.START_TIME >= DATEADD('hour', -48, CURRENT_TIMESTAMP())
        AND h.CREDITS > :P_CREDIT_THRESHOLD
        AND h.IS_COMPLETED = FALSE
    );

    -- Cancel each runaway query and send alert
    FOR rec IN result DO
        -- Attempt to cancel the query
        BEGIN
            EXECUTE IMMEDIATE 'SELECT SYSTEM$CANCEL_QUERY(''' || rec.QUERY_ID || ''')';
        EXCEPTION
            WHEN OTHER THEN
                NULL;  -- Query may have already completed
        END;

        -- Send alert with query details
        CALL SYSTEM$SEND_EMAIL(
            'ai_cost_alerts',
            'admin@company.com',
            'Runaway AI Query Cancelled - ' || rec.QUERY_ID,
            'A runaway AI Function query has been cancelled due to excessive cost.\n\n' ||
            'Query Details:\n' ||
            '- Query ID: ' || rec.QUERY_ID || '\n' ||
            '- User: ' || COALESCE(rec.USER_NAME, 'Unknown') || '\n' ||
            '- Function: ' || rec.FUNCTION_NAME || '\n' ||
            '- Model: ' || rec.MODEL_NAME || '\n' ||
            '- Credits Used: ' || rec.CREDITS::VARCHAR || '\n' ||
            '- Threshold: ' || :P_CREDIT_THRESHOLD::VARCHAR || '\n' ||
            '- Start Time: ' || rec.START_TIME::VARCHAR || '\n' ||
            '- Roles: ' || COALESCE(rec.ROLE_NAMES::VARCHAR, 'N/A') || '\n' ||
            '- Query Tag: ' || COALESCE(rec.QUERY_TAG, 'N/A') || '\n' ||
            '- Warehouse ID: ' || COALESCE(rec.WAREHOUSE_ID::VARCHAR, 'N/A') || '\n\n' ||
            'Please investigate this query and take appropriate action.'
        );
    END FOR;

    RETURN TABLE(result);
END;
$$;

-- Create a task to monitor and cancel runaway queries every hour
CREATE OR REPLACE TASK MONITOR_RUNAWAY_AI_QUERIES
    WAREHOUSE = <your_warehouse>
    SCHEDULE = 'USING CRON 0 * * * * UTC'  -- Every hour
AS
    CALL MONITOR_AND_CANCEL_RUNAWAY_QUERIES(50);  -- adjust the limit accordingly

-- Enable the task
ALTER TASK MONITOR_RUNAWAY_AI_QUERIES RESUME;

-- Verify task status
SHOW TASKS LIKE 'MONITOR_RUNAWAY_AI_QUERIES';

-- Check task execution history
SELECT *
FROM TABLE(INFORMATION_SCHEMA.TASK_HISTORY(
    SCHEDULED_TIME_RANGE_START => DATEADD('day', -1, CURRENT_TIMESTAMP()),
    TASK_NAME => 'MONITOR_RUNAWAY_AI_QUERIES'
))
ORDER BY SCHEDULED_TIME DESC;
```

> **Tip:**
>
> If you already know that some of your queries will run a long time, define a special role for these queries, and then exclude that role
> from the cancellation logic. For example, to create the role:
>
> ```sqlexample
> CREATE ROLE AI_FUNCTIONS_USER_LONG_RUNNING_ROLE;
> GRANT ROLE AI_FUNCTIONS_USER_ROLE TO ROLE AI_FUNCTIONS_USER_LONG_RUNNING_ROLE;
> GRANT ROLE AI_FUNCTIONS_USER_LONG_RUNNING_ROLE TO USER LONG_RUNNING_USER;
> ```
>
> Add the following condition to the WHERE clause of the procedure to exclude queries run by users with this role from being cancelled.
>
> ```sqlexample
> AND NOT ARRAY_CONTAINS(h.ROLE_NAMES, 'AI_FUNCTIONS_USER_LONG_RUNNING_ROLE')
> ```
>
> Now the user can assume the role to run a long-running query without it being canceled:
>
> ```sqlexample
> USE ROLE AI_FUNCTIONS_USER_LONG_RUNNING_ROLE;
> -- then start the long-running query
> ```

## Best practices

Keep the following best practices in mind when developing a cost management strategy for AI Function usage:

* **Start with monitoring:** Before implementing automated controls, establish baseline usage patterns using the queries in
  Basic usage monitoring.
* **Set conservative initial limits:** Begin with lower thresholds and adjust upward based on actual usage patterns.
* **Use query tags:** Encourage teams to use QUERY_TAG session parameters to enable cost attribution by project or team.
* **Review regularly:** Periodically review the access control table and adjust per-user limits based on legitimate needs.
* **Test alerts:** Verify that email notifications work correctly before relying on them for critical alerts.
* **Consider latency:** The ACCOUNT_USAGE view has up to 60 minutes of latency; factor this into your monitoring strategy.

---
title: Monitor Cortex Agent requests
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-monitor.md
section: Snowflake Cortex (AI & ML)
---

# Monitor Cortex Agent requests

Cortex Agents log detailed traces of all conversations for auditing and debugging purposes. With monitoring,
you can access the conversation history of an Agent deployed via Snowflake Intelligence or Agent API.
In addition to conversation history, you can review detailed tracing of the agent’s planning process,
tool selection, execution results, and final response generation.

## Information collected in Cortex Agent logs

Cortex Agent logs include the following information:

* Conversation history associated with a thread
* Agent’s execution trace with spans including:

  + LLM planning
  + Tool execution (Cortex Search, Cortex Analyst, web search)
  + LLM response generation
  + SQL execution
  + Chart generation
* Inputs and outputs associated with each span
* User feedback for each Agent response

## Access Cortex Agent logs

To view Cortex Agent conversation logs in Snowsight, do the following:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. Select the Agent whose logs you wish to view.
4. Navigate to the Monitoring pane of the Agent view.

The monitoring logs associated with the Agent are stored in the [event table](../../developer-guide/logging-tracing/event-table-setting-up.md) SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS. Entries in this table can’t be modified.

Administrators with the AI_OBSERVABILITY_ADMIN application role can delete entries in the
SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS table.

### View feedback provided by users

To view user feedback about agents programmatically, run the following SQL command:

> ```sqlexample
> SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_OBSERVABILITY_EVENTS('<database_name>', '<schema_name>', '<agent_name>', 'CORTEX AGENT')) WHERE RECORD:name='CORTEX_AGENT_FEEDBACK';
> ```

The resulting table contains columns that include information about the agent, the user who provided feedback, feedback provided by the user, and whether the feedback was positive or negative.

## Access control and permissions

To view Cortex Agent logs, users must have the following privileges:

* OWNERSHIP or MONITOR privileges on the AGENT object
* The CORTEX_USER database role

The following example uses the ACCOUNTADMIN role to create a new role `agent_monitoring_user_role`
with the required permissions to view Cortex Agent logs. This new role is then assigned to `some_user`.

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE agent_monitoring_user_role;
GRANT MONITOR ON AGENT my_agent TO ROLE agent_monitoring_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE agent_monitoring_user_role;
GRANT ROLE agent_monitoring_user_role TO USER some_user;
```

### Grant monitoring access to future agents

To grant a role monitoring access on future agents created in a schema, use the following SQL command:

```sqlexample
GRANT MONITOR ON FUTURE AGENTS IN SCHEMA <database_name>.<schema_name> TO ROLE <role_name>;
```

---
title: Monitor Cortex Search requests
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/cortex-search-monitor.md
section: Snowflake Cortex (AI & ML)
---

# Monitor Cortex Search requests

Cortex Search logs detailed information about search requests for monitoring and debugging purposes.
With request logging enabled, you can review query patterns, response times, and request details
for a Cortex Search Service.

## Information collected in request logs

Cortex Search request logs include the following information:

* Operation type (for example, QUERY)
* Full request body, including query text and parameters
* Response status code
* Response time in milliseconds
* Database, schema, and service name
* User, role, and session information

## Enable request logging

To collect request logs for a Cortex Search Service, enable the `REQUEST_LOGGING` property on
the service.

You can enable request logging when you create a service:

```sqlexample
CREATE CORTEX SEARCH SERVICE my_search_service
  ON text_col
  ATTRIBUTES category
  WAREHOUSE = my_wh
  TARGET_LAG = '1 hour'
  REQUEST_LOGGING = TRUE
AS (SELECT * FROM my_table);
```

You can also enable request logging on an existing service:

```sqlexample
ALTER CORTEX SEARCH SERVICE my_search_service SET REQUEST_LOGGING = TRUE;
```

To disable request logging:

```sqlexample
ALTER CORTEX SEARCH SERVICE my_search_service SET REQUEST_LOGGING = FALSE;
```

## Operational considerations

### Volume of log data

Each logged Cortex Search request produces one event row in
`SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS`. How much data you accumulate depends on your request
rate and how long logging stays enabled. Set retention and storage to match the log volume you
expect to keep.

### Cost considerations

Data stored in `SNOWFLAKE.LOCAL` tables incurs Snowflake [storage charges](../../cost-understanding-data-storage.md).
Querying request logs with SQL uses warehouse resources like any other query.

### Query latency

Enabling request logging does not affect the latency of Cortex Search query requests.

## Access Cortex Search request logs

The request logs for a Cortex Search Service are stored in the event table
SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS. You can access these logs using a table function or by
querying the event table directly.

### Using the `snowflake.local.get_ai_observability_events` function

Users with the `MONITOR` privilege on a Cortex Search Service can view
request logs for that service using the `snowflake.local.get_ai_observability_events`
function.

1. **Grant MONITOR privilege**: Grant the MONITOR privilege on the Cortex Search Service
   to the role that will use the function:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   GRANT MONITOR ON CORTEX SEARCH SERVICE <service_name> TO ROLE <role_name>;
   ```
2. **Query observability events**: Call `snowflake.local.get_ai_observability_events`
   using the role with the MONITOR privilege:

   ```sqlexample
   USE ROLE <role_name>;
   SELECT * FROM TABLE(snowflake.local.get_ai_observability_events(
     '<database_name>',
     '<schema_name>',
     '<service_name>',
     'CORTEX SEARCH SERVICE'
   ));
   ```

### Querying the event table as ACCOUNTADMIN

Users with the ACCOUNTADMIN role can query the event table directly:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT * FROM snowflake.local.ai_observability_events
WHERE observed_timestamp > TIMESTAMPADD(minute, -10, CURRENT_TIMESTAMP())
  AND record['name'] = 'CORTEX_SEARCH_REQUEST';
```

> **Note:**
>
> Users with the ACCOUNTADMIN role can query the `snowflake.local.ai_observability_events` table
> and access the request events for all Cortex Search Services in the account.

## Access control and permissions

To view Cortex Search request logs, users must have one of the following:

* OWNERSHIP or MONITOR privilege on the Cortex Search Service
* The ACCOUNTADMIN role (for direct event table access)

The following example uses the ACCOUNTADMIN role to create a new role `search_monitoring_role`
with the required permissions to view Cortex Search request logs:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE search_monitoring_role;
GRANT MONITOR ON CORTEX SEARCH SERVICE my_search_service TO ROLE search_monitoring_role;
GRANT ROLE search_monitoring_role TO USER some_user;
```

## Output schema

The `snowflake.local.get_ai_observability_events` function returns
a table with the following columns:

Columns

| Column name | Data type | Description |
| --- | --- | --- |
| TIMESTAMP | TIMESTAMP_NTZ(9) | Time of the event |
| START_TIMESTAMP | TIMESTAMP_NTZ(9) | Start time of the event (may be NULL) |
| TRACE | OBJECT | Trace information for the event (may be NULL) |
| RESOURCE_ATTRIBUTES | OBJECT | Contains session, user, and role information including session ID, user ID, user name, role ID, and role name |
| RECORD_TYPE | STRING | Type of record, typically ‘EVENT’ for Cortex Search requests |
| RECORD | OBJECT | Contains the event name, typically ‘CORTEX_SEARCH_REQUEST’ |
| RECORD_ATTRIBUTES | OBJECT | Contains detailed observability metadata including database, schema, service, user, role, and session information |
| VALUE | VARIANT | Contains the actual request details including operation type, request body, response status code, and response time |

The VALUE column contains the following key fields:

* `snow.ai.observability.operation_type`: The type of operation, such as ‘QUERY’
* `snow.ai.observability.request_body`: The full request including query text and parameters
* `snow.ai.observability.response_status_code`: HTTP status code of the response
* `snow.ai.observability.response_time_ms`: Response time in milliseconds
* `snow.ai.observability.database.name`: Database containing the Cortex Search Service
* `snow.ai.observability.schema.name`: Schema containing the Cortex Search Service
* `snow.ai.observability.object.name`: Name of the Cortex Search Service

The following is an example of data found in the VALUE column:

```javascript
{
  "snow.ai.observability.operation_type": "QUERY",
  "snow.ai.observability.request_body": {
    "experimental": null,
    "limit": 10,
    "multi_index_query": null,
    "query": "hello"
  },
  "snow.ai.observability.response_status_code": 200,
  "snow.ai.observability.response_time_ms": 391
}
```

## Example

Query the request logs for the last 24 hours, using a role with `MONITOR` privilege on the service.

```sqlexample
SELECT
  timestamp,
  record_attributes['ai.observability.record_id']::STRING as query_id,
  value['snow.ai.observability.request_body']['query']::STRING AS query_text,
  value['snow.ai.observability.request_body']['limit']::INT AS limit,
  value['snow.ai.observability.response_status_code']::INT AS status_code,
  value['snow.ai.observability.response_time_ms']::INT AS response_time_ms,
  record_attributes['snow.ai.observability.database.name']::STRING AS database_name,
  record_attributes['snow.ai.observability.schema.name']::STRING AS schema_name,
  record_attributes['snow.ai.observability.object.name']::STRING AS service_name,
  record_attributes['snow.ai.observability.user.name']::STRING AS user_name,
  record_attributes['snow.ai.observability.role.name']::STRING AS role_name,
  record_attributes['snow.ai.observability.session.id']::STRING AS session_id,
FROM TABLE(snowflake.local.get_ai_observability_events(
  '<database_name>',
  '<schema_name>',
  '<service_name>',
  'CORTEX SEARCH SERVICE'
))
WHERE timestamp > TIMESTAMPADD(hour, -24, CURRENT_TIMESTAMP())
ORDER BY timestamp DESC;
```

---
title: Onboarding questions in Cortex Analyst
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/suggested-questions-feature.md
section: Snowflake Cortex (AI & ML)
---

# Onboarding questions in Cortex Analyst

The *onboarding questions* feature in Cortex Analyst provides relevant suggestions for questions your users can ask while
interacting with your Cortex Analyst–powered conversational app, which will help them get started.

## How onboarding questions work

Cortex Analyst operates in one of three modes, depending on the configuration of your semantic model:

1. Generates questions using **Large Language Models** (Default mode without Verified Query Repository)

   > When your semantic model doesn’t include a Verified Query Repository (VQR), Cortex Analyst uses the
   > underlying Large Language Models (LLMs) to generate up to three suggested questions.
   > Note that these questions may not always be answerable; for instance, the system might suggest a question that
   > yields no results.
2. Suggests questions from the **Verified Query Repository** (Default mode with VQR)

   > If your semantic model has a [Verified Query Repository (VQR)](verified-query-repository.md)
   > defined, Cortex Analyst returns up to five suggested questions from the VQR. These questions are selected based on their similarity to
   > the user’s input. For example, if a user asks, `What questions can I ask about revenue?`, Cortex Analyst returns up to 5 questions
   > that are most likely about revenue from the VQR repository that are most likely answerable.
3. Returns **onboarding questions** configured in the semantic model (Customizable Mode with VQR)

   > For more control over which questions are displayed, you can use the new `use_as_onboarding_question` flag in your VQR configuration.
   >
   > * When this flag is set to true, Cortex Analyst will return **all** questions marked as onboarding questions, regardless of their
   >   similarity to the user’s input.
   > * This feature is helpful if you want to present a full set of predefined, answerable questions for users, such as in an
   >   onboarding experience. If you flag more than 5 questions, all of the flagged questions are returned in the response.

## How to Configure Onboarding Questions

To define onboarding questions, you need to mark specific verified queries in the
semantic model with the `use_as_onboarding_question` flag. The example below shows how to set this up:

```yaml
verified_queries:

- name: "lowest revenue each month"
  question: For each month, what was the lowest daily revenue and on what date did that lowest revenue occur?

  use_as_onboarding_question: true

  sql: "WITH monthly_min_revenue AS (
SELECT
    DATE_TRUNC('MONTH', date) AS month,
    MIN(daily_revenue) AS min_revenue
FROM __daily_revenue
GROUP BY
DATE_TRUNC('MONTH', date)

)

SELECT
    mmr.month,
    mmr.min_revenue,
    dr.date AS min_revenue_date
FROM monthly_min_revenue AS mmr JOIN __daily_revenue AS dr
ON mmr.month = DATE_TRUNC('MONTH', dr.date)
AND mmr.min_revenue = dr.daily_revenue
ORDER BY mmr.month DESC NULLS LAST"

verified_at: 1715187400

verified_by: user_name
```

---
title: Opt out of Snowflake AI features
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/opting-out.md
section: Snowflake Cortex (AI & ML)
---

# Opt out of Snowflake AI features

Most Snowflake AI features are initially available to all users in your Snowflake account. Access to most features is
controlled by the SNOWFLAKE.CORTEX_USER database role, which is initially granted to the PUBLIC role. All users are
granted the PUBLIC role, giving them access to Cortex features by default. (Access to Snowflake Copilot is controlled by
the SNOWFLAKE.COPILOT_USER database role, also granted to PUBLIC by default.) Cortex Analyst is an opt-in feature that is
not accessible to users by default.

## Opt out of default features

To revoke access to all Snowflake AI features that are available to users by default, revoke the SNOWFLAKE.CORTEX_USER and
SNOWFLAKE.COPILOT_USER database roles from the PUBLIC role. You can grant these roles to specific roles that you want to have
access to the features, then grant those roles to specific users as needed. (You cannot grant database roles
directly to users, but must grant them to roles that can be assumed by users.)

Use SQL like the following to revoke access to the SNOWFLAKE.CORTEX_USER and SNOWFLAKE.COPILOT_USER roles from the PUBLIC role, then grant them
to specific roles and users.

```sqlexample
-- Revoke access to most Snowflake AI features from all users in the account
REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER FROM ROLE PUBLIC;
REVOKE DATABASE ROLE SNOWFLAKE.COPILOT_USER FROM ROLE PUBLIC;

-- Optionally, grant access to specific roles
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE my_cortex_role;
GRANT DATABASE ROLE SNOWFLAKE.COPILOT_USER TO ROLE my_copilot_role;

-- Then grant those roles to specific users
GRANT ROLE my_cortex_role TO USER alice;
GRANT ROLE my_copilot_role TO USER bob;
```

> **Note:**
>
> If you granted SNOWFLAKE.CORTEX_USER and SNOWFLAKE.COPILOT_USER to other roles, revoke them from those roles
> to completely block users from using Snowflake AI features.

## Revoke access to opt-in features

Some Snowflake AI features are opt-in. Access to these features is disabled by default, so unless you grant
access to them, your users cannot use them. If you have granted access to any of these features, you can revoke access
to individual features:

* **Cortex Analyst:** Set the ENABLE_CORTEX_ANALYST account parameter to FALSE:

  ```sqlexample
  ALTER ACCOUNT SET ENABLE_CORTEX_ANALYST = FALSE;
  ```
* **Cortex Embedding Functions** (AI_EMBED, EMBED_TEXT_768, and EMBED_TEXT_1024): Calling these functions
  requires the SNOWFLAKE.CORTEX_EMBED_USER database role if the user does not have the SNOWFLAKE.CORTEX_USER database role.
  Revoke SNOWFLAKE.CORTEX_EMBED_USER role from any roles you have granted it to.

  ```sqlexample
  REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_EMBED_USER FROM ROLE my_role;
  ```
* **Cortex Fine-tuning:** Revoke the CREATE MODEL privilege on schemas from any roles you have granted it to.

  ```sqlexample
  REVOKE CREATE MODEL ON SCHEMA my_schema FROM ROLE my_role;
  ```
* **Provisioned Throughput:** Revoke the CREATE PROVISIONED THROUGHPUT privilege on schemas from any roles you have granted it to.

  ```sqlexample
  REVOKE CREATE PROVISIONED THROUGHPUT ON SCHEMA my_schema FROM ROLE my_role;
  ```

## Access control by feature

The following table has more detailed information on access control for individual Snowflake AI features:

| Feature | Opt in | Main access control method | Additional access control methods |
| --- | --- | --- | --- |
| [Cortex Agents](cortex-agents.md) |  | SNOWFLAKE.CORTEX_USER database role | USAGE on the search service that the agent queries, plus USAGE on the database, schema, and table used by the search service |
| [Cortex AI Functions](aisql.md) |  | SNOWFLAKE.CORTEX_USER database role |  |
| [Cortex Analyst](cortex-analyst.md) | ✔ | ENABLE_CORTEX_ANALYST account parameter |  |
| [Cortex Fine-tuning](cortex-finetuning.md) | ✔ | CREATE MODEL on the schema where you create fine-tuned models |  |
| [Cortex Knowledge Extensions](cortex-knowledge-extensions/cke-overview.md) |  | SNOWFLAKE.CORTEX_USER database role | Relies on access control for the underlying Cortex Search Service |
| [Cortex Provisioned Throughput](provisioned-throughput.md) | ✔ | CREATE PROVISIONED THROUGHPUT privilege on the schema where you create provisioned throughput objects |  |
| [Cortex Search](cortex-search/cortex-search-overview.md) |  | SNOWFLAKE.CORTEX_USER database role | USAGE on the search service, database, schema, and table used by the search service |
| [Snowflake Copilot](../snowflake-copilot.md) |  | SNOWFLAKE.COPILOT_USER database role |  |
| [Snowflake Intelligence](snowflake-intelligence.md) |  | SNOWFLAKE.CORTEX_USER database role | Relies on access control for the underlying Cortex Agent or Search Service |

## Opt out of specific models and AI Functions

Because the cost of using different large language models varies, you can limit access to specific LLMs via an
account-level allowlist, by role-based access control, or by a combination of both. For more information, see
[Control model access](aisql.md).

## ACCOUNTADMIN and AI features

The ACCOUNTADMIN role has complete access to all features in a Snowflake account, including Snowflake AI features.
Revoking the SNOWFLAKE.CORTEX_USER and SNOWFLAKE.COPILOT_USER roles from PUBLIC does not prevent ACCOUNTADMIN from using these features.
Even if an ACCOUNTADMIN’s access to AI features is revoked, a user with access to ACCOUNTADMIN can always
grant access to that role (or any other role) again.

For this and other reasons, it is a best practice to grant the ACCOUNTADMIN role to trusted users only, or even more
strictly, to a single user in the account which is not used for any purpose other than Snowflake account administration
and whose login credentials are tightly controlled. Use ACCOUNTADMIN only for account setup and maintenance, and use
other administrative roles with more limited scope (that is, SECURITYADMIN, SYSADMIN, or USERADMIN) for day-to-day
administration.

It is possible to prevent ACCOUNTADMIN from using Snowflake AI features that are gated by means other than role-based
access control. For example, even a user with ACCOUNTADMIN can’t use Cortex Analyst if the ENABLE_CORTEX_ANALYST account
parameter is set to FALSE. Of course, this user can always set this parameter to TRUE.

## Monitor AI feature usage

To make sure that Snowflake AI features are not being used, monitor usage of Snowflake AI features using the
Cortex-related views in the [SNOWFLAKE.ACCOUNT_USAGE](../../sql-reference/account-usage.md) schema. These views are:

* [CORTEX_ANALYST_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_analyst_usage_history.md)
* [CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_document_processing_usage_history.md)
* [CORTEX_FINE_TUNING_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_fine_tuning_usage_history.md)
* [CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_functions_query_usage_history.md)
* [CORTEX_FUNCTIONS_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_functions_usage_history.md)
* [CORTEX_PROVISIONED_THROUGHPUT_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_provisioned_throughput_usage_history.md)
* [CORTEX_SEARCH_DAILY_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_search_daily_usage_history.md)
* [CORTEX_SEARCH_SERVING_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_search_serving_usage_history.md)

> **Note:**
>
> CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY and CORTEX_FUNCTIONS_USAGE_HISTORY log essentially the same events. It
> is not necessary to monitor both.

[Create alerts on new data](../alerts.md) in these views to notify you when new AI features are
used in your account. For example, the following SQL statement creates an alert that sends a Slack message when any AI function is used:

```sqlexample
CREATE ALERT my_alert
  IF (EXISTS (
    SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY))
  THEN
    BEGIN
      CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
        SNOWFLAKE.NOTIFICATION.TEXT_PLAIN('AI function used in account'),
        '{"my_slack_integration": {}}'
      );
    END;
```

Such alerts incur a nominal compute cost when new data is added to a Cortex usage history view, but if no AI features
are used, there is no cost because no data is ever added and the alert is never triggered.

## Control access to ML features

Snowflake ML features are not AI features, and access to them is not controlled by the SNOWFLAKE.CORTEX_USER role.

### ML Functions

[ML Functions](../../guides-overview-ml-functions.md) employ classical machine learning techniques for forecasting, anomaly
detection, classification, and other data analysis tasks. Creation of models by ML Functions is opt-in and controlled by
a function-specific privilege, such as CREATE SNOWFLAKE.ML.FORECAST, on schemas. Access to trained models is controlled
by the USAGE privilege on the model object. If you have granted these privileges already, revoke them to prevent users
from creating or using ML Functions models. You may want to DROP any models that have already been created.

Owners of schemas can create ML Functions models in them, regardless of whether they have CREATE privileges for a
specific type of model, so limit ownership and creation of schemas to trusted users. Grant specific privileges to create
models within each schema only to users who need them.

### Snowflake ML

[Snowflake ML](../../developer-guide/snowflake-ml/overview.md) lets you build, deploy, and manage custom machine learning
models developed in Python, at Snowflake scale. Creation and use of Snowflake ML objects, including the model registry,
the feature store, and models and their versions, is not controlled by the SNOWFLAKE.CORTEX_USER role.

Snowflake ML objects are schema-level objects, which means that users can create Snowflake ML objects in any schema on
which they have OWNERSHIP or an appropriate CREATE privilege (for example, CREATE MODEL REGISTRY). Therefore, access to
Snowflake ML is best controlled by limiting ownership and creation of schemas to trusted users. Grant specific
privileges to create Snowflake ML objects within each schema only to users who need them.

> **Note:**
>
> Users with the CREATE MODEL privilege in a schema can also create models using Cortex Fine-tuning. However, actually
> using Cortex fine-tuned models requires the SNOWFLAKE.CORTEX_USER database role.

---
title: Optimize an existing semantic view or model with verified queries
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/analyst-optimization.md
section: Snowflake Cortex (AI & ML)
---

# Optimize an existing semantic view or model with verified queries

Snowflake allows you to optimize existing semantic views and models using only verified queries, by analyzing your verified queries to find useful information to add to the rest of the semantic layer. This optimization helps Cortex Analyst answer a broader range of questions correctly, not just those that match with existing verified queries.

Consider this verified query: “How many active users did we have last month?” Cortex Analyst uses the verified SQL to determine how you’re defining *active*. From there, it can suggest the addition of an “is_active” filter on the customer table, using that exact definition of *active users*. This filter then gives Cortex Analyst more accurate results for queries about “active users”.

This optimization feature is part of an iteration feedback loop that helps Cortex Analyst improve its accuracy and coverage over time:

1. Cortex Analyst suggests common and useful user questions for addition based on usage data and query history.
2. Users verify the suggested queries and add them to the list of verified queries.
3. Cortex Analyst uses these verified queries to generate more generalizable semantic model concepts and improve suggested queries.

## Prerequisites

* Ensure that you have the CORTEX_USER role, which is granted by default, directly or indirectly. Secondary roles are not valid for this purpose.
* Have access to at least one large language model (LLM). We recommend using Claude Sonnet 4, but you can use any other LLM.
* Ensure that you have read access to the underlying tables and columns that you will interact with using Cortex Analyst.
* Have an existing semantic view or model with at least one [verified query](verified-query-repository.md).

  > **Note:**
  >
  > Cortex Analyst can learn more from unique verified queries using optimization. Simple queries may not have as much useful information.

  + You can use the suggestions panel to get ideas for useful verified queries to add.
  + Adding more than 20 verified queries can cause the optimization feature to take longer.

## Use optimization

To use optimization, select a warehouse that can run your verified queries without too much delay. Cortex Analyst might execute verified queries up to four times per verified query. The process can take from a few minutes for a small number of verified queries to hours for dozens of slow-running verified queries.

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Cortex Analyst.
3. From the list, select the semantic view or model to optimize.
4. In the right pane under Suggestions, select Get more suggestions.
5. Select the role that will run optimization.
6. Select the warehouse that will run verified queries.

---
title: Parsing documents with AI_PARSE_DOCUMENT
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/parse-document.md
section: Snowflake Cortex (AI & ML)
---

# Parsing documents with AI_PARSE_DOCUMENT

AI_PARSE_DOCUMENT is a Cortex AI Function that extracts text, data, layout elements, and images from documents.
It can be used with other functions to create custom document processing pipelines for a variety of use cases
(see [Cortex AI Functions: Documents](ai-documents.md)).

For information on using AI_PARSE_DOCUMENT to extract images, with examples, see [Cortex AI Functions: Image extraction with AI_PARSE_DOCUMENT](image-extraction.md).

The function extracts text and layout from documents stored on internal or external stages and preserves reading order and structures like
tables and headers. For information about creating a stage suitable for storing documents, see [Create stage for media files](aisql.md).

AI_PARSE_DOCUMENT orchestrates advanced AI models for document understanding and layout analysis and processes complex
multi-page documents with high fidelity.

The AI_PARSE_DOCUMENT function offers two modes for processing PDF documents:

* **LAYOUT** mode is the preferred choice for most use cases, especially for complex documents.
  It’s specifically optimized for extracting text and layout elements like tables, making it the best option for
  building knowledge bases, optimizing retrieval systems, and enhancing AI based applications.
* **OCR** mode is recommended for quick, high-quality text extraction from documents such as
  manuals, agreements or contracts, product detail pages, insurance policies and claims, and
  [SharePoint documents](../../connectors/unstructured-data-connectors/sharepoint/about.md).

For both modes, use the `page_split` option to split multi-page documents into separate
pages in the response. You can also use the `page_filter` option to process only specified pages.
If using `page_filter`, `page_split` is implied, and you do not need to set it explicitly.

AI_PARSE_DOCUMENT is horizontally scalable, enabling efficient batch processing of multiple
documents simultaneously. Documents can be processed directly from object storage to avoid unnecessary data movement.

> **Note:**
>
> AI_PARSE_DOCUMENT is currently incompatible with custom [network policies](../network-policies.md).

## Examples

### Simple layout example

This example uses AI_PARSE_DOCUMENT’s LAYOUT mode to process a two-column research paper. The `page_split` parameter
is set to TRUE in order to separate the document into pages in the response. AI_PARSE_DOCUMENT returns the content in Markdown
format. The following shows rendered Markdown for one of the processed pages (page index 4 in the JSON output) next to
the original page. The raw Markdown is shown in the JSON response following the images.

| Page from the original document | Extracted Markdown rendered as HTML |
| --- | --- |
|  |  |

> **Tip:**
>
> To view either of the these images at a more legible size, select it by clicking or tapping.

The following is the SQL command to process the original document:

```sqlexample
SELECT AI_PARSE_DOCUMENT (
    TO_FILE('@docs.doc_stage','research-paper-example.pdf'),
    {'mode': 'LAYOUT' , 'page_split': true}) AS research_paper_example;
```

The response from AI_PARSE_DOCUMENT is a JSON object containing metadata and text from the pages of the document, like
the following. Some page objects have been omitted for brevity.

```output
{
  "metadata": {
    "pageCount": 19
  },
  "pages": [
    {
      "content": "# SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation \n\nAurick Qiao
      Zhewei Yao Samyam Rajbhandari Yuxiong He<br>Snowflake AI Research<br>San Mateo, CA, United States<br>Correspondence:
      aurick.qiao@ snowflake.com\n\n\n#### Abstract\n\nLLM inference for enterprise applications, such as summarization, RAG,
      and code-generation, typically observe much longer prompt than generations, leading to high prefill cost and response
      latency. We present SwiftKV, a novel model transformation and distillation procedure targeted at reducing the prefill
      compute (in FLOPs) of prompt tokens while preserving high generation quality. First, SwiftKV prefills later layers' KV
      cache using an earlier layer's output, allowing prompt tokens to skip those later layers. Second, SwiftKV employs a
      lightweight knowledge-preserving distillation procedure that can adapt existing LLMs with minimal accuracy impact. Third,
      SwiftKV can naturally incorporate KV cache compression to improve inference performance in low-memory scenarios. Our
      comprehensive experiments show that SwiftKV can effectively reduce prefill computation by $25-50 \\%$ across several LLM
      families while incurring minimum quality degradation. In the end-to-end inference serving, SwiftKV realizes up to $2
      \\times$ higher aggregate throughput and $60 \\%$ lower time per output token. It can achieve a staggering 560 TFlops/GPU
      of normalized inference throughput, which translates to 16 K tokens/s for Llama-3.1-70B. SwiftKV is open-sourced at
      https://github . com/snowflakedb/arctictraining.\n\n\n## 1 Introduction\n\nLarge Language Models (LLMs) are now an
      integral enabler of enterprise applications and offerings, including code and data co-pilots (Chen et al., 2021; Pourreza
      and Rafiei, 2024), retrieval augmented generation (RAG) (Lewis et al., 2020; Lin et al., 2024), summarization (Pu et al.,
      2023; Zhang et al., 2024), and agentic workflows (Wang et al., 2024; Schick et al., 2023). However, the cost and speed of
      inference determine their practicality, and improving the throughput and latency of LLM inference has become increasingly
      important.\n\nWhile prior works, such as model pruning (Ma et al., 2023; Sreenivas et al., 2024), KV cache compression
      (Hooper et al., 2024; Shazeer, 2019; Ainslie et al., 2023b; Chang et al., 2024), and sparse attention (Zhao et al., 2024;
      Jiang et al., 2024), have been developed to accelerate LLM inference, they typically significantly degrade the model
      quality or work best in niche scenarios, such as lowmemory environments or extremely long contexts requests (e.g. $>100
      \\mathrm{~K}$ tokens). On the other hand, production deployments are often compute-bound rather than memory-bound, and
      such long-context requests are rare amongst diverse enterprise use cases (e.g. those observed at Snowflake).\n\nIn this
      paper, we take a different approach to improving LLM inference based on the key observation that typical enterprise
      workloads process more input tokens than output tokens. For example, tasks like code completion, text-to-SQL,
      summarization, and RAG each submit long prompts but produce fewer output tokens (a 10:1 ratio with average prompt length
      between 500 and 1000 is observed in our production). In these scenarios, inference throughput and latency are often
      dominated by the cost of prompt processing (i.e. prefill), and reducing this cost is key to improving their performance.
      \n\nBased on this observation, we designed SwiftKV, which improves throughput and latency by reducing the prefill
      computation for prompt tokens. SwiftKV (Fig. 1) consists of three key components:\n\nModel transformation. SwiftKV rewires
      an existing LLM so that the prefill stage during inference can skip a number of later transformer layers, and their KV
      cache are computed by the last unskipped layer. This is motivated by the observation that the hidden states of later
      layers do not change significantly (see Sec. 3.2 and (Liu et al., 2024b)). With SwiftKV, prefill compute is reduced by
      approximately the number of layers skipped.\n\nOptionally, for low-memory scenarios, we",
      "index": 0
    },
    ...
    {
      "content": "Efficient Distillation. Since only a few $\\mathbf{W}_{Q K V}$ parameters need training, we can keep just a
      single copy of the original model weights in memory that are frozen during training, and add an extra trainable copy of
      the $\\mathbf{W}_{Q K V}$ parameters for layers $>l$ initialized using the original model (See Fig. 1).\n\nDuring
      training, we create two modes for the later layers $>l$, one with original frozen parameters using original architecture,
      and another with the SwiftKV re-wiring using new QKV projections i.e.,\n\n$$\n\\begin{aligned}\n& \\mathbf{y}_{\\text
      {teacher }}=\\mathbf{M}(\\mathbf{x}, \\text { SwiftKV }=\\text { False }) \\\\\n& \\mathbf{y}_{\\text {student }}=\\mathbf
      {M}(\\mathbf{x}, \\text { SwiftKV }=\\text { True })\n\\end{aligned}\n$$\n\nwhere $\\mathbf{y}$ is the final logits,
      $\\mathbf{M}$ is the model, and $\\mathbf{x}$ is the input. Afterwards, we apply the standard distillation loss (Hinton et
      al., 2015) on the outputs. After the distillation, the original KV projection layers $>l$ are discarded during inference.
      \n\nThis method allows us to distill Llama-3.1-8BInstruct on 680 M tokens of data in 3 hours using 8 H100 GPUs, and
      Llama-3.1-70B-Instruct in 5 hours using 32 H100 GPUs across 4 nodes. In contrast, many prune-and-distill (Sreenivas et
      al., 2024) and layer-skipping (Elhoushi et al., 2024) methods require much larger datasets (e.g. 10-100B tokens) and incur
      greater accuracy gaps than SwiftKV.\n\n### 3.5 Optimized Implementation for Inference\n\nLLM serving systems can be
      complex and incorporate many simultaneous optimizations at multiple layers of the stack, such as PagedAttention (Kwon et
      al., 2023), Speculative Decoding (Leviathan et al., 2023), SplitFuse (Holmes et al., 2024; Agrawal et al., 2024), and
      more. A benefit of SwiftKV is that it makes minimal changes to the model architecture, so it can be integrated into
      existing serving systems without implementing new kernels (e.g. for custom attention operations or sparse computation) or
      novel inference procedures.\n\nImplementation in vLLM and SGLang. To show that the theoretical compute reductions of
      SwiftKV translates to real-world savings, we integrated it with vLLM (Kwon et al., 2023) and SGLang (Zheng et al., 2024).
      Our implementation is compatible with chunked prefill (Holmes et al., 2024; Agrawal et al., 2024), which mixes chunks of
      prefill tokens and decode tokens in each minibatch. During each forward pass, after completing layer $l$, the KV-cache for
      the remaining layers ( $>l$ ) are immediately computed, and only the decode tokens are propagated through the rest of the
      model layers.\n\n## 4 Main Results\n\nWe evaluated SwiftKV in terms of model accuracy (Sec. 4.1) compared to the original
      model and several baselines, and end-to-end inference performance (Sec. 4.2) in a real serving system.\n\nDistillation
      datasets. Our dataset is a mixture of Ultrachat (Ding et al., 2023), SlimOrca (Lian et al., 2023), and OpenHermes-2.5
      (Teknium, 2023), totaling roughly 680M Llama-3.1 tokens. For more details, please see Appendix A.1.\n\nSwiftKV Notation.
      For prefill computation, we report the approximate reduction as $(L-l) / L$ due to SwiftKV, and for KV cache, we report
      the exact memory reduction due to AcrossKV. For example, SwiftKV $(l=L / 2)$ and 4-way AcrossKV is reported as $50 \\%$
      prefill compute reduction and $37.5 \\% \\mathrm{KV}$ cache memory reduction.\n\n### 4.1 Model Quality Impact of
      SwiftKV\n\nTable 2 shows the quality results of all models we evaluated, including Llama-3.1-Instruct, Qwen2.
      5-14B-Instruct, Mistral-Small, and Deepseek-V2. Of these models, we note that the Llama models span two orders of
      magnitude in size (3B to 405B), Llama-3.1-405B-Instruct uses FP8 (W8A16) quantization, and Deepseek-V2-Lite-Chat is a
      mixture-of-experts model that implements a novel latent attention mechanism (DeepSeek-AI et al., 2024).\n\nWe also compare
      with three baselines: (1) FFN-SkipLLM (Jaiswal et al., 2024), a training-free method for skipping FFN layers (no attention
      layers are skipped) based on hidden state similarity, (2) Llama-3.1-Nemotron-51B-Instruct (Sreenivas et al., 2024), which
      is pruned and distilled from Llama-3.1-70B-Instruct using neural architecture search on 40B tokens, and (3) DarwinLM-8.4B
      (Tang et al., 2025), which is pruned and distilled from Qwen2.5-14B-Instruct using 10B tokens.\n\nSwiftKV. For Llama,
      Mistral, and Deepseek, we find the accuracy degradation for $25 \\%$ SwiftKV is less than $0.5 \\%$ from the original
      models (averaged across tasks). Additionally, the accuracy gap is within $1-2 \\%$ even at $40-50 \\%$ SwiftKV. Beyond $50
      \\%$ SwiftKV, model quality drops quickly. For example, Llama-3.1-8B-Instruct incurs a 7\\% accuracy gap at $62.5 \\%$
      SwiftKV. We find that Qwen suffers larger degradations, at $1.1 \\%$ for $25 \\%$ SwiftKV and $7.4 \\%$ for $50 \\%$
      SwiftKV, which may be due to Qwen models having lower simularity between layer at 50-75\\% depth (Fig. 2). Even still,
      SwiftKV",
      "index": 4
    },
    ...
  ]
}
```

### Table structure extraction example

This example demonstrates extracting structural layout, including a table, from a 10-K filing. The following shows the
rendered results for one of the processed pages (page index 28 in the JSON output).

| Page from the original document | Extracted Markdown rendered as HTML |
| --- | --- |
|  |  |

> **Tip:**
>
> To view either of the these images at a more legible size, select it by clicking or tapping.

The following is the SQL command to process the original document:

```sqlexample
SELECT AI_PARSE_DOCUMENT (
    TO_FILE('@docs.doc_stage','10K-example.pdf'),
    {'mode': 'LAYOUT', 'page_split': true}) AS sec_10k_example;
```

The response from AI_PARSE_DOCUMENT is a JSON object containing metadata and text from the pages of the document, like
the following. The results for all but the page previously shown have been omitted for brevity.

```output
{
  "metadata": {
    "pageCount": 53
  },
  "pages": [
    {
      "content": ...
      "index": 0
    },
    ....
    {
      "content": "# Key Operational and Business Metrics \n\nIn addition to the measures presented in our interim condensed
      consolidated financial statements, we use the following key operational and business metrics to evaluate our business,
      measure our performance, develop financial forecasts, and make strategic decisions.\n\n|  | Three Months Ended March 31, |  |
      \n| :--: | :--: | :--: |\n|  | 2025 | 2024 |\n| Ending Paid Connected Fitness Subscriptions ${ }^{(1)}$ | 2,880,176 | $3,051,
      451$ |\n| Average Net Monthly Paid Connected Fitness Subscription Churn ${ }^{(1)}$ | $1.2 \\%$ | $1.2 \\%$ |\n| Ending Paid
      App Subscriptions ${ }^{(1)}$ | 572,775 | 675,190 |\n| Average Monthly Paid App Subscription Churn ${ }^{(1)}$ | $8.1 \\%$ |
      $9.0 \\%$ |\n| Subscription Gross Profit (in millions) | \\$ 288.8 | \\$ 298.1 |\n| Subscription Contribution (in millions) $
      { }^{(2)}$ | \\$ 304.9 | \\$ 316.4 |\n| Subscription Gross Margin | $69.0 \\%$ | $68.1 \\%$ |\n| Subscription Contribution
      Margin ${ }^{(2)}$ | $72.9 \\%$ | $72.3 \\%$ |\n| Net loss (in millions) | \\$ $(47.7)$ | \\$ $(167.3)$ |\n| Adjusted EBITDA
      (in millions) ${ }^{(3)}$ | \\$ 89.4 | \\$ 5.8 |\n| Net cash provided by operating activities (in millions) | \\$ 96.7 | \\$
      11.6 |\n| Free Cash Flow (in millions) ${ }^{(4)}$ | \\$ 94.7 | \\$ 8.6 |\n\n[^0]\n## Ending Paid Connected Fitness
      Subscriptions\n\nEnding Paid Connected Fitness Subscriptions includes all Connected Fitness Subscriptions for which we are
      currently receiving payment (a successful credit card billing or prepaid subscription credit or waiver). We do not include
      paused Connected Fitness Subscriptions in our Ending Paid Connected Fitness Subscription count.\n\n## Average Net Monthly
      Paid Connected Fitness Subscription Churn\n\nTo align with the definition of Ending Paid Connected Fitness Subscriptions
      above, our quarterly Average Net Monthly Paid Connected Fitness Subscription Churn is calculated as follows: Paid Connected
      Fitness Subscriber \"churn count\" in the quarter, divided by the average number of beginning Paid Connected Fitness
      Subscribers each month, divided by three months. \"Churn count\" is defined as quarterly Connected Fitness Subscription
      churn events minus Connected Fitness Subscription unpause events minus Connected Fitness Subscription reactivations.\n\nWe
      refer to any cancellation or pausing of a subscription for our All-Access Membership as a churn event. Because we do not
      receive payment for paused Connected Fitness Subscriptions, a paused Connected Fitness Subscription is treated as a churn
      event at the time the pause goes into effect, which is the start of the next billing cycle. An unpause event occurs when a
      pause period elapses without a cancellation and the Connected Fitness Subscription resumes, and is therefore counted as a
      reduction in our churn count in that period. Our churn count is shown net of reactivations and our new quarterly Average Net
      Monthly Paid Connected Fitness Subscription Churn metric averages the monthly Connected Fitness churn percentage across the
      three months of the reported quarter.\n\n## Ending Paid App Subscriptions\n\nEnding Paid App Subscriptions include all App
      One, App+, and Strength+ subscriptions for which we are currently receiving payment.\n\n## Average Monthly Paid App
      Subscription Churn\n\nWhen a Subscriber to App One, App+, or Strength+ cancels their membership (a churn event) and
      resubscribes in a subsequent period, the resubscription is considered a new subscription (rather than a reactivation that is
      counted as a reduction in our churn count). Average Paid App Subscription Churn is calculated as follows: Paid App
      Subscription cancellations in the quarter, divided by the average number of beginning Paid App Subscriptions each month,
      divided by three months.\n\n\n[^0]:    (1) Beginning January 1, 2025, the Company migrated its subscription data model for
      reporting Ending Paid Connected Fitness Subscriptions, Average Net Monthly Paid Connected Fitness Subscription Churn, Ending
      Paid App Subscriptions, and Average Monthly Paid App Subscription Churn to a new data model that provides greater visibility
      to changes to a subscription's payment status when they occur. The new model gives the Company more precise and timely data
      on subscription pause and churn behavior. Prior period information has been revised to conform with current period
      presentation. The impact of this change in the model on Ending Paid Connected Fitness Subscriptions, Average Net Monthly
      Paid Connected Fitness Subscription Churn, Ending Paid App Subscriptions and Average Monthly Paid App Subscription Churn for
      the three months ended March 31, 2025 and 2024 is immaterial.\n    (2) Please see the section titled \"Non-GAAP Financial
      Measures—Subscription Contribution and Subscription Contribution Margin\" for a reconciliation of Subscription Gross Profit
      to Subscription Contribution and an explanation of why we consider Subscription Contribution and Subscription Contribution
      Margin to be helpful measures for investors.\n    (3) Please see the section titled \"Non-GAAP Financial Measures—Adjusted
      EBITDA\" for a reconciliation of Net loss to Adjusted EBITDA and an explanation of why we consider Adjusted EBITDA to be a
      helpful measure for investors.\n    (4) Please see the section titled \"Non-GAAP Financial Measures-Free Cash Flow\" for a
      reconciliation of net cash provided by (used in) operating activities to Free Cash Flow and an explanation of why we
      consider Free Cash Flow to be a helpful measure for investors.",
      "index": 28
    },
    ...
    {
      "content": "# CERTIFICATION OF PRINCIPAL FINANCIAL OFFICER PURSUANT TO 18 U.S.C. SECTION 1350, AS ADOPTED PURSUANT TO
      SECTION 906 OF THE SARBANES-OXLEY ACT OF 2002 \n\nI, Elizabeth F Coddington, Chief Financial Officer of Peloton Interactive,
      Inc. (the \"Company\"), do hereby certify, pursuant to 18 U.S.C. Section 1350, as adopted pursuant to Section 906 of the
      Sarbanes-Oxley Act of 2002, that to the best of my knowledge:\n\n1. the Quarterly Report on Form 10-Q of the Company for the
      fiscal quarter ended March 31, 2025 (the \"Report\") fully complies with the requirements of Section 13(a) or 15(d) of the
      Securities Exchange Act of 1934, as amended; and\n2. the information contained in the Report fairly presents, in all
      material respects, the financial condition, and results of operations of the Company.\n\nDate: May 8, 2025\n\nBy: /s/
      Elizabeth F Coddington\nElizabeth F Coddington\nChief Financial Officer\n(Principal Financial Officer)",
      "index": 52
    }
  ]
}
```

### Slide deck example

This example demonstrates extracting structural layout from a presentation. Below we show the rendered results for one of the processed slides (page index 17 in the JSON output).

| Slide from the original document | Extracted Markdown rendered as HTML |
| --- | --- |
|  |  |

> **Tip:**
>
> To view either of the these images at a more legible size, select it by clicking or tapping.

The following is the SQL command to process the original document:

```sqlexample
SELECT AI_PARSE_DOCUMENT (TO_FILE('@docs.doc_stage','presentation.pptx'),
    {'mode': 'LAYOUT' , 'page_split': true}) as presentation_output;
```

The response from AI_PARSE_DOCUMENT is a JSON object containing metadata and the text from the slides of the presentation,
like the following. The results for some slides have been omitted for brevity.

```output
{
  "metadata": {
    "pageCount": 38
  },
  "pages": [
    {
      "content": "\n\n# **SNOWFLAKE INVESTOR PRESENTATION**\n\nFirst Quarter Fiscal 2026\n\n© 2026 Snowflake Inc. All Rights Reserved",
      "index": 0
    },
    ...
    {
      "content": "# Our Consumption Model \n\n## Revenue Recognition Consumption\n\nSnowflake recognizes the substantial majority of its revenue as customers consume the platform\n\nPro: Enables faster growth\nPro: Aligned with customer value\nPro: Aligned with usage-based costs\nConsider: Revenue is variable based on customers' usage\n\n## Pricing Model Consumption\n\nThe platform is priced based on consumption of compute, storage, and data transfer resources\n\nPro: Customers don't pay for shelfware\n\nConsider: Performance improvements inherently reduce customer cost\n\n## Billings Terms Typically Upfront\n\nSnowflake typically bills customers annually in advance for their capacity contracts\n\nSome customers consume on-demand and/or are billed in-arrears\n\nPro: Bookings represent contractual minimum\n\nPro: Variable consumption creates upside for renewal cycle\n\nConsider: Payment terms are evolving",
      "index": 17
    },
    ...
    {
      "content": "\n\n# PRODUCT REVENUE\n\n## $996.8M + 26% YoY Growth\n\n## NET REVENUE RETENTION RATE\n\n## $124%\n\n## TOTAL CUSTOMERS\n\n## $1M+ CUSTOMERS\n\n## $0.5 + 27% YoY Growth\n\nCustomers with Trailing 12-Month Product Revenue Greater than $1M\n\n## FORBES GLOBAL 2000 CUSTOMERS\n\n## $754 + 4% YoY Growth\n\n## SNOWFLAKE MARKETPLACE LISTINGS\n\n## AI/ML ADOPTION\n\n## 5,200+ Accounts using Snowflake AI/ML\n\n## SNOWFLAKE AI DATA CLOUD\n\n### Unified Platform and Connected Ecosystem\n\n- **Data Engineering**\n- **Analytics**\n- **AI**\n- **Applications & Collaboration**\n\n### Fully Managed | Cross-Cloud | Interoperable | Secure | Governed\n\n1. For the three months ended April 30, 2025.\n2. As of April 30, 2025. Please see our Q1FY26 earnings press release for definitions of net revenue retention rate, customers with trailing 12-month product revenue greater than $1 million (which definition includes a description of our total customer count), and Forbes Global 2000 customers.\n3. As of April 30, 2025. Each live dataset, package of datasets, or data service published by a data provider as a single product offering on Snowflake Marketplace is counted as a unique listing. A listing may be available in one or more regions where Snowflake Marketplace is available.\n4. Adoption is based on capacity and on-demand accounts using Snowflake AI/ML features on a weekly basis via our internal classification. We take the average of the last 4 weeks of the quarter ended April 30, 2025.",
      "index": 36
    },
    {
      "content": "# THANK YOU\n\n",
      "index": 37
    }
  ]
}
```

### Multilingual document example

This example showcases AI_PARSE_DOCUMENT’s multilingual capabilities by extracting structural layout from a German
article. AI_PARSE_DOCUMENT preserves the reading order of the main text even when images and pull quotes are present.

| Page from the original document | Extracted Markdown rendered as HTML |
| --- | --- |
|  |  |

> **Tip:**
>
> To view either of the these images at a more legible size, select it by clicking or tapping.

The following is the SQL command to process the original document. Since the document has a single page,
you do not need page splitting for this example.

```sqlexample
SELECT AI_PARSE_DOCUMENT (TO_FILE('@docs.doc_stage','german_example.pdf'),
    {'mode': 'LAYOUT'}) AS german_article;
```

The response from AI_PARSE_DOCUMENT is a JSON object containing metadata and the text from the document,
like the following.

```output
{
  "metadata": {
    "pageCount": 1
  }
  "content": "\n\nSchulen haben es verdient, gute Orte zu sein. Hier sollen wir Wissen und Fähigkeiten
  erlernen, die uns durch das Leben tragen. Hier verbringen viele einen Großteil ihres Tages, und das in einer Lebensphase, in
  der sich Zeit beinahe grenzenlos und eine Doppelstunde wie ein halbes Leben anfühlen kann.\n\nOb es die Freundin ist, ohne die
  man auf dem Schulhof verloren wäre. Der Lehrer, mit dem man nicht klarkommt, den man aber trotzdem jeden Tag aushalten muss.
  Die Klassenfahrt, auf der man zum ersten Mal das Meer sieht und knutscht. In Schulen entstehen Erfahrungen, Beziehungen und
  Erinnerungen, die uns ein ganzes Leben prägen.\n\nDie Erwartungen an Schulen sind dementsprechend hoch. Trotzdem werden sie
  von der Gesellschaft schnell vergessen und von der Politik hinten angestellt. Seit Jahrzehnten kriegt das deutsche Schulsystem
  verheerende Zeugnisse.\n\nNoch immer entscheiden Bildungsgrad und Kontostand der Eltern darüber, welchen Schulabschluss Kinder
  und Jugendliche machen. Noch immer funktioniert es vielerorts nur auf dem Papier, dass alle gut zusammen lernen. Im Alltag
  fehlen dann die Lehrkräfte und Mittel, um zum Beispiel einen geflüchteten Jugendlichen oder einen mit ADHS so zu unterstützen,
  dass alle möglichst gleichberechtigt in einem Klassenraum sitzen. Auch die gesellschaftliche Einsicht, dass alle
  Schulabschlüsse ihren Wert haben und gebraucht werden, muss erst wieder zurückgewonnen werden.\n\nJetzt aber hoch mit
  euch!\nDass Schule so irre früh anfangen muss, ist kein Gesetz. Und auch gar nicht ratsam: Jugendliche haben einen anderen
  Biorhythmus und brauchen mehr Schlaf als Erwachsene. Ein Schulbeginn gegen 9 oder 10 Uhr wäre für die meisten besser, da ist
  sich die Forschung einig\n\nAn Schulen tritt die Realität sehr schnell ein. Während sich die Gesellschaft noch fragt, wie mit
  künstlicher Intelligenz umzugehen ist, nutzen sie Lehrkräfte, Schülerinnen und Schüler längst für ihre Zwecke. Während über
  Jahre diskutiert wurde, ob Deutschland ein Einwanderungsland sei, war es das an Schulen längst. Und während andere Themen den
  Klimawandel in der Öffentlichkeit verdrängen, sind es besonders Schülerinnen und Schüler, die laut auf das drängendste Problem
  unserer Zeit hinweisen. Die Herausforderungen und Fragen, die sich an Schulen stellen, betreffen uns alle. Schule ist Zukunft.
  \n\nSchulleitungen, Lehrkräfte, pädagogisches Personal und alle, die sich sonst noch um das Gelingen des Schulalltags kümmern,
  stellen sich dem jeden Tag aufs Neue. Sie versuchen, Schule trotz vieler Probleme und fehlender Wertschätzung zu gestalten,
  sie versuchen, den Schülerinnen und Schülern zu vermitteln, dass es auf sie ankommt. Damit sie selbst an sich glauben. Sie
  haben es verdient.",
}
```

Snowflake Cortex can produce a translation to any supported language (English, language code `'en'`, in this case) as follows:

```sqlexample
SNOWFLAKE.CORTEX.TRANSLATE (ger_example, '', 'en') from german_article;
```

The translation is as follows:

```output
"Schools deserve to be good places. Here, we are supposed to learn knowledge and skills that will carry us through life. Many
spend a large part of their day here, and this is during a phase of life when time can seem almost endless and a double period
can feel like half a lifetime.

Whether it's the friend you would be lost without in the schoolyard. The teacher you can't get along with, but still have to
endure every day. The class trip where you see the sea for the first time and make out. In schools, experiences,
relationships, and memories are created that shape us for a lifetime.

The expectations for schools are correspondingly high. Nevertheless, they are quickly forgotten by society and pushed to the
back by politics. For decades, the German school system has been receiving devastating reports.

Even now, the level of education and the financial status of the parents still determine which school certificate children and
young people receive. It still only works on paper that everyone learns well together. In everyday life, the teachers and
resources are lacking to support, for example, a refugee youth or a student with ADHD so that they can sit in a classroom on
an equal footing. The societal insight that all school certificates have value and are needed also needs to be regained.

Now, let's get going!

The fact that school has to start so early is not a law. And it's not advisable either: teenagers have a different biological
rhythm and need more sleep than adults. A start time of 9 or 10 o'clock would be better for most, research agrees.

Reality sets in very quickly at schools. While society is still wondering how to deal with artificial intelligence, teachers,
students, and pupils are already being used for their purposes. While it was debated for years whether Germany is an
immigration country, it has been one in schools for a long time. And while other topics are pushing climate change out of the
public eye, it is especially students who are loudly pointing out the most pressing problem of our time. The challenges and
questions that schools face affect us all. School is the future.

School administrations, teachers, educational staff, and all those who take care of the success of everyday school life face
this every day. They try to shape school despite many problems and lack of appreciation, they try to convey to the students
that it's up to them. So that they believe in themselves. They deserve it."
```

### Using OCR mode

OCR mode extracts text from scanned documents, such as screenshots or PDFs containing images of text.
It does not preserve layout.

```sqlexample
SELECT AI_PARSE_DOCUMENT(
  TO_FILE( '@docs.doc_stage', 'document_1.pdf' ),
  { 'mode': 'OCR' } ) AS OCR;
```

Output:

```output
{
  "content": "content of the document"
}
```

### Process only certain pages of a document

This example demonstrates using the `page_filter` option to extract specific pages from a document, specifically
the first page of a 55-page research paper. Keep in mind that page indexes starts at 0 and ranges are inclusive of
the start value but exclusive of the end value. For example, `start: 0, end: 1` returns only the first page (index 0).

```sqlexample
SELECT AI_PARSE_DOCUMENT(
  TO_FILE('@my_documents', 'ResearchArticle.pdf'),
  {'mode': 'LAYOUT', 'page_filter': [{'start': 0, 'end': 1}]} );
```

Result:

```output
{
  "metadata": {
    "pageCount": 55
  },
  "pages": [
    {
      "content": "# The Critical Role of Strength Training in Lifelong Health: Evidence-Based
      Benefits and Implementation Strategies \n\n\n#### Abstract\n\nBackground: Strength training
      has emerged as one of the most powerful interventions for promoting health across the
      lifespan. This comprehensive review examines the extensive evidence supporting strength
      training's role in preventing chronic disease, maintaining functional independence, and
      enhancing quality of life.\n\nMethods: We conducted a systematic review of peer-reviewed
      literature published between 2018-2024, analyzing 127 studies involving over 45,000
      participants across various populations.\n\nResults: Regular resistance exercise provides
      cardiovascular benefits ( $15-20 \\%$ reduction in heart disease risk), metabolic improvements
      ( $12-18 \\%$ better insulin sensitivity), cognitive enhancements ( $25 \\%$ slower
      cognitive decline), and psychological well-being improvements. Strength training increases
      bone mineral density by $1-3 \\%$ annually and reduces fall risk by up to $40 \\%$ in older
      adults.\n\nConclusions: Current guidelines recommend at least two sessions per week targeting
      all major muscle groups. Implementation of strength training programs should be considered a
      public health priority given the substantial evidence for disease prevention and health
      promotion.\n\n\nKeywords: resistance training, muscle strength, bone density, chronic
      disease prevention, healthy aging, exercise prescription\n\n## Introduction\n\nThe human
      musculoskeletal system is designed for regular mechanical loading and progressive challenge.
      Throughout evolutionary history, our ancestors engaged in strength-demanding activities
      essential for survival, maintaining robust muscle mass and bone density well into advanced age.
      However, the modern sedentary lifestyle has created an unprecedented mismatch between our
      biological needs and daily activities, contributing to rising rates of sarcopenia,
      osteoporosis, and metabolic dysfunction.\n\nStrength training, also known as resistance
      training or weight training, represents a targeted intervention that can address many
      contemporary health challenges. Unlike aerobic exercise alone, resistance training provides
      unique physiological adaptations that are essential for long-term health and functional
      independence. The World Health Organization now recognizes strength training as a fundamental
      component of physical activity guidelines for all adults.\n\nKey Statistics: Only 31\\% of
      adults meet strength training recommendations, despite evidence showing $20-30 \\%$ reductions
      in all-cause mortality among regular participants.\n\n## Physiological Mechanisms and
      Adaptations\n\n## Musculoskeletal Benefits\n\nStrength training stimulates muscle protein
      synthesis through mechanistic target of rapamycin (mTOR) pathway activation, leading to
      increased muscle fiber size and improved neuromuscular coordination. Research demonstrates
      that adults can increase muscle mass by $2-4 \\%$ per month during initial training phases,
      with continued improvements possible throughout life.\n\nBone tissue responds to mechanical
      loading through osteoblast activation and increased bone formation. Weight-bearing resistance
      exercises create piezoelectric effects that stimulate osteocyte networks, resulting in
      improved bone mineral density and reduced fracture risk. Studies show 1-3\\% annual",
      "index": 0
    }
  ]
}
```

### Classify multiple documents

To classify multiple documents, first create a table of the files by retrieving the document locations from a
directory, converting these locations to FILE objects.

```sqlexample
CREATE TABLE documents_table AS
  (SELECT TO_FILE('@my_documents', RELATIVE_PATH)
    AS docs FROM DIRECTORY(@my_documents));
```

Then apply AI_PARSE_DOCUMENT to each document in the table and process the results, for example by passing them
to AI_CLASSIFY to categorize the documents by type. This is an efficient approach to batch document analysis in a
document collection.

```sqlexample
WITH single_page_extraction as (
  SELECT
  TO_VARCHAR (AI_PARSE_DOCUMENT(docs, {'mode': 'LAYOUT',
    'page_filter': [{'start': 0, 'end': 1}]} )) AS first_page FROM documents_table)
SELECT AI_CLASSIFY(
  first_page,
  ['health', 'fitness','economics', 'science', 'psychology' ,'sociology','statistics', 'finance', 'Artificial Intelligence', 'Analytics'],
  {'output_mode': 'multi'} ) as article_classification
FROM single_page_extraction;
```

The query returns classification labels for each document.

```output
{ "labels": [ "health", "psychology", "science" ] }
{ "labels": [ "fitness", "health", "science" ] }
{ "labels": [ "Analytics", "Artificial Intelligence" ] }
{ "labels": [ "finance", "Analytics" ] }

..

{ "labels": [ "finance" ] }
{ "labels": [ "Artificial Intelligence", "science" ] }
{ "labels": [ "Artificial Intelligence", "science" ] }
{ "labels": [ "fitness", "health", "science" ] }
```

## Input requirements

AI_PARSE_DOCUMENT is optimized for documents both digital-born and scanned. The following table lists the limitations and
requirements of input documents:

|  |  |
| --- | --- |
| Maximum file size | 100 MB |
| Maximum pages per document | 500 |
| Maximum page resolution | * 10000 x 10000 pixels * 33.3 x 33.3 inches (at 300 DPI) * 2400 x 2400 pts (at 300 DPI) |
| Supported file type | PDF, PPTX, DOCX, JPEG, JPG, PNG, TIFF, TIF, HTML, TXT |
| Stage encryption | Server-side encryption |
| Font size | 8 points or larger for best results |

## Supported document features and limitations

|  |  |
| --- | --- |
| Page orientation | AI_PARSE_DOCUMENT automatically detects page orientation. |
| Page splitting | AI_PARSE_DOCUMENT can split multi-page documents into individual pages and parse each separately. This is useful for processing large documents that exceed the maximum size. |
| Page filtering | AI_PARSE_DOCUMENT can process some of the pages in a document, instead of all of them, by specifying page ranges. This is useful when you know what pages the information you’re looking for is on. |
| Characters | AI_PARSE_DOCUMENT detects the following characters:   * a-z * A-Z * 0-9 * À Á Â Ã Ä Å Æ Ç È É Ê Ë Ì Í Î Ï Ð Ñ Ò Ó Ô   Õ Ö Ø Ù Ú Û Ü Ý Þ ß à á â ã ä å æ ç è é ê   ë ì í î ï ð ñ ò ó ô õ ö ÷ ø ù ú û ü ý þ ÿ   Ą ą Ć ć Č č Đ đ Ę ę ı Ł ł Ń ń ō Œ œ Ś ś Š   š Ÿ Ź ź Ż ż Ž ž ʒ β δ ε з Ṡ * ! “ # $ % & ‘ ( ) \* + , - . / : ; < = > ?   @ [ ] ^ _ ` { | } ~ ¡ ¢ £ ¥ § © ª « ­ ® ¯   ° ± ² ³ ´ µ ¶ · º » ¿ ‘ † ‡ • ‣ ⁋ ₣ ₤ ₦   ₩ € ₭ ₹ ™ ← ↑ → ↓ ↔ ↕ ↖ ↗ ↘ ↙ ↰ ↱ ↲ ↳ ↴   ↵ |
| Images | AI_PARSE_DOCUMENT generates markup for images in the document, but does not currently extract the actual images. |
| Structured elements | AI_PARSE_DOCUMENT automatically detects and extracts tables and forms. |
| Fonts | AI_PARSE_DOCUMENT recognizes text in most serif and sans-serif fonts, but may have difficulty with decorative or script fonts. The function does not recognize handwriting. |

### Supported languages

AI_PARSE_DOCUMENT is trained for the following languages:

| OCR Mode | LAYOUT Mode |
| --- | --- |
| * English * French * German * Italian * Norwegian * Polish * Portuguese * Spanish * Swedish | * Chinese * English * French * German * Hindi * Italian * Portuguese * Romanian * Russian * Spanish * Turkish * Ukrainian |

## Regional availability

Support for AI_PARSE_DOCUMENT is available to accounts in the following Snowflake regions:

| AWS | Azure | Google Cloud Platform |
| --- | --- | --- |
| US West 2 (Oregon) | East US 2 (Virginia) | US Central 1 (Iowa) |
| US East (Ohio) | West US 2 (Washington) |  |
| US East 1 (N. Virginia) | Europe (Netherlands) |  |
| Europe (Ireland) |  |  |
| Europe Central 1 (Frankfurt) |  |  |
| Europe West 2 (London) |  |  |
| Asia Pacific (Sydney) |  |  |
| Asia Pacific (Tokyo) |  |  |

AI_PARSE_DOCUMENT has cross-region support in other Snowflake regions. For information on enabling Cortex AI cross-region support, see [Cross-region inference](cross-region-inference.md).

## Access control requirements

To use the AI_PARSE_DOCUMENT function, a user with the ACCOUNTADMIN role must grant the SNOWFLAKE.CORTEX_USER database role to the user who
will call the function. See [Cortex LLM privileges](aisql.md) topic for details.

## Cost considerations

The Cortex AI_PARSE_DOCUMENT function incurs compute costs based on the number of pages per document processed. The following describes how pages are counted for different file formats:

* For paged file formats (PDF, DOCX), each page in the document is billed as a page.
* For image file formats (JPEG, JPG, TIF, TIFF, PNG), each individual image file is billed as a page.
* For HTML and TXT files, each chunk of 3,000 characters is billed as a page, including the last chunk, which may be less than 3,000 characters.

Snowflake recommends executing queries that call the Cortex AI_PARSE_DOCUMENT function in a smaller warehouse (no larger
than MEDIUM). Larger warehouses do not increase performance.

## Error conditions

Snowflake Cortex AI_PARSE_DOCUMENT can produce the following error
messages:

| Message | Explanation |
| --- | --- |
| `Document contains language that is not supported.` | Input document contains unsupported language. |
| `The provided file format {file_extension} isn't supported. Supported formats: .['.docx', '.pptx', '.pdf'].` | The document is in unsupported format. |
| `The provided file format .bin isn't supported. Supported formats: ['.docx', '.pptx', '.pdf']. Ensure the file is stored with server-side encryption.` | The file format is not supported and understood as a binary file. |
| `Maximum number of 500 pages exceeded. The document has {actual_pages} pages.` | The document exceeds the 500-page limit. |
| `Page size in pixels exceeds 10000x10000. The page size is {actual_px} pixels.` | Image input or a converted document page is larger than the supported dimensions. |
| `Page size in inches exceeds 50x50 (3600x3600 pt). The page size is {actual_in} inches ({actual_pt} pt).` | Page is larger than the supported dimensions. |
| `Maximum file size of 104857600 bytes exceeded. The file size is {actual_size} bytes.` | The document is larger than 100 MB. |
| `Provided file cannot be found.` | The file does not exist. |
| `Provided file cannot be accessed.` | The file can’t be accessed due to insufficient privileges. |
| `The Parse Document function did not respond in the allowed time.` | Timeout occurred. |
| `Internal error.` | System error occurred. Wait and try again. |

## Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Provisioned Throughput
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/provisioned-throughput.md
section: Snowflake Cortex (AI & ML)
---

# Provisioned Throughput

## Overview

Use Provisioned Throughput to reserve throughput for managed inference on Snowflake Cortex.
You specify throughput size as provisioned throughput units (PTU), and Cortex allocates the required capacity for a one-month term.
You can use the PTUs in your REST API calls for a consistent end-user experience. The functionality is available for the following models in the AWS and Azure Clouds:

* Mistral Large 2
* Llama 3.1-405B
* Llama 3.1-70B
* Llama 3.1-8B
* Snowflake-Llama3.3-70B
* Snowflake-Llama3.3-405B

## Access control requirements

Users must use a role that has been granted the SNOWFLAKE.CORTEX_USER database role with USAGE privilege on the PT ID.
For more information about this privilege, see Privileges.

### Privileges

The following sections describe the privileges required to create, manage, and use provisioned throughput.

#### Creating a provisioned throughput

To create a provisioned throughput, you must use a role that has been granted the account-level CREATE PROVISIONED THROUGHPUT privilege.
By default, ACCOUNTADMIN is the only role that can create the provisioned throughput.
You can use the ACCOUNTADMIN role to grant the CREATE PROVISIONED THROUGHPUT privilege to another role.

Use the following SQL command to grant the privilege to create a provisioned throughput:

```sqlexample
GRANT CREATE PROVISIONED THROUGHPUT ON ACCOUNT TO ROLE <role>
```

Provisioned Throughput is a schema-level object.
A role with the CREATE PROVISIONED THROUGHPUT privilege can create a provisioned throughput in any schema where it has the USAGE privilege.

The role that you used to create the provisioned throughput is automatically granted the OWNERSHIP privilege on the provisioned throughput.
The OWNERSHIP privilege allows you to rename or drop the provisioned throughput.

#### Giving roles the privilege to use a provisioned throughput

Grant roles with the USAGE privilege on the provisioned throughput. The USAGE privilege provides roles with ability to make REST API or SQL calls with a provisioned throughput ID.

The following SQL command grants the USAGE privilege on a provisioned throughput:

```sqlexample
GRANT USAGE ON PROVISIONED THROUGHPUT <pt_id> TO ROLE <role>
```

#### Using a provisioned throughput

A role with USE or OWNERSHIP privilege on a provisioned throughput can use the provisioned throughput for inference.
For information about the privileges required to use a provisioned throughput, see [Provisioned Throughput privileges](../security-access-control-privileges.md).

## Minimum Provisioned Throughput Unit requirements

Provisioned Throughput is subject to minimum and incremental PTU requirements. Each model or feature in the Minimum PTUs column shows the minimum number of PTUs that you must request. If you request fewer PTUs than the minimum, your request is rejected.

If you need more throughput than the minimum PTUs offer for the model, you need additional PTUs. The Increment PTUs column shows the PTU increments in excess of the Minimum PTUs that you can request.
Requests must specify PTUs such that the amount exceeding the minimum is a whole integer multiple of the increment; otherwise, the request is rejected.

The table below lists the available models, the minimum PTUs required for each model, and the increment requirements for additional PTUs beyond the minimum.

Provisioned Throughput - Complete REST API

| Model | Minimum PTUs | Increment PTUs |
| --- | --- | --- |
| Mistral Large 2 | 256 | 128 |
| Llama 3.1-405B | 512 | 256 |
| Llama 3.1-70B | 128 | 64 |
| Llama 3.1-8B | 64 | 32 |
| Snowflake-Llama3.3-70B | 128 | 64 |
| Snowflake-Llama3.3-405B | 512 | 256 |

## Determining PTU size

The PTUs required for your application depend on the workload profile.
For example, on Llama 3.1-8B, a workload with 500 requests per minute (RPM) and 500 tokens per request output has a minimum of 64 PTUs.
It delivers 960K tokens of throughput per minute. If you need more throughput, you can request additional PTUs in increments of 32.

When you’re starting out, you can use the minimum PTUs for the model and add increments as needed.

## Cost considerations

For the duration of your Provisioned Throughput term, you consume Credits per PTU per hour at the rate listed in the [Snowflake Credit Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). You incur charges for the allocated PTUs regardless of your actual usage during the term. The term starts and ends at 8:00 a.m. PT for the dates provided in the provisioned throughput creation.

Provisioned Throughput does not renew automatically. To reserve throughput for another term, see the following section.

## Reserving throughput

This tutorial guides you through the process of reserving and using provisioned throughput in a REST API call for the Cortex COMPLETE function.

### Step 1: Create a provisioned throughput ID

To get started with provisioned throughput, use SQL to create a request with the following information:

* The cloud provider
* The model
* The number of PTUs
* The start of the term (period of the provisioned throughput’s availability)
* The end of the term (period of the provisioned throughput’s availability)

The following examples create the `my_pt` provisioned throughput resource on AWS, specifying the model `llama3.1-8B`, allocating 64 provisioned throughput units (PTUs) from April 15, 2025, to May 15, 2025.

```sqlexample
CREATE PROVISIONED THROUGHPUT my_pt CLOUD_PROVIDER='aws', MODEL='llama3.1-8B', PTUS=64, TERM_START='2025-04-15' TERM_END='2025-05-15'
```

The provisioned throughput ID (PT ID) is in the response.

### Step 2: Open a support case to allocate the provisioned throughput

After you create an ID, create a support ticket with Snowflake Support to enable Provisioned Throughput.
In the ticket, provide your [Account identifiers](../admin-account-identifier.md) and the PT ID. We recommend creating the ticket seven business days before the start of the term to ensure that the throughput is reserved when needed.

### Step 3: Check the status of the provisioned throughput

After you create the support ticket, you can check on the status of the provisioned throughput using the following command.

```sqlexample
DESCRIBE PROVISIONED THROUGHPUT my_pt
```

This command returns one of the following states:

* REQUESTED: PT request received, but capacity not allocated yet.
* APPROVED: PT is enabled and will be ACTIVE on the specified start date.
* ACTIVE: PT is now available for use.
* EXPIRED: PT is no longer available for use or was not enabled before the term start.

### Step 4: Use the Provisioned Throughput ID in your REST API calls

After the PT is in the ACTIVE state, you can use it in your [AI_COMPLETE](../../sql-reference/functions/ai_complete.md) REST API calls. To use the provisioned throughput in the inference request, specify the PT ID in the API call.
Using provisioned throughput in the request doesn’t change the behavior of the API.

The following example shows how to use the PT ID in a COMPLETE REST API call:

```bash
curl --location 'https://some-account-identifier.snowflakecomputing.com/api/v2/cortex/inference:complete' \
--header 'X-Snowflake-Authorization-Token-Type: KEYPAIR_JWT' \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header 'Authorization: ••••••' \
--data '{
  "model": "snowflake-llama-3.1-8b",
  "messages": [
  {
      "content": "Write an essay on the benefits of provisioned throughput."
  }
  ],
  "provisioned_throughput_id": "f3a27d60-f61f-4247-8aa3-6272ea0d7a8d"
}'
```

> **Note:**
>
> The role that you use to make the REST API call must have the USE privilege on the provisioned throughput ID.
> For more information about the required privileges, see [Provisioned Throughput privileges](../security-access-control-privileges.md).

### Termination

The provisioned throughput stops processing inference requests after the term expires.
If you’re using Provisioned Throughput for API requests after the term expires, you must create a new Provisioned Throughput ID and use it in your requests.

---
title: Query a Cortex Search Service
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/query-cortex-search-service.md
section: Snowflake Cortex (AI & ML)
---

# Query a Cortex Search Service

When you create a Cortex Search Service, the system provisions an API endpoint to serve queries at low latency.
You can use three APIs for querying a Cortex Search Service:

* The Python API
* The REST API
* The SQL SEARCH_PREVIEW Function

## Parameters

All APIs support the same set of query parameters:

|  | Parameter | Description |
| --- | --- | --- |
| Required | `query` | The search query, to be searched for in the text column in the service. |
| Optional | `columns` | A comma-separated list of columns to return for each relevant result in the response. These columns must be included in the source query for the service.  If this parameter is not provided, only the search column is returned in the response. |
|  | `filter` | A filter object for filtering results based on data in the `ATTRIBUTES` columns. See Filter syntax for syntax. |
|  | `scoring_config` | Configuration object for customizing search ranking behavior. See [Customizing Cortex Search scoring](cortex-search-customize-scoring.md) for syntax. |
|  | `scoring_profile` | The named scoring profile to be used with the query, previously defined with [ALTER CORTEX SEARCH SERVICE … ADD SCORING PROFILE](../../../sql-reference/sql/alter-cortex-search.md). If `scoring_profile` is provided, any `scoring_config` provided is ignored. |
|  | `limit` | Maximum number of results to return in the response, up to 1000. The default limit is 10. |

### Multi-index search parameters

In addition, the SQL and Python APIs support [multi-index queries](cortex-search-overview.md). Using multi-index parameters allows for refining results from Cortex Search and reducing query cost by limiting the number of columns searched.

| Parameter | Description |
| --- | --- |
| `multi_index_query` | The map used to determine which indexes to query. Each key in the map is the name of an indexed column, and each value is an array containing maps that define the query:   * If the index is a text index or a managed vector index, the query array can contain:    + Text queries: `{"text": "search_text"}`   + Vector queries, as an embedding vector: `{"vector": [vector_values]}` * If the index is a user-provided vector embedding column, the query array can contain:    + If a `query_model` was specified at creation time for automatic embeddings, text queries: `{"text": "search_text"}`.   + Vector queries, as an embedding vector: `{"vector": [vector_values]}` |

> **Note:**
>
> Multi-index Cortex Search services can still be searched through the REST API or without the `multi_index_query` parameter. This causes an unrestricted search over *all* indexed columns, which affects query cost. For details on estimating cost for multi-index query compute, see [Understanding cost for Cortex Search Services - Multi-index search](cortex-search-costs.md).

## Syntax

Simple queries to a Cortex Search Service use the following syntax:

PythonREST APISQL

```python
import os
from snowflake.core import Root
from snowflake.snowpark import Session

# connect to Snowflake
CONNECTION_PARAMETERS = { ... }
session = Session.builder.configs(CONNECTION_PARAMETERS).create()
root = Root(session)

# fetch service
my_service = (root
    .databases["<service_database>"]
    .schemas["<service_schema>"]
    .cortex_search_services["<service_name>"]
)

# query service
resp = my_service.search(
    query="<query>",
    columns=["<col1>", "<col2>"],
    filter={"@eq": {"<column>": "<value>"} },
    limit=5
)
print(resp.to_json())
```

```shell
curl --location https://<ACCOUNT_URL>/api/v2/databases/<DB_NAME>/schemas/<SCHEMA_NAME>/cortex-search-services/<SERVICE_NAME>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "<search_query>",
  "columns": ["col1", "col2"],
  "filter": <filter>,
  "limit": <limit>
}'
```

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'my_search_service',
      '{
         "query": "preview query",
         "columns":[
            "col1",
            "col2"
         ],
         "filter": {"@eq": {"col1": "filter value"} },
         "limit":10
      }'
  )
)['results'] as results;
```

### Multi-index query syntax

Querying specific indices only or using a service with vector embeddings for a multi-index Cortex Search service uses the following syntax:

PythonSQL

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.configs( {...} ).create()
root = Root(session)

my_service = (root
  .databases["<service_database>"]
  .schemas["<service_schema>"]
  .cortex_search_services["<service_name>"]
)

resp = my_service.search(
    multi_index_query={
        "<index_name>": [
            {"text": "<search_text>"},
            {"vector": [<vector_values>]},
            ...
        ],
        ...
    },
    scoring_config={
        "weights": {
            "texts": <text_weight>,
            "vectors": <vector_weight>,
            "reranker": <reranker_weight>
        },
        "functions": {
            "vector_boosts": [
                {"weight": <weight>, "column": "<vector_column_name>"},
                ...
            ],
            "text_boosts": [
                {"weight": <weight>, "column": "<text_column_name>"},
                ...
            ]
        }
    },
    columns=["<column_name>", "<column_name>", ...],
    limit=<limit>
)
```

```sqlexample
SELECT SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      '<service_name>',
      '{
        "multi_index_query": {
          "<index_name>": [
            {"text": "<search_text>"},
            {"vector": [<vector_values>]},
            ...
          ],
          ...
        },
        "columns": ["<column_name>", "<column_name>", ...],
        "limit": <limit>,
        "scoring_config": {
          "weights": {
            "texts": <text_weight>,
            "vectors": <vector_weight>,
            "reranker": <reranker_weight>
          },
          "functions": {
            "vector_boosts": [
              {"weight": <weight>, "column": "<vector_column_name>"},
              ...
            ],
            "text_boosts": [
              {"weight": <weight>, "column": "<text_column_name>"}
              , ...
            ]
          }
        }
      }'
  );
```

## Setup and authentication

### Python API

Cortex Search Services may be queried using version 0.8.0 or later of the Snowflake Python APIs. See
[Snowflake Python APIs: Managing Snowflake objects with Python](../../../developer-guide/snowflake-python-api/snowflake-python-overview.md) for more information on the Snowflake
Python APIs.

#### Install the Snowflake Python API library

First, install the latest version of the Snowflake Python APIs package from PyPI.
See [Install the Snowflake Python APIs library](../../../developer-guide/snowflake-python-api/snowflake-python-installing.md)
for instructions on installing this package from PyPI.

```none
pip install snowflake -U
```

#### Connect to Snowflake

Connect to Snowflake using either a Snowpark `Session` or a Python Connector `Connection` and create a
`Root` object. See [Connect to Snowflake with the Snowflake Python APIs](../../../developer-guide/snowflake-python-api/snowflake-python-connecting-snowflake.md) for more instructions on connecting
to Snowflake. The following example uses the Snowpark `Session` object and a Python dictionary for
configuration.

```python
import os
from snowflake.core import Root
from snowflake.snowpark import Session

CONNECTION_PARAMETERS = {
    "account": os.environ["snowflake_account_demo"],
    "user": os.environ["snowflake_user_demo"],
    "password": os.environ["snowflake_password_demo"],
    "role": "test_role",
    "database": "test_database",
    "warehouse": "test_warehouse",
    "schema": "test_schema",
}

session = Session.builder.configs(CONNECTION_PARAMETERS).create()
root = Root(session)
```

> **Note:**
>
> Version 0.8.0 or later of the Snowflake Python APIs library is required to query a Cortex Search Service.

### REST API

Cortex Search exposes a REST API endpoint in the suite of [Snowflake REST APIs](../../../developer-guide/snowflake-rest-api/snowflake-rest-api.md). The REST endpoint
generated for a Cortex Search Service is of the following structure:

```none
https://<account_url>/api/v2/databases/<db_name>/schemas/<schema_name>/cortex-search-services/<service_name>:query
```

Where:

* `<account_url>`: Your Snowflake Account URL. See [Finding the organization and account name for an account](../../admin-account-identifier.md) for instructions on finding your account URL.
* `<db_name>`: Database in which the service resides.
* `<schema_name>`: Schema in which the service resides.
* `<service_name>`: Name of the service.
* `:query`: The method to invoke on the service; in this case, the `query` method.

For additional details, see the REST API reference for [Cortex Search Service](https://docs.snowflake.com/developer-guide/snowflake-rest-api/reference/cortex-search-service).

#### Authentication

Snowflake REST APIs support authentication via programmatic access tokens (PATs), key pair authentication
using JSON Web Tokens (JWTs), and OAuth. For details, see
[Authenticating Snowflake REST APIs with Snowflake](../../../developer-guide/snowflake-rest-api/authentication.md).

### SQL SEARCH_PREVIEW function

The [SNOWFLAKE.CORTEX.SEARCH_PREVIEW](../../../sql-reference/functions/search_preview-snowflake-cortex.md) function allows you to preview the
results of individual queries to a Cortex Search Service from within a SQL environment such as a worksheet or Snowflake notebook cell.
This function makes it easy to interactively validate that a service has populated correctly and is serving reasonable results.

> **Important:**
> > The `SEARCH_PREVIEW` function is provided for testing and validation of Cortex Search Services.
> > It is not intended for serving search queries in an end-user application.
>
> * The function operates only on string literals. It does not accept batch text data.
> * The function has higher latency than the REST and Python APIs..

## Filter syntax

Cortex Search supports filtering on the ATTRIBUTES columns specified in the
[CREATE CORTEX SEARCH SERVICE](../../../sql-reference/sql/create-cortex-search.md) command.

Cortex Search supports five matching operators:

* [TEXT](../../../sql-reference/data-types-text.md) or [NUMERIC](../../../sql-reference/data-types-numeric.md) equality: `@eq`
* [ARRAY](../../../sql-reference/data-types-semistructured.md) contains: `@contains`
* [NUMERIC](../../../sql-reference/data-types-numeric.md) or [DATE/TIMESTAMP](../../../sql-reference/data-types-datetime.md) greater than or equal to: `@gte`
* [NUMERIC](../../../sql-reference/data-types-numeric.md) or [DATE/TIMESTAMP](../../../sql-reference/data-types-datetime.md) less than or equal to: `@lte`
* [Primary key](cortex-search-overview.md) equality: `@primarykey`

These matching operators can be composed with various logical operators:

* `@and`
* `@or`
* `@not`

### Usage notes

* Matching against `NaN` (‘not a number’) values in the source query is handled as described in
  [Special values](../../../sql-reference/data-types-numeric.md).
* Fixed-point numeric values with more than 19 digits (not including leading zeroes) do not work with `@eq`,
  `@gte`, or `@lte` and will not be returned by these operators (although they could still be returned by the
  overall query with the use of `@not`).
* `TIMESTAMP` filters accept values of the form: `YYYY-MM-DDTHH:MM:SS.sss+HH:MM`. If the timezone offset is not specified, the date is interpreted in UTC.
* `DATE` filters accept values of the form `YYYY-MM-DD`. If time or timezones are specified, they will be truncated.
* `@primarykey` is only supported for services configured with a [primary key](../../../sql-reference/constraints-overview.md). The value of the filter must be
  a JSON object mapping every primary key column to its corresponding value (or `NULL`).

These operators can be combined into a single filter object.

### Examples

* Filtering on rows where string-like column `string_col` is equal to value `value`.

  ```json
  { "@eq": { "string_col": "value" } }
  ```
* Filtering to a row with the specified primary key values `us-west-1` in the `region` column and `abc123` in the `agent_id` column:

  ```json
  { "@primarykey": { "region": "us-west-1", "agent_id": "abc123" } }
  ```
* Filtering on rows where ARRAY column `array_col` contains value `value`.

  ```json
  { "@contains": { "array_col": "arr_value" } }
  ```
* Filtering on rows where NUMERIC column `numeric_col` is between 10.5 and 12.5 (inclusive):

  ```json
  {
    "@and": [
      { "@gte": { "numeric_col": 10.5 } },
      { "@lte": { "numeric_col": 12.5 } }
    ]
  }
  ```
* Filtering on rows where TIMESTAMP column `timestamp_col` is between `2024-11-19` and `2024-12-19`
  (inclusive).

  ```json
  {
    "@and": [
      { "@gte": { "timestamp_col": "2024-11-19" } },
      { "@lte": { "timestamp_col": "2024-12-19" } }
    ]
  }
  ```
* Composing filters with logical operators:

  ```json
  // Rows where the "array_col" column contains "arr_value" and the "string_col" column equals "value"
  {
    "@and": [
      { "@contains": { "array_col": "arr_value" } },
      { "@eq": { "string_col": "value" } }
    ]
  }

  // Rows where the "string_col" column does not equal "value"
  {
    "@not": { "@eq": { "string_col": "value" } }
  }

  // Rows where the "array_col" column contains at least one of "val1", "val2", or "val3"
  {
    "@or": [
      { "@contains": { "array_col": "val1" } },
      { "@contains": { "array_col": "val2" } },
      { "@contains": { "array_col": "val3" } }
    ]
  }
  ```

## Multi-index queries

[Preview Feature](../../../release-notes/preview-features.md) — Open

Available to all accounts.

When created as a multi-index Cortex Search service with the [CREATE CORTEX SEARCH SERVICE … TEXT INDEXES … VECTOR INDEXES](../../../sql-reference/sql/create-cortex-search.md) syntax, the optional `multi_index_query` parameter is used. When omitting this parameter, all indices are used in the search.

### Usage notes

* Each index to query is represented as a key-value pair in the `multi_index_query` map.
* At least one vector index must be supplied in each query. Querying only text indexes is an error.
* When querying a multi-index Cortex Search Service, the following behaviors apply:

  + *AND across fields*: A match in all of the queried text or vector fields is required for a document to be returned.
  + *OR across terms within a text index field*: When a query contains multiple terms such as “wash fold”, a document
    is returned if *any* of the query terms are found within the document.
  + Text queries are automatically normalized using stemming, lemmatization, and domain-specific rewrites via Snowflake’s custom analyzer.
    This improves recall by matching related terms, such as linking “washing” to “wash” and “laundromat” to “laundry”.
* The `scoring_config.weights` field modifies the relative weight of each of the 3 high-level scoring techniques
  (vector, keyword, reranking) in a given query.

  Within this field, weights are applied *relative* to each other. For example,
  `{ "texts": 3,  "vectors": 2, "reranker": 1 }` and `{ "texts": 30,  "vectors": 20, "reranker": 10 }`
  are equivalent.
* Using the `scoring_config.functions.vector_boosts` and `scoring_config.functions.text_boosts` fields:

  + These fields allow users to modify the relative weight of each vector index and text index query,
    respectively, in a given query.
  + Within each field, weights are applied relative to each other, as in `scoring_config.weights`.
* Multi-index queries can be combined with numeric boosts, time decays, and queries that disable reranking.
  For information on using those features, see [Numeric boosts and time decays](cortex-search-customize-scoring.md)
  and [Reranking](cortex-search-customize-scoring.md).
* When querying a multi-index service, the `query` parameter can be used to specify a query to be applied to all fields, unless
  the service contains a vector index with user-provided vector embeddings.
* To optimize search performance and latency, columns containing vector embeddings are not returned in results when issuing a query to a user-provided vector index.
* Snowflake recommends refining your queries to use the `multi_index_query` on multi-index Cortex Search services to reduce the amount of resources consumed, which affects cost.

  For information on estimating pricing for multi-index queries, see [Estimating costs for multi-index Cortex Search](cortex-search-costs.md).

## Access control requirements

The role that is querying the Cortex Search Service must have the following privileges to retrieve results:

| Privilege | Object |
| --- | --- |
| USAGE | The Cortex Search Service |
| USAGE | The database in which the Cortex Search Service resides |
| USAGE | The schema in which the Cortex Search Service resides |

### Querying with owner’s rights

Cortex Search Services perform searches with [owner’s rights](../../../developer-guide/stored-procedure/stored-procedures-rights.md) and follow the same security model as other
Snowflake objects that run with owner’s rights.

In particular, this means that any role with sufficient privileges to query a Cortex Search Service
may query any of the data the service has indexed, regardless of that role’s privileges on the
underlying objects (such as tables and views) referenced in the service’s source query.

For example, for a Cortex Search Service that references a table with row-level masking policies,
querying users of that service will be able to see search results from rows on which the owner’s role
has read permission, even if the querying user’s role cannot read those rows in the source table.

Use caution, for example, when granting a role with USAGE privileges on a Cortex Search Service to another
Snowflake user.

## Known limitations

Querying a Cortex Search Service is subject to the following limitations:

* **Response size**: The total size of the response payload returned from a search query
  to a Cortex Search Service must not exceed the following limits:

  + [REST API](https://docs.snowflake.com/developer-guide/snowflake-rest-api/reference/cortex-search-service) and [Python API](../../../developer-guide/snowflake-python-api/snowflake-python-overview.md): 10 Megabytes (MB)
  + [SQL SEARCH_PREVIEW Function](../../../sql-reference/functions/search_preview-snowflake-cortex.md): 300 Kilobytes (KB)

Multi-index Cortex Search is subject to additional limitations, which may change during preview:

* The Cortex Search Playground in the Snowsight UI does not support queries to multi-index services. Queries to multi-index services in the Playground display the message “Unable to query search service. Invalid request parameters or filter syntax.”
* The multi-index serving query syntax with the `multi_index_query` parameter is supported only in versions 1.6.0 or later of the Python API.

## Examples

This section provides comprehensive examples for querying Cortex Search Services across all three API
methods.

### Setup for examples

The following examples use a table named `business_documents` with timestamp and numeric columns for
demonstrating various features:

```sqlexample
CREATE OR REPLACE TABLE business_documents (
    document_contents VARCHAR,
    last_modified_timestamp TIMESTAMP,
    created_timestamp TIMESTAMP,
    likes INT,
    comments INT
);

INSERT INTO business_documents (document_contents, last_modified_timestamp, created_timestamp, likes, comments)
VALUES
    ('Quarterly financial report for Q1 2024: Revenue increased by 15%, with expenses stable.',
     '2024-01-12 10:00:00', '2024-01-10 09:00:00', 10, 20),

    ('IT manual for employees: Instructions for usage of internal technologies, including hardware.',
     '2024-02-10 15:00:00', '2024-02-05 14:30:00', 85, 10),

    ('Employee handbook 2024: Updated policies on remote work, health benefits, and company culture.',
     '2024-02-10 15:00:00', '2024-02-05 14:30:00', 85, 10),

    ('Marketing strategy document: Target audience segmentation for upcoming product launch.',
     '2024-03-15 12:00:00', '2024-03-12 11:15:00', 150, 32),

    ('Product roadmap 2024: Key milestones for tech product development, including the launch.',
     '2024-04-22 17:30:00', '2024-04-20 16:00:00', 200, 45),

    ('Annual performance review process guidelines: Procedures for managers to conduct employee.',
     '2024-05-02 09:30:00', '2024-05-01 08:45:00', 60, 5);

CREATE OR REPLACE CORTEX SEARCH SERVICE business_documents_css
    ON document_contents
    WAREHOUSE = <warehouse_name>
    TARGET_LAG = '1 minute'
AS SELECT * FROM business_documents;
```

### Filter examples

#### Simple query with an equality filter

PythonREST APISQL

```python
resp = business_documents_css.search(
    query="technology",
    columns=["DOCUMENT_CONTENTS", "LIKES"],
    filter={"@eq": {"REGION": "US"}},
    limit=5
)
```

```javascript
curl --location https://<ACCOUNT_URL>/api/v2/databases/<DB_NAME>/schemas/<SCHEMA_NAME>/cortex-search-services/<SERVICE_NAME>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "technology",
  "columns": ["DOCUMENT_CONTENTS", "LIKES"],
  "filter": {"@eq": {"REGION": "US"}},
  "limit": 5
}'
```

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'business_documents_css',
      '{
         "query": "technology",
         "columns": ["DOCUMENT_CONTENTS", "LIKES"],
         "filter": {"@eq": {"REGION": "US"}},
         "limit": 5
      }'
  )
)['results'] as results;
```

#### Range filter

PythonREST APISQL

```python
resp = business_documents_css.search(
    query="business",
    columns=["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
    filter={"@and": [
        {"@gte": {"LIKES": 50}},
        {"@lte": {"COMMENTS": 50}}
    ]},
    limit=10
)
```

```javascript
curl --location https://<ACCOUNT_URL>/api/v2/databases/<DB_NAME>/schemas/<SCHEMA_NAME>/cortex-search-services/<SERVICE_NAME>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "business",
  "columns": ["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
  "filter": {"@and": [
    {"@gte": {"LIKES": 50}},
    {"@lte": {"COMMENTS": 50}}
  ]},
  "limit": 10
}'
```

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'business_documents_css',
      '{
         "query": "business",
         "columns": ["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
         "filter": {"@and": [
           {"@gte": {"LIKES": 50}},
           {"@lte": {"COMMENTS": 50}}
         ]},
         "limit": 10
      }'
  )
)['results'] as results;
```

### Scoring examples

#### Numeric boosts

Apply numeric boosts to both the likes and comments columns, with twice the boost weight on
comments values relative to likes values.

PythonREST APISQL

```python
resp = business_documents_css.search(
    query="technology",
    columns=["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
    scoring_config={
        "functions": {
            "numeric_boosts": [
                {"column": "comments", "weight": 2},
                {"column": "likes", "weight": 1}
            ]
        }
    }
)
```

```javascript
curl --location https://<ACCOUNT_URL>/api/v2/databases/<DB_NAME>/schemas/<SCHEMA_NAME>/cortex-search-services/<SERVICE_NAME>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "technology",
  "columns": ["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
  "scoring_config": {
    "functions": {
      "numeric_boosts": [
        {"column": "comments", "weight": 2},
        {"column": "likes", "weight": 1}
      ]
    }
  }
}'
```

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'business_documents_css',
      '{
         "query": "technology",
         "columns": ["DOCUMENT_CONTENTS", "LIKES", "COMMENTS"],
         "scoring_config": {
           "functions": {
             "numeric_boosts": [
               {"column": "comments", "weight": 2},
               {"column": "likes", "weight": 1}
             ]
           }
         }
      }'
  )
)['results'] as results;
```

In the results, note:

> * With the boosts, the “Product roadmap 2024:…” document is the top result because of its large number of likes and comments, even though it has slightly lower relevance to the query “technology”
> * Without any boosts, the top result for the query is “IT manual for employees:…”

#### Time decays

Apply time decays based on the LAST_MODIFIED_TIMESTAMP column, where:

> * Documents with more recent LAST_MODIFIED_TIMESTAMP values, relative to the now timestamp, are boosted
> * Documents with a LAST_MODIFIED_TIMESTAMP value greater than 240 hours from the now timestamp receive little boosting

PythonREST APISQL

```python
resp = business_documents_css.search(
    query="technology",
    columns=["DOCUMENT_CONTENTS", "LAST_MODIFIED_TIMESTAMP"],
    scoring_config={
        "functions": {
            "time_decays": [
                {"column": "LAST_MODIFIED_TIMESTAMP", "weight": 1, "limit_hours": 240, "now": "2024-04-23T00:00:00.000-08:00"}
            ]
        }
    }
)
```

```javascript
curl --location https://<ACCOUNT_URL>/api/v2/databases/<DB_NAME>/schemas/<SCHEMA_NAME>/cortex-search-services/<SERVICE_NAME>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "technology",
  "columns": ["DOCUMENT_CONTENTS", "LAST_MODIFIED_TIMESTAMP"],
  "scoring_config": {
    "functions": {
      "time_decays": [
        {"column": "LAST_MODIFIED_TIMESTAMP", "weight": 1, "limit_hours": 240, "now": "2024-04-23T00:00:00.000-08:00"}
      ]
    }
  }
}'
```

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'business_documents_css',
      '{
         "query": "technology",
         "columns": ["DOCUMENT_CONTENTS", "LAST_MODIFIED_TIMESTAMP"],
         "scoring_config": {
           "functions": {
             "time_decays": [
               {"column": "LAST_MODIFIED_TIMESTAMP", "weight": 1, "limit_hours": 240, "now": "2024-04-23T00:00:00.000-08:00"}
             ]
           }
         }
      }'
  )
)['results'] as results;
```

In the results, note:

> * With the decays, the “Product roadmap 2024:…” document is the top result because of its recency to the now timestamp, even though it has slightly lower relevance to the query “technology”
> * Without any decays, the top result for the query is “IT manual for employees:…”

#### Disabling reranking

To disable reranking:

PythonREST APISQL

```python
resp = business_documents_css.search(
    query="technology",
    columns=["DOCUMENT_CONTENTS", "LAST_MODIFIED_TIMESTAMP"],
    limit=5,
    scoring_config={
        "reranker": "none"
    }
)
```

```javascript
curl --location https://<ACCOUNT_URL>/api/v2/databases/<DB_NAME>/schemas/<SCHEMA_NAME>/cortex-search-services/<SERVICE_NAME>:query \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer $PAT" \
--data '{
  "query": "technology",
  "columns": ["DOCUMENT_CONTENTS", "LAST_MODIFIED_TIMESTAMP"],
  "scoring_config": {
    "reranker": "none"
  }
}'
```

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'business_documents_css',
      '{
         "query": "technology",
         "columns": ["DOCUMENT_CONTENTS", "LAST_MODIFIED_TIMESTAMP"],
         "scoring_config": {
           "reranker": "none"
         }
      }'
  )
)['results'] as results;
```

> **Tip:**
>
> To query a service **with** the reranker, omit the `"reranker": "none"` parameter from the
> `scoring_config` object, as reranking is the default behavior.

## Multi-index query examples

This section provides examples for querying multi-index Cortex Search Services with a restriction on which indices to search, for the Python and SQL APIs.

### Query a service with managed vector embeddings

Examples in this section use the following `business_directory` and `example_search_service` definitions:

```sqlexample
-- Search data
CREATE OR REPLACE TABLE business_directory (name TEXT, address TEXT, description TEXT);
INSERT INTO business_directory VALUES
    ('Joe''s Coffee', '123 Bean St, Brewtown','A cozy café known for artisan espresso and baked goods.'),
    ('Sparkle Wash', '456 Clean Ave, Sudsville', 'Eco-friendly car wash with free vacuum service.'),
    ('Tech Haven', '789 Circuit Blvd, Siliconia', 'Computer store offering the latest gadgets and tech repair services.'),
    ('Joe''s Wash n'' Fold', '456 Apple Ct, Sudsville', 'Laundromat offering coin laundry and premium wash and fold services.'),
    ('Circuit Town', '459 Electron Dr, Sudsville', 'Technology store selling used computer parts at discounted prices.')
;

-- Cortex Search Service
CREATE OR REPLACE CORTEX SEARCH SERVICE example_search_service
    TEXT INDEXES name, address
    VECTOR INDEXES description (model='snowflake-arctic-embed-m-v1.5')
    WAREHOUSE = example_wh
    TARGET_LAG = '1 hour'
    AS ( SELECT * FROM business_directory );
```

#### Query specific indexes

To query `example_search_service` over the `name` text field and `description` vector field:

PythonSQL

```python
resp = business_directory.search(
    query="tech repair shop",
    columns=["name", "description"],
    limit=2
)
```

```sqlexample
SELECT
  value['name']::text as name, value['address']::text as address, value['description']::text as description
FROM TABLE(FLATTEN(PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
      'business_search_service',
      '{
      "query": "tech repair shop",
        "columns": ["name", "description"],
        "limit": 2
      }'
  ))['results']));
```

```output
+---------------------+-----------------------------+--------------------------------------------------------------------------+
|        NAME         |           ADDRESS           |                            DESCRIPTION                                   |
|---------------------+-----------------------------+--------------------------------------------------------------------------|
| Tech Haven          | 789 Circuit Blvd, Siliconia | Computer store offering the latest gadgets and tech repair services.     |
| Circuit Town        | 459 Electron Dr, Sudsville  | Technology store selling used computer parts at discounted prices.       |
+---------------------+-----------------------------+--------------------------------------------------------------------------+
```

#### Query a managed vector column only

To query `example_search_service` for “refurbished components for PCs” over the vector index `description`, using managed embeddings:

PythonSQL

```python
resp = business_directory.search(
    multi_index_query={
        "description": [
            {"text": "refurbished components for PCs"}
        ]
    },
    columns=["name", "address", "description"],
    limit=5
)
```

```sqlexample
SELECT
    value['name']::text as name, value['address']::text as address, value['description']::text as description
FROM TABLE(FLATTEN(PARSE_JSON(
    SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
        'business_search_service',
        '{
          "multi_index_query": {
            "description": [
              {"text": "refurbished components for PCs"}
            ]
          },
          "columns": ["name", "address", "description"],
          "limit": 5
        }'
    )
)['results']));
```

```output
+---------------------+-----------------------------+--------------------------------------------------------------------------+
|        NAME         |           ADDRESS           |                            DESCRIPTION                                   |
|---------------------+-----------------------------+--------------------------------------------------------------------------|
| Circuit Town        | 459 Electron Dr, Sudsville  | Technology store selling used computer parts at discounted prices.       |
| Tech Haven          | 789 Circuit Blvd, Siliconia | Computer store offering the latest gadgets and tech repair services.     |
| Joe's Coffee        | 123 Bean St, Brewtown       | A cozy café known for artisan espresso and baked goods.                  |
| Joe's Wash n' Fold  | 456 Apple Ct, Sudsville    | Laundromat offering coin laundry and premium wash and fold services.      |
| Sparkle Wash        | 456 Clean Ave, Sudsville    | Eco-friendly car wash with free vacuum service.                          |
+---------------------+-----------------------------+--------------------------------------------------------------------------+
```

#### Query with index weights

To query the `example_search_service` for “sparkle” over the text index `name` and “clothing washing” over the vector index `description`, weighting vector scoring as four times more relevant than text or reranking:

PythonSQL

```python
resp = business_directory.search(
    multi_index_query={
        "name": [
            {"text": "sparkle"}
        ],
        "description": [
            {"text": "clothing washing"}
        ]
    },
    scoring_config={
        "weights": {
            "texts": 1,
            "vectors": 4,
            "reranker": 1
        }
    },
    columns=["name", "address", "description"],
    limit=2
)
```

```sqlexample
SELECT
    value['name']::text as name, value['address']::text as address, value['description']::text as description
FROM TABLE(FLATTEN(PARSE_JSON(
    SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
        'business_search_service',
        '{
          "multi_index_query": {
            "name": [
              {"text": "sparkle"}
            ],
            "description": [
              {"text": "clothing washing"}
            ]
          },
          "scoring_config": {
            "weights": {
              "texts": 1,
              "vectors": 4,
              "reranker": 1
            }
          },
          "columns": ["name", "address", "description"],
          "limit": 2
        }'
    )
)['results']));
```

```output
+---------------------+-----------------------------+--------------------------------------------------------------------------+
|        NAME         |           ADDRESS           |                            DESCRIPTION                                   |
|---------------------+-----------------------------+--------------------------------------------------------------------------|
| Joe's Wash n' Fold  | 456 Apple Ct, Sudsville     | Laundromat offering coin laundry and premium wash and fold services.     |
| Sparkle Wash        | 456 Clean Ave, Sudsville    | Eco-friendly car wash with free vacuum service.                          |
+---------------------+-----------------------------+--------------------------------------------------------------------------+
```

Note that because the weight of the `description` vector index colum is higher than the weight of any `text` column, the business most associated with “clothes washing” appears above the business containing “sparkle” in its name.

#### Query with individually weighted indexes

To query `example_search_service` with “circuit” over all fields, applying a relative weight to boost matches in the `name` column over the `description` column:

PythonSQL

```python
resp = business_directory.search(
    multi_index_query={
        "name": [{"text": "circuit"}],
        "address": [{"text": "circuit"}],
        "description": [{"text": "circuit"}]
    },
    scoring_config={
        "functions": {
            "text_boosts": [
                {"column": "name", "weight": 2},
                {"column": "address", "weight": 1}
            ]
        }
    },
    columns=["name", "address", "description"],
    limit=3
)
```

```sqlexample
SELECT
    value['name']::text as name, value['address']::text as address, value['description']::text as description
FROM TABLE(FLATTEN(PARSE_JSON(
    SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
        'business_search_service',
        '{
          "multi_index_query": {
            "name": [ {"text": "circuit"} ],
            "address": [ {"text": "circuit"} ],
            "description": [ {"text": "circuit"} ]
          },
          "scoring_config": {
              "functions": {
                "text_boosts": [{"column":"name", "weight": 2}, {"column":"address", "weight": 1}]
                }
          },
          "columns": ["name", "address", "description"],
          "limit": 3
        }'
    )
)['results']));
```

```output
+---------------------+-----------------------------+--------------------------------------------------------------------------+
|        NAME         |           ADDRESS           |                            DESCRIPTION                                   |
|---------------------+-----------------------------+--------------------------------------------------------------------------|
| Circuit Town        | 459 Electron Dr, Sudsville  | Technology store selling used computer parts at discounted prices.       |
| Tech Haven          | 789 Circuit Blvd, Siliconia | Computer store offering the latest gadgets and tech repair services.     |
| Joe's Coffee        | 123 Bean St, Brewtown       | A cozy café known for artisan espresso and baked goods.                  |
+---------------------+-----------------------------+--------------------------------------------------------------------------+
```

Note that boosting the name over address ranks the business named “Circuit Town” above the business located at an address on “Circuit Blvd”.

### Query a service with custom vector embeddings

Examples in this section use the following `business_documents` and `example_search_service` definitions:

```sqlexample
-- Search data with only custom embeddings
CREATE OR REPLACE TABLE business_documents (
  document_contents VARCHAR,
  document_embedding VECTOR(FLOAT, 3)
);
INSERT INTO business_documents VALUES
  ('Quarterly financial report for Q1 2024: Revenue increased by 15%, with expenses stable. Highlights include strategic investments in marketing and technology.', [1, 1, 1]::VECTOR(float, 3)),
  ('IT manual for employees: Instructions for usage of internal technologies, including hardware and software guides and commonly asked tech questions.', [2, 2, 2]::VECTOR(float, 3)),
  ('Employee handbook 2024: Updated policies on remote work, health benefits, and company culture initiatives.', [2, 3, 2]::VECTOR(float, 3)),
  ('Marketing strategy document: Target audience segmentation for upcoming product launch.', [1, -1, -1]::VECTOR(float, 3))
;

-- Cortex Search Service
CREATE OR REPLACE CORTEX SEARCH SERVICE example_search_service
  TEXT INDEXES (document_contents)
  VECTOR INDEXES (document_embedding)
  WAREHOUSE = example_wh
  TARGET_LAG = '1 minute'
  AS SELECT * FROM business_documents;
```

> **Note:**
>
> These examples use mock embeddings for simplicity. In a production use-case, vectors should be generated through a [Snowflake vector embedding model](../vector-embeddings.md) or an externally-hosted embedding model.

#### Query an index with custom embeddings

To query `example_search_service` with “IT” and a corresponding embedding over the `document_contents` and `document_embedding` column:

PythonSQL

```python
resp = business_directory.search(
    multi_index_query={
        "document_embedding": [ {"vector": [1, 1, 1]} ],
        "document_contents": [ {"text": "IT"} ]
    },
    columns=["document_contents"],
    limit=2
)
```

```sqlexample
SELECT
    value['document_contents']::text as document_contents
FROM TABLE(FLATTEN(PARSE_JSON(
    SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
        'byov_search_service',
        '{
          "multi_index_query": {
                "document_embedding": [ {"vector": [1, 1, 1] } ],
                "document_contents": [ {"text": "IT"} ]
          },
          "columns": ["document_contents"],
          "limit": 2
        }'
    )
)['results']));
```

```output
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
|                                                                   DOCUMENT_CONTENTS                                                                                      |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| IT manual for employees: Instructions for usage of internal technologies, including hardware and software guides and commonly asked tech questions.                      |
| Quarterly financial report for Q1 2024: Revenue increased by 15%, with expenses stable. Highlights include strategic investments in marketing and technology.            |
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: Reference
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/reference.md
section: Snowflake Cortex (AI & ML)
---

# Reference

This page provides reference information about working with Snowflake Intelligence. It covers the following concepts:

* REST API endpoints and SQL commands available for creating, managing, and interacting with Cortex Agents
* The supported AI models and their regional availability
* Legal notices about model usage and data classification

## SQL commands and API reference

Snowflake offers the following programmatic methods to create and interact with Cortex Agents for use with Snowflake Intelligence:

* [Cortex Agent Object REST API](../cortex-agents-rest-api.md)
* [Agents SQL commands](../../../sql-reference/commands-cortex-agent.md)

### Cortex Agent Object REST API

The Cortex Agent Object REST API provides a comprehensive set of endpoints for building AI-powered applications that integrate with Snowflake Intelligence. You can use the REST API to manage agent objects and orchestrate interactions with your data.

#### Agent management operations

The Agent Object REST API supports the following operations for managing Cortex Agent objects:

| Operation | Endpoint | Description |
| --- | --- | --- |
| Create | `POST /api/v2/databases/{database}/schemas/{schema}/agents` | Creates a new Cortex Agent with specified attributes and tool configuration. |
| Describe | `GET /api/v2/databases/{database}/schemas/{schema}/agents/{name}` | Retrieves the configuration and metadata for an existing agent. |
| Update | `PUT /api/v2/databases/{database}/schemas/{schema}/agents/{name}` | Modifies an existing agent’s configuration, tools, or instructions. |
| List | `GET /api/v2/databases/{database}/schemas/{schema}/agents` | Lists all agents in a specified database and schema, with filtering options. |
| Delete | `DELETE /api/v2/databases/{database}/schemas/{schema}/agents/{name}` | Removes an agent from your account. |

For detailed API specifications including request parameters, response schemas, and examples, see [Cortex Agents REST API](../cortex-agents-rest-api.md).

### Agents SQL commands

Snowflake provides SQL commands to create and manage [Cortex Agents](../cortex-agents.md) objects for use with Snowflake Intelligence.

#### Agent management commands

The following SQL commands are available for managing Cortex Agents:

| Command | Description |
| --- | --- |
| [CREATE AGENT](../../../sql-reference/sql/create-agent.md) | Creates a new Cortex Agent or replaces an existing one with specified attributes, profile settings, and a YAML specification. |
| [DESCRIBE AGENT](../../../sql-reference/sql/desc-agent.md) | Retrieves the complete configuration, specification, and metadata for an existing agent. |
| [SHOW AGENTS](../../../sql-reference/sql/show-agents.md) | Lists all Cortex Agents for which you have access privileges, with filtering and pagination options. |
| [DROP AGENT](../../../sql-reference/sql/drop-agent.md) | Removes an agent from the current or specified schema. |

For detailed SQL syntax, parameters, and examples, see [Cortex Agent commands](../../../sql-reference/commands-cortex-agent.md).

## Supported models and regions

Snowflake Intelligence supports models listed in [Models](../cortex-agents.md). You can use these models as long as the account has access to them. For more information, see [Control model access](../aisql.md).

When creating an agent, we recommend selecting Auto for the model. This lets Snowflake Intelligence automatically select the highest quality model for your account and automatically improves as new models become available.

In default mode, when an incoming question is highly similar to a verified query, Snowflake Intelligence uses Arctic Text2SQL R1.5 to answer the question faster without using extended thinking. This reduces latency for common questions that already have validated SQL in the verified query repository.

While the listed models may not be available in [all regions](../aisql.md), you can use Snowflake Intelligence in any cloud or region by using Cortex Cross-region inference. This includes clouds and regions where the models are not available. For more information about configuring Cortex Cross-region inference, see [Cross-region inference](../cross-region-inference.md).

* **AWS US** - In AWS, Claude 4+ offers the highest quality and best speed performance. We recommend that you set up Cortex Cross-region inference for `aws_us` to use Claude 4 and get the best performance. Without Cortex Cross-region inference, you are restricted to using Claude 3.5 in `aws_us`.
* **Azure US** - If you are using Snowflake Intelligence in East US, you can use GPT 4.1+ without Cortex Cross-region inference. Other region and model combinations require Cortex Cross-region inference setup for `azure_us`.
* **AWS EU** - You can use Claude 4+ in this region as long as you configure Cortex Cross-region inference for `aws_eu`.
* **AWS APJ** - You can use Claude 4+ in this region as long as you configure Cortex Cross-region inference for `aws_apj`.

## Legal notices

Where your configuration of Snowflake Intelligence uses a model provided on the
[Model and Service Flow-down Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/open-source-model-flow-down-terms/),
your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Features [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../../guides-overview-ai-features.md).

---
title: Replicate a Cortex Search Service
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/cortex-search-replication.md
section: Snowflake Cortex (AI & ML)
---

# Replicate a Cortex Search Service

Cortex supports the replication of Cortex Search Services from a source account to one or more target accounts in the same organization. This replication is integrated seamlessly with Snowflake replication and failover groups to provide point-in-time consistency for the objects on the target account. For more information about replication and failover, see [Introduction to replication and failover across multiple accounts](../../account-replication-intro.md).

A search service is automatically replicated if the parent database is in a replication or failover group. The following behaviors apply to all replicated Cortex Search Services:

* A replicated Cortex Search Service is read only. No direct ALTER or DROP commands are allowed on the replicated entity.
* A replicated Cortex Search Service syncs with the primary service according to the replication schedule. Specifically, if the primary replica drops the service, the secondary service is also dropped during replication refresh.
* Replication related costs might incur for data transfer and compute resources during replication. There are no additional costs for Cortex Search indexing. For more information, see [Understanding replication cost](../../account-replication-cost.md).
* The serving status, queryability, and serving billing of a replicated Cortex Search Service differ between replication groups and failover groups:

|  | Replication group | Failover group |
| --- | --- | --- |
| Serving status | Inherits the serving status of the source service. If the source service is active, the replicated service is also active. | Always suspended until the failover group is promoted to primary. |
| Queryability | Queryable after a delay of up to 10 minutes following replication completion. | Not queryable until promoted to primary. |
| Serving costs | Billed for serving costs if the source service is in active serving status. | No serving costs until promoted to primary. |

For more information about replication and failover groups, see [CREATE REPLICATION GROUP](../../../sql-reference/sql/create-replication-group.md).

## Create a replicated Cortex Search Service using a replication group

To create a replicated Cortex Search Service, create a replication group that includes the parent database of the service.

1. Create a replication group in the primary account.

   > ```sqlexample
   > CREATE REPLICATION GROUP myrg
   >     OBJECT_TYPES = DATABASES
   >     ALLOWED_DATABASES = <database1>
   >     ALLOWED_ACCOUNTS = <org-name>.<secondary-account>
   >     REPLICATION_SCHEDULE = '60 MINUTE';
   > ```
2. From the secondary account, run the following command to create a replica of the primary account database in the secondary account.

   > ```sqlexample
   > CREATE REPLICATION GROUP myrg
   >     AS REPLICA OF <org-name>.<primary-account>.myrg;
   > ```
3. From the secondary account, manually refresh the replica.

   > ```sqlexample
   > ALTER REPLICATION GROUP myrg REFRESH;
   > ```
4. Create a Cortex Search Service in the primary database. For more information, see [CREATE CORTEX SEARCH SERVICE](../../../sql-reference/sql/create-cortex-search.md). The search service is automatically replicated according to the replication schedule.

## Create a replicated Cortex Search Service using a failover group

Failover groups allow you to back up your data in an additional account without using or paying for the replicated services. With a failover group, you can activate the failover only when needed to resume operations. To create a failover group for the Cortex Search Service, create a failover group that includes the parent database of the service.

1. Create a failover group in the primary account.

   > ```sqlexample
   > CREATE FAILOVER GROUP myrg
   >     OBJECT_TYPES = DATABASES
   >     ALLOWED_DATABASES = <database1>
   >     ALLOWED_ACCOUNTS = <org-name>.<secondary-account>
   >     REPLICATION_SCHEDULE = '60 MINUTE';
   > ```
2. From the secondary account, run the following command to create a failover of the primary account database in the secondary account.

   > ```sqlexample
   > CREATE FAILOVER GROUP myrg
   >     AS REPLICA OF <org-name>.<primary-account>.myrg;
   > ```
3. From the secondary account, manually refresh the failover group.

   > ```sqlexample
   > ALTER FAILOVER GROUP myrg REFRESH;
   > ```
4. Create a Cortex Search Service in the primary database. For more information, see [CREATE CORTEX SEARCH SERVICE](../../../sql-reference/sql/create-cortex-search.md). The search service is automatically replicated according to the replication schedule.
5. At the time of disaster recovery, run the following sql in the secondary account to make it the new primary. The replicated service will be activated and loaded into the serving system to query.

   > ```sqlexample
   > ALTER FAILOVER GROUP myrg PRIMARY;
   > ```

---
title: Resource budgets for Cortex Agents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-resource-budgets.md
section: Snowflake Cortex (AI & ML)
---

# Resource budgets for Cortex Agents

A resource budget lets you monitor Cortex Agents spend for your account and take actions when it exceeds spending thresholds. This allows you
to control costs for Cortex Agents and take automated actions such as revoking access when spending
exceeds your configured limits. Resource budgets give you control over the credits consumed at an aggregated level for that specific Agent.

## How resource budgets work

Resource budgets use Snowflake’s tag-based cost attribution model. You create a tag, apply it to a
Cortex Agent object, and then associate that tag with a budget. Snowflake tracks credit consumption for
the tagged object and evaluates spending against the budget limit periodically. The resource budget is
useful for limiting the spend for the Cortex Agent object.

Snowflake enforces resource budgets with the following flow:

1. You create a tag.
2. You apply the tag to the Cortex Agent object.
3. You create a budget and specify the tag to track spending for. As part of creating the budget, you also set a monthly spending limit in credits.
4. You add a stored procedure to be executed when spending reaches a configured threshold of the budget. For example, you can invoke a stored procedure for alerting at 80% and another stored procedure for revoking access at 100%.
5. Snowflake tracks credit consumption for the tagged object.
6. When spending reaches a configured threshold of the budget, such as 80% or 100%, Snowflake executes the stored procedure defined for that threshold.

Snowflake calculates usage, evaluates thresholds, and triggers any configured actions periodically.
After the budget is exceeded, it might take up to eight hours with standard budget (or two hours with latency optimized option) for the budget to be enforced.

## Create a tag

1. Create a tag to identify the cost center associated with the Cortex Agent object:

   ```sqlexample
   -- Create a tag with allowed cost center values
   CREATE TAG cost_mgmt_db.tags.cost_center
      ALLOWED_VALUES 'org-level'
      COMMENT = 'cost_center tag';
   ```
2. Apply the tag to the Cortex Agent object to associate it with a cost center:

   ```sqlexample
   -- Apply the cost center tag to the Cortex Agent object
   ALTER AGENT IF EXISTS my_agent
     SET TAG cost_mgmt_db.tags.cost_center = 'org-level';
   ```

## Set up a resource budget

You can use either Snowsight or SQL to create a budget and associate it with a Cortex Agent object.

Snowsight UISQL

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.
4. Select + Budget.
5. For Location to store, select the name of the database and schema where you want to create the budget.
6. For Name, use `my_budget`.
7. For Budget (credits per month), enter a value, such as **10000**, for the spending limit of the budget.
8. To decrease the [budget refresh interval](../budgets.md) so you can watch spending more closely, select Enable low latency budget.
9. For Threshold, enter a value, such as **80**, for the notification threshold.
10. For Notify, enter email addresses to receive notification emails.
11. Select Next.
12. For Budget scope, add the tag on the Cortex Agent object to the resource budget.
13. Select Create.

1. Create a budget instance in the schema where you manage budgets:

   ```sqlexample
   -- Create a budget instance
   USE SCHEMA budgets_db.budgets_schema;

   CREATE SNOWFLAKE.CORE.BUDGET my_budget();
   ```
2. Set the monthly credit spending limit for the budget:

   ```sqlexample
   -- Set a 10000-credit monthly spending limit
   CALL my_budget!SET_SPENDING_LIMIT(10000);
   ```
3. Add the tag to the budget so that Snowflake tracks spending for the tagged object against this budget:

   ```sqlexample
   -- Associate the cost center tag with the budget
   CALL budgets_db.budgets_schema.my_budget!SET_RESOURCE_TAGS(
      [
         [(SELECT SYSTEM$REFERENCE('TAG',
            'cost_mgmt_db.tags.cost_center',
            'SESSION',
            'applybudget')),
            'org-level']
      ],
      'UNION');
   ```

Now, Snowflake tracks credit consumption for `my_agent` against the `my_budget` budget
with a 10,000-credit monthly limit.

## Configure threshold actions

You can attach stored procedures that are executed when spending reaches specific thresholds, which are
expressed as a percentage of the spending limit and apply to the monthly budget period. For more information, see [Custom actions for budgets](../budgets/custom-actions.md).

### Send notifications

You can send notifications when spending reaches a threshold. For more information, see [Notifications for budgets](../budgets/notifications.md).

1. Set the email to send notifications to:

   ```sqlexample
   CALL my_budget!SET_EMAIL_NOTIFICATIONS(
     'budgets_notification_integration',
      'costadmin@example.com, budgetadmin@example.com'
   );
   ```
2. Set the notification threshold:

   ```sqlexample
   CALL my_budget!SET_NOTIFICATION_THRESHOLD(80);
   ```

### Revoke access

1. Create a stored procedure that revokes access to Cortex Agents. In the stored procedure, you can limit access to a specific role to revoke USAGE for that role.

   ```sqlexample
   -- Create a stored procedure that revokes access to the Cortex Agent object
   CREATE OR REPLACE PROCEDURE budgets_db.budgets_schema.sp_revoke_agent_access(
      agent_name STRING, role_name STRING
   )
   RETURNS STRING
   LANGUAGE SQL
   AS
   BEGIN
      EXECUTE IMMEDIATE 'REVOKE ROLE agent_' || agent_name || '_role FROM ROLE ' || role_name;
      RETURN 'Access revoked for ' || agent_name;
   END;
   ```

   > **Important:**
   >
   > Ensure the `role_name` and the user do not have access to Cortex Agents through other roles. For guidance about configuring roles and privileges correctly, see [User privileges and access control](snowflake-intelligence/deploy-agents.md).
2. Set a custom action that blocks access when 100% of the budget has been spent. You can also use custom actions for notifications.

   ```sqlexample
   -- Provide access to the stored procedures
   GRANT USAGE ON DATABASE budgets_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE budgets_db.budgets_schema.sp_revoke_agent_access(STRING, STRING)
      TO APPLICATION SNOWFLAKE;

   -- Block access at 100% of the budget
   CALL budgets_db.budgets_schema.my_budget!ADD_CUSTOM_ACTION(
      SYSTEM$REFERENCE('PROCEDURE',
         'budgets_db.budgets_schema.sp_revoke_agent_access(string, string)'),
      ARRAY_CONSTRUCT('AGENT_NAME', 'ROLE_NAME'),
      'ACTUAL',
      100);
   ```

> > **Note:**
> >
> > You can also use custom actions to take action when spending is forecasted to exceed the budget limit. For more information, see [Custom actions for budgets](../budgets/custom-actions.md).

### Handling exceptions to spending limits

In some cases, you may need to reinstate access after the budget limit is reached, like during earnings season or
other peak periods. You can configure thresholds beyond 100%, up to 500%, to handle these exception
scenarios.

The workflow assumes that access is revoked using the configured stored procedure when spending reaches a budget threshold. The admin reinstates
a subset of the users and grants access back. When spending reaches 200%, the revocation procedure runs
again as a hard stop.

1. Create a stored procedure to reinstate access to the role:

   ```sqlexample
   -- Create a stored procedure that reinstates access to the Cortex Agent object
   CREATE OR REPLACE PROCEDURE budgets_db.budgets_schema.sp_reinstate_agent_access(
      agent_name STRING, role_name STRING
   )
   RETURNS STRING
   LANGUAGE SQL
   AS
   BEGIN
      EXECUTE IMMEDIATE 'GRANT ROLE agent_' || agent_name || '_role TO ROLE ' || role_name;
      RETURN 'Access reinstated for ' || agent_name;
   END;
   ```
2. Configure a threshold beyond 100% with a stored procedure that reinstates access. This allows you to raise the effective budget for exception periods. Access is revoked again when spending reaches 200% of the budget:

   ```sqlexample
   -- Add grants for this procedure
   GRANT USAGE ON DATABASE budgets_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE budgets_db.budgets_schema.sp_revoke_agent_access(STRING, STRING)
      TO APPLICATION SNOWFLAKE;

   -- Issue a reinstatement for a subset of users
   CALL budgets_db.budgets_schema.sp_reinstate_agent_access('my_agent', 'power_user_role');

   -- Set another threshold at 200% as a hard stop
   CALL budgets_db.budgets_schema.my_budget!ADD_CUSTOM_ACTION(
      SYSTEM$REFERENCE(
         'PROCEDURE',
         'budgets_db.budgets_schema.sp_revoke_agent_access(string, string)'
      ),
      ARRAY_CONSTRUCT('my_agent', 'power_user_role'),
      'ACTUAL',
      200
   );
   ```

### Reinstate access

To ensure that users can access Cortex Agents again at the start of the next budget period, set the following stored procedure to be called when the budget cycle restarts.

1. Create a stored procedure to reinstate access to the role:

   ```sqlexample
   -- Create a stored procedure that reinstates access to the Cortex Agent object
   CREATE OR REPLACE PROCEDURE budgets_db.budgets_schema.sp_reinstate_agent_access(
      agent_name STRING, role_name STRING
   )
   RETURNS STRING
   LANGUAGE SQL
   AS
   BEGIN
      EXECUTE IMMEDIATE 'GRANT ROLE agent_' || agent_name || '_role TO ROLE ' || role_name;
      RETURN 'Access reinstated for ' || agent_name;
   END;
   ```
2. Set a cycle-start action for the budget:

   ```sqlexample
   GRANT USAGE ON DATABASE budgets_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE budgets_db.budgets_schema.sp_reinstate_agent_access(STRING, STRING)
      TO APPLICATION SNOWFLAKE;

   CALL budgets_db.budgets_schema.my_budget!SET_CYCLE_START_ACTION(
      SYSTEM$REFERENCE('PROCEDURE', 'budgets_db.budgets_schema.sp_reinstate_agent_access(string, string)'),
      ARRAY_CONSTRUCT('my_agent', 'power_user_role')
   );
   ```

### Setting alerts based on projected spend

To receive an alert or perform an action based on forecasted spend rather than actual spend, you can set the trigger type to `PROJECTED`. For example, to call a stored procedure named `alert_team` when projected consumption reaches 75% of the budget limit, run the following command:

```sqlexample
CALL budget_db.sch1.my_budget!ADD_CUSTOM_ACTION(
   SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.alert_team(string, string, string)'),
   ARRAY_CONSTRUCT('admin@example.com', 'Budget Alert', 'Spending at 75% of budget limit'),
   'PROJECTED',
   75);
```

## List custom actions

* To list all custom actions configured on a budget, use the [GET_CUSTOM_ACTIONS](../../sql-reference/classes/budget/methods/get_custom_actions.md) method:

  > ```sqlexample
  > -- View all custom actions on the budget
  > CALL budgets_db.budgets_schema.my_budget!GET_CUSTOM_ACTIONS();
  > ```

For more information, see [Custom actions for budgets](../budgets/custom-actions.md).

## Monitor usage

* To view credit consumption per Cortex Agent object, use the budget’s usage reporting method:

  > ```sqlexample
  > -- View usage for the current month
  > CALL budgets_db.budgets_schema.my_budget!GET_SERVICE_TYPE_USAGE_V2(
  >    '2026-02',
  >    '2026-03'
  > );
  > ```

  The output includes the following columns:

  > | Column | Description |
  > | --- | --- |
  > | Service type | The service category (AI) |
  > | Entity type | The object type (CORTEX_AGENT) |
  > | Entity ID | The unique identifier of the Cortex Agent object |
  > | Name | The display name of the Cortex Agent object |
  > | Credits used | The total credits consumed during the specified period |
  > | Credits Cloud | Number of cloud service credits used |

## Budget enforcement latency

Budget calculations and threshold enforcement are conducted periodically:

1. Snowflake calculates credit consumption for the tagged Cortex Agent object.
2. The system evaluates spending against all configured thresholds.
3. If a threshold is reached, the associated stored procedure is executed.
4. Usage dashboards are updated with the latest figures.

If you have enabled the low latency budget, your budgets are enforced in two hours after the budget is exceeded. Otherwise, it may take up to eight hours after the budget is exceeded for enforcement. You can trigger budget execution more frequently, such as every sixty minutes, to reduce the [refresh interval](../budgets.md).

> **Warning:**
>
> There is an inherent delay between when credits are consumed and when the budget system detects the
> threshold breach. During the enforcement interval, spending can exceed the configured threshold before
> the action is executed. Plan your thresholds accordingly. For example, set an alert at 80% to give you time
> to respond before the 100% action is triggered.

## Limitations

The following limitations apply to resource budgets for Cortex Agents:

* **Single-team resources only:** Resource budgets apply to the entire Cortex Agent object.
* **Enforcement latency:** Budget enforcement runs on a periodic cycle and may take up to eight hours after the budget is exceeded to enforce the budget. Spending can exceed a threshold during the interval before the action triggers.
* **Role-based access revocation:** To revoke access at a threshold, you must create a dedicated role for
  the Cortex Agent object.
* **Monthly period:** Budgets operate on a monthly cycle. You can’t configure resource budget periods.
* **Tag latency:** When you change a tag on an object, it can take up to eight hours after the change to be
  reflected in budgets that use tags. For more information, see [Custom budgets](../budgets/custom-budget.md).
* **Entry point determines attribution:** If a request starts in Snowflake Intelligence and invokes a Cortex Agent,
  that usage is attributed to Snowflake Intelligence. As a result, a budget whose scope includes only
  Cortex Agent-tagged resources (for example, tags applied only to agents) does not capture credits from
  requests initiated in Snowflake Intelligence, even when those requests invoke a Cortex Agent.
  To cover this usage, include Snowflake Intelligence resources in budget scope, or configure a separate
  Snowflake Intelligence resource budget (see [Resource budgets for Snowflake Intelligence](snowflake-intelligence/si-resource-budgets.md)).
  Note, however, that Snowflake Intelligence-scoped budgets apply to all usage attributed to the scoped
  Snowflake Intelligence objects, not only to requests that invoke a specific Cortex Agent.

---
title: Resource budgets for Snowflake Intelligence
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/si-resource-budgets.md
section: Snowflake Cortex (AI & ML)
---

# Resource budgets for Snowflake Intelligence

A resource budget lets you monitor Snowflake Intelligence spend for your account and take actions when it exceeds spending thresholds. This allows you
to control costs for Snowflake Intelligence and take automated actions such as revoking access when spending
exceeds your configured limits. Resource budgets give you control over the credits consumed at an aggregated level
by the entire Snowflake Intelligence service.

## How resource budgets work

Resource budgets use Snowflake’s tag-based cost attribution model. You create a tag, apply it to a
Snowflake Intelligence object, and then associate that tag with a budget. Snowflake tracks credit consumption for
the tagged object and evaluates spending against the budget limit periodically. The resource budget is
useful for limiting the spend for Snowflake Intelligence aggregated across the entire account.

Snowflake enforces resource budgets with the following flow:

1. You create a tag
2. You apply the tag to the Snowflake Intelligence object.
3. You create a budget and specify the tag to track spending for. As part of creating the budget, you also set a monthly spending limit in credits.
4. You add a stored procedure to be executed when spending reaches a configured threshold of the budget. For example, you can invoke a stored procedure for alerting at 80% and another stored procedure for revoking access at 100%.
5. Snowflake tracks credit consumption for the tagged object.
6. When spending reaches a configured threshold of the budget, such as 80% or 100%, Snowflake executes the stored procedure defined for that threshold.

Snowflake calculates usage, evaluates thresholds, and triggers any configured actions periodically.
After the budget is exceeded, it might take up to eight hours for the budget to be enforced.

## Create a tag

1. Create a tag to identify the cost center associated with the Snowflake Intelligence object:

   ```sqlexample
   -- Create a tag with allowed cost center values
   CREATE TAG cost_mgmt_db.tags.cost_center
      ALLOWED_VALUES 'org-level'
      COMMENT = 'cost_center tag';
   ```
2. Apply the tag to the Snowflake Intelligence object to associate it with a cost center:

   ```sqlexample
   -- Apply the cost center tag to the Snowflake Intelligence object
   ALTER SNOWFLAKE INTELLIGENCE IF EXISTS si_instance_1
      SET TAG cost_mgmt_db.tags.cost_center = 'org-level';
   ```

## Set up a resource budget

You can use either Snowsight or SQL to create a budget and associate it with a Snowflake Intelligence object.

Snowsight UISQL

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.
4. Select + Budget.
5. For Location to store, select the name of the database and schema where you want to create the budget.
6. For Name, use `my_budget`.
7. For Budget (credits per month), enter 10000 for the spending limit of the budget.
8. To decrease the [budget refresh interval](../../budgets.md) so you can watch spending more closely, select Enable low latency budget.
9. For Threshold, enter 80 for the notification threshold.
10. For Notify, enter email addresses to receive notification emails.
11. Select Next.
12. For Budget scope, add the tag on the Snowflake Intelligence object to the resource budget.
13. Select Create.

1. Create a budget instance in the schema where you manage budgets:

   ```sqlexample
   -- Create a budget instance
   USE SCHEMA budgets_db.budgets_schema;

   CREATE SNOWFLAKE.CORE.BUDGET my_budget();
   ```
2. Set the monthly credit spending limit, such as **10000**, for the budget:

   ```sqlexample
   -- Set a 10000-credit monthly spending limit
   CALL my_budget!SET_SPENDING_LIMIT(10000);
   ```
3. Add the tag to the budget so that Snowflake tracks spending for the tagged object against this budget:

   ```sqlexample
   -- Associate the cost center tag with the budget
   CALL budgets_db.budgets_schema.my_budget!SET_RESOURCE_TAGS(
      [
         [(SELECT SYSTEM$REFERENCE('TAG',
            'cost_mgmt_db.tags.cost_center',
            'SESSION',
            'applybudget')),
            'org-level']
      ],
      'UNION');
   ```

Now, Snowflake tracks credit consumption for `si_instance_1` against the `my_budget` budget
with a 10,000-credit monthly limit.

## Configure threshold actions

You can attach stored procedures that are executed when spending reaches specific thresholds, which are
expressed as a percentage of the spending limit and apply to the monthly budget period. For more information, see [Custom actions for budgets](../../budgets/custom-actions.md).

### Send notifications

You can send notifications when spending reaches a threshold. For more information, see [Notifications for budgets](../../budgets/notifications.md).

1. Set the email to send notifications to:

   ```sqlexample
   CALL my_budget!SET_EMAIL_NOTIFICATIONS(
     'budgets_notification_integration',
      'costadmin@example.com, budgetadmin@example.com'
   );
   ```
2. Set the notification threshold:

   ```sqlexample
   CALL my_budget!SET_NOTIFICATION_THRESHOLD(80);
   ```

### Revoke access

1. Create a stored procedure that revokes access to Snowflake Intelligence. In the stored procedure, you can limit access to a specific role to revoke USAGE for that role.

   ```sqlexample
   -- Create a stored procedure that revokes access to the SI object
   CREATE OR REPLACE PROCEDURE budgets_db.budgets_schema.sp_revoke_si_access(
      si_name STRING, role_name STRING
   )
   RETURNS STRING
   LANGUAGE SQL
   AS
   BEGIN
      EXECUTE IMMEDIATE 'REVOKE ROLE si_' || si_name || '_role FROM ROLE ' || role_name;
      RETURN 'Access revoked for ' || si_name;
   END;
   ```

   > **Important:**
   >
   > Ensure the `role_name` and the user do not have access to Snowflake Intelligence through other roles. For guidance about configuring roles and privileges correctly, see [User privileges and access control](deploy-agents.md).
2. Set a custom action that blocks access when 100% of the budget has been spent:

   ```sqlexample
   -- Provide access to the stored procedures
   GRANT USAGE ON DATABASE budgets_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE budgets_db.budgets_schema.sp_revoke_si_access(STRING, STRING)
      TO APPLICATION SNOWFLAKE;

   -- Block access at 100% of the budget
   CALL budgets_db.budgets_schema.my_budget!ADD_CUSTOM_ACTION(
      SYSTEM$REFERENCE('PROCEDURE',
         'budgets_db.budgets_schema.sp_revoke_si_access(string, string)'),
      ARRAY_CONSTRUCT('SI_NAME', 'ROLE_NAME'),
      'ACTUAL',
      100);
   ```

> > **Note:**
> >
> > You can also use custom actions for notifications or to take action when spending is forecasted to exceed the budget limit. For more information, see [Custom actions for budgets](../../budgets/custom-actions.md).

### Handling exceptions to spending limits

In some cases, you need to reinstate access after the budget limit is reached, like during earnings season or
other peak periods. You can configure thresholds beyond 100%, up to 500%, to handle these exception
scenarios.

The workflow assumes that access is revoked using the configured stored procedure when spending reaches a budget threshold. In the
following example, access has been revoked after spending reaches the 100% threshold. The admin reinstates
a subset of the users and grants access back. When spending reaches 200%, the revocation procedure runs
again as a hard stop.

1. Create a stored procedure to reinstate access to the role:

   ```sqlexample
   -- Create a stored procedure that reinstates access to the SI object
   CREATE OR REPLACE PROCEDURE budgets_db.budgets_schema.sp_reinstate_si_access(
      si_name STRING, role_name STRING
   )
   RETURNS STRING
   LANGUAGE SQL
   AS
   BEGIN
      EXECUTE IMMEDIATE 'GRANT ROLE si_' || si_name || '_role TO ROLE ' || role_name;
      RETURN 'Access reinstated for ' || si_name;
   END;
   ```
2. Configure a threshold beyond 100% with a stored procedure that reinstates access. This allows you to raise the effective budget for exception periods. Access is revoked again when spending reaches 200% of the budget:

   ```sqlexample
   -- Add grants for this procedure
   GRANT USAGE ON DATABASE budgets_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE budgets_db.budgets_schema.sp_revoke_si_access(STRING, STRING)
      TO APPLICATION SNOWFLAKE;

   -- Issue a reinstatement for a subset of users
   CALL budgets_db.budgets_schema.sp_reinstate_si_access('si_instance_1', 'power_user_role');

   -- Set another threshold at 200% as a hard stop
   CALL budgets_db.budgets_schema.my_budget!ADD_CUSTOM_ACTION(
      SYSTEM$REFERENCE(
         'PROCEDURE',
         'budgets_db.budgets_schema.sp_revoke_si_access(string, string)'
      ),
      ARRAY_CONSTRUCT('si_instance_1', 'power_user_role'),
      'ACTUAL',
      200
   );
   ```

### Reinstate access

To ensure that users can access Snowflake Intelligence again at the start of the next budget period, set the following stored procedure to be called when the budget cycle restarts.

1. Create a stored procedure to reinstate access to the role:

   ```sqlexample
   -- Create a stored procedure that reinstates access to the SI object
   CREATE OR REPLACE PROCEDURE budgets_db.budgets_schema.sp_reinstate_si_access(
      si_name STRING, role_name STRING
   )
   RETURNS STRING
   LANGUAGE SQL
   AS
   BEGIN
      EXECUTE IMMEDIATE 'GRANT ROLE si_' || si_name || '_role TO ROLE ' || role_name;
      RETURN 'Access reinstated for ' || si_name;
   END;
   ```
2. Set a cycle-start action for the budget:

   ```sqlexample
   GRANT USAGE ON DATABASE budgets_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE budgets_db.budgets_schema.sp_reinstate_si_access(STRING, STRING)
      TO APPLICATION SNOWFLAKE;

   CALL budgets_db.budgets_schema.my_budget!SET_CYCLE_START_ACTION(
      SYSTEM$REFERENCE('PROCEDURE', 'budgets_db.budgets_schema.sp_reinstate_si_access(string, string)'),
      ARRAY_CONSTRUCT('si_instance_1', 'power_user_role')
   );
   ```

### Setting alerts based on projected spend

To receive an alert or perform an action based on forecasted spend rather than actual spend, you can set the trigger type to `PROJECTED`. For example, to call a stored procedure named `alert_team` when projected consumption reaches 75% of the budget limit, run the following command:

```sqlexample
CALL budget_db.sch1.my_budget!ADD_CUSTOM_ACTION(
   SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.alert_team(string, string, string)'),
   ARRAY_CONSTRUCT('admin@example.com', 'Budget Alert', 'Spending at 75% of budget limit'),
   'PROJECTED',
   75);
```

## List custom actions

* To list all custom actions configured on a budget, use the [GET_CUSTOM_ACTIONS](../../../sql-reference/classes/budget/methods/get_custom_actions.md) method:

  > ```sqlexample
  > -- View all custom actions on the budget
  > CALL budgets_db.budgets_schema.my_budget!GET_CUSTOM_ACTIONS();
  > ```

> For more information, see [Custom actions for budgets](../../budgets/custom-actions.md).

## Monitor usage

* To view credit consumption per Snowflake Intelligence object, use the budget’s usage reporting method:

  > ```sqlexample
  > -- View usage for the current month
  > CALL budgets_db.budgets_schema.my_budget!GET_SERVICE_TYPE_USAGE_V2(
  >    '2026-02',
  >    '2026-03'
  > );
  > ```

  The output includes the following columns:

  > | Column | Description |
  > | --- | --- |
  > | Service type | The service category (AI) |
  > | Entity type | The object type (SI) |
  > | Entity ID | The unique identifier of the Snowflake Intelligence object |
  > | Name | The display name of the Snowflake Intelligence object |
  > | Credits used | The total credits consumed during the specified period |
  > | Credits Cloud | Number of cloud service credits used |

## Budget enforcement latency

Budget calculations and threshold enforcement are conducted periodically:

1. Snowflake calculates credit consumption for the tagged Snowflake Intelligence object.
2. The system evaluates spending against all configured thresholds.
3. If a threshold is reached, the associated stored procedure is executed.
4. Usage dashboards are updated with the latest figures.

If the low latency budget is enabled, the budgets are enforced in two hours after the budget is exceeded. Otherwise, it may take up to eight hours after the budget is exceeded for enforcement. To reduce the [refresh interval](../../budgets.md), you can trigger budget execution more frequently, such as every 60 minutes.

> **Warning:**
>
> There is an inherent delay between when credits are consumed and when the budget system detects the
> threshold breach. During the enforcement interval, spending can exceed the configured threshold before
> the action is executed. Plan your thresholds accordingly. For example, set an alert at 80% to give you time
> to respond before the 100% action is triggered.

## Limitations

The following limitations apply to resource budgets for Snowflake Intelligence:

* **Single-team resources only:** Resource budgets apply to the entire Snowflake Intelligence object.
* **Enforcement latency:** Budget enforcement runs on a periodic cycle and may take up to eight hours to
  enforce the budget after the budget is exceeded. Spending can exceed a threshold during the interval before the action triggers.
* **Role-based access revocation:** To revoke access at a threshold, you must create a dedicated role for
  the Snowflake Intelligence object. Direct block actions on the object aren’t yet supported.
* **Monthly period:** Budgets operate on a monthly cycle. You can’t configure resource budget periods.
* **Tag latency:** When you change a tag on an object, it can take up to eight hours after the change to be
  reflected in budgets that use tags. For more information, see [Custom budgets](../../budgets/custom-budget.md).

---
title: Routing Mode for Cortex Analyst
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/cortex-analyst-routing-mode.md
section: Snowflake Cortex (AI & ML)
---

# Routing Mode for Cortex Analyst

Routing Mode is a query-generation strategy that prioritizes semantic SQL and falls back to standard SQL only when needed. It acts as a simpler version of SQL, with guardrails coming from your semantic views. Routing mode uses your semantic views to drive higher accuracy and consistency. As a result, metrics, joins, and filters follow governed definitions from the semantic view.

Cortex Analyst automatically uses Routing Mode when generating based on a semantic view. There is no change to your workflow, except higher text-to-sql quality.

> **Note:**
>
> Routing Mode does not change permissions. Semantic views are Snowflake objects with standard privileges; access is enforced the same way as tables or views.

## Benefits of Routing Mode

Routing Mode offers the following benefits:

* **Consistent metrics:** Queries use definitions from semantic views, not SQL.
* **Safer defaults:** Dimensions, metrics, and joins come from governed metadata.
* **LLM-friendly:** Shorter SQL is easier for an LLM to produce correctly.

Routing Mode could be beneficial in the following situations:

* You have one or more semantic views that define core business entities and metrics.
* You want consistent answers for common questions, with flexibility for edge cases.

For example, consider the following scenarios and how Routing Mode handles them:

* **Ask for a governed metric by a business dimension**

  + **User intent:** “Average order value by customer segment.”
  + **Routing behavior:** Tries semantic SQL first, so joins and metric calculations come from the view.

    > ```sqlexample
    > SELECT *
    > FROM SEMANTIC_VIEW(
    >   tpch_analysis
    >   DIMENSIONS customer.customer_market_segment
    >   METRICS orders.order_average_value
    > )
    > ORDER BY customer_market_segment;
    > ```
  + **Benefit:** No manual joins or metric formulas. Results align with your BI definitions.
* **Multiple governed metrics with one dimension**

  + **User intent:** “Show total revenue and order count by year.”

    > ```sqlexample
    > SELECT *
    > FROM SEMANTIC_VIEW(
    >   tpch_analysis
    >   DIMENSIONS orders.order_year
    >   METRICS orders.total_revenue, orders.order_count
    > )
    > ORDER BY order_year;
    > ```
  + **Benefit:** Both metrics use the same definitions and filters as in the semantic view.
* **Fallback for uncovered asks**

  + **User intent:** “Show a raw column or transformation not modeled in the view.”
  + **Routing behavior:** If the semantic view cannot satisfy the request, Cortex Analyst automatically routes to standard SQL on base tables.
  + **Benefit:** Flexibility without blocking the user.

## How it works

The following procedure outlines the steps that Cortex Analyst takes when using Routing Mode.

1. Cortex Analyst uses Routing mode in the playground, API, and all product surfaces.
2. Cortex Analyst tries to produce semantic SQL.

   > ```sqlexample
   > SELECT … FROM SEMANTIC_VIEW(...).
   > ```
3. If Cortex Analyst is unable to produce a valid semantic SQL query that answers the question within the timeout, Cortex Analyst routes to standard SQL on physical tables.

> **Note:**
>
> Routing Mode only results in semantic SQL for about 10% of queries, in aggregate. This number varies depending on the level of coverage the metrics defined in the semantic view have.

## Considerations

* If the semantic view cannot satisfy a question, Cortex Analyst falls back to Standard SQL. You should expand the semantic view to reduce fallbacks over time.

---
title: Sentiment extraction
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-sentiment.md
section: Snowflake Cortex (AI & ML)
---

# Sentiment extraction

> **Note:**
>
> AI_SENTIMENT is the updated version of [ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)](../../sql-reference/functions/entity_sentiment-snowflake-cortex.md).
> For the latest functionality, use AI_SENTIMENT.

The AI_SENTIMENT function provides state-of-the-art quality sentiment classification across diverse markets and
languages. With AI_SENTIMENT, you can get both overall and granular, aspect based sentiment analysis for use cases like
the following:

* Social media monitoring
* Detailed product analysis
* Comprehensive brand perception studies
* Advanced market intelligence
* Employee engagement analysis
* Customer experience journey mapping
* Content performance analysis
* Customer support optimization

## Sentiment extraction quality

AI_SENTIMENT uses a custom Snowflake large language model that delivers industry-leading overall sentiment and
aspect-based sentiment accuracy. The following table provides information on how AI_SENTIMENT performs on Overall
Sentiment and Aspect Based Sentiment (ABSA-mix) benchmarks compared to popular models. The languages evaluated in the
multilingual benchmark are English, Spanish, French, German, Hindi, Italian, and Portuguese.

> **Note:**
>
> Some of the models benchmarked are not available in Snowflake Cortex.

| Model | Aspect based sentiment  accuracy (`ABSA-mix`) | Aspect based sentiment  accuracy (`ABSA-multilingual`) | Overall sentiment  accuracy | Overall sentiment  accuracy (multilingual) |
| --- | --- | --- | --- | --- |
| Cortex AI `AI_SENTIMENT` | 0.92 | 0.81 | 0.83 | 0.83 |
| `claude-4-sonnet` | 0.84 | 0.79 | 0.75 | 0.82 |
| `mistral-large2` | 0.83 | 0.80 | 0.77 | 0.78 |
| `openai-gpt-4.1` | 0.83 | 0.73 | 0.80 | 0.78 |
| `llama4-scout` | 0.82 | 0.79 | 0.71 | 0.76 |
| `llama3.3-70b` | 0.82 | 0.79 | 0.71 | 0.76 |
| AWS `DetectSentiment` |  |  | 0.62 | 0.64 |

## Calling the AI_SENTIMENT function

By default, Cortex AI_SENTIMENT returns overall sentiment scores for the overall content. However, AI_SENTIMENT can also
capture a spectrum of customer opinions beyond overall positive, negative, and neutral buckets. For this optional
aspect-based sentiment analysis, specify the content (such as a customer comment or a review) and the aspects (also
called entities or categories) for which you want to analyze sentiment. AI_SENTIMENT returns sentiment for each entity
as well as an overall sentiment. To obtain only the overall sentiment, specify the content without aspects.

### English examples

The following example uses AI_SENTIMENT to get the sentiment classification of a product review.

```sqlexample
SELECT AI_SENTIMENT('I went to the store, bought the leggings and exact same as shorts...
  they are expensive but i heard such great things. After wearing them twice i noticed a string popping out already.
  And aince i believed that they were this amazing luxury brand i didnt keep the receipt 😭 ');
```

Return value:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "mixed"
    }
  ]
}
```

The following example uses AI_SENTIMENT to get the sentiment classification for specific aspects of a restaurant review.

```sqlexample
SELECT AI_SENTIMENT('A tourist\'s delight, in low urban light,
  Recommended gem, a pizza night sight. Swift arrival, a pleasure so right,
  Yet, pockets felt lighter, a slight pricey bite. 💰🍕🚀',
  ['Cost', 'Quality' ,'Wait Time']);
```

Return value:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "mixed"
    },
    {
      "name": "Cost",
      "sentiment": "negative"
    },
    {
      "name": "Quality",
      "sentiment": "positive"
    },
    {
      "name": "Wait Time",
      "sentiment": "positive"
    }
  ]
}
```

If some aspects that you specify do not apply to the text you provide, AI_SENTIMENT returns “unknown” for those aspects,
as shown for Professionalism and Brand in the following example.

```sqlexample
SELECT AI_SENTIMENT('A tourist\'s delight, in low urban light,
  Recommended gem, a pizza night sight. Swift arrival, a pleasure so right,
  Yet, pockets felt lighter, a slight pricey bite. 💰🍕🚀',
  ['Cost', 'Professionalism' ,'Brand']);
```

Return value:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "mixed"
    },
    {
      "name": "Brand",
      "sentiment": "unknown"
    },
    {
      "name": "Cost",
      "sentiment": "negative"
    },
    {
      "name": "Professionalism",
      "sentiment": "unknown"
    }
  ]
}
```

### Multilingual examples

As shown in the following two similar examples, AI_SENTIMENT can analyze sentiment in multiple languages, so you don’t
need to translate the text and risk losing an essential part of its meaning. You do not need to specify the language of
the text. Aspects can be specified in the language of the text, as shown in the following example, or in English, as
shown in the second example.

> **Note:**
>
> AI_SENTIMENT supports English, French, German, Hindi, Italian, Spanish, and Portuguese.

Example with both text and labels in Spanish:

```sqlexample
SELECT AI_SENTIMENT ('Pedí dos pares del mismo modelo en diferentes colores.
    Uno tenía defectos en la costura y el cuero se veía de menor calidad.
    Por 350€ el par, esto es inaceptable. El servicio al cliente tardó una
    semana en responder y la solución no fue satisfactoria. Es una pena porque
    cuando están bien hechos, son zapatos hermosos. Pero la inconsistencia en la
    calidad es preocupante.', ['Calidad', 'Calidad de Servicio,' 'Precio', 'Tiempo de Espera']);
```

Return value:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "negative"
    },
    {
      "name": "Calidad",
      "sentiment": "negative"
    },
    {
      "name": "Calidad de Servicio",
      "sentiment": "negative"
    },
    {
      "name": "Precio",
      "sentiment": "negative"
    },
    {
      "name": "Tiempo de Espera",
      "sentiment": "negative"
    }
  ]
}
```

Example with text in German and labels in English:

```sqlexample
SELECT AI_SENTIMENT ('Die Schuhe selbst sind wirklich schön und gut verarbeitet.
    Das Leder ist weich und die Passform stimmt. Allerdings gab es erhebliche
    Verzögerungen bei der Lieferung - statt der versprochenen 5 Tage hat es 3
    Wochen gedauert. Der Kundenservice war freundlich, aber nicht sehr hilfreich.
    Für 320€ erwarte ich besseren Service. Die Schuhe sind in Ordnung, aber das
    Gesamterlebnis war mittelmäßig', ['Quality', 'Price', 'Service', 'WaitTime']);
```

Return value:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "mixed"
    },
    {
      "name": "Price",
      "sentiment": "neutral"
    },
    {
      "name": "Quality",
      "sentiment": "positive"
    },
    {
      "name": "Service",
      "sentiment": "neutral"
    },
    {
      "name": "WaitTime",
      "sentiment": "negative"
    }
  ]
}
```

## Model restrictions

All large language models (LLMs) available in Snowflake Cortex AI have limitations on the total number of input and
output tokens, which is referred to as the model’s *context window*. Inputs exceeding the context window limit
result in an error. Output which would exceed the context window limit is truncated.

The context window for AI_SENTIMENT is set such that the model can sustain a high level of accuracy. AI_SENTIMENT was
trained and optimized for text inputs of 2,048 tokens (roughly 1,600 words). You can specify a maximum of ten aspects,
each no longer than thirty characters.

| Function | Context window (tokens) | Maximum number of entity labels |
| --- | --- | --- |
| AI_SENTIMENT | 2,048 | 10 |

---
title: Share Cortex Agents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-sharing.md
section: Snowflake Cortex (AI & ML)
---

# Share Cortex Agents

As a provider, you can share an existing Cortex Agent with other organizations on Snowflake, allowing you to expand the user
base of your agent and bring its value to other Snowflake customers. You can share your Cortex Agents either on Snowflake Marketplace or with
designated accounts. For more information on being a provider for Snowflake Marketplace, see [Use listings as a provider](../../collaboration/provider-becoming.md).

As a consumer of a shared Cortex Agent, you gain access to an easy-to-use interface to get insights from shared structured or unstructured data. For more information on consuming Cortex Agents, see [Use listings as a consumer](../../collaboration/consumer-becoming.md) and [Use and manage Snowflake Native Apps as a consumer](../../developer-guide/native-apps/ui-consumer-about.md).

## Requirements

Sharing a Cortex Agent requires the following:

* Sharing all linked objects such as semantic views or Cortex Search Services. For more information, see [Create and configure shares](../data-sharing-provider.md) and [Sharing semantic views](../views-semantic/sharing-semantic-views.md).
* Shared linked objects must be in the same database as your shared Cortex Agent.
* Only agents that use the following tool types can be shared: semantic views, Cortex Search Services, and functions. Agents that use other tool types, such as procedures, skills, or MCP connectors, can’t be shared.

## Set a Cortex Agent as shared

You can share your Cortex Agents as a provider in Snowflake Marketplace through [Provider Studio](../../collaboration/provider-studio-accessing.md).

You can also set an agent as shared with a SQL statement. The following example adds the agent `my_agent` to the share `my_share`:

```sqlexample
GRANT USAGE ON AGENT my_agent TO SHARE my_share;
```

If your agent uses linked objects such as semantic views, Cortex Search Services, or functions, you must also grant those objects to the share:

```sqlexample
GRANT USAGE ON AGENT my_agent TO SHARE my_share;
GRANT SELECT, REFERENCES ON SEMANTIC VIEW my_sv TO SHARE my_share;
GRANT USAGE ON CORTEX SEARCH SERVICE my_css TO SHARE my_share;
GRANT USAGE ON FUNCTION my_function TO SHARE my_share;
```

When you add an agent to an existing share, the consumer user who has installed the share receives an email notification to try out the agent.

## Identify shared agents in Snowsight

In the navigation menu, select AI & ML » Agents. The **Source** column indicates whether each agent is **Local** or **Shared**. Use this
column to quickly distinguish between agents created in your account and agents shared with you from another account.

## Consume a shared Cortex Agent

When you get a listing that contains a shared Cortex Agent, you can add the agent to Snowflake Intelligence. To do so, keep the
**Add to Snowflake Intelligence** toggle enabled when you get the listing. This makes the shared agent available as a data source
within Snowflake Intelligence.

### Warehouse selection

By default, a shared agent runs using your default warehouse. You can specify a custom warehouse for query and tool execution
to control compute resources and costs.

To configure a custom warehouse for a shared agent:

1. Sign in to Snowsight.
2. In the navigation menu, select AI & ML » Agents.
3. Select a shared agent. You can identify shared agents by the **Source** column.
4. Select More options menu (…) ‣ Configure warehouses for tools.
5. Select **Custom**, choose a warehouse, and then select **Save**.

After you configure a custom warehouse, the shared agent uses the specified warehouse to run queries and execute tools.

## Replication

Shared Cortex Agents support replication. Listing auto-fulfillment replicates agents to other regions, allowing consumers
in different regions to access the shared agent.

## Limitations

The following limitations apply to shared Cortex Agents:

* A SQL table function can be shared, but a Python user-defined table function can’t.
* If you update a shared agent to use new tools (such as semantic views, Cortex Search Services, or functions), you must also grant those new tools to the share. New tools aren’t automatically added.

## Cost considerations

In addition to any costs paid to the provider of the shared Cortex Agent, consumers are billed for the following:

* Input and output tokens used by the consumer’s invocation of the shared agent.
* Consumer’s warehouse usage for SQL query and tools execution.

For more information on costs paid to providers, see [Pay for listings](../../collaboration/consumer-listings-paying.md). For more information on Snowflake costs, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: Snowflake AI Observability Reference
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/ai-observability/reference.md
section: Snowflake Cortex (AI & ML)
---

# Snowflake AI Observability Reference

This document provides a comprehensive reference for using Snowflake Cortex AI Observability to evaluate and monitor the performance of your generative AI applications.

It covers the following concepts:

* Datasets and attributes
* Evaluation metrics
* Runs
* Access control and storage

## Dataset and attributes

A dataset is a set of inputs that you use to test the application.
It can also contain a set of expected outputs (the ground truth).

You can use the TruLens Python SDK to specify the dataset as either a [Snowflake table](../../../sql-reference/sql/create-table.md) or a [pandas dataframe](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html).
Each column in the dataset must be mapped to one of the following reserved attributes:

Reserved attributes

| Input attribute | Description |
| --- | --- |
| RECORD_ROOT.INPUT | Input prompt to the LLM.  Type: string |
| RECORD_ROOT.INPUT_ID | Unique identifier for the input prompt.  If you don’t provide an input ID, an ID is automatically generated and assigned to each input.  Type: string |
| RETRIEVAL.QUERY_TEXT | User query for a RAG application  Type: string |
| RECORD_ROOT.GROUND_TRUTH_OUTPUT | Expected response for the input prompt.  Type: string |

For instrumenting the application, you must map the input and output parameters for the instrumented function (or method) to the relevant input and output attributes. Use the `@instrument` decorator to map the parameters and compute the metrics. In addition to the input attributes specified as part of the dataset, you can also use the following output attributes to instrument the relevant functions:

Output attributes

| Output attribute | Description |
| --- | --- |
| RETRIEVAL.RETRIEVED_CONTEXTS | Output generated by the LLM.  Type: List [string] |
| RECORD_ROOT.OUTPUT | Generated response from the LLM.  Type: string |

## Evaluation metrics

Evaluation metrics provide a quantifiable way to measure the accuracy and performance of your application. These metrics are computed using specific inputs to the application, LLM-generated outputs, and any intermediate information (such as retrieved results from a RAG application). You can also compute metrics using a ground truth dataset.

You can compute metrics with the “LLM-as-a-judge” approach. With this approach, an LLM is used to generate a score (between 0 - 1) with an explanation for the application’s output. based on the provided information. You can select any LLM available in Cortex AI as judges. If no LLM judge is specified, llama3.1-70b is used as the default judge. AI Observability supports a variety of evaluation metrics.

### Context Relevance

Context Relevance determines if the retrieved context from the retriever or the search service is relevant to the user query. Given the user query and retrieved context, an LLM judge is used to determine relevance of the retrieved context based on the query.

Required Attributes:

* `RETRIEVAL.QUERY_TEXT`: User query in a RAG or search application
* `RETRIEVAL.RETRIEVED_CONTEXTS`: Context retrieved from the search service or retriever

### Groundedness

Groundedness determines if the generated response is supported by and grounded in the retrieved context from the retriever or the search service. Given the generated response and retrieved context, an LLM judge is used to determine groundedness. The underlying implementation uses Chain-of-thought reasoning when generating the groundedness scores.

Required Attributes:

* `RETRIEVAL.RETRIEVED_CONTEXTS`: User query in a RAG or search application
* `RECORD_ROOT.OUTPUT`: Final response generated by the LLM

### Answer Relevance

Answer relevance determines if the generated response is relevant to the user query. Given the user query and generated response, an LLM judge is used to determine how relevant the response is when answering the user’s query. Note that this doesn’t rely on ground truth answer reference, and therefore this is not equivalent to assessing answer correctness.

Required Attributes:

* `RECORD_ROOT.INPUT`: User query in a RAG or search application
* `RECORD_ROOT.OUTPUT`: Final response generated by the LLM

### Correctness

Correctness determines how aligned the generated response is with the ground truth. A higher correctness score indicates a more accurate response with larger alignment with the ground truth.

Required Attributes:

* `RECORD_ROOT.INPUT`: User query or prompt to the LLM
* `RECORD_ROOT.GROUND_TRUTH_OUTPUT`: Expected response based on the user query
* `RECORD_ROOT.OUTPUT`: Response generated by the LLM

### Coherence

Coherence measures if the generated response of the model is coherent and doesn’t introduce logical gaps, inconsistencies or contradictions. A higher coherence score indicates a highly coherent response.

Required Attributes:

* `RECORD_ROOT.OUTPUT`: Response generated by the LLM

### Cost and Latency

#### Usage cost

Cost is calculated for each LLM invocation call that relies on Cortex LLMs based on the token usage information (prompt_tokens for input and completion_tokens for output) returned by the [COMPLETE (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/complete-snowflake-cortex.md) function. As part of the trace information, you can view the token usage and the corresponding costs associated with each LLM call.

#### Latency

Latency is determined by measuring the time taken to complete each function call in the application. Application traces provide granular visibility into the latency of each function instrumented using the TruLens SDK. Individual function latencies are aggregated to compute the overall latency of the entire application corresponding to each input. Each run also provides an average latency across all inputs for easy comparison across multiple application configurations.

## Runs

A run is an evaluation task used to measure the accuracy and performance of an application.
It helps you select the best application configuration. Building a generative AI application involves experimenting with various LLMs, prompts, and inference parameters.
You measure their accuracy, latency, and usage to find the optimal combination for production. Each combination corresponds to an application version.

A run uses the dataset that you specify to execute a batch evaluation for an application version.
You can trigger multiple runs with the same dataset for different versions. You can compare the aggregated and record-level differences between the versions to identify improvements that you need to make and select the best version to deploy.

Creating and executing a run involves four main steps:

1. **Creation**: After creating an application and a version, add a new run for the version by specifying a dataset.
2. **Invocation**: Start the run, which reads inputs from the dataset, invokes the application for each input, generates traces, and stores the information in your Snowflake account.
3. **Computation**: After invocation, trigger computation by specifying metrics to be computed. You can trigger multiple computations and add new metrics later for an existing run.
4. **Visualization**: Visualize the run results in Snowsight by logging into your Snowflake account. Runs are listed within their relevant applications in AI & ML under Evaluations.

You can label each run to categorize comparable runs between different application versions with the same dataset. Use the labels to manager and filter the runs.

A run can have one of the following statuses:

Run status

| Status | Description |
| --- | --- |
| CREATED | The run has been created but not started. |
| INVOCATION_IN_PROGRESS | The run invocation is in the process of generating the output and the traces. |
| INVOCATION_COMPLETED | The run invocation completed with all outputs and traces created. |
| INVOCATION_PARTIALLY_COMPLETED | The run invocation is partially completed due to failures in application invocation and trace generation. |
| COMPUTATION_IN_PROGRESS | The metric computation is in progress. |
| COMPLETED | The metric computation is completed with detailed outputs and traces. |
| PARTIALLY_COMPLETED | The run is partially completed due to failures during the metric computation. |
| CANCELLED | The run has been cancelled. |

## Access control and storage

### Required privileges

You need the following privileges to use AI Observability.

* To use AI Observability, your role must have the CORTEX_USER database role. The CORTEX_USER role is required for database functions. For information on granting and revoking this role, see [Cortex LLM privileges](../aisql.md).
* To register an application, your role must have CREATE EXTERNAL AGENT privileges on the schema. For more information, see Applications.
* To create and execute runs, your role must:

  + USAGE privileges on the EXTERNAL AGENT object created for the application
  + The CREATE TASK privilege on the schema where the application is registered.
  + The EXECUTE TASK global privilege to execute the task that runs the application.

    For more information, see Runs and Observability data.

The following example uses the ACCOUNTADMIN role to grant the user, `some_user`, the following:

* The CORTEX_USER database role
* The CREATE EXTERNAL AGENT privilege on the `app_schema` schema
* The CREATE TASK privilege on the `app_schema` schema
* The EXECUTE TASK global privilege

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE observability_user_role;

GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE observability_user_role;

GRANT CREATE EXTERNAL AGENT ON SCHEMA app_schema TO ROLE observability_user_role;

GRANT CREATE TASK ON SCHEMA app_schema TO ROLE observability_user_role;

GRANT EXECUTE TASK ON ACCOUNT TO ROLE observability_user_role;

GRANT ROLE observability_user_role TO USER some_user;
```

### Applications

Creating an application for evaluation creates an EXTERNAL AGENT object to represent the application in Snowflake. The role required to create and modify an application must have the following access control requirements.

A role used to create an application must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | External Agent | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the GRANT OWNERSHIP command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| CREATE EXTERNAL AGENT | Schema |  |

The USAGE privilege on the parent database and schema are required to perform operations on any object in a schema.

Modifying and deleting the application require OWNERSHIP privileges on the EXTERNAL AGENT object.

If a user’s role has USAGE or OWNERSHIP privileges on an application (EXTERNAL AGENT), the application appears in Evaluations under AI & ML within Snowsight.

### Runs

A role used to add, modify or delete a run to an application must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | External Agent | USAGE or OWNERSHIP privilege on the EXTERNAL AGENT object to the role that created the object representing the application in Snowflake. |
| CREATE TASK | Schema | For information about the privileges required to create a task, see [Access control requirements](../../../sql-reference/sql/create-task.md). |
| EXECUTE TASK | Account | For information about the privileges required to execute a task, see [EXECUTE TASK](../../../sql-reference/sql/execute-task.md). |

Deleting a run deletes the metadata associated with the run. The records created as part of the run are not deleted and remain stored. Please see Observability Data for more information on storage of the records and traces.

For instructions on creating a custom role with a specified set of privileges, see Creating custom roles.
For general information about roles and privilege grants for performing SQL actions on securable objects, see [Overview of Access Control](../../security-access-control-overview.md).

> **Important:**
>
> AI observability data ingested into the event table cannot be modified. Administrators with the AI_OBSERVABILITY_ADMIN application role have exclusive access to delete the data in the SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS event table.

### LLMs as judges

AI Observability uses Cortex LLMs as judges to compute the metrics for evaluating your applications. To successfully compute these metrics, you need permissions to access Cortex LLMs.
To grant user roles access to Cortex LLMs, please see required privileges.
The user must have access to the model configured as the LLM judge.
The default model used for LLM judge is llama3.1-70b. The default LLM judge model is subject to change in the future.

### Observability data

AI Observability data represents records containing inputs, outputs, evaluation scores, and associated traces for your generative AI applications. All the records are stored in a dedicated events table AI_OBSERVABILITY_EVENTS in your account under SNOWFLAKE.LOCAL schema.

AI observability data ingested into the event table cannot be modified. Administrators with the AI_OBSERVABILITY_ADMIN application role have exclusive access to delete the data in the SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS event table.

AI observability data can be accessed using the Trulens Python SDK or using Snowsight. The following privileges are required to view the records for an application and associated runs:

* The user role must have the USAGE privilege on the EXTERNAL AGENT object that represents the application.

For example, to view the runs for an externally instrumented RAG application, the user role requires the USAGE privilege on “my-db.my-schema.rag-application1”, where rag-application1 is the EXTERNAL AGENT object that represents the external RAG application in Snowflake.

The metadata associated with runs and external agents (such as Run name, description, dataset name etc) are classified as metadata.

---
title: Snowflake Cortex AI Functions (including LLM functions)
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/aisql.md
section: Snowflake Cortex (AI & ML)
---

# Snowflake Cortex AI Functions (including LLM functions)

Use Cortex AI Functions in Snowflake to run unstructured analytics on text and images with industry-leading LLMs from OpenAI, Anthropic, Meta, Mistral AI, and DeepSeek.
AI Functions support use cases such as:

* Extracting entities to enrich metadata and streamline validation
* Aggregating insights across customer tickets
* Filtering and classifying content by natural language
* Sentiment and aspect-based analysis for service improvement
* Translating and localizing multilingual content
* Parsing documents for analytics and RAG pipelines

All models are fully hosted in Snowflake, ensuring performance, scalability, and governance while keeping your data secure and in place.

## Available functions

Snowflake Cortex features are provided as SQL functions and are also available in Python.
Cortex AI Functions can be grouped into the following categories:

* Cortex AI functions
* Helper functions

### Cortex AI functions

These task-specific functions are purpose-built managed functions that automate routine tasks, like simple summaries and
quick translations, that don’t require any customization.

* [AI_COMPLETE](../../sql-reference/functions/ai_complete.md): Generates a completion for a given text string or image using a selected LLM. Use this function for most generative AI tasks.

  + AI_COMPLETE is the updated version of [COMPLETE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/complete-snowflake-cortex.md).
* [AI_CLASSIFY](../../sql-reference/functions/ai_classify.md): Classifies text or images into user-defined categories.

  + AI_CLASSIFY is the updated version of [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](../../sql-reference/functions/classify_text-snowflake-cortex.md) with support for multi-label and image classification.
* [AI_FILTER](../../sql-reference/functions/ai_filter.md): Returns True or False for a given text or image input, allowing you to filter results in `SELECT`, `WHERE`, or `JOIN ... ON` clauses.
* [AI_AGG](../../sql-reference/functions/ai_agg.md): Aggregates a text column and returns insights across multiple rows based on a user-defined prompt. This function isn’t subject to context window limitations.
* [AI_EMBED](../../sql-reference/functions/ai_embed.md): Generates an embedding vector for a text or image input, which can be used for similarity search, clustering, and classification tasks.

  + AI_EMBED is the updated version of [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](../../sql-reference/functions/embed_text_1024-snowflake-cortex.md).
* [AI_EXTRACT](../../sql-reference/functions/ai_extract.md): Extracts information from an input string or file, for example, text, images, and documents. Supports multiple languages.

  + AI_EXTRACT is the updated version of [EXTRACT_ANSWER (SNOWFLAKE.CORTEX)](../../sql-reference/functions/extract_answer-snowflake-cortex.md).
* [AI_SENTIMENT](../../sql-reference/functions/ai_sentiment.md): Extracts sentiment from text.

  + AI_SENTIMENT is the updated version of [SENTIMENT (SNOWFLAKE.CORTEX)](../../sql-reference/functions/sentiment-snowflake-cortex.md).
* [AI_SUMMARIZE_AGG](../../sql-reference/functions/ai_summarize_agg.md): Aggregates a text column and returns a summary across multiple rows. This function isn’t subject to context window limitations.
* [AI_SIMILARITY](../../sql-reference/functions/ai_similarity.md): Calculates the embedding similarity between two inputs.
* [AI_TRANSCRIBE](../../sql-reference/functions/ai_transcribe.md): Transcribes audio and video files stored in a stage, extracting text, timestamps, and speaker information.
* [AI_PARSE_DOCUMENT](../../sql-reference/functions/ai_parse_document.md): Extracts text (using OCR mode) or text with layout information
  (using LAYOUT mode) from documents in an internal or external stage. Can also extract images found in a document.

  + AI_PARSE_DOCUMENT is the updated version of [PARSE_DOCUMENT (SNOWFLAKE.CORTEX)](../../sql-reference/functions/parse_document-snowflake-cortex.md).
* [AI_REDACT](../../sql-reference/functions/ai_redact.md): Redact personally identifiable information (PII) from text.
* [AI_TRANSLATE](../../sql-reference/functions/ai_translate.md): Translates text between supported languages.

  + AI_TRANSLATE is the updated version of [TRANSLATE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/translate-snowflake-cortex.md).
* [SUMMARIZE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/summarize-snowflake-cortex.md): Returns a summary of the text that you’ve specified.

### Helper functions

Helper functions are purpose-built managed functions that reduce cases of failures when running other Cortex AI Functions, for example by
getting the count of tokens in an input prompt to ensure the call doesn’t exceed a model limit.

* [TO_FILE](../../sql-reference/functions/to_file.md): Creates a reference to a file in an internal or external stage for use with
  AI_COMPLETE and other functions that accept files.
* [AI_COUNT_TOKENS](../../sql-reference/functions/ai_count_tokens.md): Given an input text, returns the token count based on the model or Cortex
  function specified.

  + AI_COUNT_TOKENS is the updated version of [COUNT_TOKENS (SNOWFLAKE.CORTEX)](../../sql-reference/functions/count_tokens-snowflake-cortex.md).
* [PROMPT](../../sql-reference/functions/prompt.md): Helps you build prompt objects for use with AI_COMPLETE and other functions.
* [TRY_COMPLETE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/try_complete-snowflake-cortex.md): Works like the COMPLETE function, but returns NULL
  when the function could not execute instead of an error code.

### Cortex Guard

Cortex Guard is an option of the AI_COMPLETE (or SNOWFLAKE.CORTEX.COMPLETE) function designed to filter possible unsafe and harmful responses from a
language model. Cortex Guard is currently built with Meta’s Llama Guard 3. Cortex Guard works by evaluating the responses of a language
model before that output is returned to the application. Once you activate Cortex Guard, language model responses which may be associated
with violent crimes, hate, sexual content, self-harm, and more are automatically filtered. See
[COMPLETE arguments](../../sql-reference/functions/complete-snowflake-cortex.md) for syntax and examples.

> **Note:**
>
> Usage of Cortex Guard incurs compute charges based on the number of input tokens processed,
> in addition to the charges for the AI_COMPLETE function.

## Performance considerations

Cortex AI Functions are optimized for throughput. We recommend using these functions to process numerous inputs such as text from large SQL tables. Batch processing is typically better suited for AI Functions. For more interactive use cases where latency is important, use the REST API. These are available for simple inference (Complete API), embedding (Embed API) and agentic applications (Agents API).

## Cortex LLM privileges

This section describes the privileges required for users to access Snowflake Cortex AI Functions. It covers how to control and grant access to these functions using roles and account-level privileges.

### USE AI FUNCTIONS on the account privilege

> **Important:**
>
> Your users need both the USE AI FUNCTIONS account-level privilege and one of the CORTEX_USER or
> AI_FUNCTIONS_USER database roles to use Snowflake Cortex AI Functions.
> Because USE AI FUNCTIONS is granted to the PUBLIC role by default, no additional action is needed for this privilege
> unless it has been revoked.

The USE AI FUNCTIONS account-level privilege includes the privileges that allow your users to call Snowflake Cortex AI functions. By default, the USE AI FUNCTIONS privilege is granted to the PUBLIC role. The PUBLIC role is automatically granted to all users and roles, allowing all users in your account to use the Snowflake Cortex AI functions. If you don’t want all your users to have this privilege, you can revoke access to the PUBLIC role and grant access to other roles.

This section explains how to do the following :

* Revoke the USE AI FUNCTIONS privilege from the PUBLIC role
* Grant the USE AI FUNCTIONS privilege to specific roles

> **Important:**
>
> You must use the ACCOUNTADMIN role to manage the USE AI FUNCTIONS account-level privilege.

To revoke the USE AI FUNCTIONS account-level privilege from the PUBLIC role, run the following command:

```sqlexample
REVOKE USE AI FUNCTIONS ON ACCOUNT
FROM ROLE PUBLIC;
```

> **Note:**
>
> Revoking the USE AI FUNCTIONS account-level privilege prevents your users from accessing Snowflake Cortex AI Functions.
> Your users need **both** the USE AI FUNCTIONS account-level privilege and one of the CORTEX_USER or
> AI_FUNCTIONS_USER database roles to use Snowflake Cortex AI Functions.

After you’ve revoked the USE AI FUNCTIONS privilege from the PUBLIC role, you can use the ACCOUNTADMIN role to grant it to other roles in your Snowflake account.

The following example:

1. Grants the USE AI FUNCTIONS privilege to `cortex_user_role`.
2. Grants the `cortex_user_role` to `example_user`.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE cortex_user_role;

GRANT USE AI FUNCTIONS ON ACCOUNT TO ROLE cortex_user_role;

GRANT ROLE cortex_user_role TO USER example_user;
```

You can grant access to Snowflake Cortex AI Functions through roles that are commonly used by specific groups of users. For example, if you’ve created an `analyst` role that is used as a default role by analysts in your organization, you can grant these users access to Snowflake Cortex AI Functions with a single [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) statement. For more information about granting privileges to commonly used roles, see [User roles](../admin-user-management.md).

```sqlexample
GRANT USE AI FUNCTIONS ON ACCOUNT TO ROLE analyst;
```

> **Important:**
>
> Currently, USE AI FUNCTIONS does not apply to AI Function queries that are run inside Snowflake native applications. A query with AI Function calls runs successfully regardless of whether the role has USE AI FUNCTIONS privilege.

### Using AI Functions with Restricted Caller’s Rights

To use AI Functions with Restricted Caller’s Rights, you must grant the USE AI FUNCTIONS privilege to both the session role and the service or application owner role.

For example, to use AI Functions inside a Snowflake Park Container Services (SPCS) service that runs with Restricted Caller’s Rights:

1. Grant the USE AI FUNCTIONS privilege to the role used in the SPCS session (for example, `CHATBOT_USER_ROLE`):

   ```sqlexample
   GRANT USE AI FUNCTIONS ON ACCOUNT TO ROLE CHATBOT_USER_ROLE;
   ```
2. Grant the caller version of the privilege to the service owner role:

   ```sqlexample
   GRANT CALLER USE AI FUNCTIONS ON ACCOUNT TO ROLE <service_owner_role>;
   ```

### CORTEX_USER database role

The CORTEX_USER database role in the SNOWFLAKE database includes the privileges that allow users to call Snowflake
Cortex AI Functions. By default, the CORTEX_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted
to all users and roles, so this allows all users in your account to use the Snowflake Cortex AI functions.

If you don’t want all users to have this privilege, you can revoke access to the PUBLIC role and grant access to other roles.
The SNOWFLAKE.CORTEX_USER database role cannot be granted directly to a user. For more information, see
[Using SNOWFLAKE database roles](../../sql-reference/snowflake-db-roles.md).

To revoke the CORTEX_USER database role from the PUBLIC role, run the following commands using the ACCOUNTADMIN role:

```sqlexample
REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER
  FROM ROLE PUBLIC;

REVOKE IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE
  FROM ROLE PUBLIC;
```

You can then selectively provide access to specific roles. A user with the ACCOUNTADMIN role can grant this role to a custom role in
order to allow users to access Cortex AI functions. In the following example, use the ACCOUNTADMIN role and grant the user `some_user`
the CORTEX_USER database role via the account role `cortex_user_role`, which you create for this purpose.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE cortex_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE cortex_user_role;

GRANT ROLE cortex_user_role TO USER some_user;
```

You can also grant access to Snowflake Cortex AI functions through existing roles commonly used by specific groups of
users. (See [User roles](../admin-user-management.md).) For example, if you have created an `analyst` role that is used
as a default role by analysts in your organization, you can easily grant these users access to Snowflake Cortex AI
Functions with a single GRANT statement.

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE analyst;
```

### AI_FUNCTIONS_USER database role

The AI_FUNCTIONS_USER database role in the SNOWFLAKE database allows users to call Snowflake Cortex
scalar AI functions (all Cortex AI functions except the aggregate functions
AI_AGG and AI_SUMMARIZE_AGG) without granting access to Cortex services such as Cortex Agent, Cortex Analyst,
Cortex Fine-tuning, or Cortex Search.

> **Important:**
>
> Your users need both the USE AI FUNCTIONS account-level privilege plus one of CORTEX_USER and AI_FUNCTIONS_USER
> database role to call Snowflake Cortex AI functions. Because USE AI FUNCTIONS is granted to the PUBLIC role by
> default, no additional action is needed for this privilege unless it has been revoked.

AI_FUNCTIONS_USER role is not granted to the PUBLIC role by default. Accountadmin must
explicitly grant this role to roles that require access to AI functions. The AI_FUNCTIONS_USER database role
cannot be granted directly to users but must be granted to roles that users can assume. For more information, see
[Using SNOWFLAKE database roles](../../sql-reference/snowflake-db-roles.md).

The following example creates a custom role, grants the AI_FUNCTIONS_USER database role to it, and assigns the role
to a user.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE analyst_rl;
GRANT DATABASE ROLE SNOWFLAKE.AI_FUNCTIONS_USER TO ROLE analyst_rl;

GRANT ROLE analyst_rl TO USER some_user;
```

Alternatively, to give all users access to scalar AI function capabilities, grant the AI_FUNCTIONS_USER role to the
PUBLIC role.

```sqlexample
USE ROLE ACCOUNTADMIN;

GRANT DATABASE ROLE SNOWFLAKE.AI_FUNCTIONS_USER TO ROLE PUBLIC;
```

### CORTEX_EMBED_USER database role

The CORTEX_EMBED_USER database role in the SNOWFLAKE database includes the privileges that allow users to call the text
embedding functions AI_EMBED, EMBED_TEXT_768, and EMBED_TEXT_1024 and to create Cortex Search Services with managed
vector embeddings. CORTEX_EMBED_USER allows you to grant embedding privileges separately from other Cortex AI capabilities.

> **Note:**
>
> You can create Cortex Search Services with user-provided embeddings without the CORTEX_EMBED_USER role. In that
> case, you must generate the embeddings yourself, outside of Snowflake, and load them into a table.

Unlike the CORTEX_USER role, the CORTEX_EMBED_USER role is not granted to the PUBLIC role by default. You must
explicitly grant this role to roles that require embedding capabilities if you have revoked the CORTEX_USER role. The
CORTEX_EMBED_USER database role cannot be granted directly to users but must be granted to roles that users can assume.
The following example illustrates this process.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE cortex_embed_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_EMBED_USER TO ROLE cortex_embed_user_role;

GRANT ROLE cortex_embed_user_role TO USER some_user;
```

Alternatively, to give all users access to embedding capabilities, grant the CORTEX_EMBED_USER role to the PUBLIC role as follows.

```sqlexample
USE ROLE ACCOUNTADMIN;

GRANT DATABASE ROLE SNOWFLAKE.CORTEX_EMBED_USER TO ROLE PUBLIC;
```

### Using AI Functions in stored procedures with EXECUTE AS RESTRICTED CALLER

To use AI Functions inside stored procedures with `EXECUTE AS RESTRICTED CALLER`, grant the following privileges to the role that created the stored procedure:

```sqlexample
GRANT INHERITED CALLER USAGE ON ALL SCHEMAS IN DATABASE snowflake TO ROLE <role_that_created_the_stored_procedure>;
GRANT INHERITED CALLER USAGE ON ALL FUNCTIONS IN DATABASE snowflake TO ROLE <role_that_created_the_stored_procedure>;
GRANT CALLER USAGE ON DATABASE snowflake TO ROLE <role_that_created_the_stored_procedure>;
```

## Control model access

Snowflake Cortex provides two independent mechanisms to enforce access to models:

* Account-level allowlist parameter (simple, broad control)
* Role-based access control (RBAC) (fine-grained control)

You can use the account-level allowlist to control model access across your entire account, or you can use RBAC to control model access on a per-role basis.
For maximum flexibility, you can also use both mechanisms together, if you can accept additional management complexity.

### Account-level allowlist parameter

You can control model access across your entire account using the CORTEX_MODELS_ALLOWLIST parameter. Supported features respect the value of this parameter and prevent use of models that are not in the allowlist.

The CORTEX_MODELS_ALLOWLIST parameter can be set to `'All'`, `'None'`, or to a comma-separated list
of model names. This parameter can only be set at the account level, not at the user or session levels. Only the
ACCOUNTADMIN role can set the parameter using the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command.

Examples:

* To allow access to all models:

  ```sqlexample
  ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'All';
  ```
* To allow access to the `mistral-large2` and `llama3.1-70b` models:

  ```sqlexample
  ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'mistral-large2,llama3.1-70b';
  ```
* To prevent access to any model:

  ```sqlexample
  ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'None';
  ```

Use RBAC, as described in the following section, to provide specific roles with access beyond what you’ve specified in the allowlist.

### Role-based access control (RBAC)

Although Cortex models are not themselves Snowflake objects, Snowflake lets you create model objects in the SNOWFLAKE.MODELS schema that *represent* the Cortex models. By applying RBAC to these objects, you can control access to models the same way you would any other Snowflake object. Supported features accept the identifiers of objects in SNOWFLAKE.MODELS wherever a model can be specified.

> **Tip:**
>
> To use RBAC exclusively, set CORTEX_MODELS_ALLOWLIST to `'None'`.

#### Refresh model objects and application roles

SNOWFLAKE.MODELS is not automatically populated with the objects that represent Cortex models. You must create these
objects when you first set up model RBAC, and refresh them when you want to apply RBAC to new models.

As ACCOUNTADMIN, run the SNOWFLAKE.MODELS.CORTEX_BASE_MODELS_REFRESH stored procedure to populate the SNOWFLAKE.MODELS
schema with objects representing currently available Cortex models, and to create application roles that correspond to
the models. The procedure also creates CORTEX-MODEL-ROLE-ALL, a role that covers all models.

> **Tip:**
>
> You can safely call CORTEX_BASE_MODELS_REFRESH at any time; it will not create duplicate objects or roles.

```sqlexample
CALL SNOWFLAKE.MODELS.CORTEX_BASE_MODELS_REFRESH();
```

After refreshing the model objects, you can verify that the models appear in the SNOWFLAKE.MODELS schema as follows:

```sqlexample
SHOW MODELS IN SNOWFLAKE.MODELS;
```

The returned list of models resembles the following:

| created_on | name | model_type | database_name | schema_name | owner |
| --- | --- | --- | --- | --- | --- |
| 2025-04-22 09:35:38.558 -0700 | CLAUDE-4-5-SONNET | CORTEX_BASE | SNOWFLAKE | MODELS | SNOWFLAKE |
| 2025-04-22 09:36:16.793 -0700 | LLAMA3.1-405B | CORTEX_BASE | SNOWFLAKE | MODELS | SNOWFLAKE |
| 2025-04-22 09:37:18.692 -0700 | OPENAI-GPT-5.2 | CORTEX_BASE | SNOWFLAKE | MODELS | SNOWFLAKE |

To verify that you can see the application roles associated with these models, use the SHOW APPLICATION ROLES command, as in the following example:

```sqlexample
SHOW APPLICATION ROLES IN APPLICATION SNOWFLAKE;
```

The list of application roles resembles the following:

| created_on | name | owner | comment | owner_role_type |
| --- | --- | --- | --- | --- |
| 2025-04-22 09:35:38.558 -0700 | CORTEX-MODEL-ROLE-ALL | SNOWFLAKE | MODELS | APPLICATION |
| 2025-04-22 09:36:16.793 -0700 | CORTEX-MODEL-ROLE-LLAMA3.1-405B | SNOWFLAKE | MODELS | APPLICATION |
| 2025-04-22 09:37:18.692 -0700 | CORTEX-MODEL-ROLE-SNOWFLAKE-ARCTIC | SNOWFLAKE | MODELS | APPLICATION |

#### Grant application roles to user roles

After you create the model objects and application roles, you can grant the application roles to specific user roles in your account.

* To grant a role access to a specific model:

  ```sqlexample
  GRANT APPLICATION ROLE SNOWFLAKE."CORTEX-MODEL-ROLE-LLAMA3.1-70B" TO ROLE MY_ROLE;
  ```
* To grant a role access to all current and future models:

  ```sqlexample
  GRANT APPLICATION ROLE SNOWFLAKE."CORTEX-MODEL-ROLE-ALL" TO ROLE MY_ROLE;
  ```

#### Use model objects with supported features

To use model objects with supported Cortex features, specify the identifier of the model object in SNOWFLAKE.MODELS as the model argument.
You can use a fully-qualified identifier, a partial identifier, or a simple model name that will be automatically resolved to SNOWFLAKE.MODELS.

* Using a fully-qualified identifier:

  ```sqlexample
  SELECT AI_COMPLETE('SNOWFLAKE.MODELS."LLAMA3.1-70B"', 'Hello');
  ```
* Using a partial identifier:

  ```sqlexample
  USE DATABASE SNOWFLAKE;
  USE SCHEMA MODELS;
  SELECT AI_COMPLETE('LLAMA3.1-70B', 'Hello');
  ```
* Using automatic lookup with a simple model name:

  ```sqlexample
  -- Automatically resolves to SNOWFLAKE.MODELS."LLAMA3.1-70B"
  SELECT AI_COMPLETE('llama3.1-70b', 'Hello');
  ```

#### Using RBAC on the account allowlist

A number of Cortex features accept a model name as a string argument, for example `AI_COMPLETE('model', 'prompt')`. When you provide a model name:

1. Cortex first attempts to locate a matching model object in SNOWFLAKE.MODELS. If you provide an unqualified name like `'x'`, it automatically looks for `SNOWFLAKE.MODELS."X"`.
2. If the model object is found, RBAC is applied to determine whether the user can use the model.
3. If no model object is found, the provided string is matched against the account-level allowlist.

The following example illustrates the use of allowlist and RBAC together. In this example, the allowlist is set to allow the `mistral-large2` model, and the user has access to the `LLAMA3.1-70B` model object through RBAC.

```sqlexample
-- set up access
USE SECONDARY ROLES NONE;
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'MISTRAL-LARGE2';
CALL SNOWFLAKE.MODELS.CORTEX_BASE_MODELS_REFRESH();
GRANT APPLICATION ROLE SNOWFLAKE."CORTEX-MODEL-ROLE-LLAMA3.1-70B" TO ROLE PUBLIC;

-- test access
USE ROLE PUBLIC;

-- this succeeds because mistral-large2 is in the allowlist
SELECT AI_COMPLETE('MISTRAL-LARGE2', 'Hello');

-- this succeeds because the role has access to the model object
SELECT AI_COMPLETE('SNOWFLAKE.MODELS."LLAMA3.1-70B"', 'Hello');

-- this fails because the first argument is
-- neither an identifier for an accessible model object
-- nor is it a model name in the allowlist
SELECT AI_COMPLETE('claude-sonnet-4-6', 'Hello');
```

### Common pitfalls

* Access to a model (whether by allowlist or RBAC) does not always mean that it can be used. It may still be subject to
  cross-region, deprecation, or other availability constraints. These restrictions can result in error messages that
  seem similar to model access errors.
* Model access controls only govern the use of a model and not the use of a feature itself. A feature can have its own access
  controls. For example, access to `AI_COMPLETE` is governed by the `CORTEX_USER` or `AI_FUNCTIONS_USER` database role and the USE AI FUNCTIONS account-level privilege. For more information, see
  Cortex LLM privileges.
* Not all features support model access controls. For more information about what a feature supports, see the supported features table.
* Secondary roles can obscure permissions. For example, if a user has ACCOUNTADMIN as a secondary role, all model objects may appear
  accessible. Disable secondary roles temporarily when verifying permissions.
* Qualified model object identifiers are quoted and therefore case-sensitive. For more information, see
  [QUOTED_IDENTIFIERS_IGNORE_CASE](../../sql-reference/parameters.md).

### Supported features

Model access controls are supported by the following features:

| Feature | Account-level allowlist | Role-based access control | Notes |
| --- | --- | --- | --- |
| [AI_COMPLETE](../../sql-reference/functions/ai_complete.md) | ✔ | ✔ |  |
| [AI_CLASSIFY](../../sql-reference/functions/ai_classify.md) | ✔ | ✔ | If the model powering this function is not allowed, the error message contains information about how to modify the allowlist. |
| [AI_FILTER](../../sql-reference/functions/ai_filter.md) | ✔ | ✔ | If the model powering this function is not allowed, the error message contains information about how to modify the allowlist. |
| [AI_AGG](../../sql-reference/functions/ai_agg.md) | ✔ | ✔ | If the model powering this function is not allowed, the error message contains information about how to modify the allowlist. |
| [AI_SUMMARIZE_AGG](../../sql-reference/functions/ai_summarize_agg.md) | ✔ | ✔ | If the model powering this function is not allowed, the error message contains information about how to modify the allowlist. |
| [COMPLETE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/complete-snowflake-cortex.md) | ✔ | ✔ |  |
| [TRY_COMPLETE (SNOWFLAKE.CORTEX)](../../sql-reference/functions/try_complete-snowflake-cortex.md) | ✔ | ✔ |  |
| [Cortex REST API](cortex-rest-api.md) | ✔ | ✔ |  |
| [Cortex Playground](cortex-playground.md) | ✔ | ✔ |  |

## Regional availability

Snowflake Cortex AI functions are available in the following regions. If your region is not listed for a particular function,
use [cross-region inference](cross-region-inference.md).

> **Note:**
>
> * The TRY_COMPLETE function is available in the same regions as COMPLETE.
> * The AI_COUNT_TOKENS function is available in all regions for any model, but the models themselves are available only in the regions specified in the tables below.

Cross-RegionNorth AmericaEuropeAsia-Pacific

The following functions and models are available in any region via [cross-region inference](cross-region-inference.md).

| Function  Model | Cross Cloud (Any Region) | AWS US  (Cross-Region) | AWS US Commercial Gov  (Cross-Region) | AWS EU  (Cross-Region) | AWS APJ  (Cross-Region) | AWS AU  (Cross-Region) | Azure US  (Cross-Region) | Azure EU  (Cross-Region) | Google Cloud US  (Cross-Region) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| AI_COMPLETE |  |  |  |  |  |  |  |  |  |
| `claude-sonnet-4-6` | ✔ | ✔ |  | ✔ | ✔ |  |  |  |  |
| `claude-opus-4-6` | ✔ | ✔ |  | ✔ |  | ✔ |  |  |  |
| `claude-sonnet-4-5` | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |  |
| `claude-opus-4-5` | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| `claude-haiku-4-5` | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |  |
| `claude-4-sonnet` | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |  |
| `claude-3-7-sonnet` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `gemini-3.1-pro` | \* |  |  |  |  |  |  |  |  |
| `llama4-maverick` | ✔ | ✔ |  |  |  |  |  |  |  |
| `llama4-scout` | ✔ | ✔ |  |  |  |  |  |  |  |
| `llama3.1-8b` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |
| `llama3.1-70b` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |
| `llama3.3-70b` | ✔ | ✔ |  |  |  |  |  |  |  |
| `snowflake-llama-3.3-70b` | ✔ | ✔ |  |  |  |  |  |  |  |
| `llama3.1-405b` | ✔ | ✔ | ✔ |  |  |  | ✔ |  |  |
| `openai-gpt-5.2` | ✔ |  |  |  |  |  | ✔ |  |  |
| `openai-gpt-5.1` | ✔ |  |  |  |  |  | ✔ | ✔ |  |
| `openai-gpt-5` | \* |  |  |  |  |  | \* | \* |  |
| `openai-gpt-5-mini` | \* |  |  |  |  |  | \* |  |  |
| `openai-gpt-5-nano` | \* |  |  |  |  |  | \* |  |  |
| `openai-gpt-4.1` | ✔ |  |  |  |  |  | ✔ |  |  |
| `openai-gpt-oss-120b` | \* |  |  |  |  |  |  |  |  |
| `openai-gpt-oss-20b` | \* | \* |  |  |  |  |  |  |  |
| `snowflake-llama-3.1-405b` | ✔ | ✔ | ✔ |  |  |  |  |  |  |
| `snowflake-arctic` | ✔ | ✔ |  |  |  | ✔ |  |  |  |
| `deepseek-r1` | ✔ | ✔ |  |  |  |  |  |  |  |
| `mistral-large2` | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| `mixtral-8x7b` | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| `mistral-7b` | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
|  |  |  |  |  |  |  |  |  |  |
| EMBED_TEXT_768 |  |  |  |  |  |  |  |  |  |
| `e5-base-v2` | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| `snowflake-arctic-embed-m` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |
| `snowflake-arctic-embed-m-v1.5` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |
|  |  |  |  |  |  |  |  |  |  |
| EMBED_TEXT_1024 |  |  |  |  |  |  |  |  |  |
| `snowflake-arctic-embed-l-v2.0` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |
| `snowflake-arctic-embed-l-v2.0-8k` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |
| `nv-embed-qa-4` | ✔ | ✔ |  |  |  |  |  |  |  |
| `multilingual-e5-large` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |
| `voyage-multilingual-2` | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
|  |  |  |  |  |  |  |  |  |  |
| AI_CLASSIFY TEXT | ✔ | ✔ |  | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_CLASSIFY IMAGE | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_EXTRACT | ✔ | ✔ |  | ✔ | ✔ | ✔ | ✔ |  |  |
| AI_FILTER TEXT | ✔ | ✔ |  | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_FILTER IMAGE | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_AGG |  | ✔ |  | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_REDACT | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_SENTIMENT | ✔ | ✔ |  | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_SIMILARITY TEXT | ✔ | ✔ |  | ✔ | ✔ | ✔ |  |  |  |
| AI_SIMILARITY IMAGE | ✔ | ✔ |  | ✔ |  |  |  | ✔ |  |
| AI_SUMMARIZE_AGG | ✔ | ✔ |  | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_TRANSCRIBE | ✔ | ✔ |  | ✔ |  | ✔ |  |  |  |
| SENTIMENT | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| ENTITY_SENTIMENT | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| EXTRACT_ANSWER | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |  |  |  |
| SUMMARIZE | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| TRANSLATE | ✔ | ✔ | ✔ | ✔ | ✔ |  | ✔ | ✔ |  |
| AI_TRANSLATE | ✔ | ✔ |  | ✔ | ✔ |  | ✔ | ✔ |  |

The following functions and models are available natively in North American regions.

| Function  Model | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS US East  (Commercial Gov - N. Virginia) | Azure East US 2  (Virginia) | Azure East US  (Virginia) | Azure West US  (Washington) | Azure West US 3  (Arizona) | Azure North Central US  (Illinois) | Azure South Central US  (Texas) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| AI_COMPLETE |  |  |  |  |  |  |  |  |  |
| `claude-4-sonnet` |  |  |  |  |  |  |  |  |  |
| `claude-3-7-sonnet` |  |  |  |  |  |  |  |  |  |
| `llama4-maverick` | ✔ |  |  |  |  |  |  |  |  |
| `llama4-scout` | ✔ |  |  |  |  |  |  |  |  |
| `llama3.1-8b` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `llama3.1-70b` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `llama3.3-70b` | ✔ |  |  |  |  |  |  |  |  |
| `snowflake-llama-3.3-70b` | ✔ |  |  |  |  |  |  |  |  |
| `llama3.1-405b` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `openai-gpt-4.1` |  |  |  | ✔ |  |  |  |  |  |
| `openai-gpt-oss-120b` | \* |  |  |  |  |  |  |  |  |
| `openai-gpt-oss-20b` | \* |  |  | \* |  |  |  |  |  |
| `snowflake-llama-3.1-405b` | ✔ |  |  |  |  |  |  |  |  |
| `snowflake-arctic` | ✔ |  |  | ✔ |  |  |  |  |  |
| `deepseek-r1` | ✔ |  |  |  |  |  |  |  |  |
| `mistral-large2` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `mixtral-8x7b` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `mistral-7b` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |
| EMBED_TEXT_768 |  |  |  |  |  |  |  |  |  |
| `e5-base-v2` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `snowflake-arctic-embed-m` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `snowflake-arctic-embed-m-v1.5` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |
| EMBED_TEXT_1024 |  |  |  |  |  |  |  |  |  |
| `snowflake-arctic-embed-l-v2.0` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `snowflake-arctic-embed-l-v2.0-8k` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `nv-embed-qa-4` | ✔ |  |  |  |  |  |  |  |  |
| `multilingual-e5-large` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| `voyage-multilingual-2` | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
|  |  |  |  |  |  |  |  |  |  |
| AI_CLASSIFY TEXT | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_CLASSIFY IMAGE | ✔ | ✔ |  |  |  |  |  |  |  |
| AI_EXTRACT | ✔ | ✔ |  |  | ✔ | ✔ |  |  | ✔ |
| AI_FILTER TEXT | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_FILTER IMAGE | ✔ | ✔ |  |  |  |  |  |  |  |
| AI_AGG | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_REDACT | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| AI_SIMILARITY TEXT | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_SIMILARITY IMAGE | ✔ | ✔ |  |  |  |  |  |  |  |
| AI_SUMMARIZE_AGG | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| AI_TRANSCRIBE | ✔ | ✔ |  | ✔ |  |  |  |  |  |
| SENTIMENT | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| ENTITY_SENTIMENT | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| EXTRACT_ANSWER | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| SUMMARIZE | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |
| TRANSLATE | ✔ | ✔ | ✔ | ✔ |  |  |  |  |  |

The following functions and models are available natively in European regions.

| Function  Model | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | Azure West Europe  (Netherlands) |
| --- | --- | --- | --- |
| AI_COMPLETE |  |  |  |
| `claude-4-sonnet` |  |  |  |
| `claude-3-7-sonnet` |  |  |  |
| `llama4-maverick` |  |  |  |
| `llama4-scout` |  |  |  |
| `llama3.1-8b` | ✔ | ✔ | ✔ |
| `llama3.1-70b` | ✔ | ✔ | ✔ |
| `llama3.3-70b` |  |  |  |
| `snowflake-llama-3.3-70b` |  |  |  |
| `llama3.1-405b` |  |  |  |
| `openai-gpt-4.1` |  |  |  |
| `openai-gpt-oss-120b` |  |  |  |
| `openai-gpt-oss-20b` |  |  |  |
| `snowflake-llama-3.1-405b` |  |  |  |
| `snowflake-arctic` |  |  |  |
| `deepseek-r1` |  |  |  |
| `mistral-large2` | ✔ | ✔ | ✔ |
| `mixtral-8x7b` | ✔ | ✔ | ✔ |
| `mistral-7b` | ✔ | ✔ | ✔ |
|  |  |  |  |
| EMBED_TEXT_768 |  |  |  |
| `e5-base-v2` | ✔ |  | ✔ |
| `snowflake-arctic-embed-m` | ✔ | ✔ | ✔ |
| `snowflake-arctic-embed-m-v1.5` | ✔ | ✔ | ✔ |
|  |  |  |  |
| EMBED_TEXT_1024 |  |  |  |
| `snowflake-arctic-embed-l-v2.0` | ✔ | ✔ | ✔ |
| `snowflake-arctic-embed-l-v2.0-8k` | ✔ | ✔ | ✔ |
| `nv-embed-qa-4` |  |  |  |
| `multilingual-e5-large` | ✔ | ✔ | ✔ |
| `voyage-multilingual-2` | ✔ | ✔ | ✔ |
|  |  |  |  |
| AI_CLASSIFY TEXT | ✔ | ✔ | ✔ |
| AI_CLASSIFY IMAGE | ✔ |  |  |
| AI_EXTRACT | ✔ | ✔ | ✔ |
| AI_FILTER TEXT | ✔ | ✔ | ✔ |
| AI_FILTER IMAGE | ✔ |  |  |
| AI_AGG | ✔ | ✔ | ✔ |
| AI_REDACT | ✔ | ✔ | ✔ |
| AI_SIMILARITY TEXT | ✔ | ✔ | ✔ |
| AI_SIMILARITY IMAGE | ✔ |  |  |
| AI_SUMMARIZE_AGG | ✔ | ✔ | ✔ |
| AI_TRANSCRIBE | ✔ |  |  |
| SENTIMENT | ✔ | ✔ | ✔ |
| ENTITY_SENTIMENT | ✔ |  | ✔ |
| EXTRACT_ANSWER | ✔ | ✔ | ✔ |
| SUMMARIZE | ✔ | ✔ | ✔ |
| TRANSLATE | ✔ | ✔ | ✔ |

The following functions and models are available natively in Asia-Pacific regions:

| Function  | Model | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) |
| --- | --- | --- |
| AI_COMPLETE |  |  |
| `claude-4-sonnet` |  |  |
| `claude-3-7-sonnet` |  |  |
| `llama4-maverick` |  |  |
| `llama4-scout` |  |  |
| `llama3.1-8b` | ✔ | ✔ |
| `llama3.1-70b` | ✔ | ✔ |
| `llama3.3-70b` |  |  |
| `snowflake-llama-3.3-70b` |  |  |
| `llama3.1-405b` |  |  |
| `openai-gpt-4.1` |  |  |
| `snowflake-llama-3.1-405b` |  |  |
| `snowflake-arctic` |  |  |
| `deepseek-r1` |  |  |
| `mistral-large2` | ✔ | ✔ |
| `mixtral-8x7b` | ✔ | ✔ |
| `mistral-7b` | ✔ | ✔ |
|  |  |  |
| EMBED_TEXT_768 |  |  |
| `e5-base-v2` | ✔ | ✔ |
| `snowflake-arctic-embed-m` | ✔ | ✔ |
| `snowflake-arctic-embed-m-v1.5` | ✔ | ✔ |
|  |  |  |
| EMBED_TEXT_1024 |  |  |
| `snowflake-arctic-embed-l-v2.0` | ✔ | ✔ |
| `snowflake-arctic-embed-l-v2.0-8k` | ✔ | ✔ |
| `nv-embed-qa-4` |  |  |
| `multilingual-e5-large` | ✔ | ✔ |
| `voyage-multilingual-2` | ✔ | ✔ |
|  |  |  |
| AI_EXTRACT | ✔ | ✔ |
| AI_CLASSIFY TEXT | ✔ | ✔ |
| AI_CLASSIFY IMAGE |  |  |
| AI_FILTER TEXT | ✔ | ✔ |
| AI_FILTER IMAGE |  |  |
| AI_AGG | ✔ | ✔ |
| AI_SIMILARITY TEXT | ✔ | ✔ |
| AI_SIMILARITY IMAGE |  |  |
| AI_SUMMARIZE_AGG | ✔ | ✔ |
| AI_TRANSCRIBE |  |  |
| EXTRACT_ANSWER | ✔ | ✔ |
| SENTIMENT | ✔ | ✔ |
| ENTITY_SENTIMENT |  | ✔ |
| SUMMARIZE | ✔ | ✔ |
| TRANSLATE | ✔ | ✔ |

**\*** Indicates a preview function or model. Preview features are not suitable for production workloads.

The following Snowflake Cortex AI functions and models are available in the following extended regions.

| Function  Model | AWS US East 2  (Ohio) | AWS CA Central 1  (Central) | AWS SA East 1  (São Paulo) | AWS Europe West 2  (London) | AWS Europe Central 1  (Frankfurt) | AWS Europe North 1  (Stockholm) | AWS AP Northeast 1  (Tokyo) | AWS AP South 1  (Mumbai) | AWS AP Southeast 2  (Sydney) | AWS AP Southeast 3  (Jakarta) | Azure South Central US  (Texas) | Azure West US 2  (Washington) | Azure UK South  (London) | Azure North Europe  (Ireland) | Azure Switzerland North  (Zürich) | Azure Central India  (Pune) | Azure Japan East  (Tokyo, Saitama) | Azure Southeast Asia  (Singapore) | Azure Australia East  (New South Wales) | Google Cloud Europe West 2  (London) | Google Cloud Europe West 4  (Netherlands) | Google Cloud US Central 1  (Iowa) | Google Cloud US East 4  (N. Virginia) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| EMBED_TEXT_768 |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| `snowflake-arctic-embed-m-v1.5` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| `snowflake-arctic-embed-m` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| EMBED_TEXT_1024 |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |
| `multilingual-e5-large` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| AI_EXTRACT | ✔ | ✔ | ✔ | ✔ | ✔ | Cross-region only | ✔ | Cross-region only | ✔ | Cross-region only | ✔ | ✔ | Cross-region only | ✔ | Cross-region only | ✔ | ✔ | ✔ | ✔ | Cross-region only | Cross-region only | Cross-region only | Cross-region only |

The following table lists availability of legacy models. These models have not been deprecated and can still be used.
However, Snowflake recommends newer models for new development.

Legacy

| Function  (Model) | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) | Azure East US 2  (Virginia) | Azure West Europe  (Netherlands) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
| AI_COMPLETE |  |  |  |  |  |  |  |  |
| `llama3-8b` | ✔ | ✔ | ✔ |  | ✔ | ✔ | ✔ |  |
| `llama3-70b` | ✔ | ✔ | ✔ |  |  | ✔ | ✔ |  |
| `mistral-large` | ✔ | ✔ | ✔ |  |  |  | ✔ | ✔ |
| `openai-o4-mini` |  |  |  |  |  |  | ✔ |  |

## Create stage for media files

Cortex AI Functions that process media files (documents, images, audio, or video) require the files to be stored on an
internal or external stage. The stage must use server-side encryption. If you want to be able to query the stage or
programmatically process all the files stored there, the stage must have a directory table.

The SQL below creates a suitable internal stage:

```sqlexample
CREATE OR REPLACE STAGE input_stage
  DIRECTORY = ( ENABLE = true )
  ENCRYPTION = ( TYPE = 'SNOWFLAKE_SSE' );
```

To process files from external object storage (e.g., Amazon S3), create a storage integration, then create an external stage that uses the storage integration. To learn how to configure a Snowflake Storage Integration, see our detailed guides:

* [Amazon S3 storage integration](../data-load-s3-config-storage-integration.md)
* [Azure container integration](../data-load-azure-config.md)
* [Google Cloud Storage integration](../data-load-gcs-config.md)

Create an external stage that references the integration and points to your cloud storage container. This example points to an Amazon S3 bucket:

```sqlexample
CREATE OR REPLACE STAGE my_aisql_media_files
  STORAGE_INTEGRATION = my_s3_integration
  URL = 's3://my_bucket/prefix/'
  DIRECTORY = ( ENABLE = TRUE )
  ENCRYPTION = ( TYPE = 'AWS_SSE_S3' );
```

With an internal or external stage created, and files stored there, you can use Cortex AI Functions to process media files
stored in the stage. For more information, see:

* [AI Functions – Images](ai-images.md)
* [AI Functions – Audio](ai-audio.md) (also video)
* [AI Functions – Document Parsing](parse-document.md)

> **Note:**
>
> AI Functions are currently incompatible with custom [network policies](../network-policies.md).

### Cortex AI Functions storage best practices

You may find the following best practices helpful when working with media files in stages with Cortex AI Functions:

* Establish a scheme for organizing media files in stages. For example, create a separate stage for each team or
  project, and store the different types of media files in subdirectories.
* Enable directory listings on stages to allow querying and programmatic access to its files.

  > **Tip:**
  >
  > To automatically refresh the directory table for the external stage when new or updated files are available, set
  > AUTO_REFRESH = TRUE when creating the stage.
* For external stages, use fine-grained policies on the cloud provider side (for example, AWS IAM policies)
  to restrict the storage integration’s access to only what is necessary.
* Always use encryption, such as AWS_SSE or SNOWFLAKE_SSE, to protect your data at rest.

## Cost considerations

Snowflake Cortex AI functions incur compute cost based on the number of tokens processed. Refer to the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) for each function’s cost in credits per million tokens.

A token is the smallest unit of text processed by Snowflake Cortex AI functions. An industry convention for text is that a token is approximately equal to four
characters, although this can vary by model, as can token equivalence for media files.

* For functions that generate new text using provided text (AI_COMPLETE, AI_CLASSIFY, AI_FILTER, AI_AGG, AI_SUMMARIZE, and
  AI_TRANSLATE, and their previous versions in the SNOWFLAKE.CORTEX schema), both input and output tokens are billable.
* For Cortex Guard, only input tokens are counted. The number of input tokens is based on the number of tokens output from AI_COMPLETE (or COMPLETE).
  Cortex Guard usage is billed in addition to the cost of the AI_COMPLETE (or COMPLETE) function.
* For AI_SIMILARITY, AI_EMBED, and the SNOWFLAKE.CORTEX.EMBED_\* functions, only input tokens are counted.
* For EXTRACT_ANSWER, the number of billable tokens is the sum of the number of tokens in the `from_text` and
  `question` fields.
* AI_CLASSIFY, AI_FILTER, AI_AGG, AI_SENTIMENT, AI_SUMMARIZE_AGG, SUMMARIZE, TRANSLATE, AI_TRANSLATE, EXTRACT_ANSWER,
  ENTITY_SENTIMENT, and SENTIMENT add a prompt to the input text in order to generate the response. As a result, the
  billed token count is higher than the number of tokens in the text you provide.
* AI_CLASSIFY labels, descriptions, and examples are counted as input tokens for each record processed, not just once for each AI_CLASSIFY call.
* For AI_PARSE_DOCUMENT (or SNOWFLAKE.CORTEX.PARSE_DOCUMENT), billing is based on the number of document pages processed.
* TRY_COMPLETE (SNOWFLAKE.CORTEX) does not incur costs for error handling. If the TRY_COMPLETE(SNOWFLAKE.CORTEX) function returns NULL, no cost
  is incurred.
* For AI_EXTRACT, both input and output tokens are counted. The `responseFormat` argument is counted as input tokens.
  For document formats consisting of pages, the number of pages processed is counted as input tokens. Each page in a document is counted as 970 tokens.
* AI_COUNT_TOKENS incurs only compute cost to run the function. No additional token-based costs are incurred.

For models that support media files such as images or audio:

* Audio files are billed at 50 tokens per second of audio.
* The token equivalence of images is determined by the model used. For more information, see
  [AI Image cost considerations](ai-images.md).

Snowflake recommends executing queries that call a Snowflake Cortex AI Function with a smaller
warehouse (no larger than MEDIUM). Larger warehouses do not increase performance. The cost associated with keeping a warehouse active
continues to apply when executing a query that calls a Snowflake Cortex LLM Function. For general information on
compute costs, see [Understanding compute cost](../cost-understanding-compute.md).

### Warehouse sizing

Snowflake recommends using a warehouse size no larger than MEDIUM when calling Snowflake Cortex AI
Functions. Using a larger warehouse than necessary does not increase performance, but can result in unnecessary costs.
This recommendation may change in the future as we continue to evolve Cortex AI Functions.

### Track costs for AI services

To track credits used for AI Services including LLM Functions in your account, use the [METERING_HISTORY view](../../sql-reference/account-usage/metering_history.md):

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.METERING_DAILY_HISTORY
  WHERE SERVICE_TYPE='AI_SERVICES';
```

### Track credit consumption for Cortex AI Functions

To view the credit and token consumption for each AI Function call, use the [CORTEX_FUNCTIONS_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_functions_usage_history.md):

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_USAGE_HISTORY;
```

You can also view the credit and token consumption for each query within your Snowflake account. Viewing the credit and token consumption for each query helps you identify queries that are consuming the most credits and tokens.

The following example query uses the [CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view](../../sql-reference/account-usage/cortex_functions_query_usage_history.md) to show the credit and token consumption for all of your queries within your account.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY;
```

You can also use the same view to see the credit and token consumption for a specific query.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY
WHERE query_id='<query-id>';
```

> **Note:**
>
> You can’t get granular usage information for requests made with the REST API.

The query usage history is grouped by the models used in the query. For example, if you ran:

```sqlexample
SELECT AI_COMPLETE('mistral-7b', 'Is a hot dog a sandwich'), AI_COMPLETE('mistral-large', 'Is a hot dog a sandwich');
```

The query usage history would show two rows, one for `mistral-7b` and one for `mistral-large`.

## Model restrictions

Models used by Snowflake Cortex have limitations on size as described in the table below. Sizes are given in tokens.
According to industry estimates, tokens generally represent about four characters of text, so the number of words corresponding to a token limit is
less than the number of tokens. Inputs exceeding the context window limit result in an error. Output that exceed the
context window limit is truncated.

The maximum size of the output that a model can produce is limited by the following:

* The model’s output token limit.
* The space available in the context window after the model consumes the input tokens.

For example, `claude-3-7-sonnet` has a context window of 200,000 tokens. If 100,000 tokens are used for the input, the model can generate up to 8,192 tokens. However, if 195,000 tokens are used as input, then the model can only generate up to 5,000 tokens for a total of 200,000 tokens.

> **Important:**
>
> In the AWS AP Southeast 2 (Sydney) region:
>
> * the context window for `llama3-8b` and `mistral-7b` is 4,096 tokens.
> * the context window for `llama3.1-8b` is 16,384 tokens.
> * the context window for the Snowflake managed model from the SUMMARIZE function is 4,096 tokens.
>
> In the AWS Europe West 1 (Ireland) region:
>
> * the context window for `llama3.1-8b` is 16,384 tokens.
> * the context window for `mistral-7b` is 4,096 tokens.

| Function | Model | Context window (tokens) | Max output (tokens) |
| --- | --- | --- | --- |
| AI_COMPLETE | `llama4-maverick` | 128,000 | 8,192 |
|  | `llama4-scout` | 128,000 | 8,192 |
|  | `snowflake-arctic` | 4,096 | 8,192 |
|  | `deepseek-r1` | 32,768 | 8,192 |
|  | `claude-sonnet-4-6` | 1,000,000 | 128,000 |
|  | `claude-opus-4-6` | 1,000,000 | 128,000 |
|  | `claude-sonnet-4-5` | 200,000 | 64,000 |
|  | `claude-haiku-4-5` | 200,000 | 64,000 |
|  | `claude-opus-4-5` | 200,000 | 64,000 |
|  | `claude-4-sonnet` | 200,000 | 32,000 |
|  | `claude-3-7-sonnet` | 200,000 | 32,000 |
|  | `gemini-3.1-pro` | 1,000,000 | 64,000 |
|  | `mistral-large` | 32,000 | 8,192 |
|  | `mistral-large2` | 128,000 | 8,192 |
|  | `openai-o4-mini` | 200,000 | 32,000 |
|  | `openai-gpt-5.1` | 272,000 | 8,192 |
|  | `openai-gpt-5` | 272,000 | 8,192 |
|  | `openai-gpt-5-mini` | 272,000 | 8,192 |
|  | `openai-gpt-5-nano` | 272,000 | 8,192 |
|  | `openai-gpt-5-chat` | 128,000 | 8,192 |
|  | `openai-gpt-4.1` | 128,000 | 32,000 |
|  | `openai-gpt-oss-120b` | 128,000 | 8,192 |
|  | `openai-gpt-oss-20b` | 128,000 | 8,192 |
|  | `mixtral-8x7b` | 32,000 | 8,192 |
|  | `llama3-8b` | 8,000 | 8,192 |
|  | `llama3-70b` | 8,000 | 8,192 |
|  | `llama3.1-8b` | 128,000 | 8,192 |
|  | `llama3.1-70b` | 128,000 | 8,192 |
|  | `llama3.3-70b` | 128,000 | 8,192 |
|  | `snowflake-llama-3.3-70b` | 128,000 | 8,192 |
|  | `llama3.1-405b` | 128,000 | 8,192 |
|  | `snowflake-llama-3.1-405b` | 8,000 | 8,192 |
|  | `mistral-7b` | 32,000 | 8,192 |
| EMBED_TEXT_768 | `e5-base-v2` | 512 | n/a |
|  | `snowflake-arctic-embed-m` | 512 | n/a |
| EMBED_TEXT_1024 | `nv-embed-qa-4` | 512 | n/a |
|  | `multilingual-e5-large` | 512 | n/a |
|  | `voyage-multilingual-2` | 32,000 | n/a |
| AI_EXTRACT | `arctic-extract` | 128,000 | 51,200 |
| AI_FILTER | Snowflake managed model | 128,000 | n/a |
| AI_CLASSIFY | Snowflake managed model | 128,000 | n/a |
| AI_AGG | Snowflake managed model | 128,000 per row  can be used across multiple rows | 8,192 |
| AI_SENTIMENT | Snowflake managed model | 2,048 | n/a |
| AI_SUMMARIZE_AGG | Snowflake managed model | 128,000 per row  can be used across multiple rows | 8,192 |
| ENTITY_SENTIMENT | Snowflake managed model | 2,048 | n/a |
| EXTRACT_ANSWER | Snowflake managed model | 2,048 for text  64 for question | n/a |
| SENTIMENT | Snowflake managed model | 512 | n/a |
| SUMMARIZE | Snowflake managed model | 32,000 | 4,096 |
| TRANSLATE | Snowflake managed model | 4,096 | n/a |

## Choosing a model

The Snowflake Cortex AI_COMPLETE function supports multiple models of varying capability, latency, and cost. These models
have been carefully chosen to align with common customer use cases. To achieve the best
performance per credit, choose a model that’s a good match for the content size and
complexity of your task. Here are brief overviews of the available models.

### Large models

If you’re not sure where to start, try the most capable models first to establish a baseline to evaluate other models.
`claude-3-7-sonnet` and `mistral-large2` are the most capable models offered by Snowflake Cortex,
and will give you a good idea what a state-of-the-art model can do.

* `Claude 4-6 Sonnet` is a leader in general reasoning and multimodal capabilities. It outperforms its predecessors in tasks that require reasoning across different domains and modalities. You can use its large output capacity to get more information from either structured or unstructured queries. Its reasoning capabilities and large context windows make it well-suited for agentic workflows.
* `deepseek-r1` is a foundation model trained using large-scale reinforcement-learning (RL) without supervised fine-tuning (SFT).
  It can deliver high performance across math, code, and reasoning tasks.
  To access the model, set the [cross-region inference parameter](cross-region-inference.md) to `AWS_US`.
* `mistral-large2` is Mistral AI’s most advanced large language model with top-tier reasoning capabilities.
  Compared to `mistral-large`, it’s significantly more capable in code generation, mathematics, reasoning, and
  provides much stronger multilingual support. It’s ideal for complex tasks that require large reasoning capabilities
  or are highly specialized, such as synthetic text generation, code generation, and multilingual text analytics.
* `snowflake-llama3.1-405b` is a model derived from the open source llama3.1 model. It uses the [SwiftKV optimizations](https://www.snowflake.com/en/blog/up-to-75-lower-inference-cost-llama-meta-llm/) developed by the Snowflake AI research team to deliver up to a 75% inference cost reduction. SwiftKV achieves higher throughput performance with minimal accuracy loss.

### Medium models

* `llama3.1-70b` is an open source model that demonstrates state-of-the-art performance ideal for chat applications,
  content creation, and enterprise applications. It is a highly performant, cost effective model that enables diverse use
  cases with a context window of 128K. `llama3-70b` is still supported and has a context window of 8K.
* `snowflake-llama3.3-70b` is a model derived from the open source llama3.3 model. It uses the [SwiftKV optimizations](https://www.snowflake.com/en/blog/up-to-75-lower-inference-cost-llama-meta-llm/) developed by the Snowflake AI research team to deliver up to a 75% inference cost reduction. SwiftKV achieves higher throughput performance with minimal accuracy loss.
* `mixtral-8x7b` is ideal for text generation, classification, and question answering. Mistral models are optimized
  for low latency with low memory requirements, which translates into higher throughput for enterprise use cases.

### Small models

* `llama3.1-8b` is ideal for tasks that require low to moderate reasoning. It’s a light-weight, ultra-fast model with a context window
  of 128K. `llama3-8b` provides a smaller context window and relatively lower accuracy.
* `mistral-7b` is ideal for your simplest summarization, structuration, and question answering tasks that need to be
  done quickly. It offers low latency and high throughput processing for multiple pages of text with its 32K context
  window.

The following table provides information on how popular models perform on various benchmarks,
including the models offered by Snowflake Cortex AI_COMPLETE as well as a few other popular models.

| Model | Context Window  (Tokens) | MMLU  (Reasoning) | HumanEval  (Coding) | GSM8K  (Arithmetic Reasoning) | Spider 1.0  (SQL) |
| --- | --- | --- | --- | --- | --- |
| [GPT 4.o](https://openai.com/index/hello-gpt-4o/) | 128,000 | 88.7 | 90.2 | 96.4 | - |
| [Claude 3.5 Sonnet](https://www.anthropic.com/claude) | 200,000 | 88.3 | 92.0 | 96.4 | - |
| [llama3.1-405b](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md) | 128,000 | 88.6 | 89 | 96.8 | - |
| [llama3.1-70b](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md) | 128,000 | 86 | 80.5 | 95.1 | - |
| [mistral-large2](https://mistral.ai/news/mistral-large-2407/) | 128,000 | 84 | 92 | 93 | - |
| [llama3.1-8b](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md) | 128,000 | 73 | 72.6 | 84.9 | - |
| [mixtral-8x7b](https://mistral.ai/news/mixtral-of-experts/) | 32,000 | 70.6 | 40.2 | 60.4 | - |
| [Snowflake Arctic](https://www.snowflake.com/en/data-cloud/arctic/) | 4,096 | 67.3 | 64.3 | 69.7 | 79 |
| [mistral-7b](https://mistral.ai/news/announcing-mistral-7b/) | 32,000 | 62.5 | 26.2 | 52.1 | - |
| GPT 3.5 Turbo\* | 4,097 | 70 | 48.1 | 57.1 | - |

## Previous model versions

The Snowflake Cortex AI_COMPLETE and COMPLETE functions also supports the following older model versions. We recommend
using the latest model versions instead of the versions listed in this table.

| Model | Context Window  (Tokens) | MMLU  (Reasoning) | HumanEval  (Coding) | GSM8K  (Arithmetic Reasoning) | Spider 1.0  (SQL) |
| --- | --- | --- | --- | --- | --- |
| [mistral-large](https://mistral.ai/news/mistral-large/) | 32,000 | 81.2 | 45.1 | 81 | 81 |
| [llama-2-70b-chat](https://huggingface.co/meta-llama/Llama-2-70b-chat) | 4,096 | 68.9 | 30.5 | 57.5 | - |

## Using Snowflake Cortex AI Functions with Python

### Call Cortex AI Functions in Snowpark Python

You can use Snowflake Cortex AI Functions in the Snowpark Python API. These functions include the following. Note that the functions in Snowpark Python have names in Pythonic “snake_case”
format, with words separated by underscores and all letters in lowercase.

* [ai_agg](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ai_agg)
* [ai_classify](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ai_classify)
* [ai_complete](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ai_complete)
* [ai_filter](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ai_filter)
* [ai_similarity](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ai_similarity)
* [ai_summarize_agg](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ai_summarize_agg)

#### `ai_agg` example

The `ai_agg` function aggregates a column of text using natural language instructions in a similar manner to how you would ask an analyst to summarize or extract findings from grouped or ungrouped data.

The following example summarizes customer reviews for each product using the `ai_agg` function. The function takes a column of text and a natural language instruction to summarize the reviews.

```python
from snowflake.snowpark.functions import ai_agg, col

df = session.create_dataframe([
    [1, "Excellent product!"],
    [1, "Great battery life."],
    [1, "A bit expensive but worth it."],
    [2, "Terrible customer service."],
    [2, "Won’t buy again."],
], schema=["product_id", "review"])

# Summarize reviews per product
summary_df = df.group_by("product_id").agg(
    ai_agg(col("review"), "Summarize the customer reviews in one sentence.")
)
summary_df.show()
```

> **Note:**
>
> Use task descriptions that are detailed and centered on the use case. For example, “Summarize the customer feedback for an investor report”.

#### Classify text with `ai_classify`

The `ai_classify` function takes a string or image and classifies it into the categories that you define.

The following example classifies travel reviews into categories such as “travel” and “cooking”. The function takes a column of text and a list of categories to classify the text into.

```python
from snowflake.snowpark.functions import ai_classify, col

df = session.create_dataframe([
    ["I dream of backpacking across South America."],
    ["I made the best pasta yesterday."],
], schema=["sentence"])

df = df.select(
    "sentence",
    ai_classify(col("sentence"), ["travel", "cooking"]).alias("classification")
)
df.show()
```

> **Note:**
>
> You can provide up to 500 categories. You can classify both text and images.

#### Filter rows with `ai_filter`

The `ai_filter` function evaluates a natural language condition and returns `True` or `False`. You can use it to filter or tag rows.

```python
from snowflake.snowpark.functions import ai_filter, prompt, col

df = session.create_dataframe(["Canada", "Germany", "Japan"], schema=["country"])

filtered_df = df.select(
    "country",
    ai_filter(prompt("Is {0} in Asia?", col("country"))).alias("is_in_asia")
)
filtered_df.show()
```

> **Note:**
>
> You can filter on both strings and files. For dynamic prompts, use the `prompt` function.
> For more information, see
> [Snowpark Python reference](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/index).

### Call Cortex AI Functions in Snowflake ML

[Snowflake ML](../../developer-guide/snowflake-ml/overview.md) contains the older AI Functions, those with names that don’t
begin with “AI”. These functions are supported in version 1.1.2 and later of Snowflake ML. The names are rendered in Pythonic
“snake_case” format, with words separated by underscores and all letters in lowercase.

If you run your Python script outside of Snowflake, you must create a Snowpark session to use these functions. See
[Connecting to Snowflake](../../developer-guide/snowflake-ml/snowpark-ml.md) for instructions.

#### Process single values

The following Python example illustrates calling Snowflake Cortex AI functions on single values:

```python
from snowflake.cortex import complete, extract_answer, sentiment, summarize, translate

text = """
    The Snowflake company was co-founded by Thierry Cruanes, Marcin Zukowski,
    and Benoit Dageville in 2012 and is headquartered in Bozeman, Montana.
"""

print(complete("llama3.1-8b", "how do snowflakes get their unique patterns?"))
print(extract_answer(text, "When was snowflake founded?"))
print(sentiment("I really enjoyed this restaurant. Fantastic service!"))
print(summarize(text))
print(translate(text, "en", "fr"))
```

#### Pass hyperparameter options

You can pass options that affect the model’s hyperparameters when using the `complete` function. The following
Python example illustrates modifying the maximum number of output tokens that the model can generate:

```python
from snowflake.cortex import complete, CompleteOptions

model_options1 = CompleteOptions(
    {'max_tokens':30}
)

print(complete("llama3.1-8b", "how do snowflakes get their unique patterns?", options=model_options1))
```

#### Call functions on table columns

You can call an AI function on a table column, as shown below. This example requires a session object (stored in
`session`) and a table `articles` containing a text column `abstract_text`, and creates a new column
`abstract_summary` containing a summary of the abstract.

```python
from snowflake.cortex import summarize
from snowflake.snowpark.functions import col

article_df = session.table("articles")
article_df = article_df.withColumn(
    "abstract_summary",
    summarize(col("abstract_text"))
)
article_df.collect()
```

> **Note:**
>
> The advanced chat-style (multi-message) form of COMPLETE is not currently supported in Snowflake ML Python.

## Using Snowflake Cortex AI functions with Snowflake CLI

Snowflake Cortex AI Functions are available in [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) version 2.4.0
and later. See [Introducing Snowflake CLI](../../developer-guide/snowflake-cli/introduction/introduction.md) for more information about using Snowflake CLI.
The functions are the old-style functions, those with names that don’t begin with “AI”.

The following examples illustrate using the `snow cortex` commands on single values. The `-c` parameter specifies which connection to use.

> **Note:**
>
> The advanced chat-style (multi-message) form of COMPLETE is not currently supported in Snowflake CLI.

```snowcli
snow cortex complete "Is 5 more than 4? Please answer using one word without a period." -c "snowhouse"
```

```snowcli
snow cortex extract-answer "what is snowflake?" "snowflake is a company" -c "snowhouse"
```

```snowcli
snow cortex sentiment "Mary had a little Lamb" -c "snowhouse"
```

```snowcli
snow cortex summarize "John has a car. John's car is blue. John's car is old and John is thinking about buying a new car. There are a lot of cars to choose from and John cannot sleep because it's an important decision for John."
```

```snowcli
snow cortex translate herb --to pl
```

You can also use files that contain the text you want to use for the commands. For this example, assume that the file `about_cortex.txt` contains the following content:

```output
Snowflake Cortex gives you instant access to industry-leading large language models (LLMs) trained by researchers at companies like Anthropic, Mistral, Reka, Meta, and Google, including Snowflake Arctic, an open enterprise-grade model developed by Snowflake.

Since these LLMs are fully hosted and managed by Snowflake, using them requires no setup. Your data stays within Snowflake, giving you the performance, scalability, and governance you expect.

Snowflake Cortex features are provided as SQL functions and are also available in Python. The available functions are summarized below.

COMPLETE: Given a prompt, returns a response that completes the prompt. This function accepts either a single prompt or a conversation with multiple prompts and responses.
EMBED_TEXT_768: Given a piece of text, returns a vector embedding that represents that text.
EXTRACT_ANSWER: Given a question and unstructured data, returns the answer to the question if it can be found in the data.
SENTIMENT: Returns a sentiment score, from -1 to 1, representing the detected positive or negative sentiment of the given text.
SUMMARIZE: Returns a summary of the given text.
TRANSLATE: Translates given text from any supported language to any other.
```

You can then execute the `snow cortex summarize` command by passing in the filename using the `--file` parameter, as shown:

```snowcli
snow cortex summarize --file about_cortex.txt
```

```output
Snowflake Cortex offers instant access to industry-leading language models, including Snowflake Arctic, with SQL functions for completing prompts (COMPLETE), text embedding (EMBED_TEXT_768), extracting answers (EXTRACT_ANSWER), sentiment analysis (SENTIMENT), summarizing text (SUMMARIZE), and translating text (TRANSLATE).
```

For more information about these commands, see [snow cortex commands](../../developer-guide/snowflake-cli/command-reference/cortex-commands/overview.md).

## Legal notices

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Snowflake-managed MCP server
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-mcp.md
section: Snowflake Cortex (AI & ML)
---

# Snowflake-managed MCP server

## Overview

> **Note:**
>
> Snowflake supports Model Context Protocol revision `2025-11-25`.

Model Context Protocol (MCP), is an [open-source standard](https://modelcontextprotocol.io/docs/getting-started/intro) that lets AI agents securely interact with business applications and external data systems, such as databases and content repositories. MCP lets enterprise businesses reduce integration challenges and quickly deliver outcomes from models. Since its launch, MCP has become foundational for agentic applications, providing a consistent and secure mechanism for invoking tools and retrieving data.

The Snowflake-managed MCP server lets AI agents securely retrieve data from Snowflake accounts without needing to deploy separate infrastructure. You can configure the MCP server to serve Cortex Analyst, Cortex Search, and Cortex Agents as tools, along with custom tools and SQL executions on the standards-based interface. MCP clients discover and invoke these tools, and retrieve data required for the application. With managed MCP servers on Snowflake, you can build scalable enterprise-grade applications while maintaining access and privacy controls. The MCP server on Snowflake provides:

* **Standardized integration:** Unified interface for tool discovery and invocation, in compliance with the rapidly evolving standards.
* **Comprehensive authentication:** Snowflake’s built-in OAuth service to enable OAuth-based authentication for MCP integrations.
* **Robust governance:** Role based access control (RBAC) for the MCP server and tools to manage tool discovery and invocation.

For information about the MCP lifecycle, see [Lifecycle](https://modelcontextprotocol.io/specification/2025-11-25/basic/lifecycle). For an example of an MCP implementation, see the [Getting Started with Managed Snowflake MCP Server](https://quickstarts.snowflake.com/guide/getting-started-with-snowflake-mcp-server/index.html) Quickstart.

## MCP server security recommendations

> **Important:**
>
> When you configure hostnames for MCP server connections, use hyphens (`-`) instead of underscores (`_`). MCP servers have connection issues with hostnames containing underscores.

Using multiple MCP servers without verifying tools and descriptions could lead to vulnerabilities such as tool poisoning or tool shadowing. Snowflake recommends verifying third-party MCP servers before using them. This includes any MCP server from another Snowflake user or account. Verify all tools offered by third-party MCP servers.

We recommend using OAuth as the authentication method. Using hardcoded tokens can lead to token leakage.

When using a Programmatic Access Token (PAT), set it to use the least-privileged role allowed to work with MCP. This will help prevent leaking a secret with access to a highly-privileged role.

Configure proper permissions for the MCP server and tools following the least-privilege principle. Access to the MCP Server does not give access to the tools. Permission needs to be granted for each tool.

## Create an MCP Server object

Create an object, specifying the tools and other metadata. MCP clients that connect with the server, after requisite authentication, are able to discover and invoke these tools.

1. Navigate to the desired database and schema to create the MCP server in.
2. Create the MCP server:

   ```sqlexample-yaml
   CREATE [ OR REPLACE ] MCP SERVER [ IF NOT EXISTS ] <server_name>
     FROM SPECIFICATION $$
       tools:
         - name: "product-search"
           type: "CORTEX_SEARCH_SERVICE_QUERY"
           identifier: "database1.schema1.Cortex_Search_Service1"
           description: "cortex search service for all products"
           title: "Product Search"

         - name: "revenue-semantic-view"
           type: "CORTEX_ANALYST_MESSAGE"
           identifier: "database1.schema1.Semantic_View_1"
           description: "Semantic view for all revenue tables"
           title: "Semantic view for revenue"
     $$
   ```

> Snowflake currently supports the following tool types:
>
> * **CORTEX_SEARCH_SERVICE_QUERY:** Cortex Search Service tool
> * **CORTEX_ANALYST_MESSAGE:** Cortex Analyst tool
> * **SYSTEM_EXECUTE_SQL:** SQL execution
> * **CORTEX_AGENT_RUN:** Cortex Agent tool
> * **GENERIC:** tool for UDFs and stored procedures
>
> The following examples show how to configure different tool types:
>
> Analyst toolSearch toolSQL execution toolAgent toolUDF / Stored Procedure
>
> Using the Analyst tool, your client can generate SQL from natural language text. Use the following code to specify the tool configuration.
>
> > **Note:**
> >
> > The Snowflake-managed MCP server only supports using semantic views with Cortex Analyst. It does not support semantic models.
>
> ```yaml
> tools:
>   - name: "revenue-semantic-view"
>     type: "CORTEX_ANALYST_MESSAGE"
>     identifier: "database1.schema1.Semantic_View_1"
>     description: "Semantic view for all revenue tables"
>     title: "Semantic view for revenue"
> ```
>
> Using the Search tool requests, your client can perform unstructured search on their data.
>
> ```sqlexample-yaml
> tools:
>   - name: "product-search"
>     type: "CORTEX_SEARCH_SERVICE_QUERY"
>     identifier: "database1.schema1.Cortex_Search_Service1"
>     description: "cortex search service for all products"
>     title: "Product Search"
> ```
>
> For the SQL execution tool, your client can execute SQL queries on Snowflake. You can optionally configure the following options:
>
> * `read_only`: When set to `true`, only read operations (SELECT queries) are allowed. Defaults to `false`.
> * `query_timeout`: Maximum time in seconds for query execution.
> * `warehouse`: The warehouse to use for query execution. If not specified, the default warehouse is used.
>
> Use the following code to specify the tool configuration:
>
> ```yaml
> tools:
>   - title: "SQL Execution Tool"
>     name: "sql_exec_tool"
>     type: "SYSTEM_EXECUTE_SQL"
>     description: "A tool to execute SQL queries against the connected Snowflake database."
>     config:
>       read_only: false
>       query_timeout: 600
>       warehouse: "WAREHOUSE"
> ```
>
> For the Agent tool, your client passes a message to the agent. The agent processes the request and returns a response. Use the following code to specify the tool configuration.
>
> ```yaml
> tools:
>   - title: "Agent V2"
>     name: "agent_1"
>     type: "CORTEX_AGENT_RUN"
>     identifier: "db.schema.agent"
>     description: "agent that gives the ability to..."
> ```
>
> For your custom tools, you must provide the user-defined function (UDF) or stored procedure signature in the tool configuration. The custom tool enables you to invoke UDFs and stored procedures as tools through the MCP server.
>
> You can specify the following in the tool configuration:
>
> * `type`: `function` for UDF, `procedure` for stored procedure
> * `warehouse`: The warehouse to use. If you don’t specify a warehouse, the default warehouse is used.
> * `query_timeout`: Maximum time in seconds for tool execution.
> * `input_schema`: Corresponds to the function signature.
>
> ```yaml
> tools:
>   - name: "my_custom_tool"
>     identifier: "db.schema.my_function"
>     type: "GENERIC"
>     description: "Custom tool description"
>     config:
>       type: "function"
>       query_timeout: 120
>       warehouse: "WAREHOUSE"
>       input_schema:
>         type: "object"
>         properties:
>           query:
>             type: "string"
> ```

Use the following examples to create and configure custom tools using UDFs and stored procedures:

UDF examplesStored procedure examplesTool configuration examples

The following examples demonstrate creating UDFs that can be used as custom tools:

```sqlexample-python
-- create a simple udf
CREATE OR REPLACE FUNCTION MULTIPLY_BY_TEN(x FLOAT)
RETURNS FLOAT
LANGUAGE PYTHON
RUNTIME_VERSION = '3.8'
HANDLER = 'multiply_by_ten'
AS
$$
def multiply_by_ten(x: float) -> float:
  return x * 10
$$;

SHOW FUNCTIONS LIKE 'MULTIPLY_BY_TEN';

-- test return json/variant
CREATE OR REPLACE FUNCTION CALCULATE_PRODUCT_AND_SUM(x FLOAT, y FLOAT)
RETURNS VARIANT
LANGUAGE PYTHON
RUNTIME_VERSION = '3.8'
HANDLER = 'calculate_values'
AS
$$
import json

def calculate_values(x: float, y: float) -> dict:
  """
  Calculates the product and sum of two numbers and returns them in a dictionary.
  The dictionary is converted to a VARIANT (JSON) in the SQL return.
  """
  product = x * y
  sum_val = x + y

  return {
      "product": product,
      "sum": sum_val
  }
$$;

-- test return list/array
CREATE OR REPLACE FUNCTION GET_NUMBERS_IN_RANGE(x FLOAT, y FLOAT)
RETURNS ARRAY -- Use ARRAY to explicitly state a list is being returned
LANGUAGE PYTHON
RUNTIME_VERSION = '3.8'
HANDLER = 'get_numbers'
AS
$$
def get_numbers(x: float, y: float) -> list:
  """
  Returns a list of integers between x (exclusive) and y (inclusive).
  Assumes x < y.
  """
  # Ensure x and y are treated as integers for range generation
  start = int(x) + 1
  end = int(y) + 1 # range() is exclusive on the stop value

  # Use a list comprehension to generate the numbers
  # The Python list will be converted to a Snowflake ARRAY.
  return list(range(start, end))
$$;
```

The following examples demonstrate creating stored procedures that can be used as custom tools:

```sqlexample-python
-- create a simple stored procedure
CREATE OR REPLACE PROCEDURE MULTIPLY_BY_TEN_SP(x FLOAT)
RETURNS FLOAT
LANGUAGE PYTHON
RUNTIME_VERSION = '3.8'
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'multiply_by_ten'
AS
$$
# The handler logic is identical to the UDF for a scalar return
def multiply_by_ten(x: float) -> float:
      return x * 10
$$;

-- test return json/variant
CREATE OR REPLACE PROCEDURE CALCULATE_VALUES_SP(x FLOAT, y FLOAT)
RETURNS VARIANT
LANGUAGE PYTHON
RUNTIME_VERSION = '3.8'
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'calculate_values'
AS
$$
# The handler logic is identical to the UDF for a VARIANT return
def calculate_values(x: float, y: float) -> dict:
      """
      Calculates the product and sum of two numbers and returns them in a dictionary.
      The dictionary is converted to a VARIANT (JSON) in the SQL return.
      """
      product = x * y
      sum_val = x + y

      return {
          "product": product,
          "sum": sum_val
      }
$$;

-- test return list/array
CREATE OR REPLACE PROCEDURE GET_NUMBERS_SP(x FLOAT, y FLOAT)
RETURNS ARRAY
LANGUAGE PYTHON
RUNTIME_VERSION = '3.8'
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'get_numbers'
AS
$$
def get_numbers(x: float, y: float) -> list:
      """
      Returns a list of integers between x (exclusive) and y (inclusive).
      The Python list will be converted to a Snowflake ARRAY.
      """
      # Ensure x and y are treated as integers for range generation
      start = int(x) + 1
      end = int(y) + 1 # range() is exclusive on the stop value

      # Use a list comprehension to generate the numbers
      return list(range(start, end))
$$;
```

The following examples demonstrate configuring custom tools for UDFs and stored procedures:

```sqlexample-yaml
CREATE MCP SERVER my_mcp_server
  FROM SPECIFICATION $$
    tools:
      - title: "Custom Tool 1"
        identifier: "EXAMPLE_DATABASE.AGENTS.MULTIPLY_BY_TEN"
        name: "multiply_by_ten"
        type: "GENERIC"
        description: "Multiplied input value by ten and returns the result."
        config:
          type: "function"
          warehouse: "COMPUTE_SERVICE_WAREHOUSE"
          input_schema:
            type: "object"
            properties:
              x:
                description: "A number to be multiplied by ten"
                type: "number"
      - title: "Custom Tool 2"
        identifier: "EXAMPLE_DATABASE.AGENTS.CALCULATE_PRODUCT_AND_SUM"
        name: "calculate_product_and_sum"
        type: "GENERIC"
        description: "Calculates the product and sum of two numbers and returns them in a JSON object."
        config:
          type: "function"
          warehouse: "COMPUTE_SERVICE_WAREHOUSE"
          input_schema:
            type: "object"
            properties:
              x:
                description: "First number"
                type: "number"
              y:
                description: "Second number"
                type: "number"
      - title: "Custom Tool 3"
        identifier: "EXAMPLE_DATABASE.AGENTS.GET_NUMBERS_IN_RANGE"
        name: "get_numbers_in_range"
        type: "GENERIC"
        description: "Returns a list of integers between two numbers."
        config:
          type: "function"
          warehouse: "COMPUTE_SERVICE_WAREHOUSE"
          input_schema:
            type: "object"
            properties:
              x:
                description: "Start number (exclusive)"
                type: "number"
              y:
                description: "End number (inclusive)"
                type: "number"
      - title: "Custom Tool 4"
        identifier: "EXAMPLE_DATABASE.AGENTS.MULTIPLY_BY_TEN_SP"
        name: "multiply_by_ten_sp"
        type: "GENERIC"
        description: "Multiplied input value by ten and returns the result."
        config:
          type: "procedure"
          warehouse: "COMPUTE_SERVICE_WAREHOUSE"
          input_schema:
            type: "object"
            properties:
              x:
                description: "A number to be multiplied by ten"
                type: "number"
      - title: "Custom Tool 5"
        identifier: "EXAMPLE_DATABASE.AGENTS.CALCULATE_PRODUCT_AND_SUM_SP"
        name: "calculate_product_and_sum_sp"
        type: "GENERIC"
        description: "Calculates the product and sum of two numbers and returns them in a JSON object."
        config:
          type: "procedure"
          warehouse: "COMPUTE_SERVICE_WAREHOUSE"
          input_schema:
            type: "object"
            properties:
              x:
                description: "First number"
                type: "number"
              y:
                description: "Second number"
                type: "number"
      - title: "Custom Tool 6"
        identifier: "EXAMPLE_DATABASE.AGENTS.GET_NUMBERS_IN_RANGE_SP"
        name: "get_numbers_in_range_sp"
        type: "GENERIC"
        description: "Returns a list of integers between two numbers."
        config:
          type: "procedure"
          warehouse: "COMPUTE_SERVICE_WAREHOUSE"
          input_schema:
            type: "object"
            properties:
              x:
                description: "Start number (exclusive)"
                type: "number"
              y:
                description: "End number (inclusive)"
                type: "number"
  $$;
```

1. To show MCP servers, use the following commands:

   ```sqlexample
   SHOW MCP SERVERS IN DATABASE <database_name>;
   SHOW MCP SERVERS IN SCHEMA <schema_name>;
   SHOW MCP SERVERS IN ACCOUNT;
   ```

   The following shows the output of the command:

   ```output
   |               created_on               |       name        | database_name | schema_name |    owner     |           comment            |
   ------------------------------------------+-------------------+---------------+-------------+--------------+------------------------------
   | Fri, 23 Jun 1967 07:00:00.123000 +0000 | TEST_MCP_SERVER   | TEST_DATABASE | TEST_SCHEMA | ACCOUNTADMIN | [NULL]                       |
   | Fri, 23 Jun 1967 07:00:00.123000 +0000 | TEST_MCP_SERVER_2 | TEST_DATABASE | TEST_SCHEMA | ACCOUNTADMIN | Test MCP server with comment |
   ```
2. To describe an MCP server, use the following command:

   ```sqlexample
   DESCRIBE MCP SERVER <server_name>;
   ```

   The following shows the output of the command:

   ```output
   |      name       | database_name | schema_name |    owner     | comment |     server_spec        |               created_on               |
   ------------------------------------------------------------------------------------------------------+-------------------------------------
   | TEST_MCP_SERVER | TEST_DATABASE | TEST_SCHEMA | ACCOUNTADMIN | [NULL]  | {"version":1,"tools":[{"name":"product-search","identifier":"db.schema.search_service","type":"CORTEX_SEARCH_SERVICE_QUERY"}]} | Fri, 23 Jun 1967 07:00:00.123000 +0000 |
   ```
3. To drop an MCP server, use the following command:

   ```sqlexample
   DROP MCP SERVER <server_name>;
   ```

## MCP server URL

To connect to the MCP server, use the URL endpoint with the following format:

```none
https://<account_URl>/api/v2/databases/{database}/schemas/{schema}/mcp-servers/{name}
```

For information about formatting your account URL, see [Account identifiers](../admin-account-identifier.md).

## Access control

You can use the following privileges to manage access to the MCP server and the underlying tools.

| Privilege | Object | Description |
| --- | --- | --- |
| CREATE | MCP SERVER | Required to create the MCP server |
| OWNERSHIP | MCP SERVER | Required to update the object configuration |
| MODIFY | MCP SERVER | Provides update, drop, describe, show, and use (`tools/list` and `tools/call`) on the object configuration |
| USAGE | MCP SERVER | Required to connect with the MCP server and discover tools |
| USAGE | Cortex Search Service | Required to invoke the Cortex Search tool in the MCP server |
| SELECT | Semantic View | Required to invoke the Cortex Analyst tool in the MCP server |
| USAGE | Cortex Agent | Required to invoke the Cortex Agent as a tool in the MCP server |
| USAGE | User-defined function (UDF) or stored procedure | Required to invoke the UDF or stored procedure as a tool in the MCP server |

## Set up OAuth authentication

Configure authentication on the MCP client. The Snowflake-managed MCP server supports [OAuth 2.0](../oauth-snowflake-overview.md) aligned with the [authorization](https://modelcontextprotocol.io/specification/2025-11-25/basic/authorization) recommendation in the MCP protocol. The Snowflake-managed MCP server doesn’t support dynamic client registration.

1. First, create the security integration. For information about this command, see [CREATE SECURITY INTEGRATION (Snowflake OAuth)](../../sql-reference/sql/create-security-integration-oauth-snowflake.md).

   ```sqlexample
   CREATE [ OR REPLACE ] SECURITY INTEGRATION [IF NOT EXISTS] <integration_name>
     TYPE = OAUTH
     OAUTH_CLIENT = CUSTOM
     ENABLED = TRUE
     OAUTH_CLIENT_TYPE = 'CONFIDENTIAL'
     OAUTH_REDIRECT_URI = '<redirect_URI>'
   ```
2. Then, call the system function to retrieve your client id and keys for client configuration. The integration name is case sensitive and must be in uppercase.

   ```sqlexample
   SELECT SYSTEM$SHOW_OAUTH_CLIENT_SECRETS('<integration_name>');
   ```

## Interact with the MCP server using a custom MCP client

For information about building a custom MCP client, see [Build an MCP client](https://modelcontextprotocol.io/docs/develop/build-client).

> **Note:**
>
> The Snowflake MCP server currently only supports tool capabilities.

### Discover and invoke tools

The MCP clients can discover and invoke tools with `tools/list` and `tools/call` requests.

To discover or invoke tools, issue a POST call as shown in the [tools/list request](https://modelcontextprotocol.io/specification/2025-11-25/server/tools#calling-tools):

For the Analyst tool, your client passes messages in the request. The SQL statement is listed in the output. You must pass the name of the tool that you’re invoking in the request in the `name` parameter.

```none
POST /api/v2/databases/<database>/schemas/<schema>/mcp-servers/<name>
    {
        "jsonrpc": "2.0",
        "id": 1,
        "method": "tools/call",
        "params": {
            "name": "test-analyst",
            "arguments": {
                "message": "text"
            }
        }
    }
```

The following example shows the response:

```json
{
    "jsonrpc": "2.0",
    "id": 1,
    "result": {
        "content": [
            {
                "type": "text",
                "text": "string"
            }
        ]
    }
}
```

For Search tool requests, your client can pass the query and the following optional arguments:

* columns
* limit

The search results and request ID are returned in the output. You must pass the name of the tool that you’re invoking in the request as the `name` parameter.

```none
POST /api/v2/databases/{database}/schemas/{schema}/mcp-servers/{name}
    {
        "jsonrpc": "2.0",
        "id": 1,
        "method": "tools/call",
        "params": {
            "name": "product-search",
            "arguments": {
                "query": "Hotels in NYC",
                "columns": array of strings,
                "limit": int
            }
        }
  }
```

The following example shows the response:

```json
{
    "jsonrpc": "2.0",
    "id": 1,
    "result": {
        "results": {}
    }
}
```

## Limitations

Snowflake managed MCP server does not support the following constructs in the MCP protocol: resources, prompts, roots, notifications, version negotiations, life cycle phases, and sampling.

Only non-streaming responses are supported.

---
title: Suggestions for semantic models and views
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/verified-query-suggestions.md
section: Snowflake Cortex (AI & ML)
---

# Suggestions for semantic models and views

Suggestions help you enrich and improve your semantic models and views by identifying elements that appear useful based on real user behavior. These suggestions require human review before being added.

Cortex Analyst surfaces suggestions in contexts where it has enough information to propose new verified queries, filters, metrics, and other model elements.

Suggestions are not automatically applied. Instead, Cortex Analyst uses suggestions as a queue of potential improvements that you can accept, edit, or dismiss.

## Review suggestions

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Cortex Analyst.
3. Select the semantic model or view you want to review suggestions for.
4. Review the suggestions on the right panel.
5. Select a suggestion to Accept, Edit, or Dismiss. Cortex Analyst only allows renaming, rewriting SQL, or updating descriptions depending on the type of suggestion.
6. Refresh the page to generate new suggestions. Some suggestions that are fetched from query history may take a few minutes to appear.

## Dismissing a suggestion

When you dismiss a suggestion, the dismissal is valid only for the current session. Refreshing the page may result in the return of previously dismissed suggestions.

## Types of Suggestions

The following sections list all suggestion types, organized by their source.

### Suggestions from query history

Snowflake analyzes recent SQL query history available to the current role to discover frequently used or missing elements.

> **Note:**
>
> All suggestions based on query history only appear after the model or view has accumulated sufficient query activity. If a query was either run far in the past or run recently in a database that gets a lot of new query traffic, it may be omitted.

### Verified Query suggestions

These suggestions come from Cortex Analyst when it identifies queries that could be added to your Verified Query repository (VQR). Snowflake suggests up to 10 verified queries at a time. If you accept all suggestions, refresh the page to get more.

The criteria for these queries to be suggested are:

* **High frequency:** Queries similar to the candidate appear frequently.
* **Contains interesting semantic information:** Extremely simple queries are removed since they are unlikely to add value.
* **Novelty:** No existing verified query looks similar.

### Filter and Metric suggestions

These suggestions analyze SQL query history to find frequently used SQL expressions that are not yet represented in the Semantic Model or view. Snowflake suggests up to 10 filters and 10 metrics at a time. If you accept all suggestions, refresh the page to get more.

The criteria for these queries to be suggested are:

* **High frequency:** Queries similar to the candidate appear frequently.
* **Novelty:** No existing verified query looks similar.

### Suggestions from Usage Data

Cortex Analyst aggregates recent user queries to make suggestions. If there are commonly asked questions by users of Cortex Analyst, Cortex Agents, or Snowflake Intelligence that don’t match any existing verified queries, these questions get aggregated, grouped, and suggested in up to 10 verified query suggestions.

The criteria for these questions to be suggested are:

* Appears in Cortex Analyst monitoring tables. This includes questions from Cortex Agents and Snowflake Intelligence by default.
* Frequently asked by users.

### Suggestions from optimization

Snowflake can optimize a semantic model or view from existing verified queries. This process produces different types of suggestions, including:

* Metrics
* Filters
* Custom Instructions
* Descriptions
* Synonyms

---
title: Threads API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-threads-rest-api.md
section: Snowflake Cortex (AI & ML)
---

# Threads API

Use this API to create threads that are used to interact with Cortex Agents.

## Create thread

`POST /api/v2/cortex/threads`

Creates a new thread and returns a thread metadata object.

### Request

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. For more information, see [Authentication](cortex-agents.md). |
| `Content-Type` | (Required) application/json |

#### Request body

The request body can include the following field:

| Field | Type | Description |
| --- | --- | --- |
| `origin_application` | string | (Optional) Name of the application that created the thread. Allows grouping threads by application. Limited to 16 bytes. |

Example:

```json
{
  "origin_application": "my_app"
}
```

### Response

Returns a thread metadata object.

| Field | Type | Description |
| --- | --- | --- |
| `thread_id` | integer | UUID for the thread. |
| `thread_name` | string | Name of the thread. |
| `origin_application` | string | The name of the application that created the thread. |
| `created_on` | integer | Time when the thread was created (milliseconds since UNIX epoch). |
| `updated_on` | integer | Time when the thread was last updated (milliseconds since UNIX epoch). |

Example:

```json
{
  "thread_id": 1234567890,
  "thread_name": "",
  "origin_application": "my_app",
  "created_on": 1717000000000,
  "updated_on": 1717000000000
}
```

## Describe thread

`GET /api/v2/cortex/threads/{id}`

Describes a thread and returns a batch of messages in that thread, based on the page_size and the last_message_id, in descending order of creation. This request is only successful if the thread ID belongs to the user.

### Request

#### Path parameters

| Parameter | Type | Description |
| --- | --- | --- |
| `id` | integer | (Required) UUID for the thread. |

#### Query parameters

| Parameter | Type | Description |
| --- | --- | --- |
| `page_size` | integer | (Optional) Number of messages to return (default: 20, max: 100). |
| `last_message_id` | integer | (Optional) The ID of the last message received. Used to set the offset for next batch. Can be empty for the first batch of messages. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. |
| `Content-Type` | (Required) application/json |

### Response

Returns a thread metadata object and an array of messages.

| Field | Type | Description |
| --- | --- | --- |
| metadata | object | Metadata for the thread, including the name, application that created the thread, and the time that it was created. |
| `messages` | array | Array of message objects. |

#### metadata

| Field | Type | Description |
| --- | --- | --- |
| `thread_id` | integer | UUID for the thread. |
| `thread_name` | string | Name of the thread. |
| `origin_application` | string | The name of the application that created the thread. |
| `created_on` | integer | Time when the thread was created (milliseconds since UNIX epoch). |
| `updated_on` | integer | Time when the thread was last updated (milliseconds since UNIX epoch). An update includes adding any new messages to the thread. |

#### Messages

| Field | Type | Description |
| --- | --- | --- |
| `message_id` | integer | UUID for the message. |
| `parent_id` | integer | UUID for the parent message. |
| `created_on` | integer | Time when the message was created (milliseconds since UNIX epoch). |
| `role` | string | The role that generated this message. |
| `message_payload` | string | Message payload. |
| `request_id` | string | Request ID for the original message. |

Example:

```json
{
  "metadata": {
    "thread_id": 1234567890,
    "thread_name": "Support Chat",
    "origin_application": "my_app",
    "created_on": 1717000000000,
    "updated_on": 1717000100000
  },
  "messages": [
    {
      "message_id": 1,
      "parent_id": null,
      "created_on": 1717000000000,
      "role": "user",
      "message_payload": "Hello, I need help.",
      "request_id": "req_001"
    },
    {
      "message_id": 2,
      "parent_id": 1,
      "created_on": 1717000001000,
      "role": "assistant",
      "message_payload": "How can I assist you?",
      "request_id": "req_002"
    }
  ]
}
```

## Update thread

`POST /api/v2/cortex/threads/{id}`

Updates a thread.

### Request

#### Path parameters

| Parameter | Type | Description |
| --- | --- | --- |
| `id` | integer | (Required) UUID for the thread. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. |
| `Content-Type` | (Required) application/json |

#### Request body

| Field | Type | Description |
| --- | --- | --- |
| `thread_name` | string | (Optional) Name of the thread. |

Example:

```json
{
  "thread_name": "New Thread Name"
}
```

### Response

Returns the status of the thread update.

```json
{"status": "Thread xxxx successfully updated."}
```

## List threads

`GET /api/v2/cortex/threads`

Lists all threads belonging to the user.

### Request

#### Query parameters

| Parameter | Type | Description |
| --- | --- | --- |
| `origin_application` | string | (Optional) Filter the list of threads by this origin application. Without specifying this field, all threads are returned. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. |
| `Content-Type` | (Required) application/json |

### Response

Returns an array of thread metadata objects.

#### Thread metadata

| Field | Type | Description |
| --- | --- | --- |
| `thread_id` | integer | UUID for the thread. |
| `thread_name` | string | Name of the thread. |
| `origin_application` | string | The name of the application that created the thread. |
| `created_on` | integer | Time when the thread was created (milliseconds since UNIX epoch). |
| `updated_on` | integer | Time when the thread was last updated (milliseconds since UNIX epoch). An update includes adding any new messages to the thread. |

Example:

```json
[
  {
    "thread_id": 1234567890,
    "thread_name": "Support Chat",
    "origin_application": "my_app",
    "created_on": 1717000000000,
    "updated_on": 1717000100000
  }
]
```

## Delete thread

`DELETE /api/v2/cortex/threads/{id}`

Deletes a thread and all the messages in that thread.

### Request

#### Path parameters

| Parameter | Type | Description |
| --- | --- | --- |
| `id` | integer | (Required) UUID for the thread. |

#### Request headers

| Header | Description |
| --- | --- |
| `Authorization` | (Required) Authorization token. |
| `Content-Type` | (Required) application/json |

### Response

Returns a success response if the thread is deleted.

```json
{
  "success": true
}
```

---
title: Troubleshooting
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/troubleshooting.md
section: Snowflake Cortex (AI & ML)
---

# Troubleshooting

This page provides information about common issues that you may run into when working with Snowflake Intelligence, as well as solutions for those issues. It also provides information about best practices for optimizing the performance of your agents and how to get additional support.

## Common issues and solutions

**Inconsistent responses**
:   Inconsistent responses are most commonly caused by a lack of specificity in prompts. To get a specific style or format for your response, specify it clearly in the prompt.

    While LLMs inherently have some variance, inconsistent answers can also happen after changes in the agent configuration. To resolve this, check for recent changes to your agent configuration, semantic view configuration, chat history, or model selection.

    If you are using a semantic model, you should transition to a semantic view. Semantic views allow validation during creation to help avoid inconsistencies that are less obvious when using a semantic model.

**Streaming response issues**
:   If you see a streaming response on one machine but not another, it is likely due to your organization’s IT configurations, such as network DPI, scanning tools, endpoint security software, or browser extensions. Work with your internal IT team to resolve these issues.

**Error 370001**
:   This error indicates that Snowflake Intelligence generated an unsafe SQL command. Snowflake Intelligence does not execute these commands and instead returns this error.

**Execution_environment not populated for analyst tool**
:   This occurs when the tool is configured to run SQL queries against the user’s default warehouse and the user does not have a warehouse set. To resolve this, either set a default warehouse for the user or configure the tool to execute against a specific custom warehouse. For more information about default warehouses, see [Warehouse usage in sessions](../../warehouses-overview.md).

**“Table / search service / stage does not exist” errors**
:   If you encounter `table / search service / stage does not exist` errors, there might be privilege issues. Verify that the following privileges are set correctly:

    * For each semantic model:

      + The user’s default role is granted USAGE on the database and schema of the semantic model stage or view, and table.
      + If using a semantic model, the user’s default role is granted READ on the stage that stores the semantic model file.
      + If using a semantic view, the user’s default role is granted REFERENCES on the semantic view.
      + The user’s default role is granted SELECT for each table defined in the semantic model or view.
    * For each Cortex search service:

      + The user’s default role is granted USAGE on the database and schema of the Cortex search service.
      + The user is granted USAGE on the Cortex search service.

**Context and memory limits**
:   Cortex Agents use a finite context window, so very long conversations will lose earlier context. For persistent context, use custom instructions in the Agent configuration.

## Performance optimization

**Response time issues**
:   Response latency can vary because Snowflake Intelligence performs a complicated series of reasoning, retrieval, and analysis tasks using LLMs and queries. Performance can be affected by the load on your Snowflake warehouse and by the LLM services themselves. Requests often take longer than a minute to complete. For better performance, ensure [Cross-region inference](build-agents.md) is enabled, use the “auto” model in your [Model selection](build-agents.md), and consider adding additional Verified Queries. For more information about verified queries, see [Cortex Analyst Verified Query Repository](../cortex-analyst/verified-query-repository.md).

**Timeout issues**
:   First, check the [Snowflake Status page](https://status.snowflake.com/) for any reported incidents. Your requests might also timeout if Snowflake Intelligence is running in a cloud region with limited GPU compute resources. We recommend enabling [Cross-region inference](../cross-region-inference.md) to avoid limitations within a single region.

**Parallel requests**
:   You can request that the agent runs tool calls, such as Cortex Analyst and Cortex Search, in parallel. Add the following to the Agent orchestration instructions [Configure and interact with Agents](../cortex-agents-manage.md):

    ```yaml
    OVERALL: parallelize as many tool calls as possible for latency purposes.
    ```

    For information about orchestration instructions, see [Specify orchestration](../cortex-agents-manage.md).

**Model selection**
:   When creating an agent, you can directly specify the model that the agent should use. You can’t directly specify the model for the Cortex Search or Cortex Analyst tools. Instead, you can use role-based access control (RBAC) to limit which models these tools can use. For more information, see [Role-based access control (RBAC)](../aisql.md).

**Multiple calls to the same tool**
:   When generated queries are large, they can sometimes trigger size limits causing a retry. Cortex Analyst has a 2048 token generation limit for queries, which can trigger the size limit. A lot of custom agent response instructions can also trigger the size limit.

**Warehouse size**
:   Snowflake Intelligence makes a series of LLM-based decisions to create the best answer and call tools as needed. You can’t impact the performance of those decisions with a larger warehouse allocation.

    However, when running a Cortex Analyst tool as part of a Snowflake Intelligence request, the request is translated to SQL queries that are run using your warehouse. If your warehouse is too small or overloaded, that negatively impacts performance.

**Improve orchestration instructions and tool descriptions**
:   To resolve issues with tools and orchestration, prompt an LLM with the explanation of the issue and the desired outcome, as well as the existing description or instructions. The LLM can help automate the creation of the new prompt.

**Use verified queries**
:   To ensure predictable results for common or complex queries, add verified queries to your semantic view. This ensures that the agent uses an optimized and predictable query path for these requests.

**Identify latency bottlenecks**
:   To diagnose slow agent responses, you can use the agent monitoring tab in Snowsight to identify latency bottlenecks. These traces show the logical path the agent took and how long each step lasted. For more information about agent monitoring, see [Monitor Cortex Agent requests](../cortex-agents-monitor.md).

## Getting support

To get support for Snowflake Intelligence, you can use the [Support page in Snowsight](../../ui-support.md). You can also access the [Snowflake Forums](https://snowflake.discourse.group/c/ai-agents-snowflake-intelligence/103) for more help.

---
title: Tutorial 1: Build a simple search application with Cortex Search
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/tutorials/cortex-search-tutorial-1-search.md
section: Snowflake Cortex (AI & ML)
---

Cortex Search

Getting Started

# Tutorial 1: Build a simple search application with Cortex Search

## Introduction

This tutorial describes how to get started with Cortex Search for a simple search
application.

### What you will learn

* Create a Cortex Search Service from on an AirBnb listings dataset.
* Create a Streamlit in Snowflake app that lets you query your Cortex Search Service.

### Prerequisites

The following prerequisites are required to complete this tutorial:

* You have a Snowflake account and user with a role that grants the necessary
  privileges to create a database, tables, virtual warehouse objects, Cortex Search services, and Streamlit apps.

Refer to the [Snowflake in 20 minutes](../../../tutorials/snowflake-in-20minutes.md) for instructions to meet these requirements.

## Step 1: Setup

### Getting the sample data

You will use a sample dataset [hosted on Huggingface](https://huggingface.co/datasets/MongoDB/airbnb_embeddings)
, downloaded as a single JSON file. Download the file directly from your browser by following this link:

* [AirBnB listings dataset](https://drive.google.com/file/d/1mZwfcB4goPiyCaYe34gTE1Er8-9-hdGY/view?usp=sharing)

> **Note:**
>
> In a non-tutorial setting, you would bring your own data, possibly already in a Snowflake table.

### Creating the database, tables, and warehouse

Execute the following statements to create a database and a virtual warehouse needed for this tutorial.
After you complete the tutorial, you can drop these objects.

```sqlexample
CREATE DATABASE IF NOT EXISTS cortex_search_tutorial_db;

CREATE OR REPLACE WAREHOUSE cortex_search_tutorial_wh WITH
     WAREHOUSE_SIZE='X-SMALL'
     AUTO_SUSPEND = 120
     AUTO_RESUME = TRUE
     INITIALLY_SUSPENDED=TRUE;
```

Note the following:

* The `CREATE DATABASE` statement creates a database. The database automatically includes a schema named ‘public’.
* The `CREATE WAREHOUSE` statement creates an initially suspended warehouse. The
  statement also sets `AUTO_RESUME = true`, which starts the warehouse automatically when
  you execute SQL statements that require compute resources.

## Step 2: Load the data into Snowflake

Before you can create a search service, you must load the example data into Snowflake.

You can upload the dataset in Snowsight or using SQL. To upload in Snowsight:

1. Select the + Create button above the left navigation bar.
2. Then select Table » From File.
3. Select your newly-created warehouse as a warehouse for your table from the drop-down at the top right corner.
4. Drag and drop the JSON data file into the dialog.
5. Select the database you created above and specify the PUBLIC schema.
6. Finally, specify the creation of a new table called `airbnb_listings` and select Next.
7. In the Load Data into Table dialog, make the following adjustments. First, uncheck the `image_embeddings`, `images`, and
   `text_embeddings` columns, since those do not apply to this tutorial. Second, adjust the datatype of the `amenities` field to be
   ARRAY type.
8. Once you have made these adjustments, Select Load to proceed.
9. After a brief moment, you should see a confirmation page showing that the data has been loaded.
10. Select Query Data to open up a new Snowsight worksheet that you will use in the next step.

## Step 3: Create the search service

Create a search service over our new table by running the following SQL command.

```sqlexample
CREATE OR REPLACE CORTEX SEARCH SERVICE cortex_search_tutorial_db.public.airbnb_svc
ON listing_text
ATTRIBUTES room_type, amenities
WAREHOUSE = cortex_search_tutorial_wh
TARGET_LAG = '1 hour'
AS
    SELECT
        room_type,
        amenities,
        price,
        cancellation_policy,
        ('Summary\n\n' || summary || '\n\n\nDescription\n\n' || description || '\n\n\nSpace\n\n' || space) as listing_text
    FROM
    cortex_search_tutorial_db.public.airbnb_listings;
```

Let’s break down the arguments in this command:
:   * The `ON` parameter specifies the column for queries to search over.
      In this case, it’s the `listing_text`, which is generated in the source query
      as a concatenation of several text columns in the base table.
    * The `ATTRIBUTES` parameter specifies the columns that you will be able to filter search results on.
      This example filers on `room_type` and `amenities` when issuing queries to the
      `listing_text` column.
    * The `WAREHOUSE` and `TARGET_LAG` parameters specify the user-provided warehouse and the desired
      freshness of the search service, respectively. This example specifies to use the `cortex_search_tutorial_wh`
      warehouse to create the index and perform refreshes, and to keep the service no more than `'1 hour'` behind the
      source table `AIRBNB_LISTINGS`.
    * The `AS` field defines the source table for the service. This example
      concatenates several text columns in the original table into the search column `listing_text` so that queries can
      search over multiple fields.

## Step 4: Create a Streamlit app

You can query the service with Python SDK (using the `snowflake` Python package). This tutorial
demonstrates using the Python SDK in a Streamlit in Snowflake application.

First, ensure your global Snowsight UI role is the same as the role used to create
the service in the service creation step.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. **Important**: Select the `cortex_search_tutorial_db` database and `public` schema for the app location.
5. In the left pane of the Streamlit in Snowflake editor, select Packages and add `snowflake` (version >= 0.8.0) to install the package in your application.
6. Replace the example application code with the following Streamlit app:

   ```python
   # Import python packages
   import streamlit as st
   from snowflake.core import Root
   from snowflake.snowpark.context import get_active_session

   # Constants
   DB = "cortex_search_tutorial_db"
   SCHEMA = "public"
   SERVICE = "airbnb_svc"
   BASE_TABLE = "cortex_search_tutorial_db.public.airbnb_listings"
   ARRAY_ATTRIBUTES = {"AMENITIES"}

   def get_column_specification():
       """
       Returns the name of the search column and a list of the names of the attribute columns
       for the provided cortex search service
       """
       session = get_active_session()
       search_service_result = session.sql(f"DESC CORTEX SEARCH SERVICE {DB}.{SCHEMA}.{SERVICE}").collect()[0]
       st.session_state.attribute_columns = search_service_result.attribute_columns.split(",")
       st.session_state.search_column = search_service_result.search_column
       st.session_state.columns = search_service_result.columns.split(",")

   def init_layout():
       st.title("Cortex AI Search")
       st.markdown(f"Querying service: `{DB}.{SCHEMA}.{SERVICE}`".replace('"', ''))

   def query_cortex_search_service(query, filter={}):
       """
       Queries the cortex search service in the session state and returns a list of results
       """
       session = get_active_session()
       cortex_search_service = (
           Root(session)
           .databases[DB]
           .schemas[SCHEMA]
           .cortex_search_services[SERVICE]
       )
       context_documents = cortex_search_service.search(
           query,
           columns=st.session_state.columns,
           filter=filter,
           limit=st.session_state.limit)
       return context_documents.results

   @st.cache_data
   def distinct_values_for_attribute(col_name, is_array_attribute=False):
       session = get_active_session()
       if is_array_attribute:
           values = session.sql(f'''
           SELECT DISTINCT value FROM {BASE_TABLE},
           LATERAL FLATTEN(input => {col_name})
           ''').collect()
       else:
           values = session.sql(f"SELECT DISTINCT {col_name} AS VALUE FROM {BASE_TABLE}").collect()
       return [ x["VALUE"].replace('"', "") for x in values ]

   def init_search_input():
       st.session_state.query = st.text_input("Query")

   def init_limit_input():
       st.session_state.limit = st.number_input("Limit", min_value=1, value=5)

   def init_attribute_selection():
       st.session_state.attributes = {}
       for col in st.session_state.attribute_columns:
           is_multiselect = col in ARRAY_ATTRIBUTES
           st.session_state.attributes[col] = st.multiselect(
               col,
               distinct_values_for_attribute(col, is_array_attribute=is_multiselect)
           )

   def display_search_results(results):
       """
       Display the search results in the UI
       """
       st.subheader("Search results")
       for i, result in enumerate(results):
           result = dict(result)
           container = st.expander(f"[Result {i+1}]", expanded=True)

           # Add the result text.
           container.markdown(result[st.session_state.search_column])

           # Add the attributes.
           for column, column_value in sorted(result.items()):
               if column == st.session_state.search_column:
                   continue
               container.markdown(f"**{column}**: {column_value}")

   def create_filter_object(attributes):
       """
       Create a filter object for the search query
       """
       and_clauses = []
       for column, column_values in attributes.items():
           if len(column_values) == 0:
               continue
           if column in ARRAY_ATTRIBUTES:
               for attr_value in column_values:
                   and_clauses.append({"@contains": { column: attr_value }})
           else:
               or_clauses = [{"@eq": {column: attr_value}} for attr_value in column_values]
               and_clauses.append({"@or": or_clauses })

       return {"@and": and_clauses} if and_clauses else {}

   def main():
       init_layout()
       get_column_specification()
       init_attribute_selection()
       init_limit_input()
       init_search_input()

       if not st.session_state.query:
           return
       results = query_cortex_search_service(
           st.session_state.query,
           filter = create_filter_object(st.session_state.attributes)
       )
       display_search_results(results)

   if __name__ == "__main__":
       st.set_page_config(page_title="Cortex AI Search and Summary", layout="wide")
       main()
   ```

Here’s a brief breakdown of the major components in the Streamlit-in-Snowflake code above:

* `get_column_specification` uses a DESCRIBE SQL query to get information about the attributes available in the search service and
  stores them in Streamlit state.
* `init_layout` sets up the header and intro of the page.
* `query_cortex_search_service` handles querying the Cortex Search Service via the Python client library.
* `create_filter_object` processes selected filter attributes from the Streamlit form into the right objects to be used by the
  Python library for querying Cortex Search.
* `distinct_values_for_attribute` determines which values are possible for each filterable attribute to populate the dropdown menus.
* `init_search_input`, `init_limit_input`, `init_attribute_selection` initialize inputs for the search query, limit of
  number of results, and attribute filters.
* `display_search_results` formats search results into Markdown elements displayed in the results page.

## Step 5: Clean up

### Clean up (optional)

Execute the following [DROP <object>](../../../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

```sqlexample
DROP DATABASE IF EXISTS cortex_search_tutorial_db;
DROP WAREHOUSE IF EXISTS cortex_search_tutorial_wh;
```

Dropping the database automatically removes all child database objects such as tables.

## Next steps

Congratulations! You have successfully built a simple search app on text data in Snowflake.
You can move on to [Tutorial 2](cortex-search-tutorial-2-chat.md)
to see how to layer on [Cortex LLM Functions](../../aisql.md) to build
an AI chatbot with Cortex Search.

### Additional resources

Additionally, you can continue learning using the following resources:

* [Cortex Search overview](../cortex-search-overview.md)
* [Query a Cortex Search Service](../query-cortex-search-service.md)

---
title: Tutorial 1: Providers set up and test a CKE
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/tutorials/setup-test-cke-tutorial.md
section: Snowflake Cortex (AI & ML)
---

Cortex Knowledge Extensions

# Tutorial 1: Providers set up and test a CKE

## Introduction

For providers, this tutorial describes how to set up and test your CKE.

### What you’ll learn

In this tutorial you’ll learn how to:

* Create Snowflake objects
* Load your data into Snowflake
* Chunk your documents
* Create the Cortex Search Service
* Verify the CKE is working correctly
* Share and test the CKE with a consumer account

### Prerequisites

The following prerequisites are required to complete this tutorial:

* You have a Snowflake account and user with a role that grants the necessary
  privileges to create a database, tables, virtual warehouse objects, Cortex Search services, and Streamlit apps.

Refer to the [Snowflake in 20 minutes](../../../tutorials/snowflake-in-20minutes.md) for instructions to meet these requirements.

## Step 1: Create Snowflake objects

The first step is to create Snowflake objects.

Use the accountadmin role.

```sqlexample
use role accountadmin;
```

Create a warehouse named `xsmall_cke_getting_started` for creating and updating the index.

```sqlexample
create warehouse xsmall_cke_getting_started warehouse_size=xsmall;
```

Create a separate role named `cke_owner`.

```sqlexample
create role cke_owner;
grant role cke_owner to user admin;
grant usage on warehouse xsmall_cke_getting_started to role cke_owner;
```

Create and use a database named `cke_getting_started`.

```sqlexample
grant create database on account to role cke_owner;
use role cke_owner;
create database cke_getting_started;
use database cke_getting_started;
```

Create and use a schema called `articles`.

```sqlexample
create schema articles;
use schema articles;
```

## Step 2: Load your data into Snowflake

The next step is to load your data into Snowflake. Refer to [Load data into Snowflake](../../../../guides-overview-loading-data.md) for more information.

The example code below stores data in a Snowflake table named `cke_simple_article` in the following format:

| Column name | Type | Description |
| --- | --- | --- |
| `DOCUMENT_ID` | `VARCHAR` | The unique identifier for the document. This is the primary key of the table. |
| `DOCUMENT_TITLE` | `VARCHAR` | The title of the document. |
| `SOURCE_URL` | `VARCHAR` | A URL linking to the source of a document. |
| `DOCUMENT_TEXT` | `VARCHAR` | The document contents, parsed as text. This is the content that will be indexed and searched. |

Note that you can include additional document metadata in your indexed dataset. In our example below, we include only `SOURCE_URL` and `DOCUMENT_ID`, but you can add more columns depending on your document source.

Create a simple table.

```sqlexample
create or replace table cke_simple_article (
    DOCUMENT_ID VARCHAR,
    DOCUMENT_TITLE VARCHAR,
    SOURCE_URL VARCHAR,
    text VARCHAR
);
```

Now insert some sample data into that table.

```sqlexample
INSERT INTO cke_simple_article (DOCUMENT_ID, DOCUMENT_TITLE, SOURCE_URL, TEXT)
VALUES
    ('DOC_001', 'Sample Article 1', 'https://example.com/article1', 'This is some sample text for the first article.'),
    ('DOC_002', 'Sample Article 2', 'https://example.com/article2', 'Another sample text entry for the second article.'),
    ('DOC_003', 'Sample Article 3', 'https://example.com/article3', 'Yet another piece of text for the third article.');

INSERT INTO cke_simple_article (
    DOCUMENT_ID,
    DOCUMENT_TITLE,
    SOURCE_URL,
    text
)
VALUES (
    'DOC-GREEN-001',
    'The Grand Opening of Greenfield Biosphere',
    'https://www.example.com/news/greenfield-biosphere',
    'Greenfield Biosphere, nestled in the heart of a once-industrial landscape, opened its doors to the public today amid great fanfare and curiosity. This ambitious environmental initiative, spanning over 120 acres of reclaimed land, has been designed to house thousands of diverse plant species and animals under one vast, transparent dome. Over the past decade, teams of botanists, engineers, and conservationists collaborated intensively to restore the soil quality, implement renewable energy solutions, and establish sustainable water sources. Their efforts have resulted in an oasis that stands as a testament to nature''s resilience and humanity''s unwavering determination to coexist with it.

    Upon entering the biosphere, visitors pass through a series of controlled airlocks that maintain precise temperature and humidity levels, ensuring the delicate balance required for each habitat. The moment they step inside, a multitude of colors and scents envelops them. Towering palm trees sway gently, nurtured by a carefully engineered irrigation system that recycles water across various sections of the dome. Exotic butterflies flutter past patches of vibrant orchids, while small reptiles scurry along the edge of meandering pathways. Every detail, from lighting angles to seed selection, has been meticulously planned to promote biodiversity in a space that once lay barren.

    Local officials and environmental organizations herald this project as a bold step toward reversing ecological decline. The region had suffered decades of industrial pollution, leaving the soil depleted and wildlife populations on the brink of collapse. Public interest soared once the Greenfield Biosphere project was announced, prompting unprecedented fundraising campaigns and private investments. Citizens volunteered their time to plant seedlings, build composting facilities, and educate children on the importance of ecological stewardship. Now, as thousands explore the dome on opening day, excitement mingles with a sense of responsibility, fueling hope that this initiative can serve as a catalyst for broader restoration efforts.

    Beyond merely a tourist attraction, the Greenfield Biosphere plays a crucial role in scientific research. Biologists and ecologists from universities around the globe have established research stations within the dome to study plant migration, cross-pollination, and microclimates. Through advanced sensor networks, they collect data on everything from soil moisture levels to carbon sequestration rates, aiming to develop cutting-edge conservation strategies. Already, preliminary findings suggest that certain flora species exhibit faster growth rates under partial shade, which could help inform future reforestation projects. This research extends to aquatic ecosystems as well, with scientists closely monitoring newly formed ponds and streams for indicators of ecosystem health.

    During the grand opening ceremony, Mayor Allison Pierce praised the community for its unwavering dedication to the biosphere''s development. She emphasized how interagency cooperation and community outreach were pivotal in transforming a polluted wasteland into a verdant sanctuary. In her address, she remarked on the significance of involving local youth, who contributed to the design through art projects and educational workshops. According to Mayor Pierce, the next phase of the project will include expanding the biosphere''s capacity for endangered species breeding programs. This could cement the region''s reputation as a global leader in ecological preservation and innovation.

    For many, the real highlight of the day was the unveiling of the arboretum wing, a temperature-controlled section featuring ancient tree species that have long faced threats from illegal logging and habitat loss. Towering redwoods, thought to be too large to grow under a dome, stand proudly after years of careful nurturing. Visitors stood in awe as the directors revealed that these trees'' root systems, painstakingly preserved and transplanted, are now thriving in custom-engineered soil mixtures. A sense of reverence filled the air, with many attendees describing the experience as spiritual. The seed of hope planted in the community has visibly taken root.

    The venture''s economic impact is another key talking point. Local shops and restaurants anticipate an influx of tourists, and hotels report reservations scheduled months in advance. Construction of new eco-lodges in the surrounding areas is already underway, promising a blend of comfortable accommodations with sustainable building practices. The city council has also approved additional funding to improve roads and public transportation to accommodate the expected rise in visitor numbers. Environmental advocates caution, however, that increased foot traffic could inadvertently strain the biosphere''s delicate ecosystems, calling for balanced planning and continued emphasis on conservation education.

    Inside the administrative office, a dedicated operations team monitors real-time data feeds, adjusting temperature, humidity, and nutrient levels to meet each species'' unique needs. Modular solar panels installed around the dome generate sufficient electricity to power the entire facility, showcasing how renewable energy can be integrated seamlessly with large-scale infrastructure. Outside, an innovative wastewater treatment plant recycles greywater for irrigation, minimizing resource consumption. The architects behind the biosphere believe these sustainable technologies can be replicated in other communities looking to rehabilitate degraded land, turning once-polluted sites into living laboratories for environmental stewardship.

    While the facility is only in its first phase, future expansions are already on the drawing board. There are plans to introduce a marine habitat zone featuring coral reef tanks that highlight threats to underwater ecosystems. Specially designed walkways will give visitors a close-up view of these aquatic wonders without disturbing the delicate organisms within. Meanwhile, education programs will be expanded to local schools, offering field trips where students can learn about biodiversity, climate change, and sustainable technologies. The hope is that exposure to this living exhibit will inspire the next generation of environmental scientists, engineers, and policymakers.

    As dusk settled over the glass dome, a soft, multi-colored illumination replaced the natural daylight, casting enchanting shadows across the tropical foliage. Families strolled slowly along the paths, pausing to read plaques about the origins of each plant or to marvel at the occasional flutter of nocturnal pollinators. Meanwhile, a gentle hum of conversation reverberated in the background, carrying sentiments of astonishment and gratitude. The first day at Greenfield Biosphere ended with a collective realization that, with mindful planning, community collaboration, and respect for nature''s inherent wisdom, it is indeed possible to transform a scarred landscape into a flourishing haven for life and innovation.'
);
```

## Step 3. Chunk your documents

Before creating a Cortex Search Service, we need to ensure that each “chunk” of indexed text is no more than approximately 375 words of text. To do this, we can apply a chunking algorithm via a Snowpark UDF that imports LangChain. First, we create a chunking UDF. Then, we apply that UDF to the `cke_simple_article` table and store the chunks in a `cke_simple_article_chunks` table. And finally, we verify that the chunks were created.

Run the example below to chunk the articles into parts for the Cortex Search Service. This process can take several minutes to complete.

```sqlexample
CREATE OR REPLACE FUNCTION text_chunker(text STRING)
    RETURNS TABLE (chunk VARCHAR)
    LANGUAGE PYTHON
    RUNTIME_VERSION = '3.9'
    HANDLER = 'text_chunker'
    PACKAGES = ('snowflake-snowpark-python', 'langchain')
    AS
$$
from snowflake.snowpark.types import StringType, StructField, StructType
from langchain.text_splitter import RecursiveCharacterTextSplitter
from snowflake.snowpark.files import SnowflakeFile
import logging
import pandas as pd

class text_chunker:

    def process(self, text: str):
        text_splitter = RecursiveCharacterTextSplitter(
            chunk_size = 2000,  # Adjust this as needed
            chunk_overlap = 300,  # Overlap to keep chunks contextual
            length_function = len
        )

        chunks = text_splitter.split_text(text)
        df = pd.DataFrame(chunks, columns=['chunk'])

        yield from df.itertuples(index=False, name=None)
$$;
```

Run the example below to split the documents into chunks for indexing.

```sqlexample
CREATE OR REPLACE TABLE cke_simple_article_chunks AS
    SELECT
        c.DOCUMENT_ID,
        c.DOCUMENT_TITLE,
        c.SOURCE_URL,
        t.chunk
    FROM cke_simple_article AS c, TABLE(text_chunker(CONCAT(c.DOCUMENT_TITLE, '\n', c.TEXT))) AS t;
```

Run the following to verify that the chunks were created.

```sqlexample
select * from cke_simple_article_chunks;
```

## Step 4. Create the Cortex Search Service

Now configure a Cortex Search Service named `cke_simple_cortex_search_service` to run on warehouse
`xsmall_cke_getting_started` and reference the chunked document table `cke_simple_article_chunks`. Note that this step can
take considerable time to complete, depending on the size of the database.

```sqlexample
CREATE OR REPLACE CORTEX SEARCH SERVICE cke_simple_cortex_search_service
  ON chunk
  ATTRIBUTES document_title
  WAREHOUSE = xsmall_cke_getting_started
  TARGET_LAG = '1 hour'
  AS (
    SELECT
        chunk,
        document_title,
        source_url
      FROM cke_simple_article_chunks
  );
```

## Step 5. Test the CKE

To verify the CKE is working correctly you can issue a simple query to the Cortex Search Service. This will verify that the service has correctly indexed your documents and that relevant documents come back from queries. This query should return the first chunk of the article “The Greenfield Biosphere” with a link to the source URL.

```sqlexample
select snowflake.cortex.search_preview(
 'cke_getting_started.articles.cke_simple_cortex_search_service',
 '{ "query": "whats happening with the greenfield biosphere?", "columns": ["chunk","document_title","source_url"] }');
```

## Step 6: Share the CKE privately for testing

After the Cortex Search Service has been created and is correctly responding to queries, you can share it. This shared Cortex Search Service is the Cortex Knowledge Extension. In this step, you’ll create a [private listing](../../../../collaboration/provider-listings-creating-publishing.md) and share it with another account for testing. Then you’ll test the listing in the consumer account that you shared the CKE with.

### Create the share

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listing in the upper-right corner and select Specified Consumers.
4. Provide a title for the listing, and then click Next.
5. Click + Select for What’s in the listing?.
6. Select CKE_GETTING_STARTED.
7. Expand ARTICLES.
8. Expand Cortex Search Service.
9. Select CKE_SIMPLE_CORTEX_SEARCH_SERVICE, and then select Done.
10. Enter a description for the listing.
11. Under Add consumer accounts, add the Snowflake account that you want to share and test the Cortex Knowledge Extension with. Note that must be in the same region as the provider, and you must have access to this account.

### Test the share in a consumer account

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) using the consumer account that you shared the CKE with above.
2. In the navigation menu, select Data sharing » Internal sharing.
3. Here, you should see the CKE_GETTING_STARTED listing that you shared above. Select Get.
4. Open a new worksheet and run the SQL command below to verify that the account has access to the shared data.

   > ```sqlexample
   > select
   >   snowflake.cortex.search_preview(
   >    'CKE_GETTING_STARTED_GUIDE__FAKE_ARTICLES.ARTICLES.CKE_SIMPLE_CORTEX_SEARCH_SERVICE',
   >    '{ "query": "whats happening with the biosphere?", "columns": ["chunk","document_title"] }'
   >   );
   > ```
   >
   > > **Note:**
   > >
   > > If you specified name other than **CKE_GETTING_STARTED** in the Get dialog, you’ll need to change that in the snippet above.

At this point, you have a functional Cortex Knowledge Extension!

---
title: Tutorial 2: Build a simple chat application with Cortex Search
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/tutorials/cortex-search-tutorial-2-chat.md
section: Snowflake Cortex (AI & ML)
---

Cortex Search

Getting Started

# Tutorial 2: Build a simple chat application with Cortex Search

## Introduction

This tutorial describes how to use Cortex Search and the [COMPLETE (SNOWFLAKE.CORTEX)](../../../../sql-reference/functions/complete-snowflake-cortex.md) function
to setup a Retrieval-Augmented Generation (RAG) chatbot in Snowflake.

### What you will learn

* Create a Cortex Search Service based on a dataset downloaded from Kaggle.
* Create a Streamlit in Snowflake app that lets you query your Cortex Search Service.

### Prerequisites

The following prerequisites are required to complete this tutorial:

* You have a Snowflake account and user with a role that grants the necessary
  privileges to create a database, tables, virtual warehouse objects, Cortex Search services, and Streamlit apps.

Refer to the [Snowflake in 20 minutes](../../../tutorials/snowflake-in-20minutes.md) for instructions to meet these requirements.

## Step 1: Setup

### Getting the sample data

You will use a sample dataset hosted on Kaggle for this tutorial.
The Books dataset is a collection of book name, title and descriptions. You can download the dataset from the following link:

The complete dataset can be found on
[Kaggle](https://www.kaggle.com/datasets/elvinrustam/books-dataset/data).

> **Note:**
>
> In a non-tutorial setting, you would bring your own data, possibly already in a Snowflake table.

### Creating the database, schema, stage and warehouse

Run the following SQL code to set up the necessary database, schema, and warehouse:

```sqlexample
CREATE DATABASE IF NOT EXISTS cortex_search_tutorial_db;

CREATE OR REPLACE WAREHOUSE cortex_search_tutorial_wh WITH
    WAREHOUSE_SIZE='X-SMALL'
    AUTO_SUSPEND = 120
    AUTO_RESUME = TRUE
    INITIALLY_SUSPENDED=TRUE;

USE WAREHOUSE cortex_search_tutorial_wh;
```

Note the following:

* The `CREATE DATABASE` statement creates a database. The database automatically includes a schema named PUBLIC.
* The `CREATE WAREHOUSE` statement creates an initially suspended warehouse.

## Step 2: Load the data into Snowflake

First create a stage to store the files downloaded from Kaggle. This stage will hold the books dataset.

```sqlexample
CREATE OR REPLACE STAGE books_data_stage
    DIRECTORY = (ENABLE = TRUE)
    ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');
```

Now upload the dataset. You can upload the dataset in Snowsight or using SQL. To upload in Snowsight:

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select your database `cortex_search_tutorial_db`.
4. Select your schema `public`.
5. Select Stages and select `books_data_stage`.
6. On the top right, Select the + Files button.
7. Drag and drop files into the UI or select Browse to choose a file from the dialog window.
8. Select Upload to upload your file, `BooksDatasetClean.csv`
9. Select the three dots on the right of the file and select Load into table.
10. Name the table `BOOKS_DATASET_RAW` and select Next.
11. In the left panel of the load data dialog, choose First line contains header from the Header menu.
12. Then select Load.

## Step 3: Build the Chunks Table

Retrieval accuracy with Cortex Search tends to be higher when documents are shorter.
For more information, see [Tokens, model context windows, and text splitting](../cortex-search-overview.md).

Now, create a table to store the chunks of text extracted from the book descriptions using the [SPLIT_TEXT_RECURSIVE_CHARACTER (SNOWFLAKE.CORTEX)](../../../../sql-reference/functions/split_text_recursive_character-snowflake-cortex.md) function.
Include the title and authors in the chunk to provide context:

```sqlexample
CREATE TABLE cortex_search_tutorial_db.public.book_description_chunks AS (
    SELECT
        books.title,
        books.authors,
        books.category,
        books.publisher,
        books.title || '\n' || books.authors || '\n' || chunk_value.value AS CHUNK
    FROM cortex_search_tutorial_db.public.books_dataset_raw books,
        LATERAL FLATTEN(
            input => SNOWFLAKE.CORTEX.SPLIT_TEXT_RECURSIVE_CHARACTER(
                books.description,
                'none',
                2000,
                300
            )
        ) AS chunk_value
);
```

Verify the table contents:

```sqlexample
SELECT chunk, * FROM book_description_chunks LIMIT 10;
```

## Step 4: Create a Cortex Search Service

Create a Cortex Search Service on the table to allow you to search through the chunks in the `book_description_chunks`:

```sqlexample
CREATE CORTEX SEARCH SERVICE cortex_search_tutorial_db.public.books_dataset_service
    ON CHUNK
    WAREHOUSE = cortex_search_tutorial_wh
    TARGET_LAG = '1 hour'
    AS (
        SELECT *
        FROM cortex_search_tutorial_db.public.book_description_chunks
    );
```

## Step 5: Create a Streamlit app

You can query the service with Python SDK (using the `snowflake` Python package). This tutorial
demonstrates using the Python SDK in a Streamlit in Snowflake application.

First, ensure your global Snowsight UI role is the same as the role used to create
the service in the service creation step.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. **Important**: Select the `cortex_search_tutorial_db` database and `public` schema for the app location.
5. In the left pane of the Streamlit in Snowflake editor, select Packages and add `snowflake` (version >= 0.8.0) to install the package in your application.
6. Replace the example application code with the following Streamlit app:

   ```python
   import streamlit as st
   from snowflake.core import Root # requires snowflake>=0.8.0
   from snowflake.snowpark.context import get_active_session

   MODELS = [
       "mistral-large",
       "snowflake-arctic",
       "llama3-70b",
       "llama3-8b",
   ]

   def init_messages():
       """
       Initialize the session state for chat messages. If the session state indicates that the
       conversation should be cleared or if the "messages" key is not in the session state,
       initialize it as an empty list.
       """
       if st.session_state.clear_conversation or "messages" not in st.session_state:
           st.session_state.messages = []

   def init_service_metadata():
       """
       Initialize the session state for cortex search service metadata. Query the available
       cortex search services from the Snowflake session and store their names and search
       columns in the session state.
       """
       if "service_metadata" not in st.session_state:
           services = session.sql("SHOW CORTEX SEARCH SERVICES;").collect()
           service_metadata = []
           if services:
               for s in services:
                   svc_name = s["name"]
                   svc_search_col = session.sql(
                       f"DESC CORTEX SEARCH SERVICE {svc_name};"
                   ).collect()[0]["search_column"]
                   service_metadata.append(
                       {"name": svc_name, "search_column": svc_search_col}
                   )

           st.session_state.service_metadata = service_metadata

   def init_config_options():
       """
       Initialize the configuration options in the Streamlit sidebar. Allow the user to select
       a cortex search service, clear the conversation, toggle debug mode, and toggle the use of
       chat history. Also provide advanced options to select a model, the number of context chunks,
       and the number of chat messages to use in the chat history.
       """
       st.sidebar.selectbox(
           "Select cortex search service:",
           [s["name"] for s in st.session_state.service_metadata],
           key="selected_cortex_search_service",
       )

       st.sidebar.button("Clear conversation", key="clear_conversation")
       st.sidebar.toggle("Debug", key="debug", value=False)
       st.sidebar.toggle("Use chat history", key="use_chat_history", value=True)

       with st.sidebar.expander("Advanced options"):
           st.selectbox("Select model:", MODELS, key="model_name")
           st.number_input(
               "Select number of context chunks",
               value=5,
               key="num_retrieved_chunks",
               min_value=1,
               max_value=10,
           )
           st.number_input(
               "Select number of messages to use in chat history",
               value=5,
               key="num_chat_messages",
               min_value=1,
               max_value=10,
           )

       st.sidebar.expander("Session State").write(st.session_state)

   def query_cortex_search_service(query):
       """
       Query the selected cortex search service with the given query and retrieve context documents.
       Display the retrieved context documents in the sidebar if debug mode is enabled. Return the
       context documents as a string.

       Args:
           query (str): The query to search the cortex search service with.

       Returns:
           str: The concatenated string of context documents.
       """
       db, schema = session.get_current_database(), session.get_current_schema()

       cortex_search_service = (
           root.databases[db]
           .schemas[schema]
           .cortex_search_services[st.session_state.selected_cortex_search_service]
       )

       context_documents = cortex_search_service.search(
           query, columns=[], limit=st.session_state.num_retrieved_chunks
       )
       results = context_documents.results

       service_metadata = st.session_state.service_metadata
       search_col = [s["search_column"] for s in service_metadata
                       if s["name"] == st.session_state.selected_cortex_search_service][0]

       context_str = ""
       for i, r in enumerate(results):
           context_str += f"Context document {i+1}: {r[search_col]} \n" + "\n"

       if st.session_state.debug:
           st.sidebar.text_area("Context documents", context_str, height=500)

       return context_str

   def get_chat_history():
       """
       Retrieve the chat history from the session state limited to the number of messages specified
       by the user in the sidebar options.

       Returns:
           list: The list of chat messages from the session state.
       """
       start_index = max(
           0, len(st.session_state.messages) - st.session_state.num_chat_messages
       )
       return st.session_state.messages[start_index : len(st.session_state.messages) - 1]

   def complete(model, prompt):
       """
       Generate a completion for the given prompt using the specified model.

       Args:
           model (str): The name of the model to use for completion.
           prompt (str): The prompt to generate a completion for.

       Returns:
           str: The generated completion.
       """
       return session.sql("SELECT snowflake.cortex.complete(?,?)", (model, prompt)).collect()[0][0]

   def make_chat_history_summary(chat_history, question):
       """
       Generate a summary of the chat history combined with the current question to extend the query
       context. Use the language model to generate this summary.

       Args:
           chat_history (str): The chat history to include in the summary.
           question (str): The current user question to extend with the chat history.

       Returns:
           str: The generated summary of the chat history and question.
       """
       prompt = f"""
           [INST]
           Based on the chat history below and the question, generate a query that extend the question
           with the chat history provided. The query should be in natural language.
           Answer with only the query. Do not add any explanation.

           <chat_history>
           {chat_history}
           </chat_history>
           <question>
           {question}
           </question>
           [/INST]
       """

       summary = complete(st.session_state.model_name, prompt)

       if st.session_state.debug:
           st.sidebar.text_area(
               "Chat history summary", summary.replace("$", "\$"), height=150
           )

       return summary

   def create_prompt(user_question):
       """
       Create a prompt for the language model by combining the user question with context retrieved
       from the cortex search service and chat history (if enabled). Format the prompt according to
       the expected input format of the model.

       Args:
           user_question (str): The user's question to generate a prompt for.

       Returns:
           str: The generated prompt for the language model.
       """
       if st.session_state.use_chat_history:
           chat_history = get_chat_history()
           if chat_history != []:
               question_summary = make_chat_history_summary(chat_history, user_question)
               prompt_context = query_cortex_search_service(question_summary)
           else:
               prompt_context = query_cortex_search_service(user_question)
       else:
           prompt_context = query_cortex_search_service(user_question)
           chat_history = ""

       prompt = f"""
               [INST]
               You are a helpful AI chat assistant with RAG capabilities. When a user asks you a question,
               you will also be given context provided between <context> and </context> tags. Use that context
               with the user's chat history provided in the between <chat_history> and </chat_history> tags
               to provide a summary that addresses the user's question. Ensure the answer is coherent, concise,
               and directly relevant to the user's question.

               If the user asks a generic question which cannot be answered with the given context or chat_history,
               just say "I don't know the answer to that question.

               Don't saying things like "according to the provided context".

               <chat_history>
               {chat_history}
               </chat_history>
               <context>
               {prompt_context}
               </context>
               <question>
               {user_question}
               </question>
               [/INST]
               Answer:
           """
       return prompt

   def main():
       st.title(f":speech_balloon: Chatbot with Snowflake Cortex")

       init_service_metadata()
       init_config_options()
       init_messages()

       icons = {"assistant": "❄️", "user": "👤"}

       # Display chat messages from history on app rerun
       for message in st.session_state.messages:
           with st.chat_message(message["role"], avatar=icons[message["role"]]):
               st.markdown(message["content"])

       disable_chat = (
           "service_metadata" not in st.session_state
           or len(st.session_state.service_metadata) == 0
       )
       if question := st.chat_input("Ask a question...", disabled=disable_chat):
           # Add user message to chat history
           st.session_state.messages.append({"role": "user", "content": question})
           # Display user message in chat message container
           with st.chat_message("user", avatar=icons["user"]):
               st.markdown(question.replace("$", "\$"))

           # Display assistant response in chat message container
           with st.chat_message("assistant", avatar=icons["assistant"]):
               message_placeholder = st.empty()
               question = question.replace("'", "")
               with st.spinner("Thinking..."):
                   generated_response = complete(
                       st.session_state.model_name, create_prompt(question)
                   )
                   message_placeholder.markdown(generated_response)

           st.session_state.messages.append(
               {"role": "assistant", "content": generated_response}
           )

   if __name__ == "__main__":
       session = get_active_session()
       root = Root(session)
       main()
   ```

## Step 6: Try out the app

Enter a query in the text box to try out your new app. Some sample queries you can try are:

* `I like Harry Potter. Can you recommend more books I will like?`
* `Can you recommend me books on Greek Mythology?`

## Step 7: Clean up

### Clean up (optional)

Execute the following [DROP <object>](../../../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

```sqlexample
DROP DATABASE IF EXISTS cortex_search_tutorial_db;
DROP WAREHOUSE IF EXISTS cortex_search_tutorial_wh;
```

Dropping the database automatically removes all child database objects such as tables.

## Next steps

Congratulations! You have successfully built a simple search app on text data in Snowflake.
You can move on to [Tutorial 3](cortex-search-tutorial-3-chat-advanced.md)
to see how to build an AI chatbot with Cortex Search from a set of PDF files.

### Additional resources

Continue learning using the following resources:

* [Cortex Search overview](../cortex-search-overview.md)
* [Query a Cortex Search Service](../query-cortex-search-service.md)

---
title: Tutorial 2: Consumer interfaces with a CKE in a Streamlit chatbot
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/tutorials/query-cortex-search-service-tutorial.md
section: Snowflake Cortex (AI & ML)
---

Cortex Knowledge Extensions

# Tutorial 2: Consumer interfaces with a CKE in a Streamlit chatbot

## Introduction

In this tutorial, you’ll set up a custom retrieval augmented generation (RAG) pipeline to integrate knowledge from a Cortex Knowledge Extension into a chatbot.

This is how it works:

1. A Streamlit app accepts a prompt from a user.
2. The prompt is given to the Cortex Search Query API with the configured Cortex Knowledge Extension / Cortex Search Service.
3. The Streamlit app takes the retrieved documents, puts them into the context window with a custom prompt, and sends it to the Cortex LLM Complete function with a specified LLM.

> **Note:**
>
> This tutorial assumes that you have a CKE already available. Go to the [Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace) and access one, or use [Tutorial 1](setup-test-cke-tutorial.md) to create one.

## Step 1. Set up your environment

The example below sets up an environment and creates a Streamlit application that you can run in Snowflake to test out a Cortex Knowledge Extension. This assumes the Consumer has access to a Cortex Knowledge Extension that’s been shared by a Provider.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.

   The Create Streamlit App window opens.
4. Enter a name for your app.
5. In the App location dropdown, select the database and schema for your app.
6. In the Warehouse dropdown, select the warehouse where you want to run your app and execute queries.
7. Select Create.

   > The Streamlit in Snowflake editor opens an example Streamlit app in Viewer mode. Viewer mode allows you to see how the Streamlit application appears to users.
8. Verify that the correct packages and versions are installed as in the image below.

## Step 2: Create a Streamlit app for your CKE chat tester

The code below is a simple Streamlit app that allows you to test the CKE. The app uses the Snowflake ML Python package to call the Cortex Knowledge Extension and the Snowflake LLM Complete function.
The app allows you to select a Cortex Knowledge Extension, enter a question, and receive a response from the LLM. The app also provides options for debugging and using chat history.

1. In the navigation menu, select Projects » Streamlit.
2. Select the Streamlit app you created in the previous step.
3. In the Streamlit in Snowflake editor, select Edit » Edit code.

   > The Streamlit in Snowflake editor opens in Edit mode.
4. In the left navigation bar, select streamlit_app.py to open the code editor.
5. In the code editor, delete the existing code.
6. Copy the code below and paste it into the code editor, then select Save » Save and run.

   > The Streamlit in Snowflake editor runs the app and opens it in Viewer mode.

```python
import streamlit as st
from snowflake.core import Root
from snowflake.cortex import Complete
from snowflake.snowpark.context import get_active_session

MODELS = [
    "llama3.1-8b",
    "llama3.1-70b",
    "llama3.1-405b"
]

def init_messages():
    """Initialize session state messages if not present or if we need to clear."""
    if st.session_state.get("clear_conversation") or "messages" not in st.session_state:
        st.session_state.messages = []
        st.session_state.clear_conversation = False

def init_service_metadata():
    """Load or refresh cortex search services from Snowflake."""
    services = session.sql("SHOW CORTEX SEARCH SERVICES IN ACCOUNT;").collect()
    service_metadata = []
    if services:
        for s in services:
            svc_name = s["name"]
            svc_schema = s["schema_name"]
            svc_db = s["database_name"]
            svc_search_col = session.sql(
                f"DESC CORTEX SEARCH SERVICE {svc_db}.{svc_schema}.{svc_name};"
            ).collect()[0]["search_column"]
            service_metadata.append(
                {
                    "name": svc_name,
                    "search_column": svc_search_col,
                    "db": svc_db,
                    "schema": svc_schema,
                }
            )

    st.session_state.service_metadata = service_metadata

    # Initialize selected_cortex_search_service if it doesn't exist
    if "selected_cortex_search_service" not in st.session_state and service_metadata:
        st.session_state.selected_cortex_search_service = service_metadata[0]["name"]

    selected_entry = st.session_state.get("selected_cortex_search_service")

    if selected_entry:
        # Find matching service metadata
        selected_service_metadata = next(
            (svc for svc in st.session_state.service_metadata if svc["name"] == selected_entry),
            None
        )

        if selected_service_metadata:
            # Store them in session_state
            st.session_state.selected_schema = selected_service_metadata["schema"]
            st.session_state.selected_db = selected_service_metadata["db"]
        elif st.session_state.get("debug", False):
            st.write("No matching service found for:", selected_entry)

def init_config_options():
    if "service_metadata" not in st.session_state or not st.session_state.service_metadata:
        st.sidebar.warning("No Cortex Knowledge Extensions available")
        return

    st.sidebar.selectbox(
        "Select Cortex Knowledge Extension",
        [s["name"] for s in st.session_state.service_metadata],
        key="selected_cortex_search_service",
    )
    if st.sidebar.button("Clear conversation"):
        st.session_state.clear_conversation = True

    # If st.sidebar.toggle isn't available, use st.sidebar.checkbox:
    st.sidebar.checkbox("Debug", key="debug", value=False)
    st.sidebar.checkbox("Use chat history", key="use_chat_history", value=True)

    with st.sidebar.expander("Advanced options"):
        st.selectbox("Select model:", MODELS, key="model_name")
        st.number_input(
            "Select number of context chunks",
            value=5,
            key="num_retrieved_chunks",
            min_value=1,
            max_value=10,
        )
        st.number_input(
            "Select number of messages to use in chat history",
            value=5,
            key="num_chat_messages",
            min_value=1,
            max_value=10,
        )

    st.sidebar.expander("Session State").write(st.session_state)

def get_chat_history():
    """Get the last N messages from session state."""
    start_index = max(
        0, len(st.session_state.messages) - st.session_state.num_chat_messages
    )
    return st.session_state.messages[start_index : len(st.session_state.messages) - 1]

def complete(model, prompt):
    """Use the chosen Snowflake cortex model to complete a prompt."""
    return Complete(model=model, prompt=prompt).replace("$", "\\$")

def make_chat_history_summary(chat_history, question):
    """
    Summarize the chat history plus the question using your LLM,
    to refine the final search query.
    """
    prompt = f"""
    [INST]
    Based on the chat history below and the question, generate a query that extend the question
    with the chat history provided. The query should be in natural language.
    Answer with only the query. Do not add any explanation.

    <chat_history>
    {chat_history}
    </chat_history>
    <question>
    {question}
    </question>
    [/INST]
    """
    summary = complete(st.session_state.model_name, prompt)
    if st.session_state.debug:
        st.sidebar.text_area("Chat history summary", summary.replace("$", "\\$"), height=150)
    return summary

def query_cortex_search_service(query, columns=[], filter={}):
    """
    Query the selected cortex search service with the given query and retrieve context documents.
    """
    # Safely retrieve from session_state
    db = st.session_state.get("selected_db")
    schema = st.session_state.get("selected_schema")

    if st.session_state.get("debug", False):
        st.sidebar.write("Query:", query)
        st.sidebar.write("DB:", db)
        st.sidebar.write("Schema:", schema)
        st.sidebar.write("Service:", st.session_state.selected_cortex_search_service)

    cortex_search_service = (
        root.databases[db]
        .schemas[schema]
        .cortex_search_services[st.session_state.selected_cortex_search_service]
    )

    context_documents = cortex_search_service.search(
        query,
        columns=columns,
        filter=filter,
        limit=st.session_state.num_retrieved_chunks
    )

    results = context_documents.results

    if st.session_state.get("debug", False):
        st.sidebar.write("Search Results:", results)

    service_metadata = st.session_state.service_metadata
    search_col = [
        s["search_column"] for s in service_metadata
        if s["name"] == st.session_state.selected_cortex_search_service
    ][0].lower()

    # Build a context string for the prompt
    context_str = ""
    context_str_template = (
        "Source: {source_url}\n"
        "Source ID: {id}\n"
        "Excerpt: {chunk}\n\n\n"
    )
    for i, r in enumerate(results):
        context_str += context_str_template.format(
            id=i+1,
            chunk=r[search_col],
            source_url=r["source_url"],
            title=r["document_title"],
        )
    if st.session_state.debug:
        st.sidebar.text_area("Context documents", context_str, height=500)

    return context_str, results

def create_prompt(user_question):
    """
    Combine user question, context from the search service, and chat history
    to create a final prompt for the LLM.
    """
    if st.session_state.use_chat_history:
        chat_history = get_chat_history()
        if chat_history != []:
            question_summary = make_chat_history_summary(chat_history, user_question)
            prompt_context, results = query_cortex_search_service(
                question_summary, columns=["chunk", "source_url", "document_title"]
            )
        else:
            prompt_context, results = query_cortex_search_service(
                user_question, columns=["chunk", "source_url", "document_title"]
            )
    else:
        prompt_context, results = query_cortex_search_service(
            user_question, columns=["chunk", "source_url", "document_title"]
        )
        chat_history = ""

    prompt = f"""
You are a helpful AI assistant with RAG capabilities. When a user asks you a question, you will also be given excerpts from relevant documentation to help answer the question accurately. Please use the context provided and cite your sources using the citation format provided.

Context from documentation:
{prompt_context}

User question:
{user_question}

OUTPUT:
"""

    # Add prompt to debug window
    if st.session_state.get("debug", False):
        st.sidebar.text_area("Complete Prompt", prompt, height=300)

    return prompt, results

def post_process_citations(generated_response, results):
    """
    Replace {{.StartCitation}}X{{.EndCitation}} with bracketed references to actual product links.

    NOTE: If the model references chunks out of range (like 4 if only 2 exist),
    consider adding logic to remap or drop invalid references.
    """
    used_results = set()
    for i, ref in enumerate(results):
        old_str = f"{{.StartCitation}}{i+1}{{.EndCitation}}"
        replacement = f"[{i+1}]{ref['source_url']})"
        new_resp = generated_response.replace(old_str, replacement)
        if new_resp != generated_response:
            used_results.add(i)
        generated_response = new_resp
    return generated_response, used_results

# ------------------------------------------------------------------------------
# (2) Main Application (with improved UI)
# ------------------------------------------------------------------------------

def main():
    # Optional: wide layout, custom page title
    st.set_page_config(
        page_title="Cortex Knowledge Extension Chat Tester",
        layout="wide",
    )

    # Optional: a bit of custom CSS for bubble spacing
    custom_css = """
    <style>
    [data-testid="stChatMessage"] {
        border-radius: 8px;
        margin-bottom: 1rem;
        padding: 10px;
    }
    </style>
    """
    st.markdown(custom_css, unsafe_allow_html=True)

    # Title or subheader for your app
    st.subheader("Cortex Knowledge Extension Chat Tester")

    # Initialize metadata and config
    init_service_metadata()
    init_config_options()
    init_messages()

    # Icons for user/assistant
    icons = {"assistant": "❄️", "user": "👤"}

    # Display chat messages from history on app rerun
    for message in st.session_state.messages:
        with st.chat_message(message["role"], avatar=icons[message["role"]]):
            st.markdown(message["content"])

    # If there are no services, disable chat
    disable_chat = (
        "service_metadata" not in st.session_state
        or len(st.session_state.service_metadata) == 0
    )

    # Chat input
    if question := st.chat_input("Ask a question...", disabled=disable_chat):
        # 1. Store user message
        st.session_state.messages.append({"role": "user", "content": question})

        # 2. Display user bubble
        with st.chat_message("user", avatar=icons["user"]):
            st.markdown(question.replace("$", "\\$"))

        # 3. Prepare assistant response
        with st.chat_message("assistant", avatar=icons["assistant"]):
            message_placeholder = st.empty()

            # Clean the question
            question_safe = question.replace("'", "")

            # Build prompt and retrieve docs
            prompt, results = create_prompt(question_safe)

            with st.spinner("Thinking..."):
                generated_response = complete(st.session_state.model_name, prompt)

                # Post-process citations
                post_processed_response, used_results = post_process_citations(generated_response, results)

                # Build references table (only if there are results)
                if results:
                    markdown_table = "\n\n###### References \n\n| Index | Title | Source |\n|------|-------|--------|\n"
                    for i, ref in enumerate(results):
                        # Include all references that were found
                        markdown_table += (
                            f"| {i+1} | {ref.get('document_title', 'N/A')} | "
                            f"{ref.get('source_url', 'N/A')} |\n"
                        )
                else:
                    markdown_table = "\n\n*No references found*"

                # Show final assistant message (with references)
                message_placeholder.markdown(post_processed_response + markdown_table)

        # 4. Append final assistant message to chat history
        st.session_state.messages.append(
            {"role": "assistant", "content": post_processed_response + markdown_table}
        )

# ------------------------------------------------------------------------------
# (3) Entry Point
# ------------------------------------------------------------------------------
if __name__ == "__main__":
    session = get_active_session()
    root = Root(session)
    main()
```

## Step 3: Test the app

1. Click Run to launch the Streamlit application.
2. Select a CKE from the drop down menu on the left pane under Select Cortex Knowledge Extension.`
3. Ask a question in the chat text box.

---
title: Tutorial 3: Add a CKE to Snowflake Intelligence
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/tutorials/add-cke-to-snowflake-intelligence-tutorial.md
section: Snowflake Cortex (AI & ML)
---

Cortex Knowledge Extensions

Cortex Agent API

# Tutorial 3: Add a CKE to Snowflake Intelligence

## Step-by-step instructions

It’s easy to add a Cortex Knowledge Extension to Snowflake Intelligence. Once you have a CKE in your account, you can add it to Snowflake Intelligence by adding the CKE to an Agent in Snowsight.

> **Important:**
>
> Before you get started, make sure the Snowflake Intelligence has access to the CKE:
>
> ```sqlexample
> -- Grant Snowflake Intelligence the right access to the CKE so it can be added as an agent
> grant imported privileges on database <CKE_DATABASE_NAME> to role <SNOWFLAKE_INTELLIGENCE_ROLE>;
> ```

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. In the Agents screen, select Create Agent and give the Agent a name.
4. Under Knowledge, click + Search service.
5. Select the Database that has the CKE, and select the Search Service for the CKE.
6. Give the CKE a display name.
7. Indicate the column in the CKE that references the URL of the underlying content. This is useful for giving users additional context and an opportunity to dig deeper into attribution.
8. Click Create.
9. Navigate to Snowflake Intelligence on the left side.
10. Select on the drop down under the textbox to select the new Agent with your CKE tied to it.
11. Ask a question with the selected Agent and see the cited answers with links back to the source content via the CKE.

---
title: Tutorial 3: Build a PDF chatbot with Cortex Search
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/tutorials/cortex-search-tutorial-3-chat-advanced.md
section: Snowflake Cortex (AI & ML)
---

Cortex Search

Getting Started

# Tutorial 3: Build a PDF chatbot with Cortex Search

## Introduction

This tutorial describes how to build a chatbot from a dataset of PDF documents
using Cortex Search. In [Tutorial 2](cortex-search-tutorial-2-chat.md),
you learned how to build a chatbot from text data that was already extracted from its source.
This tutorial walks through an example of extracting that text from the PDFs using a basic Python
UDF, then ingesting the extracted data into a Cortex Search Service.

### What you will learn

* Extract text from a set of PDF files in a stage using a Python UDF.
* Create a Cortex Search Service from the extracted text.
* Create a Streamlit-in-Snowflake chat app that lets you ask questions about the
  data extracted from the PDF documents.

### Prerequisites

The following prerequisites are required to complete this tutorial:

* You have a Snowflake account and user with a role that grants the necessary
  privileges to create a database, tables, virtual warehouse objects, Cortex Search Services, and Streamlit apps.

Refer to the [Snowflake in 20 minutes](../../../tutorials/snowflake-in-20minutes.md) for instructions to meet these requirements.

## Step 1: Setup

### Get the PDF data

You will use a sample dataset of the Federal Open Market Committee (FOMC) meeting minutes for this tutorial.
This is a sample of twelve 10-page documents with meeting notes from FOMC meetings from 2023 and 2024.
Download the files directly from your browser by following this link:

* [FOMC minutes sample](https://drive.google.com/file/d/1C6TdVjy6d-GnasGO6ZrIEVJQRcedDQxG/view?usp=sharing)

The complete set of FOMC minutes can be found at the
[US Federal Reserve’s website](https://www.federalreserve.gov/monetarypolicy/fomccalendars.htm).

> **Note:**
>
> In a non-tutorial setting, you would bring your own data, possibly already in a Snowflake stage.

### Create the database, tables, and warehouse

Execute the following statements to create a database and a virtual warehouse needed for this tutorial.
After you complete the tutorial, you can drop these objects.

```sqlexample
CREATE DATABASE IF NOT EXISTS cortex_search_tutorial_db;

CREATE OR REPLACE WAREHOUSE cortex_search_tutorial_wh WITH
     WAREHOUSE_SIZE='X-SMALL'
     AUTO_SUSPEND = 120
     AUTO_RESUME = TRUE
     INITIALLY_SUSPENDED=TRUE;

 USE WAREHOUSE cortex_search_tutorial_wh;
```

> **Note:**
>
> * The `CREATE DATABASE` statement creates a database. The database automatically includes a schema named PUBLIC.
> * The `CREATE WAREHOUSE` statement creates an initially suspended warehouse.

## Step 2: Load the data into Snowflake

First create a Snowflake stage to store the files that contain the data. This stage will hold the meeting minutes PDF files.

```sqlexample
CREATE OR REPLACE STAGE cortex_search_tutorial_db.public.fomc
    DIRECTORY = (ENABLE = TRUE)
    ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');
```

> **Note:**
>
> The directory and encryption are configured for generating presigned_url for a file. If you don’t need to generate presigned_url,
> you can skip these configurations.

Now upload the dataset. You can upload the dataset in Snowsight or using SQL. To upload in Snowsight:

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select your database `cortex_search_tutorial_db`.
4. Select your schema `public`.
5. Select Stages and select `fomc`.
6. On the top right, Select the + Files button.
7. Drag and drop files into the UI or select Browse to choose a file from the dialog window.
8. Select Upload to upload your file.

## Step 3: Parse PDF files

In this step, we’ll extract raw text from PDFs and then split it up into chunks for ingestion into the
search service.

First, we will use the [Parsing documents with AI_PARSE_DOCUMENT](../../parse-document.md) function to extract the text and layout
information from the PDFs into a new table, `RAW_TEXT`.

```sqlexample
CREATE OR REPLACE TABLE cortex_search_tutorial_db.public.raw_text AS
SELECT
    RELATIVE_PATH,
    TO_VARCHAR (
        SNOWFLAKE.CORTEX.PARSE_DOCUMENT (
            '@cortex_search_tutorial_db.public.fomc',
            RELATIVE_PATH,
            {'mode': 'LAYOUT'} ):content
        ) AS EXTRACTED_LAYOUT
FROM
    DIRECTORY('@cortex_search_tutorial_db.public.fomc')
WHERE
    RELATIVE_PATH LIKE '%.pdf';
```

Then, we will use [SPLIT_TEXT_MARKDOWN_HEADER](../../../../sql-reference/functions/split_text_markdown_header-snowflake-cortex.md) to
split the documents up into chunks of maximum size 2000 characters each, using the top two markdown header levels as chunk boundaries.
We’ll insert the chunks into a new table `DOC_CHUNKS`.

```sqlexample
CREATE OR REPLACE TABLE cortex_search_tutorial_db.public.doc_chunks AS
SELECT
    relative_path,
    BUILD_SCOPED_FILE_URL(@cortex_search_tutorial_db.public.fomc, relative_path) AS file_url,
    (
        relative_path || ':\n'
        || coalesce('Header 1: ' || c.value['headers']['header_1'] || '\n', '')
        || coalesce('Header 2: ' || c.value['headers']['header_2'] || '\n', '')
        || c.value['chunk']
    ) AS chunk,
    'English' AS language
FROM
    cortex_search_tutorial_db.public.raw_text,
    LATERAL FLATTEN(SNOWFLAKE.CORTEX.SPLIT_TEXT_MARKDOWN_HEADER(
        EXTRACTED_LAYOUT,
        OBJECT_CONSTRUCT('#', 'header_1', '##', 'header_2'),
        2000, -- chunks of 2000 characters
        300 -- 300 character overlap
    )) c;
```

## Step 4: Create search service

Create a search service over your new table by running the following SQL command:

```sqlexample
CREATE OR REPLACE CORTEX SEARCH SERVICE cortex_search_tutorial_db.public.fomc_meeting
    ON chunk
    ATTRIBUTES language
    WAREHOUSE = cortex_search_tutorial_wh
    TARGET_LAG = '1 hour'
    AS (
    SELECT
        chunk,
        relative_path,
        file_url,
        language
    FROM cortex_search_tutorial_db.public.doc_chunks
    );
```

This command specifies the `attributes`, which are the columns that you’ll be able to filter search results on, as well as the
warehouse and target lag. The search column is designated as `chunk`, which is generated in the source query as a
concatenation of several text columns in the base table. The other columns in the source query can be included in response to a search request.

## Step 5: Create a Streamlit app

You can query the service with Python SDK (using the `snowflake` Python package). This tutorial
demonstrates using the Python SDK in a Streamlit in Snowflake application.

First, ensure your global Snowsight UI role is the same as the role used to create
the service in the service creation step.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. **Important**: Select the `cortex_search_tutorial_db` database and the `public` schema for the app location.
5. In the left pane of the Streamlit in Snowflake editor, select Packages and add `snowflake` (version >= 0.8.0) and `snowflake-ml-python` to install the required packages in your
   application.
6. Replace the example application code with the following Streamlit app:

   ```python
   import streamlit as st
   from snowflake.core import Root # requires snowflake>=0.8.0
   from snowflake.cortex import Complete
   from snowflake.snowpark.context import get_active_session

   """
   The available models are subject to change. Check the model availability for the REST API:
   https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-llm-rest-api#model-availability
   """
   MODELS = [
       "mistral-large2",
       "llama3.1-70b",
       "llama3.1-8b",
   ]

   def init_messages():
       """
       Initialize the session state for chat messages. If the session state indicates that the
       conversation should be cleared or if the "messages" key is not in the session state,
       initialize it as an empty list.
       """
       if st.session_state.clear_conversation or "messages" not in st.session_state:
           st.session_state.messages = []

   def init_service_metadata():
       """
       Initialize the session state for cortex search service metadata. Query the available
       cortex search services from the Snowflake session and store their names and search
       columns in the session state.
       """
       if "service_metadata" not in st.session_state:
           services = session.sql("SHOW CORTEX SEARCH SERVICES;").collect()
           service_metadata = []
           if services:
               for s in services:
                   svc_name = s["name"]
                   svc_search_col = session.sql(
                       f"DESC CORTEX SEARCH SERVICE {svc_name};"
                   ).collect()[0]["search_column"]
                   service_metadata.append(
                       {"name": svc_name, "search_column": svc_search_col}
                   )

           st.session_state.service_metadata = service_metadata

   def init_config_options():
       """
       Initialize the configuration options in the Streamlit sidebar. Allow the user to select
       a cortex search service, clear the conversation, toggle debug mode, and toggle the use of
       chat history. Also provide advanced options to select a model, the number of context chunks,
       and the number of chat messages to use in the chat history.
       """
       st.sidebar.selectbox(
           "Select cortex search service:",
           [s["name"] for s in st.session_state.service_metadata],
           key="selected_cortex_search_service",
       )

       st.sidebar.button("Clear conversation", key="clear_conversation")
       st.sidebar.toggle("Debug", key="debug", value=False)
       st.sidebar.toggle("Use chat history", key="use_chat_history", value=True)

       with st.sidebar.expander("Advanced options"):
           st.selectbox("Select model:", MODELS, key="model_name")
           st.number_input(
               "Select number of context chunks",
               value=5,
               key="num_retrieved_chunks",
               min_value=1,
               max_value=10,
           )
           st.number_input(
               "Select number of messages to use in chat history",
               value=5,
               key="num_chat_messages",
               min_value=1,
               max_value=10,
           )

       st.sidebar.expander("Session State").write(st.session_state)

   def query_cortex_search_service(query, columns = [], filter={}):
       """
       Query the selected cortex search service with the given query and retrieve context documents.
       Display the retrieved context documents in the sidebar if debug mode is enabled. Return the
       context documents as a string.

       Args:
           query (str): The query to search the cortex search service with.

       Returns:
           str: The concatenated string of context documents.
       """
       db, schema = session.get_current_database(), session.get_current_schema()

       cortex_search_service = (
           root.databases[db]
           .schemas[schema]
           .cortex_search_services[st.session_state.selected_cortex_search_service]
       )

       context_documents = cortex_search_service.search(
           query, columns=columns, filter=filter, limit=st.session_state.num_retrieved_chunks
       )
       results = context_documents.results

       service_metadata = st.session_state.service_metadata
       search_col = [s["search_column"] for s in service_metadata
                       if s["name"] == st.session_state.selected_cortex_search_service][0].lower()

       context_str = ""
       for i, r in enumerate(results):
           context_str += f"Context document {i+1}: {r[search_col]} \n" + "\n"

       if st.session_state.debug:
           st.sidebar.text_area("Context documents", context_str, height=500)

       return context_str, results

   def get_chat_history():
       """
       Retrieve the chat history from the session state limited to the number of messages specified
       by the user in the sidebar options.

       Returns:
           list: The list of chat messages from the session state.
       """
       start_index = max(
           0, len(st.session_state.messages) - st.session_state.num_chat_messages
       )
       return st.session_state.messages[start_index : len(st.session_state.messages) - 1]

   def complete(model, prompt):
       """
       Generate a completion for the given prompt using the specified model.

       Args:
           model (str): The name of the model to use for completion.
           prompt (str): The prompt to generate a completion for.

       Returns:
           str: The generated completion.
       """
       return Complete(model, prompt).replace("$", "\$")

   def make_chat_history_summary(chat_history, question):
       """
       Generate a summary of the chat history combined with the current question to extend the query
       context. Use the language model to generate this summary.

       Args:
           chat_history (str): The chat history to include in the summary.
           question (str): The current user question to extend with the chat history.

       Returns:
           str: The generated summary of the chat history and question.
       """
       prompt = f"""
           [INST]
           Based on the chat history below and the question, generate a query that extend the question
           with the chat history provided. The query should be in natural language.
           Answer with only the query. Do not add any explanation.

           <chat_history>
           {chat_history}
           </chat_history>
           <question>
           {question}
           </question>
           [/INST]
       """

       summary = complete(st.session_state.model_name, prompt)

       if st.session_state.debug:
           st.sidebar.text_area(
               "Chat history summary", summary.replace("$", "\$"), height=150
           )

       return summary

   def create_prompt(user_question):
       """
       Create a prompt for the language model by combining the user question with context retrieved
       from the cortex search service and chat history (if enabled). Format the prompt according to
       the expected input format of the model.

       Args:
           user_question (str): The user's question to generate a prompt for.

       Returns:
           str: The generated prompt for the language model.
       """
       if st.session_state.use_chat_history:
           chat_history = get_chat_history()
           if chat_history != []:
               question_summary = make_chat_history_summary(chat_history, user_question)
               prompt_context, results = query_cortex_search_service(
                   question_summary,
                   columns=["chunk", "file_url", "relative_path"],
                   filter={"@and": [{"@eq": {"language": "English"}}]},
               )
           else:
               prompt_context, results = query_cortex_search_service(
                   user_question,
                   columns=["chunk", "file_url", "relative_path"],
                   filter={"@and": [{"@eq": {"language": "English"}}]},
               )
       else:
           prompt_context, results = query_cortex_search_service(
               user_question,
               columns=["chunk", "file_url", "relative_path"],
               filter={"@and": [{"@eq": {"language": "English"}}]},
           )
           chat_history = ""

       prompt = f"""
               [INST]
               You are a helpful AI chat assistant with RAG capabilities. When a user asks you a question,
               you will also be given context provided between <context> and </context> tags. Use that context
               with the user's chat history provided in the between <chat_history> and </chat_history> tags
               to provide a summary that addresses the user's question. Ensure the answer is coherent, concise,
               and directly relevant to the user's question.

               If the user asks a generic question which cannot be answered with the given context or chat_history,
               just say "I don't know the answer to that question.

               Don't saying things like "according to the provided context".

               <chat_history>
               {chat_history}
               </chat_history>
               <context>
               {prompt_context}
               </context>
               <question>
               {user_question}
               </question>
               [/INST]
               Answer:
               """
       return prompt, results

   def main():
       st.title(f":speech_balloon: Chatbot with Snowflake Cortex")

       init_service_metadata()
       init_config_options()
       init_messages()

       icons = {"assistant": "❄️", "user": "👤"}

       # Display chat messages from history on app rerun
       for message in st.session_state.messages:
           with st.chat_message(message["role"], avatar=icons[message["role"]]):
               st.markdown(message["content"])

       disable_chat = (
           "service_metadata" not in st.session_state
           or len(st.session_state.service_metadata) == 0
       )
       if question := st.chat_input("Ask a question...", disabled=disable_chat):
           # Add user message to chat history
           st.session_state.messages.append({"role": "user", "content": question})
           # Display user message in chat message container
           with st.chat_message("user", avatar=icons["user"]):
               st.markdown(question.replace("$", "\$"))

           # Display assistant response in chat message container
           with st.chat_message("assistant", avatar=icons["assistant"]):
               message_placeholder = st.empty()
               question = question.replace("'", "")
               prompt, results = create_prompt(question)
               with st.spinner("Thinking..."):
                   generated_response = complete(
                       st.session_state.model_name, prompt
                   )
                   # build references table for citation
                   markdown_table = "###### References \n\n| PDF Title | URL |\n|-------|-----|\n"
                   for ref in results:
                       markdown_table += f"| {ref['relative_path']} | [Link]({ref['file_url']}) |\n"
                   message_placeholder.markdown(generated_response + "\n\n" + markdown_table)

           st.session_state.messages.append(
               {"role": "assistant", "content": generated_response}
           )

   if __name__ == "__main__":
       session = get_active_session()
       root = Root(session)
       main()
   ```

## Step 6: Try out the app

In the right pane of the Streamlit in Snowflake editor window, you’ll see a preview of your Streamlit app. It should look similar to the following
screenshot:

Enter a query in the text box to try out your new app. Some sample queries you can try are:

* Example session 1: multi-turn question-answering
  :   + `How was gpd growth in q4 23?`
      + `How was unemployment in the same quarter?`
* Example session 2: summarizing multiple documents
  :   + `How has the fed's view of the market change over the course of 2024?`
* Example session 3: abstaining when the documents don’t contain the right answer
  :   + `What was janet yellen's opinion about 2024 q1?`

## Step 7: Clean up

### Clean up (optional)

Execute the following [DROP <object>](../../../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

```sqlexample
DROP DATABASE IF EXISTS cortex_search_tutorial_db;
DROP WAREHOUSE IF EXISTS cortex_search_tutorial_wh;
```

Dropping the database automatically removes all child database objects such as tables.

## Next steps

Congratulations! You have successfully built a search app from a set of PDF files in Snowflake.

### Additional resources

You can continue learning using the following resources:

* [Cortex Search overview](../cortex-search-overview.md)
* [Query a Cortex Search Service](../query-cortex-search-service.md)

---
title: Tutorial: Answer questions about time-series revenue data with Cortex Analyst
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-analyst/tutorials/tutorial-1.md
section: Snowflake Cortex (AI & ML)
---

Cortex Analyst

Getting Started

# Tutorial: Answer questions about time-series revenue data with Cortex Analyst

## Introduction

Cortex Analyst transforms natural-language questions about your data into results by generating and executing SQL queries.
This tutorial describes how to set up Cortex Analyst to respond to questions about a time-series revenue data set.

### What you will learn

* Establish a semantic model for the data set.
* Create a Streamlit app that queries Cortex Analyst.

### Prerequisites

The following prerequisites are required to complete this tutorial:

* You have a Snowflake account and user with a role that grants the necessary
  privileges to create a database, schema, tables, stage, and virtual warehouse objects.
* You have [Streamlit](https://pypi.org/project/streamlit/) set up on your local system.

Refer to the [Snowflake in 20 minutes](../../../tutorials/snowflake-in-20minutes.md) for instructions to meet these requirements.

## Step 1: Setup

### Getting the sample data

You will use a sample dataset downloaded
[from GitHub](https://github.com/Snowflake-Labs/sfguide-getting-started-with-cortex-analyst/tree/main/data).
Download the following data files to your system:

* `daily_revenue.csv`
* `product.csv`
* `region.csv`

Also download the [semantic model YAML](https://github.com/Snowflake-Labs/sfguide-getting-started-with-cortex-analyst/tree/main/revenue_timeseries.yaml) from GitHub.

You might want to take a look at this semantic model before proceeding. The semantic model supplements the SQL schema of
each table with additional information that helps Cortex Analyst understand questions about the data. For more
information, see [Using SQL commands to create and manage semantic views](../../../views-semantic/sql.md).

> **Note:**
>
> In a non-tutorial setting, you would bring your own data, possibly already in a Snowflake table, and develop
> your own semantic model.

### Creating the Snowflake objects

Use Snowsight, the Snowflake UI, to create the Snowflake objects needed for this tutorial. After you complete the
tutorial, you can drop these objects.

> **Note:**
>
> Use a role that can create databases, schemas, warehouses, stages, and tables.

To create the objects:

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets, and then select the + button. A new SQL worksheet appears.
3. Paste the SQL code below into the worksheet, then select the Run All from the drop-down menu at the top right
   of the worksheet.

```sqlexample
/*--
• Database, schema, warehouse, and stage creation
--*/

USE ROLE SECURITYADMIN;

CREATE ROLE IF NOT EXISTS cortex_user_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE cortex_user_role;

GRANT ROLE cortex_user_role TO USER <user>;

USE ROLE sysadmin;

-- Create demo database
CREATE OR REPLACE DATABASE cortex_analyst_demo;

-- Create schema
CREATE OR REPLACE SCHEMA cortex_analyst_demo.revenue_timeseries;

-- Create warehouse
CREATE OR REPLACE WAREHOUSE cortex_analyst_wh
    WAREHOUSE_SIZE = 'large'
    WAREHOUSE_TYPE = 'standard'
    AUTO_SUSPEND = 60
    AUTO_RESUME = TRUE
    INITIALLY_SUSPENDED = TRUE
COMMENT = 'Warehouse for Cortex Analyst demo';

GRANT USAGE ON WAREHOUSE cortex_analyst_wh TO ROLE cortex_user_role;
GRANT OPERATE ON WAREHOUSE cortex_analyst_wh TO ROLE cortex_user_role;

GRANT OWNERSHIP ON SCHEMA cortex_analyst_demo.revenue_timeseries TO ROLE cortex_user_role;
GRANT OWNERSHIP ON DATABASE cortex_analyst_demo TO ROLE cortex_user_role;

USE ROLE cortex_user_role;

-- Use the created warehouse
USE WAREHOUSE cortex_analyst_wh;

USE DATABASE cortex_analyst_demo;
USE SCHEMA cortex_analyst_demo.revenue_timeseries;

-- Create stage for raw data
CREATE OR REPLACE STAGE raw_data DIRECTORY = (ENABLE = TRUE);

/*--
• Fact and Dimension Table Creation
--*/

-- Fact table: daily_revenue
CREATE OR REPLACE TABLE cortex_analyst_demo.revenue_timeseries.daily_revenue (
    date DATE,
    revenue FLOAT,
    cogs FLOAT,
    forecasted_revenue FLOAT,
    product_id INT,
    region_id INT
);

-- Dimension table: product_dim
CREATE OR REPLACE TABLE cortex_analyst_demo.revenue_timeseries.product_dim (
    product_id INT,
    product_line VARCHAR(16777216)
);

-- Dimension table: region_dim
CREATE OR REPLACE TABLE cortex_analyst_demo.revenue_timeseries.region_dim (
    region_id INT,
    sales_region VARCHAR(16777216),
    state VARCHAR(16777216)
);
```

The SQL above creates the following objects:

* A database named `cortex_analyst_demo`
* A schema within that database called `revenue_timeseries`
* Three tables in that schema: `daily_revenue`, `product_dim`, and `region_dim`
* A stage named `raw_data` that will hold the raw data we will load into these tables
* A virtual warehouse named `cortex_analyst_wh`

> **Note:**
>
> The virtual warehouse is initially suspended. It starts automatically when you run a query.

## Step 2: Load the data into Snowflake

To get the data from the CSV files into Snowflake, you will upload them to the stage, then load the data from the stage
into the tables. At the same time, you will upload the semantic model YAML file for use in a later step.

The files you will upload are:

* `daily_revenue.csv`
* `product.csv`
* `region.csv`
* `revenue_timeseries.yaml`

To upload the files in Snowsight:

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Add Data, and then select Load files into a stage.
3. Drag the four files you downloaded in the previous step into the Snowsight window.
4. Choose the database `cortex_analyst_demo` and the stage `raw_data`, then select the Upload button to upload the files.

Now that you have uploaded the files, load the data from the CSV files by executing the SQL commands below in a Snowsight worksheet.

```sqlexample
USE WAREHOUSE cortex_analyst_wh;

COPY INTO cortex_analyst_demo.revenue_timeseries.daily_revenue
FROM @raw_data
FILES = ('daily_revenue.csv')
FILE_FORMAT = (
    TYPE=CSV,
    SKIP_HEADER=1,
    FIELD_DELIMITER=',',
    TRIM_SPACE=FALSE,
    FIELD_OPTIONALLY_ENCLOSED_BY=NONE,
    REPLACE_INVALID_CHARACTERS=TRUE,
    DATE_FORMAT=AUTO,
    TIME_FORMAT=AUTO,
    TIMESTAMP_FORMAT=AUTO
    EMPTY_FIELD_AS_NULL = FALSE
    error_on_column_count_mismatch=false
)
ON_ERROR=CONTINUE
FORCE = TRUE ;

COPY INTO cortex_analyst_demo.revenue_timeseries.product_dim
FROM @raw_data
FILES = ('product.csv')
FILE_FORMAT = (
    TYPE=CSV,
    SKIP_HEADER=1,
    FIELD_DELIMITER=',',
    TRIM_SPACE=FALSE,
    FIELD_OPTIONALLY_ENCLOSED_BY=NONE,
    REPLACE_INVALID_CHARACTERS=TRUE,
    DATE_FORMAT=AUTO,
    TIME_FORMAT=AUTO,
    TIMESTAMP_FORMAT=AUTO
    EMPTY_FIELD_AS_NULL = FALSE
    error_on_column_count_mismatch=false
)
ON_ERROR=CONTINUE
FORCE = TRUE ;

COPY INTO cortex_analyst_demo.revenue_timeseries.region_dim
FROM @raw_data
FILES = ('region.csv')
FILE_FORMAT = (
    TYPE=CSV,
    SKIP_HEADER=1,
    FIELD_DELIMITER=',',
    TRIM_SPACE=FALSE,
    FIELD_OPTIONALLY_ENCLOSED_BY=NONE,
    REPLACE_INVALID_CHARACTERS=TRUE,
    DATE_FORMAT=AUTO,
    TIME_FORMAT=AUTO,
    TIMESTAMP_FORMAT=AUTO
    EMPTY_FIELD_AS_NULL = FALSE
    error_on_column_count_mismatch=false
)
ON_ERROR=CONTINUE
FORCE = TRUE ;
```

> **Note:**
>
> Only the result of the last command is shown in the output pane. You can run the commands line by line to see the results of each command.

## Step 3: Create a Streamlit app to talk to your data through Cortex Analyst

To create a Streamlit app that uses Cortex Analyst:

1. Create a Python file locally called `analyst_demo.py`.
2. Copy the code below into the file.
3. Replace the placeholder values with your account details.
4. Run the Streamlit app using `streamlit run analyst_demo.py`.

```python
from typing import Any, Dict, List, Optional

import pandas as pd
import requests
import snowflake.connector
import streamlit as st

DATABASE = "CORTEX_ANALYST_DEMO"
SCHEMA = "REVENUE_TIMESERIES"
STAGE = "RAW_DATA"
FILE = "revenue_timeseries.yaml"
WAREHOUSE = "cortex_analyst_wh"

# replace values below with your Snowflake connection information
HOST = "<host>"
ACCOUNT = "<account>"
USER = "<user>"
PASSWORD = "<password>"
ROLE = "<role>"

if 'CONN' not in st.session_state or st.session_state.CONN is None:
    st.session_state.CONN = snowflake.connector.connect(
        user=USER,
        password=PASSWORD,
        account=ACCOUNT,
        host=HOST,
        port=443,
        warehouse=WAREHOUSE,
        role=ROLE,
    )

def send_message(prompt: str) -> Dict[str, Any]:
    """Calls the REST API and returns the response."""
    request_body = {
        "messages": [{"role": "user", "content": [{"type": "text", "text": prompt}]}],
        "semantic_model_file": f"@{DATABASE}.{SCHEMA}.{STAGE}/{FILE}",
    }
    resp = requests.post(
        url=f"https://{HOST}/api/v2/cortex/analyst/message",
        json=request_body,
        headers={
            "Authorization": f'Snowflake Token="{st.session_state.CONN.rest.token}"',
            "Content-Type": "application/json",
        },
    )
    request_id = resp.headers.get("X-Snowflake-Request-Id")
    if resp.status_code < 400:
        return {**resp.json(), "request_id": request_id}  # type: ignore[arg-type]
    else:
        raise Exception(
            f"Failed request (id: {request_id}) with status {resp.status_code}: {resp.text}"
        )

def process_message(prompt: str) -> None:
    """Processes a message and adds the response to the chat."""
    st.session_state.messages.append(
        {"role": "user", "content": [{"type": "text", "text": prompt}]}
    )
    with st.chat_message("user"):
        st.markdown(prompt)
    with st.chat_message("assistant"):
        with st.spinner("Generating response..."):
            response = send_message(prompt=prompt)
            request_id = response["request_id"]
            content = response["message"]["content"]
            display_content(content=content, request_id=request_id)  # type: ignore[arg-type]
    st.session_state.messages.append(
        {"role": "assistant", "content": content, "request_id": request_id}
    )

def display_content(
    content: List[Dict[str, str]],
    request_id: Optional[str] = None,
    message_index: Optional[int] = None,
) -> None:
    """Displays a content item for a message."""
    message_index = message_index or len(st.session_state.messages)
    if request_id:
        with st.expander("Request ID", expanded=False):
            st.markdown(request_id)
    for item in content:
        if item["type"] == "text":
            st.markdown(item["text"])
        elif item["type"] == "suggestions":
            with st.expander("Suggestions", expanded=True):
                for suggestion_index, suggestion in enumerate(item["suggestions"]):
                    if st.button(suggestion, key=f"{message_index}_{suggestion_index}"):
                        st.session_state.active_suggestion = suggestion
        elif item["type"] == "sql":
            with st.expander("SQL Query", expanded=False):
                st.code(item["statement"], language="sql")
            with st.expander("Results", expanded=True):
                with st.spinner("Running SQL..."):
                    df = pd.read_sql(item["statement"], st.session_state.CONN)
                    if len(df.index) > 1:
                        data_tab, line_tab, bar_tab = st.tabs(
                            ["Data", "Line Chart", "Bar Chart"]
                        )
                        data_tab.dataframe(df)
                        if len(df.columns) > 1:
                            df = df.set_index(df.columns[0])
                        with line_tab:
                            st.line_chart(df)
                        with bar_tab:
                            st.bar_chart(df)
                    else:
                        st.dataframe(df)

st.title("Cortex Analyst")
st.markdown(f"Semantic Model: `{FILE}`")

if "messages" not in st.session_state:
    st.session_state.messages = []
    st.session_state.suggestions = []
    st.session_state.active_suggestion = None

for message_index, message in enumerate(st.session_state.messages):
    with st.chat_message(message["role"]):
        display_content(
            content=message["content"],
            request_id=message.get("request_id"),
            message_index=message_index,
        )

if user_input := st.chat_input("What is your question?"):
    process_message(prompt=user_input)

if st.session_state.active_suggestion:
    process_message(prompt=st.session_state.active_suggestion)
    st.session_state.active_suggestion = None
```

When you run the app, it prompts you to enter a question. Start with “What questions can I ask?” and try some of its suggestions.

## Step 4: Clean up

### Clean up (optional)

Execute the following [DROP <object>](../../../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

```sqlexample
DROP DATABASE IF EXISTS cortex_analyst_demo;
DROP WAREHOUSE IF EXISTS cortex_analyst_wh;
```

Dropping the database automatically removes all child database objects such as tables.

## Next steps

Congratulations! You have successfully built a simple Cortex Analyst app to “talk to your data” in Snowflake.

### Additional resources

Continue learning using the following resources:

* [Cortex Analyst overview](../../cortex-analyst.md)
* [YAML specification for semantic views](../../../views-semantic/semantic-view-yaml-spec.md)
* [Verified Query Repository](../verified-query-repository.md)
* [REST API](../rest-api.md)

---
title: Tutorials
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-knowledge-extensions/overview-tutorials.md
section: Snowflake Cortex (AI & ML)
---

# Tutorials

These tutorials provide step-by-step instructions for you to explore how to use Cortex Knowledge Extensions.

[Tutorial 1: Providers set up and test a CKE](tutorials/setup-test-cke-tutorial.md)
:   Walks through the process that Providers follow to create, test, and list a Cortex Knowledge Extension on the Marketplace.

[Tutorial 2: Consumer interfaces with a CKE in a Streamlit chatbot](tutorials/query-cortex-search-service-tutorial.md)
:   Walks through the process that Consumers follow to test Cortex Knowledge Extension that was shared by a provider. This tutorial is intended for users who are familiar with the Cortex Search Service and want to learn how to use it with a CKE.

[Tutorial 3: Add a CKE to Snowflake Intelligence](tutorials/add-cke-to-snowflake-intelligence-tutorial.md)
:   Walks through the process that Consumers follow to add a Cortex Knowledge Extension to Snowflake Intelligence. This tutorial is intended for users who are familiar with Snowflake Intelligence and want to learn how to use it with a CKE.

---
title: Understanding cost for Cortex Search Services
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-search/cortex-search-costs.md
section: Snowflake Cortex (AI & ML)
---

# Understanding cost for Cortex Search Services

## Cost categories

Cortex Search Services incur the following types of costs:

| Category | Description |
| --- | --- |
| Virtual warehouse compute | A Cortex Search Service requires a [virtual warehouse](../../cost-understanding-compute.md) to refresh the service: to run queries against base objects when they are initialized and refreshed, including orchestrating text embedding jobs and building the search index. These operations use compute resources, which consume [credits](../../cost-understanding-compute.md). If no changes are identified during a refresh, virtual warehouse credits aren’t consumed since there’s no new data to refresh. |
| EMBED_TEXT tokens compute | A Cortex Search Service automatically embeds each text row in the search column specified in the `ON` parameter into vector space to enable semantic search, which incurs a credit cost per token embedded. This involves calling [EMBED_TEXT_768](../../../sql-reference/functions/embed_text-snowflake-cortex.md) or [EMBED_TEXT_1024](../../../sql-reference/functions/embed_text_1024-snowflake-cortex.md) to convert each document as a series of numbers that encodes its meaning. Embeddings are computed each time a row is inserted or updated. Embeddings are processed incrementally in the evaluation of the source query, so the embedding cost is only incurred for added or changed documents. See [Vector Embeddings](../vector-embeddings.md) for more information on vector embedding costs. |
| Multi-index Cortex Search | Multi-index Cortex Search Services have costs dependent on how you embed tokens and the number of columns you index. Larger embedding vectors or higher numbers of index columns incur higher costs. Embeddings are computed each time a row is inserted or updated. Embeddings are processed incrementally in the evaluation of the source query, so the embedding cost is only incurred for added or changed documents. |
| Serving compute | A Cortex Search Service uses multi-tenant serving compute, separate from a user-provided Virtual Warehouse, to establish a low-latency, high-throughput service. The compute cost for this component is incurred per GB per month (GB/mo) of uncompressed indexed data, where indexed data is the user-provided data in the Cortex Search source query, plus vector embeddings computed on the user’s behalf. You incur these costs while the service is available to respond to queries, even if no queries are served during a given period. For the Cortex Search Serving credit rate per GB/mo of indexed data, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). |
| Storage | Cortex Search Services materialize the source query into a table stored in your account. This table is transformed into data structures that are optimized for low-latency serving, also stored in your account. Storage for the table and intermediate data structures are based on a flat rate per terabyte (TB). |
| Cloud services compute | Cortex Search Services use [Cloud Services compute](../../cost-understanding-compute.md) to identify changes in underlying base objects and whether the virtual warehouse needs to be invoked. Cloud services compute cost is subject to the constraint that Snowflake only bills if the daily cloud services cost is greater than 10% of the daily warehouse cost for the account. |

This topic provides information on these costs, as well as recommendations for managing these costs effectively.

## Managing indexing costs

You may find the following tips useful in managing the indexing costs of a Cortex Search Service:

Minimize warehouse size
:   Most services do not see improved indexing performance beyond a LARGE warehouse and many need only MEDIUM. Most of the
    compute time used in building an index is consumed by the text embedding function, which does not benefit from more
    cores or additional memory when it already has sufficient resources.

Suspend indexing when freshness isn’t important
:   [Suspend indexing](../../../sql-reference/sql/alter-cortex-search.md) (or increase target lag) when you don’t need changes
    in your documents to be immediately propagated to the search service (that is, when freshness isn’t as important
    during some period).

Set target lag according to business requirements
:   Not every search application requires real-time indexing. A target lag that is too low may cause your index to be
    refreshed more frequently than necessary. For example, if your source data updates every five minutes, but the
    consumer of the data only queries the search service once an hour, set the target lag to one hour, not five minutes.

Define primary keys
:   Defining primary keys on your Cortex Search Service can result in significant reductions to both the cost and latency
    of indexing. Services with primary keys can make use of an optimized refresh path when the underlying data
    changes, particularly when the number of changes since the last refresh is small and the last refresh occurred within
    the previous week. For more information on defining primary keys, see [Primary keys](cortex-search-overview.md).

Bundle changes together
:   There is a fixed component to the cost of an update, so fewer, bigger updates are less expensive than more frequent,
    smaller updates. Likewise, any change to any value within a row triggers the search column in that row to be
    embedded again, even if the data within that search column is unchanged, so it is better to accumulate all the changes
    to a row into a single update. For more information about vector embedding costs, see [Vector Embeddings](../vector-embeddings.md).

Minimize changes to the source data
:   Any change to the schema of the source query causes a full refresh of the service, including vector embeddings and
    indexes. When you create a large service, consider including extra payload columns for later use, so you don’t need to
    trigger a full refresh by changing the schema when you need to add a column. The cost of the additional columns is low.

    > **Tip:**
    >
    > Materializing data in a table in the source query with a CREATE OR REPLACE command causes the service to fully
    > refresh and embed all vectors again. It’s better to update the source table incrementally (for example, with MERGE INTO). For more information about vector embedding costs, see [Vector Embeddings](../vector-embeddings.md).

Keep the source query as simple as possible
:   Joins or other complex operations can add to indexing cost (and may be better to apply during ETL or at another
    stage). Refer to the Dynamic Tables Best Practices for more information on optimizing pipelines.

## Managing serving costs

You may find the following tips useful in managing the serving costs of a Cortex Search Service:

Suspend serving when it isn’t serving queries
:   A running search service incurs costs even if it is not serving queries. [Suspend the service](../../../sql-reference/sql/alter-cortex-search.md)
    when it is not needed, for example during development. It typically takes only a few minutes to resume a suspended
    service.

## Observing costs

To learn more about the costs of your Cortex Search services, use the following [Account Usage](../../../sql-reference/account-usage.md) views.

* [CORTEX_SEARCH_DAILY_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_search_daily_usage_history.md) contains daily totals for EMBED_TEXT tokens compute and serving credit compute usage per service. Snowflake
  intends to also provide virtual warehouse usage in this view in the future.
* [CORTEX_SEARCH_SERVING_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_search_serving_usage_history.md) includes hourly serving credits per service.

Snowflake intends to make this information available in the Cortex Search administration interface in the future.

## Estimating costs

### EMBED_TEXT tokens compute

EMBED_TEXT tokens compute is charged per token of text in the search column, per document, charged in to
on the cost of the credit rate of the selected embedding model. This compute cost
is incurred for each row that is inserted or updated, including for each row in the ON column during the initialization
of the service and every insert or update thereafter. For information on the per-token
cost of each embedding model, see [Cortex Search Embedding Models](cortex-search-overview.md):

For example, if you create a service on a source query with 10 million rows, each with 500 tokens, and the selected embedding model incurs
0.05 credits per 1 million tokens, you would expect to pay the following for the initial refresh:

> (0.05 credits per 1 million tokens) \* (10,000,000 rows) \* (500 tokens per row) / (1,000,000 tokens)
>
> = **250 credits**

For each row inserted or updated thereafter, you’d incur a cost of 0.05 credits per 1 million tokens.

> **Tip:**
>
> As an approximation, one token is equivalent to about 3/4 of an English word, or around 4 characters.
> To get an accurate estimate of tokens per row, use the [COUNT_TOKENS](../../../sql-reference/functions/count_tokens-snowflake-cortex.md)
> function with a representative sample of your actual data.

### Serving compute

Serving compute is charged per gigabyte-month of indexed data, where indexed data is the user-provided data
in the Cortex Search source query, plus vector embeddings computed on the user’s behalf. This is an ongoing cost
that is incurred as long as the service’s serving status is resumed. This cost is based on the number of rows indexed,
the size of the total indexed data, and the dimensionality of the selected vector embedding model. For information on the dimensionality
of each embedding model, see [Cortex Search Embedding Models](cortex-search-overview.md):

For example, if you have a service with 10 million rows, the selected embedding model has dimension of 768, each row
in the source query is around 1,000 bytes (including the search column), and the credit cost per GB/mo of indexed data is 6.3,
you would expect to pay the following cost per month:

> (6.3 credits per GB) \* (10,000,000 rows) \* (768 dimensions \* 4 bytes per dimension + 1,000 bytes per row) / (1,000,000,000 bytes per GB)
>
> = **256.5 credits monthly**

> **Note:**
>
> The size of the data per row varies by use case and increases with the amount of data (number of rows and columns) indexed by the service,
> regardless of a column’s designation as a search or attribute column.

#### Multi-index Cortex Search

Multi-index search services often store more data per row to account for the additional index columns. The total data used depends on the number of indices in addition to the table size.

To estimate the monthly serving cost for a multi-index service, use the following formula, where `n` is the number
of vector index columns, `d` is the average number of vector dimensions, and `r` is the number of rows:

> (6.3 credits per GB) \* r \* (n \* d \* (4 bytes per dimension) + 1,000 bytes per row) / (1,000,000,000 bytes per GB)

For example, if you have a service with 10 million rows and 2 vector indexes each of 768 dimensional vectors, you would expect to pay the following cost per month:

> (6.3 credits per GB) \* (10,000,000 rows) \* ((2 vector index columns) \* (768 vector dimensions) \* (4 bytes per dimension) + 1,000 bytes per row) / (1,000,000,000 bytes per GB)
>
> = **448.1 credits monthly**

### Warehouse compute

The [virtual warehouse](../../cost-understanding-compute.md) compute cost for Cortex Search Services can vary based on the change rate of your data, target lag, and warehouse size.
In general, Cortex Search Services with lower target lag values and higher change rates on underlying data will incur higher Warehouse-related
compute costs.

> > **Tip:**
> >
> > To get a clear understanding of Warehouse costs related to your Cortex Search pipelines, test
> > Cortex Search using dedicated warehouses so that the virtual warehouse consumption attributed to Cortex Search refreshes
> > can be isolated. You can move your Cortex Search Service to a shared warehouse after you establish a cost
> > baseline.

### Storage

Cortex Search Services require storage to store the materialized results of the source query, as well as the search index.
The size of the data stored can be estimated by materializing the source query into a table using the
[CORTEX_SEARCH_DATA_SCAN](../../../sql-reference/functions/cortex_search_data_scan.md) table function, and then examining the size of that table.

For detailed information about how this storage incurs cost, see [Understanding storage cost](../../cost-understanding-data-storage.md).

### Cloud Services

Cortex Search Services use [Cloud Services compute](../../cost-understanding-compute.md) to trigger refreshes when an underlying base object has changed. These costs
can vary based on the change rate of your data, target lag, and warehouse size. Cloud services
cost for change tracking in Cortex Search tend to be lower for use-cases with low change rates. Cloud services compute cost
is subject to the constraint that Snowflake only bills if the daily cloud services cost is greater than 10% of the daily warehouse
cost for the account.

---
title: Use case examples
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/templates.md
section: Snowflake Cortex (AI & ML)
---

# Use case examples

The following pre-built templates for Snowflake Intelligence implement agents for different use cases. You can use these templates to get started with Snowflake Intelligence or as a reference for building your own agents.

* [Claims Audit](https://app.snowflake.com/templates?template=snowflake_intelligence_claims_audit)
* [Financial Advisor Agent](https://app.snowflake.com/templates?template=snowflake_intelligence_financial_advisor)
* [Product Insights Agent](https://app.snowflake.com/templates?template=snowflake_intelligence_product_insights)

---
title: Use threads with the Cortex Agent REST API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-agents-threads.md
section: Snowflake Cortex (AI & ML)
---

# Use threads with the Cortex Agent REST API

This guide explains how to create, continue, and manage threaded conversations using the Cortex Agent REST API.

The workflow for using threads includes the following steps:

1. Create a new thread and use it as part of an `agent:run` request to the Cortex Agent REST API.
2. Read the message IDs for the thread.
3. Choose a message to continue the thread from.

## Start a new thread and use it with Agent Run

You must create a new thread, then pass it as part of a request to `agent:run`.

1. Create a new thread using [Create thread](cortex-agents-threads-rest-api.md).
2. Pass the ID of the newly created thread as part of a request to one of the `agent:run` REST API endpoints.

> * `agent:run` with Agent Object:
>
>   ```none
>   /api/v2/databases/{database}/schemas/{schema}/agents/{name}:run
>   ```
> * `agent:run` without Agent Object:
>
>   ```none
>   /api/v2/cortex/agent:run
>   ```
>
> As part of the request, pass the following:
>
> * `parent_message_id` must be `0`. This indicates that this request is the start of the thread.
> * Exactly one user message in `messages`.
>
> ```none
> POST <agent run endpoint>
> {
>   "thread_id": 1234,
>   "parent_message_id": 0,
>   "messages": [
>     {
>       "role": "user",
>       "content": [
>         {
>           "type": "text",
>           "text": "What is the total revenue for 2025?"
>         }
>       ]
>     }
>   ],
> }
> ```

## Read the returned message IDs

The Agent API streams back metadata events for each message in the conversation. The following output shows the structure of the metadata. Always listen for both user and assistant metadata events.

```output
event: metadata
data: {"metadata": {"role":"user","message_id":123}}

event: metadata
data: {"metadata": {"role":"assistant","message_id":456}}
```

In this output, the message IDs correspond to the following in the conversation:

* `123`: the persisted user message ID
* `456`: the persisted assistant message ID

Together, these IDs form the following thread:

```none
0 -> 123 (user) -> 456 (assistant)
```

## Continue the conversation

For the next turn in the conversation, set `parent_message_id` to the last successful assistant message ID and pass new values in `messages`. In this example, the parent message ID is `456`.

> **Note:**
>
> You must pass an assistant message ID as the `parent_message_id` to ensure the LLM functions as expected. You cannot pass a user message ID.
> If you have lost track of the last message ID, use [Create thread](cortex-agents-threads-rest-api.md) to list all messages in the thread.

```json
{
  "thread_id": 1234,
  "parent_message_id": 456,
  "messages": [
    {
      "role": "user",
      "content": [
        {
          "type": "text",
          "text": "What about last year?"
        }
      ]
    }
  ],

}
```

Continue using the latest successful assistant message ID as the `parent_message_id` in subsequent requests.

### Fork a conversation

You can also fork the conversation by continuing from any earlier assistant message. To fork the conversation, pass the desired assistant message ID as the `parent_message_id` in a new request. In the following example, `3 (user) -> 4 (assistant)` and `5 (user) -> 6 (assistant)` represent two different forks from the same assistant response.

```none
0 -> 1 (user) -> 2 (assistant) -> 3 (user) -> 4 (assistant)
0 -> 1 (user) -> 2 (assistant) -> 5 (user) -> 6 (assistant)
```

## Troubleshooting

In rare cases, the Agent API might fail to store the assistant message.
If assistant metadata is missing from the response, ignore the failed turn and continue from the last successful assistant message.

For example, consider the following thread:

```none
0 -> 1 (user) -> 2 (assistant) -> 3 (user) -> [assistant failed]
```

To continue the conversation, pass message ID 2 as part of a new request because that is the last successful assistant message.

```none
0 -> 1 (user) -> 2 (assistant) -> 5 (user) -> 6 (assistant)
```

---
title: User access and settings for agents
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/snowflake-intelligence/deploy-agents.md
section: Snowflake Cortex (AI & ML)
---

# User access and settings for agents

This topic provides information about the permissions required for users to interact with agents in Snowflake Intelligence and about the settings available for the Snowflake Intelligence interface and advanced access control features.

If you don’t have an agent for use with Snowflake Intelligence, create one using the [Build agents](build-agents.md) guide.

## Customize the Snowflake Intelligence interface

To customize the Snowflake Intelligence interface that users interact with Cortex Agents through, follow these steps:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Agents.
3. Select Open settings.
4. Under Snowflake Intelligence, modify the following settings:

   * Display name: The name of the Snowflake Intelligence interface that is displayed to users.
   * Welcome message: The message that is displayed when users first open the Snowflake Intelligence interface.
   * Color theme: The color theme of the Snowflake Intelligence interface.
     :   You can provide a custom primary color in hexadecimal format.
   * Full-length logo and Compact logo: The logos that are displayed when the navigation pane is expanded or collapsed, respectively.
   * Compact logo: The icon that is displayed in the browser tab.
5. Select Save.

## User privileges and access control

Users must have the following privileges to view agents in Snowflake Intelligence:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Database, schema | Required to view the agent. |
| USAGE | Agent | Required to query the Cortex Agent to generate responses. |

To access the tools attached to an agent, users must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Database, schema | Required to access the objects associated with any tools to attach to the agent. |
| USAGE | Cortex Search service | Required to run the Cortex Search services in the Cortex Agents request. |
| SELECT | Table | Required to access the objects referenced in the agent’s semantic view/model. |
| USAGE | Tools | Required to access all of the custom tools that the agent can use to generate responses. For example, if the custom tool is a stored procedure, then the user must have USAGE on the procedure. |
| USAGE | Semantic view/model | Required to access the semantic view/model referenced by the agent. |

**Limit access to specific roles**

The CORTEX_USER role gives users access to all Cortex features, including agents. By default, this role is granted to the PUBLIC role, which is automatically granted to all users and roles. If you don’t want all users to have this privilege, you can revoke it from the PUBLIC role and grant access to specific roles only. For more information, see [Cortex LLM privileges](../aisql.md).

After the CORTEX_USER role is revoked from the PUBLIC role, you can grant the CORTEX_AGENT_USER role. This role gives users access to only the Cortex Agents API, which allows them to use Snowflake Intelligence, but not the other Cortex features.

* To provide selective access to Cortex Agents so that only a subset of users have access to the feature, first revoke access to the PUBLIC role, and then grant the CORTEX_AGENT_USER role to specific users:

  ```sqlexample
  GRANT DATABASE ROLE SNOWFLAKE.CORTEX_AGENT_USER TO ROLE <role_name>;
  ```

For more information, see [Access control requirements](../cortex-agents.md).

## Configure Snowflake Intelligence with private connectivity

Snowflake Intelligence supports integration with AWS Privatelink and Azure Private Link to establish a private connection between your virtual private cloud (VPC) or virtual network (VNet) and Snowflake Intelligence. Configuring private connectivity requires setting up the correct DNS resolution to direct traffic to the Snowflake Intelligence service through this private connection.

Note that AWS PrivateLink and Azure Private Link are not services provided by Snowflake. They are an AWS service and Microsoft service, respectively, that Snowflake supports to use with your Snowflake account.

### Prerequisites

Complete the following prerequisites before connecting to Snowflake Intelligence with private connectivity.

* Do one of the following:
  :   + To set up AWS PrivateLink, follow the instructions in [AWS PrivateLink and Snowflake](../../admin-security-privatelink.md).
      + To set up Azure Private Link, follow the instructions in [Azure Private Link and Snowflake](../../privatelink-azure.md).
* To ensure that a `regionless-snowsight-privatelink-url` is available, using the ACCOUNTADMIN system role, call the [SYSTEM$GET_PRIVATELINK_CONFIG](../../../sql-reference/functions/system_get_privatelink_config.md) function.

> **Important:**
>
> Snowflake Intelligence exclusively uses the regionless URL format for private connectivity access. Unlike with other private connectivity URLs used for Snowflake, you should not include a region identifier, such as `us-west-2,` in the hostname. Any attempts to connect using a region-specific URL will fail.

### Connect to Snowflake Intelligence

Connect to Snowflake Intelligence by configuring the DNS for Snowflake Intelligence to use the subdomain.

* Create a CNAME record in your private DNS zone, `privatelink.snowflakecomputing.com`, that maps the following URL to the DNS name of your VPC or VNET endpoint:

  ```none
  si-<org-acct>.privatelink.snowflakecomputing.com
  ```

After the configuration is complete, users within your network can access Snowflake Intelligence by navigating to the following URL:

> ```none
> https://si-<org-acct>.privatelink.snowflakecomputing.com
> ```

The connection is routed securely over the private connection.

### User authentication with private connectivity

Users accessing Snowflake Intelligence with private connectivity use the standard Snowflake authentication process, which requires them to provide their account identifier, username, and password on the sign-in page.

## Redirect users to your identity provider

An account administrator can configure all user URLs to redirect to your identity provider (IdP) when an unauthenticated user accesses Snowflake Intelligence.
This process eliminates a step from the user’s sign-in flow.

* To redirect unauthenticated users from URLs to your IdP, execute the following SQL command, replacing `your_security_integration` with the name of the security integration that is configured for your IdP:

  ```sqlexample
  ALTER ACCOUNT SET LOGIN_IDP_REDIRECT = (SNOWFLAKE_INTELLIGENCE = <your_security_integration>);
  ```

> **Note:**
>
> * To use IdP redirecting when Snowflake Intelligence is accessed with private connectivity, you must configure the DNS to direct traffic to the Snowflake Intelligence service using the following URL format:
>
>   ```none
>   https://si-<org-acct>.privatelink.snowflakecomputing.com
>   ```
>
> For more information, see Configure Snowflake Intelligence with private connectivity.

For more information about configuring your Snowflake account to use an IdP, see the following topics:

* [Configuring Snowflake to use federated authentication](../../admin-security-fed-auth-security-integration.md)
* [Configuring an identity provider (IdP) for Snowflake](../../admin-security-fed-auth-configure-idp.md)

## Limit a user’s access to only Snowflake Intelligence

To restrict a user to only access Snowflake Intelligence and prevent them from accessing other parts of Snowflake, you can use either the [ALTER USER](../../../sql-reference/sql/alter-user.md) SQL command or the `allowedInterfaces` SCIM attribute. If a value other than `ALL` is specified using either method, then users can only access the interface specified and cannot interact with any Snowflake data outside of the interface specified.

* To restrict a user to only access Snowflake Intelligence, use the [ALTER USER](../../../sql-reference/sql/alter-user.md) SQL command:

  > ```sqlexample
  > ALTER USER <user_name> SET ALLOWED_INTERFACES = (SNOWFLAKE_INTELLIGENCE);
  > ```
* If you’re provisioning users with SCIM APIs, to set the same restriction, use the custom attribute `allowedInterfaces`.

For more information about SCIM custom attributes, see [Custom attributes](../../scim-user-api-reference.md).

### Limitations

Snowflake Intelligence currently has these limitations for Snowflake Intelligence-only users:

* Custom branding logos and icons don’t work for Snowflake Intelligence-only users and default to the Snowflake logo and icon.
* Snowflake Intelligence-only users cannot upload files.

## Snowflake Intelligence object

A Snowflake Intelligence object is an account-level object used to manage all agents in Snowflake Intelligence and their settings for your account. The Snowflake Intelligence object offers the following benefits:

* **Flexibility:** Create and manage agents anywhere in your account without needing to centralize them in a single schema.
* **Agent visibility management:** Use a single object to control which agents appear to all users.
* **Improved permission management:** Separate the ability to create agents from the ability to control which agents are shown in Snowflake Intelligence.

> **Note:**
>
> Using a Snowflake Intelligence object is an advanced configuration option and is not required to manage agents in Snowflake Intelligence. If an account has a Snowflake Intelligence object, then the agent must be added to that object to be visible. If not added, the agent can only be accessed using a direct link or the Snowsight UI.

### Set up a Snowflake Intelligence object

> **Note:**
>
> The role must have the CREATE SNOWFLAKE INTELLIGENCE ON ACCOUNT privilege to create a Snowflake Intelligence object.

To set up a Snowflake Intelligence object for your users, follow this process, which is expanded in the following sections:

* Create a Snowflake Intelligence object. The Snowflake Intelligence object is a single object meant to manage all agents used with Snowflake Intelligence in your account. You can only have one Snowflake Intelligence object in your account.
* Add agents to the Snowflake Intelligence object.
* GRANT the USAGE privilege on the Snowflake Intelligence object.

### Create a Snowflake Intelligence object

You can use either Snowsight or SQL to create a Snowflake Intelligence object.

> Snowsight UISQL
>
> Snowflake automatically creates the Snowflake Intelligence object when you modify the Snowflake Intelligence settings for the first time. When created using the UI, the Snowflake Intelligence object is named `SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT`. You can’t specify a different name.
>
> 1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
> 2. In the navigation menu, select AI & ML » Agents.
> 3. On the Snowflake Intelligence tab, select Open settings.
>    The Snowflake Intelligence object is created automatically if it doesn’t already exist. You can then add agents to the object.
>
> -To create a Snowflake Intelligence object, use the following command:
>
> > ```sqlexample
> > CREATE SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT;
> > ```

### Adding agents

The Snowflake Intelligence object is an account-level object that contains a list of agents. You can add or remove agents from this object to create a curated list of agents for your users.
For more information about adding or removing agents, see Configure the visibility of agents in Snowflake Intelligence.

### Grant Snowflake Intelligence privileges

The following privileges control access to Snowflake Intelligence objects:

* **CREATE SNOWFLAKE INTELLIGENCE on the account:** Privilege that allows creating a Snowflake Intelligence object. This privilege is granted to ACCOUNTADMIN by default.

  To grant this privilege to another role, run the following command:

  ```sqlexample
  GRANT CREATE SNOWFLAKE INTELLIGENCE ON ACCOUNT TO ROLE <role_name>;
  ```
* **USAGE on the Snowflake Intelligence object:** Privilege that allows users to view the list of agents added to the Snowflake Intelligence object and see configuration values.

  To grant this privilege, run the following command:

  ```sqlexample
  GRANT USAGE ON SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT TO ROLE <role_name>;
  ```
* **MODIFY on the Snowflake Intelligence object:** Privilege that allows users to add or remove agents from the Snowflake Intelligence object and change configuration values. Account administrators have this privilege by default.

  To grant this privilege, run the following command:

  ```sqlexample
  GRANT MODIFY ON SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT TO ROLE <role_name>;
  ```
* To make the Snowflake Intelligence object visible to all of your users, grant the USAGE privilege on the object to the PUBLIC role:

  ```sqlexample
  GRANT USAGE ON SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT TO ROLE PUBLIC;
  ```

If you are using the ACCOUNTADMIN role, you also have the MODIFY privilege on the Snowflake Intelligence object. This allows you to add or remove agents from the object to create a curated list of agents for your users.

To set up Snowflake Intelligence for your users, you must configure agent privileges. For information about the privileges required for agents, see [Access control requirements](../cortex-agents.md).

> **Important:**
>
> By default, Snowflake Intelligence uses the default role and the default warehouse of the user.
> When you invite others to use Snowflake Intelligence, ensure that they have a default role and warehouse.

> **Note:**
>
> All of the queries from Snowflake Intelligence use the user’s credentials. All role-based access control and data-masking policies associated with the user automatically apply to all interactions and conversations with the agent.

## Configure the visibility of agents in Snowflake Intelligence

In some cases, you might want to limit the agents that users can see in Snowflake Intelligence. For example, you might want to only show agents that are relevant to a specific user or group of users.

If you haven’t created a Snowflake Intelligence object and added agents to it, users automatically see all agents they have access to in your account.

* To control which agents appear in the Snowflake Intelligence interface for all users, create a curated list of agents by adding them to the Snowflake Intelligence object.

### Verify the Snowflake Intelligence object

* To see whether the Snowflake Intelligence object has been created in your account, use the following command:

  ```sqlexample
  SHOW SNOWFLAKE INTELLIGENCES;
  ```

> **Note:**
>
> Only one Snowflake Intelligence object can exist in an account.

### Manage agents with the Snowflake Intelligence object

* To add agents to the Snowflake Intelligence object, use the following command:

  ```sqlexample
  ALTER SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT ADD AGENT <db.schema.agent_name>;
  ```
* To remove agents from the Snowflake Intelligence object, use the following command:

  ```sqlexample
  ALTER SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT DROP AGENT <db.schema.agent_name>;
  ```

> **Note:**
>
> Any user or admin with the correct database and schema privileges can create agents. However, agents are not automatically added to the Snowflake Intelligence object: to add an agent to the Snowflake Intelligence object, users must have the ALTER privilege on the Snowflake Intelligence object and USAGE privileges on the agent.
>
> Administrators must have the USAGE privilege on the agent to add it to the Snowflake Intelligence object.

### Migrate from managing agent visibility with the SNOWFLAKE_INTELLIGENCE.AGENTS schema to the Snowflake Intelligence object

> **Important:**
>
> The `SNOWFLAKE_INTELLIGENCE.AGENTS` schema is deprecated as a mechanism for managing agent visibility. If you’re currently using this schema, we recommend migrating to the Snowflake Intelligence object.

If you’re using the `SNOWFLAKE_INTELLIGENCE.AGENTS` schema, your agents will continue to work, as detailed in Configure the visibility of agents in Snowflake Intelligence. However, migrating to the Snowflake Intelligence object provides the following benefits:

> * **Flexibility:** Create and manage agents anywhere in your account without needing to centralize them in a single schema.
> * **Improved permission management:** Separate the ability to create agents from the ability to make them visible in Snowflake Intelligence.
> * **Fewer naming conflicts:** Eliminate potential conflicts with the `SNOWFLAKE_INTELLIGENCE.AGENTS` schema name.
> * **Easier agent visibility management:** Use a single object to control which agents appear to all users.

You must create a Snowflake Intelligence object before you migrate your agents. For information about creating a Snowflake Intelligence object, see Snowflake Intelligence object.

* To add an agent to the Snowflake Intelligence object, use the following code:

  > ```sqlexample
  > ALTER SNOWFLAKE INTELLIGENCE SNOWFLAKE_INTELLIGENCE_OBJECT_DEFAULT ADD AGENT SNOWFLAKE_INTELLIGENCE.AGENTS.<agent_name>;
  > ```

## Access the agent

After you’ve created an agent, users can ask it questions to get insights from your data.
The agent can answer questions such as these:

* What is the average sales amount for the last quarter?
* What product sold the most units last month?
* Can you show me the sales trend for the last year?

It can also provide the following visualizations:

* Bar chart
* Line chart
* Pie chart
* Scatter plot

To use the agent, follow these steps:

> Method 1: Access without private connectivityMethod 2: Access with private connectivityMethod 3: Access with a direct link
>
> To access Snowflake Intelligence without private connectivity, navigate to the following URL:
>
> ```none
> https://ai.snowflake.com
> ```
>
> To access Snowflake Intelligence with private connectivity, navigate to the following URL:
>
> ```none
> https://si-<org-acct>.snowflakecomputing.com
> ```
>
> To access Snowflake Intelligence with a direct link, follow these steps:
>
> 1. In the navigation menu, select AI & ML » Agents.
> 2. From the list of agents, select the agent that you want to access.
> 3. Select Preview in Snowflake Intelligence.
> 4. Copy the URL.

> **Note:**
>
> You can switch between agents in the same conversation thread to retain context across agent interactions.

## Monitoring agent usage and feedback

You can view logs for an agent to see details about the interactions that users have had with the agent. The logs include information such as the prompts that users have sent to the agent, the responses that the agent has provided, and any errors that have occurred. For more information about viewing logs for agents, see [Monitor Cortex Agent requests](../cortex-agents-monitor.md).

When users in your organization interact with agents, they can provide feedback about the responses that the agents provide. This feedback gives high-level insights about the satisfaction of users. To view user feedback for your agents, see [Monitor Cortex Agent requests](../cortex-agents-monitor.md).

---
title: Vector embedding REST API
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-rest-api/embed-api.md
section: Snowflake Cortex (AI & ML)
---

# Vector embedding REST API

The Cortex REST API gives you access to an endpoint for performing [vector embeddings](../vector-embeddings.md), using the [AI_EMBED](../../../sql-reference/functions/ai_embed.md) function.

## Setting up authentication

To authenticate to the Cortex REST API, you can use the methods described in
[Authenticating Snowflake REST APIs with Snowflake](../../../developer-guide/snowflake-rest-api/authentication.md).

Set the `Authorization` header to include your token (for example, a JSON web token (JWT), OAuth token, or
[programmatic access token](../../programmatic-access-tokens.md)).

> **Tip:**
>
> Consider creating a dedicated user for Cortex REST API requests.

## Setting up authorization

To send a REST API request, your default role must be granted the SNOWFLAKE.CORTEX_USER database role.
In most cases, users already have this privilege because SNOWFLAKE.CORTEX_USER is granted to the PUBLIC
role automatically, and all roles inherit PUBLIC.

If your Snowflake administrator has revoked this grant, they must re-grant it:

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE my_role;
GRANT ROLE my_role TO USER my_user;
```

> **Important:**
>
> REST API requests use the user’s default role, so that role must have the necessary privileges. You can change
> a user’s default role with [ALTER USER … SET DEFAULT_ROLE](../../../sql-reference/sql/alter-user.md).
>
> ```sqlexample
> ALTER USER my_user SET DEFAULT_ROLE=my_role
> ```

## Endpoint format

You can make requests to the `/api/v2/cortex/inference:embed` endpoint to create embeddings for your text. The request takes the following form:

```output
POST https://<account_identifier>.snowflakecomputing.com/api/v2/cortex/inference:embed
```

where `account_identifier` is the [account identifier](../../admin-account-identifier.md) you use to access Snowsight.

## Model availability

The following table shows the EMBED function models that you can prompt using the REST API.

EMBED function models

| Model | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) | Azure East US 2  (Virginia) | Azure West Europe  (Netherlands) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- |
| `snowflake-arctic-embed-m-v1.5` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| `snowflake-arctic-embed-m` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| `e5-base-v2` | ✔ | ✔ | ✔ |  |  | ✔ | ✔ | ✔ |
| `snowflake-arctic-embed-l-v2.0` | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |

The following table shows the number of dimensions that each model can return.

EMBED function models

| Model | Number of  dimensions |
| --- | --- |
| `snowflake-arctic-embed-m-v1.5` | 768 |
| `snowflake-arctic-embed-m` | 768 |
| `e5-base-v2` | 768 |
| `snowflake-arctic-embed-l-v2.0` | 1024 |

## API Reference

### POST /api/v2/cortex/inference:embed

Creates an embedding for text that you specify.

Required headers

`Authorization: Bearer token`.
:   Authorization for the request. `token` is a JSON web token (JWT), OAuth token, or
    [programmatic access token](../../programmatic-access-tokens.md)). For details, see
    [Authenticating Snowflake REST APIs with Snowflake](../../../developer-guide/snowflake-rest-api/authentication.md).

`Content-Type: application/json`
:   Specifies that the body of the request is in JSON format.

`Accept: application/json`
:   Specifies that the response contains JSON.

#### Optional headers

`X-Snowflake-Authorization-Token-Type: type`
:   Defines the type of authorization token.

    If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.

    Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:

    * `KEYPAIR_JWT` (for key-pair authentication)
    * `OAUTH` (for OAuth)
    * `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](../../programmatic-access-tokens.md))

#### Required JSON arguments

| Argument | Type | Description |
| --- | --- | --- |
| `text` | array | A list of text strings for which you’re generating embeddings. The list can contain up to 1280 strings, each of which can be up to 4096 characters long. |
| `model` | string | The model that you’re using to create the embeddings. |

#### Status codes

The Snowflake Cortex LLM REST API uses the following HTTP status codes to indicate successful completion or various error
conditions.

200 `OK`
:   Request completed successfully. The body of the response contains the output of the model.

400 `invalid options object`
:   The optional arguments have invalid values.

400 `unknown model model_name`
:   The specified model does not exist.

400 `schema validation failed`
:   Errors related to incorrect response schema structure. Correct the schema and try again.

400 `max tokens of count exceeded`
:   The request exceeded the maximum number of tokens supported by the model (see [Model restrictions](../aisql.md)).

400 `all requests were throttled by remote service`
:   The request has been throttled due to a high level of usage. Try again later.

402 `budget exceeded`
:   The model consumption budget was exceeded.

403 `Not Authorized`
:   Account not enabled for REST API, or the default role for the calling user does not have the `snowflake.cortex_user` database role.

429 `too many requests`
:   The request was rejected because the usage quota has been exceeded. Please try your request later.

503 `embed timed out`
:   The request took too long.

## CURL request example

The following example uses `curl` to make an EMBED request to the `e5-base-v2` model.
Replace `token` and `account_identifier` with the appropriate values in this command.

```bash
curl --location "<account_url>/api/v2/cortex/inference:embed" \
--header 'X-Snowflake-Authorization-Token-Type: KEYPAIR_JWT' \
--header 'Content-Type: application/json' \
--header 'Accept: application/json' \
--header "Authorization: Bearer <token>" \
--data '{
"text": ["foo", "bar"],
"model": "e5-base-v2"
}'
```

### Output

The following is the output of the request, with the contents of the embedding array truncated:

```output
{
  "object" : "list",
  "data" : [ {
    "object" : "embedding",
    "embedding" : [ [ -0.02102863, 0.0051381723, -0.0071509206, -0.032512695, 0.056507032, ... ] ],
    "index" : 0
  }, {
    "object" : "embedding",
    "embedding" : [ [ -0.03859099, -0.0025452692, 0.002827513, -0.023107057, 0.039019972, ... ] ],
    "index" : 1
  } ],
  "model" : "e5-base-v2",
  "usage" : {
    "total_tokens" : 6
  }
}
```

Each embedding has an index that corresponds to the text string in a list in the request. The index is 0-based, so the first text string in the list has an index of 0, the second text string has an index of 1, and so on.

In the preceding example, “foo” corresponds to the 0 index and “bar” corresponds to the 1 index. The embedding for “foo” is the first element in the list of embeddings, and the embedding for “bar” is the second element in the list of embeddings.

## Python request example

The following example uses the Python API to make an EMBED request to the `e5-base-v2` model.
Replace `token` and `account_identifier` with the appropriate values in this command.

```python
from snowflake.core import Root
from snowflake.snowpark.context import get_active_session

def embed_service():
    # Initialize Snowflake session and root
    session = get_active_session()
    root = Root(session)

    # Send embed_request request and process response
    response = root.cortex_embed_service.embed("e5-base-v2", ['foo', 'bar'])
    print(response)

if __name__ == "__main__":
    embed_service()
```

### Output

The following is the output of the request, with the contents of the embedding array truncated:

```output
{
  "object" : "list",
  "data" : [ {
    "object" : "embedding",
    "embedding" : [ [ -0.02102863, 0.0051381723, -0.0071509206, -0.032512695, 0.056507032, ... ] ],
    "index" : 0
  }, {
    "object" : "embedding",
    "embedding" : [ [ -0.03859099, -0.0025452692, 0.002827513, -0.023107057, 0.039019972, ... ] ],
    "index" : 1
  } ],
  "model" : "e5-base-v2",
  "usage" : {
    "total_tokens" : 6
  }
}
```

Each embedding has an index that corresponds to the text string in a list in the request. The index is 0-based, so the first text string in the list has an index of 0, the second text string has an index of 1, and so on.

In the preceding example, “foo” corresponds to the 0 index and “bar” corresponds to the 1 index. The embedding for “foo” is the first element in the list of embeddings, and the embedding for “bar” is the second element in the list of embeddings.

## Usage quotas

The following table shows the usage quotas for the EMBED function.

EMBED function quotas

| Model | Tokens Processed  per Minute (TPM) | Requests per  Minute (RPM) | Max output (tokens) |
| --- | --- | --- | --- |
| `snowflake-arctic-embed-m-v1.5` | 400,000 | 200 | 4,096 |
| `snowflake-arctic-embed-m` | 400,000 | 200 | 4,096 |
| `e5-base-v2` | 400,000 | 200 | 4,096 |
| `nv-embed-qa-4` | 400,000 | 200 | 4,096 |
| `multilingual-e5-large` | 400,000 | 200 | 4,096 |
| `voyage-multilingual-2` | 400,000 | 200 | 4,096 |

---
title: Vector Embeddings
source: https://docs.snowflake.com/en/user-guide/snowflake-cortex/vector-embeddings.md
section: Snowflake Cortex (AI & ML)
---

# Vector Embeddings

An *embedding* refers to the reduction of high-dimensional data, such as unstructured text, to a representation
with fewer dimensions, such as a vector. Modern deep learning techniques can create vector embeddings,
which are structured numerical representations, from unstructured data such as
text and images, while preserving semantic notions of similarity and dissimilarity in the geometry of the vectors they produce.

The following illustration is a simplified example of the vector embedding and geometric similarity of natural language
text. In practice, neural networks produce embedding vectors with hundreds or even thousands of dimensions, not two as
shown here, but the concept is the same. Semantically similar text yields vectors that “point” in the same general
direction.

Many applications can benefit from the ability to find text or images similar to a target. For example, when a new
support case is logged at a help desk, the support team can benefit from the ability to find similar cases that have
already been resolved. The advantage of using embedding vectors in this application is that it goes beyond keyword
matching to semantic similarity, so related records can be found even if they don’t contain exactly the same words.

Snowflake Cortex offers the [EMBED_TEXT_768](../../sql-reference/functions/embed_text-snowflake-cortex.md) and
[EMBED_TEXT_1024](../../sql-reference/functions/embed_text_1024-snowflake-cortex.md) functions and several
[Vector functions](../../sql-reference/functions-vector.md) to compare them for various applications.

## Text embedding models

Snowflake offers the following text embedding models. See below for more details.

| Model name | Output dimensions | Context window | Language support |
| --- | --- | --- | --- |
| snowflake-arctic-embed-m-v1.5 | 768 | 512 | English-only |
| snowflake-arctic-embed-m | 768 | 512 | English-only |
| e5-base-v2 | 768 | 512 | English-only |
| snowflake-arctic-embed-l-v2.0 | 1024 | 512 | Multilingual |
| voyage-multilingual-2 | 1024 | 32000 | Multilingual ([supported languages](https://blog.voyageai.com/2024/06/10/voyage-multilingual-2-multilingual-embedding-model/)) |
| nv-embed-qa-4 | 1024 | 512 | English-only |

Supported models might have different [costs](aisql.md).

## About vector similarity functions

The measurement of similarity between vectors is a fundamental operation in semantic comparison. Snowflake Cortex
provides four vector similarity functions: VECTOR_INNER_PRODUCT, VECTOR_L1_distance, VECTOR_L2_DISTANCE, and
VECTOR_COSINE_SIMILARITY. To learn more about these functions, see [Vector functions](../../sql-reference/functions-vector.md).

For syntax and usage details, see the reference page for each function:

* [VECTOR_INNER_PRODUCT](../../sql-reference/functions/vector_inner_product.md)
* [VECTOR_L1_DISTANCE](../../sql-reference/functions/vector_l1_distance.md)
* [VECTOR_L2_DISTANCE](../../sql-reference/functions/vector_l2_distance.md)
* [VECTOR_COSINE_SIMILARITY](../../sql-reference/functions/vector_cosine_similarity.md)

## Examples

The following examples use the vector similarity functions.

This SQL example uses the VECTOR_INNER_PRODUCT function to determine which vectors in the table
are closest to each other between columns `a` and `b`:

```sqlexample
CREATE TABLE vectors (a VECTOR(float, 3), b VECTOR(float, 3));
INSERT INTO vectors SELECT [1.1,2.2,3]::VECTOR(FLOAT,3), [1,1,1]::VECTOR(FLOAT,3);
INSERT INTO vectors SELECT [1,2.2,3]::VECTOR(FLOAT,3), [4,6,8]::VECTOR(FLOAT,3);

-- Compute the pairwise inner product between columns a and b
SELECT VECTOR_INNER_PRODUCT(a, b) FROM vectors;
```

```output
+------+
| 6.3  |
|------|
| 41.2 |
+------+
```

This SQL example calls the VECTOR_COSINE_SIMILARITY function to find the vector closes to `[1,2,3]`:

```sqlexample
SELECT a, VECTOR_COSINE_SIMILARITY(a, [1,2,3]::VECTOR(FLOAT, 3)) AS similarity
    FROM vectors
ORDER BY similarity DESC
LIMIT 1;
```

```output
+-------------------------+
| [1, 2.2, 3] | 0.9990... |
+-------------------------+
```

### Snowflake Python Connector

These examples show how to use the VECTOR data type and vector similarity functions with the Python Connector.

> **Note:**
>
> Support for the VECTOR type was introduced in version 3.6 of the Snowflake Python Connector.

```python
import snowflake.connector

conn = ... # Set up connection
cur = conn.cursor()

# Create a table and insert some vectors
cur.execute("CREATE OR REPLACE TABLE vectors (a VECTOR(FLOAT, 3), b VECTOR(FLOAT, 3))")
values = [([1.1, 2.2, 3], [1, 1, 1]), ([1, 2.2, 3], [4, 6, 8])]
for row in values:
        cur.execute(f"""
            INSERT INTO vectors(a, b)
                SELECT {row[0]}::VECTOR(FLOAT,3), {row[1]}::VECTOR(FLOAT,3)
        """)

# Compute the pairwise inner product between columns a and b
cur.execute("SELECT VECTOR_INNER_PRODUCT(a, b) FROM vectors")
print(cur.fetchall())
```

```output
[(6.30...,), (41.2...,)]
```

```python
# Find the closest vector to [1,2,3]
cur.execute(f"""
    SELECT a, VECTOR_COSINE_SIMILARITY(a, {[1,2,3]}::VECTOR(FLOAT, 3))
        AS similarity
        FROM vectors
        ORDER BY similarity DESC
        LIMIT 1;
""")
print(cur.fetchall())
```

```output
[([1.0, 2.2..., 3.0], 0.9990...)]
```

### Snowpark Python

These examples show how to use the VECTOR data type and vector similarity functions with the Snowpark Python Library.

> **Note:**
>
> * Support for the VECTOR type was introduced in version 1.11 of Snowpark Python.
> * The Snowpark Python library does not support the [VECTOR_COSINE_SIMILARITY](../../sql-reference/functions/vector_cosine_similarity.md) function.

```python
from snowflake.snowpark import Session, Row
session = ... # Set up session
from snowflake.snowpark.types import VectorType, StructType, StructField
from snowflake.snowpark.functions import col, lit, vector_l2_distance
schema = StructType([StructField("vec", VectorType(int, 3))])
data = [Row([1, 2, 3]), Row([4, 5, 6]), Row([7, 8, 9])]
df = session.create_dataframe(data, schema)
df.select(
    "vec",
    vector_l2_distance(df.vec, lit([1, 2, 2]).cast(VectorType(int, 3))).as_("dist"),
).sort("dist").limit(1).show()
```

```output
----------------------
|"VEC"      |"DIST"  |
----------------------
|[1, 2, 3]  |1.0     |
----------------------
```

## Create vector embeddings from text

To create a vector embedding from a piece of text, you can use the [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](../../sql-reference/functions/embed_text-snowflake-cortex.md) or
[EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](../../sql-reference/functions/embed_text_1024-snowflake-cortex.md) functions, depending on the output dimensions of the model.
This function returns the vector embedding for a given English-language text. This vector can be used with the
vector comparison functions to determine the semantic similarity of two documents.

```sqlexample
SELECT SNOWFLAKE.CORTEX.EMBED_TEXT_768(model, text)
```

> **Tip:**
>
> You can use other embedding models through [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md). For more information, see
> [Embed Text Container Service](https://github.com/Snowflake-Labs/sfguide-text-embedding-snowpark-container-service).

> **Important:**
>
> EMBED_TEXT_768 and EMBED_TEXT_1024 are Cortex LLM Functions, so their usage is governed by the same access controls as the other
> Cortex LLM Functions. For instructions on accessing these functions, see the [Cortex LLM Functions Required Privileges](aisql.md).

## Example use cases

This section shows how to use embeddings, the vector similarity functions, and VECTOR data type to implement popular use cases
such as vector similarity search and retrieval-augmented generation (RAG).

### Vector similarity search

To implement a search for semantically similar documents, first store the embeddings for the documents to be searched.
Keep the embeddings up to date when documents are added or edited.

In this example, the documents are call center issues logged by support representatives. The issue is stored in a column
called `issue_text` in the table `issues`. The following SQL creates a new vector column to hold the
embeddings of the issues.

```sqlexample
ALTER TABLE issues ADD COLUMN issue_vec VECTOR(FLOAT, 768);

UPDATE issues
  SET issue_vec = SNOWFLAKE.CORTEX.EMBED_TEXT_768('snowflake-arctic-embed-m', issue_text);
```

To perform a search, create an embedding of the search term or target document, and then use a vector similarity function
to locate documents with similar embeddings. Use ORDER BY and LIMIT clauses to select the top *k* matching documents,
and optionally use a WHERE condition to specify a minimum similarity.

Generally, the call to the vector similarity function should appear in the SELECT clause, not in the WHERE clause. This
way, the function is called only for the rows specified by the WHERE clause, which may restrict the query based on some
other criteria, instead of operating over all rows in the table. To test a similarity value in the WHERE clause, define
a column alias for the VECTOR_COSINE_SIMILARITY call in the SELECT clause, and use that alias in a condition in the WHERE
clause.

This example finds up to five issues matching the search term from the last 90 days, assuming the cosine similarity with
the search term is at least 0.7.

```sqlexample
SELECT
  issue,
  VECTOR_COSINE_SIMILARITY(
    issue_vec,
    SNOWFLAKE.CORTEX.EMBED_TEXT_768('snowflake-arctic-embed-m', 'User could not install Facebook app on his phone')
  ) AS similarity
FROM issues
ORDER BY similarity DESC
LIMIT 5
WHERE DATEDIFF(day, CURRENT_DATE(), issue_date) < 90 AND similarity > 0.7;
```

### Retrieval-Augmented Generation (RAG)

In retrieval-augmented generation (RAG), a user’s query is used to find similar documents using
vector similarity. The top document is then passed to a large language model
(LLM) along with the user’s query, providing context for the generative response (completion). This can
improve the appropriateness of the response significantly.

In the following example, `wiki` is a table with a text column `content`, and `query` is a single-row
table with a text column `text`.

```sqlexample
-- Create embedding vectors for wiki articles (only do once)
ALTER TABLE wiki ADD COLUMN vec VECTOR(FLOAT, 768);
UPDATE wiki SET vec = SNOWFLAKE.CORTEX.EMBED_TEXT_768('snowflake-arctic-embed-m', content);

-- Embed incoming query
SET query = 'in which year was Snowflake Computing founded?';
CREATE OR REPLACE TABLE query_table (query_vec VECTOR(FLOAT, 768));
INSERT INTO query_table SELECT SNOWFLAKE.CORTEX.EMBED_TEXT_768('snowflake-arctic-embed-m', $query);

-- Do a semantic search to find the relevant wiki for the query
WITH result AS (
    SELECT
        w.content,
        $query AS query_text,
        VECTOR_COSINE_SIMILARITY(w.vec, q.query_vec) AS similarity
    FROM wiki w, query_table q
    ORDER BY similarity DESC
    LIMIT 1
)

-- Pass to large language model as context
SELECT SNOWFLAKE.CORTEX.COMPLETE('mistral-7b',
    CONCAT('Answer this question: ', query_text, ' using this text: ', content)) FROM result;
```

## Cost considerations

Snowflake Cortex LLM Functions, including EMBED_TEXT_768 and EMBED_TEXT_1024, incur compute cost based on the number of tokens processed.

> **Note:**
>
> A token is the smallest unit of text processed by Snowflake Cortex LLM Functions, approximately equal to four
> characters of text. The equivalence of raw input or output text to tokens can vary by model.

* For the EMBED_TEXT_768 and EMBED_TEXT_1024 functions, only input tokens are counted towards the billable total.
* Vector similarity functions do not incur token-based costs.

For more information about billing of Cortex LLM Functions, see [Cortex LLM Functions Cost Considerations](aisql.md).
For general information about compute costs, see [Understanding compute cost](../cost-understanding-compute.md).

## Cortex Code

Cortex Code — AI-driven coding agent integrated into Snowflake for data engineering workflows.

---
title: Cortex Code
source: https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code.md
section: Cortex Code
---

# Cortex Code

## Overview

Cortex Code is an AI-driven intelligent agent integrated into the Snowflake platform, optimized for complex data
engineering, analytics, machine learning, and agent-building tasks. It uses an autonomous agent framework to interact
directly with your Snowflake environment, with deep understanding of Snowflake’s Role-Based Access Control (RBAC),
schemas, and best practices.

Cortex Code supports data analysis, machine learning, and data engineering workflows. It provides a consistent, context-aware interface for users
performing data exploration or developing complex data pipelines.

## Core experiences

Cortex Code is delivered through two interfaces: in Snowsight and as a command line interface (CLI) that runs in a local shell.
This availability ensures access to AI agentic experiences wherever you work.

### Cortex Code in Snowsight

Cortex Code is the persistent, web-based entry point for AI in Snowflake. It is deeply integrated into Workspaces and
Snowsight Admin pages.

Key capabilities:

* **SQL and Python Notebook authoring:** Generate code from natural language or explain and optimize existing queries.
* **Account administration:** Take actions and answer questions about credit consumption, query performance, governance and user permissions.
* **Within Workspaces:**

  + **Context awareness:** Cortex Code knows which SQL file or notebook you are currently viewing and uses that as background context for its answers.
  + **Change review:** A visual “diff view” allows you to review and accept AI-suggested changes before they are applied.

### Cortex Code CLI

For power users and developers, the Cortex Code CLI provides an agentic shell for Snowflake that bridges the gap between your local development environment
(for example, VS Code or Cursor) and your Snowflake account.

For details about the CLI experience, see [Cortex Code CLI](cortex-code-cli.md).

#### Key features of the CLI

* **Snowflake integration:** The CLI connects directly to your Snowflake account using your existing authentication methods. You can execute SQL commands,
  view tables, validate [Cortex Analyst](../snowflake-cortex/cortex-analyst.md) semantic models, and manage multiple connections.
* **Local file access:** Unlike the Snowsight UI, the CLI can read and write to your local repositories, making
  it ideal for managing `dbt` projects or Streamlit apps.
* **Tool orchestration:** The CLI can invoke local `bash` commands, run `git` operations, and execute SQL directly against your Snowflake warehouse.
* **Agent customization:** Support for `AGENTS.md` files and Agent Skills allows you to define custom behaviors for the agent within
  specific projects.
* **Security:** Full support for Snowflake role-based access control (RBAC), OS-level sandboxing, a three-tier approval
  system, and automatic risk assessment help ensure secure operation within your environment.
* **Built-in Snowflake skills:** Cortex Code includes built-in skills that support key Snowflake workflows such as agent creation, machine
  learning, data engineering, and data governance.
* **Extensibility:** The CLI can be extended with custom tools, skills, subagents, hooks, and profiles to fit your organization’s workflows.
* **Developer friendly:** Developers, data engineers, and data scientists will find the Cortex Code CLI pleasant to work
  with, thanks to features like session persistence, `git` worktree support, a choice of compact and expanded display
  modes, multiple color themes, and support for `vim`-style keyboard navigation.

## More information

For detailed setup instructions, troubleshooting, and advanced use cases, see the following topics:

* [Cortex Code in Snowsight](cortex-code-snowsight.md)
* [Cortex Code CLI](cortex-code-cli.md)

## Cost

Cortex Code is billed based on token consumption. Pricing details are provided in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Cortex Code CLI

Cortex Code CLI supports two billing models depending on how you access the product:

* **Subscription:** Individual developers who sign up at [signup.snowflake.com/cortex-code](https://signup.snowflake.com/cortex-code)
  start with a free trial that includes a fixed amount of Cortex Code CLI usage. The trial is valid for 30 days from
  the date of sign-up. After the trial period ends, the account converts to a paid subscription unless cancelled. The
  subscription includes a fixed monthly amount of Cortex Code CLI usage. If you exceed the included usage, Cortex Code
  CLI is unavailable until the next billing period.
* **Pay-as-you-go:** Companies with an existing Snowflake account (on-demand or capacity customers) are billed based
  on token consumption.

Any Snowflake compute or storage consumed separately from Cortex Code CLI usage (for example, virtual warehouse or
storage costs) is billed at standard Snowflake on-demand rates, as described in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

To set daily credit usage limits for Cortex Code users, see
[Managing Cortex Code credit usage limits](credit-usage-limit.md).

### Cortex Code in Snowsight

Cortex Code in Snowsight is billed based on token consumption for customers with an existing
Snowflake account.

## Legal notices

Where your configuration of Cortex Code uses a model provided on the
[Model and Service Pass-Through Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/model-pass-through-terms/),
your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Features [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Cortex Code CLI
source: https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code-cli.md
section: Cortex Code
---

# Cortex Code CLI

This topic helps you get started with Cortex Code CLI, including installation, connection setup, and validation.

Before you begin, ensure you have a Snowflake account with access to the required Cortex models. See Prerequisites for full details.

> **Note:**
>
> If you do not have a Snowflake account, you can [sign up for a free Cortex Code CLI trial](https://signup.snowflake.com/cortex-code).

## Install Cortex Code CLI

Cortex Code CLI is available for Linux, macOS, and Windows (both WSL and native). Use the instructions below to install Cortex Code CLI on your platform.

### Linux (including WSL) and macOS

To install Cortex Code CLI on Linux, macOS, or WSL, issue the following command in a shell:

```shell
curl -LsS https://ai.snowflake.com/static/cc-scripts/install.sh | sh
```

This command downloads and runs the installation script, which installs the latest version of Cortex Code CLI.
The `cortex` executable is installed in `~/.local/bin` by default. The installation script adds this directory
to your PATH by modifying your shell profile.

### Windows native

To install Cortex Code CLI on Windows, issue the following command in PowerShell:

```shell
irm https://ai.snowflake.com/static/cc-scripts/install.ps1 | iex
```

This command downloads and runs the installation script, which installs the latest version of Cortex Code CLI.
The `cortex` executable is installed in `%LOCALAPPDATA%\cortex` by default. The installation script adds
this directory to your PATH.

After installation, invoke Cortex Code CLI from the Run dialog (Win+R), Command Prompt (`cmd.exe`), or PowerShell.

## Connect to Snowflake

After installing the Cortex Code CLI, issue the `cortex` command. A setup wizard guides you through the
initial configuration steps, including choosing or setting up a connection to Snowflake.

The first prompt asks you to choose a connection from the existing connections in the `~/.snowflake/connections.toml` file
or to create a new connection.

* To use an existing connection, choose the connection from the list using the up and down arrow keys, then press Enter.
* To create a new connection, choose More options\* by pressing the down arrow key until it is highlighted, then press Enter.
  Follow the prompts to enter your Snowflake account details.

> **Note:**
>
> The `connections.toml` is also used by the [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) (`snow` command). If you have already set up a connection
> for use with the Snowflake CLI, you can use that connection with the Cortex Code CLI.

## Start using Cortex Code

Once connected, try your first request:

```text
What can I do with Cortex Code?
```

Type natural-language requests (such as “find tables with PII tags” or “generate a Streamlit app for
SALES_MART.REVENUE”) and Cortex Code attempts to fulfill the request by orchestrating Snowflake-native skills and any
MCP tools you have configured. For more information on configuring MCP tools, see [Model Context Protocol (MCP)](extensibility.md).

As it works on your request, Cortex Code CLI displays its reasoning steps and actions in the terminal. From time to
time, it may ask you for information that it needs. If you’re in plan mode, it will ask you to confirm each action.

### Example requests

#### Discover your catalog

```text
What databases do I have access to?
List every table tagged PII = TRUE in ANALYTICS_DB
Show the lineage from RAW_DB.ORDERS to downstream dashboards
```

#### Generate and run SQL commands

```text
Write a query for top 10 customers by revenue
Add a 7-day moving average and show me the results
Explain why this query is slow and optimize it
```

#### Build applications

```text
Build a Streamlit dashboard on SALES_MART.REVENUE with filters for date and region
Create a dbt project to transform raw sales data
```

#### Work with Cortex Analyst

```text
Use the @models/revenue.yaml semantic model to answer "What was revenue last month?"
Debug my semantic model at @models/revenue.yaml
```

## Prerequisites

To use Cortex Code CLI, you need the following:

* A Snowflake user account with the necessary permissions to access the data you intend to use with Cortex Code CLI and to perform operations on them.
  This user must also have the SNOWFLAKE.CORTEX_USER database role. (Initially, all users have the SNOWFLAKE.CORTEX_USER role through the PUBLIC role, but
  your organization may have explicitly revoked it to implement stricter access control.)
* Network access to your Snowflake server.
* [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) installed on your workstation.
* One of the following supported platforms:

  > + macOS on Apple Silicon or Intel
  > + Linux on Intel or ARM
  > + Windows Subsystem for Linux (WSL) on Intel
  > + Windows Native on Intel
  > > **Note:**
  > >
  > > Snowflake may add support for other platforms from time to time. Please let your Snowflake representative know if you
  > > have a specific platform requirement.
* Local terminal access to the `bash`, `zsh`, or `fish` shell on your platform.

For additional configuration options, troubleshooting, and advanced setup, see [Cortex Code CLI reference](cli-reference.md).

## Supported platforms and models

### Supported platforms

Cortex Code CLI currently supports the following platforms:

| Platform | Architecture |
| --- | --- |
| macOS | arm64, x64 |
| Linux | x64, arm64 |
| Windows | WSL on x64/amd64  Native on x64 |

> **Note:**
>
> Snowflake may add support for other platforms from time to time. Please contact your Snowflake representative if you
> have a specific platform requirement.

### Supported models

Cortex Code CLI supports the following models. At least one of these models must be available to your account (for
example, by being included in your account’s allowlist, CORTEX_MODELS_ALLOWLIST). See [Control model access](../snowflake-cortex/aisql.md)
for more information.

Snowflake recommends specifying `auto` for the model. Cortex automatically selects the highest quality model available to your
account. When a new, more capable, model becomes available, `auto` then refers to that model.

To choose a different model, use the `/model` command inside a Cortex Code CLI session.

| Model | Identifier |
| --- | --- |
| Auto | `auto` |
| Claude Opus 4.6 | `claude-opus-4-6` |
| Claude Sonnet 4.6 | `claude-sonnet-4-6` |
| Claude Opus 4.5 | `claude-opus-4-5` |
| Claude Sonnet 4.5 | `claude-sonnet-4-5` |
| Claude Sonnet 4.0 | `claude-4-sonnet` |
| OpenAI GPT 5.2 | `openai-gpt-5.2` |

Model quality and capability vary, so choose a model based on your requirements.

### Cloud regions

If a model you want to use is not [available in your region](../snowflake-cortex/aisql.md), you can use Cortex
cross-region inference to access the model in another region where it is available. For more information about
configuring cross-region inference, see [Cross-region inference](../snowflake-cortex/cross-region-inference.md).

Cortex Code requires an `ACCOUNTADMIN` to configure
[CORTEX_ENABLED_CROSS_REGION](../snowflake-cortex/cross-region-inference.md) to one of the following
values.

The following table shows the models that are available for each cross-region inference setting:

| Model | Cross-cloud  (Any region) | AWS US  (Cross-region) | AWS EU  (Cross-region) | AWS APJ  (Cross-region) | Azure US  (Cross-region) | Azure EU  (Cross-region) |
| --- | --- | --- | --- | --- | --- | --- |
| `claude-opus-4-6` | ✔ | ✔ | ✔ |  |  |  |
| `claude-sonnet-4-6` | ✔ | ✔ | ✔ |  |  |  |
| `claude-opus-4-5` | ✔ | ✔ | ✔ |  |  |  |
| `claude-sonnet-4-5` | ✔ | ✔ | ✔ |  |  |  |
| `claude-4-sonnet` | ✔ | ✔ | ✔ | ✔ |  |  |
| `openai-gpt-5.2` | \* |  |  |  | \* |  |

**\*** Indicates a preview model. Preview models are not suitable for production workloads.

To enable cross-region inference, an ACCOUNTADMIN must run:

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'AWS_US';
```

Replace `AWS_US` with the appropriate region identifier.

> **Important:**
>
> **Cross-region inference is required when the selected model is not available in your region.** We recommend the
> following settings based on your needs:
>
> * **AWS_US**: Recommended for the best experience with **Claude Opus 4.x** models.
> * **AWS_EU**: Access Claude models from the EU.
> * **AWS_APJ**: Access Claude models from APJ (may be limited to Claude Sonnet 4.0).
> * **ANY_REGION**: Access **all** available models (best-effort global routing).
> * **AZURE_US**: Access OpenAI GPT 5.2.
>
> Your organization can restrict model access, so you may not have access to all models. See
> [Control model access](../snowflake-cortex/aisql.md) for details.

## Legal notices

Where your configuration of Cortex Code uses a model provided on the
[Model and Service Pass-Through Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/model-pass-through-terms/),
your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Features [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Cortex Code CLI agent tools
source: https://docs.snowflake.com/en/user-guide/cortex-code/tools.md
section: Cortex Code
---

# Cortex Code CLI agent tools

Cortex Code has access to a comprehensive set of tools for file operations, shell commands, Web access, and more.
You don’t need to install anything extra; these tools are built into Cortex Code CLI and ready to use. Cortex Code
automatically uses appropriate tools based on your requests. You do not need to invoke them manually; just describe
what you want. For example:

```text
Read the first 10 lines of the file src/main.py
Search for TODO comments in all Python files
Execute a bash command to list running processes
```

When creating custom skills, you must specify the tools the skill can use. See [Skills](extensibility.md).

## File tools

### Read

Read file contents from the local filesystem. Supports:

* Text files with line numbers
* Images (PNG, JPG, etc.) - displayed visually
* PDFs - page-by-page extraction
* Jupyter notebooks - cells with outputs
* Line ranges: @file.py$10-20

### Write

Create or overwrite files. Supports:

* Creates parent directories automatically
* Tracks line changes for session statistics
* Overwrites existing files

### Edit

Search and replace in files. Supports:

* Exact string replacement
* Diff preview before changes
* Supports replace_all for global replacement

### Glob

Find files by pattern matching. Examples:

| Pattern | Description |
| --- | --- |
| `**/*.py` | All Python files |
| `src/**/*.ts` | TypeScript files in `src/` directory |
| `**/test_*.py` | Python test files |
| `!node_modules` | Exclude patterns |

### Grep

Search file contents using a regular expression. Supports:

* Recursive search
* Regex patterns
* Binary file detection
* Output modes: content, files, count

## Shell tools

### Bash

Execute shell commands. Supports:

* Streaming output
* Background execution (run_in_background)
* Timeout control (default 2 min, max 10 min)
* Sandbox runtime support

### BashOutput

Retrieve output from a background shell process.

* Filter output by regex
* Status checking
* Use with run_in_background

### KillShell

Terminate running background shells.

## Agent tools

### RunSubagent

Launch subagents for specialized tasks. Types:

* general-purpose: All tools, research tasks
* Explore: Fast codebase exploration
* Plan: Architecture and planning
* Custom agents from .cortex/agents/

See [Subagents](extensibility.md) for details.

### AskUserQuestion

Prompt user for input during execution. Supports:

* Multiple choice questions
* Free-form input
* Multi-select options

### Review

Launch a review subagent for quality assurance.

## Web tools

### WebSearch

Search the Web using multiple engines. Supports:

* Fallback search engines
* Snippet extraction
* Result caching
* 30-second timeout

> **Note:**
>
> WebSearch requires enabling web search in the Cortex Code settings in Snowsight. See [Web search](cortex-code-snowsight.md).

### WebFetch

Retrieve content from web URLs. Supports:

* HTML to text conversion
* Content extraction
* Max 10,000 characters
* 30-second timeout

## Snowflake tools

### SnowflakeSqlExecute

Execute SQL queries on Snowflake. Supports:

* Permission checks
* Result caching
* Token refresh
* Large result offloading

### SnowflakeObjectSearch

Semantic search for database objects.

|  |  |
| --- | --- |
| Searches | tables, viewss, schemas, databases, functions |
| Returns | names, columns, descriptions |

### SnowflakeProductDocs

Search Snowflake documentation. Supported categories:

* User guide
* SQL reference
* Developer guide
* Cortex Code topics

### ReflectSemanticModel

Validate Cortex Analyst semantic models. Validation stages:

* File existence
* YAML syntax
* Schema validation
* Server-side validation

### SnowflakeMultiCortexAnalyst

Execute Cortex Analyst queries. Supports:

* Natural language to SQL
* Semantic model support
* Verified Query Retrieval

## Data tools

### DataDiff

Compare data between databases/tables. Supports:

* Snowflake connection handling
* Account identifier derivation
* 300-second timeout

### NotebookExecute

Execute Jupyter notebooks. Supports:

* Timeout control
* Kernel management
* Parameter injection
* Custom Python environments

### NotebookEdit

Edit Jupyter notebook cells. Supported modes:

* replace: Replace cell content
* insert: Add new cell
* delete: Remove cell

## Plan mode tools

### EnterPlanMode

Request plan mode for complex tasks. Supports:

* User approval workflow
* Automatic invocation for multi-step tasks

### ExitPlanMode

Present plan to user and exit plan mode. Supports:

* Plan confirmation
* Streaming control

## Memory tools

### Memory

Store and retrieve information across sessions. Supported commands:

* view: See stored memories
* create: Store new memory
* str_replace: Update memory
* insert: Add to memory
* delete: Remove memory
* rename: Rename memory file

> **Note:**
>
> The Memory tool must be enabled by setting the CORTEX_ENABLE_MEMORY environment variable.

## Permission levels

Tools have different permission requirements:

| Level | Tools | Behavior |
| --- | --- | --- |
| Safe | Read, Glob, Grep | Auto-approved |
| Low | Write (new files) | Usually auto-approved |
| Medium | Edit, Bash (safe) | Prompts in Confirm mode |
| High | Bash (risky), SQL write | Always prompts |
| Critical | rm -rf, sudo | Extra confirmation |

See [Security](security.md) for details.

---
title: Cortex Code CLI extensibility
source: https://docs.snowflake.com/en/user-guide/cortex-code/extensibility.md
section: Cortex Code
---

# Cortex Code CLI extensibility

Cortex Code CLI can be extended with custom behaviors, specialized agents, lifecycle hooks, and external tool integrations. This topic covers the four main extensibility mechanisms:

Skills
:   Markdown files that inject domain-specific knowledge and instructions into conversations. Use skills to teach Cortex Code about your organization’s best practices, coding standards, or specialized workflows.

Subagents
:   Autonomous, specialized AI agents that handle specific tasks independently. Subagents enable parallel execution, focused expertise, and complex multi-step workflows.

Hooks
:   Scripts that intercept and customize Cortex Code’s behavior at key lifecycle points. Use hooks to validate tool inputs, log operations, or enforce policies.

MCP (Model Context Protocol)
:   An open standard for connecting Cortex Code to external tools and data sources such as GitHub, Jira, and databases.

## Skills

Skills extend Cortex Code with domain-specific knowledge and capabilities by injecting specialized instructions and enabling additional tools.

### What are skills?

Skills are markdown files containing:

* Domain-specific instructions and best practices
* When to use the skill
* Example workflows
* Optional tool configurations

When you invoke a skill, its instructions are injected into the conversation context.

### Using skills

Run `/skill list` to list available skills, and invoke them by name to load the skill into the conversation.

### Skill locations

Skills are loaded from multiple locations, listed below from highest to lowest priority:

| Location | Path | Scope |
| --- | --- | --- |
| Project | `.cortex/skills/` or `.claude/skills/` | Project |
| User | `~/.snowflake/cortex/skills/` or `~/.claude/skills/` | User |
| Global | `~/.snowflake/cortex/skills/` | System |
| Session | Added temporarily | Session |
| Remote | Cloned from git | Cache |
| Bundled | Built into Cortex Code | System |

### Creating custom skills

Skills are directories containing a `SKILL.md` file with skill instructions, and optional examples and templates.
You can create skills in one of the following locations:

| Scope | Path |
| --- | --- |
| Project | `.cortex/skills/` or `.claude/skills/` in the project directory |
| Global | `~/.snowflake/cortex/skills/` |
| User | `~/.claude/skills/` |

To start building a custom skill:

1. Create the skill directory. This example creates a skill directory named “my-skill” in the project location:

   ```bash
   mkdir -p .cortex/skills/my-skill
   ```
2. Create `SKILL.md` in this directory and add skill instructions. This example shows the basic structure:

   ```markdown
   ---
   name: my-skill
   description: Brief description of what this skill does
   tools:
   - optional_tool_name
   ---

   # When to Use

   - Describe when this skill should be invoked
   - List specific user intents or scenarios

   # What This Skill Provides

   Explain the capabilities and knowledge this skill adds.

   # Instructions

   Step-by-step guidance for the AI when this skill is active.

   ## Best Practices

   - Best practice 1
   - Best practice 2

   ## Common Patterns

   ### Pattern 1
   Description and example.

   ### Pattern 2
   Description and example.

   # Examples

   ## Example 1: Basic Usage
   User: $my-skill Do something Assistant: [Expected behavior]

   ## Example 2: Advanced Usage
   User: $my-skill Complex task with @file.txt Assistant: [Expected behavior]
   ```
3. Verify that your skill appears in the listing using the `$$` command:

   ```text
   > $$
   ```

   If your skill is listed, it was loaded correctly and is available for use.
4. Use your skill in a conversation:

   ```text
   > $my-skill Test it out
   ```

#### Custom skill settings

Each skill’s options are defined in the YAML frontmatter at the top of `SKILL.md`.
The following options are supported:

| Option | Description |
| --- | --- |
| name: <skill name> | Required: Unique identifier |
| description: <description> | Required: Shown in $$ listing |
| tools: | Optional: Tools to enable in this skill |
| - tool_name_1 |  |
| - tool_name_2 |  |

This example shows a skill that uses two tools:

```markdown
---
name: database-admin
description: Database administration tasks
tools:
- snowflake_sql_execute
- snowflake_object_search
---
```

#### Skill best practices

To write effective skills, follow these guidelines:

* Be specific: Clear instructions produce better results
* Provide examples: Show expected inputs and outputs
* Include edge cases: Handle common errors and exceptions
* Keep focused: One skill equals one domain or capability

### Managing skills

| Slash command | Description |
| --- | --- |
| `/skill` | Interactive skill manager |
| `/skill list` | List all skills |
| `/skill sync <name>` | Sync to global location |
| `/skill add <git-url>` | Add remote skill |

#### Skill conflicts

When the same skill exists in multiple supported locations, and the contents differ, a conflict occurs, and a conflict
indicator appears in the skill listing. Use `/skill sync` to resolve conflicts by syncing the local scope to the
global scope.

### Composing skills

Custom skills can reference other skills, or combine skills with file context:

```text
> $code-review Review @src/auth.py following $security-guidelines
```

### Remote skills

You can add remote skills from Git repositories. A repo can contain any number of skills. The layout of the remote
skills should match the local skill structure.

```text
/skill add https://github.com/org/my-skills.git
```

Remote skills are cached locally. To update, use `/skill sync`.

### Skill command reference

CLI commands:

```bash
cortex skill list
cortex skill add <path>
cortex skill remove <path>
```

Slash commands:

```text
/skill list
/skill add <path>
```

### Skill troubleshooting

Skill not activating
:   * Use specific language related to the skill’s purpose
    * Mention the skill explicitly: “Use semantic-view-optimization”
    * Check availability: `/skill list`

Unexpected behavior
:   * Provide more context about your goal
    * Try a more specific request
    * Submit feedback: `/feedback`

## Subagents

Subagents are autonomous, specialized AI agents that handle specific tasks independently. They enable parallel
execution, focused expertise, and complex multi-step workflows.

Subagents:

* Execute independently from the main conversation
* Have their own context and tool access
* Can run in the foreground or background
* Specialize in specific domains or tasks

### Built-in subagent types

#### `general-purpose`

All-purpose agent with access to all tools. It’s best for:

* Complex research tasks
* Multi-step code changes
* Tasks requiring multiple tools

#### `explore`

Fast codebase exploration specialist. Best for:

* Finding files by patterns
* Searching code for keywords
* Understanding codebase structure
* Quick reconnaissance

You can specify how thoroughly the Explore agent searches:

* `"quick"`: Basic search
* `"medium"`: Moderate exploration
* `"very thorough"`: Comprehensive analysis

#### `plan`

Designs and outlines complex implementation plans. Best for:

* Designing implementation strategies
* Identifying critical files
* Evaluating architectural trade-offs
* Creating step-by-step plans

#### `feedback`

Structured feedback collection. Best for:

* Gathering user input
* Structured questions
* Session feedback

### Running subagents

Cortex Code automatically delegates to subagents when appropriate. For example, this query delegates to an Explore agent:

```text
> Find all files that import the authentication module
```

You can also explicitly request specific subagent types by name:

```text
> Use an Explore agent to find all database query definitions
> Use the Explore agent to find all API endpoint definitions
> Launch a Plan agent to design the authentication refactor
```

You can also request that multiple subagents run in parallel to tackle different aspects of a task:

```text
> In parallel, search for all test files and all config files
```

Agents can run in the background while you continue working:

```text
> Run a background agent to refactor all the test files
```

The agent starts immediately and returns an agent ID for tracking. When the agent completes, you can retrieve its output using its ID:

```text
> Get the output from agent abc1234
```

To monitor the status of all running subagents, use the `/agents` command (or press Ctrl-B) to open the background
process viewer. You can stop a running agent using its ID, or with the `/agents` interface:

```text
> kill agent abc1234
```

Killed agents stop running, but retain their context indefinitely. You can resume a killed agent using its ID:

```text
> Resume agent abc1234 and continue from where it left off
```

### Agent types

Autonomous
:   An autonomous agent runs without user interaction. The agent:

    * Completes independently
    * Never blocks for questions
    * Is suited for well-defined tasks

Non-Autonomous
:   A non-autonomous agent can pause execution to ask the user questions. The agent:

    * May ask clarifying questions
    * Can request permissions interactively
    * Is suited for tasks needing guidance

Custom
:   Custom agents are user-defined subagents with specialized prompts and configurations. You create agents tailored
    to specific domains or workflows in Markdown files, similar to custom skills.

### Creating custom subagents

Custom subagents are defined in Markdown files with YAML front matter. The front matter specifies the agent’s name, description, tool access, and model.
The body contains the system prompt that guides the agent’s behavior.

You can store custom agent Markdown files in one of three locations:

| Scope | Path |
| --- | --- |
| Project | .cortex/agents/ or .claude/agents/ |
| Global | ~/.snowflake/cortex/agents/ |
| User | ~/.claude/agents/ |

The format of an agent definition is shown below:

```markdown
---
name: my-agent
description: What this agent specializes in
tools:

- Bash
- Read
- Write

model: claude-sonnet-4-5
---

# System Prompt

You are a specialized agent for [domain].

## Your Responsibilities

1. Task 1
2. Task 2

## Guidelines

- Guideline 1
- Guideline 2

## Output Format

Describe expected output format.
```

#### Example: Test Runner agent

The following Markdown file defines a custom Test Runner agent that runs tests and summarizes results:

```markdown
---
name: test-runner
description: Runs tests and reports results
tools:
- Bash
- Read
- Grep
---

# Test Runner Agent

You run tests and provide clear reports of the results.

## Process

1. Identify the test framework (pytest, jest, go test, etc.)
2. Run appropriate test command
3. Parse and summarize results
4. Highlight failures with relevant code context

## Output Format

## Test Results Summary
- Total: X
- Passed: Y
- Failed: Z

## Failures
### Test Name
- File: path/to/file.py
- Error: Description
- Relevant code snippet
```

#### Agent configuration

A custom agent’s configuration is specified in the Markdown file’s YAML front matter.

Tool access
:   Agents can specify which tools they have access to:

    ```yaml
    tools:
    - "*"           # All tools
    - Bash          # Specific tools
    - Read
    - Write
    ```

Model selection
:   You can choose a model for a specific agent. This overrides the session’s default model.

    ```yaml
    model: claude-sonnet-4-5   # Specific model
    model: auto                # Cost-optimized
    ```

### Worktree isolation

Agents can be run in isolated git worktrees, or branches. When you request worktree isolation, Cortex Code CLI creates a separate git
worktree for the agent to operate in. This allows multiple agents to run in parallel without conflicting changes, and is easy to clean up afterward.
Isolated worktrees are particularly useful for exploration and experimentation. The git branch created by the agent is named `agent/<agentId>`.

To use worktree isolation, simply include it in your prompt:

```text
> Run a background agent with worktree isolation to implement feature X
```

### Swarm pattern

You can launch a swarm of agents to tackle different aspects of a complex task in parallel. Each agent works
independently, and results are aggregated when all agents finish. All types of agents can participate in a swarm.

Use cases for swarms include:

* Code Analysis: Multiple agents analyze different aspects
* Refactoring: Parallel agents handle different files
* Testing: Agents run different test suites
* Documentation: Agents document different components

To create a swarm, simply describe the different agents you want to launch:

```text
> Launch a swarm of agents:
> 1. Explore agent to find all database queries
> 2. Explore agent to find all API endpoints
> 3. Explore agent to find all test files
```

### Subagent best practices

Use subagents for:

* Complex tasks: Break into subtasks for parallel execution
* Exploration: Use Explore agent for codebase searches
* Planning: Use Plan agent before major changes
* Background work: Long-running tasks that don’t need attention

Subagents may not be ideal for:

* Simple queries: Direct tools are faster
* Single-file edits: Main agent is more efficient
* Interactive work: When you need immediate feedback

Detailed prompts are generally more effective:

| Good | Find all Python files that contain database queries and list them with line numbers |
| --- | --- |
| Better | Use the Explore agent (very thorough) to find all Python files containing database queries. For each file, extract the query patterns and identify potential SQL injection risks. |

### Viewing active subagents

`/agents` command
:   Issue the `/agents` command in a Cortex Code session to open the interactive agent viewer.
    This interface shows all running agents, their types, statuses, and output previews.

Background process viewer
:   In a Cortex Code CLI session, press Ctrl-B to view:

    * All background processes
    * Agent sessions
    * Bash processes

### Agent limits

The following limits apply to subagents in Cortex Code CLI:

* Maximum 50 concurrent background agents
* Agents inherit session permissions
* Background agents cannot spawn other background agents

## Hooks

Hooks allow you to intercept and customize Cortex Code’s behavior at key lifecycle points. A hook is a prompt or shell script that executes in response to an event:

* Before tool use: Validate or modify tool inputs
* After tool use: Add context or log results
* On user input: Inject session context
* On session events: Initialize or cleanup

### Hook events

The following events can trigger hooks:

| Event | Description | Can block |
| --- | --- | --- |
| PreToolUse | Before tool execution | Yes |
| PostToolUse | After tool execution | No |
| PermissionRequest | When permission is needed | Yes |
| UserPromptSubmit | When user submits prompt | No |
| SessionStart | When session starts | No |
| SessionEnd | When session ends | No |
| PreCompact | Before context compaction | No |
| Stop | When user stops Claude | No |
| SubagentStop | When subagent stops | No |
| Notification | On system notifications | No |
| Setup | During initialization | No |

### Configuring hooks

Hooks are configured in settings files, which can be in any configuration directory (listed below from highest to lowest priority):

| Location | Path |
| --- | --- |
| Local | `.claude/settings.local.json` or `.cortex/settings.local.json` |
| Project | `.claude/settings.json` or `.cortex/settings.json` |
| User | `~/.claude/settings.json` |
| Global | `~/.snowflake/cortex/hooks.json` |

Hooks are defined in JSON format, specifying the event, tool matcher, and hook actions. A simple example of a pre-tool-use hook is shown below:

```json
{
  "hooks": {
    "PreToolUse": [
      {
        "matcher": "Bash",
        "hooks": [
          {
            "type": "command",
            "command": "bash .claude/hooks/validate-bash.sh",
            "timeout": 60
          }
        ]
      }
    ]
  }
}
```

Two hook types are supported: command hooks and prompt hooks.

* Command hooks run shell commands or scripts.

  ```json
  {
    "type": "command",
    "command": "bash /path/to/script.sh",
    "timeout": 60,
    "enabled": true
  }
  ```
* Prompt hooks are evaluated as natural language prompts for a language model.

  ```json
  {
    "type": "prompt",
    "prompt": "Is this command safe? $ARGUMENTS",
    "timeout": 30
  }
  ```

To execute your hook only on specific tools, place tool names or patterns in the `matcher` field. For example, to
match all SQL tools, use `"matcher": "SQL*"`. You can use regular expressions to match multiple tools.

| Pattern | Matches |
| --- | --- |
| `*` | All tools |
| `Bash` | Only Bash |
| `Edit|Write` | Edit or Write |
| `mcp__.*` | All MCP tools |
| `Notebook.*` | NotebookEdit, NotebookExecute |

### Writing hook scripts

Hook scripts accept JSON input via standard input and return JSON output via standard output.
The output contains a field indicating whether the operation is allowed or denied. Optionally,
the hook script can pass back a modified version of the tool input.

Sample input:

```json
{
  "session_id": "abc123",
  "transcript_path": "/path/to/transcript.json",
  "cwd": "/working/directory",
  "permission_mode": "default",
  "hook_event_name": "PreToolUse",
  "tool_name": "Bash",
  "tool_input": {
    "command": "ls -la"
  }
}
```

Sample output:

```json
{
  "decision": "allow",
  "systemMessage": "Note: This operation was validated.",
  "hookSpecificOutput": {
    "hookEventName": "PreToolUse",
    "updatedInput": {
      "command": "ls -la --color=never"
    }
  }
}
```

The return code indicates whether to block the operation:

* 0: Do not block
* 2: Block

This information can also be returned as part of the JSON output as shown below.

```json
{
  "decision": "block",
  "reason": "Operation not allowed"
}
```

The following environment variables are available in hook scripts:

| Variable | Description |
| --- | --- |
| `CORTEX_PROJECT_DIR` | Project directory path |
| `CORTEX_CODE_REMOTE` | `"true"` if web context |
| `CORTEX_ENV_FILE` | Persistent env file path |

### Hook examples

The following examples illustrate possible output for common hook use cases.

#### Modify Tool Input

```json
{
  "hookSpecificOutput": {
    "hookEventName": "PreToolUse",
    "updatedInput": {
      "command": "modified command"
    }
  }
}
```

#### Add Context

```json
{
  "hookSpecificOutput": {
    "hookEventName": "PostToolUse",
    "additionalContext": "Note: File was recently modified."
  }
}
```

#### Show System Messages

```json
{
  "systemMessage": "Warning: This operation may take a while."
}
```

#### Permission Decisions

```json
{
  "hookSpecificOutput": {
    "hookEventName": "PreToolUse",
    "permissionDecision": "allow",
    "permissionDecisionReason": "Auto-approved by policy"
  }
}
```

#### Remote Hooks

You can reference scripts in git repositories as shown below:

```json
{
  "type": "command",
  "command": "bash",
  "source": {
    "source": "github:org/hooks-repo/scripts/validate.sh",
    "ref": "main"
  }
}
```

### Hook best practices

* Keep hooks fast: Timeouts default to 60 seconds
* Handle errors gracefully: Return exit 0 if uncertain
* Log for debugging: Write to files for troubleshooting
* Use matchers: Target specific tools, not all
* Test thoroughly: Use hooks manager to verify behavior

## Model Context Protocol (MCP)

You can connect Cortex Code CLI to external tools and data sources with Model Context Protocol (MCP). MCP is an open
standard for connecting AI agents to external tools such as GitHub, Jira, and databases. Once configured, MCP servers
give Cortex Code access to hosted tools beyond built-in capabilities.

### Transport types

Cortex Code supports three MCP transport types:

| Type | Use Case | Connection |
| --- | --- | --- |
| stdio | Local tools, CLI wrappers | Subprocess with stdin/stdout |
| http | Web services, APIs | HTTP requests |
| sse | Real-time services | Server-Sent Events |

You can use OAuth to authenticate to HTTP MCP servers. The first time you connect to a server configured for OAuth,
Cortex Code CLI opens a browser window, where the user authenticates. The resulting token is stored in
`~/.snowflake/cortex/mcp_oauth/` and automatically refreshed as needed. The following is a sample OAuth configuration:

```json
{
   "oauth": {
      "client_id": "pre-registered-client-id",
      "client_name": "My Client",
      "redirect_port": 8585,
      "scope": "openid mcp read write",
      "authorization_server_url": "https://auth.example.com"
   }
}
```

### Managing MCP servers

You can issue the `/mcp` command in an interactive Cortex Code CLI session to open an interactive MCP status viewer.
Use the `cortex mcp` command to manage MCP server configurations from the command line.

| Command | Description |
| --- | --- |
| **Command line** | **Description** |
| cortex mcp add | Add a new server (see below) |
| cortex mcp list | List configured servers |
| cortex mcp get <server> | Get details for a specific server |
| cortex mcp remove <server> | Remove a server |
| cortex mcp start <server> | Check server status and available tools |

#### Adding a server

The `cortex mcp add` command accepts options for configuring servers.

```bash
cortex mcp add <name> <command> [args...]
```

Options:

```text
--transport, -t    Transport type (stdio, http, sse)
--type             Alias for --transport
--env, -e          Environment variable (KEY=value)
--header, -H       HTTP header
--timeout          Connection timeout in ms
```

> **Note:**
>
> MCP tools are namespaced to avoid conflicts, using the format below:
>
> ```text
> mcp__{server-name}__{tool-name}
> ```
>
> For example, a tool called `search` from server `github` is given the name `mcp__github__search`.

### MCP configuration

MCP server configuration is stored in `~/.snowflake/cortex/mcp.json` under the key `mcpServers`. The following example
shows the structure of a configuration file with a single MCP server:

```json
{
   "mcpServers": {
      "server-name": {
         "type": "stdio",
         "command": "command-to-run",
         "args": ["arg1", "arg2"]
      }
   }
}
```

#### Environment variables

Use the `${VAR}` or `$VAR` syntax to insert the values of environment variables into the configuration file.

```json
{
"mcpServers": {
   "my-server": {
      "type": "http",
      "url": "https://api.example.com",
      "headers": {
      "Authorization": "Bearer ${MY_API_TOKEN}"
      }
   }
}
```

> **Important:**
>
> It is a best practice to use environment variables for credentials. Never hardcode tokens in `mcp.json`.
> Add a line to your shell’s profile, such as `~/.bashrc` or `~/.zshrc`, like the following:
>
> ```bash
> export GITHUB_TOKEN="your_token_here"
> ```

#### Configuration from the command line

To add an MCP server from the command line, use the `cortex mcp add` command. For example:

| Action | Command |
| --- | --- |
| Add stdio server | `cortex mcp add git-server uvx mcp-server-git` |
| Add HTTP server | `cortex mcp add api-server https://api.example.com --type http` |
| Add with environment variables | `cortex mcp add my-server npx my-mcp-server -e API_KEY=secret` |
| Add with headers | `cortex mcp add my-server https://api.example.com -H "Authorization: Bearer token"` |

### Using MCP tools

Once configured, MCP tools are available automatically in Cortex Code CLI sessions. You invoke them via natural
language commands:

```text
Show me recent GitHub pull requests
Create a Jira ticket for this bug
Query the PostgreSQL database for user activity
```

Permissions are requested on first use. Configure defaults in `~/.snowflake/cortex/permissions.json`:

```json
{
  "allow": ["mcp__github__read_file", "mcp__github__list_repos"],
  "deny": ["mcp__github__delete_repo"]
}
```

### Sample MCP configurations

The following examples illustrate MCP server configurations for common use cases.

#### Git Server (stdio)

```json
{
  "mcpServers": {
    "git": {
      "type": "stdio",
      "command": "uvx",
      "args": ["mcp-server-git", "--repository", "/path/to/repo"]
    }
  }
}
```

#### HTTP API with OAuth

```json
{
  "mcpServers": {
    "my-api": {
      "type": "http",
      "url": "https://api.example.com/mcp",
      "oauth": {
        "client_id": "my-client-id",
        "redirect_port": 8585,
        "scope": "openid mcp"
      }
    }
  }
}
```

#### SSE Server with Headers

```json
{
  "mcpServers": {
    "realtime": {
      "type": "sse",
      "url": "https://realtime.example.com/events",
      "headers": {
        "Authorization": "Bearer ${API_TOKEN}",
        "X-Custom-Header": "value"
      },
      "timeout": 30000
    }
  }
}
```

#### Sourcegraph Integration

```json
{
  "mcpServers": {
    "sourcegraph": {
      "type": "stdio",
      "command": "npx",
      "args": ["-y", "@sourcegraph/mcp-server"],
      "env": {
        "SRC_ACCESS_TOKEN": "${SOURCEGRAPH_TOKEN}",
        "SRC_ENDPOINT": "https://sourcegraph.company.com"
      }
    }
  }
}
```

### MCP troubleshooting

Server not connecting
:   * Check `/mcp` during a session to make sure it is listed
    * Use `cortex mcp start <server>` to test connectivity
    * Ensure credentials are correctly set in environment variables
    * `cat ~/.snowflake/cortex/logs/mcp.log` to review the log for clues

Tools not appearing
:   * Run `cortex mcp list` to verify configuration
    * Make sure tool names are valid (contain only alphanumeric characters, underscores, and hyphens)
    * Check that tool names are shorter than 64 characters

OAuth issues
:   * Clear cached tokens: `rm ~/.snowflake/cortex/mcp_oauth/{server}*`
    * Reconnect to trigger new OAuth flow
    * Check redirect port is available (default 8585)

Environment variables not expanding
:   * Use `${VAR}` syntax (with braces) rather than `$VAR`
    * Ensure variable is set in your shell (`echo $VAR`)
    * Check for typos in variable names

### MCP best practices

* Use descriptive server names: Makes tool namespacing clear
* Set appropriate timeouts: Default is 10 minutes for tool listing
* Secure credentials: Use environment variables, not hardcoded secrets
* Test connections: Use `cortex mcp start` before relying on a server

---
title: Cortex Code CLI keyboard shortcuts
source: https://docs.snowflake.com/en/user-guide/cortex-code/keyboard-shortcuts.md
section: Cortex Code
---

# Cortex Code CLI keyboard shortcuts

Master Cortex Code CLI with these keyboard shortcuts for efficient navigation.

## Input shortcuts

| Shortcut | Action |
| --- | --- |
| Enter | Submit message |
| Ctrl-J | Insert newline (multiline input) |
| Ctrl/Cmd-V | Paste from clipboard |
| Ctrl-C | Cancel/interrupt (double-tap to exit) |
| Esc | Dismiss suggestions (double-tap to clear input) |
| Ctrl-K | Kill line (clear input) |
| Ctrl-Y | Yank (paste killed text) |
| Ctrl-A | Move to line start |
| Ctrl-E | Move to line end |

## View shortcuts

| Shortcut | Action |
| --- | --- |
| Ctrl-T | Open table viewer (cycle forward) |
| Ctrl-Shift-T | Cycle table viewer backward |
| Ctrl-P | Toggle compact/expanded mode |
| Ctrl-O | Open full transcript viewer |
| Ctrl-B | View background bash processes |
| Ctrl-D | Open todo/task viewer |
| Ctrl-G | Open web search results |
| ? | Toggle help overlay |

## Mode Shortcuts

| Shortcut | Action |
| --- | --- |
| Shift-Tab | Cycle operational modes |

Modes cycle in this order:

* Confirm actions
* Plan mode
* Bypass safeguards

## History Navigation

| Shortcut | Action |
| --- | --- |
| Ctrl-R | Reverse search history (Emacs-style) |
| Ctrl-S | Forward search history |
| Ctrl-G | Cancel history search |
| Up / Down | Navigate history (when at top/bottom of input) |
| Option-Up | Previous history entry |
| Option-Down | Next history entry |

### Table viewer shortcuts

Additional shortcuts for the table viewer (Ctrl-T):

| Shortcut | Action |
| --- | --- |
| Tab | Cycle to next table |
| Shift-Tab | Cycle to previous table |
| c | Copy query to clipboard |

## Auto-complete triggers

Typing one of the trigger characters below begins auto-completion for commands, file paths, skills, or Snowflake tables.

| Trigger | Action |
| --- | --- |
| / | Slash command completion |
| @ | File path completion |
| $ | Skill completion |
| # | Snowflake table completion |
| Tab | Accept suggestion |
| Up / Down | Navigate suggestions |

## Display modes

Press Ctrl-P to toggle between Compact and Expanded display modes for tool execution details.

Compact mode (Default)
:   * Minimal tool execution display
    * Shows summary of operations

Expanded mode
:   * Full tool execution details
    * Complete input/output display

## Quick reference card

```text
┌──────────────────────────────────────────────────────────┐
│                  CORTEX CODE SHORTCUTS                   │
├──────────────────────────────────────────────────────────┤
│  INPUT                  │  VIEW                          │
│  Enter      Submit      │  Ctrl-T    Table viewer        │
│  Ctrl-J     Newline     │  Ctrl-P    Toggle compact      │
│  Ctrl-C     Cancel      │  Ctrl-O    Transcript          │
│  Ctrl-R     History     │  Ctrl-B    Background bash     │
│  Esc Esc    Clear       │  Ctrl-D    Todo viewer         │
│                         │  ?         Help overlay        │
├─────────────────────────┼────────────────────────────────┤
│  NAVIGATION             │  AUTOCOMPLETE                  │
│  Up/k       Up          │  /         Commands            │
│  Down/j     Down        │  @         Files               │
│  g          Top         │  $         Skills              │
│  G          Bottom      │  #         Tables              │
│  q/Esc      Exit        │  Tab       Accept              │
├─────────────────────────┴────────────────────────────────┤
│  Shift-Tab  Cycle modes (Confirm → Plan → Bypass)        │
└──────────────────────────────────────────────────────────┘
```

## Tips

* **Use Ctrl-P often**: Switch between compact and expanded mode based on what you need to see.
* **Master Ctrl-R**: History search is powerful for repeating complex prompts.
* **Vim-style navigation**: The h/j/k/l keys work everywhere for cursor movement.
* **Double-tap patterns**: For example, Esc Esc clears input, Ctrl-C Ctrl-C exits.

---
title: Cortex Code CLI reference
source: https://docs.snowflake.com/en/user-guide/cortex-code/cli-reference.md
section: Cortex Code
---

# Cortex Code CLI reference

Command line reference for Cortex Code CLI.

## Starting Cortex Code

| Command | Description |
| --- | --- |
| `cortex` | Start in current directory |
| `cortex -c production` | Start with specific connection |
| `cortex -w /path/to/project` | Start in specific directory |
| `cortex -w /new/project -c myconn` | Combine workdir and connection |
| `cortex --continue` | Continue last session |
| `cortex --resume <session_id>` | Resume specific session |

## CLI options

| Option | Description |
| --- | --- |
| `-c, --connection <name>` | Use specific Snowflake connection |
| `-w, --workdir <path>` | Set working directory for file operations |
| `-m, --model <model_name>` | Specify AI model to use |
| `--plan` | Plan mode: require approval before all actions |
| `--bypass` | Automatically approve all planned actions |
| `--dangerously-allow-all-tool-calls` | Disable tool call permission prompts (caution) |
| `--continue` | Resume most recent conversation |
| `-r, --resume <session_id>` | Resume specific session by ID, or `last` for last session |
| `-p, --print  "<prompt>"` | Pass specified prompt, print response, and exit |
| `-f, --file <file>` | Read prompt from file, execute, and exit |
| `--output-format stream-json` | JSON output (for scripting) |
| `-V, --version` | Show installed version |
| `--help` | Show CLI help |

Connections must be defined in `~/.snowflake/connections.toml`. See [Cortex Code CLI](cortex-code-cli.md) for connection setup. Session IDs are shown at startup, at exit, and stored in `~/.snowflake/cortex/conversations/`.

### Examples

Start with working directory:

```bash
cortex -w /path/to/project
```

Resume last session with specific connection:

```bash
cortex --continue -c production
```

One-off prompt (JSON output):

```bash
cortex -p "List all Python files" --output-format stream-json
```

## Commands

### `update`

| Command | Description |
| --- | --- |
| `cortex update` | Update to latest version |
| `cortex --version` | Verify after update |

### `mcp`

| Command | Description |
| --- | --- |
| `cortex mcp list` | List configured servers |
| `cortex mcp add` | Add new server (interactive) |
| `cortex mcp remove <server_name>` | Remove server |

See [Model Context Protocol (MCP)](extensibility.md) for details.

## Interactive mode

### Keyboard shortcuts

| Shortcut | Action |
| --- | --- |
| `Ctrl+C` | Cancel current operation |
| `Ctrl+C Ctrl+C` | Exit Cortex Code CLI |
| `Ctrl+L` | Clear terminal screen (keeps conversation) |
| `Up/Down arrows` | Navigate command history |
| `Tab` | Command completion |

### Slash commands

#### Session management

| Command | Description |
| --- | --- |
| `/help` | Show interactive help |
| `/plan` | Enable planning mode |
| `/plan_off` | Disable planning mode |
| `/clear`, `/cls` | Clear the screen |
| `/new` | Start a new session |
| `/rename <title>` | Rename current session |
| `/exit`, `/quit` | Exit Cortex Code CLI |
| `/resume`, `/r`, `/sessions` | List and resume sessions |
| `/rewind` | Go back *n* steps in conversation or pick interactively |
| `/skill list` | List available skills |
| `/mcp-status` | Show MCP server status |
| `/fork` | Fork current session into a new session |

#### Model and mode

| Command | Description |
| --- | --- |
| `/model` | Show/select AI model |
| `/plan` | Enable plan mode |
| `/plan-off` | Disable plan mode |
| `/bypass` | Enable bypass mode (auto-approve all including tool calls) |
| `/bypass-off` | Disable bypass mode |
| `/status` | Show current configuration |

#### Snowflake and data

| Command | Description |
| --- | --- |
| `/sql <query>` | Execute SQL query |
| `/sql <query> --limit <n>` | Limit displayed rows |
| `/table [<file>]`, `/csv` | Open table viewer |
| `/connections`, `/conn` | Manage Snowflake connections |

#### Development tools

| Command | Description |
| --- | --- |
| `/sh`, `! <command>` | Execute shell command |
| `/diff`, `/changes`, `/review` | Review git changes |
| `/worktree` | Manage git worktrees |
| `/dbt` | dbt operations |
| `/lineage` | dbt lineage visualization |

#### Configuration

| Command | Description |
| --- | --- |
| `/settings` | View/modify settings |
| `/theme` | Select color theme |
| `/sandbox` | Manage sandbox settings |
| `/add-dir <path>` | Add working directory |

#### Extensibility

| Command | Description |
| --- | --- |
| `/skill`, `/skills` | Manage skills |
| `/mcp` | MCP server status |
| `/hooks` | View hooks configuration |
| `/commands`, `/cmds` | Manage custom commands |
| `/agents` | View subagents |

#### Utilities

| Command | Description |
| --- | --- |
| `/tasks` | Show task list |
| `/feedback` | Provide session feedback (Saved locally as a .tgz file) |
| `/update` | Update Cortex Code |

### Session storage

| Command | Description |
| --- | --- |
| `~/.snowflake/cortex/conversations/` | Session files |
| `~/.snowflake/cortex/settings.json` | General settings |
| `~/.snowflake/cortex/permissions.json` | Permission preferences |

See [Cortex Code CLI Settings](settings.md) for configuration details.

### Command details

#### `/sql`: Execute SQL examples

Basic query:

```text
/sql SELECT * FROM users
```

With row limit:

```text
/sql SELECT * FROM large_table --limit 1000
```

Multi-line queries (use Ctrl+J for newlines):

```text
/sql SELECT
  customer_id,
  SUM(amount) as total
FROM orders
GROUP BY customer_id
```

Results open automatically in the table viewer (Ctrl+T).

#### `/worktree`: Git worktrees

| Command | Description |
| --- | --- |
| `/worktree create feature-branch` | Create new worktree |
| `/worktree list` | List all worktrees |
| `/worktree switch feature-branch` | Switch to worktree |
| `/worktree delete feature-branch` | Delete worktree |

#### `/sandbox`: Sandbox control

| Command | Description |
| --- | --- |
| `/sandbox` | Interactive selector |
| `/sandbox on` | Enable container sandbox |
| `/sandbox off` | Disable container sandbox |
| `/sandbox status` | Show sandbox status |
| `/sandbox runtime on` | Enable OS sandbox |
| `/sandbox runtime off` | Disable OS sandbox |
| `/sandbox mode auto` | Auto-allow sandboxed commands |
| `/sandbox mode regular` | Prompt for all commands |

#### `/mcp`: MCP servers

| Command | Description |
| --- | --- |
| `/mcp` | Show status viewer |
| `/mcp list` | List all servers |
| `/mcp start <server>` | Start server |
| `/mcp get <server>` | Get server details |
| `/mcp remove <server>` | Remove server |

## Batch mode

| Command | Description |
| --- | --- |
| `cortex -p "<prompt>"` | Run single prompt and exit |
| `cortex -f request.txt` | Read prompt from file |
| `cortex --output-format stream-json -p "<prompt>"` | JSON output |
| `cortex -c prod --workdir /app -p "..."` | Control context |

## Exit codes

| Code | Description |
| --- | --- |
| `0` | Success |
| `1` | General error |
| `2` | Configuration error |
| `3` | Connection error |
| `4` | Permission denied |
| `130` | Interrupted by user (Ctrl+C) |

## Configuration and setup

### Updating Cortex Code CLI

Cortex Code CLI updates itself when a new version is available. You can also manually update to the latest version
by issuing `cortex update`. Issue `cortex update <version>` to install the specified version.

To disable automatic updates, edit `~/.snowflake/cortex/settings.json` and add `"autoUpdate": false`.

### Manually adding a connection

To manually create or edit the `~/.snowflake/connections.toml` file to define your connection, follow the steps below:

1. Create the `~/.snowflake/connections.toml` file if it doesn’t already exist.

   ```shell
   mkdir -p ~/.snowflake
   touch ~/.snowflake/connections.toml
   ```
2. Use the `chmod` command to set its permissions so that only you can read and write it.

   ```shell
   chmod 600 ~/.snowflake/connections.toml
   ```
3. Open the file in a text editor (here, `nano`).

   ```shell
   nano ~/.snowflake/connections.toml
   ```
4. Add lines like the following to define a connection. Enter the name of the connection in place of `myaccount` and
   replace the placeholder values with your Snowflake account details. Use browser-based SSO (external browser
   authentication) or PAT (programmatic access token). You can obtain a PAT from Snowsight (see
   [Using programmatic access tokens for authentication](../programmatic-access-tokens.md)). Include only the `authenticator` value or `password` value,
   depending on the authentication method you choose.

   ```toml
   [myaccount]
   account       = "<ACCOUNT>"
   user          = "<USERNAME>"
   authenticator = "externalbrowser" # For browser-based SSO; omit for PAT
   password      = "<PAT>"           # For PAT authentication; omit for SSO
   warehouse     = "<WAREHOUSE>"
   role          = "<ROLE>"
   database      = "<DATABASE>"
   schema        = "<SCHEMA>"
   ```
5. Save and close the file.

### Setting up shell completions

To give your shell the ability to auto-complete Cortex Code CLI commands and options, follow the instructions below for your shell.

> **Tip:**
>
> If you’re not sure which shell you’re using, issue `echo $(basename $SHELL)` in your terminal. The name printed is the default
> shell for your account, and may not be accurate if you have started a different shell manually.

| Shell | Command |
| --- | --- |
| `bash` | `cortex completion bash > ~/.bash_completion.d/cortex` |
| `zsh` | `cortex completion zsh > ~/.zsh/completions/_cortex` |
| `fish` | `cortex completion fish > ~/.config/fish/completions/cortex.fish` |

After running the appropriate command above for your shell, restart your shell with `exec $SHELL`.

### Directory structure

Installing Cortex Code CLI creates the following directory structure in your home directory:

```text
~/.snowflake/cortex/
   ├── settings.json          # Main configuration
   ├── mcp.json               # MCP server configs
   ├── conversations/         # Session history
   ├── skills/                # Global skills
   ├── commands/              # Custom commands
   ├── hooks/                 # Hook scripts
   ├── profiles/              # Team profiles
   └── cache/                 # Temporary cache
```

## Troubleshooting

Following are common error messages you may encounter during installation and setup.

### Command not found

Make sure that the installation directory `~/.local/bin` is included in your `PATH` environment variable.
For example, if you are using `bash`, issue the following commands:

```shell
export PATH="~/.local/bin:$PATH"
echo 'export PATH="~/.local/bin:$PATH"' >> ~/.bashrc
```

### Permission denied

Make sure that the `cortex` executable has execute permissions. Issue the following command:

```shell
chmod +x ~/.local/bin/cortex
```

### Connection errors

Make sure that the connection file `~/.snowflake/connections.toml` exists and contains valid connection details.

```shell
cat ~/.snowflake/connections.toml
```

Try invoking the `cortex` command with a connection explicitly specified using the `-c` option. For example:

```shell
cortex -c myaccount
```

## See also

[Cortex Code CLI](cortex-code-cli.md)
:   Installation, setup, and first prompts

[Cortex Code CLI Settings](settings.md)
:   Configuration file reference

[Cortex Code CLI workflow examples](workflows.md)
:   Capabilities and workflow examples

---
title: Cortex Code CLI sandbox
source: https://docs.snowflake.com/en/user-guide/cortex-code/sandbox.md
section: Cortex Code
---

# Cortex Code CLI sandbox

Cortex Code CLI can run shell commands inside a sandbox to restrict filesystem access, network
access, and process capabilities. Sandboxing adds a layer of isolation so the agent cannot
accidentally modify files or access resources outside of your project.

> **Important:**
>
> Support for this feature is experimental and may be subject to change.

## Platform support

The sandbox uses the operating system’s built-in isolation features to restrict commands.

| Platform | Implementation | Dependencies |
| --- | --- | --- |
| macOS | `sandbox-exec` (built-in) | `ripgrep` |
| Linux | `bubblewrap` | `bubblewrap`, `socat`, and `ripgrep` |
| Windows | Native restricted tokens | None |

### Installing dependencies

macOS:

```bash
brew install ripgrep
```

Debian / Ubuntu:

```bash
sudo apt-get install bubblewrap socat ripgrep
```

Fedora / RHEL:

```bash
sudo dnf install bubblewrap socat ripgrep
```

## Enabling the sandbox

Use the `/sandbox` slash command in Cortex Code CLI:

```text
/sandbox                          # Interactive selector
/sandbox runtime on               # Enable sandbox
/sandbox runtime off              # Disable sandbox
/sandbox runtime status           # Show sandbox status
/sandbox status                   # Show current sandbox status
```

You can also enable the sandbox in your settings file. Add a `sandbox` object to
`~/.snowflake/cortex/settings.json` (user-level) or `.snowflake/cortex/settings.json`
(project-level):

```json
{
  "sandbox": {
    "enabled": true
  }
}
```

The default permission mode is `"regular"`. To use auto-allow mode, set `"mode": "autoAllow"`
explicitly. See Permission modes.

## Permission modes

The sandbox has two permission modes that control how commands are approved:

| Mode | Setting value | Behavior |
| --- | --- | --- |
| Auto-allow | `"autoAllow"` | Commands that can be sandboxed run automatically without prompting. Commands that cannot be sandboxed (for example, those requiring network access to non-allowed domains) fall back to the normal permission flow. |
| Regular | `"regular"` | All commands prompt for approval, even when running inside the sandbox. |

Set the mode with the `/sandbox` command or in settings:

```text
/sandbox mode auto                # Set auto-allow mode
/sandbox mode regular             # Set regular mode
```

## Filesystem restrictions

The sandbox controls which paths commands can read from and write to.

### Default behavior

* **Working directory**: Always allowed for read and write.
* **Skills directory** (`~/.snowflake/cortex/skills`): Allowed.
* **Context directory** (`~/.snowflake/cortex/.ctx`): Allowed when `ctxAvailable` is enabled.

### Protected paths (always denied for write)

The following paths are always protected, regardless of your configuration:

* Shell configuration files: `~/.bashrc`, `~/.bash_profile`, `~/.zshrc`, `~/.zprofile`,
  `~/.profile`, `~/.bash_login`, `~/.bash_logout`
* Git hooks: `~/.git/hooks`, `.git/hooks`
* SSH configuration: `~/.ssh/authorized_keys`, `~/.ssh/config`
* Managed settings directories and files: `/Library/Application Support/Cortex/` (macOS),
  `/etc/cortex/` (Linux), `%ProgramData%\Cortex\` (Windows)

### Custom filesystem rules

Configure filesystem access in settings:

```json
{
  "sandbox": {
    "enabled": true,
    "filesystem": {
      "allowRead": [],
      "denyRead": ["/private/secrets"],
      "allowWrite": ["/tmp", "~/projects"],
      "denyWrite": ["/etc", "/var"]
    }
  }
}
```

| Setting | Default | Description |
| --- | --- | --- |
| `allowRead` | `[]` (allow all) | Paths the sandbox can read. An empty array means all paths are allowed (except those in `denyRead`). |
| `denyRead` | `[]` | Paths the sandbox cannot read. Takes precedence over `allowRead`. |
| `allowWrite` | `[]` (working directory only) | Paths the sandbox can write to. |
| `denyWrite` | `[]` | Paths the sandbox cannot write to. Takes precedence over `allowWrite`. |

> **Important:**
>
> Deny rules always take precedence over allow rules. If a path matches both `allowWrite`
> and `denyWrite`, the path is denied.

## Network restrictions

The sandbox can restrict which domains commands can access over the network.

```json
{
  "sandbox": {
    "enabled": true,
    "network": {
      "allowedDomains": ["github.com", "*.npmjs.org", "registry.yarnpkg.com"],
      "deniedDomains": ["*.internal.company.com"],
      "allowLocalBinding": false
    }
  }
}
```

| Setting | Default | Description |
| --- | --- | --- |
| `allowedDomains` | `[]` (allow all) | Domains the sandbox can access. An empty array means all domains are allowed (except those in `deniedDomains`). Supports wildcards (`*.example.com`). |
| `deniedDomains` | `[]` | Domains the sandbox cannot access. Takes precedence over `allowedDomains`. Supports wildcards. |
| `allowLocalBinding` | `false` | Whether sandboxed commands can bind to local ports. |

## Unsandboxed command fallback

Some commands may not be compatible with the sandbox. The `allowUnsandboxedCommands` setting
controls what happens when a command cannot run inside the sandbox.

| Setting | Behavior |
| --- | --- |
| `true` (default) | The agent can request to run the command on the host. You are prompted to approve. |
| `false` | Commands must run inside the sandbox or be listed in `excludedCommands`. If neither applies, the command fails. |

### Excluded commands

You can specify commands that should always run on the host, outside the sandbox:

```json
{
  "sandbox": {
    "enabled": true,
    "allowUnsandboxedCommands": true,
    "excludedCommands": ["docker", "kubectl"]
  }
}
```

Excluded commands bypass the sandbox and follow the normal permission flow.

## Settings reference

The complete `sandbox` settings object:

```json
{
  "sandbox": {
    "enabled": false,
    "mode": "regular",
    "allowUnsandboxedCommands": true,
    "excludedCommands": [],
    "permissions": {
      "allow": [],
      "deny": []
    },
    "network": {
      "allowedDomains": [],
      "deniedDomains": [],
      "allowLocalBinding": false
    },
    "filesystem": {
      "allowRead": [],
      "denyRead": [],
      "allowWrite": [],
      "denyWrite": []
    },
    "ctxAvailable": true
  }
}
```

| Setting | Default | Description |
| --- | --- | --- |
| `enabled` | `false` | Enable or disable the sandbox. |
| `mode` | `"regular"` | Permission mode: `"regular"` or `"autoAllow"`. |
| `allowUnsandboxedCommands` | `true` | Allow fallback to host execution when a command cannot be sandboxed. |
| `excludedCommands` | `[]` | Commands that always run on the host, outside the sandbox. |
| `permissions.allow` | `[]` | High-level permission allow rules. Supports patterns like `WebFetch(domain:example.com)`, `Edit(path)`, `Read(path)`, `Bash(command)`. |
| `permissions.deny` | `[]` | High-level permission deny rules. Same pattern syntax as `permissions.allow`. Takes precedence over allow rules. |
| `network.allowedDomains` | `[]` | Network domain allowlist (empty = allow all). Supports wildcards. |
| `network.deniedDomains` | `[]` | Network domain denylist. Takes precedence over allowlist. |
| `network.allowLocalBinding` | `false` | Allow sandboxed commands to bind to local ports. |
| `filesystem.allowRead` | `[]` | Read allowlist (empty = allow all except deny). |
| `filesystem.denyRead` | `[]` | Read denylist. Takes precedence. |
| `filesystem.allowWrite` | `[]` | Write allowlist. |
| `filesystem.denyWrite` | `[]` | Write denylist. Takes precedence. |
| `ctxAvailable` | `true` | Allow sandbox access to the context directory (`~/.snowflake/cortex/.ctx`), used for storing conversation context and session data. |

### Configuration scopes

Sandbox settings follow the same precedence as other Cortex Code settings:

1. **Project-level** (highest priority): `.snowflake/cortex/settings.json`
2. **User-level**: `~/.snowflake/cortex/settings.json`
3. **Managed/enforced**: Administrators can enforce sandbox policy via the managed settings file.
   See [Managed settings (organization policy)](settings.md).

---
title: Cortex Code CLI Settings
source: https://docs.snowflake.com/en/user-guide/cortex-code/settings.md
section: Cortex Code
---

# Cortex Code CLI Settings

Cortex Code CLI settings control tool permissions, connections, and session behavior. You can
configure settings using managed policy (if provided by your organization), configuration files,
environment variables, and command-line arguments.

## Configuration files

The following configuration files are used by Cortex Code CLI:

| File | Purpose |
| --- | --- |
| `<admin-managed path>/managed-settings.json` | Organization-managed policy file (optional). For OS-specific locations, see Managed settings (organization policy). |
| `~/.snowflake/cortex/settings.json` | Main Cortex Code CLI settings file. |
| `~/.snowflake/cortex/permissions.json` | Permission preferences. |
| `~/.snowflake/cortex/mcp.json` | MCP server configuration (see [Model Context Protocol (MCP)](extensibility.md)). |
| `~/.snowflake/config.toml` | Snowflake connections (see [Cortex Code CLI](cortex-code-cli.md)). Shared with Snowflake CLI. |

The full layout of the main configuration directory is:

```text
~/.snowflake/cortex/        # Main Cortex Code CLI config directory
├── settings.json          # Main settings
├── mcp.json               # MCP server configs
├── permissions.json       # Saved permissions
├── hooks.json             # Global hooks
├── history                # Command history
├── conversations/         # Session files
├── cache/                 # Temporary cache
│   ├── table_cache.json   # SQL result metadata
│   └── sql_result_cache/  # Parquet files
├── logs/                  # Log files
├── memory/                # Persistent memory
├── agents/                # Custom agents
├── skills/                # Global skills
├── commands/              # Custom commands
├── hooks/                 # Hook scripts
└── remote_cache/          # Cloned repos
```

### Settings precedence

Settings are applied in the following order of precedence (highest to lowest):

1. Managed settings (system-managed policy file, if present). See Managed settings (organization policy).
2. In-session commands (`/plan`, etc.)
3. Command-line arguments
4. Environment variables
5. Configuration files (`~/.snowflake/cortex/`)
6. Default values embedded in the Cortex Code CLI

### `settings.json`

`~/.snowflake/cortex/settings.json`
:   Main settings file for Cortex Code CLI.

Example content:

```json
{
   "compactMode": true,
   "autoUpdate": true,
   "theme": "dark"
}
```

The following settings are available:

* `compactMode`: Enables compact output formatting.
* `autoUpdate`: Enables automatic updates.
* `theme`: Sets the CLI theme (`light` or `dark`).

### `permissions.json`

`~/.snowflake/cortex/permissions.json`
:   Controls tool access permissions.

Example content:

```json
{
  "onlyAllow": ["read_file", "execute_sql"],
  "defaultMode": "ask",
  "dangerouslyAllowAll": false
}
```

The following settings are available:

* `onlyAllow`: List of allowed tool patterns.
* `defaultMode`: Default permission mode (`ask`, `allow`, `deny`).
* `dangerouslyAllowAll`: Allows all tools without prompts (unsafe).

### Managed settings (organization policy)

Managed settings allow IT administrators to enforce organization-wide policies for Cortex Code CLI. For example, administrators can restrict which tools or accounts can be used, enforce minimum CLI versions, and disable bypass capabilities.

These settings are typically deployed through enterprise configuration management tools (such as MDM or SCCM). Users generally cannot modify managed settings unless they have administrator/root privileges.

#### File locations

The managed settings file is stored at a system-level path:

| Platform | Path |
| --- | --- |
| macOS | `/Library/Application Support/Cortex/managed-settings.json` |
| Linux and WSL | `/etc/cortex/managed-settings.json` |
| Windows | `%ProgramData%\Cortex\managed-settings.json` |

#### Configuration schema

The managed settings file uses JSON with the following structure:

```json
{
  "version": "1.0",
  "permissions": { },
  "settings": { },
  "required": { },
  "defaults": { },
  "ui": { }
}
```

#### Permissions

The `permissions` section can restrict what users can access. For example, you can allow or deny tool patterns and account patterns.

```json
{
  "permissions": {
    "onlyAllow": ["pattern1", "pattern2"],
    "deny": ["pattern3"],
    "defaultMode": "allow",
    "dangerouslyAllowAll": false
  }
}
```

| Field | Type | Default | Description |
| --- | --- | --- | --- |
| `onlyAllow` | `string[]` | — | Allowlist of patterns. If set, only matching items are allowed. |
| `deny` | `string[]` | — | Denylist of patterns. Deny takes precedence over allow. |
| `defaultMode` | `"allow"` or `"deny"` | `"deny"` | Behavior when no rule matches. |
| `dangerouslyAllowAll` | `boolean` | `false` | Controls whether bypass mode is allowed. |

#### Settings

The `settings` section enforces runtime behavior:

```json
{
  "settings": {
    "forceNoHistoryMode": true,
    "forceSandboxEnabled": true,
    "forceSandboxMode": "regular"
  }
}
```

| Field | Type | Default | Description |
| --- | --- | --- | --- |
| `forceNoHistoryMode` | `boolean` | `false` | Force no conversation history persistence. |
| `forceSandboxEnabled` | `boolean` | `false` | Force sandbox to always be enabled. |
| `forceSandboxMode` | `"regular"` or `"autoAllow"` | — | Force a specific sandbox mode. |

#### Required

The `required` section can enforce minimum versions:

```json
{
  "required": {
    "minimumVersion": "0.25.0"
  }
}
```

| Field | Type | Description |
| --- | --- | --- |
| `minimumVersion` | `string` | Minimum CLI version. Older versions display an error and exit. |

#### Defaults

The `defaults` section provides default values. Users can override these defaults only if allowed by policy.

```json
{
  "defaults": {
    "connectionName": "prod",
    "profileName": "corporate",
    "theme": "dark"
  }
}
```

| Field | Type | Description |
| --- | --- | --- |
| `connectionName` | `string` | Default Snowflake connection name. |
| `profileName` | `string` | Default profile to load. |
| `theme` | `string` | Default UI theme (for example, `dark` or `light`). |

#### UI

The `ui` section controls user interface presentation:

```json
{
  "ui": {
    "showManagedBanner": true,
    "bannerText": "[Secure] Managed by Corporate IT",
    "hideDangerousOptions": true
  }
}
```

| Field | Type | Default | Description |
| --- | --- | --- | --- |
| `showManagedBanner` | `boolean` | `false` | Display a banner indicating managed state. |
| `bannerText` | `string` | — | Custom text for the managed banner. |
| `hideDangerousOptions` | `boolean` | `false` | Hide dangerous options from help and UI. |

#### Examples

##### Basic corporate setup

Allow default functionality but disable bypass mode and show a managed banner.

```json
{
  "version": "1.0",
  "permissions": {
    "dangerouslyAllowAll": false,
    "defaultMode": "allow"
  },
  "settings": {},
  "required": {
    "minimumVersion": "0.25.0"
  },
  "ui": {
    "showManagedBanner": true,
    "bannerText": "Managed by IT"
  }
}
```

##### Restrict to specific Snowflake accounts

Only allow connections to production and staging accounts.

```json
{
  "version": "1.0",
  "permissions": {
    "dangerouslyAllowAll": false,
    "onlyAllow": [
      "account(mycompany-prod)",
      "account(mycompany-staging)"
    ],
    "defaultMode": "allow"
  }
}
```

## Environment variables

Cortex Code CLI recognizes the following configuration environment variables:

| Variable | Description |
| --- | --- |
| `SNOWFLAKE_HOME` | Overrides the default `~/.snowflake` directory. |
| `CORTEX_AGENT_MODEL` | Overrides model selection. |
| `CORTEX_ENABLE_MEMORY` | Enables the memory tool (set to `true` or `1`). |
| `COCO_DANGEROUS_MODE_REQUIRE_SQL_WRITE_PERMISSION` | Requires confirmation for SQL write operations in bypass mode. |

> **Note:**
>
> For additional permission-related environment variables, see [Security](security.md).

## Command-line overrides

Cortex Code CLI settings can be overridden via command-line arguments, which include the following:

| Example | Description |
| --- | --- |
| `cortex -c production` | Specifies the connection. |
| `cortex --workdir /path` | Sets the working directory. |
| `cortex --continue` | Continues the last session. |
| `cortex --resume <session_id>` | Resumes a specific session. |
| `cortex --plan` | Enables planning mode. |
| `cortex --dangerously-allow-all-tool-calls` | Disables permission prompts (unsafe). |

## Session storage

Conversations and settings are stored in:

| Location | Description |
| --- | --- |
| `~/.snowflake/cortex/conversations/` | Session files. |
| `~/.snowflake/cortex/permissions.json` | Permission preferences. |
| `~/.snowflake/cortex/mcp.json` | MCP configuration. |

---
title: Cortex Code CLI workflow examples
source: https://docs.snowflake.com/en/user-guide/cortex-code/workflows.md
section: Cortex Code
---

# Cortex Code CLI workflow examples

This topic provides workflow examples for common tasks to help you get the most out of Cortex Code CLI. It covers
data discovery, synthetic data generation, building dashboards, and creating Cortex Agents.

## Use cases: Data discovery and querying

This section walks through creating a synthetic dataset and performing basic analysis to generate a dashboard.

### Connect to a Snowflake account

```bash
cortex -c <your-demo-account>
```

Or connect interactively:

```text
> connect to <my demo account>
```

### Discover and explore data

Search your data catalog, understand lineage, and find relevant tables:

```text
> Find all tables related to customers that I have write access to
```

### Ensure you have the right role with the correct permissions

```text
> What privileges does my role have on this database?
```

Diagnose access issues and understand role privileges:

```text
> Why am I getting a permissions error?
```

### Generate synthetic data

Here are some examples of generating synthetic data for different use cases.

**Fraud analysis for a fintech company:**

```text
> Generate realistic looking synthetic data into <database name>. Create a table of 10000
  financial transactions where ~0.5% of them are fraudulent. Include Amount, Location,
  Merchant, and Time. Make the fraudulent ones look suspicious based on location or amount.
```

**Pharma trial data:**

```text
> Make a dummy dataset for a clinical trial of a new blood pressure medication. List 100
  patients, their age, their dosage group (Placebo vs. 10mg), and their blood pressure
  readings over 4 weeks.
```

**Customer churn data:**

```text
> Create a customer churn dataset for a telecom company showing customer usage for 100000
  customers. Include basic demographic data such as fake names, phone numbers, US city and
  state. Also include data usage (GB), call minutes, contract length, and whether they
  cancelled their service (churn). Ensure there's a customer_id column that's unique.
  Create the data locally and then upload it to Snowflake.
```

### Perform basic queries against this data

```text
> Calculate the Churn Rate grouped by state and contract length. Order the results by the
  highest churn rate first so I can see the most risky regions and contract types.
```

```text
> I want to identify the heaviest data users who are also churning.
```

### Build interactive dashboards

Create and deploy Streamlit apps with charts, filters, and interactivity.

> **Tip:**
>
> Open an example dashboard you like (or find one online) and copy it to your clipboard.
> You can paste images directly into Cortex Code (Ctrl+V) as design references.

```text
> Build an interactive Streamlit dashboard on this data with state filters and use the
  conversation so far for examples of the kinds of charts to show. Use the attached image
  as a template for visuals and branding.
```

Once you’ve verified that the dashboard is working and looks good, upload it to Snowflake:

```text
> Ensure that the Streamlit app will work with Snowflake and upload it to Snowflake.
  Give me a link to access the dashboard when it's done.
```

Congratulations! You should now have a working Streamlit dashboard that displays the dataset you created.

## Use cases: Building Cortex Agents

This section walks through creating a Cortex Agent to answer questions about your data in Snowflake Intelligence.
We’ll augment the existing synthetic data with customer call transcripts.

### Create a Semantic View for Cortex Analyst

Create a semantic view so you can use Cortex Analyst with your data. Use the defaults for all the questions it asks:

```text
> Write a Semantic View named DEMO_TELECOM_CHURN_ANALYTICS for Cortex Analyst based on
  this data. Use the semantic-view optimization skill.
```

### Create a Cortex Search service

First, generate synthetic data containing customer service calls:

```text
> Generate a new table called customer_call_logs. Generate 50 realistic customer service
  transcripts (2-3 sentences each) as PDF files. Some should be angry complaints about
  coverage, others should be questions about billing. Then use the AI_PARSE_DOCUMENT
  function to extract the text and layout information from the PDFs into the TRANSCRIPT_TEXT
  column. Split text into chunks for better search quality.
```

Then create a Cortex Search service that indexes the transcripts:

```text
> Create a Cortex Search Service named CALL_LOGS_SEARCH that indexes these transcripts.
  It should index the TRANSCRIPT_TEXT column and filter by CUSTOMER_ID.
```

### Create a Cortex Agent

Build a Cortex Agent that uses both the Analyst and Search services:

```text
> Build a Cortex Agent that has access to two tools:
  - cortex_analyst: For querying the TELECOM_CUSTOMERS SQL table.
  - cortex_search: For searching the CALL_LOGS_SEARCH service.

  Write a system prompt for this agent:
  - Persona: You are a Senior Retention Specialist.
  - Routing Logic: If the user asks for 'metrics', 'counts', or 'averages', use the
    Analyst tool. If the user asks for 'sentiment', 'reasons', or 'summaries of calls',
    use the Search tool.
  - Output Format: Always verify the customer ID before answering. If the risk score is
    high, end the response with a recommended retention offer (e.g., 'Offer 10% discount').
  - Constraint: Never reveal the raw CHURN_RISK_SCORE to the user; interpret it as 'Low',
    'Medium', or 'High'.
```

### Deploy to Snowflake Intelligence

Deploy the agent to Snowflake Intelligence:

```text
> Let's deploy this agent to Snowflake Intelligence.
```

Congratulations! You have successfully created and deployed a Snowflake Intelligence agent.

You should now be able to access this agent in Snowflake Intelligence and ask it questions like:

* “What are customers complaining about in their calls?”
* “Show me high-risk customers with monthly charges over $100”

## See also

[Cortex Code CLI](cortex-code-cli.md)
:   Get started with installation and first prompts

[Skills](extensibility.md)
:   Specialized skills for semantic models, agents, and documents

[Cortex Analyst](../snowflake-cortex/cortex-analyst.md)
:   Cortex Analyst documentation

---
title: Cortex Code in Snowsight
source: https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code-snowsight.md
section: Cortex Code
---

# Cortex Code in Snowsight

## Overview

Cortex Code provides an agentic experience across several functional areas within Snowsight. It is designed to assist data analysts,
engineers, and administrators with tasks such as SQL development, data exploration, and account management by deeply integrating into the Snowsight
interface and offering capabilities such as diff views.

Cortex Code uses intelligent orchestration to plan and execute multi-step tasks based on your request. In addition, it selects internal tools and relevant
context from your Snowflake environment to complete the task, ensuring that each response is accurate.

The assistant follows an agentic workflow and interprets your intent, creates a plan of action, and executes the steps while maintaining context
across the session.

Cortex Code understands roles, privileges, schemas, and SQL syntax, and applies Snowflake best practices when it is generating or modifying code.

To use Cortex Code in Snowsight, follow these steps:

1. Select the Cortex Code icon  in the lower-right corner. The Cortex Code panel opens on the right side of Snowsight.
2. In the message box, type in your question and then select the send icon or press `Enter` to submit it. Cortex Code provides a response in the panel.

   If the response from Cortex Code includes SQL statements, you can execute the statements or copy them to your clipboard.

### Access control requirements

A [role](../security-access-control-overview.md) used to access Cortex Code must have the following
database roles granted:

| Database Role | Notes |
| --- | --- |
| SNOWFLAKE.COPILOT_USER | Required for all users to access Cortex Code. |
| SNOWFLAKE.CORTEX_USER **or** SNOWFLAKE.CORTEX_AGENT_USER | At least one of these database roles is required. SNOWFLAKE.CORTEX_AGENT_USER provides additional capabilities for agentic workflows. |

For instructions on granting database roles, see [GRANT DATABASE ROLE](../../sql-reference/sql/grant-database-role.md).

For general information about roles and access control, see [Overview of Access Control](../security-access-control-overview.md).

> **Note:**
>
> If your account previously opted out of (or disabled) Snowflake Copilot (legacy), Cortex Code will also be disabled. Contact your account
> team to enable this feature for your account.

## Use cases and benefits

Cortex Code in Snowsight acts as an intelligent agent, helping you work more efficiently by translating natural language
instructions into executable actions. By maintaining awareness of your workspace context and Snowflake account configuration, it assists with development, exploration, and
administration tasks without requiring you to leave Snowsight.

Cortex Code supports the following key functional areas within Snowsight:

### Agentic coding in Workspaces

Cortex Code operates as a conversational coding assistant integrated within Workspaces. It supports interactive code generation, modification,
review, and explanation.

* **Code generation and development:** Generate SQL queries, create new files, and construct logic for data pipelines and analytics workflows.
* **Code modification and optimization:** Refine SQL directly in a workspace, identify logic or syntax errors, and suggest optimizations for performance, readability, or cost.
* **Change review:** Preview AI-suggested changes using a diff view before applying them. The diff view highlights insertions and deletions, allowing users to maintain control over their code.
* **Code explanation:** Request an explanation of existing SQL to assist with understanding or collaboration.
* **Ask follow-up questions:** Continue the conversation by asking clarifying questions or requesting further analysis on generated code or results.
* **Inline catalog context:** Type `@` in the message box to trigger a real-time search for catalog objects (such as tables, schemas, and views) and add them as context for your prompt.
* **Quick actions from highlighted SQL:** In a SQL file, highlight text to open quick actions such as Quick Edit, Format, Add to Chat, and Explain.
* **Fix SQL errors:** If a SQL statement fails, use the Fix button in the results grid to get suggested fixes for the error.
* **AI-powered code suggestions (currently in Preview):** As you type in a SQL file, Cortex Code displays context-aware inline suggestions to improve development speed and accuracy.

### AI code suggestions

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

Cortex Code provides intelligent, context-aware inline code suggestions for SQL in Snowflake. As you type in a SQL file, Cortex Code predicts
and suggests the next portion of your SQL statement, displayed as gray text at your cursor position.

Cortex Code uses your query history, the content of the current workspace, table schemas, and the last few executed queries from the current
workspace to match your working pattern and generate suggestions.

Suggestions are triggered automatically after you briefly pause while typing, or immediately after you accept a previous suggestion.

When interacting with a suggestion, you can perform the following actions:

* To accept a suggestion, press `Shift` + `Enter`.
* To dismiss a suggestion, press `Esc`, `Delete`, or `Backspace`, or continue typing.

When catalog suggestions appear alongside an inline suggestion, press `Shift` + `Enter` to accept the inline suggestion, or press the
down arrow and then `Enter` to select a catalog option instead.

> **Note:**
>
> AI code suggestions can occasionally be incorrect or not match your intent. If a suggestion is not relevant, dismiss it and continue typing to
> provide more context.

To disable AI code suggestions, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. From a SQL file, select the Settings icon in the upper-right corner.
4. Select User preferences.
5. Select the AI code suggestions toggle to disable the feature.
6. Select Close.

### Intelligent product and documentation discovery

Cortex Code uses context from the Horizon Catalog and Snowflake documentation to help you locate data assets and reference information without leaving your workspace.

* **Natural language schema search:** Locate database objects such as tables and columns using plain language queries, without needing to know exact object names.
* **Integrated Q & A:** Retrieve answers about Snowflake features, SQL syntax, or best practices based on official documentation.
* **Snowflake Marketplace discovery:** If your prompt references Snowflake Marketplace, Cortex Code will search and return listings from the Snowflake Marketplace.

When available, responses can include relevant context such as tags, masking policies, and lineage to help you validate the data assets you discover.

### Simplified account administration

Cortex Code supports account administration by providing contextual information about governance, security, and cost management.

* **Governance and security:** Retrieve information about user and role access, data ownership, and tables containing personally identifiable information (PII).
* **Cost management:** Query account usage and credit consumption, and identify high-cost warehouses or queries.

## Supported models and regions

Cortex Code supports the following models. You can use these models as long as the account has access to them. For more information, see [Control model access](../snowflake-cortex/aisql.md).

* Recommended: Claude Opus 4.6 (`claude-opus-4-6`)
* Claude Opus 4.5 (`claude-opus-4-5`)
* Claude Sonnet 4.5 (`claude-sonnet-4-5`)
* Claude Sonnet 4.0 (`claude-4-sonnet`)

While the listed models may not be available in [all regions](../snowflake-cortex/aisql.md), you can use Cortex Code in any cloud or region by using Cortex Cross-region inference. This includes clouds and regions where the models are not available. For more information, see [Cross-region inference](../snowflake-cortex/cross-region-inference.md).

> **Important:**
>
> **Cross-region inference is required when the selected model is not available in your region.** If inference fails with a model availability error, configure cross-region inference:
>
> * **AWS US** - Claude Opus 4.6 offers the highest quality. Set up Cortex Cross-region inference for `AWS_US` to access Claude Opus 4.6 models.
> * **AWS EU** - Set up Cortex Cross-region inference for `AWS_EU` to access Claude models.
> * **AWS APJ** - Set up Cortex Cross-region inference for `AWS_APJ` to access Claude models.
> * **Any region** - Set up Cortex Cross-region inference for `ANY_REGION` to access all models.
>
> To enable cross-region inference, an ACCOUNTADMIN must run:
>
> ```sqlexample
> ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'AWS_US';
> ```
>
> Replace `AWS_US` with the appropriate region identifier (`AWS_US`, `AWS_EU`, `AWS_APJ`, `ANY_REGION`).

> **Note:**
>
> Model access may also be restricted by your organization. If you cannot access a model even after enabling cross-region inference, verify that the model is enabled in your account’s AI model access settings. See [Control model access](../snowflake-cortex/aisql.md) for details.

Cortex Code requires the user to have the SNOWFLAKE.COPILOT_USER database role and either the SNOWFLAKE.CORTEX_USER or SNOWFLAKE.CORTEX_AGENT_USER database role.

> **Note:**
>
> If your account previously opted out of (or disabled) Snowflake Copilot (legacy), Cortex Code will also be disabled. Contact your account
> team to enable this feature for your account.

## Web search

An ACCOUNTADMIN role can configure Cortex Code CLI to search the web, and use the results in generating responses and
planning tasks. To properly enable web search in an account, follow these steps:

1. Navigate to AI/ML > Agents.
2. Select Settings.
3. Select the toggle next to Web search to enable the feature, as shown below.

Snowflake will process your inputs according to the [Snowflake Privacy Notice](https://www.snowflake.com/en/legal/privacy/privacy-policy/#2) (§2).
Web search may not be used for the purpose of redistributing or creating a competing web search service.

## Example prompts

You can interact with Cortex Code using natural language prompts. In your prompts, provide the context needed to generate accurate results (for
example, the database, schema, and the objects you want to work with). For the most reliable results across environments, use fully qualified object names.

The following examples show typical ways to request code generation, optimization, and administrative insights.

**Access and permissions**

| Use case | Example prompt |
| --- | --- |
| Access discovery | “What databases do I have access to?” |
| Security auditing | “Find all tables that have PII in them.” |

**Data discovery**

| Use case | Example prompt |
| --- | --- |
| Tag discovery | “List every table tagged PII = TRUE in ANALYTICS_DB.” |
| Lineage and tagging | “Show the lineage from RAW_DB.ORDERS to downstream dashboards.” |
| Metadata search | “Where can I find tables related to customer churn and subscription status?” |

**SQL development and optimization**

| Use case | Example prompt |
| --- | --- |
| Logic explanation | “What does this SQL script do?” |
| Generation | “Write a query for top 10 customers by revenue and a 7-day moving average.” |
| Query refinement | “Update the top performers query to show the top 100.” |
| Performance optimization | “Explain why this query is slow and optimize it.” |
| Data synthesis | “Generate synthetic data for 30 days of sales for an e-commerce site in the SAMPLESDATA.SALES table.” |

**Infrastructure and cost management**

| Use case | Example prompt |
| --- | --- |
| Resource monitoring | “Which 5 service types are using the most credits? Show me a visualization and how to reduce costs.” |

**Machine learning and engineering pipelines**

| Use case | Example prompt |
| --- | --- |
| Notebooks (EDA and machine learning) | “Build me a notebook for a customer churn prediction use case using pandas for data handling, matplotlib and seaborn for EDA and visualization, and scikit-learn for preprocessing, model training (logistic regression and a tree-based model), evaluation, and interpretation, with clear markdown explaining business impact and results.” |
| Deep learning | “Create a new notebook and build a CNN for the MNIST dataset.” |
| Pipeline engineering | “Create a dbt project to transform raw sales data.” |

**Semantic model integration (Cortex Analyst)**

| Use case | Example prompt |
| --- | --- |
| Semantic queries | “Use the @models/revenue.yaml semantic model to answer "What was revenue last month?"” |
| Model debugging | “Identify errors in my semantic model at @models/revenue.yaml” |

## Security and access

Cortex Code operates within your Snowflake account’s existing authentication and role-based access controls (RBAC). It does not store or
modify your credentials and only performs actions permitted by the active role.

Cortex Code always starts a session using your default role, regardless of the role you’ve selected
in Snowsight worksheets, workspaces, or the role selector in the lower-left corner of the UI. If you need to perform actions
that require a different role, you can ask Cortex Code to change roles during the session (for example, “switch to the SYSADMIN role”).

> **Note:**
>
> If Cortex Code returns a permissions error, verify that your default role has the required privileges. You can either change your
> default role using [ALTER USER](../../sql-reference/sql/alter-user.md), or ask Cortex Code to use a specific role for the current session.

## Cortex Code in Workspaces

You can access Cortex Code through the assistant panel integrated into Snowsight. Cortex Code processes requests in the context of
the active code or environment, or general Snowflake knowledge.

To use the Cortex Code agent in Workspaces:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Open a workspace containing the relevant file (for example, an existing SQL file).
4. Select the Cortex Code icon at the bottom-right of the workspace.
5. Enter a prompt or ask a question using natural language. Type `@` to search for and add catalog objects (such as tables, schemas, or views) as inline context. See Example prompts for ideas.
6. Review the output. Cortex Code provides an answer, suggested code, or a modified query.
7. For coding tasks, Cortex Code may display a comparison view highlighting insertions and deletions. Review the suggested changes and apply them directly to the script.
8. Use subsequent prompts to refine the code, convert the file to a different object type (like a notebook or semantic view), or integrate advanced functions like AI SQL.

### Customize Cortex Code in Workspaces with AGENTS.md and Agent Skills

[AGENTS.md](http://Agents.md) is a simple, open format for guiding coding agents.

Create an AGENTS.md file to provide persistent instructions that Cortex Code will automatically include in every conversation. Copy it to
the root directory of your workspace for personalized instructions that apply to conversations with Cortex Code about your project.

Support for [Agent Skills](https://agentskills.io/) will be available soon.

## Skills

Skills extend Cortex Code with specialized capabilities that can be invoked by typing `/` in the message box.

### Built-in skills

Snowflake provides built-in skills that are available from any page in Snowsight. Type `/` to see and select from the available skills. The list of built-in skills evolves as feature teams add new skills to Snowsight.

### Personal skills

You can create your own skills in a workspace to tailor Cortex Code to your specific workflows.

To add a personal skill, use any of the following options in the workspace:

* Upload Skill File(s)
* Upload Skill Folder(s)
* + Create Skill

Personal skills are stored in the `.snowflake/cortex/skills` directory of the workspace and can be invoked by typing `/` in the message box.

> **Note:**
>
> Personal skills can only be accessed from the workspace where they were created. They are not available when using a different workspace or when outside of a workspace.

## Cortex Code in Notebooks

Leveraging Cortex Code helps you explore data, write and edit queries and code, visualize insights, and explain results seamlessly in
[Notebooks in Workspaces](../ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md), accelerating end-to-end data science and machine learning development.

Cortex Code in Notebooks can:

* Create and manage notebooks in the Workspaces directory
* Add, remove, and reorder SQL, Python, and Markdown cells
* Edit code using up-to-date pre-installed packages and proper notebooks syntax (for example, cell referencing)
* Generate code for visualizing data using matplotlib, seaborn, plotly, and altair
* Run an entire notebook or specific cells

Try out these example prompts.

## Cortex Code agent for dbt Projects on Snowflake

Cortex Code supports transformation workflows that span the full dbt lifecycle:

* Explore raw source data and infer relationships
* Scaffold staging and intermediate models
* Build multi-model DAGs and metrics
* Add data quality tests and incremental logic
* Run dbt commands
* Generate and maintain project documentation

Using natural language prompts, the Cortex Code agent helps you explore data, author dbt models, add tests, optimize performance, and generate
documentation through iterative feedback.

It reduces day-to-day data engineering work by automating boilerplate SQL, dependency management, testing, and documentation, while preserving
control over project structure and logic.

### Example prompts for dbt Projects

The Cortex Code agent supports both new and experienced dbt users. New users can explore newly onboarded Bronze-layer data, infer schemas, and
scaffold staging models to establish a clean foundation. Experienced users can build complex data marts with incremental fact models, robust
testing, and auto-generated documentation, while iterating quickly through validation cycles.

The following scenarios illustrate common ways to use Cortex Code with dbt Projects.

| Use case | Context | Example prompt |
| --- | --- | --- |
| Explore sources | Understand raw data schemas and relationships before modeling. | “List all source tables in the bronze layer and summarize key columns, data types, and likely primary keys. Propose staging models for each source.” |
| Prototyping | Creating multi-model logic and DAGs. | “Create models to compute daily profitability by truck and location. Generate the DAG and propose dependencies.” |
| Data Quality | Adding tests to `schema.yml`. | “Add not_null and accepted_values tests to key dimensions. Suggest uniqueness tests for IDs based on inferred keys.” |
| Incremental Logic | Optimizing model performance. | “Convert the main fact model to an incremental model partitioned by order_date, with merge behavior for late-arriving data.” |
| Documentation | Reducing maintenance overhead. | “Generate docs for the project and draft descriptions for new models and key columns based on source context.” |

## Cortex Code, Snowflake Intelligence, and legacy Copilot

While Cortex Code supports a broad range of coding and administrative tasks, it is distinct from standalone coding agents and other specialized
AI systems within Snowflake.

The following table summarizes key differences between Cortex Code, Snowflake Intelligence, and the [legacy Copilot experience](../snowflake-copilot.md).

| **Feature** | **Cortex Code** | **Snowflake Intelligence** | **Snowflake Copilot (legacy)** |
| --- | --- | --- | --- |
| Use case | Supports development and operational workflows in Snowflake, including authoring SQL, exploring data assets, and performing administrative tasks. | Provides a natural language interface for asking complex questions about data and receiving analysis-focused responses. | Previous iteration of Cortex Code for documentation help and basic SQL assistance. |
| Primary integration | Integrated directly into Snowsight and Workspaces. Provides context-aware assistance within the active workspace. | Accessed through the Snowflake Intelligence UI and Cortex Agents API, enabling natural language interaction for insights and recommendations. | Separate copilot for SQL and UI assistance. |
| Scope of tasks | Supports SQL authoring, data exploration, documentation search, and account administration. | Focuses on question answering, data insights, and analysis-driven responses. | Limited SQL and UI assistance. |
| Key capabilities | Generates and modifies SQL code, reviews changes using a diff view, and explains existing code. | Analyzes data, generates summaries, and assists with natural language interactions. | Contextual SQL suggestions and limited help features. |
| Design focus | Provides a unified AI interface across coding, documentation, and administrative workflows. | Delivers conversational insights and query assistance for data understanding. | Deprecated in favor of Cortex Code. |

## Legal notices

Where your configuration of Cortex Code uses a model provided on the
[Model and Service Pass-Through Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/model-pass-through-terms/),
your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Features [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: Cost controls for Cortex Code
source: https://docs.snowflake.com/en/user-guide/cortex-code/credit-usage-limit.md
section: Cortex Code
---

# Cost controls for Cortex Code

Account administrators can set daily estimated credit usage limits for Cortex Code on a per-user basis. These
limits help organizations control Cortex Code consumption by blocking access when a user’s estimated credit usage
in a rolling 24-hour window exceeds the configured threshold.

There are separate parameters for each Cortex Code surface:

| Parameter | Controls |
| --- | --- |
| `CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER` | Cortex Code CLI usage |
| `CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER` | Cortex Code in Snowsight usage |

## How credit limits work

Each parameter tracks the corresponding user’s estimated credit usage over a rolling 24-hour window. When
a user’s estimated usage reaches the configured limit for a given surface, access is blocked for that surface
until usage drops below the threshold.

Both parameters share the same behavior:

| Value | Behavior |
| --- | --- |
| `-1` (default) | No limit. The user has unlimited access. |
| `0` | Access is blocked entirely for the user. |
| Positive number | Access is blocked when the user’s estimated credit usage in the past 24 hours exceeds this value. |

Each parameter can be set at the account level (applies to all users) or at the user level (applies to a
specific user). A user-level setting overrides the account-level setting for that user.

> **Note:**
>
> Only users with the `ACCOUNTADMIN` role (or a role with sufficient privileges to modify the account or user object)
> can set these parameters.

## Cortex Code CLI limits

The `CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER` parameter controls the daily credit limit for
Cortex Code CLI usage.

### Account level

To set a daily credit limit for all users in the account:

```sqlexample
-- Set the daily credit usage limit to 20 credits for all users in the account
ALTER ACCOUNT SET CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER = 20;
```

To remove the account-level limit and restore the default (unlimited):

```sqlexample
-- Remove the account-level limit (restores default unlimited usage)
ALTER ACCOUNT UNSET CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER;
```

### User level

To set a daily credit limit for a specific user, overriding the account-level setting:

```sqlexample
-- Set a per-user CLI limit that overrides the account-level setting
ALTER USER jsmith SET CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER = 10;
```

To remove the user-level limit, so the account-level setting applies instead:

```sqlexample
-- Remove the user-level override (account-level setting applies instead)
ALTER USER jsmith UNSET CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER;
```

## Cortex Code in Snowsight limits

The `CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER` parameter controls the daily credit limit for
Cortex Code usage within the Snowsight web interface.

### Account level

To set a daily credit limit for all users in the account:

```sqlexample
-- Set the daily Snowsight credit usage limit to 20 credits for all users in the account
ALTER ACCOUNT SET CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER = 20;
```

To remove the account-level limit and restore the default (unlimited):

```sqlexample
-- Remove the account-level Snowsight limit (restores default unlimited usage)
ALTER ACCOUNT UNSET CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER;
```

### User level

To set a daily credit limit for a specific user, overriding the account-level setting:

```sqlexample
-- Set a per-user Snowsight limit that overrides the account-level setting
ALTER USER jsmith SET CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER = 10;
```

To remove the user-level limit, so the account-level setting applies instead:

```sqlexample
-- Remove the user-level Snowsight override (account-level setting applies instead)
ALTER USER jsmith UNSET CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER;
```

## When a limit is reached

When a user’s estimated credit usage exceeds the configured limit for a surface, that surface returns an error
indicating that the daily credit limit has been reached. The user cannot use that surface until sufficient time has
passed for the rolling 24-hour usage to drop below the limit. Other surfaces with separate limits are not affected.

Administrators can adjust or remove the limit at any time to restore access.

## Listing users with custom limits

The following SQL script lists all users who have a per-user credit limit override for the CLI parameter. This is
useful for administrators who want to audit which users have custom limits set at the user level.

```sqlexample
-- List all users who have a per-user CLI credit limit override
EXECUTE IMMEDIATE $$
DECLARE
  current_user STRING;
  rs_users RESULTSET;
  res      RESULTSET;
BEGIN
  CREATE OR REPLACE TEMPORARY TABLE _param_overrides (user_name STRING, param_value STRING);

  SHOW USERS;
  rs_users := (SELECT "name" FROM TABLE(RESULT_SCAN(LAST_QUERY_ID())));

  FOR record IN rs_users DO
    current_user := record."name";

    EXECUTE IMMEDIATE
      'SHOW PARAMETERS LIKE ''CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER'' IN USER "' || :current_user || '"';

    INSERT INTO _param_overrides (user_name, param_value)
      SELECT :current_user, "value"
      FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
      WHERE "level" = 'USER';
  END FOR;

  res := (SELECT * FROM _param_overrides);
  RETURN TABLE(res);
END;
$$;
```

This script iterates over all users in the account, checks whether a user-level override is set for
`CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER`, and returns a table of users with their override values.
You can modify the parameter name in the `SHOW PARAMETERS LIKE` clause to check the Snowsight parameter instead.

## Example: Configuring limits for your organization

The following example sets default limits for all users in the account across both surfaces, then assigns a
higher limit to a power user for CLI usage:

```sqlexample
-- Set default daily limits for all users
ALTER ACCOUNT SET CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER = 20;
ALTER ACCOUNT SET CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER = 20;

-- Allow a specific user a higher CLI limit
ALTER USER power_user SET CORTEX_CODE_CLI_DAILY_EST_CREDIT_LIMIT_PER_USER = 50;

-- Block a specific user from Snowsight entirely
ALTER USER restricted_user SET CORTEX_CODE_SNOWSIGHT_DAILY_EST_CREDIT_LIMIT_PER_USER = 0;
```

> **Note:**
>
> When both an account-level and a user-level value are set for the same parameter, the user-level value takes
> precedence for that user. All other users in the account continue to use the account-level value.

---
title: Security best practices for Cortex Code CLI
source: https://docs.snowflake.com/en/user-guide/cortex-code/security.md
section: Cortex Code
---

# Security best practices for Cortex Code CLI

Essential security practices for Cortex Code CLI include using secure authentication methods, protecting configuration files, managing roles and access appropriately, handling conversation history securely, ensuring MCP server integrity, and following production safety protocols.

> **Important:**
>
> In managed environments, your organization may deploy a system-level managed settings file that enforces policy (for example, restricting tool access, limiting allowed accounts, or disabling bypass capabilities). For details, see [Managed settings (organization policy)](settings.md).

## Credentials

[Recommended] Use browser-based authentication when possible.
:   The default authentication method for Cortex Code CLI is browser-based authentication. Use `authenticator = "externalbrowser"` in your `connections.toml` file to set this option manually.

Use programmatic access tokens (PATs), when trying to scope access to a specific role.
:   Generate dedicated PATs in Snowsight (see [Using programmatic access tokens for authentication](../programmatic-access-tokens.md)). Set expiration ≤ 90 days, use descriptive names, and rotate regularly.

Protect configuration files
:   Use mode `600` for configuration files and `700` for directories to restrict access to only your user.

    ```bash
    chmod 600 ~/.snowflake/connections.toml
    chmod 700 ~/.snowflake/cortex
    ```

Never commit credentials
:   Add sensitive configuration files to `.gitignore`.

    ```bash
    echo "~/.snowflake/connections.toml" >> ~/.gitignore
    ```

    Use environment variables to hold credentials and tokens, and incorporate them in your configuration files using `${VARIABLE_NAME}` syntax.

## Roles & access

Use appropriate roles per environment
:   For example, use a read-only role in production and a more expansive role in development.

    ```toml
    [dev]
    role = "DEVELOPER"

    [prod_readonly]
    role = "ANALYST"
    ```

    Never use `ACCOUNTADMIN` for routine operations. Grant least privileges.

## Conversation history

Conversations are stored in `~/.snowflake/cortex/conversations/`. Use `cortex --private` when starting Cortex Code to disable session saving for sensitive work.
Alternatively, use the `/clear` command to clear the current session before exiting Cortex Code CLI.

Use mode 700 to restrict access to conversation history to only your user.

```bash
chmod 700 ~/.snowflake/cortex/conversations
```

## MCP security

Only install trusted MCP servers
:   Verify the source and integrity of MCP servers before adding them. Use the following commands to get a list of servers and remove any untrusted ones:

    ```bash
    cortex mcp list
    cortex mcp remove <server>
    ```

Never hardcode MCP credentials
:   Use environment variables. First, set in your shell:

    ```bash
    export GITHUB_TOKEN="your_token"
    ```

    Then reference them in your MCP configuration:

    ```json
    {
       "mcpServers": {
          "github": {
             "env": { "GITHUB_TOKEN": "${GITHUB_TOKEN}" }
          }
       }
    }
    ```

## Production safety

Enable planning mode
:   Use the `/plan` command to review intended actions before execution.

    ```text
    /plan
    Drop and recreate the ANALYTICS schema
    ```

## If your personal access token is compromised

Revoke the PAT in Snowsight immediately! Then generate a new token and start using it instead. Remember, don’t use the
token in configuration files; use environment variables instead.

Review the query history to identify any suspicious activity using the `QUERY_HISTORY` view in the `SNOWFLAKE.ACCOUNT_USAGE` schema:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE USER_NAME = '<username>'
  ORDER BY START_TIME DESC;
```

## Managed settings (enterprise policy)

In some organizations, administrators deploy managed settings that enforce policy for Cortex Code CLI. Managed settings can constrain or override user-level configuration (including permission prompts and bypass behavior).

For more information, see [Managed settings (organization policy)](settings.md).

## Permissions

Cortex Code has three operational modes:

| Mode | Indicator | Slash commands | Description |
| --- | --- | --- | --- |
| Confirm actions | Blue ⏵⏵ | Default mode | Prompts for permission before potentially dangerous actions. |
| Plan | Orange ⏸ | `/plan`, `/plan-off` | Presents a plan before taking any action. |
| Bypass | Red >> | `/bypass`, `/bypass-off` | All tool calls are approved. |

Press `Shift-Tab` in Cortex Code CLI to cycle among these modes.

> **Warning:**
>
> The Bypass mode disables all confirmation prompts. Use it only in trusted environments.

### Permission types

The following permission levels apply to Cortex Code tool calls:

| Type | Description |
| --- | --- |
| EXECUTE_COMMAND | Run bash/shell commands |
| FILE_READ | Read file contents |
| FILE_WRITE | Create/modify files |
| FILE_EDIT | Edit existing files |
| WEB_ACCESS | Web search/fetch operations |

### Trust model

Support for this feature is experimental and may be subject to change.
Cortex Code makes an attempt to classify commands and operations by risk, as shown in the following table:

| Level | Examples | Behavior |
| --- | --- | --- |
| SAFE | `ls`, `cat`, `echo`, `grep` | Auto-approved |
| LOW | Create new files (e.g., `touch file.txt`) | Usually auto-approved |
| MEDIUM | Edit files (e.g., `nano file.txt`), moderate bash | Prompts in Confirm mode |
| HIGH | `rm`, `curl`, `wget`, `sudo` | Always prompts |
| CRITICAL | `rm -rf`, destructive ops | Extra confirmation |

#### SQL queries

SQL is categorized by operation type:

| Category | Operations | Behavior |
| --- | --- | --- |
| READ_ONLY | SELECT, SHOW, DESCRIBE | Auto-approved |
| WRITE | INSERT, UPDATE, DELETE, CREATE | Prompts |
| USE_ROLE | USE ROLE, USE WAREHOUSE | Prompts |

### Sandbox

Cortex Code CLI supports sandboxing to isolate command execution. For full details on configuring
and using the sandbox, see [Sandbox](sandbox.md).

### Hook integration

You can customize permission policy using hooks. Here is an example pre-execution hook that approves auto-approves bash commands:

```text
{
   "hooks": {
      "PreToolUse": [
         {
         "matcher": "Bash",
         "hooks": [
            {
               "type": "command",
               "command": "bash .claude/hooks/auto-approve.sh"
            }
         ]
         }
      ]
   }
}
```

This hook might return a JSON response like the following to auto-approve bash commands.

```json
{
   "hookSpecificOutput": {
      "hookEventName": "PreToolUse",
      "permissionDecision": "allow",
      "permissionDecisionReason": "Approved by policy"
   }
}
```

### Permission prompts and caching

When Cortex Code requires your permission to proceed with an operation, it prompts you with details about the request.
You can choose to approve or deny the request. You can also opt to remember your choice for future similar requests:

* “Always allow (this session)” remembers until you exit Cortex Code CLI.
* “Always allow (persist)” remembers indefinitely.

These responses are cached and scoped to the project directory, the tool type, or the command pattern as appropriate.

Persistent permissions are stored in `~/.snowflake/cortex/permissions.json`. The following is an example cache:

```json
{
   "/path/to/project": {
      "Bash": {
         "npm test": "allow",
         "make build": "allow"
      },
      "Write": {
         "*": "allow"
      }
   }
}
```

Delete this file to reset all persistent permissions. To reset permissions for a specific project, delete the corresponding entry.

To reset the session cache, use the `/new` command, which begins a new session, or exit and re-start Cortex Code CLI.

### Configuration

Set the environment variables described below to control permission behavior:

| Variable | Description |
| --- | --- |
| `CORTEX_PERMISSION_CACHE_TTL_SECONDS` | Sets the default timeout for session permission cache (in seconds). |
| `COCO_DANGEROUS_MODE_REQUIRE_SQL_WRITE_PERMISSION=true` | If set to `1`, always prompt for SQL write operations, even in bypass mode |

## Security Checklist

* Use PATs with at most a 90 day expiration
* Set file permissions to 600/700
* Never commit credentials to git
* Use least privilege roles
* Never use ACCOUNTADMIN for routine work
* Enable planning mode for production and reserve bypass mode for trusted environments
* Only install trusted MCP servers
* Store credentials in environment variables
* Use hooks to enforce policies by automating custom security checks
* Periodically audit permissions

---
title: Using Apache Airflow™ with Cortex Code CLI
source: https://docs.snowflake.com/en/user-guide/cortex-code/airflow.md
section: Cortex Code
---

# Using Apache Airflow™ with Cortex Code CLI

Cortex Code provides built-in support for Apache Airflow™, providing a natural language interface to manage DAGs, debug
failures, author pipelines, analyze data, and track lineage across your Airflow deployments.

## Capabilities

| Capability | Description | Example Prompt |
| --- | --- | --- |
| Pipeline Monitoring | Health checks, DAG inspection, connection and variable visibility, scheduling control | “Is my Airflow instance healthy?” |
| Run Management | Trigger DAGs on demand, wait for results, pass custom configuration | “Test the daily_etl DAG and let me know when it finishes” |
| Failure Debugging | Root cause analysis across run state, task instances, and logs with impact assessment and fix recommendations | “Why did my_pipeline fail last night?” |
| DAG Authoring | Guided DAG creation using your existing patterns, connections, and providers with a discover-plan-implement-validate-test workflow | “Create a DAG that extracts from Snowflake and loads to S3 daily” |
| Data Analysis | Warehouse queries, table profiling, and freshness checks with pattern caching and concept-to-table learning | “How many active customers do we have this quarter?” |
| Data Lineage | Upstream origin tracing and downstream impact analysis through DAG source code with criticality ratings | “What would break if I change the customers table schema?” |
| Airflow 3 Migration | Automated code migration with Ruff rules, import fixes, context key replacements, and metadata access pattern updates | “Migrate my DAGs from Airflow 2 to Airflow 3” |
| dbt Integration | Run dbt Core or Fusion projects as Airflow DAGs via Astronomer Cosmos with parsing, execution, and profile configuration | “Set up my dbt project to run in Airflow using Cosmos” |
| Human-in-the-Loop | Approval gates, form inputs, and human-driven branching in DAGs (Airflow 3.1+) | “Add an approval step before the deploy task” |
| Local Environments | Start, stop, restart, and troubleshoot local Airflow environments with the Astro CLI | “Start my local Airflow environment” |

## Prerequisites

Cortex Code’s Airflow integration requires [uv](https://docs.astral.sh/uv/getting-started/installation/). If `uv` is
not installed, `cortex airflow` provides a helpful message with the install link.

## Setting up Airflow integration

Before you can manage your Airflow instance with Cortex Code, you must configure a connection. You can do this using
environment variables, or inside Cortex Code CLI with an interactive setup command.

Environment variable setup
:   Export the required variables in your shell before starting Cortex Code, as follows. You can use either token-based
    authentication or username/password authentication. If you always use the same Airflow instance, include code like
    this in your shell profile (`~/.bashrc` or `~/.zshrc`) to avoid having to re-enter it every time.

    ```bash
    # Token auth
    export AIRFLOW_API_URL=https://airflow.example.com
    export AIRFLOW_AUTH_TOKEN=your-api-token

    # Username/password auth
    export AIRFLOW_API_URL=https://airflow.example.com
    export AIRFLOW_USERNAME=your-username
    export AIRFLOW_PASSWORD=your-password
    ```

Interactive setup
:   Issue `/airflow` in Cortex Code to manage instances through a full screen UI. Both token and username/password authentication are supported.

    | Command | Description |
    | --- | --- |
    | `/airflow` | Manage Airflow instances (opens instance manager) |
    | `/airflow show` | Show current configuration (secrets are masked) |
    | `/airflow clear` | Remove all configuration |

    `/airflow` supports multiple named instances. Use the instance manager to add, switch between, or remove them.

## Airflow CLI commands

Use `cortex airflow` to interact with your Airflow instance from the terminal, as shown in the examples below.

Check instance health:

```bash
cortex airflow health
```

List all DAGs:

```bash
cortex airflow dags list
```

Get details on a specific DAG:

```bash
cortex airflow dags get my_pipeline
```

View DAG source code:

```bash
cortex airflow dags source my_pipeline
```

Trigger a DAG run:

```bash
cortex airflow runs trigger my_pipeline
```

List recent runs for a DAG:

```bash
cortex airflow runs list my_pipeline
```

Check task status for a specific run:

```bash
cortex airflow tasks list my_pipeline <run_id>
```

Pause or unpause a DAG:

```bash
cortex airflow dags pause my_pipeline
cortex airflow dags unpause my_pipeline
```

Issue `cortex airflow --help` for the full list of commands.

## Troubleshooting

Connection refused
:   **Symptom:** Airflow operations fail with connection errors.

    **Solution:** Verify your instance URL is correct and that the Airflow API is reachable. Check your current instance configuration and test connectivity with a health check.

Authentication failures
:   **Symptom:** Operations return 401 or 403 errors.

    **Solution:** Try the following steps:

    * Make sure that your token or credentials are correct.
    * Check to see if the token has expired; regenerate it if necessary.
    * Make sure the user and role have API access permissions in Airflow.

DAG not found
:   **Symptom:** Operations report that the DAG doesn’t exist.

    **Solution:** Check for import or parse errors that might be preventing the DAG from loading. Make sure the DAG ID matches exactly.

`uv` not installed
:   **Symptom:** `cortex airflow` displays “cortex airflow requires uv”.

    **Solution:** Install `uv` from [the uv site](https://docs.astral.sh/uv/getting-started/installation/).

## Clean Rooms

Snowflake Data Clean Rooms for privacy-safe collaboration across organizations.

---
title: About Snowflake Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/about.md
section: Clean Rooms
---

# About Snowflake Data Clean Rooms

A Snowflake Data Clean Room is a native solution to build, connect, and use data clean rooms easily in Snowflake.

It offers a secure, multi-collaborator environment where collaborators can share data that can be queried using
templates added to the collaboration. Collaborators can review and approve or reject the inclusion of new templates or data sources by other
collaborators. Query results can be exposed directly, or activated to a collaborator’s Snowflake account, at the discretion of all
collaborators.

Data clean rooms provide a secure way to gain valuable insights while protecting sensitive information. They allow you to combine
and analyze data from different parties with privacy-preserving configurations that help protect the underlying data.

Benefits of data clean rooms include:

* **Enhanced privacy** — Protects sensitive data while enabling collaboration.
* **Deeper insights** — Combines data from multiple sources for richer analysis.
* **Increased security** — Reduces the risk of unauthorized access.

## How Snowflake Data Clean Rooms work

With Snowflake Data Clean Rooms, all analyses are conducted within the secure environment of the clean room. Collaborators can
return aggregated results and insights, but can’t directly query the raw data in the clean room. The collaborator who is sharing
data can define what analyses are available to the other collaborators, allowing them to tightly control how their data is used.

## Clean room collaborators

Snowflake Data Clean Rooms is introducing a new data clean room architecture called Collaboration Data Clean
Rooms, which allows customers to collaborate in a fully symmetric, multi-party environment.

Your collaboration definition assigns roles to each collaborator that define their capabilities within the collaboration. Roles include:

* **Owner**: Creates the collaboration and determines who has what roles in a collaboration.
* **Data Provider**: Provides data to selected participants, and specifies Snowflake policies to apply to the data within the collaboration.
* **Analysis Runner**: Can run templates provided by collaborators, using specified data.

Collaboration Owner can assign one or more roles to each participant. All collaborators can
submit templates to the collaboration, and can specify who can use each template.

## Working with Snowflake Data Clean Rooms

Snowflake Data Clean Rooms come with a complete set of APIs that allow users to work with clean rooms programmatically, including the
ability to build custom applications and to customize analysis templates and ML models. For an overview, see
[Data Clean Rooms Developer Guide](developer-guide.md).

Additionally, users can utilize Cortex Code to perform clean room operations in a natural language interface. For more information, see
[Cortex Code](../cortex-code/cortex-code.md).

## Next steps

Your next steps depend on how you got here and what you want to do:

**If you’re a Snowflake administrator and are interested in installing the clean room environment in your Snowflake account:**

1. Read the [overview](overview.md) for background and installation requirements.
2. [Install the clean room environment in your account.](installing-dcr.md)

**If you’re a developer:**

1. If clean rooms are already installed in your Snowflake account, read the [overview](overview.md) for
   background.
2. [Try out the API tutorial](tutorials/collaboration-basic-api-tutorial.md).
3. Don’t forget to check out the [developer guide](developer-guide.md).

---
title: Activating query results
source: https://docs.snowflake.com/en/user-guide/cleanrooms/activation.md
section: Clean Rooms
---

# Activating query results

## Overview of activation

A collaborator can send template results outside the clean room in a process called *activation*.
The template must support activation, and each data provider must approve activation at the column level in their data offering specification.

Activation is implemented using a dedicated activation template. An activation template doesn’t return results to the query runner, but
instead writes them to a results table in the target user’s account.

> **Note:**
>
> Activating results to another Snowflake account requires Snowflake Enterprise Edition or higher.

## Implementing activation

Here are the steps to implement activation:

1. You must use a role that has the [REGISTER DATA OFFERING privilege](collaboration-api-reference.md) to join any collaboration where you are an
   analysis runner and the collaboration specification includes an `activation_destinations` field.
2. Ensure that all specifications are properly configured:

   Data offering specCollaboration specAnalysis spec

   The [data offering specification](spec-data-offering.md) for the table with the activated column must set
   `activation_allowed: TRUE` for that column:

   ```yaml
    api_version: 2.0.0
    spec_type: data_offering
    name: 2025_orders
    version: 2025_01_01_v1
    description: Activating Cleveland sales results for 2025

    datasets:
     - alias: customers
       data_object_fqn: db1.schema1.orders
       allowed_analyses: template_only
       object_class: custom
       schema_and_template_policies:
         email:
           category: join_standard
           column_type: hashed_email_sha256
           activation_allowed: TRUE
         purchase_amount:
           category: passthrough
           activation_allowed: TRUE
   ```

   The [collaboration specification](spec-collaboration.md) must provide `activation_destinations` values for the
   analysis runner. The data offering specification further limits activation to designated analysis runners and templates.

   ```yaml
   api_version: 2.0.0
   spec_type: collaboration
   name: simple_activation_collaboration
   description: Demonstrates a basic activation

   collaborator_identifier_aliases:
     advertiser_1: some_complex_identifier
     publisher_1: another_complex_identifier

   owner: publisher_1

   analysis_runners:
     advertiser_1:
       data_providers:
         advertiser_1:
           data_offerings:
             - id: customer_list
         publisher_1:
           data_offerings:
             - id: user1.2025_orders.sales
       templates:
         - id: activation_template_v0
       activation_destinations:
         snowflake_collaborators:
           - publisher_1
   ...
   ```

   The [analysis specification](spec-analysis.md) must include an `activation` section with
   `snowflake_collaborator` and `segment_name` values, and call an
   [activation template](custom-templates.md). You can’t activate results by running
   a standard analysis template.

   ```yaml
   api_version: 2.0.0
   spec_type: analysis
   name: my_analysis
   description: Description of the analysis
   template: my_activation_template
   template_configuration:
     view_mappings:
       source_tables:
         - alias1.schema1.table1
         - alias2.schema2.table2
     arguments:
       join_column: ip_address
       advertiser_activation_column: purchase_amount
       publisher_activation_column: device_type
     activation:
       snowflake_collaborator: publisher_1
       segment_name: q1_2025
   ```
3. You must use an [activation template](custom-templates.md). This template saves results to an internal table.
   All projected columns from this template are activated.

   Any column in the template with the `activation_policy` filter applied must have `activation_allowed: TRUE`
   in the data offering specification.

   > **Note:**
   >
   > If a template doesn’t apply the `activation_policy` filter to a column, the column can be activated whether or not
   > `activation_allowed: TRUE` is set for that column in the data offering spec.

   The following example shows a template with the activation policy applied to two columns supplied
   by the analysis runner:

   ```sqlexample
   BEGIN
     CREATE OR REPLACE TABLE cleanroom.activation_data_analysis_results AS
       SELECT count(*) AS ITEM_COUNT, c.status, c.age_band
       FROM IDENTIFIER({{ my_table[0] }}) AS c
       JOIN IDENTIFIER({{ source_table[0] }}) AS p
       ON {{ c_join_col | sqlsafe | activation_policy }} = {{ p_join_col | sqlsafe | activation_policy }}
       GROUP BY c.status, c.age_band
       ORDER BY c.age_band;
     RETURN 'analysis_results';
   END;
   ```
4. The analysis runner calls RUN to run the analysis and activate the results.

   * **If activating to yourself**, results are available immediately in the caller’s account.
   * **If activating to another collaborator:**

     1. The collaborator calls VIEW_ACTIVATIONS until it returns a status of SHARED.

        > Activating to another account can take considerable time for large result sets, as the data must be shared to the
        > collaborator’s account. Cross-cloud collaborators will also experience additional delays due to replication frequency settings.
     2. When the status of the activation is SHARED, the collaborator calls PROCESS_ACTIVATION to send the results to their account.

        > The response to PROCESS_ACTIVATION includes the table and segment names. This sets the activation status to PROCESSED.
5. The analysis runner can read results as described in the next section.

## Reading the activation results

When activation is complete, as described in the previous section, results are stored in the
`collaboration_name.activation.segment_records` table in your account.

The table has the following schema:

| Column | Description |
| --- | --- |
| BATCH_ID | UID for the batch job that was processed. |
| SEGMENT_NAME | Name for the activation payload. |
| TEMPLATE_ID | ID of the template used for activation. |
| SHARED_BY | Name of the collaborator who activated the data. |
| UPDATED_ON | Timestamp of when the batch was processed successfully. |
| RECORDS | Payload of activated IDs and attributes from the activation template. |

> **Note:**
>
> If a collaborator leaves the clean room, they lose access to the application, including the table that contains the activated results.

To retrieve the activation results, run the following SQL command, optionally filtering by segment name:

```sqlexample
SELECT *
  FROM <collaboration_name>.activation.segment_records
    [WHERE segment_name = '<segment_name>'];
```

---
title: Activating query results
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/activation.md
section: Clean Rooms
---

# Activating query results

## Overview of activation

The provider or consumer can send template results outside of the clean room in a process called *activation*. Snowflake supports three
types of activation:

* **Provider activation**, where results are pushed to a table in the provider’s Snowflake account.
* **Consumer activation**, where results are pushed to a table in the consumer’s Snowflake account.
* **Third-party activation**, where the provider or consumer pushes results to a Snowflake-approved third-party, such as LiveRamp or Meta
  Ads Manager, through an [activation connector](../connector-activation.md).

In all cases, the template must support activation, and parties should approve activation for any columns of their own data that will be
activated. Data providers specify which columns of their data are activated by setting an activation policy. For more about clean room
policies see [Understanding Snowflake Data Clean Room policies](policies.md).

Activation supports differential privacy, if enabled, and respects differential privacy rules and budgets.

> **Important:**
>
> If the consumer and provider are in different cloud regions, you need to enable [Cross-cloud auto-fulfillment](../laf.md) in both accounts and for both clean rooms.

## Provider and consumer activation

You can configure a clean room to save template results in the provider’s or consumer’s Snowflake account. Both the provider and consumer
must approve activation of any data out of the clean room.

Activation is implemented using a dedicated activation template. In the clean rooms UI, an activation template can be associated with
an analysis template, and the user can run the analysis template, view the results, then run the associated
activation template. The Snowflake-provided Audience Overlap & Segmentation flow does this.

An activation template need not be identical to any associated analysis template. The activation template is often a subset of the analysis
template.

### Supported templates

The following templates support provider and consumer activation:

* Audience Overlap & Segmentation
* [Custom templates](custom-templates.md)

### Supported combinations

Activation can be run either by the provider or by the consumer. (Learn more about
[provider-run analyses](../demo-flows/provider-run-analysis.md).)

The following combinations are supported:

|  | Provider activation | Consumer activation | Third-party activation |
| --- | --- | --- | --- |
| Provider-run | ✅ | ❌ | UI only |
| Consumer-run | ✅ | ✅ | UI only |

### Results

**Provider activation results** are saved to the provider’s account in the table
SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY.

**Consumer activation results** are saved to the consumer’s account in the table
SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.CONSUMER_DIRECT_ACTIVATION_SUMMARY.

See viewing results to learn how to read the data.

### Implementing provider or consumer activation

Clean rooms UIClean rooms API

**Setup**

* Activation when using the clean rooms UI requires that the clean rooms account [allows activation](../admin-tasks.md).
* For provider-run activation, the clean room must be [configured to support provider-run analysis](../demo-flows/provider-run-analysis.md).

**1. Create or join a clean room**

When creating or joining a clean room, in the Configure Analysis & Query step, under Activation Settings, specify which
columns should be added to the results activated to your account.

**2. Run the template and activate results**

To run the activation associated with your analysis, complete these steps:

1. Run your analysis.
2. After running an analysis, select Results » Activate.
3. Under Activation Hub select the name of the provider or consumer account to activate to.
4. Provide information specific to the activation template, such as descriptive segmentation names or activation
   columns.
5. Provide a segment name: this is an arbitrary string used to identify a set of results from a given run. You can provide a different
   string for each activation to group each run’s results separately, or you can use the same segment name over multiple runs
   if you want to combine results.
6. Select Push Data.
7. To learn how to view activated results, see Viewing provider and consumer activation results.

Activation is performed differently depending on who runs it, and whether it’s consumer or provider activation.

Consumer activation (consumer-run)Provider activation (consumer-run)Provider activation (provider-run)

> **Important:**
>
> The first time a consumer activates data to a provider account in a clean room, the provider must establish a data pipeline by
> signing in to the clean room UI for that account and staying signed in for up to 30 minutes. This needs to be done only once per
> clean room per consumer. Until that is done, data will not appear in the provider’s account, even if the activation succeeds.

Here is how a consumer can push results to their own Snowflake account.

**Provider**

> 1. Create the clean room, link datasets, and set join policies, as for a standard clean room.
> 2. Either choose a supported Snowflake standard template, or add a
>    [custom activation template](custom-templates.md) to the clean room. If this clean room is to be used
>    in the UI, you must provide a web form with the proper activation fields, as described in the template documentation.
> 3. Enable the template for consumer activation by calling `provider.enable_template_for_consumer_activation`.
> 4. To specify which provider columns can be activated, set the activation policy in the clean room for the enabled template by
>    calling `provider.set_activation_policy`.
> 5. Add consumer collaborators, set the default release directive, and publish the clean room, as usual.

**Consumer**

> 1. Install the clean room, link datasets, and set join policies, as for a standard clean room.
> 2. To specify which consumer columns can be activated, set the activation policy in the clean room for that template by calling
>    `consumer.set_activation_policy`.
> 3. Run the activation by calling `consumer.run_activation`, with the last parameter set to TRUE to indicate a consumer
>    activation.
> 4. View the results, as described below.

**Examples**

Download the following examples and upload them as worksheet files in your Snowflake account. You will need separate accounts for
the provider and consumer, each with the clean rooms API installed. Replace the information as noted in the sample files. [See instructions to upload a SQL worksheet into your Snowflake account](../tutorials-and-samples.md).

* [`Provider example code`](../../../_downloads/3cdbbfc219b944cb8d7cb49014a520e4/c-run-c-activation-p.sql)
* [`Consumer example code`](../../../_downloads/edfeb20ec5896fb323528481c1ea3490/c-run-c-activation-c.sql)

Here is how a consumer can push results to a provider’s Snowflake account.

> **Important:**
>
> If the consumer and provider don’t **both** have the clean rooms UI installed, and the consumer is activating to the provider:
>
> * The **consumer** must run the following SQL command:
>
>   ```sqlexample
>   ALTER SHARE SAMOOHA_INTERNAL_GOVERNANCE_SUMMARY_SHARE_NAV2
>     ADD ACCOUNTS = $provider_account_data_sharing_id;
>   ```
>
>   where `$provider_account_data_sharing_id` is the provider’s [Data Sharing Account Identifier](../../admin-account-identifier.md)
> * The **provider** must run the following procedure:
>
>   ```sqlexample
>   CALL samooha_by_snowflake_local_db.provider.mount_provider_activations_share(
>     $consumer_account_data_sharing_id, TRUE, FALSE);
>   ```
>
>   where `$consumer_account_data_sharing_id` is the consumer’s [Data Sharing Account Identifier](../../admin-account-identifier.md).

**1. Provider**

> 1. Create the clean room in the standard way.
> 2. Link datasets. The provider must also link the SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.LIBRARY.TEMP_PUBLIC_KEY table into the clean
>    room.
> 3. Set the join policy in the standard way.
> 4. Either choose a supported Snowflake standard template, or add a
>    [custom activation template](custom-templates.md) to the clean room. If this clean room is to be used
>    in the clean rooms UI, you must [provide a web form](../demo-flows/custom-templates.md) with
>    the proper activation fields.
> 5. To specify which provider columns can be activated, set the activation policy in the clean room for that template by calling
>    `provider.set_activation_policy`.
> 6. Add consumer collaborators, set the default release directive, and publish the clean room, as usual. (If you do not have the
>    clean room UI installed, call `provider.setup_provider_activation_share_mount_task` after adding consumers.)

**2. Consumer**

> 1. Install the clean room, link datasets, and set join policies, as for a standard clean room.
> 2. Set the activation policy in the clean room for that template to specify which consumer columns can be activated by calling
>    `consumer.set_activation_policy`.
> 3. Run the activation by calling `consumer.run_activation`, with the last argument set to FALSE to indicate a provider
>    activation.
>
> > **Note:**
> >
> > An encrypted version of the activated data is stored on the consumer’s account for 28 days in the table
> > SAMOOHA_LOCAL_DB_NAME_PLACEHOLDER.PUBLIC.CONSUMER_ACTIVATION_SUMMARY. Data older than 28 days is removed from the consumer’s
> > account.

**3. Provider**

> 1. The first time a consumer activates data to your account you must sign in to the clean rooms UI for this account for about 30
>    minutes after the consumer has activated data. After that, the data will appear in your account. This is done only once per clean room per consumer account. Later activations by the same consumer in the same clean room do not need this step.
>
>    The results must be decrypted before being saved to your account, which can take some time.
>    The decryption task times out after 60 minutes; if this happens, call
>    [provider.update_activation_warehouse](../provider.md) to increase the warehouse
>    size used for decryption.
> 2. View the results, as described below.

**Examples**

Download the following examples and upload them as worksheet files in your Snowflake account. You will need separate accounts for
the provider and consumer, each with the clean rooms API installed. Replace the information as noted in the sample files. [See instructions to upload a SQL worksheet into your Snowflake account](../tutorials-and-samples.md).

* [`Provider example code`](../../../_downloads/249d42fdba29da93cd25d75850a016a9/c-run-p-activation-p.sql)
* [`Consumer example code`](../../../_downloads/76e14e2219bbbe735da2790398954b80/c-run-p-activation-c.sql)

Here is how a provider can push results their own Snowflake account. This combines several techniques, including custom templates,
provider-run analysis, and provider activation, and so involves several rounds of request and approval between the provider and
consumer.

**1. Provider**

> 1. Create the clean room, link datasets, and set join policies, as for a standard clean room, **with one exception**: You must
>    link in the table `samooha_by_snowflake_local_db.library.temp_public_key`. Provider-run data is encrypted, and this enables
>    encryption and decryption of the results.
> 2. Either choose a supported Snowflake standard template, or add a
>    [custom activation template](custom-templates.md) to the clean room. If this clean room is to be used
>    in the UI, you must provide a web form with the proper fields to support activation, as described in the template
>    documentation.
> 3. Set the activation policy in the clean room for that template to specify which provider columns can be activated by calling
>    `provider.set_activation_policy`.
> 4. Add consumer collaborators in the standard way. If you do not have the clean room UI installed you must call
>    `provider.setup_provider_activation_share_mount_task` after adding users.
> 5. Enable provider-run analysis in the clean room by calling `provider.enable_provider_run_analysis`. This must be done
>    **after** adding collaborators but **before** collaborators install the clean room. If you change this setting after a
>    consumer installs the clean room, the consumer must reinstall the clean room for the change to take effect.
> 6. Set the default release directive and publish the clean room, as usual.

**2. Consumer**

> 1. Install the clean room, link datasets, and set join policies as in a standard clean room.
> 2. Set the activation policy in the clean room for that template to specify which consumer columns can be activated by calling
>    `consumer.set_activation_policy`.

**3. Provider**

> * Request permission from the consumer to run your activation template by calling `provider.request_provider_activation_consent`.

**4. Consumer**

> 1. Grant the provider permission to run a given template in this clean room by calling `consumer.enable_templates_for_provider_run`.
> 2. Grant the provider permission to activate results from a given template in this clean room by calling `consumer.approve_provider_activation_consent`.

**5. Provider**

> 1. Enable consumer data to be shared in a provider activation by calling `provider.mount_request_logs_for_all_consumers`.
> 2. Run the activation template by calling `provider.submit_analysis_request`). The request takes several minutes to appear in
>    the logs; check status by calling `provider.check_analysis_status`. Note that even after status is reported as
>    SUCCESS, additional time is required for results to be decrypted and written to the provider’s Snowflake table.
>    All decrypted data is appended at one time to the results table. Keep checking the results table periodically for your segment
>    or activation ID. The **decryption task times out after 60 minutes**; if this happens, call
>    [provider.update_activation_warehouse](../provider.md) to increase the warehouse
>    size used for decryption.
>
> > **Note:**
> >
> > To modify a template after the consumer approves it, you must take the following steps, or else
> > `provider.submit_analysis_request` will continue to run the last approved version of the template.
> >
> > 1. Provider updates the template by calling `provider.add_custom_sql_template`. No need to call
> >    `create_or_update_cleanroom_listing` again.
> > 2. Consumer calls `consumer.enable_templates_for_provider_run`.
> > 3. Consumer calls `consumer.approve_provider_activation_consent`.
> > 4. The updated template is now ready for provider activation.

**Common errors**

* `Object cleanroom_name.CLEANROOM.TEMP_RESULT_DATA does not exist or not authorized` - Temporary results table could not
  be generated for some reason. Could be a SQL error in the template, or your template didn’t explicitly generate a table; look at
  the error details.
* `Query validation checks failed` - Some columns used in the template that weren’t in the activation policies.

**Examples**

Download the following examples and upload them as worksheet files in your Snowflake account. You will need separate accounts for
the provider and consumer, each with the clean rooms API installed. Replace the information as noted in the sample files.
[See instructions to upload a SQL worksheet into your Snowflake account](../tutorials-and-samples.md).

* [`Provider example code`](../../../_downloads/0ff6c0608e468a18e039163d4953ee52/p-run-p-activation-p.sql)
* [`Consumer example code`](../../../_downloads/4430a12585695047da96b61a06593dd6/p-run-p-activation-c.sql)

### Viewing provider and consumer activation results

#### Activation results location and format

All activation results are appended to a clean room designated table in the provider’s or consumer’s account. Each row in the table maps to
a row in the query result. Results from each run are appended to the table (the table is not cleared before each run). You can distinguish
between different runs by the ACTIVATION_ID column, which is unique per activation, or the SEGMENT column, which can be specified by the
caller for each activation run.

> **Note:**
>
> Provider activation results are written in encrypted format to a temporary table in the consumer’s localDB. The results are then copied
> over to the provider’s account and decrypted before saving. This extra move and decryption step can cause delays with large result sets.

* **Provider activation results** are stored in SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY in the provider’s account.
* **Consumer activation results** are stored in SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.CONSUMER_DIRECT_ACTIVATION_SUMMARY in the consumer’s
  account.

These tables contain the following columns:

USER_ID:
:   One row of results, in JSON format, where the keys are the column names and the values are the value for that column in that row.
    The object also contains a column for each argument passed into the template.

ACTIVATION_ID:
:   A unique ID for each request. The ID is returned from a successful activation request. You can filter by this column to
    get all results for the same activation run, or filter by SEGMENT if you reuse the same segment name across multiple runs. This is the
    same as the query request ID returned by `submit_analysis_request` or `run_activation`.

CLEANROOM_NAME:
:   Name of the clean room where the query was run.

CONSUMER:
:   (*Provider activation only*) The consumer who approved this activation.

PROVIDER:
:   (*Consumer activation only*) The provider who approved this activation.

SEGMENT:
:   An arbitrary string value that you assign when you run the activation. This column enables you to join results across
    multiple query runs.

TIMESTAMP:
:   When the activation was run.

**Provider activation example**

```output
SELECT * FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY WHERE segment = 'my_segment';

                             USER_ID                          |   CLEANROOM_NAME |   SEGMENT  | CONSUMER |          TIMESTAMP      |  ACTIVATION_ID
"{""AGE_BAND"":55,""ITEM_COUNT"":2328,""STATUS"":""MEMBER""}" |  test activation | my_segment | ABC1234  | 2025-04-01 16:27:14.068 | cleanroomactivationdataanalysisresults20250401231728469
"{""AGE_BAND"":20,""ITEM_COUNT"":88,""STATUS"":""PLATINUM""}" |  test activation | my_segment | ABC1234  | 2025-04-01 16:27:14.068 | cleanroomactivationdataanalysisresults20250401231728469
"{""AGE_BAND"":80,""ITEM_COUNT"":18,""STATUS"":""GOLD""}"     |  test activation | my_segment | ABC1234  | 2025-04-01 16:27:14.068 | cleanroomactivationdataanalysisresults20250401231728469
...
```

#### Reading provider or consumer activation results

Run the appropriate SQL command to view results activated to your Snowflake account:

**View provider activation results**

```sqlsyntax
SELECT *
   FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY
   [WHERE segment = <SEGMENT_NAME>] [AND activation_id = <ACTIVATION_ID>];
```

**View consumer activation results**

```sqlsyntax
SELECT *
   FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.CONSUMER_DIRECT_ACTIVATION_SUMMARY
   [WHERE segment = <SEGMENT_NAME>] [AND activation_id = <ACTIVATION_ID>];
```

Each row of data is combined into an object in the `USER_ID` column. You can flatten results using a query like the following:

```sqlexample
-- Assuming columns AGE_BAND, STATUS, and ITEM_COUNT
SELECT
  item:"AGE_BAND",
  item:"STATUS",
  item:"ITEM_COUNT"
FROM (SELECT parse_json(user_id)
      AS item
      FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY
      WHERE segment = $segment_name)
ORDER BY item:"AGE_BAND", item:"STATUS" ASC
LIMIT 20 ;
```

**View the latest 10 result rows in Snowsight:**

> 1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
> 2. In the navigation menu, select Catalog » Database Explorer.
>
>    * **For provider activation** navigate to `SAMOOHA_BY_SNOWFLAKE_LOCAL_DB` » `PUBLIC` » `Tables` »
>      `PROVIDER_ACTIVATION_SUMMARY`.
>    * **For consumer activation** navigate to `SAMOOHA_BY_SNOWFLAKE_LOCAL_DB` » `PUBLIC` » `Tables` »
>      `CONSUMER_DIRECT_ACTIVATION_SUMMARY`.
> 3. Select Data Preview.

## Third-party activation

Third-party activation deposits query results in the account of a Snowflake-approved third party using a
[third-party activation connector](../connector-activation.md).

Third-party activation is supported only in the clean rooms UI, and not using custom templates.

Activation when using the clean rooms UI is supported only if the clean rooms account [allows activation](../admin-tasks.md).

The clean rooms administrator must configure the environment to support third-party activation connectors, select the allowed connectors,
and configure them, before they can be used in any clean room.

Third-party activation supports both consumer- and provider-run analyses.

### Supported templates

The following templates support third-party activation:

* Audience Overlap & Segmentation

### Implementing third-party activation

1. **Create or join the clean room:** When creating or joining the clean room, in the Configure Analysis & Query step, under
   Activation Settings, specify which columns should be added to the results activated to your account.
2. **Activate results:**

   1. Run your analysis.
   2. After running an analysis, select Results » Activate.
   3. Under Activation Hub select the name of the third-party provider to activate to.
   4. Provide information specific to the provider. This can be providing descriptive names or selecting which columns to activate. The
      tooltips on the page should provide additional information for that provider.
   5. Select Push Data.

---
title: Add custom templates to a clean room
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/custom-templates.md
section: Clean Rooms
---

# Add custom templates to a clean room

Both providers and consumers can add custom templates to a clean room. Custom templates are run the same way as Snowflake-provided
templates. Custom templates are created using the API, and are run using the API or (if designed for it) the UI.

A clean room template is a valid JinjaSQL template. You should
[read the clean room reference guide for custom templates](../custom-templates.md) before trying to create your
own clean room templates.

## Provider-written custom templates

Providers can add a custom template to a clean room without consumer approval. Consumers can run a provider-written template without
approval. The next sections describe how a provider can add a custom template, and a consumer run that template, using the API.

If the provider wants to design a template that a consumer can run in the clean rooms UI, they must
create a user input form for the template.

### Add a provider-written template

Providers add custom templates one at a time by calling `provider.add_custom_sql_template`, passing in the template JinjaSQL as a string.
Custom templates appear in the clean room’s template list, and behave the same as Snowflake-provided templates. A clean room can contain
any mix of custom and Snowflake-provided templates.

You can also upload [custom Python UDFs](custom-code.md) for your template to call.

> **Tip:**
>
> When you add a custom template for consumers to use, you should provide documentation that describes what the template
> does, and the required and optional arguments used by the template.

The following SQL example shows how a provider adds a simple custom template to a clean room:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    $basic_template_name,
    $$
    SELECT
      COUNT(*) AS total_count
    FROM IDENTIFIER({{ my_table[0] }}) AS c
      INNER JOIN IDENTIFIER({{ source_table[0] }}) AS p
      ON IDENTIFIER({{ consumer_id | join_policy }}) = IDENTIFIER({{ provider_id | join_policy }})
    {% if where_clause %}
    WHERE {{ where_clause | sqlsafe }}
    {% endif %};
  $$
);
```

This template takes four required parameters (`my_table` array, `source_table` array, `consumer_id` column name, and `provider_id`
column name) and an optional `where_clause` parameter that specifies a WHERE clause.

In most templates, including the previous example, column names provided by the user must be fully qualified with the table name to avoid column name conflicts. This is because it is not easy to concatenate a table name prefix to a column name in the prefix and get a valid identifier (`IDENTIFIER(p.{{ col_name | sqlsafe }})` is an error). Therefore, you might need the caller to provide a fully qualified table name rather than just a column name. Table names should use the approved lowercases `p` and `c` aliases.

### Run a provider-written template

When using the clean rooms API, consumers call `consumer.run_analysis` to run a template, and providers call
`provider.submit_analysis_request` for [provider-run analyses](provider-run-analysis.md).

If you want a template to be runnable in the clean rooms UI, the provider must create a user input form
for the template. Only provider-written templates can be run in the clean rooms UI.

Clean room collaborators can see the JinjaSQL for any template in a clean room by calling `consumer.view_template_definition`, unless
the provider [obfuscated the template](../provider.md). Only provider-written templates can be obfuscated.

You can call `consumer.get_arguments_from_template` to parse and list the variables used in a template. However, for large or complex
templates this procedure might not list all template variables, so be sure to provide helpful documentation for your template users.

The following example shows how a consumer runs the provider’s custom template shown previously:

```sqlexample
 CALL samooha_by_snowflake_local_db.consumer.run_analysis(
  $cleanroom_name,
  'basic_template',
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],  -- Populates the my_table array.
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],  -- Populates the source_table array.
  OBJECT_CONSTRUCT(
  'consumer_id', 'c.hashed_email',  -- Populates the consumer_id variable.
  'provider_id', 'p.hashed_email',  -- Populates the provider_id variable.
  'where_clause','c.status = $$MEMBER$$ AND c.age_band > 30' -- Populates the where_clause variable.
                                                             -- $$...$$ is used to stringify the column value.
  )
);
```

### Provider template example code

Here is a full code example showing how a provider adds a custom template, and how the consumer runs it.
You need two separate accounts with the clean rooms API installed to run
the code; one account to act as the provider and the other account to act as the consumer.

* [`Provider example code`](../../../_downloads/80e34f3c8c5c6c2e38ea4e078375f3d8/provider-template-p.sql)
* [`Consumer example code`](../../../_downloads/0773544e11ffd9a39d5fdf82dada99de/provider-template-c.sql)

## Consumer-written custom templates

A consumer can add a custom template to the clean room if the provider approves. Once added to the clean room, the consumer-written
template can be run the same as a provider-written template. Here is how a consumer adds a custom template:

1. The provider creates, shares, and publishes a clean room in the standard way.
2. The consumer installs and configures the clean room in the standard way.
3. The consumer calls `consumer.create_template_request` and passes in the custom template string.
4. The provider calls `provider.list_pending_template_requests` to see pending requests.
5. The provider can approve (`provider.approve_template_request`) or reject (`provider.reject_template_request`) the consumer’s request
   to run their own template. (There are also bulk versions of these methods for approving or rejecting multiple requests.) If the provider
   approves the template, the template is added to the clean room immediately.

   * Before the provider approves the template, the provider should first declare any necessary join and column policies on their data.
6. The consumer checks the status of their request by calling either `consumer.list_template_requests` (which shows the approval status)
   or `consumer.view_added_templates` (to see if their template was added to the clean room). A template is added to the clean room only
   after the provider approves it.
7. The consumer runs the template by calling `consumer.run_analysis` in the standard way.

> **Note:**
>
> A provider can run a template added by a consumer if the
> [consumer grants permission](provider-run-analysis.md).

### Consumer template example

Here is a full code example showing how a consumer can submit and run a custom template.
Upload the following worksheet files into your Snowflake account. You need two separate accounts with the clean rooms API installed to run
the code; one account to act as the provider and the other to act as the consumer.

* [`Provider example code`](../../../_downloads/333d148177d16d9faaf78198f0f6cc21/consumer-template-p.sql)
* [`Consumer example code`](../../../_downloads/56922f46a21ef92d28a78e521f593230/consumer-template-c.sql)

## Define a user input form for a custom template

For a custom template to be runnable in the clean rooms UI, the provider must define an input form for the template. This requirement
applies even if the template has no arguments for the consumer to set. Consumers cannot define a user input form for a template.

> **Important:**
>
> If you used `provider.restrict_table_options_to_consumers` or `provider.restrict_template_options_to_consumers` to restrict
> tables or templates to specific users, these restrictions won’t work as expected in the clean rooms UI. You should not enable templates
> for UI usage in clean rooms with these restrictions.

A configuration form enables users in the clean rooms UI to pass values to the custom template, similar to how you pass values to a
template when using the API.

The following example shows a custom template that uses three variables, `max_age`, `favorite_color`, and `source_table`:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
  $cleanroom_name,
  'color_picker_template',
  $$
  SELECT p.hashed_email
    FROM source_table[0] AS p
    WHERE
      p.age <= {{ max_age }} AND
      UPPER(p.favorite_color) = UPPER({{ favorite_color }});
  $$);
```

The following example shows how to pass in the template variables when you run the previous custom template in code:

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.run_analysis(
  $cleanroom_name,
  'color_picker_template',
  [],                                   -- Consumer tables, assigned to my_table array.
  ['MYDB.MYSCH.COLOR_PREFERENCES'],     -- Provider tables, assigned to source_table array.
  object_construct(
    'max_age', 30,                      -- Assign max_age.
    'favorite_color', 'blue'            -- Assign favorite_color.
  )
);
```

To run this template in the clean rooms UI, you must define a form where the consumer assigns these template variables. The following
example shows how to define a simple form where the consumer can assign values to `max_age`, `favorite_color`, and `source_table`:

```sqlexample-javascript
CALL samooha_by_snowflake_local_db.provider.add_ui_form_customizations(
    $cleanroom_name,
    'color_picker_template',
    {                                     -- Top-level template settings.
      'display_name': 'Color matcher',
      'description': 'See which users like the same color as you',
      'methodology': 'Choose a color and a max age',
      'render_table_dropdowns': {
        'render_consumer_table_dropdown': false,
        'render_provider_table_dropdown': true    -- Show a dropdown of provider tables.
      }                                           -- Chosen value is assigned to source_table.
    },
    { -- Form entry elements, one per template argument.
      'max_age': {
        'type': 'integer',
        'display_name': 'Maximum age',
        'description': 'Matching user must be less than or equal to this value.',
        'required': TRUE
      },
      'favorite_color': {
        'type': 'dropdown',
        'display_name': 'Favorite color',
        'description': 'Choose the favorite color to match.',
        'choices': ['Red', 'Blue', 'Green', 'Yellow'],
        'required': TRUE
      }
    },
    {} -- Output config not used in this example.
);

-- You must always call this procedure to propagate UI changes.
CALL samooha_by_snowflake_local_db.provider.create_or_update_cleanroom_listing(
  $cleanroom_name);
```

The previously defined form appears in the clean rooms UI when the consumer runs the template in the Configure Analysis & Query step.
The form includes a table chooser for `source_table`, labeled Collaborator table, an integer chooser element for `max_age`
labeled Maximum age, and a dropdown menu of color names for `favorite_color` labeled Favorite color, as shown in this image:

You can also define drop-down menus that are pre-populated with columns from the provider’s or consumer’s join policies, column policies,
tables, and more. For more information about form element types, see [add_ui_form_customizations](../provider.md).

### Populate `source_table` and `my_table`

The standard `source_table` and `my_table` template variables can be populated as follows:

* **Enable the default table selector drop-down menus:** These drop-down menus are single-selection. You can show or hide them by using the
  `render_provider_table_dropdown` and `render_consumer_table_dropdown` settings. The drop-down menus pass fully qualified table names
  to the `source_table` and `my_table` template variables, respectively.

### Qualify your column names

Most templates require all column names to be fully qualified to avoid column-name ambiguity.

The template must alias all tables as `p` or `c`, depending on whether they are provider or consumer tables. The template should
reference all columns using their `p` or `c` aliases. [Learn more about aliasing.](../custom-templates.md)

If you create a drop-down column selector, you must either supply the `p` or `c` table alias explicitly in a `choices` array of the
drop-down menu, or you must add the alias in your template.

The following example shows how to provide the table alias in a drop-down menu:

```sqlexample
  'provider_join_col': {
    'display_name': 'Provider Join Column',
    'choices': ['p.HASHED_EMAIL', 'p.HASHED_SSN'],
    'type': 'dropdown',
    'description': 'Select the provider column to join users on.',
    'infoMessage': 'We recommend using HASHED_EMAIL.',
    'size': 'M',
    'group': 'Enable Provider Features'
}
```

However, this method is limiting because you must know all the column names in advance.

As an alternative, you can dynamically populate a column drop-down menu by providing a `references` property. However, such a
selector returns bare column names — for example, `hashed_email` — rather than fully-qualified column names — for example,
`p.hashed_email`. If bare column names are returned, you must scope the column to the table explicitly in your template. For example, the
following code creates a drop-down menu where a user can select a column from the provider’s join policy:

> ```sqlexample
> 'p_join_col': {
>   'type': 'dropdown',
>   'references': ['PROVIDER_JOIN_POLICY']
> }
> ```

To use the column name in a template, the template must hard-code the table alias in front of the column name as shown in the following
example:

> ```sqlexample
> SELECT p.{{ p_join_col | sqlsafe }} FROM table_col AS p;
> ```

### Recommendations for developing a template that can be run in the clean rooms UI

The following steps show a recommended workflow for developing a template that can be run in the clean rooms UI:

#### 1. Develop the template

First develop your template and any [scripts](custom-code.md) that it calls by using only the clean
rooms API in both the provider and consumer accounts. Testing the template in the API is much faster and less error-prone than using the UI.

Test your template thoroughly in the API, both on the provider and consumer side, to ensure that the template does exactly what you
want it to do. Testing in the API is very quick, and changes are propagated immediately to the consumer account.

After you test your template and it runs exactly as you want, then move on to designing the input form.

#### 2. Develop the input form

When the template and any uploaded scripts are working as intended, then start working on the input form. At this stage, you use the API in
the provider account, but the UI in the consumer account.

When you make changes using the API, some values in the UI are refreshed immediately, some are refreshed when the user clicks
Refresh, and some are refreshed only every 10 minutes. Therefore, when you work on the input form, create and update the form on the
provider side using the API, but install and configure the clean room in the consumer account using the clean rooms UI,
not the API. This ensures that you are using fresh data in the clean room UI.

Additionally, each time that you make changes to the input form in the API, create a new clean room to ensure that you use the latest clean
room data. Use an incrementing number in the name; for example, “My clean room 1,” “My clean room 2,” and so on. Then, install the clean
room in the client by using the UI. Finally, delete the old clean rooms because there is a limit to the number of clean rooms an account
can hold.

An input form must be attached to a template, otherwise the clean room and form won’t be runnable in the clean rooms UI. When you develop
your form, consider using a template that simply mirrors back all the values that are selected in the form so that you can verify what
values are sent to the template.

For example, let’s suppose that your production template looks like the following template:

```sqlexample
SELECT {{ col1 | sqlsafe | column_policy }}, {{ col2 | sqlsafe | column_policy }}
  FROM IDENTIFIER({{ source_table[0] }}) AS p
  JOIN IDENTIFIER({{ my_table[0] }}) AS c
```

You could create the following template that mirrors back all the values of that production template:

```sqlexample
SELECT
  {{ col1 | default('Undefined')}},
  {{ col2 | default('Undefined') }},
  {{ source_table[0] | default('Undefined') }},
  {{ my_table[0] | default('Undefined') }},
  {{ provider_join_col | default('Undefined') }},
  {{ consumer_join_col | default('Undefined') }}
;
```

Then design a form that sets those six variable values, and attach the form to the mirror template rather than the production template.

**General tips for developing the input form**

The following list provides detailed tips to help you develop an effective input form:

* If you encounter a generic “Installation failed” or “Something went wrong” message when you install, configure, or run a clean room in
  the UI, the message could mean that there is an error with the UI form or associated template that was not caught when you added the form
  or template.
* When one field depends on another field — for example, a column drop-down menu that is based on the value chosen by a table
  drop-down menu — put the parent field first, possibly right above the child field, so that users populate the parent field before
  they populate the child field. With dependent fields, the child drop-down menu is empty until a value is chosen for the parent field.
* If you don’t specify an `order` or `group` value, items are rendered in the order that they are defined.
* Include informative `infoMessage` and `description` text, and show example values that a user might enter.
* Choose the precise element type for the variable data type. For example, for an integer, choose `integer` rather than a free-form text
  box. Your template can cast values by using Jinja filters; for example: `SELECT {{ max_age | int }};`.
* If you don’t define a minimal configuration form for a custom template, the template can’t be run in the clean rooms UI.
* If you don’t define a form element for a variable in the template, a plain text box is rendered for that variable in the user form. This
  is probably not what you want, because the text box is labeled with the template variable name and has no description or suggestions.
* Form elements specified in `add_ui_form_customizations` aren’t rendered unless there is a matching template variable with the same name
  as the element.
* Template changes made in the API propagate quickly and reliably to the UI, so you don’t need to create a new clean room for template
  changes. However, you should develop and test your template in the API before you reach the UI stage.
* You can’t auto-populate a drop-down menu with column values from a given table. You can hard-code values in a drop-down menu, but
  can’t show values from a table at runtime.

#### 3. Connect the input form to the production template

After the form looks exactly like you want it and the form makes all template variables accessible by the user, then assign your working
template to the input form in your call to `provider.add_ui_form_customizations`.

---
title: Analysis specification
source: https://docs.snowflake.com/en/user-guide/cleanrooms/spec-analysis.md
section: Clean Rooms
---

# Analysis specification

Specifies all the information that analysis runners need to run an analysis, including which template to use, which tables to pass to the
template, and any variable values used by a template. If not using free-form SQL to query data, any analysis runners that want to run an
analysis use this specification to define the template and input data.

**Schema:**

```yaml
api_version: 2.0.0              # Required: Must be "2.0.0"
spec_type: analysis             # Required: Must be "analysis"
template: <template_id>         # Required: ID of the template to use
name: <analysis_name>           # Optional: Unique name (max 75 chars)
version: <version_string>       # Optional: Version identifier (max 20 chars)
description: <analysis_description>  # Optional: Description (max 1,000 chars)

template_configuration:         # Optional: Values used when running the template
  view_mappings:                # Optional: Mappings for shared data
    source_tables:              # Optional: Tables from data offerings. Populates the source_table array variable.
      - <source_table_name>     # One or more source table names from the TEMPLATE_VIEW_NAME column...
    <argument_name>: <view_name>  # Custom argument to template view name mapping
  local_view_mappings:          # Optional: Mappings for local data
    my_tables:                  # Optional: Tables from local data offerings. Populates the my_table array variable.
      - <my_table_name>         # One or more local table names...
    <argument_name>: <view_name>  # Custom argument to local template view name mapping
  arguments:                    # Optional: Template arguments as key-value pairs
    <argument_name>: <argument_value>  # One or more argument key-value pairs...
  activation:                   # Required for activation templates
    snowflake_collaborator: <alias>  # Collaborator alias for activation destination
    segment_name: <segment_name>     # Unique segment name for this activation
```

`api_version`
:   The version of the Collaboration API used. Must be `2.0.0`.

`spec_type`
:   Specification type identifier. Must be `analysis`.

`template: template_id`
:   The ID of the template to use for this analysis. This must be the template ID obtained when the template was registered, not the template
    name.

`name` (*Optional*)
:   A unique, user-friendly name for this analysis. Must follow [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) with a
    maximum of 75 characters and be unique within your Snowflake data clean room account.

`version` (*Optional*)
:   A version identifier for this analysis specification (maximum 20 characters). Must follow
    [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) and be unique within your account for this analysis name. A good
    format to use is *YYYY_MM_DD_V#*. For example: `2025_10_22_V1`.

`description` (*Optional*)
:   A high-level description of what this analysis does (maximum 1,000 characters).

`template_configuration` (*Optional*)
:   Values used when running the specified template.

    `view_mappings` (*Optional*)
    :   Mapping of argument names to template view names for shared data offerings.

        `source_tables` (*Optional*)
        :   List of view names to populate the `source_table` template variable. Use the table aliases specified in the data offering spec. You
            can get a list of available views by calling VIEW_DATA_OFFERINGS. Use the view names from the TEMPLATE_VIEW_NAME column. Format of
            each entry is `collaborator_alias.data_offering_ID.dataset_alias`.

        `argument_name: view_name`
        :   Custom mapping of an argument name to a template view name (maximum 255 characters each).

    `local_view_mappings` (*Optional*)
    :   Mapping of argument names to local template view names for private datasets.

        `my_tables` (*Optional*)
        :   List of table names to populate the `my_table` template variable. This is available only to private datasets that you linked by
            calling LINK_LOCAL_DATA_OFFERING. Format of each entry is `collaborator_alias.data_offering_ID.dataset_alias`.

        `argument_name: view_name`
        :   Custom mapping of an argument name to a local template view name (maximum 255 characters each).

    `arguments` (*Optional*)
    :   Template arguments as key-value pairs. Argument values can be strings, numbers, Booleans, arrays, or objects depending on the template
        requirements.

    `activation` (*Required for activation templates*)
    :   Activation-specific configuration required when running activation templates.

        `snowflake_collaborator`
        :   Collaborator alias for the activation destination (maximum 25 characters). Must match an alias defined in the
            `collaborator_identifier_aliases` section of the collaboration specification, and the collaborator must be listed in the
            `activation_destinations` section.

        `segment_name`
        :   Unique segment name for this activation (maximum 255 characters). Used to identify and track activation results. Must follow
            [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md).

---
title: Audience lookalike modeling
source: https://docs.snowflake.com/en/user-guide/cleanrooms/lookalike-audience-modeling-template.md
section: Clean Rooms
---

# Audience lookalike modeling

## About the template

The audience lookalike modeling template empowers you to discover and target new,
high-value customers who mirror your most profitable existing ones. By employing custom machine learning models within a secure Snowflake
Data Clean Room, you can significantly enhance your marketing efforts. The process begins with identifying a *seed audience* (a curated
list of your best customers). The template then analyzes the distinct characteristics and behaviors of this seed audience to build a
predictive model. This model is subsequently used to score a much larger population, identifying individuals who, based on their data
profiles, are most likely to be interested in your products or services. The use of a data clean room ensures that this powerful analysis
can be performed in collaboration with partners without ever exposing or sharing the underlying raw data, guaranteeing privacy and security
for all parties involved. This allows for richer, more accurate modeling by combining insights from multiple data sources in a
privacy-compliant manner.

Specify your seed audience and select features to train a lookalike model. You can adjust boosting rounds and outlier trimming as needed to
optimize the model’s performance. The model is trained on the features of your seed audience and then used to score a larger population,
identifying individuals who are most likely to convert.

## Key use cases

* **Customer acquisition:** Find new customers who are similar to your most valuable existing customers.
* **Increase ROI:** Improve the return on investment of your marketing campaigns by targeting users who are more likely to be interested in
  your products or services.
* **Expand market reach:** Discover new market segments that you may not have previously considered.
* **Personalized advertising:** Deliver more relevant and personalized ad experiences to your target audience.

## Get the worksheets and template

These worksheets show how to create and run a clean room with a lookalike audience modeling template that you can use and modify. The
template includes a UI form so you can run the clean room either in code or in the clean rooms UI. The example enables the consumer to run
the analysis, and optionally to activate the results to the provider’s account.

Download the worksheets and install them in two separate Snowflake accounts in the same organization and the same cloud hosting
environment. [See instructions to upload a SQL worksheet into your Snowflake account](tutorials-and-samples.md).

To try out the templates with sample data, run the sample data generator first in both your provider and consumer accounts, to generate
sample data to use with the clean room.

* [`Download the Python sample data table generator.`](../../_downloads/8043b32150bbd7d384a651fd74f6f496/lookalike-audience-sample-generator.py)
  Run this to generate data that can be used as sample data for the consumer and provider worksheets.
* [`Download the consumer worksheet.`](../../_downloads/602d0090989cde5dc5382673a1f0ffc5/lookalike-audience-modeling-c.sql)
* [`Download the provider worksheet.`](../../_downloads/5a8ac8e50e52fe3166a0271368929989/lookalike-audience-modeling-p.sql)

---
title: Basic consumer-run data analysis
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/basic-flow-data-analysis.md
section: Clean Rooms
---

# Basic consumer-run data analysis

## Overview

This topic demonstrates a basic consumer-run analysis using the clean rooms API. The example shows how a provider can programmatically
create and share a clean room with data, and a consumer can run an analysis against the provider’s data. The provider defines the SQL
queries that can be run against their data. A provider can define queries that query only the provider’s data, only the consumer’s data, or
that join provider and consumer data.

You can download the full code example to upload and run on your Snowflake account.

The following diagram shows the data flow through the main components in a basic consumer-run analysis.

In a basic consumer-run analysis involving two parties, the provider and consumers link data into the clean room. This data is accessed
using a secure view stored in the consumer DB in the clean room application package on the consumer’s account.

During an analysis, the clean room app on the consumer’s account uses the specified consumer and provider secure views, and the results are
shared with the consumer.

## Provider steps

The following list shows the main steps to create, publish, and share a clean room with a consumer:

### Set up the environment

To use the API, you must use a warehouse that SAMOOHA_APP_ROLE has privileges in.
app_wh is one of a [number of warehouses](../v1/installation-details.md) with access to the API. Choose the
warehouse that is appropriate for your needs. (You can also [use your own warehouse](../admin-tasks.md), if you
choose.)

The `SAMOOHA_APP_ROLE` role is required to access the API.

```sqlexample
USE WAREHOUSE app_wh;
USE ROLE SAMOOHA_APP_ROLE;
```

### Create the clean room

The next step is to create a new clean room. This is done with a single API call that specifies whether the clean room is for internal or
external use. Internal clean rooms can be accessed only by consumers within the same organization; external clean rooms can be used by
consumers outside the organization. For both clean room types, consumers must be invited to use the clean room to be able to access it.

External clean rooms trigger additional security checks when certain actions are taken. When this happens, you must call
`provider.view_cleanroom_scan_status` to see when the security scan is done, and you can continue with the next action.

The following example creates an internal clean room:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.cleanroom_init($cleanroom_name, 'INTERNAL');
```

### Link data into the clean room

Both the provider and consumer can *link* (import) tables, views, and
[other supported data objects](../register-data.md) into a clean room. When you link data, the API creates a hidden,
secure view inside the clean room that is based on the linked source object. You reference the linked object by its source name, not
its internal view name, in all clean room procedures.

Data linked into the clean room can’t be accessed directly by any clean room collaborators. Linked data is accessed using a template
imported into the clean room (unless you enable [free-form SQL queries](../v1/web-app-sql-template.md) on your data).

Before an object can be linked into a clean room, the object must be *registered*. Registering an object grants proper access privileges to
the SAMOOHA_APP_ROLE on the object. You can either register an object directly, or register a parent object (such as a database or
schema) to access child objects. You can register an object in either the UI or the API.

> **Tip:**
>
> Registration is easier to perform and manage in the UI than the API.

Objects are registered at the account level, not the clean room level; you need to register an object only once per account, and it can be
linked into any clean room in the account. (You can link only objects registered in your own account.) After you register an object, the
object is available for linking by any clean room in the account. [Learn more about registration.](../register-data.md)

The following example links in the CUSTOMERS table from the sample database SAMOOHA_SAMPLE_DATABASE. This database is registered
automatically when you install the clean room environment in an account, so you don’t need to register it. You can link or unlink objects
at any time in a clean room, and the results propagate quickly to all collaborators.

```sqlexample
CALL samooha_by_snowflake_local_db.provider.link_datasets(
  $cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS']);
```

### Set the join policy

If you add a template to the clean room that allows the consumer to join on your data, you should set a
[clean room join policy](../v1/policies.md) on your data. A clean room join policy specifies which columns can be
joined on in queries run by collaborators. Your own join policies don’t constrain your own
queries.

Clean rooms support a [few types of data policies](../v1/policies.md) that you can set on linked data. These policies
are similar to, but not the same as, the equivalent Snowflake policies, and are applied only on the internal view, not on the source data.
Any Snowflake policies that are set on the source data are propagated to the views linked into a clean room. Clean room policies are set on
the linked data only, not on the source data.

> **Important:**
>
> The template is responsible for [using JinjaSQL filters to enforce policies](../custom-templates.md). If the template does
> not use policy filters, the policies will not be respected. Always put policy filters on templates that you write, and
> examine any templates that you run to confirm that they enforce clean room policies.

You can set policies only on the data that you link in; you can’t set policies on any other party’s data.

The following example shows how to set a join policy that allows two columns from the linked table to be joinable:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_join_policy(
  $cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL',
   'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_PHONE']);
```

[Learn more about join policies.](../v1/policies.md)

### Add templates to a clean room

A clean room template is a valid JinjaSQL template that typically evaluates to a SQL query. A template, sometimes called an
*analysis*, can be passed arguments by the caller, and can access any data linked into the clean room. Both providers and consumers can add
templates into a clean room and run them.

Snowflake provides a few standard templates, but you will probably write your own custom templates.

Any clean room policies that you set are enforced only if the template includes policy filters
in the template, so make sure that the templates you add to a clean room include these filters. For more information about policies, see
[Understanding Snowflake Data Clean Room policies](../v1/policies.md).

By default, only consumers can run templates. If a
[provider wants to run a template](provider-run-analysis.md), they must ask permission from the
consumer. Similarly, if a [consumer wants to upload a template](custom-templates.md), they must ask permission from
the provider.

For more information about creating a custom template, read [Add custom templates to a clean room](custom-templates.md) and
[Design custom templates](../custom-templates.md).

The following example shows how to add a Snowflake-provided template to the clean room:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_templates(
  $cleanroom_name,
  ['prod_overlap_analysis']);
```

### Set column policies

Clean room column policies specify which columns from your tables can be projected in queries run by collaborators. A column policy is tied
to both a column and a template, so different columns can be defined as projectable with different templates. A template must be present in
a clean room before you can set column policies for that template.

Column policies, like all policies, are *overwrite-only*; this means that setting column policies completely overwrites any existing column
policies set by that account. Both the provider and the consumer can set column policies on their data.
[Learn more about column policies.](../v1/policies.md)

The following example shows how to allow four columns to be projected from the clean rooms sample database that was linked previously:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_column_policy($cleanroom_name, [
  'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
  'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
  'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE',
  'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:REGION_CODE']);
```

### Share with consumers

Only consumers invited by the provider can access the clean room. Consumers can’t share a clean room with other consumers. Designated
consumers can’t access the clean room until it is published. Invitations to join a clean room are at the account level, not at the user
level.

The following example shows how to share a clean room with two consumers. The procedure takes two parallel, comma-delimited lists of
consumer account locators and [consumer data sharing account IDs](../../admin-account-identifier.md).

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_consumers(
  $cleanroom_name,
  'CONSUMER_LOCATOR1,CONSUMER_LOCATOR2',
  'CONSUMER_DATA_SHARING_ACCOUNT_ID1,CONSUMER_DATA_SHARING_ACCOUNT_ID2');
```

#### Sharing with consumers in other cloud hosting regions

If a consumer and provider are in different cloud regions, the provider and consumer must enable
[cross-cloud auto-fulfillment](../v1/enabling-laf.md) before the consumer can be added to the clean room. You can see
your own cloud region by running `SELECT CURRENT_REGION();`. You typically can’t see the consumer’s region, but if you try to add a
consumer in another region, `provider.add_consumers` fails with a message indicating the problem. When this failure occurs, you should
call `provider.remove_consumers` to remove the accounts that are in a different region, then enable cross-cloud auto-fulfillment, and
then add the cross-region accounts again.

### Set the default version

Clean rooms are versioned native applications. Certain actions, such as adding code to a clean room, generate a new patch version of the
application. Consumers must install the clean room in their account. The version that they install is based on the default version number
that you specify. If you later publish a new version of the clean room and increment the default version number, any versions installed by
consumers will automatically update, and new installations will default to the new version.
[Read more about clean room versioning.](../dcr-versions.md)

The following example shows how to set the default version of a clean room to V1.0.0, which is the initial version of a clean room,
if you haven’t uploaded any code:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive(
  $cleanroom_name,
  'V1_0',          -- Version number: Never changes.
  '0'              -- Patch number: Can change.
  );
```

### Publish the clean room

Publish or republish the clean room as shown in the following example. The first time this procedure is called, it makes the clean room
visible and installable by any consumers that you shared it with. You should call this procedure whenever you make significant changes,
such as when you update the default version or make changes specific to the clean room UI.

```sqlexample
CALL samooha_By_snowflake_local_db.provider.create_or_update_cleanroom_listing(
  $cleanroom_name);
```

Now the consumer can install the clean room, link in data, set policies, and run templates, as described next.

> **Tip:**
>
> When you no longer need a clean room, you should delete the clean room on the provider and consumer accounts
> (`provider.drop_cleanroom` and `consumer.uninstall_cleanroom`). There is a limit to the number of clean rooms and
> collaborators per account. When you leave many unused clean rooms in your account, you can reach your quota.

## Consumer steps

After a provider publishes a clean room, all consumers who were added as collaborators can see and install the clean room using either
the UI or the API. This section shows how a consumer can install a clean room and run an analysis using the API.

Here is a quick overview of the steps the consumer takes to install a clean room and run an analysis:

### Set up the environment

Like the provider, the consumer must use a
[warehouse that SAMOOHA_APP_ROLE can access](../v1/installation-details.md). However, unlike the provider, the
consumer can either use the SAMOOHA_APP_ROLE role directly for full API access, or a clean room administrator in that account can grant a
more limited role that gives privileges to run a subset of the API for consumers. This limited role, sometimes generically called a
“run role,” is granted by a user with full clean room privileges.
[Learn how to grant limited API access.](../manage-dcr-users.md)

A run role doesn’t allow you to install a clean room, so you must use SAMOOHA_APP_ROLE, as shown in the following example:

```sqlexample
USE WAREHOUSE app_wh;
USE ROLE SAMOOHA_APP_ROLE;
```

### Install the clean room

The following snippet shows how to list all clean rooms that are installed you have been invited to install:

```sqlexample
-- See all clean rooms, installed and not.
CALL samooha_by_snowflake_local_db.consumer.view_cleanrooms();

-- See only clean rooms that aren't installed.
CALL samooha_by_snowflake_local_db.consumer.view_cleanrooms() ->>
  SELECT * FROM $1
    WHERE IS_ALREADY_INSTALLED = false;
```

Install the clean room that the provider shared with you, as shown in the following example. You must specify the provider’s account locator
when you install a clean room.

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.install_cleanroom(
  $cleanroom_name,
  '<PROVIDER_ACCOUNT_LOCATOR>');
```

> **Tip:**
>
> Clean rooms have both a name and an ID. For clean rooms created by using the API, use the clean room name wherever an API procedure needs
> a clean room name. For clean rooms created in the UI, use the clean room ID rather than the name wherever an API procedure needs a clean
> room name.
>
> The clean room UI labels clean rooms created using the API as Supported with Developer APIs.

### Add data and set policies

If the clean room templates allow the consumer to include their own data in a query, the consumer registers data, links data, and sets
policies like the provider does. Be sure to use the `consumer` versions of the procedures, as shown in the following example:

```sqlexample
-- You must use a role with MANAGE GRANTS privilege on an object to register it.
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.consumer.register_db('MY_DATABASE');

-- Link some tables.
CALL samooha_by_snowflake_local_db.consumer.link_datasets(
  $cleanroom_name,
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS',
    'MY_DATABASE.PUBLIC.EXPOSURES'
  ]);
```

The provider’s join policy shows which provider columns can be joined on. This example shows how to check which provider
columns you can join on:

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_provider_join_policy($cleanroom_name);
```

A provider needs the consumer’s approval to run a template in the clean room. As a result, most consumers don’t bother setting policies on
the tables that they link in. Nevertheless, we recommend that you consider adding policies in case a provider asks to run a template later,
because you might forget to add appropriate policies at that time.

If you do set policies, they are enforced only if the template includes a `join_policy` or `column_policy`
filter to the column in the template, so make sure that the templates you add to a clean room include these filters to enforce your
policies. To examine the templates in a clean room, call `consumer.view_added_templates`. For more information about policies,
see [Understanding Snowflake Data Clean Room policies](../v1/policies.md).

### Run the analysis

Before you run a template, you typically examine it to see what it does and what variables it accepts, then you examine what provider
tables are available in the clean room.

#### Examine the templates

You can list the templates in a clean room and examine the code of each (unless the provider has explicitly
[obfuscated the code](../provider.md)). This can be useful to help you understand the query better. You can
also ask the clean room to parse the template and show which variables you can pass in when you run the code.

You can pass in a list of tables to use in the query, subject to the design of the template. Any table linked in to the clean room can be
passed to the template.

Many templates also support variables that you can specify at run time; for example, to match a particular value or to specify which
columns to show. Ideally, the provider should let you know what the template does and what arguments it accepts. But typically, you also
want to examine a template to see the code. The following snippet lists the templates added to the clean room by any collaborator, and gets
the arguments supported for a specific template:

```sqlexample
-- View the list of templates available in this clean room,
-- and the source code for each template.
CALL samooha_by_snowflake_local_db.consumer.view_added_templates($cleanroom_name);

-- Show which variables can be passed in when running the specified template.
CALL samooha_by_snowflake_local_db.consumer.get_arguments_from_template(
  $cleanroom_name,
  $template_name
);
```

> **Tip:**
>
> If you see the `my_table` array variable used in a template, this holds the list of consumer table names that you pass in when you run
> the template. If you see the `source_table` array variable, this holds the list of provider table names that you pass in when you run
> the template.

#### See what data is available

You can list the datasets that you and the provider have linked into a clean room, as shown in the following example:

```sqlexample
-- See which datasets you have linked into the clean room.
CALL samooha_by_snowflake_local_db.consumer.view_consumer_datasets($cleanroom_name);

-- See which datasets the provider has linked into the clean room.
CALL samooha_by_snowflake_local_db.consumer.view_provider_datasets($cleanroom_name);
```

When you pass in a table name, use the table name, not the view name, from the results of these procedures.

#### Run the template

In the previous two steps, you learned what data you have and what variables you can pass in. You’re now ready to run the analysis.

Depending on the query and the size of the data, you might want to change the warehouse size to
[something more appropriate](../v1/installation-details.md).

The following example shows how a user might call a template that takes both consumer and provider tables, and two variables:
`dimensions`, which is used as a grouping column, and an optional `where_clause`, which is used in a WHERE clause in the query.

The template runs a query against a single provider table, so the request will omit consumer tables.

In the following example, notice how the `dimensions` value is a column name prefixed by `p`. The `p` indicates that this column comes
from the provider table that is passed in. Column names typically require that you add a `p` or `c` to indicate which table they come from,
provider or consumer, to disambiguate the column names. However, this requirement is very template-specific. You need to communicate with
the template provider or examine the template code to understand when these prefixes are required.

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.run_analysis(
$cleanroom_name,
$template_name,
[],                                              -- This template doesn't accept consumer tables.
['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],      -- Provider tables.
object_construct(                                -- Template-specific arguments.
  'dimensions', ['p.STATUS'],                    -- Template takes a variable named 'dimensions'.
  'where_clause', 'p.REGION_CODE=$$REGION_10$$'  -- Template allows you to pass in a WHERE clause.
                                                 -- $$ is used to wrap string literals
  )
);
```

## Example code

The following worksheet files demonstrate how to create, share, and run a clean room analysis.

Download the following examples, and then upload them as worksheet files in your Snowflake account. You need separate accounts for
the provider and consumer, each with the clean rooms API installed.
[See instructions to upload a SQL worksheet into your Snowflake account](../tutorials-and-samples.md).

* [`Provider example code`](../../../_downloads/74f5e256a72d109f3bf5b741432911cd/c-run-analysis-p.sql)
* [`Consumer example code`](../../../_downloads/d898d27c6c1b81d0b16575285b2e0873/c-run-analysis-c.sql)

---
title: Basic multi-party collaboration
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/basic-multiparty-collab.md
section: Clean Rooms
---

# Basic multi-party collaboration

## Introduction

This topic walks through the steps to create a basic multi-party collaboration. It demonstrates how to register templates and data
offerings, how to add data to the initial version of a collaboration, and how collaborators can add resources after the collaboration is
created. It also demonstrates how to run queries using templates and data resources in the collaboration.

## Basic clean room collaboration workflow

Here is a basic multi-party clean room collaboration scenario:

1. The collaboration [owner](../roles.md) registers any templates or data offerings that they want to appear in the
   initial configuration of the collaboration.
2. The owner optionally asks any intended collaborators to register templates or data offerings that they want to appear in the initial
   configuration of the collaboration. Collaborators then give the resource IDs of the registered items to the owner.
3. The owner creates a collaboration. The collaboration is defined by a
   collaboration YAML spec that lists the collaborators, their collaboration roles, and all resources that should be present in the initial
   version of the collaboration.

   * When a collaboration is created, the set of collaborators and their collaboration roles is fixed.
   * Additional resources can be added by collaborators after the collaboration is created, if their collaboration role permits it.
   * If your collaboration shares data with users in other cloud hosting regions, the sharer must [enable Cross-Cloud Auto-Fulfillment on their account](../laf.md).
4. Collaborators review and join the collaboration.
5. Collaborators can then optionally
   link additional resources to the collaboration, such as templates and data
   offerings, depending on their collaboration roles. Additional resources can be added to a collaboration at any time.
6. Analysis runners can run any templates assigned to them in the collaboration, using any data
   available to them in the collaboration. The analysis runner bears the cost of the analysis. Templates can be designed either to return
   query results in the response or to [activate results to the caller or another collaborator](../activation.md).

The following sections describe the details of each of these steps.

## Create a collaboration

To create a collaboration, you design a [collaboration spec](../spec-collaboration.md) that defines all the collaborators
[and their collaboration roles](../roles.md).

If you want to make resources available in a collaboration as soon as it is created, the collaboration owner
registers and links those resources before creating the collaboration, and
includes the resource IDs in the collaboration spec.

If the owner expects to use resources from collaborators, the owner can also prompt those users to register their resources and give the
owner the resource IDs to include in the collaboration spec. The owner also indicates in the collaboration spec where no resources are
linked now, but can be linked in the future.

The owner then calls INITIALIZE to begin creating the collaboration. By default, INITIALIZE also automatically joins the owner to the
collaboration. This is an asynchronous process, so the owner must call GET_STATUS until the status is JOINED.

The following snippet demonstrates creating and joining a collaboration.

```sqlexample-yaml
 1  CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.INITIALIZE(
 2    $$
 3    api_version: 2.0.0
 4    spec_type: collaboration
 5    name: my_first_collaboration
 6    owner: alice
 7    collaborator_identifier_aliases:
 8      alice: example_com.acct_abc
 9      bob: another_example.acct_xyz
10    analysis_runners:
11      bob:
12        data_providers:
13          alice:
14            data_offerings: [] -- alice has not provided data to bob, but can do so in the future.
15          bob:
16            data_offerings: [customers_v1]  -- bob has registered a data offering and made it available to himself.
17        templates: []   -- No templates available yet for bob.
18      alice:
19        data_providers:
20          alice:
21            data_offerings: []
22          bob:
23            data_offerings: []
24        templates: []
25    $$,
26    'APP_WH'            -- Use this warehouse for initialization.
27  );                    --  XSMALL or SMALL warehouses are recommended for initialization.
28  SET collaboration_name = 'my_first_collaboration';
29
30  -- INITIALIZE automatically joins the owner. Check status until JOINED.
31  CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);
32
33  -- Collaboration is visible here when it's joined.
34  CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS();
```

**Notes on the script:**

* The collaboration consists of two collaborators, with the aliases `alice` and `bob`. You can use a full data-sharing ID anywhere you
  use an alias, but that is much less user-friendly.
* `alice` is the owner.
* Both `alice` and `bob` are analysis runners.
* Both `alice` and `bob` are data providers to each other.
* If you are a data provider, you must include the `data_offerings` field. This field can be populated or empty, indicating that there
  are no data offerings now, but they can be added later.
* `alice` isn’t providing data to `bob` or herself, but can do so later (lines 14, 22).
* `bob` has already registered a data offering, and provided it to himself in the initial collaboration (line 16).
* `bob` isn’t providing data to `alice`, but can do so later (line 24).
* Neither `alice` nor `bob` has templates available yet, but they can be assigned later (lines 18, 25). Note that the
  `templates` field is optional for an analysis runner. If you omit this field during initialization, collaborators can still assign
  templates to this analysis runner later.

## Link resources to a collaboration

Collaborators can link resources into a collaboration, or remove resources that they have linked into the collaboration, according to their
collaboration role. There are two steps to linking a resource into a collaboration:

1. The resource owner creates a [resource definition spec](../spec-reference.md) for the resource and uses it to
   register the resource in their account. You can register the resource in your account’s
   [default registry](../registries.md), or use a custom registry.
2. A collaborator links the resource into a collaboration. Resources can be linked into a collaboration either when the collaboration is
   created, by hard-coding the resource ID into the YAML definition used to create the collaboration, or after the collaboration is created
   and joined, by calling the appropriate procedure to link the resource into the collaboration.

After the resource is linked in, it can be used by the designated collaborators. Some resource types, such as templates, can be linked in
by any collaborator; other resources, such as data offerings, can be linked in only by users with the data provider collaboration role.
However, note that you must join a collaboration before any resources you contributed become available to the collaboration.

If you share data with users in other cloud hosting regions, the sharer must
[enable Cross-Cloud Auto-Fulfillment on their account](../laf.md).

Resources are available only to the collaborators designated by the collaboration spec.

> **Note:**
>
> Updates to an existing collaboration, such as linking or removing resources, are asynchronous and take some time to complete. Call
> VIEW_UPDATE_REQUESTS to see the status of an update. Using a resource before it becomes fully available can result in
> inconsistent behavior.

Resources support versioning; however, creating a new resource with a new version doesn’t remove the previous version from the
collaboration. Resources are uniquely named by combining the user-provided name and version (and alias, for data offerings).

To learn more about using resources in your collaboration, see [Resources](../resources.md).

## Review and join a collaboration

You must join a collaboration to share resources and run analyses in the collaboration.

* *The creator* joins automatically when calling INITIALIZE if `auto_join_warehouse` is provided. If `auto_join_warehouse` isn’t
  provided, the creator calls JOIN after INITIALIZE is complete.
* *Non-creators* call REVIEW, and then JOIN.

  + REVIEW returns an overview of the collaboration and its resources. You can call REVIEW only once.
  + JOIN installs the collaboration clean room in your account and joins the collaboration.

Both INITIALIZE and JOIN are asynchronous procedures that take several minutes to complete. You must call GET_STATUS to see when each step
is complete.

> **Important:**
>
> If your account’s cloud hosting region is different from the collaboration owner’s, REVIEW triggers additional asynchronous setup
> steps. Call REVIEW repeatedly until it returns a successful response, indicating that setup is complete.

Joining is an asynchronous process; call GET_STATUS to see when your status is listed as JOINED.

## Run an analysis

If you have the analysis runner role in a collaboration, you can run analyses against data sources shared with you in the collaboration.

Collaborations support two types of queries:

* **Template analyses.** These queries run a template (a templated JinjaSQL statement) linked into the collaboration. Templates can be
  either analysis templates, which return results immediately to you, or activation templates, which save results to the Snowflake account
  of a designated participant.
* **Free-form SQL queries.** If allowed by a data provider, you can access specified data offerings using SQL when signed in with your
  collaborator credentials. You run SQL queries directly, without calling a Collaboration API procedure, by accessing the fully qualified
  view name exposed by the collaboration.

The analysis runner bears the cost of running an analysis.

The collaboration specification determines whether you can run a template, activate results, or run free-form SQL queries. Your
capabilities, as well as the data and templates available for you to use, are described in the collaboration specification.

> **Note:**
>
> Columns from the data sources might have new names when exposed to the template or user. See [Source column renaming](../resources-data-offerings.md) to
> learn how and when source columns are renamed. Templates and user-provided arguments (such as a join column name) must use the final
> name, not the original name, if the column is renamed.

Learn more about all these analysis types in the following sections.

### Run an analysis from a template

To run an analysis from a template, view the list of templates that you can run, view the list of data offerings that you can use, then
call RUN with your values either as individual parameters or as an analysis specification in YAML format.

Tables that you pass into the `source_tables` field in the run configuration populate the `source_table` parameter in the template. The
template’s `my_table` parameter is not populated or used unless you are using Snowflake Standard Edition with your own data.

> **Note:**
>
> Resource installation is asynchronous. If a template was just installed, it can take a short while before it is available to run. If
> the template includes a code bundle, it can take additional time before the template is available.
> [See how to determine when a code bundle is available](../resources-code-bundles.md).

The following example lists data offerings and templates that the user can access, then runs an analysis using the `sales_join_template`
template (which is assumed to be listed by VIEW_TEMPLATES), passing in five named arguments to the template.

```sqlexample-yaml
-- See which data offerings are available.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS($collaboration_name);

-- See which templates you can run.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_TEMPLATES($collaboration_name);

-- Pass in the arguments in analysis YAML format.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
  $$
    api_version: 2.0.0
    spec_type: analysis
    name: My_analysis
    description: Sales results Q2 2025
    template: sales_join_template

    template_configuration:
      view_mappings:
        source_tables:
          -  user1_alias.data_offering_v1.table_1
          -  user2_alias.another_data_offering_v1.table_2
      arguments:                                            -- The template defines conv_purchase_id and the other four arguments.
         conv_purchase_id: PURCHASE_ID                      -- You must examine a template to see which arguments it supports.
         conv_purchase_amount: PURCHASE_AMOUNT
         publisher_impression_id: IMPRESSION_ID
         publisher_campaign_name: CAMPAIGN_NAME
         publisher_device_type: DEVICE_TYPE
  $$ );
```

### Enable and run free-form SQL queries on your data

A data provider can grant analysis runner permission to run arbitrary SQL queries against their data offerings.
This means that the analysis runner can run an arbitrary SQL query directly against the data offering, rather than calling a template.

To learn more about free-form SQL queries, see [Free-form SQL queries](../free-form-sql.md).

### Run an analysis with your own data when you use Standard Edition

If you use Standard Edition, you can run an analysis in the standard way. However,
you can’t link data into the collaboration to share with other users. The only way to pass your own datasets into a template is to use the
technique described here.

**To use your own data in a collaboration on Snowflake Standard Edition:**

1. Register your data offering by calling REGISTER_DATA_OFFERING.
2. Call LINK_LOCAL_DATA_OFFERING to link your data into the collaboration for you to use. No other collaborators can see or access data
   linked locally.
3. Use the data offering ID when you call [RUN](../collaboration-api-reference.md).

> * If you are using the parameterized version of RUN, pass your data offering IDs to the `local_template_view_names` parameter
> * If you are using the YAML version of RUN, provide your data offering IDs in the `local_view_mappings.my_tables` stanza of the request
> * If you are using the parameterized version of RUN, pass your data-offering IDs to the `local_template_view_names` parameter.
>
> > **Tip:**
> >
> > `local_template_view_names` and `local_view_mappings.my_tables` populate the `my_table` parameter in the template.

The following example shows how to run a template using the YAML format version of the run procedure. This example includes the
`my_tables` field, which is populated by calling LINK_LOCAL_DATA_OFFERING.

```sqlexample-yaml
-- See what data offerings are available. Your own local data will be listed here as well.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS($collaboration_name);

-- Pass in the arguments in analysis YAML format.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
  $$
    api_version: 2.0.0
    spec_type: analysis
    name: my_analysis
    description: Cross-purchase results for Q4 2025
    template: mytemplate_v1

    template_configuration:
      view_mappings:
        source_tables:
          - ADVERTISER1.ADVERTISER_DATA_V1.CUSTOMERS
          - PUBLISHER.ADVERTISER_DATA_V1.CUSTOMERS
      local_view_mappings:
        my_tables:
          - PARTNER.MY_DATA_V1.MY_CUSTOMERS # Populate my_table array with my own table.
      arguments:  # Template arguments, as name: value pairs
         conv_purchase_id: PURCHASE_ID
         conv_purchase_amount: PURCHASE_AMOUNT
         publisher_impression_id: IMPRESSION_ID
         publisher_campaign_name: CAMPAIGN_NAME
         publisher_device_type: DEVICE_TYPE
  $$ );
```

### Activate results

If the data provider and the collaboration spec allow it, you can save analysis results to your own Snowflake account, or the Snowflake
account of a designated collaborator. A template either activates results or returns results immediately, not both.

To learn more about activation, see [Activating query results](../activation.md).

## Leave or delete a collaboration

* Non-owners leave a collaboration by calling LEAVE. Any data offerings they have provided will be removed from the collaboration. You
  can’t rejoin a collaboration after leaving it.
* Collaboration owners can’t leave a collaboration because ownership can’t be transferred. A collaboration owner can drop a collaboration for all collaborators by calling TEARDOWN.

Both processes are asynchronous. You must call GET_STATUS to monitor the status, and call LEAVE or TEARDOWN again when GET_STATUS shows the
status as LOCAL_DROP_PENDING.

## Examples

The following SQL examples demonstrate how to create and run a basic collaboration:

### Two-party collaboration example

The following example demonstrates a two-party collaboration, where one party (named “alice”) is the collaboration creator, a data provider
for herself and “bob”, and an analysis runner. “bob” is a data provider for himself and “alice”, and is also an analysis runner.

The example demonstrates the following actions:

* Creating a collaboration.
* Registering templates and data offerings.
* Linking a template and data offering at collaboration creation time.
* Joining a collaboration.
* Linking additional resources to an existing collaboration.
* Running an analysis.

To run this example, you must have two separate accounts with Snowflake Data Clean Rooms installed.

You can either download the files and upload them to your Snowflake account, or copy and paste the example code into worksheets in two
separate accounts by using Snowsight.

File downloads“alice” code“bob” code

Download the source SQL files, then upload them into two separate accounts that have Snowflake Data Clean Rooms installed:

* [`Collaboration owner "alice" worksheet`](../../../_downloads/162c93b64e33d38d9cdeb15a710fa8fd/demo-collaboration-hub-alice.sql)
* [`Collaboration member "bob" worksheet`](../../../_downloads/ff68e520998de6717efbfe424fdc56db/demo-collaboration-hub-bob.sql)

```sqlexample
-- Basic Snowflake Collaboration Data Clean Rooms example.
-- This file represents user "alice" in a two-collaborator clean room example.

-- Run this worksheet in a Snowflake account with access to the latest version of
-- Snowflake Data Clean Rooms.

-- This file demonstrates the following actions:
-- * How to register a template and a dataset
-- * How to create a collaboration with pre-registered resources.
-- * How to add a template to a collaboration that has already been created, and the
--   template approval flow.
-- * How to run an analysis.

-- This scenario involves two collaborators: bob and alice
-- bob and alice each submits one data source
-- bob and alice are data providers for themselves and each other
-- bob submits one template that only alice can use
-- alice submits one template that they can both use, and one template that only alice can use

-- For more information, read docs.snowflake.com/user-guide/cleanrooms/overview

USE WAREHOUSE APP_WH;
USE ROLE SAMOOHA_APP_ROLE;

-- Secondary roles must be disabled to call link_data_offerings.
USE SECONDARY ROLES NONE;

CREATE DATABASE IF NOT EXISTS ALICE_DB;
CREATE SCHEMA IF NOT EXISTS ALICE_DB.ALICE_SCH;
CREATE OR REPLACE TABLE ALICE_DB.ALICE_SCH.ALICE_DATA AS SELECT * FROM samooha_sample_database.demo.customers LIMIT 100;

-- Register a data offering to use in the initial collaboration definition.
CALL samooha_by_snowflake_local_db.registry.register_data_offering(
    $$
    api_version: 2.0.0
    spec_type: data_offering
    version: v1
    name: <alice data offering name>
    datasets:
     - alias: customer_list
       data_object_fqn: ALICE_DB.ALICE_SCH.ALICE_DATA
       object_class: custom
       allowed_analyses: template_only
       schema_and_template_policies:
         hashed_email:
           category: join_standard
           column_type: hashed_email_b64_encoded
         status:
           category: passthrough
    $$
    );

-- Save the ID of the registered data offering.
SET alice_data_offering_id = '<data_offering_id>';

CALL samooha_by_snowflake_local_db.registry.view_registered_data_offerings();

-- Register a template to use in the initial collaboration definition.
CALL samooha_by_snowflake_local_db.registry.register_template(
$$
api_version: 2.0.0
spec_type: template
name: alice_only_template
version: <version_number>
type: sql_analysis
description: A test template
template:
  SELECT t1.status, COUNT(*)
    FROM IDENTIFIER( {{ source_table[0] }} ) AS t1
    JOIN IDENTIFIER( {{ source_table[1] }} ) AS t2
    ON t1.hashed_email_b64_encoded = t2.hashed_email_b64_encoded
    GROUP BY t1.status;
$$);

-- Save the ID of the registered template.
SET my_template_id = '<alice_only_template_id>';
CALL samooha_by_snowflake_local_db.registry.view_registered_templates();

-- Create a collaboration with the previously registered template and data offering.
-- The collaboration supports two collaborators, with aliases alice (this account) and bob.
-- Owner: alice
-- Analysis runners:
--   * alice, using her own data, and the template you created and registered earlier.
--   * bob, with no listed templates or data.
-- Data providers:
--   * alice and bob, for alice
--   * alice and bob, for bob
-- Resources added: The template and data offering alice registered earlier.
-- You will add more templates and data offerings to these users later. Only these
-- users are invited to the collaboration, and no additional users can be added later.
-- Replace the <...> placeholders with the appropriate values.
-- Account data sharing IDs are -- SELECT CURRENT_ORGANIZATION_NAME() || '.' || CURRENT_ACCOUNT_NAME();
CALL samooha_by_snowflake_local_db.collaboration.initialize(
$$
api_version: 2.0.0
spec_type: collaboration
name: my_first_collaboration_1_0
owner: alice
collaborator_identifier_aliases:
  alice: <my account data sharing ID>
  bob: <bob account data sharing ID>
analysis_runners:
  bob:
    data_providers:
      alice:
        data_offerings:
        - id: <alice data offering ID>
      bob:
        data_offerings: []
  alice:
    data_providers:
      alice:
        data_offerings:
        - id: <alice data offering ID>
      bob:
        data_offerings: []
    templates:
    - id: <alice only template ID>
$$,
'APP_WH'
);
SET collaboration_name = '<collaboration_name>';

-- INITIALIZE automatically joins the owner. Check status until JOINED.
CALL samooha_by_snowflake_local_db.collaboration.get_status($collaboration_name);

-- Collaboration is visible here when the owner has joined.
CALL samooha_by_snowflake_local_db.collaboration.view_collaborations();

-- Auto-approve any template requests from other collaborators that affect you.
CALL samooha_by_snowflake_local_db.collaboration.enable_template_auto_approval(
  $collaboration_name
);

-- SWITCH TO collaborator to join the collaboration and add a template
-- The template will be auto-approved.

-- Create a new template.
CALL samooha_by_snowflake_local_db.registry.register_template(
    $$
    api_version: 2.0.0
    spec_type: template
    name: both_use_template
    version: 2026_01_12_V1
    type: sql_analysis
    description: test_description
    template:
      select * from identifier({{ source_table[0] }}) limit 5;

    $$
);
SET both_use_template = '<template ID>';

-- Ask to add the template to the collaboration. You must ask bob, because you're
-- including bob in the sharing list. When you share a template with yourself,
-- you auto-approve it.
CALL samooha_by_snowflake_local_db.collaboration.add_template_request(
  $collaboration_name,
  $both_use_template,
  ['alice', 'bob']   -- List of collaborators who can use this template.
  );

-- SWITCH TO bob to approve the request. Request wasn't approved automatically
-- because bob didn't enable auto-approve.

-- See if bob approved the request.
CALL samooha_by_snowflake_local_db.collaboration.view_update_requests($collaboration_name);

-- See what the collaboration spec looks like now, after all the resource updates.
-- Collaboration updates are asynchronous, so if all changes that you made aren't present,
-- wait a minute or two, and then try again.
CALL samooha_by_snowflake_local_db.collaboration.view_collaborations() ->>
  SELECT "COLLABORATION_SPEC" FROM $1 WHERE "SOURCE_NAME" = $collaboration_name;

-- SWITCH TO bob to add a data offering.

-- Run an analysis.
-- Tables are scoped as <data_offering_id>.<alias>.
CALL samooha_by_snowflake_local_db.collaboration.view_data_offerings(
  $collaboration_name
);
SET $bob_data_offering = '<bob data offering ID>';

CALL samooha_by_snowflake_local_db.collaboration.view_templates(
  $collaboration_name
);

-- Run bob's template.
-- Replace the placeholders with your variables.
CALL samooha_by_snowflake_local_db.collaboration.run(
  $collaboration_name,
    $$
    api_version: 2.0.0
    spec_type: analysis
    description: <optional description of the analysis>
    template: '<alice_only_template>'
    template_configuration:
      view_mappings:
        source_tables:
          - '<alice_data_offering_view_name>'
          - '<bob_data_offering_view_name>'
    $$
  );

-- Multi-step cleanup process to delete the collaborations.
-- Doesn't delete registered resources.
CALL samooha_by_snowflake_local_db.collaboration.teardown($collaboration_name);
CALL samooha_by_snowflake_local_db.collaboration.get_status($collaboration_name);

-- When get_status reports LOCAL_DROP_PENDING, call teardown again.
CALL samooha_by_snowflake_local_db.collaboration.teardown($collaboration_name);

DROP DATABASE ALICE_DB;
```

```sqlexample
-- Basic Snowflake Collaboration Data Clean Rooms example.
-- This file represents user "bob" in a two-collaborator clean room example.

-- Run this worksheet in a Snowflake account with access to the latest version of
-- Snowflake Data Clean Rooms.

-- This file  demonstrates the following actions:
-- * Joining a collaboration
-- * Registering and adding a template and a data offering to an existing collaboration.
-- * Running an analysis.

-- For more information, read docs.snowflake.com/user-guide/cleanrooms/overview

USE WAREHOUSE APP_WH;
USE ROLE SAMOOHA_APP_ROLE;

-- Secondary roles can't be active when calling join or link_data_offering.
USE SECONDARY ROLES NONE;

-- Create sample data.
CREATE DATABASE IF NOT EXISTS BOB_DB;
CREATE SCHEMA IF NOT EXISTS BOB_DB.BOB_SCH;
CREATE OR REPLACE TABLE BOB_DB.BOB_SCH.BOB_DATA AS SELECT * FROM samooha_sample_database.demo.customers_2 LIMIT 100;

-- See which collaborations you are invited to, or have joined.
CALL samooha_by_snowflake_local_db.collaboration.view_collaborations();

-- Use SOURCE_NAME column value from the response to view_collaborations().
SET collaboration_name = '<collaboration name>';

-- Use OWNER_ACCOUNT column value from the response to view_collaborations().
SET collaborator_data_sharing_id = '<collaborator_id>';

-- Review and join the collaboration.
-- Joining is asynchronous, so you must call get_status until the status is JOINED before
-- you can perform actions on the collaboration.
CALL samooha_by_snowflake_local_db.collaboration.review($collaboration_name, $collaborator_data_sharing_id);
CALL samooha_by_snowflake_local_db.collaboration.join($collaboration_name);
CALL samooha_by_snowflake_local_db.collaboration.get_status($collaboration_name);

-- Demonstrate the auto-approve flow.
-- Alice enabled auto-approve on her account, so this request will
-- be auto-approved, and the template will be added immediately.

-- Create a template.
CALL samooha_by_snowflake_local_db.registry.register_template(
    $$
    api_version: 2.0.0
    spec_type: template
    name: auto_approve_template
    version: V1
    type: sql_analysis
    description: test_description
    template:
      SELECT * FROM IDENTIFIER({{ SOURCE_TABLE[0] }}) LIMIT 10;
    $$
);
SET auto_approve_template = '<template_id>';

CALL samooha_by_snowflake_local_db.collaboration.add_template_request($collaboration_name, $auto_approve_template, ['alice', 'bob']);
CALL samooha_by_snowflake_local_db.collaboration.view_update_requests($collaboration_name);

-- SWITCH TO other account and request adding a template, and then come back to approve the request.

-- You haven't enabled template auto-approve, so you must approve the request before the template is added.
CALL samooha_by_snowflake_local_db.collaboration.view_update_requests($collaboration_name);
CALL samooha_by_snowflake_local_db.collaboration.approve_update_request(
  $collaboration_name,
  '<request_ID>'
);

-- SWITCH TO bob to see the request status.

-- Register your own data offering.
CALL samooha_by_snowflake_local_db.registry.register_data_offering(
    $$
    api_version: 2.0.0
    spec_type: data_offering
    version: v3
    name: bob_data
    datasets:
     - alias: my_customer_list
       data_object_fqn: BOB_DB.BOB_SCH.BOB_DATA
       object_class: custom
       allowed_analyses: template_only
       schema_and_template_policies:
         hashed_email:
           category: join_standard
           column_type: hashed_email_b64_encoded
         status:
           category: passthrough
    $$
);

SET my_data_id = '<data offering id>';

-- Share the data offering with yourself and alice.
CALL samooha_by_snowflake_local_db.collaboration.link_data_offering(
  $collaboration_name,
  $my_data_id,
  ['alice', 'bob']
);

CALL samooha_by_snowflake_local_db.collaboration.view_data_offerings(
  $collaboration_name
);

-- View templates that you can use in this collaboration. You can run only templates that list you in the
-- SHARED_WITH column.
CALL samooha_by_snowflake_local_db.collaboration.view_templates($collaboration_name);

-- Run an analysis with your template.
CALL samooha_by_snowflake_local_db.collaboration.run(
    $collaboration_name,
    $$
    api_version: 2.0.0
    spec_type: analysis
    description: <optional description of the analysis>
    template:  '<both_use_template>'
    template_configuration:
      view_mappings:
        source_tables:
          -  '<my_data_offering_view_name>'
          -  '<bob_data_offering_view_name>'
    $$
);

-- SWITCH TO other account to run an analysis.

-- Try running an analysis using alice-only template.
-- This will fail, because you aren't listed as an analysis
-- runner for this template.
CALL samooha_by_snowflake_local_db.collaboration.run(
  $collaboration_name,
  $$
  api_version: 2.0.0
  spec_type: analysis
  description: <optional description of the analysis>
  template: '<alice_only_template>'
  template_configuration:
    view_mappings:
      source_tables:
        - '<my_data_offering_view_name>'
        - '<bob_data_offering_view_name>'
  $$
);

-- Clean up resources.
DROP DATABASE BOB_DB;
```

### Single-party collaboration example

This example demonstrates how to create and use a collaboration if you have only a single account for testing.

The example demonstrates creating a collaboration with a data offering and a template, then adding another data offering and template
after the collaboration is created, and running analyses.

You can either download the file and upload it to your Snowflake account, or copy and paste the example code into a worksheet by using
Snowsight.

File downloadExample code

Download the source SQL file, then upload it into a Snowflake account that has Snowflake Data Clean Rooms installed:

* [`Single account collaboration worksheet`](../../../_downloads/20f15c1dd86d7a782f6e362e78ac20c5/demo-collaboration-single-user.sql)

```sqlexample
-- ============================================================================
-- Single-user Collaboration Clean Rooms demo
-- ============================================================================
-- This example demonstrates a basic Snowflake Data Clean Rooms collaboration
-- using a single Snowflake account and a single role: SAMOOHA_APP_ROLE.
-- One user acts as the owner, data provider, and analysis runner.
--
-- The user creates two sample datasets, registers two data offerings and two
-- templates, then creates a collaboration with one data offering and one template each.
--  After the collaboration is created, the user links the remaining data offering and
-- template, then runs an analysis with each template. Finally, the code
-- cleans up all resources used.
--
-- For more information, see:
--   docs.snowflake.com/user-guide/cleanrooms/overview
--   docs.snowflake.com/user-guide/cleanrooms/spec-reference
-- ============================================================================

-- ============================================================================
-- SETUP: Create sample databases and data.
-- ============================================================================

USE ROLE SAMOOHA_APP_ROLE;
USE WAREHOUSE APP_WH;

-- You can't use secondary roles with most collaboration procedures.
USE SECONDARY ROLES NONE;

CREATE DATABASE IF NOT EXISTS DEMO_DB;
CREATE SCHEMA IF NOT EXISTS DEMO_DB.DATA_SCH;

-- Dataset 1: 300 rows from CUSTOMERS.
CREATE OR REPLACE TABLE DEMO_DB.DATA_SCH.CUSTOMERS_1 AS
  SELECT HASHED_EMAIL, STATUS, AGE_BAND
  FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS
  LIMIT 300;

-- Dataset 2: 300 rows from CUSTOMERS_2.
CREATE OR REPLACE TABLE DEMO_DB.DATA_SCH.CUSTOMERS_2 AS
  SELECT HASHED_EMAIL, STATUS, AGE_BAND
  FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2
  LIMIT 300;

-- ============================================================================
-- Register data offerings and templates.
-- ============================================================================

-- Register the first data offering.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
  $$
  api_version: 2.0.0
  spec_type: data_offering
  version: V1
  name: customers_1
  datasets:
    - alias: customers_1
      data_object_fqn: DEMO_DB.DATA_SCH.CUSTOMERS_1
      object_class: custom
      allowed_analyses: template_only
      schema_and_template_policies:
        hashed_email:
          category: join_standard
          column_type: hashed_email_b64_encoded
        status:
          category: passthrough
        age_band:
          category: passthrough
  $$
);

SET data_offering_1_id = '<data_offering_1_id>';

-- Register the second data offering.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
  $$
  api_version: 2.0.0
  spec_type: data_offering
  version: V1
  name: customers_2
  datasets:
    - alias: customers_2
      data_object_fqn: DEMO_DB.DATA_SCH.CUSTOMERS_2
      object_class: custom
      allowed_analyses: template_only
      schema_and_template_policies:
        hashed_email:
          category: join_standard
          column_type: hashed_email_b64_encoded
        status:
          category: passthrough
        age_band:
          category: passthrough
  $$
);

SET data_offering_2_id = '<data_offering_2_id>';

-- Register a template that joins two tables on hashed_email and returns
-- a count of rows grouped by age_band.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
$$
api_version: 2.0.0
spec_type: template
name: age_band_count
version: V1
type: sql_analysis
description: Joins two tables on hashed_email and returns age_band with row counts.
template:
  SELECT t1.age_band, COUNT(t1.age_band) AS age_band_count
    FROM IDENTIFIER({{ source_table[0] }}) AS t1
      JOIN IDENTIFIER({{ source_table[1] }}) AS t2
      ON t1.hashed_email_b64_encoded = t2.hashed_email_b64_encoded
    GROUP BY t1.age_band;
$$
);

SET age_band_template_id = '<age_band_template_id>';

-- Register a template that joins two tables on hashed_email and returns
-- a count of rows grouped by status.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
$$
api_version: 2.0.0
spec_type: template
name: status_count
version: V1
type: sql_analysis
description: Joins two tables on hashed_email and returns status with row counts.
template:
  SELECT t1.status, COUNT(t1.status) AS status_count
    FROM IDENTIFIER({{ source_table[0] }}) AS t1
      JOIN IDENTIFIER({{ source_table[1] }}) AS t2
      ON t1.hashed_email_b64_encoded = t2.hashed_email_b64_encoded
    GROUP BY t1.status;
$$
);

SET status_template_id = '<status_template_id>';

-- Confirm that both data offerings and both templates are registered.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS();
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_TEMPLATES();

-- ============================================================================
-- Create the collaboration with one data offering and one template.
-- ============================================================================

-- Replace <account_data_sharing_id> with:
--   SELECT CURRENT_ORGANIZATION_NAME() || '.' || CURRENT_ACCOUNT_NAME();
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.INITIALIZE(
  $$
  api_version: 2.0.0
  spec_type: collaboration
  name: single_user_demo
  owner: me
  collaborator_identifier_aliases:
    me: <account_data_sharing_id>
  analysis_runners:
    me:
      data_providers:
        me:
          data_offerings:
            - id: <data_offering_1_id>
      templates:
        - id: <age_band_template_id>
  $$,
  'APP_WH'
);

SET collaboration_name = '<collaboration_name>';

-- Verify that the owner has joined. Repeat until status is JOINED.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- ============================================================================
-- Link the remaining data offering and template into the collaboration.
-- ============================================================================

-- Link the second data offering.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.LINK_DATA_OFFERING(
  $collaboration_name, $data_offering_2_id, ['me']);

-- Add the status_count template to the collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ADD_TEMPLATE_REQUEST(
  $collaboration_name, $status_template_id, ['me']);

-- ============================================================================
-- List resources and run analyses.
-- ============================================================================

-- List all data offerings in the collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS($collaboration_name);

-- List all templates in the collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_TEMPLATES($collaboration_name);

-- Run the age_band_count template.
-- Replace placeholders with the template name/version and view names from
-- VIEW_TEMPLATES and VIEW_DATA_OFFERINGS.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
  $$
  api_version: 2.0.0
  spec_type: analysis
  description: Count matching rows grouped by age_band.
  template: '<age_band_count_template_name_and_version>'
  template_configuration:
    view_mappings:
      source_tables:
        - '<data_offering_view_1>'
        - '<data_offering_view_2>'
  $$
);

-- Run the status_count template.
-- Replace placeholders with the template name/version and view names from
-- VIEW_TEMPLATES and VIEW_DATA_OFFERINGS.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
  $$
  api_version: 2.0.0
  spec_type: analysis
  description: Count matching rows grouped by status.
  template: '<status_count_template_name_and_version>'
  template_configuration:
    view_mappings:
      source_tables:
        - '<data_offering_view_1>'
        - '<data_offering_view_2>'
  $$
);

-- ============================================================================
-- CLEANUP: Delete the collaboration, registered resources, and sample data.
-- ============================================================================

-- Teardown is a multi-step process. Call TEARDOWN, then wait for GET_STATUS
-- to report LOCAL_DROP_PENDING, then call TEARDOWN again.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- When GET_STATUS reports LOCAL_DROP_PENDING, call TEARDOWN again to complete.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);

-- Unregister the data offerings and templates from the default registry.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.UNREGISTER_DATA_OFFERING($data_offering_1_id);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.UNREGISTER_DATA_OFFERING($data_offering_2_id);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.UNREGISTER_TEMPLATE($age_band_template_id);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.UNREGISTER_TEMPLATE($status_template_id);

-- Drop the sample database.
DROP DATABASE IF EXISTS DEMO_DB;
```

---
title: Clean room managed accounts
source: https://docs.snowflake.com/en/user-guide/cleanrooms/managed-accounts.md
section: Clean Rooms
---

# Clean room managed accounts

## Overview

A clean room provider must have a Snowflake account. However, a provider can collaborate with consumers who do not have Snowflake
accounts by inviting them to collaborate using a *clean room managed account*. To begin collaborating in a clean room, the consumer simply
accepts the provider’s invitation to use the managed account.

If you want to invite new managed account users to your clean rooms, contact your clean rooms account representative.

A managed account can be converted to a Snowflake account in the consumer’s organization if the managed account user want to become a
Snowflake Service customer.

[See the usage terms](https://www.snowflake.com/en/legal/other/data-clean-rooms/managed-account/) that apply to managed accounts
in Snowflake Data Clean Rooms.

> **Important:**
>
> The consumer who accepts a provider’s invitation to use a managed account pays for the use of the clean room. When accepting the
> invitation, the consumer must enter billing details before accessing the clean room environment.

## Requirements and limitations

A managed account has the following requirements and limitations:

* It requires the use of external tables for the managed account user to import data. As a result, the provider must explicitly
  [allow the use of external tables in the clean room](register-data.md).
* It does not behave the same as a Snowflake [reader account](../data-sharing-reader-create.md). The
  consumer does not access the managed account outside the context of the clean room environment.
* It can be used only as a clean rooms consumer, not a clean rooms provider.
* It does not support the use of [identity connectors](connector-identity.md) in an analysis.
* An underlying Snowflake instance is created in the same cloud region as the provider but the managed account consumer can
  link their data from any cloud region. The managed account user can access the underlying data only by using the clean rooms UI.
* A managed account user cannot use the clean rooms APIs.
* Providers using a trial Snowflake account cannot invite managed account users as collaborators.

## Provider tasks

Follow these steps to add a managed account user as a collaborator:

### 1. Enable external tables for your account (and clean room)

The provider must ensure that [external tables are enabled for the account](register-data.md) (and, if the
provider is using the API, the specific clean room). The consumer links data using an external table connector appropriate for their cloud
platform. The consumer does not need to enable external tables.

### 2. Invite a consumer to collaborate using a managed account

When a provider wants to collaborate with a consumer who does not have a Snowflake account, they can invite them to
collaborate using a managed account.

> **Important:**
>
> Contact your clean rooms account representative to request the ability to add new managed account users to your account.

To send a consumer an invitation for a managed account:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Collaborators.
3. Select Managed Accounts » + Managed Account.
4. In the Company Name field, enter the name of the consumer you are inviting to use the managed account.
5. In the Account Admin Email, enter the email of the consumer’s administrator. The invitation to use the managed account is sent to
   this email.
6. Select Invite.

   An email is sent to the consumer inviting them to use the managed account to access a clean room environment.

### 3. Find the account identifier of a managed account

Clean room managed accounts have account identifiers just like fully capable Snowflake accounts. You might need an identifier
for tasks like using the developer API to share a clean room with a consumer.

To find the account locator or account name for a managed account:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Collaborators.
3. Find the name of the managed account, and do one of the following:

   1. If you need the [account locator](../admin-account-identifier.md) format of the account identifier, copy the value under
      Account Locator.
   2. If you need the [account name](../admin-account-identifier.md) format of the account identifier, copy the value under
      Account Identifier.

### 4. Share a clean room with a managed account

The consumer is limited to using the UI from a managed account, so you can share only clean rooms that have an analysis that is runnable in
the UI. This means either an analysis created in the UI, or a custom template that has a
[user input form](demo-flows/custom-templates.md).

You cannot share a clean room with a consumer until they accept your invitation to collaborate using the managed account. To
determine whether the consumer has accepted the invitation and signed in to the clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Collaborators.
3. Find the name of the managed account. If the consumer has accepted the invitation, the status of the account is Active. You have
   the option of resending the invite if the consumer did not respond to the original email.

After the consumer accepts the invitation to use the managed account, you can create a clean room to share with the consumer. Simply select
them as a collaborator during the Share portion of the creation process.

## Consumer (managed account user) tasks

Working with managed accounts as a consumer consists of the following tasks:

### Get started with the managed account

When a provider invites a consumer to collaborate using a managed account, the consumer administrator receives an email that lets them sign
up for the clean room environment. The provider cannot share a clean room with the consumer until the administrator uses the link in the
email to complete
the sign up process.

Because the consumer pays for their use of the managed account, the first person to sign in to the clean room environment is prompted to
enter billing information. If you want to change this billing information after the initial sign in, contact [accounts.receivable@snowflake.com](mailto:accounts.receivable%40snowflake.com).

### Access your data in a clean room

You can join your data with the provider’s data to gain valuable insights. Clean room external data connectors let you
link your data into a clean room.

Follow the steps in one of the following topics, depending on your cloud hosting platform, to link your data into a clean room:

* [Snowflake Data Clean Room: External data from an Amazon S3 bucket](external-data-aws.md)
* [Snowflake Data Clean Room: External data from Azure Blob Storage](external-data-azure.md)
* [Snowflake Data Clean Room: External data from Google Cloud Platform](external-data-gcp.md)

These topics also include information about revoking access to your data, which you can do at any time.

### Join a clean room

After a provider creates and shares a clean room with you, you can sign in to the clean room environment and join the clean room to start
running analyses. To join a clean room:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Clean Rooms.
3. Select the Invited tab.
4. Find the tile for the clean room, and select Join.

### Monitor and manage the cost of your managed account

As a consumer, you pay for the use of the clean room managed account that the provider created for you. Snowflake Data Clean Rooms lets you:

* Monitor how many credits have been consumed by your clean room activities during the current month.
* Set a limit on how much you spend on clean rooms in a given month. After a limit has been set, users cannot sign in to the clean rooms UI
  if the total credit consumption is within 10 credits of the limit.

To monitor and manage the cost of your managed account:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Admin » My Account.
3. Use the Credit Limit & Usage section to set a monthly spending limit and view the current number of credits consumed. A blank
   limit allows unlimited spending.

### Become a Snowflake Service customer

If you want to start using a managed account for more than clean rooms, you can convert it to a fully capable
Snowflake account. To convert a managed account, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Clean room versioning
source: https://docs.snowflake.com/en/user-guide/cleanrooms/dcr-versions.md
section: Clean Rooms
---

# Clean room versioning

> **Note:**
>
> This topic is for clean room creators. Clean room consumers don’t need to think about clean room versioning.

## Clean room version numbering

Snowflake clean rooms are versioned. The initial version of a clean room without any Python code is V1.0.0.

Snowflake automatically creates a new version of a clean room after certain **provider** events, such as uploading Python code or enabling
external or Apache Iceberg™ tables. Snowflake creates a new version only if the security scan triggered by this action passes. Relatively few
provider actions can generate a new clean room version, and procedures that create a new version mention the new version in the procedure
response.

Actions that fail the security scan don’t generate a new version.

Only provider actions can result in a new clean room version; consumer actions cannot.

Snowflake increments only the patch number (the last digit) with each new version. So version numbers for three successive versions would be V1.0.0, V1.0.1, and V1.0.2.

Clean rooms are versioned because they are implemented as [native application packages](../../developer-guide/native-apps/native-apps-about.md). In
Snowflake’s native application framework, the convention is that for version V1.0.2, “V1.0” (a string) is the version number and 2 (an
integer) is the patch number. Clean room documentation typically uses the term “version” to indicate the entire number (V1.0.1) rather
than simply the “V1.0” prefix (as sometimes used in the native app framework).

You can see the version history and review status for a given clean room by calling
`SHOW VERSIONS IN APPLICATION PACKAGE samooha_cleanroom_CLEANROOM_ID;` with the ID of the clean room.

## Default release directive

Each clean room is assigned a *default release directive* by the clean room provider. The default release directive specifies which version
of the clean room should be installed or loaded in the user’s account. Consumers cannot specify which version of a clean room to install.
Updates are handled automatically by Snowflake [as available resources dictate](../../developer-guide/native-apps/update-app-overview.md), and there can be
a delay before the new version is installed on the user’s account.

A clean room provider must specify the default release directive of a clean room before the clean room can be shared initially (either
internally or externally) or whenever the provider uploads code and the security scan passes. If a new version of the clean room is
generated but the default release directive is not updated, consumers will continue to be served the last default version.

You must always set the default release directive before publishing a clean room. If you haven’t added Python code, it should be
V1.0.0, as shown here:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive(
  $cleanroom_name, 'V1_0', '0');
```

A clean room provider can roll the default release directive back to an earlier release if desired.

Specify the default release directive for a clean room by calling `provider.set_default_release_directive`.

A provider must set the default release directive only when creating or modifying a clean room in code. Versioning is handled
automatically when using the clean rooms UI.

Snowflake generates a new version only if the security scan triggered by a provider action passes. Therefore you should check the security
scan status for a clean room by calling `provider.view_cleanrooom_scan_status` before updating the default release directive. Not
updating the default release directive will not cause an error, but the newer version with your changes will not be published to users if you
don’t update the default release directive.

### Clean rooms with errors

If you publish a clean room with an error, which happens when the security scan fails or you upload Python code with a syntax error, a
patch is generated, but you cannot use that version as a default release directive. Until you publish a fixed version, any additional
patches incorporate the error from the previous failed patch and also result in a failed clean room patch.

## Versioning cheat sheet

List all clean room packages (clean rooms) created in this Snowflake account:

```sqlexample
SHOW APPLICATION PACKAGES STARTS WITH 'SAMOOHA_CLEANROOM_';
```

List all versions of the clean room MY_FIRST_CLEANROOM:

```sqlexample
SHOW VERSIONS IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_MY_FIRST_CLEANROOM;
```

See your current default release directive:

```sqlexample
SHOW RELEASE DIRECTIVES IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_<your_clean_room_name>;
```

Check the scan review status before setting the version if this is a clean room that you just made external, or if this is already external
and the version changed:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_cleanroom_scan_status('MY_FIRST_CLEANROOM');

-- When REVIEW_STATUS = APPROVED, you can update the default version to the
-- latest version, if you haven't done so already.
SHOW VERSIONS IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_MY_FIRST_CLEANROOM;
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive(
  $cleanroom_name, 'V1_0', '<<LATEST_PATCH_NUMBER>>');
```

---
title: Clean rooms UI overview
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/web-app-introduction.md
section: Clean Rooms
---

# Clean rooms UI overview

The clean rooms UI provides a user interface to create, share, and use a
[Snowflake Data Clean Room](../overview.md), run queries, and more. The clean rooms UI allows non-technical
business users to collaborate in a secure environment in a browser-based GUI.

## Prerequisites

An administrator must configure the clean room environment and add you as a clean rooms UI user for your Snowflake account.

Customers without externally accessible endpoints (for instance, customers with only PrivateLink connections) cannot use the clean rooms UI.

## Sign in to the clean rooms UI

> **Note:**
>
> The following login instructions and link work no matter where your clean rooms UI is hosted. If your
> clean rooms UI is hosted in a different region, the login response might return an error, but it will include a link to the proper login
> URL for your UI.

After an administrator has added you to the clean room, you can sign in to the UI using the following procedure:

1. Navigate to the [Snowflake Data Clean Rooms login page](https://cleanroom.c1.us-east-1.aws.app.snowflake.com).
2. Choose or provide the account locator for the account you want to sign in to.
3. Provide your Snowflake account credentials.
4. Do one of the following:

   * **If you’ve set up multi-factor authentication (MFA) before**, enter the one-time code from your authentication app.
   * **If this is your first time signing in**, scan the QR code with an authentication app to
     enable MFA. Then do the following:

     1. When prompted, enter the one-time code from the app.
     2. Copy the recovery code to a separate location in case the authentication app is unavailable on your device when you sign in.

### Recommended authenticator apps for multi-factor authentication

All clean rooms UI users must log in using multi-factor authentication (MFA). Snowflake recommends using one of the following third-party
authenticator apps:

* Authy
* Google Authenticator
* Auth0 Guardian
* Microsoft Authenticator

Once MFA is enabled, you will be prompted to enter a one-time code from the authenticator app every time you sign in to the clean rooms UI.

## Clean rooms UI hosting locations and IP addresses

> **Important:**
>
> Using the clean rooms UI to work with your data in a Snowflake Data Clean Room can result in that data being processed in a
> different cloud platform and region than your Snowflake account.

The following table summarizes which cloud platform and region are used to process data for Snowflake accounts in a particular region of
Amazon Web Service (AWS), Microsoft Azure (Azure), and Google Cloud (GCP). It includes the following columns:

* **Snowflake account region**: The cloud region where your Snowflake account is registered.
* **UI gateway region**: The region that hosts the clean rooms UI for your account. Use this address to log into clean rooms UI.
* **Network addresses used by clean rooms UI**: These are the network addresses used by the clean rooms UI to communicate with your
  Snowflake account. If your Snowflake account uses a [network policy](../../network-policies.md) to control network traffic, your
  account administrator must explicitly allow traffic from all IP addresses in this column for your row. If your account has no externally
  available endpoints, you can’t use the clean rooms UI in your account.

| Snowflake account region | UI gateway region | Network addresses used by clean rooms UI |
| --- | --- | --- |
| * AWS South America (Sao Paulo) * AWS US East (N. Virginia) * AWS US East (Ohio) * AWS US West (Oregon) * Azure Central US (Iowa) * Azure East US 2 (Virginia) * Azure Mexico Central (Querétaro) * Azure South Central US (Texas) * Azure West US 2 (Washington) * GCP US Central1 (Iowa) * GCP US East4 (N. Virginia) | [AWS US East (N. Virginia)](https://cleanroom.c1.us-east-1.aws.app.snowflake.com/) | 52.7.249.136  34.195.16.248  52.7.210.215 |
| * AWS Canada (Central) * Azure Canada Central (Toronto) | [AWS Canada (Central)](https://cleanroom.c1.ca-central-1.aws.app.snowflake.com/) | 15.223.145.218  3.96.6.109  15.222.142.44 |
| * AWS Europe (London) * AWS EU (Ireland) * AWS EU (Frankfurt) * AWS EU (Paris) * AWS EU (Stockholm) * AWS EU (Zurich) * AWS Africa (Cape Town) * Azure North Europe (Ireland) * Azure Sweden Central (Gavie) * Azure Switzerland North (Zurich) * Azure UAE North (Dubai) * Azure UK South (London) * Azure West Europe (Netherlands) * GCP Middle East Central2 (Dammam) * GCP Europe West (Frankfurt) * GCP Europe West2 (London) * GCP Europe West4 (Netherlands) | [AWS EU (Frankfurt)](https://cleanroom.c1.eu-central-1.aws.app.snowflake.com/) | 54.93.86.99  3.126.238.8  3.127.143.168 |
| * AWS Asia Pacific (Mumbai) * Azure Central India (Pune) | [AWS Asia Pacific (Mumbai)](https://cleanroom.c1.ap-south-1.aws.app.snowflake.com/) | 35.154.94.29  13.235.168.249  15.206.48.175 |
| * AWS Asia Pacific (Singapore) * AWS Asia Pacific (Tokyo) * AWS Asia Pacific (Osaka) * AWS Asia Pacific (Seoul) * AWS Asia Pacific (Jakarta) * Azure Southeast Asia (Singapore) * Azure Japan East (Tokyo) * Azure Korea Central (Seoul) | [AWS Asia Pacific (Singapore)](https://cleanroom.c1.ap-southeast-1.aws.app.snowflake.com/) | 13.228.90.174  52.220.42.130  52.220.249.16 |
| * AWS Asia Pacific (Sydney) * Azure Australia East (New South Wales) | [AWS Asia Pacific (Sydney)](https://cleanroom.c1.ap-southeast-2.aws.app.snowflake.com/) | 52.65.205.236  52.62.198.227  3.104.160.96 |

## Learning Resources

After you sign in to the UI, see [Run an analysis in the UI](web-app-working.md) for information about
creating, sharing, and using a clean room. You can also use the Help Center in the clean room environment to guide you.

You can also complete a [tutorial](tutorials/cleanroom-web-app-tutorial.md) for help with getting started.

---
title: Clean rooms UI tour
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/ui-tour.md
section: Clean Rooms
---

# Clean rooms UI tour

This topic introduces you to the main pages and elements of the clean rooms UI, and how to use them for common tasks.

The clean rooms UI might look slightly different for you than shown here because the clean rooms UI is updated periodically, and also the
clean rooms UI looks different for users on an account that isn’t
[upgraded to use Snowflake authentication](../update-to-oauth.md). Additionally, some pages require that you have a
[specific clean rooms role](../manage-dcr-users.md) to be able to access. If you don’t have the appropriate role to access a
page, the page might be disabled or not appear in your clean rooms UI. Using the ACCOUNTADMIN role gives you access to all the pages
described here.

To follow along in your browser, [sign in to the clean rooms UI](web-app-introduction.md).

## Clean rooms inventory page

The landing page that you see after you sign in is the clean rooms inventory page. Here you can see all the clean rooms that you created, joined, or are
invited to join, but haven’t yet joined.

1) Navigation bar:
:   Use the navigation bar to go to the other clean rooms pages, described elsewhere in this topic.

2) Clean room list:
:   Clean rooms are grouped into those you have created, those you have joined (installed) as a consumer, and those you are invited to join as
    a consumer. You can filter the list in the current view by entering a full or partial string in the filter textbox above the list.

    Each tile shows the clean room name, the collaborators, when the clean room was created or last edited, and a button that initiates a
    primary action for that clean room as described below.

    Clean rooms are grouped into the following tabs:

    * Created tab: Shows clean rooms that you created by using the UI or the API. Clean rooms created by using the API are labeled as
      Supported with Developer APIs.

      + Edit a clean room you have created by selecting Edit, or  » Edit in a clean room tile.
      + If a tile shows a Run button in the Created list, the clean room is enabled for [provider-run analyses](../demo-flows/provider-run-analysis.md).
      + To delete a clean room, select  » Delete.
    * Joined tab: Shows clean rooms that you have *joined* (installed as a consumer) using either the UI or the API. After you join a clean
      room, you must configure it to link in your own datasets, set your join and column policies, and provide any other values required for
      the particular analyses or templates in the clean room.

      After you configure the joined clean room, you can run analyses in joined clean rooms by selecting the Run button, or edit your
      values, reset the clean room to original values, or leave (uninstall) the clean room by using the  (More) list
      options. For more information, see [Install (join) a clean room](../manage-clean-rooms.md) or [Delete a clean room that you created](../manage-clean-rooms.md).
    * Invited tab: Shows clean rooms that you are invited to join as a consumer. To join the clean room, select the Join button.
      Joining (installing) a clean room can take several minutes. After it is joined, the clean room is moved from the Invited view to the
      Joined view. You must then configure the clean room so that you can run any analyses in that clean room.

    > **Important:**
    >
    > If a clean room has been created using the API (it’s labeled Supported with Developer APIs), the clean room can be used in the UI
    > only if it is explicitly designed for UI use. Otherwise, clean rooms that you create by using the API are usable only in the API.
    >
    > To see any templates available for you to use in the UI, select Run. UI templates are listed in the Select Analysis / Query
    > list; templates listed in the Developer API Templates section can be run only using the API.

3) Clean room creation button:
:   Select this button to create a new clean room. For more information, see [Create a new clean room](../manage-clean-rooms.md).

4) Account information:
:   Use this widget to see information about the Snowflake privacy policy, basic account information, or to sign out of this account in the
    clean rooms UI. For more detailed profile information, open the Profile & Features page.

## Analysis & Queries list page

This page shows the history of analyses run by this account in the clean rooms UI. Analyses run using the API are not shown here. You can
also use this page to run a new analysis of a specified type in an existing clean room.

To see the results of a given analysis, select the analysis from the list and examine the results as described in the
*analysis results page*. Each element in the list shows the analysis name, status, and which clean room the analysis was run from.

You can run a new analysis of a given type by selecting New Analysis & Query. Select the type of analysis you want to run to see a
list of clean rooms that are configured for that analysis type, and then configure and run the analysis.

## Analysis results page

The analysis results page shows the configuration and results of a single analysis. The following image shows the results of an untitled
Overlap & Segmentation analysis that is scheduled to run every day.

The analysis results are shown in the Results section of the page; you might need to scroll down to see them. Some analyses show a
simplified graph of results, with full tabular results downloadable by selecting Download. If activation is enabled for the
analysis, select Activate and [walk through the activation flow](activation.md) to specify who should get
the full results.

The page also shows the details of the analysis in the Query Configurations section. You can modify these values, and then select Run
to run the analysis again.

This page is included in a list of all your analyses in the Analysis & Queries list page.

To change or disable the schedule of a repeating query run, change the Schedule Run selector on the page to
your desired run schedule, or Off to turn off repeating run.

## Profile & Features page

This page is used to manage the company name and logo displayed to collaborators, as well as managing the list of third-party
[identity providers, data providers,](../connector-identity.md) and [activation providers](../connector-activation.md).

Use this page to manage the list of third-party provider connectors that are *enabled* for this account; you *don’t configure* connectors
or assign them to specific clean rooms here. You configure connectors in the Connectors page, and you enable connectors within individual clean rooms
during a clean room creation or editing flow.

The identity, data, and activation provider connector lists on this page show only connectors enabled for this account. To enable a new
connector, select Edit, then select the connector that you want to enable. After you enable a connector here, you must configure it in
the Connectors page.

You need the SAMOOHA_BY_SNOWFLAKE.MANAGE_DCR_PROFILE_AND_FEATURES application role to access this page.

## Snowflake Admin page

This page is where you perform the following administrative tasks:

* See your account identifiers.
* See the account cloud and region.
* See and manage the [service user](enable-clean-rooms-ui.md) that is used by the UI to perform clean room actions.
* See and manage account-wide features, such as [Cross-Cloud Auto-Fulfillment](enabling-laf.md),
  [external and Iceberg tables](../register-data.md), and scheduled repeating analyses.
* [Register data objects](../register-data.md) for use in clean rooms.

You need the ACCOUNTADMIN role to access this page.

## Collaborators page

Use this page to manage the list of consumers who can be invited to join a clean room using the clean rooms UI. (Clean rooms API users can
invite anyone with a Snowflake account.) Only collaborators listed here can be invited by a clean room provider in the UI.

This page has separate sections for managing collaborators with a Snowflake account and managing collaborations without a Snowflake account.
(Collaborators without a Snowflake account use a *managed account*.)

You need the SAMOOHA_BY_SNOWFLAKE.MANAGE_DCR_COLLABORATORS application role to access this page.

## Connectors page

This page is used to configure any identity, data, or activation connector enabled in the Profile & Features page. See the
on how to configure a provider in the clean rooms, see the documentation in the section under Third-party connectors.

You need the SAMOOHA_BY_SNOWFLAKE.MANAGE_DCR_CONNECTORS application role to access this page.

---
title: Code bundle specification
source: https://docs.snowflake.com/en/user-guide/cleanrooms/spec-code-bundle.md
section: Clean Rooms
---

# Code bundle specification

This specification defines a bundle of one or more code functions or procedures that can be called by a template.

A code bundle spec can contain a combined maximum of 5 functions and procedures.

For examples of different kinds of code bundles, see [Example specs](resources-code-bundles.md).

Identifiers in the code bundle spec have the following general requirements:

* **Names**: Must be valid [Snowflake identifiers](../../sql-reference/identifiers-syntax.md) that start with a letter and contain only
  alphanumeric characters and underscores.
* **Quoted identifiers**: Double-quoted identifiers are supported for names with special characters.
* **Case sensitivity**: Unquoted identifiers are case-insensitive; quoted identifiers preserve case.

```yaml
api_version: 2.0.0              # Required: Must be "2.0.0"
spec_type: code_spec            # Required: Must be "code_spec"
name: <identifier>              # Required: Unique name of this code bundle.
version: <version_id>           # Required: Alphanumeric with underscores (max 20 chars)
description: <description_text> # Optional: Description (max 1,000 chars)

artifacts:                      # Optional: Staged files for import
  - alias: <identifier>         # One or more artifact items...
    stage_path: <stage_path>    # Required: Full stage path. See below for additional requirements.
    description: <description_text>  # Optional: Description (max 500 chars)
    content_hash: <sha256_hash>      # Optional: Lowercase SHA-256 hash for integrity verification

functions:                      # Required if no procedures defined
  - name: <identifier>          # One or more functions...
    type: UDF | UDTF            # Required: Function type
    language: PYTHON            # Required: Currently only PYTHON supported
    runtime_version: <python_version>  # Optional: Python runtime (3.10 - 3.14)
    handler: <handler>          # Required: Handler function
    arguments:                  # Optional: One or more function arguments
      - name: <arg_name>        # Argument name
        type: <sql_type>        # Snowflake SQL type of this argument
    returns: <sql_type>         # Required: Snowflake return type
    packages:                   # Optional: Package dependencies
      - <package_name>          # One or more package items...
    imports:                    # Optional: Artifact aliases to import
      - <artifact_alias>        # One or more import items...
    code_body: |                # Optional: Inline Python code (max 12 MB)
      <inline_python_code>
    description: <description_text>  # Optional: Description of this function.

procedures:                     # Required if no functions defined
  - name: <identifier>          # One or more procedure items...
    language: PYTHON            # Required: Currently only PYTHON supported
    runtime_version: <python_version>  # Optional: Python runtime version
    handler: <handler>          # Required: Handler function
    arguments:                  # Optional: One or more procedure arguments
      - name: <arg_name>        # Argument name
        type: <sql_type>        # Snowflake SQL type of this argument
    returns: <sql_type>         # Optional: Return type
    packages:                   # Optional: Package dependencies
      - <package_name>          # One or more package items...
    imports:                    # Optional: Artifact aliases to import
      - <artifact_alias>        # One or more import items...
    code_body: |                # Optional: Inline Python code
      # inline python_code ...
    description: <description_text>  # Optional: Description of this procedure.
```

`api_version`
:   The version of the Collaboration API used. Must be `2.0.0`.

`spec_type`
:   Specification type identifier. Must be `code_spec`.

`name: identifier`
:   A unique name for this code bundle spec within this registry. Must be a valid
    [Snowflake identifier](../../sql-reference/identifiers-syntax.md) with a maximum of 75 characters. This is used as the last name segment
    when calling the function in a template: `cleanroom.code_spec_name$function_name`

`version: version_id`
:   Custom version identifier. Must be alphanumeric with underscores, maximum 20 characters.

`description: description_text` (*Optional*)
:   A description of the code bundle spec (maximum 1,000 characters).

`artifacts` (*Optional*)
:   A list of staged files or packages that can be imported by your functions or procedures, and
    [optionally exposed via handler functions](resources-code-bundles.md). Maximum of 5 per spec.

    `alias: identifier`
    :   An alias for referencing this artifact in imports. When referencing this alias within this spec, use the bare alias name rather than
        `cleanroom.spec_name$alias`; that is, use the bare function name to reference another function in this spec.

    `stage_path: stage_path`
    :   Full stage path to the artifact file. For example, `@DB.SCHEMA.STAGE/path/file.whl`.

    * **The stage must be internal.** External stages aren’t supported.
    * **The stage must have DIRECTORY enabled**: The stage containing artifacts must have `DIRECTORY = TRUE` set.
    * **Stage path format**: Must follow `@[DB.]SCHEMA.STAGE/path/to/file.ext` format.
    * **No path traversal**: Stage paths can’t contain `..` or `\`.
    * **This artifact must exist**: The file must exist at the specified stage path when the code bundle is registered.
    * **The stage must have SNOWFLAKE_SSE server-side encryption enabled.** When creating or altering the stage, set
      `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')`.
    * **If you push, delete, or update a staged code file,** you must call `ALTER STAGE stage name REFRESH` to ensure that the
      collaboration has the latest information from the stage. Code updates are supported only before you register the code spec, as this is
      when the version is assigned and the hash checksum calculated.

    `description: description_text` (*Optional*)
    :   A description of the artifact (maximum 500 characters).

    `content_hash: sha256_hash` (*Optional*)
    :   Lowercase SHA-256 hash for integrity verification (64 hex characters).

`functions` (*Required if no procedures are defined*)
:   A list of UDF or UDTF definitions.

    `name: identifier`
    :   The function name to expose to the calling template. Must be a valid [Snowflake identifier](../../sql-reference/identifiers-syntax.md).

    `type`
    :   The function type. One of `UDF` or `UDTF`.

    `language`
    :   The function language. Currently only `PYTHON` is supported.

    `runtime_version: python_version` (*Optional*)
    :   Python runtime version to use. Supported versions: `3.10` to `3.14`.

    `handler: handler`
    :   The name of the handler function in the function code to call when `name` is called.

    `arguments` (*Optional*)
    :   Function arguments as a list of name-type pairs. Types must be valid Snowflake SQL types.

    `returns: sql_type`
    :   The return type. For UDFs, use a SQL type such as STRING or FLOAT. For UDTFs, use `TABLE(column_definitions)`.

    `packages` (*Optional*)
    :   A list of packages used by this code. This can be any of [these Anaconda Python packages](https://repo.anaconda.com/pkgs/snowflake/)
        or [these Snowpark API packages](demo-flows/snowpark.md). For example: `snowflake-snowpark-python`, `numpy`.

    `imports` (*Optional*)
    :   A list of artifacts to import. These must be aliases from the artifacts list in this spec.

    `code_body` (*Optional*)
    :   Inline Python code. Mutually exclusive with staged imports. Maximum size is 12 MB.

    `description: description_text` (*Optional*)
    :   A description of the function (maximum 500 characters).

`procedures` (*Required if no functions defined*)
:   A list of stored procedure definitions. Fields are similar to `functions`, except there is no `type` field.

---
title: Code bundles
source: https://docs.snowflake.com/en/user-guide/cleanrooms/resources-code-bundles.md
section: Clean Rooms
---

# Code bundles

Any collaborator can bundle custom Python Procedures, UDFs or UDTFs with collaboration templates. Templates in turn reference the bundled code to perform complex data actions in the collaboration.
Common usage includes machine learning or customized data manipulation within a query. Your uploaded code can
import and use packages from an [approved bundle of Python packages](https://repo.anaconda.com/pkgs/snowflake/)
and the [Snowpark API](demo-flows/snowpark.md).

Custom code can be called only via templates, and not directly.

> **Note:**
>
> Python is the only coding language supported for Code Bundles.

The following sections show you how to upload and use code bundles.

## Implementing custom code bundles

Here is how to upload and use a code bundle:

**The code submitter:**

1. Creates and registers the code by calling [REGISTER_CODE_SPEC](collaboration-api-reference.md).

   The code can be inline in the spec, or linked from a stage.
2. Creates a template that references the code bundle spec by ID in the template’s `code_specs` array. Add this field as a peer of the template and parameters fields as shown in this example:

   ```yaml
    parameters:
      - name: <parameter_name>
        description: <parameter_description>
        required: <true_or_false>
        default: <default_value>
        type: <data_type>

    code_specs:             # Optional: List of code bundles used by this template
    - <code_spec_id>        # One or more code spec IDs.

    template: |
      <template_content>
   ```
3. Registers the template and then links the template into the collaboration.

**The analysis runner:**

* Runs the template in the standard way by calling `RUN`.

> **Important:**
>
> Snowflake runs security checks on any uploaded bundles before deploying them into a clean room. If a security check fails, the template
> and its bundled code will not be deployed and available for use.

To confirm that a template with a code bundle is deployed and ready for use, take the following steps:

> 1. Find the name of the clean room application where you are trying to deploy the code bundle:
>
>    ```sqlexample
>    SHOW APPLICATIONS LIKE 'SFDCR_<collaboration name>';
>    ```
> 2. Check the `upgrade_state` value in the DESCRIBE APPLICATION response. When the upgrade state is COMPLETE, the security checks have
>    passed and the new template and bundle are available to use. Pass in the application name returned by the command in the previous step using SQL like the following example:
>    SQL code:
>
>    ```sqlexample
>    DESCRIBE APPLICATION <application name>
>    ```

### Create and register the code bundle spec

The first step in uploading custom code is to create and register the code bundle spec.

Custom functions are defined in a YAML code bundle spec. Each code bundle exposes one or more functions that can be called by a template. The code bundle spec can either include the code in the spec inline, or link to code that lives on a Snowflake stage.

A collaborator registers a spec by calling `REGISTRY.REGISTER_CODE_SPEC`, which returns the bundle ID.

After the template that references the code bundle is linked into the collaboration, that code bundle is visible to anyone in the collaboration who can access a template that links the code bundle. Call `VIEW_CODE_SPECS` to list accessible code bundles in a collaboration.

Anyone who can see a code bundle in a collaboration can see and use it in their own templates in that collaboration. Any inline code can be viewed by any member of the collaboration, but staged artifact code can not be viewed by collaborators. Collaborators need to ensure that the `content_hash` of the referenced artifacts match for code integrity verification.

The following code bundle spec that exposes a single Python UDF called `normalize_value`, which calls the `normalize` function defined in that spec:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_CODE_SPEC(
  $$
  api_version: 2.0.0
  spec_type: code_spec
  name: custom_udf
  version: v1
  functions:
    - name: normalize_value
      type: UDF
      language: PYTHON
      handler: normalize
      arguments:
        - name: value
          type: FLOAT
      returns: FLOAT
      code_body: |
        def normalize(value):
            return value / 100.0
  $$
);
```

### Create and register the calling template

After the code spec is registered, the collaborator then registers a template that uses this code bundle. To use a code bundle, add the bundle spec ID in the template’s `code_specs` field. Adding this template into the collaboration will also cause the code bundled to be available in the collaboration.

A template calls a custom function using the syntax `cleanroom.spec_name$function_name`. Note the literal `.` and `$` name scoping marks.

> **Note:**
>
> Use the spec name, not the spec ID, to reference a function in your template. This is so that you can quickly update the version of your code bundle without having to change all the references to it in your template.

In the following example, a template uses function `normalize_value` from the code bundle `custom_udf`:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: normalization_template
  version: v1
  type: sql_analysis
  code_specs:
    - custom_udf_v1  -- Imports the code bundle.
  template: |
    SELECT cleanroom.custom_udf$normalize_value(100)  -- Calls the UDF.
      AS normalized
        FROM {{ source_tables[0] }}
  $$
);
```

### Add the template to a collaboration

Add the template that calls your function to the collaboration in the standard way. For more information, see [Templates](resources-templates.md).

Snowflake validates and uploads to the collaboration when the calling template is added to a collaboration. The following example shows a request to add a template to an existing collaboration:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ADD_TEMPLATE_REQUEST(
  'my_collaboration',
  'normalization_template_v1',
  ['consumer']
);
```

> **Note:**
>
> Installing a template with a code bundle triggers a Snowflake security check, and issues a new patch of the underlying clean room. The template will not be available or usable until the process is complete and the patch is installed.
>
> To check the progress of the patch installation:
>
> 1. Find the name of the clean room application. Typically, this will be `SFDCR_<clean room name>`, but you can search to be sure:
>
>    ```sqlexample
>    -- Find the exact name of the clean room application.
>    SHOW APPLICATIONS LIKE 'SFDCR_%';
>    ```
> 2. Check the status of the patch install. Wait for `upgrade_state` is COMPLETE in the following query:
>
>    ```sqlexample
>    DESCRIBE APPLICATION SFDCR_<application name>;
>    ```

## Versioning your code

Every registered code spec must have a unique name + version across all registries in your account. A template loads a specific name and version of a code spec. If you want to create or consume a new version of your code, you must submit a new version of the template that references the new code version in the code_specs field. You do not need to change the template body. For example:

**Step 1:** Consume version 1 of the code bundle:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: normalization_template
  version: v1
  type: sql_analysis
  code_specs:
    - custom_udf_v1  -- Bundle ID includes the version number.
  template: |
    SELECT cleanroom.custom_udf$normalize_value(100)  -- Calls the UDF.
      AS normalized
        FROM {{ source_tables[0] }}
  $$
);
```

**Step 2:** Update and register the new version of your code bundle, and then update your template to use the new version:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: normalization_template
  version: v2        -- Update the template version.
  type: sql_analysis
  code_specs:
    - custom_udf_v2  -- Use the new code bundle.
  template: |
    SELECT cleanroom.custom_udf$normalize_value(100)  -- No change needed here.
      AS normalized
        FROM {{ source_tables[0] }}
  $$
);
```

Notice that function names do not include the version, so you do not need to change the calling code in the template body when you upload a new version of a function.

## Example specs

### Inline UDF with code body

A simple UDF with inline Python code:

```yaml
api_version: 2.0.0
spec_type: code_spec
name: string_utils
version: v1
description: String utility functions

functions:
  - name: clean_string
    type: UDF
    language: PYTHON
    runtime_version: "3.10"
    handler: clean
    arguments:
      - name: input_str
        type: STRING
    returns: STRING
    description: Removes leading/trailing whitespace and converts to lowercase
    code_body: |
      def clean(input_str):
          if input_str is None:
              return None
          return input_str.strip().lower()

  - name: extract_domain
    type: UDF
    language: PYTHON
    runtime_version: "3.10"
    handler: extract
    arguments:
      - name: email
        type: STRING
    returns: STRING
    description: Extracts domain from email address
    code_body: |
      def extract(email):
          if email is None or '@' not in email:
              return None
          return email.split('@')[1]
```

### UDTF (User-Defined Table Function)

This example YAML defines a UDTF that returns multiple rows:

```yaml
api_version: 2.0.0
spec_type: code_spec
name: tokenizer
version: v1
description: Text tokenization UDTF

functions:
  - name: tokenize_text
    type: UDTF
    language: PYTHON
    runtime_version: "3.10"
    handler: Tokenizer
    arguments:
      - name: text
        type: STRING
      - name: delimiter
        type: STRING
    returns: TABLE(token STRING, position INTEGER)
    description: Splits text into tokens and returns each with its position
    code_body: |
      class Tokenizer:
          def process(self, text, delimiter):
              if text is None:
                  return
              tokens = text.split(delimiter if delimiter else ' ')
              for i, token in enumerate(tokens):
                  yield (token.strip(), i)
```

### Staged artifact with wheel package

Be sure to read the [stage_path documentation requirements](spec-code-bundle.md) for linking to staged code in your code spec.

This example YAML uses a staged Python wheel package:

```yaml
api_version: 2.0.0
spec_type: code_spec
name: ml_scoring
version: v2
description: ML scoring functions using custom library

artifacts:
  - alias: ml_lib
    stage_path: "@MY_DB.PUBLIC.CODE_STAGE/libs/ml_scoring_lib-1.0.0-py3-none-any.whl"
    description: Custom ML scoring library
    content_hash: "a1b2c3d4e5f6..."

functions:
  - name: predict_score
    type: UDF
    language: PYTHON
    runtime_version: "3.10"
    handler: ml_scoring_lib.predictor.predict
    arguments:
      - name: features
        type: ARRAY
    returns: FLOAT
    packages:
      - numpy
      - scikit-learn
    imports:
      - ml_lib
    description: Predicts score using trained ML model
```

### Stored procedure

This example YAML defines a stored procedure for data processing:

```yaml
api_version: 2.0.0
spec_type: code_spec
name: data_processor
version: v1
description: Data processing procedures

procedures:
  - name: aggregate_metrics
    language: PYTHON
    runtime_version: "3.10"
    handler: process
    arguments:
      - name: table_name
        type: STRING
      - name: group_column
        type: STRING
    returns: STRING
    packages:
      - snowflake-snowpark-python
    description: Aggregates metrics by specified column
    code_body: |
      def process(session, table_name, group_column):
          df = session.table(table_name)
          result = df.group_by(group_column).count()
          result.write.mode("overwrite").save_as_table("aggregated_results")
          return f"Aggregated {df.count()} rows into aggregated_results"
```

### Multiple Python files as staged artifacts

Be sure to read the [stage_path documentation requirements](spec-code-bundle.md) for linking to staged code in your code spec.

This example YAML uses multiple staged Python source files:

```yaml
api_version: 2.0.0
spec_type: code_spec
name: analytics_suite
version: v3
description: Analytics suite with multiple modules

artifacts:
  - alias: utils
    stage_path: "@MY_DB.PUBLIC.CODE_STAGE/analytics/utils.py"
    description: Utility functions
  - alias: transformers
    stage_path: "@MY_DB.PUBLIC.CODE_STAGE/analytics/transformers.py"
    description: Data transformation functions
  - alias: validators
    stage_path: "@MY_DB.PUBLIC.CODE_STAGE/analytics/validators.py"
    description: Validation functions

functions:
  - name: transform_and_validate
    type: UDF
    language: PYTHON
    runtime_version: "3.10"
    handler: transformers.transform_validate
    arguments:
      - name: data
        type: OBJECT
    returns: OBJECT
    imports:
      - utils
      - transformers
      - validators
    description: Transforms and validates input data
```

---
title: Collaboration specification
source: https://docs.snowflake.com/en/user-guide/cleanrooms/spec-collaboration.md
section: Clean Rooms
---

# Collaboration specification

Defines the high-level collaboration. The specification defines which analysis runners are invited, and for each analysis runner, which
data and templates they can access and run. Any templates or data offerings that are listed here must be registered before they’re included
in the collaboration specification.

The owner submits this specification by calling INITIALIZE.

**Schema:**

```yaml
api_version: 2.0.0              # Required: Must be "2.0.0"
spec_type: collaboration        # Required: Must be "collaboration"
name: <collaboration_name>      # Required: Unique name (max 75 chars)
version: <version_string>       # Optional: Version identifier (max 20 chars)
description: <collaboration_description>  # Optional: Description (max 1,000 chars)
owner: <owner_alias>            # Required: Alias of owner

collaborator_identifier_aliases:  # Required: Map aliases to account identifiers
  <alias_1>: <account_identifier_1>  # One or more alias mappings...

analysis_runners:               # Required: Who can run analyses
  <analysis_runner_alias>:      # One or more analysis runner definitions...
    data_providers:             # Required: Data providers for this runner
      <provider_alias>:         # One or more provider definitions...
        data_offerings:         # Required: List of offerings (can be empty [])
          - id: <data_offering_id>  # Zero or more data offering IDs...
    templates:                  # Optional: Templates this runner can use
      - id: <template_id>       # One or more template IDs...
    activation_destinations:    # Optional: Where results can be sent
      snowflake_collaborators:  # Optional: Collaborators who can receive results
        - <collaborator_alias>  # One or more collaborator aliases...
```

`api_version`
:   The version of the Collaboration API used. Must be `2.0.0`.

`spec_type`
:   Specification type identifier. Must be `collaboration`.

`name: collaboration_name`
:   User-friendly name for this collaboration. Must be unique in the creator’s account and follow
    [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) (maximum 75 characters).

`version` (*Optional*)
:   A version identifier for this collaboration (maximum 20 characters). Must follow
    [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md). A good format to use is *YYYY_MM_DD_V#*. For example: `2025_10_22_V1`.

`description: collaboration_description` (*Optional*)
:   A human-readable description of the collaboration (maximum 1,000 characters), for collaborators to read.

`owner: owner_alias`
:   Alias of the collaboration owner, as defined in `collaborator_identifier_aliases`.

`collaborator_identifier_aliases`
:   A mapping of collaborator aliases to their [Data Sharing Account Identifiers](../admin-account-identifier.md). Only users listed
    here can participate in the collaboration. Use the aliases defined here to refer to all collaborators, rather than using their data
    sharing account identifier directly. Must be unique in this collaboration and follow [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) (maximum 25 characters).

`analysis_runners`
:   Describes who can run an analysis in this collaboration. Each analysis runner is keyed by a unique alias. You must allow at least one
    account to run an analysis in this collaboration.

    `<analysis_runner_alias>`
    :   Alias of account that can run an analysis in this collaboration. Alias is defined in the `collaborator_identifier_aliases` list.

    `data_providers`
    :   Data providers whose data this analysis runner can access. Each provider is keyed by the alias that is defined in
        `collaborator_identifier_aliases`.

        `data_offerings`
        :   A list of data offerings from this data provider that the analysis runner can access, or an empty array `[]` as a placeholder so
            that data offerings can be added later. Each data offering is referenced by its ID, generated when the data provider calls
            REGISTER_DATA_OFFERING.

    `templates` (*Optional*)
    :   The templates that can be used by this analysis runner. Each template is referenced by its ID. You can omit this in the initial spec,
        and still share templates with this analysis runner after the collaboration is created.

    `activation_destinations` (*Optional*)
    :   Defines activation settings for the analysis results.

        `snowflake_collaborators` (*Optional*)
        :   List of collaborators who can receive activated analysis results. Use the alias from the `collaborator_identifier_aliases` list in
            this spec. All collaborators listed here must have the permissions described in [Implementing activation](activation.md).

## Examples

```yaml
api_version: 2.0.0
spec_type: collaboration
name: my_sample_collaboration
owner: Owner
collaborator_identifier_aliases:
  Owner: ENG.OWNER
  AnalysisRunner_1: ENG.CONSUMER_1
  DataProvider_1: ENG.PROVIDER_1
  DataProvider_2: ENG.PROVIDER_2
  AnalysisRunner_2: ENG.PROVIDER_3
analysis_runners:
  AnalysisRunner_1:
    data_providers:
      DataProvider_1:
        data_offerings:
        - id: DCR_PREPROD_CI_PROVIDER_ANY_NAME_ZUDFTMULHQ_iuDfn_v0
      DataProvider_2:
        data_offerings: []
    templates:
    - id: test_sca_three_party_template_JOaVG_v0
  AnalysisRunner_2:
    data_providers:
      DataProvider_2:
        data_offerings: []
    templates:
    - id: test_sca_three_party_template_JOaVG_v0
```

---
title: Collaborator roles in Collaboration Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/roles.md
section: Clean Rooms
---

# Collaborator roles in Collaboration Data Clean Rooms

## Overview of collaboration roles

Collaborators have one or more of the following *collaboration roles* in a clean room collaboration scenario. In this case,
a *collaboration role* is a set of capabilities, not an [RBAC role](manage-access.md):

* **Owner:** The owner defines, creates, and owns the collaboration, and defines which collaborators are invited and their collaboration
  roles. An owner isn’t automatically an analysis runner or a data provider, and doesn’t have any elevated run privileges. The owner’s
  main abilities are to create the clean room, assign collaboration roles, determine who can share data with whom, and tear down the
  clean room. A collaboration can have only one owner.
* **Data provider:** Provides data offerings, such as tables and views, to a collaboration, and specifies which analysis runners can
  use them. That is, account A is a data provider to accounts B and C, as specified in the collaboration specification.
* **Analysis runner:** Runs permitted templates on permitted data offerings, as specified by the collaboration specification.
  An analysis runner isn’t a data provider to themselves by default, unless specified in the collaboration specification.

One collaborator can have multiple collaboration roles in a collaboration, and multiple collaborators can have the same collaboration
role (except for the owner collaboration role, which is assigned to only one user). For example, the owner of a collaboration can
also be a data provider and an analysis runner.

The owner specifies all collaborators and their collaboration roles when they create the collaboration. Collaborators and their
collaboration roles can’t be changed after a collaboration is created. As a consequence, the following collaboration role assignments
are fixed after a collaboration is created:

* The owner can’t be changed.
* Analysis runners can’t be added or removed.
* The list of data providers for each analysis runner can’t be changed. If account A isn’t defined as a data provider for account B
  when the collaboration is created, account A can never be a data provider for account B.

However, collaborators can link or remove [resources](resources.md) after a collaboration is created.

## See your role

Call `GET_STATUS` to see your roles in a collaboration in the ROLES column:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);
```

If you want to see more details about your roles, for example, if you’re a data
provider and want to see whom you can share data with, you must examine the spec.
Here is how to see the collaboration spec in a single call after you have joined a collaboration:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS() ->>
  SELECT "COLLABORATION_SPEC" FROM $1
    WHERE "SOURCE_NAME" = $collaboration_name;
```

## Example

The following example shows a very basic collaboration that defines collaboration roles, but doesn’t include any resources.
You can create a collaboration with or without resources, and add or remove them later.

```yaml
api_version: 2.0.0
spec_type: collaboration
name: basic_collaboration
owner: alice
collaborator_identifier_aliases:
  alice: corp1.acct123
  bob: corp2.acctxyz
analysis_runners:
  alice:
    data_providers:
      alice:
        data_offerings: []
      bob:
        data_offerings: []
  bob:
    data_providers:
      alice:
        data_offerings: []
```

The previous collaboration defines the following collaborators and collaboration roles:

* `alice` is the collaboration owner, an analysis runner, and a data provider for `bob` and herself. `alice` is the alias
  defined in the collaboration for account `corp1.acct123`.
* `bob` is an analysis runner, and a data provider for `alice` but *not* for himself. `bob` is the alias defined in the
  collaboration for account `corp2.acctxyz`.

These collaboration roles can’t be modified, and new collaborators can’t be added, after the collaboration is created.

Data providers can link data offerings after a collaboration is created. Any collaborator can request to add templates after a collaboration
is created. The following example shows how you can use the Collaboration API to link resources into the previous collaboration after
it’s created:

```yaml
api_version: 2.0.0
spec_type: collaboration
name: basic_collaboration
owner: alice
collaborator_identifier_aliases:
  alice: corp1.acct123
  bob: corp2.acctxyz
analysis_runners:
  alice:
    data_providers:
      alice:
        data_offerings:
        - id: alice_data_1
        - id: alice_data_2
      bob:
        data_offerings:
        - id: bob_data_1
    templates:
    - id: template1  # Alice can run template1 using alice_data_1, alice_data_2, or bob_data_1.
  bob:
    data_providers:
      alice:
        data_offerings:
        - id: alice_data_1
    templates:
    - id: template2  # Bob can run template2 using data from alice_data_1, provided by alice.
```

The modified collaboration now supports the following resources and capabilities:

* `alice` can run analyses using `template1` with data from `alice_data_1`, `alice_data_2`, and `bob_data_1`.
* `bob` can run `template2` using data from `alice_data_1`.

---
title: Creating, joining, removing, and uninstalling clean rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/manage-clean-rooms.md
section: Clean Rooms
---

# Creating, joining, removing, and uninstalling clean rooms

This topic explains basic clean room actions using both the clean rooms API and the clean rooms UI.

## Create a new clean room

You must have the proper permission in a Snowflake account to be able to create a clean room. The clean room creator is called the
*provider*.

Clean rooms UIClean rooms API

The Clean Rooms page in the clean rooms UI lets you, as a provider, manage the lifecycle of a clean room, including creating
and sharing. If you don’t have access to the clean rooms UI, speak to a clean rooms administrator for your Snowflake account.

To create and share a clean room, do the following:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Clean Rooms.
3. Select + Clean Room. The creation process has the following steps:

   1. Use the Add Data step to name the clean room and select the tables that are being shared with the consumer. The name can
      be 80 characters maximum, case-insensitive a-z, 0-9, spaces, and underscores.
   2. Use the Specify Join Policies step to enable identity providers enabled by your clean rooms account administrator and
      select which columns the consumer can join on.
   3. Use the Configure Analysis & Query step to define which templates are available in the clean room, template-specific
      configuration settings, and addtional features such as activation and privacy settings.
   4. Use the Share Clean Room step to invite consumers to use the clean room to collaborate. You can also use the
      Enable Run Analysis & Query option to specify which collaborators can run analyses in the clean room.

For a full walkthrough of creating a new clean room in the clean rooms UI, try the
[clean rooms UI tutorial](v1/tutorials/cleanroom-web-app-tutorial.md)

To create a new clean room in code, you must be granted the SAMOOHA_APP_ROLE role in your account.

```sqlexample
USE WAREHOUSE app_wh;
USE ROLE SAMOOHA_APP_ROLE;
SET cleanroom_name = 'Developer Tutorial';
CALL samooha_by_snowflake_local_db.provider.cleanroom_init(
  $cleanroom_name,
  'INTERNAL');      -- Use EXTERNAL to share outside your Snowflake org
```

After creating your clean room, you must, at minimum, perform the following steps to configure a basic clean room:

1. Import data into the clean room.
2. Set join policies on your data.
3. Specify one or more templates in the clean room.
4. Set column policies on your data for each template.
5. Set a default release directive.
6. Specify consumers to share the clean room with.
7. Publish the clean room.

For a full walkthrough of creating a new clean room in code, try the
[clean rooms code tutorial](tutorials/cleanroom-api-tutorial-basic.md)

> **Note:**
>
> There is a limit to the number of (clean rooms + collaborators) that you can create in a single account. If you create too many test
> clean rooms, you might need to delete a few in order to create new clean rooms. If you need more clean rooms than your account can hold,
> contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Install (join) a clean room

If you have been invited to join a clean room, you will receive an email message with a link to install, configure, and run the clean room
in the clean rooms UI. You can follow the link and use the clean rooms UI, or install and run the clean room using the API.

Clean rooms UIClean rooms API

The Clean Rooms page in the clean rooms UI lets you, as a consumer, install clean rooms that have been shared with you by a provider.
To install a clean room, do the following:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Clean Rooms.
3. On the Invited tab, find the clean room and select Join. You should get a direct link to this page in an invitation
   email when you are added as a collaborator in the clean rooms UI.
4. Select the tables that you want to use to collaborate with the provider’s data, then select Next.
5. Select any identity providers available in your Clean Room environment that you need to use in this clean room.
6. Specify which columns in your table can be joined, and the corresponding columns from the provider’s data.
7. Select Next.
8. Provide template-specific settings for any templates assigned to the clean room.
9. Click Finish, and optionally run a template immediately, or schedule a repeating run of that template.

If you have been invited to join a clean room as a consumer, you can install, configure, and run the clean room in code.

To join a clean room in code, open up the account that was invited to add the clean room, and run the following code:

```sqlexample
USE WAREHOUSE app_wh;
USE ROLE SAMOOHA_APP_ROLE;
SET cleanroom_name = 'Developer Tutorial'; -- Get the actual clean room name and provider's account locator from the provider.
CALL samooha_by_snowflake_local_db.consumer.
  install_cleanroom($cleanroom_name, <PROVIDER_LOCATOR>);
```

After the clean room is installed, you must take the following steps, at minimum, to be able to run templates in that clean room:

1. Link your data.
2. Set join and column policies on your tables and for the templates that you want to run.
3. Run the template.

For a full walkthrough of joining a clean room in code, try the
[clean rooms code tutorial](tutorials/cleanroom-api-tutorial-basic.md)

> **Note:**
>
> Some clean rooms throw the following error when you try to join it:
>
> ```output
> Application role `SAMOOHA_BY_SNOWFLAKE.DCR_DELEGATED_CLEANROOM_ROLE` does not exist
> or not authorized.
> ```
>
> If you encounter this error, run the following code and try joining the clean room again:
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> CALL SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.PREPARE_MOUNT_SCRIPT();
> EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
> ```

## Delete a clean room that you created

After deletion, a clean room will no longer be visible to shared users the next time they open the clean rooms UI. If an analysis is in
progress when a clean room is deleted, it might not complete before the clean room is deleted.

Clean rooms UIClean rooms API

To use the clean rooms UI to delete a clean room that you created, do the following:

> 1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
> 2. In the left navigation, select Clean Rooms.
> 3. In the clean room to delete, select *More* () > Delete.

> * **To delete a single clean room** using the API, call
>   [provider.drop_cleanroom](/user-guide/cleanrooms/provider).
> * **To list your created clean rooms,** call [provider.view_cleanrooms](/user-guide/cleanrooms/provider):
>
>   ```sqlexample
>   USE ROLE SAMOOHA_APP_ROLE;
>   USE WAREHOUSE app_wh;
>
>   -- List created and published clean rooms
>   CALL samooha_by_snowflake_local_db.provider.view_cleanrooms();
>   SELECT CLEANROOM_ID AS "cleanroom_name"
>     FROM TABLE(RESULT_SCAN(last_query_id()))
>     WHERE STATE = 'CREATED' AND IS_PUBLISHED = TRUE;
>
>   -- Specify a clean room name from the list and drop it
>   CALL samooha_by_snowflake_local_db.provider.drop_cleanroom($cleanroom_name);
>   ```

For a full walkthrough of creating, configuring, using, and deleting a clean room in code, try the
[clean rooms code tutorial](tutorials/cleanroom-api-tutorial-basic.md)

## Uninstall (unjoin) a clean room

You can uninstall a clean room that you installed (joined) as a consumer. This will uninstall the clean room for all users in the account.

Clean rooms UIClean rooms API

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Clean Rooms.
3. Navigate to Clean Rooms » Joined.
4. In the clean room to uninstall, select *More* () > Leave.

**To list your installed (joined) clean rooms**

Call [samooha_by_snowflake_local_db.consumer.view_cleanrooms](/user-guide/cleanrooms/consumer) and filter
rows to `IS_ALREADY_INSTALLED = TRUE`. This shows clean rooms that are installed rather than simply invitations to join.

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
USE WAREHOUSE app_wh;

CALL samooha_by_snowflake_local_db.consumer.view_cleanrooms();
SELECT CLEANROOM_ID AS "cleanroom_name"
  FROM TABLE(RESULT_SCAN(last_query_id()))
  WHERE IS_ALREADY_INSTALLED = TRUE;

CALL samooha_by_snowflake_local_db.consumer.uninstall_cleanroom($cleanroom_name);
```

**To uninstall (unjoin) a single clean room:**

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
USE WAREHOUSE app_wh;
CALL samooha_by_snowflake_local_db.consumer.
  uninstall_cleanroom($cleanroom_name).
```

For a full walkthrough of creating, configuring, using, and deleting a clean room in code, try the
[clean rooms code tutorial](tutorials/cleanroom-api-tutorial-basic.md)

## Adding or removing tables from a clean room

Here is how to add or remove (*link* or *unlink*) tables from a clean room:

Clean rooms UIClean rooms API

When using the UI, only tables or views registered by an administrator can be linked into a clean room. If you don’t see a table or view as available for use in your clean room, ask your administrator to register the object in your account.

* As a provider, you choose which tables to link into the clean room in the Add Data step when you create or edit a clean room.
* As a consumer, you choose which tables to link the clean room in the Add Data step when you join or edit a clean room.

Once a table is added to a clean room, it cannot be removed from that clean room. You can, however, remove the data from the entire account. If you need to remove a table or view from a clean room, speak to your clean room administrator.

When using the clean rooms API, anyone with the REFERENCE_USAGE privilege on a data object can register it in the account. After an object is registered, you can link it into any clean room in that account. (Only the account that registered an object can link it into a clean room.)

You cannot unlink a table or view after it has been linked into a clean room. However, you can unregister the table or view for the
entire account, making it unavailable to any clean room in that account. If you unregister a table, be sure to change any row or column
policies, or templates, that refer to that table.

[Learn how to register or unregister data objects](register-data.md) to make them available for linking
into a clean room.

---
title: Custom clean room template reference
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/custom-templates.md
section: Clean Rooms
---

# Custom clean room template reference

## About clean room templates

Clean room templates are written in [JinjaSQL](https://github.com/sripathikrishnan/jinjasql). JinjaSQL is an extension to the Jinja
templating language that generates a SQL query as output. This allows templates to use logic statements and run-time variable resolution to
let the user specify table names, table columns, and custom values used in the query at run time.

Snowflake provides some pre-designed templates for common use cases.
However, most users prefer to create custom query templates for their clean rooms. Custom templates are created using the clean rooms API, but can be run either in code or using the clean rooms UI.

There are two general types of templates:

* **Analysis templates**, which evaluate to a SELECT statement (or a set of SELECT operations) that show results to the template runner.
* **Activation templates**, which are used to activate results to a Snowflake account or a third-party, rather than showing results
  in the immediate environment. An activation template is very similar to an analysis template with a few extra requirements.

  In the clean rooms UI, an analysis template can be associated with an activation template to enable the caller to run an analysis, see
  results, and then activate data to themselves or a third party. The activation template does not need to resolve to the same query as the
  associated analysis template.

## Creating and running a custom template

In a clean room with default settings, the provider adds a template to a clean room and the consumer runs the template, as described in the [custom template usage documentation](../demo-flows/custom-templates.md).

### A quick example

Here is a simple SQL example that joins a provider and a consumer table by email and shows the overlap count per city:

```sqlexample
SELECT COUNT(*), city FROM consumer_table
  INNER consumer_table
  ON consumer_table.hashed_email = provider_table.hashed_email
  GROUP BY city;
```

Here is how that query would look as a JinjaSQL template that allows the caller to choose the JOIN and GROUP BY columns, as well as the tables used:

```sqlexample
SELECT COUNT(*), IDENTIFIER({{ group_by_col | column_policy }})
  FROM IDENTIFIER({{ my_table[0] }}) AS c
  INNER JOIN IDENTIFIER({{ source_table[0] }}) AS p
  ON IDENTIFIER({{ consumer_join_col | join_policy }}) = IDENTIFIER({{ provider_join_col | join_policy }})
  GROUP BY IDENTIFIER({{ group_by_col | column_policy }});
```

**Notes on the template:**

* Values within {{ double bracket pairs }} are custom variables. `group_by_col`, `my_table`, `source_table`,
  `consumer_join_col`, `provider_join_col`, and `group_by_col` are all custom variables populated by the caller.
* `source_table` and `my_table` are Snowflake-defined string array variables populated by the caller. Array members are
  fully-qualified names of provider and consumer tables linked into the clean room. The caller specifies which tables should be
  included in each array.
* Provider tables must be aliased as lowercase `p` and consumer tables as lowercase `c` in a template. If you have
  multiple tables, you can index them as `p1`, `p2`, `c1`, `c2`, and so on.
* IDENTIFIER is needed for all column and table names, because variables in {{ double brackets }} evaluate to string literals, which aren’t
  valid identifiers.
* JinjaSQL *filters* can be applied to variables to enforce any [join or column policies](policies.md) set by
  either side. Snowflake implements custom filters `join_policy` and `column_policy`, which
  verify whether a column complies with join or column policies in the clean room respectively, and fail the query if it does not. A
  filter is applied to a column name as `{{ column_name | filter_name }}`.

All these points will be discussed in detail later.

Here is how a consumer might run this template in code. Note how column names are qualified by the table aliases
declared in the template.

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.CONSUMER.RUN_ANALYSIS(
  $cleanroom_name,
  $template_name,
  ['my_db.my_sch.consumer_table],       -- Populates the my_table variable
  ['my_db.my_sch.provider_table'],      -- Populates the source_table variable
  OBJECT_CONSTRUCT(                     -- Populates custom named variables
    'consumer_join_col','c.age_band',
    'provider_join_col','p.age_band',
    'group_by_col','p.device_type'
  )
);
```

To be able to use this template in the clean rooms UI, the provider must
[create a custom UI form for the template](../provider.md). The UI form has named
form elements that correspond to template variable names, and the values provided in the form are passed into the template.

### Developing a custom template

Clean room templates are JinjaSQL templates. To create a template, you should be familiar with the following topics:

* [Jinja templating basics](https://jinja.palletsprojects.com/en/stable/)
* The [JinjaSQL extension to Jinja](https://github.com/sripathikrishnan/jinjasql).

Use the [consumer.get_jinja_sql](../consumer.md) procedure to test the validity of your template,
then run the rendered template to see that it produces the results that you expect. Note that this procedure doesn’t support clean room filter extensions, such as `join_policy`, so you must test your template without those filters, and add them later.

**Example:**

```sqlexample
-- Template to test
SELECT {{ col1 | sqlsafe }}, {{ col2 | sqlsafe }}
  FROM IDENTIFIER({{ source_table[0] }}) AS p
  JOIN IDENTIFIER({{ my_table[0] }}) AS c
  ON {{ provider_join_col | sqlsafe }} = {{ consumer_join_col | sqlsafe}}
  {% if where_phrase %} WHERE {{ where_phrase | sqlsafe}}{% endif %};

-- Render the template.
USE WAREHOUSE app_wh;
USE ROLE SAMOOHA_APP_ROLE;

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.CONSUMER.GET_SQL_JINJA(
$$
SELECT {{ col1 | sqlsafe }}, {{ col2 | sqlsafe }}
  FROM IDENTIFIER({{ source_table[0] }}) AS p
  JOIN IDENTIFIER({{ my_table[0] }}) AS c
  ON IDENTIFIER({{ provider_join_col }}) = IDENTIFIER({{ consumer_join_col }})
  {% if where_phrase %} WHERE {{ where_phrase | sqlsafe }}{% endif %};
  $$,
  object_construct(
'col1', 'c.status',
'col2', 'c.age_band',
'where_phrase', 'p.household_size > 2',
'consumer_join_col', 'c.age_band',
'provider_join_col', 'p.age_band',
'source_table', ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
'my_table', ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS']
));
```

The rendered template looks like this:

```output
SELECT c.status, c.age_band
  FROM IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS p
  JOIN IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS c
  ON p.age_band = c.age_band
  WHERE p.household_size > 2;
```

Try running the SQL statement above in your environment to see if it works, and gets the expected results.

Then test your template without a WHERE clause:

```sqlexample
-- Render the template without a WHERE clause
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.CONSUMER.GET_SQL_JINJA(
$$
SELECT {{ col1 | sqlsafe }}, {{ col2 | sqlsafe }}
  FROM IDENTIFIER({{ source_table[0] }}) AS p
  JOIN IDENTIFIER({{ my_table[0] }}) AS c
  ON {{ provider_join_col | sqlsafe }} = {{ consumer_join_col | sqlsafe}}
  {% if where_phrase %} WHERE {{ where_phrase | sqlsafe }}{% endif %};
  $$,
  object_construct(
'col1', 'c.status',
'col2', 'c.age_band',
'consumer_join_col', 'c.age_band',
'provider_join_col', 'p.age_band',
'source_table', ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
'my_table', ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS']
));
```

Rendered template:

```output
SELECT c.status, c.age_band
  FROM IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS p
  JOIN IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS c
  ON p.age_band = c.age_band
  ;
```

Add the policy filters to the template, and add the template to your clean room:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    'simple_template',
    $$
    SELECT {{ col1 | sqlsafe | column_policy }}, {{ col2 | sqlsafe | column_policy }}
      FROM IDENTIFIER({{ source_table[0] }}) AS p
      JOIN IDENTIFIER({{ my_table[0] }}) AS c
      ON {{ provider_join_col | sqlsafe | join_policy }} = {{ consumer_join_col | sqlsafe | join_policy }}
      {% if where_phrase %} WHERE {{ where_phrase | sqlsafe }}{% endif %};
    $$,
);
```

### Data protection

Templates can access only datasets linked into the clean room by the provider and consumer.

Both the provider and consumer can set join, column, and activation policies on their data to protect which columns can be
joined on, projected, or activated; however, the template must include
the appropriate JinjaSQL policy filter on a column for the policy to be applied.

## Custom template syntax

Snowflake Data Clean Rooms supports V3 JinjaSQL, with a few extensions as noted.

This section includes the following topics:

### Template naming rules

When creating a template, names must be all lowercase letters, numbers, spaces, or underscores.
Activation templates (except for consumer-run provider activation) must have a name beginning with `activation_`. Template names are
assigned when you call `provider.add_custom_sql_template` or `consumer.create_template_request`.

**Example valid names:**

* `my_template`
* `activation_template_1`

**Example invalid names:**

* `my template` - Spaces not allowed
* `My_Template` - Only lowercase templates allowed

### Template variables

Template callers can pass in values to template variables. JinjaSQL syntax enables variable binding for any variable name
within {{ double_brackets }}, but Snowflake reserves a few variable names that you should not override, as described below.

> **Caution:**
>
> All variables, whether Snowflake-defined or custom, are populated by the user and should be treated with appropriate caution.
> Snowflake Data Clean Rooms templates must resolve to a single SELECT statement, but you should still remember that all variables are
> passed in by the caller.

#### Snowflake-defined variables

All clean room templates have access to the following global variables defined by Snowflake, but passed in by the caller:

`source_table`:
:   A zero-based string array of provider-linked tables and views in the clean room that can be used by the template. Table
    names are fully qualified, for example: `my_db.my_sch.provider_customers`

    **Example:** `SELECT col1 FROM IDENTIFIER({{ source_table[0] }}) AS p;`

`my_table`:
:   A zero-based string array of consumer tables and views in the clean room that can be used by the template. Table names are
    fully qualified, for example: `my_db.my_sch.consumer_customers`

    **Example:** `SELECT col1 FROM IDENTIFIER({{ my_table[0] }}) AS c;`

`privacy`:
:   A set of privacy-related values associated with users and templates.
    [See the list of available child fields](../differential-privacy.md).
    These values can be [set explicitly](../differential-privacy.md) for the user, but you might want to set default
    values in the template. Access the child fields directly in your template, such as `privacy.threshold`.

    **Example:** Here is an example snippet of a template that uses `threshold_value` to enforce a minimum group size in an aggregation
    clause.

    ```sqlexample-python
    SELECT
      IFF(a.overlap > ( {{ privacy.threshold_value | default(2)  | sqlsafe }} ),
                        a.overlap,1 ) AS overlap,
      c.total_count AS total_count
      ...
    ```

`measure_column`:

`dimensions`:

`where_clause`:
:   Legacy clean room global variables. They are no longer recommended for use, but are still defined and appear in some legacy templates
    and documentation, so you should not alias tables or columns using either of these names to avoid naming collisions.

    If your template uses `measure_column` or `dimensions`, the column policy is checked against any columns passed into these variables.

    If your template uses a `where_clause` that has a join condition (for example, `table1.column1 = table2.column2`), the join policy is checked against any columns named there; otherwise, the column policy is checked against any columns named there.

#### Custom variables

Template creators can include arbitrary variables in a template that can be populated by the caller. These variables can have any
arbitrary Jinja-compliant name except for the Snowflake-defined variables or table alias names. If you want your template to be usable
in the clean rooms UI, you must also provide a UI form for clean rooms UI users. For API users, you should provide good documentation for the
required and optional variables.

Custom variables can be accessed by your template, as shown here for the custom variable `max_income`:

```sqlexample
SELECT income FROM my_db.my_sch.customers WHERE income < {{ max_income }};
```

Users can pass variables to a template in two different ways:

* **In the clean rooms UI,** by selecting or providing values through a UI form created by the template developer. This UI form contains
  form elements where the user can provide values for your template. The name of the form element is the name of the variable. The template
  simply uses the name of the form element to access the value. Create the UI form using [provider.add_ui_form_customizations](../provider.md).
* **In code,** a consumer calls [consumer.run_analysis](../consumer.md) and passes in table names as argument arrays, and
  custom variables as name-value pairs into the `analysis_arguments` argument.

> **Note:**
>
> If you need to access user-provided values in any custom Python code uploaded to the clean room, you must
> explicitly pass variable values in to the code through Python function arguments; template
> variables are not directly accessible within the Python code using `{{jinja variable binding syntax}}`.

#### Resolving variables correctly

String values passed into the template resolve to a string literal in the final template. This can cause SQL parsing or logical errors if
you don’t handle bound variables appropriately:

* `SELECT {{ my_col }} FROM P;` - This resolves to `SELECT 'my_col' from P;` which simply returns the string “my_col” - probably not
  what you want.
* `SELECT age FROM {{ my_table[0] }} AS P;` - This resolves to `SELECT age FROM 'somedb.somesch.my_table' AS P;`, which causes a
  parsing error because a table must be an identifier, not a literal string.
* `SELECT age FROM IDENTIFIER({{ my_table[0] }}) AS P {{ where_clause }};` - Passing in “WHERE age < 50” evaluates to
  `SELECT age FROM mytable AS P 'WHERE age < 50';`, which is a parsing error because of the literal string WHERE clause.

Therefore, where appropriate, you must resolve variables. Here is how to resolve variables properly in your template:

Resolving table and column names
:   Variables that specify table or column names must be converted to identifiers in your template in one of two ways:

    * [IDENTIFIER](../../../sql-reference/identifier-literal.md): For example: `SELECT IDENTIFIER({{ my_column }}) FROM P;`
    * [sqlsafe](https://github.com/sripathikrishnan/jinjasql?tab=readme-ov-file#sql-safe-strings): This JinjaSQL filter resolves identifier
      strings to SQL text. An equivalent statement to the previous bullet is `SELECT {{ my_column | sqlsafe }} FROM P;`

    Your particular usage dictates when to use IDENTIFIER or `sqlsafe`. For example, `c.{{ my_column | sqlsafe }}` can’t easily be
    rewritten using IDENTIFIER.

Resolving dynamic SQL
:   When you have a string variable that should be used as literal SQL, such as a WHERE clause, use the `sqlsafe` filter in your template.
    For example:

    ```sqlexample
    SELECT age FROM IDENTIFIER({{ my_table[0] }}) AS C WHERE {{ where_clause }};
    ```

    If a user passes in “age < 50” to `where_clause`, the query would resolve to `SELECT age FROM sometable AS C WHERE 'age < 50';`
    which is invalid SQL because of the literal string WHERE condition. In this case you should use the `sqlsafe` filter:

    ```sqlexample
    SELECT age FROM IDENTIFIER( {{ my_table[0] }} ) as c {{ where_clause | sqlsafe }};
    ```

### Required table aliases

At the top level of your query, all tables or subqueries must be aliased as either `p` (for provider-tables) or `c` (for consumer
tables) in order for Snowflake to validate join and column policies correctly in the query. Any column that must be verified against join
or column policies must be qualified with the lowercase `p` or `c` table alias. (Specifying `p` or `c` tells the back end
whether to validate a column against the provider or the consumer policy respectively.)

If you use multiple provider or consumer tables in your query, add a numeric, sequential 1-based suffix to each table alias after the
first. So: `p`, `p1`, `p2`, and so on for the first, second, and third provider tables, and `c`, `c1`, `c2`, and so on for the
first, second, and third consumer tables. The `p` or `c` index should be sequential without gaps (that is, create the aliases `p`,
`p1`, and `p2`, not `p`, `p2`, and `p4`).

**Example**

```sqlexample
SELECT p.col1 FROM IDENTIFIER({{ source_table[0] }}) AS P
UNION
SELECT p1.col1 FROM IDENTIFIER({{ source_table[1] }}) AS P1;
```

### Custom clean room template filters

Snowflake supports all the [standard Jinja filters](https://jinja.palletsprojects.com/en/stable/templates/#builtin-filters) and most of
the standard
[JinjaSQL filters](https://github.com/search?q=repo%3Asripathikrishnan%2Fjinjasql+self.env.filters+path%3Ajinjasql%2Fcore.py&type=code&path+jinjasql%2Fcore.py=),
along with a few extensions:

* `join_policy`: Succeeds if the column is in the join policy of the data owner; fails otherwise.
* `column_policy`: Succeeds if the column is in the column policy of the data owner; fails otherwise.
* `activation_policy`: Succeeds if the column is in the activation policy of the data owner; fails otherwise.
* `join_and_column_policy`: Succeeds if the column is in the join or column policy of the data owner; fails otherwise.
* The `identifier` JinjaSQL filter is **not supported** by Snowflake templates.

> **Tip:**
>
> JinjaSQL statements are evaluated left to right:
>
> * `{{ my_col | column_policy }}` **Correct**
> * `{{ my_col | sqlsafe | column_policy }}` **Correct**
> * `{{ column_policy | my_col }}` **Incorrect**
> * `{{ my_col | column_policy | sqlsafe }}` **Incorrect:** `column_policy` will be checked against the `my_col` value as string,
>   which is an error.

### Enforcing clean room policies

Clean rooms do not automatically check clean room policies against columns used in a template. If you want to enforce a policy against a
column:

* You must apply the appropriate policy filter to that column in the template. For example:

```sqlexample
JOIN IDENTIFIER({{ source_table[0] }}) AS p
  ON IDENTIFIER({{ c_join_col | join_policy }}) = IDENTIFIER({{ p_join_col | join_policy }})
```

* You must alias the table as lowercase p or c. See Required table aliases.

Policies are checked only against columns owned by other collaborators; policies are not checked for your own data.

Note that column names cannot be ambiguous when testing policies. So if you have columns with the same
name in two tables, you must qualify the column name in order to test the policy against that column.

### Running custom Python code

Templates can run Python code uploaded to the clean room. The template can call a Python function that
accepts values from a row of data and returns values to use or project in the query.

* When a **provider** uploads custom Python code into a clean room, the template calls Python functions with the syntax
  `cleanroom.function_name`. [More details here.](../demo-flows/custom-code.md)
* When a **consumer** uploads custom Python code into a clean room, the template calls the function with the bare `function_name` value
  passed to `consumer.generate_python_request_template` (not scoped to `cleanroom` as provider code is). [More details here.](../demo-flows/custom-code.md)

**Provider code example:**

```sqlexample-python
-- Provider uploads a Python function that takes two numbers and returns the sum.
CALL samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
  $cleanroom_name,
  'simple_addition',                        -- Function name to use in the template
  ['someval integer', 'added_val integer'], -- Arguments
  [],                                       -- No packages needed
  'integer',                                -- Return type
  'main',                                   -- Handler for function name
  $$

def main(input, added_val):
  return input + int(added_val)
    $$
);

-- Template passes value from each row to the function, along with a
-- caller-supplied argument named 'increment'
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    'simple_python_example',
$$
    SELECT val, cleanroom.simple_addition(val, {{ increment | sqlsafe }})
    FROM VALUES (5),(8),(12),(39) AS P(val);
$$
);
```

## Security considerations

A clean room template is not executed with the identity of the current user.

The user does not have direct access to any data within the clean room; all access is through the native application via the template
results.

Apply a policy filter any time a column is used in your template to ensure that your policies, and the policies of all collaborators, are
respected.

Wrap user-provided variables with IDENTIFIER() when possible to strengthen your templates against SQL injection attacks.

## Activation templates

A template can also be used to save query results to a table outside of the clean room; this is called *activation*. Currently the only
forms of activation supported for custom templates are provider activation and consumer activation (storing results to the provider or
consumer’s Snowflake account, respectively). [Learn how to implement activation.](../activation.md)

An activation template is an analysis template with the following additional requirements:

* Activation templates are JinjaSQL statements that evaluate to a SQL script block, unlike analysis templates, which can be simple SELECT
  statements.
* Activation templates create a table in the clean room to store results, and return the table name (or a fragment of the name) to the
  template caller.
* The script block should end with a RETURN statement that returns the name of the generated table, minus any
  `cleanroom.` or `cleanroom.activation_data_` prefix.
* The name of the template, the name of the internal table that the template creates, and the table name the template returns follow these
  patterns:

| Activation type | Template name prefix | Table name prefix | Returned table name |
| --- | --- | --- | --- |
| Consumer-run consumer | `activation_` | `cleanroom.activation_data_*` | Table name without prefix |
| Consumer-run provider | No prefix required | `cleanroom.activation_data_*` | Table name without prefix |
| Provider-run provider | `activation_` | `cleanroom.temp_result_data` is the full table name. | `temp_result_data` |

* Any columns being activated must be listed in the [activation policy](policies.md) of the provider or consumer
  who linked the data, and should have the `activation_policy` filter applied to it. Note that a column can be both an activation and
  a join column.
* If the template is to be run from the clean rooms UI, you should
  [provide a web form](../demo-flows/custom-templates.md) that includes the `activation_template_name` and
  `enabled_activations` fields. Templates for use in the UI must have both an analysis template and an associated activation template.
* All calculated columns must be explicitly aliased, rather than having inferred names, because a table is being generated. That is:

  > `SELECT COUNT(*), p.status from T AS P;` FAILS, because the COUNT column name is inferred.
  >
  > `SELECT COUNT(*) AS COUNT_OF_ITEMS, p.status from T AS P;` SUCCEEDS, because it explicitly aliases the COUNT column.

Here are two sample basic activation templates. One is for provider-run server activation, the other is for other activation types. They
differ in the two highlighted lines, which contain the results table name.

Provider-run provider activation templateOther activation templates

Table must be named `cleanroom.temp_result_data`:

```sqlexample
BEGIN
  CREATE OR REPLACE TABLE cleanroom.temp_result_data AS
    SELECT COUNT(c.status) AS ITEM_COUNT, c.status, c.age_band
      FROM IDENTIFIER({{ my_table[0] }}) AS c
    JOIN IDENTIFIER({{ source_table[0] }}) AS p
      ON {{ c_join_col | sqlsafe | activation_policy }} = {{ p_join_col | sqlsafe | activation_policy }}
    GROUP BY c.status, c.age_band
    ORDER BY c.age_band;
  RETURN 'temp_result_data';
END;
```

Table name needs prefix `cleanroom.activation_data`:

```sqlexample
BEGIN
  CREATE OR REPLACE TABLE cleanroom.activation_data_analysis_results AS
    SELECT COUNT(c.status) AS ITEM_COUNT, c.status, c.age_band
      FROM IDENTIFIER({{ my_table[0] }}) AS c
    JOIN IDENTIFIER({{ source_table[0] }}) AS p
      ON {{ c_join_col | sqlsafe | activation_policy }} = {{ p_join_col | sqlsafe | activation_policy }}
    GROUP BY c.status, c.age_band
    ORDER BY c.age_band;
  RETURN 'analysis_results';
END;
```

## Next steps

After you’ve mastered the templating system, read the specifics for implementing a clean room with your template type:

* [Provider templates](../demo-flows/provider-run-analysis.md) are templates written by the provider. This is the
  default use case.
* [Consumer templates](../demo-flows/custom-templates.md) are templates written by the consumer. In some cases, a
  clean room creator wants to enable the consumer to create, upload, and run their own templates to the clean room.
* [Activation templates](../activation.md) create a results table after a successful run.
  Depending on the activation template, the results table can either be saved to the provider or consumer’s account outside the clean room,
  or sent to a third-party activation provider listed in the Activation Hub.
* [Chained templates](../developer-template-chains.md) allow you to chain together multiple templates where the
  output of each template is used by the next template in the chain.

## More information

* [Jinja documentation](https://jinja.palletsprojects.com/en/stable/)
* [JinjaSQL documentation](https://github.com/sripathikrishnan/jinjasql)

---
title: Data Clean Rooms Developer Guide
source: https://docs.snowflake.com/en/user-guide/cleanrooms/developer-guide.md
section: Clean Rooms
---

# Data Clean Rooms Developer Guide

This topic provides guidelines for users who want to create or manage Snowflake Data Clean Rooms collaborations programmatically.

## Development tools

These are the main developer tools for Snowflake Data Clean Rooms collaborations:

* **Coding environment:** Any coding environment that can run stored procedures in your Snowflake account will work. Most developers use
  worksheets in Snowsight (the browser-based tool) or the [Snowflake CLI](../../developer-guide/snowflake-cli/index.md).
* **Cortex Code:** Data Clean Room procedures are also available in an agentic experience via
  [Cortex Code](../cortex-code/cortex-code.md).

## Setting up your environment

Here are some tips for setting up your coding environment to use the Snowflake Data Clean Rooms API effectively.

### Using the Collaboration API

Snowflake provides the Data Clean Rooms Collaboration API to create and manage collaborations. This API consists of stored procedures that can be run
in any environment that can access your Snowflake account. This includes Snowsight notebooks, workspaces, worksheets and the
[Snowflake CLI](../../developer-guide/snowflake-cli/index.md).

The documentation here shows SQL usage, but you can also use Python or
[any other supported Snowflake language](../../developer-guide/stored-procedure/stored-procedures-overview.md).

You can grant users access to the complete API or a subset of it through
specific [DCR privileges](manage-access.md).

> **Note:**
>
> You need proper [DCR privileges](manage-access.md) to use the Collaboration API. You can grant limited access to specific procedures for sub-groups of users through [fine-grained role-based access control](manage-access.md).
>
> The SAMOOHA_APP_ROLE has pre-configured access to the entire API.

### Choosing a warehouse

You must use the Collaboration API in a warehouse that your role has the USAGE privilege on. `APP_WH` is one of a
[number of warehouses](v1/installation-details.md) that you can use. Choose the appropriate
warehouse for your needs.

Any standard warehouse works for general collaboration editing, creation, or deletion commands. Consider
using larger warehouses, or Snowpark-optimized warehouses, when running large analyses, such as machine learning workloads. If you use a
Snowpark-optimized warehouse for reviewing or joining a collaboration, make sure [MAX_CONCURRENCY_LEVEL](../../sql-reference/parameters.md) is set to a value equal to or greater than 2.

### Setting up testing accounts

You should have at least two separate accounts in which you have full coding access, to be able to develop and test multi-party
collaborations.

Depending on your use case, you might also want a Snowflake test account in a different cloud hosting region to test
[cross-cloud behavior](laf.md).

Name your test Snowflake accounts meaningfully to indicate their typical usage: for example, “Cross-cloud account” or “Standard Edition
account.” This can help when you have multiple test accounts and must choose an account on the clean
rooms login page.

## References and resources

The following topics are useful for Snowflake Data Clean Room developers.

* **Reference topics:**

  + [Snowflake Data Clean Rooms Collaboration API](collaboration-api-reference.md)
  + [Data Clean Rooms Schema Reference](spec-reference.md)
  + [Design custom templates](custom-templates.md)
  + [Items installed with the Snowflake Data Clean Room environment](installed-artifacts.md): What objects are installed with the SDCR environment.
* **Sample data:**

  + The Snowflake DCR environment installs [a few sample datasets](installed-artifacts.md) that you can use.
  + You can also [generate synthetic test data](../synthetic-data.md) using Snowflake.
* **Troubleshooting:** See the [data clean room troubleshooting guide](v2/troubleshooting.md) for tips.
* **Useful collaboration metadata:** See the [metadata cheat sheet](collaboration-api-reference.md) to learn how to get
  useful metadata about a collaboration, such as whether a collaborator (including yourself) has installed a given collaboration.
* **See your API query history:** To see a history of Collaboration API (or other) calls that you’ve made:

  1. Sign in to [Snowsight](../ui-snowsight-gs.md).
  2. In the navigation menu, select Monitoring » Query History.
  3. Use the filters to find the query associated with the analysis, and select the query or analysis.
* **Feature examples:** To help you understand how to use various features of the Collaboration API, you can refer to the examples in the
  Use cases and Key concepts & features sections of the Snowflake DCR documentation.
* **Additional examples and videos:** For additional code examples, tutorials, and videos, see
  [Sample Notebooks and Worksheets](tutorials-and-samples.md).

---
title: Data Clean Rooms Schema Reference
source: https://docs.snowflake.com/en/user-guide/cleanrooms/spec-reference.md
section: Clean Rooms
---

# Data Clean Rooms Schema Reference

This topic describes the specification schema for all collaboration resources. Specifications are shown in YAML format.

Specifications have a schema version field `api_version`. Use the API version number shown here; support for earlier schema versions
isn’t guaranteed.

**Current DCR Collaboration API version:** 2.0.0

## Resource specifications

* [Collaboration specification](spec-collaboration.md): Defines the high-level collaboration, including which
  analysis runners are invited, and for each runner, which data and templates they can access.
* [Data offering specification](spec-data-offering.md): Defines a set of tables that a data provider shares with
  analysis runners, including sharing rules, policies, column formats, and analysis types.
* [Template specification](spec-template.md): Defines a single template in a collaboration, including
  parameters, code bundles, and the JinjaSQL template content.
* [Analysis specification](spec-analysis.md): Specifies the information that analysis runners need to
  run an analysis, including which template, tables, and variable values to use.
* [Code bundle specification](spec-code-bundle.md): Defines a bundle of one or more code functions or procedures
  that can be called by a template.

---
title: Data offering specification
source: https://docs.snowflake.com/en/user-guide/cleanrooms/spec-data-offering.md
section: Clean Rooms
---

# Data offering specification

Defines a set of tables that a provider is willing to share with analysis runners, as well as sharing rules, such as policies, column
formats, and whether the table must be used with a template.

The data provider submits this specification by calling REGISTER_DATA_OFFERING, which returns an offering ID that can be used in the
collaboration specification.

A data offering won’t be available in a collaboration until the account that registered the data offering joins the collaboration.

You must have the REGISTER DATA OFFERING account privilege to join any collaboration in which you can activate data; that is, you are an
analysis runner and the collaboration specification includes an `activation_destinations` field. For more information, see the
[access management API reference guide](collaboration-api-reference.md).

**Schema:**

```yaml
api_version: 2.0.0              # Required: Must be "2.0.0"
spec_type: data_offering        # Required: Must be "data_offering"
name: <data_offering_name>      # Required: Unique name (max 75 chars)
version: <version_string>       # Required: Version identifier (max 20 chars)
description: <data_offering_description>  # Optional: Description (max 1,000 chars)

datasets:                       # Required: Tables to share
  - alias: <dataset_name>       # One or more dataset items...
    data_object_fqn: <database.schema.table_name>  # Required: Fully-qualified table name
    allowed_analyses: <allowed_analysis_type>      # Required: template_only or template_and_freeform_sql
    object_class: <object_class>    # Optional: ads_log or custom
    schema_and_template_policies:   # Required: Column definitions
      <column_name>:                # One or more column definitions...
        category: <category_type>   # Required: join_standard, join_custom, timestamp, passthrough, or event_type
        column_type: <format_type>  # Required for join_standard category, omitted for other categories.
        activation_allowed: <true_or_false>  # Optional: Whether column can be used for activation
    freeform_sql_policies:      # Optional: Policies for freeform SQL queries
      aggregation_policy:       # Optional: Single aggregation policy
        name: <fully_qualified_policy_name>
        entity_keys:            # Optional: Entity key columns
          - <column_name>       # One or more POSSIBLY RENAMED column names...
      join_policy:              # Optional: Single join policy
        name: <fully_qualified_policy_name>
        columns:                # Optional: Columns this policy applies to
          - <column_name>       # One or more POSSIBLY RENAMED column names...
      masking_policies:         # Optional: Masking policies
        - name: <fully_qualified_policy_name>  # One or more masking policy items...
          columns:              # Optional: Columns this policy applies to
            - <column_name>     # One or more POSSIBLY RENAMED column names...
      projection_policies:      # Optional: Projection policies
        - name: <fully_qualified_policy_name>  # One or more projection policy items...
          columns:              # Optional: Columns this policy applies to
            - <column_name>     # One or more POSSIBLY RENAMED column names...
      row_access_policy:        # Optional: Row access policy
        name: <fully_qualified_policy_name>
        columns:              # Optional: Columns this policy applies to
          - <column_name>     # One or more POSSIBLY RENAMED column names...
    require_freeform_sql_policy: <true_or_false>  # Optional: Require a policy for freeform SQL
```

`api_version`
:   The version of the Collaboration API used. Must be `2.0.0`.

`spec_type`
:   Specification type identifier. Must be `data_offering`.

`name: data_offering_name`
:   A name for a set of tables and columns to expose to collaborators. This name is used as the data offering reference value in a
    collaboration specification. You can create multiple data offerings with overlapping tables and columns for different use cases. Must follow
    [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) with a maximum of 75 characters and be unique within your Snowflake
    data clean room account.
    The `name_version` pair must be unique for all data offerings in this account.

`version`
:   A custom version identifier for this data offering specification (maximum 20 characters). Must follow
    [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md). The version string is given its own column in the response to
    VIEW_DATA_OFFERINGS and VIEW_REGISTERED_DATA_OFFERINGS, so use a value that can be sorted by increasing value. Example: `V0`

`description: data_offering_description` (*Optional*)
:   A description of the data offering (maximum 1,000 characters).

`datasets`
:   A list of one or more datasets to make available to the collaboration.

    `alias: dataset_name`
    :   A name for this data object, used in `collaboration.run`. Must follow
        [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) and be unique within this offering. Maximum 75 characters.

    `data_object_fqn: fully_qualified_table_name`
    :   Describes a single table available to collaborators. Provide the fully-qualified name of the source object in your account
        (`database.schema.table_name`). Maximum length is 773 characters.

    `allowed_analyses: allowed_analysis_type`
    :   The type of analyses that collaborators can run against this table. Required field with the following values:

        * `template_only`: The analysis runner can query this table only by using a template listed in the collaboration specification.
        * `template_and_freeform_sql`: The analysis runner can query this table by using either a template listed in the collaboration
          specification, or by using [free-form SQL queries](demo-flows/basic-multiparty-collab.md) in a code environment.

    `object_class` (*Optional*)
    :   The type of object. One of the following values:

        * `ads_log`: The tables and columns listed here must fit the ad log requirements.
        * `custom`: A custom set of tables and columns that doesn’t have any special requirements.

    `schema_and_template_policies`
    :   Provide a list of column names from the table listed by `data_object_fqn` and define the policies and format of each column. Only
        columns listed here are available to collaborators. Each column has the following descriptors:

        `category: category_type`
        :   The category determines whether any column renaming is applied, and any data format enforcement that should be applied.
            `category` and `column_type` [determine the column name exposed to the analysis runner](resources-data-offerings.md).
            The following values are supported:

            * `join_standard`: This is a joinable column with data in a format specified in the `column_type` field. This column is
              renamed to the `column_type` value in the shared data offering. This column is added to the clean room’s
              [join policy](resources-data-offerings.md).
            * `join_custom`: This is a joinable column in any format. Use this when there isn’t an appropriate `column_type` for your
              join column. The original column name is used in the shared data offering. This column is added to the clean room’s
              [join policy](resources-data-offerings.md).
            * `timestamp`: This is a projectable column that specifies a timestamp for any event. The column is renamed as `timestamp` in
              the shared data offering.
            * `passthrough`: This is a projectable column of any other type. The original column name is used in the shared data offering.
            * `event_type`: This is a projectable column that records an event type classification for this row, for example: “purchase”,
              “sign-up”, “impression”, “click”, and so on.

        `column_type: <format_type>` (*Required when category=join_standard, ignored for other category types*)
        :   The format of the data. If the data doesn’t conform to this format, your call to REGISTER_DATA_OFFERING will fail. Provide this field
            for columns where `category = join_standard`. `category` and `column_type`
            [determine the column name exposed to the analysis runner](resources-data-offerings.md). You can’t assign the same
            `column_type` value to multiple columns in the same table. The following format types are supported:

            * `email`: A raw email address.
            * `hashed_email_sha256`: A SHA256 hashed email.
            * `hashed_email_b64_encoded`: A base64-encoded hashed email.
            * `phone`: A phone number without punctuation. For example: `2015551212`.
            * `hashed_phone_sha256`: A SHA256 hashed phone number. The original number should be in the `phone` format.
            * `hashed_phone_b64_encoded`: A base64-encoded hashed phone number.
            * `device_id`: A raw device ID, such as a mobile advertising ID or a CTV device ID.
            * `hashed_device_id_sha256`: SHA256 hashed device ID. The original should be in the `device_id` format.
            * `hashed_device_b64_encoded`: A base64-encoded hashed device ID.
            * `ip_address`: A raw IP address in IPv4 format.
            * `hashed_ip_address_sha256`: SHA256 hashed IPv4 address. The original should be in the `ip_address` format.
            * `hashed_ip_address_b64_encoded`: A base64-encoded hashed IP address.
            * `first_name`: A raw first name.
            * `hashed_first_name_sha256`: A SHA256 hashed first name. The original should be in the `first_name` format.
            * `hashed_first_name_b64_encoded`: A base64-encoded hashed first name.
            * `last_name`: A raw last name.
            * `hashed_last_name_sha256`: A SHA256 hashed last name. The original should be in the `last_name` format.
            * `hashed_last_name_b64_encoded`: A base64-encoded hashed last name.

        `activation_allowed` (*Optional*)
        :   Whether this column can be used for activation purposes. Default is `false`.

> `freeform_sql_policies` (*Optional*)
> :   If `allowed_analyses` is `template_and_freeform_sql`, this optional field lists any Snowflake policies that should be applied
>     in free-form SQL queries run on this data offering. For more information, see [Apply the Snowflake policy to the data offering (free-form query usage only)](resources-data-offerings.md).

> > The following types are supported:
> >
> > `aggregation_policy` (*Optional*)
> > :   A single [aggregation policy](../aggregation-policies.md) configuration.
> >
> >     * `name`: The fully-qualified policy name.
> >     * `entity_keys` (*Optional*): List of column names that serve as entity keys for the aggregation policy. NOTE: if these columns
> >       have been [renamed](resources-data-offerings.md), you must use the generated column name.
> >
> > `join_policy` (*Optional*)
> > :   A single [join policy](../join-policies.md) configuration.
> >
> >     * `name`: The fully-qualified policy name. NOTE: if this column has been [renamed](resources-data-offerings.md), you
> >       must use the generated column name.
> >     * `columns` (*Optional*): List of column names this policy applies to.
> >
> > `masking_policies` (*Optional*)
> > :   An array of [masking policy](../security-column-intro.md) configurations.
> >
> >     * `name`: The fully-qualified policy name. NOTE: if this column has been [renamed](resources-data-offerings.md), you
> >       must use the generated column name.
> >     * `columns` (*Optional*): List of column names this policy applies to.
> >
> > `projection_policies` (*Optional*)
> > :   An array of [projection policy](../projection-policies.md) configurations.
> >
> >     * `name`: The fully-qualified policy name. NOTE: if this column has been [renamed](resources-data-offerings.md), you
> >       must use the generated column name.
> >     * `columns` (*Optional*): List of column names this policy applies to.
> >
> > `row_access_policy` (*Optional*)
> > :   An object that describes a [row access policy](../security-row-intro.md) configuration.
> >
> >     * `name`: The fully-qualified policy name. NOTE: if this column has been [renamed](resources-data-offerings.md), you
> >       must use the generated column name.
> >     * `columns` (*Optional*): List of column names this policy applies to.
>
> `require_freeform_sql_policy` (*Optional*)
> :   Whether this data source must define `freeform_sql_policies`. This is used as a failsafe to prevent linking a data source
>     that supports free-form SQL queries without assigning policies to it.

---
title: Data offerings
source: https://docs.snowflake.com/en/user-guide/cleanrooms/resources-data-offerings.md
section: Clean Rooms
---

# Data offerings

A *data offering* is a set of one or more views, called *datasets*, shared with specific analysis runners in a collaboration. You can share
data with analysis runners for whom you are defined as a data provider in the [collaboration specification](spec-collaboration.md).

A data offering is a live view of the source data, not a snapshot of the data at the time the data offering is registered. Any Snowflake
policies applied to the source data are active in the data offering.

When you register a data offering, Snowflake creates a view for each data source listed in the
[data offering specification](spec-data-offering.md). The view includes *only* the columns listed in the data offering
specification. Certain columns, depending on their category, are subject to renaming at this stage.

Additionally, when you link a data offering into a collaboration, Snowflake creates a copy of the registered view and limits access to the view to specified
analysis runners according to the [collaboration specification](spec-collaboration.md).

> **Important:**
>
> If you move, rename, or change access permissions to the underlying tables, the data offering will become unusable through any previously
> registered links.

If you use Snowflake Standard Edition, you can’t share data through a data clean room with policy enforcement. Hence, you are not able to share data with other parties or leverage the data clean room policies specified in the offerings even for users in your own account.
However, you can access data offerings from other collaborators, or [use your own data as a local data offering](demo-flows/basic-multiparty-collab.md) without policies.

**Data offering requirements:**

* You must have the REFERENCE_USAGE privilege with GRANT OPTION on any data that you want to share. If you don’t, you receive a
  [“missing reference usage grant”](v2/troubleshooting.md) error when you try to register, join the
  collaboration, or link the data.

  ```sqlexample
  GRANT REFERENCE_USAGE ON DATABASE my_database TO ROLE my_role WITH GRANT OPTION;
  ```
* You must have the [data provider collaboration role](roles.md) in a collaboration.
* Currently, only the account role that created or joined the collaboration can link or unlink data into a collaboration.

Continue reading to see how to register and link a data offering into a collaboration:

## Register a data offering

1. Create a [data offering specification](spec-data-offering.md) for your data. Specify the following details about your
   data offering:

   * The source object for each dataset in your data offering.
   * Which columns to include in each dataset.
   * The type (join or otherwise) of each column, which is used to populate the clean room policies. In some cases, you will also specify the format of individual columns.
   * Any Snowflake data protection policies to apply to columns in your data offering.
   * How users can access the data: by template only, or also by [free-form SQL query](free-form-sql.md).
2. Register the data offering by calling REGISTER_DATA_OFFERING, which returns a data offering ID.

   This step makes the data offering *available* to be linked into any collaboration by any role in your account that has read access to the registry. You can use the same data offering ID to share a data offering across multiple collaborations.

## Link a data offering

The linking process depends on whether the collaboration has been created:

* **If the collaboration hasn’t been created yet,** the data provider can give the data offering ID to the collaboration owner to include
  in the [collaboration specification](spec-collaboration.md). When a data offering is included in the collaboration
  specification, the data offering ID will be visible in the collaboration specification for review by the data provider before joining the collaboration.
* **If the collaboration has been created,** the data provider joins the collaboration and calls LINK_DATA_OFFERING with the data offering
  ID, the collaboration name, and who the data can be shared with. There might be a short delay after a data offering is linked before the data offering is available to use.
  Call VIEW_UPDATE_REQUESTS if you want to ensure that the link data offering request has completed successfully. After successful linking, the data offering will be visible and ready to use when calling VIEW_DATA_OFFERINGS.

When you link data, you specify which analysis runners can access the data.

A data provider can remove data offerings from a collaboration or specific collaborators by calling UNLINK_DATA_OFFERING.

To see registered data offerings in your account, call VIEW_REGISTERED_DATA_OFFERINGS.

> **Tip:**
>
> Data offerings aren’t visible in a collaboration until the user who registered the data offering joins the collaboration.

See [Run an analysis](demo-flows/basic-multiparty-collab.md) to learn how to run an analysis.

## Source column renaming

Column names in a data offering can be renamed before exposing them to the analysis runner. Renaming depends on the
`category` and `column_type` values that define the column in the [data offering specification](spec-data-offering.md),
as described in this table:

| Column `category` | New column name |
| --- | --- |
| `join_standard` | `column_type` value |
| `timestamp` | `timestamp` |
| `join_custom`, `passthrough`, or `event_type` | Original column name is used. |

For example, if the column in the source table is named `user_email_address`, how this column is exposed to an analysis runner depends on how it’s defined in the data offering specification:

| Data offering specification | How the column is referenced |
| --- | --- |
| ```yaml ... schema_and_template_policies:   user_email_address:     category: join_standard     column_type: hashed_email_sha256 ``` | `column_type` is used for `join_standard` columns:  ```sqlexample SELECT HASHED_EMAIL_SHA256 FROM source_table[0]; ``` |

## Applying data protection policies to data offerings

Data shared in a clean room is protected in several ways:

* Data registered with the clean room environment is created as a secure view that omits any columns not listed in the data offering
  specification.
* The secure view is shared only with the specific users and templates specified by the collaboration specification.
* You can add Snowflake policies to your data to further manage how it’s used.
* Data Clean Room template policies are also applied based on the data offering column classification.

There are two ways to apply a Snowflake data protection policy, such as a [join](../join-policies.md) or [aggregation](../aggregation-policies.md) policy, to your shared data:

* Apply the policy to the source data. Any policies applied to the source data are
  enforced in the datasets exposed in a collaboration. Communicate your policy to your collaborators.
* Apply the policy to the data offering when used in free-form queries. If you allow
  free-form queries on your data offerings, you can specify policies to enforce on those queries in the data offering specification. These
  policies are applied on top of any existing Snowflake policies on your source tables.

### Apply the Snowflake policy to your source data

Any Snowflake policies applied to the source data also apply to the data offering view in the collaboration.

If you apply Snowflake policies to your source data, let your collaborators know about them so that they don’t unknowingly run a query that
joins on a non-joinable column or doesn’t meet aggregation requirements. Mention any Snowflake policies in your data offering’s
`description` field.

> **Important:**
>
> When registering a data offering that has Snowflake data policies on it, you should either use a role that is not subject to those policies, or temporarily suspend the policy until after the data is registered.
>
> This is because Snowflake Data Clean Rooms runs a validation query on the source table as part of the registration
> process. If the test query fails to return meaningful results, the registration fails. Some Snowflake data policies can cause the test
> to fail. For example, a table might have an aggregation policy, and the validation query won’t return enough rows to satisfy the aggregation
> policy’s minimum group size requirement.

### Apply the Snowflake policy to the data offering (*free-form query usage only*)

You can apply Snowflake policies to your shared data when it’s accessed through
[free-form queries](free-form-sql.md), without applying them to the source data. These policies are applied in
addition to any Snowflake policies applied directly to the source table.

**To add free-form SQL policies to your data:**

1. Create a policy of a [type supported by Collaboration Data Clean Rooms](spec-data-offering.md).
2. Add the following information to your data offering specification:

   * Set `allowed_analyses: template_and_freeform_sql`.
   * Add a `freeform_sql_policies` section to the dataset entry.
   * Add the appropriate policy type sections under `freeform_sql_policies`, listing the Snowflake policies that you created, and which
     collaboration columns they apply to. Supported policy types are:

     + `aggregation_policy`: A single aggregation policy with optional entity keys.
     + `projection_policies`: An array of projection policies, each with column bindings.
     + `join_policy`: A single join policy with optional column bindings.
     + `masking_policies`: An array of masking policies, each with column bindings.
     + `row_access_policy`: A single row access policy with optional column bindings.

   The role that registers the data offering must have the USAGE privilege on the policies.

Collaborators see policy types applied to your data when they call `COLLABORATION.VIEW_DATA_OFFERINGS`.

You can reuse a policy on multiple columns across multiple tables.

**Example:**

Policy creation and registrationData offering YAML

```sqlexample
CREATE OR REPLACE AGGREGATION POLICY my_db.public.my_agg_policy AS ()
  RETURNS AGGREGATION_CONSTRAINT ->
    AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);
```

```yaml
# Tell data clean rooms to set your aggregation policy on the hashed_email column of
# the data offering
api_version: 2.0.0
spec_type: data_offering
version: 1
name: my_favorite_dataset
datasets:
  - alias: test_freeform_restricted_agg
    data_object_fqn: samooha_provider_sample_database.audience_overlap.customers
    allowed_analyses: template_and_freeform_sql
    object_class: custom
    freeform_sql_policies:
      aggregation_policy:
        name: my_db.public.my_agg_policy
        entity_keys:
          - hashed_email
...
```

### Snowflake Data Clean Room template policies

Snowflake Data Clean Rooms also support their own policy system on top of the Snowflake policy system. Each data provider in a collaboration can set the following
policies on their data offering:

* A join policy, which specifies which columns *can* be joined on.
* A column policy, which specifies which columns *can* be projected.
* An activation policy, which specifies which columns *can* be activated.

A data provider can set these policies in their data offering specification:

* If the column’s `category` is `join_standard` or `join_custom`, the column is added to the clean room’s join policy.
* If the column’s `category` is set to any other value, the column is added to the clean room’s column policy.
* If the column’s `activation_allowed` value is set to TRUE, it is also added to the clean room’s activation policy.

Policies are enforced when a template has the appropriate [policy check filter](custom-templates.md). These
filters are: `join_policy`, `column_policy`, `activation_policy`, `join_and_column_policy`. At template execution time, these
filters validate that the referenced columns are permitted by the corresponding policy set from the data offering specification. A template
fails if a filter is applied to a column that isn’t part of the specified policy.

For example, both `col1` and `col2` must be part of the data provider’s join policies (`category: join_standard` or
`category: join_custom`), or the following template snippet will throw an error:

```sqlexample
SELECT *
FROM T1
JOIN T2
ON {{ t1_col | sqlsafe | join_policy }} = {{ t2_col | sqlsafe | join_policy }}
```

## Organizing data offerings with naming paths

You can use naming paths to group data offerings conceptually. This is particularly effective because each data
offering represents one or more tables or views. Individual tables are accessed using the syntax
`collaborator alias.data offering ID.dataset alias`, where the data offering ID is a combination of the user-provided name
and version values, and the alias is a single table in the offering.

Consider the name, version, and alias as a scoping system when registering your data offerings, which enables you to organize your data
by offering and alias. For example, you might register the following data offering of sales data, where each table is specific to a US
state:

```yaml
api_version: 2.0.0
spec_type: data_offering
version: v0
name: examplecorp_sales_by_state
datasets:
 - alias: AL
   data_object_fqn: mydb.mysch.al_data
 - alias: NY
   data_object_fqn: mydb.mysch.ny_data
 - alias: CA
   data_object_fqn: mydb.mysch.ca_data
```

The analysis runner references these tables as `user_alias.offering_id.AL`, `user_alias.offering_id.NY`, and `user_alias.offering_id.CA`.

---
title: Design custom templates
source: https://docs.snowflake.com/en/user-guide/cleanrooms/custom-templates.md
section: Clean Rooms
---

# Design custom templates

## About clean room templates

Clean room templates are written in [JinjaSQL](https://github.com/sripathikrishnan/jinjasql). JinjaSQL is an extension to the Jinja
templating language. A JinjaSQL template evaluates to a SQL statement when run in a clean room. The JinjaSQL templating language provides logic statements and run-time variable replacement, which enables the template to be customized at run time. For example, a user can provide table and column names when they run the template, and the template can adjust itself based on the values passed in.

There are two general types of templates:

* **Analysis templates**, which evaluate to a SQL DQL statement (a SELECT statement) that returns query results immediately to the template runner.
* **Activation templates**, which are used to activate results to a Snowflake account, rather than showing results
  in the immediate environment. An activation template is very similar to an analysis template with a few extra requirements, and it evaluates to a DDL statement (CREATE TABLE).

## Creating, sharing, and running a custom template

Any collaborator can [register and share templates](resources-templates.md) with specific analysis runners in a collaboration.

Let’s start by looking at a simple SQL query, and how it would be written as a template.

### 1. The JinjaSQL template

Here is a simple SQL query that joins two tables by email and shows the overlap count per city:

```sqlexample
SELECT COUNT(*), city FROM table_1
  INNER JOIN table_2
  ON table_1.hashed_email = table_2.hashed_email
  GROUP BY city;
```

Here is how that query would look as a JinjaSQL template that allows the caller to choose the JOIN and GROUP BY columns, as well as the tables used. The template includes some filters that enforce [Snowflake Data Clean Room policies](resources-data-offerings.md).

```sqlexample
SELECT COUNT(*), IDENTIFIER({{ group_by_col | column_policy }})
  FROM IDENTIFIER({{ source_table[0] }}) AS p1
  INNER JOIN IDENTIFIER({{ source_table[1] }}) AS p2
  ON IDENTIFIER({{ p1_join_col | join_policy }}) = IDENTIFIER({{ p2_join_col | join_policy }})
  GROUP BY IDENTIFIER({{ group_by_col | column_policy }});
```

**Notes on the template:**

* Values within {{ double bracket pairs }} are variables. The values are populated by the caller.
* `group_by_col`, `source_table`, `p1_join_col`, and `p2_join_col` are all variables
  populated by the caller. These variables have arbitrary names chosen by the template designer.
* `source_table` is a standard Snowflake-defined variable. This variable defines the views to use in the query. These views are datasets
  within data offerings that are linked into the clean room. Collaborators can list available datasets by calling VIEW_DATA_OFFERINGS.
* A dataset must be aliased as lowercase `p` if you want to enforce Snowflake Data Clean Room policies on it. If a template uses
  multiple datasets, the first is `p` or `p1`, and additional datasets are indexed as `p2`, `p3`, and so on.
* IDENTIFIER is needed for all column and table names, because variables in {{ double brackets }} evaluate to string literals, which aren’t
  valid identifiers.
* JinjaSQL *filters* are applied to columns to enforce Snowflake Data Clean Room policies on the column. Snowflake implements custom
  filters `join_policy` and `column_policy`, which verify whether a column complies with join or column policies in the clean room
  respectively, and fail the query if it doesn’t. A filter is applied to a column name as `{{ column_name | filter_name }}`.

All these points will be discussed in detail later.

### 2. The Collaboration template

A template is added to a collaboration by embedding it in a YAML specification and registering it, then linking it.

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: my_test_template
  version: 2026_01_12_V1
  type: sql_analysis
  description: A test template
  methodology: Join on single column with a single group by value
  parameters:
  - name: source_tables
    description: Tables from both sides which can be listed in any order, aliased with p1 or p2
    required: true
  - name: p1_join_col
    description: Column to join on from first table specified under source_tables, aliased with p1
    required: true
  - name: p2_join_col
    description: Column to join on from second table specified under source_tables, , aliased with p2
    required: true
  - name: group_by_col
    description: Column which results should be grouped group aliased with respective table p1 or p2
    required: true

  template:
    SELECT COUNT(*), IDENTIFIER({{ group_by_col | column_policy }})
    FROM IDENTIFIER({{ source_table[0] }}) AS p1
    INNER JOIN IDENTIFIER({{ source_table[1] }}) AS p2
    ON IDENTIFIER({{ p1_join_col | join_policy }}) = IDENTIFIER({{ p2_join_col | join_policy }})
    GROUP BY IDENTIFIER({{ group_by_col | column_policy }});

$$);
```

You must request to share a template with a given analysis runner, who can accept or reject the request. Additionally, all data providers for that analysis runner must accept the request for the template to be shared.

```sqlexample
-- Request to share template with only Collaborator3.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ADD_TEMPLATE_REQUEST(
  $collaboration_name,
  $template_id,
  ['Collaborator3']
);
```

### 3. Running the template

Here is how an analysis runner might run this template in code. Note how column names are qualified by the table aliases
declared in the template.

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN( $collaboration_name,
$$
api_version: 2.0.0
spec_type: analysis
name: example_run
description: Example run for template
template: $template_id

template_configuration:
  view_mappings:
    source_tables:
      - collaborator_1.data_offering_1.dataset_1
      - collaborator_2.data_offering_2.dataset_2
  arguments:
     p1_join_col: p1.hashed_email
     p2_join_col: p2.hashed_email
     group_by_col: p2.device_type

$$ );
```

### Developing a custom template

Clean room templates are JinjaSQL templates. To create a template, you should be familiar with the following topics:

* [Jinja templating basics](https://jinja.palletsprojects.com/en/stable/)
* The [JinjaSQL extension to Jinja](https://github.com/sripathikrishnan/jinjasql).

You can use Cortex Code to validate the SQL output of your JinjaSQL templates based on variable inputs that should be provided. See example prompts below that you can copy into Cortex Code to get final SQL outputs you can test:

**Example:**

```text
Resolve the following Jinja template into SQL based on the variables defined:

Jinja Template:
 SELECT IDENTIFIER({{ col1 | column_policy }}), IDENTIFIER({{ col2 | column_policy }})
  FROM IDENTIFIER({{ source_table[0] }}) AS p1
  JOIN IDENTIFIER({{ source_table[1] }}) AS p2
  ON  IDENTIFIER({{ p1_join_col | join_policy }}) = IDENTIFIER({{ p2_join_col | join_policy }})
  {% if where_phrase %} WHERE {{ where_phrase | sqlsafe }}{% endif %};

Variable Inputs:
source_table: SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS, SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS
col1: p1.status
col2: p1.age_band
p1_join_col: p1.hashed_email
p2_join_col: p2.hashed_email
where_phrase: p1.household_size > 2
```

The rendered template looks like this:

```output
SELECT IDENTIFIER('p1.status'), IDENTIFIER('p1.age_band')
FROM IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS p1
JOIN IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS p2
ON  IDENTIFIER('p1.hashed_email') = IDENTIFIER('p2.hashed_email')
WHERE p1.household_size > 2;
```

Try running the SQL statement above in your environment to see if it works and gets the expected results.

Then test your template without a WHERE clause:

```text
Resolve the following Jinja template into SQL based on the variables defined:

Jinja Template:
 SELECT IDENTIFIER({{ col1 | column_policy }}), IDENTIFIER({{ col2 | column_policy }})
  FROM IDENTIFIER({{ source_table[0] }}) AS p1
  JOIN IDENTIFIER({{ source_table[1] }}) AS p2
  ON  IDENTIFIER({{ p1_join_col | join_policy }}) = IDENTIFIER({{ p2_join_col | join_policy }})
  {% if where_phrase %} WHERE {{ where_phrase | sqlsafe }}{% endif %};

Variable Inputs:
source_table: SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS, SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS
col1: p1.status
col2: p1.age_band
p1_join_col: p1.hashed_email
p2_join_col: p2.hashed_email
```

Rendered template:

```output
SELECT IDENTIFIER('p1.status'), IDENTIFIER('p1.age_band')
FROM IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS p1
JOIN IDENTIFIER('SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS') AS p2
ON  IDENTIFIER('p1.hashed_email') = IDENTIFIER('p2.hashed_email');
```

Add the template into your clean room, and test with an analysis run spec.

### Data protection

Templates can access only datasets linked into the clean room by collaborators.

Collaborators specify join, column, and activation policies on their datasets to ensure that only those columns can be used as an input for a template variable.

> **Important:**
>
> The template must include
> the appropriate JinjaSQL policy filter on a column for the policy to be applied.

## Custom template syntax

Snowflake Data Clean Rooms supports V3 JinjaSQL, with a few extensions as noted.

This section includes the following topics:

### Template naming rules

When creating a template, names must contain only letters, numbers, or underscores.
Template names are assigned in the template specification’s `name` field when you register the template.

**Example valid names:**

* `my_template`
* `activation_template_1`

**Example invalid names:**

* `my template` - Spaces not allowed
* `my_template!` - Special characters not allowed

### Template variables

Template callers can pass in values to template variables. JinjaSQL syntax enables variable binding for any variable name
within {{ double_brackets }}, but Snowflake reserves a few variable names that you shouldn’t override, as described below.

> **Caution:**
>
> All variables, whether Snowflake-defined or custom, are populated by the user and should be treated with appropriate caution.
> Analysis templates must resolve to a single SELECT statement (activation templates resolve to a script block). Remember that all
> variables are passed in by the caller.

#### Snowflake-defined variables

All clean room templates have access to the following global variables defined by Snowflake, but passed in by the analysis runner:

`source_table`:
:   A zero-based string array of tables and views from data offerings linked into the collaboration via LINK_DATA_OFFERING that can be used by the template.

    **Example:** `SELECT col1 FROM IDENTIFIER({{ source_table[0] }}) AS p;`

`my_table`:
:   In a Collaboration clean room, `my_table` is used only by Snowflake Standard Edition users. For these users, `my_table` is a zero-based string array of datasets that the analysis runner linked by calling LINK_LOCAL_DATA_OFFERING.

    **Example:** `SELECT col1 FROM IDENTIFIER({{ my_table[0] }}) AS c;`

#### Custom variables

Template creators can include arbitrary variables in a template that can be populated by the analysis runner. These variables can have any Jinja-compliant name except for the Snowflake-defined variables or table alias names. You should provide guidance in the parameter section of the template for required and optional variables.

Custom variables can be accessed by your template, as shown here for the custom variable `max_income`:

```sqlexample
SELECT income FROM my_db.my_sch.customers WHERE income < {{ max_income }};
```

Analysis runners pass variables when calling RUN as defined in the [analysis run spec](spec-analysis.md).

#### Resolving variables correctly

String values passed into the template resolve to a string literal in the final template. This can cause SQL parsing or logical errors if
you don’t handle bound variables appropriately:

* `SELECT {{ my_col }} FROM p;` - This resolves to `SELECT 'my_col' from p;` which simply returns the string “my_col” - probably not
  what you want.
* `SELECT age FROM {{ source_table[0] }} AS p;` - This resolves to `SELECT age FROM 'somedb.somesch.source_table' AS p;`, which causes a
  parsing error because a table must be an identifier, not a literal string.
* `SELECT age FROM IDENTIFIER({{ source_table[0] }}) AS p {{ where_clause }};` - Passing in “WHERE age < 50” evaluates to
  `SELECT age FROM mytable AS p 'WHERE age < 50';`, which is a parsing error because of the literal string WHERE clause.

Therefore, where appropriate, you must resolve variables. Here is how to resolve variables properly in your template:

Resolving table and column names
:   Variables that specify table or column names must be converted to identifiers in your template in one of two ways:

    * [IDENTIFIER](../../sql-reference/identifier-literal.md): For example: `SELECT IDENTIFIER({{ my_column }}) FROM p;`
    * [sqlsafe](https://github.com/sripathikrishnan/jinjasql?tab=readme-ov-file#sql-safe-strings): This JinjaSQL filter resolves identifier
      strings to SQL text. An equivalent statement to the previous bullet is `SELECT {{ my_column | sqlsafe }} FROM p;`

    Your particular usage dictates when to use IDENTIFIER or `sqlsafe`. For example, `p.{{ my_column | sqlsafe }}` can’t easily be
    rewritten using IDENTIFIER.

Resolving dynamic SQL
:   When you have a string variable that should be used as literal SQL, such as a WHERE clause, use the `sqlsafe` filter in your template.
    For example:

    ```sqlexample
    SELECT age FROM IDENTIFIER({{ source_table[0] }}) AS p WHERE {{ where_clause }};
    ```

    If a user passes in “age < 50” to `where_clause`, the query would resolve to `SELECT age FROM sometable AS p WHERE 'age < 50';`
    which is invalid SQL because of the literal string WHERE condition. In this case, you should use the `sqlsafe` filter:

    ```sqlexample
    SELECT age FROM IDENTIFIER( {{ source_table[0] }} ) as p {{ where_clause | sqlsafe }};
    ```

### Required table aliases

At the top level of your query, all `source_table` datasets must be aliased as `p`, and all `my_table` datasets must be aliased as
`c`, in order for Snowflake to validate join and column policies correctly in the query. Any column that must be verified against join
or column policies must be qualified with the lowercase `p` or `c` table alias.

If you use multiple `source_table` or `my_table` datasets in your query, add a numeric, sequential 1-based suffix to each table alias
after the first. So: `p` or `p1`, `p2`, `p3`, and so on for the first, second, and third `source_table` datasets, and `c` or `c1`, `c2`,
`c3`, and so on for the first, second, and third `my_table` datasets. The `p` or `c` index should be sequential without gaps (that
is, create the aliases `p1`, `p2`, and `p3`, not `p1`, `p2`, and `p4`).

**Example**

```sqlexample
SELECT p1.col1 FROM IDENTIFIER({{ source_table[0] }}) AS p1
UNION
SELECT p2.col1 FROM IDENTIFIER({{ source_table[1] }}) AS p2;
```

### Custom clean room template filters

Snowflake supports all the [standard Jinja filters](https://jinja.palletsprojects.com/en/stable/templates/#builtin-filters) and most of
the standard
[JinjaSQL filters](https://github.com/search?q=repo%3Asripathikrishnan%2Fjinjasql+self.env.filters+path%3Ajinjasql%2Fcore.py&type=code&path+jinjasql%2Fcore.py=),
along with a few extensions:

`join_policy`:
:   Succeeds if the column is in the join policy of the data owner; fails otherwise. See [Applying data protection policies to data offerings](resources-data-offerings.md).

`column_policy`:
:   Succeeds if the column is in the column policy of the data owner; fails otherwise. See [Applying data protection policies to data offerings](resources-data-offerings.md).

`activation_policy`:
:   Succeeds if the column is in the activation policy of the data owner; fails otherwise. See [Applying data protection policies to data offerings](resources-data-offerings.md).

`join_and_column_policy`:
:   Succeeds if the column is in the join or column policy of the data owner; fails otherwise. See [Applying data protection policies to data offerings](resources-data-offerings.md).

`identifier`:
:   This JinjaSQL filter is **not supported** by Snowflake templates.

> **Tip:**
>
> JinjaSQL statements are evaluated left to right:
>
> * `{{ my_col | column_policy }}` **Correct**
> * `{{ my_col | sqlsafe | column_policy }}` **Correct**
> * `{{ column_policy | my_col }}` **Incorrect**
> * `{{ my_col | column_policy | sqlsafe }}` **Incorrect:** `column_policy` will be checked against the `my_col` value as a string,
>   which is an error.

### Enforcing clean room policies

Clean rooms don’t automatically check clean room policies against columns used in a template. If you want to enforce a policy against a
column:

* You must apply the appropriate policy filter to that column in the template. For example:

```sqlexample
FROM IDENTIFIER({{ source_table[0] }}) AS p1
JOIN IDENTIFIER({{ source_table[1] }}) AS p2
  ON IDENTIFIER({{ p1_join_col | join_policy }}) = IDENTIFIER({{ p2_join_col | join_policy }})
```

* You must alias the table as lowercase `p` or `c`. See Required table aliases.

Policies are checked only against columns of tables referenced in a **source_table** variable, which refer
to views shared within the clean room. Policies are not checked against columns of tables referenced in
a **my_table** variable, which are local tables not shared within the clean room.

Note that column names can’t be ambiguous when testing policies. So if you have columns with the same
name in two tables, you must qualify the column name in order to test the policy against that column.

## Access considerations and best practices

A template is always executed in context to the clean room application role. A collaborator does not have direct access to any data within the clean room that is restricted to template access only; all access is through the native application roles and the template outputs.

As best practice, you should follow the below for templates you create or use in a clean room:

* Ensure a policy filter is applied any time a column variable is used in a template, so that collaborator policies are respected.
* Wrap user-provided variables with IDENTIFIER() when possible to strengthen templates against SQL injection attacks.

## Activation templates

A template can also be used to save query results to a table outside of the clean room; this is called *activation*. An activation template is an analysis template with the following additional requirements:

* Activation templates are JinjaSQL statements that evaluate to a SQL script block, unlike analysis templates, which can be simple SELECT
  statements.
* Activation templates must create an internal table in the clean room to store results. The table generated by the template must have the
  prefix `cleanroom.activation_data_`, for example: `cleanroom.activation_data_my_results`
* All columns in the internal results table should have the value `activation_allowed: TRUE` in their data offering specification.
* The script block should end with a RETURN statement that returns the name of the generated table without the
  `cleanroom.activation_data_` prefix, for example: `RETURN 'my_results'`.
* The template itself has no naming requirements.

Here is an example activation template specification:

```yaml
api_version: 2.0.0
spec_type: template
name: my_activation_template
version: v0
type: sql_activation
description: Activation template that creates segment data
template: |
  BEGIN
      CREATE OR REPLACE TABLE cleanroom.activation_data_analysis_results AS
      SELECT
          {{ group_by_column | sqlsafe }} AS bucket_label,
          {{ activation_column | sqlsafe | activation_policy }} AS activation_label,
          COUNT(DISTINCT {{ join_column | sqlsafe }}) AS overlap_count
      FROM IDENTIFIER({{ source_table[0] }}) AS p
      GROUP BY {{ group_by_column | sqlsafe }},
               {{ activation_column | sqlsafe }};
      RETURN 'analysis_results';
  END;
parameters:
  - name: join_column
    description: Join column name
    required: true
    default: "p.IP_ADDRESS"
  - name: group_by_column
    description: Group by column name
    required: true
    default: "p.CAMPAIGN_NAME"
  - name: activation_column
    description: Activation column name
    required: true
    default: "p.DEVICE_TYPE"
```

Learn how to implement activation in a collaboration: [Activating query results](activation.md).

## Next steps

After you’ve mastered the templating system, read the specifics for implementing a clean room with your template type:

* [Activation templates](activation.md) create a results table after a successful run and is shared outside of the clean room. Depending on the collaboration specification, the results table can be shared to the analysis runner or other collaborators.
* [Code bundles](resources-code-bundles.md) are used to upload custom Python UDFs and UDTFs into a collaboration. Templates in the collaboration can run these functions to perform complex data actions.
* [Internal tables](multistep-flows.md) are used to store intermediary or persistent results, which can be used downstream to support multistep workflows. These tables are accessible to templates or custom uploaded code inside the clean room.

## More information

* [Jinja documentation](https://jinja.palletsprojects.com/en/stable/)
* [JinjaSQL documentation](https://github.com/sripathikrishnan/jinjasql)

---
title: Differential privacy in Snowflake Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/differential-privacy.md
section: Clean Rooms
---

# Differential privacy in Snowflake Data Clean Rooms

To help protect the privacy of entities in your data, Snowflake Data Clean Rooms offer *differential privacy*. Differential privacy is a
math-based privacy system [1] to provide entity-level data protection for both single queries and repeated querying of a data set.
Data providers can configure differential privacy in their clean rooms to enable strong entity-level privacy protection and low noise
levels for their data.

Differential privacy is an alternative to simple aggregation requirements, which can expose private information if adversaries generate
enough “close” queries on data that differ by one entity (known as a *differencing attack*).

Differential privacy is also a good alternative to data masking, which hides column values entirely at the cost
of preventing joins on masked rows and hiding useful data from the analyst. Differential privacy enables joins on protected columns, and
also allows analysts to view protected data, by adding enough noise to protect the privacy of protected rows, but not so much
noise that the data is unusable by the analyst.

[1]

C. Dwork and A. Roth. The Algorithmic Foundations of Differential Privacy. Foundations and Trends in Theoretical
Computer Science, 9 (3-4):211-407, 2014.

> **Important:**
>
> Customers are responsible for configuring differential privacy tools in Snowflake Data Clean Rooms to meet their data privacy
> requirements. These tools are not configured by default.

## How differential privacy works for clean rooms

Clean rooms offer their own differential privacy implementation that is different from
[Snowflake differential privacy](../diff-privacy/differential-privacy-admin-privacy-policies.md), so read this document to
understand the different behavior and settings.

Differential privacy protects the privacy of *entities* in your data. Clean rooms define an entity as a unique value in a column. Clean
rooms determine which columns contain data that are likely to be sensitive; for example, a social security number or email address is
probably a sensitive entity, but a color is not. When differential privacy is applied, clean rooms might identify one or more entity
columns in each table. You cannot configure which columns are designated as entity columns.

Differential privacy in clean rooms also adds noise to numeric results associated with each entity.

Users might try to compare multiple different query results in order to reduce the noise; this is called a *differencing attack*. In order
to mitigate differencing attacks, differential privacy calculates and monitors a *privacy budget* assigned to an account. Each query has a
cost that reflects how much entity privacy is exposed by that query. This cost is determined mathematically, and depends on the query, the
data, and the previous queries from that user. If a query cost exceeds the remaining privacy budget limit, the query will fail. Otherwise, the
query can continue and the cost is added to the user’s daily privacy budget. The privacy budget is refreshed daily.

Differential privacy in clean rooms does not enforce aggregation constraints on queries, but you can
add aggregation constraints on your data or templates independently.

> **Tip:**
>
> [Snowflake privacy policies](../diff-privacy/differential-privacy-admin-privacy-policies.md) prevent creation of a view from
> a protected table, so you cannot link in tables that have privacy polices.

## Enable and manage differential privacy in the UI

In the clean room UI, providers can set privacy settings at the template level; consumers cannot enable or modify differential privacy
settings. Standard Snowflake templates used in the clean rooms UI can have different privacy settings per template.

To use the clean room UI to enable or disable differential privacy for a template:

1. Open the Created tab of the Clean Rooms page
2. Select Edit or  » Edit on the clean room tile (depending on whether the clean room allows you to
   run an analysis).
3. Select Next until you reach Configure Analysis & Query.
4. At the bottom of the page, expand Privacy Settings. Select or deselect Differential Privacy and provide your settings for
   that template, including privacy budget for users and query cost. You can also set threshold values to enforce minimum group sizes in
   this query.
5. To configure settings for a different template, first set values for the current template, then choose a different template in the
   template selector.

> **Tip:**
>
> If you enable differential privacy for the Audience Overlap template, do not compute overlap statistics. Doing so will consume most of
> the user’s privacy budget, leaving little or no budget to run analyses.

### Manage privacy budget in the UI

**See your remaining privacy budget**

When you run a query or view the results, you can see your total budget and the amount used in the Privacy Settings section.

**Set the privacy budget for other users**

In the UI, a provider can set a privacy budget, but a consumer cannot.

> 1. Edit a clean room and go to the Configure Analysis & Query page.
> 2. Select a template.
> 3. At the bottom of the page, expand Privacy Settings where you can see your privacy budget for users and query cost.

## Enable and manage differential privacy in the API

In the clean rooms API, either side can enable and configure differential privacy at the collaborator level.

All custom templates use the same differential privacy settings in a clean room. Snowflake-provided templates can be configured with
individual privacy settings in the UI.

Use the following procedures to configure differential privacy:

* `consumer.enable_templates_for_provider_run` - Turn differential privacy off or on with default values for all
  provider-run analyses.
* `consumer.set_privacy_settings` - Specify individual differential privacy settings in provider-run analyses involving custom templates.
* `provider.set_privacy_settings` - Specify individual differential privacy settings in consumer-run analyses involving custom templates.
* `provider.add_custom_sql_template` - Provide a *sensitivity* parameter to increase or decrease the epsilon (noise level) for a
  template above or below the base line epsilon set for the consumer.
* `provider.add_consumers` - Specify privacy settings per consumer. You can add the same customer multiple times with different
  privacy settings to change their privacy settings.
* `provider.suspend_account_dp_task` - Turn off differential privacy budget monitoring for all clean rooms in this account. Differential
  privacy is no longer enforced.
* `provider.resume_account_dp_task` - Turn on differential privacy budget monitoring for all clean rooms in this account. Any
  differential privacy settings will be respected.

Privacy settings for a clean room are stored in `SAMOOHA_CLEANROOM_cleanroom_ID.admin.privacy_budget`, where `APPLICATION_ID` is
a template name (NULL represents all custom templates) and PARTY_ACCOUNT is the user it is applied to.

### Manage privacy budget in the API

**See your remaining privacy budget**

Consumers can call the `consumer.view_remaining_privacy_budget` procedure. There is no way for providers to see their remaining privacy
budget in code.

**Set the privacy budget for other users**

* **Providers** call `provider.set_privacy_settings` or `provider.add_consumers`.
* **Consumers** call `consumer.set_privacy_settings` to set budget for provider-run analyses.

### Available privacy settings

The following privacy values can be set using various privacy value setting procedures:

* `differential` (*Integer*) - 1 or 0, where 1 means that differential privacy should be enabled, and 0 means it should not.
* `epsilon` (*Float*): A number greater than zero indicating how much noise should be added to the results. Smaller values (0.1-1.0)
  provide stronger privacy protection but add more noise to the results. Default: 0.1.
* `noise_mechanism` (*String*) - The algorithm used to add noise to the results. Specify either `Laplace` or `Gaussian`.
* `privacy_budget` (*Integer*) - How much privacy budget to give this user, a number >= 0, where 0 means they cannot run a query when
  differential privacy is enabled. Default is 10.
* `threshold` (*Integer*) - Specify 1 to enforce `threshold_value` in Snowflake-provided templates, or 0 to ignore `threshold_value`.
  Default is 0. This is managed by the differential privacy toggle in the clean room UI.
* `threshold_value` (*Integer*) - Minimum number of rows that a group should have to appear in the data. Only used in specific
  Snowflake-provided templates.

## Additional privacy functionality

### Add noise to results

If you want to manually add noise to your results without implementing differential privacy, you can use the following clean room function
in your template or custom code. Note that this code requires the user to have sufficient privacy budget or it will fail; if the
differential privacy task is disabled, the user essentially has infinite budget.

```sqlsyntax
cleanroom.addnoise(<val>, <epsilon>, <noiserand>, [<gaussian>], [<delta>])
```

**Description:** Add calibrated noise to a numerical value to satisfy differential privacy guarantees. This function can be called only in
the context of a clean room. This does not require differential privacy to be enabled for the user or template, or the differential privacy
task to be enabled. Use this function in a template or UDP/UDTP.

**Arguments:**

* `val` *(DOUBLE)* - The original value to which noise will be added.
* `epsilon` *(DOUBLE)* - The privacy budget parameter, where smaller values (0.1-1.0) provide stronger privacy protection but add
  more noise. Value is > 0.
* `noiserand` (*DOUBLE)* - A random value between 0 and 1 that adds randomness to each result. Calculate this on the fly with a
  random value generator rather than passing in a static value.
* `gaussian` *(BOOLEAN, optional)* - When TRUE, uses Gaussian noise instead of Laplacian noise. Default is FALSE.
* `delta` *(DOUBLE, optional)* - The delta parameter for the Gaussian mechanism when `gaussian` is TRUE (smaller is better).
  Default is 0.000001.

**Returns:** A DOUBLE value representing the original value with privacy-preserving noise added.

**Recommendations:**

* Apply only to aggregates (COUNT, SUM, AVG), never to individual records.
* Consider rounding the results to avoid revealing too much precision.
* This function requires a privacy budget to run, so be aware that it will fail if the user is out of budget.
* Combine with minimum group size constraints for enhanced protection.

**Example:**

This example template adds noise to a count of distinct hashed email values, using the cleanroom-wide epsilon value.

```sqlexample-python
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
$cleanroom_name,
$template_name,
$$
SELECT
  cleanroom.addNoise(
    count(distinct p.hashed_email),  -- Value
    {{ privacy.epsilon | sqlsafe }}, -- Epsilon
    UNIFORM(0::FLOAT, 1::FLOAT, RANDOM()) -- Noiserand
    ) AS noisy_count
FROM
    IDENTIFIER({{ source_table[0] }}) p
$$);
```

### Set aggregation policies and minimum group sizes

If you want to require aggregation on your data, and specify minimum group sizes, you can either set an
[aggregation policy](../aggregation-policies.md) on the source tables, or enforce aggregation in your templates.

## Managing differential privacy costs

Differential privacy [incurs costs](cleanroom-cost.md) even when individual users or templates have not enabled
differential privacy, because the system checks every query to see whether differential privacy should be applied. If you want to eliminate
this cost, you can disable differential privacy for the account:

1. First, turn off differential privacy for all clean rooms using the clean rooms UI:

   1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
   2. Disable differential privacy in all non-failed clean rooms, even if not shared or published:

      1. Select Clean rooms » Created » Edit.
      2. Select Next until you reach Configure Analysis & Query.
      3. At the bottom of the page, expand Privacy Settings. Deselect Differential Privacy if it is selected, then click
         Next and Finish to save your changes. If it is not selected, just click Cancel and move on to the next clean
         room.
2. Finally, suspend the differential privacy background task in your account by calling the
   [provider.suspend_account_dp_task procedure](provider.md) in Snowsight.

> **Important:**
>
> Enabling differential privacy in a clean room after disabling the background task automatically re-enables the task for that account.

**Some notes and troubleshooting:**

* If you forget to disable differential privacy for a clean room and suspend the background task, differential privacy might not
  function in that clean room for users who have already installed it.
* If differential privacy is enabled within a clean room prior to the clean room being installed, the installation of the clean room
  fails. In this case, you must disable differential privacy in the clean room or re-enable the task as outlined below.

**If you later want to enable differential privacy in your account,** either enable differential privacy for any clean room in the account
or call the [provider.resume_account_dp_task procedure](provider.md) in Snowsight.

---
title: Enabling the legacy Snowflake Data Clean Rooms UI
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/enable-clean-rooms-ui.md
section: Clean Rooms
---

# Enabling the legacy Snowflake Data Clean Rooms UI

## Overview

Snowflake Data Clean Rooms can be used in two different environments:

* **The clean rooms UI:** A graphical, no-code, browser-based environment that makes it easy to create and run analyses.
* **The clean rooms API:** A set of stored procedures you can use to create and manage clean rooms and run analyses.

These environments provide similar, but [not exactly equivalent](../getting-started.md), capabilities. A clean rooms
administrator installs one or both components in a Snowflake account, and can then grant users access to each environment individually.

## Requirements to enable clean rooms UI for Snowflake Data Clean Rooms

### Account, installer, and user requirements

After you install the clean rooms environment, access to the clean rooms environment must be granted to users explicitly by a clean rooms administrator.

Here are the requirements to enable Snowflake Data Clean Rooms UI in your Snowflake account:

* **The account must allow** [key-pair authentication](../admin-tasks.md), which is used by the service account
  for authentication.
* **The Snowflake account must be a capacity account:** this is an account that has an up-front
  capacity commitment. Snowflake On-Demand accounts cannot access the clean rooms UI.
* **You must use multi-factor authentication** (MFA) with a
  [supported authenticator app](web-app-introduction.md).

### Role requirements

Here are the role requirements for the person enabling the clean rooms UI:

* You must have an ACCOUNTADMIN role in a Snowflake account and already installed the clean rooms environment in that account.
* The user with the ACCOUNTADMIN role must have a valid first name, last name, and email defined for their user object. To check,
  run [DESCRIBE USER](../../../sql-reference/sql/desc-user.md).

## Enable the clean rooms UI

The clean rooms UI provides an easy no-code environment to manage your clean rooms account and create clean rooms and run analyses. It also
provides some [additional functionality](../getting-started.md) not available in the clean rooms API, such as scheduled
queries, third-party activation, and useful predefined templates.

Here is how to enable the clean rooms UI in your Snowflake account:

1. **Configure your network policies** to
   [allow the clean rooms UI to access your Snowflake account](../admin-tasks.md).
   (*Required only if your Snowflake account uses a network policy to control network traffic.*)
2. **Complete the UI setup.** This step configures a service user [\*] that the clean rooms UI uses to communicate with Snowflake.

   1. [Sign in to the clean rooms UI](https://cleanroom.c1.us-east-1.aws.app.snowflake.com/) with your Snowflake credentials.
   2. Open Admin » Snowflake Admin » Connect to Snowflake account.
   3. Under Enable the Data Clean Rooms UI choose Quick Setup or Manual Setup:

      * Quick Setup - This creates a service user for you. Specify a unique service user name for this account.
      * Manual Setup - If you want to create the service user yourself, or reuse an existing service user, select this option. Note
        that clean rooms will take control of the service user and modify it, so make sure that the service user isn’t used for
        anything else. Learn how to create a service user.
   4. Enter your unique service user name and select Finish.
3. **Provide additional users access to the UI** [Manage UI clean room users](../manage-dcr-users.md) by giving the appropriate priveleges to conduct and manage clean room operations via the UI.

[\*]

The service user is configured as follows: Clean rooms sets the type as SERVICE, creates and applies the required network policy
(named SAMOOHA_SERVICE_ACCOUNT_USER_ACCESS) for the service user, sets the authentication as key-pair, and grants SAMOOHA_APP_ROLE to
the service user.

## Troubleshooting installation

Use this section to troubleshoot problems you might have after completing the steps in this topic.

Symptom: Insufficient privileges
:   **Solution:**
    Ensure that the IP addresses associated with the clean rooms UI are allowed by your network policies. For a list of these IP addresses,
    see [Clean rooms UI hosting locations and IP addresses](web-app-introduction.md).

Symptom: Installation is successful, but the clean rooms UI is not functioning properly.
:   **Solution #1:**
    Use the [DESCRIBE USER](../../../sql-reference/sql/desc-user.md) command to double-check that the Snowflake user that you used to configure Snowflake has a
    valid first name, last name, and email. If the user is missing any of these, execute the [ALTER USER](../../../sql-reference/sql/alter-user.md) command to
    specify them.

    **Solution #2:**
    Try uninstalling the Snowflake Native App for Snowflake Data Clean Rooms, and then re-installing it.

    * To uninstall the app, see
      [Uninstall a Snowflake Native App](https://other-docs.snowflake.com/en/native-apps/consumer-managing-applications#uninstall-a-native-app).
      If you installed the application with its default name, it is called SAMOOHA_BY_SNOWFLAKE.
    * To re-install the app:

      1. [Sign in to the clean rooms UI](web-app-introduction.md).
      2. In the left navigation pane, select Snowflake Admin.
      3. Select Login to Snowflake, and authenticate as a Snowflake user with the ACCOUNTADMIN role.
      4. Use the [DESCRIBE USER](../../../sql-reference/sql/desc-user.md) command to confirm that the Snowflake user with the ACCOUNTADMIN role that you just
         used to authenticate has a valid first name, last name, and email. If the user is missing any of these, execute the
         [ALTER USER](../../../sql-reference/sql/alter-user.md) command to specify them.
      5. To install the Snowflake Native App, select Install.
      6. Accept the default name of the application during the installation process.

## Creating a UI service user manually

When installing the clean rooms UI, you can either let the installation create the service user for you, or you can provide a service user
that you create. Here is how to create a service user in Snowsight:

Sign in to [Snowsight](../../ui-snowsight-gs.md) with your Snowflake administrator credentials and create a user as shown in the following SQL example:

> ```sqlsyntax
> -- Create the user.
> -- Clean rooms will set the type to SERVICE for you.
>
> USE ROLE USERADMIN;
> CREATE USER <SERVICE-USER-USERNAME>;
> ```

> **Important:**
>
> Clean rooms alters the authentication controls, network policies, and other attributes of the service user. You will not be able to use
> this user yourself after you give it to the clean rooms environment.

---
title: Free-form SQL queries
source: https://docs.snowflake.com/en/user-guide/cleanrooms/free-form-sql.md
section: Clean Rooms
---

# Free-form SQL queries

A data provider can allow their data to be exposed to an analysis runner via a template or free-form queries. When a data provider enables
free-form queries on a dataset, any analysis runners with access to the data offering can run SQL queries in their environment against that
dataset.

Analysis runners and data providers must both have joined the collaboration before the data becomes available.

## Overview

Here are the steps to run free-form queries against data in a clean room:

**Data provider**

1. Register a data offering that contains one or more datasets where `allowed_analyses: template_and_freeform_sql` is specified.

   If the data provider wants to apply Snowflake policies to columns in the dataset, they must create those policies before registering the
   data, and associate the policies with the columns in the data offering specification.
2. Link the data offering into the collaboration in the standard way.

**Analysis runner**

After the collaboration is installed on their account, the analysis runner calls VIEW_DATA_OFFERINGS. If there is a value in the
`freeform_sql_view_name` column, the dataset can be queried directly against the view named in that column.

Any policies listed in `freeform_sql_column_policies` are applied to the data by the collaboration. Any policies applied directly to the
source data by the data provider are enforced, but won’t be shown in that column.

Details about the data provider and analysis steps are given in the following sections.

## Registering a free-form query dataset (Data Provider)

The following steps show how to enable free-form queries during data offering registration:

1. Specify `allowed_analyses: template_and_freeform_sql` in the collaboration specification. This enables the dataset to be queried
   using either a template or free-form query.

   ```yaml
   ...
   datasets:
   - alias: customers_view
     data_object_fqn: PROVIDER_DB.DATA_SCH.CUSTOMERS
     object_class: custom
     allowed_analyses: template_and_freeform_sql
     schema_and_template_policies:
       HASHED_EMAIL:
         category: join_standard
         column_type: hashed_email_b64_encoded
   ...
   ```

   Only the columns listed under `schema_and_template_policies` are available for querying via templates or free-form queries.
2. If you want to apply Snowflake policies in free-form queries without applying them to your source data, take the following steps:

   1. Create your Snowflake policies in the standard way. Don’t apply them to your table.

      ```sqlexample
      CREATE OR REPLACE AGGREGATION POLICY PROVIDER_DB.DATA_SCH.MIN_GROUP_SIZE_POLICY
        AS () RETURNS AGGREGATION_CONSTRAINT ->
          AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);
      ```

      The role that creates the collaboration must have the USAGE privilege on the database, schema, and policy object.

      These policies are linked dynamically; any changes that you make to these policies immediately affect any datasets that use those
      policies, even if the data offering is already registered and linked.
   2. Assign your policies in the data offering specification under the `freeform_sql_policies` field. Important: All column
      names used under `freeform_sql_policies` must use the [auto-generated column name](resources-data-offerings.md) if the
      column has been renamed. Renaming affects only join-standard category columns.

      These policies aren’t applied directly to the source table, only to the view registered by the collaboration.

      ```yaml
      schema_and_template_policies:
        HASHED_EMAIL:                                  # Source column name.
          category: join_standard
          column_type: hashed_email_b64_encoded        # Column is renamed to the column_type value.
        STATUS:
          category: passthrough
        AGE_BAND:
          category: passthrough
        DAYS_ACTIVE:
          category: passthrough
        INCOME_BRACKET:
          category: passthrough
      freeform_sql_policies:          # Apply agg, join, and masking policies created by the data owner to these columns.
        aggregation_policy:
          name: PROVIDER_DB.DATA_SCH.MIN_GROUP_SIZE_POLICY
          entity_keys:
            - HASHED_EMAIL_B64_ENCODED
        join_policy:
          name: PROVIDER_DB.DATA_SCH.EMAIL_JOIN_POLICY
          columns:
            - HASHED_EMAIL_B64_ENCODED    # This is the renamed column.
        masking_policies:
          - name: PROVIDER_DB.DATA_SCH.MASK_INCOME_POLICY
            columns:
              - INCOME_BRACKET
      ```
3. Register the data offering in the standard way by calling REGISTER_DATA_OFFERING.

## Running free-form queries (Analysis Runner)

When an analysis runner calls VIEW_DATA_OFFERINGS, if a value appears in the `freeform_sql_view_name` column, the free-form SQL view
can be queried directly, without using a template. All Snowflake policies applied to the source table or defined in the
[data offering’s](spec-data-offering.md) `freeform_sql_policies` section are enforced in the queries.

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS($collaboration_name);
```

| Column | Value |
| --- | --- |
| TEMPLATE_VIEW_NAME | `data_provider.provider_customers_V1.customers` |
| TEMPLATE_JOIN_COLUMNS | `hashed_email_b64_encoded` |
| ANALYSIS_ALLOWED_COLUMNS | `STATUS, AGE_BAND, DAYS_ACTIVE, INCOME_BRACKET` |
| ACTIVATION_ALLOWED_COLUMNS |  |
| **FREEFORM_SQL_VIEW_NAME** | `SFDCR_FREEFORM_SQL_DEMO.FREEFORM_SQL.DATA_PROVIDER_PROVIDER_CUSTOMERS_V1_CUSTOMERS` |
| FREEFORM_SQL_COLUMN_POLICIES | ```json {   "aggregation_policy": {"entity_keys": ["HASHED_EMAIL_B64_ENCODED"]},   "masking_policy": {"columns": ["INCOME_BRACKET"]},   "join_policy": {"columns": ["HASHED_EMAIL_B64_ENCODED"]},   "no_policy": {"columns": ["DAYS_ACTIVE", "AGE_BAND", "STATUS"]} } ``` |
| SHARED_BY | `data_provider` |
| SHARED_WITH | `["data_consumer"]` |
| DATA_OFFERING_ID | `provider_customers_V1` |

You must use the value from `freeform_sql_view_name`, not the value from `template_view_name`.

```sqlexample
SELECT status, COUNT(*) AS customer_count
  FROM SFDCR_FREEFORM_SQL_DEMO.FREEFORM_SQL.DATA_PROVIDER_PROVIDER_CUSTOMERS_V1_CUSTOMERS AS t
  GROUP BY status
  ORDER BY customer_count DESC;
```

## Example: Two-party collaboration

The following example demonstrates a two-party collaboration, where one party (the “provider”) is the collaboration owner and a data
provider for the consumer. The other party (the “consumer”) is an analysis runner who can run the template and use the data provided by the
provider, and also run free-form SQL queries on the data, subject to the policies defined in the data provider’s specification.

To run this example, you must have two separate accounts with Snowflake Data Clean Rooms installed.

You can either download the files and upload them to your Snowflake account, or copy and paste the example code into worksheets in two
separate accounts by using Snowsight.

File downloadsProvider codeConsumer code

Download the source SQL files, and then upload them into two separate accounts that have Snowflake Data Clean Rooms installed:

* [`Collaboration owner and data provider worksheet`](../../_downloads/83b7c30a5d9e249f0ca1d60339f889c9/collab-hub-freeform-sql-provider.sql)
* [`Collaboration query runner worksheet`](../../_downloads/fd47736d365306779ad4d94ac2c3ad5c/collab-hub-freeform-sql-consumer.sql)

```sqlexample
-- ============================================================================
-- Free-form SQL Collaboration Demo: Data Provider
-- ============================================================================
-- This example demonstrates a Snowflake Data Clean Rooms collaboration using
-- freeform SQL policies. The data provider creates a sample dataset with
-- Snowflake aggregation, join, and masking policies, registers a data offering
-- that permits freeform SQL queries, creates a template, and initializes a
-- collaboration with one other collaborator (data_consumer).
--
-- For more information, see:
--   docs.snowflake.com/user-guide/cleanrooms/free-form-sql.rst
--   docs.snowflake.com/user-guide/cleanrooms/spec-reference
-- ============================================================================

-- ============================================================================
-- SETUP: Create sample database, schema, table, and policies.
-- ============================================================================

USE ROLE SAMOOHA_APP_ROLE;
USE WAREHOUSE APP_WH;

-- You can't use secondary roles with most collaboration procedures.
USE SECONDARY ROLES NONE;

CREATE DATABASE IF NOT EXISTS PROVIDER_DB;
CREATE SCHEMA IF NOT EXISTS PROVIDER_DB.DATA_SCH;

-- Create a table with 300 rows from the sample CUSTOMERS table.
CREATE OR REPLACE TABLE PROVIDER_DB.DATA_SCH.CUSTOMERS AS
  SELECT HASHED_EMAIL, STATUS, AGE_BAND, REGION_CODE, DAYS_ACTIVE, INCOME_BRACKET
  FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS
  LIMIT 300;

-- Create an aggregation policy that requires a minimum group size of 5.
CREATE OR REPLACE AGGREGATION POLICY PROVIDER_DB.DATA_SCH.MIN_GROUP_SIZE_POLICY
  AS () RETURNS AGGREGATION_CONSTRAINT ->
    AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);

-- Create an inactive join policy. You will modify this later.
CREATE OR REPLACE JOIN POLICY PROVIDER_DB.DATA_SCH.EMAIL_JOIN_POLICY
  AS () RETURNS JOIN_CONSTRAINT ->
    JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE);

-- Create a masking policy that replaces the original value with a fixed string.
CREATE OR REPLACE MASKING POLICY PROVIDER_DB.DATA_SCH.MASK_INCOME_POLICY
  AS (val STRING) RETURNS STRING ->
    '***MASKED***';

-- ============================================================================
-- Register a data offering with freeform SQL policies.
-- ============================================================================

-- The data offering enables freeform SQL queries (template_and_freeform_sql)
-- and attaches three Snowflake policies to protect data in freeform queries:
--   * Aggregation policy on hashed_email: enforces a minimum group size of 5.
--   * Join policy on hashed_email: requires joins to include this column.
--   * Masking policy on income_bracket: masks the column value in query results.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
  $$
  api_version: 2.0.0
  spec_type: data_offering
  version: V1
  name: provider_customers
  description: Customer dataset with freeform SQL policies.
  datasets:
    - alias: customers
      data_object_fqn: PROVIDER_DB.DATA_SCH.CUSTOMERS
      object_class: custom
      allowed_analyses: template_and_freeform_sql
      schema_and_template_policies:
        HASHED_EMAIL:
          category: join_standard
          column_type: hashed_email_b64_encoded
        STATUS:
          category: passthrough
        AGE_BAND:
          category: passthrough
        DAYS_ACTIVE:
          category: passthrough
        INCOME_BRACKET:
          category: passthrough
      freeform_sql_policies:
        aggregation_policy:
          name: PROVIDER_DB.DATA_SCH.MIN_GROUP_SIZE_POLICY
          entity_keys:
            - HASHED_EMAIL_B64_ENCODED
        join_policy:
          name: PROVIDER_DB.DATA_SCH.EMAIL_JOIN_POLICY
          columns:
            - HASHED_EMAIL_B64_ENCODED
        masking_policies:
          - name: PROVIDER_DB.DATA_SCH.MASK_INCOME_POLICY
            columns:
              - INCOME_BRACKET
  $$
);

-- Save the data offering ID returned by the registration call.
SET data_offering_id = '<data_offering_id>';

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS();

-- ============================================================================
-- Register a template with a simple one-table query.
-- ============================================================================

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: status_summary
  version: V1
  type: sql_analysis
  description: Returns a count of customers grouped by status.
  template:
    SELECT status, COUNT(*) AS customer_count
      FROM IDENTIFIER({{ source_table[0] }})
      GROUP BY status
      ORDER BY customer_count DESC;
  $$
);

-- Save the template ID returned by the registration call.
SET template_id = '<template_id>';

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_TEMPLATES();

-- ============================================================================
-- Create the collaboration.
-- ============================================================================

-- Replace the <...> placeholders with the appropriate values.
-- Get your account data sharing ID with:
--   SELECT CURRENT_ORGANIZATION_NAME() || '.' || CURRENT_ACCOUNT_NAME();
-- In this collaboration, the consumer can run templated and free-form queries
-- against the provider's data. The provider/owner isn't an analysis runner.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.INITIALIZE(
  $$
  api_version: 2.0.0
  spec_type: collaboration
  name: freeform_sql_demo
  owner: data_provider
  collaborator_identifier_aliases:
    data_provider: <provider_account_data_sharing_id>
    data_consumer: <consumer_account_data_sharing_id>
  analysis_runners:
    data_consumer:
      data_providers:
        data_provider:
          data_offerings:
            - id: <data_offering_id>
      templates:
        - id: <template_id>
  $$,
  'APP_WH'
);

SET collaboration_name = 'freeform_sql_demo';

-- INITIALIZE automatically joins the owner. Repeat until status is JOINED.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- Verify that the collaboration is visible.
-- Collaboration spec is in COLLABORATION_SPEC column.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS() ->>
  SELECT * FROM $1 WHERE "SOURCE_NAME" = $collaboration_name;

-- SWITCH TO data_consumer account to join and run analyses.

-- Update the join policy associated with HASHED_EMAIL_B64_ENCODED.
-- All queries on that data offering now require joins on HASHED_EMAIL_B64_ENCODED.
-- Re-run any of the previously successful free-form queries and they will fail.
ALTER JOIN POLICY PROVIDER_DB.DATA_SCH.EMAIL_JOIN_POLICY SET BODY ->
  JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE);

-- ============================================================================
-- CLEANUP: Delete the collaboration, registered resources, and sample data.
-- ============================================================================

-- Teardown is a multi-step process. Call TEARDOWN, then wait for GET_STATUS
-- to report LOCAL_DROP_PENDING, then call TEARDOWN again.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- When GET_STATUS reports LOCAL_DROP_PENDING, call TEARDOWN again to complete.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);
```

```sqlexample
-- ============================================================================
-- Free-form SQL Collaboration Demo: Data Consumer
-- ============================================================================
-- This example demonstrates joining a Snowflake Data Clean Rooms collaboration
-- as an analysis runner. The data consumer joins a collaboration created by
-- the data provider, views available templates and data offerings, runs an
-- analysis using the provider's template, and then runs several free-form SQL
-- queries directly against the data-offering views.
--
-- The data offering in this collaboration has three free-form SQL policies:
--   * Aggregation policy (hashed_email): minimum group size of 5.
--   * Join policy (hashed_email): joins must include this column. Currently inactive.
--   * Masking policy (income_bracket): values are replaced with '***MASKED***'.
--
-- For more information, see:
--   docs.snowflake.com/user-guide/cleanrooms/free-form-sql.rst
--   docs.snowflake.com/user-guide/cleanrooms/spec-reference
-- ============================================================================

-- ============================================================================
-- Join the collaboration
-- ============================================================================

USE ROLE SAMOOHA_APP_ROLE;
USE WAREHOUSE APP_WH;

-- You can't use secondary roles with most collaboration procedures.
USE SECONDARY ROLES NONE;

-- View available collaborations. Look for the collaboration created by the data provider.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS();

-- Use the SOURCE_NAME column value from the response to VIEW_COLLABORATIONS().
SET collaboration_name = 'freeform_sql_demo';

-- Use the OWNER_ACCOUNT column value from the response to VIEW_COLLABORATIONS().
SET collaborator_data_sharing_id = '<provider_data_sharing_id>';

-- Review the collaboration spec before joining.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.REVIEW($collaboration_name, $collaborator_data_sharing_id);

-- Join the collaboration. Joining is asynchronous; call GET_STATUS until JOINED.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.JOIN($collaboration_name);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- ============================================================================
-- View available templates and data offerings
-- ============================================================================

-- View data offerings shared with you in this collaboration.
-- Set a variable to use in future queries.
-- Note that the view name used by templates != the view name used for free-form SQL queries.
-- Templates use the TEMPLATE_VIEW_NAME value.
-- Free-form queries use the FREEFORM_SQL_VIEW_NAME value.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS($collaboration_name);
SET template_view_name = '<template_view_name>';
SET freeform_view_name = '<freeform_view_name>';

-- View templates available to you in this collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_TEMPLATES($collaboration_name);

-- ============================================================================
-- Run an analysis using the provider's template
-- ============================================================================

-- Replace the placeholders with the template name/version from VIEW_TEMPLATES
-- and the view name from VIEW_DATA_OFFERINGS.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
  $$
  api_version: 2.0.0
  spec_type: analysis
  description: Count customers grouped by status.
  template: '<status_summary_template_name_and_version>'
  template_configuration:
    view_mappings:
      source_tables:
        - '<template_view_name>'
  $$
);

-- ============================================================================
-- Free-form SQL queries: Queries that SUCCEED
-- ============================================================================

-- The following queries run directly against the data-offering view.

-- Query 1: Count customers grouped by status.
-- Succeeds because the aggregation produces groups larger than 5.
SELECT status, COUNT(*) AS customer_count
  FROM IDENTIFIER( $freeform_view_name ) AS t
  GROUP BY status
  ORDER BY customer_count DESC;

-- Query 2: Count customers grouped by age_band.
-- Succeeds because the aggregation produces groups larger than 5.
SELECT age_band, COUNT(*) AS customer_count
  FROM IDENTIFIER( $freeform_view_name ) AS t
  GROUP BY age_band
  ORDER BY age_band;

-- Query 3: Select income_bracket to demonstrate the masking policy.
-- The query succeeds, but income_bracket values are replaced with '***MASKED***'
-- because the masking policy is applied to this column.
SELECT income_bracket, COUNT(*) AS customer_count
  FROM IDENTIFIER( $freeform_view_name ) AS t
  GROUP BY income_bracket;

-- Query 4: Combine masked and unmasked columns.
-- income_bracket is masked; status and age_band are not.
SELECT status, age_band, income_bracket, COUNT(*) AS customer_count
  FROM IDENTIFIER( $freeform_view_name ) AS t
  GROUP BY status, age_band, income_bracket
  ORDER BY customer_count DESC;

-- Query 5: Group by a high-cardinality column.
-- Succeeds, but shows no values for hashed_email_b64_encoded because
-- grouping by hashed_email_b64_encoded produces groups of 1.
SELECT hashed_email_b64_encoded, COUNT(*) AS row_count
  FROM IDENTIFIER( $freeform_view_name ) AS t
  GROUP BY hashed_email_b64_encoded;

-- ============================================================================
-- Free-form SQL queries: Queries that FAIL
-- ============================================================================

-- Query 6: Select individual rows without aggregation.
-- FAILS because the aggregation policy requires a minimum group size of 5.
SELECT hashed_email_b64_encoded, status, age_band
  FROM IDENTIFIER( $freeform_view_name ) AS t
  LIMIT 10;

-- Query 8: Select a column not listed in the data offering.
-- FAILS because region_code is not included in schema_and_template_policies,
-- so it is not exposed in the data-offering view, although it is present in the source data.
SELECT region_code, COUNT(*) AS customer_count
  FROM IDENTIFIER( $freeform_view_name ) AS t
  GROUP BY region_code;

-- SWITCH TO provider account, update the JOIN policy, and re-run the successful
-- queries, which will now fail.
```

---
title: Glossary of Snowflake Data Clean Room terms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/dcr-glossary.md
section: Clean Rooms
---

# Glossary of Snowflake Data Clean Room terms

Get to know these terms as they are used in Snowflake Data Clean Rooms. Some terms are used differently here than
in the rest of Snowflake.

Activate / activation
:   Exporting the results table of a query out of the clean room, either to a collaborator or to a third party. If
    allowed by the other party and the clean room settings, you can export query results to your own account or to an
    approved third-party partner, such as Google Ads or Meta Ads Manager.

Analysis runner
:   A [collaboration role](roles.md) that allows a collaborator to run templates and view
    results in a collaboration. An analysis runner can use data offerings shared with them by data providers.

Code bundle
:   A registered package of one or more custom Python functions or procedures that can be called by a template. Code
    bundles are defined using a YAML specification and registered by calling `REGISTRY.REGISTER_CODE_SPEC`. A
    template references a code bundle by its ID, and the template calls functions using the syntax
    `cleanroom.code_spec_name$function_name`.

Collaboration
:   A secure multi-party data sharing environment. A collaboration is defined by a YAML specification that lists the
    collaborators, their collaboration roles, and all resources (templates, data offerings, and so on) available in the
    collaboration. The collaboration owner creates the collaboration by calling INITIALIZE, and other collaborators
    join by calling JOIN.

Collaboration owner
:   A [collaboration role](roles.md) assigned to the collaborator who creates a
    collaboration by calling INITIALIZE. The owner defines the collaboration spec, including the list of
    collaborators, their roles, and the initial set of resources. Owners can’t act as analysis runners or
    data providers by default unless the collaboration specification grants them those roles explicitly.

Collaboration role
:   A role that describes the set of actions that a user can perform in a given collaboration. One user can have many
    collaboration roles in a collaboration. Roles include owner, data provider, and analysis runner. Not the same as an
    RBAC role. Learn more about roles at [Collaborator roles in Collaboration Data Clean Rooms](roles.md).

Collaborator
:   Any participant in a collaboration. Each collaborator is identified by an alias and has one or more collaboration
    roles (owner, data provider, analysis runner).

Column policy
:   Specified by a collaborator to indicate which of their data columns can be projected by other collaborators. A
    clean room column policy is determined entirely within a clean room, and isn’t derived from any Snowflake policies
    that might be applied to the source table outside of the clean room.
    [Learn more about column policies.](v1/policies.md)

Data offering
:   A package of one or more datasets that a data provider shares with specific analysis runners in a collaboration.
    Each dataset represents one source table or view owned by the data provider. A data offering is a live view of the
    data, not a snapshot, so any changes to the source data are reflected in the collaboration. Data offerings are
    registered in a registry and then linked into a collaboration.

Data provider
:   A [collaboration role](roles.md) that allows a collaborator to share data offerings with
    specific analysis runners in a collaboration. A data provider registers and links data offerings into the
    collaboration for other collaborators to use.

Dataset
:   A secure view of a single source table or view from a data provider. A data offering consists of one or more
    datasets. The data offering specification defines which columns to expose, what policies to apply, and whether the
    data can be queried by template only or also by free-form SQL for each dataset.

DCR privilege
:   A conceptual permission string used to grant access to specific Collaboration API procedures to a role. DCR
    privileges can be granted for individual objects or more general actions. DCR privileges include READ, CREATE
    COLLABORATION, and JOIN COLLABORATION. These privilege strings are passed into
    [GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE](collaboration-api-reference.md) and
    [GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE](collaboration-api-reference.md). To learn more, see
    [Managing access to collaborations, resources, and data](manage-access.md).

Free-form SQL
:   A mode of data access where an analysis runner can run arbitrary SQL queries directly against a data provider’s
    dataset, without using a template. The data provider enables this by setting `allowed_analyses: template_and_freeform_sql`
    in their data offering specification. Snowflake policies defined in the `freeform_sql_policies` section of the
    data offering are enforced on these queries. See [Free-form SQL queries](free-form-sql.md).

Differential privacy
:   An algorithmic and mathematical system that adds protection to individual rows or entities in a dataset by adding
    noise to numerical results and requiring grouping in queries to prevent exact values from being associated with
    exact rows or entities in the data.

Join policy
:   A policy set by a clean room collaborator that specifies which of their columns can be joined on in queries
    in that clean room. A clean room join policy is entirely independent of Snowflake join policies.
    [Learn more about join policies.](v1/policies.md)

Linking
:   Importing a resource into a collaboration. See [Resources](resources.md).

Local data offering
:   Local data offerings let standard edition accounts use their own tables in a collaboration. These offerings are not visible to any other collaborator, and template policies are not enforced. See [Run an analysis with your own data when you use Standard Edition](demo-flows/basic-multiparty-collab.md).

Linking
:   Importing a protected view of data into a clean room. The provider and consumer can both link their own data into
    a clean room to make it available to any queries supported by that clean room. Linking a table or view means
    creating a copy (a view) of the source data within the clean room, dynamically linked to the source table or view
    outside of the clean room.

Registry
:   An account-level container that stores resources such as templates, data offerings, and code bundles. You must
    register a resource in a registry before you can link it to a collaboration. Each account has a default registry
    that all users can access, and you can create custom registries to group and manage access to resources. Custom
    registries are private to the creator until access is explicitly granted to other roles. Learn more at
    [Registries](registries.md).

Resource
:   A reusable component that can be registered in a registry and linked into a collaboration. Resources include
    templates, data offerings, and code bundles. Each resource is defined by a YAML specification, has a name and
    version, and is registered by calling the appropriate REGISTRY procedure. Resources can be linked into a
    collaboration at creation time or added later.

SCO
:   *Secure Collaboration Orchestrator.* A Snowflake-managed account that manages a collaboration behind the scenes.
    The SCO creates an individual app package per collaboration, shares data with collaborators according to the
    collaboration definition, and enforces collaboration policies such as who can access which data using which
    templates. Costs associated with the SCO aren’t charged to users.

Secure view
:   When you link a table or view into the clean room, a secure view is created. This is an encrypted view based on
    the source table or view outside the clean room. The secure view is generally invisible to you, but might
    sometimes appear in an error message or when you are browsing the database objects using various tools, where you
    will see some name mangling of the original linked dataset. Unless directed otherwise, always refer to your data
    using the dataset name, which is identical to the linked source table or view.

Spec / specification / definition
:   A YAML document that defines a collaboration resource. Each resource type has its own specification schema,
    including collaboration specifications, data offering specifications, template specifications, analysis request
    specifications, and code bundle specifications. Specifications are passed to API procedures such as
    INITIALIZE, REGISTER_DATA_OFFERING, and REGISTER_TEMPLATE. See the
    [schema reference](spec-reference.md) for details.

Template
:   Each clean room has one or more templates, which are SQL queries written in JinjaSQL, provided by collaborators.
    The template provider specifies which analysis runners can use their templates. Depending on how they are written, a
    template can either be an analysis template, which returns results immediately, or an activation template, which saves results into the
    Snowflake account of the designated collaborator.

## Legacy Provider & Consumer Clean Room Terms

The following terms are used in Legacy Provider & Consumer Clean Rooms. For current terminology, see the
definitions above.

Provider
:   A clean room creator. The provider typically shares some data and the list of permitted queries that can be run
    in that clean room, and sets high-level clean room configurations.

Consumer
:   A person or account invited to use a clean room by the clean room provider. Consumers typically import their own
    data and run one or more queries supported by that clean room. However, a clean room can be configured to allow
    consumers to propose their own query, subject to approval by the provider.

Clean rooms UI
:   Or “UI” for short. The browser-based web application you can use to manage the Snowflake Clean Room environment,
    create new clean rooms, or use clean rooms to which you have been invited. This used to be called the “web app,”
    and you might still see that terminology used in some places.

---
title: Installing the Snowflake Data Clean Rooms environment
source: https://docs.snowflake.com/en/user-guide/cleanrooms/installing-dcr.md
section: Clean Rooms
---

# Installing the Snowflake Data Clean Rooms environment

## Before you begin

* If the Snowflake Data Clean Room environment is not installed for your account, follow the installation instructions on this page.
* If the clean rooms environment is installed for your account, and you want access to it, ask an administrator to provide you appropriate privileges to conduct clean room operations in your account.

## Supported regions

Snowflake Data Clean Rooms are available for Snowflake accounts in the following cloud regions:

| Cloud platform | Supported regions |
| --- | --- |
| Amazon Web Services (AWS) | * South America (Sao Paulo) * US East (N. Virginia) * US East (Ohio) * US West (Oregon) * Canada (Central) * Europe (London) * EU (Ireland) * EU (Frankfurt) * EU (Paris) * EU (Stockholm) * EU (Zurich) * Africa (Cape Town) * Asia Pacific (Mumbai) * Asia Pacific (Singapore) * Asia Pacific (Tokyo) * Asia Pacific (Osaka) * Asia Pacific (Seoul) * Asia Pacific (Jakarta) * Asia Pacific (Sydney) |
| Microsoft Azure | * Central US (Iowa) * East US 2 (Virginia) * Mexico Central (Querétaro) * South Central US (Texas) * West US 2 (Washington) * Canada Central (Toronto) * North Europe (Ireland) * Sweden Central (Gavie) * Switzerland North (Zurich) * UAE North (Dubai) * UK South (London) * West Europe (Netherlands) * Central India (Pune) * Southeast Asia (Singapore) * Japan East (Tokyo) * Korea Central (Seoul) * Australia East (New South Wales) |
| Google Cloud (GCP) | * US Central1 (Iowa) * US East4 (N. Virginia) * Middle East Central2 (Dammam) * Europe West (Frankfurt) * Europe West2 (London) * Europe West4 (Netherlands) |

## Requirements to install Snowflake Data Clean Rooms

### Account, installer, and user requirements

When you install the clean rooms environment, you install it for all potential users in the Snowflake account. However, access to the clean
rooms environment must be granted to users explicitly by a clean rooms administrator.

Here are the requirements to install Snowflake Data Clean Rooms in your Snowflake account:

* **The account must be the required** [Snowflake Edition](../intro-editions.md):

  + **To create collaborations and be an owner**, you must have Standard Edition or higher.
  + **To join a collaboration as an analysis runner**, you must have Standard Edition or higher.
  + **To join a collaboration as a data provider or activate data to another collaborator**, you must have Enterprise Edition or higher.
* **The installer must fulfill** these role and user requirements.
* **Reader accounts are not supported,** because reader accounts do not allow the data sharing required to install and run the clean rooms
  application.
* **You must accept data sharing terms.** If you have not accepted the
  [Snowflake Customer-Controlled Data Sharing Functionality Terms](//www.snowflake.com/legal/data-sharing-terms/), please contact
  [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). Snowflake Data Clean Rooms leverage [listings](../../collaboration/collaboration-listings-about.md), which are part
  of the Snowflake Service and subject to your Service terms with Snowflake, including the Snowflake Customer-Controlled Data Sharing
  Functionality Terms and
  [Snowflake Acceptable Use Policy](//www.snowflake.com/legal/acceptable-use-policy/).
* **You must unset any unsupported account-level parameters.** See the list of unsupported account-level settings.

If you do not meet all these requirements and need to upgrade, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### Unsupported account-level parameters

Snowflake Data Clean Rooms does not support certain account-level parameter values. The following table shows the required values for these parameters:

| Parameter name | Required value | Notes |
| --- | --- | --- |
| DEFAULT_DDL_COLLATION | *No values supported, must be null* | [Account-level collation](../../sql-reference/collation.md) is not supported. |
| QUOTED_IDENTIFIERS_IGNORE_CASE | `false` |  |

To check a parameter in your account, run the following SQL command, substituting the parameter name for `<parameter_name>`:

```sqlexample
SHOW PARAMETERS LIKE '<parameter_name>' IN ACCOUNT;
```

For example:

```sqlexample
SHOW PARAMETERS LIKE 'DEFAULT_DDL_COLLATION' IN ACCOUNT;
```

### Role and user requirements

Here are the role requirements for the person installing the clean rooms environment:

* You must have an ACCOUNTADMIN role in a Snowflake account in order to install the clean rooms environment in that account.
* The user with the ACCOUNTADMIN role must have a valid first name, last name, and email defined for their user object. To check,
  run [DESCRIBE USER](../../sql-reference/sql/desc-user.md).

## Install the Snowflake Data Clean Rooms environment

Follow these steps to install the clean rooms environment in your Snowflake account.

You must always install the native app (Step 1), but after that you can enable the clean rooms API for code usage (Step 2).

### 1. Install the native application

Install the native application from the marketplace:

> 1. Set your current role to ACCOUNTADMIN
> 2. Install the [Snowflake Data Clean Rooms application](https://app.snowflake.com/marketplace/listing/GZSTZTP0KKO/snowflake-snowflake-data-clean-rooms)
>    from the Snowflake Marketplace
> 3. Select Open and accept the default options.

Installation takes several minutes. When done, proceed to step 2.

### 2. Install the clean rooms API

The clean rooms API is required to use clean rooms either through the UI or the API.

Here are the steps to install the clean rooms API in your Snowflake account:

1. After installing the native application, launch it in Snowflake. In the navigation menu, select Catalog » Apps »
   Snowflake Data Clean Rooms. Click the Open in Worksheet button at the top right corner.
   This opens a worksheet with SQL commands.
2. Run the SQL commands to install the clean rooms API, with the following notes:

   * If you renamed the native application during installation you will need to modify the script as indicated in the script comments.
   * If you want to review the full installation script before running it, uncomment the `DRY_RUN=TRUE` script line and run all commands
     up to and including that line to see the script contents. Note that you should **not run the installation script** exposed by that
     command manually, as it might result in an incomplete installation.
   * Note that installation takes several minutes.
3. Confirm that you can access the API:

   ```sqlexample
   USE ROLE SAMOOHA_APP_ROLE;
   USE WAREHOUSE app_wh;
   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.LIBRARY.CHECK_MOUNT_STATUS();
   ```

   If this returns FALSE, confirm you are using SAMOOHA_APP_ROLE and if so please retry running the mount script command by ACCOUNTADMIN role again.

## Next steps

After you have installed the clean room environment on your account successfully, you can proceed with the following:

* [Add developers.](manage-access.md) Grant access to roles in your Snowflake account, so they can access the clean room environments based on specific privileges.
* [Enable Cross-Cloud Auto-Fulfillment.](laf.md) By default, clean rooms can be shared only with participants in the same underlying cloud region. To enable collaborations with collaborators in different cloud regions, you must enable Cross-Cloud Auto-Fulfillment for your account.
* [Enable automatic clean room version updates.](admin-tasks.md) Enable the clean rooms API environment to
  be updated automatically whenever Snowflake releases a new version. You can also install updates manually, but we recommend enabling
  automatic updates.

---
title: Inventory forecasting
source: https://docs.snowflake.com/en/user-guide/cleanrooms/inventory-forecasting-template.md
section: Clean Rooms
---

# Inventory forecasting

## About the template

The inventory forecasting template helps publishers and advertisers forecast ad inventory availability within a secure data clean room. By
analyzing advertiser demand against a publisher’s available ad supply and audience data, it allows for accurate prediction of future ad
impression opportunities. This helps publishers optimize ad allocation to prevent unsold inventory and maximize revenue, while enabling
advertisers to better plan campaigns by understanding available reach across key demographics and regions.

The template analyzes consumer demand patterns against provider supply capacity to forecast inventory requirements by region and
demographics. This template is designed as a provider-run template.

## Key use cases

* **Ad impression forecasting:** Forecast the number of available ad impressions for specific audience segments to improve campaign
  planning.
* **Audience targeting:** Identify and forecast the size of targetable audience segments to optimize ad spend and campaign reach.
* **Campaign pacing and delivery:** Ensure on-time and in-full campaign delivery by accurately forecasting ad inventory and preventing
  underspending.
* **Yield management:** Maximize revenue by forecasting high-demand ad inventory and adjusting pricing strategies accordingly.
* **Retail demand planning** (cross-industry example): A CPG brand forecasts consumer demand for a product in a specific region, helping a
  retail partner optimize stock levels to prevent running out of stock and improve sales.

## Get the worksheets and template

The code below includes a worksheet that demonstrates how to install and run an inventory forecasting custom template. The provider
worksheet includes the custom template code that you can use or modify.

Download the worksheets and install them in two separate Snowflake accounts in the same organization and the same cloud hosting
environment. These worksheets show how to create and run a clean room with an inventory forecasting template that you can use and modify.
The template includes a UI form so you can run the clean room either in code or in the clean rooms UI. The example enables the consumer to
run the analysis, and optionally to activate the results to the provider’s account.
[See instructions to upload a SQL worksheet into your Snowflake account](tutorials-and-samples.md).

To try out the templates, run the sample data generator first in both your provider and consumer accounts, to generate sample data to use
with the clean room.

* [`Download the Python sample data table generator`](../../_downloads/8ab8a0b52cfa960f33416890e9ea7bf0/inventory-forecasting-sample-generator.py).
  Run this to generate data that can be used as sample data for the consumer and provider worksheets.
* [`Download the consumer worksheet.`](../../_downloads/161c4de9ce8b933bccb601573fb5c7a2/inventory-forecasting-c.sql)
* [`Download the provider worksheet.`](../../_downloads/a27b1a2c0212d9c6f56398727c9032e7/inventory-forecasting-p.sql)

---
title: Inventory forecasting collaboration
source: https://docs.snowflake.com/en/user-guide/cleanrooms/collab-inventory-forecasting.md
section: Clean Rooms
---

# Inventory forecasting collaboration

## About the template

The inventory forecasting template helps publishers and advertisers forecast ad inventory availability within a collaboration data clean
room. By analyzing advertiser demand against a publisher’s available ad supply and audience data, it allows for accurate prediction of
future ad impression opportunities. This helps publishers optimize ad allocation to prevent unsold inventory and maximize revenue, while
enabling advertisers to better plan campaigns by understanding available reach across key demographics and regions.

This example demonstrates a two-party collaboration where the publisher is the collaboration owner and provides a data offering and two
templates: an analysis template and an activation template. The advertiser joins the collaboration, links their own data, and runs both
templates.

## Collaboration roles

| Collaborator | Roles | Actions |
| --- | --- | --- |
| Publisher | Owner, data provider | Registers a data offering (historical sales data), an analysis template, and an activation template. Creates the collaboration. After the advertiser activates results, the publisher views and processes the activation data. |
| Advertiser | Analysis runner, data provider (to self) | Registers a data offering (current stock levels). Joins the collaboration, links their data, runs the analysis template to view forecast results, and runs the activation template to send results to the publisher. |

## Key use cases

* **Ad impression forecasting:** Forecast the number of available ad impressions for specific audience segments to improve campaign
  planning.
* **Audience targeting:** Identify and forecast the size of targetable audience segments to optimize ad spend and campaign reach.
* **Campaign pacing and delivery:** Ensure on-time and in-full campaign delivery by accurately forecasting ad inventory and preventing
  underspending.
* **Yield management:** Maximize revenue by forecasting high-demand ad inventory and adjusting pricing strategies accordingly.
* **Retail demand planning** (cross-industry example): A CPG brand forecasts consumer demand for a product in a specific region, helping a
  retail partner optimize stock levels to prevent running out of stock and improve sales.

## Get the worksheets and template

Download the worksheets and install them in two separate Snowflake accounts in the same organization and the same cloud hosting
environment. These worksheets show how to create and run a collaboration with an inventory forecasting template that you can use and
modify. The advertiser runs the analysis template to view forecast results, and optionally runs the activation template to send results to
the publisher’s account.

### Step 1: Generate sample data

Generate sample data in both your publisher and advertiser accounts by running the Python sample data generator.

[`Download the Python sample data table generator`](../../_downloads/5f278b5d3db892b2d843bac7deb741e3/collab-inventory-forecasting-sample-generator.py).

> **Tip:**
>
> To run the sample data generator:
>
> 1. In Snowsight, go to **Projects** > **Worksheets** > **+** > **Python Worksheet**.
> 2. Paste the contents of the downloaded file into the worksheet.
> 3. Set **Handler** to `main` and **Return type** to `String`.
> 4. Update the `DATABASE_NAME` and `SCHEMA_NAME` variables with your values.
> 5. Select **Run**.

### Step 2: Run the publisher and advertiser worksheets

After generating sample data, download and run the publisher and advertiser worksheets. Run these worksheets using the same role you used to generate the sample data.
[See instructions to upload a SQL worksheet into your Snowflake account](tutorials-and-samples.md).

* [`Download the publisher worksheet`](../../_downloads/f8f43deb6c4409e5b5822cc3dfa4813b/collab-inventory-forecasting-publisher.sql).
* [`Download the advertiser worksheet`](../../_downloads/afe1b0265a73416263b7206079b8a39b/collab-inventory-forecasting-advertiser.sql).

---
title: Items installed with the Snowflake Data Clean Room environment
source: https://docs.snowflake.com/en/user-guide/cleanrooms/installed-artifacts.md
section: Clean Rooms
---

# Items installed with the Snowflake Data Clean Room environment

This topic provides information about objects created in your account when you install the Snowflake Data Clean Room environment and create or join a collaboration. For information about Provider and Consumer clean rooms, see [Snowflake Data Clean Rooms: Installed objects](v1/installation-details.md).

## High-level overview

The following diagram is a simplified representation of a two-party collaboration:

**Notes about the diagram:**

This diagram shows two collaborators that are using the Data Clean Rooms Collaboration API to create and manage a collaboration.

* Collaborator A is the owner and creator, as indicated by the collaboration definition YAML in the diagram.
* Both Collaborator A and B are data providers, as indicated by the data offering share in the diagram.
* Both collaborators A and B can act as analysis runners, if the collaboration definition allows it.
* Collaborator B has added a template to the collaboration.
* The *Secure Collaboration Orchestrator* (SCO) is a dedicated Snowflake account used to manage collaborations for all accounts in its region. There is an SCO for each region. The SCO for a collaboration is determined based on the owner’s account region.
* For each collaboration, the SCO creates an app package along with a listing. Collaborators install an application named `SFDCR_collaboration_name` from this listing, which provides them access to the collaboration.
* Collaborators interact with the collaboration through the DCR Collaboration API in their local SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.

Collaborators create data offerings, and the SCO shares that data with the collaborators according to the collaboration definition. The
SCO uses the collaboration, data offering, template, and analysis specifications to enforce collaboration policies, such as who can
access which data by using which templates; what data can be activated, and to whom, and whether free-form SQL access is provided.

## Applications

The following applications are installed when installing the Snowflake Data Clean Rooms environment or joining a collaboration:

Data Clean Rooms Native Application `SAMOOHA_BY_SNOWFLAKE`
:   Installed application during the installation of the Snowflake Data Clean Rooms environment. Each account has this bootstrapper application installed from the Snowflake Data Clean Rooms Marketplace listing. It provides library procedures, delegation roles, and helper functions used by the local DB, and operates on local DB and clean room objects.

Collaboration Application `SFDCR_collaboration_name`
:   Installed application per collaboration an account joins. It provides a COLLABORATION schema with secure views (such as DATA_OFFERINGS, TEMPLATE_SPECS, and CODE_SPECS) filtered to the installing account, and a COLLABORATION_INTERNAL schema with stored procedures for handling join, run, and leave operations. It writes to the clean room local DB and sends messages back to the SCO.

## Databases

The following databases are created when installing the Snowflake Data Clean Rooms environment or joining a collaboration:

`SFDCR_LOCAL_collaboration_name`
:   Contains information local to an installed collaboration, including activated data, and views of local-only data.

SAMOOHA_BY_SNOWFLAKE_LOCAL_DB
:   This database is created when installing the Snowflake Data Clean Rooms environment in your account. It is local
    to your account. It is not an application, but does contain application logic.

    This database has the following schemas:

    ADMIN schema
    :   Schema in the local DB for administrative functions including privilege management, version info, and external table analysis enablement.

    COLLABORATION schema
    :   Main schema in the local DB for collaboration clean room functionality. Contains tasks, streams, and procedures for message processing.

        REGISTRY schema
        :   Stores registered templates, data offerings, code specs, and the object-to-registry mapping table.

        `registry_name_REGISTRY` schema
        :   Schema created when you create a custom registry. For example, if you create a custom named `sales_data`, the system creates a schema called `sales_data_registry`.

## Shares and listings

Below are shares and listings that are involved and created per collaboration, depending on your role defined in the collaboration.

| Object Name/Format | Type | Description |
| --- | --- | --- |
| `SFDCR: SCO collaboration_id` | Incoming Listing | Listing shared by the SCO for every collaboration you create or are invited to. |
| `SCO_DATA_OFFERINGS_LISTING_hash` | Outgoing Listing | Listing name for data offerings shared from the data provider to collaborators. |
| `SCO_ACTIVATION_LISTING_hash` | Outgoing Listing | Listing name for activation result shared by an analysis runner to another collaborator. |
| `SCO_STAGED_CODE_LISTING_hash` | Outgoing Listing | Listing name for staged code shared from a code provider to an analysis runner for code execution. |
| `SCO_DATA_OFFERINGS_SHARE_hash` | Outgoing Share | Share created by a data provider to share data offerings (datasets, policies) to collaborators. |
| `SCO_ACTIVATION_SHARE_hash` | Outgoing Share | Share created by an analysis runner to share activation results back to another collaborator. |
| `SCO_STAGED_CODE_SHARE_hash` | Outgoing Share | Share created by a code provider to an analysis runner for code execution. |

## Tasks

Below are tasks related to operating the new Snowflake Data Clean Rooms environment. For tasks related to legacy provider & consumer clean rooms, refer to [Snowflake Data Clean Rooms: Installed objects](v1/installation-details.md).

| Task Name | Description | Warehouse |
| --- | --- | --- |
| `EXPECTED_VERSION_TASK` | Automatically upgrades the native app and local db as new versions are released.  Frequency: Triggered by request. | SAMOOHA_TASK_WAREHOUSE |
| `collaboration_name_hash_OWNER_AUTO_JOIN` | Task enabled by owner to auto-join a collaboration they initiate.  Frequency: Every 1 minute, suspends after 1 hour. | User specified warehouse |

## Sample data

Sample data is stored in the SAMOOHA_SAMPLE_DATABASE database. This database contains sample data tables named DEMO.CUSTOMERS and DEMO.CUSTOMERS_2 that you can use as test data.

> **Note:**
>
> The CUSTOMERS_2 table was added in September 2025. If you installed your clean rooms environment before then, you might not have this
> sample table installed. To see whether you have CUSTOMERS_2 installed, you can run the following SQL code:
>
> ```sqlexample
> SHOW TABLES LIKE 'CUSTOMERS_2' IN SCHEMA SAMOOHA_SAMPLE_DATABASE.DEMO;
> ```
>
> If the response contains no rows, then you, or someone with ACCOUNTADMIN role, must run the following command to install the sample table:
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
> ```

## Warehouses

Snowflake Data Clean Rooms installs the following warehouses in your account. You can change the size of any warehouse
as needed. We recommend that you use XS warehouses for general clean room editing, creation, or deletion commands. Consider using larger warehouses, or Snowpark-optimized warehouses, when running large analyses, such as machine learning workloads.

[Learn how to view your warehouse usage costs.](cleanroom-cost.md)

| Warehouse name | Notes |
| --- | --- |
| APP_WH | XSMALL warehouse which is provided access by default to SAMOOHA_APP_ROLE. |
| SAMOOHA_TASK_WAREHOUSE | XSMALL warehouse used for operations such as auto-upgrades. |

---
title: Last touch attribution
source: https://docs.snowflake.com/en/user-guide/cleanrooms/last-touch-template.md
section: Clean Rooms
---

# Last touch attribution

## About the template

The last-touch attribution template provides a comprehensive last-touch
attribution analysis that allows businesses to measure the effectiveness of their marketing channels. By securely joining collaborator datasets in a Snowflake Data Clean Room, the analysis identifies the sequence of marketing touch points leading to a conversion.

The process involves joining collaborator 1 click data with collaborator 2 transaction data, ranking each touch point by time, and then attributing the
conversion to the most recent interaction. The final output aggregates key metrics like total conversions and conversion value by channel.
This helps businesses understand which channels are most effective at driving immediate conversions, enabling data-driven decisions for
optimizing marketing strategies and budget allocation.

This analysis attributes 100% of the conversion credit to the last marketing touch point a customer interacted with before converting. It
identifies the final click preceding a transaction and assigns the entire value of that conversion to that single channel.

This template activates the result of the collaborator 2 analysis to the collaborator 1 account.

## Key use cases

* **Channel performance analysis:** Identify which channels are driving the most conversions and have the highest conversion value.
* **Budget allocation:** Optimize marketing spend by allocating more budget to the channels that are performing well based on last-touch
  attribution.
* **Campaign optimization:** Understand the effectiveness of different campaigns in driving final conversions and optimize them for better
  performance.

## Get the worksheets and template

Download the worksheets and install them in two separate Snowflake accounts in the same organization and the same cloud hosting environment.
These worksheets show how to create and run a clean room with a last-touch attribution template that you can use and modify.
The example enables collaborator 2 to run the analysis, and optionally to activate the results to the collaborator 1 account.

### Step 1: Generate sample data

Generate sample data in both collaborator accounts by running the Python sample data generator.

[`Download the Python sample data table generator`](../../_downloads/a9580b36376aeb160880444db2305279/last-touch-sample-generator.py).

> **Tip:**
>
> To run the sample data generator:
>
> 1. In Snowsight, go to **Projects** > **Worksheets** > **+** > **Python Worksheet**.
> 2. Paste the contents of the downloaded `.py` file into the worksheet.
> 3. Set **Handler** to `main` and **Return type** to `String`.
> 4. Update the `DATABASE_NAME` and `SCHEMA_NAME` variables with your values.
> 5. Select **Run**.

### Step 2: Run the collaborator worksheets

After generating sample data, download and run the collaborator worksheets. Run these worksheets using the same role you used to generate the sample data.
[See instructions to upload a SQL worksheet into your Snowflake account](tutorials-and-samples.md).

* [`Download the collaborator 1 worksheet`](../../_downloads/4d0af0b113e2f5878e0f93abc5706f20/last-touch-attribution-collab-1.sql).
* [`Download the collaborator 2 worksheet`](../../_downloads/a44ec619d385c0ead74cf7be110d3c39/last-touch-attribution-collab-2.sql).

---
title: Manage clean room users and access
source: https://docs.snowflake.com/en/user-guide/cleanrooms/manage-dcr-users.md
section: Clean Rooms
---

# Manage clean room users and access

## Overview

This topic describes how a clean rooms account administrator manages user access to the clean rooms UI and API.

Clean rooms defines several application roles that permit access to the API
and various subsections of the UI. Access to the clean rooms UI and API are granted separately.
Typically, the account administrator creates a custom role, grants the desired application roles to
allow fine-grained access to the UI and API, then grants the role to various users in that account.

This strategy uses the Snowflake role-based access control (RBAC) model to delegate privileges appropriately to clean room users and
collaborators in their account. For more information about RBAC, see [Overview of Access Control](../security-access-control-overview.md). To see which
users were granted a specific role, run the following SQL command:

```sqlexample
SHOW GRANTS OF ROLE <role_name>;
```

> **Tip:**
>
> This topic uses the following terms:
>
> * Clean room users: Users who have been granted access to the clean rooms UI or API in their Snowflake account.
> * Clean room collaborators: Consumers invited to join a clean room by
>   the clean room provider. A collaborator is also a clean room user — that is, they must have access to the clean rooms UI or API to
>   be able to accept an invitation and use the clean room.

## Manage access to the clean rooms UI

An administrator grants access to the clean rooms UI in a Snowflake account by granting the appropriate roles, either directly or
indirectly. The administrator should also assign a default warehouse and grant USAGE privilege on it to UI users.

The following roles grant permission to manage or access the clean rooms UI:

* **ACCOUNTADMIN:** Role used to install or uninstall the clean rooms environment. This role also enables access to the
  Snowflake Admin page in the clean rooms UI. Account administrators use this page to manage the service user and account features
  such as Cross-Cloud Auto-Fulfillment, external and Iceberg tables, and dataset registration for UI users. This role has all clean room
  UI privileges; a user running as ACCOUNTADMIN doesn’t need any additional UI application roles.
* **MANAGE_CLEANROOMS:** Application role that provides the ability to create, update, delete, and install cleanrooms, and create, update,
  delete, and run analyses in the clean rooms UI.
* **MANAGE_DCR_PROFILE_AND_FEATURES:** Application role that provides access to the Profile & Features page in the Admin
  section of the UI, where you can manage the company profile and control which third-party connectors can be used in clean rooms.
* **MANAGE_DCR_CONNECTORS:** Application role that provides access to the Connectors page in the UI, where you can configure
  third-party connectors.
* **MANAGE_DCR_COLLABORATORS:** Application role that provides access to the Collaborators page in the UI, where you can manage the
  list of approved collaborators available to clean room providers in the UI. This role does not control the list of collaborators
  available to providers when using API; API users can invite anyone to collaborate. For more information, see
  Manage clean room collaborators.

**Example**

The following code shows how to create a custom role, grant that role several UI capabilities, and then grant the custom role to a user.

```sqlexample
-- Create the role.
USE ROLE ACCOUNTADMIN;
CREATE ROLE dcr_access;

-- Grant capabilities to the new role.
GRANT APPLICATION ROLE SAMOOHA_BY_SNOWFLAKE.MANAGE_CLEANROOMS TO ROLE dcr_access;
GRANT APPLICATION ROLE SAMOOHA_BY_SNOWFLAKE.MANAGE_DCR_COLLABORATORS TO ROLE dcr_access;
GRANT APPLICATION ROLE SAMOOHA_BY_SNOWFLAKE.MANAGE_DCR_PROFILE_AND_FEATURES TO ROLE dcr_access;
GRANT APPLICATION ROLE SAMOOHA_BY_SNOWFLAKE.MANAGE_DCR_CONNECTORS TO ROLE dcr_access;

-- Assign the role to a user.
-- You must also grant access to a default warehouse to the role.
GRANT USAGE ON WAREHOUSE <your_warehouse> TO ROLE dcr_access;
ALTER USER <some_user> SET DEFAULT_WAREHOUSE  =  <your_warehouse>;
GRANT ROLE dcr_access to USER <some_user>;
```

## Manage API users

API access is managed using roles. The following Snowflake roles are used to access or manage the API:

* **ACCOUNTADMIN:** The role used to install or uninstall the Clean Rooms environment. This role does not include SAMOOHA_APP_ROLE; to use
  the API, you must use SAMOOHA_APP_ROLE.
* **SAMOOHA_APP_ROLE:** This role grants full permission to the clean rooms API in this account. (This role is used by the clean rooms UI to communicate with the API.)
* **Run-only developer role:** Someone using SAMOOHA_APP_ROLE can
  grant usage on a limited-access role. This role, also called a *run role*, grants permission to
  use a subset of API procedures on a subset of clean rooms in the consumer context. These limited roles can be granted to roles to provide
  scoped usage in your account for specific users, such as data analysts.

### Grant or revoke full API access

The SAMOOHA_APP_ROLE role grants a user full API access to all clean rooms in a Snowflake account. This role has usage on all
warehouses installed with clean rooms.

**Grant full API access:**

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT ROLE SAMOOHA_APP_ROLE TO USER <user_name>;
```

**Revoke full API access:**

```sqlexample
USE ROLE ACCOUNTADMIN;
REVOKE ROLE SAMOOHA_APP_ROLE FROM USER <user_name>;
```

### Grant limited API access (run roles)

You can grant limited API access to specified clean rooms in your account. Limited access grants the ability to call only a
[subset of consumer procedures](consumer.md), such as `consumer.run_analysis`, but not the ability
to install, create, join, or modify a clean room. This access level is sometimes called a *run role*, because it involves creating a role
with limited API access and granting that role to users.

Here is how to grant limited access to a user:

1. A user who can grant the SAMOOHA_APP_ROLE role creates a new role, and assigns limited functionality to that role. The role must also
   be granted USAGE to any warehouses that they will use to access the clean rooms.

   ```sqlexample
   -- Create the role.
   USE ROLE ACCOUNTADMIN;
   CREATE ROLE MARKETING_ANALYST_ROLE;

   -- Grant USAGE on one of the basic clean room warehouses.
   -- You can grant USAGE to any warehouses that you want them to use.
   GRANT USAGE ON WAREHOUSE APP_WH TO MARKETING_ANALYST_ROLE;

   -- Grant limited functionality to the role for a subset of clean rooms.
   CALL samooha_by_snowflake_local_db.consumer.grant_run_on_cleanrooms_to_role(
     [$cleanroom_1, $cleanroom_2],
     'MARKETING_ANALYST_ROLE'
   );

   -- Grant the role to a user.
   GRANT ROLE MARKETING_ANALYST_ROLE TO USER george.washington;
   ```
2. The user then uses their limited role to perform specific actions in the clean room account:

   ```sqlexample
    -- User george.washington logs in and uses the limited role.
    USE WAREHOUSE APP_WH
    USE ROLE MARKETING_ANALYST_ROLE;
    USE SECONDARY ROLES NONE;

    -- Consumer-run analyses should succeed.
    CALL samooha_by_snowflake_local_db.consumer.run_analysis(
      $cleanroom_name,
      'prod_overlap_analysis',
      ['MY_DB.MYDATA.CONVERSIONS'],  -- Consumer tables
      ['MY_DB.MYDATA.EXPOSURES'],      -- Provider tables
      object_construct(
        'max_age', 30
      )
    );

   -- Clean room creation and management procedures fail.
   CALL samooha_by_snowflake_local_db.provider.cleanroom_init($cleanroom_name, 'INTERNAL');
   ```

### Revoke limited API access

* To revoke run privileges on a specific clean room from a specific role, call [revoke_run_on_cleanrooms_from_role](consumer.md).
* To revoke all granted run privileges from a single user, revoke the role from the user.

## Manage clean room collaborators

*Collaborators* are users invited to join a clean room as a consumer by a clean room provider.

**When using the clean rooms UI**, creators can invite collaborators from a list that is managed by someone using the
MANAGE_DCR_COLLABORATORS role.

**When using the clean rooms API**, providers are not limited by a predefined collaborators list, and can add any clean room collaborator
by Snowflake account locator. If you want to invite a collaborator without a Snowflake account, you must first
[create a clean room managed account](managed-accounts.md) for them. Remember that you invite an account to
collaborate, not an individual user. Any user with clean rooms access in the invited account can join and use the clean room.

> **Note:**
>
> If a Snowflake collaborator has an account in a different region than your Snowflake account, your account administrator must
> [enable Cross-Cloud Auto-Fulfillment](laf.md) before you can add them as a collaborator.

Clean rooms UIClean rooms API

To manage the collaborator list used in the clean rooms UI, you need the MANAGE_DCR_COLLABORATORS role.

1. Navigate to the [Snowflake Data Clean Rooms login page](https://cleanroom.c1.us-east-1.aws.app.snowflake.com).
2. In the left navigation, select Collaborators.
3. Do one of the following:

   * If the collaborator has a Snowflake account, select Snowflake Partners » + Snowflake Partner. Respond to the
     prompts to enter the details of the collaborator’s Snowflake account.
   * If the collaborator is not a Snowflake customer:

     1. You can contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to grant you the ability to create a [managed account](managed-accounts.md).
     2. When you have been granted that ability, return to the Collaborators pane and select the Managed Accounts tab to
        create a clean room managed account for your collaborator.

When using the clean rooms API, providers can add collaborators by calling the `provider.add_consumers`
procedure.

If the collaborator is not a Snowflake customer, someone must
[create a clean room managed account](managed-accounts.md) for them.

---
title: Managing access to collaborations, resources, and data
source: https://docs.snowflake.com/en/user-guide/cleanrooms/manage-access.md
section: Clean Rooms
---

# Managing access to collaborations, resources, and data

## Overview

Access to collaborations, and the ability to perform actions in a collaboration, are managed using the following mechanisms:

* **Permission to install applications for the account** is required to install the Data Clean Room application and is held by ACCOUNTADMIN by default.
* **Permission to run specific Collaboration API procedures** is managed by DCR privileges.
* **Permission to perform specific role-based actions** in a collaboration is managed by [Collaboration roles](roles.md).
  These roles determine what a user can do in a specific collaboration. The
  [collaboration definition](spec-collaboration.md) must list you as an analysis runner to be able to run an analysis.
  You must be listed as a data provider to share data with a specified analysis runner.

These mechanisms are overlapping, and all requirements must be fulfilled to be able to perform a specific action on a specific resource.
For example, for you to share a table `my_data` with `user_1` in existing collaboration `collab_1`, all of the following requirements
must be met:

* You must be a designated data provider for `user_1` in the collaboration, and `user_1` must be an analysis runner in that
  collaboration (*collaboration role*).
* You must have permission to call the appropriate Collaboration API procedures to link the data offering into the collaboration (*DCR privilege*).
* You must have the REFERENCE_USAGE privilege with GRANT OPTION on the table `my_data` to register it as a data offering resource (*RBAC privilege*).

This topic describes how to manage DCR privileges. [Data policies](resources-data-offerings.md) and
[collaboration roles](roles.md) are described separately.

## Use DCR privileges to manage account, object, and procedure privileges

The SAMOOHA_APP_ROLE role has privileges to run all procedures in the Collaboration API. This role may have more widely-scoped access than you would like to grant to some groups of users in your account.
Although collaboration roles limit what actions a user may perform, you may also provision specific roles with more precise and limited permissions.

Once the Snowflake Data Clean Rooms app has been installed, additional Data Clean Room-specific privileges may be assigned to specific users.

To grant granular API privileges to a user, take the following steps:

1. Create a role.
2. Grant usage on the warehouse being used to the role.
3. Call [GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE](collaboration-api-reference.md) if needed to grant appropriate privileges on a specific collaboration to a role.
4. Call [GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE](collaboration-api-reference.md) if needed to grant appropriate high-level privileges on all collaborations in the account to the role.
5. Grant the role to the user, who can now call collaboration procedures to participate in the collaboration.

For example, `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE( 'JOIN COLLABORATION', 'collab_join_role' )` grants `collab_join_role` permission
to call JOIN, REVIEW, RUN, LEAVE, VIEW_DATA_OFFERINGS, and many other API procedures needed to join and use a collaboration. In contrast,
`GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'registry_1', 'registry_reader_role')` grants `registry_reader_role` the
permission to read from a single registry. You can grant multiple sets of privileges to the same role.

See which DCR privileges must be granted to be able to call a given Collaboration API procedure
if you aren’t using SAMOOHA_APP_ROLE.

Here is an example of creating a role named COLLABORATION_CREATOR that can create a collaboration, create a custom registry, and register
data offerings, and granting the role to the current user.

```sqlexample
CREATE ROLE IF NOT EXISTS COLLABORATION_CREATOR;

-- Grant warehouse access to the role.
GRANT USAGE ON WAREHOUSE APP_WH TO ROLE COLLABORATION_CREATOR;

-- COLLABORATION_CREATOR needs these manual account-level privileges,
-- which are required by the CREATE COLLABORATION DCR privilege.
GRANT APPLY ROW ACCESS POLICY ON ACCOUNT TO ROLE COLLABORATION_CREATOR;
GRANT CREATE APPLICATION ON ACCOUNT TO ROLE COLLABORATION_CREATOR;
GRANT CREATE DATABASE ON ACCOUNT TO ROLE COLLABORATION_CREATOR;
GRANT CREATE LISTING ON ACCOUNT TO ROLE COLLABORATION_CREATOR;
GRANT CREATE SHARE ON ACCOUNT TO ROLE COLLABORATION_CREATOR;
GRANT IMPORT SHARE ON ACCOUNT TO ROLE COLLABORATION_CREATOR;
GRANT MANAGE SHARE TARGET ON ACCOUNT TO ROLE COLLABORATION_CREATOR;

GRANT ROLE COLLABORATION_CREATOR TO USER alexander_hamilton;

-- Grant DCR account-level privileges using GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE.
-- This procedure requires the ACCOUNTADMIN role.

-- COLLABORATION_CREATOR: create collaborations, create registries,
-- and register data offerings.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE(
  'CREATE COLLABORATION', 'COLLABORATION_CREATOR');
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE(
  'CREATE REGISTRY', 'COLLABORATION_CREATOR');
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE(
  'REGISTER DATA OFFERING', 'COLLABORATION_CREATOR');
```

The following code uses the role COLLABORATION_CREATOR to create a custom registry, and then grants read access on that registry to the EU_SALES_TEAM role:

```sqlexample
USE ROLE COLLABORATION_CREATOR;
USE WAREHOUSE APP_WH;
USE SECONDARY ROLES NONE;

-- Create a custom registry.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.CREATE_REGISTRY(
  'DATA_REGISTRY_EU',
  'DATA OFFERING');

-- Grant read permission on a registry created by this role to the role EU_SALES_TEAM.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE(
  'READ',
  'REGISTRY',
  'DATA_REGISTRY_EU',
  'EU_SALES_TEAM');
```

## DCR privilege requirements for Collaboration API procedures

If you’re using a custom role (rather than SAMOOHA_APP_ROLE), the following table summarizes the privileges required to run each
Collaboration API procedure.

Unless noted otherwise, privileges in a bulleted list are typically alternatives: you need only one of the privileges listed to run the
specified procedure.

| Procedure name | Access requirements |
| --- | --- |
| [REGISTER_TEMPLATE](collaboration-api-reference.md) | **Default registry:** `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REGISTER TEMPLATE', 'role name')`  **Custom registry:** You have read and write privileges on any custom registry that you created yourself. To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'registry name', 'role name')`. |
| [VIEW_REGISTERED_TEMPLATES](collaboration-api-reference.md) | **Default registry:**   * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTERED TEMPLATES', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   **Custom registry:** You have read and write privileges on any custom registry that you created yourself. To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'registry name', 'role name')`. |
| [ADD_TEMPLATE_REQUEST](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   If the template is in a custom registry, or references a code spec in a custom registry, you must also have the READ privilege on the registry. |
| [REMOVE_TEMPLATE](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [VIEW_TEMPLATES](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW TEMPLATES', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   Additionally, to see objects registered in a custom registry, you need the READ privilege on that registry. |
| [ENABLE_TEMPLATE_AUTO_APPROVAL](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [DISABLE_TEMPLATE_AUTO_APPROVAL](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [GET_CONFIGURATION](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [SET_CONFIGURATION](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [REGISTER_DATA_OFFERING](collaboration-api-reference.md) | **Default registry:** `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REGISTER DATA OFFERING', 'role name')`  **Custom registry:** You have read and write privileges on any custom registry that you created yourself. To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'registry name', 'role name')`.  Additionally, the caller needs the following RBAC privileges:   * SELECT on the source table/view. * USAGE on the database and schema containing the source table. * USAGE on any policy objects referenced in the spec. |
| [LINK_DATA_OFFERING](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   Additionally, the caller must have the REFERENCE_USAGE privilege with GRANT OPTION on any data to be shared. If you don’t, you’ll get a “missing reference usage grant” error. [Learn how to handle this issue.](v2/troubleshooting.md)  If the data offering is in a custom registry, you must also have privileges granted by calling `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'registry name', 'role name')`. |
| [UNLINK_DATA_OFFERING](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   The `UPDATE` privilege on a collaboration doesn’t grant access to this procedure. Additionally, only the role that called JOIN can successfully unlink data offerings, because the underlying share is owned by the joining role. |
| [LINK_LOCAL_DATA_OFFERING](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [UNLINK_LOCAL_DATA_OFFERING](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [VIEW_REGISTERED_DATA_OFFERINGS](collaboration-api-reference.md) | **Default registry:**   * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTERED DATA OFFERINGS', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   **Custom registry:** You have read and write privileges on any custom registry that you created yourself. To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'registry name', 'role name')`. |
| [VIEW_DATA_OFFERINGS](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW DATA OFFERINGS', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   Additionally, to see objects registered in a custom registry, you need the READ privilege on that registry. |
| [REGISTER_CODE_SPEC](collaboration-api-reference.md) | **Default registry:** `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REGISTER CODE SPEC', 'role name')`  **Custom registry:** You have read and write privileges on any custom registry that you created yourself. To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'registry name', 'role name')`. |
| [VIEW_REGISTERED_CODE_SPECS](collaboration-api-reference.md) | **Default registry:**   * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTERED CODE SPECS', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   **Custom registry:** You have read and write privileges on any custom registry that you created yourself. To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'registry name', 'role name')`. |
| [VIEW_CODE_SPECS](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   Additionally, to see objects registered in a custom registry, you need the READ privilege on that registry. |
| [VIEW_UPDATE_REQUESTS](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [APPROVE_UPDATE_REQUEST](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE UPDATE REQUEST', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [REJECT_UPDATE_REQUEST](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE UPDATE REQUEST', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [INITIALIZE](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges   See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions. |
| [TEARDOWN](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions. |
| [GET_STATUS](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [ENABLE_EXTERNAL_TABLE_ANALYSIS _FOR_COLLABORATION](collaboration-api-reference.md) | You must use a role that has been granted the MANAGE FIREWALL_CONFIGURATION privilege on the account. |
| [VIEW_COLLABORATIONS](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW COLLABORATIONS', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('RUN', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [REVIEW](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REVIEW COLLABORATION', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions. |
| [JOIN](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions. |
| [LEAVE](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges   See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions. |
| [RUN](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('RUN', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [VIEW_ACTIVATIONS](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW ACTIVATIONS', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('RUN', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [PROCESS_ACTIVATION](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('PROCESS ACTIVATION', 'COLLABORATION', 'collaboration name', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges |
| [CREATE_REGISTRY](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE REGISTRY', 'role name')` |
| [VIEW_REGISTRIES](collaboration-api-reference.md) | * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTRIES', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')` * `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE REGISTRY', 'role name')` |
| [GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE](collaboration-api-reference.md) | * **For collaboration objects:** Any role with CREATE COLLABORATION or JOIN COLLABORATION can call this procedure on any collaboration. * **For registry objects:** Only the role that created the registry can call this procedure on that registry. |
| [GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE](collaboration-api-reference.md) | You need the ACCOUNTADMIN role, or a role with the MANAGE GRANTS global privilege, to run this procedure. |

---
title: Managing clean room environment updates
source: https://docs.snowflake.com/en/user-guide/cleanrooms/managing-updates.md
section: Clean Rooms
---

# Managing clean room environment updates

This topic describes manging updates by the administrator of a Snowflake Data Clean Room. For information about installing the clean room environment
in your Snowflake account, see [Installing the Snowflake Data Clean Rooms environment](installing-dcr.md).

> **Important:**
>
> If you’re on version 12.3 or earlier, you must perform a manual update even if automatic
> upgrades are enabled. After this one-time manual update, re-enable automatic upgrades to resume receiving new updates.

## Updating the clean rooms environment

Snowflake Data Clean Rooms updates their binaries weekly to support new features, procedures, and UI updates. You can find release notes for
significant new releases in the [feature updates section](../../release-notes/new-features.md) of the Snowflake release notes page (search
for “clean rooms”).

### Clean rooms API updates

A clean rooms administrator can either enable automatic API updates (recommended) or update the API environment manually for each new
release, as described next.

#### Automatic API updates

A clean rooms API administrator can enable clean rooms updates to be installed automatically upon release by running the following SQL commands once in their account:

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
```

Clean rooms API users in that account will see the updates shortly when they are rolled out, without needing to log out.

#### Manual API updates

We recommend enabling automatic clean room updates for your account. But if you prefer to update your account’s API environment manually, you can do so by running the following SQL commands each time you want to update the environment:

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.apply_patch();
```

You can find your release number by running the following SQL command:

```sqlexample
SELECT * FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.VERSION;
```

> **Note:**
>
> Delaying updates for multiple releases can result in longer `apply_patch` execution times, because the patch must apply
> each skipped version sequentially. To minimize update times, apply patches regularly or enable automatic updates.

---
title: Managing Cross-Cloud Auto-Fulfillment in Collaboration Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/laf.md
section: Clean Rooms
---

# Managing Cross-Cloud Auto-Fulfillment in Collaboration Data Clean Rooms

## About Cross-Cloud Auto-Fulfillment

Collaborators can seamlessly share data with collaborators in different cloud regions. In order to do so, they must enable
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) on their account. Only collaborators sharing or activating data must enable this on their account.

When Cross-Cloud Auto-Fulfillment is used in a collaboration:

* Data is replicated into the cloud region of each collaborator that can access that data.
* Data is also replicated into the owner’s region for orchestration purposes via the Secure Collaboration Orchestrator (SCO). However, the collaboration owner’s ability to
  access the data is determined by the data offering’s sharing rules.
* Collaborators in a different cloud region experience some data lag due to the
  replication frequency.

## Enabling Cross-Cloud Auto-Fulfillment

Cross-Cloud Auto-Fulfillment must be enabled in the account of any collaborator that needs to share data with an account in another cloud hosting region. If this feature isn’t enabled for an account or role in an account, follow the below steps:

1. Enable Cross-Cloud Auto-Fulfillment for an account

   * **To check if Cross-Cloud Auto-Fulfillment is enabled on an account,** an organization administrator must call [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](../../sql-reference/functions/system_is_global_data_sharing_enabled_for_account.md).
   * **To enable Cross-Cloud Auto-Fulfillment on an account,** an organization administrator must call [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../../sql-reference/functions/system_enable_global_data_sharing_for_account.md).
2. Enable Cross-Cloud Auto-Fulfillment for a role in an account

   * **To delegate privileges to another role in an account,** an ACCOUNTADMIN role can grant the MANAGE LISTING AUTO FULFILLMENT privilege to other roles in the account. For more information, see [Manage privileges for auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

> **Note:**
>
> Each user who will call REVIEW or JOIN must have a first and last name set with a validated email address on
> their Snowflake user profile. See [Verify the email addresses of the email notification recipients](https://docs.snowflake.com/en/user-guide/notifications/email-notifications#label-email-notification-verify-address)
> for instructions.

## Refresh frequency for cross-region accounts

Update requests and shared data between collaborators in different cloud regions are subject to a 10-minute refresh schedule. This schedule is not configurable.

## Costs associated with cross-region collaboration

Additional costs are incurred when collaborators are in different cloud regions. For more information
about how these costs are incurred, see [Auto-fulfillment costs](../../collaboration/provider-understand-cost-auto-fulfillment.md).

## Limitations on cross-region collaboration

The following limitations exist on cross-region collaboration:

* Collaborators can’t link external tables in collaborations. See [list of supported objects](../../collaboration/provider-understand-auto-fulfillment-objects.md).
* See [additional considerations when enabling cross-region collaboration](../../collaboration/provider-listings-auto-fulfillment.md).

---
title: Managing Cross-Cloud Auto-Fulfillment in Snowflake Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/enabling-laf.md
section: Clean Rooms
---

# Managing Cross-Cloud Auto-Fulfillment in Snowflake Data Clean Rooms

## About Cross-Cloud Auto-Fulfillment

In the default clean room environment, a clean room can be shared only with accounts in the same cloud region. That is, the
provider and consumer must be in the same cloud region.

If you want to collaborate with a collaborator whose account is in a different region than you, you must enable
[Cross-Cloud Auto-Fulfillment](../../../collaboration/provider-listings-auto-fulfillment.md) for your clean
room environment and your clean room as shown on this page.

You can determine your own cloud region by running `SELECT CURRENT_REGION();`

> **Note:**
>
> Cross-Cloud Auto-Fulfillment is sometimes referred to as *LAF*, which stands for [listings auto-fulfillment](../../../collaboration/provider-listings-auto-fulfillment.md).

## Enabling Cross-Cloud Auto-Fulfillment

You can enable Cross-Cloud Auto-Fulfillment using either the API or the UI. However, note the limitations for cross-region collaboration.

### Prerequisites

In order to enable Cross-Cloud Auto-Fulfillment for an account, an org admin for all collaborators must first enable it on the account by
calling [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../../../sql-reference/functions/system_enable_global_data_sharing_for_account.md).

Learn more about [auto-fulfillment](../../../collaboration/provider-listings-auto-fulfillment.md) and [managing auto-fulfillment privileges](../../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

### Enabling Cross-Cloud Auto-Fulfillment in the UI

A clean rooms administrator enables Cross-Cloud Auto-Fulfillment at the account level for all
new and existing clean rooms by following these steps:

1. [Sign in to the clean rooms UI](web-app-introduction.md) with your administrator account.
2. Browse to Admin > Snowflake Admin.
3. Toggle on Cross-Cloud Auto-Fulfillment.
4. No additional steps are required by the provider or consumer when creating or joining a clean room in the UI. However, if you
   later create or join a clean room in the API, you must follow the API instructions for providers and consumers.

### Enabling Cross-Cloud Auto-Fulfillment in the API

Follow these instructions to create or install a clean room in the API, even if you have already enabled Cross-Cloud Auto-Fulfillment in
the UI.

#### Account administrator actions

To enable Cross-Cloud Auto-Fulfillment for an account using the API, administrators in both the provider and consumer accounts must run the following SQL code using the ACCOUNTADMIN role. You need to run this only once per account.

```sqlexample
USE ROLE ACCOUNTADMIN;
-- Optionally check first to see if LAF is enabled on the account.
CALL samooha_by_snowflake_local_db.library.is_laf_enabled_on_account();

-- If LAF is not enabled, enable it.
CALL samooha_by_snowflake_local_db.library.enable_laf_on_account();
```

#### Provider and consumer actions

After Cross-Cloud Auto-Fulfillment is enabled for an account, here is how to enable Cross-Cloud Auto-Fulfillment
when creating or installing a clean room:

1. **The provider** publishes the clean room in the normal way by calling `provider.create_or_update_cleanroom_listing`.
2. **The consumer** installs the clean room by calling `consumer.install_cleanroom`. If the consumer is in a different cloud region from
   the provider, `consumer.install_cleanroom` fails with a message that Cross-Cloud Auto-Fulfillment replication is being installed.
3. **The consumer** continues to call `consumer.install_cleanroom` until it returns success. Installation takes several minutes.

   At this point, the consumer has basic clean room functionality. To support client custom template requests, provider-run
   analyses, and provider activation, follow this additional step:
4. **The provider** calls `provider.mount_request_logs_for_all_consumers` until the procedure reports success. This means that communication from the consumer to the provider is enabled.

**Full setup code example:**

1. **Provider:** The provider creates, shares, and publishes a clean room in the standard way.

   ```sqlexample
   USE WAREHOUSE APP_WH;
   USE ROLE SAMOOHA_APP_ROLE;

   SET cleanroom_name = 'LAF example';
   SET consumer_locator = '<CONSUMER_LOCATOR>';
   SET consumer_account_name = '<CONSUMER_DATA_SHARING_ACCOUNT_ID>';

   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.cleanroom_init($cleanroom_name);

   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.set_default_release_directive(
     $cleanroom_name,
     'V1_0', '0');

   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.add_consumers(
     $cleanroom_name,
     $consumer_locator,
     $consumer_account_name);

   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.create_or_update_cleanroom_listing($cleanroom_name);
   ```
2. **Consumer:** The consumer installs the clean room.

   ```sqlexample
   USE WAREHOUSE APP_WH;
   USE ROLE SAMOOHA_APP_ROLE;

   SET cleanroom_name = 'LAF example';
   SET provider_locator = '<PROVIDER_LOCATOR>';

   -- Initial call starts the process and returns a cross-cloud/region replication failure.
   -- Continue to call this procedure until it returns a success message.
   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.consumer.install_cleanroom(
     $cleanroom_name,
     $provider_locator);

   -- Continue with standard clean room configuration and use.
   -- The consumer can run analyses, but client custom templates, provider run, and provider analysis
   -- aren't supported until the provider takes the action shown in the next step.
   ...
   ```
3. **Provider:** After the consumer installs the clean room, the provider must mount the requests share to enable
   request-based actions between the provider and consumer. Request-based actions include provider requests to run an analysis and consumer
   requests to add a template to the clean room.

   ```sqlexample
   -- Call mount_request_logs_for_all_consumers until it reports success.
   provider.mount_request_logs_for_all_consumers($cleanroom_name);
   ```

   Full provider/consumer functionality is now available.

## Refresh frequency for cross-region accounts

Requests and data between the provider and consumer when on different cloud regions are subject to replication frequency settings.

### Requests and data from provider to consumer

This includes all data and requests from the provider to the consumer, such as creating or updating a clean room, changing provider data, requests for permission (such as provider-run analyses), and approvals for requests (such as consumer templates).

You can change the provider to consumer refresh rate by calling [set_laf_dcr_refresh_schedule](../provider.md).

| Data | Default refresh rate |
| --- | --- |
| Provider clean room data, such as the following:   * Provider datasets * Provider-run requests * Clean room policies * Provider clean room metadata | * Clean rooms created after July 24, 2025: **30 minutes**. * Older clean rooms: Default to your   [account’s replication refresh schedule](../../../sql-reference/parameters.md) (or 24 hours, if not set). |

### Requests and data from consumer to provider

The following table shows the default refresh frequency for data and requests from the consumer to the provider.

You can [control the consumer to provider refresh rate](../../../collaboration/provider-listings-auto-fulfillment-update-refresh-frequency.md)
for each clean room.

| Data | Default refresh rate |
| --- | --- |
| Requests, approvals, and changes such as the following:  * Requests to provider (such as a request to add a template) * Approvals to provider (such as an approval for provider-run analyses) * Changes to linked consumer data. * Status and results for provider-run requests. | * Clean rooms created after July 24, 2025: **10 minutes** * Older clean rooms: **1 hour** |
| Provider activation data: | * Clean rooms created after July 24, 2025: **10 minutes** * Older clean rooms: **15 minutes** |

## Costs associated with cross-region collaboration

There are additional costs associated with collaborators who are in a different region. For more information about how these costs are
incurred, see [Auto-fulfillment costs](../../../collaboration/provider-understand-cost-auto-fulfillment.md).

## Limitations on cross-region collaboration

The following limitations exist on cross-region collaboration:

* When using the clean rooms UI, you can enable cross-region collaboration with other UI users in the [same UI gateway region](web-app-introduction.md). For example, accounts in AWS US East (Ohio) can share with accounts in AWS US West (Oregon) because they have the same UI gateway region (AwS US East (N. Virginia). Accounts in AWS US East (Ohio) can’t collaborate with accounts on AWS Canada, because they don’t share a gateway region. However, any account can be configured for cross-region collaboration when using the API.
* A provider cannot use differential privacy in the clean room.
* Collaborators cannot link external tables and iceberg tables in clean rooms.
* A consumer cannot run a multi-provider analysis.
* An account cannot act as both provider and consumer in cross-cloud collaboration scenarios due to replication type conflicts that can
  occur.
* See [additional considerations when enabling cross-region collaboration](../../../collaboration/provider-listings-auto-fulfillment.md).

---
title: Multi-party insights
source: https://docs.snowflake.com/en/user-guide/cleanrooms/collab-multi-party-insights.md
section: Clean Rooms
---

# Multi-party insights

## About the template

This template demonstrates a three-party data clean room use case built on Snowflake Collaboration Data Clean Rooms. It brings together three distinct
datasets representing different parties in an advertising and measurement workflow:

* **Publisher (Exposures):** Ad exposure data including hashed email, IP address, impression date, ad group, and campaign ID,
  representing audience impressions on an ad platform.
* **Advertiser (Purchases):** Purchase transaction data including hashed email, purchase date, platform, location, and purchase amount,
  representing customer buying behavior across mobile, desktop, and CTV.
* **Identity Partner (Customer Spine):** Identity resolution data including hashed email, state, and filing type, serving as a third-party
  spine that enriches the collaboration with geolocation and demographic attributes.

The template joins all three datasets using standardized join keys (hashed email SHA-256) to surface aggregated insights on purchase
behavior broken out by geolocation. Specifically, the analysis computes the total number of purchases and average spend amount per state
and ad group across the three parties.

This three-party join is a key capability of Snowflake Collaboration Data Clean Rooms, going beyond the one-to-one provider-consumer clean room
models. The collaboration specification controls which parties can run which templates and access which data offerings, enabling flexible
permissioning. For example, the identity provider can contribute data without having access to run any analysis templates.

Two SQL templates are included:

* **Audience Overlap:** A simple overlap query that returns distinct hashed emails found across the joined datasets, useful for audience
  activation.
* **State Spend Analysis:** An insights query that joins all three tables to compute total purchases and average spend per state and ad
  group, delivering actionable campaign performance insights.

## Collaboration roles

| Collaborator | Roles | Actions |
| --- | --- | --- |
| Publisher | Owner, data provider | Registers a data offering (ad exposure data) and two analysis templates (audience overlap and state spend analysis). Creates the collaboration. |
| Advertiser | Analysis runner, data provider | Registers a data offering (purchase transaction data). Joins the collaboration, links their data, and runs the analysis templates. |
| Identity Partner | Data provider | Registers a data offering (customer spine with geolocation and demographic attributes). Joins the collaboration and links their data. Doesn’t run any analysis templates. |

## Key use cases

* **Identity resolution:** Securely match customer records across publisher, advertiser, and third-party identity providers using hashed
  join keys, enabling a unified view of audiences without exposing raw PII.
* **Audience overlap analysis:** Identify shared audiences between a publisher, advertiser, and identity data provider to evaluate match
  rates, refine targeting strategies, and activate matched segments for campaigns.
* **Purchase attribution by geography:** Attribute purchase behavior to specific ad groups and geographic regions by joining advertiser
  transaction data with publisher exposure data and a third-party identity spine.
* **Campaign performance optimization:** Aggregate total purchases and average spend by state and ad group to understand which campaigns
  and regions are driving the most value, enabling data-driven budget allocation.

## Get the worksheets and template

You can run this example in two ways:

* **Single account:** Run the entire example in one Snowflake account, where one account plays all three roles.
* **Three accounts:** Run the example across three separate Snowflake accounts in the same organization and the same cloud hosting
  environment, with each account playing a different role (publisher, advertiser, identity partner).

[See instructions to upload a SQL worksheet into your Snowflake account](tutorials-and-samples.md).

### Single-account method

Download and run the following worksheets in order in a single Snowflake account. The first worksheet generates sample data and registers
all three data offerings and both templates. The second worksheet creates the collaboration, joins, and runs both analysis templates.

1. [`Download the data registration worksheet`](../../_downloads/6a7294068e1c2ce8c3636d1265ea7d86/multi-party-insights-owner-registration.sql).
   Run this first to generate sample data and register data offerings and templates.
2. [`Download the single-account worksheet`](../../_downloads/da03f9fd3a4566fc25866b508db73109/multi-party-insights-single-account.sql).
   Run this to create the collaboration, join, and run both analysis templates.

### Three-account method

Download and run the following worksheets across three separate Snowflake accounts. The publisher registers data and templates, creates the
collaboration, and runs analyses. The advertiser and identity partner each register their own data, review, and join the collaboration.

**Publisher account (Account 1):**

1. [`Download the owner registration worksheet`](../../_downloads/6a7294068e1c2ce8c3636d1265ea7d86/multi-party-insights-owner-registration.sql).
   Run this first to generate sample data and register data offerings and templates.
2. [`Download the publisher worksheet`](../../_downloads/790d41e7d02dbbd4ddca40cdfff84c9d/multi-party-insights-publisher.sql).
   Run this to create the three-party collaboration, join, and run analyses.

**Advertiser account (Account 2):**

* [`Download the advertiser worksheet`](../../_downloads/43340b7bc74bc2b500eb8268ccf9e3eb/multi-party-insights-advertiser.sql).
  Run this to generate purchase data, register a data offering, review and join the collaboration, and run the analysis.

**Identity partner account (Account 3):**

* [`Download the identity partner worksheet`](../../_downloads/7a1265f07164a2f9ed853af52e22a1c0/multi-party-insights-identity-partner.sql).
  Run this to generate customer spine data, register a data offering, and review and join the collaboration.

---
title: Overview of Provider and Consumer Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/getting-started.md
section: Clean Rooms
---

# Overview of Provider and Consumer Clean Rooms

This page provides an overview of how Snowflake Data Clean Rooms work. If you are a Snowflake administrator and want to install clean
rooms in your account, read [Installing the Snowflake Data Clean Rooms environment](installing-dcr.md).

> **New!:**
>
> Snowflake Data Clean Room has a new collaboration experience Generally Available. [Read about and try out our new Collaboration Clean Rooms experience.](about.md)

## Overview of Snowflake Data Clean Rooms

Data clean rooms are configurable, isolated Snowflake environments where collaborators can import data, specify what queries can be run
against that data, and configure data protection settings such as differential privacy and specifying joinable and projectable columns. Access
to a clean room is by invitation only.

Clean rooms don’t support monetization features. Providers are billed for various background processes required to enable clean rooms;
the account running a query is billed standard Snowflake costs for the data and compute usage. For more information about costs, see
[Understanding cost](cleanroom-cost.md).

You must be invited by a clean room provider to be able to access a clean room. If you want to open up your clean room to a larger
audience, you must provide a way for potential collaborators to contact you to provide their Snowflake account for you to invite (or an
email address for [non-Snowflake users](managed-accounts.md)).

Here is a high-level overview of how Snowflake Data Clean Rooms work:

### Clean Room environment installation

The Snowflake Data Clean Room environment is installed once for an entire Snowflake account (not once per user or per clean
room) by someone with ACCOUNTADMIN privileges on the Snowflake account.

The administrator configures the environment to specify which users in the account can create clean rooms and run queries, which users have
API access, which accounts can be invited to collaborate in a clean room, what data a clean room creator can import into the clean room,
and which (if any) third-party services can be used to export query results from any clean room created in this account.

If a clean room environment has already been installed for your account, reach out to your clean rooms administrator for access. If a clean
room environment has not been installed for your account, [learn how to install the environment](installing-dcr.md).

After installing and configuring the environment, the administrator grants permission to other Snowflake users to use the clean
rooms UI, API, or both.

> **Learn more:**
>
> * Learn how to [install and configure the Clean Room environment](installing-dcr.md).
> * By default, you can share clean rooms only with accounts in the same web hosting region. The administrator can
>   [enable sharing with accounts in other regions](v1/enabling-laf.md).
> * See [other tasks that clean rooms administrators perform](admin-tasks.md).

> **Note:**
>
> * **If you were emailed an invitation to join a clean room,** you already have clean rooms installed in your Snowflake account. You can
>   read the rest of this page to learn more about clean room usage, but you don’t need to install anything, only to join the clean room.
> * **If you are an account administrator** and the clean room environment is not installed in your Snowflake account, [learn how to install the clean room environment for your Snowflake account](installing-dcr.md).
> * **If you are not an account administrator,** ask an account administrator whether Snowflake Data Clean Rooms is installed for your
>   account. If not, ask them to install it and grant you access. If it is, ask them to grant you permission to access clean rooms.
> * **If you are a developer and want API access,** ask a clean rooms administrator to
>   [grant you access to the API](manage-dcr-users.md).

### Creating a clean room

A Snowflake account administrator grants permission to users in their Snowflake account to be able to create clean rooms. The account
that creates a clean room is called a *provider* for that clean room. Providers can configure and share clean rooms with users in other
Snowflake accounts (or even non-Snowflake users). When a clean room is shared with you, you are called a *consumer* for that clean room.

After creating a clean room, the provider *links* (imports) tables or views into it, specifies what queries can be run against their data,
which columns in their data can be joined or appear in the results, and what can be done with the results.

The provider then invites consumers to join the clean room, link their own tables and views, and run one of the queries specified by the
provider. Consumers must be pre-approved by a clean rooms administrator before they can be invited to a clean room.

> **Learn more:**
>
> * Clean rooms can be created either in code or using the clean rooms UI. Permission to create a clean room is granted differently for
>   [web users](manage-dcr-users.md) and [coders](manage-dcr-users.md).
> * Tables can be imported from both Snowflake accounts and [non-Snowflake Iceberg tables](register-data.md) on
>   [AWS](external-data-aws.md), [Azure](external-data-azure.md), and
>   [Google](external-data-gcp.md).
> * Before data can be imported into a clean room, it must be [registered](register-data.md) by a user with admin
>   privileges on the source data.
> * You can invite both Snowflake and [non-Snowflake users](managed-accounts.md) to join a clean room.
> * Learn more about the provider role in clean rooms.
> * During development, you can [use the same account for both provider and consumer roles](v1/developer-introduction.md),
>   though with only a subset of clean room functionality.

### Joining a clean room

After creating and configuring a clean room, the provider sends invitations to users in other accounts to join the clean room. These
invited users are called *consumers*, or sometimes *collaborators*. Consumers invited through the clean rooms UI
receive an emailed invitation to join the clean room. Snowflake users must have the Clean Room environment installed in order to be
invited to join a clean room, but you can [invite non-Snowflake users](managed-accounts.md) to join a clean room.
A Snowflake account must be allowlisted by a clean rooms administrator before a clean room creator can invite users in that account.

(In the clean rooms UI, both “join” and “install” are used to describe when a consumer accepts a clean room invitation. This is because a
clean room must literally be installed in the consumer’s clean room environment.)

After joining a clean room, a consumer imports (links) any data needed for the templates in that clean room, specifies how their data can
be accessed, such as which columns can be joined or projected, provides any template-specific filters or other parameters, then runs the
template. Consumers can specify a repeating run of the template, if desired. Results can be viewed in the browser, or downloaded. If the
provider has enabled activation and the consumer approves, the consumer can export the results to the approved locations (their own
Snowflake account, or a third-party activation connector designated by the provider).

Data imported into a clean room cannot be queried or viewed directly by either party — either the provider or the consumer — but can
only be accessed through a template in the clean room. A template is a SQL query installed in the clean room by the provider or consumer,
and permission must be given by the other party to use it in the clean room.

Each party also sets access rules on their own data, including which columns can be joined, projected, or exported, and which
templates can be run in the clean room. Each party can delete their data from the clean room at any time.

By default, only a consumer can run templates in a clean room, but the provider can ask permission from the consumer to run a specified
template in the clean room.

> **Learn more:**
>
> * Clean rooms support [differential privacy](differential-privacy.md). Differential privacy can be enabled and
>   configured by either the provider or consumer.
> * Learn more about the consumer role in Snowflake Data Clean Rooms.

### Templates

Every clean room has one or more *templates* installed. A template is a JinjaSQL query that typically includes run-time parameters provided by the
template runner. These parameters enable users to specify column or table names or WHERE clause filters. You cannot simply run
arbitrary SQL queries in a clean room (unless a provider [grants that ability](v1/web-app-sql-template.md)); most
clean room usage is limited to templates submitted by the provider or consumer and approved by the other party.

Snowflake provides a few stock templates for common use cases such as audience overlap and reach and frequency templates. You can also
create custom templates to use in your clean room. Snowflake Data Clean Rooms supports any valid JinjaSQL template.

Templates can be run in the clean rooms UI or in code. Template results can be viewed or downloaded, or can be shared to the provider, the
consumer, or an approved third-party if *activation* is allowed in that clean room.

> **Learn more:**
>
> * By default, only consumers can run a template in a clean room. However, a provider can
>   [ask permission of the consumer to run a template](demo-flows/provider-run-analysis.md) in a clean room.
> * The template and clean room configuration define what can be done with the query results. If the query results are exported outside the
>   clean room, this is called [activation](v1/activation.md). Results can be activated to a
>   Snowflake account of the provider or consumer, or to a [Snowflake-approved third party](connector-activation.md).

### Clean room variations

The most common clean room, as described above, is one where a provider imports data and specifies one or more specific queries that can
be run against the data and how the results can be shared, and the consumer imports their own data and runs the permitted queries against
the combined data. However, a provider can permit several variations on the standard clean room:

* [Allow the provider to run their own queries against consumer data.](demo-flows/provider-run-analysis.md). By
  default, only the consumer can run queries in a clean room. If enabled for a clean room, a provider can request permission from the
  consumer to run a specific query in the clean room.
* Allow the query results to be exported [(activated)](v1/activation.md) to the Snowflake account
  of the person running the query or to a Snowflake-approved third-party account, such as Meta Ads Manager or The Trade Desk. Exporting
  data outside the clean room is always subject to approval by all parties who shared the data being queried.
* Allow either party to [include custom Python code](demo-flows/custom-code.md) that can be called by
  the query they run. This code typically filters or manipulates the data in some way as the query is being run; it cannot take external
  actions such as saving a file, exporting data, or performing other actions.
* Allow the query to [access data in other clean rooms](overview.md), subject to approval by the
  providers of all the clean rooms being accessed.
* [Chain multiple queries together.](developer-template-chains.md)

### About providers and consumers

Clean room collaborators are classified as either a *provider* or a *consumer* for a given clean room. A provider is the account that
creates a clean room; a consumer is the account with whom a clean room is shared. You cannot invite someone in the same account where you
created a clean room to act as a consumer for that clean room. All users in the same Snowflake account have the same clean room role
(provider or consumer) for the same clean rooms in that account.

The provider and consumer roles apply at the Snowflake account level, not the individual user level. That is, if
you create clean room `cleanroom1` using Snowflake account ABC, then share `cleanroom1` with account XYZ, all ABC users with access
to `cleanroom1` are providers, and all XYZ users with access to `cleanroom1` are consumers.

Whether you are a provider or consumer is determined solely by whether you created or were shared a clean room, not by any Snowflake roles
or other permissions.

Here is more information about the provider and consumer roles.

> **Tip:**
>
> Sometimes the word *collaborator* is used to mean a consumer or anyone with access to a given clean room.

#### Providers

A *provider* is defined as the account that created a clean room. Anyone accessing the clean room from that account is considered to be a
provider for that clean room.

Providers perform the following clean room actions:

* Create, share, and delete clean rooms
* Specify who can use a clean room as a consumer
* Import data into a clean room
* Define which templates can be run in a clean room
* Specify whether consumers can run a custom template in a clean room
* Specify which templates are used in a clean room, and create custom templates for the clean room
* Run queries on consumer data, if the consumer consents
* Permit chained templates
* Load python script into a clean room to use in a template
* Permit provider data from this clean room to be queried with data from other specified clean rooms in a consumer query
* Enable or disable differential privacy for the clean room or consumer
* Manage versioning of the clean room
* Set column and join policies on their own data

#### Consumers

A *consumer* is defined as an account that was extended an invitation by a provider to join (install) a clean room.

Consumers perform the following clean room actions (according to the clean room configuration):

* Join (install) a clean room for their account
* Import data into the clean room
* Run any queries supported by the clean room
* Export query results as enabled by the clean room
* Request permission to use their own template in a clean room
* Specify whether providers can run a template in the clean room (by default, only consumers can run a template)
* Allow the clean room provider to run queries against the consumer’s data
* Run a query that spans their data and provider data from multiple clean rooms, if the providers in all the affected clean rooms agree.
* Load python script into the clean room (with the permission of the provider)
* Set column and join policies on their own data
* Set differential privacy settings for provider-run queries

## Ways to access Snowflake Data Clean Rooms

Snowflake Data Clean Rooms provide both a no-code browser-based application (the clean rooms UI) and an API to create and manage clean
rooms. Currently the clean rooms UI and API are not exactly equivalent in capabilities. Here is a summary of the differences:

| UI-only features | API-only features |
| --- | --- |
| * Environment management tasks, such as clean room logo, name, and description, the list of available activation or identity   connectors. * Managing the list of administrator, provider, and (potential) consumer accounts. * Scheduling repeating runs of a template. (You can schedule runs using other scripting tools such as cron jobs.) * Using identity providers. | * Creating custom templates, either provider or consumer * Creating template chains * Multi-provider analysis * Consumer-level access control on tables and templates (`restrict_table_options_to_consumers` and `restrict_template_options_to_consumers`). |

Note that you can create a clean room using the clean rooms UI and then use or manage it in the API, and vice versa.

### Clean rooms UI

Snowflake data clean rooms can be managed and run in a browser. You can use the clean rooms UI to create, manage, and use clean rooms as
a provider or consumer, or to configure various account-level features, such as managed accounts, third-party connectors, and features
for UI users.

The clean rooms UI is accessed at a separate URL from Snowsight. You can [find the login URL here](v1/web-app-introduction.md).

**Permissions and access:** You must be [granted access to use the clean room UI](manage-dcr-users.md) by a clean
room administrator. The clean rooms UI uses your Snowflake credentials.

[Try out the clean rooms UI tutorial](v1/tutorials/cleanroom-web-app-tutorial.md) or
[read more about the clean rooms UI](v1/web-app-introduction.md).

### API

Snowflake provides a number of stored procedures to create, manage, and run clean rooms. These procedures can be called through Snowsight
notebooks or worksheets or any interface where you can run stored procedures in your Snowflake account.
The API doesn’t enable clean room account administration; to administer a clean room account you must use the clean rooms UI.

**Permissions and access:** To use the API, you must be
[granted access to use the SAMOOHA_APP_ROLE](manage-dcr-users.md) by a clean rooms administrator for your Snowflake
account.

[Read about the Clean Room API](v1/developer-introduction.md) or
[try out the API tutorial](tutorials/cleanroom-api-tutorial-basic.md).

### Is the clean rooms environment installed in your Snowflake account?

Here is how to tell whether the clean rooms UI or API is installed in your account:

SnowsightClean rooms API

To see whether Snowflake Data Clean Rooms is installed:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps » Installed Apps.
3. Look to see whether Snowflake Data Clean Rooms appears in your Installed Apps list.

* Run `SHOW ROLES LIKE 'SAMOOHA_APP_ROLE';` to see if the API is installed in your account. If the role appears, the clean rooms
  environment is probably installed.
* Run `SELECT IS_ROLE_IN_SESSION('SAMOOHA_APP_ROLE');` to see whether you have access to the API.
* Run `SHOW GRANTS ON ROLE SAMOOHA_APP_ROLE;` to see what roles can grant SAMOOHA_APP_ROLE, which is required to use the API.

---
title: Overview of Snowflake Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/overview.md
section: Clean Rooms
---

# Overview of Snowflake Data Clean Rooms

This topic provides a high-level guide to the components that make up a collaboration, and outlines the basic steps in creating or using a Snowflake Data Clean Room collaboration.

## Requirements

* You must be [updated to the latest version of Snowflake Data Clean Rooms](admin-tasks.md).
* You need access to the Data Clean Rooms Collaboration API to see or manage collaborations. For more information, see [Managing access to collaborations, resources, and data](manage-access.md).
* Data Providers must use Snowflake Enterprise Edition. Owners and Analysis runners can use Standard Edition.
* If you use Snowflake Standard Edition, you can not share data through a data clean room with policy enforcement. However, you can access data offerings from other collaborators, or [use your own data](demo-flows/basic-multiparty-collab.md) without policies or sharing it.
* To [activate results](activation.md) to another Snowflake account, you must use Snowflake Enterprise Edition.
* Trial accounts don’t support Snowflake Data Clean Rooms.

## Roles and resources in a collaboration

To understand how to use a collaboration, you must first understand collaboration roles and collaboration resources.

### Collaboration roles

The following roles are available in a collaboration. These roles define the high-level capabilities of the collaborator.

* **Owner:** The owner defines, creates, and owns the collaboration, and defines which collaborators are invited and their collaboration
  roles. An owner isn’t automatically an analysis runner or a data provider, and doesn’t have any elevated run privileges. The owner’s
  main abilities are to create the clean room, assign collaboration roles, determine who can share data with whom, and tear down the
  clean room. A collaboration can have only one owner.
* **Data provider:** Provides data offerings, such as tables and views, to a collaboration, and specifies which analysis runners can
  use them. That is, account A is a data provider to accounts B and C, as specified in the collaboration specification.
* **Analysis runner:** Runs permitted templates on permitted data offerings, as specified by the collaboration specification.

These roles are designated in the collaboration specification that is used to create the collaboration.

A collaborator can be assigned multiple collaboration roles, and (except for the owner role) a collaboration role can be assigned to
multiple collaborators.

> **Learn more:**
>
> * [Learn more about roles](roles.md)

### Collaboration resources

A collaboration contains resources, including data offerings, templates, and code bundles. All resources, and the collaboration itself, are defined by YAML specifications.

Collaborations support the following types of resources:

* **Template:** A JinjaSQL query that analysis runners can execute in the collaboration. Depending on the type of template, results can
  either be delivered directly, or *activated* (saved) to the Snowflake account of a designated collaborator. Analysis runners can pass
  values into a template at run time to replace template variables used for column names, WHERE clauses, and other query elements.
* **Data offering:** A package of one or more tables shared by a data provider with specific analysis runners. A data offering is a live
  view of the source data, not a snapshot, and its specification controls which columns are exposed and what policies apply.
* **Code bundle:** A set of custom Python functions or procedures that can be called by a template. Code bundles let you extend template
  capabilities with user-defined logic such as machine learning models or custom transformations.

> **Learn more:**
>
> * [Collaboration resources](resources.md)
> * [Collaboration resource schemas](spec-reference.md).
> * [Design custom templates](custom-templates.md)
> * [Code bundles](resources-code-bundles.md)

### Example clean room specification

Here is the YAML specification for a basic clean room that involves two participants, `alice` (an alias for account `corp1.acct123`),
and `bob` (an alias for account `corp2.acctxyz`). The specification assigns roles to each user and links two data offerings into the
collaboration.

```yaml
api_version: 2.0.0
spec_type: collaboration
name: basic_collaboration
owner: alice                # alice is the collaboration owner.
collaborator_identifier_aliases:
  alice: corp1.acct123
  bob: corp2.acctxyz
analysis_runners:
  alice:                    # alice is also an analysis runner.
    data_providers:
      alice:                # alice provides data to herself.
        data_offerings:     # alice provides these data offerings.
        - id: alice_data_1
        - id: alice_data_2
      bob:                  # bob provides data to alice.
        data_offerings:     # bob provides this data to alice.
        - id: bob_data_1
    templates:              # alice can use this template with any data she can access.
    - id: template1
  bob:                      # bob is an analysis runner
    data_providers:         # bob can use data from the following data providers.
      alice:
        data_offerings:     # alice provides the following data to bob.
        - id: alice_data_1
    templates:              # bob can use this template with any data he can access.
    - id: template2
```

This simple collaboration includes the following resources and collaboration roles:

* `alice` is the collaboration owner, an analysis runner, and a data provider for herself and `bob`.
* `bob` is an analysis runner, and a data provider for `alice`, but *not* for himself.
* `alice` can run `template1`, `bob` can run `template2`.

Other things to note about this collaboration:

* No new collaborators can be added after the collaboration is created from this specification.
* Both `alice` and `bob` can add new templates, and share them with any other collaborators.
* Roles can’t be changed, so `bob` can’t become a data provider to himself later.
* Any data provider can add or remove data offerings in their data offerings list, even after the collaboration is created.

## Basic clean room collaboration workflow

Here is a simple clean room collaboration scenario:

1. The collaboration owner optionally registers any templates or data offerings that they want to appear in the initial configuration of
   the collaboration.
2. The owner optionally asks any intended collaborators to register any templates or data offerings that they want to appear in the initial
   configuration of the collaboration. Collaborators then give the resource IDs of any items that they registered.
3. The owner then creates a collaboration. The collaboration specification defines the collaborators, their roles, and any resources that
   should be available in the initial state of the collaboration.

   * At this point, the set of collaborators and their collaboration roles is fixed.
   * If the collaboration includes collaborators in other cloud hosting regions, they must
     [enable Cross-Cloud Auto-Fulfillment on their account](laf.md) before they can review and join the
     collaboration.
   * When the collaboration is created, it will become visible and joinable by all collaborators in the collaboration spec.
4. Collaborators review and join the collaboration.
5. Collaborators can then optionally link resources into the collaboration,
   as appropriate for their roles. Data providers can link data offerings to their analysis runners; any role can request to add a
   template and share it with any other collaborator.
6. Analysis runners can then run any templates shared with them in the collaboration, using any data offerings shared with them in the
   collaboration. The analysis runner bears the cost of the analysis. Templates can either return query results in the response or
   [activate results to the caller or another collaborator](activation.md).

> **Learn more:**
>
> * [See how to implement a basic two-party collaboration, with end-to-end code.](tutorials/collaboration-basic-api-tutorial.md)
> * See additional examples in the Use cases section of the Data Clean Rooms documentation.

### Creating a collaboration

Any Snowflake data clean rooms user with appropriate privileges can create a clean room. A clean room is defined using a YAML specification
that determines all the collaborators and their relative roles in the collaboration, as well as any resources present in the initial
configuration of the collaboration. (The resource owners must join before the resources can be used.) Resources can be added or removed
after the collaboration is created, but the list of collaborators and their relative roles is fixed after the collaboration is created.

Collaborations aren’t versioned: a collaboration can change with the addition or removal of resources, but those changes aren’t tracked.

> **Learn more:**
>
> * [Managing access to collaborations, resources, and data](manage-access.md)
> * [Collaboration specification](spec-collaboration.md)

### Adding resources to a collaboration

A collaboration can access resources, including templates, data offerings, and code bundles. To use a resource in a collaboration, you must
first *register* it with the collaboration clean rooms environment, then *link* it into a specific collaboration:

* *Registration* is an account-level action; it packages and copies the resource into the clean rooms environment, and returns an ID that
  is used to reference that resource. A resource is registered in a registry, either the default registry for your account, or a custom
  registry that someone in your account created. The default registry is available to any collaborator in the account with READ REGISTRY
  privileges; a custom registry can be access-controlled by the registry creator.
* *Linking* shares a registered resource with a specific collaboration. More specifically, it shares a registered resource with a specific
  set of collaborators in a specific collaboration. You can link a resource either by adding it to the collaboration specification used to
  create a collaboration, or you can call the appropriate Collaboration API procedure to link the resource into a collaboration.

Resources can be added to a collaboration at creation time or after a collaboration is created.

Unlike collaborations, resources are versioned. Newer versions of a resource don’t overwrite older versions. If you want to replace a
resource with a newer version, you must also update the collaboration to remove the old version (if you choose) and add the new version.

The account that registers a resource must be a collaborator, and must join the collaboration before any resources they registered can be
available in the collaboration.

> **Learn more:**
>
> * [Learn more about registries.](registries.md)
> * [Learn more about adding resources to your collaboration.](resources.md)

### Joining a collaboration

A collaboration is visible to all collaborators listed in the collaboration specification. All collaborators, including the creator, must join
the collaboration. All collaborators except for the owner must review the collaboration before they can join. Reviewing a collaboration
exposes the collaboration specification to the invited party. After reviewing the collaboration, the invitee can then join the
collaboration. You must join a collaboration before any resources that you provide to a collaboration become usable.

You can see your join status (invited, joining, joined) by calling GET_STATUS on the collaboration. Most collaboration mutation actions,
such as linking a resource, joining a collaboration, or activating results, are either asynchronous, or might take some time to propagate
to other collaborators, so you should call the appropriate procedure to see the state of the change.

### Running an analysis

Collaborators listed as analysis runners in a collaboration can run queries on any data offerings available to them in the collaboration.

Collaborations support the following types of analyses:

* Templated analysis queries: An analysis runner can run any templates assigned to them in the collaboration, and see results synchronously.
* Activation analyses: If the data offering, collaboration, and template allow it, the analysis runner can *activate* (save) results to a
  designated collaborator’s Snowflake account.
* Free-form SQL analyses: If the collaboration and data offering allow it, analysis runners can run SQL queries directly against a data
  offering’s data. See [Free-form SQL queries](free-form-sql.md).

> **Learn more:**
>
> * [Activating query results](activation.md)
> * [Free-form SQL queries](free-form-sql.md)

### Leave or delete a collaboration

You can leave a collaboration at any time, although the collaboration owner can’t leave a collaboration, and instead deletes the
collaboration for everyone.

* Non-owners leave a collaboration by calling LEAVE. Any data offerings they have provided will be removed from the
  collaboration. You can’t rejoin a collaboration after leaving it.
* Collaboration owners can’t leave a collaboration: ownership can’t be transferred. A collaboration owner can drop a collaboration for all
  collaborators by calling TEARDOWN.

Leaving or deleting a collaboration is asynchronous. You must call GET_STATUS to monitor the status, and call LEAVE or TEARDOWN again when GET_STATUS shows the status as LOCAL_DROP_PENDING.

Deleting a collaboration doesn’t affect the registration status of any resources linked into the collaboration. Those resources can
continue to be used or linked into new collaborations.

---
title: Provider-run analyses
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/provider-run-analysis.md
section: Clean Rooms
---

# Provider-run analyses

## Overview

The default clean room configuration enables only the consumer to run an analysis in the clean room. However, the provider can request
permission from the consumer to run a specific template in a specific clean room using consumer data. Provider-run analysis can be enabled
and run using either the clean rooms UI or code.

The following diagram shows the data flow and main components in a basic provider-run analysis:

1. In a basic provider-run analysis, the consumer and provider both link their data into the clean room. Source data is linked into the
   clean room as private views in the account where the data lives.
2. When the provider runs an analysis, the provider’s data is shared with the clean room app in the consumer’s account. The analysis runs
   on the consumer’s account.
3. The encrypted results are temporarily written to the consumer DB in the consumer’s account.
4. The encrypted results are copied to the analysis results back share on the provider’s account (also called the governance back share)
   and decrypted. Because the analysis runs on the consumer’s account, the consumer is billed for the analysis.

For more information, see [Snowflake Data Clean Rooms: Installed objects](../v1/installation-details.md).

### Templates that support provider-run analyses

The following templates support provider-run analyses:

* Audience Overlap & Segmentation
* SQL Query (UI only)
* Custom templates (API only)

### Billing and cost details

Provider-run analyses run in the consumer’s account, and consumers are billed for a provider-run analysis. To stop incurring
costs from provider-run analyses, the consumer must uninstall the clean room.

A consumer can estimate the number of credits consumed by the provider within the last *N* days by executing the following query.
Specify the number of previous days as a negative number.

```sqlexample
-- Estimate the number of credits consumed in the past 5 days.
SELECT * FROM TABLE(SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.LIBRARY.PRA_CONSUMPTION_UDTF(-5));
```

When a provider runs an analysis in the clean rooms UI, the clean room uses auto-scaling logic based on dataset sizes to choose a warehouse
for the provider’s analysis.

When a provider creates and runs a clean room using the API, the provider can explicitly choose a warehouse size and type from a set of
permissible values specified by the consumer.

### General notes

* Providers can activate results to their own account using the UI or the API, or to third-party providers if using the UI. For information
  about how to enable activation and view results, see [Activating query results](../v1/activation.md).
* If the consumer and provider are in different cloud regions, [Cross-cloud auto-fulfillment](../v1/enabling-laf.md)
  must be enabled in both accounts and for both clean rooms.

  Note that provider-run cross-cloud queries can take some time to run because provider source data must be replicated from the provider to
  the consumer, and query results from the consumer to the provider, all across cloud regions.
* Any templates run by the provider require column names or aliases for all columns generated in the results. If a column is
  aggregated (for example, `SUM(col1)`) or calls a custom function (for example, `cleanroom.my_function(p.hashed_email)`), the template
  must explicitly specify a column name alias as shown here:

  ```sqlexample
  SELECT SUM(col1) AS TOTAL FROM my_db.my_sch.T; -- Correct
  SELECT SUM(col1)          FROM my_db.my_sch.T; -- Error: aggregated column needs an explicit alias.
  ```

## Provider-run analyses in the UI

Here is how to enable provider-run analysis in a new clean room when using the clean rooms UI:

1. The provider [creates and configures a clean room](../manage-clean-rooms.md), using one of the
   supported templates. Configure the clean room up to the Share Clean Room step.
2. In the Share Clean Room step of clean room configuration, the provider selects Enable run analysis & query next to
   their own account to enable them to run all templates in this clean room that support provider-run analysis.

   * This setting cannot be changed after a clean room is created; if you want to change permission for a specific account to run
     queries in a published clean room, you must delete the clean room and create a new one.
3. The consumer [joins and configures the clean room](../manage-clean-rooms.md) as usual for all templates in the clean
   room, including any templates that support provider analysis. If the consumer does not want to enable a provider to run a specific
   template, they can omit required details for that template.

   * When the consumer joins the clean room, they are warned before joining that provider-run analysis is enabled for that clean
     room.
   * The consumer can run queries as soon as the clean room is joined, but there is a delay of up
     to 30 minutes before the provider can run the template. This setup delay occurs only during the initial join step; if the provider
     later adds other provider-run templates, the provider can run them as soon as the consumer configures their clean room for that
     template.
4. After the join step completes, the clean room is available for both [provider run analyses](../v1/web-app-working.md) and
   [consumer run analyses](../v1/web-app-working.md).

   **Important:**

   * Providers must wait about 10 minutes after the consumer installs the clean room before they can run an analysis. The delay is for
     additional background configuration required for provider-run analyses.
   * The consumer is billed for all analyses in this clean room, whether run by the provider or consumer.

## Provider-run analyses in the API

Here is how to enable provider-run analysis in a new clean room using the clean rooms API:

1. **Provider**

   > 1. Creates and configures the clean room and data and policies in the standard way.
   > 2. Adds consumers in the standard way.
   > 3. Enables provider-run analysis for specific consumer accounts in the clean room by calling
   >    `provider.enable_provider_run_analysis`.
   >
   >    **Important:**
   >
   >    * The provider must call `provider.enable_provider_run_analysis` **after** adding consumers to a clean
   >      room, but **before** any consumer installs the clean room. Each consumer account must approve this request for their data to be
   >      accessible for provider-run analyses in this clean room.
   >    * Any time the provider changes the provider-run analysis setting for a clean room, the clean room must be
   >      re-installed by all consumers for the change to take effect. Because it can be difficult to force all collaborators to
   >      re-install a clean room, it is more reliable for the provider to delete a published, shared clean room when changing the analysis
   >      permissions, and then create a new clean room with the desired permissions.
   > 4. Publishes the clean room.
   > 5. Lets the consumer know that the clean room is available, the name of the clean room, and what templates you want to run in
   >    the clean room.
2. **Consumer**

   1. Installs the clean room and links in data in the standard way.
   2. Sets any [join and column policies](../v1/policies.md) needed on their data.
   3. Allows provider-run analysis for specific templates in the clean room by calling either
      `consumer.enable_templates_for_provider_run` (for multiple templates) or `consumer.approve_template` (for one template).

      > **Note:**
      >
      > If the provider changes a template after the consumer approves it, the consumer must approve the template again. Until the
      > template is re-approved, the old cached version of the approved template will be run by the provider.
   4. (*Optional*) A consumer can limit the warehouse type or sizes available for provider-run analyses: see
      Restricting warehouse size and type limits.
   5. Tells the provider that they have installed the clean room and approved provider-run analyses.
3. **Provider**

   1. After the consumer has installed the clean room, the provider enables analyses to access consumer data by enabling data sharing
      from the consumer to the provider account. The process for this depends on whether the provider and consumer are in the same
      cloud region or different cloud regions:

      * If the provider and consumer are in **the same cloud region,** the provider calls
        `provider.mount_request_logs_for_all_consumers` once. If a new consumer account installs the clean room later and the provider
        wants to use consumer data in this template, the provider must re-run this procedure to be able to access that data.
      * If the provider and consumer are in **different cloud regions**, the provider and consumer must enable
        [cross-cloud auto-fulfillment](../v1/enabling-laf.md). When a provider runs an analysis across regions,
        the query can take some time to complete, because query data is sent from the provider’s region to the consumer’s region and
        back.
   2. Calls `provider.view_warehouse_sizes_for_template` to see if the consumer has limited the type and size of warehouse used for
      the analysis. If the consumer has limited warehouse sizes for provider run analyses, the provider must specify permitted
      `warehouse_type` and `warehouse_size` values in the analysis request in the next step. If the consumer has not specified
      warehouse limits, those fields are optional in the analysis request. For more information, see
      Restricting warehouse size and type limits.
   3. Runs the analysis by calling `provider.submit_analysis_request` with the template name, the table names, and the template
      arguments. If the consumer has specified limits on warehouse sizes or types, the provider must also specify the warehouse size and
      type in the analysis request.

      * Save the request ID returned by `provider.submit_analysis_request`; the ID is needed to check the status and results of the
        analysis.
   4. Checks the status of the analysis by calling `provider.check_analysis_status`. When status is reported as `COMPLETED`,
      call `provider.get_analysis_result` to get the analysis results.

### Restricting warehouse size and type limits

Because the consumer is billed for provider-run analyses, the consumer is able to dictate what sizes and types of warehouse the provider
can use to run an analysis in their account. Here is how a consumer sets warehouse size and type limitations, and how a provider chooses a
warehouse size and type when running an analysis:

1. The consumer calls `consumer.set_provider_run_configuration` and specifies which warehouse sizes and types a provider can use
   for a specific template. In the following snippet, the consumer limits providers to using STANDARD warehouses of size MEDIUM or LARGE when running `template_1`:

   ```sqlexample
   CALL samooha_by_snowflake_local_db.consumer.set_provider_run_configuration(
     $cleanroom_name,
     {
       'template_1': {
         'warehouse_type': 'STANDARD',
         'warehouse_size': ['MEDIUM', 'LARGE']}
     });
   ```
2. The provider calls `provider.view_warehouse_sizes_for_template` to see which warehouse sizes and types are permitted for
   provider-run analyses on that template.

   ```sqlexample
   CALL samooha_by_snowflake_local_db.provider.view_warehouse_sizes_for_template(
     $cleanroom_name,
     'template_1',
     $consumer_account_loc
   );
   ```
3. The provider specifies a warehouse size and type to use in their analysis run request.

   ```sqlexample
   CALL samooha_by_snowflake_local_db.provider.submit_analysis_request(
     $cleanroom_name,
     $consumer_locator_id,
     'template_1',
     ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
     ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
     object_construct(
       'dimensions', ['c.REGION_CODE'],
       'measure_type', ['AVG'],
       'measure_column', ['c.DAYS_ACTIVE'],
       'warehouse_type', 'STANDARD',      -- Any other value would cause the request to fail.
       'warehouse_size', 'LARGE'          -- Only MEDIUM and LARGE supported.
     )
   );
   ```

> **Tip:**
>
> The following procedures manage which side can run an analysis in the clean room:
>
> **Consumer-run analysis** (*allowed by default*): Changes are applied immediately.
>
> > * `provider.enable_consumer_run_analysis`
> > * `provider.disable_consumer_run_analysis`
>
> **Provider-run analysis** (*disabled by default*): Changes require reinstallation by the consumer.
>
> > * `provider.enable_provider_run_analysis` (*requires the consumer to approve by calling
> >   consumer.enable_templates_for_provider_run*)
> > * `provider.disable_provider_run_analysis`

### Install and run the code example

You can download and install a complete running example to create and run a provider-run analysis. To run this
example, you need two Snowflake accounts in the same organization and cloud hosting region with the Snowflake Data Clean Room environment
installed.

1. [`Download the example notebook`](../../../_downloads/6f09db32770533d503e9578e38467b8f/provider-analysis-notebook.ipynb).
2. Install the notebook in both your provider and consumer accounts.

   To upload a notebook, do the following:

   1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
   2. In the navigation menu, select Projects » Notebooks.
   3. Select + Notebook » Import .ipynb file.
   4. Select the .ipynb file you downloaded.
   5. Name the file as desired, and choose a database and schema.
   6. Keep the default warehouse `APP_WH`.
   7. Select Create.
   8. To create the clean room, open the notebook in the provider account and complete the provider portion.
   9. Open the notebook in the consumer account and complete the consumer portion to install and configure the clean room and run the
      template.
3. Run the provider and consumer actions as indicated, in the order shown in the notebook.

---
title: Registering data
source: https://docs.snowflake.com/en/user-guide/cleanrooms/register-data.md
section: Clean Rooms
---

# Registering data

This topic describes how to register data so that it can be linked into a Snowflake Data Clean Room.

## Supported objects

The following object types can be linked into clean rooms:

* Tables
* External tables†
* Apache Iceberg™† tables
* Dynamic tables
* Views
* Materialized views
* Secure views. The owner of a secure view must be the SAMOOHA_APP_ROLE role.

> **Note:**
>
> †External and Iceberg tables must be enabled before they can be used in a clean room.

## Registering data objects

Before users can link data into a Snowflake Data Clean Room, the data must first be *registered*. Registering data grants USAGE and SELECT privileges
on the object to SAMOOHA_APP_ROLE, which is used by the clean room environment to access data. If you register a database or schema, all
of the child objects are registered as well. You must have MANAGE GRANTS privilege on an object to be able to link it.

You can register databases, schemas, and objects using the clean rooms UI or the
Clean rooms API. Using the Clean rooms UI is simpler, but requires that you have the ACCOUNTADMIN role.
Using the developer APIs, you can register any object on which you have the MANAGE GRANTS privilege without using the ACCOUNTADMIN role.

**Usage notes**

* Registering a database or schema does not register objects added *after* the registration. You must either register
  the new object individually or use the Clean rooms UI to navigate to Admin > Snowflake Admin > Database Registration and
  select Resync.
* You can link only data registered by your account. That is, a provider can’t link data registered by the
  consumer and a consumer can’t link data registered by the provider. Once data is linked into a clean room, it can be accessed by anyone
  with access to the clean room, subject to the linking party’s settings (such as join and column policies).
* There are special considerations when registering tables or views with Snowflake policies applied to them.
* See how to register external and Apache Iceberg tables.
* When registering a secure view, you must also separately register the database that is the source for that secure view.
* The following instructions show how to register non-external or Iceberg tables:

Clean rooms UIClean rooms API

You must be able to use the ACCOUNTADMIN role to register objects using the clean rooms UI.

Follow these steps to register a database, schema, or object using the Clean rooms UI:

1. [Sign in to the Clean rooms UI](v1/web-app-introduction.md) as an account administrator and then take one of the following
   steps:

   * If using a managed account, select Admin > My Account.
   * If using a Snowflake account, select Admin > Snowflake Admin and sign in to Snowflake as a user with the
     ACCOUNTADMIN role.
2. Select Admin > Snowflake Admin.
3. Select Log in to Snowflake, and authenticate as a user with the ACCOUNTADMIN role.
4. To enable external or Iceberg tables in the account, enable the External & Iceberg Tables toggle.
5. In the Access management for Snowflake objects section, select Edit, and then select the database, schema, or object to
   make its data linkable by users in this account.
6. Select Save.

Use the Clean rooms API to register databases, schemas, and objects programmatically. You need MANAGE GRANTS privilege on an
object to register it.

External and Iceberg tables are registered differently than other object types.

The following procedures are available to register or unregister objects:

| Object type | Register | Unregister |
| --- | --- | --- |
| Database | * `provider.register_db` (for providers) * `consumer.register_db` (for consumers) | `library.unregister_db` |
| Schema | `library.register_schema` | `library.unregister_schema` |
| Managed access schema | `library.register_managed_access_schema` | `library.unregister_managed_access_schema` |
| Any other supported object type | `library.register_objects` | `library.unregister_objects` |

**Example:**

```sqlexample
USE ROLE <ROLE-WITH-MANAGE-GRANTS-PRIVILEGE>
CALL samooha_by_snowflake_local_db.library.register_schema(['MY_DB.MY_SCHEMA']);
```

### Registering tables or views that have Snowflake policies applied

If you want to link in data that has a Snowflake policy applied, and the Snowflake policy is stored in a *different* database than the
source data, you must grant reference usage on the policy database to clean rooms. You can do this either once
per account, or once per clean room.

#### Grant reference usage once per account

To grant reference usage to a database once per account, and have it granted automatically for each clean room, grant reference usage to
SAMOOHA_APP_ROLE by running the following SQL command. Replace the database placeholder with your database name.

```sqlexample
GRANT REFERENCE_USAGE ON DATABASE <database_name>
  TO ROLE SAMOOHA_APP_ROLE
  WITH GRANT OPTION;
```

#### Grant reference usage once per clean room

If you prefer to grant reference usage to a database per clean room rather than to all clean rooms in the account, run the following SQL
command. Replace the database name and [clean room ID](v1/developer-introduction.md) placeholders with the appropriate values:

```sqlexample
GRANT REFERENCE_USAGE ON DATABASE <database_name>
  TO SHARE IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_<clean_room_ID>;
```

## Unregistering data objects

Once a table is linked into a clean room, it cannot be removed. However, you can unregister the object in the account, which will remove
access by any clean rooms in that account.

If you want to remove data from a clean room or account, don’t simply delete the underlying object; this will cause the clean room
to fail. Instead, use one of the following techniques to unregister the object.

Clean rooms UIClean rooms API

When you unregister an object from an account, you should also update any clean rooms you created that used this data.

Queries by any collaborators that depend on deleted data will fail the next time they are run.

To unregister an object in an account:

1. [Sign in to the Clean rooms UI](v1/web-app-introduction.md) as an account administrator and then take one of the following
   steps:

   * If using a managed account, select Admin > My Account.
   * If using a Snowflake account, select Admin > Snowflake Admin and sign in to Snowflake as a user with the
     ACCOUNTADMIN role.
2. Select Admin > Snowflake Admin.
3. Select Log in to Snowflake, and authenticate as a user with the ACCOUNTADMIN role.
4. To enable external or Iceberg tables in the account, enable the External & Iceberg Tables toggle.
5. In the Access management for Snowflake objects section, select Edit, and then deselect the database, schema, or object to
   make its data unavailable to users in this account.
6. Select Save.
7. Update any clean rooms you created that depend on this data.

In the API, call the appropriate procedure to unregister an object from an account:

* `library.unregister_db`
* `library.unregister_schema`
* `library.unregister_managed_access_schema`
* `library.unregister_objects`

## Enabling external and Apache Iceberg™ tables

To allow external tables and Iceberg tables to be linked into a clean room, the account must first be configured to enable use of
external and Iceberg tables. After external and Iceberg tables are enabled, they can be registered, linked, and used the same as any other
tables.

The process for enabling external and Iceberg tables varies, depending on whether you are managing the clean room using the Clean rooms UI
or the Clean rooms API.

### External and Iceberg table requirements

* **Both the provider and consumer accounts must enable external and Iceberg tables** to allow full usage of a clean room that
  links in external or Iceberg tables.
* **Providers must always enable external tables and Iceberg tables when sharing a clean room with a managed account.** This is because
  managed accounts always use external tables.
* **If the provider and consumer are in different regions,** only the consumer can link external or Iceberg tables into a clean room.

Clean rooms UIClean rooms API

The Clean rooms UI controls external and Iceberg tables at the account level.

> **Warning:**
>
> If the consumer account has not enabled this feature, consumers will be blocked from joining any clean rooms that link in external or
> Iceberg tables, or will be prevented from editing (but can still run) any already joined clean rooms that link in either type of table.

A DCR administrator in both the provider and consumer accounts must take the following steps:

1. [Sign in to the Clean rooms UI](v1/web-app-introduction.md) as an account administrator and then take one of the following
   steps:

   * If using a managed account, select Admin > My Account.
   * If using a Snowflake account, select Admin > Snowflake Admin and sign in to Snowflake as a user with the
     ACCOUNTADMIN role.
2. Enable the External & Iceberg Tables toggle. This enables the feature in both UI-created and API-created clean rooms.
3. External and Iceberg tables are now selectable in the administrator’s
   Access management for Snowflake objects panel, where they can be selected to make them available to
   clean rooms, the same as any other objects.

In code, you must enable external and Iceberg tables at **both** the account level **and also** for each clean room that links in
external or Iceberg tables. If you have enabled external and Iceberg tables in the Clean rooms UI, you do not need to enable them in
code (you don’t need to take the steps listed here).

> **Warning:**
>
> If only one account has enabled this feature for their account or clean room and linked in an external or Iceberg table, the other
> account will be able to run existing templates, but won’t be able to modify the clean room in any way until external and Iceberg
> tables are allowed in both that account and clean room.

To enable and use external or Iceberg tables for new clean rooms in code:

1. A user with the ACCOUNTADMIN role first enables external and Iceberg tables for the entire clean room environment in both
   the provider and consumer accounts:

   > > ```sqlexample
   > > USE ROLE ACCOUNTADMIN;
   > > CALL samooha_by_snowflake_local_db.library.enable_external_tables_on_account();
   > > ```
   >
   > > **Note:**
   > >
   > > Existing clean rooms created with the Clean rooms UI are not affected by this method.
   > > To update existing clean rooms created using the Clean rooms UI you must either enable them in code individually, as shown in
   > > the next steps, or else enable clean rooms using the Clean rooms UI, which enables the feature for all
   > > existing clean rooms.
2. A **provider** enables external and Iceberg tables for their clean room. Note that this triggers a security scan which, if successful,
   generates a new clean room version, so you will need to update the default release directive.

   > ```sqlexample
   > USE ROLE SAMOOHA_APP_ROLE;
   > CALL samooha_by_snowflake_local_db.provider.enable_external_tables_for_cleanroom(
   >   $cleanroom_name);
   >
   > -- Call until scan is complete.
   > CALL samooha_by_snowflake_local_db.provider.view_cleanroom_scan_status($cleanroom_name);
   >
   > -- When scan is successful, update with patch version mentioned in return value from enable_external_tables_for_cleanroom.
   > CALL samooha_by_snowflake_local_db.provider.set_default_release_directive($cleanroom_name, 'V1_0', '<PATCH_VERSION>');
   > ```
3. A **consumer** must also enable use of external and Iceberg tables in the same clean room:

   > ```sqlexample
   > USE ROLE SAMOOHA_APP_ROLE;
   > CALL samooha_by_snowflake_local_db.consumer.enable_external_tables_for_cleanroom(
   >   $cleanroom_name);
   > ```

After external and Iceberg tables have been enabled for a clean room, collaborators can register and link these tables the same way as
any other table.

---
title: Registries
source: https://docs.snowflake.com/en/user-guide/cleanrooms/registries.md
section: Clean Rooms
---

# Registries

## Overview

To use a resource such as a template or data offering in a collaboration, you must first register it in a registry. A registry is an account-level container designed to store these resources. Once registered, any resource in the registry can be linked to a collaboration by any user in your account who has both access to the registry and the necessary linking permissions for that specific collaboration. Notably, registries are independent of specific collaborations; a registered resource can be linked to any number of collaborations, or none at all, within that account.

Each Snowflake account supports a default registry. You can create additional custom registries for your account. Custom registries are a
good way to group and manage access to your resources. For example, you could create a custom registry for sales data and another for
expenditure data, then grant access to these registries to the appropriate users via [DCR privileges and custom RBAC roles](manage-access.md).

## Registry rules

Here are the main rules about registries:

* Registries are account-level objects. Users can see and access only registries in their own account. However, when a resource in a
  registry is linked into a collaboration, the resource is visible to anyone who can access it according to the spec. Access to the containing registry isn’t required.
* Each custom registry supports a single resource type (template, data offering, and so on). The resource type is specified when you create the
  registry. The default registry supports any resource type.
* There is no limit to how many custom registries you can create in an account.
* When you register a resource, you can use the optional registry name parameter to specify a custom registry. If you don’t specify a
  custom registry, the resource is registered in the default registry for the account.
* All users have access to the default registry in an account. Custom registries, however, are initially private to the creator, and
  additional users must be granted access explicitly by calling `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE`.
* An account can have multiple registries that store the same resource type.
* Registries don’t have a maximum number of resources.
* A resource must have a unique name across all registries in that account for resources of that type. For example, you can have a template
  named `sales` and a data offering named `sales` in the same account, but not two templates named `sales` in either the same or
  different registries in the same account. The resource name is defined as the highest-level `name` value in the spec.
* If two different accounts link resources with the same name and type to a collaboration, that is allowed. The collaboration specification will show
  identically named resources, but the system will know which resource is intended — the resource with that name is used from the account
  that linked the resource to the collaboration.

## Example

This example creates a custom registry, registers a template in it, and grants read access to that registry to a new role. Users
with that role can link templates in that registry into a collaboration.

```sqlexample-yaml
-- Create a custom registry that can hold templates.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.CREATE_REGISTRY(
  'SALES',
  'TEMPLATE'
);

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
'SALES',
$$
api_version: 2.0.0
spec_type: template
name: alice_only_template
version: v1
type: sql_analysis
description: Joins two tables on hashed email and counts matches grouped by status.
template:
  SELECT t1.status, COUNT(*)
    FROM IDENTIFIER( {{ source_table[0] }} ) AS t1
    JOIN IDENTIFIER( {{ source_table[1] }} ) AS t2
    ON t1.hashed_email_b64_encoded = t2.hashed_email_b64_encoded
    GROUP BY t1.status;
$$
);

-- Create a role and grant it access to the registry.
CREATE ROLE MARKETING_USERS;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE(
  'READ',
  'REGISTRY',
  'SALES',
  'MARKETING_USERS'
);

-- Grant access to the registry for a user by assigning the role.
GRANT ROLE MARKETING_USERS to USER willy_loman;
```

---
title: Resources
source: https://docs.snowflake.com/en/user-guide/cleanrooms/resources.md
section: Clean Rooms
---

# Resources

## Overview of collaboration resources

Collaborators can add various resources to a Snowflake Data Clean Room collaboration. Resources include templates, data offerings, and code bundles.

Resources are available only to the collaborators designated by a collaboration specification.

Resources support versioning; however, creating a new resource with a new version doesn’t remove the previous version from the collaboration. Resources are uniquely named by combining the user-provided name and version (and alias, for data offerings).

Adding a resource to a collaboration is a two-part process:

1. **Register the resource in the account.** This makes it available to be linked into multiple collaborations. Resources are registered in either the default registry for the account, or in a custom registry. Learn more about [Registries](registries.md).
2. **Link the resource into a specific collaboration.** After a resource is linked, it can be seen and used by the designated collaborators
   in the collaboration. You must have read access to the registry and update privilege on the collaboration to be able to link a resource from that registry into the collaboration.

> **Important:**
>
> If you share data with users in other cloud hosting regions, the sharer must [enable Cross-Cloud Auto-Fulfillment on their account](laf.md).

You can link the following resource types into a collaboration:

* [Templates](resources-templates.md)
* [Data offerings](resources-data-offerings.md)
* [Code bundles](resources-code-bundles.md)

Use [registries](registries.md) to group and manage access to your resources, and [naming paths](resources-data-offerings.md) to organize your data offerings.

---
title: Run an analysis in the UI
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/web-app-working.md
section: Clean Rooms
---

# Run an analysis in the UI

The [clean rooms UI](web-app-introduction.md) of a Snowflake Data Clean Room provides an intuitive UI that allows business
users to create and use clean rooms without worrying about code complexities.

This topic provides an introduction to tasks that you complete as you work with a clean room. It describes the actions of the provider who
creates and shares a clean room along with the consumer who uses that clean room.

## Run an analysis as a provider

If a provider has [configured a clean room to allow provider-run analyses](../demo-flows/provider-run-analysis.md),they can run an analyses in the clean room.

A provider can run an analysis in a properly configured clean room through either of the following actions:

* Select Clean Rooms from the left navigation, find the tile for the clean room on the Created tab, and select Run.
* Select Analyses & Queries from the left navigation, and run an existing analysis or create a new one just like a consumer would.

The provider selects which collaborator has the data that they want to include in their analysis.

> **Important:**
>
> If a consumer allows a provider to run an analysis on a template, the consumer, not the provider, is charged for the credits consumed by
> the provider’s analysis. After the consumer has allowed the provider to run analyses, the consumer must uninstall the clean room to stop
> incurring costs.
>
> If a consumer wants to obtain an estimate of the number of credits consumed by the provider within a specific time period, they can
> execute the following query, where `-5` returns an estimate of the previous 5 days of compute consumption by the provider:
>
> ```sqlexample
> SELECT * FROM TABLE(SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.LIBRARY.PRA_CONSUMPTION_UDTF(-5));
> ```

Administrators can monitor the analyses being run by the provider: see [Monitor provider-run analyses](../admin-tasks.md).

Consumers can monitor billing due to provider-run analyses: see [Billing and cost details](../demo-flows/provider-run-analysis.md).

Providers can [activate results to their own account](activation.md).

### Limitations on provider-run analyses

When using the clean rooms UI to run an analysis, the provider has the following limitations:

* Not all templates are supported. Currently, the Audience Overlap & Segmentation and SQL Query templates are supported.
* If the collaborators are in different clouds or regions:

  + Consumers must [enable cross-cloud auto-fulfillment](enabling-laf.md) on their account.
  + Results for a provider-run analysis are returned based on the combined refresh frequency between both parties. Providers and consumers
    should coordinate so the refresh frequency of the provider application and the consumer listing are similar (for example, both have a
    frequency of 15 minutes). This ensures that results are returned promptly.

## Run an analysis as a consumer

As a consumer, you can use either the Clean Rooms page or the Analyses & Queries page to run analyses in an installed clean
room.

To use the Clean Rooms page to run a new analysis based on the types of analyses that the provider has made available in the clean
room:

1. [Sign in to your clean room environment in the clean rooms UI](web-app-introduction.md).
2. In the left navigation, select Clean Rooms.
3. On the Joined tab, find the clean room in the list and select Run.
4. Select the analysis type, and then select Proceed.
5. Add filters to the analysis. There are two reasons why filter values might not be available:

   * The column contains more than 20 distinct values.
   * The clean room was recently installed, and has not finished processing preview values for the column. You can re-run the analysis when
     these values become available.
6. Select Run.
7. Optional: Expand the Save Analysis & Query section to save the analysis for future use.

To use the Analyses & Queries page to run existing analyses or create and run a new analysis:

1. [Sign in to your clean room environment in the clean rooms UI](web-app-introduction.md).
2. In the left navigation, select Analyses & Queries.
3. Do one of the following:

   * To run an existing analysis, use the filters to find the analysis and run it.
   * To create and run a new analysis based on the types of analyses that the provider has made available in the clean room,
     select + New Analysis & Query.

## Select a warehouse for an analysis

You can select which [warehouse](../../warehouses-overview.md) you want to use to run an analysis. Increasing the
size or changing the type of the warehouse can speed up the analysis.

> **Note:**
>
> The type of template determines what type of warehouse you can select for the analysis.
> For example, some templates (like Audience Overlap) only allow regular warehouses.

The option to select a different warehouse appears next to the Run button on a template. This option does not appear for all
templates.

Be aware that increasing the size of a warehouse or using a Snowpark-optimized warehouse can increase the cost of running the
analysis. For information about how credit consumption grows as you use a larger warehouse, see [Warehouse size](../../warehouses-overview.md) and
[Billing for Snowpark-optimized warehouses](../../warehouses-snowpark-optimized.md).

For a description of the warehouses that are available, see [Warehouses](installation-details.md).

If you are an administrator who wants to create additional warehouse options, see [Using a different warehouse](../admin-tasks.md).

## View details about a clean room

You can obtain details about a clean room, including:

* A Collaborator Summary tab that lists the templates in the clean room along with the tables and join columns of your collaborator.
* A My Summary tab that lists your tables and join columns.
* A Table Relations tab that lists the relationship between your tables and the tables of your collaborator (that is, how the tables are
  joined).
* A Data Stats tab that provides the following metrics for your tables:

  + **My table:** Shows how many distinct identifiers belong to a certain group. Note that statistics are updated every 24 hours so there
    might be a delay between modifying the clean room and seeing the updated statistics. Also note that columns with more than 20 distinct
    values are not shown.
  + **Overlap stats:** A clean room with either the Audience Overlap & Segmentation or SQL Query templates will show overlap stats
    to the consumer. These statistics describe how many distinct identifiers (join columns) belong to a certain group based on
    the attribute columns enabled in the template. You can select up to 2 attribute columns to view statistics breakdowns for. The data
    is generated after the initial installation and is refreshed whenever a user logs into the clean rooms UI. Note that
    in the bar graph visualization only the first 5 rows of data are plotted based on the default sorting
    provided. Statistics that take longer than 10 minutes to run for a particular breakdown will not be available.

To access these clean room details, complete these steps:

1. [Sign in to your clean room environment in the clean rooms UI](web-app-introduction.md).
2. In the left navigation, select Clean Rooms.
3. Click on the tile for the clean room.

Clean room details are also available in the Clean Rooms Details section when you are running an analysis.

## Download and share results

A user who wants to share the aggregate results they generated within the clean room can download the results of
the clean room analysis as a .csv file, and then share these results with others outside of Snowflake, including sharing with a clean room
collaborator via email.

---
title: Running free-form SQL queries on clean room tables
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/web-app-sql-template.md
section: Clean Rooms
---

# Running free-form SQL queries on clean room tables

You can enable consumers to run free-form SQL queries on selected datasets in your clean room using either the clean room API or UI.

## Free-form queries using the clean rooms API

You can configure a clean room to allow collaborators to query specific linked datasets from outside the clean room.
Collaborators can run free-form queries on these datasets in any environment where they can access the clean room, including Snowsight or
Snowflake CLI. Free-form datasets behave as standard, read-only views that can be queried using SQL, Python, or other supported
Snowflake languages.

> **Note:**
>
> When you grant a consumer permission to run free-form SQL queries in a clean room, that consumer can query the data from that clean
> room against any other data that they can access from their account.

### Policies and differential privacy support

When you expose clean room data for free-form queries, all Snowflake policies are respected. Clean room policies (join policies,
column policies) are not enforced in free-form queries.

Clean room differential privacy is not enforced on data exposed to free-form queries. This includes both
[Snowflake differential privacy](../../diff-privacy/differential-privacy-overview.md) and
[clean room differential privacy](../differential-privacy.md).

### Enabling free-form queries

> **Important:**
>
> If a clean room was created before June, 2025 the provider must patch their clean room by running the following code to enable free-form
> queries in that clean room:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL samooha_by_snowflake_local_db.provider.patch_cleanroom($cleanroom_name,TRUE);
> ```

#### Provider steps

The provider takes the following steps to make datasets in a clean room available to clean room collaborators using free-form queries:

1. Create the clean room in the standard way.
2. Register and link the datasets into the clean room in the standard way using the API. Note that currently your
   data must be registered using the API; you cannot register views in the clean room UI and use them for free-form queries. You should
   apply any Snowflake aggregation, join, or other policies before sharing your data outside the clean room.
3. Call `provider.enable_workflows_for_consumers` to allow specific users free-form access to the tables that you will specify in the
   next step. **You must name this work flow** `freeform_sql`.
4. Call `provider.enable_datasets_for_workflow` to specify which datasets in the clean room can be queried.
5. Add your collaborators in the standard way by calling `provider.add_consumers`.
6. Publish your clean room.
7. If you want to revoke permission to query these tables, you can do this at the user level by calling
   `provider.disable_consumer_run_analysis` or `provider.remove_consumers`, at the dataset level by calling
   `library.unregister_objects` or `library.unregister_db`, or by deleting the clean room.

If a clean room already exists and data is registered, you can simply call `provider.enable_workflows_for_consumers` and
`provider.enable_datasets_for_workflow` to expose the specified datasets to the specified users.

The following code creates three sample tables and applies Snowflake policies to them, creates a new clean room, links in the tables, and
grants free-form query access to those tables for clean room collaborators via the clean room. The highlighted code shows where you enable
free-form queries in the clean room.

```sqlexample
----------------- Create sample data -----------------
USE ROLE MYROLE;
CREATE DATABASE freeform_db;

-- Create a table with an aggregation constraint.
CREATE OR REPLACE TABLE freeform_db.public.agg_constrained_table
  AS SELECT * FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS;

CREATE AGGREGATION POLICY freeform_db.public.agg_policy AS ()
  RETURNS AGGREGATION_CONSTRAINT ->
  AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);

ALTER TABLE freeform_db.public.agg_constrained_table
  SET AGGREGATION POLICY freeform_db.public.agg_policy;

-- Create a table with a projection constraint.
CREATE OR REPLACE TABLE freeform_db.public.proj_constrained_table
  AS SELECT * FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS;

CREATE OR REPLACE PROJECTION POLICY freeform_db.public.proj_policy AS ()
  RETURNS PROJECTION_CONSTRAINT ->
  PROJECTION_CONSTRAINT(ALLOW => false);

ALTER TABLE freeform_db.public.proj_constrained_table MODIFY COLUMN hashed_email
  SET PROJECTION POLICY freeform_db.public.proj_policy;

-- Create a table with a masking policy.
CREATE OR REPLACE TABLE freeform_db.public.masked_table
  AS SELECT * FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS;

CREATE OR REPLACE MASKING POLICY freeform_db.public.masking_policy
  AS (val string) RETURNS STRING ->
  CASE
    WHEN current_account() IN ('DCR_PROVIDER_PP6') THEN VAL
    ELSE '*********'
  END;

ALTER TABLE freeform_db.public.masked_table MODIFY COLUMN hashed_email
  SET MASKING POLICY freeform_db.public.masking_policy;

----------------- Create and publish a clean room that supports -----------------
----------------- free-form queries against this data.          -----------------

-- Create the clean room. Nothing new here.
USE ROLE SAMOOHA_APP_ROLE;
SET cleanroom_name = 'freeform queries';
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.cleanroom_init($cleanroom_name, 'INTERNAL');

-- Link in the policy-protected tables from above. Nothing new here.
USE ROLE MYROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.register_db('freeform_db');
USE ROLE SAMOOHA_APP_ROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.link_datasets($cleanroom_name,
  ['freeform_db.public.agg_constrained_table',
  'freeform_db.public.proj_constrained_table',
  'freeform_db.public.masked_table']);

-- Grant the following consumer access to the tables specified next.
-- The flow name must be 'freeform_sql'
SET flow_name = 'freeform_sql';
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.enable_workflows_for_consumers($cleanroom_name,
  [$flow_name],
  ['<CONSUMER_LOCATOR>']);

-- Grant the consumer specified above access to the specified tables.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.enable_datasets_for_workflow($cleanroom_name,
  $flow_name,
  ['freeform_db.public.agg_constrained_table',
   'freeform_db.public.proj_constrained_table',
   'freeform_db.public.masked_table']);

-- Add collaborators and publish, in the standard way.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.add_consumers(
  $cleanroom_name, '<CONSUMER_LOCATOR>', '<ORG_NAME>.<CONSUMER_LOCATOR>');
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.set_default_release_directive(
  $cleanroom_name, 'V1_0', '0');
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.provider.create_or_update_cleanroom_listing(
  $cleanroom_name);
```

#### Consumer steps

After the provider has published a clean room with free-form SQL work flows, consumers with access to that clean room can run queries
against the exposed views by following these steps:

1. Install the clean room in the standard way. No need to link in consumer data, as the consumer will access their data in their local
   environment, not in the clean room.
2. Call `consumer.get_provider_freeform_sql_views` to list the free-form SQL views available to the current account and role.
3. Run standard SQL queries against the data.

```sqlexample
-- Install the clean room.
USE ROLE SAMOOHA_APP_ROLE;
SET cleanroom_name = 'freeform queries';

CALL samooha_by_snowflake_local_db.consumer.install_cleanroom($cleanroom_name, '<PROVIDER_LOCATOR>');

-- List free form views available in the clean room.
CALL samooha_by_snowflake_local_db.consumer.GET_PROVIDER_FREEFORM_SQL_VIEWS($cleanroom_name);

-- Run queries on the views
SELECT * FROM <PROJECTION_POLICY_VIEW_NAME>;
SELECT * FROM <MASKING_POLICY_VIEW_NAME>;
SELECT COUNT(hashed_email), age_band
  FROM <AGGREGATION_POLICY_VIEW_NAME> group by age_band;
```

## Free-form queries in the clean rooms UI

The SQL Query template in a clean room lets consumers write free-form SQL to query data in the clean room. When using the SQL
Query template, consumer queries must meet certain requirements to successfully return results. These requirements are determined by how
the data provider protects their tables with data privacy policies.

When creating or updating a clean room in the UI, add the SQL Query template to your clean room and configure it as described below.

### Provider: Create a clean room and set policies

1. Create a clean room or edit an existing clean room, and specify tables or views for your table.
2. Join policies specified during the clean room creation process are ignored when using the SQL Query template, but respected for any
   other templates.
3. In Configure Analysis & Query select Horizontal » SQL Query.
4. In the SQL Query settings section, set the following properties:

   1. Under Tables, select tables that should be available to clean room collaborators in free-form queries. By default,
      aggregation policies do not need to be applied. To control which columns can be
      projected, and which must be aggregated, you must set column policies in the next section.

      > **Important:**
      >
      > In free-form queries in the clean rooms UI, you cannot use a table with a name that ends in “LIST” (upper or lower case).
   2. In the Column Policies section set the following values to control if or how your columns can be used in a query:

      1. Aggregation policy columns: Specify which columns must be aggregated in order to appear in query results. If you apply an
         aggregation policy to a column and one column is used in a query, then the results must be aggregated. Any columns listed here
         will be added to the Privacy settings section.
      2. Projection policy columns: Columns with a projection policy cannot be projected (that is, included in a SELECT
         statement). However, consumers can filter or join on a column with a projection policy.
      3. Fully permitted columns: The consumer can SELECT, filter, or join on these columns without restriction (aggregation or
         otherwise).
   3. The Privacy settings section lists all columns with an aggregation policy applied. The Threshold value indicates how many
      entities must exist for that value to appear in the results. For example, if you set a threshold of 5 on a FIRST_NAME column, and the
      name “Erasmus” appears only 4 times in the table, all rows with “Erasmus” will be filtered out before any processing has occurred
      (so, for example, a COUNT(\*) on such a table will omit those 4 rows with the below-threshold group size).

### Consumer: Run a free-form query

1. Join or edit the clean room in the clean rooms UI.
2. In the Configure Analysis & Query section, choose your tables that you will use for free-form queries.

   > **Important:**
   >
   > In free-form queries in the clean rooms UI, you cannot use a table with a name that ends in “LIST” (upper or lower case).
3. Select Finish to save your changes.
4. To run a query, select Run in the clean room with the SQL Query template and select the SQL Query template.

#### Select join and filtering columns

You can join and filter on any column that has a policy or is fully permitted. To determine if a column can be joined or used in a filter:

1. In the Query Configurations section, find the Tables tile.
2. Use the drop-down list to select a table. You can join and filter on all of the columns listed.

#### Select projection columns

Queries executed using the SQL Query template have restrictions on which columns can be projected (used in a SELECT statement).

To determine if your query can project a column:

1. In the Query Configurations section, find the Tables tile.
2. Use the drop-down list to select a table.
3. Look for columns that have a projection policy label, which means you cannot project it. You can project all columns except the ones
   with the projection policy label.

#### Aggregation requirements

If the provider assigned an aggregation policy to a column, all queries executed using the SQL Query template must return aggregated
results.

To determine if your query must aggregate results:

1. In the Query Configurations section, find the Tables tile.
2. Use the drop-down list to select a table.
3. Look for columns that have an aggregation policy label. If there is at least one aggregation policy label, you must use an aggregate in
   your query.

For guidelines on how to write a successful query against data protected by an aggregation policy, see:

* [Query requirements for aggregation policies](../../aggregation-policies.md). For example, you can use this section to
  determine that the MIN and MAX aggregation functions do not satisfy the query requirements, and cannot be used.
* [Aggregation policy limitations](../../aggregation-policies.md)

#### Graphing requirements

In order for Snowflake to be able to generate a graph:

* **The results table must include at least one measure (numeric) column and one dimension (category) column.**
* **The measure column name must have the following prefix or suffix (case-insensitive):**

  + Column-name prefixes:

    - COUNT
    - SUM
    - AVG
    - MIN
    - MAX
    - OUTPUT
    - OVERLAP
  + Column-name suffix:

    - _OVERLAP

Snowflake generates a chart using the first eligible measure column and the first dimension column in a results table.

#### Limitations

* An ORDER BY clause has no effect on how the results of the analysis are displayed.

#### Sample queries

Use this section to better understand what a query can and cannot include when running an analysis with the SQL Query template.

Queries without an aggregation function
:   In some circumstances, you can return values without using an aggregation function.

    | Allowed | Not allowed |
    | --- | --- |
    | ```sqlexample SELECT gender, regions   FROM TABLE sample_db.demo.customer   GROUP BY gender, region; ``` | ```sqlexample SELECT gender, regions   FROM TABLE sample_db.demo.customer; ``` |

Common table expressions (CTEs)
:   | Allowed | Not allowed |
    | --- | --- |
    | ```sqlexample WITH audience AS   (SELECT COUNT(DISTINCT t1.hashed_email),     t1.status     FROM provider_db.overlap.customers t1     JOIN consumer_db.overlap.customers t2       ON t1.hashed_email = t2.hashed_email     GROUP BY t1.status);  SELECT * FROM audience; ``` | ```sqlexample WITH audience AS   (SELECT t1.hashed_email,     t1.status     FROM provider_db.overlap.customers quoted t1     JOIN consumer_db.overlap.customers t2       ON t1.hashed_email = t2.hashed_email     GROUP BY t1.status)  SELECT * FROM audience ``` |

CREATE, ALTER, TRUNCATE
:   A query cannot use CREATE, ALTER, or TRUNCATE.

Query with joins
:   | Allowed |
    | --- |
    | ```sqlexample SELECT p.education_level,   c.status,   AVG(p.days_active),   COUNT(DISTINCT p.age_band)   FROM  samooha_sample_database.demo.customers c   INNER JOIN   samooha_sample_database.demo.customers p     ON  c.hashed_email = p.hashed_email   GROUP BY ALL; ``` |

DATE_TRUNC
:   | Allowed |
    | --- |
    | ```sqlexample SELECT COUNT(*),   DATE_TRUNC('week', date_joined) AS week   FROM consumer_sample_database.audience_overlap.customers   GROUP BY week; ``` |

Quoted identifiers
:   | Allowed |
    | --- |
    | ```sqlexample SELECT COUNT(DISTINCT t1."hashed_email")   FROM provider_sample_database.audience_overlap."customers quoted" t1   INNER JOIN   consumer_sample_database.audience_overlap.customers t2     ON t1."hashed_email" = t2.hashed_email; ``` |

---
title: Sample Notebooks and Worksheets
source: https://docs.snowflake.com/en/user-guide/cleanrooms/tutorials-and-samples.md
section: Clean Rooms
---

# Sample Notebooks and Worksheets

## Tutorials

Here are tutorials to try out using Snowflake Data Clean Rooms when you’re just getting started:

* [Basic API tutorial, two accounts](tutorials/collaboration-basic-api-tutorial.md): Demonstrates using the API to
  create and run a custom template using a single Snowflake account.

## Sample notebooks and worksheets

Many of the use case topics include full running samples of Snowflake Data Clean Rooms as downloadable notebooks or worksheets. You
need a Snowflake account with the clean rooms API environment installed to run any of these samples, and you must be able to use the
SAMOOHA_APP_ROLE role.

> **Tip:**
>
> To upload SQL worksheets and notebooks, see [Create and work with files and folders](../ui-snowsight/workspaces-working.md).

Sample single-account collaboration:
:   Demonstrates a simple collaboration with only a single user account. The example worksheet creates a collaboration with two data offerings and two templates, and runs each template.

    * [`Download the worksheet`](../../_downloads/20f15c1dd86d7a782f6e362e78ac20c5/demo-collaboration-single-user.sql)

Advanced single-account collaboration:
:   Demonstrates the use of RBAC roles to limit access, and custom registries, in a single account. The example worksheet creates a collaboration with four roles: one that can create a collaboration; one that can register templates; and two that can run analyses.

    * [`Download the worksheet`](../../_downloads/58963cf06e0917ee8331991eba5c1230/demo-collaboration-three-roles.sql)

Basic two-party example:
:   Demonstrates a two-party collaboration that creates templates and data offerings. Requires two accounts.

    * [`Collaboration creator (alice)`](../../_downloads/162c93b64e33d38d9cdeb15a710fa8fd/demo-collaboration-hub-alice.sql)
    * [`Analysis runner (bob)`](../../_downloads/ff68e520998de6717efbfe424fdc56db/demo-collaboration-hub-bob.sql)

Free-form query example:
:   Demonstrates how to implement and run free-form SQL queries in a collaboration. Requires two accounts.

    * [`Collaboration owner and data provider worksheet`](../../_downloads/83b7c30a5d9e249f0ca1d60339f889c9/collab-hub-freeform-sql-provider.sql)
    * [`Collaboration query runner worksheet`](../../_downloads/fd47736d365306779ad4d94ac2c3ad5c/collab-hub-freeform-sql-consumer.sql)

For more examples, including inventory forecasting and last touch attribution, see the
[Use cases](collab-inventory-forecasting.md) section.

---
title: Scheduling a repeating analysis in the clean rooms UI
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/schedule-analysis.md
section: Clean Rooms
---

# Scheduling a repeating analysis in the clean rooms UI

Some types of analyses can be scheduled to run automatically at a regular interval. These analyses include the following:

* SQL Query template
* Audience Overlap & Segmentation template
* Custom templates created using the developer APIs

An administrator must configure the clean room environment to allow scheduled analyses before a user can schedule an analysis to repeat.

**Limitation:** Only consumers can schedule analyses.

## Enable scheduled analyses

A clean rooms account administrator must configure an account to allow scheduled analyses.

To enable scheduled analyses in a clean room account:

1. [Sign in to the clean rooms UI.](web-app-introduction.md)
2. In the left navigation, select Admin » Snowflake Admin.
3. Select Login to Snowflake, and authenticate as a Snowflake user with the ACCOUNTADMIN role.
4. In the Account Features section, enable Schedule Analysis Run.

## Scheduling a repeating analysis

If an analysis template supports scheduling analyses, you can configure an analysis to repeat at a regular interval when you run it for the
first time in the clean room. When prompted to save the analysis, use the Schedule Run drop-down list to select an interval.

## Modify or disable a scheduled analysis

If you want to change the scheduled interval for an analysis or turn off scheduling, complete these steps:

1. [Sign in to the clean rooms UI.](web-app-introduction.md)
2. In the left navigation, select Analyses & Queries.
3. Find the analysis in the list and select it.
4. Expand the Save Analysis & Query section, and use the Schedule Run drop-down list to make the change.

---
title: Security scans for custom templates
source: https://docs.snowflake.com/en/user-guide/cleanrooms/scan-custom-template.md
section: Clean Rooms
---

# Security scans for custom templates

Snowflake runs a security scan on custom templates every 30 minutes to identify Jinja code that is susceptible to a SQL injection attack.

## Prerequisites

* To enable the custom template security scan, you must log into the clean rooms UI for that account at least once.
* The PRIVACY_AND_SECURITY_SCANNER task must be running.

  To see if the task is running in the Tasks page in Snowsight:

  1. In the navigation menu, select Transformation » Tasks.

## View security scan results

Snowflake saves security scan results to the SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.TEMPLATE_SCANNER_RESULTS table in the provider’s
Snowflake account. This table is present only if the previously listed prerequisites are satisfied.

To view results of security scans:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Use the database object explorer in Snowsight or a SQL query to view the security scan results:

   SnowsightSQL

   1. In the navigation menu, select Catalog » Database Explorer.
   2. Navigate to `SAMOOHA_BY_SNOWFLAKE_LOCAL_DB` » `PUBLIC` » `Tables` » `TEMPLATE_SCANNER_RESULTS`.
   3. Select Data Preview.

   1. In the navigation menu, select Projects » Worksheets.
   2. Select + SQL Worksheet.
   3. To list the results of the security scans, paste and run the following
      statement:

      ```sqlexample
      SELECT *
         FROM samooha_by_snowflake_local_db.public.template_scanner_results;
      ```

---
title: Snowflake Data Clean Room tutorials, samples, and videos
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/tutorials-and-samples.md
section: Clean Rooms
---

# Snowflake Data Clean Room tutorials, samples, and videos

## Tutorials

Here are tutorials to try out using Snowflake Data Clean Rooms when you’re just getting started:

* [Basic UI tutorial, single account](tutorials/cleanroom-web-app-single-account-tutorial.md): Demonstrates a
  simple overlap analysis and consumer activation, using a single Snowflake account. Single account testing supports most, but not all
  clean room features. To test the full functionality of a clean room, must use multiple Snowflake accounts.
* [Basic UI tutorial, two accounts](tutorials/cleanroom-web-app-tutorial.md): Demonstrates a simple overlap
  analysis and provider activation using two Snowflake accounts.
* [Basic API tutorial, single account](../tutorials/cleanroom-api-tutorial-basic.md): Demonstrates using the API to
  create and run a custom template using a single Snowflake account.

## Sample notebooks and worksheets

Many of the use case topics include full running samples of Snowflake Data Clean Rooms as downloadable notebooks or worksheets. You
need a Snowflake account with the clean rooms API environment installed to run any of these samples, and you must be able to use the
SAMOOHA_APP_ROLE role.

> **Tip:**
>
> * **To upload a notebook,** follow the [instructions for uploading a notebook](../../ui-snowsight/notebooks-create.md).
> * **To upload a worksheet:**
>
>   1. Open the workspaces panel: In the navigation menu, select Projects » Workspaces.
>   2. Upload the SQL worksheet: In the workspace menu, select + Add new » Upload Files.

### List of sample files

* **Internal testing clean room:** Jupyter notebook demonstrating how to use a single account to act as both provider and consumer for
  testing purposes.

  + [`Download the notebook`](../../../_downloads/980474433a279b8dd7a9409b77b0f54d/internal-testing-cleanroom.ipynb)
* **Consumer-run analysis:** Code for running a basic consumer analysis clean room using separate provider and consumer accounts.

  + [`Download the consumer worksheet`](../../../_downloads/d898d27c6c1b81d0b16575285b2e0873/c-run-analysis-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/74f5e256a72d109f3bf5b741432911cd/c-run-analysis-p.sql)
* **Provider-run analysis:** Jupyter notebook showing how a provider can run an analysis in a clean room.

  + [`Download the notebook`](../../../_downloads/6f09db32770533d503e9578e38467b8f/provider-analysis-notebook.ipynb)
* **Consumer-run consumer activation:** Code for activating analysis results to the consumer’s own Snowflake account, with setup and
  activation for both consumer and provider.

  + [`Download the consumer worksheet`](../../../_downloads/edfeb20ec5896fb323528481c1ea3490/c-run-c-activation-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/3cdbbfc219b944cb8d7cb49014a520e4/c-run-c-activation-p.sql)
* **Consumer-run provider activation:** Code for activating analysis results to the provider’s Snowflake account, with setup and activation
  for both consumer and provider.

  + [`Download the consumer worksheet`](../../../_downloads/76e14e2219bbbe735da2790398954b80/c-run-p-activation-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/249d42fdba29da93cd25d75850a016a9/c-run-p-activation-p.sql)
* **Provider-run provider activation:** Code for provider-run analysis with provider activation.

  + [`Download the consumer worksheet`](../../../_downloads/4430a12585695047da96b61a06593dd6/p-run-p-activation-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/0ff6c0608e468a18e039163d4953ee52/p-run-p-activation-p.sql)
* **Consumer-defined templates:** Code for creating, submitting, and managing consumer-written templates in a clean room.

  + [`Download the consumer worksheet`](../../../_downloads/56922f46a21ef92d28a78e521f593230/consumer-template-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/333d148177d16d9faaf78198f0f6cc21/consumer-template-p.sql)
* **Provider-defined templates:** Code for creating, managing, and using provider-created templates in a clean room.

  + [`Download the consumer worksheet`](../../../_downloads/0773544e11ffd9a39d5fdf82dada99de/provider-template-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/80e34f3c8c5c6c2e38ea4e078375f3d8/provider-template-p.sql)
* **Consumer-written UDFs:** Code for uploading and using custom Python functions in a clean room.

  + [`Download the consumer worksheet`](../../../_downloads/52519c3df63da0cbe3838f0878fbaec3/consumer-udf-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/9e9507050ea9767daf56d8c94a892579/consumer-udf-p.sql)
* **Provider-written UDFs:** Code for uploading and using provider-uploaded custom Python functions in a clean room.

  + [`Download the consumer worksheet`](../../../_downloads/f9606ce3e3ad2dbd62a3fe9735894869/provider-udf-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/d5c64053435e55dc171af58a492f947f/provider-udf-p.sql)
  + [`Bulk UDF uploading example (single-account worksheet)`](../../../_downloads/e3c5e0dab78085f95d314b4ce2e04c4e/upload-multiple-python-packages.sql)
* **UDF from stage:** Jupyter notebook demonstrating how to load user-defined functions from a Snowflake stage.

  + [`Download the notebook`](../../../_downloads/c9458d589eac4e4354d19501fa9f1707/udf_from_stage.ipynb)
* **Snowpark UDFs:** Code for creating and using Snowpark-based user-defined functions in clean rooms.

  + [`Download the consumer worksheet`](../../../_downloads/26b752753607b54bbd64faa7c688d52e/snowpark-udf-consumer.sql)
  + [`Download the provider worksheet`](../../../_downloads/ef9713d25f1026f7271faa3e2f571a1f/snowpark-udf-provider.sql)
* **Consumer-written UDF run by the provider:** A UDF uploaded by the consumer can be run by the provider.

  + [`Download the consumer worksheet`](../../../_downloads/7879eba9c233607e8d74f47e44e4997a/p-run-c-uploaded-code-c.sql)
  + [`Download the provider worksheet`](../../../_downloads/e81d3235044a7367014ec0680eab0ddc/p-run-c-uploaded-code-p.sql)
* **Snowpark Container Services Integration:** Jupyter notebooks for integrating Snowpark Container Services in clean rooms.

  + [`Consumer notebook`](../../../_downloads/c246bff46d86438d655c7a23e1afbc67/spcs-consumer.ipynb)
  + [`Provider notebook`](../../../_downloads/109fc1643160d035866a43189b9565d0/spcs-provider.ipynb)
  + [`Spec and config files`](../../../_downloads/36c3670d9a0df2bb0358fac7e0d45255/spcs-spec-and-config.zip)
* **Audience Overlap & Segmentation:** Jupyter notebook demonstrating the Audience Overlap & Segmentation template.

  + [`Download the notebook`](../../../_downloads/44b3c72a8168d977419f51da25ef51d6/overlap-segmentation.ipynb)

## Sample templates

Snowflake Data Clean Rooms provides a few sample templates that you can download as Snowflake worksheets and implement or customize using the clean rooms API:

Inventory forecasting template:
:   This template helps publishers and advertisers forecast ad inventory availability within a secure data clean room. [Learn more and download the worksheet.](../inventory-forecasting-template.md)

Last touch attribution template:
:   This template provides a comprehensive Last Touch Attribution analysis that allows businesses to measure the effectiveness of their marketing channels. [Learn more and download the worksheet.](../last-touch-template.md)

Audience lookalike modeling template:
:   This template empowers you to discover and target new, high-value customers who mirror your most profitable existing ones. [Learn more and download the worksheet.](../lookalike-audience-modeling-template.md)

## Videos

Our solutions engineers have created the following videos to demonstrate clean room usage. Watch them individually, or [subscribe to our playlist](https://www.youtube.com/playlist?list=PLavJpcg8cl1HrorP5u5VkoywZo5YMewxC).

[Native App Installation](https://youtu.be/FC4Ug95vepM?si=3TnuZDhOhl02V3LD):
:   How to install the Snowflake Data Clean Room environment in your account.

[Freeform SQL](https://youtu.be/847XBdAiam8?si=a9bAEmi8l566Qlbt):
:   How to make free-form SQL queries in Snowflake Data Clean Rooms.

[Editing A Clean Room](https://youtu.be/xMXrSiPBjrU?si=hBUO_1pi4d2hWyDr):
:   How to configure a clean room in the API or UI.

[Cross-Cloud Auto-Fulfillment](https://youtu.be/8BO2GwlZpJQ?si=mLQsLlq_GoAIS496):
:   How to enable Cross-Cloud Auto-Fulfillment in your clean rooms.

---
title: Snowflake Data Clean Room: External data from an Amazon S3 bucket
source: https://docs.snowflake.com/en/user-guide/cleanrooms/external-data-aws.md
section: Clean Rooms
---

# Snowflake Data Clean Room: External data from an Amazon S3 bucket

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

Data analyzed in a [Snowflake Data Clean Room](overview.md) can be native to Snowflake, reside externally
in cloud provider storage, or both. A *connector* allows collaborators to access external data from a cloud provider from within the clean
room.

The external data connector uses [Snowflake external tables](../tables-external-intro.md) to make data
available. Be aware that there is an increased security risk associated with linking external tables in a clean room. As a result,
the provider must explicitly allow the use of external tables in the clean room before
consumers can use a connector to include external data. If the provider uses the external data connector, the consumer is warned that
external tables are being used so they can decide whether to install the clean room.

This topic describes how to use a connector so that clean room analysts can access external data from an Amazon S3 bucket.

> **Important:**
>
> Third-party connectors are not offered by Snowflake and may be subject to additional terms. These integrations are made available for
> your convenience, but you are responsible for any content sent to or received from the integrations.
>
> Customers are responsible for obtaining any necessary consents in connection with their use of Snowflake Data Clean Rooms. Please ensure
> that you are complying with applicable laws and regulations when using Snowflake Data Clean Rooms, including in connection with
> third-party connectors for activation purposes.

## Prerequisites

To use the connector for external data:

* The provider must explicitly [allow the use of external tables in the clean room](register-data.md).
* Files must be in parquet format.

## Connecting to an S3 bucket

The process of allowing clean room collaborators to access data from Amazon S3 storage consists of the following steps:

1. In AWS, complete these procedures:

   1. Create an IAM policy with specific permissions.
   2. Create an IAM role that references the new IAM policy.
   3. Copy the identifiers of the S3 bucket and IAM role.
2. In the clean room environment, create the connector.
3. In AWS, update the IAM role with the service account identifiers from the clean
   room environment.
4. In the clean room environment, authenticate the connector with AWS.

### Create an IAM policy in AWS

Snowflake recommends that you create a dedicated IAM policy for the connector that includes the necessary permissions to access the S3 bucket. In a
subsequent step, you add this policy to an IAM role that represents the identity of the connector.

To complete this procedure, you need to know the region of the account associated with the clean room environment.

* To find the region of the account associated with the clean room environment, [log in to the clean room](v1/web-app-introduction.md), and select Connectors » Cleanrooms » Snowflake.

To create an IAM policy that contains permissions to the S3 bucket:

1. Sign in to the AWS Management Console.
2. From the Console Home dashboard, select Identity and Access Management (IAM).
3. In the left navigation, select Account settings.
4. In the Security Token Service (STS) section, find the region of the account associated with the clean room environment, and
   toggle it to Active.
5. In the left navigation, select Policies.
6. Select Create policy.
7. In the Policy editor section, select JSON.
8. Copy and paste the following policy body into the policy editor, and then edit the JSON to include your bucket name (`<bucket>`) and folder
   path prefix (`<prefix>`):

   > ```json
   > {
   >   "Version": "2012-10-17",
   >   "Statement": [
   >     {
   >       "Effect": "Allow",
   >       "Action": [
   >         "s3:GetObject",
   >         "s3:GetObjectVersion"
   >       ],
   >       "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
   >     },
   >     {
   >       "Effect": "Allow",
   >       "Action": [
   >         "s3:ListBucket",
   >         "s3:GetBucketLocation"
   >       ],
   >       "Resource": "arn:aws:s3:::<bucket>",
   >       "Condition": {
   >         "StringLike": {
   >           "s3:prefix": [
   >             "<prefix>/*"
   >           ]
   >         }
   >       }
   >     }
   >   ]
   > }
   > ```
   >
   > Be sure to keep the `:::` format. For example, if your S3 bucket URI is `s3://sales/customers/`, the value of the `Resource` JSON field is `arn:aws:s3:::sales/customers/*`.
9. Select Next.
10. Enter a policy name (for example, `snowflake_cleanroom_access`), and then select Create policy.

### Create an IAM role in AWS

The AWS IAM role represents the identity of the connector. During the creation process, you associate the
role with the new IAM policy that grants permissions needed by the connector to access the S3 bucket.

To create a new IAM role:

1. From the Console Home dashboard in AWS, select Identity and Access Management (IAM).
2. In the left navigation, select Roles.
3. Select Create role.
4. In the Trusted entity type section, select AWS account.
5. In the An AWS account section, select Another AWS account.
6. In the Account ID field, enter a temporary placeholder value that contains 12 digits (for example, the account identifier of
   the current AWS account). You will replace this value later.
7. Select Require external id, and then enter a temporary placeholder value, such as `0000`. You will replace this value later.
8. Select Next.
9. In the Permissions policies section, find the policy that you created in the previous procedure, and select its check box.
10. Select Next.
11. Enter a role name (for example, `snowflake_cleanroom_connector`), and then select Create role.

### Copy the S3 bucket and IAM role identifiers

When creating the connector in the clean room environment, you need the identifiers of the S3 bucket and the IAM role. Before creating
the connector, follow these steps to copy and save these identifiers:

To copy the IAM role identifier:

1. From the Console Home dashboard in AWS, select Identity and Access Management (IAM).
2. In the left navigation, select Roles.
3. Find the role that you created in the previous procedure, and select it to open
   it.
4. In the Summary section, find the ARN and select the copy icon. Save this role identifier for a later step.

To copy the S3 bucket identifier:

1. From the Console Home dashboard in AWS, select S3.
2. Find the name of your S3 bucket and select it to open it. The bucket must contain the data that you want to include in the clean room.
3. Navigate into the prefix of the bucket, and then select Copy S3 URI. Save this bucket identifier for a later step.

   Don’t try to select the button in the Objects section.

### Create a connector and copy the service account details

You are now ready to create the connector in the clean room environment. After you create the connector, copy details about
its service account so it can be associated with the IAM role in AWS.

To create the connector in your clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors, and then expand the Amazon Web Services section.
3. In the AWS Role ARN field, enter the identifier of the IAM role that you copied from AWS, such as
   `arn:aws:iam::772412615275:role/mub00002_vhb71832_role`.
4. In the S3 Bucket URI field, enter the identifier of the S3 bucket that you copied from AWS, such as
   `s3://sales/customer_data/`.
5. Select Create.

   The clean room generates a service account that it uses to access AWS.
6. Use the copy icon to copy the Principal and External ID identifiers of the connector’s service account, and save them for
   the next procedure.

### Update the IAM role with service account details

You are now ready to update the IAM role with the identifiers associated with the connector’s service account. To update the IAM role:

1. Sign in to the AWS Management Console.
2. From the Console Home dashboard, select Identity and Access Management (IAM).
3. In the left navigation, select Roles.
4. Find the role that you created earlier, and select it to open it.
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the JSON of the trust policy to include the identifiers from the connector’s service account. You copied these identifiers
   earlier. Make the following changes to the JSON:

   * Replace the value of the `AWS` JSON field with the Principal value you copied from the clean room environment.

     In the following example, the value of Principal in the clean room environment is `arn:aws:iam::115136555074:user/x4gy-s-p2345g38`.
   * Replace the value of the `sts:ExternalId` JSON field with the External ID value you copied from the clean room environment.

     In the following example, the value of External ID in the clean room environment is `UCA56729_SFCRole=4447_uht2344sdf3mrWLNRM0y3bE=`.

     > ```json
     > {
     >   "Version": "2012-10-17",
     >   "Statement": [
     >     {
     >       "Sid": "Statement1",
     >       "Effect": "Allow",
     >       "Principal": {
     >         "AWS": "arn:aws:iam::115136555074:user/x4gy-s-p2345g38"
     >       },
     >       "Action": "sts:AssumeRole",
     >       "Condition": {
     >         "StringEquals": {
     >           "sts:ExternalId": "UCA56729_SFCRole=4447_uht2344sdf3mrWLNRM0y3bE="
     >         }
     >       }
     >     }
     >   ]
     > }
     > ```
8. Select Update policy.

### Authenticate the connector

You are now ready to authenticate the connector to make sure it can access the S3 bucket. To authenticate the connector:

1. If you are signed out of the clean room environment, see [Sign in to the clean rooms UI](v1/web-app-introduction.md).
2. In the clean room environment, select Connectors and expand the Amazon Web Services section.
3. Select the S3 bucket you are connecting to, and then select Authenticate.

## Remove access to external data on AWS

To remove access to an S3 bucket from a clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors and expand the Amazon Web Services section.
3. Find the S3 bucket that is currently connected, and then select the trash can icon.

---
title: Snowflake Data Clean Room: External data from Azure Blob Storage
source: https://docs.snowflake.com/en/user-guide/cleanrooms/external-data-azure.md
section: Clean Rooms
---

# Snowflake Data Clean Room: External data from Azure Blob Storage

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

Data analyzed in a [Snowflake Data Clean Room](overview.md) can be native to Snowflake, reside externally
in cloud provider storage, or both. A *connector* allows collaborators to access external data from a cloud provider from within the clean
room.

The external data connector uses [Snowflake external tables](../tables-external-intro.md) to make data
available. Be aware that there is an increased security risk associated with linking external tables in a clean room. As a result,
the provider must explicitly allow the use of external tables in the clean room before
consumers can use a connector to include external data. If the provider uses the external data connector, the consumer is warned that
external tables are being used so they can decide whether to install the clean room.

This topic describes how to use a connector so clean room analysts can access external data from Azure Blob Storage.

> **Important:**
>
> Third-party connectors are not offered by Snowflake and may be subject to additional terms. These integrations are made available for
> your convenience, but you are responsible for any content sent to or received from the integrations.
>
> Customers are responsible for obtaining any necessary consents in connection with their use of Snowflake Data Clean Rooms. Please ensure
> that you are complying with applicable laws and regulations when using Snowflake Data Clean Rooms, including in connection with
> third-party connectors for activation purposes.

## Prerequisites

To use the connector for external data:

* The provider must explicitly [allow the use of external tables in the clean room](register-data.md).
* Files must be in parquet format.

## Connect to Azure Blob Storage

Allowing clean room collaborators to access data from Azure Blob Storage consists of the following steps:

1. In Azure, obtain the identifiers of the blob storage.
2. In the clean room environment, create the connector.
3. Use the clean room environment to initiate the process of
   granting permissions to the connector, then complete the process in Microsoft.
4. In the clean room environment, authenticate the connector with Azure.

The following sections discuss these steps in more detail.

### Obtain identifiers associated with blob storage

The clean room connector needs the tenant ID associated with Azure Blob Storage and the URL that uniquely identifies the blob
storage that the clean room needs to access. Before creating the connector, you must obtain both of these identifiers from Azure.

> **Note:**
>
> Microsoft changed the name of Azure Active Directory to Microsoft Entra ID.

To obtain the tenant ID that establishes a trust relationship between Azure Blob Storage and Microsoft Entra ID:

1. Sign in to the Microsoft Azure portal.
2. From the home dashboard, select Microsoft Entra ID » Properties.
3. Find the Tenant ID field and select the copy icon. You will use this identifier when you
   create the connector.

To obtain the URL that uniquely identifies the blob storage:

1. Sign in to the Microsoft Azure portal.
2. From the home dashboard, select Storage Accounts.
3. Navigate the storage account until you see the blob storage folder in the list. This folder must contain the data that you want to
   include in the clean room.
4. Find the blob storage folder in the list, and select … more menu » Copy URL. You will use this identifier when you
   create the connector.

### Create the connector and copy the service principal identifier

You are now ready to create the connector in the clean room environment. Once you have created the connector, you will need to copy the
identifier of the Azure service principal that is associated with the clean room environment.

To create the connector in your clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors, then expand the Microsoft Azure section.
3. In the Tenant ID field, enter the tenant ID that you copied in the
   previous step.
4. In the Path URL field, enter the URL of the blob storage that you copied in the
   previous step, then replace `https://` with `azure://` in the URL.
5. Select Create.
6. Use the copy icon to copy the identifier of the Azure service principal that is now associated with the clean room environment, and
   save it for the next task. Azure uses service principals to grant access to applications.

### Grant permissions to the connector

Clean rooms need permission to access external data in Azure Blob Storage. The process of granting these permissions begins in the clean
room environment and ends in Microsoft.

To grant permissions to the connector:

1. In the clean room environment, select Connectors and expand the Microsoft Azure section. If you are signed out of the
   clean room, see [Sign in to the clean rooms UI](v1/web-app-introduction.md).
2. Select Consent URL. A Microsoft dialog appears.
3. In the Microsoft dialog, ensure that Consent on behalf of your organization is selected, then select Accept.

   Microsoft grants the Azure service principal associated with the clean room environment an access token to the blob storage inside of
   your tenant.
4. In a new browser window, sign in to the Microsoft Azure portal.
5. From the home dashboard, select Storage Accounts.
6. Select the storage account that contains the blob storage.
7. Select Access Control (IAM).
8. Select Add role assignment.
9. Select Storage Blob Data Reader to grant read-only access to a Azure service principal, then select Next.
10. On the Members tab, select + Select members.
11. Search for the service principal associated with the clean room environment. You copied its identifier in a
    previous step.

    > **Tip:**
    >
    > Microsoft can take over an hour to create the service principal for the clean room environment. If you cannot find the service
    > principal in the list, wait 1-2 hours, then try to complete this step again.
12. Select Review + assign.

### Authenticate the connector

You are now ready to authenticate the connector to make sure it can access Azure Blob Storage. To authenticate the connector:

1. In the clean room environment, select Connectors and expand the Microsoft Azure section. If you are signed out of the
   clean room, see [Sign in to the clean rooms UI](v1/web-app-introduction.md).
2. Select the blob storage you are connecting to, and select Authenticate.

## Remove access to external data on AWS

To remove access to Azure Blob Storage from a clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors and expand the Microsoft Azure section.
3. Find the blob storage that is currently connected, and select the trash can icon.

---
title: Snowflake Data Clean Room: External data from Google Cloud Platform
source: https://docs.snowflake.com/en/user-guide/cleanrooms/external-data-gcp.md
section: Clean Rooms
---

# Snowflake Data Clean Room: External data from Google Cloud Platform

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

Data analyzed in a [Snowflake Data Clean Room](overview.md) can be native to Snowflake, reside externally
in cloud provider storage, or both. A *connector* allows collaborators to access external data from a cloud provider from within the clean
room.

The external data connector uses [Snowflake external tables](../tables-external-intro.md) to make data
available. Be aware that there is an increased security risk associated with linking external tables in a clean room. As a result,
the provider must explicitly allow the use of external tables in the clean room before
consumers can use a connector to include external data. If the provider uses the external data connector, the consumer is warned that
external tables are being used so they can decide whether to install the clean room.

This topic describes how to use a connector so clean room analysts can access external data from a Google Cloud Platform bucket.

> **Important:**
>
> Third-party connectors are not offered by Snowflake and may be subject to additional terms. These integrations are made available for
> your convenience, but you are responsible for any content sent to or received from the integrations.
>
> Customers are responsible for obtaining any necessary consents in connection with their use of Snowflake Data Clean Rooms. Please ensure
> that you are complying with applicable laws and regulations when using Snowflake Data Clean Rooms, including in connection with
> third-party connectors for activation purposes.

## Prerequisites

To use the connector for external data:

* The provider must explicitly [allow the use of external tables in the clean room](register-data.md).
* Files must be in parquet format.

## Connect to a Google Cloud Platform bucket

Allowing clean room collaborators to access data from Google Cloud Platform (GCP) storage consists of the following steps:

1. In GCP, obtain the URL of the GCP bucket.
2. In the clean room environment, create the connector.
3. In GCP, grant permissions to the connector.
4. In the clean room environment, authenticate the connector with GCP.

The following sections discuss these steps in more detail.

### Obtain the URL of the GCP bucket

The clean room connector needs the URL of the GCP storage bucket in order to access the data. Before creating the connector, you must:

1. Sign in to Google Cloud Platform Console as a project editor.
2. From the Console dashboard, select Cloud Storage » Browser.
3. Select the bucket that contains the data you want to access from the clean room, and navigate to the location of that data. The bucket
   cannot be empty.
4. Select the copy icon to copy the URL of the storage bucket and save it for the next task.

### Create the connector and copy the service account identifier

You are now ready to create the connector in the clean room environment. Once you have created the connector, you need to copy details about
its service account so it can be associated with the bucket in GCP. To create the connector in your clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors, then expand the Google Cloud section.
3. In the Storage bucket URL field, enter the URL that you copied from GCP, then replace `https://` with `gcs://` in the URL.
4. Select Create. The clean room generates a service account that it uses to access GCP.
5. Use the Copy icon to copy the identifier of the service account, and save it for the next task.

### Grant permissions to the connector

Clean rooms need permission to access external data in the GCP bucket. Granting these permissions consists of creating a dedicated GCP role
for the connector’s service account, then adding the service account as a principal of the GCP bucket.

To create the dedicated GCP role for the connector’s service account:

1. Sign in to the Google Cloud Platform Console as a project editor.
2. From the Console dashboard, select IAM & admin » Roles.
3. Select Create Role.
4. Enter a name and description for the role.
5. Select Add Permissions, then add the following permissions:

> * `storage.buckets.get`
> * `storage.objects.list`
> * `storage.objects.get`

Now that you have created a dedicated role, you are ready to associate the connector’s service account as a principal of the GCP bucket.
To associate the service account:

1. Sign in to Google Cloud Platform Console as a project editor.
2. From the Console dashboard, select Cloud Storage » Browser.
3. Select the bucket that contains the external data.
4. Select Show Info Panel. The information panel slides open.
5. Select Add Principals.
6. In the New Principals text box, paste the service account identifier that you copied from the clean room.
7. From the Select a role drop-down list, select the dedicated role you created for the service account.

### Authenticate the connector

You are now ready to authenticate the connector to make sure it can access the GCP bucket. To authenticate the connector:

1. In the left navigation of the clean room, select Connectors and expand the Google Cloud section. If you are signed out of the
   clean room, see [Sign in to the clean rooms UI](v1/web-app-introduction.md).
2. Select the GCP bucket you are connecting to, and select Authenticate.

## Remove access to external data on GCP

To remove access to a GCP bucket from a clean room environment:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors, then expand the Google Cloud section.
3. Find the GCP bucket that is currently connected, and select the trash can icon.

---
title: Snowflake Data Clean Rooms Collaboration API
source: https://docs.snowflake.com/en/user-guide/cleanrooms/collaboration-api-reference.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms Collaboration API

## Introduction

This is the reference page for the Snowflake Data Clean Rooms Collaboration API. This API uses the COLLABORATION and REGISTRY
schemas.

> **Note:**
>
> You should disable secondary roles in your environment when using the Collaboration API:
>
> ```sqlexample
> USE SECONDARY ROLES NONE;
> ```

To learn how to set up your development environment, see [Setting up your environment](developer-guide.md).

To learn how to manage access to the Collaboration API procedures, see [Use DCR privileges to manage account, object, and procedure privileges](manage-access.md).

### Metadata cheat sheet

Here is how to find some commonly sought information about a collaboration:

| To learn this… | Call this |
| --- | --- |
| What collaborations can I join? | VIEW_COLLABORATIONS - Look for collaborations where the `collaboration_name` column is NULL. |
| Which collaborations have I joined? | VIEW_COLLABORATIONS - Look for collaborations where the `collaboration_name` column is not NULL, which can mean either that you have created or joined the collaboration. |
| Which collaborations do I own? | VIEW_COLLABORATIONS - Look in the `owner_account` column. |
| What is the status of all collaborators in a collaboration? | GET_STATUS |
| What is my join or creation status in a collaboration? | GET_STATUS or VIEW_COLLABORATIONS |
| Who owns a given collaboration? | GET_STATUS - Look for OWNER in the `roles` column. |
| What is my collaboration role in a given collaboration? | GET_STATUS - Look in the `roles` column. |
| What collaboration roles are assigned in a given collaboration? | GET_STATUS - Look in the `roles` column. |
| What is the spec in a given collaboration? | VIEW_COLLABORATIONS - Look in the `collaboration_spec` column. |
| Is the spec up to date? | There is no way to tell if a given spec has changes in progress, but you can call VIEW_COLLABORATIONS to see when the latest updates were applied. |
| What pending update requests do I have? | `VIEW_UPDATE_REQUESTS`. Look for rows where STATUS = PENDING_MY_APPROVAL. |
| Show me the spec for a given collaboration | REVIEW returns the collaboration spec. If you have already called REVIEW or joined the collaboration, call the following SQL command with your collaboration name as indicated:  ```sqlexample CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.   VIEW_COLLABORATIONS() ->>     SELECT "COLLABORATION_SPEC" FROM $1       WHERE "SOURCE_NAME" = <collaboration name>; ``` |

## Template procedures

### REGISTER_TEMPLATE

Schema:
:   REGISTRY

Registers a template to enable it to be used in a collaboration. Every template registered must have a unique
name-version combination for all templates in all registries in your account.

#### Syntax

```sqlsyntax
REGISTER_TEMPLATE( ['<registry_name>' ,] <template_spec> )
```

#### Arguments

`registry_name` *(Optional)*
:   Name of a [custom registry](registries.md) in which to register this template. If not specified, registers the template in the default account registry.

`template_spec`
:   [Template definition](spec-template.md) in YAML format, as a string.

#### Returns

A template ID to use in the collaboration specification.

#### Examples

Register a template in the default registry:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: my_test_template
  version: 2026_01_12_V1
  type: sql_analysis
  description: A test template
  template:
    SELECT * FROM IDENTIFIER({{ source_table[0] }}) LIMIT 10;
$$);
```

Register a template in a custom registry:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  'my_custom_registry',
  $$
  api_version: 2.0.0
  spec_type: template
  name: my_test_template
  version: 2026_01_12_V1
  type: sql_analysis
  description: A test template
  template:
    SELECT * FROM IDENTIFIER({{ source_table[0] }}) LIMIT 10;
$$);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedures.

To register objects in the default registry:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REGISTER TEMPLATE', 'role name')`

To register items in a custom registry:

* You have read and write privileges on any custom registry that you created yourself.
* To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'MY_REGISTRY', 'role name')`.

---

### VIEW_REGISTERED_TEMPLATES

Schema:
:   REGISTRY

Lists all templates that you have registered. To register a template, call REGISTRY.REGISTER_TEMPLATE.

#### Syntax

```sqlsyntax
VIEW_REGISTERED_TEMPLATES( [ '<registry_name>' ] )
```

#### Arguments

`registry_name` *(Optional)*
:   Name of a [custom registry](registries.md) to list templates from. If not specified, lists templates from the default account registry.

#### Returns

A table that lists the details of all templates that you have registered in this account. The table includes the following columns:

* `TEMPLATE_ID`: ID of the template.
* `NAME`: Template name.
* `VERSION`: Template version.
* `TEMPLATE_SPEC`: Full YAML specification of the template.
* `REGISTRY`: Registry the template is registered in.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_TEMPLATES();
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures.

To see items in the default registry:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTERED TEMPLATES', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

To see items in a custom registry:

* You have read and write privileges on any custom registry that you created yourself.
* To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'MY_REGISTRY', 'role name')`.

---

### ADD_TEMPLATE_REQUEST

Schema:
:   COLLABORATION

Sends a request to link a template to an existing collaboration. If the sender is affected by the request, the sender automatically approves the request; all other affected collaborators must approve the request for the change to be applied. All collaborators need to call this procedure to link a template to an existing collaboration, even the collaboration owner.

To add additional template sharers, you can call this procedure again with their aliases. Each call adds the users listed in `share_with` to the existing list of sharers.

To see the status of the request, call VIEW_UPDATE_REQUESTS.

[See the link template flow.](resources-templates.md)

#### Syntax

```sqlsyntax
ADD_TEMPLATE_REQUEST( <collaboration_name>, <template_id>, <share_with> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to link the template to.

`template_id`
:   ID of the template to link to the collaboration. Register the template to get this value.

`share_with`
:   Array of *aliases* of analysis runners to share this template with. Collaborators listed here will be added in addition to any other collaborators associated with this template. All collaborators listed here must be analysis runners or the procedure will fail without sharing this template with anyone.

#### Returns

A string success message.

#### Example

```sqlexample
-- Ask to link the template only for Collaborator3 in this collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ADD_TEMPLATE_REQUEST(
  $collaboration_name,
  $template_alias,
  ['Collaborator3']
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* Either of the following privileges:

  + `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges that must be manually granted to the role.
  + `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges that must be manually granted to the role.
* If the template is in a custom registry, you must also have `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE( 'READ', 'registry name', 'role name')`

---

### REMOVE_TEMPLATE

Schema:
:   COLLABORATION

Asynchronous request to remove a template from a given collaboration for specified collaborators. Only the collaborator that registered the
template can remove a template. No approval is needed from anyone else to remove a template that you have registered. When a template is
removed for a collaborator, that collaborator can’t see or use the template.

#### Syntax

```sqlsyntax
REMOVE_TEMPLATE( <collaboration_name>, <template_id>, <remove_for> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to remove the template from.

`template_id`
:   ID of the template to remove from the collaboration.

`remove_for`
:   Array of one or more *aliases* of analysis runners in this collaboration that should no longer be able to see or use this template.

#### Returns

A string success message. To see if a template has been removed for a collaborator, view the collaboration specification.

#### Example

```sqlexample
-- Prevent collaborator_1234 from using the specified template
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.REMOVE_TEMPLATE(
  $collaboration_name,
  $template_id,
  ['collaborator_1234']
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* Either of the following privileges:

  + `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges that must be manually granted to the role.
  + `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges that must be manually granted to the role.
* If the template is in a custom registry, or references a code spec in a custom registry, you must also have `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE( 'READ', 'registry name', 'role name')`

---

### VIEW_TEMPLATES

Schema:
:   COLLABORATION

Shows all templates that you can run, or that you have submitted, to the specified collaboration.

#### Syntax

```sqlsyntax
VIEW_TEMPLATES( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration. You must review or join this collaboration before you can list its templates.

#### Returns

A table that lists information about templates that you can run in this collaboration, including templates that you have registered. The
table includes the following columns:

* `template_id`: The template ID. Pass this into the `template` field or `template_id` parameter of your RUN command.
* `template_spec`: The [template specification](resources-templates.md) for this template, which
  includes the full [JinjaSQL](custom-templates.md) for this template.
* `parameters`: A description of all the arguments accepted by this template, in JSON format. The information about each parameter
  includes the name, default value, template-provider-written description, and whether it is required. Pass values for these parameters
  into your RUN command.
* `shared_by`: The collaborator that registered this template.
* `shared_with`: The collaborators that this template is shared with.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_TEMPLATES(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW TEMPLATES', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### ENABLE_TEMPLATE_AUTO_APPROVAL

Schema:
:   COLLABORATION

Causes all template update requests sent by other collaborators to be approved automatically. Requests will still appear in the request log.
This affects only requests sent after auto-approval was enabled.

#### Syntax

```sqlsyntax
ENABLE_TEMPLATE_AUTO_APPROVAL( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A string success message.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ENABLE_TEMPLATE_AUTO_APPROVAL(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### DISABLE_TEMPLATE_AUTO_APPROVAL

Schema:
:   COLLABORATION

Disables automatic approval for template requests raised by other collaborators. All future requests must be approved manually by calling APPROVE_UPDATE_REQUEST.

#### Syntax

```sqlsyntax
DISABLE_TEMPLATE_AUTO_APPROVAL( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A string success message.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.DISABLE_TEMPLATE_AUTO_APPROVAL(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

## Data offering procedures

### REGISTER_DATA_OFFERING

Schema:
:   REGISTRY

Registers a data offering so that it can be linked to a collaboration definition. You cannot unregister a registered data offering. You can’t overwrite an existing data offering, but you can register a new one with the same name and a new version. Creating a new version of a data offering doesn’t remove any earlier versions.

Every data offering must have a unique name-version combination for all data offerings in all registries in your account.

If you want to share this table with others in the collaboration, include the table in the collaboration specification before the collaboration is created.

You must have the REFERENCE_USAGE privilege with GRANT OPTION on any data that you share in a collaboration. If you do not, you will get a “missing reference usage grant” error when you try to join the collaboration or register the object. [Learn how to handle this issue.](v2/troubleshooting.md)

#### Syntax

```sqlsyntax
REGISTER_DATA_OFFERING( ['<registry_name>' ,] <data_offering_spec> )
```

#### Arguments

`registry_name` *(Optional)*
:   Name of a [custom registry](registries.md) in which to register this data offering. If not specified, registers the data offering in the default account registry.

`data_offering_spec`
:   A [data offering definition](spec-data-offering.md) in YAML format that describes this data offering.

#### Returns

The data offering ID to use in a collaboration’s `data_offerings.id` field.

#### Examples

Register a data offering in the default registry:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
    $$
    api_version: 2.0.0
    spec_type: data_offering
    version: v1
    name: customers
    datasets:
     - alias: customers_1
       data_object_fqn: SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS
       allowed_analyses: template_only
       schema_and_template_policies:
         hashed_email:
           category: join_custom
         status:
           category: passthrough
    $$
  );
```

Register a data offering in a custom registry:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
    'my_custom_registry',
    $$
    api_version: 2.0.0
    spec_type: data_offering
    version: v1
    name: customers
    datasets:
     - alias: customers_1
       data_object_fqn: SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS
       allowed_analyses: template_only
       schema_and_template_policies:
         hashed_email:
           category: join_custom
         status:
           category: passthrough
    $$
  );
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedures.

To register a data offering in the default registry:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REGISTER DATA OFFERING', 'role name')`

To register items in a custom registry:

* You have read and write privileges on any custom registry that you created yourself.
* To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'MY_REGISTRY', 'role name')`.

---

### LINK_DATA_OFFERING

Schema:
:   COLLABORATION

A data provider runs this procedure to update an existing collaboration by making the specified data offering available to the specified analysis runners. This is an asynchronous procedure; analysis runners should call VIEW_DATA_OFFERINGS to see when the data offering is available to be used.

This procedure is *additive*, meaning that the collaborators you specify are added to the existing list of data offering sharers.

If you want to use this table but not make it visible to other collaborators, call LINK_LOCAL_DATA_OFFERING instead of LINK_DATA_OFFERING.

> **Important:**
>
> LINK_DATA_OFFERING can currently only be called by the role that created or joined the collaboration.

You cannot have an active secondary role when you run this procedure. Run the following SQL code to disable any secondary roles:

```sqlexample
USE SECONDARY ROLES NONE;
```

This procedure is atomic: all of the following conditions must be met for this procedure to succeed. If the link attempt fails for any one collaborator, it fails for all of them.

* All of the specified collaborators must be analysis runners.
* This data offering must not already be shared with any of the specified analysis runners.
* This procedure can be run only by a user with the data provider collaboration role who has joined the collaboration.

You must have the REFERENCE_USAGE privilege with GRANT OPTION on any data that you wish to share. If you don’t, you’ll get a “missing reference usage grant” error when you try to join the collaboration. [Learn how to handle this issue.](v2/troubleshooting.md)

#### Syntax

```sqlsyntax
LINK_DATA_OFFERING( <collaboration_name>, <data_offering_id>, <share_with> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`data_offering_id`
:   ID of the dataset to share, generated when it was registered. The data offering must be visible to you when you call VIEW_DATA_OFFERINGS or VIEW_REGISTERED_DATA_OFFERINGS to be able to link it.

`share_with`
:   Array of string aliases of analysis runners to share this dataset with. Collaborators listed here will be added in addition to any other collaborators associated with this data offering. All collaborators listed here must be analysis runners that you are a data provider for, or the procedure will fail without sharing data with anyone.

#### Returns

A string success message.

#### Example

This example allows collaborator `alice` to use the specified data offering in the specified collaboration.

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.LINK_DATA_OFFERING(
  $collaboration_name,
  $my_data_id,
  ['alice']
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`, plus all additional account-level privileges that must be manually granted to the role.
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`, plus all additional account-level privileges that must be manually granted to the role.

If the data offering is in a custom registry, you must also have `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE( 'READ', 'registry name', 'role name')`.

---

### UNLINK_DATA_OFFERING

Schema:
:   COLLABORATION

A data provider runs this procedure to remove access to a data offering from specified analysis runners in an existing collaboration. This is an asynchronous procedure; analysis runners should call VIEW_COLLABORATIONS to confirm the data offering has been removed.

#### Syntax

```sqlsyntax
UNLINK_DATA_OFFERING( <collaboration_name>, <data_offering_id>, <remove_for> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`data_offering_id`
:   ID of the dataset to unlink, generated when it was registered.

`remove_for`
:   Array of string aliases of one or more analysis runners to remove access for. All collaborators listed here must be analysis runners that currently have access to this data offering.

#### Returns

A string success message.

#### Example

```sqlexample
-- Remove data offering access for specific analysis runners in this collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.UNLINK_DATA_OFFERING(
  $collaboration_name,
  $data_offering_id,
  ['AnalysisRunner_1', 'AnalysisRunner_2']
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### LINK_LOCAL_DATA_OFFERING

Schema:
:   COLLABORATION

Use this procedure to link your own data into a collaboration if you are using Snowflake Standard Edition. You must first register your data offerings by calling REGISTER_DATA_OFFERING. These offerings will not be visible to any other collaborator, and template policies will not be enforced. Tables submitted here propagate the `my_table` array in the template.

For more information, see [Run an analysis with your own data when you use Standard Edition](demo-flows/basic-multiparty-collab.md).

#### Syntax

```sqlsyntax
LINK_LOCAL_DATA_OFFERING( <collaboration_name>, <data_offering_id> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`data_offering_id`
:   ID of the dataset, generated when you registered it. Also visible in VIEW_REGISTERED_DATA_OFFERINGS and VIEW_DATA_OFFERINGS (to you only).

#### Returns

A string success message.

#### Example

This example links a registered data offering for use only by the current account, without exposing it to the rest of the collaborators.

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.LINK_LOCAL_DATA_OFFERING(
  $collaboration_name,
  $my_private_data_offering_id
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('LINK LOCAL DATA OFFERINGS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### UNLINK_LOCAL_DATA_OFFERING

Schema:
:   COLLABORATION

Use this procedure to unlink your own local data from a collaboration. After unlinking, the data offering will no longer be available to use in analyses within this collaboration. For more information about local data offerings, see [Run an analysis with your own data when you use Standard Edition](demo-flows/basic-multiparty-collab.md).

#### Syntax

```sqlsyntax
UNLINK_LOCAL_DATA_OFFERING( <collaboration_name>, <data_offering_id> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`data_offering_id`
:   ID of the dataset to unlink, generated when you registered it. Also visible in VIEW_REGISTERED_DATA_OFFERINGS and VIEW_DATA_OFFERINGS (to you only).

#### Returns

A string success message.

#### Example

```sqlexample
-- Unlink a local data offering from a collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.UNLINK_LOCAL_DATA_OFFERING(
  $collaboration_name,
  $my_private_data_offering_id
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UNLINK LOCAL DATA OFFERINGS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### VIEW_REGISTERED_DATA_OFFERINGS

Schema:
:   REGISTRY

Lists all data offerings that you have registered. To view data offerings in a collaboration linked by others, call COLLABORATION.VIEW_DATA_OFFERINGS.

#### Syntax

```sqlsyntax
VIEW_REGISTERED_DATA_OFFERINGS( [ '<registry_name>' ] )
```

#### Arguments

`registry_name` *(Optional)*
:   Name of a [custom registry](registries.md) to list data offerings from. If not specified, lists data offerings from the default account registry.

#### Returns

A table that lists the details of all data offerings that you have registered in this account. The table includes the following columns:

* `DATA_OFFERING_ID`: ID of the data offering.
* `NAME`: Data offering name.
* `VERSION`: Data offering version.
* `DATA_OFFERING_SPEC`: Full YAML specification of the data offering.
* `REGISTRY`: Registry the data offering is registered in.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS();
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures.

To see items in the default registry:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTERED DATA OFFERINGS', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

To see items in a custom registry:

* You have read and write privileges on any custom registry that you created yourself.
* To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'MY_REGISTRY', 'role name')`.

---

### VIEW_DATA_OFFERINGS

Schema:
:   COLLABORATION

Lists all data offerings present in a specified collaboration that you can access as an analysis runner, or that you have linked yourself. To see only data offerings that you registered, call REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS.

You can see data offerings from collaborator X only after X has joined the collaboration.

#### Syntax

```sqlsyntax
VIEW_DATA_OFFERINGS( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to explore.

#### Returns

Information about all data offerings in the specified collaboration. The table includes the
following columns:

* `template_view_name`: The fully qualified view name used to reference offerings when calling RUN to query using a template. Pass this
  name into the `source_tables` field in the RUN spec.
* `template_join_columns`: Names of columns in this table that can be used in joins in template-based queries.
* `analysis_allowed_columns`: Names of columns in this table that can be projected in template-based queries.
* `activation_allowed_columns`: Names of columns in this table that can be activated.
* `freeform_sql_view_name`: The fully qualified view name used in free-form SQL queries, when the dataset supports
  [free-form SQL queries](free-form-sql.md). This cell is empty if the dataset doesn’t offer free-form SQL
  queries.
* `freeform_sql_column_policies`: A JSON representation of all [free-form column policies](spec-data-offering.md)
  in this collaboration, keyed by policy type.
* `shared_by`: The collaborator that linked this data offering.
* `shared_with`: Who can use the data in an analysis. If this value is `LOCAL`, this is a local dataset that isn’t shared with any
  collaborators except for the party that hosts the data.
* `data_offering_id`: The unique ID of this data offering, generated when it was registered.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW DATA OFFERINGS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

## Custom function procedures

### REGISTER_CODE_SPEC

Schema:
:   REGISTRY

Registers a code bundle. This stores the code in the clean rooms environment in the REGISTRY.CODE_SPECS table. After a code spec is registered, it can be used by a template.

Every code spec registered must have a unique name-version combination across all registries in your account.

#### Syntax

```sqlsyntax
REGISTER_CODE_SPEC( ['<registry_name>' ,] <code_spec> )
```

#### Arguments

`registry_name` *(Optional)*
:   Name of a [custom registry](registries.md) in which to register this code spec. If not specified, registers the code bundle in the default account registry.

`code_spec`
:   Code bundle spec definition in YAML format, as a string.

#### Returns

The generated code bundle spec ID.

#### Examples

Register a code bundle in the default registry:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_CODE_SPEC(
  $$
  api_version: 2.0.0
  spec_type: code_spec
  name: custom_udf
  version: v1
  description: Custom UDF for data normalization

  functions:
    - name: normalize_value
      type: UDF
      language: PYTHON
      runtime_version: "3.10"
      handler: normalize
      arguments:
        - name: value
          type: FLOAT
        - name: min_val
          type: FLOAT
        - name: max_val
          type: FLOAT
      returns: FLOAT
      code_body: |
        def normalize(value, min_val, max_val):
            if max_val == min_val:
                return 0.0
            return (value - min_val) / (max_val - min_val)
  $$
);
```

Register a code bundle in a custom registry:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_CODE_SPEC(
  'my_custom_registry',
  $$
  api_version: 2.0.0
  spec_type: code_spec
  name: custom_udf
  version: v1
  description: Custom UDF for data normalization

  functions:
    - name: normalize_value
      type: UDF
      language: PYTHON
      runtime_version: "3.10"
      handler: normalize
      arguments:
        - name: value
          type: FLOAT
        - name: min_val
          type: FLOAT
        - name: max_val
          type: FLOAT
      returns: FLOAT
      code_body: |
        def normalize(value, min_val, max_val):
            if max_val == min_val:
                return 0.0
            return (value - min_val) / (max_val - min_val)
  $$
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedures.

To register objects in the default registry:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REGISTER CODE SPEC', 'role name')`

To register items in a custom registry:

* You have read and write privileges on any custom registry that you created yourself.
* To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'MY_REGISTRY', 'role name')`.

---

### VIEW_REGISTERED_CODE_SPECS

Schema:
:   REGISTRY

Lists all code bundle specs registered by this role in the local account registry.

#### Syntax

```sqlsyntax
VIEW_REGISTERED_CODE_SPECS( [ '<registry_name>' ] )
```

#### Arguments

`registry_name` *(Optional)*
:   Name of a [custom registry](registries.md) to list code bundles from. If not specified, lists code bundles from the default account registry.

#### Returns

A table that lists the details of all code bundles that you have registered in this account. The table includes the following columns:

* `code_spec_id`: ID of the code bundle spec.
* `name`: Code bundle spec name.
* `version`: Code bundle spec version.
* `code_spec`: Full YAML specification of the code bundle spec.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_CODE_SPECS();
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures.

To see items in the default registry:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTERED CODE SPECS', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

To see items in a custom registry:

* You have read and write privileges on any custom registry that you created yourself.
* To access a custom registry created by another user, you need `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'MY_REGISTRY', 'role name')`.

---

### VIEW_CODE_SPECS

Schema:
:   COLLABORATION

Returns all code bundle specs that are referenced by any template that you created or can run in the specified collaboration.

#### Syntax

```sqlsyntax
VIEW_CODE_SPECS( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A table that lists the code bundles available in the specified collaboration. The table includes the following columns:

* `code_spec_id`: ID of this code bundle spec.
* `code_spec`: Full YAML specification of the code bundle spec.
* `shared_by`: Collaborator alias that shared the code bundle spec.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_CODE_SPECS(
  $collaboration_id
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW CODE SPECS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

## Update request procedures

These procedures are used to manage collaboration update requests that require approval, such as
the [link template flow.](resources-templates.md)

### VIEW_UPDATE_REQUESTS

Schema:
:   COLLABORATION

See all update requests that you have created or that you can approve or deny, in the specified collaboration. This includes all collaboration changes such as adding data offerings, templates, and code packages. This procedure shows the status of the update.
It can take a few seconds for an update request to appear in the request list, so you might not see a request that you just sent a moment ago.

[See the link template flow.](resources-templates.md)

#### Syntax

```sqlsyntax
VIEW_UPDATE_REQUESTS( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A table of update requests sent in this collaboration. Information includes

* `id`: ID of the request. Use this to approve or deny a request.
* `type`: Type of request. The following values are supported:

  + Add Template
  + Link Data Offering
  + Unlink Data Offering
  + Remove Template
* `status`: Current status of the request. The following statuses can be reported:

  + REQUESTED: The request has been submitted.
  + PENDING_MY_APPROVAL: The request is awaiting your approval or rejection.
  + PENDING_PARTNER_APPROVAL: You have approved the request, but the request still needs to be approved by one or more other collaborators.
  + REJECTED: Someone in the collaboration rejected this request.
  + APPROVED: All required approvers have approved the request.
  + COMPLETED: The update action has been completed and changes applied to the collaboration. For templates that include a code bundle, you
    should still [check the upgrade state](resources-code-bundles.md) to see when the code bundle is ready to be called.
  + FAILED: The update action has failed. See the `DETAILS` column for failure details.
* `approval_log`: Log of all approvals and rejections of the request. If the request is rejected, the reason given by the rejecting party is also provided here.
* `details`: Details specific to the request type, such as the template name, description, and whom it is shared with for an ‘Add Template’ request.
* `spec`: The details of the resource being updated, such as template specification for an ‘Add Template’ request.
* `updated_on`: The timestamp when the last action was taken on this request (for example, an approval or rejection).

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_UPDATE_REQUESTS(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW UPDATE REQUESTS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### APPROVE_UPDATE_REQUEST

Schema:
:   COLLABORATION

Approves a collaboration update request. See your list of pending requests by calling VIEW_UPDATE_REQUESTS. Once you approve a request, you cannot reject it later.

All affected collaborators must approve a request before the change is actually applied to the collaboration.

[See the link template flow.](resources-templates.md)

#### Syntax

```sqlsyntax
APPROVE_UPDATE_REQUEST( <collaboration_name>, <request_id> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`request_id`
:   ID of the request.

#### Returns

A string success message.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.APPROVE_UPDATE_REQUEST(
  $collaboration_name,
  $request_id
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE UPDATE REQUEST', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### REJECT_UPDATE_REQUEST

Schema:
:   COLLABORATION

Rejects a collaboration update request. A single rejection prevents the change from being applied to the collaboration. You cannot approve a request after rejecting it.

#### Syntax

```sqlsyntax
REJECT_UPDATE_REQUEST( <collaboration_name>, <request_id>, <reason> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`request_id`
:   ID of the request.

`reason`
:   A human-readable description of why the request was rejected. The argument is required, but you can submit an empty string.

#### Returns

A string success message.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.REJECT_UPDATE_REQUEST(
  $collaboration_name,
  'request_1324f934457',
  'Needs more cowbell'
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE UPDATE REQUEST', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

## Collaboration management procedures

### INITIALIZE

Schema:
:   COLLABORATION

The owner calls this to create a collaboration and, optionally, join the collaboration. If `auto_join_warehouse` is FALSE, you must call JOIN separately to make the collaboration available to other collaborators. You must use the same role to call INITIALIZE and then JOIN.

Submitting a collaboration definition with the same `name` value as an existing collaboration throws an error.

It takes some time to create and join a collaboration, so you must call GET_STATUS to learn when the collaboration has been joined.

#### Syntax

```sqlsyntax
INITIALIZE( <collaboration_spec> [, '<auto_join_warehouse>'] )
```

#### Arguments

`collaboration_spec`
:   [Collaboration definition](spec-collaboration.md) in YAML format, as a string.

`auto_join_warehouse` *(Optional)*
:   String that specifies a warehouse name as a valid Snowflake identifier. If specified, the collaboration will be created and joined using
    this warehouse. If not specified, the current warehouse will be used to create the collaboration, and you must call JOIN to join the
    collaboration. An XS warehouse is recommended.

#### Returns

A table with the following columns:

* `collaboration_name`: The name of the collaboration. Use this in any procedures that require you to specify a collaboration.
* `message`: Information about the initialize request.
* `auto_join_task`: If `auto_join_warehouse` was specified, indicates whether the auto-join task was created.

#### Examples

The following example creates a collaboration where Alice is the owner and can run an analysis using data provided
by Bob.

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.INITIALIZE(
  $$
  api_version: 2.0.0
  spec_type: collaboration
  name: basic_collaboration
  owner: alice
  collaborator_identifier_aliases:
    alice: corp_id.account_id
    bob: corp2_id.account2_id
  analysis_runners:
    alice:
      data_providers:
        bob:
          data_offerings:
          - id: bob_data_v1
      templates:
      - id: alice_test_template_2026_01_12_V1
  $$,
  'APP_WH'
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedure:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`

  If providing the `auto-join-warehouse` parameter and using a role other than SAMOOHA_APP_ROLE, the role must also be granted the
  EXECUTE TASK account-level privilege.

See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions.

---

### TEARDOWN

Schema:
:   COLLABORATION

Called by the owner to delete a collaboration for all parties.

**You must call this procedure twice.** Call it once, then call GET_STATUS until it returns `LOCAL_DROP_PENDING`, then call this procedure again.

> **Note:**
>
> This procedure can be called only on a collaboration that you have created and joined. If you have created but not yet joined the
> collaboration, you must join it before you can tear it down.

#### Syntax

```sqlsyntax
TEARDOWN( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to delete.

#### Returns

A string success message.

#### Example

```sqlexample
-- Start the process.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);

-- Call until it returns LOCAL_DROP_PENDING.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- Final call.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions.

---

### GET_STATUS

Schema:
:   COLLABORATION

Shows information about all collaborators in a given collaboration.

When running an asynchronous operation such as creating or joining a collaboration, you must check the status to know when the last operation was complete before you can perform additional actions on that collaboration, such as running analyses. This procedure can be called by any collaborator invited to a collaboration.

Collaboration owners can see the following status pathway:

* CREATING » CREATED » INSTALLING » IN_REVIEW (or INSTALLATION_FAILED) » JOINING » JOINED (or JOIN_FAILED)

Non-owners will see the following status pathway:

* INSTALLING » IN_REVIEW (or INSTALLATION_FAILED) » JOINING » JOINED (or JOIN_FAILED)

#### Syntax

```sqlsyntax
GET_STATUS( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to see the status of. You can see a list of your collaborations by calling
    COLLABORATION.VIEW_COLLABORATIONS. You must be invited to, or have joined, a collaboration before you can call GET_STATUS on it.

#### Returns

A table that shows the details about the latest join attempt for all collaborators in the specified collaboration. The table includes the following columns:

* `updated_on`: Timestamp when the status was reported by the system.
* `collaborator_account`: Data sharing account ID of this collaborator.
* `collaborator_name`: The collaborator’s alias, as declared in the collaboration specification.
* `roles`: The actual and potential roles for this collaborator. Values include `owner`, `data_provider`, `analysis_runner`.
* `status`: Status at the updated time. The following values are supported, and show the status of the named collaborator in the specified collaboration.

  + `CREATING`: Collaboration creation has started.
  + `CREATE_FAILED`: Collaboration creation failed.
  + `CREATE_TIMED_OUT`: Collaboration creation timed out.
  + `CREATED`: Collaboration has been created and is ready to operate on.
  + `INSTALLING`: Installing the application package and preparing the collaboration details for review.
  + `IN_REVIEW`: The collaboration is in review.
  + `INSTALLATION_FAILED`: Installation failed; application package not installed, and can’t be reviewed.
  + `INVITED`: Participant has been invited.
  + `JOINING`: Join process has started.
  + `JOIN_FAILED`: Join process failed.
  + `JOINED`: Successfully joined the collaboration. You can start to use the collaboration.
  + `LEAVING`: Leave process has started.
  + `LEAVE_FAILED`: Leave process failed.
  + `LEFT`: Successfully left the collaboration.
  + `LOCAL_DROP_PENDING`: You have made a successful request to drop or leave the collaboration. Complete the process by calling TEARDOWN or LEAVE again.
  + `DROPPING`: Drop process has started.
  + `DROPPED`: Successfully dropped.
  + `DROP_FAILED`: Drop process failed.
* `details`: Additional details about the current status, if available.
* `region`: The cloud region of this collaborator.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('GET STATUS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### ENABLE_EXTERNAL_TABLE_ANALYSIS_FOR_COLLABORATION

Schema:
:   ADMIN

Enables external and Apache Iceberg™ tables to be used to run an analysis in your account. An analysis runner must call this before running any analysis that includes external or Iceberg tables. This procedure is called once per collaboration, not once per analysis.

#### Syntax

```sqlsyntax
ENABLE_EXTERNAL_TABLE_ANALYSIS_FOR_COLLABORATION( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A table with a `MESSAGE` column containing a success message.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.ENABLE_EXTERNAL_TABLE_ANALYSIS_FOR_COLLABORATION(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted the MANAGE FIREWALL CONFIGURATION privilege to call this procedure.

---

### VIEW_COLLABORATIONS

Schema:
:   COLLABORATION

View information about collaborations that you have created, can review, or have joined.

#### Syntax

```sqlsyntax
VIEW_COLLABORATIONS()
```

#### Arguments

*None*

#### Returns

A table that lists details of all collaborations that you can access. The table includes the following columns:

* `source_name`: The name of the collaboration, as specified by the `name` value in the collaboration specification.
* `collaboration_name`: The name of the installed collaboration. This is NULL until the collaboration is installed by calling JOIN (owners) or REVIEW (non-owners).
* `owner_account`: Data sharing ID of the account that created the collaboration.
* `updated_on`: When the collaboration was last updated.
* `collaboration_spec`: The specification for this collaboration in YAML format. This shows the latest version of the collaboration, including any resources linked or removed after the collaboration was created. However, there might be update requests that are in progress that will be linked soon, such as new or removed templates or data offerings.

#### Examples

View all collaborations:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS();
```

View the specification for a given collaboration by name:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS() ->>
SELECT "COLLABORATION_SPEC" FROM $1 WHERE "SOURCE_NAME" = $collaboration_name;
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('RUN', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW COLLABORATIONS', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### REVIEW

Schema:
:   COLLABORATION

Provides details about a collaboration to which you have been invited. Call COLLABORATION.VIEW_COLLABORATIONS to see which
collaborations you have been invited to and not yet joined. All collaborators except the owner must call this
procedure before calling JOIN. You cannot call this procedure on a collaboration that you have joined. You must use the same role to call REVIEW and JOIN. If your account is on a different cloud hosting region than the owner, you might need to call this procedure several times until it returns a successful response.

This procedure installs the underlying application in your account.

**Important notes:**

* Owners cannot call REVIEW on their own collaborations.
* Everyone except the owner must call REVIEW before calling JOIN.
* After you have joined a collaboration, you cannot call REVIEW again.

#### Syntax

```sqlsyntax
REVIEW( <source_name>, <owner_account> )
```

#### Arguments

`source_name`
:   Name of the collaboration you have been invited to join. You can see a list of your collaborations by calling
    COLLABORATION.VIEW_COLLABORATIONS.

`owner_account`
:   [Data Sharing Account Identifier](../admin-account-identifier.md) of the owner. This can be found in the response to COLLABORATION.VIEW_COLLABORATIONS.

#### Returns

Table of information about the collaboration, including the collaboration ID, owner, and the collaboration specification.

If your account is on a [different cloud hosting region](laf.md) than the collaboration owner’s, REVIEW might return a message saying that additional setup steps are still being performed. If you get this message, continue calling REVIEW until it returns the information table about the collaboration.

#### Example

```sqlexample
-- View the collaboration for your own usage.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.REVIEW(
  $collaboration_name,
  'org1.account1234'
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('REVIEW COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions.

---

### JOIN

Schema:
:   COLLABORATION

Asynchronous method to join a specified collaboration. Note that you can access only the resources listed in the collaboration at the time that you join. This procedure takes some time to run.

You need the REGISTER DATA OFFERING account privilege to join any collaboration in which you can activate data (that is, you are an analysis runner and the collaboration specification includes an `activation_destinations` field). See the access management API reference guide.

You cannot have an active secondary role when you run this procedure. Run the following SQL code to disable any secondary roles:

```sqlexample
USE SECONDARY ROLES NONE;
```

Everyone except the collaboration creator must call COLLABORATION.REVIEW before calling this procedure.

This procedure is asynchronous; call GET_STATUS to determine when you have successfully joined the collaboration.

Anyone who submits a resource to the collaboration or wants to run a template in the collaboration must join the collaboration first. The collaboration creator joins automatically when calling INITIALIZE (unless `auto_join_warehouse` is set to FALSE).

#### Syntax

```sqlsyntax
JOIN( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to join. You can see a list of your collaborations by calling
    COLLABORATION.VIEW_COLLABORATIONS. If you have been invited to join multiple collaborations with the same name, this defaults to
    the last one that you called COLLABORATION.REVIEW on.

#### Returns

A string success message. If you get an error about a missing reference usage grant, see the [Troubleshooting guide](v2/troubleshooting.md).

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.JOIN(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions.

---

### LEAVE

Schema:
:   COLLABORATION

Leaves a collaboration that you have joined. You cannot rejoin a collaboration after you have left it.

**You must call this procedure twice.** Call it once, then call GET_STATUS until it returns `LOCAL_DROP_PENDING`, then call this procedure again.

#### Syntax

```sqlsyntax
LEAVE( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration to leave.

#### Returns

A string success message.

#### Example

```sqlexample
-- Start the process.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.LEAVE($collaboration_name);

-- Call until it returns LOCAL_DROP_PENDING.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- Final call.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.LEAVE($collaboration_name);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

See GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE for additional required role permissions.

---

### GET_CONFIGURATION

Schema:
:   COLLABORATION

Returns the current configuration settings for a collaboration. You must have joined the collaboration before calling this procedure.

#### Syntax

```sqlsyntax
GET_CONFIGURATION( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A table with the following columns:

| Column | Description |
| --- | --- |
| CONFIGURATION | The name of the configuration setting. |
| VALUE | The current value of the configuration. |
| STATUS | Whether the value is `ACTIVE` or `PENDING` (a change has been requested but not yet applied). |

##### Supported configurations

| Configuration name | Description |
| --- | --- |
| TEMPLATE_AUTO_APPROVAL | Whether template update requests from other collaborators are automatically approved. Values: `true` or `false`. Default: `false`. |

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_CONFIGURATION(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### SET_CONFIGURATION

Schema:
:   COLLABORATION

Sets a configuration value for a collaboration. The change is asynchronous: call GET_CONFIGURATION to check when the new value has been applied. You must have joined the collaboration before calling this procedure.

Use this procedure to manage template auto-approval instead of the deprecated ENABLE_TEMPLATE_AUTO_APPROVAL and DISABLE_TEMPLATE_AUTO_APPROVAL procedures. Setting `TEMPLATE_AUTO_APPROVAL` to `true` enables automatic approval, and setting it to `false` disables it.

#### Syntax

```sqlsyntax
SET_CONFIGURATION( <collaboration_name>, <config_name>, <value> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`config_name`
:   Name of the configuration to set. See GET_CONFIGURATION for supported configuration names.

`value`
:   The new value for the configuration. Must be a valid value for the specified configuration name.

#### Returns

A string message confirming the request has been accepted.

#### Example

```sqlexample
-- Enable automatic approval of template requests
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.SET_CONFIGURATION(
  $collaboration_name,
  'TEMPLATE_AUTO_APPROVAL',
  'true'
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('MANAGE TEMPLATE AUTO APPROVAL', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('UPDATE', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

## Running analyses and activations

### RUN

Schema:
:   COLLABORATION

Runs an analysis in the data clean room. You can pass in run details either as individual parameters, or by passing in an [analysis YAML specification string](spec-analysis.md).

Read the [consumer.run_analysis](consumer.md) reference for background about running a template in a data clean room.

There are two versions of this procedure: one that takes the run arguments as a single YAML-formatted string, and one that takes the arguments as individual parameters.

#### Syntax

**YAML argument syntax:**

```sqlsyntax
RUN( <collaboration_name>, <analysis_spec> )
```

**Explicit parameters syntax:**

```sqlsyntax
RUN( <collaboration_name>, <template_id>, <template_view_names>, <local_template_view_names>, <arguments> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration in which to run this analysis.

`analysis_spec`
:   [Analysis definition](spec-analysis.md) in YAML format as a string, describing the template, tables, and template values to use in this analysis. Used with the YAML argument syntax.

`template_id`
:   ID of the template to run.

`template_view_names`
:   Array of string names of source tables to use in the analysis. Use table names returned by VIEW_DATA_OFFERINGS in the `template_view_name` column. The format for each entry is `user_alias.data_offering_id.dataset_alias`

`local_template_view_names`
:   Array of string IDs of your own tables to use in the analysis. You must link these tables first by calling LINK_LOCAL_DATA_OFFERING.

`arguments`
:   JSON object that contains named arguments used by the template, where each key is a template argument name, and the value is the value of that argument.

#### Returns

Analysis results in table format.

#### Examples

Pass by parameter example:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
  $template_name,
  ['Provider.data_offering_1_2026_01_12_v0.test_dataset'], -- Tables to pass to source_tables variable.
  [],
  {} -- Template takes no parameters.
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('RUN', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### VIEW_ACTIVATIONS

Schema:
:   COLLABORATION

Shows the activation status of any analysis run that either you triggered to send to a collaborator, or activations that a collaborator triggered to send to you. Activation requests to send data to yourself are not listed.

For more information about activation, see [Implementing activation](activation.md).

#### Syntax

```sqlsyntax
VIEW_ACTIVATIONS( <collaboration_name> )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

#### Returns

A table containing details for each activation. The table includes the following columns:

* `updated_on`: Time when the status was last updated.
* `segment_name`: An arbitrary string assigned by the analysis runner to identify this activation. For more information, see [Activating query results](activation.md).
* `batch_id`: Batch ID of this activation request. For more information, see [Viewing provider and consumer activation results](v1/activation.md).
* `template_id`: Template used to produce this activation data.
* `shared_by`: The collaborator that ran the analysis.
* `shared_with`: The collaborator that should receive the analysis data.
* `status`: Status of the activation. The following values are supported:

  > + `PENDING`: Activation was requested, but is waiting to be processed.
  > + `REPLICATING`: Activation data is being replicated to the destination region.
  > + `SHARED`: Activation data is ready to be processed. Call PROCESS_ACTIVATION to send the results to your account.
  > + `FAILED`: Activation processing failed. See information in the `details` column.
  > + `PROCESSED`: Activation results have been sent to the account specified in the activation request.
* `details`: Failure details, if the activation failed.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_ACTIVATIONS(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('VIEW ACTIVATIONS', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('RUN', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

---

### PROCESS_ACTIVATION

Schema:
:   COLLABORATION

If the analysis runner is sending data to another collaborator’s account, that collaborator should call PROCESS_ACTIVATION to import the activation data into their account. The collaborator should call
VIEW_ACTIVATIONS and wait until the output shows that the activation status for a given segment is `SHARED` before calling PROCESS_ACTIVATION.

For more information, see [Implementing activation](activation.md).

#### Syntax

```sqlsyntax
PROCESS_ACTIVATION( <collaboration_name> [, <segment_name> | <array_of_batch_ids> ] )
```

#### Arguments

`collaboration_name`
:   Name of the collaboration.

`segment_name` *(Optional)*
:   String name of a specific activation segment to process.

`batch_ids` *(Optional)*
:   String array of batch IDs of activations to process. This value is returned by VIEW_ACTIVATIONS. If not included, the request will process all pending
    activations in the designated collaboration for the caller.

#### Returns

The table name where the user can retrieve the results, and the segment name specified for the results. See [Implementing activation](activation.md) to learn how to read results.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.PROCESS_ACTIVATION(
  $collaboration_name
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('PROCESS ACTIVATION', 'COLLABORATION', 'collaboration name', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`

## Registry management procedures

This section contains procedures used to register objects. For more information:

* [Registries](registries.md)
* [Adding resources to a collaboration](overview.md)

### CREATE_REGISTRY

Schema:
:   REGISTRY

Creates a custom registry to organize resources such as templates and data offerings. A custom registry can store resources of a single type, designated when you create the registry.

Use custom registries to group related resources separately from the default local registry. Add resources to this registry using the optional registry name parameter.

#### Syntax

```sqlsyntax
CREATE_REGISTRY( '<registry_name>', <registry_type> )
```

#### Arguments

`registry_name`
:   Name of the registry to create. Must be a unique name across all registries in the account.

`registry_type`
:   The type of resources this registry will contain. Supported values: `TEMPLATE`, `DATA OFFERING`.

#### Returns

A string success message.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.CREATE_REGISTRY(
  'my_custom_registry',
  'TEMPLATE'
);
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling the following procedure:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE REGISTRY', 'role name')`

---

### VIEW_REGISTRIES

Schema:
:   REGISTRY

Lists all registries that you have access to, including the default local registry and any custom registries.

#### Syntax

```sqlsyntax
VIEW_REGISTRIES()
```

#### Arguments

None.

#### Returns

A table with a row for each registry that you can access.

#### Example

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTRIES();
```

#### Access requirements

If you’re not using the SAMOOHA_APP_ROLE role, you must use a role that was granted privileges by calling one of the following procedures:

* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('VIEW REGISTRIES', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('JOIN COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE COLLABORATION', 'role name')`
* `GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE('CREATE REGISTRY', 'role name')`

For a custom registry to be visible to VIEW_REGISTRIES, you must also have READ or REGISTER privileges, granted by one of the following procedure calls:

* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('READ', 'REGISTRY', 'registry name', 'role name')`
* `GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE('REGISTER', 'REGISTRY', 'registry name', 'role name')`

## Access management procedures

The SAMOOHA_APP_ROLE role grants access to all Data Clean Room Collaboration API procedures. However, if an administrator wants to grant more granular privileges to specific roles, you can create a role and grant it specific privileges with the procedures described in this section. Learn more about managing access to Collaboration API: [The Access Management Documentation](manage-access.md).

The following procedures are used to manage fine-grained access to the Snowflake Data Clean Room Collaboration API:

### GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE

Schema:
:   ADMIN

Grants a specified role the privilege to call specific procedures on a specific object.

You can call this procedure multiple times to grant multiple
permissions to the same role. Run this procedure using the role that owns the object.

#### Syntax

```sqlsyntax
GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE(
  '<privilege>',
  '<object_type>',
  '<object_name>',
  '<account_role_name>'
);
```

#### Arguments

`'privilege'`
:   What permission this role should be granted. See the table below to learn which privileges are available for which objects.

`'object_type'`
:   The type of object that this role is being granted permissions on. Supported values:

    * `COLLABORATION`
    * `REGISTRY`

`'object_name'`
:   The ID of the object, as specified in the object’s specification.

`'account_role_name'`
:   The role being granted.

The following privilege and object type combinations are supported:

**Compound privileges**

The following compound privileges grant access to multiple procedures at once:

| Privilege | Object type | Procedures enabled |
| --- | --- | --- |
| `READ` | `COLLABORATION` | VIEW_COLLABORATIONS, GET_STATUS, GET_CONFIGURATION, VIEW_CODE_SPECS, VIEW_DATA_OFFERINGS, VIEW_UPDATE_REQUESTS, VIEW_TEMPLATES |
| `RUN` | `COLLABORATION` | RUN, VIEW_ACTIVATIONS, VIEW_COLLABORATIONS |
| `UPDATE` | `COLLABORATION` | LINK_LOCAL_DATA_OFFERING, UNLINK_LOCAL_DATA_OFFERING, ADD_TEMPLATE_REQUEST, REMOVE_TEMPLATE, APPROVE_UPDATE_REQUEST, REJECT_UPDATE_REQUEST, ENABLE_TEMPLATE_AUTO_APPROVAL, DISABLE_TEMPLATE_AUTO_APPROVAL, SET_CONFIGURATION, VIEW_UPDATE_REQUESTS |
| `READ` | `REGISTRY` | View resources registered in a [custom registry](registries.md). |
| `REGISTER` | `REGISTRY` | View or register resources such as templates and data offerings in a [custom registry](registries.md). |

**Fine-grained privileges**

The following fine-grained privileges grant access to individual procedures on a specific collaboration:

| Privilege | Procedures enabled |
| --- | --- |
| `GET STATUS` | GET_STATUS |
| `VIEW DATA OFFERINGS` | VIEW_DATA_OFFERINGS |
| `VIEW TEMPLATES` | VIEW_TEMPLATES |
| `VIEW CODE SPECS` | VIEW_CODE_SPECS |
| `VIEW UPDATE REQUESTS` | VIEW_UPDATE_REQUESTS |
| `VIEW ACTIVATIONS` | VIEW_ACTIVATIONS |
| `ADD TEMPLATE REQUEST` | ADD_TEMPLATE_REQUEST |
| `REMOVE TEMPLATE` | REMOVE_TEMPLATE |
| `MANAGE UPDATE REQUEST` | APPROVE_UPDATE_REQUEST, REJECT_UPDATE_REQUEST |
| `MANAGE TEMPLATE AUTO APPROVAL` | ENABLE_TEMPLATE_AUTO_APPROVAL, DISABLE_TEMPLATE_AUTO_APPROVAL, GET_CONFIGURATION, SET_CONFIGURATION |
| `LINK LOCAL DATA OFFERINGS` | LINK_LOCAL_DATA_OFFERING |
| `UNLINK LOCAL DATA OFFERINGS` | UNLINK_LOCAL_DATA_OFFERING |
| `PROCESS ACTIVATION` | PROCESS_ACTIVATION |

#### Returns

A table with a `MESSAGE` column containing a success message.

#### Example

This example creates a role for analysts to use to run analyses in a collaboration named `my_collaboration` and assigns it to a user.

```sqlexample
USE ROLE role_that_created_this_collaboration;

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE(
  'RUN',
  'COLLABORATION',
  $collaboration_name,
  'collaborator_analyst_role'
);
GRANT ROLE collaborator_analyst_role to USER alexander_hamilton;
```

#### Access requirements

You must use the same role that created the object to call GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE on that object.

* **For collaborations,** any role with CREATE COLLABORATION or JOIN COLLABORATION can call GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE on any collaboration.
* **For registries,** only the role that created the registry can call GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE on that registry.

---

### GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE

Schema:
:   ADMIN

Grants account-level privileges to a role. This procedure enables anyone using that role to call the procedures listed for that privilege.

#### Syntax

```sqlsyntax
GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE( '<privilege>', '<account_role_name>' );
```

#### Arguments

`'privilege'`
:   The privilege to grant this role. The following string values are supported:

    * `JOIN COLLABORATION`: Grants permission to run COLLABORATION.JOIN as well as the following procedures on the joined collaboration:

      + ADMIN.GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE
      + ADMIN.GRANT_PRIVILEGE_ON_OBJECT_TO_ROLE
      + ADMIN.REVOKE_PRIVILEGE_ON_OBJECT_FROM_ROLE
      + COLLABORATION.ADD_TEMPLATE_REQUEST
      + COLLABORATION.APPROVE_UPDATE_REQUEST
      + COLLABORATION.ENABLE_TEMPLATE_AUTO_APPROVAL
      + COLLABORATION.DISABLE_TEMPLATE_AUTO_APPROVAL
      + COLLABORATION.REMOVE_TEMPLATE
      + COLLABORATION.GET_STATUS
      + COLLABORATION.LEAVE
      + COLLABORATION.LINK_DATA_OFFERING
      + COLLABORATION.LINK_LOCAL_DATA_OFFERING
      + COLLABORATION.PROCESS_ACTIVATION
      + COLLABORATION.REJECT_UPDATE_REQUEST
      + COLLABORATION.REVIEW
      + COLLABORATION.RUN
      + COLLABORATION.TEARDOWN
      + COLLABORATION.UNLINK_DATA_OFFERING
      + COLLABORATION.UNLINK_LOCAL_DATA_OFFERING
      + COLLABORATION.VIEW_ACTIVATIONS
      + COLLABORATION.VIEW_CODE_SPECS
      + COLLABORATION.VIEW_COLLABORATIONS
      + COLLABORATION.VIEW_DATA_OFFERINGS
      + COLLABORATION.VIEW_TEMPLATES
      + COLLABORATION.VIEW_UPDATE_REQUESTS
      + REGISTRY.VIEW_REGISTRIES
      + REGISTRY.VIEW_REGISTERED_CODE_SPECS
      + REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS
      + REGISTRY.VIEW_REGISTERED_TEMPLATES

      This privilege requires the following account-level privileges to be granted to the role manually:

      + APPLY ROW ACCESS POLICY ON ACCOUNT
      + CREATE APPLICATION ON ACCOUNT
      + CREATE DATABASE ON ACCOUNT
      + CREATE LISTING ON ACCOUNT
      + CREATE SHARE ON ACCOUNT
      + IMPORT SHARE ON ACCOUNT
      + MANAGE SHARE TARGET ON ACCOUNT
    * `CREATE COLLABORATION`: Grants permission to run COLLABORATION.INITIALIZE, plus all procedures allowed by `JOIN COLLABORATION`
      for the joined collaboration. Requires the following account-level privileges to be granted manually to the role:

      + APPLY ROW ACCESS POLICY
      + CREATE APPLICATION
      + CREATE DATABASE
      + CREATE LISTING
      + CREATE SHARE
      + IMPORT SHARE
      + MANAGE SHARE TARGET
      + EXECUTE TASK (if using auto-join in the INITIALIZE procedure)
    * `VIEW COLLABORATIONS`: Grants permission to run COLLABORATION.VIEW_COLLABORATIONS. Requires the following privileges to be granted manually to the role:

      + IMPORT SHARE ON ACCOUNT
    * `REGISTER DATA OFFERING`: Grants permission to run REGISTRY.REGISTER_DATA_OFFERING. This permission is required for any analysis runner to join a collaboration that implements activation.
    * `VIEW REGISTERED DATA OFFERINGS`: Grants permission to run REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS.
    * `REGISTER TEMPLATE`: Grants permission to run REGISTRY.REGISTER_TEMPLATE.
    * `VIEW REGISTERED TEMPLATES`: Grants permission to run REGISTRY.VIEW_REGISTERED_TEMPLATES.
    * `REGISTER CODE SPEC`: Grants permission to run REGISTRY.REGISTER_CODE_SPEC.
    * `VIEW REGISTERED CODE SPECS`: Grants permission to run REGISTRY.VIEW_REGISTERED_CODE_SPECS.
    * `CREATE REGISTRY`: Grants permission to run REGISTRY.CREATE_REGISTRY, REGISTRY.VIEW_REGISTRIES, and also the ability to read from custom registries that you have created.
    * `REVIEW COLLABORATION`: Grants permission to run COLLABORATION.REVIEW.
    * `VIEW REGISTRIES`: Grants permission to run REGISTRY.VIEW_REGISTRIES.
    * `VIEW DCR STATUS`: Grants permission to view the overall status of Data Clean Rooms in the account.

`'account_role_name'`
:   The name of an account-level role.

#### Returns

A table with a `MESSAGE` column containing a success message.

#### Example

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.GRANT_PRIVILEGE_ON_ACCOUNT_TO_ROLE(
  'REGISTER DATA OFFERING',
  'COLLABORATOR_ANALYST_ROLE'
);
```

#### Access requirements

You need the ACCOUNTADMIN role, or a role with the MANAGE GRANTS global privilege, to run this procedure.

---
title: Snowflake Data Clean Rooms developer’s guide
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/developer-introduction.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms developer’s guide

This topic provides guidelines for users who want to create or manage Snowflake Data Clean Rooms programmatically.

Snowflake exposes an API of stored procedures for creating and controlling clean rooms.
These stored procedures can be run in any interface that can access the Snowflake account associated with your clean room environment,
including Snowsight notebooks and worksheets, as well as the [Snowflake CLI](../../../developer-guide/snowflake-cli/index.md). These procedures can be called in
SQL or in any language supported by your Snowflake environment.

## Setting up your environment

Here are some tips for setting up your coding environment to use the clean rooms API effectively.

### Development tools

Here are the main developer tools for clean rooms:

* **Coding environment:** Any coding environment that can run stored procedures in your Snowflake account will work. Most developers use
  worksheets in Snowsight (the browser-based tool) or the [Snowflake CLI](../../../developer-guide/snowflake-cli/index.md).
* **The clean rooms UI:** Use the clean rooms UI to configure, manage, or create clean rooms. Most clean room analysts use the UI rather
  than code, so it’s important to see and test the experience of your clean rooms in the UI. Additionally, there are a handful of
  [features that are available only in the clean rooms UI](../getting-started.md).
* **Snowsight** is useful to explore databases and other objects and search for objects.
* **Clean rooms API:** API documentation is divided into [provider](../provider.md) and
  [consumer](../consumer.md) topic pages.

### Coding setup

Here is how to set up your coding environment for clean rooms:

#### Required role and warehouse

The clean rooms API requires the SAMOOHA_APP_ROLE role for full API access. Ask your clean rooms administrator to
[grant you full API access](../manage-dcr-users.md). Clean rooms also supports
[creating roles with access to a subset of API procedures](../manage-dcr-users.md).

You must use the clean rooms API in a warehouse that SAMOOHA_APP_ROLE can use. `app_wh` is one of a
[number of warehouses](installation-details.md) with access to the API. Choose the appropriate warehouse
for your needs.

We recommend that you use an XS warehouse for general clean room editing, creation, or deletion commands. Consider using larger warehouses, or Snowpark-optimized warehouses, when running large analyses, such as machine learning workloads.

```sqlexample
-- Set up environment.
USE ROLE SAMOOHA_APP_ROLE;
USE WAREHOUSE app_wh;

-- Call your clean rooms API functions.
...
```

If you use any other warehouse, be sure to grant SAMOOHA_APP_ROLE usage on that warehouse:

```sqlexample
GRANT USAGE ON WAREHOUSE <your_warehouse> TO SAMOOHA_APP_ROLE;`
```

#### About the clean rooms API

Snowflake Data Clean Rooms exposes a set of stored procedures that let a provider create, configure, and share a clean room.
These procedures can be called in any command-line environment that supports Snowflake procedures, including notebooks, worksheets, and the
Snowflake CLI. The documentation here shows SQL usage, but you can also use Python or
[any other supported Snowflake language](../../../developer-guide/stored-procedure/stored-procedures-overview.md).

Procedures exist inside the following schemas:

* `samooha_by_snowflake_local_db.provider` - [Provider-specific procedures](../provider.md). These procedures can be
  called only on clean rooms that were created in the current account.
* `samooha_by_snowflake_local_db.consumer` - [Consumer-specific procedures](../consumer.md). These procedures can be
  called only on clean rooms to which the current account was invited as a consumer.
* `samooha_by_snowflake_local_db.library` - General procedures called by either the clean room creator (provider) or a clean room
  collaborator (consumer). These procedures are documented in both the provider and consumer reference pages.

Some procedures have both provider and consumer versions. The results are appropriate to the schema: for example,
`provider.view_cleanrooms` lists all clean rooms in the current account for which you are a provider, and `consumer.view_cleanrooms` lists
all clean rooms in the current account for which you are a consumer. Be sure to call the procedure in the namespace that you need.

#### About clean room names in API procedures

Many clean room API procedures take a `cleanroom_name` argument.

* Use the clean room name if a clean room was **created using the API**. If used as part of a package name, replace spaces with underscores:

  ```sqlexample
  -- Spaces work here:
  CALL samooha_by_snowflake_local_db.provider.describe_cleanroom('my code created clean room');

  -- Underscores required here:
  SHOW VERSIONS IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_my_code_created_clean_room;
  ```
* Use the clean room ID if the clean room was **created using the clean rooms UI**.

You can see the clean room name and ID by calling `describe_cleanroom` or `view_cleanrooms`.

Clean rooms created using the API are labeled in the clean rooms UI as Supported with Developer APIs.

### Setting up accounts, users, and roles

You aren’t required to use the clean rooms UI to develop clean rooms: most clean room functionality is available by calling the API.
However, a few features are [available only in the UI](../getting-started.md), and some are
faster to perform in the UI. And because many users use the UI exclusively, it’s important to see how your clean room behaves in the UI.
Therefore, you should ask a clean room administrator to add you as a clean room manager or higher in the appropriate clean room accounts.

Depending on your use case, you might also want to set up an additional Snowflake account in different web hosting regions to test
[cross-cloud behavior](enabling-laf.md).

Name your test Snowflake accounts meaningfully to indicate their typical usage: for example, “Consumer account,”
“Provider account,” and “Cross-cloud account.” This can help when you have multiple test accounts and must choose an account on the clean
rooms login page.

#### Internal testing clean rooms

You can test a clean room during development by sharing the clean room with yourself. Such a clean room is called an *internal testing clean room*. Using a single account for both provider and consumer is convenient for quick feature testing.

To create an internal testing clean room, simply pass the provider account information to `provider.add_consumers` as the sole consumer.

Internal testing clean rooms have the following restrictions:

* **An internal testing clean room cannot later be shared with other accounts**. An internal testing clean room always is an internal
  testing clean room.
* **The following features are not supported in internal testing clean rooms:**

  + Provider activation
  + Provider-run analyses
  + Mounting or viewing request logs (`provider.mount_request_logs_for_all_consumers` or `provider.view_request_logs`)
  + Consumer-defined templates
  + Multi-provider analyses
  + Differential privacy

  If you want to test features that aren’t supported in an internal testing room, you must set up separate provider and consumer Snowflake
  accounts to test both sides of a clean room.

Download a [`sample worksheet`](../../../_downloads/980474433a279b8dd7a9409b77b0f54d/internal-testing-cleanroom.ipynb) that demonstrates using a clean room in a
single account for both provider and consumer.

### See what’s installed with the clean rooms environment

Snowflake Data Clean Rooms creates many local databases upon installation. You can find details about tasks and objects that are run or
installed with a clean room package in [Snowflake Data Clean Rooms: Installed objects](installation-details.md).

### Sample data

The clean rooms environment installs [a few sample datasets](installation-details.md) you can use.

You can also [generate synthetic test data](../../synthetic-data.md) using Snowflake.

## Guidelines and recommendations

Here are some guidelines to avoid problems when working with clean rooms:

### Confirm that you are using the same account in the clean rooms UI and in code

You often need to open a coding environment and the clean rooms UI for the same Snowflake account, for example, when creating a clean room
in code, then checking its appearance in the clean rooms UI. It’s important to confirm that you’re using the same Snowflake account in each.

Snowsight does not have a shortcut to open the clean rooms UI for the same account, or the reverse, so you must be sure to log in to the
same account in each environment.

### Clean room names vs clean room ID

When using the API, for procedures that take a clean room name argument, determine whether to use the clean room name or the clean room
ID as follows:

* If the clean room was created using the API, use the clean room **name**.
* If the clean room was created in the clean rooms UI, use the clean room **ID**. You can see both the clean room name and ID by calling
  `provider.view_cleanrooms` or `provider.describe_cleanroom`.

### Update your clean room whenever you make UI changes

When you change any clean room properties that affect the UI, call
`provider.create_or_update_cleanroom_listing` to propagate the changes.

### Interoperability between clean rooms created in code or the UI

When you create a clean room using the API, some features are not modifiable in the clean rooms UI. For example, you cannot add additional
templates, even stock Snowflake templates, in code for a UI-created clean room. You also cannot change the differential privacy settings.

## Troubleshooting

Here are some common troubleshooting tips:

### Consumer can’t set join policies or perform other basic actions on a joined clean room

Confirm that you installed your clean room with the proper role (SAMOOHA_APP_ROLE). If you didn’t use SAMOOHA_APP_ROLE when installing the clean room, you’ll encounter many problems, typically permission errors. If this is the case, even `consumer.uninstall_cleanroom` will fail and you must take extra steps to uninstall then reinstall the clean room with the correct role.

```sqlexample
-- Who owns the clean room?
SHOW SHARES LIKE 'SAMOOHA_CLEANROOM_REQUESTS_<cleanroom_name>';

-- If the owner role is not SAMOOHA_APP_ROLE, you must drop the share, then
-- uninstall the clean room.
DROP SHARE SAMOOHA_CLEANROOM_REQUESTS_<cleanroom_name>;
CALL samooha_by_snowflake_local_db.consumer.uninstall_cleanroom($cleanroom_name);
USE ROLE SAMOOHA_APP_ROLE;
CALL samooha_by_snowflake_local_db.consumer.install_cleanroom($cleanroom_name, '<provider_locator>');
```

### Can’t find a clean room that you created

If you created a clean room in one account but can’t see it in the collaborator’s account, here are some possible reasons:

* The clean room was created in a different cloud hosting region and you haven’t enabled
  [cross-cloud auto-fulfillment](enabling-laf.md).
* You didn’t publish your clean room by calling `provider.create_or_update_cleanroom_listing`.
* You are calling `consumer.view_cleanrooms()` instead of `provider.view_cleanrooms()` (or the reverse).
* You didn’t share the clean room, you shared the clean room with the wrong account, or you opened the wrong collaborator account in the
  Snowsight/Clean rooms UI/CLI. Confirm that the account where you expect to see your clean room is the one that you shared the clean room
  with, and that you’re signed in to that shared account.
* There is a small delay between publishing a clean room and when it becomes visible to the collaborator.

### Unknown function

If you call a procedure and get an error something like the following snippet:

```output
Unknown user-defined function SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.CONSUMER.<procedure name>
```

Here are a few possible causes:

You typed the wrong namespace.
:   Be sure to call the proper `consumer` or `provider` version of your procedure. Many procedures have both provider and consumer
    versions.

You mistyped the name of the function.
:   Check the reference guide for the proper naming.

You have been granted a limited-access run-role, and the function you called isn’t allowed by your role.
:   Test this by running the following SQL code:

    ```sqlexample
    USE DATABASE samooha_by_snowflake_local_db;
    CALL IS_DATABASE_ROLE_IN_SESSION('samooha_run_role');
    ```

    If the code snippet returns TRUE, you have limited-access [run-role](../manage-dcr-users.md) permissions on the clean
    room API. If you need greater access, ask a clean room administrator for full access. See the list of permitted run-role procedures in
    the [consumer.grant_run_on_cleanrooms_to_role documentation](../consumer.md).

You don’t have SAMOOHA_APP_ROLE
:   To see if you can use the SAMOOHA_APP_ROLE, run the following command:

    ```sqlexample
    -- Get current user name.
    SELECT current_user();

    -- Add current user name in place as indicated.
    SHOW GRANTS TO USER <current_user_name> ->> select * from $1 where "role" = 'SAMOOHA_APP_ROLE';
    ```

    If you don’t get any results, ask an administrator to give you API access to the clean room.

### See if a user has installed a clean room

You can check if a given user has installed a given clean room by running the following SQL code. Replace `$consumer_locator` and
`$cleanroom_name` with the consumer locator and clean room name.

```sqlexample
SELECT * FROM snowflake.data_sharing_usage.application_state
  WHERE consumer_account_locator = $consumer_locator
    AND CONTAINS(package_name, UPPER(REPLACE($cleanroom_name, ' ', '_')));
```

### Check your query or analysis history

You can see your query history for analyses run in the UI or in code. These histories are stored and checked separately.

#### UI analysis history

The clean rooms UI shows a list of all previous analyses for this account in the Analyses & Queries page. These results
are only for queries run in the UI.

If you modify or delete a clean room, the analysis reports in the UI for that clean room will be deleted unless the report uses one of the
following templates:

* Audience Overlap & Segmentation
* SQL Query
* A custom template.

Query history for the templates listed above are retained even if a clean room is modified or deleted.

#### API query history

To see the account history of all calls run using the API, including template analyses, do the following:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Query History.
3. Use the filters to find the query associated with the analysis, and select the query or analysis.

## Extended examples

To help you understand how to use various features of the Developer APIs, you can refer to the examples in the Use cases and
Features sections of the clean rooms documentation.

---
title: Snowflake Data Clean Rooms operational costs
source: https://docs.snowflake.com/en/user-guide/cleanrooms/cleanroom-cost.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms operational costs

If you need an introduction to how costs are incurred in Snowflake, refer to [Understanding overall cost](../cost-understanding-overall.md).

> **Important:**
>
> You incur charges for Snowflake Data Clean Room operations in accordance with your contract with Snowflake.

The operational costs associated with using Snowflake Data Clean Rooms can be categorized into costs associated with *ongoing operations*
and *user-initiated operations*.

**Ongoing operations**

Ongoing operations are required to support functionality of the data clean room application and features. These operations include processes and tasks that support automatic upgrades, auto-join during initialization and [legacy provider & consumer clean rooms ongoing operations](v1/cleanroom-cost.md).

**User-initiated operations**

User-initiated operations occur during clean room management actions or while executing workloads within a clean room. A *workload* is the
process of executing any specific use case (analytics or activation) within the clean room through a user-initiated query. The cost of
executing a workload depends on the time required for the workload to complete within the warehouse specified by the user. Here are
some examples of user-initiated clean room management operations:

* **Analysis & Activation Queries:** This encompasses run procedures used when running specific use case workloads within the clean room by users.
* **Data registration:** This encompasses stored procedures required to enable objects to be used within a clean room by users.
* **Creating and editing a clean room:** This encompasses stored procedures required for setting up a clean room environment, adding data and template code.
* **Joining and editing a clean room:** This encompasses stored procedures required for joining a clean room environment, adding data and template code.

> **Note:**
>
> Some user-initiated operations result in actions taken by the Secure Collaboration Orchestrator (SCO)
> to orchestrate the clean room across collaborators. Currently, costs associated with these actions
> are not charged back, though they may be in the future.

## View your usage cost

### Warehouse costs

To see the cost incurred by a warehouse, sign in to [Snowsight](../ui-snowsight-gs.md).
In the navigation menu, select Admin » Cost management » Consumption, and then select a warehouse.

### Task costs

To see the cost incurred by serverless tasks, run the following SQL command:

```sqlexample
SELECT * FROM snowflake.account_usage.serverless_task_history;
```

---
title: Snowflake Data Clean Rooms operational costs
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/cleanroom-cost.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms operational costs

If you need an introduction to how costs are incurred in Snowflake, refer to [Understanding overall cost](../../cost-understanding-overall.md).

> **Important:**
>
> You incur charges for Snowflake Data Clean Room operations in accordance with your contract with Snowflake.

The operational costs associated with using Snowflake Data Clean Rooms can be categorized into costs associated with *ongoing operations*
and *user-initiated operations*.

## Ongoing operations

Ongoing operations are required to support functionality of the data clean room application and features. Here are some examples of ongoing operations required for product functionality:

* **Differential Privacy:** Enabled on the provider’s account to support enforcement of differential privacy. This operation prevents
  consumers from being able to increase their daily query budget by altering or resetting their current differential privacy
  budget usage. In order to provide the highest fidelity for this enforcement, this operation needs to validate the consumer’s true
  remaining daily budget every minute. This operation is set up when a user enables differential privacy in a clean room. You can control
  this cost by [disabling the differential privacy task](../provider.md).
* **Template Scans:** Snowflake scans custom templates to highlight deviations from best practices in custom template code logic. Clean
  room providers can then take necessary actions to address these findings by updating or disabling custom templates within their clean
  rooms. This operation is enabled when you install the Snowflake Data Clean Room application.
* **Activation:** Required to support activation use cases from the clean room to any preferred destination. For clean rooms that support
  activation, Snowflake monitors the status of incoming shares or API calls to ensure successful processing and near real-time availability
  of data (subject to end destination processing time). This operation is set up when your account is enabled as an activation partner in
  the Profiles & Features section of the clean rooms UI.
* **Clean rooms UI metadata:** Snowflake maintains the most up-to-date clean room metadata to ensure that clean rooms UI users are
  operating on the most current state of the clean room. This operation is enabled when you install the Snowflake Data Clean Room
  application.
* **Automated Data Stats:** Snowflake maintains a daily refresh of your table stats and data overlap stats between your linked tables and
  your collaborator’s linked tables. This operation is enabled when you install the Snowflake Data Clean Room application.

## User-initiated operations

User-initiated operations occur during clean room management actions or while executing workloads within a clean room. A *workload* is the
process of executing any specific use case (analytics or activation) within the clean room through a user-initiated query. The cost of
executing a workload depends on the time required for the workload to complete within the warehouse specified by the user. Here are
some examples of user-initiated clean room management operations:

* **Data registration:** This encompasses stored procedures required to enable objects to be used within a clean room by users.
* **Creating and editing a clean room:** This encompasses stored procedures required for setting up a clean room environment,
  adding data and template code, and setting respective data policies.
* **Installing and editing a clean room:** This encompasses stored procedures required for installing a clean room environment,
  adding data, and setting respective data policies.
* **Identity Hub:** This encompasses any calls to identity providers used by the clean room.
* **Statistics:** Each clean room account runs a daily task to generate clean room data statistics. Credit consumption is dependent on the
  dataset size linked into the clean room. To disable this task, the provider must run
  `CALL samooha_by_snowflake_local_db.provider.manage_datastats_task_on_account(false);`, and all consumers in all clean rooms in the
  provider’s account must run `CALL samooha_by_snowflake_local_db.consumer.manage_datastats_task_on_account(false)`;

## View your usage cost

### Warehouse costs

To see the cost incurred by a warehouse, sign in to [Snowsight](../../ui-snowsight-gs.md).
In the navigation menu, select Admin » Cost management » Consumption, and then select a warehouse.

### Task costs

To see the cost incurred by serverless tasks, run the following SQL command:

```sqlexample
SELECT * FROM snowflake.account_usage.serverless_task_history;
```

---
title: Snowflake Data Clean Rooms: Activation connectors
source: https://docs.snowflake.com/en/user-guide/cleanrooms/connector-activation.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Activation connectors

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

You can use connectors to integrate your clean room environment with what your ecosystem partners provides. This topic describes how the
clean room admin can configure a connector so that clean room users can push the result of an analysis to an activation partner.

If you are a provider who wants to control which connectors show up as options when a clean room user runs an analysis, see
[Customize available connectors](admin-tasks.md).

> **Important:**
>
> Third-party connectors are not offered by Snowflake and may be subject to additional terms. These integrations are made available for
> your convenience, but you are responsible for any content sent to or received from the integrations.
>
> Customers are responsible for obtaining any necessary consents in connection with their use of Snowflake Data Clean Rooms. Please ensure
> that you are complying with applicable laws and regulations when using Snowflake Data Clean Rooms, including in connection with
> third-party connectors for activation purposes.

## Google Ads connector

Google Ads is an online advertising platform where advertisers bid to display brief advertisements, service offerings, product listings, or
videos to web users.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To configure the connector so that your clean environment is integrated with your Google Ads account:

1. In the left navigation of the clean rooms UI, select Connectors.
2. Select the Activation tab.
3. Expand Google Ads.
4. Enter your Google credentials.
5. In the Account ID field, enter the ID associated with your Google Ads account.
6. Specify your API preference. If you select My Developer Token, enter your developer token for the Google Ads API.
7. Select Save.

See the user guide tab to learn how to activate results using this connector.

Here is how to activate analysis results with Google Ads. These instructions assume that this connector has been properly installed
and configured.

To push the results of an analysis to Google Ads for activation:

1. Run an analysis that returns data that can be activated. For example, analyses using the Audience Overlap & Segmentation
   template can be activated.

   For more information about running an analysis, see [Run an analysis as a provider](v1/web-app-working.md) or
   [Run an analysis as a consumer](v1/web-app-working.md).
2. In the Results section, select Activate.
3. In the Activation Hub dialog, select Google Ads.
4. In the Account ID field, enter the identifier for the account where you want to push the segment.
5. In the Segment Name field, enter a descriptive name for your results.
6. In the Description field, enter a description of the data you are pushing to Google Ads.
7. In the Activation IDs section, select the columns that contain hashed email and/or hashed phone identifiers.
8. Select Push Data.

## Google Display & Video 360-PAIR connector

Google Display & Video 360-PAIR is an online advertising platform where advertisers bid to display. PAIR gives publishers and advertisers
the option to securely and privately reconcile their first-party data for audiences who have visited both an advertiser’s and a publisher’s
site.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To integrate your clean room environment with Display & Video 360-PAIR, you must configure this connector as follows:

1. Before configuring the connector,
   [link your account ID in Google Display & Video 360](https://support.google.com/displayvideo/answer/15478755) to
   CMU: Snowflake.
2. In the left navigation of the clean room UI, select Connectors.
3. Select the Activation tab.
4. Expand Google DV360 - PAIR.
5. In Account ID, enter the account ID of your Display & Video 360-PAIR account. Contact [Snowflake support](https://docs.snowflake.com/user-guide/contacting-support) for this ID.
6. In Account Type, select Advertiser or Publisher as appropriate. (The advertiser is the consumer, and the
   publisher is the provider.)
7. If you want Snowflake’s data clean room users to be able to activate results to multiple Display & Video 360-PAIR accounts, select
   + Account and enter the new account ID and account type for each additional account. Contact Snowflake support for the correct
   data partner account ID for your clean room.
8. Select Save.

> Here are the steps the provider (publisher) and consumer (advertiser) must take to activate the results of a the consumer’s analysis
> to the provider’s Display & Video 360-PAIR account.
>
> **General guidelines**

* When using Google Display & Video 360-PAIR, the publisher is the provider, and the advertiser is the consumer.
* The only template supported for this connector is Audience Overlap & Segmentation. Both provider and consumer join a
  single table on a PAIR version of a hashed email or phone.
* Both provider and consumer tables must include email or phone data hashed according to [Google’s PAIR requirements](https://support.google.com/admanager/answer/15067908).
* If 4 million or more distinct rows are linked to the clean room, we recommend using the largest warehouse size (4XL).
* Do not exceed 100 million unique rows in a dataset in a PAIR clean room for analysis and activation.
* When you add Google Display & Video 360-PAIR connector to a clean room, the room allows no other activation connectors.
  The only identity connector allowed in the clean room is Google DV360 - PAIR.
* This connector does not support provider-run analysis or activation.

  **Overview**

  Here is a brief overview of how to use this connector:

  1. The provider creates a clean room that uses the PAIR Display & Video 360 Identity activation and identity connectors, and
     links in a table that contains a hashed email or phone column.
  2. The provider uses the identity connector to generate a PAIR ID column based on the hashed email or hashed phone column in their
     table.
  3. The provider specifies the generated PAIR column as the join column.
  4. The provider specifies the Audience Overlap & Segmentation template (the only one allowed for Display & Video 360-PAIR),
     configures the template, shares, and publishes the clean room.
  5. The consumer joins the clean room, specifies tables, selects the PAIR Display & Video 360 Identity connector and generates
     a PAIR column from their hashed email or phone column.
  6. The consumer joins their PAIR column on the provider’s PAIR column, runs the analysis, and activates the results (the PAIR ID
     column) to Google.
  7. The provider downloads a mapping table that correlates each hashed email or phone values with its equivalent PAIR value. The
     provider sends this table to the Ad Server or the Sell-Side Platform (SSP) to match the PAIR values that the consumer activates to
     Display & Video 360.

  For details, read the Provider or Consumer section below.

  ProviderConsumer

  The provider takes the following steps to use Display & Video 360-PAIR in a clean room:

  1. Configure your Display & Video 360 account to link to Snowflake Data Clean Rooms. For instructions, see
     [Google’s documentation](https://support.google.com/displayvideo/answer/9649053).
  2. Install and configure the Google Display & Video 360-PAIR connector as described in the Configuration guide
     tab.
  3. Create a clean room that encrypts identifying columns with Google PAIR, then share the clean room with the consumer
     (*described below*).
  4. Provide a mapping table of corresponding original and PAIR versions of the join column in the bid request sent to your SSP
     (*described below*).

  **Create and share a clean room**

  > 1. [Sign in to the clean room UI](v1/web-app-introduction.md) and create a new clean room.
  > 2. In the Add Data step, select the tables to share with the consumer. Your tables must have email and/or phone number
  >    columns hashed according to Google’s requirements.
  > 3. In the Specify Join Policies step, set the following values:
  >
  >    1. Expand Identity Hub and select PAIR Display & Video 360.
  >    2. In the PAIR Join Columns section, select your hashed email or phone column. The connector generates a
  >       PAIR version of this column with `_PAIR` appended to the original column name:
  >
  >       1. Select Generate Preview to see the new column.
  >       2. Select Add Identity to add the new column to your dataset.
  >    3. In the Join Policies section, select the generated `_PAIR` column. Don’t join on any other columns.
  > 4. In the Configure Analysis & Query section, configure the Audience Overlap & Segmentation template.
  >
  >    + Choose the table containing the hashed email or phone.
  >    + Set any Segmentation & Attribute Columns values you want.
  >    + Under Privacy Settings, keep the Threshold Value at or above 1,000, as required by Google.
  > 5. In the Share Clean Room section, select the consumer as a collaborator and then select Finish to publish and share
  >    your clean room.
  > 6. Retrieve your PAIR ID mapping table as described below. This table was generated in your Snowflake account, and you just
  >    need to know the fully qualified name of this translation table to either download it or [bulk export it](../data-unload-overview.md). Send
  >    this table with your bid request to your SSP or your ad server.

  **Prepare and send a bid request**

  Send your exported translation table of corresponding original and encrypted hashed email or phone PAIR columns to your SSP. Your SSP uses
  this data to find the corresponding hashed value for the encrypted values sent by the consumer. Best practice is to use a
  URL-safe format such as Base64 encoding when providing these IDs in your bid request.

  The fully-scoped table name has this format:

  ```sqlsyntax
  SAMOOHA_CLEANROOM_<cleanroom ID>.SHARED_SCHEMA.PROVIDER_<source database>__<source schema>__<source table>_PAIR<digit>
  ```

  `cleanroom ID`
  :   This is the clean room ID, *not* the clean room name. You can find the clean room ID for a given clean
      room name by making the following call:

      ```sqlexample
      CALL samooha_by_snowflake_local_db.provider.view_cleanrooms();
      ```

  `source database`, `source schema`, `source table`
  :   The database, schema, and name of the source table linked into the clean room used in the template. Note the separation by
      single or double underscores as shown, not by dots.

  `PAIRdigit`
  :   A single digit, usually either 0 or 1.

  Here is an example fully-scoped table name:

  ```sqlexample
  SAMOOHA_CLEANROOM_MY_CLEANROOM.SHARED_SCHEMA.PROVIDER_SAMPLE_DATABASE__AUDIENCE_OVERLAP__CUSTOMERS_PAIR0
  ```

  Query the table as shown below, or use [bulk export](../data-unload-overview.md) to download the data as a flat
  text file to a stage or your computer.

  ```sqlexample
  SELECT * FROM SAMOOHA_CLEANROOM_MY_CLEANROOM.SHARED_SCHEMA.PROVIDER_SAMPLE_DATABASE__AUDIENCE_OVERLAP__CUSTOMERS_PAIR0;
  ```

  The consumer takes the following steps to activate PAIR overlap data to Google:

  1. Configure your Display & Video 360 account to link to Snowflake Data Clean Rooms. For instructions, see
     [Google’s documentation](https://support.google.com/displayvideo/answer/9649053).
  2. Install and configure the Google Display & Video 360-PAIR connector as described in the Configuration guide
     tab.
  3. Install the clean room so it is PAIR-enabled (*described below*).
  4. Activate the results of your analysis to your Display & Video 360 account (*described below*).

  **Join and configure the clean room**

  1. [Sign in to the clean room UI](https://cleanroom.c1.us-east-1.aws.app.snowflake.com/) and join the appropriate clean room.
  2. In the Add Data section, select the tables that you want to include in the clean room. Your tables must have email and/or
     phone number columns hashed according to Google’s requirements.
  3. In the Specify Join Policies step, set the following values:

     1. In the PAIR Join Columns section, select your hashed email or phone column. The connector generates a PAIR
        version of this column with _PAIR appended to the original column name:

        1. Select Generate Preview to see the new columns.
        2. Select Add Identity to add these new columns to your schema. If you repeat this step, it will generate additional identical columns.
     2. In the Join Policies section, match your `_PAIR` columns to the corresponding `_PAIR` columns of the provider,
        and then define any additional join policies.
  4. In the Configure Analysis & Query section, configure the Audience Overlap & Segmentation template.

     + Set the Tables values to your table.
     + Set any Segmentation & Attribute Columns values you want.
  5. Select Finish and run your analysis, as described next.

  **Run the analysis and activate results**

  1. Run the clean room in your Joined tab.
  2. Run the Audience Overlap & Segmentation analysis, filling in any
     information you need for the analysis. Join on the _PAIR column that you generated.
  3. When the query completes successfully, open the results page and select Activate.
  4. Select Google Display & Video 360 - PAIR.
  5. In the Account ID field, select a Display & Video 360 account.
  6. In the Segment Name field, enter a descriptive name for your results.
  7. In the Description field, enter a description of the data you are sending to Display & Video 360.
  8. In the Publisher Name, enter the name of the provider you are collaborating with.
  9. Select the PAIR ID columns, and the type of these identifiers.
  10. Select Push Data to activate results to Google.

## LiveRamp connector

LiveRamp is a leading connectivity platform leveraged by brands and their partners to deliver products and experiences. LiveRamp RampID
connects people, data, and devices across the digital and physical world, powering people-based marketing and allowing consumers to safely
connect with brands and products.

> **Note:**
>
> If you choose to configure the LiveRamp connector so data is uploaded using a Snowflake share, LiveRamp must set up share ingestion
> before users can activate using LiveRamp.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To configure the connector so that your clean environment is integrated with your LiveRamp account:

1. In the left navigation of the clean rooms UI, select Connectors.
2. Select the Activation tab.
3. Expand LiveRamp.
4. Use the Select Upload Type drop-down list to do one of the following:

   * If you want to share data with LiveRamp using SFTP:

     1. Select SFTP.
     2. Enter the username and password provided by LiveRamp for the purpose of using their SFTP.
   * If you want to share data with LiveRamp using Snowflake data sharing:

     1. Select Snowflake Share.
     2. Use the Account drop-down to select the LiveRamp Snowflake account.
     3. Select Generate Share.
     4. Send your LiveRamp representative the name of your account and the generated share.
5. Select Authenticate.

See the user guide tab to learn how to activate results using this connector.

Here is how to activate analysis results with the LiveRamp connector. These instructions assume that this connector has been properly
installed and configured.

To push the results of an analysis to LiveRamp for activation:

1. Run an analysis that returns data that can be activated. For example, analyses using the Audience Overlap & Segmentation
   template can be activated.

   For more information about running an analysis, see [Run an analysis as a provider](v1/web-app-working.md) or
   [Run an analysis as a consumer](v1/web-app-working.md).
2. In the Results section, select Activate.
3. In the Activation Hub dialog, select LiveRamp.
4. In the Segment Name field, enter a descriptive name for your results. This name must start with a letter, and can contain
   only letters, numbers, and underscores.

   The string `_SNOWDCR` will be appended to this name.
5. In the RampID drop down, select the column from your table that contains your RampID.
6. Select Push Data.

## Meta Ads Manager connector

Meta Ads Manager is an ad platform that lets you build targeted campaigns and optimize ad spend.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To configure the connector so that your clean environment is integrated with your Meta Ads Manager account:

1. In the left navigation of the clean rooms UI, select Connectors.
2. Select the Activation tab.
3. Expand Meta Ads Manager.
4. Enter your Meta Business Manager credentials.
5. In the Meta Ads Manager Account ID field, enter the ID of your Meta Ads Manager account.
6. Select Save.

See the user guide tab to learn how to activate results using this connector.

These instructions assume that this connector has been properly installed and configured.

To push the results of an analysis to Meta Ads Manager for activation:

1. Run an analysis that returns data that can be activated. For example, analyses using the Audience Overlap & Segmentation
   template can be activated.

   For more information about running an analysis, see [Run an analysis as a provider](v1/web-app-working.md) or
   [Run an analysis as a consumer](v1/web-app-working.md).
2. In the Results section, select Activate.
3. In the Activation Hub dialog, select Meta Ads Manager.
4. In the Account ID field, enter the identifier for the account where you want to push the segment.
5. In the Segment Name field, enter a descriptive name for your results.
6. In the Description field, enter a description of the data that you are pushing.
7. In the Activation IDs section, select the columns that contain identifiers, then select the type of those identifiers.
8. Select Push Data.

## The Trade Desk - CRM connector

The Trade Desk CRM integrates customer relationship management (CRM) data to activate and target audience segments within The Trade Desk’s
platform for personalized advertising campaigns.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To configure the connector so that your clean room environment is integrated with your account with The Trade Desk - CRM:

1. In the left navigation of the clean rooms UI, select Connectors.
2. Select the Activation tab.
3. Expand The Trade Desk - CRM.
4. In the Username field, enter the username associated with your account with The Trade Desk.
5. In the Password field, enter the password your account with The Trade Desk.
6. In the Advertiser ID field, enter the advertiser ID associated with your account with The Trade Desk.
7. In the Region field, select the region of your The Trade Desk account.
8. Select Authenticate.

See the user guide tab to learn how to activate results using this connector.

These instructions assume that this connector has been properly installed and configured.

To push the results of an analysis to The Trade Desk - CRM for activation:

1. Run an analysis that returns data that can be activated. For example, analyses using the Audience Overlap & Segmentation
   template can be activated.

   For more information about running an analysis, see [Run an analysis as a provider](v1/web-app-working.md) or
   [Run an analysis as a consumer](v1/web-app-working.md).
2. In the Results section, select Activate.
3. In the Activation Hub dialog, select The Trade Desk - CRM.
4. In the Segment Name field, enter a descriptive name for your results.
5. In the Activation IDs section, select a column that contains identifiers, then select the type of those identifiers.
6. Select Push Data.

## The Trade Desk - UID 2.0 connector

The Trade Desk - UID 2.0 is a demand-side platform (DSP) that provides a technology platform for advertisers to plan, buy, and manage
digital advertising campaigns across various channels.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To configure the connector so that your clean room environment is integrated with your account with The Trade Desk - UID 2.0:

1. In the left navigation of the clean rooms UI, select Connectors.
2. Select the Activation tab.
3. Expand The Trade Desk - UID 2.0.
4. In the Advertiser ID field, enter the advertiser ID associated with your account with The Trade Desk.
5. In the Secret Key field, enter the secret key associated with your account with The Trade Desk.
6. Use the Data Center drop-down list to select a The Trade Desk data center.
7. Select Authenticate.

See the user guide tab to learn how to activate results using this connector.

These instructions assume that this connector has been properly installed and configured.

To push the results of an analysis to The Trade Desk - CRM for activation:

1. Run an analysis that returns data that can be activated. For example, analyses using the Audience Overlap & Segmentation
   template can be activated.

   For more information about running an analysis, see [Run an analysis as a provider](v1/web-app-working.md) or
   [Run an analysis as a consumer](v1/web-app-working.md).
2. In the Results section, select Activate.
3. In the Activation Hub dialog, select The Trade Desk - UID 2.0.
4. In the Segment Name field, enter a descriptive name for your results.
5. In the Activation IDs section, select a column that contains identifiers, then select the type of those identifiers.
6. Select Push Data.

## Yahoo DSP connector

Yahoo DSP is a demand-side platform that allows advertisers to programmatically buy and optimize digital ad inventory across various
channels.

Configuration guideUser guide

You must have the MANAGE_DCR_CONNECTORS role to configure this connector.

To configure the connector so that your clean room environment is integrated with Yahoo DSP:

1. In the left navigation of the clean rooms UI, select Connectors.
2. Select the Activation tab.
3. Expand Yahoo DSP.
4. Enter the MDM ID associated with your Yahoo account.
5. Select Authenticate.

See the user guide tab to learn how to activate results using this connector.

These instructions assume that this connector has been properly installed and configured.

To push the results of an analysis to Yahoo DSP for activation:

1. Run an analysis that returns data that can be activated. For example, analyses using the Audience Overlap & Segmentation
   template can be activated.

   For more information about running an analysis, see [Run an analysis as a provider](v1/web-app-working.md) or
   [Run an analysis as a consumer](v1/web-app-working.md).
2. In the Results section, select Activate.
3. In the Activation Hub dialog, select Yahoo DSP.
4. In the Segment Name field, enter a descriptive name for your results.
5. In the Description field, enter a description of the data you are pushing to Yahoo DSP.
6. In the Activation IDs section, select columns that contains identifiers, then select the type of those identifiers.
7. Select Push Data.

---
title: Snowflake Data Clean Rooms: Administrator tasks
source: https://docs.snowflake.com/en/user-guide/cleanrooms/admin-tasks.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Administrator tasks

This topic describes the tasks for the administrator of a Snowflake Data Clean Room. For information about installing the clean room environment
in your Snowflake account, see [Installing the Snowflake Data Clean Rooms environment](installing-dcr.md).

## Updating the clean rooms environment

Snowflake Data Clean Rooms updates their binaries weekly to support new features, procedures, and UI updates. You can find release notes for
significant new releases in the [feature updates section](../../release-notes/new-features.md) of the Snowflake release notes page (search
for “clean rooms”).

### Clean rooms UI updates

The clean rooms UI environment is updated automatically by Snowflake; all users need to do to get the
updated version is sign out and sign back in to the clean rooms UI.

### Clean rooms API updates

A clean rooms administrator can either enable automatic API updates (recommended) or update the API environment manually for each new
release, as described next.

#### Automatic API updates

A clean rooms API administrator can enable clean rooms updates to be installed automatically upon release by running the following SQL commands once in their account:

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
```

Clean rooms API users in that account will see the updates shortly when they are rolled out, without needing to log out.

#### Manual API updates

We recommend enabling automatic clean room updates for your account. But if you prefer to update your account’s API environment manually, you can do so by running the following SQL commands each time you want to update the environment:

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.apply_patch();
```

You can find your release number by running the following SQL command:

```sqlexample
SELECT * FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.ADMIN.VERSION;
```

## Using a different warehouse

Clean rooms come with [several warehouses](v1/installation-details.md) that can access the API. Choose the
warehouse that is appropriate for your needs. You can also choose a custom warehouse size for specific actions, such as for
[provider activation](provider.md).

However, your clean room can use any warehouse you choose, if you grant USAGE and OPERATE privileges on that warehouse to the
SAMOOHA_APP_ROLE role.

For example, to add a warehouse `my_big_warehouse` that can be used to run analyses, execute the following commands from a worksheet:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE WAREHOUSE my_big_warehouse WITH WAREHOUSE_SIZE = X5LARGE;
GRANT USAGE, OPERATE ON WAREHOUSE my_big_warehouse TO ROLE SAMOOHA_APP_ROLE;
```

## Monitor clean rooms UI activity

An administrator can track what users are doing in the clean rooms UI by monitoring the query history in your Snowflake account.

To access the query history for your clean room environment, do one of the following, depending on whether you want to use SQL or
Snowsight:

SnowsightSQL

You can identify UI traffic as queries where the `user_name` is the name of the service
user that was created when the [Snowflake account was configured](v1/enable-clean-rooms-ui.md).

1. Sign in the Snowflake account associated with your clean room environment as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Monitoring » Query History.
3. Use the User filter to select the service account user associated with the clean room environment.

When using SQL, you can identify UI traffic where `query_tag_details:request_type = DCR` or where `user_name =` the name of the
UI service user.

You can further filter queries by the `user_email` query tag to see only actions performed by the specified user in the UI.

Execute queries against the [QUERY_HISTORY view](../../sql-reference/account-usage/query_history.md) in SNOWFLAKE.ACCOUNT_USAGE.

For example, to trace the clean rooms UI activity of the user `joe@example.com`, execute the following code:

```sqlexample
SELECT *,
  TRY_PARSE_JSON(query_tag) AS query_tag_details
  FROM snowflake.account_usage.query_history
  WHERE query_tag_details IS NOT NULL
    AND query_tag_details:request_type = 'DCR'
    AND query_tag_details:user_email = 'joe@example.com';
```

## Monitor provider-run analyses

A provider-run analysis refers to the process of a provider creating and sharing a clean room, then running an analysis in the clean room
after the consumer links their data. These analyses run in the consumer’s account, not the provider’s. This section describes how the
consumer can track the queries executed by the provider’s analyses in the clean room.

Snowflake Data Clean Rooms assigns a query tag to each query executed for a provider-run analysis. This query tag takes the form
`cleanroom_UUID_provider_account_locator`. A consumer can retrieve all queries associated with provider-run analyses by searching
for the query tag in the query history of their account.

To retrieve the query, first obtain the UUID for a clean room, then search for the query tag. In the following code, replace
`cleanroom_name` and `provider_account_locator` with the appropriate values.

```sqlexample
-- Retrieve clean room UUID
SELECT cleanroom_id FROM samooha_by_snowflake_local_db.public.cleanroom_record
  WHERE cleanroom_name = '<cleanroom_name>';

-- Retrieve queries with provider-run query tag
SELECT * FROM snowflake.account_usage.query_history
  WHERE query_tag = cleanroom_id || '<provider_account_locator>;
```

You can also use Snowsight to filter the query history by the appropriate query tag after using SQL to retrieve the clean room UUID.

## Customize available connectors

Connectors let you integrate your clean room environment with your ecosystem partners. As the clean room administrator for a provider, you
can customize the clean room environment to limit which connectors appear as options for the clean room user. For example, if you have a
single preferred activation partner, you can configure the clean room environment so that the partner is the only option when a consumer
activates the results of an analysis in a clean room.

> **Note:**
>
> Your customizations apply to new clean rooms only.

To control which connectors are available in a clean room, you need the MANAGE_DCR_CONNECTORS role.

1. [Sign in to the clean rooms UI](v1/web-app-introduction.md).
2. In the left navigation, select Admin » Profile & Features.
3. Optional: To customize activation connectors, follow these steps:

   1. On the Activation tile, select Edit.
   2. Select which activation options you want to display, and then select Save.
4. Optional: To customize identity and data provider connectors, follow these steps:

   1. On the Identity & Data Provider tile, select Edit.
   2. Select which identity options you want to display, and then select Save.

## Brand your clean rooms

You can configure a profile for your clean room environment so every clean room created is branded with your logo and company name. To
define the logo and name for your company, you need the MANAGE_DCR_PROFILE_AND_FEATURES role.

1. [Sign in to the clean rooms UI](v1/web-app-introduction.md).
2. In the left navigation, select Admin » Profile & Features.
3. In the Company profile section, do the following:

   1. Upload a logo for your company in JPG or PNG format. This logo will appear on every clean room that is created.
   2. Edit the Company Name to define the name that you want to appear on the clean rooms that are created in your environment.

## Enable single sign-on (SSO)

To enable single sign-on (SSO) with Snowflake Authentication, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
**An account must use Snowflake authentication to enable SSO**; if you aren’t using it yet,
[migrate to Snowflake authentication](update-to-oauth.md) before requesting SSO.

## Allow key-pair authentication

The service account user that the clean room environment uses to communicate with your Snowflake account uses
[key-pair authentication](../key-pair-auth.md) to authenticate. If your Snowflake account uses [authentication policies](../authentication-policies.md)
to control how users authenticate, then the authentication policy controlling the service account user must allow key-pair authentication.

To allow key-pair authentication, either remove all authentication policies, or add an authentication policy with `AUTHENTICATION_METHODS = ALL`
or `AUTHENTICATION_METHODS = KEYPAIR`. If your Snowflake account has an account-level authentication policy that does not allow key-pair
authentication, you need to create a new authentication policy with the appropriate parameter, then assign the policy to the service
account user that was created during the installation process.

You can check your authentication policies by running this command:

```sqlexample
SHOW AUTHENTICATION POLICIES;
```

An empty results table indicates no policies, which means that key-pair authentication is allowed.

## Manage the service user

The clean rooms UI uses a service user account as an intermediary to perform most clean room actions. You can modify the key or switch
the clean rooms service user after it has been created or added to your account using the clean rooms UI as described in this section.

You can find information about the service user under Snowflake Admin » Snowflake » Service User Management.

> **Important:**
>
> Change the clean rooms service user only using the clean rooms UI. If you modify the service user outside of the UI, clean rooms might no
> longer be able to access the service user.

### Change the service user

If you want to change the service user name or use a new service user (essentially the same thing):

1. Open Snowflake Admin » Service User Management and select  (Edit).
2. Change the name of the service user to a user that you have [created](v1/enable-clean-rooms-ui.md) and is accessible in
   the current account.
3. Select Reauthenticate to open a confirmation dialog.
4. Read the information in the dialog, then select Confirm to start using that agent.

### Change the service user key

If you want to change your service user RSA key, you should do so as described next. If you change the key outside of the clean room
environment, you will no longer be able to use most UI functionality until you change the service user key back as described next.

1. Open Snowflake Admin » Service User Management and select  (Edit).
2. Select Reauthenticate near the manual setup section to open a confirmation dialog.
3. Read the information in the dialog, then select Confirm to generate a new RSA key.

You can see information about the service agent’s public key by running the following SQL command, substituting in your service user’s name
where indicated:

```sqlexample
DESCRIBE USER <service_user_name> ->>
  SELECT *
    FROM $1
      WHERE "property" ILIKE 'RSA_PUBLIC_KEY%';
```

Clean rooms doesn’t support key rotation using [RSA_PUBLIC_KEY_2](../key-pair-auth.md), so ignore the information about RSA_PUBLIC_KEY_2.

## Enable or disable activation in the clean room UI

Activation when using the clean room UI is controlled globally by a clean room administrator. Activation in the clean room API is controlled
at the clean room level by the provider.

This section shows how to enable or disable activation when using the clean room UI. To learn how to enable activation when using the API,
read the [activation instructions](activation.md).

Provider and consumer activation are enabled by default in your clean room account when using the clean room UI. Third-party activation
must be enabled manually.

Here is how to enable or disable activation for UI users in your account:

1. [Sign in to the clean room environment in the clean rooms UI](v1/web-app-introduction.md) as a DCR administrator.
2. Select Admin » Profile & Features.
3. In the Activation section, select Edit.

   * To manage **consumer activation**: Check or clear the checkbox next to Collaborator Account.
   * To manage **provider activation**: Check or clear the checkbox next to your own account name.
   * To manage **third-party activation**: Check or clear the checkbox next to the third-party activation target you wish to enable or
     disable. Third-party activation is enabled through connectors, and is available only in the clean room UI.
     [See the list of available third party connectors](connector-activation.md).

[Learn how to implement activation in a clean room.](activation.md)

## Configure network policies

If your Snowflake account uses a [network policy](../network-policies.md) to control network traffic, you must explicitly allow
traffic from the IP addresses that the clean rooms UI uses to communicate with your Snowflake account.

Find the IP addresses used for your region in the **IP network addresses used by clean rooms UI** column in the [IP address table](v1/web-app-introduction.md).

## See details about the service account for this environment

The clean rooms UI uses a service account to communicate with Snowflake. This service account was created by the account administrator
when they installed the Clean Room environment for this account.

You cannot modify details about the service account user.

To see details about the service account for this Clean Room environment you need the MANAGE_DCR_PROFILE_AND_FEATURES role.

1. Navigate to the [Snowflake Data Clean Rooms login page](https://cleanroom.c1.us-east-1.aws.app.snowflake.com).
2. Navigate to Admin > Snowflake Admin.
3. On the Snowflake Admin page you can see information such as the service user name and service user email.

---
title: Snowflake Data Clean Rooms: Clean room connectors
source: https://docs.snowflake.com/en/user-guide/cleanrooms/connector-clean-room.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Clean room connectors

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

You can use connectors to integrate your clean room environment with clean rooms hosted on other cloud providers. This topic describes how
the clean room admin can configure a connector so Snowflake Data Clean Rooms can interact with third-party clean rooms.

> **Important:**
>
> Third-party connectors are not offered by Snowflake and may be subject to additional terms. These integrations are made available for
> your convenience, but you are responsible for any content sent to or received from the integrations.
>
> Customers are responsible for obtaining any necessary consents in connection with their use of Snowflake Data Clean Rooms. Please ensure
> that you are complying with applicable laws and regulations when using Snowflake Data Clean Rooms, including in connection with
> third-party connectors for activation purposes.

## Amazon Marketing Cloud

Amazon Marketing Cloud (AMC) is a cloud-based clean room solution in which advertisers can perform analytics across pseudonymized signals,
including Amazon ads signals and their own inputs.

To configure the connector so your clean environment is integrated with the AMC:

1. Sign in to Amazon Ads Account.
2. Select Account & Advertiser Instances.
3. Select Save.

## Ads Data Hub

Ads Data Hub helps advertisers, agencies, and measurement partners do customized analysis of campaigns while protecting user privacy.

To configure the connector so your clean environment is integrated with Ads Data Hub:

1. Upload Service Account Credentials.
2. Enter the Developer / API Key.
3. Select Authenticate.
4. Select the Parent Account.
5. Enter the Default Destination Dataset.
6. Select Child Account.
7. Enter the Clean Room Name.
8. Select Save.

---
title: Snowflake Data Clean Rooms: Consumer API reference guide
source: https://docs.snowflake.com/en/user-guide/cleanrooms/consumer.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Consumer API reference guide

This page describes procedures used by clean rooms API consumers to manage their clean rooms. For coding setup instructions, see [Coding setup](v1/developer-introduction.md).

## Manage role access

### grant_run_on_cleanrooms_to_role

Schema:
:   CONSUMER

**Description:** Grants the specified role permission to run a subset of procedures on the specified clean rooms. Clean rooms must be
*installed* in this account, not *created* by this account. (That is, only clean rooms for which you are a consumer.)

To grant limited use to your clean rooms, grant users the specified role rather than SAMOOHA_APP_ROLE.
For more information about role access, see [Grant limited API access (run roles)](manage-dcr-users.md).

The following procedures can be run using a role specified here:

* consumer.view_added_templates
* consumer.view_added_template_chains
* consumer.get_arguments_from_template
* consumer.view_column_policy
* consumer.view_consumer_datasets
* consumer.view_join_policy
* consumer.view_provider_column_policy
* consumer.view_provider_datasets
* consumer.view_provider_join_policy
* consumer.view_remaining_privacy_budget
* consumer.run_analysis
* consumer.view_provider_activation_policy
* consumer.view_activation_policy
* consumer.run_activation

**Arguments:**

* [cleanroom_names](v1/developer-introduction.md) *(Array of strings)* - Names of all the clean rooms on which to grant limited access to the specified role.
* `run_role_name` - (String) Name of a role that has limited permissions on the specified clean rooms. You must create the role
  before calling this procedure.

**Returns:** *(String)* - Success message.

**Example:**

```sqlexample
CREATE ROLE MARKETING_ANALYST_ROLE;
CALL samooha_by_snowflake_local_db.consumer.grant_run_on_cleanrooms_to_role(
  ['overlap_cleanroom', 'market_share_cleanroom'],
  'MARKETING_ANALYST_ROLE'
);
```

### revoke_run_on_cleanrooms_from_role

Schema:
:   CONSUMER

**Description:** Revokes permissions from the specified roles on the specified clean rooms. If the user has access to a
non-revoked role, or has the SAMOOHA_APP_ROLE, they can still run clean room procedures in the specified clean rooms.

**Arguments:**

* [cleanroom_names](v1/developer-introduction.md) *(Array of strings)* - Names of one or more clean rooms in this account.
* `run_role_name` - (String) Name of role that should no longer have limited permissions on the specified clean rooms in this
  account.

**Returns:** *(String)* - Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.revoke_run_on_cleanrooms_from_role(
  ['overlap_cleanroom', 'market_share_cleanroom'],
  'TEMP_USERS_ROLE'
);
```

## Install a clean room

Procedures to install or uninstall a clean room.

### install_cleanroom

Schema:
:   CONSUMER

**Description:** Installs (joins) the clean room created by the specified provider. Calling this multiple times clears out the
existing clean room each time; if you interrupt a second installation before it’s complete, the clean room becomes corrupted, and you will
need to complete this procedure to make the clean room usable.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to install.
* `provider_account_locator` - (String) Account locator of the provider who created this clean room.

**Returns:** *(String)* Success message.

**Error handling:**

If you get an error saying that “Cross-Cloud Auto-Fulfillment is not enabled for this account”, it means
that the provider is in another cloud hosting region. You must enable Cross-Cloud Auto-Fulfillment as described in
[Managing Cross-Cloud Auto-Fulfillment in Snowflake Data Clean Rooms](v1/enabling-laf.md).

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.install_cleanroom(
  $cleanroom_name,
  $provider_locator);
```

### is_enabled

Schema:
:   CONSUMER

**Description:** There can be a short delay after clean room installation before it is ready to use. You might call this procedure to
confirm whether or not the clean room is ready for use after installation.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of clean room to check the status of.

**Returns:** *(Boolean)* Whether or not the specified clean room is installed and ready to use.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.is_enabled($cleanroom_name);
```

### uninstall_cleanroom

Schema:
:   CONSUMER

**Description:** Uninstalls the clean room on the consumer account. This removes all databases associated with the clean room, including
the shared clean room database. The clean room can always be installed again by calling `consumer.install_cleanroom`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of clean room to uninstall.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.uninstall_cleanroom($cleanroom_name);
```

## Cross-cloud collaboration

Install a clean room created on another cloud region. [Learn more.](v1/enabling-laf.md)

### enable_laf_on_account

Schema:
:   LIBRARY

**Description:** Enables Cross-Cloud Auto-Fulfillment on the current account. Requires ACCOUNTADMIN role.

> **Important:**
>
> You must first enable Cross-Cloud Auto-Fulfillment for your account by calling
> [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../../sql-reference/functions/system_enable_global_data_sharing_for_account.md).
>
> [Learn more about auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment.md)
> and [managing auto-fulfillment privileges](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.library.enable_laf_on_account();
```

### disable_laf_on_account

Schema:
:   LIBRARY

**Description:** Disables Cross-Cloud Auto-Fulfillment on the current account. Requires ACCOUNTADMIN role.

> **Important:**
>
> You must call [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../../sql-reference/functions/system_enable_global_data_sharing_for_account.md) before calling this
> procedure.
>
> [Learn more about auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) and
> [managing auto-fulfillment privileges](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.library.disable_laf_on_account();
```

### is_laf_enabled_for_cleanroom

Schema:
:   CONSUMER

**Description:** Describes whether or not cross-cloud auto-fulfillment has been enabled for this clean room. Cross-cloud auto-fulfillment
[must be configured by an account administrator](v1/enabling-laf.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** Whether or not cross-cloud auto-fulfillment has been enabled for this clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.is_laf_enabled_for_cleanroom($cleanroom_name);
```

### request_laf_cleanroom

Schema:
:   CONSUMER

**Description:** Sets up prerequisites for installing a clean room created on another cloud region. Calling `consumer.install_cleanroom`
before calling this procedure fails. This procedure returns the current status each time you call. Call periodically until
the status is FULFILLED, then call `consumer.install_cleanroom`. It can take up to 10 minutes until the status is FULFILLED.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the cross-region clean room that will be installed.
* `provider_locator` - (String) Account locator of the provider that created this clean room.

**Returns:** *(String)* Status message of the request. Continue calling until status is FULFILLED.

**Example:**

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.consumer.request_laf_cleanroom(
  $cleanroom_name,$provider_locator);
```

### setup_cleanroom_request_share_for_laf

Schema:
:   CONSUMER

**Description:** Enables cross-cloud request sharing with a specified provider for a specific clean room. This is required for cross-region clean rooms to have full functionality, including request logs, consumer template requests, and provider-run analyses.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room name.
* `provider_account_name` - (String) [Data sharing account identifier](../admin-account-identifier.md) of the provider.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.setup_cleanroom_request_share_for_laf(
      $cleanroom_name, $provider_account_name);
```

### setup_activation_share_to_laf_consumer

Schema:
:   CONSUMER

**Description:** Enables provider activation between a provider and a consumer on different cloud regions.

**Arguments:**

* `provider_account` - (String) One or more comma-delimited provider [Data sharing account identifiers](../admin-account-identifier.md).

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.setup_activation_share_to_laf_consumer(
  'org1.locator1,org2.locator2'
);
```

## Provider-run analysis

For more information about provider-run analysis, see [Provider-run analyses](demo-flows/provider-run-analysis.md).

### is_provider_run_enabled

Schema:
:   LIBRARY

**Description:** Checks if this clean room allows provider-run analyses. The consumer must still grant explicit permission by
calling `consumer.enable_templates_for_provider_run` before providers can run an analysis in this clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room name.

**Returns:** *(String)* Description of whether or not the clean room supports provider-run analyses.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.is_provider_run_enabled($cleanroom_name)
```

### approve_template

Schema:
:   CONSUMER

**Description:** Approves a single template for provider-run analysis in a given clean room. The clean room provider typically communicates
with you beforehand to ask permission to run a specific template in a clean room. Be sure to set join and column policies on a template
before you approve it for provider-run analysis:

* A clean room **without** a consumer join policy means that the provider can join on all consumer columns.
* A clean room **without** a consumer column policy means that the provider can project all consumer columns.
* A clean room **with** a consumer column policy that **doesn’t include this approved template** means that the provider cannot project any
  consumer columns when using this template.

`consumer.approve_template` grants the provider permission to run the specified template in the specified clean room as many times as they want. Any provider calls to `provider.submit_analysis_request` are against the last approved version of the template; if the provider later modifies the template, the last approved version will be run when `provider.submit_analysis_request` is called.

If you want to approve multiple templates at once, you can call `provider.enable_templates_for_provider_run`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room with the template to approve.
* `template_name` - (String) Name of the template that the provider can run, in the specified clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.approve_template(
  $cleanroom_name,
  $template_name);
```

### enable_templates_for_provider_run

Schema:
:   CONSUMER

**Description:** Grants permission to the provider to run one or more specified templates in the requested clean room. The provider must
enable provider-run analysis in a clean room before the consumer can call this procedure. This is a multi-template version of
`consumer.approve_template`, and has all the same requirements and restrictions.

`consumer.enable_templates_for_provider_run` grants the provider permission to run the specified templates in the specified clean room as
many times as they want. Any provider calls to `provider.submit_analysis_request` are against the last approved version of the template;
if the provider later modifies the template, the last approved version will be run when `provider.submit_analysis_request` is called.

Providers run enabled templates in the consumer’s account, with the usage billed to the consumer. If you want to limit the warehouse type or
sizes allowed to a provider when running a given template, call `set_provider_run_configuration`,

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room where the provider can run analyses.
* `template_names` - (Array of strings) An array of names of one or more templates in the clean room that the provider can run.
* `enable_differential_privacy` - (Boolean) If TRUE, enable differential privacy for all templates listed in `template_names`.
  Differential privacy can be enabled for these templates only if differential privacy is enabled for the clean room itself. You can check
  differential privacy status for a clean room by calling `consumer.is_dp_enabled`. You can customize the privacy settings by calling
  `consumer.set_privacy_settings`. [Learn more.](differential-privacy.md)
* `template_configuration` - (Object, optional) An optional object to specify additional settings for each template in `template_names`.
  This object contains key-value pairs, where the key is the template name (from `template_names`) and the value is an object that sets
  limitations on how the provider can use this template. **If you do not provide a template configuration**, ‘ALL’ is the default for all
  properties for all templates in `template_names`. **If you do provide a template configuration**, you must provide a configuration for
  every template listed in `template_names`, and define all properties for that template’s configuration. You can also set the permissible
  values for a template by calling `consumer.set_provider_run_configuration`.

  The following properties are supported:

  + `warehouse_type` (*String*) - A permitted warehouse type that the provider can use with this template. Allowed values:

    - ALL - Allow any warehouse type.
    - STANDARD - Allow only a standard warehouse.
    - SNOWPARK-OPTIMIZED - Allow only a Snowpark-optimized warehouse.
  + `warehouse_size` (*Array of strings*) - One or more permitted warehouse sizes that can be used with this warehouse type and template.
    Allowed values are those defined for [WAREHOUSE_SIZE](../../sql-reference/sql/create-warehouse.md) or their synonyms (for example, either
    XLARGE or X-LARGE). Specify ‘ALL’ to allow any warehouse size.

**Returns:** *(String)* Success message.

**Examples:**

```sqlexample
-- Simple example
CALL samooha_by_snowflake_local_db.consumer.enable_templates_for_provider_run(
  $cleanroom_name,
  ['prod_overlap_analysis'],
  FALSE);

-- Specify what types of warehouse the provider can use to run these templates.
CALL samooha_by_snowflake_local_db.CONSUMER.enable_templates_for_provider_run(
  $cleanroom_name,
  ['template1', 'template2', 'template3'],
  TRUE,
  {
    'template1': {'warehouse_type': 'ALL', 'warehouse_size': ['MEDIUM', 'LARGE']},
    'template2': {'warehouse_type': 'SNOWPARK-OPTIMIZED', 'warehouse_size': ['MEDIUM', 'XLARGE']},
    'template3': {'warehouse_type': 'STANDARD', 'warehouse_size': ['MEDIUM', 'XLARGE']}
  });
```

### set_provider_run_configuration

Schema:
:   CONSUMER

**Description:** Applies settings to a template that control how a provider can run a specified template in the clean room. If the consumer
does not provide a configuration for a template, then default values are applied. A provider cannot run a template until a consumer approves
the template for provider-run analyses by calling `consumer.approve_template`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room. If the template is not present in this
  clean room, the procedure throws an error. The template doesn’t need to be approved for provider-run analysis yet, but the provider won’t
  be able to run the template until the consumer approves it.
* `template_configuration` - (Object) An object that provides limits on how a provider can run a specific template in this clean room.
  Provider-run analyses are run in the consumer’s account, and billed to the consumer, so the consumer can set limitations on what
  warehouses can be used for a given template.The configuration object has this form:

  ```sqlsyntax
  {
    <template_name>: {
      'warehouse_type': '<warehouse_type>',
      'warehouse_size': '<warehouse_size>'
    }
  }
  ```

  You must provide all of the following values:

  + `template_name` - The object key is the template name. The configuration is applied to this template. This template must be
    present in the clean room.
  + `warehouse_type` (*String*) - Which warehouse type the provider can use to run this template. Allowed values:

    - ALL - (*Default*) Allow any warehouse type.
    - STANDARD - Allow only a standard warehouse.
    - SNOWPARK-OPTIMIZED - Allow only a Snowpark-optimized warehouse.
      XLARGE or X-LARGE) is supported.
    - ALL - (*Default*) Any warehouse size allowed.
    - Any size defined for [WAREHOUSE_SIZE](../../sql-reference/sql/create-warehouse.md), or their synonyms (for example, either
      XLARGE or X-LARGE) is supported.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.set_provider_run_configuration(
  $cleanroom_name,
  {
    'some_template': {
      'warehouse_type': 'STANDARD',
      'warehouse_size': ['MEDIUM', 'LARGE']
    }
  }
);
```

## Register and unregister data

Use the following procedures to register and unregister databases, schemas, and objects. Tables and views must be registered before they can
be linked into the clean room. If you register a database or schema, all of the objects in that database or schema are registered.
For more information about registering data, see [Registering data](register-data.md).

### register_db

Schema:
:   CONSUMER

**Description:** Register a database in an account to be able to link any objects from that database into a clean room in that account.
For more fine-grained control you can call `register_schema`, `register_managed_access_schema`, or `register_object` instead. Objects added
to the database after it has been registered might not be linkable, in which case you should re-register the database (or register the
object itself).

You must have MANAGE GRANTS privileges on the database to run this procedure.

**Arguments:**

* `db_name` - (String) Name of database to register in this account.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.consumer.register_db('SAMOOHA_SAMPLE_DATABASE');
```

### register_schema

Schema:
:   LIBRARY

**Description:** Register a schema in an account to be able to link any objects from that schema into a clean room in that account.
For more fine-grained control you can call `register_object` instead. Objects added to the schema after it has been registered might not be linkable, in which case you should re-register the schema (or register the object itself).

If you want to register a managed access schema (that is, a schema created with the WITH MANAGED ACCESS parameter), use `library.register_managed_access_schema` instead.

**Arguments:**

* `schema_names` - (Array of strings) Array of fully qualified schemas to register.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.library.register_schema(
  ['SAMOOHA_SAMPLE_DATABASE.DEMO']
);
```

### register_managed_access_schema

Schema:
:   LIBRARY

**Description:** Register a managed access schema in an account to be able to link any objects from that schema into a clean room in that
account. For more fine-grained control you can call `register_object` instead. Objects added to the schema after it has been registered
might not be linkable, in which case you should re-register the schema (or register the object itself).

**Arguments:**

* `schema_names` - (Array of strings) Array of fully qualified managed schemas to register.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.library.register_managed_access_schema(
  ['SAMOOHA_SAMPLE_DATABASE.DEMO']
);
```

### register_objects

Schema:
:   LIBRARY

**Description:** Grants the clean room access to tables and views of all types, making them available to be linked into the clean room by
calling `consumer.link_datasets`. You can register broader groups of objects by calling `library.register_schema`,
`library.register_managed_access_schema`, or `consumer.register_db`. You must have MANAGE GRANTS privileges on the database to run this procedure.

**Arguments:**

* `object_names` - (Array) Array of fully qualified object names. These objects can then be linked into the clean room.

**Returns:** *(String)* Success message.

**Examples**

To register a table and a view:

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.library.register_objects(
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS',
    'SAMOOHA_SAMPLE_DATABASE.INFORMATION_SCHEMA.FIELDS'
  ]
);
```

### enable_external_tables_on_account

Schema:
:   LIBRARY

**Description:** Enable Iceberg or external tables to be used in all clean rooms in this account. Must be called by an ACCOUNTADMIN in
both the provider and consumer accounts to allow Iceberg or external tables to be linked by either account. To
limit this ability to specific clean rooms in this account, call `enable_external_tables_for_cleanroom` instead.

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.library.enable_external_tables_on_account();
```

### enable_external_tables_for_cleanroom

Schema:
:   CONSUMER

**Description:** Enable Iceberg or external tables to be linked into in the specified clean room in this account by the consumer. To allow
Iceberg and external tables for all clean rooms in this account, call `enable_external_tables_on_account` instead.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room into which the provider can link Iceberg tables or external tables.

**Returns:** *(String)* Success message. If successful, it triggers a security scan and also provides the number of the patch
that is generated if the security scan succeeds.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.enable_external_tables_for_cleanroom(
  $cleanroom_name);
```

### unregister_db

Schema:
:   LIBRARY

**Description:** Removes the database-level grants given to the SAMOOHA_APP_ROLE role and Snowflake Data Clean Room native application. Any
data in this database that is linked into a clean room will no longer be accessible in this account. You must have MANAGE GRANTS privileges
on the database to run this procedure.

**Arguments:**

* `db_name` - (String) Name of the database to unregister.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.library.unregister_db('SAMOOHA_SAMPLE_DATABASE');
```

### unregister_schema

Schema:
:   LIBRARY

**Description:** Unregisters one or more schemas, which prevents users from linking their tables and views into the clean room.

If you want to unregister a managed access schema (that is, a schema created with the WITH MANAGED ACCESS parameter), use `library.unregister_managed_access_schema` instead. You must have MANAGE GRANTS privileges on the database to run this procedure.

**Arguments:**

* `schema_names` - (Array of strings) Fully qualified names of schemas to unregister.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.library.unregister_schema(
  ['SAMOOHA_SAMPLE_DATABASE.PUBLIC', 'MY_DB.MY_SCH']
);
```

### unregister_managed_access_schema

Schema:
:   LIBRARY

**Description:** Unregisters one or more managed access schemas, which prevents users from linking their tables and views into the clean
room.

**Arguments:**

* `schema_names` - (Array of strings) Fully qualified names of schemas to unregister.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.unregister_managed_access_schema(
  ['SAMOOHA_SAMPLE_DATABASE.DEMO']
);
```

### unregister_objects

Schema:
:   LIBRARY

**Description:** Revokes clean room access to tables and views of all types. Objects are no longer available to any users in any clean
rooms managed by this account.

**Arguments:**

* `object_names` - (Array) Array of fully qualified object names to revoke access to.

**Returns:** *(String)* Success message.

**Examples**

To unregister a table and a view:

```sqlexample
USE ROLE <ROLE_WITH_MANAGE_GRANTS>;
CALL samooha_by_snowflake_local_db.library.unregister_objects(
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS','MY_DB.MY_SCH.MY_VIEW']
);
```

## Link and unlink datasets

After a dataset is registered, you can link tables or views from that dataset into a specific clean room. You can
also unlink a table or view from a specific clean room to remove access to that data from the clean room.

### link_datasets

Schema:
:   CONSUMER

**Description:** Link a table or view into the clean room, giving templates within that clean room access to the table,
according to any join and column policies that you specify.

If the dataset includes a Snowflake policy that is stored in a different database, you (or a clean rooms administrator)
must [grant your clean room access to that policy database](register-data.md) to be able to link the data
into a clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to link data into.
* `full_tables` - (Array of strings) List of fully qualified table or view names to expose to the clean room. These objects must first be
  registered (made available to the clean room environment) with the appropriate registration method.

> **Note:**
>
> If a table linked into a clean room is deleted, renamed, moved, or has restrictive permissions added, the table will no longer be usable in the clean
> room unless you restore the old table with the same location, name, and permissions.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.link_datasets(
  $cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS', 'MY_DB.MY_SCH.EXPOSURES']
);
```

### unlink_datasets

Schema:
:   CONSUMER

**Description:** Removes access to the specified tables or views in the specified clean room for all users. This works only for
data that you have linked into the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room for which access should be removed.
* `tables_list` - (Array of strings) List of fully qualified table or view names for which access should be blocked.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.unlink_datasets(
  $cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS', 'MYDB.MYSCH.EXPOSURES']);
```

### view_consumer_datasets

Schema:
:   CONSUMER

**Description:** View all tables and views linked into the specified clean room by any consumer.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** Table of objects linked into the specified clean room, along with the clean room’s internal view name for each object.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_consumer_datasets($cleanroom_name);
```

## Manage and view policies

[Manage policies](v1/policies.md) on your data in a clean room that you have installed.

### set_join_policy

Schema:
:   CONSUMER

**Description:** Specifies which columns other users can join on when they run a template in the specified clean room.

Calling this function completely replaces the old policy with the new one.

Queries with wildcards might circumvent a join policy, so use discretion when you design your analysis template.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the join policy is applied.
* `table_col_names` - (String array) Fully qualified names of columns that can be joined, in the format `database name.schema name:column name`

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.set_join_policy(
  $cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL', 'MYDB.MYSCH.EXPOSURES:HASHED_EMAIL']
);
```

### view_join_policy

Schema:
:   CONSUMER

**Description:** Shows the column policy for your data in this clean room.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)*

**Returns:** *The join policy (table)*

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_join_policy($cleanroom_name);
```

### view_provider_join_policy

Schema:
:   CONSUMER

**Description:** Shows which provider columns the consumer can join on in the specified clean room.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)*

**Returns:** *(Table)* The join policy.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_provider_join_policy($cleanroom_name);
```

### set_column_policy

Schema:
:   CONSUMER

**Description:** Specifies which columns of your data can be projected in templates run by other collaborators.

Calling this function completely replaces the old policy with the new one.

Don’t set a column policy on identity columns or sensitive columns like email because you generally don’t want this sort of data to be projected.

Queries with wildcards might not be caught by using these checks, so use discretion when you design the analysis template.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the column policy is applied.
* `analysis_table_cols` - (String array) Fully qualified names of columns that can be projected, in the format `database name.schema name:column name`

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.set_column_policy(
  $cleanroom_name,
  ['prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE'
  ]
);
```

### view_column_policy

Schema:
:   CONSUMER

**Description:** Shows your column policy in the specified clean room. To see the provider’s column policy, call
`consumer.view_provider_column_policy`.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to describe.

**Returns:** *(Table)* Information about all consumer column policies in the clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_column_policy($cleanroom_name);
```

### view_provider_column_policy

Schema:
:   CONSUMER

**Description:** Shows the provider’s column policy.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)*

**Returns:** *The column policy (table)*

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_provider_column_policy($cleanroom_name);
```

## Templates

The following procedures allow users to work with templates in the clean room.

### view_template_definition

Schema:
:   CONSUMER

**Description:** View the raw JinjaSQL of the specified template. If a template [was obscured](provider.md) by applying the `is_obfuscated` argument, you can’t see the template source code.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room that holds the template.
* `template_name` - (String) Name of the template to view.

**Returns:** *(String)* The template definition.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_template_definition(
  $cleanroom_name,
  'prod_overlap_analysis');
```

### get_arguments_from_template

Schema:
:   CONSUMER

**Description:** Get a list of arguments used by the template. You can pass values for these argument into the template when you call
`consumer.run_analysis`.

**Arguments:**

> * [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room that has the template.
> * `template_name` - (String) Name of the template to return arguments for.

**Returns:** *(Table)* Argument list and specification.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.get_arguments_from_template(
  $cleanroom_name,
  'prod_overlap_analysis');
```

## Template chains

The following procedures allow users to work with [template chains](developer-template-chains.md) in the clean room.

### view_added_template_chains

Schema:
:   CONSUMER

**Description:** List all template chains defined in a given clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to list template chains for.

**Returns:** *(Table)* Information about any template chains in the specified clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_added_template_chains(
  $cleanroom_name);
```

### view_template_chain_definition

Schema:
:   CONSUMER

**Description:** Returns the attributes of a specified template chain.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room with the template chain to describe.
* `template_chain_name` - (String) Name of the template chain to describe.

**Returns:** *(String)* The definition of the specified template chain.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_template_chain_definition(
  $cleanroom_name,
  'insights_chain');
```

## Consumer-run analyses

The following procedure runs an analysis or activation based on the specified template.

### run_analysis

Schema:
:   CONSUMER

**Description:** Runs an analysis by using a template or template chain and returns the results table.

> **Important:**
>
> * If [differential privacy](differential-privacy.md) is enabled, the query can fail if you have reached your
>   budget limit for this template.
> * If a template [was obscured](provider.md) by applying the `is_obfuscated` argument, you must use
>   Snowflake Enterprise Edition or higher to be able to run the template.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room in which to run the analysis.
* `template_name` - (String) Name of the template or template chain to run in the clean room. This template must have been added to the
  clean room by the provider or consumer.
* `consumer_tables` - (Array of strings) Array of fully qualified consumer table names. These are assigned to the `my_table` template
  variable.
  These tables must already be linked into the clean room. See available tables by calling `consumer.view_consumer_datasets`.
* `provider_tables` - (Array of strings) Array of fully qualified provider table names. These are assigned to the `source_table`
  template variable. These tables must have been linked into the clean room. See available tables by calling
  `consumer.view_provider_datasets`.
* `analysis_arguments` - (Object) An object with key-value pairs passed to the template. The template can access
  the variable by key name. If you pass in `{'age': 20}`, the template accesses the value as `{{age}}`. Pass in an empty object if no
  values are required. To see which values are required, examine the template in question by calling `consumer.view_template_definition`.
  Examine the template to determine whether you need to fully qualify any column names used. If the table is aliased as `p` or `c` in
  the template, use *lowercase* `p` and `c` table aliases for column names.

  This object has one optional reserved value:

  + `epsilon` *(Float, optional)* - Specifies the
    [epsilon value for differential privacy](https://www.google.com/search?q=differential+privacy+epsilon&oq=differential+privacy+epsilon),
    if differential privacy is enabled for this clean room. Default is 0.1.
* `use_cache` - (Boolean, optional) Whether or not to use cached results for the same query. Default is FALSE.

**Returns:** *(Table)* Query results.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.run_analysis(
  $cleanroom_name,
  'prod_overlap_analysis',
  ['DB1.MYDATA.CONVERSIONS'],  -- Consumer tables
  ['MYDB.MYSCH.EXPOSURES'],    -- Provider tables
  object_construct(
    'max_age', 30
  )
);
```

## Activation

The following procedures manage [activation](v1/activation.md), or the saving of results to a consumer’s or
provider’s Snowflake account. You can’t activate data to third-party accounts by using the API.

### view_activation_policy

Schema:
:   CONSUMER

**Description:** Shows the consumer’s activation policy in the specified clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room to report on.

**Returns:** *(Table)* The provider’s activation policy in the specified clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_activation_policy($cleanroom_name);
```

### view_external_activation_history

Schema:
:   LIBRARY

**Description:** View the history of activation requests in the current account.

**Arguments:** *None*

**Returns:** A table with the details and status of activation requests.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.view_external_activation_history();
```

### set_activation_policy

Schema:
:   CONSUMER

**Description:** Indicates which columns should be allowed to be activated.

Your activation policies are enforced only on queries by other users; your activation policies are not enforced in your own queries.

Calling this function completely replaces the old policy with the new one.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the cleanroom in which to set the activation policy.
* `columns` - (Array) Name of columns of your own data that can be activated, in the format `template name:database name.schema name.table name:column_name`.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.set_activation_policy(
  $cleanroom_name,
  [
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE_NAME.DEMO.CUSTOMERS:HASHED_EMAIL',
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE_NAME.DEMO.CUSTOMERS:REGION_CODE' ]);
```

### approve_provider_activation_consent

Schema:
:   CONSUMER

**Description:** Approves a provider’s request to allow provider activation, which is the ability to push results to the provider’s
Snowflake account.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the provider is requesting to run a template.
* `activation_template_name` - (String) Name of the activation template that the provider wants to run.

**Returns:** *(String)* Success message. This procedure fails if the provider has not called `provider.request_provider_activation_consent`
in this clean room with the specified template.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.approve_provider_activation_consent(
  $cleanroom_name,
  'activation_my_template');
```

### run_activation

Schema:
:   CONSUMER

**Description:** Runs a template that pushes results back to the consumer’s or provider’s Snowflake account. The
`consumer_direct_activation` argument determines whether this is a consumer or provider activation.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room in which to run the activation.
* `segment_name` - (String) Arbitrary string used to label rows that are generated by this activation run. Each activation run adds new rows to an
  existing results table. Provide a unique string in this field each time you call this procedure to be able to filter results to a
  specific run.
* `template_name` - (String) Name of the activation template to call.
* `consumer_tables` - (Array of strings) Array of fully qualified consumer table names to pass to the template.
* `provider_tables` - (Array of strings) Array of fully qualified provider table names to pass to the template.
* `activation_arguments` - (Object) Key-value set of arguments to pass to the template.
* `consumer_direct_activation` - (Boolean, optional) TRUE to push results back to the consumer account, FALSE to send results to the
  provider. Default is FALSE.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
-- Run a consumer activation, as specified by the final TRUE argument.
SET segment_name = 'my_activation_segment';
CALL samooha_by_snowflake_local_db.consumer.run_activation(
  $cleanroom_name,
  $segment_name,
  $template_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
  object_construct(
    'c_join_col', 'c.hashed_email',
    'p_join_col', 'p.hashed_email'
  ),
  TRUE);
```

### dcr_health.provider_run_provider_activation_history

**Description:** Returns a history of provider activation requests for the specified clean room. Provider activation requests initiated by
both the provider and consumer are shown. This procedure provides extra information to help debug problems with provider activation.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room in which the activation was requested. You
  must be a provider or consumer in this clean room.

**Returns:** *(Table)* - A list of activation requests with information about each, including the template and segment name, the status,
the consumer’s account locator, and any error message returned by the request.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.dcr_health.provider_run_provider_activation_history(
  $cleanroom_name);
```

## Consumer-defined templates

The following APIs allow you to add consumer-defined templates to a clean room. For more information, see [consumer-written templates](demo-flows/custom-templates.md).

### create_template_request

Schema:
:   CONSUMER

**Description**: Sends a request to the provider of a clean room, asking them to approve a custom template so it can be added to the clean
room. See [Consumer-written custom templates](demo-flows/custom-templates.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the template is added.
* `template_name` - (String) Name of the template to add. Must be all lowercase letters, numbers, spaces, or underscores. Activation
  template names must start with “activation”.
* `template_definition` - (String) The JinjaSQL template. [Learn the template syntax.](custom-templates.md)

**Returns:** *(String)* Success message.

**Example:**

```sqlexample-jinja
CALL samooha_by_snowflake_local_db.consumer.create_template_request(
  $cleanroom_name,
  $template_name,
  $$
  SELECT
      identifier({{ dimensions[0] | column_policy }})
  FROM
      identifier({{ my_table[0] }}) c
    INNER JOIN
      identifier({{ source_table[0] }}) p
        ON
          c.identifier({{ consumer_id  }}) = p.identifier({{ provider_id | join_policy }})
        {% if where_clause %} where {{ where_clause | sqlsafe | join_and_column_policy }} {% endif %};
  $$);
```

### get_sql_jinja

Schema:
:   CONSUMER

**Description:** Evaluates a JinjaSQL template to a SQL statement. This procedure is used when developing custom templates, to see how the
template is rendered after processing with a given set of parameters.

This procedure can process only standard [JinjaSQL](https://github.com/sripathikrishnan/jinjasql) statements; it can’t process clean room
extensions to JinjaSQL such as `join_policy` or `column_policy`.

**Arguments:**

* `template_string` - (String) The JinjaSQL code to process. Only standard JinjaSQL is supported.
* `arguments` - (Object) An object where field names correspond to variables that are used in the template.

**Returns:** *(String)* The SQL statement generated by the submitted template with the provided variable values.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.get_sql_jinja(
$$
SELECT COUNT(*), IDENTIFIER({{ group_by_col }})
  FROM IDENTIFIER({{ my_table | sqlsafe }})
  INNER JOIN IDENTIFIER({{ source_table | sqlsafe }})
  ON IDENTIFIER({{ consumer_join_col }}) = IDENTIFIER({{ provider_join_col }})
  GROUP BY IDENTIFIER({{ group_by_col }});
$$,
object_construct(
'group_by_col', 'city',
'consumer_join_col', 'hashed_email',
'provider_join_col', 'hashed_email',
'my_table', 'mydb.mysch.t1',
'source_table', 'mydb.mysch.t2'));
```

**Response:**

```sqlexample
SELECT COUNT(*), IDENTIFIER('city')
  FROM IDENTIFIER(mydb.mysch.t1)
  INNER JOIN IDENTIFIER(mydb.mysch.t2)
  ON IDENTIFIER('hashed_email') = IDENTIFIER('hashed_email')
  GROUP BY IDENTIFIER('city');
```

### generate_python_request_template

Schema:
:   CONSUMER

**Description:** Generates a consumer clean room template that includes custom Python code. The generated template includes your Python code
and a placeholder for your JinjaSQL template. Pass your final template to `consumer.create_template_request`.

For more information about consumer-defined templates, see [Consumer-written custom templates](demo-flows/custom-templates.md).

**Arguments:**

* `function_name` - (String) The function name that is used by a template to call your function.
* `arguments` - (Array of String pairs) An array of arguments required by function `function_name`. Each element is a space-delimited pair
  that gives the argument name and its Snowflake SQL data type. For example: `['size INT', 'start_date DATE']`.
* `packages` - (Array of strings) Array of package names required for your Python code. If none, specify an empty array.
  [See the full list of supported packages.](https://repo.anaconda.com/pkgs/snowflake/) Example: `['pandas','numpy']`.
* `imports` - Not supported: Do not use
* `rettype` - (String) The Snowflake SQL return type of your function. Examples: INTEGER, VARCHAR.
* `handler` - (String) The name of the main handler function in your Python code. Typically this is `'main'`.
* `code` - (String) Your Python code implementation. If you include an import and your designated handler is defined in an import, this
  can be an empty string.

**Returns:** *(String)* Returns your Python UDF with a placeholder for your JinjaSQL template. You must escape any nested `$$` or
single-quote marks `'` correctly before passing your template string into `consumer.create_template_request`. Read
[Consumer-submitted code](demo-flows/custom-code.md).

**Example:**

Call the helper function with a trivial Python example:

```sqlexample-python
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.CONSUMER.GENERATE_PYTHON_REQUEST_TEMPLATE(
  'my_func',                         -- SQL should use this name to call your function.
  ['data VARIANT', 'index INTEGER'], -- Arguments and types for the function.
  ['pandas', 'numpy'],               -- Standard libraries used.
  [],                                -- Reserved.
  'INTEGER',                         -- SQL return type.
  'main',                            -- Standard main handler.
  $$
  import pandas as pd
  import numpy as np

  def main(data, index):
      df = pd.DataFrame(data)  # you can do something with df but this is just an example
      return np.random.randint(1, 100)
      $$
  );
```

The following example shows the generated code. Replace `<INSERT SQL TEMPLATE HERE>` with your template JinjaSQL code.

```output
BEGIN

-- First define the Python UDF
CREATE OR REPLACE FUNCTION CLEANROOM.my_func(data VARIANT, index INTEGER)
RETURNS INTEGER
LANGUAGE PYTHON
RUNTIME_VERSION = 3.10
PACKAGES = ('pandas', 'numpy')

HANDLER = 'main'
AS $$
import pandas as pd
import numpy as np

def main(data, index):
    df = pd.DataFrame(data)  # you can do something with df but this is just an example
    return np.random.randint(1, 100)
    $$;

-- Then define and run the SQL query
LET SQL_TEXT varchar := $$<INSERT SQL TEMPLATE HERE>$$;

-- Run the query and return the result
LET RES resultset := (EXECUTE IMMEDIATE :SQL_TEXT);
RETURN TABLE(RES);

END;
```

### list_template_requests

Schema:
:   CONSUMER

**Description:** Shows all requests that the consumer has made to add a template to a clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The clean room to list template requests for.

**Returns:** A table with the following columns:

* `request_id` - ID of the request, generated by the clean rooms system.
* `provider_identifier` - Provider’s account locator.
* `template_name` - Template name that the consumer provided in the request.
* `template_definition` - Source code of the template that the consumer asked to add to the clean room.
* `request_status` - Status of the request: PENDING, APPROVED, or REJECTED.
* `reason` - If the request status is REJECTED, the provider should give a reason for the rejection here.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.list_template_requests($cleanroom_name);
```

## Clean room metadata getter methods

The following methods show relevant properties of the clean room:

### describe_cleanroom

Schema:
:   CONSUMER

**Description:** Provides a summary of key information about the specified clean room, including templates, datasets, and policies.
If a template [was obscured](provider.md) by applying the `is_obfuscated` argument, you must use Snowflake
Enterprise Edition or higher to be able to see the template name.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to describe.

**Returns:** *(String)* Description of the clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.describe_cleanroom($cleanroom_name);
```

### view_provider_datasets

Schema:
:   CONSUMER

**Description:** Lists all datasets that the provider added to the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** *(Table)* A table of datasets added by the provider. Use the table name returned here in your queries.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_provider_datasets($cleanroom_name);
```

### view_added_templates

Schema:
:   CONSUMER

**Description:** Lists all templates in the clean room. If a template [was obscured](provider.md) by
applying the `is_obfuscated` argument, you must use Snowflake Enterprise Edition or higher to be able to view the template.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** A list of templates in this clean room, and the source code for each (unless the template was obscured by the provider).

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_added_templates($cleanroom_name);
```

### is_consumer_run_enabled

Schema:
:   LIBRARY

**Description:** Checks whether consumer-run analysis is enabled for the specified clean room. This is enabled by default, but a clean room
provider can disable it.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** *(String)* Whether or not the clean room allows consumer-run analyses.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.is_consumer_run_enabled($cleanroom_name);
```

### view_cleanrooms

Schema:
:   CONSUMER

**Description:** Lists all clean rooms that are joined (installed) or that are joinable by this account. To see only installed clean rooms,
run `consumer.view_installed_cleanrooms`. To see clean rooms created by this account, call `provider.view_cleanrooms`.

**Arguments:** *None*

**Returns:** *(Table)* All installed or invited clean rooms for this account.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_cleanrooms();
```

### view_installed_cleanrooms

Schema:
:   CONSUMER

**Description:** Lists all clean rooms that are installed (joined) in this account. To see both joined and unjoined clean rooms, call
`consumer.view_cleanrooms`. To see all clean rooms created by this account, call `provider.view_cleanrooms`.

**Arguments:** *None*

**Returns:** (*Table*) The clean rooms installed in this account.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_installed_cleanrooms();
```

## Differential privacy

These procedures control [differential privacy](differential-privacy.md) in the clean room. You can also specify differential privacy at the template level when you call `consumer.enable_templates_for_provider_run`.

### is_dp_enabled

Schema:
:   CONSUMER

**Description:** Checks whether differential privacy is enabled in the clean room. The clean room must be installed to check this value.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)*

**Returns:** *(Boolean)* Whether or not the clean room has differential privacy enabled.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.is_dp_enabled($cleanroom_name);
```

### view_remaining_privacy_budget

Schema:
:   CONSUMER

**Description:** Views the privacy budget remaining that can be used to make queries from the clean room. After the budget is exhausted, further calls to `run_analysis` aren’t allowed until the budget is reset. The budget resets daily.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* Name of the clean room. The clean room must be installed for this
  procedure to succeed.

**Returns:** *(Float)* The remaining privacy budget.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.view_remaining_privacy_budget($cleanroom_name);
```

### set_privacy_settings

Schema:
:   CONSUMER

**Description:** Sets privacy settings for provider-run analyses (including activation) that use custom templates. This procedure
overwrites all previously set values. Each time you call this method it erases all previous configuration settings.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where these settings should be applied.
* `privacy_settings` - (String) A string JSON object that specifies privacy settings when custom templates are run by a provider. Here is
  the syntax of the object:

  ```sqlsyntax
  '{
    "null" : <template_config>
  }'
  ```

  `template_config` is an object with differential privacy and aggregation settings. See
  :   [Available privacy settings](differential-privacy.md) to see what fields you can provide in this object.

**Example:**

```sqlexample
-- Apply differential privacy for provider-run analysis using all custom templates.
CALL samooha_by_snowflake_local_db.consumer.set_privacy_settings(
  $cleanroom_name,
  PARSE_JSON('{
    "null":{ "differential": 1, "epsilon": 0.1, "privacy_budget": 3 }
    }')
  );
```

**Returns:** *(String)* Success message.

## Snowpark Container Services procedures

[Read more about using Snowpark Container Services in your clean rooms.](demo-flows/machine-learning.md)

### start_or_update_service

Schema:
:   CONSUMER

**Description:** Creates and starts the latest version of Snowpark Container Services that is defined by the provider in this clean room.
Any time the provider calls `provider.load_service_into_cleanroom` to create or update a container, the consumer must call
`consumer.start_or_update_service` to update the service.

The consumer must define and start the pool before calling this procedure.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the container should be loaded.
* `compute_pool_name` - (String) The name of a compute pool that is defined by the consumer in this clean room. The pool must already be created, and
  the clean room must have privileges to access to the pool.
* `service_options` - (Object, optional) An object specifying parameters for this service. The following properties are supported:

  + `query_warehouse` - (*String, optional*) Name of the warehouse to use for this service. Doesn’t need to be the same warehouse as the one
    running the clean room.
  + `min_instances` - (*Integer, optional*) Minimum number of instances to use for this service.
  + `max_instances` - (*Integer, optional*) Minimum number of instances to use for this service.

**Returns:** (*Table*) Results of the load, if successful. Throws an error if not successful.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.start_or_update_service(
  $cleanroom_name,
  'dcr_lal_pool',
  object_construct(
        'query_warehouse', 'app_wh',
        'min_instances', '1',
        'max_instances', '1'
));
```

## Environment management

Use the following methods to assist in general clean room functionality.

### set_cleanroom_ui_accessibility

Schema:
:   CONSUMER

**Description:** Shows or hides clean rooms in the clean rooms UI for consumers in the current account.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room.
* `visibility_status` - (String) One of the following case-sensitive values:

  + HIDDEN - Hides the specified clean room in the clean rooms UI from all users in the current consumer account. The clean room will
    still be accessible using API calls.
  + EDITABLE - Makes the clean room visible in the clean rooms UI.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.set_cleanroom_ui_accessibility(
  $cleanroom_name,
  'HIDDEN');
```

### manage_datastats_task_on_account

Schema:
:   CONSUMER

**Description:** Enables or disables the background task that computes clean room statistics. The task is running by default, but you can
disable it to reduce your costs.

> **Important:**
>
> To manage this task, all collaborators must call the appropriate `provider` or `consumer` version of this
> procedure with the same value.

**Arguments:**

* `enable` - (Boolean) TRUE to enable the task, FALSE to disable the task.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
-- Disable the task in this account.
CALL samooha_by_snowflake_local_db.consumer.manage_datastats_task_on_account(FALSE);
```

### enable_local_db_auto_upgrades

Schema:
:   LIBRARY

**Description:** Enables the task that automatically upgrades the Snowflake Data Clean Rooms environment when new procedures or
functionality is released (The task is `samooha_by_snowflake_local_db.admin.expected_version_task`.) Call this procedure to automate
upgrades, rather than calling `library.apply_patch` with each new release.

Although you might reduce cost by disabling this task, we recommend that you leave it running to ensure that you have the latest version of
the clean rooms environment on your system.

**Arguments:** *None*

**Returns:** *(String)* Success or failure message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.enable_local_db_auto_upgrades();
```

### disable_local_db_auto_upgrades

Schema:
:   LIBRARY

**Description:** Disables the task that automatically upgrades the Snowflake
Data Clean Rooms environment when new versions are released. If you disable auto upgrades, you must call
`library.apply_patch` with each [new release](../../release-notes/new-features.md).

**Arguments:** *None*

**Returns:** *(String)* Success or failure message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.disable_local_db_auto_upgrades();
```

### apply_patch

Schema:
:   LIBRARY

**Description:** Updates your clean rooms environment, enabling new features and fixes in your environment. Call this when a new version of
the clean rooms environment has been released. (This typically occurs weekly; see clean rooms entries in
[Recent feature updates](../../release-notes/new-features.md).) This procedure updates
[SAMOOHA_BY_SNOWFLAKE_LOCAL_DB](v1/installation-details.md).

You can automate patch updates by calling `library.enable_local_db_auto_upgrades`. We recommend enabling auto-updates.

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.apply_patch();
```

### patch_cleanroom

Schema:
:   CONSUMER

**Description:** Updates the specified clean room to the latest version, enabling new features and fixes for that clean room. Typically you
call this only when Snowflake Support tells you to call it.

The provider should call `library.patch_cleanroom` before the consumer calls `library.patch_cleanroom`; otherwise, there is no patch to
apply.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to patch.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.patch_cleanroom($cleanroom_name);
```

### dcr_health.dcr_tasks_health_check

**Description:** Shows information about running or recently stopped clean room tasks.

**Arguments:** *None*

**Returns:** *(Table)* Information about clean room tasks, including the schedule, warehouse name, and warehouse size.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.dcr_health.dcr_tasks_health_check();
```

---
title: Snowflake Data Clean Rooms: Identity and data provider connectors
source: https://docs.snowflake.com/en/user-guide/cleanrooms/connector-identity.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Identity and data provider connectors

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

> **Important:**
>
> Third-party connectors are not offered by Snowflake and may be subject to additional terms. These integrations are made available for
> your convenience, but you are responsible for any content sent to or received from the integrations.
>
> Customers are responsible for obtaining any necessary consents in connection with their use of Snowflake Data Clean Rooms. Please ensure
> that you are complying with applicable laws and regulations when using Snowflake Data Clean Rooms, including in connection with
> third-party connectors for activation purposes.

## Overview

Identity connectors can be used to resolve and join entities between tables when different values refer to the same entity. For example,
if an identity provider knows that two different emails refer to the same person, if Table1 uses email 1 and Table2 uses email 2, using
an identity connector will enable you to join on those two different emails as the same entity.

For an identity connector to be available for use in a clean room, an administrator must first
[configure the clean room to make that connector available to clean room creators](admin-tasks.md).

## Acxiom Real ID connector

Acxiom Real ID lets you generate Real IDs securely within Snowflake, without ever needing to transfer personally identifiable information
(PII) outside your Snowflake account.

> **Tip:**
>
> For additional help, read the [Acxiom Real ID documentation](https://acxiom.my.salesforce.com/sfc/p/#80000000Lm8w/a/8b0000011nke/tK9CJE0OvFfutuk4CZjBYb3eDT5qXAfTJItUa7GOGl0)
> or contact [accrealid@acxiom.com](mailto:accrealid%40acxiom.com) for support.

### Prerequisites

1. Before configuring the Acxiom connector, you must contact Acxiom for help installing their native app.
2. Before a clean room administrator configures the connector, the owner of the Acxiom native app must:

   1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   2. Assume the role that has ownership rights to the Acxiom native app. For example, if the `acxiom_admin_role` role is the owner of
      the Acxiom native app, execute:

      ```sqlexample
      USE ROLE acxiom_admin_role;
      ```
   3. Execute the following command to grant Snowflake Data Clean Rooms access to the Acxiom `realid_app_role` application role:

      ```sqlexample
      GRANT APPLICATION ROLE <acxiom_app_database>.realid_app_role
        TO ROLE SAMOOHA_APP_ROLE;
      ```

### Configure the Acxiom Real ID connector

To configure the Acxiom Real ID connector:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors.
3. Select the Identity & Data Providers tab.
4. Expand Acxiom - Real ID.
5. In the Application Database field, enter the name of the application database that was installed by the Acxiom native app.
6. In the Warehouse drop-down list, select the warehouse size. We recommend `DCR_WH_XLarge`, but you can read
   [Acxiom’s guidance on warehouse size and performance](https://acxiom.my.salesforce.com/sfc/p/#80000000Lm8w/a/8b0000011nke/tK9CJE0OvFfutuk4CZjBYb3eDT5qXAfTJItUa7GOGl0).
   For more information about creating a warehouse for use with Snowflake Data Clean Rooms, see
   [Using a different warehouse](admin-tasks.md).
7. Select Save.

## Acxiom Real ID Transcoding connector

The transcoding functionality of Acxiom Real ID lets you generate a crosswalk of your Acxiom Real IDs and your business partners’ Acxiom
Real IDs, without ever needing to transfer PII outside your Snowflake account.

> **Tip:**
>
> For additional help, read the [Acxiom Real ID Transcoding application](https://acxiom.my.salesforce.com/sfc/p/#80000000Lm8w/a/8b000000pVgm/iUAr_yl7KsnqJhgV8qk1xbR49XDurWxgu0lzyubnnO8)
> or contact [accrealid@acxiom.com](mailto:accrealid%40acxiom.com) for support.

### Prerequisites

1. You must have installed the Acxiom Real ID native app, as described previously.
2. You must install the Acxiom Real ID Transcoding application.
3. Contact your collaborators to get the client ID and client secret generated for them when they installed the
   Acxiom Real ID Transcoding native app.
4. Before a clean room administrator configures the connector, the owner of the Acxiom native app must:

   1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   2. Assume the role that has ownership rights to the Acxiom native app. For example, if the `acxiom_admin_role` role is the owner of
      the Acxiom native app, execute:

      ```sqlexample
      USE ROLE acxiom_admin_role;
      ```
   3. Execute the following command to grant Snowflake Data Clean Rooms access to the Acxiom `realid_app_role` application role:

      ```sqlexample
      GRANT APPLICATION ROLE <acxiom_app_database>.realid_app_role TO ROLE SAMOOHA_APP_ROLE;
      ```

### Configure the Acxiom Real ID Transcoding connector

To configure the Acxiom Real ID Transcoding connector:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors.
3. Select the Identity & Data Providers tab.
4. Expand Acxiom Real ID Transcoding.
5. In the Application Database field, enter the name of the application database that was installed by the Acxiom native app.
6. In the Client ID field, enter the client ID provided by Acxiom when you installed the native app.
7. In the Client Secret field, enter the client secret provided by Acxiom when you installed the native app.
8. In the Warehouse drop-down list, select the warehouse size. We recommend `DCR_WH_Medium`, but you can read
   [Acxiom’s guidance on warehouse size and performance](https://acxiom.my.salesforce.com/sfc/p/#80000000Lm8w/a/8b0000011nke/tK9CJE0OvFfutuk4CZjBYb3eDT5qXAfTJItUa7GOGl0).
   For more information about creating a warehouse for use with Snowflake Data Clean Rooms, see
   [Using a different warehouse](admin-tasks.md).
9. In the Acxiom Collaborator section, select one or more collaborators along with the client ID and client secret that was generated
   for them when they installed the Acxiom Real ID Transcoding native app. If your collaborator does not appear in the list, you
   must [add them to the clean room environment](manage-dcr-users.md).
10. Select Save.

## Google PAIR Display & Video 360 identity connector

Google provides a PAIR-based identity connector for use with the Google Display & Video 360-PAIR activation connector. This identity
connector can be used only with the Google PAIR activation connector. When the Display & Video PAIR identity connector is used, no
other identity connectors can be used in that clean room.

[Read the instructions for the activation connector](connector-activation.md) to learn how to configure
and use this identity connector.

## LiveRamp Identity Resolution connector

LiveRamp’s Embedded Identity resolves personally identifiable information (PII) or device identifiers into a durable, pseudonymous RampID
and is available through the LiveRamp native app in Snowflake’s Marketplace.
Before you configure the LiveRamp Identity Resolution connector for use in a Snowflake Clean Room, you must first install the LiveRamp
native app. For instructions, see
[Set Up the LiveRamp Native App in Snowflake](https://docs.liveramp.com/identity/en/set-up-the-liveramp-native-app-in-snowflake.html#set-up-the-liveramp-native-app-in-snowflake)
in LiveRamp’s documentation.

> **Tip:**
>
> For additional help, see [LiveRamp Embedded Identity in Snowflake](https://docs.liveramp.com/identity/en/liveramp-embedded-identity-in-snowflake.html) in LiveRamp’s documentation or email [snowflake@liveramp.com](mailto:snowflake%40liveramp.com) for
> support.

Here is how to integrate your clean room environment with the LiveRamp Identity Resolution native application:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation of the clean rooms UI, select Connectors.
3. Select the Identity & Data Providers tab.
4. Expand LiveRamp - Identity Resolution & Translation.
5. In the Configuration Table field, enter the application database name given to you by LiveRamp permissioned to the LiveRamp native
   app.
6. Enter the client ID and secret provided by LiveRamp for authentication of this workflow
7. In the Warehouse drop-down list, select the [warehouse size](../warehouses-overview.md). Depending on the datasets used in the
   operation, we recommend a 2XL warehouse for most PII-based execution types.
8. Select Save.

For all RampID-based use cases, you must not try to re-identify the associated individual or reverse engineer the RampID. For any
tables used in the Identity connector, you must preserve the separation of known (PII) and pseudonymous data. During setup, columns can be
marked as PII for the resolution and deconfliction process; any other sensitive identifier columns (such as SSN) need to be fully removed
prior to connecting the table. If you need help or have questions, work with your LiveRamp team.

## LiveRamp RampID Translation connector

LiveRamp’s RampID Translation capability allows for the transcoding of a RampID from one partner domain encoding to another, enabling you
to match persistent pseudonymous identifiers to one another without sharing the sensitive underlying identifiers. This functionality is
available through the LiveRamp native app in the [Snowflake Marketplace](https://www.snowflake.com/en/product/features/marketplace/).

Before you configure this connector for use in a Snowflake Clean Room, you must first install the LiveRamp native app.

> **Tip:**
>
> For additional help, see [LiveRamp Embedded Identity in Snowflake](https://docs.liveramp.com/identity/en/liveramp-embedded-identity-in-snowflake.html) in LiveRamp’s documentation or email [snowflake@liveramp.com](mailto:snowflake%40liveramp.com) for
> support.

To configure the LiveRamp Translation native application:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors.
3. Select the Identity & Data Providers tab.
4. Expand LiveRamp - Identity Resolution & Translation.
5. In the Configuration Table field, enter the application database name given to you by LiveRamp permissioned to the LiveRamp native
   app.
6. Enter the client ID and secret provided by LiveRamp for authentication of this workflow.
7. In the Warehouse drop-down list, select the [warehouse size](../warehouses-overview.md). For identity translation workflows only,
   smaller warehouse sizes can be used.
8. Under RampID Collaborators, enter the following:

   1. In the Snowflake Collaborator field, enter the [account locator](../admin-account-identifier.md) of your collaborator’s
      Snowflake account.
   2. In the Target Domain field, enter LiveRamp’s target domain encoding for your collaborator’s RampID space. This is a
      four-character identifier: for more information, contact LiveRamp.
9. Select Save.

## Merkury Identity connector

dentsu’s Merkury Identity Connector enables collaboration across Merkury IDs and the translation of select personally identifiable
information (PII) securely into a pseudonymized Merkury ID.

### Step 1: Install the Merkury Identity Connector native app

1. Install the Merkury Identity Connector native app: contact Merkury at [IDConnector@dentsu.com](mailto:IDConnector%40dentsu.com) to add the listing to your account.
2. Grant privileges to the SAMOOHA_APP_ROLE role:

   1. Sign in to Snowsight.
   2. Assume the role that has ownership rights to the Merkury native app. For example, if the ACCOUNTADMIN role is the owner of the
      Merkury native app, execute `USE ROLE ACCOUNTADMIN;`
   3. Execute the following command to grant Snowflake Data Clean Rooms access to the Merkury `DCR_DB_ROLE` application role:

   ```sqlexample
   GRANT APPLICATION ROLE <merkury_app_database>.DCR_DB_ROLE TO ROLE SAMOOHA_APP_ROLE;
   ```

### Step 2: Configure the Merkury Identity Connector native app

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors.
3. Select the Identity & Data Providers tab.
4. Expand Merkury Identity Connector.
5. In the Application database field, enter the name of the application database that was installed by the Merkury Identity native
   app.
6. Authenticate yourself.

## TransUnion TruAudience Identity connector

TransUnion TruAudience Identity provides consumer data hygiene, enrichment, and matching solutions using online and offline identifiers.
It matches rows in your table with a TransUnion identity, which can be used to join rows in your collaborator’s tables.

Keep the following in mind when using the TransUnion integration:

* Snowflake does not consider the TransUnion score filter when matching identities. All matches are included.
* When the provider, not the consumer, is running an analysis like the Overlap Audience Analysis, the distinct collaboration IDs are based
  on the consumer’s count, not the provider’s count.
* You cannot use the SQL Query template to aggregate on the collaboration ID.

Configuration guideUser guide

This section describes how to configure the connector for TransUnion TruAudience Identity. You must have the MANAGE_DCR_CONNECTORS
role to install and configure this connector.

After you configure the connector, Snowflake maintains a cache that maps TransUnion collaborator IDs to values that uniquely identify
records in the source table. As an administrator, you can manage this cache, for example, by
deleting specific records from the cache.

**Prerequisites**
:   The following must be completed before configuring the TransUnion TruAudience Identity connector in the clean room environment:

    Step 1: Install the TransUnion native app
    :   Use the Snowflake Marketplace to install the native app for TransUnion TruAudience Identity.

    Step 2: Grant privileges to the clean rooms native app
    :   After the TransUnion native app has been installed, but before a clean room administrator configures the connector, the owner
        of the TransUnion native app must follow these steps:

        1. Sign in to [Snowsight](../ui-snowsight-gs.md).
        2. Assume a role that has ownership rights to the TransUnion native app. For example, if the `tu_admin_role` role is the
           owner of the
           TransUnion native app, execute:

           ```sqlexample
           USE ROLE tu_admin_role;
           ```
        3. Grant Snowflake Data Clean Rooms access to the TransUnion application role and the TransUnion table installed in step 1:

           ```sqlexample
           GRANT APPLICATION ROLE <transunion_app_database>.tru_app_public
              TO ROLE SAMOOHA_APP_ROLE;

           GRANT SELECT, INSERT
              ON TABLE SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS
              TO ROLE SAMOOHA_APP_ROLE;
           ```

    Step 3: Ensure the required stored procedure exists
    :   The TransUnion connector relies on a stored procedure, which might not exist in some clean room environments. To ensure that
        the stored procedure exists, execute the following command as a user with the ACCOUNTADMIN role:

        ```sqlexample
        USE ROLE ACCOUNTADMIN;

        DESCRIBE PROCEDURE SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.GRANT_EXTERNAL_APP_ROLE;
        ```

        If you receive an error that the procedure does not exist, you must use the following commands to define the procedure:

        ```sqlexample
        USE ROLE ACCOUNTADMIN;

        CREATE OR REPLACE PROCEDURE SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.GRANT_EXTERNAL_APP_ROLE(APP_ROLE string, APPLICATION string)
           RETURNS string
           LANGUAGE SQL
           EXECUTE AS OWNER
           AS
           $$
           GRANT APPLICATION ROLE IDENTIFIER(:APP_ROLE) TO APPLICATION IDENTIFIER(:APPLICATION);
           $$;

        GRANT USAGE ON PROCEDURE SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.GRANT_EXTERNAL_APP_ROLE(string, string)
          TO ROLE SAMOOHA_APP_ROLE;
        ```

**Configuring the connector**

To configure the TransUnion TruAudience Identity connector:

1. [Sign in to the clean rooms UI.](v1/web-app-introduction.md)
2. In the left navigation, select Connectors.
3. Select the Identity & Data Providers tab.
4. Expand TransUnion - TruAudience Identity.
5. In the Application Database field, enter the name of the application database that was installed by the TransUnion native
   app.
6. In the Collaboration Key field, enter the collaboration key received from TransUnion for authorization.
7. Select a warehouse that is used when clean room users integrate a table with TransUnion TruAudience Identity.

   If you want to complete the process of matching identities within an hour, use the following guidelines to help select the right
   warehouse size:

   | Number of rows | Warehouse size |
   | --- | --- |
   | < 100k | Large |
   | 1 million | XLarge |
   | 5-10 million with addresses | 3X-Large |
   | > 10 million | 3X-Large |
8. Select Authenticate.

This section describes how to use the TransUnion TruAudience Identity connector in the clean rooms UI.

To enrich your data with TransUnion connector:

1. Start the clean room creation or installation process.
2. When you get to the Specify Join Policies step, expand Identity Hub.
3. Select TransUnion (TruAudience Identity).
4. In the Table field, select the table that contains the data you want to enhance with TransUnion collaboration IDs.
5. In the Unique Record Column field, select the column that uniquely identifies a record in the table, for example, a
   system-generated user ID.
6. Use the User Identifiers section to associate TransUnion identity types with columns in the table. These columns are used to
   match TransUnion identities. The values in these columns should conform to the following requirements.

   > | Identity type | Format requirements |
   > | --- | --- |
   > | Address | * Address Line — Single input. For addresses with lines 1 and 2, combine the two values into a single value. * City — String. * State — Two-character abbreviation. * Zip — Zip code or Zip code+4. Exclude special characters such as spaces or hyphens. |
   > | Date of Birth | yyyy-mm-dd format. |
   > | Device ID | Either IDs with hyphens (36 character length raw Device IDs/MAIDs/IFAs) or IDs without hyphens (32 & 40 character long hashed Device IDs/MAIDs/IFAs). |
   > | Email | Plain text or SHA256-hashed lowercase strings. |
   > | First Name | Upper or lowercase names, including nicknames. Exclude titles and suffixes. |
   > | IP Address | IPv4 addresses in dot notation or integer format. You can use the [PARSE_IP](../../sql-reference/functions/parse_ip.md) function to obtain the integer format. |
   > | Last Name | Upper or lowercase names. Exclude middle initials. |
   > | Phone | Ten digits without special characters like spaces and hyphens. |

**Matched TransUnion Identities**

When Snowflake matches records in the table with TransUnion identities, the collaborator IDs are added to the table in a new column
`TCUID`. When your collaborator adds the column to one of their own tables, you can match records based on the TransUnion
collaborator ID.

### Cache for TransUnion TruAudience Identity

Snowflake maintains a cache that maps TransUnion collaborator IDs to values in the source table that uniquely identify records. For
example,
the cache might map each collaborator ID to a value in the `user_id` column of the source table. The cache is
stored in the SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS table. This table contains the
following columns:

> | Column | Data type | Description |
> | --- | --- | --- |
> | `inputid` | VARCHAR | Value from the column selected as the Unique Record Column during the integration. |
> | `collaborationid` | VARCHAR | TransUnion collaboration ID generated based on the input ID and other integration parameters. |
> | `lastprocessed` | TIMESTAMP_NTZ | Timestamp when TransUnion generated the collaboration ID. |

You can perform the following actions on a cache:

Delete the cache
:   ```sqlexample
    TRUNCATE SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS;
    ```

Delete specific records from the cache
:   You can delete specific records from the cache by specifying them as a comma-separated list of single-quoted values. For example, to
    delete the records with input IDs of `123456` and `abcedf`, execute:

    ```sqlexample
    DELETE FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS
      WHERE inputid IN ('123456', 'abcedf');
    ```

Delete multiple records based on input IDs in a separate dataset
:   You can delete multiple records from the cache when the input IDs are present in a column of another table. For example, if the input IDs
    to be deleted are listed in the `user_id` column of the `my_db.my_schema.ref_table` table, execute:

    ```sqlexample
    DELETE FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS
      WHERE INPUTID IN (
        SELECT user_id as INPUTID
        FROM my_db.my_schema.ref_table
      );
    ```

Add all records from a batch
:   You can add all of the records from a batch that is present in TransUnion’s view to the cache.

    ```sqlexample
    INSERT INTO SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS (
      INPUTID,
      COLLABORATIONID,
      LASTPROCESSED
    SELECT
      INPUTID,
      COLLABORATIONID,
      LASTPROCESSED
    FROM <TRANSUNION_APPLICATION_DATABASE>.SHARE_SCHEMA.REF_MATCHING_OUTPUT_VIEW
    WHERE BATCHID = '<BATCH_ID>';
    ```

Merge all records from a batch
:   You can merge all of the records from a batch that is present in TransUnion’s view to the cache by overwriting existing input ID records
    with the corresponding new collaboration IDs and new last-processed timestamps.

    ```sqlexample
    MERGE INTO SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS CT
    USING <TRANSUNION_APPLICATION_DATABASE>.SHARE_SCHEMA.REF_MATCHING_OUTPUT_VIEW OT
      ON
        CT.INPUTID = OT.INPUTID
        AND OT.BATCHID = '<BATCH_ID>'
    WHEN MATCHED THEN
      UPDATE SET
        CT.COLLABORATIONID = OT.COLLABORATIONID,
        CT.LASTPROCESSED = OT.LASTPROCESSED
    WHEN NOT MATCHED THEN
      INSERT (
        INPUTID,
        COLLABORATIONID,
        LASTPROCESSED
      ) VALUES (
          OT.INPUTID,
          OT.COLLABORATIONID,
          OT.LASTPROCESSED
      );
    ```

Add collaborator IDs for input ID records
:   You can add collaborator IDs for input ID records present as a column in a dataset and also present in a specific batch.

    ```sqlexample
    INSERT INTO SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.SAMOOHA_INTERNAL_TRANSUNION_ID_GENERATION_RECORDS (
      INPUTID,
      COLLABORATIONID,
      LASTPROCESSED
    )
      SELECT
        INPUTID,
        COLLABORATIONID,
        LASTPROCESSED
      FROM <TRANSUNION_APPLICATION_DATABASE>.SHARE_SCHEMA.REF_MATCHING_OUTPUT_VIEW
      WHERE INPUTID IN (
        SELECT <column_name_containing_input_ids_to_be_added> as INPUTID
        FROM <dataset_fqtn_containing_input_ids_to_be_added>
        )
        AND BATCHID = '<BATCH_ID>';
    ```

---
title: Snowflake Data Clean Rooms: Installed objects
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/installation-details.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Installed objects

This topic provides information about the objects created in your account when you install a clean room environment.

## High-level overview

The following diagram shows a high-level view of the main objects installed in provider and consumer accounts:

* **Clean rooms UI:** Users accessing a clean room using the clean rooms UI go through a service user account, configured once per account
  by the clean room installer, to the clean rooms API.
* **API user:** API users and the clean rooms UI both use the same clean rooms API. This API is defined by the local DB in your account.
* **Local DB:** Defines the clean rooms API. There is one local DB per account (not per clean room). The actual name of this object is
  SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.
* **Clean room application package:** Created on the provider’s account when the provider creates a clean room. There is one package per
  clean room, named `SAMOOHA_CLEANROOM_cleanroom name`. This package produces the clean room app installed by the consumer.
* **Back shares:** Back shares must be mounted by the provider to get messages and data from the consumer to the provider.
  Native apps support data flows only from the provider to the consumer, so a back share must be mounted to enable data to flow from the
  consumer to the provider. Two back shares are mounted in the provider’s account: a governance back share, which stores provider-run and
  provider activation data; and a request log back share, which stores messaging and responses from the consumer to the provider, such as
  consumer custom template requests, or consumer approvals of provider run requests. (The share itself lives in the consumer’s account.)
* **Installed app:** Created by an application package, the installed app defines a clean room on the consumer side. The installed app
  follows the naming convention `SAMOOHA_CLEANROOM_APP_cleanroom_name`.
* **Consumer DB:** Contains read-only views of the datasets registered in the consumer’s account. The consumer creates these views when they
  link datasets into the clean room. The consumer’s account contains one consumer DB per clean room, named `SAMOOHA_CLEANROOM_CONSUMER_clean room ID`.
* **Clean room:** A clean room at a high level can be considered as comprising the application package and back shares on the provider side,
  plus the installed app and consumer DB on the consumer side.

## Application packages

The following application packages can be installed in your account:

`SAMOOHA_CLEANROOM_cleanroom name`
:   Installed in the provider’s account, one application package per clean room created. It contains all the core
    application logic of a clean room created by the provider. It also contains the secure views used to
    share data with the clean room and several tables that store the clean room state. These include tables
    that record the current differential privacy budget of consumers, the column and join policy, and names of
    tables linked to the clean room.

## Applications

The following applications can be installed in your account:

`SAMOOHA_CLEANROOM_APP_cleanroom_name`
:   Installed in the consumer’s account when they install (join) a clean room.

## Databases

Snowflake Data Clean Rooms installs the following databases:

### SAMOOHA_BY_SNOWFLAKE

This database contains all of the core functionality and application logic used to create and manage clean rooms.
This database has the following schemas:

ADMIN schema
:   This schema contains app-level details such as the following:

    * Patches applied (version, commands).
    * Version information (number).

APP_SCHEMA schema
:   This schema contains functions and procedures that are necessary to facilitate all the clean room flows. Key details include:

    * Encrypt and decrypt functions.
    * Clean room procedures that you use with the developer APIs and clean rooms UI to create, install, and work with clean rooms.

TEMPLATES schema
:   This schema contains the Snowflake-provided SQL Jinja templates.

    These pre-built templates offer ready-to-use SQL queries for secure data collaboration within Snowflake Data Clean Rooms. They leverage
    Jinja templating for customization, allowing you to tailor queries to specific data sharing scenarios.

### SAMOOHA_BY_SNOWFLAKE_LOCAL_DB

This database is created by the clean rooms UI during the Snowflake installation process. It is local to your account. It is not
an application, but does contain application logic.

This database has two types of data:

* The developer APIs that you and the clean rooms UI use to create and manage clean rooms.
* Intermediate datasets owned by you that get saved to the PUBLIC schema during flows such as identity resolution. For example,
  the output tables from LiveRamp’s resolution and transcoding process are saved to the PUBLIC schema and joined to the view
  that gets linked to the clean room by the clean rooms UI.

The database has the following schemas:

ADMIN schema
:   This schema contains information necessary to operate certain clean room features associated with the account, such as:

    * Using Cross-Cloud Auto-Fulfillment to collaborate across regions or cloud platforms.
    * Clean room metadata updates needed to register clean rooms from developer APIs to the clean rooms UI.
    * Versioning of the current procedures associated with the functioning of the clean rooms UI with the Snowflake account.
    * Tasks and streams that listen to changes in the set of clean room shares that are shared back from collaborators, and to enable/disable
      clean rooms as needed based on the changes.

CONSUMER schema
:   This schema contains the definitions of the [consumer API procedures](../consumer.md) as well as some common
    consumer tasks.

INFORMATION_SCHEMA schema
:   Like all Snowflake databases, this database contains the INFORMATION_SCHEMA schema (“Data dictionary”), which consists of a set of
    system-defined views and table functions that provide extensive metadata information about the objects created in your account.

LIBRARY schema
:   This schema contains the definitions of the `library` namespace API procedures as well as some common tasks and procedures used by both
    providers and consumers.

PROVIDER schema
:   This schema contains the definitions of the [provider API procedures](../provider.md) as well as some common
    provider tasks.

PUBLIC schema
:   This schema contains the developer APIs that you and the clean rooms UI use to create and manage clean rooms. It also contains
    intermediate datasets owned entirely by you that get saved to the PUBLIC schema during flows such as identity resolution. For example,
    the output tables from LiveRamp’s resolution and transcoding process are saved to the PUBLIC schema and joined to the view that gets
    linked to the clean room by the clean rooms UI.

    This schema has the following tables:

    * **CLEANROOM_RECORD**: This table includes the status of a clean room (created, deleted) along with the user and timestamp of the last
      update. If the update was done in the clean rooms UI, the user is the service account user. If the update was done in
      Snowsight using
      the developer APIs, the user is the actual user who called the API. The clean room database name can be customized in this table.
    * **CONNECTOR_CONFIGURATION**: This table is the list of configured connectors in the account.
    * **REPORTS**: This table includes the list of reports saved by the consumer in the clean rooms UI. Top-level results from standard
      reports are saved in the table.
    * **HORIZONTAL_ANALYSIS_<report ID>**: Output of analyses executed with the SQL Query template and custom templates executed in the clean
      rooms UI.
    * **CONSUMER_ACTIVATION_SUMMARY**: Consumer activation results.
    * **PROVIDER_ACTIVATION_SUMMARY**: Provider activation results.

This database has three shares that get created from it:

* **SAMOOHA_INTERNAL_GOVERNANCE_SUMMARY_SHARE_NAV2**: This share contains views on the (CONSUMER_/PROVIDER_)GOVERNANCE_SUMMARY and
  (CONSUMER_/PROVIDER_)ACTIVATION_SUMMARY tables in the
  PUBLIC schema. This gets shared with any providers who have created clean rooms installed by this account, and is used to share
  governance information and provider activations back.
* **SAMOOHA_INTERNAL_LOGS_SHARE_NAV2**: This share is on the LOG_EVENTS table and is primarily used to share logs on how ID
  resolution procedures are progressing back to Snowflake, given they use third-party native apps. No PII or data is ever shared back,
  only the success/failure of the third-party app APIs used for transcoding/resolution.
* **SAMOOHA_INTERNAL_PROVIDER_METADATA_NAV2**: This share is on two tables, ADMIN.METADATA_UPDATE_REQUESTS, which is used to send registration
  requests from the API to the UI, and ADMIN.RESOURCE_MONITOR_USAGE, which is only used by managed accounts to log usage.

### `SAMOOHA_CLEANROOM_cleanroom ID`

Each clean room published (as a creator) or installed (as a consumer) has an associated database that includes all the details of that
clean room, including any templates installed, request logs, LAF status, and much more. This database includes the following schemas:

> * **Admin**: Cryptographic keys, privacy budget, request logs, requests for provider analyses, and more.
> * **Shared_schema**: Join policy, LAF status, linked tables, and versions.
> * **Templates**: List of activation templates, custom templates, and template chains in this clean room.

### `SAMOOHA_CLEANROOM_REQUESTS_clean room ID`

This is a database on the provider side and a share on the consumer side. It corresponds to the share that gets sent back from a consumer
to the provider of a clean room as part of the consumer clean room installation process. This database contains information on all the
requests raised by the consumer against the clean room and is used to keep track of the differential privacy budget usage by the consumer.

### `SAMOOHA_CLEANROOM_CONSUMER_clean room ID`

This database is installed in consumer accounts only. It is used to share objects such as the secure view of the consumer data to the
clean room, and consumer column/join policies if applied. It has the following table:

* `SAMOOHA_CLEANROOM_CONSUMER_clean room ID.SHARED.REQUESTS`. This table shows the consumer exactly which query was attempting to
  run, where PROPOSED_QUERY is the query rendered from the consumer’s template.

### SAMOOHA_SAMPLE_DATABASE

This database contains sample datasets named DEMO.CUSTOMERS and DEMO.CUSTOMERS_2 that you can use as test data.

> **Note:**
>
> The CUSTOMERS_2 table was added in September 2025. If you installed your clean rooms environment before then, you might not have this
> sample table installed. To see whether you have CUSTOMERS_2 installed, you can run the following SQL code:
>
> ```sqlexample
> SHOW TABLES LIKE 'CUSTOMERS_2' IN SCHEMA SAMOOHA_SAMPLE_DATABASE.DEMO;
> ```
>
> If the response contains no rows, then you, or someone with ACCOUNTADMIN role, must run the following command to install the sample table:
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
> ```

## Tasks

Here are some tasks used by clean rooms that you might see running in your environment.

You can find more information about a given task by running the following procedure:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.DCR_HEALTH.DCR_TASKS_HEALTH_CHECK();
```

[Learn how to view your task and warehouse usage costs.](cleanroom-cost.md)

Clean room tasks

| Task name | Description | Warehouse | Entity level |
| --- | --- | --- | --- |
| `AUTO_RUN_warehouse` | Runs the scheduled reports for each warehouse. Uses the warehouse it reports on.  Default schedule: 1 day. | `DCR_WH_warehouse` | Per clean room report |
| `AUTO_RUN_TASK` | Runs the reports set to auto-run.  Default schedule: 1 day. | The warehouse chosen by the user. | Per account |
| `COMPUTE_DATA_STATS_​FOR_ACCOUNT_consumer locator` | Computes baseline metrics for joined clean rooms.  Default schedule: 3 hours. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `COMPUTE_DATA_STATS_​FOR_ACCOUNT_provider locator` | Computes baseline metrics for created clean rooms.  Default schedule: 3 hours. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `DISTINCT_COLUMN_VALUES​_TASK` | Computes distinct values for datasets linked in a clean room to enable filter dropdowns.  Default schedule: 1 day. | SAMOOHA_TASK_WAREHOUSE | Per clean room |
| `EXPECTED_VERSION_TASK` | Automatically upgrades the native app as new versions are released.  Default schedule: Triggered by request. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `LISTEN_TO_REQUESTS` | Mount, repair, and validate incoming shares from collaborators when differential privacy is enabled on the account. The same task at higher a frequency is added to prevent over-running of analysis when DP is enabled. This task costs approximately 6 credits per day.  Default schedule: 1 minute. | *Serverless* | Per account |
| `LISTEN_TO_REQUESTS_NODP` | Mount, repair, and validate incoming shares from collaborators.  Default schedule: 30 minutes. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `LISTEN_TO_REQUESTS​_1_COLLABORATOR` | Sets up listeners for the return requests streamed back from the consumer to the provider. Determines whether a clean room has been enabled.  Default schedule: Triggered by request. | SAMOOHA_TASK_WAREHOUSE | Per collaborator |
| `MONITORING_SUMMARY_CRON_TASK` | Internal usage.  Default schedule: 30 minutes. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `MOUNT_PROVIDER_ACTIVATIONS_TASK` | Mounts the incoming share for activations for each consumer.  Default schedule: 15 minutes. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `PRIVACY_AND_SECURITY_SCANNER` | Scans each template in each provider’s clean room for privacy and security issues.  Default schedule: 30 minutes. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `PROCESS_ACTIVATIONS` | Decrypts the activation data sent back by the consumer.  Default schedule: Triggered by request. | SAMOOHA_TASK_WAREHOUSE | Per account |
| PROCESS_PROVIDER_ANALYSIS_REQUESTS | Runs the actual provider analysis.  Default schedule: Triggered by request. | `PROVIDER_RUN_UUID` | Per clean room |
| `PROCESS_REQUESTS_​BUDGET_COLLABORATOR_1` | Processes the differential privacy budget for a clean room.  Default schedule: Triggered by request. | SAMOOHA_TASK_WAREHOUSE | Per collaborator |
| `PROCESS_TEMPLATE_REQUESTS​_COLLABORATOR` | Processes the template requests for a clean room.  Default schedule: Triggered by request. | SAMOOHA_TASK_WAREHOUSE | Per collaborator |
| `RESET_PRIVACY_BUDGET` | Resets the privacy budget for all clean rooms.  Default schedule: 1 day. | SAMOOHA_TASK_WAREHOUSE | Per clean room |
| `SAMOOHA_INTERNAL_UID_​OUTPUT_TABLE_REFRESH_TABLE_DATA_TASK` | Created once per table.  Default schedule: 1 day. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `SETUP_AUTO_RUN` | Sets up auto-run reports.  Default schedule: 60 minutes. | SAMOOHA_TASK_WAREHOUSE | Per account |
| `SETUP_PROVIDER_ANALYSIS​_REQUESTS` | Sets up provider analysis infrastructure and processes the requests for provider analysis.  Default schedule: Triggered by request. | SAMOOHA_TASK_WAREHOUSE | Per clean room |
| `TRIGGER_REFRESH_FOR_LAF_CLEANROOMS` | Triggers data refresh for Cross-Cloud Auto-Fulfillment enabled clean rooms.  Default schedule: 30 minutes. | SAMOOHA_TASK_WAREHOUSE | Per account |

## Warehouses

Snowflake Data Clean Rooms installs the following warehouses in your account. You can change the size of any warehouse
as needed. We recommend that you use XS warehouses for general clean room editing, creation, or deletion commands. Consider using larger warehouses, or Snowpark-optimized warehouses, when running large analyses, such as machine learning workloads.

[Learn how to view your warehouse usage costs.](cleanroom-cost.md)

| Warehouse name | Notes |
| --- | --- |
| APP_WH | XSMALL warehouse has access to the API, sets up new clean rooms, manages permissions and data sharing. |
| DCR_WH_SMALL | Regular, SMALL warehouse |
| DCR_WH_Medium | Regular, MEDIUM warehouse |
| DCR_WH_Large | Regular, LARGE warehouse |
| DCR_WH_XLarge | Regular, XLARGE warehouse |
| DCR_WH_2XLARGE | Regular, XXLARGE warehouse |
| DCR_WH_4XLarge | Regular, X4LARGE warehouse |
| DCR_WH_OPT_XLarge | Snowpark-Optimized, XLARGE warehouse |
| DCR_WH_OPT_2XLarge | Snowpark-Optimized, XXLARGE warehouse |
| DCR_WH_OPT_4XLarge | Snowpark-Optimized, X4LARGE warehouse |
| PROVIDER_RUN_<cleanroom_identifier> | Warehouse in consumer’s account that executes analyses run by the provider. |
| SAMOOHA_TASK_WAREHOUSE | XSMALL warehouse used for many things, such as privacy and security scans, processing auto-run reports, computing data stats, and processing consumer template requests. |
| DCR_ACTIVATION_WAREHOUSE | Used to decrypt activation results sent to the provider. Default size is XL, but size can be modified by calling `provider.update_activation_warehouse`. |

## Other objects

Snowflake Data Clean Room installs the following additional objects:

* SAMOOHA_SERVICE_ACCOUNT_USER_ACCESS: A user-level network policy used by the service user to
  [enable the clean room UI](enable-clean-rooms-ui.md).

---
title: Snowflake Data Clean Rooms: Machine Learning
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/machine-learning.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Machine Learning

This topic describes the provider and consumer flows needed to programmatically set up a clean room, share it with a consumer, and run analyses through advanced machine learning algorithms in it. The provider flows loads secure Python code implementing a random-forest-based XGBoost machine learning algorithm into the clean room. This is completely confidential, and visible only to the provider. The consumer cannot see the Python machine-learning code loaded into the clean room.

This flow covers the following:

1. Provider:

   a. Add a custom template running a Lookalike Modeling analysis.

   b. Securely add Machine Learning python code-based templates leveraging XGBoost.

   c. Call the machine learning UDFs inside the clean room using the custom template.
2. Consumer:

   a. Run the custom template that uses the ML functions defined by the provider.

*Lookalike Modeling* is a type of analysis where a consumer tries to find “high-value” customers from a provider’s data by training a statistical model on their high-value customers. This model uses consumer-specified flags to indicate high-value users, such as those with expenditures above a certain threshold, in the consumer’s dataset. The trained model is then used to infer which customers in the provider’s data could potentially be “high value” to the consumer.

## Prerequisites

You need two separate Snowflake accounts to complete this flow. Use the first account to execute the provider’s commands, then switch to the second account to execute the consumer’s commands.

## Provider

> **Note:**
>
> The following commands should be run in a Snowflake worksheet in the provider account.

### Set up the environment

Execute the following commands to set up the Snowflake environment before using developer APIs to work with a Snowflake Data Clean Room. If you don’t have the SAMOOHA_APP_ROLE role, contact your account administrator.

```sqlexample
use role SAMOOHA_APP_ROLE;
use warehouse app_wh;
```

### Create the clean room

Create a name for the clean room. Enter a new clean room name to avoid colliding with existing clean room names. Clean room names can only be **alphanumeric**. Clean room names cannot contain special characters other than spaces and underscores.

```sqlexample
set cleanroom_name = 'Machine Learning Demo Clean room';
```

You can create a new clean room with the clean room name set above. If the clean room name set above already exists as an existing clean room, this process will fail.

This procedure may take a little longer to run, typically about half a minute.

The second argument to *provider.cleanroom_init* is the distribution of the clean room. This can either be INTERNAL or EXTERNAL. For testing purposes, if you are sharing the clean room to an account in the same organization, you can use INTERNAL to bypass the automated security scan which must take place before an application package is released to collaborators. However, if you are sharing this clean room to an account in a different organization, you must use an EXTERNAL clean room distribution.

```sqlexample
call samooha_by_snowflake_local_db.provider.cleanroom_init($cleanroom_name, 'INTERNAL');
```

In order to view the status of the security scan, use:

```sqlexample
call samooha_by_snowflake_local_db.provider.view_cleanroom_scan_status($cleanroom_name);
```

Once you have created your clean room, you must set its release directive before it can be shared with any collaborator. However, if your distribution was set to EXTERNAL, you must first wait for the security scan to complete before setting the release directive. You can continue running the remainder of the steps and return here before the *provider.create_or_update_cleanroom_listing* step while the scan runs.

In order to set the release directive, call:

```sqlexample
call samooha_by_snowflake_local_db.provider.set_default_release_directive($cleanroom_name, 'V1_0', '0');
```

> **Important:**
>
> If the consumer and provider are in different cloud regions, you need to enable [Cross-cloud auto-fulfillment](../laf.md) in both accounts and for both clean rooms.

### Link the dataset, and set the join policy for the dataset

Link Snowflake tables into the clean room, browse through the list of tables in your Snowflake account and enter the fully qualified table names (Database.Schema.Table) as an array. The procedure automatically makes the table accessible to the clean room by creating a secure view of the table from within the clean room, thereby avoiding any need to make a copy of your table.

```sqlexample
call samooha_by_snowflake_local_db.provider.link_datasets($cleanroom_name, ['samooha_provider_sample_database.lookalike_modeling.customers']);
```

> **Note:**
>
> If this step doesn’t work even though your table exists, it is likely the SAMOOHA_APP_ROLE role has not yet been given access to it. If so, switch to the ACCOUNTADMIN role, call the below procedure on the database, and then switch back for the rest of the flow:
>
> ```sqlexample
> use role accountadmin;
> call samooha_by_snowflake_local_db.provider.register_db('<DATABASE_NAME>');
> use role SAMOOHA_APP_ROLE;
> ```

You can view the dataset names linked to the clean room by calling the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.provider.view_provider_datasets($cleanroom_name);
```

You can see the datasets linked to the clean room using the following procedure:

```sqlexample
select * from samooha_provider_sample_database.lookalike_modeling.customers limit 10;
```

Specify which columns the consumer is allowed to join on when running templates within the clean room. This procedure should be called on identity columns like email. The join policy is “replace only”, so if the function is called again, then the previously set join policy is completely replaced by the new one.

```sqlexample
call samooha_by_snowflake_local_db.provider.set_join_policy($cleanroom_name, ['samooha_provider_sample_database.lookalike_modeling.customers:hashed_email']);
```

If you want to view all the columns to decide the join policy columns, call the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.provider.view_join_policy($cleanroom_name);
```

### Add confidential Machine Learning Python code to the clean room

This section shows you how to load some python functions into the clean room for the lookalike ML work. All python functions installed in the clean room remain completely confidential. They cannot be seen by the consumer.

The following API allows you to define your Python functions directly as inline functions into the clean room. Alternatively you can load Python from staged files you’ve uploaded into the clean room stage. See the [API reference guide](../provider.md) for an example.

> **Note:**
>
> This implementation is limited by the total Snowflake size constraint on the amount of data that can be aggregated by ARRAY_AGG (128 MB). **Upon request**, Snowflake provides an implementation that leverages batching and streaming models that can scale to arbitrarily sized data sets.

```sqlexample-python
call samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
    $cleanroom_name,
    'lookalike_train',
    ['input_data variant', 'labels variant'],
    ['pandas', 'numpy', 'xgboost'],
    'variant',
    'train',
    $$
import numpy as np
import pandas as pd
import xgboost
from sklearn import preprocessing
import sys
import os
import pickle
import codecs
import threading

class TrainXGBoostClassifier(object):
    def __init__(self):
        self.model = None
        self._params = {
            "objective": "binary:logistic",
            "max_depth": 3,
            "nthread": 1,
            "eval_metric": "auc",
        }
        self.num_boosting_rounds = 10

    def get_params(self):
        if self.model is not None and "updater" not in self._params:
            self._params.update(
                {"process_type": "update", "updater": "refresh", "refresh_leaf": True}
            )
        return self._params

    def train(self, X, y):
        """
        Train the model in a threadsafe way
        """
        # pick only the categorical attributes
        categorical = X.select_dtypes(include=[object])

        # fit a one-hot-encoder to convert categorical features to binary features (required by XGBoost)
        ohe = preprocessing.OneHotEncoder()
        categorical_ohe = ohe.fit_transform(categorical)
        self.ohe = ohe

        # get the rest of the features and add them to the binary features
        non_categorical = X.select_dtypes(exclude=[object])
        train_x = np.concatenate((categorical_ohe.toarray(), non_categorical.to_numpy()), axis=1)

        xg_train = xgboost.DMatrix(train_x, label=y)

        params = self.get_params()
        params["eval_metric"] = "auc"
        evallist = [(xg_train, "train")]
        evals_result = {}

        self.model = xgboost.train(
            params, xg_train, self.num_boosting_rounds, evallist, evals_result=evals_result
        )

        self.evals_result = evals_result

    def __dump_model(self, model):
        """
        Save down the model as a json string to load up for scoring/inference
        """
        pickle_jar = codecs.encode(pickle.dumps([model, self.ohe]), "base64").decode()
        return pickle_jar

    def dump_model(self):
        """
        Save down the model as a json string to load up for scoring/inference
        """
        if self.model is not None:
            return self.__dump_model(self.model)
        else:
            raise ValueError("Model needs to be trained first")

def train(d1, l1):

    # get take training features and put them in a pandas dataframe
    X = pd.DataFrame(d1)

    # get the labels into a Numpy array
    y = np.array(l1)

    trainer = TrainXGBoostClassifier()
    trainer.train(X, y)

    # return training stats, accuracy, and the pickled model and pickled one-hot-encoder
    return {
        "total_rows": len(d1),
        "total_bytes_in": sys.getsizeof(d1),
        "model": trainer.dump_model(),
        "iteration": trainer.num_boosting_rounds,
        "auc": np.max(trainer.evals_result["train"]["auc"]),
        "error": 1 - np.max(trainer.evals_result["train"]["auc"])
    }
    $$
);
```

Now let’s install a scoring function into the clean room

```sqlexample-python
call samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
    $cleanroom_name,
    'lookalike_score',
    ['pickle_jar variant', 'emails variant', 'features variant'],
    ['pandas', 'numpy', 'xgboost', 'scikit-learn'],
    'string',
    'score',
    $$
import numpy as np
import pandas as pd
import xgboost as xgb
import pickle
import codecs
import json

def score(model, emails, features):
    # load model
    model = model[0] if not isinstance(model, str) else model
    model = pickle.loads(codecs.decode(model.encode(), "base64"))

    # retrieve the XGBoost trainer from the pickle jar
    bst = model[0]

    # retrieve the fitted one-hot-encoder from the pickle jar
    ohe2 = model[1]

    # create pandas dataframe from the inference features
    Y = pd.DataFrame(features)

    # select the categorical attributes and one-hot-encode them
    Y1 = Y.select_dtypes(include=[object])
    Y2 = ohe2.transform(Y1)

    # select the non-categorical attributes
    Y3 = Y.select_dtypes(exclude=[object])

    # join the results of the one-hot encoding to the rest of the attributes
    Y_pred = np.concatenate((Y2.toarray(), Y3.to_numpy()), axis=1)

    # inference
    dscore = xgb.DMatrix(Y_pred)
    pred = bst.predict(dscore)

    retval = list(zip(np.array(emails), list(map(str, pred))))
    retval = [{"email": r[0], "score": r[1]} for r in retval]
    return json.dumps(retval)
    $$
);
```

> **Note:**
>
> Loading Python into the clean room creates a new patch for the clean room. If your clean room distribution is set to EXTERNAL, you need to wait for the security scan to complete, then update the default release directive using:

```sqlexample
-- See the versions available inside the cleanroom
show versions in application package samooha_cleanroom_Machine_Learning_Demo_clean_room;

-- Once the security scan is approved, update the release directive to the latest version
call samooha_by_snowflake_local_db.provider.set_default_release_directive($cleanroom_name, 'V1_0', '2');
```

### Add a Custom Lookalike Modeling template

To add a custom analysis template to the clean room you need a placeholder for table names on both the provider and consumer sides, along with join columns from the provider side. In SQL Jinja templates, these placeholders must always be:

* **source_table**: an *array* of table names from the provider
* **my_table**: an *array* of table names from the consumer

Table names can be made dynamic through using these variables, but they can also be hardcoded into the template if desired using the name of the view linked to the clean room. Column names can either be hardcoded into the template, if desired, or set dynamically through parameters. If they are set through parameters, remember that you need to call the parameters **dimensions** or **measure_column**, which need to be arrays, in order for them to be checked against the column policy. You add these as SQL Jinja parameters in the template that will be passed in later by the consumer when querying. The join policies ensure that the consumer cannot join on columns other than the authorized ones.

Alternatively, any argument in a custom SQL Jinja template can be checked for compliance with the join and column policies using the following filters:

* **join_policy**: checks if a string value or filter clause is compliant with the join policy
* **column_policy**: checks if a string value or filter clause is compliant with the column policy
* **join_and_column_policy**: checks if columns used for a join in a filter clause are compliant with the join policy, and that columns used as a filter are compliant with the column policy

For example, in the clause *{{ provider_id | sqlsafe | join_policy }}*, an input of *p.HEM* will be parsed to check if *p.HEM* is in the join policy. Note: Only use the *sqlsafe* filter with caution, it allows collaborators to put pure SQL into the template.

> **Note:**
>
> All provider/consumer tables must be referenced using these arguments since the name of the secure view actually linked to the clean room will be different to the table name. Critically, provider table aliases **MUST** be p (or p1), p2, p3, p4, etc. and consumer table aliases **must** be c (or c1), c2, c3, etc. This is required in order to enforce security policies in the clean room.

This function overrides any existing template with the same name. If you want to update any existing template, you can simply call this function again with the updated template.

A set of features is selected from the provider dataset, and a set of labels is selected from the consumer dataset, along with a “high value” flag (called label_value). These 2 tables are then inner-joined on email and passed to the Random Forest training algorithm. Lastly, the output of the model training step is passed to an inference function, which uses the trained model to “infer” which of the provider customers NOT in the consumer datasets could be “high value”. The **count** of such individuals is then returned, along with the model error.

The threshold for determining the score beyond which a customer is “likely high value” is manually set in the template as 0.5. This can be easily changed when adding the template to the clean room.

```sqlexample-jinja
call samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    'prod_custom_lookalike_template',
    $$
WITH
features AS (
    SELECT
        p.hashed_email,
        array_construct(identifier({{ dimensions[0] | column_policy }}) {% for feat in dimensions[1:] %} , identifier({{ feat | column_policy }}) {% endfor %}) as features
    FROM
        identifier({{ source_table[0] }}) as p
),
labels AS (
    SELECT
        c.hashed_email,
        {{ filter_clause | sqlsafe | column_policy }} as label_value
    FROM
        identifier({{ my_table[0] }}) as c
),
trained_model AS (
    SELECT
        train_out:model::varchar as model,
        train_out:error::float as error
    FROM (
      SELECT
        cleanroom.lookalike_train(array_agg(f.features), array_agg(l.label_value)) as train_out
      FROM features f, labels l
      WHERE f.hashed_email = l.hashed_email
    )
),
inference_output AS (
    SELECT
        MOD(seq4(), 100) as batch,
        cleanroom.lookalike_score(
            array_agg(distinct t.model),
            array_agg(p.hashed_email),
            array_agg(array_construct( identifier({{ dimensions[0] | column_policy }}) {% for feat in dimensions[1:] %} , identifier({{ feat | column_policy }}) {% endfor %}) )
        ) as scores
    FROM trained_model t, identifier({{ source_table[0] }}) p
    WHERE p.hashed_email NOT IN (SELECT c.hashed_email FROM identifier({{ my_table[0] }}) c)
    GROUP BY batch
),
processed_output AS (
    SELECT value:email::string as email, value:score::float as score FROM (select scores from inference_output), lateral flatten(input => parse_json(scores))
)
SELECT p.audience_size, t.error from (SELECT count(distinct email) as audience_size FROM processed_output WHERE score > 0.5) p, trained_model t;
    $$
);
```

> **Note:**
>
> You can add Differential Privacy sensitivity to samooha_by_snowflake_local_db.provider.add_custom_sql_template procedure call above as the last parameter (if you do not add it, it will default to 1)

If you want to view the templates that are currently active in the clean room, call the following procedure. You can make the modifications to enable Differential Privacy guarantees on your analysis. A similar pattern can be incorporated into any custom template that you choose to write.

```sqlexample
call samooha_by_snowflake_local_db.provider.view_added_templates($cleanroom_name);
```

### Set the column policy on each table

Display the data linked to see the columns present inside the table. To view the top 10 rows, call the following procedure.

```sqlexample
select * from samooha_provider_sample_database.lookalike_modeling.customers limit 10;
```

Set the columns on which you want to group, aggregate (e.g. SUM/AVG) and generally use in an analysis for every table and template combination. This gives flexibility so that the same table can allow different column selections depending on the underlying template. This should be called only after adding the template.

The column policy is **replace only**, so if the function is called again, then the previously set column policy is completely replaced by the new one.

Column policy should not be used on identity columns like email, HEM, RampID, etc. since you don’t want the consumer to be able to group by these columns. In the production environment, the system will intelligently infer PII columns and block this operation, but this feature is not available in the sandbox environment. It should only be used on columns that you want the consumer to be able to aggregate and group by, like Status, Age Band, Region Code, Days Active, etc.

For the “column_policy” and “join_policy” to carry out checks on the consumer analysis requests, all column names MUST be referred to as **dimensions** or **measure_columns** in the SQL Jinja template. Make sure you use these tags to refer to columns you want to be checked in custom SQL Jinja templates.

```sqlexample
call samooha_by_snowflake_local_db.provider.set_column_policy($cleanroom_name, [
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:status',
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:age',
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:region_code',
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:days_active',
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:income_bracket',
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:household_size',
    'prod_custom_lookalike_template:samooha_provider_sample_database.lookalike_modeling.customers:gender'
]);
```

If you want to view the column policy that has been added to the clean room, call the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.provider.view_column_policy($cleanroom_name);
```

### Share with a consumer

Finally, add a data consumer to the clean room by adding their Snowflake account locator and account names as shown below. The Snowflake account name must be of the form <ORGANIZATION>.<ACCOUNT_NAME>.

> **Note:**
>
> In order to call the following procedures, make sure you have first set the release directive using *provider.set_default_release_directive*. You can see the latest available version and patches using:
>
> ```sqlexample
> show versions in application package samooha_cleanroom_Machine_Learning_Demo_clean_room;
> ```

```sqlexample
call samooha_by_snowflake_local_db.provider.add_consumers($cleanroom_name, '<CONSUMER_ACCOUNT_LOCATOR>', '<CONSUMER_ACCOUNT_NAME>');
call samooha_By_snowflake_local_db.provider.create_or_update_cleanroom_listing($cleanroom_name);
```

Multiple consumer account locators can be passed into the *provider.add_consumers* function as a comma separated string, or as separate calls to *provider.add_consumers*.

If you want to view the consumers who have been added to this clean room, call the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.provider.view_consumers($cleanroom_name);
```

If you want to view the clean rooms that have been created recently, use the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.provider.view_cleanrooms();
```

If you want to get more insights about the clean room that you have created, use the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.provider.describe_cleanroom($cleanroom_name);
```

Any clean room created can also be deleted. The following command drops the clean room entirely, so any consumers who previously had access to the clean room will no longer be able to use it. If a clean room with the same name is desired in the future, it must be re-initialized using the above flow.

```sqlexample
call samooha_by_snowflake_local_db.provider.drop_cleanroom($cleanroom_name);
```

> **Note:**
>
> The provider flow is now finished. Switch to the consumer account to continue with consumer flow.

## Consumer

> **Note:**
>
> The following commands should be run in a Snowflake worksheet in the consumer account

### Set up the environment

Execute the following commands to set up the Snowflake environment before using developer APIs to work with a Snowflake Data Clean Room. If you don’t have the SAMOOHA_APP_ROLE role, contact your account administrator.

```sqlexample
use role SAMOOHA_APP_ROLE;
use warehouse app_wh;
```

### Install the clean room

Once a clean room share has been installed, the list of clean rooms available can be viewed using the below command.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_cleanrooms();
```

Assign a name for the clean room that the provider has shared with you.

```sqlexample
set cleanroom_name = 'Machine Learning Demo Clean room';
```

The following command installs the clean room on the consumer account with the associated provider and selected clean room.

This procedure may take a little longer to run, typically about half a minute.

```sqlexample
call samooha_by_snowflake_local_db.consumer.install_cleanroom($cleanroom_name, '<PROVIDER_ACCOUNT_LOCATOR>');
```

Once the clean room has been installed, the provider has to finish setting up the clean room on their side before it is enabled for use. The below function allows you to check the status of the clean room. Once it has been enabled, you should be able to run the Run Analysis command below. It typically takes about 1 minute for the clean room to be enabled.

```sqlexample
call samooha_by_snowflake_local_db.consumer.is_enabled($cleanroom_name);
```

### Link the dataset

Now you can link some of your datasets into the clean room to carry out secure computation with the provider’s data

```sqlexample
call samooha_by_snowflake_local_db.consumer.link_datasets($cleanroom_name, ['samooha_consumer_sample_database.lookalike_modeling.customers']);
```

> **Note:**
>
> If this step doesn’t work even though your table exists, it is likely the SAMOOHA_APP_ROLE role has not yet been given access to it. If so, switch to the ACCOUNTADMIN role, call the below procedure on the database, and then switch back for the rest of the flow:
>
> ```sqlexample
> use role accountadmin;
> call samooha_by_snowflake_local_db.consumer.register_db('<DATABASE_NAME>');
> use role SAMOOHA_APP_ROLE;
> ```

To run the analysis, you will need to pass in the consumer table. If you want to view the datasets that you have added to the clean room, call the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_consumer_datasets($cleanroom_name);
```

### Run the analysis

Now that the clean room is installed, you can run the analysis template added to the clean room by the provider using the “run_analysis” command. You can see how each field is determined in the section below.

The “high value” users are identified with the filter_clause in the query below. If *c.SALES_DLR* represented the amount of sales per user, then a valid filter could look like *c.HIGH_VALUE > 4000*.

> **Note:**
>
> Before running the analysis, you can alter the warehouse size, or use a new, bigger, warehouse size if your tables are large.

```sqlexample
call samooha_by_snowflake_local_db.consumer.run_analysis(
    $cleanroom_name,                     -- cleanroom
    'prod_custom_lookalike_template',    -- template name

    ['samooha_consumer_sample_database.lookalike_modeling.customers'],                -- consumer tables

    ['samooha_provider_sample_database.lookalike_modeling.customers'],                -- provider tables

    object_construct(                    -- Rest of the custom arguments needed for the template
        'dimensions', ['p.STATUS', 'p.AGE', 'p.REGION_CODE', 'p.DAYS_ACTIVE', 'p.INCOME_BRACKET'], -- Features used in training

        'filter_clause', 'c.SALES_DLR > 2000' -- Consumer flag for which customers are considered high value
    )
);
```

### How to determine the inputs to run_analysis

To run the analysis, you need to pass in some parameters to the run_analysis function. This section will show you how to determine what parameters to pass in.

**Template names**

First, you can see the supported analysis templates by calling the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_added_templates($cleanroom_name);
```

Before running an analysis with a template, you need to know what arguments to specify and what types are expected. For custom templates, you can execute the following.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_template_definition($cleanroom_name, 'prod_custom_lookalike_template');
```

This can often also contain a large number of different SQL Jinja parameters. The following functionality parses the SQL Jinja template and extracts the arguments that need to be specified in run_analysis into a list.

```sqlexample
call samooha_by_snowflake_local_db.consumer.get_arguments_from_template($cleanroom_name, 'prod_custom_lookalike_template');
```

**Dataset names**

If you want to view the dataset names that have been added to the clean room by the provider, call the following procedure. You can’t view the data present in the datasets that have been added to the clean room by the provider due to the security properties of the clean room.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_provider_datasets($cleanroom_name);
```

You can also see the tables you’ve linked to the clean room by using the following call:

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_consumer_datasets($cleanroom_name);
```

**Dimension and measure columns**

While running the analysis, you might want to filter, group by and aggregate on certain columns. If you want to view the column policy that has been added to the clean room by the provider, call the following procedure.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_provider_column_policy($cleanroom_name);
```

**Common errors**

If you are getting **Not approved: unauthorized columns used** error as a result of run analysis, you may want to view the join policy and column policy set by the provider again.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_provider_join_policy($cleanroom_name);
call samooha_by_snowflake_local_db.consumer.view_provider_column_policy($cleanroom_name);
```

It is also possible that you have exhausted your privacy budget, which prevents you from executing more queries. Your remaining privacy budget can be viewed using the below command. It resets daily, or the clean room provider can reset it if they wish.

```sqlexample
call samooha_by_snowflake_local_db.consumer.view_remaining_privacy_budget($cleanroom_name);
```

You can check if Differential Privacy has been enabled for your clean room using the following API:

```sqlexample
call samooha_by_snowflake_local_db.consumer.is_dp_enabled($cleanroom_name);
```

---
title: Snowflake Data Clean Rooms: Provider API reference guide
source: https://docs.snowflake.com/en/user-guide/cleanrooms/provider.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Provider API reference guide

This page describes procedures used by clean rooms API consumers to manage their clean rooms. For coding setup instructions, see [Coding setup](v1/developer-introduction.md).

## Create, configure, and delete clean rooms

These procedures enable a provider to create, configure, and delete a clean room.

### view_cleanrooms

Schema:
:   PROVIDER

**Description:** Lists all existing clean rooms that were created by this provider account.

**Arguments:** *None*

**Returns:** *(Table)* A list of clean rooms created by this provider account. Clean rooms need not be shared to, installed,
or used by consumers. Deleted clean rooms are expunged from the database, and do not appear in this list.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_cleanrooms();
```

### describe_cleanroom

Schema:
:   PROVIDER

**Description:** Get a summary of information about a clean room, such as templates, join policies, column policies, and consumers.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to get information about.

**Returns:** *(String)* A summary of clean room metadata.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.describe_cleanroom($cleanroom_name);
```

### cleanroom_init

Schema:
:   PROVIDER

**Description:** Creates a clean room with the specified name in your account. This procedure can take a minute or more to run. The clean
room isn’t visible in the clean rooms UI or to collaborators until after you call `create_or_update_cleanroom_listing`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room name, 80 characters maximum. Valid characters: `[A-Z,a-z,0-9,_]` and spaces.
* `distribution` - (String, optional) One of the following values:

  + INTERNAL (*Default*) - Clean room is visible only to users in the same organization and does not trigger a security scan before
    changing the default version.
  + EXTERNAL - Clean room is production ready and can be shared outside the organization. The clean room triggers a security scan before
    changing the default version. If you want to change the distribution after a
    clean room is created, call `ALTER PACKAGE` as shown here:

    ```sqlexample
    ALTER APPLICATION PACKAGE samooha_cleanroom_<CLEANROOM_ID>
      SET DISTRIBUTION = EXTERNAL;
    ```

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
-- Create an internal clean room
CALL samooha_by_snowflake_local_db.provider.cleanroom_init($cleanroom_name, 'INTERNAL');
```

### set_default_release_directive

Schema:
:   PROVIDER

**Description:** Specifies the version and patch of a clean room loaded by collaborators when they start a new
browser session in the clean rooms UI, or access the clean room from the API. This must be called before the clean room can be shared with
consumers.

The clean room application creates a new version of a clean room whenever you upload or change Python code. If you want users to be served
the newest version, call this procedure with the new version number. To see the available versions and their status, or the current release directive, run the appropriate SQL command:

```sqlexample
-- See all versions, including failed versions.
SHOW VERSIONS IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_<cleanroom_name>;

-- See current release directive.
SHOW RELEASE DIRECTIVES IN APPLICATION PACKAGE SAMOOHA_CLEANROOM_<cleanroom_name>;
```

Where `<cleanroom_name>` [follows this format](v1/developer-introduction.md).

All clean rooms are created with the following version and patch numbers:

* **version**: V1_0
* **patch**: 0

> **Note:**
>
> If the clean room distribution is set to EXTERNAL, this procedure can be called only after the clean room security scan moves to an
> APPROVED state. To see the security status, call `view_cleanroom_scan_status`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room name.
* `version` - (String) Version. Must always be “V1_0”.
* `patch` - (String) Patch number loaded by the consumer. This starts at 0, and you should increment it whenever a new clean room
  version is available. You can see the available versions as described above.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive(
  $cleanroom_name,
  'V1_0', '0'
);
```

### drop_cleanroom

Schema:
:   PROVIDER

**Description:** Delete the clean room. Collaborators who have the clean room installed can no longer access or use it. The
clean room no longer appears in the clean rooms UI the next time the browser is refreshed.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to delete.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.drop_cleanroom($cleanroom_name);
```

### enable_consumer_run_analysis

Schema:
:   PROVIDER

**Description:** Enables the consumer to run analyses in the clean room. This capability is enabled by default in all new clean rooms, so
this procedure need only be run if you have explicitly disabled consumer-run analysis for a clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room in which consumer-run analyses are allowed.
* `consumer_accounts` - (Array of string) Account locators of all consumers to enable this feature for. **NOTE:** These consumers must
  already have been added to the clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.enable_consumer_run_analysis(
  $cleanroom_name,
  ['<CONSUMER_ACCOUNT_LOCATOR_1>']
);
```

### disable_consumer_run_analysis

Schema:
:   PROVIDER

**Description:** Prevents the specified consumers from running analyses in the specified clean room. By default, all
consumers are allowed to run an analysis in a clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room where consumer-run analysis is being disabled.
* `consumer_accounts` - (Array of string) Account locators of consumers that cannot run an analysis in this clean room. **NOTE:**
  These consumers must already have been added to the clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.disable_consumer_run_analysis(
  $cleanroom_name,
  ['<CONSUMER_ACCOUNT_LOCATOR_1>']
);
```

### is_consumer_run_enabled

Schema:
:   LIBRARY

**Description:** Checks if this clean room allows consumer-run analyses.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to check.

**Returns:** *(String)* Whether or not this clean room allows consumer-run analyses.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.is_consumer_run_enabled($cleanroom_name)
```

### create_or_update_cleanroom_listing

Schema:
:   PROVIDER

**Description:** Publishes a new clean room or updates an existing clean room. You should call this method whenever you make changes to
a clean room to ensure that the changes are propagated to consumers.

When publishing a clean room for the first time, it can take up to 15 minutes for the clean room to become visible in the clean rooms UI.

If you make updates to a clean room without calling this method afterwards, there is no guarantee that the changes will
be propagated to consumers.

There is a limit to the number of clean rooms + collaborators that you can create in a single account. If you create too
many test clean rooms, you might need to delete a few in order to create new clean rooms. If you need more clean rooms
than your account can hold, contact Snowflake support.

> **Note:**
>
> You must set the release directive at least once before calling this procedure.
> For more information, see provider.set_default_release_directive.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to publish or update.

**Returns:** *(String)* Success message.

**Error handling:**

If you get an error saying that “Cross-Cloud Auto-Fulfillment is not enabled for this account”, it means that one of the
consumers is in another cloud hosting region. You must enable Cross-Cloud Auto-Fulfillment as described in
[Managing Cross-Cloud Auto-Fulfillment in Snowflake Data Clean Rooms](v1/enabling-laf.md).

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.create_or_update_cleanroom_listing(
  $cleanroom_name
);
```

## Register and unregister data

Use the following command to register and unregister databases, schemas, and objects. Tables and views must be registered before they can be
linked into the clean room. If you register a database or schema, all of the objects in that database or schema are registered.
[Learn more about registering data.](register-data.md)

### register_db

Schema:
:   PROVIDER

**Description:** Enables a database and all objects within it to be linked into individual clean rooms in this clean room environment.
This procedure grants USAGE and SELECT privileges on the database to SAMOOHA_APP_ROLE, which is used by the clean room environment to
access data.

You must have MANAGE GRANTS access on the database to call this procedure. Other providers in this clean room environment can then
link these objects into their own clean rooms without needing their own SELECT privilege.

> **Important:**
>
> This procedure does not register any objects created after it was called. If new objects were added to the database and you want to
> register those as well, you must call this procedure again.

**Arguments:**

* `db_name` - (String) Name of database to register.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.provider.register_db('SAMOOHA_SAMPLE_DATABASE');
```

### register_schema

Schema:
:   LIBRARY

**Description:** Similar to `register_db`, but operates at a schema level. You must have MANAGE GRANTS privilege on the schema to call
this procedure.

This procedure grants USAGE and SELECT privileges on the schema to SAMOOHA_APP_ROLE, which is used by the clean room environment to access
data.

If you want to register a managed access schema (that is, a schema created using the WITH MANAGED ACCESS parameter), use
`library.register_managed_access_schema` instead.

> **Important:**
>
> This procedure does not register any objects created after it was called. If new objects were added to the database and you want to
> register those as well, you must call this procedure again.

**Arguments:**

* `schema_name` - (Array of string) An array of one or more fully qualified schema names to register.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.register_schema(['SAMOOHA_SAMPLE_DATABASE.DEMO']);
```

### register_managed_access_schema

Schema:
:   LIBRARY

**Description:** Similar to `register_schema`, but registers a schema that was created using the WITH MANAGED ACCESS parameter. You must
have MANAGE GRANTS privileges on the schema to call this procedure.

This procedure grants use privileges on the managed schema to SAMOOHA_APP_ROLE, which is used by the clean room environment to access data.

> **Important:**
>
> This procedure does not register any objects created after it was called. If new objects were added to the database and you want to
> register those as well, you must call this procedure again.

**Arguments:**

* `schema_name` - (Array of string) An array of one or more fully qualified schema names.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.register_managed_access_schema(
  ['SAMOOHA_SAMPLE_DATABASE.DEMO']
);
```

### register_objects

Schema:
:   LIBRARY

**Description:** Grants the clean room access to tables and views of all types, making them available to be linked into the clean room by
calling `provider.link_datasets`. You can register broader groups of objects by calling `library.register_schema`,
`library.register_managed_access_schema`, or `provider.register_db`.

This procedure grants use privileges on the object to SAMOOHA_APP_ROLE, which is used by the clean room environment to access data.

You must have MANAGE GRANTS privilege on the object to call this procedure. This procedure cannot be used to register a database.

If you register a view that is based on an object in another database, you must also grant the native application permission to access
the source object.

**Arguments:**

* `object_names` - (array) Array of fully qualified object names. These objects can then be linked into the clean room.

**Returns:** *(String)* Success message.

**Examples**

To register a table and a view:

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.register_objects(
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS',
    'SAMOOHA_SAMPLE_DATABASE.INFORMATION_SCHEMA.FIELDS'
  ]
 );
```

### enable_external_tables_on_account

Schema:
:   LIBRARY

**Description:** Enable Iceberg or external tables to be used in all clean rooms in this account. Must be called by an ACCOUNTADMIN in
both the provider and consumer accounts to allow Iceberg or external tables to be linked by either account. To
limit this ability to specific clean rooms in this account, call `enable_external_tables_for_cleanroom` instead.

If successful and all security scans pass, this generates a [new patch version](dcr-versions.md) of the clean
room.

**Arguments:** *None*

**Returns:** *(String)* Success message. If successful, it triggers a security scan and also provide the number of the patch
that will be generated if the security scan succeeds.

**Example:**

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.library.enable_external_tables_on_account();
```

### enable_external_tables_for_cleanroom

Schema:
:   PROVIDER

**Description:** Enable Iceberg or external tables to be linked into the specified clean room in this account by the provider. To allow
Iceberg and external tables for all clean rooms in this account, call `enable_external_tables_on_account` instead.

If successful, this will generate a [new patch version](dcr-versions.md) of the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room into which the provider can link Iceberg or external tables.

**Returns:** *(String)* Success message. If successful, it triggers a security scan and also provide the number of the patch
that will be generated if the security scan succeeds.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.enable_external_tables_for_cleanroom(
  $cleanroom_name);
```

### unregister_db

Schema:
:   LIBRARY

**Description:** Reverses the `register_db` procedure and removes the database-level grants given to the SAMOOHA_APP_ROLE role and Snowflake
Data Clean Room native application. This also removes any database from the selector in the clean rooms UI.

**Arguments:**

* `db_name` - (String) Name of the database to unregister.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.unregister_db('SAMOOHA_SAMPLE_DATABASE');
```

### unregister_schema

Schema:
:   LIBRARY

**Description:** Unregisters a schema, which prevents users from linking its tables and views into the clean room.

If you want to unregister a managed access schema (that is, a schema created using the WITH MANAGED ACCESS parameter), use
`library.unregister_managed_access_schema` instead.

**Arguments:**

* `schema_name` - (array) Schemas to unregister.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.unregister_schema(
  ['SAMOOHA_SAMPLE_DATABASE.DEMO']
);
```

### unregister_managed_access_schema

Schema:
:   LIBRARY

**Description:** Similar to `unregister_schema`, but unregisters a schema that was created using the WITH MANAGED ACCESS parameter.

**Arguments:**

* `schema_name` - (array) Managed schemas to unregister.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.unregister_managed_access_schema(['SAMOOHA_SAMPLE_DATABASE.DEMO']);
```

### unregister_objects

Schema:
:   LIBRARY

**Description:** Revokes clean room access to tables and views of all types. Objects will no longer be available to any users in any clean
rooms managed by this account.

**Arguments:**

* `object_names` - (array) Array of fully-qualified object names for which access should be revoked.

**Returns:** *(String)* Success message.

**Examples**

To unregister a table and a view:

```sqlexample
USE ROLE <role_with_manage_grants>;
CALL samooha_by_snowflake_local_db.library.unregister_objects(
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS',
    'SAMOOHA_SAMPLE_DATABASE.INFORMATION_SCHEMA.FIELDS'
  ]
);
```

## Link data and tables

Use the following commands to add or remove tables and views in a clean room.

### link_datasets

Schema:
:   PROVIDER

**Description:** Links a Snowflake table or view into the clean room. The procedure automatically makes the table accessible to the clean
room by creating a secure view of the table within the clean room, without any requirement to copy your table. The table is
linked to its source, so updates in the source appear in the secure version within the clean room.

If the dataset includes a Snowflake policy that is stored in a different database, you (or a clean rooms administrator)
must [grant your clean room access to that policy database](register-data.md) to enable linking the data
into a clean room.

Any items linked here must be registered first, at the database, schema, or object level.

> **Note:**
>
> If a table linked into a clean room is deleted, renamed, moved, or has restrictive permissions added, the table can’t be used in the clean
> room until you restore the table using the same location, name, and permissions.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room with access to the objects.
* `tables_list` - (Array of string) List of tables or views to link into the clean room. Objects must be registered before they can be
  linked in.
* `consumer_list` - (Array of string, optional) If present, allows only consumers listed here to access these objects. If absent, allows
  anyone with access to the clean room to access this data.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.link_datasets(
  $cleanroom_name,
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS',
    'MYDB.MYSCH.EXPOSURES'
  ]
);
```

> **Note:**
>
> If you link a view into the clean room, and the view is based on a table in another database, you must register both the
> view and the source of the view

### unlink_datasets

Schema:
:   PROVIDER

**Description:** Removes access to the specified tables in the specified clean room for all users. Specified tables must have been linked by
the provider.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of clean room linked to these data sets.
* `tables_list` - (array) Array of table or view names to unlink from the clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.unlink_datasets(
  $cleanroom_name,
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS',
    'MYDB.MYSCH.EXPOSURES'
  ]
);
```

### view_provider_datasets

Schema:
:   PROVIDER

**Description:** View all tables and views linked into the specified clean room by any provider in this account.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** Table of objects linked into the specified clean room, along with the clean room’s internal view name for each object.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_provider_datasets($cleanroom_name);
```

### restrict_table_options_to_consumers

Schema:
:   PROVIDER

**Description:** Controls whether a particular consumer can access a table in the clean room. This procedure is **replace only**, meaning
that it overwrites completely any values set in a previous call.

Consumers granted access through `provider.link_datasets`, `provider.restrict_table_options_to_consumers`, or any other method will lose
access to a table if it isn’t specified when calling this method.

> **Note:**
>
> Restrictions that you create by calling this procedure might not behave as expected in the clean rooms UI. You should not call this
> procedure on a clean room that can be used in the clean rooms UI.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to restrict.
* `access_details` - (Object) A JSON object, where each field name is the fully qualified name of a table or view, and the field value is
  an array of account locators of users who can access that table or view.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.restrict_table_options_to_consumers(
  $cleanroom_name,
  {
    'DB.SCHEMA.TABLE1': ['CONSUMER_1_LOCATOR'],
    'DB.SCHEMA.TABLE2': ['CONSUMER_1_LOCATOR', 'CONSUMER_2_LOCATOR']
  }
);
```

## Manage policies

Join policies in data clean rooms are not the same as [Snowflake-wide join policies](../join-policies.md). Join policies for clean
rooms are set only by using this procedure; join policies set on tables outside of clean rooms are ignored by clean rooms.

[Learn more about table policies in clean rooms.](v1/policies.md)

### set_join_policy

Schema:
:   PROVIDER

**Description:** Specifies which columns the consumer can join on when running templates within this clean room.

Calling this function completely replaces the old policy with the new one.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

> **Important:**
>
> Join policies are enforced **only** when the template applies the `join_policy` or `join_and_column_policy` JinjaSQL filters to join rows.

> **Note:**
>
> Join policies in data clean rooms are not the same as Snowflake-wide join policies. Join policies for clean rooms are set only by using this procedure; join policies set on tables outside of clean rooms are ignored by clean rooms.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the join policy should be enforced.
* `table_and_col_names` - (Array of string) Fully qualified column name in the format
  `database_name.schema_name.table_or_view_name:column_name`. **Note the correct use of . versus : marks**

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_join_policy(
  $cleanroom_name,
  [
    'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL',
    'MYDB.MYSCH.EXPOSURES:HASHED_EMAIL'
  ]
);
```

### view_join_policy

Schema:
:   PROVIDER

**Description:** Shows the provider join policy in the specified clean room.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to query.

**Returns:** *(Table)* List of joinable rows on all tables or views in the clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_join_policy($cleanroom_name);
```

### set_column_policy

Schema:
:   PROVIDER

**Description:** Specifies which columns of your data can be projected in templates run by other collaborators.

Calling this function completely replaces the old policy with the new one.

Don’t set a column policy on identity columns or columns that contain sensitive data, such as email addresses. You generally don’t want this sort of data to be projected.

Queries with wildcards might not be caught by using these checks, so use discretion when you design the analysis template.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

> * [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.
> * `analysis_and_table_and_cols` - (Array of string) Array of columns that can be used by templates. The format for each column is:
>   `template_name:full_table_name:column_name`

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_column_policy(
  $cleanroom_name,
  ['prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE']);

 -- Same example, but using a variable name for the template.
CALL samooha_by_snowflake_local_db.provider.set_column_policy(
  $cleanroom_name,
  [$template_name || ':SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
   $template_name || ':SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
   $template_name || ':SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE']);
```

### view_column_policy

Schema:
:   PROVIDER

**Description:** Shows the provider’s column policy in the designated clean room.

Learn more about clean room policies: [Understanding Snowflake Data Clean Room policies](v1/policies.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)*

**Returns:** *(Table)* Which columns can be used in which templates.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_column_policy($cleanroom_name);
```

## Manage provider templates

Use the following commands to add the templates/analyses that are supported in this clean room.

### view_added_templates

Schema:
:   PROVIDER

**Description:** Views the provider-added templates in the clean room. There is no method to list all templates in all clean rooms for
this provider.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room to query.

**Returns:** *(Table)* - List of templates available in the specified clean room, with details about each template.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_added_templates($cleanroom_name);
```

### view_template_definition

Schema:
:   PROVIDER

**Description:** Shows information about a specific template. Consumers looking at a provider template should use
`consumer.view_template_definition`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room with this template.
* `template_name` - (String) Name of the template to request information about.

**Returns:** *(String)* The template definition.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_template_definition(
  $cleanroom_name,
  $template_name);
```

### add_templates

Schema:
:   PROVIDER

**Description:** Adds a list of templates to the clean room. This does not replace the existing template list.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to add templates to.
* `template_names` - (Array of string) Name of the templates to add. These are Snowflake-provided templates only. To add a custom template,
  call `add_custom_sql_template`.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_templates(
  $cleanroom_name,
  ['my_custom_template']);
```

### clear_template

Schema:
:   PROVIDER

**Description:** Removes a specified template from the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.
* `template_name` - (String) Name of the template to remove from that clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.clear_template(
  $cleanroom_name,
  'prod_custom_template');
```

### clear_all_templates

Schema:
:   PROVIDER

**Description:** Removes all the templates that have been added to the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room from which to remove all templates.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.clear_all_templates($cleanroom_name);
```

### add_custom_sql_template

Schema:
:   PROVIDER

**Description:** Adds a custom JinjaSQL template into the clean room. This makes the template callable by the consumer.
[Learn how to create custom templates.](custom-templates.md)

You can call this API more than once to add multiple custom templates to the clean room. The procedure overwrites any previous template with
the same name in this clean room.

If the template is used by the consumer to [activate results back to the provider](v1/activation.md),
the command must meet the following requirements:

* The name of the custom template must begin with the string `activation_`. For example: `activation_custom_template`.
* The template must create a table that begins with `cleanroom.activation_data_`.
  For example: `CREATE TABLE cleanroom.activation_data_analysis_results AS ...`.
* The template must return the unique part of the table name that was created in the definition, which is the string
  appended to `cleanroom.activation_data_`. For example, for the template named `activation_data_analysis_results`, you would return
  `data_analysis_results`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to which this template is applied.
* `template_name` - (String) Name of the template. Must be all lowercase letters, numbers, spaces, or underscores. Activation templates must
  have a name beginning with “activation”.
* `template` - (String) The JinjaSQL template.
* `sensitivity` - (Float, optional) If differential privacy is enabled for this clean room, it controls the amount of
  differential privacy noise applied to the data returned by this template. Must be a number greater than 0. Default is 1.0. The
  differential privacy task must be running in this clean room for this argument to have any effect.
* `consumer_locators` - (Array of string, optional) An array of one or more account locators. If present, this template will be added to the
  clean room only for these accounts. You can later modify this list by calling `provider.restrict_template_options_to_consumers`. If you
  don’t specify a list of consumers, all consumers can use the custom template in the specified clean room.
* `is_obfuscated` - (Boolean, optional) If TRUE, prevents consumers from being able to view the template body. Note that you must be using
  Snowflake Enterprise Edition or higher to run an obfuscated template. If this template is used for a provider-run analysis, the consumer
  must re-approve the analysis request any time you change the `is_obfuscated` state. `is_obfuscated` cannot be used together with
  `sensitivity`.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample-jinja
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    $template_name,
    $$
SELECT
    IDENTIFIER({{ dimensions[0] | column_policy }})
FROM
    IDENTIFIER({{ my_table[0] }}) c
    INNER JOIN
    IDENTIFIER({{ source_table[0] }}) p
    ON
        IDENTIFIER({{ c.consumer_id  }}) = IDENTIFIER({{ provider_id | join_policy }})
    {% if where_clause %}
      WHERE {{ where_clause | sqlsafe | join_and_column_policy }}
    {% endif %};
    $$);
```

### add_ui_form_customizations

Schema:
:   PROVIDER

**Description:** Defines a customization form for a template in a clean room when the clean room is run in the clean rooms UI. This is
useful when you let consumers choose template parameters, such as tables or columns. At a minimum, you must specify values for
`display_name`, `description`, and `methodology` in the `template_information` argument.

It is recommended to put table selection elements before column selection elements, especially when the column choosers populate based on
the table selection.

[Learn how to design user input forms for custom templates.](demo-flows/custom-templates.md)

**You must update the clean room after calling this function.** If you do not call `provider.create_or_update_cleanroom_listing` after
updating the UI, collaborators will not see any updates.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room that contains this template. The submitted form applies only to
  the specified template in the specified clean room.
* `template_name` - (String) Name of the template to which this UI applies. This is not the user-visible title, which is specified using the
  `template_information.display_name` field.
* `template_information` - (Dict) Meta information about the template to show in the clean rooms UI. The following properties must or can be
  defined:

  + `display_name` (**Required**): Display name of the template in the clean rooms UI.
  + `description` (**Required**): Description of the template.
  + `methodology` ( **Required**): Description of any arguments, and what the result is.
  + `warehouse_hints` *(Object)*: Recommends what type of warehouse to use to run the analysis. This is an object with the
    following fields:

    - `warehouse_size`: See *warehouse_size* in [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) for valid values.
    - `snowpark_optimized` *(Boolean)*: Whether to use a [Snowpark-optimized warehouse](../warehouses-snowpark-optimized.md) to
      process the query. For most machine learning use cases, Snowflake recommends TRUE.
  + `render_table_dropdowns` *(Object)*: Whether to show the default drop-down lists that let the user select which provider or consumer
    tables to use in the query. This is an object with the following fields:

    - `render_consumer_table_dropdown` - *(Boolean)* If TRUE, show the default consumer table selector. If FALSE, hide the
      consumer tables selector. The template can access the chosen values as a list using the `my_table` template variable. If any element
      sets `references=CONSUMER_TABLES`, this defaults to FALSE, otherwise it defaults to TRUE.
    - `render_provider_table_dropdown` - *(Boolean)* If TRUE, show the default provider table selector. If FALSE, hide the
      provider tables selector. The template can access the chosen values as a list using the `source_table` template variable. If any
      element sets `references=PROVIDER_TABLES`, this defaults to FALSE, otherwise it defaults to TRUE.
  + `activation_template_name` - *(String)* Name of an activation template in this clean room. Use the template name without any
    `cleanroom` prefix. [Learn about activation templates.](custom-templates.md)
  + `enabled_activations` - *(String)* Which kind of activations are enabled. Possible values: `consumer`, `provider`. No default; must be
    provided if `activation_template_name` is specified.
* `details` - (Dict, optional) Defines user-configurable input fields that pass values to the template. This is a dictionary of key - object
  pairs, each pair representing one form element. The key is a variable name available to the linked JinjaSQL template.
  The value is an object that defines the form element. If a template variable doesn’t have an equivalent form element defined here, clean
  rooms autogenerates a default form element. Each object can define the following fields:

  ```sqlsyntax
  <field_name>: {
    ['display_name': <string>,]
    ['order': <number>,]
    ['description': <string>,]
    ['type': <enum>,]
    ['default': <value>,]
    ['choices': <string array>,]
    ['infoMessage': <string>,]
    ['size': <enum>,]
    ['required': <bool>,]
    ['group': <string>,]
    ['references': <array of string>,]
    ['provider_parent_table_field':  <string>,]
    ['consumer_parent_table_field': <string>]
  }
  ```

  + `display_name`: Label text for this item in the UI form.
  + `order`: 1-based order in which this element should be shown in the form. If not specified, the elements will be rendered in the
    order in which they appear in the object.
  + `description`: A description of the element purpose, shown below the label. Provide short help or examples here. If not provided,
    none is shown.
  + `type`: The type of UI element. If *references* is specified for this input field, then omit this entry (the type is determined for
    you). Supported values:

    - `any` *(Default)*: Regular text entry field.
    - `boolean`: True/False selector
    - `integer`: Use arrows to change the number
    - `multiselect`: Select multiple items from a dropdown list
    - `dropdown`: Select one item from a dropdown list
    - `date`: Date selector
  + `default`: Default value of this element
  + `choices`: *(Array of string)* List of choices for dropdown and multiselect elements
  + `infoMessage`: Informational hovertext shown next to the element. If not provided, no tooltip is provided.
  + `size`: Element size. Supported values: `XS`, `S`, `M`, `L`, `XL`
  + `required`: Whether a value is required by the user. Specify TRUE or FALSE.
  + `group`: A group name, used to group items in the UI. Use the same group name for items that should be grouped together in the UI.
    If you hide the default dropdown lists, you can use the `{{ source_table }}` and `{{ my_table}}` special arguments in the custom
    template, then define your own dropdown list that contains the desired tables. For more information about using these special
    variables when defining the custom template, see provider.add_custom_sql_template.
  + `references`: Populates a drop-down list with tables or columns of the specified type in the clean room. If used, `type` must be
    either `multiselect` or `dropdown`. The following string values are supported:

    - `PROVIDER_TABLES`: List all the provider’s tables in the clean room. **If specified,**
      `render_table_dropdowns.render_provider_table_dropdown` must be FALSE.
    - `PROVIDER_JOIN_POLICY`: List all columns in the provider’s join policy for the table currently selected in the
      `provider_parent_table_field` element.
    - `PROVIDER_COLUMN_POLICY`: List all columns in the provider’s column policy for the current template and the table selected in
      the `provider_parent_table_field` element.
    - `PROVIDER_ACTIVATION_POLICY`: List all columns in the provider’s activation policy.
    - `CONSUMER_TABLES`: List all the consumer tables in the clean room. **If specified,**
      `render_table_dropdowns.render_consumer_table_dropdown` must be FALSE.
    - `CONSUMER_COLUMNS`: List all columns in the consumer table specified by `consumer_parent_table_field`. You shouldn’t use consumer
      column references in provider-run templates, as the consumer might
      apply join and column policies to these columns; use `CONSUMER_JOIN_POLICY` or `CONSUMER_COLUMN_POLICY` for provider-run
      templates instead.
    - `CONSUMER_JOIN_POLICY`: List all columns in the consumer’s join policy from the table selected in the
      `consumer_parent_table_field` element.
    - `CONSUMER_COLUMN_POLICY`: List all columns in the consumer’s column policy for the current template and the table selected in the
      `consumer_parent_table_field` field.
  + `provider_parent_table_field`: The name of the UI element where the user selects a provider table; don’t provide the
    table name itself here. Use only when `references` is set to `PROVIDER_COLUMN_POLICY` or `PROVIDER_JOIN_POLICY`. To reference
    the default provider table chooser, specify `source_table` here and set `render_table_dropdowns.render_provider_table_dropdown` to TRUE.
  + `consumer_parent_table_field`: The name of the UI element where the user selects a consumer table; don’t provide the
    table name itself here. Use only when `references` is set to `CONSUMER_COLUMNS`, `CONSUMER_JOIN_POLICY`, or
    `CONSUMER_COLUMN_POLICY`. To reference the default consumer table chooser, specify `my_table` here and set
    `render_table_dropdowns.render_provider_table_dropdown` to TRUE.
* `output_config` - (Dict) Defines how to display template results graphically in the clean rooms UI. If not provided, the results are not
  displayed in a graph, only in a table. If you do not want a graph, provide an empty object `{}` for this argument. Allowed fields:

  > + `measure_columns`: Names of columns containing measures and dimensions to use in the graph generated by the clean rooms UI.
  > + `default_output_type`: The default format to display the results. The user will typically be able to change the display format
  >   in the UI if the data is in the proper format. Supported types:
  >
  >   - `TABLE`: *(Default)* Tabular format
  >   - `BAR`: Bar chart, which is good for comparing different categories
  >   - `LINE`: Line chart, which is good for showing trends over time or continuous data
  >   - `PIE`: Pie chart, which is suitable for showing proportions or percentages

The following table shows a matrix of values that are allowed in the `details` object for values that can conflict:

| `type` | `references` | `provider_parent_table_field` | `consumer_parent_table_field` | `render_provider_table_dropdown` | `render_consumer_table_dropdown` |
| --- | --- | --- | --- | --- | --- |
| `multiselect` or `dropdown` | `PROVIDER_TABLES` | *Not allowed* | *Not allowed* | FALSE | TRUE or FALSE |
|  | `PROVIDER_JOIN_POLICY` | `source_table` | *Not allowed* | TRUE | TRUE or FALSE |
|  | `PROVIDER_JOIN_POLICY` | `parent field name` | *Not allowed* | TRUE or FALSE | TRUE or FALSE |
|  | `PROVIDER_COLUMN_POLICY` | `source_table` | *Not allowed* | TRUE | TRUE or FALSE |
|  | `PROVIDER_COLUMN_POLICY` | `parent field name` | *Not allowed* | TRUE or FALSE | TRUE or FALSE |
|  | `CONSUMER_TABLES` | *Not allowed* | *Not allowed* | TRUE or FALSE | FALSE |
|  | `CONSUMER_COLUMNS` | *Not allowed* | `my_table` or `parent field name` | TRUE or FALSE | TRUE |
|  | `CONSUMER_JOIN_POLICY` | *Not allowed* | `my_table` or `parent field name` | TRUE or FALSE | TRUE |
|  | `CONSUMER_COLUMN_POLICY` | *Not allowed* | `my_table` or `parent field name` | TRUE or FALSE | TRUE |
|  | `PROVIDER_ACTIVATION_POLICY` | *Not allowed* | *Not allowed* | TRUE or FALSE | TRUE or FALSE |

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
-- Specify the display name, description, and warehouse, and hide the default table dropdown lists.
-- Define the following two fields in the UI:
--   A provider table selector that shows all provider tables. Chosen tables can be accessed by the template with the variable 'a_provider_table'
--     (This dropdown list is equivalent to setting ``render_table_dropdowns.render_provider_table_dropdown: True``)
--   A column selector for the tables chosen in 'a_provider_table'. Chosen columns can be accessed by the template with the variable 'a_provider_col'

  CALL samooha_by_snowflake_local_db.provider.add_ui_form_customizations(
      $cleanroom_name,
      'prod_custom_template',
      {
          'display_name': 'Custom Analysis Template',
          'description': 'Use custom template to run a customized analysis.',
          'methodology': 'This custom template dynamically renders a form for you to fill out, which are then used to generate a customized analysis fitting your request.',
          'warehouse_hints': {
              'warehouse_size': 'xsmall',
              'snowpark_optimized': FALSE
          },
          'render_table_dropdowns': {
              'render_consumer_table_dropdown': false,
              'render_provider_table_dropdown': false
          },
          'activation_template_name': 'activation_my_template',
          'enabled_activations': ['consumer', 'provider']
      },
      {
          'a_provider_table': {
              'display_name': 'Provider table',
              'order': 3,
              'description': 'Provider table selection',
              'size': 'S',
              'group': 'Seed Audience Selection',
              'references': ['PROVIDER_TABLES'],
              'type': 'dropdown'
          },
          'a_provider_col': {
              'display_name': 'Provider column',
              'order': 4,
              'description': 'Which col do you want to count on',
              'size': 'S',
              'group': 'Seed Audience Selection',
              'references': ['PROVIDER_COLUMN_POLICY'],
              'provider_parent_table_field': 'a_provider_table',
              'type': 'dropdown'
          }
      },
      {
          'measure_columns': ['col1', 'col2'],
          'default_output_type': 'PIE'
      }
  );
```

### restrict_template_options_to_consumers

Schema:
:   PROVIDER

**Description:** Controls which users can access a given template in a given clean room. This procedure overrides any access list specified
previously by any other procedure for a clean room/template pair.

> **Note:**
>
> Restrictions that you create by calling this procedure might not behave as expected in the clean rooms UI. You should not call this
> procedure on a clean room that can be used in the clean rooms UI.

**Arguments:**

> * [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room.
> * `access_details` - (JSON object) The name of a template and the users who can access that template in that clean room. If a template is
>   specified, only users listed here can access that template in that clean room. This is an object with one child object per template in
>   the following format: `'{template_name': ['user1_locator','user2_locator','userN_locator']}`

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.restrict_template_options_to_consumers(
  $cleanroom_name,
  {
      'prod_template_1': ['CONSUMER_1_LOCATOR', 'CONSUMER_2_LOCATOR']
  }
);
```

## Consumer-defined templates

The following APIs allow you to approve or reject a request from a consumer to add a template to the clean room. A consumer-defined template
is added to a clean room only if the provider approves the consumer’s request to add it. For more information, see
[Consumer-written custom templates](demo-flows/custom-templates.md).

### list_pending_template_requests

Schema:
:   PROVIDER

**Description:** Lists all unapproved requests from consumers who want to add a consumer-defined template to a clean room. This includes
pending, approved, and rejected requests. Use this procedure to check for pending requests and approve them
(`provider.approve_template_request`) or reject them (`provider.reject_template_request`).

This will fail until all consumers that the clean room is shared with have installed the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - View consumer requests to add a template to this clean room.

**Returns:** A table with the following values, among others:

* `request_id` - (String) ID of the request, needed to accept or reject the request.
* `consumer_locator` - (String) Account locator of the person making the request.
* `template_name` - (String) Name of the consumer-provided template.
* `template_definition` - (String) Full definition of the consumer-proposed template.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.list_pending_template_requests($cleanroom_name);
```

### list_template_requests

Schema:
:   PROVIDER

**Description:** Lists all requests from consumers who want to add a consumer-defined template to a clean room. This includes pending,
approved, and rejected requests. Use this to check for pending requests and approve them (`provider.approve_template_request`) or reject
them (`provider.reject_template_request`) .

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - View consumer requests to add a template to this clean room.

**Returns:** A table with the following values, among others:

* `request_id` - (String) ID of the request, needed to accept or reject the request.
* `consumer_identifier` - (String) Account locator of the person making the request.
* `template_name` - (String) Name of the consumer-provided template.
* `template_definition` - (String) Full definition of the consumer-proposed template.
* `status` - (String) Status of the request: PENDING, APPROVED, REJECTED.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.list_template_requests($cleanroom_name);
```

### approve_template_request

Schema:
:   PROVIDER

**Description:** Approves a request to add a template to the clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room the user wants to add the template to.
* `request_id` - (String) ID of the request to approve. Call `provider.list_template_requests` to see request IDs.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.approve_template_request(
  $cleanroom_name,
  '815324e5-54f2-4039-b5fb-bb0613846a5b'
);
```

### approve_multiple_template_requests

Schema:
:   PROVIDER

**Description:** Approves multiple consumer requests to add a template to a clean room. All requests must be for a single clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room to which this request applies.
* `request_ids` - (Array of strings) The IDs of all template requests to approve. To obtain a request ID, call
  `provider.list_template_requests`.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.approve_multiple_template_requests(
  $cleanroom_name,
  [
    'cfd538e2-3a17-48e3-9773-14275e7d2cc9',
    '2982fb0a-02b7-496b-b1c1-56e6578f5eac'
  ]
);
```

### reject_template_request

Schema:
:   PROVIDER

**Description:** Rejects a request to add a template to a clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room the user wants to add the template to.
* `request_id` - (String) ID of the request to reject. Call `provider.list_template_requests` to see request IDs.
* `reason_for_rejection` - (String) Reason for rejecting the request.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.reject_template_request(
  $cleanroom_name,
  'cfd538e2-3a17-48e3-9773-14275e7d2cc9',
  'Failed security assessment');
```

### reject_multiple_template_requests

Schema:
:   PROVIDER

**Description:** Rejects multiple consumer requests to add a template to a clean room. All requests must be for the same clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to which this request applies.
* `rejected_templates` - (array of objects) An array of objects with the following fields, one per rejection:

  + `request_id` - (string) ID of the request to reject. To obtain a request ID, call `provider.list_template_requests`.
  + `reason_for_rejection` - (string) A free-text description of why the request is being rejected.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.reject_multiple_template_requests($cleanroom_name,
  [
    OBJECT_CONSTRUCT('request_id', '815324e5-54f2-4039-b5fb-bb0613846a5b', 'reason_for_rejection', 'Failed security assessment'),
    OBJECT_CONSTRUCT('request_id', '2982fb0a-02b7-496b-b1c1-56e6578f5eac', 'reason_for_rejection', 'Some other reason')
  ]
);
```

## Template chains

Use the following commands to create and manage [template chains](developer-template-chains.md).

### add_template_chain

Schema:
:   PROVIDER

**Description:** Creates a new template chain. Templates must exist before being added to the template chain. After a template chain is
created, it cannot be modified, but you can create a new template chain with the same name to overwrite the old one.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the template chain should be added.
* `template_chain_name` - (String) Name of the template chain.
* `templates` - (Array of objects) - Array of objects, one per template. The object can contain the following fields:

  + `template_name` (String) - Specifies the template being added to the template chain. The template must already be added to the clean
    room by calling `provider.add_template_chain`.
  + `cache_results` (Boolean) - Determines whether the results of the template are temporarily saved so other templates in the template
    chain can access them. To cache results, specify TRUE.
  + `output_table_name` (String) - When `cache_results` = TRUE, specifies the name of the Snowflake table where template results are stored.
  + `jinja_output_table_param` (String) - When `cache_results` = TRUE, specifies the name of the Jinja parameter that other templates must
    include to accept the results that are stored in `output_table_name`.
  + `cache_expiration_hours` (integer) - When `cache_results` = TRUE, specifies the number of hours before the results in the cache are
    dropped. When the cache expires, the next time the template chain is executed, the cache is refreshed with the results of the template.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_template_chain(
  $cleanroom_name,
  'my_chain',
  [
    {
      'template_name': 'crosswalk',
      'cache_results': True,
      'output_table_name': 'crosswalk',
      'jinja_output_table_param': 'crosswalk_table_name',
      'cache_expiration_hours': 2190
    },
    {
      'template_name': 'transaction_insights',
      'cache_results': False
    }
  ]
);
```

### view_added_template_chains

Schema:
:   PROVIDER

**Description:** Lists the template chains in the specified clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** *(Table)* Description of all template chains added to this clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_added_template_chains($cleanroom_name);
```

### view_template_chain_definition

Schema:
:   PROVIDER

**Description:** Returns the definition of a template chain.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room associated with this template chain.
* `template_chain_name` - (String) Name of the template chain associated with this clean room.

**Returns:** *(Table)* Description of the specified template chain.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_template_chain_definition(
  $cleanroom_name,
  'my_chain');
```

### clear_template_chain

Schema:
:   PROVIDER

**Description:** Deletes a specified template chain from a specified clean room. The chain is not stored anywhere, so if you want to
recreate the chain, you must recreate it from scratch.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The clean room that is assigned this template chain.
* `template_chain_name` - (String) The template chain to remove from this clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.clear_template_chain($cleanroom_name, 'my_chain');
```

## Activation

*Activation* means exporting results to a provider, a consumer, or a third party.
[Read more about activation](v1/activation.md).

### set_activation_policy

Schema:
:   PROVIDER

**Description:** Defines which provider columns can be used within an activation template. Only columns listed in an activation policy can
be activated from the provider’s data set. Not setting an activation policy prevents any provider data from being activated.

Calling this procedure wipes out any previous activation policy set by the provider.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where activation should be allowed.
* `columns` - (Array of string) Only columns listed here can be used in an activation template in this clean room. Column name format is
  `template_name:fully_qualified_table_name:column_name`. **Note the proper usage of dot . and colon : markers.**

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_activation_policy('my_cleanroom', [
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL',
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:REGION_CODE' ]);
```

### view_activation_policy

Schema:
:   PROVIDER

**Description:** Shows the provider’s activation policy in the specified clean room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room to report on.

**Returns:** *(Table)* The provider’s activation policy in the specified clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_activation_policy($cleanroom_name);
```

### request_provider_activation_consent

Schema:
:   PROVIDER

**Description:** Sends a request to the consumer to allow the provider to run a specified template and push the results to the provider’s
Snowflake account. In the background, it adds a template to the list of provider-activation templates in the clean room. Once a template is
designated as an activation template, it can be used only in activation requests.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Clean room that contains the activation template.
* `template_name` - (String) Name of the activation template to request approval for. This template must have been added to the clean room
  in a previous call. The template name must start with “activation”.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.request_provider_activation_consent(
    $cleanroom_name, 'activation_my_activation_template');
```

### update_activation_warehouse

Schema:
:   PROVIDER

**Description:** Specify what size warehouse should be used when decrypting results to the output table in a provider activation. The
warehouse used for decryption is DCR_ACTIVATION_WAREHOUSE. The provider pays for this warehouse.

**Arguments:**

* `size` - (String) Warehouse size. Choose one of the WAREHOUSE_SIZE values from the [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) command.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.update_activation_warehouse('LARGE');
```

### setup_provider_activation_share_mount_task

Schema:
:   PROVIDER

**Description:** Enables provider activation when the provider does not have the clean room UI installed on their account.

Call this after adding consumers with `provider.add_consumers`. It is called only when you are implementing provider activation
and you (the provider) do not have the clean room UI installed. (Whether or not the consumer has the UI installed does not matter.)

This starts a thread to asynchronously mount consumer shares needed for provider activation. Rather than mount the shares synchronously and
block your code, this code mounts the share asynchronously and rechecks for new collaborators periodically. You need to call this only
once, and can add additional collaborators later without needing to call this procedure again.

**Arguments:**

* `frequency_minutes` - (Integer) How often to recheck for new consumers in this clean room, in order to mount shares for them as well. A
  recommended value is 15.

**Returns:** *(String)* A success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.setup_provider_activation_share_mount_task(15);
```

### dcr_health.provider_run_provider_activation_history

**Description:** Returns a history of provider activation requests for the specified clean room. Provider activation requests initiated by
both the provider and consumer are shown. This procedure provides extra information to help debug problems with provider activation.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room in which the activation was requested. You
  must be a provider or consumer in this clean room.

**Returns:** *(Table)* - A list of activation requests with information about each, including the template and segment name, the status,
the consumer’s account locator, and any error message returned by the request.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.dcr_health.provider_run_provider_activation_history(
  $cleanroom_name);
```

### view_external_activation_history

Schema:
:   LIBRARY

**Description:** View the history of activation requests in the current account.

**Arguments:** *None*

**Returns:** A table with the details and status of activation requests.

**Example**:

```sqlexample
CALL samooha_by_snowflake_local_db.library.view_external_activation_history();
```

## Running analyses as a provider

[Learn how to run a provider analysis.](demo-flows/provider-run-analysis.md)

### enable_provider_run_analysis

Schema:
:   PROVIDER

**Description:** Enables the provider (clean room creator) to run analyses in a specified clean room. This is disabled by default. The
consumer must then call `consumer.enable_templates_for_provider_run` to enable provider-run analyses for specific templates in the clean
room. After that, the provider can run an analysis by calling `provider.submit_analysis_request`.

[Learn more about provider-run analyses.](demo-flows/provider-run-analysis.md)

> **Important:**
>
> This procedure must be called **after** `provider.add_consumers`, and before a consumer installs a clean room. If this is changed
> after a consumer has already installed their clean room, then the consumer must reinstall the clean room to reflect the new configuration.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room that should enable provider-run analysis.
* `consumer_accounts` - (Array of string) Account locators of all consumer accounts that have added data to this clean room.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.enable_provider_run_analysis(
  $cleanroom_name,
  ['<CONSUMER_ACCOUNT_LOCATOR>']
);
```

### disable_provider_run_analysis

Schema:
:   PROVIDER

**Description:** Prevents the provider (clean room creator) from running an analysis in the clean room (this is disabled by default).

> **Important:**
>
> You must call this procedure **after** calling `provider.add_consumers`, and before a consumer installs a clean room. If the run
> analysis setting is changed after a consumer has installed a clean room, then the consumer must reinstall the clean room to implement the
> new setting.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where provider-run analysis should be
  disabled.
* `consumer_account_locator` - (String) Same list of consumer account names passed to `provider.enable_provider_run_analysis`.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.disable_provider_run_analysis(
  $cleanroom_name,
  ['<CONSUMER_ACCOUNT_LOCATOR>']);
```

### is_provider_run_enabled

Schema:
:   LIBRARY

**Description:** Checks if this clean room allows provider-run analyses.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to check.

**Returns:** *(String)* Whether or not this clean room allows provider-run analyses.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.is_provider_run_enabled($cleanroom_name)
```

### is_request_back_share_mounted

Schema:
:   PROVIDER

**Description:** Checks whether messages can be propagated from the specified consumer to the provider in the specified clean room. If a
back share has not been mounted for this consumer in this clean room, messages such as provider-run request approvals aren’t propagated
from the consumer to the provider (though they will be queued on the consumer side).

Call `provider.mount_request_logs_for_all_consumers` to set up back sharing with this consumer. If you called
`provider.mount_request_logs_for_all_consumers` previously and `is_request_back_share_mounted` fails, it’s likely that you added this
consumer to this clean room after you last called `provider.mount_request_logs_for_all_consumers`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to check.
* `consumer_account` - (String) Account locator of the consumer.

**Returns:** SUCCESS if back sharing is enabled for the specified consumer in the specified clean room. Throws an error otherwise.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.is_request_back_share_mounted(
  $cleanroom_name,
  $consumer_locator);
```

### view_warehouse_sizes_for_template

Schema:
:   PROVIDER

**Description:** View the list of warehouse sizes and types available to use in provider-run analyses with a given template. The consumer
must first populate the list in their call to `consumer.enable_templates_for_provider_run`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.
* `template_name` - (String) Name of the template that the provider wants to run.
* `consumer_account` - (String) Account locator of the consumer who will approve the provider-run request.

**Returns:** A table of permitted warehouse sizes and types. Supported warehouse type and size strings are those used by the WAREHOUSE_TYPE
and WAREHOUSE_SIZE properties in the [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) command.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.PROVIDER.VIEW_WAREHOUSE_SIZES_FOR_TEMPLATE(
  $cleanroom_name,
  $template_name,
  $consumer_account_loc);
```

### submit_analysis_request

Schema:
:   PROVIDER

**Description:** Submits an analysis to run in the clean room. All of the following conditions
must be met before calling this procedure:

* The provider must have enabled provider-run analyses in this clean room.
* The consumer must have [approved provider-run analyses](consumer.md) for the specified template.
* All [join](consumer.md)) and [column](consumer.md)) policies on the consumer data and the template
  must be respected.

The template runs within the clean room and the results are stored securely inside the clean room. Results are encrypted so only the
provider can see the results.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the template should run.
* `consumer_account_locator` - *(String)* Account locator of the consumer in this clean room who has allowed provider-run analyses by
  calling `consumer.enable_templates_for_provider_run`.
* `template_name` - *(String)* Name of the template to run.
* `provider_tables` - *(Array of String)* List of provider tables to expose to the template. This list will populate the `source_table` array variable.
* `consumer_tables` - *(Array of String)* List of consumer tables to expose to the template. This list will populate the `my_table` array variable.
* `analysis_arguments` - *(JSON object)* Pass in any arguments required by the template as key-value pairs.
  The following fields are required only if the consumer specifies a set of allowed warehouse types and sizes. Call `provider.view_warehouse_sizes_for_template` to see if the consumer has specified required warehouse size and type.

  + `warehouse_type` *(String, required only if the consumer specifies a range of permitted types)* - A warehouse type that the consumer allows for provider-run analyses with the specified template. [See the list of supported types](consumer.md). If the consumer has not specified a preference, the default is STANDARD.
  + `warehouse_size` *(String, required only if the consumer specifies a range of permitted sizes)* - A warehouse size that the consumer allows for provider-run analyses with the specified template. [See the list of supported types](consumer.md). If the consumer has not specified a preference, the default is X-SMALL.

**Returns:** *(String)* A request ID that is used to check the status of the request and also to access the results. **Save this ID**
because you will need it to see the analysis results.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.submit_analysis_request(
  $cleanroom_name,
  '<CONSUMER_ACCOUNT>',
  'prod_overlap_analysis',
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'],
  object_construct(
    'dimensions', ['c.REGION_CODE'],
    'measure_type', ['AVG'],
    'measure_column', ['c.DAYS_ACTIVE'],
    'warehouse_type', 'STANDARD',        -- If this type and size pair were not listed by view_warehouse_sizes_for_template,
    'warehouse_size', 'LARGE'            -- the request will automatically fail.
  )
);
```

### check_analysis_status

Schema:
:   PROVIDER

**Description:** The provider calls this procedure to check the status of the provider analysis request. There can be a significant delay
before you can start seeing the status of a request. When an analysis is marked as complete, call `provider.get_analysis_result` to see
the results.

All consumers in the clean room must have their request logs mounted before you can call check_analysis_status. This is done once per
consumer per clean room by calling `provider.mount_request_logs_for_all_consumers`.

You can see your list of analysis requests by running this SQL command, where `cleanroom_name` is your clean room name, with spaces
replaced by underscores.

```sqlexample
SELECT * FROM SAMOOHA_CLEANROOM_<cleanroom_name>.ADMIN.PROVIDER_ANALYSIS_REQUESTS;
```

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the request was made.
* `request_id` - (String) ID of the request, returned by `provider.submit_analysis_request`.
* `consumer_account_locator` - (String) Account locator of the consumer to whom the request was sent.

**Returns:** *(String)* Status of the request, where `COMPLETED` means a successful completion of the analysis. Possible statuses:

* IN-PROGRESS: The analysis is in progress.
* PENDING: Indicates one of the following cases:

  + The request is still propagating, which can take a few minutes. Try again in a few minutes.
  + The user has not approved the request by calling `consumer.enable_templates_for_provider_run`. Try again in a few minutes.
  + You have not mounted the request logs for this consumer. Call `provider.is_request_back_share_mounted`; if that procedure not return
    SUCCESS, call `provider.mount_request_logs_for_all_consumers`.
* COMPLETED: The analysis is complete. You can call `provider.get_analysis_result`.

**Errors:**

If you see an error “ResultSet is empty or not prepared”, this can indicate that at request logs were not mounted for at least one consumer
in this clean room. Call `provider.mount_request_logs_for_all_consumers` to mount request logs for all consumers.

**Example:**

```sqlexample
-- It can take up to 2 minutes for this to pick up the request ID after the initial request
CALL samooha_by_snowflake_local_db.provider.check_analysis_status(
  $cleanroom_name,
  $request_id,
  '<CONSUMER_ACCOUNT>'
);
```

### get_analysis_result

Schema:
:   PROVIDER

**Description:** Get the results for a provider-run analysis. Do not call `get_analysis_result` until `provider.check_analysis_status`
returns COMPLETED. Analysis results persist in the clean room indefinitely.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room for which the request was sent.
* `request_id` - (String) ID of the request, returned by `submit_analysis_request`.
* `consumer_account_locator` - (String) Account locator of the consumer passed in to `submit_analysis_request`.

**Returns:** *(Table)* Query results.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.get_analysis_result(
    $cleanroom_name,
    $request_id,
    $locator
);
```

## Manage clean room sharing

Use the following commands to manage sharing a clean room with consumers.

### view_consumers

Schema:
:   PROVIDER

**Description:** Lists the consumers who are granted access to the clean room. It does not show whether a consumer has installed the clean
room.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The clean room of interest.

**Returns:** *(Table)* - List of consumer accounts that can access the clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_consumers($cleanroom_name);
```

### add_consumers

Schema:
:   PROVIDER

**Description:** Grants the specified users access to the specified clean room. The clean room can be accessed both through
the clean rooms UI and the API. This doesn’t overwrite the consumer lists from previous calls. Clean room access is granted to a specific
user, not an entire account. The consumer account must be in the same Snowflake region as the provider to be able to access a clean room.
You can check your region by calling `select current_region();`

You can see the current list of consumers by calling `provider.view_consumers`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to share with the specified users. Users can install the clean room using either the
  clean rooms API or UI.
* `consumer_account_locators` - (String) A comma-delimited list of consumer account locators, as returned by
  [CURRENT_ACCOUNT](../../sql-reference/functions/current_account.md). This list should include the same number of entries, in the same order, as contained in
  `consumer_account_names`.
* `consumer_account_names` - (String) A comma-delimited list of
  [consumer data sharing account IDs](../admin-account-identifier.md) for the consumer in the format
  `org_name.account_name` *Org name* can be retrieved by calling [CURRENT_ORGANIZATION_NAME](../../sql-reference/functions/current_organization_name.md).
  *Account name* can be retrieved by calling [CURRENT_ACCOUNT_NAME](../../sql-reference/functions/current_account_name.md). This list should include the same number
  of items, in the same order, as listed in `consumer_account_locators`.
* `enable_differential_privacy_tasks` - (Boolean, optional) TRUE to enforce differential privacy in all queries by the listed
  users in this clean room. This is a simple way to enable differential privacy with default values for the listed users. To specify
  advanced settings, provide the `privacy_settings` argument instead. The differential privacy task must be running in this clean room to
  enable differential privacy. Default is FALSE.
* `privacy_settings` - (String, optional) If present, applies privacy settings to custom templates when used by any of the users in
  `consumer_account_names`. This is a string version of an object with a single NULL key and a value that specifies various privacy
  settings. Do not specify both `enable_differential_privacy_tasks` and `privacy_settings`. The differential privacy task must be running
  in this clean room to enable differential privacy. [See the available fields for this object.](differential-privacy.md)

**Returns:** Success message. Note that the procedure does not validate user locators or account names, so success
indicates only that the submitted locators have been added to the database for this clean room.

**Examples:**

```sqlexample
-- Add consumer without differential privacy.
CALL samooha_by_snowflake_local_db.provider.add_consumers($cleanroom_name,
  'LOCATOR1,LOCATOR2',
  'ORG1.NAME1,ORG2.NAME2');

-- Add consumer and turn on differential privacy for all their queries.
CALL samooha_by_snowflake_local_db.provider.add_consumers($cleanroom_name,
  'LOCATOR1',
  'ORGNAME.ACCOUNTNAME',
  '{
    "null": {
        "threshold_value": 5000,
        "differential": 1,
        "privacy_budget": 10,
        "epsilon": 0.1,
        "noise_mechanism": "Laplace"
    }
  }'
);
```

### remove_consumers

Schema:
:   PROVIDER

**Description:** Removes account access to a given clean room. This method blocks access by all users in the provided accounts.

You can see the current list of consumers by calling `provider.view_consumers`.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The ID of the clean room (not the user-friendly name).
* `cleanroom_account_locators` - (String) A comma-delimited list of user account locators. All users in the account will lose access to the
  clean room.

**Returns:** *(String)* - Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.remove_consumers(
  $cleanroom_name,
  'locator1,locator2,locator3'
);
```

### set_cleanroom_ui_accessibility

Schema:
:   PROVIDER

**Description:** Shows or hides the clean room in the clean rooms UI to all users logged in to this provider account.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - The name of the clean room.
* `visibility_status` - (String) One of the following case-sensitive values:

  + HIDDEN - Hides the clean room in the clean rooms UI from all users in the current provider account. The clean room is still accessible
    for API calls.
  + EDITABLE - Makes the clean room visible in the clean rooms UI.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_cleanroom_ui_accessibility(
  $cleanroom_name,
  'HIDDEN'
);
```

## Cross-cloud collaboration

Enable a clean room to be shared with a consumer on another cloud region. [Learn more.](v1/enabling-laf.md)

### enable_laf_on_account

Schema:
:   LIBRARY

**Description:** Enables Cross-Cloud Auto-Fulfillment on the current account. Running this procedure requires the ACCOUNTADMIN role.

> **Important:**
>
> You must first enable Cross-Cloud Auto-Fulfillment for the account by calling
> [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../../sql-reference/functions/system_enable_global_data_sharing_for_account.md).
>
> [Learn more about auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) and
> [managing auto-fulfillment privileges](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.library.enable_laf_on_account();
```

### disable_laf_on_account

Schema:
:   LIBRARY

**Description:** Disables Cross-Cloud Auto-Fulfillment on the current account. Running this procedure requires the ACCOUNTADMIN role.

> **Important:**
>
> You must first call [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../../sql-reference/functions/system_enable_global_data_sharing_for_account.md) before you can disable Cross-Cloud
> Auto-Fulfillment on an account.
>
> [Learn more about auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) and
> [managing auto-fulfillment privileges](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
USE ROLE ACCOUNTADMIN;
CALL samooha_by_snowflake_local_db.library.disable_laf_on_account();
```

### is_laf_enabled_on_account

Schema:
:   LIBRARY

**Description:** Returns whether Cross-Cloud Auto-Fulfillment is enabled for this account.

**Arguments:** *None*

**Returns:** TRUE if Cross-Cloud Auto-Fulfillment is enabled for this account, FALSE otherwise.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.is_laf_enabled_on_account();
```

### set_laf_dcr_refresh_schedule

Schema:
:   PROVIDER

**Description:** Sets the refresh interval for [clean room data](v1/enabling-laf.md) between the provider and the consumer
when they are located on different cloud regions. This data includes provider datasets, provider run requests, clean room policies, and clean room metadata. If you need an immediate refresh, you can call [SYSTEM$TRIGGER_LISTING_REFRESH](../../sql-reference/functions/system_trigger_listing_refresh.md).

**Arguments:**

* `schedule` - (Int) Interval, in minutes, between refreshes. The minimum allowed value is 10.

**Returns:** *(String)* - Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_laf_dcr_refresh_schedule(10);
```

## Use Python in a clean room

### load_python_into_cleanroom

Schema:
:   PROVIDER

**Description:** Loads custom Python code into the clean room. Code loaded into the clean room using this procedure is not visible
to consumers. The uploaded code can be called by your Jinja template. Although your code can include multiple function definitions, only
one function is exposed for a template to call.

If you want to load multiple callable Python packages into a clean room in a single patch, call `prepare_python_for_cleanroom` instead.

[Learn how to upload and use Python code in a clean room.](demo-flows/custom-code.md)

This procedure increments the patch number of your clean room and triggers a security scan. You must wait for the scan status to be APPROVED
before you can share the latest version with collaborators. This step does not report syntax errors in the code, which are thrown at run
time.

This procedure is overloaded, and has two signatures that differ in the data type of the fifth argument, which determines whether you are
uploading the code inline or loading it from a file on a stage:

Inline uploadLink from stage

**Signature**

`load_python_into_cleanroom` has the following signature for inline code upload. Pass your code string into the `code` argument.

```javascript
(cleanroom_name String, function_name String, arguments Array, packages Array, rettype String, handler String, code String)
```

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the script should be loaded.
* `function_name` - (String) Name that a template uses to call the function specified by `handler`.
  The template must qualify the function name with the `cleanroom` namespace. For example: `cleanroom.my_func(val1, val2)`.
* `arguments` - (Array of space-delimited string pairs) An array of arguments required by function `function_name`. Each element
  is a space-delimited `'name  data_type'` pair that specifies the argument name and its Snowflake SQL data type. For
  example: `['size INT', 'start_date DATE']`.
* `packages` - (Array of string) List of any Python package names used by the code. Clean rooms natively supports all the packages
  [in this list](https://repo.anaconda.com/pkgs/snowflake/) or the [Snowpark API](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/index).
  If you need a package not listed there, you must use [Snowpark Container Services in a clean room.](demo-flows/snowpark.md)
* `ret_type` - (String) SQL data type of the value returned by the function `handler`.
  ([See some equivalent Python and SQL types](../../developer-guide/udf-stored-procedure-data-type-mapping.md). Snowflake
  [SQL type synonyms](../../sql-reference/intro-summary-data-types.md) are accepted, such as STRING for VARCHAR.) For a UDF, the return
  type is a single SQL type. For a UDTF, the return type is a TABLE function with `column_name SQL column type` pairs. For
  example:

  > `TABLE (item_name STRING, total FLOAT)`
* `handler` - (String) The function called in your code when a template calls `function_name`. For a UDF this
  should be the function name itself; for a UDTF, this should be the name of the class that implements the UDTF.
* `code` - (String) Your Python code as a string. This should be a [Python UDF](../../developer-guide/udf/python/udf-python-designing.md).

**Signature**

Upload your code to a Snowflake stage, and then provide the stage location to the clean room API. You must use the stage enabled for your
specific clean room by calling `provider.get_stage_for_python_files`.

`load_python_into_cleanroom` has the following signature to upload code into the clean room from a stage.

```javascript
(cleanroom_name String, function_name String, arguments Array, packages Array, imports Array, rettype String, handler String)
```

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the script should be loaded.
* `function_name` - (String) Name that a template uses to call the function specified by `handler`.
  The template must qualify the function name with the `cleanroom` namespace. For example: `cleanroom.my_func(val1, val2)`.
* `arguments` - (Array of space-delimited string pairs) An array of arguments required by function `function_name`. Each element
  is a space-delimited `'name  data_type'` pair that specifies the argument name and its Snowflake SQL data type. For
  example: `['size INT', 'start_date DATE']`.
* `packages` - (Array of string) List of any Python package names used by the code. Clean rooms natively supports all the packages
  [in this list](https://repo.anaconda.com/pkgs/snowflake/) or the [Snowpark API](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/index).
* `imports` - (Array of string) List of files to import from the stage. Each file address is relative to the stage to where you
  uploaded the code, for example: `['/my_func.py']`. Find the clean room stage by calling `provider.get_stage_for_python_files`.
* `ret_type` - (String) SQL data type of the value returned by the function `handler`.
  ([See some equivalent Python and SQL types](../../developer-guide/udf-stored-procedure-data-type-mapping.md). Snowflake
  [SQL type synonyms](../../sql-reference/intro-summary-data-types.md) are accepted, such as STRING for VARCHAR.) For a UDF, the return
  type is a single SQL type. For a UDTF, the return type is a TABLE function with `column_name SQL column type` pairs. For
  example:

  > `TABLE (item_name STRING, total FLOAT)`
* `handler` - (String) The function called in your code when a template calls `function_name`. For a UDF this
  should be the function name itself; for a UDTF, this should be the name of the class that implements the UDTF.

**Returns:** *(String)* Success message if the upload succ

**Examples:**

```sqlexample-python
-- Inline UDF

CALL samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
    $cleanroom_name,
    'assign_group',                      -- Name of the UDF.
    ['data STRING', 'index INTEGER'],    -- Arguments of the UDF, along with their type.
    ['pandas', 'numpy'],                 -- Packages UDF will use.
    'INTEGER',                           -- Return type of UDF.
    'main',                              -- Handler.
    $$
import pandas as pd
import numpy as np

def main(data, index):
    df = pd.DataFrame(data)  # you can do something with df but this is just an example
    return np.random.randint(1, 100)
    $$
);
```

```sqlexample-python
-- Upload from stage

CALL samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
    $cleanroom_name,
    'myfunc',                            -- Name of the UDF.
    ['data STRING', 'index INTEGER'],    -- Arguments of the UDF.
    ['numpy', 'pandas'],                 -- Packages UDF will use.
    ['/assign_group.py'],                -- Python file to import from a stage.
    'INTEGER',                           -- Return type of UDF.
    'assign_group.main'                  -- Handler, scoped to file name.
);
```

### prepare_python_for_cleanroom

Schema:
:   PROVIDER

**Description:** Loads custom Python code into the clean room as part of a bulk code upload flow. Call this procedure multiple times to
upload multiple packages, then call `load_prepared_python_into_cleanroom` to trigger the upload to the specified clean room, flush the
pool of prepared code, and generate a new clean room patch.

Uploaded code can be called by your Jinja template. To upload only a single Python bundle, you can call `load_python_into_cleanroom`
instead.

[Learn how to upload and use Python code in a clean room.](demo-flows/custom-code.md)

You can either pass code directly into this procedure using the `code` parameter, or pass in the name of a file in a stage that contains
the code using the `imports` parameter.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where the script should be loaded.
* `function_name` - (String) Name that a template uses to call the function specified by `handler`.
  The template must qualify the function name with the `cleanroom` namespace. For example: `cleanroom.my_func(val1, val2)`.
* `arguments` - (Array of space-delimited string pairs) An array of arguments required by function `function_name`. Each element is a
  space-delimited `'name  data_type'` pair that specifies the argument name and its Snowflake SQL data type. For example:
  `['size INT', 'start_date DATE']`.
* `packages` - (Array of string) List of any Python package names used by the code. Clean rooms natively supports all the packages
  [in this list](https://repo.anaconda.com/pkgs/snowflake/) or the
  [Snowpark API](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/index). If you need a package not
  listed there, you must use
  [Snowpark Container Services in a clean room.](demo-flows/snowpark.md)
* `imports` - (Array of string) List of Python source files, when importing your source from a stage. Each file address is relative to
  the stage to where you uploaded the code, for example: `['/my_func.py']`. Find the clean room stage by calling
  `provider.get_stage_for_python_files`. If you are providing code inline by using the `code` parameter, provide an empty array.
* `rettype` - (String) SQL data type of the value returned by the function `handler`.
  ([See some equivalent Python and SQL types](../../developer-guide/udf-stored-procedure-data-type-mapping.md). Snowflake
  [SQL type synonyms](../../sql-reference/intro-summary-data-types.md) are accepted, such as STRING for VARCHAR.) For a UDF, the return
  type is a single SQL type. For a UDTF, the return type is a TABLE function with `<column name> <SQL column type>` pairs. For
  example:

  > `TABLE (item_name STRING, total FLOAT)`
* `handler` - (String) The function called in your code when a template calls `function_name`. For a UDF this
  should be the function name itself; for a UDTF, this should be the name of the class that implements the UDTF.
* `code` - (String) Your Python code as a string. This should be a [Python UDF](../../developer-guide/udf/python/udf-python-designing.md)
  or UDTF. If you are uploading the code from a stage, this should be an empty string.

**Returns:** *(String)* Summary of the upload request, including the patch number before the code is added to the clean room.

**Example:**

This example loads two simple Python procedures into a clean room and triggers only a single patch generation.

```sqlexample
CALL samooha_by_snowflake_local_db.provider.prepare_python_for_cleanroom(
    $cleanroom_name,
    'get_next_status',  -- Name of the UDF. Can be different from the handler.
    ['status VARCHAR'], -- Arguments of the UDF, specified as (variable name, SQL type).
    ['numpy'],          -- Packages needed by UDF.
    [],                 -- When providing the code inline, this is an empty array.
    'VARCHAR',          -- Return type of UDF.
    'get_next_status',  -- Handler.
    $$
import numpy as np
def get_next_status(status):
  """Return the next higher status, or a random status
  if no matching status found or at the top of the list."""

  statuses = ['MEMBER', 'SILVER', 'GOLD', 'PLATINUM', 'DIAMOND']
  try:
    return statuses[statuses.index(status.upper()) + 1]
  except:
    return 'NO MATCH'
    $$
);

 CALL samooha_by_snowflake_local_db.provider.prepare_python_for_cleanroom(
    $cleanroom_name,
    'hello_world',  -- Name of the UDF.
    [],
    [],
    [],
    'VARCHAR',
    'hello_world',
    $$
import numpy as np
def hello_world():
  return 'Hello world!'
    $$
);

CALL samooha_by_snowflake_local_db.provider.load_prepared_python_into_cleanroom($cleanroom_name);
```

### load_prepared_python_into_cleanroom

Schema:
:   PROVIDER

**Description:** Takes all code staged using previous calls to `prepare_python_for_cleanroom`, runs a security scan on the code and, if the scan passes, uploads the code to the clean room and generates a new clean room patch. To serve this version of the clean room to users, you must then update the clean room’s release directive to the patch number returned by this procedure by calling
`set_default_release_directive`. Whether or not the call succeeds, it flushes the pool of Python code stored in previous calls to `prepare_python_for_cleanroom`. This step does not report syntax errors, which are only reported when you try to run your code.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where you want to upload Python code.

**Returns:** *(String)* If successful, returns the new patch number created. Update the clean room’s release directive to the patch number returned by this procedure by calling `set_default_release_directive`.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.load_prepared_python_into_cleanroom($cleanroom_name);
```

### get_stage_for_python_files

Schema:
:   PROVIDER

**Description:** Returns the stage path where Python files should be uploaded, if you plan to use code files uploaded to a stage rather than
inline code definitions to define custom Python code in a clean room. The stage does not exist, and can’t be examined, until after files
are uploaded by calling `provider.load_python_into_cleanroom`.

[Learn how to upload and use Python code in a clean room.](demo-flows/custom-code.md)

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room where you want to upload files.

**Returns:** *(String)* The path where you should upload code files. Use this for the *imports* argument in
`provider.load_python_into_cleanroom`.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.get_stage_for_python_files($cleanroom_name);
```

### view_cleanroom_scan_status

Schema:
:   PROVIDER

**Description:** Reports the threat scan status for a clean room with DISTRIBUTION set to EXTERNAL. The scan needs to be marked as
“APPROVED” before you can set or change the default release directive. Scan status needs to be checked only with EXTERNAL clean rooms.

A scan is run after any action that generates a new patch version; most commonly this is either after you first publish the clean room, or
after you upload Python into the clean room. Snowflake Data Clean Rooms uses the
[Snowflake Native App security scan framework](../../developer-guide/native-apps/security-run-scan.md).

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to check the status of.

**Returns:** *(String)* The scan status. The following values are possible:

* `NOT_REVIEWED` - The scan is in progress.
* `APPROVED` - The scan passed.
* `REJECTED` - The scan failed; a new clean room version won’t be published. Try to find the problems in your code and
  retry the last action.
* `MANUAL_REVIEW` - The scan requires manual review by Snowflake. This might take a few days, so check again periodically.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_cleanroom_scan_status($cleanroom_name);
```

## Request logs

Use the following commands to manage consumer request logs. Request logs enable the consumer to send messages to the provider, and must be
mounted to enable functionality such as consumer custom template requests, consumer approval of provider-run requests, and Cross-Cloud
Auto-Fulfillment.

### mount_request_logs_for_all_consumers

Schema:
:   PROVIDER

**Description:** Gives providers access to requests from the consumer. You must mount request logs to support various functionality,
including consumer custom template requests, consumer approval of provider-run requests, and Cross-Cloud Auto-Fulfillment.

This mounts request logs only for consumers that have already installed the specified clean room; if a consumer installs a clean room after
the provider calls this procedure, the provider must call this procedure again.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to mount request logs for.

**Returns:** *(Table)* A table of consumers, with the request log mount status for each. If a consumer was granted access to a clean room
but hasn’t yet installed the clean room, the status is described as pending, and you should call `mount_request_logs_for_all_consumers`
again after they have installed the clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.mount_request_logs_for_all_consumers($cleanroom_name);
```

### view_request_mount_status_for_all_consumers

Schema:
:   PROVIDER

**Description:** Shows the mount status of request logs for all consumers in the specified clean room. Only consumers that were included in
a call to `provider.mount_request_logs_for_all_consumers` are shown. Request logs enable messages to be passed from the consumer to the
provider.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.

**Returns:** *(Table)* - A table of consumers and the request log mount status of each consumer.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_request_mount_status_for_all_consumers($cleanroom_name);
```

### view_request_logs

Schema:
:   PROVIDER

**Description:** Shows the request logs sent by consumers in this clean room. Only requests from consumers who were included in a previous
successful call to `mount_request_logs_for_all_consumers` are shown.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room to review request logs for.

**Returns:** *(Table)* The requests sent by the consumer to the provider in the specified clean room.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.view_request_logs($cleanroom_name);
```

## Differential privacy

These commands control differential privacy at the user level or provider account level. [Learn more about differential privacy.](differential-privacy.md)

### set_privacy_settings

Schema:
:   PROVIDER

**Description:** Set (or reset) privacy settings enforced when the specified consumer runs a custom template. This
overwrites all existing settings for this consumer.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.
* `consumer_account_locator` - (String) Account locator of one or more consumers, in a comma-delimited list.
* `privacy_settings` - (Object) A JSON object that specifies differential privacy settings for one or more templates. Settings are applied
  to all templates run by the specified consumer. [See the available fields for this object.](differential-privacy.md)

**Returns:** Success message.

**Example:**

```sqlexample
-- Enforce differential privacy on queries by this consumer
-- with the settings provided.
CALL samooha_by_snowflake_local_db.provider.set_privacy_settings(
  $cleanroom_name,
  $consumer_locator,
  { 'differential': 1,
    'epsilon': 0.1,
    'privacy_budget': 3 });
```

### is_dp_enabled_on_account

Schema:
:   PROVIDER

**Description:** Describes whether or not differential privacy is enabled for this account.

**Arguments:** *None*

**Returns:** TRUE if differential privacy is enabled for this account, FALSE otherwise.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.is_dp_enabled_on_account();
```

### suspend_account_dp_task

Schema:
:   PROVIDER

**Description:** Disables the task that monitors and enforces differential privacy budgets. This is used to control the
[costs associated with differential privacy in your account](cleanroom-cost.md). If the differential privacy task is
disabled, noise will still be added to queries by users, templates, or clean rooms where differential privacy is specified, but budget
limits will not be enforced and you will not incur costs from differential privacy.
[Learn more about managing differential privacy.](differential-privacy.md)

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.suspend_account_dp_task();
```

### resume_account_dp_task

Schema:
:   PROVIDER

**Description:** Resumes the differential privacy task listener in the current account, and differential privacy budgets will be enforced.
Any differential privacy values previously set (such as sensitivity or associated users) are retained.

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.resume_account_dp_task();
```

## Snowpark Container Services commands

These procedures enable you to [use Snowpark Container Services inside a clean room](demo-flows/snowpark.md).

### load_service_into_cleanroom

Schema:
:   PROVIDER

**Description:** Creates or updates a container service in a clean room. Calling this procedure updates the clean room patch number, so you
must call `provider.set_default_release_directive` after calling this procedure. You must call this procedure every time you create or
update the service. The client must then call `consumer.start_or_update_service` to see any updates.

[Learn about using Snowpark Container Services in a clean room.](demo-flows/snowpark.md)

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* - Name of the clean room.
* `service_spec` - (String) A YAML specification for the service, rooted at the `spec` element.
* `service_config` - (String) A YAML format configuration for the service. The following properties are supported:

  + `default_service_options` - An optional array of service-level default values. These values can be overridden by the consumer when
    they create their service. The following child properties are supported:

    - `min_instances` *(Integer, optional)*
    - `max_instances` *(Integer, optional)*
    - `allow_monitoring` *(Boolean, optional)* - If TRUE, allows the consumer to see service logs. Default is FALSE.
  + `functions` - An array of functions exposed by the service. Each function definition maps to the
    [SPCS service function definition](../../sql-reference/sql/create-function-spcs.md). See that documentation to learn the details of each
    element. The following child properties are supported:

    - `name`
    - `args`
    - `returns`
    - `endpoint`
    - `path`
    - `max_batch_rows` (*optional*)
    - `context_headers` (*optional*)

**Returns:** (*String*) Success message, if successful. Throws an error if not successful.

**Example:**

```sqlexample-yaml
CALL samooha_by_snowflake_local_db.provider.load_service_into_cleanroom(
    $cleanroom_name,
    $$
    spec:
      containers:
      - name: lal
        image: /dcr_spcs/repos/lal_example/lal_service_image:latest
        env:
          SERVER_PORT: 8000
        readinessProbe:
          port: 8000
          path: /healthcheck
      endpoints:
      - name: lalendpoint
        port: 8000
        public: false
    $$,
    $$
    default_service_options:
      min_instances: 1
      max_instances: 1
      allow_monitoring: true

    functions:
      - name: train
        args: PROVIDER_TABLE VARCHAR, PROVIDER_JOIN_COL VARCHAR, CONSUMER_TABLE VARCHAR, CONSUMER_JOIN_COL VARCHAR, DIMENSIONS ARRAY, FILTER VARCHAR
        returns: VARCHAR
        endpoint: lalendpoint
        path: /train
      - name: score
        args: PROVIDER_TABLE VARCHAR, PROVIDER_JOIN_COL VARCHAR, CONSUMER_TABLE VARCHAR, CONSUMER_JOIN_COL VARCHAR, DIMENSIONS ARRAY
        returns: VARCHAR
        endpoint: lalendpoint
        path: /score
      - name: score_batch
        args: ID VARCHAR, FEATURES ARRAY
        returns: VARIANT
        max_batch_rows: 1000
        endpoint: lalendpoint
        path: /scorebatch
$$);
```

## Environment management

Use the following commands to generally assist in leveraging clean room functionality and supported flows.

### manage_datastats_task_on_account

Schema:
:   PROVIDER

**Description:** Enables or disables the background task that computes clean room statistics. The task is running by default, but you can
disable it to reduce your costs. To manage the task, all collaborators must call the appropriate `provider` or `consumer` version of this
procedure with the same value.

**Arguments:**

* `enable` - (Boolean) TRUE to enable the task, FALSE to disable the task.

**Returns:** Success message.

**Example:**

```sqlexample
-- Disable the task in this account.
CALL samooha_by_snowflake_local_db.provider.manage_datastats_task_on_account(FALSE);
```

### enable_local_db_auto_upgrades

Schema:
:   LIBRARY

**Description:** Enables the task that automatically upgrades the Snowflake Data Clean Rooms environment when new procedures or
functionality is released (The task is `samooha_by_snowflake_local_db.admin.expected_version_task`. ) Call this procedure to automate
upgrades, rather than calling `library.apply_patch` with each new release.

Although you might reduce cost by disabling this task, we recommend that you leave it running to ensure that you have the latest version of
the clean rooms environment on your system.

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.enable_local_db_auto_upgrades();
```

### disable_local_db_auto_upgrades

Schema:
:   LIBRARY

**Description:** Disables the task that automatically upgrades the Snowflake
Data Clean Rooms environment when new versions are released. If you disable auto upgrades, you must call
`library.apply_patch` with each [new release](../../release-notes/new-features.md).

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.disable_local_db_auto_upgrades();
```

### apply_patch

Schema:
:   LIBRARY

**Description:** Updates your clean rooms environment, enabling new features and fixes in your environment. Call this when a new version of
the clean rooms environment has been released. (This typically occurs weekly; see clean rooms entries in
[Recent feature updates](../../release-notes/new-features.md).) This procedure updates [samooha_by_snowflake_local_db](v1/installation-details.md).

You can automate patch updates by calling `library.enable_local_db_auto_upgrades`. We recommend enabling auto-updates.

**Arguments:** *None*

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.library.apply_patch();
```

### patch_cleanroom

Schema:
:   PROVIDER

**Description:** Updates the specified clean room to the latest version, enabling new features and fixes for that clean room. Typically you
call this only when Snowflake Support tells you to call it.

The provider should call `library.patch_cleanroom` before the consumer calls `library.patch_cleanroom`. Otherwise, there is no patch to
apply.

**Arguments:**

* [cleanroom_name](v1/developer-introduction.md) *(String)* : Name of the clean room to patch.

**Returns:** *(String)* Success message.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.provider.patch_cleanroom($cleanroom_name);
```

### dcr_health.dcr_tasks_health_check

**Description:** Shows information about running or recently stopped clean room tasks.

**Arguments:** *None*

**Returns:** *(Table)* Information about clean room tasks, including the schedule, warehouse name, and warehouse size.

**Example:**

```sqlexample
CALL samooha_by_snowflake_local_db.dcr_health.dcr_tasks_health_check();
```

---
title: Snowflake Data Clean Rooms: Troubleshooting external data connectors
source: https://docs.snowflake.com/en/user-guide/cleanrooms/external-data-troubleshoot.md
section: Clean Rooms
---

# Snowflake Data Clean Rooms: Troubleshooting external data connectors

> **Note:**
>
> Snowflake Data Clean Rooms do not currently support data subject consent management. Customers are responsible for ensuring they have
> obtained all necessary rights and consents to use the data linked in their clean rooms. Customers must also ensure compliance with all
> applicable laws and regulations when using Data Clean Rooms, including in connection with third-party connectors.

This topic describes how to troubleshoot external data errors. It applies to Amazon Web Services, Microsoft Azure, and Google Cloud.

## Steps to follow to troubleshoot external data errors

1. Ensure the path URL/URI is correct. See the associated related topic for the correct URL/URI.
2. Ensure there is at least one file in the bucket or blob storage.
3. Ensure the file is in parquet format.
4. Ensure the parquet file is not empty.
5. Ensure the parquet file is not compressed using the Snappy format.
6. If none of the above resolves the issue, then debug with the following script:

> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> USE DATABASE SAMOOHA_BY_SNOWFLAKE_LOCAL_DB;
> USE SCHEMA PUBLIC;
>
> /*
>   Query the stage name from the connector configuration.
>   Use AWS_CONNECTOR_ID for AWS, GCP_CONNECTOR_ID for GCP and
>   AZURE_CONNECTOR_ID for Azure.
>
>   For example, if you are connecting to AWS, enter:
>
>   SELECT CONFIGURATION_ID, PARSE_JSON(CONFIGURATION) FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.CONNECTOR_CONFIGURATION WHERE CONNECTOR_ID = 'AWS_CONNECTOR_ID';
>
> /*
>   Note that the rest of this script relies on the output of this query so you
>   must save the output for use in the rest of the steps.
>
>   Next, check the storage integration. Replace <CONFIGURATION_ID> from the output
>   of the query.
> */
>
>   DESC STORAGE INTEGRATION SAMOOHA_STORAGE_INT_<CONFIGURATION_ID>;
>
> /*
>   List files in the stage. Replace <STAGE_NAME> from the output of the query.
> */
>
>   LIST @<STAGE_NAME>;
>
> /*
>   Check if you are able to query the files in the external stage. Replace
>   <STAGE_NAME> from the output of the query.
> */
>
>   SELECT * FROM @<STAGE_NAME> LIMIT 10;
>
> /*
>   Check if you are able to infer the schema from the files in the external
>   stage. Replace <STAGE_NAME> from the output of the query.
> */
>
>   SELECT ARRAY_AGG(OBJECT_CONSTRUCT(*))
>   WITHIN GROUP (ORDER BY order_id)
>   FROM TABLE(
>     INFER_SCHEMA(
>       LOCATION=>'@<STAGE_NAME>',
>       FILE_FORMAT=>'SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PAR_FF'
>     )
>   );
>
> /*
>   Try to create a table from the external stage. Replace <STAGE_NAME> from
>   the output of the query.
> */
>
>   CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.LIBRARY.CREATE_TABLE_FROM_STAGE('<STAGE_NAME>', 'EXT_INT_TEMP_TABLE');
>
> /*
>   Check data in the table.
> */
>
>   SELECT * FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.EXT_INT_TEMP_TABLE LIMIT 10;
> ```

---
title: Template specification
source: https://docs.snowflake.com/en/user-guide/cleanrooms/spec-template.md
section: Clean Rooms
---

# Template specification

Defines a single template in a collaboration. Templates are registered by calling REGISTER_TEMPLATE with the template specification.

**Schema:**

```yaml
api_version: 2.0.0              # Required: Must be "2.0.0"
spec_type: template             # Required: Must be "template"
name: <template_name>           # Required: Unique name (max 75 chars)
version: <version_string>       # Required: Version identifier (max 20 chars)
type: <template_type>           # Required: sql_analysis or sql_activation
description: <template_description>  # Optional: High-level description (max 1,000 chars)
methodology: <methodology_description>  # Optional: Detailed description (max 1,000 chars)

parameters:                     # Optional: User-provided parameters
  - name: <parameter_name>      # One or more parameter items...
    description: <parameter_description>  # Optional: Description (max 500 chars)
    required: <true_or_false>   # Optional: Whether required (default: false)
    default: <default_value>    # Optional: Default value
    type: <data_type>           # Optional: String, integer, number, Boolean, array, or object

code_specs:             # Optional: List of code bundles used by this template
  - <code_spec_id>        # One or more code spec IDs.

template: |                     # Required: JinjaSQL template content
  <template_content>
```

`api_version`
:   The version of the Collaboration API used. Must be `2.0.0`.

`spec_type`
:   Specification type identifier. Must be `template`.

`name: template_name`
:   A unique, user-friendly name for this template. Must follow [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md) with a
    maximum of 75 characters.
    The `name_version` pair must be unique for all templates in this account.

`version: version_string`
:   A version identifier for this template (maximum 20 characters). Must follow
    [Snowflake identifier rules](../../sql-reference/identifiers-syntax.md). The version string is given its own column in the response to
    VIEW_TEMPLATES and VIEW_REGISTERED_TEMPLATES, so use a value that can be sorted by increasing value. Example: `V0`

`type`
:   The template type. One of the following values:

    * `sql_analysis`: Template for data analysis operations.
    * `sql_activation`: Template for data activation operations.

`description: template_description` (*Optional*)
:   A high-level description of what this template does (maximum 1,000 characters).

`methodology: methodology_description` (*Optional*)
:   A more detailed description of how this template works (maximum 1,000 characters).

`parameters` (*Optional*)
:   The list of all user-provided parameters in this template. Each item can have the following fields:

    * `name`: Parameter name as a valid [Snowflake identifier](../../sql-reference/identifiers-syntax.md), max 255 characters.
    * `description` (*Optional*): Human-readable description of the parameter (maximum 500 characters).
    * `required` (*Optional*): Whether the parameter is required. Default is `false`.
    * `default` (*Optional*): Default value for the parameter, which can be any data type.
    * `type` (*Optional*): Expected data type of the parameter. One of: `string`, `integer`, `number`, `boolean`,
      `array`, or `object`.

`code_specs` (*Optional*)
:   One or more code bundles that define any functions referenced by this template. Required when the template
    calls [custom functions](resources-code-bundles.md). Code spec IDs are versioned; if you want to access a new version
    of a function, you must update the code spec ID here, but not in the template itself, which calls the unversioned function name. The code
    spec name must have an underscore in it, and match the regular expression pattern `[A-Za-z]\w{0,74}_\w{1,20}`.

`template`
:   The template content. For SQL templates, this contains the [JinjaSQL template](custom-templates.md).
    For more information, see [Template design](resources-templates.md).

    The column names exposed to the template are determined by the `category` and `column_type` values for the column in the
    [data offering specification](spec-data-offering.md). For more information, see [Source column renaming](resources-data-offerings.md).

## Example

```yaml
api_version: 2.0.0
spec_type: template
name: trivial_template
version: V1
type: sql_analysis
description: Simple one-row template.
methodology: Always returns "1". Requires one source table.

parameters:
  - name: row_count
    description: Count of rows
    required: true

template: |
    SELECT 1 FROM IDENTIFIER( {{ source_table[0] }} ) LIMIT {{ row_count }};
```

---
title: Templates
source: https://docs.snowflake.com/en/user-guide/cleanrooms/resources-templates.md
section: Clean Rooms
---

# Templates

Templates are JinjaSQL clean room templates that can be run by specified collaborators.
Any collaborator can share a template with collaborators in a collaboration.
You can request to add or remove only templates that your account has registered.

Continue reading to see how to register and add a template into a collaboration:

## Register a template

Follow these steps to register a template:

1. Design a template for the collaboration and embed it in a [template specification](spec-template.md).
2. Register the template by calling REGISTER_TEMPLATE. This returns a template ID that you will use to link the template.

After the template is registered, it can be linked into a collaboration by anyone who has read access to that registry.

## Add a template

The process to request template addition depends on whether the collaboration already exists.

* **To add a template before the collaboration is created,** give the template ID to the collaboration owner, who includes it in the
  [collaboration spec](spec-collaboration.md), specifying who can run the template.
  In the following collaboration snippet, `alice` is granted access to run template `bob_template_v1`.

  ```yaml
  ...
  analysis_runners:
    alice:
      templates:
      - id: bob_template_v1
  ...
  ```

* **To add a template into an existing collaboration,** you send a request to all prospective sharers by following these steps:

  1. Call ADD_TEMPLATE_REQUEST with the template ID to start the approval flow to add the template into a specific collaboration, for
     specific users.

     All collaborators affected by the template see the request when they call VIEW_UPDATE_REQUESTS.
  2. Collaborators who see the request with status PENDING_MY_APPROVAL should call APPROVE_UPDATE_REQUEST or REJECT_UPDATE_REQUEST.

     + If any collaborator rejects the request, the update request is rejected.
     + Collaborators can not later change an approval to a rejection, or a rejection to an approval.
     + The template would not be shared until *all* requested parties approve the request.
     + Collaborators can’t later change an approval to a rejection, or a rejection to an approval.
     + After you approve, the status changes to PENDING_PARTNER_APPROVAL if other collaborators still need to approve.
  3. When all required collaborators have approved, the status changes to APPROVED and the update is applied automatically. The terminal statuses for an update request are COMPLETED and FAILED. When the request status is COMPLETED, the template is available to the users specified in the add template request. If the request is FAILED, see the DETAILS column in VIEW_UPDATE_REQUESTS for failure details. If any collaborator rejects the request, the status is REJECTED and any reason supplied by the rejecting party is visible in the request report.
  4. There might be a short delay after a template is approved by all users before the template is available. Call VIEW_TEMPLATES to confirm that the template is available to use.

> **Tip:**
>
> To see which templates you have registered, call VIEW_REGISTERED_TEMPLATES.

See [Run an analysis](demo-flows/basic-multiparty-collab.md) to learn how to run an analysis.

## Template design

Collaboration templates are the same as [Provider and Consumer Clean Room templates](custom-templates.md), with a
few special considerations:

* The template’s `source_table` variable is populated by the collaboration’s data offerings. In most collaboration templates,
  `source_table` is the only data source variable used.
* The template’s `my_table` is used only when an analysis runner is using Snowflake Standard Edition and can’t contribute data offerings
  to a collaboration.
* Columns from the original data sources can be renamed when exposed to the template or user. See [Source column renaming](resources-data-offerings.md)
  to learn how and when source columns are renamed. Templates and user-provided arguments (such as a join column name) should use the final
  name, not the original name, if the column is renamed.
* Activation templates in a collaboration don’t need to be named `activation_<template_name>`. All other [activation template requirements](custom-templates.md) still apply.

For information about custom template syntax in Snowflake Data Clean Rooms, see [Design custom templates](custom-templates.md).

---
title: Troubleshooting Collaboration Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v2/troubleshooting.md
section: Clean Rooms
---

# Troubleshooting Collaboration Data Clean Rooms

Consult the following troubleshooting tips when you encounter errors while you work with Collaboration Data Clean Rooms.

## Collaborations

Error:
:   `Pending invitation for collaboration: <collaboration name> not found` although `GET_STATUS` shows the account as `INVITED`.

Cause:
:   If an initial join attempt has failed for some reason, later join attempts will likely fail with this reason.

Solution:
:   Delete and recreate the collaboration.

---

Error:
:   A collaboration that you created is not visible in a collaborator’s account.

Cause:
:   There are several possible reasons:

    * The collaboration was created in a different cloud hosting region and you haven’t enabled
      [cross-cloud auto-fulfillment](../laf.md).
    * You didn’t share the collaboration, you shared the collaboration with the wrong account, or you opened the wrong collaborator account in the
      Snowsight/SDCR UI/CLI. Confirm that the account where you expect to see your collaboration is the one that you shared the collaboration
      with, and that you’re signed in to that shared account.
    * There is a small delay between publishing a collaboration and when it becomes visible to the collaborator.

Solution:
:   Verify that the collaborator’s account matches the one in your collaboration spec and that cross-cloud auto-fulfillment is enabled if needed. Wait a few moments for the collaboration to propagate.

---

Error:
:   `ReferenceUsageGrantMissingException: Reference usage grants are required for the following databases in your account ...` when a data provider tries to join the collaboration.
    Data providers will see this message when they try to join a collaboration, and they have shared data that they don’t have REFERENCE_USAGE on.
    This is expected behavior.

Solution:
:   The error message includes a database name and a share name. Either someone with REFERENCE_USAGE on the data, or an ACCOUNTADMIN, must run
    the following SQL command, providing the database and share names given in the error message:

    > ```sqlexample
    > GRANT REFERENCE_USAGE ON DATABASE <database_name> TO SHARE <share_name>;
    > ```
    >
    > After REFERENCE_USAGE is successfully granted, the data provider can join the collaboration.

## API and Permissions

Error:
:   `Unknown user-defined function <function name>`

Cause:
:   If this is a procedure documented for the DCR Collaboration API, you might have misspelled the procedure.

    If you have not misspelled the procedure name, or if the procedure is a system procedure (that is, it has a `$` in the name), you might
    be using an older version of the API and need to upgrade your clean rooms API version.

Solution:
:   * Confirm that you spelled the procedure correctly, and if not, try again with the proper spelling.
    * To update your installation, run the following SQL code:

    ```sqlexample
    USE ROLE ACCOUNTADMIN;
    CALL SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.PREPARE_MOUNT_SCRIPT();
    EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
    ```

---

Error:
:   `Listing 'listing name' is not fulfilled to your current region. Please request the listing, or if already requested, retry after some time`

Cause:
:   You are using an older version of the clean rooms API. This issue was fixed in a more recent version.

Solution:
:   [Update your clean rooms installation.](../admin-tasks.md)

---

Error:
:   `SQL compilation error: Unknown user-defined function SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN`

Cause:
:   Either you misspelled some part of the fully-qualified procedure name, or you do not have privileges to run this procedure.

Solution:
:   Confirm that you used the correct name of the procedure. If you are not using SAMOOHA_APP_ROLE, try switching to that role to see if the same error occurs. If it does not, it is a privilege error.

---

Error:
:   `Unknown user-defined function SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.<namespace>.<procedure name>`

Cause:
:   One of the following:

    * You used the wrong namespace. Be sure to call the proper `COLLABORATION` or `REGISTRY` namespace.
    * You mistyped the name of the function. Check the reference guide for the proper naming.
    * You are using an RBAC role that doesn’t have permissions to call the procedure.
    * You don’t have SAMOOHA_APP_ROLE.

Solution:
:   * Confirm that you spelled the procedure correctly and used the correct namespace.
    * Try switching to SAMOOHA_APP_ROLE to see whether you can run the procedure. If you can, then the issue is insufficient privileges on your current role. Ask someone with SAMOOHA_APP_ROLE to [grant you proper privileges](../manage-access.md).
    * To check if you have SAMOOHA_APP_ROLE, run the following command:

    ```sqlexample
    SELECT CURRENT_USER();
    SHOW GRANTS TO USER <current_user_name> ->> SELECT * FROM $1 WHERE "role" = 'SAMOOHA_APP_ROLE';
    ```

    If you don’t get any results, ask an administrator to give you API access to the collaboration.

## Code bundles

Error:
:   `CodeSpecAlreadyExistsException`

Cause:
:   Code bundle spec with same name and version already registered.

Solution:
:   Use a different version or update the existing version.

---

Error:
:   `SpecValidationError`

Cause:
:   YAML doesn’t conform to schema.

Solution:
:   Check required fields and format.

---

Error:
:   `CodeSpecStageNotAccessibleError`

Cause:
:   Stage referenced in artifact isn’t accessible.

Solution:
:   Grant access to stage or verify stage exists.

---

Error:
:   `CodeSpecArtifactNotFoundAtStageError`

Cause:
:   File not found at specified stage path.

Solution:
:   Upload file to stage before registering.

---

Error:
:   `StageDirectoryNotEnabledError`

Cause:
:   Stage doesn’t have DIRECTORY enabled.

Solution:
:   Enable directory on the stage: `ALTER STAGE ... SET DIRECTORY = (ENABLE = TRUE)`

---

Error:
:   `CodeSpecNotFoundForOwnerException`

Cause:
:   Template references unregistered code bundle spec.

Solution:
:   Register code bundle spec before registering template.

---
title: Troubleshooting Snowflake Data Clean Rooms
source: https://docs.snowflake.com/en/user-guide/cleanrooms/troubleshooting.md
section: Clean Rooms
---

# Troubleshooting Snowflake Data Clean Rooms

This page is a general troubleshooting guide when using clean rooms. If you are using the API, be sure to read the reference documentation for any procedures that you call, as well as the use case guidelines, to see if your issue is covered there.

## Installation issues

* See the [installation troubleshooting section](v1/enable-clean-rooms-ui.md).
* Confirm that you have [updated your network policy](admin-tasks.md) to allow the UI to access your data.

## Analysis and template issues

Error:
:   `Failure during expansion of shared view <CLEAN ROOM VIEW NAME> as view owner: Insufficient permission to resolve external/iceberg table <TABLE_NAME> shared by application SAMOOHA_CLEANROOM_APP_<CLEAN ROOM ID>`

Cause:
:   You are trying to access external or Iceberg tables, but External and Iceberg tables are not enabled in both the provider’s and consumer’s accounts.

Solution:
:   Ensure that both provider and consumer accounts have [enabled external and Iceberg tables](register-data.md).

---

Error:
:   `SQL compilation error: Failure during expansion of shared view '<CLEAN ROOM VIEW NAME>' as view owner: Object '<some object name>' does not exist or not authorized.`

Cause:
:   Clean room grants no longer exist on the dataset that you are trying to access. Most likely this is because the source object has been renamed or replaced.

Solution:
:   * If the table was renamed, change the name back to what is linked in the clean room. You might need to re-register the object as well.
    * If the table was recreated, register the object again in the clean room.

---

Error:
:   Query returns zero results and you think that’s wrong.

Possible causes and solutions:
:   * Confirm that neither side has a masking policy on the data that could prevent it from being joined or shown.
    * Confirm that the join columns are formatted the same way.
    * Confirm that you are not falling below any fixed threshold settings. The audience overlap has a default threshold of five, meaning that
      fewer than five rows will be omitted from the results. Ask the provider what the threshold is, and confirm whether you have any overlaps
      greater than that number; temporarily modify the overlap specifications to guarantee large segment groups to see whether you get
      results then.

---

Error:
:   `Uncaught exception of type 'STATEMENT_ERROR' ... SQL compilation error: invalid URL prefix found ...`

Cause:
:   The template uses a string value, rather than identifier, for a column or table name. This happens when the template doesn’t properly
    convert string variables into identifiers by using either the `sqlsafe` filter or the `IDENTIFIER` function.

    For example, passing in `p.col1` to `my_column` to the template `SELECT {{ my_column }} ...` resolves to
    `SELECT "p.col1" ...`. “p.col1” is a string, not a valid identifier (and `p.` is interpreted as a URL prefix).

Solution:
:   Apply either the IDENTIFIER function (preferred) or the `sqlsafe` filter to the variable:

    * `SELECT IDENTIFIER({{ my_column }}) ...` *(Preferred)*
    * `SELECT {{ my_column | sqlsafe }} ...`

---

Error:
:   `**FAILURE**: Unauthorized columns: column_name`

Cause:
:   Your query uses a column of collaborator data that is not part of the collaborator’s usage policy in the clean room. For example, you are
    trying to SELECT a column of collaborator data that is not in their column policy.

Solution:
:   Use a column that your collaborator has approved for projection, joining, or activation in your query. Inspect the template for policy
    filters by calling `consumer.view_template_definition`, then see which columns the provider allows you to project or join by calling
    `consumer.view_provider_join_policy` or `consumer.view_provider_column_policy`. Finally either update your query to pass in an
    approved column, or ask your collaborator to adjust their usage policy to include the column that you want to use.

---

Error:
:   `**FAILURE**: Invalid aliases: P.column name'` or `**FAILURE**: Invalid aliases: C.column name'`

Cause:
:   You are using uppercase `P` or `C` table aliases to scope a column name. You must use lowercase `p` or `c` aliases when scoping a column name. (The template itself can use either uppercase or lowercase when *declaring* the alias.)

Solution:
:   Always use a lowercase alias when scoping a column.

    > **Example:**

    ```sqlexample
    -- Always scope the column name with a lowercase alias.
    -- The casing of the alias declared for the table doesn't matter.

    -- These will fail.
    SELECT P.hashed_email FROM mydb.mysch.t1 AS P;
    SELECT P.hashed_email FROM mydb.mysch.t1 AS p;

    -- These will succeed.
    SELECT p.hashed_email FROM mydb.mysch.t1 AS P;
    SELECT p.hashed_email FROM mydb.mysch.t1 AS p;
    ```

---

Error:
:   `**FAILURE**: Invalid aliases: database name.schema name.column name`

Cause:
:   You must always reference columns using the `p` or `c` alias declared for a table. Columns can’t reference a table by its full path.

    **Invalid:** `SELECT hashed_email FROM mydb.mysch.t1;`

Solution:
:   Use the `p` or `c` (lowercase!) table alias when referencing a column:

    **Valid:** `SELECT p.hashed_email FROM mydb.mysch.t1 AS p;`

## Cross-cloud issues

Error:
:   `Analysis Execution Failure: 'SnowparkSQLException' due to Database Listing Conflict` in a single-account testing clean room

Cause:
:   Cross-Cloud Auto-Fulfillment is not supported with single-account testing clean rooms.

Solution:
:   Disable cross-cloud auto-fulfillment in this clean rooms account by calling `library.disable_laf_on_account` during testing, or don’t
    try to make cross-cloud procedure calls in this clean room.

## Cloud data connector issues

If you are having problems with an external data connector for AWS, Azure, or Google Cloud Storage, see [Snowflake Data Clean Rooms: Troubleshooting external data connectors](external-data-troubleshoot.md).

## Request log issues

Error:
:   `**Failure**: Request logs unable to be mounted. Try again.`

Cause:
:   Mounting request logs succeeded for internal clean room, then fails for external clean room in same account. Internal clean rooms have fewer requirements than external clean rooms. Your installation met the requirements for using internal clean rooms, but not for external clean rooms.

Solution:
:   Confirm that your email was validated by clean rooms, and that you fulfill all the [clean rooms account requirements](installing-dcr.md).

## Data access issues

### General guidelines about data access issues

You can get an error message that reports the inability to access a data source during several points in the usage flow:

**If the error occurred during the registration process:**

* You misspelled the table name or path, if using the API.
* If this is an external or Iceberg table, confirm that you have fulfilled the [requirements and procedure](register-data.md) for registering an external or Iceberg table.
* Confirm that your current role has the REFERENCE_USAGE privilege on the object being registered.

**If the error occurred during the linking process:**

* In the API, you might be using the wrong role.
* The object might not have been registered. In the API if you try to link an object that isn’t registered, you will see an error. In the UI, you should see only objects that have been registered as available for linking.
* Confirm that SAMOOHA_APP_ROLE has USAGE and SELECT privileges on your object.
* The table might have been moved, renamed, or had its permissions (or any Snowflake policy permissions) changed since registration. When this happens, you might also see the error `SQL access control error: Insufficient privileges to operate on table...`

**If the error occurred after data was successfully registered and linked:**

If using the API, confirm that you spelled the fully qualified table name correctly.

### Data access errors

Error:
:   `Object '<some_object_name>' does not exist or not authorized`

Cause:
:   The source table might have been moved, renamed, or had its permissions (or permissions on a policy or ancestor object that it depends on) changed.

Solution:
:   Try re-registering and re-linking the object in your account, or move the object back to the old location, or revert any additional permissions added.

---

Error:
:   `Insufficient permission to resolve external/iceberg table`

Cause:
:   If an external or Iceberg table is involved in your query, then the table was not registered properly.

Solution:
:   See [Enabling external and Apache Iceberg™ tables](register-data.md) to ensure that you fulfill the requirements and procedures to use these table types. You can sometimes resolve this by explicitly granting SELECT on the table to SAMOOHA_BY_SNOWFLAKE.

---

Error:
:   `not approved:unauthorized columns used` error as a result of run analysis

Cause:
:   You are joining or projecting a collaborator’s column against the collaborator’s join or column policy.

Solution:
:   View the join and column policies set by your collaborator by calling `consumer.view_provider_column_policy` and `consumer.view_provider_join_policy`.

    ```sqlexample
    CALL samooha_by_snowflake_local_db.consumer.view_provider_join_policy($cleanroom_name);
    CALL samooha_by_snowflake_local_db.consumer.view_provider_column_policy($cleanroom_name);
    ```

    You might have also exhausted your privacy budget:

    ```sqlexample
    CALL samooha_by_snowflake_local_db.consumer.view_remaining_privacy_budget($cleanroom_name);
    ```

---

Error:
:   When the consumer calls any procedure that takes a clean room name and gets
    `Application 'SAMOOHA_CLEANROOM_APP<some name>' does not exist or not authorized.`

Cause:
:   If the clean room name is reported as `SAMOOHA_CLEANROOM_APP<cleanroom name>` rather than `SAMOOHA_CLEANROOM_APP_<cleanroom name>`,
    (missing the underscore after `SAMOOHA_CLEANROOM_APP`), the *provider* did not install the clean rooms environment in the correct way
    in their account.

Solution:
:   Tell the provider that they should install the clean rooms environment in their account by following the instructions here:
    [Installing the Snowflake Data Clean Rooms environment](installing-dcr.md). After that, the provider can re-create and share the clean room.

## External and Iceberg table issues

Error:
:   `Failure during expansion of shared view <CLEAN ROOM VIEW NAME> as view owner: Insufficient permission to resolve external/iceberg table <TABLE_NAME> shared by application SAMOOHA_CLEANROOM_APP_<CLEAN ROOM ID>`

Cause:
:   External or Iceberg tables are not enabled in both the provider and consumer accounts.

Solution:
:   Ensure that both provider and consumer accounts have [enabled external and Iceberg tables](register-data.md).

---

Error:
:   Consumer gets `Invalid restricted feature 'external_data'` when linking in an external or Iceberg table.

Cause:
:   The provider has not yet enabled external and Iceberg tables.

Solution:
:   The provider must complete the process of [enabling external and Iceberg tables for their account](register-data.md). If they are doing this in code, the provider must check the security scan results and, if successful, must update the default release version.

---

Error:
:   `Insufficient permission to resolve external/iceberg table` error when running an analysis involving an external or Iceberg table.

Cause:
:   The table was probably not registered properly by both the provider and consumer.

Solution:
:   [Read the external and Iceberg table registration information](register-data.md) and be sure to follow all instructions on both the provider and consumer side.

---
title: Tutorial: Create and run a clean room using the clean rooms UI and a single account
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/tutorials/cleanroom-web-app-single-account-tutorial.md
section: Clean Rooms
---

Data Sharing

# Tutorial: Create and run a clean room using the clean rooms UI and a single account

## Introduction

This tutorial leads you through the basic steps to create and use a clean room using the clean rooms UI. Clean rooms enable users to
share data with a collaborator while maintaining the privacy of the data by tightly controlling what can be done with it.

### What you’ll learn

In this tutorial, you will learn how to use the clean rooms UI by doing the following actions:

* Add a collaborator to your clean room environment. In this tutorial, you will add yourself as a collaborator.
* Create a clean room, including how to add data, specify join policies, define which type of analysis a collaborator can run on the data,
  and share the clean room with a collaborator.
* Install a clean room, add data, and define how this data is joined with the collaborator’s data.
* Run an analysis.
* Activate the results of the analysis.

### About clean room collaborators

Clean room collaborators are either providers or consumers:

* A *provider* is the account that creates and configures the clean room. In a typical clean room, the provider adds all the SQL templates
  that the consumer can run in the clean room. The provider adds data, sets usage restrictions on it, and invites consumers, who can
  join the clean room to run the templates.
* A *consumer* is the account invited to participate in the clean room. The consumer adds their own data and runs the templates on the
  clean room data, according to the limitations set by the provider.

In this tutorial, you act as both the provider and the consumer in the clean room. In a real world clean room, the provider and consumer
would use separate accounts.

### Prerequisites

* You must have access to a Snowflake environment with the Snowflake Data Clean Rooms UI installed. You must either
  [install the environments yourself](../../installing-dcr.md), or ask an administrator to
  [grant you access to the clean rooms UI in a Snowflake account](../../manage-dcr-users.md).
* This tutorial uses a sample table named CUSTOMERS_2. Either search Snowsight for the table
  SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2, or run the following SQL command in your Snowflake account to confirm that you have this
  table installed:

  ```sqlexample
  SHOW TABLES LIKE 'CUSTOMERS_2' IN SCHEMA SAMOOHA_SAMPLE_DATABASE.DEMO;
  ```

  If the response has no rows, then you, or someone with ACCOUNTADMIN role, must run the following command to install the sample table:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
  ```

> **Note:**
>
> This tutorial uses a single account for both the provider and consumer in the clean room. This type of clean room,
> an *internal testing clean room*, is for testing purposes only, and can’t later be used in production or shared with other accounts.
> Internal testing clean rooms support [most, but not all clean room features](../developer-introduction.md). If you would
> like to try using clean rooms two separate Snowflake accounts, try the [two-account tutorial](cleanroom-web-app-tutorial.md).

## Sign in to the clean rooms UI

[Sign in to the clean rooms UI.](../web-app-introduction.md) Provide your Snowflake account credentials for an account where you
can act as a clean rooms provider. A provider has permission to create a clean room.

## Provider: Create and share a clean room

In this section, you will do the following actions as a provider:

* Create a clean room.
* Add data to the clean room that is being shared with collaborators.
* Define a join policy, which controls which columns a collaborator can join their own data with.
* Define which types of analyses a collaborator can run in the clean room.
* Share the clean room with the consumer.

### Start the creation process

To begin the process of creating a clean room:

1. In the left navigation, select Clean Rooms.
2. On the Clean Rooms page, select + Clean Room.
3. Name your clean room `Tutorial`.

   You will allow collaborators to run an audience overlap analysis in the clean room.

### Add data to your clean room

To add data to your clean room:

1. In the Datasource section, select `Snowflake`.
2. From the Tables drop-down list, select the SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS table.
3. Select Next.

> **Note:**
>
> If a table added here is deleted, renamed, moved, or has restrictive permissions added, the table will no longer be usable in the clean
> room unless you restore the old table with the same location, name, and permissions.

### Specify a join policy

A *join policy* specifies which columns of your data a collaborator can join on.

To specify a join policy:

1. From the Join Columns drop-down list, select the following columns:

   * `HASHED_EMAIL`
   * `HASHED_FIRST_NAME`
   * `HASHED_LAST_NAME`
   * `HASHED_PHONE`

   A collaborator can join their data with these columns only.
2. Select Next.

### Configure an analysis template

*Analysis templates* control how a collaborator can access the shared data in a clean room. Collaborators can only run analyses and queries
that conform to the template.

Select and configure an analysis template for clean room collaborators:

1. Select the `Audience Overlap & Segmentation` template.

   Collaborators are limited to running only the analyses that you select.
2. From the Tables drop-down list, select DEMO.CUSTOMERS.

   Collaborators can only analyze data in the DEMO.CUSTOMERS table.
3. From the Segmentation & Attribute Columns drop-down list, select the following columns:

   * `AGE_BAND`
   * `DEVICE_TYPE`
   * `EDUCATION_LEVEL`
   * `STATUS`

   A consumer can filter and create segments using these columns.
4. Enable Allow categorical value previews during filtering.
5. Select Next.

### Share and publish the clean room

Now that you have created and configured the clean room, you can share it with a collaborator so they can use it to run analyses.

To share the clean room with yourself:

1. Enable the Enable Internal Test Clean Room setting.
2. Select Finish.
3. In the dialog that opens, read the notes, then select Proceed to create the clean room.

> **Note:**
>
> If you were sharing this clean room with another account, you would take the following steps:
>
> 1. In the Select Collaborator drop-down list, select the consumer’s account name.
> 2. Select Finish.

### Monitor the status of the clean room

It takes a few minutes for the clean room to be created. During this time a Processing label is shown on the clean room tile in
the Created tab.

To check for status changes:

* Every few minutes, select Refresh

  When the tile label changes from Processing to an Edit button, you can continue to the consumer steps.

## Consumer: Install and configure the clean room

In this step, you switch from acting as the provider, who creates and shares a clean room, to acting as the consumer, who installs and
runs the clean room. Because this is an internal testing clean room, you will use the same Snowflake account for the provider and consumer.

As a consumer, you will do the following actions:

* Install the clean room that was shared with you by the provider.
* Add data to the clean room so that it can be joined with the provider’s data.
* Add a join policy to define how the consumer’s data and the provider’s data are related.
* Define the columns that analysts can use to create segments, filter results, and enrich activation data.

### Install the clean room as a consumer

Installation in the UI involves joining, configuring, and then installing the clean room.

To configure the clean room:

1. In the left navigation, select Clean Rooms.
2. Select the Invited tab.
3. Find the `Tutorial` tile, and select Join.

   If the clean room isn’t in your Invited tab, select Refresh. If it’s still not there, confirm that the clean room has an
   Edit button in the Created tab. If there is no edit button, you didn’t create the clean room as a provider.

### Add consumer data to the clean room

To add data to the clean room:

1. In the Datasource section, select `Snowflake`.
2. From the Tables drop-down list, select and then save SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2.

   * If this table is not in your list, see the Prerequisites section in the tutorial introduction to learn how to install it.
3. Select Next.

> **Note:**
>
> If a table added here is deleted, renamed, moved, or has restrictive permissions added, the table will no longer be usable in the clean
> room unless you restore the old table with the same location, name, and permissions.

### Define a join policy

Next, specify which consumer columns are joinable in an analysis or query in this clean room:

1. In the Specify Join Policies pane, in the Join Policies section, choose columns from your data (labeled My Columns)
   and equivalent columns from the provider’s table (labeled Collaborator Columns).
2. Ensure that the columns from the consumer’s table (My Columns) and the columns from the provider’s table
   (Collaborator Columns) match in content (the column names don’t need to match).

   For example, the consumer’s `HASHED_EMAIL` column should be joined with the provider’s
   `HASHED_EMAIL` column. All joined columns must match in the analysis that you select.
3. Select Next to navigate to the Configure Analysis & Query pane.

### Define the segmentation and activation columns

When you select segmentation and activation columns during the clean room installation process, you define which columns are available to
users running an analysis in the clean room. Analysts can create segments based only on these columns. When you send activation data back
to the provider, analysts can’t enrich the results of the analysis with data unless the data comes from one of these columns.

To define the segmentation and activation columns:

1. Select and then save the DEMO.CUSTOMERS_2 table from the Tables drop-down list.
2. From the Segmentation & Attribute Columns drop-down list, select and then save the following columns:

   * `INCOME_BRACKET`
   * `REGION_CODE`
   * `STATUS`
3. Select Finish to install the clean room.

   Installation takes a few minutes to complete.
4. Select Refresh every few minutes to check for changes.

   When the tile label changes from Processing to a Run button, the clean room is installed and you can run an analysis.

## Consumer: Run an analysis

In this step, you run an audience overlap and segmentation analysis in the clean room. You must first select the data to use in the
analysis.

To configure and run an analysis:

1. In the Joined tab, find the clean room tile and then select Run.
2. Select Audience Overlap & Segmentation » Proceed.
3. In My Tables, select CUSTOMERS.
4. In Collaborator’s Tables, select CUSTOMERS.
5. In Required Parameters » My Join Columns, define the following joins:

   1. From the drop-down list, select and then save `HASHED_EMAIL`.
   2. Select + Join Column, then select `HASHED_FIRST_NAME` and `HASHED_LAST_NAME`.
   3. Select + Join Column, then select `HASHED_PHONE`.

   When you run an analysis in the clean room, results include records where any of the following items are true:

   * The consumer’s `HASHED_EMAIL` matches the provider’s `HASHED_EMAIL`.
   * The consumer’s `HASHED_FIRST_NAME` matches the provider’s `HASHED_FIRST_NAME` and the consumer’s `HASHED_LAST_NAME`
     matches the provider’s `HASHED_LAST_NAME`.
   * The consumer’s `HASHED_PHONE` matches the provider’s `HASHED_PHONE`.
6. In the User Segmentation section, perform the following steps:

   1. From the My Columns drop-down list, select `INCOME_BRACKET`.
   2. From the Collaborator Columns drop-down list, select `AGE_BAND`.

   The results of the analysis are grouped into these segments.
7. In the Filters section, use the drop-down lists to specify `CUSTOMERS.STATUS = GOLD`.

   This limits analysis results to results where `STATUS = GOLD`.
8. Select Run.

   You can optionally choose a different warehouse size to run the analysis by changing the Warehouse
   drop-down selection.
9. In the Analyses & Queries page, when the status of your analysis is Completed:

   1. Select the analysis to see your results.
   2. Scroll to the Results section of the page. You can toggle the results to see either overlap or non-overlap rates.
   3. To see the segmentation groups of your analysis, select Download, and then open the comma-delimited file.
10. Continue to the next step to activate (send) enriched results to the consumer’s Snowflake account.

## Consumer: Activate the results to the consumer account

In this step, you activate the results of your analysis by pushing them to the consumer’s Snowflake account. These results are
enriched with data from the consumer and provider tables.

To activate the results of the analysis:

1. In the Results section for the analysis, select Activate.
2. In the Activation Hub section, select the name of your account.

   This section lists accounts and services where you can activate
   data to. The list can include [third-party activation connectors](../../connector-activation.md) that send data
   to services outside of Snowflake.
3. In the Segment Name field, enter `My test segment`, or another unique name for this result set.

   Copy and save the segment name that you provide here.
4. From the ID Columns drop-down list, select `HASHED_EMAIL`.
5. From the Attribute Columns drop-down list, select Select All.

   When you look at the results of the analysis, the matched records will be enriched with data from both consumer and provider tables.

   The available columns are the same as the segmentation and activation columns that you selected as the consumer when you installed the
   clean room.
6. Select Push Data.

Congratulations! You have now installed and configured a clean room in a consumer account, run an analysis, and pushed the results to
the consumer account for activation.

## View the activated data

In the previous step you activated to the consumer’s Snowsight account. Here is how to view the activated data by using either the
Snowflake web application or code:

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) with the same account where you ran the clean room. Use the Snowflake UI, not the clean
   rooms UI.
2. In the navigation menu, select Catalog » Database Explorer.
3. Search for `SAMOOHA_BY_SNOWFLAKE_LOCAL_DB`.
4. Navigate to PUBLIC » Tables » CONSUMER_DIRECT_ACTIVATION_SUMMARY.
5. Select Data Preview to view the activation data.

   * If you don’t see data there, confirm that you are using the same Snowflake account that you used to activate your data.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) with the same account where you ran the clean room. Use the Snowflake UI, not the clean
   rooms UI.
2. In the navigation menu, select Projects » Worksheets.
3. Select + SQL Worksheet.
4. to list the activation data that was pushed to the consumer’s clean room environment, paste and run the following statement
   into the new worksheet. Substitute the segment name that you entered when you ran the activation in the clean rooms UI.

   ```sqlexample
   SELECT *
      FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.CONSUMER_DIRECT_ACTIVATION_SUMMARY
      WHERE segment = '<your segment name>';
   ```

   If you don’t see data, confirm that you are using the same Snowflake account that you used to activate your data, and that you
   are using the segment name that you specified when you activated the results.

## Clean up

You can delete the clean room and activation data that you created for this tutorial to clean up your production environment.

### Delete the activation data

To delete the activation data from the provider’s Snowflake account:

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) for the provider account. Sign in to the **Snowflake UI**, not the clean rooms UI.
2. In the navigation menu, select Projects » Worksheets.
3. Select + SQL Worksheet.
4. In the new worksheet, paste and run the following statement to delete the activation data created for this tutorial.
   Substitute your custom segment name in the location indicated:

   ```sqlexample
   DELETE FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.CONSUMER_DIRECT_ACTIVATION_SUMMARY
      WHERE segment = '<your segment name>';
   ```

### Delete the clean room

Deleting a clean room in the provider account removes it from both the provider account and the consumer account.

To delete a clean room:

1. In the clean rooms UI, in the left navigation, select Clean Rooms.
2. In the Created tab, find the clean room tile.
3. Select  » Delete » Proceed.

## Learn more

Congratulations! You have now used the clean rooms UI to create and share a clean room as a provider. You have also acted as a consumer
who is using the clean room to analyze data within a privacy-preserving environment.

For more information about Snowflake Data Clean Rooms, see the following resources:

* For general information, see [Overview of Provider and Consumer Clean Rooms](../../getting-started.md).
* For more information about the clean rooms UI, see [Clean rooms UI overview](../web-app-introduction.md).
* For information about using the developer APIs to work with a Snowflake Data Clean Room programmatically, see
  [Snowflake Data Clean Rooms developer’s guide](../developer-introduction.md).

---
title: Tutorial: Create and run a clean room using the clean rooms UI and two accounts
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/tutorials/cleanroom-web-app-tutorial.md
section: Clean Rooms
---

Data Sharing

# Tutorial: Create and run a clean room using the clean rooms UI and two accounts

## Introduction

Clean rooms enable users to share data with a collaborator while maintaining the privacy of the data by tightly controlling what can be
done with it. This tutorial leads you through the basic flow of using the clean rooms UI to work with a Snowflake Data Clean Room with two test
accounts, which enable the full functionality of clean rooms. If have access to only a single Snowflake account with clean rooms installed,
you can try the [single-account tutorial](cleanroom-web-app-single-account-tutorial.md) instead.

### What you will learn

In this tutorial, you will learn how to use the clean rooms UI by doing the following tasks:

* Add a collaborator to your clean rooms environment.
* Create a clean room, including how to add data, specify join policies, define which type of analysis a collaborator can run on the data,
  and share the clean room with a collaborator.
* Install a clean room, add data, and define how this data is joined with the collaborator’s data.
* Run an analysis.
* Activate the results of the analysis.

### About clean room collaborators

Clean room collaborators are either providers or consumers:

* A *provider* is the account that creates and configures the clean room. In a typical clean room, the provider adds all the SQL templates
  that the consumer can run in the clean room. The provider adds data, sets usage restrictions on it, and invites consumers, who can
  join the clean room to run the templates.
* A *consumer* is the account invited to participate in the clean room. The consumer adds their own data and runs the templates on the
  clean room data, according to the limitations set by the provider.

### Prerequisites

* You must have access to two Snowflake environment with the Snowflake Data Clean Rooms UI installed: one to use as a provider, and the
  other to use as a consumer. You must either [install the environments yourself](../../installing-dcr.md), or ask an
  administrator to [grant you access to the clean rooms UI in a Snowflake account](../../manage-dcr-users.md).
* This tutorial uses a sample table named CUSTOMERS_2. Either search Snowsight for the table
  SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2, or run the following SQL command in your Snowflake account to confirm that you have this
  table installed:

  ```sqlexample
  SHOW TABLES LIKE 'CUSTOMERS_2' IN SCHEMA SAMOOHA_SAMPLE_DATABASE.DEMO;
  ```

  If the response has no rows, then you, or someone with ACCOUNTADMIN role, must run the following command to install the sample table:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
  ```

## Provider: Sign in to the clean rooms UI

Sign in to the clean room where you will create, configure, and share a clean room as a provider.

[Sign in to the clean rooms UI.](../web-app-introduction.md) Provide your Snowflake account credentials for the account that
will act as the provider.

## Provider: Add the consumer as a collaborator

In this section you will add the consumer account that you are using for this tutorial as a collaborator. Administrators must define
someone as a collaborator *before* other users can share a clean room with that collaborator.

To add the consumer as a collaborator:

1. In the left navigation, select Collaborators.
2. Select the Snowflake Partners tab.
3. Select + Snowflake Partner.
4. In the Company Name field, enter `Tutorial Consumer`.
5. In the Email Address field, enter the email associated with your clean room user.
6. In the Account Locator field, enter the
   [account locator](../../../admin-account-identifier.md) of the Snowflake account that you are using as a consumer.
7. Select the cloud and region of the account that hosts your consumer account.
8. Select Add.

## Provider: Create and share a clean room

In this section, you will do the following:

* Create a clean room.
* Add data to the clean room that is shared with collaborators.
* Define a join policy, which controls which columns a collaborator can join with their own data.
* Define which types of analysis a collaborator can run in the clean room.
* Share the clean room with the consumer.

### Start the creation process

To begin the process of creating a clean room:

To begin the process of creating a clean room:

1. In the left navigation, select Clean Rooms.
2. On the Clean Rooms page, select + Clean Room.
3. Name your clean room `Tutorial`.

   You will allow collaborators to run an audience overlap analysis in the clean room.

### Add data to your clean room

To add data to your clean room:

1. In the Datasource section, select `Snowflake`.
2. From the Tables drop-down list, select the SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS table.
3. Select Next.

> **Note:**
>
> If a table added here is deleted, renamed, moved, or has restrictive permissions added, the table will no longer be usable in the clean
> room unless you restore the old table at the same location, name, and permissions.

### Specify a join policy

A *join policy* specifies which columns of your data a collaborator can join on.

To specify a join policy:

1. From the Join Columns drop-down list, select the following columns:

   * `HASHED_EMAIL`
   * `HASHED_FIRST_NAME`
   * `HASHED_LAST_NAME`
   * `HASHED_PHONE`

   A collaborator can join their data with these columns only.
2. Select Next.

### Configure an analysis template

*Analysis templates* control how a collaborator can access the shared data in a clean room. Collaborators can only run analyses and queries
that conform to the template.

Select and configure an analysis template for clean room collaborators:

1. Select the `Audience Overlap & Segmentation` template.

   Collaborators are limited to running only the analyses that you select.
2. From the Tables drop-down list, select DEMO.CUSTOMERS.

   Collaborators can only analyze data in the DEMO.CUSTOMERS table.
3. From the Segmentation & Attribute Columns drop-down list, select the following columns:

   * `AGE_BAND`
   * `DEVICE_TYPE`
   * `EDUCATION_LEVEL`
   * `STATUS`

   A consumer can filter and create segments using these columns.
4. Enable Allow categorical value previews during filtering.
5. Select Next.

### Share and publish the clean room

Now that you have created and configured the clean room, you can share it with a collaborator so they can use it to run analyses.

To share a clean room:

1. Use the Select Collaborator drop-down list to select `Tutorial Consumer`.
2. Select Finish.
3. Wait until the clean room is created before continuing with this tutorial. Periodically select Refresh until the
   `Tutorial` tile changes from Processing to Edit.

Congratulations! You have created and shared a Snowflake Data Clean Room.

Next, you will switch to act as the consumer who joins the clean room and uses it to analyze data.

## Consumer: Sign in to the clean rooms UI

In this section, you switch from being the provider who created and shared the clean room to acting as the consumer to install and run the
clean room.

* [Sign in to the clean rooms UI.](../web-app-introduction.md) Provide the Snowflake account credentials for the account that
  acts as the consumer.

## Consumer: Install and configure the clean room

In this section you will do the following actions:

* Install the clean room that was shared with you from the provider account.
* Add data to the clean room so it can be joined with the provider’s data.
* Add a join policy to define how the consumer data and the provider data are related.
* Define the columns that analysts can use to create segments, filter results, and enrich activation data.

### Start the installation process

To start installing a clean room that has been shared by the provider account:

1. In the left navigation, select Clean Rooms.
2. Select the Invited tab.
3. Find the `Tutorial` tile, and select Join.

### Add consumer data to the clean room

To add data to the clean room:

1. In the Datasource section, select `Snowflake`.
2. From the Tables drop-down list, select and then save SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2.

   * If this table is not in your list, see the Prerequisites section in the tutorial introduction to learn how to install it.
3. Select Next.

> **Note:**
>
> If a table added here is deleted, renamed, moved, or has restrictive permissions added, the table will no longer be usable in the clean
> room unless you restore the old table at the same location, name, and permissions.

### Define a join policy

Next, specify which consumer columns are joinable in an analysis or query in this clean room:

1. In the Specify Join Policies pane, in the Join Policies section, choose columns from your data (labeled My Columns)
   and equivalent columns from the provider’s table (labeled Collaborator Columns).
2. Ensure that the columns from the consumer’s table (My Columns) and the columns from the provider’s table
   (Collaborator Columns) match in content (the column names don’t need to match).

   For example, the consumer’s `HASHED_EMAIL` column should be joined with the provider’s
   `HASHED_EMAIL` column. All joined columns must match in the analysis that you select.
3. Select Next to navigate to the Configure Analysis & Query pane.

### Define the segmentation and activation columns

When you select segmentation and activation columns during the clean room installation process, you define which columns are
available to users running an analysis in the clean room. Analysts can create segments based on only these columns. When you send
activation data back to the provider, analysts cannot enrich the results of the analysis with data unless it comes from one of these
columns.

To define the segmentation and activation columns:

1. Select and then save the DEMO.CUSTOMERS_2 table from the Tables drop-down list.
2. From the Segmentation & Attribute Columns drop-down list, select and then save the following columns:

   * `INCOME_BRACKET`
   * `REGION_CODE`
   * `STATUS`
3. Select Finish to install the clean room.

   Installation takes a few minutes to complete.
4. Select Refresh every few minutes to check for changes.

   When the tile label changes from Processing to a Run button, the clean room is installed and you can run an analysis.

## Consumer: Run an analysis

In this step, you run an audience overlap and segmentation analysis in the clean room. You must first select the data to use in the
analysis.

To configure and run an analysis:

1. In the Joined tab, find the clean room tile and then select Run.
2. Select Audience Overlap & Segmentation » Proceed.
3. In My Tables, select CUSTOMERS.
4. In Collaborator’s Tables, select CUSTOMERS.
5. In Required Parameters » My Join Columns, define the following joins:

   1. From the drop-down list, select and then save `HASHED_EMAIL`.
   2. Select + Join Column, then select `HASHED_FIRST_NAME` and `HASHED_LAST_NAME`.
   3. Select + Join Column, then select `HASHED_PHONE`.

   When you run an analysis in the clean room, results include records where any of the following items are true:

   * The consumer’s `HASHED_EMAIL` matches the provider’s `HASHED_EMAIL`.
   * The consumer’s `HASHED_FIRST_NAME` matches the provider’s `HASHED_FIRST_NAME` and the consumer’s `HASHED_LAST_NAME`
     matches the provider’s `HASHED_LAST_NAME`.
   * The consumer’s `HASHED_PHONE` matches the provider’s `HASHED_PHONE`.
6. In the User Segmentation section, perform the following steps:

   1. From the My Columns drop-down list, select `INCOME_BRACKET`.
   2. From the Collaborator Columns drop-down list, select `AGE_BAND`.

   The results of the analysis are grouped into these segments.
7. In the Filters section, use the drop-down lists to specify `CUSTOMERS.STATUS = GOLD`.

   This limits analysis results to results where `STATUS = GOLD`.
8. Select Run.

   You can optionally choose a different warehouse size to run the analysis by changing the Warehouse
   drop-down selection.
9. In the Analyses & Queries page, when the status of your analysis is Completed:

   1. Select the analysis to see your results.
   2. Scroll to the Results section of the page. You can toggle the results to see either overlap or non-overlap rates.
   3. To see the segmentation groups of your analysis, select Download, and then open the comma-delimited file.
10. Continue to the next step to activate (send) enriched results to the consumer’s Snowflake account.

## Consumer: Activate the results

In this step you will activate the results of your analysis to the provider’s Snowflake account. These results are enriched with data from
the consumer and provider tables.

To activate the results of the analysis:

1. In the Results section, select Activate.
2. Select the name of the provider account you used to share the clean room.
3. In the Segment Name field, specify `My test segment`, or another unique name for this result set.
4. From the ID Columns drop-down list, select `HASHED_EMAIL`.
5. From the Attribute Columns drop-down list, select Select All.

   When the provider looks at the results of the analysis, the matched records will be enriched with the additional data found in these
   columns.

   > The available columns are the same as the segmentation and activation columns that you selected as the consumer when you installed the
   > clean room.
6. Select Push Data.

Congratulations! You have now installed and configured a clean room in a consumer account, run an analysis, and pushed the results back to
the provider account for activation.

## Provider: View the activated data

In the previous step you activated to the provider’s Snowsight account. Provider activation
data is stored in the SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY table of the provider’s Snowflake account.

**Prerequisite**

The first time a consumer activates data to a provider’s account, the provider must sign in to the clean rooms UI for about 30 minutes
*after* the consumer has activated data. This is needed only once per clean room per consumer account. Later activations by the same
consumer in the same clean room do not need this step. To activate data to the provider’s account you must create a pipeline between the
consumer account and the provider account.

After this prerequisite step, you can view the activated data in your provider account using either Snowsight or SQL:

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) for the provider account. (Use the **Snowflake UI**, not the clean room UI.)
   environment.
2. In the navigation menu, select Catalog » Database Explorer.
3. Navigate to `SAMOOHA_BY_SNOWFLAKE_LOCAL_DB` » `PUBLIC` » `Tables` » `PROVIDER_ACTIVATION_SUMMARY`.
4. Select Data Preview to view the activation data.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) for the provider account. You are signing in to the Snowflake account, not the clean room
   environment.
2. In the navigation menu, select Projects » Worksheets.
3. Select + SQL Worksheet.
4. In the new worksheet, paste and run the following statement to list the activation data that was pushed from the consumer’s
   clean room environment. Substitute the segment name that you entered when you ran the activation in the clean rooms UI.

   ```sqlexample
   SELECT *
      FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY
      WHERE segment = '<your segment name>';
   ```

## Clean up

You can delete the clean room and activation data that you created for this tutorial to clean up your production environment.

### Delete the activation data

To delete the activation data from the provider’s Snowflake account:

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md) for the provider account. You are signing in to the Snowflake account, not the clean room environment.
2. In the navigation menu, select Projects » Worksheets.
3. Select + SQL Worksheet.
4. In the new worksheet, paste and run the following statement to delete the activation data created for this tutorial.
   Substitute your custom segment name in the location indicated:

   ```sqlexample
   DELETE FROM SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.PUBLIC.PROVIDER_ACTIVATION_SUMMARY
      WHERE segment = '<your segment name>';
   ```

### Delete the clean room

Deleting a clean room in the provider account removes it from both the provider account and the consumer account.

To delete a clean room:

1. [Sign in to the clean rooms UI.](../web-app-introduction.md)
2. In the left navigation, select Clean Rooms.
3. On the Created tab, find the `Tutorial` tile and select the More icon ().
4. Select Delete.
5. Select Proceed.

## Learn more

Congratulations! You have now used the clean rooms UI to create and share a clean room as a provider. You have also acted as the consumer
who is using the clean room to analyze data within a privacy-preserving environment.

You can use the following resources to learn more:

* For general information, see [Overview of Snowflake Data Clean Rooms](../../overview.md).
* For more information about the clean rooms UI, see [Clean rooms UI overview](../web-app-introduction.md).
* For information about using the developer APIs to work with a Snowflake Data Clean Room programmatically, see
  [Snowflake Data Clean Rooms developer’s guide](../developer-introduction.md).

---
title: Tutorial: Get started with collaboration clean rooms (API)
source: https://docs.snowflake.com/en/user-guide/cleanrooms/tutorials/collaboration-basic-api-tutorial.md
section: Clean Rooms
---

clean rooms

# Tutorial: Get started with collaboration clean rooms (API)

## Introduction

This tutorial is aimed at developers who want to create and use collaboration clean rooms using the Snowflake Data Clean Rooms API.
You will work through a two-account scenario where two collaborators share data, register templates, and run analyses.

### What you will learn

This tutorial shows you how to:

* Register data offerings and templates in the clean rooms registry.
* Create a collaboration using a YAML collaboration specification.
* Join a collaboration from a second account.
* Link templates and data offerings to an existing collaboration.
* Run analyses as different collaborators with different permissions.

### Requirements to run this tutorial

* Two Snowflake accounts, Enterprise Edition or higher, each with the Snowflake Data Clean Rooms environment installed.
  If clean rooms isn’t installed, see [Installing the Snowflake Data Clean Rooms environment](../installing-dcr.md).
* The SAMOOHA_APP_ROLE must be granted to the user in each account.

> **Note:**
>
> This tutorial requires **two separate Snowflake accounts**. You will run Alice’s steps in one account and Bob’s steps in the other.
> Each section heading indicates which account to use.

The tutorial includes code snippets with `<placeholders>` that you should replace with the appropriate values.

## Collaboration basics

A collaboration clean room allows multiple parties to share and analyze data securely without exposing raw data to each other.
Collaborations are defined by a YAML specification that lists the collaborators, their data, and what each party can do.

Key concepts used in this tutorial:

* **Collaboration specification**: A YAML document that defines the collaborators, their aliases, roles, data offerings, and templates.
* **Collaboration roles**: Each collaborator is assigned one or more roles:

  + **Owner**: Creates and manages the collaboration. There is exactly one owner per collaboration.
  + **Data provider**: Contributes data offerings that other collaborators can use in analyses.
  + **Analysis runner**: Runs templates against the shared data. Each analysis runner has a list of data providers and templates
    available to use.

    In this tutorial:

    - **Alice** is the collaboration **owner**, a **data provider**, and an **analysis runner**.
    - **Bob** is a **data provider** and an **analysis runner**.
* **Data offering**: A table linked into the collaboration by a data provider.
* **Template**: A registered JinjaSQL query that analysis runners execute against a data offering.

In this tutorial, Alice and Bob each contribute one data offering. Alice registers a template that only Alice can run.
Bob registers a template that both Alice and Bob can run. Both templates join the two collaborators’ data on a shared column.

## Alice: Register resources

Run the following steps in **Alice’s account** to set up the session environment and create sample data:

```sqlexample
USE WAREHOUSE APP_WH;
USE ROLE SAMOOHA_APP_ROLE;

-- Secondary roles must be disabled to call link_data_offerings.
USE SECONDARY ROLES NONE;

-- Create sample data for Alice.
CREATE DATABASE IF NOT EXISTS ALICE_DB;
CREATE SCHEMA IF NOT EXISTS ALICE_DB.ALICE_SCH;
CREATE OR REPLACE TABLE ALICE_DB.ALICE_SCH.ALICE_DATA AS
  SELECT * FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS LIMIT 100;
```

> **Note:**
>
> In real-world usage, we recommend [assigning more fine-grained privileges](../collaboration-api-reference.md) to your
> users instead of using the top-level SAMOOHA_APP_ROLE role.

Next, you will register a data offering and a template to use in the collaboration.

### Register a data offering

A data offering is a registered dataset with column-level policies that control how collaborators can use the data.
You will create a data offering from the sample data you created.

Register Alice’s data offering so that it can be included in the collaboration specification. Data offerings are defined by a YAML data
offering specification.

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
    $$
    api_version: 2.0.0
    spec_type: data_offering
    version: v1
    name: alice_customer_data
    datasets:
     - alias: customer_list
       data_object_fqn: ALICE_DB.ALICE_SCH.ALICE_DATA
       object_class: custom
       allowed_analyses: template_only
       schema_and_template_policies:
         hashed_email:
           category: join_standard
           column_type: hashed_email_b64_encoded
         status:
           category: passthrough
    $$
    );
```

The data-offering specification defines the following properties:

* `datasets`: A list of tables or views to share.
* `alias`: A short name used to reference this dataset within this spec and by templates.
* `allowed_analyses`: Restricts usage to templates only (no free-form SQL).
* `schema_and_template_policies`: Defines the format, naming, and availability of columns from this data source.
  The data offering exposes only two columns from your source table: `hashed_email` (which must be used as a join column) and `status`.

Save the data offering ID from the response. You will need it when creating the collaboration specification.

```sqlexample
-- Save the ID.
SET alice_data_offering_id = '<alice_data_offering_id>';

-- View your registered data offerings.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_DATA_OFFERINGS();
```

### Register a template

A template is a JinjaSQL query that analysis runners execute in the collaboration. Register a template that joins two tables
on the hashed email column and counts matches grouped by status:

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
$$
api_version: 2.0.0
spec_type: template
name: alice_only_template
version: v1
type: sql_analysis
description: Joins two tables on hashed email and counts matches grouped by status.
template:
  SELECT T.status, COUNT(*)
    FROM IDENTIFIER( {{ source_table[0] }} ) AS T
      JOIN IDENTIFIER( {{ source_table[1] }} ) AS T1
      ON T.hashed_email_b64_encoded = T1.hashed_email_b64_encoded
    GROUP BY T.status;
$$);
```

The template uses two tables `source_table[0]` and `source_table[1]`. These are data offerings present in the collaboration. The analysis runner passes in the names of the tables to use when they run the analysis.

> **Note:**
>
> This tutorial uses `T` and `T1` as table aliases for simplicity. In production templates, you should use the
> [standard aliases](../custom-templates.md) `p`, `p1`, `p2`, and so on, which are required for
> Snowflake Data Clean Room policy enforcement.

Save the template ID from the response:

```sqlexample
-- Save the ID.
SET alice_template_id = '<alice_only_template_id>';

-- View all registered templates.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.VIEW_REGISTERED_TEMPLATES();
```

## Alice: Create the collaboration

Continue in **Alice’s account**.

Now that the resources are registered, Alice creates the collaboration. The collaboration is defined by a YAML specification
that lists the collaborators, their roles, data offerings, and templates.

Call INITIALIZE with the collaboration specification. Review the YAML carefully before running it:

```sqlexample-yaml
-- Replace the <...> placeholders with the appropriate values.
-- Get your account data sharing ID:
--   SELECT CURRENT_ORGANIZATION_NAME() || '.' || CURRENT_ACCOUNT_NAME();

CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.INITIALIZE(
$$
api_version: 2.0.0
spec_type: collaboration
name: api_tutorial_collaboration
owner: alice
collaborator_identifier_aliases:
  alice: <alice_account_data_sharing_id>
  bob: <bob_account_data_sharing_id>
analysis_runners:
  alice:
    data_providers:
      alice:
        data_offerings:
        - id: <alice_data_offering_id>
      bob:
        data_offerings: []
    templates:
    - id: <alice_only_template_id>
  bob:
    data_providers:
      alice:
        data_offerings:
        - id: <alice_data_offering_id>
      bob:
        data_offerings: []
$$,
'APP_WH'
);
```

### Understanding the collaboration specification

The collaboration specification uses the aliases defined in the `collaborator_identifier_aliases` section to refer to all collaborators.

The collaboration defines the following roles and relationships:

* `analysis_runners` lists the collaborators that can run analyses in this collaboration. Only collaborators listed at the top level here
  can run analyses. The list of analysis runners cannot be modified after the collaboration is created.
* Each analysis runner entry has the following elements:

  + `data_providers`: Only these data providers can supply data to this analysis runner. This list cannot be modified later.
  + `data_offerings`: Only these data offerings from the listed data providers can supply data to this analysis runner. The data
    offerings list can be updated after a collaboration is created.
* Only the templates listed for an analysis runner can be used. The template list can be modified later. Notice that this collaboration
  currently shares the `alice_only_template` with Alice.

### Wait for the collaboration to be created and joined

INITIALIZE creates and joins the owner to the collaboration. This process is asynchronous.
Call GET_STATUS until Alice’s status is `JOINED`:

```sqlexample
SET collaboration_name = '<collaboration_name>';

-- Check status. Repeat until the status is JOINED.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- Verify the collaboration is visible.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS();
```

### Enable template auto-approval

Whenever a collaborator asks to share a template with you in a collaboration, all designated recipients must approve the request before the
template is shared. This tutorial doesn’t cover the approval flow, so run the following code to automatically approve all requests:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ENABLE_TEMPLATE_AUTO_APPROVAL(
  $collaboration_name
);
```

## Bob: Join the collaboration

Switch to **Bob’s account** and run the following steps.

Set up the session environment:

```sqlexample
USE WAREHOUSE APP_WH;
USE ROLE SAMOOHA_APP_ROLE;

-- Secondary roles must be disabled to call join or link_data_offering.
USE SECONDARY ROLES NONE;
```

View the collaboration invitation, then join it:

```sqlexample
-- See which collaborations you are invited to, or have joined.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_COLLABORATIONS();

-- Add the SOURCE_NAME and OWNER_ACCOUNT values from the response.
SET collaboration_name = '<collaboration_name>';
SET collaborator_data_sharing_id = '<alice_account_data_sharing_id>';

-- Review and join the collaboration.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.REVIEW(
  $collaboration_name,
  $collaborator_data_sharing_id
);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.JOIN($collaboration_name);
```

Joining is asynchronous. Call GET_STATUS until Bob’s status is `JOINED`:

```sqlexample
-- Check status. Repeat until the status is JOINED.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);
```

## Bob: Link a template

Continue in **Bob’s account**.

Bob creates and registers a template. This template joins two tables on the hashed email column,
and returns a cross-tabulation of statuses from both tables. Templates are written in JinjaSQL; a template specification is written in
YAML, which contains the embedded JinjaSQL template.

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
    $$
    api_version: 2.0.0
    spec_type: template
    name: bob_shared_template
    version: v1
    type: sql_analysis
    description: Cross-tabulates statuses from two tables joined on hashed email.
    template:
      SELECT T.status AS status_1, T1.status AS status_2, COUNT(*) AS match_count
        FROM IDENTIFIER({{ source_table[0] }}) AS T
          JOIN IDENTIFIER({{ source_table[1] }}) AS T1
          ON T.hashed_email_b64_encoded = T1.hashed_email_b64_encoded
        GROUP BY T.status, T1.status
        ORDER BY match_count DESC;
    $$
);
SET bob_template_id = '<bob_shared_template_id>';
```

Now request to link the template to the collaboration, and share it with both Alice and Bob. (Notice how you must explicitly request to
share a template with yourself; templates are not automatically shared with the account that registered them.)

Because Alice enabled template auto-approval, Alice automatically approves the request:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.ADD_TEMPLATE_REQUEST(
  $collaboration_name,
  $bob_template_id,
  ['alice', 'bob']
);
```

The template should be approved and added shortly.

```sqlexample
-- View the status of update requests.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_UPDATE_REQUESTS($collaboration_name);
```

## Bob: Link a data offering

Continue in **Bob’s account**.

Create sample data and register a data offering:

```sqlexample-yaml
-- Create sample data.
CREATE DATABASE IF NOT EXISTS BOB_DB;
CREATE SCHEMA IF NOT EXISTS BOB_DB.BOB_SCH;
CREATE OR REPLACE TABLE BOB_DB.BOB_SCH.BOB_DATA AS
  SELECT * FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2 LIMIT 100;

-- Register Bob's data offering.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_DATA_OFFERING(
    $$
    api_version: 2.0.0
    spec_type: data_offering
    version: v1
    name: bob_customer_data
    datasets:
     - alias: my_customer_list
       data_object_fqn: BOB_DB.BOB_SCH.BOB_DATA
       object_class: custom
       allowed_analyses: template_only
       schema_and_template_policies:
         hashed_email:
           category: join_standard
           column_type: hashed_email_b64_encoded
         status:
           category: passthrough
    $$
);
SET bob_data_offering_id = '<bob_data_offering_id>';
```

The collaboration specification lists `bob` as a potential data provider for both `alice` and `bob`. Link the data offering into the
collaboration, and share it with both `alice` and `bob`:

```sqlexample
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.LINK_DATA_OFFERING(
  $collaboration_name,
  $bob_data_offering_id,
  ['alice', 'bob']
);

-- Verify that both data offerings are now available.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS(
  $collaboration_name
);
```

You don’t need approvals when you share data with other collaborators.

## Alice: Run an analysis

Switch back to **Alice’s account**.

Alice runs the `alice_only_template`, which is available only to Alice. The template joins Alice’s data offering
with Bob’s data offering on the hashed email column and groups results by status.

First, view the available data offerings and templates:

```sqlexample
-- View available data offerings.
-- Note the view names in the TEMPLATE_VIEW_NAME column; you need these for the analysis spec.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS(
  $collaboration_name
);

-- View available templates.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_TEMPLATES(
  $collaboration_name
);
```

Now, run the analysis. Replace the placeholders with actual values. Replace the `source_tables` names with view names from
VIEW_DATA_OFFERINGS.

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
  $collaboration_name,
    $$
    api_version: 2.0.0
    spec_type: analysis
    description: Alice runs the alice_only_template with both data offerings.
    template: '<alice_only_template_id>'
    template_configuration:
      view_mappings:
        source_tables:
          - '<alice_data_offering_view_name>'
          - '<bob_data_offering_view_name>'
    $$
  );
```

The results show the count of matching records grouped by status from Alice’s data.

## Bob: Run an analysis

Switch to **Bob’s account**.

Bob runs the `bob_shared_template`, which is available to both collaborators. This template cross-tabulates the statuses from
both tables.

Replace the `source_tables` placeholders with actual view names from VIEW_DATA_OFFERINGS.

```sqlexample-yaml
-- View available data offerings and templates.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_DATA_OFFERINGS(
  $collaboration_name
);
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.VIEW_TEMPLATES(
  $collaboration_name
);

-- Run the analysis.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.RUN(
    $collaboration_name,
    $$
    api_version: 2.0.0
    spec_type: analysis
    description: Bob runs the bob_shared_template with both data offerings.
    template: '<bob_shared_template>'
    template_configuration:
      view_mappings:
        source_tables:
          - '<alice_data_offering_view_name>'
          - '<bob_data_offering_view_name>'
    $$
);
```

The results show a cross-tabulation of statuses from both data offerings, with the count of matching records for each combination.

Bob cannot run `alice_only_template` because Alice did not include Bob as a permitted user
for that template in the collaboration specification. Try running it to see what happens.

## Alice: Clean up resources

Switch to **Alice’s account** to clean up all the resources used.

Tearing down a collaboration is a multi-step process. Call TEARDOWN, wait for the status to reach `LOCAL_DROP_PENDING`,
and then call TEARDOWN again:

```sqlexample
-- Start the teardown process.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);

-- Check status. Repeat until the status is LOCAL_DROP_PENDING.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.GET_STATUS($collaboration_name);

-- Complete the teardown.
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.COLLABORATION.TEARDOWN($collaboration_name);

-- Clean up the sample database.
DROP DATABASE IF EXISTS ALICE_DB;
```

Switch to Bob’s account and clean up the sample database:

```sqlexample
DROP DATABASE IF EXISTS BOB_DB;
```

> **Note:**
>
> Tearing down the collaboration doesn’t delete registered templates or data offerings from either account’s registry.

## Summary

In this tutorial, you learned how to:

* Register templates and data offerings in the clean rooms registry.
* Create a collaboration with a YAML specification that defines collaborators, roles, data, and templates.
* Join a collaboration from a second account.
* Link templates and data offerings to an existing collaboration using update requests.
* Run analyses as different collaborators with different levels of access.

### Next steps

* You can find full running code samples in [Sample Notebooks and Worksheets](../tutorials-and-samples.md).
* Learn more about collaboration specifications in the [Overview of Snowflake Data Clean Rooms](../overview.md).
* Explore the full [Snowflake Data Clean Rooms Collaboration API](../collaboration-api-reference.md).
* Learn about [collaboration roles](../roles.md) and how to manage access.
* Learn how to [activate query results](../activation.md) to export data to a collaborator’s account.

---
title: Tutorial: Get started with Snowflake Data Clean Rooms in code
source: https://docs.snowflake.com/en/user-guide/cleanrooms/tutorials/cleanroom-api-tutorial-basic.md
section: Clean Rooms
---

clean rooms

# Tutorial: Get started with Snowflake Data Clean Rooms in code

## Introduction

This tutorial is aimed at developers who will create or use Snowflake Data Clean Rooms in code. This tutorial uses SQL code, but you can
adapt the information shown here to create and use clean rooms in any coding language supported by Snowflake.

### What you will learn

This tutorial shows you how to create and share a basic template in a clean room using the Snowflake Data Clean Room API. It also
shows you how to run an analysis using the API in a clean room shared with you.

This tutorial creates a clean room with one table provided by the provider, one table provided by the consumer, and a template defined
by the provider that defines a very simple JOIN query on the two tables.

### Requirements

* You should have a basic understanding of Snowflake and you should also read
  [About Snowflake Data Clean Rooms](../overview.md) before starting this tutorial.
* You must have access to a Snowflake account, Enterprise Edition or higher, with the Snowflake Data Clean Rooms native app and API
  installed. If you don’t have the clean rooms app installed, you can either
  [install it yourself](../getting-started.md), or else ask a Snowflake administrator to install it for you.
* You must be granted the SAMOOHA_APP_ROLE to use the clean rooms API.
* This tutorial uses a sample table named CUSTOMERS_2 that is installed with the clean rooms environment. Search your account
  for the table SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2 using the following command:

  ```sqlexample
  SHOW TABLES LIKE 'CUSTOMERS_2' IN SCHEMA SAMOOHA_SAMPLE_DATABASE.DEMO;
  ```

  If the response has no rows, then you, or someone with ACCOUNTADMIN role, must run the following command to install the sample table:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  EXECUTE IMMEDIATE FROM @SAMOOHA_BY_SNOWFLAKE.APP_SCHEMA.MOUNT_CODE_STAGE/dcr_loader.sql;
  ```

This tutorial uses the same account to act as both a provider and a consumer in a clean room. This scenario is supported only
for testing purposes and has [limitations on what features it supports](../v1/developer-introduction.md), compared to using
separate accounts. In the real world, providers and consumers use different accounts, and for more advanced testing you might need to use
separate accounts.

You can [`download this tutorial as a worksheet file`](../../../_downloads/980474433a279b8dd7a9409b77b0f54d/internal-testing-cleanroom.ipynb) to run in your
Snowflake account.

## Provider: Overview

Here is a summary of the steps that you’ll take to create a clean room as the provider:

1. Create test data to share in your clean room.
2. Create your clean room.
3. Set join permissions on your data to specify which columns can be joined on in consumer queries.
4. Create a template for your clean room. A clean room template is written in JinjaSQL and it evaluates to a SQL query at run time. Most
   templates include variables that allow collaborators to specify table and column names, WHERE clause conditions, and more, at run time.
   A clean room collaborator chooses and runs a template in a clean room.
5. Specify the default version of the clean room.
6. Add consumers who can access your clean room. In this tutorial consumers must be Snowflake users with accounts approved by your clean room administrator.
7. Publish the clean room to make it available to your invited consumers.

> **Note:**
>
> The term *collaborator* is used above for templates because, depending on how the clean room is configured, both providers and consumers
> can create or run templates. This tutorial shows only how to enable consumer-run templates.

## Provider: Create the clean room

[Sign in to Snowsight](https://app.snowflake.com) as a user granted the SAMOOHA_APP_ROLE role. If you don’t have that role, ask your
account administrator to grant it to you.

[Create a new SQL worksheet](../../ui-snowsight-worksheets-gs.md) in Snowsight to hold your clean room code. Name the worksheet
“API tutorial - Provider”.

The following snippet creates a clean room that is accessible only within the organization (so it’s marked as INTERNAL). To share a
clean room outside of an organization requires additional steps that won’t be covered in this tutorial. When sharing a clean room with
yourself, it must be INTERNAL, of course.

You must use the SAMOOHA_APP_ROLE for most clean room procedures.

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
SET cleanroom_name = 'Developer Tutorial';
CALL samooha_by_snowflake_local_db.provider.cleanroom_init($cleanroom_name, 'INTERNAL');
```

## Provider: Bring data into the clean room

Next, bring your test data into the clean room. There are two steps to bring data into a clean room:

1. Register the data.
2. *Link* (import) the data into the clean room.

### Register the data

The first step in importing data is to *register* the database, schema, or object in the clean room account. Registering is to grant clean rooms the right to read and use the source data. You can register an entire database, a schema, a table, or a view.

You are using sample data installed with the clean room, which is pre-registered for you, so there’s no need to register the sample data in this tutorial.

### Link the data into the clean room

Importing data into a clean room is called *linking*. Both providers and consumers can link their data into a clean room. The generic term
for a view or table linked into a clean room is a *dataset*.

When you link data, the clean room creates a read-only view linked to your source data. This clean room view is a
secure, encrypted view inside the clean room, accessible only to templates within the clean room. Your template
accesses this secure view, not the source data, although the original source name is used whenever you need to reference the data.

Unlike registering, linking is done at the individual table or view level. You can link multiple items in one call.

Link clean room sample data into the clean room:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.link_datasets($cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS']);

CALL samooha_by_snowflake_local_db.provider.view_provider_datasets($cleanroom_name);
```

> **Note:**
>
> If a table linked into a clean room is deleted, renamed, moved, or has restrictive permissions added, the table can’t be used in the
> clean the clean room unless you restore the old table with the same location, name, and permissions.

## Provider: Set join policies on the data

Both providers and consumers can specify *join policies* on their own data. A clean room join policy specifies which columns in a table can
be joined on by your collaborators’ queries in that clean room. This provides an extra level of control over how others can use your data
in the clean room. Your own policies are not enforced on your own queries – that is, join policies on your own data are
ignored when you run a query; your policies are enforced only on queries run by other users.

Clean room join policies are set on the table, and apply to all clean rooms where the
table is used. Any columns not listed here cannot be joined using INNER JOIN or OUTER JOIN conditions in the clean room if the template
explicitly checks join policies.

Note that clean room join policies are not the same as [Snowflake join policies](../../join-policies.md); clean room policies
specify which columns *can* be joined on; Snowflake join policies specify which columns *can’t* be joined on.

> **Tip:**
>
> **Snowflake** policies set on the source table are retained in the linked clean room table, but aren’t reported to
> collaborators. That is, **Snowflake** join policies are enforced but are not reported by `consumer.view_provider_join_policy`, which
> reports only the provider’s **clean room** join policies. Therefore you should let your collaborators know about any Snowflake policies
> that you have set on your data.

Specify joinable columns for a table using the format
`database_name.schema_name.table_or_view_name:column_name` for each column. The following example allows three columns
of provider data to be joinable:

```sqlexample
-- Limit joinable columns in this table to age_band, region_code, and device_type
CALL samooha_by_snowflake_local_db.provider.set_join_policy($cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
   'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:REGION_CODE',
   'SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DEVICE_TYPE']);

CALL samooha_by_snowflake_local_db.provider.view_join_policy($cleanroom_name);
```

## Provider: Add your template

A clean room template is a JinjaSQL template that evaluates to a SELECT query. This query has access to all datasets linked into the clean
room, subject to join and column policies.

This tutorial won’t cover the details of designing a JinjaSQL template, but here is the SQL query that you’re trying to implement:

```sqlexample
SELECT
  COUNT(*),
  group_by_col
FROM Consumer_Table AS C
  INNER JOIN Provider_Table AS P
    ON c.join_col = p.join_col
GROUP BY group_col;
```

The query simply joins one provider and one consumer table on a specified join column, groups by a specified grouping column, and projects
the group value and count of each group. This is the query that will run in the clean room when the user runs the template.

Here is the JinjaSQL template for the same query, with variables added where the consumer can specify tables or columns. After the consumer
specifies the variables, it will evaluate to a SQL query similar to the one above, but with the table and column names provided by the
consumer.

```sqlexample
SELECT
  COUNT(*),
  IDENTIFIER({{group_by_col | column_policy}})
FROM IDENTIFIER({{my_table[0]}}) AS C
INNER JOIN
  IDENTIFIER({{source_table[0]}}) AS P
    ON IDENTIFIER({{consumer_join_col | join_policy}}) = IDENTIFIER({{provider_join_col | join_policy}})
GROUP BY IDENTIFIER({{group_by_col | column_policy}});
```

A few notes on the template:

* Content surrounded by {{brackets}} are named variables passed in by the consumer when they run the template. The following variables are
  passed in by the consumer: `group_by_col`, `consumer_join_col`, `provider_join_col`
* The `my_table` and `source_table` arrays are global variables created by the system, populated with consumer and
  provider table names passed in by the caller. These tables must be linked into the clean room by the consumer and provider.
* All provider tables must be aliased as `p` in the query. All consumer tables must be aliased as `c`. If you use multiple tables, alias
  them with a 1-based suffix, so: `p`, `p1`, `p2`, `p3` and so on for provider tables, and `c`, `c1`, `c2`, `c3` and so on
  for consumer table aliases. (`p` and `p0` are equivalent.)
* Snowflake Data Clean Rooms supports some custom JinjaSQL *filters* that act on variables. The `column_policy` and `row_policy`
  filters verify that the columns they are applied to conform to the column and row policies in that clean room, or else the request to run
  the template will fail. So `{{ consumer_join_col | join_policy }}` verifies that the value passed in to `consumer_join_col` conforms
  to the join policies set by the provider and consumer in this clean room.
* Variables used as identifiers must be processed by the [IDENTIFIER](../../../sql-reference/identifier-literal.md) function before they can be
  used in SQL.

Add the template to the clean room:

```sqlexample
-- Add the template
SET template_name = 'overlap_template';
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    $template_name,
    $$
    SELECT
      COUNT(*),
      IDENTIFIER({{group_by_col | column_policy}})
    FROM IDENTIFIER({{my_table[0]}}) AS C
    INNER JOIN
      IDENTIFIER({{source_table[0]}}) AS P
      ON IDENTIFIER({{consumer_join_col | join_policy}}) = IDENTIFIER({{provider_join_col | join_policy}})
    GROUP BY IDENTIFIER({{group_by_col | column_policy}});
    $$);

CALL samooha_by_snowflake_local_db.provider.view_added_templates($cleanroom_name);
```

## Provider: Set column policies

Each party in the clean room can limit which columns the other parties can project by setting a *column_policy*. A column
policy in a clean room lists all the columns of your data that can be projected; no other columns can be projected. If you do not specify
a column policy for your data, all your data can be projected.

A column policy is tied to a specific table and template in a clean room. You can allow different columns to be projected in different
templates. The same column cannot be in both a join and a column policy.

Note that column and join policies are enforced only if the template uses the `column_policy` and `row_policy` filters in the template.

Here is how to allow projection of three columns of your data in the template you just created. Column syntax is
`template_name:table_name:column_name`

```sqlexample
-- Set column policies. Column policies are tied to a specific template and table, so we
-- needed to add the template first.
CALL samooha_by_snowflake_local_db.provider.set_column_policy($cleanroom_name,
  [$template_name || ':SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
   $template_name || ':SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE']);

CALL samooha_by_snowflake_local_db.provider.view_column_policy($cleanroom_name);
```

## Provider: Add a release directive

Every clean room has a [version number](../dcr-versions.md), consisting of major, minor, and patch values. You must
specify which version of the clean room is served to consumers: this is called the *default release directive*.

This is the first version, so the version number is 1.0.0.

```sqlexample
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive(
  $cleanroom_name,
  'V1_0',
  '0');
```

Snowflake creates a new version of the clean room each time you upload code into the clean room. If you want users to get the latest
version, you’ll need to set a new default release directive with the newest version number. You won’t be uploading code, so you won’t need
to call this again for this tutorial.

## Provider: Designate consumers

Now you will specify who has access to your clean room as a consumer. For this tutorial you will add yourself as a consumer. Doing so marks
the clean room as an internal testing clean room, used only for testing, which
[limits some of its functionality](../v1/developer-introduction.md), but it will support all the features needed for this
tutorial.

The procedure needs two arguments to identify each consumer:

* The consumer’s account locator. Get your account locator like this:

  ```sqlexample
  SELECT CURRENT_ACCOUNT();
  ```
* The consumer’s [consumer data sharing account ID](../../admin-account-identifier.md), in the format `org_name.account_name`.
  Get your consumer data sharing account ID in the proper format like this:

  ```sqlexample
  SELECT CURRENT_ORGANIZATION_NAME() || '.' || CURRENT_ACCOUNT_NAME();
  ```

Now share the clean room with yourself as a consumer, adding your account locator and consumer data sharing account ID where indicated:

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_consumers(
  $cleanroom_name,
  '<CONSUMER_LOCATOR>',
  '<CONSUMER_DATA_SHARING_ACCOUNT_ID>');

CALL samooha_by_snowflake_local_db.provider.view_consumers($cleanroom_name);
```

## Provider: Publish the clean room

Finally, publish the clean room. This makes the clean room available to the consumer you added above. The procedure takes a minute
or more to complete.

```sqlexample
-- Publish the clean room.
CALL samooha_by_snowflake_local_db.provider.create_or_update_cleanroom_listing(
  $cleanroom_name);
```

When the procedure finishes, you should see the clean room listed in the [clean rooms UI](../v1/web-app-introduction.md), in the
Created tab in your provider account, and in the Invited tab in the consumer account, with the label
“Powered by Dev Edition.” The consumer account will receive an invitation email. (Do not install the clean room from the Invited tab;
you will install it in code, in a later step.)

Congratulations: You’ve published your first clean room!

Now take off your provider cap and put on your consumer cap.

## Consumer: Install (join) the clean room

You’ll use the same account for the provider and consumer roles in this tutorial, so add a new SQL worksheet named “API Tutorial -
Consumer” in Snowsight in the same account.

Set up the session environment, similar to the way you did for the provider:

```sqlexample
USE WAREHOUSE app_wh;
USE ROLE SAMOOHA_APP_ROLE;
```

Next, install the clean room that you published and shared as a provider. To install a clean room, you must specify both the clean room
name and the account locator of the provider who shared the clean room with
you. Specifying the clean room name and the account locator helps disambiguate clean rooms with identical names. Run
`SELECT CURRENT_ACCOUNT();` to get your provider locator.

```sqlexample
SET cleanroom_name = 'Developer Tutorial';
CALL samooha_by_snowflake_local_db.consumer.install_cleanroom(
  $cleanroom_name,
  <PROVIDER_LOCATOR>);
```

Installation can take a few minutes.

## Consumer: Link your data

You must register and link your data into the clean room, just as you did as a provider. Again, because you are using sample data provided
with the clean room installation, you the data is pre-registered.

You will use a different sample table installed with clean rooms. If this table is not present in your account, see the Requirements
section on the Introduction page to learn how to install it.

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL samooha_by_snowflake_local_db.consumer.link_datasets(
  $cleanroom_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2']);

CALL samooha_by_snowflake_local_db.consumer.view_consumer_datasets($cleanroom_name);
```

## Consumer: No need to set policies

You could set policies on your data, the same way the provider did, but this template is approved for only the consumer to run, so there’s
no need to set any policies on it.

However, if you were to approve a provider request to run this template, you should first set join and column policies on your data to
control what the provider could do with it.

## Consumer: Run the analysis

To run a query, you need the following information:

* The name of the template you want to run.
* The names of your tables to use in the template.
* The names of the provider’s tables to use in the template.
* Any other name/value variables to pass in.

### Examine the template

You can examine the template to see what it does and any arguments that it accepts. The following example shows how to list the templates
in the clean room, see a template’s code, and see what arguments it accepts:

```sqlexample
-- List templates in the clean room.
CALL samooha_by_snowflake_local_db.consumer.view_added_templates($cleanroom_name);

-- See the template code.
SET template_name = 'overlap_template';
CALL samooha_by_snowflake_local_db.consumer.view_template_definition(
  $cleanroom_name,
  $template_name);

-- See what arguments can be passed in to the template:
CALL samooha_by_snowflake_local_db.consumer.get_arguments_from_template(
  $cleanroom_name,
  $template_name
);
```

You can see that you need to pass in a provider table and column name, a consumer table and column name, and a grouping column.

### List the available provider tables

See which tables the provider has added to the clean room.

```sqlexample
-- Table name to use is in the LINKED_TABLE column in the results.
CALL samooha_by_snowflake_local_db.consumer.view_provider_datasets($cleanroom_name);
```

### List the provider’s joinable and projectable columns

See which columns can be joined on or projected from the provider’s data.

```sqlexample
-- See which provider columns can be joined on.
CALL samooha_by_snowflake_local_db.consumer.view_provider_join_policy($cleanroom_name);

-- See which provider columns can be projected.
CALL samooha_by_snowflake_local_db.consumer.view_provider_column_policy($cleanroom_name);
```

### Run the analysis

Now that we know what the query needs, what provider data is available, and what can be done with that data, you can select values to pass
in.

You must fully qualify all column names in most circumstances. You must use the table alias as the
table name rather than the actual table name. Remember that the table aliases in this template are `p` for the provider table, and `c`
for the consumer table. You must use lowercase `p` and `c`.

In your first query, use the following values:

* Provider table: The only choice is `SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS`.
* Consumer table: The only choice is `SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2`.
* `consumer_join_col`: Use `age_band` from the consumer table; the fully qualified column name is `c.age_band`.
* `provider_join_col`: You need to join on similar columns, so the equivalent, fully qualified provider name is `p.age_band`.
* `group_by_col`: Take your pick of provider or consumer columns from the remaining projectable columns. Try `p.device_type`, but
  you can use any of the other provider or consumer columns returned by `consumer.view_provider_column_policy`.

These values are passed into `consumer.run_analysis` as shown in the following example:

```sqlexample
CALL samooha_by_snowflake_local_db.consumer.run_analysis(
  $cleanroom_name,
  $template_name,
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS_2'], -- Consumer table list.
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS'], -- Provider table list.
  OBJECT_CONSTRUCT(                    -- Additional template arguments as name-value pairs.
    'consumer_join_col','c.age_band',
    'provider_join_col','p.age_band',
    'group_by_col','p.status'
  )
);
```

Congratulations! You should see the query results in Snowsight.

Additional features not covered here allow you to export those results directly to your own Snowflake account, or to an approved
third-party service in a process called *Activation*.

See more use cases and learn about more clean room features in the
[Snowflake Clean Rooms developer guide](../v1/developer-introduction.md).

## Both accounts: Clean up

Now let’s clean up all the resources that you created.

### Provider cleanup

Run the following code in your provider worksheet:

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL samooha_by_snowflake_local_db.provider.drop_cleanroom($cleanroom_name);
```

### Consumer cleanup

Run the following code in your consumer worksheet:

```sqlexample
USE ROLE SAMOOHA_APP_ROLE;
CALL samooha_by_snowflake_local_db.consumer.uninstall_cleanroom($cleanroom_name);
```

---
title: Understanding Snowflake Data Clean Room policies
source: https://docs.snowflake.com/en/user-guide/cleanrooms/v1/policies.md
section: Clean Rooms
---

# Understanding Snowflake Data Clean Room policies

Clean rooms can implement data policies to control how data can be used by collaborators. These are in addition to any Snowflake table
policies set on the underlying tables linked into the clean room.

Each collaborator in a clean room can set policies on their own data. Your policies are enforced only in requests from other users;
your policies are not enforced against your own requests. For example, if your join policy allows joins against only column A, other users
are restricted to joining on column A, but you can run joins against any of your columns.

Clean room policies can be set using either the clean room API or UI.

To implement policy checks, the following must be true:

* **The data owner must set a policy in their clean room.** You set policies using either the API or the UI. Each policy type is set separately. Clean rooms natively implement column policies, row policies, and activation policies. **Clean room policies aren’t additive:** when you set a clean room policy, all previous values are deleted.

  ```sqlexample
  -- Sets a join policy on column HASHED_EMAIL.
  CALL samooha_by_snowflake_local_db.provider.set_join_policy(
    'my_provider_cleanroom',
    ['my_db.my_sch.T1:HASHED_EMAIL']);

  -- Replaces the previous join policy. Now the only column in the join policy is AGE_BND.
  CALL samooha_by_snowflake_local_db.provider.set_join_policy(
    'my_provider_cleanroom',
    ['my_db.my_sch.T1:AGE_BAND']);
  ```
* **The template must check the policy in the appropriate place in the template.** A clean room policy is checked only if it has the
  appropriate policy filter applied to the column in the template. If you set a clean room policy to protect your data, you should examine
  the template to confirm that the template is enforcing your policies as you expect. The following template checks whether col1 is allowed
  by the data owner’s column policy:

  ```sqlexample
  SELECT
    IDENTIFIER( {{ col1 | column_policy }} )
  FROM {{ source_table[0] }} AS c;
  ```

  The following template does not check whether `col1` has a clean room policy:

  ```sqlexample
  SELECT
    IDENTIFIER( {{ col1 }})
  FROM {{ source_table[0] }} AS c;
  ```

  Clean rooms supports a different template filter for each policy type. However, the semantics of the filter are not checked, only whether
  the column is in the policy for that filter type. For example, in the following snippet, the join policy is checked for `col1`, even
  though the column is not being joined against. If `col1` is in the data owner’s join policy, the query can succeed; if `col1` is not
  in the data owner’s join policy, the query will be blocked.

  ```sqlexample
  SELECT
    IDENTIFIER( {{ col1 | join_policy }})
  FROM {{ source_table[0] }} AS c;
  ```

> **Note:**
>
> Column policy checks are carried out when the template JinjaSQL is parsed. Queries
> with wildcards might not be caught using these checks, and discretion should be used when designing an analysis template. If some columns
> should really never be queried, consider creating a view of your source table that eliminates these sensitive columns, and link in
> that view instead.

## Snowflake policies in clean rooms

When you link tables into a clean room, any Snowflake table policies on the source tables are enforced in the linked tables in the clean
room, but these policies aren’t necessarily reported by the clean room API or UI. For instance, a
[Snowflake join policy](../../join-policies.md) continues to be enforced in the clean room, but that join policy is not visible
by calling `consumer.view_provider_join_policy` or `consumer.view_join_policy`. Therefore, you should either remove policies from the
underlying linked tables, create equivalent clean room policies (when they exist), or communicate the existence of these policies clearly
to your collaborators so that their queries don’t fail or behave unexpectedly (“why can’t I join on this column?”).

Any changes to Snowflake policies in the source tables are automatically propagated to the linked views in the clean room.

[Snowflake privacy policies](../../diff-privacy/differential-privacy-admin-privacy-policies.md) prevent creation of a view from a
protected table, so you cannot link in tables that have privacy policies.

The following policies can be applied directly into a clean room:

### Join policies

Set a join policy to indicate which columns in your data can be joined on by *any* template in the clean room. (Snowflake join policies, in
contrast, specify which columns *must* be joined on.) Join policies apply to all templates in the clean room.

A column cannot be in both a join policy and a column policy, but a column can be in both a join policy and an activation policy.

## Implementing a join policy

Clean room join policies are enforced against a column if the template applies the `join_policy` or `join_and_column_policy`
filter to the column.

If a template checks a join policy for a column, and the clean room has no join policies set, or the column is not in the join policy, the
query will be blocked.

The following code shows how to set join policies as a provider or a consumer. Remember that policies are only enforced against queries
run by another account.

```sqlexample
-- Set join policies on two columns in a clean room where you are a provider.
CALL samooha_by_snowflake_local_db.provider.set_join_policy(
  'my_provider_cleanroom',
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL', 'MYDB.MYSCH.EXPOSURES:HASHED_EMAIL']);

-- Set join policies on two columns in a clean room where you are a consumer.
CALL samooha_by_snowflake_local_db.consumer.set_join_policy(
  'my_consumer_cleanroom',
  ['SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL', 'MYDB.MYSCH.EXPOSURES:HASHED_EMAIL']);
```

The following procedures are used to view or manage join policies in code:

* `consumer.set_join_policy`
* `consumer.view_provider_join_policy`
* `consumer.view_join_policy`
* `provider.view_join_policy`
* `provider.set_join_policy`

### Column policies

Set a column policy to indicate which of your columns can be projected in analysis results from a *specific* template. Column policies are
applied to specific templates in a specific clean room.

A column cannot be in both a join and a column policy. A column can be in both an activation and a column policy.

## Implementing a column policy

Clean room column policies are enforced against a column only if the template uses the `column_policy` or `join_and_column_policy`
filter.

If a clean room checks a column policy for a column, and the column is not in the column policy, or the clean room has no column policies,
the query will be blocked.

The following code shows how to set column policies for three columns when accessed by the `prod_overlap_analysis` template. The example
shows how to set the policy both as a provider and a consumer. Remember that policies are only enforced against queries
run by another account.

```sqlexample
-- Set column policy check on prod_overlap_analysis template in a clean room where
-- you are a provider.
call samooha_by_snowflake_local_db.provider.set_column_policy(
  'my_provider_cleanroom',
  ['prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE']);

-- Set column policy check on prod_overlap_analysis template in a clean room where
-- you are a consumer.
call samooha_by_snowflake_local_db.consumer.set_column_policy(
  'my_consumer_cleanroom',
  ['prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:STATUS',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:AGE_BAND',
   'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:DAYS_ACTIVE']);
```

The following procedures are used to view or manage column policies in code:

* `consumer.set_column_policy`
* `consumer.view_column_policy`
* `consumer.view_provider_column_policy`
* `provider.set_column_policy`
* `provider.view_column_policy`

### Activation policies

Set an activation policy to indicate which of your columns can be activated by an activation template. Activation saves query results to
a table in the Snowflake account of the provider or consumer, or to a third-party activation connector.

A column can be part of an activation policy as well as any other policy.

## Implementing an activation policy

Activation policies can be set in the clean rooms UI if the template allows activation.

Activation policies are set for a specific column in a specific template.

Activation policies are enforced against a column only if the template applies the `activation_policy` filter to the column.

The following code demonstrates setting an activation policy to allow the HASHED_EMAIL and REGION_CODE columns to be activated in a clean
room. This policy affects all users and all activation templates in the clean room. There are equivalent procedures for providers and
consumers in a clean room. Call the procedure that reflects your role in the clean room.

```sqlexample
-- Set activation policy check on prod_overlap_analysis template in a clean room where you are a provider
call samooha_by_snowflake_local_db.provider.set_activation_policy('my_cleanroom', [
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:HASHED_EMAIL',
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS:REGION_CODE' ]);

-- Set activation policy check on prod_overlap_analysis template in a clean room where you are a consumer
call samooha_by_snowflake_local_db.consumer.set_activation_policy('my_cleanroom', [
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE_NAME.DEMO.CUSTOMERS:HASHED_EMAIL',
    'prod_overlap_analysis:SAMOOHA_SAMPLE_DATABASE_NAME.DEMO.CUSTOMERS:REGION_CODE' ]);
```

The following template checks that the value passed into `col1` by the caller is in the caller’s activation policy. If the activation policy was set as shown previously, that means the only columns that can be activated are `HASHED_EMAIL` and `REGION_CODE`.

```sqlexample
BEGIN
  CREATE OR REPLACE TABLE cleanroom.activation_data_analysis_results AS
    SELECT {{ col1 | sqlsafe | activation_policy }}
    FROM IDENTIFIER({{ my_table[0] }}) AS c
    RETURN 'analysis_results';
END;
```

The following procedures are used to manage activation policies in code:

* `consumer.set_activation_policy`
* `provider.set_activation_policy`

### Aggregation policies

Aggregation policies require that all queries against a table contain aggregations (GROUP BY, COUNT, and other functions), and also specify
a minimum number of rows per result group, or the group will be omitted from the results.

Clean rooms do not have their own implementation of aggregation policies; to apply aggregation constraints on your linked data, either
apply an [aggregation policy](../../aggregation-policies.md) on the source table, or implement aggregation constraints in your
template.

Some Snowflake-provided templates use the `threshold` and `threshold_value` parameters set for a user or template. These values can be
modified in the clean rooms UI, or by calling `provider.add_consumers` or `provider/consumer.set_privacy`. If set for a consumer, you
can [access these values in your template](custom-templates.md).

---
title: Uninstalling the Snowflake Data Clean Rooms environment
source: https://docs.snowflake.com/en/user-guide/cleanrooms/uninstalling-clean-rooms.md
section: Clean Rooms
---

# Uninstalling the Snowflake Data Clean Rooms environment

To completely uninstall the clean room environment from your account, you must use the ACCOUNTADMIN role in the Snowflake account where the
clean room application is installed. This deletes the clean room environment for all users in your account.

> **Important:**
>
> This procedure completely uninstalls the entire environment for your account, not just individual clean rooms.

Before uninstalling, you must remove all clean rooms and collaborations from your account. The steps depend on which types of clean rooms
you have.

To uninstall the clean room environment for your account:

1. Remove all existing clean rooms and collaborations. Complete the steps that apply to your environment:

   **If you have Collaboration Data Clean Rooms:**

   1. [Tear down all collaborations that you created as an owner.](collaboration-api-reference.md)
   2. [Leave all collaborations that you joined as a collaborator.](collaboration-api-reference.md)

   **If you have legacy Provider and Consumer clean rooms:**

   1. [Delete all the clean rooms that you created as a provider.](manage-clean-rooms.md)
   2. [Uninstall all the clean rooms that you installed (joined) as a consumer.](manage-clean-rooms.md)

   If you have both types, complete all four steps above.
2. [`Download the cleanroom uninstall notebook file`](../../_downloads/fea23b1daec843dd9cdf3d25f6673caa/UNINSTALL_COLLAB_DCR.ipynb) and
   [import it into Snowsight](../ui-snowsight/notebooks-create.md) in the account where you want to delete the clean rooms
   environment. You must be able to run the notebook by using the ACCOUNTADMIN role.
3. If you want to delete your organization from Snowflake Data Clean Rooms, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Update the clean room UI to Snowflake authentication
source: https://docs.snowflake.com/en/user-guide/cleanrooms/update-to-oauth.md
section: Clean Rooms
---

# Update the clean room UI to Snowflake authentication

> **Important:**
>
> If you installed the Snowflake Data Clean Rooms application after May 1, you don’t need to read this article.

The Problem
:   Before May 1, 2025 users signed into the clean rooms UI using their clean room credentials. On May
    1, Snowflake authentication became the default credentials used in new clean room UI installations, but not for older installations.

The Solution
:   Administrators who installed the clean room UI before May 1, 2025 need to migrate their clean rooms account to start using Snowflake
    authentication by the end of April, 2026. This is done using a small, simple wizard that appears when you sign in to clean rooms with an
    account that has administrator privileges.

The wizard walks you through three simple steps:

## 1. Download the current UI user list

In this step you will grant the appropriate role-based access to the clean room UI to your users.

1. **Download the list of current clean room UI users.** The report includes the following fields:

   * `Email`: The email address associated with the account in the old clean rooms authorization system.
   * `Users on Snowflake`: The Snowflake username associated with `Email`.
   * `Name`: The name associated with `Email` in the old clean room authorization system.
   * `DCR role`: The old DCR UI persona for this user. One of the following values:

     + DCR Admin - Maps to MANAGE_DCR_PROFILE_AND_FEATURES, MANAGE_DCR_CONNECTORS, and MANAGE_DCR_COLLABORATORS privileges.
     + Clean room manager - Maps to MANAGE_CLEANROOMS privilege.
2. **Go through each row in the list** and grant proper Snowflake clean room privileges to each user:

   * For each row, check the `Users on Snowflake` value:

     > + **If one user is listed,** grant the [appropriate privileges](manage-dcr-users.md) to the username in `Users on Snowflake`.
     > + **If multiple users are listed,** grant the [appropriate privileges](manage-dcr-users.md) to each user individually.
     >   However, only the first user who logs in after migration can see the query history. If you need to change who sees the query
     >   history for a given account, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
     > + **If no users are listed,** and you know who this user is based on their email, grant the user the
     >   [appropriate privileges](manage-dcr-users.md) in Snowflake. (Before the change, you could access the clean room UI
     >   without having a Snowflake account if a clean rooms manager invited you.)
   * If a user is not listed here who should have access to the clean room UI, grant the user the
     [appropriate privileges](manage-dcr-users.md) in Snowflake.

> **Note:**
>
> If there are users whose clean room UI email does not exactly match their Snowflake account email, or if multiple users used the same
> email address, they might encounter problems.

## 2. Test your own access

To test whether you can open the clean rooms UI with your Snowflake credentials, select Test Login and provide Snowflake credentials
for any account that should be able to access this clean room UI.

> **Important:**
>
> When you test user credentials, the response shows which [clean room privileges](manage-dcr-users.md) will be granted to this user after migration (or ALL, which means ACCOUNTADMIN). **Confirm that the privilege list is not empty**, and that it **matches the privileges you expect** . If the privileges are not what you expect or want, grant the appropriate clean room privileges to that user in Snowflake.

## 3. Migrate

Switch the clean room UI sign in to Snowflake authentication for your account. That’s it
– you’re done! The clean room UI sign in process should now be the same for Snowsight and the clean room UI. No need
to take any special steps if you are using SSO.

Remember that you must switch your clean room UI to use Snowflake authentication by the end of April, 2026.

## Analysis and query history migration details

If you do not see your clean room report history after logging in to the clean room UI, here are the possible reasons:

* Reports are migrated for a clean room account upon first login after migration. If you don’t see reports at first, wait a bit to see if
  the reports appear in your clean room account.
* You have not verified ownership of the Snowflake email address now associated with your clean room account. If this is the case,
  [verify your email address](../ui-snowsight-profile.md) in Snowflake.
* If multiple users referenced the same Snowflake email in the old clean room UI, only the first user to log in to the clean room UI
  after migration has access to the reports. If you need to switch the reports to another user, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Here are more details about the migration process:

The clean room UI previously used email addresses as a user credential. After migration, the clean room UI uses Snowflake user IDs. The system tries to find an exact match between an old clean room email addresses and a Snowflake user ID email address when migrating clean room reports.

The first time each user logs in to the clean room UI after migration, their query report history is associated with the Snowflake email
as long as the following criteria are met:

* The user has the same email on Snowflake as used in the clean room UI
* The user’s email is verified in Snowflake.

If user does not have a Snowflake email, or the Snowsight email is not verified, the reports will not be moved over.

---
title: Upload and run custom functions in a clean room
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/custom-code.md
section: Clean Rooms
---

# Upload and run custom functions in a clean room

## Overview

You can upload custom Python UDFs and UDTFs into your clean room and run them from your templates to perform complex data actions. These
actions include machine learning or customized data manipulation within a query, as part of a single-step or
[multi-step flow](../multistep-flows.md). Python is the only coding language supported for custom UDFs.

Your uploaded code can import and use packages from an [approved bundle of Python packages](https://repo.anaconda.com/pkgs/snowflake/)
and the [Snowpark API](snowpark.md).

Templates in a clean room can call code uploaded by the account that added the template. Uploaded code can’t be viewed or downloaded.
Snowflake scans uploaded code for security issues before installing the code.

There are different mechanisms for uploading code into a clean room, depending on your role:

**Providers**

* Inline code upload: If you want to upload code using the default compute resources for a
  clean room, and need to use only the standard bundle of Python packages (including the Snowpark API), you should upload inline code.
* [Snowpark Container Services running within a clean room](snowpark.md): If you need more control over the
  environment, such as specifying additional compute or custom libraries, you can run a container within a clean room.

**Consumers**

* Inline upload with template: Consumers can upload and run a template bundled with code. The
  code is bound to the template, and must be approved by the clean room provider.

This topic shows how to upload and run custom Python UDFs and UDTFs as either a provider or a consumer.

> **Tip:**
>
> For background information about how to develop your own Python UDFs in a clean room, see the following topics:
>
> * [How UDFs work on Snowflake](../../../developer-guide/udf/python/udf-python-creating.md) for general background on how to write Python
>   functions in Snowflake.
> * [How to write UDTFs in Snowflake](../../../developer-guide/udf/python/udf-python-tabular-functions.md) if you want to return tables from
>   your functions.
> * [How to create and upload custom templates into a clean room.](custom-templates.md) UDFs/UDTFs are
>   called from a custom template.
> * [Using Snowpark in clean rooms](snowpark.md) (if you want to call your UDFs from Snowpark).

### Entry points for uploaded code

Each bundle of uploaded code can define multiple functions that call each other, but a bundle exposes only one handler function.
This handler function can be called by templates created or run by anyone who uses the clean room. If the code creates internal tables,
these tables can be accessed as described in [Using internal tables for multistep workflows](../multistep-flows.md).

For example, if you uploaded a function named `simple_add` that takes in two numeric parameters, you can call it from a template as shown
here. The function is always
referenced using the scope `cleanroom`. For example, a template could call `simple_add` like this:

```sqlexample
SELECT cleanroom.simple_add({{ price | sqlsafe | int }}, {{ tax | sqlsafe | int }}) ...
```

> **Tip:**
>
> If the provider wants to run the code above, they must alias all SELECT columns that use an aggregate or custom function, because a
> results table is generated behind the scenes:
>
> ```sqlexample
> SELECT
>   cleanroom.simple_add(
>     {{ price | sqlsafe | int }}, {{ tax | sqlsafe | int }}
>     ) AS TOTAL_ITEM_COST
> ...
> ```

You can upload multiple functions in a single package, and functions within a single package can call each other, but functions can’t call
functions within other packages. (They can call the handler functions, though.) For example, if you have a clean room where you upload two
packages, each with a handler function and two helper functions:

Clean room with two uploaded Python packages

| Package 1 | Package 2 |
| --- | --- |
| * **Handler function A** * Helper function **A1** * Helper function **A2** | * **Handler function B** * Helper function **B1** * Helper function **B2** |

* Code uploaded by either party (provider or consumer) can be run templates submitted by either party.
* A template can call function A or function B, but not A1, A2, B1, or B2.
* Function A can call function B, and the reverse.
* Function A can’t call B1 or B2 and Function B can’t call A1 or A2.
* A1 can call A2 and the reverse. A1 and A2 can call B. A1 and A2 can’t call B1 or B2.
* B1 can call B2 and the reverse. B1 and B2 can call A. B1 and B2 can’t call A1 or A2.

### Updating or deleting custom functions

You can upload or overwrite an existing function or template that you uploaded, but you can’t delete an existing function or template. The
only way to “remove” a function is to create a dummy function with the exact same name and signature that always succeeds.

Uploading a function with the same signature as one that you previously uploaded will overwrite the existing function, where a
signature means the case-insensitive function name of an external handler, plus the data types of all its arguments, in the same order.
Argument names are not part of the signature. You can’t overwrite a function uploaded by another account.

Because the signature must match when you update a function, you cannot change the signature of an existing function: if you upload the
function `foo(name VARIANT age INTEGER)` and then upload the function `foo(name VARIANT age FLOAT)`, the second function will be added
to the clean room in addition to the first, because the argument types differ.

## Provider-submitted code

Provider-submitted functions can be uploaded as inline code or from a Snowflake stage. Both techniques are covered here.

Your uploaded code can natively import and use packages from an [approved set of Python packages](https://repo.anaconda.com/pkgs/snowflake/).
If you need a non-default package, you must use [Snowpark Container Services in a clean room](snowpark.md) to host your
code.

You can’t view uploaded provider code, even your own code, so be sure to include a copy of exactly what you upload into a clean room.

### Overview

Here is a high level view of how a provider adds code to a clean room:

1. The provider creates and configures the clean room in the normal way.
2. The provider uploads a bundle by calling `provider.load_python_into_cleanroom`. You can either
   upload your code inline directly within that procedure, or
   upload a code file to a stage, then provide the stage location to that procedure.

   Although each bundle can include multiple functions, only one handler function is exposed for each upload. To expose multiple functions to
   templates, upload each handler separately or do a bulk upload (described below).
3. If the clean room is exposed externally, security checks are run before the code is installed in the clean room, and you must call
   `provider.view_cleanroom_scan_status` to confirm that security checks have passed before incrementing the default version.
4. After each successful upload, a [new patch version of the clean room is generated](../dcr-versions.md). You
   must then increase the default version by calling `provider.set_default_release_directive` with the new patch number.
5. Create and upload a custom template that calls handlers in your code. The template must call the handler function using the `cleanroom` scope, that is: `cleanroom.my_function(...)`.
6. The consumer runs your template the same way as any other template.

   > **Tip:**
   >
   > If the consumer encounters a mount error when they install a clean room with custom code, this can indicate a syntax error in the code.

You can find code examples demonstrating this flow in the provider-written code example section.

### Important notes about versioning

Every time the provider uploads a function, it increases the clean room patch number (and there is a limit of 99 patch numbers). Therefore,
do your best to test and debug your code thoroughly before adding it to the clean room to reduce version updates during development.

You can upload multiple packages at once in a single bulk upload to reduce the number of patches
generated. However, bulk uploads can make it more challenging to debug if the upload has a security scan issue, because the file that
caused the problem isn’t reported in the error response.

If you do update a patch number, customers using the clean room UI might need to refresh the page to see the change. Customers using the
API should see changes immediately, but there can be a delay, depending on the available resources.
[Learn more about clean room versioning.](../dcr-versions.md)

### Uploading provider-written inline functions

You can upload the code inline in the `code` parameter of `provider.load_python_into_cleanroom`. Here is an example of uploading a
simple function inline:

```sqlexample-python
CALL samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
$cleanroom_name,
'simple_add',                         -- Name used to call the UDF from a template.
['first INTEGER', 'second INTEGER'],  -- Arguments of the UDF, specified as '<variable_name> <SQL type>' pairs.
['numpy', 'pandas'],                  -- Packages imported by the UDF.
'INTEGER',                            -- SQL return type of UDF.
'add_two',                            -- Handler function in your code called when external name is called.
$$
import numpy as np   # Not used, but you can load supported packages.
import pandas as pd

def add_two(first, second):
    return first + second
$$
);
```

The calling template calls `cleanroom.simple_add` to call this function.
The provider examples demonstrate how to upload inline code.

### Uploading provider-written functions from a stage

You can upload Python files to a clean room stage and reference the stage when you call `provider.load_python_into_cleanroom`. Loading
code from a stage allows you to develop the code in your local system in an editor, avoid copy/paste errors when loading it inline, and
also have better versioning control of your source code. Note that you can upload multiple files in a single procedure call, but only one
handler function is exposed for each upload.

Code is loaded from a stage into the clean room when you call `load_python_into_cleanroom`; later changes to the code on the stage are not
propagated to the clean room.

To upload your UDF to a stage:

1. Create your .py file and make it available in a location where you can upload it to a Snowsight stage.
2. To get the name of the stage for your clean room, call `provider.get_stage_for_python_files`. You must use the specified stage; you cannot use
   an arbitrary stage that you create.
3. Upload the .py file to the stage for your clean room. There are
   [several ways to do this](../../data-load-local-file-system-stage.md), including using the CLI, Snowsight, or
   language-specific drivers.
4. Call `provider.load_python_into_cleanroom` with the stage location, handler, external name, arguments, and return type.
   Templates in your clean room can now call the function.

The following example code shows how to load code into a clean room from a stage.

```sqlexample-python
-- Save the following code as reverser.py:
--import numpy as np
--def main(some_string):
--  '''Return the reverse of a string plus a random number 1-10'''
--  return some_string[::-1] + str(np.random.randint(1,10))

-- Get the stage for your clean room.
CALL samooha_by_snowflake_local_db.provider.get_stage_for_python_files($cleanroom_name);

-- Save the file to the stage. Here is how to do it by using the Snowflake CLI
PUT file://~/reverser.py <STAGE_NAME> overwrite=True auto_compress=False;

-- Load the code from the stage into the clean room.
CALL samooha_by_snowflake_local_db.provider.load_python_into_cleanroom(
    $cleanroom_name,
    'reverse', -- Name used to call the function
    ['some_string  STRING'], -- Arguments and SQL types
    ['numpy'],               -- Any required packages
    ['/reverser.py'],        -- Relative path to file on stage
    'STRING',                -- Return type
    'reverser.main'          -- <FILE_NAME>.<FUNCTION_NAME>
);

-- Uploading code, even from a stage, increases the patch number.
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive(
  $cleanroom_name, 'V1_0', <NEW_PATCH_NUMBER>);

-- Upload a template that calls the function.
CALL samooha_by_snowflake_local_db.provider.add_custom_sql_template(
    $cleanroom_name,
    $udf_template_name,
    $$
    SELECT
      p.status,
      cleanroom.reverse(p.status)
    FROM SAMOOHA_SAMPLE_DATABASE.DEMO.CUSTOMERS AS p
    LIMIT 100;
    $$
);

-- Switch to the consumer account and run the template to see the results.
```

The provider examples demonstrate uploading code from a stage.

### Troubleshooting syntax errors or scan failures in uploaded code

If you upload a function that fails because of a syntax error, or if a security scan fails, an unpublishable patch can be generated. Therefore, you should thoroughly test your code before upload to ensure that it has no syntax errors.

You can see the list of packages, and their review status, by running the following SQL command, providing the clean room ID in the place indicated:

`SHOW VERSIONS IN APPLICATION PACKAGE samooha_cleanroom_cleanroom_id;`

### Security scans

A security scan is run after any action that generates a new patch version in an external clean room, such as when the provider uploads
Python into the clean room. (Consumer-submitted code, described on this page, does not trigger a security scan.) Internal clean rooms do
not run security scans, but if you change an internal clean room to an external clean room, it will trigger a security scan for that patch.
A clean roomm patch cannot be published externally until the patch has been scanned.

Snowflake Data Clean Rooms uses the [Snowflake Native App security scan framework](../../../developer-guide/native-apps/security-run-scan.md).
Follow the [native app security best practices](../../../developer-guide/native-apps/security-app-requirements.md) to avoid security scan errors.

You can perform additional patch-creating actions before the last security scan is complete. However, you must wait for
`provider.view_cleanroom_scan_status` to show success before you can update the default release directive in order to serve the
latest version of the clean room.

### Uploading multiple Python functions in a single patch (bulk uploading)

If you want to upload multiple Python packages to your clean room, you can call `prepare_python_for_cleanroom` multiple
times, then call `load_prepared_python_into_cleanroom` once to scan, upload, and generate a single patch for your clean room. The
following example demonstrates uploading a UDF and a UDTF using bulk uploading:

```sqlexample-python
---- Add custom inline UDF ----
CALL samooha_by_snowflake_local_db.provider.prepare_python_for_cleanroom(
    $cleanroom_name,
    'get_next_status',  -- Name of the UDF. Can be different from the handler.
    ['status VARCHAR'], -- Arguments of the UDF, specified as (variable name, SQL type).
    ['numpy'],          -- Packages needed by UDF.
    [],                 -- When providing the code inline, this is an empty array.
    'VARCHAR',          -- Return type of UDF.
    'get_next_status',  -- Handler.
    $$
import numpy as np
def get_next_status(status):
  """Return the next higher status, or a random status
  if no matching status found or at the top of the list."""

  statuses = ['MEMBER', 'SILVER', 'GOLD', 'PLATINUM', 'DIAMOND']
  try:
    return statuses[statuses.index(status.upper()) + 1]
  except:
    return 'NO MATCH'
    $$
);

---- Add custom inline UDTF. ----
CALL samooha_by_snowflake_local_db.provider.prepare_python_for_cleanroom(
    $cleanroom_name,
    'get_info',  -- Name of the UDTF. Can be different from the handler.
    ['hashed_email VARCHAR', 'days_active INT', 'status VARCHAR', 'income VARCHAR'],   -- Name/Type arguments of the UDTF.
    ['numpy'],         -- Packages used by UDTF.
    [],                -- When providing the code inline, this is an empty array.
    'TABLE(hashed_email VARCHAR, months_active INT, level VARCHAR)',  -- Return type of UDTF.
    'GetSomeVals',     -- Handler class name.
$$
class GetSomeVals:
  def __init__(self):
    self.month_days = 30

  def process(self, hashed_email, days_active, status, income):
    '''Change days into rough months, and also return whether we
    think the user's membership status is lower, higher, or equal to
    what is expected, based on their income.'''

    months_active = days_active // self.month_days
    brackets = ['0-50K', '50K-100K', '100K-250K', '250K+']
    statuses = ['MEMBER', 'SILVER', 'GOLD', 'PLATINUM']
    if(statuses.index(status) < brackets.index(income)):
      level = 'low'
    elif(statuses.index(status) > brackets.index(income)):
      level = 'high'
    else:
      level = 'equal'

    yield(hashed_email, months_active, level)
$$
);

-- Upload all stored procedures.
-- Note the new patch number returned by this procedure. Keep this number for later use.
CALL samooha_by_snowflake_local_db.provider.load_prepared_python_into_cleanroom($cleanroom_name);

-- Set the release directive specified by the last load_python_into_cleanroom call.
CALL samooha_by_snowflake_local_db.provider.set_default_release_directive($cleanroom_name, 'V1_0', <PATCH_NUMBER>);
```

### Provider-written code examples

The following examples demonstrate adding provider-written UDFs and UDTFs to a clean room.

Download the following examples and then upload them as worksheet files in your Snowflake account. You need separate accounts for
the provider and consumer, each with the clean rooms API installed. Replace the information as noted in the sample files.
[See instructions to upload a SQL worksheet into your Snowflake account](../tutorials-and-samples.md).

* [`Provider example code`](../../../_downloads/d5c64053435e55dc171af58a492f947f/provider-udf-p.sql)
* [`Consumer example code`](../../../_downloads/f9606ce3e3ad2dbd62a3fe9735894869/provider-udf-c.sql)
* [`Loading a file from a stage`](../../../_downloads/c9458d589eac4e4354d19501fa9f1707/udf_from_stage.ipynb). Run this notebook after you run the provider
  example to try loading a UDF from a stage.
* [`Uploading multiple Python functions in a single patch.`](../../../_downloads/e3c5e0dab78085f95d314b4ce2e04c4e/upload-multiple-python-packages.sql)
  This is a single-account internal testing clean room; you can use the same account for both the provider role and the consumer role.

## Consumer-submitted code

Consumer-uploaded code is bundled and uploaded with a custom template using the [consumer template upload flow](custom-templates.md). The uploaded code can be called by any template in the clean room.

To upload code as a consumer, you should understand [custom template syntax](custom-templates.md).

Note that any code uploaded by a consumer can be seen by the provider when they request permission to upload. The consumer code is also
visible whenever a provider or consumer examines the template.

Here is an overview of the steps to upload custom consumer code:

1. The provider creates the clean room in the standard way and then invites the consumer.
2. The consumer installs and configures the clean room in the standard way.
3. The consumer prepares a template that calls the UDF or UDTF within the `cleanroom` namespace. For example, to call the
   consumer-defined `calculate_tax` function, a simple template might look like the following snippet:

   ```sqlexample
   SELECT {{ cleanroom.calculate_tax(p.cost) }} AS Tax FROM my_db.my_sch.sales AS p;
   ```
4. The consumer prepares their Python code. We recommend using double quotation marks (`" "`) rather than single quotation marks (`' '`)
   in your code to avoid extra escaping needed later. Your code can reference [these supported Python libraries](https://repo.anaconda.com/pkgs/snowflake/).
5. The consumer passes their Python code into `consumer.generate_python_request_template`. The procedure returns the Python code as a
   stored procedure, with a placeholder for the custom JinjaSQL template. There are several multi-line strings in the template that use
   `$$` as multi-line delimiters.
6. The consumer replaces the template placeholder in the output from `generate_python_request_template` with their JinjaSQL template.
7. In the combined template, escape any single quotes like this: `\'`. This is because single quotes will be used as the outermost
   delimiter for the entire multi-line procedure string when you upload it to the clean room. Here is an example of a stored procedure that
   includes the consumer Python code and custom template, with character escaping:

   ```sqlexample-python
     BEGIN

     CREATE OR REPLACE FUNCTION CLEANROOM.custom_compare(min_status STRING, max_status STRING, this_status STRING)
     RETURNS boolean
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.10
     PACKAGES = (\'numpy\')

     HANDLER = \'custom_compare\'
     AS $$
     import numpy as np

     def custom_compare(min_status:str, max_status:str, this_status:str):
       statuses = [\'MEMBER\', \'SILVER\', \'GOLD\', \'PLATINUM\']
       return ((statuses.index(this_status) >= statuses.index(min_status)) &
               (statuses.index(this_status) <= statuses.index(max_status)))
     $$;

     -- Custom template
     LET SQL_TEXT varchar := $$
     SELECT
       c.status,
       c.hashed_email
     FROM IDENTIFIER( {{ my_table[0] }} ) as c
     WHERE cleanroom.custom_compare({{ min_status }}, {{ max_status }}, c.status);
     $$;

     LET RES resultset := (EXECUTE IMMEDIATE :SQL_TEXT);
     RETURN TABLE(RES);

     END;
   ```
8. The consumer calls `consumer.create_template_request` with the combined template. Use single quotation marks (`' '`) instead of
   double dollar sign delimiters (`$$...$$`) around the code you provide for stored procedure in the `template_definition` argument. For example:

   ```sqlexample-python
   CALL samooha_by_snowflake_local_db.consumer.create_template_request(
     $cleanroom_name,
     $template_name,
     '
   BEGIN

   -- First, define the Python UDF.
   CREATE OR REPLACE FUNCTION CLEANROOM.custom_compare(min_status STRING, max_status STRING, this_status STRING)
   RETURNS boolean
   LANGUAGE PYTHON
   RUNTIME_VERSION = 3.10
   PACKAGES = (\'numpy\')

   HANDLER = \'custom_compare\'
   AS $$
   import numpy as np

   def custom_compare(min_status:str, max_status:str, this_status:str):
     statuses = [\'MEMBER\', \'SILVER\', \'GOLD\', \'PLATINUM\']
     return ((statuses.index(this_status) >= statuses.index(min_status)) &
             (statuses.index(this_status) <= statuses.index(max_status)))
       $$;

   -- Then define and execute the SQL query.
   LET SQL_TEXT varchar := $$
   SELECT
     c.status,
     c.hashed_email
   FROM IDENTIFIER( {{ my_table[0] }} ) as c
   WHERE cleanroom.custom_compare({{ min_status }}, {{ max_status }}, c.status);
   $$;

   -- Execute the query and then return the result.
   LET RES resultset := (EXECUTE IMMEDIATE :SQL_TEXT);
   RETURN TABLE(RES);

   END;
   ');
   ```
9. The consumer and provider continue with the standard
   [consumer-defined template flow](custom-templates.md):

   1. The provider views the template request (`provider.list_pending_template_requests`) and then approves it by calling
      `approve_template_request`. In the request, the provider can see the template and the bundled code.
   2. The consumer checks the request status (`consumer.list_template_requests`), and when the status is APPROVED, runs the template (`consumer.run_analysis`).

   Consumer code uploads don’t trigger a security scan or affect the clean room patch number.

### Consumer-written code examples

The following examples demonstrate adding provider-written UDFs to a clean room.

Download the following examples and then upload them as worksheet files in your Snowflake account. You need separate accounts for
the provider and consumer, each with the clean rooms API installed. Replace the information as noted in the sample files.
[See instructions to upload a SQL worksheet into your Snowflake account](../tutorials-and-samples.md).

* [`Consumer-written, consumer-run code: Provider worksheet`](../../../_downloads/9e9507050ea9767daf56d8c94a892579/consumer-udf-p.sql)
* [`Consumer-written, consumer-run code: Consumer worksheet`](../../../_downloads/52519c3df63da0cbe3838f0878fbaec3/consumer-udf-c.sql)
* [`Consumer-written, provider-run code: Provider worksheet`](../../../_downloads/e81d3235044a7367014ec0680eab0ddc/p-run-c-uploaded-code-p.sql)
* [`Consumer-written, provider-run code: Consumer worksheet`](../../../_downloads/7879eba9c233607e8d74f47e44e4997a/p-run-c-uploaded-code-c.sql)

---
title: Use case: Overlap and segmentation
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/basic-flow-overlap.md
section: Clean Rooms
---

# Use case: Overlap and segmentation

Snowflake provides an overlap and segmentation template to determine which entities exist in the data for all collaborators, and show
aggregated information about those entities.

When using this template, two parties each add one or more tables to a clean room. Entities in these tables are
joined or identified by the join columns that you specify. Additionally, the overlap count can be broken down and filtered by
particular segmentation attributes. This enables parties to gain insight of the overlap between their datasets, which can help determine
the value of collaboration and facilitate other downstream use cases in the clean room. The consumer specifies which columns to join on and
which columns to show. All projected columns must either be grouped or aggregated with an aggregation function. Entity-identifying columns
are blocked from the query results and differential privacy is applied by the clean room to further protect information about specific
entities. If enabled by the clean room creator, the results can be activated to a third party (clean rooms UI only).

For example, an advertiser can conduct an overlap analysis on a publisher’s inventory to help inform the value of
buying media on that publisher. The advertiser then activates the IDs of their desired audience back to the publisher for
targeting purposes.

The overlap and segmentation template is available for use in both the clean rooms UI and in code. The clean rooms UI enables easy usage
of identity providers and activation to third-party partners, while the code usage enables multiple tables from both the provider and
consumer.

> **Tip:**
>
> If you enable differential privacy with the Audience Overlap template, do not compute overlap statistics. Doing so will consume most of
> the user’s privacy budget, leaving little or no budget to run analyses.

## Clean rooms UI usage

In the clean rooms UI, this use case is supported through a ready-made template called Audience Overlap & Segmentation. Although
this template is targeted for marketing and advertising use cases, it can be used for any overlap and segmentation use case across all
industries. Follow the steps below to learn how to create and use this template.

> **Note:**
>
> When running this analysis in the clean rooms UI, overlap percentages can vary depending on who runs the analysis. This is because the
> percentage is calculated as (*matched IDs in my table*)/(*total IDs in my table*). For example if collaborator A has 100 IDs
> while B has 500 IDs. If they both overlap 50 IDs, then A will see a 50% overlap while B will see only 10% overlap.
>
> Also, If the same ID from collaborator A’s data matches multiple IDs in collaborator B’s data, the overlap will vary depending on who runs
> the analysis.

**Web template features:**

* One-click activation, if configured by your clean room administrator.
* One-click usage of identity providers, if configured by your clean room administrator.
* Support for provider-run analyses.
* Both sides can import data and specify columns that can be joined, projected, and activated.
* Overlap query on one consumer and one provider table from the available tables.
* Configurable differential privacy.

> **Note:**
>
> Try out the [web interface tutorial](../v1/tutorials/cleanroom-web-app-tutorial.md) to see a full end-to-end
> walkthrough of using clean rooms in the clean rooms UI. This template is also covered in this tutorial.

### Step 1: Provider creates the clean room

Here is how a provider creates and configures a clean room with the Audience Overlap & Segmentation template:

1. Sign in to the clean rooms UI and [create a new clean room](../manage-clean-rooms.md).
2. Under Add Data, do the following:

   > 1. Choose the tables to link (import) into the clean room. If the tables you need aren’t listed, speak to a clean rooms administrator.
3. Under Specify Join Policies, do the following:

   > * Choose which columns a collaborator can join on from your tables. Remember that joinable columns can’t also be shown or used in the
   >   analysis for segmentation, filtering, or grouping.
   > * If you want to use an identity provider to help resolve entities that might have multiple identifiers, for example a single
   >   individual who has multiple email accounts in different databases, choose an identity provider in the Identity Hub.
4. Under Configure Analysis & Query, do the following:

   > 1. Select Audience Overlap & Segmentation as the analysis type. (You can select multiple templates for a clean room.) The
   >    configuration options for each template will be shown on the page.
   > 2. For Tables, choose which tables that you linked earlier should be available to consumers in this clean room with this template.
   > 3. Use Segmentation & Attribute Columns to choose which columns are shown in the query results. The collaborator can show, filter,
   >    and group by selected columns. Collaborators can activate these attribute values when Snowflake Activation is enabled in the clean
   >    room. If you don’t see a column listed here, it’s probably because you marked it as joinable, and a column can’t be both joinable
   >    and visible in the query results.
   > 4. Allow categorical value previews during filtering specifies whether previews show actual values. It is enabled by default if
   >    there are fewer than 20 distinct values in the column, but disabled by default if there are more than 20 distinct values, to protect
   >    PII.
   > 5. Review the Activation Settings section to enable, configure, or disable activation for the results data:
   >
   >    * Select ID Columns that should be available during activation use cases. By default, join policy columns are auto-selected.
   >    * Enable Allow non-overlap activation to activate IDs from your dataset without matching IDs in your collaborator’s dataset.
   >      For example, if you brought in 100 IDs and ran an overlap analysis with your collaborator and only 25 IDs overlapped, non-overlap
   >      activation would activate the 75 unmatched IDs from your dataset.
   >    * Review Enabled Partners to ensure only your preferred activation destinations are enabled in your clean room. If you require
   >      a change to enabled destinations, speak to a clean rooms administrator.
   > 6. Update default Privacy Settings as needed:
   >
   >    * Threshold Value is enabled by default and set to 5. This prevents results showing for any groups where the distinct count of
   >      a join policy column is below this threshold.
   >    * Differential Privacy is disabled by default. When enabled, it provides protection against potential differencing attacks by
   >      adding noise to the results and limiting the number of daily queries. Learn more about
   >      [Differential Privacy in Snowflake Data Clean Rooms](../differential-privacy.md) and understand the costs of
   >      enabling this feature.
5. Under Share clean rooms, do the following:

   * Expand the Select collaborator menu to add collaborators to the clean room. Collaborators will receive an email inviting them to
     join and use your clean room, as described next. The collaborators list on the page shows all accounts, including your own, that can
     access this clean room.
   * Select Enable run analysis and query next to a collaborator to control whether that account can run a template in the clean room.
     By default, your own account cannot run an analysis in the clean room (that is,
     [provider-run analyses](provider-run-analysis.md) is disabled by default). By default, consumers
     can run any template in the clean room.

### Step 2: Consumer joins the clean room

Here is how a consumer joins and configures a clean room that includes the Audience Overlap & Segmentation analysis template:

1. [Sign in to the clean rooms UI and join the clean room](../manage-clean-rooms.md).
2. Under Add Data, do the following:

   * Choose the tables to link (import) into the clean room. If the tables you need aren’t listed, speak to a clean rooms administrator.
3. Under Specify Join Policies, do the following:

   * Decide which joinable columns in your data map to joinable columns in the provider’s data. You will specify which of these columns to
     join on during each run.
   * If you want to use an identity provider to help resolve entities that might have multiple identifiers - for example a single individual
     who has multiple email accounts in different databases - choose an identity provider in the Identity Hub.
4. In the Configure Analysis & Query step, do the following:

   * Select the Audience Overlap & Segmentation analysis to show the configuration options for that template.
   * Choose which of your tables should be used in this analysis from the Tables dropdown menu.
   * Use Segmentation & Attribute Columns to choose which columns are shown in the query results. These columns can also be activated
     when Snowflake Activation is enabled in the clean room. If you don’t see a column listed here, it’s probably because you marked it as
     joinable, and a column can’t be both joinable and visible in the query results.
   * Select ID Columns that should be available during activation use cases. By default, join policy columns are auto-selected.
   * Optionally enable Allow activation for clean room provider to allow the clean room provider to activate to the supported
     activation destinations. This option is shown only when provider-run is enabled in the clean room. Note that enabling this
     allows row-level data to be activated back to the provider’s account. Note that the consumer is charged for compute costs
     when running provider queries and activation, although the consumer must agree to allow the provider action.
   * Review Enabled Partners to ensure that preferred activation destinations are enabled in the clean room. If you require a change
     to the enabled destinations, contact the clean room provider.
5. Click Finish to save your results. To run the analysis, see the next section.

### Step 3: Consumer runs the analysis

> **Note:**
>
> The default configuration allows only the consumer to run an analysis using this template. To enable provider-run
> analysis with this template, the provider must open the Share clean rooms tab in the clean room configuration and select
> Enable run analysis and query next to their account name.

After the provider and consumer have configured the clean room for audience overlap and segmentation, either party that has permissions to
run an analysis can do so like this:

1. In the clean rooms UI, navigate to Clean rooms.
2. Select Run for the clean room where you configured the audience overlap, and then choose Audience Overlap & Segmentation > Proceed.
   (Alternatively, visit the Analyses & Queries page, select + New Analysis & Query, choose the
   Audience Overlap & Segmentation type, then choose the clean room that has that analysis type configured.)
3. Set up the details of the run in the Query Configurations section:

   * My tables - Choose which of your tables to join on your collaborator’s tables.
   * Collaborator table - Choose a collaborator table to join on your table.
   * My join columns - Select all the columns to join on between the tables.
   * User segmentation - Optionally select grouping columns.
   * Filters - Optionally provide one or more filters on columns specified as segmentation and attribute columns during setup.
   * Privacy settings - This query implements differential privacy, and a minimum number of rows per grouping. You can see your used
     and remaining differential privacy and minimum group size here.
4. If conducting the analysis as a consumer, you can change the [warehouse size](../../warehouses-overview.md) to improve query
   times by selecting a larger warehouse or reduce cost by selecting a smaller warehouse. When conducting an analysis as a provider,
   warehouse selection is not available, but auto-scaling will try to optimize query times.
5. Select Run. If this is a new query, do the following:

   * Specify a name for your analysis & query.
   * Optionally [schedule a repeating analysis](../v1/schedule-analysis.md).
6. Select Save to start or schedule the run. It can take some time to complete each run. You can check on the analysis status or
   results in the Analysis & Queries page in the clean rooms UI.

## Code usage

You can download and run a sample notebook showing how to use the overlap and segmentation example in SQL code. This example can be uploaded and run in Snowsight.

The notebook does not demonstrate how to use identity providers, [activation](../v1/activation.md)
to third-party providers, or [provider-run analyses](provider-run-analysis.md). See the
linked topics to demonstrate how to do those actions in code.

**Prerequisites**

You must have two accounts in the same organization with Snowflake Data Clean Rooms installed. Use one account for the provider,
the other account for the consumer.

**Install and run the code example**

1. [`Download the example notebook`](../../../_downloads/44b3c72a8168d977419f51da25ef51d6/overlap-segmentation.ipynb).
2. Install the notebook in both your provider and consumer accounts. To upload a notebook, do the following:

   1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
   2. In the navigation menu, select Projects » Notebooks.
   3. Select + Notebook » Import .ipynb file.
   4. Select the .ipynb file you downloaded.
   5. Name the file as desired, and choose a database and schema.
   6. Keep the default warehouse `APP_WH`.
   7. Select Create.
   8. To create the clean room, open the notebook in the provider account and complete the provider portion.
   9. Open the notebook in the consumer account and complete the consumer portion to install and configure the clean room and run the
      template.

---
title: Using internal tables for multistep workflows
source: https://docs.snowflake.com/en/user-guide/cleanrooms/multistep-flows.md
section: Clean Rooms
---

# Using internal tables for multistep workflows

## Overview

Many clean room use cases involve running a single SQL query against one or more tables in a clean room and displaying the results in the response.
However, there are use cases where you might require creating an internal table that can be used within subsequent templates to
support a multistep workflow. For example, a machine learning flow, where the model is trained once against a dataset and then run multiple times
against varying input data, either singly or in batches.

## Creating internal tables

You can create internal tables inside a clean room to store intermediary results, or as persistent storage for usage downstream (for example, to save training data that is used for multiple runs). See properties and guidance of internal tables below:

* You can create internal tables by using a clean room template that executes CREATE TABLE, or by running a UDF/UDTF that uses
  Python to create a table.
* Internal tables can be created in the `cleanroom` schema, which is available by default. If a custom schema is preferred, the schema must be created first before creating the table.
* By default, internal tables are only accessible by approved templates in the clean room. If access needs to be provided outside of templates, then the CLEANROOM_PUBLIC_ROLE application role of the clean room needs to be granted corresponding privileges. For example, the following grant can be given: `GRANT SELECT ON TABLE CLEANROOM.MY_TABLE TO APPLICATION ROLE CLEANROOM_PUBLIC_ROLE;`
* If you have proper access, you can list the internal tables in your collaboration. Internal tables can be found at
  `SFDCR_collaboration_name.cleanroom`, and can be listed by running the following SQL code:

  > `SHOW TABLES IN SCHEMA SFDCR_collaboration_name.CLEANROOM;`.
* Internal tables are deleted when the collaboration is removed. However, if an internal table is designed to have a shorter lifetime than
  the collaboration, consider deleting the table when it’s no longer needed.

Here are some examples of creating an internal table:

TemplateUDF

A JinjaSQL template can create an internal table, which is done in some types of [activation](activation.md).

This example template creates the table and returns the table name, so that the name can be passed in as a parameter to other templates.

```sqlexample-yaml
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.REGISTRY.REGISTER_TEMPLATE(
  $$
  api_version: 2.0.0
  spec_type: template
  name: my_test_template
  version: V1
  type: sql_analysis
  description: Simple join example. Saves to table analysis_results
  template:
    BEGIN
    CREATE OR REPLACE TABLE cleanroom.analysis_results AS
      SELECT count(*) AS ITEM_COUNT, p1.status, p1.age_band
      FROM IDENTIFIER({{ source_table[0] }}) AS p1
      JOIN IDENTIFIER({{ source_table[1] }}) AS p2
      ON IDENTIFIER({{ join_col_1 | join_policy }}) = IDENTIFIER({{ join_col_2 | join_policy }})
      GROUP BY p1.status, p1.age_band;
    RETURN 'analysis_results';
    END;
  $$
  );
```

A UDF can create an internal table. This is typically done by executing SQL in Python.

```python
# Snippet of Python UDF to save results to an internal table.
table_name = f'cleanroom.results'

session.sql(f"""
CREATE OR REPLACE TABLE {table_name} AS (
  WITH joint_data AS (
    SELECT
        date,
        p.hashed_email AS hem,
        impression_id
    FROM {source_table} p
  )
  SELECT
    date,
    COUNT(DISTINCT hem) AS reach,
    COUNT(DISTINCT impression_id) AS num_impressions
  FROM joint_data
  GROUP BY date
  ORDER BY date
);
""").collect()
```

---
title: Using Snowpark in a clean room
source: https://docs.snowflake.com/en/user-guide/cleanrooms/demo-flows/snowpark.md
section: Clean Rooms
---

# Using Snowpark in a clean room

## Introduction

Snowflake Data Clean Rooms can work with Snowpark to provide increased computing power to your clean rooms when you need to query or
process large-scale data.

Clean rooms can use Snowpark in two ways:

* Snowpark UDFs: Use the Snowpark API in your clean room code to create Snowpark UDFs that take advantage
  of Snowpark scaling and processing power.
* Snowpark Container Services: If you want greater control of the Snowpark environment, or want to use
  libraries not available with the Snowpark API, you can configure and host a container within a clean room. This enables you to configure
  the environment for your specific compute and storage needs, and customize the libraries available to your environment.

When you need to load data that is too big to fit in memory, use `to_pandas_batches()` and iterate over it. For example:

> ```python
> df_iter = session.table(intermediary_table_name).to_pandas_batches()
> for df in df_iter:
>   ...
> ```

### General design of complex usage flows

Although you could generate your data and display it all by calling one template, in many cases it’s better to break up the data generation
steps from the result viewing steps. This way a consumer can view results multiple times without triggering a recalculation each time, or
view data from various points in the process. To break up your flow into multiple user-accessible stages, create separate templates for
triggering data generation or processing, and for viewing stored results.
[Read more about designing complex usage flows.](../multistep-flows.md)

## Using Snowpark UDFs in a clean room

You can use the [Snowpark API](../../../developer-guide/snowpark/index.md) in your uploaded Python code to speed up processing of large data
loads. Clean rooms support only the [Snowpark Python API](../../../developer-guide/snowpark/python/index.md). Both providers and consumers can
use the Snowpark Python API in their uploaded Python code.

### Prerequisites

* Clean rooms that run Snowpark UDFs must be run in the clean rooms API; they cannot be run in the clean rooms UI.
* You should understand the following topics:

  + [The basics of creating a clean room in code.](../tutorials/cleanroom-api-tutorial-basic.md)
  + [The Snowpark Python API.](../../../developer-guide/snowpark/python/index.md)
  + [How to upload Python UDFs into a clean room.](custom-code.md)
  + [How to create custom templates.](../custom-templates.md)
  + Read about [designing multi-step flows](../multistep-flows.md) to understand internal tables.

### Using the Snowpark API in a clean room

Using the Snowpark API in your clean room Python code is the same as uploading and running any other Python UDF, except that you need to
link in the `snowflake-snowpark-python` library.

Create UDFs, UDTFs, and procedures by executing SQL using `session.sql` in the `cleanroom` schema rather than using the Snowpark
decorators. For example:

> ```python
> session.sql("CREATE OR REPLACE FUNCTION cleanroom.udf(...")
> ```

### Basic steps

Here are the basic steps to use the Snowpark API through a UDF or UDTF in your clean room:

**Provider**

1. Create the clean room, set the default release directive, and link in your data in the standard way.
2. Because you probably have a very specific use case designed for your code, you probably won’t need to add join or column policies to the
   clean room, although you can do so.
3. Load your custom Snowpark handler code into the clean room by calling `provider.load_python_into_cleanroom`. The code should load the
   `snowflake-snowpark-python` package at minimum, plus any other packages that you need.
   UDFs can process and return data line by line, but Snowpark use cases typically generate an output table that is read by calling a
   separate results template.
4. Update the default release directive (because code additions generate a new patch version).
5. Create and upload a [custom template](../custom-templates.md) to run your Snowpark code. The only way to run a
   UDF is to trigger it from a template that calls the UDF. Some details about the UDF-calling template:

   * It should call the function using the alias and parameters that you specified in `provider.load_python_into_cleanroom`. The
     template must use the `cleanroom` namespace to call your function’s alias.
   * If the UDF writes results to a table in the clean room, and the name of the table is different for each run, your results-generating
     template should return the name of the results table, and your results template should take the table name as an argument from the
     user.
6. Upload a custom SQL template to access the results table generated by your Snowpark UDF, if you generated an intermediary results table.
   Either use the hard-coded results table name, or let the user pass in the table name generated by your code and returned by the
   results-generating template.
7. Add collaborators and publish the clean room in the standard way.

**Consumer**

The consumer installs the clean room and runs the analysis in the standard way. If the data generation and results reading are broken into
separate templates, the consumer will need to call each template in sequence.

### Example code

The following example code demonstrates how to upload and run a linear regression of “reach on impression count” to estimate the slope.

1. The consumer first runs the `prod_calculate_regression` template that runs a provider UDF to generate results. The provider UDF
   performs the following actions:

   1. **Preprocess impressions data.** Dynamic SQL is created that joins the provider’s impressions data to the consumer’s data, calculates
      the distinct count of impressions and reach by date, and stores the results in an intermediary table inside the clean room. If the
      consumer does not supply a table, the code runs against the provider’s entire impressions table.
   2. **Load the intermediary table.** The intermediary table is loaded into the Snowpark procedure as a pandas DataFrame.
   3. **Carry out regression.** The regression is calculated using the `statsmodels` library and returns results as a pandas DataFrame.
   4. **Write results to an internal clean room table.** The results are written to a results table inside the clean room, and the ID
      suffix of the table name is returned to the consumer. Since the Snowpark procedure is running inside the clean room, it has a limited
      ability to activate the data to the consumer’s account. Instead, to keep the results more secure, it is written to a table inside the
      clean room, and the consumer runs another template to read the results data.
   5. **Drop the intermediary tables.** Intermediary tables created during the calculation inside the clean room that are no longer needed
      are dropped before the Snowpark procedure finishes.
   6. **Return the name of the results table.** The name returned to the consumer must be specified when running the template to get the
      results, because results from all previous runs are retained.
2. The consumer then runs the `get_results` template, passing in the results table suffix returned by the first template to see the
   results.

To run the examples below, you need two accounts in the same web-hosting region (unless you’ve implemented
[cross-cloud auto-fulfillment](../v1/enabling-laf.md)): one account for the provider and another account for the
consumer.

The example code should run in a Snowflake worksheet without any additional Snowpark configuration. If you run in another environment, you
might need to install and configure the Snowpark Python API.

* [`Provider worksheet example`](../../../_downloads/ef9713d25f1026f7271faa3e2f571a1f/snowpark-udf-provider.sql)
* [`Consumer worksheet example`](../../../_downloads/26b752753607b54bbd64faa7c688d52e/snowpark-udf-consumer.sql)

### More information

* [Creating User-Defined Functions (UDFs) for DataFrames in Python](../../../developer-guide/snowpark/python/creating-udfs.md).
* [Snowpark API](../../../developer-guide/snowpark/index.md)
* [Snowpark Developer Guide for Python](../../../developer-guide/snowpark/python/index.md)

## Using Snowpark Container Services in a clean room

If you want greater control over the environment that executes your Python code, you can run a Snowpark Container Service within the
clean room. This gives you fine control over the execution environment for your code, and is ideal in use cases requiring specialized
compute, storage, or other resources to maximize performance and minimize cost, or to bring in custom packages or other environment
features.

When you host a container service in your clean room, your template and any custom Python code can call functions exposed by your service.
Using Snowpark Container Services is similar to using UDFs in Snowpark, except that your UDFs are exposed as HTTP endpoints for the
template to call. You will define the service and endpoints and upload it to the clean room.

Internally-hosted endpoints are accessible only by templates within the clean room, and cannot be called directly by the clean room
collaborators.

### Prerequisites

You’ll need to understand the following topics to be able to use Snowpark Container Services in a clean room:

* [How to create a clean room in code](../tutorials/cleanroom-api-tutorial-basic.md).
* [The Snowpark Python API](../../../developer-guide/snowpark/python/index.md) if using that API.
* [How to create custom templates](../custom-templates.md).
* Read about [designing complex usage flows.](../multistep-flows.md) to understand how to break up processing data
  and exposing results into separate steps.

### Basic steps

**Provider**

1. Create the service spec, code, and the endpoints that handle processing requests.
2. Create an image repository and grant access to SAMOOHA_APP_ROLE to that repository.
3. Capture the repository URL for the next step.
4. Build and upload the image to the repository URL.
5. Create the clean room, link data, add join policies, and add consumers in the standard way.
6. [Define the templates](../custom-templates.md) that call the service endpoints and upload them to your clean
   room. Service functions are created and called in the namespace `service_functions` (unlike UDFs, which are created and called in the
   namespace `cleanroom`).

   ```sqlexample
   -- Template to call an SPCS function named train.
   SELECT service_functions.train(
         {{ source_table[0] }},
         {{ provider_join_col }},
         {{ my_table[0] }},
         {{ consumer_join_col }},
         {{ dimensions | sqlsafe }},
       ) AS train_result;
   ```
7. Upload your service details into the clean room by calling `provider.load_service_into_cleanroom`. This defines the image URL,
   endpoints, and other service options. The endpoint names defined here must match your service spec, and are the names that your template
   uses to call the functions.

   ```sqlexample
   CALL samooha_by_snowflake_local_db.provider.load_service_into_cleanroom(
   $cleanroom_name,
   $$
   spec:
     containers:
     - name: lal
       image: /dcr_spcs/repos/lal_example/lal_service_image:latest
       env:
         SERVER_PORT: 8000
     endpoints:
     - name: lalendpoint
       port: 8000
       public: false
   $$,
   $$
   functions:
     - name: train
       args: PROVIDER_TABLE VARCHAR, PROVIDER_JOIN_COL VARCHAR, CONSUMER_TABLE VARCHAR, CONSUMER_JOIN_COL VARCHAR, DIMENSIONS ARRAY, FILTER VARCHAR
       returns: VARCHAR
       endpoint: lalendpoint
       path: /train
   $$);
   ```
8. Set the default release directive for your clean room. Whenever you upload or modify your service, it creates a new patch version.
9. Publish your clean room.
10. When making any changes to the image, functions, or code, you and the consumer must update your instances.

#### Consumer

1. Install the clean room and link in any data needed, in the standard way.
2. Create a compute pool and grant access to the clean room.
3. If you will be running queries (and you almost certainly will), you must also grant USAGE privileges to the clean room on the warehouse
   being used.
4. Start the service by calling `samooha_by_snowflake_local_db.consumer.start_or_update_service`, passing in the clean room name, the
   compute pool name, and the warehouse name (if a warehouse is used).
5. Examine the available endpoints to the service by running `SHOW ENDPOINTS IN SERVICE SAMOOHA_CLEANROOM_APP_clean_room_name.services.service;`
6. When the service is up and running, you can begin to run any clean room templates that access service endpoints by calling
   `consumer.run_analysis` in the standard way.

### Creating the compute pool

Depending on who should own and configure the pool, the provider can create the compute pool inside the clean room, or the consumer
can create the compute pool outside the clean room.

If the compute pool is created outside the clean room, you must grant proper privileges to the clean room to access the pool and create the
service as shown here:

```sqlexample
-- Grant access to a warehouse to run queries. Needed only if the service queries Snowflake accounts.
USE ROLE ACCOUNTADMIN;
GRANT USAGE ON WAREHOUSE APP_WH TO APPLICATION SAMOOHA_CLEANROOM_APP_<CLEANROOM_NAME>;

-- Grant SAMOOHA_APP_ROLE privileges to create compute pools and create services
GRANT CREATE COMPUTE POOL ON ACCOUNT TO ROLE SAMOOHA_APP_ROLE WITH GRANT OPTION;
GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO ROLE SAMOOHA_APP_ROLE WITH GRANT OPTION;

USE ROLE SAMOOHA_APP_ROLE;
-- Create the compute pool
CREATE COMPUTE POOL DCR_LAL_POOL
  FOR APPLICATION SAMOOHA_CLEANROOM_APP_<CLEANROOM_NAME>
  min_nodes = 1 max_nodes = 1
  instance_family = highmem_x64_l
  auto_resume = true;

-- Grant the clean room the privileges to access a pool running outside the clean room.
GRANT USAGE ON COMPUTE POOL DCR_LAL_POOL TO APPLICATION SAMOOHA_CLEANROOM_<CLEANROOM_NAME>;

-- Allow the clean room to create the service
GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO APPLICATION SAMOOHA_CLEANROOM_APP_<CLEANROOM_NAME>;
```

### Updating your service code or configuration

If the provider updates the image, service spec, or endpoint names or source code, both the provider and consumer must take the following
steps.

**1. Provider:**

1. Update the image or source code as needed.
2. Call `provider.load_service_into_cleanroom`, which returns a new patch number.
3. Call `provider.set_default_release_directive` with the new patch number.

**2. Consumer:**

* Call `consumer.start_or_update_service`.

### Monitoring your service

By default, consumers can monitor their service. This behavior can be changed using the `allow_monitoring` value in the
`service_config` argument of `provider.load_service_into_cleanroom`.

If consumer monitoring is enabled, the consumer can access the monitoring logs for a given clean room service (in the format
`SAMOOHA_CLEANROOM_APP_SPCS_cleanroom_name.services.service`), service ID, and container, as shown here:

```sqlexample
SELECT VALUE AS log_line
  FROM TABLE(
    SPLIT_TO_TABLE(SYSTEM$GET_SERVICE_LOGS(
        'SAMOOHA_CLEANROOM_APP_SPCS_Lookalike_Demo.services.service', 0, 'lal'), '\n')
  );
```

The consumer can also see the state of their service by using the DESCRIBE SERVICE command as shown here:

```sqlexample
-- See the state of the service.
DESCRIBE SERVICE SAMOOHA_CLEANROOM_APP_SPCS_Lookalike_Demo.services.service;
```

You can list the service endpoints by running `SHOW ENDPOINTS IN SERVICE SAMOOHA_CLEANROOM_APP_clean_room_name.services.service;`.
For example:

```sqlexample
SHOW ENDPOINTS IN SERVICE SAMOOHA_CLEANROOM_APP_SPCS_Lookalike_Demo.services.service;
```

### Example code

The following notebooks and zip file demonstrate how to use Snowflake Container Services in a clean
room. You need two accounts with clean rooms installed: One for the provider and one for the consumer. They should be in the same cloud
hosting region. Use the zipped configuration files to define the service.

* [`Provider notebook example`](../../../_downloads/109fc1643160d035866a43189b9565d0/spcs-provider.ipynb)
* [`Consumer notebook example`](../../../_downloads/c246bff46d86438d655c7a23e1afbc67/spcs-consumer.ipynb)
* [`Service spec and configuration files`](../../../_downloads/36c3670d9a0df2bb0358fac7e0d45255/spcs-spec-and-config.zip)

---
title: Using the developer APIs to execute templates sequentially
source: https://docs.snowflake.com/en/user-guide/cleanrooms/developer-template-chains.md
section: Clean Rooms
---

# Using the developer APIs to execute templates sequentially

Complex analyses might require that multiple templates be executed in a specific order, sometimes using the output of one template as the
input of another. A provider can create a *template chain* to define a sequence of templates to be executed in a particular order. When
defining this template chain, the provider can specify whether the results of a particular template will be available to subsequent
templates in the chain.

A clean room user executes a template chain to perform an analysis that runs the templates in the chain in their predefined order.

## About intermediary results

If a provider wants the results of one template to be available to subsequent templates in the template chain, they can create a cache for
the template’s results. Each template with a cache also has an expiration time for that cache.

If a provider specifies that a template has a cache, the first time a user executes the template chain, the results of that template are
stored in a table within the clean room. This underlying table is only accessible to the clean room itself. The next time a user executes
the template chain, Snowflake Data Clean Rooms checks whether the cache has expired before executing the template. The template with the
cached results does not execute again unless the cache has expired.

Subsequent templates in the template chain can use the cache as input by including the appropriate Jinja parameter in the template.

## Define a template chain

A provider uses the `provider.add_template_chain` command to create a template chain. The templates that the provider wants to add
to the new template chain must exist before creating the template chain.

The `provider.add_template_chain` command accepts the following arguments:

* Name of a clean room (string).
* Name of the template chain (string).
* Templates in the template chain (array of JSON objects).

For an example of using the `provider.add_template_chain` command to create a template chain, see
Example.

### Adding templates to the template chain

The provider defines which templates are part of a template chain by passing an array of JSON objects into
`provider.add_template_chain`, where each JSON object represents a template. The order of the JSON objects determines the order in
which the templates are executed.

The JSON object for a template can include the following fields:

`template_name` (string)
:   Specifies the template being added to the template chain. The template must already exist.

    This field is required.

`cache_results` (boolean)
:   Determines whether the results of the template are cached so other templates in the template
    chain can access them. To cache results, specify TRUE.

    This field is required. If TRUE, the `output_table_name` and `cache_expiration_hours` fields are also required.

`output_table_name` (string)
:   When `cache_results = TRUE`, specifies the name of the Snowflake table where template results are stored.

    This field is required if `cache_results = TRUE`.

`jinja_output_table_param` (string)
:   When `cache_results = TRUE`, specifies the name of the Jinja parameter that other templates must include to accept the results that
    are stored in `output_table_name`.

    This field is optional.

`cache_expiration_hours` (integer)
:   When `cache_results = TRUE`, specifies the number of hours before the results in the cache are dropped. When the cache expires, then
    next time the template chain is executed the cache is refreshed with the results of the template.

    This field is required if `cache_results = TRUE`.

### Example

In this example, the provider wants to:

* Create a template chain `insights_chain` in the clean room `collab_clean_room`.
* Define the template chain so the `crosswalk` template executes before the `transaction_insights` template.
* Cache the results of the `crosswalk` template so they can be used as input to the `transaction_insights` template.

```sqlexample
CALL samooha_by_snowflake_local_db.provider.add_template_chain(
  'collab_clean_room',
  'insights_chain',
  [
    {
      'template_name': 'crosswalk',
      'cache_results': True,
      'output_table_name': 'crosswalk',
      'jinja_output_table_param': 'crosswalk_table_name',
      'cache_expiration_hours': 2190
    },
    {
      'template_name': 'transaction_insights',
      'cache_results': False
    }
  ]
);
```

For more information about each JSON object, see Adding templates to the template chain.

## Execute a template chain

A clean room user runs the `consumer.run_analysis` command to execute a template chain, which is the same command used to execute a
single template. Executing the template chain runs each template in the chain in their predefined order to get the final result.

The `consumer.run_analysis` command accepts arguments that it passes to the Jinja templates in the template chain. You can determine
what arguments are expected by the templates in the chain by executing the `consumer.get_arguments_from_template_chain` command.

The arguments passed to `consumer.run_analysis` can be specific to a particular template in the chain or can be arguments for every
template in the chain.

Universal arguments
:   If you want to pass an argument to every template in the template chain, the syntax is the same as using `consumer.run_analysis` to
    run a single template. For example, the following command passes the value of the `where_clause` argument to all templates in the
    template chain:

    ```sqlexample
    CALL samooha_by_snowflake_local_db.consumer.run_analysis(
      'collab_clean_room',
      'insights_chain',
      ['MY_CONSUMER_DB.C_SCHEMA.CONVERSIONS'],
      ['PROVIDER_DB.P_SCHEMA.EXPOSURES'],
      object_construct(
        'where_clause', 'p.EMAIL=c.EMAIL'
      )
    );
    ```

Template-specific arguments
:   If you want to pass an argument to a specific template, add another `object_construct` as a child of the top-level
    `object_construct` with the name of the template as the field name. For example, the following command passes the value of the
    `dimensions` argument to the `crosswalk_template` template only:

    ```sqlexample
    CALL samooha_by_snowflake_local_db.consumer.run_analysis(
      'collab_clean_room',
      'insights_chain',
      ['MY_CONSUMER_DB.C_SCHEMA.CONVERSIONS'],
      ['PROVIDER_DB.P_SCHEMA.EXPOSURES'],
      object_construct(
        'where_clause', 'p.EMAIL=c.EMAIL',
        'crosswalk_template', object_construct(
          'dimensions', ['p.CAMPAIGN']
        )
      )
    );
    ```

## Template chain commands

You can use the following commands to work with template chains:

| Command | Description |
| --- | --- |
| `provider.add_template_chain` | Creates a new template chain. |
| `provider.view_added_template_chains`  `consumer.view_added_template_chains` | Returns all template chains that have been added to the clean room. |
| `provider.view_template_chain_definition`  `consumer.view_template_chain_definition` | Returns the definition of a template chain. |
| `provider.clear_template_chain` | Drops a template chain from the clean room. |
| `provider.clear_all_template_chains` | Drops all template chains from the clean room. |
| `consumer.get_arguments_from_template_chain` | Returns the expected arguments for all of the templates in the template chain. |

For more information about these commands, see the following:

* [Snowflake Data Clean Rooms: Provider API reference guide](provider.md)
* [Snowflake Data Clean Rooms: Consumer API reference guide](consumer.md).

## Snowsight UI

Snowsight web interface: worksheets, notebooks, dashboards, data explorer, and activity monitoring.

---
title: About Legacy Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks.md
section: Snowsight UI
---

# About Legacy Snowflake Notebooks

> **Attention:**
>
> The original Snowflake Notebooks have been renamed to **Legacy Notebooks**. Starting in March 2026, Snowflake
> will roll out the ability to import legacy notebooks into Workspaces. [Notebooks in Workspaces](notebooks-in-workspaces/notebooks-in-workspaces-overview.md)
> is the next generation of Snowflake Notebooks, providing Jupyter compatibility and full integration with Workspaces. For a comparison of the two experiences, see [Key differences between legacy and new notebooks](notebooks-in-workspaces/notebooks-in-workspaces-migrate.md).
>
> Snowflake will migrate all users to Notebooks in Workspaces over the next few quarters. A Behavior Change Request (BCR) will be issued
> before any mandatory migration is enforced. Snowflake will communicate the deprecation timeline and process in advance, before any action
> is required of your account.

Snowflake Notebooks is a unified development interface in Snowsight that offers an interactive, cell-based programming environment for Python, SQL, and Markdown.
In Notebooks, you can leverage your Snowflake data to perform exploratory data analysis, develop machine learning models, and perform other data science and
data engineering workflows, all within the same interface.

* Explore and experiment with data already in Snowflake, or upload new data to Snowflake from local files, external cloud storage, or
  datasets from the Snowflake Marketplace.
* Write SQL or Python code and quickly compare results with cell-by-cell development and execution.
* Interactively visualize your data using embedded Streamlit visualizations and other libraries like Altair, Matplotlib, or seaborn.
* Integrate with Git to collaborate with effective version control. See [Sync notebooks with a Git repository](notebooks-snowgit.md).
* Contextualize results and make notes about different results with Markdown cells and charts.
* Run your notebook on a schedule to automate pipelines. See [Schedule notebook runs](notebooks-schedule.md).
* Make use of the role-based access control and other data governance functionality available in Snowflake to allow other users
  with the same role to view and collaborate on the notebook.

> **Note:**
>
> Private Notebooks are deprecated and no longer supported. The new Snowflake Notebooks experience in [Workspaces](workspaces.md) offers a similar private development
> environment with improved capabilities. If you’re interested in enrolling in the preview, contact your Snowflake account team for more information.

## Legacy Notebook runtimes

Snowflake Notebooks offer two distinct runtimes, each designed for specific workloads: Warehouse Runtime and Container Runtime. Notebooks utilize
compute resources from either virtual warehouses (for Warehouse Runtime) or Snowpark Container Services compute pools (for Container Runtime)
to execute your code. For both runtimes, SQL and Snowpark queries are always executed on the warehouse for optimized performance.

The Warehouse Runtime offers the fastest way to start, with a familiar and generally available warehouse environment. The Container
Runtime provides a more flexible environment that can support many different types of workloads, including SQL analytics and data
engineering. You can install additional Python packages if the Container Runtime doesn’t include what you need by default. Container
runtime also comes in CPU and GPU versions that have many popular ML packages pre-installed, making them ideal for ML and deep learning
workloads.

The following table shows supported features for each type of runtime. You can use this table to help decide which runtime is the right
choice for your use case.

| Supported Features | Warehouse Runtime | Container Runtime |
| --- | --- | --- |
| Compute | Kernel runs on the notebook warehouse. | Kernel runs on a [compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) node. |
| Environment | Python 3.9 | Python 3.10 (Preview) |
| Base image | Streamlit + Snowpark | Snowflake Container Runtime (CPU and GPU images pre-installed with Python libraries). |
| Additional Python libraries | Install using Snowflake Anaconda or from a Snowflake stage. | Install using `pip`, `conda`, or from a Snowflake stage. | If needed, specify a particular package version. |
| Editing support | Python, SQL, and Markdown cells. | Reference outputs from SQL cells in Python cells and vice versa. | Use visualization libraries like Streamlit. | Same as warehouse |
| Access | Ownership required to access and edit notebooks. | Same as warehouse |
| Supported Notebook features (still in Preview) | Git integration (Preview) | Scheduling (Preview) | Same as warehouse |

For details on creating, running, and managing notebooks on Container Runtime, see [Notebooks on Container Runtime](../../developer-guide/snowflake-ml/notebooks-on-spcs.md).

## Explore Legacy Notebooks

The Snowflake Notebooks toolbar provides the controls used to manage the notebook and adjust cell display settings.

| Control | Description |
| --- | --- |
|  | Package selector: Select and install packages for use in the notebook. See [Import Python packages to use in notebooks](notebooks-import-packages.md). |
|  | Start: Start the Notebooks session. When the session starts, the image changes to Active. |
|  | Active: Hover over the button to view real-time session details and aggregated resource consumption metrics (memory usage and CPU/GPU utilization metrics are displayed for Container Runtime notebooks). Select the down arrow to access options to restart or end the session. Select Active to end the current session. |
|  | Run All/Stop: Run all cells or stop cell execution. See [Run cells in Snowflake Notebooks](notebooks-develop-run.md). |
|  | Scheduler: Set a schedule to run your notebook as a task in the future. See [Schedule notebook runs](notebooks-schedule.md). |
|  | Vertical ellipsis menu: Customize notebook settings, clear cell outputs, duplicate, export, or delete the notebook. |

### Collapse cells in a notebook

You can collapse the code in a cell to see only the output. For example, collapse a Python cell to show only the visualizations produced
by your code, or collapse a SQL cell to show only the results table.

* To change what is visible, select Collapse results.
  :   The drop-down offers options to collapse specific parts of the cell.

---
title: Best practices for shared workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/workspaces-shared-best-practices.md
section: Snowsight UI
---

# Best practices for shared workspaces

The following recommendations detail how administrators can effectively plan, configure, and maintain shared workspaces.

## Plan and set up shared workspaces

Shared workspaces are created as schema-level objects within a standard database and follow Snowflake’s existing role-based access control (RBAC) model.

To create a shared workspace, users must either own the target schema or have the following privileges:

> * USAGE on the database and schema
> * CREATE WORKSPACE on the schema

### Grant creation privileges

1. As an administrator, to grant the necessary schema-level privileges to a user, run the following command:

   > ```sqlsyntax
   > GRANT USAGE ON DATABASE <database_name>.<schema_name> TO ROLE <role_name>;
   > GRANT CREATE WORKSPACE ON SCHEMA <database_name>.<schema_name> TO ROLE <role_name>;
   > ```

## Workspace management commands

Use the following commands to monitor and manage access to existing shared workspaces:

| Action | Syntax |
| --- | --- |
| List all workspaces | `SHOW WORKSPACES IN ACCOUNT;` |
| View workspace permissions | `SHOW GRANTS ON WORKSPACE <workspace_name>;` |
| Grant edit permissions | `GRANT WRITE ON WORKSPACE <workspace_name> TO ROLE <role_name>;` |
| Revoke edit permissions | `REVOKE WRITE ON WORKSPACE <workspace_name> FROM ROLE <role_name>;` |

**Example**

```sqlexample
GRANT WRITE ON WORKSPACE my_workspace;

REVOKE WRITE ON WORKSPACE my_workspace;
```

## Governance best practices

When enabling shared workspaces in your account, consider the following best practices:

* Plan intentionally: Align shared workspaces with specific teams, projects, or use cases. Fewer, well-defined workspaces reduce clutter and user confusion.
* Limit creation privileges: Restrict CREATE WORKSPACE privileges to designated steward roles and schema owners. Broadly granting this privilege can lead to unnecessary
  duplication or workspace sprawl.
* Monitor workspace lifecycle: Periodically review existing shared workspaces and retire stale or unused ones. Establish a lightweight review process
  (for example, quarterly) to ensure that only active and relevant workspaces remain available.

## Organizational models

Administrators can structure shared workspaces in different ways depending on their organization’s collaboration model.

### Centralized collaboration hub

A single database and schema dedicated to shared workspaces for all teams provides a consistent location for cross-team collaboration.

**Example setup**

```sqlexample
CREATE DATABASE IF NOT EXISTS SHARED_WORKSPACES_DB;
CREATE SCHEMA IF NOT EXISTS SHARED_WORKSPACES_SCHEMA;

GRANT USAGE ON DATABASE SHARED_WORKSPACES_DB.SHARED_WORKSPACES_SCHEMA TO ROLE WORKSPACES_STEWARDS;
GRANT CREATE WORKSPACE ON SCHEMA SHARED_WORKSPACES_DB.SHARED_WORKSPACES_SCHEMA TO ROLE WORKSPACES_STEWARDS;
```

**Example structure**

### Team-scoped workspaces

Each team owns its own database or schema and manages shared workspaces within its scope. This model fits organizations that already align
databases and roles by department, discipline, or business unit.

**Example structure**

### Hybrid approach

Use a combined, central schema for cross-team or high-visibility projects with team-specific schemas for daily collaboration. This model balances
flexibility with centralized governance and discoverability.

## Role design and access management considerations

* Shared workspaces can only be shared with **roles** (not individual users).
* Most organizations can use their existing roles to manage access. Avoid creating new roles solely for shared workspaces unless necessary.

### Best practices

* Use existing roles that already represent team membership or function.
* Assign a designated steward role responsible for managing access and maintaining the workplace structure.

## Adoption and maintenance

* **Naming conventions:** To improve discoverability, use clear and consistent patterns such as `TEAM_PROJECT_NAME`.
* **Ownership:** Assign a steward or owner role to each shared workspace to ensure accountability.
* **Documentation:** Maintain an internal directory or wiki listing active shared workspaces and their intended purpose.
* **Consistency:** Encourage users to move from private to shared workspaces when code or queries are ready for collaboration.
* **Review regularly:** Periodically audit roles, schemas, and shared workspaces to ensure that they remain aligned with organizational policies and team structures.

---
title: Compute setup for Snowflake Notebooks in Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-compute-setup.md
section: Snowsight UI
---

# Compute setup for Snowflake Notebooks in Workspaces

## Setting up compute

When a user runs a notebook, the user creates a Snowflake-managed notebook service to host the notebook kernel and execute code.

When creating a notebook service, users can configure the Python version, Snowflake Container Runtime version, compute pool, idle timeout, external
access integrations, and optionally customize the service name.

Each notebook service is scoped to a single user and occupies one node on the selected compute pool. All notebooks connected to the same service
share the compute resources on that node. If a notebook requires dedicated compute resources, create a separate notebook service and avoid attaching
additional notebooks to it.

## Managing a notebook service

### Suspend

You can manually suspend a notebook service by clicking Connected, hovering over the service name, and selecting Suspend (pause icon).

Alternatively, you can wait for the service to reach its idle timeout setting and it will suspend automatically. For details on how idle time is
calculated, see Idle timeout.

Suspending a service disconnects all notebooks connected to it, clears in-memory states, and removes all packages and variables. Any files created from
code or the terminal in the Workspace file system and the `/tmp` directory are also removed.

> **Note:**
>
> Writing files to the Workspace directory from code or the terminal is not supported. For information on persisting files, see
> [Working with the file system](../notebooks-work-with-files.md).

### Resume

To resume a suspended service, connect a notebook to it or run a notebook that has previously been connected to it.

### Drop

Administrators can drop a notebook service.

SQLSnowsight

To drop a notebook service via SQL:

```sqlexample
DROP USER$DB_NAME.PUBLIC.[SERVICE_NAME];
```

To drop a notebook service using Snowsight:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. Select the Connected dropdown.
3. Select Manage service to go to the Services & jobs page.
4. Select the ellipsis for the service you want to drop, then select Drop.

## Editing a notebook service

A notebook service can be updated after being created to change:

1. External Access Integrations.
2. Runtime version.
3. Idle timeout.

Changes to (1) or (2) suspend then restart the service. Changing the idle timeout does not restart the service.

## Idle timeout

Each notebook service defines its own idle timeout. The service is suspended when the idle time is reached. Idle time begins as soon as all running
cells across all connected notebooks have finished. If multiple notebooks share the same service, idle time starts only when the last notebook
becomes idle (no cells running).

By default, notebook services have an idle timeout of 24 hours. You can configure the idle timeout when creating or updating a notebook service
to better align with your usage patterns and cost optimization strategies.

> **Note:**
>
> To change idle timeout values (including the default value) for notebook services in your account, contact your Snowflake account team or Snowflake Support.

## Credit usage

Notebook execution can incur credits from two sources:

* **Compute pool:** Powers the notebook kernels and Python processes.

  Credits accrue while the notebook service is in the RUNNING state until it is manually suspended, or suspended due to idle timeout. All notebooks
  connected to the same service share the compute pool credits consumed.
* **Query warehouse:** Used for SQL queries or Snowpark pushdown compute triggered by the notebook.

  Credits accrue only when SQL queries or Snowpark pushdown compute operations run on the warehouse. To optimize costs, enable auto-suspend on the
  query warehouse. Notebooks that do not invoke any SQL queries or Snowpark pushdown compute incur no query warehouse credits.

For more information on cost optimization and maximizing value, see [Optimizing cost](../../cost-optimize.md).

## Governance on notebook services

Notebook services are personal to each user, used exclusively for running notebooks, and located within the user’s Personal Database (PDB).

### Privileges

#### Ownership

The OWNER_ROLE is NULL because Snowflake manages these services.

#### User privileges

The creating user is granted the following privileges:

* USAGE
* OPERATE
* DROP
* MONITOR

#### Administrator privileges

ACCOUNTADMIN is granted the following privileges:

* USAGE
* OPERATE
* DROP

This allows full management and oversight of all notebook services.

## Administrator control and cost monitoring on compute pools

Administrators manage user access and costs primarily through the compute pools associated with notebook services.

A user’s role must have the USAGE privilege on a compute pool to create a notebook service and run notebooks. In addition, the compute pool must
allow the `NOTEBOOK` workload type through the `ALLOWED_SPCS_WORKLOAD_TYPES` parameter. The default value for this parameter is
`ALL`, which includes `NOTEBOOK`.

To learn more about compute pool workloads, see [Snowpark Container Services: Working with compute pools](../../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

### Disable notebook execution

Administrators can restrict notebook execution in Workspaces in multiple ways:

#### Remove USAGE on the compute pool

Removing the USAGE privilege from a role on a compute pool prevents that role from using that compute pool, including running notebooks.

#### Restrict workload types on all compute pools

Administrators can restrict notebook execution while still permitting other workloads using two account-level parameters. This will affect
all roles in the account.

* Exclude `NOTEBOOK` from the `ALLOWED_SPCS_WORKLOAD_TYPES` parameter.
* Set `NOTEBOOK` as the `DISALLOWED_SPCS_WORKLOAD_TYPES` parameter.

Any role that has USAGE on the compute pool can still run other allowed types of workloads as specified by the parameters.

### Monitor costs

Administrators can monitor consumption per compute pool. Snowflake recommends provisioning a unique compute pool for each role to view role-level
consumption. To manage spend, administrators can apply budgets on specific compute pools.

### View notebook-managed services

Use the SHOW SERVICES command:

```sqlexample
SHOW SERVICES OF TYPE NOTEBOOK;
```

## Service maintenance

Notebook services are a type of Snowpark Container Services and require periodic maintenance to remain secure and up to date. Maintenance typically
takes about five minutes and suspends and restarts the notebook service. See Managing a notebook service
for details on workload impact.

After a notebook service enters the `RUNNING` state (whether newly created or resumed after being in `SUSPENDED` state), it is guaranteed
not to be disrupted for seven calendar days (168 hours) due to service maintenance. After seven days of creation, the service may be suspended for mandatory
maintenance.

## Multi-node Distributed Training Support (with Snowflake Container Runtime=2.3)

This notebook is optimized for Snowflake Container Runtime 2.3, which introduces support for multi-node clusters. This allows you to scale your ML workloads (like PyTorch, XGBoost, and LightGBM) across multiple nodes for faster training.

Use the doc link to get more information on running ML workloads on multi-node clusters, refer to ths documentation: <https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime-multi-node>

---
title: Create a notebook
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-create.md
section: Snowsight UI
---

# Create a notebook

This topic describes how to create a Snowflake notebook on Warehouse Runtime. You can also create a notebook on Container Runtime. For details,
see [Notebooks on Container Runtime](../../developer-guide/snowflake-ml/notebooks-on-spcs.md).

Snowflake Notebooks provide an interactive, cell-based development environment within Snowsight. They enable you to work with Snowflake data
using SQL and Python in a single interface, making it easier to build and iterate on workflows for data exploration, transformation, and machine
learning.

You can access notebooks through [Snowsight](../ui-snowsight-gs.md), where you can Create a new notebook or Open an existing notebook. You
can also create a notebook using SQL. For more information, see [CREATE NOTEBOOK](../../sql-reference/sql/create-notebook.md).

## Prerequisites

* You have [set up and enabled notebooks](notebooks-setup.md).
* You are using a role with the [required privileges](notebooks-setup.md).

## Runtimes

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

When creating a notebook using the Warehouse Runtime, you specify a name, location, and warehouse. In this preview, you can also select
a specific pre-configured runtime environment for your notebook. Using a Snowflake default runtime environment ensures that your notebook runs
in a consistent setting, which supports reproducible results. This setup does not require initial configuration and is ready to use immediately.

The Snowflake Warehouse Runtime environment consists of the following components:

| Snowflake Warehouse Runtime version | Python runtime | Streamlit version |
| --- | --- | --- |
| 1.0 | 3.9 | 1.39.1 |
| 2.0 | 3.10 | 1.39.1 |

All new notebooks are defaulted to the Python 3.9 runtime (Warehouse Runtime 1.0).

> **Note:**
>
> If you install packages on top of the Snowflake runtime, Snowflake can no longer guarantee compatibility across your environment.

## Create a new notebook

You can create a new notebook by selecting + Notebook, or you can import a file with the `*.ipynb` extension. This could be
a notebook file created from an application outside of Snowflake.

**To create a new notebook,** follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Select + Notebook.
4. Enter a name for your notebook. Snowflake preserves the exact casing of the notebook name as entered, including names that contain spaces.
   Notebook names are case-sensitive.
5. Select a Notebook location. This is the database and schema in which to store your notebook. These cannot be changed after you
   create the notebook.

   > **Note:**
   >
   > The Notebook location list might not show databases that were created after you opened the Create Notebook dialog. If you can’t
   > find your recently created database, schema, or warehouse, reload your browser window.

   Querying data in the notebook is not restricted to this location. In the notebook, you can query data in any location you have access to.
   To specify the location, run [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) and [USE SCHEMA](../../sql-reference/sql/use-schema.md).
6. Select Run on warehouse as your Python environment. For details on what is included in each runtime, see [Legacy Notebook runtimes](notebooks.md).

   For details on Container Runtime, see [Notebooks on Container Runtime](../../developer-guide/snowflake-ml/notebooks-on-spcs.md).
7. Optional: Select a Query warehouse to run any SQL and Snowpark queries issued by the notebook.
8. Select a Notebook warehouse to run notebook-specific tasks. Snowflake recommends that you use [SYSTEM$STREAMLIT_NOTEBOOK_WH](../warehouses-overview.md),
   a Snowflake-managed warehouse that is provisioned in each account for running notebooks.

   > **Note:**
   >
   > By default, notebooks are suspended after a period of inactivity. The default idle timeout depends on the runtime:
   >
   > * **Warehouse Runtime notebooks:** 30 minutes (1,800 seconds) of inactivity
   > * **Container Runtime notebooks:** 60 minutes (3,600 seconds) of inactivity
   >
   > You can set the idle timeout to a maximum of 72 hours (259,200 seconds). To update the idle timeout setting, use either the CREATE NOTEBOOK
   > or ALTER NOTEBOOK commands to set the value of the IDLE_AUTO_SHUTDOWN_TIME_SECONDS property.
   >
   > You can change the idle timeout setting after creation from the notebook settings. For more information, see [Idle time and reconnection](notebooks-setup.md).
9. Select Create to create and open your notebook.

**To create a new notebook from an existing file,** follow these steps:

1. Select the down arrow next to + Notebook and then select Import .ipynb file.
2. Open the file to import, such as a notebook file that was created from an application outside of Snowflake.

   > **Note:**
   >
   > If your notebook imports Python packages, you must add the packages to the notebook before you can run the imported notebook. See
   > [Import Python packages to use in notebooks](notebooks-import-packages.md). If the package you use in your imported notebook is not available, your code might not run. For
   > information about adding cells, see [Develop and run code in Snowflake Notebooks](notebooks-develop-run.md).

## Create a notebook using SQL

You can create a notebook using the [CREATE NOTEBOOK](../../sql-reference/sql/create-notebook.md) command. This command lets you define the notebook’s location, main
file, and version source programmatically. However, when you create a notebook using SQL, the notebook does not automatically include a live
version. A live version is required in order to run the notebook using the [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) command.

If you attempt to run a notebook that does not have a live version, or if the notebook was dropped and recreated, you may see the following
error:

```output
Live version is not found.
```

To resolve this, add a live version to the notebook before executing it, as shown in the following example:

```sqlexample
ALTER NOTEBOOK DB_NAME.SCHEMA_NAME.NOTEBOOK_NAME ADD LIVE VERSION FROM LAST;
```

* `DB_NAME` is the name of the database that contains the notebook
* `SCHEMA_NAME` is the name of the schema that contains the notebook
* `NOTEBOOK_NAME` is the name of the notebook

## Create a notebook from a Git repository

You can sync your notebook development with a Git repository. Then you can create Snowflake Notebooks from notebooks in that Git repository.

To create a notebook from a file in Git, see [Create a notebook from a file in a Git repository](notebooks-snowgit.md).

## Duplicate an existing notebook

You can duplicate existing Snowflake Notebooks. Duplicating notebooks may be useful if you want to, for example, test out some code changes
without altering the original notebook version.

When you duplicate a notebook, the copied notebook is created with the same role and warehouse as the original notebook, and is contained
in the same database and schema as the original notebook. Because of this, you cannot duplicate a notebook to move it to a different
database and schema, or to change ownership.

To duplicate a notebook, complete the following steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Open the notebook that you want to duplicate.
4. Select the vertical ellipsis  menu, then select Duplicate.
5. (Optional) Enter a name for the duplicate notebook, then select Duplicate.
6. In the confirmation dialog, select Close to return to the original notebook, or Open notebook to open the duplicate
   notebook.

## Open an existing notebook

To open an existing notebook, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.

   > **Note:**
   >
   > Recently used notebooks also appear in Snowsight. Under Recently viewed, select Notebooks.
3. Review the list of notebooks.

   You can see all notebooks owned by your active role or owned by a role inherited by your active role. Each notebook displays the following information:

   * Title: The title of the notebook
   * Viewed: The last time the notebook was viewed
   * Updated: The last time the notebook was executed
   * Environment: The runtime environment for the notebook (Container Runtime or Warehouse Runtime)
   * Location: The database and schema locations for the notebook
   * Owner: The owner of the notebook
4. Select a notebook to open it for editing.

   For details about editing notebooks, see [Develop and run code in Snowflake Notebooks](notebooks-develop-run.md).

When you open a notebook, you can see cached results from the last time you ran any cells in the notebook. The notebook is in the
Not connected state by default, but if you select that state or run any cell, your notebook connects to your virtual warehouse.

---
title: Develop and run code in Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-develop-run.md
section: Snowsight UI
---

# Develop and run code in Snowflake Notebooks

This topic describes how to write and run SQL, Python, and Markdown code in Snowflake Notebooks.

## Notebook cell basics

This section introduces some basic cell operations. When you [create a notebook](notebooks-create.md), three example cells are
displayed. You can modify those cells or add new ones.

### Create a new cell

Snowflake Notebooks support three types of cells: SQL, Python, and Markdown. To create a new cell, you can either hover over an existing cell or
scroll to the bottom of the notebook, then select one of the buttons for the cell type you want to add.

Change the language of an existing cell by using one of the following methods:

* Select the language dropdown menu and then select a different language.
* Use keyboard shortcuts.

### Edit a cell

To prevent editing conflicts, only one user can edit a cell at a time. If another user attempts to edit an active cell, a notification will be
displayed. The cell will become available for editing after 60 seconds of inactivity.

### Move cells

You can move a cell either by dragging and dropping the cell using your mouse or by using the actions menu:

1. (Option 1) Hover your mouse over the existing cell you want to move. Select the  (drag and drop) icon on the left side of the cell
   and move the cell to its new location.
2. (Option 2) Select the vertical ellipsis  (actions) menu. Then select the appropriate action.

> **Note:**
>
> To just move the focus between cells, use the `Up` and `Down` arrows.

### Delete a cell

To delete a cell, complete the following steps in a notebook:

1. Select the vertical ellipsis  (more actions) menu.
2. Select Delete.
3. Select Delete again to confirm.

You can also use a keyboard shortcut to delete a cell.

For considerations when using Python and SQL cells, see Considerations for running notebooks.

## Run cells in Snowflake Notebooks

To run Python and SQL cells in Snowflake Notebooks, you can:

* **Run a single cell:** Choose this option when making frequent code updates.

  + Press `CMD` + `return` on a Mac keyboard, or `CTRL` + `Enter` on a Windows keyboard.
  + Select , or Run this cell only.
* **Run all cells in a notebook in sequential order:** Choose this option before presenting or sharing a notebook to ensure that the recipients
  see the most current information. This option executes all SQL and Python code cells in the notebook from top to bottom. If an error occurs
  in any cell, execution will halt and subsequent cells will not run. This behavior also applies to scheduled notebooks. For example, if you
  run a notebook that has 10 cells, and in cell 2 there is a SQL syntax error, the notebook will stop running after cell 2.

  + Press `CMD` + `shift` + `return` on a Mac keyboard, or `CTRL` + `Shift` + `Enter` on a Windows keyboard.
  + Select Run all.
* **Run a cell and advance to the next cell:** Choose this option to run a cell and move on to the next cell more quickly.

  + Press `shift` + `return` on a Mac keyboard, or `Shift` + `Enter` on a Windows keyboard.
  + Select the vertical ellipsis  (more actions) for a cell, and choose Run cell and advance.
* **Run all above**: Choose this option when running a cell that references the results of earlier cells.

  + Select the vertical ellipsis  (more actions) for a cell, and choose Run all above.
* **Run all below**: Choose this option when running a cell that later cells depend on. This option runs the current cell and all following
  cells.

  + Select the vertical ellipsis  (more actions) for a cell, and choose Run all below.

When one cell is running, other run requests are queued and will be executed once the actively running cell finishes.

### Collapse and expand cells

You can control how much of the notebook is visible by selecting one of the cell display options at the top of the notebook:

1. Select the vertical ellipsis  (more actions) menu.
2. Select Show/hide all and choose the appropriate option:

   * **Show all:** Displays both code and results for each cell.
   * **Show code only:** Hides the results and displays only the code cells.
   * **Show results only:** Hides the code and displays only the output.
   * **Hide all:** Collapses both code and results for all cells.

These options are helpful when:

* You want to focus on reading code or reviewing results.
* You are presenting or sharing your notebook.
* You need to navigate large notebooks more efficiently.

### Duplicate cells

Duplicating a cell can help with the following:

* Testing variations of a query or function.
* Debugging without overwriting the working version.
* Comparing different outputs side by side.
* Reusing code or modifying an existing cell without losing the original.

To duplicate a notebook cell:

1. From the cell to duplicate, select the vertical ellipsis  (more actions) menu.
2. Select Duplicate.

   A copy of the cell appears immediately below the original.

### Cell minimap

The cell minimap appears in the right sidebar of the notebook and provides a compact, draggable list of all cells in the notebook. Each entry in the minimap corresponds to a code or text cell and reflects the order in which the cells appear.

* **Current cell:** The selected cell is highlighted in the minimap.
* **Reordering:** Drag and drop items in the minimap to quickly change the order of cells in the notebook.
* **Navigation:** Click a cell name in the minimap to jump directly to that cell.

This feature is useful for navigating large notebooks and reorganizing content more efficiently.

## Running notebooks with parameters

When you use the [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) command to run a notebook, you can pass arguments to the notebook.
In a Python cell in the notebook, you can access these arguments by using the `sys.argv` variable, which is a built-in Python list that holds command-line arguments.

Passing arguments to notebooks allows you to customize notebook behavior. You can:

* Personalize or customize notebook execution.
* Reuse the same notebook for multiple inputs.
* Support automation or task scheduling.

### Examples

In a Python cell in the notebook, you can access the arguments by using the `sys.argv` variable.

#### View all arguments passed to the notebook

Print the full list of arguments passed to the notebook.

```python
import sys
print(sys.argv)
```

If the notebook is executed with this command:

```sqlexample
EXECUTE NOTEBOOK MY_DATABASE.PUBLIC.MY_NOTEBOOK(
  'parameter_string a,b,c,d',
  'target_database=PROD_DB'
);
```

The output will be:

```output
['parameter_string', 'a,b,c,d', 'target_database=PROD_DB']
```

#### Print each argument

Loop through and print each argument individually.

```python
for arg in sys.argv:
    print(arg)
```

The output will be:

```output
parameter_string
a,b,c,d
target_database=PROD_DB
```

#### Access a specific argument

Access the second argument.

```python
second_param = sys.argv[1]
print(second_param)
```

The output will be:

```output
a,b,c,d
```

#### Parse an argument containing comma-separated values

If an argument contains a comma-separated list of values, you can split it into individual values.

```python
value_list = sys.argv[1].split(",")
print(value_list)
```

The output will be:

```output
['a', 'b', 'c', 'd']
```

You can also loop through the values:

```python
for value in value_list:
  print(value)
```

#### Extract an argument containing a key-value pair

If an argument includes a key-value pair (for example, `key=value`), extract the value.

```python
target_database = sys.argv[2].split("=")
print(target_database[1])
```

The output will be:

```output
PROD_DB
```

#### Alternate syntax for a single string

You can set a [session variable](../../sql-reference/session-variables.md) to the value of an argument and pass the session variable to the notebook.

```sqlexample
SET PARAMS = 'parameter_string a,b,c,d';
EXECUTE NOTEBOOK MY_DATABASE.PUBLIC.MY_NOTEBOOK($PARAMS);
```

### View results from a parameterized run

To view the result of a notebook run that was triggered using [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md):

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Select the Calendar icon.
4. Select View run history.
5. Find the notebook execution and open the result.

   A read-only notebook opens containing the result of that run.

### Notes

* `sys.argv` contains only the strings passed via [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md).
* Only strings are supported. If another data type (such as an integer) is passed, it will be interpreted as NULL. For more information,
  see [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md).

## Inspect cell status

The status of the cell run is indicated by the colors displayed by the cell. This status color is displayed in two places, the left wall of
the cell and in the right cell navigation map.

Cell status color:

* Blue dot: The cell was modified but hasn’t run yet.
* Red: The cell ran in the current session and an error occurred.
* Green: The cell ran in the current session without errors.
* Moving green: The cell is currently running.
* Gray: The cell has run in a previous session and the results shown are from the previous session. Cell results from the previous
  interactive session are kept for 7 days. Interactive session means the user runs the notebook in an interactive manner in Snowsight
  rather than those that were run by a schedule or the [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) SQL command.
* Blinking gray: The cell is waiting to be run after you select Run All.

> **Note:**
>
> Markdown cells do not show any status.

After a cell finishes running, the time it took to run is displayed at the top of the cell. Select this text to view the run details,
including start and end times and total elapsed time.

SQL cells contain additional information, such as the warehouse used to run the query, rows returned, and a hyperlink to the query ID page.

### Stop a running cell

To stop the execution of any code cells that are currently running, select Stop on the top right of the cell. You can also select
Stop on the top right of the Notebooks page. While cells are running, Run all becomes Stop.

This stops the execution of the cell that is currently running and all subsequent cells that have been scheduled to run.

## Keyboard shortcuts

Snowflake Notebooks support various keyboard shortcuts to help accelerate your development process.

You can also see the list of keyboard shortcuts by selecting the keyboard icon at the bottom right corner, and then
selecting Keyboard shortcuts.

| Task | MacOS | Windows |
| --- | --- | --- |
| Run all cells | `CMD` + `Shift` + `Return` | `CTRL` + `Shift` + `Enter` |
| Run the selected cell | `CMD` + `Return` | `CTRL` + `Enter` |
| Run the selected cell and advance to the next cell | `Shift` + `Return` | `Shift` + `Enter` |
| Move between cells | `Up` and `Down` arrows | `Up` and `Down` arrows |
| Stop all cells | `ii` | `ii` |
| Find within the cell | `CMD` + `f` | `CTRL` + `f` |
| Move cell up | `CMD` + `SHIFT` + `Up` arrow | `CTRL` + `SHIFT` + `Up` arrow |
| Move cell down | `CMD` + `SHIFT` + `Down` arrow | `CTRL` + `SHIFT` + `Down` arrow |
| Add a cell above the currently selected cell | `a` | `a` |
| Add a cell below the currently selected cell | `b` | `b` |
| Delete the currently selected cell | `dd` or `DELETE` | `dd` or `DELETE` |
| Convert a SQL or Python cell into a Markdown cell | `m` | `m` |
| Convert a cell into a code cell:  * Change a Markdown cell to a Python cell * Change a Python cell to a SQL cell * Change a SQL cell to a Python cell | `y` | `y` |
| Show keyboard shortcuts | `Shift` + `?` | `Shift` + `?` |

In addition, you can use the same keyboard shortcuts that you use for worksheets. See [Perform tasks with keyboard shortcuts](../ui-snowsight-worksheets.md).

## Format text with Markdown

To include Markdown in your notebook, add a Markdown cell:

1. Use a keyboard shortcut and select Markdown, or select + Markdown.
2. Select the Edit markdown pencil icon or double-click the cell, and start writing Markdown.

You can type valid Markdown to format a text cell. As you type, the formatted text appears below the Markdown syntax.

To view only the formatted text, select the Done editing checkmark icon.

> **Note:**
>
> Markdown cells currently do not support rendering of HTML.

### Markdown basics

This section describes basic Markdown syntax to get you started.

**Headers**

| Heading level | Markdown syntax | Example |
| --- | --- | --- |
| Top level | ```markdown # Top-level Header ``` |  |
| 2nd-level | ```markdown ## 2nd-level Header ``` |  |
| 3rd-level | ```markdown ### 3rd-level Header ``` |  |

**Inline text formatting**

| Text format | Markdown syntax | Example |
| --- | --- | --- |
| Italics | ```markdown *italicized text* ``` |  |
| Bold | ```markdown **bolded text** ``` |  |
| Link | ```markdown [Link text](url) ``` |  |

**Lists**

| List type | Markdown syntax | Example |
| --- | --- | --- |
| Ordered list | ```markdown 1. first item 2. second item   1. Nested first   2. Nested second ``` |  |
| Unordered list | ```markdown - first item - second item   - Nested first   - Nested second ``` |  |

**Code formatting**

| Language | Markdown syntax | Example |
| --- | --- | --- |
| Python | ```markdown ```python import pandas as pd df = pd.DataFrame([1,2,3]) ``` ``` |  |
| SQL | ```markdown ```sql SELECT * FROM MYTABLE ``` ``` |  |

**Embed images**

| File type | Markdown syntax | Example |
| --- | --- | --- |
| Image | ```markdown  ``` |  |

For a notebook that demonstrates these Markdown examples, see the [Markdown cells](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Visual%20Data%20Stories%20with%20Snowflake%20Notebooks/Visual%20Data%20Stories%20with%20Snowflake%20Notebooks.ipynb) section of the
visual data stories notebook.

## Understanding cell outputs

When you run a Python cell, the notebook displays the following types of output from the cell are displayed in the results:

* Any results written to the console, such as logs, errors, and warnings and output from print() statements.
* DataFrames are automatically printed with
  [Streamlit’s interactive table display](https://docs.streamlit.io/develop/api-reference/data/st.dataframe), `st.dataframe()`.

  + The supported DataFrame display types include pandas DataFrame, Snowpark DataFrames, and Snowpark Tables.
  + For Snowpark, printed DataFrames are evaluated eagerly without the need to run the `.show()` command. If you prefer not to evaluate the
    DataFrame eagerly, for example when running the notebook in non-interactive mode, Snowflake recommends removing the DataFrame
    print statements to speed up the overall runtime of your Snowpark code.
* Visualizations are rendered in outputs. To learn more about visualizing your data, see [Visualize data in Snowflake Notebooks](notebooks-visualize-data.md).

Additionally, you can access the results of your SQL query in Python and vice versa. See Reference cells and variables in Snowflake Notebooks.

### Cell output limits

Only 10,000 rows or 8 MB of DataFrame output is shown as cell results, whichever is lower. However, the entire DataFrame is still available in
the notebook session for use. For example, even though the entire DataFrame isn’t rendered, you can still perform data transformation tasks.

For each cell, only 20 MB of output is allowed. If the size of the cell output exceeds 20 MB, the output will be dropped. Consider splitting
the content into multiple cells if that happens.

## Reference cells and variables in Snowflake Notebooks

You can reference the previous cell results in a notebook cell. For example, to reference the result of a SQL cell or the value
of a Python variable, see the following tables:

> **Note:**
>
> The cell name of the reference is case-sensitive and must exactly match the name of the referenced cell.

**Referencing SQL output in Python cells:**

| Reference cell type | Current cell type | Reference syntax | Example |
| --- | --- | --- | --- |
| SQL | Python | `cell1` | Convert a SQL results table to a Snowpark DataFrame.  If you have the following in a SQL cell called `cell1`:  ```sqlexample SELECT 'FRIDAY' as SNOWDAY, 0.2 as CHANCE_OF_SNOW UNION ALL SELECT 'SATURDAY',0.5 UNION ALL SELECT 'SUNDAY', 0.9; ```  You can reference the cell to access the SQL result:  ```python snowpark_df = cell1.to_df() ```  Convert the result to a pandas DataFrame:  ```python my_df = cell1.to_pandas() ``` |

**Referencing variables in SQL code:**

> **Important:**
>
> In SQL code, you can only reference Python variables of type `string`. You cannot reference a Snowpark DataFrame, pandas DataFrame or
> other Python native DataFrame format.

| Reference cell type | Current cell type | Reference syntax | Example |
| --- | --- | --- | --- |
| SQL | SQL | `{{cell2}}` | For example, in a SQL cell named `cell1`, reference the cell results from `cell2`:  ```sqlexample SELECT * FROM {{cell2}} where PRICE > 500 ``` |
| Python | SQL | `{{variable}}` | For example, in a Python cell named `cell1`:  **Using Python variable as a value**  ```python c = "USA" ```  You can reference the value of the variable `c` in a SQL cell named `cell2` by enclosing it in single quotes to ensure that it is treated as a value:  ```sqlexample SELECT * FROM my_table WHERE COUNTRY = '{{c}}' ```  **Using Python variable as an identifier**  If the Python variable represents a SQL identifier like a column or table name:  ```python column_name = "COUNTRY" ```  If the Python variable represents a SQL identifier, such as a column or table name (`column_name = "COUNTRY"`), you can reference the variable directly without quotes:  ```python SELECT * FROM my_table WHERE {{column_name}} = 'USA' ```  Make sure to differentiate between variables used as values (with quotes) and as identifiers (without quotes).  Note: Referencing Python DataFrames is not supported. |

## Considerations for running notebooks

* Notebooks run using caller’s rights. For additional considerations, see [Changing the session context for a notebook](notebooks-sessions.md).
* You can import Python libraries to use in a notebook. For details, see [Import Python packages to use in notebooks](notebooks-import-packages.md).
* When referencing objects in SQL cells, you must use fully qualified object names, unless you are referencing object names in a specified
  database or schema. See [Changing the session context for a notebook](notebooks-sessions.md).
* Notebook drafts are saved every three seconds.
* You can use [Git integration](notebooks-snowgit.md) to maintain notebook versions.
* You can configure an idle timeout setting to automatically shut down the notebook session once the setting is met. For information,
  see [Idle time and reconnection](notebooks-setup.md).
* Notebook cell results are only visible to the user who ran the notebook and are cached across sessions. Reopening a notebook displays
  past results from the last time the user ran the notebook using Snowsight.
* [BEGIN … END (Snowflake Scripting)](../../sql-reference/snowflake-scripting/begin.md) is not supported in SQL cells. Instead, use the
  [Session.sql().collect()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.Session.sql)
  method in a Python cell to run the scripting block. Chain the `sql` call with a call to `collect` to immediately execute
  the SQL query.

  The following code runs a Snowflake scripting block using the `session.sql().collect()` method:

  ```python
  from snowflake.snowpark.context import get_active_session
  session = get_active_session()
  code_to_run = """
  BEGIN
      CALL TRANSACTION_ANOMALY_MODEL!DETECT_ANOMALIES(
          INPUT_DATA => SYSTEM$REFERENCE('TABLE', 'ANOMALY_INFERENCE'),
          TIMESTAMP_COLNAME =>'DATE',
          TARGET_COLNAME => 'TRANSACTION_AMOUNT',
          CONFIG_OBJECT => {'prediction_interval': 0.95}
      );

      LET x := SQLID;
      CREATE TABLE ANOMALY_PREDICTIONS AS SELECT * FROM TABLE(RESULT_SCAN(:x));
  END;
  """
  data = session.sql(code_to_run).collect(block=True);
  ```

---
title: Editing and running Notebooks in Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-edit-run.md
section: Snowsight UI
---

# Editing and running Notebooks in Workspaces

## Set the execution context

Notebooks in Workspaces do not automatically set a database or schema. To query data, you must define the execution context in a cell using the
following SQL commands:

```sqlsyntax
USE DATABASE <database>;
USE SCHEMA <schema>;
```

To ensure notebooks run consistently across environments and clients, use fully qualified names for tables and other objects. For example:

```sqlexample
-- Query data objects using a fully qualified name
SELECT * FROM TABLE <database_name.schema_name.table_name>;

-- Create a table using a fully qualified name
WITH filtered_events AS (
    SELECT
        user_id,
        event_type,
        event_timestamp
    FROM raw_events
    WHERE event_timestamp >= '2025-01-01'
)
CREATE OR REPLACE TABLE <database_name.schema_name.table_name> AS
SELECT *
FROM filtered_events;
```

## Use the role and warehouse picker

You can set the active role and warehouse for your notebook.

SnowsightSQL

Use the picker at the top left of the Notebooks editor:

Run the following SQL commands:

```sqlsyntax
USE ROLE <role>;
USE WAREHOUSE <warehouse>;
```

The query warehouse is used to run SQL queries and Snowpark pushdown compute invoked by the notebook. It is also used to render the interactive
datagrid, but there is no credit charge for this operation.

To learn more about credit usage, see [Setting up compute](notebooks-in-workspaces-compute-setup.md).

## Create a Snowpark session

Snowpark is a Snowflake developer framework that lets you build data pipelines, transformations, and machine learning logic directly inside
Snowflake without moving data out of the platform. It provides APIs that operate on Snowflake data as DataFrames, pushing computation down to
Snowflake’s engine for scalability, performance, and security.

To use [Snowpark Python APIs](../../../developer-guide/snowpark/python/index.md) in Notebooks, first create a Snowpark session in a Python cell:

```python
from snowflake.snowpark.context import get_active_session
session = get_active_session()
```

## Run cells

There are four supported execution options:

* Run all cells
* Run one single cell
* Run current cell and all above cells (via the cell’s ellipsis menu)
* Run current cell and all below cells (via the cell’s ellipsis menu)

### Cancel cell execution

Use Stop at the top of the notebook or Cancel execution in a cell.

Both actions stop the currently executing cell and any queued cells triggered by Run all.

> **Note:**
>
> The Run all button may temporarily change to Stop when the notebook is connecting or reconnecting to the service.

## Cell names

You can assign names to cells to make navigation easier and provide contextual labels.

If an imported `.ipynb` file already contains name or title metadata, those values are used automatically.

## Cell referencing

Bidirectional SQL to Python cell referencing allows you to reuse results and variables across cells in either language, enabling seamless transitions
between SQL and Python workflows.

You can hover over the result tooltip to see the DataFrame name you can use to reference the result in Python and SQL.

### Referencing SQL cell results

Each SQL cell exposes its result as a pandas DataFrame pointer named `dataframe_x`.

* In SQL, reference it using double curly braces: `{{dataframe_1}}`.
* In Python, reference it directly as a pandas DataFrame: `dataframe_1`.

### Referencing Python variables

To reference Python variables in SQL queries, wrap them in double curly braces. For example:

```sqlexample
SELECT * FROM {{uploaded_df}} WHERE "price" > 326;
```

DataFrame variables are also supported when referencing Python variables in SQL.

### Example workflow

**Python cell**

```python
import pandas as pd

uploaded_df = pd.read_csv("../data/diamonds.csv")
uploaded_df
```

**SQL cell referencing Python variable**

```sqlexample
SELECT * FROM {{uploaded_df}} WHERE "price" > 326;
```

**SQL cell referencing SQL cell results**

The result of a SQL cell provides a DataFrame pointer called `dataframe_1`. You can reference it in another SQL query:

```sqlexample
SELECT * FROM {{dataframe_1}} WHERE "carat" < 1.0
UNION ALL
SELECT * FROM {{dataframe_2}} WHERE "carat" >= 1.0;
```

## Interactive datagrid

The datagrid supports:

* Scrolling
* Search
* Filtering
* Sorting
* Chart creation without code

### Built-in chart builder

Provides a consistent user experience for data manipulation and visualization across editing surfaces in Workspaces.

## Minimap and cell status

The minimap generates a table of contents from Markdown headers and displays a comprehensive in-session status for each cell (running, succeeded,
failed, and modified).

## Global search and replace

You can search for keywords across all cells in the current notebook. If you’re editing a particular cell, press `esc` to exit the edit mode for that cell first.

To search keywords across all cells in the current notebook, do the following:

* To search for keywords, select Search in the minimap, or use the keyboard shortcut `CTRL` + `F`.

  Matching keywords in all cells are shown. Optionally, you can replace the search term with the desired value using Replace next or Replace all.

## Notebook kernel

The notebook kernel remains active as long as the notebook service is in the `RUNNING` state, allowing uninterrupted execution of critical,
long-running processes such as ML training and data engineering jobs.

Actions that do not affect kernel execution:

* Navigating to other pages
* Working elsewhere in Snowsight
* Closing your browser
* Shutting down your computer

You can shut down or restart the kernel using the Connected dropdown.

> **Note:**
>
> Using Shut down kernel or Restart kernel will clear variables in memory but retain any user-installed packages. If you want a completely clean
> environment with only the pre-installed packages, you must restart the service or create a new service and connect to it.

If the notebook service is suspended, the notebook kernel is also shut down. For more information, see [Setting up compute](notebooks-in-workspaces-compute-setup.md).

## Cell output

* Cell outputs in a notebook in Workspaces (both private and shared workspaces) are accessible to the user who executed the notebook.
* Cell outputs are not saved to the `.ipynb` file. To export and share outputs, choose Export as HTML. For interactive sessions in
  Workspaces, Export as HTML can be accessed from the ellipsis menu in the top right of each notebook file. For scheduled notebooks, it can
  be accessed in each past execution’s result page.
* The exported HTML file has the following behaviors:

  + The collapsed state of each cell’s code and output is saved.
  + Tables and DataFrames are capped at 1,000 rows and default to the Table view. You can toggle to Chart and configure it in the HTML file.

## Jupyter magics

Notebooks in Workspaces run the IPython (Interactive Python) kernel and provide standard Jupyter cell and line magics. Run `%lsmagic` to view available magics.

For example, you can use the `%run` magic command to invoke another notebook:

* In a Python cell of `notebook_a`, call `%run path/to/notebook_b.ipynb`. This executes `notebook_b` in the same Python process as `notebook_a`.
* For variables and pandas DataFrames in `notebook_b` to render in `notebook_a` cell results, make sure to explicitly print them. For example:
  `print(var)` or `display(df)`.

## Developer tools

Developer tools include the Terminal, the Scratchpad, and the Variables Explorer. These tools allow you to explore and interact with your data
and the notebook environment.

To access the developer tools, in the control bar at the top of the notebook, select <icon>:ui:`Tools`.

You must be connected to a notebook service to use the developer tools. Switching to a different service will restart the tools.

### Using the Terminal

The Terminal lets you run any shell command in the notebook’s container environment:

* Install dependencies - `pip install`, `pip list`, or check installed packages.
* Manage files - `ls`, `pwd`, navigate directories, and view files.
* Run parallel jobs
* Monitor compute resource usage

Example for installing and running `htop` for monitoring compute resource usage in real time:

```bash
# If installation fails, run `apt update` first
# Install `htop`
apt install htop

# Run `htop`
htop
```

### Using the Scratchpad

The Scratchpad is an exploratory space for you to quickly experiment — for example, with code, ideas, calculations, or notes — without worrying about structure or polish.
Commands that you execute in the Scratchpad do not change the notebook file.

You can do the following in the Scratchpad:

* Quick ad-hoc queries - Test SQL without adding cells to your notebook.
* Data exploration - Verify table contents, schemas, or run exploratory queries.
* Debugging - Verify data or test query fragments before adding them to a notebook cells.
* One-off operations - Run commands that don’t need to be saved (such as SHOW GRANTS or DESCRIBE TABLE).

Results stay visible while you work but aren’t saved with the notebook.

### Using the Variables Explorer

The Variables Explorer is a visual tool that lets you inspect the variables currently loaded in your session while you are working interactively.
It shows the Name, Type, Shape, and Preview for each variable. Variables are updated when a cell finishes running.

---
title: Experience Snowflake with Legacy Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-use-with-snowflake.md
section: Snowsight UI
---

# Experience Snowflake with Legacy Snowflake Notebooks

Snowflake Notebooks is a development environment that you can use with other Snowflake features. This topic describes ways to leverage other
Snowflake features within notebooks.

## Snowpark Python in notebooks

The [Snowpark library](../../developer-guide/snowpark/index.md) provides an intuitive API for querying and processing data in a data pipeline.
Using the Snowpark library, you can build applications that process data in Snowflake without moving data to the system where your
application code runs. You can also automate data transformation and processing by writing stored procedures and scheduling those
procedures as tasks in Snowflake.

You can use Snowpark to query and process data at scale in Snowflake by writing Snowpark code in a Python cell of your notebook.

### Example usage

Snowpark Python comes pre-installed in the Snowflake Notebooks environment. The following example uses the Snowpark library in a notebook
to read in a CSV file and a Snowflake table and display its contents as output.

1. In your notebook, add a Python cell, either using a [keyboard shortcut](notebooks-develop-run.md) or by selecting
   + Python.
   Snowflake Notebooks and Snowpark both support Python 3.9.
2. Set up a Snowpark session.
   In notebooks, the session context variable is preconfigured. You can use the `get_active_session` method to get the session context variable:

   > ```python
   > from snowflake.snowpark.context import get_active_session
   > session = get_active_session()
   > ```
3. Use Snowpark to load a CSV file into a Snowpark DataFrame from a stage location. This example uses a stage called `tastybyte_stage`.

   > ```python
   > df = session.read.options({"infer_schema":True}).csv('@TASTYBYTE_STAGE/app_order.csv')
   > ```
4. Load an existing Snowflake table, `app_order`, into the Snowpark DataFrame.

   > ```python
   > df = session.table("APP_ORDER")
   > ```
5. Display the Snowpark DataFrame.

   > ```python
   > df
   > ```

> **Note:**
>
> Outside of the Snowflake Notebooks environment, you must call `df.show()` to print out the DataFrame. In Snowflake Notebooks,
> DataFrames are evaluated eagerly when `df` is printed out. The DataFrame is printed out as an interactive Streamlit DataFrame display
> (st.dataframe). DataFrames output is limited to 10,000 rows or 8 MB, whichever is lower.

### Snowpark limitations

* A Snowflake Notebook creates a Snowpark session, so you can use most of the methods available in a Snowpark session class.
  However, because a notebook runs inside Snowflake rather than in your local development environment, you cannot use the following methods:

  > + session.add_import
  > + session.add_packages
  > + session.add_requirements
* Some Snowpark Python operations don’t work with SPROCs. For a complete list of operations, see [Python stored procedure limitations](../../developer-guide/stored-procedure/python/procedure-python-limitations.md).

> **Tip:**
>
> View more examples of notebooks that use Snowpark:
>
> * [Data Engineering Pipelines with Snowpark Python](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Data%20Engineering%20Pipelines%20with%20Snowpark%20Python/Data%20Engineering%20Pipelines%20with%20Snowpark%20Python.ipynb)
> * [Adding CSV files notebook](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Load%20CSV%20from%20S3/Load%20CSV%20from%20S3.ipynb)

> **Note:**
>
> These quickstarts are only shown as examples. Following along with the example may require additional rights to third-party data,
> products, or services that are not owned or provided by Snowflake. Snowflake does not guarantee the accuracy of these examples or
> cover them under any Service Level Agreement.

## Streamlit in notebooks

[Streamlit](../../developer-guide/streamlit/about-streamlit.md) is an open-source Python library that makes it easy to create and share
custom web apps for machine learning and data science. You can build interactive data applications with Streamlit directly in your
notebook. You can test and develop your app directly in a notebook. Streamlit comes preinstalled in notebooks, so you can start quickly.

### Example usage

Streamlit comes pre-installed with the Snowflake Notebooks environment. The example in this section creates an interactive data app using Streamlit.

1. Import necessary libraries

   > ```python
   > import streamlit as st
   > import pandas as pd
   > ```
2. First create some sample data for the app.

   > ```python
   > species = ["setosa"] * 3 + ["versicolor"] * 3 + ["virginica"] * 3
   > measurements = ["sepal_length", "sepal_width", "petal_length"] * 3
   > values = [5.1, 3.5, 1.4, 6.2, 2.9, 4.3, 7.3, 3.0, 6.3]
   > df = pd.DataFrame({"species": species,"measurement": measurements,"value": values})
   > df
   > ```
3. Set up your interactive slider from the Streamlit library.

   > ```python
   > st.markdown("""# Interactive Filtering with Streamlit! :balloon:
   >             Values will automatically cascade down the notebook cells""")
   > value = st.slider("Move the slider to change the filter value 👇", df.value.min(), df.value.max(), df.value.mean(), step = 0.3 )
   > ```
4. Finally, display a filtered table based on the slider value.

   > ```python
   > df[df["value"]>value].sort_values("value")
   > ```

You can interact with the app in real time from the notebook. See the filtered table change based on the value you set on the slider.

> **Tip:**
>
> For the complete example, see the interactive data app section of the [Visual Data Stories with Snowflake Notebooks](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Visual%20Data%20Stories%20with%20Snowflake%20Notebooks/Visual%20Data%20Stories%20with%20Snowflake%20Notebooks.ipynb) notebook.

### Streamlit support in notebooks

Mapbox and Carto provide map tiles when you use the [`st.map`](https://docs.streamlit.io/develop/api-reference/charts/st.map) or [`st.pydeck_chart`](https://docs.streamlit.io/develop/api-reference/charts/st.pydeck_chart) Streamlit commands.

In warehouse runtimes, which manage their packages with conda, Mapbox and Carto are third-party
applications that are subject to Snowflake’s
[External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/).

To use these commands in warehouse runtimes, you must acknowledge the External Offerings Terms.
Container runtimes don’t require this acknowledgement.

The following Streamlit elements are not supported in Notebooks:

* [st.set_page_config](https://docs.streamlit.io/1.39.0/develop/api-reference/configuration/st.set_page_config)

  > The `page_title`, `page_icon`, and `menu_items` properties of the
  > `st.set_page_config` command are not supported.

## Container Runtime notebooks

Container Runtime provides software and hardware options to support advanced data science and machine learning workloads. For
details on Container Runtime, see [Notebooks on Container Runtime](../../developer-guide/snowflake-ml/notebooks-on-spcs.md).

## Snowflake ML Registry in notebooks

The [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md) allows you to securely manage models and your
metadata in Snowflake, regardless of origin. The model registry stores machine learning models as first-class schema-level objects in
Snowflake so they can easily be found and used by others in your organization. You can create registries, and store models in them, using
classes in the Snowpark ML library. Models can have multiple versions, and you can designate a version as the default.

### Example usage

To use the Snowflake ML registry, install the `snowflake-ml-python` library for your notebook:

1. From your notebook, select Packages at the top.
2. Search for the snowflake-ml-python package and select the library to install it.

Here is an example of how you can use the Snowflake ML Registry to log a model:

```python
from snowflake.ml.registry import Registry
# Create a registry and log the model
native_registry = Registry(session=session, database_name=db, schema_name=schema)

# Let's first log the very first model we trained
model_ver = native_registry.log_model(
    model_name=model_name,
    version_name='V0',
    model=regressor,
    sample_input_data=X, # to provide the feature schema
)

# Add evaluation metric
model_ver.set_metric(metric_name="mean_abs_pct_err", value=mape)

# Add a description
model_ver.comment = "This is the first iteration of our Diamonds Price Prediction model. It is used for demo purposes."

# Show Models
native_registry.get_model(model_name).show_versions()
```

> **Tip:**
>
> View this [end-to-end example](https://www.youtube.com/watch?v=LeSGBW0YoLg) of how to use Snowflake ML Registry.

## pandas on Snowflake in notebooks

[pandas on Snowflake](../../developer-guide/snowpark/python/pandas-on-snowflake.md) lets you run your pandas code in a distributed manner
directly on your data in Snowflake. Just by changing the import statement and a few lines of code, you can get the same familiar pandas-native
experience with the scalability and security benefits of Snowflake.

With pandas on Snowflake, you can work with much larger datasets and avoid the time and expense of porting your pandas pipelines to other
big data frameworks or provisioning large and expensive machines. It runs workloads natively in Snowflake through transpilation to SQL,
enabling it to take advantage of parallelization and the data governance and security benefits of Snowflake.

pandas on Snowflake is delivered through the Snowpark pandas API as part of the Snowpark Python library, which enables scalable data
processing of Python code within the Snowflake platform.

### Example usage

Snowpark pandas is available in Snowpark Python version 1.17 and later. Snowpark Python comes pre-installed with the Snowflake Notebooks environment.

1. To install Modin, select `modin` from Packages and ensure that the version is 0.28.1 or later.
2. To set the pandas version, select `pandas` from Packages and ensure that the version is 2.2.1.

In a Python cell, import Snowpark Python and Modin:

> ```python
> import modin.pandas as pd
> import snowflake.snowpark.modin.plugin
> ```

1. Create a Snowpark session:

   ```python
   from snowflake.snowpark.context import get_active_session
   session = get_active_session()
   ```
2. Start using the Snowpark Python API:

   ```python
   # Create a Snowpark Pandas DataFrame with sample data.
   df = pd.DataFrame([[1, 'Big Bear', 8],[2, 'Big Bear', 10],[3, 'Big Bear', None],
                       [1, 'Tahoe', 3],[2, 'Tahoe', None],[3, 'Tahoe', 13],
                       [1, 'Whistler', None],['Friday', 'Whistler', 40],[3, 'Whistler', 25]],
                       columns=["DAY", "LOCATION", "SNOWFALL"])
   # Drop rows with null values.
   df.dropna()
   # Compute the average daily snowfall across locations.
   df.groupby("LOCATION").mean()["SNOWFALL"]
   ```

> **Tip:**
>
> For more examples of how to use pandas on Snowflake, see
> [Getting Started with pandas on Snowflake](https://quickstarts.snowflake.com/guide/getting_started_with_pandas_on_snowflake/#0).

## Snowflake Python API in notebooks

The [Snowflake Python API](../../developer-guide/snowflake-python-api/snowflake-python-overview.md) is a unified library that seamlessly
connects Python with Snowflake workloads. It is intended to provide comprehensive APIs for interacting with Snowflake resources across data
engineering, Snowpark, Snowpark ML, and application workloads using a first-class Python API.

You can use the Snowflake Python API to manage Snowflake resources by creating, deleting, or modifying them, and more. You can use Python
to perform tasks you might otherwise perform with [Snowflake SQL commands](../../sql-reference-commands.md).

In Notebooks, the session context variable is preconfigured. You can use the `get_active_session` method to get the session context variable:

> ```python
> from snowflake.snowpark.context import get_active_session
> session = get_active_session()
> ```

Create a `Root` object from which to use the Snowflake Python API:

> ```python
> from snowflake.core import Root
> api_root = Root(session)
> ```

Here is an example of how you can create a database and schema using the Python API:

> ```python
> # Create a database and schema by running the following cell in the notebook:
> database_ref = api_root.databases.create(Database(name="demo_database"), mode="orreplace")
> schema_ref = database_ref.schemas.create(Schema(name="demo_schema"), mode="orreplace")
> ```
>
> > **Tip:**
> >
> > For a more detailed example of how to use Snowflake’s Python API, see the [Creating Snowflake object using Python API notebook example](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Creating%20Snowflake%20Object%20using%20Python%20API/Creating%20Snowflake%20Object%20using%20Python%20API.ipynb) on Github.

---
title: Getting started with Legacy Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-get-started.md
section: Snowsight UI
---

# Getting started with Legacy Snowflake Notebooks

To start experimenting with Snowflake Notebooks, sign in to [Snowsight](../ui-snowsight-gs.md), and [set up your account to use notebooks](notebooks-setup.md).
In the navigation menu, select Projects » Notebooks. A list of notebooks that you have access to in your account is displayed.
You can either create a new notebook from scratch or upload an existing `.ipynb` file.

The following table shows the topics to review if you’re new to Snowflake Notebooks:

| Getting started guides |  |
| --- | --- |
|  | [Setting up Snowflake Notebooks](notebooks-setup.md) Instructions for developers and admins before using Notebooks. |
|  | [Create a notebook](notebooks-create.md) Create a new notebook from scratch or from an existing file. |
|  | [Develop and run code in Snowflake Notebooks](notebooks-develop-run.md) Create, edit, execute Python, SQL, and Markdown cells. |

## Developer guides

| Guide | Description |
| --- | --- |
| [Session context in notebooks](notebooks-sessions.md) | Accessing and modifying the session context. |
| [Saving results in notebooks](notebooks-save-share.md) | Saving notebooks and results across sessions. |
| [Import Python packages to use in notebooks](notebooks-import-packages.md) | Importing Python packages from Anaconda channel. |
| [Visualize and Interact with your data in Notebook](notebooks-visualize-data.md) | Visualize data with matplotlib, plotly, or altair, and develop a data app with Streamlit. |
| [Cell and variable referencing in Notebook](notebooks-develop-run.md) | Reference SQL cell output and Python variable values. |
| [Keyboard shortcuts for Notebooks](notebooks-develop-run.md) | Leverage keyboard shortcuts to navigate and streamline the editing experience. |

## Leveling up your notebook workflows

| Guide | Description |
| --- | --- |
| [Sync Snowflake Notebooks with Git](notebooks-snowgit.md) | Version control your notebook for collaboration and development. |
| [Work with files in notebooks](notebooks-work-with-files.md) | Manage and work with files in your notebook environment. |
| [Schedule notebook runs](notebooks-schedule.md) | Schedule notebooks to run and execute code within Snowflake. |
| [Troubleshoot errors in Snowflake Notebooks](notebooks-troubleshoot.md) | Troubleshoot errors that may occur while you’re using Snowflake Notebooks. |

## Quickstarts

* [Getting Started with Your First Snowflake Notebook](https://quickstarts.snowflake.com/guide/getting_started_with_snowflake_notebooks/) [[Video](https://www.youtube.com/watch?v=tpg35YgA9Gk&list=PLavJpcg8cl1Efw8x_fBKmfA2AMwjUaeBI&index=3)] [[Source](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/My%20First%20Notebook%20Project/My%20First%20Notebook%20Project.ipynb)]

  Learn how to get started with your first notebook project in less than 10 minutes.
* [Visual Data Stories with Snowflake Notebooks](https://quickstarts.snowflake.com/guide/visual_data_stories_with_snowflake_notebooks/index.html) [[Video](https://www.youtube.com/watch?v=WJUNTudCsYM&list=PLavJpcg8cl1Efw8x_fBKmfA2AMwjUaeBI&index=4)] [[Source](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Visual%20Data%20Stories%20with%20Snowflake%20Notebooks/Visual%20Data%20Stories%20with%20Snowflake%20Notebooks.ipynb)]

  Learn how you can create compelling data narratives using visualizations, Markdown, images, and interactive data apps all within your notebook, alongside your code and data.

## Highlighted use cases

Check out highlighted use cases for data science, data engineering, and ML/AI in [Github](https://github.com/Snowflake-Labs/notebook-demo).

Getting started guides

| Guide | Description |
| --- | --- |
|  | [Setting up Snowflake Notebooks](notebooks-setup.md) Instructions for developers and admins before using Notebooks. |
|  | [Create a notebook](notebooks-create.md) Create a new notebook from scratch or from an existing file. |
|  | [Develop and run code in Snowflake Notebooks](notebooks-develop-run.md) Create, edit, and execute Python, SQL, and Markdown cells. |

> **Note:**
>
> These quickstarts are only shown as examples. Following along with the example may require additional rights to third-party data,
> products, or services that are not owned or provided by Snowflake. Snowflake does not guarantee the accuracy of these examples or
> cover them under any Service Level Agreement.

## Additional resources

* For notebook demos, tutorials, and examples, see the collection of Snowflake Notebooks demos in [GitHub](https://github.com/Snowflake-Labs/notebook-demo).
* To view tutorial videos, see the Snowflake Notebooks [YouTube playlist](https://www.youtube.com/playlist?list=PLavJpcg8cl1Efw8x_fBKmfA2AMwjUaeBI).
* To learn about SQL commands to create, execute, and show notebooks, see Snowflake Notebooks [API reference](../../sql-reference/commands-notebook.md).
* Looking for reference architectures, industry-specific use cases and solutions best practices using Notebooks? See [Notebooks examples](https://developers.snowflake.com/solutions/?_sft_technology=notebooks) in the Snowflake Solution Center.

---
title: Import Python packages to use in notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-import-packages.md
section: Snowsight UI
---

# Import Python packages to use in notebooks

Snowflake Notebooks manages the Python packages used in your notebook environment. You can import
[third-party packages listed in the Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/). For information on importing
packages in Container Runtime, see [Notebooks on Container Runtime](../../developer-guide/snowflake-ml/notebooks-on-spcs.md).

## Considerations for importing packages

* Packages that you add to a notebook are available only to that notebook. If you want to use the same package in a different
  notebook, you must add the same packages again to that notebook.
* After you add a new package, you must restart the notebook session. Snowflake recommends that you add your package at the top of your notebook at the start
  of your analysis.

## Pre-installed packages

By default, Snowflake Notebooks use Python 3.9. Notebook environments come pre-packaged with common libraries for data science and machine learning,
such as altair, pandas, numpy, [snowflake-snowpark-python](../../developer-guide/snowpark/python/index.md), and [Streamlit](https://docs.streamlit.io/library/api-reference).

## Import packages from Anaconda

After your organization administrator [accepts the terms](notebooks-setup.md), you can import libraries to use in
Snowflake Notebooks.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Select a specific notebook for which you want to install Python packages.
4. Select Packages menu at the top of your notebook.
5. Search for packages [listed in the Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/).
6. Select a package to install it for use in your notebook, and optionally change the default package version
   in the list of Installed Packages.

   Packages installed by you appear under Installed Packages.

After the package is added, it may take some time to be installed. After it is installed, you will see a confirmation message and you can then
import and use the libraries in a Python cell.

## Import packages from a Snowflake stage

On both Warehouse and Container Runtime, you can import packages from a stage if the package you need is not part of the pre-installed
packages and is not available in the Anaconda channel.

The following limitations apply:

* The package importer only works for Python modules and folders.
* `.tar.gz` files are not supported.
* Wheel files are not supported on Warehouse Runtime.

Follow these steps to add additional packages:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Select a specific notebook for which you want to install Python packages.
4. Select the Packages menu at the top of your notebook.
5. Select the Stage Packages tab.
6. Enter the path to the file on your stage.

After the package is added, you can now import and use the libraries in a Python cell.
See this in action in the [import packages from stage](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Import%20Package%20from%20Stage/Import%20Package%20from%20Stage.ipynb) tutorial notebook.

Now that all your packages are installed, [start coding in your notebook](notebooks-develop-run.md).

---
title: Integrate workspaces with a Git repository
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/workspaces-git.md
section: Snowsight UI
---

# Integrate workspaces with a Git repository

> **Important:**
>
> Starting in September 2025, Snowflake is gradually upgrading accounts from Worksheets to Workspaces. Workspaces will become the default
> SQL editor. For more information, see [Defaulting accounts from Worksheets to Workspaces](../../release-notes/bcr-bundles/un-bundled/bcr-2117.md).

## Overview

Workspaces can be local to Snowflake, or you can sync workspaces in development with a branch in a Git repository. In Workspaces, you can:

* Create a workspace that is connected to a Git repository.
* Create a new branch, switch branches, or fetch a remote branch.
* Pull the latest changes from your Git repository into your workspace.
* Track any added, updated, or deleted files.
* Commit and push updated files back to your Git repository.
* View and resolve any conflicts directly in Workspaces.

### Create a Git workspace

To develop and maintain files directly in Snowsight, you can create a workspace connected to a Git repository.

> **Note:**
>
> A Git repository must contain at least one branch; empty repositories aren’t supported.

To create a new Git-synced workspace, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. In the Workspaces menu, select From Git repository.
4. Copy the URL from your Git repository (for example, `https://www.github.com/my-user/my-repo-name`), and then paste it into the Repository URL field.
5. Optional: Rename the new Git-synced workspace.
6. In the API Integration menu, select an API integration.

   The API integration must allow access to the Git repository URL you used in step 4. Creating an API integration requires the
   [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md) privilege, which is often restricted to admin roles in many
   accounts. If another role created the API integration, the current role must have the USAGE privilege on that API integration.
7. Select an authentication method:

   * OAuth2 - To use OAuth2 for authentication, you must configure the API integration to support OAuth with your Git provider. For more information,
     see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md). Complete the following steps:

     1. Select Sign in to authenticate with your GitHub repository.
     2. Select Configure next to the account you want to use, then select Authorize next to Snowflake Computing to allow the
        `snowflakedb` app to access your repository.
     3. Under Permissions, ensure that Read access to metadata and Read and write access to code permissions are granted to
        allow you to pull and push changes to your repository.
     4. Under Repository access, specify the level of access you want to grant to Snowflake.
     5. Select Save.

     For more information, see [OAuth app access](https://docs.github.com/en/apps/oauth-apps/using-oauth-apps/authorizing-oauth-apps#oauth-app-access).
     After an authorized admin approves the app, all users in the account can use it.
   * Personal access token - Select the database and schema where the object containing your token is stored. To create a new secret,
     select + Secret and enter the required details. The API integration must be configured to allow access to this secret or to all secrets.
   * Public repository - Select this option if you are using a public repository that doesn’t require authentication. Note that it isn’t
     possible to commit and push any changes from your workspace to this public repository.
8. Select Create.

### Update author details and credentials for a branch

By default, your Snowflake email and username are used for committing changes to your Git repository. You can update these at any time.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select the Changes tab.
4. Select the ellipsis and then select Edit credentials.
5. Specify an author name and email.
6. Select Update.

### Create a new branch

You can create a new branch from your current branch to work on changes independently.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select the Changes tab.
4. Select the repository dropdown.
5. Select + New.
6. Specify a new branch name, and then select Create.

### Switch to a different branch

If you have saved but uncommitted changes, you’ll need to choose how to handle them before switching branches.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. In the Git workspace view, select Changes.
4. From the branch menu, select the branch you want to switch to.

   > **Tip:**
   >
   > To filter the list, start typing a branch name.

### Fetch remote branches

If a new branch was created outside of Snowsight (for example, one created in your Git provider), you can fetch it into your Git-synced
workspace using the Fetch All option. This updates your list of available remote branches.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. In the Git workspace view, select Changes.
4. Select the down arrow next to the Pull menu, and then select Fetch All.
   When the fetch finishes, newly created remote branches appear in the branch list and are available to check out.

### View updated files

To view all the files that were added, deleted, or modified since your last successful commit and push, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. At the top of the folder view, select Changes.
   Modified files are indicated with an M, added files are indicated with an A, and deleted files are indicated with a D.
4. To view a visual diff of the changes in the editor, select a file.

### Commit and push updates

After reviewing your changes, you can commit and push them to your remote Git repository from within the workspace.

To commit and push your updated files to the remote Git repository, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select Changes at the top of the folder view.
4. Write a commit message in the Commit message field.
5. Select Push.
6. Write a commit message and select Push to push your updates to the Git repository.

   > **Note:**
   >
   > If conflicts are detected, you are prompted to pull first. Select Pull to review a list of files with conflicts.

### View and resolve conflicts

If a conflict occurs during a push, you can view and resolve it directly in the workspace before committing again.

1. In Workspaces, at the top of the folder view, select Changes.
   If one or more files have a conflict, a message is displayed at the top of the view. Files with a conflict are indicated with a red M.
2. To view a visual diff of the conflict in the editor, select a file.
   Under File with conflicts, differences are highlighted inline.
3. Accept the current change, an incoming change, or both changes.
   The result of the merge is shown.
4. Under Diff View you can view the current and remote versions side by side.
5. Select Accept all current or Accept all remote.
6. After you resolve the conflicts, select Push.
7. Write a commit message.
8. Select Push.

---
title: Limitations with Legacy Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-limitations.md
section: Snowsight UI
---

# Limitations with Legacy Snowflake Notebooks

This topic describes unsupported features and limitations of Snowflake Notebooks.

* Only one executable `ipynb` file is permitted within each notebook.
* Streamlit components and widgets, such as slider values, do not retain their state if you refresh the browser window, open the notebook
  in a new tab, or close and reopen the current tab.
* For datasets over 1,000 points, Plotly defaults to `webgl` rendering, which is not recommended for security reasons. Snowflake recommends
  that you set the render mode to SVG, however it can cause some performance degradation.
* When you create a notebook from a repository, only the selected notebook is executable. Any other notebooks in the
  repository can be selected and edited, but they are not executable.
* Notebooks cannot be created or executed by [SNOWFLAKE database roles](../../sql-reference/snowflake-db-roles.md).
* Renaming a notebook or moving it to a different database/schema will invalidate the notebook URL.
* Snowflake Notebooks are hosted in a third-party domain to provide increased security. In Safari, you must enable third-party cookies to
  allow reconnection to a running notebook after losing a connection. To enable this setting, in Safari select
  Settings » Privacy, and then clear the Prevent cross-site tracking checkbox.

---
title: Make database objects discoverable in Universal Search
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/object-visibility-universal-search.md
section: Snowsight UI
---

# Make database objects discoverable in Universal Search

Universal Search helps you discover database objects in the account. By default, you can only discover objects to which you have already been
granted access. Even if you have access to multiple accounts within your Snowflake organization, you aren’t able to see objects outside
of the account you’re signed into because access grants do not cross accounts.

Administrators can enable users to discover objects to which they don’t yet have access, including objects in other accounts within your Snowflake
organization, by managing object visibility.

> **Note:**
>
> You can [associate objects with contact information](../contacts-using.md) so that if a user performs a search and doesn’t have
> the privilege to access an object, they can select Request Access to see contact information.

## OBJECT_VISIBILITY property

The OBJECT_VISIBILITY property controls the discoverability of objects in the account, enabling users without explicit access privileges to
find objects and request access. Expanding visibility of objects in the account can simplify collaboration and streamline access requests.

OBJECT_VISIBILITY can be set on an account, database, or schema and follows Snowflake’s inheritance model: settings at a higher level (for
example, accounts) automatically apply to lower levels (for example, databases) unless overridden.

You can set OBJECT_VISIBILITY to one of the following values:

* A YAML specification describing the visibility in one of the following formats:

  ```sqlexample-yaml
  $$
  organization_targets:
    - all_accounts_including_external
  $$
  ```

  Or

  ```sqlexample-yaml
  $$
  organization_targets:
    - account: <account_name_1>
    - account: <account_name_2>
    - ...
    - organization_user_group: <org_user_group_1>
    - organization_user_group: <org_user_group_2>
  $$
  ```

  In the syntax above:

  + `all_accounts_including_external`: Specifies that all users in all accounts in the organization can see the object. This includes
    all accounts within the organization, even those to which external parties may have been given access, such as
    [reader accounts](../data-sharing-reader-create.md).
  + `account: account_name`: Specifies that all users in the specified account can see the object. You can specify multiple accounts.
    Note that `account` is the account name, not the account locator. You must specify only the account name, excluding the organization name.09-22
  + `organization_user_group: org_user_group`: Specifies that the specified [organization user group](../organization-users.md) can
    see the object in all accounts in the organization where the [organization user group has been imported](../organization-users.md).
* `PRIVILEGED`: Specifies that only roles within the current account that are granted an explicit privilege on the object can see the object.
  This is the default behavior in Snowflake.

You can revert an object to PRIVILEGED visibility at any time.

For specific syntax, usage notes, and examples, see the following topics:

### CREATE commands

* [CREATE DATABASE](../../sql-reference/sql/create-database.md)
* [CREATE SCHEMA](../../sql-reference/sql/create-schema.md)

### ALTER commands

* [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md)
* [ALTER DATABASE](../../sql-reference/sql/alter-database.md)
* [ALTER SCHEMA](../../sql-reference/sql/alter-schema.md)

## Access control requirements

Roles using this property must have the following privileges at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE VISIBILITY | Account | Only the SECURITYADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| OWNERSHIP | Database or schema | Required to execute an [ALTER DATABASE](../../sql-reference/sql/alter-database.md) or [ALTER SCHEMA](../../sql-reference/sql/alter-schema.md) statement to set object visibility. OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../../sql-reference/sql/grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../security-access-control-overview.md), see [Overview of Access Control](../security-access-control-overview.md).

## Examples

### Making a database broadly visible

The following statement makes the `product_analytics` database visible to all users in the current account (ACME_ENGINEERING):

```sqlexample-yaml
ALTER DATABASE product_analytics
SET OBJECT_VISIBILITY =
$$
organization_targets:
  - account: acme_engineering
$$;
```

The following statement makes the database visible to all users in two additional accounts within the organization (ACME_MARKETING and ACME_SALES):

```sqlexample-yaml
ALTER DATABASE product_analytics
SET OBJECT_VISIBILITY =
$$
organization_targets:
  - account: acme_engineering
  - account: acme_marketing
  - account: acme_sales
$$;
```

The following statement makes the database visible to all users in all accounts within the ACME organization:

```sqlexample-yaml
ALTER DATABASE product_analytics
SET OBJECT_VISIBILITY =
$$
organization_targets:
  - all_accounts_including_external
$$;
```

### Making a database visible to specific organization user groups

The following statement makes the database visible to specific organization user groups in all accounts within the ACME organization where the
[organization user group has been imported](../organization-users.md):

```sqlexample-yaml
ALTER DATABASE product_analytics
SET OBJECT_VISIBILITY =
$$
organization_targets:
  - organization_user_group: engineering
  - organization_user_group: marketing
  - organization_user_group: sales
$$;
```

## Limitations

* Objects that are discoverable and not accessible are only displayed in [Universal Search](../ui-snowsight-universal-search.md).
  They are not visible in the [database object explorer](../ui-snowsight-data.md) or SQL commands that show metadata (SHOW commands, etc.).
* For a schema, you can set the OBJECT_VISIBILITY property to PRIVILEGED to override any broader visibility settings that may be inherited
  from the account or database level, ensuring the schema remains accessible only by the owner.
* The OBJECT_VISIBILITY property cannot be set or overridden below the schema level. At the schema level, users can either see all objects or none.
* Search can take a few hours to reflect changes to object visibility.

---
title: Managing packages and runtime
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-packages-runtime.md
section: Snowsight UI
---

# Managing packages and runtime

Snowflake Notebooks run inside a pre-built container environment optimized for scalable AI/ML development powered by Snowflake Container Runtime.

## Python versions

Snowflake Notebooks support Python versions from 3.10 to 3.12. When creating a notebook service, select the Python version that best fits your workload requirements.

## Pre-installed Snowflake Container Runtime packages

Snowflake Container Runtime version 2.2 includes approximately 100 packages and libraries that support a wide range of ML development tasks inside Snowflake.

The following sections list a curated subset of pre-installed packages (40 entries per environment) available for each Python version of Snowflake Container Runtime version `2.2`.

> **Note:**
>
> To view the full list of pre-installed packages for your current notebook environment, run `pip freeze` in a Python cell or in the notebook terminal.

### CPU version 2.2

The following packages are available for each Python version of CPU version `2.2`:

Python 3.10Python 3.11Python 3.12

CPU Container Runtime Python 3.10 version `2.2` includes the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.3.1 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.1 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| async-timeout | 5.0.1 |
| attrs | 25.4.0 |
| babel | 2.17.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| CausalPy | 0.5.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |

CPU Container Runtime Python 3.11 version `2.2` includes the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.3.1 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.1 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.17.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better_optimize | 0.2.0 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| CausalPy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |

CPU Container Runtime Python 3.12 version `2.2` includes the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.3.1 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.1 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.17.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better_optimize | 0.2.0 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| CausalPy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |

### GPU version 2.2

The following packages are available for each Python version of GPU version `2.2`:

Python 3.10Python 3.11Python 3.12

GPU Container Runtime Python 3.10 version `2.2` includes the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.3.1 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| airportsdata | 20250909 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.1 |
| asn1crypto | 1.5.1 |
| astor | 0.8.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| async-timeout | 5.0.1 |
| attrs | 25.4.0 |
| babel | 2.17.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| blake3 | 1.0.8 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| CausalPy | 0.5.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |

GPU Container Runtime Python 3.11 version `2.2` includes the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.3.1 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| airportsdata | 20250909 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.1 |
| asn1crypto | 1.5.1 |
| astor | 0.8.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.17.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better_optimize | 0.2.0 |
| blake3 | 1.0.8 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| CausalPy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |

GPU Container Runtime Python 3.12 version `2.2` includes the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.3.1 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| airportsdata | 20250909 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.1 |
| asn1crypto | 1.5.1 |
| astor | 0.8.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.17.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better_optimize | 0.2.0 |
| blake3 | 1.0.8 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| CausalPy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |

## Installing additional packages

Snowflake supports package installation from several sources.

### From external repositories

After configuring External Access Integrations (EAIs) for secure repository access, you can install packages directly from external sources such as
PyPI. Users have access to a comprehensive ecosystem of packages beyond the pre-installed runtime, ensuring secure connectivity to external repos.

You can run `pip install` in a Python cell or in the notebook terminal.

For more information, see [Set up external access for Snowflake Notebooks](../notebooks-external-access.md).

### From `requirements.txt`

You can specify and install required package versions in a `requirements.txt` file to ensure a consistent environment setup. Install them
using the following command:

```bash
!pip install -r requirements.txt
```

> **Note:**
>
> If the package version specified in `requirements.txt` conflicts with supported versions of the
> [pre-installed packages](../../../developer-guide/snowflake-ml/container-runtime-ml.md), the Python environment may break. Validate compatibility before
> installing.

### From Workspace files

You can download or build `.whl` or `.py` files, upload them to your workspace, and install or import them.

* **Wheel files (.whl):** Upload the `.whl` file and install it:

  ```bash
  !pip install file_name.whl
  ```

  If the package contains dependencies that are not already installed, upload the complete dependency tree (either directly into Workspaces or to a
  stage). Alternatively, attach an EAI that allows access to a repository where the package can be downloaded (for example, PyPI).
* **Python files (.py):** Modules stored in your workspace can be imported directly for sharing utilities and functions across notebooks.
  For example:

  ```python
  from my_utils import my_func
  ```

### From a Snowflake stage

Stages provide secure and governed package deployment by leveraging existing Snowflake data storage and governance controls for package files. Use
the Snowpark session to retrieve package files from a Snowflake stage into the container environment for import and use. For example:

```python
from snowflake.snowpark.context import get_active_session
import sys

session = get_active_session()
session.file.get("@db.schema.stage_name/math_tools.py", "/tmp")

sys.path.append("/tmp")
import math_tools

math_tools.add_one(3)
```

## Runtime management

### Runtime pinning

All notebook services are pinned to the Runtime selected at creation unless you explicitly change it by editing the service. For example, a notebook
service created on `Runtime 2.0` will not be automatically upgraded when new Runtime versions are released.

### Runtime vulnerability scanning

Snowflake scans the Runtime images daily for security vulnerabilities. High or critical Common Vulnerabilities and Exposures (CVEs) are addressed by
releasing new Runtime versions within 30 days of detection.

Existing notebook services can continue using Runtimes with detected CVEs. However, Runtimes with known CVEs cannot be selected when creating new
notebook services.

---
title: Migrating legacy notebooks to Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-migrate.md
section: Snowsight UI
---

# Migrating legacy notebooks to Workspaces

This topic describes how to move your legacy Snowflake Notebooks and dependent files to the Workspaces environment.

## Migration steps

1. In the navigation menu, select Projects » Notebooks to open your legacy notebook.
2. Navigate to the Files section to view your `.ipynb` notebooks and any dependent files.
3. Download all necessary files to your local machine.
4. In the navigation menu, select Projects » Workspaces.
5. Select a workspace.
6. Open an existing workspace or create a new one.

   Choose a private workspace for individual use or a shared workspace if the notebooks need to be accessed by multiple users. For more information,
   see [Workspaces](../workspaces.md).
7. Select + Add new.
8. Upload your downloaded files into the workspace.

## Key differences between legacy and new notebooks

> **Note:**
>
> Not all legacy notebook files will run successfully and may require updates to align with the new environment. The table below outlines the
> updates available in Notebooks in Workspaces.

| Area | Legacy notebooks | New notebooks |
| --- | --- | --- |
| Compute | Users must choose between Warehouse and Container Runtime. | Simplified user experience with Container Runtime only.   * Fully managed CPU/GPU infrastructure. * More efficient compute utilization (multiple notebooks can connect to the same service/node). * SQL and Snowpark code is still pushed down to a warehouse for flexibility and cost-performance. |
| File system / IDE environment | Partially supported. | Full IDE environment with:   * File explorer with subfolder support. * Split panes. * Terminal, etc. * Git-synced Workspaces allow users to push/pull, view diffs, and switch branches. * Shared Workspaces support team collaboration with version history and simple publish flows. |
| Package management | * Packages installed through the Anaconda channel. * EAIs need to be configured manually for each notebook. * Package installation from stages supported. | More flexible package management options:   * Direct upload to Workspaces or import from files in stage/Git repositories. * Easier setup for EAIs for installing from external sources. * Anaconda channel is no longer supported. |
| Support for Streamlit | Supported. | Not supported.  Use libraries such as `matplotlib`, `seaborn`, `plotly`, and `altair` for visualization. |
| Jupyter compatibility | Some Jupyter magics are supported. | Full support.  Use Jupyter magics such as `%run`, `%time`, and `%autoreload`. |

If you have questions about availability timelines for specific features, ask your account representative to contact the Notebooks product team.

## Technical requirements and compatibility

Review the following constraints before running your notebooks in the new environment:

* **Python and Runtime:** Workspaces support Python 3.10 to 3.12 and Container Runtime 2.2.

  > **Note:**
  >
  > Python 3.9 and Container Runtime 2.0 are not supported in Workspaces.
* **Compute types:** Notebooks in Workspaces run on CPU or GPU compute types.
* **Visualizations:** Streamlit is not supported. For data visualization, use Matplotlib, Seaborn, Plotly, or Altair.

## Managing dependencies

Workspaces do not have integration support with the Snowflake Anaconda package repository. If your project requires packages not included in
the [pre-installed packages](../../../developer-guide/snowflake-ml/container-runtime-ml.md), you can install them using the following methods:

* **Interactive workflow:** Use `pip install` within the notebook. For more information,
  see [Managing packages and runtime](notebooks-in-workspaces-packages-runtime.md).
* **Automated setup:** Define your dependencies in a `requirements.txt` file. For detailed instructions, see
  [Managing packages and runtime](notebooks-in-workspaces-packages-runtime.md). For scheduled notebooks, specify the file using
  the `REQUIREMENTS_FILE` parameter in [EXECUTE NOTEBOOK PROJECT](../../../sql-reference/sql/execute-notebook-project.md).

## Scheduled tasks

If you have tasks scheduled on your legacy notebooks, they will continue to run with legacy notebooks and are not impacted.

If you want existing tasks to use new notebooks, update your tasks to reference the new Notebook Project Object (NPO). For more information,
see [Run and schedule Notebooks in Workspaces](notebooks-in-workspaces-schedule.md).

---
title: Notebook replication
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-replication.md
section: Snowsight UI
---

# Notebook replication

Replication supports business continuity in case of disasters, outages, or unavailability by making notebooks and other important objects available across accounts. A replication group, configured by an administrator, replicates account objects and databases from a primary account to one or more secondary accounts on a defined schedule.

Notebooks are replicated when they are part of a database included in a replication or failover group. In the secondary account, replicated content is read-only; notebooks are executable but cannot be edited.

Database replication can be configured as a failover group to support high availability. When a secondary failover group is promoted to primary, all contained objects, including notebooks, become writable in the new primary account.

For more information, see [Introduction to replication and failover across multiple accounts](../account-replication-intro.md).

## Enable replication

A user with the ORGADMIN role must enable replication for each source and target account in the organization:

```sqlexample
USE ROLE ORGADMIN;
SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER(
    '<organization_name>.<account_name>',
    'ENABLE_ACCOUNT_DATABASE_REPLICATION',
    'true');
```

For more information, see [Prerequisite: Enable replication for accounts in the organization](../account-replication-config.md).

## Create a replication group in the primary account

To replicate a notebook, specify the database that contains the notebook in the replication group:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE REPLICATION GROUP myrg
    OBJECT_TYPES = DATABASES
    ALLOWED_DATABASES = db1
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

In this example:

* `ALLOWED_DATABASES` - the name of the database that contains the notebook.
* `ALLOWED_ACCOUNTS` - the secondary account to replicate to.
* `REPLICATION_SCHEDULE` - how frequently replication occurs (for example, ‘10 MINUTE’ or ‘1 HOUR’).

### Replicate a warehouse

To run a replicated notebook as intended in the secondary account, any associated objects such as warehouses, EAIs, and tasks must be replicated or recreated separately.

To replicate a warehouse, include the warehouse in the OBJECT_TYPES parameter in the replication/failover group.

```sqlexample
-- Create a new warehouse if required
CREATE WAREHOUSE IF NOT EXISTS mywarehouse
  WAREHOUSE_SIZE = 'X-SMALL'
  AUTO_SUSPEND = 60
  AUTO_RESUME = TRUE
  COMMENT = 'Warehouse for Snowflake Notebooks';

-- Set up warehouse replication
CREATE REPLICATION GROUP mywarehouserg
  OBJECT_TYPES = WAREHOUSES
  ALLOWED_ACCOUNTS = myorg.myaccount2
  REPLICATION_SCHEDULE = '10 MINUTE';
```

For more information on syntax and options, see [CREATE REPLICATION GROUP](../../sql-reference/sql/create-replication-group.md).

## Secondary account behavior

In a secondary account, you can create new notebooks only in non-replicated databases. These notebooks are not included in the replication group and are fully read-write.

Replicated notebooks are read-only. However, users can change associated compute resources and external access integrations (EAIs). These resources must be created or replicated separately. If they are not available, the notebook will not have those resources attached.

Create a replication group in the target account as a replica of the replication group `myrg` in the source account:

```sqlexample
CREATE REPLICATION GROUP myrg
    AS REPLICA OF myorg.myaccount1.myrg;
```

You can also create a replication group for warehouses if necessary. Note that all warehouses in the account will be replicated:

```sqlexample
CREATE REPLICATION GROUP mywarehouserg
    AS REPLICA OF myorg.myaccount1.mywarehouserg;
```

The replication group can also be [refreshed manually](../../sql-reference/sql/alter-replication-group.md) by running the following command:

```sqlexample
ALTER REPLICATION GROUP myrg REFRESH;
```

## Create a failover group

To allow promotion of the secondary account to primary during an outage, use a failover group:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE FAILOVER GROUP myfg
  OBJECT_TYPES = DATABASES
  ALLOWED_DATABASES = db1
  ALLOWED_ACCOUNTS = myorg.myaccount2
  REPLICATION_SCHEDULE = '10 MINUTE';
```

In this example, `ALLOWED_DATABASES` is the database to be created in the failover group. The replicated notebook in the failover group is read-only, but still executable. If you [promote the failover group to primary](../account-replication-failover-failback.md), the notebook becomes read-write.

## Considerations

* Scheduled notebooks in a secondary account are paused until failover. After failover, scheduling resumes.
* For replication and task behavior, see [Replication considerations](../account-replication-considerations.md).
* Notebook results are only stored in the account where the notebook was run. Notebook results are not replicated.

## Limitations

* Git integration is not currently supported after failover. For notebooks in a promoted secondary account to be able to reconnect to Git, you must reconfigure Git.

### Container Runtime notebooks

Notebooks that use Container Runtime are not fully replicated. Specifically, compute pools are not replicated and must be created manually in the secondary account.

To run a Container Runtime notebook in the secondary account:

1. Identify the compute pool used in the source account.
2. Create a compute pool with the same name and configuration in the secondary account:
   For example, if a replicated notebook references a compute pool named `compute_pool`, create that compute pool in the secondary account:

```sqlexample
-- In the secondary account, create a new compute pool with a matching name and configuration

CREATE COMPUTE POOL compute_pool
  MIN_NODES = 1
  MAX_NODES = 10
  INSTANCE_FAMILY = CPU_X64_XS;
```

Once created, the replicated notebook can use the compute pool to run in the secondary account.

---
title: Notebook usage and cost monitoring
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-usage.md
section: Snowsight UI
---

# Notebook usage and cost monitoring

A notebook consumes compute resources through its configured [virtual warehouses](../warehouses-overview.md)
or [compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). To manage costs and ensure efficient operations,
it’s important to monitor usage across individual notebooks, users, and the underlying compute infrastructure. This visibility helps ensure
efficient operations and supports cost accountability throughout your environment.

Snowflake provides access to detailed usage data through [ACCOUNT_USAGE](../../sql-reference/account-usage.md) views and system tables. This
data can help answer questions such as:

* What is the hourly credit consumption per notebook?
* How frequently were notebooks run in the past week?
* Which users ran notebooks in the past month?
* Which compute pools or warehouses did notebooks use over the past week?
* What is the total credit cost of notebooks using a specific compute resource?

For a broader overview of compute-related cost management, see [Exploring compute cost](../cost-exploring-compute.md).

## Example query

You can query Snowflake’s [ACCOUNT_USAGE](../../sql-reference/account-usage.md) views to gain insight into the credit consumption for a notebook.
These views break down cost by notebook, user, or compute pool level at a daily or hourly basis.

### Usage

In the following example, each row represents a single notebook execution and includes details such as the execution timestamp, the user who ran the notebook, and the runtime
environment (Warehouse or Container Runtime).

```sqlexample
-- Warehouse Runtime
SELECT query_text, t1.user_name, credits_attributed_compute as total_warehouse_credits
FROM snowflake.account_usage.query_history t1
INNER JOIN snowflake.account_usage.query_attribution_history t2
ON t1.query_id = t2.query_id

-- Add your notebook name
AND t1.query_text ILIKE 'execute notebook% <example_nb_name>'
;

-- Container Runtime
SELECT
  start_time, notebook_name, user_name, SUM(credits) AS total_container_runtime_credits
FROM snowflake.account_usage.notebooks_container_runtime_history
WHERE notebook_name = '<example_nb_name>'
GROUP BY ALL;
```

## Cost monitoring on Container Runtime

The following queries help you monitor the credit consumption of notebooks in your account. Use these queries to analyze notebook usage patterns,
estimate costs, and understand how individual notebooks contribute to compute pool expenses.

Query: Hourly credit consumption by notebook
:   This query retrieves runtime history for a specific notebook, including credit usage and execution timestamps. Use this data to understand how
    often and how long a notebook runs, and to identify patterns or spikes in credit consumption by hour.

    ```sqlexample
    SELECT * FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE notebook_name = '<example_nb_name>';
    ```

Query: Cost to run a specific notebook
:   This query shows the total credits consumed by a specific notebook. Use this to estimate a notebook’s cost and identify high-cost notebooks.

    ```sqlexample
    SELECT
      notebook_name,
      SUM(credits) AS total_credits
    FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE notebook_name = '<example_nb_name>'
    GROUP BY notebook_name;
    ```

Query: Total compute pool cost per notebook
:   This query shows the total credits consumed by each notebook running on a specific compute pool. Use this to break down compute usage by
    notebook, which can help identify which notebooks contribute most to the compute pool’s overall cost.

    ```sqlexample
    SELECT
      notebook_name,
      SUM(credits) AS total_credits
    FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE compute_pool_name = '<example_cp_name>'
    GROUP BY notebook_name;
    ```

Query: Identify users who ran a specific notebook
:   This query returns a list of users who have executed a specific notebook. Use this to understand usage patterns, or identify collaborators
    and consumers of shared notebooks.

    ```sqlexample
    SELECT
      DISTINCT user_name
    FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE notebook_name = '<example_nb_name>';
    ```

### Additional notes

Costs for querying are associated with the underlying warehouse. For information on how warehouses work, see [Virtual warehouse credit usage](../cost-understanding-compute.md).

---
title: Notebooks in Workspaces limitations
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-limitations.md
section: Snowsight UI
---

# Notebooks in Workspaces limitations

## Notebook services and runtime

* Notebook services are subject to an account limit of 200 active services.
* Notebooks in different workspaces cannot share a service.
* Notebooks in the same workspace connect to a shared service by default.
* Users may create multiple services within a workspace and assign notebooks to different services as needed.
* Notebook services may be restarted over the weekend for container service maintenance. After a restart, you must rerun notebooks and reinstall any
  packages to restore variables and packages. For more information, see [Service maintenance](notebooks-in-workspaces-compute-setup.md).
* Package installation and listing behavior differs between `uv` and standard `pip`. Snowflake supports installing packages using
  `uv pip install`, and `uv pip freeze` lists only packages installed using `uv pip install`. `pip freeze` lists all packages available in the environment, including packages in the base image, packages installed with standard pip install, and packages installed with `uv pip install`.
* Installing packages from external stages is not supported.

## Using notebooks in Workspaces

* Queries in SQL cells do not appear in the Query History pane until you shut down the kernel:

  1. Select Connected.
  2. Select Shut down kernel.
  3. Suspend the notebook service.
* Renaming notebook files, folders, or the workspace can cause unexpected behavior, including service disconnection, clearing the notebook’s output
  cache, or delays in updating referenced files.
* If you are disconnected, try reconnecting the notebook. If you renamed the workspace, create and use a new service.
* If account session policies block the use of secondary roles, notebooks cannot run in shared workspaces.
* Cell-by-cell rendering is not currently supported when viewing differences in Git-integrated workspaces or when viewing publish history in
  shared workspaces. The entire notebook file is displayed as a unified diff.

## Editing and running notebooks

* Updates to Python files (`.py`) imported by a notebook are not automatically detected by the active notebook service. To apply changes,
  restart the notebook kernel or use the `%autoreload` magic command before your initial import so that file updates are detected automatically.
* Each cell has an output limit of 1 MB.
* Output of previous notebook executions is cached in an internal storage system, which is not yet
  [Tri-Secret Secure](../../security-encryption-tss.md). Access to this cache is encrypted at rest and results in the cache are guarded
  by governance rules.
* iPywidgets are not yet supported.
* Embedding remote images via URLs is not yet supported. To embed an image, upload it to your workspace and display it in a Markdown or
  Python cell. Example:

  ```md

  ```

  ```python
  from IPython.display import Image, display
  display(Image(filename="path/to/example_image.png"))
  ```
* SQL cells cannot run [EXECUTE NOTEBOOK PROJECT](../../../sql-reference/sql/execute-notebook-project.md) (non-interactive execution). To chain notebooks,
  use Jupyter magic commands, such as `%run`, which executes another notebook in the same Python process. For more information, see
  [Jupyter magics](notebooks-in-workspaces-edit-run.md).
* If the execution context (database and schema) or the query warehouse is not set when you run notebooks in Workspaces, the interactive datagrid for
  displaying table results in code cells and cell referencing may not function properly. For information about setting the execution context, see
  [Set the execution context](notebooks-in-workspaces-edit-run.md).
* The following values are not supported as column names:

  + CURRENT_DATE
  + CURRENT_TIME
  + CURRENT_TIMESTAMP
  + LOCALTIME
  + LOCALTIMESTAMP
  + CURRENT_USER
  + SESSION_USER
  + SYSTEM_USER

## Migrating from legacy notebooks

For information about migrating legacy notebooks to Workspaces, see [Migrating legacy notebooks to Workspaces](notebooks-in-workspaces-migrate.md).

---
title: Observability and logging for Notebooks in Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-observability-logging.md
section: Snowsight UI
---

# Observability and logging for Notebooks in Workspaces

## Overview

Snowflake writes notebook logs to the container’s local file system and ingests them into an [event table](../../../developer-guide/logging-tracing/event-table-setting-up.md), which you can query to troubleshoot notebook runs, review execution history, and perform long-term analysis.

You can use an event table to centralize operational data for Notebooks in Workspaces; for example, with the following tasks:

* Troubleshooting scheduled runs (errors, warnings, timestamps)
* Auditing who ran what and when (when emitted by the workload and configured for collection)
* Creating dashboards for notebook activity (success/failure counts, run duration, noisy errors)

> **Note:**
>
> There is typically a delay of three to five minutes before logs appear in the event table.

## Enable logging in your notebook code

By default, Python logging is set to `WARNING`. To capture application events, you must set the logging level to `INFO` or
`DEBUG`.

* Add the following code to your Python notebook or script:

```python
import logging

# Set the root logger to INFO level
logging.getLogger().setLevel(logging.INFO)

# Generate a test log entry
logging.info("APPLICATION_EVENT: Service initialization complete.")
```

## Query logs using Snowflake Trail

You can view log entries in Snowsight through Snowflake Trail.

> **Note:**
>
> Before you can view log messages, you must [enable telemetry data collection](../../../developer-guide/logging-tracing/logging-tracing-enabling.md).

### Identify your event table

* To find the event table for your account, run the following command in a SQL file:

```sqlexample
SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;
```

### Query and analyze logs

After your event table has started collecting events, you can query it like any other table to filter by time range, severity, and workload identifiers.
For more information on event table schema and column definitions, see [Event table columns](../../../developer-guide/logging-tracing/event-table-columns.md).

* To investigate recent log events, run the following code (replacing the placeholder values with your actual values):

  ```sqlexample
  SELECT
      TIMESTAMP,
      VALUE AS LOG_MESSAGE,
      RESOURCE_ATTRIBUTES:"snow.service.name"::string AS SERVICE_NAME,
      RECORD:"severity_text"::string AS SEVERITY
  FROM <database_name>.<schema_name>.<event_table_name>
  WHERE RECORD_TYPE = 'LOG'
    AND RESOURCE_ATTRIBUTES:"snow.service.name" = '<your_service_name>'
    AND TIMESTAMP > DATEADD(hour, -1, CURRENT_TIMESTAMP())
  ORDER BY TIMESTAMP DESC
  LIMIT 100;
  ```

## View logs for scheduled notebook runs in Snowsight

Each scheduled notebook uses a notebook project object that stores deployed code, execution history, and artifacts.

To view logs for scheduled runs in Snowsight:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Search for the database and schema containing the notebook project object.
4. Select the notebook project object, and then select the Run history tab.
5. For the run you want to inspect, in the Logs column, select Logs .

After you enable logging in your notebook code, your custom log messages and infrastructure initialization logs appear in this log view.

## Troubleshooting

* If you don’t see expected events, verify that your event table is created and that event logging is enabled and configured for your account and
  workloads.
* If scheduled runs fail, cross-check [notebook scheduling](notebooks-in-workspaces-schedule.md)
  and look for correlated errors in the event table during the same time window.

---
title: Private connectivity for Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-privatelink.md
section: Snowsight UI
---

# Private connectivity for Notebooks

This topic describes using AWS PrivateLink, Azure Private Link, or Google Private Service Connect when accessing Snowflake Notebooks. This feature
is available in both [Warehouse and Container Runtimes](notebooks.md) for AWS and Azure, and in Warehouse Runtime for Google.

## AWS PrivateLink prerequisites

To access Snowflake Notebooks with AWS PrivateLink:

1. Set up private connectivity for your [Snowflake account](../admin-security-privatelink.md).
2. Set up private connectivity for [Snowsight](../ui-snowsight-gs.md).

In addition, your account must already use Streamlit in Snowflake over AWS PrivateLink. Notebooks uses the Streamlit engine and widgets to execute and render
notebook cell outputs.

## Azure Private Link prerequisites

To access Snowflake Notebooks with Azure Private Link:

1. Set up private connectivity for your [Snowflake account](../privatelink-azure.md).
2. Set up private connectivity for [Snowsight](../ui-snowsight-gs.md).

In addition, your account must already use Streamlit in Snowflake over Azure Private Link. Notebooks relies on the Streamlit engine for execution and uses
Streamlit widgets to render cell outputs.

## Google Cloud Private Service Connect prerequisites

To access Snowflake Notebooks with Google Private Service Connect:

1. Set up private connectivity for your [Snowflake account](../private-service-connect-google.md).
2. Set up private connectivity for [Snowsight](../ui-snowsight-gs.md).

In addition, your account must already use Streamlit in Snowflake over Google Private Service Connect. Notebooks relies on the Streamlit engine for execution
and uses Streamlit widgets to render cell outputs.

## Configure access to Snowflake Notebooks

To determine the hostname:

* Call [SYSTEM$GET_PRIVATELINK_CONFIG](../../sql-reference/functions/system_get_privatelink_config.md) in your Snowflake account. Use the value returned for the `app-service-privatelink-url` key.
  This URL is used to route traffic to Snowflake-hosted app services, including Snowflake Notebooks, over AWS PrivateLink, Azure Private Link, or Google Private Service Connect.

> **Note:**
>
> You can set up a new VPC endpoint for Notebooks or create a DNS record to the same VPC endpoint of your Snowflake account, as shown in the following example:
>
> * Record name: `*.abcd.privatelink.snowflake.app`
> * Type: CNAME
> * Route traffic to: same VPC as your Snowflake traffic.

Hostname routing at an account level is currently not supported.

## Security considerations

Notebooks serve both HTTPS-encrypted traffic and WebSocket-encrypted traffic. The Notebooks browser client application is mounted in a third-party, cross-origin
iframe within Snowsight. This enables strict cross-site browser isolation control.

Snowflake Notebooks use a separate URL scheme for specific security requirements. Notebook URLs have their own top-level domain that does not share any elements
with Snowsight. Each notebook has a unique origin.

> **Note:**
>
> When using AWS PrivateLink, Azure Private Link, or Google Private Service Connect, you control the DNS resolution; no private connectivity
> DNS records are controlled by Snowflake.

---
title: Private connectivity for Notebooks in Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-privatelink.md
section: Snowsight UI
---

# Private connectivity for Notebooks in Workspaces

This topic describes how to use AWS PrivateLink, Azure Private Link, or Google Private Service Connect when accessing Notebooks in Workspaces.

## AWS PrivateLink prerequisites

To access Notebooks in Workspaces with AWS PrivateLink:

1. Set up private connectivity for your [Snowflake account](../../admin-security-privatelink.md).
2. Set up private connectivity for [Snowsight](../../ui-snowsight-gs.md).

In addition, your account must already use Streamlit in Snowflake over AWS PrivateLink.

## Azure Private Link prerequisites

To access Notebooks in Workspaces with Azure Private Link:

1. Set up private connectivity for your [Snowflake account](../../privatelink-azure.md).
2. Set up private connectivity for [Snowsight](../../ui-snowsight-gs.md).

In addition, your account must already use Streamlit in Snowflake over Azure Private Link.

## Google Cloud Private Service Connect prerequisites

To access Notebooks in Workspaces with Google Private Service Connect:

1. Set up private connectivity for your [Snowflake account](../../private-service-connect-google.md).
2. Set up private connectivity for [Snowsight](../../ui-snowsight-gs.md).

In addition, your account must already use Streamlit in Snowflake over Google Private Service Connect.

## Configure access to Notebooks in Workspaces

To configure private connectivity for Notebooks in Workspaces, follow the steps for
[configuring private connectivity for Snowsight](../../ui-snowsight-gs.md).

## Security considerations

Notebooks serve both HTTPS-encrypted traffic and WebSocket-encrypted traffic. The Notebooks browser client application is contained in a third-party, cross-origin
iframe within Snowsight. This enables strict cross-site browser isolation control.

Notebooks in Workspaces use a separate URL scheme for specific security requirements. Notebook URLs have their own top-level domain that does not share any elements
with Snowsight. Each notebook has a unique origin.

> **Note:**
>
> When using AWS PrivateLink, Azure Private Link, or Google Private Service Connect, you control the DNS resolution; Snowflake does not control private connectivity DNS records.

---
title: Run and schedule Notebooks in Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-schedule.md
section: Snowsight UI
---

# Run and schedule Notebooks in Workspaces

## Scheduling notebooks in Workspaces

When deploying notebooks to production, Snowflake provides native functionality to manage deployment, orchestration, and monitoring. You develop
and iterate on notebooks interactively in Workspaces within Snowsight. Scheduling a notebook deploys its contents into a production
object called a Notebook Project Object (NPO), which encapsulates the workspace contents (for example, `.ipynb` files, Python scripts,
and SQL files). NPOs support versioned deployments and are schema-level objects (for example, `db_name.schema_name.npo_name`).

After deployment, you can orchestrate notebook execution using Snowflake Tasks (which run notebook code top-down using a consistent runtime
and dependency set) or with any third-party orchestration tool. Snowflake captures execution telemetry that you can monitor in Snowsight
or query programmatically through an event table. For more information, see [Observability and logging for Notebooks in Workspaces](notebooks-in-workspaces-observability-logging.md).

### Notebook Project Objects (NPOs)

An NPO is a schema-level object that acts as a production-ready “unit” in your pipeline. A Notebook project is linked to a workspace or a stage, and all
files from the workspace are copied over. NPOs are executed in a non-interactive way and can be embedded in a task for scheduling.

* **Placement:** NPOs exist within a specific schema inside a database (`database_name.schema_name.npo_name`).
* **Encapsulation:** When you schedule a notebook, the NPO captures the entire Workspace directory to ensure all dependencies are available during execution.
* **Execution:** You execute an NPO by specifying a main `.ipynb` file (for example, using the `MAIN_FILE` parameter). The main notebook can call additional notebooks using [%run](notebooks-in-workspaces-edit-run.md).
* **Scheduling:** You can create multiple task objects that execute the same NPO, allowing multiple schedules for the same notebook project object.

### Discovering NPOs

NPOs are standard database objects, so you can use metadata commands to audit or clean up scheduled tasks

| Scope | Command |
| --- | --- |
| Current context | SHOW NOTEBOOK PROJECTS; |
| Database level | SHOW NOTEBOOK PROJECTS IN DATABASE <database_name>; |
| Schema level | SHOW NOTEBOOK PROJECTS IN SCHEMA <database_name>.<schema_name>; |
| Account level | SHOW NOTEBOOK PROJECTS IN ACCOUNT; |

## Permissions and sharing for NPOs

To execute or manage an NPO, a role must have the following privileges:

* **Location:** USAGE or OWNERSHIP on the database and schema containing the NPO.
* **NPO access:** USAGE or OWNERSHIP on the specific NPO.
* **Compute:** USAGE and MONITOR on the warehouse, and USAGE on the compute pool (for Container Runtime).
* **Scheduling:** The account-level global EXECUTE TASK privilege is required if the NPO is triggered by a task.
* **External access integrations:** USAGE on any EAIs used by the notebook.
* **Tasks:** When the NPO is scheduled via a task, the task owner role must be granted the USAGE privilege on all required objects (such as NPOs,
  warehouses, or databases). The task owner role must also have privileges to execute the USE DATABASE and USE SCHEMA commands if
  the notebook sets its execution context programmatically.

> **Note:**
>
> NPOs use caller’s rights, where the caller is the user (not the role). When you run [EXECUTE NOTEBOOK PROJECT](../../../sql-reference/sql/execute-notebook-project.md) directly in
> Snowsight, the execution uses the calling user’s identity rather than the active role in the Snowsight session.
> The notebook runs in its own dedicated session (separate from the Snowsight session), with the user’s default role as the primary
> role and all secondary roles activated. This means the notebook can execute with all privileges granted to the user’s roles.

## Using an NPO to schedule a notebook

Currently there are two supported scenarios for deploying and scheduling notebooks. In both scenarios, notebooks must be packaged in the NPO.
[Scenario A](notebooks-in-workspaces-workflow-scenarios.md) is scheduling notebooks from a private workspace. [Scenario B](notebooks-in-workspaces-workflow-scenarios.md)
is integrating GitHub Actions (or another CI/CD system) to automate the creation of NPOs from an [internal or temporary stage](../../../sql-reference/sql/create-stage.md),
manage their lifecycle through versioned updates, and orchestrate their execution using Snowflake Tasks.

| Scenario | Workspace Type | Scheduling Method |
| --- | --- | --- |
| A: Individual Development | Private | Supported. Develop in your private workspace. Create Notebook Project Objects (NPO) and schedule tasks. |
| B: Production (CI/CD) | Git-integrated | Notebook files are deployed to an internal or temporary stage from GitHub using GitHub Actions (or other CI/CD tools) and an NPO is created/updated from that stage. The Task is executed on the NPO. |

For detailed workflows for each scenario, see [Scheduling workflows by scenario](notebooks-in-workspaces-workflow-scenarios.md).

## View scheduled notebook runs

You can view scheduled tasks in three places:

**From the notebook**

To view or interact with scheduled runs, you must use a role with access to the database and schema where the schedule and project object were created.

1. In the navigation menu, select Projects » Workspaces.
2. Open a scheduled notebook.
3. At the top of the notebook editor, select Scheduled runs . A popover displays the following information:

> * All scheduled runs for this notebook.
> * The next scheduled run time.
> * Status of past runs. Hover over a status indicator to see details such as Query ID, last run time, duration, and status.

**From the Actions menu**

* **Open Run History:** Opens the notebook’s project object showing all past runs, including status, duration, results, source file, logs, and metrics.
  Selecting a run’s result opens the executed notebook with its output. For more information, see [Observability and logging for Notebooks in Workspaces](notebooks-in-workspaces-observability-logging.md).

**From Database Explorer**

To view run history for any scheduled notebook (including those deployed via CI/CD):

1. In the navigation menu, select Catalog » Database Explorer.
2. Select the database and schema that contain the Notebook Project Object (NPO).
3. Select the NPO.
4. Select Run history.
5. Select a run to view the notebook output from that execution, along with logs and metrics (when available). For more information,
   see [Observability and logging for Notebooks in Workspaces](notebooks-in-workspaces-observability-logging.md).

> **Note:**
>
> To view run history for notebook runs triggered by Airflow, sign in to Snowsight using the same user that runs Airflow.

## Manage scheduled tasks

From the Scheduled runs popover, you can manage your scheduled tasks by selecting the ellipsis (more actions)  next to a scheduled task:

* **Run now:** Triggers an immediate execution of the scheduled task.
* **Pause schedule:** Temporarily stops the schedule from running automatically. The task remains configured but won’t execute until resumed.
* **Delete:** Removes the scheduled task permanently. You can create a new schedule with different settings (such as a different role or database
  location) after deleting the existing schedule.

## Deploy updates to scheduled notebook tasks

After editing a notebook, you must deploy your changes before scheduled runs use the updated version. Deployment ensures reproducibility and prevents
scheduled tasks from running code that differs from what was last deployed. If this is the notebook’s first task and a notebook has changes that
require deployment, the Schedule (calendar) icon displays a clock indicator. If a schedule already exists, the icon is a calendar with a clock.

After modifying code or cells, the icon indicates that there are undeployed changes.

* Select Deploy Changes.

  Snowflake then updates the associated notebook project object, and all scheduled tasks for that notebook will use the newly deployed version for the next run.

## Find a notebook project object (NPO) in the Object Explorer

Each scheduled notebook automatically creates an NPO that stores its deployed code, execution history, and artifacts. You
can locate these objects in the Object Explorer in Snowsight.

To locate an NPO in Snowsight, follow these steps:

1. In the navigation menu, select Catalog » Database Explorer.
2. Navigate to Database » Schema » Notebook Project Objects to view all NPOs in that schema.

Alternatively, you can:

1. Open the relevant notebook.
2. At the top of the notebook editor, select Scheduled runs .
3. Select Open run history to open the associated NPO.

## View the notebook’s run history

This section describes how to view execution details and troubleshoot notebook runs after a schedule has been created. If any step fails
during execution, Snowflake stops the run to prevent partial or inconsistent downstream results.

To view run history, follow these steps:

1. In the navigation menu, select Projects » Workspaces.
2. Open the notebook whose run history you want to review.
3. At the top of the notebook editor, select Scheduled runs .
4. Select View run history from the drop-down menu.

> Run History shows the following information for the notebook’s project object:
>
> * **Results:** View the notebook and output from past runs.
> * **Tasks:** See which tasks executed the NPO.
> * **Source file:** View the notebook file that was executed.
> * **Logs and metrics:** View execution logs and performance metrics (ensure you have enabled logging and event tables). For more information, see [Observability and logging for Notebooks in Workspaces](notebooks-in-workspaces-observability-logging.md).
> * **Run details:** Start and end times, run status, and error details.

## Schedule a notebook using Tasks

1. In the navigation menu, select Projects » Workspaces.
2. Run the following command in a SQL file/worksheet:

> ```sqlexample
> -- Execute a notebook project using a task
> CREATE OR REPLACE TASK <database_name>.<schema_name>.<name>
>   WAREHOUSE = <string>
>   SCHEDULE = 'USING CRON 10 13 * * * America/Los_Angeles'
>   -- CRON format: <minute> <hour> <day_of_month> <month> <day_of_week> <timezone>
> AS
>   -- Execute a notebook stored within a notebook project.
>   EXECUTE NOTEBOOK PROJECT "<database_name>"."<schema_name>"."<project_name>"
>     MAIN_FILE = 'notebook.ipynb'  -- Path to the notebook file
>     COMPUTE_POOL = '<compute_pool_name>'
>     RUNTIME = '<runtime_version>'  -- e.g. V2.2-CPU-PY3.11
>     QUERY_WAREHOUSE = '<wh_name>'
>     ARGUMENTS = '<string>'  -- Can pass a single string parsed in the notebook code
>     REQUIREMENTS_FILE = '<path/to/requirements.txt>'  -- Pre-installs dependencies before the notebook runs
>     EXTERNAL_ACCESS_INTEGRATIONS = ('integration_name');  -- e.g. ('http_eai', 's3_eai')
> ```

After creating this task, run the following command to activate the schedule:

> ```sqlexample
> ALTER TASK <database_name>.<schema_name>.<task_name> RESUME;
> ```

If a task fails because your active role lacks the required privileges, Snowsight displays the relevant error messages so you can
address missing permissions.

For syntax, parameters, and examples, see [EXECUTE NOTEBOOK PROJECT](../../../sql-reference/sql/execute-notebook-project.md). For information about passing parameters to scheduled notebooks, see [Running notebooks with parameters](notebooks-in-workspaces-parameters.md).

> **Note:**
>
> To learn more about credit usage, idle timeout behavior, and notebook service management, see [Setting up compute](notebooks-in-workspaces-compute-setup.md)
> and [Idle timeout](notebooks-in-workspaces-compute-setup.md).

---
title: Running notebooks with parameters
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-parameters.md
section: Snowsight UI
---

# Running notebooks with parameters

Currently, parameters passed in the `ARGUMENTS` string are parsed into the `sys.argv` list using whitespace as the delimiter.

## Example: Execute a notebook project with parameters

The following example passes two arguments (env and prod) using ARGUMENTS = ‘env prod’.

The first element (`sys.argv[0]`) is the notebook filename, followed by the space-separated arguments.

```sqlexample
EXECUTE NOTEBOOK PROJECT "<database_name>"."<schema_name>"."<project_name>"
  MAIN_FILE = 'snow://workspace/<workspace_hash>/path/to/notebook.ipynb' -- Notebook name with full file path
  COMPUTE_POOL = '<compute_pool_name>'
  RUNTIME = '<runtime_version>'    -- For example, V2.2-CPU-PY3.11
  QUERY_WAREHOUSE = '<warehouse_name>'
  ARGUMENTS = 'env prod' -- Can pass in a single string, which can be parsed in the notebook code. Point to the environment configuration.
  REQUIREMENTS_FILE = 'path/to/requirements.txt';
```

## View all arguments

To inspect the full list of parameters passed to the session, use the `sys` module.

```python
import sys
print(sys.argv)
```

Output example:

```text
['exampletestSCOS.ipynb', 'env', 'prod']
```

## Print each argument

To process or log each parameter individually, loop through the `sys.argv` list.

```python
import sys
for arg in sys.argv:
    print(arg)
```

Output example:

```text
exampletestSCOS.ipynb
env
prod
```

## Access a specific argument

Parameters are accessed by their index in the list. Because `sys.argv[0]` is the notebook name, the first user parameter starts at `index[1]`.

```python
import sys

# Access the first user parameter
first_param = sys.argv[1]
print(first_param)
```

Output example:

```text
env
```

For full syntax and parameter details, see [EXECUTE NOTEBOOK PROJECT](../../../sql-reference/sql/execute-notebook-project.md).

---
title: Save and share results in notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-save-share.md
section: Snowsight UI
---

# Save and share results in notebooks

You can collaborate on data analysis with others using Snowflake Notebooks.

Each Snowflake notebook is owned by a role, so other users that are granted or inherit the owner role can open, run, and edit notebooks
owned by that role. You cannot share the notebook with other roles.

> **Caution:**
>
> Notebooks are saved every three seconds. If other users have the notebook open and run it, you might overwrite each other’s work.

## Export your notebook as a file for sharing

To share your notebook externally, you can export it as an `.ipynb` file. The exported notebook can be shared with others who may not
use Snowflake Notebooks. They can open the notebooks with other solutions that are compatible with the `.ipynb` format.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Open the notebook to export.
4. Select the vertical ellipsis  menu, and then select Export.
5. Acknowledge that some commands might not be supported in other notebook tools, and select Export.

   A file named `notebook_app` is downloaded. You can then
   [import the exported notebook into another Snowflake account](notebooks-create.md) or another tool that supports
   `.ipynb` files.

> **Note:**
>
> Only the cell content — not the cell outputs — is included as part of the export.

To download a CSV file of a cell, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Open the notebook from which you want to download data.
4. Run the cell for the data to download.
5. Hover over the table and select Download data as .csv.

## Collaborate in notebooks

* The role used to create the notebook owns the notebook. For details on privileges required for notebooks, see [Set up Snowflake Notebooks](notebooks-setup.md).
* Any user with that role, or whose role inherits that role, can access, edit, run, and manage the notebook.
* To share and collaborate on a notebook with another user, that user must either have the owner role or be granted a role that
  inherits the owner role of the notebook.
* Ownership of a notebook can be transferred to a different role. For details, see [GRANT OWNERSHIP](../../sql-reference/sql/grant-ownership.md).

## Limitations

* You cannot share a notebook with other roles.
* Roles with only the USAGE privilege on a notebook cannot create a task to schedule that notebook. The USAGE privilege allows the notebook
  to be referenced in certain contexts (such as the [SHOW NOTEBOOKS](../../sql-reference/sql/show-notebooks.md) command), but does not permit execution or scheduling.

---
title: Schedule notebook runs
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-schedule.md
section: Snowsight UI
---

# Schedule notebook runs

When you create a schedule for running your notebook, Snowsight creates a task to run your notebook on that schedule. Snowsight
executes the notebook in a non-interactive mode, cell by cell from top to bottom. The task used to run the notebook is owned by the notebook
owner role and uses the notebook warehouse to run. By default, task runs of the notebook auto-suspend after 10 failures. For more information about
tasks, see [Introduction to tasks](../tasks-intro.md).

Each notebook run starts the notebook and connects it with the notebook warehouse. As a result, the warehouse used by the notebook is
resumed and remains active until 15 minutes after the scheduled task is complete.

## Privileges required to run your notebook on a schedule

Because the process of running a notebook on a schedule creates and executes a task, you must use a role that has the following privileges:

| Privilege | Object |
| --- | --- |
| EXECUTE TASK | Account |
| USAGE | Database containing the notebook |
| USAGE, CREATE TASK | Schema containing the notebook |

## Schedule your notebook

To schedule your notebook to run, create a task by doing the following:

1. In the navigation menu, select Projects » Notebooks.
2. Locate and select the notebook to schedule.
3. In the notebook, select the schedule button, then Create schedule.

   The Schedule a notebook run dialog appears.
4. For Schedule name, enter a name for the notebook schedule. This is used as the name of the task that runs the notebook.
5. For Frequency, select a frequency at which to run the notebook (for example, Daily).
6. Depending on the frequency that you select, adjust the Scheduled time and other options to match when you want the notebook to run.
7. Optionally, for Parameter, you can add command-line syntax arguments to pass to the scheduled notebook. For example: `key1=value1 key2=value2 --option2`.

   > **Note:**
   >
   > If you pass `--` (two dashes) as a standalone argument, then any argument passed before the dashes will be interpreted as passed to
   > the notebook runtime.
8. Review the preview of the schedule, and select Create.

A task is created that schedules your notebook to run.

## Manage notebook schedules

Once you create schedules for your notebook, you can view and make edits to the schedules via the
task list for the schema that the notebook is in.

1. In the navigation menu, select Projects » Notebooks.
2. Locate and select the notebook to manage schedules for.
3. In the notebook, select the schedule button, then View schedules.
   This displays a table of all tasks contained in the schema that the notebook is in.
4. Use the vertical ellipsis  menu on the task that runs your notebook and select an action.

You can make edits such as changing the time or frequency of the schedule, and suspending or
dropping the task completely. See [Introduction to tasks](../tasks-intro.md) for more details on managing tasks.

## Pass arguments to a scheduled notebook

You can pass command-line arguments to a scheduled notebook using syntax such as `key1=value1 key2=value2 --option2`. You can then access these parameters within the scheduled notebook using `sys.argv` inside the notebook.

```python
# first argument
sys.argv[0]

# print the entire list
st.write(sys.argv)
```

## View past scheduled notebook runs

After your notebook runs as scheduled, you can review its run history:

1. In the navigation menu, select Projects » Notebooks.
2. Locate and select the notebook to schedule.
3. In the notebook, select the schedule button, then View run history.

   The Run History dialog for your notebook appears.
4. You can review the run history for the notebook, including run activity generated by the scheduled task or an API. Notebook runs
   performed by a user are not included. You can review the following details:

   * Trigger: The name of the task that caused the notebook to run.
   * Last Ran: The timestamp of the last scheduled run of the notebook.
   * Status: The status of the task that ran.
   * Duration: The length of time it took to run the notebook.
   * Results: A link to the results of the notebook run. The results are read-only and the notebook cells cannot be edited.
     You can select Edit current notebook to open and edit the current version of the notebook.
5. Optionally select View all tasks in schema to see a table of all tasks contained in the schema.
6. Select Done to return to your notebook.

For more information on viewing task history, see [View the task history for your account](../tasks-intro.md).

## Using your own scheduler

To run a notebook using your own scheduling tool like Airflow, use the [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) command. You can also pass command-line arguments to the scheduled notebook directly when running this command. For example:

```sqlexample
EXECUTE NOTEBOOK DB.SCHEMA.NOTEBOOK_NAME('--env staging --tablename staging-table');
```

This executes the notebook cell by cell from top to bottom. The results are accessible in the notebook’s run history section.

## Limitations

* The run history for your scheduled notebooks is limited to the last seven days.
* Changing the name of a notebook running as a scheduled task may cause an error in the task. You can manually
  edit the task using the [CREATE TASK](../../sql-reference/sql/create-task.md) command to call the notebook with the changed name.
* Nested execution of SPCS notebooks is not supported. A scheduled SPCS notebook cannot run another SPCS notebook from within its code.
* The task’s owner role must match the notebook’s owner role. Scheduled runs do not support secondary roles, but you can switch to a role
  granted to the task’s owner role. For example, in most cases, you can switch to the PUBLIC role:

  ```python
  from snowflake.snowpark.context import get_active_session
  session = get_active_session()
  session.use_role('PUBLIC')
  ```

## Failed notebook runs

* If a scheduled notebook run results in an error, that run will appear in the run history with a `Failed` status. The user who created
  the notebook can open the failed run and isolate the cell where the error occurred.

---
title: Scheduling workflows by scenario
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-workflow-scenarios.md
section: Snowsight UI
---

# Scheduling workflows by scenario

This topic provides detailed workflows for scheduling notebooks in two common scenarios:

* **Scenario A:** Development in a private workspace - Schedule notebooks directly from Snowsight
* **Scenario B:** Production (CI/CD) - Deploy notebooks from a Git repository using CI/CD pipelines

> **Note:**
>
> Scheduling notebooks is not currently supported in shared workspaces.

## Scenario A: Development in a private workspace

1. In the navigation menu, select Projects » Workspaces.
2. Select + Add new » Notebook to create a new notebook, or open an existing notebook to be scheduled.

   > **Note:**
   >
   > Ensure that you have specified the execution context (database and schema) in the notebook you are scheduling. For more information,
   > see [Set the execution context](notebooks-in-workspaces-edit-run.md).
3. At the top of the notebook editor, select Scheduled runs .

   * If this is the notebook’s first task, the  icon is a calendar.
   * If a schedule already exists, the  icon is a calendar with a clock.
4. Select Create Schedule.
5. In the Schedule a Notebook Task dialog, provide the following information:

   **Basic settings**

   * **Task name:** The unique name for the scheduled task. The default name is `{notebook-name}_task_#` but can be updated if necessary.
   * **Owner role:** The Snowflake role under which the task executes. Select a role with the required permissions to execute all operations performed by
     the scheduled notebook. This role must have permissions to:

     + Read/write the database objects the notebook uses.
     + Access warehouses, compute pools, and integrations.
     + Create/update the task and project objects.
   * **Location:** The database and schema where the task object and associated notebook project object is created. Choose a schema where your role
     has CREATE TASK and USAGE privileges. If your role has only USAGE privileges on the schema, ensure it also has the CREATE NOTEBOOK PROJECT privilege.
   * **Frequency:** How often the notebook should run. Choose from: Hourly, Daily, Weekly, Monthly, or Custom (Cron scheduling). All execution times use
     your local time zone.

   **Advanced settings (all fields are required unless otherwise specified)**

   * **Notebook project name:** A unique name for the notebook’s project container that Snowflake creates for task execution. If not edited, Snowflake provides a
     default name.
   * **Parameters (optional):** Key-value parameters are passed to the notebook at runtime and appear as command-line arguments (in `sys.argv`). Parameters
     are useful for passing dates, environment flags, thresholds, or model versions. Parameters can be passed in Snowsight as whitespace-separated values
     or in the [EXECUTE NOTEBOOK PROJECT](../../../sql-reference/sql/execute-notebook-project.md) command as `ARGUMENTS = 'env prod'`. For more information, see
     [Running notebooks with parameters](notebooks-in-workspaces-parameters.md).
   * **Runtime variant:** The runtime environment used for notebook execution. Choose from:

     + **CPU:** Uses a CPU Container Runtime environment and runs on a CPU compute pool (for example, the automatically provisioned `SYSTEM_COMPUTE_POOL_CPU`).
     + **GPU:** Uses a GPU Container Runtime environment that includes GPU-accelerated libraries and runs on a GPU compute pool (such as `SYSTEM_COMPUTE_POOL_GPU`).
     + **Python version:** The Python version used during task execution.
     + **Runtime version:** The base Container Runtime image. Choosing the correct runtime version ensures that your notebook runs consistently between
       development and scheduled execution.
   * **Compute pool:** The compute pool that executes the notebook task. Ensure that the compute pool has capacity (free nodes) at the time of
     the scheduled execution. To prevent scheduled runs from failing, we recommend that you use a dedicated compute pool to ensure no other SPCS services
     take up full capacity.
   * **Query warehouse:** The Snowflake warehouse used for all SQL queries inside the notebook.
   * **External access integrations (optional):** Defines which external access integrations (EAIs) the notebook may use. EAIs are required if
     your notebook requires external APIs, third-party services, or cloud storage outside of Snowflake’s internal stages. If no EAIs are listed, your
     selected role does not own or have privileges on any integrations.
   * **Requirements file (optional):** Pre-install Python dependencies for repeatable runs using the `REQUIREMENTS_FILE` parameter. For more
     information, see [Managing packages and runtime](notebooks-in-workspaces-packages-runtime.md).
6. Review the schedule preview, and select Create.

## Scenario B: Production (CI/CD)

For production environments, we recommend managing notebook code in a Git-based workspace (for details, see [Integrate workspaces with a Git repository](../workspaces-git.md))
or developing locally in your preferred IDE. You can use a CI/CD pipeline (such as GitHub Actions) to deploy files to a Snowflake internal or temporary stage.

For a hands-on walkthrough of this pattern, see the [Getting Started with Data Engineering using Snowflake Notebooks](https://www.snowflake.com/en/developers/guides/data-engineering-with-notebooks/)
quickstart and the accompanying [code repository](https://github.com/Snowflake-Labs/sfguide-data-engineering-with-notebooks) on GitHub.

After the files are on the stage, you can:

* Create a Notebook Project Object (NPO) sourced from that stage location.
* Schedule the NPO using a Snowflake Task for automated execution.

1. **Create a stage**

   Use [CREATE STAGE](../../../sql-reference/sql/create-stage.md) to create an internal or temporary stage:

   ```sqlexample
   -- Ensure the landing zone exists
   CREATE STAGE IF NOT EXISTS <database_name>.<schema_name>.<stage_name>;
   ```
2. **Load/deploy notebook file(s) to the internal or temporary stage**

   Your CI/CD pipeline should upload the `.ipynb` file(s) to a Snowflake stage. Use the [PUT](../../../sql-reference/sql/put.md) command to ensure that the notebook
   files are loaded into a stage readable by the Notebook Project.

   ```sqlsyntax
   PUT file://<absolute_path_to_file>/ @<database_name>.<schema_name>.<stage_name> AUTO_COMPRESS=FALSE OVERWRITE=TRUE;
   ```

   Example:

   ```sqlexample
   PUT file://notebooks/ml_model/train.ipynb @<database_name>.<schema_name>.<stage_name> AUTO_COMPRESS=FALSE OVERWRITE=TRUE;
   ```
3. **Create or update the Notebook Project Object (NPO)**

   Create (or update) the NPO to reference the internal or temporary stage that contains your deployed notebook files:

   ```sqlexample
   CREATE NOTEBOOK PROJECT IF NOT EXISTS <database_name>.<schema_name>.<project_name>
     FROM '@<database_name>.<schema_name>.<stage_name>';
   ```
4. **Alter the notebook project details**

   For subsequent code changes, your pipeline executes an ALTER command. This updates the project to the latest version of the code without
   having to drop and recreate the object:

   ```sqlexample
   -- Update the project with the latest code from the stage
   ALTER NOTEBOOK PROJECT <database_name>.<schema_name>.<project_name>
     ADD VERSION FROM '@<database_name>.<schema_name>.<stage_name>';
   ```
5. **Execute the notebook project (orchestrate with a task)**

   Create a task to schedule and execute the NPO. Use a Snowflake task to define the schedule and execution parameters for the NPO.

   > **Note:**
   >
   > Ensure that you specify your notebook execution context (use the database and schema of the notebook you want to schedule). For more
   > information, see [Set the execution context](notebooks-in-workspaces-edit-run.md).

   ```sqlexample
   -- Create or replace the task to orchestrate the notebook
   CREATE OR REPLACE TASK <database_name>.<schema_name>.<task_name>
     WAREHOUSE = '<warehouse_name>'
     SCHEDULE = 'USING CRON 0 9 * * * America/Los_Angeles'
   AS
     EXECUTE NOTEBOOK PROJECT <database_name>.<schema_name>.<project_name>
       MAIN_FILE = 'snow://workspace/<workspace_hash>/path/to/notebook.ipynb'
       COMPUTE_POOL = 'SYSTEM_COMPUTE_POOL_CPU'
       RUNTIME = 'V2.2-CPU-PY3.12'
       QUERY_WAREHOUSE = '<warehouse_name>'
       ARGUMENTS = '<db_name> <schema_name> <warehouse_name>';
   ```

   For information about passing parameters to scheduled notebooks, see [Running notebooks with parameters](notebooks-in-workspaces-parameters.md).
6. **View your notebook run or execution history**

   After the task runs, you can monitor its success or failure in Snowsight to ensure the CI/CD deployment is performing as expected.
   For detailed instructions on viewing run history, see [View scheduled notebook runs](notebooks-in-workspaces-schedule.md).

Snowsight supports non-interactive (headless) execution of notebooks. This allows you to trigger a programmatic run of a notebook without
opening Snowsight and without requiring a recurring schedule.

Headless execution is intended for tasks, scheduled tasks, or workflows orchestrated by tools such as Airflow, Prefect, Dagster, CI/CD pipelines, or
external systems that need to execute a notebook programmatically. For more information, see [CREATE NOTEBOOK PROJECT](../../../sql-reference/sql/create-notebook-project.md).

> **Note:**
>
> To run the SQL commands in this workflow (such as `CREATE NOTEBOOK PROJECT` and `CREATE TASK`), you must execute them from a SQL
> file or SQL worksheet in Workspaces, not from within a notebook cell.

---
title: Session context in Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-sessions.md
section: Snowsight UI
---

# Session context in Notebooks

The session context of a notebook is defined by the role, warehouse, database, and schema that you defined when you created the notebook.
When you run the notebook, it runs as that role, using the warehouse defined in the notebook, and in the context of the database and schema
that contain the notebook.

This topic describes how to access or change the session context of your notebook.

## Accessing the session context for a notebook

You can access the session context using both Python and SQL.

If you’re using the Snowpark Python library or Snowflake Python APIs, use the
[get_active_session()](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.context.get_active_session)
method to get the active session context.

```python
from snowflake.snowpark.context import get_active_session
session = get_active_session()
```

For SQL, you can use the [Context functions](../../sql-reference/functions-context.md) SQL functions.

```sqlexample
SELECT CURRENT_WAREHOUSE(), CURRENT_DATABASE(), CURRENT_SCHEMA();
```

## Changing the session context for a notebook

You can change the session context of the notebook to use a different role, database and schema, and/or warehouse:

* Specify a different role to use with the [USE ROLE](../../sql-reference/sql/use-role.md) SQL command.

  + You can check the role in use by the notebook by calling the [CURRENT_ROLE](../../sql-reference/functions/current_role.md) function.
  + If you change your role to one that does not have privileges to use the notebook warehouse, database, or schema,
    queries that require a warehouse or access to the notebook database or schema fail to run. However,
    you can still run queries that do not use the notebook warehouse, database, and schema.
  + Roles specified with the [USE ROLE](../../sql-reference/sql/use-role.md) SQL command do not persist across notebook sessions.
  + If you specify a database or schema that the currently active role does not have privileges to access, queries using that database
    and schema fail to run.
* If you run the SQL command [USE SECONDARY ROLES](../../sql-reference/sql/use-secondary-roles.md) to set secondary roles to ALL, the secondary roles associated
  with your user are used to generate the results of the notebook cells.
* Specify a different warehouse using the SQL command [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md).

  + You can check the warehouse in use for the notebook by calling the [CURRENT_WAREHOUSE](../../sql-reference/functions/current_warehouse.md) function.
* Specify a different database or schema using [USE DATABASE](../../sql-reference/sql/use-database.md) or
  [USE SCHEMA](../../sql-reference/sql/use-schema.md) SQL commands.

  + You can check the database in use for the notebook by calling the [CURRENT_DATABASE](../../sql-reference/functions/current_database.md) function.
  + If you reference objects in the notebook database or the database specified in an earlier notebook cell, you can simplify your
    SQL statements to include only the schema and object that you want to reference, instead of the fully qualified path to the object.

---
title: Set up external access for Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-external-access.md
section: Snowsight UI
---

# Set up external access for Snowflake Notebooks

When working with notebooks, you might need to call external services, which often require sensitive credentials such as API keys. To keep
sensitive information secure, you can use secrets managed within Snowflake instead of hardcoding credentials in your notebook.

[External access integrations (EAIs)](../../developer-guide/external-network-access/external-network-access-overview.md) are configured using
network rules and can optionally use Snowflake secrets for authentication.

By default, Snowflake restricts network traffic from external endpoints. To access external endpoints, follow these steps:

1. Create a network rule.
2. Create an [external network access integration](../../developer-guide/external-network-access/external-network-access-overview.md) that uses the rule.
3. Create a secret for authentication (if needed). Generic string secrets also require an EAI.
4. Associate the secret with the EAI.
5. Associate the EAI and secret with the notebook.

> **Note:**
>
> EAIs and network rules must be created by an organization administrator. For required privileges, see [Access control requirements](../../sql-reference/sql/create-external-access-integration.md).

## Configure a notebook with external access and secrets

This end-to-end example shows how to configure a notebook to access the OpenAI API using a generic string secret.

```sqlexample
-- Step 1: Create a secret
CREATE SECRET openai_key
  TYPE = GENERIC_STRING
  SECRET_STRING = '<your-api-key>';

-- Step 2: Create a network rule
CREATE OR REPLACE NETWORK RULE openai_rule
  MODE = EGRESS
  TYPE = HOST_PORT
  VALUE_LIST = ('api.openai.com');

-- Step 3: Create an external access integration that uses the network rule and secret
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION openai_integration
  ALLOWED_NETWORK_RULES = (openai_rule)
  ALLOWED_AUTHENTICATION_SECRETS = (openai_key)
  ENABLED = true;

-- Step 4: Associate the integration and secret with the notebook
ALTER NOTEBOOK my_notebook
  SET EXTERNAL_ACCESS_INTEGRATIONS = (openai_integration),
    SECRETS = ('openai_key' = openai_key);
```

> **Note:**
>
> Secrets must be associated with both the external access integration (EAI) and the notebook. If a secret is associated with only one, it will not be accessible from notebook code.

## Access the secret inside a notebook

* After associating the secret with the notebook, to access its value in notebook code, use the `st.secrets` object:

```python
import streamlit as st
api_key = st.secrets['openai_key']
```

## Additional EAI examples

These examples show how to set up external access for common data science and machine learning sites:

### EAI for PyPI

```sqlexample
CREATE OR REPLACE NETWORK RULE pypi_network_rule
MODE = EGRESS
TYPE = HOST_PORT
VALUE_LIST = ('pypi.org', 'pypi.python.org', 'pythonhosted.org', 'files.pythonhosted.org');

CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION pypi_access_integration
ALLOWED_NETWORK_RULES = (pypi_network_rule)
ENABLED = true;
```

### EAI for Hugging Face

```sqlexample
CREATE OR REPLACE NETWORK RULE hf_network_rule
MODE = EGRESS
TYPE = HOST_PORT
VALUE_LIST = ('huggingface.co', 'cdn-lfs.huggingface.co');

CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION hf_access_integration
ALLOWED_NETWORK_RULES = (hf_network_rule)
ENABLED = true;
```

## Grant USAGE privileges to use external access integrations

* After you create the EAIs, grant the USAGE privilege on the integration to roles that will use them:

  ```sqlexample
  GRANT USAGE ON INTEGRATION openai_integration TO ROLE my_notebook_role;
  ```

The role used to create the notebook must have USAGE on the EAI. Granting USAGE to the PUBLIC role will not work.

## Enable external access integrations in Snowsight

After you create and provision EAIs, restart the notebook session in order to see the access integrations you created in
the External Access pane.

To enable integrations using Snowsight:

1. In the navigation menu, select Projects » Notebooks.
2. Open your notebook.
3. Select the  icon at the top right of your notebook.
4. Select Notebook settings, and then select External access.
5. Toggle on the EAIs you want to enable for the notebook.

## Additional authentication examples

### OAuth access token

```sqlexample
CREATE OR REPLACE SECRET oauth_token
    TYPE = OAUTH2
    API_AUTHENTICATION = google_translate_oauth
    OAUTH_REFRESH_TOKEN = 'my-refresh-token';
```

```sqlexample
# Using the secret as part of an EAI
  ALTER NOTEBOOK google_translate_test
    SET EXTERNAL_ACCESS_INTEGRATIONS=(google_translate_integration)
      SECRETS = ('cred' = oauth_token);
```

### Secret type: GENERIC_STRING

Use a `GENERIC_STRING` secret to store a single value, such as an API key or token.

Create the secret:

```sqlexample
CREATE SECRET sf_openai_key
  TYPE = GENERIC_STRING
  SECRET_STRING = '<string_literal>';

-- SQL: Associate the secret and EAI with the notebook
ALTER NOTEBOOK openai_test
  SET EXTERNAL_ACCESS_INTEGRATIONS = (openai_access_int),
    SECRETS = ('openai_key' = sf_openai_key);
```

For GENERIC_STRING secrets, access them by dictionary or attribute style:

```python
import streamlit as st

# Access the string value directly
my_openai_key = st.secrets['openai_key']
# or using attribute access
my_openai_key = st.secrets.openai_key
```

### Secret type: PASSWORD (example: GitHub Basic Auth)

Use a `PASSWORD` secret to store a username and password pair. These are often required for basic authentication with external APIs.

In this example, the notebook accesses the GitHub REST API using a `PASSWORD` secret and an external access integration.

Create the secret:

```sqlexample
CREATE SECRET password_secret
  TYPE = PASSWORD
  USERNAME = 'my_user_name'
  PASSWORD = 'my_password';
```

Use the secret as part of an EAI:

```sqlexample
ALTER NOTEBOOK github_user_info
SET EXTERNAL_ACCESS_INTEGRATIONS = (github_access_int),
    SECRETS = ('cred' = password_secret);
```

Access the secret in your code:

```python
import streamlit as st
import requests
from requests.auth import HTTPBasicAuth

# Access credentials from the secret
username = st.secrets.cred.username
password = st.secrets.cred.password

# Make an authenticated request
response = requests.get(
    'https://api.github.com/user',
    auth=HTTPBasicAuth(username, password)
)

print(response.status_code)
print(response.json())
```

## Additional resources

* For detailed syntax, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).
* For details on using CREATE SECRET, see [Creating a secret to represent credentials](../../developer-guide/external-network-access/creating-using-external-network-access.md).
* For additional examples of EAIs, see [External network access examples](../../developer-guide/external-network-access/external-network-access-examples.md) or
  [Setting up External Access for Snowflake Notebooks on Github](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Access%20External%20Endpoints/Access%20External%20Endpoints.ipynb).

---
title: Set up Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-setup.md
section: Snowsight UI
---

# Set up Snowflake Notebooks

Snowflake Notebooks are first-class objects stored within a schema under a database. They can run on two compute architectures: warehouses and
containers. This topic provides steps to set up your account as an administrator and start using Snowflake Notebooks.

## Administrator setup

To set up your organization using Snowflake Notebooks, perform these steps:

1. Review account and deployment requirements.
2. Accept the Anaconda terms to import libraries.
3. Create resources and grant privileges to create notebooks.

### Review account and deployment requirements

Ensure that `*.snowflake.app` and `*.snowflake.com` are on the allowlist in your network (including content filtering systems), and
can connect to Snowflake. For Streamlit apps using container runtimes, also add `*.snowflakecomputing.app` to the allowlist.
When these domains are on the allowlist, your apps can communicate with Snowflake servers without any restrictions.
However, in some cases adding these domains may not be sufficient due to network policies blocking subpaths under them. If this occurs,
contact your network administrator.

In addition, to prevent any issues connecting to the Snowflake backend, ensure that WebSockets are not blocked in your network configuration.

## Using third-party packages from Anaconda

Snowflake provides access to a curated set of Python packages built by Anaconda. These packages integrate directly into Snowflake’s Python features at no extra cost.

### Licensing terms

* **In Snowflake:** Governed by your existing Snowflake customer agreement, including the Anaconda usage restrictions described in this documentation. No separate Anaconda terms apply for in-Snowflake use.
* **Local development:** From Snowflake’s [dedicated Anaconda repository](https://repo.anaconda.com/pkgs/snowflake/) : Subject to Anaconda’s Embedded End Customer Terms and Anaconda’s Terms of Service posted on the repository. Local use is limited to developing/testing workloads intended for deployment in Snowflake.

### Create resources and grant privileges

To create a notebook, a role needs privileges on the following resources:

* [CREATE NOTEBOOK](../../sql-reference/sql/create-notebook.md) privilege on a location
* USAGE privilege on compute resources
* (Optional) USAGE privilege on external access integrations (EAIs)

See Template for Notebooks setup for example scripts of creating and granting permissions on these resources.

#### Location

The location is where a notebook object is stored. The end user can query any database and schema their role has access to.

* To change the context to a different database or schema, use the [USE DATABASE](../../sql-reference/sql/use-database.md) or
  [USE SCHEMA](../../sql-reference/sql/use-schema.md) commands in a SQL cell.

In the Container Runtime, the role that is creating the notebook also requires the [CREATE SERVICE](../../sql-reference/sql/create-service.md) privilege on the schema.

| Privilege | Object |
| --- | --- |
| USAGE | Database |
| USAGE | Schema |
| CREATE NOTEBOOK | Schema |
| CREATE SERVICE | Schema |

Roles that own a schema automatically have the privilege to create notebooks within that schema, because owners can create any type of object,
including notebooks.

| Privilege | Object |
| --- | --- |
| USAGE | Database |
| OWNERSHIP | Schema |

### Compute resources

In the Warehouse Runtime, both a notebook’s engine and Python processes from the code authored in the notebook run on the notebook
warehouse, but SQL queries and Snowpark push down queries run on the Query warehouse. The owner role of the notebook requires the
USAGE privilege on both warehouses.

If a notebook runs on Container Runtime, the role needs the USAGE privilege on a compute pool instead of on the notebook warehouse. Compute
pools are CPU-based or GPU-based virtual machines managed by Snowflake. When creating a compute pool, set the MAX_NODES parameter to greater than
one because each notebook will require one full node to run. For information, see [Snowpark Container Services: Working with compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

| Privilege | Object |
| --- | --- |
| USAGE | Notebook warehouse or compute pool |
| USAGE | Query warehouse |

### External access integrations (optional)

If you allow certain roles to access an external network, use the ACCOUNTADMIN role to set up and grant the USAGE privilege on
external access integrations (EAIs). EAIs allow access to specific external endpoints so your teams can download data and models, send API
requests and responses, log in to other services, etc. For notebooks running on Container Runtime, EAIs also allow your teams to install
packages from repositories such as PyPi and Hugging Face.

For details on how to set up EAI for your notebook, see [Set up external access for Snowflake Notebooks](notebooks-external-access.md).

| Privilege | Object |
| --- | --- |
| USAGE | External access integration |

### Template for Notebooks setup

Because notebooks are objects with role-based creation and ownership privileges, you can configure access to the Notebooks feature to align
with your organization and team needs. Here are a few examples:

#### Allow everyone to create notebooks in a specific location

The following steps outline how to configure access for creating notebooks in a specific location by granting usage on a database and schema.

Replace <database> and <database.schema> with the specific database and schema where you want to create your notebooks:

```sqlexample
----------------------------------
--       Location Setup         --
----------------------------------
GRANT USAGE ON DATABASE <database> TO ROLE PUBLIC;
GRANT USAGE ON SCHEMA <database.schema> TO ROLE PUBLIC;
GRANT CREATE NOTEBOOK ON SCHEMA <database.schema> TO ROLE PUBLIC;

----------------------------------
--    Compute Resource Setup    --
----------------------------------
GRANT USAGE ON WAREHOUSE <warehouse> TO ROLE PUBLIC;

-------------------------------------
-- Optional: External Access --
-------------------------------------

-- Example EAI
CREATE OR REPLACE NETWORK RULE allow_all_rule
MODE = 'EGRESS'
TYPE = 'HOST_PORT'
VALUE_LIST = ('0.0.0.0:443','0.0.0.0:80');

CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION allow_all_integration
ALLOWED_NETWORK_RULES = (allow_all_rule)
ENABLED = true;

GRANT USAGE ON INTEGRATION allow_all_integration TO ROLE PUBLIC;
```

#### Create a dedicated role

If you only want specific users to create notebooks (assuming they do not already OWN any schemas), you can create a dedicated role for
controlling access. For example:

```sqlexample
CREATE ROLE notebooks_rl;
```

Grant the ROLE notebook_rl to specific users. Then, use the above script to create resources and grant permissions to this role (replace
ROLE PUBLIC with ROLE notebook_rl).

#### Notebook engine

The notebook engine (“kernel”) and Python processes run on the Notebook warehouse. Snowflake recommends that you start with an X-Small
warehouse to minimize credit consumption.

While you are using the notebook (for example, editing code, running, reordering, or deleting cells), or if the notebook remains active
within its idle timeout setting, an [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) query will run continuously to indicate that the notebook
engine is active and a notebook session is in use. You can check the status of this query in Query history. While
[EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) is running, the Notebook warehouse is also running. When
[EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) finishes, if there are no other queries or jobs running on the warehouse, it will shut down
according to its auto-suspend policy.

To end the [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) query (end the notebook session), follow these steps:

1. Select Active or select End session from the Active drop-down menu.
2. In Query history, find the corresponding [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) query and select Cancel query.
3. Let the notebook time out due to inactivity based on its idle time setting.
   If the [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) and [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) parameters on the Notebook
   warehouse are set to a small value, the notebook could shut down quickly or fail to start, regardless of user activity.

#### Queries

SQL and Snowpark queries (for example, session.sql) are pushed down to the Query warehouse, which is used on demand. When the SQL and
Snowpark queries finish running, the Query warehouse suspends if no other jobs are running on it outside the notebook. Select a warehouse
size that best fits your query performance needs. For example, you might want to run large SQL queries or perform compute-intensive
operations using Snowpark Python that require a larger warehouse. For operations that require high memory usage, consider using a
[Snowpark-optimized warehouse](../warehouses-snowpark-optimized.md).

You can change the Query warehouse in Notebook Settings. Alternatively, you can run the following command in any SQL cell in the notebook to
change the Query warehouse for all subsequent queries in the current notebook session:

```sqlexample
USE WAREHOUSE <warehouse_name>;
```

#### Idle time and reconnection

Idle time accumulates when the user is not performing any actions, such as editing code, running cells, reordering cells, or deleting cells. Each
time you resume activity, the idle time resets. Once the idle time reaches the timeout setting, the notebook session automatically shuts down.

By default, notebooks are suspended after a period of inactivity. The default idle timeout depends on the runtime:

* **Warehouse Runtime notebooks:** 30 minutes (1,800 seconds) of inactivity
* **Container Runtime notebooks:** 60 minutes (3,600 seconds) of inactivity

You can set the idle timeout to a maximum of 72 hours (259,200 seconds). To update the idle timeout setting, use either the CREATE NOTEBOOK
or ALTER NOTEBOOK commands to set the value of the IDLE_AUTO_SHUTDOWN_TIME_SECONDS property.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Open the notebook that you want to update.
4. Select the vertical ellipsis  menu at the top right of your notebook.
5. Select Notebook settings.
6. Select Owner.
7. Select the idle timeout setting from the dropdown.
8. Manually restart the session for the new idle time to take effect.

Before idle timeout, your notebook session will remain active until the idle timeout period is reached, even if you refresh the page, visit other parts
of Snowsight, or shut down or sleep your computer. When you reopen the same notebook, you reconnect to the same session, with all
session states and variables preserved, allowing you to continue working seamlessly. Note, however, that the state
of your Streamlit widgets will not be retained.

Each individual user running the same notebook has their own independent session. They do not interfere with one another.

#### Recommendations for optimizing cost

As an account administrator, consider the following recommendations to control the cost of running notebooks:

* Ask your teams to use the same warehouse (X-Small is recommended) as a dedicated “Notebook warehouse” for running the notebook sessions to increase
  concurrency. Note that this might lead to slower session starts (queued on warehouse) or out-of-memory errors if too many notebooks are
  to be executed simultaneously.
* Allow your teams to use a warehouse with a lower [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) value to run notebooks. This warehouse parameter controls how
  long any queries can last, including notebook sessions. For example, if the parameter is set to 10 minutes, the notebook session can run for a
  maximum of 10 minutes, regardless of whether the user is active in the notebook session during that time.
* Ask your teams to end their notebook sessions when they do not intend to actively work in the session.
* Ask your teams to minimize the idle timeout setting (for example, to 15 minutes) if they do not need the session to run for an extended
  period of time.
* Alternatively, raise a support ticket to set a default value for idle time that applies to your entire account. This value can still be
  overridden at the notebook level by the notebook owner.

## Get started using notebooks by adding data

Before you get started using Snowflake Notebooks, add data to Snowflake.

You can add data to Snowflake in several ways:

* Add data from a CSV file to a table using the web interface. See [Load data using Snowsight](../data-load-web-ui.md).
* Add data from external cloud storage:

  + To load data from Amazon S3, see [Bulk loading from Amazon S3](../data-load-s3.md).
  + To load data from Google Cloud Storage, see [Bulk loading from Google Cloud Storage](../data-load-gcs.md).
  + To load data from Microsoft Azure, see [Bulk loading from Microsoft Azure](../data-load-azure.md).
* Add data in bulk programmatically. See [Bulk loading from a local file system](../data-load-local-file-system.md).

You can also add data in other ways. See [Overview of data loading](../data-load-overview.md) for complete details.

---
title: Shared workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/workspaces-shared.md
section: Snowsight UI
---

# Shared workspaces

## Overview

A standard Snowflake workspace provides an environment for individual development and can be created as a private workspace or connected to
a Git repository.

You can also create a *shared workspace* to share with specific roles. Shared workspaces introduce a new model for team-based
collaboration directly within Snowflake. Instead of sharing individual files, users can create dedicated spaces where work is organized, versioned,
and shared with roles that represent teams or groups.

| Workspace type | Purpose | Storage location |
| --- | --- | --- |
| Private | Default mode for individual development. Ideal for ad-hoc exploratory data analysis (EDA), administration tasks, and private projects. | User’s Personal Database (PDB) |
| Git-synced | Private workspace connected to a Git repository. Ideal for production workloads and complex multi-file projects. | User’s PDB, synced to an external Git repository |
| Shared | Multi-user collaboration using wiki-style drafts and a publish model. Shared as RBAC schema objects in databases and schemas. | Standard database and schema |

## Shared workspace functionality

Shared workspaces are created within a specific database and schema, which grants access to multiple authenticated users. Users assigned
specific roles can then contribute, edit, and modify code and files simultaneously within the environment.

Users with access to a shared workspace can perform the following actions:

* View and edit the contents of the shared workspace.
* Run queries using their own access privileges.
* Collaborate on file edits with other authorized users.
* Move or copy files and folders from any of their private workspaces to the shared workspace. This capability allows users to integrate
  existing work into the team environment.

## Create a shared workspace

Shared workspaces are created within a specific database and schema that the user has access to. To create a shared workspace, the user must have one of the following privileges:

* **Option 1**: The CREATE WORKSPACE privilege on the destination schema and the USAGE privilege on the destination database.

  ```sqlexample
  GRANT USAGE ON DATABASE <database_name> TO ROLE <role_name>;
  GRANT CREATE WORKSPACE ON SCHEMA <database_name>.<schema_name> TO ROLE <role_name>;
  ```

  > **Note:**
  >
  > The USAGE privilege applies to the database itself (not to a schema). The CREATE WORKSPACE privilege applies to the schema within that database.
* **Option 2**: The OWNERSHIP privilege on the destination schema.

Shared workspaces can be shared with roles that have the USAGE privilege on the database where the shared workspace is located.

To create a shared workspace, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. In the Workspaces menu, select Shared workspace in the Create section.
4. Specify a shared workspace name.
5. Select a shared database and schema for the workspace.
6. Specify the roles to share the workspace with.
7. Select Create after you have finished adding roles.

## Access and filter shared workspaces

You can navigate, filter, and search for workspaces using the Workspaces menu.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select the Workspaces menu. The menu displays a list of all accessible workspaces.
4. Refine the list of workspaces using the filter buttons at the top of the list:
   :   * All - View all workspaces you have access to, including private and shared workspaces.
       * Private - Only display the workspaces that are private to you.
       * Shared - Only display the workspaces that have been shared with you.
5. To search for a workspace, start typing the workspace name in the Search field (indicated by a magnifying glass icon). The list
   dynamically filters to show only the workspaces matching your search query.
6. Select the name of the workspace to open. A checkmark appears next to the currently active workspace.

## Share files and folders in a workspace

There are two ways to share files and folders in a private workspace with other users:

* Move or Copy a file or folder from the workspace list into a shared workspace.
* Click Share to share a single file that is open in the Workspaces editor.

To move or copy files or folders from the workspace list:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select the files or folders to move or copy in the workspace list.
4. Select the ellipsis  for the selected items.
5. Select Copy to or Move.
6. In the dialog that appears, select a shared workspace destination for the items.
7. Select Copy to destination or Move.

> > **Note:**
> >
> > You can also copy and move files to another private workspace.

To share the file currently open in the Workspaces editor:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. From the currently open file in the editor, click Share on the upper-right.
4. From the drop-down, you can:

   > * Move file to shared workspace: Select a destination and select Move. Only shared workspaces are displayed.
   > * Copy URL: Copy the file’s unique URL to your clipboard. This option is only available if the file is in a shared workspace. Any user
   >   with access to that shared workspace can use this URL to directly open the file and its containing workspace, making it efficient to share
   >   specific files. If the file is deleted or renamed, the URL will no longer work.
   > * Copy code: Copy the contents of the file to clipboard.
   > * Download: Download the file to your computer.

After a move or copy, the file or folder is published in the shared workspace and is immediately visible to all collaborators with access.

### Manage access to a shared workspace

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select the ellipsis  next to the shared workspace you want to manage.
4. Select Configure workspace.
5. Select the Location & access tab from the Configure workspace dialog. From this tab, you can:

> * Remove a role that was granted to a user by selecting the trash icon.
> * Add a new role to access the shared workspace. To filter the list, start typing a role name.

## Collaborate in a shared workspace

Shared workspaces use a wiki-style collaboration model to manage changes:

| Concept | Description |
| --- | --- |
| Draft State | When you begin editing a file, your changes enter a draft state. The file does not automatically update with changes from other collaborators, and only you can see your edits. |
| Publishing | To make your changes visible to all other collaborators, you must publish the file. This is a per-file action that updates the shared version. |
| Publish history | For any file, you can view the history of published versions by selecting the Publish changes drop-down and selecting View publish history. |

When you access a shared workspace, you automatically see the latest, published versions of all files. The only exception is any file you
currently have in a draft state.

> **Note:**
>
> Certain actions on the file tree do not require a separate publishing action and are immediately visible to all collaborators. These
> actions include uploading, renaming, and deleting files and/or folders.

To collaborate within a shared workspace, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Open a workspace and make your updates.
4. Select Publish. Your changes are published, and the file is updated for all collaborators.

   > > **Note:**
   > >
   > > If a file with a draft (or one of its parent folders) is deleted by another user, you will be prompted to recreate it (and its folder path) when publishing.

When you are collaborating in a shared workspace, you can take the following actions on files in draft state using the Publish changes drop-down:

* **View publish history** - Select View publish history to see the history of published versions of the file.
* **Show changes** - Select Show changes to compare your current local draft against the latest published version in a side-by-side
  comparison view. Review all changes made between your draft and the latest published version. Select Hide changes to return to the editor.
* **Discard changes** - Select Discard changes to permanently erase your unpublished draft edits
  and revert the file to the last published version. You are prompted to confirm.

### Resolve conflicts

If another user publishes a version of the file while you’re working on a draft, you will be prompted to take action when you attempt to
publish:

* Select Overwrite to overwrite the version published by the other user, making your version the latest published version.
* Select Cancel to exit, and then select Discard. Your edits are discarded and the other user’s version is now the latest published version.
* Select Show differences to view a side-by-side view to resolve the conflict before publishing your changes.

### View publish history

After making updates to a file in a shared workspace, you can revert to a previous version of a file by viewing its publish history.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Open the file you want to check or restore from its publish history.
4. Select the Publish changes drop-down and select View publish history.
5. In the right-hand panel, browse through the different versions by clicking on the timestamps.
6. Filter the list of versions by selecting All (to view every version), By me (to view your own updates), or By others (to view changes made by collaborators).
7. Select a specific timestamp to preview a version in the left-hand panel.
8. When you find the version you want to revert to, select it and then select Restore this version.
9. Select Restore and publish to confirm. The file opens in the editor, and you can choose to publish this version or continue editing.

---
title: Snowflake Notebooks in Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md
section: Snowsight UI
---

# Snowflake Notebooks in Workspaces

> **Note:**
>
> Notebooks in Workspaces replaces the [Legacy Notebooks](../notebooks.md) experience. Starting in March 2026, Snowflake
> will roll out the ability to import legacy notebooks into Workspaces. For a comparison of the two experiences, see [Key differences between legacy and new notebooks](notebooks-in-workspaces-migrate.md).

## Overview

The new Snowflake Notebooks experience in Workspaces offers enhanced performance, improved developer productivity, and Jupyter compatibility.
The Workspaces environment supports easy file management, allowing you to iterate on individual notebooks and project files. Create folders,
upload files, and organize notebooks. Notebook files open in tabs in your workspace and are editable and executable.

This new offering includes:

* **Familiar Jupyter experience** - Supports a Jupyter notebook environment with direct access to governed Snowflake data.
* **Enhanced IDE features** - Editing tools, file management, and access to terminal for increased productivity.
* **Powerful for AI/ML** - Runs in a pre-built container environment optimized for scalable AI/ML development with fully-managed access to CPUs and GPUs.
* **Governed collaboration** - Allows multiple users to work in the same workspace with role-based access controls and version history through [Git-integrated workspaces](../workspaces-git.md) or [Shared workspaces](../workspaces-shared.md).
* **Schedule and orchestration** - Use the native scheduler or incorporate notebooks into orchestration scripts for production pipelines.

## Benefits for machine learning (ML) workflows

Notebooks in Workspaces provides two primary capabilities for ML workflows:

* **End-to-end workflow** - The platform enables users to consolidate their complete ML lifecycle, from source data access to model inference,
  within a single Jupyter notebook environment. This environment is integrated with the underlying data platform, allowing it to inherit existing
  governance and security controls for the data and code assets.
* **Scalable model development architecture** - The architecture supports the development of scalable models by providing open-source software
  (OSS) model development capabilities. Users can access distributed data loading and training across designated CPU or GPU compute pools. This
  design simplifies ML infrastructure management by abstracting the need for manual configuration of distributed compute resources.

For more information about Snowflake ML, see [Snowflake ML: End-to-End Machine Learning](../../../developer-guide/snowflake-ml/overview.md).

## Get started

> **Note:**
>
> These quickstarts are only shown as examples. Following along with the example may require additional rights to third-party data,
> products, or services that are not owned or provided by Snowflake. Snowflake does not guarantee the accuracy of these examples or
> cover them under any Service Level Agreement.

* Watch the [introduction video](https://www.youtube.com/watch?v=_kFhFIvnIrQ) for an overview of Notebooks in Workspaces.
* Follow the [quickstart](https://www.snowflake.com/en/developers/guides/accelerate-topic-modeling-with-gpus-in-snowflake-ml/) to learn how
  to accelerate topic modeling with scikit-learn and pandas in Snowflake ML.
* Explore the [First Machine Learning Project notebooks](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/tree/main/First_Machine_Learning_Project/Jupyter),
  a notebook series covering data preparation, exploratory data analysis, model training, and experiment tracking.
* Follow the [Build an End-to-End ML Workflow in Snowflake](https://www.snowflake.com/en/developers/guides/end-to-end-ml-workflow/)
  guide to walk through a complete machine learning workflow, from data preparation to model deployment.
* Follow the [Getting Started with Data Engineering using Snowflake Notebooks](https://www.snowflake.com/en/developers/guides/data-engineering-with-notebooks/)
  quickstart, with accompanying [code on GitHub](https://github.com/Snowflake-Labs/sfguide-data-engineering-with-notebooks), to learn how to build production data engineering pipelines using Notebooks in Workspaces.
* See an example of [Healthcare ML: Breast Cancer Classification with XGBoost](https://www.snowflake.com/en/developers/guides/healthcare-ml-breast-cancer-classification/)
  that demonstrates how to build a classification model in Snowflake.

---
title: Snowsight templates
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/snowsight-templates.md
section: Snowsight UI
---

# Snowsight templates

## Overview

[Snowsight templates](http://app.snowflake.com/templates) provide users with interactive walkthroughs for exploring Snowflake features and use cases. Templates are available
as executable worksheets, notebooks, or Streamlit apps, and come pre-configured with sample data and the required permissions.

Templates run in a dedicated `SNOWFLAKE_LEARNING` environment, which includes a pre-provisioned role (`SNOWFLAKE_LEARNING_ROLE`), an X-Small
compute warehouse (`SNOWFLAKE_LEARNING_WH`), and a database (`SNOWFLAKE_LEARNING_DB`). Costs associated with
the `SNOWFLAKE_LEARNING_WH` and `SNOWFLAKE_LEARNING_DB` are managed in the same way as any other object owned
by `ACCOUNTADMIN`. See [Monitor credit usage with budgets](../budgets.md) for details on monitoring and optimizing warehouse compute costs.

> **Note:**
>
> `SNOWFLAKE_LEARNING_WH` is owned by the `ACCOUNTADMIN` role. Standard usage costs apply.

Templates offer the following advantages:

* Safely try new features and use cases without impacting production data.
* Sample data is included to get up and running quickly.
* Concise, self-contained experiences that are typically completed in under five minutes.

Snowflake automatically provisions the `SNOWFLAKE_LEARNING` environment for both new and existing accounts as part of
[BCR-1992](../../release-notes/bcr-bundles/un-bundled/bcr-1992.md). No action is required to enable it.

If your organization prefers **not** to include this environment, an `ACCOUNTADMIN` can opt out by running:

```sqlexample
SELECT SYSTEM$DISABLE_SNOWFLAKE_LEARNING_ENVIRONMENT();
```

If [BCR-1992](../../release-notes/bcr-bundles/un-bundled/bcr-1992.md) is not enabled for your account, you can provision the
`SNOWFLAKE_LEARNING` environment manually using the following SQL:

```sqlexample
CREATE DATABASE SNOWFLAKE_LEARNING_DB;
CREATE ROLE SNOWFLAKE_LEARNING_ROLE;
GRANT ROLE SNOWFLAKE_LEARNING_ROLE TO ROLE PUBLIC;
CREATE WAREHOUSE SNOWFLAKE_LEARNING_WH
  COMMENT = 'Warehouse used for executing template and demo content'
  WAREHOUSE_SIZE = 'X-Small'
  AUTO_RESUME = true
  AUTO_SUSPEND = 300;
GRANT USAGE, MONITOR, OPERATE ON WAREHOUSE SNOWFLAKE_LEARNING_WH TO ROLE SNOWFLAKE_LEARNING_ROLE;
GRANT USAGE ON DATABASE SNOWFLAKE_LEARNING_DB TO ROLE SNOWFLAKE_LEARNING_ROLE;
GRANT CREATE SCHEMA ON DATABASE SNOWFLAKE_LEARNING_DB TO ROLE SNOWFLAKE_LEARNING_ROLE;
```

If the `SNOWFLAKE_LEARNING` environment has already been provisioned in your account, but you want to disable it and drop the objects,
a user with the `ACCOUNTADMIN` role can run the following script to disable and drop the learning environment:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$DISABLE_SNOWFLAKE_LEARNING_ENVIRONMENT();

-- DATABASE
SHOW DATABASES LIKE 'SNOWFLAKE_LEARNING_DB';
DROP DATABASE SNOWFLAKE_LEARNING_DB;

-- WAREHOUSE
SHOW WAREHOUSES LIKE 'SNOWFLAKE_LEARNING_WH';
DROP WAREHOUSE SNOWFLAKE_LEARNING_WH;

-- ROLE
SHOW ROLES LIKE 'SNOWFLAKE_LEARNING_ROLE';
DROP ROLE SNOWFLAKE_LEARNING_ROLE;
```

Get started with templates at <http://app.snowflake.com/templates>.

---
title: Sync notebooks with a Git repository
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-snowgit.md
section: Snowsight UI
---

# Sync notebooks with a Git repository

To use version control with your Snowflake Notebooks, you can sync your notebook development with a branch in a Git repository.

You must have already set up your Snowflake account to be connected to a Git repository and have created a branch in that repository to use
for your notebook development. See [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).

## Create a notebook from a file in a Git repository

> **Note:**
>
> The file must be an `.ipynb` formatted file and it must use notebook format (nbformat) 4.0 or higher.

To create a Snowflake Notebook from a file in a Git repository, do the following:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Next to + Notebook, open the drop-down menu and select Create from repository.
4. For File location in repository, select the repository and branch in the repository that contain the notebook file, then select
   the specific `.ipynb` file. For details on connecting Snowflake to your Git repository, see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).
5. For Notebook location, select a database and schema to contain the notebook. These cannot be changed after you create the notebook.
6. For Notebook warehouse, select a warehouse.
7. Select Create to create a Snowflake Notebook from the `.ipynb` file in your Git repository.

## Connect an existing notebook with a Git repository

To connect an existing Snowflake notebook to a Git repository, do the following:

> **Note:**
>
> You must use a role with the following privileges at a minimum:
>
> * OWNERSHIP or READ privilege on the Git repository.
> * USAGE privilege on the schema that contains the Git repository.

To learn how to connect to your Git repository, see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).

> For more details, see [Access control requirements](../../sql-reference/sql/show-git-repositories.md).

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks, and then open or create a notebook.
3. In the Files tab, next to the database object explorer, select Connect Git Repository.
4. For File location in repository, select the repository and branch in the repository with which you want to sync the notebook.
5. Select Select Folder.
6. When you are prompted to commit and push your notebook to the Git repository, complete the Push to Git steps outlined in
   Push changes to a branch in a Git repository.

   When your notebook is successfully pushed to the Git repository, a new folder is created for your notebook in the selected location in the
   Git repository branch, and all the files and folders in that location are synced back to your notebook. You can select the branch
   name and open the repository details in Snowflake or on Git.

## Push changes to a branch in a Git repository

If a Snowflake Notebook is connected to a branch in a Git repository, after you make changes to the notebook you can push
your changes to the branch.

You must use a role with the OWNERSHIP or WRITE privilege on the Git repository to push your changes.
For more details, see [Access control requirements](../../sql-reference/sql/alter-git-repository.md).

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks, and then open a notebook.
3. Make any relevant changes to the notebook.
4. Select Push to Git.
5. In the Push to Git dialog that appears, you can review the username and email address that are used to commit the changes
   to the specified branch and repository. If you need to update the username and email address, expand the Credentials section and
   update the Author name and Author email.
6. For Commit message, enter a message to include with your commit.
7. Expand the Credentials section to configure credentials. Enter your personal access token for the Git repository in the
   Personal access token field. This access token comes from the remote Git provider, such as GitHub.

   * This token is required to authenticate to the Git repository.
   * The token must have read and write access to the content of the repository for the commit to work.
   * Once entered, the token will be saved for future commits. You can update it during any future commits.
8. Select Push.

A confirmation message states that your changes were pushed successfully to your branch.

## Sync a notebook with a remote branch in a Git repository

After you connect your notebook to a branch in a Git repository, you can sync any changes in the remote branch with your Snowflake Notebook.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks, and then open or create a notebook.
3. In the database object explorer, in the Files tab, select Pull.

Snowflake fetches any changes present on the remote repository branch and merges the notebook contents with those changes.

### Merge conflicts

Snowflake attempts to resolve merge conflicts that occur during a sync. If there are merge conflicts that Snowflake isn’t able to
resolve, you will get a message to either discard your changes or commit them to a new branch. When they are committed to a new branch, use
your Git provider to manually merge your changes from the new branch to the original branch. Then you should pull the latest updates into
your Snowflake notebook.

---
title: Troubleshoot errors in Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-troubleshoot.md
section: Snowsight UI
---

# Troubleshoot errors in Snowflake Notebooks

The following scenarios can help you troubleshoot issues that can occur when using Snowflake Notebooks.

## Total number of notebooks exceeds the limit in Snowsight

The following error occurs when the total number of notebooks in your account exceeds 6,000 and you refresh the Notebooks list:

```output
Result size for streamlit list exceeded the limit. Streamlit list was truncated.
```

Users can still create new notebooks; however, Snowflake recommends that you remove notebooks that are no longer being used by the account.

## Notebooks (Warehouse Runtime) error when updating a package

Snowflake has deprecated the older `snowflake-ml` package, which is no longer supported. It has been removed from the package selector and is
not available in the Snowflake Anaconda channel. If you are using `snowflake-ml` and try to add, remove, or update packages in your
notebooks, those notebooks will fail because `snowflake-ml` is no longer accessible.

To avoid issues, switch to `snowflake-ml-python`, which is the correct package for Snowflake ML.

## Plotly error

```output
st.plotly_chart(fig, render_mode='svg')

WebGL is not supported by your browser - visit https://get.webgl.org for more info.
```

Plotly will switch to webgl if there are more than 1,000 datapoints.

## AttributeError: `NoneType`

The following error occurs when a cell is renamed to the same name as an existing variable in the notebook:

```output
AttributeError: ‘NoneType’ object has no attribute ‘sql’
```

For example, you have the following in a Python cell called `cell1`:

```python
session = get_active_session() #establishing a Snowpark session
```

If you then rename `cell2` to “session”, and reference “session” in `cell3`, Notebooks attempts to reference “session” (the cell
name) and not the Snowpark session, causing an error.

## Early disconnection

The notebook session runs as a stored procedure. The timeout is 30 minutes on Warehouse Runtime and 60 minutes on Container Runtime. If your
notebook unexpectedly disconnects before the timeout, your ACCOUNTADMIN or the warehouse owner might have set the [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md)
parameter to a particular value (for example, 5 minutes), which limits how long all statements can run on the warehouse, including notebook sessions.
This parameter is set at the warehouse or account level, and when it is set for both a warehouse and a session, the lowest non-zero value is enforced.

To allow the notebook to run longer, you can use the default warehouse [SYSTEM$STREAMLIT_NOTEBOOK$WAREHOUSE](../warehouses-overview.md) or
change the [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) parameter to a longer duration.

For details on the idle time setting, see [Idle time and reconnection](notebooks-setup.md).

## Fail to reconnect

If you do not have cookies enabled on your browser, you cannot automatically reconnect to the notebook session while it should still be
active (before timing out due to inactivity). When you reopen the notebook, an error message displays:

```output
Notebook connection lost and cannot reconnect. Restart or end session.
```

Restarting the session will end the current [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) query and start a new session. Ending the session
will end the current [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) query.

If you do not take either action, the current [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) query will continue running on the warehouse,
shown in Query History.

## Unable to connect due to firewall

The following popup occurs when you try to start your notebook:

```output
Something went wrong. Unable to connect. A firewall or ad blocker might be preventing you from connecting.
```

Ensure that `*.snowflake.app` and `*.snowflake.com` are on the allowlist in your network (including content filtering systems), and
can connect to Snowflake. For Streamlit apps using container runtimes, also add `*.snowflakecomputing.app` to the allowlist.
When these domains are on the allowlist, your apps can communicate with Snowflake servers without any restrictions.
However, in some cases adding these domains may not be sufficient due to network policies blocking subpaths under them. If this occurs,
contact your network administrator.

In addition, to prevent any issues connecting to the Snowflake backend, ensure that WebSockets are not blocked in your network configuration.

## Missing packages

The following message occurs in a cell output if you’re trying to use a package that is not installed in your notebook environment:

```output
ModuleNotFoundError: Line 2: Module Not Found: snowflake.core. To import packages from Anaconda, install them first using the package
selector at the top of the page.
```

Import the necessary package by following the instructions on the [Import Python packages to use in notebooks](notebooks-import-packages.md) page.

### Missing package from existing notebook

New versions of notebooks are continually being released and notebooks are auto-upgraded to the latest version.
Sometimes, when upgrading an old notebook, the packages in the notebook environment aren’t compatible with the upgrade.
This could possibly cause the notebook to fail to start.

The following is an example of an error message when the `Libpython` package is missing:

```output
SnowflakeInternalException{signature=std::vector<sf::RuntimePathLinkage> sf::{anonymous}::buildRuntimeFileSet(const sf::UdfRuntime&, std::string_view, const std::vector<sf::udf::ThirdPartyLibrariesInfo>&, bool):"libpython_missing", internalMsg=[XP_WORKER_FAILURE: Unexpected error signaled by function 'std::vector<sf::RuntimePathLinkage> sf::{anonymous}::buildRuntimeFileSet(const sf::UdfRuntime&, std::string_view, const std::vector<sf::udf::ThirdPartyLibrariesInfo>&, bool)'
Assert "libpython_missing"[{"function": "std::vector<sf::RuntimePathLinkage> sf::{anonymous}::buildRuntimeFileSet(const sf::UdfRuntime&, std::string_view, const std::vector<sf::udf::ThirdPartyLibrariesInfo>&, bool)", "line": 1307, "stack frame ptr": "0xf2ff65553120",  "libPythonOnHost": "/opt/sfc/deployments/prod1/ExecPlatform/cache/directory_cache/server_2921757878/v3/python_udf_libs/.data/4e8f2a35e2a60eb4cce3538d6f794bd7881d238d64b1b3e28c72c0f3d58843f0/lib/libpython3.9.so.1.0"}]], userMsg=Processing aborted due to error 300010:791225565; incident 9770775., reporter=unknown, dumpFile= file://, isAborting=true, isVerbose=false}
```

To resolve this error, try the following steps:

* Refresh the webpage and start the notebook again.
* If the issue persists, open the package selector and check whether all installed packages are valid. In the drop-down for each package, you
  can see the available versions. Selecting the latest version of the package usually clears the error.

## Missing notebook

Notebooks are schema-level objects, meaning they are stored within a specific schema in a database. If a schema is dropped (deleted), all
objects contained within it (including notebooks) are also dropped. If you do not see a notebook that previously existed, it’s possible that
the schema it belonged to has been dropped. In this case, the notebook is permanently deleted and cannot be recovered.

To help prevent accidental loss of notebooks:

* Review the objects contained in a schema before dropping it.
* Limit schema drop privileges to avoid accidental or unauthorized deletions.
* Consider exporting notebook contents in version-controlled scripts outside of the notebook.

If your notebook is missing and the schema still exists, ensure that your current role has the necessary privileges to view the schema and
its objects.

## Read-only file system issue

Some Python libraries download or cache data to a local user directory. However, the default user directory `/home/udf` is read-only.
To work around this, set the path as `/tmp` which is a writable location.
Note that the environment variable used to set the write directory may vary depending on which library you are using.
The following is a list of known libraries that present this issue:

* matplotlib
* HuggingFace
* catboost

### matplotlib example

You might see this warning when using matplotlib:

```output
Matplotlib created a temporary cache directory at /tmp/matplotlib-2fk8582w because the default path (/home/udf/.config/matplotlib) is
not a writable directory; it is highly recommended to set the MPLCONFIGDIR environment variable to a writable directory, in particular
to speed up the import of Matplotlib and to better support multiprocessing.
```

Resolve this warning using this code, which sets the `MPLCONFIGDIR` variable to `/tmp/`:

```python
import os
os.environ["MPLCONFIGDIR"] = '/tmp/'
import matplotlib.pyplot as plt
```

### Huggingface example

You might see this warning when using Huggingface:

```output
Readonly file system: `/home/udf/.cache`
```

The following code sets the `HF_HOME` and `SENTENCE_TRANSFORMERS_HOME` variables to `/tmp/` to get rid of this error:

```python
import os
os.environ['HF_HOME'] = '/tmp'
os.environ['SENTENCE_TRANSFORMERS_HOME'] = '/tmp'

from sentence_transformers import SentenceTransformer
model = SentenceTransformer("Snowflake/snowflake-arctic-embed-xs")
```

## Output message is too large when using `df.collect()`

The following message is displayed in the cell output when you run `df.collect()`:

```output
MessageSizeError: Data of size 522.0 MB exceeds the message size limit of 200.0 MB.
This is often caused by a large chart or dataframe. Please decrease the amount of data sent to the browser,
or increase the limit by setting the config option server.maxMessageSize.
Click here to learn more about config options.
Note that increasing the limit may lead to long loading times and large memory consumption of the client's browser and the Streamlit server.
```

Snowflake Notebooks automatically truncates results in the cell output for large datasets in following cases:

* All SQL cell results.
* Python cell results if it’s a `snowpark.Dataframe`.

The issue with the above cell is that `df.collect()` returns a `List` instead of `snowpark.Dataframe`. Lists are not automatically
truncated. To get around this issue, directly output the results of the DataFrame.

```python
df
```

## Notebook crashes when using `df.to_pandas()` on Snowpark DataFrames

When running `df.to_pandas()`, all the data is loaded into memory and may result in the Notebook session terminating if the data size
exceeds the associated Notebook warehouse’s memory limit.

### Example 1: Exporting a Snowpark table to pandas DataFrame

```python
data = session.table("BIG_TABLE")
df = data.to_pandas() # This may lead to memory error
```

#### Workaround for example 1

The following example shows how you can rewrite the code to read in the table with Snowpark pandas.

```python
# Import Snowpark pandas
import modin.pandas as pd
import snowflake.snowpark.modin.plugin
# Create a Snowpark pandas DataFrame from BIG_TABLE
df = pd.read_snowflake("BIG_TABLE")
# Keep working with your data using the pandas API
df.dropna()
```

### Example 2: Referencing a SQL cell containing large results

If you have the following code in a SQL cell called `cell1`, the output result is 500M rows.

```SQL
SELECT * from BIG_TABLE
```

Then, when you fetch the results into a pandas DataFrame, the notebook crashes because the data is too large to fit in memory:

```SQL
df = cell1.to_pandas() # This may lead to memory error
```

In general, for large datasets, Snowflake recommends that you avoid using `df.to_pandas()`. Instead, to operate on your data with pandas, use
the Snowpark pandas API and a [Snowpark-optimized warehouse](../warehouses-snowpark-optimized.md). The
[Snowpark pandas API](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/index) lets you run your
pandas code directly on your data in Snowflake with the query performed in SQL. This allows you to run pandas code on data that does not
fit in the notebook’s memory.

#### Workaround for example 2

In the second cell referencing example above, you can convert your SQL cell result to a Snowpark DataFrame first. Then, you can convert it
into Snowpark pandas.

```SQL
SELECT * from BIG_TABLE
snowpark_df = cell1.to_df()
df = snowpark_df.to_snowpark_pandas()
# Keep working with your data using the Snowpark pandas API
```

For more details, see [pandas on Snowflake in notebooks](notebooks-use-with-snowflake.md).

## Unable to connect due to VPN split tunneling

If your VPN is configured to use split tunneling, you must add both `*.snowflake.com` and `*.snowflake.app` to your network
policy allowlist.

## Notebook does not exist error

The following message is displayed in the cell output when you attempt to run a notebook whose name contains special characters:

```output
Notebook <name> does not exist or not authorized
```

Notebook names are Snowflake identifiers. Any notebook name with special characters such as dots and spaces must be enclosed in double
quotes to meet identifier rules.

## Package version conflict

If you run a non-interactive notebook that loads a package version from the base image, and then attempt to install a new version, the new
version will not be loaded. Ensure that the package version matches the base image. In an interactive notebook, you’ll be prompted to restart the
notebook to use the new version.

## Scheduled SPCS notebook failing to run

Before running a scheduled notebook on Container Runtime, you must create an image repository. If an image repository is missing, the following error is displayed:

```output
Failed to retrieve image.
```

For details on how to create an image repository, see [CREATE IMAGE REPOSITORY](../../sql-reference/sql/create-image-repository.md).

## Log messages failing to appear in output

If log messages aren’t appearing in the notebook output, ensure that logs are directed to `stdout`. Here is an example of how to configure
logging correctly:

```python
import logging
import sys

# Create a logger
logger = logging.getLogger()

# Set the log level
logger.setLevel(logging.DEBUG)

# Create a stream handler that writes to sys.stdout
stdout_handler = logging.StreamHandler(sys.stdout)

# Set the log format (optional)
formatter = logging.Formatter('%(levelname)s: %(message)s')
stdout_handler.setFormatter(formatter)

# Add the handler to the logger
logger.addHandler(stdout_handler)

# Test the logging output
logger.warning("This is a warning message.")
```

To highlight errors, you can send them to `stderr` using a similar approach. Alternatively, if you’re working with Streamlit in Snowflake Notebooks, you can use its built-in functions for clearer formatting:

```python
import streamlit as st st.warning("This is a warning message.")
st.write("This is a normal message.")
st.error("This is an error message.")
```

## Unable to execute notebook

When you create a notebook using [CREATE NOTEBOOK](../../sql-reference/sql/create-notebook.md), the notebook does not automatically have a live version in the
version stage. If you attempt to run a notebook using [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) without setting a live version first, or if
you create a notebook and it gets dropped, the following error occurs:

```output
Live version is not found.
```

To resolve this error, use the following command to set the initial live version:

```sqlexample
ALTER NOTEBOOK <name> ADD LIVE VERSION FROM LAST;
```

---
title: Visualize data in Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-visualize-data.md
section: Snowsight UI
---

# Visualize data in Snowflake Notebooks

In Snowflake Notebooks, you can use your favorite Python visualization libraries, such as matplotlib and plotly, to develop your visualizations.

This topic shows how to visualize data in your notebooks using the following libraries:

* Altair
* Matplotlib
* Plotly
* Seaborn
* Streamlit

## The dataset

The examples in this topic use the following toy dataset that is based on the
[Palmer’s Penguin dataset](https://allisonhorst.github.io/palmerpenguins/articles/intro.html).

| species | measurement | value |
| --- | --- | --- |
| adeli | bill_length | 37.3 |
| adeli | flipper_length | 187.1 |
| adeli | bill_depth | 17.7 |
| chinstrap | bill_length | 46.6 |
| chinstrap | flipper_length | 191.7 |
| chinstrap | bill_depth | 17.6 |
| gentoo | bill_length | 45.5 |
| gentoo | flipper_length | 212.7 |
| gentoo | bill_depth | 14.2 |

You can create this dataset in your notebook with the following code:

```python
species = ["adelie"] * 3 + ["chinstrap"] * 3 + ["gentoo"] * 3
measurements = ["bill_length", "flipper_length", "bill_depth"] * 3
values = [37.3, 187.1, 17.7, 46.6, 191.7, 17.6, 45.5, 212.7, 14.2]
df = pd.DataFrame({"species": species,"measurement": measurements,"value": values})
df
```

### Visualize results with Altair

Altair is imported by default on Snowflake Notebooks as part of Streamlit. Snowflake Notebooks currently support Altair version 4.0. For details on available
visualization types when using Altair, see [Vega-Altair: Declarative Visualization in Python](https://altair-viz.github.io/index.html).

The following code plots a stacked bar chart of all the measurements in a dataframe named `df` that contains the toy dataset:

```python
import altair as alt
alt.Chart(df).mark_bar().encode(
    x= alt.X("measurement", axis = alt.Axis(labelAngle=0)),
    y="value",
    color="species"
)
```

After you run the cell, the following visualization appears:

### Visualize results with matplotlib

To use matplotlib, install the matplotlib library for your notebook:

1. From the notebook, select Packages.
2. Locate the matplotlib library and select the library to install it.

The following code plots the toy dataset, `df`, using matplotlib:

```python
import matplotlib.pyplot as plt

pivot_df = pd.pivot_table(data=df, index=['measurement'], columns=['species'], values='value')

import matplotlib.pyplot as plt
ax = pivot_df.plot.bar(stacked=True)
ax.set_xticklabels(list(pivot_df.index), rotation=0)
```

After you run the cell, the following visualization appears:

For more details on using the `st.pyplot` chart element, see
[st.pyplot](https://docs.streamlit.io/library/api-reference/charts/st.pyplot).

### Visualize results with plotly

To use plotly, install the plotly library for your notebook:

1. From the notebook, select Packages.
2. Locate the plotly library and select the library to install it.

The following code plots a bar chart of the penguin measurements from the toy dataset, `df`:

```python
import plotly.express as px
px.bar(df, x='measurement', y='value', color='species')
```

After you run the cell, the following visualization appears:

### Visualize results with seaborn

To use seaborn, you must install the seaborn library for your notebook:

1. From the notebook, select Packages.
2. Locate the seaborn library and select the library to install it.

The following code plots a bar chart of the penguin measurements from the toy dataset, `df`:

```python
import seaborn as sns

sns.barplot(
    data=df,
    x="measurement", hue="species", y="value",
)
```

After you run the cells, the following visualization appears:

For more examples of seaborn visualizations, see the seaborn [Example gallery](https://seaborn.pydata.org/examples/index.html).

### Visualize results using Streamlit

Streamlit is imported by default in Snowflake Notebooks. You can use chart elements supported by Streamlit version 1.39.0 to create a line
chart, bar chart, area chart, or a map with points on it. See [Chart elements](https://docs.streamlit.io/library/api-reference/charts) .

> **Note:**
>
> Some Streamlit chart elements are not supported in Snowflake or might be subject to additional terms. See [Streamlit support in notebooks](notebooks-use-with-snowflake.md).

To visualize the toy dataset, `df`, in a bar chart, you can use the following Python code:

```python
import streamlit as st

st.bar_chart(df, x='measurement', y='value', color='species')
```

After you run both cells, the following visualization appears:

To learn more about how you can build interactive data apps with Streamlit, see [Streamlit in notebooks](notebooks-use-with-snowflake.md).

---
title: Work with files in notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-work-with-files.md
section: Snowsight UI
---

# Work with files in notebooks

This topic describes how you can upload and access files from your Snowflake Notebooks.

## Files in notebook environments

When you create a new notebook, the main notebook file is created. By default, the notebook file is assigned the same name as the notebook.

Files are stored in an internal stage that represents your notebook environment, and they persist between sessions. You can view them in the
Files tab on the left side of the notebook. To display a preview of the contents of the file, select the file name.

## Temporary filesystem in a notebook environment

Your notebook has a temporary filesystem that is available during an active session. Any files created during the session are saved in
this temporary stage. Files on the temporary stage will not be available after you end the current notebook session.

The following code creates a file called `myfile.txt` and writes some text in it:

```python
with open("myfile.txt",'w') as f:
    f.write("abc")
f.close()
```

You can access this file during the same session it was created.

Use the `listdir()` method to list the files in the temporary stage:

```python
import os
os.listdir()
```

Now disconnect from your current session and reconnect. Try the `listdir()` method again and `myfile.txt` file will not be listed.

## Persist files across notebook sessions

To persist your files across notebook sessions:

* Store files in a Snowflake stage
* Use Snowsight to upload files into a notebook
* Sync with files from Git

### Store files in a Snowflake stage

If you want your files to persist between sessions and reference these files across different notebooks, use a Snowflake stage to store them.
You can upload files from your local computer onto the stage and use file operations from Snowpark API to access them from your notebook.

#### Example

This example shows how to create a stage and store and retrieve files from it from your notebook.

To create a stage called `permanent_stage`, run the follow code in a SQL cell:

```sqlexample
CREATE OR REPLACE STAGE permanent_stage;
```

Next, to create a file called `myfile.txt` with some text in it, run the following code in a Python cell:

```python
with open("myfile.txt",'w') as f:
  f.write("abc")
f.close()
```

Note that at this point, `myfile.txt` is stored in the notebook’s temporary filesystem. To move this to the stage, you can use Snowpark
API to upload the `myfile.txt` to your `permanent_stage`:

```python
from snowflake.snowpark.context import get_active_session
session = get_active_session()

put_result = session.file.put("myfile.txt","@PERMANENT_STAGE", auto_compress= False)
put_result[0].status
```

If you disconnect your session and reconnect, you can run the following code in a SQL cell to verify whether the file still appears:

```sqlexample
LS @permanent_stage;
```

### Use Snowsight to upload files into a notebook

You can upload files from your local computer to be used in your Snowflake notebook.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. In the Files tab, next to the database object explorer, select the  icon to select files to upload.
4. Browse and select or drag and drop files into the dialog.
5. Select Upload to upload your file.

> **Note:**
>
> If you’re uploading files to use as Python packages in your notebook, consider that only `.py` and `.zip` files are supported when running on
> Warehouse Runtime. For Container Runtime, `.whl` (wheel) files are also supported. If you’re importing a package as a `.zip` file, ensure there
> is an `__init__.py` file at the root to indicate that it’s a Python package.

Uploaded files are saved to the notebook’s internal stage and persisted between sessions. You can reference uploaded files using their
local paths from the notebook file. See Referencing files in notebooks.

> **Note:**
>
> Only internal stages are supported. External stages (for example, Amazon S3, Google Cloud Storage, or Azure Blob Storage) are not supported.

### Use other editing environments to upload or download files

In addition to using the file browser in Snowsight, you can also work with files in the Notebooks stage using a local Snowpark
Python session, Snowflake CLI, or SnowSQL.

#### Local Snowpark Python session

You can upload and download files from your local computer into a notebook stage using the `session.file.put` and `session.file.get`
methods in Snowpark Python. This requires starting a Snowpark session from your local editing environment (not within Snowsight). For example:

```python
# Upload a local file to the notebook stage
res = session.file.put("aaa.csv", """snow://notebook/DEMO_DB.PUBLIC."JSMITH dbapi test"/versions/live""", overwrite=True)
# Download a file from the notebook stage to your local computer
res = session.file.get("""snow://notebook/DEMO_DB.PUBLIC."JSMITH dbapi test"/versions/live/aaa.json""", "aaa.csv")
```

> **Note:**
>
> This method does not work from the SnowSQL CLI. You must run the method from a Python environment with an active Snowpark session.

#### SnowSQL commands

You can also upload or download files from the Notebooks stage using SnowSQL commands directly:

```sqlexample
-- Download a file from the notebook stage to your local computer
GET 'snow://notebook/SNOWPUBLIC.NOTEBOOKS."ADMIN_SPCS"/versions/live/ADMIN_SPCS.ipynb' 'file://download';

-- Upload a file from your local computer to the notebook stage
PUT 'file://test.json' 'snow://notebook/SNOWPUBLIC.NOTEBOOKS.ADMIN_SPCS/versions/live' overwrite = TRUE;
```

Before running these commands, make sure you have set the appropriate database, schema, and warehouse in your SnowSQL session.

### Sync with files from Git

If your notebook is connected to Git, then any files in the same Git folder as your notebook will be displayed in the Files tab.

For more information on working with files in Git, see [Sync notebooks with a Git repository](notebooks-snowgit.md).

## Referencing files in notebooks

Each file in the notebook environment has a stage path and a local path. You can use these paths to reference the file in the notebook.

### Referencing a local path with Python

In general, Python libraries use the local path to the file as reference to the file. For example, the following code accesses the `data.csv`
file that was uploaded to the same directory as the notebook that this code is running in:

```python
import pandas as pd
df = pd.read_csv("data.csv")
```

### Referencing the stage path with SQL

With SQL, Snowflake references files based on the stage path. The stage path for a file in your notebook is based on the following format:

```none
snow://notebook/<DATABASE>.<SCHEMA>.<NOTEBOOK_NAME>/versions/live/<file_name>
```

To find the stage path associated with the files in your notebook stage using the Copy path menu:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. In the Files tab, next to the database object explorer, select the  icon next to the file you
   want to get the path for.
4. Select Copy path. This copies the path of the file to your clipboard.

Then you can use the following SQL statement to list the stage file details:

```sqlexample
LIST 'snow://notebook/<DATABASE>.<SCHEMA>.<NOTEBOOK_NAME>/versions/live/data.csv'
```

## Limitations and considerations

* Load files before starting your notebook session. If you load files after a session has started, you have to restart your session to
  access the files.
* No restrictions on file types to upload.
* The size limit per file is 250 MB or less.
* Files that are written to a local path in the notebook are not displayed in the Files tab. However, you can still use the file in
  your notebook code.

  For example, if you create a file, `data.json`, you can access it as shown in the following code even though it won’t be visible
  in the Files UI:

  ```python
  # Generate sample JSON file
  with open("data.json", "w") as f:
      f.write('{"fruit":"apple", "size":3.4, "weight":1.4}\n{"fruit":"orange", "size":5.4, "weight":3.2}')
  # Read from local JSON file (File doesn't show in UI)
  df = pd.read_json("data.json", lines=True)
  df
  ```
* Opening another `.ipynb` file that is not the main notebook file is not supported.

## Additional resources

> * [How to work with files in Snowflake Notebooks](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Working%20with%20Files/Working%20with%20Files.ipynb)
> * [Navigating and Browsing Files in Snowflake Notebooks](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/Navigating%20and%20Browsing%20Files/Navigating%20and%20Browsing%20Files.ipynb)

---
title: Working with Legacy Snowflake Notebooks
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-working.md
section: Snowsight UI
---

# Working with Legacy Snowflake Notebooks

Snowflake Notebooks provide a powerful and flexible environment for developing and running code, exploring data, and integrating with external tools.
After creating your first notebook in Snowsight, you can do any of the following tasks to dive deeper:

* [Import packages](notebooks-import-packages.md) to leverage prebuilt libraries for data analysis, visualization, and machine learning.
* [Manage session context](notebooks-sessions.md) to streamline development, reduce errors, and help maintain security.
* [Work with files](notebooks-work-with-files.md).
* [Use rich visualization options](notebooks-visualize-data.md) to analyze your data more effectively.
* [Schedule notebook runs](notebooks-schedule.md).
* [Set up external access](notebooks-external-access.md) to connect with other systems.
* [Integrate with Git](notebooks-snowgit.md) for collaboration and version control.
* Explore options to [save and share](notebooks-save-share.md) your work.

If you run into issues, see [Troubleshoot errors in Snowflake Notebooks](notebooks-troubleshoot.md) to help resolve them and optimize your workflow.

---
title: Working with the file system
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-file-system.md
section: Snowsight UI
---

# Working with the file system

## The Workspaces file system

The files shown in the left-hand pane of the Workspaces environment represent the contents of your Workspace directory, which is the notebook’s
working directory.

Running `ls` in the Workspace directory lists all files and folders in the directory, including notebooks and any other project assets.

## Referencing files

You can reference files in the current Workspace directory by relative path. For example, you want to read in a notebook and a sample dataset (in CSV) that are in your workspace:

```text
ml-intent-prediction/
├── data/
│   └── sample_data.csv
├── notebooks/
│   └── analysis.ipynb
└── utilities.py
```

In a Python cell, run the following code:

```python
import pandas as pd

df = pd.read_csv("../data/sample_data.csv")
df.head()
```

## Limitations

Writing files to the Workspace directory from code or the terminal is not supported. While file writes may appear to work during a session,
they are not guaranteed to succeed and may fail in future releases.

File persistence in the Workspace directory has the following limitations:

* **Files are read-only:** Files under `/workspace/<workspace_hash>` are read-only and cannot be updated in code while executing the notebook.
* **File writes from code or terminal are not supported:** Do not write files to the Workspace directory programmatically. Use Snowflake stages
  instead for persisting files (see Persisting files).
* **Only files uploaded or created in Snowsight persist:** Only files that are uploaded or created through Snowsight persist across sessions.
* **Session-only visibility:** Any files created from code or the terminal during a session are removed when the notebook service is suspended.
  These files do not appear in the left-hand pane.

## The `/tmp` directory of the container

The `/tmp` directory is also read/write and is suitable for scratch work or temporary data that does not need to persist.

An example of writing a file to `/tmp`:

```python
file_path = "/tmp/sample.txt"

with open(file_path, "w") as f:
    f.write("Hello from Python!\\nThis is a sample file saved in /tmp.")

print(f"File written to {file_path}")
```

To list files in the `/tmp` directory, run the following:

```python
%%bash
cd /tmp
ls
```

## Persisting files

To store files for later use, write them to a Snowflake stage with write access using Snowpark file operation APIs.

To learn more about required stage privileges, see [write access](../../security-access-control-privileges.md). For Snowpark file operations, see
[Snowpark file operation APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.6.1/api/snowflake.snowpark.FileOperation#snowflake.snowpark.FileOperation).

---
title: Working with workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/workspaces-working.md
section: Snowsight UI
---

# Working with workspaces

> **Important:**
>
> Starting in September 2025, Snowflake is gradually upgrading accounts from Worksheets to Workspaces. Workspaces will become the default
> SQL editor. For more information, see [Defaulting accounts from Worksheets to Workspaces](../../release-notes/bcr-bundles/un-bundled/bcr-2117.md).

## Create and work with files and folders

In a workspace you can use a familiar IDE and source control conventions to author, organize, and run code.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. Select the + next to the appropriate folder. If you’re using Workspaces for the first time, select + Add New.
4. Select from the following options to create a new file or folder or to upload an existing file or folder:

   * SQL File: Creates a new, blank SQL file in the editor as a tab in the editor. By default, `.sql` is appended to unnamed files. The editor
     recognizes it as a SQL file and enables syntax highlighting and autocomplete.
   * File: Creates a new file. Name the file and its extension. If the extension is recognized by the editor (for example, Java, JavaScript, or
     Scala), code highlighting and autocomplete are enabled.
   * Folder: Creates a new, empty folder inside the workspace.
   * Upload Files: Upload one or more files to any location in your workspace. The editor uses the file extension and applies the appropriate icon,
     behavior, and syntax highlighting to the file when it’s opened. For example, `.sql` files show SQL-specific features.
   * Upload Folder: Select one or more files or folders to add to the selected workspace.

### Manage files

You can rename, delete, move, and organize your workspaces, files, and folders.

To rename or delete a workspace, file, or folder, follow these steps:

1. Hover over the target and select the vertical ellipsis  (more actions).
2. Select Rename or Delete. If you choose to delete, you are prompted to confirm.

* To create a folder in a workspace, select the + next to the workspace or an existing folder.
* To organize files and folders, drag any file or folder into a different location in the same workspace. You can also drag a worksheet into a workspace.

### Format SQL code

Workspaces include a built-in functionality to format and standardize SQL code for improved readability and maintenance.

1. In the Workspaces editor, select the horizontal ellipsis.
2. Select Format SQL or use the keyboard shortcut `command` + `shift` + `O`
   (Windows: `CTRL` + `Alt` + `O`).

### Organize sections of code

Use code folding to collapse and expand large blocks of code, allowing you to focus on specific sections and improve overall code navigation.

1. In the Workspaces editor, locate the code section to collapse.
2. Hover the mouse to the right of the line numbers. A code folding icon () appears at the fold line.
3. Toggle the icon to fold or unfold the section of code.

### View multiple files or results in one layout

Managing multiple files with tabs and split panes offers several advantages:

* Compare code or results side by side: Quickly reference one worksheet query while working on another.
* Multitask more efficiently: View different cells, outputs, or files at once with less switching.

To adjust your Workspaces layout, select the vertical ellipsis () in the Workspaces pane and choose the appropriate option:

1. Split right
2. Split down
3. Close others

## Exploring query results

When you run a query in Workspaces, you can use interactive features to filter, analyze, and explore your results without writing
additional SQL. These features help you quickly understand your data and identify patterns.

> **Note:**
>
> These interactive result features are available in Workspaces in different locations than in the legacy Worksheets interface.

### Use interactive column statistics

Each column in your query results includes interactive visual statistics (mini graphs or histograms) that help you understand data
distribution and quality. You can click these statistics to open a detailed panel and create filters.

**To view column statistics:**

1. Run a query in a Workspaces SQL file.
2. In the results table, show the column statistics by doing one of the following:

   * Click Show column stats in the top-left corner of the table (next to the column headers).
   * Click the ellipsis button in any column header and select Show column stats.

   Mini graphs (histograms or distribution charts) appear in each column header showing the data distribution.
3. Click a histogram to view sum and average values for that entire column in the bottom-right of the table.

   Alternatively, you can select a range of cells in the results table to view statistics in the bottom-right of the table. For numeric columns,
   sum and average values are displayed. For non-numeric columns, the count is displayed.

**To filter using column statistics:**

1. Click Show column stats in the top-left corner of the results table, or click the ellipsis button in any column header and select Show column stats.
2. Click the histogram for the column you want to filter by. A popover displays detailed statistics for that column, including:

   * **Sum and average values** for numeric columns
   * **Distribution charts** showing value frequency
   * **Data quality metrics** such as null and filled percentages
3. In the popover, select the values or ranges you want to filter by.
4. Select Apply to apply the filter to your results.

This interactive filtering helps you explore your data visually without writing WHERE clauses or other SQL filter logic.

### Inspect cell values

The cell inspector provides detailed information about individual cells or selections in your query results.

To inspect a single cell:

1. In the results table, double-click any cell to open the Inspector Panel.
2. Review the detailed value, including formatting and data type information.

To view aggregate statistics for multiple cells:

* In the results table, select multiple cells by clicking and dragging across rows and columns.

  A statistics bar appears at the bottom showing:

  > + **Sum** of numeric values
  > + **Average** of numeric values
  > + **Count** of selected cells
  > + **Min and max** values in the selection

This feature is useful for quick calculations and data exploration without creating new queries.

## Keyboard shortcuts

Worksheets provide keyboard shortcuts to help you quickly navigate, customize your view, and edit queries. The following table identifies
commonly used keyboard shortcuts:

| Task | MacOS shortcut | Windows shortcut |
| --- | --- | --- |
| Run selected | `command` + `return` | `CTRL` + `Enter` |
| Run all | `command` + `shift` + `return` | `CTRL` + `Shift` + `Enter` |
| Format SQL file | `command` + `shift` + `O` | `CTRL` + `Alt` + `O` |
| Split pane horizontally | `control` + `\` | `CTRL` + `\` |
| Split pane vertically | `control` + `shift` + `\` | `CTRL` + `Shift` + `\` |
| Close focused tab | `control` + `W` | `CTRL` + `Q` |
| Copy selected file | `command` + `C` | `CTRL` + `C` |
| Cut selected file | `command` + `X` | `CTRL` + `X` |
| Paste file in selected location | `command` + `V` | `CTRL` + `V` |
| Open query results pane | `control` + `option` + `↑` | `CTRL` + `Alt` + `↑` |
| Close query results pane | `control` + `option` + `↓` | `CTRL` + `Alt` + `↓` |
| Open inline Copilot | `command` + `I` | `CTRL` + `I` |
| Comment out code | `command` + `/` | `CTRL` + `/` |
| Go to top of file | `command` + `home` or `command` + `↑` | `CTRL` + `home` or `CTRL` + `↑` |
| Go to bottom of file | `command` + `end` or `command` + `↓` | `CTRL` + `end` or `CTRL` + `↓` |

### Recover a workspace from a dropped user

Even when a user is dropped, their personal database (PDB) and all files within their workspaces are retained. The PDB is then renamed to
`DROPPED_USER$<dropped_user_name>_<timestamp>`.

> **Note:**
>
> The recovery of a workspace is not limited to the individual who ran the DROP command. Any user with the same role can recover the
> workspace, as the PDB retains its ownership under the role that initiated the command.

To recover a workspace from a dropped user’s PDB, follow these steps:

1. Find the dropped user’s PDB. Use the [SHOW DATABASES](../../sql-reference/sql/show-databases.md) command with a LIKE function to locate the specific database:

   ```sqlexample
   SHOW DATABASES LIKE 'dropped_user%';
   ```
2. View the workspaces in the PDB. Use the SHOW WORKSPACES IN DATABASE command to list the available workspaces:

   ```sqlexample
   SHOW WORKSPACES IN DATABASE DROPPED_USER$dropped_user_1754344912;
   ```
3. Create a new workspace from the recovered one. Use the CREATE WORKSPACE … FROM command to create a new workspace from the recovered one.

   This copies the content to a new location, making it accessible.

   > **Note:**
   >
   > You must use the USER$ qualifier to put the workspace in your own personal database. Otherwise, an error occurs. The timestamp at the
   > end of the database name varies.

   ```sqlexample
   FROM 'snow://workspace/DROPPED_USER$dropped_user_1754344912.PUBLIC."to_be_recovered"/versions/head';
   ```

## Limitations

* Column statistics may take longer to generate as the number of columns increases.
* Snowflake Copilot is not available in Workspaces.
* [Query filters](../ui-snowsight-filters.md) are not supported. Any queries containing filters will fail.
* Workspaces files are not included in Universal Search results.
* Opening and editing the same worksheet in the new Workspaces UI and old Worksheets UI simultaneously can result in lost changes.
* For worksheets, execution context settings (role, warehouse, and namespace) are not synchronized across the new Workspaces UI and the old Worksheets UI.

---
title: Workspaces
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/workspaces.md
section: Snowsight UI
---

# Workspaces

> Get started with Workspaces
>
> [Try it in Snowsight](https://app.snowflake.com/_deeplink/#/workspaces?utm_source=docs&utm_medium=growth&utm_campaign=-us-en-all&utm_content=-app-user-guide-ui-snowsight-workspaces)

> **Important:**
>
> Legacy Worksheets will be removed from Snowsight on **June 22, 2026**.
> Workspaces is the replacement SQL editing experience. For the full deprecation timeline and migration guidance, see
> [Deprecation of Legacy Worksheets and Dashboards](../../release-notes/bcr-bundles/un-bundled/bcr-2260.md).

## Overview

Workspaces provides a unified editor for creating, organizing, and managing code across multiple file types that you can use to analyze data,
develop models, and build pipelines.

A *workspace* is private to you and offers a development environment where you can build, experiment, and test your work. All content in Workspaces
is file-based, allowing you to work on more complex projects and easily integrate with Git for version control, collaboration, and alignment
with your existing workflows.

When a user accesses Workspaces for the first time, Snowflake automatically creates an internal, user-specific personal database. This database
is used to store workspaces and cannot contain standard objects such as tables or views. It does not grant the user any additional
capabilities or privileges beyond enabling Workspace functionality. For details on personal databases, see [Personal Databases](../personal-databases.md).

Administrators may notice that users appear to have OWNERSHIP, USAGE, and CREATE SCHEMA privileges on this database. These privileges are
required for interacting with Workspaces and do not affect access to other resources.

## The Workspaces environment

Workspaces is a new editor composed of six sections, or *panes*:

1. **Workspaces:** One area for all your files and folders. Drag files to move them between folders. Use nested folders to group related
   worksheets under logical categories so that you can quickly find specific worksheets without searching through a flat list. Each user has a
   default workspace named “My Workspace” that is automatically provisioned by Snowflake. You can also create a new workspace by selecting
   + Add New in the Workspaces menu. The default workspace cannot be deleted or renamed.
2. **Worksheets:** Open and edit worksheets you own or have any permissions on. Note that edits will not be saved if you only have read
   permissions on the worksheet. To convert a worksheet into a file in a workspace, drag it to a folder inside the workspace. You can only move
   worksheets individually; moving multiple worksheets at once is not supported. Workspace queries are run similarly to worksheets with a few
   small differences, including improved UI performance and the ability to run two queries simultaneously from the same SQL file.
3. **Database Explorer:** A hierarchical view of all databases in your account, the schemas for each database, and other objects, organized
   by type. Use the filter to search for objects. You can also filter out unusable objects to simplify your view by selecting Show databases I can query.
   The options available in the vertical ellipsis  (more actions) button vary by object type, but include features such as
   placing names in the editor, copying names, and viewing definitions. To open or close the Database Explorer or File Explorer, select the
   File Explorer icon  in the bottom toolbar of the Workspaces window.
4. **Editor:** Edit queries and split them side by side to view multiple files simultaneously. Use inline Copilot to get suggestions and
   completions directly within the editor workspace.
5. **Results:** Split results side-by-side or pin them for easy comparison.
6. **Query History:** View the history of all queries you have run. Current File shows historical queries from the file currently open
   and selected in the editor. Filter to the current file or across all files. All Files displays all historical queries you have run
   across all files. To open or close this view, select the Query History icon  in the bottom toolbar of the Workspaces window.

## Manage access and behavior

As an administrator, you can manage the transition to Workspaces through Snowsight or using SQL commands. You can set the default editor
for SQL queries, disable the Workspaces feature, and address potential conflicts with existing security policies.

### Set or revert the default editor

To set Workspaces as the account-wide default editor for all users from Snowsight, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md) as ACCOUNTADMIN.
2. In the lower-left corner, select your name » Settings.
3. Under Account, choose General.
4. Enable the Set Workspaces as default SQL editor for the account option.

   Administrators can revert to Worksheets as the default editor by disabling this option. If users want to revert to Worksheets, they can also
   select Go to Worksheets from the Workspaces UI:

   Or toggle the user setting in the Workspaces editor:

To set the account-wide default editor to be Workspaces for all users using SQL:

> ```sqlexample
> ALTER ACCOUNT SET USE_WORKSPACES_FOR_SQL = 'always';
> ```

To revert this setting and use the previous default editor, but respect any Snowflake-managed BCR that makes Workspaces the default, run this command:

> ```sqlexample
> ALTER ACCOUNT UNSET USE_WORKSPACES_FOR_SQL;
> ```

To revert to the previous editor and temporarily ignore any Snowflake-managed BCR that makes Workspaces the default, run this command:

> ```sqlexample
> ALTER ACCOUNT SET USE_WORKSPACES_FOR_SQL = 'never';
> ```

> **Note:**
>
> Worksheets will eventually become deprecated and the command above will no longer work. If you had previously set this parameter, it will
> be automatically cleared once Worksheets is deprecated. For more information, see [Deprecation of Legacy Worksheets and Dashboards](../../release-notes/bcr-bundles/un-bundled/bcr-2260.md).

### Disable Workspaces

> **Warning:**
>
> Disabling Workspaces by setting `ENABLE_PERSONAL_DATABASE` to `FALSE` is deprecated. Starting **April 20, 2026**, this setting is
> ignored and Workspaces can no longer be disabled. For details, see [Deprecation of Legacy Worksheets and Dashboards](../../release-notes/bcr-bundles/un-bundled/bcr-2260.md).

To disable Workspaces, set the ENABLE_PERSONAL_DATABASE account-level parameter to FALSE, run this command:

```sqlexample
ALTER ACCOUNT SET ENABLE_PERSONAL_DATABASE = FALSE;
```

This parameter requires ACCOUNTADMIN privileges. After you set it to `FALSE`, Workspaces will not be functional; however, Workspaces
will still be listed in the Snowsight navigation menu.

## Limitations

* [Query filters](../ui-snowsight-filters.md) are not supported. Any queries containing filters will fail.
* Workspaces files are not included in Universal Search results.
* Opening and editing the same worksheet in the new Workspaces UI and old Worksheets UI simultaneously can result in lost changes.
* For worksheets, execution context settings (role, warehouse, and namespace) are not synchronized across the new Workspaces UI and the old Worksheets UI.

---
title: Workspaces replication
source: https://docs.snowflake.com/en/user-guide/ui-snowsight/workspaces-replication.md
section: Snowsight UI
---

# Workspaces replication

> **Important:**
>
> * Workspaces owned by users require Business Critical (BC) or higher to support replication.
> * Failover and failback require Business Critical Edition or higher. To inquire about upgrading, contact [Snowflake Support](../contacting-support.md).

Replication helps ensure business continuity by making workspaces and other important objects
available across accounts, even during disasters, outages, or periods of unavailability. Administrators configure replication groups to copy
account objects and databases from a primary account to one or more secondary accounts on a defined schedule.

## How Workspaces replication works

Shared workspaces are replicated when they are included in a database that is part of a replication or failover group. Private workspaces are
replicated when their owning users are replicated. In secondary (target) accounts, replicated content is read-only; Workspace files are executable
but cannot be edited. To create and run new queries, use the original Worksheets interface in the secondary account.

Database replication can also be configured as a failover group to support high availability. When a secondary failover group is promoted to
primary, all contained objects, including workspaces, become writable in the new primary account.

For more information, see [Introduction to replication and failover across multiple accounts](../account-replication-intro.md).

### LOCAL workspaces

LOCAL workspaces do not use workspace replication. Workspace files remain within the current deployment and are not copied to or synchronized with other deployments.
LOCAL workspaces are stored in a schema called `LOCAL` and are always read-write, regardless of whether the account is a primary or secondary.

When workspace replication is first enabled, any workspaces that already exist in the secondary deployment are automatically migrated from
the `PUBLIC` schema to the `LOCAL` schema during the first refresh. This one-time migration ensures that users retain access to their existing
workspace data in the secondary deployment rather than losing it when replication is enabled.

After the one-time migration, standard Snowflake replication behavior applies:

* Workspaces in the secondary account (except those in the `LOCAL` schema) are updated to reflect the primary account during each refresh and are read-only.
* Workspaces in the `LOCAL` schema are not affected by replication refreshes and remain read-write.

## Set up Workspaces replication

To replicate Workspaces, you must complete the following setup tasks in order:

### Step 1: Enable replication for the account

A user with the ORGADMIN role must enable replication for each source and target account in the organization:

```sqlexample
USE ROLE ORGADMIN;
SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER(
    '<organization_name>.<account_name>',
    'ENABLE_ACCOUNT_DATABASE_REPLICATION',
    'true');
```

For more information, see [Prerequisite: Enable replication for accounts in the organization](../account-replication-config.md).

### Step 2: Create a replication group

A replication group copies objects from a primary account to a secondary account on an optionally defined schedule.

To create a replication group, specify the account that contains the workspace in the replication group:

#### Primary account

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE REPLICATION GROUP my_replication_group
    OBJECT_TYPES = USERS
    ALLOWED_ACCOUNTS = org_name.secondary_account_name
    [ REPLICATION_SCHEDULE = '10 MINUTE' ]
```

In this example:

* `ALLOWED_ACCOUNTS` - The secondary account to replicate to.
* `REPLICATION_SCHEDULE` - How frequently replication occurs (for example, ‘10 MINUTE’ or ‘1 HOUR’).

#### Secondary account

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE REPLICATION GROUP my_replication_group
  AS REPLICA OF org_name.primary_account_name.my_replication_group;
```

### Set up failover for high availability

To enable [failover](../account-replication-intro.md) (promotion of a secondary account to primary) during an outage, you must use
a failover group instead of a replication group:

#### Primary account

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE FAILOVER GROUP my_failover_group
  OBJECT_TYPES = USERS
  ALLOWED_ACCOUNTS = org_name.secondary_account_name
  [ REPLICATION_SCHEDULE = '10 MINUTE' ]
```

#### Secondary account

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE FAILOVER GROUP my_failover_group
  AS REPLICA OF org_name.primary_account_name.my_failover_group;
```

#### Secondary takes over as primary fails

If you [promote the failover group to primary](../account-replication-failover-failback.md), the workspace becomes read-write.

#### Secondary account behavior

If you don’t have an available read-write workspace, you can also revert to using Worksheets in Snowsight which support read-write.

## Considerations

* Query results are not replicated - Query results are only stored in the account where the query was originally run.
* The selected role, warehouse, database, and schema context for any files are not replicated - You may replicate those account level objects
  separately, but those contexts will not remain selected on the files in the target account.

## Limitations

* Git integration is not currently supported after failover - If a secondary account with workspaces is promoted to primary, you must
  reconfigure the Git integration manually.
* Workspaces in the secondary account are read-only.

For more detailed information on replication behavior, see [Replication considerations](../account-replication-considerations.md).

## Snowflake Postgres

Managed Postgres instances running directly inside Snowflake.

---
title: Configuring S3 Storage for pg_lake
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-pg_lake.md
section: Snowflake Postgres
---

# Configuring S3 Storage for pg_lake

pg_lake is a PostgreSQL extension that enables efficient querying of data stored in object storage
formats like Parquet and ORC. When using pg_lake with Snowflake Postgres, you configure access to
an Amazon S3 bucket where your data is stored by using a Snowflake storage integration.

This topic explains how to configure S3 bucket permissions on AWS and create a storage integration
that allows Snowflake Postgres to access your data.

> **Note:**
>
> Currently, this S3 storage isn’t managed by Snowflake Postgres. You provide your own S3 bucket
> and configure access through a storage integration that you attach to your Postgres instance.

## Prerequisites

Before configuring S3 storage for pg_lake, ensure that you have:

* An active AWS account with permissions to create and manage [S3 buckets](https://docs.aws.amazon.com/AmazonS3/latest/userguide/creating-buckets-s3.html) and [IAM roles](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles.html).
* An S3 bucket in the same AWS region as your Snowflake account. To determine your Snowflake account
  region, execute the following query in Snowflake (not on your Postgres instance):

  ```sqlexample
  SELECT CURRENT_REGION();
  ```
* Familiarity with [AWS IAM roles and policies](https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies.html).
* A [Snowflake Postgres instance](postgres-create-instance.md) with pg_lake support.
* Privileges to create storage integrations in Snowflake (requires ACCOUNTADMIN role or a role with the CREATE INTEGRATION privilege on the account).

## Step 1: Create an S3 bucket

If you don’t already have one, create an S3 bucket in the same AWS region as your Snowflake account.
For example, if your Snowflake account is in `us-west-2`, create the S3 bucket in the
`us-west-2` region.

Refer to the AWS documentation for instructions on [creating an S3 bucket](https://docs.aws.amazon.com/AmazonS3/latest/userguide/creating-buckets-s3.html).

## Step 2: Create an IAM policy for S3 access

Create an IAM policy that grants the necessary permissions for pg_lake to read from and write to your S3 bucket:

1. Sign in to the AWS Management Console and navigate to the IAM service.
2. From the left-hand navigation pane, select Account settings.
3. Under Security Token Service (STS) in the Endpoints list, find the Snowflake region where
   your account is located. If the STS status is inactive, move the toggle to Active.
   For more information, see [Activating and deactivating AWS STS in an AWS region](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_enable-regions.html).
4. From the left-hand navigation pane, select Policies, then choose Create policy.
5. For Policy editor, select JSON.
6. Add a policy document that allows Snowflake to access the S3 bucket and folder. Replace
   `bucket_name` and `prefix` with your actual bucket name and folder path prefix:

   ```json
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                   "s3:PutObject",
                   "s3:GetObject",
                   "s3:GetObjectVersion",
                   "s3:DeleteObject",
                   "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::bucket_name/prefix/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::bucket_name",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "prefix/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   This policy provides permissions to:

   * Read, write, and delete objects in the specified S3 path
   * List bucket contents and retrieve bucket location
   * Support pg_lake’s ability to create and manage Iceberg tables
7. Choose Next.
8. Enter a policy name (for example, `snowflake_pg_lake_access`) and an optional description.
9. Choose Create policy.

## Step 3: Create an IAM role

Create an IAM role that Snowflake will assume to access your S3 bucket.

> **Important:**
>
> When you create this role, you must set the Maximum session duration to `12 hours`.
> The storage integration won’t work with the default session duration. See the last step in
> this section.

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. Select AWS account as the trusted entity type.
4. Select Another AWS account.
5. In the Account ID field, enter your own AWS account ID temporarily. You will modify the
   trust relationship in a later step to grant access to Snowflake.
6. Select the Require external ID option. Enter a placeholder external ID such as `0000`.
   You will update this with the actual external ID generated by Snowflake in a later step.

   > **Note:**
   >
   > An external ID is used to grant access to your AWS resources (such as S3 buckets) to a
   > third party like Snowflake. For more information, see
   > [How to use an external ID when granting access to your AWS resources to a third party](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html).
7. Select Next.
8. Search for and select the policy you created in Step 2: Create an IAM policy for S3 access.
9. Select Next.
10. Enter a name and description for the role (for example, `snowflake_pg_lake_role`), then select
    Create role.
11. On the role summary page, locate and record the Role ARN value. You will need this when
    creating the storage integration in Snowflake.
12. While on the role summary page, select Edit in the summary section and change the
    Maximum session duration to `12 hours`. Select Save changes. For more
    information, see [Modifying a role maximum session duration (AWS)](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_use.html#id_roles_use_view-role-max-session).

## Step 4: Create a storage integration in Snowflake

Create a storage integration object in Snowflake that references the IAM role you created.
For the full command syntax, see [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md).

```sqlexample
CREATE STORAGE INTEGRATION my_pg_lake_integration
  TYPE = POSTGRES_EXTERNAL_STORAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/snowflake_pg_lake_role'
  STORAGE_ALLOWED_LOCATIONS = ('s3://my-bucket/my-prefix/');
```

Where:

* `my_pg_lake_integration` is the name you choose for the storage integration.
* `TYPE = POSTGRES_EXTERNAL_STORAGE` specifies that this integration is for use with Snowflake Postgres.
* `STORAGE_AWS_ROLE_ARN` is the Role ARN you recorded in Step 3: Create an IAM role.
* `STORAGE_ALLOWED_LOCATIONS` specifies the S3 bucket and path prefix. Replace `my-bucket` and `my-prefix` with the bucket name and folder path you created in Step 1: Create an S3 bucket. Note that only one location is allowed for Postgres storage integrations.

> **Note:**
>
> Creating a storage integration requires the ACCOUNTADMIN role or a role with the CREATE INTEGRATION
> privilege on the account. For more information, see [Access control privileges](../security-access-control-privileges.md).

## Step 5: Retrieve the Snowflake IAM user ARN and external ID

After creating the storage integration, use the [DESCRIBE INTEGRATION](../../sql-reference/sql/desc-integration.md) command
to retrieve the AWS IAM user and external ID that Snowflake generated for this integration:

```sqlexample
DESCRIBE STORAGE INTEGRATION my_pg_lake_integration;
```

In the output, locate and record the following values:

* `STORAGE_AWS_IAM_USER_ARN`: The IAM user ARN that Snowflake will use to assume the role
* `STORAGE_AWS_EXTERNAL_ID`: The external ID to use in the trust policy

You will use these values in the next step to configure the IAM role trust policy.

## Step 6: Update the IAM role trust policy

Update the trust policy of the IAM role you created in Step 3: Create an IAM role to allow Snowflake to assume the role:

1. Sign in to the AWS Management Console and navigate to the IAM service.
2. From the left-hand navigation pane, select Roles.
3. Select the role you created in Step 3: Create an IAM role.
4. Select the Trust relationships tab.
5. Select Edit trust policy.
6. Replace the policy document with the following text:

   ```json
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Principal": {
                   "AWS": "<storage_aws_iam_user_arn>"
               },
               "Action": "sts:AssumeRole",
               "Condition": {
                   "StringEquals": {
                       "sts:ExternalId": "<storage_aws_external_id>"
                   }
               }
           }
       ]
   }
   ```

   Replace the placeholder values with the values you recorded in
   Step 5: Retrieve the Snowflake IAM user ARN and external ID:

   * Replace `storage_aws_iam_user_arn` with the `STORAGE_AWS_IAM_USER_ARN` value.
     This is a full ARN in the form `arn:aws:iam::<account_id>:user/snowflake-postgres-integration-management`,
     where the username is always the same and only the AWS account ID varies.
   * Replace `storage_aws_external_id` with the `STORAGE_AWS_EXTERNAL_ID` value.
7. Select Update policy to save the changes.

## Step 7: Attach the storage integration to your Postgres instance

Attach the storage integration to your Snowflake Postgres instance. When the storage integration is attached, the S3 credentials
are automatically synchronized to the Postgres control plane and made available to pg_lake:

```sqlexample
ALTER POSTGRES INSTANCE my_postgres_instance
  SET STORAGE_INTEGRATION = my_pg_lake_integration;
```

You can also specify the storage integration when creating a new Postgres instance:

```sqlexample
CREATE POSTGRES INSTANCE my_postgres_instance
  ...
  STORAGE_INTEGRATION = my_pg_lake_integration;
```

To remove a storage integration from a Postgres instance:

```sqlexample
ALTER POSTGRES INSTANCE my_postgres_instance
  UNSET STORAGE_INTEGRATION;
```

## Step 8: Configure and use pg_lake

After attaching the storage integration, connect to your Postgres instance and configure pg_lake.
For a list of available extensions, see [Snowflake Postgres Extensions](postgres-extensions.md).

1. Create the pg_lake extension:

   ```postgres
   CREATE EXTENSION pg_lake CASCADE;
   ```
2. Set the default storage location for Iceberg tables. This should match the location specified in
   your storage integration.

   The SET command only applies to the current session:

   ```postgres
   SET pg_lake_iceberg.default_location_prefix = 's3://my-bucket/my-prefix';
   ```

   To set the value for all current and future sessions, use the ALTER DATABASE command instead. If you use
   multiple Postgres databases, make sure to set the storage location for each database:

   ```postgres
   -- Substitute the name of your database
   ALTER DATABASE my_database SET pg_lake_iceberg.default_location_prefix = 's3://my-bucket/my-prefix';
   ```
3. Verify that the storage integration is configured correctly by listing the contents of your
   S3 bucket:

   ```postgres
   SELECT * FROM lake_file.list('s3://my-bucket/my-prefix/*');
   ```

   Replace `my-bucket` and `my-prefix` with your actual bucket name and path. If the
   configuration is correct, this query returns a list of files at that location. If the bucket
   is empty, the query returns an empty result set without an error.
4. Verify the end-to-end configuration by creating an Iceberg table, inserting data, and
   querying it back. If this succeeds, pg_lake can read from and write to your S3 bucket:

   ```postgres
   CREATE TABLE my_table (
       id INT,
       data TEXT
     ) USING iceberg;

   INSERT INTO my_table VALUES (1, 'hello iceberg');

   SELECT * FROM my_table;
   ```

## Security considerations

When configuring S3 access for pg_lake, keep these security best practices in mind:

* **Use IAM roles**: Snowflake Postgres uses IAM role assumption rather than static credentials,
  providing better security through temporary credentials and automatic credential rotation.
* **Limit IAM permissions**: Grant only the minimum necessary permissions to the S3 bucket paths
  that pg_lake needs to access. The IAM policy should restrict access to specific bucket prefixes.
* **Monitor external ID**: The external ID in the trust policy ensures that only your Snowflake
  account can assume the IAM role.
* **Review storage integration changes**: Any updates to the storage integration’s
  `STORAGE_AWS_ROLE_ARN` or `STORAGE_ALLOWED_LOCATIONS` are automatically synchronized to the
  Postgres instance.
* **Use bucket policies**: Consider using S3 bucket policies in addition to IAM policies for defense in depth.
* **Enable S3 access logging**: Enable access logging on your S3 bucket to monitor and audit access patterns.
* **Regional alignment**: Ensure your S3 bucket is in the same AWS region as your Snowflake account
  for optimal performance and to meet data residency requirements.

## Troubleshooting

### Storage integration creation errors

If you encounter errors when creating the storage integration:

* Verify that you have the ACCOUNTADMIN role or a role with the CREATE INTEGRATION privilege on the account.
* Ensure the IAM role ARN is correctly formatted and exists in your AWS account.
* Confirm that the S3 bucket location uses the correct format: `s3://bucket-name/prefix/`
* Note that only one storage location is allowed for `POSTGRES_EXTERNAL_STORAGE` integrations.

> **Tip:**
>
> Storage integration errors are logged in the Postgres server logs with a `Storage integration:`
> prefix. For example:
>
> `Storage integration: IAM role must have Maximum Session Duration set to 12 hours`
>
> For information about accessing Postgres logs, see
> [Snowflake Postgres logging](postgres-logging.md).

### Connection errors

If pg_lake cannot access S3 after attaching the storage integration:

* Verify that the storage integration is properly attached to your Postgres instance by querying
  the instance properties.
* Check that the IAM role trust policy has been updated with the correct Snowflake IAM user ARN
  and external ID from the DESCRIBE STORAGE INTEGRATION output.
* Ensure that the S3 bucket region matches your Snowflake account region.
* Verify that the STS endpoint for your region is active in AWS IAM Account settings.

### Permission denied errors

If you receive permission denied errors when accessing S3:

* Confirm that the IAM policy attached to the role includes all required permissions:
  `s3:PutObject`, `s3:GetObject`, `s3:GetObjectVersion`, `s3:DeleteObject`,
  `s3:DeleteObjectVersion`, `s3:ListBucket`, and `s3:GetBucketLocation`.
* Verify that the IAM role’s trust policy allows the Snowflake IAM user to assume the role.
* Check that the S3 bucket policy (if any) doesn’t deny access from the IAM role.
* Ensure that the S3 paths you’re accessing match the prefix specified in `STORAGE_ALLOWED_LOCATIONS`.

### Trust policy errors

If you encounter errors related to assuming the IAM role:

* Verify that the external ID in the trust policy exactly matches the `STORAGE_AWS_EXTERNAL_ID`
  from the storage integration.
* Confirm that the principal ARN in the trust policy matches the `STORAGE_AWS_IAM_USER_ARN` from
  the storage integration.
* Check that the maximum session duration for the IAM role is set to 12 hours.

## Related information

* [Option 1: Configure a Snowflake storage integration to access Amazon S3](../data-load-s3-config-storage-integration.md) — Similar S3 access workflow to the one described in this topic
* [Apache Iceberg™ tables](../tables-iceberg.md) — Overview of Iceberg table support in Snowflake
* [Create an Apache Iceberg™ table in Snowflake](../tables-iceberg-create.md) — Creating Iceberg tables from different catalog sources
* [Configure an external volume](../tables-iceberg-configure-external-volume.md) — Configuring an external volume for Iceberg tables
* [Configure a catalog integration for files in object storage](../tables-iceberg-configure-catalog-integration-object-storage.md) — Catalog integration setup for files in object storage
* [pg_lake extension documentation](https://github.com/Snowflake-Labs/pg_lake)

---
title: Connecting to Snowflake Postgres
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/connecting-to-snowflakepg.md
section: Snowflake Postgres
---

# Connecting to Snowflake Postgres

Once you create a Snowflake Postgres instance, you can connect to it with any PostgreSQL client, such as `psql` or DBeaver. To
establish a connection, you configure your client with:

* The **hostname** of the instance. This is the URL of the virtual machine host.
* A **username**. When you create an instance, the `snowflake_admin` user is created by default and designed for administrative
  access.
* The **Postgres database** that you want to connect to. This parameter is required to create Postgres connections. The default
  database is named `postgres`.
* A **password** for your user.

Here is an example of these connections details used with the `psql` command line client:

```bash
$ psql -h abcefg.snowflake.app  -U snowflake_admin -d postgres
```

(`psql` will prompt for a password.)

If you need to specify a port, use 5432:

```bash
$ psql -h abcefg.snowflake.app  -U snowflake_admin -p 5432 -d postgres
```

> **Important:**
>
> SSL is required to connect to Snowflake Postgres instances.

## About connection strings

When creating a Postgres instance via Snowsight, Snowflake Postgres provides a connection string in
[libpq URI format](https://www.postgresql.org/docs/current/libpq-connect.html#LIBPQ-CONNSTRING) to use to connect directly
via `psql` or to input into your application configuration.

> **Note:**
>
> A cluster’s connection string remains the same across cluster management operations, unless you explicitly reset access for
> a given role.

The connection string as a database URL contains the following parameters:

* protocol: `postgres://`
* username: See [Snowflake Postgres Roles](postgres-roles.md) for more details
* password
* hostname
* port: 5432
* database_name: Defaults to `postgres`

These are then used to build a URI connection string with this format:

```none
postgresql://<username>:<password>@hostname:<port>/<database_name>
```

If your client environment is not otherwise configured to enforce SSL connections, you can append `?sslmode=require`
to the URI:

```none
postgresql://<username>:<password>@hostname:<port>/<database_name>?sslmode=require
```

The [sslmode](https://www.postgresql.org/docs/current/libpq-connect.html#LIBPQ-CONNECT-SSLMODE) parameter will accept different values
indicating different levels of SSL encryption and certificate verification to be used. `sslmode=require` is the minimum level
required to enforce SSL encryption. For configuring your client to perform SSL certificate verification of your Snowflake Postgres server
certificates, see [Snowflake Postgres SSL certificates](postgres-ssl-certs.md).

You can specify several other client connection parameters in a connection URI in the same way as `sslmode` is
specified above. For a full list, see the PostgreSQL documentation’s [list of URI connection parameters](https://www.postgresql.org/docs/current/libpq-connect.html#LIBPQ-PARAMKEYWORDS).

You can also set many of these parameters via [environment variables recognized by libpq](https://www.postgresql.org/docs/current/libpq-envars.html). For example, the following ensures that the `psql` connection is made with
`?sslmode=require` set:

```bash
export PGSSLMODE=require
psql -h {hostname} -U {username} {dbname}
```

Setting client connection parameters via environment variables is useful when configuring connections for application frameworks that
do not otherwise provide configuration options for needed connection parameters.

> > **Note:**
> >
> > For applications that use non-`libpq`-based database drivers, consult the documentation for those other drivers for their
> > client configuration parameter options and specification format. For example, [PostgreSQL’s JDBC driver](https://jdbc.postgresql.org/documentation/use/) provides many parameters equivalent to those provided by `libpq`, but their
> > specification in URIs is slightly different.

---
title: Creating a Snowflake Postgres Instance
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-create-instance.md
section: Snowflake Postgres
---

# Creating a Snowflake Postgres Instance

## Overview

You can create Snowflake Postgres instances by using either Snowsight or by executing
Snowflake SQL statements. You can configure the size of the instance, the storage size, and the
Postgres major version when creating an instance. You can also apply network policies to instances
at creation time.

## Privileges

To create Snowflake Postgres instances, you must use a role that has been granted
the CREATE POSTGRES INSTANCE privilege on the account. By default, this
privilege is granted to the ACCOUNTADMIN role.

To grant this privilege to other roles, a user with the ACCOUNTADMIN role
can run the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command:

```sqlexample
GRANT CREATE POSTGRES INSTANCE ON ACCOUNT TO your_role;
```

## Creating a Postgres instance

SnowsightSQL

You can create a Postgres instance by using the Create menu or by using the Create button in the Postgres Instances page.

**Using the main Create menu:**

1. At the top of the navigation menu, select  (Create).
2. Select Postgres Instance.
3. Configure your instance.
4. Select Create.

**Using the Create button on the Postgres instances page:**

1. In the navigation menu, select Postgres.
2. In the Postgres Instances page, select the Create button at the top right.
3. Choose your instance configuration.
4. Select Create.

When you create an instance, the connection details are displayed, including the hostname and credentials needed to connect to
the instance. Save these credentials in a secure location; they will not be shown again. You can regenerate credentials later if
needed.

If you did not select a network policy, you will have the option to configure network settings from the instance details page.
See [Snowflake Postgres networking](postgres-network.md) for more details.

> Use the [CREATE POSTGRES INSTANCE](../../sql-reference/sql/create-postgres-instance.md) command to create a new Postgres instance. The syntax of this command is shown below:

```sqlsyntax
CREATE POSTGRES INSTANCE <name>
  COMPUTE_FAMILY = '<compute_family>'
  STORAGE_SIZE_GB = <storage_gb>
  AUTHENTICATION_AUTHORITY = POSTGRES
  [ POSTGRES_VERSION = { 16 | 17 | 18 } ]
  [ NETWORK_POLICY = '<network_policy>' ]
  [ HIGH_AVAILABILITY = { TRUE | FALSE } ]
  [ POSTGRES_SETTINGS = '<json_string>' ]
  [ COMMENT = '<string_literal>' ];
```

For the command parameters:

> `COMPUTE_FAMILY = compute_family`
> :   Specifies the name of an instance size from the [Snowflake Postgres Instance Sizes](postgres-instance-sizes.md) tables.
>
> `STORAGE_SIZE_GB = storage_gb`
> :   Specifies storage size in GB. Must be between 10 and 65,535.
>
> `AUTHENTICATION_AUTHORITY = POSTGRES`
> :   Determines how you authenticate to your instance. Currently, the only available option is `POSTGRES`, but other
>     authentication methods, including `SNOWFLAKE`, might be supported in the future.
>
> `POSTGRES_VERSION = { 16 | 17 | 18 }`
> :   Specifies the version of Postgres to use.
>
>     Default: The latest Postgres version.
>
> `NETWORK_POLICY = 'network_policy'`
> :   Specifies the [network policy](postgres-network.md) to use for the instance. To specify this parameter, you must have been granted the USAGE privilege on the NETWORK_POLICY object.
>
>     Default: No network policy is applied. A network policy will need to be configured before the instance can be reached. See [Snowflake Postgres networking](postgres-network.md) for more information.
>
> `HIGH_AVAILABILITY = { TRUE | FALSE }`
> :   Specifies whether to enable high availability for the instance.
>
>     Default: `FALSE`
>
> `POSTGRES_SETTINGS = 'json_string'`
> :   Allows you to optionally set Postgres configuration parameters on your instance in JSON format. See [Snowflake Postgres Server Settings](postgres-server-settings.md) for a list of available Postgres parameters.
>
>     ```none
>     '{"component:name" = "value", ...}'
>     ```
>
>     Default: No custom Postgres configuration parameters are set.
>
> `COMMENT = 'string_literal'`
> :   Specifies a comment for the Postgres instance.
>
>     Default: `NULL`

When you create the instance, one row with the following columns is returned:

* `status`
* `host`
* `access_roles`
* `default_database`

The `access_roles` column contains the user name and password for both the `snowflake_admin` and `application` roles. Save these details in a secure location because they cannot be retrieved later.

Creating a new instance takes some time to complete. The instance displays its current
state as it is building. See the list of [instance states](managing-instances.md) for
details about the states that you see while instances are being created.

---
title: Snowflake Postgres
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/about.md
section: Snowflake Postgres
---

# Snowflake Postgres

## About Snowflake Postgres

Snowflake Postgres lets you create, manage, and use Postgres instances directly from Snowflake. Each instance runs a Postgres
database server on a dedicated virtual machine managed by Snowflake. You connect directly to your instances using any Postgres
client. Snowflake Postgres brings the reliable and trusted transactional database capabilities of Postgres to the Snowflake data
platform.

## About Postgres

PostgreSQL (also referred to as Postgres) is a mature, open-source relational database management system that has been actively
developed for more than 30 years. As a general-purpose transactional database, Postgres is designed for operational applications that
require highly-concurrent read/write operations, and low-latency data processing. Postgres offers a wide array of data types, including
JSONB, and sophisticated indexing capabilities. Postgres is increasingly becoming the database of choice for a wide range of use cases,
and is supported by an ecosystem of community-sponsored developer tools and extensions that offer enhanced capabilities. With
its proven reliability and performance, and active developer community, Postgres is a great addition to the Snowflake AI Data Cloud
platform that supports an expanded set of customer workloads.

## Architecture

Postgres is a mature, battle-tested database known for its reliability and performance, but it follows a more traditional architectural
model than the rest of the Snowflake platform. To bring Postgres into Snowflake, we designed an approach that preserves its operational
strengths while integrating it with Snowflake’s security, management, and connectivity capabilities.

Snowflake Postgres provisions a dedicated Postgres instance with attached disks to deliver best-in-class transactional performance. Each
Postgres instance runs in a fully isolated private network and supports private connectivity via firewall rules or Private Link. Snowflake
Postgres also offers built-in connection pooling via PgBouncer to support high-concurrency application workloads.

Snowflake Postgres is fully compatible with existing Postgres tooling and workloads, enabling you to lift-and-shift applications to
Snowflake with no code changes, and use everything that works with your Postgres instances today, including ORMs and all supported SQL
clients.

## Regional availability

Snowflake Postgres is available for the Amazon Web Services (AWS) and Microsoft Azure cloud service
providers (CSPs). Google Cloud Platform (GCP) isn’t currently supported.

Snowflake Postgres is available in the following [regions](../intro-regions.md).

| Cloud region | Cloud region ID |
| --- | --- |
| **Amazon Web Services (AWS)** |  |
| US East (N. Virginia) | us-east-1 |
| US East (Ohio) | us-east-2 |
| US West (Oregon) | us-west-2 |
| Canada (Central) | ca-central-1 |
| South America (Sao Paulo) | sa-east-1 |
| EU (Ireland) | eu-west-1 |
| Europe (London) | eu-west-2 |
| EU (Paris) | eu-west-3 |
| EU (Frankfurt) | eu-central-1 |
| EU (Zurich) | eu-central-2 |
| EU (Stockholm) | eu-north-1 |
| Africa (Cape Town) | af-south-1 |
| Asia Pacific (Mumbai) | ap-south-1 |
| Asia Pacific (Singapore) | ap-southeast-1 |
| Asia Pacific (Jakarta) | ap-southeast-3 |
| Asia Pacific (Sydney) | ap-southeast-2 |
| Asia Pacific (Tokyo) | ap-northeast-1 |
| Asia Pacific (Seoul) | ap-northeast-2 |
| Asia Pacific (Osaka) | ap-northeast-3 |
| **Microsoft Azure** |  |
| East US 2 (Virginia) | eastus2 |
| Central US (Iowa) | centralus |
| South Central US (Texas) | southcentralus |
| West US 2 (Washington) | westus2 |
| Canada Central (Toronto) | canadacentral |
| North Europe (Ireland) | northeurope |
| UK South (London) | uksouth |
| West Europe (Netherlands) | westeurope |
| Switzerland North (Zurich) | switzerlandnorth |
| Sweden Central | swedencentral |
| Southeast Asia (Singapore) | southeastasia |
| Japan East (Tokyo) | japaneast |
| Australia East (Sydney) | australiaeast |
| Korea Central (Seoul) | koreacentral |
| Central India (Pune) | centralindia |
| UAE North (Dubai) | uaenorth |

## Postgres versions

Postgres major versions 16-18 are currently available. When you choose a major version for your new instance, Snowflake Postgres automatically uses the latest available minor version. The latest available minor versions for each major version are 16.13, 17.9,
and 18.3.

For details on upgrading the Postgres version of your existing Snowflake Postgres instances see [Snowflake Postgres version upgrades](postgres-upgrades.md).

## When to use Postgres

Choose Postgres when you need a high-throughput, high-concurrency operational database, you have a use case that can benefit from
specific Postgres capabilities, or have an existing Postgres application.

## Customer Configurable Security Controls

Customers are responsible for managing the following controls to ensure a level of security appropriate to the particular content of their Postgres instances:

* Securing, keeping confidential, and rotating Postgres instance credentials, including passwords and connection strings.
* Maintaining appropriate password uniqueness, length, complexity, and expiration.
* Using [Snowflake Token Authentication for Snowflake Postgres](postgres-token-auth.md) passwords for interactive user connections.
* Using restrictive [networking policies and rules](postgres-network.md).
* Configuring [SSL certificate verification](postgres-ssl-certs.md) for your client connections to Snowflake Postgres instances.
* Configuring user and role-based access controls, including scope and duration of user access.

---
title: Snowflake Postgres Connection Pooling
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-connection-pooling.md
section: Snowflake Postgres
---

# Snowflake Postgres Connection Pooling

A connection pool is a cache of database connections that can be reused. When a request comes in from a client, an available connection
from the pool is given for that request or transaction.

In contrast, without any connection pooling, the client has to reach out to the database to establish a connection. Opening new connections
can impact availability and performance — in PostgreSQL, the server “forks” or creates a new process, and could use up available resources
as well as prevent new connections from being established. Connection pooling helps mitigate these issues and ensure that your applications
can scale.

## Do I need connection pooling?

Connection pooling is especially helpful when you have a high number of connections from your application, often in a client-side pool or via
multiple threads/processes from your web server.

You can run the following query on your Snowflake Postgres instance to determine if you would benefit from connection pooling:

```postgres
SELECT count(*),
       state
FROM pg_stat_activity
GROUP BY 2;
```

```output
 count |             state
-------+-------------------------------
     7 | active
    69 | idle
    26 | idle in transaction
    11 | idle in transaction (aborted)
(4 rows)
```

If you see a high number of idle connections relative to active ones, then using connection pooling is strongly recommended.

## Connection Pooling with PgBouncer

Snowflake Postgres uses [pgBouncer](http://www.pgbouncer.org/) for connection pooling. PgBouncer is made available on all Snowflake
Postgres instances by default to ease connection management by multiplexing native Postgres connections across its own “virtual”
connections. By default, PgBouncer instances on Snowflake Postgres are run in transaction pooling mode.

However, in order to make use of the PgBouncer service, you must take one extra step on each database you want to use it on by installing
the `snowflake_pooler` extension.

### Activating PgBouncer with the `snowflake_pooler` extension

As the `snowflake_admin` Postgres user, run the following in the database to install the `snowflake_pooler` extension:

```postgres
CREATE EXTENSION snowflake_pooler;
```

### What is `snowflake_pooler`?

`snowflake_pooler` is a simple extension that creates a user called `snowflake_pooler`. This user has access to a single function
called `user_lookup` that allows PgBouncer to authenticate incoming connections. That way, when a client makes a connection to PgBouncer,
it can check whether the client’s credentials are valid by querying Postgres’s canonical user store.

> **Note:**
>
> The `snowflake_pooler` extension must be installed individually in each database where you want to connect through PgBouncer. If
> `snowflake_pooler` has not been installed, you may receive an error like:
>
> ```output
> failed: FATAL: bouncer config error
> ```
>
> To resolve the error, connect to the database and run: `CREATE EXTENSION snowflake_pooler;`.

### Connecting to PgBouncer

Clients will connect to PgBouncer using the same connection string they’d use for the main Postgres database, except on port 5431 instead
of the usual 5432:

```bash
psql postgres://my_application_user:my_application_password@p.43lmodgbqvdmlpbjirv22dfciu.db.postgresbridge.com:5431/mydb
```

Only roles *without* superuser or replication privileges will be able to connect through PgBouncer. You might choose to connect to
PgBouncer using the `application` role, an individual user role created for team members, or any custom user roles that you may have
created (for example, using the [CREATE ROLE](https://www.postgresql.org/docs/current/sql-createrole.html) Postgres command). However,
the `user_lookup` function created by `snowflake_pooler` will deny lookups on superusers and replication roles. See [Snowflake Postgres Roles](postgres-roles.md)
for more about Postgres users and roles on Snowflake Postgres.

> **Tip:**
>
> The terms “user” and “role” in Postgres are largely synonymous. One minor difference is that CREATE USER (versus CREATE ROLE) implies
> LOGIN attribute, e.g. `CREATE ROLE myuser LOGIN;`.

### Pooling modes

PgBouncer supports three different pooling modes: transaction, session, and statement. Each is detailed briefly below and further in the
[PgBouncer documentation](https://www.pgbouncer.org/features.html).

#### Transaction

Snowflake Postgres instances will run PgBouncer in transaction pooling mode by default, since that’s the mode we recommend most people use.

> **Note:**
>
> When PgBouncer is in transaction pooling mode, SQL-level prepared statements created with PREPARE and run with EXECUTE in different
> transactions will not work since they may run on different server connections. PgBouncer does, however, support protocol-level
> prepared transactions if the application’s Postgres driver supports them. For more details on how PgBouncer handles this see its
> [max_prepared_statements](https://www.pgbouncer.org/config.html) documentation.
>
> In order to use PgBouncer’s support for protocol-level prepared statements, the PgBouncer [max_prepared_statements setting](postgres-server-settings.md) must be set to a value greater than `0`. The default on Snowflake Postgres is `250`, but you can set
> it to a different value if desired.

#### Session

Session pooling mode is supported on Snowflake Postgres if you have a need for it. To use this pooling mode, set the [pool_mode setting](postgres-server-settings.md) to `session` on your cluster.

#### Statement

Statement pooling mode is also available. However, please note that multi-statement transactions will throw errors. To use this pooling mode,
set the [pool_mode setting](postgres-server-settings.md) to `statement` on your cluster.

### Disabling PgBouncer

Dropping the `snowflake_pooler` extension from a database will functionally disable PgBouncer since it will no longer be able to authenticate:

```postgres
DROP EXTENSION snowflake_pooler;
```

---
title: Snowflake Postgres Cost Evaluation
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-cost.md
section: Snowflake Postgres
---

# Snowflake Postgres Cost Evaluation

When you use Snowflake Postgres instances, your account is charged based on three modes of consumption.

* **Instance compute**: Compute charges are based on the [COMPUTE_FAMILY](postgres-instance-sizes.md) chosen for each Snowflake Postgres
  instance created in your account and are metered on a credits per hour basis.
* **Instance storage**: Cost for storage depends on the amount of storage allocated across all Snowflake Postgres instances in your
  account. Charges are based on a flat monthly rate per terabyte (TB) per month but are metered on a byte-month basis.
* **Data transfer**: Standard [Snowflake data transfer costs](../cost-understanding-data-transfer.md) apply for all data
  transfer in and out of Snowflake Postgres instances. This includes data replication between Snowflake Postgres primary instances and any
  read replicas they have.

Details on pricing specifics for each mode of consumption can be found in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Monitoring storage consumption for Snowflake Postgres instances

You can view storage usage consumption for Snowflake Postgres instances by querying the [POSTGRES_STORAGE_USAGE_HISTORY view](../../sql-reference/account-usage/postgres_storage_usage_history.md).

## Monitoring compute consumption for Snowflake Postgres instances

You can view the total compute usage for Snowflake Postgres instances by querying the following views:

* You can query the [METERING_HISTORY view](../../sql-reference/account-usage/metering_history.md) and specify `service_type IN ('POSTGRES_COMPUTE', 'POSTGRES_COMPUTE_HA')`
  in the WHERE clause to see the hourly credit usage across all Snowflake Postgres instances for an account within the last 365 days (1 year).
* You can query the [METERING_DAILY_HISTORY view](../../sql-reference/account-usage/metering_daily_history.md) and specify `service_type IN ('POSTGRES_COMPUTE', 'POSTGRES_COMPUTE_HA')`
  in the WHERE clause to see the daily credit usage across all Snowflake Postgres instances for an account within the last 365 days (1 year).

---
title: Snowflake Postgres Extensions
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-extensions.md
section: Snowflake Postgres
---

# Snowflake Postgres Extensions

Extensions allow for expanded functionality within Postgres, without requiring a new version of Postgres to be released.
Extensions can enable new functionality including data types and functions.

You can see a list of all available extensions by querying your database:

```postgres
SELECT * FROM pg_available_extensions
```

You can see all extensions that are already enabled by executing:

```postgres
SELECT * FROM pg_extension;
```

or `\dx` in psql.

Extensions are enabled by the admin user by running:

```postgres
CREATE EXTENSION extensionname;
```

## Procedural language - PL/PgSQL

While also a category of extension, procedural languages allow you to write custom functions to be executed within your database.
We currently support PL/PgSQL.

## Current catalog of extensions

| Extension | Type of extension | Summary | Command to create |
| --- | --- | --- | --- |
| Address Standardizer | Functions | Used to parse an address into constituent elements | `CREATE EXTENSION address_standardizer;` |
| Address Standardizer (US) | Functions | Data for standardizing US addresses | `CREATE EXTENSION address_standardizer_data_us;` |
| Amcheck | Functions | Functions for verifying relation integrity | `CREATE EXTENSION amcheck;` |
| Audit | Functions | Audit user actions | `CREATE EXTENSION pgaudit;` |
| Auto explain | Logging | Automatically log execution plans of slow statements | [See auto_explain](https://docs.crunchybridge.com/extensions-and-languages/auto_explain) |
| Auto Increment | Functions | Provides function for storing the next value of a sequence in an integer field | `CREATE EXTENSION autoinc;` |
| Bloom | Index types | Provides a bloom filter index type | `CREATE EXTENSION bloom;` |
| Btree GIN | Index types | Support for indexing common data types in GIN | `CREATE EXTENSION btree_gin;` |
| Btree GIST | Index types | Support for indexing common data types in GiST | `CREATE EXTENSION btree_gist;` |
| Buffer Cache | Views | Examine the shared buffer cache | `CREATE EXTENSION pg_buffercache;` |
| Case insensitive text | Data type | Case insensitive text data type | `CREATE EXTENSION citext;` |
| Cron | Functions | Create scheduled tasks | `CREATE EXTENSION pg_cron;` |
| Crypto | Functions | Functions for encrypting data inside columns | `CREATE EXTENSION pgcrypto;` |
| Cube | Data type | Data type for multi-dimensional cubes | `CREATE EXTENSION cube;` |
| DDL Extractor | Functions | DDL eXtractor functions | `CREATE EXTENSION ddlx;` |
| dict-int | Dictionaries | Full text search dictionary template for integers | `CREATE EXTENSION dict_int;` |
| dict-xsyn | Dictionaries | Full text search dictionary template for extended synonym processing | `CREATE EXTENSION dict_xsyn;` |
| Earth Distance | Functions | Functions that assist with computing the distance between points. | `CREATE EXTENSION earthdistance;` |
| Free Space Map | Functions | Examine the free space map (FSM) | `CREATE EXTENSION pg_freespacemap;` |
| Fuzzy String Match | Functions | Functions for comparing similarity between strings | `CREATE EXTENSION fuzzystrmatch;` |
| H3 | Functions | H3 bindings for Postgres | `CREATE EXTENSION h3;` |
| H3 PostGIS | Geospatial utilities | H3 bindings for PostGIS spatial types | `CREATE EXTENSION h3_postgis;` |
| Hint plan | Functions | Adjust PostgreSQL execution plans using “hints” in SQL comments ([more info](https://github.com/ossc-db/pg_hint_plan)) | `CREATE EXTENSION pg_hint_plan;` |
| HLL | Functions | HyperLogLog data structure for approximating distinct value counts | `CREATE EXTENSION hll;` |
| Hstore | Data type | Key value data type | `CREATE EXTENSION hstore;` |
| HTTP Client | Functions | HTTP client for PostgreSQL, allows web page retrieval inside the database. | `CREATE EXTENSION http;` |
| Hypopg | Functions | Hypothetical indexes | `CREATE EXTENSION hypopg;` |
| Incremental | Functions | Incremental batch processing | `CREATE EXTENSION pg_incremental;` |
| Insert Username | Functions | Will place the current Postgres username in a text field | `CREATE EXTENSION insert_username;` |
| Integer Aggregator | Functions | Integer aggregator and enumerator | `CREATE EXTENSION intagg;` |
| Integer Array | Functions | Sorting and manipulation of integer arrays | `CREATE EXTENSION intarray;` |
| ISN | Data type | Data type for product numbering (including UPC, ISBN, ISSN) | `CREATE EXTENSION isn;` |
| IVM | Functions | Incremental View Maintenance | `CREATE EXTENSION pg_ivm;` |
| Large Object | Data type | Specialized large object data type | `CREATE EXTENSION lo;` |
| Label Tree | Data type | Data type for tree-like structures | `CREATE EXTENSION ltree;` |
| Logical | Functions | Helper functions for PostgreSQL Logical Replication | `CREATE EXTENSION pglogical;` |
| Modification Time | Functions | Will place the current timestamp into a timestamp field | `CREATE EXTENSION moddatetime;` |
| Orafce | Functions | Emulate Oracle functions | `CREATE EXTENSION orafce;` |
| Page Inspect | Functions | Inspect the contents of database pages at a low level | `CREATE EXTENSION pageinspect;` |
| Row Locking | Functions | Show row-level locking information | `CREATE EXTENSION pgrowlocks;` |
| Partman | Functions | Create and manage both time-based and serial-based table partition sets | `CREATE EXTENSION pg_partman;` |
| pg_lake | Iceberg and data lake storage | Support for tables backed by object storage formats like Iceberg, Parquet, and ORC | See [Configuring S3 Storage for pg_lake](postgres-pg_lake.md) |
| PostGIS | Geospatial utilities | PostGIS geometry, geography, and raster spatial types and functions | [See PostGIS](https://docs.crunchybridge.com/extensions-and-languages/postgis) |
| PostGIS Raster | Geospatial utilities | PostGIS raster types and functions | `CREATE EXTENSION postgis_raster;` |
| PostGIS SFCGAL | Geospatial utilities | PostGIS SFCGAL functions | `CREATE EXTENSION postgis_sfcgal;` |
| PostGIS Topology | Geospatial utilities | PostGIS topology spatial types and functions | `CREATE EXTENSION postgis_topology;` |
| Postgres FDW | Foreign Data Wrapper | Foreign data wrapper for connecting to other Postgres databases | `CREATE EXTENSION postgres_fdw;` |
| Prewarm | Functions | Utilities to prewarm your cache, helpful for standby failover | `CREATE EXTENSION pg_prewarm;` |
| Proctab | Functions | Access operating system process tables from PostgreSQL | `CREATE EXTENSION pg_proctab;` |
| Refint | Functions | Functions for referential integrity | `CREATE EXTENSION refint;` |
| Repack | Functions | Remove bloat from tables and indexes (See also pg_squeeze) | `CREATE EXTENSION pg_repack;` |
| Routing | Geospatial utilities | Routing functionality | `CREATE EXTENSION pgrouting;` |
| Semver | Data type | Data type for the Semantic Version format with support for btree and hash indexing | `CREATE EXTENSION semver;` |
| Surgery | Functions | Corrective actions on corruption or damaged data | `CREATE EXTENSION pg_surgery;` |
| Seg | Data type | Data type for representing floating point intervals or segments | `CREATE EXTENSION seg;` |
| SSL Info | Functions | Ability to query SSL information based on the current connection | `CREATE EXTENSION sslinfo;` |
| Stat statements | Views | Track planning and execution statistics of all SQL statements executed | `CREATE EXTENSION pg_stat_statements;` |
| Stat Tuple | Functions | Show tuple-level statistics | `CREATE EXTENSION pgstattuple;` |
| Squeeze | Functions | Remove bloat from tables and indexes. A modern alternative to pg_repack. See [pg_squeeze docs](https://github.com/cybertec-postgresql/pg_squeeze). | `CREATE EXTENSION pg_squeeze;` |
| Table functions | Functions | Functions for cubing and rollups of tables | `CREATE EXTENSION tablefunc;` |
| Table sampling (system rows) | Functions | Functions to provide sampling of system tables | `CREATE EXTENSION tsm_system_rows;` |
| Table sampling (system time) | Functions | Functions to provide sampling of system time | `CREATE EXTENSION tsm_system_time;` |
| Trigger change notifications | Functions | Functions for listening to changes on tables | `CREATE EXTENSION tcn;` |
| Trigram | Functions | Matching and similarity of strings | `CREATE EXTENSION pg_trgm;` |
| Unaccent | Dictionaries | Text search dictionary that removes accents | `CREATE EXTENSION unaccent;` |
| Visibility | Functions | Examine the visibility map (VM) and page-level visibility info | `CREATE EXTENSION pg_visibility;` |
| Vector | Functions | Vector (pgvector) data type and ivfflat access method | `CREATE EXTENSION vector;` |
| ULID | Functions | Generate universally unique lexicographically sortable identifiers (ULIDs) | `CREATE EXTENSION pgx_ulid;` |
| uuid-ossp | Functions | Generate universally unique identifiers (UUIDs) | `CREATE EXTENSION uuid-ossp;` |
| uuidv7 | Functions | Generate version 7 universally unique identifiers (UUIDs) | `CREATE EXTENSION pg_uuidv7;` |
| WAL inspect | Functions | Inspect contents of WAL | `CREATE EXTENSION pg_walinspect;` |
| xml2 | Functions | XPath querying and XSLT | `CREATE EXTENSION xml2;` |

---
title: Snowflake Postgres High Availability
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/high-availability.md
section: Snowflake Postgres
---

# Snowflake Postgres High Availability

High Availability (HA) preserves the uptime of your instance by provisioning a secondary virtual machine in a separate
availability zone that receives the same writes as your primary. When HA is enabled, in the event your primary becomes
unavailable, we will automatically fail over to the secondary host by promoting the standby to replace the impacted host. You
don’t need to update your connection details. Once the promotion occurs, the original primary is destroyed and a new standby
host is created.

For instances that are sensitive to protracted downtime, we recommend using our HA feature. Without HA, if your instance becomes
unavailable, Snowflake attempts to provision a new host for your instance, and the control plane automatically restores your
instance using the most recent automated backup and all WAL (write-ahead-log) statements that have been generated since the
latest backup. For small, inactive clusters this could be a matter of minutes, but for larger or active clusters this could take
many hours.

To turn high availability on or off for your Snowflake Postgres instance, run the [ALTER POSTGRES INSTANCE](../../sql-reference/sql/alter-postgres-instance.md) command with the
SET HIGH_AVAILABILITY option. The following example shows how to turn high availability on or off:

```sqlexample
ALTER POSTGRES INSTANCE production_instance SET HIGH_AVAILABILITY = TRUE;
ALTER POSTGRES INSTANCE dev_test_instance SET HIGH_AVAILABILITY = FALSE;
```

---
title: Snowflake Postgres Insights
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/insights.md
section: Snowflake Postgres
---

# Snowflake Postgres Insights

The database insights available on each Snowflake Postgres instance’s Snowsight details page provide point in time insights into your database along with recommendations on actions you can take to improve performance.

To view an instance’s insights:

1. In the navigation menu, select Postgres
2. Select your instance from the list of instances shown to load its details page.
3. Choose the insight to view with the Insight select box shown just under the Details tab heading.

The available insights are:

* Cache and index hit rates
* Unused indexes
* Bloat
* Outlier queries
* Long running queries
* Vacuum statistics
* Table sizes
* Connections

## Cache hit

Postgres generally tries to keep the data you access most often in its shared buffers cache. The cache hit ratio measures how many content requests the buffer cache is able to handle compared to how many requests it receives. A cache hit is a request that is successfully handled and a miss is one that is not. A miss will go beyond the cache to the file system to fulfill the request.

So if you have 100 cache hits and 2 misses, you’ll have a cache hit ratio of 100/102 which equals 98%.

For normal operations of Postgres and performance, you’ll want to have your Postgres cache hit ratio about 99%.

If you see your cache hit ratio below that, you may need to look at moving to an instance with larger memory.

## Index hit

Adding indexes to your database is critical to query and application performance. Indexes are particularly valuable across large tables.

The index hit rate is measured as a ratio or percentage of the total number of queries or query executions that successfully utilize an index versus the total number of queries executed. A higher index hit rate suggests better index utilization and overall query performance.

In general, you are looking for 99%+ on tables larger than 10,000 rows. If you see a table larger than 10,000 with no or low index usage, that’s your best bet on where to start with adding an index.

## Unused indexes

Unused indexes in PostgreSQL refer to indexes that are created on tables but are not actively used. These indexes consume disk space, require maintenance, and can negatively affect the performance.

Here are a few reasons why you should care about unused indexes in Postgres:

* Storage and disk space: Unused indexes occupy disk space that could be better utilized for other purposes. This can result in increased storage costs and reduce the available space for other database objects.
* Performance impact: Indexes incur overhead during data modification operations, such as inserts, updates, and deletes. When there are many unused indexes, these operations take longer because the database must update multiple indexes in addition to the table.
* Slower query execution: Postgres’ query optimizer considers all available indexes when generating an execution plan for a query. If there are unused indexes, the optimizer may spend additional time considering these indexes, leading to suboptimal query plans and slower query execution.
* Maintenance overhead: Maintaining indexes requires resources, including CPU and disk I/O. If you have a large number of unused indexes, these resources are wasted on unnecessary index maintenance tasks.

> **Important:**
>
> Note that you might have indexes that are not used on a primary instance but are used on a replica.

## Bloat

Bloat refers to the accumulation of dead and unused rows in a database, resulting in disk space consumption and performance degradation. It primarily affects databases with high transaction workloads. Postgres’ MVCC system creates multiple versions of a row to handle concurrent transactions. When a row is updated or deleted, a new version is created, while the old version is marked as dead. These dead rows are not immediately removed from the table to preserve transactional integrity and ensure data consistency during concurrent operations.

To reclaim the disk space occupied by dead rows, Postgres periodically performs vacuuming. This process identifies and eliminates dead rows from the table, freeing up the disk space for reuse. Bloat occurs when high transactions generate a substantial number of dead rows between vacuum processes.

We provide a percentage of bloat to show the amount of space taken up by dead rows compared to the total size of the table or index. The bloat displayed is an estimate or approximation. If you need a more data on bloat in your tables, you can use the extension [pgstattuple](https://www.postgresql.org/docs/current/pgstattuple.html), though this can be a resource intensive operation.

**Low Bloat**: Bloat below 50% is generally considered acceptable and does not normally require action. It is still recommended to monitor bloat for further growth and check vacuum configurations and settings.

**High Bloat**: Bloat above 50% suggests a high level of bloat that can begin to severely impact performance and disk space utilization. You may need to consider action, such as performing a manual vacuum operation, or changing vacuum settings, if you notice slow queries or performance issues.

We do not display a bloat percentage for tables under 1GB or with a bloat percentage less than 10%.

## Outlier queries

These are the queries with the highest proportional execution time. This may
include very slow but relatively infrequent queries, as well as slightly slow
but extremely common queries. The queries with the highest proportional
execution time are the best starting point for database query tuning at the
application level or indexing.

## Long running queries

Long-running queries in PostgreSQL can have several negative implications for
your database and application. Here are some reasons why long-running queries
are generally considered undesirable:

* Performance impact: Long-running queries tie up database resources, including
  CPU, memory, and disk I/O, for an extended period.
* Increased contention: Long-running queries can lead to increased contention
  for shared resources, such as locks and concurrent access to database objects.
* Reduced throughput: When a query takes a long time to complete, it can limit
  the number of queries that can be executed within a given timeframe.
* Poor user experience: If your application relies on timely query execution,
  long-running queries can negatively impact user experience. Users may
  experience delays or unresponsiveness, leading to frustration and
  dissatisfaction with your application.
* Resource exhaustion: Long-running queries can consume excessive memory,
  leading to increased memory usage and potential out-of-memory errors. They can
  also generate large temporary files on disk, potentially causing disk space
  issues.

## Vacuum

The insights panel also includes vacuum statistics. You can check on the table names, the last vacuum and last autovacuum. You can also get insights on how many dead rows exist, when vacuum last cleaned up dead rows, and more.

Vacuum statistics include:

* Table name
* Last vacuum: last time a manual vacuum operation was run
* Last autovacuum: last time autovacuum ran
* Row count: total row count for the table
* Dead row count: number of un-vacuumed / dead rows in the table presently
* Scale factor: the current scale factor set in the autovacuum settings
* Threshold: the total number of rows, using the scale factor, that would require a vacuum operation
* Should vacuum: if you should manually vacuum the table

## Table sizes

Details about your Postgres table sizes is available under Table Sizes in instance insights. This shows table information like:

* table names
* approximate row counts
* total table size
* size of indexes on the table
* number of table bytes in TOAST tables
* raw row table size

## Connections

The connections insight displays all currently active and idle connections in the database instance. Active connections are in a session that is currently connected to the database and is executing a query or waiting to execute one.

Idle connections are common and they aren’t inherently a problem, but they can become an issue depending on your workload and configuration. Idle connections consume memory, so a large number of them can lead to excessive memory usage. High idle connections is typically an indication that the database would benefit from connection pooling.

Each running session has a `pid` which is the process id - a unique identifier assigned to each active backend connection.

To cancel a connection, query, or process but leave the session open use this statement:

```postgres
SELECT pg_cancel_backend(<pid>);
```

A more forceful action, which will close the connection and roll back any transactions, is:

```postgres
SELECT pg_terminate_backend(<pid>);
```

---
title: Snowflake Postgres instance management
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/managing-instances.md
section: Snowflake Postgres
---

# Snowflake Postgres instance management

Snowflake Postgres helps you manage your instances through a variety of instance management operations. These operations are forms of
maintenance that keep your instances operational and secure.

**A brief service interruption is required to perform instance management operations.** Please ensure that your applications are able to
automatically reconnect to the database.

> **Note:**
>
> An instance’s connection string will remain the same across instance management operations unless you explicitly rotate the credentials.

When required to ensure the health of your instance, we may schedule maintenance operations on your behalf (for example, to modify
instance storage size).

For a detailed description of how instance maintenance is carried out by our platform, see [Snowflake Postgres Maintenance](postgres-maintenance.md).

## Available operations

The following operations are available from the Manage dropdown menu on your instance details page in the dashboard:

* Fork - Create a new instance from an existing instance
* Modify - Change the instance size, storage size, or Postgres version of the instance
* Enable High Availability - Enable High Availability for the instance
* Create replica - Create a replica of the instance
* Instance Suspend and Resume - Spin down the Postgres server but retain data on disk
* Refresh - Replace instance, update to latest minor version, get the newest OS version, and enable the latest features
* Restarting services - Restart either PostgreSQL or the entire underlying server
* Regenerate credentials - Regenerate the credentials for the instance

### Fork

You can fork an instance to create a new instance from an existing instance, optionally choosing a point in time to fork from. By default
the new instance will be forked from the current state of the source instance. Read more about forking in [Snowflake Postgres point-in-time recovery](postgres-point-in-time-recovery.md).

### Modify

To make a change to an existing Snowflake Postgres instance, you must use a role that has been granted the OWNERSHIP or OPERATE privilege on that instance.

You can resize an instance in-place with minimal impact and no changes to your connection string. During an instance resize, you can:

* Change the [COMPUTE_FAMILY](postgres-instance-sizes.md) to a different size.
* Change the amount of storage. Both increases and decreases in storage size are supported.
* Upgrade the Postgres version to a newer major version.

Modifying your instance’s resource configuration or major version requires a failover maintenance. See
[Snowflake Postgres maintenance failover](postgres-maintenance.md) for more information.

To make a change:

SnowsightSQL

> 1. In the navigation menu, select Postgres.
> 2. Select your instance.
> 3. In the Manage menu at the top right, select Modify.
> 4. Select the new COMPUTE_FAMILY and/or storage size from the dropdown menus. See Postgres major version upgrades for more
>    information about changing the Postgres version.
> 5. Select the Save button to confirm the changes.

If you have a maintenance window set for your instance the upgrade maintenance failover will proceed during the next window after
the replacement instance is ready. If you do not have a maintenance window set for your instance the upgrade maintenance failover
will proceed as soon as the replacement instance is ready.

> Use the [ALTER POSTGRES INSTANCE](../../sql-reference/sql/alter-postgres-instance.md) command to make changes to the configuration of a Snowflake Postgres instance.
>
> **Modifying a Postgres instance examples**
>
> Change an existing instance’s COMPUTE_FAMILY to STANDARD_M and storage size to 100GB in a single operation:
>
> ```sqlexample
> ALTER POSTGRES INSTANCE my_instance
>   SET COMPUTE_FAMILY = 'STANDARD_M'
>       STORAGE_SIZE_GB = 100;
> ```
>
> If you have a maintenance window set for your instance, the required maintenance failover will proceed during the next maintenance window
> to occur after the replacement instance is ready. To instead have the maintenance proceed as soon as the replacement instance is ready
> use APPLY IMMEDIATELY:
>
> ```sqlexample
> ALTER POSTGRES INSTANCE my_instance
>   SET COMPUTE_FAMILY = 'STANDARD_M'
>       STORAGE_SIZE_GB = 100
>   APPLY IMMEDIATELY;
> ```

Alternatively, you can use an APPLY ON ‘<timestamp>’ clause to specify a future date or timestamp up to three days from the current
for the maintenance failover to proceed.

> **Note:**
>
> If your instance does not have a maintenance window set and you do not use an APPLY IMMEDIATELY or APPLY ON ‘timestamp’ clause, the
> maintenance failover will proceed as if APPLY IMMEDIATELY were used.

If you plan to decrease the storage size of your instance, please note that we currently allow the resize to be greater than or equal to 1.4x
the current disk usage to reduce alerting and immediate resizing up.

> **Important:**
>
> COMPUTE_FAMILY and STORAGE_SIZE_GB changes made to a primary instance are **not** also applied to any present read replicas. They require
> their own Modify operations.
>
> COMPUTE_FAMILY and STORAGE_SIZE_GB changes **are** also applied to HA standbys if HA is enabled for the given instance.
> HA standby instance replacements for these operations always happen as soon as their replacement instances are ready since that does
> not require a downtime for their primary servers.

> **Note:**
>
> For details on how to track the progress of an ongoing Modify operation see the [DESCRIBE POSTGRES INSTANCE usage notes](../../sql-reference/sql/desc-postgres-instance.md).

#### Postgres major version upgrades

Changes to an instance’s Postgres major version work via a [Snowflake Postgres maintenance failover](postgres-maintenance.md)
operation just as with other Modify operations, but there are some important differences where HA and read replica instances are concerned.

Postgres major version upgrade operations can only be applied to primary instances. When a primary instance undergoes a major version
upgrade, the same upgrade is applied to any present read replica and HA instances by rebuilding them from a fresh backup of the primary
instance taken **after** the primary’s upgrade is complete.

This means that during the time it takes to run a fresh, post-upgrade backup of the primary and build a new HA and/or read replica instances
from that backup:

* The primary will not have a valid HA instance present.
* While they will remain accessible, read replicas will have stale data since they will not replicate from the primary until their
  replacement instances are ready.

For more details on Postgres major version upgrade operations, see [Postgres major version upgrades](postgres-upgrades.md).

### Enable High Availability

When High Availability (HA) is enabled, your instance includes a standby host that replaces the primary if your primary
becomes unavailable. You can read more about this in [Snowflake Postgres High Availability](high-availability.md).

### Create replica

You can create a replica of your instance from the dashboard. A replica is a read-only copy of the source instance that is kept in sync
with the source instance. Find about more about creating and using replicas in [Snowflake Postgres Read Replicas](postgres-create-replica.md).

### Instance suspend and resume

#### Suspend

Suspending an instance deactivates the virtual machine that it’s running on while keeping its disk image in storage so that the instance can be resumed.
Normal billing for the instance is suspended, but storage costs will continue to accrue. The existing 10 days’ worth of backups are also retained.

If there were operations that were pending restart to be applied, they will be applied when the instance is resumed.

To suspend or resume a Snowflake Postgres instance, you must use a role that has been granted the OWNERSHIP or OPERATE privilege on the instance.

SnowsightSQL

Snowflake Postgres allows you to suspend your instance from the dashboard.

1. In the navigation menu, select Postgres.
2. Select your instance.
3. In the Manage menu at the top right, select Suspend.
4. Click the Suspend button to confirm the action.

To suspend a Snowflake Postgres instance, run the [ALTER POSTGRES INSTANCE](../../sql-reference/sql/alter-postgres-instance.md) command with the SUSPEND option. For example:

```sqlexample
ALTER POSTGRES INSTANCE instance_that_definitely_exists SUSPEND;
ALTER POSTGRES INSTANCE IF EXISTS instance_that_might_exist SUSPEND;
```

* These operations are asynchronous. You can use the DESCRIBE POSTGRES INSTANCE command to track the status of these operations.

**Example: Suspend a Snowflake Postgres instance named my_instance**

```sqlexample
ALTER POSTGRES INSTANCE my_instance SUSPEND;
```

#### Resume

You can resume a suspended instance at any time. The time it takes to resume an instance depends on the instance and the size of the dataset.
When you resume an instance, normal billing and backups will also recommence.

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select your instance.
3. In the Manage menu at the top right, select Resume.
4. Click the Resume button to confirm the action.

To resume a Snowflake Postgres instance, run ALTER POSTGRES INSTANCE … RESUME:

```sqlsyntax
ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> RESUME
```

These operations are asynchronous. The DESCRIBE command may be used to track the status of these operations.

**Example: Resume a Snowflake Postgres instance named my_instance**

```sqlexample
ALTER POSTGRES INSTANCE my_instance RESUME;
```

### Refresh

Refresh is a instance [maintenance operation](postgres-maintenance.md) that will replace your instance without making any changes
to its configured resources. Use this to ensure your instance has up-to-date OS security patches, the latest Postgres minor version for its
given major version, and works properly with the latest Snowflake Postgres features.

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select your instance.
3. In the Manage menu at the top right, select Refresh.
4. If you want the Refresh maintenance failover to occur as soon as the replacement server is ready, select
   Bypass maintenance Window and apply immediately.
5. Click the Refresh button to confirm the action.

To run an instance Refresh via SQL use ALTER POSTGRES INSTANCE with the COMPUTE_FAMILY value matching its current value. For
example, if you have a STANDARD_M instance named `myinstance` use this to run a Refresh maintenance and have the maintenance’s
failover operation happen during the first maintenance window after the replacement server is ready:

```sqlexample
ALTER POSTGRES INSTANCE myinstance
  SET COMPUTE_FAMILY = STANDARD_M;
```

Use this to have the Refresh maintenance failover to occur as soon as the replacement server is ready instead of waiting for
its next maintenance window if it has one set:

```sqlexample
ALTER POSTGRES INSTANCE myinstance
  SET COMPUTE_FAMILY = STANDARD_M
  APPLY IMMEDIATELY;
```

> **Note:**
>
> For details on how to track the progress of an ongoing Refresh operation, see the [DESCRIBE POSTGRES INSTANCE usage notes](../../sql-reference/sql/desc-postgres-instance.md).

### Restarting services

You can restart either PostgreSQL or the underlying server that runs your Postgres instance if needed. This type of instance management
operation restarts the server in-place, without creating a replica or performing a fail-over. Read more about restarting services in
[Snowflake Postgres maintenance restart](postgres-maintenance.md).

### Regenerate credentials

Regenerating credentials will return a new connection string for your database instance, replacing the existing credentials. Read more about
this topic in [Snowflake Postgres Roles](postgres-roles.md).

## Custom configuration parameters

You can make changes to many of Postgres’s own server settings for your Snowflake Postgres instances. You can see the list of available
configuration parameters in [Snowflake Postgres Server Settings](postgres-server-settings.md).

To change the Postgres settings on a Snowflake Postgres instance, you must use a role that has been granted the OWNERSHIP or OPERATE privilege on that instance.

To make a change:

SnowsightSQL

1. In the navigation menu, select Postgres
2. Select your instance
3. On the right side of the page select the edit icon next to Custom parameters
4. Choose configuration parameters from the list, or use the search box to find specific parameters.
5. Enter the new value for the configuration parameter.
6. When you’ve finished add new values for parameters, click Continue to review, and then click Submit to confirm the changes.

To specify changes to the [Postgres settings](postgres-server-settings.md) for the instance,
run the [ALTER POSTGRES INSTANCE](../../sql-reference/sql/alter-postgres-instance.md) command with the SET POSTGRES_SETTINGS option.

With the POSTGRES_SETTINGS option, you specify a JSON-formatted string with the following structure:

```none
'{"component:name" = "value", ...}'
```

Changes to some of the Postgres settings may require an instance restart to take effect. These changes will not take effect
unless you specify APPLY IMMEDIATELY in the ALTER POSTGRES INSTANCE statement. For the list of settings that require a restart,
consult the table in [Postgres settings](postgres-server-settings.md).

**Example: Set the work_mem configuration parameter to 128MB for a Snowflake Postgres instance named my_instance**

```sqlexample
ALTER POSTGRES INSTANCE my_instance SET POSTGRES_SETTINGS = ( 'work_mem' = '128MB' );
```

## Instance states

Any instance management operation, whether it’s creating a new instance or modifying an existing
one, takes some time to complete. The exact duration depends on many factors, including your
data and schema sizes, and how busy your instance is. An instance’s state gives
you insight into the progress of an ongoing operation. It is shown in the dashboard, or you can
check it by running the `DESCRIBE POSTGRES INSTANCE` command.

Possible instance states are listed below. During an instance modification operation, the
replacement instance goes through all of the states listed in the first table. A new instance
being created goes through some but not all of the states listed. The following table lists
some additional states you might see during normal operations.

**States seen during create, modify, and fork:**

| State | What’s happening | Typical duration | Next state |
| --- | --- | --- | --- |
| **Creating** | A new underlying server is being created | 1-2 minutes | Restoring |
| **Restoring** | Latest base backup is being restored to the server | Variable | Starting |
| **Starting** | Postgres is being started on the instance and WAL that accumulated during base backup is being applied | Variable | Replaying |
| **Replaying** | Accumulated WAL since last base backup is being replayed | Variable | Finalizing |
| **Finalizing** | Instance configuration is being finalized and the server is being made available | 1-2 minutes | Ready |
| **Ready** | New instance matches source instance and is ready for the operation to proceed. If scheduled for an upcoming maintenance window, the instance is kept `Ready` until that time. If scheduled for now, the operation proceeds once it reaches `Ready`. Running instances normally show the `Ready` state. | N/A | N/A |

**Other instance states that you might see on the platform:**

| State | What’s happening | Typical duration | Next state |
| --- | --- | --- | --- |
| **Restarting** | Underlying server is being restarted | 1-2 minutes | Ready |
| **Resuming** | A new server is being built and a suspended instance is being resumed | 3-5 minutes | Ready |
| **Suspending** | Instance is being suspended | 3-5 minutes | Suspended |
| **Suspended** | Instance is currently suspended | Until resumed | Resuming |

---
title: Snowflake Postgres Instance Sizes
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-instance-sizes.md
section: Snowflake Postgres
---

# Snowflake Postgres Instance Sizes

Snowflake Postgres offers three tiers of instances — Burstable, Standard, and Memory — to cover a variety of use cases.

For credit costs for each instance size, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

In general:

* **Burstable** instances have a baseline CPU level but can temporarily burst above this baseline.
* **Standard** instances have a good balance of CPU and memory.
* **Memory-optimized** instances have a higher ratio of memory to CPU, which may improve performance for workloads with greater
  memory needs.

## Burstable

*Important notes*

* Burstable instances can be provisioned with a maximum of 100GB storage.
* Burstable instances have burstable vCPUs. Utilization in excess of the CPU baseline shown below will deplete available vCPU
  credits, leading to CPU rate limiting. This may appear as a sudden downgrade in performance with no other cause.
* Burstable instances do not support High Availability standbys.

| Name | Cores | Memory | IOPS | HA supported |
| --- | --- | --- | --- | --- |
| BURST_XS | 2 | 1GB | 11,800 | No |
| BURST_S | 2 | 2GB | 11,800 | No |
| BURST_M | 2 | 4GB | 11,800 | No |

## General purpose

| Name | Cores | Memory | IOPS | HA supported |
| --- | --- | --- | --- | --- |
| STANDARD_M | 1 | 4GB | 20,000 | Yes |
| STANDARD_L | 2 | 8GB | 40,000 | Yes |
| STANDARD_XL | 4 | 16GB | 40,000 | Yes |
| STANDARD_2XL | 8 | 32GB | 40,000 | Yes |
| STANDARD_4XL | 16 | 64GB | 40,000 | Yes |
| STANDARD_8XL | 32 | 128GB | 40,000 | Yes |
| STANDARD_12XL | 48 | 192GB | 60,000 | Yes |
| STANDARD_24XL | 96 | 384GB | 78,000 | Yes |

> **Note:**
>
> The STANDARD_M instance size is not available on Microsoft Azure.

## Memory optimized

| Name | Cores | Memory | IOPS | HA supported |
| --- | --- | --- | --- | --- |
| HIGHMEM_L | 2 | 16GB | 40,000 | Yes |
| HIGHMEM_XL | 4 | 32GB | 40,000 | Yes |
| HIGHMEM_2XL | 8 | 64GB | 40,000 | Yes |
| HIGHMEM_4XL | 16 | 128GB | 40,000 | Yes |
| HIGHMEM_8XL | 32 | 256GB | 40,000 | Yes |
| HIGHMEM_12XL | 48 | 384GB | 78,000 | Yes |
| HIGHMEM_16XL | 64 | 512GB | 78,000 | Yes |
| HIGHMEM_24XL | 96 | 768GB | 78,000 | Yes |
| HIGHMEM_32XL | 128 | 1TB | 78,000 | Yes |
| HIGHMEM_48XL | 192 | 1.5TB | 78,000 | Yes |

---
title: Snowflake Postgres logging
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-logging.md
section: Snowflake Postgres
---

# Snowflake Postgres logging

All Postgres servers on Snowflake Postgres instances log locally to syslog. The log data is collected
from there and sent to your account’s active
[event table](../../developer-guide/logging-tracing/event-table-setting-up.md). In addition to logs,
Snowflake Postgres automatically collects instance-level metrics (CPU, memory, disk, network, and
Postgres-specific counters). For details, see [Snowflake Postgres metrics](postgres-metrics.md).

## Retrieving Postgres log data

To view the Postgres logs for a given instance, query the event table for the TIMESTAMP field and the MESSAGE portion of the VALUE field of
the records with `record_type = 'LOG'` using the initial portion of the instance’s hostname, aka the instance ID (or `cluster_id`).
For example, this will pull the last 10 minutes of log entries for instance with id `oyrpb2cwtvbu5al5vtbyrsnkgy`:

```sqlexample
SELECT TIMESTAMP, VALUE:MESSAGE as log_line
FROM SNOWFLAKE.TELEMETRY.EVENTS
WHERE resource_attributes['snowflake.o11y.logtype'] = 'postgres-otelcol-vm-agent'
  AND resource_attributes['instance.id'] = 'oyrpb2cwtvbu5al5vtbyrsnkgy'
  AND record_type = 'LOG'
  AND TIMESTAMP > CURRENT_TIMESTAMP() - INTERVAL '10 MINUTES'
LIMIT 100;
```

> **Note:**
>
> The above query uses the account default event table, SNOWFLAKE.TELEMETRY.EVENTS. If you have set up a custom event table, you should
> adjust the query appropriately.

Each row of the output will contain a single log-line entry that was logged by the Postgres server on the given Snowflake Postgres instance
with the timestamp when it was originally logged. Note that it can take up to a few minutes between the time Postgres makes a log entry
and it is available in the event table.

## Understanding Postgres log-line interleaving

Note that Postgres uses multi-line logging and since multiple Postgres server processes will be making log entries
concurrently, full log entries from different Postgres server processes will often be interleaved. For example, let’s consider these log line
entries:

Postgres log lines example

|  |  |
| --- | --- |
| timestamp | log_line |
| 2025-12-09 23:16:38.760 | “[14-1] [1592908][client backend][27/2][0] [user=snowflake_admin,db=postgres,app=psql] [34.214.158.144] ERROR: canceling statement due to user request” |
| 2025-12-09 23:16:38.760 | “[10-1] [1593992][not initialized][][0] [user=[unknown],db=[unknown],app=[unknown]] [34.214.158.144] LOG: connection received: host=34.214.158.144 port=46114” |
| 2025-12-09 23:16:38.760 | “[14-2] [1592908][client backend][27/2][0] [user=snowflake_admin,db=postgres,app=psql] [34.214.158.144] STATEMENT: select pg_sleep(10);” |
| 2025-12-09 23:16:43.007 | “[15-1] [1592908][client backend][27/3][0] [user=snowflake_admin,db=postgres,app=psql] [34.214.158.144] LOG: AUDIT: SESSION,2,1,MISC,SHOW,,,show log_min_duration_statement,<not logged>” |

In each log line entry:

* The first bracketed values are the command number for the session that ran the command and the log line for that command separated by a hyphen.
  For example, [1-1] and [1-2] would be two log lines from the first command run in a session.
* The second bracketed value is the process ID (pid) for the session that logged the line. Postgres uses a process-based (vs. thread-based) concurrency
  model so each session is run on its own server process.

In this example, you can see that:

* Command 14 was run by the session with pid `1592908` as the cancellation of a `select pg_sleep(10);` query.
* Logging of command 14 by pid `1592908` added two log lines, [14-1] and [14-2].
* A single log line from the 10th command run by the session with pid `1593992` ended up between the two lines from command 14 on
  pid `1592908`.
* The next command run by the session with pid `1592908` was a `show log_min_duration_statement` query and required only one log line,
  [15-1].

> **Tip:**
>
> The Postgres log line format is determined by its [log_line_prefix](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-LINE-PREFIX)
> server setting, which defaults to ‘[%p][%b][%v][%x] %q[user=%u,db=%d,app=%a] [%h]’ on Snowflake Postgres instances.

---
title: Snowflake Postgres Maintenance
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-maintenance.md
section: Snowflake Postgres
---

# Snowflake Postgres Maintenance

## Overview

Maintenance is the process by which a Postgres instance can be updated or have its configuration
changed. In some cases maintenance will be scheduled automatically by the platform, such as when
low disk space triggers a resize operation. Snowflake may also schedule maintenance for an instance
when needed to keep it secure. When maintenance is performed, a Postgres instance will always
receive the latest Postgres minor version, operating system updates, and new features and
functionality.

## How maintenance works

Some maintenance operations can be performed directly on a Postgres instance, such as a simple
restart of the service. Other maintenance operations require a failover to a new instance.

### Restarts

Restarting the Postgres service or the underlying server can be done directly on the Postgres
instance through the Manage menu.

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select your instance from the list to view its details page.
3. In the Manage menu at the top right, hover over Restart and then choose the type of restart needed.

To restart the Postgres service or the underlying server, run the ALTER POSTGRES INSTANCE command
with the RESTART option. For example:

```sqlexample
ALTER POSTGRES INSTANCE my_instance RESTART POSTGRES;
ALTER POSTGRES INSTANCE my_instance RESTART SERVER;
```

> **Tip:**
>
> Restarting the Postgres service is generally faster than restarting the entire instance.

### Failovers

[Modifying](managing-instances.md) the configuration of a Postgres instance requires a failover to apply the changes. You
can modify your instance type, size, storage, and/or upgrade to a newer Postgres major version.

> **Note:**
>
> When maintenance operations require a failover, the new instance will always receive the latest Postgres
> *minor* version, operating system updates, and new features and functionality.

When you initiate changes to your Postgres instance, a fresh instance is created in the background with the new
configuration. During this time, your original instance continues operating in its original state. As the new
instance comes online, it will be synchronized with the source instance. Failover will not happen until the new
instance is ready.

> **Note:**
>
> There is a brief service interruption when a failover occurs, typically lasting from seconds to a few minutes.

If a maintenance window has been set, the new instance will be kept in sync via replication until the maintenance
window arrives, and then the failover will happen. If no maintenance window was set, the platform will begin the
failover to the new instance as soon as it is ready.

> **Tip:**
>
> Failover can be delayed when clients are holding on to connections and performing writes on the source
> instance. The complete write-ahead log (WAL) must be written and archived before a failover can happen. For faster failovers,
> set your maintenance window to occur during a quiet period for your application.

Assuming the failover is successful, the original instance will be removed automatically since it is no
longer needed. If the failover does not succeed for some reason (which can occur, for example, during
a major version upgrade), the operation will be aborted and the original instance will remain
in place.

## Automatic maintenance

The platform will automatically run maintenance to increase the storage on your instance when the
available disk space becomes critically low. Maintenance may also be scheduled to run when a
Postgres major version has been deprecated and an instance has not been upgraded to a newer major
version by the published deadline.

### Automatic disk resizes

Overutilizing storage on a Postgres instance can be operationally dangerous because there might
not be enough disk space for the server to recover in case of an emergency. An instance will be
put into read-only mode when disk usage becomes critical to protect your data while the instance
is automatically resized.

An automatic resize operation will be initiated when the following conditions are met:

* 85% disk usage with less than 50GB remaining
* 90% disk usage

The new storage size is calculated based on the original size:

* 100GB disks will be increased by 50% (for example, 10 GB becomes 15 GB).
* 100GB to 999GB disks will be increased by 25% (for example, 100 GB becomes 125 GB).
* Disks larger than 1000 GB will be increased by 15% (for example, 1000 GB becomes 1150 GB).

> **Tip:**
>
> Ensure your application is set up to automatically reconnect to the database, given that there will be a
> brief service interruption when the failover occurs.

## Checking maintenance status

You can schedule maintenance for your instance by choosing Modify under the Manage menu. When there
is a maintenance operation pending, you can see a banner on the instance details page:

Click the View details button to view more information about the maintenance, such as the
old and new configurations.

---
title: Snowflake Postgres metrics
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-metrics.md
section: Snowflake Postgres
---

# Snowflake Postgres metrics

Snowflake Postgres automatically collects instance metrics and stores them in your account’s active
[event table](../../developer-guide/logging-tracing/event-table-setting-up.md). A monitoring agent
samples metrics approximately every 5 - 30 seconds depending on the metric type and writes them to
`SNOWFLAKE.TELEMETRY.EVENTS` with `RECORD_TYPE = 'METRIC'`.

You can query these metrics directly in Snowflake or forward them to an external observability
platform such as Grafana or Observe.

> **Note:**
>
> For information about querying Postgres *log* data from the event table, see
> [Snowflake Postgres logging](postgres-logging.md).

## Available metrics

### Postgres metrics

| Metric | Type | Description |
| --- | --- | --- |
| `postgres_connections` | gauge | Number of active backend connections |
| `postgres_databases_size_bytes` | gauge | Total size of all databases (bytes) |
| `postgres_wal_size_bytes` | gauge | WAL directory size (bytes) |
| `postgres_log_size_bytes` | gauge | Log directory size (bytes) |
| `postgres_tmp_size_bytes` | gauge | Temp file size (bytes) |
| `postgres_locking_transactions` | gauge | Number of granted locks |
| `postgres_locked_transactions` | gauge | Number of waiting/blocked locks |
| `server_version` | gauge | Postgres version as an integer (for example, 180003 = 18.0.3) |

### Postgres process metrics

| Metric | Type | Unit | Dimensions |
| --- | --- | --- | --- |
| `process.cpu.time` | sum | seconds | state (user, system, wait) process.command, process.executable.name, process.owner, process.pid, process.parent_pid |
| `process.memory.usage` | sum | bytes | process.command, process.executable.name, process.owner, process.pid, process.parent_pid |
| `process.memory.virtual` | sum | bytes | process.command, process.executable.name, process.owner, process.pid, process.parent_pid |

> **Note:**
>
> Each Postgres process will have one `process.cpu.time` row for each CPU state and one for each of `process.memory.usage` and
> `process.memory.virtual`.
>
> The `process.*` dimension attributes are found in each row’s `resource_attributes` column. As with the `state` values for other
> metrics, the `state` dimension attributes are in the `record_attributes` column.

### CPU metrics

| Metric | Type | Unit | Dimensions |
| --- | --- | --- | --- |
| `system.cpu.time` | sum | seconds | state: user, system, wait, idle, nice, interrupt, softirq, steal cpu: cpu# |
| `system.cpu.load_average.1m` | gauge | threads | -– |
| `system.cpu.load_average.5m` | gauge | threads | -– |
| `system.cpu.load_average.15m` | gauge | threads | -– |

> **Note:**
>
> Each cpu# (such as cpu0 and cpu2) will have one `system.cpu.time` row for each CPU state.
>
> `system.cpu.time` is a cumulative counter. To get a percentage, compute the delta between
> consecutive samples and divide by the elapsed interval.

### Memory metrics

| Metric | Type | Unit | Dimensions |
| --- | --- | --- | --- |
| `system.memory.usage` | sum | bytes | state: used, free, cached, buffered, slab_reclaimable, slab_unreclaimable |

> **Note:**
>
> One `system.memory.usage` row for each state.

### Disk metrics

| Metric | Type | Unit | Dimensions |
| --- | --- | --- | --- |
| `system.filesystem.usage` | sum | bytes | mountpoint, device, state (used, free), type, mode |

> **Note:**
>
> One `system.filesystem.usage` row for each state.

### Network metrics

| Metric | Type | Unit | Dimensions |
| --- | --- | --- | --- |
| `system.network.io` | sum | bytes | device, direction (transmit, receive) |

> **Note:**
>
> Each device (‘eth0’ and ‘lo’) will have one `system.network.io` row for each direction.

### Paging metrics

| Metric | Type | Unit | Dimensions |
| --- | --- | --- | --- |
| `system.paging.usage` | sum | bytes | device, state (used, free) |

> **Note:**
>
> One `system.paging.usage` row for each state.

## Resource attributes

Every metric row includes the following fields in `RESOURCE_ATTRIBUTES`:

| Attribute | Description | Example |
| --- | --- | --- |
| `instance_id` | Postgres instance identifier | `4jypgsndvzd5ta6ufaryx6owja` |
| `host_name` | Server host name | `df6m4y5m5fgfpb5idy2pj67xrm` |
| `host.id` | EC2 instance ID | `i-0f6724aef472706a3` |
| `host.type` | Instance family | `m8g.medium` |
| `cloud.region` | AWS region | `us-west-2` |
| `cloud.availability_zone` | Availability zone | `us-west-2b` |
| `application` | Always `postgres` | `postgres` |
| `os.type` | Always `linux` | `linux` |

## Querying metrics

A given Snowflake Postgres instance can have multiple servers running at any given time, such as a primary server and its HA server or
an upgrade replacement waiting for the instance’s maintenance window to be swapped into place. Since each of these servers will report
metrics for the instance’s given `instance_id` you also need the server `host_name` for the instance’s currently active server.

To find your Postgres instance’s `instance_id`, use [DESCRIBE POSTGRES INSTANCE](../../sql-reference/sql/desc-postgres-instance.md):

```sqlexample
DESCRIBE POSTGRES INSTANCE my_instance
  ->> SELECT "value"
      FROM $1
      WHERE "property" = 'host';
```

The instance’s `instance_id` is the first segment of the returned `host` value (everything before the first period).

> **Note:**
>
> You can use the `host` column of the SHOW POSTGRES INSTANCES command’s output to see the instance host values for all running Snowflake
> Postgres instances on your account.

To find the instance’s current server `host_name`, use a simple DNS CNAME lookup of the instance’s `host` value.

Let’s say the returned `host` value was ‘4jypgsndvzd5ta6ufaryx6owja.sfengineering-pgtest.preprod.us-west-2.aws.postgres.snowflake.app’
(so we know the instance’s `instance_id` is ‘4jypgsndvzd5ta6ufaryx6owja’).

Here is an example using the `dig` CLI utility to do the DNS CNAME lookup:

```bash
$ dig cname +short 4jypgsndvzd5ta6ufaryx6owja.sfengineering-pgtest.preprod.us-west-2.aws.postgres.snowflake.app
df6m4y5m5fgfpb5idy2pj67xrm.4jypgsndvzd5ta6ufaryx6owja.sfengineering-pgtest.preprod.us-west-2.aws.postgres.snowflake.app.
```

And here is an example using Python’s `dns.resolver` module:

```python
>>> import dns.resolver
>>> answer = dns.resolver.resolve('4jypgsndvzd5ta6ufaryx6owja.sfengineering-pgtest.preprod.us-west-2.aws.postgres.snowflake.app', 'CNAME')
>>> print(answer[0].target.to_text())
df6m4y5m5fgfpb5idy2pj67xrm.4jypgsndvzd5ta6ufaryx6owja.sfengineering-pgtest.preprod.us-west-2.aws.postgres.snowflake.app.
```

The `host_name` value is the first segment of that returned value, ‘df6m4y5m5fgfpb5idy2pj67xrm’ in the above examples.

The following query returns the most recent value for each metric collected in the last 5 minutes:

```sqlexample
SELECT TIMESTAMP as time,
  RECORD['metric']['name']::VARCHAR as metric,
  RESOURCE_ATTRIBUTES,
  RECORD_ATTRIBUTES,
  ROUND(VALUE::FLOAT, 2) AS value
FROM SNOWFLAKE.TELEMETRY.EVENTS
WHERE RESOURCE_ATTRIBUTES['application'] = 'postgres'
  AND record_type = 'METRIC'
  AND RESOURCE_ATTRIBUTES['instance_id']::VARCHAR = '<your_instance_id>'
  AND RESOURCE_ATTRIBUTES['host_name']::VARCHAR = '<instance_current_host_name>'
  AND TIMESTAMP > CURRENT_TIMESTAMP() - INTERVAL '5 MINUTES'
QUALIFY ROW_NUMBER() OVER (PARTITION BY record, record_attributes ORDER BY timestamp desc, record, record_attributes) = 1
ORDER BY timestamp desc, metric, record_attributes;
```

> **Note:**
>
> The above query uses the account default event table, `SNOWFLAKE.TELEMETRY.EVENTS`. If you’ve
> set up a custom event table, adjust the query appropriately.

### Example metric queries

#### Active connections

```sqlexample
SELECT
    TIMESTAMP,
    VALUE::FLOAT AS connections
FROM SNOWFLAKE.TELEMETRY.EVENTS
WHERE RECORD_TYPE = 'METRIC'
  AND RECORD['metric']['name']::VARCHAR = 'postgres_connections'
  AND RESOURCE_ATTRIBUTES['instance_id']::VARCHAR = '<your_instance_id>'
  AND RESOURCE_ATTRIBUTES['host_name']::VARCHAR = '<instance_current_host_name>'
  AND TIMESTAMP > CURRENT_TIMESTAMP() - INTERVAL '1 hour'
ORDER BY TIMESTAMP DESC;
```

#### Memory usage by state

```sqlexample
SELECT
    TIMESTAMP,
    RECORD_ATTRIBUTES['state']::VARCHAR AS state,
    ROUND(VALUE::FLOAT / (1024*1024*1024), 2) AS usage_gb
FROM SNOWFLAKE.TELEMETRY.EVENTS
WHERE RECORD_TYPE = 'METRIC'
  AND RECORD['metric']['name']::VARCHAR = 'system.memory.usage'
  AND RECORD_ATTRIBUTES['state']::VARCHAR IN ('used', 'cached', 'buffered', 'free')
  AND RESOURCE_ATTRIBUTES['instance_id']::VARCHAR = '<your_instance_id>'
  AND RESOURCE_ATTRIBUTES['host_name']::VARCHAR = '<instance_current_host_name>'
  AND TIMESTAMP > CURRENT_TIMESTAMP() - INTERVAL '1 hour'
ORDER BY TIMESTAMP DESC;
```

#### CPU load averages

```sqlexample
SELECT
    TIMESTAMP,
    RECORD['metric']['name']::VARCHAR AS metric,
    VALUE::FLOAT AS load_avg
FROM SNOWFLAKE.TELEMETRY.EVENTS
WHERE RECORD_TYPE = 'METRIC'
  AND RECORD['metric']['name']::VARCHAR IN (
      'system.cpu.load_average.1m',
      'system.cpu.load_average.5m',
      'system.cpu.load_average.15m'
  )
  AND RESOURCE_ATTRIBUTES['instance_id']::VARCHAR = '<your_instance_id>'
  AND RESOURCE_ATTRIBUTES['host_name']::VARCHAR = '<instance_current_host_name>'
  AND TIMESTAMP > CURRENT_TIMESTAMP() - INTERVAL '1 hour'
ORDER BY TIMESTAMP;
```

#### Database size

```sqlexample
SELECT
    TIMESTAMP,
    ROUND(VALUE::FLOAT / (1024*1024), 1) AS size_mb
FROM SNOWFLAKE.TELEMETRY.EVENTS
WHERE RECORD_TYPE = 'METRIC'
  AND RECORD['metric']['name']::VARCHAR = 'postgres_databases_size_bytes'
  AND RESOURCE_ATTRIBUTES['instance_id']::VARCHAR = '<your_instance_id>'
  AND RESOURCE_ATTRIBUTES['host_name']::VARCHAR = '<instance_current_host_name>'
  AND TIMESTAMP > DATEADD('hour', -1, CURRENT_TIMESTAMP())
ORDER BY TIMESTAMP DESC
LIMIT 1;
```

## Forwarding metrics to external tools

Because metrics are stored in a standard Snowflake table, you can forward them to any observability
platform that supports a Snowflake connection. For step-by-step setup with specific tools, see:

* [Monitor Snowflake Postgres with Grafana](https://www.snowflake.com/en/developers/guides/snowflake-postgres-monitoring-grafana/)
* [Monitor Snowflake Postgres with Observe](https://www.snowflake.com/en/developers/guides/snowflake-postgres-logs-to-observe/)

---
title: Snowflake Postgres networking
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-network.md
section: Snowflake Postgres
---

# Snowflake Postgres networking

By default, Snowflake Postgres will provision each new instance inside a new private network in the cloud region you have
selected. Each network is separate and private from other networks in the same cloud region.

By default, Snowflake Postgres instances do not allow incoming connections. Traffic to/from your Snowflake Postgres instances can be
enabled in either of these two ways:

* Attach a network policy containing Postgres ingress and/or egress network rules. This option is available for all accounts.
* Configure Private Link connections to/from cloud vendor private networks. This option is available for Business Critical edition or
  above accounts.

## Snowflake Postgres network policies and rules

[Network policies](../network-policies.md) and [network rules](../network-policies.md) for Snowflake Postgres
instances function much the same as they do for other Snowflake resources with a few key differences:

* Network policies do not need to be [activated](../network-policies.md) to be used with Snowflake Postgres instances
  in the same way they are for Snowflake accounts, users, and other security integrations. Network policies for Snowflake Postgres instances
  are instead attached to the instances directly at instance creation time. Existing instances can also have their network policies changed.
* Snowflake Postgres instances only use the ALLOWED_NETWORK_RULE_LIST and BLOCKED_NETWORK_RULE_LIST properties of network policies.
  The BLOCKED_IP_LIST and ALLOWED_IP_LIST properties are ignored.
* Network rules for Snowflake Postgres instances should use either the Postgres Ingress or Postgres Egress modes. Rules using these modes
  are currently limited to type IPv4.
* Network rules using other modes other than Postgres Ingress or Postgres Egress in a network policy are ignored by Snowflake Postgres
  instances that use them.

> **Warning:**
>
> Snowflake recommends making your network policies as restrictive as is practical. Applying a policy with a `0.0.0.0/0` networking rule will make the server open to connections from anywhere on the internet. For this reason, Snowflake recommends against using policies with `0.0.0.0/0` rules for your Snowflake Postgres instances.

### Privileges

* To create new network policies, Snowflake users must have the CREATE NETWORK POLICY privilege on the account.
* To create new network rules, Snowflake users must have the CREATE NETWORK RULE privilege on the schema in which they want to
  create the rules.
* To attach an existing network policy to a Snowflake instance, Snowflake users must own the network policy or the policy’s owner must
  [GRANT](../../sql-reference/sql/grant-privilege.md) usage on it.

### Snowflake Postgres network policy and rules example

Let’s say that:

* You want to allow incoming traffic to a new Postgres instance from your office, and your office network router’s public IP address is
  `23.206.171.35`.
* You also want to allow outgoing traffic from the new Postgres instance to your office Postgres server via a Postgres Foreign Data Wrapper
  connection.

For this we’ll create a new policy with both a Postgres Ingress network rule and a Postgres Egress network rule.

SnowsightSQL

1. [Create two new network rules](../network-policies.md). Use `23.206.171.35/32` as the sole network identifier for both, and use “Postgres Ingress” as the Mode for one and “Postgres Egress” for Mode of the other.
2. [Create a new network policy](../network-policies.md) with both new rules included in its Allowed list.
3. In the navigation menu, select Postgres.
4. Select + Create.
5. When selecting your desired instance configuration details make sure to select your new policy under Network policy select box. In the image below we have selected the policy that we named `OFFICE POLICY EXAMPLE`.

```sqlexample
-- Create the ingress rule
CREATE NETWORK RULE PG_INGRESS_FROM_OFFICE
  TYPE = IPV4
  VALUE_LIST = ('23.206.171.35/32')
  MODE = POSTGRES_INGRESS;

-- Create the egress rule
CREATE NETWORK RULE PG_EGRESS_TO_OFFICE
  TYPE = IPV4
  VALUE_LIST = ('23.206.171.35/32')
  MODE = POSTGRES_EGRESS;

-- Create a new policy using both rules in its allowed list
CREATE NETWORK POLICY "OFFICE POLICY EXAMPLE"
  ALLOWED_NETWORK_RULE_LIST = ('PG_INGRESS_FROM_OFFICE', 'PG_EGRESS_TO_OFFICE')
  COMMENT = 'Traffic to/from the office.';

-- Create a new Snowflake Postgres instance that uses the new policy
CREATE POSTGRES INSTANCE SNOWFLAKE_POSTGRES_DEMO
  COMPUTE_FAMILY = 'STANDARD_L'
  STORAGE_SIZE_GB = 50
  AUTHENTICATION_AUTHORITY = POSTGRES
  POSTGRES_VERSION = 17
  NETWORK_POLICY = '"OFFICE POLICY EXAMPLE"';
```

### Creating ingress rules at instance creation time

Instead of creating your network policy and rules before creating your Snowflake Postgres instance, you can create a
policy with Postgres ingress rules when creating Snowflake Postgres instances via Snowsight.

1. In the navigation menu, select Postgres.
2. In the Postgres Instances page, select the Create button at the top right.
3. Choose your instance configuration but leave the Network policy choice blank.
4. After you select the Create, a new dialog displays the `snowflake_admin` Postgres user’s
   [connection credentials](connecting-to-snowflakepg.md). After saving those credentials in a secure location,
   select Continue to network settings.
5. In the Network Settings dialog (shown below) enter the IP address and/or CIDR values you wish to create Postgres ingress
   rules for, pressing enter to add each one to the list.
6. Expand the Details section to edit your new network rule and/or policy names if needed.
7. Select Save to create your new Postgres ingress network policy and have it automatically attached to your instance once it is active.

## Snowflake Postgres Private Link

Private Link for Snowflake Postgres instances is available for Business Critical edition accounts and above.

To enable Private Link for a Snowflake Postgres instance, start by following the instructions to enable Private Link between your cloud
vendor account and your Snowflake account:

* [AWS PrivateLink and Snowflake](../admin-security-privatelink.md)
* [Azure Private Link and Snowflake](../privatelink-azure.md)
* [Google Cloud Private Service Connect and Snowflake](../private-service-connect-google.md)

### Privileges

To enable Private Link for Snowflake Postgres instances, Snowflake users must have the following privileges.

* MANAGE POSTGRES PRIVATE CONNECTIVITY ON ACCOUNT
* OWNERSHIP or MANAGE for each given Snowflake Postgres instance

### Setting up Private Link for Snowflake Postgres instances

Once you have Private Link enabled between your cloud vendor and Snowflake accounts and the required privileges, you can enable Private Link
for Snowflake Postgres instances on a per-instance basis as follows.

SnowsightSQL

If you do not intend to set up any network policy rules for your instance in addition to your Private Link connection, select
Private Link for the Network Security option in the New instance dialog. If you do want to set up or use a network
policy select Network policy instead and follow the previous instructions on network policies.

Once an instance is active you can enable Private Link for it:

1. In the navigation menu, select Postgres and select your instance.
2. In the instance’s Instance details pane, select the edit icon in the Private Link section.
3. A confirmation dialog is shown asking you to confirm setting up Private Link for your cloud service provider. Select Enable.
   Note that this step can take up to 10 minutes to complete.

Once Private Link is active for your Snowflake Postgres instance you can establish new Private Link connections for it:

1. In the navigation menu, select Postgres and select your instance to see its details page.
2. Select the edit icon in the Private Link section to the right to expand the Private Link pane (shown below).
3. Use the displayed Service address to make a Private Link connection request from the private network on your cloud
   vendor account.
4. Refresh your Snowflake Postgres instance’s details page. The Private Link pane will now have a new connection entry
   for your request with neither the check mark (accept) nor x mark (reject) selected. Select the check mark
   to accept.
5. You can now connect to your Snowflake Postgres instance from hosts in cloud service provider’s private network.

You can enable Private Link for an active instance with Snowflake SQL as follows:

```sqlexample
ALTER POSTGRES INSTANCE <name> ENABLE PRIVATELINK;
```

That asynchronous operation can take up to 10 minutes. To track its status check the value of the `privatelink_service_identifier`
returned by DESCRIBE POSTGRES INSTANCE:

```sqlexample
DESCRIBE POSTGRES INSTANCE <name>;
```

The same `privatelink_service_identifier` is shown for the instances entry in the output of SHOW POSTGRES INSTANCES:

```sqlexample
SHOW POSTGRES INSTANCES;
```

When that `privatelink_service_identifier` column shows a non-NULL value you can use that identifier to make a Private Link connection
request from the private network on your cloud service provider account you have enabled for Private Link connections to your Snowflake
account.

After making that connection request from your cloud vendor account’s private network find the request for the Snowflake Postgres
instance:

```sqlexample
SHOW PRIVATELINK CONNECTIONS IN POSTGRES INSTANCE <name>;
```

This command returns the following columns:

* `endpoint`
* `connection_id`
* `status`

Your connection request will be an entry with your cloud vendor private network’s Private Link `endpoint` value and a `status` value
of `pending`.

You can accept one or more pending Private Link connection requests by running an ALTER POSTGRES INSTANCE command:

```sqlexample
ALTER POSTGRES INSTANCE [IF EXISTS] <name> AUTHORIZE PRIVATELINK CONNECTIONS = ('<connection_id' [ , ... ]);
```

You can revoke one or more pending or previously approved Private Link connection requests by running this command:

```sqlexample
ALTER POSTGRES INSTANCE [IF EXISTS] <name> REVOKE PRIVATELINK CONNECTIONS = ('<connection_id' [ , ... ]);
```

### Connecting to Snowflake Postgres instances over Private Links

Instead of using the Snowflake Postgres instance’s `hostname`, connections to Snowflake Postgres instances via Private Link setups should
be made using the DNS hostname configured on your cloud service provider’s private network for the Private Link.

---
title: Snowflake Postgres point-in-time recovery
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-point-in-time-recovery.md
section: Snowflake Postgres
---

# Snowflake Postgres point-in-time recovery

## Overview

Snowflake Postgres supports creating **forks** of an instance using point-in-time
recovery (PITR). A fork is a new instance that reflects the state of an existing
instance at a specific time. A fork is similar to a [CLONE](../tables-storage-considerations.md)
operation in Snowflake. However, unlike the CLONE operation, a fork performs a full copy
of all of the origin data.

Because a fork is isolated from the origin instance, any changes you make to the
fork (schema or data) do not affect the origin instance.

Point-in-time recovery is useful when you need to:

* **Recover from accidental changes,** such as dropped tables or incorrect data
  updates.
* **Inspect the historical state of your data** for debugging or auditing.
* **Test application changes** against a realistic copy of production data
  without impacting the origin instance.

Forks are created from the most recent base backup of the origin instance that
exists before a specified time. Write-ahead log (WAL) records from the origin
instance are replayed up to the selected point in time so that the forked instance
is transactionally consistent with the origin instance at that moment in time.

## What is copied to the fork

When you create a fork, the following characteristics are copied from the
origin instance:

* The Postgres version. The version is copied for binary compatibility.
* The high availability setting (enabled or disabled).
* Credentials for accessing the instance.

You can customize some properties for the new instance during creation, such as
the **storage** and **instance size (plan)**. Pricing for the fork is based on
the configuration of the fork (plan, storage, and high availability), just like any
other instance.

### Creating a Fork

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select the instance you want to fork.
3. Under Manage on the Postgres Instance page, select the Fork item and enter the configuration options.
4. Select Fork to create the fork.

To create a Postgres instance as a fork of an origin instance, execute the CREATE POSTGRES INSTANCE command and specify the FORK clause.
The command creates the fork from the origin instance at the point in time specified by the AT or BEFORE clause. If you omit this
clause, the fork is based on the origin instance at the current point in time.

```sqlsyntax
CREATE POSTGRES INSTANCE <name>
  FORK <orig_name>
  [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> } ) ]
  [ COMPUTE_FAMILY = <compute_family> ]
  [ STORAGE_SIZE_GB = <storage_gb> ]
  [ HIGH_AVAILABILITY = { TRUE | FALSE } ]
  [ POSTGRES_SETTINGS = '<json_string>' ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
```

For the command parameters:

`FORK orig_name`
:   Specifies the origin of the fork.

`{ AT | BEFORE } ( { TIMESTAMP => timestamp | OFFSET => time_difference } )`
:   Specifies the point in time to fork from. The timestamp or offset must fall within the 10
    day postgres data retention time.

    Default: Uses current time.

    The [AT | BEFORE](../../sql-reference/constructs/at-before.md) clause accepts one of the following parameters:

    > `TIMESTAMP => timestamp`
    > :   Specifies an exact date and time to use for Time Travel. The value must be explicitly cast to a TIMESTAMP,
    >     TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ data type.
    >
    >     If no explicit cast is specified, the timestamp in the AT clause is treated as a timestamp with the UTC time zone (equivalent to
    >     TIMESTAMP_NTZ). Using the TIMESTAMP data type for an explicit cast may also result in the value being treated as a TIMESTAMP_NTZ
    >     value. For details, see [Date & time data types](../../sql-reference/data-types-datetime.md).
    >
    > `OFFSET => time_difference`
    > :   Specifies the difference in seconds from the current time to use for Time Travel, in the form `-N` where `N`
    >     can be an integer or arithmetic expression (e.g. `-120` is 120 seconds, `-30*60` is 1800 seconds or 30 minutes).
    >
    > Default: Copied from the origin.

    `COMPUTE_FAMILY = compute_family`
    :   Specifies the name of an instance size from the [Snowflake Postgres Instance Sizes](postgres-instance-sizes.md) tables.

        Default: Copied from the origin.

    `STORAGE_SIZE_GB = storage_gb`
    :   Specifies storage size in GB. Must be between 10 and 65,535.

        Default: Copied from the origin.

    `HIGH_AVAILABILITY = { TRUE | FALSE }`
    :   Specifies the high availability setting to be used for the fork.

        Default: Copied from the origin.

    `POSTGRES_SETTINGS = 'json_string'`
    :   Allows you to optionally set Postgres configuration parameters on your instance in JSON format. See [Snowflake Postgres Server Settings](postgres-server-settings.md)
        for a list of available Postgres parameters.

        ```none
        '{"component:name" = "value", ...}'
        ```

        Default: Copied from the origin.

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the user.

        Default: `NULL`

    `TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
    :   Specifies the [tag](../object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../object-tagging/introduction.md).

    One row with the following columns will be returned:

    * `status`
    * `host`

**CREATE FORK SQL Examples**

> Create a fork `my_fork` from the origin instance `my_origin_instance` at the timestamp `2025-01-01 12:00:00`.
>
> ```sqlexample
> CREATE POSTGRES INSTANCE my_fork
>   FORK my_origin_instance
>   AT (TIMESTAMP => '2025-01-01 12:00:00');
> ```
>
> Create a fork `my_fork` from the origin instance `my_origin_instance` as it was `120` seconds ago.
>
> ```sqlexample
> CREATE POSTGRES INSTANCE my_fork
>   FORK my_origin_instance
>   AT (OFFSET => -120);
> ```
>
> Create a fork `my_fork` from the origin instance `my_origin_instance` as of the current time, using the `STANDARD_M` instance size
> and no high availability.
>
> ```sqlexample
> CREATE POSTGRES INSTANCE my_fork
>   FORK my_origin_instance
>   COMPUTE_FAMILY = STANDARD_M
>   HIGH_AVAILABILITY = FALSE;
> ```

When you create a fork, no credentials will be displayed. Credentials for the fork are the same as the origin instance. You can regenerate
credentials later if needed.

The time needed to create a fork is dependent on the size of the origin instance.

---
title: Snowflake Postgres Read Replicas
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-create-replica.md
section: Snowflake Postgres
---

# Snowflake Postgres Read Replicas

## Overview

Snowflake Postgres supports creating *replicas*. Replicas are read-only copies
of a *leader instance* that are continually kept synchronized with changes from that
instance. This synchronization is done automatically and transparently to the user.

Replicas are useful for read scaling and offloading certain workloads that could
impact production (such as reporting workloads). Replicas are required to have
the same storage size as their leader but can have a different compute size.

Replicas are provisioned in the same network as their leader instance and, as a result,
inherit all ingress and egress network rules from their leader instance.

Postgres credentials, along with all other data on replicas, are copied and kept
synchronized with the leader instance.

## Creating a Read Replica

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select the instance you want to create a replica of to load its detail page.
3. In the Manage menu at the top right of the detail page, select the `Create replica` option.
4. Make your choices for your new replica’s configuration options.
5. Select Save to create the replica.

To create a Postgres instance as a replica of an origin instance, specify the AS REPLICA OF clause in the CREATE POSTGRES INSTANCE command.
By default, the COMPUTE_FAMILY and POSTGRES_SETTINGS properties are copied from the original Postgres instance.
You can override those settings, and also specify COMMENT and TAG properties for the new instance.

One row with the following columns will be returned:

* `status`
* `host`

**CREATE REPLICA SQL Examples**

Create a replica `my_replica` of the instance `my_origin_instance`.

```sqlexample
CREATE POSTGRES INSTANCE my_replica
  AS REPLICA OF my_origin_instance;
```

Create a replica `my_replica` of the instance `my_origin_instance` with a different compute family.

```sqlexample
CREATE POSTGRES INSTANCE my_replica
  AS REPLICA OF my_origin_instance
  COMPUTE_FAMILY = STANDARD_M;
```

The time needed to create a replica depends on the size of its origin instance. The replica will
display its current state as it is building. See the list of
[instance states](managing-instances.md) for details about the states the replica will
pass through as it builds.

## Replica behavior and limitations

* Only **10 replicas** can stream changes from a leader instance by default. To allow additional replicas to stream, increase the Postgres `max_wal_senders` setting (see [Snowflake Postgres Server Settings](postgres-server-settings.md)).
* Leader Postgres instances **cannot be dropped while they have replicas**. All replicas must be removed before the leader can be dropped.
* Postgres server settings applied to a leader instance are copied to all replicas.

---
title: Snowflake Postgres Roles
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-roles.md
section: Snowflake Postgres
---

# Snowflake Postgres Roles

Postgres has its own role-based authentication for managing connections to databases and using
databases on a Postgres server. These roles are separate from Snowflake roles. Postgres
roles are used for accessing and managing databases, tables, and other objects within
Snowflake Postgres instances.

When you create an instance, Snowflake automatically creates two special managed roles for you
to use, which are described below.

For more information about managing Postgres roles, see the [Postgres documentation](https://www.postgresql.org/docs/current/user-manag.html).

> **Note:**
>
> Here and in many other places you will see the terms “role” and “user” used interchangeably in the context of Postgres user
> management. This is because a Postgres user is simply a role that has the Postgres role LOGIN attribute.

## Snowflake Postgres managed roles

Snowflake Postgres automatically creates two managed roles at the same time that it creates your instance.

### The `snowflake_admin` role

The `snowflake_admin` role is a high-privilege Postgres role used to administer your Snowflake Postgres instance. It is **not**
a full Postgres superuser; some operations remain restricted and are managed by Snowflake. However, it has elevated privileges
that include:

* Creating and managing Postgres roles.
* Creating and managing databases.
* Managing replication for your Snowflake Postgres instance.
* Bypassing row-level security (RLS) policies where applicable.

In addition, `snowflake_admin` is a member of several Postgres built-in roles that grant monitoring and operational capabilities,
including:

* `pg_signal_backend`
* `pg_use_reserved_connections`
* `pg_create_subscription`
* `pg_read_all_settings`
* `pg_read_all_stats`
* `pg_stat_scan_tables`
* `pg_monitor`
* `snowflake_admin_group`

### The `application` role

The `application` role is a non-superuser role that by default has permissions to create objects in the `postgres` database. New permissions
or ownership for this role should be granted by the `snowflake_admin` role.

## Postgres password security

### Regenerating credentials for Snowflake Postgres managed roles

Credentials for the `snowflake_admin` and `application` roles are generated when you create the instance and are displayed only once.
You can regenerate these credentials at any time, invalidating the existing credentials.

SnowsightSQL

From the dashboard you can regenerate the credentials for your instance’s `snowflake_admin` role.

1. In the navigation menu, select Postgres.
2. Select your instance.
3. In the Manage menu at the top right select Regenerate credentials.
4. Click the Acknowledge & continue button to confirm the action.

> To regenerate credentials for the `snowflake_admin` or `application` role, you can use an ALTER POSTGRES INSTANCE command with the
> RESET ACCESS FOR parameter. The value that you specify is a quoted string, either `'snowflake_admin'` or `'application'`.
> For example:
>
> ```sqlexample
> ALTER POSTGRES INSTANCE my_instance_1 RESET ACCESS FOR 'snowflake_admin';
> ALTER POSTGRES INSTANCE my_instance_2 RESET ACCESS FOR 'application';
> ```
>
> * Requires **OWNERSHIP** privilege
>
> That command returns one row with the following column:
>
> > * `password`

**Rotate Credentials Example**

> > Reset the access for the `snowflake_admin` role for a Snowflake Postgres instance named `my_instance`:
>
> ```sqlexample
> ALTER POSTGRES INSTANCE my_instance RESET ACCESS FOR 'snowflake_admin';
> ```

### Setting passwords for other Postgres roles

Snowflake Postgres instances are configured for scram-sha-256 password authentication. When new
passwords are set, the server generates and stores a scram-sha-256 hash, but when the Postgres
[log_statement](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-STATEMENT) parameter
is set to any value other than `none`, then CREATE ROLE and ALTER ROLE DDL commands are fully
logged to the Postgres server log. Therefore, you should make sure that clear-text passwords are not
logged as part of those statements.

#### Disabling statement logging for CREATE ROLE and ALTER ROLE Postgres DDL commands

The simplest way to prevent clear-text passwords used in CREATE ROLE and ALTER ROLE DDL statements from appearing in the Postgres server
log is to disable the `log_statement` parameter for the transaction that you run them in. Do so by using SET LOCAL:

```postgres
BEGIN;
SET LOCAL log_statement = 'none';
CREATE USER mynewrole PASSWORD 'mynewpassword';
COMMIT;
```

#### Using the `psql` Postgres client’s `\password` command

The Postgres [psql](https://www.postgresql.org/docs/current/app-psql.html) client program has a
[\password](https://www.postgresql.org/docs/current/app-psql.html) meta-command that can be used
to change the password for existing users. The `\password` meta-command precomputes the entered
password’s scram-sha-256 hash and uses that in the ALTER ROLE command that is sent to the server. To
use this method, first create new users without a password, and then set each user’s password with
the psql `\password` meta-command.

```psql
postgres=# CREATE ROLE mynewrole LOGIN;
CREATE ROLE

postgres=# \password mynewrole
Enter new password for user "mynewrole":
Enter it again:
```

If `log_statement` is set to a value other than `'none'`, then the log entry for ALTER ROLE
command sent by `psql` for the above `\password` command has the calculated scram-sha-256
hash instead of the actual clear-text password. You can combine this method with disabling
`log_statement` completely, as described above, to prevent even that hash from appearing in the
Postgres log:

```psql
postgres=# CREATE ROLE mynewrole LOGIN;
CREATE ROLE

postgres=# BEGIN;
BEGIN

postgres=# SET LOCAL log_statement = 'none';
SET

postgres=# \password mynewrole
Enter new password for user "mynewrole":
Enter it again:

postgres=# COMMIT;
COMMIT
```

### Leaked Password Protection

Leaked password protection is provided for roles on Snowflake Postgres instances. Discovery and notification work as described in our
main [Leaked password protection](../leaked-password-protection.md). When Snowflake discovers a leaked password for one of your Snowflake Postgres roles:

* The role is added to the special `snowflake_nologin` Postgres group role to prevent future logins with it.
* All existing connections for the role are terminated.
* The email notification you receive will have “Urgent - Snowflake Postgres Role(s) Password Reset to Prevent Unauthorized Access” for
  its Subject.

Should you receive this email, you should immediately securely update the role’s password as described above. When regenerating credentials
for managed roles they are automatically removed from the `snowflake_nologin` Postgres role group. For non-managed roles, after updating
the role’s password they can be removed from the `snowflake_nologin` group role by running this Postgres SQL with the `snowflake_admin` role:

```postgres
REVOKE snowflake_nologin FROM {rolename};
```

## Role limitations

In Snowflake Postgres, certain operations are reserved for the service itself and can’t be
performed by any customer-managed role, including `snowflake_admin`.

Examples of operations that are restricted include:

* Logging in with superuser roles such as `postgres` or `snowflake_superuser`, or assuming such roles by
  using SET ROLE.
* Creating other superusers.
* Executing the ALTER SYSTEM command.
* Changing protected server-level configuration parameters that are managed by Snowflake.
* Modifying or disabling core Snowflake-managed components or extensions.
* Accessing or altering Snowflake-managed system databases or schemas used by the service.
* Accessing or altering the Snowflake Postgres instance filesystem.
* Directly modifying system catalog tables.
* Creating more than 64 roles in the instance.
* Creating more than 32 databases in the instance.
* Accessing the Postgres [generic file access functions](https://www.postgresql.org/docs/current/functions-admin.html#FUNCTIONS-ADMIN-GENFILE)
  that permit filesystem access.

The Snowflake Postgres extension may introduce further restrictions on what both `snowflake_admin`
and `application` can do within an instance. These extension-specific limitations may evolve over
time and will be documented with the corresponding extension behavior. If an operation is blocked,
you receive an error indicating that it isn’t permitted in Snowflake Postgres.

---
title: Snowflake Postgres Server Settings
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-server-settings.md
section: Snowflake Postgres
---

# Snowflake Postgres Server Settings

The table below details the parameters that can be set for the Postgres server component of Snowflake Postgres instances. Each setting’s name
is hyperlinked to its Postgres documentation.

Where “Postgres default” appears in the Default column, Snowflake Postgres instances use the default value from Postgres. This can vary by
major version.

See [Creating a Snowflake Postgres Instance](postgres-create-instance.md) for details on setting values for these Postgres server settings when creating Snowflake Postgres
instances.

> **Tip:**
>
> To see a parameter’s documentation for a specific major version change the word “current” in the hyperlink address to the
> target major version. For example, this hyperlink address for the `postgres:work_mem` setting:
>
> <https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-WORK-MEM>
>
> becomes this to visit its Postgres 17 documentation:
>
> <https://www.postgresql.org/docs/17/runtime-config-resource.html#GUC-WORK-MEM>

| Component | Name | Requires restart | Description | Default |
| --- | --- | --- | --- | --- |
| pgbouncer | [autodb_idle_timeout](https://www.pgbouncer.org/config.html) | FALSE | If the automatically created (via “\*”) database pools have been unused this many seconds, they are freed. | 3600 |
| pgbouncer | [default_pool_size](https://www.pgbouncer.org/config.html) | FALSE | How many server connections to allow per user/database pair. | 497 |
| pgbouncer | [ignore_startup_parameters](https://www.pgbouncer.org/config.html) | FALSE | Ignore parameters startup packets (e.g. options,extra_float_digits). | client_encoding,datestyle,timezone,standard_conforming_strings,extra_float_digits |
| pgbouncer | [max_prepared_statements](https://www.pgbouncer.org/config.html) | FALSE | Number of prepared statements kept active on a single server connection | 250 |
| pgbouncer | [pool_mode](https://www.pgbouncer.org/config.html) | FALSE | Specifies when a server connection can be reused by other clients. | transaction |
| pgbouncer | [server_idle_timeout](https://www.pgbouncer.org/config.html) | FALSE | If a server connection has been idle for more than this many seconds, it will be closed. | 60 |
| postgres | [auto_explain.log_analyze](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-ANALYZE) | FALSE | Causes EXPLAIN ANALYZE output, rather than just EXPLAIN output, to be printed when an execution plan is logged. | Postgres default |
| postgres | [auto_explain.log_buffers](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-BUFFERS) | FALSE | Controls whether buffer usage statistics are printed when an execution plan is logged. | Postgres default |
| postgres | [auto_explain.log_format](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-FORMAT) | FALSE | Selects the EXPLAIN output format to be used. | Postgres default |
| postgres | [auto_explain.log_min_duration](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-MIN-DURATION) | FALSE | The minimum statement execution time, in milliseconds, that will cause the statement’s plan to be logged. | Postgres default |
| postgres | [auto_explain.log_nested_statements](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-NESTED-STATEMENTS) | FALSE | Causes nested statements (statements executed inside a function) to be considered for logging. | Postgres default |
| postgres | [auto_explain.log_timing](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-TIMING) | FALSE | Controls whether per-node timing information is printed when an execution plan is logged. | Postgres default |
| postgres | [auto_explain.log_triggers](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-TRIGGERS) | FALSE | Causes trigger execution statistics to be included when an execution plan is logged. | Postgres default |
| postgres | [auto_explain.log_verbose](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-LOG-VERBOSE) | FALSE | Controls whether verbose details are printed when an execution plan is logged. | Postgres default |
| postgres | [auto_explain.sample_rate](https://www.postgresql.org/docs/current/auto-explain.html#AUTO-EXPLAIN-CONFIGURATION-PARAMETERS-SAMPLE-RATE) | FALSE | Causes auto_explain to only explain a fraction of the statements in each session. | Postgres default |
| postgres | [autovacuum_analyze_scale_factor](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-ANALYZE-SCALE-FACTOR) | FALSE | Specifies a fraction of the table size to add to autovacuum_analyze_threshold when deciding whether to trigger an ANALYZE. | Postgres default |
| postgres | [autovacuum_freeze_max_age](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-FREEZE-MAX-AGE) | TRUE | Specifies the maximum age (in transactions) that a table’s transaction ID can attain before a VACUUM operation is forced to prevent transaction ID wraparound within the table. | Postgres default |
| postgres | [autovacuum_vacuum_cost_delay](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-VACUUM-COST-DELAY) | FALSE | Specifies the cost delay value that will be used in automatic VACUUM operations. If -1 is specified, the regular vacuum_cost_delay value will be used. | Postgres default |
| postgres | [autovacuum_vacuum_cost_limit](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-VACUUM-COST-LIMIT) | FALSE | Specifies the cost limit value that will be used in automatic VACUUM operations. | Postgres default |
| postgres | [autovacuum_vacuum_insert_scale_factor](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-VACUUM-INSERT-SCALE-FACTOR) | FALSE | Specifies a fraction of the table size to add to autovacuum_vacuum_insert_threshold when deciding whether to trigger a VACUUM. | Postgres default |
| postgres | [autovacuum_vacuum_insert_threshold](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-VACUUM-INSERT-THRESHOLD) | FALSE | Specifies the number of inserted tuples needed to trigger a VACUUM in any one table. | Postgres default |
| postgres | [autovacuum_vacuum_scale_factor](https://www.postgresql.org/docs/17/runtime-config-autovacuum.html#GUC-AUTOVACUUM-VACUUM-SCALE-FACTOR) | FALSE | Specifies a fraction of the table size to add to autovacuum_vacuum_threshold when deciding whether to trigger a VACUUM. | Postgres default |
| postgres | [checkpoint_completion_target](hhttps://www.postgresql.org/docs/current/runtime-config-wal.md) | FALSE | Specifies the target of checkpoint completion, as a fraction of total time between checkpoints. | Postgres default |
| postgres | [checkpoint_timeout](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-CHECKPOINT-TIMEOUT) | FALSE | Maximum time between automatic WAL checkpoints. | Postgres default |
| postgres | [checkpoint_warning](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-CHECKPOINT-WARNING) | FALSE | Write a message to the server log if checkpoints caused by the filling of WAL segment files happen closer together than this amount of time. | Postgres default |
| postgres | [default_statistics_target](https://www.postgresql.org/docs/current/runtime-config-query.html#GUC-DEFAULT-STATISTICS-TARGET) | FALSE | Sets the default statistics target for table columns without a column-specific target set via ALTER TABLE SET STATISTICS. | Postgres default |
| postgres | [default_text_search_config](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-DEFAULT-TEXT-SEARCH-CONFIG) | FALSE | Selects the text search configuration that is used by those variants of the text search functions that do not have an explicit argument specifying the configuration. | Postgres default |
| postgres | [default_transaction_read_only](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-DEFAULT-TRANSACTION-READ-ONLY) | FALSE | A read-only SQL transaction cannot alter non-temporary tables. | off |
| postgres | [hot_standby_feedback](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-HOT-STANDBY-FEEDBACK) | FALSE | Specifies whether or not a hot standby will send feedback to the primary or upstream standby about queries currently executing on the standby. | on |
| postgres | [idle_in_transaction_session_timeout](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-IDLE-IN-TRANSACTION-SESSION-TIMEOUT) | FALSE | Terminate any session that has been idle within an open transaction for longer than the specified amount of time. | Postgres default |
| postgres | [intervalstyle](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-INTERVALSTYLE) | FALSE | Sets the display format for interval value. | Postgres default |
| postgres | [jit](https://www.postgresql.org/docs/current/runtime-config-query.html#GUC-JIT) | FALSE | Enable JIT support. | Postgres default |
| postgres | [lock_timeout](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-LOCK-TIMEOUT) | FALSE | Abort any statement that waits longer than the specified amount of time while attempting to acquire a lock. | Postgres default |
| postgres | [log_autovacuum_min_duration](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-AUTOVACUUM-MIN-DURATION) | FALSE | Causes each action executed by autovacuum to be logged if it ran for at least the specified amount of time. | Postgres default |
| postgres | [log_connections](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-CONNECTIONS) | FALSE | Outputs a line to the server logs detailing each successful connection. | Postgres default |
| postgres | [log_destination](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-DESTINATION) | FALSE | Sets the desired log destinations. | syslog,stderr |
| postgres | [log_disconnections](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-DISCONNECTIONS) | FALSE | Causes session terminations to be logged. The log output provides information similar to log_connections, plus the duration of the session. | Postgres default |
| postgres | [log_duration](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-DURATION) | FALSE | Causes the duration of every completed statement to be logged. | Postgres default |
| postgres | [log_line_prefix](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-LINE-PREFIX) | FALSE | Specifies a printf-style string that is output at the beginning of each log line. | [%p][%b][%v][%x] %q[user=%u,db=%d,app=%a] |
| postgres | [log_lock_waits](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-LOCK-WAITS) | FALSE | Controls whether a log message is produced when a session waits longer than deadlock_timeout to acquire a lock. | on |
| postgres | [log_min_duration_sample](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-MIN-DURATION-SAMPLE) | FALSE | Allows sampling the duration of completed statements that ran for at least the specified amount of time. | Postgres default |
| postgres | [log_min_duration_statement](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-MIN-DURATION-STATEMENT) | FALSE | Causes the duration of each completed statement to be logged if the statement ran for at least the specified amount of time. | 2s |
| postgres | [log_min_messages](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-MIN-MESSAGES) | FALSE | Controls which message levels are written to the server log. | notice |
| postgres | [log_rotation_size](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-ROTATION-SIZE) | FALSE | This determines the maximum size of an individual log file. | Postgres default |
| postgres | [log_statement](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-STATEMENT) | FALSE | Controls which SQL statements are logged. | ddl |
| postgres | [log_statement_sample_rate](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-STATEMENT-SAMPLE-RATE) | FALSE | Determines the fraction of statements with duration exceeding log_min_duration_sample that will be logged. | Postgres default |
| postgres | [log_temp_files](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-TEMP-FILES) | FALSE | Controls logging of temporary file names and sizes. | 10MB |
| postgres | [log_transaction_sample_rate](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-LOG-TRANSACTION-SAMPLE-RATE) | FALSE | Sets the fraction of transactions whose statements are all logged, in addition to statements logged for other reasons. | Postgres default |
| postgres | [logical_decoding_work_mem](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-LOGICAL-DECODING-WORK-MEM) | FALSE | Specifies the maximum amount of memory to be used by logical decoding. | Postgres default |
| postgres | [maintenance_work_mem](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-MAINTENANCE-WORK-MEM) | FALSE | Specifies the maximum amount of memory to be used by maintenance operations, such as VACUUM, CREATE INDEX, and ALTER TABLE ADD FOREIGN KEY. | TOTAL_MEMORY \* 0.4 |
| postgres | [max_connections](https://www.postgresql.org/docs/current/runtime-config-connection.html#GUC-MAX-CONNECTIONS) | TRUE | Determines the maximum number of concurrent connections to the database server. | 500 |
| postgres | [max_locks_per_transaction](https://www.postgresql.org/docs/current/runtime-config-locks.html#GUC-MAX-LOCKS-PER-TRANSACTION) | TRUE | Controls the average number of object locks allocated for each transaction. | Postgres default |
| postgres | [max_logical_replication_workers](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-LOGICAL-REPLICATION-WORKERS) | TRUE | Specifies maximum number of logical replication workers. | Postgres default |
| postgres | [max_parallel_maintenance_workers](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-MAX-PARALLEL-MAINTENANCE-WORKERS) | FALSE | Sets the maximum number of parallel workers that can be started by a single utility command. | Postgres default |
| postgres | [max_parallel_workers](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-MAX-PARALLEL-WORKERS) | FALSE | Sets the maximum number of workers that the cluster can support for parallel operations. | NUM_CPUS |
| postgres | [max_parallel_workers_per_gather](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-MAX-PARALLEL-WORKERS-PER-GATHER) | FALSE | Sets the maximum number of workers that can be started by a single Gather or Gather Merge node. | NUM_CPUS |
| postgres | [max_replication_slots](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-REPLICATION-SLOTS) | TRUE | Specifies the maximum number of replication slots that the server can support. | 10 |
| postgres | [max_slot_wal_keep_size](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-SLOT-WAL-KEEP-SIZE) | FALSE | Specifies the maximum size of WAL files that replication slots are allowed to retain in the `pg_wal` directory at checkpoint time. | STORAGE_GB \* 0.1 |
| postgres | [max_standby_archive_delay](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-STANDBY-ARCHIVE-DELAY) | FALSE | Determines how long the standby server should wait before canceling standby queries that conflict with about-to-be-applied WAL entries. | Postgres default |
| postgres | [max_standby_streaming_delay](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-STANDBY-STREAMING-DELAY) | FALSE | Determines how long the standby server should wait before canceling standby queries that conflict with about-to-be-applied WAL entries. | Postgres default |
| postgres | [max_wal_senders](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-MAX-WAL-SENDERS) | TRUE | Specifies the maximum number of concurrent connections from standby servers or streaming base backup clients. | 10 |
| postgres | [max_wal_size](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-MAX-WAL-SIZE) | FALSE | Maximum size to let the WAL grow during automatic checkpoints. | MIN(10GB, STORAGE_GB \* 0.1) |
| postgres | [max_worker_processes](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-MAX-WORKER-PROCESSES) | TRUE | Sets the maximum number of background processes that the cluster can support. | 100 |
| postgres | [pg_stat_statements.max](https://www.postgresql.org/docs/current/pgstatstatements.html#PGSTATSTATEMENTS-CONFIG-PARAMS) | TRUE | Maximum number of statements tracked. | Postgres default |
| postgres | [pg_stat_statements.track](https://www.postgresql.org/docs/current/pgstatstatements.html#PGSTATSTATEMENTS-CONFIG-PARAMS) | FALSE | Control which statements should be tracked. | Postgres default |
| postgres | [pg_stat_statements.track_utility](https://www.postgresql.org/docs/current/pgstatstatements.html#PGSTATSTATEMENTS-CONFIG-PARAMS) | FALSE | Should the utility commands be tracked. Utility commands are all those other than SELECT, INSERT, UPDATE, DELETE, and MERGE. | Postgres default |
| postgres | [random_page_cost](https://www.postgresql.org/docs/current/runtime-config-query.html) | FALSE | Sets the planner’s estimate of the cost of a non-sequentially-fetched disk page. | 1.1 |
| postgres | [session_preload_libraries](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-SESSION-PRELOAD-LIBRARIES) | FALSE | Specifies one or more shared libraries that are to be preloaded at connection start. | Postgres default |
| postgres | [statement_timeout](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-STATEMENT-TIMEOUT) | FALSE | Abort any statement that takes more than the specified amount of time. | Postgres default |
| postgres | [synchronous_commit](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-SYNCHRONOUS-COMMIT) | FALSE | Specifies how much WAL processing must complete before the database server returns a “success” indication to the client. | local |
| postgres | [syslog_split_messages](https://www.postgresql.org/docs/current/runtime-config-logging.html#GUC-SYSLOG-SPLIT-MESSAGES) | FALSE | Split messages sent to syslog by lines and to fit into 1024 bytes | Postgres default |
| postgres | [tcp_keepalives_count](https://www.postgresql.org/docs/current/runtime-config-connection.html#GUC-TCP-KEEPALIVES-COUNT) | FALSE | Specifies the number of TCP keepalive messages that can be lost before the server’s connection to the client is considered dead. | 4 |
| postgres | [tcp_keepalives_idle](https://www.postgresql.org/docs/current/runtime-config-connection.html#GUC-TCP-KEEPALIVES-IDLE) | FALSE | Specifies the amount of time with no network activity after which the operating system should send a TCP keepalive message to the client. | 2 |
| postgres | [temp_file_limit](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-TEMP-FILE-LIMIT) | FALSE | Specifies the maximum amount of disk space that a process can use for temporary files, such as sort and hash temporary files, or the storage file for a held cursor. | MIN(2000GB, STORAGE_GB \* 0.25) |
| postgres | [track_activity_query_size](https://www.postgresql.org/docs/current/runtime-config-statistics.html#GUC-TRACK-ACTIVITY-QUERY-SIZE) | TRUE | Memory reserved to store the text of the currently executing command for each active session, for the pg_stat_activity.query field. | Postgres default |
| postgres | [track_commit_timestamp](https://www.postgresql.org/docs/current/runtime-config-replication.html#GUC-TRACK-COMMIT-TIMESTAMP) | TRUE | Record commit time of transactions. | Postgres default |
| postgres | [wal_keep_size](https://postgresqlco.nf/doc/en/param/wal_keep_size/) | FALSE | Specifies the minimum size of past WAL files kept in the pg_wal directory, in case a standby server needs to fetch them for streaming replication. | Postgres default |
| postgres | [wal_sender_timeout](https://www.postgresql.org/docs/current/runtime-config-replication.html) | FALSE | Sets the maximum time to wait for WAL replication. | Postgres default |
| postgres | [work_mem](https://www.postgresql.org/docs/current/runtime-config-resource.html#GUC-WORK-MEM) | FALSE | Sets the base maximum amount of memory to be used by a query operation (such as a sort or hash table) before writing to temporary disk files. | (TOTAL_MEMORY \* 0.75)/ (NUM_CORES \* 8) |

---
title: Snowflake Postgres SSL certificates
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-ssl-certs.md
section: Snowflake Postgres
---

# Snowflake Postgres SSL certificates

Snowflake Postgres runs with secure SSL connections to clusters. When connecting clients and applications to a cluster, include
`SSLMODE=require` in your settings, or the equivalent setting for non-`libpq`-based drivers. For more details and connection
troubleshooting tips, see [Connecting to Snowflake Postgres](connecting-to-snowflakepg.md).

For an added layer of security, admins and instance owners can retrieve the public certificate portion of the root CA (Certificate
Authority) public certificate and private key pair that is used to sign each Snowflake Postgres server’s certificate. These root CA
certificates (and their non-shared private keys) are unique to each Snowflake account. Once retrieved, the root CA certificate can be used
for additional verification of the server certificates presented at connection time to protect against man-in-the-middle (MITM) attacks.

## Retrieving the SSL public root certificate

SnowsightSQL

1. In the navigation menu, select Postgres.
2. In the More Options [⋮] menu at the top right, select Download Certificate.
3. Select Download in the confirmation dialog.

You can retrieve the root CA certificate from the `certificate` field returned by the [DESCRIBE POSTGRES INSTANCE](../../sql-reference/sql/desc-postgres-instance.md)
command. This certificate is the same for all instances on a given account.

```sqlexample
DESCRIBE POSTGRES INSTANCE my_postgres
 ->> SELECT "property", "value"
     FROM $1
     WHERE "property" = 'certificate';
```

## Configuring Postgres clients for SSL certificate verification

1. Place the root CA certificate text, including the “—–BEGIN CERTIFICATE—–” and “—–END CERTIFICATE—–” lines, in a file
   in a secure location on your client host. If you already have a root CA store file with contents that you want to reuse, you can append
   your Snowflake Postgres root CA certificate text to it.
2. In your connection configuration:

   1. Specify the root CA public certificate location with the `sslrootcert=/path/to/root/certfile` in your connection parameters.
   2. Specify either `sslmode=verify-ca` or `sslmode=verify-full` (instead of `sslmode=require`)
      in your connection parameters.

> **Note:**
>
> `sslrootcert` has a default value of `$HOME/.postgresql/root.crt` for the client system user making the connection. If you place
> your root CA certificate at that location, you don’t need to specify the `sslrootcert` parameter for your connection.

Here is how these two `sslmode` values work:

* **verify-ca**: Verifies that the server is trustworthy by checking that it was signed by the root CA certificate pair
  using the present root CA public certificate.
* **verify-full**: Performs the `verify-ca` verification and additionally verifies that the server host name matches
  a name stored in the server certificate. Snowflake ensures that this will work for all signed server certificates signed with your
  account’s root CA.

The SSL connection fails if the server certificate can’t be verified per the specified `sslmode` parameter. Snowflake recommends
`verify-full` in most security-sensitive environments.

> **Warning:**
>
> If there is a root CA certificate present, then `sslmode=require` performs the same verification as `sslmode=verify-ca`. The presence
> of a root CA certificate at `$HOME/.postgresql/root.crt` for a server with a certificate signed by a different CA is a common source of
> SSL connection errors. If this happens, you can simply append your Snowflake root CA certificate’s text to that file, or place it somewhere
> else specified by the connection’s `sslrootcert` parameter.

> **Note:**
>
> For a full explanation of how these different `sslmode` setting levels prevent against MITM attacks, see the PostgreSQL chapter on
> [Protection provided in different sslmode settings](https://www.postgresql.org/docs/current/libpq-ssl.html#LIBPQ-SSL-PROTECTION).

---
title: Snowflake Postgres Tri-Secret Secure
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-tss.md
section: Snowflake Postgres
---

# Snowflake Postgres Tri-Secret Secure

[Tri-Secret Secure](../security-encryption-tss.md) is supported for Snowflake Postgres instance storage. Snowflake Postgres Tri-Secret Secure
instance storage uses a self-service registration process similar to that outlined in [Tri-Secret Secure self-service in Snowflake](../security-encryption-tss-self-serve.md)
with the following differences:

* Snowflake Postgres Tri-Secret Secure uses different Snowflake system functions for activation and CMK registration.
* Snowflake Postgres Tri-Secret Secure does not support private connectivity.
* Snowflake Postgres Tri-Secret Secure does not support self-registration with support activation.
* While Snowflake Postgres Tri-Secret Secure supports registering and activating new CMKs, it does not support rekeying of existing Snowflake Postgres
  instances with new CMKs.

> **Attention:**
>
> Before engaging with Snowflake to enable Snowflake Postgres Tri-Secret Secure for your account, you should carefully consider your responsibility for
> safeguarding your key as mentioned in [Customer-managed keys](../security-encryption-manage.md). If the customer managed key (CMK) in the composite master key hierarchy is revoked,
> your data can no longer be decrypted by Snowflake.
>
> If you have any questions or concerns, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> Snowflake also bears the same responsibility for the keys that we maintain. As with all security-related aspects of our service, we treat
> this responsibility with the utmost care and vigilance.
>
> All of our keys are maintained under strict policies that have enabled us to earn the highest security accreditations, including SOC 2
> Type II, PCI-DSS, HIPAA and [HITRUST CSF](../intro-cloud-platforms.md).

## Activate Snowflake Postgres Tri-Secret Secure

This procedure works on all cloud provider platforms that Snowflake supports. See your specific cloud provider documentation for any steps
taken on the cloud provider platform.

To create and register your CMK, and then activate Snowflake Postgres Tri-Secret Secure, complete the following steps:

1. On the cloud provider, create a CMK.

   Do this step in the key management service (KMS) on the cloud platform that hosts your Snowflake account.
2. In Snowflake, call the [SYSTEM$REGISTER_CMK_INFO_POSTGRES](../../sql-reference/functions/system_register_cmk_info_postgres.md) system function.

   * This system function registers your CMK with your Snowflake account for use with Snowflake Postgres Tri-Secret Secure.
   * Double-check the system function arguments to make sure they are correct for the cloud platform that hosts your Snowflake account.
3. In Snowflake, call the [SYSTEM$GET_CMK_INFO_POSTGRES](../../sql-reference/functions/system_get_cmk_info_postgres.md) system function.

   This system function returns the registration status and details for the CMK that you registered.
4. In Snowflake, call the [SYSTEM$GET_CMK_CONFIG_POSTGRES](../../sql-reference/functions/system_get_cmk_config_postgres.md) system function.

   This system function generates the information required for your cloud provider to allow Snowflake to access your CMK.

   > **Note:**
   >
   > If Microsoft Azure hosts your Snowflake account, you must pass the `tenant_id` value into the function.
5. In Snowflake, call the [SYSTEM$VERIFY_CMK_INFO_POSTGRES](../../sql-reference/functions/system_verify_cmk_info_postgres.md) system function.

   This system function confirms connectivity between your Snowflake account and your CMK.
6. In Snowflake, call the [SYSTEM$ACTIVATE_CMK_INFO_POSTGRES](../../sql-reference/functions/system_activate_cmk_info_postgres.md) system function.

   This system function activates Snowflake Postgres Tri-Secret Secure with your newly registered CMK.

   > **Important:**
   >
   > Snowflake Postgres Tri-Secret Secure does not support rekeying of existing Snowflake Postgres instances. This means that:
   >
   > * Snowflake Postgres instances that were created before any CMK was activated will not use Snowflake Postgres Tri-Secret Secure.
   > * Snowflake Postgres instances that were created while a prior CMK was active will continue to use that prior CMK.
   > * Only Snowflake Postgres primary instances that are created after a CMK is activated will use that CMK.
   > * Snowflake Postgres replicas and forks will always use the CMK in use by their primary instance.

### View the status of your CMK

You can call [SYSTEM$GET_CMK_INFO_POSTGRES](../../sql-reference/functions/system_get_cmk_info_postgres.md) at any time, to check the registration and activation status of your CMK.

For example, depending on when you call SYSTEM$GET_CMK_INFO_POSTGRES after the Snowflake Postgres Tri-Secret Secure activation process completes, the
function returns output that includes `...is activated...`. This means that your Snowflake account is using Snowflake Postgres Tri-Secret Secure with the
CMK that you registered.

### Change the CMK for Snowflake Postgres Tri-Secret Secure

Snowflake system functions support changing your customer-managed key (CMK), based on your security needs. Use the same steps to register a new CMK as the
steps that you followed to register your initial CMK. When you complete those steps again by using a new key, the output of the system functions
differs. Read the output from each system function that you call during self-service registration to confirm that you have changed your key.

### Deregister your current CMK

You can only register one CMK at a time with Snowflake Postgres Tri-Secret Secure. When you register your CMK, if the [SYSTEM$REGISTER_CMK_INFO_POSTGRES](../../sql-reference/functions/system_register_cmk_info_postgres.md)
function fails because a different CMK exists, call the [SYSTEM$DEREGISTER_CMK_INFO_POSTGRES](../../sql-reference/functions/system_deregister_cmk_info_postgres.md) system function, as prompted.

---
title: Snowflake Postgres version upgrades
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-upgrades.md
section: Snowflake Postgres
---

# Snowflake Postgres version upgrades

Postgres uses an X.Y versioning scheme, with X being the major version and Y being the minor version within that major version.

## Postgres major version upgrades

Snowflake Postgres allows you to schedule your major version upgrades via an instance [Modify](managing-instances.md) action, which requires
[failover maintenance](postgres-maintenance.md).

To initiate a major version upgrade, you must use a role that has been granted the OWNERSHIP or OPERATE privilege on the instance.

> **Note:**
>
> You can only upgrade to a newer major version. You can’t downgrade to a previous major version.
>
> You can combine a major version upgrade with an instance resize by selecting a new instance size,
> storage size, or both along with the new version number.

> **Tip:**
>
> Since upgrade maintenance failovers can take longer than other instance [Modify](managing-instances.md) maintenance failovers (see below) and
> you can’t downgrade an instance to a prior major version, Snowflake strongly recommends that you fully test major version upgrades
> with a [Fork](managing-instances.md) of your instance before proceeding with major version upgrades of active production instances.

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select your Snowflake Postgres instance.
3. In the Manage menu at the top right, select Modify.
4. If a newer version is available, you will be able to select it from the Postgres version dropdown menu.
5. Select the Save button to confirm the change.

You can initiate a major version upgrade with the [ALTER POSTGRES INSTANCE](../../sql-reference/sql/alter-postgres-instance.md) command by setting the POSTGRES_VERSION parameter to the desired version.

This will upgrade an instance named `my_instance` to Postgres 18:

```sqlexample
ALTER POSTGRES INSTANCE my_instance
  SET POSTGRES_VERSION = 18;
```

To have the upgrade proceed as soon as the upgrade replacement instance is ready, regardless of the currently set maintenance window:

```sqlexample
ALTER POSTGRES INSTANCE my_instance
  SET POSTGRES_VERSION = 18
  APPLY IMMEDIATELY;
```

This will both upgrade the instance to Postgres 18 and change its storage size to 100GB:

```sqlexample
ALTER POSTGRES INSTANCE my_instance
  SET POSTGRES_VERSION = 18
      STORAGE_SIZE_GB = 100;
```

Let’s say today’s date is March 18, 2026, and you want to have the upgrade maintenance failover happen tomorrow at 10pm:

```sqlexample
ALTER POSTGRES INSTANCE my_instance
  SET POSTGRES_VERSION = 18
  APPLY ON '2026-03-19 22:00:00';
```

> **Note:**
>
> If you have no maintenance window set, and have not specified a run time with `APPLY ON '<timestamp>'` when creating the upgrade action
> via SQL, the upgrade maintenance failover will proceed as soon as the new instance is populated and ready, just as when using
> APPLY IMMEDIATELY when creating the upgrade action via SQL.
>
> When using `APPLY ON '<timestamp>` to schedule the upgrade maintenance failover for a specified future time, that time can be at most
> three days from the current time.

### How major version upgrade maintenances work

Postgres major version upgrades work differently than other instance management operations. Once you initiate the process, Snowflake Postgres
will execute the following steps:

1. Just as with other [Modify](managing-instances.md) actions, a hidden replica is provisioned for the upgrade.
2. When the scheduled maintenance time arrives:

   * The current primary instance is locked to prevent writes.
   * The hidden replica is upgraded using [pg_upgrade](https://www.postgresql.org/docs/current/pgupgrade.html). The duration depends
     on the *number of objects* in your database, not data size.
3. Fail over to the newly upgraded instance once the upgrade is complete.

**Important Notes**:

* Major Version changes can affect application compatibility. We recommend testing your application against the new PostgreSQL version
  before upgrading.
* Read Replicas can’t have their major versions upgraded separately from their primary instances. Instead they are automatically
  upgraded when performing a major version upgrade on their primary, but only once their primary is upgraded and a fresh backup is taken. Until then, replicas will remain available but in a stale state.
* HA instances (if present) are also automatically upgraded after their primary is upgraded and a fresh backup is taken. Until then, the
  primary will not have a valid HA instance present.
* The service interruption from the maintenance failover will be longer than that required for other [Modify](managing-instances.md) actions, but
  should typically last no longer than a few minutes.
* If an upgrade fails, your instance will automatically revert back to the original instance.

## Postgres minor version upgrades

Snowflake will automatically upgrade your database with new minor versions of Postgres over time.

With each Postgres release we examine all security related issues and bugs. For any deemed critical, we will prioritize your upgrade to
ensure that your data is safe. If an emergency update is required, we’ll perform that update during your maintenance window.

For non-critical fixes we gradually update databases by one of the following:

* Updating your instance during Instance Management operations that require instance replacement, such as resource changes
* Updating your High Availability standby after an HA failover. If an HA failover occurs the newly build HA instance will receive
  the latest point release.

An instance [Refresh](managing-instances.md) will also ensure your instance and HA instance (if present) will be upgraded to the latest available minor
version.

---
title: Snowflake Token Authentication for Snowflake Postgres
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-token-auth.md
section: Snowflake Postgres
---

# Snowflake Token Authentication for Snowflake Postgres

Snowflake allows users to generate short-lived access tokens to use for logging into Snowflake Postgres instances. At the
instance level this is known as Snowflake authorization and is done with these three steps which are expanded upon below:

1. Enable Snowflake authorization for the Snowflake Postgres instance.
2. On the Snowflake Postgres instance, create mappings between Postgres users and Snowflake users.
3. Mapped Snowflake users then generate short-lived access tokens to use when logging into the Snowflake Postgres instance.

> **Note:**
>
> Snowflake Token Authentication for Snowflake Postgres is a separate feature from the
> [Snowflake OAuth](../oauth-snowflake-overview.md) and [Programmatic access tokens](../programmatic-access-tokens.md)
> Snowflake authentication methods.

## Enabling and disabling Snowflake Authorization on Snowflake Postgres instances

SnowsightSQL

To enable Snowflake authorization at instance creation time, enable the Snowflake auth option in the Snowflake Postgres
New instance dialogue when [Creating a new instance](postgres-create-instance.md).

To enable or disable Snowflake authorization for an existing instance:

1. In the navigation menu, select Postgres.
2. Select your instance.
3. In the Manage menu at the top right, select the Enable Snowflake auth or Disable Snowflake auth
   option from the instance’s Manage dropdown menu on its details page in the dashboard.
4. Select Enable or Disable on the presented confirmation dialogue.

Enabling and disabling Snowflake authorization for an instance is done with its AUTHENTICATION_AUTHORITY attribute.

To enable Snowflake authorization at instance creation time:

```sqlexample
CREATE POSTGRES INSTANCE {instance_name}
 SET AUTHENTICATION_AUTHORITY = POSTGRES_OR_SNOWFLAKE
 <other_options>;
```

To enable Snowflake authorization for existing instances:

```sqlexample
ALTER POSTGRES INSTANCE {instance_name}
 SET AUTHENTICATION_AUTHORITY = POSTGRES_OR_SNOWFLAKE
 <other_options>;
```

To disable Snowflake authorization for existing instances:

```sqlexample
ALTER POSTGRES INSTANCE {instance_name}
 SET AUTHENTICATION_AUTHORITY = POSTGRES
 <other_options>;
```

> **Important:**
>
> Disabling Snowflake authorization on an instance only prevents Snowflake users from creating new short-lived access tokens.
> Users with valid tokens can still establish new connections until the tokens expire, and existing connections will persist.
>
> After disabling Snowflake Authorization, Postgres users mapped to Snowflake users will not be able to use standard Postgres
> authentication until their mappings have been removed as described in Creating mappings between Postgres users and Snowflake users below.

## Creating mappings between Postgres users and Snowflake users

To create a mapping between a Postgres user and a Snowflake user log into your Postgres instance with the `snowflake_admin` user and run:

```postgresql
ALTER USER {postgres_user} SET snowflake_user = '{snowflake_user}';
```

The supplied `{postgres_user}` and `{snowflake_user}` names in the above statement will read as case-insensitive. If case-sensitivity
is required place the names in double-quotes. For example, to map a Postgres user named Casey to a Snowflake user of the same name:

```postgresql
ALTER USER "Casey" SET snowflake_user = '"Casey"';
```

To remove a mapping between a Postgres user and a Snowflake user log into your Postgres instance with the `snowflake_admin` user and run:

```postgres
ALTER USER {postgres_user} RESET snowflake_user;
```

To view which existing mappings between Postgres users and Snowflake users log into your Postgres instance with the `snowflake_admin` user
and query the SNOWFLAKE_AUTH.IDENTITY_MAPPING Postgres view view.

> **Note:**
>
> Postgres users with Snowflake user mappings can only log in with generated short-lived access tokens. They cannot connect with a Postgres
> password, and their Postgres passwords cannot be changed. To re-enable standard password login functionality for a given Postgres user, you
> must remove its mapping to a Snowflake user.

## Creating short-lived access tokens for mapped Snowflake users

Snowflake Postgres instance owners and Snowflake users with the USAGE privilege granted on a given instance can create short-lived access tokens
for themselves on a per-instance basis for instances that have Snowflake authorization enabled per the instructions above in
Enabling and disabling Snowflake Authorization on Snowflake Postgres instances.

SnowsightSQL

1. In the navigation menu, select Postgres.
2. Select your instance.
3. In the Manage menu at the top right, select Regenerate token.
4. In the presented Regenerate token dialogue enter the name of a Postgres user that has been mapped to your Snowflake user and select
   Acknowledge & continue.
5. Copy the presented short-lived access token or Postgres URI to use for establishing new connections to the Snowflake Postgres instance within the next
   15 minutes.

Use the [GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER](../../sql-reference/functions/generate_postgres_access_token_for_user.md) function.

## SNOWFLAKE_AUTH.IDENTITY_MAPPING Postgres view

This Snowflake Postgres view can be used to query a list of all mappings between Postgres users and Snowflake users on the instance.

> **Note:**
>
> This view is available to query only inside Snowflake Postgres instances and can not be queried directly from Snowflake.

### Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| postgres_role | name | The name of the mapped Postgres user |
| snowflake_identity | text | The snowflake user identity in USER:# form, where # is the mapped Snowflake user’s `user_id` value seen in the [USERS view](../../sql-reference/account-usage/users.md) view. |

---
title: Using Cortex Code CLI with Snowflake Postgres
source: https://docs.snowflake.com/en/user-guide/snowflake-postgres/postgres-cortex-code.md
section: Snowflake Postgres
---

# Using Cortex Code CLI with Snowflake Postgres

The Cortex Code CLI Postgres skill lets you ask natural-language questions about a Postgres database and have
Cortex Code generate and run SQL for you. It is designed for debugging, schema exploration, and lightweight
analytics without needing to hand-write every query.

For installation, connection setup, and general Cortex Code CLI usage, see
[Cortex Code CLI](../cortex-code/cortex-code-cli.md).

This Postgres-specific skill:

* Helps create and manage Snowflake Postgres instances.
* Translates natural-language questions into Postgres SQL.
* Executes the generated SQL against a configured Postgres instance.
* Returns a short, readable summary plus optional raw results.
* Can set up `pg_lake` for object storage and data movement between Postgres and Snowflake via Snowflake stages or S3 buckets.

## Managing connections

The skill stores connections using PostgreSQL’s native `~/.pg_service.conf` and `~/.pgpass` files,
making them compatible with all standard PostgreSQL clients (`psql`, pgAdmin, DBeaver, etc.).
When you ask Cortex Code to create an instance or reset credentials, the connection is saved automatically
via `pg_connect.py`.

> **Warning:**
>
> Never display `.pgpass` contents in chat or logs. Use `pg_connect.py` for all credential operations.

## Running queries

Once a connection is saved, Cortex Code can run `psql` commands against your instances directly from chat.
Passwords are resolved automatically from `~/.pgpass`. You can use natural-language prompts:

* *“Show me all tables on my_instance”*
* *“Run a SELECT on the orders table to get the last 10 rows”*
* *“What indexes exist on the users table?”*

Cortex Code translates these into `psql` commands, checks that the instance is ready (auto-resuming
if suspended), executes the query, and presents the results.

```text
You:          How many orders were placed this month?
Cortex Code:  Running: psql "service=my_instance" -c \
         "SELECT count(*) FROM orders
          WHERE created >= date_trunc('month', current_date);"

         count
        -------
           142
```

Cortex Code does not execute write operations (`INSERT`, `UPDATE`, `DELETE`, `DROP`, `TRUNCATE`)
unless you explicitly ask. Write operations require confirmation before proceeding.

## Postgres health checks

`pg_doctor` is a read-only diagnostic tool that runs health checks against a Postgres
instance with a 30-second statement timeout.

| Check | Description | Thresholds |
| --- | --- | --- |
| `cache_hit` | Index and table cache hit rate | Pass: >= 99% / Warn: 95-99% / Fail: < 95% |
| `bloat` | Table and index bloat estimation | Pass: < 30% / Warn: 30-50% / Fail: > 50% |
| `vacuum_stats` | Dead rows and autovacuum status | Warn if tables need vacuum |
| `connections` | Connection counts per role | Informational |
| `locks` | Exclusive locks held | Warn if locks present |
| `blocking` | Blocked queries | Fail if queries are blocked |
| `long_running` | Queries running longer than 5 minutes | Warn if found |
| `outliers` | Top slow queries (requires `pg_stat_statements`) | Informational |
| `unused_indexes` | Indexes never scanned | Warn if any found |
| `table_sizes` | Table size breakdown (total, index, toast) | Informational |

After presenting results, Cortex Code explains flagged checks and offers to investigate further.
Any remediation actions (`VACUUM`, `REINDEX`, etc.) require explicit confirmation before execution.

## Setting up pg_lake

`pg_lake` is a PostgreSQL extension that enables object storage and S3 data movement on Snowflake
Postgres instances. For details on the extension itself, see [Configuring S3 Storage for pg_lake](postgres-pg_lake.md).

The Cortex Code skill assists the multi-system setup (Snowflake SQL, AWS IAM, Postgres SQL) for both
Snowflake stages and S3 buckets managed outside Snowflake. You can ask Cortex Code to walk you through
the setup interactively:

* *“Set up pg_lake on my_instance with s3://my-bucket/data/”*
* *“Configure pg_lake with a Snowflake managed stage on my_instance”*

## Approval gates

Cortex Code requires confirmation before executing operations that are billable, destructive, or
security-sensitive.

| Operation | Reason |
| --- | --- |
| Create / suspend instance | Billable resource or drops active connections |
| Network policy changes | Modifies access control |
| Create / modify storage integration | Cloud resources, requires `ACCOUNTADMIN` |
| Update AWS trust policy | Modifies IAM permissions |
| Drop / destructive operations | Permanent data loss |
| Write operations from diagnostics | `VACUUM`, `REINDEX`, `pg_terminate_backend`, etc. |

Read-only operations (`SHOW`, `DESCRIBE`, health checks, `SELECT` queries) do not require approval.

## User Guide

Virtual warehouses, databases, queries, data sharing, security, governance, and account management.

---
title: About Data Exchange
source: https://docs.snowflake.com/en/user-guide/data-exchange.md
section: User Guide
---

# About Data Exchange

Data Exchange provides a data hub for securely collaborating around data with a selected group of members that you invite. It lets you,
as a provider, publish data which can then be discovered by the consumers participating in your exchange.

With a Data Exchange, you can easily provide data to a specific group of consistent business partners taking part in the Data Exchange,
such as internal departments in your company or vendors, suppliers, and partners external to your company.
If you want to share data with a variety of consumers inside and outside your
organization, you can also use listings offered to specific consumers or publicly on the Snowflake Marketplace.

You can manage membership, access to data, and audit data usage, as well as apply security controls to the data shared in the Data Exchange.
see [Manage data listings](data-exchange-managing-data-listings.md).

To set up a data exchange, see [Request a new Data Exchange](data-exchange-requesting.md).

* To access a data exchange, see [Access a Data Exchange](data-exchange-accessing.md).
* To create and manage data exchange provider profiles, see [Manage provider profiles](data-exchange-becoming-a-provider.md).
* If you’re a consumer of a data exchange, see [Configure and use a Data Exchange](data-exchange-using.md).

## Data Exchange Admin responsibilities

The Snowflake account that hosts the Data Exchange is the Data Exchange Admin. The Data Exchange Admin is responsible for configuring the Data Exchange and managing members (data providers and data consumers).

A user with the ACCOUNTADMIN role in the account designated as the Data Exchange Admin can:

* Add or remove members
* Designate members as providers, or consumers, or both

A Data Exchange Admin can delegate these privileges to other roles. For more information, see [Granting administrator privileges in a Data Exchange](data-exchange-marketplace-privileges.md).

## Data Exchange membership

Members are Snowflake accounts that are added by the Data Exchange Admin and designated as providers, consumers, or both.

After joining the Data Exchange, providers can:

* Create a listing.
* Define listing access personalized or [free](../collaboration/collaboration-listings-about.md).
* Publish the listing.
* Grant access to personalized listings or datasets that reside in a different region from the consumer.

After joining the Data Exchange, consumers can:

* Discover by browsing the exchange listings.
* Switch between the Snowflake Marketplace and the Data Exchange.
* Consume datasets (instantly or by request).

---
title: About organizational listings
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-about.md
section: User Guide
---

# About organizational listings

Organizational listings in Snowflake allow you to share data products securely within your organization, making it easier for internal consumers to discover and use trusted resources in the Internal Marketplace. Providers can create and manage listings with the Snowflake API or Provider Studio in Snowsight.

## The Internal Marketplace

The Internal Marketplace in Snowflake is a curated, secure space for collaborative data sharing within
your organization. It centralizes internally-available data products, allowing teams to discover,
trust, and apply resources without needing to navigate external marketplaces. By offering an organized
way to discover data products, the Internal Marketplace supports collaboration and data-driven
decision-making across your company.

The Internal Marketplace is similar to the public Snowflake Marketplace, but it is exclusively for your organization.
It allows you to easily discover and use vetted data shared within your internal teams. Access can be managed
by account targeting and Role-Based Access Control (RBAC), ensuring that data remains secure and accessible
only to authorized users.

## Organizational listing providers

For those creating and sharing data products, the Internal Marketplace provides a secure platform to publish data products
internally. Providers can create and manage organizational listings using Provider Studio in Snowsight or
via the API. Publishing data products in the Internal Marketplace ensures teams access consistent datasets, reducing
redundancy and supporting unified, data-driven initiatives.

Centralizing data offerings in the Internal Marketplace helps you manage access to sensitive information, maintaining data security
and integrity while enabling the organization to innovate with trusted data.

Providers can create and manage organizational listings by using Provider Studio or the API.

## Organizational listing consumers

For team members and data consumers, organizational listings provide a way to discover and access internal-only data resources. The
Internal Marketplace lets users locate data products without having to browse through externally-shared listings
in Snowflake Marketplace. Each organizational listing can be curated to meet your organizational
standards, so consumers can use these data products confidently for their analytics and projects.

## Internal Marketplace listings in government regions

Providers in government regions can configure organizational listings in the Internal Marketplace and share those listings with consumers in commercial or Virtual Private Snowflake (VPS) accounts. Providers can also configure [Cross-Cloud Auto-Fulfillment](../../../../collaboration/provider-listings-auto-fulfillment.md) on these listings. When configured, auto-fulfillment will be triggered after the consumer accesses the listing.

Consumers in government regions can programmatically access listings created by providers in commercial or VPS accounts using the Uniform Listing Locator (ULL) or by mounting the shared database.

### Limitations when working with Internal Marketplace listings in government regions

* Consumers can’t search the Internal Marketplace in Snowsight for listings in government regions.

  + To access a listing in Snowsight, select the listing URL received from the provider.
  + To access and test a listing programmatically, run the following code:

    ```sqlexample
    SHOW AVAILABLE LISTINGS;
    SELECT * FROM <ull>.<schema>.<view>
    ```
* To trigger auto-fulfillment on a listing, consumers must get the listing through Snowsight. This functionality isn’t supported using SQL.
* Custom organization profiles aren’t supported.
* The [ACCESS_HISTORY](../../../../sql-reference/organization-usage/access_history.md) view for these listings in the organization account isn’t visible to consumers.

---
title: About privacy domains
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-privacy-domains.md
section: User Guide
---

# About privacy domains

Within [differential privacy in Snowflake](differential-privacy-overview.md), a *privacy domain* defines the possible
values in a column, similar to a mathematical domain. A privacy domain is either a range of values with a minimum and maximum or an
enumerated list of values.

The privacy domain is one factor that Snowflake uses to calculate the amount of [noise](differential-privacy-overview.md) that
must be added to preserve privacy. Because of this, most fields should have a finite privacy domain; otherwise, the amount of noise added
would need to be infinite. By default, fields without privacy domains are assumed to have an infinite domain.

## Which columns need a privacy domain?

With the exception of a COUNT function, a query cannot aggregate a column unless the column has a privacy domain. Similarly, a query cannot
use a column in a GROUP BY clause unless the column has a privacy domain. For example, in the following query, score needs to have a
privacy domain, but age does not:

```sqlexample
SELECT COUNT(age) AS count_age where age >= 20 and age <= 100 FROM t1 GROUP BY score
```

## Defining a privacy domain

While both administrators and the analysts who are running queries can define a privacy domain for a column, they do so in different ways:

* An administrator uses CREATE TABLE and ALTER TABLE commands to set a privacy domain for a column. An administrator for the data provider
  sets privacy domains before giving access to analysts. [In some circumstances](differential-privacy-analyst.md), an
  administrator for the analyst might also need to set privacy domains on tables being joined with the data provider’s protected tables. If
  you’re an administrator who needs to set privacy domains, see [Working with privacy domains as an administrator](differential-privacy-privacy-domains-admin.md).
* An analyst shapes a query to implicitly specify a privacy domain using query elements like filters and column transformations. These
  privacy domains can be specified for columns without a privacy domain or can narrow a privacy domain set by the data provider. If you’re
  an analyst who needs to specify or narrow a privacy domain, see [Working with privacy domains as an analyst](differential-privacy-privacy-domains-analyst.md).

## Interactions between privacy domains

Multiple privacy domains can be involved in a query. There can be an admin-specified privacy domain and an analyst-specified privacy
domain on the same column. Alternatively, a query might join two tables on a column that has a privacy domain in both tables.

Snowflake evaluates all privacy domains and calculates the privacy domain to use for the duration of the query. For information about
how this query-time privacy domain is determined, see:

* Interaction between admin-specified and analyst-specified privacy domains
* Privacy domains and joins

### Interaction between admin-specified and analyst-specified privacy domains

An analyst uses query elements to implicitly specify a privacy domain for a column. For example, filtering on a column defines a privacy
domain for it. This analyst-specified privacy domain exists only for the duration of the query; it doesn’t change the privacy domain that
an administrator set on the column.

An analyst-specified privacy domain can narrow an admin-specified privacy domain, but can never expand it. The query-time privacy domain
is the intersection between the privacy domain specified by the query and the privacy domain set by the administrator. For example, if the
data provider set the privacy domain as a range (5, 15) and the query uses filters to specify the privacy domain as a range (0, 10), then
the effective, query-time privacy domain is (5, 10).

Similarly, if the administrator set the privacy domain as a list ( ‘blue’, ‘yellow’ ) and the query uses filters to specify a
privacy domain of ( ‘orange’, ‘blue’) , the query-time privacy domain is ( ‘blue’ ).

### Privacy domains and joins

When an analyst joins two tables on a column that has a privacy domain in both tables, the type of join determines the
query-time privacy domain. During the duration of the query, the effective privacy domain can be the intersection of the two privacy
domains, the union of the two privacy domains, or just one of the privacy domains.

In the following table, `domainL` refers to the privacy domain on the join column in the left table and `domainR` refers to the privacy
domain on the join column in the right table.

| Join type | Query-time privacy domain |
| --- | --- |
| INNER | Intersection of `domainL` and `domainR` |
| OUTER | Union of `domainL` and `domainR` |
| LEFT | `domainL` |
| RIGHT | `domainR` |
| LEFT SEMI | Intersection of `domainL` and `domainR` |
| LEFT ANTI | `domainL` |

For example, suppose the `day` column in `t1` has a privacy domain of (1, 100) and the `day` column in `t2` has a privacy domain of
(0, 90). When an analyst joins `t1` and `t2` on `day`, the query-time privacy domain is (1, 90), which is the intersection of the two
privacy domains.

## Values outside a privacy domain

A privacy domain defines *possible* values in a column, not necessarily *actual* values. The following summarizes what happens to values
that are not included in the list or range of the privacy domain.

Strings
:   Values in a string column that fall outside the privacy domain are always treated as NULL for the duration of the query. This is true
    regardless of whether it is an admin-specified privacy domain, an analyst-specified privacy domain, or an intersection of privacy
    domains.

    For example, suppose the data provider set a privacy domain on a column `state` of (`'california'`, `'oregon'`) and the analyst
    wrote a query that filters the `state` column to (`'nevada'`, `'oregon'`). If the query uses the `state` column in a GROUP BY
    clause, then the result contains two groups: `OREGON` and `NULL`. The `NULL` group includes all records where the value of
    the `state` column is not `OREGON` along with records where the value of the `state` column is literally `NULL`.

Numeric, date, and time
:   Snowflake treats numeric, date, and time values that fall outside the range of a privacy domain differently depending on
    whether the privacy domain was defined by an administrator or an analyst.

    Admin-specified:
    :   When the data provider defines a range privacy domain that contains a subset of the column’s actual values, the values outside the
        privacy domain are *clamped*, meaning they are treated as if they are the nearest value in the domain (the minimum or maximum
        value). For example, if the privacy domain of a column consists of integers between 1-100, a record with an actual value of 105 is
        treated as if it has a value of 100 when calculating aggregations. Analysts cannot access values outside the privacy domain.

        When a join of two privacy-protected tables results in the intersection of privacy
        domains, values outside the query-time privacy domain are clamped.

    Analyst-specified:
    :   When an analyst specifies a privacy domain for a column that doesn’t have one or narrows an admin-specified privacy domain, the
        query itself determines what happens to values that fall outside the privacy domain.

        * If the query uses a filter ([WHERE clause](differential-privacy-privacy-domains-analyst.md)), values outside of
          the privacy domain are ignored when calculating aggregations.
        * If the query uses a [column transformation](differential-privacy-privacy-domains-analyst.md), values in
          the column that are outside of the privacy domain are clamped like an admin-specified privacy domain.

## How intermediary query elements affect privacy domains

How a query is written can affect whether the range of a privacy domain changes or even whether a privacy domain still exists on a column.
This section helps you understand how intermediary parts of a query, that is, parts of the query before the final aggregation, can affect
the privacy domain of a column.

Adding new columns
:   If a query adds a new column that is based on an existing column, specifying or narrowing a privacy domain on the original column has no
    effect on the new column.

    In the following example, assume the data provider defined the privacy domain on the `score` column as a range between 0 and 100. When
    the query specifies the privacy domain of `score` as a range between 1 and 2, it has no effect on the privacy domain of the column
    `score_derived`.

    ```sqlexample
    SELECT COUNT(score_derived)
      FROM (SELECT score, score_derived FROM t1 WHERE score <= 2);
    ```

    For example, the output might be:

    ```output
    ----------------------------
    |"count(""SCORE_DERIVED"")"|
    ----------------------------
    |31                        |
    ----------------------------
    ```

Using a GROUP BY clause in intermediary aggregations
:   For intermediary portions of a query, using a GROUP BY clause while aggregating a column removes the privacy domain from the column. As a
    result, you need to specify a new privacy domain on the column if it is used in the final aggregation of the query.

    In the following example, the initial aggregation removes any privacy domain that has been set on the `score` column. The query
    succeeds only because it sets a privacy domain on the alias of the column before the final aggregation.

    ```sqlexample
    SELECT COUNT(num_scores)
      FROM (SELECT COUNT(score) AS num_scores
        FROM t1
        GROUP BY age)
      WHERE num_scores >= 0 AND num_scores <= 100;
    ```

---
title: About Secure Data Sharing
source: https://docs.snowflake.com/en/user-guide/data-sharing-intro.md
section: User Guide
---

# About Secure Data Sharing

Secure Data Sharing lets you share selected objects in a database in your account with other Snowflake accounts. You can share the
following Snowflake objects:

* Databases
* Tables
* Dynamic tables
* External tables
* Externally managed and managed Apache Iceberg™ tables
* Externally managed Delta Lake tables (with Delta Direct and catalog-linked databases)
* Views

  + Regular views
  + Secure views
  + Secure materialized views
  + Semantic views
* Cortex Search services
* User-defined functions (UDFs) (secure and non-secure)
* Models of type USER_MODEL, CORTEX_FINETUNED, or DOC_AI

Snowflake enables the sharing of databases through *shares*, which are created by data providers and “imported” by data consumers.

> **Important:**
>
> All database objects shared between accounts are read-only (i.e. the objects cannot be modified or deleted, including adding or
> modifying table data).

## How does Secure Data Sharing work?

With Secure Data Sharing, no actual data is copied or transferred between accounts. All sharing uses Snowflake’s
services layer and metadata store. Shared data does not take up any storage in a consumer account and therefore does not contribute to the
consumer’s monthly data storage charges. The only charges to consumers are for the compute resources (i.e. virtual warehouses) used
to query the imported data.

Because no data is copied or exchanged, Secure Data Sharing setup is quick and easy for providers and access to the imported
data is near-instantaneous for consumers:

* The provider creates a share of a database in their account and grants access to specific objects in the database. The provider can also
  share data from multiple databases, as long as these databases belong to the same account. One or more accounts are then added to the
  share, which can include your own accounts (if you have multiple Snowflake accounts).

  For more details, refer to What is a share? (in this topic).
* On the consumer side, a read-only database is created from the share. Access to this database is configurable using the same,
  standard role-based access control that Snowflake provides for all objects in the system.

With this architecture, Snowflake enables a network of providers that can share data with multiple consumers (including within
their own organization) and consumers that can access imported data from multiple providers:

> **Note:**
>
> Any full Snowflake account can both provide and consume imported data. Snowflake also supports third-party accounts, a special type of
> account that consumes imported data from a single provider account. For more details, refer to Reader accounts for third-party access
> (in this topic).

## What is a share?

Shares are named Snowflake objects that encapsulate all of the information required to share a database.

Data providers add Snowflake objects (databases, schemas, tables, secure views, etc.) to a share using either or both of the
following options:

* **Option 1:** Grant privileges on objects to a share via a database role.
* **Option 2:** Grant privileges on objects directly to a share.

For more information on these options, refer to [How to share database objects](data-sharing-gs.md).

You choose which accounts can consume data from the share by adding the accounts to the share.

After a database is created (in a consumer account) from a share, all the imported objects are accessible to users in the consumer account:

Shares are secure, configurable, and controlled completely by the provider account:

* New objects added to a share become immediately available to all consumers, providing real-time access to imported data.
* Updates to existing objects in a share become immediately available to all consumers.
* Access to a share (or any of the objects in a share) can be revoked at any time.

## Options for sharing in Snowflake

You can share data in Snowflake using one of the following options:

* a Listing, in which you offer a share and additional metadata as a data product to one or more accounts,
* a Direct Share, in which you directly share specific database objects (a share) to another account in your region,
* a Data Exchange, in which you set up and manage a group of accounts and offer a share to that group,
* a clean room, in which you can share data and control which queries can be run against our data.

You can also convert a direct share to a listing. For instructions, see [Convert a direct share to a listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing#convert-a-direct-share-to-a-private-listing).

See [Data sharing and collaboration in Snowflake](../guides-overview-sharing.md) for more details.

## Overview of data providers and consumers

When sharing in Snowflake, the account that shares data is called a provider, and the account that is a recipient of the data is called a
consumer.

### About providers

A data provider is any Snowflake account that creates shares and makes them available to other Snowflake accounts to consume. As a
data provider, you share a database with one or more Snowflake accounts. For each database you share, Snowflake supports using grants
to provide granular access control to selected objects in the database (i.e., you grant access privileges for one or more specific
objects in the database).

You can create as many shares as you want, and add as many accounts to a share as you want. If you want to provide a share to many accounts,
you might want to use a [listing](../collaboration/collaboration-listings-about.md) or a [data exchange](data-exchange.md).

For a guide to sharing data as a provider, refer to [Share secure database objects](data-sharing-gs.md). For more detailed information, refer to
[Create and configure shares](data-sharing-provider.md).

### About consumers

A data consumer is any account that chooses to create a database from a share made available by a data provider. As a data consumer,
once you add an imported database to your account, you can access and query the objects in the database just as you would with any
other database in your account.

You can consume as many shares as you want from data providers, but you can only create one database per share.

For more details, refer to [Consume imported data](data-share-consumers.md).

## Usage metrics shared with providers

If you provide listings privately, using a data exchange, or on the Snowflake Marketplace, you have access to various metrics about consumer
usage of your listings, and metrics about the consumer accounts accessing your listings.

For details about usage data for listings, refer to
[Monitor listing use](../collaboration/provider-listings-monitor-studio.md). Usage data
for listings shared in a data exchange is only available in the views contained in the [Data Sharing Usage](../sql-reference/data-sharing-usage.md) schema of
the imported Snowflake database.

## Reader accounts for third-party access

Data sharing is only supported between Snowflake accounts. As a data provider, you might want to share data with a consumer who does
not already have a Snowflake account or is not ready to become a licensed Snowflake customer.

To facilitate sharing data with these consumers, you can create reader accounts. Reader accounts (formerly
known as “read-only accounts”) provide a quick, easy, and cost-effective way to share data without requiring the consumer to become
a Snowflake customer.

Each reader account belongs to the provider account that created it. As a provider, you use *shares* to share databases with reader
accounts; however, a reader account can only consume data from the provider account that created it. Refer to the following diagram:

Users in a reader account can query data that has been imported with the reader account, but cannot perform any of the DML tasks that are
allowed in a full account, such as data loading, insert, update, and similar data manipulation operations.

For more details, refer to [Manage reader accounts](data-sharing-reader-create.md).

---
title: Access a billing usage statement
source: https://docs.snowflake.com/en/user-guide/billing-usage-statement.md
section: User Guide
---

# Access a billing usage statement

Snowflake generates a monthly usage statement for customers who have at least one active contract, also known as the Snowflake Order Form.
This statement itemizes usage for the month as expressed in credits consumed and currency spent. It also contains a summary of usage during
the life of the contract.

Snowsight allows you to view and download monthly usage statements, starting with July 2023. To access these statements, either of
the following must be true:

* You have been granted the GLOBALORGADMIN, and you are in the [organization account](organization-accounts.md).
* You have been granted the ACCOUNTADMIN and ORGADMIN roles, and you are in an account that has the ORGADMIN role enabled.

Keep the following in mind when accessing usage statements:

* If your organization has multiple contracts, Snowflake generates a separate usage statement for each contract.
* If you renew a contract in the middle of the month, Snowflake generates two separate usage statements for the contract.
* Customers who signed a contract through a Snowflake reseller cannot view or download usage statements.
* You cannot access usage statements from an account in [US SnowGov Regions](intro-regions.md) on AWS
  GovCloud and Microsoft Azure Government.

To access a usage statement:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. On the Snowflake billing tab, view or download the usage statement.

---
title: Access a Data Exchange
source: https://docs.snowflake.com/en/user-guide/data-exchange-accessing.md
section: User Guide
---

# Access a Data Exchange

When logging in to the Data Exchange for administrative purposes (e.g. joining the exchange, configuring the exchange, configuring data listings), the member must have the ACCOUNTADMIN role.

When logging in to the Data Exchange as a consumer:

* All roles can browse data listings.
* All roles with the ACCOUNTADMIN role can request and get data.
* All roles with the [IMPORT SHARE](security-access-privileges-shares.md) and CREATE DATABASE privileges can get and request data.

> **Note:**
>
> If you are using private connectivity to the Snowflake service and wish to access the Snowflake Marketplace through the new Snowflake web interface, you must first create a CNAME record, as described in the Snowflake documentation:
>
> * [AWS PrivateLink and Snowflake](admin-security-privatelink.md)
> * [Azure Private Link and Snowflake](privatelink-azure.md)
> * [Google Cloud Private Service Connect and Snowflake](private-service-connect-google.md)

## Sign in to your Data Exchange as a Data Exchange Admin

To access your Data Exchange, sign in to [Snowsight](ui-snowsight-gs.md).

After your Data Exchange is provisioned by Snowflake, you can administer your exchange using Snowsight.
See [Configure and use a Data Exchange](data-exchange-using.md).

To add or remove provider profiles, use the Provider Profiles tab of the Manage Exchanges page.
See [Manage provider profiles](data-exchange-becoming-a-provider.md).

## Sign in to a Data Exchange as a member

> **Note:**
>
> To access a data exchange, your account must be added to the exchange by the Data Exchange Admin.

After you become a member of a data exchange:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared with you tab.

See [Configure and use a Data Exchange](data-exchange-using.md). If you are a data provider, you can also manage data in the exchange.

---
title: Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md
section: User Guide
---

# Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog

Access Snowflake-managed Apache Iceberg™ tables by using an external query engine through
Snowflake Horizon Catalog. To ensure this interoperability with external engines, [Apache Polaris™](https://github.com/apache/polaris)
is integrated into Horizon Catalog. In addition, Horizon Catalog exposes the Apache Iceberg™ REST API (Horizon Iceberg REST Catalog API). This
API lets you access the tables by using external query engines.

You can use Horizon Catalog, which is available in all your existing Snowflake accounts, to read and write to Snowflake-managed Iceberg
tables with external query engines. By using Horizon Catalog, you don’t need to sync Snowflake managed Iceberg tables through Snowflake Open
Catalog or create a separate Snowflake Open Catalog account to access Snowflake-managed Iceberg tables with external query engines.

## Query Iceberg tables

By connecting an external query engine to Iceberg tables through Horizon Catalog, you can perform the following tasks:

* Use any external query engine that supports the open Iceberg REST protocol to query these tables, such as Apache Spark™.
* Query any existing and new Snowflake-managed Iceberg tables in a new or existing Snowflake account by using a single Horizon Catalog endpoint.
* Query the tables by using your existing users, roles, policies, and authentication in Snowflake.
* Use vended credentials.

For more information about Snowflake Horizon Catalog, see [Snowflake Horizon Catalog](snowflake-horizon.md).

## Write to Iceberg tables

Writing to Iceberg tables by using an external query engine through Horizon Catalog is in public preview. To write to tables, follow the
workflow for accessing Iceberg tables by using an external query engine.
When you configure access control, ensure that you
configure write access to your tables.

Then write to Iceberg tables.

The following diagram shows external query engines reading and writing to Snowflake-managed Iceberg tables through Horizon Catalog and Snowflake reading and
writing to these tables:

## Billing

* The Horizon Iceberg REST Catalog API is available in all Snowflake editions.
* The API requests are billed as 0.5 credit per million calls and charged as Cloud Services.
* For cross-region data access, standard cross-region data egress charges as stated in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) are applicable.

> **Note:**
>
> Billing for this feature is scheduled to begin in mid-2026, subject to change.

## Supported external engines and catalogs

The following tables, although not exhaustive, show many external engines and catalogs that integrate with the Horizon Iceberg REST Catalog API.
This integration enables access to Snowflake managed Iceberg tables through external systems.

### Supported external engines

The following external query engines integrate with the Horizon Iceberg REST Catalog API:

| Product | Access Snowflake-managed Iceberg tables through Horizon Catalog |
| --- | --- |
| Apache Doris™ | ✔ |
| Apache Flink™ | ✔ |
| Apache Spark™ | ✔ |
| Dremio | ✔ |
| DuckDB | ✔ |
| PyIceberg | ✔ |
| StarRocks | ✔ |
| Trino | ✔ |

### Supported external catalogs

The following external catalogs integrate with the Horizon Iceberg REST Catalog API:

| Product | Access Snowflake-managed Iceberg tables through Horizon Catalog | Comment |
| --- | --- | --- |
| Apache Polaris™ | ✔ |  |
| AWS Glue | ✔ | For instructions on how to configure this integration, see [Access Snowflake Horizon Catalog data using catalog federation in the AWS Glue Data Catalog](https://aws.amazon.com/blogs/big-data/access-snowflake-horizon-catalog-data-using-catalog-federation-in-the-aws-glue-data-catalog/) in the AWS Big Data Blog. |
| Palantir Foundry | ✔ | For instructions on how to configure this integration, see [Iceberg tables (virtual tables only)](https://www.palantir.com/docs/foundry/available-connectors/snowflake#iceberg-tables-virtual-tables-only) in the Palantir documentation. |
| Databricks Unity Catalog | Not announced |  |
| Google BigLake Metastore | In development |  |
| Microsoft Fabric / Synapse | In development |  |

## Prerequisites

Retrieve the account identifier for your Snowflake account that contains the Iceberg tables that you want to access. For instructions,
see [Account identifiers](admin-account-identifier.md). You specify this identifier when you
connect an external query engine to your Iceberg tables.

> **Tip:**
>
> To get your account identifier by using SQL, you can run the following command:
>
> ```sqlexample
> SELECT CURRENT_ORGANIZATION_NAME() || '-' || CURRENT_ACCOUNT_NAME();
> ```

## (Optional) Private connectivity

For secure connectivity, consider configuring [Inbound](private-connectivity-inbound.md) and
[Outbound](private-connectivity-outbound.md) private connectivity for your Snowflake account while you access the
Horizon Catalog endpoint.

> **Note:**
>
> Private connectivity is only supported for Snowflake-managed Iceberg tables stored on Amazon S3 or Azure Storage (ADLS).

## Workflow for accessing Iceberg tables by using an external query engine

To access Iceberg tables by using an external query engine, complete the following steps:

1. Create Iceberg tables
2. Configure access control
3. Obtain an access token for authentication
4. Verify access token permissions
5. (Optional) Configure data protection policies
6. Connect an external query engine to Iceberg tables through Horizon Catalog
7. Query Iceberg tables or
   write to Iceberg tables

## Step 1: Create Iceberg tables

> **Important:**
>
> If you already have Snowflake-managed Iceberg tables that you want to access, you can skip this step.

In this step, you create Snowflake-managed Iceberg tables that use Snowflake as the catalog, so you can access them with an external
query engine. For instructions, see the following topics:

* [Tutorial: Create your first Apache Iceberg™ table](tutorials/create-your-first-iceberg-table.md): A tutorial that shows how to create a database, create a Snowflake-managed Iceberg table, and load data into the table.
* [Create a Snowflake-managed Iceberg table](tables-iceberg-create.md): Example code for creating a Snowflake-managed Iceberg table.

## Step 2: Configure access control

> **Important:**
>
> If you already have roles that are configured with access to the Iceberg tables that you want to access, you can skip this step.

In this step, you configure access control for the Snowflake-managed Iceberg tables that you want to access with an external query engine.
For example, you can set up the following roles in Snowflake:

* data_engineer role, which has access to all schemas and all Snowflake-managed Iceberg tables in a database.
* data_analyst role, which has access to one schema in the database and only access to two Snowflake-managed Iceberg tables within that schema.

For more information, see the following sections:

* Configure read access to your Iceberg tables
* Configure write access to your Iceberg tables

### Configure read access to your Iceberg tables

To query Iceberg tables, the role used to perform the operation must have the SELECT privilege on the Iceberg table and the USAGE
privilege on the parent database and schema. For an example of granting these privileges to a role, see
Example: Set up a service account user.

> **Important:**
>
> The role that has the OWNERSHIP privilege on an Iceberg table must maintain the USAGE privilege on the external volume associated with
> the table. If the owner role doesn’t have USAGE on the external volume, any read or write table operation that asks for vended credentials
> will fail.

#### Example: Set up a service account user

The following example sets up a service account user in Snowflake with read-only access to an Iceberg table:

* Creates a `data_engineer` role.
* Grants the `data_engineer` role USAGE and MONITOR privileges on the `iceberg_test_db` database and its `public` schema.
* Grants SELECT privileges on the `test_table` Iceberg table.
* Creates a service user named `horizon_rest_srv_account_user` and assigns the `data_engineer` role to that user.

```sqlexample
CREATE OR REPLACE ROLE data_engineer;

GRANT USAGE ON DATABASE iceberg_test_db TO ROLE data_engineer;
GRANT USAGE ON SCHEMA iceberg_test_db.public TO ROLE data_engineer;

GRANT SELECT ON TABLE iceberg_test_db.public.test_table TO ROLE data_engineer;

CREATE OR REPLACE USER horizon_rest_srv_account_user TYPE=SERVICE DEFAULT_ROLE=data_engineer;

GRANT ROLE data_engineer TO USER horizon_rest_srv_account_user;
```

#### (Optional) Apply future grants on Iceberg tables

To ensure access to any new Iceberg tables created in a schema, use the
[GRANT … ON FUTURE ICEBERG TABLES](../sql-reference/sql/grant-privilege.md) syntax.

The following example grants the `data_engineer` role access to any Iceberg tables created under a schema named `my_schema`.

```sqlexample
GRANT SELECT ON FUTURE ICEBERG TABLES IN SCHEMA my_db.my_schema TO ROLE data_engineer;
```

For more information about access control in Snowflake, see the following topics:

* [Overview of Access Control](security-access-control-overview.md)
* [Configuring access control](security-access-control-configure.md)

### Configure write access to your Iceberg tables

The following table describes the privileges required for write operations on Iceberg tables:

| Operation | Necessary privileges |
| --- | --- |
| Data Manipulation Language (DML) operations | **Important:** A role used to execute the operation must have *all* of the following privileges:   * SELECT, UPDATE, TRUNCATE, INSERT, and DELETE privileges on the table * USAGE privilege for the parent schema where the table is nested under * USAGE privilege on the parent database or schema under which the table is nested |
| CREATE ICEBERG TABLE | A role used to execute the operation must have the following privileges:   * CREATE ICEBERG TABLE privilege on schema * USAGE privilege on the external volume |
| CREATE SCHEMA | A role used to execute the operation must have the CREATE SCHEMA privilege on the parent database. |
| Rename a table | A role used to execute the operation must have the OWNERSHIP privilege on the table.  **Important:** To move the table to a new schema, ensure that your role also has the CREATE ICEBERG TABLE privilege on the destination schema. |
| All other operations on a table | A role used to execute the operation must have the OWNERSHIP privilege on the table in addition to the privileges on the schema and database. For example, you must have these privileges to run the ALTER ICEBERG TABLE … ADD COLUMN or ALTER ICEBERG TABLE … DROP COLUMN operation. |

For more information about access control in Snowflake, see the following topics:

* [Overview of Access Control](security-access-control-overview.md)
* [Configuring access control](security-access-control-configure.md)

## Step 3: Obtain an access token for authentication

In this step, you obtain an access token, which you must have to authenticate to the Horizon Catalog endpoint for your Snowflake account. You
need to obtain an access token for each user — service or human — and role that is configured with access to Snowflake-managed Iceberg tables. For example, you need to
obtain one access token for a user with DATA_ENGINEER role and another user with a DATA_ANALYST role.

You specify this access token later when you
connect an external query engine to Iceberg tables through Horizon Catalog.

You can obtain an access token by using one of the following authentication options:

* External OAuth
* Key-pair authentication
* Programmatic access token (PAT)

### External OAuth

If you’re using External OAuth, generate an access token for your identity provider. For instructions, see [External OAuth overview](oauth-ext-overview.md).

> **Note:**
>
> For External OAuth, alternatively, you can configure your connection to the engine with automatic token refresh instead of specifying
> an access token.

### Key-pair authentication

If you use key-pair authentication, to obtain an access token, you sign a JSON web token (JWT) with your
private key.

The following steps cover how to generate an access token for key-pair authentication:

1. Configure key-pair authentication
2. Grant a role to the user
3. Generate a JSON Web Token (JWT)
4. Generate an access token

#### Step 1: Configure key-pair authentication

In this step, you perform the following tasks:

* Generate a private key
* Generate a public key
* Store the private and public keys securely
* Grant the privilege to assign a public key to a Snowflake user
* Assign the public key to a Snowflake user
* Verify the user’s public key fingerprint

For instructions, see [Configuring key-pair authentication](key-pair-auth.md).

#### Step 2: Grant a role to the user

To grant to the key-pair authentication user the Snowflake role that has privileges to the tables you want to access, run the [GRANT ROLE](../sql-reference/sql/grant-role.md) command.
For example, to grant the ENGINEER role to the `my_service_user` user, run
the following command:

```sqlexample
GRANT ROLE ENGINEER to user my_service_user;
```

#### Step 3: Generate a JSON Web Token (JWT)

In this step, you use SnowSQL to generate a JSON Web Token (JWT) for key-pair authentication.

> **Note:**
>
> * You must have [SnowSQL](https://www.snowflake.com/developers/downloads/snowsql/) installed on your machine.
> * Alternatively, you can use Python, Snowflake CLI, Java, or Node.js to generate a JWT. For an example, see the following sections:
>
>   + [Python example](../developer-guide/sql-api/authenticating.md)
>   + [Snowflake CLI example](../developer-guide/sql-api/authenticating.md)
>   + [Java example](../developer-guide/sql-api/authenticating.md)
>   + [Node.js example](../developer-guide/sql-api/authenticating.md)

Use SnowSQL to generate a JWT:

```bash
snowsql --private-key-path "<private_key_file>" \
  --generate-jwt \
  -h "<account_identifier>.snowflakecomputing.com" \
  -a "<account_locator>" \
  -u "<user_name>"
```

Where:

* `<private_key_file>` is the path to your private key file that corresponds to the public key assigned to your Snowflake user.
  For example: `/Users/jsmith/.ssh/rsa_key.p8`.
* `<account_identifier>` is the account identifier for your Snowflake account, in the format `<organization_name>-<account_name>`.
  To find the account identifier, see Supported external engines and catalogs.
  An example of an account identifier is `myorg-myaccount`.
* `<account_locator>` is the account locator for your Snowflake account.

  To find your account locator, see
  [Locate your Snowflake account information in Snowsight](ui-snowsight-gs.md) and view the *Account locator* in the Account Details dialog.
* `<user_name>` is the user name for a Snowflake user with the public key assigned to the user.

#### Step 4: Generate an access token

> **Important:**
>
> To generate an access token, you must first generate a JWT.
> You must first generate a JWT because you use the JWT to
> generate the access token.

Use a `curl` command to generate an access token:

```bash
curl -i --fail -X POST "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens" \
 --header 'Content-Type: application/x-www-form-urlencoded' \
 --data-urlencode 'grant_type=client_credentials' \
 --data-urlencode 'scope=session:role:<role>' \
 --data-urlencode 'client_secret=<JWT_token>'
```

Where:

* `<account_identifier>` is the account identifier for your Snowflake account, in the format `<organization_name>-<account_name>`.
  To find the account identifier, see Supported external engines and catalogs.
  An example of an account identifier is `myorg-myaccount`.
* `<role>` is the Snowflake role that is granted access to Iceberg tables, such as ENGINEER.
* `<JWT_token>` Is the JWT that you generated in the previous step.

### Programmatic access token (PAT)

If you use PATs, generate a PAT for authentication.

First, you generate a PAT, which you use to connect an external query engine to Iceberg tables.
Then, you generate an access token, which you only use to verify the permissions for your PAT.

#### Step 1: Generate a PAT

For instructions on how to configure and generate a PAT,
see [Using programmatic access tokens for authentication](programmatic-access-tokens.md).

The following example creates a programmatic access token (PAT) for the service account user that you created in the previous step by
using the [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](../sql-reference/sql/alter-user-add-programmatic-access-token.md) command:

```sqlexample
ALTER USER IF EXISTS HORIZON_REST_SRV_ACCOUNT_USER
ADD PAT HORIZON_REST_SRV_ACCOUNT_USER_PAT
  DAYS_TO_EXPIRY = 7
  ROLE_RESTRICTION = 'DATA_ENGINEER'
  COMMENT = 'HORIZON REST API PAT FOR SERVICE ACCOUNT';
```

#### Step 2: Generate an access token for your PAT

In this step, you generate an access token for your PAT.

> **Attention:**
>
> You only specify the access token that you generate in this step when you
> verify the permissions
> for your PAT. When you
> connect an external query engine to Iceberg tables,
> you must specify your PAT that you generated in the previous step, not the access token that you generate in this step.

Use a `curl` command to generate an access token for your PAT:

```bash
curl -i --fail -X POST "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens" \
 --header 'Content-Type: application/x-www-form-urlencoded' \
 --data-urlencode 'grant_type=client_credentials' \
 --data-urlencode 'scope=session:role:<role>' \
 --data-urlencode 'client_secret=<PAT_token>'
```

Where:

* `<account_identifier>` is the account identifier for your Snowflake account, in the format `<organization_name>-<account_name>`.
  To find the account identifier, see Supported external engines and catalogs.
  An example of an account identifier is `myorg-myaccount`.
* `<role>` is the Snowflake role that is granted to your PAT and has access to the Iceberg tables that you want to query or write to, such as ENGINEER.
* `<PAT_token>` is the value for the PAT token that you generated in the previous step.

## Step 4: Verify access token permissions

In this step, you verify the permissions for the access token that you obtained in the previous step.

* Verify access to the Horizon IRC endpoint
* Retrieve the metadata for a table

### Verify access to the Horizon IRC endpoint

Use a `curl` command to verify that you have permission to access your Horizon IRC endpoint:

```bash
curl -i --fail -X GET "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog/v1/config?warehouse=<database_name>" \
-H "Authorization: Bearer <access_token>" \
-H "Content-Type: application/json"
```

Where:

* `<account_identifier>` is the account identifier for your Snowflake account, in the format `<organization_name>-<account_name>`.
  To find the account identifier, see Supported external engines and catalogs.
  An example of an account identifier is `myorg-myaccount`.
* `<access_token>` is your access token that you generated. If you’re using a PAT, this value is the access token you generated, not the
  *personal access token (PAT)* you generated.
* `<database_name>` is the name of the database that contains the Iceberg tables that you want to access.

  > **Important:**
  >
  > If your database was created without quotes around the name, you must specify the database name in *all capital letters*, even if it was created with lowercase letters.

Example return value:

```output
{
  "defaults": {
    "default-base-location": ""
  },
  "overrides": {
    "prefix": "MY-DATABASE"
  }
}
```

### Retrieve the metadata for a table

You can also make a GET request to retrieve the metadata for a table. Snowflake uses the
[loadTable](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L616)
operation to load table metadata from your REST catalog.

```bash
curl -i --fail -X GET "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog/v1/<database_name>/namespaces/<namespace_name>/tables/<table_name>" \
 -H "Authorization: Bearer <access_token>" \
 -H "Content-Type: application/json"
```

Where:

* `<account_identifier>` is the account identifier for your Snowflake account, in the format `<organization_name>-<account_name>`.
  To find the account identifier, see Supported external engines and catalogs.
  An example of an account identifier is `myorg-myaccount`.
* `<database_name>` is the database of the table whose metadata you want to retrieve.
* `<namespace_name>` is the namespace of the table whose metadata you want to retrieve.
* `<table_name>` is the table whose metadata you want to retrieve.
* `<access_token>` is your access token that you generated. If you’re using a PAT, this value is the
  access token you generated, not the
  *personal access token (PAT)* you generated.

> **Important:**
>
> If your database, namespace, or table was created without quotes around the name, you must specify the database, namespaces, or table name in *all capital letters*, even if the object was created with lowercase
> letters.

## (Optional) Step 5: Configure data protection policies

In this step, you configure data protection policies for Iceberg tables. If you don’t have tables that you need to
protect with Snowflake data policies, you can proceed to the next step.

> **Note:**
>
> Tables protected by data protection policies can be accessed over the Horizon Iceberg REST API and by using Apache Spark™.

For instructions on how to configure data protection policies, see [Configure data protection policies on Iceberg tables accessed over Horizon Iceberg REST API and using Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

## Step 6: Connect an external query engine to Iceberg tables through Horizon Catalog

In this step, you connect an external query engine to Iceberg tables through Horizon Catalog. With this connection, you can access the tables
by using the external query engine.

The external engines use the Apache Iceberg™ REST endpoint exposed by Snowflake. For your Snowflake account, this endpoint is
in the following format:

```none
https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog
```

The example code in this step shows how to set up a connection in Spark, and the example code is in PySpark. For more information,
see the following sections:

* Connect by using External OAuth or key pair authentication
* Connect by using a programmatic access token (PAT)

### Connect by using External OAuth or key pair authentication

Use one of the following configurations to connect:

* To access Iceberg tables that *don’t* have Snowflake data protection policies configured, connect an external query engine without enforcing data policies.
* To access Iceberg tables that have Snowflake row access and masking policies configured, connect an external query engine with data policies enforced.

#### Connect an external query engine without enforcing data policies

* To connect the external query engine to Iceberg tables by using External OAuth or key pair authentication. Use the following example code.

This code doesn’t enforce data protection policies:

```python
# Snowflake Horizon Catalog Configuration, change as per your environment

CATALOG_URI = "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog"
HORIZON_SESSION_ROLE = f"session:role:<role>"
CATALOG_NAME = "<database_name>" #provide in UPPER CASE

# Cloud Service Provider Region Configuration (where the Iceberg data is stored)
REGION = "eastus2"

# Paste the External Oauth Access token that you generated in Snowflake here
ACCESS_TOKEN = "<your_access_token>"

# Iceberg Version
ICEBERG_VERSION = "1.9.1"

def create_spark_session():
  """Create and configure Spark session for Snowflake Iceberg access."""
  spark = (
      SparkSession.builder
      .appName("SnowflakeIcebergReader")
      .master("local[*]")

# JAR Dependencies for Iceberg and Azure
      .config(
          "spark.jars.packages",
          f"org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:{ICEBERG_VERSION},"
          f"org.apache.iceberg:iceberg-aws-bundle:{ICEBERG_VERSION}"
          # for Azure storage, use the below package and comment above azure bundle
          # f"org.apache.iceberg:iceberg-azure-bundle:{ICEBERG_VERSION}"
      )

      # Iceberg SQL Extensions
      .config("spark.sql.extensions", "org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions")
      .config("spark.sql.defaultCatalog", CATALOG_NAME)

      # Horizon REST Catalog Configuration
      .config(f"spark.sql.catalog.{CATALOG_NAME}", "org.apache.iceberg.spark.SparkCatalog")
      .config(f"spark.sql.catalog.{CATALOG_NAME}.type", "rest")
      .config(f"spark.sql.catalog.{CATALOG_NAME}.uri", CATALOG_URI)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.warehouse", CATALOG_NAME)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.token", ACCESS_TOKEN)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.scope", HORIZON_SESSION_ROLE)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.client.region", REGION)

      # Required for vended credentials
      .config(f"spark.sql.catalog.{CATALOG_NAME}.header.X-Iceberg-Access-Delegation", "vended-credentials")
      .config("spark.sql.iceberg.vectorization.enabled", "false")
      .getOrCreate()
  )
  spark.sparkContext.setLogLevel("ERROR")
  return spark
```

Where:

* `<account_identifier>` is your Snowflake account identifier for the Snowflake account that contains the Iceberg tables that you
  want to access. To find this identifier, see Supported external engines and catalogs.
* `<your_access_token>` is your access token that you obtained. To obtain it, see Step 3: Obtain an access token for authentication.

  > **Note:**
  >
  > For External OAuth, alternatively, you can configure your connection to the engine with automatic token refresh instead of specifying
  > an access token.
* `<database_name>` is the name of the database in your Snowflake account that contains Snowflake-managed Iceberg tables that you want to access.

  > **Note:**
  >
  > The `.warehouse` property in Spark expects your Snowflake *database* name, not your Snowflake warehouse name.
* `<role>` is the role in Snowflake that is configured with access to the Iceberg tables that you want to access. For example: DATA_ENGINEER.

> **Important:**
>
> By default, the code example is set up for Apache Iceberg™ tables stored on Amazon S3. If your Iceberg tables are stored on Azure Storage (ADLS),
> perform the following steps:
>
> > 1. Comment out the following line: `f"org.apache.iceberg:iceberg-aws-bundle:{ICEBERG_VERSION}"`
> > 2. Uncomment the following line: `# f"org.apache.iceberg:iceberg-azure-bundle:{ICEBERG_VERSION}"`

#### Connect an external query engine with data policies enforced

* To connect with data protection policies enforced, see [Connect Spark to Iceberg tables](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

### Connect by using a programmatic access token (PAT)

Use one of the following configurations to connect:

* If you *don’t use* data protection policies with the Iceberg tables that you want to access, use the configuration Connect an external query engine without enforcing data policies.
* If you *use* data protection policies with the Iceberg tables that you want to access, use the configuration Connect an external query engine with data policies enforced.

#### Connect an external query engine without enforcing data policies

* To connect the external query engine to Iceberg tables by using a programmatic access token (PAT), use the following example code.

This code doesn’t enforce data protection policies:

```python
# Snowflake Horizon Catalog Configuration, change as per your environment

CATALOG_URI = "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog"
HORIZON_SESSION_ROLE = f"session:role:<role>"
CATALOG_NAME = "<database_name>" #provide in UPPER CASE

# Cloud Service Provider Region Configuration (where the Iceberg data is stored)
REGION = "eastus2"

# Paste the PAT you generated in Snowflake here
PAT_TOKEN = "<your_PAT_token>"

# Iceberg Version
ICEBERG_VERSION = "1.9.1"

def create_spark_session():
  """Create and configure Spark session for Snowflake Iceberg access."""
  spark = (
      SparkSession.builder
      .appName("SnowflakeIcebergReader")
      .master("local[*]")

# JAR Dependencies for Iceberg and Azure
      .config(
          "spark.jars.packages",
          f"org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:{ICEBERG_VERSION},"
          f"org.apache.iceberg:iceberg-aws-bundle:{ICEBERG_VERSION}"
          # for Azure storage, use the below package and comment above azure bundle
          # f"org.apache.iceberg:iceberg-azure-bundle:{ICEBERG_VERSION}"
      )

      # Iceberg SQL Extensions
      .config("spark.sql.extensions", "org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions")
      .config("spark.sql.defaultCatalog", CATALOG_NAME)

      # Horizon REST Catalog Configuration
      .config(f"spark.sql.catalog.{CATALOG_NAME}", "org.apache.iceberg.spark.SparkCatalog")
      .config(f"spark.sql.catalog.{CATALOG_NAME}.type", "rest")
      .config(f"spark.sql.catalog.{CATALOG_NAME}.uri", CATALOG_URI)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.warehouse", CATALOG_NAME)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.credential", PAT_TOKEN)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.scope", HORIZON_SESSION_ROLE)
      .config(f"spark.sql.catalog.{CATALOG_NAME}.client.region", REGION)

      # Required for vended credentials
      .config(f"spark.sql.catalog.{CATALOG_NAME}.header.X-Iceberg-Access-Delegation", "vended-credentials")
      .config("spark.sql.iceberg.vectorization.enabled", "false")
      .getOrCreate()
  )
  spark.sparkContext.setLogLevel("ERROR")
  return spark
```

Where:

* `<account_identifier>` is your Snowflake account identifier for the Snowflake account that contains the Iceberg tables that you want
  to access. To find this identifier, see Supported external engines and catalogs.
* `<your_PAT_token>` is your PAT that you obtained. To obtain it, see Step 3: Obtain an access token for authentication.
* `<role>` is the role in Snowflake that is configured with access to the Iceberg tables that you want to access. For example:
  DATA_ENGINEER.
* `<database_name>` is the name of the database in your Snowflake account that contains Snowflake-managed Iceberg tables that you
  want to access.

  > **Note:**
  >
  > The `.warehouse` property in Spark expects your Snowflake *database* name, not your Snowflake warehouse name.

> **Important:**
>
> By default, the code example is set up for Apache Iceberg™ tables stored on Amazon S3. If your Iceberg tables are stored on Azure Storage (ADLS),
> perform the following steps:
>
> > 1. Comment out the following line: `f"org.apache.iceberg:iceberg-aws-bundle:{ICEBERG_VERSION}"`
> > 2. Uncomment the following line: `# f"org.apache.iceberg:iceberg-azure-bundle:{ICEBERG_VERSION}"`

#### Connect an external query engine with data policies enforced

* To connect with data protection policies enforced, see [Connect Spark to Iceberg tables](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

## Step 7: Access Iceberg tables

This section includes code examples for using Apache Spark™ to query and write to Iceberg tables.

### Query Iceberg tables

This section provides the following code examples for using Apache Spark™ to query Iceberg tables:

* Show namespaces
* Use namespaces
* Show tables
* Query a table

#### Show namespaces

```python
spark.sql("show namespaces").show()
```

#### Use namespace

```python
spark.sql("use namespace <your_schema_name_in_snowflake>")
```

#### Show tables

```python
spark.sql("show tables").show()
```

#### Query a table

```python
spark.sql("use namespace spark_demo")
spark.sql("select * from <your_table_name_in_snowflake>").show()
```

### Write to Iceberg tables

This section provides the following code examples for using Apache Spark™ to write to Iceberg tables:

* CREATE TABLE
* INSERT INTO <table>
* ALTER TABLE … ADD COLUMN
* UPDATE TABLE … WHERE
* DELETE TABLE … WHERE
* TRUNCATE TABLE
* RENAME TABLE
* DROP TABLE

#### CREATE TABLE

```python
spark.sql("CREATE TABLE MY_TABLE (COLUMN1 INT) USING ICEBERG").show();
```

#### INSERT INTO <table>

```python
spark.sql("INSERT INTO MY_TABLE VALUES (600)").show()
```

#### ALTER TABLE … ADD COLUMN

```python
spark.sql("ALTER TABLE MY_TABLE ADD COLUMN COLUMN2 INT").show()
```

#### UPDATE TABLE … WHERE

```python
spark.sql("UPDATE MY_TABLE SET COLUMN2 = 10 WHERE COLUMN1 = 100").show()
```

#### DELETE TABLE … WHERE

```python
spark.sql("DELETE FROM MY_TABLE WHERE COLUMN2 = 10").show()
```

#### TRUNCATE TABLE

```python
spark.sql("TRUNCATE TABLE MY_TABLE").show()
```

#### RENAME TABLE

```python
spark.sql("ALTER TABLE MY_TABLE RENAME TO MY_NEW_TABLE")
```

#### DROP TABLE

```python
spark.sql("DROP TABLE MY_TABLE")
```

## Considerations for accessing Iceberg tables with an external query engine

This section lists the considerations for accessing, querying, and writing to Iceberg tables with an external query engine.

Consider the following items when you access Iceberg tables with an external query engine:

* Iceberg

  + For tables in Snowflake:

    - Only Snowflake-managed Iceberg tables are supported.
* Listings:

  + Iceberg tables that you share through [auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md) aren’t
    accessible through the consumer account’s Horizon Iceberg REST Catalog API.
* Network and private connectivity:

  + Using network policies that are set at the user level isn’t supported with this feature.
  + For [Snowflake-managed network rules](network-rules.md), egress IP addresses that are static aren’t supported.
  + Explicitly granting the Horizon Catalog endpoint access to your storage accounts isn’t supported. We recommend that you use private connectivity for
    secure connectivity from external engines to Horizon Catalog and from Horizon Catalog to your storage account.
* Clouds:

  + Commercial: This feature is only supported for Snowflake-managed Iceberg tables that are stored on Amazon S3, Google Cloud, or Microsoft Azure for
    all commercial cloud regions. S3-compatible non-AWS storage isn’t yet supported.
  + FedRAMP (Moderate): This feature is supported for Snowflake-managed Iceberg tables that are stored on FedRAMP (Moderate) deployments
    on AWS Commercial Gov (US) in the us-east-1 and us-west-2 regions.
  + For Iceberg tables stored on Amazon S3:

    - If you want to use SSE-KMS encryption, contact customer support or your account team for assistance with enabling access.

      > **Note:**
      >
      > Writing to KMS-encrypted external volumes is not supported.
  + For Iceberg tables stored on Azure:

    - Azure Virtual Network (VNet) isn’t supported.
* Authentication:

  + For key-pair authentication, key-pair rotation isn’t supported.
  + Workload identity federation isn’t supported with this feature.

Consider the following items when you query (read) Iceberg tables with an external query engine:

* Iceberg

  + Querying the following tables isn’t supported:

    - Remote tables
    - Snowflake native tables
    - Externally managed Iceberg tables including Delta-based Iceberg tables and
      Snowflake-managed Iceberg tables that you loaded with data from Iceberg-compatible Parquet data files by using the COPY INTO table command
  + Reading Iceberg v2 tables is supported.
  + Reading Iceberg V3 tables (public preview) is supported for the following capabilities:

    - Variant data type
    - Row lineage

    All other Iceberg V3 capabilities, including default values and the geography data type, aren’t supported.
* Access control:

  + Tables protected by the following fine-grained data policies can be accessed over Apache Spark™ through Snowflake Horizon Catalog:

    - Masking policies
    - Tag-based masking policies
    - Row access policies

    For more information, see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).
* Cloned and converted tables:

  + Reading cloned or converted tables is not supported with vended credentials. To read these tables, use direct access to object
    storage.

Consider the following items when you write to Iceberg tables with an external query engine:

* Table operations:

  + You can’t specify a base location with your CREATE TABLE statement.

    When you create a Snowflake-managed table without specifying a base location, Snowflake constructs the following path for your table:
    `STORAGE_BASE_URL/database/schema/table_name.randomId/[data | metadata]/`
  + CREATE TABLE AS SELECT (CTAS) from an external engine is not supported.
  + Equality deletes aren’t supported.
  + You can’t write to tables by using row-level deletes; only copy-on-write mode is supported.
  + Creating Iceberg tags and branches isn’t supported.
  + The external engine writes are supported only on Iceberg version 2; writing to Iceberg version 3 (v3)
    tables (public preview) is not currently supported.
  + Writing to KMS-encrypted external volumes is not supported.
  + Writing to dynamic tables in Snowflake isn’t supported.
  + Writing to shared Iceberg tables isn’t supported.
  + Registering Iceberg tables isn’t supported.
* Maintenance operations

  + You can’t roll back a table to a previous snapshot.
  + The snapshot expiration operation isn’t supported.
  + You can’t upgrade an Iceberg table from v2 to v3.
* Cloned and converted tables:

  + Writing to cloned or converted tables is not supported with vended credentials. To write to these tables, connect your external query
    engine directly to the object storage where your tables are stored.
  + You can’t write to an Iceberg table that was converted from externally managed to Snowflake managed.
* Streams:

  + On Iceberg V2 tables, copy-on-write operations cause standard streams to represent an updated or relocated row as a DELETE record followed
    by an INSERT record for the same row.
* Fine-grained access control policies:

  + Writing to tables that have fine-grained access control policies or tags isn’t supported.

---
title: Access billing invoices
source: https://docs.snowflake.com/en/user-guide/billing-invoices.md
section: User Guide
---

# Access billing invoices

Snowflake generates billing invoices for customers. An invoice lists the amount owed to Snowflake by an On Demand customer.

To access invoices in Snowsight, either of the following must be true:

* You have been granted the GLOBALORGADMIN role, and you are in the [organization account](organization-accounts.md).
* You have been granted the ACCOUNTADMIN and ORGADMIN roles, and you are in an account that has the ORGADMIN role enabled.

To access a billing invoice:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. On the Snowflake billing tab, select Invoices.
4. View or download the billing invoice.

---
title: Access control
source: https://docs.snowflake.com/en/user-guide/opencatalog/access-control.md
section: User Guide
---

# Access control

This section provides information about how access control works for Snowflake Open Catalog.

Open Catalog uses a role-based access control (RBAC) model in which the Open Catalog administrator assigns access privileges to catalog roles
and then grants access to resources to service principals by assigning catalog roles to principal roles.

These are the key concepts to understanding access control in Open Catalog:

* **Securable object**
* **Principal role**
* **Custom role**
* **Catalog role**
* **Privilege**

## Securable object

A securable object is an object to which access can be granted. Open Catalog
has the following securable objects:

* Catalog
* Namespace
* Iceberg table
* View

The RBAC model for Open Catalog is fine-grained; that is, you can grant privileges on the entire catalog or on a namespace, table, or view
within the catalog. If you’re granting privileges on a namespace, you can also secure the tables grouped under it and any
child namespaces or tables nested under it.

## Principal role

A principal role is a resource in Open Catalog that you can use to logically group Open Catalog service principals together and grant privileges on
securable objects.

Open Catalog supports a many-to-one relationship between service principals and principal roles. For example, to grant the same privileges to
multiple service principals, you can grant a single principal role to those service principals. A service principal can be granted only one
principal role. When registering a service connection, the Open Catalog administrator specifies the principal role that is granted to the
service principal.

You don’t grant privileges directly to a principal role. Instead, you configure object permissions at the catalog role level, and then grant
catalog roles to a principal role.

The following table shows examples of principal roles that you might configure in Open Catalog:

| Principal role name | Description |
| --- | --- |
| Data_engineer | A role that is granted to multiple service principals for running data engineering jobs. |
| Data_scientist | A role that is granted to multiple service principals for running data science or AI jobs. |

## Custom role

A custom role is similar to a principal role but with a few differences. A custom role is also a resource in Open Catalog. You can
use a custom role for the following purposes:

* Logically group Open Catalog service principals together that you create through the Snowflake CLI. Open Catalog supports a many-to-one
  relationship between these service principals and custom roles. You use these service principals to
  [connect to Open Catalog with External OAuth](external-oauth-overview.md).
* Logically group Open Catalog users together that you create through the Snowflake CLI for key pair authentication. Open Catalog
  supports a many-to-one relationship between these users and custom roles. You use these service principals to
  [connect to Open Catalog with key pair authentication](key-pair-auth-overview.md).

To create a custom role, you must use the Snowflake CLI.

As with principal roles, you don’t grant privileges directly to a custom role. Instead, you configure object permissions at the catalog role level, and
then grant catalog roles to a custom role.

You can’t grant a custom role to a service principal that is created through the Open Catalog UI. In other words, when you
[configure a service connection](configure-service-connection.md), you can’t grant a custom role to the service principal for the service
connection. However, when you grant a catalog role to a custom role, the custom role appears in the **ASSIGNED TO PRINCIPAL ROLES** column
in the Open Catalog UI.

## Catalog role

A catalog role belongs to a particular catalog resource in Open Catalog and specifies a set of permissions for actions on the catalog or objects
in the catalog, such as catalog namespaces or tables. You can create one or more catalog roles for a catalog.

You grant privileges to a catalog role and then grant the catalog role to a principal role to bestow the privileges to one or more service
principals.

**Note**

> If you update the privileges bestowed to a service principal, the updates won’t take effect for up to one hour. This means that if you
> revoke or grant some privileges for a catalog, the updated privileges won’t take effect on any service principal with access to that catalog
> for up to one hour.

Open Catalog also supports a many-to-many relationship between catalog roles and principal roles. You can grant the same catalog role to one or more
principal roles. Likewise, a principal role can be granted one or more catalog roles.

The following table displays examples of catalog roles that you might
configure in Open Catalog:

| Example Catalog role | Description |
| --- | --- |
| Catalog administrators | A role that has been granted multiple privileges to emulate full access to the catalog.  Principal roles that have been granted this role are permitted to create, alter, read, write, and drop tables in the catalog. |
| Catalog readers | A role that has been granted read-only privileges to tables in the catalog.  Principal roles that have been granted this role are allowed to read from tables in the catalog. |
| Catalog contributor | A role that has been granted read and write access privileges to all tables that belong to the catalog.  Principal roles that have been granted this role are allowed to perform read and write operations on tables in the catalog. |

## User roles

A user is someone who signs in to the Open Catalog web interface to manage the Open Catalog account. However, before a user can manage the
account, they must be granted at least one user role. The user roles available to grant to a user are service admin and catalog admin.

### Service admin role

The service admin role allows a user to manage the entire account, with a few exceptions.

#### Allowed permissions

A user with service admin privileges is a service admin. A service admin can perform the following tasks in the account:

* Create service connections.
* Use network policies.
* Manage users: Create and drop users, grant roles to users, and revoke roles from users.
* Create catalogs and manage the catalogs that they create.

When a service admin creates a catalog, they are also automatically granted the catalog admin role on the catalog. The service admin can do
the following to a catalog they that have catalog admin privileges on:

* Grant to another user the catalog admin role to the catalog. As a result, both users can access the catalog.
* Revoke their own catalog admin privileges to the catalog, thereby losing access to the catalog. However, before revoking their catalog
  admin privileges, they must first grant to another user catalog admin privileges to that catalog.

#### Disallowed permissions

A user with service admin privileges can’t perform the following tasks in the account:

* Access or manage a catalog that they didn’t create.
* Grant catalog admin privileges to a catalog that the service admin didn’t create.

### Catalog admin role

The catalog admin role allows a user to manage a catalog in the account. The catalog admin role must be granted to each individual catalog.

#### Allowed permissions

The catalog admin role grants to a user the permissions to access and manage a catalog, as follows:

* Access the catalog.
* Create namespaces in the catalog.
* Create a catalog role in the catalog and grant privileges to it.
* Grant a catalog role to a principal role, which bestows the service principal with the privileges that are granted to the catalog role.

#### Disallowed permissions

A user with only catalog admin privileges can’t perform the following tasks in the account:

* Create a service connection.
* Access or manage catalogs for which they haven’t been granted the catalog admin role.
* Create or remove a catalog.
* Use network policies.
* Manage users.

## RBAC model

The following diagram illustrates the RBAC model used by Open Catalog. For each catalog, the Open Catalog catalog admin assigns access
privileges to catalog roles and then grants service principals access to resources by assigning catalog roles to principal roles. Open Catalog
supports a many-to-one relationship between service principals and principal roles.

> **Note:**
>
> * For [External OAuth](external-oauth-overview.md) and [key pair authentication](key-pair-auth-overview.md), a custom role is used instead of a **principal role**.
> * For key pair authentication, a key pair authentication user is used instead of a **service principal**. This type of user connects to Open Catalog
>   programmatically through an access token. It’s not a human user that signs in to the Open Catalog UI by using a username and password.

You can also grant access control privileges at the table or namespace level. For example, the following diagram depicts granting privileges
on a namespace and table within the catalog.

## Access control privileges

This section describes the privileges that are available in the Open Catalog access control model. Privileges are granted to catalog roles, catalog roles are granted to principal roles, and principal roles are granted to service principals to specify the operations that service principals can perform on objects in Open Catalog.

To grant the full set of privileges (drop, list, read, write, etc.) on an object, you can use the *full privilege* option.

For more information, see:

* Catalog privileges
* Namespace privileges
* Table privileges

> **Note:**
>
> For privilege sets for common catalog roles, such as Table Reader, see Privilege sets for common catalog roles.

### Catalog privileges

| Privilege | Description |
| --- | --- |
| CATALOG_MANAGE_CONTENT | Enables full management of content for the catalog. This privilege encompasses the following privileges:  * CATALOG_MANAGE_METADATA * TABLE_FULL_METADATA * NAMESPACE_FULL_METADATA * VIEW_FULL_METADATA * TABLE_WRITE_DATA * TABLE_READ_DATA * CATALOG_READ_PROPERTIES * CATALOG_WRITE_PROPERTIES |
| CATALOG_MANAGE_METADATA | Enables full management of the catalog, catalog roles, namespaces, and tables. |
| CATALOG_READ_PROPERTIES | Enables listing catalogs and reading properties of the catalog. |
| CATALOG_WRITE_PROPERTIES | Enables configuring catalog properties. |
| NAMESPACE_CREATE | Enables creating a namespace in a catalog. |
| NAMESPACE_DROP | Enables dropping the namespace from the catalog. |
| NAMESPACE_FULL_METADATA | Grants all namespace privileges. |
| NAMESPACE_LIST | Enables listing any object in the namespace, including nested namespaces and tables. |
| NAMESPACE_READ_PROPERTIES | Enables reading all the namespace properties. |
| NAMESPACE_WRITE_PROPERTIES | Enables configuring namespace properties. |
| TABLE_CREATE | Enables registering a table with the catalog. |
| TABLE_DROP | Enables dropping a table from the catalog. |
| TABLE_FULL_METADATA | Grants all table privileges, except TABLE_READ_DATA and TABLE_WRITE_DATA, which need to be granted individually. |
| TABLE_LIST | Enables listing any tables in the catalog. |
| TABLE_READ_DATA | Enables reading data from the table by receiving short-lived read-only storage credentials from the catalog. |
| TABLE_READ_PROPERTIES | Enables reading [properties](https://iceberg.apache.org/docs/nightly/configuration/#table-properties) of the table. |
| TABLE_WRITE_DATA | Enables writing data to the table by receiving short-lived read+write storage credentials from the catalog. |
| TABLE_WRITE_PROPERTIES | Enables configuring [properties](https://iceberg.apache.org/docs/nightly/configuration/#table-properties) for the table. |
| VIEW_CREATE | Enables registering a view with the catalog. |
| VIEW_DROP | Enables dropping a view from the catalog. |
| VIEW_FULL_METADATA | Grants all view privileges. |
| VIEW_LIST | Enables listing any views in the catalog. |
| VIEW_READ_PROPERTIES | Enables reading all the view properties. |

### Namespace privileges

| Privilege | Description |
| --- | --- |
| CATALOG_MANAGE_CONTENT | Enables full management of content for the namespace, any tables grouped under it, and any namespaces and tables nested under the namespace, if applicable. The privilege is not granted to the entire catalog. This privilege encompasses the following privileges:  * CATALOG_MANAGE_METADATA * TABLE_FULL_METADATA * NAMESPACE_FULL_METADATA * VIEW_FULL_METADATA * TABLE_WRITE_DATA * TABLE_READ_DATA * CATALOG_READ_PROPERTIES * CATALOG_WRITE_PROPERTIES |
| CATALOG_MANAGE_METADATA | Enables full management of the namespace, catalog roles, and any tables grouped under it or any child namespaces or tables nested under it. |
| NAMESPACE_CREATE | Enables creating a child namespace off the namespace. |
| NAMESPACE_DROP | Enables dropping the namespace from the catalog. |
| NAMESPACE_FULL_METADATA | Grants all namespace privileges on the namespace. |
| NAMESPACE_LIST | Enables listing any object in the namespace, including nested namespaces and tables. |
| NAMESPACE_READ_PROPERTIES | Enables reading all the namespace properties. |
| NAMESPACE_WRITE_PROPERTIES | Enables configuring namespace properties. |
| TABLE_CREATE | Enables registering a table with the namespace. |
| TABLE_DROP | Enables dropping a table from the namespace. |
| TABLE_FULL_METADATA | Grants all table privileges for tables grouped on the namespace, except TABLE_READ_DATA and TABLE_WRITE_DATA, which need to be granted individually. |
| TABLE_LIST | Enables listing any tables in the namespace. |
| TABLE_READ_DATA | Enables reading data from any table grouped under the namespace by receiving short-lived read-only storage credentials from the catalog. |
| TABLE_READ_PROPERTIES | Enables reading [properties](https://iceberg.apache.org/docs/nightly/configuration/#table-properties) of any table grouped under the namespace. |
| TABLE_WRITE_DATA | Enables writing data to any table grouped on the namespace by receiving short-lived read+write storage credentials from the catalog. |
| TABLE_WRITE_PROPERTIES | Enables configuring [properties](https://iceberg.apache.org/docs/nightly/configuration/#table-properties) for any table grouped under the namespace. |
| VIEW_CREATE | Enables registering a view with the namespace. |
| VIEW_DROP | Enables dropping a view from the namespace. |
| VIEW_FULL_METADATA | Grants all view privileges for all views in the namespace. |
| VIEW_LIST | Enables listing any views in the namespace. |
| VIEW_READ_PROPERTIES | Enables reading all the view properties for all view in the namespace. |
| VIEW_WRITE_PROPERTIES | Enables configuring view properties for any view in the namespace. |

### Table privileges

| Privilege | Description |
| --- | --- |
| TABLE_DROP | Enables dropping the table from the catalog. |
| TABLE_FULL_METADATA | Grants all table privileges, except TABLE_READ_DATA and TABLE_WRITE_DATA, which need to be granted individually. |
| TABLE_LIST | Enables listing any tables in the catalog. |
| TABLE_READ_DATA | Enables reading data from the table by receiving short-lived read-only storage credentials from the catalog. |
| TABLE_READ_PROPERTIES | Enables reading [properties](https://iceberg.apache.org/docs/nightly/configuration/#table-properties) of the table. |
| TABLE_WRITE_DATA | Enables writing data to the table by receiving short-lived read+write storage credentials from the catalog. |
| TABLE_WRITE_PROPERTIES | Enables configuring [properties](https://iceberg.apache.org/docs/nightly/configuration/#table-properties) for the table. |
| VIEW_READ_PROPERTIES | Enables reading all the view properties. |

### View privileges

| Privilege | Description |
| --- | --- |
| VIEW_CREATE | Enables registering a view with the catalog. |
| VIEW_DROP | Enables dropping a view from the catalog. |
| VIEW_LIST | Enables listing any views in the catalog. |
| VIEW_READ_PROPERTIES | Enables reading all the view properties. |
| VIEW_WRITE_PROPERTIES | Enables configuring view properties. |
| VIEW_FULL_METADATA | Grants all view privileges. |

### Privilege sets for common catalog roles

| Catalog role | Description | Privileges |
| --- | --- | --- |
| table_reader | Read the table information and query the tables. | * TABLE_LIST * TABLE_READ_PROPERTIES * TABLE_READ_DATA * TABLE_FULL_METADATA |
| catalog_writer | Create and drop tables. | * TABLE_CREATE * TABLE_DROP * TABLE_LIST * TABLE_READ_PROPERTIES * TABLE_WRITER_PROPERTIES * TABLE_READ_METADATA * TABLE_WRITE_METADATA * TABLE_FULL_METADATA |
| catalog_metadata_reader | Read catalog metadata. | * NAMESPACE_LIST * NAMESPACE_READ_PROPERTIES * NAMESPACE_FULL_METADATA * CATALOG_READ_PROPERTIES |

## RBAC example

The following diagram illustrates how RBAC works in Open Catalog at the catalog level and includes the following users:

* **Alice:** A service admin who signs up for Open Catalog. Alice can create service principals. She can also create catalogs and namespaces and
  configure access control for Open Catalog resources.

  > **Note**

  > The service principal for Alice is not visible in the Open Catalog user interface.
* **Bob:** A data engineer who uses Snowpipe Streaming (in Snowflake) and Apache Spark™ connections to interact with Open Catalog.

  + Alice has created a service principal for Bob. It has been granted the Data_engineer principal role, which in turn has been granted
    the following catalog roles: Catalog contributor and Data administrator (for both the Silver and Gold zone catalogs in the following
    diagram).

    - The Catalog contributor role grants permission to create namespaces and tables in the Bronze zone catalog.
    - The Data administrator roles grant full administrative rights to the Silver zone catalog and Gold zone catalog.
* **Mark:** A data scientist who uses Snowflake AI services to interact with Open Catalog.

  + Alice has created a service principal for Mark. It has been granted the Data_scientist principal role, which in turn has been granted
    the catalog role named Catalog reader.

    - The Catalog reader role grants read-only access for a catalog named Gold zone catalog.

---
title: Access control best practices
source: https://docs.snowflake.com/en/user-guide/security-access-control-considerations.md
section: User Guide
---

# Access control best practices

This topic describes best practices and important considerations for managing secure access to your Snowflake account and data stored within
the account. Primarily, it provides general guidance for configuring role-based access control (RBAC), which limits access to objects based
on a user’s role. For specific considerations about user-based access control (UBAC), see Comparing and contrasting RBAC with UBAC.

## Using the ACCOUNTADMIN Role

The account administrator (users with the ACCOUNTADMIN system role) role is the most powerful role in the system. This role alone is
responsible for configuring parameters at the account level. Users with the ACCOUNTADMIN role can view and manage Snowflake billing
and credit data, and can stop any running SQL statements.

Note that ACCOUNTADMIN is not a superuser role. This role only allows viewing and managing objects in the account if this role, or a
role lower in a [role hierarchy](security-access-control-overview.md), has sufficient privileges on the objects.

In the system role hierarchy, the other administrator roles are children of this role:

* The user administrator (USERADMIN) role includes the privileges to create and manage users and roles (assuming ownership of those roles or
  users has not been transferred to another role).
* The security administrator (SECURITYADMIN system-defined) role includes the global MANAGE GRANTS privilege to grant or revoke privileges
  on objects in the account. The USERADMIN role is a child of this role in the default access control hierarchy. For more information about
  the children system-defined roles, see [System-defined roles](security-access-control-overview.md).
* The system administrator (SYSADMIN) role includes the privileges to create warehouses, databases, and all database objects (schemas,tables,
  and so on).

> **Attention:**
>
> By default, when your account is provisioned, the first user is assigned the ACCOUNTADMIN role. This user should then create one or more
> additional users who are assigned the USERADMIN role. All remaining users should be created by the user(s) with the USERADMIN role or
> another role that is granted the global CREATE USER privilege.

### Control the assignment of the ACCOUNTADMIN role to users

Snowflake strongly recommends the following precautions when assigning the ACCOUNTADMIN role to users:

* Assign this role only to a select/limited number of people in your organization.
* All users assigned the ACCOUNTADMIN role should also be required to use multi-factor authentication (MFA) for login (for details, see
  [Configuring access control](security-access-control-configure.md)).
* Assign this role to at least two users. We follow strict security procedures for resetting a forgotten or lost password for users with the
  ACCOUNTADMIN role. These procedures can take up to two business days. Assigning the ACCOUNTADMIN role to more than one user avoids having
  to go through these procedures because the users can reset each other’s passwords.

> **Tip:**
>
> Assigning email addresses for current employees to ACCOUNTADMIN users helps Snowflake Support know who to contact in an urgent situation.

### Avoid using the ACCOUNTADMIN role to create objects

The ACCOUNTADMIN role is intended for performing initial setup tasks in the system and managing account-level objects and tasks on a
day-to-day basis. As such, it should not be used to create objects in your account, unless you absolutely need these objects to have the
highest level of secure access. If you create objects with the ACCOUNTADMIN role and you want users to have access to these objects, you
must explicitly grant privileges on the objects to the roles for these users.

Instead, Snowflake recommends creating a hierarchy of roles aligned with business functions in your organization and ultimately assigning
these roles to the SYSADMIN role. For more information, see Aligning Object Access with Business Functions in this topic.

> **Tip:**
>
> To help prevent account administrators from inadvertently using the ACCOUNTADMIN role to create objects, assign these users additional
> roles and designate one of these roles as their default (do not make ACCOUNTADMIN the default role for any users in the system).
>
> This doesn’t prevent users from using the ACCOUNTADMIN role to create objects, but it forces them to explicitly change their role to
> ACCOUNTADMIN each time they log in. This can help raise awareness of the purpose/function of roles in the system and encourage users to
> change to the appropriate role for performing a given task, particularly account administrator tasks.

### Avoid using the ACCOUNTADMIN Role for automated scripts

Snowflake recommends using a role other than ACCOUNTADMIN for automated scripts. If, as recommended, you create a role hierarchy under the
SYSADMIN role, all warehouse and database object operations can be performed using the SYSADMIN role or lower roles in the hierarchy. The
only limitations you would encounter is creating or modifying users or roles. These operations must be performed by a user with the
SECURITYADMIN role or another role with sufficient object privileges.

## Accessing database objects

All securable database objects (such as TABLE, FUNCTION, FILE FORMAT, STAGE, SEQUENCE, etc.) are contained within a SCHEMA object within a
DATABASE. As a result, to access database objects, in addition to the privileges on the specific database objects, users must be granted the
USAGE privilege on the container database and schema.

For example, suppose `mytable` is created in `mydb.myschema`. In order to query `mytable`, a user must have the following
privileges at a minimum:

Database:
:   USAGE on `mydb`

Schema:
:   USAGE on `myschema`

Table:
:   SELECT on `mytable`

## Managing custom roles

When a custom role is first created, it exists in isolation. The role must be assigned to any users who will use the object privileges
associated with the role. The custom role must also be granted to any roles that will manage the objects created by the custom role.

> **Important:**
>
> By default, not even the ACCOUNTADMIN role can modify or drop objects created by a custom role. The custom role must be granted to the
> ACCOUNTADMIN role directly or, preferably, to another role in a hierarchy with the SYSADMIN role as the parent. The SYSADMIN role is
> managed by the ACCOUNTADMIN role.

For instructions to create a role hierarchy, see [Creating a role hierarchy](security-access-control-configure.md).

## Aligning object access with business functions

Consider taking advantage of role hierarchies to align access to database objects with business functions in your organization. In a role
hierarchy, roles are granted to other roles to form an inheritance relationship. Permissions granted to roles at a lower level are inherited
by roles at a higher level.

For optimal flexibility in controlling access to database objects, create a combination of object *access roles* with different permissions
on objects and assign them as appropriate to *functional roles*:

* Grant permissions on database objects or account objects (such as warehouses) to access roles.
* Grant access roles to functional roles to create a role hierarchy. These roles correspond to the business functions of your organization
  and serve as a catch-all for any access roles required for these functions.

  When appropriate, grant lower-level functional roles to higher-level functional roles in a parent-child relationship where the parent
  roles map to business functions that should subsume the permissions of the child roles.

  Following best practices for role hierarchies, grant the highest-level functional roles in a role hierarchy to the system administrator
  (SYSADMIN) role. System administrators can then grant privileges on database objects to any roles in this hierarchy:

> **Note:**
>
> There is no technical difference between an object access role and a functional role in Snowflake. The difference is in how they are used
> logically to assemble and assign sets of permissions to groups of users.

### Example

As a simple example, suppose two databases in an account, `fin` and `hr`, contain payroll and employee data, respectively. Accountants
and analysts in your organization require different permissions on the objects in these databases to perform their business functions.
Accountants should have read-write access to `fin` but might only require read-only access to `hr` because human resources personnel
maintain the data in this database. Analysts could require read-only access to both databases.

Permissions on existing database objects are granted via the following hierarchy of access roles and functional roles:

> **Note:**
>
> When new objects are added in each database, consider automatically granting privileges on the objects to roles based on object type
> (for example schemas, tables, or views). For information, see Simplifying Grant Management Using Future Grants (in this topic).

| Custom Role | Description | Privileges |
| --- | --- | --- |
| `db_hr_r` | Access role that permits read-only access to tables in the `hr` database. | USAGE on database `hr`.  USAGE on all schemas in database `hr`.  SELECT on all tables in database `hr`. |
| `db_fin_r` | Access role that permits read-only access to tables in the `fin` database. | USAGE on database `fin`.  USAGE on all schemas in database `fin`.  SELECT on all tables in database `fin`. |
| `db_fin_rw` | Access role that permits read-write access to tables in the `fin` database. | USAGE on database `fin`.  USAGE on all schemas in database `fin`.  SELECT, INSERT, UPDATE, DELETE on all tables in database `fin`. |
| `accountant` | Functional role for accountants in your organization. | N/A |
| `analyst` | Functional role for analysts in your organization. | N/A |

The following diagram shows the role hierarchy for this example:

To configure access control for this example:

1. As a user administrator (user with the USERADMIN role) or another role with the CREATE ROLE privilege on the account, create the access
   roles and functional roles in this example:

   ```sqlexample
   CREATE ROLE db_hr_r;
   CREATE ROLE db_fin_r;
   CREATE ROLE db_fin_rw;
   CREATE ROLE accountant;
   CREATE ROLE analyst;
   ```
2. As a security administrator (user with the SECURITYADMIN role) or another role with the MANAGE GRANTS privilege on the account, grant the
   required minimum permissions to each of the access roles:

   ```sqlexample
   -- Grant read-only permissions on database HR to db_hr_r role.
   GRANT USAGE ON DATABASE hr TO ROLE db_hr_r;
   GRANT USAGE ON ALL SCHEMAS IN DATABASE hr TO ROLE db_hr_r;
   GRANT SELECT ON ALL TABLES IN DATABASE hr TO ROLE db_hr_r;

   -- Grant read-only permissions on database FIN to db_fin_r role.
   GRANT USAGE ON DATABASE fin TO ROLE db_fin_r;
   GRANT USAGE ON ALL SCHEMAS IN DATABASE fin TO ROLE db_fin_r;
   GRANT SELECT ON ALL TABLES IN DATABASE fin TO ROLE db_fin_r;

   -- Grant read-write permissions on database FIN to db_fin_rw role.
   GRANT USAGE ON DATABASE fin TO ROLE db_fin_rw;
   GRANT USAGE ON ALL SCHEMAS IN DATABASE fin TO ROLE db_fin_rw;
   GRANT SELECT,INSERT,UPDATE,DELETE ON ALL TABLES IN DATABASE fin TO ROLE db_fin_rw;
   ```
3. As a security administrator (user with the SECURITYADMIN role) or another role with the MANAGE GRANTS privilege on the account, grant the
   `db_fin_rw` access role to the `accountant` functional role, and grant the `db_hr_r` `db_fin_r` access roles to the `analyst`
   functional role:

   ```sqlexample
   GRANT ROLE db_fin_rw TO ROLE accountant;
   GRANT ROLE db_hr_r TO ROLE analyst;
   GRANT ROLE db_fin_r TO ROLE analyst;
   ```
4. As a security administrator (user with the SECURITYADMIN role) or another role with the MANAGE GRANTS privilege on the account, grant
   both the `analyst` and `accountant` roles to the system administrator (SYSADMIN) role:

   ```sqlexample
   GRANT ROLE accountant,analyst TO ROLE sysadmin;
   ```
5. As a security administrator (user with the SECURITYADMIN role) or another role with the MANAGE GRANTS privilege on the account, grant the
   business functional roles to the users who perform those business functions in your organization. In this example, the `analyst`
   functional role is granted to user `user1`, and the `accountant` functional role is granted to user `user2`.

   ```sqlexample
   GRANT ROLE accountant TO USER user1;
   GRANT ROLE analyst TO USER user2;
   ```

## Managing database object access using database roles

Database roles are essentially the same as traditional [roles](security-access-control-overview.md) created at the account
level (custom *account roles*) except for their scope: To permit SQL actions on objects within a database, privileges can be granted
to a database role in the same database.

Database roles are intended to satisfy the following use cases:

Ease of management:
:   Database owners can independently manage access to securable objects within their own databases. Database owners can perform the
    following actions:

    * Create and manage database roles.
    * Grant privileges to database roles.

      Privileges on objects granted to the database roles must be scoped to objects contained in the database where the role exists.
      Privileges on objects in one database (such as tables or views) cannot be granted to database roles in another database.

      Any privilege, including OWNERSHIP, can be granted to database roles on objects in a database. Note that only an account role
      can hold the OWNERSHIP privilege on the database itself.
    * Create or extend [role hierarchies](security-access-control-overview.md). Grant database roles to other database
      roles within the same database, and then grant the highest-level database roles in a database to account roles. For more information,
      see [Role hierarchy and privilege inheritance](security-access-control-overview.md).

      Note that granting a database role to an account role implicitly grants the USAGE privilege on the database that contains the database
      role to that account role. Granting the USAGE privilege on the database explicitly is not required.

Data Sharing:
:   Data providers in Snowflake’s [Secure Data Sharing](data-sharing-intro.md) can segment the securable objects in a share
    by creating multiple database roles in a database to share and granting privileges on a subset of the objects in the database to each
    database role. After creating a database from a share that includes database roles, data consumers grant each shared database role to
    one or more account-level roles in their own account.

    Without database roles, account administrators in data consumer accounts grant a single privilege, IMPORTED PRIVILEGES, to roles to
    allow their users to access all databases and database objects (tables, secure views, etc.) in a share. There is no option to
    allow different groups of users in a data consumer account to access a subset of the shared objects. This all or nothing approach
    requires data providers to create multiple shares to grant access to different objects in the same databases.

Note that database roles cannot be [activated](security-access-control-overview.md) directly in a session. Grant database
roles to account roles, which can be activated in a session.

## Centralizing grant management using managed access schemas

With regular (non-managed) schemas in a database, object owners (roles with the OWNERSHIP privilege on one or more objects) can grant access
on those objects to other roles, with the option to further grant those roles the ability to manage object grants.

To further lock down object security, consider using managed access schemas. In a managed access schema, object owners lose the ability to
make grant decisions. Only the schema owner (the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege
can grant privileges on objects in the schema, including [future grants](security-access-control-configure.md), centralizing
privilege management.

Note that a role that holds the global MANAGE GRANTS privilege can grant additional privileges to the current (grantor) role.

For more information on managed access schemas, see [Creating managed access schemas](security-access-control-configure.md).

## Simplifying grant management using future grants

Future grants allow defining an initial set of privileges on objects of a certain type (for example tables or views) in a specified schema.
As new objects are created, the defined privileges are automatically granted to a role, simplifying grant management.

Consider the following scenario, in which a particular role is granted the SELECT privilege on all new tables created in schema. At a later
date, the decision is made to revoke the privilege from this role and instead grant it to a different role. Using the ON FUTURE keywords
for new tables and the ALL keyword for existing tables, few SQL statements are required to grant and revoke privileges on new and existing
tables. For example:

```sqlexample
-- Grant the SELECT privilege on all new (future) tables in a schema to role R1
GRANT SELECT ON FUTURE TABLES IN SCHEMA s1 TO ROLE r1;

-- / Create tables in the schema /

-- Grant the SELECT privilege on all new tables in a schema to role R2
GRANT SELECT ON FUTURE TABLES IN SCHEMA s1 TO ROLE r2;

-- Grant the SELECT privilege on all existing tables in a schema to role R2
GRANT SELECT ON ALL TABLES IN SCHEMA s1 TO ROLE r2;

-- Revoke the SELECT privilege on all new tables in a schema (future grant) from role R1
REVOKE SELECT ON FUTURE TABLES IN SCHEMA s1 FROM ROLE r1;

-- Revoke the SELECT privilege on all existing tables in a schema from role R1
REVOKE SELECT ON ALL TABLES IN SCHEMA s1 FROM ROLE r1;
```

For more information on future grants, see [Assigning future grants on objects](security-access-control-configure.md).

## Viewing query results

A user cannot view the result set from a query that another user executed. This behavior is intentional. For security reasons, only the user
who executed a query can access the query results.

> **Note:**
>
> This behavior is not connected to the Snowflake access control model for objects. Even a user with the ACCOUNTADMIN role cannot
> view the results for a query run by another user.

## Understanding cloned objects and granted privileges

Cloning a database, schema or table creates a copy of the source object. The cloned object includes a snapshot of data present in the source
object when the clone was created.

A cloned object is considered a new object in Snowflake. Any privileges granted on the source object do not transfer to the cloned object.
However, a cloned container object (a database or schema) retains any privileges granted on the objects contained in the source object. For
example, a cloned schema retains any privileges granted on the tables, views, UDFs, and other objects in the source
schema.

For more details about cloning, see [Cloning considerations](object-clone.md) and [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

## Comparing and contrasting RBAC with UBAC

Role-based access control (RBAC) is your foundation for access control in Snowflake. RBAC provides, by design, scalability and
centralized control. Using RBAC, you grant privileges to roles, and then assign users to those roles, simplifying administration, ensuring
consistency, and making audit access easier. RBAC is generally recommended for production environments and enterprise-level governance.
User-based access control (UBAC) is intended for use cases such as private development and collaboration.

You should consider using UBAC for collaborative scenarios, such as building Streamlit applications. During a collaborative development process, an asset owner may want to control access to the asset before sharing it with a wider audience. UBAC complements RBAC by providing flexibility to grant privileges directly to individual users. UBAC is particularly useful in scenarios that benefit from a more granular access control model.

UBAC does not provide object owners with new levels of privilege. If you currently trust object owners to manage access to their objects
using roles in RBAC, then using UBAC does not fundamentally change that level of trust. Object owners already possess the ability to grant
access to any role, including broadly accessible roles such as PUBLIC. UBAC allows object owners to grant access directly to specific
users. UBAC does not impact query performance.

## Avoiding grant proliferation when using UBAC

To prevent object owners from indiscriminately granting access to objects, use [managed access schemas](security-access-control-configure.md).
Managed access schemas remove the ability for object owners to grant access to other roles or users. Only schema owners or a role with
the MANAGE GRANTS privilege can grant privileges on objects in a managed access schema. Grant proliferation can occur while using either UBAC or RBAC.
Outside managed access schemas, object owners can grant access to any role in an account when using RBAC, just as they can grant privileges
to any user when using UBAC.

## Monitoring access control privileges in your account

You can monitor privileges granted to roles, users, and applications using the GRANTS_TO_ROLES view in ACCOUNT_USAGE. For more information,
see [GRANTS_TO_ROLES view](../sql-reference/account-usage/grants_to_roles.md).

You can also monitor access control privileges in your account in the following ways:

* Viewing direct grants to all users
* Showing direct grants to specific users
* Viewing the current set of privileges granted on an object
* Viewing the current permissions on a schema
* Viewing the privileges on a database schema
* Viewing the current set of privileges granted to a role or a user

For example, to view direct grants to all users, run the following query:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.GRANTS_TO_ROLES
  WHERE granted_to = 'USER';
```

For example, to show direct grants to specific users, use the [pipe operator](../sql-reference/operators-flow.md)
(`->>`) to filter the result of a SHOW GRANTS TO USER command to show only the privileges granted directly to the user,
not through roles:

```sqlexample
SHOW GRANTS TO USER <user_name>
  ->> SELECT * FROM $1 WHERE "role" IS NULL;
```

For example, to view the current set of privileges granted on an object, you can run the [SHOW GRANTS](../sql-reference/sql/show-grants.md) command.

> **Note:**
>
> Executing the SHOW GRANTS command on a specific object requires the same object privileges as running the SHOW command for that object
> type.
>
> For example, running the SHOW GRANTS command on a table requires the following privileges on the table and its database and schema:
>
> Database:
> :   USAGE
>
> Schema:
> :   USAGE
>
> Table:
> :   *any privilege*

For example, to view the current permissions on a schema, execute the following command:

```sqlexample
SHOW GRANTS ON SCHEMA <database_name>.<schema_name>;
```

For example, to view the privileges on `database_a.schema_1` that were granted in
[Creating custom roles](security-access-control-configure.md), execute the following command:

```sqlexample
SHOW GRANTS ON SCHEMA database_a.schema_1;
```

Snowflake returns the following results:

```output
+-------------------------------+-----------------------+------------+----------------------+------------+--------------------------+--------------+---------------+
| created_on                    | privilege             | granted_on | name                 | granted_to | grantee_name             | grant_option | granted_by    |
|-------------------------------+-----------------------+------------+----------------------+------------+--------------------------+--------------+---------------|
| 2022-03-07 09:04:23.635 -0800 | USAGE                 | SCHEMA     | database_a.schema_1  | ROLE       | R1                       | false        | SECURITYADMIN |
+-------------------------------+-----------------------+------------+----------------------+------------+--------------------------+--------------+---------------+
```

You can also run the SHOW GRANTS command to view the current set of privileges granted to:

* A role:

  ```sqlexample
  SHOW GRANTS TO ROLE <role_name>;
  ```

  For example, execute the following command to view the privileges granted on role `r1`, created as a custom role:

  ```sqlexample
  SHOW GRANTS TO ROLE r1;
  ```

  Snowflake returns the following results:

  ```output
  +-------------------------------+-----------+------------+----------------------+------------+--------------+--------------+---------------+
  | created_on                    | privilege | granted_on | name                 | granted_to | grantee_name | grant_option | granted_by    |
  |-------------------------------+-----------+------------+----------------------+------------+--------------+--------------+---------------|
  | 2022-03-07 09:08:43.773 -0800 | USAGE     | DATABASE   | D1                   | ROLE       | R1           | false        | SECURITYADMIN |
  | 2022-03-07 09:08:55.253 -0800 | USAGE     | SCHEMA     | D1.S1                | ROLE       | R1           | false        | SECURITYADMIN |
  | 2022-03-07 09:09:07.206 -0800 | SELECT    | TABLE      | D1.S1.T1             | ROLE       | R1           | false        | SECURITYADMIN |
  | 2022-03-07 09:08:34.838 -0800 | USAGE     | WAREHOUSE  | W1                   | ROLE       | R1           | false        | SECURITYADMIN |
  +-------------------------------+-----------+------------+----------------------+------------+--------------+--------------+---------------+
  ```
* A user:

  ```sqlexample
  SHOW GRANTS TO USER <user_name>;
  ```

  For example, execute the following command to view the privileges granted to user `user1`:

  ```sqlexample
  SHOW GRANTS TO USER user1;
  ```

  Snowflake returns the following results:

  ```output
  +-------------------------------+-----------+------------+---------------------------+-----------+------------+--------------+--------------+---------------+
  | created_on                    | privilege | granted_on | name                      |  role     | granted_to | grantee_name | grant_option | granted_by    |
  |-------------------------------+-----------+------------+---------------------------+-----------+------------+--------------+------------------------------|
  | 2025-05-07 09:08:43.773 -0800 | USAGE     | DATABASE   | test_db                   | null      | USER       | user1        | false        | SECURITYADMIN |
  | 2025-05-07 09:08:55.253 -0800 | USAGE     | SCHEMA     | test_db.test_sch          | null      | USER       | user1        | false        | SECURITYADMIN |
  | 2025-05-07 09:08:55.253 -0800 | SELECT    | TABLE      | test_db.test_sch.test_tbl | null      | USER       | user1        | false        | SECURITYADMIN |
  | 2025-05-07 09:08:34.838 -0800 | USAGE     | WAREHOUSE  | test_wh                   | null      | USER       | user1        | false        | SECURITYADMIN |
  +-------------------------------+-----------+------------+---------------------------+-----------+------------+--------------+--------------+---------------+
  ```

---
title: Access control for cost anomalies
source: https://docs.snowflake.com/en/user-guide/cost-anomalies-access-control.md
section: User Guide
---

# Access control for cost anomalies

A cost anomaly occurs when daily consumption is above or below the expected range of consumption for the day. The following sections
describe the access control requirements for viewing and configuring cost anomalies.

## Administrators with system roles

Administrators with the following system roles can perform all tasks related to identifying and investigating cost anomalies, both in
Snowsight and by using the ANOMALY_INSIGHTS class:

* ACCOUNTADMIN role in an ORGADMIN-enabled account or a regular account.
* GLOBALORGADMIN role in the organization account.

## Granting access to users

You can let users work with cost anomalies by granting application roles to them. The following application roles, which are within the
SNOWFLAKE application, let users work with cost anomalies.

| Application role | Description |
| --- | --- |
| APP_USAGE_VIEWER | Allows a user to view cost anomalies. |
| APP_USAGE_ADMIN | Allows a user to view cost anomalies and add email addresses where notifications are sent for [account-level cost anomalies](cost-anomalies.md). |
| ORGANIZATION_BILLING_VIEWER | When combined with the APP_USAGE_ADMIN or APP_USAGE_VIEWER role, allows a user in the organization account to see consumption with a currency as the unit of measure. Without this role, users see consumption in credits, not a currency.  Also required to add email addresses where notifications are sent for [organization-level cost anomalies](cost-anomalies.md). |
| APP_ORGANIZATION_BILLING_VIEWER | Provides the same access as ORGANIZATION_BILLING_VIEWER but in an ORGADMIN-enabled account instead of the organization account. |

The following sections provide more information about how you can use these application roles to provide access to cost anomalies.

### Grant the ability to view cost anomalies in a specific account

If you want users to be able to view account-level cost anomalies in a specific account, but not act as an administrator, grant them the
APP_USAGE_VIEWER application role.

For example, if you want user `joe` to be able to view cost anomalies for a specific account, sign in to the account, and then run the
following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE anomaly_viewer_role;
GRANT APPLICATION ROLE SNOWFLAKE.APP_USAGE_VIEWER TO ROLE anomaly_viewer_role;
GRANT ROLE anomaly_viewer_role TO USER joe;
```

### Grant the ability to view cost anomalies for all accounts

To allow a user to view account-level cost anomalies for all accounts in the organization and to view organization-level anomalies, grant
the APP_USAGE_VIEWER role and one of the following roles:

* If the user signs in to the organization account to view cost anomalies, also grant the ORGANIZATION_BILLING_VIEWER application role.
* If the user signs in to an ORGADMIN-enabled account to view cost anomalies, also grant the APP_ORGANIZATION_BILLING_VIEWER application
  role.

A user who is granted these roles can see consumption data with a currency as the unit of measure instead of credits.

For example, if the user `ralph` signs in to the organization account to view cost anomalies that are related to the entire organization,
run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE anomaly_viewer_role;
GRANT APPLICATION ROLE SNOWFLAKE.APP_USAGE_VIEWER TO ROLE anomaly_viewer_role;
GRANT APPLICATION ROLE SNOWFLAKE.ORGANIZATION_BILLING_VIEWER TO ROLE anomaly_viewer_role;
GRANT ROLE anomaly_viewer_role TO USER ralph;
```

### Grant the ability to configure cost anomalies in a specific account

If you want users to be able to view *and* configure account-level cost anomalies within a specific account, grant them the APP_USAGE_ADMIN
application role. A user with this role doesn’t need the APP_USAGE_VIEWER role to view the cost anomalies. Configuring cost anomalies
includes adding the email addresses where notifications are sent when there is an anomaly in the account.

For example, if you want user `judy` to be able to view and configure account-level cost anomalies for a specific account, sign in to the
account, and then run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE anomaly_admin_role;
GRANT APPLICATION ROLE SNOWFLAKE.APP_USAGE_ADMIN TO ROLE anomaly_admin_role;
GRANT ROLE anomaly_admin_role TO USER judy;
```

### Grant the ability to configure organization-level cost anomalies

To allow a user to configure organization-level cost anomalies, grant the APP_USAGE_ADMIN role and one of the following roles:

* If the user signs in to the organization account to configure and view cost anomalies, also grant the ORGANIZATION_BILLING_VIEWER
  application role.
* If the user signs in to an ORGADMIN-enabled account to configure and view cost anomalies, also grant the APP_ORGANIZATION_BILLING_VIEWER
  application role.

An administrator with one of these role combinations can perform the following tasks:

* Set and view the email addresses where notifications are sent for organization-level anomalies.
* View account-level cost anomalies in all accounts in the organization.
* View organization-level cost anomalies.
* View consumption data that uses a currency as the unit of measure.

For example, if the user `steven` signs in to the organization account to work with cost anomalies related to the entire organization,
run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE anomaly_admin_role;
GRANT APPLICATION ROLE SNOWFLAKE.APP_USAGE_ADMIN TO ROLE anomaly_admin_role;
GRANT APPLICATION ROLE SNOWFLAKE.ORGANIZATION_BILLING_VIEWER TO ROLE anomaly_admin_role;
GRANT ROLE anomaly_admin_role TO USER steven;
```

---
title: Access control for cost management
source: https://docs.snowflake.com/en/user-guide/cost-access-control.md
section: User Guide
---

# Access control for cost management

This topic describes system-defined roles (roles, application roles, and database roles) that control access to cost management
features. You can assign these roles to a user once to provide access to cost-related features.

A cost-related feature might have its own roles and privileges that provide access without granting access to other features. Refer to
the feature-specific documentation for a discussion of these privileges.

> **Note:**
>
> The access control privileges discussed in this topic do not provide access to resource monitors.

## Access to organization-level cost information

The type of account determines whether a user can view organization-level cost data. For example, the Organization overview tab is
only available from certain accounts.

You can view organization-level cost data from the following accounts:

* The [organization account](organization-accounts.md).
* A regular account with the [ORGADMIN role enabled](organization-administrators.md).

## Viewing a currency as the unit of measure

The unit of measure for cost information can be credits or a currency. You can view a currency as the unit of measure only in certain
circumstances. Being able to see a currency as the unit of measure also allows you to view related information like the remaining balance
of a contract.

To see a currency as the unit measure, you must:

* Access cost information from the organization account or a regular account with the ORGADMIN role enabled.
* Use one of the following roles:

  + ACCOUNTADMIN system-defined role
  + GLOBALORGADMIN system-defined role
  + ORGANIZATION_BILLING_VIEWER. This is a database role for some cost-related features but an application role for other features.

## Default access by administrators

By default, only users with system-defined administrator roles can use cost-related features.

* If you are signed in to the organization account, use the GLOBALORGADMIN role to view cost information.
* If you are signed in to a regular account, use the ACCOUNTADMIN role to view cost information.

  > **Tip:**
  >
  > Consider using an application role and database role to grant administrator rights to cost-related features, even if the user has the
  > ACCOUNTADMIN role. Some features might behave differently if a user with the ACCOUNTADMIN role does not also have the ORGADMIN role. For
  > information about granting administrator rights, see Granting access to other users.

## Granting access to other users

To simplify access control for cost management, Snowflake provides two levels of access:

* Users who can view cost information.
* Users who act as an administrator for cost-related features. These users can also view cost information.

There is an application role/database role combination that corresponds to each level of access. A user’s level of access is determined by
the application role and database role that they are granted. The following shows the application role and database role required
to view cost information or act as an administrator.

| Level of access | Application role | Database role |
| --- | --- | --- |
| Viewer | APP_USAGE_VIEWER | USAGE_VIEWER |
| Administrator | APP_USAGE_ADMIN | USAGE_ADMIN |

> **Note:**
>
> If you want a viewer or administrator to be able to see a currency as the unit of measure (instead of credits), you must also grant the
> ORGANIZATION_BILLING_VIEWER role. For more information, see Viewing a currency as the unit of measure.

You grant access to cost management features by granting the appropriate application role and database role. For example, if you want user
`joe` to be able to view cost information, but not act as an administrator, execute the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE USER joe;
CREATE ROLE cost_viewer_role;

USE DATABASE SNOWFLAKE;
USE SCHEMA ACCOUNT_USAGE;
GRANT APPLICATION ROLE APP_USAGE_VIEWER TO ROLE cost_viewer_role;
GRANT DATABASE ROLE USAGE_VIEWER TO ROLE cost_viewer_role;
GRANT ROLE cost_viewer_role TO USER joe;
```

### Administrator tasks

A user granted the APP_USAGE_ADMIN application role and USAGE_ADMIN database role can view all cost information. In addition, they can
perform the following administrative tasks:

**Budgets**

> * Activate a budget.
> * Deactivate a budget.
> * Modify a spending limit.
> * Modify notification email addresses.
> * Mute notifications for the account budget and for custom budgets.
> * Delete custom budgets.

**Cost anomalies**

> * Modify notification email addresses.

---
title: Access control for data quality
source: https://docs.snowflake.com/en/user-guide/data-quality-access-control.md
section: User Guide
---

# Access control for data quality

The following sections describe the access control requirements for actions related to data quality and data metric functions (DMFs).

## Common tasks

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Task | Required privileges/roles | Notes |
| --- | --- | --- |
| Associate a DMF with a table or view | EXECUTE DATA METRIC FUNCTION on the account | You can’t grant global privileges to database roles because database roles are scoped to the database in which they exist. If you have an object that is owned by a database role and you want to set a DMF on that object, you must transfer the OWNERSHIP privilege of the object to a custom role or system role. |
|  | USAGE privilege on the DMF | All users have USAGE on system DMFs [1]. For custom DMFs, see Granting privileges on a custom DMF. |
|  | One of the following:   * Role with the OWNERSHIP privilege on the table * Role that has the SELECT privilege on the table *and* is specified by the EXECUTE AS ROLE property. | For information about the EXECUTE AS ROLE property, see Required privilege on the table or view. |
| View associations between objects and DMFs | USAGE privilege on the DMF | All users have USAGE on system DMFs [1]. For custom DMFs, see Granting privileges on a custom DMF. |
|  | SELECT privilege on the table or view associated with the DMF |  |
| Set the DMF schedule for a table | One of the following:   * OWNERSHIP on the table * Any privilege on the table *and* EXECUTE DATA METRIC FUNCTION on the account |  |
| Create a custom DMF | CREATE DATA METRIC FUNCTION privilege on the schema |  |
| Call a DMF manually | USAGE privilege on the DMF | All users have USAGE on system DMFs [1]. For custom DMFs, see Granting privileges on a custom DMF. |
|  | SELECT privilege on table or view specified in the call |  |

[1]
(1,2,3)

If you want to revoke the USAGE privilege on system DMFs, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Viewing data quality results

| Viewing option | Required privileges/roles | Notes |
| --- | --- | --- |
| DATA_QUALITY_MONITORING_RESULTS_RAW event table | SNOWFLAKE.DATA_QUALITY_MONITORING_ADMIN application role |  |
| DATA_QUALITY_MONITORING_RESULTS view | One of the following:   * SNOWFLAKE.DATA_QUALITY_MONITORING_ADMIN application role * SNOWFLAKE.DATA_QUALITY_MONITORING_VIEWER application role |  |
| DATA_QUALITY_MONITORING_RESULTS function | One of the following:   * SNOWFLAKE.DATA_QUALITY_MONITORING_ADMIN application role * SNOWFLAKE.DATA_QUALITY_MONITORING_VIEWER application role * SNOWFLAKE.DATA_QUALITY_MONITORING_LOOKUP application role | The PUBLIC role is granted the DATA_QUALITY_MONITORING_LOOKUP application role, which means a user can use any role to call the DATA_QUALITY_MONITORING_RESULTS function. |
|  | USAGE privilege on the DMF | All users have USAGE on system DMFs [2]. For custom DMFs, see Granting privileges on a custom DMF. |
|  | OWNERSHIP or SELECT privilege on the table associated with the DMF |  |
|  | If the EXECUTE AS ROLE property of the association specifies a role, then that role must be active in your session. |  |

[2]

If you want to revoke the USAGE privilege on system DMFs, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Required privilege on the table or view

One of the access control requirements for associating a DMF with a table or view is having the appropriate privilege on that table or view.
To associate a DMF with an object your role must have one of the following privileges:

* OWNERSHIP privilege on the table or view
* SELECT privilege on the table or view

If you want roles with the SELECT privilege on an object to be able to associate DMFs with the object, you must set the EXECUTE AS ROLE
property when defining the association. This property specifies the role that the DMF runs with. For example, suppose the role
`analyst_role` has the SELECT privilege on table `t1`. To associate the `positive_number_count` DMF with table `t1` so it runs with
the `analyst_role` role, run the following command:

```sqlexample
ALTER TABLE t1
  ADD DATA METRIC FUNCTION governance.dfms.positive_number_count on (c1, c2, c3)
    EXECUTE AS ROLE analyst_role;
```

This command can be run by a user with the `analyst_role` role or by a user with a role that is higher in the role hierarchy (for example,
the ACCOUNTADMIN role).

If the EXECUTE AS ROLE property is not specified, the DMF runs with the role of the table owner. The role that the DMF runs with is important
because it can affect data governance policies that behave differently depending on the role of the current user.

### Benefits of the EXECUTE AS ROLE property

The EXECUTE AS ROLE property allows a non-owner to associate and run a DMF on a table or view. This makes it possible for a data governor to
create data quality checks without needing to own the table.

### Limitations

You cannot use the MODIFY DATA METRIC FUNCTION clause to change the role specified by the EXECUTE AS ROLE property. You must drop the
association, then re-create it with a new EXECUTE AS ROLE role.

## Granting privileges on a custom DMF

The GRANT and REVOKE commands require you to specify the arguments of the custom DMF that you create. For example:

```sqlexample
GRANT USAGE ON FUNCTION
  governance.dmfs.count_positive_numbers(TABLE(NUMBER, NUMBER, NUMBER))
  TO data_engineer;
```

---
title: Access control for dbt projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-access-control.md
section: User Guide
---

# Access control for dbt projects on Snowflake

The following commands demonstrate commonly granted privileges for dbt project objects.

* **To grant privileges to create a dbt project object, including deploying from within a workspace:**

  ```sqlexample
  GRANT CREATE DBT PROJECT ON SCHEMA my_database.my_schema TO ROLE my_role;
  ```
* **To grant privileges to alter or drop (delete) a dbt project object, including connecting a workspace to a dbt project object:**

  ```sqlexample
  GRANT OWNERSHIP ON DBT PROJECT my_dbt_project_object TO ROLE my_role;
  ```
* **To grant privileges to execute a dbt project object and to list or get files:**

  ```sqlexample
  GRANT USAGE ON DBT PROJECT my_dbt_project_object TO ROLE my_role;
  ```
* **To view a dbt project object in Snowsight, you must use a role that has the MONITOR privilege on that dbt project. Without this
  privilege, you can’t access the project details, run history, or monitoring information:**

  ```sqlexample
  GRANT MONITOR ON DBT PROJECT my_dbt_project_object TO ROLE my_role;
  ```

For more information, see [dbt project object privileges](../security-access-control-privileges.md).

## Roles and privileges for dbt project deployment

Deploying a dbt project from Snowsight initially uses the role selected in the deploy dialog (that is, the role you select from Connect » Deploy dbt project). During compilation, the dbt Project uses the role specified in the target profile in `profiles.yml` file, unless the object has the DEFAULT_TARGET attribute, which takes precedence.

Similarly, deploying a dbt project from SQL or CLI initially uses the role in the worksheet or `connection.toml`, respectively, then uses the role specified in the command. The actual compilation during deployment uses the role within the target profile in `profiles.yml`, unless the object has the DEFAULT_TARGET attribute, which takes precedence.

## Roles and privileges for dbt project execution

When you execute a dbt project, the roles that perform execution and that materialize output when you specify the dbt `run` or `build` commands depend on the method of execution.

### Execution from SQL or CLI

The dbt command specified in EXECUTE DBT PROJECT runs with the privileges of the `role` specified in the `outputs` block of the projects `profiles.yml` file. Operations are further restricted to only those privileges granted to the Snowflake user calling EXECUTE DBT PROJECT. Both the user and the role specified must have the required privileges to use the `warehouse`, perform operations on the `database` and `schema` specified in the project’s `profiles.yml` file, and perform operations on any other Snowflake objects that the dbt model specifies.

### Execution from within Workspaces

Choosing the dbt Run or Build command for a project from within a workspace materializes target output using the `role` defined in the project’s `profiles.yml` file. Both the user and the role specified must have the required privileges to use the `warehouse`, perform operations on the `database` and `schema` that are specified in the project’s `profiles.yml` file, and perform operations on any other Snowflake objects that the dbt model specifies.

### Scheduled execution from within Workspaces

Scheduling dbt project object execution from within Workspaces creates user-managed tasks. To create a task from within Workspaces, a user must have a role with privileges described under [Access control requirements](../../sql-reference/sql/create-task.md) in the CREATE TASK reference. Snowflake runs tasks with the privileges of the task owner, but task runs are not associated with the user. For more information, see [Tasks run by a system service](../tasks-intro.md).

---
title: Access control privileges
source: https://docs.snowflake.com/en/user-guide/security-access-control-privileges.md
section: User Guide
---

# Access control privileges

This topic describes the privileges that are available in the Snowflake access control model. Privileges are granted to roles, and roles are
granted to users, to specify the operations that the users can perform on objects in the system.

> **Tip:**
>
> To obtain a definitive list of all possible privileges for one or more objects, call the
> [EXPLAIN_GRANTABLE_PRIVILEGES](../sql-reference/functions/explain_grantable_privileges.md) function.

## All privileges (alphabetical)

The following privileges are available in the Snowflake access control model. The meaning of each privilege varies depending on the object type
to which it is applied, and not all objects support all privileges:

| Privilege | Object Type | Description |
| --- | --- | --- |
| ALL [ PRIVILEGES ] | All | Grants all the privileges for the specified object type. |
| APPLY | Policy, Tag | Grants the ability to assign a policy or tag to an object that can be tagged or protected by a policy. |
| APPLYBUDGET | Database, Schema, Table, event table, hybrid table, Apache Iceberg™ table, Warehouse, Task, Pipe, Materialized View | Grants the ability to add or remove an object to or from a [budget](budgets.md). |
| APPLY AGGREGATION POLICY | Global | Grants the ability to add and drop an aggregation policy on a table or view. |
| APPLY AUTHENTICATION POLICY | Global | Grants the ability to add or drop an authentication policy on the Snowflake account or a user in the Snowflake account. |
| APPLY BACKUP RETENTION LOCK | Global | Grants the ability to create and apply [backup](backups.md) policies with retention lock. This privilege is granted to the ACCOUNTADMIN role and can be delegated. |
| APPLY CONTACT | Global | Grants the ability to associate or detach a [contact](contacts-using.md) with an object. |
| APPLY FEATURE POLICY | Global | Grants the ability to apply a feature policy for an account or on a specific object. |
| APPLY JOIN POLICY | Global | Grants the ability to add and drop a join policy on a table or view. |
| APPLY LEGAL HOLD | Global | Grants the ability to add and remove legal holds from [WORM backups](backups.md) for Snowflake databases, schemas, and tables. |
| APPLY MASKING POLICY | Global | Grants the ability to set a Column-level Security masking policy on a table or view column and to set a masking policy on a tag. This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY PACKAGES POLICY | Global | Grants the ability to add or drop a packages policy on the Snowflake account. |
| APPLY PASSWORD POLICY | Global | Grants the ability to add or drop a password policy on the Snowflake account or a user in the Snowflake account. |
| APPLY PRIVACY POLICY | Global | Grants the ability to add and drop a privacy policy on a table or view. |
| APPLY PROJECTION POLICY | Global | Grants the ability to add and drop a projection policy on a table or view. |
| APPLY ROW ACCESS POLICY | Global | Grants the ability to add and drop a row access policy on a table or view. This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY SESSION POLICY | Global | Grants the ability to set or unset a session policy on an account or user. |
| APPLY SNAPSHOT RETENTION LOCK — *Deprecated* | Global | Grants the ability to create and apply [snapshot](backups.md) policies with retention lock. This privilege is granted to the ACCOUNTADMIN role and can be delegated. Deprecated: use APPLY BACKUP RETENTION LOCK instead. |
| APPLY STORAGE LIFECYCLE POLICY | Global | Grants the ability to add or drop a [storage lifecycle policy](storage-management/storage-lifecycle-policies.md) on a table. This global privilege also allows executing the DESCRIBE operation on all storage lifecycle policies. |
| APPLY TAG | Global | Grants the ability to add or drop a tag on a Snowflake object. |
| ATTACH POLICY | Global | Grants the ability to activate a network policy by associating it with your account. |
| AUDIT | Global | Grants the ability to set the [ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR](../sql-reference/parameters.md) and [ENABLE_UNREDACTED_SECURE_OBJECT_ERROR](../sql-reference/parameters.md) user parameters. |
| BIND SERVICE ENDPOINT | Global | Enables the ability to create a service that supports public endpoints. For more information about public endpoints, see [Ingress: Using a service from outside Snowflake](../developer-guide/snowpark-container-services/working-with-services.md). |
| CREATE *<object_type>* | Global, Database, Schema | Grants the ability to create an object of *<object_type>* (e.g. CREATE TABLE grants the ability to create a table within a schema). |
| DELETE | Table, event table, hybrid table, Iceberg table | Grants the ability to execute a [DELETE](../sql-reference/sql/delete.md) command on the table. |
| EVOLVE SCHEMA | Table | Grants the ability for [schema evolution](data-load-schema-evolution.md) to occur on a table when loading data. |
| EXECUTE ALERT | Global | Grants the ability to execute alerts owned by the role. For serverless alerts to run, the role that has the OWNERSHIP privilege on the alert must also have the global EXECUTE MANAGED ALERT privilege. |
| EXECUTE AUTO CLASSIFICATION | Global, Database, Schema | Grants the ability to set a classification profile on a database or schema to implement [sensitive data classification](classify-intro.md). |
| EXECUTE DATA METRIC FUNCTION | Global | Enables using serverless compute resources when calling a data metric function. |
| EXECUTE DBT PROJECT | dbt project object | Grants the ability to execute a dbt project object. |
| EXECUTE MANAGED ALERT | Global | Grants the ability to create alerts that rely on serverless compute resources. Only required to create serverless alerts. The role that has the OWNERSHIP privilege on a serverless alert must have both the EXECUTE MANAGED ALERT and the EXECUTE ALERT privilege for the alert to run. |
| EXECUTE MANAGED TASK | Global | Grants the ability to create tasks that rely on serverless compute resources. Only required to create serverless tasks. The role that has the OWNERSHIP privilege on a task must have both the EXECUTE MANAGED TASK and the EXECUTE TASK privilege for the task to run. |
| EXECUTE TASK | Global | Grants the ability to run tasks owned by the role. For serverless tasks to run, the role that has the OWNERSHIP privilege on the task must also have the global EXECUTE MANAGED TASK privilege. |
| FAILOVER | Failover Group, Connection | Grants the ability to promote a secondary failover group or secondary connection to serve as the primary. |
| IMPORT ORGANIZATION USER GROUPS | Global | Grants the ability to add an [organization user group](organization-users.md) to a regular account, which imports users into the account. |
| IMPORT SHARE | Global | Applies to data consumers. Grants the ability to view shares shared with your account. Also grants the ability to create databases from the shares; requires the global CREATE DATABASE privilege. |
| OVERRIDE SHARE RESTRICTIONS | Global | Grants the ability to set value for the SHARE_RESTRICTIONS parameter on a share. For more details, see [Override share restrictions](override_share_restrictions.md). |
| IMPERSONATE | User | Runs a task or dynamic table on behalf of a specified user account. |
| IMPORTED PRIVILEGES | Database, Data Exchange | Grants the ability to enable roles other than the owning role to access a shared database or manage a Snowflake Marketplace / Data Exchange. |
| INSERT | Table, hybrid table, Iceberg table | Grants the ability to execute an [INSERT](../sql-reference/sql/insert.md) command on the table. |
| MANAGE ACCOUNT SUPPORT CASES | Global | Grants the ability to view, comment on, and manage all Support cases for the current account in Snowsight. |
| MANAGE ACCOUNTS | Global | Grants the ability to manage the lifecycle of accounts in an organization. |
| MANAGE GRANTS | Global | Grants the ability to grant or revoke privileges on any object as if the invoking role were the owner of the object. |
| MANAGE LISTING AUTO FULFILLMENT | Global | Grants the ability to publish listings to remote regions using [Cross-Cloud Auto-Fulfillment](../collaboration/provider-listings-auto-fulfillment.md) and manage auto-fulfillment settings for listings. |
| MANAGE ORGANIZATION CONTACTS | Global | Grants the ability to manage the contacts for an organization. |
| MANAGE ORGANIZATION SUPPORT CASES | Global | Grants the ability to view, comment on, and manage all Support cases that were opened by the current user in Snowsight. |
| MANAGE ORGANIZATION TERMS | Global | Grants the ability to manage the legal terms for an organization. |
| MANAGE ORGANIZATION USERS | Global | Grants the ability to manage [organization users](organization-users.md). |
| MANAGE ORGANIZATION USER GROUPS | Global | Grant the ability to manage [organization user groups](organization-users.md). |
| MANAGE SHARE TARGET | Global | Grants the ability to manage (ALTER) share targets. |
| MANAGE USER SUPPORT CASES | Global | Grants the ability to view, comment on, and manage all Support cases for the current user in Snowsight. |
| MANAGE VISIBILITY | Global | Grants the ability to set the OBJECT_VISIBILITY property, which controls the [discoverability of the objects](ui-snowsight/object-visibility-universal-search.md) in the account. |
| MANAGE WAREHOUSES | Global | Grants the ability to perform operations that require the MODIFY, MONITOR, and OPERATE privileges on warehouses in the same account. |
| MODIFY | Resource Monitor, Warehouse, Data Exchange Listing, Database, Schema, Failover Group, Replication Group, Compute Pool | Grants the ability to change the settings or properties of an object (for example, on a virtual warehouse, provides the ability to change the size of a virtual warehouse). |
| MODIFY LOG LEVEL | Global | Enables setting the level of log messages captured for stored procedures and UDFs in the current account. For more information, see [LOG_LEVEL](../sql-reference/parameters.md). |
| MODIFY METRIC LEVEL | Global | Enables setting the level of metrics data captured for stored procedures and UDFs in the current account. For more information, see [METRIC_LEVEL](../sql-reference/parameters.md). |
| MODIFY PROGRAMMATIC AUTHENTICATION METHODS | User | Grants the ability to create, modify, delete, rotate, and view information about the [programmatic access tokens](programmatic-access-tokens.md) and [key pairs](key-pair-auth.md) for the user. |
| MODIFY SESSION LOG LEVEL | Global | Enables setting the level of log messages captured for stored procedures and UDFs invoked in the current session. For more information, see [LOG_LEVEL](../sql-reference/parameters.md). |
| MODIFY SESSION METRIC LEVEL | Global | Enables setting the level of metrics data captured for stored procedures and UDFs invoked in the current session. For more information, see [METRIC_LEVEL](../sql-reference/parameters.md). |
| MODIFY SESSION TRACE LEVEL | Global | Enables setting the level of trace events captured for stored procedures and UDFs invoked in the current session. When tracing events, you must also set the LOG_LEVEL parameter to one of its supported values. For more information, see [TRACE_LEVEL](../sql-reference/parameters.md). |
| MODIFY TRACE LEVEL | Global | Enables setting the level of trace events captured for stored procedures and UDFs in the current account. When tracing events, you must also set the LOG_LEVEL parameter to one of its supported values. For more information, see [TRACE_LEVEL](../sql-reference/parameters.md). |
| MONITOR | User, Resource Monitor, Warehouse, Database, Schema, Task, Failover Group, Replication Group, Alert, Compute Pool, Service, Dynamic Table, Semantic View, Snowflake Native App, Agent, dbt Projects on Snowflake | Grants the ability to see details within an object (for example, queries and usage within a warehouse). . . For semantic views, the MONITOR privilege also allows you to view Cortex Analyst [monitoring and observability data](snowflake-cortex/cortex-analyst/admin-observability.md). |
| MONITOR EXECUTION | Global | Grants the ability to monitor pipes (Snowpipe) or tasks in the account. |
| MONITOR SECURITY | Global | Grants the ability to call system functions pertaining to [Customer-managed keys](security-encryption-manage.md). |
| MONITOR USAGE | Global | Grants the ability to monitor account-level usage and historical information for databases and warehouses; for more details, see [Enabling non-account administrators to monitor usage and billing history](security-access-control-configure.md). Additionally grants the ability to view managed accounts using [SHOW MANAGED ACCOUNTS](../sql-reference/sql/show-managed-accounts.md). |
| OPERATE | Warehouse, Task, Dynamic table, Alert, Compute Pool, Service | Grants the ability to start, stop, suspend, or resume a virtual warehouse. Grants the ability to suspend or resume a task. Grants the ability to suspend, resume, or refresh a dynamic table. Grants the ability to suspend or resume a compute pool. Grants the ability to suspend or resume a Snowpark Container Services service, upgrade service, set, and unset service properties. |
| OWNERSHIP | All | Grants the ability to drop, alter, and grant or revoke access to an object. Required to rename an object and create a temporary object with the same name as the object itself. OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command to a different role by the owning role or any role with the MANAGE GRANTS privilege. |
| PURCHASE DATA EXCHANGE LISTING | Global | Grants the ability to purchase a paid listing. |
| READ | Stage (internal only), Compute Pool, Git Repository, Image Repository | Grants the ability to perform any operations that require reading from an internal stage ([GET](../sql-reference/sql/get.md), [LIST](../sql-reference/sql/list.md), [COPY INTO <table>](../sql-reference/sql/copy-into-table.md), etc.). Grants the ability to download an image from an image repository. READ privilege on stage and image repository is required to create a Snowpark Container Services service. For models, READ grants the ability to run inference methods along with read-only access to the model’s underlying artifacts and metadata. |
| READ SESSION | Global | Grants the ability to read session context. |
| READ UNREDACTED ERROR TABLE | Global | Grants the ability to read the unredacted data in an error table. Required when the error table is associated with a base table that has security policies, such as a [masking policy](security-column-intro.md). For more information about error tables, see [DML error logging](data-load-overview.md). |
| REFERENCES | Table, event table, hybrid table, Iceberg table, external table, interactive table, view, materialized view, semantic view | Grants the ability to view the structure of an object (but not the data). . . For tables, the privilege also grants the ability to reference the object as the unique/primary key table for a foreign key constraint. |
| REPLICATE | Global, Replication Group, Failover Group | At the account level, grants the ability to change the REPLICABLE_WITH_FAILOVER_GROUPS setting for databases and schemas. For replication groups and failover groups, grants the ability to refresh a secondary replication or failover group. |
| RESOLVE ALL | Global | Grants the ability to resolve all objects in the account, which outputs the object in the corresponding [SHOW <objects>](../sql-reference/sql/show.md) command. |
| SELECT | Table, hybrid table, Iceberg table, event table, external table, interactive table, view, materialized view, semantic view, stream | Grants the ability to execute a [SELECT](../sql-reference/sql/select.md) statement on the table/view. |
| SELECT ERROR TABLE | Table | Grants the ability to execute a [SELECT](../sql-reference/sql/select.md) statement on the error table associated with a base table. For more information, see [DML error logging](data-load-overview.md). |
| TRUNCATE | Table, hybrid table, event table, Iceberg table | Grants the ability to execute a [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md) command on the table. |
| UPDATE | Table, hybrid table, Iceberg table | Grants the ability to execute an [UPDATE](../sql-reference/sql/update.md) command on the table. |
| USE AI FUNCTIONS | Global | Grants the ability to use Snowflake Cortex AI Functions. Users need both the USE AI FUNCTIONS account privilege and the CORTEX_USER database role to use all Snowflake Cortex AI Functions. For more information, see [Snowflake Cortex AI Functions (including LLM functions)](snowflake-cortex/aisql.md). |
| USAGE | Warehouse, Dataset, Data Exchange Listing, Integration, Database, Schema, Stage (external only), File Format, Sequence, Stored Procedure, User-Defined Types, User-Defined Function, External Function, Compute Pool, Snapshot, Backup Policy, Backup Set, Model, dbt project object, Agent, MCP Server | Grants the ability to execute a [USE <object>](../sql-reference/sql/use.md) command on the object. Also grants the ability to execute a [SHOW <objects>](../sql-reference/sql/show.md) command on the object. Usage on a compute pool is required to create a Snowpark Container Services service. For models, USAGE grants the ability to run inference methods. It doesn’t grant access to the model’s underlying artifacts. For dbt Projects on Snowflake, grants the ability to SHOW, DESCRIBE, view execution history, and EXECUTE DBT PROJECT on the dbt project object. |
| VIEW LINEAGE | Global | Grants the [ability to view data lineage](ui-snowsight-lineage.md), including upstream and downstream lineage objects and dependencies. |
| WRITE | Stage (internal only), image repository, Git Repository | Grants the ability to perform any operations that require writing to an internal stage ([PUT](../sql-reference/sql/put.md), [REMOVE](../sql-reference/sql/remove.md), [COPY INTO <location>](../sql-reference/sql/copy-into-location.md), etc.). Grants the ability to upload an image to an image repository. |

The remaining sections in this topic describe the specific privileges available for each type of object and their usage.

## Global privileges (account privileges)

| Privilege | Usage | Notes |
| --- | --- | --- |
| APPLY AGGREGATION POLICY | Grants the ability to add and drop an aggregation policy on a table or view. | This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY AUTHENTICATION POLICY | Grants the ability to add or drop an authentication policy on the Snowflake account or a user in the Snowflake account. |  |
| APPLY BACKUP POLICY | Grants the ability to add [backup](backups.md) policies to backup sets that don’t already have a policy. This privilege is granted to the ACCOUNTADMIN role and can be delegated. |  |
| APPLY BACKUP RETENTION LOCK | Grants the ability to create and apply backup policies with retention lock. This privilege is granted to the ACCOUNTADMIN role and can be delegated. |  |
| APPLY CONTACT | Grants the ability to associate or detach a [contact](contacts-using.md) with an object. |  |
| APPLY FEATURE POLICY | Grants the ability to apply a feature policy for an account or on a specific object. |  |
| APPLY JOIN POLICY | Grants the ability to add and drop a join policy on a table or view. | This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY MASKING POLICY | Grants the ability to set a Column-level Security masking policy on a table or view column and to set a masking policy on a tag. | This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY ROW ACCESS POLICY | Grants the ability to add and drop a row access policy on a table or view. | This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY PACKAGES POLICY | Grants the ability to add or drop a packages policy on the Snowflake account. |  |
| APPLY PASSWORD POLICY | Grants the ability to add or drop a password policy on the Snowflake account or a user in the Snowflake account. |  |
| APPLY PRIVACY POLICY | Grants the ability to add and drop a privacy policy on a table or view. | This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY PROJECTION POLICY | Grants the ability to add and drop a projection policy on a table or view. | This global privilege also allows executing the DESCRIBE operation on tables and views. |
| APPLY SESSION POLICY | Grants the ability to set or unset a session policy on an account or user. |  |
| APPLY SNAPSHOT POLICY — *Deprecated* | Grants the ability to add [snapshot](backups.md) policies to snapshot sets that don’t already have a policy. This privilege is granted to the ACCOUNTADMIN role and can be delegated. . . Deprecated: use APPLY BACKUP POLICY instead. |  |
| APPLY SNAPSHOT RETENTION LOCK — *Deprecated* | Grants the ability to create and apply snapshot policies with retention lock. This privilege is granted to the ACCOUNTADMIN role and can be delegated. . . Deprecated: use APPLY BACKUP RETENTION LOCK instead. |  |
| APPLY STORAGE LIFECYCLE POLICY | Grants the ability to add or drop a [storage lifecycle policy](storage-management/storage-lifecycle-policies.md) on a table. This privilege also allows executing the DESCRIBE operation on all storage lifecycle policies. . . Global privileges aren’t required to use storage lifecycle policies. |  |
| APPLY TAG | Grants the ability to add or drop a tag on a Snowflake object. |  |
| ATTACH POLICY | Grants the ability to activate a network policy by associating it with your account. |  |
| AUDIT | Grants the ability to set the [ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR](../sql-reference/parameters.md) and [ENABLE_UNREDACTED_SECURE_OBJECT_ERROR](../sql-reference/parameters.md) user parameters. |  |
| BIND SERVICE ENDPOINT | Enables the ability to create a service that supports public endpoints. For more information about public endpoints, see [Ingress: Using a service from outside Snowflake](../developer-guide/snowpark-container-services/working-with-services.md) | Must be granted by the ACCOUNTADMIN role. |
| CREATE AGENT | Enables creating a new Cortex Agent. |  |
| CREATE ACCOUNT | Enables a data provider to create a new managed account (i.e. reader account). For more details, see [Manage reader accounts](data-sharing-reader-create.md). | Must be granted by the ACCOUNTADMIN role. |
| CREATE COMPUTE POOL | Enables creating a compute pool to run a Snowpark Container Services service. | Must be granted by the ACCOUNTADMIN role. |
| CREATE DATABASE | Enables creating a new [database](../guides-overview-db.md). | Must be granted by the ACCOUNTADMIN role. |
| CREATE EXTERNAL VOLUME | Enables creating a new external volume for [Apache Iceberg™ tables](tables-iceberg.md). |  |
| CREATE EXTERNAL ACCESS INTEGRATION | Grants a Snowflake Native App the ability to create an external access integration. |  |
| CREATE FEATURE POLICY | Enables creating a new feature policy. |  |
| CREATE FAILOVER GROUP | Enables creating a new [failover group](account-replication-intro.md). | Must be granted by the ACCOUNTADMIN role. |
| CREATE GATEWAY | Enables creating a new gateway. |  |
| CREATE REPLICATION GROUP | Enables creating a new [replication group](account-replication-intro.md). | Must be granted by the ACCOUNTADMIN role. |
| CREATE ROLE | Enables creating a new role. |  |
| CREATE USER | Enables creating a new user. |  |
| CREATE LISTING | Enables creating a new Data Exchange listing. | Must be granted by the ACCOUNTADMIN role. |
| CREATE INTEGRATION | Enables creating a new catalog, notification, security, or storage integration. | Must be granted by the ACCOUNTADMIN role. |
| CREATE NETWORK POLICY | Enables creating a new network policy. |  |
| CREATE ORGANIZATION LISTING | Enables creating a new organization listing. |  |
| CREATE ORGANIZATION PROFILE | Enables creating a new organization profile. |  |
| CREATE ORGANIZATION USER | Enables creating a new [organization user](organization-users.md). | Must be granted by the GLOBALORGADMIN role in the organization account. |
| CREATE ORGANIZATION USER GROUP | Enables creating a new [organization user group](organization-users.md). | Must be granted by the GLOBALORGADMIN role in the organization account. |
| CREATE SECURITY INTEGRATION | Grants a Snowflake Native App the ability to create a security integration. |  |
| CREATE SHARE | Enables a data provider to create a new share. For more details, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md). | Must be granted by the ACCOUNTADMIN role. |
| CREATE WAREHOUSE | Enables creating a new virtual warehouse. | Must be granted by the ACCOUNTADMIN role. |
| EXECUTE ALERT | Grants the ability to execute alerts owned by the role. For serverless alerts to run, the role that has the OWNERSHIP privilege on the alert must also have the global EXECUTE MANAGED ALERT privilege. | Must be granted by the ACCOUNTADMIN role. |
| EXECUTE AUTO CLASSIFICATION | Grants the ability to set a classification profile on a schema to implement [sensitive data classification](classify-intro.md). | Must be granted by the ACCOUNTADMIN role. |
| EXECUTE DATA METRIC FUNCTION | Enables using serverless compute resources when calling a data metric function. |  |
| EXECUTE MANAGED ALERT | Grants the ability to create alerts that rely on serverless compute resources. Only required to create serverless alerts. The role that has the OWNERSHIP privilege on a serverless alert must have both the EXECUTE MANAGED ALERT and the EXECUTE ALERT privilege for the alert to run. |  |
| EXECUTE MANAGED TASK | Grants the ability to create tasks that rely on serverless compute resources. Only required for serverless tasks. The role that has the OWNERSHIP privilege on a task must have both the EXECUTE MANAGED TASK and the EXECUTE TASK privilege for the task to run. | Must be granted by the ACCOUNTADMIN role. |
| EXECUTE TASK | Grants the ability to run tasks owned by the role. For serverless tasks to run, the role that has the OWNERSHIP privilege on the task must also have the global EXECUTE MANAGED TASK privilege. | Must be granted by the ACCOUNTADMIN role. |
| IMPORT SHARE | Enables a data consumer to view shares shared with their account. Also grants the ability to create databases from shares; requires the global CREATE DATABASE privilege. For more details, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md). | Must be granted by the ACCOUNTADMIN role. |
| IMPORT ORGANIZATION LISTING | Enables a provider to install a listing or to perform a query without installing the listing. |  |
| IMPORT ORGANIZATION USER GROUPS | Grants the ability to add an [organization user group](organization-users.md) to a regular account, which imports users into the account. | Must be granted by the ACCOUNTADMIN role. |
| MANAGE ACCOUNTS | Grants the ability to manage the lifecycle of accounts (for example, creating and deleting). | Must be granted by the GLOBALORGADMIN role in the [organization account](organization-accounts.md). |
| MANAGE ACCOUNT SUPPORT CASES | Grants the ability to view, comment on, and manage all Support cases for the current account in Snowsight. |  |
| MANAGE APPLICATION SPECIFICATIONS | Grants the ability to approve app specifications. |  |
| MANAGE GRANTS | Enables granting or revoking privileges on objects for which the role is not the owner. | Must be granted by the SECURITYADMIN role (or higher). |
| MANAGE LISTING AUTO FULFILLMENT | Grants the ability to publish listings to remote regions using [Cross-Cloud Auto-Fulfillment](../collaboration/provider-listings-auto-fulfillment.md) and manage auto-fulfillment settings for listings. | In the [organization account](organization-accounts.md), must be granted by the GLOBALORGADMIN role. In all other accounts, must be granted by the ACCOUNTADMIN role after that role has been [delegated privileges by the ORGADMIN role](../collaboration/provider-listings-auto-fulfillment-manage-privileges.md). |
| MANAGE ORGANIZATION CONTACTS | Grants the ability to manage the contacts of an organization. | Must be granted by the GLOBALORGADMIN role in the [organization account](organization-accounts.md). |
| MANAGE ORGANIZATION SUPPORT CASES | Grants the ability to view, comment on, and manage all Support cases that were opened by the current user in Snowsight. |  |
| MANAGE ORGANIZATION TERMS | Grants the ability to manage the legal terms for an organization. | Must be granted by the GLOBALORGADMIN role in the [organization account](organization-accounts.md). |
| MANAGE ORGANIZATION USERS | Grants the ability to manage [organization users](organization-users.md). | Must be granted by the GLOBALORGADMIN role in the organization account. |
| MANAGE ORGANIZATION USER GROUPS | Grants the ability to manage [organization user groups](organization-users.md). | Must be granted by the GLOBALORGADMIN role in the organization account. |
| MANAGE SHARE TARGET | Grants the ability to manage (ALTER) share targets. |  |
| MANAGE USER SUPPORT CASES | Grants the ability to view, comment on, and manage all Support cases for the current user in Snowsight. |  |
| MANAGE WAREHOUSES | Grants the ability to perform operations that require MODIFY, MONITOR, and OPERATE privileges on warehouses in the same account. | Must be granted by the ACCOUNTADMIN role. |
| MODIFY LOG LEVEL | Enables setting the level of log messages captured for stored procedures and UDFs in the current account. | For more information, see [LOG_LEVEL](../sql-reference/parameters.md). |
| MODIFY METRIC LEVEL | Enables setting the level of metrics data captured for stored procedures and UDFs in the current account. | For more information, see [METRIC_LEVEL](../sql-reference/parameters.md). |
| MODIFY SESSION LOG LEVEL | Enables setting the level of log messages captured for stored procedures and UDFs invoked in the current session. | For more information, see [LOG_LEVEL](../sql-reference/parameters.md). |
| MODIFY SESSION METRIC LEVEL | Enables setting the level of metrics data captured for stored procedures and UDFs invoked in the current session. | For more information, see [METRIC_LEVEL](../sql-reference/parameters.md). |
| MODIFY TRACE LEVEL | Enables setting the level of trace events captured for stored procedures and UDFs in the current account. | When tracing events, you must also set the LOG_LEVEL parameter to one of its supported values. For more information, see [TRACE_LEVEL](../sql-reference/parameters.md). |
| MODIFY SESSION TRACE LEVEL | Enables setting the level of trace events captured for stored procedures and UDFs invoked in the current session. | When tracing events, you must also set the LOG_LEVEL parameter to one of its supported values. For more information, see [TRACE_LEVEL](../sql-reference/parameters.md). |
| MONITOR EXECUTION | Grants the ability to monitor any pipes or tasks in the account. | Must be granted by the ACCOUNTADMIN role. The USAGE privilege is also required on each database and schema that stores these objects. |
| MONITOR | Grants the ability to describe connections, resolve any object and session, and show capacity groups, locks, login events, query history by warehouse, REST history events, task history, and transactions. |  |
| MONITOR SECURITY | Grants the ability to call system functions pertaining to [Customer-managed keys](security-encryption-manage.md). |  |
| MONITOR USAGE | Grants the ability to monitor account-level usage and historical information for databases and warehouses; for more details, see [Enabling non-account administrators to monitor usage and billing history](security-access-control-configure.md). Additionally grants the ability to view managed accounts using [SHOW MANAGED ACCOUNTS](../sql-reference/sql/show-managed-accounts.md). | Must be granted by the ACCOUNTADMIN role. |
| OVERRIDE SHARE RESTRICTIONS | Grants the ability to set value for the SHARE_RESTRICTIONS parameter on a share. | For more details, see [Override share restrictions](override_share_restrictions.md). |
| PURCHASE DATA EXCHANGE LISTING | Grants the ability to purchase a paid listing. | See [Pay for listings](../collaboration/consumer-listings-paying.md). |
| READ SESSION | Grants the ability to read session context. | Must be granted by the ACCOUNTADMIN role. |
| READ UNREDACTED ERROR TABLE | Grants the ability to read the unredacted data in an error table. Required when the error table is associated with a base table that has security policies, such as a [masking policy](security-column-intro.md). For more information about error tables, see [DML error logging](data-load-overview.md). | Must be granted by the ACCOUNTADMIN role. |
| REPLICATE | Grants the ability to change the REPLICABLE_WITH_FAILOVER_GROUPS setting for databases and schemas. |  |
| RESOLVE ALL | Grants the ability to resolve all objects in the account, which outputs the object in the corresponding [SHOW <objects>](../sql-reference/sql/show.md) command. |  |
| USE AI FUNCTIONS | Grants the ability to use Snowflake Cortex AI Functions. Users need both the USE AI FUNCTIONS account privilege and the CORTEX_USER database role to use all Snowflake Cortex AI Functions. | For more information, see [Snowflake Cortex AI Functions (including LLM functions)](snowflake-cortex/aisql.md). |
| VIEW LINEAGE | Grants the ability to view data lineage, including upstream and downstream lineage objects and dependencies. For more information, see [Data Lineage](ui-snowsight-lineage.md). |  |
| ALL [ PRIVILEGES ] | Grants all global privileges. |  |

## User privileges

| Privilege | Usage |
| --- | --- |
| IMPERSONATE | Runs a task or dynamic table on behalf of a specified user account. |
| MODIFY PROGRAMMATIC AUTHENTICATION METHODS | Grants the ability to create, modify, delete, rotate, and view information about the [programmatic access tokens](programmatic-access-tokens.md) and [key pairs](key-pair-auth.md) for the user. |
| MONITOR | Grants the ability to view the login history for the user. |
| OWNERSHIP | Grants full control over a user/role. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the user. |

## Role privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over a role. Only a single role can hold this privilege on a specific object at a time. Note that the owner role does not inherit any permissions granted to the owned role. To inherit permissions from a role, that role must be granted to another role, creating a parent-child relationship in a role hierarchy. |

## Resource monitor privileges

| Privilege | Usage |
| --- | --- |
| MODIFY | Enables altering any properties of a resource monitor, such as changing the monthly credit quota. |
| MONITOR | Enables viewing a resource monitor. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the resource monitor. |

## Virtual warehouse privileges

| Privilege | Usage |
| --- | --- |
| APPLYBUDGET | Enables adding or removing a warehouse from a budget. |
| MODIFY | Enables altering any properties of a warehouse, including changing its size. . . Required to assign a warehouse to a resource monitor. Note that only the ACCOUNTADMIN role can assign warehouses to resource monitors. |
| MONITOR | Enables viewing current and past queries executed on a warehouse as well as usage statistics on that warehouse. |
| OPERATE | Enables changing the state of a warehouse (stop, start, suspend, resume). In addition, enables viewing current and past queries executed on a warehouse and aborting any executing queries. |
| USAGE | Enables using a virtual warehouse and, as a result, executing queries on the warehouse. If the warehouse is configured to auto-resume when a SQL statement (e.g. query) is submitted to it, the warehouse resumes automatically and executes the statement. |
| OWNERSHIP | Grants full control over a warehouse. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the warehouse. |

> **Tip:**
>
> The granting of the global MANAGE WAREHOUSES privilege is equivalent to granting the MODIFY, MONITOR, and OPERATE
> privileges on all warehouses in an account. You can grant this
> privilege to a role whose purpose includes managing a warehouse to simplify your Snowflake access control management.
>
> For details, refer to [Delegating warehouse management](warehouses-tasks.md).

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

## Connection privileges

| Privilege | Usage |
| --- | --- |
| FAILOVER | Grants the ability to promote a secondary connection to serve as the primary connection. |

## External volume privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables referencing the external volume when executing other commands that use the external volume, and grants the ability to view details for an external volume in a SHOW or DESCRIBE command. |
| OWNERSHIP | Grants full control over an external volume. Only a single role can hold this privilege on a specific object at a time. |

## Failover group privileges

| Privilege | Usage |
| --- | --- |
| MODIFY | Enables altering any properties of a failover group. |
| MONITOR | Enables viewing details of a failover group. |
| OWNERSHIP | Grants full control over a failover group. Only a single role can hold this privilege on a specific object at a time. |
| FAILOVER | Enables promoting a secondary failover group to serve as primary failover group. |
| REPLICATE | Enables refreshing a secondary failover group. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the failover group. |

## Replication group privileges

| Privilege | Usage |
| --- | --- |
| MODIFY | Enables altering any properties of a replication group. |
| MONITOR | Enables viewing details of a replication group. |
| OWNERSHIP | Grants full control over a replication group. Only a single role can hold this privilege on a specific object at a time. |
| REPLICATE | Enables refreshing a secondary replication group. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the replication group. |

## Integration privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables referencing the integration when executing other commands that use the integration. For more information, see access control requirements for [CREATE STAGE](../sql-reference/sql/create-stage.md) and [CREATE EXTERNAL ACCESS INTEGRATION](../sql-reference/sql/create-external-access-integration.md). |
| USE_ANY_ROLE | Allows the External OAuth client or user to switch roles only if this privilege is granted to the client or user. Configure the External OAuth security integration to use the `EXTERNAL_OAUTH_ANY_ROLE_MODE` parameter using [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md) or [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-oauth-external.md). |
| OWNERSHIP | Grants full control over an integration. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the integration. |

## Authentication Policy privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Transfers ownership of an authentication policy, which grants full control over the authentication policy. Required to alter most properties of an authentication policy. |

## Network Rule privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over the network rule. |

## Network policy privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over the network policy. Only a single role can hold this privilege on a specific object at a time. |
| USAGE | Grants the ability to apply a network policy. |

## Packages policy privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Transfers ownership of a packages policy, which grants full control over the packages policy. Required to alter most properties of a packages policy. |
| USAGE | Grants the ability to view the contents of a packages policy in a SHOW or DESCRIBE command. |

## Password policy privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Transfers ownership of a password policy, which grants full control over the password policy. Required to alter most properties of a password policy. |

## Provisioned Throughput privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over a provisioned throughput. Only one role at a time can hold this privilege on a specific object. |
| USE | Enables inference with a provisioned throughput. |
| MONITOR | Enables performing DESCRIBE and SHOW commands on a provisioned throughput. |

## Session policy privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Transfers ownership of a session policy, which grants full control over the session policy. Required to alter most properties of a session policy. |

## Data exchange privileges

| Privilege | Usage |
| --- | --- |
| IMPORTED PRIVILEGES | Enables roles other than the owning role to manage a Data Exchange. |

## Listing privileges

| Privilege | Usage |
| --- | --- |
| MODIFY | Enables roles other than the owning role to modify a listing. |
| USAGE | Enables viewing a listing. |
| OWNERSHIP | Grants full control over a listing. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a listing. |

## Organization profile privileges

| Privilege | Usage |
| --- | --- |
| MODIFY | Enables roles other than the owning role to modify an organization profile. |
| OWNERSHIP | Grants full control over an organization profile. Only a single role can hold this privilege on a specific object at a time. |

## Share privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over a share. Only a single role can hold this privilege on a specific object at a time. Cannot be transferred. |

## Database privileges

| Privilege | Usage |
| --- | --- |
| APPLYBUDGET | Enables adding or removing a database from a budget. |
| MODIFY | Enables altering any settings of a database. |
| MONITOR | Enables performing the DESCRIBE command on the database. |
| USAGE | Enables using a database, including returning the database details in the [SHOW DATABASES](../sql-reference/sql/show-databases.md) command output. Additional privileges are required to view or take actions on objects in a database. |
| REFERENCE_USAGE | Enables using an object (e.g. secure view in a share) when the object references another object in a different database. Grant the privilege on the other database to the share. You cannot grant this privilege on a database to any kind of role. For details, see [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) and [Share data from multiple databases](data-sharing-multiple-db.md). |
| CREATE DATABASE ROLE | Enables creating a new database role in a database. |
| CREATE SCHEMA | Enables creating a new schema in a database, including cloning a schema. |
| EXECUTE AUTO CLASSIFICATION | Grants the ability to set a classification profile on a database in order to implement [sensitive data classification](classify-intro.md). |
| IMPORTED PRIVILEGES | Enables roles other than the owning role to access a shared database; applies only to shared databases. |
| OWNERSHIP | Grants full control over the database. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a database. |

> **Note:**
>
> * Changing the properties of a database requires the OWNERSHIP privilege for the database.
>
>   Updating the COMMENT property only requires the MODIFY privilege for the database.
> * If any database privilege is granted to a role, that role can take SQL actions on objects in a schema using fully-qualified
>   names. The role must have the USAGE privilege on the schema as well as the required privilege or privileges on the object. To make a
>   database the active database in a user session, the USAGE privilege on the database is required.
> * An account-level role (i.e. `r1`) with the OWNERSHIP privilege on the database can grant the CREATE DATABASE ROLE privilege to a
>   different account-level role (i.e. `r2`). Similarly, `r1` can also revoke the CREATE DATABASE ROLE privilege from another
>   account-level role.
>
>   In this scenario, `r2` must have the USAGE privilege on the database to create a new database role in that database.
> * When you create a database role, the USAGE privilege on the database that contains the database role is automatically granted to the
>   database role.

## Schema privileges

| Privilege | Usage |
| --- | --- |
| APPLYBUDGET | Enables adding or removing a schema from a budget. |
| MODIFY | Enables altering any settings of a schema. |
| MONITOR | Enables performing the DESCRIBE command on the schema. |
| USAGE | Enables using a schema, including returning the schema details in the [SHOW SCHEMAS](../sql-reference/sql/show-schemas.md) command output. . . To execute [SHOW <objects>](../sql-reference/sql/show.md) commands for objects (tables, views, stages, file formats, sequences, pipes, types, or functions) in the schema, a role must have at least one privilege granted on the object. |
| CREATE AGENT | Enables creating a new [agent](snowflake-cortex/cortex-agents.md) in a schema. |
| CREATE AUTHENTICATION POLICY | Enables creating a new authentication policy in a schema. |
| CREATE BACKUP POLICY | Grants the ability to create a backup policy in a schema. The role granting this privilege must have the OWNERSHIP privilege on the schema. |
| CREATE BACKUP SET | Grants the ability to create a backup set in a schema. The role granting this privilege must have the OWNERSHIP privilege on the schema. |
| CREATE CONTACT | Enables creating a new [contact](contacts-using.md) in a schema. |
| CREATE DATASET | Enables creating a new [machine learning dataset](../developer-guide/snowflake-ml/dataset.md) in a schema. |
| CREATE DATA METRIC FUNCTION | Enables creating a new data metric function in a schema. |
| CREATE DBT PROJECT | Enables creating a new dbt project object in a schema. |
| CREATE EXPERIMENT | Enables creating a new [machine learning experiment](../developer-guide/snowflake-ml/experiments.md) in a schema. |
| CREATE TABLE | Enables creating a new table in a schema, including by cloning. . . This privilege applies to both standard tables and [hybrid tables](tables-hybrid.md). . . This privilege is not required to create temporary tables, which are scoped to the current user session and are automatically dropped when the session ends. |
| CREATE DYNAMIC TABLE | Enables creating a new [dynamic table](dynamic-tables-about.md) in a schema. |
| CREATE EVENT TABLE | Enables creating a new [event table](../developer-guide/logging-tracing/logging-tracing-overview.md) in a schema. |
| CREATE EXTERNAL TABLE | Enables creating a new external table in a schema. |
| CREATE GIT REPOSITORY | Enables creating a new [Git repository](../developer-guide/git/git-overview.md) stage in a schema. |
| CREATE ICEBERG TABLE | Enables creating a new [Iceberg table](tables-iceberg.md) in a schema. |
| CREATE INTERACTIVE TABLE | Enables creating a new [interactive table](interactive.md) in a schema. |
| CREATE VIEW | Enables creating a new view in a schema. |
| CREATE MASKING POLICY | Enables creating a new masking policy in a schema. |
| CREATE MATERIALIZED VIEW | Enables creating a new materialized view in a schema. |
| CREATE MCP SERVER | Enables creating a new [MCP server](snowflake-cortex/cortex-agents-mcp.md) in a schema. |
| CREATE NETWORK RULE | Enables creating a new network rule in a schema. |
| CREATE NOTEBOOK | Enables creating a new notebook in a schema. |
| CREATE ONLINE FEATURE TABLE | Enables creating a new [online feature table](../sql-reference/sql/create-online-feature-table.md) in a schema. |
| CREATE ROW ACCESS POLICY | Enables creating a new row access policy in a schema. |
| CREATE SECRET | Enables creating a new secret in the current/specified schema or replaces an existing secret. |
| CREATE SEMANTIC VIEW | Enables creating a new semantic view in a schema. |
| CREATE SESSION POLICY | Enables creating a new session policy in a schema. |
| CREATE SNAPSHOT POLICY — *Deprecated* | Grants the ability to create a snapshot policy in a schema. The role granting this privilege must have the OWNERSHIP privilege on the schema. Deprecated: use CREATE BACKUP POLICY instead. |
| CREATE SNAPSHOT SET — *Deprecated* | Grants the ability to create a snapshot set in a schema. The role granting this privilege must have the OWNERSHIP privilege on the schema. Deprecated: use CREATE BACKUP SET instead. |
| CREATE STAGE | Enables creating a new stage in a schema, including cloning a stage. |
| CREATE STORAGE LIFECYCLE POLICY | Enables creating a new [storage lifecycle policy](storage-management/storage-lifecycle-policies.md) in a schema. |
| CREATE STREAMLIT | Enables creating a new Streamlit app in a schema. |
| CREATE FILE FORMAT | Enables creating a new file format in a schema, including cloning a file format. |
| CREATE TYPE | Enables creating a new [user-defined type](../sql-reference/data-types-user-defined.md) in a schema. |
| CREATE SEQUENCE | Enables creating a new sequence in a schema, including cloning a sequence. |
| CREATE FUNCTION | Enables creating a new UDF or external function in a schema. |
| CREATE PACKAGES POLICY | Enables creating a new packages policy in a schema. |
| CREATE PASSWORD POLICY | Enables creating a new password policy in a schema. |
| CREATE PIPE | Enables creating a new pipe in a schema. |
| CREATE STREAM | Enables creating a new stream in a schema, including cloning a stream. |
| CREATE TAG | Enables creating a new [tag key](object-tagging/introduction.md) in a schema. |
| CREATE TASK | Enables creating a new task in a schema, including cloning a task. |
| CREATE PROCEDURE | Enables creating a new stored procedure in a schema. |
| CREATE ALERT | Enables creating a new alert in a schema. |
| CREATE CORTEX SEARCH SERVICE | Enables creating new [Cortex search services](snowflake-cortex/cortex-search/cortex-search-overview.md) on a schema. |
| CREATE SNOWFLAKE.CORE.BUDGET | Enables creating new [budget](budgets.md) on a schema. |
| CREATE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE | Enables creating new [classification profile](classify-auto.md) instances on a schema to implement sensitive data classification. |
| CREATE SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER | Enables creating new [custom classifier](classify-custom.md) instances on a schema. |
| CREATE SNOWFLAKE.ML.ANOMALY_DETECTION | Enables creating new [anomaly detection](ml-functions/anomaly-detection.md) model instances on a schema. |
| CREATE SNOWFLAKE.ML.CLASSIFICATION | Enables creating new [classification](ml-functions/classification.md) model instances on a schema. |
| CREATE SNOWFLAKE.ML.FORECAST | Enables creating new [forecast](ml-functions/forecasting.md) model instances on a schema. |
| CREATE SNOWFLAKE.ML.TOP_INSIGHTS | Enables creating new [Top Insights](ml-functions/top-insights.md) instances on a schema. |
| CREATE MODEL | Enables creating a [machine learning model](../developer-guide/snowflake-ml/model-registry/overview.md) on a schema. |
| CREATE MODEL MONITOR | Enables creating a [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md) on a schema. |
| CREATE IMAGE REPOSITORY | Enables creating a Snowpark Container Services [image repository](../developer-guide/snowpark-container-services/working-with-registry-repository.md) on a schema. |
| CREATE SERVICE | Enables creating a Snowpark Container Services [service](../developer-guide/snowpark-container-services/working-with-services.md) on a schema. |
| CREATE SNAPSHOT | Enables creating a Snowpark Container Services [snapshot](../developer-guide/snowpark-container-services/block-storage-volume.md) on a schema. |
| CREATE WORKSPACE | Enables creating a new [Snowflake Workspace](ui-snowsight/workspaces.md) in a schema. |
| EXECUTE AUTO CLASSIFICATION | Grants the ability to set a classification profile on a schema in order to implement [sensitive data classification](classify-intro.md). Schema owner has this privilege by default. |
| ADD SEARCH OPTIMIZATION | Enables [adding search optimization](search-optimization-service.md) to a table in a schema. |
| OWNERSHIP | Grants full control over the schema. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a schema. |

> **Note:**
>
> * Changing the properties of a schema requires the OWNERSHIP privilege for the database.
> * Operating on a schema also requires at least one privilege on the parent database.

## Table privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a table and [classifying](classify-intro.md) a table. |
| SELECT ERROR TABLE | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on the error table associated with a base table. For more information, see [DML error logging](data-load-overview.md). |
| INSERT | Enables executing an [INSERT](../sql-reference/sql/insert.md) command on a table. Also enables using the [ALTER TABLE](../sql-reference/sql/alter-table.md) command with a `RECLUSTER` clause to manually recluster a table with a clustering key. |
| UPDATE | Enables executing an [UPDATE](../sql-reference/sql/update.md) command on a table. |
| TRUNCATE | Enables executing a [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md) command on a table. |
| DELETE | Enables executing a [DELETE](../sql-reference/sql/delete.md) command on a table. |
| EVOLVE SCHEMA | Enables [schema evolution](data-load-schema-evolution.md) to occur on a table when loading data. |
| REFERENCES | Enables referencing a table as the unique/primary key table for a foreign key constraint. Also enables viewing the structure of a table (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| APPLYBUDGET | Enables adding or removing a table from a budget. |
| OWNERSHIP | Grants full control over the table. Required to alter most properties of a table, with the exception of reclustering. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a table. |

> **Note:**
>
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object
>   that already exists in the schema.

## Dynamic table privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a dynamic table. The SELECT privilege on a dynamic table allows you to view it in the output of the [SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) command.  If you have the SELECT privilege but don’t have the MONITOR privilege, the following fields are hidden: `text`, `warehouse`, `scheduling_state`, `last_suspended_on`, and `suspend_reason_code` (only hidden in Snowsight). |
| OPERATE | Enables altering the properties of a dynamic table.  If you do not have this privilege on a dynamic table, you can’t use the ALTER DYNAMIC TABLE command, which enables you to:   * Suspend a dynamic table using [ALTER … SUSPEND](../sql-reference/sql/alter-dynamic-table.md). * Resume a dynamic table using [ALTER … RESUME](../sql-reference/sql/alter-dynamic-table.md). * Refresh a dynamic table using [ALTER … REFRESH](../sql-reference/sql/alter-dynamic-table.md). * Set or change the warehouse and/or target lag using [ALTER … SET](../sql-reference/sql/alter-dynamic-table.md).   Additionally, if you lack this privilege on a dynamic table, you cannot execute `CREATE DYNAMIC TABLE ... INITIALIZE = ON_CREATE` to create a new dynamic table that consumes from it. |
| MONITOR | Enables accessing the metadata for a dynamic table through Snowsight and SQL commands and functions.  While the OPERATE privilege grants this access, it also includes the capability to alter dynamic tables, making MONITOR the more suitable option for scenarios where a user does not need to alter a dynamic table. For example, roles held by data scientists.  If you have the MONITOR privilege on a dynamic table, you can do the following:   * Call the [DYNAMIC_TABLE_GRAPH_HISTORY](../sql-reference/functions/dynamic_table_graph_history.md) table function to view   graph history of that dynamic table. * Call the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md) table function to view   refresh history for that dynamic table. * View that dynamic table in the output of the [SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) command. * View that dynamic table’s metadata in the output of the [DESCRIBE DYNAMIC TABLE](../sql-reference/sql/desc-dynamic-table.md) command or the Snowsight dynamic tables   details page.    + If you have the SELECT privilege but don’t have the MONITOR privilege, the following fields are hidden:     `text`, `warehouse`, `scheduling_state`, `last_suspended_on`, and `suspend_reason_code` (only hidden in Snowsight). |
| OWNERSHIP | Grants full control over the dynamic table. Only a single role can hold this privilege on a specific object at a time.  Required to drop a dynamic table. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the dynamic table. |

## Event table privileges

Some privileges typically supported for tables are disallowed on event tables (and as a result aren’t listed here) because the
[event table structure](../developer-guide/logging-tracing/event-table-columns.md) is predefined and immutable.

| Privilege | Usage |
| --- | --- |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the event table. |
| APPLYBUDGET | Enables adding or removing an event table from a [budget](budgets.md). |
| DELETE | Enables executing a [DELETE](../sql-reference/sql/delete.md) command on an event table. |
| OWNERSHIP | Grants full control over the event table. Required to alter the event table. In conjunction with OWNERSHIP of the account, grants the ability to associate an account with an event table. |
| REFERENCES | Grants the ability to view the structure of an event table (but not the data). |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on an event table. |
| TRUNCATE | Enables executing a [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md) command on the event table. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## External table privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on an external table and [classifying](classify-intro.md) an external table. |
| REFERENCES | Enables viewing the structure of an external table (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| OWNERSHIP | Grants full control over the external table; required to refresh an external table. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on an external table. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Hybrid table privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a hybrid table. |
| INSERT | Enables executing an [INSERT](../sql-reference/sql/insert.md) command on a hybrid table. |
| UPDATE | Enables executing an [UPDATE](../sql-reference/sql/update.md) command on a hybrid table. |
| TRUNCATE | Enables executing a [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md) command on a hybrid table. |
| DELETE | Enables executing a [DELETE](../sql-reference/sql/delete.md) command on a hybrid table. |
| REFERENCES | Enables referencing a hybrid table as the unique/primary key table for a foreign key constraint. Also enables viewing the structure of a hybrid table (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| APPLYBUDGET | Enables adding or removing a hybrid table from a budget. |
| OWNERSHIP | Grants full control over the hybrid table. Required to alter most properties of a hybrid table. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a hybrid table. |

> **Note:**
>
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * The following privileges have no effect when granted on a hybrid table that uses a catalog integration: INSERT, UPDATE, DELETE. Hybrid tables that
>   use a catalog integration are read-only.

## Iceberg table privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on an Iceberg table. |
| INSERT | Enables executing an [INSERT](../sql-reference/sql/insert.md) command on an Iceberg table. |
| UPDATE | Enables executing an [UPDATE](../sql-reference/sql/update.md) command on an Iceberg table. |
| TRUNCATE | Enables executing a [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md) command on an Iceberg table. |
| DELETE | Enables executing a [DELETE](../sql-reference/sql/delete.md) command on an Iceberg table. |
| REFERENCES | Enables referencing an Iceberg table as the unique/primary key table for a foreign key constraint. Also enables viewing the structure of an Iceberg table (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| APPLYBUDGET | Enables adding or removing an Iceberg table from a budget. |
| OWNERSHIP | Grants full control over the Iceberg table. Required to alter most properties of an Iceberg table. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on an Iceberg table. |

> **Note:**
>
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * The following privileges have no effect when granted on an Iceberg table that uses an external catalog: INSERT, UPDATE, DELETE. Iceberg tables that
>   use an external catalog are read-only.

## Interactive table privileges

These privileges apply to [interactive tables](interactive.md).

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on an interactive table. |
| MONITOR | Grants the ability to access metadata for an interactive table through Snowsight and SQL. You can monitor interactive table refresh progress using the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md) table function. |
| REFERENCES | Enables referencing an interactive table as the unique/primary key table for a foreign key constraint. Also enables viewing the structure of an interactive table (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| OWNERSHIP | Grants full control over the interactive table. Required to rename the interactive table. Only a single role can hold this privilege on a specific object at a time. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on an interactive table. |

> **Note:**
>
> * Operating on an interactive table also requires the USAGE privilege on the parent database and schema.
> * Interactive tables don’t support DML operations such as INSERT, UPDATE, and DELETE.
>   For information about ingesting data into interactive tables, see [Snowflake interactive tables and interactive warehouses](interactive.md).

## View privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a view and [classifying](classify-intro.md) a view. . . This privilege is sufficient to query a view; the SELECT privilege is not required on the objects from which the view is created. |
| REFERENCES | Enables viewing the structure of a view (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| OWNERSHIP | Grants full control over the view. Required to alter a view. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a view. |

> **Note:**
>
> * Table DML privileges such as INSERT, UPDATE, and DELETE can be granted on views; however, because views are read-only, these privileges
>   have no effect.
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object
>   that already exists in the schema.

## Materialized view privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a view and [classifying](classify-intro.md) a materialized view. . . Note that this privilege is sufficient to query a view. The SELECT privilege is not required on the underlying objects for a view. |
| REFERENCES | Enables viewing the structure of a view (but not the data) via the DESCRIBE or SHOW command or by querying the Information Schema. |
| APPLYBUDGET | Enables adding or removing a materialized view from a budget. |
| OWNERSHIP | Grants full control over the view. Required to alter a view. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a view. |

> **Note:**
>
> * Table DML privileges such as INSERT, UPDATE, and DELETE can be granted on views; however, because views are read-only, these privileges
>   have no effect.
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object
>   that already exists in the schema.

## Semantic view privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a semantic view. . . This privilege is sufficient to query a semantic view; the SELECT privilege is not required on the objects from which the semantic view is created. . . Also enables executing [DESCRIBE SEMANTIC VIEW](../sql-reference/sql/desc-semantic-view.md) for the semantic view. |
| REFERENCES | Enables viewing the structure of a semantic view (but not the data) by querying the Information Schema views that provide information about the semantic view or by executing a DESCRIBE or SHOW command. This includes the DESCRIBE and SHOW commands for the underlying entities, calculations and relationships. Also enables calling the [GET_DDL](../sql-reference/functions/get_ddl.md) function for the semantic view. |
| MONITOR | Grants the ability to view details about the semantic view (using SHOW commands, DESC commands, and INFORMATION_SCHEMA views) and Cortex Analyst [monitoring and observability data](snowflake-cortex/cortex-analyst/admin-observability.md). |
| OWNERSHIP | Grants full control over the semantic view. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. . . Required to replace a view. Your role must also be granted the CREATE SEMANTIC VIEW privilege on the schema containing the view. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on a semantic view. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Notebook privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over the notebook. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| USAGE | Grants the ability to reference and view the notebook in [SHOW <objects>](../sql-reference/sql/show.md) commands. |

## Online Feature Table privileges

| Privilege | Usage |
| --- | --- |
| MONITOR | Grants the ability to view details about the online feature table using [SHOW ONLINE FEATURE TABLES](../sql-reference/sql/show-online-feature-tables.md) and view refresh history using the [ONLINE_FEATURE_TABLE_REFRESH_HISTORY](../sql-reference/functions/online-feature-table-refresh-history.md) function. |
| SELECT | Grants the ability to query data from the online feature table. |
| OWNERSHIP | Grants full control over the online feature table. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all applicable privileges, except OWNERSHIP, on the online feature table. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Stage privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables using an external stage object in a SQL statement and includes the READ and WRITE privileges; not applicable to internal stages. |
| READ | Enables performing any operations that require reading from a stage (for example, [file staging commands](../sql-reference/commands-file.md) and [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)). |
| WRITE | Enables performing any operations that require writing to a stage (for example, [file staging commands](../sql-reference/commands-file.md) and [COPY INTO <location>](../sql-reference/sql/copy-into-location.md)). |
| OWNERSHIP | Grants full control over the stage. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all applicable privileges, except OWNERSHIP, on the stage (internal or external). |

> **Note:**
>
> * When granting both the READ and WRITE privileges for a stage, the READ privilege must be granted before or at the same time as
>   the WRITE privilege.
> * When revoking both the READ and WRITE privileges for a stage, the WRITE privilege must be revoked before or at the same time as
>   the READ privilege.
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * To run the following commands on an external stage that uses a storage integration,
>   you must use a role that has been granted or inherits the USAGE privilege on the storage integration (unless the stage-owning
>   role has this privilege):
>
>   + [LIST](../sql-reference/sql/list.md)
>   + [REMOVE](../sql-reference/sql/remove.md)
>   + [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)
>   + [COPY INTO <location>](../sql-reference/sql/copy-into-location.md)
> * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object
>   that already exists in the schema.

### Directory table privileges

The following table summarizes the stage [privileges](security-access-control-overview.md) that you need to execute common
SQL commands when you work with a [directory table](data-load-dirtables.md) on a stage.

| Operation | Object Type | Privilege Required |
| --- | --- | --- |
| Retrieve file URLs from a directory table using a SELECT FROM DIRECTORY statement. | Stage | One of the following, depending on the type of stage:   * Internal stage: An account role or database role with the READ privilege on the stage. * External stage: An account role or database role with either the READ or USAGE privilege on the stage. |
| Upload data using the [PUT](../sql-reference/sql/put.md) command. | Stage (internal only) | An account role or database role with the WRITE privilege on the stage. |
| Remove files using the [REMOVE](../sql-reference/sql/remove.md) command. | Stage | One of the following, depending on the type of stage:   * Internal stage: An account role or database role with the WRITE privilege on the stage. * External stage: An account role or database role with either the WRITE or USAGE privilege on the stage. |
| Refresh the metadata using the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command. | Stage | One of the following, depending on the type of stage:   * Internal stage: An account role or database role with the WRITE privilege on the stage. * External stage: An account role or database role with either the WRITE or USAGE privilege on the stage. |

## Snowflake Git repository clone privileges

| Privilege | Usage |
| --- | --- |
| READ | Enables performing any operations that require reading from a Git repository clone. |
| WRITE | Enables performing operations that require writing to a Git repository clone, such as changing the object’s properties or performing a FETCH from the remote repository. |
| OWNERSHIP | Grants full control over the Git repository clone. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all applicable privileges, except OWNERSHIP, on the Git repository clone. |

## File format privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables using a file format in a SQL statement. |
| OWNERSHIP | Grants full control over the file format. Required to alter a file format. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the file format. |

> **Note:**
>
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object
>   that already exists in the schema.

## Pipe privileges

Pipe objects are created and managed to load data using Snowpipe.

| Privilege | Usage |
| --- | --- |
| APPLYBUDGET | Enables adding or removing a pipe from a budget. |
| MONITOR | Enables viewing details for the pipe (using DESCRIBE PIPE or SHOW PIPES). |
| OPERATE | Enables viewing details for the pipe (using DESCRIBE PIPE or SHOW PIPES), pausing or resuming the pipe, and refreshing the pipe. |
| OWNERSHIP | Grants full control over the pipe. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the pipe. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Database role privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over a database role. Only a single role can hold this privilege on a specific object at a time. Note that the owner role does not inherit any permissions granted to the owned database role. To inherit permissions from a database role, that database role must be granted to another role, creating a parent-child relationship in a role hierarchy. |

## Stream privileges

| Privilege | Usage |
| --- | --- |
| SELECT | Enables executing a [SELECT](../sql-reference/sql/select.md) statement on a stream, which also allows you to view the stream in the output of the SHOW STREAMS command. To view the `table_name` and `base_tables` columns, you need at least one access privilege on the stream’s source object. |
| OWNERSHIP | Grants full control over the stream. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the stream. |

## Task privileges

| Privilege | Usage |
| --- | --- |
| APPLYBUDGET | Enables adding or removing a task from a budget. |
| MONITOR | Enables viewing details for the task (using DESCRIBE TASK or SHOW TASKS). |
| OPERATE | Enables viewing details for the task (using DESCRIBE TASK or SHOW TASKS) and resuming or suspending the task. |
| OWNERSHIP | Grants full control over the task. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the task. |

## dbt project object privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables executing a dbt project object, retrieving files from the dbt project object, viewing details (using DESCRIBE DBT PROJECT and SHOW DBT PROJECT), and viewing execution history. |
| MONITOR | Enables viewing a dbt project object in Snowsight. Without this privilege, you can’t access the project details, run history, or monitoring information. |
| OWNERSHIP | Grants full control over the dbt project object, including executing and monitoring. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the dbt project object. |

## Secret privileges

| Privilege | Usage |
| --- | --- |
| READ | Enables a UDF or stored procedure that uses a secret to access the credentials that are stored in the secret. For details, see [Creating a secret to represent credentials](../developer-guide/external-network-access/creating-using-external-network-access.md). |
| USAGE | Enables using a secret. |
| OWNERSHIP | Transfers ownership of a secret, which grants full control over the secret. Required to alter most properties of a secret or drop a secret from the system. |

## Aggregation policy privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for an aggregation policy on a table or view.  Note that granting the global APPLY AGGREGATION POLICY privilege (i.e. APPLY AGGREGATION POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views. |
| OWNERSHIP | Grants full control over the aggregation policy. Required to alter most properties of an aggregation policy. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Join policy privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a join policy on a table or view.  Note that granting the global APPLY JOIN POLICY privilege (i.e. APPLY JOIN POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views. |
| OWNERSHIP | Grants full control over the join policy. Required to alter most properties of a join policy. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Masking policy privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a [masking policy](security-column-intro.md) on a column.  Note that granting the global APPLY MASKING POLICY privilege (i.e. APPLY MASKING POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views.  For syntax examples, see [Masking policy privileges](security-column-intro.md). |
| OWNERSHIP | Grants full control over the masking policy. Required to alter most properties of a masking policy. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Privacy policy privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a privacy policy on a table or view.  Note that granting the global APPLY PRIVACY POLICY privilege (that is, APPLY PRIVACY POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views. |
| OWNERSHIP | Grants full control over the privacy policy. Required to alter most properties of a privacy policy. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Projection policy privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a projection policy on a column.  Note that granting the global APPLY PROJECTION POLICY privilege (i.e. APPLY PROJECTION POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views. |
| OWNERSHIP | Grants full control over the projection policy. Required to alter most properties of a projection policy. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Row access policy privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the add and drop operations for the [row access policy](security-row-intro.md) on a table or view.  Note that granting the global APPLY ROW ACCESS POLICY privilege (i.e. APPLY ROW ACCESS POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views.  For syntax examples, see [Summary of DDL commands, operations, and privileges](security-row-intro.md). |
| OWNERSHIP | Grants full control over the row access policy. Required to alter most properties of a row access policy. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Tag privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the add and drop operations for the tag on a Snowflake object. |
| READ | Enables a data sharing consumer to view shared tag assignments using a [SHOW TAGS](../sql-reference/sql/show-tags.md) command. The data sharing provider grants this privilege to a database role or directly to the share. |
| OWNERSHIP | Grants full control over the tag. Required to alter most properties of a tag. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Tags are stored at the schema level.
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Sequence privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables using a sequence in a SQL statement. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the sequence. |
| OWNERSHIP | Grants full control over the sequence; required to alter the sequence. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## User-defined types

| Privilege | Usage |
| --- | --- |
| USAGE | Enables *explicitly* using a user-defined type in a SQL statement. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the user-defined type. |
| OWNERSHIP | Grants full control over the user-defined type; required to alter the user-defined type. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

Explicit use of a user-defined type means specifying the type by name in a SQL statement or code. The
following are examples of explicit use of a user-defined type:

* An explicit cast to the user-defined type.
* A type definition in a DML statement, stored procedure code, or user-defined function code that
  specifies the user-defined type.

USAGE privilege isn’t required on a user-defined type if it isn’t explicitly used. For example, if a table
column, stored procedure argument, or function argument is defined with a user-defined type, users
can query the table or call the stored procedure or function without USAGE privilege on the user-defined
type.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

## Stored procedure privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables calling a stored procedure. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the stored procedure. |
| OWNERSHIP | Grants full control over the stored procedure; required to alter the stored procedure. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * If a stored procedure runs with caller’s rights, the user who calls the stored procedure must have privileges on the database
>   objects (e.g. tables) accessed by the stored procedure. For details, see [Understanding caller’s rights and owner’s rights stored procedures](../developer-guide/stored-procedure/stored-procedures-rights.md).

## User-defined function (UDF) and external function privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables calling a UDF or external function. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the UDF or external function. |
| OWNERSHIP | Grants full control over the UDF or external function; required to alter the UDF or external function. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

> **Note:**
>
> * Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.
> * The owner of a UDF must have privileges on the objects accessed by the function; the user who calls a UDF does not need those
>   privileges. For details, see [Security/privilege requirements for SQL UDFs](../developer-guide/udf/sql/udf-sql-introduction.md).
> * The owner of an external function must have the USAGE privilege on the API integration object associated with the external
>   function. For details, see [Access control](../sql-reference/external-functions-security.md) in the documentation on external functions.

## Data metric function (DMF) privileges

| Privilege | Usage |
| --- | --- |
| USAGE | Enables calling the DMF. |
| OWNERSHIP | Transfers ownership of the data metric function, which grants full control over the data metric function. Required to alter most properties of the data metric function. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the DMF. |

## Alert privileges

| Privilege | Usage |
| --- | --- |
| MONITOR | Enables viewing details for the alert (using [DESCRIBE ALERT](../sql-reference/sql/desc-alert.md) or [SHOW ALERTS](../sql-reference/sql/show-alerts.md)). |
| OPERATE | Enables viewing details for the alert (using DESCRIBE ALERT or SHOW ALERTS) and resuming or suspending the alert (using [ALTER ALERT](../sql-reference/sql/alter-alert.md)). |
| OWNERSHIP | Grants full control over the alert. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the alert. |

## Compute Pool privileges

| Privilege | Usage |
| --- | --- |
| OPERATE | Enables suspending or resuming a compute pool. |
| MODIFY | Enables altering compute pool and setting properties. |
| USAGE | Enables running a service or a job. It enables communicating with the service (create a service function, use public endpoints, and connect from another service). |
| MONITOR | Enables viewing compute pool usage (number of services and jobs running), properties, and listing compute pool in the account for which the role has access privileges. |
| OWNERSHIP | Grants full control over the compute pool. Only a single role can hold this privilege on a specific compute pool object at a time. |

## Image Repository privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the image repository. The role with this privilege can also delete an image repository. |
| READ | Enable listing and downloading images from an image repository. |
| WRITE | Enables listing and downloading images from a repository. Also enables pushing images in the repository. |

## Service privileges

| Privilege | Usage |
| --- | --- |
| OPERATE | Enable suspending or resuming a service, upgrading service, and modifying service properties. |
| OWNERSHIP | Enables full control over the service. The role with this privilege can also remove a service from a schema. |
| MONITOR | Enable monitoring a service and getting runtime status. |

## Cortex Search Service privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the Cortex Search service. The role with this privilege can also remove a service from a schema. |
| OPERATE | Enables inspecting, suspending or resuming a Cortex Search service and modifying service properties. |
| USAGE | Enables invoking the service. |
| ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the service. |

## Snapshot privileges (for block storage volume snapshots)

These privileges apply to block storage volume snapshots.

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the snapshot. The role with this privilege can also remove a snapshot from a schema. |
| USAGE | Enables listing and describing snapshots. |

## Backup policy privileges

These privileges apply to [backup](backups.md) policies for Snowflake databases, schemas,
and tables.

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over backup policies. |
| USAGE | Enables listing and describing backup policies. |

## Backup set privileges

These privileges apply to [backup](backups.md) sets for Snowflake databases, schemas,
and tables.

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over backup sets. |
| USAGE | Enables listing and describing backup sets. |

## Snapshot policy privileges (for WORM snapshots) — *Deprecated*

> **Note:**
>
> These privileges are deprecated. Use backup policy privileges instead.

These privileges apply to [Write Once Read Many (WORM) snapshots](backups.md) for Snowflake databases, schemas,
and tables.

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over snapshot policies. |
| USAGE | Enables listing and describing snapshot policies. |

## Snapshot set privileges (for WORM snapshots) — *Deprecated*

> **Note:**
>
> These privileges are deprecated. Use backup set privileges instead.

These privileges apply to [Write Once Read Many (WORM) snapshots](backups.md) for Snowflake databases, schemas,
and tables.

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over snapshot sets. |
| USAGE | Enables listing and describing snapshot sets. |

## Storage lifecycle policy privileges

These privileges apply to [storage lifecycle policies](storage-management/storage-lifecycle-policies.md).

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control of the storage lifecycle policy. This privilege is required to alter the policy. Only one role can have this privilege per lifecycle policy object. |
| APPLY | Allows the grantee to add or drop the storage lifecycle policy on a table. To add the policy to a table, you must also have the OWNERSHIP privilege for the table or the global `APPLY STORAGE LIFECYCLE POLICY` privilege on the account. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Streamlit privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Grants full control over the Streamlit object; required to alter the Streamlit object. Only a single role can hold this privilege on a specific object at a time. |
| USAGE | Enables viewing and running a Streamlit app, as well as displaying information about the Streamlit object. This privilege does not allow users to see the Streamlit app code or the artifacts that define the Streamlit app. |

## Model privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the model. Only one role at a time can hold this privilege on a given model. |
| USAGE | Enables displaying information about a model and invoking its methods. It does not allow users to see model weights or the artifacts that define the model. This privilege is also supported `ON FUTURE MODELS`. |

## Application package privileges

| Privilege | Usage |
| --- | --- |
| ATTACH LISTING | Associates a listing with an application package or share. |

## Contact privileges

| Privilege | Usage |
| --- | --- |
| APPLY | Enables the ability to associate and detach a contact with a Snowflake object. |
| MODIFY | Enables the ability to modify a contact. |
| OWNERSHIP | Grants full control over the contact. Required to alter most properties of a contact. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

## Dataset privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the dataset. |
| USAGE | Enables displaying information about a dataset and invoking its methods. |

## Cortex Agent privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the Cortex Agent. The role with this privilege can also remove an agent from a schema. |
| MODIFY | Enables the ability to modify a Cortex Agent. |
| MONITOR | Enables the ability to view threads, logs, and traces of the Cortex Agent. |
| USAGE | Enables querying the Cortex Agent to generate responses. |

## Machine Learning Experiment privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the experiment. The role with this privilege can also remove an experiment from a schema. |
| MODIFY | Enables the ability to modify an experiment and its runs. |
| USAGE | Enables examining the run information contained within an experiment. |

## MCP Server privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the MCP Server. The role with this privilege can also remove an MCP Server from a schema. |
| MODIFY | Enables the ability to modify an MCP Server. |
| USAGE | Enables querying the MCP Server to discover tools and invoke them. |

## Gateway privileges

| Privilege | Usage |
| --- | --- |
| OWNERSHIP | Enables full control over the gateway. The role with this privilege can also remove a gateway from a schema. |
| MODIFY | Enables the ability to modify a gateway. |
| USAGE | Enables using the gateway. |

## Workspace privileges

| Privilege | Usage |
| --- | --- |
| READ | Grants read-only access to the workspace and its files. |
| WRITE | Grants the ability to create, edit, and delete files in the workspace. Granting WRITE also grants READ access. You do not need to grant READ separately. |
| OWNERSHIP | Grants full control over the workspace. Only a single role can hold this privilege on a specific object at a time. Note that in a [managed access schema](security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| ALL [ PRIVILEGES ] | Grants all applicable privileges, except OWNERSHIP, on the workspace. |

For information about creating and sharing workspaces, see [Shared workspaces](ui-snowsight/workspaces-shared.md).

---
title: Access History
source: https://docs.snowflake.com/en/user-guide/access-history.md
section: User Guide
---

# Access History

This topic provides concepts on the user access history in Snowflake.

## Overview

Access History in Snowflake refers to when the user query reads data and when the SQL statement performs a data write
operation, such as INSERT, UPDATE, and DELETE along with variations of the COPY command, from the source data object to the target data
object. The user access history can be found by querying the ACCESS_HISTORY view in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas. The
records in these views facilitate regulatory compliance auditing and provide insights on popular and frequently accessed tables and columns
because there is a direct link between the user (i.e. query operator), the query, the table or view, the column, and the data.

Each row in the ACCESS_HISTORY view contains a single record per SQL statement. The record contains the following kinds of information:

* The *source columns* the query accessed directly and indirectly, such as the underlying tables that the data for the query comes from.
* The *projected columns* the user sees in the query result, such as the columns specified in a SELECT statement.
* The columns that are used to determine the query result but are not projected, such as columns in a WHERE clause to filter the result.

For example:

```sqlexample
CREATE OR REPLACE VIEW v1 (vc1, vc2) AS
SELECT c1 as vc1,
       c2 as vc2
FROM t
WHERE t.c3 > 0
;
```

* Columns C1 and C2 are source columns that the view accesses directly, which are recorded in the `base_objects_accessed` column of
  the ACCESS_HISTORY view.
* Column C3 is used to filter the rows the view includes, which is recorded in the `base_objects_accessed` column of
  the ACCESS_HISTORY view.
* Columns VC1 and VC2 are projected columns the user sees when querying the view, `SELECT * FROM v1;`, which are recorded in the
  `direct_objects_accessed` column of the ACCESS_HISTORY view.

The same behavior applies to a key column in a WHERE clause. For example:

```sqlexample
CREATE OR REPLACE VIEW join_v (vc1, vc2, c1) AS
  SELECT
      bt.c1 AS vc1,
      bt.c2 AS vc2,
      jt.c1
  FROM bt, jt
  WHERE bt.c3 = jt.c1;
```

* Two different tables are required to create the view: `bt` (base table) and `jt` (join table.).
* Columns C1, C2, and C3 from the base table and column C1 from the join table are all recorded in the `base_objects_accessed` column
  of the ACCESS_HISTORY view.
* Columns VC1, VC2, and C1 are projected columns the user sees when querying the view, `SELECT * FROM join_v;`, which are
  recorded in the `direct_objects_accessed` column of the ACCESS_HISTORY view.

> **Note:**
>
> Records in the Account Usage [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) view do not always get recorded in the
> ACCESS_HISTORY view. The structure of the SQL statement determines whether Snowflake records an entry in the ACCESS_HISTORY view.
>
> For details on the read and write operations Snowflake supports in the ACCESS_HISTORY view, refer to the view
> [Usage notes](../sql-reference/account-usage/access_history.md).

## Tracking read and write operations

The ACCESS_HISTORY view in both the ACCOUNT_USAGE and the ORGANIZATION_USAGE schemas includes the following columns:

```none
query_id | query_start_time | user_name | direct_objects_accessed | base_objects_accessed | objects_modified | object_modified_by_ddl | policies_referenced | parent_query_id | root_query_id
```

Read operations are tracked through the first five columns, while the last column,
`objects_modified`, specifies the data write information that involved Snowflake columns, tables, and stages.

The query in Snowflake and how the database objects were created determines the information Snowflake returns in the
`direct_objects_accessed`, `base_objects_accessed`, and `objects_modified` columns.

Similarly, if the query references an object protected by a row access policy or a column protected by a masking policy, Snowflake records
the policy information in the `policies_referenced` column.

The `object_modified_by_ddl` column records the DDL operation on a database, schema, table, view, and column. These operations also
include statements that specify a row access policy on a table or view, a masking policy on a column, and tag updates
(e.g. set a tag, change a tag value) on the object or column.

The `parent_query_id` and `root_query_id` columns record query IDs that correspond to:

* A query that performs a read or write operation on another object.
* A query that performs a read or write operation on an object that calls a stored procedure, including nested stored procedure calls. For
  details, see ancestor queries (in this topic).

For column details, see the [Columns](../sql-reference/account-usage/access_history.md) section in the ACCESS_HISTORY view.

### Read

Consider the following scenario to understand a read query and how the ACCESS_HISTORY view records this information:

* A series of objects: `base_table` » `view_1` » `view_2` » `view_3`.
* A read query on `view_2`, such as:

  ```sqlexample
  select * from view_2;
  ```

In this example, Snowflake returns:

* `view_2` in the `direct_objects_accessed` column because the query specifies `view_2`.
* `base_table` in the `base_objects_accessed` column because that is the original source of the data in `view_2`.

Note that `view_1` and `view_3` are not included in the `direct_objects_accessed` and `base_objects_accessed` columns
because neither of those views were included in the query and they are not the base object that serves as the source for the data in
`view_2`.

### Write

Consider the following scenario to understand a write operation and how the ACCESS_HISTORY view records this information:

* A data source: `base_table`
* Create a table from the data source (i.e. CTAS):

  ```sqlexample
  create table table_1 as select * from base_table;
  ```

In this example, Snowflake returns:

* `base_table` in the `base_objects_accessed` and `direct_objects_accessed` columns because the table was accessed directly
  and is the source of the data.
* `table_1` in the `objects_modified` column with the columns that were written to when creating the table.

### Supported operations

For a complete description of the read and write operations the ACCESS_HISTORY view supports, see the usage notes sections in the
[ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md).

### Multiple statements in a single request

Snowflake supports executing multiple statements simultaneously as a single request. How you track the request in the access history depends
on whether it was executed in Snowsight or programmatically.

* When you use Snowsight to execute multiple statements, it runs the queries one at a time and returns the
  `query_id` of the last executed query. You can find all executed statements and their return values in the ACCESS_HISTORY view.
* Features like the Snowflake Python connector or the Snowflake SQL API combine multiple SQL statements into a single request and return a
  single `query_id` for all of the statements. This number is actually a parent query id for all of the individual
  statements. To return the `query_id` of each statement that comprised the request, you must query the ACCESS_HISTORY view using the
  `parent_query_id`. For example, if the request returned `query_id = 6789`, then you can return the query ids of the individual
  statements by executing the following:

  ```sqlexample
  SELECT query_id, parent_query_id, direct_objects_accessed
  FROM snowflake.account_usage.access_history
  WHERE parent_query_id = 6789;
  ```

### Benefits

Access history in Snowflake provides the following benefits pertaining to read and write operations:

Data discovery:
:   Discover unused data to determine whether to archive or delete the data.

Track how sensitive data moves:
:   Track data movement from an external cloud storage location (e.g. Amazon S3 bucket) to the target Snowflake table, and vice versa.

    Track internal data movement from a Snowflake table to a different Snowflake table.

    After tracing the movement of sensitive data, apply policies ([masking](security-column-intro.md) and
    [row access](security-row-intro.md)) to protect data, update
    [access control settings](security-access-control-overview.md) to further regulate access to the stage and table, and set
    [tags](object-tagging/introduction.md) to ensure stages, tables, and columns with sensitive data can be tracked for compliance
    requirements.

Data validation:
:   The accuracy and integrity of reports, dashboards, and data visualization products such as charts and graphs are validated since the
    data can be traced to its original source.

    Data stewards can also notify users prior to dropping or altering a given table or view.

Compliance auditing:
:   Identify the Snowflake user who performed a write operation on a table or stage and when the write operation occurred to meet compliance
    regulations, such as [GDPR](https://gdpr-info.eu/) and [CCPA](https://oag.ca.gov/privacy/ccpa).

Enhance overall data governance:
:   The ACCESS_HISTORY view provides a unified picture of what data was accessed, when the data access took place, and how the accessed data
    moved from the data source object to the data target object.

## Column lineage

Column lineage (i.e. access history for columns) extends the Account Usage ACCESS_HISTORY view to specify how data flows from the source
column to the target column in a write operation. Snowflake tracks the data from the source columns through all subsequent table objects
that reference data from the source columns (e.g. INSERT, MERGE, CTAS) provided that objects in the lineage chain are not dropped.
Snowflake makes column lineage accessible by enhancing the `objects_modified` column in the ACCESS_HISTORY view.

Column lineage provides the following benefits:

Protect Derived Objects:
:   Data stewards can easily [tag](object-tagging/introduction.md) sensitive source columns without having to do additional work after
    creating derived objects (e.g. CTAS). Subsequently, the data steward can protect tables containing sensitive columns with a
    [row access policy](security-row-intro.md) or protect the sensitive columns themselves with either a
    [masking policy](security-column-intro.md) or a
    [tag-based masking policy](tag-based-masking-policies.md).

Sensitive Column Copy Frequency:
:   Data privacy officers can quickly determine the object count (e.g. 1 table, 2 views) of a column containing sensitive data. By knowing
    how many times a column with sensitive data appears in a table object, data privacy officers can prove how they satisfy regulatory
    compliance standards (e.g. to meet General Data Protection Regulation (GDPR) standards in the European Union).

Root Cause Analysis:
:   Column lineage provides a mechanism to trace the data to its source, which can help to pinpoint points of failure resulting from
    poor data quality and reduce the number of columns to analyze during the troubleshooting process.

For additional details about column lineage, see:

* Column lineage (in this topic)

## Masking and row access policy references

The POLICY_REFERENCED column specifies the object that has a row access policy set on a table or a masking policy set on a column,
including any intermediate objects that are protected by either a row access policy or a masking policy. Snowflake records the policy that
is enforced on the table or column.

Consider these objects:

`t1` » `v1` » `v2`

Where:

* `t1` is a base table.
* `v1` is a view built from the base table.
* `v2` is a view built from `v1`.

If the user queries `v2`, the `policies_referenced` column records either the row access policy that protects `v2`, each masking
policy that protects the columns in `v2`, or both kinds of policy as applicable. Additionally, this column records any masking or row
access policies that protect `t1` and `v1`.

These records can help data governors understand how their policy-protected objects are accessed.

The `policies_referenced` column provides additional benefits to the ACCESS_HISTORY view:

* Identify the policy-protected objects a user accesses in a given query.
* Simplify the policy audit process.

  Querying the ACCESS_HISTORY view eliminates the need for complex joins on other Account Usage views
  (e.g. [POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md) and
  [QUERY_HISTORY](../sql-reference/account-usage/query_history.md)), to obtain information about the protected objects and protected
  columns a user accesses.

## Account-level vs. Organization-level access history

Administrators monitor access history at the account-level by querying the
[ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md) in the account’s ACCOUNT_USAGE schema. There
is no additional cost associated with the ACCOUNT_USAGE.ACCESS_HISTORY view.

The ACCESS_HISTORY view in the ORGANIZATION_USAGE schema gathers the access history of all of the accounts in an organization into a single
view to provide an organization-level access history. This ORGANIZATION_USAGE.ACCESS_HISTORY view is only found in the
[organization account](organization-accounts.md).

Organization-level access history in the ORGANIZATION_USAGE schema differs from access history in the ACCOUNT_USAGE schema in the
following ways:

Additional columns:
:   The ORGANIZATION_USAGE.ACCESS_HISTORY view in the organization account contains additional columns that provide insights related to
    [organizational listings](collaboration/listings/organizational/org-listing-about.md). These columns can be used to determine which of
    the data products attached to an organization listing were accessed by a consumer’s query, and whether those data products are protected
    by a policy such as a masking policy. For more information, see [Organizational listing governance](collaboration/listings/organizational/org-listing-governance.md).

Additional cost:
:   The ORGANIZATION_USAGE.ACCESS_HISTORY view in the organization account is a premium view that incurs the following costs:

    * Compute costs associated with the serverless tasks that populate the ACCESS_HISTORY view.
    * Storage costs associated with storing the data in the ACCESS_HISTORY view.

    For more information about these costs, see [Costs associated with premium views](organization-accounts-premium-views.md).

## Supported Objects

Use the following table to determine whether the ACCESS_HISTORY view contains a record when a SQL statement involves a specific type of object. SQL statements include the following:

* Data Manipulation Language (DML) statements. For example, statements used to insert data into a table.
* Data Query Language (DQL) statements. For example, statements that use a SELECT statement to project data.
* Data Definition Language (DDL) statements. For example, statements that create or alter a Snowflake object.

| Object | DML | DQL | DDL | Notes |
| --- | --- | --- | --- | --- |
| DATABASE | n/a | n/a | ✔ |  |
| DYNAMIC TABLE | Partial | ✔ | ✔ | Support for DML is only for the `ALTER DYNAMIC TABLE ... REFRESH` command. |
| EXTERNAL TABLE | ✔ | ✔ | ✔ |  |
| AGENT | n/a | n/a | ✔ | DDL for Cortex agent objects (for example CREATE AGENT, ALTER AGENT, and DROP AGENT). |
| FUNCTION | n/a | ✔ | ✔ | Support for DQL is limited to a function that appears in a SELECT statement. |
| ICEBERG TABLE | Partial | ✔ | ✔ | Full support (DML, DQL, DDL) for Snowflake-managed Apache Iceberg™ tables. Support for DQL and DDL only for externally managed Apache Iceberg™ tables. |
| LISTING | n/a | n/a | ✔ |  |
| MATERIALIZED VIEW | n/a | ✔ | ✔ |  |
| MCP SERVER | n/a | n/a | ✔ | DDL for MCP servers (for example CREATE MCP SERVER, ALTER MCP SERVER, and DROP MCP SERVER). |
| POLICY | n/a | ✔ | ✔ | Support for DDL shows when a policy is applied to an object and when policy metadata is queried via SHOW and DESCRIBE commands. Support for DQL shows the policies under enforcement when a query is run. |
| POSTGRES INSTANCE | n/a | n/a | ✔ | DDL for Postgres instances (for example CREATE POSTGRES INSTANCE, ALTER POSTGRES INSTANCE, and DROP POSTGRES INSTANCE). |
| PROCEDURE | n/a | ✔ | ✔ | A procedure can have multiple SQL statements with each statement generating a separate record. |
| ROLE | n/a | n/a | ✔ |  |
| SCHEMA | n/a | n/a | ✔ |  |
| SEQUENCE |  | n/a | ✔ | Non-support for DML is intentional. |
| SESSION | n/a | n/a | ✔ |  |
| SHARE | n/a | n/a | ✔ |  |
| STAGE | Partial |  | ✔ | Support for DML is limited to using the stage as the source for a table. For DQL, there is no support for queries against a stage. |
| STREAM | n/a | Partial | ✔ | Support for DQL is limited to using a stream as the source for a table. Support for DDL is limited to the create operation. |
| TABLE | ✔ | ✔ | ✔ |  |
| TAG | n/a | n/a | ✔ |  |
| VIEW | n/a | ✔ | ✔ |  |

## Querying the ACCESS_HISTORY View

The following sections provide example queries for the ACCESS_HISTORY view.

Note that some of the example queries filter on the `query_start_time` column to increase query performance. Another option to
increase performance is to query over narrower time ranges.

## Access history examples

### Read queries

The subsections below detail how to query the ACCESS_HISTORY view for read operations for the following use cases:

* Obtain the access history for a specific user.
* Facilitate compliance audits for sensitive data access in the last 30 days, based on `object_id` (e.g. a table id), to answer the
  following questions:

  + Who accessed the data?
  + When was the data accessed?
  + What columns were accessed?

#### Return the user access history

Return the user access history, ordered by user and query start time, starting from the most recent access.

> ```sqlexample
> SELECT user_name
>        , query_id
>        , query_start_time
>        , direct_objects_accessed
>        , base_objects_accessed
> FROM access_history
> ORDER BY 1, 3 desc
> ;
> ```

#### Facilitate compliance audits

The following examples help to facilitate compliance audits:

* Add the `object_id` value to determine who accessed a sensitive table in the last 30 days:

  ```sqlexample
  SELECT distinct user_name
  FROM access_history
       , lateral flatten(base_objects_accessed) f1
  WHERE f1.value:"objectId"::int=<fill_in_object_id>
  AND f1.value:"objectDomain"::string='Table'
  AND query_start_time >= dateadd('day', -30, current_timestamp())
  ;
  ```
* Using the `object_id` value of `32998411400350`, determine when the access occurred in the last 30 days:

  ```sqlexample
  SELECT query_id
         , query_start_time
  FROM access_history
       , lateral flatten(base_objects_accessed) f1
  WHERE f1.value:"objectId"::int=32998411400350
  AND f1.value:"objectDomain"::string='Table'
  AND query_start_time >= dateadd('day', -30, current_timestamp())
  ;
  ```
* Using the `object_id` value of `32998411400350`, determine which columns were accessed in the last 30 days:

  ```sqlexample
  SELECT distinct f4.value AS column_name
  FROM access_history
       , lateral flatten(base_objects_accessed) f1
       , lateral flatten(f1.value) f2
       , lateral flatten(f2.value) f3
       , lateral flatten(f3.value) f4
  WHERE f1.value:"objectId"::int=32998411400350
  AND f1.value:"objectDomain"::string='Table'
  AND f4.key='columnName'
  ;
  ```

### Write operations

The subsections below detail how to query the ACCESS_HISTORY view for write operations for the following use cases:

* Load data from a stage to a table.
* Unload data from a table to a stage.
* Use the PUT command to upload a local file to a stage.
* Use the GET command to retrieve data files from a stage to a local directory.
* Tracking sensitive stage data movement.

#### Load data from a stage to a table

Load a set of values from a data file in external cloud storage into columns in a target table.

> ```sqlexample
> copy into table1(col1, col2)
> from (select t.$1, t.$2 from @mystage1/data1.csv.gz);
> ```

The `direct_objects_accessed` and `base_objects_accessed` column specify that an external named stage was accessed:

> ```sqljson
> {
>   "objectDomain": STAGE
>   "objectName": "mystage1",
>   "objectId": 1,
>   "stageKind": "External Named"
> }
> ```

The `objects_modified` column specifies that data was written to two columns of the table:

> ```sqljson
> {
>   "columns": [
>      {
>        "columnName": "col1",
>        "columnId": 1
>      },
>      {
>        "columnName": "col2",
>        "columnId": 2
>      }
>   ],
>   "objectId": 1,
>   "objectName": "TEST_DB.TEST_SCHEMA.TABLE1",
>   "objectDomain": TABLE
> }
> ```

#### Unload data from a table to a stage

Unload a set of values from a Snowflake table into cloud storage.

> ```sqlexample
> copy into @mystage1/data1.csv
> from table1;
> ```

The `direct_objects_accessed` and `base_objects_accessed` columns specify the table columns that were
accessed:

> ```sqljson
> {
>   "objectDomain": TABLE
>   "objectName": "TEST_DB.TEST_SCHEMA.TABLE1",
>   "objectId": 123,
>   "columns": [
>      {
>        "columnName": "col1",
>        "columnId": 1
>      },
>      {
>        "columnName": "col2",
>        "columnId": 2
>      }
>   ]
> }
> ```

The `objects_modified` column specifies the stage to which the accessed data was written:

> ```sqljson
> {
>   "objectId": 1,
>   "objectName": "mystage1",
>   "objectDomain": STAGE,
>   "stageKind": "External Named"
> }
> ```

#### Use the PUT Command to upload a local file to a stage

Copy a data file to an internal (i.e. Snowflake) stage.

> ```sqlexample
> put file:///tmp/data/mydata.csv @my_int_stage;
> ```

The `direct_objects_accessed` and `base_objects_accessed` columns specify the local path to the file that was
accessed:

> ```sqljson
> {
>   "location": "file:///tmp/data/mydata.csv"
> }
> ```

The `objects_modified` column specifies the stage where the accessed data was written:

> ```sqljson
> {
>   "objectId": 1,
>   "objectName": "my_int_stage",
>   "objectDomain": STAGE,
>   "stageKind": "Internal Named"
> }
> ```

#### Use the GET command to retrieve data files from a stage to a local directory

Retrieve a data file from an internal stage to a directory on the local machine.

> ```sqlexample
> get @%mytable file:///tmp/data/;
> ```

The `direct_objects_accessed` and `base_objects_accessed` columns specify the stage and local directory that were
accessed:

> ```sqljson
> {
>   "objectDomain": Stage
>   "objectName": "mytable",
>   "objectId": 1,
>   "stageKind": "Table"
> }
> ```

The `objects_modified` column specifies the directory to which the accessed data was written:

> ```sqljson
> {
>   "location": "file:///tmp/data/"
> }
> ```

#### Tracking Sensitive stage data movement

Track sensitive stage data as it moves through a series of queries executed in chronological order.

Execute the following queries. Note that five of the statements access stage data. Therefore, when you query the ACCESS_HISTORY view for
stage access, the result set should include five rows.

> ```sqlexample
> use test_db.test_schema;
> create or replace table T1(content variant);
> insert into T1(content) select parse_json('{"name": "A", "id":1}');
>
> -- T1 -> T6
> insert into T6 select * from T1;
>
> -- S1 -> T1
> copy into T1 from @S1;
>
> -- T1 -> T2
> create table T2 as select content:"name" as name, content:"id" as id from T1;
>
> -- T1 -> S2
> copy into @S2 from T1;
>
> -- S1 -> T3
> create or replace table T3(customer_info variant);
> copy into T3 from @S1;
>
> -- T1 -> T4
> create or replace table T4(name string, id string, address string);
> insert into T4(name, id) select content:"name", content:"id" from T1;
>
> -- T6 -> T7
> create table T7 as select * from T6;
> ```
>
> Where:
>
> * `T1`, `T2` … `T7` specify the names of tables.
> * `S1` and `S2` specify the names of stages.

Query the access history to determine the access to stage `S1`.

> The data for the `direct_objects_accessed`, `base_objects_accessed`, and `objects_modified` columns are shown in the
> following table.
>
> | `direct_objects_accessed` | `base_objects_accessed` | `objects_modified` |
> | --- | --- | --- |
> | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68611,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66566,     "objectName": "TEST_DB.TEST_SCHEMA.T6"   } ] ``` |
> | ```sqljson [   {     "objectDomain": "Stage",     "objectId": 117,     "objectName": "TEST_DB.TEST_SCHEMA.S1",     "stageKind": "External Named"   } ] ``` | ```sqljson [   {     "objectDomain": "Stage",     "objectId": 117,     "objectName": "TEST_DB.TEST_SCHEMA.S1",     "stageKind": "External Named"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` |
> | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68613,         "columnName": "ID"       },       {         "columnId": 68612,         "columnName": "NAME"       }     ],     "objectDomain": "Table",     "objectId": 66568,     "objectName": "TEST_DB.TEST_SCHEMA.T2"   } ] ``` |
> | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "objectDomain": "Stage",     "objectId": 118,     "objectName": "TEST_DB.TEST_SCHEMA.S2",     "stageKind": "External Named"   } ] ``` |
> | ```sqljson [   {     "objectDomain": "Stage",     "objectId": 117,     "objectName": "TEST_DB.TEST_SCHEMA.S1",     "stageKind": "External Named"   } ] ``` | ```sqljson [   {     "objectDomain": "Stage",     "objectId": 117,     "objectName": "TEST_DB.TEST_SCHEMA.S1",     "stageKind": "External Named"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68614,         "columnName": "CUSTOMER_INFO"       }     ],     "objectDomain": "Table",     "objectId": 66570,     "objectName": "TEST_DB.TEST_SCHEMA.T3"   } ] ``` |
> | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "TEST_DB.TEST_SCHEMA.T1"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68615,         "columnName": "NAME"       },       {         "columnId": 68616,         "columnName": "ID"       }     ],     "objectDomain": "Table",     "objectId": 66572,     "objectName": "TEST_DB.TEST_SCHEMA.T4"   } ] ``` |
> | ```sqljson [   {     "columns": [       {         "columnId": 68611,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66566,     "objectName": "TEST_DB.TEST_SCHEMA.T6"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68611,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66566,     "objectName": "TEST_DB.TEST_SCHEMA.T6"   } ] ``` | ```sqljson [   {     "columns": [       {         "columnId": 68618,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66574,     "objectName": "TEST_DB.TEST_SCHEMA.T7"   } ] ``` |
>
> Note the following about the query example:
>
> * Uses a [recursive common table expression](queries-cte.md).
> * Uses a [JOIN](../sql-reference/constructs/join.md) construct rather than a
>   [USING clause](../sql-reference/account-usage/access_history.md).
>
>   ```sqlexample
>   with access_history_flatten as (
>       select
>           r.value:"objectId" as source_id,
>           r.value:"objectName" as source_name,
>           r.value:"objectDomain" as source_domain,
>           w.value:"objectId" as target_id,
>           w.value:"objectName" as target_name,
>           w.value:"objectDomain" as target_domain,
>           c.value:"columnName" as target_column,
>           t.query_start_time as query_start_time
>       from
>           (select * from TEST_DB.ACCOUNT_USAGE.ACCESS_HISTORY) t,
>           lateral flatten(input => t.BASE_OBJECTS_ACCESSED) r,
>           lateral flatten(input => t.OBJECTS_MODIFIED) w,
>           lateral flatten(input => w.value:"columns", outer => true) c
>           ),
>       sensitive_data_movements(path, target_id, target_name, target_domain, target_column, query_start_time)
>       as
>         -- Common Table Expression
>         (
>           -- Anchor Clause: Get the objects that access S1 directly
>           select
>               f.source_name || '-->' || f.target_name as path,
>               f.target_id,
>               f.target_name,
>               f.target_domain,
>               f.target_column,
>               f.query_start_time
>           from
>               access_history_flatten f
>           where
>           f.source_domain = 'Stage'
>           and f.source_name = 'TEST_DB.TEST_SCHEMA.S1'
>           and f.query_start_time >= dateadd(day, -30, date_trunc(day, current_date))
>           union all
>           -- Recursive Clause: Recursively get all the objects that access S1 indirectly
>           select sensitive_data_movements.path || '-->' || f.target_name as path, f.target_id, f.target_name, f.target_domain, f.target_column, f.query_start_time
>             from
>                access_history_flatten f
>               join sensitive_data_movements
>               on f.source_id = sensitive_data_movements.target_id
>                   and f.source_domain = sensitive_data_movements.target_domain
>                   and f.query_start_time >= sensitive_data_movements.query_start_time
>         )
>   select path, target_name, target_id, target_domain, array_agg(distinct target_column) as target_columns
>   from sensitive_data_movements
>   group by path, target_id, target_name, target_domain;
>   ```
>
> The query produces the following result set related to stage `S1` data movement:
>
> > | PATH | TARGET_NAME | TARGET_ID | TARGET_DOMAIN | TARGET_COLUMNS |
> > | --- | --- | --- | --- | --- |
> > | TEST_DB.TEST_SCHEMA.S1–>TEST_DB.TEST_SCHEMA.T1 | TEST_DB.TEST_SCHEMA.T1 | 66564 | Table | [“CONTENT”] |
> > | TEST_DB.TEST_SCHEMA.S1–>TEST_DB.TEST_SCHEMA.T1–>TEST_DB.TEST_SCHEMA.S2 | TEST_DB.TEST_SCHEMA.S2 | 118 | Stage | [] |
> > | TEST_DB.TEST_SCHEMA.S1–>TEST_DB.TEST_SCHEMA.T1–>TEST_DB.TEST_SCHEMA.T2 | TEST_DB.TEST_SCHEMA.T2 | 66568 | Table | [“NAME”,”ID”] |
> > | TEST_DB.TEST_SCHEMA.S1–>TEST_DB.TEST_SCHEMA.T1–>TEST_DB.TEST_SCHEMA.T4 | TEST_DB.TEST_SCHEMA.T4 | 66572 | Table | [“ID”,”NAME”] |
> > | TEST_DB.TEST_SCHEMA.S1–>TEST_DB.TEST_SCHEMA.T3 | TEST_DB.TEST_SCHEMA.T3 | 66570 | Table | [“CUSTOMER_INFO”] |

### Column lineage

The following example queries the ACCESS_HISTORY view and uses the [FLATTEN](../sql-reference/functions/flatten.md) function to flatten the
`objects_modified` column.

As a representative example, execute the following SQL query in your Snowflake account to produce the table below, where the numbered
comments indicate the following:

* `// 1`: Get the mapping between the `directSources` field and the target column.
* `// 2`: Get the mapping between the `baseSources` field and the target column.

```sqlexample
// 1

select
  directSources.value: "objectId" as source_object_id,
  directSources.value: "objectName" as source_object_name,
  directSources.value: "columnName" as source_column_name,
  'DIRECT' as source_column_type,
  om.value: "objectName" as target_object_name,
  columns_modified.value: "columnName" as target_column_name
from
  (
    select
      *
    from
      snowflake.account_usage.access_history
  ) t,
  lateral flatten(input => t.OBJECTS_MODIFIED) om,
  lateral flatten(input => om.value: "columns", outer => true) columns_modified,
  lateral flatten(
    input => columns_modified.value: "directSources",
    outer => true
  ) directSources

union

// 2

select
  baseSources.value: "objectId" as source_object_id,
  baseSources.value: "objectName" as source_object_name,
  baseSources.value: "columnName" as source_column_name,
  'BASE' as source_column_type,
  om.value: "objectName" as target_object_name,
  columns_modified.value: "columnName" as target_column_name
from
  (
    select
      *
    from
      snowflake.account_usage.access_history
  ) t,
  lateral flatten(input => t.OBJECTS_MODIFIED) om,
  lateral flatten(input => om.value: "columns", outer => true) columns_modified,
  lateral flatten(
    input => columns_modified.value: "baseSources",
    outer => true
  ) baseSources
;
```

Returns:

> | SOURCE_OBJECT_ID | SOURCE_OBJECT_NAME | SOURCE_COLUMN_NAME | SOURCE_COLUMN_TYPE | TARGET_OBJECT_NAME | TARGET_COLUMN_NAME |
> | --- | --- | --- | --- | --- | --- |
> | 1 | D.S.T0 | NAME | BASE | D.S.T1 | NAME |
> | 2 | D.S.V1 | NAME | DIRECT | D.S.T1 | NAME |

### Track row access policy references

Return a row for each instance when a row access policy is set on a table, view, or materialized view without duplicates:

> ```sqlexample
> use role accountadmin;
> select distinct
>     obj_policy.value:"policyName"::VARCHAR as policy_name
> from snowflake.account_usage.access_history as ah
>     , lateral flatten(ah.policies_referenced) as obj
>     , lateral flatten(obj.value:"policies") as obj_policy
> ;
> ```

### Track masking policy references

Return a row for each instance when a masking policy protects a column without duplicates. Note that additional flattening is necessary
because the `policies_referenced` column specifies the masking policy on a column one level deeper than the row access policy on a
table:

> ```sqlexample
> use role accountadmin;
> select distinct
>     policies.value:"policyName"::VARCHAR as policy_name
> from snowflake.account_usage.access_history as ah
>     , lateral flatten(ah.policies_referenced) as obj
>     , lateral flatten(obj.value:"columns") as columns
>     , lateral flatten(columns.value:"policies") as policies
> ;
> ```

### Track the enforced policy in a query

Return the time when the policy was updated (POLICY_CHANGED_TIME) and the policy conditions (POLICY_BODY) for a given query in a given time
frame.

Prior to using this query, update the WHERE clause input values:

```sqlexample
where query_start_time > '2023-07-07' and
   query_start_time < '2023-07-08' and
   query_id = '01ad7987-0606-6e2c-0001-dd20f12a9777')
```

Where:

`query_start_time > '2023-07-07'`
:   Specifies the beginning timestamp.

`query_start_time < '2023-07-08'`
:   Specifies the end timestamp.

`query_id = '01ad7987-0606-6e2c-0001-dd20f12a9777'`
:   Specifies the query identifier in the Account Usage ACCESS_HISTORY view.

Run the query:

```sqlexample
SELECT *
from(
  select j1.*,j2.QUERY_START_TIME as POLICY_CHANGED_TIME, POLICY_BODY
from
(
  select distinct t1.*,
      t4.value:"policyId"::number as PID
  from (select *
      from SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY
      where query_start_time > '2023-07-07' and
         query_start_time < '2023-07-08' and
         query_id = '01ad7987-0606-6e2c-0001-dd20f12a9777') as t1, //
  lateral flatten (input => t1.POLICIES_REFERENCED,OUTER => TRUE) t2,
  lateral flatten (input => t2.value:"columns", OUTER => TRUE) t3,
  lateral flatten (input => t3.value:"policies",OUTER => TRUE) t4
) as j1
left join
(
  select OBJECT_MODIFIED_BY_DDL:"objectId"::number as PID,
      QUERY_START_TIME,
      OBJECT_MODIFIED_BY_DDL:"properties"."policyBody"."value" as POLICY_BODY
      from SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY
      where OBJECT_MODIFIED_BY_DDL is not null and
      (OBJECT_MODIFIED_BY_DDL:"objectDomain" ilike '%masking%' or OBJECT_MODIFIED_BY_DDL:"objectDomain" ilike '%row%')
) as j2
On j1.POLICIES_REFERENCED is not null and j1.pid = j2.pid and j1.QUERY_START_TIME>j2.QUERY_START_TIME) as j3
QUALIFY ROW_NUMBER() OVER (PARTITION BY query_id,pid ORDER BY policy_changed_time DESC) = 1;
```

### UDFs

These UDF examples show how the Account Usage ACCESS_HISTORY view records:

* Calling a UDF named `get_product`.
* Insert the product of calling the `get_product` function into a table named
  `mydb.tables.t1`.
* Shared UDFs.

#### Call a UDF

Consider the following SQL UDF that calculates the product of two numbers and assume it is stored in the schema named `mydb.udfs`:

> ```sqlexample
> CREATE FUNCTION MYDB.UDFS.GET_PRODUCT(num1 number, num2 number)
> RETURNS number
> AS
> $$
>     NUM1 * NUM2
> $$
> ;
> ```

[Calling](../sql-reference/sql/call.md) `get_product` directly results in recording the UDF details in the
`direct_objects_accessed` column:

> ```sqljson
> [
>   {
>     "objectDomain": "FUNCTION",
>     "objectName": "MYDB.UDFS.GET_PRODUCT",
>     "objectId": "2",
>     "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
>     "dataType": "NUMBER(38,0)"
>   }
> ]
> ```

This example is analogous to calling a stored procedure (in this topic).

#### UDF with INSERT DML

Consider the following [INSERT](../sql-reference/sql/insert.md) statement to update the columns named 1 and 2 in the table named `mydb.tables.t1`:

> ```sqlexample
> insert into t1(product)
> select get_product(c1, c2) from mydb.tables.t1;
> ```

The ACCESS_HISTORY view records the `get_product` function in the:

* `direct_objects_accessed` column because the function is explicitly named in the SQL statement, and
* `objects_modified` column in the `directSources` array because the function is the source of the values that are inserted into
  the columns.

Similarly, the table `t1` is recorded in these same columns:

> | `direct_objects_accessed` | `objects_modified` |
> | --- | --- |
> | ```sqljson [   {     "objectDomain": "FUNCTION",     "objectName": "MYDB.UDFS.GET_PRODUCT",     "objectId": "2",     "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",     "dataType": "NUMBER(38,0)"   },   {     "objectDomain": "TABLE",     "objectName": "MYDB.TABLES.T1",     "objectId": 1,     "columns":     [       {         "columnName": "c1",         "columnId": 1       },       {         "columnName": "c2",         "columnId": 2       }     ]   } ] ``` | ```sqljson  [    {      "objectDomain": "TABLE",      "objectName": "MYDB.TABLES.T1",      "objectId": 2,      "columns":      [        {          "columnId": "product",          "columnName": "201",          "directSourceColumns":          [            {              "objectDomain": "Table",              "objectName": "MYDB.TABLES.T1",              "objectId": "1",              "columnName": "c1"            },            {              "objectDomain": "Table",              "objectName": "MYDB.TABLES.T1",              "objectId": "1",              "columnName": "c2"            },            {              "objectDomain": "FUNCTION",              "objectName": "MYDB.UDFS.GET_PRODUCT",              "objectId": "2",              "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",              "dataType": "NUMBER(38,0)"            }          ],          "baseSourceColumns":[]        }      ]    } ] ``` |

#### Shared UDFs

Shared UDFs can be referenced directly or indirectly:

* A direct reference is the same as calling the UDF explicitly (in this topic) but results
  in the UDF being recorded in both the `base_objects_accessed` and `direct_objects_accessed` columns.
* An example of an indirect reference is calling the UDF to create a view:

  > ```sqlexample
  > create view v as
  > select get_product(c1, c2) as vc from t;
  > ```

  The `base_objects_accessed` column records the UDF and the table.

  The `direct_objects_accessed` column records the view.

### Tracking objects modified by a DDL operation

#### Create a tag with ALLOWED_VALUES

Create the tag:

> ```sqlexample
> create tag governance.tags.pii allowed_values 'sensitive','public';
> ```

Column value:

> ```sqljson
> {
>   "objectDomain": "TAG",
>   "objectName": "governance.tags.pii",
>   "objectId": "1",
>   "operationType": "CREATE",
>   "properties": {
>     "allowedValues": {
>       "sensitive": {
>         "subOperationType": "ADD"
>       },
>       "public": {
>         "subOperationType": "ADD"
>       }
>     }
>   }
> }
> ```

> **Note:**
>
> If you do not specify allowed values when creating the tag, the `properties` field is an empty array (i.e. `{}`).

#### Create a table with a tag and masking policy

Create the table with a masking policy on the column, a tag on the column, and a tag on the table:

> ```sqlexample
> create or replace table hr.data.user_info(
>   email string
>     with masking policy governance.policies.email_mask
>     with tag (governance.tags.pii = 'sensitive')
>   )
> with tag (governance.tags.pii = 'sensitive');
> ```

Column value:

> ```sqljson
> {
>   "objectDomain": "TABLE",
>   "objectName": "hr.data.user_info",
>   "objectId": "1",
>   "operationType": "CREATE",
>   "properties": {
>     "tags": {
>       "governance.tags.pii": {
>         "subOperationType": "ADD",
>         "objectId": {
>           "value": "1"
>         },
>         "tagValue": {
>           "value": "sensitive"
>         }
>       }
>     },
>     "columns": {
>       "email": {
>         objectId: {
>           "value": 1
>         },
>         "subOperationType": "ADD",
>         "tags": {
>           "governance.tags.pii": {
>             "subOperationType": "ADD",
>             "objectId": {
>               "value": "1"
>             },
>             "tagValue": {
>               "value": "sensitive"
>             }
>           }
>         },
>         "maskingPolicies": {
>           "governance.policies.email_mask": {
>             "subOperationType": "ADD",
>             "objectId": {
>               "value": 2
>             }
>           }
>         }
>       }
>     }
>   }
> }
> ```

#### Set a masking policy on a tag

Set a masking policy on the tag (i.e. [tag-based masking](tag-based-masking-policies.md)):

> ```sqlexample
> alter tag governance.tags.pii set masking policy governance.policies.email_mask;
> ```

Column value:

> ```sqljson
> {
>   "objectDomain": "TAG",
>   "objectName": "governance.tags.pii",
>   "objectId": "1",
>   "operationType": "ALTER",
>   "properties": {
>     "maskingPolicies": {
>       "governance.policies.email_mask": {
>         "subOperationType": "ADD",
>         "objectId": {
>           "value": 2
>         }
>       }
>     }
>   }
> }
> ```

#### Swap a table

Swap the table named `t2` with the table named `t3`:

> ```sqlexample
> alter table governance.tables.t2 swap with governance.tables.t3;
> ```

Note the two different records in the view.

Record 1:

> ```sqljson
> {
>   "objectDomain": "Table",
>   "objectId": 0,
>   "objectName": "GOVERNANCE.TABLES.T2",
>   "operationType": "ALTER",
>   "properties": {
>     "swapTargetDomain": {
>       "value": "Table"
>     },
>     "swapTargetId": {
>       "value": 0
>     },
>     "swapTargetName": {
>       "value": "GOVERNANCE.TABLES.T3"
>     }
>   }
> }
> ```

Record 2:

> ```sqljson
> {
>   "objectDomain": "Table",
>   "objectId": 0,
>   "objectName": "GOVERNANCE.TABLES.T3",
>   "operationType": "ALTER",
>   "properties": {
>     "swapTargetDomain": {
>       "value": "Table"
>     },
>     "swapTargetId": {
>       "value": 0
>     },
>     "swapTargetName": {
>       "value": "GOVERNANCE.TABLES.T2"
>     }
>   }
> }
> ```

#### Drop a masking policy

Drop the masking policy:

> ```sqlexample
> drop masking policy governance.policies.email_mask;
> ```

Column value:

> ```sqljson
> {
>   "objectDomain" : "MASKING_POLICY",
>   "objectName": "governance.policies.email_mask",
>   "objectId" : "1",
>   "operationType": "DROP",
>   "properties" : {}
> }
> ```
>
> > **Note:**
> >
> > The column value is representative and applies to a DROP operation on a tag and row access policy.
> >
> > The `properties` field is an empty array and does not provide any information on the policy prior to the DROP operation.

#### Track tag references on a column

Query the `object_modified_by_ddl` column to monitor how a tag is set on a column.

As the table administrator, set a tag on a column, unset the tag, and update the tag with a different string value:

> ```sqlexample
> alter table hr.tables.empl_info
>   alter column email set tag governance.tags.test_tag = 'test';
>
> alter table hr.tables.empl_info
>   alter column email unset tag governance.tags.test_tag;
>
> alter table hr.tables.empl_info
>   alter column email set tag governance.tags.data_category = 'sensitive';
> ```

As the data engineer, change the tag value:

> ```sqlexample
> alter table hr.tables.empl_info
>   alter column email set tag governance.tags.data_category = 'public';
> ```

Query the ACCESS_HISTORY view to monitor the changes:

> ```sqlexample
> select
>   query_start_time,
>   user_name,
>   object_modified_by_ddl:"objectName"::string as table_name,
>   'EMAIL' as column_name,
>   tag_history.value:"subOperationType"::string as operation,
>   tag_history.key as tag_name,
>   nvl((tag_history.value:"tagValue"."value")::string, '') as value
> from
>   TEST_DB.ACCOUNT_USAGE.access_history ah,
>   lateral flatten(input => ah.OBJECT_MODIFIED_BY_DDL:"properties"."columns"."EMAIL"."tags") tag_history
> where true
>   and object_modified_by_ddl:"objectDomain" = 'Table'
>   and object_modified_by_ddl:"objectName" = 'TEST_DB.TEST_SH.T'
> order by query_start_time asc;
> ```

Returns:

> ```output
> +-----------------------------------+---------------+---------------------+-------------+-----------+-------------------------------+-----------+
> | QUERY_START_TIME                  | USER_NAME     | TABLE_NAME          | COLUMN_NAME | OPERATION | TAG_NAME                      | VALUE     |
> +-----------------------------------+---------------+---------------------+-------------+-----------+-------------------------------+-----------+
> | Mon, Feb. 14, 2023 12:01:01 -0600 | TABLE_ADMIN   | HR.TABLES.EMPL_INFO | EMAIL       | ADD       | GOVERNANCE.TAGS.TEST_TAG      | test      |
> | Mon, Feb. 14, 2023 12:02:01 -0600 | TABLE_ADMIN   | HR.TABLES.EMPL_INFO | EMAIL       | DROP      | GOVERNANCE.TAGS.TEST_TAG      |           |
> | Mon, Feb. 14, 2023 12:03:01 -0600 | TABLE_ADMIN   | HR.TABLES.EMPL_INFO | EMAIL       | ADD       | GOVERNANCE.TAGS.DATA_CATEGORY | sensitive |
> | Mon, Feb. 14, 2023 12:04:01 -0600 | DATA_ENGINEER | HR.TABLES.EMPL_INFO | EMAIL       | ADD       | GOVERNANCE.TAGS.DATA_CATEGORY | public    |
> +-----------------------------------+---------------+---------------------+-------------+-----------+-------------------------------+-----------+
> ```

### Call a stored procedure

Consider the following stored procedure and assume it is stored in the schema named `mydb.procedures`:

> ```sqlexample
> create or replace procedure get_id_value(name string)
> returns string not null
> language javascript
> as
> $$
>   var my_sql_command = "select id from A where name = '" + NAME + "'";
>   var statement = snowflake.createStatement( {sqlText: my_sql_command} );
>   var result = statement.execute();
>   result.next();
>   return result.getColumnValue(1);
> $$
> ;
> ```

[Calling](../sql-reference/sql/call.md) `my_procedure` directly results in recording the procedure details in both the
`direct_objects_accessed` and `base_objects_accessed` columns as follows:

> ```sqljson
> [
>   {
>     "objectDomain": "PROCEDURE",
>     "objectName": "MYDB.PROCEDURES.GET_ID_VALUE",
>     "argumentSignature": "(NAME STRING)",
>     "dataType": "STRING"
>   }
> ]
> ```

This example is analogous to calling a UDF (in this topic).

### Ancestor queries with stored procedures

You can use the `parent_query_id` and `root_query_id` columns to understand how stored procedure calls relate to each other.

Suppose that you have three different stored procedure statements and you run them in the following order:

> ```sqlexample
> CREATE OR REPLACE PROCEDURE myproc_child()
> RETURNS INTEGER
> LANGUAGE SQL
> AS
> $$
>   BEGIN
>   SELECT * FROM mydb.mysch.mytable;
>   RETURN 1;
>   END
> $$;
>
> CREATE OR REPLACE PROCEDURE myproc_parent()
> RETURNS INTEGER
> LANGUAGE SQL
> AS
> $$
>   BEGIN
>   CALL myproc_child();
>   RETURN 1;
>   END
> $$;
>
> CALL myproc_parent();
> ```

A query on the ACCESS_HISTORY view records the information as follows:

> ```sqlexample
> SELECT
>   query_id,
>   parent_query_id,
>   root_query_id,
>   direct_objects_accessed
> FROM
>   SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY;
> ```
>
> ```output
> +----------+-----------------+---------------+-----------------------------------+
> | QUERY_ID | PARENT_QUERY_ID | ROOT_QUERY_ID | DIRECT_OBJECTS_ACCESSED           |
> +----------+-----------------+---------------+-----------------------------------+
> |  1       | NULL            | NULL          | [{"objectName": "myproc_parent"}] |
> |  2       | 1               | 1             | [{"objectName": "myproc_child"}]  |
> |  3       | 2               | 1             | [{"objectName": "mytable"}]       |
> +----------+-----------------+---------------+-----------------------------------+
> ```

* The first row corresponds to calling the second procedure named `myproc_parent` as shown in the `direct_objects_accessed`
  column.

  The `parent_query_id` and `root_query_id` columns return NULL because you called this stored procedure directly.
* The second row corresponds to the query that calls the first procedure named `myproc_child` as shown in the
  `direct_objects_accessed column`.

  The `parent_query_id` and `root_query_id` columns return the same query ID because the query calling `myproc_child` was
  initiated by the query calling `myproc_parent`, which you called directly.
* The third row corresponds to the query that accessed the table named `mytable` in the `myproc_child` procedure as shown in
  the `direct_objects_accessed` column.

  The `parent_query_id` column returns the query ID of the query that accessed `mytable`, which corresponds to calling
  `myproc_child`. That stored procedure was initiated by the query calling `myproc_parent`, which is shown in the
  `root_query_id` column.

### Sequence

Consider the following SQL statement that creates a sequence:

```sqlexample
CREATE SEQUENCE SEQ
  START = 2
  INCREMENT = 7
  COMMENT = 'Comment on sequence';
```

Creating this sequence results in the following entry in the access history:

```JSON
{
  "objectDomain": "Sequence",
  "objectId": 1,
  "objectName": "TEST_DB.TEST_SCHEMA.SEQ",
  "operationType": "CREATE",
  "properties": {
    "start": {
      "value": "2"
    },
    "increment": {
        "value": "7"
    },
    "comment": {
          "value": "Comment on Sequence"
    }
  }
}
```

### Join

A join in a query shows up in the access history as a `joinObject` in the `direct_accessed_objects` column. The `joinObject` does
not appear in other columns because access history only tracks joins that are explicitly mentioned in the query.

For example, consider the following query that joins table `t1` with table `t2`:

```sqlexample
CREATE OR REPLACE VIEW v1 (vc1, vc2) AS
  SELECT
    t1.c1 AS vc1,
    t2.c2 AS vc2
  FROM t1 LEFT OUTER JOIN t2
    ON t1.c2 = t2.c1;
```

Executing this query results in the following appearing for the `t1` object in the `direct_accessed_objects` column:

```JSON
{
  "columns": [
    {
      "columnId": 0,
      "columnName": "C1"
    },
    {
      "columnId": 0,
      "columnName": "C2"
    }
  ],
  "joinObjects": [
    {
      "joinType": "LEFT_OUTER_JOIN",
      "node": {
        "objectDomain": "Table",
        "objectId": 0,
        "objectName": "DB1.SCH.T2"
      }
    }
  ],
  "objectDomain": "Table",
  "objectId": 0,
  "objectName": "DB1.SCH.T1"
}
```

> **Note:**
>
> In this example, access history wouldn’t contain a `joinObject` for the `t2` object because it would be redundant to the information
> provided by the `joinObject` for table `t1`.

---
title: Account identifiers
source: https://docs.snowflake.com/en/user-guide/admin-account-identifier.md
section: User Guide
---

# Account identifiers

An account identifier uniquely identifies a Snowflake account within your [organization](organizations.md), as well as
throughout the global network of Snowflake-supported [cloud platforms](intro-cloud-platforms.md) and
[cloud regions](intro-regions.md).

The preferred account identifier consists of the *name* of the account prefixed by its organization; for example, `myorg-account123`. You
can also use the Snowflake-assigned *locator* as the account identifier; however, the use of this legacy format is *not recommended*.

## Requirements for account identifiers

> > **Important:**
> >
> > To prevent DNS resolution failures, an account identifier should meet all the following requirements:
> >
> > * Must be unique within an organization, regardless of which Snowflake region the account is in.
> > * Must start with an alphabetic character and can’t contain spaces or special characters *except for* underscores (`_`).
> > * Shouldn’t end with `_`.
> > * If the account name includes `_`, features that don’t accept account names with `_`, such as Okta SSO or SCIM, can reference a version
> >   of the account identifier that substitutes a hyphen (`-`) for each `_` character.
> > * The account identifier string length, including the organization name, account name, and `-` characters, shouldn’t exceed 63 characters.
> > * Names should comply with the “Letter, Digit, Hyphen” (LDH) rule defined in [RFC 952](https://datatracker.ietf.org/doc/html/rfc952).

## Where are account identifiers used?

Account identifiers are required in Snowflake wherever you need to specify the account you are using, including:

* URLs for accessing any of the Snowflake web interfaces.
* Snowflake CLI, SnowSQL, and other clients (such as connectors and drivers) for connecting to Snowflake.
* Third-party applications and services that comprise the Snowflake ecosystem.
* Security features for protecting Snowflake internal operations and communication/interaction with external systems.
* Global features such as [Secure Data Sharing](data-sharing-intro.md) and [Replication and Failover/Failback](replication-intro.md).

For example, the URL for an account uses the following format:

`account_identifier.snowflakecomputing.com`

If your organization uses the [Client Redirect](client-redirect.md) feature, the name of a
[connection object](client-redirect.md) can be used in place of the account name in the account identifier
to connect to a Snowflake account using a Snowflake client. For more information, see [Using a connection URL](client-redirect.md).

For more information about using account identifiers and connections to connect to a Snowflake account, see
[Connecting to your accounts](organizations-connect.md).

## Format 1 (preferred): Account name in your organization

An [organization](organizations.md) is a Snowflake object that links the accounts owned by your business
entity. [Organization administrators](organization-administrators.md) view, create, and manage all of your
accounts across different cloud platforms and regions.

Account names must be unique within your organization, and can be changed, which allows more flexibility and leads to shorter and more
intuitive account names. You specify an account name when you create a new account
(see [Creating an account](organizations-manage-accounts-create.md)). To change a name
for an existing account, see [Renaming an account](organizations-manage-accounts-rename.md).

While an account name uniquely identifies an account within your organization, it is *not* a unique identifier of an account
across Snowflake organizations.

Account names with underscores also have a dashed version of the URL for features that don’t accept URLs with underscores, such as
Okta SSO/SCIM.

The next sections explain the format to use and how to find your account identifier:

* Finding the organization and account name for an account
* Understanding the format to use for the identifier
* Organization and account names

### Finding the organization and account name for an account

To find the organization and account name for an account, you can use Snowsight or SQL.

Snowsight:
:   1. Open the account selector and review the list of accounts that you previously signed in to.
    2. Select View account details.

       The Account Details dialog displays information about the account, including the account identifier and the account URL.

    The following table lists some examples of getting the different forms of the account identifier:

    | Use case | Instructions |
    | --- | --- |
    | Get the data sharing account identifier (for example, if a provider wants to share a private listing with you). | Copy the value in the Data Sharing Account Identifier field. |
    | Get the Snowflake account URL for configuring a third-party tool (such as Tableau or PowerBI) to connect to Snowflake. | Copy the value in the Account/Server URL field.  See [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md). |
    | Set up a configuration file for a client (such as [Snowflake CLI](../developer-guide/snowflake-cli/index.md) or [SnowSQL](snowsql.md)). | Select the Config File tab.  See [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md). |
    | Configure a driver (such as the [ODBC](../developer-guide/odbc/odbc.md) or [JDBC](../developer-guide/jdbc/jdbc.md) driver) or library. | Select the Connectors/Drivers tab.  See [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md). |

SQL:
:   In the Account Details dialog in Snowsight, you can select the
    SQL Commands tab to find and copy the SQL statement that returns the account identifier.

    If you want to construct the account identifier yourself:

    * To retrieve the organization of the current account, call the [CURRENT_ORGANIZATION_NAME](../sql-reference/functions/current_organization_name.md) function.
    * To retrieve the name of the current account, call the [CURRENT_ACCOUNT_NAME](../sql-reference/functions/current_account_name.md) function.

    For details on the format to use for the identifier, see Understanding the format to use for the identifier.

    For example, to get the account identifier for configuring a client, driver, or library to connect to Snowflake, run:

    ```sqlexample
    SELECT CURRENT_ORGANIZATION_NAME() || '-' || CURRENT_ACCOUNT_NAME();
    ```

### Understanding the format to use for the identifier

The account identifier for an account in your organization takes one of the following forms, depending on where and how you use
the identifier:

* Specifying the account name when connecting to Snowflake
* Specifying the fully qualified account name in a SQL statement
* Providing your data sharing account identifier

#### Specifying the account name when connecting to Snowflake

The following table lists some of the commonly used forms of the account identifier, based on the use case:

| Use cases | Format to use |
| --- | --- |
| Using a URL to [sign in to Snowsight](ui-snowsight-gs.md). | `orgname-account_name.snowflakecomputing.com` |
| Specifying the Snowflake account URL when configuring a third-party tool (such as Tableau or PowerBI) to [connect to Snowflake](gen-conn-config.md). | `orgname-account_name.snowflakecomputing.com` |
| Specifying the Snowflake account when configuring a client, driver, or library to [connect to Snowflake](gen-conn-config.md):   * Specifying the account in a configuration file for a client (such as   [Snowflake CLI](../developer-guide/snowflake-cli/index.md) or [SnowSQL](snowsql.md)) to   [connect to Snowflake](gen-conn-config.md). * Specifying the when configuring a driver (such as the [ODBC](../developer-guide/odbc/odbc.md) or   [JDBC](../developer-guide/jdbc/jdbc.md) driver) or library to   [connect to Snowflake](gen-conn-config.md). | `orgname-account_name` |

Where:

* `orgname` is the name of your Snowflake organization.
* `account_name` is the unique name of your account within your organization.

To get the account identifier in the correct format for clients, drivers, libraries, and third-pary applications, you can use
Snowsight. For more information, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md).

> **Note:**
>
> For scenarios/features where underscores in an account name are not supported, use hyphens instead of underscores.

For example, in a [configuration file for Snowflake CLI](../developer-guide/snowflake-cli/connecting/configure-cli.md), if your
organization is `myorganization` and your account is `myaccount`, set `account` to:

```toml
[connections]
[connections.myconnection]
account = "myorganization-myaccount"
```

#### Specifying the fully qualified account name in a SQL statement

In a SQL statement, when specifying the fully qualified account name, use a period between the
organization name and account name:

> `orgname.account_name`

#### Providing your data sharing account identifier

When a [provider](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about) plans to share a [private listing](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about) with you, the provider will ask you for your account identifier. This is
referred to as the *data sharing account identifier*.

Specify your account identifier in the following format, using a period between the
organization name and account name:

> `orgname.account_name`

### Organization and account names

#### Organization name

For users who sign up for a Snowflake account using the self-service option, an organization is automatically created with a
system-generated name when the account is created. For entities who work directly with Snowflake personnel to set up accounts,
Snowflake can assign the organization a custom name. This custom name must be unique across all other organizations in Snowflake.
The name must start with a letter and can only contain letters (lowercase and uppercase) and numbers. The name can’t contain underscores
or other delimiters.

If you want to change the name of an organization, for example to change a system-generated name to a more user-friendly one,
contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

As a best practice, review and change your organization name, if needed, before using the name in any account identifiers. Renaming
the organization name in the future will result in changing all the URLs for your Snowflake accounts to match the new name.

To view the name of your organization, see [Viewing the name of your organization and its accounts](organizations.md).

#### Account name

Each account name must be unique within your organization. You specify an account name when you create the account (see [Creating an account](organizations-manage-accounts-create.md)).

While an account name uniquely identifies an account within your organization, it is *not* a unique identifier of an account across
Snowflake organizations. To uniquely identify an account in Snowflake, you must prepend your organization name to the account name. For
example:

`orgname-account_name`

Consistent with SQL standards for identifiers, account names can include underscores as separators between words, such as `MARKETING_TEST_ACCOUNT`.

URLs that include underscores can sometimes cause issues for certain features, such as Okta SSO/SCIM. For this reason, Snowflake also
supports a version of the account name that substitutes the hyphen character (`-`) in place of the underscore character. For example
both of the following URLs are supported:

> URL with underscores: `https://acme-marketing_test_account.snowflakecomputing.com`
>
> URL with dashes: `https://acme-marketing-test-account.snowflakecomputing.com`

#### Existing accounts

If you have any accounts that existed before the Organizations feature was enabled, the Format 2: Account locator in a region is used as the
account name.

In addition, if you have existing accounts with the same name in different regions, the cloud and region names are appended to the
account name in the new URL format.

For example, if your organization name is `ACME`, and there are two accounts named `TEST`, one in the AWS `us-east-2` region
and the other in the Azure `west-us-2` region, the new URLs will use the following structure:

* First account:

  Original URL:
  :   `https://test.us-east-2.aws.snowflakecomputing.com`

  New URL:
  :   `https://acme-test_aws_us_east_2.snowflakecomputing.com`
* Second account:

  Original URL:
  :   `https://test.west-us-2.azure.snowflakecomputing.com`

  New URL:
  :   `https://acme-test_azure_west_us_2.snowflakecomputing.com`

These account names can be changed as long as the new names are unique. For instructions on how to change an account name, see [Renaming an account](organizations-manage-accounts-rename.md).

## Format 2: Account locator in a region

An account locator is an identifier assigned by Snowflake when the account is created:

* If the account is created by a Snowflake representative, you may be able to request a specific value for the locator, such as a
  company name, acronym, or other recognizable string.
* If the account is created through self-service or an automated/background process, the locator is a random string of unique characters
  and numbers, such as `xy12345`).

The locator for an account can’t be changed once the account is created.

> **Note:**
>
> Account locators continue to be supported for identifying accounts in Snowflake, but this is no longer the preferred method. The
> preferred method for identifying accounts is now the account name within your organization (as described earlier in this topic).

The next sections explain the format to use:

* Using an account locator as an identifier
* Finding the region and locator for an account
* Finding the account locator format for a VPS account
* Non-VPS account locator formats by cloud platform and region

### Using an account locator as an identifier

Each Snowflake account is hosted on a [cloud platform](intro-cloud-platforms.md) in a geographical
[region](intro-regions.md).

The region determines where the data in the account is stored and where the compute resources used by the account are provisioned.

When using an account locator to identify an account, the locator by itself is not always sufficient to identify the account. Depending
on the region and cloud platform for the account, *additional* segments may be required, in the form of:

`account_locator.cloud_region_id` *or*

`account_locator.cloud_region_id.cloud` *or*

`account_locator.gov_compliance.cloud_region_id.cloud`

Where:

* `cloud_region_id` is the identifier for the cloud region (dictated by the cloud platform).
* `cloud` is the identifier for the cloud platform (`aws`, `azure`, or `gcp`).
* `compliance` is for SnowGov regions only and specifies the level of U.S. government compliance supported by the region
  (`fhplus` or `dod`).

For example, if your account locator is `xy12345`:

* If the account is located in the AWS US West (Oregon) region, no additional segments are required and the URL would be
  `xy12345.snowflakecomputing.com`.
* If the account is located in the AWS US East (Ohio) region, additional segments are required and the URL would be
  `xy12345.us-east-2.aws.snowflakecomputing.com`.

For a complete list of regions and locator formats, see Non-VPS Account Locator Formats by Cloud Platform and Region (in this topic).

> **Note:**
>
> If your Snowflake Edition is [VPS](intro-editions.md), the account locator uses a different format. See
> Finding the account locator format for a VPS account (in this topic).

### Finding the region and locator for an account

To find the region and locator for an account, you can use Snowsight or SQL.

Snowsight:
:   1. Open the account selector and review the list of accounts that you previously signed in to.
    2. Select View account details.

       The Account Details dialog displays information about the account, including the account identifier and the account URL.

    You can copy the full account locator from the Full Account Locator field.

SQL:
:   If you can connect to your Snowflake account, call the following context functions to identify the region and account locator
    for the Snowflake account you are connected to:

    * Call [CURRENT_REGION](../sql-reference/functions/current_region.md) to retrieve the region in which your account is located.
    * Call [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) to retrieve the account locator.

If you are unable to connect to Snowflake, contact the Snowflake administrator for your account to retrieve this information.

### Finding the account locator format for a VPS account

If your Snowflake Edition is [VPS](intro-editions.md), the account locator format uses different
naming conventions than the accounts for other Snowflake Editions. This results in a different structure for the hostnames and URLs used to
access VPS accounts.

For details, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) or your Snowflake representative.

As an alternative, you can use the preferred format of `organization_name-account_name` as your account identifier. This
format works for accounts that use the VPS edition. For details, see Format 1 (preferred): Account name in your organization (in this topic).

### Non-VPS account locator formats by cloud platform and region

The following table lists the account locator formats across all the supported non-VPS regions, including whether the account locator
for a given region requires additional segments:

If your account locator is `xy12345`:

| Cloud Platform / Region | Account Identifier | Notes |
| --- | --- | --- |
| **Amazon Web Services (AWS)** |  |  |
| US West (Oregon) | `xy12345` | No additional segments required. |
| US West (Commercial Gov - Oregon) | `xy12345.us-west-2-gov.aws` |  |
| US Gov West 1 (FedRAMP High Plus) | `xy12345.fhplus.us-gov-west-1.aws` | Additional `fhplus` segment required after the account locator. |
| US Gov West 1 (DoD) | `xy12345.dod.us-gov-west-1.aws` | Additional `dod` segment required after the account locator. |
| US East (Ohio) | `xy12345.us-east-2.aws` |  |
| US East (N. Virginia) | `xy12345.us-east-1` | Cloud region ID is the only additional segment required. |
| US East (Commercial Gov - N. Virginia) | `xy12345.us-east-1-gov.aws` |  |
| US Gov East 1 (FedRAMP High Plus) | `xy12345.fhplus.us-gov-east-1.aws` | Additional `fhplus` segment required after the account locator. |
| Canada (Central) | `xy12345.ca-central-1.aws` |  |
| South America (Sao Paulo) | `xy12345.sa-east-1.aws` |  |
| Africa (Cape Town) | `xy12345.af-south-1.aws` |  |
| EU (Ireland) | `xy12345.eu-west-1` | Cloud region ID is the only additional segment required. |
| Europe (London) | `xy12345.eu-west-2.aws` |  |
| EU (Paris) | `xy12345.eu-west-3.aws` |  |
| EU (Frankfurt) | `xy12345.eu-central-1` | Cloud region ID is the only additional segment required. |
| EU (Zurich) | `xy12345.eu-central-2.aws` |  |
| EU (Stockholm) | `xy12345.eu-north-1.aws` |  |
| Middle East (UAE) | `xy12345.me-central-1.aws` |  |
| Asia Pacific (Tokyo) | `xy12345.ap-northeast-1.aws` |  |
| Asia Pacific (Osaka) | `xy12345.ap-northeast-3.aws` |  |
| Asia Pacific (Seoul) | `xy12345.ap-northeast-2.aws` |  |
| Asia Pacific (Mumbai) | `xy12345.ap-south-1.aws` |  |
| Asia Pacific (Singapore) | `xy12345.ap-southeast-1` | Cloud region ID is the only additional segment required. |
| Asia Pacific (Sydney) | `xy12345.ap-southeast-2` | Cloud region ID is the only additional segment required. |
| Asia Pacific (Jakarta) | `xy12345.ap-southeast-3.aws` |  |
| China (Ningxia) | `xy12345.cn-northwest-1.aws` | This region utilizes the `snowflakecomputing.cn` domain instead of the `snowflakecomputing.com` domain utilized by the other regions. |
| **Google Cloud Platform (GCP)** |  |  |
| US Central1 (Iowa) | `xy12345.us-central1.gcp` |  |
| US East4 (N. Virginia) | `xy12345.us-east4.gcp` |  |
| Europe West2 (London) | `xy12345.europe-west2.gcp` |  |
| Europe West3 (Frankfurt) | `xy12345.europe-west3.gcp` |  |
| Europe West4 (Netherlands) | `xy12345.europe-west4.gcp` |  |
| Middle East Central2 (Dammam) | `xy12345.me-central2.gcp` |  |
| Australia Southeast 2 (Melbourne) | `xy12345.australia-southeast2.gcp` |  |
| **Microsoft Azure** |  | Snowflake added hyphens to the Azure region IDs for consistency with AWS and GCP. |
| West US 2 (Washington) | `xy12345.west-us-2.azure` |  |
| Central US (Iowa) | `xy12345.central-us.azure` |  |
| South Central US (Texas) | `xy12345.south-central-us.azure` |  |
| East US (Virginia) | `xy12345.east-us.azure` |  |
| East US 2 (Virginia) | `xy12345.east-us-2.azure` |  |
| US Gov Virginia (FedRAMP High Plus) | `xy12345.fhplus.us-gov-virginia.azure` |  |
| US Gov Virginia | `xy12345.us-gov-virginia.azure` |  |
| Canada Central (Toronto) | `xy12345.canada-central.azure` |  |
| Mexico Central (Mexico City) | `xy12345.mexicocentral.azure` |  |
| UK South (London) | `xy12345.uk-south.azure` |  |
| North Europe (Ireland) | `xy12345.north-europe.azure` |  |
| Sweden Central (Gävle) | `xy12345.sweden-central.azure` |  |
| West Europe (Netherlands) | `xy12345.west-europe.azure` |  |
| Switzerland North (Zurich) | `xy12345.switzerland-north.azure` |  |
| UAE North (Dubai) | `xy12345.uae-north.azure` |  |
| Central India (Pune) | `xy12345.central-india.azure` |  |
| Japan East (Tokyo) | `xy12345.japan-east.azure` |  |
| Korea Central (Seoul) | `xy12345.korea-central.azure` |  |
| Southeast Asia (Singapore) | `xy12345.southeast-asia.azure` |  |
| Australia East (New South Wales) | `xy12345.australia-east.azure` |  |

## Account identifiers for private connectivity

If private connectivity to the Snowflake service is enabled for your account and you wish to use the feature to connect to Snowflake,
run the [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function to determine the private connectivity URL to use.
You can use either the account name or account locator in the URL to connect to the Snowflake web interface.

If you want to connect to Snowsight using private connectivity, use the following instructions in the
[Signing in to Snowsight](ui-snowsight-gs.md).

## Account identifiers for replication and failover

The preferred method of identifying an account in replication and failover related SQL commands uses the organization name and account name
as the account identifier. If you decide to use the legacy account locator instead, it may need to contain additional segments in order to
uniquely identify the account. See the table below for reference:

> | Account Identifier | Location of the Remote Account |
> | --- | --- |
> | `organization_name.account_name` | Preferred account identifier that can be used regardless of the region or region group of the account that stores the primary database. |
> | `account_locator` | Same region but a different account from the account that stores the primary database. |
> | `snowflake_region.account_locator` | Same region group but a different region from the account that stores the primary database. |
> | `region_group.snowflake_region.account_locator` | Different region group from the account that stores the primary database. |

The values for `snowflake_region` and `region_group` can be found in the output of [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md).

## Snowflake region IDs and region groups

A Snowflake Region is a distinct region (deployed within an AWS, Azure, or GCP cloud region) that is isolated from other Snowflake
Regions. A Snowflake Region can be either multi-tenant (containing accounts for multiple organizations) or single-tenant
(aka Virtual Private Snowflake for a single organization).

Each Snowflake Region has an unique identifier and belongs to a region group, which enables global features such as data sharing and
replication.

### Region IDs

Because each cloud platform utilizes different conventions and formats for naming their regions, Snowflake assigns a canonical ID to
each Snowflake Region that uniquely identifies it across all the cloud platforms and their regions.

If the Organizations feature is enabled, specifying the Snowflake Region ID as part of an account identifier is required when you create
a new account, as well as when you configure replication and failover.

The following table displays the complete list of Snowflake Region IDs:

| Cloud Region | Cloud Region ID | Snowflake Region ID | Notes |
| --- | --- | --- | --- |
| **Amazon Web Services (AWS)** |  |  |  |
| US West (Oregon) | `us-west-2` | `aws_us_west_2` |  |
| US West (Commercial Gov - Oregon) | `us-west-2` | `aws_us_gov_west_2` | Available only for accounts on Business Critical (or higher); located in US West 2, not [AWS GovCloud (US)](https://aws.amazon.com/govcloud-us/). |
| US Gov West 1 (FedRAMP High Plus) | `us-gov-west-1` | `aws_us_gov_west_1_fhplus` | Available only for accounts on Business Critical (or higher); located in [AWS GovCloud (US)](https://aws.amazon.com/govcloud-us/). |
| US Gov West 1 (DoD) | `us-gov-west-1` | `aws_us_gov_west_1_dod` | Available only for accounts on Business Critical (or higher); located in [AWS GovCloud (US)](https://aws.amazon.com/govcloud-us/). |
| US East (Ohio) | `us-east-2` | `aws_us_east_2` |  |
| US East (N. Virginia) | `us-east-1` | `aws_us_east_1` |  |
| US East (Commercial Gov - N. Virginia) | `us-east-1` | `aws_us_gov_east_1` | Available only for accounts on Business Critical (or higher); located in US East 1, not [AWS GovCloud (US)](https://aws.amazon.com/govcloud-us/). |
| US Gov East 1 (FedRAMP High Plus) | `us-gov-east-1` | `aws_us_gov_east_1_fhplus` | Available only for accounts on Business Critical (or higher); located in [AWS GovCloud (US)](https://aws.amazon.com/govcloud-us/). |
| Canada (Central) | `ca-central-1` | `aws_ca_central_1` |  |
| South America (Sao Paulo) | `sa-east-1` | `aws_sa_east_1` |  |
| Africa (Cape Town) | `af-south-1` | `aws_af_south_1` |  |
| EU (Ireland) | `eu-west-1` | `aws_eu_west_1` |  |
| Europe (London) | `eu-west-2` | `aws_eu_west_2` |  |
| EU (Paris) | `eu-west-3` | `aws_eu_west_3` |  |
| EU (Frankfurt) | `eu-central-1` | `aws_eu_central_1` |  |
| EU (Zurich) | `eu-central-2` | `aws_eu_central_2` |  |
| EU (Stockholm) | `eu-north-1` | `aws_eu_north_1` |  |
| Middle East (UAE) | `me-central-1` | `aws-me-central-1` |  |
| Asia Pacific (Tokyo) | `ap-northeast-1` | `aws_ap_northeast_1` |  |
| Asia Pacific (Osaka) | `ap-northeast-3` | `aws_ap_northeast_3` |  |
| Asia Pacific (Seoul) | `ap-northeast-2` | `aws_ap_northeast_2` |  |
| Asia Pacific (Mumbai) | `ap-south-1` | `aws_ap_south_1` |  |
| Asia Pacific (Singapore) | `ap-southeast-1` | `aws_ap_southeast_1` |  |
| Asia Pacific (Sydney) | `ap-southeast-2` | `aws_ap_southeast_2` |  |
| Asia Pacific (Jakarta) | `ap-southeast-3` | `aws_ap_southeast_3` |  |
| China (Ningxia) | `cn-northwest-1` | `aws_cn_northwest_1` | Utilizes a different domain name (`snowflakecomputing.cn`) and is operated by Digital China Cloud Technology Limited (DCC), an authorized operating partner of Snowflake. |
| **Google Cloud Platform (GCP)** |  |  |  |
| US Central1 (Iowa) | `us-central1` | `gcp_us_central1` |  |
| US East4 (N. Virginia) | `us-east4` | `gcp_us_east4` |  |
| Europe West2 (London) | `europe-west2` | `gcp_europe_west2` |  |
| Europe West3 (Frankfurt) | `europe-west3` | `gcp_europe_west3` |  |
| Europe West4 (Netherlands) | `europe-west4` | `gcp_europe_west4` |  |
| Middle East Central2 (Dammam) | `me-central2` | `gcp_me_central2` |  |
| Australia Southeast 2 (Melbourne) | `australia-southeast2` | `gcp-australia-southeast2` |  |
| **Microsoft Azure** |  |  |  |
| West US 2 (Washington) | `westus2` | `azure_westus2` |  |
| Central US (Iowa) | `centralus` | `azure_centralus` |  |
| South Central US (Texas) | `southcentralus` | `azure_southcentralus` |  |
| East US (Virginia) | `eastus` | `azure_eastus` |  |
| East US 2 (Virginia) | `eastus2` | `azure_eastus2` |  |
| US Gov Virginia (FedRAMP High Plus) | `usgovvirginia` | `azure_usgovvirginia_fhplus` | Available only for accounts on Business Critical (or higher); located in [Microsoft Azure Government](https://docs.microsoft.com/en-us/azure/azure-government/). |
| US Gov Virginia | `usgovvirginia` | `azure_usgovvirginia` | Available only for accounts on Business Critical (or higher); located in [Microsoft Azure Government](https://docs.microsoft.com/en-us/azure/azure-government/). |
| Canada Central (Toronto) | `canadacentral` | `azure_canadacentral` |  |
| Mexico Central (Mexico City) | `mexicocentral` | `azure_mexicocentral` |  |
| UK South (London) | `uk-south` | `azure_uksouth` |  |
| North Europe (Ireland) | `northeurope` | `azure_northeurope` |  |
| Sweden Central (Gävle) | `swedencentral` | `azure_swedencentral` |  |
| West Europe (Netherlands) | `westeurope` | `azure_westeurope` |  |
| Switzerland North (Zurich) | `switzerlandnorth` | `azure_switzerlandnorth` |  |
| UAE North (Dubai) | `uaenorth` | `azure_uaenorth` |  |
| Central India (Pune) | `centralindia` | `azure_centralindia` |  |
| Japan East (Tokyo) | `japaneast` | `azure_japaneast` |  |
| Korea Central (Seoul) | `koreacentral` | `azure_koreacentral` |  |
| Southeast Asia (Singapore) | `southeastasia` | `azure_southeastasia` |  |
| Australia East (New South Wales) | `australiaeast` | `azure_australiaeast` |  |

### Region groups

A region group is a group of Snowflake Regions that offer similar security controls, isolation, and compliance. The region group to which
a Snowflake Region belongs differs depending on the region:

* All Snowflake multi-tenant commercial regions (across all the supported cloud platforms) are in the same shared/general `PUBLIC`
  group.
* Each Snowflake multi-tenant government region is in a separate group specific to the region.
* Each single-tenant Virtual Private Snowflake (VPS) is in a separate region group specific to the VPS. If your organization has more than
  one VPS, you can have one VPS per region group or multiple VPSs can share the same region group.

Specifying the region group as part of an account identifier is required when you want to create
accounts in different region groups. If you have questions about the region group of your account, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Adaptive Compute
source: https://docs.snowflake.com/en/user-guide/warehouses-adaptive.md
section: User Guide
---

# Adaptive Compute

Adaptive Compute is a compute service focused on delivering strong performance with effortless
operations. It replaces the fixed compute engine with a workload-aware one that adapts to your
queries automatically. The system decides how to allocate resources for the best performance,
eliminating the need for infrastructure tuning.

By automatically scaling resources and intelligently routing queries, Adaptive Compute removes the
operational complexity that comes with traditional warehouse management: manual cluster sizing,
disruptive upgrades, and hands-on performance tuning. It also incorporates the latest hardware and
performance enhancements, so adaptive warehouses can run significantly more queries at a similar
cost to Gen2.

You access Adaptive Compute through adaptive warehouses. With an adaptive warehouse, you no longer
need to manage:

* Warehouse size (XSMALL, SMALL, MEDIUM, and so on).
* Multi-cluster warehouse settings.
* Query Acceleration Service settings.
* Suspend and resume semantics.

Snowflake handles all of this automatically, so your team can focus on working with data rather
than managing the infrastructure behind it.

All jobs across all adaptive warehouses in an account are routed to a shared pool of compute
resources. This pool is dedicated to your account: it isn’t shared with other accounts in your
organization and isn’t used by other warehouse types, such as standard, interactive, or
Snowpark-optimized. You can still have multiple adaptive warehouses per account
for grouping workloads with similar performance and cost characteristics,
reporting, and governance.

Adaptive warehouses use a query-based billing model, where the cost of each query
depends on factors like the amount of compute and software resources it uses. You
can still reason about costs at the warehouse level, because all queries running
in an adaptive warehouse add up to the total cost of that warehouse. Query-level
cost visibility isn’t available during Public Preview but is planned for general
availability.

The same cost management tools are available:

* [Budgets](budgets.md) and
  [resource monitors](resource-monitors.md) for cost governance.
* ACCOUNT_USAGE views for granular observability.

You can create new adaptive warehouses or convert existing standard warehouses to adaptive
without downtime. Converting existing warehouses allows you to retain your existing
chargeback and showback structures and workload segregation (analytics versus ETL, team-based
warehouses, and so on). For example, the finance team might use one adaptive warehouse and the
engineering team might use another.

## Limitations

Adaptive warehouses require Enterprise Edition (or higher).

During Public Preview, adaptive warehouses are available in the following regions:
US West 2 (Oregon), EU West 1 (Ireland), and AP Northeast 1 (Tokyo).

The following conversions are also **not** yet supported:

* Converting to or from an X5Large or X6Large warehouse.
* Converting to or from a Snowpark-optimized or interactive warehouse.

## Managing performance and throughput

Adaptive warehouses expose two primary properties to control performance and throughput:

* MAX_QUERY_PERFORMANCE_LEVEL
* QUERY_THROUGHPUT_MULTIPLIER

### MAX_QUERY_PERFORMANCE_LEVEL

MAX_QUERY_PERFORMANCE_LEVEL expresses the upper bound of performance for any individual query.
It’s set at the warehouse level and serves as the mechanism to tell the system to “speed up” or
“slow down” query execution.

The property is expressed in units of t-shirt sizes (XSMALL through X4LARGE). Each t-shirt size
conveys a similar or better level of performance than its commensurate classic warehouse size.

Type:
:   `{ XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE }`

Default:
:   `XLARGE`

**Semantics:**

* Larger values provide more compute headroom per statement, improving latency for large, complex
  queries, and increase potential instantaneous spend for a single statement.
* Smaller values constrain per-statement spend but might slow large queries while leaving more
  headroom for concurrency.
* This value doesn’t map to a specific underlying compute configuration. It expresses only a
  performance level: Snowflake determines the actual resources needed for each query.

**Behavior:**

Adaptive Compute determines the optimal compute needed for a query based on the
query plan. If the service determines that the compute needs for optimal
performance are greater than MAX_QUERY_PERFORMANCE_LEVEL, Snowflake caps it at
MAX_QUERY_PERFORMANCE_LEVEL. For smaller queries, Snowflake chooses compute for
optimal performance below MAX_QUERY_PERFORMANCE_LEVEL commensurate with what the
query needs.

**Guidance:**

Set MAX_QUERY_PERFORMANCE_LEVEL to the highest query performance you’re comfortable having for
your largest queries. Use [budgets](budgets.md) and
[resource monitors](resource-monitors.md) to govern total spend over time.

### QUERY_THROUGHPUT_MULTIPLIER

QUERY_THROUGHPUT_MULTIPLIER expresses the multiplier used to compute the maximum throughput at any
given time. Rather than specifying an absolute maximum throughput, you specify an integer scale
factor over the system-computed minimum.

To run `N` statements in parallel at the MAX_QUERY_PERFORMANCE_LEVEL, set the multiplier
to `N`. Because MAX_QUERY_PERFORMANCE_LEVEL represents the upper bound, this setting typically
supports more than `N` queries running in parallel, because many queries need less than the
maximum.

Type:
:   Non-negative integer

Default:
:   `2`

Setting this value to `0` means unlimited throughput: the warehouse can use as much burst
capacity as available with no cap.

**Semantics:**

When set to a positive value, the maximum throughput is computed as:

```text
MAX_THROUGHPUT = QUERY_THROUGHPUT_MULTIPLIER * MINIMUM
```

Where `MINIMUM` is a system-computed base capacity for the MAX_QUERY_PERFORMANCE_LEVEL set on
the warehouse.

* Acts as a scale factor on this system-computed base capacity.
* Higher values increase peak throughput (more concurrent work) and reduce queuing, at the cost of
  potentially higher instantaneous spend.
* Lower values constrain burst throughput and reduce the risk of sudden spikes in spend, but might
  lead to queuing.

**Behavior:**

Snowflake computes an internal base capacity rate for the warehouse based on
MAX_QUERY_PERFORMANCE_LEVEL, migration history (classic size, max cluster count, QAS scale
factor), and other system tuning parameters.

QUERY_THROUGHPUT_MULTIPLIER multiplies against this base capacity to determine
the total number of queries that can be executed concurrently. When the system
is below this target, it allows execution of the query. When it reaches the
target, it queues the query.

**Guidance:**

If you observe persistent queued-on-load time and want higher throughput, increase
QUERY_THROUGHPUT_MULTIPLIER. If you’re more concerned about capping instantaneous spend, reduce
QUERY_THROUGHPUT_MULTIPLIER and rely on budgets and resource monitors for absolute cost controls.

## Create an adaptive warehouse

You can create an adaptive warehouse using Snowsight, SQL, or
[Cortex Code](cortex-code/cortex-code.md).

SnowsightSQLCortex Code

To create an adaptive warehouse using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Compute » Warehouses.
3. Select +Warehouse.
4. In the Type dropdown, select Adaptive.
5. Optionally, select Advanced and configure:

   * Maximum query performance level (default: XLarge)
   * Query throughput multiplier (default: 2)

The warehouse is created and can be used normally.

As an alternative to using Snowsight, you can create an adaptive warehouse using
the CREATE ADAPTIVE WAREHOUSE command.

```sqlexample
CREATE ADAPTIVE WAREHOUSE my_adaptive_wh;
```

This creates an adaptive warehouse using defaults (MAX_QUERY_PERFORMANCE_LEVEL = XLARGE,
QUERY_THROUGHPUT_MULTIPLIER = 2). Snowflake uses conservative, safe defaults so you can
start without tuning.

You can also specify properties at creation time:

```sqlexample
CREATE ADAPTIVE WAREHOUSE my_adaptive_wh
  WITH MAX_QUERY_PERFORMANCE_LEVEL = XLARGE
       QUERY_THROUGHPUT_MULTIPLIER = 4;
```

For the full syntax, additional examples, and the list of optional properties,
see the SQL reference section.

You can ask [Cortex Code](cortex-code/cortex-code.md) to create an
adaptive warehouse using natural language. For example:

```text
Create an adaptive warehouse called my_adaptive_wh with
MAX_QUERY_PERFORMANCE_LEVEL set to XLARGE and
QUERY_THROUGHPUT_MULTIPLIER set to 4.
```

Cortex Code generates and runs the appropriate SQL on your behalf.

## Convert a standard warehouse to an adaptive warehouse

You can convert a standard warehouse to adaptive using Snowsight, SQL, or
[Cortex Code](cortex-code/cortex-code.md).

> **Note:**
>
> Converting a warehouse to or from an adaptive warehouse is an *online operation*,
> which means that it doesn’t involve any downtime. This conversion doesn’t make the
> warehouse unavailable or interrupt any running queries.
>
> When you convert a warehouse to an adaptive warehouse or back to a standard warehouse,
> existing queries that were running on that warehouse continue to run to completion using
> the existing compute resources. At the same time, the warehouse runs any new queries on
> the compute resources of the new warehouse type. While the existing queries are running,
> you’re charged for both sets of compute resources. If you’re converting the warehouse back
> to a standard one, the warehouse doesn’t automatically suspend during this period, whether
> or not any queries are using the new compute resources. When the existing queries complete,
> the workload shifts entirely to the new compute resources.

SnowsightSQLCortex Code

To convert a standard warehouse to an adaptive warehouse using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Compute » Warehouses » <warehouse_identifier>.
3. Select the more menu … (three dots) » Convert to Adaptive.
4. Confirm the operation.

To convert a standard warehouse to an adaptive warehouse, use the
ALTER WAREHOUSE command to set the WAREHOUSE_TYPE property to
`ADAPTIVE`. For example:

```sqlexample
ALTER WAREHOUSE my_warehouse SET WAREHOUSE_TYPE = 'ADAPTIVE';
```

To convert the adaptive warehouse back to a standard warehouse,
change the property to `STANDARD`. For example:

```sqlexample
ALTER WAREHOUSE my_warehouse SET WAREHOUSE_TYPE = 'STANDARD';
```

You can ask [Cortex Code](cortex-code/cortex-code.md) to convert
a warehouse using natural language. For example:

```text
Convert my_warehouse to an adaptive warehouse.
```

Cortex Code generates and runs the appropriate SQL on your behalf.

### Property behavior during conversion

When you convert a standard warehouse to an adaptive warehouse, the only property you must change
is WAREHOUSE_TYPE. Snowflake automatically computes appropriate values for
MAX_QUERY_PERFORMANCE_LEVEL and QUERY_THROUGHPUT_MULTIPLIER.

The system derives these from the existing configuration of the standard warehouse:

* Warehouse size.
* MAX_CLUSTER_COUNT (for multi-cluster warehouses).
* QAS scale factor.
* Warehouse generation (hardware/software generation).

The goal is to preserve or improve performance compared to the original standard warehouse,
provide enough burst capacity for typical load spikes, and avoid requiring manual tuning when
switching to adaptive.

After conversion, you can optionally override MAX_QUERY_PERFORMANCE_LEVEL and
QUERY_THROUGHPUT_MULTIPLIER using ALTER WAREHOUSE. Standard warehouse properties
such as WAREHOUSE_SIZE and MAX_CLUSTER_COUNT no longer apply after conversion to
adaptive, and adaptive properties no longer apply after conversion back to standard.

## Billing and pricing

Adaptive warehouses use a query-based billing model. The cost of each query
depends on factors like the amount of compute and software resources it uses,
including the cluster sizes and additional capacity used by features like
Query Acceleration Service (QAS). You aren’t charged for creating an adaptive
warehouse: charges start when the first query runs.

All queries running in an adaptive warehouse add up to the total cost of that
warehouse, so you can continue to use existing chargeback and showback
structures. Adaptive warehouse usage is reported as part of COMPUTE in usage
statements using virtual warehouse credits.

You control performance and spend primarily through:

* **MAX_QUERY_PERFORMANCE_LEVEL**: caps the per-statement performance level.
* **QUERY_THROUGHPUT_MULTIPLIER**: caps the overall burst capacity at any instant.
* **Budgets and resource monitors**: govern total spend over time at the account
  or warehouse level.

Typical configuration patterns:

| Workload type | Configuration |
| --- | --- |
| Latency-sensitive, critical workloads | Higher MAX_QUERY_PERFORMANCE_LEVEL (XLARGE or above). Higher QUERY_THROUGHPUT_MULTIPLIER. Resource monitors or budgets to keep aggregate spend within plan. |
| Cost-sensitive, high-throughput workloads | Moderate MAX_QUERY_PERFORMANCE_LEVEL (MEDIUM or LARGE). Medium QUERY_THROUGHPUT_MULTIPLIER to balance throughput against spend spikes. |
| Tightly budgeted workloads | Lower MAX_QUERY_PERFORMANCE_LEVEL. Lower QUERY_THROUGHPUT_MULTIPLIER. Strict budgets and resource monitors. |

You can use [ACCOUNT_USAGE](../sql-reference/account-usage.md) views to retrieve
granular data on credit consumption for a specific adaptive warehouse. Use
[WAREHOUSE_METERING_HISTORY view](../sql-reference/account-usage/warehouse_metering_history.md) to view credit
consumption for your warehouse. For a full list of relevant views, see
Account Usage views.

For more information about compute cost, see
[Understanding compute cost](cost-understanding-compute.md).

## SQL reference

### CREATE ADAPTIVE WAREHOUSE

Creates a new adaptive virtual warehouse.

```sqlsyntax
CREATE [ OR REPLACE ] ADAPTIVE WAREHOUSE [ IF NOT EXISTS ] <name>
  [ [ WITH ] adaptiveProperties ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
  [ objectParams ]

adaptiveProperties ::=
  COMMENT = '<string_literal>'
  MAX_QUERY_PERFORMANCE_LEVEL = { XSMALL | SMALL | MEDIUM | LARGE
                                | XLARGE | XXLARGE | XXXLARGE | X4LARGE }
  QUERY_THROUGHPUT_MULTIPLIER = <integer>

objectParams ::=
  STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>
  STATEMENT_TIMEOUT_IN_SECONDS = <num>
```

You can also create an adaptive warehouse using the standard
[CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) syntax with
`WAREHOUSE_TYPE = 'ADAPTIVE'`:

```sqlsyntax
CREATE [ OR REPLACE ] WAREHOUSE [ IF NOT EXISTS ] <name>
  [ [ WITH ] WAREHOUSE_TYPE = 'ADAPTIVE'
    [ adaptiveProperties ]
  ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
  [ objectParams ]
```

> **Note:**
>
> Standard warehouse properties such as WAREHOUSE_SIZE, MIN_CLUSTER_COUNT,
> MAX_CLUSTER_COUNT, and SCALING_POLICY can’t be set on an adaptive warehouse.
> Similarly, adaptive warehouse properties such as MAX_QUERY_PERFORMANCE_LEVEL
> and QUERY_THROUGHPUT_MULTIPLIER can’t be set on a standard warehouse.

#### Required parameters

`name`
:   Identifier for the adaptive virtual warehouse. Must be unique for your account.
    Must start with an alphabetic character and can’t contain spaces or special
    characters unless enclosed in double quotes. See
    [Object identifiers](../sql-reference/identifiers.md) for details.

#### Optional properties

`MAX_QUERY_PERFORMANCE_LEVEL = { XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE }`
:   Upper bound on the performance level for a single statement, expressed
    as a t-shirt size. Default: `XLARGE`.

    Snowflake chooses a performance level up to this bound based on statement
    characteristics. Smaller statements might run at a lower performance level
    to reduce spend. Choose a value appropriate for your largest queries.

    For more details, see Managing performance and throughput.

`QUERY_THROUGHPUT_MULTIPLIER = <integer>`
:   Multiplier used to compute the maximum throughput at any given time, expressed
    as a non-negative integer scale factor over the system-computed minimum. Higher
    values increase peak throughput (more concurrent work) and reduce queuing, at
    the cost of potentially higher instantaneous spend. Lower values constrain
    burst throughput and reduce the risk of sudden spikes in spend, but might lead
    to queuing. A value of `0` means unlimited throughput.

    Default: `2`.

    For more details, see Managing performance and throughput.

`STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>`
:   Maximum time, in seconds, a SQL statement can remain queued on the warehouse
    before Snowflake cancels it. See
    [Parameters](../sql-reference/parameters.md) for details.

`STATEMENT_TIMEOUT_IN_SECONDS = <num>`
:   Maximum time, in seconds, a running SQL statement can run before Snowflake
    cancels it. See [Parameters](../sql-reference/parameters.md) for details.

#### Examples

Create an adaptive warehouse with defaults:

```sqlexample
CREATE ADAPTIVE WAREHOUSE my_adaptive_wh;
```

Create with a specific performance level:

```sqlexample
CREATE ADAPTIVE WAREHOUSE my_adaptive_wh
  WITH MAX_QUERY_PERFORMANCE_LEVEL = XXLARGE;
```

Create with both properties:

```sqlexample
CREATE ADAPTIVE WAREHOUSE my_adaptive_wh
  WITH MAX_QUERY_PERFORMANCE_LEVEL = MEDIUM
       QUERY_THROUGHPUT_MULTIPLIER = 6;
```

Create using the standard CREATE WAREHOUSE syntax:

```sqlexample
CREATE WAREHOUSE my_adaptive_wh
  WITH WAREHOUSE_TYPE = 'ADAPTIVE'
       MAX_QUERY_PERFORMANCE_LEVEL = LARGE
       QUERY_THROUGHPUT_MULTIPLIER = 3;
```

### ALTER WAREHOUSE (adaptive)

You can use [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) to convert
a standard warehouse to adaptive, modify adaptive warehouse properties, or
convert an adaptive warehouse back to standard.

Convert a standard warehouse to adaptive:

```sqlexample
ALTER WAREHOUSE my_warehouse SET WAREHOUSE_TYPE = 'ADAPTIVE';
```

Modify adaptive warehouse properties after creation or conversion:

```sqlexample
ALTER WAREHOUSE my_adaptive_wh SET
  MAX_QUERY_PERFORMANCE_LEVEL = XLARGE
  QUERY_THROUGHPUT_MULTIPLIER = 8;
```

Convert an adaptive warehouse back to standard:

```sqlexample
ALTER WAREHOUSE my_warehouse SET WAREHOUSE_TYPE = 'STANDARD';
```

## SHOW WAREHOUSES

The adaptive warehouse feature introduces new columns to the
[SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md) command. Properties
that don’t apply to adaptive warehouses are shown as `NULL`.

Columns specific to adaptive warehouses include:

| Column name | Description |
| --- | --- |
| STATE | One of:   * ENABLED (active/running) * DISABLED (inactive) |
| MAX_QUERY_PERFORMANCE_LEVEL | Expressed as a t-shirt size. Upper bound on the per-statement performance level. |
| QUERY_THROUGHPUT_MULTIPLIER | Integer scale factor controlling how much burst capacity the warehouse can use at any instant. |
| DISABLED_REASONS | One or more reasons why the adaptive warehouse was disabled. |

## Account Usage views

The following ACCOUNT_USAGE views are available for adaptive warehouses:

* [WAREHOUSE_METERING_HISTORY view](../sql-reference/account-usage/warehouse_metering_history.md)
* [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md)
* [WAREHOUSE_LOAD_HISTORY view](../sql-reference/account-usage/warehouse_load_history.md)

> **Note:**
>
> For adaptive warehouses, QAS usage is included in compute credits and doesn’t
> appear as a separate credit column. Use
> [WAREHOUSE_LOAD_HISTORY view](../sql-reference/account-usage/warehouse_load_history.md) to monitor queuing
> behavior and understand whether to adjust MAX_QUERY_PERFORMANCE_LEVEL or
> QUERY_THROUGHPUT_MULTIPLIER.

The following sample query produces a time series of warehouse-level performance
data for any warehouse that ran at least one query in `ADAPTIVE` state within a
specified lookback period.

```sqlexample
WITH adaptive_whs AS (
  SELECT DISTINCT warehouse_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY q
  WHERE q.warehouse_size = 'ADAPTIVE'
    AND q.start_time >= DATEADD(day, -7, CURRENT_DATE())
)
SELECT
  q.end_time::DATE AS ds,
  q.warehouse_name,
  IFF(q.warehouse_size = 'ADAPTIVE', 'ADAPTIVE', 'STANDARD') AS warehouse_type,
  AVG(q.total_elapsed_time) AS avg_query_time,
  AVG(q.execution_time) AS avg_exec_time,
  AVG(q.queued_overload_time) AS avg_queued_overload_time,
  AVG(q.queued_provisioning_time) AS avg_queued_provisioning_time
FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY q
WHERE q.start_time >= DATEADD(day, -7, CURRENT_DATE())
  AND q.warehouse_name IN (SELECT warehouse_name FROM adaptive_whs)
GROUP BY ALL;
```

## Bulk migration of standard warehouses to adaptive

If you want to migrate many standard warehouses to adaptive simultaneously,
you can use the SYSTEM$BULK_UPDATE_WH function.

Parameters for SYSTEM$BULK_UPDATE_WH function

| Parameter | Description | Allowed values |
| --- | --- | --- |
| property_name | The warehouse property to update. | `'WAREHOUSE_TYPE'` |
| new_value | New value for the property. | `'ADAPTIVE'` or `'STANDARD'` |
| property_filter | JSON filter on warehouse properties (for example, name pattern, size). Warehouses matching all filters are considered for update. | `'{"name": "TEST.*"}'` |
| tag_filter | JSON filter on tags. Warehouses must match all specified tags to be selected. | `'{"cost-centre": "sales"}'` |
| execution_mode | Operation mode: perform the update or dry run. | `'ACTIVE'`, `'DRY_RUN'` |

Suggested usage:

1. First, do a dry run and review the results:

   ```sqlexample
   SELECT SYSTEM$BULK_UPDATE_WH(
     'WAREHOUSE_TYPE',
     'ADAPTIVE',
     '{"WAREHOUSE_TYPE": "STANDARD"}',
     'DRY_RUN'
   );
   ```
2. Review the output and adjust filters if necessary.
3. After verifying the dry run, call the function again using the active mode:

   ```sqlexample
   SELECT SYSTEM$BULK_UPDATE_WH(
     'WAREHOUSE_TYPE',
     'ADAPTIVE',
     '{"WAREHOUSE_TYPE": "STANDARD"}',
     'ACTIVE'
   );
   ```
4. Carefully review the results and any errors before repeating or broadening
   the migration scope.

---
title: Adjust privacy controls
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-admin-adjust.md
section: User Guide
---

# Adjust privacy controls

This topic describes techniques the data owner can use to adjust the privacy controls that Snowflake uses to introduce noise into results.
Snowflake recommends trying these options in the order in which they’re presented in this topic.

Snowflake provides parameters to adjust both the privacy budget’s limit on privacy loss and the maximum amount of privacy budget used per
aggregate (collectively known as the *epsilon* in differential privacy literature).

## Step 1: Adjust privacy domains

Before adjusting the privacy budget, you should consider adjusting the privacy domain set on the columns of the privacy-protected table.
Snowflake introduces enough noise to obscure all values in a column, so the wider the range of values, the more noise that must be introduced.
Follow these guidelines:

* If you want to increase the noise, broaden the range to include values that are greater or less than the actual values. Remember, the
  privacy domain defines all *possible* values, not actual values.
* If you want to decrease the noise, narrow the privacy domain to exclude or clamp values outside a useful range. For information about how
  values outside the privacy domain are treated, see [Values outside a privacy domain](differential-privacy-privacy-domains.md).

> **Note:**
>
> The analyst can also narrow a privacy domain to decrease noise. For more information, see
> [Narrowing a privacy domain to improve results](differential-privacy-privacy-domains-analyst.md)

## Step 2: Adjust MAX_BUDGET_PER_AGGREGATE parameter

If you’ve adjusted the privacy domain, but still need to fine-tune your privacy controls, you can start modifying settings that affect the
privacy budget. Adjusting the `MAX_BUDGET_PER_AGGREGATE` parameter in the body of a privacy policy controls how much of a privacy
budget can be spent on each aggregate in a query (that is, how much privacy loss an aggregate can incur). Adjusting this parameter changes
the amount of noise added to each aggregate query, as well as the number of aggregates that can be executed before the privacy budget
limit is reached.

The parameter sets the level for each aggregate, not each query. As an example, the query `SELECT COUNT(*), AVG(a) ...` has two
aggregates: `COUNT(*)` and `AVG(a)`.

To adjust the maximum privacy loss incurred by each aggregate in a query, use the [ALTER PRIVACY POLICY](../../sql-reference/sql/alter-privacy-policy.md) command to
set a new value for the `MAX_BUDGET_PER_AGGREGATE` parameter. For example:

```sqlexample
ALTER PRIVACY POLICY users_policy SET BODY ->
  PRIVACY_BUDGET(BUDGET_NAME=>'analysts', MAX_BUDGET_PER_AGGREGATE=>0.1);
```

## Step 3: Adjust limit of the privacy budget

If adjusting other privacy controls doesn’t give you the results you’re looking for, you can adjust the privacy budget’s limit on privacy
loss. While the other privacy controls affect the amount of noise in query results, adjusting the budget limit affects how many queries an
analyst can run.

Each time an analyst runs a query with aggregate functions against a privacy-protected table, the analyst’s cumulative privacy loss is
incremented, and the estimated number of remaining aggregates is decremented. When the cumulative privacy loss reaches the privacy budget’s
limit, the analysts cannot run another query. If you want to maximize the usefulness of your data to the analyst, you can base your budget
limit on how many queries you think analysts will run during each budget window.

> **Note:**
>
> Remember that cumulative privacy loss is reset to 0 on a fixed schedule, as defined by the [budget window](differential-privacy-admin-privacy-budgets.md). When the privacy budget is reset, the analyst can run a fresh set of
> queries even if they reached the budget limit during the previous budget window.

The [ESTIMATE_REMAINING_DP_AGGREGATES](../../sql-reference/functions/estimate_remaining_dp_aggregates.md) function helps estimate the number of queries remaining for a privacy
budget. In general, this number is based on the number of aggregates in each query and the value of the `MAX_BUDGET_PER_AGGREGATE`
parameter that you specified in the body of the privacy policy. For an extended example of using the ESTIMATE_REMAINING_DP_AGGREGATES
function to see the effects of queries on the privacy budget, see [Tracking privacy budget spending](differential-privacy-analyst.md).

After you have used the ESTIMATE_REMAINING_DP_AGGREGATES function to get an idea of how much privacy budget is spent on a series of queries,
you can adjust the `BUDGET_LIMIT` parameter in the body of the privacy policy to set a new privacy budget limit. For example:

```sqlexample
ALTER PRIVACY POLICY users_policy SET BODY ->
  PRIVACY_BUDGET(BUDGET_NAME=>'analysts',
  BUDGET_LIMIT=>300,
  MAX_BUDGET_PER_AGGREGATE=>0.1);
```

> **Important:**
>
> Note that this command includes the `MAX_BUDGET_PER_AGGREGATE` parameter that was set previously. If you don’t include a parameter
> in the ALTER PRIVACY POLICY statement, it resets to its default value.

---
title: Advanced Column-level Security topics
source: https://docs.snowflake.com/en/user-guide/security-column-advanced.md
section: User Guide
---

# Advanced Column-level Security topics

This topic provides an introduction to two advanced concepts related to Column-level Security masking policies:

1. Role hierarchy.
2. Using multiple [Context functions](../sql-reference/functions-context.md).

## Context functions and role hierarchy

Column-level Security supports using [Context functions](../sql-reference/functions-context.md) in the conditions of the masking policy body to enforce
whether a user has authorization to see data. To determine whether a user can see data in a given SQL statement, it is helpful to consider:

The current session:
:   Masking policy conditions using [CURRENT_ROLE](../sql-reference/functions/current_role.md) target the role in use for the current session.

Database and schema:
:   If you specify the [CURRENT_DATABASE](../sql-reference/functions/current_database.md) or [CURRENT_SCHEMA](../sql-reference/functions/current_schema.md) function in the
    body of a masking or row access policy, the function returns the database or schema that contains the protected table, not the database or
    schema in use for the session.

The executing role:
:   Masking policy conditions using [INVOKER_ROLE](../sql-reference/functions/invoker_role.md) target the executing role in a SQL statement.

Role hierarchy:
:   If role hierarchy is necessary in the policy conditions, use [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md).

    Determine if a specified role in a masking policy condition (e.g. `analyst` custom role) is a lower privilege role in the
    CURRENT_ROLE or INVOKER_ROLE role hierarchy. If so, then the role returned by the CURRENT_ROLE or INVOKER_ROLE functions inherits the
    privileges of the specified role. For more information about role hierarchy and privilege inheritance, see:

    * [Overview of Access Control](security-access-control-overview.md)
    * [Configuring access control](security-access-control-configure.md)

The following table shows common context functions in masking policies that target the session, the executing role, and role hierarchy.

| Context function | Description |
| --- | --- |
| [CURRENT_ROLE](../sql-reference/functions/current_role.md) | Returns the name of the role in use for the current session. |
| [CURRENT_DATABASE](../sql-reference/functions/current_database.md) | In a policy body, returns the database that contains the table that is protected by the masking policy. |
| [CURRENT_SCHEMA](../sql-reference/functions/current_schema.md) | In a policy body, returns the schema that contains the table that is protected by the masking policy. |
| [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) | Returns TRUE if the user’s current role in the session (i.e. the role returned by [CURRENT_ROLE](../sql-reference/functions/current_role.md)) inherits the privileges of the specified role. |
| [INVOKER_ROLE](../sql-reference/functions/invoker_role.md) | Returns the name of the executing role. |
| [IS_GRANTED_TO_INVOKER_ROLE](../sql-reference/functions/is_granted_to_invoker_role.md) | Returns TRUE if the role returned by the INVOKER_ROLE function inherits the privileges of the specified role in the argument based on the context in which the function is called. |
| [INVOKER_SHARE](../sql-reference/functions/invoker_share.md) | Returns the name of the share that directly accessed the table or view where the INVOKER_SHARE function is invoked. |

### Use CURRENT_ROLE and IS_ROLE_IN_SESSION

A masking policy condition using CURRENT_ROLE targets the current session and is not affected by the execution context of the SQL statement.

If role activation and role hierarchy is necessary in the policy conditions, use [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md).

Consider the following masking policy body:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY mask_string AS
> (val string) RETURNS string ->
> CASE
>   WHEN CURRENT_ROLE() IN ('ANALYST') THEN val
>   ELSE '********'
> END;
> ```

To determine whether a given user has authorization to see data in a column where this masking policy is set on that column, complete the
following steps:

1. Evaluate the masking policy conditions.
2. Determine if the specified role is in the CURRENT_ROLE hierarchy.
3. Run a test query to verify.

#### Step 1: Evaluate the masking policy conditions

The following table summarizes the consequences of the masking policy body conditions.

| Context | Sees unmasked data | Sees masked data |
| --- | --- | --- |
| CURRENT_ROLE = ANALYST custom role. | ✔ |  |
| CURRENT_ROLE is in the ANALYST custom role in hierarchy. | ✔ |  |
| CURRENT_ROLE is not in the ANALYST custom role hierarchy. |  | ✔ |

Next, evaluate the role hierarchy.

#### Step 2: Determine if the specified role is in the CURRENT_ROLE hierarchy

Assuming that the CURRENT_ROLE is not the ANALYST custom role, determine if the CURRENT_ROLE inherits the privileges granted to the ANALYST custom role.

Execute the following statement:

> ```sqlexample
> SELECT IS_ROLE_IN_SESSION('ANALYST');
> ```
>
> ```output
> +-------------------------------+
> | IS_ROLE_IN_SESSION('ANALYST') |
> +-------------------------------+
> | FALSE                         |
> +-------------------------------+
> ```

Since Snowflake returns FALSE, the CURRENT_ROLE does not inherit privileges granted to the ANALYST custom role. Therefore, based on the masking policy body in this example, the user should see a fixed mask value.

#### Step 3: Run a test query to verify

Execute a query on the column that has the masking policy in this example applied to that column to verify that the user sees a fixed masked value.

```sqlexample
USE ROLE analyst;

SELECT * FROM mydb.mysch.mytable;
```

### Use INVOKER_ROLE

A masking policy condition using INVOKER_ROLE targets the execution context of the SQL statement.

The following table summarizes the execution context and the value that INVOKER_ROLE returns in a masking policy condition:

| Context | Evaluated role |
| --- | --- |
| User | [CURRENT_ROLE](../sql-reference/functions/current_role.md) |
| Table | CURRENT_ROLE. |
| View | View owner role. |
| UDF | UDF owner role. |
| Stored procedure with caller’s right | CURRENT_ROLE. |
| Stored procedure with owner’s right | Stored procedure owner role. |
| Task | Task owner role. |
| Stream | The role that queries a given [stream](streams-intro.md). |

Consider the following masking policy body that is applied to a single view on a table:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY mask_string AS
> (val string) RETURNS string ->
> CASE
>   WHEN INVOKER_ROLE() IN ('ANALYST') THEN val
>   ELSE '********'
> END;
> ```

To determine whether a given user running a query on the column has authorization to see data, complete the following steps:

1. Evaluate the masking policy conditions.
2. Determine if the specified role owns the view.
3. Run a test query to verify.

#### Step 1: Evaluate the masking policy conditions

The following table summarizes the consequences of the masking policy body conditions applied to a view column.

| Context | Sees unmasked data | Sees masked data |
| --- | --- | --- |
| `analyst` custom role is the view owner role. | ✔ |  |
| `analyst` custom role is not the view owner role. |  | ✔ |

Next, determine if the ANALYST custom role owns the view.

#### Step 2: Determine if the ANALYST role owns the view

To determine if the ANALYST custom role owns the view, execute the following statement:

```sqlexample
SHOW GRANTS OF ROLE analyst;
```

If the `analyst` custom role owns the view, then a query on the view column should result in unmasked data.

If the `analyst` custom role does not own the view, masked data should be seen.

#### Step 3: Run a test query to verify

Execute a query on the view column to determine whether the ANALYST custom role sees masked or unmasked data.

```sqlexample
USE ROLE analyst;

SELECT * FROM mydb.mysch.myview;
```

### Use IS_GRANTED_TO_INVOKER_ROLE

The IS_GRANTED_TO_INVOKER_ROLE function can be passed into a masking policy body as part of a condition. When the function evaluates to TRUE, the role in the function argument is in the INVOKER_ROLE hierarchy.

Consider the following masking policy body that is applied to a view column of social security numbers (SSNs):

```sqlexample
CREATE OR REPLACE MASKING POLICY mask_string AS
(val string) RETURNS string ->
CASE
  WHEN IS_GRANTED_TO_INVOKER_ROLE('PAYROLL') THEN val
  WHEN IS_GRANTED_TO_INVOKER_ROLE('ANALYST') THEN REGEXP_REPLACE(val, '[0-9]', '*', 7)
  ELSE '*******'
END;
```

To determine whether a given user running a query on the view column has authorization to see data, complete the following steps:

1. Evaluate the masking policy conditions.
2. Determine if the specified role is in invoker role hierarchy. For example, if the policy is set on a view, the specified role must
   be in the view owner role hierarchy to return TRUE. For details, see the [usage notes](../sql-reference/functions/is_granted_to_invoker_role.md).
3. Run a test query to verify.

#### Step 1: Evaluate the masking policy conditions

The following table summarizes the consequences of the masking policy body conditions applied to a view column and viewing data in the view column.

| Context | Unmasked data | Partially masked data | Masked data |
| --- | --- | --- | --- |
| `payroll` custom role is in the view owner role hierarchy. | ✔ |  |  |
| `analyst` custom role is in the view owner role hierarchy. |  | ✔ |  |
| Neither the `payroll` nor `analyst` custom roles are in the view owner hierarchy. |  |  | ✔ |

#### Step 2: Determine if the specified role is in the view owner role hierarchy

If either the `payroll` or `analyst` custom roles are in the view owner hierarchy, then executing a
[SHOW GRANTS](../sql-reference/sql/show-grants.md) command on the view owner role can verify the role hierarchy. For example:

> ```sqlexample
> SHOW GRANTS TO ROLE view_owner_role;
> ```

The outputs of the SQL statement will state whether the view owner role has been granted either the `payroll` or `analyst` custom roles.

#### Step 3: Run a test query to verify

Execute a query on the column that has the masking policy in this example applied to that column to verify how the user sees data in the
view column.

```sqlexample
USE ROLE payroll;

SELECT * FROM mydb.mysch.myview;

USE ROLE analyst;

SELECT * FROM mydb.mysch.myview;
```

## Combine CURRENT_ROLE and INVOKER_ROLE in masking policies

Snowflake supports creating a single masking policy to differentiate the role in use for the session that executes a query
(i.e. [CURRENT_ROLE](../sql-reference/functions/current_role.md)) and the object owner executing a query
(e.g. view owner, [INVOKER_ROLE](../sql-reference/functions/invoker_role.md)). Uses cases of this type are typically more complicated than simply
determining a set of values to mask and a relatively small audience (e.g. users with the `analyst` custom role) that can see unmasked
values.

### Hashing, cryptographic, and encryption functions in masking policies

[Hashing](../sql-reference/functions-hash-scalar.md) and [cryptographic/checksum](../sql-reference/functions-string.md) can be used in masking policies to mask sensitive data.

Before implementing any of these functions in a [masking policy](security-column-intro.md), it is important to consider
whether your use case with these functions involve [JOIN](../sql-reference/constructs/join.md) operations. Under certain masking policy
implementations, creative JOIN operations that involve tables and views can lead to reverse engineering the masked value to its true value
based upon the following limitation:

* It is possible that collisions may occur because there may not be a 1:1 representation of the actual value (i.e. input) and the hashed,
  cryptographic, or checksum value based on the total number of values (i.e. output, the range of values) to transform.

A 1:1 representation is more likely to occur until the total number of input values reaches the square root of the output values to
transform.

For example, if the output values to hash is 144, then it is reasonable to expect that the first 12 values
(i.e. 144^(1/2) – the square root of 144) will be unique and that collisions might occur for the remaining 132 values. Since this
limitation and its consequence is possible, it is advisable to never use hashed, cryptographic, or checksum functions in masking policies
whose values may be used in JOIN operations.

> **Tip:**
>
> If the masking policy use case prioritizes collision avoidance for enhanced security, implement
> [External Tokenization](security-column-ext-token-intro.md). Tokenization does not result in collisions because there is
> always a 1:1 representation of the input and output values.
>
> If tokenization is not possible, one possible workaround is to implement a masking policy to differentiate between the session role
> executing a query (i.e. [CURRENT_ROLE](../sql-reference/functions/current_role.md)) and the object owner executing a query
> (i.e. [INVOKER_ROLE](../sql-reference/functions/invoker_role.md)).
>
> For example, the following masking policy assumes two different custom roles, CSR_EMPL_INFO and DBA_EMPL_INFO, to regulate access to
> employee information.
>
> > ```sqlexample
> > CREATE OR REPLACE MASKING POLICY mask_string AS
> > (val string) RETURNS string ->
> > CASE
> >     WHEN CURRENT_ROLE() IN ('CSR_EMPL_INFO') THEN HASH(val)
> >     WHEN INVOKER_ROLE() IN ('DBA_EMPL_INFO') THEN val
> >     ELSE null
> > END;
> > ```
>
> If the policy is applied to the table, then the policy will be inherited to any view created from the table. If the custom role
> `dba_empl_info` owns the view created from this table (i.e. has the OWNERSHIP privilege on the view), then only users with this custom
> role can see the actual values if querying the view. Users with the `csr_empl_info` custom role always see a hashed value whether query
> is made on the table or view. All other users see `NULL`.

---
title: Aggregation policies
source: https://docs.snowflake.com/en/user-guide/aggregation-policies.md
section: User Guide
---

# Aggregation policies

An aggregation policy is a schema-level object that controls what type of query can access data from a table or view. When an aggregation
policy is applied to a table, queries against that table must aggregate data into groups of a minimum size in order to return results,
thereby preventing a query from returning information from an individual record. A table or view with an aggregation policy assigned to it
is said to be *aggregation-constrained*.

Aggregation policies can be used with or without an entity key. When aggregation policies are used without an entity key, they protect the
privacy of individual rows in the data set (that is, row-level privacy). If you use an aggregation policy with an entity key, it protects
the privacy of an entity, even if information about that entity appears in multiple rows (that is, entity-level privacy).

For more information about combining aggregation policies with an entity key, see [Implementing entity-level privacy with aggregation policies](aggregation-policies-entity-privacy.md).

## Overview

A core feature of Snowflake is the ability to share data sets with other entities. Aggregation policies allow a provider (data owner) to
exercise control over what can be done with their data even after it is shared with a consumer. Specifically, the provider can require a
consumer of a table to aggregate the data rather than retrieve individual records.

When creating an aggregation policy, the provider’s policy administrator specifies a minimum group size (i.e. the number of rows that must
be aggregated together into a group). The larger the minimum group size, the less likely it is that a consumer could use the query results
to deduce the contents of a single record.
Once the aggregation policy is applied to a table or view, a query against it must conform to two requirements:

* The query must aggregate the data. If the query uses an aggregation function, it must be one of the
  allowed aggregation functions.
* Each group created by the query must include the aggregate of at least X records, where X is the minimum group size of the aggregation
  policy.

If the query returns a group that contains fewer records than the minimum group size of the policy, then Snowflake combines those groups
into a *remainder group*. Snowflake applies the aggregation function to the appropriate column to return a value for the remainder group.
However, because that value is calculated from rows that belong to more than one group, the value of the GROUP BY key column is NULL. For
example, if the query includes the clause `GROUP BY state`, then the value of `state` in the remainder group is NULL.

A query that does not return enough results to populate a remainder group still works, but returns a NULL value in every field of the
results.

### Limitations

* You cannot protect an external table with an aggregation policy.
* If the query uses an explicit grouping construct, it must be a [GROUP BY](../sql-reference/constructs/group-by.md) clause. The query cannot use
  related constructs like [GROUP BY ROLLUP](../sql-reference/constructs/group-by-rollup.md), [GROUP BY CUBE](../sql-reference/constructs/group-by-cube.md), or [GROUP BY GROUPING SETS](../sql-reference/constructs/group-by-grouping-sets.md).
* Most [set operators](../sql-reference/operators-query.md) are not allowed when one of the queries acts on an
  aggregation-constrained table. As an exception, UNION ALL is supported, but each result group must satisfy the minimum group size of the
  aggregation-constrained tables being queried (see Query requirements for details).
* If a column of an aggregation-constrained table is protected by a [projection policy](projection-policies.md), a query
  against that table cannot use the column as an argument of the COUNT function.
* [Recursive CTEs](queries-cte.md) are not allowed in queries against an aggregation-constrained table or
  view.
* [Window functions](../sql-reference/functions-window.md) are not allowed in queries against an aggregation-constrained table or view.
* A query against an aggregation-constrained table cannot use a [correlated subquery](querying-subqueries.md)
  or [lateral join](../sql-reference/constructs/join-lateral.md) when there are references to or from the portion of the query that meets
  the requirements of the aggregation policy. The following examples illustrate the types of queries that are prohibited.

  Example 1
  :   Assuming `protected_table` is aggregation-constrained, the following query is not allowed because the portion of the query that
      aggregates data references another part of the query outside of the subquery:

      ```sqlexample
      SELECT c1, c2
      FROM open_table
      WHERE c1 = (SELECT x FROM protected_table WHERE y = open_table.c2);
      ```

  Example 2
  :   Assuming `protected_table` is aggregation-constrained, the following query is not allowed because the subquery references the part of
      the query that aggregates data, which is outside of the subquery:

      ```sqlexample
      SELECT
        SUM(SELECT COUNT(*) FROM open_table ot WHERE pt.id = ot.id)
      FROM protected_table pt;
      ```

### Considerations

Consider the following when using aggregation policies to protect sensitive data:

* Aggregation policies protect data for an individual record, not an entity. If a data set contains multiple records belonging to the same
  entity, an aggregation policy only protects the privacy of a specific record pertaining to that entity, not the entire entity.
* While aggregation policies limit access to individual records, they do not guarantee a malicious actor could not use deliberate queries
  to obtain potentially sensitive data from an aggregation-constrained table. With enough query attempts, a malicious actor
  could potentially work around the aggregation requirements to ascertain a value from an individual row.
  Aggregation policies are best suited for use with partners and customers with whom you have an existing level of trust. In addition,
  providers should be vigilant about potential misuses of their data (for example, reviewing the access history for their
  listings).

## Create an aggregation policy

The syntax for creating an aggregation policy is:

> ```sqlsyntax
> CREATE [ OR REPLACE ] AGGREGATION POLICY <name>
>   AS () RETURNS AGGREGATION_CONSTRAINT -> <body>
>   [ COMMENT = '<string_literal>' ];
> ```

Where:

* `name` specifies the name of the policy.
* `AS () RETURNS AGGREGATION_CONSTRAINT` is the signature and return type of the policy. The signature does not accept any arguments
  and the return type is AGGREGATION_CONSTRAINT, which is an internal data type. All aggregation policies have the same signature and return
  type.
* `body` is a SQL expression that determines the restrictions of an aggregation policy.

### Calling functions from the body

The body of an aggregation policy uses two functions to define the constraints of the policy: NO_AGGREGATION_CONSTRAINT and AGGREGATION_CONSTRAINT. When the conditions of the body call one of these functions, the return value from the function determines how
queries against the aggregation-constrained table or view must be formulated to return results.

NO_AGGREGATION_CONSTRAINT
:   Use the body’s expression to call the NO_AGGREGATION_CONSTRAINT function when you want a query to have unrestricted access to the table
    or view to which the aggregation policy is assigned.

AGGREGATION_CONSTRAINT
:   Use the body’s expression to call the AGGREGATION_CONSTRAINT function to require that queries aggregate data in order to return results.
    Use the MIN_GROUP_SIZE argument to specify how many rows or [entities](aggregation-policies-entity-privacy.md) must be
    included in each aggregation group.

For the complete syntax for the NO_AGGREGATION_CONSTRAINT and AGGREGATION_CONSTRAINT functions, see
[CREATE AGGREGATION POLICY](../sql-reference/sql/create-aggregation-policy.md).

> **Note:**
>
> The body of an aggregation policy cannot reference a user-defined function, table, or view.

### Example policies

Fixed minimum group size
:   The simplest aggregation policy calls the AGGREGATION_CONSTRAINT function directly and defines a constant minimum group size that is
    applied to all queries against the table. For example, the following command creates an aggregation policy with a minimum group size of 5:

    > ```sqlexample
    > CREATE AGGREGATION POLICY my_agg_policy
    >   AS () RETURNS AGGREGATION_CONSTRAINT -> AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);
    > ```

Conditional policy
:   Policy administrators can define the SQL expression of an aggregation policy so different queries have different restrictions based on
    factors such as the role of the user executing the query. This strategy can allow one user to query a table without restriction while
    requiring others to aggregate results.

    For example, the following aggregation policy gives users with the role `ADMIN` unrestricted access to a table while requiring all
    other queries to aggregate data into groups of at least 5 rows or entities.

    ```sqlexample
    CREATE AGGREGATION POLICY my_agg_policy
      AS () RETURNS AGGREGATION_CONSTRAINT ->
        CASE
          WHEN CURRENT_ROLE() = 'ADMIN'
            THEN NO_AGGREGATION_CONSTRAINT()
          ELSE AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5)
        END;
    ```

    > **Tip:**
    >
    > You can use the following strategies when using context functions like [CURRENT_ROLE](../sql-reference/functions/current_role.md) in a conditional
    > policy:
    >
    > * Context functions return strings, so comparisons using them are case-sensitive. You can use
    >   [LOWER](../sql-reference/functions/lower.md) to convert strings to all lowercase if you’d like to do a case-insensitive comparison.
    > * The [POLICY_CONTEXT](../sql-reference/functions/policy_context.md) function helps you evaluate whether a policy body is returning the correct value
    >   when a context function returns a certain value. The POLICY_CONTEXT function simulates query results based upon a specified value of
    >   one or more context functions.

## Modify an aggregation policy

You can use the [ALTER AGGREGATION POLICY](../sql-reference/sql/alter-aggregation-policy.md) command to modify the SQL expression that determines the minimum group
size of the aggregation policy. You can also rename the policy or change its comment.

Before modifying an aggregation policy, you can execute the [DESCRIBE AGGREGATION POLICY](../sql-reference/sql/desc-aggregation-policy.md) command or
[GET_DDL](../sql-reference/functions/get_ddl.md) function to review the current SQL expression of the policy. The SQL expression that determines the
minimum group size appears in the `BODY` column.

As an example, you can execute the following command to change the SQL expression of the aggregation policy `my_policy` to require a
minimum group size of 2 rows in all circumstances:

> ```sqlexample
> ALTER AGGREGATION POLICY my_policy SET BODY -> AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE=>2);
> ```

## Assign an aggregation policy

Once created, an aggregation policy can be applied to one or more tables or views to make it aggregation-constrained. A table or view can
only have one aggregation policy attached.

Use the SET AGGREGATION POLICY clause of a [ALTER TABLE](../sql-reference/sql/alter-table.md) or [ALTER VIEW](../sql-reference/sql/alter-view.md) command to assign
an aggregation policy to an existing table or view:

> ```sqlsyntax
> ALTER { TABLE | VIEW } <name> SET AGGREGATION POLICY <policy_name> [ FORCE ]
> ```

Where:

* `name` specifies the name of the table or view.
* `policy_name` specifies the name of the aggregation policy.
* `FORCE` is an optional parameter that allows the command to assign the aggregation policy to a table or view that already has an
  aggregation policy assigned to it. The new aggregation policy atomically replaces the existing one.

For example, to assign the policy `my_agg_policy` to the table `t1`, execute:

> ```sqlexample
> ALTER TABLE t1 SET AGGREGATION POLICY my_agg_policy;
> ```

You can also use the WITH clause of the [CREATE TABLE](../sql-reference/sql/create-table.md) and [CREATE VIEW](../sql-reference/sql/create-view.md) commands to assign
an aggregation policy to a table or view at creation time. For example, to assign the policy `my_agg_policy` to a new table, execute:

> ```sqlexample
> CREATE TABLE t1 WITH AGGREGATION POLICY my_agg_policy;
> ```

### Replace an aggregation policy

The recommended method of replacing an aggregation policy is to use the `FORCE` parameter to detach the existing aggregation policy and
assign the new one in a single command. This allows you to atomically replace the old policy, leaving no gap in protection.

For example, to assign a new aggregation policy to a table that is already aggregation-constrained:

```sqlexample
ALTER TABLE privacy SET AGGREGATION POLICY agg_policy_2 FORCE;
```

You can also detach the aggregation policy from a table or view in one statement (… UNSET AGGREGATION POLICY) and then set a new policy
on the table or view in a different statement (… SET AGGREGATION POLICY <name>). If you choose this method, the table is not protected by an
aggregation policy in between detaching one policy and assigning another. A query could potentially access sensitive data during this time.

## Detach an aggregation policy

Use the UNSET AGGREGATION POLICY clause of an ALTER TABLE or ALTER VIEW command to detach an aggregation policy from a table or view in
order to remove the need to aggregate data. The name of the aggregation policy is not required because a table or view cannot have more than
one aggregation policy attached.

> ```sqlsyntax
> ALTER {TABLE | VIEW} <name> UNSET AGGREGATION POLICY
> ```

Where:

* `name` specifies the name of the table or view.

For example, to detach an aggregation policy from view `v1`, execute:

> ```sqlexample
> ALTER VIEW v1 UNSET AGGREGATION POLICY;
> ```

## View aggregation policies with Snowsight

To determine whether a table or view has an aggregation policy, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select the table or view.
3. On the Table Details tab, find the Policies section and look for an aggregation policy.
4. To determine the minimum group size of the aggregation policy, find the Minimum Group Size field. If the body of the policy is
   complex and has a different minimum group size under different conditions, `Case dependent` displays instead of a number. For a complex
   body, you can hover over the name of the aggregation policy to view its body to help determine its minimum group size.
5. To determine the columns of the table that combine to make up an entity key, find the Entity Key Columns field. If there is more
   than one entity key for the table or view, the policy appears multiple times in the Policies section, once for each entity key.

## Monitor aggregation policies

It can be helpful to think of two general approaches to determine how to monitor aggregation policy usage.

* Discover aggregation policies
* Identify aggregation policy references

### Discover aggregation policies

You can use the [AGGREGATION_POLICIES](../sql-reference/account-usage/aggregation_policies.md) view in the Account Usage schema of the shared
SNOWFLAKE database. This view is a *catalog* for all aggregation policies in your Snowflake account. For example:

> ```sqlexample
> SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.AGGREGATION_POLICIES
> ORDER BY POLICY_NAME;
> ```

### Identify aggregation policy references

The [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) Information Schema table function can identify aggregation policy references. There
are two different syntax options:

1. Return a row for each object (i.e. table or view) that has the specified aggregation policy set on it:

   ```sqlexample
   USE DATABASE my_db;
   USE SCHEMA information_schema;
   SELECT policy_name,
          policy_kind,
          ref_entity_name,
          ref_entity_domain,
          ref_column_name,
          ref_arg_column_names,
          policy_status
   FROM TABLE(information_schema.policy_references(policy_name => 'my_db.my_schema.aggpolicy'));
   ```
2. Return a row for each policy assigned to the table named `my_table`:

   ```sqlexample
   USE DATABASE my_db;
   USE SCHEMA information_schema;
   SELECT policy_name,
          policy_kind,
          ref_entity_name,
          ref_entity_domain,
          ref_column_name,
          ref_arg_column_names,
          policy_status
   FROM TABLE(information_schema.policy_references(ref_entity_name => 'my_db.my_schema.my_table', ref_entity_domain => 'table'));
   ```

## Query requirements

After an aggregation policy has been applied to a table or view, queries against that table or view must conform to certain requirements.
This section discusses what is and isn’t allowed in a query against an aggregation-constrained table or view.

> **Note:**
>
> Once part of the query properly aggregates data to satisfy the requirements of the aggregation policy, these query restrictions do not
> apply, and another part of the query can include things that are otherwise prohibited.
>
> For example, the following query can use a SELECT statement that does not aggregate results because another part of the query has already satisfied the aggregation requirements of the policy that is assigned to `protected_table`:
>
> ```sqlexample
> SELECT * FROM open_table ot WHERE ot.a > (SELECT SUM(id) FROM protected_table pt)
> ```

For additional restrictions on what can be included in a query, refer to Limitations.

Aggregation functions
:   The following aggregation functions are allowed in a query against an aggregation-constrained table:

    * [AVG](../sql-reference/functions/avg.md)
    * [COUNT [DISTINCT]](../sql-reference/functions/count.md)
    * [HLL](../sql-reference/functions/hll.md)
    * [SUM](../sql-reference/functions/sum.md)

    A query can contain more than one of these allowed aggregation functions. A query fails if it attempts to use an aggregation
    function that is not allowed.

    If you wish to use a preprocessing function within an aggregation function, only the following preprocessing functions are supported:

    |  |  |  |
    | --- | --- | --- |
    | * [CEIL](../sql-reference/functions/ceil.md) * [CONCAT , ||](../sql-reference/functions/concat.md) * [DATE_TRUNC](../sql-reference/functions/date_trunc.md) * [DECRYPT](../sql-reference/functions/decrypt.md) * [ENCRYPT](../sql-reference/functions/encrypt.md) | * [FLOOR](../sql-reference/functions/floor.md) * [GET](../sql-reference/functions/get.md) * [INITCAP](../sql-reference/functions/initcap.md) * [LOWER](../sql-reference/functions/lower.md) * [LTRIM](../sql-reference/functions/ltrim.md) | * [MD5 , MD5_HEX](../sql-reference/functions/md5.md) * [ROUND](../sql-reference/functions/round.md) * [RTRIM](../sql-reference/functions/rtrim.md) * [TRIM](../sql-reference/functions/trim.md) * [UPPER](../sql-reference/functions/upper.md) |

    So, for example:

    * `SELECT myfunc(C1), COUNT(C2) FROM t1 GROUP BY 1;` is valid, because any non-aggregation function is supported at the top level of
      an aggregation policy.
    * `SELECT C1, COUNT(myfunc(C2)) FROM t1 GROUP BY 1;` is invalid because only functions listed above are supported within an aggregation
      function in an aggregation policy.

Grouping statement
:   A query against an aggregation-constrained table must aggregate data into groups of a minimum size. It can use an explicit grouping
    statement (i.e. a GROUP BY clause) or a scalar aggregation function that aggregates the entire data set (for example, `COUNT(*)`).

Filters
:   In general, Snowflake does not restrict how a query uses WHERE and ON clauses to filter the aggregation-constrained table as long as it
    aggregates the rows selected by the filter.

Joins
:   A query can join an aggregation-constrained table with another table, including another aggregation-constrained table.

    Snowflake checks each aggregation group to make sure that the number of rows taken from an aggregation-constrained table meets or exceeds
    the minimum group size of that table. For example, if an aggregation-constrained table `table_a` with a minimum group size of 5 is
    joined with `table_b` with a minimum group size of 3, each group returned by the query must be created using at least 5 rows from
    `table_a` and 3 rows from `table_b`.

    Whether a query with a join meets the requirements of an aggregation-constrained table is determined by the number of rows taken from the
    table, not the size of a group. As a result, the size of a group created from the joined data could be greater than the minimum group
    size of the aggregation-constrained table, but still result in filtered data. For example, suppose:

    * `agg_t` is aggregation constrained with a minimum group size of 2. This table contains a single integer column `c` that has the
      following content: { `1`, `2`, `2` }.
    * `open_t` is unconstrained, and contains an integer column `c` with the following content: { `1`, `1`, `1`, `2` }.

    A user executes the following query that joins the two tables:

    ```sqlexample
    SELECT c, COUNT(*)
    FROM agg_t, open_t
    WHERE agg_t.c = open_t.c
    GROUP BY agg_t.c;
    ```

    The query will return:

    ```output
    +-----------------+
    |  c   | COUNT(*) |
    |------+----------|
    |  2   |  2       |
    |------+----------|
    | null |  3       |
    +-----------------+
    ```

    Even though the second group has 3 records, which is greater than the minimum group size, all of those records correspond to a single
    record in the aggregation-constrained table, so the value is filtered out.

UNION ALL
:   A query can use [UNION ALL](../sql-reference/operators-query.md) to combine results of two subqueries, even if one or more of the queried
    tables are aggregation-constrained. Similar to joins, each group in the results must satisfy the minimum group size of every
    aggregation-constrained table being queried. For example, suppose:

    * Table `protected_table1` has a minimum group size of 2.
    * Table `protected_table2` has a minimum group size of 5.

    If you run the query:

    ```sqlexample
    SELECT a, COUNT(*)
    FROM (
        SELECT a, b FROM protected_table1
        UNION ALL
        SELECT a, b FROM protected_table2
    )
    GROUP BY a;
    ```

    Each group formed by the key `a` must contain 2 records from `protected_table1` and 5 records from `protected_table2`, otherwise
    the records are placed in a remainder group.

External Functions
:   A query cannot call an [external function](../sql-reference/external-functions-introduction.md) unless another part of the query has
    properly aggregated results to meet the requirements of the aggregation-constrained table.

Logging & Metrics
:   A query cannot log a column of an aggregation-constrained table via UDF logging or metrics.

Data Type Conversions
:   A query that includes a data type conversion function in the SELECT statement must use the TRY version of the function. For example, the
    TRY_CAST function is allowed, but the CAST function is prohibited. The following data type conversion functions are allowed for numeric
    types:

    * [TRY_CAST](../sql-reference/functions/try_cast.md)
    * [TRY_TO_DECFLOAT](../sql-reference/functions/try_to_decfloat.md)
    * [TRY_TO_DECIMAL](../sql-reference/functions/try_to_decimal.md)
    * [TRY_TO_DOUBLE](../sql-reference/functions/try_to_double.md)
    * [TRY_TO_NUMBER](../sql-reference/functions/try_to_decimal.md)
    * [TRY_TO_NUMERIC](../sql-reference/functions/try_to_decimal.md)

PIVOT
:   A query cannot use the [PIVOT](../sql-reference/constructs/pivot.md) operator against a column in an aggregation-constrained table.

## Extended example

Creating an aggregation policy and assigning the aggregation policy to a table follows the same general procedure as creating and assigning
other policies, such as masking and projection policies:

1. If you are using a centralized management approach, create a custom role (e.g. `agg_policy_admin`) to manage the policy. Alternatively,
   you can use an existing role.
2. Grant this role the privileges to create and assign an aggregation policy.
3. Create the aggregation policy.
4. Assign the aggregation policy to a table.

Once the aggregation policy is assigned to a table, successful queries against the table must aggregate its data.

The following extended example provides insight into each step in this process, from the provider’s access control administrator creating a
custom role to a data consumer executing a query to return aggregated results.

Access Control Administrator Tasks
:   1. Create a custom role to manage the aggregation policy. You could also re-use an existing role.

       ```sqlexample
       USE ROLE USERADMIN;

       CREATE ROLE AGG_POLICY_ADMIN;
       ```
    2. Grant the `agg_policy_admin` custom role the privileges to create an aggregation policy in a schema and assign the aggregation policy
       to a table or view in the Snowflake account.

       This step assumes the aggregation policy will be stored in a database and schema named `privacy.agg_policies` and this database and
       schema already exist:

       ```sqlexample
       GRANT USAGE ON DATABASE privacy TO ROLE agg_policy_admin;
       GRANT USAGE ON SCHEMA privacy.agg_policies TO ROLE agg_policy_admin;

       GRANT CREATE AGGREGATION POLICY
         ON SCHEMA privacy.agg_policies TO ROLE agg_policy_admin;

       GRANT APPLY AGGREGATION POLICY ON ACCOUNT TO ROLE agg_policy_admin;
       ```

       The `agg_policy_admin` role can now be assigned to one or more users.

       For details about the privileges needed to work with aggregation policies, refer to Privileges and commands
       (in this topic).

Aggregation Policy Administrator Tasks
:   1. Create an aggregation policy to require aggregation and define a minimum group size of 3:

       > ```sqlexample
       > USE ROLE agg_policy_admin;
       > USE SCHEMA privacy.agg_policies;
       >
       > CREATE AGGREGATION POLICY my_policy
       >   AS () RETURNS AGGREGATION_CONSTRAINT -> AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 3);
       > ```
    2. Assign the aggregation policy to a table `t1`:

       > ```sqlexample
       > ALTER TABLE t1 SET AGGREGATION POLICY my_policy;
       > ```

Consumer Query
:   Once the provider shares the aggregation-constrained table, the data consumer can execute queries against it. For this example, assume
    the aggregation-constrained table `t1` contains the following rows:

    | peak | state | elevation |
    | --- | --- | --- |
    | washington | NH | 6288 |
    | cannon | NH | 4080 |
    | kearsarge | NH | 2937 |
    | mansfield | VT | 4395 |
    | killington | VT | 4229 |
    | wachusett | MA | 2006 |

    Now, assume that the consumer executes the following query against `t1`:

    > ```sqlexample
    > SELECT state, AVG(elevation) AS avg_elevation
    > FROM t1
    > GROUP BY state;
    > ```

    The results are:

    > ```output
    > +----------+-----------------+
    > |  STATE   |  AVG_ELEVATION  |
    > |----------+-----------------+
    > |  NH      |  4435           |
    > |  NULL    |  3543           |
    > +----------+-----------------+
    > ```

    Note that the value of `state` in the second group is `NULL` because it is a remainder group that averages the elevation of peaks in
    both `VT` and `MA`.

## Aggregation policies with Snowflake features

The following subsections briefly summarize how aggregation policies interact with various Snowflake features and services.

### Other policies

This section describes how an aggregation policy interacts with other policies, including
[masking policies](security-column-intro.md),
[row access policies](security-row-intro.md), and [projection policies](projection-policies.md).

You can attach other policies to an aggregation-constrained table. A successful query against the table must meet the requirements of all
policies.

If a row access policy is assigned to an aggregation-constrained table, a row excluded from the query results based on the row
access policy is not included when calculating the aggregated results.

The body of a masking policy, row access policy, or projection policy cannot reference an aggregation-constrained table, including its
columns. Similarly, the body of the other policy cannot include a UDF that references the aggregation-constrained table.

### Views and materialized views

You can assign an aggregation policy to both views and materialized views. When an aggregation policy is applied to a view, the underlying
table does not become aggregation-constrained. This base table can still be queried without restriction.

To avoid the possibility of exposing sensitive data, all aggregation-constrained views are treated as if they are
[secure views](views-secure.md) even if they are not.

Whether you can create a view from an aggregation-constrained table depends on the type of view:

> * You can create a regular view from one or more aggregation-constrained tables, however queries against that view must aggregate data in
>   a way that meets the restrictions of those base tables.
> * You cannot create a materialized view based on an aggregation-constrained table or view, nor can you assign an aggregation policy to a
>   table or view upon which a materialized view is based.

### Cloned objects

The following approach helps to safeguard data from users with the SELECT privilege on a cloned table or view that is stored in the cloned
database or schema:

* Cloning an individual aggregation policy object is not supported.
* Cloning a database results in the cloning of all aggregation policies within the database.
* Cloning a schema results in the cloning of all aggregation policies within the schema.
* A cloned table maps to the same aggregation policies as the source table.

  + When a table is cloned in the context of its parent schema cloning, if the source table has a reference to an aggregation policy in the
    same parent schema (i.e. a local reference), the cloned table will have a reference to the cloned aggregation policy.
  + If the source table refers to an aggregation policy in a different schema (i.e. a foreign reference), then the cloned table retains the
    foreign reference.

For more information, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

### Replication

Aggregation policies and their assignments can be replicated using database replication and replication groups.

For [database replication](database-replication-considerations.md), the replication operation fails if either of the
following conditions is true:

* The primary database is in an Enterprise (or higher) account and contains a policy but one or more of the accounts approved for
  replication are on lower editions.
* A table or view contained in the primary database has a [dangling reference](database-replication-considerations.md) to an
  aggregation policy in another database.

The dangling reference behavior for database replication can be avoided when replicating multiple databases in a
[replication group](account-replication-intro.md).

## Privileges and commands

The following subsections provide information to help manage aggregation policies.

### Aggregation policy privileges

Snowflake supports the following privileges on the aggregation policy object.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables the set and unset operations for an aggregation policy on a table. |
| OWNERSHIP | Transfers ownership of the aggregation policy, which grants full control over the aggregation policy. Required to alter most properties of an aggregation policy. |

For details, see Summary of DDL commands, operations, and privileges (in this topic).

### Aggregation policy DDL reference

Snowflake supports the following DDL to create and manage aggregation policies.

* [CREATE AGGREGATION POLICY](../sql-reference/sql/create-aggregation-policy.md)
* [ALTER AGGREGATION POLICY](../sql-reference/sql/alter-aggregation-policy.md)
* [DESCRIBE AGGREGATION POLICY](../sql-reference/sql/desc-aggregation-policy.md)
* [DROP AGGREGATION POLICY](../sql-reference/sql/drop-aggregation-policy.md)
* [SHOW AGGREGATION POLICIES](../sql-reference/sql/show-aggregation-policies.md)

### Summary of DDL commands, operations, and privileges

The following table summarizes the relationship between aggregation policy privileges and DDL operations.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Operation | Privilege required |
| --- | --- |
| Create aggregation policy. | A role with the CREATE AGGREGATION POLICY privilege in the same schema. |
| Alter aggregation policy. | The role with the OWNERSHIP privilege on the aggregation policy. |
| Describe aggregation policy | One of the following:   * A role with the global APPLY AGGREGATION POLICY privilege, or * A role with the OWNERSHIP privilege on the aggregation policy, or * A role with the APPLY privilege on the aggregation policy. |
| Drop aggregation policy. | A role with the OWNERSHIP privilege on the aggregation policy. |
| Show aggregation policies. | One of the following:   * A role with the USAGE privilege on the schema in which the aggregation policy exists, or * A role with the APPLY AGGREGATION POLICY on the account. |
| Set or unset an aggregation policy on a table. | One of the following:   * A role with the APPLY AGGREGATION POLICY privilege on the account, or * A role with the APPLY privilege on the aggregation policy and the OWNERSHIP privilege on the table or view. |

Snowflake supports different permissions to create and set an aggregation policy on an object.

1. For a centralized aggregation policy management approach in which the `aggregation_policy_admin` custom role creates and sets
   aggregation policies on all tables, the following permissions are necessary:

   ```sqlexample
   USE ROLE securityadmin;
   GRANT USAGE ON DATABASE mydb TO ROLE aggregation_policy_admin;
   GRANT USAGE ON SCHEMA mydb.schema TO ROLE aggregation_policy_admin;
   GRANT CREATE AGGREGATION POLICY ON SCHEMA mydb.schema TO ROLE aggregation_policy_admin;
   GRANT APPLY ON AGGREGATION POLICY ON ACCOUNT TO ROLE aggregation_policy_admin;
   ```
2. In a hybrid management approach, a single role has the CREATE AGGREGATION POLICY privilege to ensure aggregation policies are named
   consistently and individual teams or roles have the APPLY privilege for a specific aggregation policy.

   For example, the custom role `finance_role` role can be granted the permission to set the aggregation policy `cost_center` on tables
   and views the role owns (i.e. the role has the OWNERSHIP privilege on the table or view):

   ```sqlexample
   USE ROLE securityadmin;
   GRANT CREATE AGGREGATION POLICY ON SCHEMA mydb.schema TO ROLE aggregation_policy_admin;
   GRANT APPLY ON AGGREGATION POLICY cost_center TO ROLE finance_role;
   ```

---
title: All Partners & Technologies (Alphabetical)
source: https://docs.snowflake.com/en/user-guide/ecosystem-all.md
section: User Guide
---

# All Partners & Technologies (Alphabetical)

This table lists all known 3rd-party partners and technologies that have been certified to provide native connectivity to Snowflake.

If you need to connect to Snowflake using a tool or technology that is not listed here, we suggest attempting to connect through our
[JDBC](../developer-guide/jdbc/jdbc.md) or [ODBC](../developer-guide/odbc/odbc.md) drivers. These drivers provide general, multi-purpose connection
functionality for most tools and technologies.

Also, you are not limited to working with these solutions. Other solutions can be used with Snowflake; however, we do not guarantee that
all features provided by other solutions are supported and will operate without issue.

For more details about a particular solution, click the link in the “Category” column in the table:

| Solution [1] |  | Description | Category | Notes |
| --- | --- | --- | --- | --- |
| **A** |  |  |  |  |
|  |  | Ab Initio — enterprise application integration | [Data Integration](ecosystem-etl.md) |  |
|  |  | Acryl Data — DataHub platform for the enterprise | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Adobe Campaign — marketing campaign management, delivery, and analysis | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Agile Data Engine — data warehouse design, CI/CD automation, and (E)LT orchestration | [Data Integration](ecosystem-etl.md) , . [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | Airbyte — open source integration | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Aginity Pro / Aginity Team — reusable and sharable SQL editing and analysis | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | Alation — enterprise data catalog | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Alteryx — analytic and data science automation | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Alteryx Designer Cloud — cloud-native data profiling and prep | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | ALTR — data governance, security, and intelligence as a service | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Amplitude — self-serve behavioral analytics and modern growth best practices to help companies scale digital revenue | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Anomalo — autonomous data quality monitoring platform for detecting data issues and root causes | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Amazon Data Firehose — cloud analytics platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | Amazon Data Firehose — cloud analytics platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | Amazon SageMaker — cloud machine-learning platform | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  |  | Artie — real-time database replication | [Data Integration](ecosystem-etl.md) |  |
|  |  | Ascend.io — data pipeline automation | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Astrato — cloud-based, no-code data analytics and visualization | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Atlan — data governance, security, and cataloging as a service | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | AtScale — data warehouse virtualization platform | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | AWS QuickSight — cloud-based business analytics service | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Azure Data Factory — serverless data integration and transformation | [Data Integration](ecosystem-etl.md) |  |
| **B** |  |  |  |  |
|  |  | Baffle — enterprise data security solution | [Security, Governance & Observability](ecosystem-security.md) | * Used for [External Tokenization](security-column-ext-token-intro.md). |
|  |  | Bigeye — data observability platform for data quality monitoring and anomaly detection | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | BigID — data discovery, classification, and protection | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Boomi — enterprise integration platform as a service | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | BoostKPI — granular anomaly detection and root cause analysis | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
| **C** |  |  |  |  |
|  |  | CARTO — cloud-native spatial analytics | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | CData Software — data connectivity, integration and automation | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Celigo — cloud-native integration and automation platform | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Census — data activation platform for sales, marketing, and ads | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Chartio — cloud-based data analytics | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Coalesce — code-based, GUI-driven data transformation | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Collibra — data privacy, governance, and catalog solutions | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Comforte — enterprise data protection and management | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | CyberRes Voltage — privacy and protection for structured and unstructured data | [Security, Governance & Observability](ecosystem-security.md) |  |
| **D** |  |  |  |  |
|  |  | data.world — data cataloging, metadata management, collaboration, and governance | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Datadog — cloud monitoring and incident handling as a service | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Dataguise — security intelligence, protection, and governance for sensitive data | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Dataiku — collaborative data science platform | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Datameer — data preparation | [Data Integration](ecosystem-etl.md) |  |
|  |  | DataOps.live — CI/CD orchestration, environment management, automated testing, & ELT | [SQL Development & Management](ecosystem-editors.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | DataRobot — automated machine learning | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | DataVirtuality — data analytics pipeline (self-service or enterprise) | [Data Integration](ecosystem-etl.md) |  |
|  |  | DBeaver — open source universal SQL client (enterprise edition also available) | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | dbt Labs — in-database integration and transformation | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Denodo — data virtualization and federation platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | Devart ODBC Driver for Snowflake — enterprise-level ODBC connectivity | [Data Integration](ecosystem-etl.md) |  |
|  |  | Devart Python Connector for Snowflake — enterprise-level Python connectivity | [Data Integration](ecosystem-etl.md) |  |
|  |  | Devart SSIS Data Flow Components for Snowflake — enterprise-level SSIS integration | [Data Integration](ecosystem-etl.md) |  |
|  |  | Diyotta — data integration and migration | [Data Integration](ecosystem-etl.md) | * Acquired by [ThoughtSpot](https://www.thoughtspot.com/) |
|  |  | dlt — Python library for moving data | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Domino — data science platform | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  |  | Domo — Business intelligence tools and data visualization | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Domo — Machine Learning and Data Science | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Domo — Connect, combine, and transform | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Domo – Trust and Governance | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | DvSum — data catalog and data intelligence platform | [Security, Governance & Observability](ecosystem-security.md) |  |
| **E** |  |  |  |  |
|  |  | erwin — enterprise data modeling and governance | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | Estuary — cloud data integration service | [Data Integration](ecosystem-etl.md) |  |
|  |  | Etleap — ETL and data pipelines | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Etlworks — cloud data integration service | [Data Integration](ecosystem-etl.md) |  |
| **F** |  |  |  |  |
|  |  | Fivetran — data replication service | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Fortanix — multicloud security for structured and unstructured data | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | The Fosfor Decision Cloud unifies the modern data ecosystem to deliver the long-sought promise of AI: enhanced business outcomes. | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  |  | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
| **G** |  |  |  |  |
|  |  | Go — open source programming language | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native driver. |
|  |  | Google Cloud Data Fusion — cloud-native data integration | [Data Integration](ecosystem-etl.md) |  |
|  |  | Google Cloud Dataflow — unified stream and batch data processing | [Data Integration](ecosystem-etl.md) |  |
|  |  | Google Data Studio — data visualization and reporting | [Business Intelligence (BI)](ecosystem-bi.md) |  |
| **H** |  |  |  |  |
|  |  | H2O.ai — enterprise machine learning platform | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Hackolade — visual schema design for big data analytics | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | HashiCorp Vault — protection for secrets and sensitive data | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Heap | [Data Integration](ecosystem-etl.md) |  |
|  |  | Hevo Data CDC for ETL — automated data pipelines for loading data from any source | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Hex — collaborative data science and analytics platform | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Hightouch — reverse ETL for customer data | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Hunters — AI-based autonomous threat hunting | [Security, Governance & Observability](ecosystem-security.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | HVR — enterprise data replication | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
| **I** |  |  |  |  |
|  |  | IBM Cognos Analytics — enterprise business intelligence platform | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | IBM DataStage — enterprise ETL platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | Immuta — policy management and enforcement | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Informatica Cloud — cloud-based enterprise data integration and management | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Informatica Data Governance and Compliance — enterprise data privacy, curation, and analytics | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Informatica Data Loader — Free cloud-based data loader | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Integrate.io — True low-code data pipeline platform with 220+ transformations and 60-second CDC replication to Snowflake | [Data Integration](ecosystem-etl.md) |  |
| **J** |  |  |  |  |
|  |  | JDBC — Java database connectivity | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native driver. |
|  |  | jSonar — cloud-based and on-premise security and DCAP | [Security, Governance & Observability](ecosystem-security.md) |  |
| **K** |  |  |  |  |
|  |  | Kafka — Apache open source distributed streaming platform | [Data Integration](ecosystem-etl.md) | * Supported via a Snowflake native connector. |
|  |  | Keboola — cloud-based data integration and manipulation | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | KNIME — visual workflows for data and AI work | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  |  | Knoema — data discovery, distribution, and management | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
| **L** |  |  |  |  |
|  |  | Lacework — configuration, detection, and compliance solutions | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Looker — data exploration and reporting service | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
| **M** |  |  |  |  |
|  |  | MachEye — AI-powered analytics and intelligent search | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Matillion Data Productivity Cloud — simple, fast cloud-based data loading and migration | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Matillion ETL — full-featured cloud-based enterprise ETL/data integration | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Metabase — open-source business intelligence | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Monte Carlo — ML-powered, end-to-end data observability platform to improve data quality | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Microsoft .NET — open source software framework for developing applications | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native driver. |
|  |  | Microsoft Power BI — business analytics suite (cloud and on-premises) | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Microstrategy — enterprise analytics platform | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Mode — SQL editing, Python development, and report building | [Business Intelligence (BI)](ecosystem-bi.md) |  |
| **N** |  |  |  |  |
|  |  | Nexla — data integration, transformation, monitoring, and APIs | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Node.js — open source JavaScript runtime environment | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native driver. |
|  |  | Normalyze — Data Security Posture Management (DSPM) | [Security, Governance & Observability](ecosystem-security.md) |  |
| **O** |  |  |  |  |
|  |  | ODBC — open database connectivity | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native driver. |
|  |  | Okera — enterprise data access management | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | OneTrust — data privacy, security, governance, and classification | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Oracle Analytics — cloud-based and desktop analytics | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | OvalEdge — a data catalog for end-to-end data governance | [Security, Governance & Observability](ecosystem-security.md) |  |
| **P** |  |  |  |  |
|  |  | Pentaho Business Analytics — data analytics and visualization platform | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Pentaho Data Integration — data loading and transformation platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | PHP PDO — interface for accessing databases in PHP | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native driver. |
|  |  | Precog — AI-driven, no-code ELT | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Privacera — cloud-based data access governance | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Protegrity — enterprise data security platform | [Security, Governance & Observability](ecosystem-security.md) | * Used for [External Tokenization](security-column-ext-token-intro.md). |
|  |  | Pyramid Analytics — enterprise business analytics platform | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | Python — open source general-purpose programming language | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native connector. |
| **Q** |  |  |  |  |
|  |  | Qlik AutoML — automated machine learning platform | [Machine Learning & Data Science](ecosystem-analytics.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) * Formerly Big Squid |
|  |  | Qlik Replicate — data integration and big data management | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Qlik Sense — data analytics platform | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Qlik Talend — open source data integration and management | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Qubole — cloud-based Big Data activation platform | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
| **R** |  |  |  |  |
|  |  | R — open source statistical computing and graphics environment/language | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  |  | Redpoint — CDP solution for data quality, identity resolution, and customer profile unification | [Data Integration](ecosystem-etl.md) |  |
|  |  | Rivery — cloud-based data integration and preparation | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
| **S** |  |  |  |  |
|  |  | SAP BusinessObjects — enterprise business intelligence | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | SAP Data Services — enterprise data management | [Data Integration](ecosystem-etl.md) |  |
|  |  | SAS — advanced statistical software suite | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  |  | Satori — DataSecOps platform for monitoring, classifying, and auditing data | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | SecuPi — data discovery, monitoring, behavior analytics, and protection | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | SeekWell — Google Sheets-based querying and reporting | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | Segment — web and mobile data collection platform | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Select Star — automated data lineage, catalog, and governance | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Skyflow — Data privacy vault APIs for security, governance, and compliance | [Security, Governance & Observability](ecosystem-security.md) | * Used for [External Tokenization](security-column-ext-token-intro.md). |
|  |  | Sigma Computing — analytic tools and visual interfaces for cloud data platforms | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Sisense — BI, analytics, data visualization, and SQL editing | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Skyvia — cloud-based ETL, ELT and workflow automation | [Data Integration](ecosystem-etl.md) |  |
|  |  | Sled — analytics metadata and metrics | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Snaplogic — enterprise integration platform | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Snowplow — event analytics platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | Solace — event streaming and management platform | [Data Integration](ecosystem-etl.md) |  |
|  |  | Solita — integrated DataOps development and operations platform | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | Spark — Apache open source analytic cluster computing framework | [Machine Learning & Data Science](ecosystem-analytics.md) | * Supported via a Snowflake native connector. |
|  |  | Spring Labs — Store and share sensitive data using cryptographic and tokenization solutions | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | SQLAlchemy — open source Python SQL toolkit and object relational mapper | [Native Programmatic Interfaces](ecosystem-lang.md) | * Supported via a Snowflake native Python package. |
|  |  | SqlDBM — online database/data warehouse design and modeling | [SQL Development & Management](ecosystem-editors.md) |  |
|  |  | SQL Workbench — cross-platform SQL query tool | [SQL Development & Management](ecosystem-editors.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Statsig — enterprise experimentation platform | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Stitch — data integration service | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) |
|  |  | Streamkap — streaming ETL | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | StreamSets — continuous data integration platform | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Striim — real-time data integration and streaming analytics | [Data Integration](ecosystem-etl.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Supermetrics — ETL for marketing and analytics data | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
| **T** |  |  |  |  |
|  |  | Tableau Desktop/Server/Online — interactive data visualization and exploration | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | Tableau CRM — visualization and predictive analytics integrated into Salesforce | [Business Intelligence (BI)](ecosystem-bi.md) | * Formerly Salesforce Einstein Analytics |
|  |  | Tableau Prep — data preparation and integration | [Data Integration](ecosystem-etl.md) |  |
|  |  | Tellius — AI-driven analytics & decision intelligence | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  |  | Tamr — Machine learning-driven data clean-up and management | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | Thales — cloud-based data security platform | [Security, Governance & Observability](ecosystem-security.md) |  |
|  |  | ThoughtSpot — enterprise analytics platform | [Business Intelligence (BI)](ecosystem-bi.md) | * [Snowflake Partner Connect](ecosystem-partner-connect.md) * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | TIBCO Spotfire — data science and machine learning platform | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  |  | TIBCO ActiveMatrix BusinessWorks — enterprise integration offering | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | TMMData — data integration, preparation, and management | [Data Integration](ecosystem-etl.md) |  |
|  |  | Trifacta — cloud data preparation and management | [Data Integration](ecosystem-etl.md) |  |
|  |  | Trustlogix — cloud-native data security platform | [Security, Governance & Observability](ecosystem-security.md) |  |
| **W** |  |  |  |  |
|  |  | Wherescape — data warehouse automation | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
|  |  | windsor.ai — ELT connectors to sync data from 325+ sources to Snowflake | [Data Integration](ecosystem-etl.md) |  |
|  |  | Workato — application integration and automation | [Data Integration](ecosystem-etl.md) | * [Snowflake Ready Validated](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/) |
| **Z** |  |  |  |  |
|  |  | Zepl — enterprise data science and analytics platform | [Machine Learning & Data Science](ecosystem-analytics.md) |  |

[1]

**Disclaimer:** Partner’s names and logos are the property of their respective owner. Customers are responsible for determining
if solutions or integrations offered by such Partners meet their specific requirements, including around security.

---
title: Allow access to Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-load-gcs-allow.md
section: User Guide
---

# Allow access to Google Cloud Storage

If your Google Cloud organization enforces a
[domain restriction constraint](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains),
a Google Cloud administrator must allow the
Google Workspace customer ID in the domain restriction so that the Snowflake service account can access your storage.

> **Important:**
>
> If your Google Cloud organization was created on or after May 3, 2024, Google Cloud enforces a
> [domain restriction constraint](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains)
> in project organization policies. The default constraint lists your domain as the only allowed value.
>
> To allow the Snowflake service account access to your storage, you must
> update the domain restriction.

## Retrieve the Google Workspace customer ID

Before you can update an organization policy, you must retrieve the
Google Workspace customer ID associated with the Snowflake service account.

Call the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../sql-reference/functions/system_get_snowflake_platform_info.md) function:

```sqlexample
SELECT SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO();
```

The function returns the project ID and Google Workspace customer ID (`snowflake-customer-directory-id`) for the Snowflake service account.

Example output:

```output
{
  "snowflake-project-id":["preprod-deployment1-a12b"],
  "snowflake-customer-directory-id":["A01bcd2ef"]
}
```

## Update the allow list for a domain constraint

To update the allow list for your domain constraint, you must update your organization policy. Specifically,
you must add the Google Workspace customer ID for the Snowflake service account to the `allowed_values`
list in the constraint.

For instructions, see
[Setting the organization policy](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains#setting_the_organization_policy)
in the Google Cloud documentation.

---
title: Allow the VNet subnet IDs
source: https://docs.snowflake.com/en/user-guide/data-load-azure-allow.md
section: User Guide
---

# Allow the VNet subnet IDs

This topic provides guidance for explicitly granting Snowflake access to
your Microsoft Azure storage account (containers, the objects in those containers, and your storage queues).
The process involves allowing the Azure Virtual Network (VNet) subnet IDs for your Snowflake account.

Allowing VNet subnet IDs is required
only if [Azure storage firewall](https://docs.microsoft.com/en-us/azure/storage/common/storage-network-security) is configured
to block all unauthorized traffic to your Azure storage account.

> **Note:**
>
> This process must be completed by an Azure administrator in your organization.

To allow the Snowflake VNet subnet IDs:

1. Log in to your Snowflake account using [any supported client](../guides-overview-connecting.md).
2. Run [USE ROLE](../sql-reference/sql/use-role.md) to set ACCOUNTADMIN as the active role for the user session.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
3. Query the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../sql-reference/functions/system_get_snowflake_platform_info.md) function to retrieve the IDs of the VNet subnet
   in which your Snowflake account is located:

   ```sqlexample
   SELECT SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO();
   ```

   Record the VNet subnet IDs that the query returns.
4. Follow the instructions in
   [Managing virtual network rules](https://learn.microsoft.com/en-us/azure/storage/common/storage-network-security?tabs=azure-portal#managing-virtual-network-rules)
   to add a network rule for each Snowflake VNet subnet ID. You must add a network rule for each of the subnet IDs returned
   by the SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO function.

   > **Note:**
   >
   > Azure might return an error similar to the following:
   >
   > ```bash
   > Unable retrieve endpoint status for one or more subnets. Status 'insufficient permissions' indicates lack of subnet read permissions ('Microsoft.Network/virtualNetworks/subnets/read').
   > ```
   >
   > The error indicates that your Azure storage account may not initiate connections to Snowflake because those permissions are not granted. You can ignore this error. It will not block the allow feature.

For additional options for managing virtual network rules, see the [Azure documentation](https://docs.microsoft.com/en-us/azure/storage/common/storage-network-security).

For help with this configuration process or any of the other Azure configuration steps, contact the Azure administrator for your organization.

**Next:** [Configure an Azure container for loading data](data-load-azure-config.md)

---
title: Allowing Host names
source: https://docs.snowflake.com/en/user-guide/hostname-allowlist.md
section: User Guide
---

# Allowing Host names

All Snowflake clients, such as SnowSQL, JDBC driver, and ODBC driver, require permanent access to cloud storage (Amazon S3, Google Cloud Storage,
or Microsoft Azure), as well as other web-based hosts, to perform various runtime operations. To ensure access, particularly in a
[secure/private network](admin-security-privatelink.md), you must allow the host names for the required hosts.

The host names that need to be allowed depend on your AWS, Google Cloud, or Microsoft Azure cloud platform and the region where your Snowflake
account is located.

Use the [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md) function for general accounts or
[SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md) function for accounts using private connectivity to the Snowflake service to
obtain the host names for your Snowflake account.

Use [SnowCD](snowcd.md) to ensure the provided endpoints are allowed.

---
title: Allowing the Virtual Private Cloud IDs
source: https://docs.snowflake.com/en/user-guide/data-load-s3-allow.md
section: User Guide
---

# Allowing the Virtual Private Cloud IDs

This topic describes how an AWS administrator in your organization can explicitly grant Snowflake access to your AWS S3 storage account (i.e. your buckets and the objects in those buckets). The process involves allowing the Amazon Virtual Private Cloud (Amazon VPC) IDs for your Snowflake account.

> **Important:**
>
> This security feature currently requires that your S3 bucket is located in the same AWS [region](intro-regions.md) as your Snowflake account.

To allow the Amazon VPC IDs for your Snowflake account:

1. Log into your Snowflake account using any supported client.
2. Execute [USE ROLE](../sql-reference/sql/use-role.md) to set ACCOUNTADMIN as the active role for the user session.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
3. Query the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../sql-reference/functions/system_get_snowflake_platform_info.md) function to retrieve the IDs of the AWS Virtual Network (VNet) in which your Snowflake account is located:

   ```sqlexample
   SELECT SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO();
   ```

   Record the VPC IDs returned by the query.
4. Allow the VPC IDs by creating an [Amazon S3 policy for a specific VPC](https://docs.aws.amazon.com/AmazonS3/latest/dev/example-bucket-policies-vpc-endpoint.html?shortFooter=true#example-bucket-policies-restrict-access-vpc).
5. Provide an AWS Identity and Access Management (IAM) role to Snowflake to access the allowed Amazon S3 bucket instead of the AWS key and secret.

For help with this configuration process or any of the other AWS configuration steps, please contact your organization’s AWS administrator.

**Next:** [Configuring secure access to Amazon S3](data-load-s3-config.md)

---
title: Alter existing dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-alter.md
section: User Guide
---

# Alter existing dynamic tables

This section describes making changes to existing dynamic tables using the [ALTER DYNAMIC TABLE](../sql-reference/sql/alter-dynamic-table.md) command:

* Change the warehouse or target lag of your dynamic tables
* Rename, swap, or add clustering keys to your dynamic tables

## Alter the warehouse or target lag for dynamic tables

Adjust your dynamic tables’ warehouse for cost efficiency or performance boost. For more information, see
[Compute costs](dynamic-tables-cost.md) and [Understand warehouse usage for dynamic tables](dynamic-tables-warehouses.md).

Adjust your dynamic table’s target lag to get fresher data in the following situations:

* **You need fresher data**: Reduce target lag to trigger more frequent refreshes.
* **You want to reduce cost**: Data that doesn’t need near real-time freshness can use a
  longer target lag. For example, a dynamic table that refreshes every 20 minutes but only
  needs to be within one hour of the source tables can use a one-hour target lag to reduce
  compute costs.
* **Your pipeline has misaligned schedules**: When your dynamic table depends on other tables
  with longer refresh intervals, align the target lag with those dependencies to avoid
  unnecessary refreshes.
* **You’re seeing skipped refreshes**: When refreshes take longer than your target lag,
  Snowflake skips some refreshes. Increase the target lag to match realistic refresh durations.

For more information, see [Understanding dynamic table target lag](dynamic-tables-target-lag.md).

To change the warehouse or target lag for a dynamic table, use the [ALTER DYNAMIC TABLE](../sql-reference/sql/alter-dynamic-table.md) command. For example:

```sqlexample
-- Change the warehouse for my_dynamic_table to my_other_wh:
ALTER DYNAMIC TABLE my_dynamic_table SET
  WAREHOUSE = my_other_wh;
```

```sqlexample
-- Specify the downstream target lag for a dynamic table:
ALTER DYNAMIC TABLE my_dynamic_table SET
  TARGET_LAG = DOWNSTREAM;
```

## Rename dynamic tables

Renaming a dynamic table can be useful in scenarios where you have scripts or applications that rely on a specific table name, and you want to
update the dynamic table without changing your existing script. For example, if you have a script that references a specific dynamic table
name, renaming the table allows you to swap out the underlying table while keeping the script unchanged. This ensures continuity and avoids
the hassle of updating multiple references across scripts or processes.

To rename a dynamic table, use the [ALTER DYNAMIC TABLE … RENAME TO](../sql-reference/sql/alter-dynamic-table.md) command. For example:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table RENAME TO my_new_dynamic_table;
```

## Swap dynamic tables

Swapping dynamic tables allows for a seamless transition between datasets or table versions without disrupting workflows or modifying dependent
scripts. For example, if you’re developing a new version of a table but want to keep the same name for ongoing processes, swapping lets you
replace the old table with the new one. This approach ensures continuity while enabling updates, testing, or upgrades with minimal downtime or
disruption.

To swap a dynamic table, use the [ALTER DYNAMIC TABLE … SWAP WITH](../sql-reference/sql/alter-dynamic-table.md) command. Note that you can
only swap a dynamic table with another dynamic table.

For example:

```sqlexample
-- Swap my_dynamic_table with the my_new_dynamic_table:
ALTER DYNAMIC TABLE my_dynamic_table SWAP WITH my_new_dynamic_table;
```

## Add clustering keys to dynamic tables

Adding clustering keys to dynamic tables can enhance performance by improving query efficiency and refresh operations:

* Query efficiency: Clustering keys can help speed up queries, just like with regular tables, by clustering on common join keys or filter
  columns.
* Refresh operations: Clustering keys can help speed up refreshes if the clustering keys align with frequent change patterns; for example,
  clustering by user ID can be effective when you have updates where a handful of users change.

Clustering keys can be specified for a dynamic table with incremental or full refresh mode. In full refresh, the clustering is performed
during the refresh and background reclustering isn’t needed.

To cluster a dynamic table, use the [ALTER DYNAMIC TABLE … CLUSTER BY](../sql-reference/sql/alter-dynamic-table.md) command:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table CLUSTER BY (date);
```

---
title: Analyze query profiles for hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-read-query-profiles.md
section: User Guide
---

# Analyze query profiles for hybrid tables

Unistore workloads pose some interesting questions about query execution that you can investigate by using the Snowsight Query Profile
feature or information gleaned from [EXPLAIN](../sql-reference/sql/explain.md) output. In addition to monitoring overall performance and throughput, you
may want to know if a table scan is being executed against the row store or object storage, or whether a specific secondary index is being used.

This section identifies Query Profile operators and attributes that pertain to hybrid table operations and presents some examples to help
you understand how to read query plans that access hybrid tables. See also [Monitor query activity with Query History](ui-snowsight-activity.md).

## Hybrid table scans and index scans

Table and index scan operators appear in query profiles to show access to hybrid tables. These operators typically appear at
the bottom of the tree, representing the first step in reading the data that is needed to execute a specific query. Queries against standard
tables always use table scans; they do not use index scans.

When a primary key index is used to scan a hybrid table, a TableScan operator appears in the query profile, not an
IndexScan operator. When any other index is used to scan a hybrid table, such as a secondary index, you will see an IndexScan operator.

Under Attributes for the IndexScan operator, you can see the fully qualified name of the index and
Access predicates. These are the predicates that are applied to the index during the scan. You can also see predicates for
filters that are applied during table scans.

When a predicate is “pushed” to an index, the predicate contains a placeholder, inside parentheses, for the constant that was used in the query.
For example: `SENSOR_DATA_DEVICE2.DEVICE_ID = (:SFAP_PRE_NR_1)`

### Scan mode

Hybrid table data is maintained in two formats to serve both operational and analytical workloads. A common question asked by administrators is
whether a given query will access the row store or the column store (object storage). A query may read from one or both types of storage,
depending on the tables in question, the specific requirements of the query, availability of indexes, and other factors.

The query profile for hybrid table queries includes a Scan Mode attribute for each table scan operator in the tree:

* ROW-BASED: The query reads from the table data in the row store, or uses indexes to compute query results.
* COLUMN-BASED: The query reads from an object storage copy of the same data that was loaded into the row store. Index scans can also access
  object storage, for [Time Travel](data-time-travel.md) queries.

Scan mode is specific to hybrid tables. If a table scan is run on a standard table, no Scan Mode attribute is displayed.

### Data read from the columnar warehouse cache

Where possible, table scans for hybrid tables read data from a columnar warehouse cache. This cache is an extension to the standard warehouse cache; see [Optimizing the warehouse cache](performance-query-warehouse-cache.md). The cache contains data that has been read from the hybrid table storage provider and is
accessible by read-only queries against hybrid tables.

To see cache usage in a given query profile, select the table scan operator and check the Percentage scanned from cache under Statistics.

Queries that select from hybrid tables do not benefit from the [query results cache](querying-persisted-results.md).

## Throttling for hybrid table requests

In the Profile Overview, you can see a Hybrid Table Requests Throttling percentage. To see this overview, do not select an operator in
the tree; the overview applies to the whole query plan.

For example, the following query recorded that 87.5% of its execution time was spent being
throttled by the hybrid table storage provider. A high throttling percentage is an indicator that too many hybrid table read and write requests are
being sent to the storage provider, relative to the quota for the database. For more information, see
[Quotas and throttling](tables-hybrid-limitations.md).

## Examples

The following Snowsight examples of query profiles show attributes specific to hybrid table
operations. To understand these examples, you do not need to create and load the tables that are queried and modified. However,
here is the CREATE TABLE statement for one of the tables for reference. Note the definition of the PRIMARY KEY constraint (on the
`timestamp` column) and a secondary index (on the `device_id` column):

```sqlexample
CREATE OR REPLACE HYBRID TABLE sensor_data_device1 (
  timestamp TIMESTAMP_NTZ PRIMARY KEY,
  device_id VARCHAR(10),
  temperature DECIMAL(6,4),
  vibration DECIMAL(6,4),
  motor_rpm INT,
  INDEX device_idx(device_id)
 );
```

Another similar hybrid table, `sensor_data_device2`, is also used in the examples.

### Query plan that accesses the primary key column

When your query filters the primary key of the table (`timestamp`), which is automatically indexed, the query profile uses a
TableScan operator. Also note that ROW_BASED scan mode is used for this query.

```sqlexample
SELECT * FROM sensor_data_device1 WHERE timestamp='2024-03-01 13:45:56.000';
```

### Query plan that accesses a secondary index

The query that generated this profile looks like this:

```sqlexample
SELECT COUNT(*) FROM sensor_data_device1 WHERE device_id='DEVICE2';
```

Only part of the profile is shown here, focusing on the IndexScan operator and its attributes.
The scan mode is ROW_BASED, and you can see the complete predicate by hovering over Access Predicates.
The fully qualified index name is also displayed.

See also [INCLUDE columns](tables-hybrid-index.md).

### Query plan for DML on a hybrid table

DML operations on hybrid tables typically modify single rows. For example:

```sqlexample
UPDATE sensor_data_device2 SET device_id='DEVICE3' WHERE timestamp = '2024-04-02 00:00:05.000';
```

The query profile for the TableScan operator shows that this UPDATE accesses the row store for the
hybrid table (scan mode is ROW_BASED):

### Recurring query that benefits from cached data

In this case, assume that the following query is run twice in quick succession on a hybrid table.

```sqlsyntax
SELECT device_id, AVG(temperature)
  FROM sensor_data_device2
  WHERE temperature>33
  GROUP BY device_id;
```

The first query reads all of the data from object storage. The second run of the query reads 100% of the data from the columnar cache.
Also note that the scan mode for this query is COLUMN_BASED.

### Query plan for a join (hybrid table to standard table)

When you join a hybrid table to a standard table, you will see a Scan Mode attribute for the scan on the hybrid table, but not on
the standard table. For example, the TableScan operator on the left side of this join plan used ROW_BASED scan mode. The `order_header`
table is a hybrid table with `order_id` as its primary key (the joining column in this example). The other table, `truck_history`, is a standard table.

---
title: Analyzing data with window functions
source: https://docs.snowflake.com/en/user-guide/functions-window-using.md
section: User Guide
---

# Analyzing data with window functions

This topic contains introductory conceptual information about window functions. If you are already familiar with the usage of
window functions, you might find the following reference information sufficient:

> * [Window functions](../sql-reference/functions-window.md), which contains a list of functions and links to individual function
>   descriptions.
> * [Window function syntax and usage](../sql-reference/functions-window-syntax.md), which describes general syntax rules for all window functions.

## Introduction

A window function is an analytic SQL function that operates on a group of related rows known as a *partition*. A partition is usually
a logical group of rows along some familiar dimension, such as product category, location, time period, or business unit. Function results are
computed over each partition, with respect to an implicit or explicit *window frame*. A window frame is a fixed or variable set of rows relative
to the *current row*. The current row is a single input row for which the function result is currently being computed. Function results are calculated
row by row within each partition, and each row in the window frame takes its turn as the current row.

The syntax that defines this behavior is the OVER clause for the function. In many cases, the OVER clause distinguishes a window function
from a regular SQL function with the same name (such as AVG or SUM). The OVER clause consists of three main components:

* A PARTITION BY clause
* An ORDER BY clause
* A window frame specification

Depending on the function or query in question, all of these components may be optional; a window function with an empty OVER clause is valid:
`OVER()`. However, in most analytic queries, window functions require one or more explicit OVER clause components. You can call a window
function in any context that supports other SQL functions. The following sections explain the concepts behind window functions in more detail
and present some introductory examples. For complete syntax information, see [Window function syntax and usage](../sql-reference/functions-window-syntax.md).

## Window functions versus aggregate functions

A good way to start learning about window functions is to compare regular aggregate functions with their window function counterparts. Several
standard [aggregate functions](../sql-reference/functions-aggregation.md), such as SUM, COUNT, and AVG, have corresponding window functions
with the same name. To distinguish the two, note that:

* For an aggregate function, the input is a group of rows, and the output is one row.
* For a window function, the input is each row within a partition, and the output is one row *per input row*.

For example, the SUM aggregate function returns a single total value for all of the input rows, whereas a window function returns multiple
totals: one for each row (the current row) relative to all the other rows in the partition.

To see how this works, first [create and load the menu_items table](../sql-reference/functions/stddev.md), which contains the cost of goods
sold and prices for foodtruck menu items. Use a regular AVG function to find the average cost of goods for menu items in different categories:

```sqlexample
SELECT menu_category,
    AVG(menu_cogs_usd) avg_cogs
  FROM menu_items
  GROUP BY 1
  ORDER BY menu_category;
```

```output
+---------------+------------+
| MENU_CATEGORY |   AVG_COGS |
|---------------+------------|
| Beverage      | 0.60000000 |
| Dessert       | 1.79166667 |
| Main          | 6.11046512 |
| Snack         | 3.10000000 |
+---------------+------------+
```

Note that the function returns one grouped result for `avg_cogs`.

Alternatively, you can specify an OVER clause and use AVG as a window function. (The result is limited to 15 rows from the 60-row table.)

```sqlexample
SELECT menu_category,
    AVG(menu_cogs_usd) OVER(PARTITION BY menu_category) avg_cogs
  FROM menu_items
  ORDER BY menu_category
  LIMIT 15;
```

```output
+---------------+----------+
| MENU_CATEGORY | AVG_COGS |
|---------------+----------|
| Beverage      |  0.60000 |
| Beverage      |  0.60000 |
| Beverage      |  0.60000 |
| Beverage      |  0.60000 |
| Dessert       |  1.79166 |
| Dessert       |  1.79166 |
| Dessert       |  1.79166 |
| Dessert       |  1.79166 |
| Dessert       |  1.79166 |
| Dessert       |  1.79166 |
| Main          |  6.11046 |
| Main          |  6.11046 |
| Main          |  6.11046 |
| Main          |  6.11046 |
| Main          |  6.11046 |
+---------------+----------+
```

Note that the function returns an average for each row in each partition and resets the calculation when the partitioning column value
changes. To make the value of the window function more apparent, add an ORDER BY clause and a window frame to the function definition.
Also return the raw `menu_cogs_usd` values, in addition to the averages, so you can see how the specific calculations work. This query is
a simple example of a “moving average,” a rolling calculation that depends on an explicit window frame. For more examples like this, see
[Analyzing time-series data](querying-time-series-data.md).

```sqlexample
SELECT menu_category, menu_price_usd, menu_cogs_usd,
    AVG(menu_cogs_usd) OVER(PARTITION BY menu_category
      ORDER BY menu_price_usd, menu_cogs_usd ROWS BETWEEN CURRENT ROW and 2 FOLLOWING) avg_cogs
  FROM menu_items
  ORDER BY menu_category, menu_price_usd, menu_cogs_usd
  LIMIT 15;
```

```output
+---------------+----------------+---------------+----------+
| MENU_CATEGORY | MENU_PRICE_USD | MENU_COGS_USD | AVG_COGS |
|---------------+----------------+---------------+----------|
| Beverage      |           2.00 |          0.50 |  0.58333 |
| Beverage      |           3.00 |          0.50 |  0.63333 |
| Beverage      |           3.00 |          0.75 |  0.70000 |
| Beverage      |           3.50 |          0.65 |  0.65000 |
| Dessert       |           3.00 |          0.50 |  0.91666 |
| Dessert       |           4.00 |          1.00 |  1.58333 |
| Dessert       |           5.00 |          1.25 |  2.08333 |
| Dessert       |           6.00 |          2.50 |  2.66666 |
| Dessert       |           6.00 |          2.50 |  2.75000 |
| Dessert       |           7.00 |          3.00 |  3.00000 |
| Main          |           5.00 |          1.50 |  2.03333 |
| Main          |           6.00 |          2.60 |  3.00000 |
| Main          |           6.00 |          2.00 |  2.33333 |
| Main          |           6.00 |          2.40 |  3.13333 |
| Main          |           8.00 |          4.00 |  3.66666 |
+---------------+----------------+---------------+----------+
```

The window frame adjusts the average calculations such that only the current row and the two rows that follow it (within the
partition) are considered. The last row in a partition has no following rows so the average for the last `Beverage` row, for example,
is the same as the corresponding `menu_cogs_usd` value (`0.65`). The output of the window function depends on the
individual row that is passed to the function and the values of the other rows that qualify for the window frame.

> **Note:**
>
> When using window functions with ORDER BY clauses, ensure that the ordering is deterministic. If multiple rows
> have the same value for the ORDER BY columns, add additional columns as tiebreakers to ensure consistent,
> predictable results across query executions. In this example, `menu_cogs_usd` is included as a tiebreaker because
> multiple rows can have the same `menu_price_usd` value.

## Ordering the rows for window functions

The previous AVG window function example uses an ORDER BY clause within the function definition to ensure that the window frame is
subject to data that is sorted (by `menu_price_usd` in this case).

Two types of window functions require an ORDER BY clause:

> * Window functions with explicit window frames, which perform rolling operations on subsets of the rows in each partition, such as
>   calculating running totals or moving averages. Without an ORDER BY clause, the window frame is meaningless; the set of “preceding” and
>   “following” rows must be deterministic.
> * Ranking window functions, such as CUME_DIST, RANK, and DENSE_RANK, which return information based on the “rank” of a row. For example,
>   if you rank stores in descending order by profit per month, the store with the highest profit will be ranked 1; the second-most profitable
>   store will be ranked 2, and so on.

The ORDER BY clause for a window function supports the same syntax as the main ORDER BY clause that sorts the final results of a query. These
two ORDER BY clauses are separate and distinct. An ORDER BY clause within an OVER clause controls only the order in which the window function
processes rows; it does not control the output of the entire query. In many cases, your window function queries will contain both types of
ORDER BY clauses.

> **Note:**
>
> The ORDER BY clause for window functions does not support the use of an ordinal position, such as `OVER (PARTITION BY 1 ORDER BY 2)`.
> In this context, `2` is interpreted as the constant `2`; it does not refer to the second column in the query.

The PARTITION BY and ORDER BY clauses within the OVER clause are also independent. You can use the ORDER BY clause without the PARTITION BY
clause and vice versa.

Check the syntax for individual window functions before writing queries. Syntax requirements for the ORDER BY clause vary by function:

* Some window functions require an ORDER BY clause.
* Some window functions use an ORDER BY clause if one is present, but do not require it.
* Some window functions do not allow an ORDER BY clause.
* Some window functions interpret an ORDER BY clause as an implied window frame.

> **Caution:**
>
> Generally speaking, SQL is an explicit language, with few implied clauses. However, for some window functions, an ORDER BY clause implies
> a window frame. For details, see [Usage notes for window frames](../sql-reference/functions-window-syntax.md).
>
> Because behavior that is implied rather than explicit can lead to results that are difficult to understand, Snowflake recommends declaring
> window frames explicitly.

## Using different types of window frames

Window frames are defined explicitly or implicitly. They depend on the presence of an ORDER BY clause within the OVER clause:

* For explicit frame syntax, see the `windowFrameClause` under [Syntax](../sql-reference/functions-window-syntax.md). You can define open-ended boundaries: from the beginning of the partition to the current row; from the current row to the end of the partition; or completely “unbounded” end to end. Alternatively, you can use explicit offsets (inclusive) that are relative to the current row in the partition.
* Implicit frames are used by default when the OVER clause does not include a `windowFrameClause`. The default frame depends on the function in question. See also [Usage notes for window frames](../sql-reference/functions-window-syntax.md).

### Range-based versus row-based window frames

Snowflake supports two main types of window frames:

Row-based:
:   An exact sequence of rows belongs to the frame, based on a *physical* offset from the current row. For example, `5 PRECEDING` means the five rows preceding the current row. The offset must be a number. ROWS mode is inclusive and is always relative to the current row. If the specified number of preceding or following rows extends beyond the limits of the partition, Snowflake treats the value as NULL.

    If the frame has open-ended rather than explicitly numbered boundaries, a similar physical offset applies. For example,
    ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW means that the frame consists of the whole set of rows (zero or more) that physically precede the current row and the current row itself.

Range-based:
:   A *logical* range of rows belongs to the frame, given an offset from the ORDER BY value for the current row. For example, `5 PRECEDING` means rows with ORDER BY values that have the ORDER BY value of the current row, plus or minus a maximum of 5 (plus for DESC order, minus for ASC order). The offset value may be a number or an interval.

    If the frame has open-ended rather than numbered boundaries, a similar logical offset applies. For example, RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW means that the frame consists of all the rows that physically precede the current row, the current row itself, *and* any adjacent rows that have the same ORDER BY value as the current row. For a RANGE window frame, CURRENT ROW does not mean physically the current row; it means all the rows that have the same ORDER BY value as the current physical row.

The distinctions in ROWS BETWEEN and RANGE BETWEEN window frames are important because window function queries may return very different results, depending on the ORDER BY expression, the data in the tables, and the exact definition of the frame. The following examples demonstrate the differences in behavior.

#### Comparing RANGE BETWEEN and ROWS BETWEEN with explicit offsets

A range-based window frame requires an ORDER BY column or expression and a RANGE BETWEEN specification. The logical boundary of the window frame depends on the ORDER BY value (a numeric constant or interval literal) for the current row.

For example, a time-series table named `heavy_weather` is defined as follows:

```sqlexample
CREATE OR REPLACE TABLE heavy_weather
  (start_time TIMESTAMP, precip NUMBER(3,2), city VARCHAR(20), county VARCHAR(20));
```

Sample rows in this table look like this:

```output
+-------------------------+--------+-------+-------------+
| START_TIME              | PRECIP | CITY  | COUNTY      |
|-------------------------+--------+-------+-------------|
| 2021-12-30 11:23:00.000 |   0.12 | Lebec | Los Angeles |
| 2021-12-30 11:43:00.000 |   0.98 | Lebec | Los Angeles |
| 2021-12-30 13:53:00.000 |   0.23 | Lebec | Los Angeles |
| 2021-12-30 14:53:00.000 |   0.13 | Lebec | Los Angeles |
| 2021-12-30 15:15:00.000 |   0.29 | Lebec | Los Angeles |
| 2021-12-30 17:53:00.000 |   0.10 | Lebec | Los Angeles |
| 2021-12-30 18:53:00.000 |   0.09 | Lebec | Los Angeles |
| 2021-12-30 19:53:00.000 |   0.07 | Lebec | Los Angeles |
| 2021-12-30 20:53:00.000 |   0.07 | Lebec | Los Angeles |
+-------------------------+--------+-------+-------------+
```

Assume that a query computes a 3-hour moving average (AVG) over the `precip` (precipitation) column, using a window frame ordered by `start_time`:

```sqlexample
AVG(precip)
  OVER(ORDER BY start_time
    RANGE BETWEEN CURRENT ROW AND INTERVAL '3 hours' FOLLOWING)
```

Given the sample rows above, when the current row is `2021-12-30 11:23:00.000` (the first sample row), only the next two rows fall inside the frame
(`2021-12-30 11:43:00.000` and `2021-12-30 13:53:00.000`). Subsequent timestamps are greater than 3 hours later.

However, if you change the window frame to a 1-day interval, all of the sample rows that follow the current row fall inside the frame because
they all have timestamps on the same date (`2021-12-30`):

```sqlexample
RANGE BETWEEN CURRENT ROW AND INTERVAL '1 day' FOLLOWING
```

If you were to change this syntax from RANGE BETWEEN to ROWS BETWEEN, the frame would have to specify fixed boundaries, which represent an exact number
of rows: the current row plus the following exact ordered number of rows, such as 1, 3, or 10 rows, regardless of the values returned by the ORDER BY
expression.

See also [RANGE BETWEEN example with explicit numeric offsets](../sql-reference/functions-window-syntax.md).

#### Comparing RANGE BETWEEN and ROWS BETWEEN with open-ended boundaries

The following example compares results when the following window frames are calculated against the same set of rows:

```sqlexample
ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
```

This example selects from a small table named `menu_items`. See [Create and load the menu_items table](../sql-reference/functions/stddev.md).

The SUM window function aggregates the `menu_price_usd` values for each `menu_category` partition. With the ROWS BETWEEN
syntax, it is easy to see how the running totals are cumulative within each partition.

```sqlexample
SELECT menu_category, menu_price_usd,
    SUM(menu_price_usd)
      OVER(PARTITION BY menu_category ORDER BY menu_price_usd
      ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) sum_price
  FROM menu_items
  WHERE menu_category IN('Beverage','Dessert','Snack')
  ORDER BY menu_category, menu_price_usd;
```

```output
+---------------+----------------+-----------+
| MENU_CATEGORY | MENU_PRICE_USD | SUM_PRICE |
|---------------+----------------+-----------|
| Beverage      |           2.00 |      2.00 |
| Beverage      |           3.00 |      5.00 |
| Beverage      |           3.00 |      8.00 |
| Beverage      |           3.50 |     11.50 |
| Dessert       |           3.00 |      3.00 |
| Dessert       |           4.00 |      7.00 |
| Dessert       |           5.00 |     12.00 |
| Dessert       |           6.00 |     18.00 |
| Dessert       |           6.00 |     24.00 |
| Dessert       |           7.00 |     31.00 |
| Snack         |           6.00 |      6.00 |
| Snack         |           6.00 |     12.00 |
| Snack         |           7.00 |     19.00 |
| Snack         |           9.00 |     28.00 |
| Snack         |          11.00 |     39.00 |
+---------------+----------------+-----------+
```

When the RANGE BETWEEN syntax is used with an otherwise identical query, the calculations are not so obvious at
first; they depend on a different interpretation of *current row*: the current row itself plus any adjacent rows
that have the same ORDER BY value as that row.

For example, the `sum_price` values for the second and third rows in the result are both `8.00` because the
ORDER BY value for those rows is the same. This behavior occurs in two other places in the result set,
where `sum_price` is calculated consecutively as `24.00` and `12.00`.

```sqlexample
SELECT menu_category, menu_price_usd,
    SUM(menu_price_usd)
      OVER(PARTITION BY menu_category ORDER BY menu_price_usd
      RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) sum_price
  FROM menu_items
  WHERE menu_category IN('Beverage','Dessert','Snack')
  ORDER BY menu_category, menu_price_usd;
```

```output
+---------------+----------------+-----------+
| MENU_CATEGORY | MENU_PRICE_USD | SUM_PRICE |
|---------------+----------------+-----------|
| Beverage      |           2.00 |      2.00 |
| Beverage      |           3.00 |      8.00 |
| Beverage      |           3.00 |      8.00 |
| Beverage      |           3.50 |     11.50 |
| Dessert       |           3.00 |      3.00 |
| Dessert       |           4.00 |      7.00 |
| Dessert       |           5.00 |     12.00 |
| Dessert       |           6.00 |     24.00 |
| Dessert       |           6.00 |     24.00 |
| Dessert       |           7.00 |     31.00 |
| Snack         |           6.00 |     12.00 |
| Snack         |           6.00 |     12.00 |
| Snack         |           7.00 |     19.00 |
| Snack         |           9.00 |     28.00 |
| Snack         |          11.00 |     39.00 |
+---------------+----------------+-----------+
```

### Window frames for cumulative and sliding calculations

Window frames are a very flexible mechanism for running different types of analytic queries, including both cumulative calculations and moving
calculations. To return cumulative sums, for example, you can specify a window frame that starts at a fixed point and moves row by row through the whole partition:

```sqlexample
OVER(PARTITION BY col1 ORDER BY col2 ROWS UNBOUNDED PRECEDING)
```

Another example of this type of frame might be:

```sqlexample
OVER(PARTITION BY col1 ORDER BY col2 ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW)
```

The number of rows that qualify for these frames is variable, but the start and end points of the frames are fixed, using named boundaries rather than
numeric or interval boundaries.

If you want the window function calculation to slide forward over a specific number (or range) of rows, you can use explicit offsets:

```sqlexample
OVER(PARTITION BY col1 ORDER BY col2 ROWS BETWEEN 3 PRECEDING AND 3 FOLLOWING)
```

In this case, the result is a sliding frame that consists of a maximum of seven rows (3 + current row + 3). Another example of this type of frame might be:

```sqlexample
OVER(PARTITION BY col1 ORDER BY col2 ROWS BETWEEN CURRENT ROW AND 3 FOLLOWING)
```

Window frames can contain a mix of named boundaries and explicit offsets.

### Sliding window frames

A sliding window frame is a fixed-width frame that “slides through” the rows in the partition, covering a different
slice of the partition each time. The number of rows in the frame remains the same, except at the beginning or end of a partition, where it may contain
fewer rows.

Sliding windows are often used to calculate moving averages, which are based on a fixed-size interval (such as a number of days).
The average is “moving” because although the size of the interval is constant, the actual values in the interval change over time (or over some other
dimension).

For example, stock market analysts often analyze stocks based in part on the 13-week moving average of
a stock’s price. The moving average price today is the average of price at the end of today and the price at the
end of each day during the most recent 13 weeks. If stocks are traded 5 days a week, and if there were no holidays
in the last 13 weeks, the moving average is the average price on each of the most recent 65 trading days (including today).

The following example shows what happens to a 13-week (91-day) moving average of a stock price on the last
day of June and the first few days of July:

* On June 30th, the function returns the average price for April 1 to June 30 (inclusive).
* On July 1st, the function returns the average price for April 2 to July 1 (inclusive).
* On July 2nd, the function returns the average price for April 3 to July 2 (inclusive).

The following example uses a small (3-day) sliding window over the first 7 days in the month. This example takes into account the fact
that at the beginning of the period, the partition might not be full:

As you can see in the corresponding mockup of a query result, the last column contains the sum of the three most recent days’ of sales
data. For example, the column value for day 4 is `36`, which is the sum of the sales for days 2, 3, and 4 (`11 + 12 + 13`):

> ```output
> +--------+-------+---------------+
> | Day of | Sales | Most Recent   |
> | Month  | Today | 3 Days' Sales |
> |--------+-------+---------------+
> |      1 |    10 |            10 |
> |      2 |    11 |            21 |
> |      3 |    12 |            33 |
> |      4 |    13 |            36 |
> |      5 |    14 |            39 |
> |    ... |   ... |           ... |
> +--------+-------+---------------+
> ```

## Ranking window functions

The syntax for a ranking window function is essentially the same as the syntax for other window functions. The exceptions include:

* Ranking window functions require the ORDER BY clause inside the OVER clause.
* For some ranking functions, such as [RANK](../sql-reference/functions/rank.md) itself, no input argument is required. For the RANK function, the value
  returned is based solely on numeric ranking, as determined by the ORDER BY clause inside the OVER clause. Therefore, passing a column name
  or expression to the function is unnecessary.

The simplest ranking function is named RANK. You can use this function to:

* Rank salespeople on revenue (sales), from highest to lowest.
* Rank countries based on their per-capita GDP (income per person), from highest to lowest.
* Rank countries on air pollution, from lowest to highest.

This function simply identifies the numeric ranking position of a row in an ordered set of rows. The first row has rank 1, the second has rank 2,
and so on. The following example shows the rank order of salespeople based on `Amount Sold`:

> ```output
> +-------------+-------------+------+
> | Salesperson | Amount Sold | Rank |
> |-------------+-------------+------|
> | Smith       |        2000 |    1 |
> | Jones       |        1500 |    2 |
> | Torkelson   |        1200 |    3 |
> | Dolenz      |        1100 |    4 |
> +-------------+-------------+------+
> ```

The rows must already be sorted before the rankings can be assigned. Therefore, you must use an ORDER BY clause within the OVER clause.

Consider the following example: you’d like to know where your store profit ranks among branches of the store chain (whether your store ranks first,
second, third, and so on). This example ranks each store by profitability within its city. The rows are put in descending order (highest profit first), so
the most profitable store is ranked 1:

> ```sqlexample
> SELECT city, branch_ID, net_profit,
>        RANK() OVER (PARTITION BY city ORDER BY net_profit DESC) AS rank
>     FROM store_sales
>     ORDER BY city, rank;
> +-----------+-----------+------------+------+
> | CITY      | BRANCH_ID | NET_PROFIT | RANK |
> |-----------+-----------+------------+------|
> | Montreal  |         3 |   10000.00 |    1 |
> | Montreal  |         4 |    9000.00 |    2 |
> | Vancouver |         2 |   15000.00 |    1 |
> | Vancouver |         1 |   10000.00 |    2 |
> +-----------+-----------+------------+------+
> ```

> **Note:**
>
> The `net_profit` column does *not* need to be passed as an argument to the RANK function. Instead, the input rows are sorted by `net_profit`.
> The RANK function merely needs to return the position of the row (1, 2, 3, and so on) within the partition.

The output of a ranking function depends on:

* The individual row passed to the function.
* The values of the other rows in the partition.
* The order of all the rows in the partition.

Snowflake provides several different ranking functions. For a list of these functions, and more details about their syntax, see
[Window functions](../sql-reference/functions-window.md).

To rank your store against all other stores in the chain, not just against other stores in your city,
use the query below:

```sqlexample
SELECT
    branch_ID,
    net_profit,
    RANK() OVER (ORDER BY net_profit DESC) AS sales_rank
  FROM store_sales
```

The following query uses the first ORDER BY clause to control processing by the window function and the second ORDER BY clause to
control the order of the entire query’s output:

```sqlexample
SELECT
    branch_ID,
    net_profit,
    RANK() OVER (ORDER BY net_profit DESC) AS sales_rank
  FROM store_sales
  ORDER BY branch_ID;
```

## Illustrated example

This example uses a sales scenario to illustrate many of the concepts described earlier in this topic.

Suppose that you need to generate a financial report that shows values based on sales over the last week:

* Daily sales
* Ranking within the week (that is, sales ranked highest to lowest for the week)
* Sales so far this week (that is, the “cumulative sum” for all days from the beginning of the week up through and including the
  current day)
* Total sales for the week
* Three-day moving average (that is, the average over the current day and the two previous days)

The report might look something like this:

> ```output
> +--------+-------+------+--------------+-------------+--------------+
> | Day of | Sales | Rank | Sales So Far | Total Sales | 3-Day Moving |
> | Week   | Today |      | This Week    | This Week   | Average      |
> |--------+-------+------+--------------+-------------|--------------+
> |      1 |    10 |    4 |           10 |          84 |         10.0 |
> |      2 |    14 |    3 |           24 |          84 |         12.0 |
> |      3 |     6 |    5 |           30 |          84 |         10.0 |
> |      4 |     6 |    5 |           36 |          84 |          9.0 |
> |      5 |    14 |    3 |           50 |          84 |         10.0 |
> |      6 |    16 |    2 |           66 |          84 |         11.0 |
> |      7 |    18 |    1 |           84 |          84 |         12.0 |
> +--------+-------+------+--------------+-------------+--------------+
> ```

The SQL for this query is somewhat complex. Rather than show the example as a single query, this discussion breaks down the SQL
for the individual columns.

In a real-world scenario, you would have years of data, so to calculate sums and averages for one specific week of data, you would
need to use a one-week window, or use a filter similar to:

> ```sqlexample
> ... WHERE date >= start_of_relevant_week and date <= end_of_relevant_week ...
> ```

However, for this example, assume that the table contains only the most recent week’s worth of data.

> ```sqlexample
> CREATE TABLE store_sales_2 (
>     day INTEGER,
>     sales_today INTEGER
>     );
> +-------------------------------------------+
> | status                                    |
> |-------------------------------------------|
> | Table STORE_SALES_2 successfully created. |
> +-------------------------------------------+
> INSERT INTO store_sales_2 (day, sales_today) VALUES
>     (1, 10),
>     (2, 14),
>     (3,  6),
>     (4,  6),
>     (5, 14),
>     (6, 16),
>     (7, 18);
> +-------------------------+
> | number of rows inserted |
> |-------------------------|
> |                       7 |
> +-------------------------+
> ```

### Calculating sales rank

The `Rank` column is calculated using the RANK function:

> ```sqlexample
> SELECT day,
>        sales_today,
>        RANK()
>            OVER (ORDER BY sales_today DESC) AS Rank
>     FROM store_sales_2
>     ORDER BY day;
> +-----+-------------+------+
> | DAY | SALES_TODAY | RANK |
> |-----+-------------+------|
> |   1 |          10 |    5 |
> |   2 |          14 |    3 |
> |   3 |           6 |    6 |
> |   4 |           6 |    6 |
> |   5 |          14 |    3 |
> |   6 |          16 |    2 |
> |   7 |          18 |    1 |
> +-----+-------------+------+
> ```

Although there are 7 days in the time period, there are only 5 different ranks (1, 2, 3, 5, 6). There were
two ties (for 3rd place and 6th place), so there are no rows with ranks 4 or 7.

### Calculating sales so far this week

The `Sales So Far This Week` column is calculated using [SUM](../sql-reference/functions/sum.md) as a window function
with a window frame:

> ```sqlexample
> SELECT day,
>        sales_today,
>        SUM(sales_today)
>            OVER (ORDER BY day
>                ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW)
>                AS "SALES SO FAR THIS WEEK"
>     FROM store_sales_2
>     ORDER BY day;
> +-----+-------------+------------------------+
> | DAY | SALES_TODAY | SALES SO FAR THIS WEEK |
> |-----+-------------+------------------------|
> |   1 |          10 |                     10 |
> |   2 |          14 |                     24 |
> |   3 |           6 |                     30 |
> |   4 |           6 |                     36 |
> |   5 |          14 |                     50 |
> |   6 |          16 |                     66 |
> |   7 |          18 |                     84 |
> +-----+-------------+------------------------+
> ```

This query orders the rows by date and then, for each date, calculates the sum of sales from the start of the window
up to the current date (inclusive).

### Calculating total sales this week

The `Total Sales This Week` column is calculated using [SUM](../sql-reference/functions/sum.md).

```sqlexample
SELECT day,
       sales_today,
       SUM(sales_today)
           OVER ()
               AS total_sales
    FROM store_sales_2
    ORDER BY day;
+-----+-------------+-------------+
| DAY | SALES_TODAY | TOTAL_SALES |
|-----+-------------+-------------|
|   1 |          10 |          84 |
|   2 |          14 |          84 |
|   3 |           6 |          84 |
|   4 |           6 |          84 |
|   5 |          14 |          84 |
|   6 |          16 |          84 |
|   7 |          18 |          84 |
+-----+-------------+-------------+
```

### Calculating a three-day moving average

The `3-Day Moving Average` column is calculated using [AVG](../sql-reference/functions/avg.md) as a window function with a
window frame:

```sqlexample
SELECT day,
       sales_today,
       AVG(sales_today)
           OVER (ORDER BY day ROWS BETWEEN 2 PRECEDING AND CURRENT ROW)
               AS "3-DAY MOVING AVERAGE"
    FROM store_sales_2
    ORDER BY day;
+-----+-------------+----------------------+
| DAY | SALES_TODAY | 3-DAY MOVING AVERAGE |
|-----+-------------+----------------------|
|   1 |          10 |               10.000 |
|   2 |          14 |               12.000 |
|   3 |           6 |               10.000 |
|   4 |           6 |                8.666 |
|   5 |          14 |                8.666 |
|   6 |          16 |               12.000 |
|   7 |          18 |               16.000 |
+-----+-------------+----------------------+
```

The difference between this window frame and the window frame described earlier is the starting point: a fixed boundary
versus an explicit offset.

### Putting it all together

Here’s the final version of the query, showing all of the columns:

```sqlexample
SELECT day,
       sales_today,
       RANK()
           OVER (ORDER BY sales_today DESC) AS Rank,
       SUM(sales_today)
           OVER (ORDER BY day
               ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW)
               AS "SALES SO FAR THIS WEEK",
       SUM(sales_today)
           OVER ()
               AS total_sales,
       AVG(sales_today)
           OVER (ORDER BY day ROWS BETWEEN 2 PRECEDING AND CURRENT ROW)
               AS "3-DAY MOVING AVERAGE"
    FROM store_sales_2
    ORDER BY day;
+-----+-------------+------+------------------------+-------------+----------------------+
| DAY | SALES_TODAY | RANK | SALES SO FAR THIS WEEK | TOTAL_SALES | 3-DAY MOVING AVERAGE |
|-----+-------------+------+------------------------+-------------+----------------------|
|   1 |          10 |    5 |                     10 |          84 |               10.000 |
|   2 |          14 |    3 |                     24 |          84 |               12.000 |
|   3 |           6 |    6 |                     30 |          84 |               10.000 |
|   4 |           6 |    6 |                     36 |          84 |                8.666 |
|   5 |          14 |    3 |                     50 |          84 |                8.666 |
|   6 |          16 |    2 |                     66 |          84 |               12.000 |
|   7 |          18 |    1 |                     84 |          84 |               16.000 |
+-----+-------------+------+------------------------+-------------+----------------------+
```

## Additional examples

This section provides more examples of window functions, and illustrates how the PARTITION BY and
ORDER BY clauses work together.

These examples use the following table and data:

```sqlexample
CREATE TABLE sales (sales_date DATE, quantity INTEGER);

INSERT INTO sales (sales_date, quantity) VALUES
    ('2018-01-01', 1),
    ('2018-01-02', 3),
    ('2018-01-03', 5),
    ('2018-02-01', 2)
    ;
```

### Window function with ORDER BY clause

The ORDER BY clause controls the order of the data within each window (and each partition if there is more than one partition).
This is useful if you want to show a “running sum” over time as new rows are added.

A running sum can be calculated either from the beginning of the window to the current row (inclusive) or from the current row to the end
of the window.

A query can use a “sliding” window, which is a fixed-width window that processes *n* specified rows relative to the current row
(for example, the 10 most recent rows, including the current row).

### Window frames with fixed boundaries

When the window frame has a fixed boundary, values can be computed from the beginning of the window to the current row (or from the current row to the
end of the window):

```sqlexample
SELECT MONTH(sales_date) AS MONTH_NUM,
       quantity,
       SUM(quantity) OVER (ORDER BY MONTH(sales_date)
                     ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW)
           AS CUMULATIVE_SUM_QUANTITY
    FROM sales
    ORDER BY sales_date;
```

The query result includes additional comments that show how the `CUMULATIVE_SUM_QUANTITY` column was calculated:

```sqlexample
+-----------+----------+-------------------------+
| MONTH_NUM | QUANTITY | CUMULATIVE_SUM_QUANTITY |
|-----------+----------+-------------------------|
|         1 |        1 |                       1 |  -- sum = 1
|         1 |        3 |                       4 |  -- sum = 1 + 3
|         1 |        5 |                       9 |  -- sum = 1 + 3 + 5
|         2 |        2 |                      11 |  -- sum = 1 + 3 + 5 + 2
+-----------+----------+-------------------------+
```

### Window frames with explicit offsets

In the financial world, analysts often study “moving averages.”

For example, you might have a graph in which the X axis is time, and the Y axis shows the average price of the stock over the last 13 weeks
(that is, a 13-week moving average). In a graph of a 13-week moving average of a stock price, the price shown for June 30th is not the price
of the stock on June 30th, but the *average* price of the stock for the 13 weeks up to and including June 30th (April 1st through June 30th).
The value on July 1st is the average price for April 2nd through July 1st; the value on July 2nd is the average price for April 3rd through
July 2nd, and so on. Each day, the window effectively adds the most recent day’s value to the moving average, and removes the oldest
day’s value. This smooths out day-to-day fluctuations and can make trends easier to recognize.

Moving averages can be calculated using a sliding window frame. The frame has a specific width in rows. In the stock price example above,
13 weeks is 91 days, so the sliding window would be 91 days. If the measurements are taken once per day (for example, at the end of the day),
the window would be 91 rows “wide.”

To define a window that is 91 rows wide:

```sqlexample
SELECT AVG(price) OVER(ORDER BY timestamp1 ROWS BETWEEN 90 PRECEDING AND CURRENT ROW)
  FROM sales;
```

> **Note:**
>
> The initial window frame might be less than 91 days wide. For example, suppose that you want the 13-week
> moving average price of a stock. If the stock was first created on April 1st, on April 3rd only 3 days’ of
> price information exists, so the window is only 3 rows wide.

The following example shows the result of summing over a sliding window frame that is wide enough to hold two samples:

```sqlexample
SELECT MONTH(sales_date) AS MONTH_NUM,
       quantity,
       SUM(quantity) OVER (ORDER BY sales_date
                           ROWS BETWEEN 1 PRECEDING AND CURRENT ROW)
           AS SLIDING_SUM_QUANTITY
  FROM sales
  ORDER BY sales_date;
```

The query result includes additional comments that show how the `SLIDING_SUM_QUANTITY` column was calculated:

```sqlexample
+-----------+----------+----------------------+
| MONTH_NUM | QUANTITY | SLIDING_SUM_QUANTITY |
|-----------+----------+----------------------+
|         1 |        1 |                   1  |  -- sum = 1
|         1 |        3 |                   4  |  -- sum = 1 + 3
|         1 |        5 |                   8  |  -- sum = 3 + 5 (1 is no longer in the window)
|         2 |        2 |                   7  |  -- sum = 5 + 2 (3 is no longer in the window)
+-----------+----------+----------------------+
```

Note that the “sliding window” functionality requires the ORDER BY clause; the function depends on the order
of rows that enter and exit the window frame.

### Running totals with PARTITION BY and ORDER BY clauses

You can combine PARTITION BY and ORDER BY clauses to get running sums within partitions. In this example, the partitions
are one month, and because the sums apply only within a partition, the sum is reset to `0` at the beginning of each new month:

```sqlexample
SELECT MONTH(sales_date) AS MONTH_NUM,
       SUM(quantity) OVER (PARTITION BY MONTH(sales_date) ORDER BY sales_date)
          AS MONTHLY_CUMULATIVE_SUM_QUANTITY
    FROM sales
    ORDER BY sales_date;
```

The query result includes additional comments showing how the `MONTHLY_CUMULATIVE_SUM_QUANTITY` column was calculated:

```sqlexample
+-----------+---------------------------------+
| MONTH_NUM | MONTHLY_CUMULATIVE_SUM_QUANTITY |
|-----------+---------------------------------+
|         1 |                               1 |  -- sum = 1
|         1 |                               4 |  -- sum = 1 + 3
|         1 |                               9 |  -- sum = 1 + 3 + 5
|         2 |                               2 |  -- sum = 0 + 2 (new month)
+-----------+---------------------------------+
```

You can combine partitions and sliding window frames. In the example below, the sliding window is usually two rows wide, but each time a new
partition (that is, a new month) is reached, the sliding window starts with only the first row in that partition:

```sqlexample
SELECT
       MONTH(sales_date) AS MONTH_NUM,
       quantity,
       SUM(quantity) OVER (PARTITION BY MONTH(sales_date)
                           ORDER BY sales_date
                           ROWS BETWEEN 1 PRECEDING AND CURRENT ROW)
         AS MONTHLY_SLIDING_SUM_QUANTITY
    FROM sales
    ORDER BY sales_date;
```

The query result includes additional comments showing how the `MONTHLY_SLIDING_SUM_QUANTITY` column was calculated:

```sqlexample
+-----------+----------+------------------------------+
| MONTH_NUM | QUANTITY | MONTHLY_SLIDING_SUM_QUANTITY |
|-----------+----------+------------------------------+
|         1 |        1 |                           1  |  -- sum = 1
|         1 |        3 |                           4  |  -- sum = 1 + 3
|         1 |        5 |                           8  |  -- sum = 3 + 5
|         2 |        2 |                           2  |  -- sum = 0 + 2 (new month)
+-----------+----------+------------------------------+
```

### Calculate the ratio of a value to a sum of values

You can use the RATIO_TO_REPORT function to calculate the ratio of a value to the sum of the values in a partition, then
return the ratio as a percentage of that sum. The function divides the value in the current row by the sum of the values in all of the rows in a partition.

```sqlexample
SELECT branch_ID,
       city,
       100 * RATIO_TO_REPORT(net_profit) OVER (PARTITION BY city)
    FROM store_sales AS s1
    ORDER BY city, branch_ID;
+-----------+-----------+------------------------------------------------------------+
| BRANCH_ID | CITY      | 100 * RATIO_TO_REPORT(NET_PROFIT) OVER (PARTITION BY CITY) |
|-----------+-----------+------------------------------------------------------------|
|         3 | Montreal  |                                                52.63157900 |
|         4 | Montreal  |                                                47.36842100 |
|         1 | Vancouver |                                                40.00000000 |
|         2 | Vancouver |                                                60.00000000 |
+-----------+-----------+------------------------------------------------------------+
```

The PARTITION BY clause defines partitions on the `city` column. If you want to see the profit percentage relative to the entire chain,
rather than just the stores within a specific city, omit the PARTITION BY clause:

```sqlexample
SELECT branch_ID,
       100 * RATIO_TO_REPORT(net_profit) OVER ()
    FROM store_sales AS s1
    ORDER BY branch_ID;
+-----------+-------------------------------------------+
| BRANCH_ID | 100 * RATIO_TO_REPORT(NET_PROFIT) OVER () |
|-----------+-------------------------------------------|
|         1 |                               22.72727300 |
|         2 |                               34.09090900 |
|         3 |                               22.72727300 |
|         4 |                               20.45454500 |
+-----------+-------------------------------------------+
```

---
title: Analyzing query workloads with Performance Explorer
source: https://docs.snowflake.com/en/user-guide/performance-explorer.md
section: User Guide
---

# Analyzing query workloads with Performance Explorer

You can use Performance Explorer in Snowsight to review interactive metrics for SQL workloads.
The metrics show the overall health of your Snowflake environment, query activity, changes to warehouses,
and changes to tables.

## Benefits of Performance Explorer

Performance Explorer can help you answer the following key questions about Snowflake activity:

* **Overall activity:** Are queries generally succeeding, and can Snowflake users get their
  work done?
* **Change over time:** If query activity or resources look different from what I expected, what has
  changed and when did the changes occur?
* **Hot spots:** When I look for opportunities to take action, where should I focus my attention?

## Common use cases for Performance Explorer

Performance Explorer can help with the following use cases:

* **Investigating problem reports about queries or workloads:** If a Snowflake workload has started to behave
  differently, determine what else might have changed recently, such as the resources that the workload depends
  on or neighboring workload activity.
* **Proactively identifying hotspots:** If a warehouse or table shows persistent errors or saturation, identify
  and address the hotspot before it affects critical workloads.
* **Identifying optimization opportunities:** Find warehouses and tables that might be mismatched to the query
  activity they support, and adjust workloads and resources to make them compatible.

## Required privileges

Performance Explorer shows account activity that is similar to data in Account Usage views (for example,
[query history](../sql-reference/account-usage/query_history.md) and
[access history](../sql-reference/account-usage/access_history.md)). What you can see in each part of the
dashboard depends on your privileges. Snowflake grants the `SNOWFLAKE.PERFORMANCE_EXPLORER_PUBLIC_USER`
application role to the `PUBLIC` role so that users can open Performance Explorer in Snowsight;
the following rules determine whether sections show full account data, filtered data, an empty state, or a
permission error.

> **Note:**
>
> For Performance Explorer, Snowflake evaluates privileges from **all roles granted to you**. This is
> equivalent to `USE SECONDARY ROLES ALL` on top of your active primary role for the session.

### Full access to Performance Explorer data in the account

You have full access to Performance Explorer data for your account when **any** role granted to you meets
one of the following conditions:

* Your role is the [ACCOUNTADMIN role](security-access-control-overview.md).
* Your role has been granted `IMPORTED PRIVILEGES` on the shared `SNOWFLAKE` database (see
  [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md)).
* Your role has been granted the `SNOWFLAKE.PERFORMANCE_EXPLORER_USER` application role.

For example, to give the user `jdoe` full Performance Explorer access by using a custom role, run:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE pe_viewer_role;
GRANT APPLICATION ROLE SNOWFLAKE.PERFORMANCE_EXPLORER_USER TO ROLE pe_viewer_role;
GRANT ROLE pe_viewer_role TO USER jdoe;
```

### Seeing query metrics without full account access

If you do **not** have full access as described above, you can still see query activity when **any** role
granted to you satisfies one of the following:

* Your role has been granted the `GOVERNANCE_VIEWER` [database role](../sql-reference/snowflake-db-roles.md)
  in the `SNOWFLAKE` database (account-wide query activity).
* Your role has the `MONITOR` privilege on the account, or the `MANAGE WAREHOUSES` privilege on the
  account, which effectively covers all warehouses (query activity that uses warehouses).
* Your role has `OWNERSHIP`, `MONITOR`, or `OPERATE` on at least one warehouse (query activity only for
  queries that ran on warehouses you can monitor or operate). For details, see
  [Warehouse privileges](security-access-control-privileges.md).

If none of the above apply and you cannot `MONITOR` or `OPERATE` any warehouse, Performance Explorer
shows a permission error for query activity. This is intentional: without at least one authorized warehouse,
no query metrics would be visible. In practice, many accounts include a default warehouse that all users
can monitor; see [Snowsight templates](ui-snowsight/snowsight-templates.md).

Warehouse filters list warehouses you are allowed to use in filters (for example, warehouses you can
`MONITOR` or `OPERATE`, and in some cases warehouses that had query activity in the retention window).
Warehouse-scoped visibility is similar in spirit to the rules for Query History in
[Monitor query activity with Query History](ui-snowsight-activity.md), but Performance Explorer uses **all roles granted to you** and
combines several privilege types, so the exact rules differ.

### Database filters and database-oriented breakdowns

To see all databases in database filters (and related aggregations by database), **any** role granted to you
must satisfy one of the following:

* Your role meets the conditions for full access to Performance Explorer data in the account (see the preceding
  section).
* Your role has been granted the `OBJECT_VIEWER` [database role](../sql-reference/snowflake-db-roles.md) in
  the `SNOWFLAKE` database.
* Your role has `RESOLVE ALL` on the account.
* Your role has `MONITOR` on the account.

Otherwise, you only see databases where **any** role granted to you has at least one privilege on
the database. Queries can still appear in other sections even if they touch a database you cannot list,
except where the UI explicitly names databases (for example, certain side-panel breakdowns).

### Warehouse events

To see warehouse events for all warehouses that appear in your authorized query activity, **any** role granted
to you must satisfy one of the following:

* Your role meets the conditions for full access to Performance Explorer data in the account (see the preceding
  section).
* Your role has been granted the `USAGE_VIEWER` [database role](../sql-reference/snowflake-db-roles.md) in
  the `SNOWFLAKE` database.
* Your role has the `MONITOR` privilege on the account.
* Your role has the `MANAGE WAREHOUSES` privilege on the account.

Otherwise, warehouse events are limited to warehouses where you have `OWNERSHIP`,
`MONITOR`, or `OPERATE`. If you have none of those warehouse privileges or usage-related database
roles, Performance Explorer shows a permission error for warehouse events.

### Top tables and table change events

The **Top tables** section and **table change events** require full access **or** the `GOVERNANCE_VIEWER`
database role in the `SNOWFLAKE` database. Snowflake does not offer a lower-privilege, per-table
alternative for these sections due to security and performance constraints. If you do not meet this bar,
those sections show a permission error.

### Empty charts, filtered results, and permission errors

For security reasons, an empty chart or table can mean either that there was no activity in the selected
period **or** that your roles cannot see that activity. Tile-level permission errors call out missing
privileges (for example, governance visibility for table metrics).

### Privilege changes and data freshness

Updates to grants and revocations can take **a few hours** to affect what Performance Explorer shows.

## Open Performance Explorer

To open Performance Explorer, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Performance Explorer.

Performance Explorer contains charts that show metrics related to your workloads and the general health of
your Snowflake environment.

To leave feedback about Performance Explorer, select Feedback.

## Understanding the Performance Explorer dashboard

You can review interactive metrics for SQL workloads by using charts on the Performance Explorer dashboard, and
you can apply filters to show metrics about only the query activity and resources that you’re interested in.
Charts are grouped under tabs (Queries, Warehouses, and Tables). The page URL updates when you
change tabs, and the same tab stays selected if you refresh the page.

### Performance Explorer filters

At the top of the Performance Explorer dashboard, you can apply the following filters:

* Preset - Choose None or a saved combination of period, warehouse, database, and role filters. From
  the menu you can save the current filters as a new preset, clear all filters, copy a link that encodes the
  current filters, and manage saved presets (for example set or change a default preset).
* Period - Select a time period, such as the last week, the last two weeks, or a custom range. The
  dashboard shows metrics for the specified period.

  Performance Explorer displays metrics for one week by default. It supports a period of up to one month,
  going back from the current date.

  Several Performance Explorer charts show the percentage of change compared to the previous period. The
  range of the previous period corresponds with the current period range. For example, if the current
  period is two weeks, then the previous period is the two weeks before the current period started.
* Warehouse - Select a warehouse to view metrics only for query activity that ran using that warehouse.
  To limit the warehouses in the list, use the search field. To clear the filter, select `X`.
* Database - Select a database to view metrics only for query activity that accessed that database.
  To limit the databases in the list, use the search field. To clear the filter, select `X`.
* Role - Select a role to view metrics only for query activity initiated by that role. To limit the
  roles in the list, use the search field. To clear the filter, select `X`.

### Performance Explorer charts

Performance Explorer displays metrics in different types of charts. It is important to understand the components
in each type of chart and how to interpret them.

On the Queries tab, line chart metrics use line charts that are similar to the following image:

The following table describes the callouts in the image:

| Callout | Description |
| --- | --- |
| **1** | Select View details > to open the side panel. View details > appears when you hover over a chart. |
| **2** | Shows the average or median in the period. |
| **3** | Shows the percentage increased or decreased compared to the previous period. |
| **4** | Represents the value for one hour. The values are shown for an amount of time at the start of the interval. For example, if the interval is one hour, the value shown at 9 AM is for the interval from 9 AM to 10 AM. |

Some charts include a large average or median value and the percentage of change for the period.
When there is more than one line, there is a key to the lines above the chart.

Some charts have an information icon next to the title. Hover over the icon for information about the metrics
in the chart.

You can hover over a point in the line chart to see the value for a specific hour:

The Top warehouses section on the Warehouses tab and the
Top tables section on the Tables tab have bar charts that are similar to
the following image:

The following table describes the callouts in the image:

| Callout | Description |
| --- | --- |
| **1** | Select View details > to open the side panel. View details > appears when you hover over a chart. |
| **2** | Select a tab to show the metrics on the tab. |
| **3** | Shows the value of this metric for the current period. |
| **4** | Shows the percentage increased or decreased compared to the previous period. |
| **5** | Indicates that there is no data from the previous period for comparison. |

On both line charts and bar charts, select View details > to open a side panel that displays more detailed information
about the metrics on the chart. The detailed information varies based on the metrics shown in the chart. Most side
panels present sortable tables that you can use to review metrics for specific warehouses, roles, databases, and queries in
the period.

Use the Search results field above the table to filter rows; search is case-insensitive and applies across the
side-panel aggregation tabs (for example, By warehouse and By role). Select the download control to export
the table as a CSV file. The downloaded file name reflects the chart and the active dashboard filters.

You can select a custom period of time in a side panel by clicking where the custom period starts
and dragging to where the custom period ends.

In a side panel, you can select one of the following tabs:

* By warehouse - Shows the activity by warehouses in the period.
* By database - Shows the activity by databases in the period.
* By role - Shows the activity by roles in the period.
* By grouped queries - Shows the queries that were run in the period. Some queries are redacted for security
  reasons. For information about how queries are grouped, see [Use the Grouped Query History view in Snowsight](ui-snowsight-activity.md).

If you select a custom period, these tabs refresh to show the metrics only for the selected custom period.

The Top warehouses and
Top tables sections also include events charts that are similar to the
following image:

An events chart shows a sortable table of events for the type of object. You can examine the data for unexpected events.
For more information about warehouse events, see [WAREHOUSE_EVENTS_HISTORY view](../sql-reference/organization-usage/warehouse_events_history.md).
For more information about table events, see [TABLES view](../sql-reference/organization-usage/tables.md).

## Reviewing metrics on the Queries tab

On the Queries tab, line charts cover reliability signals (failures, retries, overload, blocking) and runtime signals
(duration, throughput, wait time, and hourly failure counts). Use them to review trends over the selected period. Performance Explorer
summarizes historical windows for your account; it is not a live monitoring dashboard.

The following line-chart metrics are available on the Queries tab:

| Metric | Unit | Description | Notes | More information |
| --- | --- | --- | --- | --- |
| Query failures/1K | Failures per 1000 | The number of queries that failed for every 1,000 queries that ran, including the following metrics:   * The large number above the line graph is the average number of failures for every 1,000 queries in the period. * The percentage value is the percentage of change in the rate of failures since the last period. * The line chart shows the number of failures for every 1,000 queries for each hour in the period. | This metric should be low or zero. If queries are failing, review the query history and errors, and then modify your queries to resolve the issues. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |
| Query retries/1K | Retries per 1000 | The number of queries that were retried for every 1,000 queries that ran, including the following metrics:   * The large number above the line graph is the average number of retries for every 1,000 queries in the period. * The percentage value is the percentage of change in the rate of retries since the last period. * The line chart shows the number of retries for every 1,000 queries for each hour in the period. | This metric should be low or zero. If queries are retrying, review the causes, and then take actions to prevent query retries. For example, if a query is retried because of an out-of-memory error, modifying warehouse settings might resolve the issue. | [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |
| Query overload % | Percent | The percentage of total run time that queries waited in a queue for warehouse resources, including the following metrics:   * The large number above the line graph is the median percentage of time that queries waited in a queue for   warehouse resources in the period. * The percentage value is the percentage of change in the number of queries that waited since the last period. * The line chart shows the percentage of time that queries waited in a queue for warehouse resources for each   hour in the period. | This metric should be low or zero. If queries are waiting before running, warehouse resources might be exhausted, causing queries to be queued until resources become available. | [Reducing queues](performance-query-warehouse-queue.md) |
| Query blocked % | Percent | The percentage of total run time that queries spent blocked waiting for a transaction lock on a resource, including the following metrics:   * The large number above the line graph is the median percentage of time spent blocked waiting for a lock   in the period. * The percentage value is the percentage of change in the amount of time that queries spent blocked since the last   period. * The line chart shows the percentage of time queries spent blocked waiting for a lock for each hour in   the period. | This metric should be low or zero. If queries were blocked, review the query history and errors, and then modify your queries to resolve the issues. | [Resource locking](../sql-reference/transactions.md) . . [Best practices for transactions](../sql-reference/transactions.md) . . [LOCK_WAIT_HISTORY view](../sql-reference/organization-usage/lock_wait_history.md) . . [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |
| Query duration | Seconds | The amount of time it took for queries to complete for each hour of the period. The line chart shows the median amount of time for all queries, the amount of time for queries in the ninetieth percentile, and the amount of time for queries in the ninety-ninth percentile. | This metric varies widely depending on your data and the types of queries you are running. Queries with durations that change over time might be candidates for investigation and optimization. | [Exploring execution times](performance-query-exploring.md) . . [Optimizing query performance](performance-query-options.md) |
| Query throughput | Queries | The number of queries that ran each hour. | This metric can reveal changes in query activity, which might indicate new trends or changes in your workloads. | [Optimizing warehouses for performance](performance-query-warehouse.md) |
| Query wait time | Seconds | The amount of time that queries waited for warehouse resources or because of a lock on a resource. For information about the states (Overload, Provisioning, Repair, and Blocked), see [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md). | This metric should be low or zero. If queries are waiting before running, warehouse resources might be exhausted, causing queries to be queued until resources become available. | [Reducing queues](performance-query-warehouse-queue.md) . . [Resource locking](../sql-reference/transactions.md) |
| Query failures | Failures | The number of queries that failed for each hour in the period. | This metric should be low or zero. If queries are failing, review the query history and errors, and then modify your queries to resolve the issues. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |

## Reviewing top warehouses

On the Warehouses tab, this section of Performance Explorer includes metrics about the warehouses in your Snowflake environment that
experienced the most changes in the period. You can review these metrics to see whether your warehouses
are functioning as expected to support query activity. The metrics can also show whether any warehouses are
associated with trends in query activity that are unusual when compared to other warehouses. You can also determine
whether the composition of the workloads that warehouses run have changed.

All metrics in this section show the metric value and the percentage of change since the last period. The percentage of
change can be positive or negative, with positive change represented by an up arrow and negative change represented
by a down arrow. For each metric, Performance Explorer shows the 10 warehouses with the most changes. To view metrics
for more warehouses, select View details > on a chart to open the side panel. If this metric has no value from the last period
for a warehouse, — is shown instead of the percentage of change. There might be no value because the warehouse is
new, or because the event being measured is infrequent.

This section includes the following metrics:

| Metric | Tab | Unit | Description | Notes | More information |
| --- | --- | --- | --- | --- | --- |
| Warehouses with errors | Query failures/1K | Failures per 1000 | For each warehouse, the number of queries that failed for every 1,000 queries that ran. | This metric should be low or zero. If queries are failing, review the query history and errors, and then modify your queries to resolve the issues. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |
|  | Query OOM errors/1K | Errors per 1000 | For each warehouse, the number of queries that returned “out of memory” errors for every 1,000 queries that ran. | This metric should be low or zero. If queries are failing with “out of memory” errors, review the query history to determine which queries are failing for the warehouses, and then modify the warehouses that run the queries to avoid the errors. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) . . [Queries too large to fit in memory](performance-query-warehouse-memory.md) |
|  | Query retries/1K | Retries per 1000 | For each warehouse, the number of queries that were retried for every 1,000 queries that ran. | This metric should be low or zero. If queries are retrying because warehouses are running out of memory, review the query history to determine which queries are retrying for the warehouses, and then modify the warehouses that run the queries to avoid the errors. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) . . [Optimizing warehouses for performance](performance-query-warehouse.md) |
| Warehouses with spillage | % queries with bytes spilled | Percent | For each warehouse, the percentage of queries that spilled to local disk or remote cloud storage when they ran. | This metric should be low or zero. If queries are spilling to disk because warehouses are running out of memory, review the query history to determine which queries are spilling for the warehouses, and then modify the warehouses that run the queries to avoid the errors. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) . . [Queries too large to fit in memory](performance-query-warehouse-memory.md) |
|  | % bytes spilled of total | Percent | For each warehouse, the percentage of bytes that spilled to local disk or remote cloud storage when they ran compared with the number of bytes read. | This metric should be low or zero. If queries are spilling to disk because warehouses are running out of memory, review the query history to determine which queries are spilling for the warehouses, and then modify the warehouses that run the queries to avoid the errors. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) . . [Queries too large to fit in memory](performance-query-warehouse-memory.md) |
| Query wait time % | Overload % | Percent | For each warehouse, the proportion of total run time that queries waited because the warehouse was overloaded by the query workload. | This metric should be low or zero. If queries are waiting before running, warehouse resources might be exhausted, causing the warehouse to queue queries until resources become available. | [Reducing queues](performance-query-warehouse-queue.md) |
|  | Provisioning % | Percent | For each warehouse, the average proportion of total run time that queries waited for warehouse compute resources to provision, due to warehouse creation, resume, or resize. | This metric should be low or zero. If queries are waiting before running, warehouse resources might be exhausted, causing it to queue queries until resources become available. | [Reducing queues](performance-query-warehouse-queue.md) |
| Warehouse query performance | Median query duration | Seconds | For each warehouse, the median amount of time for queries to run. | This metric varies widely depending on your data and the types of queries you are running. If the median query duration shows unusual changes, the workload that this warehouse supports might have changed, or the warehouse configuration might have changed. | [Exploring execution times](performance-query-exploring.md) . . [Optimizing query performance](performance-query-options.md) |
|  | Query throughput | Queries | For each warehouse, the number of queries processed. | This metric can reveal changes in query activity, which might require modifications to the warehouses that run the queries. | [Optimizing warehouses for performance](performance-query-warehouse.md) |
| Warehouse events | **–** | None | A sortable table of warehouse events. | This metric shows which warehouses changed in the period. Examine the data for unexpected events. | [WAREHOUSE_EVENTS_HISTORY view](../sql-reference/organization-usage/warehouse_events_history.md) |

## Reviewing top tables

On the Tables tab, this section of Performance Explorer includes metrics about the tables in your Snowflake environment that
experienced the most changes in the period. You can review these metrics to see whether your tables
can support query activity and return data as expected. The metrics can also show whether any tables are
associated with trends in query activity that are unusual when compared to other tables. You can also determine
whether any tables have changed recently and how they have changed.

All metrics in this section show the metric value and the percentage of change since the last period. The percentage of
change can be positive or negative, with positive change represented by an up arrow and negative change represented
by a down arrow. For each metric, Performance Explorer shows the 10 tables with the most changes. To view metrics
for more tables, select View details > on a chart to open the side panel. If this metric has no value from the last period
for a table, — is shown instead of the percentage of change. There might be no value because the table is new or
because the event being measured is infrequent.

This section includes the following metrics:

| Metric | Tab | Unit | Description | Notes | More information |
| --- | --- | --- | --- | --- | --- |
| Table query failures/1K | **–** | Failures per 1000 | For each table, the number of queries that failed for every 1,000 queries that ran. | This metric should be low or zero. If queries are failing, review the query history and errors, and then modify your queries to resolve the issues. | [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |
| Table queries blocked/1K | **–** | Blocked per 1000 | For each table, the number of queries that were blocked for every 1,000 queries that ran. | This metric should be low or zero. If queries were blocked, review the query history and errors, and then modify your queries to resolve the issues. | [Resource locking](../sql-reference/transactions.md) . . [Best practices for transactions](../sql-reference/transactions.md) . . [LOCK_WAIT_HISTORY view](../sql-reference/organization-usage/lock_wait_history.md) . . [Monitor query activity with Query History](ui-snowsight-activity.md) . . [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) |
| Table read performance | Median read query duration | Seconds | For each table, the median amount of time for queries to run. | This metric varies widely depending on your data and the types of queries you are running. Queries with durations that change over time might be candidates for investigation and optimization. | [Exploring execution times](performance-query-exploring.md) . . [Optimizing query performance](performance-query-options.md) |
|  | Read query throughput | Queries | For each table, the number of queries processed. | This metric can reveal changes in query activity for tables. If there is an increase in the number of queries for a table, you might want to modify the table to optimize query performance. For example, you might enable search optimization on the table. | [Table Design Considerations](table-considerations.md) . . [Optimizing query performance](performance-query-options.md) |
| Table write performance | Median write query duration | Seconds | For each table, the median amount of time for Data Manipulation Language (DML) operations to run. | This metric varies widely depending on your data and the types of DML operations you are running. DML operations with durations that change over time might be candidates for investigation and optimization. | [Exploring execution times](performance-query-exploring.md) . . [Optimizing query performance](performance-query-options.md) |
|  | Write query throughput | Queries | For each table, the number of DML operations processed. If there is an increase in the number of DML operations for a table, you might want to modify the table to optimize performance. | This metric can reveal changes in the number of DML operations. | [Table Design Considerations](table-considerations.md) |
| Table change events | **–** | None | A sortable table of table events. | This metric shows which tables changed in the period. Examine the data for unexpected events. | [TABLES view](../sql-reference/organization-usage/tables.md) |

---
title: Analyzing time-series data
source: https://docs.snowflake.com/en/user-guide/querying-time-series-data.md
section: User Guide
---

# Analyzing time-series data

You can analyze time-series data in Snowflake, using functionality designed specifically
for this purpose. Database administrators, data scientists, and application developers
have to make sure that the time series is stored and loaded efficiently, and in many cases
summarized into a form that is complete and consistent, before making the data available
to business analysts and other consumers.

## Introduction to time-series data

A *time series* consists of sequential observations that capture how systems, processes, and
behaviors change over a period of time. Time-series data is collected from a broad
range of devices across a broad range of industries. Common examples include stock-trading data
collected for financial applications, weather observations, temperature readings collected from
sensors in smart factories, and logs of user clicks in digital advertising.

A single record in a time series typically has the following components:

* A date, time, or timestamp that has a consistent level of granularity
  (milliseconds, seconds, minutes, hours, etc.).
* One or more measurements or metrics of some kind, usually numeric (facts that might reveal trends
  or anomalies in the data).
* Dimensions of interest that are associated with the measurement, such as a location for a
  temperature reading, or a stock symbol for a given trade.

For example, the following weather observation has start and end timestamps, a rainfall measurement (`0.32`),
and location information:

```output
EVENTID | TYPE | SEVERITY | START_TIME              | END_TIME                | PRECIP | TIME_ZONE   | CITY       | COUNTY    | STATE | ZIP
W100    | Rain | Moderate | 2020-12-20 16:35:00.000 | 2020-12-20 17:15:00.000 |   0.32 | US/Eastern  | Southport  | Brunswick | NC    | 28461
```

The following data collected from a factory device has a namespace (`IOT`), a tag ID or sensor ID (`3000`),
a timestamp for the temperature reading on the device, the temperature reading itself (`21.1673`), and a “broker timestamp,”
which is when the data subsequently arrived at the data broker. For example, the data broker might be a Kafka server that ingests data into a Snowflake table.

```output
DEVICE | LINE | DEVICE_TIMESTAMP        | TEMP     | BROKER_TIMESTAMP
IOT    | 3000 | 2023-01-01 00:01:00.000 | 21.1673  | 2023-01-01 00:01:32.000
```

A time series might reveal spikes when readings change dramatically for some reason. For example, the following image shows a sequence
of temperature readings taken at 15-second intervals, with values peaking over 40°C after being steadily in the 35°C range for the previous day.

The following sections show how to analyze and visualize large volumes of this kind of data with SQL functions and joins that provide fast, accurate results.

## How to store time-series data

The following [datetime data types](../sql-reference/data-types-datetime.md) are supported:

* DATE
* TIME
* TIMESTAMP (and variations, including TIMESTAMP_TZ)

For information about loading, managing, and querying data that uses these data types, see
[Working with date and time values](../sql-reference/date-time-examples.md).

A number of commonly used [SQL functions](../sql-reference/functions-date-time.md) are available
to help with both storing and querying time-series data. For example, you can use
[CONVERT_TIMEZONE](../sql-reference/functions/convert_timezone.md) to convert timestamps from one time zone to
another, and you can use functions such as [EXTRACT](../sql-reference/functions/extract.md) and
[TIMEADD](../sql-reference/functions/timeadd.md) to manipulate time-based data as needed.

> **Note:**
>
> For TIMESTAMP_TZ data, Snowflake stores the offset of a given time zone, not the actual time zone,
> at the moment of creation for a given value.

To optimize query performance, tables used for time-series analytics are often clustered by time (and
sometimes also by sensor ID or a similar dimension). See [Clustering Keys & Clustered Tables](tables-clustering-keys.md).

## Aggregating time-series data

Management of time-series data might require the aggregation of large volumes of fine-grained
records into a more summarized form (a process sometimes referred to as “downsampling”).
Given a large set of records with a specific time-based granularity (milliseconds, seconds, minutes, etc.),
you can roll up these records to a coarser granularity, effectively producing a
smaller sample.

Downsampling is valuable because it decreases the size of a data set and its storage requirements.
A coarser level of granularity also reduces compute resource requirements during query execution.
Another key reason for downsampling is that a large number of records
in a time series might be redundant from an analyst’s point of view. For example, if a sensor
emits a new value once every second, but this measurement rarely changes within each 60-second interval,
the data can be rolled up to the minute level for analysis.

Another case for downsampling occurs when two different data sets need to be analyzed as one, but
they have different time granularities. For example, Sensor A in a factory collects data every 15 seconds,
but Sensor B collects related data every 30 seconds. In this case, aggregating the records into 1-minute
buckets might be a good solution. IDs and dimensions in each data set are retained as they are, but numeric
measurements are summed or averaged by a common time interval.

### Downsampling examples

You can downsample a data set that is stored in a table by using the
[TIME_SLICE](../sql-reference/functions/time_slice.md) function.
This function calculates the start and end times of fixed-width “buckets” so that individual
records can be grouped and summarized, using standard aggregate functions, such as SUM and
AVG.

Similarly, the [DATE_TRUNC](../sql-reference/functions/date_trunc.md) function truncates part of a
series of date or timestamp values, reducing their granularity. The following sections show examples of each function.

#### Downsampling with TIME_SLICE

The following example downsamples a table named `sensor_data_ts`, which contains readings
from two factory sensors and contains 5.3 million rows. These readings were ingested per second, so 5.3 million rows
represents only one month of data, with just over 2.5 million rows per sensor. You can use the TIME_SLICE function to
aggregate up to a single row per minute, per hour, or per day, for example.

To run this example, first create and load the `sensor_data_ts` table; see Creating the sensor_data_ts table.
Here is a small sample of the data in the table:

```output
+-----------+-------------------------+-------------+-----------+-----------+
| DEVICE_ID | TIMESTAMP               | TEMPERATURE | VIBRATION | MOTOR_RPM |
|-----------+-------------------------+-------------+-----------+-----------|
| DEVICE1   | 2024-03-01 00:00:00.000 |     32.6908 |    0.3158 |      1492 |
| DEVICE2   | 2024-03-01 00:00:00.000 |     35.2086 |    0.3232 |      1461 |
| DEVICE1   | 2024-03-01 00:00:01.000 |     35.9578 |    0.3302 |      1452 |
| DEVICE2   | 2024-03-01 00:00:01.000 |     26.2468 |    0.3029 |      1455 |
+-----------+-------------------------+-------------+-----------+-----------+
```

The table contains 60 readings like these per minute for each device, as shown by this query:

```sqlexample
SELECT device_id, count(*) FROM sensor_data_ts
  WHERE TIMESTAMP >= ('2024-03-01 00:01:00')
    AND TIMESTAMP < ('2024-03-01 00:02:00')
  GROUP BY device_id;
```

```output
+-----------+----------+
| DEVICE_ID | COUNT(*) |
|-----------+----------|
| DEVICE2   |       60 |
| DEVICE1   |       60 |
+-----------+----------+
```

In this downsampling query, the TIME_SLICE function defines one-minute buckets and returns the start time of each bucket.
The AVG function calculates the average temperature for each bucket per device. The COUNT(\*) function
is included for reference, just to show how many rows land in each time bucket.

The `vibration` and `motor_rpm` columns are not included, but they could be aggregated in the same way
as the `temperature` column or by using different aggregate functions.

> **Important:**
>
> If you run this example yourself, your output will not match exactly because the `sensor_data_ts` table is loaded
> with randomly generated values.

```sqlexample
SELECT
    TIME_SLICE(TO_TIMESTAMP_NTZ(timestamp), 1, 'MINUTE') minute_slice,
    device_id,
    COUNT(*),
    AVG(temperature) avg_temp
  FROM sensor_data_ts
  WHERE TIMESTAMP >= ('2024-03-01 00:01:00')
    AND TIMESTAMP < ('2024-03-01 00:02:00')
  GROUP BY 1,2
  ORDER BY 1,2;
```

```output
+-------------------------+-----------+----------+---------------+
| MINUTE_SLICE            | DEVICE_ID | COUNT(*) |      AVG_TEMP |
|-------------------------+-----------+----------+---------------|
| 2024-03-01 00:01:00.000 | DEVICE1   |       60 | 32.4315466667 |
| 2024-03-01 00:01:00.000 | DEVICE2   |       60 | 30.4967783333 |
+-------------------------+-----------+----------+---------------+
```

By using the TIME_SLICE function, you can create smaller, aggregated tables for analysis
purposes, and you can apply the downsampling process at different levels (hour, day, week, and so on).

#### Downsampling with DATE_TRUNC

The following example selects data from a table named `order_header` in the `raw.pos`
schema of the
[Tasty Bytes sample database](https://quickstarts.snowflake.com/guide/tasty_bytes_introduction/index.html#0).
This table contains 248M rows.

The `order_header` table has a TIMESTAMP column named `order_ts`. The query creates an aggregated time series by
using this column as the second argument to the DATE_TRUNC function. The first argument specifies a `day` interval.
This means that the individual records, which have an hours/minutes/seconds granularity, are rolled up by day.

The query groups the records by two dimensions: `truck_id` and
`location_id`. The `avg_amount` column returns the average price per order, per food truck, per
location for each business day on record.

The query shown here limits the results to the first 25 rows for January 1, 2022. If you remove this date filter
and the LIMIT clause, the query downsamples the original 248M rows to about 500,000 rows.

```sqlexample
SELECT DATE_TRUNC('day', order_ts)::date sliced_ts, truck_id, location_id, AVG(order_amount)::NUMBER(4,2) as avg_amount
  FROM order_header
  WHERE EXTRACT(YEAR FROM order_ts)='2022'
  GROUP BY date_trunc('day', order_ts), truck_id, location_id
  ORDER BY 1, 2, 3 LIMIT 25;
```

```output
+------------+----------+-------------+------------+
| SLICED_TS  | TRUCK_ID | LOCATION_ID | AVG_AMOUNT |
|------------+----------+-------------+------------|
| 2022-01-01 |        1 |        3223 |      19.23 |
| 2022-01-01 |        1 |        3869 |      20.15 |
| 2022-01-01 |        2 |        2401 |      39.29 |
| 2022-01-01 |        2 |        4199 |      34.29 |
| 2022-01-01 |        3 |        2883 |      35.01 |
| 2022-01-01 |        3 |        2961 |      39.15 |
| 2022-01-01 |        4 |        2614 |      35.95 |
| 2022-01-01 |        4 |        2899 |      40.29 |
| 2022-01-01 |        6 |        1946 |      26.58 |
| 2022-01-01 |        6 |       14960 |      18.59 |
| 2022-01-01 |        7 |        1427 |      26.91 |
| 2022-01-01 |        7 |        3224 |      28.88 |
| 2022-01-01 |        9 |        1557 |      35.52 |
| 2022-01-01 |        9 |        2612 |      43.80 |
| 2022-01-01 |       10 |        2217 |      32.35 |
| 2022-01-01 |       10 |        2694 |      32.23 |
| 2022-01-01 |       11 |        2656 |      44.23 |
| 2022-01-01 |       11 |        3327 |      52.00 |
| 2022-01-01 |       12 |        3181 |      52.84 |
| 2022-01-01 |       12 |        3622 |      49.59 |
| 2022-01-01 |       13 |        2516 |      31.13 |
| 2022-01-01 |       13 |        3876 |      28.13 |
| 2022-01-01 |       14 |        1359 |      72.04 |
| 2022-01-01 |       14 |        2505 |      68.75 |
| 2022-01-01 |       15 |        2901 |      41.90 |
+------------+----------+-------------+------------+
```

### Using windowed aggregations for rolling calculations

By using windowed aggregate functions to observe how a metric changes over time, you can
analyze a time series for trends. Windowed aggregations are useful for analyzing data within defined
subsets (“windows”) of a larger data set. You can compute rolling calculations (such as moving averages and sums)
for each row in a data set, taking into account a group of rows before, after, or surrounding the current row.
This kind of analysis contrasts with regular aggregations, which summarize the entire data set.

By using range-based window frames with explicit offsets, you can apply a very flexible approach to computing
these rolling aggregations. The RANGE BETWEEN window frame, ordered by either timestamps or numbers, is not disrupted
by gaps that may occur in time-series data. For instance, in the following illustration, the fact that `Day 4`
data is missing in the series of records does not affect the computation of aggregate functions over a three-day moving
window. In particular, frames 3, 4, and 5 are computed correctly, taking into account that `Day 4` data is unknown.

The following example calculates a moving sum over weather data that records hourly precipitation readings in
different cities and counties. You can run this kind of query to evaluate trends in various time-series data sets,
such as sensors and other IoT devices, especially when those data sets are known or expected to have gaps.

The window function includes in its frame the current precipitation reading and *all the readings that fall within the
specified time interval before the current reading.* The rolling calculation is based on this flexible and
logical *range* of rows rather than an exact *number* of rows. The first row for each city has matching `precip` and
`moving_sum_precip` values. After that, the sum is recalculated for each subsequent row in the frame. The raw values
fluctuate significantly, but the moving sums have a strong smoothing effect.

To run this example, follow these instructions first: [Create and load the heavy_weather table](../sql-reference/functions-window-syntax.md).
This very small table contains sporadic hourly weather observations, with lots of gaps, including a missing day. The query
returns the moving sum of precipitation values ordered by the `start_time` column. The window frame defines a
range between 12 hours before the current row and the current row. Therefore, the frame consists of the current row plus only those
rows that have timestamps up to 12 hours earlier than the ORDER BY timestamp for the current row.

```sqlexample
SELECT city, start_time, precip,
    SUM(precip) OVER(
      PARTITION BY city
      ORDER BY start_time
      RANGE BETWEEN INTERVAL '12 hours' PRECEDING AND CURRENT ROW) moving_sum_precip
  FROM heavy_weather
  WHERE city IN('South Lake Tahoe','Big Bear City')
  GROUP BY city, precip, start_time
  ORDER BY city;
```

```output
+------------------+-------------------------+--------+-------------------+
| CITY             | START_TIME              | PRECIP | MOVING_SUM_PRECIP |
|------------------+-------------------------+--------+-------------------|
| Big Bear City    | 2021-12-24 05:35:00.000 |   0.42 |              0.42 |
| Big Bear City    | 2021-12-24 16:55:00.000 |   0.09 |              0.51 |
| Big Bear City    | 2021-12-26 09:55:00.000 |   0.07 |              0.07 |
| South Lake Tahoe | 2021-12-23 16:23:00.000 |   0.56 |              0.56 |
| South Lake Tahoe | 2021-12-23 17:24:00.000 |   0.38 |              0.94 |
| South Lake Tahoe | 2021-12-23 18:30:00.000 |   0.28 |              1.22 |
| South Lake Tahoe | 2021-12-23 19:36:00.000 |   0.80 |              2.02 |
| South Lake Tahoe | 2021-12-24 06:49:00.000 |   0.17 |              0.97 |
| South Lake Tahoe | 2021-12-24 15:53:00.000 |   0.07 |              0.24 |
| South Lake Tahoe | 2021-12-26 05:43:00.000 |   0.16 |              0.16 |
| South Lake Tahoe | 2021-12-27 14:53:00.000 |   0.07 |              0.07 |
| South Lake Tahoe | 2021-12-27 17:53:00.000 |   0.07 |              0.14 |
+------------------+-------------------------+--------+-------------------+
```

The three `moving_sum_precip` values for Big Bear City are calculated as follows:

* 0.42 = 0.42 (no preceding rows)
* 0.42 + 0.09 = 0.51 (the first two rows are within the 12-hour window)
* 0.07 = 0.07 (no preceding rows are within the 12-hour window)

The South Lake Tahoe rows include these calculations, for example:

* 0.56 + 0.38 + 0.28 + 0.80 = 2.02 (all four rows for 2024-12-23 are within 12 hours of each other)
* 0.80 + 0.17 = 0.97 (one preceding row is within the 12-hour window)

Other window functions, such as the
[LEAD](../sql-reference/functions/lead.md) and [LAG](../sql-reference/functions/lag.md) ranking functions,
are also commonly used in time-series analysis. Use the LEAD window function to find the next data point in the time series,
relative to the current data point, and the LAG function to find the previous data point.

### Visualizing query results in Snowsight

You can use Snowsight to visualize the results of aggregation queries, and get a
better sense of the smoothing effect of calculations with sliding window frames.
In the query worksheet, click the Chart button next to Results.

For example, the yellow line in the following bar chart shows a much smoother trend for average
temperature versus the blue line for the raw temperature. The query itself looks like this:

```sqlexample
SELECT device_id, timestamp, temperature, AVG(temperature)
  OVER (PARTITION BY device_id ORDER BY timestamp
    ROWS BETWEEN 6 PRECEDING AND CURRENT ROW) AS avg_temp
FROM sensor_data_ts
WHERE timestamp BETWEEN '2024-03-15 00:00:59.000' AND '2024-03-15 00:01:10.000'
ORDER BY 1, 2;
```

### Using the MIN_BY and MAX_BY aggregate functions

The ability to select one column based on the minimum or maximum value of another column in the
same row is a common requirement for SQL developers who are working with time-series data.
[MIN_BY](../sql-reference/functions/min_by.md) and [MAX_BY](../sql-reference/functions/max_by.md) are
convenience functions that return the starting and ending (or highest and lowest, or first and last)
values in a table when the data is sorted by some other column, such as a timestamp.

The first example simply finds the last (most recent) `precip` value in the whole table. The MAX_BY function sorts all the rows
by their `start_time` value, then returns the `precip` value for the “max” start time.

To create and load the table used in the following examples, see Creating the heavy_weather table.

```sqlexample
SELECT MAX_BY(precip, start_time) most_recent_precip
  FROM heavy_weather;
```

```output
+--------------------+
| MOST_RECENT_PRECIP |
|--------------------|
|               0.07 |
+--------------------+
```

You can verify this result (and get more information about it) by running this query:

```sqlexample
SELECT * FROM heavy_weather WHERE start_time=
  (SELECT MAX(start_time) FROM heavy_weather);
```

```output
+-------------------------+--------+-------+-------------+
| START_TIME              | PRECIP | CITY  | COUNTY      |
|-------------------------+--------+-------+-------------|
| 2021-12-30 20:53:00.000 |   0.07 | Lebec | Los Angeles |
+-------------------------+--------+-------+-------------+
```

You can add a GROUP BY clause to ask more interesting questions about this data. For example, the following query
finds the last precipitation value that was observed for each city in California, ordered by `precip` values
(high to low). The results are grouped by `city` to return the last `precip` value for each different city.

```sqlexample
SELECT city, MAX_BY(precip, start_time) most_recent_precip
  FROM heavy_weather
  GROUP BY city
  ORDER BY 2 DESC;
```

```output
+------------------+--------------------+
| CITY             | MOST_RECENT_PRECIP |
|------------------+--------------------|
| Alta             |               0.89 |
| Bishop           |               0.75 |
| Mammoth Lakes    |               0.37 |
| Alturas          |               0.23 |
| Mount Shasta     |               0.09 |
| South Lake Tahoe |               0.07 |
| Big Bear City    |               0.07 |
| Montague         |               0.07 |
| Lebec            |               0.07 |
+------------------+--------------------+
```

The last time an observation was taken for the city of Alta, the `precip` value was `0.89`,
and the last time an observation was taken for the cities of South Lake Tahoe, Big Bear City, Montague, and Lebec, the `precip`
value was `0.07` for all four locations. (Note that the query does not tell you when those observations were taken.)

You can return the “opposite” result set (oldest `precip` record versus most recent) by using the MIN_BY function.

```sqlexample
SELECT city, MIN_BY(precip, start_time) oldest_precip
  FROM heavy_weather
  GROUP BY city
  ORDER BY 2 DESC;
```

```output
+------------------+---------------+
| CITY             | OLDEST_PRECIP |
|------------------+---------------|
| South Lake Tahoe |          0.56 |
| Big Bear City    |          0.42 |
| Mammoth Lakes    |          0.37 |
| Alta             |          0.25 |
| Alturas          |          0.23 |
| Bishop           |          0.08 |
| Lebec            |          0.08 |
| Mount Shasta     |          0.08 |
| Montague         |          0.07 |
+------------------+---------------+
```

## Joining time-series data

You can use the [ASOF JOIN](../sql-reference/constructs/asof-join.md) construct to join tables that
contain time-series data. Although ASOF JOIN queries can be emulated through the use of complex SQL, other types
of joins, and window functions, these queries are easier to write (and are optimized) if you use the ASOF JOIN syntax.

A common use for ASOF joins is the analysis of financial trading data.
Transaction-cost analysis, for example, requires “slippage” calculations, which measure the difference
between the price quoted at the time of a decision to buy stocks and the price actually paid when the trade
was executed and recorded. The ASOF JOIN can expedite this type of analysis. Given that the key capability
of this join method is the analysis of one time series with respect to another, ASOF JOIN can be useful for
analyzing any data set that is historical in nature. In many of these use cases, ASOF JOIN can be used to
associate data when readings from different devices have timestamps that are not exactly the same.

The assumption is that the time-series data you need to analyze exists in two tables, and there is a timestamp
for each row in each table. This timestamp represents the precise “as of” date and time for a recorded event.
For each row in the first (or left) table, the join uses a “match condition” with a comparison operator that
you specify to find a single row in the second (or right) table where the timestamp value is one of the
following:

* Less than or equal to the timestamp value in the left table.
* Greater than or equal to the timestamp value in the left table.
* Less than the timestamp value in the left table.
* Greater than the timestamp value in the left table.

The qualifying row on the right side is the closest match, which could be equal in time, earlier in time, or
later in time, depending on the specified comparison operator.

The cardinality of the result of the ASOF JOIN is always equal to the cardinality of the left table.
If the left table contains 40 million rows, the ASOF JOIN returns 40 million rows. Therefore, the left table
can be thought of as the “preserving” table, and the right table as the “referenced” table.

### Joining two tables on the closest match (alignment)

For example, in a financial application, you might have a table named `quotes` and a table named `trades`.
One table records the history of bids to buy stock, and the other records the history of actual trades.
A bid to buy stocks happens before the trade (or possibly at the “same” time, depending on the granularity of
the recorded time). Both tables have timestamps, and both have other columns of interest that you might want to
compare. A simple ASOF JOIN query will return the closest quote (in time) before each trade. In other words,
the query asks: What was the price of a given stock at the time I made a trade?

Assume that the `trades` table contains three rows, and the `quotes` table contains seven rows.
The background color of the cells shows which three rows from `quotes` will qualify for the ASOF JOIN when the
rows are joined on matching stock symbols and their timestamp columns are compared.

**TRADES Table (Left or “Preserving” Table)**

**QUOTES Table (Right or “Referenced” Table)**

This conceptual example is easy to turn into a specific ASOF JOIN query:

```sqlexample
SELECT t.stock_symbol, t.trade_time, t.quantity, q.quote_time, q.price
  FROM trades t ASOF JOIN quotes q
    MATCH_CONDITION(t.trade_time >= quote_time)
    ON t.stock_symbol=q.stock_symbol
  ORDER BY t.stock_symbol;
```

```output
+--------------+-------------------------+----------+-------------------------+--------------+
| STOCK_SYMBOL | TRADE_TIME              | QUANTITY | QUOTE_TIME              |        PRICE |
|--------------+-------------------------+----------+-------------------------+--------------|
| AAPL         | 2023-10-01 09:00:05.000 |     2000 | 2023-10-01 09:00:03.000 | 139.00000000 |
| SNOW         | 2023-10-01 09:00:05.000 |     1000 | 2023-10-01 09:00:02.000 | 163.00000000 |
| SNOW         | 2023-10-01 09:00:10.000 |     1500 | 2023-10-01 09:00:08.000 | 165.00000000 |
+--------------+-------------------------+----------+-------------------------+--------------+
```

The ON condition groups the matched rows by their stock symbols.

To run this example, create and load the tables as follows:

```sqlexample
CREATE OR REPLACE TABLE trades (
  stock_symbol VARCHAR(4),
  trade_time TIMESTAMP_NTZ(9),
  quantity NUMBER(38,0)
  );

CREATE OR REPLACE TABLE quotes (
  stock_symbol VARCHAR(4),
  quote_time TIMESTAMP_NTZ(9),
  price NUMBER(12,8)
  );

INSERT INTO trades VALUES
  ('SNOW','2023-10-01 09:00:05.000', 1000),
  ('AAPL','2023-10-01 09:00:05.000', 2000),
  ('SNOW','2023-10-01 09:00:10.000', 1500);

INSERT INTO quotes VALUES
  ('SNOW','2023-10-01 09:00:01.000', 166.00),
  ('SNOW','2023-10-01 09:00:02.000', 163.00),
  ('SNOW','2023-10-01 09:00:07.000', 166.00),
  ('SNOW','2023-10-01 09:00:08.000', 165.00),
  ('AAPL','2023-10-01 09:00:03.000', 139.00),
  ('AAPL','2023-10-01 09:00:07.000', 142.00),
  ('AAPL','2023-10-01 09:00:11.000', 142.00);
```

For more examples of ASOF JOIN queries, see [Examples](../sql-reference/constructs/asof-join.md).

## Filling gaps in time-series data

Time-series analysis often requires data to have a consistent granularity with records for every interval, yet real-world data often
arrives at irregular intervals or contains gaps. For instance, you might have a predominantly hourly data set but need to generate
half-hour entries to align with downstream analytics, or you might already have a consistent resolution but discover gaps in the series.
Snowflake gap-filling functionality provides efficient ways to apply a uniform interval to time-series data and fill any gaps.

For example, consider the following eight records, which capture weather observations for two cities in California on March 15, 2025.

```output
+-------------------------+-------------+------------------+----------------+
| OBSERVED                | TEMPERATURE | CITY             | COUNTY         |
|-------------------------+-------------+------------------+----------------|
| 2025-03-15 09:49:00.000 |          48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |          44 | South Lake Tahoe | El Dorado      |
| 2025-03-15 09:55:00.000 |          49 | Big Bear City    | San Bernardino |
| 2025-03-15 09:55:00.000 |          46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:10:00.000 |          51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:10:00.000 |          52 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:15:00.000 |          54 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:18:00.000 |          54 | Big Bear City    | San Bernardino |
+-------------------------+-------------+------------------+----------------+
```

Although these records have a somewhat consistent level of granularity (day, hour, minute), the intervals between the rows are
inconsistent, varying between 1 and 15 minutes. If the goal is to collect data at five-minute intervals, several rows are missing.

### Using the RESAMPLE clause

You can modify the granularity and improve the consistency of a set of rows by “upsampling” them to a specific time interval. To make
this kind of change, use the [RESAMPLE](../sql-reference/constructs/resample.md) clause, which you define within the FROM clause of a SELECT statement. The result of a
resampled data set is a *larger* data set that preserves all of the existing input rows and generates some number of new rows with values that
fill gaps in the time series. (Note that you can also use the RESAMPLE clause to “downsample” rows into a smaller, more coarse-grained result set.)

By definition, a time series always has a column that contains a sequence of dates, timestamps, or numeric values that represent dates or times.
Resampling operates on such a column in the source table, and the required granularity must be specified with an INTERVAL value, such as
`5 minutes`, `30 minutes`, or `1 hour`.

Typically, you also define partitions that create time-series rows over certain dimensions, rather than just generating one new timestamp per interval.

The structure of a RESAMPLE query looks like this:

```sqlexample
SELECT *
  FROM time_series_table
    RESAMPLE (
      USING time_series_column
      INCREMENT BY INTERVAL '5 minutes'
      PARTITION BY other_column_1, other_column_2)
  ORDER BY time_series_column;
```

Columns in the rows that are generated are set to NULL, except for the columns specified in the USING and PARTITION BY clauses. The specified date, time,
or numeric column and the partitioning columns have meaningful generated values.

> **Note:**
>
> If you plan to filter your resampled data by specific values (for example, a specific device ID or location), include those columns in the PARTITION BY clause.
> This ensures that generated rows have real values for those columns rather than NULL values. If you filter with a WHERE clause on columns that are not in the PARTITION BY clause, the WHERE clause filters out all generated rows for those columns because they contain NULL values.

To run a simple example that uses the eight records shown earlier, start by creating and loading the following table:

```sqlexample
CREATE OR REPLACE TABLE march_temps
 (observed TIMESTAMP, temperature INT, city VARCHAR(20), county VARCHAR(20));

INSERT INTO march_temps VALUES
  ('2025-03-15 09:50:00.000',44,'South Lake Tahoe','El Dorado'),
  ('2025-03-15 09:55:00.000',46,'South Lake Tahoe','El Dorado'),
  ('2025-03-15 10:10:00.000',52,'South Lake Tahoe','El Dorado'),
  ('2025-03-15 10:15:00.000',54,'South Lake Tahoe','El Dorado'),
  ('2025-03-15 09:49:00.000',48,'Big Bear City','San Bernardino'),
  ('2025-03-15 09:55:00.000',49,'Big Bear City','San Bernardino'),
  ('2025-03-15 10:10:00.000',51,'Big Bear City','San Bernardino'),
  ('2025-03-15 10:18:00.000',54,'Big Bear City','San Bernardino')
;
```

Now select upsampled rows from that table, using an interval of `5 minutes`:

```sqlexample
SELECT *
  FROM march_temps
    RESAMPLE (
      USING observed
      INCREMENT BY INTERVAL '5 minutes')
  ORDER BY observed;
```

```output
+-------------------------+-------------+------------------+----------------+
| OBSERVED                | TEMPERATURE | CITY             | COUNTY         |
|-------------------------+-------------+------------------+----------------|
| 2025-03-15 09:45:00.000 |        NULL | NULL             | NULL           |
| 2025-03-15 09:49:00.000 |          48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |          44 | South Lake Tahoe | El Dorado      |
| 2025-03-15 09:55:00.000 |          46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 09:55:00.000 |          49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:00:00.000 |        NULL | NULL             | NULL           |
| 2025-03-15 10:05:00.000 |        NULL | NULL             | NULL           |
| 2025-03-15 10:10:00.000 |          52 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:10:00.000 |          51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:15:00.000 |          54 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:18:00.000 |          54 | Big Bear City    | San Bernardino |
+-------------------------+-------------+------------------+----------------+
```

This query preserves the original eight rows and generates three new rows, filling gaps for three time intervals, at `09:45`, `10:00`, and `10:05`.
NULL values are inserted into the `temperature`, `city`, and `county` columns.

The starting point for the time series is `2025-03-15 09:45:00.000` because it is within 5 minutes of the earliest timestamp in the input data set
(`2025-03-15 09:49:00.000`).

If you want to remove rows that don’t occur at uniform intervals (`09:49` and `10:18` in this case), see [RESAMPLE example that uses BUCKET_START() to filter out non-uniform rows](../sql-reference/constructs/resample.md).

Now add a PARTITION BY clause to the query:

```sqlexample
SELECT *
  FROM march_temps
    RESAMPLE (
      USING observed
      INCREMENT BY INTERVAL '5 minutes'
      PARTITION BY city, county)
  ORDER BY city, county, observed;
```

```output
+-------------------------+-------------+------------------+----------------+
| OBSERVED                | TEMPERATURE | CITY             | COUNTY         |
|-------------------------+-------------+------------------+----------------|
| 2025-03-15 09:45:00.000 |        NULL | Big Bear City    | San Bernardino |
| 2025-03-15 09:49:00.000 |          48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |        NULL | Big Bear City    | San Bernardino |
| 2025-03-15 09:55:00.000 |          49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:00:00.000 |        NULL | Big Bear City    | San Bernardino |
| 2025-03-15 10:05:00.000 |        NULL | Big Bear City    | San Bernardino |
| 2025-03-15 10:10:00.000 |          51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:15:00.000 |        NULL | Big Bear City    | San Bernardino |
| 2025-03-15 10:18:00.000 |          54 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |          44 | South Lake Tahoe | El Dorado      |
| 2025-03-15 09:55:00.000 |          46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:00:00.000 |        NULL | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:05:00.000 |        NULL | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:10:00.000 |          52 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:15:00.000 |          54 | South Lake Tahoe | El Dorado      |
+-------------------------+-------------+------------------+----------------+
```

The partitioned results are different in two ways:

* Seven rows are generated, for a total of 15 rows. A row now exists for every 5-minute interval for every partition.
* The partitioning columns have correctly generated `city` and `county` values. The only column that has NULL values in the generated rows is `temperature`.

You can also specify the METADATA_COLUMNS parameter in the RESAMPLE syntax to add the following columns to the result:

* The `is_generated` metadata column identifies the rows that were generated by the RESAMPLE operation and the rows that were already present.
* The `bucket_start` metadata column returns the value that marks the beginning of the current bucket or interval
  that the RESAMPLE operation produces. You can use this column to identify which interval a particular row belongs to after resampling, and you can
  use it to run aggregate queries on resampled data. See [RESAMPLE example that uses BUCKET_START() to aggregate resampled rows](../sql-reference/constructs/resample.md).

For the complete RESAMPLE syntax, see [RESAMPLE](../sql-reference/constructs/resample.md).

To store the results of a RESAMPLE query, use a [CTAS statement](../sql-reference/sql/create-table.md) that selects and inserts the data into a new table:

```sqlexample
CREATE OR REPLACE TABLE march_temps_every_five_mins AS
  SELECT *
    FROM march_temps
      RESAMPLE (
        USING observed
        INCREMENT BY INTERVAL '5 minutes'
        PARTITION BY city, county)
    ORDER BY city, county, observed;
```

### Interpolating or “gap-filling” values into a time series

Although you can use the RESAMPLE syntax and interpolation functions independently, they are most commonly used
together to gap-fill time-series data in the scope of a single query. Having resampled your data set, you can
call an interpolation function to update the other columns of interest in the newly generated rows. The interpolation process updates
columns that were previously NULL, such as numeric measurements, giving them meaningful values based on values found in the
preceding or following rows.

You can interpolate values by calling the INTERPOLATE_FFILL, INTERPOLATE_BFILL, and INTERPOLATE_LINEAR window functions.
For example, the INTERPOLATE_FFILL function finds the previous (last) value in the time series for the column in question:

```sqlexample
SELECT observed,
    INTERPOLATE_FFILL(temperature) OVER (PARTITION BY city, county ORDER BY observed) ffill_temp,
    city, county
  FROM march_temps_every_five_mins
  ORDER BY city, county, observed;
```

```output
+-------------------------+------------+------------------+----------------+
| OBSERVED                | FFILL_TEMP | CITY             | COUNTY         |
|-------------------------+------------+------------------+----------------|
| 2025-03-15 09:45:00.000 |       NULL | Big Bear City    | San Bernardino |
| 2025-03-15 09:49:00.000 |         48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |         48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:55:00.000 |         49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:00:00.000 |         49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:05:00.000 |         49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:10:00.000 |         51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:15:00.000 |         51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:18:00.000 |         54 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |         44 | South Lake Tahoe | El Dorado      |
| 2025-03-15 09:55:00.000 |         46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:00:00.000 |         46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:05:00.000 |         46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:10:00.000 |         52 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:15:00.000 |         54 | South Lake Tahoe | El Dorado      |
+-------------------------+------------+------------------+----------------+
```

The first row returns NULL for the `ffill_temp` column because there is no previous row for the INTERPOLATE_FFILL function to use.

For more information about these window functions, see [INTERPOLATE_BFILL, INTERPOLATE_FFILL, INTERPOLATE_LINEAR](../sql-reference/functions/interpolate_bfill.md).

### Upsampling, gap-filling, and storing the results in one operation

To simplify the whole process of gap-filling a data set, you can upsample data and interpolate values within a single query, and save
the results by using a CTAS operation. For example, the following CTAS statement creates a new table that interpolates measurements into a
an upsampled data set:

```sqlexample
CREATE OR REPLACE TABLE march_temps_every_five_mins_with_interpolations
  (observed TIMESTAMP, temperature INT, ffill INT, bfill INT, linear INT, city VARCHAR(20), county VARCHAR(20))
  AS
  SELECT observed, temperature,
    INTERPOLATE_FFILL(temperature) OVER (PARTITION BY city ORDER BY observed) ffill,
    INTERPOLATE_BFILL(temperature) OVER (PARTITION BY city ORDER BY observed) bfill,
    INTERPOLATE_LINEAR(temperature) OVER (PARTITION BY city ORDER BY observed) linear,
    city,
    county
  FROM march_temps
    RESAMPLE(
      USING observed
      INCREMENT BY INTERVAL '5 minutes'
      PARTITION BY city, county)
  ORDER BY observed;
```

> **Note:**
>
> When you use INTERPOLATE functions with resampling, the columns you specify in the OVER (PARTITION BY) clause for window functions typically match
> the columns in the RESAMPLE (PARTITION BY) clause. This approach ensures that interpolation happens within the same logical partitions that were created
> during resampling. In the previous example, resampling is partitioned by `city` and `county`, while the INTERPOLATE functions partition by `city` only.
> This example works because the interpolation is happening at a coarser granularity, but you should always make sure that the partitioning strategy aligns with your data requirements.

## Gap-filling with ASOF JOIN

> **Note:**
>
> To use the recommended approach for gap-filling and interpolation, see Filling gaps in time-series data. The RESAMPLE construct and
> INTERPOLATE functions are preview features, and the following ASOF JOIN approach to gap-filling is included only as a potential workaround.

In addition to aligning the data in two tables by finding non-exact matches on time-based columns, ASOF JOIN is useful for filling gaps in a time series when your raw data table is missing rows for particular dates or timestamps. For example, when rows are missing because faulty equipment, or a power failure, results in skipped sensor readings, you can use ASOF JOIN to interpolate values from a generated time series into the table. The missing rows are filled in with the last known value for the readings that are missing. This value is also known as the “last observation carried forward” (LOCF). The ASOF JOIN query returns a complete set of rows that are in chronological order and contiguous.

To use ASOF JOIN for interpolation, follow these steps:

1. Identify the gaps in your table by running a simple query.
2. Generate a complete time series, with the appropriate grain, for the period of time that you need to cover. For example, your time series might
   be a simple sequence of dates for a particular year, or a much more granular sequence of timestamps per second for some number of days. You can use
   SQL or a spreadsheet application to generate the list of values.

   The time series will also need a meaningful ID or dimension for each row that you will specify later in the ASOF JOIN ON condition.
3. Write an ASOF JOIN query that interpolates values into the missing rows. The generated time series will be the preserving table and the raw data table
   will be the referenced table.

The following example requires the `sensor_data_ts` table. If you haven’t already created and loaded it, see
Creating the sensor_data_ts table. To simulate the need for a gap-filling operation, delete some rows from the table as follows:

```sqlexample
DELETE FROM sensor_data_ts
  WHERE device_id='DEVICE2'
    AND TIMESTAMP > ('2024-03-07 00:01:15')
    AND TIMESTAMP <= ('2024-03-07 00:01:20');
```

The result is a table that is missing five rows for `DEVICE2` on March 7th (1:16 through 1:20).

```output
+------------------------+
| number of rows deleted |
|------------------------|
|                      5 |
+------------------------+
```

Now follow these steps to complete the gap-filling exercise.

> **Note:**
>
> If you run this example yourself, your output will not match exactly because the `sensor_data_ts` table is loaded
> with randomly generated values.

### Step 1: Verify that the table has gaps

Run the following query to identify the gaps:

```sqlexample
SELECT * FROM sensor_data_ts
  WHERE device_id='DEVICE2'
  AND TIMESTAMP >= ('2024-03-07 00:01:15')
  AND TIMESTAMP <= ('2024-03-07 00:01:21')
ORDER BY TIMESTAMP;
```

```output
+-----------+-------------------------+-------------+-----------+-----------+
| DEVICE_ID | TIMESTAMP               | TEMPERATURE | VIBRATION | MOTOR_RPM |
|-----------+-------------------------+-------------+-----------+-----------|
| DEVICE2   | 2024-03-07 00:01:15.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:21.000 |     28.0426 |    0.2944 |      1448 |
+-----------+-------------------------+-------------+-----------+-----------+
```

This query returns two rows for `DEVICE2`: the last row before the gap and the first row
after the gap.

### Step 2: Generate a complete time series to cover the known gaps

To generate a time series with a fine grain (one row per second) for the gap in the `sensor_data_ts`
table, create the following table, which contains generated timestamps:

```sqlexample
CREATE OR REPLACE TABLE continuous_timestamps AS
  SELECT 'DEVICE2' as DEVICE_ID,
    DATEADD('SECOND', ROW_NUMBER() OVER (ORDER BY SEQ8()), '2024-03-07 00:01:15')::TIMESTAMP_NTZ AS TS
  FROM TABLE(GENERATOR(ROWCOUNT => 5));
```

In this SQL statement, `5` is the number of seconds that you need to cover the gap. Note that the device ID value
(`DEVICE2`) is included in the generated rows.

The following query returns the five generated rows.

```sqlexample
SELECT * FROM continuous_timestamps ORDER BY ts;
```

```output
+-----------+-------------------------+
| DEVICE_ID | TS                      |
|-----------+-------------------------|
| DEVICE2   | 2024-03-07 00:01:16.000 |
| DEVICE2   | 2024-03-07 00:01:17.000 |
| DEVICE2   | 2024-03-07 00:01:18.000 |
| DEVICE2   | 2024-03-07 00:01:19.000 |
| DEVICE2   | 2024-03-07 00:01:20.000 |
+-----------+-------------------------+
```

### Step 3: Interpolate values by using ASOF JOIN

Now you can run an ASOF JOIN query that joins `continuous_timestamps` to `sensor_data_ts` and
interpolates values for missing rows for `DEVICE2`. The match condition finds the closest
row in time for each missing row, and the ON condition guarantees that interpolation occurs
on matching device IDs.

The closest row for the missing rows is the row with the `2024-03-07 00:01:16.000` timestamp,
assuming that `>=` is specified in the match condition, as shown in this example.

```sqlexample
INSERT INTO sensor_data_ts(device_id, timestamp, temperature, vibration, motor_rpm)
  SELECT t.device_id, t.ts, s.temperature, s.vibration, s.motor_rpm
    FROM continuous_timestamps t
      ASOF JOIN sensor_data_ts s
        MATCH_CONDITION(t.ts >= s.timestamp)
        ON t.device_id = s.device_id
    WHERE TIMESTAMP >= ('2024-03-07 00:01:15')
      AND TIMESTAMP < ('2024-03-07 00:01:21');
```

This INSERT statement selects five rows from the ASOF JOIN operation and inserts them into the
`sensor_data_ts` table.

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       5 |
+-------------------------+
```

To check the results of the interpolation, select those five rows, and the two rows that directly precede and
follow them, from the `sensor_data_ts` table. Note that the five interpolated rows have picked up the same values
for the `temperature`, `vibration`, and `motor_rpm` columns that were recorded in the `2024-03-07 00:01:15.000`
row. The interpolation was successful.

```sqlexample
SELECT * FROM sensor_data_ts
  WHERE device_id='DEVICE2'
    AND TIMESTAMP >= ('2024-03-07 00:01:15')
    AND TIMESTAMP <= ('2024-03-07 00:01:21')
  ORDER BY TIMESTAMP;
```

```output
+-----------+-------------------------+-------------+-----------+-----------+
| DEVICE_ID | TIMESTAMP               | TEMPERATURE | VIBRATION | MOTOR_RPM |
|-----------+-------------------------+-------------+-----------+-----------|
| DEVICE2   | 2024-03-07 00:01:15.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:16.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:17.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:18.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:19.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:20.000 |     30.1088 |    0.2960 |      1457 |
| DEVICE2   | 2024-03-07 00:01:21.000 |     28.0426 |    0.2944 |      1448 |
+-----------+-------------------------+-------------+-----------+-----------+
```

## Applying ML-based functions to time-series data

You can train a model with ML Functions to do predictive analysis on time-series data:

* [Time-Series Forecasting](ml-functions/forecasting.md)
* [Anomaly Detection](ml-functions/anomaly-detection.md)
* [Top Insights](ml-functions/top-insights.md)

Forecasting uses historical time-series data to make predictions about future data. Given a recorded time series
with actual observed values for dates and times in the past, the ML model forecasts what the observed values might be
for dates and times in the future.

Anomaly detection identifies outliers, which are data points that deviate from an expected range. In the context of
a time series, an outlier is a measurement that is much larger or smaller than other measurements in a
similar time interval. To find outliers, the ML function produces a forecast for the same time period that
is being checked for anomalies, then compares the forecast results to the actual data.

Top Insights finds the most important dimensions in a data set, builds segments from those dimensions, and detects which
of those segments influenced a metric.

> **Note:**
>
> For machine-learning purposes, the timestamps in your time series must represent fixed time intervals. If necessary,
> you can use the DATE_TRUNC or TIME_SLICE function on TIMESTAMP columns to remove irregularities when training the
> forecast model.

### An example of anomaly detection in a time series

The following example uses a view with only 30 rows to train an anomaly detection model. Start by generating data into a
table, then create a view on the table. The view is not required (you can use a table to train a model), but the view option
gives you some flexibility to train models iteratively, with different row counts, without updating the source data.

> **Note:**
>
> If you run this example yourself, your output will not match exactly because the `sensor_data_30_rows` table is loaded
> with randomly generated values.

```sqlexample
CREATE OR REPLACE TABLE sensor_data_30_rows (
  device_id VARCHAR(10),
  timestamp TIMESTAMP,
  temperature DECIMAL(6,4),
  vibration DECIMAL(6,4),
  motor_rpm INT);

INSERT INTO sensor_data_30_rows (device_id, timestamp, temperature, vibration, motor_rpm)
  SELECT 'DEVICE3', timestamp,
    UNIFORM(30.2345, 36.3456, RANDOM()), --
    UNIFORM(0.4000, 0.4718, RANDOM()), --
    UNIFORM(1510, 1625, RANDOM()) --
  FROM (
    SELECT DATEADD(SECOND, SEQ4(), '2024-03-01') AS timestamp
      FROM TABLE(GENERATOR(ROWCOUNT => 30))
  );

CREATE OR REPLACE VIEW sensor_data_view AS SELECT * FROM sensor_data_30_rows;
```

Now create the model:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION sensor_model(
  INPUT_DATA => SYSTEM$REFERENCE('VIEW', 'sensor_data_view'),
  TIMESTAMP_COLNAME => 'timestamp',
  TARGET_COLNAME => 'temperature',
  LABEL_COLNAME => '');
```

```output
+---------------------------------------------+
| status                                      |
|---------------------------------------------|
| Instance SENSOR_MODEL successfully created. |
+---------------------------------------------+
```

When the model has built successfully, call the [<model_name>!DETECT_ANOMALIES](../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md)
method to detect outliers in the specified test data set. The timestamps in the test data must chronologically follow the
timestamps in the training data, but there must not be too great a gap in time between the training data and the test data. For
example, if you have timestamps for every second, do not use test data that is millions of seconds ahead of the training data.

This example uses another table as the test data, with only three rows. These rows have timestamps that closely follow those
in the training data.

```sqlexample
CREATE OR REPLACE TABLE sensor_data_device3 (
  device_id VARCHAR(10),
  timestamp TIMESTAMP,
  temperature DECIMAL(6,4),
  vibration DECIMAL(6,4),
  motor_rpm INT);

INSERT INTO sensor_data_device3 VALUES
  ('DEVICE3','2024-03-01 00:00:30.000',36.0422,0.4226,1560),
  ('DEVICE3','2024-03-01 00:00:31.000',36.1519,0.4341,1515),
  ('DEVICE3','2024-03-01 00:00:32.000',36.1524,0.4321,1591);

CALL sensor_model!DETECT_ANOMALIES(
  INPUT_DATA => SYSTEM$REFERENCE('TABLE', 'sensor_data_device3'),
  TIMESTAMP_COLNAME => 'timestamp',
  TARGET_COLNAME => 'temperature'
);
```

When the anomaly detection call finishes, it returns output similar to the following:

```output
+-------------------------+---------+--------------+--------------+--------------+------------+--------------+-------------+
| TS                      |       Y |     FORECAST |  LOWER_BOUND |  UPPER_BOUND | IS_ANOMALY |   PERCENTILE |    DISTANCE |
|-------------------------+---------+--------------+--------------+--------------+------------+--------------+-------------|
| 2024-03-01 00:00:30.000 | 36.0422 | 30.809998241 | 25.583156942 | 36.036839539 | True       | 0.9950380683 | 2.578470982 |
| 2024-03-01 00:00:31.000 | 36.1519 | 32.559470456 | 27.332629158 | 37.786311755 | False      | 0.961667911  | 1.770378085 |
| 2024-03-01 00:00:32.000 | 36.1524 | 32.205610776 | 26.978769478 | 37.432452075 | False      | 0.9741130751 | 1.945009377 |
+-------------------------+---------+--------------+--------------+--------------+------------+--------------+-------------+
```

The `TS` and `Y` columns return the timestamps and temperature values from the test data. In this very small test case,
the function found an anomaly (`IS_ANOMALY=True`). For more information about the output columns, see the “Returns” section in the
[function description](../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md).

## Creating the sensor_data_ts table

If you want to test the examples in this section that query the `sensor_data_ts` table, you can create and load
a copy of this table by running the following SQL script. The script generates one month of synthetic data for sensor
readings by calling the UNIFORM, RANDOM, and GENERATOR functions; therefore, your copy of the table will not return identical
results. The readings will be in the same range but they will not be the same.

```sqlexample
 CREATE OR REPLACE TABLE sensor_data_device1 (
   device_id VARCHAR(10),
   timestamp TIMESTAMP,
   temperature DECIMAL(6,4),
   vibration DECIMAL(6,4),
   motor_rpm INT
 );

 INSERT INTO sensor_data_device1 (device_id, timestamp, temperature, vibration, motor_rpm)
   SELECT 'DEVICE1', timestamp,
     UNIFORM(25.1111, 40.2222, RANDOM()), -- Temperature range in °C
     UNIFORM(0.2985, 0.3412, RANDOM()), -- Vibration range in mm/s
     UNIFORM(1400, 1495, RANDOM()) -- Motor RPM range
   FROM (
     SELECT DATEADD(SECOND, SEQ4(), '2024-03-01') AS timestamp
       FROM TABLE(GENERATOR(ROWCOUNT => 2678400)) -- seconds in 31 days
 );

CREATE OR REPLACE TABLE sensor_data_device2 (
   device_id VARCHAR(10),
   timestamp TIMESTAMP,
   temperature DECIMAL(6,4),
   vibration DECIMAL(6,4),
   motor_rpm INT
 );

INSERT INTO sensor_data_device2 (device_id, timestamp, temperature, vibration, motor_rpm)
   SELECT 'DEVICE2', timestamp,
     UNIFORM(24.6642, 36.3107, RANDOM()), -- Temperature range in °C
     UNIFORM(0.2876, 0.3333, RANDOM()), -- Vibration range in mm/s
     UNIFORM(1425, 1505, RANDOM()) -- Motor RPM range
   FROM (
     SELECT DATEADD(SECOND, SEQ4(), '2024-03-01') AS timestamp
       FROM TABLE(GENERATOR(ROWCOUNT => 2678400)) -- seconds in 31 days
 );

 INSERT INTO sensor_data_device1 SELECT * FROM sensor_data_device2;

 DROP TABLE IF EXISTS sensor_data_ts;

 ALTER TABLE sensor_data_device1 rename to sensor_data_ts;

 DROP TABLE sensor_data_device2;

 SELECT COUNT(*) FROM sensor_data_ts; -- verify row count = 5356800
```

## Creating the heavy_weather table

The following script creates and loads the `heavy_weather` table, which is used in the examples
for the MAX_BY functions. The table contains 55 rows of snowfall precipitation records for
California cities during the last week of 2021.

```sqlexample
CREATE OR REPLACE TABLE heavy_weather
   (start_time TIMESTAMP, precip NUMBER(3,2), city VARCHAR(20), county VARCHAR(20));

INSERT INTO heavy_weather VALUES
  ('2021-12-23 06:56:00.000',0.08,'Mount Shasta','Siskiyou'),
  ('2021-12-23 07:51:00.000',0.09,'Mount Shasta','Siskiyou'),
  ('2021-12-23 16:23:00.000',0.56,'South Lake Tahoe','El Dorado'),
  ('2021-12-23 17:24:00.000',0.38,'South Lake Tahoe','El Dorado'),
  ('2021-12-23 18:30:00.000',0.28,'South Lake Tahoe','El Dorado'),
  ('2021-12-23 19:35:00.000',0.37,'Mammoth Lakes','Mono'),
  ('2021-12-23 19:36:00.000',0.80,'South Lake Tahoe','El Dorado'),
  ('2021-12-24 04:43:00.000',0.25,'Alta','Placer'),
  ('2021-12-24 05:26:00.000',0.34,'Alta','Placer'),
  ('2021-12-24 05:35:00.000',0.42,'Big Bear City','San Bernardino'),
  ('2021-12-24 06:49:00.000',0.17,'South Lake Tahoe','El Dorado'),
  ('2021-12-24 07:40:00.000',0.07,'Alta','Placer'),
  ('2021-12-24 08:36:00.000',0.07,'Alta','Placer'),
  ('2021-12-24 11:52:00.000',0.08,'Alta','Placer'),
  ('2021-12-24 12:52:00.000',0.38,'Alta','Placer'),
  ('2021-12-24 15:44:00.000',0.13,'Alta','Placer'),
  ('2021-12-24 15:53:00.000',0.07,'South Lake Tahoe','El Dorado'),
  ('2021-12-24 16:55:00.000',0.09,'Big Bear City','San Bernardino'),
  ('2021-12-24 21:53:00.000',0.07,'Montague','Siskiyou'),
  ('2021-12-25 02:52:00.000',0.07,'Alta','Placer'),
  ('2021-12-25 07:52:00.000',0.07,'Alta','Placer'),
  ('2021-12-25 08:52:00.000',0.08,'Alta','Placer'),
  ('2021-12-25 09:48:00.000',0.18,'Alta','Placer'),
  ('2021-12-25 12:52:00.000',0.10,'Alta','Placer'),
  ('2021-12-25 17:21:00.000',0.23,'Alturas','Modoc'),
  ('2021-12-25 17:52:00.000',1.54,'Alta','Placer'),
  ('2021-12-26 01:52:00.000',0.61,'Alta','Placer'),
  ('2021-12-26 05:43:00.000',0.16,'South Lake Tahoe','El Dorado'),
  ('2021-12-26 05:56:00.000',0.08,'Bishop','Inyo'),
  ('2021-12-26 06:52:00.000',0.75,'Bishop','Inyo'),
  ('2021-12-26 06:53:00.000',0.08,'Lebec','Los Angeles'),
  ('2021-12-26 07:52:00.000',0.65,'Alta','Placer'),
  ('2021-12-26 09:52:00.000',2.78,'Alta','Placer'),
  ('2021-12-26 09:55:00.000',0.07,'Big Bear City','San Bernardino'),
  ('2021-12-26 14:22:00.000',0.32,'Alta','Placer'),
  ('2021-12-26 14:52:00.000',0.34,'Alta','Placer'),
  ('2021-12-26 15:43:00.000',0.35,'Alta','Placer'),
  ('2021-12-26 17:31:00.000',5.24,'Alta','Placer'),
  ('2021-12-26 22:52:00.000',0.07,'Alta','Placer'),
  ('2021-12-26 23:15:00.000',0.52,'Alta','Placer'),
  ('2021-12-27 02:52:00.000',0.08,'Alta','Placer'),
  ('2021-12-27 03:52:00.000',0.14,'Alta','Placer'),
  ('2021-12-27 04:52:00.000',1.52,'Alta','Placer'),
  ('2021-12-27 14:37:00.000',0.89,'Alta','Placer'),
  ('2021-12-27 14:53:00.000',0.07,'South Lake Tahoe','El Dorado'),
  ('2021-12-27 17:53:00.000',0.07,'South Lake Tahoe','El Dorado'),
  ('2021-12-30 11:23:00.000',0.12,'Lebec','Los Angeles'),
  ('2021-12-30 11:43:00.000',0.98,'Lebec','Los Angeles'),
  ('2021-12-30 13:53:00.000',0.23,'Lebec','Los Angeles'),
  ('2021-12-30 14:53:00.000',0.13,'Lebec','Los Angeles'),
  ('2021-12-30 15:15:00.000',0.29,'Lebec','Los Angeles'),
  ('2021-12-30 17:53:00.000',0.10,'Lebec','Los Angeles'),
  ('2021-12-30 18:53:00.000',0.09,'Lebec','Los Angeles'),
  ('2021-12-30 19:53:00.000',0.07,'Lebec','Los Angeles'),
  ('2021-12-30 20:53:00.000',0.07,'Lebec','Los Angeles')
  ;
```

---
title: Anomaly Detection (Snowflake ML Functions)
source: https://docs.snowflake.com/en/user-guide/ml-functions/anomaly-detection.md
section: User Guide
---

# Anomaly Detection (Snowflake ML Functions)

## Overview

Anomaly detection is the process of identifying outliers in data. The anomaly detection function lets you train a model
to detect outliers in your time-series data. Outliers, which are data points that deviate from the expected range, can
have an outsized impact on statistics and models derived from your data. Spotting and removing outliers can therefore
help improve the quality of your results.

> **Note:**
>
> Anomaly Detection is part of Snowflake’s suite of business analysis tools powered by machine learning.

Detecting outliers can also be useful in pinpointing the origin of problems or deviations in processes when there is no
obvious cause. For example:

* Determining when a problem started to occur with your logging pipeline.
* Identifying the days when your Snowflake compute costs are higher than expected.

Anomaly detection works with either single-series or multi-series data. Multi-series data represents multiple
independent threads of events. For example, if you have sales data for multiple stores, each store’s sales can be
checked separately by a single model based on the store identifier.

The data must include:

* A timestamp column.
* A target column representing some quantity of interest at each timestamp.

> **Note:**
>
> Ideally, the training data for an Anomaly Detection model has time steps at equally spaced intervals (for example,
> daily). However, model training can handle real-world data that has missing, duplicate, or misaligned time steps.
> For more information, see [Dealing with real-world data in Time-Series Forecasting](preprocessing.md).

To detect outliers in time-series data, use the Snowflake built-in class [ANOMALY_DETECTION (SNOWFLAKE.ML)](../../sql-reference/classes/anomaly_detection.md),
and follow these steps:

1. [Create an anomaly detection object](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md),
   passing in a reference to the training data.

   This object fits a model to the training data that you provide. The model is a schema-level object.
2. Using this anomaly detection model object, call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method to
   detect anomalies, passing in a reference to the data to analyze.

   The method uses the model to identify outliers in the data.

Anomaly detection is closely related to [Forecasting](forecasting.md). An anomaly detection model
produces a forecast for the same time period as the data you’re checking for anomalies, then compares the actual data to
the forecast to identify outliers.

## About the Algorithm for Anomaly Detection

The anomaly detection algorithm is powered by a [gradient boosting machine](https://en.wikipedia.org/wiki/Gradient_boosting)
(GBM). Like an [ARIMA](https://en.wikipedia.org/wiki/Autoregressive_integrated_moving_average) model, it uses a
differencing transformation to model data with a non-stationary trend and uses auto-regressive lags of the historical
target data as model variables.

Additionally, the algorithm uses rolling averages of historical target data to help predict trends, and automatically
produces cyclic calendar variables (such as day of week and week of year) from timestamp data.

You can fit models with only historical target and timestamp data, or you may include exogenous data (variables) that
might have influenced the target value. Exogenous variables can be numerical or categorical and may be NULL (rows
containing NULLs for exogenous variables are not dropped).

The algorithm does not rely on one-hot encoding when training on categorical variables, so you can use categorical data
with many dimensions (high cardinality).

If your model incorporates exogenous variables, you must provide values for those variables at timestamps in the future
when detecting anomalies. Appropriate exogenous variables could include weather data (temperature, rainfall),
company-specific information (historic and planned company holidays, advertisement campaigns, event schedules), or any
other external factors you believe may help predict your target variable.

Optionally, individual historical rows can be labeled as anomalous or non-anomalous by using a separate Boolean column.

A *prediction interval* is an estimated range of values within an upper bound and a lower bound in which a certain
percentage of data is likely to fall. For example, a 0.99 value means that 99% of the data likely appears within the
interval. The anomaly detection model identifies any data that falls outside of the prediction interval as an anomaly. You can
specify a prediction interval or use the default, which is 0.99. You may want to set this value to be very close
to 1.0; 0.9999 or even closer.

> **Important:**
>
> From time to time, Snowflake may refine the anomaly detection algorithm. Such improvements roll out
> through the regular Snowflake release process. You cannot revert to a previous version of the feature, but models you
> created with a previous version continue to use that version for anomaly detection.

### Limitations

* You cannot choose or adjust the anomaly detection algorithm. In particular, the algorithm does not provide parameters
  to override trend, seasonality, or seasonal amplitudes; these are inferred from the data.
* The minimum number of rows for the main anomaly detection algorithm is 12 per time series. For time series with
  between 2 and 11 observations, anomaly detection produces a “naive” result in which all predicted values are equal to
  the last observed target value. For the labeled anomaly detection case, the number of observations used is the number
  of rows where the label column is false.
* The minimum acceptable granularity of data is one second. (Timestamps must not be less than one second apart.)
* The minimum granularity of seasonal components is one minute. (The function cannot detect cyclic patterns at smaller
  time deltas.)
* The “season length” of autoregressive features is tied to the input frequency (24 for hourly data, 7 for daily data,
  and so on).
* Anomaly detection models, once trained, are immutable. You cannot update existing models with new data; you must train
  an entirely new model. Models do not support versioning. Generally, you should retrain models on a regular cadence,
  such as once a day, once a week, or once a month, depending on how frequently you receive new data, to help the model
  keep up with changing trends.
* This feature only detects anomalies in the test data; it cannot detect anomalies in the training data. Furthermore,
  timestamps in the test data must all be greater than timestamps in the training data. Ensure that the training data
  covers a typical period free of actual outliers, or label known outliers in a Boolean column.
* You cannot clone models or share models across roles or accounts. When cloning a schema or database, model objects are skipped.
* You cannot [replicate](../account-replication-intro.md) an instance of the ANOMALY_DETECTION
  class.

## Preparing for Anomaly Detection

Before you can use anomaly detection, you must:

* Select a virtual warehouse
  in which to train and run your models.
* Grant the privileges to create anomaly detection objects.

You might also want to [modify your search path](../../sql-reference/snowflake-db-classes.md) to include
SNOWFLAKE.ML.

### Selecting a Virtual Warehouse

A Snowflake [virtual warehouse](../warehouses.md) provides the compute resources for training and using your
machine learning models for this feature. This section provides general guidance on selecting the best size and type of
warehouse for this purpose, focusing on the training step (the most time-consuming and memory-intensive part of
the process).

#### Training on Single-Series Data

For models trained on single-series data, you should choose the warehouse type based on the size of your training data.
Standard warehouses are subject to a lower [Snowpark memory limit](../../developer-guide/udf/python/udf-python-troubleshooting.md),
and are more appropriate for training jobs with fewer rows or exogenous features.
If your training data does not contain any exogenous features, you can train on a standard warehouse if the dataset has 5 million rows or less.
If your training data uses 5 or more exogenous features, then the maximum row count is lower.
Otherwise, Snowflake suggests using a [Snowpark-optimized warehouse](../warehouses-snowpark-optimized.md) for larger training jobs.

In general, for single-series data, a larger warehouse size does not result in a faster training time or higher memory limits.
As a rough rule of thumb, training time is proportional to the number of rows in your time series. For example, on a XS
standard warehouse, with evaluation turned off (`CONFIG_OBJECT => {'evaluate': False}`), training on a
100,000-row dataset takes about 60 seconds, while training on a 1,000,000-row dataset takes about 125 seconds. With
evaluation turned on, training time increases roughly linearly by the number of splits used.

For best performance, Snowflake recommends using a dedicated warehouse without other concurrent workloads to train your model.

#### Training on Multi-Series Data

As with single-series data, choose the warehouse type based on the number of rows in your largest time series. If your
largest time series contains more than 5 million rows, the training job is likely to exceed memory limits on a standard
warehouse.

Unlike single-series data, multi-series data trains considerably faster on larger warehouse sizes.
The following data points can guide you in your selection. Once again, all these times are done with evaluation turned
off.

| Warehouse type and size | Number of time series | Number of rows per time series | Training time (seconds) |
| --- | --- | --- | --- |
| Standard XS | 1 | 100,000 | 60 seconds |
| Standard XS | 10 | 100,000 | 204 seconds |
| Standard XS | 100 | 100,000 | 720 seconds |
| Standard XL | 10 | 100,000 | 104 seconds |
| Standard XL | 100 | 100,000 | 211 seconds |
| Standard XL | 1000 | 100,000 | 840 seconds |
| Snowpark-optimized XL | 10 | 100,000 | 65 seconds |
| Snowpark-optimized XL | 100 | 100,000 | 293 seconds |
| Snowpark-optimized XL | 1000 | 100,000 | 831 seconds |

#### Detecting Anomalies

The inference step takes approximately 1 second to process 100 rows in the input dataset, regardless of warehouse size.

### Granting Privileges to Create Anomaly Detection Objects

Training an anomaly detection model results in a schema-level object. Therefore, the role you use to create models must
have the CREATE SNOWFLAKE.ML.ANOMALY_DETECTION privilege on the schema where the model is created, which allows the
model to be stored there. This privilege is similar to other schema privileges like CREATE TABLE or CREATE VIEW.

Snowflake recommends that you create a role named `analyst` to be used by people who need to detect anomalies.

In the following example, the `admin` role is the owner of the schema `admin_db.admin_schema`. The
`analyst` role needs to create models in this schema.

```sqlexample
USE ROLE admin;
GRANT USAGE ON DATABASE admin_db TO ROLE analyst;
GRANT USAGE ON SCHEMA admin_schema TO ROLE analyst;
GRANT CREATE SNOWFLAKE.ML.ANOMALY_DETECTION ON SCHEMA admin_db.admin_schema TO ROLE analyst;
```

To use this schema, a user assumes the role `analyst`:

```sqlexample
USE ROLE analyst;
USE SCHEMA admin_db.admin_schema;
```

If the `analyst` role has CREATE SCHEMA privileges in database `analyst_db`, the role can create a new schema
`analyst_db.analyst_schema` and create anomaly detection models in that schema:

```sqlexample
USE ROLE analyst;
CREATE SCHEMA analyst_db.analyst_schema;
USE SCHEMA analyst_db.analyst_schema;
```

To revoke a role’s model creation privilege on the schema, use [REVOKE <privileges> … FROM ROLE](../../sql-reference/sql/revoke-privilege.md):

```sqlexample
REVOKE CREATE SNOWFLAKE.ML.ANOMALY_DETECTION ON SCHEMA admin_db.admin_schema FROM ROLE analyst;
```

## Setting Up the Data for the Examples

The examples in the following sections use a sample dataset that contains daily sales for items in different stores along with
daily weather data (humidity and temperature). The dataset also contains a column that indicates whether the day is a holiday.

1. Execute the following statements to create a table named `historical_sales_data` that contains the training data for the model:

> ```sqlexample
> CREATE OR REPLACE TABLE historical_sales_data (
>   store_id NUMBER, item VARCHAR, date TIMESTAMP_NTZ, sales FLOAT, label BOOLEAN,
>   temperature NUMBER, humidity FLOAT, holiday VARCHAR);
>
> INSERT INTO historical_sales_data VALUES
>   (1, 'jacket', to_timestamp_ntz('2020-01-01'), 2.0, false, 50, 0.3, 'new year'),
>   (1, 'jacket', to_timestamp_ntz('2020-01-02'), 3.0, false, 52, 0.3, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-03'), 5.0, false, 54, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-04'), 30.0, true, 54, 0.3, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-05'), 8.0, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-06'), 6.0, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-07'), 4.6, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-08'), 2.7, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-09'), 8.6, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-10'), 9.2, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-11'), 4.6, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-12'), 7.0, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-13'), 3.6, false, 55, 0.2, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-14'), 8.0, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-01'), 3.4, false, 50, 0.3, 'new year'),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-02'), 5.0, false, 52, 0.3, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-03'), 4.0, false, 54, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-04'), 5.4, false, 54, 0.3, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-05'), 3.7, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-06'), 3.2, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-07'), 3.2, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-08'), 5.6, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-09'), 7.3, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-10'), 8.2, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-11'), 3.7, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-12'), 5.7, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-13'), 6.3, false, 55, 0.2, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-14'), 2.9, false, 55, 0.2, null);
> ```

1. Execute the following statements to create a table named `new_sales_data` that contains the data to analyze:

> ```sqlexample
> CREATE OR REPLACE TABLE new_sales_data (
>   store_id NUMBER, item VARCHAR, date TIMESTAMP_NTZ, sales FLOAT,
>   temperature NUMBER, humidity FLOAT, holiday VARCHAR);
>
> INSERT INTO new_sales_data VALUES
>   (1, 'jacket', to_timestamp_ntz('2020-01-16'), 6.0, 52, 0.3, null),
>   (1, 'jacket', to_timestamp_ntz('2020-01-17'), 20.0, 53, 0.3, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-16'), 3.0, 52, 0.3, null),
>   (2, 'umbrella', to_timestamp_ntz('2020-01-17'), 70.0, 53, 0.3, null);
> ```

## Training, Using, Viewing, Deleting, and Updating Models

Use [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) to create and train a model. The model is trained on the dataset you
provide.

```sqlexample
CREATE SNOWFLAKE.ML.ANOMALY_DETECTION mydetector(...);
```

See [ANOMALY_DETECTION (SNOWFLAKE.ML)](../../sql-reference/classes/anomaly_detection.md) for complete details about the SNOWFLAKE.ML.ANOMALY_DETECTION
constructor. For examples of creating a model, see Detecting Anomalies.

> **Note:**
>
> SNOWFLAKE.ML.ANOMALY_DETECTION runs using limited privileges, so by default it does not have access to your data. You must
> therefore pass tables and views as [references](../../developer-guide/stored-procedure/stored-procedures-calling-references.md), which pass along the
> caller’s privileges. You can also provide a [query reference](../../developer-guide/stored-procedure/stored-procedures-calling-references.md) instead of a
> reference to a table or a view.
>
> To create this reference, you can use the [TABLE keyword](../../sql-reference/snowflake-db-classes.md) with the table name, view name,
> or query, or you can call the [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) or
> [SYSTEM$QUERY_REFERENCE](../../sql-reference/functions/system_query_reference.md) function.

To detect anomalies, call the model’s [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method:

```sqlexample
CALL mydetector!DETECT_ANOMALIES(...);
```

To select columns from the tabular output of the method, you can
[call the method in the FROM clause](../../sql-reference/snowflake-db-classes.md):

```sqlexample
SELECT ts, forecast FROM TABLE(mydetector!DETECT_ANOMALIES(...));
```

To view a list of your models, use the [SHOW SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/show-anomaly-detection.md) command:

```sqlexample
SHOW SNOWFLAKE.ML.ANOMALY_DETECTION;
```

To remove a model, use the [DROP SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/drop-anomaly-detection.md) command:

```sqlexample
DROP SNOWFLAKE.ML.ANOMALY_DETECTION <name>;
```

To update a model, delete it and train a new one. Models are immutable and cannot be updated in place.

## Detecting Anomalies

The following sections demonstrate how to use anomaly detection to detect outliers. These sections provide examples of
detecting anomalies for a single time series, for multiple time series, with and without exogenous variables, with a
user-defined prediction interval, and with a supervised (labeled) approach.

* Detecting Anomalies for a Single Time Series (Unsupervised)
* Training an Anomaly Detection Model with Labeled Data
* Specifying the Prediction Interval For Anomaly Detection
* Including Additional Columns for Analysis
* Detecting Anomalies in Multiple Series

### Detecting Anomalies for a Single Time Series (Unsupervised)

To detect anomalies in your data:

1. Train an anomaly detection model using historical data.
2. Use the trained anomaly detection model to detect anomalies in historical or projected data. The timestamps in the test data
   must chronologically follow the timestamps in the training data. You need at least 2 data points to train a model, at least
   12 for non-naive results, and at least 60 for non-linear results.

See [ANOMALY_DETECTION (SNOWFLAKE.ML)](../../sql-reference/classes/anomaly_detection.md) for information on the parameters used in creating and using a model.

#### Training an Anomaly Detection Model

To create an anomaly detection model object, execute the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) command.

For example, suppose that you want to analyze the sales for jackets in the store with the `store_id` of 1:

1. Create a view or design a query that returns the data for training the model for anomaly detection.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named `view_with_training_data`
   that contains the date and sales information:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_training_data
     AS SELECT date, sales FROM historical_sales_data
       WHERE store_id=1 AND item='jacket';
   ```
2. Create an anomaly detection object, and train its model on the data in that view.

   For this example, execute the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) command to create an anomaly detection object named
   `basic_model`. Pass in the following arguments:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION basic_model(
     INPUT_DATA => TABLE(view_with_training_data),
     TIMESTAMP_COLNAME => 'date',
     TARGET_COLNAME => 'sales',
     LABEL_COLNAME => '');
   ```

   This example passes in a reference to a view as the INPUT_DATA argument. The example
   [uses the TABLE keyword to create the reference](../../developer-guide/stored-procedure/stored-procedures-calling-references.md). As an alternative, you can call
   [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) to create the reference.

   The purpose of the label column is to tell the model which rows are known anomalies. Because this example uses
   unsupervised training, you do not need to use the label column. Pass an empty string as the name of the label column.

   > **Tip:**
   >
   > If you don’t want to create a view for the INPUT_DATA argument, you can pass in a
   > [reference to a query](../../developer-guide/stored-procedure/stored-procedures-calling-references.md) that uses a SELECT statement that serves as an inline
   > view.
   >
   > You can use the TABLE keyword to create this query reference. For example:
   >
   > ```sqlexample
   > CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION basic_model(
   >   INPUT_DATA =>
   >     TABLE(SELECT date, sales FROM historical_sales_data WHERE store_id=1 AND item='jacket'),
   >   TIMESTAMP_COLNAME => 'date',
   >   TARGET_COLNAME => 'sales',
   >   LABEL_COLNAME => '');
   > ```
   >
   > Escape any single quotes and other special characters with a backslash.
   >
   > As an alternative to using the TABLE keyword, you can call [SYSTEM$QUERY_REFERENCE](../../sql-reference/functions/system_query_reference.md) to create
   > the query reference.

> If the command is executed successfully, a message indicates that your anomaly detection instance was created
> successfully:
>
> ```output
> +--------------------------------------------+
> |                 status                     |
> +--------------------------------------------+
> | Instance basic_model successfully created. |
> +--------------------------------------------+
> ```

#### Using an Anomaly Detection Model to Detect Anomalies

Creating the anomaly detection object trains the model and stores it in the schema. To use the anomaly detection object
to detect anomalies, call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method of the object. For example:

1. Create a view or design a query that returns the data for analysis.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named
   `view_with_data_to_analyze` that contains the date and sales information:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_data_to_analyze
     AS SELECT date, sales FROM new_sales_data
       WHERE store_id=1 and item='jacket';
   ```
2. Using the object for the anomaly detection model (in this example, `basic_model`, which
   you created earlier),
   call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method:

   ```sqlexample
   CALL basic_model!DETECT_ANOMALIES(
     INPUT_DATA => TABLE(view_with_data_to_analyze),
     TIMESTAMP_COLNAME =>'date',
     TARGET_COLNAME => 'sales'
   );
   ```

   The method returns a table that includes rows for the data currently in the view `view_with_data_to_analyze` along with the
   prediction of the detector. For a description of the columns in this table, see [Returns](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md).

**Output**

The results have been rounded for readability.

```output
+--------+-------------------------+----+----------+--------------+--------------+------------+--------------+--------------+
| SERIES | TS                      |  Y | FORECAST |  LOWER_BOUND |  UPPER_BOUND | IS_ANOMALY |   PERCENTILE |     DISTANCE |
+--------|-------------------------+----+----------+--------------+--------------+------------+--------------+--------------|
| NULL   | 2020-01-16 00:00:00.000 |  6 |      4.6 | -7.185885251 | 16.385885251 | False      | 0.6201873452 | 0.3059728606 |
| NULL   | 2020-01-17 00:00:00.000 | 20 |      9   | -2.785885251 | 20.785885251 | False      | 0.9918932208 | 2.404072476  |
+--------+-------------------------+----+----------+--------------+--------------+------------+--------------+--------------|
```

To save your results directly to a table, use [CREATE TABLE … AS SELECT …](../../sql-reference/sql/create-table.md) and
[call the DETECT_ANOMALIES method in the FROM clause](../../sql-reference/snowflake-db-classes.md):

```sqlexample
CREATE TABLE my_anomalies AS
  SELECT * FROM TABLE(basic_model!DETECT_ANOMALIES(
    INPUT_DATA => TABLE(view_with_data_to_analyze),
    TIMESTAMP_COLNAME =>'date',
    TARGET_COLNAME => 'sales'
  ));
```

As shown in the example above, when calling the method, omit the [CALL](../../sql-reference/sql/call.md) command. Instead, put the call
in parentheses, preceded by the TABLE keyword.

### Training an Anomaly Detection Model with Labeled Data

In the previous example, the result of the model appears to be inaccurate. This is probably because:

* The anomaly detection model was trained on very little input data.
* A larger number of jackets (30) were sold on 2020-01-03. This skewed the predictions upward and increased the size of
  the prediction interval.

To improve the accuracy of the anomaly detection model, you can either include more training data or label the training data
(supervised training). Labeled training data has an additional Boolean column that indicates whether each row is a known
anomaly. Labeling can help the anomaly detection model to avoid overfitting to known anomalies in the training data.

To include labeled data in the training data, specify the column containing the label in the LABEL_COLNAME constructor argument
of the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) command. For example:

1. Create a view or design a query that returns the labels with the training data.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named
   `view_with_labeled_data` that contains the labels in a column named `label`:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_labeled_data_for_training
     AS SELECT date, sales, label FROM historical_sales_data
       WHERE store_id=1 and item='jacket';
   ```
2. Create an object for the anomaly detection model, and train the model on the data in that view.

   For this example, execute the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) command to create an anomaly detection object named
   `model_trained_with_labeled_data`. The following statement creates the anomaly detection object:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION model_trained_with_labeled_data(
     INPUT_DATA => TABLE(view_with_labeled_data_for_training),
     TIMESTAMP_COLNAME => 'date',
     TARGET_COLNAME => 'sales',
     LABEL_COLNAME => 'label'
   );
   ```
3. Using this new anomaly detection model, call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method,
   passing in the same arguments that you used in Detecting Anomalies for a Single Time Series (Unsupervised):

   ```sqlexample
   CALL model_trained_with_labeled_data!DETECT_ANOMALIES(
     INPUT_DATA => TABLE(view_with_data_to_analyze),
     TIMESTAMP_COLNAME =>'date',
     TARGET_COLNAME => 'sales'
   );
   ```

   The method returns a table that includes rows for the data currently in the view `view_with_data_to_analyze` along with the
   prediction of the detector. For a description of the columns in this table, see [Returns](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md).

**Output**

The results have been rounded for readability.

> ```output
> +--------+-------------------------+----+----------+---------------+--------------+------------+--------------+------------+
> | SERIES | TS                      |  Y | FORECAST |   LOWER_BOUND |  UPPER_BOUND | IS_ANOMALY |   PERCENTILE |   DISTANCE |
> +--------|-------------------------+----+----------+---------------+--------------+------------+--------------+------------|
> | NULL   | 2020-01-16 00:00:00.000 |  6 |        6 |  0.82         | 11.18        | False      | 0.5          | 0          |
> | NULL   | 2020-01-17 00:00:00.000 | 20 |        6 | -0.39         | 12.33        | True       | 0.99         | 5.70       |
> +--------+-------------------------+----+----------+---------------+--------------+------------+--------------+------------+
> ```

### Specifying the Prediction Interval For Anomaly Detection

You can detect anomalies with varying levels of sensitivity. To specify the percentage of observations to classify as
anomalies, create an [OBJECT](../../sql-reference/data-types-semistructured.md) that contains configuration settings for [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md), and set the
`prediction_interval` key to the percentage of the observations that should be marked as anomalies.

To construct this object, you can use either an [object constant](../../sql-reference/data-types-semistructured.md) or the
[OBJECT_CONSTRUCT](../../sql-reference/functions/object_construct.md) function.

Then, when calling the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method, pass in this object as the CONFIG_OBJECT argument.

By default, the value associated with the prediction_interval key is set to 0.99, which means that roughly 1% of the data is
marked as anomalies. You can specify a value between 0 and 1:

* To mark fewer observations as anomalies, specify a higher value for `prediction_interval`.
* To mark more observations as anomalies, reduce the `prediction_interval` value.

The following example configures anomaly detection to be more strict by setting the `prediction_interval` to 0.995. The example also
uses the model trained on labeled data (that you set up in Training an Anomaly Detection Model with Labeled Data) with the view
that contains the data to analyze (that you set up in Detecting Anomalies for a Single Time Series (Unsupervised)).

```sqlexample
CALL model_trained_with_labeled_data!DETECT_ANOMALIES(
  INPUT_DATA => TABLE(view_with_data_to_analyze),
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales',
  CONFIG_OBJECT => {'prediction_interval':0.995}
);
```

This statement produces a table that includes rows for the data currently in the view `view_with_data_to_analyze`. Each row
includes a column with the prediction of the detector. You can see that the result
of this model is more accurate than the unlabeled example.

**Output**

The results have been rounded for readability.

```output
+--------+-------------------------+----+----------+---------------+--------------+------------+--------------+------------+
| SERIES | TS                      |  Y | FORECAST |   LOWER_BOUND |  UPPER_BOUND | IS_ANOMALY |   PERCENTILE |   DISTANCE |
+--------|-------------------------+----+----------+---------------+--------------+------------+--------------+------------|
| NULL   | 2020-01-16 00:00:00.000 |  6 |        6 |  0.36         | 11.64        | False      | 0.5          | 0          |
| NULL   | 2020-01-17 00:00:00.000 | 20 |        6 | -0.90         | 12.90        | True       | 0.99         | 5.70       |
+--------+-------------------------+----+----------+---------------+--------------+------------+--------------+------------+
```

### Including Additional Columns for Analysis

You can include additional columns in the data (for example, `temperature`, `weather`, `is_black_friday`) in the data for training
and analysis, if these columns can help you improve the identification of true anomalies.

To include new columns for analysis:

1. For the training data, create a view or design a query that includes the new columns, and create a new anomaly detection object,
   passing in a reference to that view or query.
2. For the data to analyze, create a view or design a query that includes the new columns, and pass a reference to that
   view or query to the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method.

The anomaly detection model detects and uses the additional columns automatically.

> **Note:**
>
> You must provide a view or query with the same set of additional columns when executing the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md)
> command and when calling the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method. If there is a mismatch between the columns in the training data
> passed to the command and the columns in the data for analysis passed to the function, an error occurs.

For example, suppose that you want to add the columns `temperature`, `humidity`, and `holiday`:

1. Create a view or design a query that returns the training data with these additional columns.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named
   `view_with_training_data_extra_columns`:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_training_data_extra_columns
     AS SELECT date, sales, label, temperature, humidity, holiday
       FROM historical_sales_data
       WHERE store_id=1 AND item='jacket';
   ```
2. Create an object for the anomaly detection model, and train the model on the data in that view.

   For this example, execute the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) command to create an anomaly detection object named
   `model_with_additional_columns`, passing in a reference to the new view:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION model_with_additional_columns(
     INPUT_DATA => TABLE(view_with_training_data_extra_columns),
     TIMESTAMP_COLNAME => 'date',
     TARGET_COLNAME => 'sales',
     LABEL_COLNAME => 'label'
   );
   ```
3. Create a view or design a query that returns the data to analyze with these additional columns.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named
   `view_with_data_for_analysis_extra_columns`:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_data_for_analysis_extra_columns
     AS SELECT date, sales, temperature, humidity, holiday
       FROM new_sales_data
       WHERE store_id=1 AND item='jacket';
   ```
4. Using this new anomaly detection object, call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method, passing in the new view:

   ```sqlexample
   CALL model_with_additional_columns!DETECT_ANOMALIES(
     INPUT_DATA => TABLE(view_with_data_for_analysis_extra_columns),
     TIMESTAMP_COLNAME => 'date',
     TARGET_COLNAME => 'sales',
     CONFIG_OBJECT => {'prediction_interval':0.93}
   );
   ```

   This statement produces a table that includes rows for the data currently in the view
   `view_with_data_for_analysis_extra_columns` along with the prediction of the detector. The format of the output
   is the same as the format of the output shown for the commands that you ran earlier.

**Output**

The results have been rounded for readability.

> ```output
> +--------+-------------------------+----+----------+-------------+--------------+------------+--------------+------------+
> | SERIES | TS                      |  Y | FORECAST | LOWER_BOUND |  UPPER_BOUND | IS_ANOMALY |   PERCENTILE |   DISTANCE |
> +--------|-------------------------+----+----------+-------------+--------------+------------+--------------+------------|
> | NULL   | 2020-01-16 00:00:00.000 |  6 |        6 | 2.34        |  9.64        | False      | 0.5          | 0          |
> | NULL   | 2020-01-17 00:00:00.000 | 20 |        6 | 1.56        | 10.451       | True       | 0.99         | 5.70       |
> +--------+-------------------------+----+----------+-------------+--------------+------------+--------------+------------+
> ```

### Detecting Anomalies in Multiple Series

The previous sections provided examples of detecting anomalies for a single series. These examples flagged anomalies for
the sale of one type of item (jackets) in one store (store ID 1). To detect anomalies for multiple time series at the
same time (for example, for multiple combinations of items and stores):

1. For the training data, create a view or design a query that includes a column that identifies the series, and create
   a new anomaly detection object, passing in a reference to that view or query and specifying the name of the series
   column for the SERIES_COLNAME argument.
2. For the data to analyze, create a view or design a query that includes the column that identifies the series. Call
   the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method, passing in a reference to that view or query and
   specifying the name of the series column for the SERIES_COLNAME argument.

For example, suppose that you want to use the combination of the `store_id` and `item` columns to identify the series:

1. Create a view or design a query that returns the training data with the column for the series.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named
   `view_with_training_data_multiple_series` that contains a column named `store_item` that identifies the series as
   a combination of store ID and item:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_training_data_multiple_series
     AS SELECT
       [store_id, item] AS store_item,
       date,
       sales,
       label,
       temperature,
       humidity,
       holiday
     FROM historical_sales_data;
   ```
2. Create an object for the anomaly detection, and train the model on the data in that view.

   For this example, execute the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../../sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md) command to create an anomaly detection object named
   `model_for_multiple_series`, passing in a reference to the new view and specifying `store_item` for the SERIES_COLNAME
   argument:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION model_for_multiple_series(
     INPUT_DATA => TABLE(view_with_training_data_multiple_series),
     SERIES_COLNAME => 'store_item',
     TIMESTAMP_COLNAME => 'date',
     TARGET_COLNAME => 'sales',
     LABEL_COLNAME => 'label'
   );
   ```
3. Create a view or design a query that returns the data to analyze with the series column.

   For this example, execute the [CREATE VIEW](../../sql-reference/sql/create-view.md) command to create a view named
   `view_with_data_for_analysis_multiple_series` that contains a column named `store_item` for the series:

   ```sqlexample
   CREATE OR REPLACE VIEW view_with_data_for_analysis_multiple_series
     AS SELECT
       [store_id, item] AS store_item,
       date,
       sales,
       temperature,
       humidity,
       holiday
     FROM new_sales_data;
   ```
4. Using this new anomaly detection object, call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method, passing in the new view and specifying
   `store_item` for the SERIES_COLNAME argument:

   ```sqlexample
   CALL model_for_multiple_series!DETECT_ANOMALIES(
     INPUT_DATA => TABLE(view_with_data_for_analysis_multiple_series),
     SERIES_COLNAME => 'store_item',
     TIMESTAMP_COLNAME => 'date',
     TARGET_COLNAME => 'sales',
     CONFIG_OBJECT => {'prediction_interval':0.995}
   );
   ```

   This statement produces a table that includes rows for the data currently in the view
   `view_with_data_for_analysis_multiple_series` along with the prediction of the detector. The output includes the column that
   identifies the series.

**Output**

The results have been rounded for readability.

> ```output
> +--------------+-------------------------+----+----------+---------------+--------------+------------+---------------+--------------+
> | SERIES       | TS                      |  Y | FORECAST |   LOWER_BOUND |  UPPER_BOUND | IS_ANOMALY |    PERCENTILE |     DISTANCE |
> |--------------+-------------------------+----+----------+---------------+--------------+------------+---------------+--------------|
> | [            | 2020-01-16 00:00:00.000 |  3 |      6.3 |  2.07         | 10.53        | False      | 0.01          | -2.19         |
> |   2,         |                         |    |          |               |              |            |               |              |
> |   "umbrella" |                         |    |          |               |              |            |               |              |
> | ]            |                         |    |          |               |              |            |               |              |
> | [            | 2020-01-17 00:00:00.000 | 70 |      2.9 | -1.33         |  7.13        | True       | 1             | 44.54         |
> |   2,         |                         |    |          |               |              |            |               |              |
> |   "umbrella" |                         |    |          |               |              |            |               |              |
> | ]            |                         |    |          |               |              |            |               |              |
> | [            | 2020-01-16 00:00:00.000 |  6 |      6   |  0.36         | 11.64        | False      | 0.5           |  0           |
> |   1,         |                         |    |          |               |              |            |               |              |
> |   "jacket"   |                         |    |          |               |              |            |               |              |
> | ]            |                         |    |          |               |              |            |               |              |
> | [            | 2020-01-17 00:00:00.000 | 20 |      6   | -0.90         | 12.90        | True       | 0.99          |  5.70         |
> |   1,         |                         |    |          |               |              |            |               |              |
> |   "jacket"   |                         |    |          |               |              |            |               |              |
> | ]            |                         |    |          |               |              |            |               |              |
> +--------------+-------------------------+----+----------+---------------+--------------+------------+---------------+--------------+
> ```

## Visualizing Anomalies and Interpreting the Results

Use [Snowsight](../ui-snowsight-gs.md) to review and visualize the results of anomaly detection. In Snowsight, when
you call the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method, the results are displayed in a table under the worksheet.

To visualize the results, you can use the chart feature in Snowsight.

1. After calling the [<model_name>!DETECT_ANOMALIES](../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md) method, select Charts above the results table.
2. In the Data section on the right side of the chart:

   1. Select the Y column, and under Aggregation, select None.
   2. Select the TS column, and under Bucketing, select None.
3. Add the LOWER_BOUND and UPPER_BOUND columns, and under Aggregation, select None.
4. To display the initial visualization, select Chart.
5. Select Add Column on the right side of the page, and select the columns you want to visualize:

   * LOWER_BOUND
   * UPPER_BOUND
   * IS_ANOMALY

   Results:
6. Hover over the high spike to see that Y lies outside of the upper bound and is tagged with a 1 in the IS_ANOMALY field.

> **Tip:**
>
> To better understand your results, try [Top Insights](top-insights.md).

## Automate Anomaly Detection with Snowflake Tasks and Alerts

You can create an automated anomaly detection pipeline, both for retraining the model and for monitoring your data for anomalies, by using Anomaly Detection functions within Snowflake Tasks or Alerts.

* Recurring Training with a Snowflake Task
* Monitoring with a Snowflake Task
* Monitoring with a Snowflake Alert

### Recurring Training with a Snowflake Task

You can update your model to reflect the most up-to-date data using [Snowflake Tasks](../tasks-intro.md).

To create a task that refreshes the anomaly detection object every hour, run following statement, replacing `your_warehouse_name` with your warehouse name:

```sqlexample
CREATE OR REPLACE TASK ad_model_retrain_task
WAREHOUSE = <your_warehouse_name>
SCHEDULE = '60 MINUTE'
AS
EXECUTE IMMEDIATE
$$
BEGIN
  CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION model_trained_with_labeled_data(
    INPUT_DATA => TABLE(view_with_labeled_data_for_training),
    TIMESTAMP_COLNAME => 'date',
    TARGET_COLNAME => 'sales',
    LABEL_COLNAME => 'label'
  );
END;
$$;
```

By default, newly created tasks are suspended.

To resume the task, execute the [ALTER TASK … RESUME](../../sql-reference/sql/alter-task.md) command:

```sqlexample
ALTER TASK ad_model_retrain_task RESUME;
```

To pause the task, execute the [ALTER TASK … SUSPEND](../../sql-reference/sql/alter-task.md) command:

```sqlexample
ALTER TASK ad_model_retrain_task SUSPEND;
```

### Monitoring with a Snowflake Task

You can also use Snowflake Tasks to monitor your data at a given frequency.

First, create a table to hold the results
of anomaly detection:

```sqlexample
CREATE OR REPLACE TABLE anomaly_res_table (
  ts TIMESTAMP_NTZ, y FLOAT, forecast FLOAT, lower_bound FLOAT, upper_bound FLOAT,
  is_anomaly BOOLEAN, percentile FLOAT, distance FLOAT);
```

Create a task to store the results of a recurring anomaly detection operation in the table.
This example sets the `WAREHOUSE` parameter to `snowhouse`. You can replace that with
your own warehouse:

```sqlexample
CREATE OR REPLACE TASK ad_model_monitoring_task
WAREHOUSE = snowhouse
SCHEDULE = '1 minute'
AS
EXECUTE IMMEDIATE
$$
BEGIN
  INSERT INTO anomaly_res_table (ts, y, forecast, lower_bound, upper_bound, is_anomaly, percentile, distance)
    SELECT * FROM TABLE(
      model_trained_with_labeled_data!DETECT_ANOMALIES(
        INPUT_DATA => TABLE(view_with_data_to_analyze),
        TIMESTAMP_COLNAME => 'date',
        TARGET_COLNAME => 'sales',
        CONFIG_OBJECT => {'prediction_interval':0.99}
    )
  );
END;
$$;
```

To resume the task, execute the [ALTER TASK … RESUME](../../sql-reference/sql/alter-task.md) command:

```sqlexample
ALTER TASK ad_model_monitoring_task RESUME;
```

`anomaly_res_table` then contains all the results for each task run.

To pause the task, execute the [ALTER TASK … SUSPEND](../../sql-reference/sql/alter-task.md) command:

```sqlexample
ALTER TASK ad_model_monitoring_task SUSPEND;
```

### Monitoring with a Snowflake Alert

You can also use [Snowflake Alerts](../alerts.md) to monitor your data at a given frequency and send you
email with detected anomalies. The following statements create an alert that detects anomalies every minute. First you
define a [stored procedure](../../developer-guide/stored-procedure/stored-procedures-overview.md) to detect anomalies, then create an alert
that uses that stored procedure.

> **Note:**
>
> You must set up email integration to send mail from a stored procedure; see [Notifications in Snowflake](../notifications/about-notifications.md).

```sqlexample
CREATE OR REPLACE PROCEDURE extract_anomalies()
  RETURNS TABLE()
  LANGUAGE SQL
  AS
  $$
    BEGIN
      let res RESULTSET := (SELECT * FROM TABLE(
        model_trained_with_labeled_data!DETECT_ANOMALIES(
          INPUT_DATA => TABLE(view_with_data_to_analyze),
          TIMESTAMP_COLNAME => 'date',
          TARGET_COLNAME => 'sales',
          CONFIG_OBJECT => {'prediction_interval':0.99}
        ))
        WHERE is_anomaly = TRUE
      );
      RETURN TABLE(res);
    END;
  $$
  ;

CREATE OR REPLACE ALERT sample_sales_alert
WAREHOUSE = <your_warehouse_name>
SCHEDULE = '1 MINUTE'
IF (EXISTS (CALL extract_anomalies()))
THEN
CALL SYSTEM$SEND_EMAIL(
  'sales_email_alert',
  'your_email@snowflake.com',
  'Anomalous Sales Data Detected in data stream',
  CONCAT(
    'Anomalous Sales Data Detected in data stream \n',
    'Value outside of prediction interval detected in the most recent run at ',
    current_timestamp(1)
  ));
```

To start or resume the alert, execute the [ALTER ALERT … RESUME](../../sql-reference/sql/alter-alert.md) command:

```sqlexample
ALTER ALERT sample_sales_alert RESUME;
```

To pause the alert, execute the [ALTER ALERT … SUSPEND](../../sql-reference/sql/alter-alert.md) command:

```sqlexample
ALTER ALERT sample_sales_alert SUSPEND;
```

## Understanding Feature Importance

An anomaly detection model can explain the relative importance of all features used in your model, including any exogenous
variables that you choose, automatically generated time features (such as day of week or week of year), and
transformations of your target variable (such as rolling averages and auto-regressive lags). This information is useful
in understanding what factors are really influencing your data.

The [<model_name>!EXPLAIN_FEATURE_IMPORTANCE](../../sql-reference/classes/anomaly-detection/methods/explain_feature_importance.md) method counts the number of times the
model’s trees used each feature to make a decision. These feature importance scores are then normalized to values
between 0 and 1 so that their sum is 1. The resulting scores represent an approximate ranking of the features in your
trained model.

Features that are close in score have similar importance. For extremely simple series (for example, when the target
column has a constant value), all feature importance scores may be zero.

Using multiple features that are very similar to each other may result in reduced importance scores for those features.
For example, if one feature is *quantity of items sold* and another is *quantity of items in inventory*, the values may be
correlated because you can’t sell more than you have and because you try to manage inventory so you won’t have more in
stock than you will sell. If two features are identical, the model may treat them as interchangeable when making
decisions, resulting in feature importance scores that are half of what those scores would be if only one of the
features were included.

Feature importance also reports *lag features.* During training, the model infers the frequency (hourly, daily, or weekly)
of your training data. The feature `lagx` (e.g. `lag24`) is the value of the target variable *x* time units ago.
For example, if your data is inferred to be hourly, `lag24` represents your target variable 24 hours ago.

All other transformations of your target variable (rolling averages, etc.) are summarized as
`aggregated_endogenous_features` in the results table.

### Limitations

* You cannot choose the technique used to calculate feature importance.
* Feature importance scores can be helpful for gaining intuition about which features are important to your model’s
  accuracy, but the actual values should be considered estimates.

### Example

To understand the relative importance of your features to your model, train a model, and then call
[<model_name>!EXPLAIN_FEATURE_IMPORTANCE](../../sql-reference/classes/anomaly-detection/methods/explain_feature_importance.md). In this example, you first create random data with
two exogenous variables: one that is random and therefore unlikely to be very important to your model, and one that
is a copy of your target and therefore likely to be more important to your model.

Execute the following statements to generate the data, train a model on it, and get the importance of the features:

```sqlexample
CREATE OR REPLACE VIEW v_random_data AS SELECT
  DATEADD('minute', ROW_NUMBER() over (ORDER BY 1), '2023-12-01')::TIMESTAMP_NTZ ts,
  MOD(SEQ1(),10) y,
  UNIFORM(1, 100, RANDOM(0)) exog_a
FROM TABLE(GENERATOR(ROWCOUNT => 500));

CREATE OR REPLACE VIEW v_feature_importance_demo AS SELECT
  ts,
  y,
  exog_a
FROM v_random_data;

SELECT * FROM v_feature_importance_demo;

CREATE OR REPLACE SNOWFLAKE.ML.ANOMALY_DETECTION anomaly_model_feature_importance_demo(
  INPUT_DATA => TABLE(v_feature_importance_demo),
  TIMESTAMP_COLNAME => 'ts',
  TARGET_COLNAME => 'y',
  LABEL_COLNAME => ''
);

CALL anomaly_model_feature_importance_demo!EXPLAIN_FEATURE_IMPORTANCE();
```

**Output**

Because this example uses random data, do not expect your output to match this exactly.

```output
+--------+------+--------------------------------------+-------+-------------------------+
| SERIES | RANK | FEATURE_NAME                         | SCORE | FEATURE_TYPE            |
+--------+------+--------------------------------------+-------+-------------------------+
| NULL   |    1 | aggregated_endogenous_trend_features |  0.36 | derived_from_endogenous |
| NULL   |    2 | exog_a                               |  0.22 | user_provided           |
| NULL   |    3 | epoch_time                           |  0.15 | derived_from_timestamp  |
| NULL   |    4 | minute                               |  0.13 | derived_from_timestamp  |
| NULL   |    5 | lag60                                |  0.07 | derived_from_endogenous |
| NULL   |    6 | lag120                               |  0.06 | derived_from_endogenous |
| NULL   |    7 | hour                                 |  0.01 | derived_from_timestamp  |
+--------+------+--------------------------------------+-------+-------------------------+
```

## Inspecting Training Logs

When you train multiple series with `CONFIG_OBJECT => 'ON_ERROR': 'SKIP'`, individual time series models can
fail to train without the overall training process failing. To understand which time series failed and why, call
`<model_instance>!SHOW_TRAINING_LOGS`.

### Example

```sqlexample
CREATE TABLE t_error(date TIMESTAMP_NTZ, sales FLOAT, series VARCHAR);
INSERT INTO t_error VALUES
  (TO_TIMESTAMP_NTZ('2019-12-20'), 1.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-21'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-22'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-23'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-24'), 1.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-25'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-26'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-27'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-28'), 1.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-29'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-30'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-31'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-01'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-02'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-03'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-04'), 7.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 10.0, 'B'), -- the same timestamp used again and again
  (TO_TIMESTAMP_NTZ('2020-01-06'), 13.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 12.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 15.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 14.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 18.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 12.0, 'B');

CREATE SNOWFLAKE.ML.ANOMALY_DETECTION model(
  INPUT_DATA => TABLE(SELECT date, sales, series FROM t_error),
  SERIES_COLNAME => 'series',
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales',
  LABEL_COLNAME => '',
  CONFIG_OBJECT => {'ON_ERROR': 'SKIP'}
);

CALL model!SHOW_TRAINING_LOGS();
```

**Output**

```output
+--------+--------------------------------------------------------------------------+
| SERIES | LOGS                                                                     |
+--------+--------------------------------------------------------------------------+
| "B"    | {   "Errors": [     "At least two unique timestamps are required."   ] } |
| "A"    | NULL                                                                     |
+--------+--------------------------------------------------------------------------+
```

## Cost Considerations

For details on costs for using ML functions, see [Cost Considerations](../../guides-overview-ml-functions.md) in the ML functions overview.

---
title: Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg.md
section: User Guide
---

# Apache Iceberg™ tables

Apache Iceberg™ tables for Snowflake combine the performance and query semantics of typical
Snowflake tables with external cloud storage that you manage. They are
ideal for existing data lakes that you cannot, or choose not to, store in Snowflake.

Iceberg tables use the [Apache Iceberg™](https://iceberg.apache.org/) open table
format specification, which provides an abstraction layer on data files stored in open formats and supports features such as:

* ACID (atomicity, consistency, isolation, durability) transactions
* Schema evolution
* Hidden partitioning
* Table snapshots

Snowflake supports Iceberg tables that use the [Apache Parquet™](https://parquet.apache.org/) file format.

## Getting started

To get started with Iceberg tables, see [Tutorial: Create your first Apache Iceberg™ table](tutorials/create-your-first-iceberg-table.md).

## How it works

This section provides information specific to working with Iceberg tables *in Snowflake*.
To learn more about the Iceberg table format specification,
see the official [Apache Iceberg documentation](https://iceberg.apache.org/docs/latest/) and the
[Iceberg Table Spec](https://iceberg.apache.org/spec/).

* Data storage
* Catalog
* Metadata and snapshots
* Cross-cloud/cross-region support
* Billing

### Data storage

Iceberg tables store their data and metadata files in an external cloud storage location
(Amazon S3, Google Cloud Storage, or Azure Storage). The external storage is not part of Snowflake. You are responsible
for all management of the external cloud storage location, including the configuration of data protection and recovery.
Snowflake does not provide [Fail-safe](data-failsafe.md) storage for Iceberg tables.

Snowflake connects to your storage location using an external volume, and
Iceberg tables incur no Snowflake storage costs. For more information, see Billing.

To learn more about storage for Iceberg tables, see [Storage for Apache Iceberg™ tables](tables-iceberg-storage.md).

#### External volume

An external volume is a named, account-level Snowflake object that you use to connect Snowflake to your
external cloud storage for Iceberg tables. An external volume stores an identity and access management (IAM) entity
for your storage location. Snowflake uses the IAM entity to securely connect to your storage for accessing
table data, Iceberg metadata, and manifest files that store the table schema, partitions, and other metadata.

A single external volume can support one or more Iceberg tables.

To set up an external volume for Iceberg tables, see [Configure an external volume](tables-iceberg-configure-external-volume.md).

### Catalog

An Iceberg catalog enables a compute engine to manage and load Iceberg tables.
The catalog forms the first architectural layer in the [Iceberg table specification](https://iceberg.apache.org/spec/#overview) and
must support:

* Storing the current metadata pointer for one or more Iceberg tables.
  A metadata pointer maps a table name to the location of that table’s current metadata file.
* Performing atomic operations so that you can update the current metadata pointer for a table.

To learn more about Iceberg catalogs, see the [Apache Iceberg documentation](https://iceberg.apache.org/terms/#catalog-implementations).

Snowflake supports different catalog options. For example, you can use Snowflake as the
Iceberg catalog, or use a catalog integration to connect Snowflake to
an external Iceberg catalog.

#### Catalog integration

A catalog integration is a named, account-level Snowflake object that stores information about how your table metadata is organized for the
following scenarios:

* When you don’t use Snowflake as the Iceberg catalog. For example, you need a
  catalog integration if your table is managed by AWS Glue.
* When you want to integrate with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) to:

  + Query an Iceberg table in Snowflake Open Catalog using Snowflake.
  + Sync a Snowflake-managed Iceberg table with Snowflake Open Catalog so that third-party compute engines can query the table.

A single catalog integration can support one or more Iceberg tables that use the same external catalog.

To set up a catalog integration, see [Configure a catalog integration](tables-iceberg-configure-catalog-integration.md).

### Metadata and snapshots

Iceberg uses a snapshot-based querying model, where data files are mapped using manifest and metadata files.
A snapshot represents the state of a table at a point in time and is used to access the complete set of data files in the table.

To learn about table metadata and Time Travel support, see [Metadata and retention for Apache Iceberg™ tables](tables-iceberg-metadata.md).

### Cross-cloud/cross-region support

Snowflake supports using an external volume storage location with a different cloud provider (in a different region)
from the one that hosts your Snowflake account.

| Table type | Cross-cloud/cross-region support | Notes |
| --- | --- | --- |
| Tables that use an external catalog with a catalog integration | ✔ | If your Snowflake account and external volume are in different regions, your external cloud storage account incurs egress costs when you query the table. |
| Tables that use Snowflake as the catalog | ✔ | If your Snowflake account and external volume are in different regions, your external cloud storage account incurs egress costs when you query the table.  These tables incur costs for cross-region data transfer usage. For more information, see Billing. |

### Billing

Snowflake bills your account for virtual warehouse (compute) usage and cloud services when you work with Iceberg tables.
Snowflake also bills your account if you use [automated refresh](tables-iceberg-auto-refresh.md) or an
[external query engine through Snowflake Horizon Catalog](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

If a Snowflake-managed Iceberg table is cross-cloud/cross-region, Snowflake bills your
cross-region data transfer usage under the TRANSFER_TYPE of DATA_LAKE. To learn more, see:

* [DATA_TRANSFER_HISTORY view](../sql-reference/organization-usage/data_transfer_history.md) in the ORGANIZATION_USAGE schema.
* [DATA_TRANSFER_HISTORY view](../sql-reference/account-usage/data_transfer_history.md) in the ACCOUNT_USAGE schema.

Snowflake does not bill your account for the following:

* Iceberg table storage costs when the table uses an external volume that you manage. Your cloud storage provider bills you
  directly for data storage usage. However, if the table uses
  [Snowflake Storage](tables-iceberg-internal-storage.md) (`EXTERNAL_VOLUME = SNOWFLAKE_MANAGED`),
  Snowflake charges for the storage. For more information, see
  [Snowflake storage for Apache Iceberg™ tables](tables-iceberg-internal-storage.md).
* Active bytes used by Iceberg tables. However,
  the [INFORMATION_SCHEMA.TABLE_STORAGE_METRICS](../sql-reference/info-schema/table_storage_metrics.md) and
  [ACCOUNT_USAGE.TABLE_STORAGE_METRICS](../sql-reference/account-usage/table_storage_metrics.md) views display ACTIVE_BYTES for Iceberg tables
  to help you track how much storage a table occupies. To view an example, see [Retrieve storage metrics](tables-iceberg-manage.md).

> **Note:**
>
> If your Snowflake account and external volume are in different regions,
> your external cloud storage account incurs egress costs when you query the table.

## Catalog options

Snowflake supports the following Iceberg catalog options:

* Use Snowflake as the Iceberg catalog
* Use an external Iceberg catalog

The following table summarizes the differences between these catalog options.

|  | Use Snowflake as the catalog | Use an external catalog |
| --- | --- | --- |
| Read access | ✔ | ✔ |
| Write access | ✔ | ✔ |
| Catalog-vended credentials |  | ✔ |
| Write access across regions | ✔ | ✔ with [Write support for externally managed tables](tables-iceberg-externally-managed-writes.md) |
| Data and metadata storage | External volume (cloud storage) | External volume (cloud storage) |
| Snowflake platform support | ✔ |  |
| Integrates with Snowflake Open Catalog | ✔  You can sync a Snowflake-managed table with Open Catalog to query a table using other compute engines. | ✔  You can use Snowflake to query or write to Iceberg tables managed by Open Catalog. |
| Works with the [Snowflake Catalog SDK](tables-iceberg-catalog.md) | ✔ | ✔ |
| Replication for tables | ✔  See [Configure replication for Snowflake-managed Apache Iceberg™ tables](tables-iceberg-replication.md). |  |

### Use Snowflake as the catalog

An Iceberg table that uses Snowflake as the Iceberg catalog (Snowflake-managed Iceberg table) provides full Snowflake platform support with
read and write access. The table data and metadata are stored in external cloud storage, which Snowflake accesses using an
external volume. Snowflake
handles all life-cycle maintenance, such as compaction, for the table. However, you can [disable compaction for the table](tables-iceberg-manage.md)
, if needed.

### Use an external catalog

An Iceberg table that uses an external catalog provides limited Snowflake platform support.

With this table type, Snowflake uses a catalog integration
to retrieve information about your Iceberg metadata and schema.

You can use this option to create an Iceberg table for the following sources:

* [Remote Iceberg REST catalog](tables-iceberg-configure-catalog-integration-rest.md), including
  [AWS Glue](tables-iceberg-configure-catalog-integration-rest-glue.md) and [Snowflake Open Catalog](tables-iceberg-open-catalog.md).
  Snowflake supports writes to externally managed tables that use a remote Iceberg REST catalog.

  > **Tip:**
  >
  > To bring your external data from a remote Iceberg REST catalog into Snowflake, you can create a catalog-linked database.
  > The database automatically discovers
  > and stays in sync with the namespaces and tables in your remote catalog. You can use a catalog-linked database to read and
  > write to the tables in your remote catalog from Snowflake, while preserving full interoperability with your existing
  > Iceberg ecosystem. For more information, see the following topics:
  >
  > + [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md)
  > + If your external data is in Unity Catalog, see [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md)
  > + If your external data is in AWS Glue, see [Build Data Lakes using Apache Iceberg with Snowflake and AWS Glue](https://www.snowflake.com/en/developers/guides/data-lake-using-apache-iceberg-with-snowflake-and-aws-glue/)
* [Delta table files in object storage](tables-iceberg-configure-catalog-integration-object-storage.md)
* [Iceberg metadata files in object storage](tables-iceberg-configure-catalog-integration-object-storage.md)

Snowflake does not assume any life-cycle management on the table.

The table data and metadata are stored in external cloud storage, which Snowflake accesses using an
external volume.

> **Note:**
>
> If you want full Snowflake platform support for an Iceberg table that uses an external catalog, you can convert it to use Snowflake as
> the catalog. For more information, see [Convert an Apache Iceberg™ table to use Snowflake as the catalog](tables-iceberg-conversion.md).

The following diagram shows how an Iceberg table uses a catalog integration with an external
Iceberg catalog.

## Apache Iceberg™ V3 support (*Preview*)

[Preview Feature](../release-notes/preview-features.md) — Open

Available to all accounts.

Support for V3 of the Apache Iceberg™ table specification is now in public preview. For details, see
[Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)](tables-iceberg-v3-specification-support.md).

## Considerations and limitations

The following considerations and limitations apply to Iceberg tables, and are subject to change:

**Clouds and regions**

> * Iceberg tables are available for all Snowflake accounts, on all cloud platforms and in all regions.
> * Cross-cloud/cross-region tables are supported. For more information, see Cross-cloud/cross-region support.

**Iceberg**

> * Versions 1 and 2 of the Apache Iceberg specification are supported, excluding the following [features](https://iceberg.apache.org/spec/):
>
>   + Row-level equality deletes. However, tables that use Snowflake as the catalog support Snowflake
>     [DELETE](../sql-reference/sql/delete.md) statements.
>   + Using the `history.expire.min-snapshots-to-keep`
>     [table property](https://iceberg.apache.org/docs/1.2.1/configuration/#table-behavior-properties)
>     to specify the default minimum number of snapshots to keep. For more information, see Metadata and snapshots.
> * Iceberg partitioning with the `bucket` transform function impacts performance for queries that use conditional clauses
>   to filter results.
> * For Iceberg tables that aren’t managed by Snowflake, be aware of the following:
>
>   + Time travel to any snapshot generated after table creation is supported
>     as long as you periodically refresh the table before the snapshot expires.
>   + Converting a table that has an un-materialized identity partition column isn’t supported.
>     An un-materialized identity partition column is created when a table defines an identity transform
>     using a source column that doesn’t exist in a Parquet file.
>   + For [row-level deletes](tables-iceberg-manage.md):
>
>     - Snowflake supports [position deletes](https://iceberg.apache.org/spec/#position-delete-files) only for v2 Iceberg tables, and
>       [deletion vectors](https://iceberg.apache.org/spec/#deletion-vectors) for v3 Iceberg tables.
>     - Snowflake only supports position deletes with externally managed Iceberg tables.
>     - For the best read performance when you use row-level deletes, perform regular compaction and table maintenance to remove old delete files. For
>       information, see [Maintain tables that use an external catalog](tables-iceberg-manage.md).
>     - Excessive position deletes, especially dangling position deletes, might prevent table creation and refresh operations.
>       To avoid this issue, perform table maintenance to remove extra position deletes.
>
>       The table maintenance method to use depends on your external Iceberg engine. For example, you can use the `rewrite_data_files` method
>       for Spark with the `delete-file-threshold` or `rewrite-all` options. For more information, see
>       [rewrite_data_files](https://iceberg.apache.org/docs/latest/spark-procedures/#rewrite_data_files) in the Apache Iceberg™ documentation.

**File formats**

> * Iceberg tables support Apache Parquet files.
> * Parquet files that use the unsigned integer logical type aren’t supported.
> * For Parquet files that use the `LIST` logical type, be aware of the following:
>
>   + The three-level annotation structure with the `element` keyword is supported. For more
>     information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#lists). If your
>     Parquet file uses an obsolete format with the `array` keyword, you must regenerate your data based on the supported format.

**External volumes**

> * You can’t access the cloud storage locations in external volumes using a storage integration.
> * You must configure a separate trust relationship for each external volume that you create.
> * You can use [outbound private connectivity](private-connectivity-outbound.md) to access Snowflake-managed Iceberg tables
>   and Iceberg tables that use a catalog integration for object storage, but cannot use it to access Iceberg tables that use other catalog
>   integrations.
> * After you create a Snowflake-managed table,
>   the path to its files in external storage does not change, even if you rename the table.
> * Snowflake can’t support external volumes with S3 bucket names that contain dots (for example, `my.s3.bucket`).
>   S3 doesn’t support SSL for virtual-hosted-style buckets with dots in the name, and
>   Snowflake uses virtual-host-style paths and HTTPS to access data in S3.

**Metadata files**

> * The metadata files don’t identify the most recent snapshot of an Iceberg table.
> * You can’t modify the location of the data files or snapshot using the ALTER ICEBERG TABLE command.
>   To modify either of these settings, you must recreate the table (using the CREATE OR REPLACE ICEBERG TABLE syntax).
> * For tables that use an external catalog:
>
>   > + Ensure that manifest files don’t contain duplicates.
>   >   If duplicate files are present in the *same* snapshot, Snowflake returns an error that includes the path of the duplicate file.
>   > + You can’t create a table if the Parquet metadata contains invalid UTF-8 characters. Ensure that your Parquet metadata is UTF-8 compliant.
> * Snowflake detects corruptions and inconsistencies in Parquet metadata produced outside of Snowflake,
>   and surfaces issues through error messages.
>
>   It’s possible to create, refresh, or query externally managed (or converted) tables, even if the table metadata is inconsistent.
>   When writing Iceberg data, ensure that the table’s metadata statistics (for example, `RowCount` or `NullCount`) match the data content.
> * For tables that use Snowflake as the catalog, Snowflake processes DDL statements individually and produces metadata in a way that might differ from other catalogs.
>   For more information, see [DDL statements](tables-iceberg-transactions.md).

**Clustering**

> [Clustering](tables-clustering-keys.md) support depends on the type of Iceberg table.
>
> | Table type | Notes |
> | --- | --- |
> | Tables that use Snowflake as the Iceberg catalog | Set a clustering key by using either the CREATE ICEBERG TABLE or the ALTER ICEBERG TABLE command. To set or manage a clustering key, see [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-snowflake.md) and [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md). |
> | Tables that use an external catalog | Clustering is not supported. |
> | Converted tables | Snowflake only clusters files if they were created after converting the table, or if the files have since been modified using a DML statement. |

**Delta**

> * Snowflake supports minReaderVersion 3 and can read all tables written by engines that use the latest version of Delta Lake,
>   which is 4.0.0. Delta Lake version 4.0.0 includes support for deletion vectors and liquid clustering.
> * Snowflake streams aren’t supported for Iceberg tables created from Delta table files with partition columns.
>   However, insert-only streams for tables created from Delta files *without* partition columns are supported.
> * Iceberg tables created from Delta files that were created before the [2024_04](../release-notes/bcr-bundles/2025_04_bundle.md) release bundle are not supported in dynamic tables.
> * Snowflake doesn’t support creating Iceberg tables from Delta table definitions in the AWS Glue Data Catalog.
>
> * Parquet files (data files for Delta tables) that use any of the following features or data types aren’t supported:
>
>   + Field IDs.
>   + The INTERVAL data type.
>   + The DECIMAL data type with precision higher than 38.
>   + LIST or MAP types with one-level or two-level representation.
>   + Unsigned integer types (INT(signed = false)).
>   + The FLOAT16 data type.
> * You can use the Parquet physical type `int96` for TIMESTAMP, but Snowflake doesn’t support `int96` for TIMESTAMP_NTZ.
>
> * For more information about Delta data types and Iceberg tables, see [Delta data types](tables-iceberg-data-types.md).
> * Snowflake processes a maximum of 1000 Delta commit files each time you refresh a table using CREATE/ALTER … REFRESH.
>   If your table has over 1000 commit files, you can do additional manual refreshes.
>   Each time, the refresh process continues from where the last one stopped.
>
>   > **Note:**
>   >
>   > Snowflake uses Delta checkpoint files when creating an Iceberg table.
>   > The 1,000 commit file limit only applies to commits after the latest checkpoint.
>   >
>   > When you refresh an existing table, Snowflake processes Delta commit files, but not checkpoint files. If table maintenance removes stale log and data files for the source
>   > Delta table, you should refresh Delta-based
>   > Iceberg tables in Snowflake more frequently than the retention period of Delta logs and data files.
> * The following Delta Lake features aren’t currently supported: Row Tracking, change data files, change metadata,
>   DataChange, CDC, protocol evolution.

**Automated refresh**

> * For catalog integrations created before Snowflake version 8.22 (or 9.2 for Delta-based tables), you must manually set the `REFRESH_INTERVAL_SECONDS` parameter
>   before you enable automated refresh on tables that depend on that catalog integration.
>   For instructions, see [ALTER CATALOG INTEGRATION … SET AUTO_REFRESH](../sql-reference/sql/alter-catalog-integration.md).
> * For [catalog integrations for object storage](tables-iceberg-configure-catalog-integration-object-storage.md), automated refresh is only supported
>   for integrations with `TABLE_FORMAT = DELTA`.
> * For tables with frequent updates, using a shorter polling interval (`REFRESH_INTERVAL_SECONDS`) can cause performance degradation.
> * Automated refresh synchronizes schema changes alongside [DML](../sql-reference/sql-dml.md) operations such as INSERT, UPDATE,
>   or DELETE. To apply schema changes made through DDL operations alone, perform a [manual refresh](tables-iceberg-manage.md).

**Catalog-linked databases and automatic table discovery**

> * Supported only when you use a catalog integration for Iceberg REST (for example, Snowflake Open Catalog).
> * To limit automatic table discovery to a specific set of namespaces, use the ALLOWED_NAMESPACES parameter. You can also use the
>   BLOCKED_NAMESPACES parameter to block a set of namespaces.
> * Snowflake doesn’t sync remote catalog access control for users or roles.
> * You can create schemas, externally managed Iceberg tables, or database roles in a catalog-linked database. Creating other Snowflake objects
>   isn’t currently supported.
> * When you create a catalog-linked database, you can’t specify the default Iceberg version or merge-on-read behavior to use for
>   Iceberg tables.
>
>   However, you can modify these properties for an existing database by using the [ALTER DATABASE (catalog-linked)](../sql-reference/sql/alter-database-catalog-linked.md)
>   command to set the following parameters:
>
>   + ICEBERG_VERSION_DEFAULT
>   + ENABLE_ICEBERG_MERGE_ON_READ
> * For Iceberg tables in a catalog-linked database:
>
>   + Snowflake doesn’t copy remote catalog table properties, such as retention policies or buffers, and doesn’t currently support altering table properties.
>   + [Automated refresh](tables-iceberg-auto-refresh.md) is enabled by default. If the `table-uuid` of an external table
>     and the catalog-linked database table don’t match, refresh fails and Snowflake drops the table from the catalog-linked database; Snowflake doesn’t change the remote table.
>   + If you drop a table from the remote catalog, Snowflake drops the table from the catalog-linked database.
>     This action is asynchronous, so you might not see the change in the remote catalog right away.
>   + If you rename a table in the remote catalog, Snowflake drops the existing table from the catalog-linked database and creates a table with the new name.
>   + Masking policies and tags are supported. Other Snowflake-specific features, including replication and cloning, aren’t supported.
>   + The character that you choose for the NAMESPACE_FLATTEN_DELIMITER parameter can’t appear in your remote namespaces. During the auto discovery process,
>     Snowflake skips any namespace that contains the delimiter, and doesn’t create a corresponding schema in your catalog-linked database.
>   + If you specify anything other than `_`, `$`, or numbers for the NAMESPACE_FLATTEN_DELIMITER parameter,
>     you must put the schema name in quotes when you query the table.
>   + For databases linked to AWS Glue, you must use lowercase letters and surround the schema, table, and column names in double quotes.
>     This is also required for other Iceberg REST catalogs that only support lowercase identifiers.
>
>     The following example shows a valid query:
>
>     ```sqlexample
>     CREATE SCHEMA "s1";
>     ```
>
>     The following statements aren’t valid, because they use uppercase letters or omit the double quotes:
>
>     ```sqlexample
>     CREATE SCHEMA s1;
>     CREATE SCHEMA "Schema1";
>     ```
>   + Using UNDROP ICEBERG TABLE isn’t supported.
>   + Sharing:
>
>     - Sharing with a listing isn’t currently supported
>     - Direct sharing is supported
> * For writing to tables in a catalog-linked database:
>
>   + Creating tables in nested namespaces isn’t currently supported.
>   + Writing to tables in nested namespaces isn’t currently supported.
>   + Position [row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) are supported for tables stored
>     on Amazon S3, Azure, or Google Cloud. Row-level deletes with equality delete files aren’t supported. For more information about row-level deletes,
>     see [Use row-level deletes](tables-iceberg-manage.md). To turn off position deletes, which enable
>     running the Data Manipulation Language (DML) operations in copy-on-write mode, set the `ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE at the table, schema, or
>     database level.

**Externally managed write support**

> * Snowflake supports externally managed writes for Iceberg tables that use version 2 of the
>   [Iceberg table specification](https://iceberg.apache.org/spec/).
> * Snowflake provides Data Definition Language (DDL) and Data Manipulation Language (DML) commands for externally managed tables. However,
>   you configure metadata and data retention using your external catalog and the tools provided by your external storage provider.
>   For more information, see [Tables that use an external catalog](tables-iceberg-metadata.md).
>
>   For writes, Snowflake ensures that changes are committed to your remote catalog before updating the table in Snowflake.
> * If you use a catalog-linked database, you can use the CREATE ICEBERG TABLE syntax with column definitions to create a table in Snowflake
>   *and* in your remote catalog. If you use a standard Snowflake database (not linked to a catalog), you must first create a
>   table in your remote catalog. After that, you can use the [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) syntax to create
>   an Iceberg table in Snowflake and write to it.
> * For the AWS Glue Data Catalog: Dropping an externally managed table through Snowflake doesn’t delete
>   the underlying table files. This behavior is specific to the AWS Glue Data Catalog implementation.
> * You can’t drop an Amazon S3 Table through Snowflake. The Amazon S3 Tables service requires
>   the `purge` option to be specified with the DROP command, which Snowflake doesn’t currently support.
> * Position [row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) are supported for tables stored on
>   Amazon S3, Azure, or Google Cloud. Row-level deletes with equality delete files aren’t supported. For more information about row-level deletes,
>   see [Use row-level deletes](tables-iceberg-manage.md). To turn off position deletes, which enable
>   running the DML operations in copy-on-write mode, set the
>   `ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE at the table, schema, or database level.
> * Writing to externally managed tables with the following Iceberg data types isn’t supported:
>
>   + `uuid`
>   + `fixed(L)`
> * The following features aren’t currently supported when you use Snowflake to write to externally managed Iceberg tables:
>
>   + Server-side encryption (SSE) for Azure external volumes.
>   + Multi-statement transactions. Snowflake supports autocommit transactions only.
>   + Conversion to Snowflake-managed tables.
>   + External Iceberg catalogs that don’t conform to the Iceberg REST protocol.
>   + Using the OR REPLACE option when creating a table.
>   + Using the CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT syntax if you use one of the following catalogs as your remote catalog:
>
>     - AWS Glue
>     - Databricks Unity Catalog
>
>     Alternatively, you can use the [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) syntax to create an empty Iceberg table and then use
>     an [INSERT INTO … SELECT](../sql-reference/sql/insert.md) statement to insert data into the empty table. However, this alternative
>     uses two separate transactions, so it doesn’t guarantee atomicity.
> * For creating schemas in a catalog-linked database, be aware of the following:
>
>   + The CREATE SCHEMA command creates a corresponding namespace in your remote catalog only when you use a catalog-linked database.
>   + The ALTER and CLONE options aren’t supported.
>   + Delimiters aren’t supported for schema names. Only alphanumeric schema names are supported.
>
> * You can set a target file size for a table’s Parquet files. For more information, see [Set a target file size](tables-iceberg-manage.md).
> * For Azure cloud storage services: Snowflake only supports externally managed writes for Iceberg tables that use the following services for external storage:
>
>   + Blob Storage
>   + Data Lake Storage Gen2
>
>     [Preview feature](../release-notes/preview-features.md) — Open
>
>     Available to all accounts.
>
>     Connecting Snowflake to Data Lake Storage Gen2 storage by using an external volume is in public preview. This configuration enables externally managed
>     writes to catalogs that
>     are only configured to use Data Lake Storage, such as Unity Catalog. For more information, see [Configure an external volume for Azure](tables-iceberg-configure-external-volume-azure.md)
>
>     > **Note:**
>     >
>     > Connecting Snowflake to Data Lake Storage Gen2 storage by using catalog-vended credentials isn’t supported.
>   + General-purpose v1
>   + General-purpose v2
>   + Microsoft Fabric OneLake
> * Sharing:
>
>   + Sharing with a listing isn’t currently supported.
>   + Direct sharing isn’t currently supported.

**Access by third-party clients to Iceberg data, metadata**

> * Third-party clients can’t append to, delete from, or upsert data to Iceberg tables that use Snowflake as the catalog.

**Table optimization**

* Snowflake doesn’t support orphan file deletion for Snowflake-managed Iceberg tables. If you see a mismatch between storage usage for your
  external cloud storage and Snowflake, you might have orphan files in your external cloud storage. To see your storage usage for Snowflake,
  you can use the [TABLE_STORAGE_METRICS view](../sql-reference/info-schema/table_storage_metrics.md) or [TABLE_STORAGE_METRICS view](../sql-reference/account-usage/table_storage_metrics.md).
  If you see a mismatch, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for assistance with determining whether you have orphan files and removing them.
* For Snowflake-managed Iceberg tables, if a DML operation fails unexpectedly and rolls back, some Parquet files might get written to your
  external cloud storage but won’t be tracked or referenced by your Iceberg table metadata. These Parquet files are orphan files.

**External query engines through Snowflake Horizon Catalog**

This section lists the considerations for accessing, querying, and writing to Iceberg tables with an external query engine.

Consider the following items when you access Iceberg tables with an external query engine:

* Iceberg

  + For tables in Snowflake:

    - Only Snowflake-managed Iceberg tables are supported.
* Listings:

  + Iceberg tables that you share through [auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md) aren’t
    accessible through the consumer account’s Horizon Iceberg REST Catalog API.
* Network and private connectivity:

  + Using network policies that are set at the user level isn’t supported with this feature.
  + For [Snowflake-managed network rules](network-rules.md), egress IP addresses that are static aren’t supported.
  + Explicitly granting the Horizon Catalog endpoint access to your storage accounts isn’t supported. We recommend that you use private connectivity for
    secure connectivity from external engines to Horizon Catalog and from Horizon Catalog to your storage account.
* Clouds:

  + Commercial: This feature is only supported for Snowflake-managed Iceberg tables that are stored on Amazon S3, Google Cloud, or Microsoft Azure for
    all commercial cloud regions. S3-compatible non-AWS storage isn’t yet supported.
  + FedRAMP (Moderate): This feature is supported for Snowflake-managed Iceberg tables that are stored on FedRAMP (Moderate) deployments
    on AWS Commercial Gov (US) in the us-east-1 and us-west-2 regions.
  + For Iceberg tables stored on Amazon S3:

    - If you want to use SSE-KMS encryption, contact customer support or your account team for assistance with enabling access.

      > **Note:**
      >
      > Writing to KMS-encrypted external volumes is not supported.
  + For Iceberg tables stored on Azure:

    - Azure Virtual Network (VNet) isn’t supported.
* Authentication:

  + For key-pair authentication, key-pair rotation isn’t supported.
  + Workload identity federation isn’t supported with this feature.

Consider the following items when you query (read) Iceberg tables with an external query engine:

* Iceberg

  + Querying the following tables isn’t supported:

    - Remote tables
    - Snowflake native tables
    - Externally managed Iceberg tables including Delta-based Iceberg tables and
      Snowflake-managed Iceberg tables that you loaded with data from Iceberg-compatible Parquet data files by using the COPY INTO table command
  + Reading Iceberg v2 tables is supported.
  + Reading Iceberg V3 tables (public preview) is supported for the following capabilities:

    - Variant data type
    - Row lineage

    All other Iceberg V3 capabilities, including default values and the geography data type, aren’t supported.
* Access control:

  + Tables protected by the following fine-grained data policies can be accessed over Apache Spark™ through Snowflake Horizon Catalog:

    - Masking policies
    - Tag-based masking policies
    - Row access policies

    For more information, see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).
* Cloned and converted tables:

  + Reading cloned or converted tables is not supported with vended credentials. To read these tables, use direct access to object
    storage.

Consider the following items when you write to Iceberg tables with an external query engine:

* Table operations:

  + You can’t specify a base location with your CREATE TABLE statement.

    When you create a Snowflake-managed table without specifying a base location, Snowflake constructs the following path for your table:
    `STORAGE_BASE_URL/database/schema/table_name.randomId/[data | metadata]/`
  + CREATE TABLE AS SELECT (CTAS) from an external engine is not supported.
  + Equality deletes aren’t supported.
  + You can’t write to tables by using row-level deletes; only copy-on-write mode is supported.
  + Creating Iceberg tags and branches isn’t supported.
  + The external engine writes are supported only on Iceberg version 2; writing to Iceberg version 3 (v3)
    tables (public preview) is not currently supported.
  + Writing to KMS-encrypted external volumes is not supported.
  + Writing to dynamic tables in Snowflake isn’t supported.
  + Writing to shared Iceberg tables isn’t supported.
  + Registering Iceberg tables isn’t supported.
* Maintenance operations

  + You can’t roll back a table to a previous snapshot.
  + The snapshot expiration operation isn’t supported.
  + You can’t upgrade an Iceberg table from v2 to v3.
* Cloned and converted tables:

  + Writing to cloned or converted tables is not supported with vended credentials. To write to these tables, connect your external query
    engine directly to the object storage where your tables are stored.
  + You can’t write to an Iceberg table that was converted from externally managed to Snowflake managed.
* Streams:

  + On Iceberg V2 tables, copy-on-write operations cause standard streams to represent an updated or relocated row as a DELETE record followed
    by an INSERT record for the same row.
* Fine-grained access control policies:

  + Writing to tables that have fine-grained access control policies or tags isn’t supported.

**Native App Framework**

> You can share Iceberg tables with consumers through the
> [Snowflake Native App Framework](../developer-guide/native-apps/native-apps-about.md).
> Be aware of the following restrictions:
>
> * Iceberg tables shared through a Native App are read-only for consumers.
> * Cross-Cloud Auto-Fulfillment is not supported for apps that share Iceberg tables.
> * Consumers must explicitly enable the `EXTERNAL_DATA` restricted feature to the app
>   before it can resolve Iceberg tables. For more information, see
>   [Request access to external and Apache Iceberg™ tables](../developer-guide/native-apps/requesting-external-tables.md).

**Unsupported features**

> The following Snowflake features aren’t currently supported for all Iceberg tables:
>
> * [Collation](../sql-reference/collation.md)
> * [Fail-safe](data-failsafe.md)
> * [Hybrid tables](tables-hybrid.md)
> * Snowflake encryption
> * [Snowflake schema evolution](data-load-schema-evolution.md)
> * [Tagging](object-tagging/introduction.md) using the
>   [ASSOCIATE_SEMANTIC_CATEGORY_TAGS](../sql-reference/stored-procedures/associate_semantic_category_tags.md) stored procedure
> * [Temporary and transient tables](tables-temp-transient.md)
>
> The following features aren’t supported for externally managed Iceberg tables:
>
> * [Cloning](tables-storage-considerations.md)
> * [Clustering](tables-clustering-micropartitions.md)
> * Standard and append-only [streams](streams-intro.md). Insert-only streams are supported.
> * [Replication](account-replication-intro.md) of Iceberg tables, external volumes, or catalog integrations

---
title: Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-v3-specification-support.md
section: User Guide
---

# Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (*Preview*)

This preview introduces support for v3 of the Apache Iceberg™ specification, but with some considerations and limitations. Unless
otherwise noted, both Snowflake-managed and externally managed Iceberg tables are supported in this preview.

## Supported Iceberg v3 features

This section lists the Iceberg v3 features that are supported in this preview.

### Data types

The following v3 data types are supported in the public preview:

* `geography`
* `geometry`
* `nanosecond`
* `variant`

For more information, see [Iceberg v3 data types](tables-iceberg-data-types.md).

### Default values

See [Default values](tables-iceberg-manage.md).

### Deletion vectors

See [Deletion vectors](tables-iceberg-manage.md).

### Row lineage

See [Row lineage](tables-iceberg-manage.md).

## Configure the default Iceberg version

Iceberg tables inherently have a format version that they conform to. For externally managed Iceberg tables in a standard Snowflake database,
Snowflake retrieves this version from the table’s metadata.

For the following Iceberg tables, the table owner must specify which Iceberg version the table should conform to:

* Snowflake-managed Iceberg tables
* Externally managed Iceberg tables that you create in a [catalog-linked database](tables-iceberg-catalog-linked-database.md)

The system default Iceberg format version in Snowflake is v2 but you can set it to v3, if needed. To set the Iceberg version to v3, perform one of the following actions:

* Use the ICEBERG_VERSION_DEFAULT parameter to set the Iceberg version to `3` at the account, database, or schema level. For more information,
  see [ICEBERG_VERSION_DEFAULT](../sql-reference/parameters.md).
* Specify `ICEBERG_VERSION = 3` in your CREATE ICEBERG TABLE statement.

  > **Note:**
  >
  > If you don’t specify an Iceberg version when you create an Iceberg table, the table defaults to the Iceberg version set for the
  > schema, database, or account. The schema takes precedence over the database, and the database takes precedence over the account.

> **Caution:**
>
> Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
> engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
> readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
> needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

### Usage notes

* To modify the ICEBERG_VERSION_DEFAULT parameter at the account level, you must be an account administrator; that is, you must be a user
  with the ACCOUNTADMIN role.
* To modify the ICEBERG_VERSION_DEFAULT parameter at the database or schema level, the role used to perform the operation must have the OWNERSHIP
  privilege on the respective database or schema.

### Examples

Specify that new Iceberg tables in the `my_db` database should be created using v3:

```sqlexample
ALTER DATABASE my_db SET ICEBERG_VERSION_DEFAULT=3;
```

Create a new externally managed Iceberg table with v3. The column definitions included with the command indicate that a new table
will be created, or an existing table will be replaced, in the remote catalog. The table is successfully created because this is a new
table that doesn’t have an existing version.

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_v3_table (
    boolean_col boolean,
    int_col int,
    long_col long,
  )
  CATALOG='my_catalog_integration'
  ICEBERG_VERSION=3;
```

Create an externally managed Iceberg table with v3 from an existing table with Iceberg metadata. The lack of a column definitions
or format version in this example indicates that this table already exists and the column specification and format version will be inferred from
Iceberg metadata from the remote catalog. This example uses
[catalog-vended credentials](tables-iceberg-configure-catalog-integration-vended-credentials.md), so
the EXTERNAL_VOLUME parameter is excluded from the CREATE ICEBERG TABLE statement:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_v3_table
  CATALOG = 'my_catalog_integration'
  CATALOG_TABLE_NAME = 'my_table'
  AUTO_REFRESH = TRUE;
```

> **Note:**
>
> You can’t use the ALTER ICEBERG TABLE command to change the format version for an existing table.

## Get the format version for Iceberg tables

* The following example shows how to get the Iceberg version for a specific table:

  ```sqlexample
  SHOW PARAMETERS LIKE 'ICEBERG_VERSION' IN TABLE my_v3_iceberg_table;
  ```

  Output:

  ```output
  +-----------------+-------+---------+-------+---------------------------------------------------+--------+
  | key             | value | default | level | description                                       | type   |
  +-----------------+-------+---------+-------+---------------------------------------------------+--------+
  | ICEBERG_VERSION | 3     | 2       | TABLE | Specifies the Iceberg table format version to ... | NUMBER |
  +-----------------+-------+---------+-------+---------------------------------------------------+--------+
  ```
* The following example shows how to get the Iceberg version for a specific table by using the [GET_DDL](../sql-reference/functions/get_ddl.md) function
  to retrieve the Iceberg table definition:

  ```sqlexample
  SELECT GET_DDL('ICEBERG_TABLE', 'my_v3_iceberg_table');
  ```

  Output:

  ```output
   CREATE ICEBERG TABLE my_v3_iceberg_table (
    record VARIANT,
    event_timestamp TIMESTAMP_LTZ(6)
  )
    CATALOG = 'SNOWFLAKE'
    EXTERNAL_VOLUME = 'my_external_volume'
    BASE_LOCATION = 'my_iceberg_table'
    ICEBERG_VERSION = 3;
  ```

## Considerations and limitations for Iceberg v3 features

Consider the following information when you use Iceberg v3 features:

### Unsupported Snowflake features

The following Snowflake features aren’t supported in this preview for Iceberg v3:

* Append-only streams on externally managed Iceberg tables
* [dbt Projects on Snowflake](data-engineering/dbt-projects-on-snowflake.md)
* [Schema inference](../sql-reference/functions/infer_schema.md)
* [Snowpipe Streaming classic architecture](snowpipe-streaming/snowpipe-streaming-classic-overview.md)
* SnowGov Regions
* For tables that use an external catalog, you can’t create Iceberg v3 tables with structured type columns, which includes OBJECT, ARRAY,
  or MAP. For example, you can’t use CREATE ICEBERG TABLE … AS SELECT (CTAS) to create an externally managed Iceberg v3 table with
  structured type columns.

  You can create Snowflake-managed Iceberg v3 tables with structured type columns.
* An in-place upgrade of a Snowflake-managed Iceberg table from v2 to v3, which includes cloning a v2 table, and then upgrading the clone to v3

  > **Important:**
  >
  > If you use Apache Spark to upgrade an externally managed Iceberg table from v2 to v3, you must use a commit that creates a new
  > snapshot, such as DML operations. Otherwise, if the format-version is updated in table properties without a new snapshot, Snowflake’s
  > manual and automated refresh for the table will fail until a new snapshot is created.
  >
  > The following example uses Apache Spark to upgrade an externally managed Iceberg table from v2 to v3:
  >
  > ```sqlexample
  > ALTER TABLE table_name SET TBLPROPERTIES('format-version'='3');
  > ```

> **Note:**
>
> * The list of unsupported features isn’t finalized and is subject to change in the future. The list will be updated, as needed,
>   to reflect the latest unsupported features.
> * For considerations and limitations
>   specific to a v3 feature, see the feature topic for a feature.

### Supported Snowflake features

Features that aren’t listed in the Unsupported Snowflake features section are
supported. Supported features include those in the following list:

| Feature | Notes |
| --- | --- |
| [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md) |  |
| [Automated refresh](tables-iceberg-auto-refresh.md) |  |
| [Catalog integrations](tables-iceberg-configure-catalog-integration.md) |  |
| [Catalog-linked databases](tables-iceberg-catalog-linked-database.md) |  |
| [Cloning](../sql-reference/sql/create-clone.md) |  |
| Clustering | Snowflake-managed Iceberg v3 only. |
| [Converting externally managed v3 tables to Snowflake-managed](tables-iceberg-conversion.md) | Supported with the following considerations:   * Iceberg partitioning remains intact when you convert a v3 Iceberg table. * Before conversion, Snowflake never deletes any metadata, manifest lists, or manifests from your external storage. * During conversion, Snowflake doesn’t rewrite any metadata or Parquet data files. * After conversion, Snowflake is the catalog that is fully responsible for the lifecycle management of the table. Snowflake   deletes metadata, manifest lists, manifests, and data files, either created before or after conversion from your external storage   after they expire and pass the retention window. |
| [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) | LOAD_MODE = FULL_INGEST or ADD_FILES COPY are supported with the following considerations:   * To load the row lineage metadata columns in Parquet files   (`_row_id` and `_last_updated_sequence_number`), you must use the FULL_INGEST option. The other   LOAD_MODE methods aren’t supported. However, Parquet files containing row lineage are likely already part of an Iceberg v3 table.   Registering Parquet files by using ADD_FILES_COPY isn’t recommended if those files are already part of another Iceberg table. The best   practice for converting externally-managed Iceberg tables to Snowflake-managed Iceberg tables without rewriting files is to use the   [ALTER ICEBERG TABLE … CONVERT TO MANAGED](../sql-reference/sql/alter-iceberg-table-convert-to-managed.md) command. |
| [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) | Supported with the following limitations:   * VARIANT, GEOMETRY, and GEOGRAPHY are unloaded as JSON-encoded strings. * TIMESTAMP_NTZ(9) is unloaded as milliseconds, not nanoseconds. * TIMESTAMP_LTZ(9), ARRAY, OBJECT, and MAP must be casted to other data types. |
| [Data Clean Rooms](cleanrooms/overview.md) |  |
| [Data lineage](ui-snowsight-lineage.md) |  |
| Data protection policies | The following data protection policies are supported:   * Masking policies * Row access policies * Projection policies * Aggregation policies * Privacy policies * Join policies |
| [Data protection policy enforcement from Apache Spark](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md) |  |
| Data quality monitoring |  |
| [Dynamic tables](dynamic-tables-create-iceberg.md) | Write a v3 externally managed Iceberg table as the target of a dynamic table. |
| [Horizon Iceberg REST Catalog API](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md) |  |
| [LOB (Large Object)](../release-notes/bcr-bundles/2025_03/bcr-1942.md) |  |
| [Materialized Views](views-materialized.md) |  |
| [Object tagging](object-tagging/introduction.md) |  |
| Query acceleration |  |
| [Replication](account-replication-intro.md) |  |
| [Search optimization](search-optimization-service.md) |  |
| [Secure views](views-secure.md) |  |
| Sensitive Data Classification |  |
| [Target file size](tables-iceberg-manage.md) |  |
| [Single-argument Iceberg partitioning](tables-iceberg-metadata.md) | Partitioned tables can’t also write deletion vectors; only copy-on-write is supported for partitioned tables. |
| [Snowflake Connector for Kafka](kafka-connector.md) | Versions 4.0 or newer. |
| [Snowpark](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.saveAsTable) | 1.33.0 or newer. |
| Snowpark pandas API method [to_iceberg](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.45.0/modin/pandas_api/modin.pandas.to_iceberg) | Only supported for Iceberg v3 when ICEBERG_VERSION_DEFAULT is set on the account, database, or schema. If ICEBERG_VERSION = 3 is set at the table level, Snowpark pandas API method to_iceberg isn’t supported. |
| [Snowpark Connect for Apache Spark](../developer-guide/snowpark-connect/snowpark-connect-overview.md) | Writing dataframes to existing Iceberg v3 tables by using an append or overwrite method is supported. Creating a new Iceberg v3 table isn’t supported. |
| [Snowpipe](data-load-snowpipe-intro.md) |  |
| [Snowpipe Streaming key concepts](snowpipe-streaming/snowpipe-streaming-high-performance-overview.md) |  |
| [Sharing](data-sharing-intro.md) |  |
| [Streams](streams-intro.md) | * Append-only streams and standard streams are supported on Snowflake-managed Iceberg v3 tables. * Insert-only streams and standard streams are supported on externally managed Iceberg v3 tables.    + To have standard streams produce the correct results, the external engine must write to Iceberg v3 tables with respect to the Iceberg v3     specification. Specifically, newly inserted rows should have `_row_id=NULL`. Rows that are copied during copy-on-write should maintain the `_row_id`.   + MAX_DATA_EXTENSION_TIME_IN_DAYS doesn’t work on externally managed Iceberg v3 tables. * When DMLs are committed over multi-statement transactions, append-only streams on Iceberg v3 tables have different semantics compared to Iceberg v2 tables:    + On Iceberg v2, for append-only streams, if a row is added and then deleted in a multi-statement transaction, this row is considered an     insertion.   + On Iceberg v3, for append-only streams, this row isn’t treated as an insertion. |
| [Table optimization](tables-iceberg-manage.md) |  |

### Unsupported Iceberg v3 features

The following features from the Iceberg v3 specification aren’t supported:

* Nested variant
* Multi-argument transforms for partitioning and sorting
* Table encryption keys
* UNKNOWN data type

## Examples: Support for v3 with existing Snowflake features

This section lists examples of the existing Snowflake features that are supported with v3. A feature listing includes an example for a
Snowflake-managed table and an externally managed table, when supported.

For the full list of Snowflake features that are supported in this preview for Iceberg v3,
see Supported Snowflake features.

### Create a v3 Iceberg table

The following example creates a Snowflake-managed Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ specification:

```sqlexample
CREATE ICEBERG TABLE my_v3_iceberg_table (
  record VARIANT,
  event_timestamp TIMESTAMP_LTZ(6)
)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table'
  ICEBERG_VERSION = 3;
```

The following example creates an Apache Iceberg™ table that uses a remote Iceberg REST catalog and conforms to v3 of the Apache Iceberg™ specification:

> **Note:**
>
> You don’t need to specify `ICEBERG_VERSION = 3` with the command because the format version is already defined in the
> external catalog’s metadata, so Snowflake retrieves this version from the metadata.

```sqlexample
CREATE ICEBERG TABLE my_v3_iceberg_table
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'my_rest_catalog_integration'
  CATALOG_TABLE_NAME = 'my_remote_table'
  AUTO_REFRESH = TRUE;
```

The following example creates a writable Iceberg table in a
[catalog-linked database](tables-iceberg-catalog-linked-database.md)
with column definitions and conforms to v3 of the Apache Iceberg™ specification:

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA 'my_namespace';

CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
  first_name string,
  last_name string,
  amount int,
  create_date date
)
  ICEBERG_VERSION = 3;
```

### Write to a v3 Iceberg table

DML commands INSERT, UPDATE, DELETE, MERGE, TRUNCATE TABLE, and COPY INTO are supported for writing to Snowflake-managed and
[externally managed](tables-iceberg-externally-managed-writes.md) Iceberg v3 tables:

The following example inserts a row into an Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ table specification:

```sqlexample
INSERT INTO my_v3_iceberg_table (id, payload) VALUES (1, PARSE_JSON('{"name": "Alice", "age": 30}'));
```

The following example loads files into an Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ table specification:

```sqlexample
COPY INTO my_v3_iceberg_table
  FROM @my_json_stage
  FILE_FORMAT = 'my_json_format'
  MATCH_BY_COLUMN_NAME = CASE_SENSITIVE;
```

### Load data by using Snowpipe

The following example loads data from files for Iceberg v3 tables, for both Snowflake-managed and externally managed tables:

```sqlexample
CREATE PIPE mypipe
  AUTO_INGEST = TRUE
  INTEGRATION = 'MYINT'
  AS
  COPY INTO snowpipe_db.public.my_v3_iceberg_table
  FROM @snowpipe_db.public.mystage
  FILE_FORMAT = (TYPE = 'JSON');
```

> **Note:**
>
> Snowflake supports additional write features for Iceberg v3. For this list, see the
> considerations and limitations for Iceberg v3 features, and
> then see the supported Snowflake features list.

### Create a v3 dynamic Iceberg table

The following example writes a v3 Snowflake-managed Iceberg table as the output of a dynamic table:

```sqlexample
CREATE DYNAMIC ICEBERG TABLE my_dynamic_iceberg_v3_table (
    num_orders NUMBER(10,0),
    order_day
  )
  TARGET_LAG = '20 minutes'
  WAREHOUSE = my_warehouse
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_dynamic_iceberg_v3_table'
  ICEBERG_VERSION = 3
  AS
    SELECT
        COUNT(DISTINCT order_id)
        DATE_TRUNC('DAY', order_timestamp_ns) AS order_day
      FROM staging_v3_iceberg_table;
```

> **Note:**
>
> Writing either a v2 or v3 externally managed Iceberg table as the target of a dynamic table isn’t supported. The output of a dynamic
> Iceberg table can only be Snowflake-managed.

### Query a v3 Iceberg table

The following example queries a Snowflake-managed or externally managed Iceberg v3 table:

> ```sqlexample
> SELECT * FROM MY_DB.MY_SCHEMA.MY_ICEBERG_V3_TABLE;
> ```

---
title: Attributing cost
source: https://docs.snowflake.com/en/user-guide/cost-attributing.md
section: User Guide
---

# Attributing cost

An organization can apportion the cost of using Snowflake to logical units within the organization (for example, to different
departments, environments, or projects). This chargeback or showback model is useful for accounting purposes and pinpoints
areas of the organization that could benefit from controls and optimizations that can reduce costs.

To attribute costs to different groups like departments or projects, use the following recommended approach:

* Use [object tags](object-tagging/introduction.md) to associate resources and users with departments or projects.
* Use [query tags](../sql-reference/parameters.md) to associate individual queries with departments or projects when the queries are
  made by the same application on behalf of users belonging to multiple departments.

## Types of cost attribution scenarios

The following cost attribution scenarios are the most commonly encountered. In these scenarios, warehouses are used as an
example of a resource that incurs costs.

* **Resources used exclusively by a single cost center or department:** An example of this is using object tags to associate
  warehouses with a department. You can use these object tags to attribute the costs incurred by those warehouses to that
  department entirely.
* **Resources that are shared by users from multiple departments:** An example of this is a warehouse shared by users from
  different departments. In this case, you use object tags to associate each user with a department. The costs of queries are
  attributed to the users. Using the object tags assigned to users, you can break down the costs by department.
* **Applications or workflows shared by users from different departments:** An example of this is an application that issues
  queries on behalf of its users. In this case, each query executed by the application is assigned a query tag that identifies
  the team or cost center of the user on whose behalf the query is being made.

The next sections explain how to set up object tags in your accounts and provide the details for each of these cost attribution
scenarios.

## Setting up object tags for cost attribution

When you set up tags to represent the groupings that you want to use for cost attribution, you should determine if the
groupings apply to a single account or multiple accounts. This determines how you set up your tags.

For example, suppose that you want to attribute costs based on department.

* If the resources used by the department are located in a single account, you create the tags in a database in that account.
* If the resources used by the department span multiple accounts, you create the tags
  in a key account in your organization (for example, in your [organization account](organization-accounts.md)),
  and you make those tags available in other accounts through replication.

The next sections explain how to create the tags, replicate the tags, and apply the tags to resources.

* Creating the tags
* Replicating the tag database
* Tagging the resources and users

> **Note:**
>
> The examples in these sections use the custom role `tag_admin`, which is assumed to have been granted the privileges to
> create and manage tags. Within your organization, you can use more granular
> [privileges for object tagging](object-tagging/work.md) to develop a secure tagging strategy.

### Creating the tags

As part of designing the strategy, decide on the database and schema where you plan to create the tags.

* You can create a dedicated database and schema for the tags.
* If you want to tag resources in different accounts across your organization, you can create the tags in a key account in your
  organization (for example, in your [organization account](organization-accounts.md)).

The following example creates a database named `cost_management` and a schema named `tags` for the tags that you plan to use:

```sqlexample
USE ROLE tag_admin;

CREATE DATABASE cost_management;
CREATE SCHEMA tags;
```

With `cost_management` and `tags` selected as the current database and schema, create a tag named `cost_center` and set
the values allowed for the tag to the names of cost centers:

```sqlexample
CREATE TAG cost_center
  ALLOWED_VALUES 'finance', 'marketing', 'engineering', 'product';
```

### Replicating the tag database

If you have an organization with multiple accounts and you want to make the tags available in these other accounts,
[set up your accounts for replication](account-replication-config.md), and
[create a replication group](../sql-reference/sql/create-replication-group.md) in a main account (for example, in the
[organization account](organization-accounts.md)). Set up this replication group to replicate the database
containing the tags.

For example, to replicate the tags to the accounts named `my_org.my_account` and `my_org.my_account_2`, execute this
statement in your organization account:

```sqlexample
CREATE REPLICATION GROUP cost_management_repl_group
  OBJECT_TYPES = DATABASES
  ALLOWED_DATABASES = cost_management
  ALLOWED_ACCOUNTS = my_org.my_account_1, my_org.my_account_2
  REPLICATION_SCHEDULE = '10 MINUTE';
```

Then, in each account in which you want to make the tags available, create a secondary replication group, and refresh this
group from the primary group:

```sqlexample
CREATE REPLICATION GROUP cost_management_repl_group
  AS REPLICA OF my_org.my_org_account.cost_management_repl_group;

ALTER REPLICATION GROUP cost_management_repl_group REFRESH;
```

### Tagging the resources and users

After creating and replicating the tags, you can use these tags to identify the warehouses and users belonging to each
department. For example, because the sales department uses both `warehouse1` and `warehouse2`, you can set the
`cost_center` tag to `'SALES'` for both warehouses.

> **Tip:**
>
> Ideally, you should have workflows that automate the process of applying these tags when you create resources and users.

```sqlexample
USE ROLE tag_admin;

ALTER WAREHOUSE warehouse1 SET TAG cost_management.tags.cost_center='SALES';
ALTER WAREHOUSE warehouse2 SET TAG cost_management.tags.cost_center='SALES';
ALTER WAREHOUSE warehouse3 SET TAG cost_management.tags.cost_center='FINANCE';

ALTER USER finance_user SET TAG cost_management.tags.cost_center='FINANCE';
ALTER USER sales_user SET TAG cost_management.tags.cost_center='SALES';
```

## Viewing cost by tag in SQL

You can attribute costs within an account or across accounts in an organization:

* **Attributing costs within an account**

  You can attribute costs within an account by querying the following views in the
  [ACCOUNT_USAGE](../sql-reference/account-usage.md) schema:

  + [TAG_REFERENCES view](../sql-reference/account-usage/tag_references.md): Identifies objects (for example, warehouses and users) that have tags.
  + [WAREHOUSE_METERING_HISTORY view](../sql-reference/account-usage/warehouse_metering_history.md): Provides credit usage for warehouses.
  + [QUERY_ATTRIBUTION_HISTORY view](../sql-reference/account-usage/query_attribution_history.md): Provides the compute costs for queries. The cost per query is
    the warehouse credit usage for executing the query.

    For more information on using this view, see About the QUERY_ATTRIBUTION_HISTORY view.
* **Attributing costs across accounts in an organization**

  Within an organization, you can also attribute costs for resources that are used **exclusively by a single department** by
  querying views in the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schema from the
  [organization account](organization-accounts.md).

  > **Note:**
  > + In the ORGANIZATION_USAGE schema, the TAG_REFERENCES view is only available in the organization account.
  > + The QUERY_ATTRIBUTION_HISTORY view is only available in the ACCOUNT_USAGE schema for an account. There is no
  >   organization-wide equivalent of the view.

The next sections explain how to attribute costs for some of the
common cost-attribution scenarios:

* Resources not shared by departments
* Resources shared by users from different departments
* Resources used by applications that need to attribute costs to different departments

### Resources not shared by departments

Suppose that you want to attribute costs by department and that each department uses a set of dedicated warehouses.

If you tag warehouses with a `cost_center` tag to identify the department that owns the warehouse, you can join the
ACCOUNT_USAGE [TAG_REFERENCES view](../sql-reference/account-usage/tag_references.md) with the
[WAREHOUSE_METERING_HISTORY view](../sql-reference/account-usage/warehouse_metering_history.md) on the `object_id` and `warehouse_id` columns to get usage
information by warehouse, and you can use the `tag_value` column to identify the departments that own those warehouses.

The following SQL statement performs this join:

```sqlexample
SELECT
    TAG_REFERENCES.tag_name,
    COALESCE(TAG_REFERENCES.tag_value, 'untagged') AS tag_value,
    SUM(WAREHOUSE_METERING_HISTORY.credits_used_compute) AS total_credits
  FROM
    SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_METERING_HISTORY
      LEFT JOIN SNOWFLAKE.ACCOUNT_USAGE.TAG_REFERENCES
        ON WAREHOUSE_METERING_HISTORY.warehouse_id = TAG_REFERENCES.object_id
          AND TAG_REFERENCES.domain = 'WAREHOUSE'
  WHERE
    WAREHOUSE_METERING_HISTORY.start_time >= DATE_TRUNC('MONTH', DATEADD(MONTH, -1, CURRENT_DATE))
      AND WAREHOUSE_METERING_HISTORY.start_time < DATE_TRUNC('MONTH',  CURRENT_DATE)
  GROUP BY TAG_REFERENCES.tag_name, COALESCE(TAG_REFERENCES.tag_value, 'untagged')
  ORDER BY total_credits DESC;
```

```output
+-------------+-------------+-----------------+
| TAG_NAME    | TAG_VALUE   |   TOTAL_CREDITS |
|-------------+-------------+-----------------|
| NULL        | untagged    |    20.360277159 |
| COST_CENTER | Sales       |    17.173333333 |
| COST_CENTER | Finance     |      8.14444444 |
+-------------+-------------+-----------------+
```

You can run a similar query to perform the same attribution for all the accounts in your organization using views in the
ORGANIZATION_USAGE schema from the [organization account](organization-accounts.md). The rest of the query
does not change.

```sqlexample
SELECT
    TAG_REFERENCES.tag_name,
    COALESCE(TAG_REFERENCES.tag_value, 'untagged') AS tag_value,
    SUM(WAREHOUSE_METERING_HISTORY.credits_used_compute) AS total_credits
  FROM
    SNOWFLAKE.ORGANIZATION_USAGE.WAREHOUSE_METERING_HISTORY
      LEFT JOIN SNOWFLAKE.ORGANIZATION_USAGE.TAG_REFERENCES
        ON WAREHOUSE_METERING_HISTORY.warehouse_id = TAG_REFERENCES.object_id
          AND TAG_REFERENCES.domain = 'WAREHOUSE'
          AND tag_database = 'COST_MANAGEMENT' AND tag_schema = 'TAGS'
  WHERE
    WAREHOUSE_METERING_HISTORY.start_time >= DATE_TRUNC('MONTH', DATEADD(MONTH, -1, CURRENT_DATE))
      AND WAREHOUSE_METERING_HISTORY.start_time < DATE_TRUNC('MONTH',  CURRENT_DATE)
  GROUP BY TAG_REFERENCES.tag_name, COALESCE(TAG_REFERENCES.tag_value, 'untagged')
  ORDER BY total_credits DESC;
```

### Resources shared by users from different departments

Suppose that users in different departments share the same warehouses and you want to break down the credits used by each
department. You can tag the users with a `cost_center` tag to identify the department that they belong to, and you can join
the [TAG_REFERENCES view](../sql-reference/account-usage/tag_references.md) with the [QUERY_ATTRIBUTION_HISTORY view](../sql-reference/account-usage/query_attribution_history.md).

> **Note:**
>
> You can only get this data for a single account at a time. You cannot execute a query that retrieves this data across
> accounts in an organization.

The next sections provide examples of SQL statements for attributing costs for shared resources.

* Calculating the cost of user queries for the last month
* Calculating the cost of user queries by department without idle time
* Calculating the cost of queries by users without idle time
* Calculating the cost of queries by users without tags

#### Calculating the cost of user queries for the last month

This following SQL statement calculates the costs for the last month.

In this example, idle time is distributed among the users in proportion to their usage.

```sqlexample
WITH
  wh_bill AS (
    SELECT SUM(credits_used_compute) AS compute_credits
      FROM SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_METERING_HISTORY
      WHERE start_time >= DATE_TRUNC('MONTH', CURRENT_DATE)
        AND start_time < CURRENT_DATE
  ),
  user_credits AS (
    SELECT user_name, SUM(credits_attributed_compute) AS credits
      FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
      WHERE start_time >= DATE_TRUNC('MONTH', CURRENT_DATE)
        AND start_time < CURRENT_DATE
      GROUP BY user_name
  ),
  total_credit AS (
    SELECT SUM(credits) AS sum_all_credits
    FROM user_credits
  )
SELECT
    u.user_name,
    u.credits / t.sum_all_credits * w.compute_credits AS attributed_credits
  FROM user_credits u, total_credit t, wh_bill w
  ORDER BY attributed_credits DESC;
```

```output
+-----------+--------------------+
| USER_NAME | ATTRIBUTED_CREDITS |
|-----------+--------------------+
| FINUSER   | 6.603575468        |
| SALESUSER | 4.321378049        |
| ENGUSER   | 0.6217131392       |
|-----------+--------------------+
```

#### Calculating the cost of user queries by department without idle time

The following example attributes the compute cost to each department through the queries executed by users in that department.
This query depends on the user objects having a tag that identifies their department.

```sqlexample
WITH joined_data AS (
  SELECT
      tr.tag_name,
      tr.tag_value,
      qah.credits_attributed_compute,
      qah.start_time
    FROM SNOWFLAKE.ACCOUNT_USAGE.TAG_REFERENCES tr
      JOIN SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY qah
        ON tr.domain = 'USER' AND tr.object_name = qah.user_name
)
SELECT
    tag_name,
    tag_value,
    SUM(credits_attributed_compute) AS total_credits
  FROM joined_data
  WHERE start_time >= DATEADD(MONTH, -1, CURRENT_DATE)
    AND start_time < CURRENT_DATE
  GROUP BY tag_name, tag_value
  ORDER BY tag_name, tag_value;
```

```output
+-------------+-------------+-----------------+
| TAG_NAME    | TAG_VALUE   |   TOTAL_CREDITS |
|-------------+-------------+-----------------|
| COST_CENTER | engineering |   0.02493688426 |
| COST_CENTER | finance     |    0.2281084988 |
| COST_CENTER | marketing   |    0.3686840545 |
|-------------+-------------+-----------------|
```

#### Calculating the cost of queries by users without idle time

This following SQL statement calculates the costs per user for the past month (excluding idle time).

```sqlexample
SELECT user_name, SUM(credits_attributed_compute) AS credits
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
  WHERE
    start_time >= DATEADD(MONTH, -1, CURRENT_DATE)
    AND start_time < CURRENT_DATE
  GROUP BY user_name;
```

```output
+-----------+--------------------+
| USER_NAME | ATTRIBUTED_CREDITS |
|-----------+--------------------|
| JSMITH    |       17.173333333 |
| MJONES    |         8.14444444 |
| SYSTEM    |         5.33985393 |
+-----------+--------------------+
```

#### Calculating the cost of queries by users without tags

The following example calculates the cost of queries by users who are not tagged. You can use this to verify that tags are
being applied consistently to users.

```sqlexample
SELECT qah.user_name, SUM(qah.credits_attributed_compute) as total_credits
  FROM
    SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY qah
    LEFT JOIN snowflake.account_usage.tag_references tr
    ON qah.user_name = tr.object_name AND tr.DOMAIN = 'USER'
  WHERE
    start_time >= dateadd(month, -1, current_date)
    AND qah.user_name IS NULL OR tr.object_name IS NULL
  GROUP BY qah.user_name
  ORDER BY total_credits DESC;
```

```output
+------------+---------------+
| USER_NAME  | TOTAL_CREDITS |
|------------+---------------|
| RSMITH     |  0.1830555556 |
+------------+---------------+
```

### Resources used by applications that need to attribute costs to different departments

The examples in this section calculate the costs for one or more applications that are powered by Snowflake.

The examples assume that these applications set query tags that identify the application for all queries executed. To set the
query tag for queries in a session, execute the [ALTER SESSION](../sql-reference/sql/alter-session.md) command. For example:

```sqlexample
ALTER SESSION SET QUERY_TAG = 'COST_CENTER=finance';
```

This associates the `COST_CENTER=finance` tag with all subsequent queries executed during the session.

You can then use the query tag to trace back the cost incurred by these queries to the appropriate departments.

The next sections provide examples of using this approach.

* Calculating the cost of queries by department
* Calculating the cost of queries (excluding idle time) by query tag
* Calculating the cost of queries (including idle time) by query tag

#### Calculating the cost of queries by department

The following example calculates the compute credits and the credits used for the
[query acceleration service](query-acceleration-service.md) for the finance department. This depends on the
`COST_CENTER=finance` query tag being applied to the original queries that were executed.

Note that the costs exclude idle time.

```sqlexample
SELECT
    query_tag,
    SUM(credits_attributed_compute) AS compute_credits,
    SUM(credits_used_query_acceleration) AS qas
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
  WHERE query_tag = 'COST_CENTER=finance'
  GROUP BY query_tag;
```

```output
+---------------------+-----------------+------+
| QUERY_TAG           | COMPUTE_CREDITS | QAS  |
|---------------------+-----------------|------|
| COST_CENTER=finance |      0.00576115 | null |
+---------------------+-----------------+------+
```

#### Calculating the cost of queries (excluding idle time) by query tag

The following example calculates the cost of queries by query tag and includes queries without tags (identified as “untagged”).

```sqlexample
SELECT
    COALESCE(NULLIF(query_tag, ''), 'untagged') AS tag,
    SUM(credits_attributed_compute) AS compute_credits,
    SUM(credits_used_query_acceleration) AS qas
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
  WHERE start_time >= DATEADD(MONTH, -1, CURRENT_DATE)
  GROUP BY tag
  ORDER BY compute_credits DESC;
```

```output
+-------------------------+-----------------+------+
| TAG                     | COMPUTE_CREDITS | QAS  |
|-------------------------+-----------------+------+
| untagged                | 3.623173449     | null |
| COST_CENTER=engineering | 0.531431948     | null |
|-------------------------+-----------------+------+
```

#### Calculating the cost of queries (including idle time) by query tag

The following example distributes the idle time that is not captured in the per-query cost across departments in proportion
to their usage of the warehouse.

```sqlexample
WITH
  wh_bill AS (
    SELECT SUM(credits_used_compute) AS compute_credits
      FROM SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_METERING_HISTORY
      WHERE start_time >= DATE_TRUNC('MONTH', CURRENT_DATE)
      AND start_time < CURRENT_DATE
  ),
  tag_credits AS (
    SELECT
        COALESCE(NULLIF(query_tag, ''), 'untagged') AS tag,
        SUM(credits_attributed_compute) AS credits
      FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
      WHERE start_time >= DATEADD(MONTH, -1, CURRENT_DATE)
      GROUP BY tag
  ),
  total_credit AS (
    SELECT SUM(credits) AS sum_all_credits
      FROM tag_credits
  )
SELECT
    tc.tag,
    tc.credits / t.sum_all_credits * w.compute_credits AS attributed_credits
  FROM tag_credits tc, total_credit t, wh_bill w
  ORDER BY attributed_credits DESC;
```

```output
+-------------------------+--------------------+
| TAG                     | ATTRIBUTED_CREDITS |
+-------------------------+--------------------|
| untagged                |        9.020031304 |
| COST_CENTER=finance     |        1.027742521 |
| COST_CENTER=engineering |        1.018755812 |
| COST_CENTER=marketing   |       0.4801370376 |
+-------------------------+--------------------+
```

## Viewing cost by tag in Snowsight

You can attribute costs by reporting on the use of resources that have the `cost_center` tag. You can access this data in
[Snowsight](ui-snowsight-gs.md).

1. Switch to a role that has [access to the ACCOUNT_USAGE schema](../sql-reference/account-usage.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Consumption.
4. From the Tags drop-down, select the `cost_center` tag.
5. To focus on a specific cost center, select a value from the list of the tag’s values.
6. Select Apply.

For more details about filtering in Snowsight, see [Filter by tag](cost-exploring-compute.md).

## About the QUERY_ATTRIBUTION_HISTORY view

You can use the [QUERY_ATTRIBUTION_HISTORY view](../sql-reference/account-usage/query_attribution_history.md) to attribute cost based on queries. The cost per
query is the warehouse credit usage for executing the query. This cost does not include any other credit usage that is incurred
as a result of query execution. For example, the following are not included in the query cost:

* Data transfer costs
* Storage costs
* Cloud services costs
* Costs for serverless features
* Costs for tokens processed by AI services

For queries that are executed concurrently, the cost of the warehouse is attributed to individual queries based on the weighted
average of their resource consumption during a given time interval.

The cost per query does not include warehouse *idle time*. Idle time is a period of time in which no queries are running in the
warehouse and can be measured at the warehouse level.

## Additional examples of queries

The next sections provide additional queries that you can use for cost attribution:

* Grouping similar queries
* Attributing costs of hierarchical queries

### Grouping similar queries

For recurrent or similar queries, use the `query_hash` or `query_parameterized_hash` to group costs
by query.

To find the most expensive recurrent queries for the current month, execute the following statement:

```sqlexample
SELECT query_parameterized_hash,
       COUNT(*) AS query_count,
       SUM(credits_attributed_compute) AS total_credits
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
  WHERE start_time >= DATE_TRUNC('MONTH', CURRENT_DATE)
  AND start_time < CURRENT_DATE
  GROUP BY query_parameterized_hash
  ORDER BY total_credits DESC
  LIMIT 20;
```

For an additional query based on query ID, see [Examples](../sql-reference/account-usage/query_attribution_history.md).

### Attributing costs of hierarchical queries

For stored procedures that issue multiple hierarchical queries, you can compute the attributed query costs for the
procedure by using the root query ID for the procedure.

1. To find the root query ID for a stored procedure, use the [ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md). For example,
   to find the root query ID for a stored procedure, set the `query_id` and execute the following statements:

   ```sqlexample
   SET query_id = '<query_id>';

   SELECT query_id,
          parent_query_id,
          root_query_id,
          direct_objects_accessed
     FROM SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY
     WHERE query_id = $query_id;
   ```

   For more information, see [Ancestor queries with stored procedures](access-history.md).
2. To sum the query cost for the entire procedure, replace `<root_query_id>` and execute the following statements:

   ```sqlexample
   SET query_id = '<root_query_id>';

   SELECT SUM(credits_attributed_compute) AS total_attributed_credits
     FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
     WHERE (root_query_id = $query_id OR query_id = $query_id);
   ```

---
title: Authentication policies
source: https://docs.snowflake.com/en/user-guide/authentication-policies.md
section: User Guide
---

# Authentication policies

Authentication policies provide you with control over how a client or user authenticates by allowing you to specify:

* Whether users must [enroll in multi-factor authentication (MFA)](ui-snowsight-profile.md).
* Which authentication methods require multi-factor authentication.
* The allowed authentication methods, such as [SAML](admin-security-fed-auth-overview.md), passwords,
  [OAuth](oauth-intro.md), [key pair authentication](key-pair-auth.md), and
  [programmatic access tokens](programmatic-access-tokens.md).
* The [SAML2 security integrations](admin-security-fed-auth-security-integration.md) that are available to users during the
  login experience. For example, if there are multiple security integrations, you can specify which identity provider (IdP) can be selected
  and used to authenticate.

  If you are using authentication policies to control which IdP a user can use to authenticate, you can further refine that control using
  the `ALLOWED_USER_DOMAINS` and `ALLOWED_EMAIL_PATTERNS` properties of the SAML2 security integrations associated with the
  IdPs. For more details, see [Using multiple identity providers for federated authentication](admin-security-fed-auth-security-integration-multiple.md).
* The clients that users can use to connect to Snowflake, such as [Snowsight](ui-snowsight-gs.md), [Snowflake CLI](../developer-guide/snowflake-cli/index.md), [drivers](../developer-guide/drivers.md), or
  [SnowSQL (CLI client)](snowsql.md). The `CLIENT_TYPES` property of an authentication policy is a best-effort method to block user logins based on specific clients. It should not be used as the sole control to establish a security boundary. Notably, it does not restrict access to the Snowflake REST APIs..

  By defining a “client policy” within an authentication policy, you can also set the minimum version that is allowed for specific client types.
* The [default and maximum expiration times](programmatic-access-tokens.md) and the
  [network policy requirements](programmatic-access-tokens.md) for programmatic access tokens.

You can set authentication policies on the account or users in the account. If you set an authentication policy on the account, then the
authentication policy applies to all users in the account. If you set an authentication policy on both an account and a user, then the
user-level authentication policy overrides the account-level authentication policy.

> **Note:**
>
> If you already have access to the identifier-first login flow, you need to migrate your account from the unsupported
> SAML_IDENTITY_PROVIDER account parameter using the [SYSTEM$MIGRATE_SAML_IDP_REGISTRATION](../sql-reference/functions/system_migrate_saml_idp_registration.md) function.

## Use cases

The following non-exhaustive list describes use cases for authentication policies:

* You want to control whether a user, all users in an account, or specific authentication methods require MFA.
* You want to control the user login flows when there are multiple login options.
* You want to control the authentication methods, specific client types, minimum versions of clients, and security integrations available
  to specific users or all users.
* You have customers building services on top of Snowflake using Snowflake drivers, but the customers do not want their users accessing
  Snowflake through Snowsight.
* You want to offer multiple identity providers as authentication options for specific users.

## Limitations

* The `CLIENT_TYPES` property of an authentication policy is a best-effort method to block user logins based on specific clients. It should not be used as the sole control to establish a security boundary. Notably, it does not restrict access to the Snowflake REST APIs..

## Considerations

* Ensure authentication methods and security integrations listed in your authentication policies do not conflict. For example, if you add a
  SAML2 security integration in the list of allowed security integrations, and you only allow OAuth as an allowed authentication method,
  then you cannot create an authentication policy.
* Use an additional non-restrictive authentication policy for administrators in case users are locked out. For an example, see
  Preventing a lockout.

## Security policy precedence

When more than one type of security policy is activated, precedence between the policies occur. For example,
[network policies](network-policies.md) take precedence over authentication policies, so if the IP address of a request
matches an IP address in the blocked list of the network policy, then the authentication policy is not checked, and evaluation stops at the
network policy.

The following list describes the order in which security policies are evaluated:

1. [Network policies](../sql-reference/sql/create-network-policy.md): Allow or deny IP addresses, VPC IDs, and VPCE IDs.
2. Authentication policies - Allow or deny clients, authentication methods, and security integrations.
3. [Password policies](../sql-reference/sql/create-password-policy.md) (For local authentication only): Specify password requirements such
   as character length, characters, password age, retries, and lockout time.
4. [Session policies](../sql-reference/sql/create-session-policy.md): Require users to re-authenticate after a period of inactivity

If a policy is assigned to both the account and the user authenticating, the user-level policy is enforced.

## Combining identifier-first login with authentication policies

By default, Snowsight provides a generic login experience that provides several options for logging in,
regardless if the options are relevant to users. This means that authentication is attempted regardless of whether the login option is a
valid option for the user.

You can alter this behavior to enable a identifier-first login flow for Snowsight. In this flow, Snowflake prompts the user for an
email address or username before presenting authentication options. Snowflake uses the email address or username to identify the user, and
then only displays the login options that are relevant to the user, and are allowed by the authentication policy set on the account or user.

For instructions for enabling the identifier-first login flow, see [Identifier-first login](identifier-first-login.md).

The following table provides example configuration on how the identifier-first login and authentication policies can be combined to control
the login experience of the user.

| Configuration | Result |
| --- | --- |
| The authentication policy’s AUTHENTICATION_METHODS parameter only contains PASSWORD. | Snowflake prompts the user for an email address or username, and password. |
| The authentication policy’s AUTHENTICATION_METHODS parameter only contains SAML, and there is an active SAML2 security integration. | Snowflake redirects the user to the identity provider’s login page if the email address or username matches only one SAML2 security integration. |
| The authentication policy’s AUTHENTICATION_METHODS parameter contains both PASSWORD and SAML, and there is an active SAML2 security integration. | Snowflake displays a SAML SSO button if the email address or username matches only one SAML2 security integration, and the option to log in with an email address or username, and password. |
| The authentication policy’s AUTHENTICATION_METHODS parameter only contains SAML, and there are multiple active SAML2 security integrations. | Snowflake displays multiple SAML SSO buttons if the email address or username matches multiple SAML2 security integrations. |
| The authentication policy’s AUTHENTICATION_METHODS parameter contains both PASSWORD and SAML, and there are multiple active SAML2 security integrations. | Snowflake displays multiple SAML SSO buttons if the email address or username matches multiple SAML2 security integrations, and the option to log in with an email address or username, and password. |

## Creating an authentication policy

An administrator can use the [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) command to create a new authentication policy,
specifying which clients can connect to Snowflake, which authentication methods can be used, and which security integrations are
available to users. By default, all client types, authentication methods, and security integrations can be used
to connect to Snowflake. The `CLIENT_TYPES` property of an authentication policy is a best-effort method to block user logins based on specific clients. It should not be used as the sole control to establish a security boundary. Notably, it does not restrict access to the Snowflake REST APIs..

For example, the following commands create a custom `policy_admin` role and an authentication policy that allows
authentication using Snowsight. The user must authenticate with SAML or a password.

> **Note:**
>
> To run this example, you must replace `<username>` in the GRANT ROLE command with your login username.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE OR REPLACE DATABASE my_database;
USE DATABASE my_database;

CREATE OR REPLACE SCHEMA my_schema;
USE SCHEMA my_schema;

CREATE ROLE policy_admin;

GRANT USAGE ON DATABASE my_database TO ROLE policy_admin;
GRANT USAGE ON SCHEMA my_database.my_schema TO ROLE policy_admin;
GRANT CREATE AUTHENTICATION POLICY ON SCHEMA my_database.my_schema TO ROLE policy_admin;
GRANT APPLY AUTHENTICATION POLICY ON ACCOUNT TO ROLE policy_admin;

GRANT ROLE policy_admin TO USER <username>;
USE ROLE policy_admin;

CREATE AUTHENTICATION POLICY my_example_authentication_policy
  CLIENT_TYPES = ('SNOWFLAKE_UI')
  AUTHENTICATION_METHODS = ('SAML', 'PASSWORD');
```

For detailed examples, see Example login configurations.

## Setting an authentication policy on an account or user

When you set an authentication policy on an account or user, the restrictions specified in the authentication policy apply to the account or
user. You can use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) or [ALTER USER](../sql-reference/sql/alter-user.md) commands to set an authentication
policy on an account or user.

In a Snowsight worksheet, use either of the following commands to set an authentication policy on an account or
user:

```sqlexample
ALTER ACCOUNT SET AUTHENTICATION POLICY my_example_authentication_policy;
```

```sqlexample
ALTER USER example_user SET AUTHENTICATION POLICY my_example_authentication_policy;
```

You can also set an authentication policy on all users of a [specific type](admin-user-management.md). For example, to set an
authentication policy on all users of type SERVICE within the account, but not on users of type PERSON, run the following command:

```sqlexample
ALTER ACCOUNT SET AUTHENTICATION POLICY my_example_authentication_policy
  FOR ALL SERVICE USERS;
```

Only a security administrator (a user with the SECURITYADMIN role) or users with a role that has the APPLY
AUTHENTICATION POLICY privilege can set authentication policies on accounts or users. To grant this privilege to a role so users can set
an authentication policy on an account or user, execute one of the following commands:

```sqlexample
GRANT APPLY AUTHENTICATION POLICY ON ACCOUNT TO ROLE my_policy_admin;
```

```sqlexample
GRANT APPLY AUTHENTICATION POLICY ON USER example_user TO ROLE my_policy_admin;
```

For detailed examples, see Example login configurations.

## Hardening user or account authentication using MFA

To improve the security of user logins, you can create an authentication policy that requires users to
[enroll in MFA](ui-snowsight-profile.md), and then apply the authentication policy to individual users or the account. After users
enroll in MFA, the authentication policy requires users to authenticate with MFA.

> **Note:**
>
> Snowflake is [deprecating single-factor password logins](security-mfa-rollout.md). When the rollout is complete, all users
> who authenticate with a password must enroll in MFA.

Run the following command if you want to create an authentication policy that requires **password users** to authenticate with MFA when using any Snowflake client, not just Snowsight. Single sign-on (SSO) users won’t be required to use MFA.

```sqlexample
CREATE AUTHENTICATION POLICY require_mfa_authentication_policy
  MFA_ENROLLMENT = 'REQUIRED'
  MFA_POLICY=  (
    ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION = 'NONE'
  );
```

Run the following command if you want to create an authentication policy that requires **password and single sign-on users** to authenticate with MFA.

```sqlexample
CREATE AUTHENTICATION POLICY require_mfa_authentication_policy
  MFA_ENROLLMENT = 'REQUIRED'
  MFA_POLICY=  (
    ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION = 'ALL'
  );
```

To set this authentication policy for all users in an account, execute the following SQL statement:

```sqlexample
ALTER ACCOUNT SET AUTHENTICATION POLICY require_mfa_authentication_policy;
```

> **Note:**
>
> If you set the `MFA_ENROLLMENT` parameter, then the `CLIENT_TYPES` parameter must include
> `SNOWFLAKE_UI`, because Snowsight is the only place users can
> [enroll in multi-factor authentication (MFA)](ui-snowsight-profile.md).

## Tracking authentication policy usage

Use the Information Schema table function [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) to return a row for each user that is assigned
to the specified authentication policy and a row for the authentication policy assigned to the Snowflake account.

The following syntax is supported for authentication policies:

```sqlexample
POLICY_REFERENCES( POLICY_NAME => '<authentication_policy_name>' )
```

```sqlexample
POLICY_REFERENCES( REF_ENTITY_DOMAIN => 'USER', REF_ENTITY_NAME => '<username>')
```

```sqlexample
POLICY_REFERENCES( REF_ENTITY_DOMAIN => 'ACCOUNT', REF_ENTITY_NAME => '<accountname>')
```

Where `authentication_policy_name` is the fully qualified name of the authentication policy.

For example, execute the following query to return a row for each user that is assigned the authentication policy named
`authentication_policy_prod_1`, which is stored in the database named `my_db` and the schema named `my_schema`:

```sqlexample
SELECT *
FROM TABLE(
  my_db.INFORMATION_SCHEMA.POLICY_REFERENCES(
  POLICY_NAME => 'my_db.my_schema.authentication_policy_prod_1'
  )
);
```

## Preventing a lockout

In situations where the authentication policy governing an account is strict, you can create a non-restrictive authentication policy for
an administrator to use as a recovery option in case of a lockout caused by a security integration. For example, you can include the
`PASSWORD` authentication method for the administrator only. The user-level authentication policy overrides the more restrictive
account-level policy.

```sqlexample
CREATE AUTHENTICATION POLICY admin_authentication_policy
  AUTHENTICATION_METHODS = ('SAML', 'PASSWORD')
  CLIENT_TYPES = ('SNOWFLAKE_UI', 'SNOWFLAKE_CLI', SNOWSQL', 'DRIVERS')
  SECURITY_INTEGRATIONS = ('EXAMPLE_OKTA_INTEGRATION');
```

You can then assign this policy to an administrator:

```sqlexample
ALTER USER <administrator_name> SET AUTHENTICATION POLICY admin_authentication_policy
```

## Replication of authentication policies

You can replicate authentication policies using failover and replication groups. For details, see
[Replication and security policies](account-replication-considerations.md).

## Example login configurations

This section provides examples of how you can use and combine authentication policies and SAML2 security integrations to control login flow
and security.

### Restricting user access to Snowflake by client type

The `CLIENT_TYPES` property of an authentication policy is a best-effort method to block user logins based on specific clients. It should not be used as the sole control to establish a security boundary. Notably, it does not restrict access to the Snowflake REST APIs..

Create an authentication policy named `restrict_client_type_policy` that only allows access through Snowsight:

```sqlexample
CREATE AUTHENTICATION POLICY restrict_client_type_policy
  CLIENT_TYPES = ('SNOWFLAKE_UI')
  COMMENT = 'Only allows access through the web interface';
```

Set the authentication policy on a user:

```sqlexample
ALTER USER example_user SET AUTHENTICATION POLICY restrict_client_type_policy;
```

### Allow authentication from multiple identity providers on an account

Create a SAML2 security integration that allows users to log in through SAML using Okta as an IdP:

```sqlexample
CREATE SECURITY INTEGRATION example_okta_integration
  TYPE = SAML2
  SAML2_SSO_URL = 'https://okta.example.com';
  ...
```

Create a security integration that allows users to log in through SAML using Microsoft Entra ID as an IdP:

```sqlexample
CREATE SECURITY INTEGRATION example_entra_integration
  TYPE = SAML2
  SAML2_SSO_URL = 'https://entra-example_acme.com';
  ...
```

Create an authentication policy associated with the `example_okta_integration` and `example_entra_integration` integrations:

```sqlexample
CREATE AUTHENTICATION POLICY multiple_idps_authentication_policy
  AUTHENTICATION_METHODS = ('SAML')
  SECURITY_INTEGRATIONS = ('EXAMPLE_OKTA_INTEGRATION', 'EXAMPLE_ENTRA_INTEGRATION');
```

Set the authentication policy on an account:

```sqlexample
ALTER ACCOUNT SET AUTHENTICATION POLICY multiple_idps_authentication_policy;
```

## Privileges and commands

### Authentication Policy Privilege Reference

Snowflake supports the following authentication policy privileges to determine whether users can create, set, and own authentication
policies.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Object | Usage |
| --- | --- | --- |
| CREATE | Schema | Enables creating a new authentication policy in a schema. |
| APPLY AUTHENTICATION POLICY | Account | Enables applying an authentication policy at the account or user level. |
| OWNERSHIP | Authentication policy | Grants full control over the authentication policy. Required to alter most properties of an authentication policy. |

### Authentication policy DDL reference

For details about authentication policy privileges and commands, see the following reference documentation:

| Command | Privilege | Description |
| --- | --- | --- |
| [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) | CREATE AUTHENTICATION POLICY on SCHEMA | Creates a new authentication policy. |
| [ALTER AUTHENTICATION POLICY](../sql-reference/sql/alter-authentication-policy.md) | OWNERSHIP on AUTHENTICATION POLICY | Modifies an existing authentication policy. |
| [DROP AUTHENTICATION POLICY](../sql-reference/sql/drop-authentication-policy.md) | OWNERSHIP on AUTHENTICATION POLICY | Removes an existing authentication policy from the system. |
| [DESCRIBE AUTHENTICATION POLICY](../sql-reference/sql/desc-authentication-policy.md) | OWNERSHIP on AUTHENTICATION POLICY | Describes the properties of an existing authentication policy. |
| [SHOW AUTHENTICATION POLICIES](../sql-reference/sql/show-authentication-policies.md) | OWNERSHIP on AUTHENTICATION POLICY or USAGE on SCHEMA | Lists all of the authentication policies in the system. |

---
title: Automate continuous data loading with cloud messaging
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto.md
section: User Guide
---

# Automate continuous data loading with cloud messaging

Automated data loads leverage event notifications for cloud storage to inform Snowpipe of the arrival of new data files to load. Snowpipe
copies the files into a queue, from which they are loaded into the target table in a continuous, serverless fashion based on parameters
defined in a specified pipe object.

> **Note:**
>
> * Automated Snowpipe uses event notifications to determine when new files arrive in monitored external stages in cloud storage and are ready to load. Notifications identify the cloud storage event and include a list of the file names. They do not include the actual data in the files.
> * When a pipe is paused, event messages received for the pipe enter a limited retention period. The period is 14 days by default. If a
>   pipe is paused for longer than 14 days, it is considered stale.
>
>   Event notifications received while a pipe is paused are retained for only a limited period of time (14 days). As each notification
>   reaches the end of this period, Snowflake schedules it to be dropped from the internal metadata. If the pipe is later resumed,
>   Snowpipe may process notifications older than 14 days on a best effort basis. Snowflake cannot guarantee that these older
>   notifications are processed.
>
>   For information about resuming stale pipes, see [Managing Snowpipe](data-load-snowpipe-manage.md).

The following table indicates which cloud storage services are supported for automatically loading data in external stages into your Snowflake account using cloud storage event notifications, based on the [cloud platform](intro-cloud-platforms.md) that hosts your account:

| Snowflake Account Host | Amazon S3 | Google Cloud Storage | Microsoft Azure Blob storage | Microsoft Data Lake Storage Gen2 | Microsoft Azure General-purpose v2 |
| --- | --- | --- | --- | --- | --- |
| Amazon Web Services | ✔ | ✔ | ✔ | ✔ | ✔ |
| Google Cloud | ✔ | ✔ | ✔ | ✔ | ✔ |
| Microsoft Azure | ✔ | ✔ | ✔ | ✔ | ✔ |

> **Important:**
>
> Snowflake recommends that you enable cloud event filtering for Snowpipe to reduce costs, event noise, and latency. For more information about configuring event filtering for each cloud provider, see the following pages:
>
> * [Configuring event notifications using object key name filtering - Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/notification-how-to-filtering.html#notification-how-to-filtering-examples-invalid)
> * [Understand event filtering for Event Grid subscriptions - Azure](https://docs.microsoft.com/en-us/azure/event-grid/event-filtering)
> * [Filtering messages - Google Pub/Sub](https://cloud.google.com/pubsub/docs/filtering)

**Next Topics:**

* [Automating Snowpipe for Amazon S3](data-load-snowpipe-auto-s3.md)
* [Automating Snowpipe for Google Cloud Storage](data-load-snowpipe-auto-gcs.md)
* [Automating Snowpipe for Microsoft Azure Blob Storage](data-load-snowpipe-auto-azure.md)

---
title: Automated directory table metadata refreshes
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-auto.md
section: User Guide
---

# Automated directory table metadata refreshes

You can automatically refresh the metadata for a directory table on an internal or external stage.

The refresh operation synchronizes the metadata with the latest set of associated files in storage, and occurs
in response to the following types of changes:

> * New files in the path are added to the table metadata.
> * Files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

## Internal stages

Automatically refreshing the directory table on an internal stage
synchronizes the metadata with the latest set of associated files in the internal named stage and path when the following occur:

> * New files in the path are added to the table metadata.
> * Changes to files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

### Create an internal named stage with a directory table enabled

Create an internal named stage with a directory table enabled by using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads
your staged data files into the directory table metadata.

```sqlexample
CREATE STAGE my_int_stage
  DIRECTORY = (
    ENABLE = TRUE
    AUTO_REFRESH = TRUE
  );
```

## External stages

You can automatically refresh the metadata for a directory table by using the following event notification services:

* **Amazon S3:** [Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/)
* **Google Cloud Storage:** [Google Cloud Pub/Sub](https://cloud.google.com/storage/docs/reporting-changes)
* **Microsoft Azure:** [Microsoft Azure Event Grid](https://azure.microsoft.com/en-us/services/event-grid/)

To set up automated refreshes, see the topic for the cloud storage service where your files are located:

* [Refresh directory tables automatically for Amazon S3](data-load-dirtables-auto-s3.md)
* [Refresh directory tables automatically for Google Cloud Storage](data-load-dirtables-auto-gcs.md)
* [Refresh directory tables automatically for Azure Blob Storage](data-load-dirtables-auto-azure.md)

### Cross-cloud support

Snowflake supports cross-cloud, cross-region automated directory table refreshes for external stages.

The following table shows the cross-cloud options that Snowflake supports for automated directory table refreshes,
based on the [cloud platform](intro-cloud-platforms.md) that hosts your Snowflake account.

|  | Amazon S3 | Google Cloud Storage | Microsoft Azure Blob storage | Microsoft Data Lake Storage Gen2 | Microsoft Azure General-purpose v2 |
| --- | --- | --- | --- | --- | --- |
| Accounts hosted on AWS | ✔ | ✔ | ✔ | ✔ | ✔ |
| Accounts hosted on GCP | ✔ | ✔ | ✔ | ✔ | ✔ |
| Accounts hosted on Azure | ✔ | ✔ | ✔ | ✔ | ✔ |

## Considerations

* Automated refreshes are event-based and provide better performance that manual refreshes for large or fast-growing stages.
* Automated refreshes for internal stages is currently available for accounts hosted on AWS. Snowflake
  doesn’t support refreshing the directory table metadata on an internal stage when your account is hosted on Google Cloud or Azure.

**Next Topics:**

* [Refresh directory tables automatically for Amazon S3](data-load-dirtables-auto-s3.md)
* [Refresh directory tables automatically for Google Cloud Storage](data-load-dirtables-auto-gcs.md)
* [Refresh directory tables automatically for Azure Blob Storage](data-load-dirtables-auto-azure.md)

---
title: Automatic Clustering
source: https://docs.snowflake.com/en/user-guide/tables-auto-reclustering.md
section: User Guide
---

# Automatic Clustering

Automatic Clustering is the Snowflake service that seamlessly and continually manages all reclustering, as needed, of clustered tables.

Note that, after a clustered table is defined, reclustering does not necessarily start immediately. Snowflake only reclusters a clustered table if it will benefit from the
operation.

> **Note:**
>
> If manual reclustering is still available in your account, Automatic Clustering might not be enabled yet for your account. For more details, see [Manual Reclustering — Deprecated](tables-clustering-manual.md).

## Benefits of Automatic Clustering

### Ease-of-maintenance

Automatic Clustering eliminates the need for performing any of the following tasks:

* Monitoring the state of clustered tables.

  Instead, as DML is performed on these tables, Snowflake monitors and evaluates the tables to determine whether they would benefit from reclustering, and automatically
  reclusters them, as needed.
* Designating warehouses in your account to use for reclustering.

  Snowflake performs automatic reclustering in the background, and you do not need to specify a warehouse to use.

All you need to do is define a clustering key for each table (if appropriate) and Snowflake manages all future maintenance.

### Full control

You can suspend and resume Automatic Clustering for a clustered table at any time using ALTER TABLE … SUSPEND / RESUME RECLUSTER. While Automatic Clustering is suspended
for a table, the table is never automatically reclustered, regardless of its clustering state and, therefore, does not incur any related credit charges.

You can also drop the clustering key on a clustered table at any time, which prevents all future reclustering on the table.

### Non-blocking DML

Automatic Clustering is transparent and does not block DML statements issued against tables while they are being reclustered.

### Optimal efficiency

With Automatic Clustering, Snowflake internally manages the state of clustered tables, as well as the resources (servers, memory, etc.) used for all automated clustering
operations. This allows Snowflake to dynamically allocate resources as needed, resulting in the most efficient and effective reclustering.

Also, Automatic Clustering does not perform any unnecessary reclustering. Reclustering is triggered only if/when the table would benefit from the operation.

## Enabling Automatic Clustering for a table

In most cases, no tasks are required to enable Automatic Clustering for a table. You simply define a
[clustering key](tables-clustering-keys.md) for the table.

However, the rule does not apply to tables created by cloning ([CREATE TABLE … CLONE …](../sql-reference/sql/create-clone.md))
from a source table that has clustering keys. The new table starts with Automatic Clustering suspended – even if Automatic
Clustering for the source table is not suspended. (This is true whether the `CLONE` command cloned the table, the schema
containing the table, or the database containing the table.)

> **Tip:**
>
> Before you define a clustering key for a table, consider the following conditions, which may cause reclustering activity (and corresponding credit charges):
>
> * The table is not optimally-clustered. For more details, see [Micro-partitions & Data Clustering](tables-clustering-micropartitions.md).
> * The clustering key on the table has changed.
>
> As such, we recommend starting with one or two selected tables and assessing the impact of Automatic Clustering on these tables. Once you are comfortable/familiar with how
> Automatic Clustering performs reclustering, you can then define clustering keys for your other tables.

For information about choosing optimal clustering keys, see [Strategies for Selecting Clustering Keys](tables-clustering-keys.md).

To add clustering to a table, you must also have USAGE or OWNERSHIP privileges on the schema and database that
contain the table.

## Viewing the Automatic Clustering status for a table

You can use SQL to view whether Automatic Clustering is enabled for a table:

* [SHOW TABLES](../sql-reference/sql/show-tables.md) command.
* [TABLES](../sql-reference/info-schema/tables.md) view (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
* [TABLES](../sql-reference/account-usage/tables.md) view (in the [Account Usage](../sql-reference/account-usage.md) shared database).

The `AUTO_CLUSTERING_ON` column in the output displays the Automatic Clustering status for each table, which can be used to determine whether to suspend or resume Automatic
Clustering for a given table.

In addition, the `CLUSTER_BY` column (SHOW TABLES) or `CLUSTERING_KEY` column (TABLES view) displays the column(s) defined as the clustering key(s) for each table.

## Suspending Automatic Clustering for a table

To suspend Automatic Clustering for a table, use the [ALTER TABLE](../sql-reference/sql/alter-table.md) command with a `SUSPEND RECLUSTER` clause. For example:

> ```sqlexample
> ALTER TABLE t1 SUSPEND RECLUSTER;
>
> SHOW TABLES LIKE 't1';
>
> +---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> |           created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes |  owner   | retention_time | automatic_clustering |
> +---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> | Thu, 12 Apr 2018 13:29:01 -0700 | T1   | TESTDB        | MY_SCHEMA   | TABLE |         | LINEAR(C1) | 0    | 0     | SYSADMIN | 1              | OFF                  |
> +---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> ```

> **Note:**
>
> Changing the clustering key of a table resumes automatic clustering, which can result in credit consumption by serverless resources.
> Including the word LINEAR in the ALTER TABLE … CLUSTER BY statement is considered a change to the clustering key even if the column
> doesn’t change.

## Resuming Automatic Clustering for a table

To resume Automatic Clustering for a clustered table, use the [ALTER TABLE](../sql-reference/sql/alter-table.md) command with a `RESUME RECLUSTER` clause. For example:

> ```sqlexample
> ALTER TABLE t1 RESUME RECLUSTER;
>
> SHOW TABLES LIKE 't1';
>
> +---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> |           created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes |  owner   | retention_time | automatic_clustering |
> +---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> | Thu, 12 Apr 2018 13:29:01 -0700 | T1   | TESTDB        | MY_SCHEMA   | TABLE |         | LINEAR(C1) | 0    | 0     | SYSADMIN | 1              | ON                   |
> +---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> ```

> **Tip:**
>
> Before you resume Automatic Clustering on a clustered table, consider the following conditions, which may cause reclustering activity (and corresponding credit charges):
>
> * The table is not optimally-clustered (e.g. significant DML has been performed on the table since it was last reclustered).
> * The clustering key on the table has changed.
>
> For more details, see [Micro-partitions & Data Clustering](tables-clustering-micropartitions.md) and [Clustering Keys & Clustered Tables](tables-clustering-keys.md).

## Automatic Clustering costs

The cost of enabling Automatic Clustering can be broken down into compute costs and storage costs.

Compute costs
:   Snowflake uses [serverless compute resources](cost-understanding-compute.md) to cluster a table for the first time. It also uses compute resources to maintain that table in a well-clustered state as new data is added to the table. The more changes to a table, the higher the
    maintenance costs.

Storage Costs
:   Because Automatic Clustering reorganizes existing data rather than creating additional storage, in many cases there are no additional
    storage costs. However, reclustering can incur additional storage costs if it increases the size of
    [Fail-safe](data-failsafe.md) storage. For more information, see [Credit and Storage Impact of Reclustering](tables-clustering-keys.md).

### Credit usage and warehouses for Automatic Clustering

Automatic Clustering consumes Snowflake credits, but does not require you to provide a virtual warehouse. Instead, Snowflake internally
manages and achieves efficient resource utilization for reclustering the tables.

Your account is billed only for the actual credits consumed by automatic clustering operations on your clustered tables.

> **Important:**
>
> After enabling or resuming Automatic Clustering on a clustered table, if it has been a while since the table was reclustered, you may
> experience reclustering activity (and corresponding credit charges) as Snowflake brings the table to an optimally-clustered state. Once
> the table is optimally-clustered, the reclustering activity will drop off.
>
> Likewise, defining a clustering key on an existing table or changing the clustering key on a clustered table may trigger reclustering and
> credit charges.
>
> To prevent any unexpected credit charges, we recommend starting with one or two selected tables and observing the credit charges
> associated with keeping the tables well-clustered as DML is performed. This will help you establish a baseline for the number of credits
> consumed by reclustering activity.

### Estimating Automatic Clustering cost

You can call the [SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS](../sql-reference/functions/system_estimate_automatic_clustering_costs.md) function to help estimate the compute cost of
enabling Automatic Clustering for a table and maintaining the table in a well-clustered state. You can also call the function to help predict
the compute cost of changing the cluster key of a table.

> **Important:**
>
> The cost estimates returned by the SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS function are best efforts. The actual realized costs can vary by up to 100% (or, in rare cases, several times) from the estimated costs.

### Viewing Automatic Clustering cost

Automatic clustering consumes credits as it uses [serverless compute resources](cost-understanding-compute.md) for the
automated background maintenance of each clustered table, including initial clustering and reclustering as needed. To learn how many
credits per compute-hour are consumed by automatic clustering, refer to the “Serverless Feature Credit Table” in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

Users with the proper privileges can view the cost of automatic clustering using [Snowsight](ui-snowsight-gs.md) or SQL:

> Snowsight:
> :   In the navigation menu, select Admin » Cost management.
>
> SQL:
> :   Query either of the following:
>
>     * [AUTOMATIC_CLUSTERING_HISTORY](../sql-reference/functions/automatic_clustering_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
>     * [AUTOMATIC_CLUSTERING_HISTORY view](../sql-reference/account-usage/automatic_clustering_history.md) (in [Account Usage](../sql-reference/account-usage.md)).
>
>       The following queries can be executed against the AUTOMATIC_CLUSTERING_HISTORY view:
>
>       **Query: Automatic Clustering cost history (by day, by object)**
>
>       This query provides a list of tables with Automatic Clustering and the volume of credits consumed via the service over the last 30 days,
>       broken out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional investigation.
>
>       ```sqlexample
>       SELECT TO_DATE(start_time) AS date,
>         database_name,
>         schema_name,
>         table_name,
>         SUM(credits_used) AS credits_used
>       FROM snowflake.account_usage.automatic_clustering_history
>       WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
>       GROUP BY 1,2,3,4
>       ORDER BY 5 DESC;
>       ```
>
>       **Query: Automatic Clustering History & m-day average**
>
>       This query shows the average daily credits consumed by Automatic Clustering grouped by week over the last year. It can help identify
>       anomalies in daily averages over the year so you can investigate spikes or unexpected changes in consumption.
>
>       ```sqlexample
>       WITH credits_by_day AS (
>         SELECT TO_DATE(start_time) AS date,
>           SUM(credits_used) AS credits_used
>         FROM snowflake.account_usage.automatic_clustering_history
>         WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
>         GROUP BY 1
>         ORDER BY 2 DESC
>       )
>
>       SELECT DATE_TRUNC('week',date),
>             AVG(credits_used) AS avg_daily_credits
>       FROM credits_by_day
>       GROUP BY 1
>       ORDER BY 1;
>       ```

> **Note:**
>
> [Resource monitors](resource-monitors.md) provide control over virtual warehouse credit usage; however, you cannot use
> them to control credit usage for the Snowflake-provided warehouses, including the  AUTOMATIC_CLUSTERING
> warehouse.

---
title: Automatic tag propagation with user-defined tags
source: https://docs.snowflake.com/en/user-guide/object-tagging/propagation.md
section: User Guide
---

# Automatic tag propagation with user-defined tags

Tag propagation automatically assigns an [object tag](introduction.md) to target objects if it is applied to the
source object. For example, you can define tags on a source object, such as a table and its columns, and these tags are
automatically propagated to a target object, such as a view or another table created from the source object.

The advantages of automatic tag propagation include the following:

* Streamlining tag management across objects, particularly when tags are applied to source objects or columns for ease of discovery and data
  protection.
* Ensuring that any policies associated with the tags are automatically applied to the target objects.

Only the tag owner with the account-level APPLY TAG privilege can implement automatic tag propagation.

## Types of propagation

You can choose to propagate a tag when there is an object dependency,
data movement, or both.

### Tag propagation for object dependencies

When tag propagation is configured for object dependencies, a tag is propagated from a source object to all of the target objects that are
based on it. For example, if you set up propagation for a tag `data_sensitivity` on a table `t1`, and then create two views based
on `t1`, the `data_sensitivity` tag is propagated to both views.

Creating a view, secure view, materialized view, or dynamic table from a source object is considered an object dependency.

#### Continuous propagation for object dependencies

When a tag is configured for object dependencies, Snowflake continuously updates the target objects when any of the following occurs:

* The tag is added to a source object or column.
* The value of a tag is updated.
* A tag is removed from a source object or column. In this case, Snowflake removes the tag from the target object or column.

For example, suppose the tag `data_sensitivity` was propagated from table `t1` to view `v2` after executing a CREATE VIEW statement.
When you change the value of `data_sensitivity` on `t1`, the value of the tag on `v2` is also updated.

Automatic tag propagation relies on the existence of the source object. If the source object with tags is dropped, the tags won’t be
propagated to the target object. Because a view depends on its sources, like a base table or other views, tags are propagated only if
the source object exists.

### Tag propagation for data movement

When tag propagation is configured for data movement, a tag is propagated when you move data from a source object to another
object by doing any of the following:

* Executing a [CREATE TABLE … AS SELECT (CTAS)](../../sql-reference/sql/create-table.md) statement to create a table.
* Executing a CREATE DYNAMIC TABLE statement.
* Executing a Data Manipulation Language (DML) command. Tag propagation occurs for the following DML commands:

  + INSERT
  + MERGE
  + UPDATE
  + COPY INTO

[CREATE TABLE … CLONE](../../sql-reference/sql/create-table.md) and [CREATE TABLE … LIKE](../../sql-reference/sql/create-table.md) do not rely
on the PROPAGATE tag property for tag propagation. When you execute these statements, tags from the source are always assigned to the
target object.

> **Note:**
>
> Unlike tag propagation for object dependencies, tags applied to target objects when there is data movement are *not* continuously updated
> as tags change on the source object.

## Setting up tag propagation

To enable automatic tag propagation, use the CREATE TAG or ALTER TAG command to set the PROPAGATE property. You can configure the
property so tags are propagated for object dependencies, data movement, or both.

For instructions on setting up tag propagation, see [Define a tag that will automatically propagate](work.md).

## Tag propagation conflicts

Conflicts can occur when a tag is propagated from different source objects to the same target object. If the tag has a different value in
each of the source objects, there is a conflict when that tag is propagated from the source objects to the target object.

> **Note:**
>
> If the target object has a tag that was manually applied, the existing tag value takes precedence over a propagated value so there is no
> conflict.
>
> If the target object inherits a value from an object higher in the Snowflake hierarchy of objects, the propagated value takes precedence
> and there is no conflict.

The ON_CONFLICT property of a tag determines what happens when there is a conflict. You have three options for handling tag propagation
conflicts:

* **Option 1:** Replace the value of the tag with the string `CONFLICT`. This is the default if you don’t set the ON_CONFLICT parameter
  of the tag.
* **Option 2:** Replace the value of the tag with a user-defined string. You set the ON_CONFLICT parameter to this string.

  For example, if you want the value of a tag to be `HIGHLY CONFIDENTIAL` when there is a conflict in values, use the following SQL to
  create the tag:

  ```sqlexample
  CREATE TAG data_sensitivity
    PROPAGATE = ON_DEPENDENCY_AND_DATA_MOVEMENT
    ON_CONFLICT = 'HIGHLY CONFIDENTIAL';
  ```
* **Option 3:** Use the order of the values in the tag’s ALLOWED_VALUES parameter to determine which value to use. Set
  `ON_CONFLICT = ALLOWED_VALUES_SEQUENCE` to implement this strategy.

  For example, suppose you created the tag with the following SQL statement:

  ```sqlexample
  CREATE TAG data_sensitivity
    ALLOWED_VALUES 'confidential', 'internal', 'public'
    PROPAGATE = ON_DEPENDENCY
    ON_CONFLICT = ALLOWED_VALUES_SEQUENCE;
  ```

  If there is a conflict for this tag between values `internal` and `public`, the value of the `data_sensitivity` tag will be
  `internal` because it comes before `public` in the list of allowed values.

  Be aware that if you choose to use `ON_CONFLICT = ALLOWED_VALUES_SEQUENCE`, changing the ALLOWED_VALUES parameter affects how conflicts
  are resolved. For example, if you change the order of the values in the allowed list, then future conflicts could result in a different
  value being assigned to the tag.

To track conflicts associated with tag propagation, see Using an event table to monitor tag propagation.

## Using an event table to monitor tag propagation

You can use an [event table](../../developer-guide/logging-tracing/event-table-setting-up.md) to collect telemetry data related to tag
propagation. After Snowflake starts collecting the data in the event table, you can query the table, create a stream to track changes, or
set alerts to send notifications when certain events occur.

If you want to collect telemetry data for tag propagation, you must enable the [ENABLE_TAG_PROPAGATION_EVENT_LOGGING](../../sql-reference/parameters.md) account
parameter. To start collecting data, run the following command:

```sqlexample
ALTER ACCOUNT SET ENABLE_TAG_PROPAGATION_EVENT_LOGGING = TRUE;
```

If you have an event table set for the tag’s database, then events are logged to that table. Otherwise, events are logged to the default
event table.

### Understanding the events

The following table describes the values in the event table that correspond to tag propagation so you can focus on the appropriate
events. For detailed information about the structure of an event table, see [Event table columns](../../developer-guide/logging-tracing/event-table-columns.md).

| Event table column | Column field | Field value | Description |
| --- | --- | --- | --- |
| `scope` | `name` | `snow.automatic_tag_propagation` | Indicates that the record relates to automatic tag propagation. |
| `record_attributes` | `tag_name` | `tag_name` | Name of the tag that had an event during propagation. |
| `record_attributes` | `event_type` | `CONFLICT` | Indicates that a conflict occurred when propagating a tag. |
| `record_attributes` | `event_type` | `TAG_PROPAGATION_LIMIT_EXCEEDED` | Indicates that Snowflake didn’t propagate a tag because there were more than 10,000 target objects. |
| `value` | `conflict_values` | [`tag_value`, `tag_value`] | Array of the tag values that were conflicting. |
| `value` | `resolution_type` | `DEFAULT`, `STRING_OVERRIDE`, or `ALLOWED_VALUES_OVERRIDE` | Indicates the action that Snowflake took when a conflict occurred. To understand why the conflict was resolved in a particular way, see Tag propagation conflicts. |
| `value` | `resolved_values` | `tag_value` | Final value of the tag after Snowflake resolved a conflict. |

Use the following examples to better understand how to identify tag propagation events in an event table.

Query: Find all events related to the propagation of tag `TAG1`
:   ```sqlexample
    SELECT
      TIMESTAMP as time,
      RECORD_ATTRIBUTES['event_type'] as event_type,
      VALUE as event_details
    FROM tagging_db.tagging_schema.my_event_table
    WHERE
      SCOPE['name'] = 'snow.automatic_tag_propagation'
      AND RECORD_ATTRIBUTES['tag_name'] = 'TAGGING_DB.TAGGING_SCHEMA.TAG1';
    ```

Query: Find all tags that had a conflict when propagated
:   ```sqlexample
    SELECT
      DISTINCT RECORD_ATTRIBUTES['tag_name'] as tags,
      VALUE['conflict_values'] as conflicting_tag_values,
      VALUE['resolution_type'] as resolution_type,
      VALUE['resolved_value'] as resolved_value,
    FROM tagging_db.tagging_schema.my_event_table
    WHERE
      SCOPE['name'] = 'snow.automatic_tag_propagation'
      AND RECORD_ATTRIBUTES['event_type'] = 'CONFLICT';
    ```

Query: Find entities that had conflicts when the tag `TAG1` was propagated
:   ```sqlexample
    SELECT
      TIMESTAMP as time,
      RECORD_ATTRIBUTES['entity_name'] as entity_name,
      RECORD_ATTRIBUTES['entity_domain'] as entity_domain,
    FROM tagging_db.tagging_schema.my_event_table
    WHERE
      SCOPE['name'] = 'snow.automatic_tag_propagation'
      AND RECORD_ATTRIBUTES['tag_name'] = 'TAGGING_DB.TAGGING_SCHEMA.TAG1'
      AND RECORD_ATTRIBUTES['event_type'] = 'CONFLICT';
    ```

### Severity of events

> Tag propagation events are logged only if the [LOG_EVENT_LEVEL parameter](../../sql-reference/parameters.md) governing the table is configured to show events
> of that severity level. Use the following table to determine the severity level of tag propagation events.

| Event type | Resolution type | Severity |
| --- | --- | --- |
| `CONFLICT` | `default` | WARN |
|  | `string_override` | INFO |
|  | `allowed_values_override` | INFO |
| `TAG_PROPAGATION_LIMIT_EXCEEDED` | n/a | ERROR |

## Supported objects

Tag propagation from source to target is supported for the following object types:

* Columns
* The following types of tables:

  + Tables
  + Dynamic tables - Creating a dynamic table is considered both an object dependency and data movement for the purposes of tag propagation.
  + External tables
  + Iceberg tables
  + Temp/transient tables
* The following types of views:

  + Views
  + Secure views
  + Materialized views

## Limitations and considerations

* System tags are not propagated.
* [Inherited tags](inheritance.md) are not propagated.
* Tags are not propagated from a share to local objects.
* The number of tags on an object cannot exceed the [standard limit](introduction.md).
* In a single transaction that triggers tag propagation, a tag can only be propagated to up to 10,000 downstream objects. If there are
  more than 10,000 objects in the dependency chain, for instance, a table with more than 10,000 views referencing it, then propagation
  fails. You can use the event table to find out if propagation failed for this reason.
* With tag propagation for object dependencies, a tag can be applied to both the source table and target views. If the tag is associated with
  a masking policy, there could be consequences associated with duplicate execution of the policy.

---
title: Automatically refresh Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-auto-refresh.md
section: User Guide
---

# Automatically refresh Apache Iceberg™ tables

Configure automated metadata refreshes for new or existing externally managed
[Apache Iceberg™ tables](tables-iceberg.md). With automated refreshes, Snowflake polls your external Iceberg catalog in
a continuous and serverless fashion to synchronize the metadata with the most recent remote changes.

Automated refresh for Iceberg tables works differently from automated refresh for directory tables or external tables because it
doesn’t rely on cloud provider notifications. Instead, you configure the feature according to the following steps:

1. Set a refresh interval on a catalog integration.
   Snowflake supports automated refresh for the following external Iceberg catalog options:

   * REST catalog that complies with the Apache Iceberg REST OpenAPI specification
   * Snowflake Open Catalog
   * Object storage (Delta Lake only)
   * AWS Glue
2. Create one or more Iceberg tables that use the catalog integration.
3. Control automated refresh for each table with the AUTO_REFRESH parameter.

This approach lets you centrally manage refresh settings through the catalog integration while you control individual tables as needed.

## Set the refresh interval on a catalog integration

When you run the [CREATE CATALOG INTEGRATION](../sql-reference/sql/create-catalog-integration.md) command,
you can specify a value for the `REFRESH_INTERVAL_SECONDS` parameter. Otherwise, the default refresh interval
is 30 seconds. Snowflake only polls the external catalog if there are Iceberg tables defined with the catalog integration.

The following example creates a catalog integration for AWS Glue, specifying a refresh interval of 60 seconds:

```sqlexample
CREATE CATALOG INTEGRATION auto_refresh_catalog_integration
  CATALOG_SOURCE = GLUE
  CATALOG_NAMESPACE = 'my_catalog_namespace'
  TABLE_FORMAT = ICEBERG
  GLUE_AWS_ROLE_ARN = 'arn:aws:iam::123456789123:role/my-catalog-role'
  GLUE_CATALOG_ID = '123456789123'
  ENABLED = TRUE
  REFRESH_INTERVAL_SECONDS = 60;
```

To update the refresh interval for a catalog integration, use the [ALTER CATALOG INTEGRATION](../sql-reference/sql/alter-catalog-integration.md) command.

For example:

```sqlexample
ALTER CATALOG INTEGRATION auto_refresh_catalog_integration SET REFRESH_INTERVAL_SECONDS = 120;
```

## Create an Iceberg table with automated refresh

Create an Iceberg table by using the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command. To specify that the
table should use automated metadata refreshes, set `AUTO_REFRESH = TRUE`.

The following example creates an Iceberg table that uses AWS Glue as the catalog, specifying
the catalog integration created previously (`auto_refresh_catalog_integration`)
and the [CATALOG_TABLE_NAME](../sql-reference/sql/create-iceberg-table-aws-glue.md) from AWS Glue.

```sqlexample
CREATE OR REPLACE ICEBERG TABLE auto_refresh_iceberg_table
  CATALOG_TABLE_NAME = 'myGlueTable'
  CATALOG = 'auto_refresh_catalog_integration'
  AUTO_REFRESH = TRUE;
```

## Enable or turn off automated refresh

> **Note:**
>
> * If the table uses a catalog integration created before Snowflake version 8.22, you must use the
>   [ALTER CATALOG INTEGRATION](../sql-reference/sql/alter-catalog-integration.md) command to
>   set the `REFRESH_INTERVAL_SECONDS` parameter before you enable automated refresh on the table.
> * Frequently toggling automated refresh on and off for an Iceberg table can slow metadata refreshes for the table.

Use the [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) command to enable or turn off automated refresh for an existing Iceberg table.

For example:

```sqlsyntax
ALTER ICEBERG TABLE my_iceberg_table SET AUTO_REFRESH = FALSE;
```

## Monitoring automated refresh status

### SHOW ICEBERG TABLES

To get the automated refresh status for multiple tables, use the [SHOW ICEBERG TABLES](../sql-reference/sql/show-iceberg-tables.md) command.

```sqlexample
SHOW ICEBERG TABLES;
```

The command output includes a column named `auto_refresh_status`, which displays the same information as
the [SYSTEM$AUTO_REFRESH_STATUS](../sql-reference/functions/system_auto_refresh_status.md) function for each table that you have access privileges on.

### SYSTEM$AUTO_REFRESH_STATUS

To retrieve the automated refresh status for a specific table, call the [SYSTEM$AUTO_REFRESH_STATUS](../sql-reference/functions/system_auto_refresh_status.md) function.

```sqlexample
SELECT SYSTEM$AUTO_REFRESH_STATUS('my_iceberg_table');
```

The function returns details about the pipe that Snowflake uses to automate refreshes for the table, such as the execution state
and size of the snapshot queue.
An execution state of `RUNNING` indicates that automated refresh is running as expected.
For more information, see [SYSTEM$AUTO_REFRESH_STATUS](../sql-reference/functions/system_auto_refresh_status.md).

### ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY

To retrieve metadata and snapshot information about the most recent refresh history for a specific table,
use the [ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY](../sql-reference/functions/iceberg_table_snapshot_refresh_history.md) function.

```sqlexample
SELECT *
FROM TABLE(INFORMATION_SCHEMA.ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY(
  TABLE_NAME => 'my_iceberg_table'
));
```

### Monitor automated refresh events

You can configure Snowflake to record an event that provides information about the status of automated refresh for an Iceberg table.
Snowflake records the event in the [event table for your account](../developer-guide/logging-tracing/event-table-setting-up.md).

> **Important:**
>
> To monitor Iceberg auto refresh events, you need an active account-level event table and you need to set [LOG_EVENT_LEVEL](../sql-reference/parameters.md) to DEBUG at either
> the table, database, or schema level. Snowflake records events to your active event table set at the account level, not the database level.

Monitoring automated refresh events can help you gain insight into the following areas:

* **Automated refresh progress**: Track how snapshots move through the automated refresh process.
* **Aggregated statistics**: Review summarized statistics for automated refresh operations.

You can also configure alerts for the following critical conditions:

* Refresh errors
* High refresh latencies

> **Note:**
>
> Logging events for automated refresh incurs costs. For more information, see [Costs of telemetry data collection](../developer-guide/logging-tracing/logging-tracing-billing.md).

Snowflake records an event when automated refresh starts, completes, or results in error.

#### Set the severity level to capture events

To capture automated refresh events, you must set the [LOG_EVENT_LEVEL](../sql-reference/parameters.md) parameter at the Iceberg table level or account level.
`LOG_EVENT_LEVEL` determines which log events to capture based on the following values:

* **ERROR**: Events that signal a change requiring human intervention to resolve.
* **WARN**: Events that signal an issue that can be resolved without human intervention.
* **DEBUG**: High-volume events.

> **Note:**
>
> There is no default severity level. To capture events, you must set the severity level at either the
> account level or Iceberg table level.

For example, to capture DEBUG-level automated refresh events for a specific Iceberg table, use the following command:

```sqlexample
ALTER ICEBERG TABLE <my_table_name> SET LOG_EVENT_LEVEL = DEBUG;
```

For more information, see [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md).

#### Query your event table for automated refresh events

Before you can query for automated refresh events, you must set up an event table and set the severity level for event capture.

The following example shows how to retrieve Iceberg automated refresh events that are generated during snapshot processing:

```sqlexample
SELECT record_type,
       record:"name" event_name,
       record:"severity_text" log_level,
       resource_attributes:"snow.database.name" database_name,
       resource_attributes:"snow.schema.name" schema_name,
       resource_attributes:"snow.table.name" table_name,
       resource_attributes:"snow.catalog.integration.name" catalog_integration_name,
       record_attributes:"snow.snapshot.id" snapshot_id,
       parse_json(value):metadata_file_location metadata_file_location,
       parse_json(value):snapshot_state snapshot_state
  FROM my_active_event_table
  WHERE record_type='EVENT' AND event_name='iceberg_auto_refresh_snapshot_lifecycle';
```

Output:

```output
+-------------+-----------------------------------------+-----------+---------------+-------------+------------+--------------------------+---------------+------------------------+----------------+
| RECORD_TYPE | EVENT_NAME                              | LOG_LEVEL | DATABASE_NAME | SCHEMA_NAME | TABLE_NAME | CATALOG_INTEGRATION_NAME | SNAPSHOT_ID   | METADATA_FILE_LOCATION | SNAPSHOT_STATE |
+-------------+-----------------------------------------+-----------+---------------+-------------+------------+--------------------------+---------------+------------------------+----------------+
| EVENT       | iceberg_auto_refresh_snapshot_lifecycle | DEBUG     | TESTDB        | TESTSH      | TESTTABLE  | glue_integration         | 4281775564368 | metadata.json          | started        |
| EVENT       | iceberg_auto_refresh_snapshot_lifecycle | DEBUG     | TESTDB        | TESTSH      | TESTTABLE  | glue_integration         | 4281775564368 | metadata.json          | completed      |
+-------------+-----------------------------------------+-----------+---------------+-------------+------------+--------------------------+---------------+------------------------+----------------+
```

#### Query your event table for stale automated refresh events

You can query your event table for tables whose last successful refresh is older than a threshold you define.

1. The following example defines a threshold of 20 minutes:

   ```sqlexample
   SET STALENESS_THRESHOLD_MINUTES = 20;
   ```
2. Query for tables whose last successful refresh is older than the threshold you defined:

   ```sqlexample
   WITH last_successful_refresh AS (
     -- Find the most recent 'completed' event for each table
     SELECT
       resource_attributes:"snow.table.name"::STRING AS table_name,
       MAX(timestamp) AS last_success_timestamp
     FROM
       <my_active_event_table>
     WHERE
       record:"name" = 'iceberg_auto_refresh_snapshot_lifecycle'
       AND parse_json(value):snapshot_state::STRING = 'completed'
     GROUP BY
       table_name
   )

   -- Select tables whose the last successful refresh was longer ago than our threshold
   SELECT
     table_name,
     last_success_timestamp
   FROM
     last_successful_refresh
   WHERE
     last_success_timestamp < DATEADD(minute, -$STALENESS_THRESHOLD_MINUTES, CURRENT_TIMESTAMP())
   ORDER BY
     last_success_timestamp ASC;
   ```

> * Where `my_active_event_table` is your active event table.
>
> Output:
>
> ```output
> +------------+-------------------------+
> | TABLE_NAME | LAST_SUCCESS_TIMESTAMP  |
> +------------+-------------------------+
> | my_table   | 2025-10-10 07:24:30.854 |
> +------------+-------------------------+
> ```

## Error recovery

When an error occurs during the automated refresh process,
Snowflake updates the execution state to one of the following values:

* `STALLED` means that Snowflake is attempting to recover from the error. If recovery succeeds, the automated refresh process
  continues running as expected and the execution state transitions back to the healthy `RUNNING` state.
* `STOPPED` means the automated refresh process encountered an unrecoverable error, and automated refreshes for the table have been
  stopped.

  An unrecoverable error might occur, for example, when Snowflake can’t establish a direct lineage between the target snapshot and the current snapshot.

  To recover from a `STOPPED` state, take the following actions:

  1. Turn off automated refresh on the table.
  2. Perform a manual metadata refresh. For instructions, see [Refresh the table metadata](tables-iceberg-manage.md).
  3. Re-enable automated refresh using an [ALTER ICEBERG TABLE … SET AUTO_REFRESH](../sql-reference/sql/alter-iceberg-table.md) statement.
  4. Verify that automated refresh is in the `RUNNING` state by calling the [SYSTEM$AUTO_REFRESH_STATUS](../sql-reference/functions/system_auto_refresh_status.md) function.
     You can also call the function multiple times to confirm that the number of queued snapshots (`pendingSnapshotCount`) gradually decreases.

## Billing

Snowflake uses Snowpipe to automate refreshes for Iceberg tables, so charges for automated refresh appear
in the same line item on your bill as Snowpipe charges.
Using events to monitor automated refresh
also incurs cost. For more information, see [Costs of telemetry data collection](../developer-guide/logging-tracing/logging-tracing-billing.md).

There are no Snowpipe file charges for this feature.

You can estimate charges incurred by examining the Account Usage [PIPE_USAGE_HISTORY view](../sql-reference/account-usage/pipe_usage_history.md), which displays
the Iceberg table name in the `pipe_name` column.

For Delta-based Iceberg tables, automated refresh pipes display a NULL pipe name.

For more information about Iceberg table charges, see [Iceberg table billing](tables-iceberg.md).

## Considerations and limitations

Consider the following when you work with Iceberg tables that use automated refresh:

* For catalog integrations created before Snowflake version 8.22 (or 9.2 for Delta-based tables), you must manually set the `REFRESH_INTERVAL_SECONDS` parameter
  before you enable automated refresh on tables that depend on that catalog integration.
  For instructions, see [ALTER CATALOG INTEGRATION … SET AUTO_REFRESH](../sql-reference/sql/alter-catalog-integration.md).
* For [catalog integrations for object storage](tables-iceberg-configure-catalog-integration-object-storage.md), automated refresh is only supported
  for integrations with `TABLE_FORMAT = DELTA`.
* For tables with frequent updates, using a shorter polling interval (`REFRESH_INTERVAL_SECONDS`) can cause performance degradation.
* Automated refresh synchronizes schema changes alongside [DML](../sql-reference/sql-dml.md) operations such as INSERT, UPDATE,
  or DELETE. To apply schema changes made through DDL operations alone, perform a [manual refresh](tables-iceberg-manage.md).

---
title: Automating Snowpipe for Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-s3.md
section: User Guide
---

# Automating Snowpipe for Amazon S3

This topic provides instructions for triggering Snowpipe data loads from external stages on S3 automatically using [Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket.

Snowflake recommends that you only send supported events for Snowpipe to reduce costs, event noise, and latency.

## Cloud platform support

Triggering automated Snowpipe data loads using S3 event messages is supported by Snowflake accounts hosted on [all supported cloud platforms](intro-cloud-platforms.md).

## Network traffic

Note to [Virtual Private Snowflake (VPS)](intro-editions.md) and [AWS PrivateLink](admin-security-privatelink.md) customers:

Automating Snowpipe using Amazon SQS notifications works well. However, although AWS cloud storage within a VPC (including VPS) can communicate with its own messaging services (Amazon SQS, Amazon Simple Notification Service), this traffic flows between servers on Amazon’s secure network outside of the VPC; therefore, this traffic is not protected by the VPC.

## Configuring secure access to Cloud Storage

> **Note:**
>
> If you have already configured secure access to the S3 bucket that stores your data files, you can skip this section.

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Amazon S3 bucket referenced in an external (i.e. S3) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an AWS identity and access management (IAM) user ID. An administrator in your organization grants the integration IAM user permissions in the AWS account.

An integration can also list buckets (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires permissions in AWS to create and manage IAM policies and roles. If you are not an AWS administrator, ask your AWS administrator to perform these tasks.
> * Note that currently, accessing S3 storage in [government regions](intro-regions.md)
>   using a storage integration is limited to Snowflake accounts hosted on AWS in the same government
>   region. Accessing your S3 storage from an account hosted outside of the government region using
>   direct credentials is supported.

The following diagram shows the integration flow for a S3 stage:

1. An external (i.e. S3) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a S3 IAM user created for your account. Snowflake creates a single IAM user that is referenced by all S3 storage integrations in your Snowflake account.
3. An AWS administrator in your organization grants permissions to the IAM user to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same storage integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the IAM user on the bucket before allowing or denying access.

> **Note:**
>
> We highly recommend this option, which avoids the need to supply IAM credentials when accessing cloud storage. See [Configuring secure access to Amazon S3](data-load-s3-config.md) for additional storage access options.

### Step 1: Configure access permissions for the S3 bucket

#### AWS access control requirements

Snowflake requires the following permissions on an S3 bucket and folder to be able to access files in the folder (and sub-folders):

* `s3:GetBucketLocation`
* `s3:GetObject`
* `s3:GetObjectVersion`
* `s3:ListBucket`

As a best practice, Snowflake recommends creating an IAM policy for Snowflake access to the S3 bucket. You can then attach the policy to
the role and use the security credentials generated by AWS for the role to access files in the bucket.

#### Creating an IAM policy

The following step-by-step instructions describe how to configure access permissions for Snowflake in your AWS Management Console to access
your S3 bucket.

1. Log into the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add a policy document that will allow Snowflake to access the S3 bucket and folder.

   The following policy (in JSON format) provides Snowflake with the required permissions to load or unload data using a single bucket and
   folder path.

   Copy and paste the text into the policy editor:

   > **Note:**
   > * Make sure to replace `bucket` and `prefix` with your actual bucket name and folder path prefix.
   > * The Amazon Resource Names (ARN) for buckets in
   >   [government regions](intro-regions.md) have a `arn:aws-us-gov:s3:::` prefix.

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:GetObject",
                 "s3:GetObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   > **Note:**
   >
   > Setting the `"s3:prefix":` condition to either `["*"]` or `["<path>/*"]` grants access to all prefixes in the
   > specified bucket or path in the bucket, respectively.

   Note that AWS policies support a variety of different security use cases.
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

### Step 2: Create the IAM role in AWS

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. Select AWS account as the trusted entity type.
4. Select Another AWS account
5. In the Account ID field, enter your own AWS account ID temporarily. Later, you modify the trust relationship and grant
   access to Snowflake.
6. Select the Require external ID option. An external ID is used to grant access to your AWS resources
   (such as S3 buckets) to a third party like Snowflake.

   Enter a placeholder ID such as `0000`.
   In a later step, you will modify the trust relationship for your IAM role and specify the external ID for your storage integration.
7. Select Next.
8. Select the policy you created in Step 1: Configure access permissions for the S3 bucket (in this topic).
9. Select Next.
10. Enter a name and description for the role, then select Create role.

    You have now created an IAM policy for a bucket, created an IAM role, and attached the policy to the role.
11. On the role summary page, locate and record the Role ARN value. In the next step, you will create a Snowflake integration that
    references this role.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and access data from the cloud storage location until the cache expires.

### Step 3: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake
object that stores a generated identity and access management (IAM) user for your S3 cloud storage, along with an optional set of allowed
or blocked storage locations (that is, buckets). Cloud provider administrators in your organization grant permissions on the storage locations
to the generated user. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, S3) stages. The URL in the stage definition must align with the S3
buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this
> SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = '<iam_role>'
  STORAGE_ALLOWED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `iam_role` is the Amazon Resource Name (ARN) of the role you created in Step 2: Create the IAM role in AWS (in this topic).
* `protocol` is one of the following:

  + `s3` refers to S3 storage in public AWS regions outside of China.
  + `s3china` refers to S3 storage in public AWS regions in China.
  + `s3gov` refers to S3 storage in [government regions](intro-regions.md).
* `bucket` is the name of a S3 bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS
  parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that
  reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that allows access to all buckets in the account but blocks access to the defined `sensitivedata` folders.

Additional external stages that also use this integration can reference the allowed buckets and paths:

```sqlexample
CREATE STORAGE INTEGRATION s3_int
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
  STORAGE_ALLOWED_LOCATIONS = ('*')
  STORAGE_BLOCKED_LOCATIONS = ('s3://mybucket1/mypath1/sensitivedata/', 's3://mybucket2/mypath2/sensitivedata/');
```

> **Note:**
>
> Optionally, use the [STORAGE_AWS_EXTERNAL_ID](../sql-reference/sql/create-storage-integration.md) parameter to specify
> your own external ID. You might choose this option
> to use the same external ID across multiple external volumes and/or storage integrations.

### Step 4: Retrieve the AWS IAM user for your Snowflake account

1. To retrieve the ARN for the IAM user that was created automatically for your Snowflake account, use the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md).

   ```sqlsyntax
   DESC INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 3: Create a Cloud Storage Integration in Snowflake
     (in this topic).

   For example:

   ```sqlexample
   DESC INTEGRATION s3_int;
   ```

   ```output
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   | property                  | property_type | property_value                                                                 | property_default |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------|
   | ENABLED                   | Boolean       | true                                                                           | false            |
   | STORAGE_ALLOWED_LOCATIONS | List          | s3://mybucket1/mypath1/,s3://mybucket2/mypath2/                                | []               |
   | STORAGE_BLOCKED_LOCATIONS | List          | s3://mybucket1/mypath1/sensitivedata/,s3://mybucket2/mypath2/sensitivedata/    | []               |
   | STORAGE_AWS_IAM_USER_ARN  | String        | arn:aws:iam::123456789001:user/abc1-b-self1234                                 |                  |
   | STORAGE_AWS_ROLE_ARN      | String        | arn:aws:iam::001234567890:role/myrole                                          |                  |
   | STORAGE_AWS_EXTERNAL_ID   | String        | MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=                               |                  |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   ```
2. Record the values for the following properties:

   | Property | Description |
   | --- | --- |
   | `STORAGE_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account; for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. All S3 storage integrations in your account use that IAM user. |
   | `STORAGE_AWS_EXTERNAL_ID` | The external ID that Snowflake uses to establish a trust relationship with AWS. If you didn’t specify an external ID (`STORAGE_AWS_EXTERNAL_ID`) when you created the storage integration, Snowflake generates an ID for you to use. |

   You provide these values in the next section.

### Step 5: Grant the IAM user permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your AWS Management Console so that you can use a S3 bucket to load and unload data:

1. Sign in to the AWS Management Console.
2. Select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the role you created in Step 2: Create the IAM role in AWS (in this topic).
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the DESC STORAGE INTEGRATION output values you recorded in
   Step 4: Retrieve the AWS IAM user for your Snowflake account (in this topic):

   **Policy document for IAM role**

   ```sqljson
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<snowflake_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<snowflake_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   > * `snowflake_user_arn` is the STORAGE_AWS_IAM_USER_ARN value you recorded.
   > * `snowflake_external_id` is the STORAGE_AWS_EXTERNAL_ID value you recorded.
   >
   >   In this example, the `snowflake_external_id` value is `MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=`.
   >
   >   > **Note:**
   >   >
   >   > For security reasons, if you create a new storage integration (or recreate an existing storage integration using the CREATE OR
   >   > REPLACE STORAGE INTEGRATION syntax) without specifying an external ID, the new integration has a *different* external ID and
   >   > can’t resolve the trust relationship unless you update the trust policy.
8. Select Update policy to save your changes.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Determining the correct option

Before proceeding, determine whether an S3 event notification exists for the target path (or “prefix,” in AWS terminology) in your S3 bucket where your data files are located. AWS rules prohibit creating conflicting notifications for the same path.

The following options for automating Snowpipe using Amazon SQS are supported:

* **Option 1. New S3 event notification:** Create an event notification for the target path in your S3 bucket. The event notification informs Snowpipe via an SQS queue when files are ready to load.

  > **Important:**
  >
  > If a conflicting event notification exists for your S3 bucket, use Option 2 instead.
* **Option 2. Existing event notification:** Configure [Amazon Simple Notification Service (SNS)](https://aws.amazon.com/sns/) as a broadcaster to share notifications for a given path with multiple endpoints (or “subscribers,” e.g. SQS queues or AWS Lambda workloads), including the Snowflake SQS queue for Snowpipe automation. An S3 event notification published by SNS informs Snowpipe via an SQS queue when files are ready to load.

  > **Note:**
  >
  > We recommend this option if you plan to use [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md). You can also migrate from option 1 to
  > option 2 after you create a replication or failover group. For more information, see [Migrate to Amazon Simple Notification Service (SNS)](account-replication-stages-pipes-load-history.md).
* **Option 3. Setting up Amazon EventBridge for automating Snowpipe:** Similar to option 2, you can also enable [Amazon EventBridge](https://aws.amazon.com/eventbridge/) for S3 buckets and create rules to send notifications to SNS topics.

## Option 1: Creating a new S3 event notification to automate Snowpipe

This section describes the most common option for triggering Snowpipe data loads automatically using [Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. The steps explain how to create an event notification for the target path (or “prefix,” in AWS terminology) in your S3 bucket where your data files are stored.

> > **Important:**
> >
> > If a conflicting event notification exists for your S3 bucket, use Option 2: Configuring Amazon SNS to Automate Snowpipe Using SQS Notifications (in this topic) instead. AWS rules prohibit creating conflicting notifications for the same target path.

The following diagram shows the Snowpipe auto-ingest process flow:

1. Data files are loaded in a stage.
2. An S3 event notification informs Snowpipe via an SQS queue that files are ready to load. Snowpipe copies the files into a queue.
3. A Snowflake-provided virtual warehouse loads data from the queued files into the target table based on parameters defined in the specified pipe.

> **Note:**
>
> The instructions in this topic assume a target table already exists in the Snowflake database where your data will be loaded.

### Step 1: Create a stage (if needed)

Create an external stage that references your S3 bucket using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowpipe fetches your data files from the stage and temporarily queues them before loading them into your target table. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configuring Secure Access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the path `files`. The stage references a storage integration named `my_storage_int`:

> ```sqlexample
> USE SCHEMA snowpipe_db.public;
>
> CREATE STAGE mystage
>   URL = 's3://mybucket/load/files'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

### Step 2: Create a pipe with auto-ingest enabled

Create a pipe using the [CREATE PIPE](../sql-reference/sql/create-pipe.md) command. The pipe defines the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement used by Snowpipe to load data from the ingestion queue into the target table.

The following example creates a pipe named `mypipe` in the active schema for the user session. The pipe loads the data from files staged in the `mystage` stage into the `mytable` table:

> ```sqlexample
> CREATE PIPE snowpipe_db.public.mypipe
>   AUTO_INGEST = TRUE
>   AS
>     COPY INTO snowpipe_db.public.mytable
>       FROM @snowpipe_db.public.mystage
>       FILE_FORMAT = (type = 'JSON');
> ```

The `AUTO_INGEST = TRUE` parameter specifies to read event notifications sent from an S3 bucket to an SQS queue when new data is ready to load.

> **Important:**
>
> Compare the stage reference in the pipe definition with existing pipes. Verify that the directory paths for the same S3 bucket do not overlap; otherwise, multiple pipes could load the same set of data files multiple times, into one or more target tables. This can happen, for example, when multiple stages reference the same S3 bucket with different levels of granularity, such as `s3://mybucket/path1` and `s3://mybucket/path1/path2`. In this use case, if files are staged in `s3://mybucket/path1/path2`, the pipes for both stages would load a copy of the files.
>
> This is different from the manual Snowpipe setup (with auto-ingest *disabled*), which requires users to submit a named set of files to a REST API to queue the files for loading. With auto-ingest enabled, each pipe receives a generated file list from the S3 event notifications. Additional care is required to avoid data duplication.

### Step 3: Configure security

For each user who will execute continuous data loads using Snowpipe, grant sufficient access control privileges on the objects for the data load (i.e. the target database, schema, and table; the stage object, and the pipe).

> **Note:**
>
> To follow the general principle of “least privilege”, we recommend creating a separate user and role to use for ingesting files using a pipe. The user should be created with this role as its default role.

Using Snowpipe requires a role with the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Named pipe | OWNERSHIP |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE | Optional; only needed if the stage you created in Step 1: Create a Stage (If Needed) references a named file format. |
| Target database | USAGE |  |
| Target schema | USAGE |  |
| Target table | INSERT , SELECT |  |

Use the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command to grant privileges to the role.

> **Note:**
>
> Only security administrators (i.e. users with the SECURITYADMIN role) or higher, or another role with both the CREATE ROLE privilege on the account and the global MANAGE GRANTS privilege, can create roles and grant privileges.

For example, create a role named `snowpipe_role` that can access a set of `snowpipe_db.public` database objects as well as a pipe named `mypipe`; then, grant the role to a user:

> ```sqlexample
> -- Create a role to contain the Snowpipe privileges
> USE ROLE SECURITYADMIN;
>
> CREATE OR REPLACE ROLE snowpipe_role;
>
> -- Grant the required privileges on the database objects
> GRANT USAGE ON DATABASE snowpipe_db TO ROLE snowpipe_role;
>
> GRANT USAGE ON SCHEMA snowpipe_db.public TO ROLE snowpipe_role;
>
> GRANT INSERT, SELECT ON snowpipe_db.public.mytable TO ROLE snowpipe_role;
>
> GRANT USAGE ON STAGE snowpipe_db.public.mystage TO ROLE snowpipe_role;
>
> -- Pause the pipe for OWNERSHIP transfer
> ALTER PIPE mypipe SET PIPE_EXECUTION_PAUSED = TRUE;
>
> -- Grant the OWNERSHIP privilege on the pipe object
> GRANT OWNERSHIP ON PIPE snowpipe_db.public.mypipe TO ROLE snowpipe_role;
>
> -- Grant the role to a user
> GRANT ROLE snowpipe_role TO USER jsmith;
>
> -- Set the role as the default role for the user
> ALTER USER jsmith SET DEFAULT_ROLE = snowpipe_role;
>
> -- Resume the pipe
> ALTER PIPE mypipe SET PIPE_EXECUTION_PAUSED = FALSE;
> ```

### Step 4: Configure event notifications

Configure event notifications for your S3 bucket to notify Snowpipe when new data is available to load. The auto-ingest feature relies on SQS queues to deliver event notifications from S3 to Snowpipe.

For ease of use, Snowpipe SQS queues are created and managed by Snowflake. The SHOW PIPES command output displays the Amazon Resource Name (ARN) of your SQS queue.

1. Execute the SHOW PIPES command:

   > ```sqlexample
   > SHOW PIPES;
   > ```

   Note the ARN of the SQS queue for the stage in the `notification_channel` column. Copy the ARN to a convenient location.

   > **Note:**
   >
   > Following AWS guidelines, Snowflake designates no more than one SQS queue per AWS S3 region. An SQS queue can be shared among multiple buckets in the same region from the same AWS account. The SQS queue coordinates notifications for all pipes connecting the external stages for the S3 buckets to the target tables. When a data file is uploaded into the bucket, all pipes that match the stage directory path perform a one-time load of the file into their corresponding target tables.
2. Log into the Amazon S3 console.
3. Configure an event notification for your S3 bucket using the instructions provided in the [Amazon S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/enable-event-notifications.html). Complete the fields as follows:

   > * Name: Name of the event notification (e.g. `Auto-ingest Snowflake`).
   > * Events: Select the ObjectCreate (All) option.
   > * Send to: Select SQS Queue from the dropdown list.
   > * SQS: Select Add SQS queue ARN from the dropdown list.
   > * SQS queue ARN: Paste the SQS queue name from the SHOW PIPES output.

> **Note:**
>
> These instructions create a single event notification that monitors activity for the entire S3 bucket. This is the simplest approach. This notification handles all pipes configured at a more granular level in the S3 bucket directory. Snowpipe only loads data files as specified in pipe definitions. Note, however, that a high volume of notifications for activity outside a pipe definition could negatively impact the rate at which Snowpipe filters notifications and takes action.
>
> Alternatively, in the above steps, configure one or more paths and/or file extensions (or *prefixes* and *suffixes*, in AWS terminology) to filter event activity. For instructions, see the object key name filtering information in the relevant [AWS documentation topic](https://docs.aws.amazon.com/AmazonS3/latest/userguide/notification-how-to-filtering.html). Repeat these steps for each additional path or file extension you want the notification to monitor.
>
> Note that AWS limits the number of these notification *queue configurations* to a maximum of 100 per S3 bucket.
>
> Also note that AWS does not allow overlapping queue configurations (across event notifications) for the same S3 bucket. For example, if an existing notification is configured for `s3://mybucket/load/path1`, then you cannot create another notification at a higher level, such as `s3://mybucket/load`, or vice-versa.

Snowpipe with auto-ingest is now configured!

When new data files are added to the S3 bucket, the event notification informs Snowpipe to load them into the target table defined in the pipe.

### Step 5: Load historical files

To load any backlog of data files that existed in the external stage before SQS notifications were configured, see [Loading historic data](data-load-snowpipe-manage.md).

### Step 6: Delete staged files

Delete the staged files after you successfully load the data and no longer require the files. For instructions, see
[Deleting staged files after Snowpipe loads the data](data-load-snowpipe-manage.md).

## Option 2: Configuring Amazon SNS to automate Snowpipe using SQS notifications

This section describes how to trigger Snowpipe data loads automatically using [Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. The steps explain how to configure [Amazon Simple Notification Service (SNS)](https://aws.amazon.com/sns/) as a broadcaster to publish event notifications for your S3 bucket to multiple subscribers (e.g. SQS queues or AWS Lambda workloads), including the Snowflake SQS queue for Snowpipe automation.

> > **Note:**
> >
> > These instructions assume an event notification exists for the target path in your S3 bucket where your data files are located. If no event notification exists, either:
> >
> > * Follow Option 1: Creating a New S3 Event Notification to Automate Snowpipe (in this topic) instead.
> > * Create an event notification for your S3 bucket, then proceed with the instructions in this topic. For information, see the [Amazon S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/enable-event-notifications.html).

The following diagram shows the process flow for Snowpipe auto-ingest with Amazon SNS:

1. Data files are loaded in a stage.
2. An S3 event notification published by SNS informs Snowpipe via an SQS queue that files are ready to load. Snowpipe copies the files into a queue.
3. A Snowflake-provided virtual warehouse loads data from the queued files into the target table based on parameters defined in the specified pipe.

> **Note:**
>
> The instructions assume a target table already exists in the Snowflake database where your data will be loaded.
>
> Snowpipe auto ingest supports AWS KMS-encrypted SNS topics. For more information, refer to [Encryption at rest](https://docs.aws.amazon.com/sns/latest/dg/sns-server-side-encryption.html).

### Prerequisite: Create an Amazon SNS Topic and Subscription

1. Create an SNS topic in your AWS account to handle all messages for the Snowflake stage location on your S3 bucket.
2. Subscribe your target destinations for the S3 event notifications (for example, other SQS queues or AWS Lambda workloads) to this topic. SNS publishes event notifications for your bucket to all subscribers to the topic.

For instructions, see the [SNS documentation](https://aws.amazon.com/documentation/sns/).

### Step 1: Subscribe the Snowflake SQS Queue to the SNS Topic

1. Sign in to the AWS Management Console.
2. From the home dashboard, choose Simple Notification Service (SNS).
3. Choose Topics from the left-hand navigation pane.
4. Locate the topic for your S3 bucket. Note the topic ARN.
5. Using a Snowflake client, query the [SYSTEM$GET_AWS_SNS_IAM_POLICY](../sql-reference/functions/system_get_aws_sns_iam_policy.md) system function with your SNS topic ARN:

   > ```sqlexample
   > select system$get_aws_sns_iam_policy('<sns_topic_arn>');
   > ```

   The function returns an IAM policy that grants a Snowflake SQS queue permission to subscribe to the SNS topic.

   For example:

   > ```sqlexample
   > select system$get_aws_sns_iam_policy('arn:aws:sns:us-west-2:001234567890:s3_mybucket');
   >
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > | SYSTEM$GET_AWS_SNS_IAM_POLICY('ARN:AWS:SNS:US-WEST-2:001234567890:S3_MYBUCKET')                                                                                                                                                                   |
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > | {"Version":"2012-10-17","Statement":[{"Sid":"1","Effect":"Allow","Principal":{"AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"},"Action":["sns:Subscribe"],"Resource":["arn:aws:sns:us-west-2:001234567890:s3_mybucket"]}]}                 |
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > ```
6. Return to the AWS Management Console. Choose Topics from the left-hand navigation pane.
7. Select the topic for your S3 bucket, and click the Edit button. The Edit page opens.
8. Click Access policy - Optional to expand this area of the page.
9. Merge the IAM policy addition from the SYSTEM$GET_AWS_SNS_IAM_POLICY function results into the JSON document.

   For example:

   **Original IAM policy (abbreviated):**

   > ```sqljson
   > {
   >   "Version":"2008-10-17",
   >   "Id":"__default_policy_ID",
   >   "Statement":[
   >      {
   >         "Sid":"__default_statement_ID",
   >         "Effect":"Allow",
   >         "Principal":{
   >            "AWS":"*"
   >         }
   >         ..
   >      }
   >    ]
   >  }
   > ```

   **Merged IAM policy:**

   > ```sqljson
   > {
   >   "Version":"2008-10-17",
   >   "Id":"__default_policy_ID",
   >   "Statement":[
   >      {
   >         "Sid":"__default_statement_ID",
   >         "Effect":"Allow",
   >         "Principal":{
   >            "AWS":"*"
   >         }
   >         ..
   >      },
   >      {
   >         "Sid":"1",
   >         "Effect":"Allow",
   >         "Principal":{
   >           "AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
   >          },
   >          "Action":[
   >            "sns:Subscribe"
   >          ],
   >          "Resource":[
   >            "arn:aws:sns:us-west-2:001234567890:s3_mybucket"
   >          ]
   >      }
   >    ]
   >  }
   > ```
10. Add an additional policy grant to allow S3 to publish event notifications for the bucket to the SNS topic.

    For example (using the SNS topic ARN and S3 bucket used throughout these instructions):

    > ```sqljson
    > {
    >     "Sid":"s3-event-notifier",
    >     "Effect":"Allow",
    >     "Principal":{
    >        "Service":"s3.amazonaws.com"
    >     },
    >     "Action":"SNS:Publish",
    >     "Resource":"arn:aws:sns:us-west-2:001234567890:s3_mybucket",
    >     "Condition":{
    >        "ArnLike":{
    >           "aws:SourceArn":"arn:aws:s3:*:*:s3_mybucket"
    >        }
    >     }
    >  }
    > ```

    **Merged IAM policy:**

    > ```sqljson
    > {
    >   "Version":"2008-10-17",
    >   "Id":"__default_policy_ID",
    >   "Statement":[
    >      {
    >         "Sid":"__default_statement_ID",
    >         "Effect":"Allow",
    >         "Principal":{
    >            "AWS":"*"
    >         }
    >         ..
    >      },
    >      {
    >         "Sid":"1",
    >         "Effect":"Allow",
    >         "Principal":{
    >           "AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
    >          },
    >          "Action":[
    >            "sns:Subscribe"
    >          ],
    >          "Resource":[
    >            "arn:aws:sns:us-west-2:001234567890:s3_mybucket"
    >          ]
    >      },
    >      {
    >         "Sid":"s3-event-notifier",
    >         "Effect":"Allow",
    >         "Principal":{
    >            "Service":"s3.amazonaws.com"
    >         },
    >         "Action":"SNS:Publish",
    >         "Resource":"arn:aws:sns:us-west-2:001234567890:s3_mybucket",
    >         "Condition":{
    >            "ArnLike":{
    >               "aws:SourceArn":"arn:aws:s3:*:*:s3_mybucket"
    >            }
    >         }
    >       }
    >    ]
    >  }
    > ```
11. Click Save changes.

### Step 2: Create a stage (if needed)

Create an external stage that references your S3 bucket using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowpipe fetches your data files from the stage and temporarily queues them before loading them into your target table.

Alternatively, you can use an existing external stage.

> **Note:**
>
> To configure secure access to the cloud storage location, see Configuring Secure Access to Cloud Storage (in this topic).

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the path `files`. The stage references a storage integration named `my_storage_int`:

> ```sqlexample
> CREATE STAGE mystage
>   URL = 's3://mybucket/load/files'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

### Step 3: Create a pipe with auto-ingest enabled

Create a pipe using the [CREATE PIPE](../sql-reference/sql/create-pipe.md) command. The pipe defines the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement used by Snowpipe to load data from the ingestion queue into the target table. In the COPY statement, identify the SNS topic ARN from Prerequisite: Create an Amazon SNS Topic and Subscription.

The following example creates a pipe named `mypipe` in the active schema for the user session. The pipe loads the data from files staged in the `mystage` stage into the `mytable` table:

> ```sqlexample
> CREATE PIPE snowpipe_db.public.mypipe
>   AUTO_INGEST = TRUE
>   AWS_SNS_TOPIC='<sns_topic_arn>'
>   AS
>     COPY INTO snowpipe_db.public.mytable
>       FROM @snowpipe_db.public.mystage
>       FILE_FORMAT = (type = 'JSON');
> ```

Where:

`AUTO_INGEST = TRUE`
:   Specifies to read event notifications sent from an S3 bucket to an SQS queue when new data is ready to load.

`AWS_SNS_TOPIC = '<sns_topic_arn>'`
:   Specifies the ARN for the SNS topic for your S3 bucket, e.g. `arn:aws:sns:us-west-2:001234567890:s3_mybucket` in the current example. The CREATE PIPE statement subscribes the Snowflake SQS queue to the specified SNS topic. Note that the pipe will only copy files to the ingest queue triggered by event notifications via the SNS topic.

To remove either parameter from a pipe, it is currently necessary to recreate the pipe using the CREATE OR REPLACE PIPE syntax.

> **Important:**
>
> Verify that the storage location reference in the COPY INTO *<table>* statement does not overlap with the reference in existing pipes
> in the account. Otherwise, multiple pipes could load the same set of data files into the target tables. For example, this situation can
> occur when multiple pipe definitions reference the same storage location with different levels of granularity, such as
> `<storage_location>/path1/` and `<storage_location>/path1/path2/`. In this example, if files are staged in
> `<storage_location>/path1/path2/`, both pipes would load a copy of the files.
>
> View the COPY INTO *<table>* statements in the definitions of all pipes in the account by executing [SHOW PIPES](../sql-reference/sql/show-pipes.md)
> or by querying either the [PIPES](../sql-reference/account-usage/pipes.md) view in Account Usage or the
> [PIPES](../sql-reference/info-schema/pipes.md) view in the Information Schema.

### Step 4: Configure security

For each user who will execute continuous data loads using Snowpipe, grant sufficient access control privileges on the objects for the data load (i.e. the target database, schema, and table; the stage object, and the pipe).

> **Note:**
>
> To follow the general principle of “least privilege”, we recommend creating a separate user and role to use for ingesting files using a pipe. The user should be created with this role as its default role.

Using Snowpipe requires a role with the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Named pipe | OWNERSHIP |  |
| Named storage integration | USAGE | Needed if the stage you created in Step 2: Create a Stage (If Needed) references a storage integration. |
| Named stage | USAGE , READ |  |
| Named file format | USAGE | Optional; only needed if the stage you created in Step 2: Create a Stage (If Needed) references a named file format. |
| Target database | USAGE |  |
| Target schema | USAGE |  |
| Target table | INSERT , SELECT |  |

Use the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command to grant privileges to the role.

> **Note:**
>
> Only security administrators (i.e. users with the SECURITYADMIN role) or higher can create roles.

For example, create a role named `snowpipe_role` that can access a set of `snowpipe_db.public` database objects as well as a pipe named `mypipe`; then, grant the role to a user:

> ```sqlexample
> -- Create a role to contain the Snowpipe privileges
> USE ROLE SECURITYADMIN;
>
> CREATE OR REPLACE ROLE snowpipe_role;
>
> -- Grant the required privileges on the database objects
> GRANT USAGE ON DATABASE snowpipe_db TO ROLE snowpipe_role;
>
> GRANT USAGE ON SCHEMA snowpipe_db.public TO ROLE snowpipe_role;
>
> GRANT INSERT, SELECT ON snowpipe_db.public.mytable TO ROLE snowpipe_role;
>
> GRANT USAGE, READ ON STAGE snowpipe_db.public.mystage TO ROLE snowpipe_role;
>
> -- Pause the pipe for OWNERSHIP transfer
> ALTER PIPE mypipe SET PIPE_EXECUTION_PAUSED = TRUE;
>
> -- Grant the OWNERSHIP privilege on the pipe object
> GRANT OWNERSHIP ON PIPE snowpipe_db.public.mypipe TO ROLE snowpipe_role;
>
> -- Grant the role to a user
> GRANT ROLE snowpipe_role TO USER jsmith;
>
> -- Set the role as the default role for the user
> ALTER USER jsmith SET DEFAULT_ROLE = snowpipe_role;
>
> -- Resume the pipe
> ALTER PIPE mypipe SET PIPE_EXECUTION_PAUSED = FALSE;
> ```

Snowpipe with auto-ingest is now configured!

When new data files are added to the S3 bucket, the event notification informs Snowpipe to load them into the target table defined in the pipe.

### Step 5: Load historical files

To load any backlog of data files that existed in the external stage before SQS notifications were configured, see [Loading historic data](data-load-snowpipe-manage.md).

### Step 6: Delete staged files

Delete the staged files after you successfully load the data and no longer require the files. For instructions, see
[Deleting staged files after Snowpipe loads the data](data-load-snowpipe-manage.md).

## Option 3: Setting up Amazon EventBridge to automate Snowpipe

Similar to Option 2, you can also set up Amazon EventBridge to automate Snowpipe.

### Step 1: Create an Amazon SNS topic

Follow Prerequisite: Create an Amazon SNS Topic and Subscription (in this topic).

### Step 2: Create an EventBridge rule to subscribe S3 buckets and send notifications to SNS topic

* [Enable Amazon EventBridge](https://docs.aws.amazon.com/AmazonS3/latest/userguide/enable-event-notifications-eventbridge.html) for S3 buckets.
* Create EventBridge rules to [send notifications](https://docs.aws.amazon.com/eventbridge/latest/userguide/eb-s3-object-created-tutorial.html) to the SNS topic created in step 1.

### Step 3: Configuring Amazon SNS to automate Snowpipe using SQS notifications

Follow Option 2: Configuring Amazon SNS to automate Snowpipe using SQS notifications (in this topic).

## SYSTEM$PIPE_STATUS output

The [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function retrieves a JSON representation of the current status of a pipe.

For pipes with AUTO_INGEST set to TRUE, the function returns a JSON object containing the following name/value pairs (if applicable to the current pipe status):

> {“executionState”:”<value>”,”oldestFileTimestamp”:<value>,”pendingFileCount”:<value>,”notificationChannelName”:”<value>”,”numOutstandingMessagesOnChannel”:<value>,”lastReceivedMessageTimestamp”:”<value>”,”lastForwardedMessageTimestamp”:”<value>”,”error”:<value>,”fault”:<value>}

For descriptions of the output values, see the reference topic for the SQL function.

---
title: Automating Snowpipe for Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-gcs.md
section: User Guide
---

# Automating Snowpipe for Google Cloud Storage

This topic provides instructions for triggering Snowpipe data loads from external stages on Google Cloud Storage automatically using [Google Cloud Pub/Sub](https://cloud.google.com/storage/docs/reporting-changes) messages for Google Cloud Storage (GCS) events.

Note that only `OBJECT_FINALIZE` events trigger Snowpipe to load files. Snowflake recommends that you only send supported events for Snowpipe to reduce costs, event noise, and latency.

## Cloud platform support

Triggering automated Snowpipe data loads using GCS Pub/Sub event messages is supported by Snowflake accounts hosted on [all supported cloud platforms](intro-cloud-platforms.md).

## Configuring secure access to Cloud Storage

> **Note:**
>
> If you have already configured secure access to the GCS bucket that stores your data files, you can skip this section.

This section describes how to configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage to a Snowflake identity and access management (IAM) entity.

This section describes how to use storage integrations to allow Snowflake to read data from and write to a Google Cloud Storage bucket referenced in an external
(that is, Cloud Storage) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as
secret keys or access tokens; instead, integration objects reference a Cloud Storage service account. An administrator in your organization grants the service
account permissions in the Cloud Storage account.

Administrators can also restrict users to a specific set of Cloud Storage buckets (and optional paths) accessed by external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires access to your Cloud Storage project as a project editor. If you are not a project
>   editor, ask your Cloud Storage administrator to perform these tasks.
> * Confirm that Snowflake supports the Google Cloud Storage region that your storage is hosted in. For more information, see
>   [Supported cloud regions](intro-regions.md).

The following diagram shows the integration flow for a Cloud Storage stage:

1. An external (that is, Cloud Storage) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a Cloud Storage service account created for your account. Snowflake creates a single service account that is referenced by all GCS storage integrations in your Snowflake account.
3. A project editor for your Cloud Storage project grants permissions to the service account to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the service account on the bucket before allowing or denying access.

**In this Section:**

### Step 1: Create a Cloud Storage integration in Snowflake

Create an integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. An integration is a Snowflake object that delegates authentication responsibility for external cloud storage to a Snowflake-generated entity (that is, a Cloud Storage service account). For accessing Cloud Storage buckets, Snowflake creates a service account that can be granted permissions to access the bucket(s) that store your data files.

A single storage integration can support multiple external (that is, GCS) stages. The URL in the stage definition must align with the GCS buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'GCS'
  ENABLED = TRUE
  STORAGE_ALLOWED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `bucket` is the name of a Cloud Storage bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two buckets and paths. In a later step, we will create an external stage that references one of these buckets and paths.

Additional external stages that also use this integration can reference the allowed buckets and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION gcs_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'GCS'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('gcs://mybucket1/path1/', 'gcs://mybucket2/path2/')
>   STORAGE_BLOCKED_LOCATIONS = ('gcs://mybucket1/path1/sensitivedata/', 'gcs://mybucket2/path2/sensitivedata/');
> ```

### Step 2: Retrieve the Cloud Storage service account for your Snowflake account

Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the ID for the Cloud Storage service account that was created automatically for your Snowflake account:

```sqlsyntax
DESC STORAGE INTEGRATION <integration_name>;
```

Where:

> * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage integration in Snowflake (in this topic).

For example:

> ```sqlexample
> DESC STORAGE INTEGRATION gcs_int;
>
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> | property                    | property_type | property_value                                                              | property_default |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------|
> | ENABLED                     | Boolean       | true                                                                        | false            |
> | STORAGE_ALLOWED_LOCATIONS   | List          | gcs://mybucket1/path1/,gcs://mybucket2/path2/                               | []               |
> | STORAGE_BLOCKED_LOCATIONS   | List          | gcs://mybucket1/path1/sensitivedata/,gcs://mybucket2/path2/sensitivedata/   | []               |
> | STORAGE_GCP_SERVICE_ACCOUNT | String        | service-account-id@project1-123456.iam.gserviceaccount.com                  |                  |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> ```

The STORAGE_GCP_SERVICE_ACCOUNT property in the output shows the Cloud Storage service account created for your Snowflake account (that is, `service-account-id@project1-123456.iam.gserviceaccount.com`). We provision a single Cloud Storage service account for your entire Snowflake account. All Cloud Storage integrations use that service account.

### Step 3: Grant the service account permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your Google Cloud console so that you can use a Cloud Storage bucket to load and unload data:

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select IAM & Admin » Roles.
3. Select Create Role.
4. Enter a Title and optional Description for the custom role.
5. Select Add Permissions.
6. Filter the list of permissions, and add the following from the list:

   > | Action(s) | Required permissions |
   > | --- | --- |
   > | Data loading only | * `storage.buckets.get` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading with purge option, executing the REMOVE command on the stage | * `storage.buckets.get` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading and unloading | * `storage.buckets.get` (for calculating data transfer costs) * `storage.objects.create` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data unloading only | * `storage.buckets.get` * `storage.objects.create` * `storage.objects.delete` * `storage.objects.list` |
   > | Using [COPY FILES](../sql-reference/sql/copy-files.md) to copy files to an external stage | You must have the following additional permissions:  * `storage.multipartUploads.abort` * `storage.multipartUploads.create` * `storage.multipartUploads.list` * `storage.multipartUploads.listParts` |
7. Select Add.
8. Select Create.

#### Assign the custom role to the Cloud Storage Service Account

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select Cloud Storage » Buckets.
3. Filter the list of buckets, and select the bucket that you specified when you created your storage integration.
4. Select Permissions » View by principals, then select Grant access.
5. Under Add principals, paste the name of the service account name that you retrieved from the DESC STORAGE INTEGRATION command output.
6. Under Assign roles, select the custom IAM role that you created previously, then select Save.

> **Important:**
>
> If your Google Cloud organization was created on or after May 3, 2024, Google Cloud enforces a
> [domain restriction constraint](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains)
> in project organization policies. The default constraint lists your domain as the only allowed value.
>
> To allow the Snowflake service account access to your storage, you must
> [update the domain restriction](data-load-gcs-allow.md).

#### Grant the Cloud Storage service account permissions on the Cloud Key Management Service cryptographic keys

> **Note:**
>
> This step is required only if your GCS bucket is encrypted using a key stored in the Google Cloud Key Management Service (Cloud KMS).

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, search for and select Security » Key Management.
3. Select the key ring that is assigned to your GCS bucket.
4. Click SHOW INFO PANEL in the upper-right corner. The information panel for the key ring slides out.
5. Click the ADD PRINCIPAL button.
6. In the New principals field, search for the service account name from the DESCRIBE INTEGRATION output in Step 2: Retrieve the Cloud Storage service account for your Snowflake account (in this topic).
7. From the Select a role dropdown, select the `Cloud KMS CrytoKey Encryptor/Decryptor` role.
8. Click the Save button. The service account name is added to the Cloud KMS CrytoKey Encryptor/Decryptor role dropdown in the information panel.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Configuring Automation Using GCS Pub/Sub

### Prerequisites

The instructions in this topic assume the following items have been created and configured:

GCP account:
:   * Pub/Sub topic that receives event messages from the GCS bucket. For more information, see Creating the Pub/Sub Topic (in this topic).
    * Subscription that receives event messages from the Pub/Sub topic. For more information, see Creating the Pub/Sub Subscription (in this topic).

    For instructions, see the [Pub/Sub documentation](https://cloud.google.com/pubsub/docs).

Snowflake:
:   * Target table in the Snowflake database where you want to load data.

#### Creating the Pub/Sub Topic

Create a Pub/Sub topic using [Cloud Shell](https://cloud.google.com/shell) or [Cloud SDK](https://cloud.google.com/sdk).

Execute the following command to create the topic and enable it to listen for activity in the specified GCS bucket:

```bash
$ gsutil notification create -t <topic> -f json -e OBJECT_FINALIZE gs://<bucket-name>
```

Where:

* `<topic>` is the name for the topic.
* `<bucket-name>` is the name of your GCS bucket.

If the topic already exists, the command uses it; otherwise, the command creates a new topic.

For more information, see [Using Pub/Sub notifications for Cloud Storage](https://cloud.google.com/storage/docs/reporting-changes) in the Pub/Sub documentation.

#### Creating the Pub/Sub Subscription

Create a subscription with pull delivery to the Pub/Sub topic using the Cloud Console, `gcloud` command-line tool, or the Cloud Pub/Sub API. For instructions, see [Managing topics and subscriptions](https://cloud.google.com/pubsub/docs/admin) in the Pub/Sub documentation.

> **Note:**
>
> * Only Pub/Sub subscriptions that use the default pull delivery are supported with Snowflake. Push delivery is not supported.

#### Retrieving the Pub/Sub Subscription ID

The Pub/Sub topic subscription ID is used in these instructions to allow Snowflake access to event messages.

1. Log into the Google Cloud Platform Console as a project editor.
2. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
3. Copy the ID in the Subscription ID column for the topic subscription.

### Step 1: Create a Notification Integration in Snowflake

Create a notification integration using the
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-queue-inbound-gcp.md) command.

The notification integration references your Pub/Sub subscription. Snowflake associates the notification integration with a GCS
service account created for your account. Snowflake creates a single service account that is referenced by all GCS notification
integrations in your Snowflake account.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
> * The GCS service account for notification integrations is different from the service account created for storage integrations.
> * A single notification integration supports a single Google Cloud Pub/Sub subscription. Referencing the same Pub/Sub subscription in multiple notification integrations could result in missing data in target tables because event notifications are split between notification integrations. Therefore, pipe creation is blocked if a pipe references the same Pub/Sub subscription as an existing pipe.

```sqlsyntax
CREATE NOTIFICATION INTEGRATION <integration_name>
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  ENABLED = true
  GCP_PUBSUB_SUBSCRIPTION_NAME = '<subscription_id>';
```

Where:

* `integration_name` is the name of the new integration.
* `subscription_id` is the subscription name you recorded in Retrieving the Pub/Sub Subscription ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  ENABLED = true
  GCP_PUBSUB_SUBSCRIPTION_NAME = 'projects/project-1234/subscriptions/sub2';
```

### Step 2: Grant Snowflake Access to the Pub/Sub Subscription

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the Snowflake service account ID:

   ```sqlsyntax
   DESC NOTIFICATION INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Notification Integration in Snowflake.

   For example:

   > ```sqlexample
   > DESC NOTIFICATION INTEGRATION my_notification_int;
   > ```
2. Record the service account name in the GCP_PUBSUB_SERVICE_ACCOUNT column, which has the following format:

   ```bash
   <service_account>@<project_id>.iam.gserviceaccount.com
   ```
3. Log into the Google Cloud Platform Console as a project editor.
4. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
5. Select the subscription to configure for access.
6. Click SHOW INFO PANEL in the upper-right corner. The information panel for the subscription slides out.
7. Click the ADD PRINCIPAL button.
8. In the New principals field, search for the service account name you recorded.
9. From the Select a role dropdown, select Pub/Sub Subscriber.
10. Click the Save button. The service account name is added to the Pub/Sub Subscriber role dropdown in the information panel.
11. Navigate to the Dashboard page in the Cloud Console, and select your project from the dropdown list.
12. Click the ADD PEOPLE TO THIS PROJECT button.
13. Add the service account name you recorded.
14. From the Select a role dropdown, select Monitoring Viewer.
15. Click the Save button. The service account name is added to the Monitoring Viewer role.

### Step 3: Create a stage (if needed)

Create an external stage that references your GCS bucket using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads your
staged data files into the external table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configuring Secure Access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration
>   object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the
path `files`. The stage references a storage integration named `my_storage_int`.

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE mystage
>   URL='gcs://load/files/'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

### Step 4: Create a pipe with auto-ingest enabled

Create a pipe using the [CREATE PIPE](../sql-reference/sql/create-pipe.md) command. The pipe defines the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement used by Snowpipe to load data from the ingestion queue into the target table.

For example, create a pipe in the `snowpipe_db.public` schema that loads data from files staged in an external (GCS) stage named `mystage` into a destination table named `mytable`:

```sqlexample
CREATE PIPE snowpipe_db.public.mypipe
  AUTO_INGEST = true
  INTEGRATION = 'MY_NOTIFICATION_INT'
  AS
    COPY INTO snowpipe_db.public.mytable
      FROM @snowpipe_db.public.mystage/path2;
```

The INTEGRATION parameter references the `my_notification_int` notification integration you created in Step 1: Create a Notification Integration in Snowflake. The integration name must be provided in all uppercase.

> **Important:**
>
> Verify that the storage location reference in the COPY INTO *<table>* statement does not overlap with the reference in existing pipes
> in the account. Otherwise, multiple pipes could load the same set of data files into the target tables. For example, this situation can
> occur when multiple pipe definitions reference the same storage location with different levels of granularity, such as
> `<storage_location>/path1/` and `<storage_location>/path1/path2/`. In this example, if files are staged in
> `<storage_location>/path1/path2/`, both pipes would load a copy of the files.
>
> View the COPY INTO *<table>* statements in the definitions of all pipes in the account by executing [SHOW PIPES](../sql-reference/sql/show-pipes.md)
> or by querying either the [PIPES](../sql-reference/account-usage/pipes.md) view in Account Usage or the
> [PIPES](../sql-reference/info-schema/pipes.md) view in the Information Schema.

Snowpipe with auto-ingest is now configured!

When new data files are added to the GCS bucket, the event message informs Snowpipe to load them into the target table defined in the pipe.

### Step 5: Load historical files

To load any backlog of data files that existed in the external stage before Pub/Sub messages were configured, execute an [ALTER PIPE … REFRESH](../sql-reference/sql/alter-pipe.md) statement.

### Step 6: Delete staged files

Delete the staged files after you successfully load the data and no longer require the files. For instructions, see
[Deleting staged files after Snowpipe loads the data](data-load-snowpipe-manage.md).

## SYSTEM$PIPE_STATUS output

The [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function retrieves a JSON representation of the current status of a pipe.

For pipes with AUTO_INGEST set to TRUE, the function returns a JSON object containing the following name/value pairs (if applicable to the current pipe status):

```sqljson
{"executionState":"<value>","oldestFileTimestamp":<value>,"pendingFileCount":<value>,"notificationChannelName":"<value>","numOutstandingMessagesOnChannel":<value>,"lastReceivedMessageTimestamp":"<value>","lastForwardedMessageTimestamp":"<value>","error":<value>,"fault":<value>}
```

For descriptions of the output values, see the reference topic for the SQL function.

---
title: Automating Snowpipe for Microsoft Azure Blob Storage
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-azure.md
section: User Guide
---

# Automating Snowpipe for Microsoft Azure Blob Storage

This topic provides instructions for triggering Snowpipe data loads from external stages on Azure Blob Storage automatically using [Microsoft Azure Event Grid](https://azure.microsoft.com/en-us/services/event-grid/) messages for Blob storage events. The instructions explain how to create an event message for the target path in Blob storage where your data files are stored.

> **Note:**
>
> To harden your security posture, you can configure Snowpipe automation to use private connectivity rather than the public Internet for
> network traffic. For more information, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](data-load-azure-private.md).

Snowflake supports the following types of blob storage accounts:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v2

> **Note:**
>
> * Automated Snowpipe isn’t supported for Microsoft Fabric OneLake.
> * Only `Microsoft.Storage.BlobCreated` events trigger Snowpipe to load files. Adding new objects to blob storage triggers these events. Renaming a directory or object does not trigger these events.

Snowflake supports the following `Microsoft.Storage.BlobCreated`
APIs:

* `CopyBlob`
* `PutBlob`
* `PutBlockList`
* `FlushWithClose`
* `SftpCommit`

Snowflake recommends that you only send supported events for Snowpipe to reduce costs, event noise, and latency.

For Data Lake Storage Gen2 storage accounts, `Microsoft.Storage.BlobCreated` events are triggered when clients use the `CreateFile`
and `FlushWithClose` operations. If the SSH File Transfer Protocol (SFTP) is used, `Microsoft.Storage.BlobCreated` events are triggered with `SftpCreate` and `SftpCommit` operations. The `CreateFile` or `SftpCreate` API alone does not indicate a commit of a file in the storage account. If the
`FlushWithClose` or `SftpCommit` message is not sent to the Snowflake queue, Snowpipe does not ingest the file.

> **Note:**
>
> Snowflake only supports the [Azure Event Grid event schema](https://learn.microsoft.com/en-us/azure/event-grid/event-schema); it doesn’t support the [CloudEvents schema with Azure Event Grid](https://learn.microsoft.com/en-us/azure/event-grid/cloud-event-schema).

## Cloud platform support

Triggering automated Snowpipe data loads using Azure Event Grid messages is supported by Snowflake accounts hosted on [all supported cloud platforms](intro-cloud-platforms.md).

## Process flow

[Microsoft Azure Event Grid](https://azure.microsoft.com/en-us/services/event-grid/) notifications for an Azure container trigger Snowpipe data loads automatically.

The following diagram shows the Snowpipe auto-ingest process flow:

1. Data files are loaded in a stage.
2. A blob storage event message informs Snowpipe via Event Grid that files are ready to load. Snowpipe copies the files into a queue.
3. A Snowflake-provided virtual warehouse loads data from the queued files into the target table based on parameters defined in the specified pipe.

For instructions, see Automating Snowpipe for Microsoft Azure Blob Storage.

## Configuring secure access to Cloud Storage

> **Note:**
>
> If you have already configured secure access to the Azure blob storage container that stores your data files, you can skip this section.

This section describes how to configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage to a Snowflake identity and access management (IAM) entity.

> **Note:**
>
> We highly recommend this option, which avoids the need to supply IAM credentials when accessing cloud storage. See [Configure an Azure container for loading data](data-load-azure-config.md) for additional storage access options.

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Azure container referenced in an external (Azure) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an Azure identity and access management (IAM) user ID called the *app registration*. An administrator in your organization grants this app the necessary permissions in the Azure account.

An integration must also specify containers (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> Completing the instructions in this section requires permissions in Azure to manage storage accounts. If you are not an Azure administrator, ask your Azure administrator to perform these tasks.

**In this Section:**

### Step 1: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake object that stores a generated service principal for your Azure cloud storage, along with an optional set of allowed or blocked storage locations (that is, containers). Cloud provider administrators in your organization grant permissions on the storage locations to the generated service principal. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, Azure) stages. The URL in the stage definition must align with the Azure containers (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'AZURE'
  ENABLED = TRUE
  AZURE_TENANT_ID = '<tenant_id>'
  STORAGE_ALLOWED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `tenant_id` is the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to. A storage integration can authenticate to only one tenant, so the allowed and blocked storage locations must refer to storage accounts that all belong this tenant.

  To find your tenant ID, sign in to the Azure portal and click Azure Active Directory » Properties. The tenant ID is displayed in the Tenant ID field.
* `container` is the name of an Azure container that stores your data files (for example, `mycontainer`). The STORAGE_ALLOWED_LOCATIONS and STORAGE_BLOCKED_LOCATIONS parameters allow or block access to these containers, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over logical directories in the container.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two containers and paths. In a later step, we will create an external stage that references one of these containers and paths. Multiple external stages that use this integration can reference the allowed containers and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION azure_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'AZURE'
>   ENABLED = TRUE
>   AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
>   STORAGE_ALLOWED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/')
>   STORAGE_BLOCKED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/sensitivedata/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/sensitivedata/');
> ```

### Step 2: Grant Snowflake Access to the Storage Locations

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC STORAGE INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage Integration in Snowflake.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed storage locations.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to be granted an access token on specified resources inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Sign in to the Microsoft Azure portal.
5. Navigate to Azure Services » Storage Accounts. Click the name of the storage account you are granting the Snowflake service principal access to.
6. Click Access Control (IAM) » Add role assignment.
7. Select the desired role to grant to the Snowflake service principal:

   * `Storage Blob Data Reader` grants read access only. This allows loading data from files staged in the storage account.
   * `Storage Blob Data Contributor` grants read and write access. This allows loading data from or unloading data to files staged in
     the storage account. The role also allows executing the [REMOVE](../sql-reference/sql/remove.md) command to remove files staged in the
     storage account.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC STORAGE INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the storage integration stops working.
9. Click the Review + assign button.

   > **Note:**
   > * According to the Microsoft Azure documentation, role assignments may take up to five minutes to propagate.
   > * Snowflake caches the temporary credentials for a period that cannot exceed the 60 minute expiration time. If you revoke access from Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Configuring Automation With Azure Event Grid

### Step 1: Configuring the Event Grid Subscription

This section describes how to set up an Event Grid subscription for Azure Storage events using the Azure CLI. For more information about the steps described in this section, see the following articles in the Azure documentation:

* <https://docs.microsoft.com/en-us/azure/event-grid/custom-event-to-queue-storage>
* <https://docs.microsoft.com/en-us/azure/storage/blobs/storage-blob-event-quickstart>

#### Create a Resource Group

An Event Grid *topic* provides an endpoint where the source (i.e. Azure Storage) sends events. A topic is used for a collection of related events. Event Grid topics are Azure resources, and must be placed in an Azure resource group.

Execute the following command to create a resource group:

```bash
az group create --name <resource_group_name> --location <location>
```

Where:

* `resource_group_name` is the name of the new resource group.
* `location` is the location, or *region* in Snowflake terminology, of your Azure Storage account.

#### Enable the Event Grid Resource Provider

Execute the following command to register the Event Grid resource provider. Note that this step is only required if you have not previously used Event Grid with your Azure account:

```bash
az provider register --namespace Microsoft.EventGrid
az provider show --namespace Microsoft.EventGrid --query "registrationState"
```

#### Create a Storage Account for Data Files

Execute the following command to create a storage account to store your data files. This account must be either a Blob storage (i.e. `BlobStorage` kind) or GPv2 (i.e. `StorageV2` kind) account, because only these two account types support event messages.

> **Note:**
>
> If you already have a Blob storage or GPv2 account, you can use that account instead.

For example, create a Blob storage account:

```bash
az storage account create --resource-group <resource_group_name> --name <storage_account_name> --sku Standard_LRS --location <location> --kind BlobStorage --access-tier Hot
```

Where:

* `resource_group_name` is the name of the resource group you created in Create a Resource Group.
* `storage_account_name` is the name of the new storage account.
* `location` is the location of your Azure Storage account.

#### Create a Storage Account for the Storage Queue

Execute the following command to create a storage account to host your storage queue. This account must be a GPv2 account, because only this kind of account supports event messages to a storage queue.

> **Note:**
>
> If you already have a GPv2 account, you can use that account to host both your data files and your storage queue.

For example, create a GPv2 account:

```bash
az storage account create --resource-group <resource_group_name> --name <storage_account_name> --sku Standard_LRS --location <location> --kind StorageV2
```

Where:

* `resource_group_name` is the name of the resource group you created in Create a Resource Group.
* `storage_account_name` is the name of the new storage account.
* `location` is the location of your Azure Storage account.

#### Create a Storage Queue

A single Azure Queue Storage queue can collect the event messages for many Event Grid subscriptions. For best performance, Snowflake recommends creating a single storage queue to accommodate all of your subscriptions related to Snowflake.

Execute the following command to create a storage queue. A storage queue stores a set of messages, in this case event messages from Event Grid:

```bash
az storage queue create --name <storage_queue_name> --account-name <storage_account_name>
```

Where:

* `storage_queue_name` is the name of the new storage queue.
* `storage_account_name` is the name of the storage account you created in Create a Storage Account for the Storage Queue.

#### Export the Storage Account and Queue IDs for Reference

Execute the following commands to set environment variables for the storage account and queue IDs that will be requested later in these instructions:

* Linux or macOS:

  ```bash
  export storageid=$(az storage account show --name <data_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  export queuestorageid=$(az storage account show --name <queue_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  export queueid="$queuestorageid/queueservices/default/queues/<storage_queue_name>"
  ```
* Windows:

  ```bash
  set storageid=$(az storage account show --name <data_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  set queuestorageid=$(az storage account show --name <queue_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  set queueid="%queuestorageid%/queueservices/default/queues/<storage_queue_name>"
  ```

Where:

* `data_storage_account_name` is the name of the storage account you created in Create a Storage Account for Data Files.
* `queue_storage_account_name` is the name of the storage account you created in Create a Storage Account for the Storage Queue.
* `resource_group_name` is the name of the resource group you created in Create a Resource Group.
* `storage_queue_name` is the name of the storage queue you created in Create a Storage Queue.

#### Install the Event Grid Extension

Execute the following command to install the Event Grid extension for Azure CLI:

```bash
az extension add --name eventgrid
```

#### Create the Event Grid Subscription

Execute the following command to create the Event Grid subscription. Subscribing to a topic informs Event Grid which events to track:

* Linux or macOS:

  ```bash
  az eventgrid event-subscription create \
  --source-resource-id $storageid \
  --name <subscription_name> --endpoint-type storagequeue \
  --endpoint $queueid \
  --advanced-filter data.api stringin CopyBlob PutBlob PutBlockList FlushWithClose SftpCommit
  ```
* Windows:

  ```bash
  az eventgrid event-subscription create \
  --source-resource-id %storageid% \
  --name <subscription_name> --endpoint-type storagequeue \
  --endpoint %queueid% \
  -advanced-filter data.api stringin CopyBlob PutBlob PutBlockList FlushWithClose SftpCommit
  ```

Where:

* `storageid` and `queueid` are the storage account and queue ID environment variables you set in Export the Storage Account and Queue IDs for Reference.
* `subscription_name` is the name of the new Event Grid subscription.

### Step 2: Creating the Notification Integration

A notification integration is a Snowflake object that provides an interface between Snowflake and a third-party cloud message queuing service such as Azure Event Grid.

> **Note:**
>
> A single notification integration supports a single Azure Storage queue. Referencing the same storage queue in multiple notification integrations could result in missing data in target tables because event notifications are split between notification integrations. Therefore, pipe creation is blocked if a pipe references the same storage queue as an existing pipe.

#### Retrieve the Storage Queue URL and Tenant ID

1. Log into the Microsoft Azure portal.
2. Navigate to Storage account » Queue service » Queues. Record the URL for the queue you created in Create a Storage Queue for reference later. The URL has the following format:

   ```bash
   https://<storage_account_name>.queue.core.windows.net/<storage_queue_name>
   ```
3. Navigate to Azure Active Directory » Properties. Record the Tenant ID value for reference later. The directory ID, or *tenant ID*, is needed to generate the consent URL that grants Snowflake access to the Event Grid subscription.

#### Create the Notification Integration

Create a notification integration using the
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-queue-inbound-azure.md) command.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
> * The Azure service principal for notification integrations is different from the service principal created for storage integrations.

```sqlsyntax
CREATE NOTIFICATION INTEGRATION <integration_name>
  ENABLED = true
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = '<queue_URL>'
  AZURE_TENANT_ID = '<directory_ID>';
```

Where:

* `integration_name` is the name of the new integration.
* `queue_URL` and `directory_ID` are the queue URL and tenant ID you recorded in Retrieve the Storage Queue URL and Tenant ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  ENABLED = true
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = 'https://myqueue.queue.core.windows.net/mystoragequeue'
  AZURE_TENANT_ID = 'a123bcde-1234-5678-abc1-9abc12345678';
```

#### Grant Snowflake Access to the Storage Queue

Note that specific steps in this section require a local installation of the Azure CLI.

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC NOTIFICATION INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Create the Notification Integration.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed topic.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to obtain an access
   token on any resource inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate
   permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Log into the Microsoft Azure portal.
5. Navigate to Azure Active Directory » Enterprise applications. Verify the Snowflake application identifier you
   recorded in Step 2 in this section is listed.

   > **Important:**
   >
   > If you delete the Snowflake application in Azure Active Directory at a later time, the notification integration stops working.
6. Navigate to Queues » `storage_queue_name`, where `storage_queue_name` is the name of the storage queue you created in Create a Storage Queue.
7. Click Access Control (IAM) » Add role assignment.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC NOTIFICATION
   INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in
   >   this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the notification integration stops working.
9. Grant the Snowflake app the following permissions:

   * Role: Storage Queue Data Contributor
   * Assign access to: Azure AD user, group, or service principal
   * Select: The `appDisplayName` value.

   The Snowflake application identifier should now be listed under Storage Queue Data Contributor (on the same dialog).

### Step 3: Create a stage (if needed)

Create an external stage that references your Azure container using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowpipe fetches your data files from the stage and temporarily queues them before loading them into your target table.

Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configuring Secure Access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the path `load/files`. The stage references a storage integration named `my_storage_int`.

> ```sqlexample
> USE SCHEMA snowpipe_db.public;
>
> CREATE STAGE mystage
>   URL = 'azure://myaccount.blob.core.windows.net/mycontainer/load/files/'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

> **Note:**
>
> Use the `blob.core.windows.net` endpoint for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.

### Step 4: Create a pipe with auto-ingest enabled

Create a pipe using the [CREATE PIPE](../sql-reference/sql/create-pipe.md) command. The pipe defines the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement used by Snowpipe to load data from the ingestion queue into the target table.

For example, create a pipe in the `snowpipe_db.public` schema that loads the data from files staged in the `mystage` stage into the `mytable` table:

> ```sqlexample
> CREATE PIPE snowpipe_db.public.mypipe
>   AUTO_INGEST = true
>   INTEGRATION = 'MY_NOTIFICATION_INT'
>   AS
>     COPY INTO snowpipe_db.public.mytable
>       FROM @snowpipe_db.public.mystage
>       FILE_FORMAT = (type = 'JSON');
> ```

Where:

* `MY_NOTIFICATION_INT` is the name of the notification integration you created in Step 2: Creating the Notification Integration.

> **Important:**
>
> * The integration name must be typed in all uppercase.
> * Verify that the storage location reference in the COPY INTO *<table>* statement does not overlap with the reference in existing pipes
>   in the account. Otherwise, multiple pipes could load the same set of data files into the target tables. For example, this situation can
>   occur when multiple pipe definitions reference the same storage location with different levels of granularity, such as
>   `<storage_location>/path1/` and `<storage_location>/path1/path2/`. In this example, if files are staged in
>   `<storage_location>/path1/path2/`, both pipes would load a copy of the files.
>
>   View the COPY INTO *<table>* statements in the definitions of all pipes in the account by executing [SHOW PIPES](../sql-reference/sql/show-pipes.md)
>   or by querying either the [PIPES](../sql-reference/account-usage/pipes.md) view in Account Usage or the
>   [PIPES](../sql-reference/info-schema/pipes.md) view in the Information Schema.

Snowpipe with auto-ingest is now configured!

When new data files are added to the Azure container, the event message informs Snowpipe to load them into the target table defined in the pipe.

### Step 5: Load historical files

To load any backlog of data files that existed in the external stage before Event Grid messages were configured, execute an [ALTER PIPE … REFRESH](../sql-reference/sql/alter-pipe.md) statement.

### Step 6: Delete staged files

Delete the staged files after you successfully load the data and no longer require the files. For instructions, see
[Deleting staged files after Snowpipe loads the data](data-load-snowpipe-manage.md).

## SYSTEM$PIPE_STATUS output

The [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function retrieves a JSON representation of the current status of a pipe.

For pipes with AUTO_INGEST set to TRUE, the function returns a JSON object containing the following name/value pairs (if applicable to the current pipe status):

> {“executionState”:”<value>”,”oldestFileTimestamp”:<value>,”pendingFileCount”:<value>,”notificationChannelName”:”<value>”,”numOutstandingMessagesOnChannel”:<value>,”lastReceivedMessageTimestamp”:”<value>”,”lastForwardedMessageTimestamp”:”<value>”,”error”:<value>,”fault”:<value>}

For descriptions of the output values, see the reference topic for the SQL function.

---
title: AWS data file encryption
source: https://docs.snowflake.com/en/user-guide/data-load-s3-encrypt.md
section: User Guide
---

# AWS data file encryption

Snowflake supports either client-side encryption (CSE) or server-side encryption (SSE). Either can be configured to decrypt files staged in S3 buckets.

* Client-side encryption:

  + AWS_CSE: Requires a MASTER_KEY value. The [master key](https://csrc.nist.gov/glossary/term/master_key) must be a 128-bit or 256-bit key in Base64-encoded form.

    For client-side encryption, Snowflake supports using a master key stored in Snowflake; using a master key stored in AWS Key Management Service (AWS KMS) is not supported.

    Snowflake supports AWS V1 encryption standards. (AWS V2 encryption standards are not supported.)

    For more information, see the AWS documentation for [client-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/UsingClientSideEncryption.html).
* Server-side encryption (SSE):

  + AWS_SSE_S3: Requires no additional encryption settings.
  + AWS_SSE_KMS: Accepts an optional KMS_KEY_ID value.

  For more information, see the AWS documentation for [server-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/serv-side-encryption.html).

  Using AWS Key Management Service (KMS) to manage keys requires configuring an IAM policy. For information, see the [KMS documentation](https://aws.amazon.com/documentation/kms/).

**Next:** [Create an S3 stage](data-load-s3-create-stage.md)

---
title: AWS PrivateLink and Snowflake
source: https://docs.snowflake.com/en/user-guide/admin-security-privatelink.md
section: User Guide
---

# AWS PrivateLink and Snowflake

This topic describes how to configure AWS PrivateLink to directly connect your Snowflake account to one or more AWS Virtual Private Clouds (VPCs).

## AWS PrivateLink: Overview

[AWS PrivateLink](https://docs.aws.amazon.com/aws-technical-content/latest/aws-vpc-connectivity-options/aws-privatelink.html) is an AWS
service for creating private VPC endpoints that allow direct, secure connectivity between your AWS VPCs and the Snowflake VPC without
traversing the public internet. AWS PrivateLink connectivity supports VPC endpoint services and AWS VPCs that are located in the same or in
different AWS regions. Cross-region connectivity for AWS PrivateLink allows you to use a custom endpoint service to connect a Snowflake account
in a region that is different from your AWS VPC region. Cross-region connectivity isn’t currently supported for any platform as a service (PaaS)
services, such as Amazon Simple Storage Service (Amazon S3) or key management service (KMS).

For more information, see the AWS blog page, [Introducing Cross-Region Connectivity for AWS PrivateLink](https://aws.amazon.com/blogs/networking-and-content-delivery/introducing-cross-region-connectivity-for-aws-privatelink). For information about finding the region names for your account, see Find the cloud-provider’s name of the region for your account.

When [writing external functions](../sql-reference/external-functions.md), you can also use AWS PrivateLink with
[private endpoints](../sql-reference/external-functions-creating-aws-planning.md).

If you have an on-premises environment, such as a non-hosted data center, you can use [AWS Direct Connect](https://aws.amazon.com/directconnect/)
with AWS PrivateLink to connect all your virtual and physical environments in a single, private network.

> **Note:**
>
> AWS Direct Connect is a separate AWS service that must be implemented independently from AWS PrivateLink and is outside the scope of this
> topic. To inquire about implementing AWS Direct Connect, please contact Amazon.

## Enable AWS PrivateLink

> **Note:**
>
> The self-service enablement process in this section *doesn’t* currently support authorizing an AWS account identifier from a managed
> cloud service or a third-party vendor.
>
> To authorize an AWS account identifier for this use case, please retrieve the AWS account identifier from the vendor, and then contact
> [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

To enable AWS PrivateLink for your Snowflake account, complete the following steps:

1. Generate a federated token, and then save the output.

   1. To generate a token, run the [AWS CLI STS](https://docs.aws.amazon.com/cli/latest/reference/sts/get-federation-token.html) command
      on the command line.
      `get-federation-token` requires either an identity and access management user in AWS or the AWS account root
      user. For details, refer to the [AWS documentation](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_request.html#stsapi_comparison).

      > **Important:**
      >
      > The federated token expires after 12 hours. If you call any of the system functions to authorize, verify, or disable your
      > Snowflake account to use AWS PrivateLink and the token has expired, regenerate the token by running the AWS CLI STS command again.

      ```bash
      aws sts get-federation-token --name sam
      ```

      In a later step, you provide the output of this command as the `federated_token` argument for the SYSTEM$AUTHORIZE_PRIVATELINK function.
   2. From your generated token, extract the value of the `"FederatedUserId"` field.
      For example, if your token contains the following values:

      ```sqljson
      {
       ...
         "FederatedUser": {
           "FederatedUserId": "185...:sam",
           "Arn": "arn:aws:sts::185...:federated-user/sam"
         },
       "PackedPolicySize": 0
      }
      ```

      Extract `185...`. In the next step, you provide this 12-digit number as the `aws_id` argument for the SYSTEM$AUTHORIZE_PRIVATELINK function.
2. Using the ACCOUNTADMIN Snowflake system role, call the
   [SYSTEM$AUTHORIZE_PRIVATELINK](../sql-reference/functions/system_authorize_privatelink.md) function to *authorize* (enable) AWS PrivateLink for your
   Snowflake account:

   ```sqlsyntax
   SELECT SYSTEM$AUTHORIZE_PRIVATELINK ( '<aws_id>' , '<federated_token>' );
   ```

   Where:

   * `'aws_id'`

     The 12-digit identifier that uniquely identifies your Amazon Web Services (AWS) account, as a string.
   * `'federated_token'`

     The federated token value that contains access credentials for a federated user as a string.

   For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$AUTHORIZE_PRIVATELINK (
     '185...',
       '{
         "Credentials": {
           "AccessKeyId": "ASI...",
           "SecretAccessKey": "enw...",
           "SessionToken": "Fwo...",
           "Expiration": "2021-01-07T19:06:23+00:00"
         },
         "FederatedUser": {
           "FederatedUserId": "185...:sam",
           "Arn": "arn:aws:sts::185...:federated-user/sam"
         },
         "PackedPolicySize": 0
        }'
     );
   ```

   To verify your configuration, call the [SYSTEM$GET_PRIVATELINK](../sql-reference/functions/system_get_privatelink.md) function in your Snowflake account on AWS.
   This function uses the same argument values for `'aws_id'` and `'federated_token'` that were used
   to authorize your Snowflake account.

   SYSTEM$GET_PRIVATELINK returns `Account is authorized for PrivateLink.` for a successful authorization.
3. Optional: If you need to *disable* AWS PrivateLink in your Snowflake account, call the [SYSTEM$REVOKE_PRIVATELINK](../sql-reference/functions/system_revoke_privatelink.md)
   function by using the same argument values for `'aws_id'` and `'federated_token'`.

To further harden your security posture, Snowflake recommends pinning private endpoints for your Snowflake account. For more information, see
[Pinning private connectivity endpoints for inbound traffic](pin-private-endpoints.md).

## Configure your AWS VPC environment

> **Attention:**
>
> This section covers only the Snowflake-specific details for configuring your VPC environment.
>
> Snowflake isn’t responsible for the actual configuration of the required AWS VPC endpoints, security group rules, and Domain Name
> System (DNS) records. If you encounter issues with any of these configuration tasks, please contact AWS Support.

### Create and configure your AWS VPC endpoint

To create and configure a VPC endpoint in your AWS VPC environment, complete the following steps:

1. In your Snowflake account, use the ACCOUNTADMIN system role to call the
   [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function, and then record the `privatelink-vpce-id` value.
2. In your AWS environment, create a VPC endpoint by using the `privatelink-vpce-id` value from the previous step.

   > **Note:**
   >
   > If the Snowflake region of your VPC endpoint is different from the region of your AWS VPC, you must make two selections that enable
   > cross-region connectivity. In the AWS VPC Console, select Enable Cross Region endpoint, and then select the primary region of
   > the service in Service Settings » Service Region.
   >
   > For complete instructions, see the step-by-step setup procedure for [configuring cross-region connectivity](https://aws.amazon.com/blogs/networking-and-content-delivery/introducing-cross-region-connectivity-for-aws-privatelink/) in the AWS documentation.
   >
   > For instructions that describe how to find the region name of your account, see Find the cloud-provider’s name of the region for your account.
3. In your AWS environment, authorize a security group of services that connect the Snowflake outgoing connection to port `443` and
   `80` of the VPCE CIDR (Classless Inter-Domain Routing).

For more information, see the following topics in the AWS documentation:

* [Working with VPCs and subnets](https://docs.aws.amazon.com/vpc/latest/userguide/working-with-vpcs.html)
* [VPC endpoints](https://docs.aws.amazon.com/vpc/latest/userguide/vpc-endpoints.html)
* [VPC endpoint services (AWS PrivateLink)](https://docs.aws.amazon.com/vpc/latest/userguide/endpoint-service.html)
* [Security groups for your VPC](https://docs.aws.amazon.com/vpc/latest/userguide/VPC_SecurityGroups.html)

### Find the cloud-provider’s name of the region for your account

Snowflake and the cloud provider that hosts your Snowflake account use similar, but different names for the region that hosts the Snowflake
service. You can use system functions to find region names that you use to establish connectivity across regions. To determine the cloud-provider’s
name of the region that hosts your Snowflake account, take the following steps:

1. Run the [CURRENT_REGION](../sql-reference/functions/current_region.md) and [SHOW REGIONS](../sql-reference/sql/show-regions.md) commands.
2. In the output returned by SHOW REGIONS, find a row that shows a value in the `snowflake_region column` that matches the output returned
   by SELECT CURRENT REGION.

   The value in this row’s `region` column is the cloud-provider’s name of the region that hosts your Snowflake account.

In the following example, `us-west-2` is the cloud-provider’s name of the region that hosts the Snowflake account named `AWS_US_WEST`.

```sqlexample
SELECT CURRENT_REGION();
```

Output:

```sqlexample
+------------------+
| CURRENT_REGION() |
|------------------|
| AWS_US_WEST_2    |
+------------------+
```

```sqlexample
SHOW REGIONS;
```

Output:

```sqlexample
+------------------+-------+-----------|-----------------+
| snowflake_region | cloud | region    | display_name    |
|------------------|-------|-----------|-----------------|
| AWS_US_WEST_2    | aws   | us-west-2 | US West (Oregon)|
+------------------+-------+-----------+-----------------+
```

### Configure your VPC network

To access Snowflake by using an AWS PrivateLink endpoint, you must create Canonical Name (CNAME) records in your DNS to resolve the appropriate
endpoint values from the [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function to the DNS name of your VPC endpoint.

The values to obtain from the output of SYSTEM$GET_PRIVATELINK_CONFIG depend on which Snowflake features you access using private
connectivity. For a description of the possible values, see [Return values](../sql-reference/functions/system_get_privatelink_config.md).

Note that the values for `regionless-snowsight-privatelink-url` and `snowsight-privatelink-url` allow access to
Snowsight and the Snowflake Marketplace using private connectivity. However, there is additional configuration if you want to enable
URL redirects. For information, see [Snowsight & Private Connectivity](ui-snowsight-gs.md).

For additional help with DNS configuration, please contact your internal AWS administrator.

> **Important:**
>
> The structure of the Online Certificate Status Protocol (OCSP) cache server host name depends on the version of your installed clients,
> as described in Configure your Snowflake clients:
>
> * If you use the listed version or a later version, use the format shown in Configure your Snowflake clients, which enables better DNS
>   resolution when you have multiple Snowflake accounts — for example, dev, test, and production — in the same region. When updating
>   client drivers and using OCSP with PrivateLink, update the firewall rules to allow the OCSP host name.
> * If you use an earlier client version, then the OCSP cache server host name takes the form `ocsp.region_id.privatelink.snowflakecomputing.com`
>   without an [account identifier](admin-account-identifier.md).
> * Your DNS record must resolve to private IP addresses within your VPC. If it resolves to public IP addresses, the record isn’t configured
>   correctly.

### Create AWS VPC interface endpoints for Amazon S3

This step is required for Amazon S3 traffic from Snowflake clients to stay on the AWS backbone. The Snowflake clients
(such as Snowflake CLI, SnowSQL, JDBC driver, and so on) require access to Amazon S3 to perform various runtime operations.

If your AWS VPC network doesn’t allow access to the public internet, you can configure private connectivity to internal stages or more
gateway endpoints to the Amazon S3 host names required by the Snowflake clients.

There are three options to configure access to Amazon S3. The first two options avoid the public internet and the third option
uses the public internet:

* Configure an AWS VPC interface endpoint for [internal stages](private-internal-stages-aws.md) or for [Snowflake-managed storage volumes](private-managed-volumes-aws.md) if you use
  Apache Iceberg tables with Snowflake-managed storage. This option is recommended.
* Configure an Amazon S3 gateway endpoint. For more information, see the following Attention section.
* Don’t configure an interface endpoint or a gateway endpoint. This results in access that uses the public internet.

> **Attention:**
>
> To prevent communications between an Amazon S3 bucket and an AWS VPC with Snowflake from using the public internet, you can set up an
> Amazon S3 gateway endpoint in the same AWS region as the Amazon S3 bucket. This prevents communications on the public internet because
> AWS PrivateLink only allows communications between VPCs, and the Amazon S3 bucket isn’t included in the VPC.
>
> You can configure the Amazon S3 gateway endpoint to limit access to specific users, Amazon S3 resources, routes, and subnets; however,
> Snowflake doesn’t require this configuration. For more information, see
> [Gateway endpoints for Amazon S3](https://docs.aws.amazon.com/vpc/latest/userguide/vpc-endpoints-s3.html).
>
> To limit Amazon S3 gateways to use only Amazon S3 resources for Snowflake, choose one of the following options:
>
> * Use the specific Amazon S3 host name addresses that is used by your Snowflake account in your AWS endpoint policies. For the complete list of
>   host names that are used by your account, see [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md).
> * Use an Amazon S3 host name pattern that matches the Snowflake S3 host names in your AWS endpoint policies. With this option, there are
>   two possible types of connections to Snowflake: VPC-to-VPC or On-Premises-to-VPC.
>
>   Based on your connection type, complete the following instructions:
>
>   VPC-to-VPC:
>   :   Ensure that the Amazon S3 gateway endpoint exists. Optionally modify the Amazon S3 gateway endpoint policy to match the specific host
>       name patterns that are shown in the following Amazon S3 Hostnames table.
>
>   On-Premises-to-VPC:
>   :   Define a setup to include the Amazon S3 host name patterns in the firewall or proxy configuration *if* Amazon S3 traffic isn’t
>       permitted on the public gateway.

If you don’t require your gateway endpoints to explicitly match your account’s Snowflake-managed S3 buckets, you can use the Amazon S3 host
name patterns shown in the following table to create gateway endpoints:

> | Amazon S3 Hostnames | Notes |
> | --- | --- |
> | **All regions** |  |
> | `sfc-*-stage.s3.amazonaws.com:443` | None. |
> | **All regions other than US East** |  |
> | `sfc-*-stage.s3-<region_id>.amazonaws.com:443` | The pattern uses a hyphen (`-`) before the region ID. |
> | `sfc-*-stage.s3.<region_id>.amazonaws.com:443` | The pattern uses a period (`.`) before the region ID. |

For information about creating gateway endpoints, see [Gateway VPC endpoints](https://docs.aws.amazon.com/vpc/latest/userguide/vpce-gateway.html).

## Connect to Snowflake

Before you connect to Snowflake, you can *optionally* use the Snowflake Connectivity Diagnostic tool (SnowCD) to evaluate the
network connection with Snowflake and AWS PrivateLink.

For more information, see [SnowCD](snowcd.md) and [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md).

Otherwise, connect to Snowflake with your private connectivity account [URL](organizations-connect.md).

If you want to connect to Snowsight through AWS PrivateLink, follow the instructions in the
[Snowsight documentation](ui-snowsight-gs.md).

## Block public access — *Recommended*

After you test private connectivity to Snowflake by using AWS PrivateLink, you can optionally block public access to Snowflake. This
means that users can access Snowflake only if their connection request originates from an IP address within a particular CIDR block range
specified in a Snowflake network policy.

To block public access by using a network policy:

1. Create a new network policy or edit an existing network policy.
2. Add the CIDR block range for your organization.
3. Activate the network policy for your account.

For more information, see [Controlling network traffic with network policies](network-policies.md).

## Configure your Snowflake clients

The following sections describe how to configure Snowflake clients for specific use cases.

### Ensure Snowflake clients support OCSP cache server

The Snowflake OCSP cache server mitigates connectivity issues between Snowflake clients and the server. To enable your installed Snowflake
clients to use the OCSP server cache, ensure that you use the following client versions:

* Snowflake CLI 3.0.0 (or higher)
* SnowSQL 1.1.57 (or higher)
* Python Connector 1.8.2 (or higher)
* JDBC Driver 3.8.3 (or higher)
* ODBC Driver 2.19.3 (or higher)

> **Note:**
>
> The Snowflake OCSP cache server listens on port `80`, which is why you were instructed in Create and configure your AWS VPC endpoint
> to configure your AWS PrivateLink VPCE security group to accept both port `80` and port `443`, which is required for all other
> Snowflake traffic.

### Specify a host name for Snowflake clients

Each Snowflake client requires a host name to connect to your Snowflake account.

The host name is the same as the host name that you specified in the CNAME records in Configure your VPC network.

This step isn’t applicable to access the Snowflake Marketplace.

For example, for an account named `xy12345`:

* If the account is in US West, the host name is `xy12345.us-west-2.privatelink.snowflakecomputing.com`.
* If the account is in EU (Frankfurt), the host name is `xy12345.eu-central-1.privatelink.snowflakecomputing.com`.

> **Important:**
>
> The method for specifying the host name differs depending on the client:
>
> * For the Spark connector and the ODBC and JDBC drivers, specify the entire host name.
> * For all the other clients, *don’t* specify the entire host name.
>   Instead, specify the [account identifier](admin-account-identifier.md) with the `privatelink` segment, which is `<account_identifier>.privatelink`. Snowflake concatenates this name with `snowflakecomputing.com` to dynamically construct the host name.
>
> For more information about specifying the account name or host name for a Snowflake client, see the documentation for each client.

## Using SSO with AWS PrivateLink

Snowflake supports using SSO with AWS PrivateLink. For more information, see:

* [SSO with private connectivity](admin-security-fed-auth-overview.md)
* [Partner applications](oauth-snowflake-overview.md)

## Using Client Redirect with AWS PrivateLink

Snowflake supports using Client Redirect with AWS PrivateLink.

For more information, see [Redirecting client connections](client-redirect.md).

## Using replication and Tri-Secret Secure with private connectivity

Snowflake supports replicating your data from the source account to the target account, regardless of whether you enable
Tri-Secret Secure or this feature in the target account.

## Troubleshooting

To troubleshoot problems that you might come across with PrivateLink, see the following Snowflake Community articles:

* [How to retrieve a Federation Token from AWS for PrivateLink Self-Service](https://community.snowflake.com/s/article/How-to-retrieve-a-Federation-Token-from-AWS-for-PrivateLink-Self-Service)
* [FAQ: PrivateLink Self-Service with AWS](https://community.snowflake.com/s/article/PrivateLink-Self-Service-with-AWS)
* [Troubleshooting: Snowflake self-service functions for AWS PrivateLink](https://community.snowflake.com/s/article/Troubleshooting-Snowflake-self-service-functions-for-AWS-PrivateLink)

---
title: AWS PrivateLink and Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-inbound-configure-aws.md
section: User Guide
---

# AWS PrivateLink and Snowflake Open Catalog

This topic describes how to configure AWS PrivateLink to directly connect your Snowflake Open Catalog account to your query engine by
using inbound private connectivity.

## Prerequisites

* Your Snowflake Open Catalog account is hosted on AWS.
* You have the necessary permissions to configure your AWS DNS service with the private connectivity URL for your Open Catalog account.
  For guidance, see [How to configure the AWS DNS service (Route 53) to access Snowflake via a PrivateLink](https://community.snowflake.com/s/article/How-to-configure-the-AWS-DNS-service-Route-53-to-access-Snowflake-via-a-PrivateLink) in the Snowflake Community.

## Step 1: Enable AWS PrivateLink

In this procedure, you enable AWS PrivateLink for your Open Catalog account. This configuration allows the query engine to connect to
Open Catalog through private connectivity. You will need the 12-digit identifier for your Amazon Web Services (AWS) account and
the federated token value that contains access credentials for a federated user.

1. To obtain the federated token value, execute the following command by using the AWS CLI and copy the value into a text editor:

   ```bash
   aws sts get-federation-token --name sam
   ```
2. Sign in to Snowflake Open Catalog.
3. In the navigation menu, select **Settings**.
4. Select **Authorize**.
5. In the **Authorize Private Link** dialog, enable private connectivity for your account:

   1. In the **ID** field, enter the 12-digit identifier for your Amazon Web Services (AWS) account.
   2. For **Federated token**, enter the federated token value that you copied to a text editor.
   3. Select **Save**.

## Step 2: Verify that your account is authorized

To verify whether your Open Catalog account is authorized for private connectivity to the Snowflake Open Catalog service, follow this procedure:

1. Sign in to Snowflake Open Catalog.
2. In the navigation menu, select **Settings**.
3. Select **Get**.
4. In the Get Private Link authorization dialog, verify your account:

   1. In the **ID** field, enter the 12-digit identifier for your Amazon Web Services (AWS) account.
   2. In the **Federated token** field, enter the federated token value.
      You retrieved this value when you enabled AWS PrivateLink.
   3. Select **Save**. A message appears, which states whether your account is authorized.

## Step 3: Retrieve your Open Catalog account settings

Retrieve these settings, which you’ll need later to create and configure a VPC endpoint and your VPC network.

1. Sign in to Snowflake Open Catalog.
2. In the navigation menu, select **Settings**.
3. On the Settings page, copy the values for the following settings into a text editor:

   * PrivateLink Account URL
   * Regionless PrivateLink Account URL
   * PrivateLink OCSP URL
   * Regionless PrivateLink OCSP URL
   * VPCE Service ID

You paste these values when you create and configure a VPC endpoint (VPCE),
configure your VPC network, and connect to Open Catalog through AWS PrivateLink.

For descriptions of each setting, see
[Return values for the SYSTEM$GET_PRIVATELINK_CONFIG system function](https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_config#returns) in the Snowflake documentation. In this topic, the names of the account settings are in JSON format.

> **Note:**
>
> Remember that, where applicable, the description refers to a Snowflake account but your value is actually for your Snowflake Open
> Catalog account. For example, the `privatelink-account-url` is the URL for your Snowflake Open Catalog account.
>
> * Optional: To retrieve these values in JSON format, [Create a Snowflake CLI connection for Open Catalog](private-connectivity-outbound-manage-endpoints-aws.md),
>   and then call the SYSTEM$GET_PRIVATELINK_CONFIG system function.
> * In the Snowflake documentation, `privatelink-vpce-id` corresponds to the VPCE Service ID in Open Catalog.

## Step 4: Create and configure a VPC endpoint

In this procedure, you create and configure a corresponding VPC endpoint (VPCE) in your AWS VPC environment.

> **Note:**
>
> If you already created a VPC endpoint for your Snowflake account, and the account is in the same deployment as your Open Catalog account,
> creating a new VPC endpoint for your Open Catalog account isn’t required. You can optionally skip this step.

For instructions, see
[Create and configure a VPC endpoint (VPCE)](https://docs.snowflake.com/en/user-guide/admin-security-privatelink#create-and-configure-a-vpc-endpoint-vpce)
in the Snowflake documentation, starting with step 2.

## Step 5: Configure your VPC network

To configure your VPC network, create CNAME records in your DNS service to resolve the appropriate endpoint values from your
Open Catalog account settings for private connectivity to the DNS name of your VPC Endpoint.

For instructions, see [Configure your VPC network](https://docs.snowflake.com/en/user-guide/admin-security-privatelink#configure-your-vpc-network)
in the Snowflake documentation. Remember that these instructions are for Snowflake, so some of the features mentioned in them don’t apply
to Open Catalog. For example, `regionless-snowsight-privatelink-url` is for Snowsight, which isn’t supported in Open Catalog.

For additional help with DNS configuration, contact your internal AWS administrator.

## Step 6: Connect to Open Catalog through AWS PrivateLink

* To register a service connection and connect your query engine to Snowflake Open Catalog through AWS PrivateLink, use the code:

  ```python
  import pyspark
  from pyspark.sql import SparkSession

  spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,<maven_coordinate>') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_privatelink_account_url>/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','<client_id>:<client_secret>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:<principal_role_name>') \
    .getOrCreate()
  ```

### Parameters

> **Note:**
>
> Ensure that you set up your DNS service to match the value you specify for `<open_catalog_account_identifier>`.

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<client_id>` | Specifies the client ID for the service principal to use.   Enter the **Client ID** that you copied when you configured a new service connection. |
| `<client_secret>` | Specifies the client secret for the service principal to use.   Enter the **Secret** that you copied when you configured a new service connection. |
| `<open_catalog_privatelink_account_url>` | Specifies the URL to connect to your Snowflake account using AWS PrivateLink or Azure Private Link.   Enter one of the following values, which you copied when you retrieved your Open Catalog account settings:  * **PrivateLink Account URL** * **Regionless PrivateLink Account URL**  For details on retrieving your Open Catalog account settings, see the instructions for the cloud platform where your Open Catalog account is hosted:    * AWS * [Azure](private-connectivity-inbound-configure-azure.md) |
| `<principal_role_name>` | Specifies the principal role that is granted to the service principal.  To view this principal role, in Open Catalog, select the **Connections** page, select your service connection, and in the **Principal Details** dialog, refer to **Principal Roles.** |

## Step 7 (Optional): Create a catalog integration for Snowflake

If you’re using Snowflake to query Open Catalog-managed tables, create a catalog for Snowflake that uses a private IP address. To create
this catalog integration, your Snowflake account must be in the same deployment as your Open Catalog account.

For an example, see [Example: Catalog integration that uses a private IP address](../tables-iceberg-open-catalog-query.md) in the Snowflake documentation.

> **Note:**
>
> You can also configure private connectivity for the Snowflake Open Catalog UI. This configuration, combined with configuring private
> connectivity for your Open Catalog account, allows you to access the Open Catalog UI through private connectivity instead of over the public
> internet.
>
> To configure this access, see
> [Configure private connectivity for the Snowflake Open Catalog UI](private-connectivity-ui-configure.md).

---
title: AWS VPC interface endpoints for internal stages
source: https://docs.snowflake.com/en/user-guide/private-internal-stages-aws.md
section: User Guide
---

# AWS VPC interface endpoints for internal stages

This topic provides concepts as well as detailed instructions for connecting to Snowflake internal stages through AWS VPC Interface
Endpoints.

## Overview

AWS [VPC interface endpoints](https://docs.aws.amazon.com/vpc/latest/privatelink/endpoint-services-overview.html) and
[AWS PrivateLink for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html) can be
combined to provide secure connectivity to Snowflake internal stages. This setup ensures that data loading and data unloading operations to
Snowflake internal stages use the AWS internal network and do not take place over the public Internet.

Prior to AWS supporting VPC interface endpoints for internal stage access, it was necessary to create a proxy farm within the AWS VPC to
facilitate secure access to Snowflake internal stages. With the added support of VPC interface endpoints for Snowflake internal stages,
users and client applications can now access Snowflake internal stages over the private AWS network. The following diagram summarizes this
new support:

Note the following regarding the numbers in the BEFORE diagram:

* Users have two options to connect to a Snowflake internal stage:

  + Option A allows an on-premises connection directly to the internal stage as shown by the number 1.
  + Option B allows a connection to the internal stage through a proxy farm as shown by the numbers 2 and 3.
* If using the proxy farm, users can also connect to Snowflake directly as denoted by the number 4.

Note the following regarding the numbers in the AFTER diagram:

* The updates in this feature remove the need to connect to Snowflake or a Snowflake internal stage through a proxy farm.
* An on-premises user can connect to Snowflake directly as shown in number 5.
* To connect to a Snowflake internal stage, on-premises users connect to an interface endpoint, number 6, and then use AWS PrivateLink
  for Amazon S3 to connect to the Snowflake internal stage as shown in number 7.

There is a single Amazon S3 bucket per internal stage deployment. A
[prefix](https://docs.aws.amazon.com/AmazonS3/latest/userguide/using-prefixes.html) in the internal stage Amazon S3 bucket is used to
organize the data in each Snowflake account. The Amazon S3 bucket endpoint URLs are different depending on whether the connection to the
bucket uses private connectivity (i.e. AWS PrivateLink for S3).

Public Amazon S3 Global Endpoint URL:
:   `<bucket_name>.s3.region.amazonaws.com/prefix`

Private Amazon S3 Endpoint URL:
:   `<bucket_name>.<vpceID>.s3.<region>.vpce.amazonaws.com/prefix`

### Benefits

Implementing VPC interface endpoints to access Snowflake internal stages provides the following advantages:

* Internal stage data does not traverse the public Internet.
* Client and SaaS applications, such as Microsoft PowerBI, that run outside of the AWS VPC can connect to Snowflake securely.
* Administrators are not required to modify firewall settings to access internal stage data.
* Administrators can implement consistent security and monitoring regarding how users connect to storage accounts.

### Limitations

AWS doesn’t support cross-region VPC interface endpoints for the Amazon S3 service. Therefore, your VPC interface endpoint must be located in the same region as your Snowflake account to provide inbound connectivity to your Snowflake account’s internal stage.

Cross-region support for AWS PrivateLink isn’t available in government regions or in the People’s Republic of China.

Customers that use a SnowGov region for Federal Information Processing Standard (FIPS) compliance should be aware that AWS Privatelink for
Amazon S3 doesn’t support FIPS endpoints.

For more information about the AWS regions in which FIPS is enforced, see [Supported cloud regions](intro-regions.md).

For information about finding the region names for your account, see [Find the cloud-provider’s name of the region for your account](admin-security-privatelink.md).

For more information about limitations of AWS PrivateLink, see the
[AWS documentation](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html#privatelink-limitations).

## Getting started

Before configuring AWS and Snowflake to allow requests to access a Snowflake internal stage via AWS PrivateLink, you must:

* Meet the prerequisites.
* Choose the implementation strategy that fits your environment.

### Prerequisites

* Set the [ENABLE_INTERNAL_STAGES_PRIVATELINK](../sql-reference/parameters.md) parameter to enable support for connecting to an internal stage over AWS
  PrivateLink. For both implementation strategies discussed in this topic, the account administrator must execute:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  ALTER ACCOUNT SET ENABLE_INTERNAL_STAGES_PRIVATELINK = true;
  ```
* [AWS PrivateLink for S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html).

  > > **Important:**
  > >
  > > AWS PrivateLink for S3 is an AWS service that must be enabled in your cloud environment.
  > >
  > > For help with configuring and implementing this service, contact your internal AWS administrator.
* Update the firewall allow-listing as follows:

  + If using an outbound firewall, ensure that it allows all the URLs required by Snowflake. For details, see [SnowCD (Connectivity Diagnostic Tool)](snowcd.md).
* For `us-east-1` customers only: If using one of the following Snowflake clients to connect to Snowflake, please upgrade to the
  client version as follows:

  + JDBC driver: 3.13.3 (or higher)
  + ODBC driver: 2.23.2 (or higher)
  + Python Connector for Snowflake: 2.5.1 (or higher)
  + SnowSQL: 1.2.17 (or higher)

    - Upgrade SnowSQL before using this feature. For more information, see [Installing SnowSQL](snowsql-install-config.md).
    - Starting with version 1.3.0, SnowSQL disables automatic upgrades by default to avoid potential issues that can affect production environments when an automatic upgrade occurs. To upgrade, you should download and install new versions manually, preferably in a non-production environment. Snowflake recommends you leave this setting disabled, but you can manually enable the auto-upgrade behavior by configuring the SnowSQL `noup` [option](snowsql-install-config.md) option.

### Choosing an implementation strategy

Choosing the right implementation strategy depends on whether your organization is using AWS PrivateLink to access a single internal stage
or multiple internal stages.

* If your organization is accessing the internal stage of a single account, see Accessing an internal stage with an interface endpoint.
* If your organization is accessing the internal stages of multiple accounts, see
  Accessing Internal stages with dedicated interface endpoints. This strategy uses multiple interface endpoints to connect, one for each
  internal stage.

## Accessing an internal stage with an interface endpoint

Snowflake recommends the following implementation strategy when your organization accesses the internal stage of a *single account*. If you
plan to access multiple internal stages from your VPC, see Accessing Internal stages with dedicated interface endpoints.

To configure a VPC interface endpoint to access a Snowflake internal stage, it is necessary to have support from the following three roles in
your organization:

1. The Snowflake account administrator (that is, a user with the Snowflake ACCOUNTADMIN system role).
2. The AWS administrator.
3. The network administrator.

Depending on the organization, it might be necessary to coordinate the configuration efforts with more than one person or team to implement
the following configuration steps.

### Procedure

Complete the following steps to configure and implement secure access to a Snowflake internal stage through a VPC endpoint:

1. As a Snowflake account administrator, execute the following statements in your Snowflake account and record the value defined by the
   `privatelink_internal_stage` key. Note that the Amazon S3 bucket name is defined in the first segment of the URL when read from left
   to right. For more information, see [ENABLE_INTERNAL_STAGES_PRIVATELINK](../sql-reference/parameters.md) and
   [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md).

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ALTER ACCOUNT SET ENABLE_INTERNAL_STAGES_PRIVATELINK = true;
   select key, value from table(flatten(input=>parse_json(system$get_privatelink_config())));
   ```
2. As the AWS administrator, create a VPC endpoint for AWS PrivateLink for S3 using the AWS Console. Record the VPCE DNS Name for use in
   the next step; do not record any VPCE DNS zonal names.

   The VPCE DNS Name can be found by
   [describing an interface endpoint](https://docs.aws.amazon.com/vpc/latest/privatelink/vpce-interface.html#describe-interface-endpoint)
   once the endpoint is created.

   In this example, a wildcard (i.e. `*`) is listed as the leading character in the VPCE DNS Name. Replace the leading wildcard with the
   Amazon S3 bucket name from the previous step. For example:

   Replace:
   :   `*.vpce-000000000000a12-abc00ef0.s3.us-west-2.vpce.amazonaws.com`

   With:
   :   `<bucket_name>.vpce-000000000000a12-abc00ef0.s3.us-west-2.vpce.amazonaws.com`
3. As the network administrator, update the DNS settings to resolve the following URL:

   `<bucket_name>.s3.<region>.amazonaws.com` to the VPCE DNS name after the leading wildcard is replaced with the Amazon S3 bucket name.

   In this example, resolve `<bucket_name>.s3.<region>.amazonaws.com` to
   `<bucket_name>.vpce-000000000000a12-abc00ef0.s3.us-west-2.vpce.amazonaws.com`.

   > **Tip:**
   > * Do not use wildcard characters (i.e. `*`) with DNS mapping because of the possible impact of accessing other Amazon S3 buckets outside
   >   of Snowflake.
   > * Use a separate Snowflake account for testing, and configure a private hosted DNS zone in a test VPC to test the feature so that the
   >   testing is isolated and does not impact your other workloads.
   > * If using a separate Snowflake account is not possible, use a test user to access Snowflake from a test VPC where the DNS changes are
   >   made.
   > * To test from on-premises applications, use DNS forwarding to forward requests to the AWS private hosted zone in the VPC where the DNS
   >   settings are made. If there are client applications in both the VPC and on-premises, use AWS Transit Gateway.
   > * Execute the following command from the client machine to verify that the IP address returned is the private IP
   >   address for the storage account:
   >
   >   ```bash
   >   dig <bucket_name>.s3.<region>.amazonaws.com
   >   ```
4. For Snowflake accounts in `us-east-1`, verify your Snowflake clients are on their latest versions.

## Accessing Internal stages with dedicated interface endpoints

Snowflake recommends the following implementation strategy when your organization accesses the internal stages of *multiple accounts*.

The [S3_STAGE_VPCE_DNS_NAME](../sql-reference/parameters.md) parameter allows users to associate a Snowflake account with the DNS name
of an Amazon S3 interface endpoint. This allows organizations with multiple Snowflake accounts in an AWS deployment to associate each
internal stage with a different interface endpoint. When each internal stage has its own interface endpoint, network traffic to a specific
internal stage is isolated from network traffic to other internal stages.

Before continuing, make sure you have met the prerequisites.

### Benefits

The strategy in which an internal stage within an AWS deployment has a dedicated Amazon S3 interface endpoint has the following benefits:

Security:
:   Each account can have a different security strategy because individual interface endpoints can have different security
    configurations.

Chargeback models:
:   Companies can isolate network traffic based on the type of account (for example, production vs. development), and attribute
    costs associated with data flowing through an endpoint to the correct account.

DNS Management:
:   The DNS name of an Amazon S3 interface endpoint is a globally unique name that locates the specific endpoint within a specific
    region. AWS automatically registers this DNS name in its public DNS service, meaning it is publicly resolvable. For these reasons, an
    administrator does not need to do any additional DNS configuration to route traffic through an Amazon S3 interface endpoint to an internal
    stage. For example, the administrator does not need to set up a private hosted zone (PHZ) when configuring the Amazon Route 53
    DNS service or register a DNS name to point to an endpoint.

### Configuration

The network isolation strategy consists of the following:

1. In AWS, an administrator creates a new Amazon S3 interface endpoint for every Snowflake account in the organization. For example, if an
   organization has two accounts in the Snowflake deployment, the administrator creates two interface endpoints.
2. In Snowflake, an administrator uses the S3_STAGE_VPCE_DNS_NAME parameter to associate each Snowflake account with the DNS name of its
   dedicated interface endpoint. All traffic to the account’s internal stage goes through this interface endpoint.

#### AWS configuration

In your VPC as an AWS administrator:

1. [Create a separate Amazon S3 interface endpoint](https://docs.aws.amazon.com/vpc/latest/privatelink/create-interface-endpoint.html#create-interface-endpoint-aws)
   for each of your Snowflake accounts.
2. For each of these endpoints, use the AWS VPC Management Console to:

   1. Open the endpoint to view its Details.
   2. Find the DNS Names field, and copy the region-scoped DNS name. The Snowflake S3_STAGE_VPCE_DNS_NAME parameter will be set to
      this value.

      The format of the region-scoped DNS name looks like `*.vpce-sd98fs0d9f8g.s3.us-west-2.vpce.amazonaws.com`. Though AWS also provides
      an availability zone DNS name, Snowflake recommends the region-scoped DNS name because it provides high availability with failover
      capabilities.

#### Snowflake configuration

After the AWS administrator creates the interface endpoint for a Snowflake account’s internal stage, the Snowflake administrator can
use the S3_STAGE_VPCE_DNS_NAME parameter to associate the DNS name of that endpoint with the account.

The S3_STAGE_VPCE_DNS_NAME parameter should be set to the region-scoped DNS Name of the interface endpoint associated with a specific
internal stage. The standard format begins with an asterisk (`*`) and ends with `vpce.amazonaws.com`
(for example, `*.vpce-sd98fs0d9f8g.s3.us-west-2.vpce.amazonaws.com`).

As an example, the account administrator can execute the following to associate an endpoint with the current account:

```sqlexample
ALTER ACCOUNT SET S3_STAGE_VPCE_DNS_NAME = '*.vpce-sd98fs0d9f8g.s3.us-west2.vpce.amazonaws.com';
```

#### User overrides of interface endpoints

Snowflake supports user-level overrides within accounts that use [gateway endpoints for Amazon S3](https://docs.aws.amazon.com/vpc/latest/privatelink/vpc-endpoints-s3.html).
If your organization has applications (typically, users with the `TYPE` property set to `SERVICE`) running in the cloud that use S3
gateway endpoints to access internal stages, you can preserve DNS isolation (use S3 interface endpoints) at the account level, while allowing
specific service users in the same account to use S3 gateway endpoints. After setting dedicated interface endpoints
for each account, set an override for each user that you want to use the default S3 gateway endpoint.

For example, to associate a service user session with an internal stage that is being accessed through an Amazon S3 gateway endpoint, run the
following ALTER USER command:

```sqlexample
ALTER USER service1 SET S3_STAGE_VPCE_DNS_NAME = 's3-gateway-vpce-default';
```

`s3-gateway-vpce-default` is a reserved token used by the runtime to override the internal stage access and route session traffic through
the S3 gateway endpoint.

### Final DNS value

The final DNS name associated with an account has the form: `<bucketname>.bucket.vpce-<vpceid>.s3.<region>.vpce.amazonaws.com`

Where:

* `<bucketname>` is the name of the internal stage’s Amazon S3 bucket.
* `<vpceid>` is the unique identifier of the Amazon S3 interface endpoint associated with the account.
* `<region>` is the [cloud region](intro-regions.md) that hosts your Snowflake account.

The final DNS name appears in logs for each driver that connects to the internal stage.

---
title: AWS VPC interface endpoints for Snowflake-managed storage volumes
source: https://docs.snowflake.com/en/user-guide/private-managed-volumes-aws.md
section: User Guide
---

# AWS VPC interface endpoints for Snowflake-managed storage volumes

This topic provides concepts and detailed instructions for connecting to Snowflake-managed storage volumes through AWS VPC
interface endpoints.

## Overview

When you use an external query engine such as Apache Spark to read from or write to an iceberg table that uses
Snowflake-managed storage, the query engine communicates directly with the native iceberg volume hosted on Amazon S3. By default,
this traffic can traverse the public internet.

[AWS PrivateLink for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html) can
be combined with VPC interface endpoints to provide secure connectivity to the managed storage volume. This setup ensures that
read and write operations from your external query engine to the native iceberg volume use the AWS internal network instead of the
public internet.

### Benefits

Implementing VPC interface endpoints to access Snowflake-managed storage volumes provides the following advantages:

* Data doesn’t traverse the public internet when external query engines read from or write to the Snowflake managed iceberg volume.
* Client and SaaS applications, such as Microsoft PowerBI, that run outside of the AWS VPC can connect to Snowflake securely.
* Administrators aren’t required to modify firewall settings to access volume data.
* Administrators can implement consistent security and monitoring for how query engines connect to storage.

### Limitations

AWS doesn’t support cross-region VPC interface endpoints for the Amazon S3 service. Therefore, your VPC interface endpoint must be
located in the same region as your Snowflake account to provide inbound connectivity to your Snowflake managed
storage volume.

Cross-region support for AWS PrivateLink isn’t available in government regions or in the People’s Republic of China.

Customers that use a SnowGov region for Federal Information Processing Standard (FIPS) compliance should be aware that AWS Privatelink for
Amazon S3 doesn’t support FIPS endpoints.

For more information about the AWS regions in which FIPS is enforced, see [Supported cloud regions](intro-regions.md).

For information about finding the region names for your account, see [Find the cloud-provider’s name of the region for your account](admin-security-privatelink.md).

For more information about limitations of AWS PrivateLink, see the
[AWS documentation](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html#privatelink-limitations).

## Getting started

Before configuring AWS and Snowflake to allow requests to access a Snowflake-managed storage volume through AWS PrivateLink, you
must meet the prerequisites.

### Prerequisites

* [AWS PrivateLink for S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html).

  > > **Important:**
  > >
  > > AWS PrivateLink for S3 is an AWS service that must be enabled in your cloud environment.
  > >
  > > For help with configuring and implementing this service, contact your internal AWS administrator.
* Update the firewall allow-listing as follows:

  + If using an outbound firewall, ensure that it allows all the URLs required by Snowflake. For details, see [SnowCD (Connectivity Diagnostic Tool)](snowcd.md).
* For `us-east-1` customers only: If using one of the following Snowflake clients to connect to Snowflake, please upgrade to the
  client version as follows:

  + JDBC driver: 3.13.3 (or higher)
  + ODBC driver: 2.23.2 (or higher)
  + Python Connector for Snowflake: 2.5.1 (or higher)
  + SnowSQL: 1.2.17 (or higher)

    - Upgrade SnowSQL before using this feature. For more information, see [Installing SnowSQL](snowsql-install-config.md).
    - Starting with version 1.3.0, SnowSQL disables automatic upgrades by default to avoid potential issues that can affect production environments when an automatic upgrade occurs. To upgrade, you should download and install new versions manually, preferably in a non-production environment. Snowflake recommends you leave this setting disabled, but you can manually enable the auto-upgrade behavior by configuring the SnowSQL `noup` [option](snowsql-install-config.md) option.

## Accessing a Snowflake-managed storage volume with an interface endpoint

To configure a VPC interface endpoint to access a Snowflake-managed storage volume, the following roles in your organization must
coordinate:

1. The Snowflake account administrator (that is, a user with the Snowflake ACCOUNTADMIN system role).
2. The AWS administrator.
3. The network administrator.

Depending on the organization, it might be necessary to coordinate the configuration efforts with more than one person or team to implement the following configuration steps.

### Procedure

Complete the following steps to configure and implement secure access to a Snowflake-managed storage volume through a VPC endpoint:

1. As the AWS administrator, create a VPC endpoint to S3 using the AWS Console. Record the VPCE DNS Name for
   use in the next step; do not record any VPCE DNS zonal names.

   The VPCE DNS Name can be found by
   [describing an interface endpoint](https://docs.aws.amazon.com/vpc/latest/privatelink/vpce-interface.html#describe-interface-endpoint)
   once the endpoint is created.

   Example VPCE DNS Name: `*.vpce-000000000000a12-abc00ef0.s3.us-west-2.vpce.amazonaws.com`
2. Configure your external query engine to use the VPCE DNS name directly. Replace the `*` in the VPCE DNS name with `bucket`. For example, in Apache Spark:

   ```text
   .config("spark.sql.catalog.<catalog_name>.s3.endpoint",
           "bucket.vpce-000000000000a12-abc00ef0.s3.us-west-2.vpce.amazonaws.com")
   ```

   > **Tip:**
   >
   > Use a separate Snowflake account for testing, and configure a private hosted DNS zone in a test VPC to test the feature so
   > that the testing is isolated and doesn’t impact your other workloads.

## Blocking public access

After you configure VPC interface endpoints to access the managed storage volume through AWS PrivateLink, you can optionally
restrict access to the volume by using network rules and network policies.

### Prerequisites

To use network rules to restrict access to a Snowflake-managed storage volume, the account administrator must enable the
[ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME](../sql-reference/parameters.md) parameter:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME = true;
```

### Creating a network rule

Create a network rule with `MODE = SNOWFLAKE_MANAGED_STORAGE_VOLUME` and `TYPE = AWSVPCEID` to restrict access to the managed
storage volume based on VPC endpoint identifiers:

```sqlexample
CREATE NETWORK RULE managed_volume_rule
  TYPE = AWSVPCEID
  VALUE_LIST = ('vpce-123abc3420c1931')
  MODE = SNOWFLAKE_MANAGED_STORAGE_VOLUME
  COMMENT = 'Allow access from Horizon and S3 VPC endpoints';
```

### Applying a network policy

Create a network policy that uses the network rule and apply it to the account:

```sqlexample
CREATE NETWORK POLICY managed_volume_policy
  ALLOWED_NETWORK_RULE_LIST = ('managed_volume_rule')
  COMMENT = 'Restrict Snowflake-managed storage volume access to specific VPC endpoints';

ALTER ACCOUNT SET NETWORK_POLICY = managed_volume_policy;
```

---
title: Azure private endpoints for internal stages
source: https://docs.snowflake.com/en/user-guide/private-internal-stages-azure.md
section: User Guide
---

# Azure private endpoints for internal stages

This topic provides concepts as well as detailed instructions for connecting to Snowflake internal stages through Microsoft Azure Private
Endpoints.

## Overview

[Azure private endpoints](https://docs.microsoft.com/en-us/azure/private-link/private-endpoint-overview) and
[Azure Private Link](https://docs.microsoft.com/en-us/azure/private-link/private-link-overview) can be combined to provide secure
connectivity to Snowflake internal stages. This setup ensures that data loading and data unloading operations to Snowflake internal stages
use the Azure internal network and do not take place over the public internet.

Before Microsoft supported private endpoints for internal stage access, it was necessary to create a proxy farm within the Azure VNet to
facilitate secure access to Snowflake internal stages. With the added support of private endpoints for Snowflake internal stages, users
and client applications can now access Snowflake internal stages over the private Azure network. The following diagram summarizes this new support:

Note the following regarding the numbers in the BEFORE diagram:

* Users have two options to connect to a Snowflake internal stage:

  + Option A allows an on-premises connection directly to the internal stage as shown by the number 1.
  + Option B allows a connection to the internal stage through a proxy farm as shown by the numbers 2 and 3.
* If using the proxy farm, users can also connect to Snowflake directly as denoted by the number 4.

Note the following regarding the numbers in the AFTER diagram:

* For clarity, the diagram shows a single private endpoint from one Azure VNet pointing to a single Snowflake internal stage (6 and 7).

  Note that it is possible to configure multiple private endpoints, each within a different VNet, that point to the same Snowflake internal
  stage.
* The updates in this feature remove the need to connect to Snowflake or a Snowflake internal stage through a proxy farm.
* An on-premises user can connect to Snowflake directly as shown in number 5.
* To connect to a Snowflake internal stage, on-premises user connects to a private endpoint, number 6, and then uses Azure Private Link
  to connect to the Snowflake internal stage as shown in number 7.

In Azure, each Snowflake account has a dedicated storage account to use as an internal stage. The storage account URIs are different
depending on whether the connection to the storage account uses private connectivity (that is, Azure Private Link). The private connectivity
URL includes a `privatelink` segment in the URL.

Public storage account URI:
:   `<storage_account_name>.blob.core.windows.net`

Private connectivity storage account URI:
:   `<storage_account_name>.privatelink.blob.core.windows.net`

After you configure a private endpoint connection for your account’s internal
stage, Microsoft Azure automatically creates a CNAME record in the public DNS service that points the storage account host to its Azure
Private Link counterpart. This counterpart is `.privatelink.blob.core.windows.net`.

## Benefits

Implementing private endpoints to access Snowflake internal stages provides the following advantages:

* Internal stage data does not traverse the public internet.
* Client and SaaS applications, such as Microsoft PowerBI, that run outside of the Azure VNet can connect to Snowflake securely.
* Administrators are not required to modify firewall settings to access internal stage data.
* Administrators can implement consistent security and monitoring regarding how users connect to storage accounts.

## Limitations

Microsoft Azure defines how a private endpoint can interact with Snowflake:

* A single private endpoint can communicate to a single Snowflake Service Endpoint. You can have multiple one-to-one configurations that
  connect to the same Snowflake internal stage.
* The maximum number of private endpoints in your storage account that can connect to a Snowflake internal stage is fixed. For details, see
  [Standard storage account limits](https://learn.microsoft.com/en-us/azure/azure-resource-manager/management/azure-subscription-service-limits#standard-storage-account-limits).

## Configuring private endpoints to access Snowflake internal stages

To configure private endpoints to access Snowflake internal stages, you must have support from the following three roles in your
organization:

1. The Snowflake account administrator (that is, a user with the Snowflake ACCOUNTADMIN system role).
2. The Microsoft Azure administrator.
3. The network administrator.

Depending on the organization, it may be necessary to coordinate the configuration efforts with more than one person or team to implement
the following configuration steps.

Complete the following steps to configure and implement secure access to Snowflake internal stages through Azure private endpoints:

1. Verify that your Azure subscription is registered with the Azure Storage resource manager. This step allows you to connect to the
   internal stage from a private endpoint.
2. As a Snowflake account administrator, run the following commands in your Snowflake account and record the `ResourceID` of the
   internal stage storage account defined by the `privatelink_internal_stage` key. For more information, see
   [ENABLE_INTERNAL_STAGES_PRIVATELINK](../sql-reference/parameters.md) and [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md).

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ALTER ACCOUNT SET ENABLE_INTERNAL_STAGES_PRIVATELINK = true;
   SELECT KEY, VALUE FROM TABLE(flatten(input=>parse_json(system$get_privatelink_config())));
   ```
3. As the Azure administrator, create a private endpoint through the Azure portal.

   View the private endpoint properties and record the resource ID value. You will provide this value as the `privateEndpointResourceID`
   function argument in the next step.

   Verify that the Target sub-resource value is set to `blob`.

   For more information, see the Microsoft Azure Private Link [documentation](https://docs.microsoft.com/en-us/azure/private-link/).

   > **Important:**
   >
   > Before you proceed with the next step to authorize the private endpoint, you should be aware of the Microsoft Azure DNS behavior when a private
   > endpoint is authorized on a storage location *for the very first time*.
   >
   > When the first private endpoint is connected and authorized, Azure automatically creates a CNAME record in its public DNS for
   > `storage-account-name.privatelink.blob.core.windows.net`.
   >
   > Under normal circumstances, this DNS update should not affect existing public connectivity to the storage account. However, if your
   > environment already has private DNS zones configured for `.privatelink.blob.core.windows.net`, this DNS update can lead to unintended
   > behavior. Specifically, existing storage clients attempting to access the public endpoint `storage-account-name.blob.core.windows.net`
   > may fail DNS resolution or be unable to reach the storage account using public IP.
   >
   > To avoid this issue, Microsoft recommends enabling the Fallback to Internet option in the private DNS zone configuration before
   > authorizing the first private endpoint. This guidance also appears as a cautionary note in the Microsoft Azure [DNS zone configuration documentation](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-dns#azure-services-dns-zone-configuration).
4. As the Snowflake administrator, call the [SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS](../sql-reference/functions/system_authorize_stage_privatelink_access.md) function using the
   `privateEndpointResourceID` value as the function argument. This step authorizes access to the Snowflake internal stage through the
   private endpoint.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS('<privateEndpointResourceID>');
   ```

   If necessary, complete these steps to revoke access to the internal
   stage.
5. Involve your network administrator to update the DNS settings in a private DNS zone. The settings must resolve the privatelink blob URL
   `<storage_account_name>.privatelink.blob.core.windows.net` to the private IP address(es) of the Azure private endpoint that connects
   to your storage account internal stage.

   For more information, see
   [Azure Private Endpoint DNS configuration](https://docs.microsoft.com/en-us/azure/private-link/private-endpoint-dns).

   > **Tip:**
   > * Use a separate Snowflake account for testing, and configure a private DNS zone in a test VNet to test the feature so that the testing
   >   is isolated and does not impact your other workloads.
   > * If using a separate Snowflake account is not possible, use a test user to access Snowflake from a test VPC where the DNS changes are
   >   made.
   > * To test from on-premises applications, use DNS forwarding to forward requests to the Azure private DNS in the VNet where the DNS
   >   settings are made. Run the following command from the client machine to verify that the IP address returned is the private IP
   >   address for the storage account:
   >
   >   ```bash
   >   dig <storage_account_name>.blob.core.windows.net
   >   ```

## Blocking public access — *Recommended*

After you configure private endpoints to access the internal stage using Azure Private Link, you can optionally block requests from
public IP addresses to the internal stage. After blocking public access, all traffic must be through the private endpoint.

Controlling public access to an Azure internal stage differs from controlling public access to the Snowflake service. You use the
[SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](../sql-reference/functions/system_block_internal_stages_public_access.md) function, not a network policy, to block requests to the internal
stage. Unlike network policies, this function can’t block some public IP addresses while allowing others. Calling the SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function blocks all public IP addresses.

> **Important:**
>
> Confirm that traffic using private connectivity is successfully reaching the internal stage before blocking public access. Blocking
> public access without configuring private connectivity can cause unintended disruptions, including interference with managed services like
> Azure Data Factory.

The SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function enforces its restrictions by altering the Networking settings of the Azure
storage account where the internal stage is located. These Azure settings are commonly referred to as the “storage account firewall
settings”. Calling this Snowflake system function does the following actions in Azure:

* Sets the Public network access field to Enabled from selected virtual networks and IP addresses.
* Adds Snowflake VNet subnet ids to the Virtual Networks section.
* Clears all IP addresses from the Firewall section.

To block all traffic from public IP addresses to the internal stage, call the following function:

```sqlexample
SELECT SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS();
```

The function can take a few minutes to complete.

### Blocking public access with IP allowlist exceptions

The [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION](../sql-reference/functions/system_block_internal_stages_public_access_with_exception.md) function extends the set of functions for
blocking public access to internal stages. While the SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function blocks all public IP addresses,
SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION lets you block public access while maintaining an allowlist of IP addresses or CIDR blocks that are permitted to reach an internal stage location on Microsoft Azure.

> **Note:**
>
> This feature is not supported on Amazon Web Services or Google Cloud.

To block public access to internal stages on Microsoft Azure while allowing specific IP addresses or CIDR blocks, take the following steps:

1. Define IP allowlist exceptions
2. Verify function status
3. Test stage access with a pre-signed URL

#### Define IP allowlist exceptions

To create or modify an allowlist that defines which IP addresses can access an internal stage location on Microsoft Azure, call the
[SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION](../sql-reference/functions/system_block_internal_stages_public_access_with_exception.md) function and provide a comma-separated
list of IP addresses or CIDR ranges as function arguments. For example:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION('1.2.3.4/24, 100.0.0.1, 101.0.0.0/31');
```

> **Note:**
>
> You can also call this function to replace an existing allowlist with a different one.

#### Verify function status

Check that the feature is active and view the IP allowlist by calling the
[SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](../sql-reference/functions/system_internal_stages_public_access_status.md) function:

```sqlexample
SELECT SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS();
```

#### Test stage access with a pre-signed URL

To confirm the allowlist is working correctly:

1. Ensure the [ENABLE_INTERNAL_STAGES_PRIVATELINK](../sql-reference/parameters.md) parameter is set to TRUE.
2. Create an internal stage and upload a sample file for testing.
3. Generate a pre-signed URL for that file and test access from different IP addresses. Only requests originating from allowlisted IPs
   should be allowed.

   ```sqlexample
   SELECT GET_PRESIGNED_URL(@my_stage, 'data/sample.csv');
   ```

#### Examples

Block public access while allowing specific IP addresses and CIDR ranges:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION('100.0.0.1', '1.2.3.0/24', '101.0.0.0/31');
```

```output
Public Access to internal stages is blocked. Private link is required to connect to internal stages of this account. Exceptions: 100.0.0.1, 1.2.3.0/24, 101.0.0.0/31
```

Replace the existing allowlist with a new set of exceptions:

```sqlexample
SELECT SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION('200.0.0.1', '10.0.0.0/16');
```

```output
Public Access to internal stages is blocked. Private link is required to connect to internal stages of this account. Exceptions: 200.0.0.1, 10.0.0.0/16
```

### Ensuring public access is blocked

To determine whether public IP addresses are able to access an internal stage, call the [SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](../sql-reference/functions/system_internal_stages_public_access_status.md) function.

If the Azure settings are currently blocking all public traffic, the function returns `Public Access to internal stages is blocked`.
This verifies that the settings have not been changed since the SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function was called.

If at least some public IP addresses can access the internal stage, the function returns
`Public Access to internal stages is unblocked`.

### Unblocking public access

To allow public access to an internal stage that was previously blocked, call the [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](../sql-reference/functions/system_unblock_internal_stages_public_access.md) function.

Calling this function alters the Networking settings of the Azure storage account where the internal stage is located. It sets the
Azure Public network access field to Enabled from all networks.

## Revoking private endpoints to access Snowflake internal stages

To revoke access to Snowflake internal stages through Microsoft Azure private endpoints, complete the following steps:

1. As a Snowflake administrator, confirm that the [ENABLE_INTERNAL_STAGES_PRIVATELINK](../sql-reference/parameters.md) parameter is set to `TRUE`. For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SHOW PARAMETERS LIKE 'enable_internal_stages_privatelink' IN ACCOUNT;
   ```
2. As a Snowflake administrator, call the [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](../sql-reference/functions/system_revoke_stage_privatelink_access.md) function to revoke access
   to the private endpoint, and use the same `privateEndpointResourceID` value that was used to originally authorize access to the private
   endpoint.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS('<privateEndpointResourceID>');
   ```
3. As an Azure administrator, delete the private endpoint through the Azure portal.
4. As a network administrator, remove the DNS and alias records that were used to resolve the storage account URLs.

At this point, the access to the private endpoint is revoked. The query result from calling the
[SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function shouldn’t return the `privatelink_internal_stage` key and its
value.

## Troubleshooting

Azure applications that access Snowflake stages over the public internet and also use a private DNS service to resolve service host names
cannot access Snowflake stages if a private endpoint connection is established to the stage as described in this topic.

If any application has configured a private DNS region for the same domain, then Microsoft Azure tries to resolve the storage account host
by querying the private DNS service. If the entry for the storage account is not found in the private DNS service, a connection error occurs.

To address this issue, use one of the following two options:

1. Remove or dissociate the private DNS region from the application.
2. Create a CNAME record for the storage account private hostname — that is, `<storage_account_name>.privatelink.blob.core.windows.net`
   — in the private DNS service and point it to the hostname specified by the output of this command:

   ```bash
   dig CNAME <storage_account_name>.privatelink.blob.core.windows.net
   ```

---
title: Azure private endpoints for Snowflake-managed storage volumes
source: https://docs.snowflake.com/en/user-guide/private-managed-volumes-azure.md
section: User Guide
---

# Azure private endpoints for Snowflake-managed storage volumes

This topic provides concepts and detailed instructions for connecting to Snowflake-managed storage volumes through Microsoft
Azure private endpoints. Snowflake-managed storage volumes are the storage locations for
[Apache Iceberg tables that use Snowflake as the catalog](tables-iceberg.md).

## Overview

When you use an external query engine such as Apache Spark or Databricks to read from or write to an iceberg table that uses
Snowflake-managed storage, the query engine communicates directly with the native iceberg volume hosted on Azure Storage. By
default, this traffic can traverse the public internet.

[Azure private endpoints](https://docs.microsoft.com/en-us/azure/private-link/private-endpoint-overview) and
[Azure Private Link](https://docs.microsoft.com/en-us/azure/private-link/private-link-overview) can be combined to provide
secure connectivity to Snowflake-managed storage volumes. This setup ensures that read and write operations from your external
query engine to the native iceberg volume use the Azure internal network instead of the public internet.

## Benefits

Implementing private endpoints to access Snowflake-managed storage volumes provides the following advantages:

* Data doesn’t traverse the public internet when external query engines read from or write to the native iceberg volume.
* Administrators can implement consistent security and monitoring for how query engines connect to storage accounts.
* Administrators aren’t required to modify firewall settings to access storage volume data.

## Limitations

Microsoft Azure defines how a private endpoint can interact with Snowflake:

* A single private endpoint can communicate to a single Snowflake Service Endpoint. You can have multiple one-to-one
  configurations that connect to the same managed storage volume.
* The maximum number of private endpoints in your storage account that can connect to a Snowflake-managed storage volume is fixed.
  For details, see
  [Standard storage account limits](https://learn.microsoft.com/en-us/azure/azure-resource-manager/management/azure-subscription-service-limits#standard-storage-account-limits).

## Configuring private endpoints to access Snowflake-managed storage volumes

To configure private endpoints to access Snowflake-managed storage volumes, you must have support from the following three roles in
your organization:

1. The Snowflake account administrator (that is, a user with the Snowflake ACCOUNTADMIN system role).
2. The Microsoft Azure administrator.
3. The network administrator.

Depending on the organization, it may be necessary to coordinate the configuration efforts with more than one person or team to
implement the following configuration steps.

Complete the following steps to configure and implement secure access to Snowflake-managed storage volumes through Azure private
endpoints:

1. Verify that your Azure subscription is registered with the Azure Storage resource manager. This step allows you to connect to
   the managed storage volume from a private endpoint.
2. As a Snowflake account administrator, run the following commands in your Snowflake account. Record the resource Id of your non-failsafe and failsafe
   Snowflake-managed storage volume’s storage account respectively defined by the `privatelink-snowflake-managed-storage-volume-nfs` and
   `privatelink-snowflake-managed-storage-volume-fs` keys. For more information, see
   [ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK](../sql-reference/parameters.md) and
   [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md).

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ALTER ACCOUNT SET ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK = true;
   SELECT KEY, VALUE FROM TABLE(FLATTEN(input=>PARSE_JSON(SYSTEM$GET_PRIVATELINK_CONFIG())));
   ```
3. As the Azure administrator, create a private endpoint through the Azure portal to each of your Snowflake-managed storage volumes.

   View the private endpoint properties and record the resource ID value. You provide this value as the
   `privateEndpointResourceID` function argument in the next step.

   For more information, see the Microsoft Azure Private Link [documentation](https://docs.microsoft.com/en-us/azure/private-link/).

   > **Important:**
   >
   > Before you proceed with the next step to authorize the private endpoint, you should be aware of the Microsoft Azure DNS behavior
   > when a private endpoint is authorized on a storage location *for the very first time*.
   >
   > When the first private endpoint is connected and authorized, Azure automatically creates a CNAME record in its public DNS.
   >
   > Under normal circumstances, this DNS update should not affect existing public connectivity to the storage account. However,
   > if your environment already has private DNS zones configured, this DNS update can lead to unintended behavior.
   >
   > To avoid this issue, Microsoft recommends enabling the Fallback to Internet option in the private DNS zone
   > configuration before authorizing the first private endpoint.
4. As the Snowflake administrator, call the
   [SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](../sql-reference/functions/system_authorize_snowflake_managed_storage_volume_privatelink_access.md) function using the
   `privateEndpointResourceID` value as the function argument. This step authorizes access to the Snowflake-managed storage
   volume through the private endpoint.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS('<privateEndpointResourceID>');
   ```

   If necessary, complete these steps to revoke access to the
   Snowflake-managed storage volume.
5. Involve your network administrator to update the DNS settings in a private DNS zone. The settings must resolve the privatelink
   URL to the private IP address(es) of the Azure private endpoint that connects to your Snowflake-managed storage volume’s storage account.

   For more information, see
   [Azure Private Endpoint DNS configuration](https://docs.microsoft.com/en-us/azure/private-link/private-endpoint-dns).

   > **Tip:**
   > * Use a separate Snowflake account for testing, and configure a private DNS zone in a test VNet to test the feature so that
   >   the testing is isolated and doesn’t impact your other workloads.
   > * If using a separate Snowflake account is not possible, use a test user to access Snowflake from a VNet where the DNS changes
   >   are made.
   > * To test from on-premises applications, use DNS forwarding to forward requests to the Azure private DNS in the VNet where the
   >   DNS settings are made.

## Blocking public access

After you configure private endpoints to access the managed storage volume using Azure Private Link, you can optionally block
requests from public IP addresses to the managed storage volume. After blocking public access, all traffic must be through the
private endpoint.

> **Important:**
>
> Confirm that traffic using private connectivity is successfully reaching the managed storage volume before blocking
> public access. Blocking public access without configuring private connectivity can cause unintended disruptions.

To block all traffic from public IP addresses to the managed storage volume, call the following function:

```sqlexample
SELECT SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS();
```

The function can take a few minutes to complete.

### Blocking public access with IP allowlist exceptions

The [SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION](../sql-reference/functions/system_block_snowflake_managed_storage_volume_public_access_with_exception.md) function lets you
block public access while maintaining an allowlist of IP addresses or CIDR blocks that are permitted to reach the managed storage
volume.

To block public access while allowing specific IP addresses or CIDR blocks:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION('1.2.3.4/24, 100.0.0.1');
```

### Ensuring public access is blocked

To determine whether public IP addresses can access a Snowflake-managed storage volume, call the
[SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS](../sql-reference/functions/system_snowflake_managed_storage_volume_public_access_status.md) function.

```sqlexample
SELECT SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS();
```

### Unblocking public access

To allow public access to a Snowflake-managed storage volume that was previously blocked, call the
[SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](../sql-reference/functions/system_unblock_snowflake_managed_storage_volume_public_access.md) function.

```sqlexample
SELECT SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS();
```

## Revoking private endpoints to access Snowflake-managed storage volumes

To revoke access to Snowflake-managed storage volumes through Microsoft Azure private endpoints, complete the following steps:

1. As a Snowflake administrator, confirm that the
   [ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK](../sql-reference/parameters.md) parameter is set to `TRUE`. For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SHOW PARAMETERS LIKE 'ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK' IN ACCOUNT;
   ```
2. As a Snowflake administrator, call the
   [SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](../sql-reference/functions/system_revoke_snowflake_managed_storage_volume_privatelink_access.md) function to revoke access to
   the private endpoint, using the same `privateEndpointResourceID` value that was used to originally authorize access.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS('<privateEndpointResourceID>');
   ```
3. As an Azure administrator, delete the private endpoint through the Azure portal.
4. As a network administrator, remove the DNS and alias records that were used to resolve the storage account URLs.

---
title: Azure Private Link and Snowflake
source: https://docs.snowflake.com/en/user-guide/privatelink-azure.md
section: User Guide
---

# Azure Private Link and Snowflake

This topic describes how to configure Azure Private Link to connect your Azure Virtual Network (VNet) to the Snowflake VNet in Azure.

Note that Azure Private Link is not a service provided by Snowflake. It is a Microsoft service that Snowflake enables for use with
your Snowflake account.

## Overview

[Azure Private Link](https://docs.microsoft.com/en-us/azure/private-link/private-link-overview) provides private connectivity to Snowflake
by ensuring that access to Snowflake is through a private IP address. Traffic can only occur from the customer virtual network (VNet) to the
Snowflake VNet using the Microsoft backbone and avoids the public Internet. This significantly simplifies the network configuration by
keeping access rules private while providing secure and private communication.

The following diagram summarizes the Azure Private Link architecture with respect to the customer VNet and the Snowflake VNet.

From either a virtual machine (1) or through peering (2), you can connect to the Azure Private Link endpoint (3) in your virtual network.
That endpoint then connects to the Private Link Service (4) and routes to Snowflake.

Here are the high-level steps to integrate Snowflake with Azure Private Link:

1. Create a Private Endpoint.
2. Generate and retrieve an access token from your Azure subscription.

   Note that if you plan to use Azure Private Link to connect to a Snowflake internal stage or Snowflake-managed storage volume on
   Azure, you must register your subscription with the Azure Storage resource provider before connecting from a private endpoint.
3. Enable your Snowflake account on Azure to use Azure Private Link.
4. Update your outbound firewall settings to allow the Snowflake account URL and OCSP URL.
5. Update your DNS server to resolve your account URL and OCSP URL to the Private Link IP address. You can add the DNS entry to your
   on-premises DNS server or private DNS on your VNet, and use DNS forwarding to direct queries for the entry from other locations where
   your users will access Snowflake.
6. After the Private Endpoint displays a CONNECTION STATE value of Approved, test your connection to Snowflake with
   [SnowCD (Connectivity Diagnostic Tool)](snowcd.md) and [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md).
7. Connect to Snowflake using your private connectivity account URL.

To further harden your security posture, Snowflake recommends [Pinning private connectivity endpoints for inbound traffic](pin-private-endpoints.md) for your Snowflake account.

### Requirements and limitations

Before attempting to configure Azure Private Link to connect your Azure VNet to the Snowflake VNet on Azure, note the following:

* In Azure at the subnet level, optionally
  [enable a network policy](https://learn.microsoft.com/en-us/azure/private-link/disable-private-endpoint-network-policy?tabs=network-policy-portal)
  for the Private Endpoint.

  Verify that the TCP ports 443 and 80 allow traffic to `0.0.0.0` in the network security group of the Private Endpoint network card.

  For help with the port configuration, contact your internal Azure administrator.
* Use [ARM VNets](https://docs.microsoft.com/en-us/azure/azure-resource-manager/).
* Use IPv4 TCP traffic only.
* Currently, the self-service enablement process described in this topic does not support authorizing a managed Private Endpoint from Azure
  Data Factory, Synapse, or other managed services.

  For details on how to configure a managed private endpoint for this use case, see this
  [article](https://community.snowflake.com/s/article/How-to-set-up-a-managed-private-endpoint-from-Azure-Data-Factory-or-Synapse-to-Snowflake)
  (in the Snowflake community).

For more information on the requirements and limitations of Microsoft Azure Private Link, see the Microsoft documentation on
[Private Endpoint Limitations](https://docs.microsoft.com/en-us/azure/private-link/private-endpoint-overview#limitations) and
[Private Link Service Limitations](https://docs.microsoft.com/en-us/azure/private-link/private-link-service-overview#limitations).

## Configure access to Snowflake with Azure Private Link

> **Attention:**
>
> This section only covers the Snowflake-specific details for configuring your VNet environment. Also, note that Snowflake is not
> responsible for the actual configuration of the required firewall updates and DNS records. If you encounter issues with any of these
> configuration tasks, please contact Microsoft Support directly.

This section describes how to configure your Azure VNet to connect to the Snowflake VNet on Azure using Azure Private Link. After initiating
the connection to Snowflake using Azure Private Link, you can determine the approval state of the connection in the Azure portal.

For installation help, see the Microsoft documentation for the
[Azure CLI](https://docs.microsoft.com/en-us/cli/azure/install-azure-cli?view=azure-cli-latest)
or [Azure PowerShell](https://docs.microsoft.com/en-us/powershell/azure/install-az-ps?view=azps-2.6.0).

Complete the configuration procedure to configure your Microsoft Azure VNet and initiate the Azure Private Link connection to Snowflake.

### Procedure

This procedure manually creates and initializes the necessary Azure Private Link resources to use Azure Private Link to connect to
Snowflake on Azure. Note that this procedure assumes that your use case does not involve Using SSO with Azure Private Link
(in this topic).

1. As a representative example using the Azure CLI, execute `az account list --output table`. Note the output values in the
   `Name`, `SubscriptionID` and `CloudName` columns.

   ```text
   Name     CloudName   SubscriptionId                        State    IsDefault
   -------  ----------  ------------------------------------  -------  ----------
   MyCloud  AzureCloud  13c...                                Enabled  True
   ```
2. Navigate to the Azure portal. Search for Private Link and click Private Link.
3. Click Private endpoints and then click Add.

#. In the Basics section, complete the Subscription, Resource group, Name, and Region fields
for your environment and then click Next: Resource.

1. In the Resource section, complete the Connection method and the Resource ID or alias Field fields.

   * For Connection Method, select the Connect to an Azure resource by resource ID or alias.
   * In Snowflake, execute [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) and input the value for `privatelink-pls-id`
     into the Resource ID or alias field. Note that the screenshot in this step uses the alias value for the `east-us-2`
     region as a representative example, and that Azure confirms a valid alias value with a green checkmark.
   * If you receive an error message regarding the alias value, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to receive the resource ID value and then
     repeat this step using the resource ID value.
2. Return to the Private endpoints section and allow a few minutes to wait. On approval, the Private Endpoint displays a
   CONNECTION STATE value of Pending. This value will update to Approved after completing the authorization in
   the next step.
3. Enable your Snowflake account on Azure to use Azure Private Link by completing the following steps:

   * In your command-line environment, record the private endpoint resource ID value using the following Azure CLI
     [network](https://docs.microsoft.com/en-us/cli/azure/network/private-endpoint?view=azure-cli-latest#az-network-private-endpoint-show)
     command:

     > ```bash
     > az network private-endpoint show
     > ```
     >
     > The private endpoint was created in the previous steps using the template files. The resource ID value takes the following form,
     > which has a truncated value:
     >
     > `/subscriptions/26d.../resourcegroups/sf-1/providers/microsoft.network/privateendpoints/test-self-service`
   * In your command-line environment, execute the following
     [Azure CLI account](https://docs.microsoft.com/en-us/cli/azure/account?view=azure-cli-latest#az_account_get_access_token) command
     and save the output. The output will be used as the value for the `federated_token` argument in the next step.

     > ```bash
     > az account get-access-token --subscription <SubscriptionID>
     > ```

     Extract the access token value from the command output. This value will be used as the `federated_token` value in the next
     step. In this example, the values are truncated and the access token value is `eyJ...`:

     > ```sqljson
     > {
     >    "accessToken": "eyJ...",
     >    "expiresOn": "2021-05-21 21:38:31.401332",
     >    "subscription": "0cc...",
     >    "tenant": "d47...",
     >    "tokenType": "Bearer"
     >  }
     > ```
     >
     > > **Important:**
     > >
     > > The user generating the Azure access Token must have Read permissions on the Subscription. The least privilege permission is
     > > [Microsoft.Subscription/subscriptions/acceptOwnershipStatus/read](https://docs.microsoft.com/en-us/azure/role-based-access-control/resource-provider-operations#microsoftsubscription).
     > > Alternatively, the default role `Reader` grants more coarse-grained permissions.
     > >
     > > The `accessToken` value is sensitive information and should be treated like a password value — do not share this
     > > value.
     > >
     > > If it is necessary to contact Snowflake Support, redact the access token from any commands and URLs before creating a support
     > > ticket.
   * In Snowflake, call the [SYSTEM$AUTHORIZE_PRIVATELINK](../sql-reference/functions/system_authorize_privatelink.md) function,
     using the `private-endpoint-resource-id` value and the `federated_token` value as arguments, which are truncated in
     this example:

     > ```sqlexample
     > USE ROLE ACCOUNTADMIN;
     >
     > SELECT SYSTEM$AUTHORIZE_PRIVATELINK (
     >   '/subscriptions/26d.../resourcegroups/sf-1/providers/microsoft.network/privateendpoints/test-self-service',
     >   'eyJ...'
     >   );
     > ```

   To verify your authorized configuration, call the [SYSTEM$GET_PRIVATELINK](../sql-reference/functions/system_get_privatelink.md) function in your
   Snowflake account on Azure. Snowflake returns `Account is authorized for PrivateLink.` for a successful authorization.

   If it is necessary to *disable* Azure Private Link in your Snowflake account, call the
   [SYSTEM$REVOKE_PRIVATELINK](../sql-reference/functions/system_revoke_privatelink.md) function, using the argument values for
   `private-endpoint-resource-id` and `federated_token`.
4. DNS Setup. All requests to Snowflake need to be routed via the Private Endpoint. Update your DNS to resolve the Snowflake account and
   OCSP URLs to the private IP address of your Private Endpoint.

   * To get the endpoint IP address, navigate to Azure portal search bar and enter the name of the endpoint
     (i.e. the NAME value from Step 5). Locate the Network Interface result and click it.
   * Copy the value for the Private IP address (i.e. `10.0.27.5`).
   * Configure your DNS to have the appropriate endpoint values from the [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md)
     function resolve to the private IP address.

     The values to obtain from the output of SYSTEM$GET_PRIVATELINK_CONFIG depend on which Snowflake features you access using private
     connectivity. For a description of the possible values, see [Return values](../sql-reference/functions/system_get_privatelink_config.md).

     Note that the values for `regionless-snowsight-privatelink-url` and `snowsight-privatelink-url` allow access to
     Snowsight and the Snowflake Marketplace using private connectivity. However, there is additional configuration if you want to enable
     URL redirects. For information, see [Snowsight & Private Connectivity](ui-snowsight-gs.md).

     > **Note:**
     >
     > A full explanation of DNS configuration is beyond the scope of this procedure. For example, you can choose to integrate an
     > [Azure Private DNS zone](https://docs.microsoft.com/en-us/azure/dns/private-dns-privatednszone) into your environment. Please
     > consult your internal Azure and Cloud Infrastructure administrators to configure and resolve the URLs in DNS properly.
5. After verifying your outbound firewall settings and DNS records include your Azure Private Link account and OCSP URLs, test your
   connection to Snowflake with [SnowCD (Connectivity Diagnostic Tool)](snowcd.md) and [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md).
6. Connect to Snowflake with your private connectivity account [URL](organizations-connect.md).

   Note that if you want to connect to Snowsight via Azure Private Link, follow the instructions in the
   [Snowsight documentation](ui-snowsight-gs.md).

## Using SSO with Azure Private Link

Snowflake supports using SSO with Azure Private Link. For more information, see:

* [SSO with private connectivity](admin-security-fed-auth-overview.md)
* [Partner applications](oauth-snowflake-overview.md)

## Using Client Redirect with Azure Private Link

Snowflake supports using Client Redirect with Azure Private Link.

For more information, see [Redirecting client connections](client-redirect.md).

## Using replication and Tri-Secret Secure with private connectivity

Snowflake supports replicating your data from the source account to the target account, regardless of whether you enable
Tri-Secret Secure or this feature in the target account.

## Blocking public access — *Recommended*

After testing the Azure Private Link connectivity with Snowflake, you can optionally block public access to Snowflake using
[Controlling network traffic with network policies](network-policies.md).

Configure the CIDR block range to block public access to Snowflake using your organization’s IP address range. This range can be
from within your virtual network.

Once the CIDR Block ranges are set, only IP addresses within the CIDR block range can access Snowflake.

To block public access using a network policy:

1. Create a new network policy or edit an existing network policy. Add the CIDR block range for your organization.
2. Activate the network policy for your account.

---
title: Azure Private Link and Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-inbound-configure-azure.md
section: User Guide
---

# Azure Private Link and Snowflake Open Catalog

This topic describes how to configure Azure Private Link to directly connect your Snowflake Open Catalog account to your query engine by
using inbound private connectivity.

## Prerequisites

* Your Snowflake Open Catalog account is hosted on Azure.
* You have the necessary permissions to configure your DNS service with the private connectivity URL for your Open Catalog account.

## Step 1: Retrieve your Open Catalog account settings

Retrieve the following settings for configuring access to Open Catalog with Azure Private Link.

1. Sign in to Snowflake Open Catalog.
2. In the navigation menu, select **Settings**.
3. On the Settings page, copy the values for the following settings into a text editor:

   * PrivateLink Account URL
   * Regionless PrivateLink Account URL
   * PrivateLink OCSP URL
   * Regionless PrivateLink OCSP URL
   * Private Link Service ID

You paste these values when you Configure access to Open Catalog with Azure Private Link and
Connect to Open Catalog through Azure Private Link.

For descriptions of each setting, see
[Return values for the SYSTEM$GET_PRIVATELINK_CONFIG system function](https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_config#returns) in the Snowflake documentation. In this topic, the names of the account settings are in JSON format.

> **Note:**
>
> Remember that, where applicable, the description refers to a Snowflake account but your value is actually for your Snowflake Open
> Catalog account. For example, the `privatelink-account-url` is the URL for your Snowflake Open Catalog account.
>
> * Optional: To retrieve these values in JSON format, [Create a Snowflake CLI connection for Open Catalog](private-connectivity-outbound-manage-endpoints-aws.md),
>   and then call the SYSTEM$GET_PRIVATELINK_CONFIG system function.

## Step 2: Configure access to Open Catalog with Azure Private Link

> **Attention:**
>
> This section only covers the Open Catalog–specific details for configuring your VNet environment. Also, note that Snowflake is not
> responsible for the actual configuration of the required firewall updates and DNS records. If you have issues with any of these
> configuration tasks, contact Microsoft Support directly.

This section describes how to connect your VNet to the Open Catalog VNet using Azure Private Link.

To complete the instructions, you need to use the Azure CLI or Azure PowerShell. For installation help, see the Microsoft documentation
for the [Azure CLI](https://learn.microsoft.com/en-us/cli/azure/install-azure-cli?view=azure-cli-latest)
or [Azure PowerShell](https://learn.microsoft.com/en-us/powershell/azure/install-azure-powershell?view=azps-13.4.0&amp;viewFallbackFrom=azps-2.6.0).

After initiating the connection to Snowflake Open Catalog using Azure Private Link, you can determine the approval state of the connection
in the Azure portal.

### Create a private endpoint

> **Note:**
>
> If you already created a private endpoint for your Snowflake account, and the account is in the same deployment as your Open Catalog account,
> creating a new private endpoint for your Open Catalog account isn’t required. You can optionally skip this step.

1. Retrieve your Azure account details. The following example uses the Azure CLI’s `az account list` command.

   ```text
   Name     CloudName   SubscriptionId                        State    IsDefault
   -------  ----------  ------------------------------------  -------  ----------
   MyCloud  AzureCloud  13c...                                Enabled  True
   ```
2. In the Azure portal, search for **Private Link**, and then select **Private Link** in the results.
3. Click **Private endpoints**, and then click **Add**.
4. On the **Basics** tab, complete the **Subscription**, **Resource group**, **Name**, and **Region** fields for your
   environment and then click **Next: Resource**.
5. On the **Resource** tab, for **Connection Method**, select **Connect to an Azure resource by resource ID or alias**.
6. For **Resource ID or alias**, enter the value for `Private Link Service ID` that you obtained when you
   retrieved your Open Catalog account settings for private connectivity.

   If you receive an error message regarding the alias value, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support)
   for the resource ID value, and then repeat this step using that value.

When the private endpoint is approved, the **CONNECTION STATE** in the Private endpoints section on the Private Link Center page displays
the value *Pending*. This value changes to *Approved* when you complete the authorization in the next procedure.

### Enable inbound private connectivity

In this procedure, you enable Azure Private Link for your Open Catalog account. This configuration allows the query engine to connect to
Open Catalog through private connectivity. You will need your private endpoint resource ID, a subscription ID, and the federated token value
that contains access credentials for a federated user.

1. To obtain these values, execute the following commands in the Azure CLI:

   1. To obtain your private endpoint resource ID, execute the following command, and copy the value into a text editor:

      ```bash
      az network private-endpoint show
      ```
   2. To obtain the subscription ID, execute the following command, and note the value in the SubscriptionID column in the output:

      ```bash
      az account list --output table
      ```
   3. To obtain the federated token value, execute the following command, and copy the accessToken value into a text editor:

      ```bash
      az account get-access-token --subscription <SubscriptionID>
      ```

      * Where: `SubscriptionID` is the unique identifier you obtained in the previous step.
      > **Important:**
      >
      > The user generating the Azure access Token must have Read permissions on the Subscription. The least privilege permission is
      > [Microsoft.Subscription/subscriptions/acceptOwnershipStatus/read](https://docs.microsoft.com/en-us/azure/role-based-access-control/resource-provider-operations#microsoftsubscription).
      > Alternatively, the default role `Reader` grants more coarse-grained permissions.
      >
      > The `accessToken` value is sensitive information and should be treated like a password value — do *not* share this value.
      >
      > If it is necessary to contact Snowflake Support, redact the access token from any commands and URLs before creating a support ticket.
2. Sign in to Snowflake Open Catalog.
3. In the navigation menu, select **Settings**.
4. Select **Authorize**.
5. In the Authorize Private Link dialog, enable private connectivity for your account:

   1. For **ID**, enter the private endpoint resource ID that you copied to a text editor.
   2. For **Federated token**, enter the federated token value that you copied to a text editor.
   3. Select **Save**.

### Verify that your account is authorized

Follow these steps to verify whether your Open Catalog account is authorized for private connectivity to the Snowflake Open Catalog service.

1. Sign in to Snowflake Open Catalog.
2. In the navigation menu, select **Settings**.
3. Select **Get**.
4. In the Get Private Link authorization dialog, verify your account:

   1. In the **ID** field, enter your private endpoint resource ID. You retrieved this value when you
      enabled inbound private connectivity.
   2. In the **Federated token** field, enter the federated token value.
      You retrieved this value when you
      enabled inbound private connectivity.
   3. Select **Save**. A message appears, which states whether your account is authorized.

### Set up DNS

All requests to Open Catalog must be routed through the private endpoint. To resolve the Open Catalog account and OCSP URLs to the private IP address of your private endpoint, update your DNS.

1. To get the endpoint IP address, in the Azure portal search bar, enter the name of the private endpoint you created.
2. Select the Network Interface result.
3. Copy the value for the **Private IP address**.
4. Configure your DNS to have the appropriate endpoint values from your Open Catalog account settings for private connectivity
   resolve to the private IP address.

## Step 3: Connect to Open Catalog through Azure Private Link

* To register a service connection and connect your query engine to Open Catalog through Azure Private Link, use the following code:

  ```python
  import pyspark
  from pyspark.sql import SparkSession

  spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,<maven_coordinate>') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_privatelink_account_url>/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','<client_id>:<client_secret>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:<principal_role_name>') \
    .getOrCreate()
  ```

### Parameters

> **Note:**
>
> Ensure that you set up your DNS service to match the value you specify for `<open_catalog_account_identifier>`.

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<client_id>` | Specifies the client ID for the service principal to use.   Enter the **Client ID** that you copied when you configured a new service connection. |
| `<client_secret>` | Specifies the client secret for the service principal to use.   Enter the **Secret** that you copied when you configured a new service connection. |
| `<open_catalog_privatelink_account_url>` | Specifies the URL to connect to your Snowflake account using AWS PrivateLink or Azure Private Link.   Enter one of the following values, which you copied when you retrieved your Open Catalog account settings:  * **PrivateLink Account URL** * **Regionless PrivateLink Account URL**  For details on retrieving your Open Catalog account settings, see the instructions for the cloud platform where your Open Catalog account is hosted:    * [AWS](private-connectivity-inbound-configure-aws.md) * Azure |
| `<principal_role_name>` | Specifies the principal role that is granted to the service principal.  To view this principal role, in Open Catalog, select the **Connections** page, select your service connection, and in the **Principal Details** dialog, refer to **Principal Roles.** |

## Step 4 (Optional): Create a catalog integration for Snowflake

If you’re using Snowflake to query Open Catalog-managed tables, create a catalog for Snowflake that uses a private IP address. To create
this catalog integration, your Snowflake account must be in the same deployment as your Open Catalog account.

For an example, see [Example: Catalog integration that uses a private IP address](../tables-iceberg-open-catalog-query.md) in the Snowflake documentation.

> **Note:**
>
> You can also configure private connectivity for the Snowflake Open Catalog UI. This configuration, combined with configuring private
> connectivity for your Open Catalog account, allows you to access the Open Catalog UI through private connectivity instead of over the public
> internet.
>
> To configure this access, see
> [Configure private connectivity for the Snowflake Open Catalog UI](private-connectivity-ui-configure.md).

---
title: Backups for disaster recovery and immutable storage
source: https://docs.snowflake.com/en/user-guide/backups.md
section: User Guide
---

# Backups for disaster recovery and immutable storage

Backups help organizations protect critical data against modification or deletion.

Backups represent discrete snapshots of Snowflake objects. You choose which objects to back up, how frequently to
back them up, how long to keep the backups, and whether to add a retention lock so that they can’t
be deleted prematurely.

## Use cases for Snowflake backups

The following use cases are typical applications of backups:

Regulatory compliance:
:   Backups with retention lock help organizations, financial institutions, and
    related industries address regulations that require records to be retained in an immutable format.

    > **Note:**
    >
    > Snowflake has engaged Cohasset Associates to perform an independent assessment of our Backups
    > feature for compliance with key regulatory recordkeeping requirements, including SEC 17a-4(f),
    > SEC 18a-6(e), FINRA Rule 4511(c), and CFTC Rule 1.31(c)-(d). This Cohasset assessment provides
    > independent, third-party verification that Snowflake’s immutable storage controls support the
    > creation, protection, and retention of data, and provides customers with confidence that
    > Snowflake meets critical industry standards for regulated data retention subject to the
    > evaluated regulations.
    >
    > For the full compliance report that applies to Snowflake backups with retention lock,
    > see [the Snowflake Compliance Center](https://trust.snowflake.com/resources?s=jv88d2ujxzj9kcnz43w31&name=snowflake-cohasset-assessment).

Recovery:
:   Backups help organizations create discrete snapshots to protect and recover business-critical data
    in case of accidental modifications or deletions.

Cyber resilience:
:   Backups with retention lock are part of an overall cyber-resilience strategy. They help organizations
    protect business-critical data during cyber attacks, especially ransomware attacks. The retention lock ensures that this data
    can’t be deleted by the attacker, even if they gain access to the account by using the ACCOUNTADMIN or ORGADMIN roles.

## Key concepts

This section provides an overview of the key concepts for backups in Snowflake.

### Backup

A *backup* represents a point-in-time snapshot of an object.

* The object can be a single table, a schema, or an entire database.
* A specific backup can be identified by a unique ID generated by Snowflake.
* A backup can’t be modified. It can, however, be deleted, and the backup expiration period can be modified
  (unless a retention lock is applied).

During day-to-day operations, you rarely interact with individual backups. Instead, you manage the *backup sets* that contain
them. For example, you get a list of backups by running the SHOW BACKUPS IN BACKUP SET command. You create a new backup by
running an ALTER BACKUP SET command.

### Backup set

A *backup set* is a schema-level object that contains a set of backups for a specific database, schema, or table.
Snowflake has SQL commands to CREATE, ALTER, DROP, SHOW, and DESCRIBE backup sets.

You can have multiple backup sets for the same object.

The life cycle of the backups within a set is determined by an optional *backup policy* that you can attach to the backup set.
You can also add or delete backups manually in a backup set. Your ability to delete backups is affected by
other factors, in particular *retention lock* and *legal hold*.

### Backup policy

A *backup policy* is a schema-level object that contains the settings that define the life cycle of the backups within a backup
set. These settings include schedule, expiration, and retention lock.

* The *schedule* determines when backups are created. The schedule can be defined as
  an interval in minutes, or as a cron expression.
  For example, if the schedule is set to one hour, a backup of the object is taken every 60 minutes.
* The *expiration period* is the length of time the backup is valid. After a backup expires,
  Snowflake deletes it automatically, unless a legal hold is applied to that particular backup.

  > **Tip:**
  >
  > If the backup set doesn’t have a retention lock and the particular backup doesn’t have a legal
  > hold applied, you can delete the backup manually before the end of the expiration period.
  > You can manually delete backups one at a time, always starting with the oldest backup that
  > doesn’t have a legal hold.

Each backup policy must have one or both of the schedule and expiration period properties. For example, you can
create a policy with a schedule and an expiration period, and let Snowflake handle all creation and removal
of the backups in all backup sets where that policy is applied. Alternatively, you might
create a policy with a schedule and no expiration period if you want to manage removing older backups yourself.
Or, you can create a policy with an expiration period but without a schedule, and then manage
backup creation yourself. You can’t create a policy with no schedule and no expiration period.

If you associate a backup policy with a backup set, you can do so when you create the backup set, or you can apply the policy
later. Or, you can have a backup set that doesn’t have an associated backup policy. In that case, you manually control when to take
new backups and expire old ones.

You can apply a backup policy to multiple backup sets. If you modify a backup policy, Snowflake applies the changes to all
backup sets that the policy is attached to.

### Retention lock

A *retention lock* protects a backup from deletion for the defined expiration period.
You can use a backup with a retention lock for backups for regulatory compliance and cyber resilience.
The following restrictions apply for a backup set with retention lock:

* Backups can’t be deleted by any role, including the ACCOUNTADMIN role.
* You can’t decrease the backup expiration period, although you can increase the expiration period.
* You can’t drop a backup set if there are any unexpired backups in the set.
* You can’t drop a schema that contains a backup set with any unexpired backups.
* You can’t drop a database that contains a backup set with any unexpired backups.
* You can’t drop an account that contains a database with a backup set that has any unexpired backups.

> **Important:**
>
> Applying a backup policy with a retention lock to a backup set is *irreversible*.
> Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a backup set,
> you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
> you set a retention lock on a backup set with a long expiration period, to avoid unexpected storage charges
> for undeletable backup sets, and the schemas and databases that contain them.
>
> If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
> Snowflake deletes all backups, including those with retention locks. Deleting a Snowflake organization
> requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

### Legal hold

The *legal hold* feature of Snowflake backups prevents backups from being overwritten or deleted.
That way, you can preserve Snowflake databases, schemas, or tables based on your own legal requirements.

Snowflake lets you place a legal hold on specific backups.
When a Snowflake backup is under legal hold, the following conditions apply:

* Nobody can modify the backup.
* Nobody can delete the backup. That’s true even if the backup has passed its EXPIRE_AFTER_DAYS period.
* Access to the backup is logged and auditable.
* The legal hold can be removed by a privileged user, unlike a retention lock.

> **Important:**
>
> If you replicate a backup set, make sure to perform a refresh immediately after placing a legal hold on a backup
> in that backup set. If you perform a failover before you replicate the backup set that contains the legal hold, the original
> backup set can be overwritten when you fail back to the original primary account, potentially erasing the legal hold.

### Overview of the backup lifecycle

The following diagram shows how the Snowflake objects, backups, backup sets, and backup policies relate to each other.
The diagram involves the simplest kind of backup: one for a single table.
Each backup operation produces a new backup. All the backups for that particular object are grouped together
in a backup set. The automatic addition and removal of backups in the backup set is governed by the backup policy.
To recover the information from a backup, you use a CREATE command to create a new object from a specific backup.

## How backups work

Backups are *zero-copy* duplicates of a Snowflake object similar to [clones](object-clone.md). Backups don’t make
copies of table data when they are created. The backup mechanism backs up table data without incurring the additional cost
or time of copying the data.

Snowflake stores data in files that are immutable, and maintains pointers from backups to the data files that underlie the table. As the
table evolves and is modified, Snowflake ensures that each data file is protected from deletion as long as there is an unexpired
backup that references that file.

## Restrictions for backups

Snowflake enforces the following restrictions for backups:

* You can’t modify the retention lock for a backup policy.
* When a policy has a retention lock, you can increase the expiration period, but you can’t decrease it.
* The minimum schedule interval for scheduled backups is one hour (60 minutes).

## Comparison of backups with other disaster recovery and business continuity features

Backups provide the following advantages that are different from other Snowflake business continuity and disaster recovery
features, such as replication and Time Travel:

* You can enable long-term retention for backups. Long-term retention helps with recovery, regulatory compliance,
  and cyber resilience against threats such as ransomware or insider attacks.
* Retention lock ensures that backups can’t be deleted by any user, including account administrators.
* You can schedule backups on a different timeframe than you use for other data transfer operations, such as
  replication refreshes.
* You can backup and restore individual table objects, or container objects such as entire schemas or databases.
* You can prevent the retention time for backups from being reduced after the backup is taken, by using a backup
  policy that includes a retention lock. That’s different from the Time Travel feature, where you can reduce the
  retention interval to zero.
* Unlike Time Travel and Fail-safe, backups preserve data from more types of objects than just tables and table data.
* The speed and storage efficiency of taking backups is similar to the zero-copy mechanism used for cloning.
* The way all backups for the same object are grouped into backup sets makes management simpler than if
  you used clones to implement your own backup mechanism. For example, you don’t have to manage large numbers of
  objects, devise a naming scheme to keep track of the cloned objects, or implement a scheduling mechanism to delete
  old clones. Also, unlike with cloned objects, backups can’t be modified after you create them.
* Each backup represents a single table, schema, or a database as of the specified point in time.
  backups don’t include account-level objects such as users or roles.
  Some kinds of tables and other database-level objects aren’t included in schema and database backups.
  For more information, see backup objects.
* Backup-related objects are stored in the same cloud service provider (CSP) region as the associated database, schema, or
  table. For business continuity and disaster recovery scenarios, you typically combine backups with Snowflake account
  replication. That way, all the backup sets and backup policies can be replicated to a different region or
  a different CSP and recovered even if there’s an outage affecting the original region or CSP.
* Backup sets and backup policies can’t be cloned. If you clone a schema or database that contains such objects,
  they aren’t included in the cloned schema or database.

## Backup objects

You can create backup sets for tables, schemas, and databases.

### References from tables to other objects

Objects, such as views or functions, can refer to objects outside the schema or database in the backup. To ensure that
such references continue functioning after you restore from a backup, use one of the following strategies:

* If the tables and the other objects that they refer to are all in the same schema or the same database, create a
  backup set for the entire schema or database. That way, Snowflake restores all the interconnected objects at once
  when you restore from the backup.
* If objects in a backup set refer to objects that aren’t included in the backup set, be aware that when a backup
  is restored, the references from the restored objects point to the original objects from the other database or schema.
  If you dropped those other objects or changed their properties after taking the backup, you might encounter errors
  when you access the restored objects.
* For account-level objects, any references from restored objects *always* point to the original account-level object.
  That’s because the account-level objects aren’t part of any backup. For example, a schema backup might contain
  a secret that refers to a security integration. The security integration is an account-level object and can’t be
  included in any backup.

### Types of objects in database and schema backups

The following table lists the objects that are included in a database or schema backup:

| Object | Included in backup | Notes |
| --- | --- | --- |
| Permanent tables | Yes | Time Travel information for tables isn’t stored as part of a backup. |
| Transient tables | Yes | Such tables continue to be transient tables after you restore them. Transient schemas and transient databases also retain the transient property after you restore them. |
| Temporary tables | No | Temporary tables are session scoped and aren’t included in backups. |
| Dynamic tables | Yes | When you restore a dynamic table from a backup, the table is restored in a suspended state. Snowflake [automatically initializes](dynamic-tables-refresh.md) the new table during its first refresh. |
| External tables | No |  |
| Hybrid tables | No |  |
| Apache Iceberg™ tables | No | Dynamic Iceberg tables are also not included in backups. |
| Table constraints | Yes |  |
| Event tables | No |  |
| Sequences | Yes |  |
| Views | Yes |  |
| Materialized views | No |  |
| Secure views | Yes |  |
| File formats | Yes |  |
| Internal stages | No |  |
| External stages | No |  |
| Temporary stages | No |  |
| Directory tables | No |  |
| Pipes | No |  |
| Stored procedures | Yes | SQL, Javascript, Python, Java, and Scala procedures are all supported. |
| User-defined functions (UDFs) | Yes | SQL, Javascript, Python, Java, and Scala functions are all supported. Both scalar UDFs and user-defined table functions (UDTFs) are included in the backup. Java UDFs in backups have the same requirements as in [Limitations on cloning](../developer-guide/udf/java/udf-java-limitations.md). |
| Streams | No |  |
| Tasks | Yes | Tasks are included in the backup. Tasks restored from a backup are suspended and must be resumed. |
| Data metric functions (DMFs) | No |  |
| Policies | Yes | The following kinds of policies are included in a schema or database backup:   * Column-level security (masking) * Row access policies * Tag-based masking policies   If any table included in the backup has any other kind of policy applied (for example an aggregation policy, a projection policy, or a storage lifecycle policy), backup creation fails. |
| Grants | Yes | If you drop a role, associated ownership grants are transferred to the role that performs the DROP ROLE command. Grants other than ownership are deleted in this case. Therefore, the grants on a restored object might differ from the grants that existed when the backup was created. |
| Database roles | No | If the backup includes a database role or any object with a non-ownership grant to a database role, backup creation fails. |
| Object tagging | Yes |  |
| Alerts | Yes |  |
| Network rules | Yes |  |
| Github repos | No |  |
| Models | No |  |
| Model monitors | No |  |
| Datasets | No |  |
| Notebooks | No |  |
| Contacts | No |  |
| Cortex search services | No |  |
| Dbt projects | No |  |
| Image repositories | No |  |
| Listings | No |  |
| Organization listings | No |  |
| Pipes | No |  |
| Policy (aggregation) | No |  |
| Policy (authentication) | No |  |
| Policy (feature) | No |  |
| Policy (join) | No |  |
| Policy (packages) | No |  |
| Policy (password) | No |  |
| Policy (privacy) | No |  |
| Policy (projection) | No |  |
| Policy (session) | No |  |
| Provisioned throughput | No |  |
| Semantic views | No |  |
| Services | No |  |
| Streamlits | No |  |

### How Snowflake associates objects with their backup sets

When you create a backup set for a database, schema, or table, Snowflake associates the backup set with the
internal ID of that database, schema, or table. If you delete the original object, you can’t add any more backups
to that backup set. This behavior applies even if you recreate an object with the same name, or replace it with an
object that was restored from a backup.

If you instead rename the original object, then you can continue making more backups of it by adding more backups to
the same backup set. In that case, the output of SHOW BACKUP SETS changes to reflect the OBJECT_NAME value of the
renamed object.

If you want to make backups of a table but you frequently drop and recreate that table, perhaps through CREATE OR REPLACE
statements, include it in a backup set for the schema or database that contains the table. That way, you can keep using
the same backup set regardless of changes to the table.

When you restore a table from a backup, the restored table starts with a different name than the original. Suppose that
you want to completely replace the contents of the original table with the backup data, and continue to use the same backup set
for more backups of that same table. In that case, use a TRUNCATE or DELETE statement to remove the contents of the original table,
and an INSERT … SELECT statement to copy the data from the restored table. Don’t drop the original table and rename the restored table
to the name of the original table.

### Backups and encryption

The data within backup sets is protected by the same end-to-end encryption as other Snowflake objects and table data.
For more information about Snowflake encryption, see [Understanding end-to-end encryption in Snowflake](security-encryption-end-to-end.md).

Key rotation also applies to the data within backups.

### Backups and data lineage

Snowflake doesn’t preserve [data lineage](ui-snowsight-lineage.md) metadata with database, schema, and table
backups. After you restore an object from a backup, you can’t use Snowsight to view lineage information for the
restored data.

## Cost for backups

The following table describes charges for backups.

For information about credit consumption, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

| Cost component | Description | Billed |
| --- | --- | --- |
| Backup compute | Snowflake-managed compute service generates scheduled backup creation and expiration. | Yes |
| Restore compute | Snowflake-managed warehouses are used to restore objects from backups. | Yes |
| Backup storage | Snowflake-managed cloud object storage to store backup data. | Billed for bytes retained for backups, similar to bytes retained for clones. |

You can monitor costs for backup storage in the [TABLE_STORAGE_METRICS](../sql-reference/account-usage/table_storage_metrics.md)
view using the `RETAINED_FOR_CLONE_BYTES` column, and in the
[BACKUP_STORAGE_USAGE](../sql-reference/account-usage/backup_storage_usage.md) view.

## Access control privileges

The following table lists privileges and the object type on which the privilege is granted for managing and using backups.

| Privilege | Object type | Description |
| --- | --- | --- |
| CREATE BACKUP POLICY | Schema | Grants the ability to create a backup policy in a schema. The role granting this privilege must also have the USAGE privilege on the schema. |
| CREATE BACKUP SET | Schema | Grants the ability to create a backup set in a schema. The role granting this privilege must also have the USAGE privilege on the schema. To actually create the backup set also requires the appropriate privilege on the object that’s the subject of the backup set: SELECT for a table backup, or USAGE for a schema backup or database backup. |
| APPLY | Backup policy | Grants the ability to apply a specific backup policy. Only a user with the ACCOUNTADMIN role can grant this privilege. |
| APPLY BACKUP RETENTION LOCK | Account | Grants the ability to create and apply backup policies with retention lock. This privilege is granted to the ACCOUNTADMIN role and can be delegated.  This privilege is required to enable a role to do the following:   * Create a backup policy with retention lock. * Apply a backup policy with retention lock on a backup set. * Create a backup, either manually by a user or automatically on a schedule, in a backup set protected by a policy   with retention lock. |
| APPLY LEGAL HOLD | Account | Grants the ability to add or remove a legal hold from a backup. By default, the ACCOUNTADMIN role has this privilege. |

The following privilege requirements apply when Snowflake automatically creates or expires backups in the background.
The owner of the backup set needs to have the following privileges:

* The appropriate privilege on the object that’s the subject of the backup set: SELECT for a table
  backup, or USAGE for a schema backup or database backup.
* Any privilege on the parent schema or database for the subject of the backup set.
* Any privilege on the parent schema and database of the backup set.

If any of those privileges are missing, the automatic backup creation or expiration fails. You can
monitor these background operations using the ACCOUNT_USAGE.BACKUP_OPERATION_HISTORY view.

### Grant privileges required to create backup policies and sets

> **Note:**
>
> * The role used to grant these privileges must have the OWNERSHIP privilege on the schema,
>   or it must have the CREATE BACKUP SET or CREATE BACKUP POLICY privilege WITH GRANT OPTION.
> * You can grant the following privileges to a custom account role or a database role.

To enable the role `myrole` to create a backup policy in schema `myschema`, execute the following statement:

```sqlexample
GRANT CREATE BACKUP POLICY ON SCHEMA policy_schema TO ROLE myrole;
```

To enable the role `myrole` to create a backup set in schema `myschema`, execute the following statement:

```sqlexample
GRANT CREATE BACKUP SET ON SCHEMA policy_schema TO ROLE myrole;
```

### Grant the APPLY privilege on a backup policy to a role

> **Note:**
>
> * Only a user with the ACCOUNTADMIN role can grant this privilege.
> * You can grant this privilege to a custom account role or a database role.

To enable the role `myrole` to apply the backup policy `hourly_backup_policy` to a backup set, execute the following statement:

```sqlexample
GRANT APPLY ON BACKUP POLICY hourly_backup_policy TO ROLE myrole;
```

### Grant the APPLY BACKUP RETENTION LOCK privilege to a role

You can grant a role the privilege to apply backup policies with retention lock on backup sets.

Only a user with the ACCOUNTADMIN role can grant this privilege.

> **Important:**
>
> Applying a backup policy with a retention lock to a backup set is *irreversible*.
> Due to the strong guarantees needed for regulatory compliance, once you put a retention lock on a backup set,
> you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock.
> Backups created with a retention lock can’t be deleted until the expiration period ends.
>
> If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
> Snowflake deletes all backups, including those with retention locks.

To enable the role `retention_lock_admin_role` to apply a backup policy with retention lock on a backup set, execute
the following statement:

```sqlexample
GRANT APPLY BACKUP RETENTION LOCK ON ACCOUNT TO ROLE retention_lock_admin_role;
```

## Create and configure backups

This section provides example workflows for creating and restoring backups.

1. Create a backup policy named `hourly_backup_policy`. Backups taken with this policy are created hourly
   and each backup expires after 90 days.

   ```sqlexample
   CREATE BACKUP POLICY hourly_backup_policy
     SCHEDULE = '60 MINUTE'
     EXPIRE_AFTER_DAYS = 90
     COMMENT = 'Hourly backups expire after 90 days';
   ```
2. Create a backup set for table `t1` with the backup policy `hourly_backup_policy`:

   ```sqlexample
   CREATE BACKUP SET t1_backups
     FOR TABLE t1
     WITH BACKUP POLICY hourly_backup_policy;
   ```
3. Create a backup set for schema `s1` with the backup policy `hourly_backup_policy`:

   ```sqlexample
   CREATE BACKUP SET s1_backups
     FOR SCHEMA s1
     WITH BACKUP POLICY hourly_backup_policy;
   ```
4. Create a backup set for database `d1` with the backup policy `hourly_backup_policy`:

   ```sqlexample
   CREATE BACKUP SET d1_backups
     FOR DATABASE d1
     WITH BACKUP POLICY hourly_backup_policy;
   ```

### Create scheduled backups with retention lock

Create a backup set that automatically creates backups with a retention lock on a schedule.
The retention lock prevents anyone, even privileged users, from deleting or modifying backups
in any backup set that the policy is attached to.

Only a role that has the APPLY BACKUP RETENTION LOCK privilege on the account can create a backup policy
with a retention lock.

> **Important:**
>
> Applying a backup policy with a retention lock to a backup set is *irreversible*.
> Due to the strong guarantees needed for regulatory compliance, once you put a retention lock on a backup set,
> you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock.
> Backups created with a retention lock can’t be deleted until the expiration period ends.
>
> If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
> Snowflake deletes all backups, including those with retention locks.

1. Create a policy with a retention lock that creates a daily backup with an expiration period of 90 days:

   ```sqlexample
   CREATE BACKUP POLICY daily_backup_policy_with_lock
     WITH RETENTION LOCK
     SCHEDULE = '1440 MINUTE'
     EXPIRE_AFTER_DAYS = 90
     COMMENT = 'regulatory backups: they have a retention lock and expire after 90 days';
   ```
2. Create a backup set for table `t2` with the backup policy `daily_backup_policy_with_lock`:

   ```sqlexample
   CREATE BACKUP SET t2_backups
     FOR TABLE t2
     WITH BACKUP POLICY daily_backup_policy_with_lock;
   ```
3. Create a backup set for schema `s2` with the backup policy `daily_backup_policy_with_lock`:

   ```sqlexample
   CREATE BACKUP SET s2_backups
     FOR SCHEMA s2
     WITH BACKUP POLICY daily_backup_policy_with_lock;
   ```
4. Create a backup set for database `d2` with the backup policy `daily_backup_policy_with_lock`:

   ```sqlexample
   CREATE BACKUP SET d2_backups
     FOR DATABASE d2
     WITH BACKUP POLICY daily_backup_policy_with_lock;
   ```

### Create backups manually

You can manually add a backup to a backup set at any time. Doing so makes a backup of the database, schema, or table that’s
associated with the backup set. You can create backups manually whether or not the backup set also has backups that are
scheduled by a backup policy. If there’s a backup policy associated with the backup set,
and the policy defines an expiration period, that expiration period also applies to the manual backup.

The following example creates a table backup set `t1_backups` and then adds the first backup to it:

```sqlexample
CREATE BACKUP SET t1_backups FOR TABLE t1;
ALTER BACKUP SET t1_backups ADD BACKUP;
```

The following example creates a backup policy with hourly backups, a table backup set `t2_backups` that uses the policy, and
then adds a manual backup to the backup set:

```sqlexample
CREATE BACKUP POLICY hourly_backup_policy
  SCHEDULE = '60 MINUTE'
  EXPIRE_AFTER_DAYS = 7;

CREATE BACKUP SET t2_backups FOR TABLE t2 WITH BACKUP POLICY hourly_backup_policy;
-- Wait several hours. Then the backup set already contains several scheduled backups.
-- You can manually add a backup at any time, in addition to the scheduled backups.
ALTER BACKUP SET t2_backups ADD BACKUP;
```

You can run similar commands to add a backup to a schema or database backup set.
Substitute the name of the schema or database backup set in the ALTER BACKUP SET command.

### Suspend a backup policy on a backup set

When you suspend a backup policy on a backup set, you prevent the backup policy from being used to create
new scheduled backups in that backup set. You also suspend the expiration of existing backups in that
backup set that use the backup policy. Other backup sets that use the same policy aren’t affected.

The following example suspends a backup policy on the backup set `t2_backups`:

> ```sqlexample
> ALTER BACKUP SET t2_backups SUSPEND BACKUP POLICY;
> ```

You can also selectively suspend just the creation or just the expiration processes of the backup set.
The following example suspends the creation of new backups in the backup set `t3_backups`, and
suspends expiration of old backups from the backup set `t4_backups`:

> ```sqlexample
> ALTER BACKUP SET t3_backups SUSPEND BACKUP CREATION POLICY;
> ALTER BACKUP SET t4_backups SUSPEND BACKUP EXPIRATION POLICY;
> ```

For more information about the ALTER BACKUP SET command, see [ALTER BACKUP SET](../sql-reference/sql/alter-backup-set.md).

### Resume a backup policy on a backup set

You can resume suspended backup policies. Doing so resumes the creation and expiration of backups according to the backup
policy. If any backups reached their expiration time while the policy was suspended, Snowflake deletes those backups as soon as
the policy is resumed.

The following example resumes a backup policy on the backup set `t1_backup`:

```sqlexample
ALTER BACKUP SET t1_backups
  RESUME BACKUP POLICY;
```

You can also selectively resume just the creation or just the expiration processes of the backup set.
The following example resumes the creation of new backups in the backup set `t3_backups`, and
resumes expiration of old backups from the backup set `t4_backups`:

```sqlexample
ALTER BACKUP SET t3_backups RESUME BACKUP CREATION POLICY;
ALTER BACKUP SET t4_backups RESUME BACKUP EXPIRATION POLICY;
```

For more information about the ALTER BACKUP SET command, see [ALTER BACKUP SET](../sql-reference/sql/alter-backup-set.md).

### Restore a backup

You can restore an object from a backup set by using the ID of the specific backup.
For example, to restore table `t1` from backup set `t1_backups` in the current schema,
execute the following statements:

1. Find the ID of the table backup to restore in the `backup_id` column:

   ```sqlexample
   SHOW BACKUPS IN BACKUP SET t1_backups ->> SELECT "created_on", "backup_id", "expire_on" FROM $1;
   ```

   ```output
   +-------------------------------+--------------------------------------+-------------------------------+
   | created_on                    | backup_id                            | expire_on                     |
   |-------------------------------+------------------------------------------+---------------------------|
   | 2024-08-19 17:12:28.991 -0700 | 983e0b66-91eb-41cb-8a0b-037abfec1914 | 2024-08-20 17:12:28.991 -0700 |
   | 2024-08-19 18:12:33.824 -0700 | b5624ef0-1f35-452f-b132-09d8f0592e52 | 2024-08-20 18:12:33.824 -0700 |
   | 2024-08-19 19:12:43.830 -0700 | eca1a94a-fd40-46db-a2bc-4afba6a38c0a | 2024-08-20 19:12:43.830 -0700 |
   | 2024-08-19 20:12:45.446 -0700 | 8ee2fd7e-1afe-42e1-acd7-79582765a910 | 2024-08-20 20:12:45.446 -0700 |
   | 2024-08-19 21:12:55.305 -0700 | d38caf14-f8a5-4ba8-a248-8287e0cdcf40 | 2024-08-20 21:12:55.305 -0700 |
   +-------------------------------+--------------------------------------+-----------+-------------------+
   ```
2. Find the ID of the schema backup to restore in the `backup_id` column:

   ```sqlexample
   SHOW BACKUPS IN BACKUP SET s1_backups;
   ```

   ```output
   +-------------------------------+--------------------------------------+-------------------------------+
   | created_on                    | backup_id                            | expire_on                     |
   |-------------------------------+--------------------------------------+-------------------------------|
   | 2024-08-19 17:12:28.991 -0700 | 0a0382e1-d265-46e9-b152-4c3b2b859e65 | 2024-08-20 17:12:28.991 -0700 |
   | 2024-08-19 18:12:33.824 -0700 | 8dbcf919-3393-4590-928f-5481d7f2502f | 2024-08-20 18:12:33.824 -0700 |
   | 2024-08-19 19:12:43.830 -0700 | 8ee2fd7e-1afe-42e1-acd7-79582765a910 | 2024-08-20 19:12:43.830 -0700 |
   | 2024-08-19 20:12:45.446 -0700 | bd729a79-01bc-444d-a550-adaaa31ab62f | 2024-08-20 20:12:45.446 -0700 |
   | 2024-08-19 21:12:55.305 -0700 | 9a8802c5-5fbd-4200-a09d-43e046103939 | 2024-08-20 21:12:55.305 -0700 |
   +-------------------------------+--------------------------------------+-------------------------------+
   ```
3. Find the ID of the database backup to restore in the `backup_id` column:

   ```sqlexample
   SHOW BACKUPS IN BACKUP SET d1_backups;
   ```

   ```output
   +-------------------------------+--------------------------------------+-------------------------------+
   | created_on                    | backup_id                            | expire_on                     |
   |-------------------------------+--------------------------------------+-------------------------------|
   | 2024-08-19 17:12:28.991 -0700 | 42435925-4e77-4b01-ba89-8163ac03e12f | 2024-08-20 17:12:28.991 -0700 |
   | 2024-08-19 18:12:33.824 -0700 | 29c2c1b9-6599-4f0b-87b8-d43377fd7c77 | 2024-08-20 18:12:33.824 -0700 |
   | 2024-08-19 19:12:43.830 -0700 | a4283984-a063-4415-acc4-0e3c19259fad | 2024-08-20 19:12:43.830 -0700 |
   | 2024-08-19 20:12:45.446 -0700 | ffe25397-64b9-4c5f-b061-23a1885dc2dc | 2024-08-20 20:12:45.446 -0700 |
   | 2024-08-19 21:12:55.305 -0700 | 28e12b8a-aab8-40a8-ae39-9a5a5f654d66 | 2024-08-20 21:12:55.305 -0700 |
   +-------------------------------+--------------------------------------+-------------------------------+
   ```
4. Restore the backup for table `t1` taken on 2024-08-19 18:12:33:

   ```sqlexample
   CREATE TABLE restored_t1 FROM BACKUP SET t1_backups IDENTIFIER 'b5624ef0-1f35-452f-b132-09d8f0592e52';
   ```
5. Restore the backup for schema `s1` taken on 2024-08-19 18:12:33:

   ```sqlexample
   CREATE SCHEMA restored_s1 FROM BACKUP SET s1_backups IDENTIFIER '8dbcf919-3393-4590-928f-5481d7f2502f';
   ```
6. Restore the backup for database `d1` taken on 2024-08-19 18:12:33:

   ```sqlexample
   CREATE DATABASE restored_d1 FROM BACKUP SET d1_backups IDENTIFIER '29c2c1b9-6599-4f0b-87b8-d43377fd7c77';
   ```

### Delete a backup from a backup set

For any backup set, you can only delete the oldest backup that doesn’t have a legal hold. You do so by specifying the backup
ID. You can find the backups that don’t have a legal hold by examining the `is_under_legal_hold` property. You can find the
oldest backup by examining the `created_on` property.

> > **Note:**
> >
> > You can’t delete any backup from a backup set if a backup policy with retention lock is attached to that backup set,
> > or if that particular backup has a legal hold applied.
> >
> > The backup that you delete from the backup set must be the earliest backup in the set.

1. Find the ID of the table backup to delete in the `backup_id` column in the following output.
   Sorting in ascending order by the `created_on` column puts the oldest backup first.
   You could add `LIMIT 1` to the SELECT command to return only the row with the details of the oldest backup.

   ```sqlexample
   SHOW BACKUPS IN BACKUP SET t1_backups ->>
     SELECT "created_on", "backup_id", "expire_on" FROM $1
       WHERE "is_under_legal_hold" = 'N'
       ORDER BY "created_on";
   ```

   ```output
   +-------------------------------+--------------------------------------+-------------------------------+
   | created_on                    | backup_id                            | expire_on                     |
   |-------------------------------+--------------------------------------+-------------------------------|
   | 2024-08-19 17:12:28.991 -0700 | 983e0b66-91eb-41cb-8a0b-037abfec1914 | 2024-08-20 17:12:28.991 -0700 |
   | 2024-08-19 18:12:33.824 -0700 | b5624ef0-1f35-452f-b132-09d8f0592e52 | 2024-08-20 18:12:33.824 -0700 |
   | 2024-08-19 19:12:43.830 -0700 | eca1a94a-fd40-46db-a2bc-4afba6a38c0a | 2024-08-20 19:12:43.830 -0700 |
   | 2024-08-19 20:12:45.446 -0700 | 8ee2fd7e-1afe-42e1-acd7-79582765a910 | 2024-08-20 20:12:45.446 -0700 |
   | 2024-08-19 21:12:55.305 -0700 | d38caf14-f8a5-4ba8-a248-8287e0cdcf40 | 2024-08-20 21:12:55.305 -0700 |
   +-------------------------------+--------------------------------------+-------------------------------+
   ```
2. Delete the `t1_backups` backup created on 2024-08-19 17:12:28 using the `backup_id`:

   ```sqlexample
   ALTER BACKUP SET t1_backups DELETE BACKUP IDENTIFIER '983e0b66-91eb-41cb-8a0b-037abfec1914';
   ```
3. Find the ID of the schema backup to delete in the `backup_id` column in the following output:

   ```sqlexample
   SHOW BACKUPS IN BACKUP SET s1_backups ->>
     SELECT "created_on", "backup_id", "expire_on" FROM $1 ORDER BY "created_on";
   ```

   ```output
   +-------------------------------+--------------------------------------+-------------------------------+
   | created_on                    | backup_id                            | expire_on                     |
   |-------------------------------+--------------------------------------+-------------------------------|
   | 2024-08-19 17:12:28.991 -0700 | 28e12b8a-aab8-40a8-ae39-9a5a5f654d66 | 2024-08-20 17:12:28.991 -0700 |
   | 2024-08-19 18:12:33.824 -0700 | 46a1e22a-8557-432f-a14c-1261a4ca2b34 | 2024-08-20 18:12:33.824 -0700 |
   | 2024-08-19 19:12:43.830 -0700 | 3e42fef6-b895-4055-a59f-179744d015d3 | 2024-08-20 19:12:43.830 -0700 |
   | 2024-08-19 20:12:45.446 -0700 | 7807d24e-285e-4741-b332-87c32bad5cb6 | 2024-08-20 20:12:45.446 -0700 |
   | 2024-08-19 21:12:55.305 -0700 | e022e619-ee83-45a0-b2b7-9007e284bdb3 | 2024-08-20 21:12:55.305 -0700 |
   +-------------------------------+--------------------------------------+-------------------------------+
   ```
4. Delete the `s1_backups` backup created on 2024-08-19 17:12:28 using the `backup_id`:

   ```sqlexample
   ALTER BACKUP SET s1_backups DELETE BACKUP IDENTIFIER '28e12b8a-aab8-40a8-ae39-9a5a5f654d66';
   ```
5. Find the ID of the database backup to delete in the `backup_id` column in the following output:

   ```sqlexample
   SHOW BACKUPS IN BACKUP SET d1_backups ->>
     SELECT "created_on", "backup_id", "expire_on" FROM $1 ORDER BY "created_on";
   ```

   ```output
   +-------------------------------+--------------------------------------+-------------------------------+
   | created_on                    | backup_id                            | expire_on                     |
   |-------------------------------+--------------------------------------+-------------------------------|
   | 2024-08-19 17:12:28.991 -0700 | d3a77432-c98d-4969-91a9-fffae5dd655c | 2024-08-20 17:12:28.991 -0700 |
   | 2024-08-19 18:12:33.824 -0700 | 0a0382e1-d265-46e9-b152-4c3b2b859e65 | 2024-08-20 18:12:33.824 -0700 |
   | 2024-08-19 19:12:43.830 -0700 | 25e01ee0-ea9d-4bb7-af7f-f3fe87f9409e | 2024-08-20 19:12:43.830 -0700 |
   | 2024-08-19 20:12:45.446 -0700 | a12294f5-fc63-49cf-84f1-c7b72f7664af | 2024-08-20 20:12:45.446 -0700 |
   | 2024-08-19 21:12:55.305 -0700 | 28e12b8a-aab8-40a8-ae39-9a5a5f654d66 | 2024-08-20 21:12:55.305 -0700 |
   +-------------------------------+--------------------------------------+-------------------------------+
   ```
6. Delete the `d1_backups` backup created on 2024-08-19 17:12:28 using the `backup_id`:

   ```sqlexample
   ALTER BACKUP SET d1_backups DELETE BACKUP IDENTIFIER 'd3a77432-c98d-4969-91a9-fffae5dd655c';
   ```
7. Attempt to delete a more recent `d1_backups` backup created on 2024-08-19 21:12:55. Notice how Snowflake
   prevents you from deleting a backup other than the oldest one in the backup set.

   ```sqlexample
   ALTER BACKUP SET d1_backups DELETE BACKUP IDENTIFIER '28e12b8a-aab8-40a8-ae39-9a5a5f654d66';
   ```

   ```output
   Backup '28e12b8a-aab8-40a8-ae39-9a5a5f654d66' cannot be deleted as it is not the oldest active backup in the backup set D1_BACKUPS.
   ```

### Delete a backup set

You can delete a backup set using the [DROP BACKUP SET](../sql-reference/sql/drop-backup-set.md) command.

> **Note:**
>
> You can’t delete a backup set that has a retention lock and contains unexpired backups.
> You also can’t delete a backup set if any of its backups has a legal hold.

Delete the `t1_backups` backup set:

```sqlexample
DROP BACKUP SET t1_backups;
```

Delete the `s1_backups` backup set:

```sqlexample
DROP BACKUP SET s1_backups;
```

Delete the `d1_backups` backup set:

```sqlexample
DROP BACKUP SET d1_backups;
```

### Find all the backup sets that contain backups of a specific table

The following example shows how to find all the backup sets that contain a specific table inside a specific schema and database.
The SHOW TABLES command uses a pipe operator to retrieve the names of the database, schema, and table and store them in variables.
The SHOW BACKUP SETS output is filtered to show the backup sets that back up the database containing the table, or the schema
containing the table, or that contain that single table.

The filtered output from SHOW BACKUP SETS shows that there are two database backup sets for the
database `my_big_important_database`, one schema backup set for the schema
`my_big_important_database.public`, and one table backup set for the table
`my_big_important_database.public.my_small_secondary_table`.

```sqlexample
SHOW TABLES IN SCHEMA public ->>
  SET (dname, sname, tname) =
    (SELECT "database_name", "schema_name", "name" FROM $1
      WHERE "name" = 'MY_SMALL_SECONDARY_TABLE' AND "kind" = 'TABLE');

SHOW BACKUP SETS ->> SELECT "object_kind", "name", "database_name", "schema_name", "object_name" FROM $1
  WHERE ("object_kind" = 'TABLE' AND "database_name" = $dname AND "schema_name" = $sname AND "object_name" = $tname)
    OR ("object_kind" = 'SCHEMA' AND "database_name" = $dname AND "object_name" = $sname)
    OR ("object_kind" = 'DATABASE' AND "object_name" = $dname);
```

```output
+-------------+------------------+---------------------------+-------------+---------------------------+
| object_kind | name             | database_name             | schema_name | object_name               |
|-------------+------------------+---------------------------+-------------+---------------------------|
| DATABASE    | DATABASE_BACKUP  | MY_BIG_IMPORTANT_DATABASE | PUBLIC      | MY_BIG_IMPORTANT_DATABASE |
| DATABASE    | DATABASE_BACKUP2 | MY_BIG_IMPORTANT_DATABASE | PUBLIC      | MY_BIG_IMPORTANT_DATABASE |
| SCHEMA      | SCHEMA_BACKUP3   | MY_BIG_IMPORTANT_DATABASE | PUBLIC      | PUBLIC                    |
| TABLE       | TABLE_BACKUP2    | MY_BIG_IMPORTANT_DATABASE | PUBLIC      | MY_SMALL_SECONDARY_TABLE  |
+-------------+------------------+---------------------------+-------------+---------------------------+
```

### Create a backup for a table with dependencies

The following examples show how you might create a table backup for a table
that refers to a sequence and a foreign key in a different schema. To prepare,
we create the schema `other_schema` containing a sequence and a table. Then we create the
main table in the `public` schema, referring to the sequence and the other table.

```sqlexample
USE DATABASE my_big_important_database;

CREATE SCHEMA other_schema;
USE SCHEMA other_schema;

CREATE SEQUENCE my_sequence;
CREATE TABLE my_dimension_table (id INT AUTOINCREMENT PRIMARY KEY);

USE SCHEMA public;
CREATE TABLE dependent_table
(
   id INT DEFAULT my_big_important_database.other_schema.my_sequence.NEXTVAL PRIMARY KEY,
   foreign_id INT,
   FOREIGN KEY (foreign_id) REFERENCES my_big_important_database.other_schema.my_dimension_table(id)
 );

SELECT GET_DDL('TABLE','dependent_table');
```

The GET_DDL() output shows the references that point to the other schema:

```output
+-------------------------------------------+
| GET_DDL('TABLE','DEPENDENT_TABLE')        |
|-------------------------------------------|
| create or replace TABLE DEPENDENT_TABLE ( |
|     ID NUMBER(38,0) NOT NULL DEFAULT MY_BIG_IMPORTANT_DATABASE.OTHER_SCHEMA.MY_SEQUENCE.NEXTVAL,
|     FOREIGN_ID NUMBER(38,0),                |
|     primary key (ID),                       |
|     foreign key (FOREIGN_ID) references MY_BIG_IMPORTANT_DATABASE.OTHER_SCHEMA.MY_DIMENSION_TABLE(ID)
| );                                        |
+-------------------------------------------+
```

Next, we create the backup set for the table and add a backup to it:

```sqlexample
CREATE BACKUP SET dependency_experiments FOR TABLE dependent_table;
ALTER BACKUP SET dependency_experiments ADD BACKUP;
SHOW BACKUPS IN BACKUP SET dependency_experiments;
```

The SHOW BACKUPS output contains the `backup_id` value to use for the restore operation:

```output
+-------------------------------+--------------------------------------+------------------------+---------------------------+--------------+-----------+
| created_on                    | backup_id                            | backup_set_name        | database_name             | schema_name  | expire_on |
|-------------------------------+--------------------------------------+------------------------+---------------------------+--------------+-----------|
| 2025-07-01 11:53:27.860 -0700 | 0fd44138-b571-449b-be0a-72779501f80e | DEPENDENCY_EXPERIMENTS | MY_BIG_IMPORTANT_DATABASE | OTHER_SCHEMA | NULL      |
+-------------------------------+--------------------------------------+------------------------+---------------------------+--------------+-----------+
```

We restore that table under a new name, and confirm that the restored table refers to
the objects in the other schema:

```sqlexample
CREATE TABLE restored_dependent_table FROM BACKUP SET dependency_experiments
  IDENTIFIER '0fd44138-b571-449b-be0a-72779501f80e';

SELECT GET_DDL('TABLE','restored_dependent_table');
```

```output
+----------------------------------------------------+
| GET_DDL('TABLE','RESTORED_DEPENDENT_TABLE')        |
|----------------------------------------------------|
| create or replace TABLE RESTORED_DEPENDENT_TABLE ( |
|     ID NUMBER(38,0) NOT NULL DEFAULT MY_BIG_IMPORTANT_DATABASE.OTHER_SCHEMA.MY_SEQUENCE.NEXTVAL,
|     FOREIGN_ID NUMBER(38,0),                         |
|     foreign key (FOREIGN_ID) references MY_BIG_IMPORTANT_DATABASE.OTHER_SCHEMA.MY_DIMENSION_TABLE(ID),
|     primary key (ID)                                 |
| );                                                 |
+----------------------------------------------------+
```

To illustrate what happens if the referred-to object no longer exists, we drop the sequence
and then restore the table again from the same backup:

```sqlexample
DROP SEQUENCE my_big_important_database.other_schema.my_sequence;
CREATE OR REPLACE TABLE restored_dependent_table FROM BACKUP SET dependency_experiments
  IDENTIFIER '0fd44138-b571-449b-be0a-72779501f80e';

SELECT * FROM restored_dependent_table;
```

Querying the table still works:

```output
+----+------------+
| ID | FOREIGN_ID |
|----+------------|
+----+------------+
0 Row(s) produced. Time Elapsed: 0.129s
```

However, operations such as GET_DDL(), DESCRIBE, and INSERT all fail because they
depend on a sequence that no longer exists:

```sqlexample
SELECT GET_DDL('TABLE','restored_dependent_table');
```

```output
002073 (02000): SQL compilation error:
Sequence used as a default value in table 'MY_BIG_IMPORTANT_DATABASE.OTHER_SCHEMA.RESTORED_DEPENDENT_TABLE'
  column 'ID' was not found or could not be accessed.
```

```sqlexample
DESC TABLE restored_dependent_table;
```

```output
+------------+--------------+--------+-------+----------------------------------------+-------------+------------+-------+------------+---------+-------------+----------------+
| name       | type         | kind   | null? | default                                | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------------+--------------+--------+-------+----------------------------------------+-------------+------------+-------+------------+---------+-------------+----------------|
| ID         | NUMBER(38,0) | COLUMN | N     | [sequence cannot be found or accessed] | Y           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| FOREIGN_ID | NUMBER(38,0) | COLUMN | Y     | NULL                                   | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------------+--------------+--------+-------+----------------------------------------+-------------+------------+-------+------------+---------+-------------+----------------+
```

```sqlexample
INSERT INTO restored_dependent_table (foreign_id) VALUES (2);
```

```output
002073 (02000): SQL compilation error:
Sequence used as a default value in table 'MY_BIG_IMPORTANT_DATABASE.OTHER_SCHEMA.RESTORED_DEPENDENT_TABLE'
  column 'ID' was not found or could not be accessed.
```

### Create a backup for a dynamic table

A dynamic table always involves a reference to some other table. For that reason, you might prefer
to use schema backups or database backups for dynamic tables, so that the original table and
the dynamic table can be included in the same backup.

If you make a table backup for a dynamic table, you include the keyword DYNAMIC in the CREATE BACKUP
SET command, and in the CREATE TABLE command when you restore from a backup. The following example
sets up the dynamic table, a table backup set for that table, and creates the first backup:

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = '1 minute'
  WAREHOUSE = my_wh
  AS SELECT * FROM my_base_table WHERE col1 IS NOT NULL;

CREATE BACKUP SET dynamic_table_backups
  FOR DYNAMIC TABLE my_dynamic_table;

ALTER BACKUP SET dynamic_table_backups ADD BACKUP;
```

The following example shows how to determine the backup IDs for backups created at various times.
In this case, the newest backup is the first row in the result set. Then you use the ID of the backup
in the CREATE DYNAMIC TABLE command.

```sqlexample
SHOW BACKUPS IN BACKUP SET dynamic_table_backups
  ->> SELECT "created_on", "backup_id" FROM $1
        ORDER BY "created_on" DESC;

CREATE DYNAMIC TABLE restored_dynamic_table
  FROM BACKUP SET dynamic_table_backups
    IDENTIFIER '<backup_id_from_SHOW_BACKUPS_output>';
```

> **Tip:**
>
> When you restore a dynamic table from a backup, Snowflake
> [automatically initializes](dynamic-tables-refresh.md)
> the new table during its first refresh.

### Add and remove legal holds

Before you work with legal holds for Snowflake backups, learn their purpose and requirements. For more information, see
Legal hold.

Suppose that your organization’s legal or compliance team sends a litigation hold request, specifying what types of data need to be
preserved. In that case, you might follow a process as follows:

* You work with the legal team to identify where the relevant data is stored, and which backup sets contain the associated objects.
* You put a legal hold on a backup from the applicable timeframe within a backup set. Doing so disables any automatic expiration for that backup.
  You can put a legal hold on a backup that Snowflake created automatically based on a schedule, or that you created manually. The
  legal hold applies whether or not the backup set has an associated backup policy, or an expiration period, or a retention lock.
* You perform refresh operations for any secondary accounts where the database containing the backup set is replicated.
  That way, the legal hold and associated backup are preserved across any failover and failback operations.
* You use the Snowflake access controls and logs to audit access to the data that’s under the legal hold.
* Once the legal case concludes and the legal team approves removing the legal hold, a user with the APPLY LEGAL HOLD privilege
  releases the legal hold. Then the normal automation for expiry resumes.

This example shows the sequence of SQL commands you might use during the lifecycle of a legal hold for
a backup within a particular backup set. You find the identifier of the relevant backup by using the
SHOW BACKUPS IN BACKUP SET command, and checking the `"is_under_legal_hold"` column to see if a legal hold is
already in place. Then you add or remove the legal hold from the specific backup.

```sqlexample
USE ROLE <role_name>; -- use a role that has the APPLY LEGAL HOLD privilege
SHOW BACKUPS IN BACKUP SET <backup_set_name>
  ->> SELECT * FROM $1 WHERE "is_under_legal_hold" = 'N';
ALTER BACKUP SET <backup_set_name>
  MODIFY BACKUP IDENTIFIER '<backup_identifier>'
  ADD LEGAL HOLD;

USE ROLE <role_name>; -- use a role that has the APPLY LEGAL HOLD privilege
SHOW BACKUPS IN BACKUP SET <backup_set_name>
  ->> SELECT * FROM $1 WHERE "is_under_legal_hold" = 'Y';
ALTER BACKUP SET <backup_set_name>
  MODIFY BACKUP IDENTIFIER '<backup_identifier>'
  REMOVE LEGAL HOLD;
```

> **Tip:**
>
> You can also check for the existence of legal holds by querying the `"is_under_legal_hold"` column in the
> INFORMATION_SCHEMA.BACKUPS or ACCOUNT_USAGE.BACKUPS views.

### Replicate backup-related objects

When you use replication in combination with database, schema, and table backups, you specify the databases that contain the
backup sets and backup policies in your replication groups and failover groups. You can control how your backup-related
objects are replicated by organizing your replication groups and failover groups, and choosing which databases and schemas
contain your backup sets and backup policies. For more information, see [Backup replication for database, schema, and table backups](account-replication-intro.md).

The backup sets and backup policies are database objects. Snowflake replicates the backup sets and backup
policies along with the databases and schemas that contain them.

Snowflake minimizes the time and storage usage for backups by using a mechanism similar to cloning, so that each backup doesn’t
require a complete new copy of all the table data. If the backup set is part of a different failover group than the database,
schema, or table that the backup set applies to, there’s a one-time full transfer of the data for the first refresh of that
replication group or failover group.

When a replication group or failover group includes a backup set, the increase in refresh latency is proportional to the number of
backups created since the last refresh.

If you define an expiry period for older backups, the automatic deletion happens on the primary account.
Those expired backups are removed from the secondary account when you perform a refresh operation.

> **Important:**
>
> If you replicate a backup set, make sure to perform a refresh immediately after placing a legal hold on a backup
> in that backup set. If you perform a failover before replicating the backup set that contains the legal hold, the original
> backup set can be overwritten when you fail back to the original primary account, potentially erasing the legal hold.

Therefore, you can fine-tune the replication for backup-related objects by following these practices:

* To minimize the chance of refresh failures, put the backup set and the optional
  backup policy within the same database and schema.
  If that’s not practical, put those things in the same replication group or failover group.
  That way, all these related objects are replicated at the same time.
* To maximize flexibility about which backup-related objects are replicated and the replication schedule,
  put the backup set and optional backup policy in a different database than the associated database,
  schema, or table.
  That way, you can specify whether the backup-related objects are replicated, and how often.
* If you apply a policy with an expiry period for older backups, make the expiry period longer than
  the interval between replication refresh operations. That way, every new backup is replicated to
  the secondary account at least once before it expires.
* Immediately after adding a legal hold, perform refresh operations on all secondary accounts where the
  database that contains the backup set is replicated. That way, the legal hold and associated backup
  are preserved across any failover and failback operations.
* Suppose that you put a backup policy and an associated backup set into databases that are part of different
  replication groups or failover groups. In that case, make sure to do the initial refresh of the group containing
  the backup policy first. Otherwise, the refresh operation fails because you can’t create a backup set that’s missing
  its associated backup policy in the secondary account.

  > **Note:**
  >
  > Putting the backup set into a different replication group or failover group than the object of the backup set does require a
  > full transfer of all the data, during the first refresh of the group that contains the backup set.
  >
  > If you restore a schema backup or database backup on a secondary account, references to objects within the restored schema
  > or database might not resolve properly unless the referenced objects are part of the same failover group as the backup.

### Monitor backups and backup operations

You can determine which backup-related objects exist, their properties, and how much storage they use
by querying the following views.

Information schema:

* [BACKUP_POLICIES view](../sql-reference/info-schema/backup_policies.md)
* [BACKUP_SETS view](../sql-reference/info-schema/backup_sets.md)
* [BACKUPS view](../sql-reference/info-schema/backups.md)

Account usage:

* [BACKUP_OPERATION_HISTORY view](../sql-reference/account-usage/backup_operation_history.md)
* [BACKUP_POLICIES view](../sql-reference/account-usage/backup_policies.md)
* [BACKUP_SETS view](../sql-reference/account-usage/backup_sets.md)
* [BACKUP_STORAGE_USAGE view](../sql-reference/account-usage/backup_storage_usage.md)
* [BACKUPS view](../sql-reference/account-usage/backups.md)

Organization usage:

* [BACKUP_OPERATION_HISTORY view](../sql-reference/organization-usage/backup_operation_history.md)
* [BACKUP_POLICIES view](../sql-reference/organization-usage/backup_policies.md)
* [BACKUP_SETS view](../sql-reference/organization-usage/backup_sets.md)
* [BACKUPS view](../sql-reference/organization-usage/backups.md)

## SQL reference topics

### Backup policy

* [CREATE BACKUP POLICY](../sql-reference/sql/create-backup-policy.md)
* [ALTER BACKUP POLICY](../sql-reference/sql/alter-backup-policy.md)
* [DROP BACKUP POLICY](../sql-reference/sql/drop-backup-policy.md)
* [SHOW BACKUP POLICIES](../sql-reference/sql/show-backup-policies.md)

### Backup set

* [CREATE BACKUP SET](../sql-reference/sql/create-backup-set.md)
* [ALTER BACKUP SET](../sql-reference/sql/alter-backup-set.md)
* [DROP BACKUP SET](../sql-reference/sql/drop-backup-set.md)
* [SHOW BACKUP SETS](../sql-reference/sql/show-backup-sets.md)

### Backups

You don’t run an actual CREATE BACKUP command. To create a new backup, you run ALTER BACKUP SET … ADD BACKUP.
Or when you associate the backup set with a backup policy that has a schedule, Snowflake automatically creates
backups in the backup set based on the specified schedule. To delete an older backup, you run ALTER BACKUP SET … DELETE
BACKUP. Such operations require you to specify the identifier for a specific backup. You can find the backup identifiers,
along with other information such as when each backup was created, by using the following command.

* [SHOW BACKUPS IN BACKUP SET](../sql-reference/sql/show-backups-in-backup-set.md)

### Restoring objects from backups

You use the syntax CREATE `object_kind` FROM BACKUP SET to restore each kind of object
from the appropriate kind of backup set.

Further backups in the backup set use the original object, not the restored one. That’s true even
if you rename the restored object to the same name as the original object. If you want to continue using the
same backup set after doing a restore, you restore the object under a new name
and then transfer data back to the original object.

* [CREATE DATABASE FROM BACKUP SET](../sql-reference/sql/create-database.md)
* [CREATE SCHEMA FROM BACKUP SET](../sql-reference/sql/create-schema.md)
* [CREATE TABLE FROM BACKUP SET](../sql-reference/sql/create-table.md)

### Views

The following system views contain metadata related to backups, backup sets, and backup policies.

#### Information schema views

These views in the INFORMATION_SCHEMA schema contain information about backup-related objects
that currently exist:

* [BACKUP_POLICIES view](../sql-reference/info-schema/backup_policies.md)
* [BACKUP_SETS view](../sql-reference/info-schema/backup_sets.md)
* [BACKUPS view](../sql-reference/info-schema/backups.md)

#### Account usage views

These views in the ACCOUNT_USAGE schema contain information at the account level about backup-related objects
that exist, or have been dropped, the operations that were performed on the backups, and the storage that they use:

* [BACKUP_OPERATION_HISTORY view](../sql-reference/account-usage/backup_operation_history.md)
* [BACKUP_POLICIES view](../sql-reference/account-usage/backup_policies.md)
* [BACKUP_SETS view](../sql-reference/account-usage/backup_sets.md)
* [BACKUP_STORAGE_USAGE view](../sql-reference/account-usage/backup_storage_usage.md)
* [BACKUPS view](../sql-reference/account-usage/backups.md)

#### Organization usage views

These views in the ORGANIZATION_USAGE schema contain information at the organization level about backup-related objects
that exist, or have been dropped, the operations that were performed on the backups, and the storage that they use:

* [BACKUP_OPERATION_HISTORY view](../sql-reference/organization-usage/backup_operation_history.md)
* [BACKUP_POLICIES view](../sql-reference/organization-usage/backup_policies.md)
* [BACKUP_SETS view](../sql-reference/organization-usage/backup_sets.md)
* [BACKUPS view](../sql-reference/organization-usage/backups.md)

## Terminology change

The feature is now called **backups** instead of snapshots. All SQL commands, views, and privileges use
**BACKUP** terminology:

* CREATE BACKUP POLICY, CREATE BACKUP SET
* ALTER BACKUP POLICY, ALTER BACKUP SET
* DROP BACKUP POLICY, DROP BACKUP SET
* SHOW BACKUP POLICIES, SHOW BACKUP SETS, SHOW BACKUPS IN BACKUP SET
* BACKUPS, BACKUP_POLICIES, BACKUP_SETS views in Account Usage, Organization Usage, and Information Schema
* APPLY BACKUP POLICY, APPLY BACKUP RETENTION LOCK privileges

The former SNAPSHOT/SNAPSHOTS names are still present but deprecated in favor of their BACKUP/BACKUPS equivalents.
For example:

* CREATE SNAPSHOT POLICY is deprecated; use CREATE BACKUP POLICY instead.
* SNAPSHOTS view is deprecated; use BACKUPS view instead.
* APPLY SNAPSHOT POLICY privilege is deprecated; use APPLY BACKUP POLICY privilege instead.

The deprecated commands, views, and privileges continue to work, but Snowflake intends to remove them in a future release.

---
title: Before you begin
source: https://docs.snowflake.com/en/user-guide/setup.md
section: User Guide
---

# Before you begin

If your organization has a Snowflake account, check with your Snowflake account administrator to see
if a user was created for you. If your organization doesn’t have a Snowflake account, you can
sign up for a [free trial account](https://signup.snowflake.com/) or
[contact us directly](https://www.snowflake.com/free-trial-contact-sales/) to request
an account.

After you have access to a Snowflake account and can sign in as a user, you can access Snowflake
by using any of the following methods:

* Snowflake interfaces:

  + [Snowsight](ui-snowsight.md), Snowflake’s browser-based web interface
  + Command-line clients:

    - [Snowflake CLI](../developer-guide/snowflake-cli/index.md), for developer-centric workloads and SQL operations
    - [SnowSQL](snowsql.md), for SQL operations
  + Programmatic interfaces, such as Snowflake connectors, drivers, and the Snowpark API. For an overview of these
    and other programmatic interfaces, see [Develop Apps and Extensions](https://docs.snowflake.com/developer).
* Applications:

  + You can access Snowflake through an application. Snowflake provides several ways to create and use applications.
    Your organization might create an application and make it available to you, or you might use a third-party application
    available on the [Snowflake Marketplace](../collaboration/collaboration-marketplace-about.md) or in the
    [Snowflake ecosystem](ecosystem-all.md).

For information about these tools and interfaces, see [Applications and tools for connecting to Snowflake](../guides-overview-connecting.md).

For pricing and service information, see the [pricing page](https://www.snowflake.com/pricing/) on the Snowflake website.

For more information about free trial accounts, see [Trial accounts](admin-trial-account.md).

---
title: Best practices
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-best-practices.md
section: User Guide
---

# Best practices

This topic provides best practices for working with Apache Iceberg™ tables in Snowflake.

## Create enough external volumes for your use case

Each external volume is associated with a particular [Active storage location](tables-iceberg-storage.md),
and a single external volume can support multiple Iceberg tables. However, the number of external volumes you need depends on how you want to store,
organize, and secure your table data.

You can use a single external volume if you want the data and metadata
for *all* of your Snowflake-Iceberg tables in subdirectories under the same storage location (for example, in the same S3 bucket).
To configure these directories for Snowflake-managed tables, see [Data and metadata directories](tables-iceberg-storage.md).

Alternatively, you can create multiple external volumes to secure various storage locations differently. For example,
you might create the following external volumes:

* A read-only external volume for externally managed Iceberg tables.
* An external volume configured with read and write access for Snowflake-managed tables.

## Use the recommended file format options for data loading

For data loading with [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)
and [Snowpipe](data-load-snowpipe-intro.md),
use the following format options for your Parquet data files:

* `BINARY_AS_TEXT = FALSE`
* `USE_LOGICAL_TYPE = TRUE`
* `USE_VECTORIZED_SCANNER = TRUE`
* `REPLACE_INVALID_CHARACTERS = TRUE`

## Refresh externally managed tables often

To prevent long refresh times and get the most up-to-date table data quickly,
perform frequent refreshes on Iceberg tables that use an external catalog.

Snowflake attempts to optimize table refreshes when it expects the operation to take a long time.
However, refresh time ultimately depends on the number of snapshots associated with a table, and the number of data
files that belong to a table.

It’s also important to align your Snowflake refresh schedule with table maintenance operations such as snapshot expiration or compaction.
Refresh the metadata each time you perform a maintenance operation.

For instructions, see [Refresh the table metadata](tables-iceberg-manage.md).

## Write complete statistics

To optimize query runtime performance for tables that aren’t managed by Snowflake,
make sure your Parquet file statistics are as complete as possible.

Ensure that the Parquet file writer you use (for example, Spark or Trino) is configured to write statistics.
You might also need to update your writer to the latest version.

Missing statistics like the following degrade query performance:

* Minimum and maximum values.
* Number of distinct values (NDV). The number of distinct values is used to determine the join order in complex joins. Missing NDV statistics
  can lead to join explosion.
* Number of NULL counts.

## Increase warehouse size

When you create an Iceberg table that uses an external catalog, Snowflake attempts to read statistics from the table manifest files
to provide faster performance.

In some situations, such as when there are missing or incorrect statistics in the manifest files, Snowflake
scans the table data files for statistics. Scanning a large number of data files can slow down table creation.
To accelerate the table creation process, use a larger warehouse that can scan table files in parallel.

> **Note:**
>
> Snowflake doesn’t parallelize table column scanning. Switching to a larger warehouse doesn’t result in faster query runtime.

## Choose the right storage serialization policy for your use case

Choose an appropriate `STORAGE_SERIALIZATION_POLICY` for your use case.
When you create a Snowflake-managed table (or convert a table to use Snowflake as the catalog), you set a storage serialization policy for
that table. The serialization policy tells Snowflake what kind of encoding and compression to perform on the table data files.

An unsuitable policy might make a table incompatible with external engines or cause performance degradation in Snowflake.

For more information, see [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-snowflake.md).

## Use a catalog-linked database to access your remote Iceberg tables

To bring your external data from a remote Apache Iceberg™ REST catalog into Snowflake, use a catalog-linked database. A catalog-linked database
automatically discovers and stays in sync with the namespaces and tables in your remote catalog. You can use a catalog-linked database to
read and write to the tables in your remote catalog from Snowflake, while preserving full interoperability with your existing Iceberg ecosystem.

In addition, a catalog-linked database lets you to create new remote Iceberg tables from Snowflake.

For more information, see the following topics:

* [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md)
* If your external data is in Unity Catalog, see [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md)
* If your external data is in AWS Glue, see [Build Data Lakes using Apache Iceberg with Snowflake and AWS Glue](https://www.snowflake.com/en/developers/guides/data-lake-using-apache-iceberg-with-snowflake-and-aws-glue/)

---
title: Best practices for hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-best-practices.md
section: User Guide
---

# Best practices for hybrid tables

This topic describes best practices and important considerations when using hybrid tables.
To achieve optimal performance with hybrid tables, follow these best practices in
your deployment. This guide outlines specific configuration, design, and operational practices that maximize
performance for production workloads.

* General best practices:

  + Query performance in Snowsight versus driver-based access
  + Client drivers for hybrid tables
  + Client configuration and access methods
  + Index design and usage
* Best practices for optimizing performance:

  + Bulk loading data
  + Warehouse optimization
  + Troubleshooting performance issues
  + Stored procedures and hybrid tables
  + Serverless tasks and hybrid tables
  + Foreign keys for join queries
* Best practices for operating and monitoring hybrid tables:

  + Caching and warm-up
  + Performance monitoring
  + Monitoring quotas and throttling

## Query performance in Snowsight versus driver-based access

> **Attention:**
>
> Performance statistics reported in Snowsight are not indicative of query performance for driver-based workloads.

[Snowsight](ui-snowsight.md) provides rich access to query plans, data statistics, query history,
and other detailed information that is useful for interactive query prototyping, debugging, investigation, monitoring, and
other activities. Providing that rich interactive experience adds overhead to the Snowflake query engine. As such,
latency for short-running queries executed through Snowsight is not indicative of performance that can be achieved with
programmatic drivers. Queries executed via code-based or driver-based solutions execute with lower latency and variability than
queries executed via Snowsight.

> **Note:**
>
> Run a [simple performance test](tables-hybrid-test.md) to validate performance for
> your scenario.

## Client drivers for hybrid tables

In order to access hybrid tables, you will need to use one of the following driver versions:

> | Driver | Minimum Version |
> | --- | --- |
> | Go | 1.6.25 |
> | JDBC | 3.13.31 |
> | .Net | 2.1.2 |
> | Node.js | 1.9.0 |
> | ODBC | 3.0.2 |
> | PHP | 2.0.0 |
> | Python Connector | 3.1.0 |
> | Snowflake CLI | 3.10.0 |
> | SnowSQL | 1.2.28 |
>
> > **Note:**
> >
> > You may not be able to access hybrid tables using an earlier driver version.

For optimal performance with hybrid tables, be sure to use the latest version of your selected driver.

You can also access hybrid tables by using the [Snowflake SQL API](../developer-guide/sql-api/index.md);
however, this API is not recommended for use cases that require optimal latency.

## Client configuration and access methods

Connection management directly affects performance and scalability. When connecting to databases that contain hybrid tables, consider the
following best practices for achieving good performance.

* Use connection pooling with long-lived connections to eliminate the overhead of repeatedly establishing new connections. Most client
  frameworks that connect to Snowflake provide a connection-pooling mechanism to efficiently manage access.
* Network proximity significantly affects end-to-end latency; therefore, colocate your client software in the same cloud region as the
  Snowflake account.
* Use prepared statements with bound parameters so the query planner will reuse previously created query plans.
* Use the supported programmatic client drivers, not Snowsight, to achieve optimal latency.
  See Client drivers for hybrid tables.

## Index design and usage

Creating and using indexes is a key component to achieving optimal performance for hybrid tables. Consider the
following recommendations:

* Create secondary indexes for frequently used predicates.
* Design composite indexes to match complete query patterns.
* Avoid using multiple indexes with columns in the same ordinal position.
* Understand the cardinality of your data before creating indexes. Indexes built with a single, low-cardinality column have
  limited benefit. See [Estimating the Number of Distinct Values](querying-approximate-cardinality.md).
* Indexes add write overhead and storage requirements. Be careful to balance read versus write performance for
  applications that require low-latency write operations.

Properly designed indexes significantly improve query performance by providing efficient
data access paths. If possible, choose primary keys for optimal selectivity while minimizing complexity.
In some cases, adding columns with calculated or surrogate key values provides better performance
than complex composite indexes. Secondary indexes dramatically improve performance for frequently accessed columns.

For well-defined queries, using the INCLUDE keyword to add columns to an index when you create the table might further decrease
latency. See [INCLUDE columns](tables-hybrid-index.md).

> **Attention:**
>
> Be mindful of the indexes you create on a hybrid table; non-selective index scans result in
> sub-optimal performance, throttling, and higher cost.

### Queries that qualify for index use

Hybrid table indexes may be accessed when queries use one of the following conditions:

* `<column_reference> {=, >, >=, <, <=} <constant_value>`
* `<column_reference> IN <constant_in_list>`
* `<column_reference> BETWEEN <constant_value> AND <constant_value>`

Expressions can be chained together using [Logical operators](../sql-reference/operators-logical.md).

For example:

```sqlexample
CREATE OR REPLACE HYBRID TABLE icecream_orders (
  id NUMBER PRIMARY KEY AUTOINCREMENT START 1 INCREMENT 1 ORDER,
  store_id NUMBER NOT NULL,
  flavor VARCHAR(20) NOT NULL,
  order_ts TIMESTAMP_NTZ,
  num_scoops NUMERIC,
  INDEX idx_icecream_order_store (store_id, order_ts),
  INDEX idx_icecream_timestamp (order_ts)
  );

-- Generate sample data for testing

INSERT INTO icecream_orders (store_id, flavor, order_ts, num_scoops)
  SELECT
    UNIFORM(1, 10, RANDOM()),
    ARRAY_CONSTRUCT('CHOCOLATE', 'VANILLA', 'STRAWBERRY', 'LEMON')[UNIFORM(0, 3, RANDOM())],
    DATEADD(SECOND, UNIFORM(0, 86400, RANDOM()), DATEADD(DAY, UNIFORM(-90, 0, RANDOM()), CURRENT_DATE())),
    UNIFORM(1, 3, RANDOM())
  FROM TABLE(GENERATOR(ROWCOUNT => 10000))
  ;

-- Use idx_icecream_order_store (first column)

  SELECT *
    FROM icecream_orders
    WHERE store_id = 5;

-- Use idx_icecream_order_store (both columns)

  SELECT *
    FROM icecream_orders
    WHERE store_id IN (1,2,3) AND order_ts > DATEADD(DAY, -7, CURRENT_DATE());

-- Use idx_icecream_timestamp

  SELECT *
    FROM icecream_orders
    WHERE order_ts BETWEEN DATEADD(DAY, -2, CURRENT_DATE()) AND DATEADD(DAY, -2, CURRENT_DATE());
```

## Foreign keys for join queries

In general, queries that require joins benefit from the definition of FOREIGN KEY constraints. Although foreign keys aren’t required
for running hybrid table queries, they do assist the optimizer in building the most effective query plan. Foreign keys provide two important functions:

* They establish referential integrity between tables.
* They provide the query planner with metadata for optimization.

A FOREIGN KEY constraint informs the query optimizer that a particular record in a child table points to exactly one record in a parent table. This
behavior is one way in which query predicates are “pushed down” during a join, thereby optimizing storage I/O. The query is executed as a
“one-to-many” join. Joining hybrid tables without foreign keys means that they are executed as “many-to-many” joins, such that additional query predicates
might be necessary to optimize the query.

For more information, see the following topics:

* [REFERENTIAL_CONSTRAINTS view](../sql-reference/info-schema/referential_constraints.md)
* [CREATE | ALTER TABLE … CONSTRAINT](../sql-reference/sql/create-table-constraint.md)
* [Constraints](../sql-reference/constraints.md)

## Bulk loading data

You can use several optimizations and best practices for loading data into hybrid tables:

* Use [CREATE TABLE … AS SELECT (also referred to as CTAS)](../sql-reference/sql/create-table.md) for creating and immediately loading empty tables.
* Verify use of optimized bulk loading in query profiles.
* Prefer initial data loading as a single bulk transaction.

Hybrid tables provide an optimized bulk loading path that delivers up to 10x faster loading performance than standard
loading methods. This optimized bulk loading path is automatically applied when you load data into an empty table
using CTAS (CREATE TABLE AS SELECT), COPY INTO, or INSERT INTO SELECT commands. (An empty table is a table that
has never contained any data.)

You can verify that the optimization is being used by checking the statistics section of the query profile,
where rows will be reported as `Number of rows bulk loaded` rather than `Number of rows inserted`.

> **Note:**
>
> CTAS operations do not support FOREIGN KEY constraints. If your table requires foreign keys,
> you must use COPY or INSERT INTO SELECT instead.

For tables that already contain data, the optimized bulk loading path is not currently available.
In these cases, loading operations may achieve approximately 1 million records per minute, though this
varies based on record size, table structure, and number of indexes.

## Warehouse optimization

A warehouse of size X-Small is sufficient for many operational workloads.
In order to achieve higher concurrency and throughput on short-running
operational queries, increase the compute node count by using a
[multi-cluster warehouse](warehouses-multicluster.md) rather than
increasing compute resources with a larger warehouse.

If your workload has variable throughput patterns, you can enable autoscaling to
reduce consumption when demand is lower. Set the scaling policy to `Standard`
rather than `Economy` for the best performance and efficiency on workloads
that require high throughput or low latency. For more information, see
[Setting the scaling policy for a multi-cluster warehouse](warehouses-multicluster.md).

In some cases, isolating workloads in separate
warehouses might be beneficial to enable independent scaling. If you
have a mixed hybrid workload with operational and analytical components, it is
beneficial to separate the operational and analytical components into different
warehouses. If you cannot separate them and must execute them together on the
same warehouse, choose the warehouse size based on the analytical query
latency requirements and choose the multi-cluster node count based on what is
required to support your workload’s throughput.

## Caching and warm-up

The first hybrid table query issued to a newly started warehouse triggers activities such as query planning,
index selection, I/O to load data, caching decisions, and, of course query execution. The query engine continues
to optimize memory and storage for the query. This time is called the “warm-up” period. Query latency
drops until the engine converges on a steady-state latency.

* Use dedicated warehouses for hybrid table workloads to avoid cache interference.
* Understand that reaching steady-state latency takes from several seconds to 2-3 minutes as the cache warms up.
* Configure auto-suspend and auto-scaling to balance efficiency and cache warmth.

Hybrid tables utilize multiple caching approaches to optimize performance. The plan cache reduces compilation
overhead by storing frequently used query plans. The column store data cache maintains frequently accessed data in
memory, and the metadata cache provides rapid access to table and index information. Hybrid tables do not
use a result cache.

These caches require some time to optimize for your workload patterns. Using dedicated warehouses
for hybrid table workloads prevents cache interference from other workloads. Initial queries
after a cold start experience higher latency until caches are populated.
If your workload has variable throughput patterns, you can enable autoscaling and auto-suspend to
reduce consumption or suspend your warehouse when demand is lower. When your warehouse restarts or
auto-scales to add a new cluster, caches will need to rehydrate. Set the scaling policy to
`Standard` rather than `Economy` for the best performance. see [Multi-cluster warehouses](warehouses-multicluster.md).

## Stored procedures and hybrid tables

Stored procedures are supported for hybrid tables; however, executing
transactions with [AUTOCOMMIT](../sql-reference/parameters.md) enabled or multi-statement transactions
offers better performance and efficiency than calling a stored procedure.

## Serverless tasks and hybrid tables

While serverless tasks are supported, be aware that you may
not experience optimal performance or efficiency for workloads that use hybrid
tables.

## Performance monitoring

The recommended view to use for hybrid table performance monitoring is
the [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md). This view
contains query execution details aggregated over a short period of time.

For example, to retrieve the average default interval performance over the last 24 hours
for a warehouse serving hybrid table requests:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.AGGREGATE_QUERY_HISTORY
  WHERE warehouse_name = 'HYBRID_TABLES_WAREHOUSE'
  AND query_type = 'SELECT'
  AND interval_start_time >= DATEADD(hour, -24, CURRENT_TIMESTAMP());
```

See the [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md) for more examples.

## Monitoring quotas and throttling

Hybrid tables implement quota controls at the database level for both hybrid storage and hybrid table requests throughput.
These quotas ensure consistent performance across all users. The default quotas are sufficient for
most initial implementations, but may need adjustment as workloads grow.

* Monitor the hybrid table requests quota by using the [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md).
* Monitor the hybrid storage quota by using the [STORAGE_USAGE view](../sql-reference/account-usage/storage_usage.md).
* High throttling percentages in query profiles indicate you’re approaching throughput limits. When you consistently utilize more
  than 70% of either quota, proactively request an increase through Snowflake Support.

The performance of hybrid tables is subject to throttling even in a case where virtual warehouse compute usage is not high.
To monitor your usage and determine whether a hybrid table is being throttled, see the example in the
[AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md). You can also retrieve the number of throttled hybrid table requests from
the `HYBRID_TABLE_REQUESTS_THROTTLED_COUNT` column.

For more information, see [Quotas and throttling](tables-hybrid-limitations.md).

## Troubleshooting performance issues

If you’re not achieving expected performance after implementing these best practices,
Snowflake Support can help analyze and optimize your implementation. When creating a support case,
include the following information to enable rapid resolution:

* Query IDs (UUIDs) for representative queries showing suboptimal performance
* Workload characteristics:

  + Typical query patterns
  + Expected versus actual latency
  + Concurrency requirements
  + Data storage volumes
  + Query response row size
  + Column [cardinality estimates](querying-approximate-cardinality.md)
* Any recent changes to table schemas, indexes, or workload patterns
* Throttling metrics from query profiles
* Performance differences between cold and warm warehouses

Include both fast and slow examples of similar queries if possible to help identify optimization
opportunities. This comparison helps support teams quickly identify potential configuration or design improvements.

---
title: Best practices for semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/best-practices-dev.md
section: User Guide
---

# Best practices for semantic views

This section describes best practices for the development of data pipelines and data products that incorporate
[semantic views](overview.md). These recommendations are primarily intended for data engineering and data science
professionals who need assistance with the following development processes:

> **Note:**
>
> This section does not address best practices for *modeling* semantic views. The information in this section assumes
> that Snowflake semantic views are being iteratively designed and need to be managed as part of a data engineering
> pipeline or data product.

## Ownership and data access

Semantic views facilitate access to information that exists in multiple canonical data sources. The semantic
layer enables a shift from thinking about how to query a specific data source to focusing on use cases and
business questions supported by a unified view of the available data. With this overall goal in mind, data
engineering and business teams must work together closely. The business teams have expertise in the business cases,
while the data engineering teams understand how to access the data from tables and views. Both teams need to share
ownership of the semantic model.

To secure the semantic layer in a way that serves the needs of both teams, use role-based access control (RBAC)
to grant appropriate privileges to semantic views and their dependent objects. If you’re starting from scratch,
you can use the sequence of GRANT statements in the next section as a working template. However, if your team
members already have permissions set up in a certain way for development, test, and production environments,
you might need to make some changes or direct them to use different roles as needed.

### Grant privileges on semantic view objects

Four key types of objects require the appropriate grants:

* Semantic views themselves
* Tables used in semantic view definitions
* Views used in semantic view definitions
* Cortex Search Service objects (generally applied on categorical data within views and tables).

To simplify privileges for a given domain, Snowflake recommends creating objects within the same database schema. Then you can
use a specific custom role to grant access to end users on that universe of objects. For example, for a “Sales Analysis” Cortex
Agent, you might create a `sales_analysis` schema within the `sales` database and create a role specifically for granting access to
semantic views and other data necessary for the agent (for example, `snowflake_intelligence_sales_analysis_role`). With the schema
and role in place, you should grant privileges on future objects to this role.

The following commands demonstrate this approach:

```sqlexample
-- Set variables for the specified role, database, and schema
SET my_role = 'snowflake_intelligence_sales_analysis_role';
SET my_db = 'sales';
SET my_schema = 'sales_analysis';
SET my_full_schema = $my_db || '.' || $my_schema;

-- Grant usage on the database and schema that will contain the tables and views
GRANT USAGE ON DATABASE IDENTIFIER($my_db) TO ROLE IDENTIFIER($my_role);
GRANT USAGE ON SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);

-- Grant privileges on future objects within the schema
-- For tables and views, SELECT is the typical "usage" grant for read access
GRANT SELECT ON FUTURE TABLES IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);
GRANT SELECT ON FUTURE VIEWS IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);
GRANT SELECT ON FUTURE SEMANTIC VIEWS IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);

-- For other object types, USAGE is the correct privilege
GRANT USAGE ON FUTURE FUNCTIONS IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);
GRANT USAGE ON FUTURE PROCEDURES IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);
GRANT USAGE ON FUTURE STAGES IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);
GRANT USAGE ON FUTURE CORTEX SEARCH SERVICES IN SCHEMA IDENTIFIER($my_full_schema) TO ROLE IDENTIFIER($my_role);
```

The example includes grants on future tables and views to support scenarios where users might need direct access to the underlying
data objects in addition to the semantic views. While querying a semantic view only requires SELECT privilege on the semantic view itself,
granting access to tables and views ensures flexibility for users who might need to query or analyze the base data directly, outside of the
semantic layer. If you want to restrict users strictly to semantic views, you can omit the grants on tables and views and only grant privileges
on the semantic view objects. However, note that Cortex Analyst and Cortex Agents that depend on Cortex Analyst require the role
that is executing queries to have SELECT privilege on both the semantic view and its underlying tables.

While you’re in the process of setting up grants, keep the following additional points in mind:

* If your end data is already correctly shared with end users, you can proceed as is. However, if your Snowflake data has
  generally been shared via service accounts or at the BI layer, you need to take extra steps to share the underlying data with end users.
* The semantic view is a new object type in Snowflake; therefore, most role types don’t have default or inherited read/write access privileges on these views.
  Regardless of your underlying data sharing, work with your core Snowflake admin team to provision access to this new object type.
* For the benefit of Snowflake Intelligence (and the potential of expanding the functionality of agents there), it’s worth granting the USAGE
  privilege on stages, procedures, and functions (as shown in the example). You can use these objects to create custom tools within
  Snowflake Intelligence.
* CREATE SEMANTIC VIEW is a required schema-level privilege for any user who creates a semantic view or edits a semantic view in
  Snowsight.

### Limit access with masking policies and row access policies

Semantic views use *owner’s rights*, meaning that a user with access to a semantic view does not require separate access to its underlying tables;
the view’s owner (role) controls access. As long as a user has SELECT privilege on the semantic view object itself, privileges to see the base data
are not required. This behavior is consistent with the privileges [required to query standard views](../views-introduction.md).

Depending on the underlying data in your semantic views and Cortex Agents, you might not want all end users to have unlimited access to all
of that data although they have been granted privileges through your custom role. You can use
[Dynamic Data Masking policies](../security-column-ddm-intro.md) and [row access policies](../security-row-intro.md)
to control access to the underlying data at the row level. These policies can’t be set directly on semantic view attributes, but
if they are set on underlying tables and columns, they propagate to semantic views and are enforced. This is a security benefit for
applications that work with sensitive data. However, note that sample values, which are stored as metadata, are not masked. See
Sample values are not masked.

For example, you can create a row access policy and a masking policy and apply them both to an `accounts` table that underlies a semantic view
named `account_semantic_view`. In this example, rows are only visible when the user querying the semantic view has an email that matches an
authorized account. Secondly, the sensitive column (`sensitive_col`) is dynamically masked for unauthorized roles, even via semantic views.

```sqlexample
-- Row access policy (restricts rows by user email)
CREATE OR REPLACE ROW ACCESS POLICY my_schema.account_row_policy AS (user_email STRING)
  RETURNS BOOLEAN ->
    EXISTS (
      SELECT 1
      FROM my_schema.account_access_list
      WHERE email = user_email()
    );

-- Masking policy (masks "sensitive_col" for users without a privileged role)
CREATE OR REPLACE MASKING POLICY my_schema.sensitive_col_masking_policy AS (val STRING)
RETURNS STRING ->
  CASE
    WHEN current_role() IN ('SENSITIVE_DATA_ACCESS_ROLE') THEN val
    ELSE 'MASKED'
  END;

-- Attach row access policy to the user_email column in the accounts table
ALTER TABLE my_schema.accounts
  ADD ROW ACCESS POLICY account_row_policy ON (user_email);

-- Attach masking policy to the sensitive_col column
ALTER TABLE my_schema.accounts
  MODIFY COLUMN sensitive_col
  SET MASKING POLICY sensitive_col_masking_policy;

-- Create the semantic view on the "accounts" table
CREATE OR REPLACE SEMANTIC VIEW my_schema.account_semantic_view
  TABLES (
    accounts AS my_schema.accounts
    PRIMARY KEY (account_id)
  )
  FACTS (
    account_id AS accounts.account_id,
    account_name AS accounts.account_name
  )
  DIMENSIONS (
    user_email AS accounts.user_email,
    sensitive_col AS accounts.sensitive_col
);
```

If you are using dbt, you can apply these policies in a
[post-hook](https://docs.getdbt.com/reference/resource-configs/pre-hook-post-hook). For example:

```sqlexample-jinja
models:
- name: accounts
  description: "Table of accounts for semantic analytics."
  columns:
    - name: account_id
      description: "Unique identifier for the account."
    - name: account_name
      description: "Name of the account."
    - name: user_email
      description: "Email address linked to each account row."
    - name: sensitive_col
      description: "Sensitive information to be masked for non-privileged users."
  post-hook:
    - >
      ALTER TABLE {{ this }}
        ADD ROW ACCESS POLICY account_row_policy ON (user_email);
  ...
```

The code `ALTER TABLE {{ this }}` uses the dbt runtime variable for the fully qualified table name. Every time dbt builds or updates
the `accounts` table, the policy is applied.

### Sample values are not masked

Although users who can query semantic views that have masking policies applied can’t see the actual data values in query results, sample
values that were defined in Snowsight with Cortex Analyst aren’t masked because the masking policy is not applied to metadata.
A user who runs the [GET_DDL](../../sql-reference/functions/get_ddl.md) function on a semantic view that has sample values defined for dimensions
will see those exact values. For example, look at the values in the WITH EXTENSION clause in the following DDL:

```sqlexample
SELECT GET_DDL('SEMANTIC_VIEW','TEST_SAMPLE_VALUES');
```

```output
create or replace semantic view TEST_SAMPLE_VALUES
tables (MARCH_TEMPS
  ...)
facts (MARCH_TEMPS.TEMPERATURE as TEMPERATURE
  ...)
dimensions (MARCH_TEMPS.CITY as CITY,
MARCH_TEMPS.COUNTY as COUNTY,
MARCH_TEMPS.OBSERVED as OBSERVED)
  ...
with extension (CA='{"tables":[{"name":"MARCH_TEMPS","dimensions":[{"name":"CITY","sample_values":["South Lake Tahoe","Big Bear City"]},{"name":"COUNTY","sample_values":["San Bernardino","El Dorado"]}],"facts":[{"name":"TEMPERATURE","sample_values":["44","46","52"]}],"time_dimensions":[{"name":"OBSERVED","sample_values":["2025-03-15T09:50:00.000+0000","2025-03-15T09:55:00.000+0000","2025-03-15T10:10:00.000+0000"]}]}
...);
```

If necessary, you can provide representative, non-sensitive sample values, rather than use real values, when you create the view. Cortex Analyst can use any value that’s representative of a real value to determine the contents of the column.

## Options for creating, updating, and querying semantic views

You can author semantic views in Snowflake by writing a YAML file, using Snowflake DDL syntax, or using
the UI in Snowsight. Snowflake provides convenient functions for both importing YAML models and exporting semantic views to YAML models.
For details, see Conversion of YAML semantic models to native semantic views.

Generally, it’s best to start by creating semantic views (rather than semantic models), which are Snowflake metadata objects that benefit from
RBAC, usage statistics, and direct integration with other Snowflake features, including Cortex Analyst and Snowflake Intelligence.

To create a semantic view, you have three main options:

* Create a semantic view in Snowsight:

  + You can use the wizard, or you can upload a YAML specification.
  + The wizard approach is recommended for initial setup, and includes automatic creation of synonyms, sample values, and column descriptions.
    For instructions, see [Using Snowsight to create and manage semantic views](ui.md).
* Create a semantic view via a SQL CREATE OR REPLACE SEMANTIC VIEW statement, using any interface that supports SQL. For instructions, see
  [Using SQL commands to create and manage semantic views](sql.md).

  Programmatic creation and querying is possible through interfaces such as JDBC and ODBC drivers or the
  [SQL API](../../developer-guide/sql-api/index.md). However, you can’t use the
  [Snowflake REST APIs](../../developer-guide/snowflake-rest-api/snowflake-rest-api.md).
* Create a semantic view from a YAML specification in SQL by calling the SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML stored procedure.
  See Conversion of YAML semantic models to native semantic views.

In addition, if you are using dbt, you can configure the creation of semantic views in Snowflake by installing the `dbt_semantic_view` package.
For more information, see Integration with dbt projects.

Keep in mind that the setup of roles and privileges for your team members might have an impact on their ability to create semantic views.
For example, if your production environment requires you to run as a SERVICE user, you can’t sign in to Snowsight in that
environment; you have to use SQL commands to create and manage semantic views.

When semantic views have been created in a Snowflake database, administrators can manage them by using standard SHOW
and DESCRIBE commands, and users can access them downstream via [SQL SELECT statements](querying.md)
and in the following ways:

* Directly through the [Cortex Analyst](../snowflake-cortex/cortex-analyst.md) user interface
* Through [Streamlit](../../developer-guide/streamlit/about-streamlit.md) or other custom applications that use
  the Cortex Analyst API and/or [generate SELECT FROM SEMANTIC_VIEW statements](querying.md)
* Through Cortex Agents via Cortex Analyst (semantic views must be added to a new or existing
  [agent](../snowflake-cortex/cortex-agents-manage.md))

Except for comments, you can’t add or alter tables, columns, or metadata within existing semantic views, so you must recreate them
(with [CREATE OR REPLACE](../../sql-reference/sql/create-semantic-view.md) commands) to incorporate any changes. Also note that updating a
semantic view via a SQL command overwrites any manual edits that you have made in an active Snowsight session. Preserving both sets of
changes is not supported.

## Conversion of YAML semantic models to native semantic views

You can use SQL system functions and stored procedures to create semantic views from YAML models and create YAML models from semantic views.

Currently, Snowflake does not support bulk conversion; you must convert YAML files to native semantic views one at a time.
You can use the [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md) stored procedure for conversion. If you need bulk conversion or integration
into a CI/CD pipeline, you have to script the conversions in a series. Snowflake does not plan to support batch/bulk conversion in the near future.

To export a native semantic view back to YAML, you can use the [SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](../../sql-reference/functions/system_read_yaml_from_semantic_view.md) function.
This function enables automated post-processing, round-tripping, or serialization into version control.

The same practical guidelines regarding size apply to both native semantic views and YAML-based models. There is a practical guideline (not a hard limit)
that for best performance, semantic views should have no more than 50-100 columns in total across all tables. This guideline applies to both
native semantic views and YAML-based models, and is mainly due to context window limits in AI components such as Cortex Analyst. Exceeding this
recommendation might lead to latency or quality degradation, but it is not a technical boundary.

## Automated deployment of semantic views

Where possible, leverage CI/CD pipelines and programmatic interfaces to create, modify, and manage semantic views. Ideally,
set up your workflow so that semantic view updates are synchronized automatically with your Git repository. This approach reduces manual errors
that might be caused by copying and pasting or pushing changes to Git.

* Store the semantic view YAML (or SQL DDL) in a Git repository; this approach supports version control, peer review, history, and rollback.
* If you are using Snowsight, export or download the YAML model regularly and commit it to Git.
* Trigger CI/CD pipelines on changes to Git (to run tests and accuracy checks, then deploy only if these tests pass).
* If necessary, roll back by redeploying the previous known-good YAML or DDL from Git.

To promote models from dev to test or production environments, you can incorporate automated deployment scripts for that purpose, or you
can use [schema-level cloning](../../sql-reference/sql/create-clone.md). Semantic views are cloned when schemas that contain them are cloned.
Cloning is a good option for promoting semantic views across databases and environments that use the same Snowflake account. To promote
semantic views across accounts, you can use [account replication](../account-replication-intro.md).

Semantic views can be shared directly via the [Snowflake Marketplace](../../collaboration/collaboration-marketplace-about.md) and
[data sharing](../../guides-overview-sharing.md). You can create [secure views](../views-secure.md) based on semantic views, and sharing
these nested views is supported. However, some resharing scenarios have limitations (such as when a consumer of a share wishes to further share a
view built on a semantic view).

To support materializing and maintaining semantic views as part of a Snowflake data pipeline, you can use a dbt project;
see Integration with dbt projects. Support for a similar process using the [Snowflake Terraform provider](../terraform.md) is planned.

Ultimately, your goal should be to enable a workflow that is similar to the following dbt example:

* Work on dbt project changes in an IDE, such as VS Code.
* Add a new semantic view definition to the dbt code.
* Push the changes to Git.
* Set up triggers that do a `'dbt run'` operation as part of the data pipeline.

As a result, the semantic view would be materialized in the Snowflake account.

## Integration with dbt projects

You can integrate semantic views into your dbt workflow by installing the `dbt_semantic_view` package that is available
from Snowflake Labs: <https://hub.getdbt.com/Snowflake-Labs/dbt_semantic_view/latest/>.

This package works natively with [dbt Projects on Snowflake](../data-engineering/dbt-projects-on-snowflake.md) or any dbt installation that has
access to a Snowflake account. You can use this package to materialize semantic views via dbt and reference them from downstream models.

> **Note:**
>
> The code samples in Snowflake Labs are intended for reference, testing, and educational purposes. These code samples aren’t covered by any
> Service-Level Agreement.

The following instructions assume that you are familiar with dbt and already have dbt installed in an environment that can connect to Snowflake.

To install and use the `dbt_semantic_view` package:

1. Add the following code to your `packages.yml` file:

   ```sqlexample-jinja
   packages:
     - package: Snowflake-Labs/dbt_semantic_view
       version: 1.0.3
   ```

   Be sure to include the version number. The version number of the package might change; using the latest version is recommended.
2. Run the `dbt deps` command to install the package.
3. In the dbt `models` directory, create a model that uses the semantic view materialization code:

   ```sqlexample-jinja
   {{ config(materialized='semantic_view') }}

   TABLES(
   {{ source('<source_name>', '<table_name>') }},
   {{ ref('<another_model>') }}
   )
   [ RELATIONSHIPS ( relationshipDef [ , ... ] ) ]
   [ FACTS ( factExpression [ , ... ] ) ]
   [ DIMENSIONS ( dimensionExpression [ , ... ] ) ]
   [ METRICS ( metricExpression [ , ... ] ) ]
   [ COMMENT = '<comment>' ]
   [ COPY GRANTS ]
   ```

   For example, you can materialize a simple semantic view as follows:

   ```sqlexample-jinja
   {{ config(materialized='semantic_view') }}

   TABLES(t1 AS {{ ref('base_table') }}, t2 as {{ source('seed_sources', 'base_table2') }})
   DIMENSIONS(t1.count as value, t2.volume as value)
   METRICS(t1.total_rows AS SUM(t1.count), t2.max_volume as max(t2.volume))
   COMMENT='test semantic view'
   ```
4. Configure your connection to Snowflake in dbt, specifying the connection details in your dbt
   `profiles.yml` file. For more information, see the
   [dbt documentation](https://docs.getdbt.com/docs/core/connect-data-platform/profiles.yml). For example:

   ```yaml
   semantic_project:
     target: snowflake
     outputs:
       snowflake:
       type: "snowflake"
       account: "{{ env_var('SNOWFLAKE_ACCOUNT') }}"
       user: "{{ env_var('SNOWFLAKE_USER') }}"
       password: "{{ env_var('SNOWFLAKE_PASSWORD') }}"
       authenticator: "{{ env_var('SNOWFLAKE_AUTHENTICATOR') }}"
       role: "{{ env_var('SNOWFLAKE_ROLE') }}"
       database: "{{ env_var('SNOWFLAKE_DATABASE') }}"
       warehouse: "{{ env_var('SNOWFLAKE_WAREHOUSE') }}"
       schema: "{{ env_var('SNOWFLAKE_SCHEMA') }}"
       threads: 4
   ```
5. Given this profile, you could authenticate with the following environment variables:

   ```bash
   $ export SNOWFLAKE_ACCOUNT=snowflake_acct1
   $ export SNOWFLAKE_USER=sem_user1
   $ export SNOWFLAKE_PASSWORD=**************
   $ export SNOWFLAKE_AUTHENTICATOR=externalbrowser
   $ export SNOWFLAKE_ROLE=semantic_role
   $ export SNOWFLAKE_DATABASE=sem_db
   $ export SNOWFLAKE_WAREHOUSE=sem_wh
   $ export SNOWFLAKE_SCHEMA=sem_schema
   ```
6. Run the `dbt build` command to connect to your Snowflake account and create the model. The following
   example builds a specific model defined as `models/semantic_view_basic`. Note that another model,
   `table_refer_to_semantic_view`, depends on this model, so the command requires the `+` sign at the end.

   ```bash
   $ dbt build --target snowflake --select semantic_view_basic+
   23:43:16  Running with dbt=1.11.0-b3
   23:43:17  Registered adapter: snowflake=1.10.2
   23:43:17  Found 9 models, 8 data tests, 1 seed, 2 operations, 2 sources, 500 macros
   23:43:17
   23:43:17  Concurrency: 4 threads (target='snowflake')
   23:43:17
   23:43:32  1 of 2 START hook: dbt_semantic_view_integration_tests.on-run-start.0 .......... [RUN]
   23:43:32  1 of 2 OK hook: dbt_semantic_view_integration_tests.on-run-start.0 ............. [OK in 0.90s]
   23:43:33  2 of 2 START hook: dbt_semantic_view_integration_tests.on-run-start.1 .......... [RUN]
   23:43:33  2 of 2 OK hook: dbt_semantic_view_integration_tests.on-run-start.1 ............. [OK in 0.38s]
   23:43:33
   23:43:33  1 of 6 START sql semantic_view model sem_schema.semantic_view_basic ............ [RUN]
   23:43:33  1 of 6 OK created sql semantic_view model sem_schema.semantic_view_basic ....... [SUCCESS 1 in 0.26s]
   23:43:33  3 of 6 START test semantic_view_basic_has_no_copy_grants ....................... [RUN]
   23:43:33  2 of 6 START test semantic_view_basic_has_comment .............................. [RUN]
   23:43:33  4 of 6 START test semantic_view_sum_matches_base_table ......................... [RUN]
   23:43:33  2 of 6 PASS semantic_view_basic_has_comment .................................... [PASS in 0.23s]
   23:43:34  3 of 6 PASS semantic_view_basic_has_no_copy_grants ............................. [PASS in 0.75s]
   23:43:34  4 of 6 PASS semantic_view_sum_matches_base_table ............................... [PASS in 1.05s]
   23:43:34  5 of 6 START sql table model sem_schema.table_refer_to_semantic_view ........... [RUN]
   23:43:35  5 of 6 OK created sql table model sem_schema.table_refer_to_semantic_view ...... [SUCCESS 1 in 1.22s]
   23:43:35  6 of 6 START test table_refer_semantic_view_matches_semantic_view .............. [RUN]
   23:43:36  6 of 6 PASS table_refer_semantic_view_matches_semantic_view .................... [PASS in 0.26s]
   23:43:36
   23:43:36  Finished running 2 project hooks, 1 semantic view model, 1 table model, 4 data tests in 0 hours 0 minutes and 19.34 seconds (19.34s).
   23:43:36
   23:43:36  Completed successfully
   23:43:36
   23:43:36  Done. PASS=8 WARN=0 ERROR=0 SKIP=0 NO-OP=0 TOTAL=8
   ```

For more information about the `dbt_semantic_view` package, which includes pre-built models and tests that you can run,
see the `README.md` file. Go to <https://hub.getdbt.com/Snowflake-Labs/dbt_semantic_view/latest/> and select View on GitHub.

See also <https://www.snowflake.com/en/engineering-blog/dbt-semantic-view-package/>.

## Integration with BI tools

A number of BI tool vendors offer integrations with Snowflake semantic views. To learn more about these integrations, please contact
your BI tool account teams and follow these links:

* Sigma: <https://www.sigmacomputing.com/blog/snowflake-semantic-views-launch>
* Omni: <https://omni.co/snowflake>
* Honeydew: <https://honeydew.ai/blog/honeydew-and-snowflake-semantic-views/>
* Hex: <https://hex.tech/blog/introducing-snowflake-semantic-sync-aisql/>

---
title: Best practices for Snowpipe Streaming with classic architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-recommendation.md
section: User Guide
---

# Best practices for Snowpipe Streaming with classic architecture

## Cost optimization

As a best practice, we recommend calling the API with fewer Snowpipe Streaming clients that write more data per second. Use a Java or Scala application to aggregate data from multiple sources, such as IoT devices or sensors, and then use the Snowflake Ingest SDK to call the API to load data at higher flow rates. The API efficiently aggregates data across multiple target tables in an account.

A single Snowpipe Streaming client can open multiple channels to send data, but the client cost is only charged per active client. The number of channels does not affect the client cost. Therefore, we recommend using multiple channels per client for performance and cost optimization.

When you use the same tables for both batch and streaming ingestion, you can also reduce Snowpipe Streaming compute costs because of pre-empted file migration operations.

Snowpipe Streaming handles and bills all file-migration compute costs for tables with [Automatic Clustering](../tables-auto-reclustering.md) enabled, where Snowpipe Streaming is inserting data. This process optimizes and migrates data within the same transaction, incorporating costs previously associated with Automatic Clustering.

## Performance recommendations

For optimal performance in high-throughput deployments, we recommend the following actions:

* If you are loading multiple rows, using `insertRows` is more efficient and cost effective than calling `insertRow` multiple times because less time is spent on locks.

  + Keep the size of each row batch passed to `insertRows` below 16 MB compressed.
  + The optimal size of row batches is between 10-16 MB.
* Pass values for the TIME, DATE, and all TIMESTAMP columns as one of the [supported types](snowpipe-streaming-table-support.md) from the `java.time` package.
* When you create a channel using `OpenChannelRequest.builder`, set the `OnErrorOption` to `OnErrorOption.CONTINUE`, and manually check the return value from `insertRows` for potential ingestion errors. This approach currently leads to a better performance than relying on exceptions thrown when `OnErrorOption.ABORT` is used.
* When you set the default log level to DEBUG, make sure that the following loggers keep logging on INFO: their DEBUG output is very verbose, which can lead to a significant performance degradation.

  > + `net.snowflake.ingest.internal.apache.parquet`
  > + `org.apache.parquet`
* Channels should be long lived when a client is actively inserting data and should be reused because offset token information is retained. Don’t close channels after inserting data because data inside the channels is automatically flushed based on the time configured in `MAX_CLIENT_LAG`.

## Latency recommendations

When you use Snowpipe Streaming, latency refers to how quickly data written to a channel becomes available for querying in Snowflake. Snowpipe Streaming automatically flushes data within channels every one second, meaning you don’t need to explicitly close a channel for data to be flushed.

**Configuring latency with MAX_CLIENT_LAG**
With Snowflake Ingest SDK versions 2.0.4 and later, you can fine-tune data flush latency by using the `MAX_CLIENT_LAG` option:

* Standard Snowflake Tables (non-Iceberg): The default MAX_CLIENT_LAG is 1 second. You can override this to set your desired flush latency anywhere from 1 second up to a maximum of 10 minutes.
* Snowflake-managed Iceberg Tables: Supported by Snowflake Ingest SDK versions 3.0.0 and later, the default `MAX_CLIENT_LAG` is 30 seconds. This default helps ensure that optimized Parquet files are created, which is beneficial for query performance. While you can set a lower value, it’s generally not recommended unless you have exceptionally high throughput.

**Latency recommendations for optimal performance**
Setting `MAX_CLIENT_LAG` effectively can significantly impact query performance and the internal migration process (where Snowflake compacts small partitions).

For low-throughput scenarios, where you might only be sending a small amount of data (for example,e.g., 1 row or 1 KB) every second, frequent flushes can lead to numerous small partitions. This can increase query compilation time as Snowflake has to resolve many tiny partitions, especially if queries run before the migration process can compact them.

Therefore, you should set MAX_CLIENT_LAG as high as your target latency requirements allow. Buffering inserted rows for a longer duration allows Snowpipe Streaming to create better-sized partitions, which improves query performance and reduces migration overhead.
For example, if you have a task that runs every minute to merge or transform your streamed data, an optimal `MAX_CLIENT_LAG` might be between 50 and 55 seconds. This ensures data is flushed in larger chunks just before your downstream process runs.

**Kafka connector for Snowpipe Streaming**
It’s important to note that the Kafka connector for Snowpipe Streaming has its own internal buffer. Whenthe Kafka buffer flush time is reached, data is then sent to Snowflake with the standard one-second latency through Snowpipe Streaming. For more information, see [buffer.flush.time setting](snowpipe-streaming-classic-kafka.md)

## Exactly-once delivery best practices

Achieving exactly-once delivery can be challenging, and adherence to the following principles in your custom code is critical:

> * To ensure appropriate recovery from exceptions, failures, or crashes, you must always reopen the channel and restart ingestion using the latest committed offset token.
> * Although your application may maintain its own offset, it’s crucial to use the latest committed offset token provided by Snowflake as the source of truth and reset your own offset accordingly.
> * The only instance in which your own offset should be treated as the source of truth is when the offset token from Snowflake is set or reset to NULL. A NULL offset token usually means one of the following:
>
>   > + This is a new channel, so no offset token has been set.
>   > + The target table was dropped and recreated, so the channel is considered new.
>   > + There was no ingestion activity through the channel for 30 days, so the channel was automatically cleaned up, and the offset token information was lost.
> * If necessary, you can periodically purge the source data that has already been committed based on the latest committed offset token, and advance your own offset.
> * If the table schema is modified when Snowpipe Streaming channels are active, the channel must be reopened. The Snowflake Kafka connector handles this scenario automatically, but if you use Snowflake Ingest SDK directly, you must reopen the channel yourself.

For more information about how the Kafka connector with Snowpipe Streaming achieves exactly-once delivery, see
[Exactly-once semantics](snowpipe-streaming-classic-kafka.md).

---
title: Best practices for Snowpipe Streaming with high-performance architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-best-practices.md
section: User Guide
---

# Best practices for Snowpipe Streaming with high-performance architecture

This guide outlines key best practices to design and implement robust data ingestion pipelines by using Snowpipe Streaming with high-performance architecture. By following these best practices, you ensure that your pipelines are durable, reliable, and have efficient error handling.

## Manage channels strategically

Apply the following channel-management strategies for performance and long-term stability:

* **Use long-lived channels**: To minimize overhead, open a channel once, and then keep it active for the duration of the ingestion task. Avoid repeatedly opening and closing channels.
* **Use deterministic channel names**: Apply a consistent, predictable naming convention — for example, `source-env-region-client-id` — to simplify troubleshooting and facilitate automated recovery processes.
* **Scale out with multiple channels**: To increase throughput, open multiple channels. These channels can point to a single target pipe or to multiple pipes, depending on service limits and your throughput requirements.
* **Monitor channel status**: Regularly use the `getChannelStatus` method to monitor the health of your ingestion channels.

  + Track the `last_committed_offset_token` to verify that data is being ingested successfully and that the pipeline is making progress.
  + Monitor the `row_error_count` to detect bad records or other ingestion issues early.

## Validate the schema consistently

Ensure that incoming data conforms to the expected table schema to prevent ingestion failures and maintain data integrity:

* **Client-side validation**: Implement schema validation on the client side to provide immediate feedback and reduce server-side errors. Although full row-by-row validation offers maximum safety, a method that performs better might involve selective validation; for example, at batch boundaries or by sampling rows.
* **Server-side validation**: The high-performance architecture can offload schema validation to the server. Errors and their counts are reported through `getChannelStatus` if schema mismatches occur during ingestion into the target pipe and table.

## Add client-side metadata columns

To enable robust error detection and recovery, you must carry ingestion metadata as part of the row payload. This requires planning your data shape and PIPE definition in advance.

Add the following columns to your row payload before ingestion:

* `CHANNEL_ID`; for example, a compact INTEGER
* `STREAM_OFFSET`; a BIGINT that is monotonically increasing per channel, such as a Kafka partition offset

Together, these columns uniquely identify records per channel and enable you to trace the data’s origin.

Optionally, add a `PIPE_ID` column if multiple pipes ingest into the same target table. With this column, you can trace rows back to their ingestion pipeline. You can store descriptive pipe names in a separate lookup table, mapping them to compact integers to reduce storage costs.

## Detect and recover from errors using metadata offsets

Combine channel monitoring with your metadata columns to detect and recover from issues:

* **Monitor status**: Regularly check `getChannelStatus`. An increasing `row_error_count` is a strong indicator of a potential problem.
* **Detect missing records**: If errors are detected, use a SQL query to identify missing or out-of-order records by checking for gaps in your `STREAM_OFFSET` sequence.

```sqlexample
SELECT
  PIPE_ID,
  CHANNEL_ID,
  STREAM_OFFSET,
  LAG(STREAM_OFFSET) OVER (
    PARTITION BY PIPE_ID, CHANNEL_ID
    ORDER BY STREAM_OFFSET
  ) AS previous_offset,
  (LAG(STREAM_OFFSET) OVER (
    PARTITION BY PIPE_ID, CHANNEL_ID
    ORDER BY STREAM_OFFSET
  ) + 1) AS expected_next
FROM my_table
QUALIFY STREAM_OFFSET != previous_offset + 1;
```

## Use compression for REST API requests

When you use the Snowpipe Streaming REST API, use compression to send more data per request and reduce network overhead.

Although the REST API has a physical limit of 4 MB per request, this limit applies to the observed transfer size. By using compression, you can fit a larger uncompressed data volume into each request, enabling higher throughput and reducing the number of API calls required.

Snowflake recommends using ZSTD as the high-performance compression algorithm, although Gzip is also supported.

## Optimize ingestion performance and cost with MATCH_BY_COLUMN_NAME

Configure your pipe to map the necessary columns from your source data instead of ingesting all data into a single VARIANT column. To do this, use `MATCH_BY_COLUMN_NAME = CASE_SENSITIVE` or apply transformations in your pipe definition. This best practice not only optimizes your ingestion costs but also enhances the overall performance of your streaming data pipeline.

This best practice has the following important advantages:

* By using `MATCH_BY_COLUMN_NAME = CASE_SENSITIVE`, you’re only billed for the data values that are ingested into your target table. In contrast, ingesting data into a single VARIANT column bills you for all JSON bytes, including both the keys and the values. For data with verbose or numerous JSON keys, this can lead to a significant and unnecessary increase in your ingestion costs.
* Snowflake’s processing engine is more computationally efficient. Instead of parsing the entire JSON object into a VARIANT, and then extracting the required columns, this method directly extracts the necessary values.

## Use native data types for semi-structured data

For optimal performance and data integrity, provide semi-structured data by using native language objects rather than serialized strings.

* **Performance**: With native objects, the SDK can handle data more efficiently without requiring additional parsing steps on the Snowflake server.
* **Type Safety**: The high-performance architecture treats string literals as literal text. By using native objects, you ensure that your data is stored as structured JSON rather than escaped string values.

**Java example**:

```java
// Preferred: SDK converts the List to a structured ARRAY
row.put("tags", Arrays.asList("electronics", "sale"));
```

**Python example**:

```python
# Preferred: SDK converts the dict to a structured VARIANT
row["payload"] = {"event_id": 101, "status": "active"}
```

## Get Prometheus metrics

To get performance metrics from the Snowpipe Streaming high-performance client, you must enable the built-in Prometheus metrics server and configure your Prometheus service to scrape the endpoint.

Enable the metrics server by setting the environment variable `SS_ENABLE_METRICS` to `true` before running your application.

Scrape the metrics endpoint on the host that is running your Snowpipe Streaming ingest process. The default path is `/metrics` on the host and port defined by `SS_METRICS_IP` and `SS_METRICS_PORT`.

### Example: Verifying the metrics endpoint (local process/dev box)

```bash
# Enable Prometheus metrics
export SS_ENABLE_METRICS=true
# Run your application (the server starts on 127.0.0.1:50000 by default)

# Curl the endpoint to verify the metrics are exposed
curl http://127.0.0.1:50000/metrics
```

### Example: Prometheus scrape configuration

Point your Prometheus service at the host running the Snowpipe Streaming client.

```yaml
scrape_configs:
  - job_name: snowpipe_streaming_hp
    metrics_path: /metrics   # default is /metrics
    static_configs:
      - targets: ['127.0.0.1:50000']
```

## Designing for resiliency

### Wrap ingestion in try-catch blocks

Don’t assume that `insertRows` always succeeds. Ensure that your ingestion loop can catch `SFException` and interpret the HTTP status codes, specifically 409 for invalidations and 429 for throttling.

### Implement exponential back-off

For retryable errors (429, 500, 503), don’t retry immediately. Use an exponential back-off strategy —– increasing the wait time between each retry —– to allow the system to recover.

### Verify progress with offset tokens

Periodically call `getLatestCommittedOffsetToken` to track which data was successfully persisted. If a 409 error occurs, use this token to identify the exact point from which to replay data after reopening the channel.

### Monitor channel status

Regularly check `getChannelStatus()`. If the status code is anything other than `SUCCESS`, trigger your error-handling logic to reset the channel or client connection.

---
title: Billing for storage lifecycle policies
source: https://docs.snowflake.com/en/user-guide/storage-management/storage-lifecycle-policies-billing.md
section: User Guide
---

# Billing for storage lifecycle policies

When you use storage lifecycle policies, you incur costs for policy execution, data storage, and data operations.
This topic explains the cost components associated with storage lifecycle policies and provides
guidance on how to monitor each component.

## Policy execution costs

Each time Snowflake runs a storage lifecycle policy, you incur serverless compute charges to identify and process rows that
meet your defined conditions. Policies run automatically, approximately once every 24-hour period.
For billing details, see table 5 in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Monitoring

To view credits that are consumed by policy execution, use the following metering history views.
Filter for the STORAGE_LIFECYCLE_POLICY_EXECUTION service type:

* [ACCOUNT_USAGE.METERING_DAILY_HISTORY](../../sql-reference/account-usage/metering_daily_history.md)
* [ACCOUNT_USAGE.METERING_HISTORY](../../sql-reference/account-usage/metering_history.md)
* [ORGANIZATION_USAGE.METERING_DAILY_HISTORY](../../sql-reference/organization-usage/metering_daily_history.md)

To view policy execution history and metadata, use the following views and function:

* [ACCOUNT_USAGE.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/account-usage/storage_lifecycle_policy_history.md)
* [ORGANIZATION_USAGE.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/organization-usage/storage_lifecycle_policy_history.md)
* [INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/functions/storage_lifecycle_policy_history.md) (table function)

> **Note:**
>
> Policy execution times can vary from execution to execution, even when processing similar amounts of data. To better understand
> the cost of policy executions, monitor the credits charged for each execution along with the amount of data expired or archived.

## Archive storage costs

When you archive data, you incur charges for moving data to archive storage,
storing data in archive storage, and
retrieving archived data. If you drop a table with archived data,
you might also incur minimum storage duration charges.

### Moving data to archive storage

When a policy archives data, you incur a one-time serverless compute charge to move data from regular storage to the
cool or cold archive storage tier. For billing details, see table 5 in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

#### Monitoring

To view the credits consumed to move data to archive storage, use the following metering history views.
Filter for the STORAGE_LIFECYCLE_POLICY_EXECUTION and ARCHIVE_STORAGE_WRITE service types:

* [ACCOUNT_USAGE.METERING_DAILY_HISTORY](../../sql-reference/account-usage/metering_daily_history.md)
* [ACCOUNT_USAGE.METERING_HISTORY](../../sql-reference/account-usage/metering_history.md)
* [ORGANIZATION_USAGE.METERING_DAILY_HISTORY](../../sql-reference/organization-usage/metering_daily_history.md)

### Data storage

After policy execution, you temporarily incur charges for *both* archive storage and [table storage](../cost-exploring-data-storage.md).
Snowflake immediately copies data into the specified archive storage tier when the policy runs. However, the data remains in
table storage for seven or more days, which is the 7-day [Fail-safe](../data-failsafe.md) period plus your
[Time Travel](../data-time-travel.md) retention period set by DATA_RETENTION_TIME_IN_DAYS.

After this period, data in archive storage incurs ongoing storage charges.
For billing details, see table 3(e) in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

#### Monitoring

To view the volume of archived data in bytes for a table, database, or for your account, use the following views:

**Account Usage views:**

* [ACCOUNT_USAGE.TABLE_STORAGE_METRICS](../../sql-reference/account-usage/table_storage_metrics.md)
* [ACCOUNT_USAGE.TABLES](../../sql-reference/account-usage/tables.md)
* [ACCOUNT_USAGE.DATABASE_STORAGE_USAGE_HISTORY](../../sql-reference/account-usage/database_storage_usage_history.md)
* [ACCOUNT_USAGE.STORAGE_USAGE](../../sql-reference/account-usage/storage_usage.md)

**Organization Usage views:**

* [ORGANIZATION_USAGE.TABLE_STORAGE_METRICS](../../sql-reference/organization-usage/table_storage_metrics.md)
* [ORGANIZATION_USAGE.TABLES](../../sql-reference/organization-usage/tables.md)
* [ORGANIZATION_USAGE.DATABASE_STORAGE_USAGE_HISTORY](../../sql-reference/organization-usage/database_storage_usage_history.md)

### Data retrieval

When you query [retrieve archived data](storage-lifecycle-policies-retrieving-archived-data.md),
you incur the following charges:

* **Retrieval cost**: One-time charge to retrieve archived data from the archive storage tier.
* **File processing**: Serverless compute charge to process the retrieved data.
* **Temporary storage** (COLD tier only): When you retrieve data from the COLD tier, Snowflake temporarily stores the
  retrieved data in normal storage. This incurs additional storage charges.

For billing details, see tables 3(e) and 5 in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

> **Note:**
>
> To estimate retrieval cost, use [EXPLAIN](../../sql-reference/sql/explain.md) with
> [CREATE TABLE … FROM ARCHIVE OF](../../sql-reference/sql/create-table.md). This shows the number of files that
> will be retrieved from archive storage. For an example, see [Retrieve archived data](storage-lifecycle-policies-retrieving-archived-data.md).

#### Monitoring

To view consumed credits and the cost related to retrieving archived data, use the following views:

* [ACCOUNT_USAGE.ARCHIVE_STORAGE_DATA_RETRIEVAL_USAGE_HISTORY](../../sql-reference/account-usage/archive_storage_data_retrieval_usage_history.md)

To view the credits consumed for file processing in order to retrieve archived data, use the following metering history views.
Filter for the ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING service type:

* [ACCOUNT_USAGE.METERING_DAILY_HISTORY](../../sql-reference/account-usage/metering_daily_history.md)
* [ACCOUNT_USAGE.METERING_HISTORY](../../sql-reference/account-usage/metering_history.md)
* [ORGANIZATION_USAGE.METERING_DAILY_HISTORY](../../sql-reference/organization-usage/metering_daily_history.md)

To view temporary storage that you use when you retreive data from the COLD storage tier,
use the ARCHIVE_STORAGE_RETRIEVAL_TEMP_BYTES column in the
[ACCOUNT_USAGE.STORAGE_USAGE](../../sql-reference/account-usage/storage_usage.md).

### Minimum storage duration charges

Cloud providers impose a minimum storage duration for archive storage tiers. When you drop a table, Snowflake
deletes the table data from storage. If the table data is in archive storage and hasn’t been there for
the minimum duration set by the cloud provider, Snowflake charges you for the minimum duration.

For example, if you drop a table with data that Snowflake moved to the AWS cold storage tier 15 days ago, you still
incur storage cost for the remaining 165 days of the minimum cold storage period, which is the 180-day minimum minus 15 days already stored.

For archive storage billing details, see table 3(e) in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

The minimum storage duration varies by cloud provider and storage tier:

* **COOL tier**: 90-day minimum
* **COLD tier**: 180-day minimum

#### Monitoring

To view the amount of data that is subject to minimum storage duration charges for a table, use the following columns
in the TABLE_STORAGE_METRICS view:

* ARCHIVE_STORAGE_COOL_EARLY_DELETION_PENALTY_BYTES
* ARCHIVE_STORAGE_COLD_EARLY_DELETION_PENALTY_BYTES

These columns are available in the following topics:

* [ACCOUNT_USAGE.TABLE_STORAGE_METRICS](../../sql-reference/account-usage/table_storage_metrics.md)
* [ORGANIZATION_USAGE.TABLE_STORAGE_METRICS](../../sql-reference/organization-usage/table_storage_metrics.md)

---
title: Browser test
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/browser-test.md
section: User Guide
---

# Browser test

For a successful connection to Snowflake, the URLs listed in the Snowflake allowlist should be accessible via the browser. To verify, you should try to access the URLs in your browser. The results of this browser test can help Snowflake understand the possible root causes of your issue.

* For the Snowflake URL, you should be able to access the login page in the browser.
* For a Stage URL, you should expect some sort of 403 error, but still succeed in connecting. See the examples below:

  + AWS stage successful connection example:

    ```xml
    <Error>
      <Code>AccessDenied</Code>
      <Message>Access Denied</Message>
      <RequestId>5CXSPXCBPY8DDQ0N</RequestId>
      <HostId>
        fQPUOjEOGM2lpG4TWXCwAGfOnuR01LHHzlm6+0rmzC3Zu7geOe4IEwNwIOLLl43Tk183XFJG5pw=
      </HostId>
    </Error>
    ```
  + Azure stage successful connection example:

    ```xml
    <Error>
      <Code>InvalidQueryParameterValue</Code>
      <Message>
        Value for one of the query parameters specified in the request URI is invalid. RequestId:1c0658d7-e01e-010c-5be8-8023af000000 Time:2022-06-15T18:44:55.1523344Z
      </Message>
      <QueryParameterName>comp</QueryParameterName>
      <QueryParameterValue/>
      <Reason/>
    </Error>
    ```
  + GCP stage successful connection example:

    ```xml
    <Error>
      <Code>AccessDenied</Code>
      <Message>Access denied.</Message>
      <Details>
        Anonymous caller does not have storage.objects.list access to the Google Cloud Storage bucket.
      </Details>
    </Error>
    ```
  + OCSP URL (`http://ocsp.snowflakecomputing.com/ocsp_response_cache.json`) successful connection example:

    You should see a progress bar for downloading the `ocsp_response_cache.json` file to the location specified in your browser.

    If the test is unsuccessful, you would see an error similar to the following:

    ```xml
    <Error>
      <Code>AccessDenied</Code>
      <Message>Access Denied</Message>
      <RequestId>YE1ZB5WN693FMJNP</RequestId>
      <HostId>hOZHtpAS4SU8/qsX5vZG/dOlWe33ttwYyCy9zrENWN7V/B38JTxdaCCyA+gePDoDUZ3VNf95Pn0=</HostId>
    </Error>
    ```

After completing these steps, continue with [follow-up actions](followup-actions.md).

---
title: Build a data processing pipeline using a directory table
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-pipeline.md
section: User Guide
---

# Build a data processing pipeline using a directory table

Build a data processing pipeline by combining a directory table,
which tracks and stores file-level metadata on a stage, with other Snowflake objects
such as streams and tasks.

A [stream](streams-intro.md) records data manipulation language (DML) changes made to a directory table,
table, external table, or the underlying tables in a view. A [task](tasks-intro.md) executes a single action,
which can be a SQL command or an extensive [user-defined function (UDF)](../developer-guide/udf/udf-overview.md).
You can schedule a task to run periodically, or run a task on demand.

## Example: Create a simple pipeline to process PDFs

This example builds a simple data processing pipeline that does the following:

1. Detects PDF files added to a stage.
2. Extracts data from the files.
3. Inserts the data into a Snowflake table.

The pipeline uses a stream to detect changes to a directory table on the stage,
and a task that executes a UDF to extract data from the files.

The following diagram summarizes how the example pipeline works:

### Step 1: Create a stage with a directory table enabled

Create an internal stage with a directory table enabled.
The example statement sets the `ENCRYPTION` type to `SNOWFLAKE_SSE` to
[enable unstructured data access on the stage](unstructured-intro.md).

```sqlexample
CREATE OR REPLACE STAGE my_pdf_stage
  ENCRYPTION = ( TYPE = 'SNOWFLAKE_SSE')
  DIRECTORY = ( ENABLE = TRUE);
```

### Step 2: Create a stream on the directory table

Create a stream on the directory table by specifying the stage that the directory table belongs to.
The stream will track changes to the directory table. In step 5 of this example, we use this stream to construct a task.

```sqlexample
CREATE STREAM my_pdf_stream ON STAGE my_pdf_stage;
```

### Step 3: Create a user-defined function to parse PDFs

Create a UDF that extracts data from PDF files. The task that you create in a later step will call this UDF to process
newly-added files on the stage.

The following example statement creates a Python UDF named `PDF_PARSE` that processes PDF files containing product review data.
The UDF extracts form field data using the [PyPDF2](https://pypi.org/project/PyPDF2/) library.
It returns a dictionary that contains the form names and values as key-value pairs.

> **Note:**
>
> The UDF reads dynamically-specified files using the `SnowflakeFile` class. To learn more about `SnowflakeFile`,
> see [Reading a dynamically-specified file with SnowflakeFile](../developer-guide/udf/python/udf-python-examples.md).

```sqlexample
CREATE OR REPLACE FUNCTION PDF_PARSE(file_path string)
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.8'
  HANDLER = 'parse_pdf_fields'
  PACKAGES=('typing-extensions','PyPDF2','snowflake-snowpark-python')
AS
$$
from pathlib import Path
import PyPDF2 as pypdf
from io import BytesIO
from snowflake.snowpark.files import SnowflakeFile

def parse_pdf_fields(file_path):
    with SnowflakeFile.open(file_path, 'rb') as f:
        buffer = BytesIO(f.readall())
    reader = pypdf.PdfFileReader(buffer)
    fields = reader.getFields()
    field_dict = {}
    for k, v in fields.items():
        if "/V" in v.keys():
            field_dict[v["/T"]] = v["/V"].replace("/", "") if v["/V"].startswith("/") else v["/V"]

    return field_dict
$$;
```

### Step 4: Create a table to store the file contents

Next, create a table where each row stores information about a file on the
stage in columns named `file_name` and `file_data`. The task that you create in a later step
will load data into this table.

```sqlexample
CREATE OR REPLACE TABLE prod_reviews (
  file_name varchar,
  file_data variant
);
```

### Step 5: Create a task

Create a scheduled task that checks the stream for new files on the stage and inserts the file data into the `prod_reviews` table.

The following statement creates a scheduled task using the stream created previously.
The task uses the [SYSTEM$STREAM_HAS_DATA](../sql-reference/functions/system_stream_has_data.md) function
to check whether the stream contains change data capture (CDC) records.

```sqlexample
CREATE OR REPLACE TASK load_new_file_data
  WAREHOUSE = 'MY_WAREHOUSE'
  SCHEDULE = '1 minute'
  COMMENT = 'Process new files on the stage and insert their data into the prod_reviews table.'
  WHEN
  SYSTEM$STREAM_HAS_DATA('my_pdf_stream')
  AS
  INSERT INTO prod_reviews (
    SELECT relative_path as file_name,
    PDF_PARSE(build_scoped_file_url('@my_pdf_stage', relative_path)) as file_data
    FROM my_pdf_stream
    WHERE METADATA$ACTION='INSERT'
  );
```

### Step 6: Run the task to test the pipeline

To check that the pipeline works, you can add files to the stage, manually execute the task, and then query the `product_reviews` table.

Start by adding some PDF files to the `my_pdf_stage` stage, and then refresh the stage.

> **Note:**
>
> This example uses [PUT](../sql-reference/sql/put.md) commands, which you can’t run from a worksheet in the Snowflake web interface.
> To upload files with Snowsight, see [Upload files onto a named internal stage](data-load-local-file-system-stage-ui.md).

```sqlexample
PUT file:///my/file/path/prod_review1.pdf @my_pdf_stage AUTO_COMPRESS = FALSE;
PUT file:///my/file/path/prod_review2.pdf @my_pdf_stage AUTO_COMPRESS = FALSE;

ALTER STAGE my_pdf_stage REFRESH;
```

You can query the stream to verify that it has recorded the two PDF files that we added to the stage.

```sqlexample
SELECT * FROM my_pdf_stream;
```

Now, execute the task to process the PDF files and update the `product_reviews` table.

```sqlexample
EXECUTE TASK load_new_file_data;
+----------------------------------------------------------+
| status                                                   |
|----------------------------------------------------------|
| Task LOAD_NEW_FILE_DATA is scheduled to run immediately. |
+----------------------------------------------------------+
1 Row(s) produced. Time Elapsed: 0.178s
```

Query the `product_reviews` table to see that the task has added a row for each PDF file.

```sqlexample
select * from prod_reviews;
+------------------+----------------------------------+
| FILE_NAME        | FILE_DATA                        |
|------------------+----------------------------------|
| prod_review1.pdf | {                                |
|                  |   "FirstName": "John",           |
|                  |   "LastName": "Johnson",         |
|                  |   "Middle Name": "Michael",      |
|                  |   "Product": "Tennis Shoes",     |
|                  |   "Purchase Date": "03/15/2022", |
|                  |   "Recommend": "Yes"             |
|                  | }                                |
| prod_review2.pdf | {                                |
|                  |   "FirstName": "Emily",          |
|                  |   "LastName": "Smith",           |
|                  |   "Middle Name": "Ann",          |
|                  |   "Product": "Red Skateboard",   |
|                  |   "Purchase Date": "01/10/2023", |
|                  |   "Recommend": "MayBe"           |
|                  | }                                |
+------------------+----------------------------------+
```

Finally, you can create a view that parses the objects in the `FILE_DATA` column into separate columns.
You can then query the view to analyze and work with the file contents.

```sqlexample
CREATE OR REPLACE VIEW prod_review_info_v
  AS
  WITH file_data
  AS (
      SELECT
        file_name
        , parse_json(file_data) AS file_data
      FROM prod_reviews
  )
  SELECT
      file_name
      , file_data:FirstName::varchar AS first_name
      , file_data:LastName::varchar AS last_name
      , file_data:"Middle Name"::varchar AS middle_name
      , file_data:Product::varchar AS product
      , file_data:"Purchase Date"::date AS purchase_date
      , file_data:Recommend::varchar AS recommended
      , build_scoped_file_url(@my_pdf_stage, file_name) AS scoped_review_url
  FROM file_data;

SELECT * FROM prod_review_info_v;

+------------------+------------+-----------+-------------+----------------+---------------+-------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| FILE_NAME        | FIRST_NAME | LAST_NAME | MIDDLE_NAME | PRODUCT        | PURCHASE_DATE | RECOMMENDED | SCOPED_REVIEW_URL                                                                                                                                                                                                                                                                                                                                                                                                              |
|------------------+------------+-----------+-------------+----------------+---------------+-------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| prod_review1.pdf | John       | Johnson   | Michael     | Tennis Shoes   | 2022-03-15    | Yes         | https://mydeployment.us-west-2.aws.privatelink.snowflakecomputing.com/api/files/01aefcdc-0000-6f92-0000-012900fdc73e/1275606224902/RZ4s%2bJLa6iHmLouHA79b94tg%2f3SDA%2bOQX01pAYo%2bl6gAxiLK8FGB%2bv8L2QSB51tWP%2fBemAbpFd%2btKfEgKibhCXN2QdMCNraOcC1uLdR7XV40JRIrB4gDYkpHxx3HpCSlKkqXeuBll%2fyZW9Dc6ZEtwF19GbnEBR9FwiUgyqWjqSf4KTmgWKv5gFCpxwqsQgofJs%2fqINOy%2bOaRPa%2b65gcnPpY2Dc1tGkJGC%2fT110Iw30cKuMGZ2HU%3d              |
| prod_review2.pdf | Emily      | Smith     | Ann         | Red Skateboard | 2023-01-10    | MayBe       | https://mydeployment.us-west-2.aws.privatelink.snowflakecomputing.com/api/files/01aefcdc-0000-6f92-0000-012900fdc73e/1275606224902/g3glgIbGik3VOmgcnltZxVNQed8%2fSBehlXbgdZBZqS1iAEsFPd8pkUNB1DSQEHoHfHcWLsaLblAdSpPIZm7wDwaHGvbeRbLit6nvE%2be2LHOsPR1UEJrNn83o%2fZyq4kVCIgKeSfMeGH2Gmrvi82JW%2fDOyZJITgCEZzpvWGC9Rmnr1A8vux47uZj9MYjdiN2Hho3uL9ExeFVo8FUtR%2fHkdCJKIzCRidD5oP55m9p2ml2yHOkDJW50%3d                            |
+------------------+------------+-----------+-------------+----------------+---------------+-------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: Bulk loading from a local file system
source: https://docs.snowflake.com/en/user-guide/data-load-local-file-system.md
section: User Guide
---

# Bulk loading from a local file system

This set of topics describes how to use the COPY command to bulk load data from a local file system into tables using an internal (i.e.
Snowflake-managed) stage. For instructions on loading data from a cloud storage location that you manage, refer to [Bulk loading from Amazon S3](data-load-s3.md), [Bulk loading from Google Cloud Storage](data-load-gcs.md), or [Bulk loading from Microsoft Azure](data-load-azure.md).

As illustrated in the diagram below, loading data from a local file system is performed in two, separate steps:

Step 1:
:   Upload (i.e. stage) one or more data files to a Snowflake stage (named internal stage or table/user stage) using the [PUT](../sql-reference/sql/put.md) command.

Step 2:
:   Use the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to load the contents of the staged file(s) into a Snowflake database table.

    Regardless of the stage you use, this step requires a running virtual warehouse that is also the current (i.e. in use) warehouse for the session. The warehouse provides the compute resources to
    perform the actual insertion of rows into the table.

> **Tip:**
>
> The instructions in this set of topics assume you have read [Preparing to load data](data-load-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data loading considerations](data-load-considerations.md) for best practices, tips, and other guidance.

**Next Topics:**

* **Configuration tasks (complete as needed):**

  + [Choosing an internal stage for local files](data-load-local-file-system-create-stage.md)
* **Data loading tasks (complete for each set of files you load):**

  + [Staging data files from a local file system](data-load-local-file-system-stage.md)
  + [Copy data from an internal stage](data-load-local-file-system-copy.md)

---
title: Bulk loading from Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-load-s3.md
section: User Guide
---

# Bulk loading from Amazon S3

If you already have an Amazon Web Services (AWS) account and use S3 buckets for storing and managing your data files, you can make use of your existing buckets and folder paths for bulk loading into
Snowflake. This set of topics describes how to use the COPY command to bulk load from an S3 bucket into tables.

As illustrated in the diagram below, loading data from an S3 bucket is performed in two steps:

Step 1:
:   Snowflake assumes the data files have already been staged in an S3 bucket. If they haven’t been staged yet, use the upload interfaces/utilities provided by AWS to stage the files.

Step 2:
:   Use the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to load the contents of the staged file(s) into a Snowflake database table. You can load directly from the bucket, but
    Snowflake recommends creating an external stage that references the bucket and using the external stage instead.

    Regardless of the method you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to perform the actual insertion of rows into the table.

> **Note:**
>
> Snowflake uses Amazon S3 gateway endpoints in each of its Amazon Virtual Private Clouds.
>
> As long as your Snowflake account is hosted on AWS, your network traffic does not traverse the public internet. This is true regardless
> of the region that your S3 bucket is in.

> **Tip:**
>
> The instructions in this set of topics assume you have read [Preparing to load data](data-load-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data loading considerations](data-load-considerations.md) for best practices, tips, and other guidance.

**Next Topics:**

* **Configuration tasks (complete as needed):**

  + [Allowing the Virtual Private Cloud IDs](data-load-s3-allow.md)
  + [Configuring secure access to Amazon S3](data-load-s3-config.md)
  + [AWS data file encryption](data-load-s3-encrypt.md)
  + [Create an S3 stage](data-load-s3-create-stage.md)
* **Data loading tasks (complete for each set of files you load):**

  + [Copying data from an S3 stage](data-load-s3-copy.md)

---
title: Bulk loading from Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-load-gcs.md
section: User Guide
---

# Bulk loading from Google Cloud Storage

If you already have a Google Cloud Storage account and use Cloud Storage buckets for storing and managing your data files, you can make use of your existing buckets and folder paths for bulk loading into Snowflake.

> **Note:**
>
> Snowflake supports Regional Storage and Multi-Regional Storage accounts only.

This set of topics describes how to use the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to load data from a Cloud Storage bucket into tables.

As illustrated in the diagram below, loading data from a Cloud Storage bucket is performed in two steps:

Step 1:
:   Snowflake assumes the data files have already been staged in a Cloud Storage bucket. If they haven’t been staged yet, use the upload interfaces/utilities provided by Google to stage the files.

Step 2:
:   Use the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to load the contents of the staged file(s) into a Snowflake database table. You can load directly from the bucket, but
    Snowflake recommends creating an external stage that references the bucket and using the external stage instead.

    Regardless of the method you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to perform the actual insertion of rows into the table.

> **Note:**
>
> As long as your Snowflake account is hosted on Google Cloud, your network traffic does not traverse the public internet.

> **Tip:**
>
> The instructions in this set of topics assume you have read [Preparing to load data](data-load-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data loading considerations](data-load-considerations.md) for best practices, tips, and other guidance.

**Next Topics:**

* **Configuration tasks (complete as needed):**

  + [Configure an integration for Google Cloud Storage](data-load-gcs-config.md)
  + [Google Cloud Storage data file encryption](data-load-gcs-encrypt.md)
* **Data loading tasks (complete for each set of files you load):**

  + [Copy data from a Google Cloud Storage stage](data-load-gcs-copy.md)
* **Troubleshooting:**

  + [Troubleshooting loads from Google Cloud Storage](data-load-gcs-ts.md)

---
title: Bulk loading from Microsoft Azure
source: https://docs.snowflake.com/en/user-guide/data-load-azure.md
section: User Guide
---

# Bulk loading from Microsoft Azure

If you already have a Microsoft Azure account and use Azure blob storage containers for storing and managing your data files, you can make
use of your existing containers and folder paths for bulk loading into Snowflake.

> **Note:**
>
> To harden your security posture, you can configure your bulk load to use private connectivity rather than the public Internet. For more
> information, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](data-load-azure-private.md).

This set of topics describes how to use the COPY command to load data from an Azure container into tables.

Snowflake currently supports loading from blob storage only. Snowflake supports the following types of storage accounts:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v1
* General-purpose v2
* Microsoft Fabric OneLake

Snowflake *doesn’t* support Data Lake Storage Gen1.

> **Note:**
>
> * For Microsoft Fabric OneLake, Snowflake doesn’t support the following features:
>
>   + Automated Snowpipe
>   + Automatic refresh for external tables and directory tables
>   + Private connectivity
> * Loading from block, append, and page blobs is supported. Unloaded files are created as block blobs. For information about these blob
>   types, see the [Azure documentation on blob types](https://docs.microsoft.com/en-us/rest/api/storageservices/understanding-block-blobs--append-blobs--and-page-blobs).
> * If a hierarchical namespace is enabled on Data Lake Storage Gen2, Snowflake doesn’t support purging files with the COPY command. A
>   hierarchical namespace organizes data into directories and subdirectories. Azure only allows you to delete empty directories, which means
>   that you can’t delete directories recursively by using the PURGE option with the COPY command.

As illustrated in the following diagram, loading data from an Azure container is performed in two steps:

Step 1:
:   Snowflake assumes the data files have already been staged in an Azure container. If they haven’t been staged yet, use the upload interfaces/utilities provided by Microsoft to stage the files.

Step 2:
:   Use the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to load the contents of the staged file(s) into a Snowflake database table. You can load directly from the bucket, but
    Snowflake recommends creating an external stage that references the bucket and using the external stage instead.

    Regardless of the method you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to perform the actual insertion of rows into the table.

> **Note:**
>
> As long as your Snowflake account is hosted on Azure, your network traffic does not traverse the public internet.

> **Tip:**
>
> The instructions in this set of topics assume you have read [Preparing to load data](data-load-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data loading considerations](data-load-considerations.md) for best practices, tips, and other guidance.

**Next Topics:**

* **Configuration tasks (complete as needed):**

  + [Allow the VNet subnet IDs](data-load-azure-allow.md)
  + [Configure an Azure container for loading data](data-load-azure-config.md)
  + [Create an Azure stage](data-load-azure-create-stage.md)
* **Data loading tasks (complete for each set of files you load):**

  + [Copy data from an Azure stage](data-load-azure-copy.md)

---
title: Business Intelligence (BI)
source: https://docs.snowflake.com/en/user-guide/ecosystem-bi.md
section: User Guide
---

# Business Intelligence (BI)

Business intelligence (BI) tools enable analyzing, discovering, and reporting on data to help executives and managers make more informed
business decisions. A key component of any BI tool is the ability to deliver data visualization through dashboards, charts, and other
graphical output.

Business intelligence also sometimes overlaps with technologies such as [data integration/transformation](ecosystem-etl.md)
and [advanced analytics](ecosystem-analytics.md); however, we’ve chosen to list these technologies separately in their own
categories.

The following BI tools and technologies are known to provide native connectivity to Snowflake:

| Solution |  | Version / Installation Requirements | Notes |
| --- | --- | --- | --- |
|  |  | **Adobe Campaign:** 20.1 (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Additional resources:    + [Specific configurations by database type — Configure Access to Snowflake](http://docs.adobe.com/content/help/en/campaign-classic/using/getting-started/accessing-external-database/specific-configuration-database.html)     (Adobe Documentation: Help)   + [Big data management on Snowflake](http://docs.adobe.com/content/help/en/campaign-classic-learn/tutorials/administrating/fda/big-data-segmentation-on-snowflake.html)     (Adobe Documentation: Tutorials) |
|  |  | **Amplitude:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Amplitude & Snowflake](https://amplitude.com/partners/snowflake) (Amplitude website)   + [Data Warehouse Data Lake - Snowflake](https://amplitude.com/integrations/snowflake) (Amplitude website)   + [Amplitude Data Warehouse Demo](https://amplitude.wistia.com/medias/dn85ls9x43) (Amplitude videos)   + [Data > Warehouse-native Amplitude: Overview](https://amplitude.com/docs/data/warehouse-native/overview) (Amplitude Documentation)   + [Data > Warehouse-native Amplitude: Best Practices](https://amplitude.com/docs/data/warehouse-native/warehouse-native-amplitude-best-practices) (Amplitude Documentation) |
|  |  | **Astrato:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Create your Astrato account — free for 5 users](https://app.astrato.io/auth-login/sign-up) (Astrato website)   + [Connect to Snowflake](https://help.astrato.io/en/articles/5161726-connecting-to-snowflake) (Astrato Help Center) |
|  |  | **AtScale:** 7.4 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [Snowflake & AtScale, A Perfect Fit](https://www.atscale.com/blog/snowflake-and-atscale-a-perfect-fit) (AtScale Blog) |
|  |  | **AWS Quicksight:** No requirements  **Snowflake:** No requirements |  |
|  |  | **CARTO:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Spatial Extension for Snowflake](https://carto.com/snowflake/spatial-analytics) (CARTO website)   + [Data and Analysis > Analytics Toolbox for Snowflake](https://docs.carto.com/data-and-analysis/analytics-toolbox-for-snowflake) (CARTO Documentation) |
|  |  | **Chartio:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Connect and Explore Your Data in the Cloud with Snowflake and Chartio Analytics](https://chartio.com/product/data-sources/snowflake/)     (Chartio website) |
|  |  | **Domo:** No requirements  **Snowflake:** No requirements | Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md).   * Additional resources:    + [Snowflake Connector](https://knowledge.domo.com/Connect/Connecting_to_Data_with_Connectors/Configuring_Each_Connector/Database_Connectors/Snowflake_Connector)     (Domo Knowledge Base)   + [Simplify data management and get actionable intelligence](https://www.domo.com/partners/snowflake)   + Snowflake and Cloud Amplifier: [Customer Support Community](https://domo-support.domo.com/s/article/4402322966807?language=en_US) |
|  |  | **Fosfor**: no requirements  **Snowflake**: No requirements | Additional resources   * [Fosfor website](https://fosfor.com) |
|  |  | **Google Data Studio**: No requirements  **Snowflake**: No requirements | * Additional resources:    + [Snowflake Connector for Google Looker Studio](https://other-docs.snowflake.com/connectors/google-data-studio-connector.html) |
|  |  | **IBM Cognos Analytics:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Snowflake connections](https://www.ibm.com/docs/en/cognos-analytics/12.0.0?topic=details-snowflake-connections)     (Cognos Analytics Documentation)   + [How to set up Snowflake data source connection in Cognos Analytics](https://www.ibm.com/support/pages/how-set-snowflake-data-source-connection-cognos-analytics)     (IBM Support) |
|  |  | **Looker:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake](https://docs.looker.com/setup-and-management/database-config/snowflake) (Looker Documentation) |
|  |  | **MachEye:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Search-driven Augmented Analytics for Snowflake data](https://www.macheye.com/connect/snowflake/) (MachEye website) |
|  |  | **Metabase:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [metabase / modules / drivers / snowflake](https://github.com/metabase/metabase/tree/master/modules/drivers/snowflake) (Github) |
|  |  | **Microsoft Power BI Cloud Service:** [On-premises Data Gateway July Update](https://powerbi.microsoft.com/en-us/blog/power-bi-on-premises-data-gateway-july-update-is-now-available/) (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page |  |
|  |  | **Microsoft Power BI Desktop:**   * [Desktop July Update](https://powerbi.microsoft.com/en-us/blog/power-bi-desktop-july-feature-summary-2/) (or higher) * [Desktop September Update](https://powerbi.microsoft.com/en-us/blog/power-bi-desktop-september-feature-summary/) (or higher)   **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Additional resources:    + [Connect to Snowflake in Power BI Desktop](https://docs.microsoft.com/en-us/power-bi/desktop-connect-snowflake)     (Power BI Documentation) |
|  |  | **Microstrategy:** Secure Enterprise Platform 10.2 or 10.3 (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Additional resources:    + [How to connect to Snowflake](https://www2.microstrategy.com/producthelp/Current/Gateway_Connections/WebHelp/Lang_1033/Content/snowflake.htm)     (Microstrategy Community)   + [How to connect to Snowflake in Secure Enterprise Platform 10.2](https://community.microstrategy.com/s/article/KB275528-How-to-connect-to-a-Snowflake-2-x-database-in)     (Microstrategy Community)   + [Configuring MicroStrategy to use Snowflake](https://community.snowflake.com/s/article/configuring-microstrategy-to-use-snowflake)     (Snowflake Community) |
|  |  | **Mode Analytics:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Explore & Visualize Your Snowflake data](https://about.modeanalytics.com/snowflake/) (Mode website) |
|  |  | **Oracle Analytics Cloud:** March 2019 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [How to Use the Oracle Analytics Cloud (OAC) 105.2 Snowflake Connector](https://www.performancearchitects.com/post/how-to-use-the-oracle-analytics-cloud-oac-105-2-snowflake-connector)     (Performance Architects website) |
|  |  | **Oracle Analytics Desktop:** November 2019 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [User’s Guide for Oracle Analytics Desktop > Connect to Snowflake Data Warehouse](https://docs.oracle.com/en/middleware/bi/analytics-desktop/bidvd/connect-snowflake-data-warehouse.html)     (Oracle Documentation)   + [Oracle Analytics Desktop Doesn’t Connect To Snowflake](https://blog.redpillanalytics.com/oracle-analytics-desktop-doesnt-connect-to-snowflake-aa82ef6783d5)     (Red Pill Analytics Blog) |
|  |  | **Pentaho Business Analytics:**   * Pentaho 8.3 (or higher): Snowflake plugin — download from the Pentaho Customer Portal (requires login) * Pentaho 8.2 (or lower): No Pentaho requirements, but some Snowflake requirements   **Snowflake:**   * Pentaho 8.3 (or higher): No requirements * Pentaho 8.2 (or lower):    + [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc)  or   + 3rd-party connector (PentahoSnowflakePlugin) — download from [GitHub](https://github.com/inquidia/PentahoSnowflakePlugin) | * Additional resources:    + [PDI and Snowflake](https://docs.pentaho.com/pdia-data-integration/advanced-topics-pentaho-data-integration-overview/pdi-and-snowflake-cp) (Pentaho Documentation)   + [Bulk load into Snowflake](https://docs.pentaho.com/pdia-data-integration/pdi-job-entries-reference-overview/bulk-load-into-snowflake) (Pentaho Documentation)   + [PentahoSnowflakePlugin Readme](https://github.com/inquidia/PentahoSnowflakePlugin/blob/master/README.md) (GitHub) |
|  |  | **Pyramid:** 2018 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [Snowflake + Pyramid](https://www.pyramidanalytics.com/landers/snowflake) (Pyramid Analytics website) |
|  |  | **Qlik Sense:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Start a Trial](https://www.qlik.com/us/trial/qlik-sense-business)   + [Qlik Sense user guide for Snowflake](https://res.cloudinary.com/talend/image/upload/v1711389468/qlik/docs/resource-library/whitepapers/resource-wp-qlik-sense-best-practices-guide-for-snowflake-en_uraivy.pdf)   + [Qlik Communicty Resources](https://community.qlik.com/) |
|  |  | **Sigma Computing:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). |
|  |  | **SAP BusinessObjects:** 4.2 SP08 (or higher)  **Snowflake:**   * [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page  or * [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Additional resources:    + [Snowflake for SAP BusinessObjects 4.2 SP08](https://blogs.sap.com/2020/03/12/snowflake-for-sap-businessobjects-4.2-sp08/)     (SAP Community Blogs) |
|  |  | **Sisense:** No requirements  **Snowflake:** No requirements | * Sisense for Cloud Data Teams (formerly Periscope Data) available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Sisense Connector for Snowflake](https://www.sisense.com/data-connectors/snowflake/) (Sisense website)   + [Connecting to Snowflake](https://documentation.sisense.com/latest/managing-data/connectors/snowflake-online.htm)     (Sisense Documentation) |
|  |  | **Tableau:** Desktop/Server/Online 9.3 (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Introducing the Snowflake connector in Tableau 9.3](https://www.tableau.com/about/blog/2016/3/introducing-snowflake-connector-tableau-93-52456)     (Tableau Blog)   + [Supported Connectors: Snowflake](https://onlinehelp.tableau.com/current/pro/desktop/en-us/examples_snowflake.htm) (Tableau Help)   + [Best Practices for Using Tableau with Snowflake](https://resources.snowflake.com/ebooks/best-practices-for-using-tableau-with-snowflake)     (Snowflake Resource Library) |
|  |  | **Tableau CRM:** No requirements  **Snowflake:** No requirements | * Formerly Salesforce Einstein Analytics * Additional resources:    + [Snowflake Computing Connection](https://help.salesforce.com/articleView?id=bi_integrate_connectors_snowflake_settings.htm&type=0)     (Salesforce Community Help) |
|  |  | **ThoughtSpot:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Search and AI-Driven Analytics for Snowflake](https://www.thoughtspot.com/sites/default/files/pdf/ThoughtSpot-Snowflake-Data-Sheet.pdf)     (ThoughtSpot website) |
|  |  | **TIBCO Spotfire:** 10.3 LTS (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Additional resources:    + [Connector for Snowflake](https://docs.tibco.com/pub/spotfire/general/drivers/data_sources/snowflake.htm)     (TIBCO Documentation)   + [Accessing Data from Snowflake](https://docs.tibco.com/pub/sfire-analyst/10.4.0/doc/html/en-US/TIB_sfire-analyst_UsersGuide/connectors/snowflake/snowflake_accessing_data_from_snowflake.htm)     (TIBCO Documentation) |

---
title: C5 (Cloud Computing Compliance Controls Catalog)
source: https://docs.snowflake.com/en/user-guide/cert-c5.md
section: User Guide
---

# C5 (Cloud Computing Compliance Controls Catalog)

This topic describes how Snowflake supports customers with C5 compliance requirements.

## Understanding C5 compliance requirements

The Cloud Computing Compliance Controls Catalog (C5) is an audited standard establishing mandatory baselines for cloud security. The
framework was created by the German Federal Office for Information Security (Bundesamt für Sicherheit in der Informationstechnik, or BSI).
C5 was initially created for government agencies and organizations that work with the government to ensure that security baselines are met
by their cloud service providers (CSPs). The private sector has also adopted this framework for evaluation of the security of their CSPs.
The framework is based on ISO 27001, CSA, and BSI’s IT-Grundshutz catalogs. The certification can be obtained for either the Basic
requirements or Basic + Additional Criteria. Snowflake’s C5 scope currently includes the Basic requirements.

---
title: Canceling Statements
source: https://docs.snowflake.com/en/user-guide/querying-cancel-statements.md
section: User Guide
---

# Canceling Statements

The recommended way to cancel a statement is to use the interface of the application
in which the query is running (e.g. the Worksheet in the Snowflake web interface)
or the cancellation API provided by the Snowflake ODBC or JDBC driver. However,
in some cases, it is necessary to cancel a query using SQL.

Snowflake provides the following functions to support using SQL to cancel
running/active statements:

* [SYSTEM$CANCEL_ALL_QUERIES](../sql-reference/functions/system_cancel_all_queries.md)
* [SYSTEM$CANCEL_QUERY](../sql-reference/functions/system_cancel_query.md)

## Example

The following Java sample code uses [SYSTEM$CANCEL_ALL_QUERIES](../sql-reference/functions/system_cancel_all_queries.md)
and other Snowflake functions to cancel a running statement in the current session
after 5 seconds:

1. The sample code first issues a SQL command for [CURRENT_SESSION](../sql-reference/functions/current_session.md)
   to obtain the session identifier.
2. It then creates a task to be executed 5 seconds later. This task uses the
   session identifier as a parameter to SYSTEM$CANCEL_ALL_QUERIES.
3. Then a long running statement is executed using the [GENERATOR](../sql-reference/functions/generator.md)
   table function to generate rows for 120 seconds.

```java
public void testCancelQuery() throws IOException, SQLException
{
  Statement         statement         = null;
  ResultSet         resultSet         = null;
  ResultSetMetaData resultSetMetaData = null;
  final Connection  connection        = getConnection(true);
  try
  {
    // Get the current session identifier
    Statement getSessionIdStmt = connection.createStatement();
    resultSet                  = getSessionIdStmt.executeQuery("SELECT current_session()");
    resultSetMetaData          = resultSet.getMetaData();
    assertTrue(resultSet.next());
    final int sessionId = resultSet.getInt(1);

    // Use Timer to cancel all queries of session in 5 seconds
    Timer timer = new Timer();
    timer.schedule( new TimerTask()
    {
      @Override
      public void run()
      {
        try
        {
          // Cancel all queries on session
          PreparedStatement cancelAll;
          cancelAll = connection.prepareStatement(
                                    "call system$cancel_all_queries(?)");

          // bind the session identifier as first argument
          cancelAll.setInt(1, sessionId);
          cancelAll.executeQuery();
        }
        catch (SQLException ex)
        {
          logger.log(Level.SEVERE, "Cancel failed with exception {}", ex);
        }
      }
    }, 5000);

    // Use the internal row generator to execute a query for 120 seconds
    statement = connection.createStatement();
    resultSet = statement.executeQuery(
                   "SELECT count(*) FROM TABLE(generator(timeLimit => 120))");
    resultSetMetaData = resultSet.getMetaData();
    statement.close();
  }
  catch (SQLException ex)
  {
    // assert the sqlstate is what we expect (QUERY CANCELLED)
    assertEquals("sqlstate mismatch",
                 SqlState.QUERY_CANCELED, ex.getSQLState());
  }
  catch (Throwable ex)
  {
    logger.log(Level.SEVERE, "Test failed with exception: ", ex);
  }
  finally
  {
    if (resultSet != null)
      resultSet.close();
    if (statement != null)
      statement.close();
    // close connection
    if (connection != null)
      connection.close();
  }
}
```

---
title: CE+ (Cyber Essentials Plus)
source: https://docs.snowflake.com/en/user-guide/cert-cyber-essentials-plus.md
section: User Guide
---

# CE+ (Cyber Essentials Plus)

This topic describes how Snowflake supports customers with CE+ compliance requirements.

## Understanding CE+ compliance requirements

Cyber Essentials Plus (CE+) is a United Kingdom government supported framework that helps protect organizations, regardless of size,
against a wide range of the most common cyber attacks. CE+ certification is required for organizations that plan to bid for central
government contracts which involve handling sensitive and personal information or the provision of certain technical products and services.
CE+ certification is supported by industry, including the Federation of Small Businesses, the Confederation of British Industry and a
number of insurance organizations that offer incentives for CSPs holding this certification. CE+ provides the necessary technical controls
and a related assurance framework conducted via an annual external assessment conducted by an accredited assessor.

Achievement of the CE+ certification demonstrates Snowflake’s commitment to mitigate the risk from common Internet-based threats and cyber
security best practices.

For more information, please visit the [Cyber Essentials website](https://www.ncsc.gov.uk/cyberessentials/overview).

You can view the current status of Snowflake’s (Snowflake, Inc.) CE certifications on the
[IASME website](https://iasme.co.uk/certified-organisations/).

---
title: Channels and exactly-once delivery
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-channels.md
section: User Guide
---

# Channels and exactly-once delivery

This topic explains how Snowpipe Streaming ingests data through channels with ordering guarantees and how offset tokens enable exactly-once delivery.

## Streaming ingestion fundamentals

Snowpipe Streaming is built around several core streaming ingestion principles:

* **Continuous ingestion**: Data flows into Snowflake as it is produced, rather than being collected into batches and loaded periodically. Applications submit rows continuously through long-lived connections, and Snowflake commits the data automatically.
* **Exactly-once delivery**: Each record is ingested exactly once, even in the event of client failures or network interruptions. Snowpipe Streaming achieves this through offset token tracking, which lets clients resume from the last committed position without duplicating data.
* **Ordered ingestion**: Rows are committed in the order they are submitted within a channel. This preserves the sequence of events from the source system, which is critical for time-series data, CDC pipelines, and audit trails.
* **Low latency**: Data becomes available for query in as low as 5 seconds after ingestion. This enables near-real-time analytics without the delays of traditional batch loading.
* **Serverless**: Snowflake manages all compute resources for ingestion. Resources scale automatically based on throughput, with no infrastructure for the client to provision or manage.

## How data flows

A client application connects to Snowflake using a Snowpipe Streaming SDK (Java or Python) or the REST API. The client opens one or more channels against a pipe, then submits rows through those channels. Snowflake buffers and commits the data to the target table, making it available for query within seconds.

The end-to-end flow:

1. **Client application** submits rows using the SDK (`appendRows`) or the REST API (`Append Rows` endpoint).
2. **Channel** receives the rows in order and associates each batch with an offset token for progress tracking.
3. **Pipe** processes the data server-side: validates the schema, applies any configured transformations or pre-clustering, then commits to the target table.
4. **Target table** receives the committed data, which becomes immediately queryable.

## Channels

A channel is a logical, named streaming connection to Snowflake for loading data into a table. Channels provide two guarantees:

* **Ordered ingestion**: The ordering of rows and their corresponding offset tokens is preserved within a channel.
* **Exactly-once delivery**: Offset tokens enable clients to track committed progress and replay from the last committed position on recovery.

Ordering is preserved within a channel but not across channels that point to the same table.

Channels are opened against a pipe. The client SDK can open multiple channels to multiple pipes; however, the SDK can’t open channels across accounts. Channels are meant to be long lived when a client is actively inserting data and should be reused across client process restarts because offset token information is retained.

You can permanently drop channels by using the `DropChannelRequest` API when you no longer need the channel and the associated offset metadata. You can drop a channel in two ways:

* Dropping a channel at closing. Data inside the channel is automatically flushed before the channel is dropped.
* Dropping a channel blindly. We don’t recommend this approach because it discards any pending data.

You can run the SHOW CHANNELS command to list the channels for which you have access privileges. For more information, see [SHOW CHANNELS](../../sql-reference/sql/show-channels.md).

> **Note:**
>
> Inactive channels, along with their offset tokens, are deleted automatically after 30 days of inactivity.

## Offset tokens and exactly-once delivery

> **Tip:**
>
> **How exactly-once works in Snowpipe Streaming**: Your application submits rows with an offset token (for example, a Kafka partition offset). Snowflake persists the token when the data is committed. On recovery, your application calls `getLatestCommittedOffsetToken` to find where it left off, then replays from that position. No duplicate data is ingested, and no data is lost.

An *offset token* is a string that a client includes in row-submission requests to track ingestion progress on a per-channel basis. The specific methods used are `appendRow` or `appendRows` for the SDK and the `Append Rows` endpoint for the REST API.

The token is initialized to NULL on channel creation and is updated when the rows with a provided offset token are committed to Snowflake. Clients can periodically call `getLatestCommittedOffsetToken` to get the latest committed offset token for a channel and use that to reason about ingestion progress.

When a client re-opens a channel, the latest persisted offset token is returned. The client can reset its position in the data source by using the token to avoid sending the same data twice. When a channel re-open event occurs, any uncommitted data buffered in Snowflake is discarded to avoid committing it.

You can use the latest committed offset token to perform the following:

> * Track ingestion progress
> * Check whether a specific offset has been committed by comparing it with the latest committed offset token
> * Advance the source offset and purge the data that has already been committed
> * Enable de-duplication and ensure exactly-once delivery of data

**Example: Kafka connector crash recovery**

The Kafka connector reads an offset token from a topic such as `<partition>:<offset>`. Consider the following scenario:

1. The Kafka connector comes online and opens a channel corresponding to `Partition 1` in Kafka topic `T` with the channel name `T:P1`.
2. The connector begins reading records from the Kafka partition.
3. The connector calls the API, making an `appendRows` method request, with the offset associated with the record as the offset token.

   For example, the offset token could be `10`, referring to the tenth record in the Kafka partition.
4. The connector periodically makes `getLatestCommittedOffsetToken` method requests to determine the ingest progress.

If the Kafka connector crashes, the following procedure resumes reading records from the correct offset:

1. The Kafka connector comes back online and re-opens the channel, using the same name as earlier.
2. The connector calls `getLatestCommittedOffsetToken` to get the latest committed offset for the partition.

   For example, assume the latest persisted offset token is `20`.
3. The connector uses the Kafka read APIs to reset a cursor corresponding to the offset plus 1 (`21` in this example).
4. The connector resumes reading records. No duplicate data is retrieved after the read cursor is repositioned successfully.

**Example: Log file ingestion with crash recovery**

An application reads logs from a directory and uses the Snowpipe Streaming SDK to export those logs to Snowflake. The application does the following:

1. Lists files in the log directory.

   Assume that the logging framework generates log files that can be ordered lexicographically and that new log files are positioned at the end of this ordering.
2. Reads a log file line by line and calls the API, making `appendRows` method requests with an offset token corresponding to the log file name and the line count or byte position.

   For example, an offset token could be `messages_1.log:20`, where `messages_1.log` is the name of the log file, and `20` is the line number.

If the application crashes or needs to be restarted, it calls `getLatestCommittedOffsetToken` to retrieve the offset token that corresponds to the last exported log file and line. Continuing with the example, this could be `messages_1.log:20`. The application then opens `messages_1.log` and seeks line `21` to prevent the same log line from being ingested twice.

> **Note:**
>
> The offset token information can be lost. The offset token is linked to a channel object, and a channel is automatically cleared if no new ingestion is performed using the channel for a period of 30 days. To prevent the loss of the offset token, consider maintaining a separate offset and resetting the channel’s offset token if required.

## Roles of `offsetToken` and `continuationToken`

Both `offsetToken` and `continuationToken` are used to ensure exactly-once data delivery, but they serve different purposes and are managed by different subsystems. The primary distinction is who controls the token’s value and the scope of its use.

* `continuationToken` (only used by direct REST API users):

  This token is managed by Snowflake and is essential for maintaining the state of a single, continuous streaming session. When a client sends data using the `Append Rows` API, Snowflake returns a `continuationToken`. The client must pass back this token in its next AppendRows request to ensure the data is received by Snowflake in the correct order and without gaps. Snowflake uses the token to detect and prevent duplicate data or missing data in the event of an SDK retry.
* `offsetToken`:

  This token is a user-defined identifier that enables exactly-once delivery from an external source. Snowflake stores this value but doesn’t use it for its own internal operations or to prevent re-ingestion. It is the responsibility of the external system, such as a Kafka connector, to read the offsetToken from Snowflake and use it to track its own ingestion progress and avoid sending duplicate data if the external stream needs to be replayed.

---
title: Checking your REST catalog configuration
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-check-config.md
section: User Guide
---

# Checking your REST catalog configuration

You can use the following scenarios to check whether you’ve correctly configured authorization and access control with your Iceberg
REST catalog so that Snowflake can interact with your catalog server.

* Check a configuration for OAuth
* Check a configuration for a bearer token
* Check a configuration for SigV4

## Use SYSTEM$VERIFY_CATALOG_INTEGRATION

You can use the [SYSTEM$VERIFY_CATALOG_INTEGRATION](../sql-reference/functions/system_verify_catalog_integration.md) function to check your catalog integration configuration.

The following example demonstrates how the system function catches and reports issues with an improperly configured catalog integration.

The following example statement creates a REST catalog integration
using an invalid OAuth client secret (this runs without error):

```sqlexample
CREATE CATALOG INTEGRATION my_rest_cat_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'default'
  REST_CONFIG = (
    CATALOG_URI = 'https://abc123.us-west-2.aws.myapi.com/polaris/api/catalog'
    CATALOG_NAME = 'my_catalog_name'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = '123AbC ...'
    OAUTH_CLIENT_SECRET = '1365910abIncorrectSecret ...'
    OAUTH_ALLOWED_SCOPES = ('all-apis', 'sql')
  )
  ENABLED = TRUE;
```

Use the system function to verify the catalog integration, expecting failure:

```sqlexample
SELECT SYSTEM$VERIFY_CATALOG_INTEGRATION('my_rest_cat_int');
```

Output:

```output
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
|                                                                                                              SYSTEM$VERIFY_CATALOG_INTEGRATION('MY_REST_CAT_INT')                                                                                                               |
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| {                                                                                                                                                                                                                                                                               |
|  "success" : false,                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                    |
|   "errorCode" : "004155",                                                                                                                                                                                                                                                       |
|   "errorMessage" : "SQL Execution Error: Failed to perform OAuth client credential flow for the REST Catalog integration MY_REST_CAT_INT due to error: SQL execution error: OAuth2 Access token request failed with error 'unauthorized_client:The client is not authorized'.." |
| }                                                                                                                                                                                                                                                                               |
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

## Check a configuration for OAuth

Follow these steps to check your configuration for OAuth with your remote REST catalog.

### Step 1: Retrieve an access token

Use a `curl` command to retrieve an access token from your catalog. The following example
requests an access token from Snowflake Open Catalog:

```bash
curl -X POST https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens \
    -H "Accepts: application/json" \
    -H "Content-Type: application/x-www-form-urlencoded" \
    --data-urlencode "grant_type=client_credentials" \
    --data-urlencode "scope=PRINCIPAL_ROLE:ALL" \
    --data-urlencode "client_id=<my_client_id>" \
    --data-urlencode "client_secret=<my_client_secret>" | jq
```

Where:

* `https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens` is the endpoint for retrieving an OAuth token
  ([getToken](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L132)).
* `scope` is the same as the value that you specify for `OAUTH_ALLOWED_SCOPES` parameter when you create a catalog integration.
  For multiple scopes, use a space as a separator.
* `my_client_id` is the same client ID that you specify for the `OAUTH_CLIENT_ID` parameter when you create a catalog integration.
* `my_client_secret` is the same client secret that you specify for the `OAUTH_CLIENT_SECRET` parameter when you create a catalog integration.

Example return value:

```output
{
  "access_token": "xxxxxxxxxxxxxxxx",
  "token_type": "bearer",
  "issued_token_type": "urn:ietf:params:oauth:token-type:access_token",
  "expires_in": 3600
}
```

### Step 2: Verify the access token permissions

Using the access token that you retrieved in the previous step,
verify that you have permission to access your catalog server.

You can use a `curl` command to list the configuration settings for your catalog:

```bash
curl -X GET "https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/config?warehouse=<warehouse>" \
    -H "Accepts: application/json" \
    -H "Content-Type: application/x-www-form-urlencoded" \
    -H "Authorization: Bearer ${ACCESS_TOKEN}" | jq
```

Where:

* `?warehouse=warehouse` optionally specifies the warehouse name to request from your catalog (if supported). For Snowflake Open Catalog, the
  warehouse name is your catalog name.
* `ACCESS_TOKEN` is a variable that contains the `access_token` that you retrieved in the previous step.

Example return value:

```output
{
  "defaults": {
    "default-base-location": "s3://my-bucket/polaris/"
  },
  "overrides": {
    "prefix": "my-catalog"
  }
}
```

### Step 3: Load a table from the catalog

You can also make a GET request to load a table. Snowflake uses the
[loadTable](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L616)
operation to load table data from your REST catalog.

```bash
curl -X GET "https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/<prefix>/namespaces/<namespace>/tables/<table>" \
    -H "Accepts: application/json" \
    -H "Content-Type: application/x-www-form-urlencoded" \
    -H "Authorization: Bearer ${ACCESS_TOKEN}" | jq
```

Where:

* `prefix` optionally specifies the prefix obtained from the previous `getConfig` response.
* `namespace` is the namespace of the table you want to retrieve. If the namespace is nested, use the `%1F` separator;
  for example, `parentNamespace%1FchildNamespace`.
* `table` is the table name.

## Check a configuration for a bearer token

Follow these steps to check your configuration with your remote REST catalog for using a bearer token.

### Step 1: Verify the access token permissions

Use a `curl` command to verify that you have permission to access your catalog server:

```bash
curl -X GET "https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/config?warehouse=<warehouse>" \
    -H "Accepts: application/json" \
    -H "Content-Type: application/x-www-form-urlencoded" \
    -H "Authorization: Bearer ${BEARER_TOKEN}" | jq
```

Where:

* `https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens` is the endpoint for retrieving an OAuth token
  ([getToken](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L132)).
* `?warehouse=warehouse` optionally specifies the warehouse name to request from your catalog (if supported).
* `BEARER_TOKEN` is a variable that contains the `access_token` that you retrieved in the previous step.

Example return value:

```output
{
  "defaults": {
    "default-base-location": "s3://my-bucket/polaris"
  },
  "overrides": {
    "prefix": "my-catalog"
  }
}
```

### Step 2: Load a table from the catalog

You can also make a GET request to load a table. Snowflake uses the
[loadTable](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L616)
operation to load table data from your REST catalog.

```bash
curl -X GET "https://xx123xx.us-west-2.aws.snowflakecomputing.com/polaris/api/catalog/v1/<prefix>/namespaces/<namespace>/tables/<table>" \
    -H "Accepts: application/json" \
    -H "Content-Type: application/x-www-form-urlencoded" \
    -H "Authorization: Bearer ${BEARER_TOKEN}" | jq
```

Where:

* `prefix` optionally specifies the prefix obtained from the previous `getConfig` response.
* `namespace` is the namespace of the table you want to retrieve. If the namespace is nested, use the `%1F` separator;
  for example, `parentNamespace%1FchildNamespace`.
* `table` is the table name.

## Check a configuration for SigV4

Follow these steps to check your configuration for SigV4 with AWS.

### Step 1: Add your user to the IAM role trust relationship

When you create a REST catalog integration for SigV4, Snowflake provisions an AWS IAM user for your Snowflake account.
You [add that Snowflake IAM user to the trust relationship](tables-iceberg-configure-catalog-integration-rest-api-gateway.md) for
an [IAM role](tables-iceberg-configure-catalog-integration-rest-api-gateway.md) with permission to access your API Gateway resources.

To test your configuration, *you* can assume the role as a user in your AWS account after you add your AWS
user to the role’s trust policy document. To retrieve your current IAM user ARN, use the
[sts get-caller-identity](https://awscli.amazonaws.com/v2/documentation/api/latest/reference/sts/get-caller-identity.html) command
for the [AWS Command Line Interface (CLI)](https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-welcome.html) :

```bash
aws sts get-caller-identity
```

Example output:

```output
{
  "UserId": "ABCDEFG1XXXXXXXXXXX",
  "Account": "123456789XXX",
  "Arn": "arn:aws:iam::123456789XXX:user/managed/my_user"
}
```

The updated trust policy document should include both the Snowflake user ARN and your user ARN as follows:

```json
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "",
      "Effect": "Allow",
      "Principal": {
        "AWS": [
          "<snowflake_iam_user_arn>",
          "<my_iam_user_arn>"
        ]
      },
      "Action": "sts:AssumeRole",
      "Condition": {
        "StringEquals": {
          "sts:ExternalId": "my_external_id"
        }
      }
    }
  ]
}
```

For full instructions, see [Update a role trust policy](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_update-role-trust-policy.html)
in the AWS IAM documentation.

### Step 2: Assume your IAM role to get temporary credentials

To get temporary security credentials for AWS, use the
[sts assume-role](https://awscli.amazonaws.com/v2/documentation/api/latest/reference/sts/assume-role.html) command for the AWS CLI.

```bash
aws sts assume-role \
  --role-arn <my_role_arn> \
  --role-session-name <session_name>
```

Where:

* `my_role_arn` is the Amazon Resource Name (ARN) of the IAM role that you’ve configured for Snowflake.
* `session_name` is a string identifier of your choice for the assumed role session; for example, `my_rest_session`.

Example output:

```output
{
  "Credentials": {
      "AccessKeyId": "XXXXXXXXXXXXXXXXXXXXX",
      "SecretAccessKey": "XXXXXXXXXXXXXXXXXXXXX",
      "SessionToken": "XXXXXXXXXXXXXXXXXXXXX",
      "Expiration": "2024-10-09T08:13:15+00:00"
  },
  "AssumedRoleUser": {
      "AssumedRoleId": "{AccessKeyId}:my_rest_catalog_session",
      "Arn": "arn:aws:sts::123456789XXX:assumed-role/my_catalog_role/my_rest_catalog_session"
  }
}
```

> **Note:**
>
> If the `assume-role` command fails, it means that your current AWS user isn’t included in the role’s trust policy as
> an allowed principal.
>
> Similarly, if the Snowflake IAM user ARN isn’t included in your trust policy, Snowflake won’t
> be able to connect to your API Gateway resources.
> For more information, see [Configure the trust relationship in IAM](tables-iceberg-configure-catalog-integration-rest-api-gateway.md).

### Step 3: Verify that your IAM role has the right permissions

Using the temporary credentials that you retrieved in the previous step,
verify that your IAM role has permission to invoke your API Gateway APIs.

You can use a `curl` command to list the configuration settings for your catalog:

```bash
curl -v -X GET  "https://123xxxxxxx.execute-api.us-west-2.amazonaws.com/test_v2/v1/config?warehouse=<warehouse>" \
  --user "$AWS_ACCESS_KEY_ID":"$AWS_SECRET_ACCESS_KEY" \
  --aws-sigv4 "aws:amz:us-west-2:execute-api" \
  -H "x-amz-security-token: $AWS_SESSION_TOKEN"
```

Where:

* `123xxxxxxx.execute-api.us-west-2.amazonaws.com` is your API Gateway hostname.
* `test_v2` is the name of the stage that your API is deployed to.
* `v1/config` specifies the [getConfig](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L65) operation from the Iceberg catalog OpenAPI definition.
* `?warehouse=warehouse` optionally specifies the warehouse name to request from your catalog (if supported).
* `$AWS_ACCESS_KEY_ID` is a variable that contains the `AccessKeyId` that you retrieved using the `sts assume-role` command.
* `$AWS_SECRET_ACCESS_KEY` is a variable that contains the `SecretAccessKey` that you retrieved using the `sts assume-role` command.
* `aws:amz:us-west-2:execute-api` is the signing name of the SigV4 protocol. For AWS Glue, use `aws:amz:us-west-2:glue` instead.
* `$AWS_SESSION_TOKEN` is a variable that contains the `SessionToken` that you retrieved using the `sts assume-role` command.

Example return value:

```output
{
  "defaults": {},
  "overrides": {
    "prefix": "my-catalog"
  }
}
```

You can also make a GET request to load a table. Snowflake uses the
[loadTable](https://github.com/apache/iceberg/blob/apache-iceberg-1.6.1/open-api/rest-catalog-open-api.yaml#L616)
operation to load table data from your REST catalog.

```bash
curl -v -X GET "https://123xxxxxxx.execute-api.us-west-2.amazonaws.com/test_v2/v1/<prefix>/namespaces/<namespace>/tables/<table>" \
    --user "$AWS_ACCESS_KEY_ID":"$AWS_SECRET_ACCESS_KEY" \
    --aws-sigv4 "aws:amz:us-west-2:execute-api" \
    -H "x-amz-security-token: $AWS_SESSION_TOKEN"
```

Where:

* `prefix` optionally specifies the prefix obtained from the previous `getConfig` response.
* `namespace` is the namespace of the table you want to retrieve. If the namespace is nested, use the `%1F` separator;
  for example, `parentNamespace%1FchildNamespace`.
* `table` is the table name.

**Private API**

For a private API, you can specify your VPC endpoint and private Amazon API Gateway hostname in the same `curl` commands.

For example:

```bash
curl -v -X GET  "https://vpce-xxxxxxxxxxxxxxxxxxxxxxxxxx.execute-api.us-west-2.vpce.amazonaws.com/test_v2/v1/config?warehouse=<warehouse>" \
  --user "$AWS_ACCESS_KEY_ID":"$AWS_SECRET_ACCESS_KEY" \
  --aws-sigv4 "aws:amz:us-west-2:execute-api" \
  -H "x-amz-security-token: $AWS_SESSION_TOKEN"
  -H "Host: abc1defgh2.execute-api.us-west-2.amazonaws.com"
```

Where:

* `https://vpce-xxxxxxxxxxxxxxxxxxxxxxxxxx.execute-api.us-west-2.vpce.amazonaws.com/...` is the hostname of your VPC endpoint.
* `abc1defgh2.execute-api.us-west-2.amazonaws.com` is the hostname of your private API in Amazon API Gateway.

---
title: Choosing an internal stage for local files
source: https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage.md
section: User Guide
---

# Choosing an internal stage for local files

A stage specifies where data files are stored (that is, “staged”) so that the data in the files can be loaded into a table.

## Types of internal stages

Snowflake supports the following types of internal stages:

> * User
> * Table
> * Named

By default, each user and table in Snowflake is automatically allocated an internal stage for staging data files to be loaded. In addition, you can create named internal stages.

File staging information is required during both steps in the data loading process:

1. You must specify an internal stage in the [PUT](../sql-reference/sql/put.md) command when uploading files to Snowflake.
2. You must specify the same stage in the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command when loading data into a table from the staged files.

Consider the best type of stage for specific data files. Each option provides benefits and potential drawbacks.

### User stages

Each user has a Snowflake stage allocated to them by default for storing files. This stage is a convenient option if your files will only be accessed by a single user, but need to be copied into multiple tables.

User stages have the following characteristics and limitations:

* User stages are referenced using `@~`; e.g. use `LIST @~` to list the files in a user stage.
* Unlike named stages, user stages cannot be altered or dropped.
* User stages do not support setting file format options. Instead, you must specify file format and copy options as part of the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command.

This option is not appropriate if:

* Multiple users require access to the files.
* The current user does not have INSERT privileges on the tables the data will be loaded into.

### Table stages

> **Note:**
>
> Apache Iceberg™ tables in Snowflake don’t support table stages.

By default, each table has a Snowflake stage allocated to it for storing files. This stage is called a *table* stage.

You might use a table stage if you only need to copy files into a single table, but want
to make the files accessible to multiple users.

Table stages have the following characteristics and limitations:

* A table stage has the same name as the table. For example, a table named `mytable` has a stage referenced as `@%mytable`.
* A table stage is an implicit stage tied to a table object. It’s not a separate database object. As a result,
  a table stage has no grantable privileges of its own. A table stage is also not appropriate if you need to copy file data into multiple tables.
* To stage files on a table stage, list the files, query the files, or drop them,
  you must be the table owner (have the role with the OWNERSHIP privilege on the table).
* Unlike a named stage, you can’t alter or drop a table stage.
* Table stages don’t support transforming data while loading it (using a query as the source for the COPY command).

### Named stages

Named stages are database objects that provide the greatest degree of flexibility for data loading:

* Users with the appropriate privileges on the stage can load data into any table.
* Because the stage is a database object, the security/access rules that apply to all objects apply. The privileges to use a stage can be
  granted or revoked from roles. In addition, ownership of the stage can be transferred to another role.

If you plan to stage data files that will be loaded only by you, or will be loaded only into a single table, then you may prefer to simply
use either your user stage or the stage for the table into which you will be loading data.

Named stages are optional but recommended when you plan regular data loads that could involve multiple users and/or tables. For
instructions on creating a named stage, see Creating a Named Stage below.

## Creating a named stage

You can create a named internal stage using SQL or the web interface.

> **Note:**
>
> To create a stage, you must use a role that is granted or inherits the necessary privileges.
> For more information, see [Access control requirements](../sql-reference/sql/create-stage.md) for [CREATE STAGE](../sql-reference/sql/create-stage.md).

### Create a named stage using SQL

Use the [CREATE STAGE](../sql-reference/sql/create-stage.md) command to create a named stage using SQL.

The following example creates an internal stage that uses server-side encryption:

> ```sqlexample
> CREATE STAGE my_int_stage
>   ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');
> ```

### Create a named stage using Python

Use the [StageCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.stage.StageCollection)
method of the [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-overview.md) to create a named stage.
For more information, see [Creating a stage](../developer-guide/snowflake-python-api/snowflake-python-managing-data-loading.md).

The following example creates an internal stage that uses server-side encryption:

```python
from snowflake.core.stage import Stage, StageEncryption

my_stage = Stage(
  name="my_int_stage",
  encryption=StageEncryption(type="SNOWFLAKE_SSE")
)
root.databases["<database>"].schemas["<schema>"].stages.create(my_stage)
```

### Create a named stage using Snowsight

To use Snowsight to create a named internal stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Stage » Snowflake Managed.
3. In the Create Stage dialog, enter a Stage Name.
4. Select the database and schema where you want to create the stage.
5. Optionally deselect Directory table. Directory tables let you see files on the stage, but require a warehouse and thus incur a cost.
   You can choose to deselect this option for now and enable a directory table later.
6. Select the type of Encryption supported for all files on your stage. For details, see [encryption for internal stages](../sql-reference/sql/create-stage.md). You can’t change the encryption type after you create the stage.

   > > **Note:**
   > >
   > > To enable data access, use server-side encryption. Otherwise, staged files are client-side
   > > encrypted by default and unreadable when downloaded. For more information, see [Server-side encryption for unstructured data access](unstructured-intro.md).
7. Complete the fields to describe your stage. For more information, see [CREATE STAGE](../sql-reference/sql/create-stage.md).
8. Select Create.

**Next:** [Staging data files from a local file system](data-load-local-file-system-stage.md)

---
title: CI/CD integrations on dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-ci-cd.md
section: User Guide
---

# CI/CD integrations on dbt Projects on Snowflake

dbt project objects support using Snowflake CLI commands to integrate deployment and execution into your CI/CD workflows. For a detailed tutorial, see [Tutorial: Set up CI/CD integrations on dbt Projects on Snowflake](../tutorials/dbt-projects-on-snowflake-ci-cd-tutorial.md).

This topic explains how to use GitHub Actions to automatically test and deploy your dbt Projects on Snowflake whenever you open a pull request or merge to
main.

Continuous Integration (CI) runs your dbt project against a dev schema on each pull request. In other words, whenever someone opens or
updates a pull request in your code repository, you automatically run tests and builds on the new code. This helps catch problems early
before merging.

Continuous Deployment (CD) keeps a dbt project object in Snowflake up to date after your commits are merged. In other words, whenever code
gets merged into a branch, you automatically deploy the updated code to production. This ensures that your production environment stays
up-to-date, reliably and reproducibly.

CI/CD helps avoid manual, error-prone deployments, ensures changes are validated before being merged, and enables consistent, repeatable
deployments and versioning.

## Why use CI/CD for a dbt Project

dbt projects define all your data transformations in code, so frequent updates can easily introduce errors. CI catches these issues early by
testing every change in a separate dev environment before merging.

After changes are merged, CD automatically updates the official dbt project object in your Snowflake production environment. This removes
manual steps, reduces risk, keeps everything version-controlled, and supports a reliable, collaborative workflow.

## High-level prerequisites for using CI/CD on dbt Projects

* A dbt project stored in a Git repository (for example, GitHub).
* A Snowflake account and user with privileges as described in [Access control for dbt projects on Snowflake](dbt-projects-on-snowflake-access-control.md).
* Privileges to create and edit the following objects or access to an administrator who can create each of them on your behalf:

  + GitHub repository environment variables and secrets to hold Snowflake account, database and schema values, and workflow files (for example,
    `.github/workflows/…`) that define CI and CD jobs.
  + Snowflake service account to communicate with GitHub
* A separation between dev environment (for CI) and prod environment (for CD) in Snowflake (for example, separate databases or schemas for each
  environment).
* A way to permit your CI/CD runner (for example, GitHub Actions) to connect to Snowflake, such as OIDC or PAT. For more information, see
  [Safely configure the action in your CI/CD workflow](../../developer-guide/snowflake-cli/cicd/integrate-ci-cd.md).
* In your code repository, a `profiles.yml` file configured to point to dev and prod targets (for example, databases/schemas, warehouse).
* A network policy that allows inbound access from your Git provider into Snowflake.

## CI/CD workflow overview

The following steps outline the typical workflow with CI/CD.

1. Developer writes or modifies dbt code (models, tests, etc.) in a branch.
2. Developer opens a pull request.
3. CI kicks in: a tester instance of the dbt project object is deployed to the Snowflake dev environment, which runs the `dbt run`
   and `dbt test` commands.

   * If an operation fails, the pull request fails. The developer must fix and update, then rerun.
   * If all operations pass, the pull request is eligible for merge.
4. Pull request is merged to main.
5. CD kicks in: the production dbt project object in Snowflake is updated to reflect the latest code.
6. Optionally, automated scheduling (for example, via Snowflake tasks) can be deployed, so data pipelines run on a schedule without manual
   intervention.

---
title: CJIS
source: https://docs.snowflake.com/en/user-guide/cert-cjis.md
section: User Guide
---

# CJIS

This topic describes how Snowflake supports customers with CJIS compliance requirements.

## Understanding CJIS compliance requirements

Snowflake’s [U.S. regions supporting public sector workloads](intro-regions.md) are ready and able to support customer compliance with the
FBI’s CJIS Security Policy. The CJIS Security Policy provides federal and state agencies with a unified set of standards for
the protection and safeguarding of Criminal Justice Information (CJI) in the cloud. Snowflake recognizes the importance of protecting CJI
and works collaboratively with customers to satisfy CJIS requirements.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Classification (Snowflake ML Functions)
source: https://docs.snowflake.com/en/user-guide/ml-functions/classification.md
section: User Guide
---

# Classification (Snowflake ML Functions)

Classification uses machine learning algorithms to sort data into different classes using patterns detected in training
data. Binary classification (two classes) and multi-class classification (more than two classes) are supported. Common
use cases of classification include customer churn prediction, credit card fraud detection, and spam detection.

> **Note:**
>
> Classification is part of Snowflake’s suite of business analysis tools powered by machine learning.

Classification involves creating a classification model object, passing in a reference to the training data. The model
is fitted to the provided training data. You then use the resulting schema-level classification model object to classify
new data points and to understand the model’s accuracy through its evaluation APIs.

## About the Classification Model

The classification function is powered by a
[gradient boosting machine](https://en.wikipedia.org/wiki/Gradient_boosting).
For binary classification, the model is trained using an
[area-under-the-curve](https://en.wikipedia.org/wiki/Receiver_operating_characteristic#Area_under_the_curve)
loss function. For multi-class classification, the model is trained using a
[logistic loss](https://en.wikipedia.org/wiki/Loss_functions_for_classification#Logistic_loss) function.

Suitable training datasets for use with classification include a target column representing the labeled class of each
data point and at least one feature column.

The classification model supports numeric, Boolean, and string data types for features and labels.

* Numeric features are treated as continuous. To treat numeric features as categorical, cast them to strings.
* String features are treated as categorical. The classification function supports high-cardinality features (for
  example, job titles or fruits). It does not support full free text, like sentences or paragraphs.
* Boolean features are treated as categorical.
* Timestamps must be [TIMESTAMP_NTZ](../../sql-reference/data-types-datetime.md) type. The model creates additional
  time-based features (epoch, day, week, month), which are used in training and classification. These features appear in
  [SHOW_FEATURE_IMPORTANCE](../../sql-reference/classes/classification/methods/show_feature_importance.md) results as
  `derived_features`.
* The cardinality of the label (target) column must be greater than one and less than the number of rows in the dataset.

Inference data must have the same feature names and types as training data. It is not an error for a categorical feature
to have a value that is not present in the training dataset. Columns in the inference data that were not present in the
training dataset are ignored.

Classification models can be evaluated for quality of prediction. In the evaluation process, an additional model is
trained on the original data but with some data points withheld. The withheld data points are then used for inference,
and the predicted classes are compared to the actual classes.

### Current Limitations

* Training and inference data must be numeric, TIMESTAMP_NTZ, Boolean, or string. Other types must be cast to one of
  these types.
* You cannot choose or modify the classification algorithm.
* Model parameters cannot be manually specified or adjusted.
* In tests, training on a Medium Snowpark-optimized warehouse has succeeded with up to 1,000 columns and 600 million
  rows. It is possible, but unlikely, to run out of memory below this limit.
* Your target column must contain no more than 255 distinct classes.
* SNOWFLAKE.ML.CLASSIFICATION instances cannot be cloned. When you clone or replicate a database containing a
  classification model, the model is currently skipped.

## Preparing for Classification

Before you can use classification, you must:

* Select a virtual warehouse in which to train and run your models.
* Grant the privileges required to create classification models.

You might also want to [modify your search path](../../sql-reference/snowflake-db-classes.md) to include the SNOWFLAKE.ML schema.

### Selecting a Virtual Warehouse

A Snowflake [virtual warehouse](../warehouses.md) provides the compute resources for training and using
classification machine learning models. This section provides general guidance on selecting the best type and size of
warehouse for classification, focusing on the training step, the most time-consuming and memory-intensive part of the
process.

You should choose the warehouse type based on the size of your training data. Standard warehouses are subject to a lower
Snowpark memory limit, and are appropriate for prototyping with fewer rows or features. Memory limits of standard
warehouses do not increase with warehouse size.

As the number of rows or features increases, consider using a Snowpark-optimized warehouse to ensure training can
succeed. Memory limits of Snowpark-optimized warehouses do not increase above Medium.

For best performance, train your models using a dedicated warehouse without other concurrent workloads.

To minimize costs, we recommend using an X-Small standard warehouse for prototyping. For larger datasets and production
workloads, use a Medium Snowpark-optimized warehouse.

### Granting Privileges to Create Classification Models

Training a classification model results in a schema-level object. Therefore, the role you use to create models must
have the CREATE SNOWFLAKE.ML.CLASSIFICATION privilege on the schema where the model will be created, allowing the
model to be stored there. This privilege is similar to other schema privileges like CREATE TABLE or CREATE VIEW.

Snowflake recommends that you create a role named `analyst` to be used by people who need to create classification
models.

In the following example, the `admin` role is the owner of the schema `admin_db.admin_schema`. The
`analyst` role needs to create models in this schema.

```sqlexample
USE ROLE admin;
GRANT USAGE ON DATABASE admin_db TO ROLE analyst;
GRANT USAGE ON SCHEMA admin_schema TO ROLE analyst;
GRANT CREATE SNOWFLAKE.ML.CLASSIFICATION ON SCHEMA admin_db.admin_schema TO ROLE analyst;
```

To use this schema, a user assumes the role `analyst`:

```sqlexample
USE ROLE analyst;
USE SCHEMA admin_db.admin_schema;
```

If the `analyst` role has CREATE SCHEMA privileges in database `analyst_db`, the role can create a new schema
`analyst_db.analyst_schema` and create classification models in that schema:

```sqlexample
USE ROLE analyst;
CREATE SCHEMA analyst_db.analyst_schema;
USE SCHEMA analyst_db.analyst_schema;
```

To revoke a role’s model creation privilege on the schema, use [REVOKE <privileges> … FROM ROLE](../../sql-reference/sql/revoke-privilege.md):

```sqlexample
REVOKE CREATE SNOWFLAKE.ML.CLASSIFICATION ON SCHEMA admin_db.admin_schema FROM ROLE analyst;
```

## Training, Using, Viewing, Deleting, and Updating Models

> **Note:**
>
> SNOWFLAKE.ML.CLASSIFICATION runs using limited privileges, so by default, it does not have access to your data. You
> must therefore pass tables and views as [references](../../developer-guide/stored-procedure/stored-procedures-calling-references.md),
> which pass along the caller’s privileges. You can also provide a [query reference](../../developer-guide/stored-procedure/stored-procedures-calling-references.md)
> instead of a reference to a table or a view.

See the [CLASSIFICATION reference](../../sql-reference/classes/classification.md) for information about training, inference, and evaluation APIs.

Use CREATE SNOWFLAKE.ML.CLASSIFICATION to create and train a model.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.CLASSIFICATION <model_name>(...);
```

To run inference (prediction) on a dataset, use the model’s PREDICT method.

```sqlexample
SELECT <model_name>!PREDICT(...);
```

To evaluate a model, call the provided evaluation methods.

```sqlexample
CALL <model_name>!SHOW_EVALUATION_METRICS();
CALL <model_name>!SHOW_GLOBAL_EVALUATION_METRICS();
CALL <model_name>!SHOW_THRESHOLD_METRICS();
CALL <model_name>!SHOW_CONFUSION_MATRIX();
```

To show a model’s feature importance ranking, call its SHOW_FEATURE_IMPORTANCE method.

```sqlexample
CALL <model_name>!SHOW_FEATURE_IMPORTANCE();
```

To investigate logs generated during training, use the SHOW_TRAINING_LOGS method. If no training logs are available, this call returns NULL.

```sqlexample
CALL <model_name>!SHOW_TRAINING_LOGS();
```

> **Tip:**
>
> For examples of using these methods, see Examples.

To view all classification models, use the SHOW command.

```sqlexample
SHOW SNOWFLAKE.ML.CLASSIFICATION;
```

To delete a classification model, use the DROP command.

```sqlexample
DROP SNOWFLAKE.ML.CLASSIFICATION <model_name>;
```

Models are immutable and cannot be updated in place. To update a model, drop the existing model and train a new one.
The CREATE OR REPLACE variant of the CREATE command is useful for this purpose.

## Examples

### Setting Up the Data for the Examples

The examples in this topic uses two tables. The first table, `training_purchase_data`, has two feature columns: a
binary label column and a multi-class label column. The second table is called
`prediction_purchase_data` and has two feature columns. Use the SQL below to create these
tables.

```sqlexample
CREATE OR REPLACE TABLE training_purchase_data AS (
    SELECT
        CAST(UNIFORM(0, 4, RANDOM()) AS VARCHAR) AS user_interest_score,
        UNIFORM(0, 3, RANDOM()) AS user_rating,
        FALSE AS label,
        'not_interested' AS class
    FROM TABLE(GENERATOR(rowCount => 100))
    UNION ALL
    SELECT
        CAST(UNIFORM(4, 7, RANDOM()) AS VARCHAR) AS user_interest_score,
        UNIFORM(3, 7, RANDOM()) AS user_rating,
        FALSE AS label,
        'add_to_wishlist' AS class
    FROM TABLE(GENERATOR(rowCount => 100))
    UNION ALL
    SELECT
        CAST(UNIFORM(7, 10, RANDOM()) AS VARCHAR) AS user_interest_score,
        UNIFORM(7, 10, RANDOM()) AS user_rating,
        TRUE AS label,
        'purchase' AS class
    FROM TABLE(GENERATOR(rowCount => 100))
);

CREATE OR REPLACE table prediction_purchase_data AS (
    SELECT
        CAST(UNIFORM(0, 4, RANDOM()) AS VARCHAR) AS user_interest_score,
        UNIFORM(0, 3, RANDOM()) AS user_rating
    FROM TABLE(GENERATOR(rowCount => 100))
    UNION ALL
    SELECT
        CAST(UNIFORM(4, 7, RANDOM()) AS VARCHAR) AS user_interest_score,
        UNIFORM(3, 7, RANDOM()) AS user_rating
    FROM TABLE(GENERATOR(rowCount => 100))
    UNION ALL
    SELECT
        CAST(UNIFORM(7, 10, RANDOM()) AS VARCHAR) AS user_interest_score,
        UNIFORM(7, 10, RANDOM()) AS user_rating
    FROM TABLE(GENERATOR(rowCount => 100))
);
```

### Training and Using a Binary Classifier

First, create a view containing binary data for training.

```sqlexample
CREATE OR REPLACE view binary_classification_view AS
    SELECT user_interest_score, user_rating, label
FROM training_purchase_data;
SELECT * FROM binary_classification_view ORDER BY RANDOM(42) LIMIT 5;
```

The SELECT statement returns results in the following form.

```output
+---------------------+-------------+-------+
| USER_INTEREST_SCORE | USER_RATING | LABEL |
|---------------------+-------------+-------|
| 5                   |           4 | False |
| 8                   |           8 | True  |
| 6                   |           5 | False |
| 7                   |           7 | True  |
| 7                   |           4 | False |
+---------------------+-------------+-------+
```

Using this view, create and train a binary classification model.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.CLASSIFICATION model_binary(
    INPUT_DATA => SYSTEM$REFERENCE('view', 'binary_classification_view'),
    TARGET_COLNAME => 'label'
);
```

After you’ve created the model, use its PREDICT method to infer labels for the unlabeled purchase data. You can use
[wildcard expansion in an object literal](../../sql-reference/data-types-semistructured.md) to create key-value pairs of features
for the INPUT_DATA argument.

```sqlexample
SELECT model_binary!PREDICT(INPUT_DATA => {*})
    AS prediction FROM prediction_purchase_data;
```

The model returns output in the following format. The prediction object includes predicted probabilities for each class
and the predicted class based on the maximum predicted probability. The predictions are returned in the same order as
the original features were provided.

```output
+-------------------------------------+
| PREDICTION                          |
|-------------------------------------|
| {                                   |
|   "class": "True",                  |
|   "logs": null,                     |
|   "probability": {                  |
|     "False": 1.828038600000000e-03, |
|     "True": 9.981719614000000e-01   |
|   }                                 |
| }                                   |
| {                                   |
|   "class": "False",                 |
|   "logs": null,                     |
|   "probability": {                  |
|     "False": 9.992944771000000e-01, |
|     "True": 7.055229000000000e-04   |
|   }                                 |
| }                                   |
| {                                   |
|   "class": "True",                  |
|   "logs": null,                     |
|   "probability": {                  |
|     "False": 3.429796010000000e-02, |
|     "True": 9.657020399000000e-01   |
|   }                                 |
| }                                   |
| {                                   |
|   "class": "False",                 |
|   "logs": null,                     |
|   "probability": {                  |
|     "False": 9.992687686000000e-01, |
|     "True": 7.312314000000000e-04   |
|   }                                 |
| }                                   |
| {                                   |
|   "class": "False",                 |
|   "logs": null,                     |
|   "probability": {                  |
|     "False": 9.992951615000000e-01, |
|     "True": 7.048385000000000e-04   |
|   }                                 |
| }                                   |
+-------------------------------------+
```

To zip together features and predictions, use a query like the following.

```sqlexample
SELECT *, model_binary!PREDICT(INPUT_DATA => {*})
    AS predictions FROM prediction_purchase_data;
```

```output
+---------------------+-------------+-------------------------------------+
| USER_INTEREST_SCORE | USER_RATING | PREDICTIONS                         |
|---------------------+-------------+-------------------------------------|
| 9                   |           8 | {                                   |
|                     |             |   "class": "True",                  |
|                     |             |   "logs": null,                     |
|                     |             |   "probability": {                  |
|                     |             |     "False": 1.828038600000000e-03, |
|                     |             |     "True": 9.981719614000000e-01   |
|                     |             |   }                                 |
|                     |             | }                                   |
| 3                   |           0 | {                                   |
|                     |             |   "class": "False",                 |
|                     |             |   "logs": null,                     |
|                     |             |   "probability": {                  |
|                     |             |     "False": 9.992944771000000e-01, |
|                     |             |     "True": 7.055229000000000e-04   |
|                     |             |   }                                 |
|                     |             | }                                   |
| 10                  |           7 | {                                   |
|                     |             |   "class": "True",                  |
|                     |             |   "logs": null,                     |
|                     |             |   "probability": {                  |
|                     |             |     "False": 3.429796010000000e-02, |
|                     |             |     "True": 9.657020399000000e-01   |
|                     |             |   }                                 |
|                     |             | }                                   |
| 6                   |           6 | {                                   |
|                     |             |   "class": "False",                 |
|                     |             |   "logs": null,                     |
|                     |             |   "probability": {                  |
|                     |             |     "False": 9.992687686000000e-01, |
|                     |             |     "True": 7.312314000000000e-04   |
|                     |             |   }                                 |
|                     |             | }                                   |
| 1                   |           3 | {                                   |
|                     |             |   "class": "False",                 |
|                     |             |   "logs": null,                     |
|                     |             |   "probability": {                  |
|                     |             |     "False": 9.992951615000000e-01, |
|                     |             |     "True": 7.048385000000000e-04   |
|                     |             |   }                                 |
|                     |             | }                                   |
+---------------------+-------------+-------------------------------------+
```

### Training and Using a Multi-Class Classifier

Create a view containing binary data for training.

```sqlexample
CREATE OR REPLACE VIEW multiclass_classification_view AS
    SELECT user_interest_score, user_rating, class
FROM training_purchase_data;
SELECT * FROM multiclass_classification_view ORDER BY RANDOM(42) LIMIT 10;
```

This SELECT statement returns results in the following form.

```output
+---------------------+-------------+-----------------+
| USER_INTEREST_SCORE | USER_RATING | CLASS           |
|---------------------+-------------+-----------------|
| 5                   |           4 | add_to_wishlist |
| 8                   |           8 | purchase        |
| 6                   |           5 | add_to_wishlist |
| 7                   |           7 | purchase        |
| 7                   |           4 | add_to_wishlist |
| 1                   |           1 | not_interested  |
| 2                   |           1 | not_interested  |
| 7                   |           3 | add_to_wishlist |
| 2                   |           0 | not_interested  |
| 0                   |           1 | not_interested  |
+---------------------+-------------+-----------------+
```

Now create a multi-class classification model from this view.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.CLASSIFICATION model_multiclass(
    INPUT_DATA => SYSTEM$REFERENCE('view', 'multiclass_classification_view'),
    TARGET_COLNAME => 'class'
);
```

After you’ve created the model, use its PREDICT method to infer labels for the unlabeled purchase data. Use
[wildcard expansion in an object literal](../../sql-reference/data-types-semistructured.md) to automatically create key-value pairs
for the INPUT_DATA argument.

```sqlexample
SELECT *, model_multiclass!PREDICT(INPUT_DATA => {*})
    AS predictions FROM prediction_purchase_data;
```

The model returns output in the following format. The prediction object includes predicted probabilities for each class
and the predicted class based on the maximum predicted probability. The predictions are returned in the same order as the
original features provided and can be joined in the same query.

```output
+---------------------+-------------+-----------------------------------------------+
| USER_INTEREST_SCORE | USER_RATING | PREDICTIONS                                   |
|---------------------+-------------+-----------------------------------------------|
| 9                   |           8 | {                                             |
|                     |             |   "class": "purchase",                        |
|                     |             |   "logs": null,                               |
|                     |             |   "probability": {                            |
|                     |             |     "add_to_wishlist": 3.529288000000000e-04, |
|                     |             |     "not_interested": 2.259768000000000e-04,  |
|                     |             |     "purchase": 9.994210944000000e-01         |
|                     |             |   }                                           |
|                     |             | }                                             |
| 3                   |           0 | {                                             |
|                     |             |   "class": "not_interested",                  |
|                     |             |   "logs": null,                               |
|                     |             |   "probability": {                            |
|                     |             |     "add_to_wishlist": 3.201690000000000e-04, |
|                     |             |     "not_interested": 9.994749885000000e-01,  |
|                     |             |     "purchase": 2.048425000000000e-04         |
|                     |             |   }                                           |
|                     |             | }                                             |
| 10                  |           7 | {                                             |
|                     |             |   "class": "purchase",                        |
|                     |             |   "logs": null,                               |
|                     |             |   "probability": {                            |
|                     |             |     "add_to_wishlist": 1.271809310000000e-02, |
|                     |             |     "not_interested": 3.992673600000000e-03,  |
|                     |             |     "purchase": 9.832892333000000e-01         |
|                     |             |   }                                           |
|                     |             | }                                             |
| 6                   |           6 | {                                             |
|                     |             |   "class": "add_to_wishlist",                 |
|                     |             |   "logs": null,                               |
|                     |             |   "probability": {                            |
|                     |             |     "add_to_wishlist": 9.999112027000000e-01, |
|                     |             |     "not_interested": 4.612520000000000e-05,  |
|                     |             |     "purchase": 4.267210000000000e-05         |
|                     |             |   }                                           |
|                     |             | }                                             |
| 1                   |           3 | {                                             |
|                     |             |   "class": "not_interested",                  |
|                     |             |   "logs": null,                               |
|                     |             |   "probability": {                            |
|                     |             |     "add_to_wishlist": 2.049559150000000e-02, |
|                     |             |     "not_interested": 9.759854413000000e-01,  |
|                     |             |     "purchase": 3.518967300000000e-03         |
|                     |             |   }                                           |
|                     |             | }                                             |
+---------------------+-------------+-----------------------------------------------+
```

### Saving Results to a Table and Exploring Predictions

Results of the calls to models’ PREDICT method can be read directly into a query, but saving the results to a table
allows you to conveniently explore predictions.

```sqlexample
CREATE OR REPLACE TABLE my_predictions AS
SELECT *, model_multiclass!PREDICT(INPUT_DATA => {*}) AS predictions FROM prediction_purchase_data;

SELECT * FROM my_predictions;
```

The key and prediction columns can then be explored in further queries. The query below explores predictions.

```sqlexample
SELECT
    predictions:class AS predicted_class,
    ROUND(predictions:probability:not_interested,4) AS not_interested_class_probability,
    ROUND(predictions['probability']['purchase'],4) AS purchase_class_probability,
    ROUND(predictions['probability']['add_to_wishlist'],4) AS add_to_wishlist_class_probability
FROM my_predictions
LIMIT 5;
```

The query above returns results in the following form.

```output
+-------------------+----------------------------------+----------------------------+-----------------------------------+
| PREDICTED_CLASS   | NOT_INTERESTED_CLASS_PROBABILITY | PURCHASE_CLASS_PROBABILITY | ADD_TO_WISHLIST_CLASS_PROBABILITY |
|-------------------+----------------------------------+----------------------------+-----------------------------------|
| "purchase"        |                           0.0002 |                     0.9994 |                            0.0004 |
| "not_interested"  |                           0.9995 |                     0.0002 |                            0.0003 |
| "purchase"        |                           0.0002 |                     0.9994 |                            0.0004 |
| "purchase"        |                           0.0002 |                     0.9994 |                            0.0004 |
| "not_interested"  |                           0.9994 |                     0.0002 |                            0.0004 |
| "purchase"        |                           0.0002 |                     0.9994 |                            0.0004 |
| "add_to_wishlist" |                           0      |                     0      |                            0.9999 |
| "add_to_wishlist" |                           0.4561 |                     0.0029 |                            0.5409 |
| "purchase"        |                           0.0002 |                     0.9994 |                            0.0004 |
| "not_interested"  |                           0.9994 |                     0.0002 |                            0.0003 |
+-------------------+----------------------------------+----------------------------+-----------------------------------+
```

### Using Evaluation Functions

By default, evaluation is enabled on all instances. However, evaluation can be manually enabled or disabled using the
config object argument. If the key ‘evaluate’ is specified with the value FALSE, evaluation is not available.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.CLASSIFICATION model(
    INPUT_DATA => SYSTEM$REFERENCE('view', 'binary_classification_view'),
    TARGET_COLNAME => 'label',
    CONFIG_OBJECT => {'evaluate': TRUE}
);
```

When evaluation is enabled, evaluation metrics can be obtained using the evaluation APIs shown here.

```sqlexample
CALL model!SHOW_EVALUATION_METRICS();
CALL model!SHOW_GLOBAL_EVALUATION_METRICS();
CALL model!SHOW_THRESHOLD_METRICS();
CALL model!SHOW_CONFUSION_MATRIX();
```

See Understanding Evaluation Metrics for a description of the returned metrics.

The evaluation metrics of our multiclass model are as follows..

```sqlexample
CALL model_multiclass!SHOW_EVALUATION_METRICS();
```

```output
+--------------+-----------------+--------------+---------------+------+
| DATASET_TYPE | CLASS           | ERROR_METRIC |  METRIC_VALUE | LOGS |
|--------------+-----------------+--------------+---------------+------|
| EVAL         | add_to_wishlist | precision    |  0.8888888889 | NULL |
| EVAL         | add_to_wishlist | recall       |  1            | NULL |
| EVAL         | add_to_wishlist | f1           |  0.9411764706 | NULL |
| EVAL         | add_to_wishlist | support      | 16            | NULL |
| EVAL         | not_interested  | precision    |  1            | NULL |
| EVAL         | not_interested  | recall       |  0.9090909091 | NULL |
| EVAL         | not_interested  | f1           |  0.9523809524 | NULL |
| EVAL         | not_interested  | support      | 22            | NULL |
| EVAL         | purchase        | precision    |  1            | NULL |
| EVAL         | purchase        | recall       |  1            | NULL |
| EVAL         | purchase        | f1           |  1            | NULL |
| EVAL         | purchase        | support      | 22            | NULL |
+--------------+-----------------+--------------+---------------+------+
```

```sqlexample
CALL model_multiclass!SHOW_GLOBAL_EVALUATION_METRICS();
```

```output
+--------------+--------------+--------------+---------------+------+
| DATASET_TYPE | AVERAGE_TYPE | ERROR_METRIC |  METRIC_VALUE | LOGS |
|--------------+--------------+--------------+---------------+------|
| EVAL         | macro        | precision    | 0.962962963   | NULL |
| EVAL         | macro        | recall       | 0.9696969697  | NULL |
| EVAL         | macro        | f1           | 0.964519141   | NULL |
| EVAL         | macro        | auc          | 0.9991277911  | NULL |
| EVAL         | weighted     | precision    | 0.9703703704  | NULL |
| EVAL         | weighted     | recall       | 0.9666666667  | NULL |
| EVAL         | weighted     | f1           | 0.966853408   | NULL |
| EVAL         | weighted     | auc          | 0.9991826156  | NULL |
| EVAL         | NULL         | log_loss     | 0.06365200147 | NULL |
+--------------+--------------+--------------+---------------+------+
```

```sqlexample
CALL model_multiclass!SHOW_CONFUSION_MATRIX();
```

```output
+--------------+-----------------+-----------------+-------+------+
| DATASET_TYPE | ACTUAL_CLASS    | PREDICTED_CLASS | COUNT | LOGS |
|--------------+-----------------+-----------------+-------+------|
| EVAL         | add_to_wishlist | add_to_wishlist |    16 | NULL |
| EVAL         | add_to_wishlist | not_interested  |     0 | NULL |
| EVAL         | add_to_wishlist | purchase        |     0 | NULL |
| EVAL         | not_interested  | add_to_wishlist |     2 | NULL |
| EVAL         | not_interested  | not_interested  |    20 | NULL |
| EVAL         | not_interested  | purchase        |     0 | NULL |
| EVAL         | purchase        | add_to_wishlist |     0 | NULL |
| EVAL         | purchase        | not_interested  |     0 | NULL |
| EVAL         | purchase        | purchase        |    22 | NULL |
+--------------+-----------------+-----------------+-------+------+
```

> **Note:**
>
> For information on threshold metrics, see [SHOW_THRESHOLD_METRICS](../../sql-reference/classes/classification/methods/show_threshold_metrics.md).

We can also review feature importance.

```sqlexample
CALL model_multiclass!SHOW_FEATURE_IMPORTANCE();
```

```output
+------+---------------------+---------------+---------------+
| RANK | FEATURE             |         SCORE | FEATURE_TYPE  |
|------+---------------------+---------------+---------------|
|    1 | USER_RATING         | 0.9186571982  | user_provided |
|    2 | USER_INTEREST_SCORE | 0.08134280181 | user_provided |
+------+---------------------+---------------+---------------+
```

## Model Roles and Usage Privileges

Each classification model instance includes two model roles, `mladmin` and `mlconsumer`. These roles are scoped to the
model itself: `model!mladmin` and `model!mlconsumer`. The owner of the model object (initially, its creator) is
automatically granted the `model!mladmin` and `model!mlconsumer` roles, and can grant these roles to account roles
and database roles.

The `mladmin` role permits usage of all APIs invocable from the model object, including but not limited to prediction
methods and evaluation methods. The `mlconsumer` role permits usage only on prediction APIs, not other exploratory
APIs.

The following SQL example illustrates the grant of classification model roles to other roles. The role `r1` can create
a classification model, and grants the role `r2` the `mlconsumer` privilege so that the `r2` can call that model’s
PREDICT method. Then `r1` grants the `mladmin` role to another role, `r3`, so `r3` can call all methods of the
model.

First, role `r1` creates a model object, making `r1` the owner of the model `model`.

```sqlexample
USE ROLE r1;
CREATE OR REPLACE SNOWFLAKE.ML.CLASSIFICATION model(
    INPUT_DATA => SYSTEM$REFERENCE('TABLE', 'test_classification_dataset'),
    TARGET_COLNAME => 'LABEL'
);
```

You can see by executing the statements below that the role `r2` cannot call the model’s PREDICT method.

```sqlexample
USE ROLE r2;
SELECT model!PREDICT(1);    -- privilege error
```

Next, `r1` grants `r2` the `mlconsumer` instance role, after which `r2` can call the model’s PREDICT method.

```sqlexample
USE ROLE r1;
GRANT SNOWFLAKE.ML.CLASSIFICATION ROLE model!mlconsumer TO ROLE r2;

USE ROLE r2;
CALL model!PREDICT(
    INPUT_DATA => system$query_reference(
    'SELECT {*} FROM test_classification_dataset')
);
```

Similarly, role `r3` cannot see the model’s evaluation metrics without the `mladmin` instance role.

```sqlexample
USE ROLE r3;
CALL model!SHOW_EVALUATION_METRICS();   -- privilege error
```

Role `r1` grants the required role to `r3`, and `r3` can then call the model’s SHOW_EVALUATION_METRICS method.

```sqlexample
USE ROLE r1;
GRANT SNOWFLAKE.ML.CLASSIFICATION ROLE model!mladmin TO ROLE r3;

USE ROLE r3;
CALL model!SHOW_EVALUATION_METRICS();
```

You can revoke the privileges as follows.

```sqlexample
USE ROLE r1;
REVOKE SNOWFLAKE.ML.CLASSIFICATION ROLE model!mlconsumer FROM ROLE r2;
REVOKE SNOWFLAKE.ML.CLASSIFICATION ROLE model!mladmin FROM ROLE r3;
```

Use the following commands to see which account roles and database roles have been granted each of these instance roles.

```sqlexample
SHOW GRANTS TO SNOWFLAKE.ML.CLASSIFICATION ROLE <model_name>!mladmin;
SHOW GRANTS TO SNOWFLAKE.ML.CLASSIFICATION ROLE <model_name>!mlconsumer;
```

## Understanding Evaluation Metrics

Metrics measure how accurately a model predicts new data. The Snowflake classification currently evaluates models by selecting a
random sample from the entire dataset. A new model is trained without these rows, and then the rows are used as inference input.
The random sample portion can be configured using the `test_fraction` key in the EVALUATION_CONFIG object.

### Metrics in `show_evaluation_metrics`

`show_evaluation_metrics` calculates the following values for each class. See
[SHOW_EVALUATION_METRICS](../../sql-reference/classes/classification/methods/show_evaluation_metrics.md).

* *Positive Instances*: Instances of data (rows) that belong to the class of interest or the class being predicted.
* *Negative Instances*: Instances of data (rows) that do not belong to the class of interest or are the opposite of what is being predicted.
* *True Positives (TP)*: Correct predictions of positive instances.
* *True Negatives (TN)*: Correct predictions of negative instances,
* *False Positives (FP)*: Incorrect predictions of positive instances
* *False Negatives (FN)*: Incorrect predictions of negative instances.

Using the values above, the following metrics are reported for each class. For each metric, a higher value indicates a
more predictive model.

* *Precision*: The ratio of true positives to the total predicted positives. It measures how many of the predicted
  positive instances are actually positive.
* *Recall (Sensitivity)*: The ratio of true positives to the total actual positives. It measures how many of the actual
  positive instances were correctly predicted.
* *F1 Score*: The harmonic mean of precision and recall. It provides a balance between precision and recall, especially
  when there is an uneven class distribution.

### Metrics in `show_global_evaluation_metrics`

`show_global_evaluation_metrics` calculates overall (global) metrics for all classes predicted by the model by averaging the per-class metrics
calculated by `show_evaluation_metrics`. See
[SHOW_GLOBAL_EVALUATION_METRICS](../../sql-reference/classes/classification/methods/show_global_evaluation_metrics.md).

Currently, `macro` and `weighted` averaging is used for the metrics Precision, Recall, F1, AUC.

Logistic Loss (LogLoss) is calculated for the model as a whole. The objective of prediction is to minimize the loss
function.

### Metrics in `show_threshold_metrics`

`show_threshold_metrics` provides raw counts and metrics for a specific threshold for each class. This can be used to
plot ROC and PR curves or do threshold tuning if desired. The threshold varies from 0 to 1 for each specific class; a
predicted probability is assigned. See [SHOW_THRESHOLD_METRICS](../../sql-reference/classes/classification/methods/show_threshold_metrics.md).

The sample is classified as belonging to a class if the predicted probability of being in that class exceeds the
specified threshold. The true and false positives and negatives are computed considering the negative class as
every instance that does not belong to the class being considered. The following metrics are then computed.

* *True positive rate (TPR)*: The proportion of actual positive instances that the model correctly identifies (equivalent to Recall).
* *False positive rate (FPR)*: The proportion of actual negative instances that were incorrectly predicted as positive.
* *Accuracy*: The ratio of correct predictions (both true positives and true negatives) to the total number of
  predictions, an overall measure of how well the model is performing. This metric can be misleading in
  unbalanced cases.
* *Support*: The number of actual occurrences of a class in the specified dataset. Higher support values indicate a larger
  representation of a class in the dataset. Support is not itself a metric of the model but a characteristic of the dataset.

### Confusion Matrix in `show_confusion_matrix`

The confusion matrix is a table used to assess the performance of a model by comparing predicted and actual values and
evaluating its ability to correctly identify positive and negative instances. The objective is to maximize the number of
instances on the diagonal of the matrix while minimizing the number of off-diagonal instances. See
[SHOW_CONFUSION_MATRIX](../../sql-reference/classes/classification/methods/show_confusion_matrix.md).

You can visualize the confusion matrix in Snowsight as follows.

```sqlexample
CALL model_binary!SHOW_CONFUSION_MATRIX();
```

The results look like the following.

```output
+--------------+--------------+-----------------+-------+------+
| DATASET_TYPE | ACTUAL_CLASS | PREDICTED_CLASS | COUNT | LOGS |
|--------------+--------------+-----------------+-------+------|
| EVAL         | false        | false           |    37 | NULL |
| EVAL         | false        | true            |     1 | NULL |
| EVAL         | true         | false           |     0 | NULL |
| EVAL         | true         | true            |    22 | NULL |
+--------------+--------------+-----------------+-------+------+
```

To visualize the confusion matrix, click on Chart, then Chart Type, then Heatgrid. Under Data, for
Cell values select NONE, for Rows select PREDICTED_CLASS, and for Columns select ACTUAL_CLASS. The
result appears similar to the figure below.

## Understanding Feature Importance

A classification model can explain the relative importance of all features used in the model, This information is useful
in understanding what factors are really influencing your data.

The [SHOW_FEATURE_IMPORTANCE](../../sql-reference/classes/classification/methods/show_feature_importance.md) method counts
the number of times the model’s trees used each feature to make a decision. These feature importance scores are then
normalized to values between 0 and 1 so that their sum is 1. The resulting scores represent an approximate ranking of
the features in your trained model.

Features that are close in score have similar importance. Using multiple features that are very similar to each other
may result in reduced importance scores for those features.

### Limitations

* You cannot choose the technique used to calculate feature importance.
* Feature importance scores can be helpful for gaining intuition about which features are important to your model’s
  accuracy, but the actual values should be considered estimates.

### Example

```sqlexample
CALL model_binary!SHOW_FEATURE_IMPORTANCE();
```

```output
+------+---------------------+---------------+---------------+
| RANK | FEATURE             |         SCORE | FEATURE_TYPE  |
|------+---------------------+---------------+---------------|
|    1 | USER_RATING         | 0.9295302013  | user_provided |
|    2 | USER_INTEREST_SCORE | 0.07046979866 | user_provided |
+------+---------------------+---------------+---------------+
```

## Cost Considerations

Training and using classification models incurs compute and storage costs.

Using any APIs from the Classification feature (training a model, predicting with the model, retrieving metrics) all
require an active warehouse. The compute cost of using Classification functions is charged to the warehouse. See
[Understanding Compute Cost](../cost-understanding-compute.md) for general information on Snowflake compute
costs.

For details on costs for using ML functions in general, see [Cost Considerations](../../guides-overview-ml-functions.md) in the ML functions overview.

---
title: Clone databases that contain hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-clone.md
section: User Guide
---

# Clone databases that contain hybrid tables

You can clone databases that contain hybrid tables for two main purposes:

* To run point-in-time restore operations. Cloning works in combination with [Time Travel](data-time-travel.md),
  which by default creates implicit continuous backups. After [setting a data retention period](data-time-travel.md),
  you can clone a database at any point in its Time Travel history to restore the database to a healthy state (in the event that a
  corruption was introduced). You do not need to create a clone except when a restore is necessary.
* To hydrate other environments from a source environment, such as cloning a database from production to development or test.

Before you attempt to create any cloned databases that contain hybrid tables, be sure to read and understand the specific requirements
and limitations in the following sections.

## Cloning hybrid tables at the database level

Hybrid table clones must be created at the database level. For example:

```sqlexample
CREATE DATABASE clone_db1 CLONE db1;
```

You cannot clone hybrid tables at the schema level or the table level. If you try to create a new hybrid table by cloning a hybrid table or a standard table, the command fails with an error. For example:

```sqlexample
CREATE HYBRID TABLE clone_ht1 CLONE ht1;
```

```output
391411 (0A000): This feature is not supported for hybrid tables: 'CLONE'.
```

If you try to create a schema by cloning another schema, and the source schema has one or more hybrid tables, the command fails. However, you can clone the schema by using the [IGNORE HYBRID TABLES](../sql-reference/sql/create-clone.md) parameter to explicitly skip the hybrid tables in the schema. This parameter also works for creating database clones. For example:

```sqlexample
CREATE OR REPLACE SCHEMA clone_ht_schema CLONE ht_schema IGNORE HYBRID TABLES;
```

```output
+----------------------------------------------+
| status                                       |
|----------------------------------------------|
| Schema CLONE_HT_SCHEMA successfully created. |
+----------------------------------------------+
```

## Usage notes for cloning hybrid tables

* You cannot create clones that include hybrid tables by using the AT BEFORE, OFFSET, or STATEMENT (query UUID) parameters.
  You must specify either no parameters at all or AT TIMESTAMP with an explicitly cast TIMESTAMP value.
* Consistent with the behavior for standard tables, the history of a source table that is cloned is not retained by the clone itself. Cloned tables lose all the prior history of their source tables, which means that
  you cannot use Time Travel to see any past state after they have been cloned. Time Travel can be used to see the new history of tables that accrues after the cloning operation.
* Cloning hybrid tables is a size-of-data operation, while cloning standard tables is a metadata-only operation. This difference has an impact on compute cost, storage cost, and performance.

  + The database clone operation itself incurs compute cost when the database contains hybrid tables.
  + When hybrid tables are cloned, the data is physically copied into the row store; therefore, the cloning operation can take a long time for large tables, and the cost scales linearly with the size of the data.
  + Cloning performance is similar to that of optimized direct bulk loading with CREATE TABLE AS SELECT. See [Loading data](tables-hybrid-create.md).

The examples that follow highlight the main requirements for creating clones of databases that contain hybrid tables. For complete syntax information and usage notes, see [AT | BEFORE](../sql-reference/constructs/at-before.md) and [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

## Example: CREATE DATABASE … CLONE

You can clone a database that contains hybrid tables by using a CREATE DATABASE … CLONE command. The command specifies the name of the existing source database and the name of a new destination database. The cloned database is created as of the AT TIMESTAMP value you specify, or as of
now if you don’t specify a timestamp. The new database is a copy of the schemas and tables that existed in the source at that point in time (regardless of standard or hybrid table type).

The following example demonstrates the expected behavior when you clone a database that contains one or more hybrid tables. The first command shows the two tables that exist in the `testdata` schema of the `testdb` database. The `ht1` table is a hybrid table, and the `st1` table is a standard table.

```sqlexample
SHOW TERSE TABLES;
```

```output
+-------------------------------+------+-------+---------------+-------------+
| created_on                    | name | kind  | database_name | schema_name |
|-------------------------------+------+-------+---------------+-------------|
| 2024-11-14 15:59:32.683 -0800 | HT1  | TABLE | TESTDB        | TESTDATA    |
| 2024-11-14 16:00:01.360 -0800 | ST1  | TABLE | TESTDB        | TESTDATA    |
+-------------------------------+------+-------+---------------+-------------+
```

The following command clones this database, as of 16:01 on November 14, shortly after the tables were created:

```sqlexample
CREATE OR REPLACE DATABASE clone_testdb
  CLONE testdb AT(TIMESTAMP => '2024-11-14 16:01:00'::TIMESTAMP_LTZ);
```

```output
+---------------------------------------------+
| status                                      |
|---------------------------------------------|
| Database CLONE_TESTDB successfully created. |
+---------------------------------------------+
```

To see the cloned tables, use the `testdata` schema in the `clone_testdb` database:

```sqlexample
USE DATABASE clone_testdb;
USE SCHEMA testdata;
```

Use a SHOW TABLES command to check that the tables were successfully cloned:

```sqlexample
SHOW TERSE TABLES;
```

```output
+-------------------------------+------+-------+---------------+-------------+
| created_on                    | name | kind  | database_name | schema_name |
|-------------------------------+------+-------+---------------+-------------|
| 2024-11-14 16:05:14.102 -0800 | HT1  | TABLE | CLONE_TESTDB  | TESTDATA    |
| 2024-11-14 16:05:14.102 -0800 | ST1  | TABLE | CLONE_TESTDB  | TESTDATA    |
+-------------------------------+------+-------+---------------+-------------+
```

## Example: Create a clone that restores a dropped hybrid table

Using the same `testdb` database as the previous example, assume that a user creates and loads another hybrid table named `ht2`.
However, a few minutes later, another user drops the `ht2` table by mistake.

```sqlexample
SHOW TERSE TABLES;
```

```output
+-------------------------------+------+-------+---------------+-------------+
| created_on                    | name | kind  | database_name | schema_name |
|-------------------------------+------+-------+---------------+-------------|
| 2024-11-14 15:59:32.683 -0800 | HT1  | TABLE | TESTDB        | TESTDATA    |
| 2024-11-14 17:37:24.304 -0800 | HT2  | TABLE | TESTDB        | TESTDATA    |
| 2024-11-14 16:00:01.360 -0800 | ST1  | TABLE | TESTDB        | TESTDATA    |
+-------------------------------+------+-------+---------------+-------------+
```

```sqlexample
DROP TABLE HT2;
```

```output
+---------------------------+
| status                    |
|---------------------------|
| HT2 successfully dropped. |
+---------------------------+
```

```sqlexample
SHOW TERSE TABLES;
```

```output
+-------------------------------+------+-------+---------------+-------------+
| created_on                    | name | kind  | database_name | schema_name |
|-------------------------------+------+-------+---------------+-------------|
| 2024-11-14 15:59:32.683 -0800 | HT1  | TABLE | TESTDB        | TESTDATA    |
| 2024-11-14 16:00:01.360 -0800 | ST1  | TABLE | TESTDB        | TESTDATA    |
+-------------------------------+------+-------+---------------+-------------+
```

You can restore the database to its “healthy” state, when it contained three tables, by creating a clone of `testdb` (named
`restore_testdb` in this case) with an appropriate timestamp. The timestamp specified here is very close to the point in time when the
table was created (and before it was dropped). In practice, you would have to choose the timestamp carefully, based on when data was
loaded into the table or other updates were applied. The main goal in this example is to capture the state of the table just before it was
dropped.

```sqlexample
CREATE OR REPLACE DATABASE restore_testdb
  CLONE testdb AT(TIMESTAMP => '2024-11-14 17:38'::TIMESTAMP_LTZ);
```

```output
+-----------------------------------------------+
| status                                        |
|-----------------------------------------------|
| Database RESTORE_TESTDB successfully created. |
+-----------------------------------------------+
```

Now you can check the contents of the new clone and verify that table `ht2` is there:

```sqlexample
USE DATABASE restore_testdb;
USE SCHEMA testdata;
SHOW TERSE TABLES;
```

```output
+-------------------------------+------+-------+----------------+-------------+
| created_on                    | name | kind  | database_name  | schema_name |
|-------------------------------+------+-------+----------------+-------------|
| 2024-11-14 17:47:58.984 -0800 | HT1  | TABLE | RESTORE_TESTDB | TESTDATA    |
| 2024-11-14 17:47:58.984 -0800 | HT2  | TABLE | RESTORE_TESTDB | TESTDATA    |
| 2024-11-14 17:47:58.984 -0800 | ST1  | TABLE | RESTORE_TESTDB | TESTDATA    |
+-------------------------------+------+-------+----------------+-------------+
```

## Example: Restore a database to a point in time before an incorrect DML operation

A database named `ht_sensors` has a schema `ht_schema` that contains a table named `sensor_data_device2`.
Assume that a series of DELETE operations were run on this table on November 25th. In Snowsight, in the navigation menu, select Monitoring » Query History
to see information about these DELETE operations. (In this example, the SQL Text filter is set to `DELETE` to isolate them.)

If the second DELETE operation in the list was run by mistake (rows with `motor_rpm` values greater than 1504 were deleted),
you can clone the database to restore it to its
state directly before that operation was committed. (For the sake of simplicity in this example, let’s assume that
no other changes, such as updates or inserts, were applied to that table or any other table in the database
during this time frame.)

Before cloning the database, you can check Time Travel results with a simple query. In this way, you can verify that the clone
captures the expected data before running the more costly restore operation.

For example, compare the results of the following two Time Travel queries, which are one minute apart:

```sqlexample
SELECT COUNT(*) FROM sensor_data_service2
  AT(TIMESTAMP => 'Mon, 25 Nov 2024 14:09:00'::TIMESTAMP_LTZ) WHERE MOTOR_RPM>1504;
```

```output
+----------+
| COUNT(*) |
|----------|
|     1855 |
+----------+
```

```sqlexample
SELECT COUNT(*) FROM sensor_data_service2
  AT(TIMESTAMP => 'Mon, 25 Nov 2024 14:10:00'::TIMESTAMP_LTZ) WHERE MOTOR_RPM>1504;
```

```output
+----------+
| COUNT(*) |
|----------|
|        0 |
+----------+
```

The results confirm the expected difference. Now you can clone the database, using the same timestamp as the first query:

```sqlexample
USE DATABASE ht_sensors;
USE SCHEMA ht_schema;

CREATE OR REPLACE DATABASE restore_ht_sensors
  CLONE ht_sensors AT(TIMESTAMP => 'Mon, 25 Nov 2024 14:09:00'::TIMESTAMP_LTZ);
```

```output
+---------------------------------------------------+
| status                                            |
|---------------------------------------------------|
| Database RESTORE_HT_SENSORS successfully created. |
+---------------------------------------------------+
```

Now check the state of the cloned database. Keep in mind that the cloned version of table `sensor_data_device2` does not have any Time Travel data.

```sqlexample
USE DATABASE restore_ht_sensors;
USE SCHEMA ht_schema;
SELECT COUNT(*) FROM SENSOR_DATA_DEVICE2 WHERE motor_rpm>1504;
```

```output
+----------+
| COUNT(*) |
|----------|
|     1855 |
+----------+
```

The following Time Travel query against the new table fails as expected:

```sqlexample
SELECT COUNT(*) FROM SENSOR_DATA_DEVICE2 AT(TIMESTAMP => 'Mon, 25 Nov 2024 14:09:00'::TIMESTAMP_LTZ) WHERE MOTOR_RPM>1504;
```

```output
000707 (02000): Time travel data is not available for table SENSOR_DATA_DEVICE2. The requested time is either
beyond the allowed time travel period or before the object creation time.
```

Finally, note that the most recent DELETE operation in the query history might need to be reapplied because the cloned table retained
the rows where the `timestamp` column was greater than `2024-04-03 07:30:00.000`.

---
title: Clone dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-clone.md
section: User Guide
---

# Clone dynamic tables

Cloning creates a new dynamic table with the same column definitions and contains all the existing data from the source dynamic table, without
actually copying the data.

You can clone a dynamic table to a new dynamic table or regular table.

## Clone a dynamic table to a new dynamic table

Cloned dynamic tables, whether cloned directly or as part of a cloned database or schema, are suspended by default.

In [DYNAMIC_TABLE_GRAPH_HISTORY](../sql-reference/functions/dynamic_table_graph_history.md), this appears as CLONED_AUTO_SUSPENDED in the SCHEDULING_STATE column. Any
downstream dynamic tables are also suspended, shown as UPSTREAM_CLONED_AUTO_SUSPENDED. For more information, see
[Automatic dynamic table suspension](dynamic-tables-suspend-resume.md).

```sqlsyntax
-- Clone a dynamic table to a new dynamic table
CREATE [ OR REPLACE ] [ TRANSIENT ] DYNAMIC TABLE <name>
  CLONE <source_dynamic_table>
        [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
  [
    COPY GRANTS
    TARGET_LAG = { '<num> { seconds | minutes | hours | days }' | DOWNSTREAM }
    WAREHOUSE = <warehouse_name>
  ]
```

You can also clone a dynamic table as it existed at a specific point in the past:

```sqlexample
CREATE DYNAMIC TABLE my_cloned_dynamic_table CLONE my_dynamic_table AT (TIMESTAMP => TO_TIMESTAMP_TZ('04/05/2013 01:02:03', 'mm/dd/yyyy hh24:mi:ss'));
```

For more information, see [Cloning using Time Travel (databases, schemas, tables, dynamic tables, event tables, and streams only)](object-clone.md).

## Clone a dynamic table to a new table

Cloned tables inherit the same column definitions and data of the source dynamic table but lack dynamic table-specific properties. They retain
row access and masking policies, tags, clustering keys, and comments.

```sqlsyntax
-- Clone a dynamic table to a new table
CREATE [ OR REPLACE ] TABLE [ IF NOT EXISTS ] <name>
CLONE <source_dynamic_table_name>
  [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
```

Cloning a dynamic table to a regular table follows the same considerations as [cloning a regular table](../sql-reference/sql/create-clone.md),
with the following exceptions:

* The source dynamic table has to be [initialized](dynamic-tables-refresh.md) in order to be cloned as a regular table.
* You can’t clone dynamic Apache Iceberg™ tables.

## Best practice for cloning pipelines of dynamic tables

Clone all elements of the dynamic table pipeline in the same clone command to avoid reinitializations of your pipeline. You can do this by
consolidating all elements of the pipeline (e.g. base tables, view, and dynamic tables) in the same schema or database.

---
title: Cloning considerations
source: https://docs.snowflake.com/en/user-guide/object-clone.md
section: User Guide
---

# Cloning considerations

This topic provides important considerations when cloning objects in Snowflake, particularly databases, schemas, and non-temporary tables. Factors
such as DDL and DML transactions (on the source object), Time Travel, and data retention periods can affect the object clone.

## Access control privileges for cloned objects

If the source object is a database or schema, the clone inherits all granted privileges on the clones of all child objects
contained in the source object:

* For databases, contained objects include schemas, tables, views, etc.
* For schemas, contained objects include tables, views, etc.

> **Note:**
>
> * The clone of the container itself (database or schema) doesn’t inherit the privileges granted on the source container.
> * For pipes, the role that creates the clone has the OWNERSHIP privilege on the pipes.

[CREATE <object> … CLONE](../sql-reference/sql/create-clone.md) statements for most objects do not copy grants on the source object to the object clone.
However, [CREATE <object>](../sql-reference/sql/create.md) commands that support the COPY GRANTS clause (for example, CREATE TABLE, CREATE VIEW) enable you to
optionally copy grants to object clones. For example, the [CREATE TABLE](../sql-reference/sql/create-table.md) … CLONE command syntax supports the
COPY GRANTS parameter. When the COPY GRANTS parameter is specified in a CREATE TABLE statement, the create operation copies all privileges,
except OWNERSHIP, from the source table to the new table. The same behavior is true for other CREATE commands that support the COPY GRANTS
clause.

In all other cases, you must grant any required privileges to the newly-created clone (using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)).

## Cloning and Snowflake objects

This section describes special cloning considerations with regard to specific Snowflake objects.

### Cloning and managed access schemas

If you clone a schema and specify the WITH MANAGED ACCESS clause, the required privileges depends on whether the source schema is a managed
or unmanaged schema. For details, see [CREATE SCHEMA privileges](../sql-reference/sql/create-schema.md).

### Cloning and object parameters

Cloned objects inherit any object parameters that were set on the source object when that object was cloned. If an object parameter can be
set on object containers (that is, account, database, schema) and isn’t explicitly set on the source object, an object clone inherits the
default parameter value or the value overridden at the lowest level. For more information about object parameters, see
[Parameters](../sql-reference/parameters.md).

### Cloning and default sequences

In a table, a column can reference a [sequence](querying-sequences.md) that generates default values. When a table is cloned,
the cloned table references the source or cloned sequence:

* If the database or schema containing both the table and sequence is cloned, the cloned table references the cloned sequence.
* Otherwise, the cloned table references the source sequence.

  For example, if the sequence is defined in a different database or schema, the cloned table references the source sequence. Or if you
  clone just the table itself, the cloned table references the source sequence.

  If you don’t want the new table to continue using the source sequence, run the following command:

  ```sqlsyntax
  ALTER TABLE <table_name> ALTER COLUMN <column_name> SET DEFAULT <new_sequence>.nextval;
  ```

### Cloning and foreign key constraints

A table can have a foreign key constraint that references a table that includes the primary key. When a table with a foreign key constraint
is cloned, the cloned table references the source or cloned table that includes the primary key:

* If the database or schema containing both tables is cloned, the cloned table with the foreign key references the primary key in the other
  cloned table.
* If the tables are in separate databases or schemas, the cloned table references the primary key in the source table.

### Cloning and clustering keys

A table can have a subset of columns designated as a [clustering key](tables-clustering-keys.md) to co-locate similar rows in the
same micro-partition. When a table with a clustering key is cloned, the new table is created with a clustering key. By default,
[Automatic Clustering](tables-auto-reclustering.md) is suspended for the new table. To resume automatic clustering for the new table, run the
following command:

```sqlsyntax
ALTER TABLE <name> RESUME RECLUSTER
```

### Cloning and stages

You can clone external named stages individually. An external stage references a bucket or container in external cloud storage; cloning an external stage has
no impact on the referenced cloud storage.

You can optionally clone internal named stages when you clone a database or schema.

When cloning a database or schema:

* External named stages that were present in the source when the cloning operation started are cloned.
* Tables are cloned, which means the internal stage associated with each table is also cloned. Any data files that were present in a table stage in the
  source database or schema aren’t copied to the clone (that is, the cloned table stages are empty).
* Internal named stages are cloned if you use the INCLUDE INTERNAL STAGES clause. For more information,
  see the [internal stage cloning usage notes](../sql-reference/sql/create-clone.md).

### Cloning and Apache Iceberg™ tables

#### Storage

Storage for cloned Iceberg tables works the same as storage for other cloned Snowflake objects;
clones share the same underlying storage as the source table.

For information about how storage works for cloned objects, see [Cloned table, schema, and database storage](tables-storage-considerations.md).

For information about Iceberg table storage, see [Storage for Apache Iceberg™ tables](tables-iceberg-storage.md).

#### Data manipulation language (DML) commands

You can use DML commands on cloned Iceberg tables just as you do on regular Snowflake-managed tables. For instructions and examples, see
[Use DML commands](tables-iceberg-manage.md).

For DML operations on cloned tables, Snowflake generates new data files and stores them in the base location of the source table.
The diverging data files don’t affect the source table; DML operations on the source table are reflected only in the source table’s data files.

#### Iceberg metadata

For cloned tables, Snowflake generates Iceberg metadata files that are distinct from those of the source table.
For example, a cloned Iceberg table has its own `metadata.json` file with a unique `table-uuid`, `last-sequence-number`, and other properties.
Cloned table backups don’t include any backup information from the source table.

#### Apache Iceberg™ tables with Snowflake storage

[Preview Feature](../release-notes/preview-features.md) — Open

Available to all accounts.

For Iceberg tables that use Snowflake-provided storage (`EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'`), CREATE ICEBERG TABLE … CLONE
succeeds only when the source table and the new table are **both** transient or **both** permanent. If one is transient and the other
is permanent, the statement fails.

| Source table | Clone | Result |
| --- | --- | --- |
| Transient | Transient | Supported |
| Permanent | Permanent | Supported |
| Transient | Permanent | Not supported |
| Permanent | Transient | Not supported |

For more information, see [Snowflake storage for Apache Iceberg™ tables](tables-iceberg-internal-storage.md).

### Cloning and event tables

When cloning an event table, you can clone to and from only event tables. In other words, you can not clone from a regular table to an
event table, nor from an event table to a regular table.

### Cloning and pipes

When a database or schema is cloned, any pipes in the source container that reference an
internal (that is, Snowflake) stage are not cloned.

However, any pipes that reference an external stage are cloned. This includes any pipe objects
where the INTEGRATION parameter is set. This parameter points to a notification integration to
enable auto-ingest Snowpipe when loading data from files in Google Cloud Storage or Microsoft
Azure blob storage.

When you clone a database or schema that contains any pipes through a CREATE .. CLONE
command, the role that creates the clone takes ownership of the cloned pipe. To copy the grants,
especially the ownership of the pipe, you can add the COPY GRANTS option when cloning databases or
schemas that contain pipe objects.

When a data file is created in a stage location (for example, blob storage container), a copy of the notification is sent to every pipe that matches the stage location. This results in the following behavior:

> * If a table is fully qualified in the COPY statement in the pipe definition (in the form of
>   `db_name.schema_name.table_name` or `schema_name.table_name`), then Snowpipe loads duplicate data into the
>   source table (that is, the `database.schema.table` in the COPY statement) for each pipe.
> * If a table is not fully qualified in the pipe definition, then Snowpipe loads the data into the table (for example, `mytable`) in
>   the source and cloned databases/schemas.

The default state of a pipe clone is as follows:

> * When `AUTO_INGEST = FALSE`, a cloned pipe is paused by default.
> * When `AUTO_INGEST = TRUE`, a cloned pipe is set to the `STOPPED_CLONED` state. In this state, pipes don’t accumulate event
>   notifications as a result of newly staged files. When a pipe is explicitly resumed, it only processes data files triggered as a result
>   of new event notifications.

A pipe clone in either state can be resumed by executing an [ALTER PIPE](../sql-reference/sql/alter-pipe.md) … RESUME statement.

### Cloning and search optimization

You can clone tables that have the [Search optimization service](search-optimization-service.md) enabled. When you do, the corresponding search
access path is a [zero-copy clone](tables-storage-considerations.md). However, if the cloned search access path isn’t up-to-date,
it might incur maintenance costs, even if the cloned table doesn’t change, because the search access path must catch up with the
current state of the cloned table. For more information about cloning and search optimization, see
[Cloning the table, schema, or database](search-optimization/working-with-tables.md).

### Cloning and streams

Currently, when a database or schema that contains source tables and streams is cloned, any unconsumed records in the streams (in the
clone) are inaccessible. This behavior is consistent with [Time Travel](data-time-travel.md) for tables. If a table is
cloned, historical data for the table clone begins at the time/point when the clone was created.

### Cloning and tasks

When a database or schema that contains tasks is cloned, the tasks in the clone are suspended by default. The tasks can be resumed
individually (using [ALTER TASK](../sql-reference/sql/alter-task.md) … RESUME).

### Cloning and alerts

When a database or schema that contains alerts is cloned, the alerts in the clone are
[suspended](alerts.md) by default.

To resume a suspended alert, you can use the [ALTER ALERT](../sql-reference/sql/alter-alert.md) … RESUME command.

### Cloning and governance objects

Masking & row access policies:

> The following approach helps to safeguard data from users with the SELECT privilege on the table or view when accessing a cloned object:
>
> * Cloning an individual policy object is not supported.
> * Cloning a schema results in the cloning of all policies within the schema.
> * A cloned table maps to the same policies as the source table. In other words, if a policy is set on the base table or its columns, the
>   policy is attached to the cloned table or its columns.
>
>   + If a table or view exists in the source schema/database and has references to policies in the same schema/database, the cloned table or
>     view is mapped to the corresponding cloned policy (in the target schema/database) instead of the policy in the source schema/database.
>   + If the source table refers to a policy in a different schema (i.e. a foreign reference), then the cloned table retains the
>     foreign reference.
>
> For more information, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).
>
> Also see:
>
> * Cloning external tables and [masking policies](security-column-intro.md).
> * Cloning external tables and [row access policies](security-row-intro.md).

Tags:

> * Tag associations in the source object (e.g. table) are maintained in the cloned objects.
> * For a database or a schema:
>
>   When a database or schema is cloned, tags that reside in that schema or database are also cloned.
>
>   If a table or view exists in the source schema/database and has references to tags in the same schema or database, the cloned table or view is mapped to the corresponding cloned tag (in the target schema/database) instead of the tag in the source schema or database.

Tag-based masking policies:

> For a tag-based masking policy where the tag is stored in a different schema than the masking policy and table, cloning the schema
> containing the masking policy and table results in the cloned table being protected by the masking policy in the source schema not the
> cloned schema.
>
> However, for a tag-based masking policy where the tag, masking policy, and table all exist in the schema, cloning the schema results in the
> table being protected by the masking policy in the cloned schema, not the source schema.
>
> If the table is cloned or moved to a different schema or database and was originally protected by a tag-based masking policy set on the
> schema or database, the table is not protected by the tag-based masking policy set on the source schema or database. The table is
> protected by the tag-based masking policy set on the target schema or database, if there is a tag-based masking policy set on the target
> schema or database.

### Cloning and differential privacy

Cloning a table or view that is protected by [differential privacy](diff-privacy/differential-privacy-overview.md) results
in the following behavior.

#### Privacy policies

When you clone a privacy-protected table or view, the object is also privacy-protected. Whether the privacy policy is cloned depends on
what you are cloning:

* If you clone the privacy-protected table only, the privacy policy isn’t cloned.
* If you clone a schema that contains both the table and the privacy policy, the privacy policy is cloned.
* If you clone a database that contains a schema that contains both the table and the privacy policy, the privacy policy is cloned.

If the privacy policy and the table are in different schemas, cloning the database or schema of the table doesn’t clone the privacy policy.
In this case, the privacy policy is automatically associated with the cloned objects.

#### Privacy domains

When you clone a privacy-protected table or view, the privacy domains set on the columns are also cloned.

Keep the following in mind when cloning a privacy-protected table or view with a REFERENCE privacy domain:

* If you clone a privacy-protected table but not the referenced table, the new table continues to reference the same table.
* If you clone both the privacy-protected table and the referenced table, the new privacy-protected table references the new cloned version
  of the referenced table.
* If the REFERENCE privacy domain references itself, the newly cloned table references itself, not the original table.

### Cloning and database roles

You can clone a database role using the CREATE DATABASE ROLE … CLONE command if the database role doesn’t already exist in the target
database. For details, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

### Cloning and Java UDFs

A Java UDF can be cloned when the database or schema containing the Java UDF is cloned. To be cloned, the Java UDF must meet certain
conditions. For more information, see [Limitations on cloning](../developer-guide/udf/java/udf-java-limitations.md).

### Cloning and instances of Snowflake classes

An instance of the [CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier.md) is cloned when the schema that contains the instance
is cloned. Cloning of instances of other Snowflake [classes](../sql-reference-classes.md) is *not* supported.

### Cloning and WORM backups

The backup set and backup policy objects that are used in [Write Once Read Many (WORM) backups](backups.md)
can’t be cloned. If you clone a schema or database that contains such objects, they aren’t included in the cloned schema or
database.

## Impact of DDL on cloning

Cloning is fast, but not instantaneous, particularly for large objects (for example, tables). As such, if DDL statements are executed on source objects
(for example, renaming tables in a schema) while the cloning operation is in progress, the changes may not be represented in the clone. This is because
DDL statements are atomic and not part of multi-statement transactions.

Furthermore, Snowflake doesn’t record which object names were present when the cloning operation started and which names changed. As such, DDL
statements that rename (or drop and recreate) source child objects compete with any in-progress cloning operations and can cause name conflicts.

In the following example, the `t_sales` table is dropped and another table is altered and given the same name as the dropped table while
the parent database is being cloned, producing an error:

> ```sqlexample
> CREATE OR REPLACE DATABASE staging_sales CLONE sales;
>
> DROP TABLE sales.public.t_sales;
>
> ALTER TABLE sales.public.t_sales_20170522 RENAME TO sales.public.t_sales;
> ```
>
> ```output
> 002002 (42710): None: SQL compilation error: Object 'T_SALES' already exists.
> ```

> **Tip:**
>
> To avoid conflicts in name resolution during a cloning operation, we suggest refraining from renaming objects to a name previously used by
> a dropped object until cloning is completed.

## Impact of DML and data retention on cloning

The [data retention period](data-time-travel.md) specifies the number of days for which Snowflake retains historical
data for performing
Time Travel actions on an object. Because the data retained for Time Travel incurs storage costs at the table-level, some users set this parameter
to `0` for some tables, effectively disabling data retention for these tables (that is, when the value is set to `0`, Time Travel data
retained for DML transactions is purged, incurring negligible additional storage costs).

Cloning operations require time to complete, particularly for large tables. During this period, DML transactions can alter the data in a source
table. Subsequently, Snowflake attempts to clone the table data as it existed when the operation began. However, if data is purged for DML
transactions that occur during cloning (because the retention time for the table is `0`), the data is unavailable to complete the operation,
producing an error similar to the following:

> ```output
> ProgrammingError occurred: "000707 (02000): None: Data is not available." with query id None
> ```

> **Tip:**
>
> As a workaround, we recommend either of the following best practices when cloning an object:
>
> * Refrain, if possible, from executing DML transactions on the source object (or any of its children) until after the cloning operation
>   completes.
> * If this isn’t possible, prior to starting cloning, set `DATA_RETENTION_TIME_IN_DAYS=1` for all tables in the schema (or database if
>   you are cloning an entire database). Once the operation completes, remember to reset the parameter value back to `0` for those tables
>   in the source, if desired.
>
>   You might also want to set the value to `0` for the cloned tables (if you plan to make DML changes to the cloned tables and don’t wish
>   to incur additional storage costs for Time Travel on the tables).

## Cloning using Time Travel (databases, schemas, tables, dynamic tables, event tables, and streams only)

This section provides information to consider when using [Time Travel](data-time-travel.md) to clone objects at a specific time/point in the
past.

### Cloning of historical objects

If the source object didn’t exist at the time/point set in the [AT | BEFORE](../sql-reference/constructs/at-before.md) clause, an error is returned.

In the following example, a CREATE TABLE … CLONE statement attempts to clone the source table at a point in the past (30 minutes prior) when
it didn’t exist:

> ```sqlexample
> CREATE TABLE t_sales (numeric integer) data_retention_time_in_days=1;
>
> CREATE OR REPLACE TABLE sales.public.t_sales_20170522 CLONE sales.public.t_sales at(offset => -60*30);
> ```
>
> ```output
> 002003 (02000): SQL compilation error:
> Object 'SALES.PUBLIC.T_SALES' does not exist.
> ```

Any child object in a cloned database or schema that didn’t exist at the specified time/point isn’t cloned.

The cloning operation fails in the following scenarios:

> * If the specified Time Travel time is beyond the retention time of any current child of the cloned database or schema.
>
>   As a workaround for child objects that have been purged from Time Travel, use the
>   [IGNORE TABLES WITH INSUFFICIENT DATA RETENTION](../sql-reference/sql/create-clone.md) parameter of the
>   CREATE <object> … CLONE command. For more information, see Child objects and data retention time.
> * If a pipe object with `AUTO_INGEST = TRUE` set was recreated (using the CREATE OR REPLACE PIPE syntax) or dropped since the point
>   in time specified in the AT | BEFORE clause. This limitation doesn’t apply to pipe objects created for manual Snowpipe ingest using the
>   REST API (that is, with `AUTO_INGEST = FALSE`).
> * If the [IGNORE HYBRID TABLES parameter](../sql-reference/sql/create-clone.md) isn’t specified and any hybrid tables exist in the
>   specified database or schema.

#### Child objects and data retention time

If a child object (for example, a table) has a shorter [data retention period](data-time-travel.md) than
the data retention period for its parent object (for example, a database or schema), the child object’s historical data is moved out
of Time Travel before the historical data of its parent object is moved out of Time Travel.

For example, the [data retention period](data-time-travel.md) for database `db1` is seven days and the
data retention period for table `t1` in `db1` is one day. If you clone `db1` using Time Travel at a point 12 hours in the past,
the cloning operation successfully creates a clone of `db1` and it contains the cloned table `t1`.

However, if you try to clone `db1` at a point two days in the past, the historical data for table `t1` at that point is no
longer available in Time Travel and the cloning operation fails.

As a workaround, use the [IGNORE TABLES WITH INSUFFICIENT DATA RETENTION](../sql-reference/sql/create-clone.md)
parameter of the [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md) command to clone a database or schema. The parameter skips tables that no
longer have historical data available in Time Travel at the time specified for the cloning operation.

#### Cloning of historical object metadata

An object clone inherits the name and structure of the source object current at the time the [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md) statement
is executed or at a specified time/point in the past using [Time Travel](data-time-travel.md). An object clone inherits any other metadata,
such as comments or table clustering keys, that is current in the source object at the time the statement is executed, regardless of whether Time Travel is
used.

> **Note:**
>
> To ensure consistent behavior in long cloning operations, when an AT or BEFORE clause isn’t specified for a CREATE *<object>* … CLONE statement, the
> cloning operation internally sets the AT clause value as the timestamp when the statement was initiated.

## Cloning and replication

For more information, see [Replication and cloning](account-replication-considerations.md).

---
title: Clustering Keys & Clustered Tables
source: https://docs.snowflake.com/en/user-guide/tables-clustering-keys.md
section: User Guide
---

# Clustering Keys & Clustered Tables

In general, Snowflake produces well-clustered data in tables; however, over time, particularly as DML occurs on very large tables (as defined by the amount of data in the table,
not the number of rows), the data in some table rows might no longer cluster optimally on desired dimensions.

To improve the clustering of the underlying table micro-partitions, you can always manually sort rows on key table columns and re-insert them into the table; however, performing
these tasks could be cumbersome and expensive.

Instead, Snowflake supports automating these tasks by designating one or more table columns/expressions as a *clustering key* for the table. A table with a clustering key defined
is considered to be *clustered*.

You can cluster [materialized views](views-materialized.md), as well as tables. The rules for
clustering tables and materialized views are generally the same. For a few additional tips specific to materialized
views, see [Materialized Views and Clustering](views-materialized.md) and
[Best Practices for Materialized Views](views-materialized.md).

> **Attention:**
>
> Clustering keys are not intended for all tables due to the costs of initially clustering the data and
> maintaining the clustering. Clustering is optimal when either:
>
> * You require the fastest possible response times, regardless of cost.
> * Your improved query performance offsets the credits required to cluster and maintain the table.
>
> For more information about choosing which tables to cluster, see: Considerations for Choosing Clustering for a Table.

## What is a Clustering Key?

A clustering key is a subset of columns in a table (or expressions on a table) that are explicitly designated to co-locate the data in the table in the same
[micro-partitions](tables-clustering-micropartitions.md). This is useful for very large tables where the ordering was not ideal (at the time the data was inserted/loaded) or
extensive DML has caused the table’s natural clustering to degrade.

Some general indicators that can help determine whether to define a clustering key for a table include:

* Queries on the table are running slower than expected or have noticeably degraded over time.
* The [clustering depth](tables-clustering-micropartitions.md) for the table is large.

A clustering key can be defined at table creation (using the [CREATE TABLE](../sql-reference/sql/create-table.md) command) or afterward (using the [ALTER TABLE](../sql-reference/sql/alter-table.md) command).
The clustering key for a table can also be altered or dropped at any time.

> **Attention:**
>
> Clustering keys cannot be defined for [hybrid tables](tables-hybrid.md). In hybrid tables, data is always ordered by primary key.

## Benefits of Defining Clustering Keys (for Very Large Tables)

Using a clustering key to co-locate similar rows in the same micro-partitions enables several benefits for very large tables, including:

* Improved scan efficiency in queries by skipping data that does not match filtering predicates.
* Better column compression than in tables with no clustering. This is especially true when other columns are strongly correlated with the columns that comprise the clustering key.
* After a key has been defined on a table, no additional administration is required, unless you chose to drop or modify the key. All future maintenance on the rows in the table
  (to ensure optimal clustering) is performed automatically by Snowflake.

Although clustering can substantially improve the performance and reduce the cost of some queries, the compute resources used to perform clustering consume credits. As such, you
should cluster only when queries will benefit substantially from the clustering.

Typically, queries benefit from clustering when the queries filter or sort on the clustering key for the table. Sorting is commonly done for `ORDER BY` operations,
for `GROUP BY` operations, and for some joins. For example, the following join would likely cause Snowflake to perform a sort operation:

> ```sqlexample
> SELECT ...
>     FROM my_table INNER JOIN my_materialized_view
>         ON my_materialized_view.col1 = my_table.col1
>     ...
> ```

In this pseudo-example, Snowflake is likely to sort the values in either `my_materialized_view.col1` or `my_table.col1`. For example, if the values in `my_table.col1` are
sorted, then as the materialized view is being scanned, Snowflake can quickly find the corresponding row in `my_table`.

The more frequently a table is queried, the more benefit clustering provides. However, the more frequently a table changes, the more expensive it will be to keep it
clustered. Therefore, clustering is generally most cost-effective for tables that are queried frequently and do not change frequently.

> **Note:**
>
> After you define a clustering key for a table, the rows are not necessarily updated immediately. Snowflake only performs automated maintenance if the table will benefit from
> the operation. For more details, see Reclustering (in this topic) and [Automatic Clustering](tables-auto-reclustering.md).

## Considerations for Choosing Clustering for a Table

Whether you want faster response times or lower overall costs, clustering is best for a table that meets all of
the following criteria:

* The table contains a large number of [micro-partitions](tables-clustering-micropartitions.md). Typically, this means that
  the table contains multiple terabytes (TB) of data.
* The queries can take advantage of clustering. Typically, this means that one or both of the following are true:

  + The queries are selective. In other words, the queries need to read only a small percentage of rows (and thus usually a small
    percentage of micro-partitions) in the table.
  + The queries sort the data. (For example, the query contains an ORDER BY clause on the table.)
* A high percentage of the queries can benefit from the same clustering key(s). In other words, many/most queries select on,
  or sort on, the same few column(s).

If your goal is primarily to reduce overall costs, then each clustered table should have a high ratio of queries to DML operations
(INSERT/UPDATE/DELETE). This typically means that the table is queried frequently and updated infrequently. If you want to
cluster a table that experiences a lot of DML, then consider grouping DML statements in large, infrequent batches.

Also, before choosing to cluster a table, Snowflake strongly recommends that you test a representative set of queries on
the table to establish some performance baselines.

## Strategies for Selecting Clustering Keys

A single clustering key can contain one or more columns or expressions. For most tables, Snowflake recommends a
maximum of 3 or 4 columns (or expressions) per key. Adding more than 3-4 columns tends to increase costs more than
benefits.

Selecting the right columns/expressions for a clustering key can dramatically impact query performance. Analysis of
your workload will usually yield good clustering key candidates.

Snowflake recommends prioritizing keys in the order below:

1. Cluster columns that are most actively used in selective filters. For many fact tables involved in date-based
   queries (for example “WHERE invoice_date > x AND invoice date <= y”), choosing the date column is a good idea.
   For event tables, event type might be a good choice, if there are a large number of different event types. (If your
   table has only a small number of different event types, then see the comments on cardinality below before choosing
   an event column as a clustering key.)
2. If there is room for additional cluster keys, then consider columns frequently used in join predicates, for example
   “FROM table1 JOIN table2 ON table2.column_A = table1.column_B”.

If you typically filter queries by two dimensions (e.g. `application_id` and `user_status` columns), then
clustering on both columns can improve performance.

The number of distinct values (i.e. cardinality) in a column/expression is a critical aspect of selecting it as a clustering key. It is important to choose a clustering key that has:

* A large enough number of distinct values to enable effective pruning on the table.
* A small enough number of distinct values to allow Snowflake to effectively group rows in the same micro-partitions.

A column with very low cardinality might yield only minimal pruning, such as a column named `IS_NEW_CUSTOMER`
that contains only Boolean values.
At the other extreme, a column with very high cardinality is also typically not a good candidate to use as a clustering key directly.
For example, a column that contains nanosecond timestamp values would not make a good clustering key.

> **Tip:**
>
> In general, if a column (or expression) has higher cardinality, then maintaining clustering on that column is
> more expensive.
>
> The cost of clustering on a unique key might be more than the benefit of clustering on that key,
> especially if point lookups are not the primary use case for that table.
>
> If you want to use a column with very high cardinality as a clustering key, Snowflake recommends defining the key as an
> expression on the column, rather than on the column directly, to reduce the number of distinct values. The
> expression should preserve the original ordering of the column so that the minimum and maximum values in each
> partition still enable pruning.
>
> For example, if a fact table has a TIMESTAMP column `c_timestamp` containing many discrete values (many more than
> the number of micro-partitions in the table), then a clustering key could be defined on the column by casting the
> values to dates instead of timestamps (e.g. `to_date(c_timestamp)`). This would reduce the cardinality to the
> total number of days, which typically produces much better pruning results.
>
> As another example, you can truncate a number to fewer significant digits by using the `TRUNC` functions and a
> negative value for the scale (e.g. `TRUNC(123456789, -5)`).

> **Tip:**
>
> If you are defining a multi-column clustering key for a table, the order in which the columns are specified in
> the `CLUSTER BY` clause is important. As a general rule, Snowflake recommends ordering the columns from
> lowest cardinality to highest cardinality. Putting a higher cardinality column before a lower
> cardinality column will generally reduce the effectiveness of clustering on the latter column.

> **Tip:**
>
> When clustering on a text field, the cluster key metadata tracks only the first several bytes (typically 5 or 6 bytes).
> Note that for multi-byte character sets, this can be fewer than 5 characters.

In some cases, clustering on columns used in `GROUP BY` or `ORDER BY` clauses can be helpful. However, clustering
on these columns is usually less helpful than clustering on columns that are heavily used in filter or `JOIN`
operations. If you have some columns that are heavily used in filter/join operations and different columns that are
used in `ORDER BY` or `GROUP BY` operations, then favor the columns used in the filter and join operations.

## Reclustering

As DML operations (INSERT, UPDATE, DELETE, MERGE, COPY) are performed on a clustered table, the data in the table might become less clustered. Periodic/regular reclustering of the table is required to
maintain optimal clustering.

During reclustering, Snowflake uses the clustering key for a clustered table to reorganize the column data, so that related records are relocated to the same micro-partition. This DML operation deletes the
affected records and re-inserts them, grouped according to the clustering key.

> **Note:**
>
> Reclustering in Snowflake is automatic; no maintenance is needed. For more details, see [Automatic Clustering](tables-auto-reclustering.md).
>
> However, for certain accounts, manual reclustering has been deprecated, but is still allowed. For more details see [Manual Reclustering](tables-clustering-manual.md).

### Credit and Storage Impact of Reclustering

Similar to all DML operations in Snowflake, reclustering consumes credits. The number of credits consumed depends on the size of the table and the amount of data that needs to be reclustered.

Reclustering also results in storage costs. Each time data is reclustered, the rows are physically grouped based on the clustering key for the table, which results in Snowflake generating new
micro-partitions for the table. Adding even a small number of rows to a table can cause all micro-partitions that contain those values to be recreated.

This process can create significant data turnover because the original micro-partitions are marked as deleted, but retained in the system to enable Time Travel and Fail-safe. The original micro-partitions
are purged only after both the Time Travel retention period and the subsequent Fail-safe period have passed (i.e. minimum of 8 days and up to 97 days for extended Time Travel, if you are using Snowflake
Enterprise Edition (or higher)). This typically results in increased storage costs. For more information, see [Snowflake Time Travel & Fail-safe](data-availability.md).

> **Important:**
>
> Before defining a clustering key for a table, you should consider the associated credit and storage costs.

### Reclustering Example

Building on the [clustering diagram](tables-clustering-micropartitions.md) from the previous topic, this diagram illustrates how reclustering a table can help reduce scanning of micro-partitions to improve
query performance:

* To start, table `t1` is naturally clustered by `date` across micro-partitions 1-4.
* The query (in the diagram) requires scanning micro-partitions 1, 2, and 3.
* `date` and `type` are defined as the clustering key. When the table is reclustered, new micro-partitions (5-8) are created.
* After reclustering, the same query only scans micro-partition 5.

In addition, after reclustering:

* Micro-partition 5 has reached a *constant state* (i.e. it cannot be improved by reclustering) and is therefore excluded when computing depth and overlap for future maintenance. In a well-clustered
  large table, most micro-partitions will fall into this category.
* The original micro-partitions (1-4) are marked as deleted, but are not purged from the system; they are retained for [Time Travel and Fail-safe](data-availability.md).

> **Note:**
>
> This example illustrates the impact of reclustering on an extremely small scale. Extrapolated to a very large table (i.e. consisting of millions of micro-partitions or more), reclustering can have a
> significant impact on scanning and, therefore, query performance.

## Defining Clustered Tables

### Calculating the Clustering Information for a Table

Use the system function, [SYSTEM$CLUSTERING_INFORMATION](../sql-reference/functions/system_clustering_information.md), to calculate clustering details, including clustering depth, for a given table. This function can be run on
any columns on any table, regardless of whether the table has an explicit clustering key:

* If a table has an explicit clustering key, the function doesn’t require any input arguments other than the name of the table.
* If a table doesn’t have an explicit clustering key (or a table has a clustering key, but you want to calculate the ratio on other columns in the table), the function takes the desired column(s) as an
  additional input argument.

### Defining a Clustering Key for a Table

A clustering key can be defined when a table is created by appending a `CLUSTER BY` clause to [CREATE TABLE](../sql-reference/sql/create-table.md):

```sqlsyntax
CREATE TABLE <name> ... CLUSTER BY ( <expr1> [ , <expr2> ... ] )
```

Where each clustering key consists of one or more table columns/expressions, which can be of any data type, except
GEOGRAPHY, VARIANT, OBJECT, or ARRAY. A clustering key can contain any of the following:

* Base columns.
* Expressions on base columns.
* Expressions on paths in VARIANT columns.

For example:

> ```sqlexample
> -- cluster by base columns
> CREATE OR REPLACE TABLE t1 (c1 DATE, c2 STRING, c3 NUMBER) CLUSTER BY (c1, c2);
>
> SHOW TABLES LIKE 't1';
>
> +-------------------------------+------+---------------+-------------+-------+---------+----------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by     | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+----------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 12:06:07.517 -0700 | T1   | TESTDB        | PUBLIC      | TABLE |         | LINEAR(C1, C2) |    0 |     0 | SYSADMIN | 1              | ON                   |
> +-------------------------------+------+---------------+-------------+-------+---------+----------------+------+-------+----------+----------------+----------------------+
>
> -- cluster by expressions
> CREATE OR REPLACE TABLE t2 (c1 timestamp, c2 STRING, c3 NUMBER) CLUSTER BY (TO_DATE(C1), substring(c2, 0, 10));
>
> SHOW TABLES LIKE 't2';
>
> +-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by                                     | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 12:07:51.307 -0700 | T2   | TESTDB        | PUBLIC      | TABLE |         | LINEAR(CAST(C1 AS DATE), SUBSTRING(C2, 0, 10)) |    0 |     0 | SYSADMIN | 1              | ON                   |
> +-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------+------+-------+----------+----------------+----------------------+
>
> -- cluster by paths in variant columns
> CREATE OR REPLACE TABLE T3 (t timestamp, v variant) cluster by (v:"Data":id::number);
>
> SHOW TABLES LIKE 'T3';
>
> +-------------------------------+------+---------------+-------------+-------+---------+-------------------------------------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by                                | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+-------------------------------------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 16:30:11.330 -0700 | T3   | TESTDB        | PUBLIC      | TABLE |         | LINEAR(TO_NUMBER(GET_PATH(V, 'Data.id'))) |    0 |     0 | SYSADMIN | 1              | ON                   |
> +-------------------------------+------+---------------+-------------+-------+---------+-------------------------------------------+------+-------+----------+----------------+----------------------+
> ```

#### Important Usage Notes

* For each VARCHAR column, the current implementation of clustering uses only the first 5 bytes.

  If the first N characters are the same for every row, or do not provide sufficient cardinality, then consider clustering on a
  substring that starts after the characters that are identical, and that has optimal cardinality. (For more information about
  optimal cardinality, see Strategies for Selecting Clustering Keys.) For example:

  > ```sqlexample
  > create or replace table t3 (vc varchar) cluster by (SUBSTRING(vc, 5, 5));
  > ```
* If you define two or more columns/expressions as the clustering key for a table, the order has an impact on how the data is clustered in micro-partitions.

  For more details, see Strategies for Selecting Clustering Keys (in this topic).
* An existing clustering key is copied when a table is created using CREATE TABLE … CLONE. However, Automatic Clustering is
  [suspended for the cloned table](object-clone.md) and must be resumed.
* An existing clustering key is not supported when a table is created using CREATE TABLE … AS SELECT; however, you can define a clustering key after the table is created.
* Defining a clustering key directly on top of VARIANT columns is not supported; however, you can specify a VARIANT column in a clustering key if you provide an expression consisting of
  the path and the target type.

### Changing the Clustering Key for a Table

At any time, you can add a clustering key to an existing table or change the existing clustering key for a table using [ALTER TABLE](../sql-reference/sql/alter-table.md):

```sqlsyntax
ALTER TABLE <name> CLUSTER BY ( <expr1> [ , <expr2> ... ] )
```

For example:

> ```sqlexample
> -- cluster by base columns
> ALTER TABLE t1 CLUSTER BY (c1, c3);
>
> SHOW TABLES LIKE 't1';
>
> +-------------------------------+------+---------------+-------------+-------+---------+----------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by     | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+----------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 12:06:07.517 -0700 | T1   | TESTDB        | PUBLIC      | TABLE |         | LINEAR(C1, C3) |    0 |     0 | SYSADMIN | 1              | ON                   |
> +-------------------------------+------+---------------+-------------+-------+---------+----------------+------+-------+----------+----------------+----------------------+
>
> -- cluster by expressions
> ALTER TABLE T2 CLUSTER BY (SUBSTRING(C2, 5, 15), TO_DATE(C1));
>
> SHOW TABLES LIKE 't2';
>
> +-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by                                     | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 12:07:51.307 -0700 | T2   | TESTDB        | PUBLIC      | TABLE |         | LINEAR(SUBSTRING(C2, 5, 15), CAST(C1 AS DATE)) |    0 |     0 | SYSADMIN | 1              | ON                   |
> +-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------+------+-------+----------+----------------+----------------------+
>
> -- cluster by paths in variant columns
> ALTER TABLE T3 CLUSTER BY (v:"Data":name::string, v:"Data":id::number);
>
> SHOW TABLES LIKE 'T3';
>
> +-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------------------------------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by                                                                   | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------------------------------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 16:30:11.330 -0700 | T3   | TESTDB        | PUBLIC      | TABLE |         | LINEAR(TO_CHAR(GET_PATH(V, 'Data.name')), TO_NUMBER(GET_PATH(V, 'Data.id'))) |    0 |     0 | SYSADMIN | 1              | ON                   |
> +-------------------------------+------+---------------+-------------+-------+---------+------------------------------------------------------------------------------+------+-------+----------+----------------+----------------------+
> ```

#### Important Usage Notes

* When adding a clustering key to a table already populated with data, not all expressions are allowed to be specified in the key. You can check whether a specific function is supported using
  [SHOW FUNCTIONS](../sql-reference/sql/show-functions.md):

  > `show functions like 'function_name';`

  The output includes a column, `valid_for_clustering`, at the end of the output. This column displays whether the function can be used in a clustering key for a populated table.
* Changing the clustering key for a table does not affect existing records in the table until the table has been reclustered by Snowflake.

### Dropping the Clustering Keys for a Table

At any time, you can drop the clustering key for a table using [ALTER TABLE](../sql-reference/sql/alter-table.md):

```sqlsyntax
ALTER TABLE <name> DROP CLUSTERING KEY
```

For example:

> ```sqlexample
> ALTER TABLE t1 DROP CLUSTERING KEY;
>
> SHOW TABLES LIKE 't1';
>
> +-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> | created_on                    | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner    | retention_time | automatic_clustering |
> |-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------|
> | 2019-06-20 12:06:07.517 -0700 | T1   | TESTDB        | PUBLIC      | TABLE |         |            |    0 |     0 | SYSADMIN | 1              | OFF                  |
> +-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+----------+----------------+----------------------+
> ```

---
title: Code examples: Apache Spark™
source: https://docs.snowflake.com/en/user-guide/opencatalog/spark-code-examples.md
section: User Guide
---

# Code examples: Apache Spark™

This section provides code examples for using Apache Spark™ to do the following tasks in Snowflake Open Catalog:

* Configure a service connection
* Use a catalog
* List catalogs
* List namespaces
* Create a namespace
* Use a namespace
* Drop a namespace
* Create a table
* Query a table
* Show table properties
* List tables
* Drop a table

## Required privileges

To perform the commands included in the code examples, the following privileges must be bestowed to the service principal you use to connect
Spark to Open Catalog:

| Command | Required privilege |
| --- | --- |
| Show Namespaces | NAMESPACE_LIST |
| Create namespace | NAMESPACE_CREATE |
| Use namespace | NAMESPACE_READ_PROPERTIES |
| Show tables | TABLE_LIST |
| Create or replace table | * TABLE_WRITE_DATA * TABLE_CREATE |
| Drop namespace | NAMESPACE_DROP |
| Drop table | TABLE_DROP |
| Insert into table | TABLE_WRITE_DATA |
| Select from table | TABLE_READ_DATA |

## Configure a service connection

See [examples of configuring a service connection in Spark](register-service-connection.md).

## Use catalog

Use the catalog `catalog1`:

```python
spark.sql("use catalog1").show()
```

## List catalogs

List the catalogs you’re connected to:

```python
spark.sql("show catalogs").show()
```

## List namespaces

List the namespaces for the catalog you’re connected to:

```python
spark.sql("show namespaces").show()
```

## Create a namespace

Create the namespace `namespace1`:

```python
spark.sql("CREATE NAMESPACE namespace1")
```

## Use a namespace

Use the namespace `namespace1`:

```python
spark.sql("use namespace1").show()
```

## Drop a namespace

Drop the namespace `namespace1` from the catalog:

```python
spark.sql("DROP NAMESPACE namespace1")
```

## Create a table

Create a `customers` table under the parent namespace `namespace1`:

```python
spark.sql ("use namespace1");
spark.sql("CREATE OR REPLACE TABLE customers (id int, custnum int) using iceberg")
```

## Query a table

Query the `customers` table:

```python
spark.sql ("use namespace1");
spark.sql("SELECT * FROM customers").show()
```

## Show table properties

Show the table properties for the `customers` table:

```python
spark.sql("SHOW TBLPROPERTIES customers").show(50, False)
```

## List tables

List the tables for the catalog you’re connected to:

```python
spark.sql("show tables").show()
```

## Drop a table

Drop the `customers` table under parent namespace `namespace1`:

```python
spark.sql ("use namespace1");
spark.sql("DROP TABLE customers")
```

---
title: Common connectivity issues and resolutions
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/common-issues.md
section: User Guide
---

# Common connectivity issues and resolutions

This topic outlines the steps for troubleshooting connectivity issues that are likely to be the root cause of the [common error messages](error-messages.md).

## Firewall or proxy SSL inspection issues

Snowflake does not support the alteration or modification of the TLS/SSL certificates for its services. Please work with your network administrator to ensure that all service endpoints returned by the [allowing the URLs](../hostname-allowlist.md) function have full passthrough access through your network.

If you have unique requirements specific to your network environment security that require further discussion on this topic, contact your account team.

## OCSP and port 80 issues

All communications with Snowflake use port 443. However, OCSP certification checks are transmitted over port 80. If port 80 is not open in your network, you might experience OCSP-related issues, which can be accompanied by an error mentioning OCSP (such as [JDCB Error 5](error-messages.md)). In these scenarios, your organization’s system or network administrator needs to open the firewall to traffic on ports 443 and 80 and to ensure that all URLs in the [Snowflake allowlist](../hostname-allowlist.md) are allowed.

> **Note:**
>
> No customer data is transferred over unencrypted HTTP; it is strictly data related to the OCSP operations. Also, note that Snowflake does not provide or maintain the OCSP Responders. The OCSP Cache Server is an exception, which is provided and operated by Snowflake.

If the issue persists after enabling port 80, try deleting all OCSP-related temporal cache files and retry connecting based on your operating system:

* Windows: `$HOME/AppData/Local/Snowflake/Caches`
* MacOS: `$HOME/Library/Caches/Snowflake`
* Linux: `$HOME/.cache/snowflake`

## Fetching large query result sets failures

At times, your client can fetch small query results but struggles with large ones because retrieving large results (over 100KB) requires clients to have full network access with certificate passthrough to all STAGE endpoints. You can frequently resolve these issues by [allowing the URLs](../hostname-allowlist.md) in the Snowflake allowlist in your proxy or firewall.

## DNS configuration issues

In Private Connectivity scenarios, DNS-related settings can be misconfigured on the host or the remote DNS server. These issues are usually accompanied by error messages like “Name or service not known” or “nodename nor servname provided, or not known” (such as [JDCB Error 2](error-messages.md)). If you configure [AWS PrivateLink](../admin-security-privatelink.md), [Azure Private Link](../privatelink-azure.md), or [Google Cloud Private Service Connect](../private-service-connect-google.md), your network administrator must [create and manage a DNS record for your connection URL](../client-redirect.md). Ensure that you performed all the configuration steps associated with your provider correctly.

## Transient network issues

Sometimes your issue might be transient, which can result from the temporary unavailability of the OCSP servers, remote DNS servers, Snowflake servers, or the client temporarily being unable to reach them.

## Further troubleshooting

If your issue is not transient, or the steps above do not resolve your issue, please follow the steps in [Troubleshooting steps](troubleshooting-steps.md).

---
title: Comparison between Snowpipe Streaming high-performance and classic SDKs
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-comparison.md
section: User Guide
---

# Comparison between Snowpipe Streaming high-performance and classic SDKs

This section summarizes the main differences between the classic and high-performance SDKs.

**Client and channel management**

* **OpenClient**: The high-performance SDK requires you to specify the `DB`, `SCHEMA`, and `PIPE`. In the classic SDK, you only need to specify a client `NAME`.
* **OpenChannel**: The high-performance SDK simplifies this by only requiring the channel name. The classic SDK requires you to specify the `DB`, `SCHEMA`, `TABLE`, and an `ERROR_OPTION`. The new SDK also returns an `OpenChannelResult` that contains the channel entity and status, removing the need for a separate RPC call to get the last committed offset token.
* **Support for offsetToken**: The new `openChannel` method now has an optional `offsetToken` parameter, allowing you to open a channel at a specific position. `openChannel(String channelName, (optional) String offsetToken)`.

**Data ingestion**

* **InsertRows renamed**: The `InsertRows` method is now called `AppendRows` in the high-performance SDK.
* **AppendResult removed**: The `appendRow` and `appendRows` methods no longer return an `AppendResult`. Their signatures have changed to `void appendRow(Map<String, Object> row, String offsetToken)` and `void appendRows(Iterable<Map<String, Object>> row, String startOffsetToken, String endOffsetToken)`.

**New asynchronous and utility methods**

* **GetChannelStatus**: This is a new API available on the `Channel` object.
* **waitForFlush**: New `waitForFlush` methods have been added to both the client and channel objects.

  + Client: `void close(boolean waitForFlush, Duration timeoutDuration)`
  + Channel and Client: `void waitForFlush((optional) Duration timeoutDuration)`
* **waitForCommit**: A new method, `CompletableFuture<Boolean> waitForCommit(Predicate<String> tokenChecker, Duration timeoutDuration)`, lets you wait for a commit to be confirmed.
* **initiateFlush**: This new method `void initiateFlush()` asynchronously calls a flush on a channel or client. The method lets you flush data without waiting for the timeout or size limits.

**Data type and parsing**

The high-performance architecture requires native objects for ARRAY and VARIANT columns and doesn’t auto-parse string literals.

| Column Type | Classic | High-performance |
| --- | --- | --- |
| OBJECT | Automatically parses JSON strings. | No change. Automatically parses JSON strings. |
| ARRAY | Implicitly parses strings. For example, “[1,2]” becomes [1,2]. | Type-strict. Treats strings as literals. For example, “[1,2]” becomes [“[1,2]”]. |
| VARIANT | Implicitly parses strings. For example, “true” becomes true. | Type-strict. Treats strings as literals. For example, “true” becomes “true”. |

To ensure semi-structured data is stored correctly in the high-performance architecture, pass native language objects — for example, Java List/Map or Python list/dict — instead of serialized JSON strings.

**Other changes**

* **GetLatestCommittedOffsetTokens**: This API is improved. In the high-performance SDK, it can now fetch offset tokens for channels not opened by the client and allows for partial failures.
* **isValid removed**: The `isValid` method is removed from the high-performance SDK.
* **Schema evolution support**: The high-performance SDK supports [schema evolution](../data-load-schema-evolution.md), a key capability for handling changing data schemas automatically.

The following tables show the API changes from the classic SDK to the high-performance SDK:

## SnowflakeStreamingIngestClientFactory and SnowflakeStreamingIngestClientFactory.Builder

| Classic | High-performance | Notes |
| --- | --- | --- |
| `builder(String name)` | `builder(String clientName, String dbName, String schemaName, String pipeName)` | `name` in the classic version = `clientName` in the high-performance version. |
| N/A | `setExecutorService(ExecutorService executorService)` | A new method. Allows you to specify the `ExecutorService` the SDK will use for its background tasks. |

## SnowflakeStreamingIngestClient

> | Classic | High-performance | Notes |
> | --- | --- | --- |
> | `String getName()` | `String getClientName()` | API name change only; the same information is returned. |
> | N/A | `String getDBName()` | New API. |
> | N/A | `String getPipeName()` | New API. |
> | N/A | `String getSchemaName()` | New API. |
> | `SnowflakeStreamingIngestChannel` `openChannel(OpenChannelRequest request)` | `OpenChannelResult` `openChannel(String channelName, (optional) String offsetToken)` | Different request args and return values. |
> | `Map<String,String> getLatestCommittedOffsetTokens` `(List<SnowflakeStreamingIngestChannel> channels)` | `Map<String, String> getLatestCommittedOffsetTokens` `(List<String> channelNames)` | Different request args. High-performance SDK enables the API to fetch the channel’s status that is opened by other clients and potentially don’t belong to the client. |
> | N/A | `ChannelStatusBatch getChannelStatus(List<String> channelNames)` | New API. |
> | `Void dropChannel(DropChannelRequest request)` | `Void dropChannel(String channelName)` | Different request argument. |
> | `Void setRefreshToken(String refreshToken)` | N/A | Removed. |
> | N/A | `CompletableFuture<Void> close(boolean waitForFlush, Duration timeoutDuration)` | A new client `close` method that has more control over the shutdown process. `waitForFlush`: A Boolean parameter to indicate whether the client should wait for all channels to flush before shutting down. `timeoutDuration`: A `Duration` to specify how long the client should wait for the flush to complete before a forced shutdown. |
> | N/A | `CompletableFuture<Void> waitForFlush((optional) Duration timeoutDuration)` | A new method to wait for the flush to complete. `timeoutDuration`: Specifies how long the client should wait before timing out. |
> | N/A | `void initiateFlush()` | A new method for clients to asynchronously trigger a flush and return immediately. |

## **SnowflakeStreamingIngestChannel**

> | Classic | High-performance | Notes |
> | --- | --- | --- |
> | `getLatestCommittedOffsetToken` | `getLatestCommittedOffsetToken` | This API has been improved. In the high-performance SDK, it can now fetch offset tokens for channels not opened by the client and allows for partial failures. |
> | `isValid` | N/A | Removed. |
> | N/A | `String getDBName()` | New API. |
> | N/A | `String getSchemaName()` | New API. |
> | N/A | `String getPipeName()` | New API. |
> | N/A | `String getFullyQualifiedPipeName()` | New API. |
> | `InsertValidationResponse insertRow(Map<String, Object> row, String offsetToken)` | `void appendRow(Map<String, Object> row, @Nullable String offsetToken)` | API name changed. Response type changed because there is no more validation on the client. |
> | `InsertValidationResponse insertRow(Iterable<Map<String, Object>> row, @Nullable String startOffsetToken, @Nullable String endOffsetToken)` | `void appendRows(Iterable<Map<String, Object>> row, String startOffsetToken, String endOffsetToken)` | API name changed. Response type changed because there is no more validation on the client. |
> | `InsertValidationResponse insertRow(Iterable<Map<String, Object>> row, String offsetToken)` | N/A | Removed. |
> | `String getTableName()` | N/A | Removed. |
> | `String getFullyQualifiedTableName()` | N/A | Removed. |
> | N/A | `String getPipeName()` | New API. |
> | N/A | `String getFullyQualifiedPipeName()` | New API. |
> | `String getName()` | `String getChannelName()` | API name change. |
> | `String getFullyQualifiedName()` | `String getFullyQualifiedChannelName()` | API name change. |
> | `Map<String, ColumnProperties> getTableSchema()` | N/A | Removed. |
> | N/A | `ChannelStatus getChannelStatus()` | New API. |
> | `CompletableFuture<Void> close()` | `Void close()` | The return type is changed, but the behavior is the same. |
> | `CompletableFuture<Void> close(boolean drop)` | `Void close(boolean waitForFlush, Duration timeoutDuration)` | API name is changed, but the behavior is the same. |
> | `Boolean isValid()` | N/A | Removed. |
> | N/A | `CompletableFuture<Void> waitForFlush((optional)Duration timeoutDuration)` | A new method to wait for the flush to complete. `timeoutDuration`: Specifies how long the channel should wait before timing out. |
> | N/A | `void waitForCommit(Predicate<String> tokenChecker, Duration timeoutDuration)` | A new method that asynchronously triggers and waits for the flush of all buffered data within this specific channel to the Snowflake server. This method ensures that all pending data is successfully written and the flush operation is complete before proceeding. |
> | N/A | `void initiateFlush()` | A new method for channels to asynchronously trigger a flush. |

---
title: Computing the Number of Distinct Values
source: https://docs.snowflake.com/en/user-guide/querying-distinct-counts.md
section: User Guide
---

# Computing the Number of Distinct Values

To compute the number of rows that have distinct values, you can use one of the following approaches:

* Call the SQL [COUNT](../sql-reference/functions/count.md) function with the `DISTINCT` keyword.
* If you just need an approximate count of distinct values, you can use the HyperLogLog functions
  (e.g. `APPROX_COUNT_DISTINCT`). For details, see [Estimating the Number of Distinct Values](querying-approximate-cardinality.md).
* If you are counting distinct values for hierarchical aggregations (e.g. multiple grouping sets, rollups, or cubes), you can
  improve performance by using one of the following approaches (rather than using `COUNT(DISTINCT <expr>)`):

  + [Use bitmaps to identify the number of distinct values](querying-bitmaps-for-distinct-counts.md).

    With this approach, you use the bitmap functions to produce bitmaps that identify the distinct integer values in a column.
    Because a bitmap can represent at most 32,768 distinct values, this approach requires “bucketizing” (using multiple bitmaps)
    if the number of distinct values exceeds 32,768.

    For details, see [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](querying-bitmaps-for-distinct-counts.md).
  + [Produce arrays that contain the distinct values](querying-arrays-for-distinct-counts.md).

    With this approach, you use the aggregate functions that produce arrays containing the unique values in a column. You can then
    call [ARRAY_SIZE](../sql-reference/functions/array_size.md) to get the count of values.

    This approach works for values of any data type (e.g. [VARIANT](../sql-reference/data-types-semistructured.md)) and does not require
    “bucketizing”, unless the size of the data in the ARRAY exceeds the maximum size of an ARRAY.

    For details, see [Using Arrays to Compute Distinct Values for Hierarchical Aggregations](querying-arrays-for-distinct-counts.md).

**Next Topics:**

* [Estimating the Number of Distinct Values](querying-approximate-cardinality.md)
* [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](querying-bitmaps-for-distinct-counts.md)
* [Using Arrays to Compute Distinct Values for Hierarchical Aggregations](querying-arrays-for-distinct-counts.md)

---
title: Configurations and examples for Snowpipe Streaming classic architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-configuration.md
section: User Guide
---

# Configurations and examples for Snowpipe Streaming classic architecture

## Snowpipe Streaming properties: Classic architecture

Configure the API connection settings in a `profile.json` file. The properties are described in this topic.

As shown in the [Java example](https://github.com/snowflakedb/snowflake-ingest-java/blob/master/src/main/java/net/snowflake/ingest/streaming/example/SnowflakeStreamingIngestExample.java) (GitHub), you can load the settings from `profile.json` by specifying the file path as the input to the variable `PROFILE_PATH`.

### Required properties

`authorization_type`
:   Configure the authentication and authorization method for the user. You can use one of the following methods:

    * `JWT` : key pair authentication with JSON Web Token (JWT). This is the default method. If `authorization_type` is not configured, the default method `JWT` is used. Configure the following `private_key` for key pair authentication:

      + `private_key`
        Private key to authenticate the user. Include only the key, not the header or footer. If the key is split across multiple lines, remove the line breaks.

        You can provide an unencrypted key, or you can provide an encrypted key and provide the `snowflake.private.key.passphrase` parameter to enable Snowflake to decrypt the key. Use this parameter *if and only if* the `snowflake.private.key` parameter value is encrypted.
    * `OAuth` : Snowflake OAuth. This option is only available with Snowflake Ingest SDK versions 2.0.3 and later. Configure the following parameters for Snowflake OAuth in the `profile.json` file:

      + `oauth_client_id` : The client ID of the OAuth integration.
      + `oauth_client_secret` : The client secret of the OAuth integration.
      + `oauth_refresh_token` : A valid refresh token of the OAuth integration.

      To support token refresh on Snowflake/OKTA OAuth, you must configure three parameters: `oauth_client_id`, `oauth_client_secret`, and `oauth_refresh_token`. However, if you use a customized API endpoint for OAuth that doesn’t require these values in the token refresh request, you can fill in the fields for these parameters with any placeholders.

`url`
:   URL for accessing your Snowflake account. This URL must include your [account identifier](../admin-account-identifier.md). The protocol (`https://`) and port number are optional.

    `url` is not required if you are already using the Snowflake Ingest SDK and have set the `host`, `scheme`, and `port` properties in the `profile.json` file.

`user`
:   User login name for the Snowflake account.

### Optional properties

`enable_iceberg_streaming`
:   Set the property to `true` to enable Snowpipe Streaming with the Snowflake-managed Apache Iceberg™ table. For more information, see [Snowpipe Streaming Classic with Apache Iceberg™ tables](snowpipe-streaming-classic-iceberg.md).

`max_client_lag`
:   Use this property to configure the data flush latency. By default, Snowpipe Streaming flushes data every 1 second for standard Snowflake tables (non-Apache Iceberg). The max_client_lag configuration lets you override that and set it to your desired flush latency from 1 second to 10 minutes. For more information, see [Snowpipe Streaming latency recommendations](snowpipe-streaming-classic-recommendation.md).

`snowflake.private.key.passphrase`
:   Passphrase to decrypt the private key when the key is encrypted. For information, see Using key pair authentication and key rotation (in this topic).

`role`
:   Access control role to use for the session after connecting to Snowflake.

    The `role` property is optional for Snowflake Ingest SDK versions 2.0.3 and later. It is required for earlier Ingest SDK versions.

## Authentication and authorization

### Using Snowflake OAuth

With Snowflake Ingest SDK versions 2.0.3 and later, or Snowflake Connector for Kafka versions 2.1.2 and later, you can use Snowflake OAuth as an authorization method.

Follow [the workflow](../oauth-custom.md) to create a Snowflake OAuth integration and to call OAuth endpoints to request authorization codes and refresh access tokens. The response of token requests contains `oauth_refresh_token`. After a Snowflake OAuth integration is created, run the [SYSTEM$SHOW_OAUTH_CLIENT_SECRETS](../../sql-reference/functions/system_show_oauth_client_secrets.md) function to obtain `oauth_client_id` and `oauth_client_secret`.

To enable Snowflake OAuth, in the `profile.json` file, set `authorization_type` as `OAuth`, and complete the fields `oauth_refresh_token`, `oauth_client_id`, and `oauth_client_secret` with the parameters obtained above.

### Using key pair authentication and key rotation

API calls rely on key pair authentication with JSON Web Token (JWT). JWTs are signed using a public/private key pair with RSA encryption.
This authentication method requires a 2048-bit (minimum) RSA key pair. Generate the public-private key pair using OpenSSL. The public key
is assigned to the Snowflake user defined in the properties file.

Complete the key pair authentication instructions described in [key pair rotation](../key-pair-auth.md). Copy and paste the
entire private key into the `snowflake.private.key` field in the properties file. Save the file.

See [Java Example](../../developer-guide/sql-api/authenticating.md) for an example of creating a fingerprint and generating a JWT token.

Next, evaluate the recommendation for Externalizing secrets (in this topic).

### Externalizing secrets

Snowflake strongly recommends externalizing secrets such as the private key and storing them in an encrypted form or in a key management service such as AWS Key Management Service (KMS), Microsoft Azure Key Vault,
or HashiCorp Vault.

For more information, see the Confluent description of this [service](https://docs.confluent.io/current/connect/security.html#externalizing-secrets).

## Examples

* For a simple example that shows how the client SDK could be used to build a Snowpipe Streaming application, see [this Java file](https://github.com/snowflakedb/snowflake-ingest-java/blob/master/src/main/java/net/snowflake/ingest/streaming/example/SnowflakeStreamingIngestExample.java) (GitHub).
* Quick start examples:

  + [Streaming Data Integration with Snowflake](https://quickstarts.snowflake.com/guide/data_engineering_streaming_integration/index.html)
  + [Getting Started with Snowpipe Streaming and Amazon MSK](https://quickstarts.snowflake.com/guide/getting_started_with_snowpipe_streaming_aws_msk/index.html)
  + [Snowpipe Streaming and Dynamic Tables for Real-Time Ingestion (CDC Use Case)](https://quickstarts.snowflake.com/guide/CDC_SnowpipeStreaming_DynamicTables)

---
title: Configurations for Snowpipe Streaming with high-performance architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-configurations.md
section: User Guide
---

# Configurations for Snowpipe Streaming with high-performance architecture

This guide describes the configuration settings for the high-performance Snowpipe Streaming client that are available in Java and Python SDKs. There are two distinct kinds of configuration:

* Process-wide environment variables: Variables that control logging and metrics for the entire running application and must be set before the client is initialized.
* Client-side properties: Properties that define the secure connection and ingestion target — such as, `url`, `user`, and `private_key` — and are configured for a specific client object, typically through an inline map or a `profile.json` file.

A single application can run multiple client objects. Each object has its own client-side properties, but they all share the same process-wide environment variable settings for logging and metrics.

The high-performance architecture requires the client to be explicitly bound to a specific PIPE object, which manages the schema, transformations, and ingestion into the target table.

## Environment variables

These configuration settings control process-wide behavior like logging and metrics collection and must be configured as environment variables before the client object is initialized. The following table shows the environmental variables that apply to all Snowpipe Streaming client objects within the same process:

| Variable | Description | Default value |
| --- | --- | --- |
| `SS_ENABLE_METRICS` | Set to TRUE to enable the built-in Prometheus metrics server. | FALSE |
| `SS_METRICS_PORT` | The port used for exposing metrics. | 50000 |
| `SS_METRICS_IP` | The IP address where the metrics server is hosted. | 127.0.0.1 |
| `SS_LOG_LEVEL` | The minimum logging level to output. | `info` (Options: `info`, `warn`, `error`) |

## Required properties

The high-performance SDK mandates several properties to establish both the secure connection and the specific ingestion target (the PIPE). The following table shows the required connection and user authentication properties:

| Property | Description |
| --- | --- |
| `url` | URL for accessing your Snowflake account, including your account identifier. The protocol (<https://>) and port number are optional. |
| `user` | User sign-in name for the Snowflake account. |
| `account` | Snowflake account identifier; for example, xy12345. |

If `authorization_type` is set to `JWT`, which is the default, you must provide either the key content or the key file path, as shown in the following table:

| Property | Description |
| --- | --- |
| `private_key` | Private key content that is used to authenticate the user. Include only the key content; no header, footer, or line breaks. |
| `private_key_file` | File path to the private key; for example, rsa_key.p8. This is an alternative to providing the key content directly. |

## Optional properties

The following table shows the high-performance SDK optional properties:

| Property | Description |
| --- | --- |
| `role` | Access control role to use for the session after connecting to Snowflake. |
| `authorization_type` | Property that configures the authentication method. Options are: JWT (key pair authentication, default). |

## Externalizing secrets

Snowflake strongly recommends that you externalize secrets, such as the `private_key` and OAuth credentials, and store them in a key management service; for example, AWS KMS.

## Configuration examples

The following examples show client-side and environment variable configurations.

### Client-side configuration through a profile.json file

The following example shows how to define client-side properties:

```json
// profile.json
{
  "authorization_type": "JWT",
  "url": "https://<account_identifier>.snowflakecomputing.com",
  "user": "MY_SNOWFLAKE_USER",
  "account": "XY12345",
  "private_key_file": "/path/to/rsa_key.p8",
  "role": "MY_INGEST_ROLE"
}
```

### Client-side configuration provided inline

The following examples show how to define client-side properties directly in code:

#### Python example

```python
config = {
    "authorization_type": "JWT",
    "url": "https://<account_identifier>.snowflakecomputing.com",
    "user": "MY_SNOWFLAKE_USER",
    "account": "XY12345",
    "private_key": "-----BEGIN PRIVATE KEY-----\n...\n-----END PRIVATE KEY-----",
}
# ... code to initialize client with 'config'
```

#### Java example

```java
Map<String, Object> config = new HashMap<>();
config.put("authorization_type", "JWT");
config.put("url", "https://<account_identifier>.snowflakecomputing.com");
config.put("user", "MY_SNOWFLAKE_USER");
config.put("account", "XY12345");
config.put("private_key_file", "/path/to/rsa_key.p8");
config.put("role", "MY_INGEST_ROLE");
// ... code to initialize client with 'config'
```

### Environment variable configuration

The following examples show how to define process-wide environment variables in the shell before you run the application:

#### Linux or macOS (Bash or Zsh)

```bash
# Set the log level for the entire application process to 'warn'
export SS_LOG_LEVEL=warn

# Change the IP for metrics to a specific loopback address
export SS_METRICS_IP=127.0.0.5

# Now run your application
```

#### Windows (command prompt)

```batch
# Set the log level for the entire application process to 'warn'
set SS_LOG_LEVEL=warn

# Change the metrics port
set SS_METRICS_PORT=55000

# Now run your application
```

---
title: Configure a catalog integration
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration.md
section: User Guide
---

# Configure a catalog integration

A catalog integration is a named, account-level Snowflake object that stores information about how your table metadata is organized for the
following scenarios:

* When you don’t use [Snowflake as the Iceberg catalog](tables-iceberg.md). For example, you need a
  catalog integration if your table is managed by AWS Glue.
* When you want to integrate with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) to:

  + Query an Iceberg table in Snowflake Open Catalog using Snowflake.
  + Sync a Snowflake-managed Iceberg table with Snowflake Open Catalog so that third-party compute engines can query the table.

A single catalog integration can support one or more Iceberg tables that use the same external catalog.

You must specify a catalog integration to create an Apache Iceberg™ table in Snowflake for the following scenarios:

* Use an external Iceberg catalog.
* Create a table from files in object storage.
* Integrate with Snowflake Open Catalog.
* Use an Iceberg REST catalog.

  > **Tip:**
  >
  > If you use an Iceberg REST catalog, you can use a Apache Iceberg™ REST catalog integration with a catalog-linked database to bring your external data
  > from a remote Iceberg REST catalog into Snowflake.
  >
  > A catalog-linked database automatically discovers and stays in sync with the namespaces and tables in your remote catalog. You can use the
  > catalog-linked database to read and write to the tables in your remote catalog from Snowflake, while preserving full interoperability
  > with your existing Iceberg ecosystem. For more information, see the following topics:
  >
  > + [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md)
  > + If your external data is in Unity Catalog, see [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md)
  > + If your external data is in AWS Glue, see [Build Data Lakes using Apache Iceberg with Snowflake and AWS Glue](https://www.snowflake.com/en/developers/guides/data-lake-using-apache-iceberg-with-snowflake-and-aws-glue/)

## Create a catalog integration

You can create and configure a catalog integration to use with one or more Iceberg tables.

For specific instructions, see the following topics:

* [Configure a catalog integration for files in object storage](tables-iceberg-configure-catalog-integration-object-storage.md)
* [Configure a catalog integration for Snowflake Open Catalog](tables-iceberg-configure-catalog-integration-open-catalog.md)
* [Configure a catalog integration for Apache Iceberg™ REST catalogs](tables-iceberg-configure-catalog-integration-rest.md)

## Set a default catalog at the account, database, or schema level

To define which catalog to use as the default for Iceberg tables,
you can set the [CATALOG](../sql-reference/parameters.md) parameter at the following levels:

Account:
:   Account administrators can use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to set the parameter for the account.
    If the value is set for the account, all Iceberg tables created in the account that use an external catalog use this
    catalog integration by default.

Object:
:   Users can execute the appropriate [CREATE <object>](../sql-reference/sql/create.md) or [ALTER <object>](../sql-reference/sql/alter.md) command
    to override the [CATALOG](../sql-reference/parameters.md) parameter value at the database or schema level.
    The lowest-scoped declaration is used: schema > database > account.

    In addition to the minimum privileges required to modify an object using the appropriate ALTER *<object_type>* command,
    a role must have the USAGE privilege on the catalog integration.

> **Note:**
>
> Changes to the CATALOG parameter only apply to tables created *after* the change. Existing tables continue to use the
> catalog integration specified when they were created.

### Example

The following statement sets a catalog integration (`shared_catalog_integration`) for a database named `my_database_1`:

```sqlexample
ALTER DATABASE my_database_1
  SET CATALOG = 'shared_catalog_integration';
```

After setting a catalog integration at the database level, you can create an Iceberg table in that database
without specifying a catalog integration. The following statement creates an Iceberg table from metadata in object storage in `my_database_1`
that uses the default catalog integration (`shared_catalog_integration`) set for the database.

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table
   EXTERNAL_VOLUME='my_external_volume'
   METADATA_FILE_PATH='path/to/metadata/v1.metadata.json';
```

---
title: Configure a catalog integration for Amazon API Gateway
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-api-gateway.md
section: User Guide
---

# Configure a catalog integration for Amazon API Gateway

The following diagram shows how Snowflake interacts with your REST catalog server using API Gateway and SigV4 authentication.

Follow the steps in this topic to use a REST API in
[Amazon API Gateway](https://docs.aws.amazon.com/apigateway/latest/developerguide/welcome.html)
and [Signature Version 4 (SigV4)](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_aws-signing.html) authentication
to securely connect Snowflake to an Iceberg REST catalog that isn’t publicly accessible.

1. Create a REST API in Amazon API Gateway
2. Create an IAM policy and attach it to a role
3. Attach an API Gateway resource policy (private APIs only)
4. Select IAM-based authorization for your API
5. Retrieve the endpoint URL
6. Create a catalog integration for SigV4
7. Configure the trust relationship in IAM

## Create a REST API in Amazon API Gateway

To connect Snowflake to your Iceberg REST catalog, you need a
[REST API resource](https://docs.aws.amazon.com/apigateway/latest/developerguide/apigateway-rest-api.html)
in Amazon API Gateway.

If you don’t already have a REST API resource in Amazon API Gateway for your Iceberg catalog,
you can create a simple REST API by modifying and importing an Iceberg catalog OpenAPI definition file or manually adding endpoints.

> **Note:**
>
> To import the Iceberg catalog OpenAPI definition, you must modify the YAML file. Amazon API Gateway does not support all components
> of the OpenAPI 2.0 or 3.0 specifications. For more information, see
> [Amazon API Gateway important notes for REST APIs](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-known-issues.html#api-gateway-known-issues-rest-apis).

1. In the AWS Management Console, search for and select API Gateway.
2. Select Create API.
3. Select Build under REST API. To create a *private* REST API, select Build under REST API Private.
4. Select one of the following options:

   * To create an API by manually adding endpoints, select New API.
   * To create an API using an OpenAPI definition file, select Import API, then upload the file or paste
     the definition in the code editor.
5. Enter an API name and optional Description.

   > **Note:**
   >
   > You don’t need to enter a VPC endpoint ID when you create a private REST API.
6. Select Create API.

For more information about creating and developing a REST API in API Gateway, see
the [Amazon API Gateway Developer Guide](https://docs.aws.amazon.com/apigateway/latest/developerguide/rest-api-develop.html).

## Create an IAM policy and attach it to a role

In this step, you create an AWS IAM role that Snowflake can use to connect to API Gateway.
You attach a policy to the role that grants permission to call your API.

1. In the AWS Management Console, search for and select IAM.
2. From the left-hand navigation pane, select Policies.
3. Select Create policy and then select JSON for the Policy editor.
4. Replace the empty policy with a policy that has permission to invoke your API methods.
   For example, the following general policy allows the invoke action for all API Gateway resources in an AWS account.

   ```json
   {
     "Version": "2012-10-17",
     "Statement": [
       {
           "Effect": "Allow",
           "Action": [
               "execute-api:Invoke"
           ],
           "Resource": "arn:aws:execute-api:*:<aws_account_id>:*"
       }
     ]
   }
   ```

   > **Important:**
   >
   > As a best practice, use a policy that grants the minimum required privileges for your use case. For additional guidance and example policies,
   > see [Control access to an API with IAM permissions](https://docs.aws.amazon.com/apigateway/latest/developerguide/permissions.html).
5. Select Next.
6. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
7. Select Create policy.
8. From the left-hand navigation pane in the IAM dashboard, select Roles.
9. Select a role to attach the policy to. When you create a catalog integration, you specify this role. If you don’t have a role, [create a
   new role](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user.html).
10. On the role Summary page in the Permissions tab, select Add permissions » Attach policies.
11. Search for and check the box next to the policy that you created for API Gateway, then select Add permissions.
12. On the role Summary page, copy the role ARN. You specify this ARN when you create a catalog integration.

## Attach an API Gateway resource policy (private APIs only)

If your REST API is private, you must attach an Amazon API Gateway resource policy to your API. The resource
policy allows Snowflake to call your API from the Amazon Virtual Private Cloud (VPC) in which your Snowflake account is located.

1. In Snowflake, call the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../sql-reference/functions/system_get_snowflake_platform_info.md)
   function to retrieve the IDs for the VPC in which your Snowflake account is located.
   From the function output, for each property identified with “purpose”: “generic”, record the corresponding VPC ID(s).

   ```sqlexample
   SELECT SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO();
   ```

   Output:

   ```output
   {
     "snowflake-vpc-id": ["vpc-c1c234a5"],
     "snowflake-egress-vpc-ids": [
       ...
       {
         "id": "vpc-c1c234a5",
         "expires": "2025-03-01T00:00:00",
         "purpose": "generic"
       },
       ...
     ]
   }
   ```
2. Follow the instructions in [Attaching API Gateway resource policies](https://docs.aws.amazon.com/apigateway/latest/developerguide/apigateway-resource-policies-create-attach.html#apigateway-resource-policies-create-attach-console)
   to attach a resource policy to your REST API.

   Paste and modify the following example policy.

   ```json
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Effect": "Deny",
         "Principal": "*",
         "Action": "execute-api:Invoke",
         "Resource": "<api_gateway_arn>",
         "Condition": {
           "StringNotEquals": {
             "aws:sourceVpc": "<snowflake_vpc_id>"
           }
         }
       },
       {
         "Effect": "Allow",
         "Principal": {
           "AWS": "arn:aws:sts::123456789XXX:assumed-role/<my_api_permissions_role_name>/snowflake"
         },
         "Action": "execute-api:Invoke",
         "Resource": "<api_gateway_arn>/*/*/*",
         "Condition": {
           "StringEquals": {
             "aws:sourceVpc": "<snowflake_vpc_id>"
           }
         }
       }
     ]
   }
   ```

The first statement in the policy denies all requests that don’t originate from the Snowflake VPC. The second statement allows the invoke
action (for all methods) from requests originating from the Snowflake VPC that use the
[assumed-role session principal](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_policies_elements_principal.html#principal-role-session).

To learn more about API Gateway resource policies, see:

* [Controlling access to an API with API Gateway resource policies](https://docs.aws.amazon.com/apigateway/latest/developerguide/apigateway-resource-policies.html)
* [API Gateway resource policy examples](https://docs.aws.amazon.com/apigateway/latest/developerguide/apigateway-resource-policies-examples.html)

## Select IAM-based authorization for your API

Select IAM-based authorization for each method that you want to provide access to in your REST API.
With IAM-based authorization, Snowflake can use the IAM role that you configured to make
calls to the API.

1. In the Amazon API Gateway console, select your REST API.
2. For each method:

   1. Under Resources, select a method from the list.
   2. Under Method request settings, select Edit.
   3. For Authorization, select AWS IAM.
   4. Select Save.
3. To apply the authorization changes, select Deploy API. For more information, see
   [Deploying a REST API from the API Gateway console](https://docs.aws.amazon.com/apigateway/latest/developerguide/how-to-deploy-api-with-console.html).

## Retrieve the endpoint URL

Retrieve your REST API endpoint URL (or *invoke* URL). Your API must be deployed to a stage before you can
retrieve the endpoint URL.

1. In the Amazon API Gateway console, select your REST API.
2. In the left-hand navigation pane, select Stages.
3. Under Stage details, copy the Invoke URL.

You specify the endpoint URL when you create a catalog integration.

## Create a catalog integration for SigV4

After you have a REST API in Amazon API Gateway and have completed the initial steps to
control access to your API using IAM permissions, you can create a catalog integration
in Snowflake.

To view the command syntax and parameter descriptions,
see [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md).

**Public REST API**

To create a catalog integration for a public REST API, specify `ICEBERG_REST`
as the `CATALOG_SOURCE` and use `SIGV4` authentication.

Include details such as your API endpoint URL and IAM role ARN.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_rest_catalog_integration
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'my_namespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://asdlkfjwoalk-execute-api.us-west-2-amazonaws.com/MyApiStage'
    CATALOG_API_TYPE = AWS_API_GATEWAY
  )
  REST_AUTHENTICATION = (
    TYPE = SIGV4
    SIGV4_IAM_ROLE = 'arn:aws:iam::123456789XXX:role/my_api_permissions_role'
    SIGV4_EXTERNAL_ID = 'my_iceberg_external_id'
  )
  ENABLED = TRUE;
```

**Private REST API**

To create a catalog integration for a private REST API, you must set the `CATALOG_API_TYPE`
parameter to `AWS_PRIVATE_API_GATEWAY`.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_rest_catalog_integration
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'my_namespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://asdlkfjwoalk-execute-api.us-west-2-amazonaws.com/MyApiStage'
    CATALOG_API_TYPE = AWS_PRIVATE_API_GATEWAY
  )
  REST_AUTHENTICATION = (
    TYPE = SIGV4
    SIGV4_IAM_ROLE = 'arn:aws:iam::123456789XXX:role/my_api_permissions_role'
    SIGV4_EXTERNAL_ID = 'my_iceberg_external_id'
  )
  ENABLED = TRUE;
```

> **Note:**
>
> Both examples specify an
> [external ID](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html)
> (`SIGV4_EXTERNAL_ID = 'my_iceberg_external_id'`) that you can use in the trust relationship for your IAM role (in the next step).
>
> Specifying an external ID lets you use the same IAM role across multiple catalog integrations without updating the
> IAM role trust policy. Doing so is particularly useful in testing scenarios if you need to create or replace a catalog integration many times.

## Configure the trust relationship in IAM

Retrieve information about the AWS IAM user that was created for your
Snowflake account when you created the catalog integration, and configure the trust
relationship for your IAM role.

1. In Snowflake, call the [DESCRIBE CATALOG INTEGRATION](../sql-reference/sql/desc-catalog-integration.md) command:

   ```sqlexample
   DESCRIBE CATALOG INTEGRATION my_rest_catalog_integration;
   ```

   Record the following values:

   > | Value | Description |
   > | --- | --- |
   > | `API_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account, for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. |
   > | `API_AWS_EXTERNAL_ID` | The external ID that’s needed to establish a trust relationship. If you didn’t specify an external ID (`SIGV4_EXTERNAL_ID`) when you created the catalog integration, Snowflake generates an ID for you to use. Record the value so that you can update your IAM role trust policy with the generated external ID. |
2. In the AWS Management Console, search for and select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the IAM role that you created for your catalog integration.
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the values that you recorded.

   ```json
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<api_aws_iam_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<api_aws_external_id>"
           }
         }
       }
     ]
   }
   ```
8. Select Update policy to save your changes.

---
title: Configure a catalog integration for Apache Iceberg™ REST catalogs
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest.md
section: User Guide
---

# Configure a catalog integration for Apache Iceberg™ REST catalogs

An Apache Iceberg™ REST [catalog integration](tables-iceberg.md) lets Snowflake access
[Apache Iceberg™ tables](tables-iceberg.md) managed in a remote catalog that complies with the
open source [Apache Iceberg REST OpenAPI specification](https://github.com/apache/iceberg/blob/main/open-api/rest-catalog-open-api.yaml).

Snowflake supports the following additional features when you use an Iceberg REST catalog integration:

* [Catalog-linked databases and automatic table discovery](tables-iceberg-catalog-linked-database.md)
* [Write support for externally managed Iceberg tables](tables-iceberg-externally-managed-writes.md)

## Authentication methods

Snowflake supports the following authentication methods for Iceberg REST catalogs:

* OAuth
* Bearer token or personal access token (PAT)
* Signature Version 4 (SigV4)

Supported authentication methods vary by catalog source.

### Credential rotation

To rotate the credentials for a catalog integration, you can use the [ALTER CATALOG INTEGRATION](../sql-reference/sql/alter-catalog-integration.md)
command to update the credentials that Snowflake uses to authenticate with your remote catalog.

For example:

```sqlexample
ALTER CATALOG INTEGRATION my_cat_int SET
  REST_AUTHENTICATION (
    OAUTH_CLIENT_SECRET = 'myNewSecret'
  );
```

## Connection options

This section describes the connection options for Iceberg REST catalogs.

### Vended credentials

In addition to [External volumes](tables-iceberg-configure-external-volume.md),
Snowflake supports the following connection options for Iceberg REST catalogs:

* [Vended credentials](tables-iceberg-configure-catalog-integration-vended-credentials.md)

Supported connection options vary by catalog source.

### Private connectivity

Snowflake supports connecting to Iceberg REST catalogs through [private connectivity](tables-iceberg-configure-catalog-integration-rest-private.md).

However, when you connect to the catalog through private connectivity, you must use an external volume to connect to the catalog data.

Supported connection options vary by catalog source.

## Catalog sources

Snowflake supports any external catalog server that complies with the Iceberg REST specification.

The following topics provide examples for commonly used REST catalogs:

* [Snowflake Open Catalog](tables-iceberg-configure-catalog-integration-open-catalog.md). These instructions also apply to
  Apache Polaris™.
* [AWS Glue](tables-iceberg-configure-catalog-integration-rest-glue.md)
* [Amazon API Gateway](tables-iceberg-configure-catalog-integration-rest-api-gateway.md)
* [Tabular](tables-iceberg-configure-catalog-integration-rest-tabular.md)
* [Unity Catalog](tables-iceberg-configure-catalog-integration-rest-unity.md)
* [OneLake](tables-iceberg-configure-catalog-integration-rest-onelake.md)

## Browsing a remote catalog

After you create a catalog integration for Iceberg REST, you can use the following
Snowflake system functions to browse namespaces and tables in the catalog:

* [SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG](../sql-reference/functions/system_list_iceberg_tables_from_catalog.md)
* [SYSTEM$LIST_NAMESPACES_FROM_CATALOG](../sql-reference/functions/system_list_namespaces_from_catalog.md)

## Migrate a table to a Iceberg REST catalog integration

After you create a catalog integration for Iceberg REST, if needed, you can
replace the catalog integration associated with an externally managed Iceberg table in a standard Snowflake database with the catalog
integration you created. For instructions, see [SYSTEM$SET_CATALOG_INTEGRATION](../sql-reference/functions/system_set_catalog_integration.md).

## Create a catalog-linked database

After you create a catalog integration for Iceberg REST, you can create a catalog-linked database to bring the data from your remote Iceberg REST catalog into Snowflake.
When you create the catalog-linked database, specify the name of the catalog integration you created as the catalog.

A catalog-linked database automatically discovers
and stays in sync with the namespaces and tables in your remote catalog. You can use a catalog-linked database to read and
write to the tables in your remote catalog from Snowflake, while preserving full interoperability with your existing
Iceberg ecosystem. For more information, see the following topics:

* [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md)
* If your external data is in Unity Catalog, see [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md)
* If your external data is in AWS Glue, see [Build Data Lakes using Apache Iceberg with Snowflake and AWS Glue](https://www.snowflake.com/en/developers/guides/data-lake-using-apache-iceberg-with-snowflake-and-aws-glue/)

---
title: Configure a catalog integration for AWS Glue
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-glue.md
section: User Guide
---

# Configure a catalog integration for AWS Glue

> **Important:**
>
> To integrate with AWS Glue, we recommend that you instead configure a catalog integration for the AWS Glue Iceberg REST endpoint, which supports
> additional Iceberg table features such as catalog-vended credentials.
>
> For instructions, see [Configure a catalog integration for AWS Glue Iceberg REST](tables-iceberg-configure-catalog-integration-rest-glue.md).

Create a catalog integration for AWS Glue and grant Snowflake restricted access
to the AWS Glue Data Catalog.

> **Note:**
>
> * To complete the instructions in this section, you must have permissions in Amazon Web Services (AWS)
>   to create and manage IAM policies and roles.
>   If you are not an AWS administrator, ask your AWS administrator to perform these tasks.
> * To migrate an Iceberg table in a standard Snowflake database from an AWS Glue catalog integration to an AWS Glue Iceberg REST catalog integration,
>   see [SYSTEM$SET_CATALOG_INTEGRATION](../sql-reference/functions/system_set_catalog_integration.md).

## Step 1: Configure access permissions for the AWS Glue Data Catalog

As a best practice, create a new IAM policy for Snowflake to access the AWS Glue Data Catalog.
You can then attach the policy to an IAM role and use the security credentials that AWS generates
for that role to access files in the catalog. For instructions, see
[Creating IAM policies](https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies_create-console.html) and
[Modifying a role permissions policy](https://docs.aws.amazon.com/IAM/latest/UserGuide/roles-managingrole-editing-console.html#roles-modify_permissions-policy)
in the AWS Identity and Access Management User Guide.

At a minimum, Snowflake requires the following permissions on the AWS Glue Data Catalog to access information about tables.

* `glue:GetTable`
* `glue:GetTables`

The following example policy (in JSON format) provides the required permissions
to access all of the tables in a specified database.

```sqljson
{
   "Version": "2012-10-17",
   "Statement": [
      {
         "Sid": "AllowGlueCatalogTableAccess",
         "Effect": "Allow",
         "Action": [
            "glue:GetTable",
            "glue:GetTables"
         ],
         "Resource": [
            "arn:aws:glue:*:<accountid>:table/*/*",
            "arn:aws:glue:*:<accountid>:catalog",
            "arn:aws:glue:*:<accountid>:database/<database-name>"
         ]
      }
   ]
}
```

> **Note:**
>
> * You can modify the `Resource` element of this policy to further restrict the allowed resources
>   (for example, catalog, databases, or tables). For more information, see
>   [Resource types defined by AWS Glue](https://docs.aws.amazon.com/service-authorization/latest/reference/list_awsglue.html#awsglue-resources-for-iam-policies).
> * If you use encryption for AWS Glue, you must modify the policy to add AWS Key Management Service (AWS KMS) permissions.
>   For more information, see [Setting up encryption in AWS Glue](https://docs.aws.amazon.com/glue/latest/dg/set-up-encryption.html).

## Step 2: Create a catalog integration in Snowflake

Create a catalog integration for the AWS Glue Data Catalog using the [CREATE CATALOG INTEGRATION (AWS Glue)](../sql-reference/sql/create-catalog-integration-glue.md) command.

The following example creates a catalog integration that uses an AWS Glue Data Catalog source.
The example specifies a value for the optional `GLUE_REGION` parameter.

```sqlexample
CREATE CATALOG INTEGRATION glueCatalogInt
  CATALOG_SOURCE = GLUE
  CATALOG_NAMESPACE = 'my.catalogdb'
  TABLE_FORMAT = ICEBERG
  GLUE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/myGlueRole'
  GLUE_CATALOG_ID = '123456789012'
  GLUE_REGION = 'us-east-2'
  ENABLED = TRUE;
```

## Step 3: Retrieve the AWS IAM user and external ID for your Snowflake account

To retrieve information about the AWS IAM user and the external ID
that were created for your Snowflake account when you created the catalog integration, execute the [DESCRIBE CATALOG INTEGRATION](../sql-reference/sql/desc-catalog-integration.md) command.
You provide this information to AWS in the next section to establish a trust relationship.

The following example command describes the catalog integration created in the previous step:

```sqlexample
DESCRIBE CATALOG INTEGRATION glueCatalogInt;
```

Record the following values:

> | Value | Description |
> | --- | --- |
> | `GLUE_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account, for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. All Glue catalog integrations in your account use that IAM user. |
> | `GLUE_AWS_EXTERNAL_ID` | The external ID that is needed to establish a trust relationship. |

You will provide these values in the next section.

## Step 4: Grant the IAM user permissions to access the AWS Glue Data Catalog

Update the trust policy for the same IAM role that you specified with the ARN when you created the
catalog integration (`GLUE_AWS_ROLE_ARN`). Add the values that you recorded in
Step 3: Retrieve the AWS IAM user and external ID for your Snowflake account to the trust policy.

For instructions, see [Modifying a trust policy](https://docs.aws.amazon.com/IAM/latest/UserGuide/roles-managingrole-editing-console.html#roles-managingrole_edit-trust-policy).

The following example trust policy demonstrates where to specify the `GLUE_AWS_IAM_USER_ARN` and `GLUE_AWS_EXTERNAL_ID` values:

```sqljson
{
   "Version": "2012-10-17",
   "Statement": [
      {
      "Sid": "",
      "Effect": "Allow",
      "Principal": {
         "AWS": "<glue_iam_user_arn>"
      },
      "Action": "sts:AssumeRole",
      "Condition": {
         "StringEquals": {
            "sts:ExternalId": "<glue_aws_external_id>"
         }
      }
      }
   ]
}
```

Where:

> * `glue_iam_user_arn` is the `GLUE_IAM_USER_ARN` value that you recorded.
> * `glue_aws_external_id` is the `GLUE_AWS_EXTERNAL_ID` value that you recorded.

> **Note:**
>
> * For security reasons, if you create a new catalog integration (or recreate an existing catalog integration using the CREATE OR
>   REPLACE CATALOG INTEGRATION syntax), the new catalog integration has a different external ID and cannot resolve the trust
>   relationship unless you modify the trust policy with the new external ID.
> * To verify that your permissions are configured correctly, [create an Iceberg table](tables-iceberg-create.md)
>   using this catalog integration. Snowflake doesn’t verify that your permissions are set correctly until you create
>   an Iceberg table that references this catalog integration.

## Next steps

After you configure a catalog integration for AWS Glue, you can create an Iceberg table.

To update the table and keep it in sync with changes in AWS Glue, use an
[ALTER ICEBERG TABLE … REFRESH](../sql-reference/sql/alter-iceberg-table-refresh.md) statement. For more information, see
[Refresh the metadata for a table](tables-iceberg-manage.md).

---
title: Configure a catalog integration for AWS Glue Iceberg REST
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-glue.md
section: User Guide
---

# Configure a catalog integration for AWS Glue Iceberg REST

Follow the steps in this topic to create a catalog integration for the
[AWS Glue Iceberg REST endpoint](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html)
with [Signature Version 4 (SigV4)](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_aws-signing.html) authentication.

> **Note:**
>
> To configure a catalog integration for connecting to AWS Glue Data Catalog through a private IP address instead of over the public internet,
> see [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](tables-iceberg-configure-catalog-integration-rest-private.md).

## Step 1: Configure access permissions for the AWS Glue Data Catalog

Create an IAM policy for Snowflake to access the AWS Glue Data Catalog.
Attach the policy to an IAM role, which you specify when you create a catalog integration. For instructions, see
[Creating IAM policies](https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies_create-console.html) and
[Modifying a role permissions policy](https://docs.aws.amazon.com/IAM/latest/UserGuide/roles-managingrole-editing-console.html#roles-modify_permissions-policy)
in the AWS Identity and Access Management User Guide.

### Read-only example policy

At a minimum, Snowflake requires the following permissions on the AWS Glue Data Catalog to access information using the Glue Iceberg REST catalog.

* `glue:GetCatalog`
* `glue:GetDatabase`
* `glue:GetDatabases`
* `glue:GetTable`
* `glue:GetTables`

The following example policy (in JSON format) provides the required permissions
to access all of the tables in a specified database.

```json
{
   "Version": "2012-10-17",
   "Statement": [
      {
         "Sid": "AllowGlueCatalogTableAccess",
         "Effect": "Allow",
         "Action": [
           "glue:GetCatalog",
           "glue:GetDatabase",
           "glue:GetDatabases",
           "glue:GetTable",
           "glue:GetTables"
         ],
         "Resource": [
            "arn:aws:glue:*:<accountid>:table/*/*",
            "arn:aws:glue:*:<accountid>:catalog",
            "arn:aws:glue:*:<accountid>:database/<database-name>"
         ]
      }
   ]
}
```

> **Note:**
>
> * You can modify the `Resource` element of this policy to further restrict the allowed resources
>   (for example, catalog, databases, or tables). For more information, see
>   [Resource types defined by AWS Glue](https://docs.aws.amazon.com/service-authorization/latest/reference/list_awsglue.html#awsglue-resources-for-iam-policies).
> * If you use encryption for AWS Glue, you must modify the policy to add AWS Key Management Service (AWS KMS) permissions.
>   For more information, see [Setting up encryption in AWS Glue](https://docs.aws.amazon.com/glue/latest/dg/set-up-encryption.html).

### Read and write example policy

The following example policy (in JSON format) provides the required permissions
for read and write access to all of the tables in all databases.
To configure [write access for externally managed tables](tables-iceberg-externally-managed-writes.md),
use this policy as an example.

```json
{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Sid": "AllowGlueCatalogTableAccess",
      "Effect": "Allow",
      "Action": [
        "s3:GetObject",
        "s3:PutObject",
        "glue:GetCatalog",
        "glue:GetDatabase",
        "glue:GetDatabases",
        "glue:CreateDatabase",
        "glue:DeleteDatabase",
        "glue:GetTable",
        "glue:GetTables",
        "glue:CreateTable",
        "glue:UpdateTable",
        "glue:DeleteTable"
      ],
      "Resource": [
        "arn:aws:glue:*:<accountid>:table/*/*",
        "arn:aws:glue:*:<accountid>:catalog",
        "arn:aws:glue:*:<accountid>:database/*",
        "arn:aws:s3:<external_volume_path>"
      ]
    }
  ]
}
```

> **Note:**
>
> * The policy must provide access to your storage location in order for AWS Glue catalog to write metadata to the table location.
> * The `"arn:aws:glue:*:<accountid>:database/*"` line in the `Resource` element of this policy specifies all databases. This is required
>   if you want to create a new database in Glue from Snowflake with the [CREATE SCHEMA](tables-iceberg-externally-managed-writes.md)
>   command. To limit access to a single database, you can specify the database by name. For more information about defining resources, see
>   [Resource types defined by AWS Glue](https://docs.aws.amazon.com/service-authorization/latest/reference/list_awsglue.html#awsglue-resources-for-iam-policies).
> * If you use encryption for AWS Glue, you must modify the policy to add AWS Key Management Service (AWS KMS) permissions.
>   For more information, see [Setting up encryption in AWS Glue](https://docs.aws.amazon.com/glue/latest/dg/set-up-encryption.html).

### (Optional) Configure Lake Formation access control

If you use AWS Lake Formation for fine-grained access control, ensure that your Lake Formation configuration
allows Snowflake to access your catalog objects and underlying data.

The IAM role that you created in the previous step — the role that you specify in Snowflake when you create a catalog integration — must
have the `lakeformation:GetDataAccess` IAM permission. This permission grants read and write access to underlying data:

```json
{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": "lakeformation:GetDataAccess",
            "Resource": "*"
        }
    ]
}
```

For more information, see [Underlying data access control](https://docs.aws.amazon.com/lake-formation/latest/dg/access-control-underlying-data.html)
in the Lake Formation documentation.

You must also grant data permissions to the IAM role. The method that you use to grant data permissions depends on your Lake Formation setup.
For example, you might use the named resources method to grant permissios to AWS Glue objects, or you might use tag-based access control. For more information
and instructions, see the [AWS Lake Formation documentation](https://docs.aws.amazon.com/lake-formation/latest/dg/granting-catalog-permissions.html).

## Step 2: Create a catalog integration in Snowflake

Create a catalog integration for the
[AWS Glue Iceberg REST endpoint](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html)
using the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) command.
Specify the IAM role that you configured. For `CATALOG_NAME`, use your AWS account ID.

```sqlexample
CREATE CATALOG INTEGRATION glue_rest_catalog_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'rest_catalog_integration'
  REST_CONFIG = (
    CATALOG_URI = 'https://glue.us-west-2.amazonaws.com/iceberg'
    CATALOG_API_TYPE = AWS_GLUE
    CATALOG_NAME = '123456789012'
  )
  REST_AUTHENTICATION = (
    TYPE = SIGV4
    SIGV4_IAM_ROLE = 'arn:aws:iam::123456789012:role/my-role'
    SIGV4_SIGNING_REGION = 'us-west-2'
  )
  ENABLED = TRUE;
```

Where:

* `CATALOG_URI` is the service endpoint for the AWS Glue Iceberg REST catalog.
* `CATALOG_NAME` is the ID of your AWS account.

For more information, see [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md),
which includes instructions for configuring a catalog integration for AWS Glue.

## Step 3: Retrieve the AWS IAM user and external ID for your Snowflake account

To retrieve information about the AWS IAM user and the external ID for your Snowflake account,
run the [DESCRIBE CATALOG INTEGRATION](../sql-reference/sql/desc-catalog-integration.md) command.
You provide this information to AWS in the next step to establish a trust relationship.

```sqlexample
DESCRIBE CATALOG INTEGRATION glue_rest_catalog_int;
```

Record the following values:

> | Value | Description |
> | --- | --- |
> | `GLUE_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account, for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. All Glue catalog integrations in your account use that IAM user. |
> | `GLUE_AWS_EXTERNAL_ID` | An external ID for establishing a trust relationship. |

## Step 4: Grant the IAM user access to the AWS Glue Data Catalog

Update the trust policy for the same IAM role that you specified with the ARN when you created the
catalog integration (`GLUE_AWS_ROLE_ARN`). Add the values that you recorded in the
previous step to the trust policy.

For instructions, see [Modifying a trust policy](https://docs.aws.amazon.com/IAM/latest/UserGuide/roles-managingrole-editing-console.html#roles-managingrole_edit-trust-policy).

The following example policy shows where to specify the `GLUE_AWS_IAM_USER_ARN` and `GLUE_AWS_EXTERNAL_ID` values:

```sqljson
{
   "Version": "2012-10-17",
   "Statement": [
      {
      "Sid": "",
      "Effect": "Allow",
      "Principal": {
         "AWS": "<glue_iam_user_arn>"
      },
      "Action": "sts:AssumeRole",
      "Condition": {
         "StringEquals": {
            "sts:ExternalId": "<glue_aws_external_id>"
         }
      }
      }
   ]
}
```

Where:

> * `glue_iam_user_arn` is the `GLUE_IAM_USER_ARN` value that you recorded.
> * `glue_aws_external_id` is the `GLUE_AWS_EXTERNAL_ID` value that you recorded.

> **Note:**
>
> * For security reasons, if you create a new catalog integration (or recreate an existing catalog integration by using the CREATE OR
>   REPLACE CATALOG INTEGRATION syntax), the new catalog integration has a different external ID and can’t resolve the trust
>   relationship unless you modify the trust policy with the new external ID.
> * To verify that your permissions are configured correctly, [create an Iceberg table](tables-iceberg-create.md)
>   that uses this catalog integration. Snowflake doesn’t verify that your permissions are set correctly until you create
>   an Iceberg table that references this catalog integration.

## Next steps

After you configure a catalog integration for AWS Glue Iceberg REST, you can [create a catalog-linked database](tables-iceberg-catalog-linked-database.md).
Specify the name of your catalog integration as the catalog when you create your catalog-linked database.

A catalog-linked database brings your external data from a remote Iceberg REST catalog into Snowflake by automatically discovering and
staying in sync with the namespaces and tables in your remote catalog.

---
title: Configure a catalog integration for files in object storage
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-object-storage.md
section: User Guide
---

# Configure a catalog integration for files in object storage

Create a catalog integration for Apache Iceberg™ table files or Delta table files in object storage.

After you create a catalog integration, you can [create an Iceberg table](tables-iceberg-create.md).

## Iceberg files

Create a catalog integration for Iceberg metadata that’s in an external cloud storage location
by setting `OBJECT_STORE` as the `CATALOG_SOURCE` value and `ICEBERG` as the `TABLE_FORMAT`.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION icebergCatalogInt
  CATALOG_SOURCE = OBJECT_STORE
  TABLE_FORMAT = ICEBERG
  ENABLED = TRUE;
```

## Delta table files

Create a catalog integration for Iceberg tables based on
Delta table files by setting `OBJECT_STORE` as the `CATALOG_SOURCE` value and `DELTA` as the `TABLE_FORMAT`.

* `CATALOG_SOURCE = OBJECT_STORE`
* `TABLE_FORMAT = DELTA`

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION delta_catalog_integration
  CATALOG_SOURCE = OBJECT_STORE
  TABLE_FORMAT = DELTA
  ENABLED = TRUE;
```

> **Note:**
>
> Snowflake doesn’t support creating Iceberg tables from Delta table definitions in the AWS Glue Data Catalog.

---
title: Configure a catalog integration for OneLake REST
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-onelake.md
section: User Guide
---

# Configure a catalog integration for OneLake REST

Follow the steps in this topic to create a catalog integration for the OneLake REST API endpoint, which is an endpoint for OneLake
table APIs that you can use to interact with tables in Microsoft Fabric. For more information about this endpoint,
see [Getting started with OneLake table APIs for Iceberg](https://learn.microsoft.com/en-us/fabric/onelake/table-apis/iceberg-table-apis-get-started)
in the Microsoft Fabric documentation.

With this catalog integration, you can use Snowflake to read OneLake tables that have Iceberg metadata.

## Prerequisites

* Before you begin, you must find your workspace ID for your workspace in Fabric and the data item ID for your lakehouse in Fabric.
  You specify your workspace ID and data item ID later when you create a catalog integration for OneLake REST.

  > + To find your workspace ID (`<workspaceID>`), refer to the URL of the Fabric site for an item in a workspace. For more information, see
  >   [Identify your workspace ID](https://learn.microsoft.com/en-us/fabric/admin/portal-workspace#identify-your-workspace-id) in the
  >   Microsoft Fabric documentation. Copy your workspace ID into a text editor.
  > + To find your data item ID (`<dataItemID>`), open your lakehouse, and then refer to the value after `lakehouses` in the URL. For more information,
  >   see [Lakehouse source configuration](https://learn.microsoft.com/en-us/fabric/data-factory/connector-lakehouse-copy-activity#source)
  >   in the Microsoft Fabric documentation and see the Connection bullet point. Copy your data item ID into a text editor.
* In your Fabric workspace, make sure you have Iceberg tables in any data item, such as in a lakehouse.

## Step 1: Configure access permissions for OneLake

To configure access permissions for OneLake, you create an application registration in Microsoft Azure, add the user_impersonation
permission to your application registration, and create a new client secret for your application registration.

1. In Azure, create an application registration.

   For details, see [Register an application in Microsoft Entra ID](https://learn.microsoft.com/en-us/entra/identity-platform/quickstart-register-app)
   in the Microsoft Entra documentation.
2. In your application registration, add the user_impersonation permission.

   To get started, follow the first four steps in [Use the Microsoft Entra admin center to find the APIs your organization uses](https://learn.microsoft.com/en-us/graph/migrate-azure-ad-graph-configure-permissions?tabs=http&pivots=entra-portal-api-permissions#use-the-microsoft-entra-admin-center-to-find-the-apis-your-organization-uses)
   in the Microsoft Graph documentation.

   > **Important:**
   >
   > Don’t switch to the APIs my organization uses tab as described in the steps. Instead, switch to the Microsoft APIs tab,
   > select Azure Storage, and then add the user_impersonation permission.
3. Create a new client secret for your application registration, and then copy the secret into a text editor.

   For instructions, see
   [Create a new client secret](https://learn.microsoft.com/en-us/entra/identity-platform/howto-create-service-principal-portal#option-3-create-a-new-client-secret)
   in the Microsoft Entra documentation. You specify this secret when you create a catalog integration.

   > **Important:**
   >
   > Remember to copy your secret to a text editor, because you can’t retrieve it later.
4. From the Overview page of your application registration, copy the Display name, Application (client) ID, and
   Directory (tenant) ID into a text editor.

   You specify these values when you create a catalog integration and external volume.

## Step 2: Grant your application registration access to your Fabric workspace

In this step, you give your application registration access to your workspace in Fabric.

1. Navigate to [Microsoft Fabric](https://app.fabric.microsoft.com/), and then sign in.
2. Open your Microsoft Fabric workspace.
3. Select Manage access.
4. Select + Add people or groups.
5. In the Enter name or email field, paste the name of your application registration.

   This name is the Display name that
   you copied when you configured access permissions for OneLake.
6. From the drop-down menu, select Contributor access or higher to allow the app to create the necessary Fabric item.
7. Select Add.

## Step 3: Create a catalog integration in Snowflake

Create a catalog integration for the REST API endpoint by using the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) command.

For example:

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_onelake_catalog_int
   CATALOG_SOURCE = ICEBERG_REST
   TABLE_FORMAT = ICEBERG
   REST_CONFIG = (
      CATALOG_URI = 'https://onelake.table.fabric.microsoft.com/iceberg'
      CATALOG_NAME = '<fabric_data_item_scope>'
   )
   REST_AUTHENTICATION = (
      TYPE = OAUTH
      OAUTH_TOKEN_URI = '<azure_active_directory_token_endpoint>'
      OAUTH_CLIENT_ID = '<entra_application_client_id>'
      OAUTH_CLIENT_SECRET = '<entra_application_client_secret>'
      OAUTH_ALLOWED_SCOPES = ('https://storage.azure.com/.default')
   )
   ENABLED = TRUE;
```

Where:

* `https://onelake.table.fabric.microsoft.com/iceberg` is the base URL at the OneLake table endpoint.
* `<fabric_data_item_scope>` is the Fabric data item scope, in the form `<workspaceID>`/`<dataItemID>`, such as
  `12345678-abcd-1abc-1a11-111111ab1111/11111111-abcd-1111-1ab1-1111a1a1ab91`. To find your `<workspaceID>` and `<dataItemID>`, see Prerequisites.
* `<azure_active_directory_token_endpoint_>` is your Azure Active Directory OAuth 2.0 token endpoint URL, in the form of `https://login.microsoftonline.com/<entra_tenant_id>/oauth2/v2.0/token`.
  For `<entra_tenant_id>` you specify your Entra tenant ID, which you copied when you configured access permissions for OneLake.
* `<entra_application_client_id>` is your Entra application client ID, which you copied when you configured access permissions for OneLake, such as `11111111-aabb-1a11-abc1-ab11111a11a1`.
* `<entra_application_client_secret>` is your application client secret, which you copied when you configured access permissions for OneLake.
* `https://storage.azure.com/.default` is the storage token audience.

## Step 4: Configure an external volume

In this step, you configure an external volume for Azure with your Azure OneLake URL and your Entra tenant ID.

1. Create an external volume using the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command.

   For example:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL VOLUME my_onelake_extvol
      STORAGE_LOCATIONS =
      (
         (
               NAME = 'my_onelake_extvol'
               STORAGE_PROVIDER = 'AZURE'
               STORAGE_BASE_URL = '<azure_onelake_url>'
               AZURE_TENANT_ID='<entra_tenant_id>'
         )
      )
      ALLOW_WRITES = FALSE;
   ```

   Where:

   * `<azure_onelake_url>` is your Azure OneLake URL, in the form of `azure://onelake.dfs.fabric.microsoft.com/<workspaceID>/<dataItemID>`, such as `azure://onelake.dfs.fabric.microsoft.com/12345678-abcd-1abc-1a11-111111ab1111/11111111-abcd-1111-1ab1-1111a1a1ab91`.
     To find your `<workspaceID>` and `<dataItemID>`, see Prerequisites.
   * `<entra_tenant_id>` is your Entra tenant ID, such as, `11111111-aabb-1a11-abc1-ab11111a11a1`. You copied your Entra tenant ID when you configured access permissions for OneLake.
2. To retrieve a URL to the Microsoft permissions request page, use the [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) command.
   Specify the name of the external volume that you created previously.

   ```sqlexample
   DESC EXTERNAL VOLUME my_onelake_extvol;
   ```

   Record the values for the following properties:

   | Property | Description |
   | --- | --- |
   | `AZURE_CONSENT_URL` | URL to the Microsoft permissions request page. |
   | `AZURE_MULTI_TENANT_APP_NAME` | Name of the Snowflake client application created for your account. In a later step in this section, you grant this application permission to obtain an access token on your allowed storage location. |

   You use these values in the following steps.
3. In a web browser, navigate to the Microsoft permissions request page (the `AZURE_CONSENT_URL`).
4. Select Accept. This action allows the Azure service principal created for your Snowflake account to obtain an
   access token on a specified resource inside your tenant. Obtaining an access token succeeds only if you grant the service principal the
   appropriate permissions on the storage account level (see the next step).
5. Give the multi-tenant application permission to obtain an access token on your allowed storage location in Fabric.

   1. Navigate to [Microsoft Fabric](https://app.fabric.microsoft.com/), and then sign in.
   2. Open your Microsoft Fabric workspace.
   3. Select Manage access.
   4. Select + Add people or groups.
   5. In the Enter name or email field, paste the value you recorded for AZURE_MULTI_TENANT_APP_NAME.
   6. From the drop-down menu, select Contributor access or higher to allow the app to create the necessary Fabric item.
   7. Select Add.

For more information, see [Example Snowflake catalog integration and external volume code for the REST endpoint in Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/onelake/table-apis/iceberg-table-apis-get-started#snowflake)
in the Microsoft Fabric documentation.

## Next steps

After you configure a catalog integration for OneLake REST and an external volume, you can use the [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md)
command to create a catalog-linked database, and then read your tables from OneLake in Snowflake.

When you create your catalog-linked database, you specify the catalog integration and external volume that you created.

For example:

```sqlexample
CREATE OR REPLACE DATABASE my_linked_db
   LINKED_CATALOG = (
      CATALOG = 'my_onelake_catalog_int'
   )
   EXTERNAL_VOLUME = 'my_onelake_extvol';

SELECT SYSTEM$CATALOG_LINK_STATUS('IRC_CATALOG_LINKED');

SELECT * FROM my_linked_db."dbo"."sentiment";
```

> **Note:**
>
> Snowflake only supports read operations for tables in OneLake.

---
title: Configure a catalog integration for Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-open-catalog.md
section: User Guide
---

# Configure a catalog integration for Snowflake Open Catalog

> **Note:**
>
> These instructions also apply to configuring a catalog integration for Apache Polaris™.

Create a catalog integration for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview), which you can use to query a
table in Snowflake Open Catalog using Snowflake or sync a Snowflake-managed table with Open Catalog.
For more information, see [Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake](tables-iceberg-open-catalog.md).

A catalog integration for Open Catalog is associated with a specific catalog and service connection in your Open Catalog account.

For more information about creating a catalog integration to connect Open Catalog to Snowflake, see the following topics:

* [Query a table in Snowflake Open Catalog using Snowflake](tables-iceberg-open-catalog-query.md)
* [Sync a Snowflake-managed table with Snowflake Open Catalog](tables-iceberg-open-catalog-sync.md)

## Example: Create a catalog integration for Open Catalog

To create a catalog integration for Open Catalog, use the [CREATE CATALOG INTEGRATION](../sql-reference/sql/create-catalog-integration-open-catalog.md) command.

For example:

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_open_catalog_int
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'myOpenCatalogCatalogNamespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://ABCDEFG-ACCOUNT1.snowflakecomputing.com/polaris/api/catalog'
    CATALOG_NAME = 'myOpenCatalogExternalCatalogName'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = 'myClientId'
    OAUTH_CLIENT_SECRET = 'myClientSecret'
    OAUTH_ALLOWED_SCOPES = ('PRINCIPAL_ROLE:ALL')
  )
  ENABLED = TRUE;
```

* The value for CATALOG_URI is your Open Catalog account URL. For more information, see the
  [CATALOG_URI](../sql-reference/sql/create-catalog-integration-open-catalog.md) parameter description.
* If you’re [syncing a Snowflake-managed table with Open Catalog](tables-iceberg-open-catalog-sync.md), the
  CATALOG_NAMESPACE parameter isn’t required and doesn’t affect how you sync the table with Open Catalog. Snowflake syncs
  the table to the external catalog in Open Catalog that you specify in the catalog integration, along with its parent namespace
  from Snowflake.

  For example, if you have a `db1.public.table1` Iceberg table registered in Snowflake and you specify `catalog1`
  in the catalog integration, Snowflake syncs the table with Open Catalog with the following fully qualified name: `catalog1.db1.public.table1`.

> **Note:**
>
> To check your authentication configuration, see [Check a configuration for OAuth](tables-iceberg-configure-catalog-integration-rest-check-config.md).

---
title: Configure a catalog integration for Tabular
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-tabular.md
section: User Guide
---

# Configure a catalog integration for Tabular

Use the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) command to create a REST catalog integration for Tabular.

The following example creates a REST catalog integration that uses OAuth:

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION tabular_catalog_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'default'
  REST_CONFIG = (
    CATALOG_URI = 'https://api.tabular.io/ws'
    CATALOG_NAME = '<tabular_warehouse_name>'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_TOKEN_URI = 'https://api.tabular.io/ws/v1/oauth/tokens'
    OAUTH_CLIENT_ID = '<oauth_client_id>'
    OAUTH_CLIENT_SECRET = '<oauth_secret>'
    OAUTH_ALLOWED_SCOPES = ('catalog')
  )
  ENABLED = TRUE;
```

---
title: Configure a catalog integration for Unity Catalog
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-unity.md
section: User Guide
---

# Configure a catalog integration for Unity Catalog

Use the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) command to create a REST catalog integration that uses
[vended credentials](tables-iceberg-configure-catalog-integration-vended-credentials.md)
or an [external volume](tables-iceberg.md) to connect to Databricks Unity Catalog.

> **Note:**
>
> * To configure a catalog integration for connecting to Databricks Unity Catalog through a private IP address instead of over the public internet,
>   see [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](tables-iceberg-configure-catalog-integration-rest-private.md).
> * For a tutorial that covers how to connect Snowflake to a catalog in Databricks Unity Catalog by using a writable
>   catalog-linked database with catalog-vended credentials, see [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md).

You can create a catalog integration for Unity Catalog where the Databricks workspace is hosted on one of the following cloud providers:

* AWS
* Azure
* Google Cloud

You can configure a catalog integration for Unity Catalog that uses OAuth or bearer authentication:

* Configure an OAuth catalog integration
* Configure a bearer token catalog integration

## Configure an OAuth catalog integration

### Step 1: Find your Databricks workspace URL

Your Databricks workspace URL is the URL that you use to access your Databricks workspace. You need to find this URL because you specify
it later when you create a catalog integration.

1. Find your Databricks workspace URL.

   For instructions on how to find this URL, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Workspace instance names, URLs, and IDs](https://docs.databricks.com/aws/workspace/workspace-details#workspace-instance-names-urls-and-ids)
   * **Azure Databricks**: [Azure Databricks: Determine per-workspace URL](https://learn.microsoft.com/azure/databricks/workspace/workspace-details#determine-per-workspace-url)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Workspace instance names, URLs, and IDs](https://docs.databricks.com/gcp/workspace/workspace-details#workspace-instance-names-urls-and-ids)
2. Copy your Databricks workspace URL into a text editor.

### Step 2: Add a service principal in Databricks

1. To add a service principal, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Add service principals to your account](https://docs.databricks.com/aws/admin/users-groups/manage-service-principals?language=Account%C2%A0console#-add-service-principals-to-your-account)
   * **Azure Databricks**: [Azure Databricks: Add service principals to your account](https://learn.microsoft.com/azure/databricks/admin/users-groups/manage-service-principals#-add-service-principals-to-your-account)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Add service principals to your account](https://docs.databricks.com/gcp/admin/users-groups/manage-service-principals#-add-service-principals-to-your-account)
2. Copy the *Application ID* value for your service principal into a text editor and store it securely. You specify this value later
   when you create a catalog integration in Snowflake.

### Step 3: Create an OAuth secret for your service principal

1. To create an OAuth secret for your service principal, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Create an OAuth secret](https://docs.databricks.com/aws/dev-tools/auth/oauth-m2m#-step-1-create-an-oauth-secret)
   * **Azure Databricks**: [Azure Databricks: Create an OAuth secret](https://learn.microsoft.com/azure/databricks/dev-tools/auth/oauth-m2m#-step-1-create-an-oauth-secret)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Create an OAuth secret](https://docs.databricks.com/gcp/dev-tools/auth/oauth-m2m#-step-1-create-an-oauth-secret)
2. Copy the *Secret* value that you generated into a text editor and store it securely. You specify this value later when you create a
   catalog integration in Snowflake.

   > **Important:**
   >
   > The client secret is only displayed once. Make sure to copy it before closing the dialog.

### Step 4: Enable Snowflake access to your catalog in Unity Catalog

In this step, you use Databricks to enable Snowflake access to your catalog in Unity Catalog.

To enable Snowflake access to your catalog in Unity Catalog through vended credentials, first, at the metastore level, you must enable
external data access on the metastore. Next, you need to grant your service principal Unity Catalog privileges to your catalog.

#### Enable external data access on the metastore (vended credentials only)

If you’re creating a catalog integration that uses vended credentials, you must enable external data access on the metastore in Databricks.
If you’re creating a catalog integration that uses an external volume, you can skip this step.

For instructions on how to enable external data access on the metastore, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Enable external data access on the metastore](https://docs.databricks.com/aws/en/external-access/admin#enable-external-data-access-on-the-metastore)
* **Azure Databricks**: [Azure Databricks: Enable external data access on the metastore](https://learn.microsoft.com/en-us/azure/databricks/external-access/admin#enable-external-data-access-on-the-metastore)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Enable external data access on the metastore](https://docs.databricks.com/gcp/en/external-access/admin#enable-external-data-access-on-the-metastore)

#### Assign your service principal to a workspace

Next, you need to assign your service principal to your Databricks workspace.

For instructions, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Assign a service principal to a workspace](https://docs.databricks.com/aws/en/admin/users-groups/manage-service-principals?language=Account%C2%A0console#assign-a-service-principal-to-a-workspace)
* **Azure Databricks**: [Azure Databricks: Assign a service principal to a workspace](https://learn.microsoft.com/en-us/azure/databricks/admin/users-groups/manage-service-principals#assign-a-service-principal-to-a-workspace)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Assign a service principal to a workspace](https://docs.databricks.com/gcp/en/admin/users-groups/manage-service-principals#assign-a-service-principal-to-a-workspace)

#### Grant your service principal access to your catalog

Next, you must grant your service principal Unity Catalog privileges. You need to grant these privileges to your service principal to allow
Snowflake to access the catalog based on the privileges that you specify.

##### Privileges for full functionality

To enable full functionality in Snowflake, you must grant the following privileges:

> **Note:**
>
> If you want to restrict Snowflake access, see [Unity Catalog privileges and securable objects](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges/privileges)
> in the Databricks documentation.

| Privilege | Description |
| --- | --- |
| `EXTERNAL USE SCHEMA` | Allows Unity Catalog to generate and provide temporary, scoped credentials to Snowflake for accessing table data in cloud storage.  **Note:** This privilege is only required when you create a catalog integration that uses vended credentials; it’s not required when you create a catalog integration that uses an external volume, so if you’re using an external volume, remove it from the example code block. |
| `MODIFY` | Allows Snowflake to add, update, and delete data in a table. |
| `SELECT` | Allows Snowflake to query tables and access table metadata. Required for all operations in Snowflake, including reading data and discovering tables in the catalog-linked database. |
| `USE CATALOG` | Allows Snowflake to access the catalog. Required to connect to and interact with any objects in the Unity Catalog. |
| `USE SCHEMA` | Allows Snowflake access to schemas (namespaces) within the catalog. Required to view and work with tables in specific schemas. |

##### Grant privileges

You can grant privileges by using Catalog Explorer or SQL.

Catalog ExplorerSQL

To grant permissions by using the Databricks Catalog Explorer, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Grant permissions on an object](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)
* **Azure Databricks**: [Azure Databricks: Grant permissions on an object](https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Grant permissions on an object](https://docs.databricks.com/gcp/en/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)

> **Important:**
>
> In the Principals field, you must enter the name of your service principal, not the email address for a user or the
> name of a group.

To grant your service principal Unity Catalog privileges, you must specify the Application ID for your service principal.

For example, the following statement grants `example_sales_catalog` catalog privileges to a service principal with an
Application ID of `1aaa1a1a-11a1-1111-1111-1a11111aaa1a`.

```sqlexample
GRANT EXTERNAL USE SCHEMA ON CATALOG example_sales_catalog TO `1aaa1a1a-11a1-1111-1111-1a11111aaa1a`;
GRANT MODIFY ON CATALOG example_sales_catalog TO `1aaa1a1a-11a1-1111-1111-1a11111aaa1a`;
GRANT SELECT ON CATALOG example_sales_catalog TO `1aaa1a1a-11a1-1111-1111-1a11111aaa1a`;
GRANT USE CATALOG ON CATALOG example_sales_catalog TO `1aaa1a1a-11a1-1111-1111-1a11111aaa1a`;
GRANT USE SCHEMA ON CATALOG example_sales_catalog TO `1aaa1a1a-11a1-1111-1111-1a11111aaa1a`;
```

For more information on how to grant your service principal Unity Catalog privileges, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Grant a principal Unity Catalog privileges](https://docs.databricks.com/aws/en/external-access/admin#grant-a-principal-unity-catalog-privileges) and [Databricks on AWS: Grant permissions on an object by using SQL](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges?language=SQL#-grant-permissions-on-an-object)
* **Azure Databricks**: [Azure Databricks: Grant a principal Unity Catalog privileges](https://learn.microsoft.com/en-us/azure/databricks/external-access/admin#external-schema) and [Azure Databricks: Grant permissions on an object by using SQL](https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/manage-privileges/#sql-2)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Grant a principal Unity Catalog privileges](https://docs.databricks.com/gcp/en/external-access/admin#grant-a-principal-unity-catalog-privileges) and [Databricks on Google Cloud: Grant permissions on an object by using SQL](https://docs.databricks.com/gcp/en/data-governance/unity-catalog/manage-privileges?language=SQL#-grant-permissions-on-an-object)

### Step 5: Create a catalog integration

The following example creates a REST catalog integration that uses OAuth:

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION unity_catalog_int_oauth_vended_credentials
CATALOG_SOURCE = ICEBERG_REST
TABLE_FORMAT = ICEBERG
REST_CONFIG = (
  CATALOG_URI = '<databricks_workspace_url>/api/2.1/unity-catalog/iceberg-rest'
  CATALOG_NAME = '<catalog_name>'
  ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
)
REST_AUTHENTICATION = (
  TYPE = OAUTH
  OAUTH_TOKEN_URI = '<databricks_workspace_url>/oidc/v1/token'
  OAUTH_CLIENT_ID = '<client_id>'
  OAUTH_CLIENT_SECRET = '<oauth_secret>'
  OAUTH_ALLOWED_SCOPES = ('all-apis')
)
ENABLED = TRUE;
```

Where:

* `<databricks_workspace_url>` specifies the URL for your Databricks workspace. To find this URL, see
  Step 1: Find your Databricks workspace URL.

  Here is an example of a Databricks workspace URL for each cloud platform:

  + **Databricks on AWS**: `https://dbc-a1a1a1a1-a1a1.cloud.databricks.com`
  + **Azure Databricks**: `https://adb-1111111111111111.1.azuredatabricks.net`
  + **Databricks on Google Cloud**: `https://1111111111111111.1.gcp.databricks.com`
* `<catalog_name>` specifies the name of your catalog in Unity Catalog that you want to connect Snowflake to. You can find this name in the Databricks workspace
  under Data > Catalogs.
* `ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS` configures the catalog integration to use vended credentials from Unity Catalog.

  > **Note:**
  >
  > If you’re creating a catalog integration that uses an external volume, you must exclude the `ACCESS_DELEGATION_MODE` parameter.
* `<client_id>` specifies the OAuth client ID for your Databricks service principal. You copied this value when you
  added a service principal in Databricks.

  > **Note:**
  >
  > In Databricks, this value is called the *Application ID*, not the Client ID.
* `<oauth_secret>` specifies the OAuth secret for your Databricks service principal. You copied this value when you
  created an OAuth secret for your service principal.

### Step 6: Verify your catalog integration

* To verify the configuration for your catalog integration, call the SYSTEM$VERIFY_CATALOG_INTEGRATION function.

  For more information, including an example, see [Use SYSTEM$VERIFY_CATALOG_INTEGRATION to check your catalog integration configuration](tables-iceberg-configure-catalog-integration-rest-check-config.md).

### Next steps

After you configure a catalog integration for your catalog in Unity Catalog, use the [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md)
command to create a catalog-linked database by specifying the catalog integration that you created. Snowflake then automatically syncs
with your catalog in Unity Catalog to detect schemas and
Iceberg tables, and registers the remote tables to the catalog-linked database.

When you create your catalog-linked database, you specify the catalog integration.

For example:

```sqlexample
CREATE OR REPLACE DATABASE my_linked_db
   LINKED_CATALOG = (
      CATALOG = 'unity_catalog_int_oauth_vended_credentials'
   );
```

By default, catalog-linked databases support both read and write operations. You can use Snowflake to insert data into, update, and
create Iceberg tables in your catalog in Unity Catalog. For more information, see
[Write support for externally managed Apache Iceberg™ tables](tables-iceberg-externally-managed-writes.md).

> **Note:**
>
> * If you’re using an external volume, you must include the `EXTERNAL_VOLUME` parameter with your CREATE DATABASE statement. For more
>   information, see [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md).
> * For more information on working with catalog-linked databases, see [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md).

## Configure a bearer token catalog integration

### Step 1: Find your Databricks workspace URL

Your Databricks workspace URL is the URL that you use to access your Databricks workspace. You need to find this URL because you specify
it later when you create a catalog integration.

1. Find your Databricks workspace URL.

   For instructions on how to find this URL, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Workspace instance names, URLs, and IDs](https://docs.databricks.com/aws/workspace/workspace-details#workspace-instance-names-urls-and-ids)
   * **Azure Databricks**: [Azure Databricks: Determine per-workspace URL](https://learn.microsoft.com/azure/databricks/workspace/workspace-details#determine-per-workspace-url)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Workspace instance names, URLs, and IDs](https://docs.databricks.com/gcp/workspace/workspace-details#workspace-instance-names-urls-and-ids)
2. Copy your Databricks workspace URL into a text editor.

### Step 2: Add a personal access token (PAT) in Databricks

You need to add a personal access token (PAT) because you must specify it when you create a catalog integration that uses a bearer token
for authentication.

1. To add a PAT in Databricks, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Authenticate with Databricks personal access tokens (legacy)](https://docs.databricks.com/aws/en/dev-tools/auth/pat)
   * **Azure Databricks**: [Azure Databricks: Authenticate with Azure Databricks personal access tokens (legacy)](https://learn.microsoft.com/en-us/azure/databricks/dev-tools/auth/pat)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Authenticate with Databricks personal access tokens (legacy)](https://docs.databricks.com/gcp/en/dev-tools/auth/pat)
2. Copy the value for your PAT into a text editor and store it securely. You specify this value later when you create a catalog integration in Snowflake.

### Step 3: Enable Snowflake access to your catalog in Unity Catalog

In this step, you use Databricks to enable Snowflake access to your catalog in Unity Catalog.

To enable Snowflake access to your catalog in Unity Catalog through vended credentials, first, at the metastore level, you must enable
external data access on the metastore. Next, you need to grant your Databricks user Unity Catalog privileges to your catalog, which your PAT
inherits.

#### Enable external data access on the metastore (vended credentials only)

If you’re creating a catalog integration that uses vended credentials, you must enable external data access on the metastore in Databricks.
If you’re creating a catalog integration that uses an external volume, you can skip this step.

For instructions on how to enable external data access on the metastore, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Enable external data access on the metastore](https://docs.databricks.com/aws/en/external-access/admin#enable-external-data-access-on-the-metastore)
* **Azure Databricks**: [Azure Databricks: Enable external data access on the metastore](https://learn.microsoft.com/en-us/azure/databricks/external-access/admin#enable-external-data-access-on-the-metastore)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Enable external data access on the metastore](https://docs.databricks.com/gcp/en/external-access/admin#enable-external-data-access-on-the-metastore)

#### Grant your Databricks user access to your catalog

Next, you must grant your Databricks user Unity Catalog privileges. You need to grant these privileges to your Databricks user to allow
Snowflake access to the catalog based on the privileges that you specify. When you use a PAT for authentication, it inherits all the
privileges granted on the Databricks user who created the PAT.

##### Privileges for full functionality

To enable full functionality in Snowflake, you must grant the following privileges:

> **Note:**
>
> If you want to restrict Snowflake access, see [Unity Catalog privileges and securable objects](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges/privileges)
> in the Databricks documentation.

| Privilege | Description |
| --- | --- |
| `EXTERNAL USE SCHEMA` | Allows Unity Catalog to generate and provide temporary, scoped credentials to Snowflake for accessing table data in cloud storage.  **Note:** This privilege is only required when you create a catalog integration that uses vended credentials; it’s not required when you create a catalog integration that uses an external volume, so if you’re using an external volume, remove it from the example code block. |
| `MODIFY` | Allows Snowflake to add, update, and delete data in a table. |
| `SELECT` | Allows Snowflake to query tables and access table metadata. Required for all operations in Snowflake, including reading data and discovering tables in the catalog-linked database. |
| `USE CATALOG` | Allows Snowflake to access the catalog. Required to connect to and interact with any objects in the Unity Catalog. |
| `USE SCHEMA` | Allows Snowflake access to schemas (namespaces) within the catalog. Required to view and work with tables in specific schemas. |

##### Grant privileges

You can grant privileges by using Catalog Explorer or SQL.

Catalog ExplorerSQL

To grant permissions by using the Databricks Catalog Explorer, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Grant permissions on an object](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)
* **Azure Databricks**: [Azure Databricks: Grant permissions on an object](https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Grant permissions on an object](https://docs.databricks.com/gcp/en/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)

To grant your Databricks user Unity Catalog privileges, you must specify the user ID for that Databricks user.

For example, the following statement grants `example_sales_catalog` catalog privileges to the `j.smith@example.com` Databricks user.

```sqlexample
GRANT EXTERNAL USE SCHEMA ON CATALOG example_sales_catalog TO `j.smith@example.com`;
GRANT MODIFY ON CATALOG example_sales_catalog TO `j.smith@example.com`;
GRANT SELECT ON CATALOG example_sales_catalog TO `j.smith@example.com`;
GRANT USE CATALOG ON CATALOG example_sales_catalog TO `j.smith@example.com`;
GRANT USE SCHEMA ON CATALOG example_sales_catalog TO `j.smith@example.com`;
```

For more information on how to grant your Databricks user Unity Catalog privileges, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Grant a principal Unity Catalog privileges](https://docs.databricks.com/aws/en/external-access/admin#grant-a-principal-unity-catalog-privileges) and [Databricks on AWS: Grant permissions on an object by using SQL](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges?language=SQL#-grant-permissions-on-an-object)
* **Azure Databricks**: [Azure Databricks: Grant a principal Unity Catalog privileges](https://learn.microsoft.com/en-us/azure/databricks/external-access/admin#external-schema) and [Azure Databricks: Grant permissions on an object by using SQL](https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/manage-privileges/#sql-2)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Grant a principal Unity Catalog privileges](https://docs.databricks.com/gcp/en/external-access/admin#grant-a-principal-unity-catalog-privileges) and [Databricks on Google Cloud: Grant permissions on an object by using SQL](https://docs.databricks.com/gcp/en/data-governance/unity-catalog/manage-privileges?language=SQL#-grant-permissions-on-an-object)

### Step 4: Create a catalog integration

The following example creates a REST catalog integration that uses a bearer token with vended credentials:

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION unity_catalog_int_bearer_vended_credentials
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  REST_CONFIG = (
    CATALOG_URI = '<databricks_workspace_url>/api/2.1/unity-catalog/iceberg-rest'
    CATALOG_NAME = '<catalog_name>'
    ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
  )
  REST_AUTHENTICATION = (
    TYPE = BEARER
    BEARER_TOKEN = '<personal_access_token>'
  )
  ENABLED = TRUE;
```

Where:

* `<databricks_workspace_url>` specifies the URL for your Databricks workspace. To find this URL, see
  Step 1: Find your Databricks workspace URL.

  Here is an example of a Databricks workspace URL for each cloud platform:

  + **Databricks on AWS**: `https://dbc-a1a1a1a1-a1a1.cloud.databricks.com`
  + **Azure Databricks**: `https://adb-1111111111111111.1.azuredatabricks.net`
  + **Databricks on Google Cloud**: `https://1111111111111111.1.gcp.databricks.com`
* `<catalog_name>` specifies the name of your catalog in Unity Catalog that you want to connect Snowflake to. You can find this name in the Databricks workspace
  under Data > Catalogs.
* `ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS` configures the catalog integration to use vended credentials from Unity Catalog.

  > **Note:**
  >
  > If you’re creating a catalog integration that uses an external volume, you must exclude the `ACCESS_DELEGATION_MODE` parameter.
* `<personal_access_token>` specifies your Databricks personal access token (PAT). An example of a PAT is `aaaa111aaaa111a1a1a1a111111a111a1111`.

### Step 5: Verify your catalog integration

* To verify the configuration for your catalog integration, call the SYSTEM$VERIFY_CATALOG_INTEGRATION function.

  For more information, see [Use SYSTEM$VERIFY_CATALOG_INTEGRATION to check your catalog integration configuration](tables-iceberg-configure-catalog-integration-rest-check-config.md).

### Next steps

After you configure a catalog integration for your catalog in Unity Catalog, use the [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md)
command to create a catalog-linked database by specifying the catalog integration that you created. Snowflake then automatically syncs
with your catalog in Unity Catalog to detect schemas and
Iceberg tables, and registers the remote tables to the catalog-linked database.

When you create your catalog-linked database, you specify the catalog integration.

For example:

```sqlexample
CREATE OR REPLACE DATABASE my_linked_db
   LINKED_CATALOG = (
      CATALOG = 'unity_catalog_int_bearer_vended_credentials'
   );
```

By default, catalog-linked databases support both read and write operations. You can use Snowflake to insert data into, update, and
create Iceberg tables in your catalog in Unity Catalog. For more information, see
[Write support for externally managed Apache Iceberg™ tables](tables-iceberg-externally-managed-writes.md).

> **Note:**
>
> * If you’re using an external volume, you must include the `EXTERNAL_VOLUME` parameter with your CREATE DATABASE statement. For more
>   information, see [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md).
> * For more information on working with catalog-linked databases, see [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md).

---
title: Configure a reader account
source: https://docs.snowflake.com/en/user-guide/data-sharing-reader-config.md
section: User Guide
---

# Configure a reader account

A newly-created reader account contains only a single user, who serves as the administrator for the entire account.

To “bootstrap” (i.e. configure) the account, the account administrator must create a minimum set of additional objects in the account, including users, custom roles (if desired), virtual warehouses, and
one or more shared databases (for the data shared by the provider account).

This topic provides an overview of all these configuration tasks, both required and optional.

> **Note:**
>
> Tasks 2 to 4 must be completed as the account administrator. All remaining tasks can be delegated to other users.
>
> Also, all of these tasks must be performed in the reader account, as opposed to the provider account.

## Task 1: Log into the reader account as the account administrator

Log into the reader account using any of the supported interfaces (such as Snowflake CLI, SnowSQL, or the web interface).

The instructions in this topic assume you are using SQL to perform these tasks, either in Snowflake CLI, SnowSQL, or using a worksheet (in the web interface ). However, the tasks can be performed in any supported
Snowflake interface.

> **Tip:**
>
> Remember to set ACCOUNTADMIN as the role to use. You can set this role either during login or afterwards in the active session.
>
> If you are using a worksheet (in the web interface) to perform these tasks, set the role in the context for the worksheet.

## Task 2: Create custom roles (optional)

Roles enable fine-grained control over the tasks that users in the reader account can perform. You can use roles to:

* Specify the users who can query the data shared with the account.
* Grant control over virtual warehouses to selected users.
* Delegate some administrator tasks and responsibilities to selected users (if desired).

Each reader account comes with the standard, system-defined roles (SYSADMIN, SECURITYADMIN, PUBLIC). If these roles do not meet the access requirements for the users you will create in the account, you
can create additional custom roles.

For more details, see [Overview of Access Control](security-access-control-overview.md).

## Task 3: Create users

Create the users who will log into the reader account and query data shared with the account, as well as perform any other tasks you choose to allow.

As part of the user creation process, remember to grant roles, system-defined or custom (if you created any), to the users. The roles you assign to the users determine what they can do in the account.

For more details, see [CREATE USER](../sql-reference/sql/create-user.md) and [GRANT ROLE](../sql-reference/sql/grant-role.md).

> **Tip:**
>
> All remaining tasks in this topic can be completed by the account administrator or can be delegated (through privileges and roles) to other users in the account.
>
> At a minimum, we recommend:
>
> * Grant the SECURITYADMIN role to at least one other user so that they can help create and manage other users and object access in the account.
> * Grant the SYSADMIN role to at least one other user so that they can help create and manage other objects in the account (e.g. virtual warehouses).

## Task 4: Create resource monitors (optional)

Virtual warehouses are required for querying data shared with the reader account. When running, virtual warehouses consume credits, which will be charged to your provider account.

If you wish to control the amount of credits consumed monthly by the virtual warehouses in the reader account, create one or more resource monitors and specify whether they control:

* All warehouses in the account.
* Individual warehouses.

For more details, see [CREATE RESOURCE MONITOR](../sql-reference/sql/create-resource-monitor.md).

> **Attention:**
>
> If you choose to skip this task, the warehouses in the reader account can consume an unlimited number of credits each month, which will be charged to your provider account.

## Task 5: Create virtual warehouses

To enable querying the objects in the shared database, you must create at least one virtual warehouse. You can create as many warehouses as you like or need; however, remember that your provider account
is responsible for all credits consumed by the warehouses in the reader account and consider the following:

* Set the warehouse size appropriately, weighing desired query performance against desired credit consumption.
* Ensure the warehouse is set to auto-suspend when not in use.

For more details, see [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md).

## Task 6: Create a database from each share shared with the account

A reader account does not contain any data by default. To consume data shared from your provider account, you must use the [CREATE DATABASE](../sql-reference/sql/create-database.md) command to create a database from
each share shared with the account. When you create the database(s), you specify the name that other users in the reader account will reference when querying the shared data.

For example, if your provider account is named `ab12345` and you shared two shares named `share1` and `share2` with this reader account:

> ```sqlexample
> CREATE DATABASE shared_db1 FROM SHARE ab12345.share1;
>
> CREATE DATABASE shared_db2 FROM SHARE ab12345.share2;
> ```

## Task 7: Grant privileges on virtual warehouses and databases to roles

Data providers can choose to add objects to a share by either granting privileges on the objects to a share via a database role,
and then granting the database role to a share (Option 1) or granting privileges on the objects directly to the share (Option 2).
The instructions in this section differ depending on the option a database provider chose:

Option 1:
:   To enable querying data shared with the reader account, grant a database role in the share that aligns with a business function in your
    account with the appropriate role in your account. For example, suppose the share includes a database role named `shared_db1.dr1`
    that you want to share with every user in your account. In this case, you would grant the database role to the PUBLIC system role:

    ```sqlexample
    GRANT DATABASE ROLE shared_db1.dr1 TO ROLE PUBLIC;
    ```

Option 2:
:   To enable querying data shared with the reader account, grant the following privileges to the other roles, system-defined or custom
    (if any), in the account:

    * IMPORTED PRIVILEGES on each database created from the share(s) in Task 6: Create a database from each share shared with the account (in this topic).

    For example, the following commands grant the necessary privileges for two databases named `shared_db1` and `shared_db2`,
    and a warehouse named `testing_vw`, to the PUBLIC role. Because all users in the account automatically have the PUBLIC role, this
    enables any user in the account to use the warehouse and query the databases:

    ```sqlexample
    GRANT IMPORTED PRIVILEGES ON DATABASE shared_db1 TO ROLE PUBLIC;

    GRANT IMPORTED PRIVILEGES ON DATABASE shared_db2 TO ROLE PUBLIC;
    ```

In addition, grant the USAGE privilege on a virtual warehouse you created for executing queries:

```sqlexample
GRANT USAGE ON WAREHOUSE testing_vw TO ROLE PUBLIC;
```

You can grant additional privileges if desired; however, the privileges listed above are the minimum privileges required to query
the shared databases in the account.

In addition, you could grant full privileges on the `testing_vw` warehouse to the SYSADMIN role, enabling users with the role to
start, stop, and resize the warehouse:

```sqlexample
GRANT ALL ON WAREHOUSE testing_vs TO ROLE SYSADMIN;
```

For more details, see [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md).

## Task 8: Invite users to log in and reset their passwords

As the last configuration task, notify all the users you created that the account is available to use.

The fastest/easiest way to do this is to use the [ALTER USER](../sql-reference/sql/alter-user.md) command to reset the password for each user. This generates a unique URL for each user, which you then send/give
to them. They use the URL to change their password and log into the account.

For example:

> ```sqlexample
> ALTER USER ra_user1 RESET PASSWORD;
>
> ALTER USER ra_user2 RESET PASSWORD;
> ```

> **Important:**
>
> Each URL can be used only once and expires after 4 hours. However, you can reset the password for a user as often as needed.
>
> For more details, see [ALTER USER](../sql-reference/sql/alter-user.md).

---
title: Configure a task to send error notifications
source: https://docs.snowflake.com/en/user-guide/tasks-errors-integrate.md
section: User Guide
---

# Configure a task to send error notifications

To enable a task to send error notifications, you must associate the task with a notification integration.
You can do this when running the [CREATE TASK](../sql-reference/sql/create-task.md) command to create a new task or
the [ALTER TASK](../sql-reference/sql/alter-task.md) command to modify an existing task.
When running these commands, set ERROR_INTEGRATION to the name of the notification integration.

You only specify the error notification integrations on a root task of a task graph. Any failed child task sends error notifications to
the root task’s specified integration.

Tasks with `TASK_AUTO_RETRY_ATTEMPTS` set to a value greater than `0` send error notifications for each failed task run.

> **Note:**
>
> Creating or modifying a task that references a notification integration requires a role that has the USAGE privilege on the notification
> integration. In addition, the role must have either the CREATE TASK privilege on the schema or the OWNERSHIP privilege on the task.

## Create a new task that sends error notifications

Create a new task using [CREATE TASK](../sql-reference/sql/create-task.md). For descriptions of all available task parameters, see the SQL command
topic:

```sqlsyntax
CREATE TASK <name>
  [...]
  ERROR_INTEGRATION = <integration_name>
  AS <sql>
```

Where:

`ERROR_INTEGRATION = integration_name`
:   Specifies the name of a notification integration created using [CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration.md). For more information, see
    [AWS SNS](notifications/creating-notification-integration-amazon-sns.md), [Google Pub/Sub](notifications/creating-notification-integration-google-pubsub.md), or [Azure Event Grid](notifications/creating-notification-integration-azure-event-grid.md).

The following example creates a serverless task that supports error notifications. The task inserts the current timestamp into a table
column every 5 minutes:

```sqlexample
CREATE TASK mytask
  SCHEDULE = '5 MINUTE'
  ERROR_INTEGRATION = my_notification_int
  AS
  INSERT INTO mytable(ts) VALUES(CURRENT_TIMESTAMP);
```

## Update an existing task to send error notifications

Modify an existing task using [ALTER TASK](../sql-reference/sql/alter-task.md):

```sqlsyntax
ALTER TASK <name> SET ERROR_INTEGRATION = <integration_name>;
```

Where `integration_name` is the name of the notification integration created in one of
[AWS SNS](notifications/creating-notification-integration-amazon-sns.md), [Google Pub/Sub](notifications/creating-notification-integration-google-pubsub.md), or [Azure Event Grid](notifications/creating-notification-integration-azure-event-grid.md) platform level notifications.

For example:

```sqlexample
ALTER TASK mytask SET ERROR_INTEGRATION = my_notification_int;
```

## Task error notification message payload

The body of error messages identifies the task and the errors encountered during a task run.

The following is a sample message payload describing a task error. The payload can include one or more error messages.

```bash
{\"version\":\"1.0\",\"messageId\":\"3ff1eff0-7ad7-493c-9552-c0307087e0c6\",\"messageType\":\"USER_TASK_FAILED\",\"timestamp\":\"2021-11-11T19:46:39.648Z\",\"accountName\":\"AWS_UTEN_DPO_ACC\",\"taskName\":\"AWS_UTEN_DPO_DB.AWS_UTEN_SC.UTEN_AWS_TK1\",\"taskId\":\"01a03962-2b57-889e-0000-000000000001\",\"rootTaskName\":\"AWS_UTEN_DPO_DB.AWS_UTEN_SC.UTEN_AWS_TK1\",\"rootTaskId\":\"01a03962-2b57-889e-0000-000000000001\",\"messages\":[{\"runId\":\"2021-11-11T19:46:23.826Z\",\"scheduledTime\":\"2021-11-11T19:46:23.826Z\",\"queryStartTime\":\"2021-11-11T19:46:24.879Z\",\"completedTime\":\"null\",\"queryId\":\"01a03962-0300-0002-0000-0000000034d8\",\"errorCode\":\"000630\",\"errorMessage\":\"Statement reached its statement or warehouse timeout of 10 second(s) and was canceled.\"}]}
```

Note that you must parse the string into a JSON object to process values in the payload.

---
title: Configure a task to send success notifications
source: https://docs.snowflake.com/en/user-guide/tasks-success-integrate.md
section: User Guide
---

# Configure a task to send success notifications

Snowflake can push success notifications to a cloud messaging service when a task graph completes successfully. This topic provides instructions for configuring success notification support for tasks using cloud messaging.

Success notification integration is only specified on a root task of a task graph. Snowflake only sends success notifications when the entire task graph is successfully executed and will not send notifications for any successfully executed standalone task, which is different from [error notification integration](tasks-errors-integrate.md).

> **Note:**
>
> The task success notification feature is supported for both serverless tasks and user-managed tasks (that is, tasks that rely on a virtual warehouse to provide the compute resources).

To enable a task to send success notifications, you must associate the task with a message notification integration. Follow the task documentation to create a notification integration with [Amazon Web Services Simple Notification Service (AWS SNS)](notifications/creating-notification-integration-amazon-sns.md), [Microsoft Azure Event Grid](notifications/creating-notification-integration-azure-event-grid.md), or [Google Pub/Sub](notifications/creating-notification-integration-google-pubsub.md).

## Create a new task or modifying an existing task to send success notifications

You can associate the task with a notification integration when running the [CREATE TASK](../sql-reference/sql/create-task.md) command to create a new task, or running the [ALTER TASK](../sql-reference/sql/alter-task.md) command to modify an existing task.

> **Note:**
>
> Creating or modifying a task that references a notification integration requires a role that has the USAGE privilege on the notification integration. In addition, the role must have either the CREATE TASK privilege on the schema or the OWNERSHIP privilege on the task, respectively.

```sqlsyntax
CREATE [ OR REPLACE ] TASK [ IF NOT EXISTS ] <name>
    WAREHOUSE = <string>
    [...]
    SUCCESS_INTEGRATION = <integration_name>
```

```sqlsyntax
ALTER TASK <name> SET SUCCESS_INTEGRATION = <integration_name>;
```

Where:

`SUCCESS_INTEGRATION = integration_name`

Name of the notification integration created in one of [AWS SNS](notifications/creating-notification-integration-amazon-sns.md), [Microsoft Azure Event Grid](notifications/creating-notification-integration-azure-event-grid.md), or [Google Pub/Sub](notifications/creating-notification-integration-google-pubsub.md) platform level notifications.

## Display success notifications

You can run [SHOW TASKS](../sql-reference/sql/show-tasks.md) or [DESCRIBE TASK](../sql-reference/sql/desc-task.md) to see task success notifications. Snowflake adds a new column, success_integration, to the output of SHOW TASKS and DESCRIBE TASK. This field displays null for all child tasks. This field displays the name of the graph-level success integration if the notification integration is specified on a root task, and null otherwise.

## Payload

The body of success messages includes information that identifies the task graph, such as rootTaskName, rootTaskID, queryID, and attemptNumber. The following is a sample message payload for a task graph success notification.

```bash
{"version":"1.0",
 "messageId":"3ff1eff0-7ad7-493c-9552-c0307087e0c6",
 "messageType":"GRAPH_SUCCEEDED",
 "timestamp":"2021-11-11T19:46:39.648Z",
 "accountName":"XY12345",
 "rootTaskName":"AWS_UTEN_DPO_DB.AWS_UTEN_SC.UTEN_AWS_TK1",
 "rootTaskId":"01a03962-2b57-889e-0000-000000000001",
 "messages": [{
              "runId":"2021-11-11T19:46:23.826Z",
              "scheduledTime":"2021-11-11T19:46:23.826Z",
              "queryStartTime":"2021-11-11T19:46:24.879Z",
              "graphCompletedTime":"2021-11-11T19:54:24.5591",
              "queryId":"01a03962-0300-0002-0000-0000000034d8",
              "attemptNumber":5
}]}
```

Note that you must parse the string into a JSON object to process values in the payload.

---
title: Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-rest-private.md
section: User Guide
---

# Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity

This topic explains how to configure a [catalog integration](tables-iceberg.md)
for [Apache Iceberg™ tables](tables-iceberg.md) managed in a remote catalog that complies with the
open source [Apache Iceberg™ REST OpenAPI specification](https://github.com/apache/iceberg/blob/main/open-api/rest-catalog-open-api.yaml).

With this configuration, you can use the catalog integration to connect to a remote Iceberg REST catalog through a private IP address
instead of over the public internet.

The following diagram shows how an Iceberg table uses a catalog integration with an external Iceberg catalog.

For general information about outbound private connectivity in Snowflake, including
[outbound private connectivity costs](private-connectivity-outbound.md), see
[Private connectivity for outbound network traffic](private-connectivity-outbound.md).

This topic covers the configuration steps for the following catalog types:

* Generic Iceberg REST catalogs
* AWS Glue Data Catalog
* Databricks Unity Catalog

> **Note:**
>
> * Private connectivity is only supported for catalog integrations on AWS that use AWS PrivateLink and Azure that use Azure Private Link.
> * Private connectivity is only available within the same cloud provider; the catalog and the Snowflake deployment must be running in the same cloud provider.
> * Catalog-vended credentials aren’t supported when you configure a catalog integration with outbound private connectivity.

## Step 1: Gather private connectivity information for your catalog

You must gather private connectivity information to specify it later when you provision a corresponding private connectivity endpoint in the Snowflake VPC or VNet.
When you provision a corresponding private connectivity endpoint, you create an AWS PrivateLink endpoint in Snowflake when your Snowflake
account is hosted in AWS or you create an Azure private endpoint when your Snowflake account is hosted on Azure.

Generic Iceberg REST catalogAWS Glue Data CatalogDatabricks Unity Catalog

* To gather private connectivity information for your catalog, see the documentation for the remote REST Iceberg catalog.

  The following example is an AWS VPC Endpoint Service ID in AWS: `com.amazonaws.vpce.us-west-2.vpce-svc-0123456789abcdef`.

You must find the provider service name and host name for your AWS Glue Data Catalog:

1. To obtain your *provider service name* (`<provider_service_name>`), copy `com.amazonaws.<region>.glue` into your text editor
   where `<region>` is the AWS region where your Iceberg tables are stored.

   An example of a provider service name is `com.amazonaws.us-west-2.glue`. For more information, see [Creating an interface VPC endpoint for AWS Glue](https://docs.aws.amazon.com/glue/latest/dg/vpc-interface-endpoints.html#vpc-endpoint-create)
   in the AWS documentation.
2. To obtain your *host name* (`<host_name>`), copy `glue.<region>.amazonaws.com` into your text editor where `<region>` is the AWS
   region where your Iceberg tables are stored.

   An example of a host name is `glue.us-west-2.amazonaws.com`. For more information, see [Connecting to the Data Catalog using AWS Glue Iceberg REST endpoint](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html)
   in the AWS documentation.

> **Note:**
>
> Alternatively, to retrieve these values, you can use the describe-vpc-endpoint-services subcommand from the AWS command line. For
> more information, see [Provision private connectivity endpoints](private-manage-endpoints-aws.md).

AWSAzure

You must find the PrivateLink VPC endpoint service ID for your Databricks Unity
Catalog and your Databricks workspace host name:

1. To find your *PrivateLink VPC endpoint service ID* (`<vpc_endpoint_service_id>`), see [PrivateLink VPC endpoint services](https://docs.databricks.com/aws/en/resources/ip-domain-region#privatelink) in the Databricks documentation.

   This topic contains the list of
   the VPC endpoint service IDs for each AWS region.
2. Copy the endpoint service ID for the region where your tables are hosted,
   which is the value for *Workspace (including REST API)*, into a text editor.

   An example of a VPC endpoint service ID is `com.amazonaws.vpce.us-west-2.vpce-svc-0129f463fcfbc46c5`.

   For more information about
   PrivateLink at Databricks, see [Configure Front-end PrivateLink](https://docs.databricks.com/aws/security/network/front-end/front-end-private-connect) in the Databricks documentation.
3. To find your *Databricks workspace host name* (`<databricks_workspace_host_name>`), follow these steps:

   1. Retrieve your Databricks workspace URL.

      For instructions, see
      [Get identifiers for workspace objects](https://docs.databricks.com/aws/en/workspace/workspace-details) in the
      Databricks documentation.

      This topic includes an example Databricks workspace URL.
   2. Copy your Databricks workspace URL into a text editor.
   3. Remove `https://` from your Databricks workspace URL.

      The resulting value is your Databricks workspace host name.

      For example, if your Databricks per-workspace URL is `https://dbc-a1a11111-1a11.cloud.databricks.com`, your
      Databricks workspace host name is `dbc-a1a11111-1a11.cloud.databricks.com`.

You must find the resource ID for your Databricks workspace in the Azure portal and your Databricks workspace host name:

1. To find the *resource ID for your Databricks workspace in the Azure portal* (`<databricks_workspace_resource_id>`), follow these steps:

   1. In the Azure portal, navigate to your Databricks workspace.
   2. On the **Overview** page, in the **Essentials** section, select the **JSON View** link.

      The resource ID for your Databricks workspace is displayed in the **Resource ID** field. An example of this resource ID is
      `/subscriptions/1111-22-333-4444-55555/resourceGroups/my-rg/providers/Microsoft.Databricks/workspaces/my-databricks-workspace`.
   3. Copy the resource ID into a text editor.
2. To find your *Databricks workspace host name* (`<databricks_workspace_host_name>`), follow these steps:

   1. Retrieve your Databricks per-workspace URL.

      For instructions, see
      [Determine per-workspace URL](https://learn.microsoft.com/en-us/azure/databricks/workspace/workspace-details#determine-per-workspace-url)
      in the Azure Databricks documentation.
   2. Copy your Databricks per-workspace URL into a text editor.
   3. Remove `https://` from your Databricks per-workspace URL.

      The resulting value is your Databricks workspace host name.

      For example, if your Databricks per-workspace URL is `https://adb-1234567890123456.12.azuredatabricks.net`, your
      Databricks workspace host name is `adb-1234567890123456.12.azuredatabricks.net`.

## Step 2: Provision a private connectivity endpoint

In this step, you provision a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to the remote
Iceberg REST catalog by using private connectivity.

Generic Iceberg REST catalogAWS Glue Data CatalogDatabricks Unity Catalog

* To provision a private connectivity endpoint, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function.

  For instructions on specifying the arguments for this system function, see the documentation for the remote REST Iceberg catalog
  that you want to connect to through private connectivity.

  The following code block shows an example of provisioning an AWS PrivateLink endpoint:

  ```sqlexample
  SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
    'com.amazonaws.vpce.us-west-2.vpce-svc-0123456789abcdef',
    'my.catalog.com'
    );
  ```

* To provision a private connectivity endpoint, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
    '<provider_service_name>',
    '<host_name>'
  );
  ```

  Where:

  + `<provider_service_name>` is the provider service name that you copied when you gathered private connectivity information for your catalog.
  + `<host_name>` is the host name that you copied when you gathered private connectivity information for your catalog.

  For example:

  ```sqlexample
  SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
    'com.amazonaws.<region>.glue',
    'glue.<region>.amazonaws.com'
    );
  ```

  > **Note:**
  >
  > You only need to provision one private connectivity endpoint in the Snowflake VPC. This is because, with AWS Glue, you can use one Glue private connectivity endpoint to
  > access everything managed by the AWS Glue Data Catalog in the same region. For more information, see [Creating an interface VPC endpoint for AWS Glue](https://docs.aws.amazon.com/glue/latest/dg/vpc-interface-endpoints.html#vpc-endpoint-create)
  > in the AWS documentation.

You only need to provision one private connectivity endpoint. Unity requires just one private connectivity endpoint to access everything managed by the Unity Data Catalog in the same region.

AWSAzure

* To provision a private connectivity endpoint, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function:

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  >
  > SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  >   '<vpc_endpoint_service_id>',
  >   '<databricks_workspace_host_name>'
  > );
  > ```
  >
  > Where:
  >
  > + `<vpc_endpoint_service_id>` is the PrivateLink VPC endpoint service ID that you copied when you
  >   gathered private connectivity information for your catalog
  > + `<databricks_workspace_host_name>` is the Databricks workspace host name that you retrieved when you
  >   gathered private connectivity information for your catalog
  >
  >   > **Note:**
  >   >
  >   > If you have multiple Databricks workspaces in the same AWS region, you can use a wildcard with your Databricks workspace URL.
  >
  > For example:
  >
  > ```sqlexample
  > SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  >   'com.amazonaws.vpce.us-west-2.vpce-svc-0129f463fcfbc46c5',
  >   'dbc-a1a11111-1a11.cloud.databricks.com'
  > );
  > ```

* To provision a private connectivity endpoint, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function:

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  >
  > SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  >   '<databricks_workspace_resource_id>',
  >   '<databricks_workspace_host_name>',
  >   'databricks_ui_api'
  > );
  > ```
  >
  > Where:
  >
  > + `<<databricks_workspace_resource_id>>` is the resource ID for your Databricks workspace in the Azure portal that
  >   you copied when you
  >   gathered private connectivity information for your catalog.
  > + `<databricks_workspace_host_name>` is the Databricks workspace host name that you retrieved when you
  >   gathered private connectivity information for your catalog.
  > + `databricks_ui_ap` is the sub-resource value for an Azure Databricks workspace.
  >
  > For example:
  >
  > ```sqlexample
  > SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  >   '/subscriptions/1111-22-333-4444-55555/resourceGroups/my-rg/providers/Microsoft.Databricks/workspaces/my-databricks-workspace',
  >   'adb-1234567890123456.12.azuredatabricks.net',
  >   'databricks_ui_api'
  > );
  > ```

## Step 3: Verify the endpoint status

In this step, you verify the endpoint status of the private connectivity endpoint in the Snowflake VPC or VNet that you provisioned in the
previous step.

* To verify the endpoint status, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) system function:

  ```sqlexample
  SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
  ```

  The endpoint is ready to use when the `status` changes from `pending` to `available`.

## Step 4: Additional catalog-specific configuration

Complete the additional configuration steps for your catalog type.

> **Note:**
>
> For some catalogs or some types of private connectivity endpoints, you also need to approve the connection or allowlist the private
> connectivity endpoints on the catalog server side.

Generic Iceberg REST catalogAWS Glue Data CatalogDatabricks Unity Catalog

* To complete the additional configuration steps, see the documentation for the remote REST Iceberg catalog, and then proceed
  to the next step.

No additional configuration is required. Proceed to the next step.

In this step, you register the Snowflake endpoint in Databricks to accept the traffic coming from the VPC endpoint.

AWSAzure

**Complete configuration steps in Databricks**

Before you register the Snowflake VPC endpoint, ensure that you complete the following configurations in Databricks:

* Your workspace must be located in a customer-managed VPC.
* Your Databricks account must be in the enterprise subscription.
* You must set up a private access configuration.

For more information, see [Azure Databricks: Configure Front-end PrivateLink](https://docs.databricks.com/aws/en/security/network/front-end/front-end-private-connect) in the Databricks documentation.

**Register the Snowflake VPC endpoint**

To register the VPC endpoint, complete the following steps:

1. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) system function, and then copy the value for `snowflake_endpoint_name` in the response:

   ```sqlexample
   SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
   ```

   For example, the output to copy looks like `vpce-11111aaaa11aaaa11`. This value is the VPC endpoint ID in your Snowflake account.
2. In Databricks, register the Snowflake VPC endpoint ID by specifying the VPC endpoint ID value that you copied in the previous step.

   For instructions, see [Manage VPC endpoint registrations](https://docs.databricks.com/aws/en/security/network/classic/vpc-endpoints) in the
   Databricks documentation.
3. In Databricks, add a private access setting, and then specify the VPC endpoint that you registered in the previous step.

   For instructions, see
   [Manage private access settings](https://docs.databricks.com/aws/en/security/network/classic/private-access-settings) in the Databricks
   documentation.

**Complete configuration steps in Databricks**

Before you register the Snowflake VPC endpoint, ensure that you complete the required configurations in Databricks, which
includes deploying Azure Databricks in your Azure virtual network. For all these required configurations, see
[Requirements for configuring Front-end Private Link](https://learn.microsoft.com/en-us/azure/databricks/security/network/front-end/front-end-private-connect#requirements)
in the Azure Databricks documentation.

**Approve the private connectivity from Snowflake**

To approve the private connectivity from Snowflake, complete the following steps:

1. In the Azure portal, navigate to your Azure Databricks workspace.
2. In the sidebar, click Networking.
3. Click Private endpoint connections.
4. From the list of private endpoint connections, click the checkbox next to the private endpoint connection that you want to approve.
5. Above the list, click the Approve button.

## Step 5: Create a catalog integration

In this step, to enable private connectivity, you configure a catalog integration for the catalog REST endpoint.

Generic Iceberg REST catalogAWS Glue Data CatalogDatabricks Unity Catalog

* To configure this catalog integration, run the [CREATE CATALOG INTEGRATION](../sql-reference/sql/create-catalog-integration-rest.md) command.

  For example:

  ```sqlexample
  CREATE OR REPLACE CATALOG INTEGRATION iceberg_rest_catalog_cat_int_private
    CATALOG_SOURCE = ICEBERG_REST
    TABLE_FORMAT = ICEBERG
    REST_CONFIG = (
      CATALOG_URI = '<rest_api_endpoint_url>'
      CATALOG_API_TYPE = PRIVATE
      CATALOG_NAME = '<catalog_name>'
    )
    REST_AUTHENTICATION = (
      TYPE = OAUTH
      OAUTH_TOKEN_URI = '<token_server_uri>'
      OAUTH_CLIENT_ID = '<oauth_client_id>'
      OAUTH_CLIENT_SECRET = '<oauth_client_secret>'
      OAUTH_ALLOWED_SCOPES = ('all-apis', 'sql')
  )
  ENABLED = true;
  ```

  > **Important:**
  >
  > To use outbound private connectivity, you must specify `CATALOG_API_TYPE=PRIVATE` when you create the integration.

  For more information, including the supported authentication methods, see [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md).

* To configure this catalog integration, follow the steps in [Configure a catalog integration for AWS Glue Iceberg REST](tables-iceberg-configure-catalog-integration-rest-glue.md).

  > **Important:**
  >
  > To use outbound private connectivity, you must specify `CATALOG_API_TYPE = AWS_PRIVATE_GLUE` when you create the integration
  > instead of `CATALOG_API_TYPE = AWS_GLUE`.

  For example:

  ```sqlexample
  CREATE CATALOG INTEGRATION glue_rest_catalog_int
    CATALOG_SOURCE = ICEBERG_REST
    TABLE_FORMAT = ICEBERG
    REST_CONFIG = (
      CATALOG_URI = 'https://glue.us-west-2.amazonaws.com/iceberg'
      CATALOG_API_TYPE = AWS_PRIVATE_GLUE
      CATALOG_NAME = '123456789012'
    )
    REST_AUTHENTICATION = (
      TYPE = SIGV4
      SIGV4_IAM_ROLE = 'arn:aws:iam::123456789012:role/my-role'
      SIGV4_SIGNING_REGION = 'us-west-2'
    )
    ENABLED = TRUE;
  ```

AWSAzure

* To create a REST catalog integration to connect to Databricks Unity Catalog, use the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) command.

  > **Important:**
  > + To use outbound private connectivity, you must specify `CATALOG_API_TYPE = PRIVATE` as one of the `REST_CONFIG`
  >   parameters when you create the integration.
  > + For `CATALOG_URI` and `OAUTH_TOKEN_URI`, you must use the standard public hostname, which is your Databricks workspace URL,
  >   *not* the name of the private endpoint. Snowflake automatically routes traffic through the provisioned private endpoint when
  >   `CATALOG_API_TYPE` is set to `PRIVATE`. To find your Databricks workspace URL, see
  >   [Get identifiers for workspace objects](https://docs.databricks.com/aws/en/workspace/workspace-details)
  >   in the Databricks documentation.

  **Example: Bearer token authentication**

  To create a bearer token, which is called a personal access token (PAT) in Databricks, see [Databricks on AWS: Create personal access tokens for workspace users](https://docs.databricks.com/aws/en/dev-tools/auth/pat#create-personal-access-tokens-for-workspace-users)
  in the Databricks documentation.

  ```sqlexample
  CREATE OR REPLACE CATALOG INTEGRATION unity_catalog_int_private_pat
    CATALOG_SOURCE = ICEBERG_REST
    TABLE_FORMAT = ICEBERG
    REST_CONFIG = (
      CATALOG_URI = 'https://dbc-a1a11111-1a11.cloud.databricks.com/api/2.1/unity-catalog/iceberg-rest'
      CATALOG_NAME = '<catalog_name>'
      CATALOG_API_TYPE = PRIVATE
    )
    REST_AUTHENTICATION = (
      TYPE = BEARER
      BEARER_TOKEN = 'eyAbCD...eyDeF...'
    )
    ENABLED = TRUE;
  ```

  **Example: OAuth authentication with service principal**

  The following example uses OAuth authentication with a Databricks service principal. You must have a service principal
  configured in Databricks with the necessary credentials, which are the `client_id` and `client_secret`. For instructions on adding a
  service principal, see [Databricks on AWS: Add service principals to your account](https://docs.databricks.com/aws/en/admin/users-groups/manage-service-principals#-add-service-principals-to-your-account)
  in the Databricks documentation.

  ```sqlexample
  USE ROLE ACCOUNTADMIN;

  CREATE OR REPLACE CATALOG INTEGRATION unity_catalog_int_private_oauth
    CATALOG_SOURCE = ICEBERG_REST
    TABLE_FORMAT = ICEBERG
    REST_CONFIG = (
      CATALOG_API_TYPE = PRIVATE
      CATALOG_URI = '<databricks_workspace_url>/api/2.1/unity-catalog/iceberg-rest'
      CATALOG_NAME = '<catalog_name>'
    )
    REST_AUTHENTICATION = (
      TYPE = OAUTH
      OAUTH_TOKEN_URI = '<databricks_workspace_url>/oidc/v1/token'
      OAUTH_CLIENT_ID = '<your_databricks_client_id>''
      OAUTH_CLIENT_SECRET = '<your_databricks_client_secret>'
      OAUTH_ALLOWED_SCOPES = ('all-apis', 'sql')
    )
    ENABLED = TRUE;
  ```

  Where:

  + `<databricks_workspace_url>` is your Databricks workspace URL, which you retrieved when you
    gathered private connectivity information for your catalog.
    For example, `https://dbc-a1a11111-1a11.cloud.databricks.com` is a Databricks workspace URL.

* To create a REST catalog integration to connect to Databricks Unity Catalog, use the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) command.

  > **Important:**
  > + To use outbound private connectivity, you must specify `CATALOG_API_TYPE = PRIVATE` as one of the `REST_CONFIG`
  >   parameters when you create the integration.
  > + For `CATALOG_URI` and `OAUTH_TOKEN_URI`, you must use the standard public hostname, which is your Databricks workspace URL,
  >   *not* the name of the private endpoint. Snowflake automatically routes traffic through the provisioned private endpoint when
  >   `CATALOG_API_TYPE` is set to `PRIVATE`. To find your Databricks workspace URL, see
  >   [Determine per-workspace URL](https://learn.microsoft.com/en-us/azure/databricks/workspace/workspace-details#determine-per-workspace-url)
  >   in the Azure Databricks documentation.

  **Example: Bearer token authentication**

  To create a bearer token, which is called a personal access token (PAT) in Databricks, see [Azure Databricks: Create personal access tokens for workspace users](https://learn.microsoft.com/en-us/azure/databricks/dev-tools/auth/pat#create-personal-access-tokens-for-workspace-users)
  in the Azure Databricks documentation.

  ```sqlexample
  CREATE OR REPLACE CATALOG INTEGRATION unity_catalog_int_private_pat
    CATALOG_SOURCE = ICEBERG_REST
    TABLE_FORMAT = ICEBERG
    REST_CONFIG = (
      CATALOG_URI = 'https://my-workspace.azuredatabricks.net/api/2.1/unity-catalog/iceberg-rest'
      CATALOG_NAME = '<catalog_name>'
      CATALOG_API_TYPE = PRIVATE
    )
    REST_AUTHENTICATION = (
      TYPE = BEARER
      BEARER_TOKEN = 'eyAbCD...eyDeF...'
    )
    ENABLED = TRUE;
  ```

  **Example: OAuth authentication with service principal**

  The following example uses OAuth authentication with a Databricks service principal. You must have a service principal
  configured in Databricks with the necessary credentials, which are the `client_id` and `client_secret`. For instructions on adding a
  service principal, see [Azure Databricks: Add service principals to your account](https://learn.microsoft.com/en-us/azure/databricks/admin/users-groups/manage-service-principals#-add-service-principals-to-your-account)
  in the Databricks documentation.

  ```sqlexample
  USE ROLE ACCOUNTADMIN;

  CREATE OR REPLACE CATALOG INTEGRATION unity_catalog_int_private_oauth
    CATALOG_SOURCE = ICEBERG_REST
    TABLE_FORMAT = ICEBERG
    REST_CONFIG = (
      CATALOG_API_TYPE = PRIVATE
      CATALOG_URI = '<databricks_per_workspace_url>/api/2.1/unity-catalog/iceberg-rest'
      CATALOG_NAME = '<catalog_name>'
    )
    REST_AUTHENTICATION = (
      TYPE = OAUTH
      OAUTH_TOKEN_URI = '<databricks_per_workspace_url>/oidc/v1/token'
      OAUTH_CLIENT_ID = '<your_databricks_client_id>'
      OAUTH_CLIENT_SECRET = '<your_databricks_client_secret>'
      OAUTH_ALLOWED_SCOPES = ('all-apis', 'sql')
    )
    ENABLED = TRUE;
  ```

  Where:

  + `<databricks_per_workspace_url>` is your Databricks per-workspace URL, which you retrieved when you
    gathered private connectivity information for your catalog.
    For example, `https://adb-1234567890123456.12.azuredatabricks.net` is a Databricks per-workspace URL.

## Step 6: Verify your catalog integration

* To verify your catalog integration configuration, call the SYSTEM$VERIFY_CATALOG_INTEGRATION function.

  For more information, including an example, see [Use SYSTEM$VERIFY_CATALOG_INTEGRATION to check your catalog integration configuration](tables-iceberg-configure-catalog-integration-rest-check-config.md).

## (Optional) Step 7: Update your catalog configuration

We recommend that you update the configuration for your remote catalog so that it’s only accessible through private connectivity.

Generic Iceberg REST catalogAWS Glue Data CatalogDatabricks Unity Catalog

* To disable public access to your catalog, see the documentation for the remote catalog that you want to connect to through private connectivity.

AWS Glue Data Catalog doesn’t support restricting access to only allowlisted VPC endpoints.

AWSAzure

* To disable public access to your catalog, see [Databricks on AWS: Configure Front-end PrivateLink](https://docs.databricks.com/aws/en/security/network/front-end/front-end-private-connect)
  in the Databricks documentation.

* To disable public access to your catalog, see [Configure Front-end Private Link](https://learn.microsoft.com/en-us/azure/databricks/security/network/front-end/front-end-private-connect)
  in the Azure Databricks documentation.

## Next steps

This section contains some tasks that you can perform after you configure your catalog integration:

* Monitor your private connectivity endpoints
* Configure an external volume with outbound private connectivity
* Create a catalog-linked database
* Write to your remote catalog

### Monitor your private connectivity endpoints

* To monitor your private connectivity endpoints, see [OUTBOUND_PRIVATELINK_ENDPOINTS view](../sql-reference/account-usage/outbound_privatelink_endpoints.md)
  in the ACCOUNT_USAGE schema.
* To explore the cost of your private connectivity endpoints, see [Outbound private connectivity costs](private-connectivity-outbound.md).

### Configure an external volume with outbound private connectivity

> * To enable private connectivity between Snowflake and your storage buckets, configure an [external volume](tables-iceberg.md)
>   with outbound private connectivity.
>
>   For more information about external volumes, see [Configure an external volume](tables-iceberg-configure-external-volume.md).
>
>   > **Note:**
>   >
>   > Catalog-vended credentials aren’t supported when you configure a catalog integration with outbound private connectivity.

* To configure an external volume with outbound private connectivity, follow the instructions for your cloud provider:

  + **AWS**: [Private connectivity to external volumes for Amazon Web Services](tables-iceberg-configure-external-volume-s3-private.md)
  + **Azure**: [Private connectivity to external volumes for Microsoft Azure](tables-iceberg-configure-external-volume-azure-private.md)

### Create a catalog-linked database

* To create a Snowflake database that is connected to your external Iceberg REST catalog, create a catalog-linked database.

  For more information, see [Create a catalog-linked database](tables-iceberg-catalog-linked-database.md).

  > **Note:**
  >
  > When you create the catalog-linked database, specify a catalog integration that is configured with outbound private connectivity.

### Write to your remote catalog

After you configure a catalog integration for Apache Iceberg™ REST and create a catalog-linked database, you can write to your remote catalog.

* To write to your remote catalog, see [Write to your remote catalog](tables-iceberg-catalog-linked-database.md).

---
title: Configure an Azure container for loading data
source: https://docs.snowflake.com/en/user-guide/data-load-azure-config.md
section: User Guide
---

# Configure an Azure container for loading data

Configure secure access to data files stored in a Microsoft Azure container.

Snowflake supports the following options:

Option 1:
:   Configure a storage integration object to delegate authentication responsibility for external cloud storage to an Azure service principal. A service principal is an identity created for use with services such as Snowflake to access Azure resources.

    This option makes it easier to manage access for multiple users
    to different resources in Azure storage. A storage integration stores your secrets
    so that you don’t need to supply a SAS token every time you create an external stage.

    > **Note:**
    >
    > * Accessing Azure blob storage in [government regions](intro-regions.md) using a storage integration is limited to Snowflake
    >   accounts hosted on Azure in the same government region. Accessing your blob storage from an account hosted outside of the government
    >   region using direct credentials is supported.
    > * Confirm that Snowflake supports the Azure region that your storage is hosted in. For more information, see
    >   [Supported cloud regions](intro-regions.md).

Option 2:
:   Generate a shared access signature (SAS) token to grant Snowflake limited access to objects in your storage account. You can then access an external (Azure) stage that references the container using the SAS token.

> **Note:**
>
> * For the OneLake URL format, see [CREATE STAGE](../sql-reference/sql/create-stage.md).
> * Completing the instructions in this topic requires Azure administrative access. If you’re not an Azure administrator,
>   ask your Azure administrator to perform these tasks.
> * To improve query performance for an Azure external stage, configure your network routing to use
>   [Microsoft network routing](https://learn.microsoft.com/en-us/azure/storage/common/network-routing-preference#microsoft-global-network-versus-internet-routing).
>   For instructions, see the [Azure documentation](https://learn.microsoft.com/en-us/azure/storage/common/configure-network-routing-preference?tabs=azure-portal).

## Option 1: Configure a Snowflake storage integration

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Azure container referenced in an external (Azure) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an Azure identity and access management (IAM) user ID called the *app registration*. An administrator in your organization grants this app the necessary permissions in the Azure account.

An integration must also specify containers (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> Completing the instructions in this section requires permissions in Azure to manage storage accounts. If you are not an Azure administrator, ask your Azure administrator to perform these tasks.

**In this Section:**

### Step 1: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake object that stores a generated service principal for your Azure cloud storage, along with an optional set of allowed or blocked storage locations (that is, containers). Cloud provider administrators in your organization grant permissions on the storage locations to the generated service principal. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, Azure) stages. The URL in the stage definition must align with the Azure containers (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'AZURE'
  ENABLED = TRUE
  AZURE_TENANT_ID = '<tenant_id>'
  STORAGE_ALLOWED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `tenant_id` is the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to. A storage integration can authenticate to only one tenant, so the allowed and blocked storage locations must refer to storage accounts that all belong this tenant.

  To find your tenant ID, sign in to the Azure portal and click Azure Active Directory » Properties. The tenant ID is displayed in the Tenant ID field.
* `container` is the name of an Azure container that stores your data files (for example, `mycontainer`). The STORAGE_ALLOWED_LOCATIONS and STORAGE_BLOCKED_LOCATIONS parameters allow or block access to these containers, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over logical directories in the container.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two containers and paths. In a later step, we will create an external stage that references one of these containers and paths. Multiple external stages that use this integration can reference the allowed containers and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION azure_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'AZURE'
>   ENABLED = TRUE
>   AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
>   STORAGE_ALLOWED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/')
>   STORAGE_BLOCKED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/sensitivedata/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/sensitivedata/');
> ```

### Step 2: Grant Snowflake Access to the Storage Locations

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC STORAGE INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage Integration in Snowflake.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed storage locations.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to be granted an access token on specified resources inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Sign in to the Microsoft Azure portal.
5. Navigate to Azure Services » Storage Accounts. Click the name of the storage account you are granting the Snowflake service principal access to.
6. Click Access Control (IAM) » Add role assignment.
7. Select the desired role to grant to the Snowflake service principal:

   * `Storage Blob Data Reader` grants read access only. This allows loading data from files staged in the storage account.
   * `Storage Blob Data Contributor` grants read and write access. This allows loading data from or unloading data to files staged in
     the storage account. The role also allows executing the [REMOVE](../sql-reference/sql/remove.md) command to remove files staged in the
     storage account.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC STORAGE INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the storage integration stops working.
9. Click the Review + assign button.

   > **Note:**
   > * According to the Microsoft Azure documentation, role assignments may take up to five minutes to propagate.
   > * Snowflake caches the temporary credentials for a period that cannot exceed the 60 minute expiration time. If you revoke access from Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

### Step 3: Create an external stage

Create an external (Azure) stage that references the storage integration you created in Step 1: Create a Cloud Storage Integration in Snowflake (in this topic).

> **Note:**
>
> * Creating a stage that uses a storage integration requires a role that has the CREATE STAGE privilege for the schema as well as the USAGE privilege on the integration. For example:
>
>   ```sqlexample
>   GRANT CREATE STAGE ON SCHEMA public TO ROLE myrole;
>
>   GRANT USAGE ON INTEGRATION azure_int TO ROLE myrole;
>   ```
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration object.
> * Append a forward slash (`/`) to the URL value to filter to the specified folder path. If the forward slash is omitted, all files and
>   folders starting with the prefix for the specified path are included.
>
>   Note that the forward slash is required to access and retrieve unstructured data files in the stage.

Create the stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.

For example, set `mydb.public` as the current database and schema for the user session, and then create a stage named `my_azure_stage`. In this example, the stage references the Azure container and path `mycontainer1/path1`, which are supported by the integration. The stage also references a named file format object called `my_csv_format`:

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE my_azure_stage
>   STORAGE_INTEGRATION = azure_int
>   URL = 'azure://myaccount.blob.core.windows.net/container1/path1'
>   FILE_FORMAT = my_csv_format;
> ```

> **Note:**
>
> * The stage owner (i.e. the role with the OWNERSHIP privilege on the stage) must have the USAGE privilege on the storage integration.
> * To load or unload data from or to a stage that uses an integration, a role must have the USAGE privilege on the stage. It is not necessary to also have the USAGE privilege on the storage integration.
> * Use the `blob.core.windows.net` endpoint for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
> * The STORAGE_INTEGRATION parameter is handled separately from other stage parameters, such as FILE_FORMAT. Support for these other parameters is the same regardless of the integration used to access your Azure container.

## Option 2: Generate a SAS token

### Step 1: Generate the SAS token

The following step-by-step instructions describe how to generate an SAS token to grant Snowflake limited access to objects in your storage account:

1. Log into the Azure portal.
2. From the home dashboard, choose Storage Accounts » *<storage_account>*. Under Security + networking, choose Shared access signature.
3. Select the following **Allowed services**:

   * `Blob`
4. Select the following **Allowed resource types**:

   * `Container` (required to list objects in the storage account)
   * `Object` (required to read/write objects from/to the storage account)
5. Select the following allowed permissions to load data files from Azure resources:

   * Read
   * List

   The `Write`, `Add`, and `Create` permissions are also required if you plan to unload files to a container. In addition, to use the `PURGE = TRUE` option, the `Permanent Delete` permission is required.
6. Specify start and expiry dates/times for the SAS token. As part of a general security plan, you could generate a different SAS token periodically.
7. Leave the **Allowed IP addresses** field blank, and specify either HTTPS only or HTTPS and HTTP under Allowed protocols.
8. Click the Generate SAS and connection string button. Record the full value in the SAS token field, starting with and including the `?`. This is your SAS token. You will specify this token when you create an external stage.

### Step 2: Create an external stage

Create an external (Azure) stage that references the SAS token you generated in Step 1: Generate the SAS Token (in this topic).

The following example uses SQL to create an external stage named `my_azure_stage` that includes Azure credentials and a
[master encryption key](https://csrc.nist.gov/glossary/term/master_key). The stage URL references the Azure `myaccount` account. The
data files are stored in the `mycontainer` container and `/load/files` path. The stage references a named file format object called
`my_csv_format`. Note that the example truncates the `MASTER_KEY` value:

> ```sqlexample
> CREATE OR REPLACE STAGE my_azure_stage
>   URL='azure://myaccount.blob.core.windows.net/mycontainer/load/files'
>   CREDENTIALS=(AZURE_SAS_TOKEN='?sv=2016-05-31&ss=b&srt=sco&sp=rwdl&se=2018-06-27T10:05:50Z&st=2017-06-27T02:05:50Z&spr=https,http&sig=bgqQwoXwxzuD2GJfagRg7VOS8hzNr3QLT7rhS8OFRLQ%3D')
>   ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'kPx...')
>   FILE_FORMAT = my_csv_format;
> ```

Note that the AZURE_SAS_TOKEN and MASTER_KEY values used in this example are for illustration purposes only.

> **Note:**
>
> By specifying a named file format object (or individual file format options) for the stage, it is not necessary to later specify the same file format options in the COPY command used to load data from
> the stage. For more information about file format objects and options, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).

## Data file encryption

Enable Azure Storage Service Encryption (SSE) for Data at Rest on your storage account directly, and Snowflake will handle it correctly. For more information, see the [Azure documentation on SSE](https://docs.microsoft.com/en-us/azure/storage/storage-service-encryption).

In addition, Snowflake supports client-side encryption to decrypt files staged in Azure containers.

* Client-side encryption:

  > + AZURE_CSE: Requires a MASTER_KEY value. For information, see the [Client-side encryption information](https://docs.microsoft.com/en-us/azure/storage/common/storage-client-side-encryption) in the Microsoft Azure documentation.
  >
  >   > **Note:**
  >   >
  >   > *Block blobs* and *append blobs* support client-side encryption but *page blobs* do not.

**Next:** [Create an Azure stage](data-load-azure-create-stage.md)

---
title: Configure an external volume
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume.md
section: User Guide
---

# Configure an external volume

An external volume is a named, account-level Snowflake object that you use to connect Snowflake to your
external cloud storage for Iceberg tables. An external volume stores an identity and access management (IAM) entity
for your storage location. Snowflake uses the IAM entity to securely connect to your storage for accessing
table data, Iceberg metadata, and manifest files that store the table schema, partitions, and other metadata.

A single external volume can support one or more Iceberg tables.

You must create an external volume before you can create an Apache Iceberg™ table in Snowflake.

## Create an external volume

The steps to create an external volume depend on your cloud storage provider.

For instructions, see the following topics:

* [Amazon S3](tables-iceberg-configure-external-volume-s3.md)
* [Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
* [Azure Storage](tables-iceberg-configure-external-volume-azure.md)
* [S3-compatible storage](tables-iceberg-s3-compatible.md)

Each external volume is associated with a particular [Active storage location](tables-iceberg-storage.md),
and a single external volume can support multiple Iceberg tables. However, the number of external volumes you need depends on how you want to store,
organize, and secure your table data.

You can use a single external volume if you want the data and metadata
for *all* of your Snowflake-Iceberg tables in subdirectories under the same storage location (for example, in the same S3 bucket).
To configure these directories for Snowflake-managed tables, see [Data and metadata directories](tables-iceberg-storage.md).

Alternatively, you can create multiple external volumes to secure various storage locations differently. For example,
you might create the following external volumes:

* A read-only external volume for externally managed Iceberg tables.
* An external volume configured with read and write access for Snowflake-managed tables.

## Verify an external volume

Verify an external volume to check that Snowflake can successfully authenticate to your storage provider using an external volume that
you’ve configured. You can verify an external volume by using SQL or Snowsight.

### Use SQL

To verify an external volume by using SQL,
call the [SYSTEM$VERIFY_EXTERNAL_VOLUME](../sql-reference/functions/system_verify_external_volume.md)
function.

Specify the name of the external volume that you want to verify.

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_VOLUME('my_s3_external_volume');
```

### Use Snowsight

To verify an external volume by using Snowsight, follow these steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » External data.
3. Select the External volumes tab.
4. Select the external volume whose connection you want to verify.
5. Select … » Verify connection.

## Set a default external volume at the account, database, or schema level

You can either set an existing external volume as the default or
set a new external volume as the default when you create it.

### Set an existing external volume as the default

To set an existing external volume as the default to use for Iceberg tables,
you can set the [EXTERNAL_VOLUME](../sql-reference/parameters.md) parameter at the following levels:

Account:
:   Account administrators can use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to set the parameter for the account.
    If the value is set for the account, all Iceberg tables created in the account read from and write to this external volume by default.

Object:
:   Users can execute the appropriate [CREATE <object>](../sql-reference/sql/create.md) or [ALTER <object>](../sql-reference/sql/alter.md) command
    to override the [EXTERNAL_VOLUME](../sql-reference/parameters.md) parameter value at the database or schema level.
    The lowest-scoped declaration is used: schema > database > account.

    In addition to the minimum privileges required to modify an object using the appropriate ALTER *<object_type>* command,
    a role must have the USAGE privilege on the external volume.

> **Note:**
>
> * Changes to the EXTERNAL_VOLUME parameter only apply to tables created *after* the change. Existing tables continue to use the external
>   volume specified when they were created.
> * Alternatively, you can set a default external volume at the account, database, or schema level when you create the external volume by
>   using Snowsight.

#### Example

The following statement sets an external volume (`my_s3_vol`) for a database named `my_database_1`:

```sqlexample
ALTER DATABASE my_database_1
  SET EXTERNAL_VOLUME = 'my_s3_vol';
```

After setting an external volume at the database level, you can create an Iceberg table in that database
without specifying an external volume. The following statement creates an Iceberg table in `my_database_1`
that uses Snowflake as the catalog and uses the default external volume (`my_s3_vol`) set for the database.

```sqlexample
CREATE ICEBERG TABLE iceberg_reviews_table (
  id STRING,
  product_name STRING,
  product_id STRING,
  reviewer_name STRING,
  review_date DATE,
  review STRING
)
CATALOG = 'SNOWFLAKE'
BASE_LOCATION = 'my/product_reviews/';
```

### Set a new external volume as the default

To set a new external volume as the default to use for Iceberg tables, when you create the external volume in Snowsight,
use the Scope
field in the configuration settings to set the external volume as the default at the account, database, or schema level.

For instructions on how to create an external volume in Snowsight, see the following sections:

* [Amazon S3](tables-iceberg-configure-external-volume-s3.md)
* [Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
* [Azure Storage](tables-iceberg-configure-external-volume-azure.md)
* [S3-compatible storage](tables-iceberg-s3-compatible.md)

## Grant USAGE privileges to an external volume by using Snowsight

The USAGE privilege grants the ability to reference the external volume and view details for the external volume. For more information,
see [External volume privileges](security-access-control-privileges.md).

To grant USAGE privileges to an external volume by using Snowsight, follow these steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role that has OWNERSHIP privileges on the external volume that you want to grant USAGE privileges to.

   For instructions on switching a role, see [Switch your primary role](ui-snowsight-gs.md). For more information on the OWNERSHIP privilege,
   see [External volume privileges](security-access-control-privileges.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select the external volume that you want to grant USAGE privileges on.
6. Select + Privilege.
7. From the Roles field, select the role that you want to grant the USAGE privilege for the external volume.
8. From the Privileges field, select USAGE.
9. Select Grant privileges.

## Add a storage location by using Snowsight

> **Note:**
>
> To add a storage location to an external volume by using SQL, use the ADD STORAGE_LOCATION parameter of the [ALTER EXTERNAL VOLUME](../sql-reference/sql/alter-external-volume.md) command.

To add a named storage location to an external volume by using Snowsight, follow these steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role that has OWNERSHIP privilege on the external volume for which you want to add a storage location.

   For instructions, see [Switch your primary role](ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select the external volume that you want to add a storage location to.
6. Select … » Add storage location
7. Select your cloud storage provider and specify the configuration for the storage location that you’re adding:

   Amazon S3Microsoft AzureGoogle CloudS3 Compatible

   1. Select the Amazon S3 tab.
   2. Specify the configuration for the storage location that you’re adding:

      | Field | Description |
      | --- | --- |
      | Location name | Enter a name for your additional storage location. |
      | Region type | Specifies the cloud storage provider that stores your data files.  * Standard (default): S3 storage in public AWS regions outside of China. * Government (GovCloud): S3 storage in AWS [government regions](intro-regions.md). |
      | S3 role ARN | Specifies the case-sensitive Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges on the S3 bucket containing your data files.  You recorded this value when you [created an IAM role](tables-iceberg-configure-external-volume-s3.md). |
      | Encryption (optional) | Specifies the encryption type used. Possible values are:  * None (default): No encryption. * SSE-S3: Server-side encryption using S3-managed encryption keys. For more information, see [Using server-side   encryption with Amazon S3-managed encryption keys (SSE-S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingServerSideEncryption.html). * SSE-KMS (enter key): Server-side encryption using keys stored in KMS. For more information, see [Using server-side   encryption with AWS Key Management Service (SSE-KMS)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingKMSEncryption.html). |
      | Connectivity | Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see [Private connectivity to external volumes for Amazon Web Services](tables-iceberg-configure-external-volume-s3-private.md). Possible values are:  * Public (default): Use the public internet. * Private (AWS PrivateLink): Use outbound private connectivity. |
      | Storage base URL | Specifies the base URL for your additional storage location. |

   1. Select the Microsoft Azure tab.
   2. Specify the configuration for the storage location that you’re adding:

      | Field | Description |
      | --- | --- |
      | Location name | Enter a name for your additional storage location. |
      | Storage base URL | Specifies the base URL for your additional storage location. |
      | Azure tenant ID | specify your Azure tenant ID.  To find your Azure tenant ID, see [How to find your Microsoft Entra tenant ID](https://learn.microsoft.com/en-us/entra/fundamentals/how-to-find-tenant) in the Microsoft Entra documentation. |
      | Use PrivateLink endpoint | Specifies whether to use outbound private connectivity to harden your security posture. For information about using outbound private connectivity, see [Private connectivity to external volumes for Microsoft Azure](tables-iceberg-configure-external-volume-azure-private.md). |

   1. Select Google Cloud tab.
   2. Specify the configuration for the storage location that you’re adding:

   | Field | Description |
   | --- | --- |
   | Location name | Enter a name for your additional storage location. |
   | Storage base URL | Specifies the base URL for your additional storage location. |
   | Encryption (optional) | Specifies the encryption type used. Possible values are:  * None (default): No encryption. * SSE-KMS (enter key): Server-side encryption using keys stored in KMS. For more information,   see [customer-managed encryption keys](https://cloud.google.com/storage/docs/encryption/customer-managed-keys). |

   1. Select the S3 Compatible tab.
   2. Specify the configuration for the storage location that you’re adding:

      | Field | Description |
      | --- | --- |
      | Location name | Enter a name for your additional storage location. |
      | Storage base URL | Specifies the base URL for your cloud storage location. |
      | AWS key ID | Specifies the AWS key ID for connecting to and accessing your S3-compatible storage location. |
      | AWS secret key | Specifies the AWS secret key for connecting to and accessing your S3-compatible storage location. |
      | Storage endpoint | Specifies a fully qualified domain that points to your S3-compatible API endpoint.  **Note:** The storage endpoint should not include a bucket name; for example, specify `example.com` instead of `my_bucket.example.com`. |
8. Select Add storage location.

---
title: Configure an external volume for Amazon S3
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-s3.md
section: User Guide
---

# Configure an external volume for Amazon S3

Grant Snowflake restricted access to your Amazon S3 bucket using an external volume for
Apache Iceberg™ tables in Snowflake. To configure an external volume for Amazon S3, you can
use SQL or use Snowsight.

As a best practice, create a designated IAM policy that grants Snowflake access to your S3 location.
You can then attach the policy to a role, and use the security credentials generated by AWS for that role to access the files.

> **Note:**
>
> To harden your security posture, you can configure an external volume to use private connectivity rather than the public Internet for
> network traffic. For more information, see [Private connectivity to external volumes for Amazon Web Services](tables-iceberg-configure-external-volume-s3-private.md).

## Prerequisites

Before you configure an external volume, you need the following:

* An S3 storage bucket.

  + To use the external volume for externally managed Iceberg tables, all of your table data and metadata files must
    be located in a bucket that hosts your Snowflake account.
  + Snowflake can’t support external volumes with S3 bucket names that contain dots (for example, `my.s3.bucket`).
    S3 doesn’t support SSL for virtual-hosted-style buckets with dots in the name, and
    Snowflake uses virtual-host-style paths and HTTPS to access data in S3.
  + To support data recovery, [enable versioning for your external cloud storage location](tables-iceberg-storage.md).
* Permissions in AWS to create and manage IAM policies and roles. If you aren’t an AWS administrator, ask your AWS administrator to perform these tasks.

## Configure an external volume by using SQL

### Step 1: Create an IAM policy that grants access to your S3 location

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. Log in to the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add a policy to provide Snowflake with the required permissions to read and write data to your S3 location.

   The following example policy grants access to all locations in the specified bucket.

   > **Note:**
   > * Replace `my_bucket` with your actual bucket name. You can also specify a path in the bucket; for example, `my_bucket/path`.
   > * Setting the `"s3:prefix":` condition to `["*"]` grants access to all prefixes in the
   >   specified bucket; setting it to `["path/*"]` grants access to a specified path in the bucket.
   > * For buckets in [government regions](intro-regions.md), the bucket ARNs use the `arn:aws-us-gov:s3:::` prefix.
   > * If you’re using an S3 access point, specify the access point ARN instead of a bucket ARN. For more information, see
   >   [Configuring IAM policies for using access points](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-policies.html).

   ```sqljson
   {
      "Version": "2012-10-17",
      "Statement": [
            {
               "Effect": "Allow",
               "Action": [
                  "s3:PutObject",
                  "s3:GetObject",
                  "s3:GetObjectVersion",
                  "s3:DeleteObject",
                  "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<my_bucket>/*"
            },
            {
               "Effect": "Allow",
               "Action": [
                  "s3:ListBucket",
                  "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<my_bucket>",
               "Condition": {
                  "StringLike": {
                        "s3:prefix": [
                           "*"
                        ]
                  }
               }
            }
      ]
   }
   ```
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

### Step 2: Create an IAM role

Create an AWS IAM role to grant privileges on the S3 bucket containing your data files.

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. For the trusted entity type, select AWS account.
4. Under An AWS account, select This account. In a later step,
   you modify the trust relationship and grant access to Snowflake.
5. Select the Require external ID option. Enter an
   [external ID](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html) of your choice.
   For example, `iceberg_table_external_id`.

   An external ID is used to grant access to your AWS resources (such as S3 buckets) to a third party like Snowflake.
6. Select Next.
7. Select the policy that you created for the external volume, then select Next.
8. Enter a Role name and description for the role, then select Create role.

   You have now created an IAM policy for an S3 location, created an IAM role, and attached the policy to the role.
9. Select View role to view the role summary page. Locate and record the ARN (Amazon Resource Name) value for the role.

### Step 3: Grant privileges required for SSE-KMS encryption to the IAM role (optional)

If you want to upload an object encrypted with an AWS KMS key to Amazon S3,
the IAM role that you created for your external volume needs `kms:GenerateDataKey` permissions on the key.
To download an object encrypted with an AWS KMS key, the IAM role needs `kms:Decrypt` permissions on the key.

If you want to use a KMS key for your server-side encryption, follow these steps to create a key and reference it.

1. In the AWS Management Console, go to the Key Management Service (KMS). From the left navigation, select Customer managed keys, and then select Create key.
   You must create a key in the same region as your bucket.
2. Create a Symmetric key type. For the key usage, select Encrypt and decrypt. Select Next.
3. For Alias, enter a name for the key and select Next.
4. If needed, select an administrator for the key and select Next.
5. For Define key usage permissions, select your IAM role, then select Next.
6. Review the key configuration details, then select Finish to create the key.
7. Find the key in the list of customer managed keys, select it, and record its ARN.
   The following is an example of an ARN for a key: `arn:aws:kms:us-west-2:111111122222:key/1a1a11aa-aa1a-aaa1a-a1a1-000000000000`.

   When you create your external volume, set the `KMS_KEY_ID` value to the ARN of your key.

### Step 4: Create an external volume in Snowflake

Create an external volume using the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command.
The following example creates an external volume named `iceberg_external_volume`
that defines a single Amazon S3 storage location with encryption.

```sqlexample
CREATE OR REPLACE EXTERNAL VOLUME iceberg_external_volume
   STORAGE_LOCATIONS =
      (
         (
            NAME = 'my-s3-us-west-2'
            STORAGE_PROVIDER = 'S3'
            STORAGE_BASE_URL = 's3://<my_bucket>/'
            STORAGE_AWS_ROLE_ARN = '<arn:aws:iam::123456789012:role/myrole>'
            STORAGE_AWS_EXTERNAL_ID = 'iceberg_table_external_id'
         )
      )
      ALLOW_WRITES = TRUE;
```

The example specifies the
external ID (`iceberg_table_external_id`) associated with the IAM role that you created for the external volume.
Specifying an external ID lets you use the same IAM role (and external ID) across multiple external volumes.

> **Note:**
>
> Specify ARNs exactly as provided by AWS. ARNs are case-sensitive.

### Step 5: Retrieve the AWS IAM user for your Snowflake account

1. Retrieve the ARN for the AWS IAM user that was created automatically
   for your Snowflake account using the [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) command.
   Specify the name of your external volume.

   The following example describes an external volume named `iceberg_external_volume`.

   ```sqlexample
   DESC EXTERNAL VOLUME iceberg_external_volume;
   ```
2. Record the value for the `STORAGE_AWS_IAM_USER_ARN` property, which is the AWS IAM user created for your Snowflake account;
   for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`.

   Snowflake provisions a single IAM user for your entire Snowflake account. All S3 external volumes in your account use that IAM user.

   > **Note:**
   >
   > If you didn’t specify an external ID (`STORAGE_AWS_EXTERNAL_ID`) when you created an external volume,
   > Snowflake generates an ID for you to use. Record the value so that you can update your IAM role trust policy with the generated external ID.

### Step 6: Grant the IAM user permissions to access bucket objects

In this step, you configure permissions that allow the IAM user for your Snowflake account to access objects in your S3 bucket.

1. Log in to the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the IAM role that you created for your external volume.
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the DESC EXTERNAL VOLUME output values that you recorded.

   **Policy document for IAM role**

   ```sqljson
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<snowflake_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<iceberg_table_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   * `snowflake_user_arn` is the STORAGE_AWS_IAM_USER_ARN value you recorded.
   * `iceberg_table_external_id` is your external ID. If you *already* specified an external ID when you created the role, and used the same
     ID to create your external volume, leave the value as-is. Otherwise, update `sts:ExternalId` with the value that you recorded.
   > **Note:**
   >
   > You must update this policy document if you create a new external volume (or recreate an existing external volume using the CREATE OR
   > REPLACE EXTERNAL VOLUME syntax) and don’t provide your own external ID.
   > For security reasons, a new or recreated external volume has a different external ID and cannot
   > resolve the trust relationship unless you update this trust policy.
8. Select Update policy to save your changes.

### Step 7: Verify storage access

To check that Snowflake can successfully authenticate to your storage provider, call the [SYSTEM$VERIFY_EXTERNAL_VOLUME](../sql-reference/functions/system_verify_external_volume.md)
function.

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_VOLUME('my_external_volume');
```

> **Note:**
>
> If you receive the following error, your account administrator must activate AWS STS in the Snowflake deployment region.
> For instructions, see
> [Manage AWS STS in an AWS Region](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_enable-regions.html)
> in the AWS documentation.
>
> ```output
> Error assuming AWS_ROLE:
> STS is not activated in this region for account:<external volume id>. Your account administrator can activate STS in this region using the IAM Console.
> ```

## Configure an external volume in Snowsight

### Step 1: Create an IAM policy that grants access to your S3 location

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. Log in to the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add a policy to provide Snowflake with the required permissions to read and write data to your S3 location.

   The following example policy grants access to all locations in the specified bucket.

   > **Note:**
   > * Replace `my_bucket` with your actual bucket name. You can also specify a path in the bucket; for example, `my_bucket/path`.
   > * Setting the `"s3:prefix":` condition to `["*"]` grants access to all prefixes in the
   >   specified bucket; setting it to `["path/*"]` grants access to a specified path in the bucket.
   > * For buckets in [government regions](intro-regions.md), the bucket ARNs use the `arn:aws-us-gov:s3:::` prefix.
   > * If you’re using an S3 access point, specify the access point ARN instead of a bucket ARN. For more information, see
   >   [Configuring IAM policies for using access points](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-policies.html).

   ```sqljson
   {
      "Version": "2012-10-17",
      "Statement": [
            {
               "Effect": "Allow",
               "Action": [
                  "s3:PutObject",
                  "s3:GetObject",
                  "s3:GetObjectVersion",
                  "s3:DeleteObject",
                  "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<my_bucket>/*"
            },
            {
               "Effect": "Allow",
               "Action": [
                  "s3:ListBucket",
                  "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<my_bucket>",
               "Condition": {
                  "StringLike": {
                        "s3:prefix": [
                           "*"
                        ]
                  }
               }
            }
      ]
   }
   ```
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

### Step 2: Create an IAM role

Create an AWS IAM role to grant privileges on the S3 bucket containing your data files.

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. For the trusted entity type, select AWS account.
4. Under An AWS account, select This account. In a later step,
   you modify the trust relationship and grant access to Snowflake.
5. Select the Require external ID option. Enter an
   [external ID](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html) of your choice.
   For example, `iceberg_table_external_id`.

   An external ID is used to grant access to your AWS resources (such as S3 buckets) to a third party like Snowflake.
6. Select Next.
7. Select the policy that you created for the external volume, then select Next.
8. Enter a Role name and description for the role, then select Create role.

   You have now created an IAM policy for an S3 location, created an IAM role, and attached the policy to the role.
9. Select View role to view the role summary page. Locate and record the ARN (Amazon Resource Name) value for the role.

### Step 3: Create an external volume in Snowsight

To create an external volume in Snowflake by using Snowsight, follow these steps:

> 1. Sign in to [Snowsight](ui-snowsight-gs.md).
> 2. In the lower-left corner, select your name » Switch role, and then select ACCOUNTADMIN or a role that has the CREATE EXTERNAL VOLUME privilege.
>
>    For more information, see [Switch your primary role](ui-snowsight-gs.md).
> 3. In the navigation menu, select Catalog » External data.
> 4. Select the External volumes tab.
> 5. Select + Create.
> 6. Select AWS S3 and then select Next.
>
>    > **Note:**
>    >
>    > You already configured your cloud provider earlier when you created an IAM policy that grants access to your S3 location
>    > and created an IAM role.
> 7. In the Grant storage access page, from the Trust policy field, copy the trust policy into a text editor.
>
>    In the next step, you paste this trust policy into AWS.
> 8. To grant storage access, follow these steps:
>
>    1. In AWS, log in to the AWS Management Console.
>    2. From the home dashboard, search for and select IAM.
>    3. From the left-hand navigation pane, select Roles.
>    4. Select the IAM role that you created for your external volume.
>    5. Select the Trust relationships tab.
>    6. Select Edit trust policy.
>    7. Replace the trust policy for your IAM role with the policy that you copied in Snowsight.
>    8. Select Update policy to save your changes.
>    9. In Snowsight, select Next.
> 9. In Snowsight, select Next.
> 10. To configure your external volume, from the Configure external volume page, complete the fields:
>
>     | Field | Description |
>     | --- | --- |
>     | External volume name | Enter a name for your external volume. |
>     | Region type | Specifies the cloud storage provider that stores your data files.  * Standard (default): S3 storage in public AWS regions outside of China. * Government (GovCloud): S3 storage in AWS [government regions](intro-regions.md). |
>     | S3 role ARN | Specifies the case-sensitive Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges on the S3 bucket containing your data files.  You recorded this value when you created an IAM role. |
>     | Encryption (optional) | Specifies the encryption type used. Possible values are:  * None (default): No encryption. * SSE-S3: Server-side encryption using S3-managed encryption keys. For more information, see [Using server-side   encryption with Amazon S3-managed encryption keys (SSE-S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingServerSideEncryption.html). * SSE-KMS (enter key): Server-side encryption using keys stored in KMS. For more information, see [Using server-side   encryption with AWS Key Management Service (SSE-KMS)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingKMSEncryption.html). |
>     | Scope | Choose where this external volume should become the default location for future Iceberg tables. Possible values are:  * Do not set a default: Don’t set the external volume as a default anywhere. * Account: Set the external volume as the default for Iceberg tables that are created under the entire account. * Specific database: Set the external volume as the default for Iceberg tables that are created under the database you   specify. To specify this database, use the Database drop-down that appears when you select Specific database. * Specific schema: Set the external volume as the default for Iceberg tables that are created under the schema you specify.   To specify this schema, use the Database drop-down that appears to first select   the parent database of the schema and then select the schema. |
>     | Comment (optional) | Specifies a comment for the external volume. |
>     | Connectivity | Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see [Private connectivity to external volumes for Amazon Web Services](tables-iceberg-configure-external-volume-s3-private.md). Possible values are:  * Public (default): Use the public internet. * Private (AWS PrivateLink): Use outbound private connectivity. |
>     | Storage base URL | Specifies the base URL for your cloud storage location. |
>     | Access scope | Specifies whether write operations are allowed for the external volume; must be set to Allow writes for the following tables:  * Iceberg tables that use Snowflake as the catalog. * Iceberg tables that use an external catalog and are writable. Externally managed Iceberg tables are writable when you access them   through a catalog-linked database that has the ALLOWED_WRITE_OPERATIONS parameter set to TRUE. For Iceberg tables created from Delta table files, setting this parameter to Allow writes enables Snowflake to write Iceberg metadata to your external storage. For more information, see [Delta-based tables](tables-iceberg-metadata.md).  The value of this field must also match the permissions that you set on the cloud storage account for each specified storage location.  **Note:** If you plan to use the external volume for reading externally managed Iceberg tables, you can set this field to Off. Snowflake doesn’t write data or Iceberg metadata files to your cloud storage when you read tables in an external Iceberg catalog. |
> 11. Select Next.
>
>     On the Verify connection & create volume page, Snowflake verifies your connection to AWS and then displays
>     a “Successfully connected” message.
>
>     > **Note:**
>     >
>     > If Snowflake is unable to verify your connection, check your permission or external volume configuration and then select
>     > Verify again.
> 12. Select Create.

## Next steps

After you configure an external volume, you can create an Iceberg table.

* To create a read-only Iceberg table that uses an external catalog, see
  [Configure a catalog integration](tables-iceberg-configure-catalog-integration.md).
* To create an Iceberg table with full Snowflake platform support,
  see [Create a Snowflake-managed table](tables-iceberg-create.md).

---
title: Configure an external volume for Azure
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-azure.md
section: User Guide
---

# Configure an external volume for Azure

Grant Snowflake restricted access to your own Microsoft Azure container using an external
volume. Snowflake supports the following Azure cloud storage services for external volumes:

* Blob storage
* Data Lake Storage Gen2

  [Preview feature](../release-notes/preview-features.md) — Open

  Available to all accounts. Configuring an external volume that is connected to Data Lake Storage Gen2 (Data Lake Storage) is in public preview.
* General-purpose v1
* General-purpose v2
* Microsoft Fabric OneLake

> **Important:**
>
> To enable interoperability between Snowflake and remote catalogs that are only configured to use Data Lake Storage, you must configure your external
> volume to connect to Data Lake Storage. For more information, see
> Enable interoperability with remote catalogs that use Data Lake Storage.

> **Note:**
>
> To harden your security posture, you can configure an external volume to use private connectivity rather than the public Internet for
> network traffic. For more information, see [Private connectivity to external volumes for Microsoft Azure](tables-iceberg-configure-external-volume-azure-private.md).

To configure an external volume for Azure,
you can use SQL or use Snowsight.

## Prerequisites

Before you configure an external volume, you need the following:

* An Azure storage container.

  + To use the external volume for externally managed Iceberg tables, all of your table data and metadata files must
    be located in the container.
  + To support data recovery, [enable versioning for your external cloud storage location](tables-iceberg-storage.md).
* Permissions in Azure to create and manage IAM policies and roles. If you aren’t an Azure administrator, ask your Azure administrator to perform these tasks.

If you use an Azure storage firewall to block unauthorized traffic to your storage account, follow the instructions in [Allow the VNet subnet IDs](data-load-azure-allow.md)
to explicitly grant Snowflake access to your Azure storage account.

## Enable interoperability with remote catalogs that use Data Lake Storage

This section describes how to configure Snowflake so that Iceberg tables you write to by using Snowflake are interoperable with remote
catalogs that are only configured to use Data Lake Storage. For example, Unity Catalog is configured to only use Data Lake Storage.

To enable interoperability with these catalogs, Snowflake must write data files for Iceberg tables to Data Lake Storage. To enable Snowflake to
write data files to Data Lake Storage, you must configure an external volume that connects Snowflake to Data Lake Storage,
which uses the `dfs.core.windows.net` endpoint.

When you use Snowflake to write to Iceberg tables that are interoperable with a remote catalog that uses Data Lake
Storage, the following scenarios are supported:

* Use Snowflake to create Snowflake-managed Iceberg tables that the query engine for the remote catalog can read and write to.

  > **Note:**
  >
  > To enable interoperability with your existing Snowflake-managed Iceberg tables that are stored in Blob Storage, migrate them
  > to Data Lake Storage. For instructions, see
  > [Migrate an Iceberg table to Azure Data Lake Storage](tables-iceberg-manage.md).
* Use Snowflake to read and write to remote tables in the remote catalog.

### Configure an external volume that connects Snowflake to Data Lake Storage

[Preview feature](../release-notes/preview-features.md) — Open

Available to all accounts.

To configure an external volume that connects Snowflake to Data Lake Storage, when you create an external volume in Snowflake,
you must specify a STORAGE_BASE_URL that points to an `dfs.core.windows.net` endpoint.

The following example creates an external volume named `exvoldfs` that is configured with a STORAGE_BASE_URL that points to a
`dfs.core.windows.net` endpoint.

> ```sqlexample
> CREATE EXTERNAL VOLUME exvoldfs
>   STORAGE_LOCATIONS =
>     (
>       (
>         NAME = 'my-azure-northeurope'
>         STORAGE_PROVIDER = 'AZURE'
>         STORAGE_BASE_URL = 'azure://exampleacct.dfs.core.windows.net/my_container_northeurope/'
>         AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
>       )
>     );
> ```

## Configure an external volume by using SQL

### Step 1: Create an external volume in Snowflake

Create an external volume using the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL command.

The following example creates an external volume that defines an Azure storage location with encryption:

```sqlexample
CREATE EXTERNAL VOLUME exvol
  STORAGE_LOCATIONS =
    (
      (
        NAME = 'my-azure-northeurope'
        STORAGE_PROVIDER = 'AZURE'
        STORAGE_BASE_URL = 'azure://exampleacct.blob.core.windows.net/my_container_northeurope/'
        AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
      )
    );
```

> **Note:**
>
> * Use the `azure://` prefix and not `https://` when specifying a value for STORAGE_BASE_URL.
> * For information about specifying a OneLake location (preview feature), see the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) reference page.
> * If you use a regional endpoint for a Microsoft Fabric OneLake storage location,
>   use the same region as your Microsoft Fabric capacity. This must also be the same region that hosts your Snowflake account.

### Step 2: Grant Snowflake access to the storage location

1. To retrieve a URL to the Microsoft permissions request page, use the [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) command.
   Specify the name of the external volume that you created previously.

   ```sqlexample
   DESC EXTERNAL VOLUME exvol;
   ```

   Record the values for the following properties:

   | Property | Description |
   | --- | --- |
   | `AZURE_CONSENT_URL` | URL to the Microsoft permissions request page. |
   | `AZURE_MULTI_TENANT_APP_NAME` | Name of the Snowflake client application created for your account. In a later step in this section, you grant this application permission to obtain an access token on your allowed storage location. |

   You use these values in the following steps.
2. In a web browser, navigate to the Microsoft permissions request page (the `AZURE_CONSENT_URL`).
3. Select Accept. This action allows the Azure service principal created for your Snowflake account to obtain an
   access token on a specified resource inside your tenant. Obtaining an access token succeeds only if you grant the service principal the
   appropriate permissions on the storage account level (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Log in to the Microsoft Azure portal.
5. Go to Azure Services » Storage Accounts. Select the name of the storage account that the Snowflake service principal
   needs to access.

   > > **Note:**
   > >
   > > You must set IAM permissions for an external volume at the storage account level, not the container level.
6. Select Access Control (IAM) » Add role assignment.
7. Select the `Storage Blob Data Contributor` role to grant read and write access to the Snowflake service principal.

   > > **Note:**
   > >
   > > The `Storage Blob Data Contributor` role grants write access to the external volume location.
   > > To completely configure write access, set the `ALLOW_WRITES` parameter of the external volume
   > > to `TRUE` (the default value).

> 1. Select + Select members.

1. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the
   DESC EXTERNAL VOLUME output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft
   >   request page in this section. If the service principal is not available immediately, wait an hour
   >   or two and then search again.
   > * If you delete the service principal, the external volume stops working.
2. Select Review + assign.

   > **Note:**
   >
   > It can take up to 10 minutes for changes to take effect when you assign a role. For more information, see
   > [Symptom - Role assignment changes are not being detected](https://learn.microsoft.com/en-us/azure/role-based-access-control/troubleshooting?tabs=bicep#symptom---role-assignment-changes-are-not-being-detected)
   > in the Microsoft Azure documentation.

### Step 3: Verify storage access

To check that Snowflake can successfully authenticate to your storage provider, call the [SYSTEM$VERIFY_EXTERNAL_VOLUME](../sql-reference/functions/system_verify_external_volume.md)
function.

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_VOLUME('my_external_volume');
```

> **Note:**
>
> If you receive the following error, your account administrator must activate AWS STS in the Snowflake deployment region.
> For instructions, see
> [Manage AWS STS in an AWS Region](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_enable-regions.html)
> in the AWS documentation.
>
> ```output
> Error assuming AWS_ROLE:
> STS is not activated in this region for account:<external volume id>. Your account administrator can activate STS in this region using the IAM Console.
> ```

## Configure an external volume in Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role, and then select ACCOUNTADMIN or a role that has the CREATE EXTERNAL VOLUME privilege.

   For more information, see [Switch your primary role](ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select + Create.
6. Select Microsoft Azure & OneLake and then select Next.
7. From the Prerequisites page, for Azure tenant ID, specify your Azure tenant ID.

   To find your Azure tenant ID, see [How to find your Microsoft Entra tenant ID](https://learn.microsoft.com/en-us/entra/fundamentals/how-to-find-tenant)
   in the Microsoft Entra documentation.
8. Select Next.
9. From the Grant storage access page, to grant Snowflake access to the storage location, follow these steps:

   1. To provide consent for Snowflake to connect to your Azure storage or Microsoft OneLake,
      select Provide consent.

      The Microsoft permissions request page opens in a new browser tab.
   2. From the Microsoft permissions request page, select Accept. This action allows the Azure service principal created for your
      Snowflake account to obtain an access token on a specified resource inside your tenant. Obtaining an access token succeeds only if you
      grant the service principal the appropriate permissions on the storage account level (see the next step).

      The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
   3. In Snowflake, from the Multi-tenant app name field, copy the name of the Snowflake client application created for your account
      into a text editor. In the next step, you grant this application permission to obtain an access token on your allowed storage location.
10. To grant your application permission to obtain an access token on your allowed storage location, follow these steps:

    1. Log in to the Microsoft Azure portal.
    2. Go to Azure Services » Storage Accounts. Select the name of the storage account that the Snowflake service principal
       needs to access.

       > > **Note:**
       > >
       > > You must set IAM permissions for an external volume at the storage account level, not the container level.
    3. Select Access Control (IAM) » Add role assignment.
    4. Select the `Storage Blob Data Contributor` role to grant read and write access to the Snowflake service principal.

       > > **Note:**
       > >
       > > The `Storage Blob Data Contributor` role grants write access to the external volume location.
       > > To completely configure write access, set the `ALLOW_WRITES` parameter of the external volume
       > > to `TRUE` (the default value).
    5. Select + Select members.
    6. Search for the Snowflake service principal.

       This is the *Multi-tenant app name* that you copied from Snowflake in the previous step.

       > **Important:**
       > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft
       >   request page in this section. If the service principal is not available immediately, wait an hour
       >   or two and then search again.
       > * If you delete the service principal, the external volume stops working.
    7. Select Review + assign.

       > **Note:**
       >
       > It can take up to 10 minutes for changes to take effect when you assign a role. For more information, see
       > [Symptom - Role assignment changes are not being detected](https://learn.microsoft.com/en-us/azure/role-based-access-control/troubleshooting?tabs=bicep#symptom---role-assignment-changes-are-not-being-detected)
       > in the Microsoft Azure documentation.
11. In Snowflake, select Next.
12. In Snowflake, to configure your external volume, from the Configure external volume page, complete the fields:

    | Field | Description |
    | --- | --- |
    | External volume name | Enter a name for your external volume. |
    | Storage base URL | Specifies the base URL for your cloud storage location. |
    | Access scope | Specifies whether write operations are allowed for the external volume; must be set to Allow writes for the following tables:  * Iceberg tables that use Snowflake as the catalog. * Iceberg tables that use an external catalog and are writable. Externally managed Iceberg tables are writable when you access them   through a catalog-linked database that has the ALLOWED_WRITE_OPERATIONS parameter set to TRUE. For Iceberg tables created from Delta table files, setting this parameter to Allow writes enables Snowflake to write Iceberg metadata to your external storage. For more information, see [Delta-based tables](tables-iceberg-metadata.md).  The value of this parameter must also match the permissions that you set on the cloud storage account for each specified storage location.  **Note:** If you plan to use the external volume for reading externally managed Iceberg tables, you can set this field to Off. Snowflake doesn’t write data or Iceberg metadata files to your cloud storage when you read tables in an external Iceberg catalog. |
    | Scope | Choose where this external volume should become the default location for future Iceberg tables. Possible values are:  * Do not set a default: Don’t set the external volume as a default anywhere. * Account: Set the external volume as the default for Iceberg tables that are created under the entire account. * Specific database: Set the external volume as the default for Iceberg tables that are created under the database you   specify. To specify this database, use the Database drop-down that appears when you select Specific database. * Specific schema: Set the external volume as the default for Iceberg tables that are created under the schema you specify.   To specify this schema, use the Database drop-down that appears to first select   the parent database of the schema and then select the schema. |
    | Comment (optional) | Specifies a comment for the external volume. |
    | Connectivity | Specifies whether to use outbound private connectivity to harden your security posture. For information about using outbound private connectivity, see [Private connectivity to external volumes for Microsoft Azure](tables-iceberg-configure-external-volume-azure-private.md). Possible values are:  * Public (default): Use the public internet. * Private (Azure Private Endpoint): Use outbound private connectivity. |
13. Select Next.

    On the Verify connection & create volume page, Snowflake verifies your connection to Azure and then displays
    a “Successfully connected” message.

    > **Note:**
    >
    > If Snowflake is unable to verify your connection, check your permission or external volume configuration and then select
    > Verify again.
14. Select Create.

## Next steps

After you configure an external volume, you can create an Iceberg table.

* To create a read-only Iceberg table that uses an external catalog, see
  [Configure a catalog integration](tables-iceberg-configure-catalog-integration.md).
* To create an Iceberg table with full Snowflake platform support,
  see [Create a Snowflake-managed table](tables-iceberg-create.md).

---
title: Configure an external volume for Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-gcs.md
section: User Guide
---

# Configure an external volume for Google Cloud Storage

Grant Snowflake restricted access to a Google Cloud Storage (GCS) bucket
using an external volume. To configure an external volume for Google Cloud Storage, you can use SQL or
use Snowsight.

## Prerequisites

Before you configure an external volume, you need the following:

* A Google Cloud Storage bucket.

  + To use the external volume for externally managed Iceberg tables, all of your table data and metadata files must
    be located in the bucket.
  + To support data recovery, [enable versioning for your external cloud storage location](tables-iceberg-storage.md).
* Permissions in Google Cloud to create and manage IAM policies and roles. If you aren’t a Google Cloud administrator,
  ask your Google Cloud administrator to perform these tasks.

To configure an external volume, you can use SQL or Snowsight:

* Configure an external volume by using SQL
* Configure an external volume in Snowsight

## Configure an external volume by using SQL

### Step 1: Create an external volume in Snowflake

Create an external volume using the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL command.

The following example creates an external volume that defines a single GCS storage location with encryption:

```sqlexample
CREATE EXTERNAL VOLUME my_gcs_external_volume
  STORAGE_LOCATIONS =
    (
      (
        NAME = 'my-us-west-2'
        STORAGE_PROVIDER = 'GCS'
        STORAGE_BASE_URL = 'gcs://mybucket1/path1/'
        ENCRYPTION=(TYPE='GCS_SSE_KMS' KMS_KEY_ID = '1234abcd-12ab-34cd-56ef-1234567890ab')
      )
    );
```

### Step 2: Retrieve the GCS service account for your Snowflake account

To retrieve the ID for the GCS service account that was created automatically
for your Snowflake account, use the [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) command.
Specify the name of the external volume that you created previously.

For example:

> ```sqlexample
> DESC EXTERNAL VOLUME my_gcs_external_volume;
> ```

Record the value of the `STORAGE_GCP_SERVICE_ACCOUNT` property in the output
(for example, `service-account-id@project1-123456.iam.gserviceaccount.com`).

Snowflake provisions a single GCS service account for your entire Snowflake account.
All GCS external volumes use that service account.

### Step 3: Grant the service account permissions to access bucket objects

In this step, you configure IAM access permissions for Snowflake in your Google Cloud console.

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Log in to the Google Cloud console as a project editor.
2. From the home dashboard, select IAM & Admin » Roles.
3. Select Create Role.
4. Enter a Title and optional Description for the custom role.
5. Select Add Permissions.
6. In Filter, select Service and then select storage.
7. Filter the list of permissions, and add the following from the list:

   > * `storage.buckets.get`
   > * `storage.objects.create`
   > * `storage.objects.delete`
   > * `storage.objects.get`
   > * `storage.objects.list`
8. Select Add.
9. Select Create.

#### Assign the custom role to the GCS service account

1. Log in to the Google Cloud console as a project editor.
2. From the home dashboard, select Cloud Storage » Buckets.
3. Filter the list of buckets, and select the bucket that you specified when you created an external volume.
4. Select Permissions » View by principals, then select Grant access.
5. Under Add principals, paste the name of the service account name from the
   output in Step 2: Retrieve the GCS service account for your Snowflake account.
6. Under Assign roles, select the custom IAM role that you created previously, then select Save.

#### Grant the GCS service account permissions on the Google Cloud Key Management Service keys

> **Note:**
>
> This step is required only if your GCS bucket is encrypted using a key stored in the
> Google Cloud Key Management Service (Cloud KMS).

1. Log in to the Google Cloud console as a project editor.
2. From the home dashboard, search for and select Security » Key Management.
3. Select the key ring that is assigned to your GCS bucket.
4. In the upper-right corner, select SHOW INFO PANEL. The information panel for the key ring appears.
5. In the Add members field, search for the service account name from the DESCRIBE EXTERNAL VOLUME output
   in Step 2: Retrieve the GCS service account for your Snowflake account.
6. From the Select a role dropdown, select the Cloud KMS CryptoKey Encrypter/Decrypter role.
7. Select Add. The service account name is added to the Cloud KMS CryptoKey Encrypter/Decrypter
   role drop-down in the information panel.

### Step 4: Verify storage access

To check that Snowflake can successfully authenticate to your storage provider, call the [SYSTEM$VERIFY_EXTERNAL_VOLUME](../sql-reference/functions/system_verify_external_volume.md)
function.

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_VOLUME('my_external_volume');
```

> **Note:**
>
> If you receive the following error, your account administrator must activate AWS STS in the Snowflake deployment region.
> For instructions, see
> [Manage AWS STS in an AWS Region](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_enable-regions.html)
> in the AWS documentation.
>
> ```output
> Error assuming AWS_ROLE:
> STS is not activated in this region for account:<external volume id>. Your account administrator can activate STS in this region using the IAM Console.
> ```

## Configure an external volume in Snowsight

### Step 1: Retrieve the GCS service account for your Snowflake account

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role, and then select ACCOUNTADMIN or a role that has the CREATE EXTERNAL VOLUME privilege.

   For more information, see [Switch your primary role](ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select + Create.
6. Select Google Cloud Storage and then select Next.
7. From the Grant storage access page, copy the value of the GCS service account into a text editor.

   Snowflake provisions a single GCS service account for your entire Snowflake account. All GCS external volumes use that service account.

### Step 2: Grant the service account permissions to access bucket objects

In this step, you configure IAM access permissions for Snowflake in your Google Cloud console.

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Log in to the Google Cloud console as a project editor.
2. From the home dashboard, select IAM & Admin » Roles.
3. Select Create Role.
4. Enter a Title and optional Description for the custom role.
5. Select Add Permissions.
6. In Filter, select Service and then select storage.
7. Filter the list of permissions, and add the following from the list:

   > * `storage.buckets.get`
   > * `storage.objects.create`
   > * `storage.objects.delete`
   > * `storage.objects.get`
   > * `storage.objects.list`
8. Select Add.
9. Select Create.

#### Assign the custom role to the GCS service account

1. Log in to the Google Cloud console as a project editor.
2. From the home dashboard, select Cloud Storage » Buckets.
3. Filter the list of buckets, and select the bucket that you specified when you created an external volume.
4. Select Permissions » View by principals, then select Grant access.
5. Under Add principals, paste the name of the service account name from the
   output in Step 1: Retrieve the GCS service account for your Snowflake account.
6. Under Assign roles, select the custom IAM role that you created previously, then select Save.

### Step 3: Create an external volume

To create an external volume in Snowflake by using Snowsight, follow these steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role, and then select ACCOUNTADMIN or a role that has the CREATE EXTERNAL VOLUME privilege.

   For instructions, see [Switch your primary role](ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select + Create.
6. Select Google Cloud Storage and then select Next.
7. Select Next.

   > **Note:**
   >
   > You already granted storage access earlier when you retrieved the GCS service account for your Snowflake account and
   > assigned the custom role to the GCS service account.
8. To configure your external volume, from the Configure external volume page, complete the fields:

   | Field | Description |
   | --- | --- |
   | External volume name | Enter a name for your external volume. |
   | Storage base URL | Specifies the base URL for your cloud storage location. |
   | Encryption (optional) | Specifies the encryption type used. Possible values are:  * None (default): No encryption. * SSE-KMS (enter key): Server-side encryption using keys stored in KMS. For more information,   see [customer-managed encryption keys](https://cloud.google.com/storage/docs/encryption/customer-managed-keys). |
   | Access scope | Specifies whether write operations are allowed for the external volume; must be set to Allow writes for the following tables:  * Iceberg tables that use Snowflake as the catalog. * Iceberg tables that use an external catalog and are writable. Externally managed Iceberg tables are writable when you access them   through a catalog-linked database that has the ALLOWED_WRITE_OPERATIONS parameter set to TRUE. For Iceberg tables created from Delta table files, setting this parameter to Allow writes enables Snowflake to write Iceberg metadata to your external storage. For more information, see [Delta-based tables](tables-iceberg-metadata.md).  The value of this field must also match the permissions that you set on the cloud storage account for each specified storage location.  **Note:** If you plan to use the external volume for reading externally managed Iceberg tables, you can set this field to Off. Snowflake doesn’t write data or Iceberg metadata files to your cloud storage when you read tables in an external Iceberg catalog. |
   | Scope | Choose where this external volume should become the default location for future Iceberg tables. Possible values are:  * Do not set a default: Don’t set the external volume as a default anywhere. * Account: Set the external volume as the default for Iceberg tables that are created under the entire account. * Specific database: Set the external volume as the default for Iceberg tables that are created under the database you   specify. To specify this database, use the Database drop-down that appears when you select Specific database. * Specific schema: Set the external volume as the default for Iceberg tables that are created under the schema you specify.   To specify this schema, use the Database drop-down that appears to first select   the parent database of the schema and then select the schema. |
   | Comment (optional) | Specifies a comment for the external volume. |
   | Connectivity | Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see [Private connectivity to external volumes for Google Cloud](tables-iceberg-configure-external-volume-gcs-private.md). Possible values are:  * Public (default): Use the public internet. * Private (Private Service Connect): Use outbound private connectivity. |
9. Select Next.

   On the Verify connection & create volume page, Snowflake verifies your connection to Google Cloud Storage and then displays
   a “Successfully connected” message.

   > **Note:**
   >
   > If Snowflake is unable to verify your connection, check your permission or external volume configuration and then select
   > Verify again.
10. Select Create.

## Next steps

After you configure an external volume, you can create an Iceberg table.

* To create a read-only Iceberg table that uses an external catalog, see
  [Configure a catalog integration](tables-iceberg-configure-catalog-integration.md).
* To create an Iceberg table with full Snowflake platform support,
  see [Create a Snowflake-managed table](tables-iceberg-create.md).

---
title: Configure an external volume for S3-compatible storage
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-s3-compatible.md
section: User Guide
---

# Configure an external volume for S3-compatible storage

For [externally managed](tables-iceberg.md) or [Snowflake-managed](tables-iceberg.md)
Apache Iceberg™ tables with data and metadata in S3-compatible storage,
you can configure an external volume associated with an [Amazon S3-compatible storage](data-load-s3-compatible-storage.md) location.

To create an external volume for S3-compatible storage, you can Use SQL or
Use Snowsight.

## Prerequisites

To use S3-compatible storage for Iceberg tables, you must have an S3-compatible API endpoint for Snowflake.
For more information, see [Requirements for S3-compatible storage](data-load-s3-compatible-storage.md).

## Create an external volume for S3-compatible storage by using SQL

Create an external volume that specifies an S3-compatible storage location.
For information about the S3-compatible parameters in the CREATE EXTERNAL VOLUME command, see the
[command syntax](../sql-reference/sql/create-external-volume.md).

```sqlexample
CREATE OR REPLACE EXTERNAL VOLUME ext_vol_s3_compat
  STORAGE_LOCATIONS = (
    (
      NAME = 'my_s3_compat_storage_location'
      STORAGE_PROVIDER = 'S3COMPAT'
      STORAGE_BASE_URL = 's3compat://mybucket/unload/mys3compatdata'
      CREDENTIALS = (
        AWS_KEY_ID = '1a2b3c...'
        AWS_SECRET_KEY = '4x5y6z...'
      )
      STORAGE_ENDPOINT = 'mystorage.com'
    )
  );
```

## Create an external volume for S3-compatible storage by using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role, and then select ACCOUNTADMIN or a role that has the CREATE EXTERNAL VOLUME privilege.

   For more information, see [Switch your primary role](ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select + Create.
6. Select S3 compatible storage.
7. Select Next.
8. In your S3-compatible storage provider, create access key credentials that have the permissions required to access your bucket and get
   objects.

   For more information, see [Requirements for S3-compatible storage](data-load-s3-compatible-storage.md).
9. Select Next.
10. To configure your external volume, from the Configure external volume page, complete the fields:

    | Field | Description |
    | --- | --- |
    | External volume name | Enter a name for your external volume. |
    | Storage endpoint | Specifies a fully qualified domain that points to your S3-compatible API endpoint.  **Note:** The storage endpoint should not include a bucket name; for example, specify `example.com` instead of `my_bucket.example.com`. |
    | AWS key ID | Specifies the AWS key ID for connecting to and accessing your S3-compatible storage location. |
    | AWS secret key | Specifies the AWS secret key for connecting to and accessing your S3-compatible storage location. |
    | Access scope | Specifies whether write operations are allowed for the external volume; must be set to Allow writes for the following tables:  * Iceberg tables that use Snowflake as the catalog. * Iceberg tables that use an external catalog and are writable. Externally managed Iceberg tables are writable when you access them   through a catalog-linked database that has the ALLOWED_WRITE_OPERATIONS parameter set to TRUE. For Iceberg tables created from Delta table files, setting this parameter to Allow writes enables Snowflake to write Iceberg metadata to your external storage. For more information, see [Delta-based tables](tables-iceberg-metadata.md).  The value of this field must also match the permissions that you set on the cloud storage account for each specified storage location.  **Note:** If you plan to use the external volume for reading externally managed Iceberg tables, you can set this field to Off. Snowflake doesn’t write data or Iceberg metadata files to your cloud storage when you read tables in an external Iceberg catalog. |
    | Scope | Choose where this external volume should become the default location for future Iceberg tables. Possible values are:  * Do not set a default: Don’t set the external volume as a default anywhere. * Account: Set the external volume as the default for Iceberg tables that are created under the entire account. * Specific database: Set the external volume as the default for Iceberg tables that are created under the database you   specify. To specify this database, use the Database drop-down that appears when you select Specific database. * Specific schema: Set the external volume as the default for Iceberg tables that are created under the schema you specify.   To specify this schema, use the Database drop-down that appears to first select   the parent database of the schema and then select the schema. |
    | Comment | Specifies a comment for the external volume. |
    | Storage base URL | Specifies the base URL for your cloud storage location. |
11. Select Next.

    On the Verify connection & create volume page, Snowflake verifies your connection to your S3 compatible storage and then displays
    a “Successfully connected” message.

    > **Note:**
    >
    > If Snowflake is unable to verify your connection, check your permission or external volume configuration and then select
    > Verify again.
12. Select Create.

## Update your external volume credentials

To change or update the credentials for the external volume, you can use the
[ALTER EXTERNAL VOLUME … UPDATE](../sql-reference/sql/alter-external-volume.md) command.
Specify the name of the storage location that you want to change the credentials for.

```sqlexample
ALTER EXTERNAL VOLUME ext_vol_s3_compat UPDATE
  STORAGE_LOCATION = 'my_s3_compat_storage_location'
  CREDENTIALS = (
    AWS_KEY_ID = '4d5e6f...'
    AWS_SECRET_KEY = '7g8h9i...'
  );
```

---
title: Configure an identity provider (IdP) for Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/sso-configure-idp.md
section: User Guide
---

# Configure an identity provider (IdP) for Snowflake Open Catalog

This topic shows you how to configure Okta or configure Auth0 as the IdP for your Snowflake Open Catalog account.

## Before you begin

To set up an IdP for Open Catalog SSO, you use your full Open Catalog account identifier, which includes your Snowflake
organization name and your Open Catalog account name; for example: `<orgname>.<my-snowflake-open-catalog-account-name>`.

* To find your *Snowflake* organization name (`<orgname>`), see [Finding the organization and account name for an account](../admin-account-identifier.md).
* To find your *Snowflake Open Catalog* account name (`<my-snowflake-open-catalog-account-name>`), see
  [Find the account name for a Snowflake Open Catalog account](find-account-name.md).

## Configure Okta

> **Note:**
>
> To create an Okta account for your company or
> organization, see <https://www.okta.com/>.

To set up Okta as the IdP for your Open Catalog account, follow these steps:

### Create an application in Okta for your Snowflake Open Catalog account

1. Sign in to the Okta Admin Portal.
2. In the left pane, select **Applications** > **Applications**, and then select **Browse App Catalog**.
3. In the search bar, search for and select the **Snowflake** application.
4. Select **Add Integration**.
5. In the **General settings** tab, enter the following values:

   * For **Application label**, enter Snowflake Open Catalog.
   * For **Subdomain**, enter your Snowflake organization name and Snowflake Open Catalog account name, using the format `<orgname>-<my-snowflake-open-catalog-account-name>`.

     For example: `ABCDEFG-MYACCOUNT1`.
     To find these names, see Before you begin.
6. Under **Sign-On Options - Required**, select **SAML 2.0**.
7. Under **Credentials Details**, for **Application username format**, select **Okta username**.

   This is the NameID value passed to Snowflake from Okta, which must
   match the LOGIN_NAME value of each user in Snowflake Open Catalog.
8. Select **Done**.
9. Select **View Setup Instructions**.

   This opens a new browser tab that contains information for configuring your Snowflake Open Catalog account to use SSO.
10. From the setup instructions, copy the following values, and paste them into a text editor for later use:

    * Entity ID (sometimes referred to as Issuer URL)
    * IDP SSO URL (sometimes referred to as Login URL)
    * Authentication Certificate

### Create a user (person)

Okta uses the term *person* to mean *user*. These are the users who will have access to your Open Catalog account.

To create users in Okta, follow these steps:

1. In the Okta Admin Portal, in the left pane, select **Directory** > **People**.
2. Select **Add Person**.
3. Enter the user’s details:

   | Field | Value |
   | --- | --- |
   | **User type** | Select **User**. |
   | **First name** | The user’s first name. |
   | **Last name** | The user’s last name. |
   | **Username** | The user’s email address.   Note: The **Username** that you enter here must match the LOGIN_NAME used to [create the user in Open Catalog](sso-configure-open-catalog.md). |
   | **Primary email** | This field is automatically populated with the **Username** that you enter. |
   | **Activation** | Select **Activate now**. |
   | **I will set password** | Select this option, and then enter a password for the user. |
4. Select **Save**.

### Assign the Snowflake Open Catalog application to users

Assigning the Open Catalog application to a user allows you to grant them access to your Open Catalog account. When you
[create the user in Open Catalog](sso-configure-open-catalog.md), you grant them access to the
account.

To assign the Snowflake Open Catalog application to users in Okta, follow these steps:

1. In the Okta Admin Portal, navigate to the Snowflake Open Catalog application that you previously created.
2. Select the **Assignments** tab.
3. Assign the application to users through individual user assignment (**Assign to People**) or group assignment (**Assign to Groups**).

## Configure Auth0

> **Note:**
>
> To create an Auth0 account for your company or organization, see <https://auth0.com/>.

To set up Auth0 as the IdP for your Open Catalog account, follow these steps:

### Create a Snowflake Open Catalog application

1. Sign in to the Auth0 console.
2. Select **Applications** > **Applications** > **+ Create Application**.
3. Create an application for Snowflake Open Catalog:

   1. Select **Native**.
   2. Enter a name for the application: **Snowflake Open Catalog**
   3. Select **Create**.
4. On the **Settings** tab, under **Application URIs**, provide the following details:

   | Field | Value |
   | --- | --- |
   | **Application Login URI** | `https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com`    For example: `https://ABCDEFG-MYACCOUNT1.snowflakecomputing.com`    To find these names, see Before you begin. |
   | **Allowed Callback URLs** | `https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com`   `https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/login`    For example: `https://ABCDEFG-MYACCOUNT1.snowflakecomputing.com <br /><br /> https://ABCDEFG-MYACCOUNT1.snowflakecomputing.com/fed/login`    To find these names, see Before you begin. |
   | **Allowed Logout URLs** | `https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/logout`    For example: `https://ABCDEFG-MYACCOUNT1.snowflakecomputing.com/fed/logout`    To find these names, see Before you begin. |
5. Under **Advanced settings**, select the **Grant Types** tab.
6. Select the **Password** checkbox. Accept the default values for the other settings.
7. At the top of the page, select the **Addons** tab.
8. Select the **SAML2 WEB APP**.
9. In the window that opens, select the **Settings** tab.
10. For **Application Callback URL**, enter:
    `https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/login`

     For example: `https://ABCDEFG-MYACCOUNT1.snowflakecomputing.com/fed/login`

     To find these names, see Before you begin.|
11. For **Settings**, replace the contents with the following code:

    ```sqljson
      {
             "audience": "https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com",
             "recipient": "https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/login",
             "signatureAlgorithm": "rsa-sha256",
             "digestAlgorithm": "sha256",
             "destination": "https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/login",
             "nameIdentifierProbes": [
                  "http://schemas.xmlsoap.org/ws/2005/05/identity/claims/emailaddress""
              ],
             "logout": {
                "callback": "https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/logout"
              },
             "binding": "urn:oasis:names:tc:SAML:2.0:bindings:HTTP-POST"
        }
    ```

    Where:

    `<orgname>` is the name of your Snowflake organization, and`<my-snowflake-open-catalog-account-name>` is the name of your Snowflake Open Catalog account. To find these names, see Before you begin.
12. Scroll down and select **Enable**.

    This button changes to **Save**.
13. to save your settings, select **Save**.

### Create users

These are the users who will have access to your Open Catalog account.

To create users in Auth0, follow these steps:

1. In the Auth0 console, in the left pane, select **User Management** > **Users**.
2. Select **+ Create User**.
3. In the **Create user** dialog, enter these values:

   * For **Connection**, select **Username-Password-Authentication**.
   * For **Email**, enter an email address for the user.

     > **Note:**
     >
     > The email address that you enter here must match the LOGIN_NAME used to [create the user in Open Catalog](sso-configure-open-catalog.md).
   1. For **Password** and **Repeat Password**, enter the same password for the user twice.
   2. Select **Create**.

---
title: Configure an integration for Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-load-gcs-config.md
section: User Guide
---

# Configure an integration for Google Cloud Storage

This topic describes how to configure secure access to data files stored in a Google Cloud Storage bucket.

## Configure a Snowflake storage integration

This section describes how to use storage integrations to allow Snowflake to read data from and write to a Google Cloud Storage bucket referenced in an external
(that is, Cloud Storage) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as
secret keys or access tokens; instead, integration objects reference a Cloud Storage service account. An administrator in your organization grants the service
account permissions in the Cloud Storage account.

Administrators can also restrict users to a specific set of Cloud Storage buckets (and optional paths) accessed by external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires access to your Cloud Storage project as a project editor. If you are not a project
>   editor, ask your Cloud Storage administrator to perform these tasks.
> * Confirm that Snowflake supports the Google Cloud Storage region that your storage is hosted in. For more information, see
>   [Supported cloud regions](intro-regions.md).

The following diagram shows the integration flow for a Cloud Storage stage:

1. An external (that is, Cloud Storage) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a Cloud Storage service account created for your account. Snowflake creates a single service account that is referenced by all GCS storage integrations in your Snowflake account.
3. A project editor for your Cloud Storage project grants permissions to the service account to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the service account on the bucket before allowing or denying access.

**In this Section:**

### Step 1: Create a Cloud Storage integration in Snowflake

Create an integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. An integration is a Snowflake object that delegates authentication responsibility for external cloud storage to a Snowflake-generated entity (that is, a Cloud Storage service account). For accessing Cloud Storage buckets, Snowflake creates a service account that can be granted permissions to access the bucket(s) that store your data files.

A single storage integration can support multiple external (that is, GCS) stages. The URL in the stage definition must align with the GCS buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'GCS'
  ENABLED = TRUE
  STORAGE_ALLOWED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `bucket` is the name of a Cloud Storage bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two buckets and paths. In a later step, we will create an external stage that references one of these buckets and paths.

Additional external stages that also use this integration can reference the allowed buckets and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION gcs_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'GCS'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('gcs://mybucket1/path1/', 'gcs://mybucket2/path2/')
>   STORAGE_BLOCKED_LOCATIONS = ('gcs://mybucket1/path1/sensitivedata/', 'gcs://mybucket2/path2/sensitivedata/');
> ```

### Step 2: Retrieve the Cloud Storage service account for your Snowflake account

Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the ID for the Cloud Storage service account that was created automatically for your Snowflake account:

```sqlsyntax
DESC STORAGE INTEGRATION <integration_name>;
```

Where:

> * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage integration in Snowflake (in this topic).

For example:

> ```sqlexample
> DESC STORAGE INTEGRATION gcs_int;
>
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> | property                    | property_type | property_value                                                              | property_default |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------|
> | ENABLED                     | Boolean       | true                                                                        | false            |
> | STORAGE_ALLOWED_LOCATIONS   | List          | gcs://mybucket1/path1/,gcs://mybucket2/path2/                               | []               |
> | STORAGE_BLOCKED_LOCATIONS   | List          | gcs://mybucket1/path1/sensitivedata/,gcs://mybucket2/path2/sensitivedata/   | []               |
> | STORAGE_GCP_SERVICE_ACCOUNT | String        | service-account-id@project1-123456.iam.gserviceaccount.com                  |                  |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> ```

The STORAGE_GCP_SERVICE_ACCOUNT property in the output shows the Cloud Storage service account created for your Snowflake account (that is, `service-account-id@project1-123456.iam.gserviceaccount.com`). We provision a single Cloud Storage service account for your entire Snowflake account. All Cloud Storage integrations use that service account.

### Step 3: Grant the service account permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your Google Cloud console so that you can use a Cloud Storage bucket to load and unload data:

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select IAM & Admin » Roles.
3. Select Create Role.
4. Enter a Title and optional Description for the custom role.
5. Select Add Permissions.
6. Filter the list of permissions, and add the following from the list:

   > | Action(s) | Required permissions |
   > | --- | --- |
   > | Data loading only | * `storage.buckets.get` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading with purge option, executing the REMOVE command on the stage | * `storage.buckets.get` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading and unloading | * `storage.buckets.get` (for calculating data transfer costs) * `storage.objects.create` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data unloading only | * `storage.buckets.get` * `storage.objects.create` * `storage.objects.delete` * `storage.objects.list` |
   > | Using [COPY FILES](../sql-reference/sql/copy-files.md) to copy files to an external stage | You must have the following additional permissions:  * `storage.multipartUploads.abort` * `storage.multipartUploads.create` * `storage.multipartUploads.list` * `storage.multipartUploads.listParts` |
7. Select Add.
8. Select Create.

#### Assign the custom role to the Cloud Storage Service Account

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select Cloud Storage » Buckets.
3. Filter the list of buckets, and select the bucket that you specified when you created your storage integration.
4. Select Permissions » View by principals, then select Grant access.
5. Under Add principals, paste the name of the service account name that you retrieved from the DESC STORAGE INTEGRATION command output.
6. Under Assign roles, select the custom IAM role that you created previously, then select Save.

> **Important:**
>
> If your Google Cloud organization was created on or after May 3, 2024, Google Cloud enforces a
> [domain restriction constraint](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains)
> in project organization policies. The default constraint lists your domain as the only allowed value.
>
> To allow the Snowflake service account access to your storage, you must
> [update the domain restriction](data-load-gcs-allow.md).

#### Grant the Cloud Storage service account permissions on the Cloud Key Management Service cryptographic keys

> **Note:**
>
> This step is required only if your GCS bucket is encrypted using a key stored in the Google Cloud Key Management Service (Cloud KMS).

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, search for and select Security » Key Management.
3. Select the key ring that is assigned to your GCS bucket.
4. Click SHOW INFO PANEL in the upper-right corner. The information panel for the key ring slides out.
5. Click the ADD PRINCIPAL button.
6. In the New principals field, search for the service account name from the DESCRIBE INTEGRATION output in Step 2: Retrieve the Cloud Storage service account for your Snowflake account (in this topic).
7. From the Select a role dropdown, select the `Cloud KMS CrytoKey Encryptor/Decryptor` role.
8. Click the Save button. The service account name is added to the Cloud KMS CrytoKey Encryptor/Decryptor role dropdown in the information panel.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

### Step 4: Create an external stage

Create an external stage that references the integration you created.

> **Note:**
>
> * You must use a role that is granted or inherits the USAGE privilege on the database and schema and the CREATE STAGE privilege on the schema. The stage owner (that is, the role with the OWNERSHIP privilege on the stage) must also have the USAGE privilege on the storage integration.
>
>   Refer to [Access control requirements](../sql-reference/sql/create-stage.md) for [CREATE STAGE](../sql-reference/sql/create-stage.md).
> * To load data to or unload data from a stage that uses an integration, a role must have the USAGE privilege on the stage. It isn’t necessary to also have the USAGE privilege on the storage integration.
> * Snowflake uses multipart uploads when uploading to Amazon S3 and Google Cloud Storage.
>   This process might leave incomplete uploads in the storage location for your external stage.
>
>   To prevent incomplete uploads from accumulating, we recommend that you set a lifecycle rule.
>   For instructions, see the [Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpu-abort-incomplete-mpu-lifecycle-config.html)
>   or [Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle#abort-mpu) documentation.

#### Create an external stage by using SQL

Ensure that the role in use is granted or inherits the necessary privileges to create a stage that uses a storage integration. For example:

```sqlexample
GRANT USAGE ON DATABASE mydb TO ROLE myrole;
GRANT USAGE ON SCHEMA mydb.stages TO ROLE myrole;
GRANT CREATE STAGE ON SCHEMA mydb.stages TO ROLE myrole;
GRANT USAGE ON INTEGRATION gcs_int TO ROLE myrole;
```

You can create an external stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.

Example 1:

In this example, we set `mydb.stages` as the current database and schema for the user session, and then create a stage named `my_gcs_stage`. In this example, the stage references the Cloud Storage bucket and path `mybucket1/path1`, which are supported by the integration. The stage also references a named file format object called `my_csv_format`:

```sqlexample
USE SCHEMA mydb.stages;

CREATE STAGE my_gcs_stage
  URL = 'gcs://mybucket1/path1'
  STORAGE_INTEGRATION = gcs_int
  FILE_FORMAT = my_csv_format;
```

Example 2:

In this example, we connect to Google Cloud Storage using a customer-managed key (CMK):

```sqlexample
USE SCHEMA mydb.stages;

CREATE STAGE my_ext_stage2
  URL='gcs://load/encrypted_files/'
  STORAGE_INTEGRATION = gcs_int
  ENCRYPTION=(TYPE = 'GCS_SSE_KMS' KMS_KEY_ID = '{a1b2c3});
  FILE_FORMAT = my_csv_format;
```

> **Note:**
>
> * Append a forward slash (`/`) to the URL value to filter to the specified folder path. If the forward slash is omitted, all files and
>   folders starting with the prefix for the specified path are included.
>
>   Note that the forward slash is required to access and retrieve unstructured data files on the stage.
> * The STORAGE_INTEGRATION parameter is handled separately from other stage parameters, such as FILE_FORMAT. Support for these other parameters is the same regardless of the integration used to access your GCS bucket.

#### Create an external stage using Python

Use the [StageCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.stage.StageCollection)
method of the [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-overview.md) to create an external stage.

Similar to the preceding SQL example, the following Python example creates an external stage named `my_gcs_stage` in the `mydb` database
and the `stages` schema:

```python
from snowflake.core.stage import Stage

my_stage = Stage(
  name="my_gcs_stage",
    storage_integration="gcs_int",
    url="gcs://mybucket1/path1"
)
root.databases["mydb"].schemas["stages"].stages.create(my_stage)
```

> **Note:**
>
> The Python API currently does not support the FILE_FORMAT parameter of the [CREATE STAGE](../sql-reference/sql/create-stage.md) SQL command.

#### Create an external stage using Snowsight

To use Snowsight to create a named external stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select the database and schema where you want to create a stage.
4. Select Create » Stage.
5. Select Google Cloud Platform.
6. Enter a Stage Name.
7. Enter the URL of your Google Cloud Storage bucket.
8. Note that Enable Directory Table is selected by default. This lets you see the files on the stage, but requires a warehouse and thus incurs a cost. You can choose to deselect this option for now and enable a directory table later.
9. Enable Authentication.
10. Select your storage integration from the menu.
11. Optionally expand the SQL Preview to view a generated SQL statement. To specify additional options for your stage such as AUTO_REFRESH, you can open this SQL preview in a worksheet.
12. Select Create.

## Edit existing stages to use storage integrations

You can edit an existing external stage configuration to use a storage integration using SQL or the web interface.

> **Note:**
>
> * You cannot disable authentication or encryption settings for a stage.
> * You can update a stage to use a storage integration for authentication. However, you cannot change the authentication type to credentials if the stage already uses a storage integration. To change the authentication type, you can drop and re-create the stage.

### Edit a stage using SQL

Use [ALTER STAGE](../sql-reference/sql/alter-stage.md) to modify the stage. For example:

```sqlexample
ALTER STAGE my_gcs_stage
  SET STORAGE_INTEGRATION = gcs_int;
```

### Edit a stage using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select the stage that you want to edit.
4. Select  » Edit.
5. Make your desired changes to the stage.
6. Select Save.

---
title: Configure and remove a service connection
source: https://docs.snowflake.com/en/user-guide/opencatalog/configure-service-connection.md
section: User Guide
---

# Configure and remove a service connection

As a Snowflake Open Catalog administrator, you configure a new service connection in Snowflake Open Catalog. You can then register it, which
connects the query engine that uses the connection to a catalog in Open Catalog. You can use the same service connection
for one or multiple query engines. For more information about service connections, see [Service connection](overview.md).

When you configure a new service connection, you specify the following items:

* A [principal role](access-control.md) to grant to the service principal. You can use a principal role to logically
  group Open Catalog service principals together. For more information, including examples of principal roles, see [Principal role](access-control.md).
* The query engine that users will use with the connection, such as Apache Spark.

When you configure a service connection, the service credentials for its service principal are created. You specify these service credentials
when you register the service connection.

## Configure a service connection

1. Sign in to Open Catalog.
2. In the menu on the left, select **Connections**.
3. Select **+ Connection**.
4. In the Configure Service Connection dialog, complete the fields:

   1. For **Query Engine**, select the query engine for the service connection.
   2. For **Name**, enter a service principal name.

      You can enter a user-friendly name so the connection is easier to identify and
      use in tools. For more information, including examples, see [Service principal](overview.md).
   3. To grant a principal role to the service principal, do one of the following:

      * To grant an existing principal role, select a role in the **Principal Role** drop-down.

        You can select an existing principal role to grant the same privileges to multiple service principals, such as a principal role named DATA_ENGINEERS.
      * To grant a new principal role, select **Create new principal role**. For
        **Principal Role**, enter a name for the new role.
5. Select **Create**.

   The Client ID and Client Secret service credentials for the service principal are created.
6. In the **Configure Service Connection** dialog, save the service credentials:

   1. To copy the Client ID, select **Copy client id** inside the **Client ID** field, and paste it in a file.
   2. To copy the Client Secret, select **Copy secret** inside the **Client Secret** field, and paste it in a file.
   3. To copy both the Client ID and Client Secret and in the format that they need to be specified when you register the service
      connection, select **Copy** inside the **As <CLIENT ID>:<SECRET>** field.

      **Important**

      You must save the service credentials before you close the Configure Service Connection window, because you can’t retrieve them later.
7. Select **Close**.

## Remove a service connection

If you no longer need to use a service connection, remove it.

To remove a service connection, do the following:

1. Sign in to Open Catalog.
2. In the menu on the left, select **Connections**.
3. In the list of connections, locate the service connection you want to remove.
4. Under the **MORE** column, select **…** for the connection you want to remove.
5. Select **Delete**.

---
title: Configure and use a Data Exchange
source: https://docs.snowflake.com/en/user-guide/data-exchange-using.md
section: User Guide
---

# Configure and use a Data Exchange

Use the information provided here to perform Data Exchange administrative and user tasks.

After you sign in to your Data Exchange as an Data Exchange Admin, you can perform the following tasks:

* Set up your Data Exchange.
* Create, update, or delete provider profiles. New profiles must be approved by a data exchange administrator. See [Manage provider profiles](data-exchange-becoming-a-provider.md).
* Update contact email.
* Manage profile editors.
* Manage membership in the data exchange.
* Assign roles to members of the data exchange. See [Grant privileges to other roles](data-exchange-marketplace-privileges.md).

> **Note:**
>
> When logging in to a data exchange for administrative purposes such as joining the exchange, configuring the exchange, or configuring data listings, the member must have the ACCOUNTADMIN role.

## Set up your Data Exchange

> **Note:**
>
> To create a Data Exchange, contact your Snowflake representative or
> [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### Invite members and assign roles

After the Data Exchange is set up, you can start inviting accounts as members and designating them as data providers, data consumers,
or both. You invite members using their Snowflake account name or account URL.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Navigate to the Manage exchanges tab.
4. Select the exchange you want to manage.
5. Select the Members tab.
6. Select Add Member to add a new member. To manage an existing member, select their member row.
7. Select the role for the member, Provider or Consumer, by selecting the appropriate checkbox.
8. Save your changes.

### Manage member listings

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Manage exchanges tab.
4. Select the exchange you want to manage.
5. Select the Member Listings tab.
6. Select Any, Pending, or Reviewed to manage listings in different states.
7. Open a listing by selecting its row.
8. View the listing, or select Review to review the listing and approve or deny it for your Data Exchange.

### Manage member profiles

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Manage exchanges tab.
4. Select the exchange you want to manage.
5. Select the Member Profiles tab. On the tab, you can do the following:

   * Select Pending or Reviewed to view profiles in different states.
   * You can view already reviewed profiles, or select Review to approve or deny a member profile.

### Access consumer listings

All users can browse listings in the Data Exchange, but only users with the ACCOUNTADMIN role or the [IMPORT SHARE](security-access-privileges-shares.md) privilege can get or request data.

If you do not have sufficient privileges, you can do one of the following:

* Request your ACCOUNTADMIN to grant you the IMPORT SHARE privilege.
* Request your ACCOUNTADMIN to get data, and grant you IMPORTED PRIVILEGES on the database created from the share.
  For more information, see [Granting privileges on an imported database](data-share-consumers.md).

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared with you tab.

### Browse data listings

After you sign in to a Data Exchange, review the Listings section of the Shared With You tab to view available listings.

In a Data Exchange, the following types of listings are available to you:

* [Free listings](../collaboration/collaboration-listings-about.md), which you can
  access by selecting Get to create a database out of the shared data inside of your Snowflake account.
* Personalized listings, which you can access by selecting Request to request access to the data. An email notification is sent to
  the data provider with your request.

### View listing requests

> **Note:**
>
> To see requests from listings on the Snowflake Marketplace, such as those for personalized listings or free listings in another region,
> use Provider Studio.
> See [Managing Listing Requests as a Provider](../collaboration/provider-listings-managing.md).

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Requests tab.
4. Select Inbound.

   When a request is denied, a comment is provided next to the request, explaining the reason for denial. In such cases, you can make the necessary adjustments and resubmit your request.

### Access shared data

1. When your request for a listing in the Data Exchange is approved, sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared with you tab.
4. Locate the listing you requested and select Get Data for the listing.
5. Enter the name for the database to create in your account from the share.
6. Select roles that you want to have access to the database created from the share.
7. Accept Snowflake’s consumer terms and the provider’s terms of use. You only need to accept the listing terms when you create a database from a share for the first time.

   > **Note:**
   >
   > Accepting terms using SQL is not supported.
8. Select Create Database.

   After you create the database from share, the Get Data button is replaced with the View Database button.

   See also: [Usage metrics shared with providers](data-sharing-intro.md)

---
title: Configure custom authorization servers for External OAuth
source: https://docs.snowflake.com/en/user-guide/oauth-ext-custom.md
section: User Guide
---

# Configure custom authorization servers for External OAuth

This topic describes how to create an External OAuth security integration in Snowflake, so clients can access Snowflake data by
authenticating with a custom authorization server.

If your authorization server is a [supported identity provider (IdP)](oauth-ext-overview.md) rather than a custom one, refer to the
topic focused on configuring that specific IdP.

## External OAuth token payload requirements

The access token that custom authentication servers send to Snowflake must contain the following payload information. For more information
about the Claims column, see [JWT Claims](https://tools.ietf.org/html/rfc7519#section-4).

| Claims | Description |
| --- | --- |
| scp | Scopes. A list of scopes in the access token. |
| scope | Scopes.  A comma-separated string of scopes in the access token.  Snowflake supports specifying any single character for the delimiter, such as a space (i.e. `' '`), by setting the `EXTERNAL_OAUTH_SCOPE_DELIMITER` property when [creating](../sql-reference/sql/create-security-integration-oauth-external.md) or [modifying](../sql-reference/sql/alter-security-integration-oauth-external.md) the External OAuth security integration for custom authorization servers.  Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to enable this property in your Snowflake account. |
| aud | Audience. Identifies the recipients that the access token is intended for as a string URI. |
| exp | Expiration time. Identifies the expiration time on or after which the access token must not be accepted for processing. |
| iss | Issuer. Identifies the principal that issued the access token as a string URI. |
| iat | Issued at. Required. Identifies the time at which the JWT was issued. |

> **Note:**
>
> Snowflake supports the `nbf` (not before) claim, which identifies the time before which the access token must not be
> accepted for processing.
>
> If your custom authorization server supports the `nbf` (not before) claim, you can optionally include the `nbf` claim in the
> access token.

To verify your token contains the required information, you can test the token on this [JSON Web Tokens](https://jwt.ms) site.

As a representative example, the PAYLOAD: DATA interface displays the token payload as follows.

```sqljson
{
  "aud": "<audience_url>",
  "iat": 1576705500,
  "exp": 1576709100,
  "iss": "<issuer_url>",
  "scp": [
    "session:role:analyst"
  ]
}
```

## Configuration procedure

The following steps assume that your custom authorization server and environment can be configured to obtain the necessary values to create
the Snowflake Security Integration.

> **Important:**
>
> The steps in this topic are a representative example on how to configure custom authorization servers.
>
> You can configure your environment to any desired state and use any desired OAuth flow provided that you can obtain the necessary
> information for the External OAuth security integration.
>
> Note that the following steps serve as a guide to obtain the necessary information to create the External OAuth security integration in
> Snowflake.
>
> Consult your internal security policies before configuring a custom authorization server to ensure your organization meets all regulations
> and compliance requirements.

### Obtain key environment values to use External OAuth

When you configure your IdP and authorization server, you must collect the following values to define an External OAuth security
integration:

Issuer URL:
:   Include this URL with the `external_oauth_issuer` parameter.

RSA Public Key:
:   Include this value with the `external_oauth_rsa_public_key` parameter.

Audience URLs:
:   If more than one Audience URL is necessary, separate each URL with a comma in the `external_oauth_audience_list` parameter.

Scope attribute:
:   You can set this value to `scp` or `scope`. By default, this value is `scp`.

    You can set the value of the `external_oauth_scope_mapping_attribute` parameter to this value.

    If you do not use the default value, `scp`, then set value of the `external_oauth_scope_mapping_attribute` parameter to
    `scope`.

    For more information, refer to External OAuth token payload requirements.

User Attribute:
:   This attribute refers to attribute to identify users in your IdP. Include this attribute value in the
    `external_oauth_user_mapping_claim` parameter.

Snowflake User Attribute:
:   The attribute in Snowflake to identify users. Include this value in the `external_oauth_snowflake_user_mapping_attribute` parameter.

### Create an External OAuth security integration in Snowflake

This step creates an External OAuth security integration in Snowflake. The External OAuth security integration ensures that Snowflake can
communicate securely with and validate access tokens from your custom authorization server, and provide users access to Snowflake data based
on their user role associated with the access token. For more information, see
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md).

> **Important:**
>
> Only account administrators or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
>
> The External OAuth security integration parameter values are case-sensitive, and the values you put into the External OAuth security
> integration must match the values in your environment. If the case of a value does not match, the access token will not be validated,
> resulting in a failed authentication attempt.

**Create an External OAuth security integration in Snowflake**

> ```sqlexample
> create security integration external_oauth_custom
>     type = external_oauth
>     enabled = true
>     external_oauth_type = custom
>     external_oauth_issuer = '<authorization_server_url>'
>     external_oauth_rsa_public_key = '<public_key_value>'
>     external_oauth_audience_list = ('<audience_url_1>', '<audience_url_2>')
>     external_oauth_token_user_mapping_claim = 'upn'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```

### Modifying Your External OAuth Security Integration

You can update your External OAuth security integration by executing an ALTER statement on the security integration.

For more information, see [ALTER SECURITY INTEGRATION (External OAuth)](../sql-reference/sql/alter-security-integration-oauth-external.md).

### Using ANY role with External OAuth

In the configuration step to create a security integration in Snowflake, the OAuth access token includes the scope definition. Therefore, at runtime, using the External OAuth security integration allows neither the OAuth client nor the user to use an undefined role in the OAuth access token.

After validating the access token and creating a session, the ANY role can allow the OAuth client and user to decide its role. If necessary, the client or the user can switch to a role that is different that the role defined in the OAuth access token.

To configure ANY role, define the scope as `SESSION:ROLE-ANY` and configure the security integration with the `external_oauth_any_role_mode` parameter. This parameter can have three possible string values:

* `DISABLE` does not allow the OAuth client or user to switch roles (i.e. `use role <role>;`). Default.
* `ENABLE` allows the OAuth client or user to switch roles.
* `ENABLE_FOR_PRIVILEGE` allows the OAuth client or user to switch roles only for a client or user with the `USE_ANY_ROLE` privilege. This privilege can be granted and revoked to one or more roles available to the user. For example:

  ```sqlexample
  grant USE_ANY_ROLE on integration external_oauth_1 to role1;
  ```

  ```sqlexample
  revoke USE_ANY_ROLE on integration external_oauth_1 from role1;
  ```

Define the security integration as follows:

```sqlexample
create security integration external_oauth_1
    type = external_oauth
    enabled = true
    external_oauth_any_role_mode = 'ENABLE'
    ...
```

### Using secondary roles with External OAuth

The desired scope for the primary role is passed in the external token: either the default role for the user (`session:role-any`) or
a specific role that was granted to the user (`session:role:<role_name>`).

By default, Snowflake does not activate the default [secondary roles](security-access-control-overview.md) for a user (i.e.
the DEFAULT_SECONDARY_ROLES) user in the session.

To activate the default secondary roles for a user in a session and allow executing the [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md)
command while using External OAuth, complete the following steps:

1. Configure the security integration for the connection. Set the EXTERNAL_OAUTH_ANY_ROLE_MODE parameter value to either ENABLE or
   ENABLE_FOR_PRIVILEGE when you create the security integration (using CREATE SECURITY INTEGRATION) or later (using ALTER SECURITY
   INTEGRATION).
2. Configure the authorization server to pass the static value of `session:role-any` in the scope attribute of the token. For more
   information about the scope parameter, see [External OAuth overview](oauth-ext-overview.md).

### Using Client Redirect with External OAuth

Snowflake supports using Client Redirect with External OAuth, including using Client Redirect and External OAuth with supported Snowflake
Clients.

For more information, see [Redirecting client connections](client-redirect.md).

### Using network policies with External OAuth

Currently, network policies cannot be added to your External OAuth security integration. However, you can still implement network policies that apply broadly to the entire Snowflake account.

If your use case requires a network policy that is specific to the OAuth security integration, use [Snowflake OAuth](oauth-intro.md). This approach allows the Snowflake OAuth network policy to be distinct from other network policies that may apply to the Snowflake account.

For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Using replication with External OAuth

Snowflake supports replication and failover/failback of the External OAuth security integration from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## Testing procedure

To test the configuration of a custom authorization server:

1. Verify that the test user exists in your IdP and has a password.
2. Verify that the test user exists in Snowflake with their `login_name` attribute value set to the
   `<external_oauth_token_user_mapping_claim>`.
3. Register an OAuth 2.0 client
4. Allow the OAuth 2.0 client to make a POST request to the custom token endpoint as follows:

   * Grant type set to Resource Owner
   * HTTP Basic Authorization header containing the clientID and secret
   * FORM data containing the username & password
   * Include scopes

The sample command requests the `ANALYST` custom role and that assumes the `session:role:analyst` has been defined in the custom
authorization server.

Here is an example for getting an access token using cURL.

```bash
curl -X POST -H "Content-Type: application/x-www-form-urlencoded;charset=UTF-8" \
  --user <OAUTH_CLIENT_ID>:<OAUTH_CLIENT_SECRET> \
  --data-urlencode "username=<IdP_USER_USERNAME>" \
  --data-urlencode "password=<IdP_USER_PASSWORD>" \
  --data-urlencode "grant_type=password" \
  --data-urlencode "scope=session:role:analyst" \
  <IdP_TOKEN_ENDPOINT>
```

## Connecting to Snowflake with External OAuth

After configuring your security integration and obtaining your access token, you can connect to Snowflake using one of the following:

* [SnowSQL](snowsql-start.md)
* [Python Connector](../developer-guide/python-connector/python-connector-connect.md)
* [Go Driver](https://godoc.org/github.com/snowflakedb/gosnowflake#hdr-Connection_Parameters)
* [JDBC Driver](../developer-guide/jdbc/jdbc-configure.md)
* [ODBC Driver](../developer-guide/odbc/odbc-parameters.md)
* [Spark Connector](spark-connector-use.md)
* [.NET Driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/README.md#create-a-connection)
* [Node.js Driver](../developer-guide/node-js/nodejs-driver-authenticate.md)

Note the following:

* It is necessary to set the `authenticator` parameter to `oauth` and the `token` parameter to the `external_oauth_access_token`.
* When passing the `token` value as a URL query parameter, it is necessary to URL-encode the `token` value.
* When passing the `token` value to a Properties object (e.g. JDBC Driver), no modifications are necessary.

For example, if using the Python Connector, set the connection string as shown below.

```python
ctx = snowflake.connector.connect(
   user="<username>",
   host="<hostname>",
   account="<account_identifier>",
   authenticator="oauth",
   token="<external_oauth_access_token>",
   warehouse="test_warehouse",
   database="test_db",
   schema="test_schema"
)
```

You can now use External OAuth to connect to Snowflake securely.

---
title: Configure External OAuth in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/external-oauth-configure.md
section: User Guide
---

# Configure External OAuth in Snowflake Open Catalog

This topic describes how to configure external servers that use OAuth for accessing Snowflake Open Catalog.

> **Note:**
>
> This topic shows you how to use Auth0 to configure External OAuth for Open Catalog. However, the steps for configuring it in Okta or
> Microsoft Entra ID are similar.

## Prerequisites

* Create one or more catalogs in your Open Catalog account.
* Ensure that you have an account with an identity provider (IdP). For demonstration purposes, this topic uses Auth0 as the IdP. To create an Auth0
  account for your company or organization, see <https://auth0.com/>. However, the process is similar for Okta and Microsoft Entra ID.
* You must have [Snowflake CLI](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation) installed on your machine.
* To configure External OAuth, you must have the service admin role in Open Catalog. For more information, see [User roles](https://other-docs.snowflake.com/en/opencatalog/access-control#user-roles). In Snowflake CLI, this role is printed as POLARIS_ACCOUNT_ADMIN.

## Before you begin

To configure External OAuth, you need to create a Snowflake CLI connection for Open Catalog.

In order to create this connection, you need your full Open Catalog account identifier, which includes your Snowflake organization name and
your Open Catalog account name; for example: `<orgname>.<my-snowflake-open-catalog-account-name>`.

* To find your *Snowflake* organization name (`<orgname>`), see [Finding the organization and account name for an account](../admin-account-identifier.md).
* To find your *Snowflake Open Catalog* account name (`<my-snowflake-open-catalog-account-name>`), see
  [Find the account name for a Snowflake Open Catalog account](find-account-name.md).

## Create a Snowflake CLI connection

Create a Snowflake CLI connection for your Open Catalog account so you can use it to configure external OAuth for the account.

### Step 1: Add a Snowflake CLI connection for Snowflake Open Catalog

* [Add a connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md)
  with the following values. For all other parameters, press `Enter` to skip specifying a value for the parameter.

  | Connection configuration parameters | Value |
  | --- | --- |
  | **Name for this connection** | Specify a name for the connection; for example, `myopencatalogconnection`. |
  | **Account name** | Specify your Snowflake organization name, followed by your Open Catalog account name, in this format:  `<orgname>-<my-snowflake-open-catalog-account-name>`.  For example, `ABCDEFG-MYACCOUNT1`.  To find these names, see Before you begin. |
  | **Username** | Specify your username for Open Catalog; for example, `jsmith`. |
  | **Password [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  Enter your password for Open Catalog; for example, `MyPassword123456789`. |
  | **Role for the connection [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  You must enter `POLARIS_ACCOUNT_ADMIN` |

### Step 2: Test the Snowflake CLI connection

* To test your CLI connection, follow this example, which tests the connection for `myopencatalogconnection`:

  ```snowcli
  snow connection test -c myopencatalogconnection
  ```

  The response should look like this:

  ```snowcli
  +------------------------------------------------------------------------------+
  | key              | value                                                     |
  |----------------------------+-------------------------------------------------|
  | Connection name  | myopencatalogconnection                                   |
  | Status           | OK                                                        |
  | Host             | ABCDEFG-MYACCOUNT1.snowflakecomputing.com                 |
  | Account          | ABCDEFG-MYACCOUNT1                                        |
  | User             | jsmith                                                    |
  | Role             | POLARIS_ACCOUNT_ADMIN                                     |
  | Database         | not set                                                   |
  | Warehouse        | not set                                                   |
  +------------------------------------------------------------------------------+
  ```

### Step 3: Set a default Snowflake CLI connection

To ensure that the connection you’re using always has the required POLARIS_ACCOUNT_ADMIN role granted to it, you can set the Snowflake CLI
connection you created for Open Catalog as the default connection. For more information about the default connection, see
[Set the default connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md).

1. Follow this example, which sets the `myopencatalogconnection` connection as the default:

   ```snowcli
   snow connection set-default myopencatalogconnection
   ```
2. To confirm that you’re using the correct user and role, run the following:

   ```snowcli
   snow sql -q "Select current_user(); select current_role();"
   ```

   The response should return your Open Catalog username and the CURRENT
   ROLE should be POLARIS_ACCOUNT_ADMIN.

   ```snowcli
   +----------------+
   | CURRENT_USER() |
   |----------------|
   | JSMITH        |
   +----------------+
   select current_role();
   +-----------------------+
   | CURRENT_ROLE()        |
   |-----------------------|
   | POLARIS_ACCOUNT_ADMIN |
   +-----------------------+
   ```

## Step 1: Set up Auth0 as an External OAuth authorization server

In this section, you set up Auth0 as an External OAuth authorization server. However, the steps for setting up Microsoft Entra ID or Okta as
an External OAuth authorization server are similar.

> **Note:**
>
> To create an Auth0 account for your company or organization, see <https://auth0.com/>.

### Define an API

Define an API so you can assign permissions to it.

1. Sign in to the Auth0 console.
2. In the left pane, select **Applications** > **APIs**.
3. Select **+ Create API**.
4. Enter a **Name** and **Identifier**, and accept the default settings.
5. Select **Create**.

### Add permissions to the API

Add permissions to the API so you can grant them to the client.

This client will be the principal in Open Catalog and the permission you assign to it will be a principal role in Open Catalog.

1. Sign in to the Auth0 console.
2. In the left pane, select **Applications** > **APIs**.
3. Select your API.
4. On the **Permissions** tab, add permissions:

   1. Use the **Permission** and **Description** fields to add permissions for your API. Use the
      format `SESSION:ROLE:<custom_role_name>`. For example: `SESSION:ROLE:ENGINEER`.

      > **Important:**
      >
      > To generate your service admin access token, you must add the
      > `SESSION:ROLE:POLARIS_ACCOUNT_ADMIN` permission, which allows you to assign the service admin role in Open Catalog with
      > permissions to the API.
   2. Select **+ Add**.

### Create an application

Create applications so that you can grant them permissions to the API you created.

1. Sign in to the Auth0 console.
2. In the left pane, select **Applications** > **Applications**.
3. Select **+ Create Application** and create an application. Repeat this step toe create each application you need for configuring
   External OAuth.

   > **Important:**
   >
   > Make sure you create an application for generating the service admin access token. Later, you’ll
   > grant the `SESSION:ROLE:POLARIS_ACCOUNT_ADMIN` permission to it.

### Assign permissions to the API

Follow these steps to select the permissions that you want to grant to the client:

1. Sign in to the Auth0 console.
2. In the left pane, select **Applications** > **Applications**.
3. Select the application you created.
4. Select the **APIs** tab.
5. If needed, select the **Authorized** toggle for your API to On.
6. Select the **Expand** icon for your API.
7. In the **Permissions** field, select the check box for each permission you want to assign to the API.
8. Select **Update**.

   Repeat these steps for each application you created.

> **Important:**
>
> Make sure you select the application you created for generating the service admin access token and assign the `SESSION:ROLE:POLARIS_ACCOUNT_ADMIN`
> permission to the API. Otherwise, you can’t generate the service admin access token.

## Step 2: Retrieve your organization and account name

You need your organization name and Snowflake Open Catalog account name, separated by a hyphen, for tasks such as creating a security
integration.

1. To retrieve your organization name and Snowflake Open Catalog account name in this format, in Snowflake CLI, run the following command:

   ```snowcli
   snow sql -q "SELECT CURRENT_ORGANIZATION_NAME() || '-' || CURRENT_ACCOUNT_NAME();"
   ```
2. In the response, copy the returned values and paste them into a text editor for later use. For example: `ABCDEFG-MYACCOUNT1`.

## Step 3: Create a security integration

To create a security integration, run the CREATE SECURITY INTEGRATION command by using a Snowflake CLI connection.

```snowcli
 snow sql -q "create or replace security integration external_oauth_auth0
    type = external_oauth
    enabled = true
    external_oauth_type = custom
    external_oauth_issuer = 'https://<Auth0_domain>/'
    external_oauth_jws_keys_url = 'https://<Auth0_domain>/.well-known/jwks.json'
    external_oauth_audience_list = ('https://<your_org_name>-<your_open_catalog_account_name>.snowflakecomputing.com')
    external_oauth_token_user_mapping_claim = 'sub'
    external_oauth_snowflake_user_mapping_attribute = 'login_name'
    EXTERNAL_OAUTH_SCOPE_DELIMITER = ' '
    EXTERNAL_OAUTH_SCOPE_MAPPING_ATTRIBUTE = 'scope';"
```

Where:

* `<Auth0_domain>` is your Auth0 domain. To find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Domain** field.
* `<your_org_name>-<your_open_catalog_account_name>` is your organization name and Snowflake Open Catalog account name, separated by
  a hyphen.

  For example: `ABCDEFG-MYACCOUNT1`.

  To retrieve these values in this format, see Retrieve your organization and account name.

## Step 4: Generate your service admin access token

To configure External OAuth programmatically, you need a service admin access token. However, you have the option to use the Open Catalog UI
to perform some tasks for configuring External OAuth.

If you already generated a service admin access token for yourself and it’s still active, you can skip this step.

To generate your service admin access token, in Snowflake CLI, execute the following command and copy the value into a text editor:

```bash
ACCESS_TOKEN=$(curl -X POST https://<Auth0_domain>/oauth/token --header 'content-type: application/x-www-form-urlencoded' --data grant_type=client_credentials --data client_id=<client_id> --data client_secret=<client_secret> --data-urlencode "audience=https://<your_org_name>-<your_open_catalog_account_name>.snowflakecomputing.com" --data "scope=SESSION:ROLE:POLARIS_ACCOUNT_ADMIN" | jq -r '.access_token')
```

Where:

* `<Auth0_domain>` is your Auth0 domain. To find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Domain** field.
* `<client_id>`is the client ID for your application in Auth0 that you grant access to the POLARIS_ACCOUNT_ADMIN privilege. To find this
  value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Client ID** field.
* `<client_secret>` is the client secret for your application in Auth0 that you grant access to the POLARIS_ACCOUNT_ADMIN privilege. To
  find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Client Secret** field.
* `<audience>` is the identifier for your API. To find this value, in Auth0, navigate to Applications > APIs > select your API > Settings >
  **Identifier** field.
* `<your_org_name>-<your_open_catalog_account_name>` is your organization name and Snowflake Open Catalog account name, separated by
  a hyphen.

  For example: `ABCDEFG-MYACCOUNT1`.

  To retrieve these values in this format, see Retrieve your organization and account name.
* `POLARIS_ACCOUNT_ADMIN` is the name of the built-in role in Open Catalog that allows you to perform administrative tasks in Open
  Catalog, which includes configuring External OAuth. In the Open Catalog UI, this role is referred to as the service admin role.

## Step 5: Create a custom role

In this step, you use your Snowflake CLI connection to create a custom role.

Create a custom role so that later, you can grant catalog roles to it and grant the custom role to a service principal, which bestows the
service principal with privileges. For more information about custom roles, see [Custom role](access-control.md). For more
information on the RBAC model in Open Catalog, see [RBAC model](access-control.md).

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

The following example creates an `OPEN_CATALOG_ADMIN` custom role:

```snowcli
snow sql -q "create role OPEN_CATALOG_ADMIN;"
```

## Step 6: Grant a catalog role to a custom role

In this section, you grant a catalog role to the custom role you created.

After you grant a catalog role to the custom role, you then grant the custom role to a service principal to bestow the service principal
with any privileges granted to catalog roles that are granted to the principal role. For more information on the RBAC model in Open Catalog,
see [RBAC model](access-control.md).

You can grant the custom role with a catalog role that has catalog admin privileges for the catalog or has a set of
privileges for the catalog that you specify:

* If you want to grant a service principal with catalog admin privileges to a catalog,
  grant a catalog role with catalog admin privileges to the custom role. For information on what privileges a catalog admin has, see [Catalog admin role](access-control.md).
* If you want to grant the service principal with a set of privileges you specify,
  grant a catalog role with a set of privileges you specify to the custom role.
  For example, choose this option if you want to grant a cataLog_reader, catalog_writer, or catalog_metadata_reader catalog role to the
  custom role.

### Grant a catalog role with catalog admin privileges to the custom role

In this section, you grant a catalog role with catalog admin privileges to the custom role. The workflow is as follows:

1. Create a catalog role.
2. Grant catalog admin privileges to the catalog role.
3. Grant the catalog role to a custom role.

#### Create a catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command to create a catalog role in the catalog you specify:

> ```bash
>  curl -X POST \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{
>    "name": "<catalog_role_name>"
>    }'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of the catalog in Open Catalog where you want to create a catalog role.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * For `<catalog_role_name>` specify a name for the catalog role. For example: CatalogAdmin.

See [Catalog](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#catalog-1) in Snowflake Labs.

See See [Create a catalog role](create-catalog-role.md).

#### Grant catalog admin privileges to the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "catalog", "privilege": "CATALOG_MANAGE_CONTENT"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `CATALOG_MANAGE_CONTENT` is the name of the privilege in Open Catalog with catalog admin privileges.

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.
As described in the instructions, make sure you grant the `CATALOG_MANAGE_CONTENT` privilege to the catalog role.

See [Grant catalog privileges on a catalog role](secure-catalogs.md) and select the
`CATALOG_MANAGE_CONTENT` privilege.

#### Grant the catalog role to a custom role

> **Important:**
>
> Custom roles are case-sensitive. You should specify a custom role with *all* uppercase letters, even if you create it with lowercase
> letters or lowercase and uppercase letters.

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/principal-roles/{customRoleName}/catalog-roles/{catalogName}" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"catalogRole": {"name": "<catalog_role_name>", "properties": {}, "entityVersion": 1}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{customRoleName}` is the name of the custom role you want to grant with the catalog role that has catalog admin privileges.
>       For example, OPEN_CATALOG_ADMIN.
>     * `{catalogName}` is the name of a catalog in Open Catalog that you want to grant catalog admin privileges to.
>     * `<catalog_role_name>` is the name of the catalog role for a catalog, which has catalog admin privileges granted to it.
>       For exampple: CatalogAdmin.
>     * `<service_admin_access_token>` is the service admin access token you generated.

See [Grants](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#grants) in Snowflake Labs.

See [Secure catalogs](secure-catalogs.md). These
instructions describe how to grant a catalog role to a principal role but the process is the same. Instead of selecting a principal
role from the list, select your custom role that you want to grant with the catalog role that has the `CATALOG_MANAGE_CONTENT` privilege.

If needed, repeat this step to grant the custom role with catalog admin privileges for other catalogs.

### Grant a catalog role with a set of privileges you specify to the custom role

The workflow is as follows:

1. Create a catalog role.
2. Grant privileges to the catalog role. These privileges allow the service principal to perform actions in Open Catalog.
3. Grant the catalog role to a custom role.

#### Create a catalog role

In this section, you create a catalog role. For more information about catalog roles, see [Catalog role](access-control.md).

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command to create a catalog role in the catalog you specify:

> ```bash
>  curl -X POST \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{
>    "name": "<catalog_role_name>"
>    }'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of the catalog in Open Catalog where you want to create a catalog role.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * For `<catalog_role_name>` specify a name for the catalog role. For example: TableReader.

See [Catalog](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#catalog-1) in Snowflake Labs.

See [Create a catalog role](create-catalog-role.md).

#### Grant privileges to the catalog role

You can grant privileges to the entire catalog or to a namespace or table in the catalog:

* Grant catalog privileges on the catalog role
* Grant namespace privileges on the catalog role
* Grant table privileges on the catalog role

##### Grant catalog privileges on the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "catalog", "privilege": "<privilege_name>"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<privilege_name>` is the name of the privilege you want to grant to the catalog role. For the list of available privileges,
>       see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges).

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.

> **Note:**
>
> The example in Snowflake Labs grants the `CATALOG_MANAGE_CONTENT` privilege, which grants catalog admin privileges for the
> catalog. However, for the list of other available privileges, see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges) in
> the Open Catalog documentation.

See [Grant catalog privileges on a catalog role](secure-catalogs.md).

##### Grant namespace privileges on the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "namespace", "namespace": ["<namespace_name>"], "privilege": "<privilege_name>"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<namespace_name>` is the name of the namespace you want to grant privileges to. To grant privileges to a
>       nested namespace, specify it, along with each parent namespace, separated by a comma. For example: `"ns1","ns1a"`.
>     * `<privilege_name>` is the name of the namespace privilege you want to grant to the catalog role. For the list of available privileges,
>       see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges).

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.

> **Note:**
>
> The example in Snowflake Labs grants the `CATALOG_MANAGE_CONTENT` privilege, which grants catalog admin privileges for the
> catalog. However, for the list of other available privileges, see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges) in
> the Open Catalog documentation.

See [Grant namespace privileges on a catalog role](secure-catalogs.md).

##### Grant table privileges on the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "table", "namespace": ["<namespace_name>"], "tableName": "<table_name>", "privilege": "<privilege_name>"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<namespace_name>` is the name of the namespace whose table you want to grant privileges to. To grant privileges on a
>       table located under a nested namespace, specify the nested namespace, along with each parent namespace, separated by a comma. For example: `"ns1","ns1a"`.
>     * `<table_name>` is the name of the table you want to grant privileges to.
>     * `<privilege_name>` is the name of the privilege you want to grant to the catalog role. For the list of available privileges,
>       see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges).

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.

> **Note:**
>
> The example in Snowflake Labs grants the `CATALOG_MANAGE_CONTENT` privilege, which grants catalog admin privileges for the
> catalog. However, for the list of other available privileges, see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges) in
> the Open Catalog documentation.

See [Grant table privileges on a catalog role](secure-catalogs.md).

#### Grant the catalog role to a custom role

> **Important:**
>
> Custom roles are case-sensitive. You should specify a custom role with *all* uppercase letters, even if you create it with lowercase
> letters or lowercase and uppercase letters.

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/principal-roles/{customRoleName}/catalog-roles/{catalogName}" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"catalogRole": {"name": "<catalog_role_name>", "properties": {}, "entityVersion": 1}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{customRoleName}` is the name of the custom role you that you want to grant the catalog role to.
>     * `{catalogName}` is the name of a catalog in Open Catalog where you created the catalog role that you want to grant privileges on.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<catalog_role_name>` is the name of the catalog role that you want to grant to the custom role.

See [Grants](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#grants) in Snowflake Labs.

See [Secure catalogs](secure-catalogs.md). These
instructions describe how to grant a catalog role to a principal role but the process is the same. Instead of selecting a principal
role from the list, you select your custom role.

If needed, repeat this workflow to grant additional catalog roles that you create to the custom role.

## Step 7: Create a service principal

In the step, you use your Snowflake CLI connection to create a service principal.

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

To connect to Open Catalog through External OAuth, we need a service principal. In Open Catalog, service principals are users with the `TYPE` parameter set to `service`. So we will use the CREATE USER command to create a user of `TYPE=service`.

To create a service principal, run the following command:

```snowcli
snow sql -q "CREATE USER <user_name> LOGIN_NAME='<client_id>@clients' TYPE='service';"
```

Where:

* For `<user_name>`, specify a name for the service principal.
* For `<client_id>`, specify the client ID for your application. To find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Client ID** field.

## Step 8: Grant a custom role to the service principal

In this section, you use your Snowflake CLI connection to grant the custom role to one or more service principals. As a result, you bestow the service principal with the privileges granted to any catalog role(s) that are granted to
the custom role. For more information on the RBAC model in Open Catalog, see [RBAC model](access-control.md).

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

> **Important:**
>
> Custom roles are case-sensitive. You should specify a custom role with *all* uppercase letters, even if you create it with lowercase
> letters or lowercase and uppercase letters.

The following example grants the `ENGINEER` custom role to the `service_principal1` service principal.

```snowcli
snow sql -q "GRANT ROLE ENGINEER to user service_principal1;"
```

To validate that the role is granted to the service principal, run:

```snowcli
snow sql -q "show grants to user service_principal1;"
```

In the response, check that the custom role you created (**role** column) is assigned to the service principal you created (**grantee_name** column).

## (Optional) Step 9: Generate the access token

In this section, you can generate an access token, which you can use to connect to Open Catalog with External OAuth. However, if you connect by
using this method, you must manually refresh the access token.

Alternatively, you can skip this step and later, [connect with Open Catalog by using an automatic refresh token](external-oauth-connect.md), which is the preferred method.

Use the following curl command to generate an access token:

```bash
ACCESS_TOKEN=$(curl -X POST https://<Auth0_domain>/oauth/token --header 'content-type: application/x-www-form-urlencoded' --data grant_type=client_credentials --data client_id=<client_id> --data client_secret=<client_secret> --data-urlencode "audience=https://<your_org_name>-<your_open_catalog_account_name>.snowflakecomputing.com" --data "scope=SESSION:ROLE:<custom_role_name>" | jq -r '.access_token')
```

Where:

* `<Auth0_domain>` is your Auth0 domain. To find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Domain** field.
* `<client_id>`is the client ID for your application in Auth0. To find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Client ID** field.
* `<client_secret>` is the client secret for your application in Auth0. To find this value, in Auth0, navigate to Applications > Applications >
  [Name of your application] > Settings > **Client Secret** field.
* `<audience>` is the identifier for your API. To find this value, in Auth0, navigate to Applications > APIs > select your API > Settings

  > **Identifier** field.
* `<your_org_name>-<your_open_catalog_account_name>` is your organization name and Snowflake Open Catalog account name, separated by
  a hyphen.

  For example: `ABCDEFG-MYACCOUNT1`.

  To retrieve these values in this format, see Retrieve your organization and account name.
* `<custom_role_name>` is the name of a custom role you granted with catalog roles, such as `ENGINEER`.

## Step 10: Connect to Open Catalog with External OAuth

In this section, you connect the service principal to Open Catalog through External OAuth. For instructions, see [Connect to Snowflake Open Catalog with External OAuth](external-oauth-connect.md), which includes instructions for connecting by using an access token or automatic token refresh.

---
title: Configure key pair authentication in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/key-pair-auth-configure.md
section: User Guide
---

# Configure key pair authentication in Snowflake Open Catalog

This topic describes how to configure key pair authentication in Snowflake Open Catalog. This configuration allows a key pair authentication
user to connect to Open Catalog programmatically through an access token. For simplicity, unless otherwise specified, the rest of this topic
uses the term user to refer to a key pair authentication user.

With key pair authentication, you can allow a user programmatic access to Open Catalog for various custom roles with permissions on the appropriate catalogs.
For example:

* ANALYST custom role: Can only access catalogA.
* ENGINEER custom role: Can only access catalogB.

## Prerequisites

* [Create a catalog](https://other-docs.snowflake.com/en/opencatalog/create-catalog) in your Open Catalog account.
* You must have [Snowflake CLI](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation) installed on your
  machine. In addition, you must create a Snowflake CLI connection for Open Catalog. To create this connection, see
  Create a Snowflake CLI connection for Open Catalog below.
* To configure key pair authentication, you must have the service admin role in Open Catalog. For more information, see [User roles](https://other-docs.snowflake.com/en/opencatalog/access-control#user-roles). In Snowflake CLI, this role is printed as POLARIS_ACCOUNT_ADMIN.
* You must have [SnowSQL](https://www.snowflake.com/en/developers/downloads/snowsql/) installed on your machine.
* You need a service admin access token. You need this token to configure key pair authentication programmatically and it’s required to grant
  a user with catalog admin privileges. To generate this token, see Generate your service admin access token below.

## Before you begin

To configure key pair authentication, you need a Snowflake CLI connection for Open Catalog.

To create this connection, you need your full Open Catalog account identifier, which includes your Snowflake organization name and your
Open Catalog account name; for example: `<orgname>.<my-snowflake-open-catalog-account-name>`.

* To find your *Snowflake* organization name (`<orgname>`), see [Finding the organization and account name for an account](../admin-account-identifier.md).
* To find your *Snowflake Open Catalog* account name (`<my-snowflake-open-catalog-account-name>`), see
  [Find the account name for a Snowflake Open Catalog account](find-account-name.md).

## Create a Snowflake CLI connection for Open Catalog

Create a Snowflake CLI connection for your Open Catalog account so that you can use it to configure key pair authentication for the account.

### Step 1: Add a Snowflake CLI connection for Snowflake Open Catalog

Add a connection for the Snowflake Open Catalog account where you want to configure key pair authentication.

* [Add a connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md)
  with the following values. For all other parameters, press `Enter` to skip specifying a value for the parameter.

  | Connection configuration parameters | Value |
  | --- | --- |
  | **Name for this connection** | Specify a name for the connection; for example, `myopencatalogconnection`. |
  | **Account name** | Specify your Snowflake organization name, followed by your Open Catalog account name, in this format:  `<orgname>-<my-snowflake-open-catalog-account-name>`.  For example, `ABCDEFG-MYACCOUNT1`.  To find these names, see Before you begin. |
  | **Username** | Specify your username for Open Catalog; for example, `jsmith`. |
  | **Password [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  Enter your password for Open Catalog; for example, `MyPassword123456789`. |
  | **Role for the connection [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  You must enter `POLARIS_ACCOUNT_ADMIN` |

### Step 2: Test the Snowflake CLI connection

* To test your CLI connection, follow this example, which tests the connection for `myopencatalogconnection`:

  ```snowcli
  snow connection test -c myopencatalogconnection
  ```

  The response should look like this:

  ```snowcli
  +------------------------------------------------------------------------------+
  | key              | value                                                     |
  |----------------------------+-------------------------------------------------|
  | Connection name  | myopencatalogconnection                                   |
  | Status           | OK                                                        |
  | Host             | ABCDEFG-MYACCOUNT1.snowflakecomputing.com                 |
  | Account          | ABCDEFG-MYACCOUNT1                                        |
  | User             | jsmith                                                    |
  | Role             | POLARIS_ACCOUNT_ADMIN                                     |
  | Database         | not set                                                   |
  | Warehouse        | not set                                                   |
  +------------------------------------------------------------------------------+
  ```

### Step 3: Set a default Snowflake CLI connection

To ensure that the connection you’re using always has the required POLARIS_ACCOUNT_ADMIN role granted to it, you can set the Snowflake CLI
connection you created for Open Catalog as the default connection. For more information about the default connection, see
[Set the default connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md).

1. Follow this example, which sets the `myopencatalogconnection` connection as the default:

   ```snowcli
   snow connection set-default myopencatalogconnection
   ```
2. To confirm that you’re using the correct user and role, run the following:

   ```snowcli
   snow sql -q "Select current_user(); select current_role();"
   ```

   The response should return your Open Catalog username and the CURRENT
   ROLE should be POLARIS_ACCOUNT_ADMIN.

   ```snowcli
   +----------------+
   | CURRENT_USER() |
   |----------------|
   | JSMITH        |
   +----------------+
   select current_role();
   +-----------------------+
   | CURRENT_ROLE()        |
   |-----------------------|
   | POLARIS_ACCOUNT_ADMIN |
   +-----------------------+
   ```

## Generate your service admin access token

To configure key pair authentication programmatically, you need a service admin access token. However, you have the option to use the Open Catalog UI
to perform some tasks for configuring key pair authentication.

If you already generated a service admin access token for yourself and it’s still active, you can skip this step.

The steps for generating a service admin access token are as follows:

1. Generate a private and public key
2. Assign the public key to yourself
3. Generate a JWT for yourself
4. Generate a service admin access token

### Generate a private and public key

This section describes how to generate a private and public key.

To generate a private key, use the following command:

```bash
openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out rsa_key.p8 -nocrypt
```

To generate a public key, use the following command:

```bash
openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
```

### Assign the public key to yourself

Use your Snowflake CLI connection to assign the public key to yourself.

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

1. To assign the public key to yourself, run:

   ```snowcli
   snow sql -q "alter user <your_username> set RSA_PUBLIC_KEY='<your_public_key>';"
   ```

   Where:

   * `<your_username>` is your Open Catalog username, which you use to sign in to the Open Catalog UI.
   > **Note:**
   >
   > If you need to retrieve your public key, run: `cat rsa_key.pub`.
2. Validate that the user has the public key (`RSA_PUBLIC_KEY`) set and the fingerprint of user’s RSA public key (`RSA_PUBLIC_KEY_FP`) set:

   ```snowcli
   snow sql -q "desc user <your_username>;"
   ```

   Where:

   * `<your_username>` is your Open Catalog username, which you use to sign in to the Open Catalog UI.

### Generate a JSON Web Token

In this section, you generate a JSON Web Token (JWT), which you need in order to generate an access token.

1. To use SnowSQL to generate a JWT, run:

   ```bash
   snowsql --private-key-path rsa_key.p8 --generate-jwt -h <your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com -a <account-identifier> -u <your_user_name>
   ```

   Where:

   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by
     a hyphen.

     For example: `ABCDEFG-MYACCOUNT1`.

     To retrieve these values in this format, see Retrieve your organization and account name.
   * `<account-identifier>` is the account identifier for your Snowflake Open Catalog account. To retrieve it, refer to your Open Catalog
     account URL. For example, `abc12345`in `https://app.snowflake.com/us-west-2/abc12345/#/`.
   * `<your_user_name>` is your Open Catalog username.
2. If you encrypted it, enter the passkey or else select Enter to continue. It may take a few seconds for you to receive your JWT.

### Generate a service admin access token

In this section, you generate a service admin access token, which you use to configure key pair authentication programmatically.

1. To generate your service admin access token, execute the following command and copy the value into a text editor:

   ```bash
   curl -i -X POST \
   -H "Content-Type: application/x-www-form-urlencoded" \
   -H "Accept: application/json" \
   --data-urlencode "scope=session:role:POLARIS_ACCOUNT_ADMIN" \
   --data-urlencode "grant_type=client_credentials" \
   --data-urlencode "client_secret=<your_JWT>" \
   "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens"
   ```

   Where:

   * `POLARIS_ACCOUNT_ADMIN` is the name of the built-in role in Open Catalog that allows you to perform administrative tasks in Open
     Catalog, which includes configuring key pair authentication. In the Open Catalog UI, this role is referred to as the service admin
     role.
   * `<your_JWT>` is the JWT you generated in the previous step.
   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by
     a hyphen.

     For example: `ABCDEFG-MYACCOUNT1`.

     To retrieve these values in this format, see Retrieve your organization and account name.

## Step 1: Create a custom role

In this step, you use your Snowflake CLI connection to create a custom role.

Create a custom role so that later, you can grant catalog roles to it and grant the custom role to a user, which bestows the user
with privileges. For more information about custom roles, see [Custom role](access-control.md). For more information on the
RBAC model in Open Catalog, see [RBAC model](access-control.md).

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

The following example creates an `OPEN_CATALOG_ADMIN` custom role:

```snowcli
snow sql -q "create role OPEN_CATALOG_ADMIN;"
```

## Step 2: Retrieve your organization and account name

You need your organization name and Snowflake Open Catalog account name, separated by a hyphen, for tasks such as generating a JWT or an access token.

1. To retrieve your organization name and Snowflake Open Catalog account name in this format, run the following command:

   ```snowcli
   snow sql -q "SELECT CURRENT_ORGANIZATION_NAME() || '-' || CURRENT_ACCOUNT_NAME();"
   ```
2. In the response, copy the returned values and paste them into a text editor for later use. For example: `ABCDEFG-MYACCOUNT1`.

## Step 3: Grant a catalog role to a custom role

In this section, you grant a catalog role to the custom role you created.

After you grant a catalog role to the custom role, you then grant the custom role to a user to bestow the user
with any privileges granted to catalog roles that are granted to the principal role. For more information on the RBAC model in Open Catalog,
see [RBAC model](access-control.md).

You can grant the custom role with a catalog role that has catalog admin privileges for the catalog or has a set of
privileges for the catalog that you specify:

* If you want to grant a user with catalog admin privileges to a catalog,
  grant a catalog role with catalog admin privileges to the custom role. For information on what privileges a catalog admin has, see [Catalog admin role](access-control.md).
* If you want to grant the user with a set of privileges you specify,
  grant a catalog role with a set of privileges you specify to the custom role.
  For example, choose this option if you want to grant a cataLog_reader, catalog_writer, or catalog_metadata_reader catalog role to the
  custom role.

### Grant a catalog role with catalog admin privileges to the custom role

In this section, you grant a catalog role with catalog admin privileges to the custom role. The workflow is as follows:

1. Create a catalog role.
2. Grant catalog admin privileges to the catalog role.
3. Grant the catalog role to a custom role.

#### Create a catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command to create a catalog role in the catalog you specify:

> ```bash
>  curl -X POST \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{
>    "name": "<catalog_role_name>"
>    }'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of the catalog in Open Catalog where you want to create a catalog role.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * For `<catalog_role_name>` specify a name for the catalog role. For example: CatalogAdmin.

See [Catalog](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#catalog-1) in Snowflake Labs.

See See [Create a catalog role](create-catalog-role.md).

#### Grant catalog admin privileges to the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "catalog", "privilege": "CATALOG_MANAGE_CONTENT"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `CATALOG_MANAGE_CONTENT` is the name of the privilege in Open Catalog with catalog admin privileges.

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.
As described in the instructions, make sure you grant the `CATALOG_MANAGE_CONTENT` privilege to the catalog role.

See [Grant catalog privileges on a catalog role](secure-catalogs.md) and select the
`CATALOG_MANAGE_CONTENT` privilege.

#### Grant the catalog role to a custom role

> **Important:**
>
> Custom roles are case-sensitive. You should specify a custom role with *all* uppercase letters, even if you create it with lowercase
> letters or lowercase and uppercase letters.

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/principal-roles/{customRoleName}/catalog-roles/{catalogName}" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"catalogRole": {"name": "<catalog_role_name>", "properties": {}, "entityVersion": 1}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{customRoleName}` is the name of the custom role you want to grant with the catalog role that has catalog admin privileges.
>       For example, OPEN_CATALOG_ADMIN.
>     * `{catalogName}` is the name of a catalog in Open Catalog that you want to grant catalog admin privileges to.
>     * `<catalog_role_name>` is the name of the catalog role for a catalog, which has catalog admin privileges granted to it.
>       For exampple: CatalogAdmin.
>     * `<service_admin_access_token>` is the service admin access token you generated.

See [Grants](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#grants) in Snowflake Labs.

See [Secure catalogs](secure-catalogs.md). These
instructions describe how to grant a catalog role to a principal role but the process is the same. Instead of selecting a principal
role from the list, select your custom role that you want to grant with the catalog role that has the `CATALOG_MANAGE_CONTENT` privilege.

If needed, repeat this step to grant the custom role with catalog admin privileges for other catalogs.

### Grant a catalog role with a set of privileges you specify to the custom role

The workflow is as follows:

1. Create a catalog role.
2. Grant privileges to the catalog role. These privileges allow the user to perform actions in Open Catalog.
3. Grant the catalog role to a custom role.

#### Create a catalog role

In this section, you create a catalog role. For more information about catalog roles, see [Catalog role](access-control.md).

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command to create a catalog role in the catalog you specify:

> ```bash
>  curl -X POST \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{
>    "name": "<catalog_role_name>"
>    }'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of the catalog in Open Catalog where you want to create a catalog role.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * For `<catalog_role_name>` specify a name for the catalog role. For example: TableReader.

See [Catalog](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#catalog-1) in Snowflake Labs.

See See [Create a catalog role](create-catalog-role.md).

#### Grant privileges to the catalog role

You can grant privileges to the entire catalog or to a namespace or table in the catalog:

* Grant catalog privileges on the catalog role
* Grant namespace privileges on the catalog role
* Grant table privileges on the catalog role

##### Grant catalog privileges on the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "catalog", "privilege": "<privilege_name>"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<privilege_name>` is the name of the privilege you want to grant to the catalog role. For the list of available privileges,
>       see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges).

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.

> **Note:**
>
> The example in Snowflake Labs grants the `CATALOG_MANAGE_CONTENT` privilege, which grants catalog admin privileges for the
> catalog. However, for the list of other available privileges, see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges) in
> the Open Catalog documentation.

See [Grant catalog privileges on a catalog role](secure-catalogs.md).

##### Grant namespace privileges on the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "namespace", "namespace": ["<namespace_name>"], "privilege": "<privilege_name>"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<namespace_name>` is the name of the namespace you want to grant privileges to. To grant privileges to a
>       nested namespace, specify it, along with each parent namespace, separated by a comma. For example: `"ns1","ns1a"`.
>     * `<privilege_name>` is the name of the namespace privilege you want to grant to the catalog role. For the list of available privileges,
>       see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges).

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.

> **Note:**
>
> The example in Snowflake Labs grants the `CATALOG_MANAGE_CONTENT` privilege, which grants catalog admin privileges for the
> catalog. However, for the list of other available privileges, see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges) in
> the Open Catalog documentation.

See [Grant namespace privileges on a catalog role](secure-catalogs.md).

##### Grant table privileges on the catalog role

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/catalogs/{catalogName}/catalog-roles/{catalogRoleName}/grants" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"grant": {"type": "table", "namespace": ["<namespace_name>"], "tableName": "<table_name>", "privilege": "<privilege_name>"}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{Catalogname}` is the name of a catalog in Open Catalog that you want to grant privileges to.
>     * `{catalogRoleName}` is the name of the catalog role you want to grant privileges on. For example, TableReader.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<namespace_name>` is the name of the namespace whose table you want to grant privileges to. To grant privileges on a
>       table located under a nested namespace, specify the nested namespace, along with each parent namespace, separated by a comma. For example: `"ns1","ns1a"`.
>     * `<table_name>` is the name of the table you want to grant privileges to.
>     * `<privilege_name>` is the name of the privilege you want to grant to the catalog role. For the list of available privileges,
>       see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges).

See [Privileges](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#privileges) in Snowflake Labs.

> **Note:**
>
> The example in Snowflake Labs grants the `CATALOG_MANAGE_CONTENT` privilege, which grants catalog admin privileges for the
> catalog. However, for the list of other available privileges, see [Access control privileges](https://other-docs.snowflake.com/en/opencatalog/access-control#access-control-privileges) in
> the Open Catalog documentation.

See [Grant table privileges on a catalog role](secure-catalogs.md).

#### Grant the catalog role to a custom role

> **Important:**
>
> Custom roles are case-sensitive. You should specify a custom role with *all* uppercase letters, even if you create it with lowercase
> letters or lowercase and uppercase letters.

curlApache Polaris (Incubating) CLIOpen Catalog UI

Run the following command:

> ```bash
> curl -X PUT \
> "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/management/v1/principal-roles/{customRoleName}/catalog-roles/{catalogName}" \
> -H "Authorization: Bearer <service_admin_access_token>" \
> -H "Content-Type: application/json" \
> -H "Accept: application/json" \
> -d '{"catalogRole": {"name": "<catalog_role_name>", "properties": {}, "entityVersion": 1}}'
> ```
>
> Where:
> :   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by a hyphen.
>
>       For example: `ABCDEFG-MYACCOUNT1`.
>
>       To retrieve these values in this format, see Step 2: Retrieve your organization and account name.
>     * `{customRoleName}` is the name of the custom role you that you want to grant the catalog role to.
>     * `{catalogName}` is the name of a catalog in Open Catalog where you created the catalog role that you want to grant privileges on.
>     * `<service_admin_access_token>` is the service admin access token you generated.
>     * `<catalog_role_name>` is the name of the catalog role that you want to grant to the custom role.

See [Grants](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo?tab=readme-ov-file#grants) in Snowflake Labs.

See [Secure catalogs](secure-catalogs.md). These
instructions describe how to grant a catalog role to a principal role but the process is the same. Instead of selecting a principal
role from the list, you select your custom role.

If needed, repeat this workflow to grant additional catalog roles that you create to the custom role.

## Step 4: Create a user

Use your Snowflake CLI connection to create a key pair authentication user in Open Catalog. A human can’t use this user’s credentials to
sign in to the Open Catalog UI.

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

1. To create a user, run the following command:

   ```snowcli
   snow sql -q "create user <username> login_name='<username>';"
   ```

   Where:

   * `<username>` is the user name you want to assign to the key pair authentication user.

## Step 5: Grant a custom role to the user

In this section, you use your Snowflake CLI connection to grant the custom role to one or more users. As a result, you bestow the user with the privileges granted to any catalog role(s) that are granted to
the custom role. For more information on the RBAC model in Open Catalog, see [RBAC model](access-control.md).

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

> **Important:**
>
> Custom roles are case-sensitive. You should specify a custom role with *all* uppercase letters, even if you create it with lowercase
> letters or lowercase and uppercase letters.

The following example grants the `ENGINEER` custom role to the `keypairuser1` user.

```snowcli
snow sql -q "GRANT ROLE ENGINEER to user keypairuser1;"
```

To validate that the role is granted to the user, run:

```snowcli
snow sql -q "show grants to user keypairuser1;"
```

In the response, check that the custom role you created (**role** column) is assigned to the user you created (**grantee_name** column).

## Step 6: Generate an access token for the user

In this section, you generate an access token for the user, which you use later to connect the user to Open Catalog through key pair
authentication.

The steps are as follows:

1. Generate a private and public key
2. Assign the public key to the user
3. Generate a JWT
4. Generate an access token

### Generate a private and public key

This section describes how to generate a private and public key.

To generate a private key, use the following command:

```bash
openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out rsa_key.p8 -nocrypt
```

To generate a public key, use the following command:

```bash
openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
```

### Assign the public key to the user

Use your Snowflake CLI connection to assign the public key to the user you created.

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

1. To assign the public key to the user you created, run:

   ```snowcli
   snow sql -q "alter user <username> set RSA_PUBLIC_KEY='<your_public_key>';"
   ```

   > **Note:**
   >
   > If you need to retrieve your public key, run: `cat rsa_key.pub`.
2. Validate that the user has the public key (`RSA_PUBLIC_KEY`) set and the fingerprint of user’s RSA public key (`RSA_PUBLIC_KEY_FP`) set:

   ```snowcli
   snow sql -q "desc user keypairuser1;"
   ```

### Generate a JWT for a user

In this step, you generate a JWT, which you need in order to generate an access token.

1. To use SnowSQL to generate a JWT, run:

   ```bash
   snowsql --private-key-path rsa_key.p8 --generate-jwt -h <your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com -a <account-identifier> -u <user_name>
   ```

   Where:

   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by
     a hyphen.

     For example: `ABCDEFG-MYACCOUNT1`.

     To retrieve these values in this format, see Retrieve your organization and account name.
   * `<account-identifier>` is the account identifier for your Snowflake Open Catalog account. To retrieve it, refer to your Open Catalog
     account URL. For example, `abc12345`in `https://app.snowflake.com/us-west-2/abc12345/#/`.
   * `<user_name>` is the user name for an Open Catalog user with the public key assigned to the user.
2. If you encrypted it, enter the passkey or else select Enter to continue. It may take a few seconds for you to receive your JWT.

### Generate an access token for the user

1. Use the JWT to retrieve an access token for a custom or built-in role:

   ```bash
   curl -i -X POST \
   -H "Content-Type: application/x-www-form-urlencoded" \
   -H "Accept: application/json" \
   --data-urlencode "scope=session:role:<custom_role_name>" \
   --data-urlencode "grant_type=client_credentials" \
   --data-urlencode "client_secret=<your_JWT>" \
   "https://<your_org_name>-<your_open_catalogaccount_name>.snowflakecomputing.com/polaris/api/catalog/v1/oauth/tokens"
   ```

   Where

   * `<custom_role_name>` is the name of a custom role you created, such as `ENGINEER`.
   * `<your_JWT>` is the JWT you generated in the previous step.
   * `<your_org_name>-<your_open_catalogaccount_name>` is your organization name and Snowflake Open Catalog account name, separated by
     a hyphen.

     For example: `ABCDEFG-MYACCOUNT1`.

     To retrieve these values in this format, see Retrieve your organization and account name.
2. Store the access token in a variable (`$ACCESS_TOKEN`).

## Step 7: Connect to Open Catalog with key pair authentication

In this section, you connect the user to Open Catalog through key pair authentication. For instructions,
see [Connect to Snowflake Open Catalog with key pair authentication](key-pair-auth-connect.md).

## Configure key-pair rotation

Open Catalog supports multiple active keys to allow for uninterrupted rotation. The steps for configuring key-pair rotation in Open Catalog
are the same as configuring it in Snowflake. For instructions, see [Configuring key-pair rotation](https://docs.snowflake.com/en/user-guide/key-pair-auth#configuring-key-pair-rotation)
in the Snowflake documentation.

## Use the Apache Polaris™ (Incubating) CLI to manage catalogs

After you configure key pair authentication, you can generate a service admin access token to use the Apache Polaris™ (Incubating) CLI
to set up and manage catalogs in Open Catalog. For instructions, see the [Open Catalog with Polaris guide](https://github.com/Snowflake-Labs/polaris-cli-opencatalog-demo)
in Snowflake Labs.

---
title: Configure Microsoft Entra ID for External OAuth
source: https://docs.snowflake.com/en/user-guide/oauth-azure.md
section: User Guide
---

# Configure Microsoft Entra ID for External OAuth

This topic describes how to configure Snowflake as an OAuth Resource and Microsoft Entra ID as an External OAuth Authorization Server to
facilitate secure, programmatic access to Snowflake data.

## Configuration procedure

The following four steps assume that your environment does not have anything configured relating to Microsoft Entra ID OAuth authorization
servers, OAuth clients, scopes, and necessary metadata.

The information from Steps 1-3 will be used to create a security integration in Snowflake.

If you already have an Microsoft Entra ID OAuth authorization server and client configured, it is not necessary to complete all of the steps
below. Rather, skim the following three steps and verify that you can obtain the desired information, create scopes, assign scopes to one or
more policies, and access the metadata.

If you do not have an Microsoft Entra ID OAuth authorization server and client configured, complete all of the following four steps.

> **Important:**
>
> The steps in this topic are a representative example on how to configure Microsoft Entra ID for External OAuth.
>
> You can configure Microsoft Entra ID to any desired state and use any desired OAuth flow provided that you can obtain the necessary
> information for the security integration (in this topic).
>
> Note that the following steps serve as a guide to obtain the necessary information to create the security integration in Snowflake.
>
> Steps 1-3 are derived from the Microsoft Entra ID documentation on OAuth 2.0 and authentication. For more information on how Microsoft
> defines its terms, its user interface, and options relating to OAuth 2.0 and authentication consult the following Microsoft Entra ID guides:
>
> * [Microsoft identity platform (v2.0) overview](https://docs.microsoft.com/en-us/azure/active-directory/develop/v2-overview)
> * [Authentication protocol (and related topics)](https://docs.microsoft.com/en-us/azure/active-directory/develop/v2-app-types)
> * [Application configuration (and related topics)](https://docs.microsoft.com/en-us/azure/active-directory/develop/app-objects-and-service-principals)
> * [How-to guides for authentication & application configuration (and related topics)](https://docs.microsoft.com/en-us/azure/active-directory/develop/active-directory-enterprise-app-role-management)

### Determine the OAuth flow in Microsoft Entra ID

Microsoft Entra ID supports two different OAuth flows in which an OAuth Client can get an access token.

1. The authorization server can grant the OAuth client an access token on behalf of the user.
2. The authorization server can grant the OAuth client an access token for the OAuth client itself.

In the first flow, the identity in the access token references the user. In the second flow, the identity in the access token references
the OAuth client.

Microsoft Entra ID does not allow the same role format for each of these two OAuth flows. The role format to use depends on the OAuth
flow in use. After determining which OAuth flow to use:

* Complete sub-step 10 or 11 in Configure the OAuth resource in Microsoft Entra ID
* Complete sub-step 13 or 14 in Create an OAuth client in Microsoft Entra ID

### Configure the OAuth resource in Microsoft Entra ID

1. Navigate to the [Microsoft Azure Portal](https://portal.azure.com) and authenticate.
2. Navigate to Microsoft Entra ID.
3. Click on App Registrations.
4. Click on New Registration.
5. Enter `Snowflake OAuth Resource`, or similar value as the Name.
6. Verify the Supported account types is set to Single Tenant.
7. Click Register.
8. Click on Expose an API.
9. Click on the Set link next to Application ID URI to set the `Application ID URI`.

   > > **Important:**
   > >
   > > The `Application ID URI` must be unique within your organization’s directory, such as
   > > `https://your.example.com/4d2a8c2b-a5f4-4b86-93ca-294185f45f2e`. This value will be referred to as the
   > > `<SNOWFLAKE_APPLICATION_ID_URI>` in the subsequent configuration steps.
   > >
   > > For help obtaining your Application ID URI, please contact your internal Microsoft Entra ID administrator.
   > >
   > > If the Application ID URI is not used, then it is necessary to create a security integration with audiences using the Snowflake
   > > Account URL (i.e. `<account_identifier>.snowflakecomputing.com`). For more information, see:
   > >
   > > * The audience integration in Create a security integration in Snowflake.
   > > * [Account identifiers](admin-account-identifier.md).
10. To add a Snowflake Role as an OAuth scope for OAuth flows where the programmatic client acts on behalf of a user, click on
    Add a scope to add a scope representing the Snowflake role.

    * Enter the scope by having the name of the Snowflake role with the `session:scope:` prefix. For example, for the Snowflake
      Analyst role, enter `session:scope:analyst`.
    * Select who can consent.
    * Enter a display name for the scope (e.g.: Account Admin).
    * Enter a description for the scope (e.g.: Can administer the Snowflake account).
    * Click Add Scope.
11. To add a Snowflake Role as a Role for OAuth flows where the programmatic client requests an access token for itself:

    * Click on Manifest.
    * Locate the `appRoles` element.
    * Enter an App Role with the following settings.

      | Setting | Description |
      | --- | --- |
      | allowedMemberTypes | Application. |
      | description | A description of the role. |
      | displayName | A friendly name for users to view. |
      | id | A unique ID. You can use the `[System.Guid]::NewGuid()` function from PowerShell to generate a unique ID if needed. |
      | isEnabled | Set to `true`. |
      | lang | The language. Set to `null`. |
      | origin | Set to `Application`. |
      | value | Set to the name of the Snowflake role with the `session:role:` prefix. . For the Analyst role, enter `session:role:analyst`. |

      The App Role manifests as follows.

      ```sqljson
      "appRoles":[
          {
              "allowedMemberTypes": [ "Application" ],
              "description": "Account Administrator.",
              "displayName": "Account Admin",
              "id": "3ea51f40-2ad7-4e79-aa18-12c45156dc6a",
              "isEnabled": true,
              "lang": null,
              "origin": "Application",
              "value": "session:role:analyst"
          }
      ]
      ```
12. Click Save.

### Create an OAuth client in Microsoft Entra ID

1. Navigate to the [Microsoft Azure Portal](https://portal.azure.com) and authenticate.
2. Navigate to Azure Active Directory.
3. Click on App Registrations.
4. Click on New Registration.
5. Enter a name for the client such as `Snowflake OAuth Client`.
6. Verify the Supported account types is set to Single Tenant.
7. Click Register.
8. In the Overview section, copy the `ClientID` from the Application (client) ID field. This will be known as the
   `<OAUTH_CLIENT_ID>` in the following steps.
9. Click on Certificates & secrets and then New client secret.
10. Add a description of the secret.
11. Select never expire. For testing purposes, select secrets that never expire.
12. Click Add. Copy the secret. This will be known as the `<OAUTH_CLIENT_SECRET>` in the following steps.
13. For programmatic clients that will request an Access Token on behalf of a user, configure Delegated permissions for Applications as
    follows.

    * Click on API Permissions.
    * Click on Add Permission.
    * Click on the Microsoft Entra ID setting that corresponds to the available APIs (for example, My APIs or APIs my organization uses).
    * Click on the Snowflake OAuth Resource that you created in Configure the OAuth resource in Microsoft Entra ID.
    * Click on the Delegated Permissions box.
    * Check on the Permission related to the Scopes defined in the Application that you wish to grant to this client.
    * Click Add Permissions.
    * Click on the Grant Admin Consent button to grant the permissions to the client. Note that for testing purposes, permissions
      are configured this way. However, in a production environment, granting permissions in this manner is not advisable.
    * Click Yes.
14. For programmatic clients that will request an Access Token for themselves, configure API permissions for Applications as
    follows.

    * Click on API Permissions.
    * Click on Add Permission.
    * Click on My APIs.
    * Click on the Snowflake OAuth Resource that you created in Configure the OAuth resource in Microsoft Entra ID.
    * Click on the Application Permissions.
    * Check on the Permission related to the Roles manually defined in the `Manifest` of the Application that you wish to
      grant to this client.
    * Click Add Permissions.
    * Click on the Grant Admin Consent button to grant the permissions to the client.Note that for testing purposes, permissions
      are configured this way. However, in a production environment, granting permissions in this manner is not advisable.
    * Click Yes.

### Collect Azure AD information for Snowflake

1. Navigate to the [Microsoft Azure Portal](https://portal.azure.com) and authenticate.
2. Navigate to Azure Active Directory.
3. Click on App Registrations.
4. Click on the Snowflake OAuth Resource that you created in Configure the OAuth resource in Microsoft Entra ID.
5. Click on Endpoints in the Overview interface.
6. On the right-hand side, copy the OAuth 2.0 token endpoint (v2) and note the URLs for OpenID Connect metadata and
   Federation Connect metadata.

   * The OAuth 2.0 token endpoint (v2) will be known as the `<AZURE_AD_OAUTH_TOKEN_ENDPOINT>` in the following configuration
     steps. The endpoint should be similar to
     `https://login.microsoftonline.com/90288a9b-97df-4c6d-b025-95713f21cef9/oauth2/v2.0/token`.
   * For the OpenID Connect metadata, open in a new browser window.

     + Locate the `"jwks_uri"` parameter and copy its value.
     + This parameter value will be known as the `<AZURE_AD_JWS_KEY_ENDPOINT>` in the following configuration steps. The endpoint
       should be similar to `https://login.microsoftonline.com/90288a9b-97df-4c6d-b025-95713f21cef9/discovery/v2.0/keys`.
   * For the Federation metadata document, open the URL in a new browser window.

     + Locate the `"entityID"` parameter in the `XML Root Element` and copy its value.
     + This parameter value will be known as the `<AZURE_AD_ISSUER>` in the following configuration steps. The entityID value should
       be similar to `https://sts.windows.net/90288a9b-97df-4c6d-b025-95713f21cef9/`.

### Create a security integration in Snowflake

This step involves creating a security integration in Snowflake to ensure that Snowflake can communicate with Microsoft Entra ID securely,
validate the tokens from Microsoft Entra ID, and provide the appropriate Snowflake data access to users based on the user role associated with
the OAuth token.

Choose the security integration that best addresses your use case and configuration needs. If your integration is only based on the
preceding configuration, use the first security integration. For more information, see
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md).

> **Important:**
>
> If you are trying to create a security integration for Microsoft Power BI, follow the setup instructions in [Power BI SSO to Snowflake](oauth-powerbi.md).
>
> Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute
> this SQL command.
>
> The security integration parameter values are case-sensitive, and the values you put into the security integration must match those
> values in your environment. If the case does not match, it is possible that the access token will not be validated, resulting in a failed
> authentication attempt.
>
> Verify all values are an exact match. For example, if the Issuer value does not end with a backslash and the security integration is
> created with a backslash character at the end of the URL, an error message will occur. It would then be necessary to drop the security
> integration object using [DROP INTEGRATION](../sql-reference/sql/drop-integration.md) and then create the object again with the correct Issuer value
> using CREATE SECURITY INTEGRATION.

**Create a security integration for Microsoft Entra ID**

> ```sqlexample
> create security integration external_oauth_azure_1
>     type = external_oauth
>     enabled = true
>     external_oauth_type = azure
>     external_oauth_issuer = '<AZURE_AD_ISSUER>'
>     external_oauth_jws_keys_url = '<AZURE_AD_JWS_KEY_ENDPOINT>'
>     external_oauth_token_user_mapping_claim = 'upn'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```

**Create a security integration with audiences**

> The `external_oauth_audience_list` parameter of the security integration must match the Application ID URI that you specified
> while configuring Microsoft Entra ID.
>
> ```sqlexample
> create security integration external_oauth_azure_2
>     type = external_oauth
>     enabled = true
>     external_oauth_type = azure
>     external_oauth_issuer = '<AZURE_AD_ISSUER>'
>     external_oauth_jws_keys_url = '<AZURE_AD_JWS_KEY_ENDPOINT>'
>     external_oauth_audience_list = ('<SNOWFLAKE_APPLICATION_ID_URI>')
>     external_oauth_token_user_mapping_claim = 'upn'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```

### Modifying Your External OAuth Security Integration

You can update your External OAuth security integration by executing an ALTER statement on the security integration.

For more information, see [ALTER SECURITY INTEGRATION (External OAuth)](../sql-reference/sql/alter-security-integration-oauth-external.md).

### Using ANY role with External OAuth

In the configuration step to create a security integration in Snowflake, the OAuth access token includes the scope definition. Therefore, at runtime, using the External OAuth security integration allows neither the OAuth client nor the user to use an undefined role in the OAuth access token.

After validating the access token and creating a session, the ANY role can allow the OAuth client and user to decide its role. If necessary, the client or the user can switch to a role that is different that the role defined in the OAuth access token.

To configure ANY role, define the scope as `SESSION:ROLE-ANY` and configure the security integration with the `external_oauth_any_role_mode` parameter. This parameter can have three possible string values:

* `DISABLE` does not allow the OAuth client or user to switch roles (i.e. `use role <role>;`). Default.
* `ENABLE` allows the OAuth client or user to switch roles.
* `ENABLE_FOR_PRIVILEGE` allows the OAuth client or user to switch roles only for a client or user with the `USE_ANY_ROLE` privilege. This privilege can be granted and revoked to one or more roles available to the user. For example:

  ```sqlexample
  grant USE_ANY_ROLE on integration external_oauth_1 to role1;
  ```

  ```sqlexample
  revoke USE_ANY_ROLE on integration external_oauth_1 from role1;
  ```

Define the security integration as follows:

```sqlexample
create security integration external_oauth_1
    type = external_oauth
    enabled = true
    external_oauth_any_role_mode = 'ENABLE'
    ...
```

### Using secondary roles with External OAuth

The desired scope for the primary role is passed in the external token: either the default role for the user (`session:role-any`) or
a specific role that was granted to the user (`session:role:<role_name>`).

By default, Snowflake does not activate the default [secondary roles](security-access-control-overview.md) for a user (i.e.
the DEFAULT_SECONDARY_ROLES) user in the session.

To activate the default secondary roles for a user in a session and allow executing the [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md)
command while using External OAuth, complete the following steps:

1. Configure the security integration for the connection. Set the EXTERNAL_OAUTH_ANY_ROLE_MODE parameter value to either ENABLE or
   ENABLE_FOR_PRIVILEGE when you create the security integration (using CREATE SECURITY INTEGRATION) or later (using ALTER SECURITY
   INTEGRATION).
2. Configure the authorization server to pass the static value of `session:role-any` in the scope attribute of the token. For more
   information about the scope parameter, see [External OAuth overview](oauth-ext-overview.md).

### Using Client Redirect with External OAuth

Snowflake supports using Client Redirect with External OAuth, including using Client Redirect and OAuth with supported Snowflake Clients.

For more information, see [Redirecting client connections](client-redirect.md).

### Using network policies with External OAuth

Currently, network policies cannot be added to your External OAuth security integration. However, you can still implement network policies that apply broadly to the entire Snowflake account.

If your use case requires a network policy that is specific to the OAuth security integration, use [Snowflake OAuth](oauth-intro.md). This approach allows the Snowflake OAuth network policy to be distinct from other network policies that may apply to the Snowflake account.

For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Using replication with External OAuth

Snowflake supports replication and failover/failback of the External OAuth security integration from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## Testing procedure

In the context of testing OAuth while using Microsoft Entra ID as an authorization server, you must:

1. Verify that the test user exists in Microsoft Entra ID and has a password.
2. Verify that the test user exists in Snowflake with their `login_name` attribute value set to the `<AZURE_AD_USER_USERNAME>`
3. Grant the SYSADMIN role to this user.
4. Register an OAuth Client.
5. Allow the OAuth Client to make a POST request to the Microsoft Entra ID Token endpoint as follows:

   * Grant type set to Resource Owner
   * HTTP Basic Authorization header containing the clientID and secret
   * FORM data containing the user’s username & password
   * Include scopes

Here is an example for getting an access token using cURL. Note that the scope must be fully qualified, including the Microsoft Entra ID App
URI (e.g. `scope=https://example.com/wergheroifvj25/session:role-any`).

```bash
curl -X POST -H "Content-Type: application/x-www-form-urlencoded;charset=UTF-8" \
  --data-urlencode "client_id=<OAUTH_CLIENT_ID>" \
  --data-urlencode "client_secret=<OAUTH_CLIENT_SECRET>" \
  --data-urlencode "username=<AZURE_AD_USER>" \
  --data-urlencode "password=<AZURE_AD_USER_PASSWORD>" \
  --data-urlencode "grant_type=password" \
  --data-urlencode "scope=<AZURE_APP_URI+AZURE_APP_SCOPE>" \
  '<AZURE_AD_OAUTH_TOKEN_ENDPOINT>'
```

## Connecting to Snowflake with External OAuth

After configuring your security integration and obtaining your access token, you can connect to Snowflake using one of the following:

* [SnowSQL](snowsql-start.md)
* [Python Connector](../developer-guide/python-connector/python-connector-connect.md)
* [Go Driver](https://godoc.org/github.com/snowflakedb/gosnowflake#hdr-Connection_Parameters)
* [JDBC Driver](../developer-guide/jdbc/jdbc-configure.md)
* [ODBC Driver](../developer-guide/odbc/odbc-parameters.md)
* [Spark Connector](spark-connector-use.md)
* [.NET Driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/README.md#create-a-connection)
* [Node.js Driver](../developer-guide/node-js/nodejs-driver-authenticate.md)

Note the following:

* It is necessary to set the `authenticator` parameter to `oauth` and the `token` parameter to the `external_oauth_access_token`.
* When passing the `token` value as a URL query parameter, it is necessary to URL-encode the `token` value.
* When passing the `token` value to a Properties object (e.g. JDBC Driver), no modifications are necessary.

For example, if using the Python Connector, set the connection string as shown below.

```python
ctx = snowflake.connector.connect(
   user="<username>",
   host="<hostname>",
   account="<account_identifier>",
   authenticator="oauth",
   token="<external_oauth_access_token>",
   warehouse="test_warehouse",
   database="test_db",
   schema="test_schema"
)
```

You can now use External OAuth to connect to Snowflake securely.

---
title: Configure Okta for External OAuth
source: https://docs.snowflake.com/en/user-guide/oauth-okta.md
section: User Guide
---

# Configure Okta for External OAuth

This topic describes how to configure Snowflake as an OAuth Resource and Okta as an External OAuth authorization server to facilitate
secure, programmatic access to Snowflake data.

## Configuration procedure

The following five steps assume that your environment does not have anything configured relating to Okta OAuth authorization servers,
OAuth clients, scopes, and necessary metadata.

The information from Steps 1-3 will be used to create a security integration in Snowflake.

If you already have an Okta authorization server and client configured, it is not necessary to complete all of the steps below. Rather,
skim the following four steps and verify that you can obtain the desired information, create scopes, assign scopes to one or more policies,
and access the metadata.

If you do not have and Okta OAuth authorization server and client configured, complete all of the following five steps.

> **Important:**
>
> The steps in this topic are a representative example on how to configure Okta for External OAuth.
>
> You can configure Okta to any desired state and use any desired OAuth flow provided that you can obtain the necessary information for the
> security integration (in this topic).
>
> Note that the following steps serve as a guide to obtain the necessary information to create the security integration in Snowflake.
>
> Be sure to consult your internal security policies with regard to configuring an authorization server to ensure your organization meets
> all necessary regulations and compliance requirements.
>
> Steps 1-3 are derived from the Okta documentation on Authorization Servers. For more information on how Okta defines its terms, its user
> interface, and options relating to Authorization Servers, consult the following Okta guides:
>
> * [Create an Authorization Server](https://developer.okta.com/docs/guides/customize-authz-server/overview/)
> * [Implement the Authorization Code Flow](https://developer.okta.com/docs/guides/implement-auth-code/overview/)
> * [Implement the Authorization Code Flow with PKCE](https://developer.okta.com/docs/guides/implement-auth-code-pkce/overview/)
> * [Implement the Client Credentials Flow](https://developer.okta.com/docs/guides/implement-client-creds/overview/)
> * [Implement the Resource Owner Password Flow](https://developer.okta.com/docs/guides/implement-password/overview/)
> * [Refresh Access Tokens](https://developer.okta.com/docs/guides/refresh-tokens/overview/)

### Create an OAuth compatible client to use with Snowflake

1. Navigate to the Okta Admin Console.
2. Click Applications.
3. Click Add Application.
4. Click Create New App.

   * For Platform, select Native App.
5. Click Create.
6. Enter a name for the application.
7. In the Login redirect URIs box, add the full Snowflake account URL
   (i.e. `https://<account_identifier>.snowflakecomputing.com`). For a list of possible URL formats, see
   [Connecting with a URL](organizations-connect.md).
8. Click Save.
9. From New Applications in the General interface, click Edit.
10. Check Refresh Token and Resource Owner Password.
11. Click Save.
12. Click the Edit button next to Client Credentials.
13. Select the Use Client Authentication option.
14. Click Save.
15. In the Client Credentials container, save the ClientID and Secret. These two values will be known as the
    `<OAUTH_CLIENT_ID>` and `<OAUTH_CLIENT_SECRET>`, respectively in the following steps.

### Create an OAuth authorization server

1. Navigate to the Okta Admin Console.
2. In the Security menu, click API.
3. Click Authorization Servers.
4. Click Add Authorization Server.
5. Enter a name.
6. Enter the Snowflake account URL as the Audience value. For a list of possible URL formats, see [Connecting with a URL](organizations-connect.md).
7. Click Save.

Complete the following steps for the newly added Authorization Server.

1. Copy the Issuer value. Its format should resemble `https://dev-390798.oktapreview.com/oauth2/auslh9j9vf9ej7NfT0h7`. This
   value will be known as the `<OKTA_ISSUER>` in the following steps.
2. Click on Scopes.
3. Click on Add Scope.
4. To add a Snowflake Role as a scope, enter the scope by having the name of the Snowflake role with the `session:role:` prefix
   (e.g.: for the Snowflake Analyst role, enter `session:role:analyst`).
5. Click Create.
6. Click on Access Policies.
7. Click Add Policy.
8. Enter a name and a description for the policy. Assign it to the client created earlier and click Create.
9. In the newly added Access Policy, click Add Rule.
10. Enter a rule name.
11. Select the authorized Grant Types. You should select Resource Owner Password and Client Credentials along
    with any others that match your organization’s policies.
12. For scopes, you can select any of the scopes or select the desired scopes created earlier that clients assigned to this policy will be
    able to request (including offline_access for refresh tokens if needed). Configure any additional settings as needed.
13. Click Create Rule.

### Collect Okta information

1. Go to the Okta Admin Console.
2. In the Security menu, click API.
3. Click Authorization Servers.
4. Click on the Authorization Server for the Snowflake Resource.
5. In the Settings tab, copy the Issuer value. This value will be known as the `<OKTA_ISSUER>` in the following
   steps. Its format should resemble `https://dev-111111.oktapreview.com/oauth2/auslh9j9vf9ej7NfT0h7`.

In the Metadata document:

1. Copy the Metadata URI value, open a browser tab, and paste the URL in the address bar.
2. You should see JSON text in the browser. You can work with this text in a text editor or in the browser itself.
3. Locate the `"jwks_uri"` parameter and copy its value. Its format should resemble
   `https://dev-111111.oktapreview.com/oauth2/auslh9j9vf9ej7NfT0h7/v1/keys`. This endpoint will be known as the
   `<OKTA_JWS_KEY_ENDPOINT>` in the following steps.
4. Locate the `"token_endpoint"` parameter and copy its value. Its format should resemble
   `https://dev-111111.oktapreview.com/oauth2/auslh9j9vf9ej7NfT0h7/v1/token`. This endpoint will be known as the `<OKTA_OAUTH_TOKEN_ENDPOINT>` in the following steps.

### Create a security integration for Okta

This step creates a security integration in Snowflake. The security integration ensures that Snowflake can communicate with Okta securely,
validates the tokens from Okta, and provides the appropriate Snowflake data access to users based on the user role associated with the
OAuth token.

For more information, see [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md).

> **Important:**
>
> Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute
> this SQL command.
>
> The security integration parameter values are case-sensitive and the values you put into the security integration must match those values
> in your environment. If the case does not match, it is possible that the access token will not be validated resulting in a failed
> authentication attempt.

**Create a security integration with audiences**

> The `external_oauth_audience_list` parameter of the security integration must match the Audience that you specified
> while configuring Okta.
>
> ```sqlexample
> create security integration external_oauth_okta_2
>     type = external_oauth
>     enabled = true
>     external_oauth_type = okta
>     external_oauth_issuer = '<OKTA_ISSUER>'
>     external_oauth_jws_keys_url = '<OKTA_JWS_KEY_ENDPOINT>'
>     external_oauth_audience_list = ('<snowflake_account_url')
>     external_oauth_token_user_mapping_claim = 'sub'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```

### Modifying Your External OAuth Security Integration

You can update your External OAuth security integration by executing an ALTER statement on the security integration.

For more information, see [ALTER SECURITY INTEGRATION (External OAuth)](../sql-reference/sql/alter-security-integration-oauth-external.md).

### Using ANY role with External OAuth

In the configuration step to create a security integration in Snowflake, the OAuth access token includes the scope definition. Therefore, at runtime, using the External OAuth security integration allows neither the OAuth client nor the user to use an undefined role in the OAuth access token.

After validating the access token and creating a session, the ANY role can allow the OAuth client and user to decide its role. If necessary, the client or the user can switch to a role that is different that the role defined in the OAuth access token.

To configure ANY role, define the scope as `SESSION:ROLE-ANY` and configure the security integration with the `external_oauth_any_role_mode` parameter. This parameter can have three possible string values:

* `DISABLE` does not allow the OAuth client or user to switch roles (i.e. `use role <role>;`). Default.
* `ENABLE` allows the OAuth client or user to switch roles.
* `ENABLE_FOR_PRIVILEGE` allows the OAuth client or user to switch roles only for a client or user with the `USE_ANY_ROLE` privilege. This privilege can be granted and revoked to one or more roles available to the user. For example:

  ```sqlexample
  grant USE_ANY_ROLE on integration external_oauth_1 to role1;
  ```

  ```sqlexample
  revoke USE_ANY_ROLE on integration external_oauth_1 from role1;
  ```

Define the security integration as follows:

```sqlexample
create security integration external_oauth_1
    type = external_oauth
    enabled = true
    external_oauth_any_role_mode = 'ENABLE'
    ...
```

### Using secondary roles with External OAuth

The desired scope for the primary role is passed in the external token: either the default role for the user (`session:role-any`) or
a specific role that was granted to the user (`session:role:<role_name>`).

By default, Snowflake does not activate the default [secondary roles](security-access-control-overview.md) for a user (i.e.
the DEFAULT_SECONDARY_ROLES) user in the session.

To activate the default secondary roles for a user in a session and allow executing the [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md)
command while using External OAuth, complete the following steps:

1. Configure the security integration for the connection. Set the EXTERNAL_OAUTH_ANY_ROLE_MODE parameter value to either ENABLE or
   ENABLE_FOR_PRIVILEGE when you create the security integration (using CREATE SECURITY INTEGRATION) or later (using ALTER SECURITY
   INTEGRATION).
2. Configure the authorization server to pass the static value of `session:role-any` in the scope attribute of the token. For more
   information about the scope parameter, see [External OAuth overview](oauth-ext-overview.md).

### Using Client Redirect with External OAuth

Snowflake supports using Client Redirect with External OAuth, including using Client Redirect and OAuth with supported Snowflake Clients.

For more information, see [Redirecting client connections](client-redirect.md).

### Using network policies with External OAuth

Currently, network policies cannot be added to your External OAuth security integration. However, you can still implement network policies that apply broadly to the entire Snowflake account.

If your use case requires a network policy that is specific to the OAuth security integration, use [Snowflake OAuth](oauth-intro.md). This approach allows the Snowflake OAuth network policy to be distinct from other network policies that may apply to the Snowflake account.

For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Using replication with External OAuth

Snowflake supports replication and failover/failback of the External OAuth security integration from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## Testing procedure

In the context of testing OAuth while using Okta as an authorization server, you must:

1. Verify that the test user exists in Okta and has a password.
2. Verify that the test user exists in Snowflake with their `login_name` attribute value set to the `<OKTA_USER_USERNAME>`
3. Register an OAuth Client.
4. Allow the OAuth Client to make a POST request to the Okta Token endpoint as follows:

   * Grant type set to Resource Owner
   * HTTP Basic Authorization header containing the clientID and secret
   * FORM data containing the user’s username & password
   * Include scopes

The sample command requests the Analyst and that assumes the `session:role:analyst` have been defined in
Okta > OAuth App Resource.

Here is an example for getting an access token using cURL.

```bash
curl -X POST -H "Content-Type: application/x-www-form-urlencoded;charset=UTF-8" \
  --user <OAUTH_CLIENT_ID>:<OAUTH_CLIENT_SECRET> \
  --data-urlencode "username=<OKTA_USER_USERNAME>" \
  --data-urlencode "password=<OKTA_USER_PASSWORD>" \
  --data-urlencode "grant_type=password" \
  --data-urlencode "scope=session:role:analyst" \
  <OKTA_OAUTH_TOKEN_ENDPOINT>
```

## Connecting to Snowflake with External OAuth

After configuring your security integration and obtaining your access token, you can connect to Snowflake using one of the following:

* [SnowSQL](snowsql-start.md)
* [Python Connector](../developer-guide/python-connector/python-connector-connect.md)
* [Go Driver](https://godoc.org/github.com/snowflakedb/gosnowflake#hdr-Connection_Parameters)
* [JDBC Driver](../developer-guide/jdbc/jdbc-configure.md)
* [ODBC Driver](../developer-guide/odbc/odbc-parameters.md)
* [Spark Connector](spark-connector-use.md)
* [.NET Driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/README.md#create-a-connection)
* [Node.js Driver](../developer-guide/node-js/nodejs-driver-authenticate.md)

Note the following:

* It is necessary to set the `authenticator` parameter to `oauth` and the `token` parameter to the `external_oauth_access_token`.
* When passing the `token` value as a URL query parameter, it is necessary to URL-encode the `token` value.
* When passing the `token` value to a Properties object (e.g. JDBC Driver), no modifications are necessary.

For example, if using the Python Connector, set the connection string as shown below.

```python
ctx = snowflake.connector.connect(
   user="<username>",
   host="<hostname>",
   account="<account_identifier>",
   authenticator="oauth",
   token="<external_oauth_access_token>",
   warehouse="test_warehouse",
   database="test_db",
   schema="test_schema"
)
```

You can now use External OAuth to connect to Snowflake securely.

---
title: Configure organizational listings
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-configure.md
section: User Guide
---

# Configure organizational listings

This page introduces configurations for organizational listings in Snowflake. You’ll find details on targeting accounts, adding roles, access regions, and auto-fulfillment settings.

## Set the Uniform Listing Locator or listing name

The Uniform Listing Locator (ULL) is a unique identifier that represents the listing and its data product, treating them as one.
The listing name is different from the title of the listing: multiple listings can have the same title, but each listing must have
a unique listing name or ULL. The complete ULL is formed by three elements delimited by the symbol ‘$’.
The first element is the provider’s organization name, the second element is the provider profile
`INTERNAL`, and the third element is the listing name. The ULL cannot be changed after the listing is published.
Although it has three parts, the ULL is treated as a single name in queries. For example, you can query a table in a listing like this:

```sqlexample
SELECT * FROM "ORGDATACLOUD$INTERNAL$MY_LISTING_NAME_123".PUBLIC.TABLE_FROM_LISTING;
```

When creating a listing, give it a clear, descriptive name. Consumers can find listings faster by name rather than title,
and a descriptive name is easier to use in queries.

## Set who can discover and access an organizational listing

The target audience of your organizational listings is always your internal marketplace.

Despite the restrictions of an internal listing, you can still control who can discover and access the listing.
You can mark each listing discoverable and accessible individually. That is, you can configure a listing so that it is discoverable but not accessible.

In general you can specify access or discovery at the following levels:

* Everyone in your account
* Specific accounts
* Specific accounts, but limited by specific roles

For example the `access` element defines who can access a listing.
Likewise, the `discovery` element defines who can discover a listing.

Allow all accounts to access the listing.

```yaml
organization_targets:
   access:
   - all_accounts : true
```

Allow specific accounts to access the listing.

```yaml
organization_targets:
   access:
   - account: 'Account1'
   - account: 'Account2'
```

Allow specific accounts to access the listing, but only for the given roles.

```yaml
organization_targets:
   access:
      - account: 'Account1'
         roles: [<role1>, <role2>, <role3>]
```

Allow all accounts to discover the listing.

```yaml
organization_targets:
   discovery:
   - all_accounts : true
```

Allow specific accounts to discover the listing.

```yaml
organization_targets:
   discovery:
   - account: 'Account1'
   - account: 'Account2'
```

Allow specific accounts to discover the listing, but only for the given roles.

```yaml
organization_targets:
   discovery:
      - account: 'Account1'
         roles: [<role1>, <role2>, <role3>]
```

In a similar way, regions are set up the access regions_attribute:

```yaml
locations:
  access_regions:
     - name: "ALL"
```

```yaml
locations:
   access_regions:
     - name: "AWS_US_WEST_2"
     - name: "AZURE_CENTRALINDIAUS-EAST"
```

## Specify approver and support contact

Optionally, you can specify an email address or link to internal ticketing system for both approver and support contact.

```yaml
support_contact: "support@somedomain.com"
approver_contact: "approver@somedomain.com"
```

## Set auto-fulfillment options for an organizational listing

Organizational listings that have attached data shares and apps both use auto-fulfillment, however they each
use different methods. For this reason, the refresh schedules for each are different. For shares, the refresh
schedule is set on the database level. For apps, it’s set on the account level.

If you need to use auto-fulfillment, you can set it when your run `CREATE ORGANIZATIONAL LISTING` OR
`ALTER LISTING` by changing the auto_fulfillment attribute in the [listing manifest
fields](https://other-docs.snowflake.com/en/sql-reference/sql/create-listing#parameters).

```yaml
auto_fulfillment:
   refresh_type: SUB_DATABASE
   refresh_schedule: '10 MINUTE'
```

---
title: Configure organizational listings for auto-fulfillment
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-auto-fulfillment.md
section: User Guide
---

# Configure organizational listings for auto-fulfillment

Auto-fulfillment ensures that data products in organizational listings are propagated across regions
automatically, eliminating the need for manual replication. This mechanism provides seamless regional
availability for data consumers, enhancing consistency and reducing administrative overhead in multi-region
data environments.

Before you begin, make sure you have the [necessary privileges](../../../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md) to manage auto-fulfillment settings for organizational listings.

If your organization spans multiple regions, you can enable auto-fulfillment for your organizational listings
to ensure that data products are available in all regions where your organization has a presence.
Auto-fulfillment happens automatically if it’s enabled for your organization.

To find your account name (`account_name`), run this command:

```sqlexample
SHOW ACCOUNTS;
```

To check if global data sharing is enabled for your organization account, run this command:

```sqlexample
SELECT SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT('<account_name>');
```

To enable global data sharing for an organization account, run this command:

```sqlexample
CALL SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('<account_name>');
```

To disable global data sharing for an organization account, run this command:

```sqlexample
CALL SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('<account_name>');
```

---
title: Configure PingFederate for External OAuth
source: https://docs.snowflake.com/en/user-guide/oauth-pingfed.md
section: User Guide
---

# Configure PingFederate for External OAuth

This topic describes how to configure Snowflake as an OAuth Resource and Ping Identity PingFederate as an External OAuth authorization
server to facilitate secure, programmatic access to Snowflake data.

## Configuration procedure

The following two steps assume that your environment does not have anything configured relating to PingFederate OAuth authorization
servers, OAuth clients, scopes, and necessary metadata. These steps are also a representative example on how to configure PingFederate.

The information from the first step will be used to create a security integration in Snowflake.

If you already have a PingFederate authorization server and client configured, it is not necessary to complete all of the steps below.
Rather, skim the first step and verify that you can obtain the desired information, create scopes, assign scopes to one or more policies,
and access the metadata.

If you do not have a PingFederate OAuth authorization server and client configured, complete both steps.

> **Important:**
>
> The steps in this topic are a representative example on how to configure PingFederate for External OAuth.
>
> You can configure PingFederate to any desired state and use any desired OAuth flow provided that you can obtain the necessary information
> for the security integration (in this topic).
>
> Note that the following steps serve as a guide to obtain the necessary information to create the security integration in Snowflake.
>
> Be sure to consult your internal security policies with regard to configuring an authorization server to ensure your organization meets
> all necessary regulations and compliance requirements.
>
> Steps 1 is derived from the PingIdentity documentation on OAuth 2.0. For more information on how PingIdentity defines its terms, its user
> interface, and options relating to Authorization Servers consult the following PingIdentity guide:
>
> * [OAuth 2.0 Developer’s Guide](https://www.pingidentity.com/content/developer/en/resources/oauth-2-0-developers-guide.html)

### Configure PingFederate

1. Navigate to the PingFederate Server downloads page and either download or upgrade your PingFederate instance based on your
   [operating system](https://www.pingidentity.com/en/resources/downloads/pingfederate.html).
2. Use the PingFederate installation guide for your operating system. After installation, access PingFederate.
3. Create the OAuth Scopes by navigating to the Exclusive Scopes interface in the OAuth Server panel.
4. To add a Snowflake Role as a scope, add the role to the Scope Value. The Snowflake role must have the `session:role:`
   prefix (e.g. for the Snowflake Analyst role, enter `session:role:analyst`).
5. Enter a description for the scope in the Scope Description box and click Add.
6. Navigate to the OAuth Server tab and create a new client. Verify the following values.

   > | Field | Value |
   > | --- | --- |
   > | NAME | A friendly name for the PingFederate OAuth Authorization server |
   > | DESCRIPTION | A friendly description for the PingFederate OAuth Authorization Server |
   > | CLIENT AUTHENTICATION | CLIENT SECRET |
   > | EXCLUSIVE SCOPES | Select the Scopes (i.e. Snowflake Roles) |
   > | ALLOWED GRANT TYPES | Choose Refresh Token and Resource Owner Password Credentials |
   > | DEFAULT ACCESS TOKEN MANAGER | JSON Web Tokens |
7. Navigate to the Security tab and export the certificate. Extract the public key from the certificate for use in the following
   steps.
8. Navigate to the Instance Configuration tab under the OAuth Server tab and
   Access Token Management | Create Access Token Management Instance, and then:

   * Update the ISSUER CLAIM VALUE to the unique identifier referencing this OAuth Authorization Server.
   * Update the AUDIENCE CLAIM VALUE to your Snowflake account URL
     (e.g. `https://<account_identifier>.snowflakecomputing.com`). For a list of possible URL formats, see
     [Connecting with a URL](organizations-connect.md).
9. Download the PingFederate OAuth Playground add on from the
   [Developer Tools](https://www.pingidentity.com/en/resources/downloads/pingfederate.html) section. This client performs API requests.
10. Install the
    [OAuth Playground](https://www.pingidentity.com/content/dam/developer/downloads/Software/OAuth%20Grant%20Types%20using%20the%20OAuth%20PlayGround.pdf).

### Create a security integration in Snowflake

This step creates a security integration in Snowflake to ensure that Snowflake can communicate with PingIdentity
securely, validate the tokens from PingIdentity, and provide the appropriate Snowflake data access to users based on the user role
associated with the OAuth token.

Execute the following statement in the Snowflake web interface, Snowflake CLI, or SnowSQL.

Note that the value for `external_oauth_issuer` must be the unique identifier set in Step 1.8. For example, if the unique identifier
value is `27f10cde-a964-4499-a88c-0c598883e5ad`, replace `<unique_id>` with `'27f10cde-a964-4499-a88c-0c598883e5ad'`. The
unique identifier must be in single (i.e. vertical) quotes.

Choose the security integration that best addresses your use case and configuration needs. For more information, see
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md).

> **Important:**
>
> Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute
> this SQL command.
>
> The security integration parameter values are case-sensitive and the values you put into the security integration must match those values
> in your environment. If the case does not match, it is possible that the access token will not be validated resulting in a failed
> authentication attempt.

**Create a security integration for PingFederate**

> ```sqlexample
> create or replace security integration external_oauth_pf_1
>     type = external_oauth
>     enabled = true
>     external_oauth_type = ping_federate
>     external_oauth_rsa_public_key = '<BASE64_PUBLIC_KEY>'
>     external_oauth_issuer = '<unique_id>'
>     external_oauth_token_user_mapping_claim = 'username'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```
>
> This security integration uses the `external_oauth_rsa_public_key` parameter. Snowflake uses the public key value to verify the
> Signature on JWT Access Token.

**Create a security integration with audiences**

> > The `external_oauth_audience_list` parameter of the security integration must match the Audience Claim Value that you
> > specified while configuring PingFederate.
>
> ```sqlexample
> create security integration external_oauth_pf_2
>     type = external_oauth
>     enabled=true
>     external_oauth_type = ping_federate
>     external_oauth_issuer = '<ISSUER>'
>     external_oauth_rsa_public_key = '<BASE64_PUBLIC_KEY>'
>     external_oauth_audience_list = ('<snowflake_account_url>')
>     external_oauth_token_user_mapping_claim = 'username'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```
>
> This security integration uses the `external_oauth_rsa_public_key` parameter. Snowflake uses the public key value to verify the
> Signature on JWT Access Token.

### Modifying Your External OAuth Security Integration

You can update your External OAuth security integration by executing an ALTER statement on the security integration.

For more information, see [ALTER SECURITY INTEGRATION (External OAuth)](../sql-reference/sql/alter-security-integration-oauth-external.md).

### Using ANY role with External OAuth

In the configuration step to create a security integration in Snowflake, the OAuth access token includes the scope definition. Therefore, at runtime, using the External OAuth security integration allows neither the OAuth client nor the user to use an undefined role in the OAuth access token.

After validating the access token and creating a session, the ANY role can allow the OAuth client and user to decide its role. If necessary, the client or the user can switch to a role that is different that the role defined in the OAuth access token.

To configure ANY role, define the scope as `SESSION:ROLE-ANY` and configure the security integration with the `external_oauth_any_role_mode` parameter. This parameter can have three possible string values:

* `DISABLE` does not allow the OAuth client or user to switch roles (i.e. `use role <role>;`). Default.
* `ENABLE` allows the OAuth client or user to switch roles.
* `ENABLE_FOR_PRIVILEGE` allows the OAuth client or user to switch roles only for a client or user with the `USE_ANY_ROLE` privilege. This privilege can be granted and revoked to one or more roles available to the user. For example:

  ```sqlexample
  grant USE_ANY_ROLE on integration external_oauth_1 to role1;
  ```

  ```sqlexample
  revoke USE_ANY_ROLE on integration external_oauth_1 from role1;
  ```

Define the security integration as follows:

```sqlexample
create security integration external_oauth_1
    type = external_oauth
    enabled = true
    external_oauth_any_role_mode = 'ENABLE'
    ...
```

### Using secondary roles with External OAuth

The desired scope for the primary role is passed in the external token: either the default role for the user (`session:role-any`) or
a specific role that was granted to the user (`session:role:<role_name>`).

By default, Snowflake does not activate the default [secondary roles](security-access-control-overview.md) for a user (i.e.
the DEFAULT_SECONDARY_ROLES) user in the session.

To activate the default secondary roles for a user in a session and allow executing the [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md)
command while using External OAuth, complete the following steps:

1. Configure the security integration for the connection. Set the EXTERNAL_OAUTH_ANY_ROLE_MODE parameter value to either ENABLE or
   ENABLE_FOR_PRIVILEGE when you create the security integration (using CREATE SECURITY INTEGRATION) or later (using ALTER SECURITY
   INTEGRATION).
2. Configure the authorization server to pass the static value of `session:role-any` in the scope attribute of the token. For more
   information about the scope parameter, see [External OAuth overview](oauth-ext-overview.md).

### Using network policies with External OAuth

Currently, network policies cannot be added to your External OAuth security integration. However, you can still implement network policies that apply broadly to the entire Snowflake account.

If your use case requires a network policy that is specific to the OAuth security integration, use [Snowflake OAuth](oauth-intro.md). This approach allows the Snowflake OAuth network policy to be distinct from other network policies that may apply to the Snowflake account.

For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Using replication with External OAuth

Snowflake supports replication and failover/failback of the External OAuth security integration from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## Testing procedure

In the context of testing OAuth while using PingFederate as an authorization server, you must:

1. Verify that the test user exists in PingIdentity and has a password.
2. Verify that the test user exists in Snowflake with their `login_name` attribute value set to the `<PING_USER_USERNAME>`
3. Grant the Analyst role to this user.
4. Register an OAuth Client.
5. Allow the OAuth Client to make a POST request to the PingFederate Token endpoint as follows:

   * Grant type set to Resource Owner.
   * HTTP Basic Authorization header containing the clientID and secret.
   * FORM data containing the user’s username and password.
   * Include any necessary scopes.

The sample command requests the Analyst and that assumes the `session:role:analyst` is defined in
PingFederate > OAuth Server > Exclusive Scopes.

Use the following command to obtain an access token from Ping.

```bash
curl -k 'https://10.211.55.4:9031/as/token.oauth2' \
  --data-urlencode 'client_id=<CLIENT_ID>&grant_type=password&username=<USERNAME>&password=<PASSWORD>&client_secret=<CLIENT_SECRET>&scope=session:role:analyst'
```

## Connecting to Snowflake with External OAuth

After configuring your security integration and obtaining your access token, you can connect to Snowflake using one of the following:

* [SnowSQL](snowsql-start.md)
* [Python Connector](../developer-guide/python-connector/python-connector-connect.md)
* [Go Driver](https://godoc.org/github.com/snowflakedb/gosnowflake#hdr-Connection_Parameters)
* [JDBC Driver](../developer-guide/jdbc/jdbc-configure.md)
* [ODBC Driver](../developer-guide/odbc/odbc-parameters.md)
* [Spark Connector](spark-connector-use.md)
* [.NET Driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/README.md#create-a-connection)
* [Node.js Driver](../developer-guide/node-js/nodejs-driver-authenticate.md)

Note the following:

* It is necessary to set the `authenticator` parameter to `oauth` and the `token` parameter to the `external_oauth_access_token`.
* When passing the `token` value as a URL query parameter, it is necessary to URL-encode the `token` value.
* When passing the `token` value to a Properties object (e.g. JDBC Driver), no modifications are necessary.

For example, if using the Python Connector, set the connection string as shown below.

```python
ctx = snowflake.connector.connect(
   user="<username>",
   host="<hostname>",
   account="<account_identifier>",
   authenticator="oauth",
   token="<external_oauth_access_token>",
   warehouse="test_warehouse",
   database="test_db",
   schema="test_schema"
)
```

You can now use External OAuth to connect to Snowflake securely.

---
title: Configure private connectivity for the Snowflake Open Catalog UI
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-ui-configure.md
section: User Guide
---

# Configure private connectivity for the Snowflake Open Catalog UI

This topic describes how to configure private connectivity for the Snowflake Open Catalog UI. This configuration, combined with configuring
private connectivity for your Open Catalog account, allows you to access the Open Catalog UI through private connectivity instead of over
the public internet. For more information, see the prerequisites for configuring private connectivity for the Snowflake Open Catalog UI.

Configuring private connectivity for the UI is similar to configuring it for your Open Catalog account. However, when you configure it for
the UI, you need to configure additional DNS entries because the UI is hosted under a different domain, compared to your account.

## Prerequisites

* Before you configure private connectivity for the Open Catalog UI, you must set up inbound private connectivity for your Open Catalog
  account. If you don’t, you won’t be able to use the UI.

  To set up inbound private connectivity for your Open Catalog account, follow the guide specific to the cloud platform that hosts your Open Catalog account:

  + [AWS](private-connectivity-inbound-configure-aws.md)
  + [Azure](private-connectivity-inbound-configure-azure.md)

## Step 1: Retrieve your PrivateLink URLs

In this procedure, you retrieve PrivateLink URLs, which you use to configure private connectivity.

1. Sign in to Snowflake Open Catalog.
2. In the navigation menu, select **Settings**.
3. On the Settings page, copy the values for the following settings into a text editor:

   * PrivateLink Account URL
   * Regionless PrivateLink Account URL
   * Regionless Snowsight PrivateLink URL
   * Snowsight PrivateLink URL

   For descriptions of each setting, see
   [Return values for the SYSTEM$GET_PRIVATELINK_CONFIG system function](https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_config#returns)
   in the Snowflake documentation. In this topic, the names of the account settings are in JSON format.

   > **Note:**
   > * Remember that the description refers to a Snowflake account, but your value is actually for your Snowflake Open
   >   Catalog account. For example, the `privatelink-account-url` is the URL for your Snowflake Open Catalog account.
   > * You need the **Regionless Snowsight PrivateLink URL** and **Snowsight PrivateLink URL** settings because if an Open Catalog user navigates to the Regionless Account URL, the user is redirected to Snowsight.
   > * Optional: To retrieve these values in JSON format, [Create a Snowflake CLI connection for Open Catalog](private-connectivity-outbound-manage-endpoints-aws.md),
   >   and then call the SYSTEM$GET_PRIVATELINK_CONFIG system function.

## Step 2: Confirm your settings

To use private connectivity with the Open Catalog UI, configure your DNS, and ensure that firewalls allow access to the relevant values:

1. Confirm that your DNS settings can resolve the values.
2. To confirm that you can connect to Open Catalog from your browser, use each of the PrivateLink URLs.
3. Optional: To use the account name URL (the value for **Regionless PrivateLink Account URL**) as your primary URL to access Open Catalog,
   contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support), and request that all URL redirects point to the URL
   specified by **Regionless PrivateLink Account URL**.

---
title: Configure replication for Snowflake-managed Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-replication.md
section: User Guide
---

# Configure replication for Snowflake-managed Apache Iceberg™ tables

With this feature, you can replicate [Snowflake-managed Apache Iceberg™ tables](tables-iceberg.md), from a source account to one or more target accounts in the same organization.

Replication for Iceberg tables works similarly to replication for regular Snowflake tables. Snowflake replicates an Iceberg table
when you add its parent database to a failover or replication group.

However, Snowflake-managed Iceberg tables rely on external volumes, which are account-level objects that require extra configuration to connect
to your external cloud storage. Before you can replicate an Iceberg table, you must configure replication for external volumes.

## Opt in to the public preview for replication for Snowflake-managed Iceberg tables

To opt in to this public preview, you must opt in both the source and target account.

1. To opt in your source account, after you [enable preview features](../release-notes/preview-features.md) for your account, use the
   [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to enable the following parameters at the
   account level:

   > * ENABLE_ICEBERG_MANAGED_TABLE_REPLICATION
   >
   >   > **Note:**
   >   >
   >   > You can also enable this parameter at the failover group level.
   > * ENABLE_SELECTIVE_EXTERNAL_VOLUME_REPLICATION_PUPR
   >
   > For example:
   >
   > ```sqlexample
   > ALTER ACCOUNT SET
   >   ENABLE_ICEBERG_MANAGED_TABLE_REPLICATION = TRUE
   >   ENABLE_SELECTIVE_EXTERNAL_VOLUME_REPLICATION_PUPR = TRUE;
   > ```
2. Repeat the previous step for your target account.

## Enable replication

A user with the ORGADMIN role must enable replication for each source and target account in the organization:

```sqlexample
USE ROLE ORGADMIN;
SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER(
    '<organization_name>.<account_name>',
    'ENABLE_ACCOUNT_DATABASE_REPLICATION',
    'true');
```

For more information, see [Prerequisite: Enable replication for accounts in the organization](account-replication-config.md).

For more information about replication, see [Introduction to replication and failover across multiple accounts](account-replication-intro.md).

## Replicate an external volume by using a failover group

These steps provide a sample workflow for replicating an external volume and the Iceberg tables that depend on it
to a target account by using a failover group.

> **Note:**
>
> If you don’t already have an external volume, you can create one with the storage locations that you want, including a
> location *in the same region* as your target account. After configuring storage access for each location,
> you can create and replicate an Iceberg table that references the external volume.
>
> To create an external volume, see [Configure an external volume](tables-iceberg-configure-external-volume.md).

1. In the source account, update your external volume to add a storage location *in the same region* as your target account.

   For example:

   ```sqlexample
   ALTER EXTERNAL VOLUME exvol1
    ADD STORAGE_LOCATION =
    (
      NAME = 'my-s3-us-central-2'
      STORAGE_PROVIDER = 'S3'
      STORAGE_BASE_URL = 's3://my_bucket_us_central-2/'
      STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/myrole'
      STORAGE_AWS_EXTERNAL_ID = 'iceberg_table_external_id'
    );
   ```

   > **Important:**
   >
   > If you don’t specify your own `STORAGE_AWS_EXTERNAL_ID` for S3 storage, you must call [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) after you
   > add the new storage location to retrieve the Snowflake-generated external ID.
   > You need the external ID to configure access to S3 in the next step.

   Snowflake sets this new location as the [active storage location](tables-iceberg-storage.md) for the secondary
   external volume.
2. In the source account, create a Snowflake-managed Iceberg table that uses the external volume that you updated with the additional storage
   location.

   For example:

   ```sqlexample
   CREATE ICEBERG TABLE my_iceberg_table (amount int)
     CATALOG = 'SNOWFLAKE'
     EXTERNAL_VOLUME = 'exvol1'
     BASE_LOCATION = 'my_iceberg_table';
   ```
3. In the source account, retrieve information about the [Snowflake service principal](tables-iceberg-storage.md) for
   your *target account* by following these steps:

   1. Retrieve the name (`account_name`) of your target account by using the [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md) command.

      ```sqlexample
      SHOW REPLICATION ACCOUNTS LIKE 'my_target_account%';
      ```
   2. Call the [SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY](../sql-reference/functions/system_desc_iceberg_access_identity.md) system function.
      Specify the cloud provider for the target storage location and the name of your target account *exactly* as
      it appears in the `account_name` column of the SHOW REPLICATION ACCOUNTS output.

      For example:

      ```sqlexample
      SELECT SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY('S3', 'MY_TARGET_ACCOUNT_1');
      ```
4. Configure Snowflake access to the storage location associated with your target account.
   Follow the instructions for your cloud provider, using the information you retrieved for the service principal in the target account:

   * [Configure an external volume for Amazon S3](tables-iceberg-configure-external-volume-s3.md). Use the external ID associated with the storage location for your target account.
   * [Configure an external volume for Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
   * [Configure an external volume for Azure](tables-iceberg-configure-external-volume-azure.md). In the `AZURE_CONSENT_URL TEMPLATE` returned by
     SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY, replace `your_tenant_id` with the ID for your
     tenant that the storage location belongs to.
5. In the source account, use the [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md) command to create a failover group.
   Specify `EXTERNAL VOLUMES` in the `OBJECT_TYPES` list. In the
   `ALLOWED_DATABASES` list, include the database with the Iceberg tables that you want to replicate. In the
   `ALLOWED_EXTERNAL_VOLUMES` list, include the external volumes that provide access to the Iceberg tables that you want to replicate.

   ```sqlexample
   CREATE FAILOVER GROUP my_iceberg_fg
     OBJECT_TYPES = DATABASES, EXTERNAL VOLUMES
     ALLOWED_DATABASES = my_iceberg_database
     ALLOWED_EXTERNAL_VOLUMES = my_external_volume
     ALLOWED_ACCOUNTS = myorg.my_account_1;
   ```

   > **Note:**
   >
   > If you receive a SQL parser error, your list of allowed external volumes might be too long. If you receive this error, shorten this
   > list in your CREATE FAILOVER GROUP statement, and then use the [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command to add
   > additional allowed external volumes to the failover group.

   To update an existing group, use the [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command to add `EXTERNAL VOLUMES` to the
   `OBJECT_TYPES` list.
   Include any other existing objects in the `OBJECT_TYPES` list to avoid dropping those objects in the target account.

   For example, add `EXTERNAL VOLUMES` to a failover group that already includes `DATABASES`:

   ```sqlexample
   ALTER FAILOVER GROUP my_iceberg_rg SET
     OBJECT_TYPES = DATABASES, EXTERNAL VOLUMES
     ALLOWED_EXTERNAL_VOLUMES = my_external_volume;
   ```
6. In the target account, create a failover group as a replica of the group in the source account (`my_source_account`):

   ```sqlexample
   CREATE FAILOVER GROUP my_iceberg_fg
     AS REPLICA OF myorg.my_source_account.my_iceberg_fg;
   ```

   Skip this step if you already have a secondary group that replicates the group in the source account.
7. In the target account, run a refresh command.

   ```sqlexample
   ALTER FAILOVER GROUP my_iceberg_fg REFRESH;
   ```

   As long as you replicate the database that contains your Snowflake-managed Iceberg table and you’ve
   configured access to your cloud storage for the target account, Snowflake replicates the table in the target account.

   > **Note:**
   >
   > The refresh operation fails if Snowflake can’t access the storage location configured for the target account.
   > If this happens, double-check your access control settings, or try [Verifying storage access](tables-iceberg-storage.md).

## Replicate an external volume by using a replication group

These steps provide a sample workflow for replicating an external volume and the Iceberg tables that depend on it
to a target account by using a replication group.

> **Note:**
>
> If you don’t already have an external volume, you can create one with the storage locations that you want, including a
> location *in the same region* as your target account. After configuring storage access for each location,
> you can create and replicate an Iceberg table that references the external volume.
>
> To create an external volume, see [Configure an external volume](tables-iceberg-configure-external-volume.md).

1. In the source account, update your external volume to add a storage location *in the same region* as your target account.

   For example:

   ```sqlexample
   ALTER EXTERNAL VOLUME exvol1
    ADD STORAGE_LOCATION =
    (
      NAME = 'my-s3-us-central-2'
      STORAGE_PROVIDER = 'S3'
      STORAGE_BASE_URL = 's3://my_bucket_us_central-2/'
      STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/myrole'
      STORAGE_AWS_EXTERNAL_ID = 'iceberg_table_external_id'
    );
   ```

   > **Important:**
   >
   > If you don’t specify your own `STORAGE_AWS_EXTERNAL_ID` for S3 storage, you must call [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) after you
   > add the new storage location to retrieve the Snowflake-generated external ID.
   > You need the external ID to configure access to S3 in the next step.

   Snowflake sets this new location as the [active storage location](tables-iceberg-storage.md) for the secondary
   external volume.
2. In the source account, create a Snowflake-managed Iceberg table that uses the external volume that you updated with the additional storage
   location.

   For example:

   ```sqlexample
   CREATE ICEBERG TABLE my_iceberg_table (amount int)
     CATALOG = 'SNOWFLAKE'
     EXTERNAL_VOLUME = 'exvol1'
     BASE_LOCATION = 'my_iceberg_table';
   ```
3. In the source account, retrieve information about the [Snowflake service principal](tables-iceberg-storage.md) for
   your *target account* by following these steps:

   1. Retrieve the name (`account_name`) of your target account by using the [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md) command.

      ```sqlexample
      SHOW REPLICATION ACCOUNTS LIKE 'my_target_account%';
      ```
   2. Call the [SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY](../sql-reference/functions/system_desc_iceberg_access_identity.md) system function.
      Specify the cloud provider for the target storage location and the name of your target account *exactly* as
      it appears in the `account_name` column of the SHOW REPLICATION ACCOUNTS output.

      For example:

      ```sqlexample
      SELECT SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY('S3', 'MY_TARGET_ACCOUNT_1');
      ```
4. Configure Snowflake access to the storage location associated with your target account.
   Follow the instructions for your cloud provider, using the information you retrieved for the service principal in the target account:

   * [Configure an external volume for Amazon S3](tables-iceberg-configure-external-volume-s3.md). Use the external ID associated with the storage location for your target account.
   * [Configure an external volume for Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
   * [Configure an external volume for Azure](tables-iceberg-configure-external-volume-azure.md). In the `AZURE_CONSENT_URL TEMPLATE` returned by
     SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY, replace `your_tenant_id` with the ID for your
     tenant that the storage location belongs to.
5. In the source account, use the [CREATE REPLICATION GROUP](../sql-reference/sql/create-replication-group.md) command to create a replication group.
   Specify `EXTERNAL VOLUMES` in the `OBJECT_TYPES` list. In the
   `ALLOWED_DATABASES` list, include the database with the Iceberg table(s) you want to replicate. In the
   `ALLOWED_EXTERNAL_VOLUMES` list, include the external volumes that provide access to the Iceberg table(s) you want to replicate.

   ```sqlexample
   CREATE REPLICATION GROUP my_iceberg_rg
     OBJECT_TYPES = DATABASES, EXTERNAL VOLUMES
     ALLOWED_DATABASES = my_iceberg_database
     ALLOWED_EXTERNAL_VOLUMES = my_external_volume
     ALLOWED_ACCOUNTS = myorg.my_account_1;
   ```

   > **Note:**
   >
   > If you receive a SQL parser error, your list of allowed external volumes might be too long. If you receive this error, shorten this
   > list in your CREATE REPLICATION GROUP statement, and then use the [ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) command to add
   > additional allowed external volumes to the replication group.

   To update an existing group, use the [ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) command to add `EXTERNAL VOLUMES` to the
   `OBJECT_TYPES` list.
   Include any other existing objects in the `OBJECT_TYPES` list to avoid dropping those objects in the target account.

   For example, add `EXTERNAL VOLUMES` to a replication group that already includes `DATABASES`:

   ```sqlexample
   ALTER REPLICATION GROUP my_iceberg_rg SET
     OBJECT_TYPES = DATABASES, EXTERNAL VOLUMES
     ALLOWED_EXTERNAL_VOLUMES = my_external_volume;
   ```
6. In the target account, create a replication group as a replica of the group in the source account (`my_source_account`):

   ```sqlexample
   CREATE REPLICATION GROUP my_iceberg_rg
     AS REPLICA OF myorg.my_source_account.my_iceberg_rg;
   ```

   Skip this step if you already have a secondary group that replicates the group in the source account.
7. In the target account, run a refresh command.

   ```sqlexample
   ALTER REPLICATION GROUP my_iceberg_rg REFRESH;
   ```

   As long as you replicate the database that contains your Snowflake-managed Iceberg table and you’ve
   configured access to your cloud storage for the target account, Snowflake replicates the table in the target account.

   > **Note:**
   >
   > The refresh operation fails if Snowflake can’t access the storage location configured for the target account.
   > If this happens, double-check your access control settings, or try [Verifying storage access](tables-iceberg-storage.md).

## Considerations and limitations

Consider the following points when you use replication for Iceberg tables:

* Snowflake currently supports replication of Snowflake-managed tables only.
* Replicating converted Iceberg tables isn’t supported. Snowflake skips converted tables during refresh operations.
* For replicated tables, you must configure access to a storage location in the *same region* as the target account.
* If you drop or alter a storage location that is used for replication on the primary external volume, refresh operations might fail.
* Secondary tables in the target account are read-only until you promote the target account to serve as the source account.
* Snowflake maintains the [directory hierarchy](tables-iceberg-storage.md)
  of the primary Iceberg table for the secondary table.
* Replication costs apply for this feature. For more information, see [Understanding replication cost](account-replication-cost.md).
* For considerations about the account objects for replication and failover groups, see [Account objects](account-replication-considerations.md).
* Replicating dynamic Iceberg tables isn’t supported. Snowflake skips converted tables during refresh operations.

---
title: Configure Snowflake OAuth for custom clients
source: https://docs.snowflake.com/en/user-guide/oauth-custom.md
section: User Guide
---

# Configure Snowflake OAuth for custom clients

This topic describes how to configure OAuth support for custom clients.

## Workflow

The following high-level steps are required to configure OAuth for custom clients:

1. Register your client with Snowflake. To register your client, create an integration. An integration is a Snowflake object that provides
   an interface between Snowflake and third-party services, such as a client that supports OAuth.

   The registration process defines a client ID and client secrets.
2. Configure calls to the Snowflake OAuth endpoints to request authorization codes from the Snowflake authorization server and to request
   and refresh access tokens.

   The optional “scope” parameters in the initial authorization request limit the role permitted by the access token and can additionally
   be used to configure the refresh token behavior.

> **Note:**
>
> In-session role switching to secondary roles is not supported with Snowflake OAuth.
>
> If this behavior is necessary with your OAuth workflow, use External OAuth instead.
>
> For more information, see [Using secondary roles with External OAuth](oauth-ext-overview.md).

## Create a Snowflake OAuth integration

Create a Snowflake OAuth integration using the
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-snowflake.md) command. Be sure to
specify `OAUTH_CLIENT = CUSTOM` when creating the integration.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this
> SQL command.

### Blocking specific roles from using the integration

The optional BLOCKED_ROLES_LIST parameter allows you to list Snowflake roles that a user cannot explicitly consent to using with
the integration.

By default, the ACCOUNTADMIN, SECURITYADMIN, GLOBALORGADMIN, and ORGADMIN roles are included in this list and cannot be removed. If you have a business
need to allow users to use Snowflake OAuth with these roles, and your security team allows it, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request that these roles be allowed for your account.

### Using Client Redirect with Snowflake OAuth custom clients

Snowflake supports using Client Redirect with Snowflake OAuth Custom Clients, including using Client Redirect and OAuth with supported
Snowflake Clients.

For more information, see [Redirecting client connections](client-redirect.md).

### Managing network policies

Snowflake supports network policies for OAuth. For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Integration example

The following example creates an OAuth integration that uses key pair authentication. The integration allows refresh tokens, which expire
after 1 day (86400 seconds). The integration blocks users from starting a session with SYSADMIN as the active role:

```sqlexample
CREATE SECURITY INTEGRATION oauth_kp_int
  TYPE = OAUTH
  ENABLED = TRUE
  OAUTH_CLIENT = CUSTOM
  OAUTH_CLIENT_TYPE = 'CONFIDENTIAL'
  OAUTH_REDIRECT_URI = 'https://localhost.com'
  OAUTH_ISSUE_REFRESH_TOKENS = TRUE
  OAUTH_REFRESH_TOKEN_VALIDITY = 86400
  BLOCKED_ROLES_LIST = ('SYSADMIN')
  OAUTH_CLIENT_RSA_PUBLIC_KEY ='
  MIIBI
  ...
  ';
```

## Call the OAuth endpoints

OAuth endpoints are the URLs that clients call to request authorization codes and to request and refresh access tokens. These endpoints
refer to specific OAuth 2.0 policies that execute when the endpoint is called.

Snowflake provides the following OAuth endpoints:

Authorization:
:   `<snowflake_account_url>/oauth/authorize`

Token requests:
:   `<snowflake_account_url>/oauth/token-request`

Where `<snowflake_account_url>` is a valid Snowflake account URL. For example, you might use the endpoints
`https://myorg-account_xyz.snowflakecomputing.com/oauth/authorize` and
`https://myorg-account_xyz.snowflakecomputing.com/oauth/token-request`. For a list of supported formats for the Snowflake account URL,
see [Connecting with a URL](organizations-connect.md).

To see a list of valid OAuth endpoints for a security integration, execute [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md),
and then view the values in the `OAUTH_ALLOWED_AUTHORIZATION_ENDPOINTS` and `OAUTH_ALLOWED_TOKEN_ENDPOINTS` properties.

### Authorization endpoint

The authorization endpoint is used to obtain an authorization grant after a user successfully authorizes a client with Snowflake.

> > **Important:**
> >
> > The authorization endpoint must be opened in a browser that the user can interact with. Do not use cURL with this endpoint.

The authorization endpoint is as follows:

```bash
<snowflake_account_url>/oauth/authorize
```

Where:

> `snowflake_account_url`
> :   Specifies a valid [Snowflake account URL](organizations-connect.md). For example,
>     `https://myorg-account_xyz.snowflakecomputing.com/oauth/authorize`.

#### HTTP method

`GET`

#### Query parameters

> **Note:**
>
> The following parameters should be URL encoded.

| Parameter | Data Type | Required? | Description |
| --- | --- | --- | --- |
| `client_id` | String | Yes | Client ID (provided by Snowflake when the client is registered) |
| `response_type` | String | Yes | Response type created. Currently supports `code` value, because Snowflake only issues authorization codes. |
| `redirect_uri` | String | Yes | URI where the user is redirected to after successfully authorizing. In general, this should match the value of the OAUTH_REDIRECT_URI parameter of the security integration.  However, if the `redirect_uri` includes query parameters, do not include those query parameters when defining the OAUTH_REDIRECT_URI parameter of the security integration. For example, if the value of the `redirect_uri` query parameter in the request to the authorization endpoint is `https://www.example.com/connect?authType=snowflake`, make sure the OAUTH_REDIRECT_URI parameter in the security integration is set to `https://www.example.com/connect`. |
| `state` | String | No | String of no more than 2048 ASCII characters that is returned with the response from the Snowflake authorization server. Typically used to prevent cross-site request forgery attacks. |
| `scope` | String | No | Space-delimited string that is used to limit the scope of the access request. For more information, refer to Scope (in this topic). |
| `code_challenge` | String | No | Challenge for Proof Key for Code Exchange (PKCE). String generated via a secret and a code challenge method. For more information, refer to Proof key for code exchange (in this topic). |
| `code_challenge_method` | String | No | String indicating the method used to derive the code challenge for PKCE. For more information, refer to Proof key for code exchange (in this topic). |

When a user authorizes the client, a redirect is made to the `redirect_uri` that contains the following in a GET request:

> | Query Parameter | Description |
> | --- | --- |
> | `code` | Short-lived authorization code, which can be exchanged at the token endpoint for an access token. |
> | `state` | `state` value provided in the original request, unmodified. |
> | `scope` | Scope of the access request; currently the same as the `scope` value in the initial authorization request, but might differ in the future. For more information, see Scope (in this topic). |

##### Scope

The `scope` query parameter in the initial authorization request optionally limits the operations and role permitted by the access token.

Scope is validated immediately when making an authorization request with respect to semantics, but not necessarily validity. That is, any
invalid scopes (e.g. “bogus_scope”) are rejected before the user authenticates, but a scope the user does not have access to (a
particular role, etc.) does not result in an error until after the user authenticates.

The following are the possible values of the `scope` query parameter:

| Scope Value | Required? | Description |
| --- | --- | --- |
| `refresh_token` | No | If included in the authorization URL, Snowflake presents the user with the option to consent to offline access. In this context, offline access refers to allowing the client to refresh access tokens when the user is not present. With user consent, the authorization server returns a refresh token in addition to an access token when redeeming the authorization code. |
| `session:role:role_name` | No | Used to limit the access token to a single role that the user can consent to for the session. Only one session role scope can be specified. If this scope is omitted, then the default role for the user is used instead. When a user authorizes consent, Snowflake always displays the role for the session regardless if this scope is included in the authorization URL.  Note that `role_name` is case-sensitive and must be input in all uppercase unless the role name was enclosed in quotes when it was created using [CREATE ROLE](../sql-reference/sql/create-role.md). To verify the case, execute [SHOW ROLES](../sql-reference/sql/show-roles.md) in Snowflake and see the role name in the output.  If the role name contains characters that are reserved in a query parameter URL, you must use a `session:role-encoded:role_name` syntax, where `role_name` is a URL-encoded string. For example, if the role name is `AUTH SNOWFLAKE` (with a space), then the value of the `scope` query parameter must be `session:role-encoded:AUTH%20SNOWFLAKE`. |

The following example limits authorization to the custom R1 role:

> ```bash
> scope=session:role:R1
> ```

The following example indicates that access/refresh tokens should use the default role for the user and requests a refresh token so that
offline access can occur:

> ```bash
> scope=refresh_token
> ```

The following example limits authorization to the custom R1 role and requests a refresh token so that offline access can occur:

> ```bash
> scope=refresh_token session:role:R1
> ```

### Token endpoint

This endpoint returns access tokens or refresh tokens depending on the request parameters. The token endpoint is as follows:

```bash
<snowflake_account_url>/oauth/token-request
```

Where:

> `snowflake_account_url`
> :   Specifies a valid [Snowflake account URL](organizations-connect.md). For example,
>     `https://myorg-account_xyz.snowflakecomputing.com/oauth/token-request`.

#### HTTP method

`POST`

Ensure that the content-type header in the POST request is set as follows:

```bash
Content-type: application/x-www-form-urlencoded
```

#### Request header

The client ID and client secret must be included in the authorization header. Currently, Snowflake only supports the
[Basic Authentication Scheme](https://tools.ietf.org/html/rfc2617), which means that the value expected is in the following form:

`Basic Base64(client_id:client_secret)`

Where:

| Header Value | Data Type | Required | Description |
| --- | --- | --- | --- |
| `client_id` | String | Yes | Client ID of the integration. |
| `client_secret` | String | Yes | Client secret for the integration. |

Both the client ID and client secret can be retrieved using the [SYSTEM$SHOW_OAUTH_CLIENT_SECRETS](../sql-reference/functions/system_show_oauth_client_secrets.md) function.

Note the `:` character between `client_id` and `client_secret`.

#### Request body

| Parameter | Data Type | Required | Description |
| --- | --- | --- | --- |
| `grant_type` | String | Yes | Type of grant requested: . `authorization_code` indicates that an authorization code should be exchanged for an access token. . `refresh_token` indicates a request to refresh an access token. |
| `code` | String | Yes | Authorization code returned from the token endpoint. Used and required when `grant_type` is set to `authorization_code`. |
| `refresh_token` | String | Yes | Refresh token returned from an earlier request to the token endpoint when redeeming the authorization code. Used and required when `grant_type` is set to `refresh_token`. |
| `redirect_uri` | String | Yes | Redirect URI as used in the authorization URL when requesting an authorization code. Used and required when `grant_type` is set to `authorization_code`. |
| `code_verifier` | String | No | Required only if the authorization request was sent to the Authorization Endpoint with a `code_challenge` parameter value. Code verifier for PKCE. For more information, see Proof key for code exchange (in this topic). |

#### Response

A JSON object is returned with the following fields:

| Field | Data Type | Description |
| --- | --- | --- |
| `access_token` | String | Access token used to establish a Snowflake session |
| `refresh_token` | String | Refresh token. Not issued if the client is configured to not issue refresh tokens or if the user did not consent to the `refresh_token` scope. |
| `expires_in` | Integer | Number of seconds remaining until the token expires |
| `token_type` | String | Access token type. Currently, always `Bearer`. |
| `username` | String | Username that the access token belongs to. Currently only returned when exchanging an authorization code for an access token. |

##### Successful response example

The following example shows a successful response when exchanging an authorization code for an access and refresh token:

```sqljson
{
  "access_token":  "ACCESS_TOKEN",
  "expires_in": 600,
  "refresh_token": "REFRESH_TOKEN",
  "token_type": "Bearer",
  "username": "user1",
}
```

##### Unsuccessful response example

The following example shows an unsuccessful response:

```sqljson
{
  "data" : null,
  "message" : "This is an invalid client.",
  "code" : null,
  "success" : false,
  "error" : "invalid_client"
}
```

The `message` string value is a description of the error and `error` is the error type. For more information on the types of
errors returned, see [OAuth Error Codes](oauth-snowflake-overview.md).

### Token exchange

This endpoint returns an OAuth access token in exchange for a JSON Web Token (JWT). For an example, see [Tutorial 1 (step 5)](../developer-guide/snowpark-container-services/tutorials/tutorial-1.md). In the tutorial you send a request to this endpoint to exchange a JWT token for an OAuth token and use the OAuth token to access a public endpoint exposed by a Snowpark Container Services service.

The token endpoint is as follows:

```bash
<snowflake_account_url>/oauth/token
```

Where:

> `snowflake_account_url`
> :   Specifies a valid [Snowflake account URL](organizations-connect.md). For example,
>     `https://myorg-account_xyz.snowflakecomputing.com/oauth/token`.

#### HTTP method

`POST`

Ensure that the content-type header in the POST request is set as follows:

```bash
Content-type: application/x-www-form-urlencoded
```

#### Request body

| Parameter | Data Type | Required | Description |
| --- | --- | --- | --- |
| `grant_type` | String | Yes | Pass this as string `urn:ietf:params:oauth:grant-type:jwt-bearer` . |
| `scope` | String | Yes | Pass this as string `session:role:role_name <ingress-endpoint-url>`. Note that the `role_name` is case sensitive. Use the [SHOW ENDPOINTS IN SERVICE](../sql-reference/sql/show-endpoints.md) command to find the ingress endpoint URL. . |
| `assertion` | String | Yes | Pass the JWT token. |

For example,

```sqljson
{
    'grant_type': 'urn:ietf:params:oauth:grant-type:jwt-bearer',
    'scope': 'session:role:TEST_ROLE ab12-orgname-acctname.snowflakecomputing.app',
    'assertion': '<token>'
}
```

When specifying `scope`, the `session:role:role_name` is optional. If not provided, the default role of the user is used.

```sqljson
{
    'grant_type': 'urn:ietf:params:oauth:grant-type:jwt-bearer',
    'scope': 'ab12-orgname-acctname.snowflakecomputing.app',
    'assertion': '<token>'
}
```

#### Response

An OAuth access token is returned

## Proof key for code exchange

Snowflake supports Proof Key for Code Exchange (PKCE) for obtaining access tokens using the `authorization_code` grant type as
described in [RFC 7636](https://tools.ietf.org/html/rfc7636). PKCE can be used to lessen the possibility of an authorization code
interception attack, and is suitable for clients that might not be able to fully keep the client secret secure.

By default, PKCE is optional and is enforced only if the `code_challenge` and `code_challenge_method` parameters are both
included in the authorization endpoint URL. However, Snowflake highly recommends that your client require PKCE for all authorizations to
make the OAuth flow more secure.

The following describes how PKCE for Snowflake works:

1. The client creates a secret called the *code verifier* and performs a transformation on it to generate the *code challenge*. The client
   holds onto the secret.

   > **Important:**
   >
   > Generate the *code verifier* from the allowed ASCII characters according to
   > [Section 4.1 of RFC 7636](https://tools.ietf.org/html/rfc7636#section-4.1).
2. The client directing the user to the Authorization URL appends the following two query parameters:

   `code_challenge`
   :   Specifies the code challenge generated in Step 1.

   `code_challenge_method`
   :   Specifies the transformations used on the code verifier in Step 1 to generate the code challenge. Currently, Snowflake only supports
       SHA256, so this value must be set to `S256`. The transformation algorithm for SHA256 is
       `BASE64URL-ENCODE(SHA256(ASCII(code_verifier)))`.
3. After the user consents to the requested scopes or Snowflake determines that consent is present for that user, the authorization code
   is issued.
4. The client receives the authorization code from the Snowflake authorization server, which it then submits along with the
   `code_verifier` in the request to the token endpoint.
5. Snowflake transforms the `code_verifier` value and verifies that the transformed value matches the `code_challenge` value
   used when generating authorizations. If these values match, then the authorization server issues the access and refresh tokens.

## Using key-pair authentication

Snowflake supports using key pair authentication rather than the typical username/password authentication when calling the OAuth token
endpoint. This authentication method requires a 2048-bit (minimum) RSA key pair. Generate the PEM (Privacy Enhanced Mail) public-private
key pair using OpenSSL. The public key is assigned to the Snowflake user who uses the Snowflake client.

To configure the public/private key pair:

1. From the command line in a terminal window, generate an encrypted private key:

   > ```bash
   > $ openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 des3 -inform PEM -out rsa_key.p8
   > ```

   OpenSSL prompts for a passphrase used to encrypt the private key file. Snowflake recommends using a strong passphrase to protect the private
   key. Record this passphrase. You must input it when connecting to Snowflake. Note that the passphrase is only used for protecting
   the private key and is never sent to Snowflake.

   **Sample PEM private key**

   > ```bash
   > -----BEGIN ENCRYPTED PRIVATE KEY-----
   > MIIE6TAbBgkqhkiG9w0BBQMwDgQILYPyCppzOwECAggABIIEyLiGSpeeGSe3xHP1
   > wHLjfCYycUPennlX2bd8yX8xOxGSGfvB+99+PmSlex0FmY9ov1J8H1H9Y3lMWXbL
   > ...
   > -----END ENCRYPTED PRIVATE KEY-----
   > ```
2. From the command line, generate the public key by referencing the private key:

   > ```bash
   > $ openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
   > ```
   >
   > **Sample PEM public key**
   >
   > > ```bash
   > > -----BEGIN PUBLIC KEY-----
   > > MIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8AMIIBCgKCAQEAy+Fw2qv4Roud3l6tjPH4
   > > zxybHjmZ5rhtCz9jppCV8UTWvEXxa88IGRIHbJ/PwKW/mR8LXdfI7l/9vCMXX4mk
   > > ...
   > > -----END PUBLIC KEY-----
   > > ```
3. Copy the public and private key files to a local directory for storage. Record the path to the files.

   Note that the private key is stored using the PKCS#8 (Public Key Cryptography Standards) format and is encrypted using the passphrase
   you specified in the previous step; however, the file should still be protected from unauthorized access using the file permission
   mechanism provided by your operating system. It is your responsibility to secure the file when it is not being used.
4. Assign the public key to the integration object using [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-oauth-snowflake.md). For example:

   > ```sqlexample
   > ALTER SECURITY INTEGRATION myint SET OAUTH_CLIENT_RSA_PUBLIC_KEY='MIIBIjANBgkqh...';
   > ```

   > **Note:**
   > * Only account administrators can execute the ALTER SECURITY INTEGRATION command.
   > * Exclude the public key header and footer in the command.

   Verify the public key fingerprint using [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md):

   ```sqlexample
   DESC SECURITY INTEGRATION myint;

   +----------------------------------+---------------+----------------------------------------------------------------------+------------------+
   | property                         | property_type | property_value                                                       | property_default |
   |----------------------------------+---------------+----------------------------------------------------------------------+------------------|
   ...
   | OAUTH_CLIENT_RSA_PUBLIC_KEY_FP   | String        | SHA256:MRItnbO/123abc/abcdefghijklmn12345678901234=                  |                  |
   | OAUTH_CLIENT_RSA_PUBLIC_KEY_2_FP | String        |                                                                      |                  |
   ...
   +----------------------------------+---------------+----------------------------------------------------------------------+------------------+
   ```

   > **Note:**
   >
   > The `OAUTH_CLIENT_RSA_PUBLIC_KEY_2_FP` property is described in Key Rotation (in this topic).
5. Modify and execute the sample code below. The code uses the private key to encode a JWT and then passes that token to the Snowflake
   authorization server:

   * Update the security parameters:

     + `<private_key>`: Contents of the decrypted `rsa_key.p8` (including BEGIN and END), which you can obtain by executing `openssl rsa -in rsa_key.p8 -text`.
   * Update the session parameters:

     + `<account_locator>`: Your account locator code, for example `CIB07125`.
       You cannot use the account name (for example, `myorg-account_xyz`).
   * Update the token-request endpoint in the `_make_request()` function:

     + `<account_name>`: This is the account name format of your account’s identifier (for example, `myorg-account_xyz`).
   * Update the public key fingerprint:

     + `<public_key_fp>`: Retrieved by executing the DESC SECURITY INTEGRATION command.
       There can be 2 public keys so ensure you’re referencing the correct key.
   * Update the redirect URI:

     + `<redirect_uri>`: The redirect URI your integration is configured with. Execute the DESC SECURITY INTEGRATION command to obtain it.
   * Obtain an OAuth authorization code:

     + `<oauth_az_code>`: Obtained after authenticating with your `/authorize` endpoint.
       Note: this code needs to be refreshed periodically.
   * Update the JSON Web Token (JWT) fields:

     post body
     :   A JSON object with the following standard fields (“claims”):

     | Attribute | Data Type | Required | Description |
     | --- | --- | --- | --- |
     | `iss` | String | Yes | Specifies the principal that issued the JWT in the format `client_id.public_key_fp` where `client_id` is the client ID of the OAuth client integration and `public_key_fp` is the fingerprint of the public key that is used during verification. |
     | `sub` | String | Yes | Subject of the JWT in the format `account_locator.client_id` where |
     | `account_locator` is your Snowflake account locator and `client_id` is the client ID of the OAuth client integration. Depending on the cloud platform (AWS or Azure) and region where your account is hosted, the full account name might require additional segments. For more information, see the `account` variable description under Token endpoint. |  |  |  |
     | `iat` | Timestamp | No | Time when the token was issued. |
     | `exp` | Timestamp | Yes | Time when the token should expire. This period should be relatively short (e.g. a few minutes). |

   **Sample code**

   > Note that the `private_key` value (decrypted) includes the `-----BEGIN` header and the `-----END` footer.
   >
   > ```python
   > import datetime
   > import json
   > import urllib
   >
   > import jwt
   > import requests
   >
   > private_key = """
   > <private_key>
   > """
   >
   > public_key_fp = "<public_key_fp>" # SHA256:MR...
   >
   >
   > def _make_request(payload, encoded_jwt_token):
   >     token_url = "https://<account_name>.snowflakecomputing.com/oauth/token-request"
   >     headers = {
   >             u'Authorization': "Bearer %s" % (encoded_jwt_token),
   >             u'content-type': u'application/x-www-form-urlencoded'
   >     }
   >     r = requests.post(
   >             token_url,
   >             headers=headers,
   >             data=urllib.urlencode(payload))
   >     return r.json()
   >
   >
   > def make_request_for_access_token(oauth_az_code, encoded_jwt_token):
   >     """ Given an Authorization Code, make a request for an Access Token
   >     and a Refresh Token."""
   >     payload = {
   >         'grant_type': 'authorization_code',
   >         'code': oauth_az_code,
   >         'redirect_uri': <redirect_uri>
   >     }
   >     return _make_request(payload, encoded_jwt_token)
   >
   >
   > def make_request_for_refresh_token(refresh_token, encoded_jwt_token):
   >     """ Given a Refresh Token, make a request for another Access Token."""
   >     payload = {
   >         'grant_type': 'refresh_token',
   >         'refresh_token': refresh_token
   >     }
   >     return _make_request(payload, encoded_jwt_token)
   >
   >
   > def main():
   >     account_locator = "<account_locator>"
   >     client_id = "1234"  # found by running DESC SECURITY INTEGRATION
   >     issuer = "{}.{}".format(client_id, public_key_fp)
   >     subject = "{}.{}".format(account_locator, client_id)
   >     payload = {
   >         'iss': issuer,
   >         'sub': subject,
   >         'iat': datetime.datetime.utcnow(),
   >         'exp': datetime.datetime.utcnow() + datetime.timedelta(seconds=30)
   >     }
   >     encoded_jwt_token = jwt.encode(
   >             payload,
   >             private_key,
   >             algorithm='RS256')
   >
   >     data = make_request_for_access_token(<oauth_az_code>, encoded_jwt_token)
   >     refresh_token = data['refresh_token']
   >     data = make_request_for_refresh_token(refresh_token, encoded_jwt_token)
   >     access_token = data['access_token']
   >
   >
   > if __name__ == '__main__':
   >     main()
   > ```

   After the token is created, submit it in requests to the token endpoint. Requests require the Bearer authorization format as the
   authorization header instead of the basic authorization format normally used for the client ID and client secret, as follows:

   ```bash
   "Authorization: Bearer JWT_TOKEN"
   ```

### Key rotation

Snowflake supports multiple active keys to allow for uninterrupted rotation. Rotate and replace your public and private keys based on the
expiration schedule you follow internally.

Currently, you can use the `OAUTH_CLIENT_RSA_PUBLIC_KEY` and `OAUTH_CLIENT_RSA_PUBLIC_KEY_2` parameters for
[ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-oauth-snowflake.md) to associate up to 2 public keys with a single user.

To rotate your keys:

1. Complete the steps in Using key-pair authentication (in this topic):

   * Generate a new private and public key set.
   * Assign the public key to the integration. Set the public key value to either `OAUTH_CLIENT_RSA_PUBLIC_KEY` or
     `OAUTH_CLIENT_RSA_PUBLIC_KEY_2` (whichever key value is not currently in use). For example:

     > ```sqlexample
     > alter integration myint set oauth_client_rsa_public_key_2='JERUEHtcve...';
     > ```
2. Update the code to connect to Snowflake. Specify the new private key.

   Snowflake verifies the correct active public key for authentication based on the submitted private key.
3. Remove the old public key from the integration. For example:

   ```sqlexample
   alter integration myint unset oauth_client_rsa_public_key;
   ```

## Error codes

See the [Error codes](oauth-snowflake-overview.md) for a list of error codes associated with OAuth, as well as errors that are returned in the JSON
blob, during the authorization flow, token request or exchange, or when creating a Snowflake session after completing the OAuth flow.

## Pre-authorizing user consent for a role

Security administrators (i.e. users with the SECURITYADMIN role) or higher can pre-authorize consent for a client to initiate a session for
a user using a specified role and integration. This consent is granted using [ALTER USER](../sql-reference/sql/alter-user.md) with the ADD DELEGATED
AUTHORIZATION keywords. Without this delegated authorization, a user must authorize consent for the role after authentication. This
delegated authorization can also be revoked.

For more information, see [Managing user consent for OAuth](oauth-consent.md).

---
title: Configure Snowflake OAuth for partner applications
source: https://docs.snowflake.com/en/user-guide/oauth-partner.md
section: User Guide
---

# Configure Snowflake OAuth for partner applications

This topic explains how to configure Snowflake OAuth access to Snowflake for supported Snowflake partner applications. This process
requires creating an integration, a first-class Snowflake object that defines the interface between Snowflake and a third-party application
or service.

> **Important:**
>
> When connecting to Snowflake using any third-party application, Snowflake recommends that you verify that the integration flow used by
> the application meets your internal security requirements. You can contact the partner directly for details on their end-to-end flow used
> for this feature.

> **Note:**
>
> In-session role switching to secondary roles is not supported with Snowflake OAuth.
>
> If this behavior is necessary with your OAuth workflow, use External OAuth instead.
>
> For more information, see [Using secondary roles with External OAuth](oauth-ext-overview.md).

Currently, Snowflake OAuth supports the following applications:

| Client | Required Client Version | Client Type |
| --- | --- | --- |
| [Tableau Desktop / Cloud](https://www.tableau.com/) [1] | 2019.1 or higher | Public |
| [Looker](https://looker.com) [2] | 6.20 or higher |  |
| [Alation](https://www.alation.com/) | See the Alation documentation |  |
| [ThoughtSpot](https://thoughtspot.com) | See the ThoughtSpot documentation |  |
| [Collibra](https://www.collibra.com) | See the Collibra documentation |  |

[1]

If Tableau Cloud (version 2024.2 or higher) is connecting to Snowflake using private connectivity to the Snowflake service, you need
to use a custom security integration rather than the integration designed for a partner application. The redirect URL of the custom
integration must use the following URL form: `https://<your_server_url>/auth/add_oauth_token`. For instructions, see
[Configure Snowflake OAuth for custom clients](oauth-custom.md). Tableau Desktop, Tableau Online, and Tableau Server (version 2024.2 or lower) continue to use the partner
application integration regardless of whether it uses a private URL.

[2]

Looker supports OAuth only when Looker-hosted instances can access the public Internet. Note that this limitation does not
affect customer-hosted Looker implementations (i.e. on-premises implementations). Customers using
private connectivity to the Snowflake service might experience issues if attempting to use OAuth and Looker with Snowflake. Please contact Looker for questions or more details.

## Configuring a Snowflake OAuth integration

Create an integration using the [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-snowflake.md) command. An integration is a Snowflake object that
provides an interface between Snowflake and third-party services, such as a client that supports Snowflake OAuth.

> **Note:**
>
> Only account administrators (i.e users with the ACCOUNTADMIN system role) or a role with the global CREATE INTEGRATION privilege can
> execute this SQL command.

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [ IF NOT EXISTS ]
  <name>
  TYPE = OAUTH
  ENABLED = { TRUE | FALSE }
  OAUTH_CLIENT = <partner_application>
  oauthClientParams
  [ COMMENT = '<string_literal>' ]
```

Where:

> **oauthClientParams**
>
> > ```sqlsyntax
> > oauthClientParams ::=
> >   [ OAUTH_ISSUE_REFRESH_TOKENS = TRUE | FALSE ]
> >   [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
> >   [ BLOCKED_ROLES_LIST = ('<role_name>', '<role_name>') ]
> > ```

### Blocking specific roles from using the integration

The optional BLOCKED_ROLES_LIST parameter allows you to list Snowflake roles that a user cannot explicitly consent to using with
the integration.

By default, the ACCOUNTADMIN, SECURITYADMIN, GLOBALORGADMIN, and ORGADMIN roles are included in this list and cannot be removed. If you have a business
need to allow users to use Snowflake OAuth with these roles, and your security team allows it, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request that these roles be allowed for your account.

### Controlling the login frequency

When a user has authenticated successfully, the partner application can use the issued refresh token to request new, short-lived access
tokens, and not prompt the user to repeat the login process until the refresh token expires. The optional OAUTH_REFRESH_TOKEN_VALIDITY
parameter specifies the length of time a refresh token is valid (in seconds). This setting can be used to expire the refresh token
periodically, forcing the user to repeat the login process.

The supported minimum, maximum, and default values for the OAUTH_REFRESH_TOKEN_VALIDITY parameter are as follows:

| Application | Minimum | Maximum | Default |
| --- | --- | --- | --- |
| Tableau Desktop | `60` (1 minute) | `36000` (10 hours) | `36000` (10 hours) |
| Tableau Cloud | `60` (1 minute) | `7776000` (90 days) | `7776000` (90 days) |

If you have a business need to lower the minimum value or raise the maximum value, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request the change for
your account.

### Using Client Redirect with Snowflake OAuth for partner applications

Snowflake supports using Client Redirect with Snowflake OAuth for Partner Applications, including using Client Redirect and Snowflake OAuth
with supported Snowflake Clients.

For more information, see [Redirecting client connections](client-redirect.md).

### Managing network policies

Snowflake supports network policies for Looker, but not other partner applications.
For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Examples

**Tableau Desktop**

> The following example creates a Snowflake OAuth integration with the default settings:
>
> ```sqlexample
> CREATE SECURITY INTEGRATION td_oauth_int1
>   TYPE = OAUTH
>   ENABLED = TRUE
>   OAUTH_CLIENT = TABLEAU_DESKTOP;
> ```
>
> View the integration settings using [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md):
>
> ```sqlexample
> DESC SECURITY INTEGRATION td_oauth_int1;
> ```
>
> The following example creates a Snowflake OAuth integration with refresh tokens that expire after 10 hours (36000 seconds). The
> integration blocks users from starting a session with SYSADMIN as the active role:
>
> ```sqlexample
> CREATE SECURITY INTEGRATION td_oauth_int2
>   TYPE = OAUTH
>   ENABLED = TRUE
>   OAUTH_REFRESH_TOKEN_VALIDITY = 36000
>   BLOCKED_ROLES_LIST = ('SYSADMIN')
>   OAUTH_CLIENT = TABLEAU_DESKTOP;
> ```

**Tableau Cloud**

> The following example creates a Snowflake OAuth integration with the default settings:
>
> ```sqlexample
> CREATE SECURITY INTEGRATION ts_oauth_int1
>   TYPE = OAUTH
>   ENABLED = TRUE
>   OAUTH_CLIENT = TABLEAU_SERVER;
> ```
>
> View the integration settings using [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md):
>
> ```sqlexample
> DESC SECURITY INTEGRATION ts_oauth_int1;
> ```
>
> The following example creates a Snowflake OAuth integration with refresh tokens that expire after 1 day (86400 seconds). The integration
> blocks users from starting a session with SYSADMIN as the active role:
>
> ```sqlexample
> CREATE SECURITY INTEGRATION ts_oauth_int2
>   TYPE = OAUTH
>   ENABLED = TRUE
>   OAUTH_CLIENT = TABLEAU_SERVER
>   OAUTH_REFRESH_TOKEN_VALIDITY = 86400
>   BLOCKED_ROLES_LIST = ('SYSADMIN');
> ```

## Logging in to Snowflake from a partner application

### Tableau

Follow the [instructions](https://onlinehelp.tableau.com/current/pro/desktop/en-us/examples_snowflake.htm) provided by Tableau to connect
to Snowflake using Snowflake OAuth.

### Looker

Follow the [steps](https://docs.looker.com/setup-and-management/database-config/snowflake#oauth) provided by Looker to connect to
Snowflake using Snowflake OAuth.

### Alation

Access the [Alation Community](https://community.alation.com/home) and follow the instructions provided by Alation to connect to
Snowflake using Snowflake OAuth.

### ThoughtSpot

Access the [ThoughtSpot documentation](https://docs.thoughtspot.com/software/latest/connections-snowflake) and follow
the instructions to create a connection to Snowflake, which includes a step on how to configure
[Snowflake OAuth](https://docs.thoughtspot.com/software/latest/connections-snowflake-oauth.html).

### Collibra

Access the [Collibra Documentation](https://productresources.collibra.com/docs/collibra/latest/Content/Edge/JDBCConnections/ta_create-jdbc-connection.htm?catalog-connector-details=snowflake-native) and follow the instructions provided by Collibra to connect to Snowflake using Snowflake OAuth.

## Managing user consent

This section describes how to manage delegated authorizations, i.e. user consent given to one or more clients associated with Snowflake
integrations.

### Display Snowflake OAuth consents

List the active delegated authorizations for which you have access privileges, using
[SHOW DELEGATED AUTHORIZATIONS](../sql-reference/sql/show-delegated-authorizations.md):

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS;

+-------------------------------+-----------+-----------+-------------------+--------------------+
| created_on                    | user_name | role_name | integration_name  | integration_status |
|-------------------------------+-----------+-----------+-------------------+--------------------|
| 2018-11-27 07:43:10.914 -0800 | JSMITH    | PUBLIC    | MY_OAUTH_INT      | ENABLED            |
+-------------------------------+-----------+-----------+-------------------+--------------------+
```

List the active delegated authorizations for a specified user. Users can list their own delegated authorizations; otherwise, this command
variant requires the OWNERSHIP privilege on the user.

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS
    BY USER <username>;
```

List the active delegated authorizations for a specified integration. This command variant requires the OWNERSHIP privilege on the
integration (i.e. the ACCOUNTADMIN role):

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS
    TO SECURITY INTEGRATION <integration_name>;
```

### Revoke consent

A user can revoke consent from a specified integration. This has the effect of revoking any access token associated with the integration.

To revoke user consent for a given integration, execute the [ALTER USER](../sql-reference/sql/alter-user.md) … REMOVE DELEGATED AUTHORIZATIONS command.

> **Note:**
>
> Only security administrators (i.e. users with the SECURITYADMIN role) or higher can execute this SQL command.

```sqlsyntax
ALTER USER <username> REMOVE DELEGATED AUTHORIZATIONS
    FROM SECURITY INTEGRATION <integration_name>
```

Where:

`username`
:   Specifies the user whose consent you are revoking.

`integration_name`
:   Specifies the integration associated with the access tokens for a specific client.

To revoke user consent associated with a specific role, include the `OF ROLE role_name` parameter in the statement:

```sqlsyntax
ALTER USER <username> REMOVE DELEGATED AUTHORIZATION
    OF ROLE <role_name>
    FROM SECURITY INTEGRATION <integration_name>
```

Where:

`role_name`
:   Specifies the role associated with the access token.

Any access tokens associated with the role are revoked.

## Error codes

See the [Error codes](oauth-snowflake-overview.md) for a list of error codes associated with OAuth, as well as errors that are returned in the JSON
blob, during the authorization flow, token request or exchange, or when creating a Snowflake session after completing the OAuth flow.

---
title: Configure Snowflake Open Catalog to use SSO
source: https://docs.snowflake.com/en/user-guide/opencatalog/sso-configure-open-catalog.md
section: User Guide
---

# Configure Snowflake Open Catalog to use SSO

This topic shows you how to configure Snowflake Open Catalog to use SAML-based SSO.

Before you configure Snowflake Open Catalog to use SSO, you must configure your IdP for Open Catalog. For instructions, see the following
topics:

* [Configure Okta as the IdP for Open Catalog](sso-configure-idp.md)
* [Configure Auth0 as the Idp for Open Catalog](sso-configure-idp.md)

## Before you begin

To set up Snowflake Open Catalog for SSO, you need your full Open Catalog account identifier, which includes your Snowflake
organization name and your Open Catalog account name; for example: `<orgname>.<my-snowflake-open-catalog-account-name>`.

* To find your *Snowflake* organization name (`<orgname>`), see [Finding the organization and account name for an account](../admin-account-identifier.md).
* To find your *Snowflake Open Catalog* account name (`<my-snowflake-open-catalog-account-name>`), see
  [Find the account name for a Snowflake Open Catalog account](find-account-name.md).

## Create a Snowflake CLI connection for Open Catalog

To configure Snowflake Open Catalog to use SSO, you need a Snowflake CLI connection for Open Catalog. Follow these steps to create this
connection. If you don’t already have Snowflake CLI installed, see [Installing Snowflake CLI](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation).

> **Important:**
>
> To create this connection, you must be an Open Catalog user with service
> admin privileges. For information about service admin privileges, see [Service admin role](access-control.md).

### Add a Snowflake CLI connection for Snowflake Open Catalog

Add a connection for the Snowflake Open Catalog account where you want to enable SSO.

* [Add a connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md)
  with the following values. For all other parameters, press `Enter` to skip specifying a value for the parameter.

  | Connection configuration parameters | Value |
  | --- | --- |
  | **Name for this connection** | Specify a name for the connection; for example, `myopencatalogconnection`. |
  | **Account name** | Specify your Snowflake organization name, followed by your Open Catalog account name, in this format:  `<orgname>-<my-snowflake-open-catalog-account-name>`.  For example, `ABCDEFG-MYACCOUNT1`.  To find these names, see Before you begin. |
  | **Username** | Specify your username for Open Catalog; for example, `jsmith`. |
  | **Password [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  Enter your password for Open Catalog; for example, `MyPassword123456789`. |
  | **Role for the connection [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  You must enter `POLARIS_ACCOUNT_ADMIN` |

### Test the Snowflake CLI connection

* To test your CLI connection, follow this example, which tests the connection for `myopencatalogconnection`:

  ```snowcli
  snow connection test -c myopencatalogconnection
  ```

  The response should look like this:

  ```snowcli
  +------------------------------------------------------------------------------+
  | key              | value                                                     |
  |----------------------------+-------------------------------------------------|
  | Connection name  | myopencatalogconnection                                   |
  | Status           | OK                                                        |
  | Host             | ABCDEFG-MYACCOUNT1.snowflakecomputing.com                 |
  | Account          | ABCDEFG-MYACCOUNT1                                        |
  | User             | jsmith                                                    |
  | Role             | POLARIS_ACCOUNT_ADMIN                                     |
  | Database         | not set                                                   |
  | Warehouse        | not set                                                   |
  +------------------------------------------------------------------------------+
  ```

### Set your Snowflake CLI connection for Snowflake Open Catalog as the default

To ensure that the connection you’re using always has the required POLARIS_ACCOUNT_ADMIN role granted to it, you can set the Snowflake CLI
connection you created for Open Catalog as the default connection. For more information about the default connection, see
[Set the default connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md).

1. Follow this example, which sets the `myopencatalogconnection` connection as the default:

   ```snowcli
   snow connection set-default myopencatalogconnection
   ```
2. To confirm that you’re using the correct user and role, run the following:

   ```snowcli
   snow sql -q "Select current_user(); select current_role();"
   ```

   The response should return your Open Catalog username and the CURRENT
   ROLE should be POLARIS_ACCOUNT_ADMIN.

   ```snowcli
   +----------------+
   | CURRENT_USER() |
   |----------------|
   | JSMITH        |
   +----------------+
   select current_role();
   +-----------------------+
   | CURRENT_ROLE()        |
   |-----------------------|
   | POLARIS_ACCOUNT_ADMIN |
   +-----------------------+
   ```

## Create a security integration

To create a security integration, run the CREATE SECURITY INTEGRATION command by using a Snowflake CLI connection. You can create an
Auth0 security integration or an Okta security integration.

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

### Auth0 security integration

* To create a SAML security integration for Auth0, run the following command in Snowflake CLI:

  ```snowcli
  snow sql -q “create security integration <Name>
      type = saml2
      enabled = true
      saml2_issuer = 'urn:<Auth0 Domain>'
      saml2_sso_url = '<SAML Protocol URL>'
      saml2_provider = 'Custom'
      saml2_x509_cert='<Certificate from Auth0>'
      saml2_sp_initiated_login_page_label = 'Auth0'
      saml2_enable_sp_initiated = true
      saml2_snowflake_acs_url = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/login'
      saml2_snowflake_issuer_url = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com'
      saml2_requested_nameid_format = 'urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress';”
  ```

  Where:

  + `<Name>` specifies the identifier for the security integration; must be unique for your account.
  + `<Auth0 Domain>` is copied in the Auth0 console. To find this value, in Auth0, navigate to Applications > Applications > Snowflake Open Catalog application > Settings >
    Basic Information: **Domain** field.
  + `<SAML Protocol URL>` is copied in the Auth0 console. To find this value, in Auth0, navigate to Applications > Applications > Snowflake Open Catalog application
    > Settings > Advanced settings > Endpoints tab: **SAML Protocol URL** field.
  + `<Certificate from Auth0>` is copied in the Auth0 console. To find this value, in Auth0, navigate to: Applications > Applications > Snowflake Open Catalog application
    > Settings > Advanced Settings > Certificate tab: **Signing Certificates** field. Copy the value between <BEGIN CERTIFICATE> and <END CERTIFICATE>.
  + `<orgname>` is the name of your Snowflake organization. To find this name, see Before you begin.
  + `<my-snowflake-open-catalog-account-name>` is the name of your Snowflake Open Catalog account. To find this name, see
    Before you begin.

### Okta security integration

* To create a SAML security integration for Okta, run the following
  command in Snowflake CLI:

  ```snowcli
  snow sql -q “CREATE SECURITY INTEGRATION <Name>
      TYPE = SAML2
      ENABLED = TRUE
      SAML2_ISSUER = '<ENTITY ID>'
      SAML2_SSO_URL = '<IDP SSO URL>'
      SAML2_PROVIDER = 'OKTA'
      SAML2_X509_CERT='<Authentication Certificate>'
      SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = 'OKTA SSO'
      SAML2_ENABLE_SP_INITIATED = TRUE
      SAML2_SNOWFLAKE_ACS_URL = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/fed/login'
      SAML2_SNOWFLAKE_ISSUER_URL = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com';”
  ```

  Where:

  + `<Name>` specifies the identifier for the security integration; must be unique for your account.
  + `<ENTITY ID>` is the Entity ID value you copied when you [created an application in Okta](sso-configure-idp.md).
  + `<IDP SSO URL>` is the IDP SSO URL value you copied when you created an application in Okta.
  + `<Authentication Certificate>` is the IDP Authentication Certificate value you copied when you created an application in Okta.
  + `<orgname>` is the name of your Snowflake organization. To find this name, see Before you begin.
  + `<my-snowflake-open-catalog-account-name>` is the name of your Snowflake Open Catalog account. To find this name, see
    Before you begin.

## Verify the security integration

You can only use one security integration at a time, and the integration you want to use must be enabled.

> **Note:**
>
> If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
> following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN`.

1. To verify that the security integration you want to use is enabled, run the following command:

   ```snowcli
   snow sql -q "desc security integration <saml2-security-integration-name>;"
   ```

   If the response contains SAML2_ENABLE_SP_INITIATED=true, the SAML2 security integration is enabled.
2. Optional: If the response contains SAML2_ENABLE_SP_INITIATED=false, to enable it, run the following command:

   ```snowcli
   snow sql -q “ALTER SECURITY INTEGRATION <saml-security-integration-name> SET ENABLED = TRUE;”
   ```

## Create a user in the Open Catalog account

For SSO to work for a user, you must create an Open Catalog user that corresponds to the user you created in your IdP.

> **Important:**
>
> To create a user, you must use Snowflake CLI.
>
> If you create a user by using the Open Catalog UI, you must specify a password,
> which would allow the user to sign in through SSO or by using Open Catalog credentials.

* To create a user, run the following command:

  ```snowcli
  snow sql -q "CREATE USER \"<login-name>\" EMAIL='<email>';"
  ```

  Where:

  + `<login-name>` must match one of the following:

    - The **Email** that you specified for the user in Auth0.
    - The **Username** that you specified for the user in Okta.
  + `<email>` is the user’s email address. If you’re using Auth0, this value will match <login-name>.

  For example:

  ```snowcli
  snow sql -q "CREATE USER \"testuser123@example.com\" EMAIL='testuser123@example.com';"
  ```
* To confirm that you set up the users correctly, run the following command:

  ```snowcli
  snow sql -q "show users;"
  ```

  In the response, the value in the LOGIN_NAME column must match the **Email** in Auth0 or **Username** in Okta.

---
title: Configuring a client, driver, library, or third-party application to connect to Snowflake
source: https://docs.snowflake.com/en/user-guide/gen-conn-config.md
section: User Guide
---

# Configuring a client, driver, library, or third-party application to connect to Snowflake

To configure a client, driver, library, or third-party application to connect to Snowflake, you must specify your Snowflake
account identifier. In addition, you might need to specify the warehouse, database, schema, and role that should be used.

You can find this information in Snowsight or by executing SQL commands:

* Using Snowsight to get connection settings
* Using SQL commands to get connection settings

## Using Snowsight to get connection settings

To get the settings that you can use to configure a client, driver, library, or third-party application:

1. [Sign in](ui-snowsight-gs.md) to Snowsight.
2. Open the user menu by selecting your user name.
3. From the user menu, select Connect a tool to Snowflake to display the Account Details dialog.

   > **Tip:**
   >
   > You can also display the account details from the [account selector](ui-snowsight-gs.md).
4. Select one of the following tabs:

   * If your client, driver, library, or third-party application supports using a TOML configuration file (for example,
     [Snowflake CLI](../developer-guide/snowflake-cli/connecting/configure-cli.md),
     [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-connecting-snowflake.md), or the
     [Snowflake Connector for Python](../developer-guide/python-connector/python-connector-connect.md):

     1. Select the Config file tab.
     2. To specify a warehouse in the configuration file, select the warehouse from the Warehouse menu.
     3. To specify a database and schema in the configuration file, use the Database menu to select the database
        and schema.
     4. From the Connection Method menu, select the method that you plan to use to authenticate:

        + To use [browser-based single sign-on (SSO)](admin-security-fed-auth-use.md), select Web Browser.
        + To use a password, select Password.
        > **Note:**
        >
        > Clients, drivers, libraries, and third-party applications support additional authentication methods not listed in
        > the menu. For information, see [Securing Snowflake](../guides-overview-secure.md).
     5. Select the copy icon () to copy the content for the configuration file.
     > **Note:**
     >
     > For the [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-connecting-snowflake.md), underscores are not supported in the
     > `account` setting. If the account identifer includes underscores, replace them with dashes.
   * If your client, driver, library, or third-party application supports specifying a connection string (for example,
     the [ODBC Driver](../developer-guide/odbc/odbc-parameters.md), [JDBC Driver](../developer-guide/jdbc/jdbc-configure.md),
     [Go Snowflake Driver](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Connection_String), or
     [.NET Driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/Connecting.md)):

     1. Select the Connectors/Drivers tab.
     2. From the Select Connector or Driver menu, select the driver that you want to use.
     3. To specify a warehouse in the connection string, select the warehouse from the Warehouse menu. (Note that this menu
        is not present for ODBC and .NET.)
     4. To specify a database and schema in the connection string, use the Database menu to select the database
        and schema.
     5. From the Connection Method menu, select the method that you plan to use to authenticate:

        + To use [browser-based single sign-on (SSO)](admin-security-fed-auth-use.md), select Web Browser.
        + To use a password, select Password.
        > **Note:**
        >
        > Clients, drivers, libraries, and third-party applications support additional authentication methods not listed in
        > the menu. For information, see [Securing Snowflake](../guides-overview-secure.md).
     6. Select the copy icon () to copy the resulting connection string.
   * To execute SQL commands to get the configuration information:

     1. Select the SQL Commands tab.
     2. Select the copy icon () next to the command that provides the information that you need, paste the
        command into a worksheet, and execute the command.

## Using SQL commands to get connection settings

You can execute SQL commands to get the following information needed to configure your client, driver, library, or application:

| Setting | SQL command |
| --- | --- |
| Account identifier for the current account | * To get the `organization_name-account_name` form of your account identifier:  ```sqlexample   SELECT CURRENT_ORGANIZATION_NAME() || '-' || CURRENT_ACCOUNT_NAME();   ``` * To get the [account locator](admin-account-identifier.md) form of your account identifier:  ```sqlexample   SELECT CURRENT_ACCOUNT();   ``` |
| Current user name | ```sqlexample SELECT CURRENT_USER(); ``` |
| Current role | ```sqlexample SELECT CURRENT_ROLE(); ``` |
| Current region | ```sqlexample SELECT CURRENT_REGION(); ``` |
| Current warehouse | ```sqlexample SELECT CURRENT_WAREHOUSE(); ``` |
| Current database | ```sqlexample SELECT CURRENT_DATABASE(); ``` |
| Current schema | ```sqlexample SELECT CURRENT_SCHEMA(); ``` |

## Account formats used by clients and drivers

For different clients and drivers, you use different syntaxes for specifying your account.

In general, you should use the variation that includes the organization name (`orgname`) and account name
(`account_name`).

One exception to this rule is when you’re using the [Client Redirect](client-redirect.md) feature. If you’re
using this feature, replace the name of the account (`account_name`) with the name of the connection
(`connection_name`). For examples of this syntax, see [Using a connection URL](client-redirect.md).

To configure a private connection to the Snowflake service, add `.privatelink` to either the account name or the account
locator syntax. To determine which value you should use to connect to Snowflake when using private connectivity, call the
[SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function in your Snowflake account.

If you need to use the account locator, you might also need to specify the cloud region ID, the cloud, and the level of government
compliance as additional segments after the account locator. For the format to use, see [Format 2: Account locator in a region](admin-account-identifier.md). In the
examples below, `account_locator_with_additional_segments` represents the account location with any additional segments
that are required.

Snowflake CLI:
:   * Account name: `snow sql --account orgname-account_name`
    * Account locator: `snow sql --account account_locator_with_additional_segments`

    You can also specify this information in the `account` parameter for the connection in the Snowflake CLI `config.toml` configuration file.

    For additional information, see [Configuring Snowflake CLI and connecting to Snowflake](../developer-guide/snowflake-cli/connecting/connect.md).

SnowSQL:
:   * Account name: `snowsql -a orgname-account_name`
    * Account locator: `snowsql -a account_locator_with_additional_segments`

    For additional information, see [Connection syntax](snowsql-start.md).

JDBC:
:   * Account name: `jdbc:snowflake://orgname>-<account_name.snowflakecomputing.com/?connection_paramsr`
    * Account locator: `jdbc:snowflake://account_locator_with_additional_segments.snowflakecomputing.com/?connection_params`

    For additional information, see [JDBC Driver connection string](../developer-guide/jdbc/jdbc-configure.md).

ODBC:
:   * Account name:

      + Server: `orgname-account_name.snowflakecomputing.com`
    * Account locator:

      + Server: `account_locator_with_additional_segments.snowflakecomputing.com}`

    For additional information, see [ODBC configuration and connection parameters](../developer-guide/odbc/odbc-parameters.md).

Python:
:   * Account name:

      + Set the `ACCOUNT` parameter value as `orgname-account_name`.
    * Account locator:

      + Set the `ACCOUNT` parameter value as `account_locator_with_additional_segments`.

    For additional information, see [Connecting to Snowflake with the Python Connector](../developer-guide/python-connector/python-connector-connect.md).

.Net:
:   * Account name:

      + Set the `ACCOUNT` parameter value as `orgname-account_name`.
      + Set the `HOST` parameter value as the default (`.snowflakecomputing.com`).
    * Account locator:

      + Set the `ACCOUNT` parameter value as `account_locator_with_additional_segments`.
      + Set the `HOST` parameter value as the default `.snowflakecomputing.com`. Specify if your Snowflake account is not
        in the `us-west` region.

    For additional information, see
    [Connecting](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/Connecting.md).

Golang:
:   * Account name: `db, err := sql.Open("snowflake", "jsmith:mypassword@orgname-account_name/mydb/testschema?warehouse=mywh")`
    * Account locator: `sql.Open("snowflake", "jsmith:mypassword@account_locator_with_additional_segments/mydb/testschema?warehouse=mywh")`

    For additional information, see
    [Connection String](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Connection_String).

node.js:
:   * Account name: Set the `ACCOUNT` parameter value as `orgname-account_name`.
    * Account locator: Set the `ACCOUNT` parameter value as `account_locator_with_additional_segments`.

    For additional information, see [Managing connections](../developer-guide/node-js/nodejs-driver-connect.md).

Spark (connector):
:   * Account name: Same as JDBC
    * Account locator: Same as JDBC

    For additional information, see [Setting Configuration Options for the Connector](spark-connector-use.md).

Spark (Databricks):
:   * Account name: `Account URL for Snowflake account`
    * Account locator: `Account Locator URL for Snowflake account`

    For additional information, see [Configuring Snowflake for Spark in Databricks](spark-connector-databricks.md).

Spark (Qubole):
:   * Account name: Set the Host Address field value as `orgname-account_name.snowflakecomputing.com`.
    * Account locator: Set the Host Address field value as
      `account_locator_with_additional_segments.snowflakecomputing.com`.

    For additional information, see [Configuring Snowflake for Spark in Qubole](spark-connector-qubole.md).

PHP:
:   * Account name:

      + Set the `ACCOUNT` parameter value as `orgname-account_name`.
      + Leave the `REGION` parameter value blank for all regions.
    * Account locator:

      + Set the `ACCOUNT` parameter value as `account_locator`.
      + Set the `REGION` parameter value if your Snowflake account is not in the `us-west` region.

    For additional information, see
    [Connecting to the Snowflake database](https://github.com/snowflakedb/pdo_snowflake/blob/master/README.rst#connecting-to-the-snowflake-database).

SQLAlchemy:
:   * Account name: `snowflake://user_login_name:password@orgname-account_name`
    * Account locator: `snowflake://user_login_name:password@account_locator_with_additional_segments`

    For additional information, see [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../developer-guide/python-connector/sqlalchemy.md).

## Additional configuration steps

The next topics cover specific areas of configuring a connection:

* [Allowing Host names](hostname-allowlist.md)
* [OCSP Configuration](ocsp.md)

---
title: Configuring a second factor of authentication
source: https://docs.snowflake.com/en/user-guide/security-mfa-second-factor.md
section: User Guide
---

# Configuring a second factor of authentication

When a password user is enrolled in [Multi-factor authentication (MFA)](security-mfa.md), they must use a second factor of
authentication when signing in to Snowflake. These users enter their password, then use the second factor.

Snowflake provides the following possible second factors:

* Authenticating with a passkey that can be stored and accessed in a variety of ways.
* Authenticating with your preferred authenticator app.
* Authenticating with Duo.

Your administrator controls which factors are available to you. For more information, see [Restricting which MFA methods are available](security-mfa.md).

## Get started

When an administrator requires a user to enroll in MFA, the user is prompted to add a second factor of authentication the next time they
sign in to Snowsight.

If you are already signed in to Snowsight and want to set up a second factor of authentication, do the following:

1. In the left-hand navigation, select your name. The user menu opens.
2. Select Settings.
3. Select Authentication.
4. In the Multi-factor authentication section, select Add new authentication method.
5. Follow the prompts to configure your second factor of authentication.

## Using passkey authentication

A passkey is a form of authentication based on the [WebAuthn standard](https://www.w3.org/TR/webauthn-3/), which uses public/private key
cryptography. When you successfully configure Snowflake to authenticate with a passkey, the private key is securely stored in a personal
location, whether it’s on your machine, a hardware security key (for example, a Yubikey), or a password manager.

To set up a passkey as your second factor of authentication, complete the following tasks:

1. When prompted, select Passkey.
2. Complete the steps to store your passkey as you would with any other website or application. For example, you can use a hardware security
   key or configure your machine so you must use a fingerprint to access the passkey when authenticating.
3. Specify a name for the authentication method so that you can identify it when signing in to Snowflake.

After you enter your password, you’ll be prompted to provide your passkey, using the method you configured.

## Using an authenticator app

Snowflake allows you to use your preferred authenticator app to use a time-based one-time passcode (TOTP) as your second factor of
authentication. Common authenticator apps include Google Authenticator, Microsoft Authenticator, and Authy.

To set up an authenticator app as your second factor, complete the following tasks:

1. When prompted, select Authenticator.
2. Complete the steps with your authenticator app as you would with any other website or application.
3. Specify a name for the authentication method so that you can identify it when signing in to Snowflake.

After you enter your password, you’ll be prompted to enter the TOTP from your authenticator app.

## Using Duo

To set up Duo as your second factor, complete the following tasks:

1. When prompted, select DUO.
2. Complete the steps with Duo as you would with any other website or application.

> **Note:**
>
> Your administrator must configure your organization’s firewall before you can use Duo as a second factor of authentication. For more
> information, see [Prerequisite](security-mfa-duo.md).

## View your authentication methods

You can use Snowsight or SQL to view your second factors of authentication.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the left-hand navigation, select your name. The user menu opens.
3. Select Settings.
4. Select Authentication.
5. Use the Multi-factor authentication section to view your MFA methods.

Execute the [SHOW MFA METHODS](../sql-reference/sql/show-mfa-methods.md) command.

```sqlexample
SHOW MFA METHODS;
```

> **Note:**
>
> If you’re an administrator who wants to view the authentication method of another user, see [SHOW MFA METHODS](../sql-reference/sql/show-mfa-methods.md).
>
> For information about the passkeys and TOTPs for all users in the account, query the
> [CREDENTIALS view](../sql-reference/account-usage/credentials.md). Note that this view does not include information about
> [Duo authenticators](security-mfa-duo.md) (Duo push and passcodes).

## Set a default authentication method

If you configured more than one MFA method as a second factor of authentication, you can choose which one you’ll use to authenticate after
you enter your password. To set the default second factor, do the following:

1. In the left-hand navigation, select your name. The user menu opens.
2. Select Settings.
3. Select Authentication.
4. In the Multi-factor authentication section, select an MFA method from the Default sign-in method drop-down.

## Identifying the login sessions in which a second-factor credential was used

To determine when a second-factor credential was used for authentication (for example, a specific passkey or time-based one-time
passcode), you can join the [LOGIN_HISTORY](../sql-reference/account-usage/login_history.md) and
[CREDENTIALS](../sql-reference/account-usage/credentials.md) views in the ACCOUNT_USAGE schema on the column containing the
credential ID:

* The LOGIN_HISTORY view contains the credential ID in the `second_authentication_factor_id` column, if the
  `second_authentication_factor` column contains `PASSKEY` or `TOTP`.
* The CREDENTIALS view contains the credential ID in the `credential_id` column.

For example:

```sqlexample
SELECT
    login.event_timestamp,
    login.user_name,
    cred.name
  FROM SNOWFLAKE.ACCOUNT_USAGE.LOGIN_HISTORY login
    JOIN SNOWFLAKE.ACCOUNT_USAGE.CREDENTIALS cred
    ON login.second_authentication_factor_id = cred.credential_id
  WHERE login.second_authentication_factor IN ('PASSKEY', 'TOTP');
```

```output
+-------------------------------+-----------+--------------+
| EVENT_TIMESTAMP               | USER_NAME | NAME         |
|-------------------------------+-----------+--------------|
| 2025-08-05 17:10:00.941 -0700 | USER_A    | PASSKEY_RALU |
| 2025-07-28 13:04:27.201 -0700 | USER_B    | TOTP_D406    |
| 2025-07-21 09:09:47.701 -0700 | USER_C    | PASSKEY_GN1N |
+-------------------------------+-----------+--------------+
```

To get information about the queries that were run during this login session, you can join the LOGIN_HISTORY view with the
[SESSIONS](../sql-reference/account-usage/sessions.md) view on the `login_event_id` column to get the session ID, and then
use that to join the [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) view.

---
title: Configuring access control
source: https://docs.snowflake.com/en/user-guide/security-access-control-configure.md
section: User Guide
---

# Configuring access control

This topic describes how to configure access control security for [securable objects](security-access-control-overview.md) in your
account.

## Account administration

### Designating additional users as account administrators

By default, each account has one user who has been designated as an account administrator (that is, user granted the system-defined ACCOUNTADMIN
role). We recommend designating at least one other user as an account administrator. This helps ensure that your account always has at least
one user who can perform account-level tasks, particularly if one of your account administrators is unable to log in.

For these additional account administrators, you can choose to create new users or designate existing users, but make sure to specify the
following:

* Grant the ACCOUNTADMIN role to the user(s), but do not set this role as their default. Instead, designate a lower-level
  administrative role (for example, SYSADMIN) or custom role as their default. This helps prevent account administrators from inadvertently using
  the ACCOUNTADMIN role to create objects.
* Ensure an email address is specified for each user (required for multi-factor authentication).

For example, grant the ACCOUNTADMIN and SYSADMIN roles to an existing user named `user2` and specify SYSADMIN as the default role:

> ```sqlexample
> GRANT ROLE ACCOUNTADMIN, SYSADMIN TO USER user2;
>
> ALTER USER user2 SET EMAIL='user2@domain.com', DEFAULT_ROLE=SYSADMIN;
> ```

### Enabling MFA for each account administrator

To ensure the highest level of security for your Snowflake account, we strongly recommend that any user who can modify or view
sensitive data be required to use multi-factor authentication (MFA) for login.

This recommendation applies particularly to users with the ACCOUNTADMIN role, but can also be expanded to include users with the
SECURITYADMIN and SYSADMIN roles.

For more details, see [Access control best practices](security-access-control-considerations.md) and [Multi-factor authentication (MFA)](security-mfa.md).

## Creating custom roles

To follow the general principle of “least privilege”, we recommend creating custom roles that
[align with the business functions](security-access-control-considerations.md) in your organization to permit SQL actions
on a narrow set of securable objects.

You can create custom roles using Snowsight or SQL.

The workflow is as follows:

1. Create a custom role.
2. Grant a set of privileges to the role.
3. Grant the role to one or more users who require the privileges granted to the role to perform SQL actions for their business needs.
4. Grant the role to another role to create or add to a role hierarchy. While not required, this step is highly recommended. For more
   information, see Creating a role hierarchy.

This section provides instructions for creating a role named `r1` and granting the following privileges to the role. The privileges allow
a user who activates the role in a session to query a single table, `d1.s1.t1`:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Warehouse `w1`  Database `d1`  Schema `s1` | To query an object (for example, a table or view), a role must have the USAGE privilege on a warehouse. The warehouse provides the compute resources to execute the query.  To operate on any object in a schema, a role must have the USAGE privilege on the container database and schema. |
| SELECT | Table `t1` |  |

After a role is created, additional privileges can be granted to it to allow users with the role to perform additional SQL actions on the
same or additional objects.

### Create a role

Only user administrators (that is, users with the USERADMIN system role or higher), or another role with the CREATE ROLE privilege on the
account, can create roles.

SQL:
:   1. Create the `r1` role using a [CREATE ROLE](../sql-reference/sql/create-role.md) statement:

       > ```sqlexample
       > CREATE ROLE r1
       >    COMMENT = 'This role has all privileges on schema_1';
       > ```

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. Switch to a role with privileges to create roles in the account.
    3. In the navigation menu, select Governance & security » Users & roles, and then select Roles.
    4. Select + Role.

       A New Role dialog appears.
    5. For Name, enter the name of the role. For example, `r1`.
    6. For Grant to role, optionally choose to grant the new role to an existing role and inherit the privileges of the existing role.
    7. Optionally add a comment.
    8. Select Create Role.

### Grant privileges to a role

You can use the SECURITYADMIN role to grant privileges on objects to roles. For more information, see
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md).

SQL:
:   1. Grant to the `r1` role the privileges defined in the table earlier in this section.

       ```sqlexample
       GRANT USAGE ON WAREHOUSE w1 TO ROLE r1;

       GRANT USAGE ON DATABASE d1 TO ROLE r1;

       GRANT USAGE ON SCHEMA d1.s1 TO ROLE r1;

       GRANT SELECT ON TABLE d1.s1.t1 TO ROLE r1;
       ```

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Catalog » Database Explorer.
    3. For a specific database and schema, select a database object to which you want to grant privileges. For example, a database
       named `d1`.
    4. In the Object Details, locate the Privileges section.
    5. Select + Privilege.
    6. Select the role or user to which you want to grant privileges. For example, `r1` or `u1`.

    > **Tip:**
    >
    > You can search for users by username, email, or first/last name.

    1. Select the privilege you want to grant to the role or user. For example, `USAGE`.
    2. If you want the role to be able to grant the privilege to other roles or users, select the checkbox for Grant option.
    3. Select Grant Privileges.

    For this example, repeat the steps to grant USAGE on the schema `s1`, SELECT on the table `t1`.

    To grant USAGE on the warehouse `w1`, complete the following steps:

    1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Compute » Warehouses.
    3. Locate and select the warehouse to which you want to grant privileges. For example, `w1`.
    4. In the Privileges section, select + Privilege.
    5. Select the role or user to which you want to grant privileges. For example, `r1` or `u1`.
    6. For Privileges, select the privilege to grant. For example, USAGE.
    7. If you want the role to be able to grant the privilege to other roles or users, select the checkbox for Grant option.
    8. Select Grant Privileges.

### Grant the role to users

You can use the SECURITYADMIN role to grant roles to users. For additional options, see [GRANT ROLE](../sql-reference/sql/grant-role.md).

SQL:
:   1. Assign the `r1` role to user `smith` using a [GRANT ROLE](../sql-reference/sql/grant-role.md) statement:

       > ```sqlexample
       > GRANT ROLE r1
       >    TO USER smith;
       > ```
    2. Optionally set the new custom role as the default role for the user. The next time the user logs into Snowflake, the default role is
       automatically active in the session.

       Only the role with the OWNERSHIP privilege on the user, or a higher role, can execute this command.

       The following command sets the default role for user `smith`:

       > ```sqlexample
       > ALTER USER smith
       >    SET DEFAULT_ROLE = r1;
       > ```

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. Switch to a role with privileges to grant privileges to roles in the account.
    3. In the navigation menu, select Governance & security » Users & roles, and then select Roles.
    4. Select Table and locate and select the role that you created.
    5. In the section 0 users have been granted R1, select Grant to User.
    6. For User to receive grant, select a user to grant the role to. For example, smith.
    7. Select Grant.

### Grant global privileges to a role

You can also grant a global privilege to a role. See [Access control privileges](security-access-control-privileges.md) for the list of global privileges
available to grant to a role.

SQL:
:   Use the GRANT PRIVILEGE command. See [Privilege management](../sql-reference/commands-user-role.md) for details.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. Switch to a role with privileges to grant privileges to roles in the account.
    3. In the navigation menu, select Governance & security » Users & roles, and then select Roles.
    4. Select Table and locate and select the role that you created.
    5. In the role details page, select  » Manage global privileges.
    6. For Global privilege to grant, select the privilege that you want to grant to the role.
    7. If you want the role to be able to grant the privilege to other roles, select the checkbox for Grant option.
    8. Select Update Privileges.

## Creating custom read-only roles

Suppose you need a role that is limited to querying all tables in a specific schema (for example, `d1.s1`). Users who execute
commands using this role cannot update the table data, create additional database objects, or drop tables. The role is limited to querying
table data.

To create a read-only role, complete the basic steps described in Creating custom roles. In the
Grant privileges to a role section, grant the read-only role (named `read_only` in these instructions) the
following object privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Warehouse | To query an object (for example, a table or view), a role must have the USAGE privilege on a warehouse. The warehouse provides the compute resources to execute the query. |
| SELECT | Table | To operate on any object in a schema, a role must have the USAGE privilege on the container database and schema. |

The GRANT *<privilege>* statements are as follows:

```sqlexample
GRANT USAGE
  ON DATABASE d1
  TO ROLE read_only;

GRANT USAGE
  ON SCHEMA d1.s1
  TO ROLE read_only;

GRANT SELECT
  ON ALL TABLES IN SCHEMA d1.s1
  TO ROLE read_only;

GRANT USAGE
  ON WAREHOUSE w1
  TO ROLE read_only;
```

> **Note:**
>
> The `GRANT SELECT ON ALL TABLES IN SCHEMA <schema>` statement grants the SELECT privilege on all existing tables only. To
> grant the SELECT privilege on all future tables to the role, execute the following
> statement:
>
> > ```sqlexample
> > GRANT SELECT ON FUTURE TABLES IN SCHEMA d1.s1 TO ROLE read_only;
> > ```

## Creating a role hierarchy

When creating custom roles, consider creating a role hierarchy ultimately assigned to a high-level administrator role. In general, the
SYSADMIN role works well as the role all other roles are assigned to in a hierarchy, although it is important to note that any role with
sufficient privileges could serve this function. The SYSADMIN role is a system-defined role that has privileges to create warehouses,
databases, and database objects in an account and grant those privileges to other roles. In the default system hierarchy, the top-level
ACCOUNTADMIN role manages the system administrator role.

Create a role hierarchy by granting a role to a second role. You can then grant that second role to a third role. The privileges associated
with a role are inherited by any roles above that role in the hierarchy (that is, the parent role).

The following diagram shows an example role hierarchy and the privileges granted to each role:

### Grant a role to another role

Assign the role to a higher-level role in a role hierarchy. In this example, we are assigning the `r1` role created in
Creating custom roles to the SYSADMIN role. The SYSADMIN role inherits any object privileges granted to the
`r1` role:

SQL:
:   ```sqlexample
    GRANT ROLE r1
       TO ROLE sysadmin;
    ```

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Users & roles, and then select Roles.
    3. Select Table and locate the role that you want to grant to another role. For example, `r1`.
    4. In the section 0 roles have been granted R1, select Grant to Role.
    5. For Role to receive grant, select SYSADMIN.
    6. Select Grant.

> **Note:**
>
> In a more complex example, you could assign the `custom` role to another child role of SYSADMIN (or another administrator role,
> such as a custom role with sufficient privileges to create databases). The SYSADMIN role would inherit the combined privileges assigned
> to the `custom` role and its parent role. If the role above `custom` in the hierarchy owned any objects, then the role hierarchy
> would ensure that members of the SYSADMIN role also owned those objects (indirectly) and could manage them as expected.

### Explore role hierarchies in Snowsight

Snowsight includes a roles graph that displays the hierarchy of roles in your account. The graph is organized in descending order
of hierarchy, with paths representing inheritance from parent to child roles. In accounts with lots of roles, the graph can take some time
to load.

> **Note:**
>
> Database roles are not displayed in the roles graph.

To open the roles graph, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.

When viewing the roles graph, you can select an individual role or user which then becomes the focus of the roles graph. To explore,
you can zoom in and out, and center the graph on the focused role or user.

Select a role to view details about the role, such as when the role was created, the owner role, the number of roles granted to the role,
the number of roles that the role has been granted to, the number of users to which the role is granted, and the ability to manage grants
to the role.

When viewing the details for a role, you can select the  to center the graph on the selected role or open the role detail
page.

## Granting privileges to a user

A user with MANAGE GRANTS privileges on objects can grant privileges directly to users. For more information, see
[GRANT <privileges> … TO USER](../sql-reference/sql/grant-privilege-user.md).

For example, to grant the USAGE privilege on a Streamlit application `streamlitApp1` to
`user1`, execute the following commands:

```sqlexample
GRANT USAGE ON WAREHOUSE w1 TO USER user1;

GRANT USAGE ON DATABASE d1 TO USER user1;

GRANT USAGE ON SCHEMA d1.s1 TO USER user1;

GRANT USAGE ON STREAMLIT `streamlitApp1` TO USER user1;
```

> **Note:**
>
> Privileges assigned directly to users are only effective when the user has all secondary roles enabled.

For more specific information about granting privileges to users, see [Usage notes](../sql-reference/sql/grant-privilege-user.md) for
GRANT *<privileges>* … TO USER.

### Disabling UBAC

We understand that this new access control model might affect your governance practices. If you need to disable UBAC in your account *after*
Bundle 2025_02 becomes enabled by default, use the ALTER ACCOUNT command to set the account parameter
`DISABLE_USER_PRIVILEGE_GRANTS = TRUE`. For example:

```sqlexample
ALTER ACCOUNT SET DISABLE_USER_PRIVILEGE_GRANTS = TRUE;
```

For more information about using the ALTER ACCOUNT command to set account parameters, see [ALTER ACCOUNT](../sql-reference/sql/alter-account.md). For more
information about the DISABLE_USER_PRIVILEGE_GRANTS parameter, see [DISABLE_USER_PRIVILEGE_GRANTS](../sql-reference/parameters.md).

## Assigning future grants on objects

To simplify grant management, *future grants* allow defining an initial set of privileges to
grant on new (that is, future) objects of a certain type in a database or a schema. As new
objects are created in the database or schema, the defined privileges are automatically granted
to a specified role.

Future grants only define the initial set of privileges granted on new objects of a specified
type. After an individual object is created, administrators can explicitly grant additional privileges
or revoke privileges on the object. This allows fine-grained access control over all objects in the
schema or database.

### Considerations when using future grants

* When future grants are defined on the same object type for a database and a schema in the
  same database, the schema-level grants take precedence over the database level grants, and the
  database level grants are ignored. This behavior applies to privileges on future objects granted
  to one role or different roles.

  For example, the following statements grant different privileges on objects of the same type
  at the database and schema levels.

  Grant the SELECT privilege on all future tables in database `d1` to role `r1`:

  ```sqlexample
  GRANT SELECT ON FUTURE TABLES IN DATABASE d1 TO ROLE r1;
  ```

  Grant the INSERT and DELETE privileges on all future tables in schema `d1.s1` to role `r2`.

  ```sqlexample
  GRANT INSERT,DELETE ON FUTURE TABLES IN SCHEMA d1.s1 TO ROLE r2;
  ```

  The future grants assigned to the `r1` role on object types in schema `d1.s1` are ignored completely. When new tables are created
  in schema `d1.s1`, only the future privileges defined on tables for the `r2` role are granted.
* Database level future grants apply to both regular and
  managed access schemas.

To manage future grants in Snowsight, run SQL statements in a worksheet.

### Defining future grants on database or schema objects

Grant privileges on future objects of a specified type using the
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command with the ON FUTURE keywords.

### Revoking future grants on database or schema objects

Revoke grants on future objects using the [REVOKE <privileges> … FROM ROLE](../sql-reference/sql/revoke-privilege.md) command with
the ON FUTURE keywords.

### Object cloning and future grants

* When a database or schema is cloned, future grants are copied to its clone. This behavior maintains
  consistency with regular object grants. For example, when you clone a source object such as a database,
  grants of privileges on the database are not copied to its clones. Privilege grants on all child objects,
  such as tables created in the database, are copied to the clones.
* When an object in a schema is cloned, any future grants defined for this object type in the schema
  are applied to the cloned object unless the COPY GRANTS option is specified in the CREATE *<object>*
  statement for the clone operation. In that case, the new object retains the access permissions of the
  original object and does not inherit any future grants for objects of that type.

## Creating managed access schemas

Managed access schemas improve security by locking down privilege management on objects.

In regular (that is, non-managed) schemas, object owners (that is, a role with the OWNERSHIP privilege on an object) can grant access on
their objects to other roles, with the option to further grant those roles the ability to manage object grants.

With managed access schemas, object owners lose the ability to make grant decisions. Only the schema owner (that is, the role with the
OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant privileges on objects in the schema, including
future grants, centralizing privilege management.

You can create a managed access schema in Snowsight using a SQL command. For example, run the
[CREATE SCHEMA](../sql-reference/sql/create-schema.md) command with the `WITH MANAGED ACCESS` keywords.

```sqlexample
CREATE SCHEMA myschema WITH MANAGED ACCESS;
```

You can change a managed access schema to a regular one in Snowsight using a SQL command. For example, run the
[ALTER SCHEMA](../sql-reference/sql/alter-schema.md) command with the `DISABLE MANAGED ACCESS` keywords.

```sqlexample
ALTER SCHEMA myschema DISABLE MANAGED ACCESS;
```

The following table indicates which roles can manage object privileges in a regular or managed access schema:

| Role | Can grant object privileges in a regular schema | Can grant object privileges in a managed access schema |
| --- | --- | --- |
| SYSADMIN | No | No |
| SECURITYADMIN or higher | Yes | Yes |
| Database owner | No | No |
| Schema owner | No | Yes |
| Object owner | Yes | No |
| Any role with the MANAGE GRANTS privilege | Yes | Yes |

## Manage object privileges with Snowsight

You can use Snowsight to manage grants of database object privileges to roles.
To manage these grants, use a role with either the OWNERSHIP privilege on the object or with the global MANAGE GRANTS privilege.

When you use Snowsight to manage grants, it is equivalent to running a [GRANT PRIVILEGE](../sql-reference/sql/grant-privilege.md)
or [REVOKE PRIVILEGE](../sql-reference/sql/revoke-privilege.md) command in SQL. For example, you can use Snowsight to grant
the USAGE privilege on a view to the ACCOUNTADMIN role.

### Grant privileges on objects

To grant database object privileges to a role, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. For a specific database and schema, select a database object to which you want to grant privileges.
4. In the Object Details, locate the Privileges section.
5. Select + Privilege.
6. Select the role to which you want to grant privileges.
7. Select the privilege you want to grant to the role.
8. If you want the role to be able to grant the privilege to other roles, select Grant option.
9. Repeat the steps for each object privilege you want to grant to the role.
10. Select Grant Privileges.

### Revoke privileges on objects

To revoke database object privileges from a role, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. For a specific database and schema, select a database object from which you want to revoke privileges.
4. In the Object Details, locate the Privileges section.
5. For a specific role listed, select the Edit Role pencil icon that appears when you hover over the row.
6. In the dialog that appears, select the x to revoke a privilege from a specific role.
7. Select Update Privileges.

### Identify privileges granted to roles

To show the privileges granted on a specific role, you can run the [SHOW GRANTS](../sql-reference/sql/show-grants.md) command or
do the following in Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles » Roles.
3. Select Table and locate the role for which you want to view granted privileges.
4. Select the role for which you want to view granted privileges to view the details.
5. Review the Privileges section for the role.

## Enabling non-account administrators to monitor usage and billing history

Snowflake provides extensive account usage and billing information about data storage/transfer and warehouse usage/load:

Snowsight:
:   In the navigation menu, select Admin » Cost management.

SQL:
:   Query any of the following:

    * Table functions (in the [Snowflake Information Schema](../sql-reference/info-schema.md)):

      + [DATABASE_STORAGE_USAGE_HISTORY](../sql-reference/functions/database_storage_usage_history.md)
      + [STAGE_STORAGE_USAGE_HISTORY](../sql-reference/functions/stage_storage_usage_history.md)
      + [WAREHOUSE_LOAD_HISTORY](../sql-reference/functions/warehouse_load_history.md)
      + [WAREHOUSE_METERING_HISTORY](../sql-reference/functions/warehouse_metering_history.md)
    * Views (in [Account Usage](../sql-reference/account-usage.md)):

      + [DATABASE_STORAGE_USAGE_HISTORY view](../sql-reference/account-usage/database_storage_usage_history.md)
      + [STAGE_STORAGE_USAGE_HISTORY view](../sql-reference/account-usage/stage_storage_usage_history.md)
      + [WAREHOUSE_LOAD_HISTORY view](../sql-reference/account-usage/warehouse_load_history.md)
      + [WAREHOUSE_METERING_HISTORY view](../sql-reference/account-usage/warehouse_metering_history.md)

By default, this information can be accessed/viewed only by account administrators.

> **Note:**
>
> Currently, [Snowsight](ui-snowsight-gs.md) only displays usage and billing information to account administrators. It is not possible to grant other
> roles the ability to view this information.

To enable users who are not account administrators to access/view this information, grant the following privileges to a system-defined or
custom role. Granting the privileges to a role allows all users who are granted the role to access this historical/usage information:

> | Privilege | Object | Description |
> | --- | --- | --- |
> | MONITOR USAGE | Account (that is, global privilege) | Allows users who have been granted the role to view usage and billing information in the web interface and query the corresponding table functions in the Information Schema.  In addition, with this privilege, the [SHOW DATABASES](../sql-reference/sql/show-databases.md) and [SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md) commands return the lists of all databases and warehouses in the account, respectively, regardless of other privilege grants. |
> | IMPORTED PRIVILEGES | `snowflake` database | Allows users who have been granted the role to query all of the ACCOUNT USAGE views, including the views containing usage and billing information.  For more information, see [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md). |

For example, to grant these permissions to the `custom` role:

```sqlexample
GRANT MONITOR USAGE ON ACCOUNT TO ROLE custom;

GRANT IMPORTED PRIVILEGES ON DATABASE snowflake TO ROLE custom;
```

---
title: Configuring an identity provider (IdP) for Snowflake
source: https://docs.snowflake.com/en/user-guide/admin-security-fed-auth-configure-idp.md
section: User Guide
---

# Configuring an identity provider (IdP) for Snowflake

The tasks for configuring an IdP are different depending on whether you choose Okta, AD FS, or another (i.e. custom) SAML 2.0-compliant service/application to
provide federated authentication for your Snowflake users.

> **Important:**
>
> Prior to configuring your IdP, consider how to manage federated authentication after it is fully configured and how users will access Snowflake through
> federated authentication.
>
> For example, decide whether users will access Snowflake through a public URL or through a URL associated with
> private connectivity to the Snowflake service. To learn more, see [Managing/Using federated authentication](admin-security-fed-auth-use.md).

## Okta setup

To use Okta as your IdP for federated authentication, you must perform the following tasks in Okta:

1. Create an Okta account for your company or organization.
2. Log into your Okta account as a user with administrator privileges and create a user for each person who will need access to Snowflake. When creating
   users, make sure to include an email address for each user. Email addresses are required to map the users in Okta with the corresponding users in Snowflake.

   > **Note:**
   >
   > Remember to ensure the email address you enter in Okta maps to the `login_name` value of the user object in Snowflake and
   > the SAML `NameID` attribute.
3. Create a Snowflake application in Okta:

   * In the Label field for the application, you can specify any name.
   * In the SubDomain field for the application, enter the [account identifier](admin-account-identifier.md) of
     your Snowflake account. If you are using private connectivity, append `privatelink` to the account identifier. For example, if the
     URL used to access the Snowflake account is `https://myorg-myaccount.privatelink.snowflakecomputing.com`, then
     enter `myorg-myaccount.privatelink`.

     If the Snowflake account name contains an underscore and you are using the account name format of the identifier, you need to convert
     the underscore to a hyphen because Okta does not support underscores in URLs (e.g. `myorg-myaccount-name`).
4. Assign the Okta users you created to the Snowflake application in Okta.

### Obtain IdP information

Snowflake as the service provider needs information about the IdP to establish a relationship between the two. You’ll need this information
when you configure Snowflake, as described in [Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

The preferred method of providing information about the IdP is to obtain its metadata URL, which Snowflake can use to dynamically obtain
all of the information it needs. You can also manually define this information in multiple parameters in Snowflake, but that process is
error prone and the parameters must be manually updated if IdP configuration settings change, including when certificates are rotated.

#### Obtain the metadata URL (Preferred)

1. Navigate to the integration that you created for Snowflake and select the Sign On tab.
2. In the SAML 2.0 tile of the Sign on methods section, copy the Metadata URL.

#### Obtain the SSO URL and certificate

1. Navigate to the integration that you created for Snowflake and select the Sign On tab.
2. Select View Setup Instructions.
3. Gather the required information from the setup instructions:

   * SSO URL (IdP URL endpoint to which Snowflake will send SAML requests)
   * Certificate (used to verify communication between the IdP and Snowflake)

## AD FS setup

To use AD FS as your IdP for federated authentication, you must perform the following tasks in AD FS.

### Prerequisites

* Verify that AD FS 3.0 is installed and working on Windows Server 2012 R2.
* Ensure that a user exists in AD FS for each person who will need access to Snowflake. When creating users, make sure to include an email address for each
  user. Email addresses are required to connect the users in AD FS with their corresponding users in Snowflake.

> **Note:**
>
> Other versions of AD FS and Windows Server can be used; however, the configuration instructions may be different.

### Add a relying party trust for Snowflake

In the AD FS Management console, use the Add Relying Party Trust Wizard to add a new relying party trust to the AD FS configuration database:

1. When prompted, select the Enter data about the relying party manually radio button.
2. In the next screen, enter a display name (e.g. “Snowflake”) for the relying party.
3. In the next screen, select the AD FS profile radio button.
4. Skip the next screen (for specifying an optional token encryption certificate).
5. In the next screen:

   * Select the Enable support for the SAML 2.0 WebSSO protocol checkbox.
   * In the Relying party SAML 2.0 SSO service URL field, enter the SSO URL for your Snowflake account appended with `/fed/login`. For example, to use the Account Name URL with private connectivity, enter: `https://<orgname>-<account_name>.privatelink.snowflakecomputing.com/fed/login`. For a list of possible URL formats, see [Connecting with a URL](organizations-connect.md). When you [create the security integration](admin-security-fed-auth-security-integration.md) for federated authentication, make sure its URL parameters match the format used in this field.
6. In the next screen, in the Relying party trust identifier field, enter the URL for your Snowflake account as specified in the
   previous step.
7. In the next screen, select the
   I do not want to configure multi-factor authentication settings for this relying party trust at this time radio button.
8. In the next screen, select the Permit all users to access this relying party radio button.
9. In the next screen, review your configuration for the relying party trust. Also ensure that in the Advanced tab,
   SHA-256 is selected as the secure hash algorithm.
10. In the next screen, select Open the Edit Claim Rules dialog for this relying party trust when the wizard closes and click
    Close to finish the wizard configuration.

### Define claim rules for the Snowflake relying party trust

The Edit Claim Rules for `snowflake_trust_name` window opens automatically after closing the wizard. You can also open this window from the
AD FS Management console by clicking on:

> AD FS » Trust Relationships » Relying Party Trusts » `snowflake_trust_name` » Edit Claim Rules…

In the window:

1. Create a rule for sending LDAP attributes as claims:

   1. Click Add Rules and select Send LDAP Attributes as Claim.
   2. In the Edit Rule dialog:

      * Enter a name (e.g. “Get Attributes”) for the rule.

        + Set Attribute store to: Active Directory.
        + Add two LDAP attributes for the rule:

          - E-Mail-Addresses with E-Mail Address as the Outgoing Claim Type.
          - Display-Name with Name as the Outgoing Claim Type.
   3. Click the OK button to create the rule.
2. Create a rule for transforming incoming claims:

   1. Click Add Rules and select Transform an Incoming Claim.
   2. In the Add Transform Claim Rule Wizard dialog:

      * Enter a name (e.g. “Name ID Transform”) for the claim rule.
      * Set Incoming claim type to: E-Mail Address.
      * Set Outgoing claim type to: Name ID.
      * Set Outgoing name ID format to: Email.
      * Select the Pass through all claim values radio button.
   3. Click the Finish button to create the rule.
3. Click the OK button to finish adding claim rules for the Snowflake relying party trust.

> **Important:**
>
> Ensure that you enter the values for the rules exactly as described above.
>
> Also, ensure that the rules you created are listed in the following order:
>
> 1. LDAP Attributes
> 2. Incoming Claim Transform
>
> The rules will not work correctly if there are any typos in the rules or the rules are not listed in the correct order.

### Enable global logout — Optional

To enable global logout for Snowflake in AD FS, in the AD FS Management console, click on:

> AD FS » Trust Relationships » Relying Party Trusts » *<snowflake_trust_name>* » Properties

In the Properties dialog:

1. Go to the Endpoints tab and click the Add SAML… button.
2. In the Edit Endpoint dialog:

   * Set Endpoint type to: SAML Logout.
   * Set Binding to: POST or REDIRECT.
   * Set Trusted URL to the value specified in step 1.
   * Leave Response URL blank.
   * Click the OK button to save your changes.

### Obtain IdP information

Snowflake as the service provider needs information about the IdP to establish a relationship between the two. You’ll need this information
when you configure Snowflake, as described in [Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

The preferred method of providing information about the IdP is to obtain its metadata URL, which Snowflake can use to dynamically obtain
all of the information it needs. You can also manually define this information in multiple parameters in Snowflake, but that process is
error prone and the parameters must be manually updated if IdP configuration settings change, including when certificates are rotated.

#### Obtain the metadata URL (Preferred)

1. Navigate to the integration that you created for Snowflake and select the Sign On tab.
2. Select the Endpoints tab.
3. Find the Federation metadata document field and copy the URL. This is your metadata URL.

#### Obtain the SSO URL and certificate

To complete the AD FS setup, obtain the SSO URL and certificate from AD FS. You will use these two values in the next step:
[Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

* SSO URL
  :   The AD FS URL endpoint to which Snowflake will send SAML requests. This is typically the Login URL for AD FS, which is usually the IP address or the
      fully-qualified domain name (i.e. FQDN) of your AD FS server with `/adfs/ls` appended to the end.
* Certificate
  :   Used to verify communication between AD FS and Snowflake. You download it from the AD FS Management console:

  1. In the console, click on:

     > AD FS » Service » Certificates
  2. In the Certificates page, right-click the Token-signing entry and click View Certificate….
  3. In the Certificate dialog, select the Details tab.
  4. Click Copy to File… to open the Certificate Export Wizard.
  5. For the export file format, select Base-64 encoded X.509 (.CER) and click Next.
  6. Save the file to a directory on your local environment.
  7. Open the file and copy the certificate, which consists of a single line located between the following lines:

     ```text
     -----BEGIN CERTIFICATE-----
     <certificate>
     -----END CERTIFICATE-----
     ```

## Custom IdP setup

To use a SAML 2.0-compliant service or application as your IdP for federated authentication, you must perform the following tasks:

1. In the service/application interface, define a custom SHA-256 application for Snowflake. The instructions for defining a custom application are specific to
   the service/application that is serving as the IdP.
2. In the interface, create a user for each person who will need access to Snowflake. When creating users, make sure to include an email address for each
   user. Email addresses are required to connect the users in the IdP with their corresponding users in Snowflake.
3. Obtain the SSO URL and certificate from your Custom IdP. You will need the SSO URL value and Certificate in the next step,
   [Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

   * SSO URL (IdP URL endpoint to which Snowflake will send SAML requests)
   * Certificate (used to verify communication between the IdP and Snowflake)

> **Important:**
>
> When configuring custom identity providers, field values are often case-sensitive. If error messages or error codes appear, double-check the casing for any
> values you may have entered in the configuration process.

## Next steps

After you have completed the steps above, you must
[configure Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md) to complete your custom IdP setup.

---
title: Configuring secure access to Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-load-s3-config.md
section: User Guide
---

# Configuring secure access to Amazon S3

To read data from and write to an S3 bucket, the security and access management policies on the bucket must allow Snowflake to access the bucket.

The following options for configuring secure access to a private S3 bucket are supported:

Option 1:
:   Configure a storage integration object to delegate authentication responsibility for external cloud storage to a Snowflake identity and access management (IAM) entity.

    > **Note:**
    >
    > We highly recommend this option, which avoids the need to supply AWS IAM credentials when creating stages or loading data.

Option 2:
:   Configure an AWS IAM role with the required policies and permissions to access your external S3 bucket. This approach allows individual users to avoid providing and managing security credentials and access keys.

    Note that implementing this feature requires a named external stage. Support for accessing an S3 bucket URL directly in a COPY statement is not supported.

    > **Important:**
    >
    > The ability to use an AWS IAM role to access a private S3 bucket to load or unload data is now deprecated (i.e. support will be removed in a future release, TBD). We highly recommend modifying any existing S3 stages that use this feature to instead reference storage integration objects (**Option 1** in this topic).

Option 3:
:   Configure an AWS IAM user with the required permissions to access your S3 bucket. This one-time setup involves establishing access permissions on a bucket and associating the required permissions with an IAM user. You can then access an external (i.e. S3) stage that points to the bucket with the AWS key and secret key.

This topic describes how to perform the required tasks in S3.

> **Note:**
>
> Completing the instructions in this topic requires administrative access to AWS. If you are not an AWS administrator, ask your AWS administrator to perform these tasks.

**Next Topics:**

* [Option 1: Configure a Snowflake storage integration to access Amazon S3](data-load-s3-config-storage-integration.md)
* [Option 2: Configure an AWS IAM role to access Amazon S3 — *Deprecated*](data-load-s3-config-aws-iam-role.md)
* [Option 3: Configure AWS IAM user credentials to access Amazon S3](data-load-s3-config-aws-iam-user.md)

---
title: Configuring sfsql — Obsoleted
source: https://docs.snowflake.com/en/user-guide/sfsql-install-config.md
section: User Guide
---

# Configuring sfsql — *Obsoleted*

This topic describes how to configure `sfsql`. Note that the `sfsql` installer is no longer available for download.

## Prerequisites

`sfsql` uses the Snowflake JDBC driver to connect to Snowflake. The driver
does not need to be downloaded and installed before installing `sfsql`
because the driver is automatically installed along with `sfsql`; however,
the JDBC driver requires the 64-bit version of Java 1.7 (or higher).

If the required version of Java is not installed on the client machine where
`sfsql` will be installed, it must be installed. For more information, see
[Java requirements for the JDBC Driver](../developer-guide/jdbc/java-install.md).

## Configure Client Login

`sfsql` provides various parameters for configuring client connection and login.
The `client/login.defaults` file can be used to define default connection
parameters which can be overridden on the command line when starting the client.

When you download the client from Snowflake, the following parameters are preset
in `login.defaults`:

> ```bash
> ACCOUNT=<account_name>
> GSIP=<account_name>.snowflakecomputing.com
> PORT=443
> ```
>
> where `<account_name>` is the name assigned to your account by Snowflake.

To set additional defaults, add the corresponding parameters to the file, using the
same structure/format described above. For a complete list of the defaults you can set
in the file, see [Starting and Stopping sfsql — Obsoleted](sfsql-start-stop.md).

---
title: Configuring Snowflake for Spark in Databricks
source: https://docs.snowflake.com/en/user-guide/spark-connector-databricks.md
section: User Guide
---

# Configuring Snowflake for Spark in Databricks

The Databricks version 4.2 native Snowflake Connector allows your Databricks account
to read data from and write data to Snowflake without importing any libraries.
Older versions of Databricks required importing the libraries for the Spark connector into your Databricks clusters.

The connector automatically distributes processing across Spark and Snowflake,
without requiring the user to specify the parts of the processing that should be
done on each system. Queries also benefit from Snowflake’s automatic query
pushdown optimization.

## Prerequisites

* You must have a Databricks account, and you must be using the Databricks Runtime version 4.2 or later. In addition:

  + You should have already set your Snowflake user login name and password in your Databricks secret manager; you will read the login and password back by calling `dbutils.secrets.get(...)`. For more details about the Databricks secret manager, see <https://docs.databricks.com/user-guide/secrets/index.html>
* You must have a Snowflake account. To read or write from this account, you need the following information:

  + URL for your Snowflake account.
  + Login name and password for the user who connects to the account.
  + Default database and schema to use for the session after connecting.
  + Default virtual warehouse to use for the session after connecting.
* The role used in the connection needs USAGE and CREATE STAGE privileges
  on the schema that contains the table that you will read from or write to
  via Databricks.

## Accessing Databricks Snowflake Connector Documentation

The primary documentation for the Databricks Snowflake Connector is available on the Databricks web site. That documentation includes examples showing the commands
a Scala or Python notebook uses to send data from Spark to Snowflake or vice versa.

For more details, see [Data Sources — Snowflake](https://docs.databricks.com/spark/latest/data-sources/snowflake.html).

## Preparing an External Location for Long-running Queries

If some of your jobs exceed 36 hours in length, consider preparing an
external location to use to exchange data between Snowflake and Spark.
For more information, see [Preparing an External Location For Files](spark-connector-install.md).

## Query Pushdown in Databricks

Spark queries benefit from Snowflake’s automatic query pushdown optimization, which improves performance. By default, Snowflake query pushdown is enabled in Databricks.

For more details about query pushdown, see [Pushing Spark Query Processing to Snowflake](https://www.snowflake.com/snowflake-spark-part-2-pushing-query-processing/) (Snowflake Blog).

---
title: Configuring Snowflake for Spark in Qubole
source: https://docs.snowflake.com/en/user-guide/spark-connector-qubole.md
section: User Guide
---

# Configuring Snowflake for Spark in Qubole

To configure Snowflake for Spark in Qubole, you simply add Snowflake as a Qubole data store. This topic provides step-by-step instructions for performing this task using the Qubole Data Service (QDS) UI.

> **Note:**
>
> You can also use the QDS REST API to add Snowflake as a data store. For step-by-step instructions, see
> [Adding a Snowflake Data Warehouse as a Data Store](http://docs.qubole.com/en/latest/partner-integration/snowflake-integration/add-a-snowflake-data-warehouse.html) (in the Qubole Documentation).

## Prerequisites

* You must be a QDS system administrator to add a data store.
* You must have a Qubole Enterprise edition account.
* The role used in the connection needs USAGE and CREATE STAGE privileges
  on the schema that contains the table that you will read from or write to
  via Qubole.

## Preparing an External Location for Long-running Queries

If some of your jobs exceed 36 hours in length, consider preparing an
external location to use to exchange data between Snowflake and Spark.
For more information, see [Preparing an External Location For Files](spark-connector-install.md).

## Adding Snowflake as a Data Store in the QDS UI

1. From the Home menu, click Explore.
2. In the dropdown list on the Explore page, select + Add Data Store.
3. Enter the required information in the following fields:

   * Data Store Name: Enter the name of the data store to be created.
   * Database Type: Select ‘Snowflake’.
   * Catalog Name: Enter the name of the Snowflake catalog.
   * Database Name: Enter the name of the database in Snowflake where the data is stored.
   * Warehouse Name: Enter the name of the Snowflake virtual warehouse to use for queries.
   * Host Address: Enter the base URL of your Snowflake account (e.g.
     `myorganization-myaccount.snowflakecomputing.com`). See [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md) for details on
     specifying your account identifier in this URL.
   * Username: Enter the login name for your Snowflake user (used to connect to the host).
   * Password: Enter the password for your Snowflake user (used to connect to the host).

   Note that all the values are case-sensitive, except for Host Address.
4. Click Save to create the data store.

Repeat these steps for each Snowflake database that you want to add as a data store. Or you can edit the data store to change the Snowflake database or any other properties for the data store (e.g.
change the virtual warehouse used for queries).

> **Note:**
>
> After adding a Snowflake data store, restart the Spark cluster (if you are using an already-running Spark cluster). Restarting the Spark cluster installs the `.jar` files for the Snowflake
> Connector for Spark and the Snowflake JDBC Driver.

## Verifying the Snowflake Data Store in Qubole

To verify that the Snowflake data store was created and has been activated, click on the dropdown list in the upper-left of the Explore page. A green dot indicates that the data store has
been activated.

You should also verify that the table explorer widget in the left pane of the Explore page displays all of the tables in the Snowflake database specified in the data store.

## Query Pushdown in Qubole

Spark queries benefit from Snowflake’s automatic query pushdown optimization, which improves performance. By default, Snowflake query pushdown is enabled in Qubole.

For more details about query pushdown, see [Pushing Spark Query Processing to Snowflake](https://www.snowflake.com/snowflake-spark-part-2-pushing-query-processing/) (Snowflake Blog).

---
title: Configuring Snowflake to use federated authentication
source: https://docs.snowflake.com/en/user-guide/admin-security-fed-auth-security-integration.md
section: User Guide
---

# Configuring Snowflake to use federated authentication

This topic describes how to configure Snowflake for federated authentication using a SAML2 security integration. This topic assumes you have
already [configured your IdP to work with Snowflake](admin-security-fed-auth-configure-idp.md).

> **Note:**
>
> A SAML2 security integration replaces the deprecated [SAML_IDENTITY_PROVIDER](../sql-reference/parameters.md) account parameter.
>
> If you have an existing SSO implementation that uses this deprecated account parameter, you should [migrate to a SAML security
> integration](admin-security-fed-auth-configure-snowflake.md) before continuing to configure Snowflake for federated
> authentication.
>
> Snowflake will continue to support the deprecated account parameter as long as there are implementations that use it.

## Create a SAML2 security integration

Snowflake uses a SAML2 security integration to integrate with the IdP you are using to implement federated authentication. Use the
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-saml2.md) command to start configuring Snowflake for SSO.

> **Before you begin:**
>
> * When you [configured an IdP](admin-security-fed-auth-configure-idp.md) for SSO, you provided a URL for the Snowflake
>   account. The [format](organizations-connect.md) of this URL must match the URLs in the `SAML2_SNOWFLAKE_ISSUER_URL` and
>   `SAML2_SNOWFLAKE_ACS_URL` properties of the security integration.
>
>   If you do not define these properties when creating the security integration, then they default to the
>   [legacy URL](admin-account-identifier.md) of the account.
> * Note that `/fed/login` is appended to the URL for the `SAML2_SNOWFLAKE_ACS_URL` property.
> * The preferred method of providing information about the IdP to Snowflake is to use the integration’s `METADATA_URL` parameter to
>   specify the IdP’s metadata URL. Snowflake uses the metadata URL to dynamically obtain the IdP’s configuration settings, including its
>   certificate.
>
>   If you define the IdP’s metadata URL, then you can run an ALTER SECURITY INTEGRATION REFRESH METADATA_URL command to refresh the IdP’s
>   configuration settings without having to change any of the integration’s parameters. This simplifies the rotation of certificates.

For example, to create a security integration that uses an account name URL with private connectivity, run the following SQL command:

```sqlexample
CREATE SECURITY INTEGRATION my_idp
  TYPE = saml2
  ENABLED = true
  METADATA_URL = 'https://integrator-26580.okta.com/app/ex2kbcS30N697/sso/saml/metadata'
  SAML2_SNOWFLAKE_ISSUER_URL = 'https://<orgname>-<account_name>.privatelink.snowflakecomputing.com'
  SAML2_SNOWFLAKE_ACS_URL = 'https://<orgname>-<account_name>.privatelink.snowflakecomputing.com/fed/login';
```

After configuring a SAML2 security integration, you can use the security integration to do the following tasks:

* Encrypt SAML Assertions
* Send Signed SAML Requests
* Specify the SAML NameID Format
* Export the SAML2 Security Integration Metadata
* Force Re-authentication to Snowflake Procedure

> **Note:**
>
> You can [use a SAML2 security integration with Client Redirect](account-replication-security-integrations.md) if your
> account is a [Business Critical Edition or higher](replication-intro.md).
>
> For more information, see [Redirecting client connections](client-redirect.md).

## Configure SSO login for users

After you have created a SAML2 security integration, you can configure whether the user starts their SSO login from the IdP or from
Snowflake.

An IdP-initiated SSO does not require configuration in Snowflake. You only need to inform your users about how to access Snowflake (e.g.
using an internal portal).

The `SAML2_ENABLE_SP_INITIATED` property enables Snowflake-initiated SSO. The `SAML2_SP_INITIATED_LOGIN_PAGE_LABEL` property
defines a string that identifies the IdP. This string appears on the Snowflake login page so users can access the IdP.

Use the `ALTER SECURITY INTEGRATION` command to set these properties:

```sqlexample
ALTER SECURITY INTEGRATION my_idp SET SAML2_ENABLE_SP_INITIATED = true;
ALTER SECURITY INTEGRATION my_idp SET SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = 'My IdP';
```

For information about how clients connect to Snowflake after you have configured SSO login for users, see
[Using SSO with client applications that connect to Snowflake](admin-security-fed-auth-use.md)

## Encrypt SAML assertions

The `SAML2_SNOWFLAKE_X509_CERT` property ensures that SAML2 assertions are encrypted using Snowflake’s public certificate, securing
traffic when users access Snowflake through federated authentication.

After receiving the encrypted assertions from the customer IdP, Snowflake decrypts the encrypted assertions with its private key. Snowflake
never exports or makes its private key available.

To encrypt SAML assertions, see the sections below.

### Export the public certificate from Snowflake

After you have created a SAML2 security integration, follow the steps below:

1. Execute the following SQL statement on the SAML2 integration.

   ```sqlexample
   DESC SECURITY INTEGRATION my_idp;
   ```
2. Find the `SAML2_SNOWFLAKE_X509_CERT` value in row 7, which is the public certificate in PEM format.
3. Save the value, ensuring you include the `BEGIN CERTIFICATE` and `END CERTIFICATE` delimiters. For example, the codeblock
   below contains a truncated certificate in PEM format:

   ```text
   -----BEGIN CERTIFICATE-----
   MIICr...
   -----END CERTIFICATE-----
   ```

### Create a certificate signing request (CSR) — *Optional*

By default, a SAML2 security integration in Snowflake uses a self-signed certificate for the SAML IdP to encrypt SAML assertions. If your
organization requires using a certificate issued from a Certificate Authority (CA), then complete the steps below:

1. Generate a certificate signing request (CSR) from Snowflake using the system function
   [SYSTEM$GENERATE_SAML_CSR](../sql-reference/functions/system_generate_saml_csr.md).
2. Provide the CSR to the CA of your choice so that the certificate can be issued.
3. Upload the Base64-encoded certificate into the SAML integration using the following ALTER statement, without the
   `BEGIN CERTIFICATE` and `END CERTIFICATE` delimiters.

   ```sqlexample
   ALTER SECURITY INTEGRATION my_idp SET SAML2_SNOWFLAKE_X509_CERT = 'AX2bv...';
   ```
4. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to view the updated security integration:

   > ```sqlexample
   > DESC SECURITY INTEGRATION my_idp;
   > ```

You can then upload the certificate for your private key using the CSR generated by the function into Snowflake.

### Configure Your SAML IdP

1. Upload the saved certificate in PEM format to your organization’s IdP as the SAML Encryption certificate.
2. Configure your IdP to encrypt SAML Assertions for the Snowflake service provider (SP).

## Send signed SAML requests

You can send a signed SAML request from Snowflake to the IdP to verify Snowflake as an authentic service provider. To verify Snowflake, you
can configure your IdP to use the certificate stored in the SAML2 security integration to ensure the SAML request originates from Snowflake,
not a third-party that is impersonating Snowflake.

### Set the SAML2_SIGN_REQUEST property

If you are creating a SAML2 security integration for the first time, ensure you set the
[SAML2_SIGN_REQUEST](../sql-reference/sql/create-security-integration-saml2.md) property.

If you created a SAML2 security integration without setting the `SAML2_SIGN_REQUEST` property, follow the steps below:

1. Execute the [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-saml2.md) command as a user with an
   ACCOUNTADMIN role to update the security integration:

   > ```sqlexample
   > ALTER SECURITY INTEGRATION my_idp SET SAML2_SIGN_REQUEST = true;
   > ```
2. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to view the updated security integration:

   > ```sqlexample
   > DESC SECURITY INTEGRATION my_idp;
   > ```

### Configure your IdP to accept signed requests

Configure your IdP to accept signed requests from Snowflake. During the configuration, your IdP needs to have the certificate stored in the
[SAML2_SNOWFLAKE_X509_CERT](../sql-reference/sql/create-security-integration-saml2.md) property. Your IdP uses this certificate to verify
that the SAML request originates from Snowflake.

> **Note:**
>
> Snowflake is not responsible for configuring your IdP. For help with configuring your IdP, please consult your internal security
> administrator.

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command:

   > ```sqlexample
   > DESC SECURITY INTEGRATION my_idp;
   > ```
2. Save the value of the [SAML2_SNOWFLAKE_X509_CERT](../sql-reference/sql/create-security-integration-saml2.md) property in row 7 to use in
   your IdP settings.

## Specify the SAML `NameID` format

Snowflake supports allowing the administrator (i.e. user with the ACCOUNTADMIN role) to specify the SAML `NameID` format that will be
requested in the outgoing SAML authentication request sent from Snowflake to the IdP.

Specifying the SAML `NameID` format allows Snowflake to set an expectation of the identifying attribute of the user (i.e. SAML
Subject) in the SAML assertion from the IdP to ensure a valid authentication to Snowflake.

The SAML `NameID` format can be integrated into the SAML2 security integration. You can specify the SAML `NameID` in the
security integration using one of the following values:

* `urn:oasis:names:tc:SAML:1.1:nameid-format:unspecified`
* `urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress`
* `urn:oasis:names:tc:SAML:1.1:nameid-format:X509SubjectName`
* `urn:oasis:names:tc:SAML:1.1:nameid-format:WindowsDomainQualifiedName`
* `urn:oasis:names:tc:SAML:2.0:nameid-format:kerberos`
* `urn:oasis:names:tc:SAML:2.0:nameid-format:persistent`
* `urn:oasis:names:tc:SAML:2.0:nameid-format:transient`

If the SAML `NameID` format is not specified, Snowflake uses the following value:

`urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress`

### Set the SAML2_REQUESTED_NAMEID_FORMAT property

If you are creating a SAML2 security integration for the first time, ensure you set the
[SAML2_REQUESTED_NAMEID_FORMAT](../sql-reference/sql/create-security-integration-saml2.md) property.

If you created a SAML2 security integration without setting the `SAML2_REQUESTED_NAMEID_FORMAT` property, follow the steps below:

1. Execute the [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-saml2.md) command as a user with an
   ACCOUNTADMIN role to specify the SAML `NameId` format:

   > ```sqlexample
   > ALTER SECURITY INTEGRATION my_idp SET SAML2_REQUESTED_NAMEID_FORMAT = '<string_literal>';
   > ```
2. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to view the updated security integration:

   > ```sqlexample
   > DESC SECURITY INTEGRATION my_idp;
   > ```

### Configure your IdP to specify the `NameID`

Configure your IdP to specify the SAML `NameID` format in SAML assertions.

> **Note:**
>
> Snowflake is not responsible for configuring your IdP. For help with configuring your IdP, please consult your internal security
> administrator.

## Export the SAML2 security integration metadata

Snowflake provides SAML 2.0 metadata for the SAML2 security integration to facilitate configuring the Snowflake service provider in your
IdP.

The SAML 2.0 metadata is contained in the `SAML2_SNOWFLAKE_METADATA` property and can be obtained by executing a
[DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command on the SAML2 security integration. For example:

```sqlexample
DESC SECURITY INTEGRATION my_idp;
```

```output
------------------------------------+---------------+-----------------------------------------------------------------------------+------------------+
              property              | property_type |                               property_value                                | property_default |
------------------------------------+---------------+-----------------------------------------------------------------------------+------------------+
SAML2_X509_CERT                     | String        | MIICr...                                                                    |                  |
SAML2_PROVIDER                      | String        | OKTA                                                                        |                  |
SAML2_ENABLE_SP_INITIATED           | Boolean       | false                                                                       | false            |
SAML2_SP_INITIATED_LOGIN_PAGE_LABEL | String        | my_idp                                                                      |                  |
SAML2_SSO_URL                       | String        | https://okta.com/sso                                                        |                  |
SAML2_ISSUER                        | String        | https://okta.com                                                            |                  |
SAML2_SNOWFLAKE_X509_CERT           | String        | MIICr...                                                                    |                  |
SAML2_REQUESTED_NAMEID_FORMAT       | String        | urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress                      |                  |
SAML2_SNOWFLAKE_ACS_URL             | String        | https://example.snowflakecomputing.com/fed/login                            |                  |
SAML2_SNOWFLAKE_ISSUER_URL          | String        | https://example.snowflakecomputing.com                                      |                  |
SAML2_SNOWFLAKE_METADATA            | String        | <md:EntityDescriptor entityID="https://example.snowflakecomputing.com"> ... |                  |
SAML2_DIGEST_METHODS_USED           | String        | http://www.w3.org/2001/04/xmlenc#sha256                                     |                  |
SAML2_SIGNATURE_METHODS_USED        | String        | http://www.w3.org/2001/04/xmldsig-more#rsa-sha256                           |                  |
------------------------------------+---------------+-----------------------------------------------------------------------------+------------------+
```

As a representative example, the formatted SAML 2.0 XML metadata from the `SAML2_SNOWFLAKE_METADATA` property is shown below. Note
that the `X509certificate` values for `signing` and `encryption` are truncated.

```xml
<md:EntityDescriptor xmlns:dsig="http://www.w3.org/2000/09/xmldsig#" xmlns:md="urn:oasis:names:tc:SAML:2.0:metadata" xmlns:xenc="http://www.w3.org/2001/04/xmlenc#" xmlns:saml="urn:oasis:names:tc:SAML:2.0:assertion" entityID="https://example.snowflakecomputing.com">
 <md:SPSSODescriptor AuthnRequestsSigned="false" protocolSupportEnumeration="urn:oasis:names:tc:SAML:2.0:protocol">
  <md:KeyDescriptor use="signing">
    <dsig:KeyInfo>
      <dsig:X509Data>
        <dsig:X509Certificate>MIICr...</dsig:X509Certificate>
      </dsig:X509Data>
    </dsig:KeyInfo>
  </md:KeyDescriptor>
  <md:KeyDescriptor use="encryption">
    <dsig:KeyInfo>
      <dsig:X509Data>
        <dsig:X509Certificate>MIICr...</dsig:X509Certificate>
      </dsig:X509Data>
    </dsig:KeyInfo>
  </md:KeyDescriptor>
  <md:AssertionConsumerService index="0" isDefault="true" Binding="urn:oasis:names:tc:SAML:2.0:bindings:HTTP-POST" Location="https://example.snowflakecomputing.com/fed/login" />
 </md:SPSSODescriptor>
</md:EntityDescriptor>
```

For reference, the following table maps the XML metadata tags to the Snowflake SAML2 security integration properties.

| XML Output | SAML2 Security Integration Property |
| --- | --- |
| entityID | SAML2_SNOWFLAKE_ISSUER_URL |
| AuthnRequestsSigned | SAML2_SIGN_REQUEST |
| Signing Certificate | SAML2_SNOWFLAKE_X509_CERT |
| Encryption Certificate | SAML2_SNOWFLAKE_X509_CERT |
| Assertion Consumer Service URL | SAML2_SNOWFLAKE_ACS_URL |

## Force re-authentication to Snowflake

Snowflake supports configuring your SAML2 security integration to require the authenticating user to re-authenticate to access Snowflake
during the initial authentication SSO flow or when a current Snowflake session expires.

When enabling this feature in the Snowflake SAML2 security integration, Snowflake sets the SAML specification `ForceAuthn` property
to `True` in the outgoing SAML request from Snowflake to the IdP. Once the IdP receives the request with `ForceAuthn` property
set to `True`, the IdP sends a request to Snowflake which results in users being prompted to re-enter their authentication credentials
(e.g. username, password) to access Snowflake.

This feature provides enhanced security through re-authentication. In addition, the re-authentication prompt allows users to input a
different set of credentials than those used to initiate SSO to access Snowflake.

> **Important:**
>
> Before implementing this feature, verify that your IdP supports switching identities during an SSO authentication flow.
>
> If this feature is implemented in Snowflake and your IdP does not support switching identities during the initial SSO authentication flow,
> users might not be able to access Snowflake using the different set of credentials provided in the re-authentication prompt.

If you are creating a SAML2 security integration for the first time, ensure you set the
[SAML2_FORCE_AUTHN](../sql-reference/sql/create-security-integration-saml2.md) property.

To update an existing SAML2 security integration to support forced re-authentication to access Snowflake, follow the steps below:

1. Execute the [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-saml2.md) command to update the security
   integration:

   > ```sqlexample
   > ALTER SECURITY INTEGRATION my_idp SET SAML2_FORCE_AUTHN = true;
   > ```
2. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to view the updated security integration:

   > ```sqlexample
   > DESC SECURITY INTEGRATION my_idp;
   > ```

Where:

> `SAML2_FORCE_AUTHN = TRUE | FALSE`
> :   The Boolean indicating whether users, during the initial authentication flow, are forced to authenticate again to access Snowflake. When
>     set to `TRUE`, Snowflake sets the `ForceAuthn` SAML property to `TRUE` in the outgoing request from Snowflake to the
>     identity provider.
>
>     * `TRUE` forces users to authenticate again to access Snowflake, even if a valid session with the identity provider exists.
>     * `FALSE` does not force users to authenticate again to access Snowflake.
>
>     Default: `FALSE`.

## Custom logout endpoint

Snowflake supports defining a custom endpoint URL to redirect users to after logging out of Snowflake. The endpoint is set through the
`SAML2_POST_LOGOUT_REDIRECT_URL` property in the SAML2 security integration.

Once enabled for users who access Snowflake through SAML SSO, clicking the Log Out button in Snowsight results in
Snowflake terminating the Snowflake session and redirecting users to the specified endpoint.

> **Important:**
>
> This behavior does not apply to [Snowsight](ui-snowsight-gs.md).

Defining a logout endpoint provides administrators the option to control where users are redirected after logging out of Snowflake. For
example, a custom endpoint could trigger a script to simultaneously terminate the IdP session. The advantage of this implementation is that
both the Snowflake and IdP sessions are terminated, which forces users to re-authenticate against the IdP to access Snowflake.

If you are creating a SAML2 security integration for the first time, ensure you set the
[SAML2_POST_LOGOUT_REDIRECT_URL](../sql-reference/sql/create-security-integration-saml2.md) property.

If you created a SAML2 security integration without setting the `SAML2_POST_LOGOUT_REDIRECT_URL` property, execute the
[ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-saml2.md) command to configure the custom logout endpoint:

```sqlexample
ALTER SECURITY INTEGRATION my_idp SET SAML2_POST_LOGOUT_REDIRECT_URL = 'https://logout.example.com';
```

## Manage Your SAML2 security integration

You can use an [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-saml2.md) command to manage the SAML2 security
integration. For example:

* Update the X.509 certificate as a string into an existing SAML2 security integration.

  ```sqlexample
  ALTER SECURITY INTEGRATION my_idp SET SAML2_X509_CERT = 'AX2bv...';
  ```
* If you are a customer who configures your IdP to verify SAML request signatures or encrypt SAML responses, then you can overwrite your
  existing private key and self-signed certificate, and generate a new private key and self-signed certificate:

  1. Generate a new private key:

     > **Caution:**
     >
     > After running the command below, SAML authentication stops working because your IdP still uses your old
     > `SAML2_SNOWFLAKE_X509_CERT` certificate. To minimize disruptions, you should run the command below when users are not as
     > active.

     ```sqlexample
     ALTER SECURITY INTEGRATION my_idp REFRESH SAML2_SNOWFLAKE_PRIVATE_KEY;
     ```
  2. Retrieve the value of the `SAML2_SNOWFLAKE_X509_CERT` property in your security integration:

     ```sqlexample
     DESCRIBE SECURITY INTEGRATION my_idp;
     SELECT "property_value" FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
       WHERE "property" = 'SAML2_SNOWFLAKE_X509_CERT';
     ```
  3. Upload the retrieved value to your IdP to replace your old certificate with your new certificate in your IdP.
* Enable signed requests.

  ```sqlexample
  ALTER SECURITY INTEGRATION my_idp SET SAML2_SIGN_REQUEST = true;
  ```
* Specify the `NameID` format.

  ```sqlexample
  ALTER SECURITY INTEGRATION my_idp SET SAML2_REQUESTED_NAMEID_FORMAT = 'urn:oasis:names:tc:SAML:1.1:nameid-format:unspecified';
  ```
* Update an existing security integration to enable forced re-authentication.

  ```sqlexample
  ALTER SECURITY INTEGRATION my_idp SET SAML2_FORCE_AUTHN = true;
  ```
* Update an existing security integration to disable forced re-authentication.

  ```sqlexample
  ALTER SECURITY INTEGRATION my_idp UNSET SAML2_FORCE_AUTHN;
  ```
* Update the custom logout endpoint.

  ```sqlexample
  ALTER SECURITY INTEGRATION my_idp SET SAML2_POST_LOGOUT_REDIRECT_URL = 'https://logout.example.com';
  ```

For more information, see [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-saml2.md).

## Replicate the SSO Configuration

Snowflake supports replication and failover/failback of the
SAML2 security integration from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

---
title: Configuring SnowSQL
source: https://docs.snowflake.com/en/user-guide/snowsql-config.md
section: User Guide
---

# Configuring SnowSQL

This topic describes how to configure SnowSQL using connection parameters, configuration options, and variables.

## About the SnowSQL `config` file

SnowSQL supports multiple configuration files that allow organizations to define base values for connection parameters,
default settings, and variables while allowing individual users to customize their personal settings in their
own `<HOME_DIR>/.snowsql/config` files. When SnowSQL starts, it loads configuration parameter values from the following
configuration file locations in order, overriding values from files loaded previously:

* `/etc/snowsql.cnf`
* `/etc/snowflake/snowsql.cnf`
* `/usr/local/etc/snowsql.cnf`
* `<HOME_DIR>/.snowsql.cnf` (supported only for backward compatibility)
* `<HOME_DIR>/.snowsql/config`

For example, if the `/etc/snowsql.cnf` configuration file sets the `log_level` parameter to `info`, you can override this by setting the parameter to `debug` in your file `<HOME_DIR>/.snowsql/config` file.

The `snowsql` command generates a configuration file similar to the following the first time you execute the command.

```ini
[connections]
# *WARNING* *WARNING* *WARNING* *WARNING* *WARNING* *WARNING*
#
# The Snowflake user password is stored in plain text in this file.
# Pay special attention to the management of this file.
# Thank you.
#
# *WARNING* *WARNING* *WARNING* *WARNING* *WARNING* *WARNING*

#If a connection doesn't specify a value, it will default to these
#
#accountname = defaultaccount
#region = defaultregion
#username = defaultuser
#password = defaultpassword
#dbname = defaultdbname
#schemaname = defaultschema
#warehousename = defaultwarehouse
#rolename = defaultrolename
#proxy_host = defaultproxyhost
#proxy_port = defaultproxyport

[connections.example]
#Can be used in SnowSql as #connect example

accountname = my_organization-my_account
username = username
password = password1234

[variables]
# SnowSQL defines the variables in this section on startup.
# You can use these variables in SQL statements. For details, see
# https://docs.snowflake.com/en/user-guide/snowsql-use.html#using-variables

# example_variable=27

[options]
# If set to false auto-completion will not occur interactive mode.
auto_completion = True

# main log file location. The file includes the log from SnowSQL main
# executable.
log_file = ~/.snowsql/log

# bootstrap log file location. The file includes the log from SnowSQL bootstrap
# executable.
# log_bootstrap_file = ~/.snowsql/log_bootstrap

# Default log level. Possible values: "CRITICAL", "ERROR", "WARNING", "INFO"
# and "DEBUG".
log_level = INFO

# Timing of sql statements and table rendering.
timing = True

# Table format. Possible values: psql, plain, simple, grid, fancy_grid, pipe,
# orgtbl, rst, mediawiki, html, latex, latex_booktabs, tsv.
# Recommended: psql, fancy_grid and grid.
output_format = psql

# Keybindings: Possible values: emacs, vi.
# Emacs mode: Ctrl-A is home, Ctrl-E is end. All emacs keybindings are available in the REPL.
# When Vi mode is enabled you can use modal editing features offered by Vi in the REPL.
key_bindings = emacs

# OCSP Fail Open Mode.
# The only OCSP scenario which will lead to connection failure would be OCSP response with a
# revoked status. Any other errors or in the OCSP module will not raise an error.
# ocsp_fail_open = True

# Enable temporary credential file for Linux users
# For Linux users, since there are no OS-key-store, an unsecure temporary credential for SSO can be enabled by this option. The default value for this option is False.
# client_store_temporary_credential = True

# Select statement split method (default is to use the sql_split method in snowsql, which does not support 'sql_delimiter')
# sql_split = snowflake.connector.util_text # to use connector's statement_split which has legacy support to 'sql_delimiter'.

# Force the result data to be decoded in utf-8. By default the value is set to false for compatibility with legacy data. It is recommended to set the value to true.
# json_result_force_utf8_decoding = False
```

You should create these configuration files using UTF-8 encoding.

## Modify the SnowSQL configuration file

To modify the configuration file:

1. Open the SnowSQL configuration file (named `config`) in a text editor. The default location of the file is:

   Linux/macOS:
   :   `~/.snowsql/`

   Windows:
   :   `%USERPROFILE%\.snowsql\`

   > **Note:**
   >
   > You can change the default location by specifying the `--config path` command-line flag when starting SnowSQL.
2. Modify settings in the following sections:

   * Connection Parameters Section
   * Configuration Options Section
   * Variables Section

> **Attention:**
>
> * The password is stored in plain text in the `config` file. You must explicitly secure the file to restrict access. For example, in Linux or macOS, you can set the read permissions to you
>   alone by running `chmod`:
>
>   > ```bash
>   > $ chmod 700 ~/.snowsql/config
>   > ```

> **Note:**
>
> * If a value contains special characters (other than single or double quotes), you must enclose it in either single quotes or double quotes, e.g.:
>
>   ```bash
>   password = "my$^pwd"
>   ```
> * If a value contains quote characters *in addition to* other special characters, escape these quotes using the backslash (`\`) character, e.g.:
>
>   ```bash
>   password = "my$^\"\'pwd"
>   ```
>
>   Note that escaping quote characters is optional in unenclosed values (i.e. values that do not contain other special characters).
> * Multi-line values must be enclosed in three single or double quotes (`'''` or `"""`), e.g.:
>
>   ```bash
>   prompt_format='''[#FFFF00][user]@[account]
>   [#00FF00]> '''
>   ```

### Connection parameters section

In the `[connections]` section of the `config` file, optionally set the default connection parameters for SnowSQL,
e.g. account identifier, user login credentials, and the default database and warehouse.
You can also define *named* connections to make multiple simultaneous connections to Snowflake or store different sets of connection configurations.

For more information, see [Connecting through SnowSQL](snowsql-start.md).

### Configuration options section

Configure the behavior of SnowSQL by adding settings in the `[options]` section of the `config` file:

> ```bash
> [options]
> <option_name> = <option_value>
> ```

Where:

* `<option_name>` is the name of the option (case-insensitive). If an invalid name is specified, SnowSQL displays an error.
* `<option_value>` specifies a supported value (case-insensitive) for the option, as described below.

```none
+----------------------------+---------------------+------------------------------------------------------------------------------------+
| Name                       | Value               | Help                                                                               |
+----------------------------+---------------------+------------------------------------------------------------------------------------+
| auto_completion            | True                | Displays auto-completion suggestions for commands and Snowflake objects.           |
| client_session_keep_alive  | False               | Keeps the session active indefinitely, even if there is no activity from the user. |
| echo                       | False               | Outputs the SQL command to the terminal when it is executed.                       |
| editor                     | vim                 | Changes the editor to use for the !edit command.                                   |
| empty_for_null_in_tsv      | False               | Outputs an empty string for NULL values in TSV format.                             |
| environment_variables      | ['PATH']            | Specifies the environment variables that can be referenced as SnowSQL variables.   |
|                            |                     | The variable names should be comma separated.                                      |
| execution_only             | False               | Executes queries only.                                                             |
| exit_on_error              | False               | Quits when SnowSQL encounters an error.                                            |
| fix_parameter_precedence   | True                | Controls the precedence of the environment variable and the config file entries    |
|                            |                     | for password, proxy password, and private key phrase.                              |
| force_put_overwrite        | False               | Force PUT command to stage data files without checking whether already exists.     |
| friendly                   | True                | Shows the splash text and goodbye messages.                                        |
| header                     | True                | Outputs the header in query results.                                               |
| insecure_mode              | False               | Turns off OCSP certificate checks.                                                 |
| key_bindings               | emacs               | Changes keybindings for navigating the prompt to emacs or vi.                      |
| log_bootstrap_file         | ~/.snowsql/log_...  | SnowSQL bootstrap log file location.                                               |
| log_file                   | ~/.snowsql/log      | SnowSQL main log file location.                                                    |
| log_level                  | CRITICAL            | Changes the log level (critical, debug, info, error, warning).                     |
| login_timeout              | 120                 | Login timeout in seconds.                                                          |
| noup                       | False               | Turns off auto upgrading SnowSQL.                                                  |
| output_file                | None                | Writes output to the specified file in addition to the terminal.                   |
| output_format              | psql                | Sets the output format for query results.                                          |
| paging                     | False               | Enables paging to pause output per screen height.                                  |
| progress_bar               | True                | Shows progress bar while transferring data.                                        |
| prompt_format              | [user]#[warehou...] | Sets the prompt format. Experimental feature, currently not documented.            |
| sfqid                      | False               | Turns on/off Snowflake query id in the summary.                                    |
| sfqid_in_error             | False               | Turns on/off Snowflake query id in the error message.                              |
| quiet                      | False               | Hides all output.                                                                  |
| remove_comments            | False               | Removes comments before sending query to Snowflake.                                |
| remove_trailing_semicolons | True                | Removes trailing semicolons from SQL text before sending queries to Snowflake.     |
| results                    | True                | If set to off, queries will be sent asynchronously, but no results will be fetched.|
|                            |                     | Use !queries to check the status.                                                  |
| rowset_size                | 1000                | Sets the size of rowsets to fetch from the server.                                 |
|                            |                     | Set the option low for smooth output, high for fast output.                        |
| stop_on_error              | False               | Stops all queries yet to run when SnowSQL encounters an error.                     |
| syntax_style               | default             | Sets the colors for the text of SnowSQL.                                           |
| timing                     | True                | Turns on/off timing for each query.                                                |
| timing_in_output_file      | False               | Includes timing in the output file.                                                |
| variable_substitution      | False               | Substitutes variables (starting with '&') with values.                             |
| version                    | 1.1.70              | Returns SnowSQL version.                                                           |
| wrap                       | True                | Truncates lines at the width of the terminal screen.                               |
+----------------------------+---------------------+------------------------------------------------------------------------------------+
```

See SnowSQL configuration options reference (in this topic) for descriptions of all valid options.

> **Note:**
>
> In addition to setting the configuration options in the `config` file, you can set the options using either of the following methods:
>
> * While connecting to Snowflake, you can use the `-o` or `--option` connection parameter to set these options. For more information, see [Connection parameters reference](snowsql-start.md).
> * After connecting to Snowflake, you can use the `!set` command to set these options for the session. For more information, see [Commands reference](snowsql-use.md).

### Variables section

In the `[variables]` section of the `config` file, you can store values as variables for reuse. This feature enables you to use user-defined and database values in queries.

For more information, see [Using variables](snowsql-use.md).

## SnowSQL configuration options reference

Options modify the default SnowSQL behavior. You can set these options using any of the following methods:

* In the configuration file (as described in this topic).
* Using the `-o` or `--option` [parameter](snowsql-start.md) when connecting to Snowflake.
* Using the `!set` [command](snowsql-use.md) once connected to Snowflake.

> **Note:**
>
> The option names and values are case-insensitive.

### `auto_completion`

> Type:
> :   Boolean
>
> Description:
> :   Enables context-sensitive auto-completion. If enabled, functions, table names, and variables stored in SnowSQL are auto-completed in interactive mode.
>
> Default:
> :   `auto_completion=True`

### `client_session_keep_alive`

> Type:
> :   Boolean
>
> Description:
> :   Indicates whether to force a user to log in again after a period of inactivity in a JDBC or ODBC session. When set to `True`, Snowflake keeps the session active indefinitely, even if there is no activity from the user. When set to `False`, the user must log in again after four hours of inactivity.
>
> Default:
> :   `client_session_keep_alive=False`

### `echo`

> Type:
> :   Boolean
>
> Description:
> :   Echoes local input. When set to `True`, echoes to both `stdout` and the output file.
>
> Default:
> :   `echo=False`

### `editor`

> Type:
> :   String (constant)
>
> Description:
> :   Specifies the editor to invoke when the `!edit` command is issued in SnowSQL. Supported values:
>
>     * `emacs`
>     * `vi`
>     * `vim`
>
> Default:
> :   `editor=vim`

### `empty_for_null_in_tsv`

> Type:
> :   Boolean
>
> Description:
> :   If enabled, when `output_format` is set to `TSV`, SnowSQL outputs an empty string for each NULL value.
>
> Example:
> :   `empty_for_null_in_tsv=True`

### `environment_variables`

> Type:
> :   List
>
> Description:
> :   Specifies the environment variables to be set in the SnowSQL variables.
>
> Example:
> :   `environment_variables=PATH,USER,AWS_ACCESS_KEY_ID,AWS_SECRET_ACCESS_KEY`

### `execution_only`

> Type:
> :   Boolean
>
> Description:
> :   If enabled, SnowSQL executes queries without fetching data. This option is useful when you only want to measure execution times. Note that returned values include any network latency and
>     are not pure server-side execution times.
>
> Example:
> :   `execution_only=True`

### `exit_on_error`

> Type:
> :   Boolean
>
> Description:
> :   If enabled, SnowSQL exits when an error occurs. This behavior is useful to stop running queries when an error is encountered.
>
> Example:
> :   `exit_on_error=True`

### `fix_parameter_precedence`

> Type:
> :   Boolean
>
> Description:
> :   Controls the precedence among the possible sources of the password, proxy password, and private key
>     phrase parameters.
>
>     If the value is True, the precedence (from highest to lowest) is:
>
>     * The environment variable or the SnowSQL command-line parameter.
>     * The connection-specific connection parameters, which are the parameters in the config file’s named connection
>       section, e.g. the section `[connections.myconnection]`.
>     * The default connection parameters, which are the parameters in the `[connections]` section of the config file.
>
>     If the value is False, the precedence (from highest to lowest) is:
>
>     * The connection-specific connection parameters, which are the parameters in the config file’s named connection
>       section, e.g. the section `[connections.myconnection]`.
>     * The environment variable or the SnowSQL command-line parameter.
>     * The default connection parameters, which are the parameters in the `[connections]` section of the config file.
>
> Default:
> :   True

### `force_put_overwrite`

> Type:
> :   Boolean
>
> Description:
> :   If enabled, SnowSQL forces the PUT command to upload (i.e. stage) data files from a local directory/folder on a client machine to the specified internal (i.e. Snowflake) stage without checking whether the files already exist in the stage. If the files are already present in the destination stage, the PUT command overwrites the existing files.
>
> Default:
> :   `force_put_overwrite=False`

### `friendly`

> Type:
> :   Boolean
>
> Description:
> :   If disabled, SnowSQL suppresses the startup and exit messages.
>
> Default:
> :   `friendly=True`

### `header`

> Type:
> :   Boolean
>
> Description:
> :   Displays the header in the results table rendered by SnowSQL. Disabling this option is useful when you want to retrieve data-only in the results.
>
>     Can be used with `output_format` and `timing` to produce data-only output.
>
> Default:
> :   `header=True`

### `insecure_mode`

> Type:
> :   Boolean
>
> Description:
> :   Skips the certificate revocation checks using the Online Certificate Status Protocol (OCSP). This option could be used in an emergency situation in which no OCSP service is accessible.
>     Snowflake strongly recommends that you do not enable this option unless directed by Snowflake Support.
>
> Default:
> :   `insecure_mode=False`

### `key_bindings`

> Type:
> :   String (constant)
>
> Description:
> :   Key bindings to use. Possible values:
>
>     * `emacs`: `CTRL` + `a` is home, `CTRL` + `e` is end. All Emacs key bindings for the REPL
>       environment are available.
>     * `vi`: You can use all modal editing features offered by vi in the REPL environment.
>
> Note:
> :   The value cannot be changed by `!set` command during the SnowSQL session. Instead, set the value in the configuration file or on the command line when connecting to Snowflake.
>
> Default:
> :   `key_bindings=vi`

### `log_bootstrap_file`

> Type:
> :   String (path)
>
> Description:
> :   Bootstrap log file location. If not specified, `log_file` is used as the base name followed by `_bootstrap`. For example, by default, the log file name is `log_bootstrap`.
>
> Default:
> :   `log_bootstrap_file=~/.snowsql/bootlog`

### `log_file`

> Type:
> :   String (path)
>
> Description:
> :   log_file location.
>
> > **Note:**
> >
> > You must have permissions to write to the log file’s parent directory or to modify the location of the log file.
>
> Default:
> :   `log_file=~/.snowsql/log`

### `log_level`

> Type:
> :   String (constant)
>
> Description:
> :   Default log level. Possible values: `CRITICAL`, `ERROR`, `WARNING`, `INFO`, `DEBUG`.

### `login_timeout`

> Type:
> :   Number
>
> Description:
> :   Login timeout in seconds.
>
> Default:
> :   `login_timeout=120`

### `noup`

> Type:
> :   Boolean
>
> Description:
> :   Prevents SnowSQL from downloading and installing a new version if `True`. By default, SnowSQL auto-upgrades to the latest version if no version is specified.
>
> Default:
> :   `noup=True`

### `output_file`

> Type:
> :   String (path and file name)
>
> Description:
> :   Writes output to the specified file in addition to the terminal output.
>
> Default:
> :   None

### `output_format`

> Type:
> :   String (constant)
>
> Description:
> :   Specifies the format of the results displayed in the terminal. Possible values:
>
>     * `csv`
>     * `expanded`
>     * `fancy_grid`
>     * `grid`
>     * `html`
>     * `json`
>     * `latex`
>     * `latex_booktabs`
>     * `mediawiki`
>     * `orgtbl`
>     * `pipe`
>     * `plain`
>     * `psql`
>     * `rst`
>     * `simple`
>     * `tsv`
>
>     Recommended values for tabular results: `psql` , `grid`, or `fancy_grid`
>
>     Recommended values for data-only results (used in combination with `header`, `timing`, and `friendly` set to `False`): `plain` , `csv`, or `tsv`
>
> Default:
> :   `output_format=psql`

### `paging`

> Type:
> :   Boolean
>
> Description:
> :   When enabled, pauses output per screen height. This feature is useful for browsing large result sets. To scroll down, press the **[ENTER]/[RETURN]** key.
>
> Default:
> :   `paging=False`

### `progress_bar`

> Type:
> :   Boolean
>
> Description:
> :   Shows progress bar while transferring data.
>
> Default:
> :   `progress_bar=True`

### `prompt_format`

> Type:
> :   string
>
> Description:
> :   Changes the SnowSQL prompt format.
>
>     The SnowSQL prompt dynamically displays the current user, warehouse, database, and schema by default. Dynamic tokens are written as [<token>], e.g. [user] or [warehouse]. You can change the Snowflake object order, delimiter,
>     and color. Change the object color by defining a pygments token in brackets.
>
>     For example, change the object order to user, database and schema, then warehouse. Change the delimiter to a period. Change the [user] object name to red, the [database] and [schema] names to green, and the [warehouse] name to blue:
>
>     > ```bash
>     > prompt_format="[#FF0000][user]@[#00FF00][database][schema][#0000FF][warehouse]"
>     > ```
>
>     Put quotes around the value to prevent “#” characters from being interpreted as the start of a comment.
>
> Default:
> :   `None`

### `quiet`

> Type:
> :   Boolean
>
> Description:
> :   Removes all output data from the terminal, but continues to display error messages and diagnostic data.
>
> Default:
> :   `quiet=True`

### `remove_comments`

> Type:
> :   Boolean
>
> Description:
> :   Removes comments from the output.
>
> Default:
> :   `remove_comments=False`

### `remove_trailing_semicolons`

> Type:
> :   Boolean
>
> Description:
> :   Removes trailing semicolons from SQL text before sending queries to Snowflake. Note that removing the semicolons can prevent Snowflake from using cached results from different clients when
>     the [USE_CACHED_RESULT](../sql-reference/parameters.md) session parameter is enabled.
>
> Default:
> :   `remove_trailing_semicolons=True`

### `results`

> Type:
> :   Boolean
>
> Description:
> :   Returns the query results. If `False`,the query is executed asynchronously, no result including any error messages is returned.
>
> Default:
> :   `results=True`

### `rowset_size`

> Type:
> :   Number
>
> Description:
> :   Number of rows to fetch at once in interactive mode. Results are then fetched for output one rowset at a time.
>
> Default:
> :   `rowset_size=1000`

### `sfqid`

> Type:
> :   Boolean
>
> Description:
> :   Includes the Snowflake query ID in the result summary.
>
>     **Note**: You must also set `timing_in_output_file=True` to add `sqfid` to the spool file.
>
> Default:
> :   `sfqid=False`

### `sfqid_in_error`

> Type:
> :   Boolean
>
> Description:
> :   Includes the Snowflake query ID in error messages.
>
> Default:
> :   `sfqid_in_error=False`

### `stop_on_error`

> Type:
> :   Boolean
>
> Description:
> :   When an error is encountered, stops query execution, but does not exit.
>
> Default:
> :   `stop_on_error=False`

### `syntax_style`

> Type:
> :   String (constant)
>
> Description:
> :   Sets the text colors for SnowSQL. Currently, the only supported value is `default`.
>
> Default:
> :   `syntax_style=default`

### `timing`

> Type:
> :   Boolean
>
> Description:
> :   Specifies whether to display the number of rows produced and elapsed time for SQL statements that have executed. This information is displayed as a line of text under the results table rendered by SnowSQL. If set to `False`, the line of text under the results table is not displayed.
>
>     Can be used in conjunction with `header` and `output_format` to produce data-only output.
>
> Default:
> :   `timing=True`

### `timing_in_output_file`

> Type:
> :   Boolean
>
> Description:
> :   Specifies whether to include the execution time details in the output file, if the `output_file` option is configured. Requires also setting the `timing` option to `True`.
>
>     If set to `False`, the line of text under the results table is not included in the output file.
>
> Default:
> :   `timing_in_output_file=False`

### `variable_substitution`

> Type:
> :   Boolean
>
> Description:
> :   Substitutes variables with the values. See [Using variables](snowsql-use.md).
>
> Default:
> :   `variable_substitution=False`

### `wrap`

> Type:
> :   Boolean
>
> Description:
> :   Wraps the output by the terminal width. If `False`, the outputs are truncated.
>
> Default:
> :   `wrap=True`

---
title: Connect to Snowflake Open Catalog with External OAuth
source: https://docs.snowflake.com/en/user-guide/opencatalog/external-oauth-connect.md
section: User Guide
---

# Connect to Snowflake Open Catalog with External OAuth

This topic describes how to connect to Snowflake Open Catalog with External OAuth using a client application.

The example code in this topic shows how to connect using Apache Spark™, and the example code is in PySpark.

> **Note:**
>
> If you’re using Snowflake to query Open Catalog-managed tables, you can create a catalog integration for Snowflake that uses External OAuth.
> For more information, see [CREATE CATALOG INTEGRATION (Snowflake Open Catalog)](https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-open-catalog)
> in the Snowflake documentation.

## Prerequisites

Before you can connect to Open Catalog with External OAuth, you need to configure External OAuth in Open Catalog. For
instructions, see [Configure External OAuth in Snowflake Open Catalog](external-oauth-configure.md).

## Connect with Open Catalog by using automatic refresh token (Preferred method)

Use this method to connect by using an automatic refresh token so you don’t have to manually refresh the token.

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,<maven_coordinate>') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.rest.auth.type','oauth2') \
    .config('spark.sql.catalog.opencatalog.oauth2-server-uri','<oauth2_server_uri>') \
    .config('spark.sql.catalog.opencatalog.credential','<oauth_client_id>:<oauth_client_secret>') \
    .config('spark.sql.catalog.opencatalog.scope','SESSION:ROLE:<custom_role>') \
    .config('spark.sql.catalog.opencatalog.audience','https://<open_catalog_account_identifier>.snowflakecomputing.com') \
    .getOrCreate()
```

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account.   Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |
| `<oauth2_server_uri>` | Your OAuth2 server URI. |
| `<oauth_client_id>` | Your OAuth2 client ID. |
| `<oauth_client_secret>` | Your OAuth2 client secret. |
| `<custom_role>` | The name of the custom role in Open Catalog whose privileges you want to grant to the service principal. |

## Connect with Open Catalog by using an access token

If needed, you can connect with Open Catalog by using an access token. However, the access token will expire and you’ll need to manually
refresh it. Alternatively, you can connect by using an automatic refresh token.

The following example code is for connecting with Open Catalog by using Spark.

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<access_token>` | Specifies the access token for the client application to use.   Enter the [access token that you generated when you configured External OAuth in Open Catalog](external-oauth-configure.md). |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account.   Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,<maven_coordinate>') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','<access_token>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .getOrCreate()
```

## Connect with a cross-region connection (Amazon S3 only)

The following example code is for connecting to Open Catalog when the following is true:

* Your Open Catalog account is hosted on Amazon S3.
* Your external storage provider is Amazon S3.
* Your Open Catalog account is hosted in an S3 region that is different from the S3 region where the storage bucket containing your Apache Iceberg™ tables is located.

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','<access_token>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.client.region','<target_s3_region>') \
    .getOrCreate()
```

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<access_token>` | Specifies the access token for the client application to use.   Enter the [access token that you generated when you configured External OAuth in Open Catalog](external-oauth-configure.md). |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account. Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |
| `<target_s3_region>` | Specifies the region code where the S3 bucket containing your Apache Iceberg tables is located. For the region codes, see [AWS service endpoints](https://docs.aws.amazon.com/general/latest/gr/s3.html#s3_region) and refer to the Region column in the table. |

## Examples

This section contains examples of connecting to Open Catalog using Spark:

* Example 1: Connect (S3)
* Example 2: Connect (Cloud Storage from Google)
* Example 3: Connect (Azure)

### Example 1: Connect (S3)

See:

* Connect by using automatic refresh (S3)
* Connect by using access token (S3)

#### Connect by using automatic refresh (S3)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.rest.auth.type','oauth2') \
    .config('spark.sql.catalog.opencatalog.oauth2-server-uri','your-tenant.region.auth0.com') \
    .config('spark.sql.catalog.opencatalog.credential','11111111111111111111111111111111:222222222222222222222222222222222222222222222222222222222222222222') \
    .config('spark.sql.catalog.opencatalog.scope','SESSION:ROLE:DATA_ENG') \
    .config('spark.sql.catalog.opencatalog.audience','https://ab12345.snowflakecomputing.com') \
    .getOrCreate()
```

#### Connect by using access token (S3)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','0000000000000000000000000001111111111111111111111111111111111111111111') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .getOrCreate()
```

### Example 2: Connect (Cloud Storage from Google)

See:

* Connect by using automatic refresh (Cloud Storage from Google)
* Connect by using access token (Cloud Storage from Google)

#### Connect by using automatic refresh (Cloud Storage from Google)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-gcp-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.rest.auth.type','oauth2') \
    .config('spark.sql.catalog.opencatalog.oauth2-server-uri','your-tenant.region.auth0.com') \
    .config('spark.sql.catalog.opencatalog.credential','11111111111111111111111111111111:222222222222222222222222222222222222222222222222222222222222222222') \
    .config('spark.sql.catalog.opencatalog.scope','SESSION:ROLE:DATA_ENG') \
    .config('spark.sql.catalog.opencatalog.audience','https://ab12345.snowflakecomputing.com') \
    .getOrCreate()
```

#### Connect by using access token (Cloud Storage from Google)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-gcp-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','0000000000000000000000000001111111111111111111111111111111111111111111') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .getOrCreate()
```

### Example 3: Connect (Azure)

See:

* Connect by using automatic refresh (Azure)
* Connect by using access token (Azure)

#### Connect by using automatic refresh (Azure)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-azure-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.rest.auth.type','oauth2') \
    .config('spark.sql.catalog.opencatalog.oauth2-server-uri','your-tenant.region.auth0.com') \
    .config('spark.sql.catalog.opencatalog.credential','11111111111111111111111111111111:222222222222222222222222222222222222222222222222222222222222222222') \
    .config('spark.sql.catalog.opencatalog.scope','SESSION:ROLE:DATA_ENG') \
    .config('spark.sql.catalog.opencatalog.audience','https://ab12345.snowflakecomputing.com') \
    .getOrCreate()
```

#### Connect by using access token (Azure)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-azure-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','0000000000000000000000000001111111111111111111111111111111111111111111') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .getOrCreate()
```

## Verify the connection to Open Catalog

To verify that Spark is connected to Open Catalog, list the namespaces for the catalog. For more information,
see [List namespaces](spark-code-examples.md).

---
title: Connect to Snowflake Open Catalog with key pair authentication
source: https://docs.snowflake.com/en/user-guide/opencatalog/key-pair-auth-connect.md
section: User Guide
---

# Connect to Snowflake Open Catalog with key pair authentication

This topic describes how to connect to Snowflake Open Catalog with key pair authentication using a client application.

The example code in this topic shows how to connect using Apache Spark™, and the example code is in PySpark.

## Prerequisites

Before you can connect to Open Catalog with key pair authentication, you need to configure key pair authentication in Open Catalog. For
instructions, see [Configure key pair authentication in Snowflake Open Catalog](key-pair-auth-configure.md).

## Connect with Open Catalog

The following example code is for connecting with Open Catalog by using Spark.

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<access_token>` | Specifies the access token for the client application to use.   Enter the [access token that you generated when you configured key pair authentication in Open Catalog](key-pair-auth-configure.md). |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account.   Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,<maven_coordinate>') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','<access_token>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .getOrCreate()
```

## Connect with a cross-region connection (Amazon S3 only)

The following example code is for connecting to Open Catalog when the following is true:

* Your Open Catalog account is hosted on Amazon S3.
* Your external storage provider is Amazon S3.
* Your Open Catalog account is hosted in an S3 region that is different from the S3 region where the storage bucket containing your Apache Iceberg™ tables is located.

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','<access_token>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.client.region','<target_s3_region>') \
    .getOrCreate()
```

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<access_token>` | Specifies the access token for the client application to use.   Enter the [access token that you generated when you configured key pair authentication in Open Catalog](key-pair-auth-configure.md). |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account. Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |
| `<target_s3_region>` | Specifies the region code where the S3 bucket containing your Apache Iceberg tables is located. For the region codes, see [AWS service endpoints](https://docs.aws.amazon.com/general/latest/gr/s3.html#s3_region) and refer to the Region column in the table. |

## Examples

This section contains examples of connecting to Open Catalog using Spark:

* Example 1: Connect (S3)
* Example 2: Connect (Cloud Storage from Google)
* Example 3: Connect (Azure)

### Example 1: Connect (S3)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','0000000000000000000000000001111111111111111111111111111111111111111111') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .getOrCreate()
```

### Example 2: Connect (Cloud Storage from Google)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-gcp-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','0000000000000000000000000001111111111111111111111111111111111111111111') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .getOrCreate()
```

### Example 3: Connect (Azure)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-azure-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.token','0000000000000000000000000001111111111111111111111111111111111111111111') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .getOrCreate()
```

## Verify the connection to Open Catalog

To verify that Spark is connected to Open Catalog, list the namespaces for the catalog. For more information,
see [List namespaces](spark-code-examples.md).

---
title: Connecting through SnowSQL
source: https://docs.snowflake.com/en/user-guide/snowsql-start.md
section: User Guide
---

# Connecting through SnowSQL

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

This topic describes how to connect to Snowflake by entering connection parameters manually. The topic then explains how to configure a default connection for ease of use, as well as one or more
*named connections* to use alternative connection settings or create multiple concurrent sessions.

> **Note:**
>
> Snowflake does not support running multiple instances of SnowSQL simultaneously on the same machine. For example, you cannot open two MacOS terminals or Linux shell applications and run `snowsql` in both at the same time.

## Connection syntax

```bash
$ snowsql <connection_parameters>
```

Where `<connection_parameters>` are one or more of the following. For detailed descriptions of each parameter, see Connection parameters reference (in this topic).

| Parameter | Description |
| --- | --- |
| `-a, --accountname TEXT` | Your [account identifier](gen-conn-config.md). Honors $SNOWSQL_ACCOUNT. |
| `-u, --username TEXT` | Username to connect to Snowflake. Honors $SNOWSQL_USER. |
| `-d, --dbname TEXT` | Database to use. Honors $SNOWSQL_DATABASE. |
| `-s, --schemaname TEXT` | Schema in the database to use. Honors $SNOWSQL_SCHEMA. |
| `-r, --rolename TEXT` | Role name to use. Honors $SNOWSQL_ROLE. |
| `-w, --warehouse TEXT` | Warehouse to use. Honors $SNOWSQL_WAREHOUSE. |
| `-h, --host TEXT` | Host address for the connection. Honors $SNOWSQL_HOST. |
| `-p, --port INTEGER` | Port number for the connection. Honors $SNOWSQL_PORT. |
| `--region TEXT` | Region. Honors $SNOWSQL_REGION. (Deprecated; use -a or –accountname instead) |
| `-m, --mfa-passcode TEXT` | Token to use for multi-factor authentication (MFA) |
| `--mfa-passcode-in-password` | Appends the MFA passcode to the end of the password. |
| `--abort-detached-query` | Aborts a query if the connection between the client and server is lost. By default, it won’t abort even if the connection is lost. |
| `--probe-connection` | Test connectivity to Snowflake. This option is mainly used to print out the TLS (Transport Layer Security) certificate chain. |
| `--proxy-host TEXT` | (DEPRECATED. Use HTTPS_PROXY and HTTP_PROXY environment variables.) Proxy server hostname. Honors $SNOWSQL_PROXY_HOST. |
| `--proxy-port INTEGER` | (DEPRECATED. Use HTTPS_PROXY and HTTP_PROXY environment variables.) Proxy server port number. Honors $SNOWSQL_PROXY_PORT. |
| `--proxy-user TEXT` | (DEPRECATED. Use HTTPS_PROXY and HTTP_PROXY environment variables.) Proxy server username. Honors $SNOWSQL_PROXY_USER. Set $SNOWSQL_PROXY_PWD for the proxy server password. |
| `--authenticator TEXT` | Authenticator: ‘snowflake’, ‘externalbrowser’ (to use any IdP and a web browser), <https:/>/<okta_account_name>.okta.com (to use Okta natively), ‘workload_idenity’ or ‘oauth’ to authenticate using OAuth. |
| `-v, --version` | Shows the current SnowSQL version, or uses a specific version if provided as a value. |
| `--noup` | Disables auto-upgrade for this run. If no version is specified for -v, the latest version in ~/.snowsql/ is used. |
| `-D, --variable TEXT` | Sets a variable to be referred by &<var>. -D tablename=CENUSTRACKONE or –variable db_key=$DB_KEY |
| `-o, --option TEXT` | Set SnowSQL options. See the options reference in the Snowflake documentation. |
| `-f, --filename PATH` | File to execute. |
| `-q, --query TEXT` | Query to execute. |
| `--query_tags TEXT` | Tags to use when running queries. By default, `--query_tag` reads the value of the `SNOWSQL_QUERY_TAG` environment variable. |
| `--config PATH` | Path and name of the SnowSQL configuration file. By default, ~/.snowsql/config. |
| `-P, --prompt` | Forces an interactive password prompt to allow you to specify a password that differs from the one stored in the $SNOWSQL_PWD environment variable. |
| `-M, --mfa-prompt` | Forces a prompt for the second token for MFA. |
| `-c, --connection TEXT` | Named set of connection parameters to use. |
| `--single-transaction` | Connects with autocommit disabled. Wraps BEGIN/COMMIT around statements to execute them as a single transaction, ensuring all commands complete successfully or no change is applied. |
| `--private-key-path PATH` | Path to private key file. |
| `--oauth-client-id` | Value of client id provided by the identity provider for Snowflake integration. |
| `--oauth-redirect-uri` | URI to use for authorization code redirection. |
| `--oauth-authorization-url` | Identity provider endpoint supplying the authorization code to the driver. |
| `--oauth-token-request-url` | Identity provider endpoint supplying the access tokens to the driver. |
| `--oauth-scope` | Scope requested in the identity provider authorization request. |
| `--oauth-disable-pkce` | Disables Proof Key for Code Exchange (PKCE). Default: `False`. |
| `--oauth-enable-refresh-tokens` | Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`. |
| `--oauth-enable-single-use-refresh-tokens` | Whether to opt-in to single-use refresh token semantics. Default: `False`. |
| `--workload-identity-provider` | workload identity provider [AWS|AZURE|GCP|OIDC]. By default auto discovery is performed. |
| `--disable-request-pooling` | Disables connection pooling. |
| `-U, --upgrade` | Force upgrade of SnowSQL to the latest version. |
| `-K, --client-session-keep-alive` | Keep the session active indefinitely, even if there is no activity from the user. |
| `--include_connector_version` | Display the version of the Snowflake Connector for Python software that is packaged in the SnowSQL binary. |
| `-?, --help` | Show this message and exit. |

### Specifying passwords when connecting

Passwords cannot be passed through connection parameters. Passwords must be specified in one of the following ways:

* Entered via interactive prompt in SnowSQL (applies to passwords only).
* Defined in the SnowSQL configuration file using the `password` option. For details, see Configuring Default Connection Settings (in this topic).
* Specified using the `SNOWSQL_PWD` environment variables. For details, see Using Environment Variables (in this topic).

> **Note:**
>
> In Windows environments, the Cygwin terminal doesn’t prompt for your account identifier, username, or password. This is because
> SnowSQL cannot enable TTY mode in Cygwin terminals.

### Using environment variables

Currently, environment variables can only be used to pre-specify some command-line parameter values such as password, host, and database. Environment variables are not available to use in SnowSQL variable substitution unless they are explicitly specified on the command line when starting SnowSQL, using either the `-D` or `--variable` connection parameter. For example:

Linux/macOS:
:   ```bash
    $ snowsql ... -D tablename=CENUSTRACKONE --variable db_key=$DB_KEY
    ```

Windows:
:   ```bash
    $ snowsql ... -D tablename=CENUSTRACKONE --variable db_key=%DB_KEY%
    ```

In the above example, `--variable` sets a Snowflake variable named `db_key` to the `DB_KEY` environment variable.

## Configuring default connection settings

We recommend configuring your default connection parameters to simplify the connection process. Thereafter, when connecting to
Snowflake, you can omit your Snowflake account identifier, username, and any other parameters you have configured as your default
values.

To configure your default settings:

1. Open the [SnowSQL configuration file](snowsql-config.md) (named `config`) in a text editor. The default
   location of the file is:

   Linux/macOS:
   :   `~/.snowsql/`

   Windows:
   :   `%USERPROFILE%\.snowsql\`

   > **Note:**
   >
   > You can change the default location by specifying the `--config path` command-line flag when starting SnowSQL.

1. In the `[connections]` section, configure the default connection parameters by removing the comment symbol from any of
   the following parameters and specifying the correct values. For information on these settings, see
   [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md).

   > ```ini
   > [connections]
   > #accountname = <string>   # Account identifier to connect to Snowflake (for example, myorganization-myaccount).
   > #username = <string>      # User name in the account. Optional.
   > #password = <string>      # User password. Optional.
   > #dbname = <string>        # Default database. Optional.
   > #schemaname = <string>    # Default schema. Optional.
   > #warehousename = <string> # Default warehouse. Optional.
   > #rolename = <string>      # Default role. Optional.
   > #authenticator = <string> # Authenticator: 'snowflake', 'externalbrowser' (to use any IdP and a web browser),  https://<okta_account_name>.okta.com (to use Okta natively), 'oauth' to authenticate using OAuth.
   > ```

   > **Attention:**
   > * The password is stored in plain text in the `config` file. You must explicitly secure the file to restrict access. For example, in Linux or macOS, you can set the read permissions to you
   >   alone by running `chmod`:
   >
   >   > ```bash
   >   > $ chmod 700 ~/.snowsql/config
   >   > ```
   > * If your password includes special characters, you must enclose the password in either single quotes or double quotes.

## Verifying the network connection to Snowflake with SnowCD

After configuration, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

## Using named connections

To make multiple simultaneous connections to Snowflake, or to simply store different sets of connection configurations, you can define one or more *named* connections.

### Defining named connections in the configuration file

1. Open the `config` configuration file in a text editor. By default, the file is located in:

   Linux/macOS:
   :   `~/.snowsql/`

   Windows:
   :   `%USERPROFILE%\.snowsql\`
2. Add a separate `[connections]` section with a unique name for each named connection.

   For example, the following illustrates a connection named `my_example_connection` for a Snowflake account with the
   [account identifier](gen-conn-config.md) `myorganization-myaccount`:

   > ```ini
   > [connections.my_example_connection]
   > accountname = myorganization-myaccount
   > username = jsmith
   > password = xxxxxxxxxxxxxxxxxxxx
   > dbname = mydb
   > schemaname = public
   > warehousename = mywh
   > ```

### Connecting to Snowflake using a named connection

Use the `-c <string>` (or `--connection <string>`) connection parameter to specify a named connection, where `<string>` is the name of a connection defined in the
[configuration file](snowsql-config.md).

For example, connect using the `my_example_connection` connection you created in Defining Named Connections in the Configuration File (in this topic):

> ```bash
> $ snowsql -c my_example_connection
> ```

## Using key-pair authentication and key-pair rotation

SnowSQL supports key pair authentication and key rotation. You can use unencrypted or encrypted
key pairs.

> **Caution:**
>
> While unencrypted private keys are supported, Snowflake strongly recommends using encrypted private keys
> when connecting to Snowflake. Unencrypted private keys have no protection against unauthorized use if any
> unauthorized person gains access to them.

The following procedure presumes you use the recommended encrypted key pair authentication:

1. To start, follow the instructions to configure [Key-pair authentication and key-pair rotation](key-pair-auth.md).
2. Specify the path to the private key file either in the configuration file or on the command line:

> * In the configuration file:
>
>   + Add the `private_key_path` connection parameter to your connection settings and specify the local path to the private key file you created. The syntax is not OS-specific:
>
>     Supported OS:
>     :   ```bash
>         private_key_path = <path>/rsa_key.p8
>         ```
>   + Use the `SNOWSQL_PRIVATE_KEY_PASSPHRASE` environment variable to set the passphrase for decrypting the private key file.
>     Note that you do not enclose the passphrase in quotes for Linux or MacOS but must use single or double quotes for Windows:
>
>     Linux/macOS:
>     :   ```bash
>         export SNOWSQL_PRIVATE_KEY_PASSPHRASE=<passphrase>
>         ```
>
>     Windows:
>     :   ```bash
>         set SNOWSQL_PRIVATE_KEY_PASSPHRASE=<passphrase>
>         ```
> * On the command line:
>
>   Include the `private-key-path` connection parameter and specify the path to your encrypted private key file:
>
>   > ```bash
>   > $ snowsql -a <account_identifier> -u <user> --private-key-path <path>/rsa_key.p8
>   > ```
>
>   SnowSQL prompts you for the passphrase. Alternatively, use the `SNOWSQL_PRIVATE_KEY_PASSPHRASE` environment variable to set the passphrase for decrypting the private key file (as described above).

## Using the OAuth 2.0 Authorization Code flow

The OAuth 2.0 Authorization Code flow is a secure method for a client application to obtain an access token from an authorization server on behalf of a user, without revealing the user’s credentials.

The following sample configuration file shows how to use this flow:

```toml
[connections.oauth]
authenticator = "OAUTH_AUTHORIZATION_CODE"
username = "user"
accountname = "account"
oauth_client_id = "client_id"
oauth_client_secret = "client_secret"
oauth_redirect_uri = "http://localhost:8001/snowflake/oauth-redirect"
oauth_scope = "session:role:PUBLIC"
```

## Using the OAuth 2.0 Client Credentials flow

The OAuth 2.0 Client Credentials flow provides a secure way for machine-to-machine (M2M) authentication, such as the Snowflake Connector for Python connecting to a backend service. Unlike the OAuth 2.0 Authorization Code flow, this method does not rely on any user-specific data.

The following sample configuration file shows how to use this flow:

```toml
[connections.oauth]
authenticator = "OAUTH_CLIENT_CREDENTIALS"
username = "user"
accountname = "account"
oauth_client_id = "client_id"
oauth_client_secret = "client_secret"
oauth_token_request_url = "http://identity.provider.com/token"
oauth_scope = "session:role:PUBLIC"
```

## Using a proxy server

To use a proxy server, configure the following environment variables:

* HTTP_PROXY
* HTTPS_PROXY
* NO_PROXY

For example:

Linux/macOS:
:   ```bash
    export HTTP_PROXY='http://username:password@proxyserver.example.com:80'
    export HTTPS_PROXY='http://username:password@proxyserver.example.com:80'
    ```

Windows:
:   ```bash
    set HTTP_PROXY=http://username:password@proxyserver.example.com:80
    set HTTPS_PROXY=http://username:password@proxyserver.example.com:80
    ```

> **Tip:**
>
> Snowflake does not support configurations involving intercepting HTTPS proxies that present a Transport Layer Security (TLS)
> certificate other than the one issued by Snowflake. Avoiding this configuration helps reduce potential security risks
> such as a MITM (Man In The Middle) attack through a compromised proxy.
>
> If you must use your TLS proxy, Snowflake strongly recommends that you update the server policy to pass through the
> Snowflake certificate such that no certificate is altered in the middle of communications.
>
> Optionally, `NO_PROXY` can be used to bypass the proxy for specific communications. For example, Amazon S3 access can be bypassed by specifying `NO_PROXY=".amazonaws.com"`.

## Using a web browser for federated authentication/SSO

To use [browser-based SSO authentication](admin-security-fed-auth-use.md) for SnowSQL, add `--authenticator externalbrowser` to your SnowSQL connection parameters:

For example:

> ```bash
> $ snowsql -a <account_identifier> -u <username> --authenticator externalbrowser
> ```

For more information about federated authentication/SSO, see [Managing/Using federated authentication](admin-security-fed-auth-use.md).

## Verifying the OCSP connector or driver version

Snowflake uses OCSP to evaluate the certificate chain when making a connection to Snowflake. The driver or connector version and its configuration both determine the OCSP behavior. For more information about the driver or connector version, their configuration, and OCSP behavior, see [OCSP Configuration](ocsp.md).

## OCSP response cache server

> **Note:**
>
> The OCSP response cache server is currently supported by SnowSQL 1.1.55 and higher.

Snowflake clients initiate every connection to a Snowflake service endpoint with a “handshake” that establishes a secure connection before actually transferring data. As part of the handshake, a
client authenticates the TLS certificate for the service endpoint. The revocation status of the certificate is checked by sending a client certificate request to one of the OCSP
(Online Certificate Status Protocol) servers for the CA (certificate authority).

A connection failure occurs when the response from the OCSP server is delayed beyond a reasonable time. The following caches persist the revocation status, helping alleviate these issues:

* Memory cache, which persists for the life of the process.
* File cache, which persists until the cache directory (e.g. `~/.cache/snowflake` or `~/.snowsql/ocsp_response_cache`) is purged.
* Snowflake OCSP response cache server, which fetches OCSP responses from the CA’s OCSP servers hourly and stores them for 24 hours. Clients can then request the validation status of a given Snowflake
  certificate from this server cache.

  > **Important:**
  >
  > If your server policy denies access to most or all external IP addresses and web sites, you must allowlist the cache server
  > address to allow normal service operation. The cache server hostname is `ocsp*.snowflakecomputing.com:80`.

  If you need to disable the cache server for any reason, set the `SF_OCSP_RESPONSE_CACHE_SERVER_ENABLED` environment variable to `false`. Note that the value is case-sensitive and must
  be in lowercase.

If none of the cache layers contain the OCSP response, the client then attempts to fetch the validation status directly from the OCSP server for the CA.

## Connection error handling

`Cannot open self /usr/bin/snowsql or archive /usr/bin/snowsql.pkg` (Linux Only)
:   Due to a limitation in `pyinstaller` (the program that packages SnowSQL into a stand-alone executable from Python source code), `prelink` mistakenly strips parts of the `snowsql`
    executable and causes this error.

    To avoid this issue, the SnowSQL installer attempts to update the `prelink` configuration file in `/etc/prelink.conf.d/snowsql.conf` for the `snowsql` executable such that
    `prelink` does not alter the file. Unfortunately, this configuration update cannot be made by the SnowSQL auto-upgrade process.

    Work with your system administrator to run the following command on your workstation:

    > ```bash
    > $ sudo bash -c "echo '-b snowsql' > /etc/prelink.conf.d/snowsql.conf"
    > ```

> **Note:**
>
> If you install `snowsql` in your user home directory, this issue is less likely to occur because `prelink` is configured, by default, to scan the shared binary directories (e.g.
> `/usr/bin` or `/bin`) and does not alter programs in your home directory.

## Connection parameters reference

### `-a` , `--accountname`

> Description:
> :   Required
>
>     Specifies your [account identifier](gen-conn-config.md). Specify the account identifier in this form:
>     `organization_name-account_name` (for example, `myorganization-myaccount`).
>
>     For instructions on finding the account identifier, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md).
>
>     This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Value:
> :   String
>
>     Also, the value can be an environment variable:
>
>     Linux/macOS:
>     :   `$SNOWSQL_ACCOUNT`
>
>     Windows:
>     :   `%SNOWSQL_ACCOUNT%`
>
>     For example, in Linux or macOS:
>
>     > ```bash
>     > $ export SNOWSQL_ACCOUNT=myorganization-myaccount
>     >
>     > $ snowsql -a $SNOWSQL_ACCOUNT
>     > ```
>
> Default:
> :   None

### `-u` , `--username`

> Description:
> :   Specifies the login name of the user with whom you connect to the specified account.
>
>     This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Value:
> :   String
>
>     The value can be an environment variable:
>
>     Linux/macOS:
>     :   `$SNOWSQL_USER`
>
>     Windows:
>     :   `%SNOWSQL_USER%`
>
>     For example, in Linux or macOS:
>
>     > ```bash
>     > $ export SNOWSQL_USER=jdoe
>     >
>     > $ snowsql -u $SNOWSQL_USER
>     > ```
>
> Default:
> :   None

### `-d` , `--dbname`

> Description:
> :   Specifies the database to use by default in the client session (can be changed after login).
>
> Value:
> :   String
>
>     The value can be an environment variable:
>
>     Linux/macOS:
>     :   `$SNOWSQL_DATABASE`
>
>     Windows:
>     :   `%SNOWSQL_DATABASE%`
>
>     This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Default:
> :   None

### `-s` , `--schemaname`

> Description:
> :   Specifies the database schema to use by default in the client session (can be changed after login).
>
> Value:
> :   String
>
>     The value can be an environment variable:
>
>     Linux/macOS:
>     :   `$SNOWSQL_SCHEMA`
>
>     Windows:
>     :   `%SNOWSQL_SCHEMA%`
>
>     This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Default:
> :   None

### `-r` , `--rolename`

> Description:
> :   Specifies the role to use by default for accessing Snowflake objects in the client session (can be changed after login).
>
>     This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Value:
> :   String
>
>     The value can be an environment variable:
>
>     Linux/macOS:
>     :   `$SNOWSQL_ROLE`
>
>     Windows:
>     :   `%SNOWSQL_ROLE%`
>
> Default:
> :   None

### `-w` , `--warehouse`

> Description:
> :   Specifies the virtual warehouse to use by default for queries, loading, etc. in the client session (can be changed after login).
>
>     This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Value:
> :   String
>
>     The value can be an environment variable:
>
>     Linux/macOS:
>     :   `$SNOWSQL_WAREHOUSE`
>
>     Windows:
>     :   `%SNOWSQL_WAREHOUSE%`
>
> Default:
> :   None

### `-h` , `--host` — *Deprecated*

> Description:
> :   Provided for backward compatibility/internal use
>
>     Specifies the address of the host to which you connect in Snowflake.
>
>     This parameter is no longer used because the host address is determined automatically by concatenating the account identifier
>     you specified (using either `-a` or `--account`) and the Snowflake domain (`snowflakecomputing.com`).
>
> Value:
> :   String
>
> Default:
> :   None

### `-p` , `--port` — *Deprecated*

> Description:
> :   Provided for backward compatibility/internal use
>
>     Specifies the port number to use for connection.
>
>     This parameter is no longer used because the port number for Snowflake is always `443`.
>
> Value:
> :   String
>
> Default:
> :   None

### `--region` — *Deprecated*

> Description:
> :   Provided for backward compatibility/internal use
>
>     Specifies the ID for the [region](intro-regions.md) where your account is located.
>
>     This parameter is no longer used. For more details, see -a , --accountname (in this topic).
>
> Value:
> :   N/A
>
> Default:
> :   N/A

### `-m` , `--mfa-passcode`

> Description:
> :   Specifies the second token for MFA (multi-factor authentication) if you pass in the passcode in the command line.
>
> Value:
> :   String
>
> Default:
> :   None

### `--mfa-passcode-in-password`

> Description:
> :   Appends the MFA passcode to the end of the password.
>
>     You can force the password prompt and type the password followed by the MFA passcode. For example if the MFA token was `123456` and the password was `PASSWORD`:
>
>     > ```bash
>     > $ snowsql ... -P ...
>     >
>     > Password: PASSWORD123456
>     > ```
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `--abort-detached-query`

> Description:
> :   Aborts a query if the connection between the client and server is lost.
>
> Value:
> :   Boolean
>
> Default:
> :   False (i.e. an active query does not abort if the connection is lost)

### `--probe-connection`

> Description:
> :   Test connectivity to Snowflake and report the results. Note that this is an experimental option used mainly to print out the TLS certificate chain.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `--authenticator`

> Description:
> :   Specifies the authenticator to use for verifying user login credentials.
>
> Value:
> :   String (Constant):
>
>     > * `snowflake` uses the internal Snowflake authenticator.
>     > * `externalbrowser` [uses your web browser](admin-security-fed-auth-use.md) to authenticate with Okta, AD FS, or any other SAML 2.0-compliant identity provider (IdP) that has been defined for your account.
>     > * `https://<okta_account_name>.okta.com` (i.e. the URL endpoint for Okta) [authenticates through native Okta](admin-security-fed-auth-use.md) (only supported if your IdP is Okta).
>     > * `oauth` authenticates using OAuth. When OAuth is specified as the authenticator, you must also set the `--token` parameter to specify the OAuth token (see below).
>
>     For more information, see [Managing/Using federated authentication](admin-security-fed-auth-use.md) and [Clients, drivers, and connectors](oauth-intro.md).
>
> Default:
> :   `snowflake`
>
> > **Note:**
> >
> > The `externalbrowser` authenticator is only supported in terminal windows that have web browser access. For example, a terminal window on a remote machine accessed through a SSH (Secure Shell)
> > session may require additional setup to open a web browser.
> >
> > If you don’t have access to a web browser, but your IdP is Okta, you can use native Okta (i.e. set the authenticator to `https://<okta_account_name>.okta.com`).

### `--token`

> Description:
> :   Specifies the OAuth token to use for authentication.
>     This parameter is required only when you specify `--authenticator=oauth`.
>
> Value:
> :   String
>
> Default:
> :   None

### `-v` , `--version`

> Description:
> :   Use the specified SnowSQL version or, if no version is specified, display the latest SnowSQL version installed.
>
> Value:
> :   String
>
> Default:
> :   None

### `--versions`

> Description:
> :   Lists all available versions of SnowSQL that can be installed and run. To install an earlier SnowSQL version from the list, use the `-v` option and specify the version you want
>     to install.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `--noup`

> Description:
> :   Disables auto-upgrade for this run. If this option is not included and a newer version is available, SnowSQL automatically downloads and installs the new version. The next time you
>     run SnowSQL, the new version is used.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `-D` , `--variable`

> Description:
> :   Defines SnowSQL variables on the command line. This option can be used to set specific variables to use in Snowflake.
>
> Value:
> :   String
>
>     For example:
>
>     > ```bash
>     > $ snowsql ... -D tablename=CENUSTRACKONE --variable db_key=$DB_KEY ...
>     > ```
>
> Default:
> :   None

### `-o` , `--option`

> Description:
> :   Defines SnowSQL configuration options on the command line. These options override any options that have been set in the SnowSQL configuration file. For descriptions of the options you
>     can set/override, see [SnowSQL configuration options reference](snowsql-config.md).
>
> Value:
> :   String
>
> Default:
> :   None

### `-f` , `--filename`

> Description:
> :   Specifies a SQL file to execute in batch mode.
>
>     The value can be a file name (including the directory path, if needed) or a URL to the file.
>
> Value:
> :   String
>
> Default:
> :   None

### `-q` , `--query`

> Description:
> :   Specifies a SQL query to execute.
>
>     The value can be a single SQL query or a semicolon-separated list of queries to execute (e.g. `'select current_user(); select current_role()'`).
>
>     You can also specify multiple queries to run asynchronously by separating the queries with `;>`.
>     The following example starts SnowSQL and runs all four queries asynchronously:
>
>     `snowsql -o log_level=DEBUG -q "select * from SNOWSQLTABLE;> insert into table table1 values(2);> select 5;>select count(*) from testtable;"`
>
> Value:
> :   String
>
> Default:
> :   None

### `--query_tag`

> Description:
> :   Specifies the tags to use when running a query.
>
>     The value can be a single tag or a semicolon-separated list of tags.
>
> Value:
> :   String
>
> Default:
> :   Value of the `SNOWSQL_QUERY_TAG` environment variable.

### `--config`

> Description:
> :   Specifies the location (i.e. directory path) for the SnowSQL configuration file. Include this connector parameter if you want to move or copy the configuration file from the default
>     location.
>
> Value:
> :   String
>
> Default:
> :   OS-specific:
>
>     Linux/macOS:
>     :   `~/.snowsql/`
>
>     Windows:
>     :   `%USERPROFILE%\.snowsql\`

### `-P` , `--prompt`

> Description:
> :   Forces an interactive password prompt.
>
>     By default, SnowSQL uses the password stored in the $SNOWSQL_PWD environment variable. Using this option allows you to override the password defined in $SNOWSQL_PWD.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `-M`, `--mfa-prompt`

> Description:
> :   Forces a prompt for the second token for MFA. Alternatively use `--mfa-passcode <string>` if you want to pass in to the command line.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `-c` , `--connection`

> Description:
> :   Specifies a connection to use, where the specified string is the name of a connection defined in the SnowSQL configuration file. For more details, see
>     Using named connections (in this topic).
>
> Value:
> :   String
>
> Default:
> :   None

### `--single-transaction`

> Description:
> :   Combined with `--filename`, `--query`, or standard input commands, this option wraps BEGIN/COMMIT around the statements to ensure all commands complete successfully or no
>     change is applied.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A
>
> > **Note:**
> >
> > Note that if the input commands use BEGIN, COMMIT, or ROLLBACK, this option will not work correctly. Also, if any command cannot be executed inside a transaction block, this option will cause the command to fail.

### `--private-key-path`

> Description:
> :   Path to private key file.
>
> > **Caution:**
> > > While unencrypted private keys are supported, Snowflake strongly recommends using encrypted private keys
> > > when connecting to Snowflake.
> >
> > For more information, see Using Key Pair Authentication & Key Pair Rotation.
>
> This connection parameter can also be set in the [configuration file](snowsql-config.md).
>
> Value:
> :   String
>
> Default:
> :   None

### `--disable-request-pooling`

> Description:
> :   By default, snowsql uses connection pooling. Connection pooling usually reduces the lag
>     time to make a connection. However, it can slow down client failover to an alternative DNS when a
>     DNS problem occurs. This parameter allows you to turn off connection pooling.
>
>     This parameter applies only to customers who have [replication](account-replication-intro.md) enabled.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `-U` , `--upgrade`

> Description:
> :   Force upgrade of SnowSQL to the latest version if it is not downloaded in the local directory.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A
>
> > **Note:**
> >
> > Requires the bootstrap executable of SnowSQL 1.1.63 or newer version. Download it from the UI.

### `-K` , `--client-session-keep-alive`

> Description:
> :   Keep the session active indefinitely, even if there is no activity from the user.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `--include_connector_version`

> Description:
> :   Displays the version of the Snowflake Connector for Python software that is packaged in the SnowSQL binary.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

### `-?` , `--help`

> Description:
> :   Shows the command line quick usage guide.
>
> Value:
> :   N/A (parameter doesn’t take a value)
>
> Default:
> :   N/A

---
title: Connecting to your accounts
source: https://docs.snowflake.com/en/user-guide/organizations-connect.md
section: User Guide
---

# Connecting to your accounts

This topic provides the URL and [account identifier](admin-account-identifier.md) formats that you use to connect to the
Snowflake accounts in your organization.

> **Note:**
>
> If you are an organization administrator and want to delete old URLs for an account that has changed, see [Managing account URLs](organizations-manage-accounts-urls.md).

## Connecting to the Snowflake web interface

To connect to Snowsight using your web browser, see [Signing in to Snowsight](ui-snowsight-gs.md).

## Connecting with a URL

Snowflake supports multiple URL formats when connecting to a Snowflake account without a browser. For example, an identity provider
might use a direct URL to communicate with Snowflake.

* The **account name** format uses the name of the account and its [organization](organizations.md) to identify the account.
  To find the name of your organization and account, see [Finding the organization and account name for an account](admin-account-identifier.md).
* The **connection name** format, which replaces the account name with the name of a connection, is required when using the
  [Client Redirect](client-redirect.md) feature. To find the name of your connection, execute the
  [SHOW CONNECTIONS](../sql-reference/sql/show-connections.md) command.
* The legacy **account locator** format is currently supported, but its use is discouraged.

### Standard account URLs

The standard URL format can be used in most cases where a Snowflake account URL is required, including:

> * SSO connections (except Okta)
> * SCIM base URL (except Okta)
> * OAuth connections with third-party identity providers (except Okta)
> * OAuth base URL for a Snowflake Authorization Server

The standard URL formats are:

> * Account name: `https://<orgname>-<account_name>.snowflakecomputing.com`
> * Connection name: `https://<orgname>-<connectionname>.snowflakecomputing.com`
> * Account locator (legacy): `https://<accountlocator>.<region>.<cloud>.snowflakecomputing.com`

### Private connectivity URLs

When connecting to Snowflake using private connectivity to the Snowflake service (e.g. AWS PrivateLink), the string `privatelink` must be
appended to the [account identifier](admin-account-identifier.md) in the Snowflake account URL.

> * Account Name: `https://<orgname>-<account_name>.privatelink.snowflakecomputing.com`
> * Connection Name: `https://<orgname>-<connectionname>.privatelink.snowflakecomputing.com`
> * Account Locator (legacy): `https://<account_locator>.<region>.privatelink.snowflakecomputing.com`

Note that using private connectivity requires updating DNS records to include the private connectivity URL. For more information, see:

> * [AWS PrivateLink CNAME Records](admin-security-privatelink.md).
> * Azure Private Link DNS setup in the [configuration procedure](privatelink-azure.md).
> * Google Cloud Private Service Connect DNS setup in [Step 8](private-service-connect-google.md).

### Okta URLs

When using Okta for SSO, SCIM, or OAuth, you must use a special account name format if the account name contains an underscore. Because
Okta does not support underscores in URLs, the underscore in the account name must be converted to a hyphen.

> * Account name: `https://<orgname>-<account-name>.snowflakecomputing.com`
> * Connection name: Use the standard URL
> * Account locator (legacy): Use the standard URL

## Connecting from clients, connectors, and drivers

See [Configuring a client, driver, library, or third-party application to connect to Snowflake](gen-conn-config.md).

## Backwards compatibility

Using the legacy account locator in an account identifier or account URL is still supported, though discouraged.

---
title: Considerations for semi-structured data stored in VARIANT
source: https://docs.snowflake.com/en/user-guide/semistructured-considerations.md
section: User Guide
---

# Considerations for semi-structured data stored in VARIANT

This topic provides best practices, general guidelines, and important considerations for loading and working with
[VARIANT](../sql-reference/data-types-semistructured.md) values that contain semi-structured data. This can be explicitly-constructed
[hierarchical data](semistructured-intro.md) or data that you have loaded from semi-structured data formats such as
JSON, Avro, ORC, and Parquet. The information in this topic does not necessarily apply
to XML data.

## Data size limitations

A VARIANT value can have a maximum size of up to 128 MB of uncompressed data. However, in practice,
the maximum size is usually smaller because of internal overhead. The maximum size is also dependent
on the object being stored.

For more information, see [VARIANT](../sql-reference/data-types-semistructured.md).

## Storing semi-structured data in a VARIANT column vs. flattening the nested structure

If you are not sure yet what types of operations you want to perform on your semi-structured data, Snowflake recommends storing the
data in a VARIANT column for now.

For data that is mostly regular and uses only data types that are native to the semi-structured format you are using (e.g. strings
and integers for JSON format), the storage requirements and query performance for operations on relational data and data in
a VARIANT column is very similar.

For better pruning and less storage consumption, we recommend flattening your OBJECT and key data into separate relational columns
if your semi-structured data includes:

* Dates and timestamps, especially non-[ISO 8601](http://www.iso.org/iso/home/standards/iso8601.htm) dates and timestamps, as string values
* Numbers within strings
* Arrays

Non-native values (such as dates and timestamps in JSON) are stored as strings when loaded into a VARIANT column, so operations on
these values could be slower and also consume more space than when stored in a relational column with the corresponding data type.

If you know your use cases for the data, perform tests on a typical data set. Load the data set into a VARIANT column in a table.
Use the [FLATTEN](../sql-reference/functions/flatten.md) function to extract the OBJECTs and keys you plan to query into a separate table.
Run a typical set of queries against both tables to see which structure provides the best performance.

## NULL values

Snowflake supports two types of NULL values in semi-structured data:

* SQL NULL: SQL NULL means the same thing for semi-structured data types as it means for structured data types: the value is missing or
  unknown.
* JSON null (sometimes called “VARIANT NULL”): In a VARIANT column, JSON null values are stored as a string containing the word “null” to
  distinguish them from SQL NULL values.

The following example contrasts SQL NULL and JSON null:

```sqlexample
SELECT PARSE_JSON(NULL) AS "SQL NULL",
       PARSE_JSON('null') AS "JSON NULL",
       PARSE_JSON('[ null ]') AS "JSON NULL",
       PARSE_JSON('{ "a": null }'):a AS "JSON NULL",
       PARSE_JSON('{ "a": null }'):b AS "ABSENT VALUE";
```

```output
+----------+-----------+-----------+-----------+--------------+
| SQL NULL | JSON NULL | JSON NULL | JSON NULL | ABSENT VALUE |
|----------+-----------+-----------+-----------+--------------|
| NULL     | null      | [         | null      | NULL         |
|          |           |   null    |           |              |
|          |           | ]         |           |              |
+----------+-----------+-----------+-----------+--------------+
```

To convert a JSON null value to a SQL NULL value, cast it as a string. For example:

```sqlexample
SELECT PARSE_JSON('{ "a": null }'):a,
       TO_CHAR(PARSE_JSON('{ "a": null }'):a);
```

```output
+-------------------------------+----------------------------------------+
| PARSE_JSON('{ "A": NULL }'):A | TO_CHAR(PARSE_JSON('{ "A": NULL }'):A) |
|-------------------------------+----------------------------------------|
| null                          | NULL                                   |
+-------------------------------+----------------------------------------+
```

When you construct an [ARRAY](../sql-reference/data-types-semistructured.md) value, you can specify array elements that contain
SQL NULL values or JSON null values. The following example constructs an array with both types of
NULL values:

```sqlexample
SELECT ARRAY_CONSTRUCT(1, NULL, PARSE_JSON('null')) AS array_with_null_values;
```

```sqlexample
+------------------------+
| ARRAY_WITH_NULL_VALUES |
|------------------------|
| [                      |
|   1,                   |
|   undefined,           |
|   null                 |
| ]                      |
+------------------------+
```

The output shows that SQL NULL values are `undefined` elements in an array, while JSON null values are
`null` elements.

## Semi-structured data files and subcolumnarization

When semi-structured data is inserted into a VARIANT column, Snowflake uses certain rules to extract as much of the data as possible
to a columnar form. The rest of the data is stored as a single column in a parsed semi-structured structure.

By default, Snowflake extracts a maximum of 200 elements per partition, per table. To increase this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### Elements that are not extracted

Elements with the following characteristics are not extracted into a column:

* Elements that contain even a single “null” value are not extracted into a column.
  This applies to elements with “null” values and not to elements with missing values, which are represented in columnar form.

  This rule ensures that no information is lost (that is, that the difference between VARIANT “null” values and SQL NULL values is not lost).
* Elements that contain multiple data types. For example:

  The `foo` element in one row contains a number:

  ```sqljson
  {"foo":1}
  ```

  The same element in another row contains a string:

  ```sqljson
  {"foo":"1"}
  ```

### How extraction impacts queries

When you query a semi-structured element, Snowflake’s execution engine behaves differently according to whether an element was extracted.

* If the element was extracted into a column, the engine scans only the extracted column.
* If the element was not extracted into a column, the engine must scan the entire JSON structure,
  and then for each row traverse the structure to output values. This impacts performance.

To avoid the performance impact for elements that were not extracted, do the following:

* Extract semi-structured data elements containing “null” values into relational columns before you load them.

  Alternatively, if the “null” values in your files indicate missing values and have no other special meaning,
  we recommend setting the [file format option](../sql-reference/sql/create-file-format.md) STRIP_NULL_VALUES to TRUE
  when you load the semi-structured data files. This option removes OBJECT elements or ARRAY elements containing “null” values.
* Ensure each unique element stores values of a single data type that is native to the format (for example, string or number for JSON).

### Parsing NULL values

To output a SQL NULL value from a VARIANT `"null"` key-value, use the [TO_CHAR , TO_VARCHAR](../sql-reference/functions/to_char.md) function to cast the value as a string, e.g.:

```sqlexample
SELECT column1
  , TO_VARCHAR(PARSE_JSON(column1):a)
FROM
  VALUES('{"a" : null}')
, ('{"b" : "hello"}')
, ('{"a" : "world"}');

+-----------------+-----------------------------------+
| COLUMN1         | TO_VARCHAR(PARSE_JSON(COLUMN1):A) |
|-----------------+-----------------------------------|
| {"a" : null}    | NULL                              |
| {"b" : "hello"} | NULL                              |
| {"a" : "world"} | world                             |
+-----------------+-----------------------------------+
```

---
title: Constructing SQL at runtime
source: https://docs.snowflake.com/en/user-guide/querying-construct-at-runtime.md
section: User Guide
---

# Constructing SQL at runtime

Snowflake supports several different techniques for constructing strings of SQL statements dynamically at runtime.
By using these techniques, you can specify more general and flexible SQL strings for use cases where the full text
of the SQL statements are unknown until runtime.

A stored procedure or application can accept user input and then use that input in a SQL statement. For example,
a table might store information about sales orders. An application or stored procedure might accept an order ID as
input and run a query that only returns the results for that specific order.

A developer can write stored procedure code or application code with SQL statements that contain placeholders, and
then bind variables to those placeholders in the code. These placeholders are called
[bind variables](../sql-reference/bind-variables.md). A developer can also write code that constructs SQL
statements from an input string (for example, by concatenating strings that contain a SQL command, parameters,
and values).

The following techniques are available for constructing SQL statements dynamically at runtime:

* The TO_QUERY function - This function takes
  a SQL string with optional parameters as input.
* Dynamic SQL - Code in a stored procedure or
  application takes input and constructs a dynamic SQL statement using this input. The code can be part of a
  [Snowflake Scripting](../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md) or
  [Javascript](../developer-guide/stored-procedure/stored-procedures-javascript.md)
  stored procedure, or a Snowflake Scripting anonymous block. You can also use this technique in your
  application code that uses a [Snowflake driver](../developer-guide/drivers.md) or the
  [Snowflake SQL API](../developer-guide/sql-api/index.md).

> **Note:**
>
> When programs construct SQL statements with user input, there are potential security risks, such as
> SQL injection. If inputs to SQL statements come from external sources, make sure they are validated.
> For more information, see [SQL injection](../developer-guide/stored-procedure/stored-procedures-usage.md).

## Use the TO_QUERY function

You can use the [TO_QUERY](../sql-reference/functions/to_query.md) function in the code for stored procedures and applications
that construct SQL statements dynamically. This table function takes a SQL string as input. Optionally, the
SQL string can contain parameters, and you can specify the arguments to pass to the parameters as
bind variables.

The following is a simple example that calls the function:

```sqlexample
SELECT COUNT(*) FROM TABLE(TO_QUERY('SELECT 1'));
```

```output
+----------+
| COUNT(*) |
|----------|
|        1 |
+----------+
```

The following example uses the TO_QUERY function in a stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE get_num_results_tq(query VARCHAR)
RETURNS TABLE ()
LANGUAGE SQL
AS
DECLARE
  res RESULTSET DEFAULT (SELECT COUNT(*) FROM TABLE(TO_QUERY(:query)));
BEGIN
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../developer-guide/snowflake-cli/index.md), [SnowSQL](snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE get_num_results_tq(query VARCHAR)
RETURNS TABLE ()
LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET DEFAULT (SELECT COUNT(*) FROM TABLE(TO_QUERY(:query)));
BEGIN
  RETURN TABLE(res);
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL get_num_results_tq('SELECT 1');
```

```output
+----------+
| COUNT(*) |
|----------|
|        1 |
+----------+
```

## Use dynamic SQL in stored procedures and applications

To construct SQL statements that take user input, you can use dynamic SQL in a
[Snowflake Scripting](../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md)
or [Javascript](../developer-guide/stored-procedure/stored-procedures-javascript.md) stored procedure, or in a Snowflake
Scripting anonymous block . You can also use dynamic SQL in your application code that uses a
[Snowflake driver](../developer-guide/drivers.md) or the [Snowflake SQL API](../developer-guide/sql-api/index.md).

This example creates a stored procedure with Snowflake Scripting. The stored procedure takes SQL text as input and constructs
a string containing a SQL statement by appending the text to it. The dynamic SQL is then executed using the
[EXECUTE IMMEDIATE](../sql-reference/sql/execute-immediate.md) command.

```sqlexample
CREATE OR REPLACE PROCEDURE get_num_results(query VARCHAR)
RETURNS INTEGER
LANGUAGE SQL
AS
DECLARE
  row_count INTEGER DEFAULT 0;
  stmt VARCHAR DEFAULT 'SELECT COUNT(*) FROM (' || query || ')';
  res RESULTSET DEFAULT (EXECUTE IMMEDIATE :stmt);
  cur CURSOR FOR res;
BEGIN
  OPEN cur;
  FETCH cur INTO row_count;
  RETURN row_count;
END;
```

Note: If you use [Snowflake CLI](../developer-guide/snowflake-cli/index.md), [SnowSQL](snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE get_num_results(query VARCHAR)
RETURNS INTEGER
LANGUAGE SQL
AS
$$
DECLARE
  row_count INTEGER DEFAULT 0;
  stmt VARCHAR DEFAULT 'SELECT COUNT(*) FROM (' || query || ')';
  res RESULTSET DEFAULT (EXECUTE IMMEDIATE :stmt);
  cur CURSOR FOR res;
BEGIN
  OPEN cur;
  FETCH cur INTO row_count;
  RETURN row_count;
END;
$$
;
```

The following example calls the procedure:

```sqlexample
CALL get_num_results('SELECT 1');
```

```output
+-----------------+
| GET_NUM_RESULTS |
|-----------------|
|               1 |
+-----------------+
```

Dynamic SQL supports bind variables. The following Snowflake Scripting example uses bind variables represented
by the `?` placeholders to construct SQL statements dynamically at runtime. This block selects data from the
following `invoices` table:

```sqlexample
CREATE OR REPLACE TABLE invoices (price NUMBER(12, 2));
INSERT INTO invoices (price) VALUES
  (11.11),
  (22.22);
```

Execute the anonymous block:

```sqlexample
DECLARE
  rs RESULTSET;
  query VARCHAR DEFAULT 'SELECT * FROM invoices WHERE price > ? AND price < ?';
  minimum_price NUMBER(12,2) DEFAULT 20.00;
  maximum_price NUMBER(12,2) DEFAULT 30.00;
BEGIN
  rs := (EXECUTE IMMEDIATE :query USING (minimum_price, maximum_price));
  RETURN TABLE(rs);
END;
```

Note: If you use [Snowflake CLI](../developer-guide/snowflake-cli/index.md), [SnowSQL](snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  rs RESULTSET;
  query VARCHAR DEFAULT 'SELECT * FROM invoices WHERE price > ? AND price < ?';
  minimum_price NUMBER(12,2) DEFAULT 20.00;
  maximum_price NUMBER(12,2) DEFAULT 30.00;
BEGIN
  rs := (EXECUTE IMMEDIATE :query USING (minimum_price, maximum_price));
  RETURN TABLE(rs);
END;
$$
;
```

```output
+-------+
| PRICE |
|-------|
| 22.22 |
+-------+
```

## Comparison of the techniques for constructing SQL dynamically

The following table describes the advantages and disadvantages of the techniques for constructing
SQL dynamically.

| Technique | Advantages | Disadvantages |
| --- | --- | --- |
| TO_QUERY function | * Simple syntax * Built-in error handling * Specific semantics for the use case of constructing SQL dynamically * Automatically determined result set | * Queries cannot be described or explained before execution * Only valid in the FROM clause of a SELECT statement * Snowflake specific |
| Dynamic SQL | * More general and flexible than the TO_QUERY function * Queries can be described or explained before execution | * More complex than the TO_QUERY function * Manual error handling |

---
title: Consume imported data
source: https://docs.snowflake.com/en/user-guide/data-share-consumers.md
section: User Guide
---

# Consume imported data

This topic describes the tasks associated with creating databases from shares made available by data providers and then using the databases
for queries and other operations.

You must use the ACCOUNTADMIN role (or a role granted the IMPORT SHARE global privilege) to perform these tasks. For more details about the
IMPORT SHARE privilege, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).

> **Note:**
>
> The tasks described in this topic do not apply to reader accounts. If you are using a reader account to consume imported data, you do not
> need to perform any of these tasks because they have already been completed by an administrator from the provider account.

## General limitations for imported databases

Imported databases have the following limitations for consumers:

* Imported databases are read-only. Users in a consumer account can view/query data, but cannot insert or update data, or create any objects
  in the database.
* The following actions are not supported:

  > + Creating a clone of an imported database or any schemas/tables in the database.
  > + Time Travel for an imported database or any schemas/tables in the database.
  > + Editing the comments for an imported database.
  > + Attaching [storage lifecycle policies](storage-management/storage-lifecycle-policies.md) to tables in an imported database.
* Imported databases and all the objects in the database cannot be re-shared with (imported by) other accounts.
* Imported databases cannot be replicated.

## Viewing available shares

You can view the shares that are available to consume in your account using either the web interface or SQL:

SnowsightSQL

To view shares that have been shared with you, In the navigation menu, select Data sharing » Internal sharing, then select Shared With You.

> From this page, you can view the following:
>
> * **Privately Shared Listings** that have been shared with you. You can also view **Data exchange** listings that you have access to.
> * **Direct shares** that have been shared with you. Depending on the share status, shares are grouped into two sections:
>
>   + Direct shares that are ready to get (i.e. a database has not been created from the share).
>   + Direct shares that have been imported into a database and are ready to query.

To view Snowflake Marketplace listings that have been imported to a database and are ready to query, do the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.

For more information, see [Explore listings](../collaboration/consumer-listings-exploring.md).

Execute a [SHOW SHARES](../sql-reference/sql/show-shares.md) or [DESCRIBE SHARE](../sql-reference/sql/desc-share.md) statement.

For example, using SQL:

```sqlexample
SHOW SHARES;
```

The output shows the following:

* Two shares, `sales_s` and `sales_s2` are available. `INBOUND` in the `kind` column specifies that a data provider made the
  share available to your account to consume.
* The `name` column displays the name of each share, in the form of `share_name` (e.g. `SALES_S`).
* The `owner_account` column displays the account name that provided each share, in the form of `orgname.account_name`.
* If the `database_name` column is empty, a database has not yet been created from the share in your account.

```output
+-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
| created_on                    | kind     | owner_account        | name          | database_name         | to               | owner        | comment                                | listing_global_name |                  |
|-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------|---------------------|
| 2017-07-09 19:18:09.821 -0700 | INBOUND  | SNOW.XY12345         | SALES_S2      | UPDATED_SALES_DB      |                  |              | Transformed and updated sales data     |                     |
| 2017-06-15 17:02:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT | SALES_S       | SALES_DB              | XY12345, YZ23456 | ACCOUNTADMIN |                                        |                     |
+-------------------------------+----------+----------------------+---------------+-----------------------+----------------- -+--------------+----------------------------------------+---------------------+
```

### DESCRIBE SHARE example

The following example uses the [DESCRIBE SHARE](../sql-reference/sql/desc-share.md) command to show the objects (database, schemas, and tables) that are in
the `sales_s` share:

> ```sqlexample
> DESC SHARE xy12345.sales_s;
>
> +----------+------------------------------------+---------------------------------+
> | kind     | name                               | shared_on                       |
> |----------+------------------------------------+---------------------------------|
> | DATABASE | <DB>                               | Thu, 15 Jun 2017 17:03:16 -0700 |
> | SCHEMA   | <DB>.AGGREGATES_EULA               | Thu, 15 Jun 2017 17:03:16 -0700 |
> | TABLE    | <DB>.AGGREGATES_EULA.AGGREGATE_1   | Thu, 15 Jun 2017 17:03:16 -0700 |
> | VIEW     | <DB>.AGGREGATES_EULA.AGGREGATE_1_v | Thu, 15 Jun 2017 17:03:16 -0700 |
> +----------+------------------------------------+---------------------------------+
> ```
>
> The share consists of one schema, `aggregates_eula`, with one table, `aggregate_1`. Each object name, including the database
> itself, is prefixed with `<DB>`. This indicates a database has not been created yet (in your account) from the share.

## Creating a database from a share

You can create a database from a share using the web interface or SQL:

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared with you tab.
4. In the Ready to Get section, select the share that you want to create a database for.
5. Set a database name and the roles that are permitted to access the database.
6. Select Get Data.

Execute a [CREATE DATABASE](../sql-reference/sql/create-database.md) statement with the following data sharing-specific syntax:

```sqlsyntax
CREATE DATABASE <name> FROM SHARE <provider_account>.<share_name>
```

Where `provider_account` is the name of the account that provided the share and `share_name` is the name of the share
from which to create the database.

> **Note:**
>
> * A share can only be consumed once per account.
> * To see the objects that are being imported before creating a database, use the [DESCRIBE SHARE](../sql-reference/sql/desc-share.md) command.
> * When a database is created from a share, only the role used to create the database can access objects in the database by default.
>   For instructions on granting access to other roles, see Granting Privileges on an Imported Database (in this topic).

### SQL examples

The following example creates a new database named `snow_sales` in your account from the `sales_s` share:

> ```sqlexample
> CREATE DATABASE snow_sales FROM SHARE xy12345.sales_s;
> ```

List the new `snow_sales` database:

> ```sqlexample
> SHOW DATABASES LIKE 'snow%';
>
> +---------------------------------+-----------------------+------------+------------+-------------------------+--------------+---------+---------+----------------+
> | created_on                      | name                  | is_default | is_current | origin                  | owner        | comment | options | retention_time |
> |---------------------------------+-----------------------+------------+------------+-------------------------+--------------+---------+---------+----------------|
> | Sun, 10 Jul 2016 23:28:50 -0700 | SNOWFLAKE_SAMPLE_DATA | N          | N          | SFC_SAMPLES.SAMPLE_DATA | ACCOUNTADMIN |         |         | 1              |
> | Thu, 15 Jun 2017 18:30:08 -0700 | SNOW_SALES            | N          | Y          | xy12345.SALES_S         | ACCOUNTADMIN |         |         | 1              |
> +---------------------------------+-----------------------+------------+------------+-------------------------+--------------+---------+---------+----------------+
> ```
>
> In this example, the `origin` column indicates the fully-qualified name of the share from which the database was created.

Similarly, the output of SHOW SHARES and DESC SHARE includes the name of the database that was created from the share:

> ```sqlexample
> SHOW SHARES;
> ```
>
> ```output
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> | created_on                    | kind     | owner_account        | name          | database_name         | to               | owner        | comment                                | listing_global_name |
> |-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------|---------------------|
> | 2017-07-09 19:18:09.821 -0700 | INBOUND  | SNOW.XY12345         | SALES_S2      | UPDATED_SALES_DB      |                  |              | Transformed and updated sales data     |                     |
> | 2017-06-15 17:02:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT | SALES_S       | SALES_DB              | XY12345, YZ23456 | ACCOUNTADMIN |                                        |                     |
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> ```
>
> ```sqlexample
> DESC SHARE xy12345.sales_s;
>
> +----------+------------------------------------------+---------------------------------+
> | kind     | name                                     | shared_on                       |
> |----------+------------------------------------------+---------------------------------|
> | DATABASE | SNOW_SALES                               | Thu, 15 Jun 2017 17:03:16 -0700 |
> | SCHEMA   | SNOW_SALES.AGGREGATES_EULA               | Thu, 15 Jun 2017 17:03:16 -0700 |
> | TABLE    | SNOW_SALES.AGGREGATES_EULA.AGGREGATE_1   | Thu, 15 Jun 2017 17:03:16 -0700 |
> | VIEW     | SNOW_SALES.AGGREGATES_EULA.AGGREGATE_1_v | Thu, 15 Jun 2017 17:03:16 -0700 |
> +----------+------------------------------------------+---------------------------------+
> ```

## Granting privileges on an imported database

The instructions to grant access to objects in a share differ depending on whether the provider segmented the objects in a share using
database roles. This option associates different objects in the share with different database roles.

Note that a single share can include both objects that are accessible via database roles and objects that are not associated with a
database role.

### Option 1: Objects in a share aren’t associated with a database role

Allow users to access objects in a share by granting the IMPORTED PRIVILEGES privilege on an imported database to one or more roles in your
account.

A role can grant IMPORTED PRIVILEGES on an imported database only when it either:

* Owns the imported database (i.e. has the OWNERSHIP privilege on the database).
* Was granted the MANAGE GRANTS global privilege.

#### Assigning IMPORTED PRIVILEGES to other roles

You can assign this role to other roles using either Snowsight or SQL:

SnowsightSQL

1. Select Catalog » Database Explorer.
2. Select the database that you want to grant privileges to.
3. In the Privileges section, select + Privileges.
4. Select a role and privilege to grant to that role.
5. Select Grant Privileges.

Execute a [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) statement.

To see the roles that have USAGE privileges on an imported database, use Snowsight or the [SHOW GRANTS](../sql-reference/sql/show-grants.md) command.

#### SQL examples

1. Role `r1` creates database `snow_sales` from share `xy12345.sales_s`:

   ```sqlexample
   use role r1;
   create database snow_sales from share xy12345.sales_s;
   ```
2. Role `r1` grants IMPORTED PRIVILEGES on database `snow_sales` to role `r2`:

   ```sqlexample
   grant imported privileges on database snow_sales to role r2;
   ```
3. Since `r2` does not have the OWNERSHIP privilege on the database, to be able to perform either of the following grant or revoke
   operations, role `r2` must hold the MANAGE GRANTS privilege on the account:

   ```sqlexample
   use role r2;
   grant imported privileges on database snow_sales to role r3;
   revoke imported privileges on database snow_sales from role r3;
   ```

### Option 2: Objects in a share associated with a database role

Allow users to access objects in a share by granting the appropriate database role in the imported database to one or more roles in your
account.

#### Step 1: Create a database from the share

Create a database from the share using [CREATE DATABASE … FROM SHARE](../sql-reference/sql/create-database.md).

Executing this command requires a role with the global CREATE DATABASE and IMPORT SHARE privileges.

For example, create databases `c1` from provider `provider1` and share `share1`:

```sqlexample
CREATE DATABASE c1 FROM SHARE provider1.share1;
```

#### Step 2: Grant database roles to your account-level roles

Grant database roles to roles in your account to allow users with those roles to access database objects in the share.

Use the role that you used to create the database from the share.

For example, see the database roles available, then grant database role `c1.r1` to the `analyst` role in your account:

```sqlexample
SHOW DATABASE ROLES in DATABASE c1;
GRANT DATABASE ROLE c1.r1 TO ROLE analyst;
```

## Creating streams on shared views or tables

Creating streams on shared objects (secure views or tables) enables you to track data manipulation language (DML) changes made in those
objects. This functionality is similar to creating and using streams on “local” objects (i.e. in the same account as the stream).

The role used to execute the SQL statements in this section must have the required grants on the shared table or secure view. For information,
see Granting privileges on an imported database (in this topic).

* To create streams on shared views:

  ```sqlsyntax
  CREATE STREAM <name> ON VIEW <shared_db>.<schema>.<view>;
  ```

  For example, create a stream on the shared `aggregate_1_v` view in the `snow_sales.aggregates_eula` database and schema:

  ```sqlexample
  CREATE STREAM aggregate_1_v_stream ON VIEW snow_sales.aggregates_eula.aggregate_1_v;
  ```
* To create streams on shared tables:

  ```sqlsyntax
  CREATE STREAM <name> ON TABLE <shared_db>.<schema>.<table>;
  ```

  For example, create a table stream on the shared `aggregate_1` table in the `snow_sales.aggregates_eula` database and schema:

  ```sqlexample
  CREATE STREAM aggregate_1_stream ON TABLE snow_sales.aggregates_eula.aggregate_1;
  ```

For more information on creating streams, see [CREATE STREAM](../sql-reference/sql/create-stream.md).

> **Note:**
>
> * The data provider must enable change tracking on views or tables before you can create streams on these objects. If you cannot
>   create streams on a desired shared object, contact the data provider to consider enabling change tracking on the object.
> * To avoid allowing a stream to become stale, consume the stream records within a transaction during the retention period for the table.
>   Contact the data provider to determine the data retention period for the table.
>
>   To determine whether a stream has become stale, execute the [DESCRIBE STREAM](../sql-reference/sql/desc-stream.md) or [SHOW STREAMS](../sql-reference/sql/show-streams.md)
>   command. In the command output, when the STALE column value is TRUE, the stream may be stale. In practice, reading from the stream may
>   succeed for some time after the expected STALE_AFTER. However, the stream may become stale at any time during this period.

## Querying an imported database

Querying an imported database is the same as querying any other database in your account.

For example:

> ```sqlexample
> USE ROLE r1;
>
> USE DATABASE snow_sales;
>
> SELECT * FROM aggregates_1;
> ```

---
title: Contacting Snowflake Support
source: https://docs.snowflake.com/en/user-guide/contacting-support.md
section: User Guide
---

# Contacting Snowflake Support

To submit a case to Snowflake Support, you can either:

* [Use the Support page in Snowsight](ui-support.md).
* [Use the Snowflake Support Portal](https://community.snowflake.com/s/article/How-To-Submit-a-Support-Case-in-Snowflake-Lodge).

---
title: Continuous data pipeline examples
source: https://docs.snowflake.com/en/user-guide/data-pipelines-examples.md
section: User Guide
---

# Continuous data pipeline examples

This topic provides practical examples of use cases for data pipelines.

## Prerequisites

The role used to execute the SQL statements in these examples requires the following access control privileges:

`EXECUTE TASK`
:   Global EXECUTE TASK privilege to run tasks

`USAGE`
:   USAGE privilege on the database and schema in which the SQL statements are executed, as well as on the warehouse that runs any tasks in these examples.

`CREATE object`
:   Various `CREATE object` privileges on the schema in which the SQL statements are executed, to create objects such as tables, streams, and tasks.

For more information about access control in Snowflake, see [Overview of Access Control](security-access-control-overview.md).

## Transform loaded JSON data on a schedule

The following example loads raw JSON data into a single landing table named `raw`. Two tasks query table streams created on the `raw` table and insert subsets of rows into multiple tables. Because each task consumes the change data capture records in a table stream, multiple streams are required.

```sqlexample
-- Create a landing table to store raw JSON data.
-- Snowpipe could load data into this table.
create or replace table raw (var variant);

-- Create a stream to capture inserts to the landing table.
-- A task will consume a set of columns from this stream.
create or replace stream rawstream1 on table raw;

-- Create a second stream to capture inserts to the landing table.
-- A second task will consume another set of columns from this stream.
create or replace stream rawstream2 on table raw;

-- Create a table that stores the names of office visitors identified in the raw data.
create or replace table names (id int, first_name string, last_name string);

-- Create a table that stores the visitation dates of office visitors identified in the raw data.
create or replace table visits (id int, dt date);

-- Create a task that inserts new name records from the rawstream1 stream into the names table
-- every minute when the stream contains records.
-- Replace the 'mywh' warehouse with a warehouse that your role has USAGE privilege on.
create or replace task raw_to_names
warehouse = mywh
schedule = '1 minute'
when
system$stream_has_data('rawstream1')
as
merge into names n
  using (select var:id id, var:fname fname, var:lname lname from rawstream1) r1 on n.id = to_number(r1.id)
  when matched then update set n.first_name = r1.fname, n.last_name = r1.lname
  when not matched then insert (id, first_name, last_name) values (r1.id, r1.fname, r1.lname)
;

-- Create another task that merges visitation records from the rawstream2 stream into the visits table
-- every minute when the stream contains records.
-- Records with new IDs are inserted into the visits table;
-- Records with IDs that exist in the visits table update the DT column in the table.
-- Replace the 'mywh' warehouse with a warehouse that your role has USAGE privilege on.
create or replace task raw_to_visits
warehouse = mywh
schedule = '1 minute'
when
system$stream_has_data('rawstream2')
as
merge into visits v
  using (select var:id id, var:visit_dt visit_dt from rawstream2) r2 on v.id = to_number(r2.id)
  when matched then update set v.dt = r2.visit_dt
  when not matched then insert (id, dt) values (r2.id, r2.visit_dt)
;

-- Resume both tasks.
alter task raw_to_names resume;
alter task raw_to_visits resume;

-- Insert a set of records into the landing table.
insert into raw
  select parse_json(column1)
  from values
  ('{"id": "123","fname": "Jane","lname": "Smith","visit_dt": "2019-09-17"}'),
  ('{"id": "456","fname": "Peter","lname": "Williams","visit_dt": "2019-09-17"}');

-- Query the change data capture record in the table streams
select * from rawstream1;
select * from rawstream2;

-- Wait for the tasks to run.
-- A tiny buffer is added to the wait time
-- because absolute precision in task scheduling is not guaranteed.
call system$wait(70);

-- Query the table streams again.
-- Records should be consumed and no longer visible in streams.

-- Verify the records were inserted into the target tables.
select * from names;
select * from visits;

-- Insert another set of records into the landing table.
-- The records include both new and existing IDs in the target tables.
insert into raw
  select parse_json(column1)
  from values
  ('{"id": "456","fname": "Peter","lname": "Williams","visit_dt": "2019-09-25"}'),
  ('{"id": "789","fname": "Ana","lname": "Glass","visit_dt": "2019-09-25"}');

-- Wait for the tasks to run.
call system$wait(70);

-- Records should be consumed and no longer visible in streams.
select * from rawstream1;
select * from rawstream2;

-- Verify the records were inserted into the target tables.
select * from names;
select * from visits;
```

## Unload data on a schedule

The following example unloads the change data capture records in a stream into an internal (i.e. Snowflake) stage.

```sqlexample
-- Use the landing table from the previous example.
-- Alternatively, create a landing table.
-- Snowpipe could load data into this table.
create or replace table raw (id int, type string);

-- Create a stream on the table.  We will use this stream to feed the unload command.
create or replace stream rawstream on table raw;

-- Create a task that executes the COPY statement every minute.
-- The COPY statement reads from the stream and loads into the table stage for the landing table.
-- Replace the 'mywh' warehouse with a warehouse that your role has USAGE privilege on.
create or replace task unloadtask
warehouse = mywh
schedule = '1 minute'
when
  system$stream_has_data('RAWSTREAM')
as
copy into @%raw/rawstream from rawstream overwrite=true;
;

-- Resume the task.
alter task unloadtask resume;

-- Insert raw data into the landing table.
insert into raw values (3,'processed');

-- Query the change data capture record in the table stream
select * from rawstream;

-- Wait for the tasks to run.
-- A tiny buffer is added to the wait time
-- because absolute precision in task scheduling is not guaranteed.
call system$wait(70);

-- Records should be consumed and no longer visible in the stream.
select * from rawstream;

-- Verify the COPY statement unloaded a data file into the table stage.
ls @%raw;
```

## Refresh external table metadata on a schedule

The following example refreshes the metadata for an external table named `mydb.myschema.exttable` (using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) … REFRESH) on a schedule.

> **Note:**
>
> When an external table is created, the AUTO_REFRESH parameter is set to `TRUE` by default. We recommend that you accept this default value for external tables that reference data files in either Amazon S3 or Microsoft Azure stages. However, the automatic refresh option is not available currently for external tables that reference Google Cloud Storage stages. For these external tables, manually refreshing the metadata on a schedule can be useful.

```sqlexample
-- Create a task that executes an ALTER EXTERNAL TABLE ... REFRESH statement every 5 minutes.
-- Replace the 'mywh' warehouse with a warehouse that your role has USAGE privilege on.
CREATE TASK exttable_refresh_task
WAREHOUSE=mywh
SCHEDULE='5 minutes'
  AS
ALTER EXTERNAL TABLE mydb.myschema.exttable REFRESH;
```

---
title: Continuous data protection
source: https://docs.snowflake.com/en/user-guide/data-cdp.md
section: User Guide
---

# Continuous data protection

Continuous Data Protection (CDP) encompasses a comprehensive set of features that help protect data stored in Snowflake against human
error, malicious acts, and software failure. At every stage within the data lifecycle, Snowflake enables your data to be
accessible and recoverable in the event of accidental or intentional modification, removal, or corruption.

The features include:

| Feature | Additional Reading |
| --- | --- |
| Network policies for granting or restricting users access to the site based on their IP address (i.e. IP allow lists). | [Controlling network traffic with network policies](network-policies.md) |
| Verification/authentication required for any users accessing your account (includes support for MFA and SSO). | [Multi-factor authentication (MFA)](security-mfa.md) — enabled per user . [Federated authentication](admin-security-fed-auth-overview.md) |
| Security roles for controlling user access to all objects in the system. | [Overview of Access Control](security-access-control-overview.md) |
| All files stored on internal stages for data loading and unloading operations are automatically encrypted using AES-256 strong encryption on the server side. By default, Snowflake provides additional client-side encryption with a 128-bit key (with the option to configure a 256-bit key). | [Understanding end-to-end encryption in Snowflake](security-encryption-end-to-end.md) |
| Maintenance of historical data (i.e. data that has been changed or deleted) through Snowflake Time Travel (for querying and restoring data) and Fail-safe (for disaster recovery; can only be performed by Snowflake). | [Snowflake Time Travel & Fail-safe](data-availability.md) |

Most Continuous Data Protection features are included standard for all [Snowflake editions](intro-editions.md) (i.e. no additional licensing is
required); however, some features are available only for Snowflake Enterprise Edition (or higher).

In addition, both Time Travel and Fail-safe require additional data storage, which has associated fees. For more details, see
[Data storage considerations](tables-storage-considerations.md).

---
title: Control network traffic to Snowflake Open Catalog with network policies
source: https://docs.snowflake.com/en/user-guide/opencatalog/network-policies.md
section: User Guide
---

# Control network traffic to Snowflake Open Catalog with network policies

Create network policies to control network traffic to a Snowflake Open Catalog account. When creating a network policy, you specify the following
lists for the network policy:

* The list of IPv4 addresses that are permitted to access the Open Catalog account (the *allowed list* for the policy)
* If you need to explicitly block IPv4 addresses, the list of IP addresses that are restricted from accessing the Open Catalog account (the *blocked list* for the policy)

When you add IP addresses to the allowed list of a network policy, you don’t have to use the blocked list to explicitly block other IP
addresses of the same type; only the allowed IP addresses have access. Typically, you use the blocked list to restrict IP addresses included
in a CIDR block range that you add to the allowed list.

For example, if you add a single IPv4 address to the allowed list, all other IPv4 addresses are blocked. There is no need to use the blocked
list to restrict access from other IP addresses.

You can create multiple network policies, if needed. However, you can only activate one network policy at a time.

If a network policy has the same IP address values in both the allowed list and blocked list, Open Catalog applies the
values in the blocked list first. For example, if `192.168.1.99` is added to the allowed list through a CIDR block range such as
`192.168.1.0/24`, but `192.168.1.99` is specified in the blocked list, `192.168.1.99` is ultimately added to the blocked list.

## Step 1: Create a network policy

**Caution**

> Ensure that the network policy you create grants the IP address for your computer access to the Open Catalog account. Otherwise, when you
> activate the network policy, you’ll be locked out of the account. If you are using private connectivity in the Open Catalog service, do
> the following:

> 1. Configure the external service, such as AWS PrivateLink, to generate private IP addresses.
> 2. Use CIDR notation to add the private IP addresses to the allowed list for your network policy.

To create a network policy, follow these steps:

1. Sign in to Open Catalog.
2. From the menu on the left, select **Security**.
3. Select **+ Network Policy**.
4. Enter a name for the network policy.

   **Note**

   > * A network policy name can’t contain spaces or special characters other than underscores.
   > * Network policy names are treated as case insensitive and are saved with uppercase letters.
5. To add IPv4 addresses to the allowed list, follow these steps:

   1. In the **Allowed IPs** field, add an entry. CIDR notation is supported. For an example, see
      Use CIDR notation to specify allowed IP addresses.
   2. Press **Enter**.
   3. If needed, repeat the previous steps to add another entry.
6. Optional: To add IPv4 addresses to the blocked list, follow these steps:

   1. In the **Blocked IPs** field, add an entry.
   2. Press **Enter**.
   3. If needed, repeat the previous steps to add another entry.
7. Select **Create**.

## Step 2: Activate a network policy

After you create a network policy, you need to activate it for its policy to take effect and restrict network traffic. If you created
multiple network policies, you can only activate one network policy at a time.

**Caution**

> Before you activate a network policy, ensure that it grants the IP address for your computer access to the Open Catalog account.
> Otherwise, you’ll be locked out of the account.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Security**.
3. From the list of network policies, locate the network policy you want to activate.
4. Under the **MORE** column, select **…** for the network policy you want to activate.
5. Select **Activate**.

   **Note**

   > If another network policy is currently activated, it’s automatically deactivated when you activate the network policy.

## Deactivate a network policy

1. Sign in to Open Catalog.
2. From the menu on the left, select **Security**.
3. From the list of network policies, locate the network policy you want to deactivate.
4. Under the **MORE** column, select **…** for the network policy you want to deactivate.
5. Select **Deactivate**.

## Delete a network policy

**Note**

> If the network policy you want to delete is activated, first deactivate it. You can’t delete a network
> policy that is activated.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Security**.
3. From the list of network policies, locate the network policy you want to delete.
4. Under the **MORE** column, select **…** for the network policy you want to delete.
   Select **Delete**.

## Examples

### Use CIDR notation to specify allowed IP addresses

The following network policy allows requests from all IP addresses in the range of `192.168.1.0` to `192.168.1.255`, except `192.168.1.99`.
IP addresses outside the range are also blocked.

The entry for the allowed list of the policy uses CIDR notation.

**Allowed IPs** = `192.168.1.0/24`

**Blocked IPs** = `192.168.1.99`

---
title: Controlling cost
source: https://docs.snowflake.com/en/user-guide/cost-controlling.md
section: User Guide
---

# Controlling cost

You can use budgets to control credit usage for compute costs, including those incurred by serverless features. If you are only concerned
with controlling the costs of warehouses, you can also use resource monitors to monitor and suspend warehouses. In addition, Snowflake
provides cost controls you can configure to help avoid unexpected costs.

## Use budgets to control credit usage

A *budget* allows you to set a monthly spending limit and monitor the credit usage of all
[supported objects](budgets/custom-budget.md) and serverless features in your account. In addition to your account budget,
you can create custom budgets to monitor credit usage of groups of specified objects and the serverless features used by those objects.
For example, you can create a custom budget for each department in your organization. Each budget sends a notification if the
credit usage is expected to exceed its spending limit for the month. You can configure the budget to send this notification to a
list of email addresses, a queue provided by a cloud service (Amazon SNS, Azure Event Grid, or Google Cloud PubSub), or a webhook
for a third-party system (for example, Slack, Microsoft Teams, or PagerDuty).

For information about budgets, see [Monitor credit usage with budgets](budgets.md).

## Use resource monitors to control credit usage

A *resource monitor* lets you monitor credit usage by user-managed virtual warehouses. You can set a spending limit that resets on a
monthly basis or on a custom schedule. A resource monitor can
send an email notification when your credit usage reaches a percentage (threshold) of the spending limit. You can customize up to five
notification thresholds. To help avoid unexpected credit usage, you can optionally suspend a warehouse when its credit usage reaches
a threshold.

For background information about how virtual warehouses incur costs, see [Understanding compute cost](cost-understanding-compute.md).

For information about resource monitors, see [Working with resource monitors](resource-monitors.md).

## Cost controls for warehouses

For a set of best practices that act as cost controls for virtual warehouses, see [Cost controls for warehouses](cost-controlling-controls.md).

---
title: Controlling network traffic with network policies
source: https://docs.snowflake.com/en/user-guide/network-policies.md
section: User Guide
---

# Controlling network traffic with network policies

You can use network policies to control *inbound* access to the Snowflake service and internal stage.

If you want to control *outbound* traffic from Snowflake to an external network destination, see
[External network access overview](../developer-guide/external-network-access/external-network-access-overview.md).

> **Note:**
>
> Network policies that existed before the introduction of network rules can no longer be modified in Snowsight. Use the
> [ALTER NETWORK POLICY](../sql-reference/sql/alter-network-policy.md) command instead.

## About network policies

By default, Snowflake allows users to connect to the service and internal stage from any computer or device. A security administrator
(or higher) can use a network policy to allow or deny access to a request based on its origin. The *allowed list* of the network policy
controls which requests are allowed to access the Snowflake service or internal stage, while the *blocked list* controls which
requests should be explicitly blocked.

A network policy does not directly specify the network identifiers in its allowed list or blocked list. Rather, a network policy adds
*network rules* to its allowed and blocked lists. These network rules group related identifiers into logical units that are added to the
allowed list and blocked list of a network policy.

> **Important:**
>
> Network policies that existed before the introduction of network rules still work. However, all new network policies should use network
> rules, not the `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST` parameters, to control access from IP addresses. Best practice is to
> avoid using both ways to restrict access in the same network policy.

### Workflow

The following list shows the general workflow of using network policies to control inbound network traffic:

1. Create network rules based on their purpose and type of network identifier.
2. Create one or more network policies that include the network rules that contain the
   identifiers to be allowed or blocked.
3. Activate the network policy for an account, user, or security integration. A network policy
   doesn’t restrict network traffic until it is activated.

### Interaction between allowed lists and blocked lists

When you add a network rule to the allowed list of a network policy, you do not have to use the blocked list to explicitly block other
identifiers of the same type; only the allowed identifiers have access. For example, if you add an IPv4 network rule with a single IP
address to the allowed list, all other IPv4 addresses are blocked. There is no need to use the blocked list to restrict access from other
IP addresses.

If a network policy has the same IP address values in both the `ALLOWED_IP_LIST` and the `BLOCKED_IP_LIST` parameters, Snowflake
applies the values in the `BLOCKED_IP_LIST` parameter first. This behavior also applies to the `ALLOWED_NETWORK_RULE_LIST` and
the `BLOCKED_NETWORK_RULE_LIST` parameters.

[Private connectivity](private-connectivity-inbound.md) network rules — that is, network rules of type AWSVPCEID or
AZURELINKID — take precedence over IPV4 network rules. If an incoming request uses private connectivity and there is a network rule of
type AWSVPCEID or AZURELINKID in the `ALLOWED_NETWORK_RULE_LIST` property, then all IPV4 network rules that contain public or private
IP ranges are ignored.

A network rule that uses private endpoint identifiers such as Azure LinkIDs or AWS VPCE IDs to restrict access has no effect
on requests coming from the public network. If you want to restrict access based on private endpoint identifiers, and then completely
block requests from public IPv4 addresses, you must create two separate network rules, one for the allowed list and another for the blocked
list.

The following network rules could be combined in a network policy to allow a VPCE ID while blocking public network traffic.

```sqlexample
CREATE NETWORK RULE block_public_access
  MODE = INGRESS
  TYPE = IPV4
  VALUE_LIST = ('0.0.0.0/0');

CREATE NETWORK RULE allow_vpceid_access
  MODE = INGRESS
  TYPE = AWSVPCEID
  VALUE_LIST = ('vpce-0fa383eb170331202');

CREATE NETWORK POLICY allow_vpceid_block_public_policy
  ALLOWED_NETWORK_RULE_LIST = ('allow_vpceid_access')
  BLOCKED_NETWORK_RULE_LIST=('block_public_access');
```

#### IP ranges

If you want to allow a range of IP addresses with the exception of a single IP address, you can create two network rules, one for the
allowed list and another for the blocked list.

For example, the following would allow requests from all IP addresses in the range of `192.168.1.0` to `192.168.1.255`, except
`192.168.1.99`. IP addresses outside the range are also blocked.

```sqlexample
CREATE NETWORK RULE allow_access_rule
  MODE = INGRESS
  TYPE = IPV4
  VALUE_LIST = ('192.168.1.0/24');

CREATE NETWORK RULE block_access_rule
  MODE = INGRESS
  TYPE = IPV4
  VALUE_LIST = ('192.168.1.99');

CREATE NETWORK POLICY public_network_policy
  ALLOWED_NETWORK_RULE_LIST = ('allow_access_rule')
  BLOCKED_NETWORK_RULE_LIST=('block_access_rule');
```

### Network policy precedence

You can apply a network policy to an account, a security integration, or a user. If there are network policies applied to more than one
of these, the most specific network policy overrides more general network policies. The following summarizes the order of precedence:

Account:
:   Network policies applied to an account are the most general network policies. They are overridden by network policies applied to a
    security integration or user.

User:
:   Network policies applied to a user override network policies applied to the account, but are overridden by a network
    policy applied to a security integration.

Security Integration:
:   Network policies applied to a security are the most specific network policies. They override both accounts and users.

    > **Note:**
    >
    > A network policy attached to a Snowflake OAuth integration takes precedence when there is network traffic between the client and
    > Snowflake, but doesn’t take precedence over the user-level policy when there is an interaction between the user and Snowflake as the
    > authorization server. For more information, see [Restricting network traffic for Snowflake OAuth](oauth-snowflake-overview.md).

### Bypassing a network policy

It is possible to temporarily bypass a network policy for a set number of minutes by configuring the user object property
`MINS_TO_BYPASS_NETWORK_POLICY`, which can be viewed by executing [DESCRIBE USER](../sql-reference/sql/desc-user.md). Only Snowflake can set the
value for this object property. Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to set a value for this property.

## About network rules

While restrictions on incoming requests to Snowflake are ultimately applied to an account, user, or security integration with a network
policy, the administrator can organize these restrictions using [network rules](network-rules.md), which are schema-level
objects.

Each network rule groups together the identifiers for a particular type of request origin. For example, one network rule
might include all of the IPv4 addresses that should be allowed to access Snowflake while another groups together all of the private
endpoints that should be blocked.

A network rule, however, does not specify whether it is allowing or blocking the origin of a request. It simply organizes
related origins into a logical unit. Administrators specify whether that unit should be allowed or blocked when they create
or modify a network policy.

If you already understand the strategies for using network rules with network policies, see Working with network rules.

### Best practices

* **Limit the scope**. Network rules are designed to group together small units of related network identifiers. Previously, network
  policies often contained a large, monolithic list of IP addresses that should be allowed or blocked. The introduction of network rules
  changes this strategy. For example, you could break up network identifiers by:

  + Creating a network rule to contain client IP addresses for the North American region, and a different rule for the Europe and Middle
    Eastern region.
  + Creating a network rule whose purpose is to allow access for a special population, such as highly privileged users and service account
    users. This network rule can be added to a network policy that is applied to individual users.
  + Creating a network rule that is scoped to one or more data apps.

  With the introduction of network rules, Snowflake recommends that you also limit the scope of network policies. Whenever possible,
  narrowly scope a network policy to a group of users or a security integration rather than an entire account.
* **Add comments**. When creating a network rule, use the `COMMENT` property to keep track of what the rule is supposed to do.
  Comments are important because Snowflake encourages a large number of small targeted rules over fewer monolithic ones.

  You can use the SHOW NETWORK RULES command to list all of the network rules, including their comments.

### Supported identifiers

Each network rule contains a list of one or more network identifiers of the same type (e.g. an IPv4 address rule or a private endpoint rule).

A network rule’s `TYPE` property identifies what type of identifiers the network rule contains.

For a complete list of the types of identifiers that can be restricted using network rules, see [Supported network identifiers](network-rules.md).

### Protecting the Snowflake service

This section discusses how to use network rules to restrict access to the Snowflake service only. If you want to restrict access to both the
service and the internal stage of an account on AWS, see Protecting internal stages on AWS.

To restrict access to the Snowflake service, set the `MODE` property of the network rule to `INGRESS`.

You can then use the `TYPE` property to specify the [identifiers](network-rules.md) that should be allowed or
blocked.

### Protecting internal stages on AWS

This section discusses how to use network rules to restrict access to internal stages on AWS, including how to simultaneously restrict
access to the Snowflake service and internal stage. It includes:

* Limitations
* Prerequisite: Enabling internal stage restrictions
* Guidelines for internal stages
* Strategy for protecting the internal stage only
* Strategies for protecting both service and internal stage

> **Note:**
>
> You cannot use a network rule to restrict access to an internal stage on Microsoft Azure. However, you can block all public access to an
> internal stage on Azure if you are using
> [Azure Private Link](https://learn.microsoft.com/en-us/azure/private-link/private-link-overview). For details, see [Blocking public access — Recommended](private-internal-stages-azure.md).

#### Limitations

* A network policy that is activated for a security integration does not restrict access to an internal stage.

#### Prerequisite: Enabling internal stage restrictions

To use network rules to restrict access to the internal stage of an account, the account administrator must enable the [ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES](../sql-reference/parameters.md) parameter . Network rules do not protect an internal stage until this parameter is
enabled, regardless of the rule’s mode.

To allow network rules to restrict access to internal stages, execute:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES = true;
```

#### Guidelines for internal stages

We recommend that you follow these guidelines when creating network policies and network rules to restrict access to internal stages.

* **Limit the number of identifiers**. Network policies used to protect an internal stage cannot contain an unlimited number of network
  identifiers. The limits vary depending on your Snowflake edition.

  > **Note:**
  >
  > If a network policy has more than one network rule, the combined number of identifiers from all network rules cannot exceed the limit
  > for the network policy.

  + **Standard and Enterprise editions**:

    - Maximum number of IPv4 address ranges is 10 per network rule.
    - Maximum number of VPCE IDs is 7 per network policy.
  + **Business Critical edition and higher**:

    - Maximum number of IPv4 address ranges is approximately 250 per network policy.
    - Maximum number of VPCE IDs is approximately 200 per network policy.
    - Maximum number of network policies is 50. If you need to increase this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* **Use same rule to protect both service and internal stage**. When a rule contains IPv4 addresses and the mode of a network rule is
  `INGRESS`, a single rule can protect both the Snowflake service and the internal stage of the account. Snowflake recommends using a
  single rule even when the IP addresses accessing the service are different from the IP addresses accessing the internal stage. This
  approach improves organization, manageability, and auditing.
* **Test Network Policies**. Snowflake recommends testing network rules using user-level network policies. If you encounter
  `PolicySizeExceeded` exceptions when fetching the scoped credentials from AWS STS, break up the network identifiers into smaller network
  rules.

#### Strategy for protecting the internal stage only

To restrict access to an AWS internal stage without affecting how network traffic accesses the Snowflake service, create a network rule with
the following settings:

* Set the `MODE` parameter to `INTERNAL_STAGE`.
* Set the `TYPE` parameter to `AWSVPCEID`.

> **Note:**
>
> You cannot restrict access to the internal stage based on the IP address of the request without also restricting access to the Snowflake
> service.

#### Strategies for protecting both service and internal stage

When restricting access to both the Snowflake service and internal stage, the implementation strategy varies based on whether network
traffic is traversing the public internet or AWS Private Link.

In the following comparison, “Public” indicates that traffic to the service or internal stage is traversing the public internet while
“Private” indicates traffic is using AWS Private Link. Find the combination that matches your environment, and then choose the
implementation strategy accordingly.

| Service Connection | Internal Stage Connection | Implementation Strategy |
| --- | --- | --- |
| Public | Public | Create a single network rule with `TYPE=IPV4` and `MODE=INGRESS`. Include all IP addresses that access the service and internal stage. |
| Private | Private | Strategy depends on whether you want to restrict access using private IP addresses or the VPCE ID of the VPC endpoints:   * **(Recommended)** If using VPCE IDs, you must create two network rules, even if the same VPC endpoint is connecting to both the   service and the internal stage.    + For the service, create a network rule with `TYPE=AWSVPCEID` and `MODE=INGRESS`.   + For the internal stage, create a network rule with `TYPE=AWSVPCEID` and `MODE=INTERNAL_STAGE`. * If using private IP addresses, create a network rule with `TYPE=IPV4` and `MODE=INGRESS`. Include all private IP addresses that   access the service and internal stage. |
| Public [1] | Private | Strategy depends on whether you want to restrict access to the internal stage using private IP addresses or VPCE ID of the VPC endpoints:   * **(Recommended)** If using VPCE IDs, create two network rules, one for the service and one for the internal stage.    + For the service, create a network with `TYPE=IPV4` and `MODE=INGRESS`.   + For the internal stage, create a network rule with `TYPE=AWSVPCEID` and `MODE=INTERNAL_STAGE`. * If using private IP addresses, create a single network rule with `TYPE=IPV4` and `MODE=INGRESS`. Include all IP addresses that   access the service and internal stage. |
| Private | Public [1] | You must use private IPs for the service (cannot use VPCE IDs). Create a single network rule with `TYPE=IPV4` and `MODE=INGRESS`. Include all IP addresses that access the service and internal stage. |

[1]
(1,2)

If you have implemented private connectivity to either the service or the internal stage, Snowflake recommends implementing it for both.

### Protecting Snowflake-managed storage volumes on AWS

This section discusses how to use network rules to restrict access to Snowflake-managed storage volumes on AWS, including how to
simultaneously restrict access to the Snowflake service and Snowflake-managed storage volume. It includes:

* Limitations
* Prerequisite: Enabling Snowflake-managed storage volume restrictions
* Guidelines for Snowflake-managed storage volumes
* Strategy for protecting the Snowflake-managed storage volume only

> **Note:**
>
> You can’t use a network rule to restrict access to a Snowflake-managed storage volume on Microsoft Azure. However, you can block all
> public access to a Snowflake-managed storage volume on Azure if you are using
> [Azure Private Link](https://learn.microsoft.com/en-us/azure/private-link/private-link-overview). For details, see
> [Blocking public access](private-managed-volumes-azure.md).

#### Limitations

* A network policy that is activated for a security integration does not restrict access to a Snowflake-managed storage volume.
* Network policies that protect Snowflake-managed storage volumes can only be applied at the account level

#### Prerequisite: Enabling Snowflake-managed storage volume restrictions

To use network rules to restrict access to the Snowflake-managed storage volume of an account, the account administrator must enable the
[ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME](../sql-reference/parameters.md) parameter. Network rules don’t protect a Snowflake-managed storage
volume until this parameter is enabled, regardless of the rule’s mode.

To allow network rules to restrict access to Snowflake-managed storage volumes, execute:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME = true;
```

#### Guidelines for Snowflake-managed storage volumes

We recommend that you follow these guidelines when creating network policies and network rules to restrict access to managed
storage volumes.

* **Limit the number of identifiers**. Network policies used to protect a Snowflake-managed storage volume can’t contain an unlimited
  number of network identifiers.

  > **Note:**
  >
  > If a network policy has more than one network rule, the combined number of identifiers from all network rules can’t exceed
  > the limit for the network policy.

  + **All Snowflake editions**:

    - Maximum number of IPv4 address ranges is 10 per network rule.
    - Maximum number of VPCE IDs is 7 per network policy.

#### Strategy for protecting the Snowflake-managed storage volume only

To restrict access to an AWS Snowflake-managed storage volume without affecting how network traffic accesses the Snowflake service,
create a network rule with the following settings:

* Set the `MODE` parameter to `SNOWFLAKE_MANAGED_STORAGE_VOLUME`.
* Set the `TYPE` parameter to `AWSVPCEID`.

For example:

```sqlexample
CREATE NETWORK RULE managed_volume_rule
  TYPE = AWSVPCEID
  VALUE_LIST = ('vpce-123abc3420c1931')
  MODE = SNOWFLAKE_MANAGED_STORAGE_VOLUME;

CREATE NETWORK POLICY managed_volume_policy
  ALLOWED_NETWORK_RULE_LIST = ('managed_volume_rule');

ALTER ACCOUNT SET NETWORK_POLICY = managed_volume_policy;
```

## Working with network rules

You can use Snowsight or SQL to manage the lifecycle of a network rule.

### Create a network rule

You need the CREATE NETWORK RULE privilege on the schema to create a network rule. By default, only the ACCOUNTADMIN and SECURITYADMIN
roles, along with the schema owner, have this privilege.

The mode of a network rule that will be used by a network policy must be `INGRESS` or `INTERNAL STAGE`.

To gain a better understand of best practices and strategies for creating network rules, see About network rules.

You can create a network rule using Snowsight or by executing a SQL command:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Rules tab.
    3. Select + Network Rule.
    4. Enter the name of the network rule.
    5. Select the schema of the network rule. Network rule are schema-level objects.
    6. Optionally, add a descriptive comment for the network rule to help organize and maintain network rules in the schema.
    7. In the Type drop-down, select the [type of identifier](network-rules.md) being defined in the network
       rule. The Host Port type is not a valid option for network rules being used with network policies.
    8. In the Mode drop-down, select Ingress or Internal Stage. The Egress mode is not a valid option for network
       rules being used with network policies.
    9. Enter a comma-separated list of the identifiers that will be allowed or blocked when the network rule is added to a network policy. The
       identifiers in this list must all be of the type specified in the Type drop-down.
    10. Select Create Network Rule.

SQL:
:   An administrator can execute the [CREATE NETWORK RULE](../sql-reference/sql/create-network-rule.md) command to create a new network rule, specifying a list of
    network identifiers along with the type of those identifiers.

    For example, to use a custom role to create a network rule that can be used to allow or block traffic from a range of IP addresses:

    ```sqlexample
    GRANT USAGE ON DATABASE securitydb TO ROLE network_admin;
    GRANT USAGE ON SCHEMA securitydb.myrules TO ROLE network_admin;
    GRANT CREATE NETWORK RULE ON SCHEMA securitydb.myrules TO ROLE network_admin;
    USE ROLE network_admin;

    CREATE NETWORK RULE cloud_network TYPE = IPV4 VALUE_LIST = ('47.88.25.32/27');
    ```

### Modify a network rule

You can modify the network rule using Snowsight or SQL.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Rules tab.
    3. Find the network rule, select the … button, and then select Edit.
    4. Modify the network rule as needed.
    5. Select Update Network Rule.

SQL:
:   Execute an [ALTER NETWORK RULE](../sql-reference/sql/alter-network-rule.md) statement.

## Working with network policies

Once you have grouped network identifiers into network rules, you are ready to add those network rules to the allowed list and blocked list
of a new or existing network policy. There is no limit on how many network rules can be added to a network policy.

For general information about how network policies control inbound access to the Snowflake service and internal stage, see
About network policies.

### Create a network policy

Only security administrators (i.e. users with the SECURITYADMIN role) or higher or a role with the global CREATE NETWORK POLICY
privilege can create network policies. Ownership of a network policy can be transferred to another role.

> **Caution:**
>
> `0.0.0.0/0` refers to all public and private IPv4 address ranges. Use a network rule to block public access and add the
> network rule to the `BLOCKED_NETWORK_RULE_LIST` property of the network policy.
>
> The network policy evaluation considers any network rule properties before the `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST`
> network policy properties:
>
> * The network rule `TYPE` property for `AWSVPCEID` and `AZURELINKID` takes precedence over any `TYPE = IPV4`
>   value.
> * If there are no network rules, the network policy evaluation considers the `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST`
>   network policy properties and their values.
>
> Before you block all public access with a network rule, ensure that you have a network rule added to a network policy to allow access to
> Snowflake. If you are using private connectivity to the Snowflake service, such as AWS PrivateLink, configure this service and update the
> network rule and network policy accordingly.
>
> If you try to create an empty network policy, no IPv4 addresses are allowed to access your Snowflake account.

> **Caution:**
>
> When defining the network policy for a Snowflake Open Catalog account, ensure the allowed list of the network policy includes at least one IP
> address that you intend to use to access the account. Otherwise, you may get locked out of the account.

You can create a network policy using [Snowsight](ui-snowsight-gs.md) or SQL:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Policies tab.
    3. Select + Network Policy.
    4. Enter the name of the network policy.
    5. Optionally, enter a descriptive comment.
    6. To add a network rule to the allowed list, select Allowed, and then select Select rule. You can add multiple network rules
       to the allowed list by re-selecting Select rule.
    7. To add a network rule to the blocked list, select Blocked, and then select Select rule. You can add multiple network rules
       to the blocked list by re-selecting Select rule.
    8. Select Create Network Policy.

SQL:
:   Execute a [CREATE NETWORK POLICY](../sql-reference/sql/create-network-policy.md) statement.

### Identify network policies in your account

You can identify the network policies in your account using Snowsight or SQL.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Policies tab.

SQL:
:   Do one of the following:

    * Call the [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) Information Schema table function.
    * Query the [POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md) or
      [NETWORK_POLICIES](../sql-reference/account-usage/network_policies.md) Account Usage view.
    * Run the [SHOW PARAMETERS](../sql-reference/sql/show-parameters.md) command as follows:

      ```sqlexample
      SHOW PARAMETERS LIKE 'network_policy' IN ACCOUNT;
      ```

### Modify a network policy

You can add or remove network rules from the allowed list and blocked list of an existing network policy using Snowsight or SQL. If
you are editing a network policy that uses the `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST` parameters instead of a network rule, you
must use SQL to modify the network policy.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Policies tab.
    3. Find the network policy, select the … button, and then select Edit.
    4. To add a network rule to the allowed list, select Allowed, and then select Select rule. You can add multiple network rules
       to the allowed list by re-selecting Select rule.
    5. To add a network rule to the blocked list, select Blocked, and then select Select rule. You can add multiple network rules
       to the blocked list by re-selecting Select rule.
    6. To remove a network rule from the allowed list or blocked list of the network policy:

       1. Select Allowed or Blocked.
       2. Find the network rule in the list and select X to remove.

SQL:
:   Use the [ALTER NETWORK POLICY](../sql-reference/sql/alter-network-policy.md) command to add or remove network rules from an existing network policy.

    When adding a network rule to the allowed list or blocked list, you can either replace all existing network rules in the list or add the
    new rule while keeping the existing list. The following examples show each of these options:

    * Use the SET clause to replace network rules in the blocked list with a new network rule named `other_network`:

      > ```sqlexample
      > ALTER NETWORK POLICY my_policy SET BLOCKED_NETWORK_RULE_LIST = ( 'other_network' );
      > ```
    * Use the ADD clause to add a single network rule to the allowed list of an existing network policy. Network rules that were previously
      added to the policy’s allowed list remain in effect.

      > ```sqlexample
      > ALTER NETWORK POLICY my_policy ADD ALLOWED_NETWORK_RULE_LIST = ( 'new_rule' );
      > ```

    You can also remove a network rule from an existing list without replacing the entire list. For example, to remove a network rule from
    the network policy’s blocked list:

    ```sqlexample
    ALTER NETWORK POLICY my_policy REMOVE BLOCKED_NETWORK_RULE_LIST = ( 'other_network' );
    ```

## Activating a network policy

A network rule does not restrict inbound network traffic until it has been activated for an account, user, or security integration. For
instructions on how to activate at each level, see:

* Activate a network policy for your account
* Activate network policies for individual users
* Activate network policies for security integrations

If you are activating multiple network policies at different levels (for example, both account- and user-level network policies), see
Network policy precedence.

### Activate a network policy for your account

Activating a network policy for an account enforces the policy for all users in the account.

Only security administrators (i.e. users with the SECURITYADMIN role) or higher or a role with the global ATTACH POLICY privilege
can activate a network policy for an account.

Once the policy is associated with your account, Snowflake restricts access to your account based on the allowed list and blocked
list. Any user who attempts to log in from an network origin restricted by the rules is denied access. In addition, when a network
policy is associated with your account, any restricted users who are already logged into Snowflake are prevented from executing further
queries.

You can create multiple network policies, however only one network policy can be associated with an account
at any one time. Associating a network policy with your account automatically removes the currently-associated network policy (if any).

Note that your current IP address or private endpoint identifier must be included in the allowed list in the policy. Otherwise, when you
activate the policy, Snowflake returns an error. In addition, your current identifier cannot be included in the blocked list.

If you want to determine whether there is already an account-level network policy before activating a new one, see
Identify an activated network policy.

You can activate a network policy for your account using Snowsight or SQL:

Snowsight:
:   1. In the navigation menu, select Governance & security » Network policies, and then select the Network Policies tab.
    2. Find the network policy, select the … button, and then select Activate.
    3. Select Activate policy.

SQL:
:   Execute the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) statement to set the [NETWORK_POLICY](../sql-reference/parameters.md)
    parameter for the account. For example:

    ```sqlexample
    ALTER ACCOUNT SET NETWORK_POLICY = my_policy;
    ```

### Activate network policies for individual users

To enforce a network policy for a specific user in your Snowflake account, activate the network policy for the user. Only a single network
policy can be activated for each user at a time. The ability to activate different network policies for different users allows for granular
control. Associating a network policy with a user automatically removes the currently-associated network policy (if any).

> **Note:**
>
> A role with the OWNERSHIP privilege on the user and USAGE privilege on the network policy, or a higher role, can activate a network policy
> for an individual user.

Once the policy is associated with the user, Snowflake restricts access to the user based on the allowed list and blocked
list. If the user with an activated user-level network policy attempts to log in from a network location restricted by the rules, the user
is denied access to Snowflake.

In addition, when a user-level network policy is associated with the user and the user is already logged into Snowflake, if the user’s
network location does not match the user-level network policy rules, Snowflake prevents the user from executing further queries.

If you want to determine whether there is already a user-level network policy before activating a new one, see
Identify an activated network policy.

To activate a network policy for an individual user, execute the [ALTER USER](../sql-reference/sql/alter-user.md) command to set the
[NETWORK_POLICY](../sql-reference/parameters.md) parameter for the user. For example, execute:

```sqlexample
ALTER USER joe SET NETWORK_POLICY = my_policy;
```

### Activate network policies for security integrations

Some security integrations support activating a network policy to control network traffic that is governed by that integration. These
security integrations have a NETWORK_POLICY parameter that activates the network policy for the integration. Currently,
SCIM, Snowflake OAuth, and External OAuth support integration-level network policies.

> **Note:**
>
> A network policy that is activated for a security integration does not restrict access to an internal stage.

For example, you could activate a network policy when creating a new Snowflake OAuth security integration. The network policy would restrict
the access of requests trying to authenticate.

> ```sqlexample
> CREATE SECURITY INTEGRATION oauth_kp_int
>   TYPE = oauth
>   ENABLED = true
>   OAUTH_CLIENT = custom
>   OAUTH_CLIENT_TYPE = 'CONFIDENTIAL'
>   OAUTH_REDIRECT_URI = 'https://example.com'
>   NETWORK_POLICY = mypolicy;
> ```

You can execute the ALTER SECURITY INTEGRATION … SET NETWORK_POLICY statement to activate a network policy for an existing security
integration.

### Identify an activated network policy

You can identify which network policy is activated at the account, user, or integration level.

Account:
:   1. In the navigation menu, select Governance & security » Network policies, and then select the Network Policies tab.
    2. Sort the Status column to view the network policies.

       The Status column shows active and inactive network policies. Select the column value to view more details about the network
       policy, edit the policy, and delete the network policy. You can activate and deactivate a network policy that is set on your account.

    Alternatively, you can call the [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) function and specify a network policy. The values in
    the `ref_entity_name` and `ref_entity_domain` columns for an individual row indicate the object on which the network policy
    is set.

## Using replication with network policies and network rules

Snowflake supports replication and failover/failback for network policies and network rules, including the assignment of the network policy.

For details, refer to [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

---
title: Convert an Apache Iceberg™ table to use Snowflake as the catalog
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-conversion.md
section: User Guide
---

# Convert an Apache Iceberg™ table to use Snowflake as the catalog

Convert an [Apache Iceberg™ table](tables-iceberg.md) that Snowflake doesn’t manage into a
table that uses Snowflake as the Iceberg catalog.

You might choose to convert a table when you want full Snowflake platform support,
including support for the [Snowflake Catalog SDK](tables-iceberg-catalog.md).

To learn about the differences between Iceberg table types, see [Catalog options](tables-iceberg.md).

## Before and after table conversion

When you convert an Iceberg table to use Snowflake as the catalog, the table becomes writable and Snowflake assumes life-cycle management
for it.

The following table compares Iceberg tables before and after conversion:

|  | Before conversion | After conversion |
| --- | --- | --- |
| Iceberg catalog | An external catalog (such as AWS Glue), or no catalog at all. Requires a catalog integration. | Snowflake. Snowflake registers changes to the source data and registers the changes in the Snowflake catalog. Snowflake then updates the table metadata on your external volume.  Does not require a catalog integration. |
| Snowflake read operations | ✔ | ✔ |
| Snowflake write operations | ❌ | ✔ |
| Storage location for table data and metadata | External volume (external cloud storage). | External volume (external cloud storage) under a base location that you specify. |
| Data and metadata cleanup | Managed by you or your external catalog. | Managed by Snowflake. Snowflake never deletes any metadata, manifest lists, or manifests created before conversion from your external storage. Snowflake doesn’t rewrite any Parquet data files during conversion. *After* you convert a table, Snowflake might rewrite some of the data files as part of regular table maintenance. |
| Accessible from the Snowflake Catalog SDK | ❌ | ✔ |

> **Important:**
>
> When you convert an Iceberg table, Snowflake doesn’t lock down or assume sole access to your external storage.
> To prevent table corruption, ensure that you monitor or stop any non-Snowflake writes
> (such as automated maintenance jobs) to your external storage location.

> **Note:**
>
> Iceberg partitioning is removed when you convert a table.

## Requirements

Before you convert an Iceberg table, ensure that Snowflake can write to your external volume.

For Snowflake to write to your external volume, the following conditions must be met:

> * Use the ALTER ICEBERG TABLE … REFRESH command to manually refresh the table before you convert it.
> * The `ALLOW_WRITES` property for your external volume is set to `TRUE`. To update the value of this property for an existing
>   external volume, use the [ALTER EXTERNAL VOLUME](../sql-reference/sql/alter-external-volume.md) command.
>   For example: `ALTER EXTERNAL VOLUME my_ext_vol SET ALLOW_WRITES=TRUE`.
> * The access control permissions that you set on the cloud storage account must allow write access. For example,
>   if you use an external volume configured for Amazon S3, your IAM role must have the `s3:PutObject` permission for your S3 location.

> **Note:**
>
> Converting a table that has an un-materialized identity partition column isn’t supported.
> An un-materialized identity partition column is created when a table defines an identity transform
> using a source column that doesn’t exist in a Parquet file.

## Example: Convert a table

> **Important:**
>
> When you convert an Iceberg table, Snowflake doesn’t lock down or assume sole access to your external storage.
> To prevent table corruption, ensure that you monitor or stop any non-Snowflake writes
> (such as automated maintenance jobs) to your external storage location.

This example starts by creating an Iceberg table from Iceberg files in object storage.
Snowflake uses the `METADATA_FILE_PATH` value to look for the table metadata in the following location for column definitions:
`<ext-vol-storage-base-url>/path/to/metadata/v1.metadata.json`.

```sqlexample
CREATE ICEBERG TABLE myIcebergTable
  EXTERNAL_VOLUME='icebergMetadataVolume'
  CATALOG='icebergCatalogInt'
  METADATA_FILE_PATH='path/to/metadata/v1.metadata.json';
```

Next, use the ALTER ICEBERG TABLE … REFRESH command to synchronize the table metadata with the latest metadata file.
The following example command refreshes the table by specifying a metadata file path.

```sqlexample
ALTER ICEBERG TABLE myIcebergTable REFRESH 'metadata/v2.metadata.json';
```

Finally, convert the table to use Snowflake as the Iceberg catalog by using the
[ALTER ICEBERG TABLE … CONVERT TO MANAGED](../sql-reference/sql/alter-iceberg-table-convert-to-managed.md) command.

```sqlexample
ALTER ICEBERG TABLE myIcebergTable CONVERT TO MANAGED
  BASE_LOCATION = 'my/relative/path/from/external_volume';
```

> **Note:**
>
> In this example, the ALTER statement must specify a `BASE_LOCATION` because the table was created
> from Iceberg files in object storage and `BASE_LOCATION` was not part of the original CREATE ICEBERG TABLE statement.
> The `BASE_LOCATION` defines the relative path from your external
> volume to a directory where Snowflake writes table data and metadata for the converted table.
>
> Otherwise, if `BASE_LOCATION` was specified in the original CREATE ICEBERG TABLE statement, you don’t need to include it in your
> ALTER ICEBERG TABLE … CONVERT TO MANAGED command.

For example, Snowflake writes table data to
`<ext-vol-storage-base-url>/myBaseLocation/data/`.

Snowflake writes metadata for the converted table to `<ext-vol-storage-base-url>/myBaseLocation/metadata/`.

## Conversion and data types

> **Note:**
>
> You can’t convert a table the uses the following Iceberg data types:
>
> * `uuid`
> * `fixed(L)`

Snowflake uses Snowflake data types to process and return values,
but writes the original Iceberg types to table data files.

For data types such as `int` and `long`, the Snowflake data type supports a larger range of values than the Iceberg data type.
To stay consistent with the source data type, Snowflake does not allow inserting values outside the range that the
source data type supports. For more information, see [Approximate types](tables-iceberg-data-types.md).

---
title: Cookies in the Snowflake web interface
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-cookies.md
section: User Guide
---

# Cookies in the Snowflake web interface

Snowflake uses cookies and similar technologies to recognize you when you visit Snowflake web applications and product websites.
This documentation explains what these technologies are, why Snowflake uses them, and your rights to control Snowflake’s
use of these technologies.

Cookies set by the website owner (in this case, Snowflake) are called *first-party cookies*. Cookies set by parties other than the
website owner are called *third-party cookies*. Third-party cookies enable third party features or functionality to be provided on or
through the website, such as interactive video content for listings on the Snowflake Marketplace.

Snowflake uses different types of first-party and third-party cookies for different reasons. Cookies are categorized as necessary, performance, functional, or targeting:

Necessary Cookies:
:   Strictly necessary to provide you with the services available through Snowflake sites and to use some of its features.

Performance Cookies:
:   Allow Snowflake to count visits and traffic sources to measure and improve the performance of Snowsight. All information
    collected by these cookies is aggregated.

Functional Cookies:
:   Enable Snowsight to provide enhanced functionality and personalization. These cookies might be set by Snowflake, or by
    third-party providers whose services Snowflake has added to Snowsight pages. If you do not allow these cookies, some or all of
    these services may not function properly.

Targeting Cookies:
:   Used to track online activity for a more personalized experience, such as by offering relevant content or advertisements.

## Cookies used by Snowflake

This table details the specific cookies used by Snowflake. Cookies labeled Necessary are required for technical reasons, while
other cookies are used to track users and enhance the product experience. Third parties serve cookies through the web interface for
analytics, improving user experience, and other purposes.

| Cookie Name | Domain | Type | Purpose |
| --- | --- | --- | --- |
| `user-<encoded_string>` | `apps-api.<string>.<string>.<string>.snowflake.com` | Necessary | Used for user authentication in Snowsight. |
| `csrf-<token>` | `apps-api.<string>.<string>.<string>.snowflake.com` | Necessary | Used to carry the cross-site request forgery (CSRF) security token between the server and the client in Snowsight. |
| `oauth-nonce-<8-bit-value>` | `app.snowflake.com` | Necessary | Used to prevent OAuth CSRF attacks. |
| `snowflakeContext` | `app.snowflake.com` | Necessary | Used to determine the active customer account. |
| `__dd_s` | `app.snowflake.com` | Necessary | Used for support and SLA management. |
| `S8_SESSION_<username>__<accountUrl>` | `apps-api.<string>.<string>.<string>.snowflake.com` | Necessary | Used for user authentication. |
| `snowflake_deployment` | `app.snowflake.com` | Necessary | Used for randomized deployment of asset retrieval to support content delivery network (CDN) functionality. |
| `__stripe_mid` | `app.snowflake.com` | Necessary | Used for Stripe fraud prevention if using payment processing functionality. |
| `docai_version` | `app.snowflake.com` | Necessary | Used for blue/green deployment of updates to applications. |
| `PREF*`, `VSC*`, `VISITOR_INFO1_LIVE*`, `NID`, `remote_sid*` | `app.snowflake.com` | Necessary | Used for YouTube’s privacy-enhanced mode if you select or play a YouTube video, such as on the Snowflake Marketplace. This mode does not store personally-identifiable cookie information for playbacks of embedded videos. |

## Manage cookies used by Snowflake

You can control and manage cookies by setting or amending your web browser controls to accept or refuse all cookies. Follow the guidance
provided by your web browser for more details.

Snowflake interprets the use of a Do Not Track signal as an objection to the placement of Targeting cookies.

> **Note:**
>
> Due to a lack of standardization in Do Not Track settings, this setting might not work with Snowflake.

Snowflake does not currently provide functionality for end-user cookie consent management, but Snowflake is compatible with many existing
third-party solutions that provide this functionality. You can work with your internal IT teams or consult with your Snowflake implementation
partners to identify the right cookie solution for your organization’s needs.

If you choose to reject cookies, you can still use Snowsight. Your access to some functionality and
areas of the web interface might be restricted.

## More information about Snowflake, cookies, and tracking technology

If you have questions about Snowflake’s use of cookies or other technologies, email `privacy@snowflake.com`

---
title: Copy data from a Google Cloud Storage stage
source: https://docs.snowflake.com/en/user-guide/data-load-gcs-copy.md
section: User Guide
---

# Copy data from a Google Cloud Storage stage

Load data from your staged files into the target table.

## Load your data

Execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) to load your data into the target table.

> **Note:**
>
> Loading data requires a [warehouse](warehouses.md). If you are using a warehouse that is
> not configured to auto resume, execute [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) to resume the warehouse. Note
> that starting the warehouse could take up to five minutes.
>
> > ```sqlexample
> > ALTER WAREHOUSE mywarehouse RESUME;
> > ```

> **Important:**
>
> The list of objects returned for an external stage may include one or more “directory blobs”; essentially, paths that end in a forward slash character (`/`), e.g.:
>
> ```sqlexample
> LIST @my_gcs_stage;
>
> +---------------------------------------+------+----------------------------------+-------------------------------+
> | name                                  | size | md5                              | last_modified                 |
> |---------------------------------------+------+----------------------------------+-------------------------------|
> | my_gcs_stage/load/                    |  12  | 12348f18bcb35e7b6b628ca12345678c | Mon, 11 Sep 2019 16:57:43 GMT |
> | my_gcs_stage/load/data_0_0_0.csv.gz   |  147 | 9765daba007a643bdff4eae10d43218y | Mon, 11 Sep 2019 18:13:07 GMT |
> +---------------------------------------+------+----------------------------------+-------------------------------+
> ```
>
> These blobs are listed when directories are created in the Google Cloud console rather than using any other tool provided by Google.
>
> COPY statements that reference a stage can fail when the object list includes directory blobs. To avoid errors, we recommend using file pattern matching to identify the files for inclusion (i.e. the PATTERN clause) when the file list for a stage includes directory blobs. For an example, see Load data using pattern matching (in this topic). Alternatively, set ON_ERROR = SKIP_FILE in the COPY statement.

### Load data using pattern matching

The following example loads data from files in the named `my_gcs_stage` stage created in [Configure an integration for Google Cloud Storage](data-load-gcs-config.md). Using pattern matching, the statement only loads files whose names start with the string `sales`:

> ```sqlexample
> COPY INTO mytable
>   FROM @my_gcs_stage
>   PATTERN='.*sales.*.csv';
> ```

Note that file format options are not specified because a named file format was included in the stage definition.

### Load data using a path / prefix

The following example loads all files with the `data/files` path (i.e. prefix) in your Cloud Storage bucket using the named `my_csv_format` file format created in [Preparing to load data](data-load-prepare.md). Note that a path can be combined with pattern matching:

> ```sqlexample
> COPY INTO mytable
>   FROM @my_gcs_stage/mybucket/data/files
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

### Load data using ad hoc file format options

The following ad hoc example loads data from all files in the Cloud Storage bucket. The COPY command
specifies file format options instead of referencing a named file format. This example loads CSV files
with a pipe (`|`) field delimiter. The COPY command skips the first line in the data files.

Note that the storage integration reference is required in ad hoc data loads; that is, when the
COPY statement does not reference a stage:

```sqlexample
COPY INTO mytable
  FROM 'gcs://mybucket/data/files'
  STORAGE_INTEGRATION = myint
  FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|' SKIP_HEADER = 1);
```

## Validate your data

Before loading your data, you can validate that the data in the uploaded files will load correctly.

To validate data in an uploaded file, execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) in validation mode using
the VALIDATION_MODE parameter. The VALIDATION_MODE parameter returns errors that it encounters in the file. You
can then modify the data in the file to ensure it loads without error.

In addition, [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) provides the ON_ERROR copy option to specify an action
to perform if errors are encountered in a file during loading.

## Monitor data loads

Snowflake retains historical data for COPY INTO commands executed within the previous 14 days. The metadata can be used to monitor and
manage the loading process, including deleting files after upload completes:

* Monitor the status of each [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command on the Query History page of Snowsight.
* Use the [LOAD_HISTORY](../sql-reference/info-schema/load_history.md) Information Schema view to retrieve the history of data loaded into tables
  using the COPY INTO command.

## Copy files from one stage to another

Use the [COPY FILES](../sql-reference/sql/copy-files.md) command to organize data into a single location
by copying files from one named stage to another.

The following example copies all of the files from a source stage (`src_stage`) to a target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage;
```

You can also specify a list of file names to copy, or copy files by using pattern matching.
For information, see the [COPY FILES examples](../sql-reference/sql/copy-files.md).

---
title: Copy data from an Azure stage
source: https://docs.snowflake.com/en/user-guide/data-load-azure-copy.md
section: User Guide
---

# Copy data from an Azure stage

Load data from your staged files into the target table.

## Load your data

Execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) to load your data into the target table.

> **Note:**
>
> Loading data requires a [warehouse](warehouses.md). If you’re using a warehouse that is not configured to auto resume, execute [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) to resume the warehouse. Note that starting the warehouse could take up to five minutes.
>
> > ```sqlexample
> > ALTER WAREHOUSE mywarehouse RESUME;
> > ```

The following example loads data from files in the named `my_azure_stage` stage created in [Create an Azure stage](data-load-azure-create-stage.md). Using pattern matching, the statement only loads files whose names start with the string `sales`:

> ```sqlexample
> COPY INTO mytable
>   FROM @my_azure_stage
>   PATTERN='.*sales.*.csv';
> ```

Note that file format options are not specified because a named file format was included in the stage definition.

The following example loads all files prefixed with `data/files` in your Azure container using the named `my_csv_format` file format created in [Preparing to load data](data-load-prepare.md):

```sqlexample
COPY INTO mytable
  FROM 'azure://myaccount.blob.core.windows.net/mycontainer/data/files'
  CREDENTIALS=(AZURE_SAS_TOKEN='?sv=2016-05-31&ss=b&srt=sco&sp=rwdl&se=2018-06-27T10:05:50Z&st=2017-06-27T02:05:50Z&spr=https,http&sig=abcDEFGHIjklmNOPqrsTUVwxyZ123456789%3D')
  ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'aBcDeFGHI0jklMnoP0QrsTUVWXyz1234567891abcDEFG=')
  FILE_FORMAT = (FORMAT_NAME = my_csv_format);
```

The following ad hoc example loads data from all files in the Azure container. The COPY command specifies file format options instead of referencing a named file format. This example loads CSV files with a pipe (`|`) field delimiter. The COPY command skips the first line in the data files:

```sqlexample
COPY INTO mytable
  FROM 'azure://myaccount.blob.core.windows.net/mycontainer/data/files'
  CREDENTIALS=(AZURE_SAS_TOKEN='?sv=2016-05-31&ss=b&srt=sco&sp=rwdl&se=2018-06-27T10:05:50Z&st=2017-06-27T02:05:50Z&spr=https,http&sig=abcDEFGHIjklmNOPqrsTUVwxyZ123456789%3D')
  ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'aBcDeFGHI0jklMnoP0QrsTUVWXyz1234567891abcDEFG=')
  FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|' SKIP_HEADER = 1);
```

## Validate your data

Before loading your data, you can validate that the data in the uploaded files will load correctly.

To validate data in an uploaded file, execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) in validation mode using the VALIDATION_MODE parameter. The VALIDATION_MODE parameter returns errors that it encounters in the file. You can then modify the data in the file to ensure it loads without error.

In addition, [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) provides the ON_ERROR copy option to specify an action to perform if errors are encountered in a file during loading.

> **Note:**
>
> While Azure storage accounts support the MD5 File Validation feature, Azure doesn’t calculate MD5 hash values for files larger than 100 MB. For more information, see [MD5 hash calculation for large files](https://learn.microsoft.com/answers/questions/282572/md5-hash-calculation-for-large-files).

## Monitor data loads

Snowflake retains historical data for COPY INTO commands executed within the previous 14 days. The metadata can be used to monitor and
manage the loading process, including deleting files after upload completes:

* Monitor the status of each [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command on the Query History page of Snowsight.
* Use the [LOAD_HISTORY](../sql-reference/info-schema/load_history.md) Information Schema view to retrieve the history of data loaded into tables
  using the COPY INTO command.

## Copy files from one stage to another

Use the [COPY FILES](../sql-reference/sql/copy-files.md) command to organize data into a single location
by copying files from one named stage to another.

The following example copies all of the files from a source stage (`src_stage`) to a target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage;
```

You can also specify a list of file names to copy, or copy files by using pattern matching.
For information, see the [COPY FILES examples](../sql-reference/sql/copy-files.md).

---
title: Copy data from an internal stage
source: https://docs.snowflake.com/en/user-guide/data-load-local-file-system-copy.md
section: User Guide
---

# Copy data from an internal stage

Load data from your staged files into the target table.

## Load your data

Execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) to load your staged data into the target table.

> **Note:**
>
> Loading data requires a [warehouse](warehouses.md). If you are using a warehouse that is
> not configured to auto resume, execute [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) to resume the warehouse. Note
> that starting the warehouse could take up to five minutes.
>
> > ```sqlexample
> > ALTER WAREHOUSE mywarehouse RESUME;
> > ```

### User stage

The following example loads data from all files prefixed with `staged` in your user stage using the named `my_csv_format` file format created in [Preparing to load data](data-load-prepare.md):

```sqlexample
COPY INTO mytable from @~/staged FILE_FORMAT = (FORMAT_NAME = 'my_csv_format');
```

### Table stage

The following ad hoc example loads data from all files in the stage for the `mytable` table. The COPY command specifies file format options instead of referencing a named file format. This example
loads CSV files with a pipe (`|`) field delimiter. The COPY command skips the first line in the data files:

```sqlexample
COPY INTO mytable FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|' SKIP_HEADER = 1);
```

Note that when copying data from files in a table stage, the FROM clause can be omitted because Snowflake automatically checks for files in the table stage.

### Named stage

The following example loads data from all files from the `my_stage` named stage, which was created in [Choosing an internal stage for local files](data-load-local-file-system-create-stage.md):

```sqlexample
COPY INTO mytable from @my_stage;
```

Note that a file format does not need to be specified because it is included in the stage definition.

## Validate your data

Before loading your data, you can validate that the data in the uploaded files will load correctly.

To validate data in an uploaded file, execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) in validation mode using
the VALIDATION_MODE parameter. The VALIDATION_MODE parameter returns any errors that it encounters in a file. You
can then modify the data in the file to ensure it loads without error.

In addition, the ON_ERROR copy option for the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command indicates what action
to perform if errors are encountered in a file during loading.

## Monitor files staged internally

Snowflake maintains detailed metadata for each file uploaded into internal stage (for users, tables, and stages), including:

* File name
* File size (compressed, if compression was specified during upload)
* LAST_MODIFIED date, i.e. the timestamp when the data file was initially staged or when it was last modified, whichever is later

In addition, Snowflake retains historical data for COPY INTO commands executed within the previous 14 days. The metadata can be used to monitor and
manage the loading process, including deleting files after upload completes:

* Use the [LIST](../sql-reference/sql/list.md) command to view the status of data files that have been staged.
* Monitor the status of each [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command on the Query History page of Snowsight.
* Use the [VALIDATE](../sql-reference/functions/validate.md) function to validate the data files you’ve loaded and retrieve any errors encountered during the load.
* Use the [LOAD_HISTORY](../sql-reference/info-schema/load_history.md) Information Schema view to retrieve the history of data loaded into tables
  using the COPY INTO command.

## Manage data files

Staged files can be deleted from a Snowflake stage (user stage, table stage, or named stage) using the following methods:

* Files that were loaded successfully can be deleted from the stage during a load by specifying the PURGE copy option in the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command.
* After the load completes, use the [REMOVE](../sql-reference/sql/remove.md) command to remove the files in the stage.

Removing files ensures they aren’t inadvertently loaded again. It also improves load performance, because it reduces the number of files that
COPY commands must scan to verify whether existing files in a stage were loaded already.

### Copy files from one stage to another

Use the [COPY FILES](../sql-reference/sql/copy-files.md) command to organize data into a single location
by copying files from one named stage to another.

The following example copies all of the files from a source stage (`src_stage`) to a target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage;
```

You can also specify a list of file names to copy, or copy files by using pattern matching.
For information, see the [COPY FILES examples](../sql-reference/sql/copy-files.md).

---
title: Copying data from an S3 stage
source: https://docs.snowflake.com/en/user-guide/data-load-s3-copy.md
section: User Guide
---

# Copying data from an S3 stage

Load data from your staged files into the target table.

## Load your data

Execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) to load your data into the target table.

> **Note:**
>
> Loading data requires a [warehouse](warehouses.md). If you are using a warehouse that is
> not configured to auto resume, execute [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) to resume the warehouse. Note
> that starting the warehouse could take up to five minutes.
>
> > ```sqlexample
> > ALTER WAREHOUSE mywarehouse RESUME;
> > ```

The following example loads data from files in the named `my_ext_stage` stage created in [Create an S3 stage](data-load-s3-create-stage.md). Using pattern matching, the statement only loads files whose names start with the string `sales`:

> ```sqlexample
> COPY INTO mytable
>   FROM @my_ext_stage
>   PATTERN='.*sales.*.csv';
> ```

Note that file format options are not specified because a named file format was included in the stage definition.

The following example loads all files prefixed with `data/files` in your S3 bucket using the named `my_csv_format` file format created in [Preparing to load data](data-load-prepare.md):

> ```sqlexample
> COPY INTO mytable
>   FROM s3://mybucket/data/files credentials=(AWS_KEY_ID='$AWS_ACCESS_KEY_ID' AWS_SECRET_KEY='$AWS_SECRET_ACCESS_KEY')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

The following ad hoc example loads data from all files in the S3 bucket. The COPY command specifies file format options instead of referencing a named file format. This example loads CSV files with a pipe (`|`) field delimiter. The COPY command skips the first line in the data files:

```sqlexample
COPY INTO mytable
  FROM s3://mybucket credentials=(AWS_KEY_ID='$AWS_ACCESS_KEY_ID' AWS_SECRET_KEY='$AWS_SECRET_ACCESS_KEY')
  FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|' SKIP_HEADER = 1);
```

## Validate your data

Before loading your data, you can validate that the data in the uploaded files will load correctly.

To validate data in an uploaded file, execute [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) in validation mode using
the VALIDATION_MODE parameter. The VALIDATION_MODE parameter returns errors that it encounters in the file. You
can then modify the data in the file to ensure it loads without error.

In addition, [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) provides the ON_ERROR copy option to specify an action
to perform if errors are encountered in a file during loading.

## Monitor data loads

Snowflake retains historical data for COPY INTO commands executed within the previous 14 days. The metadata can be used to monitor and
manage the loading process, including deleting files after upload completes:

* Monitor the status of each [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command on the Query History page of Snowsight.
* Use the [LOAD_HISTORY](../sql-reference/info-schema/load_history.md) Information Schema view to retrieve the history of data loaded into tables
  using the COPY INTO command.

## Copy files from one stage to another

Use the [COPY FILES](../sql-reference/sql/copy-files.md) command to organize data into a single location
by copying files from one named stage to another.

The following example copies all of the files from a source stage (`src_stage`) to a target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage;
```

You can also specify a list of file names to copy, or copy files by using pattern matching.
For information, see the [COPY FILES examples](../sql-reference/sql/copy-files.md).

---
title: Cost controls for warehouses
source: https://docs.snowflake.com/en/user-guide/cost-controlling-controls.md
section: User Guide
---

# Cost controls for warehouses

This topic discusses *controls* that you can use to limit how much is spent on [virtual warehouse](warehouses-overview.md)
usage. These controls help ensure that the actual cost of using virtual warehouses does not exceed expected cost.

These controls do not apply to [cloud services](cost-understanding-compute.md) and
[serverless features](cost-understanding-compute.md).

## Control access to warehouses

Carefully defining who can work with warehouses and what they can do with those warehouses helps control cost by limiting compute resource
usage to known warehouses that have cost-effective configurations. Snowflake’s granular
[access control](security-access-control-overview.md) allows you to grant the following privileges for warehouses:

* **CREATE WAREHOUSE** — Global privilege (i.e. granted on the account) that restricts which roles can create a new warehouse, allowing
  you to force individuals to use existing warehouses that have cost controls in place.
* **MODIFY** — Privilege on a specific warehouse that allows changing the settings that affect cost, including resizing a warehouse and
  disabling the auto-suspend setting. Commonly, users increase the size of a warehouse for a
  particular workload and then forget to change it back to its original size, which can have a significant effect on cost.
* **USAGE** — Privilege on a specific warehouse that allows activating the warehouse to provide compute resources for queries and other
  SQL actions. Carefully assigning this privilege ensures that users can only use warehouses with the appropriate size and configuration
  for their workloads.

Centralizing the responsibility of creating and scaling warehouses to just a few members of your team is considered a best practice. You can
create a dedicated role with permissions to create and modify all warehouses, and then grant that role to a limited number of users. This
allows you to control your warehouse policies and prevent accidental cost overruns resulting from warehouses being created or upsized
unexpectedly.

> **Tip:**
>
> If you want the ability to scale a warehouse to handle more demanding workloads, but do not want to give users the ability to increase
> the size of a warehouse because they might forget to resize it later, consider using a
> [multi-cluster warehouse](warehouses-multicluster.md). A multi-cluster warehouse scales automatically as workloads
> fluctuate.

For a list of all the privileges that can be set for a warehouse, see [Virtual warehouse privileges](security-access-control-privileges.md).

## Limit query times

Hung queries consume excessive credits because they run longer than expected. To avoid the excess cost associated with a
runaway query, you can set the `STATEMENT_TIMEOUT_IN_SECONDS` parameter to define the maximum amount of time a SQL statement can run
before it is cancelled.

The `STATEMENT_TIMEOUT_IN_SECONDS` parameter can be set for an entire account, a user, a session, or a specific warehouse so that you can
carefully set time limits that match the expected run times for various workloads. This parameter is set at the account level by default.
When the parameter is set for a warehouse in addition to the session, the lowest non-zero value is enforced.

Use the following commands to view the current query time limits:

```sqlexample
SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' IN ACCOUNT;
SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' IN USER <username>;
SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' IN SESSION;
SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' IN WAREHOUSE <warehouse_name>;
```

If you need to adjust the time limits, use one of the following commands:

```sqlexample
ALTER ACCOUNT SET STATEMENT_TIMEOUT_IN_SECONDS = <number_of_seconds>;
ALTER USER <username> SET STATEMENT_TIMEOUT_IN_SECONDS = <number_of_seconds>;
ALTER SESSION SET STATEMENT_TIMEOUT_IN_SECONDS = <number_of_seconds>;
ALTER WAREHOUSE <warehouse_name> SET STATEMENT_TIMEOUT_IN_SECONDS = <number_of_seconds>;
```

## Limit statement queue times

SQL statements that are in a queue to use a warehouse do not consume credits. However, if a query stays in the queue too long, it might no
longer be relevant by the time it executes. Running a query that is no longer relevant wastes credits, so you can implement a
cost control by setting a maximum amount of time that a SQL statement can be queued before it is cance led.

The parameter that controls the amount of time that a SQL statement stays in the queue is `STATEMENT_QUEUED_TIMEOUT_IN_SECONDS`. This
parameter can be set for an entire account, a user, a session, or a specific warehouse. This parameter is set at the account level by
default. When the parameter is set for a warehouse in addition to the session, the lowest non-zero value is enforced.

Use the following commands to view the current queue time limits:

```sqlexample
SHOW PARAMETERS LIKE 'STATEMENT_QUEUED_TIMEOUT_IN_SECONDS' IN ACCOUNT;
SHOW PARAMETERS LIKE 'STATEMENT_QUEUED_TIMEOUT_IN_SECONDS' IN USER <username>;
SHOW PARAMETERS LIKE 'STATEMENT_QUEUED_TIMEOUT_IN_SECONDS' IN SESSION;
SHOW PARAMETERS LIKE 'STATEMENT_QUEUED_TIMEOUT_IN_SECONDS' IN WAREHOUSE <warehouse_name>;
```

If you need to adjust the time limits, use one of the following commands:

```sqlexample
ALTER ACCOUNT SET STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <number_of_seconds>;
ALTER USER <username> SET STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <number_of_seconds>;
ALTER SESSION SET STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <number_of_seconds>;
ALTER WAREHOUSE <warehouse_name> SET STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <number_of_seconds>;
```

## Use auto-suspension

By default, all warehouses have the auto-suspend setting enabled, which means a warehouse automatically shuts down when it is inactive
for a defined period of time. A suspended warehouse does not consume credits, so the warehouse only incurs cost when it is processing a
workload.

Restricting users from disabling the auto-suspend setting helps to prevent an unused warehouse from wasting credits. You can use
access control to allow someone to use a warehouse but also prevent them from modifying its Auto
Suspend setting.

**Query: Find warehouses without auto-suspend**

Use the following query to periodically check whether the auto-suspend setting was disabled for any warehouses.

```sqlexample
SHOW WAREHOUSES
  ->> SELECT "name" AS WAREHOUSE_NAME,
             "size" AS WAREHOUSE_SIZE
        FROM $1
        WHERE IFNULL("auto_suspend", 0) = 0;
```

To enable auto-suspend for the warehouses that have it turned off, sign in to [Snowsight](ui-snowsight-gs.md). In the navigation menu, select Compute » Warehouses. You can also use the `AUTO_SUSPEND` parameter of the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command.

### Using auto-resume with auto-suspend

In general, every warehouse that has auto-suspend enabled should also have auto-resume enabled. The combination of these two settings
stops and starts a warehouse automatically as the warehouse’s workload fluctuates.

**Query: Find warehouses without Auto Resume**

The following query lists the warehouses that do not have auto-resume enabled, letting you know which ones need to be modified.

```sqlexample
SHOW WAREHOUSES
  ->> SELECT "name" AS WAREHOUSE_NAME,
             "size" AS WAREHOUSE_SIZE
        FROM $1
        WHERE "auto_resume" = 'false';
```

To enable auto-resume for the warehouses that have it turned off, sign in to [Snowsight](ui-snowsight-gs.md). In the navigation menu, select Compute » Warehouses. You can also use the `AUTO_RESUME` parameter of the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command.

## Enforce spending limits

*Resource monitors* provide the ability to set limits on credits consumed by a warehouse during a specific time interval or date range.
This can help prevent warehouses from unintentionally consuming more credits than typically expected.

Sometimes a resource monitor simply notifies an administrator when a credit limit is reached, but you can also *enforce* a limit by
configuring a resource monitor to suspend a warehouse as soon as the limit is reached. There are two options when enforcing a limit: suspend
the warehouse after pending statements are executed or suspend immediately without waiting for statements to complete.

Because a single resource monitor can be set for multiple warehouses or an entire account, you can effectively suspend multiple warehouses
when an overall spending limit is reached. A warehouse can be assigned to its own resource monitor and an account-specific resource monitor
at the same time; the warehouse is suspended when either of the credit limits is reached.

For more information about suspending warehouses when spending limits are reached, see [Working with resource monitors](resource-monitors.md).

**Query: Find warehouses without resource monitors**

The following query lists the warehouses that aren’t assigned to a warehouse-specific resource monitor, which makes them vulnerable
to runaway costs. The query doesn’t check for account-level resource monitors; warehouses in the list that belong to an account
that has an account-level resource monitor are still subject to credit limits.

```sqlexample
SHOW WAREHOUSES
  ->> SELECT "name" AS WAREHOUSE_NAME,
             "size" AS WAREHOUSE_SIZE
        FROM $1
        WHERE "resource_monitor" = 'null';
```

> **Note:**
>
> The cloud services layer of the Snowflake architecture can still incur a small cost if queries are run against a warehouse that was
> suspended by a resource monitor.

---
title: Costs for Snowpipe Streaming Classic
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-billing.md
section: User Guide
---

# Costs for Snowpipe Streaming Classic

With Snowpipe Streaming’s serverless compute model, users can stream any data volume without managing a virtual warehouse. Instead, Snowflake provides and manages the compute resources, automatically growing or shrinking capacity based on the current Snowpipe Streaming load.

For Snowpipe Streaming Classic, accounts are charged based on the per-second time that serverless compute and active client streaming ingestion uses. Be aware of the following:

* File migration occurs asynchronously from streaming ingestion.
* File migration might be pre-empted by clustering or other DML operations.
* File migration might not always occur, and therefore compute costs might be reduced.
* For Snowflake-managed Apache Iceberg™ tables, file migration operates similarly to Iceberg table maintenance to create new compacted Parquet files, if necessary.

For more information, see the “Serverless Feature Credit Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Estimating Snowpipe Streaming charges

Given the number of factors that can differentiate Snowpipe Streaming loads, it is very difficult for Snowflake to provide sample costs. Size of records, number of records, data types, etc. can affect the compute resource consumption for file migration. Client charges are dictated only by how many clients are actively writing data to Snowflake on a per-second basis.

We suggest that you experiment by performing a typical streaming ingestion load to estimate future charges.
To see a sample streaming ingestion experiment with estimated costs, see [this blog post](https://www.snowflake.com/blog/data-ingestion-best-practices-part-three/).

## Temporary file storage and billing

Although the Snowpipe Streaming API is designed to write rows directly to Snowflake tables without requiring users to explicitly stage files, in Snowpipe Streaming Classic, Snowflake’s internal processes use a transparent internal stage for temporary buffering of data. The Snowpipe Streaming with classic architecture SDK generates and uploads intermediate files to this internal stage before they are transformed into Snowflake’s native file format.

Snowflake bills you for the storage that is consumed by these temporary files in the internal stage. This storage cost is separate from the Snowpipe Streaming serverless compute cost and appears under the general “storage cost” on your Snowflake bill.

The retention period for these temporary files in the internal stage is directly associated with the data retention time for the target table (or the account-level retention if no specific table retention is set). Snowflake automatically deletes these files after they fall outside of the defined Time Travel window. Typically, this deletion occurs within one day of the data exiting the retention period. Users don’t have direct access to, or visibility into, these internal stage files.

## Cloning tables with Snowpipe Streaming

When users clone a table that is actively receiving data through Snowpipe Streaming with classic architecture, users might observe higher storage costs. This additional cost isn’t because of duplication of the underlying data files. Snowflake performs zero-copy cloning. Instead, it’s because data in flight – data that was processed by the Snowpipe Streaming with classic architecture SDK and is temporarily stored in the internal stage but not yet fully committed to the target table – requires a file migration for both the original table and the clone. This double processing of temporary files increases file migration consumption and leads to greater storage usage. This additional cost is typically very small, reflecting a maximum of about 5 minutes of temporary files, but could be larger with very high throughput if the system is experiencing delays in these migrations. This duplication contributes to increased storage consumption.

In contrast, Snowpipe Streaming with a high-performance architecture offers true zero-copy cloning for tables that actively receive streaming data. With the high-performance architecture, cloning operations behave like standard Snowflake table clones. This means that only new data written after the clone operation consumes additional storage. Data in flight at the time of cloning isn’t subject to this dual migration. As a result, you benefit from cost-efficient cloning for streaming tables.

## Viewing the data load history for your account

Account administrators (users with the ACCOUNTADMIN role) or users with a role granted the MONITOR USAGE global privilege can use SQL commands to view the credits billed to your Snowflake account within a specified date range. You can use the following views to query the history of data migrated into Snowflake tables, the amount of time spent loading data into Snowflake tables using Snowpipe Streaming, and the credits consumed.

To view the total Snowpipe Streaming costs, including both compute and client costs, query the metering history when the `SERVICE_TYPE` is set to `SNOWPIPE_STREAMING`.

> * [METERING_HISTORY view](../../sql-reference/account-usage/metering_history.md) (in [Account Usage](../../sql-reference/account-usage.md)).

For more information about querying the total Snowpipe Streaming costs, see [a SQL example](../cost-exploring-compute.md).

To see the detailed breakdowns of client ingestion and migration compute, you can query the following views:

> * [SNOWPIPE_STREAMING_CLIENT_HISTORY view](../../sql-reference/account-usage/snowpipe_streaming_client_history.md) (in [Account Usage](../../sql-reference/account-usage.md)).
> * [SNOWPIPE_STREAMING_FILE_MIGRATION_HISTORY view](../../sql-reference/account-usage/snowpipe_streaming_file_migration_history.md) (in [Account Usage](../../sql-reference/account-usage.md)).

---
title: Create a catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/create-catalog.md
section: User Guide
---

# Create a catalog

The Snowflake Open Catalog service admin creates a catalog.

The steps to create a catalog depend on your cloud storage provider.

When you create a catalog, you supply information about your external cloud storage, and Snowflake Open Catalog uses that information to create a storage
configuration. This configuration stores an identity and access management (IAM) entity for your storage. Open Catalog uses the IAM entity to
securely connect to your storage locations in order to access table data, Apache Iceberg™ metadata, and manifest files.

For instructions, see the following sections:

* Create a catalog using Amazon Simple Storage Service (Amazon S3)
* Create a catalog using Cloud Storage from Google
* Create a catalog using Azure

## Create a catalog using Amazon Simple Storage Service (Amazon S3)

**Prerequisites**

* An S3 storage bucket in the same region that hosts your Snowflake account

  + Open Catalog can’t support bucket names that contain dots (for example, *my.s3.bucket*). Open Catalog uses virtual-hosted-style paths and HTTPS to
    access data in S3. However, S3 does not support SSL for virtual-hosted-style buckets with dots in the name.
  + For data recovery features, see your storage provider.
* Permissions in AWS to create and manage IAM policies and roles

  If you aren’t an AWS administrator, ask your AWS administrator to perform
  these tasks.

### Step 1: Create an IAM policy that grants access to your S3 location

To configure access permissions for Open Catalog in the AWS Management Console, follow this procedure:

1. Sign in to the AWS Management Console.
2. On the home dashboard, select **IAM**.
3. In the navigation pane, select **Account settings**.
4. Under **Security Token Service (STS)**, in the **Endpoints** list, find the Open Catalog region where your account is located, and if the
   **STS status** is inactive, set the toggle to **Active**.
5. In the navigation pane, select **Policies**.
6. Select **Create Policy**.
7. For **Policy editor**, select **JSON**.
8. Add a policy to provide Open Catalog with the required permissions to read and write data to your S3 location.

   > **Note:**
   > * Replace `*my_bucket*` with your actual bucket name. You can also specify a path in the bucket; for example, `*my_bucket*/*path*`.
   > * Setting the `"s3:prefix":` condition to `["*"]` grants access to all prefixes in the specified bucket; setting it to `["*path*/*"]`
   >   grants access to a specified path in the bucket.
   > * For buckets in government regions, the bucket ARNs use the `arn:aws-us-gov:s3:::` prefix.

   The following example policy grants access to all locations in the specified bucket:

   ```sqljson
      {
         "Version": "2012-10-17",
         "Statement": [
               {
                  "Effect": "Allow",
                  "Action": [
                     "s3:PutObject",
                     "s3:GetObject",
                     "s3:GetObjectVersion",
                     "s3:DeleteObject",
                     "s3:DeleteObjectVersion"
                  ],
                  "Resource": "arn:aws:s3:::<my_bucket>/*"
               },
               {
                  "Effect": "Allow",
                  "Action": [
                     "s3:ListBucket",
                     "s3:GetBucketLocation"
                  ],
                  "Resource": "arn:aws:s3:::<my_bucket>",
                  "Condition": {
                     "StringLike": {
                           "s3:prefix": [
                              "*"
                           ]
                     }
                  }
               }
         ]
      }
   ```
9. Select **Next**.
10. For **Policy name**, enter a policy name (for example, `open_catalog_access`).
11. Optional: For **Description**, enter a description.
12. Select **Create policy**.

### Step 2: Create an IAM role to grant privileges on your S3 bucket

1. From the AWS Management Console, on the Identity and Access Management (IAM) Dashboard, in the navigation pane, select **Roles**.
2. Select **Create role**.
3. For the trusted entity type, select **AWS account**.
4. Under **An AWS account**, select **This account**.

   In a later step, you modify the trusted relationship and grant access to Open
   Catalog.
5. Optional: To create an external ID, select the **Require external ID** checkbox, and enter an [external ID](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html); for example, `open_catalog_external_id`.

   > **Note:**
   >
   > If you don’t create an external ID when you create a catalog, Open Catalog generates an external ID for you to use. An external
   > ID is used to grant access to your AWS resources (such as S3 buckets) to a third party like Open Catalog.
6. Select **Next**.
7. Select the policy that you created in the previous step, and then select **Next**.
8. Enter a role name and description for the role, and then select **Create role**.

   You have now created an IAM policy for an S3 location, created an
   IAM role, and attached the policy to the role.
9. To view the role summary page, select **View role**.
10. Locate and record the **ARN** (Amazon Resource Name) value for the role.

### Step 3: Create a catalog in Open Catalog

1. Sign in to Open Catalog.
2. On the Open Catalog home page, in the **Catalogs** area, select **+ Create**.
3. In the **Create Catalog** dialog, complete the fields:

   1. For **Name**, enter a name for the catalog.

      Catalog names are case-sensitive.
   2. Optional: To create an external catalog, set the **External** toggle to **On**.

      For information about external catalogs, see
      [Catalog types](overview.md).
   3. For **Storage Provider**, select **S3**.
   4. Optional: To enable [outbound private connectivity](private-connectivity-outbound.md) for the catalog, set the **Private Link**
      toggle to **Enabled**.
   5. For **Default base location**, enter the default base location for your AWS S3 storage bucket.
   6. Optional: If the catalog will contain objects stored in more than one location, list each additional location (separated by a comma) in **Additional locations (optional)**.
   7. For **S3 role ARN**, enter the ARN of the IAM role that you created for Open Catalog.
   8. Optional: If you created an external ID when you created an IAM role, for **External ID**, enter the external ID.
   9. Select **Create**.

      For external catalogs, credential vending is disabled by default. However, you can enable it for the catalog.
      For details, see [Enable credential vending for an external catalog](enable-credential-vending-external-catalog.md).

### Step 4: Retrieve the AWS IAM user for your Open Catalog account

1. On the Open Catalog home page, in the **Catalogs** area, select the catalog that you created.
2. Under **Storage Details**, copy the **IAM user arn**; for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`.

   Open Catalog provisions
   a single IAM user for your entire Open Catalog account. All S3 storage configurations in your account use that IAM user.

   **Note**
   If you didn’t specify an external ID when you created your IAM role, Open Catalog generates an external ID for you to use.
   Record the value so that you can update your IAM role trust policy with the generated external ID.

### Step 5: Grant the IAM user permissions to access bucket objects

1. Sign in to the AWS Management Console.
2. On the home dashboard, search for and select **IAM**.
3. In the navigation pane, select **Roles**.
4. Select the IAM role that you created for your storage configuration.
5. Select the **Trust relationships** tab.
6. Select **Edit trust policy**.
7. Modify the policy document with the catalog storage details that you recorded:

   ```sqljson
      {
        "Version": "2012-10-17",
        "Statement": [
          {
            "Sid": "",
            "Effect": "Allow",
            "Principal": {
              "AWS": "<open_catalog_user_arn>"
            },
            "Action": "sts:AssumeRole",
            "Condition": {
              "StringEquals": {
                "sts:ExternalId": "<open_catalog_external_id>"
              }
            }
          }
        ]
      }
   ```

   Where:

   * `open_catalog_user_arn` is the IAM user ARN that you recorded.
   * `open_catalog_external_id` is your external ID. If you specified an external ID when you created the role and used the same ID
     to create your storage configuration, leave the value as-is. Otherwise, update `sts:ExternalId` with the value that you recorded.

     > **Note:**
     >
     > You must update this policy document if you create a new storage configuration and don’t provide your own external ID. For security reasons,
     > a new or recreated storage configuration has a different external ID and cannot resolve the trust relationship unless you update this trust
     > policy.

   **Example policy document for IAM role**
8. To save your changes, select **Update policy**.

## Create a catalog using Cloud Storage from Google

This section covers how to create a catalog and grant Open Catalog restricted access to a Cloud Storage bucket using a storage
configuration.

An administrator in your organization grants the IAM user permissions in your Google Cloud account.

> **Note:**
>
> * To complete the instructions in this topic, you must have permissions in Google Cloud to create and manage IAM policies and roles. If you
>   are not a Google Cloud administrator, ask your Google Cloud administrator to perform these tasks.
> * For data recovery features, see your storage provider.

### Step 1: Create a catalog

1. Sign in to Open Catalog.
2. On the Open Catalog home page, in the **Catalogs** area, select **+ Create**.
3. In the **Create Catalog** dialog, complete the fields:

   1. For **Name**, enter a name for the catalog.

      Catalog names are case-sensitive.
   2. Optional: To create an external catalog, set the **External** toggle to **On**.

      For information about external catalogs, see [Catalog types](overview.md).
   3. For **Storage Provider**, select **GCS**.
   4. For **Default base location**, enter the default base location for your Cloud Storage bucket.
   5. Optional: If the catalog will contain objects stored in more than one location, for **Additional locations (optional)**, list each additional storage location, separated by a comma.
   6. Select **Create**.

      For external catalogs, credential vending is disabled by default. However, you can enable it for the catalog.
      For details, see [Enable credential vending for an external catalog](enable-credential-vending-external-catalog.md).

### Step 2: Retrieve the Google Cloud service account for your Open Catalog account

1. From the Open Catalog home page, in the **Catalogs** area, select the catalog that you created.
2. Under **Storage Details**, copy the **GCP_SERVICE_ACCOUNT** ID; for example, `service-account-id@project1-123456.iam.gserviceaccount.com`.

   Open Catalog provisions a single Google Cloud service account for your entire Open Catalog account and uses that service account
   when accessing storage on Google Cloud.

### Step 3: Grant the service account permissions to access bucket objects

In this step, you configure IAM access permissions for Open Catalog in your Google Cloud console.

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Sign in to the Google Cloud console as a project editor.
2. On the home dashboard, in the navigation menu, select **IAM & Admin** > **Roles**.
3. Select **Create Role**.
4. For **Title**, enter a title for the custom role.
5. Optional: For **Description**, enter a description for the custom role.
6. Select **Add Permissions**.
7. In **Filter**, select **Service**, and then select **storage**.
8. Filter the list of permissions, and select the following from the list:

   * `storage.buckets.get`
   * `storage.objects.create`
   * `storage.objects.delete`
   * `storage.objects.get`
   * `storage.objects.list`
9. Select **Add**.
10. Select **Create**.

#### Assign the custom role to the Google Cloud service account

Remain in the Google Cloud console for this procedure.

1. On the home dashboard, in the navigation menu, select **Cloud Storage** > **Buckets**.
2. Filter the list of buckets, and select the bucket that you specified in your Open Catalog storage configuration.
3. Select **Permissions** > **View by principals**, and then select **Grant access**.
4. Under **Add principals**, paste the service account ID that you copied earlier.
5. Under **Assign roles**, select the custom IAM role that you created earlier, and then select **Save**.

## Create a catalog using Azure storage

This section covers how to grant Open Catalog restricted access to your own Microsoft Azure container using a storage configuration. Open
Catalog supports the following Azure cloud storage services for storage configurations:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v1
* General-purpose v2

An administrator in your organization grants the IAM user permissions in your Azure account.

> **Note:**
>
> * Completing the instructions in this topic requires permissions in Azure to create and manage IAM policies and roles. If you are not an
>   Azure administrator, ask your Azure administrator to perform these tasks.
> * For data recovery features, see your storage provider.

### Step 1: Create a catalog

1. Sign in to Open Catalog.
2. On the Open Catalog home page, in the **Catalogs** area, select **+ Create**.
3. In the **Create Catalog** dialog, complete the fields:

   1. For **Name**, enter a name for the catalog.

      Catalog names are case-sensitive.
   2. Optional: To create an external catalog, set the **External** toggle to **On**.

      For information about external catalogs, see [Catalog types](overview.md).
   3. For **Storage Provider**, select **AZURE**.
   4. Optional: To enable [outbound private connectivity](private-connectivity-outbound.md) for the catalog, set the **Private Link**
      toggle to **Enabled**.
   5. For **Default base location**, enter the default base location for your Azure storage container by applying from this list the
      applicable format to the path to the primary endpoint for your container:

      | Endpoint type | Format | Default base location example |
      | --- | --- | --- |
      | Blob | `abfss://<container_name>@<storage_account_name>.blob.core.windows.net/<directory_name>/` | `abfss://my_container1@my_storageaccount1.blob.core.windows.net/my_directory1/` |
      | Azure Data Lake Storage (ADLS) | `abfss://<container_name>@<storage_account_name>.dfs.core.windows.net/<directory_name>/` | `abfss://my_container2@my_storageaccount2.dfs.core.windows.net/my_directory2/` |

      > **Note:**
      > * You copied this path and the name of your container when you created a Microsoft Azure container.
      > * In the path to the primary endpoint for your container, the name of your storage account is the text between `https://` and the first period in the path.
      > * Use the `abfss://` prefix, not `https://`.
   6. Optional: If the catalog will contain objects stored in more than one location, in the **Additional locations (optional)** field, list each
      additional storage location, separated by a comma.
   7. For **Tenant ID**, enter the Azure Tenant ID.
   8. Select **Create**.

      For external catalogs, credential vending is disabled by default. However, you can enable it for the catalog.
      For details, see [Enable credential vending for an external catalog](enable-credential-vending-external-catalog.md).

### Step 2: Copy the values for the storage location

1. On the Open Catalog home page, in the **Catalogs** area, select the catalog that you created.
2. Under **Storage Details**, copy the following values:

   | Property | Description |
   | --- | --- |
   | `AZURE_CONSENT_URL` | URL to the Microsoft permissions request page. |
   | `AZURE_MULTI_TENANT_APP_NAME` | Name of the Snowflake client application created for your account. In a later step in this section, you grant this application permission to obtain an access token on your allowed storage location. |

   You use these values in the following steps.

### Step 3. Grant the Azure service principal permissions to an access token

1. In a web browser, navigate to the Microsoft permissions request page (the Azure consent URL).
2. Select **Accept**.

   This action allows the Azure service principal created for your Open Catalog account to obtain an access token
   on specified resources inside your tenant. Obtaining an access token succeeds only if you grant to the service principal the appropriate permissions
   on the container. The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
3. Sign in to the Microsoft Azure portal.
4. Open **Azure Services** > **Storage Accounts**.
5. Select the name of the storage account that the Open Catalog service principal needs
   to access.
6. Select **Access Control (IAM)** > **Add role assignment**.
7. Select the desired role to grant to the Open Catalog service principal, such as the Storage Blob Data Contributor role.

   The Storage
   Blob Data Contributor role grants read and write access to the Open Catalog service principal and grants write access to the storage location.

   > **Note:**
   >
   > Open Catalog issues a user delegation shared access signature (SAS) token. The SAS token for accessing the storage blobs is scoped at the level of container instead of blob or directory. The role you select should have permission to create the user delegation key. For a list of these roles, see [Assign permissions with RBAC](https://learn.microsoft.com/en-us/rest/api/storageservices/create-user-delegation-sas).
8. Select **Next**.
9. Select **+ Select members**.
10. After an hour, search for and select the Open Catalog service principal, which is the Azure multi-tenant app name property. Search for the string ***before***
    the underscore in the property value.

    > **Important:**
    > * It can take an hour or longer for Azure to create the Open Catalog service principal requested through the Microsoft request page
    >   in this section. If the service principal is not available immediately, wait an hour or two and then search again.
    > * If you delete the service principal, the catalog will stop working due to authentication failure.
11. Select **Select**.
12. Select **Review + assign**.

> **Note:**
>
> It can take up to 10 minutes for changes to take effect when you assign a role. For more information, see
> [Symptom - Role assignment changes are not being detected](https://learn.microsoft.com/en-us/azure/role-based-access-control/troubleshooting?tabs=bicep#symptom---role-assignment-changes-are-not-being-detected)
> in the Microsoft Azure documentation.

---
title: Create a catalog role
source: https://docs.snowflake.com/en/user-guide/opencatalog/create-catalog-role.md
section: User Guide
---

# Create a catalog role

Create a catalog role to grant privileges on it when you [secure a catalog](secure-catalogs.md). For more information about catalog roles, see [Catalog role](access-control.md).

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog for which you want to create a catalog role.
4. Select the **Roles** tab.
5. Select **+ Catalog role**.
6. Enter a name for the Catalog role, and then select **Create**.

   Catalog role names are case-sensitive.

When you secure a catalog, you can now select this catalog role to secure the entire catalog or a namespace, table, or view within it.

---
title: Create a principal role
source: https://docs.snowflake.com/en/user-guide/opencatalog/create-principal-role.md
section: User Guide
---

# Create a principal role

You can create principal roles to logically group service principals together. For more information about principal roles, including examples, see [Principal role](access-control.md).

1. Sign in to Open Catalog.
2. In the menu on the left, select **Connections**.
3. Select the **Roles** tab.
4. Select **+ Principal role**.
5. Enter a name for the principal role, and then select **Create**.

---
title: Create a sequence of tasks with a task graph
source: https://docs.snowflake.com/en/user-guide/tasks-graphs.md
section: User Guide
---

# Create a sequence of tasks with a task graph

In Snowflake, you can manage multiple tasks with a *task graph*, also known as a directed acyclic graph (DAG). A task graph is composed of a root task and dependent child tasks. The dependencies must run in a start-to-finish direction, with no loops. An optional final task, called a *finalizer*, can perform cleanup operations after all other tasks are complete.

Build task graphs that have dynamic behavior by specifying logic-based operations in the task body using runtime values, graph level configuration, and return values of parent tasks.

You can create tasks and task graphs using [supported languages and tools](../developer-guide/stored-procedure/stored-procedures-overview.md) like SQL, JavaScript, Python,
Java, Scala, or Snowflake Scripting. This topic provides SQL examples. For Python examples, see [Managing Snowflake tasks and task graphs with Python](../developer-guide/snowflake-python-api/snowflake-python-managing-tasks.md).

You can also use Snowsight to manage and view your task graphs. For more information, see [View tasks and task graphs in Snowsight](ui-snowsight-tasks.md).

## Create a task graph

Create a root task using [CREATE TASK](../sql-reference/sql/create-task.md), then create child tasks using CREATE TASK .. AFTER to select the parent tasks.

The root task defines when the task graph runs. Child tasks are executed in the order defined by the task graph.

When multiple child tasks have the same parent, the child tasks run
in parallel.

When a task has multiple parents, the task waits for all
preceding tasks to successfully complete before starting.
(The task may also run when some parent tasks are skipped. For
more information, see Skip or suspend a child task).

The following example creates a serverless task graph that starts
with a root task that is scheduled to run every minute. The root
task has two child tasks that run in parallel. (The diagram
shows an example where one of these tasks runs longer than the other.)
After both of those tasks complete, a third child task runs. The
finalizer task runs after all other tasks complete or fail to complete:

```sqlexample
CREATE TASK task_root
  SCHEDULE = '1 MINUTE'
  AS SELECT 1;

CREATE TASK task_a
  AFTER task_root
  AS SELECT 1;

CREATE TASK task_b
  AFTER task_root
  AS SELECT 1;

CREATE TASK task_c
  AFTER task_a, task_b
  AS SELECT 1;
```

Considerations:

* A task graph is limited to a maximum of 1000 tasks.
* A single task can have a maximum of 100 parent tasks and 100 child tasks.
* When tasks run in parallel on the same user-managed warehouse, the [compute resources](tasks-intro.md) must be sized to handle the concurrent task runs.

### Finalizer task

You can add an optional finalizer task to run after all other tasks in
the task graph complete (or fail to complete). Use this to do the following:

* Perform cleanup operations, for example, cleaning up intermediate data that is no longer needed.
* Send notifications about task success or failure.

To create a finalizer task, use [CREATE TASK … FINALIZE …](../sql-reference/sql/create-task.md) on the root task. Example:

```sqlexample
CREATE TASK task_finalizer
  FINALIZE = task_root
  AS SELECT 1;
```

Considerations:

* A finalizer task is always associated with a root task. Each root task can have only one finalizer task, and a finalizer task can be
  associated with only one root task.
* When the root task of a task graph is skipped (for example, because of overlap task graph runs), the finalizer task won’t be started.
* A finalizer task cannot have any child tasks.
* A finalizer task is scheduled only when no other tasks are running or queued in the current task graph run.

For more examples, see Finalizer task example: Send email notification and Finalizer task example: Correct for errors.

## Manage task graph ownership

All tasks in a task graph must have the same task owner and be stored in the same database and schema.

You can transfer ownership of all tasks in a task graph using one of the following actions:

* Drop the owner of all tasks in the task graph using [DROP ROLE](../sql-reference/sql/drop-role.md). Snowflake transfers ownership to the
  role that runs the DROP ROLE command.
* Transfer ownership of all tasks in the task graph using [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) on all tasks in a schema.

When you transfer ownership of the tasks in a task graph using these methods, the tasks in the task graph retain their relationships to
each other.

Transferring ownership of a single task removes the dependency between the task and any parent and child tasks. For more information, see
Unlink parent and child tasks (in this topic).

> **Note:**
>
> Database replication does not work for task graphs if the graph is owned by a different role than the role that performs replication.

## Run or schedule tasks in a task graph

### Run a task graph manually

You can run a single instance of a task graph. This is useful for testing new or modified task graphs before enabling the task graph in production, or for one-time runs as needed.

Before starting the task graph, use [ALTER TASK … RESUME](../sql-reference/sql/alter-task.md) on each child task (including the optional finalizer task) that you want to include in the run.

To run a single instance of a task graph, use [EXECUTE TASK](../sql-reference/sql/execute-task.md) on the root task. When you run the root task, all resumed child tasks in the task graph are executed in the order defined by the task graph.

### Run a task on a schedule or as a triggered task

In the root task, define when the task graph runs. Task graphs can run on a recurring schedule, or they can be triggered by an event. For more information, see the following topics:

* [Scheduled tasks](tasks-intro.md)
* [Triggered tasks](tasks-triggered.md)

To start the task graph, you can do either of the following:

* Resume each individual child task (including the finalizer) that you want to include in the run, and then resume the root task, using [ALTER TASK … RESUME](../sql-reference/sql/alter-task.md).
* Resume all of the tasks in a task graph at once using [SYSTEM$TASK_DEPENDENTS_ENABLE](../sql-reference/functions/system_task_dependents_enable.md) ( <root_task_name> ) on the root task.

## View dependent tasks in a task graph

To view the child tasks for a root task, call the [TASK_DEPENDENTS](../sql-reference/functions/task_dependents.md) table function. To retrieve all tasks in a task graph, input the root task when calling the function.

You can also use Snowsight to manage and view your task graphs. For more information, see [View tasks and task graphs in Snowsight](ui-snowsight-tasks.md).

## Modify, suspend, or retry tasks

### Modify a task in a task graph

To modify a task in a scheduled task graph, suspend the root task using [ALTER TASK … SUSPEND](../sql-reference/sql/alter-task.md). If a run of the task graph is in process, it completes the current run. All future scheduled runs of the root task are canceled.

When the root task is suspended, child tasks including the finalizer task retain their state (suspended, running, or completed). The child tasks don’t need to be individually suspended.

After you suspend the root task, you can modify any task in the task graph.

To resume the task graph, you can do either of the following:

* Resume the root task using [ALTER TASK … RESUME](../sql-reference/sql/alter-task.md). Individual child tasks that were running before do not need to be resumed.
* Resume all of the tasks in a task graph at once by calling [SYSTEM$TASK_DEPENDENTS_ENABLE](../sql-reference/functions/system_task_dependents_enable.md) and passing in the name of the root task.

### Skip or suspend a child task

To skip a child task in a task graph, suspend the child task
using [ALTER TASK … SUSPEND](../sql-reference/sql/alter-task.md).

When you suspend a child task, the task graph continues to run as though
the child task had succeeded. A child task with multiple predecessors
runs as long as at least one of the predecessors is in a
resumed state, and all resumed predecessors run successfully to completion.

### Retry a failed task

Use [EXECUTE TASK … RETRY LAST](../sql-reference/sql/execute-task.md) to attempt to run the task graph from the last failed task. If the task succeeds, all child tasks will continue to run as their preceding tasks complete.

### Automatic retries

By default, if a child task fails, the entire task graph is considered to have failed.

Rather than waiting until the next scheduled task graph run, you can instruct the task graph to retry immediately by setting the `TASK_AUTO_RETRY_ATTEMPTS` parameter on the root task. When a child task fails, the entire task graph is immediately retried, up to the number of times specified. If the task graph still doesn’t complete, the task graph is considered to have failed.

### Suspend task graphs after failed task graph runs

By default, a task graph is suspended after 10 consecutive failures. You can change this value by setting `SUSPEND_TASK_AFTER_NUM_FAILURES` on the root task.

In the following example, whenever a child task fails, the task graph immediately retries twice before the entire task graph is considered failed. If the task graph fails three times in a row, the task graph is then suspended.

```sqlexample
CREATE OR REPLACE TASK task_root
  SCHEDULE = '1 MINUTE'
  TASK_AUTO_RETRY_ATTEMPTS = 2   --  Failed task graph retries up to 2 times
  SUSPEND_TASK_AFTER_NUM_FAILURES = 3   --  Task graph suspends after 3 consecutive failures
  AS SELECT 1;
```

## Unlink parent and child tasks

Dependencies between tasks in a task graph can be severed as a result of the following actions:

* ALTER TASK … REMOVE AFTER and ALTER TASK … UNSET FINALIZE remove the link between the target task and the specified
  parent tasks or finalized root task.
* DROP TASK and GRANT OWNERSHIP sever all the target task’s links. For example, root task A has child task B, and task B has child task C. If you drop task B, the link between task A and B is severed and so is the link between task B and C.

If any combination of the above actions severs the relationship between the child task and all parent tasks, the
child task becomes either a standalone task or a root task.

> **Note:**
>
> If you grant the ownership of a task to its current owner, dependency links might not be severed.

## Overlap task graph runs

By default, Snowflake ensures that only one instance of a particular task graph is allowed to run at a time. The next run of a root task
is scheduled only after all tasks in the task graph have finished running. This means that if the cumulative time required to run all tasks
in the task graph exceeds the explicit scheduled time set in the definition of the root task, at least one run of the task graph is
skipped.

To control task graph parallelism, use [CREATE TASK](../sql-reference/sql/create-task.md) or [ALTER TASK](../sql-reference/sql/alter-task.md)
on the root task to set the OVERLAP_POLICY parameter:

* `OVERLAP_POLICY = NO_OVERLAP` (default): Executes tasks serially with no parallelism.
  Snowflake schedules the next run of a root task only after all child tasks finish running.
* `OVERLAP_POLICY = ALLOW_CHILD_OVERLAP`: Allows child task parallelism.
  When the next scheduled run time for the root task occurs while any child task is still running,
  Snowflake starts a new instance of the task graph. Root tasks never overlap with this policy.
* `OVERLAP_POLICY = ALLOW_ALL_OVERLAP`: Allows unlimited true parallelism.
  Snowflake can run multiple instances of the entire task graph, including the root task, concurrently.

Overlapping runs may be tolerated (or even desirable) when read/write SQL operations executed by overlapping runs of a task graph do not
produce incorrect or duplicate data. However, for other task graphs, task owners (the role with the OWNERSHIP privilege on all tasks in the
task graph) should set an appropriate schedule on the root task and choose an appropriate warehouse size (or use serverless compute
resources) to ensure an instance of the task graph finishes to completion before the root task is next scheduled to run.

To better align a task graph with the schedule defined in the root task:

1. If feasible, increase the scheduling time between runs of the root task.
2. Consider modifying compute-heavy tasks to use serverless compute resources. If the task relies on user-managed compute resources, increase the size of the warehouse that runs large or complex SQL statements or stored procedures in the task graph.
3. Analyze the SQL statements or stored procedure executed by each task. Determine if code can be rewritten to leverage parallel processing.

If none of the above solutions help, consider whether it is necessary to allow concurrent runs of the task graph by setting
OVERLAP_POLICY = ALLOW_CHILD_OVERLAP or OVERLAP_POLICY = ALLOW_ALL_OVERLAP on the root task.
You can set this parameter when you create a task (using CREATE TASK) or later
(using ALTER TASK or in Snowsight).

### Versioning

When the root task in a task graph is resumed or manually executed, Snowflake sets a version of the entire task graph, including all properties for all tasks in the task graph. After a task is suspended and modified, Snowflake set a new version when the root task is resumed or manually executed.

To modify or recreate any task in a task graph, the root task must first be suspended. When the root task is suspended, all future
scheduled runs of the root task are cancelled; however, if any tasks are currently running, these tasks and any descendant tasks continue
to run using the current version.

> **Note:**
>
> If the definition of a stored procedure called by a task changes while the task graph is executing, the new programming could be
> executed when the stored procedure is called by the task in the current run.

For example, suppose the root task in a task graph is suspended, but a scheduled run of this task has already started. The owner of all
tasks in the task graph modifies the SQL code called by a child task while the root task is still running. The child task runs and executes
the SQL code in its definition using the version of the task graph that was current when the root task started its run. When the root task
is resumed or is manually executed, a new version of the task graph is set. This new version includes the modifications to the child task.

To retrieve the history of task versions, query [TASK_VERSIONS](../sql-reference/account-usage/task_versions.md) [Account Usage view](../sql-reference/account-usage.md) (in the SNOWFLAKE shared database).

## Task graph duration

Task graph duration includes the time from when the root task is scheduled to start to when the last child task completes. To calculate the duration of a task graph, query [COMPLETE_TASK_GRAPHS view](../sql-reference/account-usage/complete_task_graphs.md), and compare SCHEDULED_TIME with COMPLETED_TIME.

For example, the following diagram shows a task graph that is scheduled to run every minute. The root task and its two child tasks each queue for 5 seconds and run for 10 seconds, requiring a total of 45 seconds to complete.

### Task graph timeouts

When [USER_TASK_TIMEOUT_MS](../sql-reference/parameters.md) is set in the root task, the timeout applies to the entire task graph.

When [USER_TASK_TIMEOUT_MS](../sql-reference/parameters.md) in set in a child task or finalizer task, the timeout applies to only that task.

When [USER_TASK_TIMEOUT_MS](../sql-reference/parameters.md) is set in both the root task and a child task, the child task timeout overrides the root task timeout for that child task.

### Considerations

* For serverless tasks, Snowflake automatically scales resources to make sure tasks complete within a target completion interval, including queueing time.
* For user-managed tasks, longer queueing periods are common when tasks are scheduled to run on a shared or busy warehouse.
* For task graphs, the total time might include additional queueing time for child tasks waiting for their predecessors complete.

## Create a task graph with logic (runtime info, configuration, and return values)

Tasks in a task graph can use return values from parent tasks to perform logic-based operations in their function body.

Considerations:

* Some logic-based commands, like [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](../sql-reference/functions/system_get_predecessor_return_value.md), are case sensitive. However, tasks created using CREATE TASK without quotes are [stored and resolved in uppercase](../sql-reference/identifiers-syntax.md). To manage this, you can do any of the following:

  + Create task names using only uppercase letters.
  + Use quotes when naming and calling tasks.
  + For task names defined with lowercase characters, call the task using uppercase characters. For example: a task defined by “CREATE TASK task_c…” can be called as SELECT SYSTEM$GET_PREDECESSOR_RETURN_VALUE(‘TASK_C’).

### Pass configuration information to the task graph

You can pass configuration information by using a JSON object that can be read by other tasks in a task graph.
Use the syntax [CREATE TASK](../sql-reference/sql/create-task.md) or [ALTER TASK](../sql-reference/sql/alter-task.md) with
the CONFIG parameter to set, unset, or modify the configuration information in the root task.
Use the function [SYSTEM$GET_TASK_GRAPH_CONFIG](../sql-reference/functions/system_get_task_graph_config.md) to retrieve the configuration information.

Example:

```sqlexample
CREATE OR REPLACE TASK task_root
  SCHEDULE = '1 MINUTE'
  USER_TASK_TIMEOUT_MS = 60000
  CONFIG='{"environment": "production", "path": "/prod_directory/"}'
  AS SELECT 1;

CREATE OR REPLACE TASK task_a
  USER_TASK_TIMEOUT_MS = 600000
  AFTER task_root
  AS
    BEGIN
      LET VALUE := (SELECT SYSTEM$GET_TASK_GRAPH_CONFIG('path'));
      CREATE TABLE IF NOT EXISTS demo_table(NAME VARCHAR, VALUE VARCHAR);
      INSERT INTO demo_table VALUES('task c path',:value);
    END;
```

> **Note:**
>
> You can dynamically override the configuration for a single task execution with the
> [EXECUTE TASK … USING CONFIG](../sql-reference/sql/execute-task.md) command.
> With this command, you can test different configurations or run ad-hoc executions with modified settings without changing the task definition.

### Pass return values between tasks

You can pass return values between tasks in a task graph.
Use the function [SYSTEM$SET_RETURN_VALUE](../sql-reference/functions/system_set_return_value.md)
to add a return value from a task, and use the function
[SYSTEM$GET_PREDECESSOR_RETURN_VALUE](../sql-reference/functions/system_get_predecessor_return_value.md) to retrieve it.

When a task has multiple predecessors, you must specify which task has the return value that you want.
In the following example, we create a root task in a task graph that adds configuration information.

```sqlexample
CREATE OR REPLACE TASK task_c
  SCHEDULE = '1 MINUTE'
  USER_TASK_TIMEOUT_MS = 60000
  AS
    BEGIN
      CALL SYSTEM$SET_RETURN_VALUE('task_c successful');
    END;

CREATE OR REPLACE TASK task_d
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_c
  AS
    BEGIN
      LET VALUE := (SELECT SYSTEM$GET_PREDECESSOR_RETURN_VALUE('task_c'));
      CREATE TABLE IF NOT EXISTS demo_table(NAME VARCHAR, VALUE VARCHAR);
      INSERT INTO demo_table VALUES('Value from predecessor task_c', :value);
    END;
```

### Get and use runtime information

Use the function [SYSTEM$TASK_RUNTIME_INFO](../sql-reference/functions/system_task_runtime_info.md) to report information about the current task run. This function has several options specific to task graphs. For example, use CURRENT_ROOT_TASK_NAME to get the name of the root task in the current task graph.
The following examples shows how to add a date stamp to a table based on when the root task of the task graph started.

```sqlexample
-- Updates the date/time table after the root task completes.
CREATE OR REPLACE TASK task_date_time_table
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_root
  AS
    BEGIN
      LET VALUE := (SELECT SYSTEM$TASK_RUNTIME_INFO('CURRENT_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP'));
      INSERT INTO date_time_table VALUES('order_date',:value);
    END;
```

## Examples

### Example: Start multiple tasks and report status

In the following example, the root task starts tasks to update three different tables. After those three tables are updated, a task combines the information from the other three tables into an aggregate sales table.

```sqlexample
-- Create a notebook in the public schema
-- USE DATABASE <database name>;
-- USE SCHEMA <schema name>;

-- task_a: Root task. Starts the task graph and sets basic configurations.
CREATE OR REPLACE TASK task_a
  SCHEDULE = '1 MINUTE'
  TASK_AUTO_RETRY_ATTEMPTS = 2
  SUSPEND_TASK_AFTER_NUM_FAILURES = 3
  USER_TASK_TIMEOUT_MS = 60000
  CONFIG='{"environment": "production", "path": "/prod_directory/"}'
  AS
    BEGIN
      CALL SYSTEM$SET_RETURN_VALUE('task_a successful');
    END;
;

-- task_customer_table: Updates the customer table.
--   Runs after the root task completes.
CREATE OR REPLACE TASK task_customer_table
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_a
  AS
    BEGIN
      LET VALUE := (SELECT customer_id FROM ref_cust_table
        WHERE cust_name = "Jane Doe";);
      INSERT INTO customer_table VALUES('customer_id',:value);
    END;
;

-- task_product_table: Updates the product table.
--   Runs after the root task completes.
CREATE OR REPLACE TASK task_product_table
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_a
  AS
    BEGIN
      LET VALUE := (SELECT product_id FROM ref_item_table
        WHERE PRODUCT_NAME = "widget";);
      INSERT INTO product_table VALUES('product_id',:value);
    END;
;

-- task_date_time_table: Updates the date/time table.
--   Runs after the root task completes.
CREATE OR REPLACE TASK task_date_time_table
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_a
  AS
    BEGIN
      LET VALUE := (SELECT SYSTEM$TASK_RUNTIME_INFO('CURRENT_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP'));
      INSERT INTO "date_time_table" VALUES('order_date',:value);
    END;
;

-- task_sales_table: Aggregates changes from other tables.
--   Runs only after updates are complete to all three other tables.
CREATE OR REPLACE TASK task_sales_table
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_customer_table, task_product_table, task_date_time_table
  AS
    BEGIN
      LET VALUE := (SELECT sales_order_id FROM ORDERS);
      JOIN CUSTOMER_TABLE ON orders.customer_id=customer_table.customer_id;
      INSERT INTO sales_table VALUES('sales_order_id',:value);
    END;
;
```

### Finalizer task example: Send email notification

This example demonstrates how to use a finalizer task to send an email summary of a task graph run.
The finalizer task calls two external functions: one aggregates the completion status of each task,
and the other formats the information into an email for a remote messaging service.

This example uses an example root task named `task_root` and an example finalizer task named `notify_finalizer`.

```sqlexample
CREATE OR REPLACE TASK notify_finalizer
  USER_TASK_TIMEOUT_MS = 60000
  FINALIZE = task_root
AS
  DECLARE
    my_root_task_id STRING;
    my_start_time TIMESTAMP_LTZ;
    summary_json STRING;
    summary_html STRING;
  BEGIN
    --- Get root task ID
    my_root_task_id := (SELECT SYSTEM$TASK_RUNTIME_INFO('CURRENT_ROOT_TASK_UUID'));
    --- Get root task scheduled time
    my_start_time := (SELECT SYSTEM$TASK_RUNTIME_INFO('CURRENT_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP')::timestamp_ltz);
    --- Combine all task run info into one JSON string
    summary_json := (SELECT get_task_graph_run_summary(:my_root_task_id, :my_start_time));
    --- Convert JSON into HTML table
    summary_html := (SELECT HTML_FROM_JSON_TASK_RUNS(:summary_json));

    --- Send HTML to email
    CALL SYSTEM$SEND_EMAIL(
        'email_notification',
        'admin@snowflake.com',
        'notification task run summary',
        :summary_html,
        'text/html');
    --- Set return value for finalizer
    CALL SYSTEM$SET_RETURN_VALUE('✅ Graph run summary sent.');
  END

CREATE OR REPLACE FUNCTION get_task_graph_run_summary(my_root_task_id STRING, my_start_time TIMESTAMP_LTZ)
  RETURNS STRING
AS
$$
  (SELECT
    ARRAY_AGG(OBJECT_CONSTRUCT(
      'task_name', name,
      'run_status', state,
      'return_value', return_value,
      'started', query_start_time,
      'duration', duration,
      'error_message', error_message
      )
    ) AS GRAPH_RUN_SUMMARY
  FROM
    (SELECT
      NAME,
      CASE
        WHEN STATE = 'SUCCEED' then '🟢 Succeeded'
        WHEN STATE = 'FAILED' then '🔴 Failed'
        WHEN STATE = 'SKIPPED' then '🔵 Skipped'
        WHEN STATE = 'CANCELLED' then '🔘 Cancelled'
      END AS STATE,
      RETURN_VALUE,
      TO_VARCHAR(QUERY_START_TIME, 'YYYY-MM-DD HH24:MI:SS') AS QUERY_START_TIME,
      CONCAT(TIMESTAMPDIFF('seconds', query_start_time, completed_time),
        ' s') AS DURATION,
      ERROR_MESSAGE
    FROM
      TABLE(my-database.information_schema.task_history(
        ROOT_TASK_ID => my_root_task_id ::STRING,
        SCHEDULED_TIME_RANGE_START => my_start_time,
        SCHEDULED_TIME_RANGE_END => current_timestamp()
      ))
    ORDER BY
      SCHEDULED_TIME)
  )::STRING
$$
;

CREATE OR REPLACE FUNCTION HTML_FROM_JSON_TASK_RUNS(JSON_DATA STRING)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.9'
  HANDLER = 'GENERATE_HTML_TABLE'
AS
$$
import json

def GENERATE_HTML_TABLE(JSON_DATA):
    column_widths = ["320px", "120px", "400px", "160px", "80px", "480px"]

    DATA = json.loads(JSON_DATA)
    HTML = f"""
    <img src="https://docs.snowflake.com/images/logo-sample.png"
      alt="Sample organization logo" height="72">
    <p><strong>Task Graph Run Summary</strong>
      <br>Sign in to Snowsight to see more details.</p>
    <table border="1" style="border-color:#DEE3EA"
      cellpadding="5" cellspacing="0">
      <thead>
        <tr>
    """
    headers = ["Task name", "Run status", "Return value", "Started", "Duration", "Error message"]
    for i, header in enumerate(headers):
        HTML += f'<th scope="col" style="text-align:left; width: {column_widths[i]}">{header.capitalize()}</th>'

    HTML += """
        </tr>
      </thead>
      <tbody>
    """
    for ROW_DATA in DATA:
        HTML += "<tr>"
        for header in headers:
            key = header.replace(" ", "_").upper()
            CELL_DATA = ROW_DATA.get(key, "")
            HTML += f'<td style="text-align:left; width: {column_widths[headers.index(header)]}">{CELL_DATA}</td>'
        HTML += "</tr>"
    HTML += """
      </tbody>
    </table>
    """
    return HTML
$$
;
```

### Finalizer task example: Correct for errors

This example demonstrates how a finalizer task can correct for errors.

For demonstration purposes, the tasks are designed to fail during their first run. The finalizer tasks corrects the issue and restarts the tasks, which succeed on following runs:

```sqlexample
-- Configuration
-- By default, the notebook creates the objects in the public schema.
-- USE DATABASE <database name>;
-- USE SCHEMA <schema name>;

-- 1. Set the default configurations.
--    Creates a root task ("task_a"), and sets the default configurations
--    used throughout the task graph.
--    Configurations include:
--    * Each task runs after one minute, with a 60-second timeout.
--    * If a task fails, retry it twice. if it fails twice,
--      the entire task graph is considered as failed.
--    * If the task graph fails consecutively three times, suspend the task.
--    * Other environment values are set.

CREATE OR REPLACE TASK task_a
  SCHEDULE = '1 MINUTE'
  USER_TASK_TIMEOUT_MS = 60000
  TASK_AUTO_RETRY_ATTEMPTS = 2
  SUSPEND_TASK_AFTER_NUM_FAILURES = 3
  AS
    BEGIN
      CALL SYSTEM$SET_RETURN_VALUE('task a successful');
    END;
;

-- 2. Use a runtime reflection variable.
--    Creates a child task ("task_b").
--    By design, this example fails the first time it runs, because
--    it writes to a table ("demo_table") that doesn’t exist.
CREATE OR REPLACE TASK task_b
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_a
  AS
    BEGIN
      LET VALUE := (SELECT SYSTEM$TASK_RUNTIME_INFO('current_task_name'));
      INSERT INTO demo_table VALUES('task b name',:VALUE);
    END;
;

-- 3. Get a task graph configuration value.
--    Creates the child task ("task_c").
--    By design, this example fails the first time it runs, because
--    the predecessor task ("task_b") fails.
CREATE OR REPLACE TASK task_c
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_b
  AS
    BEGIN
      CALL SYSTEM$GET_TASK_GRAPH_CONFIG('path');
      LET VALUE := (SELECT SYSTEM$GET_TASK_GRAPH_CONFIG('path'));
      INSERT INTO demo_table VALUES('task c path',:value);
    END;
;

-- 4. Get a value from a predecessor.
--    Creates the child task ("task_d").
--    By design, this example fails the first time it runs, because
--    the predecessor task ("task_c") fails.
CREATE OR REPLACE TASK task_d
  USER_TASK_TIMEOUT_MS = 60000
  AFTER task_c
  AS
    BEGIN
      LET VALUE := (SELECT SYSTEM$GET_PREDECESSOR_RETURN_VALUE('TASK_A'));
      INSERT INTO demo_table VALUES('task d: predecessor return value', :value);
    END;
;

-- 5. Create the finalizer task ("task_f"), which creates the missing demo table.
--    After the finalizer completes, the task should automatically retry
--    (see task_a: task_auto_retry_attempts).
--    On retry, task_b, task_c, and task_d should complete successfully.
CREATE OR REPLACE TASK task_f
  USER_TASK_TIMEOUT_MS = 60000
  FINALIZE = task_a
  AS
    BEGIN
      CREATE TABLE IF NOT EXISTS demo_table(NAME VARCHAR, VALUE VARCHAR);
    END;
;

-- 6. Resume the finalizer. Upon creation, tasks start in a suspended state.
--    Use this command to resume the finalizer.
ALTER TASK task_f RESUME;
SELECT SYSTEM$TASK_DEPENDENTS_ENABLE('task_a');

-- 7. Query the task history
SELECT
    name, state, attempt_number, scheduled_from
  FROM
    TABLE(information_schema.task_history(task_name=> 'task_b'))
  LIMIT 5;
;

-- 8. Suspend the task graph to stop incurring costs
--    Note: To stop the task graph, you only need to suspend the root task
--    (task_a). Child tasks don’t run unless the root task is run.
--    If any child tasks are running, they have a limited duration
--    and will end soon.
ALTER TASK task_a SUSPEND;
DROP TABLE demo_table;

-- 9. Check tasks during execution (optional)
--    Run this command to query the demo table during execution
--    to check which tasks have run.
SELECT * FROM demo_table;

-- 10. Demo reset (optional)
--     Run this command to remove the demo table.
--     This causes task_b to fail during its first run.
--     After the task graph retries, task_b will succeed.
DROP TABLE demo_table;
```

---
title: Create a Snowflake Open Catalog account
source: https://docs.snowflake.com/en/user-guide/opencatalog/create-open-catalog-account.md
section: User Guide
---

# Create a Snowflake Open Catalog account

If you’re an existing Snowflake customer, you can sign up for Snowflake Open Catalog by using Snowsight or the CREATE ACCOUNT Snowflake SQL
command.

You typically only need one Open Catalog account for your organization. However, you can create multiple accounts, if needed.

**Note**

> To create an Open Catalog account, you must be a user with the organization administrator (ORGADMIN) role.

## Create an account by using Snowsight

To create an Open Catalog account by using Snowsight, do the following:

1. Sign in to Snowsight.
2. Select **Admin** > **Accounts**.
3. In the **+ Account** drop-down, select **Create Snowflake Open Catalog Account**.
4. In the **Create Snowflake Open Catalog Account** dialog, complete the fields:

   * **Cloud:** The cloud provider where you want to store Apache Iceberg™
     tables.
   * **Region:** The region where you want to store Iceberg tables.
   * **Edition:** The edition for your Open Catalog account.
5. Select **Next**.
6. In the **Create New Account** dialog, complete the **Account Name**, **User Name**, **Password**, and **Email** fields.
7. Select **Create Account**.

   Your new Open Catalog account is created and a confirmation box appears.
8. In the confirmation box, select the **Account Locator URL** to open
   the Account Locator URL in your web browser.
9. Bookmark the Account Locator URL. When signing in to Open
   Catalog, you must specify the Account Locator URL.

## Create an account by using Snowflake SQL

To create an Open Catalog account by using Snowflake SQL, run the following
CREATE ACCOUNT SQL command:

```sqlsyntax
CREATE ACCOUNT <account_name>
ADMIN_NAME = <admin_user_name>
ADMIN_PASSWORD = '<admin_user_password>'
MUST_CHANGE_PASSWORD = { TRUE | FALSE }
EMAIL = '<admin_user_email>'
EDITION = standard
REGION = <cloud_region>
POLARIS = TRUE;
```

For more information, see [CREATE ACCOUNT](https://docs.snowflake.com/en/sql-reference/sql/create-account).

**Important**

> After you run the CREATE ACCOUNT SQL command, copy the accountLocatorUrl in the command output and save it for signing in to Open Catalog.

---
title: Create an Apache Iceberg™ table in Snowflake
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-create.md
section: User Guide
---

# Create an Apache Iceberg™ table in Snowflake

Create [Apache Iceberg™ tables](tables-iceberg.md) in Snowflake for different [Catalog options](tables-iceberg.md).
You can create an Iceberg table by using the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command.

> **Note:**
>
> * To create an Iceberg table, you must have a running warehouse that is specified as the current warehouse for your session.
>   Errors might occur if no running warehouse is specified when you create an Iceberg table.
>   For more information, see [Working with Warehouses](warehouses-tasks.md).
> * To create an Iceberg table that works with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview), see [Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake](tables-iceberg-open-catalog.md).

## Snowflake-managed

To create an Iceberg table with Snowflake as the catalog, you specify an
[external volume](tables-iceberg.md) and a base location (directory on the external volume)
where Snowflake can write table data and metadata.

You can use one of the following storage options:

* **Your cloud storage**: Create an [external volume](../sql-reference/sql/create-external-volume.md) and reference it from the table.
  For instructions, see [Configure an external volume](tables-iceberg-configure-external-volume.md).
* **Snowflake-provided storage**: Set `EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'` (or rely on defaults when the catalog is Snowflake).
  You don’t create a separate external volume object for that path. For more information, see [Snowflake storage for Apache Iceberg™ tables](tables-iceberg-internal-storage.md).

To define table columns, you can use Iceberg data types. For more information, see [Data types for Apache Iceberg™ tables](tables-iceberg-data-types.md).

The following example creates an Iceberg table with Snowflake as the Iceberg catalog, and uses the value of the column named `int_col`
to [partition the table](tables-iceberg-metadata.md):

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    boolean_col boolean,
    int_col int,
    long_col long,
    float_col float,
    double_col double,
    decimal_col decimal(10,5),
    string_col string,
    fixed_col fixed(10),
    binary_col binary,
    date_col date,
    time_col time,
    timestamp_ntz_col timestamp_ntz(6),
    timestamp_ltz_col timestamp_ltz(6)
  )
  PARTITION BY (int_col)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_ext_vol'
  BASE_LOCATION = 'my/relative/path/from/extvol';
```

> **Note:**
>
> Alternatively, use variant syntax.
> For more information, see [CREATE TABLE … AS SELECT](../sql-reference/sql/create-iceberg-table-snowflake.md) and
> [CREATE ICEBERG TABLE … LIKE](../sql-reference/sql/create-iceberg-table-snowflake.md).

After you create a table that uses Snowflake as the catalog, you can take actions such as:

* [Generating snapshots](tables-iceberg-manage.md)
* [Querying the table](tables-iceberg-manage.md)
* [Updating the table](tables-iceberg-manage.md)

For more information, see [Manage Apache Iceberg™ tables](tables-iceberg-manage.md).

## External catalog

To create an Iceberg table that uses an external catalog, or no catalog at all, you must specify an
[external volume](tables-iceberg.md) and a [catalog integration](tables-iceberg.md).
If you use an external Iceberg catalog, you might also need to specify additional parameters. For example, when you use AWS Glue as the catalog,
you must specify a catalog table name.

When you create an Iceberg table that uses an external catalog, Snowflake performs an initial metadata refresh.
You can also manually refresh the table metadata using the [ALTER ICEBERG TABLE … REFRESH](../sql-reference/sql/alter-iceberg-table-refresh.md) command to
synchronize the metadata with the most recent table changes. For more information, see [Refresh the table metadata](tables-iceberg-manage.md).

> **Note:**
>
> The CREATE ICEBERG TABLE command supports different options for different external catalogs. The examples in this section specify only
> some of the available options. To view the full syntax, see the following pages:
>
> > * [CREATE ICEBERG TABLE (AWS Glue as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-aws-glue.md)
> > * [CREATE ICEBERG TABLE (Iceberg files in object storage)](../sql-reference/sql/create-iceberg-table-iceberg-files.md)
> > * [CREATE ICEBERG TABLE (Delta files in object storage)](../sql-reference/sql/create-iceberg-table-delta.md)
> > * [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md)
>
> You can also configure data governance (for example, masking or row access policies)
> for externally managed tables by using [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md).

### Iceberg files in object storage

The following example creates an Iceberg table from Iceberg metadata in external cloud storage,
specifying a relative path to the table metadata on the external volume (`METADATA_FILE_PATH`).

```sqlexample
CREATE ICEBERG TABLE myIcebergTable
  EXTERNAL_VOLUME='icebergMetadataVolume'
  CATALOG='icebergCatalogInt'
  METADATA_FILE_PATH='path/to/metadata/v1.metadata.json';
```

### Delta files in object storage

The following example command creates an Iceberg table from Delta table files in object storage with
[automated refresh](tables-iceberg-auto-refresh.md).

The example specifies an external volume associated with the cloud location of the Delta table files,
a [catalog integration configured for Delta](tables-iceberg-configure-catalog-integration-object-storage.md),
and a value for the required `BASE_LOCATION` parameter.

```sqlexample
CREATE ICEBERG TABLE my_delta_iceberg_table
  CATALOG = delta_catalog_integration
  EXTERNAL_VOLUME = delta_external_volume
  BASE_LOCATION = 'relative/path/from/ext/vol/'
  AUTO_REFRESH = TRUE;
```

If the Delta table uses a partitioning scheme, Snowflake automatically interprets the scheme from the Delta log.

### Apache Iceberg™ REST catalog

The following example creates a table that uses a remote
[Iceberg REST catalog](tables-iceberg-configure-catalog-integration-rest.md).

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'my_rest_catalog_integration'
  CATALOG_TABLE_NAME = 'my_remote_table'
  AUTO_REFRESH = TRUE;
```

For more examples by use case, see the following topics:

* [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md)
* [Write support for externally managed Apache Iceberg™ tables](tables-iceberg-externally-managed-writes.md)
* [Use catalog-vended credentials for Apache Iceberg™ tables](tables-iceberg-configure-catalog-integration-vended-credentials.md)
* [Query a table in Snowflake Open Catalog using Snowflake](tables-iceberg-open-catalog-query.md)

---
title: Create an Azure stage
source: https://docs.snowflake.com/en/user-guide/data-load-azure-create-stage.md
section: User Guide
---

# Create an Azure stage

A stage specifies where data files are stored (that is, “staged”) so that the data in the files can be loaded into a table.

Data can be loaded directly from files in a specified Azure container or in an Azure “folder” path (i.e. key value prefix). If the path ends with `/`, all of the objects in the corresponding Azure folder are loaded.

## External stages

In addition to loading directly from files in Azure containers, Snowflake supports creating named external stages, which encapsulate all of the required information for staging files, including:

* The Azure container where the files are staged.
* The named storage integration object or Azure credentials for the container (if it is protected).
* An encryption key (if the files in the container have been encrypted).

Named external stages are optional, but recommended when you plan to load data regularly from the same location. For instructions for creating an external stage, see Create an external stage below.

> **Note:**
>
> To improve query performance for an Azure external stage, configure your network routing to use
> [Microsoft network routing](https://learn.microsoft.com/en-us/azure/storage/common/network-routing-preference#microsoft-global-network-versus-internet-routing).
> For instructions, see the [Azure documentation](https://learn.microsoft.com/en-us/azure/storage/common/configure-network-routing-preference?tabs=azure-portal).

## Create an external stage

You can create a named external stage using SQL or the web interface.

> **Note:**
>
> To create a stage, you must use a role that is granted or inherits the necessary privileges.
> For more information, see [Access control requirements](../sql-reference/sql/create-stage.md) for [CREATE STAGE](../sql-reference/sql/create-stage.md).

### Create an external stage using SQL

Use the [CREATE STAGE](../sql-reference/sql/create-stage.md) command to create an external stage.

The following example creates an external stage named `my_azure_stage`. The CREATE statement includes the `azure_int` storage
integration that was created in [Configure an Azure container for loading data](data-load-azure-config.md) to access the Azure container `container1` in the `myaccount`
account.

The data files are stored in the `load/files/` path. The stage references a named file format object named `my_csv_format`, which
describes the data in the files stored in the path:

```sqlexample
CREATE STAGE my_azure_stage
  STORAGE_INTEGRATION = azure_int
  URL = 'azure://myaccount.blob.core.windows.net/mycontainer/load/files/'
  FILE_FORMAT = my_csv_format;
```

> **Note:**
>
> Use the `blob.core.windows.net` endpoint for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.

> **Note:**
>
> By specifying a named file format object (or individual file format options) for the stage, it is not necessary to later specify the same file format options in the COPY command used to load data from
> the stage. For more information about file format objects and options, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).

### Create an external stage using Python

Use the [StageCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.stage.StageCollection)
method of the [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-overview.md) to create an external stage.

Similar to the preceding SQL example, the following Python example creates an external stage named `my_azure_stage`:

```python
from snowflake.core.stage import Stage

my_stage = Stage(
  name="my_azure_stage",
  storage_integration="azure_int",
  url="azure://myaccount.blob.core.windows.net/mycontainer/load/files/"
)
root.databases["<database>"].schemas["<schema>"].stages.create(my_stage)
```

> **Note:**
>
> The Python API currently does not support the FILE_FORMAT parameter of the [CREATE STAGE](../sql-reference/sql/create-stage.md) SQL command.

### Create an external stage using Snowsight

To use Snowsight to create a named external stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Stage » External Stage.
3. Select your external cloud storage provider: Amazon S3, Microsoft Azure, or Google Cloud Platform.
4. In the Create Stage dialog, enter a Stage Name.
5. Select the database and schema where you want to create the stage.
6. Enter the URL of your external cloud storage location.
7. If your external storage isn’t public, enable Authentication and enter your details. For more information,
   see [CREATE STAGE](../sql-reference/sql/create-stage.md).
8. Optionally deselect Directory table. Directory tables let you see files on the stage,
   but require a warehouse and thus incur a cost. You can choose to deselect this option for now and enable a directory table later.

   > If you enable Directory table, optionally select Enable auto-refresh, and then select your event notification or
   > notification integration to automatically refresh the directory table when files are added or removed.
   > For more information, see [Automated directory table metadata refreshes](data-load-dirtables-auto.md).
9. If your files are encrypted, enable Encryption, and then enter your details.
10. (Optional) To view a generated SQL statement, expand the SQL Preview.
    To specify additional options for your stage, such as AUTO_REFRESH, you can open this SQL preview in a worksheet.
11. Select Create.

**Next:** [Copy data from an Azure stage](data-load-azure-copy.md)

---
title: Create an organizational listing
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-create.md
section: User Guide
---

# Create an organizational listing

Create an organizational listing to share data products securely within your organization. Before you create an organizational listing, review the prerequisites, known limitations, and considerations.

## Prerequisites

* Your organization has an ORGADMIN role. ([Organization accounts](../../../organization-accounts.md) are optional.)

## Known limitations

* Support for organizational listings in government and special regions is currently in preview in the following regions:

  + FRM: US-EAST-1
  + FRM: US-WEST-2
  + KSA: GCPMECENTRAL2

  These deployments are subject to the following limitations:

  + Creating custom organization profiles in government regions isn’t supported.
  + The [ACCESS_HISTORY view](../../../../sql-reference/organization-usage/access_history.md) in the organization account isn’t available.
  + Organizational listings created from commercial or Virtual Private Snowflake (VPS) accounts don’t show up when searching, filtering, or browsing listings.
* You must use the API to target specific regions.
* Data products supported: Snowflake Native App Framework and shares.
* Organizational listings that contain a Snowflake Native App do not support target roles for access or discovery.
* The following features are not supported when using organizational listings:

  + Provider studio analytics.
  + Reader accounts.
* You cannot specify specific regions in organizational listings using Snowsight.

  Instead, you can specify the region in the [manifest YAML](org-listing-manifest-reference.md)
  file when [creating](../../../../sql-reference/sql/create-organization-listing.md) or [altering](../../../../sql-reference/sql/alter-listing.md) the listing programatically.
* When assigning organizational listing privileges, Snowflake loads all database roles granted to a share. This allows consumers to see unmasked data when running a query on a mountless listing. To avoid this behavior, mount the listing and don’t use database roles with ULLs.

## Considerations

* Before you target an entire organization, check for external tenants. Adjust the target accounts for your data
  products before adding them to an organizational listing unless you intend to share them with external tenants.
* Each share can be attached to one listing.
* Each Native App can be attached to one or more listings.
* For organization changes (such as mergers) with accounts containing organizational listings, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Access control requirements

Use the information provided here to determine the specific roles and privileges that you must have to execute organizational listing SQL commands.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../security-access-control-overview.md), see [Overview of Access Control](../../../security-access-control-overview.md).

### Assign organizational listing privileges

To create an organizational listing, a role must have the necessary privileges to create a share, as shown in Share creation and management, as well as necessary privileges to create an organizational listing from it, as shown in Privileges to create an organizational listing using the share.

#### Share creation and management

To create a share and to create and manage objects inside a share, a role must have the necessary privileges on the data objects, schemas, and the CREATE SHARE command.

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SHARE | ACCOUNT | To `CREATE` a share |
| OWNERSHIP or USAGE with grants option | DATABASE | To see and `USE` the specified database. |
| OWNERSHIP or USAGE with grants option | SCHEMA | To see the specified schema. |
| SELECT | TABLE | To query specified tables in the specified schema. |

The `USAGE` privilege on the parent database and schema is required to perform operations on any object in a schema.

#### Privileges to create an organizational listing using the share

One of the following privileges is required to create an organizational listing, in addition to the share-related privileges listed above.

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION LISTING | ACCOUNT | To create and alter organizational listings. |
| CREATE LISTING | ACCOUNT | To create and alter organizational listings and private listings. |

#### Privileges to alter an organizational listing

One of the following privileges is required to alter a listing.

| Role | Notes |
| --- | --- |
| OWNERSHIP | Can `ALTER` a share without additional grant options. |
| MODIFY with grants option | Can `ALTER` a share after granting modify privileges to a role. This can be done using:  ```sqlexample grant modify on data exchange listing <listing_name> to role <role_name> ``` |

## Consume or query an organizational listing

To directly consume an organizational listing, you can reference the [Uniform Listing Locator (ULL)](org-listing-configure.md) without any additional privileges. If you require mounting the listing, then the following privileges are required:

| Privilege | Object | Notes |
| --- | --- | --- |
| IMPORT ORGANIZATION LISTING | ACCOUNT | To import an organizational listing. |
| CREATE database | ACCOUNT | To create a database and mount the listing objects. |

### Manage listing auto-fulfillment settings

Before managing auto-fulfillment settings for your organization listing, ensure that you have the necessary roles to manage auto-fulfilling the listing. See the auto-fulfillment [required privileges](../../../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md) for more information.

A [role](../../../security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE LISTING AUTO FULFILLMENT | ACCOUNT | To configure the auto-fulfillment settings. |

## Create an organizational listing in Snowsight or SQL

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. Select + Create Listing.
4. Select a data product such as a table, view, or other data product to add to the listing.

   1. Review the generated share identifier, then select Generate listing.
5. Enter a name for your listing and review the generated Universal Listing Locator, then select Save.
6. To specify who can access the listing (the target accounts, roles, and regions), select + Access Control. The Access and discovery dialog displays.

   1. Complete the Grant access section:

      | Field | Description |
      | --- | --- |
      | Who can access this data product? | Select one of the following:  * Entire organization: Anyone in the organization can access the listing.  If Entire organization is selected and [cross-cloud auto-fulfillment](http://other-docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment) is enabled on your account, then you’ll be prompted to review the auto-fulfillment refresh settings for the listing. * Selected accounts and roles: Only selected accounts and roles can access. * No accounts or roles are pre-approved: (Default) Data product will only be available by request. |
      | Accounts | If Selected accounts and roles is selected, select one or more accounts.  Optional. Select + Add another account to add second and subsequent accounts.  By default, all roles in the selected accounts can access the listing. Select Selected roles to grant access only to specific roles each selected account. |
   2. Complete the Allow discovery section:

      | Field | Description |
      | --- | --- |
      | Who else can discover the listing and request access? | Select one of:  * Entire organization: (Default) Anyone in the organization can discover listing and request access. This field is selected and disabled if Entire organization is specified for in the Grant access section. * Selected accounts and roles: Only selected accounts and roles can discover listing and request access. * Not discoverable by users without access: Only users with access can discover this listing. |
      | Accounts | If Select accounts and roles is selected, select one or more accounts.  Optional. Select + Add another account to add second and subsequent accounts and grant access to specific roles. |
      | Selected user roles | If Selected roles is selected, enter one or more roles to grant access. |
   3. If Allow discovery is Selected accounts and roles, then select Set up request approval flow.

      * In the Set up request approval flow dialog, select one of the following options in the How should the request approval happen? list:

        + Manage requests in Snowflake: Enter the email address of the request approver and optionally specify additional roles that can approve requests.
        + Manage requests outside of Snowflake: Enter an email address for the request approver or enter a URL that points to an internal ticketing system.
        > **Note:**
        >
        > The Set up request approval flow button isn’t available if the data product is accessible by the entire organization or if the data product is not discoverable by users without access.

7. Complete the listing.

   Enter addition information about listing page to guide consumers,
   such as the following. For more information about these fields, see [Configure listings](../../../../collaboration/provider-listings-reference.md).

   * Description
   * Data dictionary
   * Quick start examples
   * Details, including the support contact
   * Documentation links
   * Terms of service
   * Attributes
8. Select Publish to make the listing available in the Internal Marketplace.

   If you exit without publishing, the listing is saved as a draft that’s ready for review or for the addition of descriptive metadata.

Create an organizational listing from the share with the required attributes included in YAML (entered in $$ delimiters).

This part of the manifest yaml specifies the accounts that will be able to use the organizational listing:

```yaml
organization_targets:
  access:
```

This example creates a listing using the required settings in the manifest YAML. It targets one role in
one account in one region and includes support and approver contacts:

> **Note:**
>
> `support_contact` is required.
> `approver_contact` is required if a `discovery` target is provided.

```sqlexample
USE ROLE <organizational_listing_role>;

CREATE ORGANIZATION LISTING <organization_listing_name>
SHARE <share_name> AS
$$
title: "My title"
description: "One region, all accounts"
organization_profile: "INTERNAL"
organization_targets:
  discovery:
  - account: "<account_name>"
    roles:
    - "<role>"

  access:
  - account: "<account_name>"
    roles:
    - "<role>"

support_contact: "support@somedomain.com"
approver_contact: "approver@somedomain.com"
locations:
  access_regions:
  - name: "PUBLIC.<snowflake_region>"
$$;
```

For a complete list of all fields and values for an organizational listing see [Organization listing manifest reference](org-listing-manifest-reference.md).
For additional examples, see [Set who can discover and access an organizational listing](org-listing-configure.md).

---
title: Create an S3 stage
source: https://docs.snowflake.com/en/user-guide/data-load-s3-create-stage.md
section: User Guide
---

# Create an S3 stage

An external (that is, Amazon S3) stage specifies where data files are stored so that the data in the files can be loaded into a table.

Data can be loaded directly from files in a specified S3 bucket, with or without a folder path (or prefix, in S3 terminology). If the path ends with `/`, all of the objects in the corresponding S3 folder are loaded.

> **Note:**
>
> In the [previous step](data-load-s3-config.md), if you followed the instructions to configure an AWS IAM role with the required policies and permissions
> to access your external S3 bucket, you have already created an S3 stage. You can skip this step and continue to [Copying data from an S3 stage](data-load-s3-copy.md).

## External stages

In addition to loading directly from files in S3 buckets, Snowflake supports creating named external stages, which encapsulate all of the required information for staging files, including:

* The S3 bucket where the files are staged.
* The named storage integration object or S3 credentials for the bucket (if it is protected).
* An encryption key (if the files in the bucket have been encrypted).

Named external stages are optional, but recommended when you plan to load data regularly from the same location.

> **Note:**
>
> Snowflake uses multipart uploads when uploading to Amazon S3 and Google Cloud Storage.
> This process might leave incomplete uploads in the storage location for your external stage.
>
> To prevent incomplete uploads from accumulating, we recommend that you set a lifecycle rule.
> For instructions, see the [Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpu-abort-incomplete-mpu-lifecycle-config.html)
> or [Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle#abort-mpu) documentation.

## Create an external stage

You can create a named external stage using SQL or the web interface.

> **Note:**
>
> To create a stage, you must use a role that is granted or inherits the necessary privileges.
> For more information, see [Access control requirements](../sql-reference/sql/create-stage.md) for [CREATE STAGE](../sql-reference/sql/create-stage.md).

### Create an external stage using SQL

Use the [CREATE STAGE](../sql-reference/sql/create-stage.md) command to create an external stage using SQL.

The following example uses SQL to create an external stage named `my_s3_stage` that references a private/protected S3 bucket
named `mybucket` with a folder path named `encrypted_files/`. The CREATE statement includes the `s3_int` storage integration
that was created in [Option 1: Configure a Snowflake storage integration to access Amazon S3](data-load-s3-config-storage-integration.md) to access the S3 bucket. The stage references a named file
format object named `my_csv_format`, which describes the data in the files stored in the bucket path:

> ```sqlexample
> CREATE STAGE my_s3_stage
>   STORAGE_INTEGRATION = s3_int
>   URL = 's3://mybucket/encrypted_files/'
>   FILE_FORMAT = my_csv_format;
> ```

> **Note:**
>
> By specifying a named file format object (or individual file format options) for the stage, it is not necessary to later specify the same file format options in the COPY command used to load data from the stage.

### Create an external stage using Python

Use the [StageCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.stage.StageCollection)
method of the [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-overview.md) to create an external stage.

Similar to the preceding SQL example, the following Python example creates an external stage named `my_s3_stage` that references an S3
bucket named `mybucket` with a folder path named `encrypted_files/`:

```python
from snowflake.core.stage import Stage

my_stage = Stage(
  name="my_s3_stage",
  storage_integration="s3_int",
  url="s3://mybucket/encrypted_files/"
)
root.databases["<database>"].schemas["<schema>"].stages.create(my_stage)
```

> **Note:**
>
> The Python API currently does not support the FILE_FORMAT parameter of the [CREATE STAGE](../sql-reference/sql/create-stage.md) SQL command.

### Create an external stage using Snowsight

To use Snowsight to create a named external stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Stage » External Stage.
3. Select your external cloud storage provider: Amazon S3, Microsoft Azure, or Google Cloud Platform.
4. In the Create Stage dialog, enter a Stage Name.
5. Select the database and schema where you want to create the stage.
6. Enter the URL of your external cloud storage location.
7. If your external storage isn’t public, enable Authentication and enter your details. For more information,
   see [CREATE STAGE](../sql-reference/sql/create-stage.md).
8. Optionally deselect Directory table. Directory tables let you see files on the stage,
   but require a warehouse and thus incur a cost. You can choose to deselect this option for now and enable a directory table later.

   > If you enable Directory table, optionally select Enable auto-refresh, and then select your event notification or
   > notification integration to automatically refresh the directory table when files are added or removed.
   > For more information, see [Automated directory table metadata refreshes](data-load-dirtables-auto.md).
9. If your files are encrypted, enable Encryption, and then enter your details.
10. (Optional) To view a generated SQL statement, expand the SQL Preview.
    To specify additional options for your stage, such as AUTO_REFRESH, you can open this SQL preview in a worksheet.
11. Select Create.

**Next:** [Copying data from an S3 stage](data-load-s3-copy.md)

---
title: Create and configure shares
source: https://docs.snowflake.com/en/user-guide/data-sharing-provider.md
section: User Guide
---

# Create and configure shares

This topic describes the tasks associated with a data provider account creating and configuring shares, sharing the shares with other
consumer accounts, and performing ongoing maintenance of the shares.

> **Attention:**
>
> Snowflake is not responsible for ensuring that HIPAA (and HITRUST) accounts who engage in data sharing have a signed BAA with each other;
> this is at the discretion of the accounts that are sharing data. Failure to have a signed BAA might impact the HIPAA (and HITRUST)
> compliance of both accounts, particularly the provider account.
>
> If you have a Business Critical account, consider the following to maintain the expected level of data protection before requesting
> Snowflake to enable Secure Data Sharing with non-Business Critical accounts:
>
> * Do not share sensitive data with non-Business Critical accounts.
> * Consider creating a non-Business Critical account to store less sensitive data and then sharing this data with non-Business
>   Critical accounts.
>
> * If you are using [Tri-Secret Secure](security-encryption-tss.md) with your Business Critical account and you share data with other accounts, Snowflake
>   treats the data access from these accounts as if the access occurred from within your own account. Specifically, granting access to
>   the consumer account may require Snowflake to access the key management service in the cloud platform that hosts your Snowflake account.
>
> These are only recommendations and are not enforced by Snowflake. The decision to share data is always at the discretion of the data
> provider. Snowflake does not assume any responsibility for data that is improperly shared.

## General data sharing considerations and usage

Note the following important usage details for creating and maintaining shares:

* You can share data across regions and cloud platforms. For more information,
  see [Share data securely across regions and cloud platforms](secure-data-sharing-across-regions-platforms.md).
* A share can include data from multiple databases. For more information, see [Share data from multiple databases](data-sharing-multiple-db.md).
* A share is available immediately to a consumer when you add that consumer’s account to the share.
* New and modified rows are available immediately to consumers who have created a database from the share.
  This only happens when the consumer already has access.
* A new object created or recreated in a database granted to a share is not automatically available to consumers. For example,
  if you drop and then recreate an object, it is still considered a new object, even if the name is the same.
  To make a new object available to consumers, you must use the [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) command to
  explicitly add the object to the share.
* For data security and privacy reasons, only [secure views](views-secure.md) are supported in shares at this time.
  If a standard view is added to a share, Snowflake returns an error.
* Creating secure views on streams in your database and then sharing those views with consumers is not recommended.
  This scenario requires the ability to modify a stream in another account, which is not a supported operation and is therefore
  an anti-pattern. Instead, allow consumers to create their own streams on the tables and secure views that you share.
  For more information, see Streams on shared objects (in this topic).
* [Storage lifecycle policies](storage-management/storage-lifecycle-policies.md) aren’t supported on shared tables. If you need to manage data retention
  for shared data, consider implementing retention logic in your application or using other data management strategies before sharing.

## Using SQL with data shares

Preparing objects to share can be performed using any role. Other data sharing tasks, such as creating a share or adding consumer
accounts to the share, requires the ACCOUNTADMIN role or a role granted the global CREATE SHARE privilege.
For more details about the CREATE SHARE privilege, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).

If you want to use DDL to create and manage database roles, use the commands listed here:

* [CREATE DATABASE ROLE](../sql-reference/sql/create-database-role.md)
* [ALTER DATABASE ROLE](../sql-reference/sql/alter-database-role.md)
* [DROP DATABASE ROLE](../sql-reference/sql/drop-database-role.md)
* [SHOW DATABASE ROLES](../sql-reference/sql/show-database-roles.md)
* A shared database role does not support future grants on objects. For details, see [GRANT DATABASE ROLE … TO SHARE](../sql-reference/sql/grant-database-role-share.md).

If you want to use DDL to view, grant, or revoke access to database objects in a share, use the commands listed here:

* [GRANT DATABASE ROLE … TO SHARE](../sql-reference/sql/grant-database-role-share.md)
* [REVOKE DATABASE ROLE … FROM SHARE](../sql-reference/sql/revoke-database-role-share.md)
* [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md)
* [REVOKE <privilege> … FROM SHARE](../sql-reference/sql/revoke-privilege-share.md)
* [SHOW GRANTS TO SHARE …](../sql-reference/sql/show-grants.md) — lists all object privileges that have been granted to a share
* [SHOW GRANTS OF SHARE …](../sql-reference/sql/show-grants.md) — lists all accounts for the share and indicates the accounts
  that are using the share

## Preparing to create a share

Before creating a share, Snowflake recommends identifying the Snowflake objects you plan to share:

* Databases
* Tables
* Dynamic tables
* External tables
* Externally managed and managed Apache Iceberg™ tables
* Externally managed Delta Lake tables (with Delta Direct and catalog-linked databases)
* Views

  + Regular views
  + Secure views
  + Secure materialized views
  + Semantic views
* Cortex Search services
* User-defined functions (UDFs) (secure and non-secure)
* Models of type USER_MODEL, CORTEX_FINETUNED, or DOC_AI

This might require some additional planning and administrative tasks, particularly if you decide to share only a subset of data in
any of your tables.

### Database and tables

If you plan to share a database, little or no preparation is required.

If you plan to share entire tables, no preparation is required.

However, if you decide to filter the data in a table (or set of tables), either based on certain conditions, or by consumer account,
you must create one or more secure views on the table(s).

### Secure objects (views, materialized views and UDFs)

To provide strict control of access to data in a shared database, you must
use [secure views](views-secure.md), [secure materialized views](views-materialized.md)
and/or [secure UDFs](../developer-guide/secure-udf-procedure.md). For example,
you can choose to filter data by date or some other condition, or you can decide to use a single share
to partition shared data for different consumer accounts. Secure objects enable you to dictate
the level of granularity you wish to apply to your data while ensuring that the base tables and
business logic are protected from exposure.

Secure objects are defined similar to standard objects, using either the corresponding [CREATE <object>](../sql-reference/sql/create.md)
or [ALTER <object>](../sql-reference/sql/alter.md) commands. However, note the following important
usage information:

* Secure objects that reference tables by their fully-qualified names (i.e. `<db_name>.<schema_name>.<table_name>`)
  can be included in a share; however, you must ensure that the referenced database
  name matches the database for the share.
* Do not include secure objects that use the [CURRENT_USER](../sql-reference/functions/current_user.md)
  or [CURRENT_ROLE](../sql-reference/functions/current_role.md) functions in their definition. The contextual values
  returned by these functions have no relevance in a consumer’s account and will cause the object to fail when queried/used.
* When defining a secure object to share with consumer accounts, a key/vital additional step to perform
  is validating that the object is configured correctly to display only the data you wish to display.
  This is particularly important if you wish to limit data access based on the account the data is shared with. To facilitate performing
  this validation, Snowflake provides the [SIMULATED_DATA_SHARING_CONSUMER](../sql-reference/parameters.md) session parameter.
  The SIMULATED_DATA_SHARING_CONSUMER session parameter only supports secure views and
  secure materialized views, but does not support secure UDFs. Setting this parameter in a session enables you to
  simulate querying a secure view as a user in any of the consumer account(s) you plan to share the view with.

  For example, for consumer account `xy12345`:

  > ```sqlexample
  > ALTER SESSION SET SIMULATED_DATA_SHARING_CONSUMER = xy12345;
  > ```

For a detailed example, see [Use secure objects to control data access](data-sharing-secure-views.md).

### Streams on shared objects

Data consumers can create streams in their own databases that record data manipulation language (DML) changes made to the source tables or
views.

> **Note:**
>
> The operations listed here are not supported:
>
> * Creating append-only streams on shares of secondary source objects is not supported.
> * Modifying a stream in another account is not supported.

You can allow consumers to create streams on shared tables or secure views. Before you do this, you need to extend the data
retention period for the tables, and you also need to enable change tracking on the shared tables or the
underlying tables for a shared view. You set the CHANGE_TRACKING and DATA_RETENTION_TIME_IN_DAYS parameters when
creating or altering a table, using
[CREATE TABLE](../sql-reference/sql/create-table.md) or [ALTER TABLE](../sql-reference/sql/alter-table.md).

Enable change tracking:
:   Currently, when the first stream for a local table is created, a pair of hidden columns are automatically added to the table and begin
    storing change tracking metadata. This change is not possible for shared tables, because a consumer of a share cannot modify the source
    database. Instead, to enable change tracking for tables intended for sharing, execute
    [ALTER TABLE](../sql-reference/sql/alter-table.md) … CHANGE_TRACKING = TRUE on each of the tables.

Extend the data retention period for the table:
:   When a stream on a local table is not consumed regularly, Snowflake temporarily extends the data retention period for the source table
    to help avoid staleness.

    A stream on a shared table does not extend the data retention period for the table. Likewise, a stream on a shared view does not
    extend the data retention period for the underlying tables. To manually specify a longer data retention period
    for any shared table, or any underlying table for a shared view, set the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) parameter for
    the table.

### Shared tag references

A data sharing provider can set a tag on an object and share both the tag and the tagged object with the data sharing consumer.
Additionally, the tag references of the shared object are available to the consumer. Sharing the tag references allows the provider to
share additional context regarding the shared object, such as the data sensitivity of a table or column based on the tag string value.

The consumer can use SQL to view the tag assignments on shared objects and determine the tag references of the shared objects. By viewing
the tag assignments and references of shared objects, data stewards in the consumer account can provide a more comprehensive assessment of
where data originates from and how the data is being used. These new insights can facilitate regulatory compliance requirements.

The provider must create the tag in the same database as the tagged objects and share this database. After sharing the database, if the
provider unsets a tag from a shared object, the change in tag assignment also occurs in the consumer account. The consumer cannot track the
shared object using the tag once that tag is unset. By unsetting the tag, the provider can maintain data discretion in cases when an object
was tagged inadvertently.

[Tag inheritance](object-tagging/inheritance.md) applies to tagged objects in the shared database. For example, if a provider sets a tag
on a schema in the shared database, the objects and columns in that schema are also tagged. However, the consumer cannot use the
Information Schema [TAG_REFERENCES](../sql-reference/functions/tag_references.md) table function to determine where the provider initially set the tag.
Snowflake hides the values in the LEVEL column in the table function output to protect the data provider by not revealing where
the tag was initially set.

> **Important:**
>
> Shared tags are read only. The consumer cannot set a shared tag on an object in their account.

Provider options
:   To share a tag, the provider has these options:

    * Use SQL to allow the share to access the tag and allow the consumer to view the assignments of the shared tag on the shared objects.

      The provider must grant The READ privilege on each tag to make the tag available to a consumer.

      ```sqlexample
      GRANT READ ON TAG mydb.tags.tag1 TO SHARE my_share;

      GRANT USAGE ON DATABASE mydb TO SHARE my_share;
      GRANT USAGE ON SCHEMA mydb.tags TO SHARE my_share;
      ```
    * [Create a database role](../sql-reference/sql/create-database-role.md), grant the READ privilege on the tag to the database role, and
      [grant the database role to the share](../sql-reference/sql/grant-database-role-share.md). The database role also needs the USAGE
      privilege on the schema that stores the tag.

      ```sqlexample
      GRANT READ ON TAG mydb.tags.tag1 TO DATABASE ROLE my_db_role;
      GRANT USAGE ON SCHEMA mydb.tags TO DATABASE ROLE my_db_role;
      GRANT DATABASE ROLE my_db_role TO SHARE my_share;
      ```

Consumer options
:   To view shared tags in the consumer account, the consumer has these options:

    * Use the ACCOUNTADMIN role. Consumer account administrators can view the shared tags the provider makes available.
    * Use a role with IMPORTED PRIVILEGES. An account role that is granted or inherits a role with IMPORTED PRIVILEGES on the database
      created from the share can view the shared tag the provider makes available.

      ```sqlexample
      GRANT IMPORTED PRIVILEGES ON DATABASE db_share TO ROLE db_share_role;
      ```
    * Use a shared database role. If the provider grants the READ privilege on a tag to the database role and shares the database role, the
      consumer can grant the shared database role to an account role in their account.

      ```sqlexample
      GRANT DATABASE ROLE my_db_role TO ROLE consumer_analyst_role;
      ```

    In the consumer account, you can use SQL to view tags, tag references, and tagged objects that the provider shares:

    * Command: [SHOW TAGS](../sql-reference/sql/show-tags.md)
    * Functions:

      + [SYSTEM$GET_TAG](../sql-reference/functions/system_get_tag.md)
      + [TAG_REFERENCES](../sql-reference/functions/tag_references.md)
      + [TAG_REFERENCES_ALL_COLUMNS](../sql-reference/functions/tag_references_all_columns.md)

    Currently, you cannot use the following options in the consumer account to view tags, tag references, and tagged objects the provider
    shares:

    * Snowsight.
    * The Account Usage [TAG_REFERENCES](../sql-reference/account-usage/tag_references.md) view.
    * The Account Usage [TAG_REFERENCES_WITH_LINEAGE](../sql-reference/functions/tag_references_with_lineage.md) table function.

## Creating a share

You must use the ACCOUNTADMIN role or a role that has been granted the CREATE SHARE global privilege to create shares.

### Using Snowsight to create a share

There are several ways to share data in Snowsight:

> * Provide a listing to specific consumers or publicly on the Snowflake Marketplace using Provider Studio.
>   See [Create and publish a listing](../collaboration/provider-listings-creating-publishing.md).
> * Publish a listing in a [data exchange](data-exchange-managing-data-listings.md).
> * Create a direct share to share data with consumer accounts in your region.

If you are creating a share where you need to add a secure view that references objects in other databases,
you must create your share using SQL. For more information, see [Share data from multiple databases](data-sharing-multiple-db.md).

To create a direct share:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.
4. Select Share » Create a Direct Share.
5. In the Share Data dialog, select + Select Data and then:

   1. Select a source database.
   2. Select a target object or objects from the source database.
   3. Optionally, update the Secure Share Identifier created for your share.
   4. Optionally, enter a Description.
   5. In the remaining text box, enter an account locator. Entering a partial account locator lists all accounts that match the entered text.
      Repeat as required to add additional accounts. You can only add accounts within the same region to the share.
   6. Select Create Share.

If you want to convert a direct share with active consumers to a listing, see [Convert a direct share to a listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing#convert-a-direct-share-to-a-private-listing).

### Using SQL to create a share

To create a share using SQL:

1. Use the [CREATE SHARE](../sql-reference/sql/create-share.md) command to create an empty share.
2. Use the [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) command to add a database to the share and then selectively grant access to
   specific database objects (schemas, tables and secure views) to the share.
3. Use the [ALTER SHARE](../sql-reference/sql/alter-share.md) command to add one or more accounts access to the share.

> **Note:**
>
> The following steps assume a provider account, `prvdr1`, is sharing data with two consumer accounts, `xy12345` and `yz23456`.

### Using DDL to create and manage shares

To create and manage shares, you use the DDL commands listed here:

* [CREATE SHARE](../sql-reference/sql/create-share.md)
* [ALTER SHARE](../sql-reference/sql/alter-share.md)
* [DROP SHARE](../sql-reference/sql/drop-share.md)
* [DESCRIBE SHARE](../sql-reference/sql/desc-share.md) — describes all the objects in a share
* [SHOW SHARES](../sql-reference/sql/show-shares.md) — lists all shares, as well as the consumer accounts specified for each share

#### Step 1: Create the empty share

The following example creates an empty share named `sales_s`:

> ```sqlexample
> CREATE SHARE sales_s;
> ```

#### Step 2: Grant privileges for a database and objects to the share

Add objects (database, schema, tables, secure views, etc.) to the share. You can choose to either add privileges on these objects
to a share via a database role, or grant privileges on the objects directly to the share. For more information on these options, see
[How to share database objects](data-sharing-gs.md).

Option 1:
:   The following example illustrates creating a database role, granting privileges on the following objects to the database role, and then
    granting the database role to the `sales_s` share created in the previous step:

    > * `sales_db` (database)
    > * `aggregates_eula` (schema)
    > * `aggregate_1` (table)
    >
    > ```sqlexample
    > CREATE DATABASE ROLE sales_db.dr1;
    >
    > GRANT USAGE ON DATABASE sales_db TO DATABASE ROLE sales_db.dr1;
    >
    > GRANT USAGE ON SCHEMA sales_db.aggregates_eula TO DATABASE ROLE sales_db.dr1;
    >
    > GRANT SELECT ON TABLE sales_db.aggregates_eula.aggregate_1 TO DATABASE ROLE sales_db.dr1;
    >
    > GRANT USAGE ON DATABASE sales_db TO SHARE sales_s;
    >
    > GRANT DATABASE ROLE sales_db.dr1 TO SHARE sales_s;
    > ```

Option 2:
:   To include objects in the share, grant privileges on each object. When granting privileges, first grant usage on any container
    objects before granting usage on the objects in the container. For example, grant usage on a database before granting usage on any
    schemas contained in the database.

    > **Note:**
    >
    > Perform this task before adding accounts to the share. Attempting to add an account before granting usage on a
    > database results in an error.

    The following example illustrates granting privileges on the following objects to the `sales_s` share created in the previous step:

    > * `sales_db` (database)
    > * `aggregates_eula` (schema)
    > * `aggregate_1` (table)
    >
    > ```sqlexample
    > GRANT USAGE ON DATABASE sales_db TO SHARE sales_s;
    >
    > GRANT USAGE ON SCHEMA sales_db.aggregates_eula TO SHARE sales_s;
    >
    > GRANT SELECT ON TABLE sales_db.aggregates_eula.aggregate_1 TO SHARE sales_s;
    > ```

To confirm the contents of the share:

> ```sqlexample
> SHOW GRANTS TO SHARE sales_s;
>
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> | created_on                    | privilege | granted_on | name                                 | granted_to | grantee_name   | grant_option | granted_by   |
> |-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------|
> | 2017-06-15 16:45:07.307 -0700 | USAGE     | DATABASE   | SALES_DB                             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:10.310 -0700 | USAGE     | SCHEMA     | SALES_DB.AGGREGATES_EULA             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:12.312 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGGREGATE_1 | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> ```

This ensures that the share is correctly configured before making it available to other accounts to consume.

#### Step 3: Add accounts to the share

> **Attention:**
>
> If you have a Business Critical account and are sharing data with consumer accounts:
>
> * Snowflake supports sharing sensitive data with non-Business Critical accounts (disabled by default), but does not
>   encourage doing so.
> * To ensure compliance with HIPAA and HITRUST requirements, Snowflake does not allow HIPAA accounts to share data
>   with non-HIPAA accounts.
>
> * If you are using Tri-Secret Secure, Snowflake treats data access from consumer accounts as if the access occurred from within your
>   own account.

The following example adds two accounts to the `sales_s` share:

> ```sqlexample
> ALTER SHARE sales_s ADD ACCOUNTS=xy12345, yz23456;
> ```

Accounts `xy12345` and `yz23456` are now able to see the share and create a database from it.

> > **Note:**
> >
> > When adding accounts to a share, if the accounts do not exist, the command completes successfully,
> > but no updates are made to the share. To ensure the share is properly updated, verify that the accounts
> > exist and you’ve entered the names correctly.

Use [SHOW SHARES](../sql-reference/sql/show-shares.md) to confirm the share. The output of the command lists the `sales_s` share.
The `kind` column indicates that the share is OUTBOUND, meaning this share is sharing a database with other
Snowflake accounts. The `to` column lists all accounts to which the share has been made available:

> ```sqlexample
> SHOW SHARES;
> ```
>
> ```output
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> | created_on                    | kind     | owner_account        | name          | database_name         | to               | owner        | comment                                | listing_global_name |
> |-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------|---------------------|
> | 2017-07-09 19:18:09.821 -0700 | INBOUND  | SNOW.XY12345         | SALES_S2      | UPDATED_SALES_DB      |                  |              | Transformed and updated sales data     |                     |
> | 2017-06-15 17:02:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT | SALES_S       | SALES_DB              | XY12345, YZ23456 | ACCOUNTADMIN |                                        |                     |
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> ```

## Maintaining shares

You must use a role with the OWNERSHIP privilege on a share and the CREATE SHARE global privilege to manage shares.

### Adding objects to a share

You can add objects to an existing share at any time. Objects that you add to a share are instantly available to the consumer
accounts that have created databases from the share. For example, if you add a table to a share, users in consumer accounts can query the
data in the table as soon as the table is added to the share.

> **Important:**
>
> If you plan to securely share data with data consumers across different [regions](intro-regions.md) or
> [cloud platforms](intro-cloud-platforms.md), note that replicating a primary database is blocked if the database
> contains some types of objects. For a full list of objects that cause refresh operations to fail, see
> [Current limitations of replication](account-replication-intro.md).

#### Using Snowsight to add objects to a share

To modify the data associated with a share using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.
4. Locate and select the share you want to modify.
5. In the Data section, select Edit.
6. Select the data that you want to add.
7. Select Done.

> **Note:**
>
> The web interface does not currently support adding or removing external tables, secure materialized views, or secure UDFs to/from
> shares. All management of these objects in shares must be performed using SQL.
>
> You cannot add a secure view that references objects in other databases to a share using the web interface. You must
> create your share using SQL. See [Share data from multiple databases](data-sharing-multiple-db.md).

#### Using SQL to add objects to a share

Use the [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) command.

> **Note:**
>
> * If the schema for the object is already in the share, you only need to add the object.
> * If the schema for the object is not already in the share, you need to first add the schema and then the object.

The following example adds a secure view named `agg_secure` in the `aggregates_eula` schema to the `sales_s` share:

> ```sqlexample
> SHOW GRANTS TO SHARE sales_s;
>
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> | created_on                    | privilege | granted_on | name                                 | granted_to | grantee_name   | grant_option | granted_by   |
> |-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------|
> | 2017-06-15 16:45:07.307 -0700 | USAGE     | DATABASE   | SALES_DB                             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:10.310 -0700 | USAGE     | SCHEMA     | SALES_DB.AGGREGATES_EULA             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:12.312 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGGREGATE_1 | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
>
> GRANT SELECT ON VIEW sales_db.aggregates_eula.agg_secure TO SHARE sales_s;
>
> SHOW GRANTS TO SHARE sales_s;
>
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> | created_on                    | privilege | granted_on | name                                 | granted_to | grantee_name   | grant_option | granted_by   |
> |-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------|
> | 2017-06-15 16:45:07.307 -0700 | USAGE     | DATABASE   | SALES_DB                             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:10.310 -0700 | USAGE     | SCHEMA     | SALES_DB.AGGREGATES_EULA             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:12.312 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGGREGATE_1 | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-17 12:33:15.310 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGG_SECURE  | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> ```

### Removing objects from a share

You can remove objects from an existing share at any time.
Any objects that you remove from a share are instantly unavailable to the consumers accounts who have created databases from the share.

For example, if you remove a table from a share, users in consumer accounts can no longer query the data in the table as soon as the table
is removed from the share.

> **Note:**
>
> The web interface does not currently support adding or removing external tables, secure materialized views, or
> secure UDFs to/from shares. All management of these objects in shares must be performed using SQL.

#### Using Snowsight to remove objects from a share

To remove the data associated with a share using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.
4. Locate and select the share you want to modify.
5. In the Data section, select Edit.
6. Select the data in the share and deselect the checkboxes for the data that you want to remove from the share.
7. Select Done.

#### Using SQL to remove objects from a share

Remove objects from an existing share at any time using the [REVOKE <privilege> … FROM SHARE](../sql-reference/sql/revoke-privilege-share.md) command.

The following example removes the secure view named `agg_secure` in the `aggregates_eula` schema from the `sales_s` share:

> ```sqlexample
> SHOW GRANTS TO SHARE sales_s;
>
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> | created_on                    | privilege | granted_on | name                                 | granted_to | grantee_name   | grant_option | granted_by   |
> |-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------|
> | 2017-06-15 16:45:07.307 -0700 | USAGE     | DATABASE   | SALES_DB                             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:10.310 -0700 | USAGE     | SCHEMA     | SALES_DB.AGGREGATES_EULA             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:12.312 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGGREGATE_1 | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-17 12:33:15.310 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGG_SECURE  | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
>
> REVOKE SELECT ON VIEW sales_db.aggregates_eula.agg_secure FROM SHARE sales_s;
>
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> | created_on                    | privilege | granted_on | name                                 | granted_to | grantee_name   | grant_option | granted_by   |
> |-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------|
> | 2017-06-15 16:45:07.307 -0700 | USAGE     | DATABASE   | SALES_DB                             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:10.310 -0700 | USAGE     | SCHEMA     | SALES_DB.AGGREGATES_EULA             | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> | 2017-06-15 16:45:12.312 -0700 | SELECT    | TABLE      | SALES_DB.AGGREGATES_EULA.AGGREGATE_1 | SHARE      | PRVDR1.SALES_S | false        | ACCOUNTADMIN |
> +-------------------------------+-----------+------------+--------------------------------------+------------+----------------+--------------+--------------+
> ```

### Adding accounts to a share

You can add accounts to an existing share at any time. After an account is added to the share, the share is immediately “visible”
to the account and the account can create a database from the share and start querying the Snowflake objects in the database.

#### Using Snowsight to add accounts to a share

To add consumers to an existing share using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.
4. Locate the share you want to modify.
5. In the Shared With section, select Add Consumers.
6. For Share With Snowflake Accounts, enter one or more account locators. Entering part of an account locator lists all accounts that match.
7. Select Add.

#### Using SQL to add accounts to a share

To add consumers to an existing share using SQL, use the [ALTER SHARE](../sql-reference/sql/alter-share.md) command.

### Removing accounts from a share

You can remove accounts from an existing share at any time. Removing an account from a share instantly invalidates the database they
created from the share. All queries and other operations that users in the account perform on the database will no longer work.

After removing an account from a share, you can add it back again to the share; however, this does not restore the database
they created earlier from the share. They must create a new database from the share.

> **Note:**
>
> Before removing an account from a share, consider the downstream impact it will have on the account.
> Because the database is instantly invalidated, all queries and other operations that users (in the account)
> perform on the database will stop working, which could have a significant impact on the business operations of the account.

#### Using Snowsight to remove accounts from a share

To remove consumers from an existing share using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.
4. Locate and select the share you want to modify.
5. In the Shared With section, select … » Remove.
6. In the confirmation dialog, select Remove.

#### Using SQL to remove accounts from a share

Remove accounts from an existing share using the [ALTER SHARE](../sql-reference/sql/alter-share.md) command.

You remove an account from a share by setting a new list of accounts for the share and leaving the desired account off the list.

### Dropping a share

You can drop (remove) a share at any time. Dropping a share instantly invalidates all databases created from the share by consumer accounts.
All queries and other operations performed on these databases no longer work.

After dropping a share, you can recreate it with the same name; however, this does not restore any of the databases created from the share
by consumer accounts. The recreated share is treated as a new share and all consumer accounts must create a new database from the new share.

> **Note:**
>
> Before dropping a share, consider the downstream impact it will have on all consumer accounts using the share.
>
> Instead, you might want to consider removing individual objects from the share. Removed objects can be added back to a share without
> requiring any additional tasks on the part of the consumer accounts.

#### Using Snowsight to drop a share

To drop a share using Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.
4. Locate and select the share you want to drop.
5. Select … » Drop.
6. In the confirmation dialog, select Drop.

#### Using SQL to drop a share

Drop a share using the [DROP SHARE](../sql-reference/sql/drop-share.md) command.

## Viewing consumers who have created databases from shares

To see the accounts that have created databases from a share, use the SHOW GRANTS OF SHARE command. The output from this command is
different from the list of accounts returned by SHOW SHARES in the following ways:

* SHOW SHARES lists all shares that are available to accounts, as well as the accounts that are able to access each share.
* SHOW GRANTS OF SHARE lists all accounts that have created a database from the share. If no accounts have created a database from the
  share, the results are empty.

For example, the following example shows:

> * Two shares, `sales_s` and `sales_s2` have been made available to accounts `xy12345` and `yz23456`
>   by the owner account `SNOW.PRVDR1`.
> * Account `xy12345` has created a database from the `prvdr1.sales_s` share.
> * No accounts have created databases from the `sales_s2` share.
>
> ```sqlexample
> SHOW SHARES;
> ```
>
> ```output
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> | created_on                    | kind     | owner_account        | name          | database_name         | to               | owner        | comment                                | listing_global_name |
> |-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------|---------------------|
> | 2017-06-15 17:02:29.625 -0700 | OUTBOUND | SNOW.PRVDR1          | SALES_S       | SALES_DB              | XY12345, YZ23456 | ACCOUNTADMIN |                                        |
> | 2017-06-15 17:02:29.625 -0700 | OUTBOUND | SNOW.PRVDR1          | SALES_S2      | SALES_DB              | XY12345, YZ23456 | ACCOUNTADMIN |                                        |                     |
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> ```
>
> ```sqlexample
> SHOW GRANTS OF SHARE sales_s;
> ```
>
> ```output
> +-------------------------------+----------------+------------+----------+
> | created_on                    | share          | granted_to | account  |
> |-------------------------------+----------------+------------+----------|
> | 2017-06-15 18:00:03.803 -0700 | PRVDR1.SALES_S | ACCOUNT    | XY12345  |
> +-------------------------------+----------------+------------+----------+
> ```
>
> ```sqlexample
> SHOW GRANTS OF SHARE sales_s2;
> ```
>
> ```output
> +------------+-------+------------+---------+
> | created_on | share | granted_to | account |
> |------------+-------+------------+---------|
> +------------+-------+------------+---------+
> ```

## Viewing shares and data

Using Snowsight, you can view data that was shared by your account using a listing, a direct share, or as part of a data exchange.

To view the data shared by your account, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Shared by your account tab.

On this page, you can do the following:

* View the shares that you have created or have access privileges to. This includes information such as the database for the share,
  the consumer accounts, if any, added to the share, creation date of the share, and the shared objects.
* Explore shares associated with listings offered specifically to certain consumers or available to any consumer on the Snowflake Marketplace.
* Access shares that are shared within private data exchanges.

You can use the following filters to selectively display shared data:

* Filter by type with the All Types drop-down list. Choose to display only secure shares or listings shared within a data exchange.
  Some secure shares are shares associated with listings.
* Filter by consumer account or data exchange with the Shared With drop-down list. Select one or more specific consumers or data
  exchanges to see all shares or listings associated with your selection or selections.

## Managing shares and data

Select a share to manage the share, revoke access for individual consumer accounts, or add a description to the share.
To manage secure shares that are offered as listings, or to manage your listings on the Snowflake Marketplace, use Provider Studio.

---
title: Create and manage offers
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/providers-create-manage-offers.md
section: User Guide
---

# Create and manage offers

Providers can create both standard and private offers.

* Standard offers are displayed in the pricing section of a listing. With standard offers, providers can allow self-serve purchases or send consumers to their sales team using a Contact sales option.
* Private offers are only visible to targeted consumers. With private offers, providers can offer custom discounts and terms.

## Prerequisites

* A provider profile. For more information, see [Set up a provider profile](../../../../collaboration/provider-becoming.md).
* A published listing. For more information, see [Create a new listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing).
* An account that allows payment for listings. For more information, see [Set up Stripe to get paid for listings](../../../../collaboration/provider-becoming.md).

## Required privileges

* You must use the ACCOUNTADMIN role or a role that has been granted the provider privileges. For more information, see [Privileges required for working with listings](../../../../collaboration/provider-becoming.md).

## Working with offers

The Offers tab shows a list of both standard and private offers that are available for a listing. The table includes the following information about each offer:

The following details are available for Standard offers:

> * Offer name
> * Status (Draft, Active, Retired)
> * Type (Self-serve or Sales-led)
> * Last updated date
> * Display order

The following details are available for Private offers:

* Offer name
* Status (Draft, Active, Withdrawn, Expired)
* Expiration date
* Target consumer
* Pricing planTerms
* Last updated date

Each row in an Offers table includes an action button  that you can select to view additional options for managing the offer.

Standard offers provide the following actions:

* View details
* Edit offer
* Display order (not available for Retired or Draft offers)
* Retire offer (not available for Retired or Draft offers)

Private offers provide the following actions:

* Copy offer URL
* View details
* Edit offer (not available for Active offers)
* Withdraw offer (not available for Expired offers)

## Limitations

When providers include a discount in an offer, the discount isn’t automatically applied to the [SYSTEM$CREATE_BILLING_EVENT](../../../../sql-reference/functions/system_create_billing_event.md) charges.

1. To apply the discount, store the discounted price in your app.

   You can also run the [SHOW OFFERS](../../../../sql-reference/sql/show-offers.md) command to retrieve the discount amount that’s included in an offer.
2. After retrieving the discount, emit the final dollar amount in the [SYSTEM$CREATE_BILLING_EVENT](../../../../sql-reference/functions/system_create_billing_event.md) call.

   Do not send the undiscounted price.

For more information about creating billing events, see [Billable event examples](../../../../developer-guide/native-apps/adding-custom-event-billing.md).

## Create a standard offer

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. On the Offers tab, Standard offers is selected by default. Click + Create offer.
6. In the Offer details dialog, specify details for the offer.

   1. Select Standard offer.
   2. Select the Purchase type:

      * Select Self-serve to allow consumers to see the price and purchase the listing directly
      * Select Sales-led to require consumers to contact you to purchase the listing.
   3. Specify a name for the offer.
   4. Select Next.
7. In the Billing and payments dialog, select a pricing plan to attach to this offer.
8. (Optional) Specify whether to require consumers to include a credit card on file to purchase the listing.
9. Select Next.
10. In the Description dialog, enter information about the offer that users will see.

    1. Specify an offer name to display to consumers.
    2. Specify the price to display to consumers.
    3. (Optional) Specify a tagline to display to consumers.
    4. Specify the text for the button that consumers click to purchase the listing.
    5. (Optional) Specify any value propositions for the offer.
11. Select Next.
12. Review the offer summary, then click Create offer.

1. Create an [offer manifest reference](offer-manifest-reference.md) named PRICING_PLAN_1_DEFAULT_OFFER.

   > **Note:**
   >
   > The offer name must be uppercase.

   ```yaml
   access_start_date_preference: SPECIFIC_DATE
   comment: An internal note
   contract_value: 120.12
   contract_type: LIMITED_TIME
   contract_duration_months: 12
   discount: 0.0
   invoice_start_date_preference: SPECIFIC_DATE
   invoice_start_time: 1731102884579
   is_default: false
   display_name: Display name of the offer
   expiration_time: 1762638884579
   payment_terms:
     payment_type: FULL
   pricing_plan_name: PRICING_PLAN_1
   access_end_time: 1762638884579
   access_start_time: 1731102884579
   state: PUBLISHED
   terms_of_service:
     type: DEFAULT
   ```
2. Create a [listing manifest reference](../../../../progaccess/listing-manifest-reference.md) that includes the offer.

   ```yaml
   title: my_listing
   subtitle: Subtitle for my_listing
   description: Description for my_listing
   listing_terms:
     type: OFFLINE
   targets:
     regions: PUBLIC.AWS_US_EAST_1
   usage_examples:
     - title: this is a test sql
       description: Simple example
       query: select *
   offers:
     - name: PRICING_PLAN_1_DEFAULT_OFFER
       type: FILE
       path: offers/PRICING_PLAN_1_DEFAULT_OFFER.yaml
   ```
3. Stage the offer and listing manifest reference files.

   ```sqlexample
   PUT file:///local/path/to/PRICING_PLAN_1_DEFAULT_OFFER.yaml @DB.SCHEMA.STAGE/offers/PRICING_PLAN_1_DEFAULT_OFFER
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;

   PUT file:///local/path/to/manifest.yaml @DB.SCHEMA.STAGE/listings/my_manifest
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;
   ```
4. Create a listing that uses the manifest files uploaded to the stage.

   ```sqlexample
   CREATE EXTERNAL LISTING my_listing
     FROM @DB.SCHEMA.STAGE/listings/my_manifest
     REVIEW = TRUE
     PUBLISH = FALSE;
   ```

## Create a private offer based on a pricing plan

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. On the Offers tab, select Private offers, then click + Create offer.
6. In the Create private offer pane, perform the following steps to create a private offer:

   1. On the Offer details page, enter the following information:

      1. Select Private offer as the offer type.
      2. Specify the [data sharing account identifier](../../../admin-account-identifier.md) for the consumer that will receive this offer.
      3. Specify a name and expiration date for the offer.
      4. Select Next.
   2. On the Billings and payments page, select an existing pricing plan` and enter the following information.

      1. Review the negotiated price details in the Plan components table.

         Optional: Hover on a row in the Plan components table and select the Edit icon to modify the component details.

         * For usage-based plans, you can edit the monthly access fee or apply a discount. You can also edit the price per query and the monthly limit.
         * For flat-fee plans, you can edit the access fee price or apply a discount percentage.
      2. Specify the contract type:

         * Limited-time: This grants access for a fixed period of time, such as 30 days. Consumers can be charged upfront or in installments
         * Recurring (Subscription): This grants continuous access. Consumers are billed at the chosen frequency for the contract duration, and the subscription auto-renews until the consumer [cancels](consumers-manage-offers.md) the purchase.
         > **Note:**
         >
         > You can’t specify a contract type for usage-based pricing plans.
      > 1. Enter a contract duration to indicate the length of time that the offer is valid for.
      >
      >    For flat-fee plans, specifying a contract duration will auto-fill the total contract value based on the pricing plan details.
      > 2. Specify payment options for the offer:
      >
      >    * Require full payment upfront: The consumer pays the total contract value (TCV) at the start of the contract.
      >    * Accept installments: Allow the consumer to pay in equal monthly installments or specify custom installment amounts.
      >
      >      If you select the Accept installments option, you can specify the number of installments and the installment amount.
      >    > **Note:**
      >    >
      >    > You can’t specify payment options for usage-based pricing plans.
      > 3. Specify the first invoice date.
      >
      >    The first invoice date is the date when the consumer will be billed for the first time.
      > 4. Select whether to require a credit card to be on file.
      > 5. Select Next.
   3. On the Access and terms page, specify the access start date and the terms of service for the offer.

      The access start date is the date when the consumer can start using the product. You can set this to When offer accepted to allow the consumer to start using the product immediately after accepting the offer, or you can configure a specific start date.
   4. Select Next.
   5. On the Summary page, review the offer details, and then select Done.

Upon completion, the offer appears on the Private offers tab. The initial status will show Active, indicating that the offer is ready to be accepted by the consumer. The consumer can either accept or reject the offer, and the status will be updated accordingly.

1. Create an [offer manifest reference](offer-manifest-reference.md) named PRIVATE_OFFER_PRICING_PLAN.

   > **Note:**
   >
   > The offer name must be uppercase.

   ```yaml
   version: V2
   access_start_date_preference: SPECIFIC_DATE
   comment: Private offer for specific consumer
   contract_type: LIMITED_TIME
   contract_duration_months: 12
   discount: 10.0
   invoice_start_date_preference: SPECIFIC_DATE
   invoice_start_time: 1731102884579
   is_default: false
   display_name: Private Offer Display Name
   expiration_time: 1762638884579
   payment_terms:
     payment_type: FULL
   pricing_plan_details:
     type: DEFAULT
     name: PRICING_PLAN_1
   access_end_time: 1762638884579
   access_start_time: 1731102884579
   state: PUBLISHED
   target_consumer: ORGANIZATION_NAME.ACCOUNT_NAME
   terms_of_service:
     type: DEFAULT
   ```
2. Create a [listing manifest reference](../../../../progaccess/listing-manifest-reference.md) that includes the private offer.

   ```yaml
   title: my_listing
   subtitle: Subtitle for my_listing
   description: Description for my_listing
   listing_terms:
     type: OFFLINE
   targets:
     regions: PUBLIC.AWS_US_EAST_1
   usage_examples:
     - title: this is a test sql
       description: Simple example
       query: select *
   offers:
     - name: PRIVATE_OFFER_PRICING_PLAN
       type: FILE
       path: offers/PRIVATE_OFFER_PRICING_PLAN.yaml
   ```
3. Stage the offer and listing manifest reference files.

   ```sqlexample
   PUT file:///local/path/to/PRIVATE_OFFER_PRICING_PLAN.yaml @DB.SCHEMA.STAGE/offers/PRIVATE_OFFER_PRICING_PLAN
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;

   PUT file:///local/path/to/manifest.yaml @DB.SCHEMA.STAGE/listings/my_manifest
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;
   ```
4. Create a listing that uses the manifest files uploaded to the stage.

   ```sqlexample
   CREATE EXTERNAL LISTING my_listing
     FROM @DB.SCHEMA.STAGE/listings/my_manifest
     REVIEW = TRUE
     PUBLISH = FALSE;
   ```

## Create a one-time pricing offer

One-time pricing plans allow providers to create a private offer that isn’t tied to a pricing plan. When you extend a one-time pricing offer to a consumer, the consumer is charged a single, upfront fee for access to the data product for a specified duration.

The steps below describe how to create a one-time pricing offer.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. On the Offers tab, select Private offers, then click + Create offer.
6. In the Create private offer pane, run the following steps to create a private offer:

   1. On the Offer details page, enter the following information:

      1. Select Private offer as the offer type.
      2. Specify the [data sharing account identifier](../../../admin-account-identifier.md) for the consumer that will receive this offer.
      3. Specify a name and expiration date for the offer.
      4. Select Next.
   2. On the Billings and payments page, select Apple one-time pricing and enter the following information.

      1. Specify the total contract value.
      2. Specify the contract type:

         * Limited-time: This grants access for a fixed period of time, such as 30 days. Consumers can be charged upfront or in installments
         * Recurring (Subscription): This grants continuous access. Consumers are billed at the chosen frequency for the contract duration, and the subscription auto-renews until the consumer [cancels](consumers-manage-offers.md) the purchase.
      3. Enter a contract duration to indicate the length of time that the offer is valid for.
      4. Specify payment options for the offer:

         * Require full payment upfront: The consumer pays the total contract value (TCV) at the start of the contract.
         * Accept installments: Allow the consumer to pay in equal monthly installments or specify custom installment amounts.

           If you select the Accept installments option, you can specify the number of installments and the installment amount.
      5. Specify the first invoice date.

         The first invoice date is the date when the consumer will be billed for the first time.
      6. Select whether to require a credit card to be on file.
      7. Select Next.
   3. On the Access and terms page, specify the access start date and the terms of service for the offer.

      The access start date is the date when the consumer can start using the product. You can set this to When offer accepted to allow the consumer to start using the product immediately after accepting the offer, or you can configure a specific start date.
   4. Select Next.
   5. On the Summary page, review the offer details, and then select Done.

1. Create an [offer manifest reference](offer-manifest-reference.md) named ONE_TIME_PRICING_OFFER.

   > **Note:**
   >
   > The offer name must be uppercase.

   ```yaml
   version: V2
   access_start_date_preference: SPECIFIC_DATE
   comment: One-time pricing offer for specific consumer
   contract_type: LIMITED_TIME
   contract_duration_months: 12
   contract_value: 5000.00
   invoice_start_date_preference: SPECIFIC_DATE
   invoice_start_time: 1731102884579
   is_default: false
   display_name: One-Time Pricing Offer
   expiration_time: 1762638884579
   payment_terms:
     payment_type: FULL
   access_end_time: 1762638884579
   access_start_time: 1731102884579
   state: PUBLISHED
   target_consumer: ORGANIZATION_NAME.ACCOUNT_NAME
   terms_of_service:
     type: DEFAULT
   ```
2. Create a [listing manifest reference](../../../../progaccess/listing-manifest-reference.md) that includes the one-time pricing offer.

   ```yaml
   title: my_listing
   subtitle: Subtitle for my_listing
   description: Description for my_listing
   listing_terms:
     type: OFFLINE
   targets:
     regions: PUBLIC.AWS_US_EAST_1
   usage_examples:
     - title: this is a test sql
       description: Simple example
       query: select *
   offers:
     - name: ONE_TIME_PRICING_OFFER
       type: FILE
       path: offers/ONE_TIME_PRICING_OFFER.yaml
   ```
3. Stage the offer and listing manifest reference files.

   ```sqlexample
   PUT file:///local/path/to/ONE_TIME_PRICING_OFFER.yaml @DB.SCHEMA.STAGE/offers/ONE_TIME_PRICING_OFFER
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;

   PUT file:///local/path/to/manifest.yaml @DB.SCHEMA.STAGE/listings/my_manifest
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;
   ```
4. Create a listing that uses the manifest files uploaded to the stage.

   ```sqlexample
   CREATE EXTERNAL LISTING my_listing
     FROM @DB.SCHEMA.STAGE/listings/my_manifest
     REVIEW = TRUE
     PUBLISH = FALSE;
   ```

## Edit a standard offer

The steps below describe how to edit a standard offer.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. On the Offers tab, select the  button for the standard offer you want to edit, and then select Edit
   offer.

1. Create a live version of your listing and download the offer manifest reference.

   ```sqlexample
   ALTER LISTING my_listing ADD LIVE VERSION FROM LAST;
   GET snow://listing/my_listing/versions/live/offers/STANDARD_OFFER.yml file:///Users/my_username/
   ```
2. Edit the offer manifest reference.
3. Upload the offer and listing manifest reference files and commit the change.

   ```sqlexample
   PUT file:///Users/my_username/STANDARD_OFFER.yaml snow://listing/my_listing/versions/live/offers AUTO_COMPRESS = false;

   ALTER LISTING my_listing COMMIT;
   ```

## Edit a private offer

The steps below describe how to edit a private offers. Only private offers that have the following status can be edited:

* DRAFT
* EXPIRED
* WITHDRAWN

> **Note:**
>
> It can take up to 10 minutes for the edits to private offers to be visible to consumers.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. On the Offers tab, select Private offers, then select the  button for the private offer you want to edit, and then select Edit.
6. Edit the private offer, and then click Edit offer.

1. Create a live version of your listing and download the offer manifest reference.
2. Edit the offer manifest reference.
3. Upload the offer and listing manifest reference files and commit the change.

   ```sqlexample
   PUT file:///Users/my_username/PRIVATE_OFFER.yaml snow://listing/my_listing/versions/live/offers AUTO_COMPRESS = false;

   ALTER LISTING my_listing COMMIT;
   ```

## View offer details

You can view details of both standard and private offers.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. Click the Private Offers tab.
6. Click the  button for the offer you want to view and then select View details.
7. Review the offer and click Close to return to the offers list.

To see details of an offer in your listing, run the [SHOW OFFERS](../../../../sql-reference/sql/show-offers.md) command.

> ```sqlexample
> SHOW OFFERS IN LISTING my_listing;
> ```

## Retire a standard offer

Retiring an active standard offer makes it unavailable for new purchases. Existing consumers who have already purchased the offer can continue to use it until their contract expires.

> **Note:**
>
> This action can’t be undone.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. Click the  button for the standard offer you want to retire and then select Retire offer.
6. A confirmation dialog appears. Click Retire offer to confirm.

## Copy a private offer URL

Copy a private offer URL and provide it to consumers so they can review and accept or decline it.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. Click the Private Offers tab.
6. Click the  button for the private offer you want to view and then select Copy URL.

## Withdraw a private offer

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select a paid listing in the list.
5. Click the Private Offers tab.
6. Click the  button for private offer you want to withdraw and then select Withdraw.
7. Click Withdraw offer.

---
title: Create and manage organization profiles
source: https://docs.snowflake.com/en/user-guide/collaboration/organization-profiles/org-profiles-create-manage.md
section: User Guide
---

# Create and manage organization profiles

Organization profiles allow providers to organize their Internal Marketplace listings by department.
For example, individual organization profiles can be created for sales, marketing, and human resources.
This allows providers to identify and brand organizational listings that are specific
to their organization’s business unit, and associate all organizational listings
created within their business unit with the same organization profile.

Organization profiles provide consumers with a reliable method to confirm that
the organizational listings they use come from trusted sources within their organization.
Organization profiles also allow consumers to filter and locate organizational
listings that are specific to their business unit or use case.

> **Note:**
>
> Organization profiles cannot be used outside an organization’s Internal Marketplace,
> and they are unique within an organizational data cloud. Organization profiles
> can be created and modified programmatically or via Snowsight and then assigned to
> an organizational listing.
>
> An organization account is required to create and manage organization profiles.
> To learn more about organization accounts, see [Organization accounts](../../organization-accounts.md).

## Organization profile format

An organization profile forms part of the Uniform Listing Locator (ULL). The
format of an organization profile is `ORGDATACLOUD${org_profile_name}${organizational_listing_name}`.
The ULL identifies the organization profile and its associated organizational listing.
The ULL can be used in programmatic queries similar to this example:

```sqlexample
SELECT * FROM "ORGDATACLOUD$<ProfileName>$<ListingName>.<SchemaName>.<TableName>;
```

## Access control requirements

A [role](../../security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION PROFILE | Account | Organization profiles can only be created from the organization account in an organization. The GLOBALORGADMIN role has been granted the CREATE ORGANIZATION PROFILE privilege. |

## Create an organization profile

To create an organization profile, you can use Snowsight or SQL commands.

SnowsightSQL

Create a new organization profile.

> 1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
> 2. In the navigation menu, select Data sharing » Internal sharing.
> 3. In the right pane, select the Profiles tab.
> 4. Select + Create profile.
> 5. On the Basic information page, specify the following:
>
>    * Profile title: The title for this profile.
>
>      Specifying a title generates a ULL reference name.
>    * ULL reference name: (Optional) Edit the ULL reference name.
>    * Description: Enter a description for the profile.
> 6. Select Next.
> 7. On the Access page, specify who in the organization can use the profile to publish internal listings.
>
>    * Entire organization: Anyone in the organization can use the profile.
>    * Selected accounts and roles: Only specific accounts and roles can use the profile.
>
>      1. Select one or more accounts.
>
>         By default, all roles in the selected accounts can use the profile.
>      2. (Optional) To grant access to specific roles in each account, select the All roles drop-down, then select Selected roles.
>
>         + Select one or more roles in the account that can use the profile.
> 8. Select Next.
> 9. On the Contact information page, specify email addresses for the owner of the profile and for the approver of profile access requests.
> 10. Select Next.
> 11. On the Appearance page, select an icon to use as the profile avatar and select the avatar background color.
> 12. Upon completion, select one of the following options:
>
>     * Publish: Publish the profile and make it Live on the Profiles page.
>     * Save as draft: Save the profile without publishing.
>     * Cancel: Discard the profile without saving or publishing.
>     * Previous: Return to a prior page to make changes.

To create an organization profile use the [CREATE ORGANIZATION PROFILE](../../../sql-reference/sql/create-organization-profile.md)
and execute a statement similar to:

```sqlexample
USE ROLE GLOBALORGADMIN;

CREATE ORGANIZATION PROFILE MyOrgPROFILE
AS
$$
title: "My Org Profile"
description: "An appropriate desc"
contact: "contact@test.com"
approver_contact: "approver@test.com"
allowed_publishers:
  access:
    - all_internal_accounts: true
$$ publish=True;
```

For details of organization profile manifest fields, see [Organization profile manifest reference](org-profile-manifest-reference.md).

## Assign an organization profile to a organizational listing

To assign an organization profile to a new or existing organizational listing, you can use the Snowsight or SQL commands.

SnowsightSQL

Assign an organization profile to a new listing.

> 1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
> 2. In the navigation menu, select Data sharing » Internal sharing.
> 3. Select Create Listing.
> 4. Select a data product such as a table, view, or other data product to add to the listing.
>
>    1. Review the generated share identifier, then select Generate listing.
> 5. Enter a name for your listing.
> 6. Select the Select Profile drop-down.
> 7. Select an organization profile in the Profile list.
> 8. Complete the organizational listing setup. See [Create an organizational listing](../listings/organizational/org-listing-create.md).

Assign an organization profile to an existing draft listing.

> **Note:**
>
> You can only assign an organization profile to a listing that is in draft status.
> If the organizational listing has been published, an organization profile cannot be assigned or changed.

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. On the Listings tab, select the listing that you want to edit.
4. Select the Select profile drop-down, and select a profile for the listing.

> You can update an existing listing to use a different organization profile using the
> [ALTER ORGANIZATION PROFILE](../../../sql-reference/sql/alter-organization-profile.md) command and executing a command similar to:
>
> Note the value of the `organization_profile` field in the manifest YAML
> which specifies the organization profile associated with the listing.

```sqlexample
USE ROLE GLOBALORGADMIN;

ALTER LISTING MyLISTING
AS $$
title: "my listings title"
description: "Listing updated for new org profile"
auto_fulfillment:
   refresh_type: "SUB_DATABASE"
   refresh_schedule: "10 MINUTE"
organization_profile: "MyOrgPROFILE"
organization_targets:
access:
   - all_internal_accounts: true
locations:
access_regions:
    - name: "ALL"
$$;
```

For details of organization profile manifest fields, see [Organization profile manifest reference](org-profile-manifest-reference.md).

## Modify an existing organizational listing profile

By default, the contact support email defined in the organization profile appears on the organizational listing landing page.
You can specify a custom support email address or URL when the original email address changes.

To assign an organization profile to a new or existing organizational listing, you can use the Snowsight or SQL commands.

SnowsightSQL

To modify listing support contact email address:

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. In the right pane, select the Listings tab.
4. Select an organizational listing in the list.
5. In the Details section, select Edit next to the support contact email address.
6. Select Use custom email or URL in the Profile list.
7. Enter an email address or a URL.
8. Select Save.

To alter an existing organization profile use the [ALTER ORGANIZATION PROFILE](../../../sql-reference/sql/alter-organization-profile.md) and execute a statement similar to:

Unlike Snowsight, SQL commands can be used to alter many of the fields in an organization profile, including the contact email address.

```sqlexample
USE ROLE GLOBALORGADMIN;

ALTER ORGANIZATION PROFILE MyOrgPROFILE
AS
$$
title: "New Title"
description: "New desc"
contact: "contact@test.com"
approver_contact: "approver@test.com"
allowed_publishers:
  access:
   - all_internal_accounts: true
logo: "urn:emoji:smile"
$$
```

For details of organization profile manifest fields, see [Organization profile manifest reference](org-profile-manifest-reference.md).

## View organization profiles

SnowsightSQL

1. Sign in to [Snowsight](../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Browse the available profiles or use the search bar to find a specific organization listing and examine its profile.

Use [SHOW AVAILABLE ORGANIZATION PROFILES](../../../sql-reference/sql/show-available-organization-profiles.md) to find organization profiles which are available to
you.

```sqlexample
SHOW AVAILABLE ORGANIZATION PROFILES;
```

---
title: Create and manage pricing plans
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/providers-create-manage-pricing-plans.md
section: User Guide
---

# Create and manage pricing plans

## Prerequisites

* A provider profile. See [Set up a provider profile](../../../../collaboration/provider-becoming.md).
* A published listing. See [Create a new listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing).
* An account that allows payment for listings. See [Set up Stripe to get paid for listings](../../../../collaboration/provider-becoming.md).

## Required privileges

* You must use the ACCOUNTADMIN role or a role that has been granted the provider privileges. See [Privileges required for working with listings](https://other-docs.snowflake.com/collaboration/provider-becoming#label-permissions-required-for-working-with-listings-and-shares).

## Create a pricing plan

Follow the steps below to create a new listing with a pricing plan.

SnowsightSQL

1. Follow the steps for [sharing data on the Snowflake Marketplace](../../../../collaboration/provider-listings-creating-publishing.md).
2. After you add a data product to your share, in the Access type dropdown, select Paid listing.

   A Pricing section and a Trial (optional) section are added to the listing page.
3. In the Pricing section, select Add pricing plans.

   The Create pricing plan page opens.
4. On the Settings page, specify a name for the plan, then select Next.

   You can optionally specify a product SKU for the pricing plan.
5. On the Pricing details page, select a pricing model for the plan:

   * If you select Flat-fee, specify the access fee price and the billing frequency (monthly or annually) for the plan.
   * If you select Usage-based, specify the monthly access fee, the price per query, and the maximum monthly charge.
6. Select Next.
7. Review the pricing plan summary, and then select Done.
8. Optional: To add another pricing plan, select Add pricing plan, and then repeat the previous steps.
9. Select Submit for approval » Publish once approved to publish the listing. Only published listings can be offered to consumers.

If you want to create additional pricing plans for a specific listing, select the listing, select the Pricing plans tab, and then select + Create pricing plan.

1. Create a [pricing plan manifest reference](pricing-plan-manifest-reference.md) named PRICING_PLAN_1.

   > **Note:**
   >
   > The pricing plan name must be uppercase.

   ```yaml
   display_name: Default pricing plan display name
   currency: USD
   pricing_model: FLAT_FEE
   base_fee: 100.0
   billing_duration_months: 1
   sales_motion: SELF_SERVE
   comment: Comment for the pricing plan
   metadata:
     description: Pricing plan description
     price: $100 / unit
     button_text: Buy Now
     value_propositions:
       - val 1
       - val 2
     visibility: VISIBLE
     contract_type: LIMITED_TIME
     contract_duration_months: 12
     state: PUBLISHED
   ```
2. Create a [listing manifest reference](../../../../progaccess/listing-manifest-reference.md) that includes the pricing plan.

   ```yaml
   title: my_listing
   subtitle: Subtitle for my_listing
   description: Description for my_listing
   listing_terms:
     type: OFFLINE
   targets:
     regions: PUBLIC.AWS_US_EAST_1
   usage_examples:
     - title: this is a test sql
       description: Simple example
       query: select *
   pricing_plans:
     - name: PRICING_PLAN_1
       type: FILE
       path: pricingPlans/PRICING_PLAN_1.yaml
   ```
3. Stage the pricing plan and listing manifest reference files.

   ```sqlexample
   PUT file:///local/path/to/PRICING_PLAN_1.yaml @DB.SCHEMA.STAGE/pricingPlans/PRICING_PLAN_1
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;

   PUT file:///local/path/to/manifest.yaml @DB.SCHEMA.STAGE/listings/my_manifest
     SOURCE_COMPRESSION=NONE AUTO_COMPRESS=FALSE OVERWRITE=TRUE;
   ```
4. Create a listing that uses the manifest files uploaded to the stage.

   ```sqlexample
   CREATE EXTERNAL LISTING my_listing
     FROM @DB.SCHEMA.STAGE/listings/my_manifest
     REVIEW = TRUE
     PUBLISH = FALSE;
   ```

## Add a pricing plan to a paid listing

The steps below add a pricing plan to an existing listing.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, click the Listings tab.
4. Select the paid listing that you want to attach a pricing plan to.

   The Pricing plans tab for the listing opens.
5. Click + Create pricing plan.
6. On the Settings page, specify a name for the plan, then select Next.

   You can optionally specify a product SKU for the pricing plan.
7. Click Next.
8. On the Pricing details page, select a pricing model for the plan:

   * If you select Flat-fee, specify the access fee price and the billing frequency (monthly or annually) for the plan.
   * If you select Usage-based, specify the monthly access fee, the price per query, and the maximum monthly charge.
9. Select Next.
10. Review the pricing plan summary, and then select Done.
11. Optional: To add another pricing plan, select Add pricing plan, and then repeat the previous steps.

1. Create a pricing plan manifest reference file and save it as PRICING_PLAN_1.yaml.

   ```yaml
   display_name: Default pricing plan display name
   currency: USD
   pricing_model: FLAT_FEE
   base_fee: 100.0
   billing_duration_months: 1
   sales_motion: SELF_SERVE
   comment: Comment for the pricing plan
   metadata:
     description: Pricing plan description
     price: $100 / unit
     button_text: Buy Now
     value_propositions:
       - val 1
       - val 2
     visibility: VISIBLE
     contract_type: LIMITED_TIME
     contract_duration_months: 12
     state: PUBLISHED
   ```
2. Create a live version of your listing and download the listing manifest reference.

   ```sqlexample
   ALTER LISTING my_listing ADD LIVE VERSION FROM LAST;
   GET snow://listing/my_listing/versions/live/manifest.yml file:///Users/my_username/
   ```
3. Add the pricing plan to the listing manifest reference.

   > **Note:**
   >
   > The pricing plan name must be uppercase.

   ```yaml
   pricing_plans:
     - name: PRICING_PLAN_1
       type: FILE
       path: pricingPlans/PRICING_PLAN_1.yaml
   ```
4. Upload the pricing plan and listing manifest reference files and commit the change.

   ```sqlexample
   PUT file:///Users/my_username/PRICING_PLAN_1.yaml snow://listing/my_listing/versions/live/pricingPlans AUTO_COMPRESS = false;

   PUT file:///Users/my_username/manifest.yml snow://listing/my_listing/versions/live AUTO_COMPRESS = false;

   ALTER LISTING my_listing COMMIT;
   ```
5. To see the pricing plan in your listing, run the [SHOW PRICING PLANS](../../../../sql-reference/sql/show-pricing-plans.md) command.

   ```sqlexample
   SHOW PRICING PLANS IN LISTING my_listing;
   ```

## Edit a pricing plan

To edit an existing pricing plan, follow the steps below:

> **Note:**
>
> It can take up to 10 minutes for pricing plan edits to be visible to consumers.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, select the Listings tab.
4. On the Listings page, select a paid listing.
5. Select the Pricing plans tab.
6. Select the  button for the pricing plan you want to edit, and then select Edit plan.
7. Edit the pricing plan, and then click Done.

1. Create a live version of your listing and download the pricing plan manifest reference.

   ```sqlexample
   ALTER LISTING my_listing ADD LIVE VERSION FROM LAST;
   GET snow://listing/my_listing/versions/live/pricingPlans/PRICING_PLAN_1.yml file:///Users/my_username/
   ```
2. Edit the pricing plan manifest reference.
3. Upload the pricing plan and listing manifest reference files and commit the change.

   ```sqlexample
   PUT file:///Users/my_username/PRICING_PLAN_1.yaml snow://listing/my_listing/versions/live/pricingPlans AUTO_COMPRESS = false;

   ALTER LISTING my_listing COMMIT;
   ```

## View pricing plan details

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, select the Listings tab.
4. On the Listings page, select a paid listing.
5. Select the Pricing tab.
6. Select Options for the pricing plan you want to view, and then select See details.
7. Review the pricing plan and click Edit to edit the pricing plan, or click Close.

To see details of the pricing plan in your listing, run the [SHOW PRICING PLANS](../../../../sql-reference/sql/show-pricing-plans.md) command.

```sqlexample
SHOW PRICING PLANS IN LISTING my_listing;
```

---
title: Create and manage storage lifecycle policies
source: https://docs.snowflake.com/en/user-guide/storage-management/storage-lifecycle-policies-create-manage.md
section: User Guide
---

# Create and manage storage lifecycle policies

The following sections explain how to create, recreate, and manage storage lifecycle policies on your tables.

## Create a storage lifecycle policy

To create a storage lifecycle policy, use the [CREATE STORAGE LIFECYCLE POLICY](../../sql-reference/sql/create-storage-lifecycle-policy.md) command.

When you create a storage lifecycle policy, you can choose an [archive tier](storage-lifecycle-policies.md)
and optionally set an archival period in days.
If you set an archival period, Snowflake moves table rows that match the policy expression into a lower-cost storage tier
for the specified number of days before expiring the rows.
Snowflake also enables change tracking on any tables that you attach the policy to.

For example:

```sqlexample
CREATE STORAGE LIFECYCLE POLICY my_slp
  AS (event_ts TIMESTAMP, account_id NUMBER)
  RETURNS BOOLEAN ->
    event_ts < DATEADD(DAY, -60, CURRENT_TIMESTAMP())
    AND EXISTS (
      SELECT 1 FROM closed_accounts
      WHERE id = account_id
    )
  ARCHIVE_TIER = COOL
  ARCHIVE_FOR_DAYS = 90;
```

> **Note:**
>
> For considerations when you work with tables that have archival storage policies, see [Archival storage policies](storage-lifecycle-policies.md).

### Best practice: Use date conversions for time-based expressions

To improve performance and ensure consistent policy execution, convert timestamps to dates in your policy expressions
when you compare time values.

For example, consider this policy expression:

```sqlexample
event_time < DATEADD(DAY, -400, CURRENT_TIMESTAMP())
```

This comparison includes the time component of the timestamp, which can cause inconsistent behavior. When data gets inserted
in chronological order by `event_time`, the policy’s execution time affects how many rows get deleted from each file.

To avoid this inconsistent behavior, convert timestamps to dates in your expression:

```sqlexample
event_time < TO_DATE(DATEADD(DAY, -400, CURRENT_TIMESTAMP()))
```

This method provides consistent policy execution regardless of the time of day.

## Recreate a storage lifecycle policy

This feature extends the [GET_DDL](../../sql-reference/functions/get_ddl.md) command to recreate a
specified storage lifecycle policy. You might do this if you want to change the archival tier for a policy.

To recreate a storage lifecycle policy named `my_slp`, return the DDL, as shown in the following example:

```sqlexample
SELECT GET_DDL('policy','my_slp');
```

Output:

```output
---------------------------------------------------------------------+
                      GET_DDL('POLICY','SLP')                        |
---------------------------------------------------------------------+
create or replace storage lifecycle policy SLP as                    |
  (event_ts timestamp, account_id number)
    returns boolean ->
    event_ts < dateadd(day, -60, current_timestamp())
    and exists (
      select 1 from closed_accounts
      where id = account_id
  )
  ARCHIVE_FOR_DAYS = 365                                             |
;                                                                    |
---------------------------------------------------------------------+
```

## Manage storage lifecycle policies on tables

Use the following options to manage storage lifecycle policy attachments.

### Attach a policy to a table

You can manage multiple tables with one storage lifecycle policy. Attach the policy when you create or alter the table.

To create a table and attach the policy to a new table by
using the specified columns, use [CREATE TABLE](../../sql-reference/sql/create-table.md), as shown in the following example.

> **Note:**
>
> * You must have the necessary privileges to apply the policy. For information about required privileges, see
>   [Storage lifecycle policy privileges](../security-access-control-privileges.md).
> * A table can have only one attached storage lifecycle policy.
> * The number of columns must match the argument count in the policy function signature, and the column data must be compatible with the argument types.
> * Associated policies aren’t affected if you rename table columns. Snowflake associates policies to tables by using the column IDs.
> * In order to evaluate and apply storage lifecycle policy expressions, Snowflake internally and temporarily bypasses any governance policies on a table.

```sqlexample
CREATE TABLE my_table
  ...
  WITH STORAGE LIFECYCLE POLICY my_slp ON (col1);
```

To attach the policy to an existing table by using the specified columns, use [ALTER TABLE](../../sql-reference/sql/alter-table.md), as shown in the following example:

```sqlexample
ALTER TABLE my_table ADD STORAGE LIFECYCLE POLICY my_slp
  ON (col1);
```

### Apply a policy as a one-time operation

If you only need to expire or archive historical data once, as a one-time operation, we recommend the following procedure:

1. Create, and then attach a storage lifecycle policy to your table.
2. Wait for the policy to execute, and then archive or expire the data.

   Monitor the [INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/functions/storage_lifecycle_policy_history.md) table
   function to confirm the process is complete.
3. To prevent recurring charges, remove the storage lifecycle policy
   from the table.

   Storage lifecycle policies incur cost *per execution*.

This method ensures that you only pay for a single execution instead of ongoing daily charges for a
policy that has already processed all eligible data. For more information about cost,
see [Billing for storage lifecycle policies](storage-lifecycle-policies-billing.md).

### Remove a policy from a table

To remove a storage lifecycle policy from a table, use [ALTER TABLE](../../sql-reference/sql/alter-table.md), as shown in the following example:

```sqlexample
ALTER TABLE my_table DROP STORAGE LIFECYCLE POLICY;
```

* This command removes all future policy executions for this table.
* Running policy executions might complete before they are dropped from the table.
* To drop a storage lifecycle policy, you must have the OWNERSHIP privilege on the table the policy is attached to.

---
title: Create custom categories for sensitive data
source: https://docs.snowflake.com/en/user-guide/classify-custom.md
section: User Guide
---

# Create custom categories for sensitive data

If there isn’t a [native semantic category](classify-native.md) that detects your domain-specific sensitive data, you can
create a custom category for your sensitive data.

Implement custom semantic categories by defining a custom classifier. A custom classifier has the following attributes:

* Custom semantic categories that identify types of data; for example, `medical_code` and `employee_id`.
* Regular expressions that are used by Snowflake’s algorithm to detect your sensitive data.
* One of the pre-defined privacy categories.

## How it works

Snowflake provides the CUSTOM_CLASSIFIER [class](../sql-reference/classes/custom_classifier.md) in the SNOWFLAKE.DATA_PRIVACY schema to
enable data engineers to extend their data classification capabilities based on their own knowledge of their data. After you create an
instance of the class, you can call a method on the instance to define your custom semantic category, specify the privacy category, and
specify regular expressions to match column value patterns while optionally matching the column name.

> **Important:**
>
> Sensitive data classification stores the definition of a custom classifier, not a reference. If you change the custom classifier, you must use
> the SET_CUSTOM_CLASSIFIERS method to update the classification profile with the new definition.

For an example of using the CUSTOM_CLASSIFIER class to create and use a custom classifier, see [Example](classify-custom-using.md).

## Considerations

Choose a warehouse that matches the size of the data you are classifying:

* No concern for processing time: x-small warehouse.
* Up to 100 columns in a table: small warehouse.
* 101 to 300 columns in a table: medium warehouse.
* More than 300 in a table: large warehouse.

## Threshold for custom categories

The algorithm used to classify custom categories uses a *scoring rule* to evaluate the regular expression of your custom classifier to
determine which semantic category to recommend.

The scoring rule uses a default threshold value of 0.8, which equates to high confidence in terms of what the recommended category should
be. Eighty percent of the data in the sample must match the regular expressions that you add to the instance. The algorithm compares the
score for a column against the threshold value and recommends a category that corresponds to one of the following:

* Non-international system tag
* International system tag
* Custom classifier tag

You can specify the threshold value for a custom classification instance by calling the
[custom_classifier!ADD_REGEX](../sql-reference/classes/custom_classifier/methods/add_regex.md) method on the instance.

> **Note:**
>
> It is possible for two custom classifiers to have the same score. In this case, a tie is resolved by evaluating the following:
>
> * Match percentage between respective custom categories.
> * Alphabetical order between the names of the custom categories.
>
> In such a case, the winning category will be the recommended category and rest is contained in the alternates.

The following table summarizes the scoring algorithm and the recommended tag:

| Name Matcher Provided | Value matches >= threshold | Name matches | Recommendation |
| --- | --- | --- | --- |
| True | True | True | Custom category |
|  | False | True | Snowflake category |
|  | True | False | Snowflake category |
|  | False | False | Snowflake category |
| False | True | Not applicable | Custom category |
|  | False | Not applicable | Snowflake category |

## Replication and cloning

* Instances of the CUSTOM_CLASSIFIER class are replicated when you replicate a database.
* Instances of the CUSTOM_CLASSIFIER class are cloned when you clone the schema that contains the instances.

---
title: Create dynamic Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-create-iceberg.md
section: User Guide
---

# Create dynamic Apache Iceberg™ tables

This topic explains how to create dynamic Iceberg tables, which store query results in
the Iceberg table format.

## Create dynamic Iceberg tables

Dynamic Iceberg tables combine the benefits of dynamic tables and Snowflake-managed
Iceberg tables, offering features like external cloud storage management, automated data
transformation, and performance optimization.

Dynamic Iceberg tables integrate with data lakes, which let you store data in external
cloud storage such as AWS S3 or Azure Blob Storage while being managed by Snowflake.
These tables support ACID transactions, schema evolution, hidden partitioning, and
table snapshots.

Automated data transformation with dynamic Iceberg tables uses declarative SQL to define
the desired end state without managing intermediary steps. Snowflake handles
orchestration, scheduling, and refreshing data transformations based on your specified
data freshness targets.

Performance is optimized through incremental processing, which processes only changed
data to improve performance and reduce costs compared to full data refreshes.
Additionally, you can transition between batch processing and streaming data with a
simple command, providing flexibility in data processing workflows.

Example use cases for dynamic Iceberg tables include the following:

* **Data lake integration:** You can store large datasets cost-effectively while
  performing transformations and analytics within Snowflake, leveraging the Iceberg
  format for efficient querying and management.
* **Defining continuous data transformation pipelines:** By using dynamic tables, you
  can ensure data is always up to date without manual intervention and handle
  high-velocity data streams efficiently with incremental processing.

To create a dynamic Iceberg table, execute the [CREATE DYNAMIC ICEBERG TABLE](../sql-reference/sql/create-dynamic-table.md) SQL
statement. For example, to create a dynamic Iceberg table that reads from `my_iceberg_table`,
use the following syntax:

```sqlexample
CREATE DYNAMIC ICEBERG TABLE my_dynamic_iceberg_table (product_id NUMBER(10,0), product_name STRING, order_time TIMESTAMP_NTZ)
  TARGET_LAG = '20 minutes'
  WAREHOUSE = my_warehouse
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_iceberg_table'
  AS
    SELECT product_id, product_name, order_time FROM staging_table;
```

## Configure partitioning, file size, and path layout

Dynamic Iceberg tables support the `PARTITION_BY`, `TARGET_FILE_SIZE`, and
`PATH_LAYOUT` table properties. These properties work the same way as they do for
regular Snowflake-managed Iceberg tables.

* **PARTITION_BY**: Specifies one or more Iceberg partition expressions that control how
  Snowflake partitions data files in the table. For example, to partition a dynamic
  Iceberg table by the year of an `order_time` column:

  ```sqlexample
  CREATE DYNAMIC ICEBERG TABLE my_dynamic_iceberg_table (
      product_id NUMBER(10,0),
      product_name STRING,
      order_time TIMESTAMP_NTZ
    )
    TARGET_LAG = '20 minutes'
    WAREHOUSE = my_warehouse
    EXTERNAL_VOLUME = 'my_external_volume'
    CATALOG = 'SNOWFLAKE'
    BASE_LOCATION = 'my_iceberg_table'
    PARTITION BY (YEAR(order_time))
    PATH_LAYOUT = HIERARCHICAL
    AS
      SELECT product_id, product_name, order_time FROM staging_table;
  ```

  For supported partition transforms, see
  [Partition expression parameters (partitionExpression)](../sql-reference/sql/create-iceberg-table-snowflake.md).
* **TARGET_FILE_SIZE**: Specifies the target Parquet file size for the table. Defaults
  to `AUTO`, which lets Snowflake choose the file size based on table characteristics.
  For more information, see [Set a target file size](tables-iceberg-manage.md).
* **PATH_LAYOUT**: Specifies whether Snowflake writes Parquet data files using a flat or
  hierarchical path layout. Defaults to `FLAT`. Use `HIERARCHICAL` together with
  `PARTITION_BY` to enable Hive-style partitioned paths under the `data/` directory.

For full parameter details, see
[CREATE DYNAMIC ICEBERG TABLE](../sql-reference/sql/create-dynamic-table.md).

## Future grants on dynamic Iceberg tables

To ensure access to any new dynamic Iceberg tables created in the schema, use the
[GRANT … ON FUTURE ICEBERG TABLES](../sql-reference/sql/grant-privilege.md)
syntax without the `DYNAMIC` keyword. For example:

```sqlsyntax
GRANT <privilege> ON FUTURE ICEBERG TABLES IN SCHEMA my_schema TO ROLE my_role;
```

If you use the `DYNAMIC` keyword, the grant doesn’t provide access to new dynamic
Iceberg tables created in the schema. For instance, the following command doesn’t apply
for dynamic Iceberg tables:

```sqlsyntax
GRANT <privilege> ON FUTURE DYNAMIC TABLES IN SCHEMA my_schema TO ROLE my_role;
```

## Considerations and limitations

* Dynamic Iceberg tables support the same data types as regular Iceberg tables in
  Snowflake. For more information, see [Supported data types](tables-iceberg-data-types.md).
* The [Catalog](tables-iceberg.md) is an account, schema, or database parameter
  that you can configure to be implicit, just like regular Snowflake managed Iceberg tables.
* Dynamic Iceberg tables don’t currently support the `IF NOT EXISTS` clause. Using the
  `IF NOT EXISTS` clause throws an error if the target table already exists.
* Dynamic Iceberg tables are currently only supported for `CREATE` statements.
  Specifying `DYNAMIC ICEBERG` in any other command (for example,
  `ALTER DYNAMIC ICEBERG TABLE <name>`) results in an error.
* You can’t clone dynamic Iceberg tables. Additionally, cloning a database or schema
  containing a dynamic Iceberg table doesn’t clone the table to the new location.

---
title: Create dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-create.md
section: User Guide
---

# Create dynamic tables

This topic outlines the key concepts for creating dynamic tables.

Before you begin, ensure you have the [privileges for creating dynamic tables](dynamic-tables-privileges.md), and all objects used
by the dynamic table query have change tracking enabled.

Some limitations apply to creating dynamic tables. For a complete list, see [Dynamic table limitations](dynamic-tables-limitations.md).

> **Note:**
>
> For guidance on writing queries that work efficiently with incremental refresh, see
> [Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).

## Enable change tracking

When creating a dynamic table with incremental refresh mode, if change tracking is not already enabled on the tables that it queries, Snowflake
automatically attempts to enable change tracking on them. In order to support incremental refreshes, change tracking must be enabled with
[non-zero time travel retention](../sql-reference/parameters.md) on all underlying objects used by a dynamic table.

As base objects change, so does the dynamic table. If you recreate a base object, you must re-enable change tracking.

> **Note:**
>
> Snowflake doesn’t automatically attempt to enable change tracking on dynamic tables created with full refresh mode.

To enable change tracking on a specific database object, use [ALTER TABLE](../sql-reference/sql/alter-table.md), [ALTER VIEW](../sql-reference/sql/alter-view.md), and
similar commands on that object. The user creating the dynamic table must have the OWNERSHIP privilege to enable change tracking on all
underlying objects.

To check if change tracking is enabled, use [SHOW VIEWS](../sql-reference/sql/show-views.md), [SHOW TABLES](../sql-reference/sql/show-tables.md), and similar commands
on the underlying objects, and inspect the `change_tracking` column.

## Supported base objects

Dynamic tables support the following base objects:

* Tables
* Snowflake-managed Apache Iceberg™ tables
* Externally managed Apache Iceberg™ tables

## Example: Create a simple dynamic table

Suppose that you want to create a dynamic table that contains the `product_id` and `product_name` columns from a table named
`staging_table`, and you decide:

* You want the data in your dynamic table to be at most 20 minutes behind the data in `staging_table`.
* You want to use the warehouse `mywh` for the compute resources needed for the [refresh](dynamic-tables-refresh.md).
* You want the refresh mode to be automatically chosen.

  + Snowflake recommends using the automatic refresh mode only during development. For more information,
    see [Choose a refresh mode](dynamic-tables-performance-optimize.md).
* You want the dynamic table to refresh synchronously at creation.
* You want the refresh mode to be automatically chosen, and you want the dynamic table to refresh synchronously at creation.

To create this dynamic table, you would execute the following [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md) SQL
statement:

```sqlexample
CREATE OR REPLACE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = '20 minutes'
  WAREHOUSE = mywh
  REFRESH_MODE = auto
  INITIALIZE = on_create
  AS
    SELECT product_id, product_name FROM staging_table;
```

For a complete list of parameters and variant syntax, see the [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md) reference.

## Create dynamic tables that read from Snowflake-managed or externally managed Apache Iceberg™ tables

Creating a dynamic table from an Iceberg table is similar to creating one from a regular table. Execute the [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md)
SQL statement as you would for a regular table, using either a Snowflake-managed table or a table managed by an external catalog as the base
object.

Dynamic tables that read from a Snowflake-managed Iceberg table as the base table are useful if you want your pipelines to operate on data in
a Snowflake-managed Iceberg table or if you want your pipelines to operate on Iceberg tables written by other engines. Note that external
engines cannot write to Snowflake-managed Iceberg tables; they are read-write for Snowflake and read-only for external engines.

Dynamic tables that read from Iceberg tables managed by [external (non-Snowflake) catalogs](tables-iceberg.md),
such as AWS Glue and written by engines like Apache Spark, are useful for processing data from external data lakes. You can create dynamic
tables on top of externally managed data, continuously processing it in Snowflake without duplicating or ingesting the data.

### Limitations and considerations for using Iceberg tables

All limitations for [regular dynamic tables](dynamic-tables-limitations.md) and
[dynamic Iceberg tables](dynamic-tables-create-iceberg.md) still apply.

Additionally:

* All limitations for Iceberg base tables apply. For more information, see [Considerations and limitations](tables-iceberg.md).
* You can create a dynamic table that reads from Snowflake native tables, Snowflake-managed Iceberg tables, and externally managed Iceberg
  tables.
* Dynamic tables track changes at the file level for externally managed Iceberg base tables, unlike other base tables that track changes at
  the row level. Frequent copy-on-write operations (for example, updates or deletes) on externally managed Iceberg tables may impact the
  performance of incremental refreshes.

## Create dynamic tables with immutability and backfill

[Immutability constraints](dynamic-tables-immutability-constraints.md) let you mark portions of a dynamic table as static.
When you define an `IMMUTABLE WHERE` clause, Snowflake skips those rows during refresh,
which improves performance for tables with large amounts of historical data.

Backfill extends immutability constraints by letting you copy existing data into a dynamic table without computing it.
This operation makes historical data immediately available while you define a custom refresh query for future updates.

For more information and examples, see [Use immutability constraints](dynamic-tables-performance-optimize-immutability.md).

## Best practices for creating dynamic tables

### Chain together pipelines of dynamic tables

When defining a new dynamic table, rather than defining a large dynamic table with many nested statements, use small dynamic tables with
pipelines instead.

You can set up a dynamic table to query other dynamic tables. For instance, imagine a scenario where your data pipeline extracts data from a
staging table to update various dimension tables (e.g., `customer`, `product`, `date` and `time`). Additionally, your
pipeline updates an aggregate `sales` table based on the information from these dimension tables. By configuring the dimension tables to
query the staging table and the aggregate `sales` table to query the dimension tables, you create a cascade effect similar to a task
graph.

In this setup, the refresh for the aggregate `sales` table executes only after the refreshes for the dimension tables have successfully
completed. This ensures data consistency and meets lag targets. Through an automated refresh process, any changes in the source tables trigger
refreshes in all dependent tables at the appropriate times.

### Use a “controller” dynamic table for complex task graphs

When you have a complex graph of dynamic tables with many roots and leaves and you want to perform operations (e.g. changing lag, manual
refresh, suspension) on the full task graph with a single command, do the following:

1. Set the value for the `TARGET_LAG` of all of your dynamic tables to `DOWNSTREAM`.
2. Create a “controller” dynamic table that reads from all of the leaves in your task graph.

   * A leaf dynamic table is a node in your task graph with no downstream dependencies. No other dynamic tables read from it, so it is not a
     dependency of any other dynamic table.
   * Replace `<leaf1>`, `<leaf2>`, …, `<leafN>` with actual leaf dynamic table names.
   * To ensure this controller doesn’t consume resources, create a cartesian join with `LIMIT 0`.
   > ```sqlexample
   > CREATE DYNAMIC TABLE controller
   >   TARGET_LAG = <target_lag>
   >   WAREHOUSE = <warehouse>
   >   AS
   >     SELECT 1 A FROM <leaf1>, …, <leafN> LIMIT 0;
   > ```
3. Use the controller to control the whole graph. For example:

> * Set a new target lag for the task graph.
>
>   ```sqlexample
>   ALTER DYNAMIC TABLE controller SET
>     TARGET_LAG = <new_target_lag>;
>   ```
> * Manually refresh the task graph.
>
>   ```sqlexample
>   ALTER DYNAMIC TABLE controller REFRESH;
>   ```

### Use transient dynamic tables to reduce storage cost

[Transient](tables-temp-transient.md) dynamic tables maintain data reliably over time and support Time Travel within the data
retention period, but don’t retain data beyond the fail-safe period. By default, dynamic table data is retained for seven days in
[fail-safe](data-failsafe.md) storage.

For dynamic tables with high refresh throughput, this can significantly increase storage consumption. Therefore, you should make a dynamic
table transient only if its data doesn’t need the same level of data protection and recovery provided by permanent tables.

You can create a transient dynamic table or clone existing dynamic tables to transient dynamic tables using
the [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md) statement.

## Troubleshoot dynamic table creation

When you create a dynamic table, the initial refresh happens either on a schedule (`ON_SCHEDULE`) or immediately at creation
(`ON_CREATE`). The initial data population, or [initialization](dynamic-tables-refresh.md), depends on when this initial
refresh occurs. For example, for `ON_CREATE`, initialization might take longer if it triggers refreshes of upstream dynamic tables.

Initialization can take some time, depending on how much data is scanned. To view progress, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Query History.
3. In the Filters dropdown, enter CREATE DYNAMIC TABLE in the SQL Text filter and enter your warehouse name in the Warehouse
   filter.
4. Select the query with your dynamic table under SQL text and use the Query Details and Query Profile tabs to track progress.

---
title: Create external cloud storage for a catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/create-external-cloud-storage.md
section: User Guide
---

# Create external cloud storage for a catalog

This article describes how to create external cloud storage for Apache Iceberg™ tables for the following cloud storage providers:

* Amazon S3
* Cloud Storage from Google
* Microsoft Azure container

Before you can create an internal catalog in your Snowflake Open Catalog account, you must
first create and configure external cloud storage for it.

## Create an Amazon S3 bucket

1. Sign in to the AWS Management Console.
2. On the home dashboard, search for and select **S3**.
3. Select **Create bucket**.
4. For **Bucket name**, enter a name for the bucket.
5. Configure the settings for your storage bucket or use the default settings.
6. Select **Create bucket**.
7. Search for and select the storage bucket you created.
8. To create a folder, select **Create folder**.

   **Note**

   We recommend creating this folder as a best practice.
9. For **Folder name**, enter the name of the folder where you want to store Apache Iceberg™ tables, and then select
   **Create folder**.
10. Select the folder you created.
11. Select **Copy S3 URI**, and then store the URI for later use.

    **Note**

    When creating a catalog in Open Catalog, you enter the S3 URI in the **Default base location** field.

## Create a Cloud Storage bucket

1. Sign in to the Google Cloud console as a project editor.
2. In the navigation menu, select **Solutions > All products**.
3. Under **Storage**, select **Cloud Storage**.
4. Select **Create**.
5. Under **Get Started**, enter a name for your Cloud Storage bucket.
6. Optional: Configure the settings for your storage bucket.
7. Select **Create**.
8. On the **Bucket details** page, select **CREATE FOLDER**.
9. Enter a folder name where you want to store Apache Iceberg™ tables, and then select **Create**.
10. On the **Bucket details** page, next to the name of the folder you created, select **Copy**, and store the path for later use.

    **Note**

    > When creating a catalog in Open Catalog, you enter the path to the folder you created in the **Default base location** field.

## Create a Microsoft Azure container

To create a Microsoft Azure container for your Apache Iceberg tables, use one of the following Azure cloud storage services:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v1
* General-purpose v2

These services are the Azure cloud storage services that Open Catalog supports for storage integrations. A storage integration is an Open
Catalog object that stores a generated identity and access management (IAM) entity for your external cloud storage and is created when you
create a catalog.

### Step 1: Create a storage account

1. Sign in to Azure.
2. On the home dashboard, search for and select **Storage account**.
3. Select **+ Create**.
4. For **Resource group**, select a resource group for your storage account or select **Create new** to create a new resource group.
5. For **Storage account name**, enter a name for your storage account.
6. Optional: Enable hierarchical namespace to use the storage account for Azure Data Lake Storage Gen2 workloads. For more information,
   see [Create a storage account](https://learn.microsoft.com/en-us/azure/storage/common/storage-account-create?tabs=azure-portal#create-a-storage-account).
7. Optional: Configure the settings for your storage account.
8. Select **Review + create**.
9. Select **Create**.

### Step 2: Create a container in your storage account

1. In Azure, navigate to the storage account you created.
2. From the menu on the left, Select **Data storage**.
3. Under Data storage, select **Containers**.
4. Select **+ Container**.
5. Enter a name for your container, and then select **Create**.
6. Copy and save the name of your container. You need to specify this name when you create a catalog in Open Catalog.
7. Optional: If you’re using a hierarchical namespace and need to add a directory:

   a. Select the container you created.

   b. Select **+ Add Directory**.

   c. Enter a name for the directory, and then select **Save**.

   d. Copy and save the name of this directory. You need to specify this name when you create a catalog in Open Catalog.

### Step 3: Copy the endpoint path to your container

1. In Azure, navigate to the storage account you created.
2. From the menu on the left, select **Settings**.
3. Under Settings, select **Endpoints**.
4. Copy and store the path to the primary endpoint for your container:

   * If you’re using blob storage, under Blob service, select the **Copy to clipboard** icon for the **Primary endpoint: Blob service** field.
   * If you’re using Azure Data Lake Storage, under Data Lake Storage, select the **Copy to clipboard** icon for the **Primary endpoint: Data Lake Storage**
     field.

   **Note**

   > When creating a catalog in Open Catalog, you enter the path to the primary endpoint for your container in the **Default base location** field.
   > The steps for creating a catalog in Open Catalog include instructions for how to format this path into the required format for
   > the **Default base location** field.

---
title: Create hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-create.md
section: User Guide
---

# Create hybrid tables

This topic provides an overview on creating [hybrid tables](tables-hybrid.md) in Snowflake.

> **Note:**
>
> To create a hybrid table, you must have a running warehouse that is specified as the current warehouse for your session.
> Errors may occur if no running warehouse is specified when you create a hybrid table.
> For more information, see [Working with Warehouses](warehouses-tasks.md).

## CREATE HYBRID TABLE options

You can create a hybrid table by using one of the following methods.

* [CREATE HYBRID TABLE](../sql-reference/sql/create-hybrid-table.md). The following example
  creates a hybrid table with a required PRIMARY KEY constraint, inserts some rows, deletes a row, and queries the table:

  ```sqlexample
  CREATE OR REPLACE HYBRID TABLE application_log (
    id NUMBER PRIMARY KEY AUTOINCREMENT,
    col1 VARCHAR(20),
    col2 VARCHAR(20) NOT NULL
    );

  INSERT INTO application_log (col1, col2) VALUES ('A1', 'B1');
  INSERT INTO application_log (col1, col2) VALUES ('A2', 'B2');
  INSERT INTO application_log (col1, col2) VALUES ('A3', 'B3');
  INSERT INTO application_log (col1, col2) VALUES ('A4', 'B4');

  SELECT * FROM application_log;

  UPDATE application_log SET col2 = 'B3-updated' WHERE id = 3;

  DELETE FROM application_log WHERE id = 4;

  SELECT * FROM application_log;
  ```
* [CREATE HYBRID TABLE … AS SELECT (CTAS)](../sql-reference/sql/create-hybrid-table.md) or [CREATE HYBRID TABLE … LIKE](../sql-reference/sql/create-hybrid-table.md). For example:

  ```sqlexample
  CREATE OR REPLACE HYBRID TABLE dept_employees (
    employee_id INT PRIMARY KEY,
    department_id VARCHAR(200)
    )
  AS SELECT employee_id, department_id FROM company_employees;
  ```

## Loading data

> **Note:**
>
> Because the primary storage for hybrid tables is a row store, hybrid tables typically
> have a larger storage footprint than standard tables.
> The main reason for the difference is that columnar data for standard tables often
> achieves higher rates of compression. For details about storage costs, see
> [Evaluate cost for hybrid tables](tables-hybrid-cost.md).

### Optimized bulk loads

You can bulk load data into hybrid tables by copying either from a data stage or
from other tables (using
[CTAS](../sql-reference/sql/create-table.md), [COPY INTO <table>](../sql-reference/sql/copy-into-table.md), or
[INSERT INTO … SELECT](../sql-reference/sql/insert.md)).

The optimization of bulk loads depends on whether the table is freshly created, without ever having
any records loaded, or is created using a CTAS query.

When a hybrid table is empty, all three load methods (CTAS, COPY, and INSERT INTO … SELECT) use optimized
bulk loading to speed up the load process. After the table is loaded, normal INSERT performance applies.
You can still run incremental batch loads with
COPY and INSERT INTO … SELECT operations, but they will typically be less
efficient. Bulk load speeds of approximately 1 million records per minute are common but can widely vary
based on the structure of the table (for example, larger records are slower to load). Optimized bulk
loading will be extended to support incremental batch loads in a future release.

You can check the Statistics information in Snowsight query profiles to see whether the bulk-load
fast path was used. Number of rows inserted is referred to as the Number of rows bulk loaded when the fast
path is used. For example, this CTAS operation bulk loaded 200000 rows into a new table:

A subsequent incremental batch load into the same table would not use optimized bulk loading.

For more information about query profiles, see [Analyze query profiles for hybrid tables](tables-hybrid-read-query-profiles.md) and
[Monitor query activity with Query History](ui-snowsight-activity.md).

> **Attention:**
>
> CTAS commands do not support FOREIGN KEY constraints. If your hybrid table requires FOREIGN KEY constraints,
> use COPY or INSERT INTO … SELECT to load the table.

> **Note:**
>
> Other methods of loading data into Snowflake tables (for example, Snowpipe) are not currently supported.

### Index-building errors during loads

Index sizes are limited in width. When building indexes on columns in a hybrid table, especially
indexes on a large number of columns, any command that loads the table
(including CTAS, COPY, or INSERT INTO … SELECT) might return the following error. In this case, the table
contains an index named `IDX_HT100_COLS`:

```output
The value is too long for index "IDX_HT100_COLS".
```

This error occurs because row-based storage imposes a limit on the size of the data (and metadata) that can
be stored per record. To reduce the record size, try creating the table without specifying larger columns,
such as wide VARCHAR columns, as indexed columns. You can also try creating indexes on fewer columns.

You can also try using INCLUDE columns on secondary indexes when you create a hybrid table or an index on a
hybrid table. For more information, see [INCLUDE columns](tables-hybrid-index.md).

---
title: Create users and grant roles
source: https://docs.snowflake.com/en/user-guide/tutorials/users-and-roles-tutorial.md
section: User Guide
---

Snowflake

Getting Started

# Create users and grant roles

## Introduction

This tutorial shows you how to create a user and grant a role to it by using SQL commands.
You can access a pre-loaded [Snowsight template](../ui-snowsight/snowsight-templates.md)
worksheet to complete these tasks.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage used for any sample data in
> this tutorial. The tutorial provides steps to drop objects and minimize storage
> cost. Snowflake requires a [virtual warehouse](../warehouses.md) to load the
> data and execute queries. A running virtual warehouse consumes Snowflake credits.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

### What you will learn

In this tutorial you will learn how to:

* Use a role that has the privileges to create and use the Snowflake objects required by this tutorial.
* Create a user.
* Grant a role to the user and grant access to a warehouse.
* Explore the users and roles in your account.
* Drop the user you created.

## Prerequisites

This tutorial assumes the following:

* You have a [supported browser](../ui-snowsight-gs.md).
* You have access to a Snowflake account and can log in as a user who has been granted
  the ACCOUNTADMIN, USERADMIN, and SECURITYADMIN
  [system-defined roles](../security-access-control-overview.md).

  If you don’t have an account, you can sign up for a [free trial](https://signup.snowflake.com/)
  and choose any [Snowflake Cloud Region](../intro-regions.md).

## Step 1. Sign in using Snowsight

To access Snowsight over the public Internet, do the following:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](../admin-account-identifier.md) or account URL.
   If you’ve previously signed in to Snowsight, you might see an account name that you can select.
3. Sign in using your Snowflake account credentials.

## Step 2. Open the SQL worksheet for adding a user and granting roles

You can use worksheets to write and run SQL commands on your database.
You can access a pre-loaded template worksheet for this tutorial.
The worksheet contains the SQL commands that you will run to set the role context,
create a user, and grant role privileges. Because it is a template worksheet, you
will be invited to enter your own values for certain SQL parameters.

For more information about worksheets, see [Getting started with worksheets](../ui-snowsight-worksheets-gs.md).

To open the pre-loaded template worksheet, follow these steps:

1. In the navigation menu, select Projects » Templates.
2. Find and open Create users in a SQL worksheet.

   The beginning of your worksheet looks similar to the following image:

## Step 3. Set the role to use

The role you use determines the privileges you have. In this tutorial, use the
USERADMIN system role so that you can create and manage users and roles in your
account. For more information, see [Overview of Access Control](../security-access-control-overview.md).

To set the role to use, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line.

   ```sqlexample
   USE ROLE USERADMIN;
   ```
2. At the top of the worksheet, select Run.

   > **Note:**
   >
   > In this tutorial, run SQL statements one at a time. Don’t select Run all.

## Step 4. Create a user

A Snowflake user has login credentials. When a user is granted a role, the user can
perform all the operations that the role allows, through the privileges that were
granted to the role. For more information, see [User management](../admin-user-management.md).

In this step of the tutorial, you create a user with a name, a password, and some
other properties.

In the open worksheet, place your cursor in the [CREATE USER](../../sql-reference/sql/create-user.md) line,
insert a username and other parameter values of your choice (an example is shown below), and
select Run.

For MUST_CHANGE_PASSWORD, set the value to `true`, which ensures that a password
reset is requested on first login. For DEFAULT_WAREHOUSE, use `COMPUTE_WH`.

```sqlexample
CREATE OR REPLACE USER snowman
PASSWORD = 'sn0wf@ll'
LOGIN_NAME = 'snowstorm'
FIRST_NAME = 'Snow'
LAST_NAME = 'Storm'
EMAIL = 'snow.storm@snowflake.com'
MUST_CHANGE_PASSWORD = true
DEFAULT_WAREHOUSE = COMPUTE_WH;
```

This command returns the following output:

> ```output
> User SNOWMAN successfully created.
> ```

If you were creating a real user in a real Snowflake account, you would now send the
following information in a secure manner to the person who would need to access
this new account:

* Snowflake Account URL: The Snowflake account link where the user will log in.
  You can find this link at the top of your browser
  (for example: <https://app.snowflake.com/myorg/myaccount/>,
  where `myorg` is the Snowflake organization ID, and `myaccount` is the account ID).
* LOGIN_NAME, as specified in the CREATE USER command.
* PASSWORD, as specified in the CREATE USER command.

## Step 5. Grant a system role and warehouse access to the user

Now that you have created a user, you can use the SECURITYADMIN role to grant the
SYSADMIN role to the user, and grant USAGE on the COMPUTE_WH warehouse.

Granting a role to another role creates a parent-child relationship between the roles
(also referred to as a role hierarchy). Granting a role to a user enables the user to perform
all operations allowed by the role (through the access privileges granted to the role).

The SYSADMIN role has privileges to create warehouses, databases, and database objects
in an account and grant those privileges to other roles. Only grant this role to users who should
have these privileges. For more information about other system-defined roles, see
[Overview of Access Control](../security-access-control-overview.md).

To grant the user access to a role and a warehouse, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line,
   then select Run.

   ```sqlexample
   USE ROLE SECURITYADMIN;
   ```
2. Place your cursor in the [GRANT ROLE](../../sql-reference/sql/grant-role.md) line, enter the name of the user you created,
   then select Run.

   ```sqlexample
   GRANT ROLE SYSADMIN TO USER snowman;
   ```
3. Place your cursor in the GRANT USAGE line, then select Run.

   ```sqlexample
   GRANT USAGE ON WAREHOUSE COMPUTE_WH TO ROLE SYSADMIN;
   ```

## Step 6. Explore the users and roles in your account

Now you can explore all the users and roles in your account by using the ACCOUNTADMIN role.

To explore users and roles, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line,
   then select Run.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
2. Place your cursor in the [SHOW USERS](../../sql-reference/sql/show-users.md) line, then select Run.

   ```sqlexample
   SHOW USERS;
   ```

   Your output looks similar to the following image:
3. Place your cursor in the [SHOW ROLES](../../sql-reference/sql/show-roles.md) line, then select Run.

   ```sqlexample
   SHOW ROLES;
   ```

   Your output looks similar to the following image:

## Step 7. Drop the user and review key points

Congratulations! You have successfully completed this tutorial.
Take a few minutes to review the key points that were covered.
Learn more by reviewing other topics in the Snowflake Documentation.

### Drop the user

Assuming that it is no longer needed, you can now drop the user you created.

In the open worksheet, place your cursor in the [DROP USER](../../sql-reference/sql/drop-user.md) line,
enter the name of the user you created, then select Run.

```sqlexample
DROP USER snowman;
```

### Review key points

In summary, you used a pre-loaded worksheet in Snowsight to complete the following steps:

1. Set the role to use.
2. Create a new user.
3. Grant the user role privileges and access to a warehouse.
4. Explore the users and roles in the account.
5. Drop the user you created.

Here are some key points to remember about users and roles:

* You need the required permissions to create and manage objects in your account. In this tutorial,
  you used the USERADMIN, SECURITYADMIN, SYSADMIN, and ACCOUNTADMIN system roles for different purposes.
* The ACCOUNTADMIN role isn’t normally used to create objects. Instead, we recommend creating a
  hierarchy of roles aligned with business functions in your organization. For more information, see
  [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).
* A warehouse provides the compute resources that you need to execute DML operations, load data,
  and run queries. This tutorial uses the `compute_wh` warehouse that is included with your account.

### What’s next?

Continue learning about Snowflake using the following resources:

* Complete the other tutorials provided by Snowflake:

  + [Tutorials to get started with Snowflake](../../learn-tutorials.md)
* Familiarize yourself with key Snowflake concepts and features, and the SQL commands used to
  create users and grant role privileges:

  + [Get started with Snowflake for users](../../getting-started-for-users.md)
  + [User, role, & privilege commands](../../sql-reference/commands-user-role.md)
* Try the Tasty Bytes Quickstarts provided by Snowflake:

  + [Tasty Bytes Quickstarts](https://www.snowflake.com/en/developers/guides/?searchTerm=tasty+bytes)

---
title: Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic
source: https://docs.snowflake.com/en/user-guide/notifications/creating-notification-integration-google-pubsub.md
section: User Guide
---

# Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic

To send notifications to a Google Cloud Pub/Sub topic, you must create a notification integration for that topic. To do this:

1. Create a Pub/Sub topic.
2. Create a Pub/Sub subscription.
3. Create a notification integration.
4. Grant Snowflake access to the Pub/Sub subscription.

> **Note:**
>
> Currently, this feature is limited to Snowflake accounts hosted on Google Cloud (GC).

## Create the Pub/Sub topic

Create a Pub/Sub topic that can receive error notification messages from Snowflake, or reuse an existing topic. You can create
the topic using [Cloud Shell](https://cloud.google.com/shell) or [Cloud SDK](https://cloud.google.com/sdk). For more
information, see [Create and use topics](https://cloud.google.com/pubsub/docs/admin) in the Pub/Sub documentation.

For example, execute the following command to create an empty topic:

```bash
gsutil notification create -t <topic>
```

If the topic already exists, the command uses it; otherwise, a new topic is created.

## Create the Pub/Sub subscription

Optionally, create a subscription to the Pub/Sub topic to retrieve notifications. You can create a subscription with pull
delivery using the Cloud Console, `gcloud` command-line tool, or the Cloud Pub/Sub API. For instructions, see
[Managing topics and subscriptions](https://cloud.google.com/pubsub/docs/admin) in the Pub/Sub documentation.

## Create a notification integration in Snowflake

Run the [CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration-queue-outbound-gcp.md) command to
create a notification integration. An integration is a Snowflake object that references the Pub/Sub topic you created.

Snowflake associates the notification integration with a Google Cloud (GC) service account created for your account.
Snowflake creates a single service account that is referenced by all GCP notification integrations in your Snowflake account.
The GCP service account for notification integrations is different from the service account created for storage integrations.

When running the command, set GCP_PUBSUB_TOPIC_NAME to the name of the
topic that you created earlier.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  ENABLED = TRUE
  DIRECTION = OUTBOUND
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  GCP_PUBSUB_TOPIC_NAME = 'projects/sdm-prod/topics/mytopic';
```

## Grant Snowflake access to the Pub/Sub subscription

1. Execute the [DESCRIBE NOTIFICATION INTEGRATION](../../sql-reference/sql/desc-notification-integration.md) command to display the properties of the notification
   that you just created.

   For example, to display the properties of the notification integration named `my_notification_int`:

   ```sqlexample
   DESC NOTIFICATION INTEGRATION my_notification_int;
   ```
2. Record the value of the GCP_PUBSUB_SERVICE_ACCOUNT property (the service account name), which has the following format:

   ```none
   <service_account>@<project_id>.iam.gserviceaccount.com
   ```
3. Log into the Google Cloud console as a project editor.
4. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
5. Select the subscription to configure for access.
6. Select SHOW INFO PANEL in the upper-right corner. The information panel for the subscription slides out.
7. In the Add members field, search for the service account name you recorded.
8. From the Select a role dropdown, select Pub/Sub Publisher.
9. Select Add.

   The service account name is added to the Pub/Sub Publisher role dropdown in the information panel.

---
title: Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic
source: https://docs.snowflake.com/en/user-guide/notifications/creating-notification-integration-azure-event-grid.md
section: User Guide
---

# Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic

To send notifications to a Microsoft Azure Event Grid topic, you must create a notification integration for that topic. To do
this:

1. Create a custom Event Grid topic.
2. Create a notification integration.

> **Note:**
>
> Currently, this feature is limited to Snowflake accounts hosted on Microsoft Azure.

## Create a custom Event Grid topic

An Event Grid topic provides an endpoint where the source sends event notifications. Create a dedicated topic to receive
notifications published by Snowflake.

> **Note:**
>
> If you plan to use the topic for notifications about errors in [tasks](../tasks-errors.md) or
> [pipes](../data-load-snowpipe-errors.md), you can use a single topic for error notifications for all tasks or pipes.

For instructions on creating Event Grid topics, see the
[Event Grid documentation](https://docs.microsoft.com/en-us/azure/event-grid/custom-event-quickstart).

Record the Event Grid topic endpoint, which you will need later in these instructions.

Optionally, subscribe to the topic to inform Event Grid which events you want to track and where to send those events.

## Create notification integration in Snowflake

### Retrieve the tenant ID

Retrieve your Azure tenant ID, which you will need later in these instructions.

1. Log into the Microsoft Azure portal.
2. Navigate to Azure Active Directory » Properties. Record the Tenant ID value for reference later.
   The directory ID, or *tenant ID*, is needed to generate the consent URL that grants Snowflake access to the Event Grid topic.

### Create the notification integration

Run the [CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration-queue-outbound-azure.md) command
to create a notification integration. An integration is a Snowflake object that references the Event Grid topic you created.

When running the command, set these parameters to the following values:

* Set AZURE_EVENT_GRID_TOPIC_ENDPOINT to the
  Event Grid topic endpoint, which you recorded earlier.
* Set AZURE_TENANT_ID to your Azure tenant ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  ENABLED = TRUE
  DIRECTION = OUTBOUND
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_EVENT_GRID
  AZURE_EVENT_GRID_TOPIC_ENDPOINT = 'https://myaccount.region-1.eventgrid.azure.net/api/events'
  AZURE_TENANT_ID = 'mytenantid';
```

### Grant Snowflake access to the topic

1. Execute the [DESCRIBE NOTIFICATION INTEGRATION](../../sql-reference/sql/desc-notification-integration.md) command to display the properties of the notification
   that you just created.

   For example, to display the properties of the notification integration named `my_notification_int`:
2. Record the values of the following properties:

   * AZURE_CONSENT_URL

     URL to the Microsoft permissions request page.
   * AZURE_MULTI_TENANT_APP_NAME

     Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant
     this application the permissions necessary to obtain an access token on your allowed topic.
3. In a web browser, navigate to the URL specified by the AZURE_CONSENT_URL property. The page displays a Microsoft permissions
   request page.
4. Select Accept. This action allows the Azure service principal created for your Snowflake account to be granted an
   access token on specified resources inside your tenant. Obtaining an access token succeeds only if you grant the service
   principal the appropriate permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
5. Log into the Microsoft Azure portal.
6. Navigate to Azure Active Directory » Enterprise applications. Verify that the Snowflake application
   identifier you recorded earlier (the value of the AZURE_MULTI_TENANT_APP_NAME property) is listed.

   > **Important:**
   >
   > If you delete the Snowflake application in Azure Active Directory at a later time, the notification integration stops
   > working.
7. Navigate to Event Grid Topics » `topic_name`, where `topic_name` is the name of the topic you
   created to receive event notifications.
8. Select Access Control (IAM) » Add role assignment.
9. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property that you recorded
   earlier. Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft
   >   request page in this section. If the service principal is not available immediately, we recommend waiting an hour or two
   >   and then searching again.
   > * If you delete the service principal, the notification integration stops working.
10. Grant the Snowflake application the
    [EventGrid Data Sender](https://docs.microsoft.com/en-us/azure/role-based-access-control/built-in-roles#eventgrid-data-sender)
    permission.

---
title: Creating a notification integration to send notifications to an Amazon SNS topic
source: https://docs.snowflake.com/en/user-guide/notifications/creating-notification-integration-amazon-sns.md
section: User Guide
---

# Creating a notification integration to send notifications to an Amazon SNS topic

To send notifications to an Amazon SNS topic, you must create a notification integration for that topic. To do this:

1. Create an Amazon SNS topic
2. Create the IAM policy that grants permission to publish to this topic.
3. Create the IAM role that you attach to this policy.
4. Create a notification integration.
5. Grant Snowflake access to the topic.

> **Note:**
>
> Currently, this feature is limited to Snowflake accounts hosted on AWS.

## Create an Amazon SNS topic

Create an SNS topic in your AWS account to handle the notifications. Record the Amazon Resource Name (ARN) for the SNS topic.

> **Note:**
>
> Only standard SNS topics are supported. Do not create SNS FIFO (first in, first out) topics for use with error notifications.
> Currently, error notifications sent to FIFO topics fail silently.

To reduce latency and avoid [data egress](../cost-understanding-data-transfer.md) charges for sending notifications
across [regions](../intro-regions.md), we recommend creating the SNS topic in the same region as your Snowflake
account.

For instructions, see the [Creating an Amazon SNS topic](https://docs.aws.amazon.com/sns/latest/dg/sns-create-topic.html) in
the SNS documentation.

## Create the IAM policy

Create an AWS Identity and Access Management (IAM) policy that grants permissions to publish to the SNS topic. The policy defines the following actions:

* `sns:publish`: Publish to the SNS topic.

1. Log into the AWS Management Console.
2. From the home dashboard, choose Identity & Access Management (IAM).
3. Choose Account settings from the left-hand navigation pane.
4. Expand the Security Token Service Regions list, find the AWS region corresponding to the
   [region](../intro-regions.md) where your account is located, and choose Activate if the status is
   Inactive.
5. Choose Policies from the left-hand navigation pane.
6. Select Create Policy.
7. Select the JSON tab.
8. Add a policy document that defines actions that can be taken on your SNS topic.

   Copy and paste the following text into the policy editor:

   ```json
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Effect": "Allow",
         "Action": [
           "sns:Publish"
         ],
         "Resource": "<sns_topic_arn>"
       }
     ]
   }
   ```

   Replace `sns_topic_arn` with the ARN of the
   SNS topic that you created earlier.
9. Select Review policy.
10. Enter the policy name (e.g. `snowflake_sns_topic`) and an optional description, and select Create policy.

## Create the AWS IAM role

Create an AWS IAM role on which to assign privileges on the SNS topic.

1. Log into the AWS Management Console.
2. From the home dashboard, choose Identity & Access Management (IAM):
3. Choose Roles from the left-hand navigation pane.
4. Select Create role.
5. Select Another AWS account as the trusted entity type.
6. In the Account ID field, enter your own AWS account ID temporarily.
7. Select the Require external ID option. This option enables you to grant permissions on your Amazon account resources
   (i.e. SNS) to a third party (i.e. Snowflake).

   For now, enter a dummy ID such as `0000`. Later, you will modify the trust relationship and replace the dummy ID with the
   external ID for the Snowflake IAM user generated for your account. A condition in the trust policy for your IAM role allows
   your Snowflake users to assume the role using the notification integration object you will create later.
8. Select Next.
9. Locate the policy that you created earlier, and select this policy.
10. Select Next.
11. Enter a name and description for the role, and select Create role.
12. Record the Role ARN value located on the role summary page. You will specify this value in one or more later steps.

## Create the notification integration

Run the [CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration-queue-outbound-aws.md) command to
create a notification integration. An integration is a Snowflake object that references the SNS topic you created.

> **Note:**
>
> If you plan to use the integration for notifications about errors in [tasks](../tasks-errors.md) or
> [pipes](../data-load-snowpipe-errors.md), a single notification integration can support multiple tasks or pipes.

When running the command, set these parameters to the following values:

* Set AWS_SNS_TOPIC_ARN to the SNS topic ARN you recorded earlier.
* Set AWS_SNS_ROLE_ARN to the IAM role ARN you recorded earlier.

  > **Note:**
  >
  > The value of AWS_SNS_ROLE_ARN is case-sensitive. Use the exact value that is specified in your AWS account.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  ENABLED = TRUE
  DIRECTION = OUTBOUND
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AWS_SNS
  AWS_SNS_TOPIC_ARN = 'arn:aws:sns:us-east-2:111122223333:sns_topic'
  AWS_SNS_ROLE_ARN = 'arn:aws:iam::111122223333:role/error_sns_role';
```

## Grant Snowflake access to the SNS topic

### Retrieve the IAM user ARN and SNS topic external ID

1. Execute the [DESCRIBE NOTIFICATION INTEGRATION](../../sql-reference/sql/desc-notification-integration.md) command to display the properties of the notification
   integration that you just created.

   For example, to display the properties of the notification integration named `my_notification_int`:

   ```sqlexample
   DESC NOTIFICATION INTEGRATION my_notification_int;
   ```

   ```output
   +---------------------------+-------------------+------------------------------------------------------+----------------------+
   |   property                |   property_type   |   property_value                                     |   property_default   |
   +---------------------------+-------------------+------------------------------------------------------+----------------------+
   |   ENABLED                 |   Boolean         |   true                                               |   false              |
   |   NOTIFICATION_PROVIDER   |   String          |   AWS_SNS                                            |                      |
   |   DIRECTION               |   String          |   OUTBOUND                                           |   INBOUND            |
   |   AWS_SNS_TOPIC_ARN       |   String          |   arn:aws:sns:us-east-2:111122223333:myaccount       |                      |
   |   AWS_SNS_ROLE_ARN        |   String          |   arn:aws:iam::111122223333:role/myrole              |                      |
   |   SF_AWS_IAM_USER_ARN     |   String          |   arn:aws:iam::123456789001:user/c_myaccount         |                      |
   |   SF_AWS_EXTERNAL_ID      |   String          |   MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=   |                      |
   +---------------------------+-------------------+------------------------------------------------------+----------------------+
   ```
2. Record the values of the following properties:

   * SF_AWS_IAM_USER_ARN

     ARN for the Snowflake IAM user created for your account. Users in your Snowflake account will assume the
     IAM role you created earlier by submitting the external ID for this
     user using your notification integration.
   * SF_AWS_EXTERNAL_ID

     External ID for the Snowflake IAM user created for your account.

   In the next step, you will update the trust relationship for the IAM role with these values.

Note the DIRECTION property, which indicates the direction of the cloud messaging with respect to Snowflake.

### Modify the trust relationship in the IAM role

1. Log into the AWS Management Console.
2. From the home dashboard, choose Identity & Access Management (IAM):
3. Choose Roles from the left-hand navigation pane.
4. Select the role you created earlier.
5. Select the Trust relationships tab.
6. Select Edit trust relationship.
7. Modify the policy document to use the
   values of the notification integration properties that you recorded earlier.

   **Policy document for IAM role**

   ```json
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<sf_aws_iam_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<sf_aws_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   * `sf_aws_iam_user_arn` is the SF_AWS_IAM_USER_ARN value you recorded.
   * `sf_aws_external_id` is the SF_AWS_EXTERNAL_ID value you recorded.
8. Select Update Trust Policy. The changes are saved.

---
title: Creating an account
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-create.md
section: User Guide
---

# Creating an account

[As the organization administrator](organization-administrators.md), you can create an account through the web interface or
using SQL:

> [Snowsight](ui-snowsight-gs.md):
> :   In the navigation menu, select Admin » Accounts, and then select + Account.
>
> SQL:
> :   Execute a [CREATE ACCOUNT](../sql-reference/sql/create-account.md) command.
>
> > **Note:**
> >
> > For instructions on how to create a Snowflake Open Catalog account, see [Create a Snowflake Open Catalog account](https://other-docs.snowflake.com/en/opencatalog/create-open-catalog-account)

When creating an account, you can specify a [cloud platform](intro-cloud-platforms.md), a
[region](intro-regions.md), and a [Snowflake edition](intro-editions.md). You can optionally specify a region
group if you have, or want to have, accounts in multiple region groups. For more details see [Region groups](admin-account-identifier.md).

If you are having trouble creating or accessing a new account, consider:

* By default, the maximum number of On Demand accounts in an organization is 25. If the organization has a capacity contract,
  the default maximum number of accounts is 100. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to have these limits raised.
* You can only create an account in a region that is enabled for your organization. For a list of available regions,
  see [View a list of regions available for an organization](intro-regions.md). To request access to additional regions, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* It takes about 30 seconds for the DNS changes to propagate before you can access a newly created account. If the account is not
  accessible immediately, wait for approximately 30 seconds and try again.

Each account in your organization can have its own set of users, roles, databases, and warehouses.

You will be billed for usage in all of your accounts on a single bill. To monitor usage for your organization accounts, see [Organization Usage](../sql-reference/organization-usage.md) views.

## Initial user of an account

When you create a new account, you specify the first user of the account, who is assigned the ACCOUNTADMIN role. It is important to specify
whether this initial user is a human user or a service user because this determines whether the user must enroll in
MFA (multi-factor authentication).

### Enforce MFA enrollment on a human ACCOUNTADMIN

If a human directly uses the ACCOUNTADMIN role on your account, you can secure your account by forcing this account administrator to enroll
in MFA during account creation.

Execute the following SQL statement during account creation to specify that a human uses the ACCOUNTADMIN role, and is required to enroll in
MFA:

```sqlsyntax
CREATE ACCOUNT my_admin ADMIN_USER_TYPE = PERSON;
```

### Prevent MFA from being enforced on a non-human ACCOUNTADMIN

If a human does not use the ACCOUNTADMIN role on your account, you must prevent MFA enrollment from being enforced to allow the service that
is using the ACCOUNTADMIN role to run successfully. A service-type ACCOUNTADMIN cannot use passwords to authenticate, and must specify an
[ADMIN_RSA_PUBLIC_KEY](../sql-reference/sql/create-account.md) during account creation.

Execute the following SQL statement during account creation to specify that a service uses the ACCOUNTADMIN role, uses an RSA key to
authenticate, and is not required to enroll in MFA:

```sqlsyntax
CREATE ACCOUNT my_admin
  ADMIN_USER_TYPE = SERVICE
  ADMIN_RSA_PUBLIC_KEY = 'MIIBIj...';
```

---
title: CSA Star Level 1
source: https://docs.snowflake.com/en/user-guide/cert-csa-star-level-1.md
section: User Guide
---

# CSA Star Level 1

This topic describes how Snowflake supports customers with CSA Star Level 1 compliance requirements.

## Understanding CSA Star Level 1 compliance requirements

Cloud Security Alliance (CSA) is a not-for-profit organization with a mission to “promote the use of best practices for providing security
assurance within Cloud Computing, and to provide education on the uses of Cloud Computing to help secure all other forms of computing.”
Snowflake participates in the voluntary CSA Security, Trust & Assurance Registry (STAR) Self-Assessment to document our compliance with
CSA-published best practices. The completed [CSA Consensus Assessments Initiative Questionnaire (CAIQ)](https://cloudsecurityalliance.org/star/registry/snowflake/services/snowflake/) is found on the Cloud Security Alliance website.

---
title: Custom actions for budgets
source: https://docs.snowflake.com/en/user-guide/budgets/custom-actions.md
section: User Guide
---

# Custom actions for budgets

You can configure a budget to automatically call a stored procedure when a spending threshold is reached. This lets you take automated
actions in response to credit consumption, such as suspending warehouses, sending custom alerts, or logging spending events to a table.
Custom actions don’t replace the [notifications](notifications.md) that Snowflake sends when consumption is expected
to exceed your budget limit.

When you define a custom action, you specify whether it calls the stored procedure based on *projected* credit consumption or *actual*
credit consumption, and then set the threshold. When projected or actual consumption reaches the threshold, the stored procedure executes.

## Stored procedure requirements

The stored procedure that is called by a custom action must meet the following requirements:

* The procedure must run with owner’s rights, not caller’s rights. For more information, see
  [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
* The procedure can’t take more than 30 minutes to complete.
* The procedure can’t have an OUTPUT argument.
* Snowflake retries failed actions once, so design your procedure to handle being called multiple times without causing duplicate or
  unintended effects.
* The procedure’s required arguments must be one of the following data types:

  + [Numeric data types](../../sql-reference/data-types-numeric.md)
  + [String & binary data types](../../sql-reference/data-types-text.md)
  + [Logical data types](../../sql-reference/data-types-logical.md)
  + [Date & time data types](../../sql-reference/data-types-datetime.md)

After you’ve created a stored procedure that meets these requirements, you must grant to the SNOWFLAKE application the USAGE privilege on
the procedure and its parent database/schema. For example, if the fully qualified name of your stored procedure is
`code_db.sch1.alert_team`, run the following commands:

```sqlexample
GRANT USAGE ON DATABASE code_db TO APPLICATION SNOWFLAKE;
GRANT USAGE ON SCHEMA code_db.sch1 TO APPLICATION SNOWFLAKE;
GRANT USAGE ON PROCEDURE code_db.sch1.alert_team(STRING, NUMBER) TO APPLICATION SNOWFLAKE;
```

> **Note:**
>
> If you update the stored procedure after adding it to a custom action, you must re-grant the USAGE privilege on the procedure to the
> SNOWFLAKE application.

## Add a custom action to a budget

You can add multiple custom actions to the account budget or to a custom budget, but you can’t add more than 10 custom actions to the same
budget. A custom action consists of the following components:

* Stored procedure: A reference to the procedure to be called.
* Arguments: An array of arguments to pass to the stored procedure.
* Threshold: The percentage of the budget limit that triggers the custom action (for example, 75%).
* Trigger type: Whether the custom action is triggered based on projected consumption or actual consumption.

To add a custom action to a budget, call the [ADD_CUSTOM_ACTION](../../sql-reference/classes/budget/methods/add_custom_action.md) method on
the budget instance. For example, the following code adds a custom action that calls the `send_email_notification` stored procedure
when spending is forecast to exceed 75% of the budget limit:

```sqlexample
CALL budget_db.sch1.my_budget!ADD_CUSTOM_ACTION(
  SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.alert_team(string, string, string)'),
  ARRAY_CONSTRUCT('admin@example.com', 'Budget Alert', 'Spending at 75% of budget limit'),
  'PROJECTED',
  75);
```

For an end-to-end example that includes creating the stored procedure that is called by the custom action, see Extended example.

## Remove a custom action from a budget

To remove a custom action from a budget, call the [REMOVE_CUSTOM_ACTIONS](../../sql-reference/classes/budget/methods/remove_custom_actions.md)
method on the budget instance. You can use this method to do the following:

* **Remove all custom actions from a budget**. For example:

  ```sqlexample
  CALL budget_db.sch1.my_budget!REMOVE_CUSTOM_ACTIONS();
  ```
* **Remove all custom actions that have a specified threshold**. For example, to remove all custom actions that are triggered when
  consumption reaches 75%, run the following command:

  ```sqlexample
  CALL budget_db.sch1.my_budget!REMOVE_CUSTOM_ACTIONS(75);
  ```
* **Remove a specified custom action from a budget**. For example, to remove the custom action that calls the `my_sp` stored procedure
  when consumption reaches 75%, run the following command:

  ```sqlexample
  CALL budget_db.sch1.my_budget!REMOVE_CUSTOM_ACTIONS(75, 'code_db.sch1.my_sp');
  ```

  > **Tip:**
  >
  > If you are removing a specific action, use the fully qualified procedure name that is returned by the
  > [GET_CUSTOM_ACTIONS](../../sql-reference/classes/budget/methods/get_custom_actions.md) method.

## Extended example

The following example demonstrates how to write a stored procedure called by a custom action, grant the necessary privileges on the
procedure, and then add the custom action to the budget.

1. Create a stored procedure that conforms to all the requirements:

   ```sqlexample-javascript
   CREATE OR REPLACE PROCEDURE code_db.sch1.alert_team(
       integration_name string,
       email_list string,
       email_subject string,
       email_content string)
   RETURNS STRING
   LANGUAGE JAVASCRIPT
   EXECUTE AS OWNER
   AS
   $$
       var sql_command = "CALL SYSTEM$SEND_EMAIL('" + INTEGRATION_NAME + "', " +
                                               "'" +  EMAIL_LIST + "', " +
                                               "'" + EMAIL_SUBJECT + "'," +
                                               "'" + EMAIL_CONTENT + "'" + ");";
       var statement1 = snowflake.createStatement({sqlText: sql_command});
       statement1.execute();
       return "alert sent";
   $$;
   ```
2. Grant privileges on the stored procedure to the SNOWFLAKE application:

   ```sqlexample
   GRANT USAGE ON DATABASE code_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA code_db.sch1 TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE code_db.sch1.alert_team(STRING, STRING, STRING, STRING)
     TO APPLICATION SNOWFLAKE;
   ```
3. Add the custom action to the budget so that it is triggered when consumption reaches 90% of the budget’s spending limit:

   ```sqlexample
   CALL budget_db.sch1.my_budget!ADD_CUSTOM_ACTION(
     SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.alert_team(string, string, string, string)'),
     ARRAY_CONSTRUCT('my_int', 'admin@example.com', 'Budget Alert', 'Spending at 90% of budget limit'),
     'ACTUAL',
     90);
   ```

## Troubleshooting custom actions

If a custom action is not working as expected, use the following methods to diagnose the issue.

### Monitor custom action execution

Snowflake uses tasks to execute custom actions. These tasks follow the naming convention `BUDGET_CUSTOM_ACTION_TRIGGER_AT_%`. To check the
execution status of all custom action tasks for a budget instance, run the following query:

```sqlexample
SELECT th.*, ci.name AS budget_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY th
  JOIN SNOWFLAKE.ACCOUNT_USAGE.CLASS_INSTANCES ci
    ON th.instance_id = ci.id
  WHERE ci.class_name = 'BUDGET'
    AND th.name ILIKE 'BUDGET_CUSTOM_ACTION_TRIGGER_AT_%'
    AND ci.name = '<budget_name>'
  ORDER BY th.completed_time DESC
  LIMIT 10;
```

### View action trigger history

To see which custom actions have been triggered from a specific budget over a time period, run the following query:

```sqlexample
SELECT th.*, ci.name as budget_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY th
  JOIN SNOWFLAKE.ACCOUNT_USAGE.CLASS_INSTANCES ci
    ON th.instance_id = ci.id
  WHERE ci.class_name = 'BUDGET'
    AND th.name ILIKE 'BUDGET_CUSTOM_ACTION_TRIGGER_AT_%'
    AND ci.name = '<budget_name>'
    AND th.COMPLETED_TIME >= DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY th.completed_time DESC;
```

To check the trigger history for a specific custom action, first get the action ID by calling the
[GET_CUSTOM_ACTIONS](../../sql-reference/classes/budget/methods/get_custom_actions.md) method:

```sqlexample
CALL <budget_name>!GET_CUSTOM_ACTIONS();
```

Then use the action ID in the following query:

```sqlexample
SELECT th.*, ci.name AS budget_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY th
  JOIN SNOWFLAKE.ACCOUNT_USAGE.CLASS_INSTANCES ci
    ON th.instance_id = ci.id
  WHERE ci.class_name = 'BUDGET'
    AND th.name ILIKE 'BUDGET_CUSTOM_ACTION_TRIGGER_AT_%'
    AND th.query_text ILIKE '%<action_id>%'
    AND ci.name = '<budget_name>'
    AND th.COMPLETED_TIME >= DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY th.completed_time DESC;
```

### Troubleshoot actions that aren’t triggering

If a custom action is not triggering when expected, check for the following common issues. Assume your custom budget is
`budget_db.sch1.my_budget`.

**Stored procedure or privileges changed**

Verify that the stored procedure called by the custom action is still valid and that the SNOWFLAKE application still has necessary
privileges. Call the [CONFIRM_CUSTOM_ACTIONS_ACCESS](../../sql-reference/classes/budget/methods/confirm_custom_actions_access.md) method to
validate the stored procedure and access control privileges:

```sqlexample
CALL budget_db.sch1.my_budget!CONFIRM_CUSTOM_ACTIONS_ACCESS();
```

**Budget is not activated**

For account budgets only, verify that the budget is activated by calling the
[GET_CONFIG](../../sql-reference/classes/budget/methods/get_config.md) method and checking the `is_active` field.

```sqlexample
CALL budget_db.sch1.my_budget!GET_CONFIG();
```

**Budget has no spending limit**

Custom actions won’t trigger if the budget doesn’t have a spending limit configured. Check the spending limit:

```sqlexample
CALL budget_db.sch1.my_budget!GET_SPENDING_LIMIT();
```

**Budget is not tracking any resources**

Verify that the budget is tracking resources by checking the spending history:

```sqlexample
CALL budget_db.sch1.my_budget!GET_SPENDING_HISTORY();
```

**Custom action has recently triggered**

To prevent excessive triggering, Snowflake limits how frequently a custom action can execute:

* If the custom action runs when credit consumption is projected to reach a spending threshold, the stored procedure won’t be called more
  than once a day.
* If the custom action runs when credit consumption reaches a limit on actual spending, the stored procedure won’t be called more than once
  a month.

Check the `LAST_TRIGGER_ATTEMPT_TIME` field by calling the
[GET_CUSTOM_ACTIONS](../../sql-reference/classes/budget/methods/get_custom_actions.md) method.

---
title: Custom budgets
source: https://docs.snowflake.com/en/user-guide/budgets/custom-budget.md
section: User Guide
---

# Custom budgets

Custom budgets let you monitor compute costs for a custom group of objects. You can specify which objects you want to monitor in two ways:

* Add a tag to the budget. All objects that have the specified tag/value pair are monitored by the budget.
* Add each object to the budget individually.

The same budget can track objects added individually and added using tags. If an object is included in the budget for more than one reason (for
example, it was added individually and has the specified tag/value pair), its credit usage counts only once against the budget’s
spending limit.

When you add an object to a custom budget, the budget monitors all compute costs for the object, including background
maintenance operations and serverless features. For example, if you add a table to a custom budget, and the table has automatic
clustering enabled, the budget monitors credit usage for the background maintenance for automatic clustering.

## Using tags to monitor objects

[Tags](../object-tagging/introduction.md) can be applied to budgets to monitor credit usage by objects that belong to a logical
unit within the account. Suppose you use the `cost_center` tag to track costs incurred by cost centers within the organization. You
might tag all objects attributed to the sales team with the tag/value pair `cost_center = 'sales'`. Rather than individually add
each object used by the sales team to a budget, you could simply add the tag/value pair `cost_center = 'sales'`, and the budget
will automatically monitor credit usage of all objects that have been assigned that tag/value pair.

### Tag inheritance

Adding a tag to a budget tracks all objects with that tag, including objects that have inherited the tag from a parent object. For example,
if a database has a tag, then tables within the database inherit the tag and will be tracked by a budget. Because a budget tracks usage
based on a tag/value pair, if you override the value of the tag at the table-level, it might change whether the budget tracks usage
associated with the table. For example, suppose you have a budget that tracks objects with tag `team = 'eng'`. If the database has the
tag `team = 'eng'`, but a table within the database has tag `team = 'IT'`, the budget won’t monitor costs associated with that table.

In the context of budgets, tags are not inherited from an account because the account budget is intended to fulfill that use case.

For more information, including how tag values are overridden, see [Tag inheritance](../object-tagging/inheritance.md).

### Tracking an object with multiple budgets

Multiple budgets can add the same tag/value pair, which means more than budget can track credit usage of the same object. For
example, suppose you add the tag `cost_center = 'eng'` to both `budget_1` and `budget_2`. As a warehouse with tag
`cost_center = 'eng'` consumes credits, it will count toward the credit limit of both `budget_1` and `budget_2`.

An object can also be tracked by more than one budget if the object has multiple tags. For example, suppose a warehouse
has two tags: `cost_center = 'finance'` and `stage = 'dev'`. You could create one budget that tracks `cost_center = 'finance'` and
another that tracks `stage = 'dev'`. Credits consumed by the warehouse would count toward the credit limit of both budgets.

### Limitations and considerations

When using tags to monitor objects, keep the following in mind:

* When you change a tag on an object, it can take up to six hours for the change to be reflected in budgets that use tags.
* Currently, alerts cannot be monitored with tags. You must add them individually.
* Changes to tags within the first two days of the month are reflected in the prior month’s usage.

## Supported objects for custom budgets

You can create a custom budget to monitor the following types of Snowflake objects:

| Object | Monitored costs |
| --- | --- |
| Alerts | Serverless alerts are monitored by the account budget. To monitor the credit usage for an alert that executes using a user-managed warehouse, you must add the warehouse to the budget. For more information about the costs of alerts, see [Understanding the costs of alerts](../alerts.md). |
| Apps . (Snowflake Native Apps) | The behavior of budgets for objects that are created and owned by an Snowflake Native App depends on whether you add the app directly or by adding a tag.   * When you add a Snowflake Native App to a budget using tags, only warehouses that have the matching tag/value combination are tracked   automatically, regardless of whether they are shared. * When you add a Snowflake Native App to a budget directly, all objects that consume credits and are created and owned by the app are   added to the budget automatically. This includes warehouses and Snowpark Container Services compute pools that are owned by   the app. Warehouses and compute pools that are shared are not tracked by the budget automatically, although you can   add these manually.  You cannot add objects created and owned by an app to a separate budget. You can add warehouses and compute pools   that are shared to a separate budget.  To determine if a warehouse or compute pool is owned by an app, check the following:    + For warehouses, run the [SHOW WAREHOUSES](../../sql-reference/sql/show-warehouses.md) command. If the value in the `owner_role_type` column     is `APPLICATION`, the warehouse is owned by a Snowflake Native App.   + For compute pools, run the [SHOW COMPUTE POOLS](../../sql-reference/sql/show-compute-pools.md) command. If the value in the `application`     column is not NULL, the compute pool is owned by a Snowflake Native App. |
| Compute pool | Compute pool usage for Snowpark Container Services. For more information, see [Compute pool cost](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md). |
| Databases | When you add a database to a budget, all supported objects that the database contains are also automatically added. The budget monitors credit usage for the following objects and serverless features:   * Supported schema objects as described above. * Replication for secondary (replica) databases.  **Note:** Replication costs for secondary databases that are replicated in a replication or failover group can only be   monitored by the account budget. |
| Materialized views | Background maintenance for the materialized view. For more information, see [Materialized Views Cost](../views-materialized.md). |
| Schemas | When you add a schema to a budget, all supported objects that the schema contains are also automatically added. The budget monitors the credit usage for schema objects as described above. |
| Pipes | Resource consumption for loading data using Snowpipe. For more information, see [Snowpipe costs](../data-load-snowpipe-billing.md). |
| Tables | Background maintenance operations for [automatic clustering](../tables-auto-reclustering.md) and [search optimization](../search-optimization-service.md) if they are enabled on the table. |
| Tasks | Serverless tasks are monitored by a custom budget. To monitor the credit usage for a task that executes using a user-managed warehouse, you must add the warehouse to the budget. For more information, see [Task costs](../tasks-intro.md). |
| Warehouses | Compute resources for query execution, web interface, and other features (see [Virtual warehouse credit usage](../cost-understanding-compute.md)), serverless tasks, and [cloud services compute](../cost-understanding-compute.md). |

For more information, see Add or remove tags from a custom budget.

## Create a custom budget

The next sections explain how to create a custom budget:

* Create a custom role to create budgets
* Use Snowsight to create a custom budget
* Use SQL commands to create a custom budget

You can create a custom budget using Snowsight or by executing SQL statements.

### Create a custom role to create budgets

You can use a custom role to create budgets in your account. For a full list of privileges and roles that must be granted to a
role to create a custom budget, see [Budgets roles and privileges](../budgets.md).

The following example creates a role named `budget_owner` role and grants the required role and privileges to create custom
budgets in the schema `budgets_db.budgets_schema`. The example must be executed using the ACCOUNTADMIN role:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE budget_owner;

GRANT USAGE ON DATABASE budgets_db TO ROLE budget_owner;
GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO ROLE budget_owner;

GRANT DATABASE ROLE SNOWFLAKE.BUDGET_CREATOR TO ROLE budget_owner;

GRANT CREATE SNOWFLAKE.CORE.BUDGET ON SCHEMA budgets_db.budgets_schema
  TO ROLE budget_owner;
```

If you want to enable a role other than the budget owner to modify a custom budget’s settings, you can create a custom role with
modify privileges. For more information, see Create a custom role to manage a custom budget.

### Use Snowsight to create a custom budget

> **Note:**
>
> If the account budget is not [activated](account-budget.md) or has
> been deactivated, you can’t use Snowsight to create custom budgets. However, you
> can create custom budgets using SQL.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select the Budgets tab.
4. Select + Budget.
5. On the Basic Information page, complete the following steps:

   1. From the Location to store drop-down, select the name of the database and schema where you want to create the budget.
   2. In the Name field, specify the name of the custom budget.
   3. In the Budget (credits per month) field, specify the spending limit of the budget.
   4. To decrease the [budget refresh interval](../budgets.md) so you can watch spending more closely, select
      Enable low latency budget.
   5. In the Threshold field, specify a percentage of the budget limit. Notifications are sent when Snowflake determines that
      spending will exceed this percentage of the budget limit.
   6. In the Notify field, enter email addresses to receive notification emails.

      > **Note:**
      >
      > Each email address added for budget notifications must be [verified](../notifications/email-notifications.md). The
      > notification email setup fails if any email address in the list is *not* verified.
   7. Select Next.
6. On the Budget scope page, add the objects that you want to add to the custom budget.

   * If you are using tags to track consumption by objects, compete the following steps:

     1. Select the Tags on resources drop-down list.
     2. Find the appropriate tag, then expand it and select one or more values.
     3. Select Done.
   * If you are adding individual objects to the budget, complete the following steps:

     1. Select the Resources drop-down list.
     2. Select one or more objects.
     3. Select Done.

        > **Note:**
        >
        > If you are directly adding individual objects, you can only add an object to one custom budget. In this case, if an object is currently
        > included in one custom budget and you add that object to a second custom budget, Budgets removes the object from the first custom budget
        > without issuing a warning.
        >
        > This behavior does not apply to using tags to add objects to budgets; an object with one or more tags can be
        > included in multiple custom budgets if you are using tags to add the object to the budgets.
7. Select Create.

After you create and set up a custom budget, you can create a custom role to enable non-account administrators to monitor budget resources
and usage. For more information, see [Create a custom role to monitor a custom budget](monitor.md).

### Use SQL commands to create a custom budget

Create a custom budget and then set the spending limit and notification email addresses.

> **Note:**
>
> * To create a custom budget, you must use a role with the
>   required privileges to create a budget.
> * To modify a custom budget, you must use a role with the
>   required privileges to modify a budget.

1. Review the existing budgets in your account:

   > **Note:**
   >
   > The following statement returns the budgets for which you have access privileges. Only a user with the ACCOUNTADMIN role
   > can see all the budgets in the account.

   ```sqlexample
   SELECT SYSTEM$SHOW_BUDGETS_IN_ACCOUNT();
   ```
2. Create budget `my_budget` in `budgets_db.budgets_schema` using the
   [CREATE BUDGET](../../sql-reference/classes/budget/commands/create-budget.md) command:

   ```sqlexample
   USE SCHEMA budgets_db.budgets_schema;

   CREATE SNOWFLAKE.CORE.BUDGET my_budget();
   ```
3. Set the monthly spending limit. For example, set the spending limit to 500 credits per month:

   ```sqlexample
   CALL my_budget!SET_SPENDING_LIMIT(500);
   ```
4. Set up notifications for the budget so that you receive notifications when your credit usage is expected to exceed your
   spending limits.

   See [Notifications for budgets](notifications.md).

After you create and set up a custom budget, you can create a custom role to enable non-account administrators to monitor budget
resources and usage. For more information, see [Create a custom role to monitor a custom budget](monitor.md).

To add objects to your new budget, see Add or remove objects from a custom budget.

## Create a custom role to manage a custom budget

To monitor and modify a custom budget, you can grant privileges and instance roles to a custom role. For a full list of privileges
and roles that must be granted to a role to modify a custom budget, see [Budgets roles and privileges](../budgets.md).

### Custom role example

Grant the custom role `budget_admin` the ability to monitor and modify the budget `my_budget` in schema
`budgets_db.budgets_schema`:

> **Note:**
>
> You need OWNERSHIP privilege on the custom budget to execute the following examples.

* Grant the required privileges and instance role to custom role `budget_admin` for budget `my_budget` in schema
  `budgets_db.budgets_schema`:

  > ```sqlexample
  > GRANT USAGE ON DATABASE budgets_db TO ROLE budget_admin;
  >
  > GRANT USAGE ON SCHEMA budget_db.budgets_schema TO ROLE budget_admin;
  >
  > GRANT SNOWFLAKE.CORE.BUDGET ROLE budgets_db.budgets_schema.my_budget!ADMIN
  >    TO ROLE budget_admin;
  >
  > GRANT DATABASE ROLE SNOWFLAKE.USAGE_VIEWER TO ROLE budget_admin;
  > ```
* Grant the APPLYBUDGET privilege on objects and tags to be added to or removed from a custom budget. This step is required for each object
  or tag to be added or removed.

  For example, to enable the role `budget_admin` to add database `db1` to custom budget `my_budget`,
  execute the following statements:

  ```sqlexample
  GRANT USAGE ON DATABASE db1 TO ROLE budget_admin;

  GRANT APPLYBUDGET ON DATABASE db1 TO ROLE budget_admin;
  ```

## Add or remove tags from a custom budget

You can add or remove tags from a custom budget using Snowsight or SQL. Each tag added to the budget includes one or more values
for the tag.

> **Note:**
>
> To add or remove tags from a custom budget, you must use a role with the required privileges on the budget and the tag. For more
> information, see Create a custom role to manage a custom budget.

### Use Snowsight to add or remove tags from a custom budget

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.
4. Select the budget to edit.
5. In the dashboard, select  (edit icon).
6. Select + Tags & resources.
7. Expand Tags and navigate to the tag you want to add.
8. Do one of the following:

   * If the tag has a [list of allowed values](../object-tagging/work.md), select one or more of the values.
   * If the tag can be set to any value, specify the value.
9. Select Done.

> **Note:**
>
> When adding tags to the budget in Snowsight, keep the following in mind:
>
> * A tag must be applied to at least one object before you can add it to a budget.
> * It can take up to two hours for a tag to appear after adding it to an object.

### Use SQL commands to add or remove tags from a custom budget

The role used to add or remove an tag from a budget must have the APPLYBUDGET privilege on the tag. For more information, see the examples
in the Create a custom role to manage a custom budget section.

To review the list of tags already in the custom budget, call the budget’s
[<budget_name>!GET_RESOURCE_TAGS](../../sql-reference/classes/budget/methods/get_resource_tags.md) method. For example, to see the list of tags in the budget
`my_budget` in the `budgets_db.budgets_schema` schema, execute the following statement:

```sqlexample
CALL budgets_db.budgets_schema.my_budget!GET_RESOURCE_TAGS();
```

Tags must be added to or removed from a budget by [reference](../../sql-reference/references.md).

1. You can add tag `cost_mgmt_db.tags.cost_center` to budget `my_budget` by using the following steps:

   1. Grant the APPLYBUDGET privilege on the tag to the role `budget_admin` by executing the following statement:

      ```sqlexample
      GRANT APPLYBUDGET ON TAG cost_center TO ROLE budget_admin;
      ```
   2. Pass a reference for tag `cost_center` to the [ADD_RESOURCE_TAG](../../sql-reference/classes/budget/methods/add_resource_tag.md) instance
      method by executing the following statement. The value of the tag is set to `finance`.

      ```sqlexample
      CALL budgets_db.budgets_schema.my_budget!ADD_RESOURCE_TAG(
         SELECT SYSTEM$REFERENCE('TAG',
            'cost_mgmt_db.tags.cost_center',
            'SESSION',
            'applybudget'),
            'finance');
      ```

      The [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) function creates a reference for the tag `cost_center`, with the
      APPLYBUDGET privilege granted on the tag. This enables the budget to monitor the objects in your account that have the specified
      tag/value pair in your account. The third parameter to the function specifies the scope for the reference; in this
      case, ‘SESSION’ creates a reference with session scope. References passed to the ADD_RESOURCE_TAG method for a budget can be created
      with any transient reference scope (that is, the third parameter can be either ‘SESSION’ or ‘CALL’).
2. You can remove the tag `cost_center` from the budget `my_budget` by using the following steps:

   1. Grant the APPLYBUDGET privilege on the database to the role `budget_admin` by executing the following statement:

      ```sqlexample
      GRANT APPLYBUDGET ON TAG cost_center TO ROLE budget_admin;
      ```
   2. Remove the tag by passing a reference to the [REMOVE_RESOURCE_TAG](../../sql-reference/classes/budget/methods/remove_resource_tag.md)
      instance method:

      ```sqlexample
      CALL budgets_db.budgets_schema.my_budget!REMOVE_RESOURCE_TAG(
         SELECT SYSTEM$REFERENCE('TAG',
            'cost_mgmt_db.tags.cost_center',
            'SESSION',
            'applybudget'),
            'finance');
      ```

## Add or remove objects from a custom budget

You can add or remove objects from a custom budget using Snowsight or SQL.

> **Note:**
>
> To add or remove objects from a custom budget, you must use a role with the required privileges on the budget and the object. For more
> information, see Create a custom role to manage a custom budget.

### Use Snowsight to add or remove objects from a custom budget

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.
4. Select the budget to edit.
5. In the dashboard, select  (edit icon).
6. Select + Tags & resources, then select the objects you want to add to the custom budget.

   > **Note:**
   > * When you select a database or schema, all supported objects (for example, tables)
   >   contained within the database or schema are also added to the budget.
   > * If you are directly adding individual objects, you can only add an object to one custom budget. In this case, if an object is currently
   >   included in one custom budget and you add that object to a second custom budget, Budgets removes the object from the first custom budget
   >   without issuing a warning.
   >
   >   This behavior does not apply to using tags to add objects to budgets; an object with one or more tags can be
   >   included in multiple custom budgets if you are using tags to add the object to the budgets.
7. Select Done.

### Use SQL commands to add or remove objects from a custom budget

The role used to add or remove an object from a budget must have the APPLYBUDGET privilege on the object. For more information, see
the examples in the Create a custom role to manage a custom budget section.

To review the list of objects already in the custom budget, call the budget’s
[<budget_name>!GET_LINKED_RESOURCES](../../sql-reference/classes/budget/methods/get_linked_resources.md) method. For example, to see the list of objects in the budget
`my_budget` in the `budgets_db.budgets_schema` schema, execute the following statement:

```sqlexample
CALL budgets_db.budgets_schema.my_budget!GET_LINKED_RESOURCES();
```

The statement returns the following output:

```output
+-------------+-----------------+-----------+-------------+---------------+
| RESOURCE_ID | NAME            | DOMAIN    | SCHEMA_NAME | DATABASE_NAME |
|-------------+-----------------+-----------+-------------+---------------|
|         326 | DB1             | DATABASE  | NULL        | NULL          |
|         157 | MY_WH           | WAREHOUSE | NULL        | NULL          |
+-------------+-----------------+-----------+-------------+---------------+
```

> **Note:**
>
> The list does not include:
>
> * Objects that were added automatically (for example, compute pools and warehouses created and owned by a Snowflake Native App).
> * Objects that were added when a tag was added to the budget.

Objects must be added to or removed from a budget by [reference](../../sql-reference/references.md).

1. You can add table `t1` to budget `my_budget` by using the following steps:

   1. Grant the APPLYBUDGET privilege on the table to the role `budget_admin` by executing the following statement:

      ```sqlexample
      GRANT APPLYBUDGET ON TABLE t1 TO ROLE budget_admin;
      ```
   2. Pass a reference for table `t1` to the [ADD_RESOURCE](../../sql-reference/classes/budget/methods/add_resource.md) instance
      method by executing the following statement:

      ```sqlexample
      CALL budgets_db.budgets_schema.my_budget!ADD_RESOURCE(
         SELECT SYSTEM$REFERENCE('TABLE', 't1', 'SESSION', 'applybudget'));
      ```

      The [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) function creates a reference for a TABLE object, `t1`, with the
      APPLYBUDGET privilege granted on the table. This enables the budget to monitor the specified object in your account. The
      third parameter to the function specifies the scope for the reference; in this
      case, ‘SESSION’ creates a reference with session scope. References passed to the ADD_RESOURCE method for a budget can be created
      with any transient reference scope (that is, the third parameter can be either ‘SESSION’ or ‘CALL’).

      > **Note:**
      >
      > If you want to add a Snowflake Native App to a budget, when you call SYSTEM$REFERENCE, specify `'DATABASE'` (not `'APPLICATION'`)
      > for the `object_type` argument.

      For a full list of objects and privileges, see [Supported object types and privileges for references](../../sql-reference/references.md).

      > **Note:**
      >
      > If you are directly adding individual objects, you can only add an object to one custom budget. In this case, if an object is currently
      > included in one custom budget and you add that object to a second custom budget, Budgets removes the object from the first custom budget
      > without issuing a warning.
      >
      > This behavior does not apply to using tags to add objects to budgets; an object with one or more tags can be
      > included in multiple custom budgets if you are using tags to add the object to the budgets.
2. You can remove the database `db1` from the budget `my_budget` by using the following steps:

   1. Grant the APPLYBUDGET privilege on the database to the role `budget_admin` by executing the following statement:

      ```sqlexample
      GRANT APPLYBUDGET ON DATABASE db1 TO ROLE budget_admin;
      ```
   2. Remove the database by passing a reference to the [REMOVE_RESOURCE](../../sql-reference/classes/budget/methods/remove_resource.md)
      instance method:

      ```sqlexample
      CALL budgets_db.budgets_schema.my_budget!REMOVE_RESOURCE(
         SELECT SYSTEM$REFERENCE('DATABASE', 'db1', 'SESSION', 'applybudget'));
      ```

---
title: Custom data metric functions
source: https://docs.snowflake.com/en/user-guide/data-quality-custom-dmfs.md
section: User Guide
---

# Custom data metric functions

If there isn’t a [system data quality metric function (DMF)](data-quality-system-dmfs.md) that can perform your data quality
checks, then you can use the [CREATE DATA METRIC FUNCTION](../sql-reference/sql/create-data-metric-function.md) command to create your own DMF.

## Create a custom DMF

The following examples demonstrate how to use the [CREATE DATA METRIC FUNCTION](../sql-reference/sql/create-data-metric-function.md) command to create a custom DMF.

Example: User-defined DMF with single table argument
:   Create a DMF that calls the [COUNT](../sql-reference/functions/count.md) function to return the total number of rows that
    have positive numbers in three columns of the table:

    ```sqlexample
    CREATE OR REPLACE DATA METRIC FUNCTION governance.dmfs.count_positive_numbers(
      arg_t TABLE(
        arg_c1 NUMBER,
        arg_c2 NUMBER,
        arg_c3 NUMBER
      )
    )
    RETURNS NUMBER
    AS
    $$
      SELECT
        COUNT(*)
      FROM arg_t
      WHERE
        arg_c1>0
        AND arg_c2>0
        AND arg_c3>0
    $$;
    ```

Example: Using multiple table arguments to perform referential checks
:   A user-defined DMF can have more than one argument that accepts a table. When you add the DMF to a table, that table is used as the first
    argument. If there is an additional argument that accepts a table, you must also specify the fully qualified name of the second table. This
    capability simplifies referential integrity, matching and comparison, or conditional checking across different datasets.

    Suppose you want to validate referential integrity as defined by a primary key/foreign key relationship. In this case, you can create a
    DMF to validate that all records in a source table have corresponding records in the referenced table. The following user-defined DMF
    returns the number of records where the value of a column in one table does not have a corresponding value in the column of another table:

    ```sqlexample
    CREATE OR REPLACE DATA METRIC FUNCTION governance.dmfs.referential_check(
      arg_t1 TABLE (arg_c1 INT), arg_t2 TABLE (arg_c2 INT))
    RETURNS NUMBER AS
     'SELECT COUNT(*) FROM arg_t1
      WHERE arg_c1 NOT IN (SELECT arg_c2 FROM arg_t2)';
    ```

    Now suppose you want to check whether every order, as identified by its `sp_id`, in the `salesorders` table maps back to an `sp_id`
    in the `salespeople` table. You can add the DMF to the `salesorders` table while specifying the `salespeople` table as the other
    table argument.

    ```sqlexample
    ALTER TABLE salesorders
      ADD DATA METRIC FUNCTION governance.dmfs.referential_check
        ON (sp_id, TABLE (my_db.sch1.salespeople(sp_id)));
    ```

    The output returns the number of rows in the `salesorders` table that have a value in the `sp_id` column that doesn’t appear in the
    `sp_id` column of the `salespeople` table. A value greater than 0 indicates that there are `sp_id` values in `salesorders` that
    don’t map to records in `salespeople`.

## Test a custom DMF

You can execute a custom DMF manually in order to test it before associating it with one or more tables. For more information, see
[Call a DMF manually](data-quality-working.md).

## Secure the custom DMF

You can use the ALTER FUNCTION command to make a DMF secure. For more information about what it means for a function to be secure, see
[Protecting Sensitive Information with Secure UDFs and Stored Procedures](../developer-guide/secure-udf-procedure.md).

```sqlexample
ALTER FUNCTION governance.dmfs.count_positive_numbers(
 TABLE(
   NUMBER,
   NUMBER,
   NUMBER
))
SET SECURE;
```

## View the properties of a DMF

Describe the DMF to view its properties:

```sqlexample
DESC FUNCTION governance.dmfs.count_positive_numbers(
  TABLE(
    NUMBER, NUMBER, NUMBER
  )
);
```

```output
+-----------+---------------------------------------------------------------------+
| property  | value                                                               |
+-----------+---------------------------------------------------------------------+
| signature | (ARG_T TABLE(ARG_C1 NUMBER, ARG_C2 NUMBER, ARG_C3 NUMBER))          |
| returns   | NUMBER(38,0)                                                        |
| language  | SQL                                                                 |
| body      | SELECT COUNT(*) FROM arg_t WHERE arg_c1>0 AND arg_c2>0 AND arg_c3>0 |
+-----------+---------------------------------------------------------------------+
```

## Set a tag on a custom DMF

Use the [ALTER FUNCTION](../sql-reference/sql/alter-function.md) command to set a tag on a DMF:

```sqlexample
ALTER FUNCTION governance.dmfs.count_positive_numbers(
  TABLE(NUMBER, NUMBER, NUMBER))
  SET TAG governance.tags.quality = 'counts';
```

## Drop a custom DMF

You can use the [DROP FUNCTION](../sql-reference/sql/drop-function.md) command to remove a custom data metric function from the system.

> **Note:**
>
> You cannot drop a custom DMF from the system while it is still associated with a table or view. Use the [DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/functions/data_metric_function_references.md) function to identify the tables and views that have a data metric function set on them.
>
> For information about removing DMF associations from a table or view, see [Drop a DMF from an object](data-quality-working.md).

Drop a custom DMF from the system:

```sqlexample
DROP FUNCTION governance.dmfs.count_positive_numbers(
  TABLE(
    NUMBER, NUMBER, NUMBER
  )
);
```

---
title: Custom SCIM integration with Snowflake
source: https://docs.snowflake.com/en/user-guide/scim-custom.md
section: User Guide
---

# Custom SCIM integration with Snowflake

Custom SCIM integrations allow users to build their own applications to interface with their identity provider to provision, map, and manage users and roles to Snowflake.

Currently, Custom SCIM integrations are supported for identity providers that are neither Okta nor Microsoft Entra ID.

After creating your SCIM application, follow the procedure below to create a Snowflake Security Integration and generate a SCIM API
authorization token. Save the authorization token and include it in the SCIM API request header as described in
[SCIM API references](scim-api-references.md).

## Limitations

* Snowflake supports a maximum of 500 concurrent requests per account per SCIM endpoint (e.g. the `/Users` endpoint, the `/Groups` endpoint). After your account exceeds this threshold, Snowflake returns a `429` HTTP status code (i.e. too many requests). Note that this request limit usually only occurs during the initial provisioning when relatively large numbers of requests (i.e. more than 10 thousand) occur to provision users or groups.
* If your Snowflake [account URL](admin-account-identifier.md) was created with underscores, you can access your Snowflake
  account with the account URL having underscores or hyphens.

  If your SCIM provider reuses the same account URL for both SAML SSO and SCIM, then URLs with underscores are not supported. Therefore,
  use the hyphenated account URL to configure SCIM.

  Snowflake account URLs that do not contain underscores are not restricted by this limitation.
* A custom SCIM integration may or may not allow the provisioning and management of nested groups. Before attempting to use a custom SCIM integration to provision nested groups in Snowflake, please contact your identity provider to determine whether nested groups can be used with a SCIM integration.
* If you are using private connectivity to the Snowflake service to access Snowflake, ensure that you are not entering these URLs in the
  integration settings. Enter the public endpoint (i.e. without `.privatelink`), and ensure that the network policy allows access
  from the IdP IP address, otherwise you cannot use this integration.

  + Note that if your IdP and Snowflake account are both hosted in Microsoft Azure, it is necessary that the [network policy](network-policies.md) in Snowflake allows access from all of the Microsoft Azure IP addresses for the [Public Cloud](https://www.microsoft.com/en-us/download/details.aspx?id=56519) or the [US Government Cloud](https://www.microsoft.com/en-us/download/details.aspx?id=57063). Currently, all Microsoft Azure IP addresses are required in the network policy. For more information, see Managing SCIM Network Policies.
* Enabling or disabling password synchronization from a custom identity provider to Snowflake.

  Setting the `SYNC_PASSWORD` property in the Snowflake security integration is only supported for Okta SCIM integrations.

## Prerequisites

Before provisioning users or groups, ensure that the [network policy](network-policies.md) in Snowflake allows access from the IP ranges that correspond to your organization. For more information, see Managing SCIM Network Policies.

## Create a custom SCIM security integration and API token

The Snowflake configuration process creates a SCIM security integration to allow users and roles created in the identity provider to be owned by the GENERIC_SCIM_PROVISIONER SCIM role in Snowflake and creates an access token to use in SCIM API requests. The access token is valid for six months. Upon expiration, create a new access token manually using [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md) as shown below.

> **Note:**
>
> To invalidate an existing access token for a SCIM integration, execute a [DROP INTEGRATION](../sql-reference/sql/drop-integration.md) statement.
>
> To continue using SCIM with Snowflake, recreate the SCIM integration with a [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-scim.md) statement and generate a new access token using [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md).

Execute the following SQL statements in your preferred Snowflake client. Each of the following statements is explained below.

```sqlexample
use role accountadmin;
create role if not exists generic_scim_provisioner;
grant create user on account to role generic_scim_provisioner;
grant create role on account to role generic_scim_provisioner;
grant role generic_scim_provisioner to role accountadmin;
create or replace security integration generic_scim_provisioning
    type=scim
    scim_client='generic'
    run_as_role='GENERIC_SCIM_PROVISIONER';
select system$generate_scim_access_token('GENERIC_SCIM_PROVISIONING');
```

> **Important:**
>
> The example SQL statements use the ACCOUNTADMIN system role and the GENERIC_SCIM_PROVISIONER custom role is granted to the ACCOUNTADMIN role.
>
> It is possible not to use the ACCOUNTADMIN role in favor of a less-privileged role. Using a less-privileged role can help to address compliance concerns relating to least-privileged access, however, using a less-privileged role can result in unexpected errors during the SCIM configuration and management process.
>
> These errors could be the result of the less-privileged role not having sufficient rights to manage all of the roles through SCIM due to how the roles are created and the resultant role hierarchy. Therefore, in an effort to avoid errors in the configuration and management processes, choose one of the following options:
>
> 1. Use the ACCOUNTADMIN role as shown in the example SQL statements.
> 2. Use a role with the global MANAGE GRANTS privilege.
> 3. If neither of these first two options are desirable, use a custom role that has the OWNERSHIP privilege on all of the roles that will be managed using SCIM.

1. Use the ACCOUNTADMIN role.

   > ```sqlexample
   > use role accountadmin;
   > ```
2. Create the custom role GENERIC_SCIM_PROVISIONER. All users and roles in Snowflake created by the IdP will be owned by the scoped down GENERIC_SCIM_PROVISIONER role.

   > ```sqlexample
   > create role if not exists generic_scim_provisioner;
   > grant create user on account to role generic_scim_provisioner;
   > grant create role on account to role generic_scim_provisioner;
   > ```
3. Let the ACCOUNTADMIN role create the security integration using the GENERIC_SCIM_PROVISIONER custom role. For more information, see [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-scim.md).

   > ```sqlexample
   > grant role generic_scim_provisioner to role accountadmin;
   > create or replace security integration generic_scim_provisioning
   >     type = scim
   >     scim_client = 'generic'
   >     run_as_role = 'GENERIC_SCIM_PROVISIONER';
   > ```
4. Create and save the authorization token and store securely for later use. Use this token for each SCIM REST API request and place it in the request header. The access token expires after six months and a new access token can be generated with this statement.

   > ```sqlexample
   > select system$generate_scim_access_token('GENERIC_SCIM_PROVISIONING');
   > ```

## Enabling Snowflake-initiated SSO

The SCIM provisioning process does not automatically enable single sign-on (SSO).

To use SSO after the SCIM provisioning process is complete, enable
[Snowflake-initiated SSO](admin-security-fed-auth-security-integration.md).

## Managing SCIM network policies

Applying a network policy to a SCIM security integration allows the SCIM network policy to be distinct from network policies that apply to the entire Snowflake account.
It allows the SCIM provider to provision users and groups without adding IP addresses to a network policy that controls access for normal users.

A network policy applied to a SCIM integration overrides a network policy applied to the entire Snowflake account.

After creating the SCIM security integration, create the SCIM network policy using this command:

> ```sqlsyntax
> alter security integration generic_scim_provisioning set network_policy = <scim_network_policy>;
> ```

To unset the SCIM network policy, use this command:

> ```sqlexample
> alter security integration generic_scim_provisioning unset network_policy;
> ```

Where:

`generic_scim_provisioning`
:   Specifies the name of the Custom SCIM security integration.

`scim_network_policy`
:   Specifies the Custom SCIM network policy in Snowflake.

For more information, see [Controlling network traffic with network policies](network-policies.md) and [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-scim.md).

## Using secondary roles with SCIM

Snowflake supports setting the [user](../sql-reference/sql/create-user.md) property `DEFAULT_SECONDARY_ROLES` to `'ALL'` with
SCIM to allow users to use [secondary roles](security-access-control-overview.md) in a Snowflake session.

For a representative example, see [Update a user](scim-user-api-reference.md).

## Populating Snowflake tags with SCIM integrations

You can populate tags by using the `snowflakeTags` attribute when you ingest user information into the SCIM security integration. The exact request input can be found in [Create a user](scim-user-api-reference.md).

To enable support for this feature:

* Create the tag before you run the SCIM integration.
* Grant proper privileges on each tag and tag schema to the GENERIC_SCIM_PROVISIONER role.

Here is an example of creating a tag and assigning the proper role privileges:

```sqlexample
-- Create the tag.
CREATE TAG my_database_name.my_schema_name.my_tag_name;

-- Assign the proper privileges to the SCIM integration.
GRANT USAGE ON SCHEMA my_database_name.my_schema_name TO ROLE GENERIC_SCIM_PROVISIONER;
GRANT APPLY ON TAG my_database_name.my_schema_name.my_tag_name TO ROLE GENERIC_SCIM_PROVISIONER;
```

You must grant USAGE ON SCHEMA and APPLY ON TAG to all tags and tag schemas that you plan to assign through your SCIM security integration.

## Replicating the custom SCIM security integration

Snowflake supports replication and failover/failback with the SCIM security integration from the source account to the target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

**Next topics:**

* [SCIM API references](scim-api-references.md)

---
title: Cycle-start actions for budgets
source: https://docs.snowflake.com/en/user-guide/budgets/cycle-start-actions.md
section: User Guide
---

# Cycle-start actions for budgets

You can configure a budget to automatically call a stored procedure when the budget cycle restarts. The cycle restarts when spending is
reset to 0 at the beginning of the budget’s monthly period. This lets you run automated actions at the beginning of each budget period,
such as re-enabling warehouses or sending notifications about the new cycle.

Cycle-start actions are particularly useful for cleaning up or reversing actions that were triggered by
[custom actions](custom-actions.md) during the previous budget cycle.

When you define a cycle-start action, you specify the stored procedure to call and the arguments to pass to it. The stored
procedure executes automatically each time the budget cycle restarts.

## Stored procedure requirements

The stored procedure that is called by a cycle-start action must meet the following requirements:

* The procedure must run with owner’s rights, not caller’s rights. For more information, see
  [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
* The procedure can’t take more than 30 minutes to complete.
* The procedure can’t have an OUTPUT argument.
* Design your procedure to handle being called multiple times without causing duplicate or
  unintended effects.
* The procedure’s required arguments must be one of the following data types:

  + [Numeric data types](../../sql-reference/data-types-numeric.md)
  + [String & binary data types](../../sql-reference/data-types-text.md)
  + [Logical data types](../../sql-reference/data-types-logical.md)
  + [Date & time data types](../../sql-reference/data-types-datetime.md)

After you’ve created a stored procedure that meets these requirements, you must grant to the SNOWFLAKE application the USAGE privilege on
the procedure and its parent database and schema. For example, if the fully qualified name of your stored procedure is
`code_db.sch1.reset_resources`, run the following commands:

```sqlexample
GRANT USAGE ON DATABASE code_db TO APPLICATION SNOWFLAKE;
GRANT USAGE ON SCHEMA code_db.sch1 TO APPLICATION SNOWFLAKE;
GRANT USAGE ON PROCEDURE code_db.sch1.reset_resources(STRING, STRING) TO APPLICATION SNOWFLAKE;
```

> **Note:**
>
> If you update the stored procedure after adding it as a cycle-start action, you must re-grant the USAGE privilege on the procedure to the
> SNOWFLAKE application.

## Set a cycle-start action for a budget

You can set one cycle-start action per budget (either the account budget or a custom budget). A cycle-start action consists of the
following components:

* Stored procedure: A reference to the procedure to be called when the budget cycle restarts.
* Arguments: An array of arguments to pass to the stored procedure.

To set a cycle-start action for a budget, call the [SET_CYCLE_START_ACTION](../../sql-reference/classes/budget/methods/set_cycle_start_action.md)
method on the budget instance. For example, the following code sets a cycle-start action that calls the `reset_resources` stored
procedure when the budget cycle restarts:

```sqlexample
CALL budget_db.sch1.my_budget!SET_CYCLE_START_ACTION(
  SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.reset_resources(STRING, STRING, STRING, STRING)'),
  ARRAY_CONSTRUCT('my_int', 'admin@example.com', 'Budget Alert', 'New budget cycle started'));
```

For an end-to-end example that includes creating the stored procedure that is called by the cycle-start action, see
Extended example.

## Remove a cycle-start action from a budget

To remove a cycle-start action from a budget, call the [REMOVE_CYCLE_START_ACTION](../../sql-reference/classes/budget/methods/remove_cycle_start_action.md)
method on the budget instance:

```sqlexample
CALL budget_db.sch1.my_budget!REMOVE_CYCLE_START_ACTION();
```

## Extended example

The following example demonstrates how to write a stored procedure called by a cycle-start action, grant the necessary privileges on the
procedure, and then set the cycle-start action for the budget.

1. Create a stored procedure that conforms to all the requirements:

   ```sqlexample-javascript
   CREATE OR REPLACE PROCEDURE code_db.sch1.reset_resources(
       integration_name STRING,
       email_list STRING,
       email_subject STRING,
       email_content STRING)
   RETURNS STRING
   LANGUAGE JAVASCRIPT
   EXECUTE AS OWNER
   AS
   $$
       // Re-enable warehouses or reset configurations here
       var enable_wh = "ALTER WAREHOUSE my_warehouse RESUME;";
       var statement1 = snowflake.createStatement({sqlText: enable_wh});
       statement1.execute();

       // Send notification about new cycle
       var sql_command = "CALL SYSTEM$SEND_EMAIL('" + INTEGRATION_NAME + "', " +
                                               "'" + EMAIL_LIST + "', " +
                                               "'" + EMAIL_SUBJECT + "'," +
                                               "'" + EMAIL_CONTENT + "'" + ");";
       var statement2 = snowflake.createStatement({sqlText: sql_command});
       statement2.execute();
       return "Resources reset for new budget cycle";
   $$;
   ```
2. Grant privileges on the stored procedure to the SNOWFLAKE application:

   ```sqlexample
   GRANT USAGE ON DATABASE code_db TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON SCHEMA code_db.sch1 TO APPLICATION SNOWFLAKE;
   GRANT USAGE ON PROCEDURE code_db.sch1.reset_resources(STRING, STRING, STRING, STRING)
     TO APPLICATION SNOWFLAKE;
   ```
3. Set the cycle-start action for the budget:

   ```sqlexample
   CALL budget_db.sch1.my_budget!SET_CYCLE_START_ACTION(
     SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.reset_resources(STRING, STRING, STRING, STRING)'),
     ARRAY_CONSTRUCT('my_int', 'admin@example.com', 'Budget Cycle Restarted', 'New budget cycle has begun'));
   ```

## Troubleshooting cycle-start actions

If a cycle-start action is not working as expected, use the following methods to diagnose the issue.

### Monitor cycle-start action execution

Snowflake uses a task to execute the cycle-start action. This tasks is named `_budget_cycle_start_task`. To check the
execution status of the cycle-start action task for a budget instance, run the following query. Replace `budget_name` with the name of
your budget.

```sqlexample
SELECT th.*, ci.name AS budget_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY th
  JOIN SNOWFLAKE.ACCOUNT_USAGE.CLASS_INSTANCES ci
    ON th.instance_id = ci.id
  WHERE ci.class_name = 'BUDGET'
    AND th.name ILIKE '_budget_cycle_start_task'
    AND ci.name = '<budget_name>'
  ORDER BY th.completed_time DESC
  LIMIT 10;
```

### Troubleshoot actions that aren’t triggering

If a cycle-start action is not triggering when expected, check for the following common issues. Assume your custom budget is
`budget_db.sch1.my_budget`.

**Stored procedure or privileges changed**

Verify that the stored procedure called by the cycle-start action is still valid and that the SNOWFLAKE application still has necessary
privileges. You can verify the privileges by running:

```sqlexample
SHOW GRANTS ON PROCEDURE code_db.sch1.reset_resources(STRING, STRING, STRING, STRING);
```

**Budget is not activated**

For account budgets only, verify that the budget is activated by calling the
[GET_CONFIG](../../sql-reference/classes/budget/methods/get_config.md) method and checking the `is_active` field.

```sqlexample
CALL budget_db.sch1.my_budget!GET_CONFIG();
```

**No cycle-start action is configured**

Verify that a cycle-start action is configured for the budget:

```sqlexample
CALL budget_db.sch1.my_budget!GET_CYCLE_START_ACTION();
```

**Budget cycle has not restarted yet**

The cycle-start action triggers only when the budget cycle restarts. Check when the current cycle began and when it will end
to determine when the next trigger will occur.

---
title: Data governance skills for Cortex Code
source: https://docs.snowflake.com/en/user-guide/governance-skills.md
section: User Guide
---

# Data governance skills for Cortex Code

Cortex Code includes built-in data governance skills designed to help you understand, protect, and monitor the data in your
Snowflake account. These skills work directly in your Snowflake environment — describe what you need in plain English, and
Cortex Code generates and executes the necessary queries, classifications, and analyses for you. You don’t need to know which skill to run;
Cortex Code automatically selects the skill that it needs to answer your question.

## Getting started

1. [Install Cortex Code CLI and connect to your account](cortex-code/cortex-code-cli.md).
2. Ensure that you meet the access control requirements.
3. Start asking questions from the command line. You can use any of the example prompts below directly in Cortex Code. Cortex Code selects
   the appropriate skill automatically based on your question — no special commands are needed.

## General data governance

Data governance skills can answer questions about access control, audit trails, permissions, role hierarchies, and compliance monitoring
across your Snowflake account. Cortex Code uses skills to run SQL queries against Snowflake’s ACCOUNT_USAGE views using an embedded semantic
model with effective query patterns.

Cortex Code can help you do the following tasks:

* **Audit who accessed what data and when** — Understand user access patterns, track query history, and identify after-hours or unusual
  activity.
* **Analyze permissions and role hierarchies** — Review grants, role assignments, and privilege structures to ensure least-privilege access
  for users.
* **Monitor compliance posture** — Analyze masking policies, row access policies, aggregation policies, and tag usage across your account.
* **Investigate object dependencies** — Understand how databases, schemas, tables, and views relate to one another.
* **Track DDL changes** — See who created, altered, or dropped objects and when.

**Example prompts**

```none
"Who has accessed the SALES.NA.CUSTOMERS table in the last 30 days?"
"Show me all users with the ACCOUNTADMIN role"
"What tables were accessed outside of business hours last week?"
"List all grants to the ANALYST_ROLE"
"Which users have run DDL operations on the FINANCE database in the last 7 days?"
"Show me the role hierarchy for my account"
"What masking policies are applied across my account?"
"Which tables have no row access policies attached?"
"Show me all tag references in the ANALYTICS database"
```

## Sensitive data classification

The data governance skill for sensitive data classification can detect and classify personally identifiable information (PII) and other
sensitive data in your Snowflake tables. It uses Snowflake’s native [SYSTEM$CLASSIFY](../sql-reference/stored-procedures/system_classify.md)
function to scan tables and identify columns containing data like emails, phone numbers, social security numbers, and addresses. It can
also set up automated classification profiles for continuous monitoring.

Cortex Code can help you do the following tasks:

* **Discover PII in your tables** — Scan individual tables or entire schemas to find columns containing sensitive data such as emails, names,
  phone numbers, credit card numbers, and social security numbers.
* **Analyze existing classification results** — Query the [DATA_CLASSIFICATION_LATEST](../sql-reference/account-usage/data_classification_latest.md)
  view to see what PII has already been detected, which tables have the most sensitive columns, and what categories of sensitive data exist.
* **Set up automated classification** — Create classification profiles that continuously monitor databases for new sensitive data and, if
  desired, auto-tag columns.
* **Create custom classifiers** — Define regex-based classifiers for domain-specific sensitive data (employee IDs, internal codes, custom
  formats) that Snowflake’s built-in categories don’t cover.
* **Test and validate classification accuracy** — Run classifiers against representative tables to verify detection accuracy before deploying
  them to production.

**Example prompts**

```none
"Scan SALES.NA.ORDERS for PII"
"Does the CUSTOMERS table contain any sensitive data?"
"What PII exists across my ANALYTICS database?"
"Show me all columns classified as EMAIL or PHONE in my account"
"Which tables have the most sensitive columns?"
"Create a classification profile for the PROD_DB database"
"Set up auto-classification with auto-tagging enabled"
"Create a custom classifier for employee IDs that match the pattern EMP-XXXXX"
"Show me classification results for the NA.FINANCE schema"
"Which tables need re-classification (older than 90 days)?"
```

## Data protection policies

The data governance skill for data protection policies helps you create, audit, and manage Snowflake masking policies, row access policies,
and projection policies. It provides best practices, proven patterns (like Attribute-Based Access Control), and guided workflows for both
building new policies and auditing existing ones. It also includes compliance reference material for PCI-DSS, HIPAA, GDPR, CCPA, SOX, and
FERPA.

Cortex Code can help you do the following tasks:

* **Create masking policies** — Build column-level masking policies that dynamically redact sensitive data based on the querying user’s role,
  using best practices like `IS_ROLE_IN_SESSION()` and memoizable functions.
* **Create row access policies** — Restrict which rows a user can see based on role membership, attributes, or lookup tables.
* **Create projection policies** — Control whether a column can appear in query results at all.
* **Audit existing policies** — Inventory all policies in your account, evaluate them against a checklist of security best practices, and
  identify anti-patterns (for example, using the CURRENT_ROLE function instead of IS_ROLE_IN_SESSION).
* **Consolidate scattered policies** — Migrate from table-specific policies to generic, reusable policies centralized in a governance
  database.
* **Meet regulatory requirements** — Get policy templates and guidance tailored to specific compliance frameworks (HIPAA for healthcare,
  PCI-DSS for payment data, GDPR for EU personal data).

**Example prompts**

```none
"Create a masking policy for the EMAIL column in the SALES.NA.CUSTOMERS table"
"Help me set up row access policies for the FINANCE schema"
"Audit all masking policies in my account"
"Are there any anti-patterns in my existing data policies?"
"Create a HIPAA-compliant masking policy for PHI columns"
"Show me the best practice for role-based masking"
"I need a projection policy to prevent the SSN column from appearing in query results"
"Help me consolidate my scattered masking policies into reusable ones"
"What's the recommended pattern for Attribute-Based Access Control (ABAC)?"
"Generate a policy health report for my account"
```

## Data quality

The data governance skill for data quality monitors and analyzes data quality across your Snowflake schemas using Data Metric Functions
(DMFs). It provides health scoring, root cause analysis for failing metrics, regression detection, trend analysis, SLA alerting, table
comparison for migration validation, and dataset popularity analysis.

Cortex Code can help you do the following tasks:

* **Check schema health** — Get an overall data quality score for a schema, showing how many metrics are passing versus failing, and which tables
  are monitored.
* **Investigate quality failures** — Drill into failing metrics to understand which tables and columns have issues, what the issues are, and
  get fix recommendations.
* **Detect quality regressions** — Compare current quality against previous measurements to see if quality improved or degraded, and identify
  new failures.
* **Track quality trends** — View time-series quality scores to understand whether quality is improving, stable, or declining over time.
* **Set up SLA alerts** — Create automated Snowflake ALERT objects that notify you when data quality drops below a threshold.
* **Compare tables** — Validate data migrations, reconcile dev versus prod data, or find row-level differences between two table versions
  (added, removed, modified rows, schema diffs).
* **Analyze dataset popularity** — Identify the most and least used tables, find unused or stale data, and understand who is consuming which
  datasets.

**Example prompts**

```none
"What is the data quality score for ANALYTICS.REPORTING?"
"Why is the SALES.CUSTOMERS.ORDERS table failing quality checks?"
"Has data quality improved or gotten worse in the DB.FINANCE schema this month?"
"Show me quality trends for PROD_DB.SALES over the last 30 days"
"Set up an alert if data quality in ANALYTICS.CORE drops below 90%"
"Compare STAGING.ORDERS_V1 with STAGING.ORDERS_V2"
"Find the differences between dev and prod versions of the SALES.ORDERS.CUSTOMERS table"
"Which tables in my account are the most popular?"
"Are there any unused tables in the SANDBOX database?"
"Show me the root cause of quality failures in SALES.ORDERS"
```

## Lineage

The data governance skill for lineage traces data dependencies across your Snowflake account — both upstream (where data comes from) and
downstream (what depends on it). It supports table-level and column-level lineage, impact analysis with risk scoring, root cause analysis
with change detection, and data discovery with trust scoring.

Cortex Code can help you do the following tasks:

* **Assess the impact of changes** — Before modifying a table, see all downstream objects that depend on it, ranked by risk (CRITICAL,
  MODERATE, LOW), with usage frequency and affected user counts.
* **Debug data issues by tracing upstream** — When a report shows wrong numbers, trace the data back through its transformation layers to
  identify where the issue originated, including recent schema and data changes.
* **Discover and verify trusted datasets** — Find the best table to use for a given analysis, with trust scores based on schema tier
  (production, staging, raw, sandbox), usage patterns, and data freshness.
* **Trace column-level dependencies** — Understand which downstream columns consume a specific column, or trace a column back to its original
  source through transformation layers.
* **Detect recent changes in the lineage** — Identify schema changes, data modifications, and DDL operations across the lineage path to
  correlate with data quality issues.

**Example prompts**

```none
"What will break if I change RAW_DB.SALES.ORDERS?"
"What depends on the SALES.SCH1.CUSTOMERS table?"
"Where does ANALYTICS_DB.REPORTING.REVENUE come from?"
"Why is the REVENUE_SUMMARY table showing wrong numbers?"
"Which table should I use for customer revenue analysis?"
"Is STAGING_DB.TRANSFORM.ORDERS_ENRICHED trustworthy?"
"What uses the AMOUNT column in SALES.ORDERS?"
"Where does the TOTAL_SALES column in the REVENUE report come from?"
"Show me the full lineage for SUMMIT.DEMO.SHIPMENTS"
"Has the DISCOUNT_PCT column in ORDERS changed recently?"
```

## Access control requirements

To successfully invoke the data governance skills from Cortex Code, you must have the following:

* [Privileges and roles required by Cortex Code](cortex-code/cortex-code-cli.md).
* Access to the schemas, tables, and views that you’re interested in.

  + For sensitive data classification questions, you need the OWNERSHIP or USAGE privilege on the table or view.
  + For data protection policies, you need the CREATE MASKING POLICY or CREATE ROW ACCESS POLICY privilege on the schema that contains the
    table.
* Access to views in the ACCOUNT_USAGE schema. By default, only the ACCOUNTADMIN system role has privileges to access the views in the ACCOUNT_USAGE schema. To
  grant the ability to access these views to other people, you can do either of the following:

  + Grant the IMPORTED PRIVILEGES privilege on the SNOWFLAKE database to the user’s role. This is a broad grant of privileges that allows a
    user to view all ACCOUNT_USAGE views, but also grants access to views in the ORGANIZATION_USAGE schema.
  + Grant database roles needed to access views. To use Cortex Code for all governance-related topics, a user needs all of these database roles:
    OBJECT_VIEWER, USAGE_VIEWER, GOVERNANCE_VIEWER, and SECURITY_VIEWER. To restrict a user from learning about certain aspects of data
    governance, grant a subset of these roles. For a list of the views that each role can access, see [ACCOUNT_USAGE schema](../sql-reference/snowflake-db-roles.md).

## Tips for best results

* **Be specific with object names** — Use fully qualified names like `DATABASE.SCHEMA.TABLE` for the most accurate results.
* **Start broad, then drill in** — Begin with a health check or overview, then ask follow-up questions to investigate specific issues.

---
title: Data Integration
source: https://docs.snowflake.com/en/user-guide/ecosystem-etl.md
section: User Guide
---

# Data Integration

Commonly referred to as ETL, data integration encompasses the following three primary operations:

Extract:
:   Exporting data from specified data sources.

Transform:
:   Modifying the source data (as needed), using rules, merges, lookup tables or other conversion methods, to match the target.

Load:
:   Importing the resulting transformed data into a target database.

The more recent usage of the term is ELT, emphasizing that the transformation operation does not necessarily need to be performed before
loading, particularly in systems such as Snowflake that support transformation during or after loading.

In addition, the scope of data integration has expanded to include a wider range of operations, including:

* Data preparation.
* Data migration, movement, and management.
* Data warehouse automation.

The following data integration tools and technologies are known to provide native connectivity to Snowflake:

| Solution |  | Version / Installation Requirements | Notes |
| --- | --- | --- | --- |
|  |  | **Ab Initio:** No requirements — contact Ab Initio for more details  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page |  |
|  |  | **Agile Data Engine:** No requirements  **Snowflake:** No requirements |  |
|  |  | **Airbyte:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Connectors > Connector Catalog > Destinations > Snowflake](https://docs.airbyte.com/integrations/destinations/snowflake) (Airbyte Documentation) |
|  |  | **Alteryx Designer Cloud:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). |
|  |  | **Amazon Data Firehose:** No requirements  **Snowflake:** No requirements | Additional resources:  * [Amazon Data Firehose documentation](https://docs.aws.amazon.com/firehose/latest/dev/create-destination.html#create-destination-snowflake) * [Getting Started with Snowflake and Amazon Data Firehose](https://quickstarts.snowflake.com/guide/getting_started_with_snowflake_and_aws_kdf/#0) |
|  |  | **AWS Glue:** No requirements  **Snowflake:** No requirements | Additional resources:  * [Connecting to Snowflake in AWS Glue Studio](https://docs.aws.amazon.com/glue/latest/dg/connecting-to-data-snowflake.html) * [Utilizing the new AWS Glue Studio Native Connector for Snowflake](https://medium.com/snowflake/utilizing-the-new-aws-glue-studio-native-connector-for-snowflake-12e99f288682) |
|  |  | **Artie:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Artie Transfer - Snowflake](https://docs.artie.so/real-time-destinations/snowflake) (Artie Documentation) |
|  |  | **Ascend.io:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Seamlessly Ingest, Transform, and Orchestrate Snowflake Workloads](https://www.ascend.io/snowflake/) (Ascend website)   + [Connections > Warehouse > Snowflake](https://developer.ascend.io/docs/snowflake) (Ascend Documentation) |
|  |  | **Azure Data Factory:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Copy activity in Azure Data Factory](https://docs.microsoft.com/en-us/azure/data-factory/copy-activity-overview)     (Azure Data Factory Documentation)   + [Copy data from and to Snowflake by using Azure Data Factory](https://docs.microsoft.com/en-us/azure/data-factory/connector-snowflake)     (Azure Data Factory Documentation) |
|  |  | **Boomi:** DCP 4.2 (or higher) or Integration July 2020 (or higher)  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). |
|  |  | **CData Software:** The [latest version](https://www.cdata.com/drivers/snowflake/download) of the driver/connector/application is always recommended, but legacy versions will continue to work if they are licensed.  Note that any changes to Snowflake since the release of a driver may not be available in the driver depending on how the changes are implemented.  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Snowflake - Driver Documentation](https://www.cdata.com/drivers/snowflake/docs/) (CData Software Documentation)   + [Snowflake Integration Guides and Tutorials](https://www.cdata.com/kb/tech/snowflake-article-list.rst)     (CData Software Knowledge Base) |
|  |  | **Celigo:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake Prebuilt Integrations](https://www.celigo.com/integrations/snowflake/) (Celigo website)   + [Set up a connection to Snowflake](https://docs.celigo.com/hc/en-us/articles/360048048792-Set-up-a-connection-to-Snowflake) (Celigo Help Center)   + Automate application and data integration use cases using [integrator.io](http://integrator.io/) — free trial available (Celigo integrator.io website, login required) |
|  |  | **Census:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Data Sources > Snowflake](https://docs.getcensus.com/sources/snowflake) (Census Documentation)   + [Destinations > Snowflake](https://docs.getcensus.com/destinations/snowflake) (Census Documentation)   + [Sync Snowflake to Salesforce](https://www.getcensus.com/sync/snowflake-to-salesforce) (Census website) |
|  |  | **Coalesce:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Snowflake Quickstart:    + [Accelerate Transformations with Coalesce and Snowflake](https://quickstarts.snowflake.com/guide/transform_your_data_with_coalesce/) * Additional resources:    + [Quick Start Guide](https://docs.coalesce.io/docs/quick-start) (Coalesce Documentation) |
|  |  | **Datameer:** v7  **Snowflake:** No requirements | * Snowflake Quickstart:    + [Getting Started with Datameer](https://quickstarts.snowflake.com/guide/getting_started_datameer/) * Additional resources:    + [User Guide > Getting Started](https://documentation.datameer.com/datameer/) (Datameer Documentation)   + [User Guide > Deploying Data to Snowflake](https://documentation.datameer.com/datameer/deploying_data_to_snowflake/)     (Datameer Documentation) |
|  |  | **DataVirtuality:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Snowflake Data Warehouse Connector](https://documentation.datavirtuality.com/21/reference-guide/connecting-datasources/jdbc-connectors/snowflake-data-warehouse-connector) (DataVirtuality website) |
|  |  | **dbt:** v0.13 (or higher)  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Snowflake Quickstarts:    + [Data Engineering with Snowpark Python and dbt](https://quickstarts.snowflake.com/guide/data_engineering_with_snowpark_python_and_dbt/#0)   + [Accelerating Data Teams with dbt Core and Snowflake](https://quickstarts.snowflake.com/guide/data_teams_with_dbt_core/)   + [Accelerating Data Teams with Snowflake and dbt Cloud Hands On Lab](https://quickstarts.snowflake.com/guide/accelerating_data_teams_with_snowflake_and_dbt_cloud_hands_on_lab/)   + [Data Engineering with Apache Airflow, Snowflake and dbt](https://quickstarts.snowflake.com/guide/data_engineering_with_apache_airflow/) * Additional resources:    + [Getting Started > Supported Databases > Snowflake](https://docs.getdbt.com/docs/profile-snowflake) (dbt Documentation)   + [Building Models > Warehouse-Specific Configs > Snowflake](https://docs.getdbt.com/docs/snowflake-configs)     (dbt Documentation)   + [How we configure Snowflake](https://blog.fishtownanalytics.com/how-we-configure-snowflake-fc13f1eb36c4)     (Fishtown Analytics Blog) |
|  |  | **Denodo:** Denodo Platform 6.0 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [Denodo Platform 8.0 Datasheet](https://www.denodo.com/en/system/files/document-attachments/ds-denodoplatform8.0-01-web_0.pdf)     (Denodo website)   + [How to connect to Snowflake from Denodo](https://community.denodo.com/kb/en/view/document/How%20to%20connect%20to%20Snowflake%20from%20Denodo)     (Denodo Knowledge Base) |
|  |  | **Devart ODBC Driver for Snowflake:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Overview - ODBC Driver for Snowflake](https://docs.devart.com/odbc/snowflake/) (Devart Documentation) |
|  |  | **Devart Python Connector for Snowflake:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Overview - Python Connector for Snowflake](https://docs.devart.com/python/snowflake/) (Devart Documentation) |
|  |  | **Devart SSIS Data Flow Components for Snowflake:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Devart SSIS Data Flow Components > Overview](https://docs.devart.com/ssis/) (Devart Documentation) |
|  |  | **Diyotta:** No requirements  **Snowflake:** No requirements | * Acquired by [ThoughtSpot](https://www.thoughtspot.com/) * Additional resources:    + [Working with Snowflake Data Object](https://support.diyotta.com/docs/latest/user-guide/using-diyotta-studio/working-with-data-object/working-with-snowflake-data-object)     (Diyotta website) |
|  |  | **dlt:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Getting started](https://dlthub.com/docs/intro) (dlt Documentation)   + [Destinations > Snowflake](https://dlthub.com/docs/dlt-ecosystem/destinations/snowflake) (dlt Documentation)   + [Sources > 30+ SQL Databases](https://dlthub.com/docs/dlt-ecosystem/verified-sources/sql_database/#supported-databases)     (dlt Documentation) |
|  |  | **Domo:** No requirements  **Snowflake:** No requirements | Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md).   * Additional resources:    + [Connect everything, everywhere](https://www.domo.com/data-integration/connectors)   + [Combine and transform your data](https://www.domo.com/data-integration/etl)   + [Magic ETL on Snowflake](https://domo-support.domo.com/s/article/000005455?language=en_US)   + [Domo Data Connectors](https://domo-support.domo.com/s/topic/0TO5w000000ZanLGAS/connectors?language=en_US) |
|  |  | **Estuary:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Estuary and Snowflake](https://estuary.dev/solutions/technology/real-time-snowflake-streaming/) (Estuary website)   + [Snowflake materialization connector](https://docs.estuary.dev/reference/Connectors/materialization-connectors/Snowflake/) (Estuary Documentation)   + [Snowflake change data capture connector](https://docs.estuary.dev/reference/Connectors/capture-connectors/snowflake/) (Estuary Documentation) |
|  |  | **Etleap:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Etleap Launches Snowflake Integration](https://blog.etleap.com/2019/07/22/etleap-launches-snowflake-integration/)     (Etleap Blog) |
|  |  | **Etlworks:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Etlworks and Snowflake](https://etlworks.com/using-etlworks-to-load-data-in-snowflake.html) (Etlworks website)   + [Working with Snowflake](https://support.etlworks.com/hc/en-us/sections/360002758614-Working-with-Snowflake) (Etlworks Documentation) |
|  |  | **Fivetran:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Snowflake Quickstarts:    + [Automating Data Pipelines to Drive Marketing Analytics with Snowflake and Fivetran](https://quickstarts.snowflake.com/guide/vhol_fivetran/)   + [Fivetran - Automate Salesforce Insights: Source, Target, Transformations, Dashboard…NO CODE](https://quickstarts.snowflake.com/guide/modern_data_stack_with_fivetran_snowflake_salesforce/) * Additional resources:    + [Strategic Partner: Snowflake](https://www.fivetran.com/partners-snowflake) (Fivetran website)   + [Connectors > Destinations > Snowflake](https://fivetran.com/docs/destinations/snowflake) (Fivetran Documentation) |
|  |  | **Google Cloud Data Fusion:** Snowflake plugin  **Snowflake:** No requirements | * Additional resources:    + [Cloud Data Fusion Plugins](https://cloud.google.com/data-fusion/plugins) (Google Cloud website)   + [Cloud Storage to Snowflake Action](https://github.com/data-integrations/snowflake-plugins/blob/develop/docs/CloudStorageToSnowflake-action.md)     (GitHub) |
|  |  | **Google Cloud Dataflow:** Apache Beam  **Snowflake:** No requirements | * Additional resources:    + [Snowflake I/O](https://beam.apache.org/documentation/io/built-in/snowflake/) (Apache Beam Documentation)   + [Class SnowflakeIO](https://beam.apache.org/releases/javadoc/2.25.0/org/apache/beam/sdk/io/snowflake/SnowflakeIO.html)     (Apache Beam Javadoc) |
|  |  | **Heap:** Connect  **Snowflake:** No requirements | * Additional resources:    + [Heap Connect for Snowflake](https://docs.heap.io/docs/heap-connect-snowflake-integration) (Heap Documentation) |
|  |  | **Hevo Data CDC for ETL:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Destinations > Data Warehouses > Snowflake](https://docs.hevodata.com/destinations/data-warehouses/snowflake/)     (Hevo Documentation) |
|  |  | **Hightouch:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Snowflake Quickstart:    + [Suppress existing customers from a Youtube campaign with Hightouch and Snowflake](https://quickstarts.snowflake.com/guide/suppress_existing_customers_from_youtube_campaign_with_hightouch_and_snowflake/) * Additional resources:    + [Activate the Data Cloud](https://hightouch.com/snowflake/) (Hightouch website)   + [Sources > Snowflake](https://hightouch.com/docs/sources/snowflake/) (Hightouch Documentation) |
|  |  | **HVR:** No requirements  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Requirements for Snowflake](https://www.hvr-software.com/docs/5/location-class-requirements/requirements-for-snowflake)     (HVR Documentation)   + [Quick Start for HVR - Snowflake](https://www.hvr-software.com/docs/5/quick-start-guides/quick-start-for-hvr-snowflake)     (HVR Documentation) |
|  |  | **DataStage:** InfoSphere Information Server 11.7  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Snowflake Quickstart:    + [Use IBM DataStage to Load Enterprise Data into Snowflake](https://quickstarts.snowflake.com/guide/data_engineering_with_datastage/) * Additional resources:    + [Snowflake connector](https://www.ibm.com/docs/en/iis/11.7?topic=databases-snowflake)     (InfoSphere Information Server Documentation)   + [DataStage on Cloud Pak for Data free trial](https://dataplatform.cloud.ibm.com/registration/stepone?context=cpdaas&apps=cos%2Cdatastage&regions=us-south%2Ceu-de&S_PKG=ov80049&cm_mmca1=10000665&cm_mmca2=000000TF)     (IBM Cloud Pak for Data website)   + DataStage on Cloud Pak for Data as a Service:      - [DataStage on Cloud Pak for Data as a Service](https://dataplatform.cloud.ibm.com/docs/content/svc-welcome/datastage.html)       (Documentation)     - [Snowflake connection](https://dataplatform.cloud.ibm.com/docs/content/wsj/manage-data/conn-snowflake.html?audience=wdp)       (Documentation)   + DataStage on Cloud Pak for Data Software:      - [DataStage on Cloud Pak for Data](https://www.ibm.com/docs/en/cloud-paks/cp-data/4.6.x?topic=services-datastage)       (Documentation)     - [Snowflake connection](https://www.ibm.com/docs/en/cloud-paks/cp-data/4.6.x?topic=catalogs-snowflake-connection)       (Documentation)     - [ELT run mode in DataStage](https://www.ibm.com/docs/en/cloud-paks/cp-data/4.6.x?topic=data-elt-run-mode)       (Documentation) |
|  |  | **Informatica Cloud:**   * Cloud Connector for Snowflake — available directly in the Informatica Cloud interface or by download from the   [Informatica Marketplace](https://marketplace.informatica.com/solutions/snowflake_elastic_data_warehouse). * Secure Agent — download and install from the Informatica Cloud interface   **Snowflake:**   * For push-down optimization: [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page * For all other functionality: No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Snowflake Quickstarts:    + [Harness the Power of Snowflake with Informatica Intelligent Data Management Cloud](https://quickstarts.snowflake.com/guide/harness_the_power_of_snowflake_with_informatica_idmc/)   + [Accelerate Data Transformation with the Telecom Data Cloud and Informatica](https://quickstarts.snowflake.com/guide/Accelerate_Data_Transformation_with_the_Telecom_Data_Cloud/) * Additional resources:    + [FAQ: Informatica Cloud Connector for Snowflake](https://community.snowflake.com/s/article/Informatica-Cloud-Connector-for-Snowflake-FAQ)     (Snowflake Community)   + [Snowflake Cloud Data Warehouse Connector Related Queries](https://kb.informatica.com/faq/7/Pages/21/553165.aspx) (Informatica Network) |
|  |  | **Informatica Data Loader:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [How to Video - Getting Started with Informatica Data Loader for Snowflake](https://video.informatica.com/detail/video/6308017723112)     (Informatica Videos) |
|  |  | **Integrate.io:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Low-code Transform & Load to Snowflake](https://www.integrate.io/docs/etl/using-components-snowflake-destination/)   + [60-second CDC Replication to Snowflake](https://www.integrate.io/docs/cdc/snowflake/) |
|  |  | **Apache Kafka:** No requirements  **Kafka Connect:** API 2.0.0 to 2.2.0 (all other versions are not supported)  **Snowflake:** [Snowflake Connector for Kafka](kafka-connector.md) — download from [Maven](https://mvnrepository.com/artifact/com.snowflake) | * Additional requirements if using Avro format; for more details, see [Snowflake Connector for Kafka](kafka-connector.md). |
|  |  | **Keboola:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Snowflake Quickstart:    + [Getting Started with Keboola](https://quickstarts.snowflake.com/guide/getting_started_keboola/) * Additional resources:    + [Storage](https://help.keboola.com/storage/?_ga=2.36925536.1177541999.1508806853-909758245.1508806853) (Keboola Documentation)   + [Snowflake Transformation](https://help.keboola.com/manipulation/transformations/snowflake) (Keboola Documentation) |
|  |  | **Knoema:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). |
|  |  | **Matillion Data Productivity Cloud:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [How to Create a Data Pipeline Using Matillion Data Productivity Cloud](https://www.matillion.com/resources/blog/how-to-create-a-data-pipeline-using-matillion-data-loader/) (Matillion Blog) |
|  |  | **Matillion ETL:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Snowflake Quickstart:    + [Cloud Native Data Engineering with Matillion and Snowflake](https://quickstarts.snowflake.com/guide/cloud_native_data_engineering_with_matillion_and_snowflake/) * Additional resources:    + [Snowflake: How to Get Started](https://www.matillion.com/etl-for-snowflake/how-to-get-started/) (Matillion website)   + [Snowflake articles with videos](https://www.matillion.com/community/snowflake/) (Matillion Community) |
|  |  | **Nexla:** No requirements  **Snowflake:** No requirements (ODBC, JDBC, and API options) | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Nexla for Snowflake](https://www.nexla.com/partners/snowflake/) (Nexla website)   + [Send Data to Snowflake](https://nexla.zendesk.com/hc/en-us/articles/360023976353-Send-Data-to-Snowflake) (Nexla Help Center) |
|  |  | **Pentaho Data Integration (PDI):**   * Pentaho 8.3 (or higher): Snowflake plugin — download from the Pentaho Customer Portal (requires login) * Pentaho 8.2 (or lower): No Pentaho requirements, but some Snowflake requirements   **Snowflake:**   * Pentaho 8.3 (or higher): No requirements * Pentaho 8.2 (or lower):    + [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc)  or   + 3rd-party connector (PentahoSnowflakePlugin) — download from [GitHub](https://github.com/inquidia/PentahoSnowflakePlugin) | * Additional resources:    + [PDI and Snowflake](https://docs.pentaho.com/pdia-data-integration/advanced-topics-pentaho-data-integration-overview/pdi-and-snowflake-cp) (Pentaho Documentation)   + [Bulk load into Snowflake](https://docs.pentaho.com/pdia-data-integration/pdi-job-entries-reference-overview/bulk-load-into-snowflake) (Pentaho Documentation)   + [PentahoSnowflakePlugin Readme](https://github.com/inquidia/PentahoSnowflakePlugin/blob/master/README.md) (GitHub) |
|  |  | **Precog:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Request a [trial](https://precog.com/demo/). * Additional resources:    + [AI-driven, no-code ELT for Snowflake](https://precog.com/use-cases/no-code-elt-for-snowflake/) (Precog website)   + [Case study: How a Luxury Gelato Chain Uses Precog to Actually Get Realtime, Multi-Source Analytics](https://precog.com/how-a-luxury-gelato-chain-uses-precog-to-actually-get-realtime-multi-source-analytics/) (Precog website) |
|  |  | **Qlik Replicate:** No requirements  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Formerly Attunity Replicate * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Start a trial (business sence)](https://www.qlik.com/us/trial/qlik-sense-business)   + [Technology Partners > Snowflake](https://www.qlik.com/us/products/technology/snowflake/) (Qlik website) |
|  |  | **Qlik Talend:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake Components in Qlik Talend](https://help.qlik.com/talend/en-US/components/8.0/snowflake/snowflake-component)     (Qlik Documentation)   + [Centralizing Snowflake Metadata](https://help.qlik.com/talend/en-US/studio-user-guide/8.0-R2024-12/centralizing-snowflake-metadata)     (Qlik Documentation) |
|  |  | **Qlik Talend Cloud:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Start a trial (replicate)](https://www.qlik.com/us/trial/replicate)   + [Technology Partners > Snowflake](https://www.qlik.com/us/products/technology/snowflake/) (Qlik website) |
|  |  | **Qlik Talend Studio:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). |
|  |  | **Redpoint:**   * Redpoint Orchestration v7.3 (or higher) * Redpoint Data Management v9.4.9.3 (or higher)   **Snowflake:**   * [ODBC Driver](../developer-guide/odbc/odbc.md) (for Redpoint Orchestration) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page * [JDBC Driver](../developer-guide/jdbc/jdbc.md) (for Redpoint Data Management) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Additional resources:    + [Redpoint and Snowflake: No Data Replication and Complete CDP Functionality](https://www.redpointglobal.com/blog/redpoint-and-snowflake-no-data-replication-and-complete-cdp-functionality/) (Redpoint Blog)   + [Redpoint Orchestration > Snowflake ODBC driver configuration](https://docs.redpointglobal.com/rpi/admin-appendix-a-database-preparation#Admin:AppendixA-Databasepreparation-SnowflakeODBCdriverconfiguration) (Redpoint Documentation)   + [Redpoint Data Management > RDBMS providers: Snowflake](https://docs.redpointglobal.com/rpdm/rdbms-database-providers#RDBMSdatabaseproviders-RDBMS_providers_SnowflakeA6B0F944RDBMSproviders:Snowflake) (Redpoint Documentation) |
|  |  | **Rivery:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Now Compatible with Snowflake](https://rivery.io/now-compatible-snowflake/) (Rivery website)   + [Integrations: Snowflake](https://rivery.io/integration/snowflake/) (Rivery website)   + [Customer Spotlight: Roomjoom](https://rivery.io/customer-spotlight-roojoom/) (Rivery website) |
|  |  | **SAP Data Services:** 4.2 SP12 (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Snowflake Quickstart:    + [SAP Accounts Receivable to Snowflake using ADF](https://quickstarts.snowflake.com/guide/sap_accounts_receivable_to_snowflake_using_adf/) * Additional resources:    + [Data Services — What’s New](https://help.sap.com/viewer/9f1b4472ec98409682d91953b9e68c92/4.2.12/en-US/578e89ed6d6d1014b3fc9283b0e91070.html)     (SAP Help Portal)   + [Data Services Supplement for Big Data](https://help.sap.com/viewer/af6d8e979d0f40c49175007e486257f0/4.2.12/en-US/bdfc176728dc466bbaf5012a9e3793bc.html)     (SAP Help Portal) |
|  |  | **Segment:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake Destination](https://segment.com/docs/destinations/snowflake/) (Segment Documentation) |
|  |  | **Skyvia:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Skyvia and Snowflake integration](https://skyvia.com/connectors/snowflake) (Skyvia website)   + [Working with Snowflake](https://docs.skyvia.com/connectors/databases/snowflake_connections.html) (Skyvia Documentation) |
|  |  | **Solace:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Solace and Snowflake integration](https://solace.com/integration-hub/pubsub-connector-for-snowflake/) (Solace website) |
|  |  | **Snaplogic:** 4.7.0 (or higher) with Snowflake Snap Pack  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Snowflake Snap Pack](https://docs-snaplogic.atlassian.net/wiki/display/SD/Snowflake+Snap+Pack) (SnapLogic Documentation) |
|  |  | **Snowplow:** Snowflake Loader — download from [GitHub](https://github.com/snowplow-incubator/snowplow-snowflake-loader)  **Snowflake:** No requirements | * Additional resources:    + [Snowflake Loader](https://docs.snowplowanalytics.com/docs/pipeline-components-and-applications/loaders-storage-targets/snowplow-snowflake-loader/)     (Snowplow Documentation) |
|  |  | **Stitch:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Connecting a Snowflake Data Warehouse to Stitch](https://www.stitchdata.com/docs/destinations/snowflake/connecting-a-snowflake-data-warehouse-to-stitch#main/)     (Stitch Documentation) |
|  |  | **Streamkap:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Stream into Snowflake](https://streamkap.com/snowflake) (Free trial available)   + [Streamkap Snowflake documenation](https://docs.streamkap.com/docs/snowflake) |
|  |  | **StreamSets:** No requirements  **Snowflake:** No requirements | * Utilizes [Snowpipe](data-load-snowpipe-intro.md) for continuous loading. * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Snowflake Quickstarts:    + [StreamSets’ Transformer for Snowflake: Hands on Lab](https://quickstarts.snowflake.com/guide/streamsets_transformer_for_snowflake_hol)   + [Process Change Data Capture (CDC) data from Oracle to Snowflake Using StreamSets](https://quickstarts.snowflake.com/guide/cdc_data_from_oracle_to_snowflake_in_streamsets/)   + [A Dive Into Slowly Changing Dimensions with Snowpark and StreamSets](https://quickstarts.snowflake.com/guide/snowflake_transformer/) * Additional resources:    + [Tame Snowflake with StreamSets](https://streamsets.com/partners/snowflake) (StreamSets website) |
|  |  | **Striim:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Striim for Snowflake Data Warehouse](https://www.striim.com/partners/real-time-data-to-snowflake/) (Striim website) |
|  |  | **Supermetrics:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake Data Warehouse: Getting started](https://supermetrics.com/docs/product-dwh-snowflake-getting-started/)     (Supermetrics Documentation) |
|  |  | **Tableau:** Prep 2018.3 (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Snowflake Quickstart:    + [Visual Analytics powered by Snowflake and Tableau](https://quickstarts.snowflake.com/guide/visual_analytics_powered_by_snowflake_and_tableau/) * Additional resources:    + [Snowflake for Tableau Prep](https://www.tableau.com/products/new-features/prep#feature-91953) (Tableau website) |
|  |  | **TIBCO ActiveMatrix BusinessWorks:**   * 6.x (or higher) * Plug-in for Snowflake 6.x (or higher)   **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [TIBCO ActiveMatrix BusinessWorks Plug-in for Snowflake User’s Guide](https://docs.tibco.com/pub/bwpluginsnowflake/6.0.1/doc/html/GUID-53E7AE6F-9F9B-4059-9136-CB0A9A1684EC.html)     (TIBCO Documentation) |
|  |  | **TMMData:** No requirements  **Snowflake:** No requirements |  |
|  |  | **Trifacta:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Trifacta for Snowflake](https://www.trifacta.com/solutions/snowflake/) (Trifacta website)   + [Trifacta for Snowflake: Data Prep for Your Cloud Data Warehouse or Data Lake — Part 1](https://www.trifacta.com/blog/trifacta-for-snowflake-part-1/)     (Trifacta blog) |
|  |  | **Wherescape:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Wherescape Automation for Snowflake](https://www.wherescape.com/products-services/wherescape-automation-for-snowflake/)     (Wherescape website) |
|  |  | **windsor.ai:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Utilizing Windsor.ai connector for Snowflake data integration](https://windsor.ai/destinations/snowflake/)   + [Documentation: How to integrate data into Snowflake with Windsor.ai](https://windsor.ai/documentation/how-to-integrate-data-into-snowflake/) |
|  |  | **Workato:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Workato is Snowflake Ready…](https://blog.workato.com/2020/08/workato-snowflake-ready/#.X5yFdlNKjuw) (Workato Blog)   + [Connectors > Snowflake](https://docs.workato.com/connectors/snowflake.html) (Workato Documentation) |

---
title: Data Lineage
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-lineage.md
section: User Guide
---

# Data Lineage

Snowflake tracks how data flows from source to target objects, for example from a table to a view, and lets you see
where the data in an object came from or where it goes. This information is called *data lineage*, and it helps you
understand the relationships between your Snowflake objects.

Data lineage captures two types of relationship:

* Data movement, such as when data is copied or materialized from one object to another. For example, CREATE TABLE AS
  SELECT (CTAS), INSERT, or MERGE operations on tables result in data movement.
* Object dependencies, when an object references a base object but does not materialize or copy data, such as when a
  view references a table.

Snowflake data lineage provides these benefits:

* Provides impact analysis by understanding the relationship between different objects.
* Enhances monitoring and troubleshooting by viewing data movement lineage and object dependencies.
* Facilitates compliance by tracking the flow of sensitive data.
* Helps you work with tags and masking policies on columns to protect sensitive data.
* Enhances trust in the data by understanding the source and target objects and columns.
* Allows administration for viewing lineage to be delegated. For more information, see Access control for lineage information.

## About upstream and downstream relationships

Data lineage helps you understand the relationships of an object in terms of source and target objects. In lineage terminology, the source
object is “upstream” of the target object, and the target object is “downstream” of the source object. Snowsight reveals objects
incrementally, one step at a time upstream or downstream from your selection.

For example, in this SQL statement:

```sqlexample
CREATE TABLE table2 AS SELECT col1 FROM table1;
```

`table2` is the target table, and is downstream of the source table, `table1`. Column `col1`, which originates
in table `table1`, is included in table `table2`; this is also a downstream lineage relationship.

If you view the details of table `table1` in Snowsight, the Lineage tab displays an arrow pointing from `table1` to
`table2` to indicate the downstream lineage relationship. If you instead start at table `table2`, an arrow points from
`table2` upstream to `table1`.

## Get started

To start using data lineage in Snowsight, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md) with the necessary privileges.
2. In the navigation menu, select Catalog » Database Explorer, and then select a supported object such as a table or
   view.
3. Select the Lineage tab.

Basic actions on the Lineage tab include the following:

* **A.** Select an object to show additional details about it, including columns and tags on those columns.
* **B.** Select +/- to show or hide objects that are further upstream or downstream.
* **C.** Select the arrow that connects two objects to show information about how the downstream object was created (for
  example, the SQL statement that created an object). Your access control privileges determine what information appears.
* **D.** Opens a new lineage diagram that focuses on the lineage of the selected object.

To learn about using the Lineage tab to perform other actions, see the following:

* Column lineage
* Work with tags
* Identify masking policies

## Column lineage

You can use Snowsight to trace the relationship between columns in a source object and columns in a target object. For a given
column, you can determine all upstream and downstream columns that share lineage with the column.

To determine the lineage of a column:

1. Open the Lineage tab and select the object that contains the column you are interested in tracing. A side panel opens.
2. Hover over the column name in the side panel, and select View Lineage.
3. Select Upstream Lineage or Downstream Lineage to list the columns in upstream or downstream objects.

   You can use the Distance column to determine how far away a column is in the lineage. For example, if the downstream distance is 1,
   then the column is in an object that was created directly from the current object. If the downstream distance is 2, then the column
   exists in an object that was created from an object that was created from the current object.

## Work with tags

The Lineage tab provides an integrated governance experience that lets you view the lineage of columns, identify columns that should
have tags, and apply new tags all in the same workflow.

Whether you can see and apply tags depends on the access control privileges of the role you are using to view the Lineage tab. For
information about the privileges required to work with tags, see [Summary of DDL commands, operations, and privileges](object-tagging/work.md).

### Find tags on an object and its columns

1. Open the Lineage tab and select the object you’re interested in. A side panel opens.
2. To view the tags on the object itself, look under the Details section of the side panel.
3. To view the tags on a column of the object, find the column in the Columns section. If there is a tag, a tag symbol appears next to
   the column name. Hover over the symbol to see the tag name and value.

### Identify and remedy missing tags or incorrect tag values

If there’s a tag on one column, there’s a good chance the same tag should be applied to upstream columns and downstream columns that share
lineage with the column. Similarly, the value of a tag on upstream columns and downstream columns often needs to be the same.

The data lineage workflow identifies tags that are missing from upstream and downstream columns and tags that have a different value. It
then helps you apply the missing tags or change the tag value on those columns.

1. Open the Lineage tab and select the object that contains the column you’re interested in tracing. A side panel opens.
2. Hover over the column name in the side panel, and select View Lineage.
3. In the View Column Lineage dialog, select Downstream Lineage or Upstream Lineage.

   If there are missing tags or mismatched tag values on the downstream or upstream column, a banner appears. You can use the color coding
   in the Tags column to identify what is wrong with the tag:

   * If a tag has a dashed border, the column does not have the tag applied.
   * If a tag has a yellow border, the value of the tag doesn’t match.
4. To remedy these missing or mismatched tags, do the following:

   > 1. Select Review and Apply.
   > 2. After confirming you’d like to accept the proposed changes, select Apply.

## Identify masking policies

1. Open the Lineage tab, and select the object you are interested in. A side panel opens.
2. To view the masking policy on a column of the object, find the column in the Columns section. If the column is protected by a
   masking policy, a symbol appears next to the column name. Hover over the symbol to see the masking policy name and details.

   If there’s a problem with the masking policy, for example there are multiple masking policies assigned to the same column,
   Policy Error appears instead of the mask symbol. If you hover over Policy Error, an explanation of the error appears. For
   additional help identifying why the error might have occurred, see [Tag and policy discovery](tag-based-masking-policies.md) and
   [Troubleshoot tag-based masking policies](tag-based-masking-policies.md).

## Lineage created by a stored procedure or task

A stored procedure or task can result in lineage between an upstream object and a downstream object. You can select the arrow that connects
the objects to obtain more information about the stored procedure or task. You must have privileges to access the stored procedure or task
to view this information.

If the downstream object was created by a stored procedure, the Stored Procedures section contains the following
information:

* Direct — Displays the name of the stored procedure that, when executed, resulted in the downstream object.
* Root — If the direct stored procedure is nested within other stored procedures, this field displays the name of the stored
  procedure that is at the top of the hierarchy of nested procedures.

To view additional information about a stored procedure, select the Go to procedure icon next to its name.

Keep in mind the following:

* If you [call a stored procedure anonymously](../sql-reference/sql/call-with.md), details about the stored procedure do not appear in the
  lineage.
* Details about stored procedures and tasks are not backfilled. Lineage that occurred before the introduction of support for stored
  procedures and tasks doesn’t include details about the stored procedure or task.

## Retrieve lineage programmatically

You can use the [GET_LINEAGE (SNOWFLAKE.CORE)](../sql-reference/functions/get_lineage-snowflake-core.md) function to retrieve lineage information programmatically. This
function returns a subset of the information provided by the Lineage tab in Snowsight.

## Supported operations for data lineage

The following operations create upstream and downstream relationships between a source object and a target object:

* [COPY INTO](../sql-reference/sql/copy-into-table.md)
* [CREATE TABLE … AS SELECT](../sql-reference/sql/create-table.md) (CTAS)
* [CREATE TABLE … CLONE](../sql-reference/sql/create-table.md)
* [CREATE VIEW](../sql-reference/sql/create-view.md)
* [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md)
* [CREATE SEMANTIC VIEW](../sql-reference/sql/create-semantic-view.md)
* [INSERT … SELECT …](../sql-reference/sql/insert.md)
* [MERGE](../sql-reference/sql/merge.md)
* [UPDATE](../sql-reference/sql/update.md), for example:

  ```sqlexample
  UPDATE mydb.schema1.table1 FROM mydb.schema2.table2 SET table1.col1 = table2.col1;
  ```

## Supported objects

Data lineage supports data movement and dependency between [table-like objects](../guides-overview-db.md). A
“table-like” object is any object that can be queried like a table, including tables (nothing is more table-like than a
table). Table-like objects include:

* Tables
* Dynamic tables
* External tables
* Iceberg tables
* Views
* Materialized views
* Semantic views

Stages can also participate in data lineage relationships, as can the following machine learning objects.

* [Datasets](../developer-guide/snowflake-ml/dataset.md)
* [Feature Views](../developer-guide/snowflake-ml/feature-store/feature-views.md) (which are actually a dynamic tables or views inside Snowflake)
* [Models](../developer-guide/snowflake-ml/model-registry/overview.md)

Column lineage is supported between columns in any two table-like objects. You can, for example, select a column in a table
to view downstream column lineage, which shows the other table-like objects where that column appears.

> **Note:**
>
> Column lineage is not currently supported for semantic views.

Additionally, you can see tag and masking policy associations if you are using a role that has privileges for managing
tags and masking policies.

### Lineage for objects from external data sources

Snowflake can track data lineage for sources and destinations outside of Snowflake. This provides visibility into how data flows from
external ETL tools and databases into your Snowflake objects, creating a comprehensive view of your entire data pipeline.

For more information, see [External lineage](external-lineage.md).

### ML Lineage

[ML Lineage](../developer-guide/snowflake-ml/ml-lineage.md) specifically supports machine learning relationships, which
focus on how data is used and transformed in machine learning workflows, rather than on simpler movement or dependency
relationships. Relationships between the following types of objects are supported:

* [Datasets](../developer-guide/snowflake-ml/dataset.md)
* [Feature Views](../developer-guide/snowflake-ml/feature-store/feature-views.md) (which is actually a dynamic table or a view inside Snowflake)
* [Models](../developer-guide/snowflake-ml/model-registry/overview.md)

## Access control for lineage information

A role with the following privileges can access the Lineage tab and view an object’s upstream and downstream lineage objects and
dependencies:

* VIEW LINEAGE on the account.
* Any privilege on the objects for which you want to evaluate the lineage, such as SELECT on a table. If you want to let users view the
  lineage of an object without being able to access its data, you can grant the REFERENCES privilege on the object.
* USAGE on the database and schema that contains the object.

The VIEW LINEAGE privilege controls whether a user can view data lineage for their objects. By default, the PUBLIC role has this privilege,
which means everyone has the ability to view lineage. To narrow who can view lineage, you can revoke the VIEW LINEAGE privilege from the
PUBLIC role and grant it to custom roles instead.

You can configure a role to view the full lineage of all objects, even if the role doesn’t have privileges on the objects, database, or schema.
Simply grant the role the RESOLVE ALL privilege on the account, for example, `GRANT RESOLVE ALL ON ACCOUNT TO ROLE lineage_role;`. The
role still requires the VIEW LINEAGE privilege.

If a user does not have privileges on an upstream or downstream object in the lineage graph, the object appears gray with a
message stating that they have insufficient privileges to view the object. The gray object does not imply a terminal node in the lineage
graph; it merely indicates that the user cannot view lineage any further upstream or downstream from that point because they don’t have the
privileges to retrieve that object’s lineage. This behavior also applies to objects and columns protected by other access policies.

A user must have privileges to access the [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) to see the SQL statement that resulted in the
target object.

## Lineage history and retention

Lineage was introduced to Snowflake in November 2024. Lineage information is available as follows:

* Lineage for an object dependency (for example, a view based on a table) that occurred before this date *is* available.
* Lineage for data movement (for example, using a CTAS statement to create a table from another table) that occurred before this date
  *is not* available.

Historical information is retained as follows:

* Column lineage is retained for one year.
* Object lineage is retained for one year.

## Limitations and considerations

* Lineage is not available for the following kinds of objects:

  + Objects in a shared database.
  + Objects in the shared SNOWFLAKE database.
  + Objects in the INFORMATION_SCHEMA of a database.
  + Semantic views created before early February 2026.
* Dynamic tables appear in the lineage graph for other objects, but the Lineage tab does not appear for dynamic tables
  themselves.
* Deleted tables are not shown in the lineage graph, but renamed tables are shown.
* Temporary tables are not shown in the lineage graph.
* Lineage does not include a table that was used for filtering or joining when data did not move from the table to the downstream object. In
  the following example, table `t2` is not considered part of the lineage of table `target_table`:

  ```sqlexample
  CREATE TABLE target_table AS
    SELECT t1.c1, t1.c2
      FROM t1, t2
      WHERE t1.c3 = t2.c3;
  ```
* Lineage cannot track the movement of data that results from separate, disjointed queries. For example, the following set of queries does
  not result in lineage from table `sourceTable1` to table `target_table`.

  ```sqlexample
  SET read_output1 = (SELECT c1 FROM sourceTable1);

  INSERT INTO target_table(c1) VALUES ($read_output1);
  ```

  This limitation applies to anything that caused the data movement, including stored procedures.
* You cannot use the [GET_LINEAGE (SNOWFLAKE.CORE)](../sql-reference/functions/get_lineage-snowflake-core.md) function to obtain lineage information related to a stored
  procedure.

---
title: Data loading considerations
source: https://docs.snowflake.com/en/user-guide/data-load-considerations.md
section: User Guide
---

# Data loading considerations

This set of topics provides best practices, general guidelines, and important considerations for bulk data loading using the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command. It is intended to help simplify and optimize importing of data from data files into Snowflake tables.

> **Note:**
>
> Most considerations in this set of topics pertain to bulk data loading only. However, the [Preparing your data files](data-load-considerations-prepare.md) topic applies to both bulk loading and continuous loading using [Snowpipe](data-load-snowpipe-intro.md).

**Next Topics:**

* [Preparing your data files](data-load-considerations-prepare.md)
* [Planning a data load](data-load-considerations-plan.md)
* [Staging data](data-load-considerations-stage.md)
* [Loading data](data-load-considerations-load.md)
* [Managing regular data loads](data-load-considerations-manage.md)

---
title: Data loading preparation using the Snowpipe REST API
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-rest-gs.md
section: User Guide
---

# Data loading preparation using the Snowpipe REST API

This topic describes how to get started with Snowpipe when calling the REST API, including instructions for installing the required client SDK, creating a stage (if needed) and pipe, and the one-time security setup for each Snowpipe user.

> **Note:**
>
> The instructions in this section assume you already have a target table in your Snowflake database where your data will be loaded.

## Client requirement (Java or Python SDK)

The Snowpipe service requires either the Java SDK or Python SDK. These SDKs are provided by Snowflake for your convenience.

> **Important:**
>
> The binaries are provided as Client Software under the terms of your master service agreement (MSA) with Snowflake.

### Install the Java SDK

1. Download the Java SDK installer from the Maven Central Repository:

   [Sonatype](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-ingest-sdk) (or <https://repo1.maven.org/maven2/net/snowflake/snowflake-ingest-sdk>)
2. Integrate the JAR file into an existing project.

> **Note:**
>
> The developer notes are hosted with the source code on [GitHub](https://github.com/snowflakedb/snowflake-ingest-java).

### Install the Python SDK

Note that the Python SDK requires Python 3.6 or higher.

To install the SDK, execute the following command:

> ```bash
> pip install snowflake-ingest
> ```

Alternatively, download the wheel file from [PyPI](https://pypi.org/project/snowflake-ingest/) and integrate it into an existing project.

> **Note:**
>
> The developer notes are hosted with the source code on [GitHub](https://github.com/snowflakedb/snowflake-ingest-python).

## Step 1: Create a stage (if needed)

Snowpipe supports loading from the following stage types:

* Named internal (Snowflake) or external (Amazon S3, Google Cloud Storage, or Microsoft Azure) stages
* Table stages

Create a named stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command, or you can choose to use an existing stage. You will stage your files temporarily before Snowpipe loads them into your target table.

## Step 2: Create a pipe

Create a new pipe in the system for defining the COPY INTO *<table>* statement used by Snowpipe to load data from an ingestion queue into tables. For information, see [CREATE PIPE](../sql-reference/sql/create-pipe.md).

> **Note:**
>
> Creating a pipe requires the CREATE PIPE access control privilege, as well as the USAGE privilege on the database, schema and stage.

For example, create a pipe in the `mydb.myschema` schema that loads all the data from files staged in the `mystage` stage into the `mytable` table:

> ```sqlexample
> create pipe mydb.myschema.mypipe if not exists as copy into mydb.myschema.mytable from @mydb.myschema.mystage;
> ```

## Step 3: Configure security (per user)

For each user who will execute continuous data loads using Snowpipe, generate a public-private key pair for making calls to the Snowpipe REST endpoints. In addition, grant sufficient privileges on the objects for
the data load (i.e. the target database, schema, and table), the stage object, and the pipe.

If you plan to restrict Snowpipe data loads to a single user, you only need to configure key pair authentication for the user once. After that, you only need to grant access control privileges on the database objects used for each data load.

> **Note:**
>
> To follow the general principle of least privilege, we recommend creating a separate user and role to use for ingesting files using a pipe. The user should be created with this role as its default role.

### Use key pair authentication & key rotation

The Snowpipe REST endpoints require key pair authentication with JSON Web Token (JWT). JWTs are signed using a public/private key pair with
RSA encryption.

As part of this process, you must:

1. Generate a public-private key pair. The generated private key should be in a file (e.g. named `rsa_key.p8`).
2. Assign the public key to your Snowflake user. After you assign the key to the user, run the [DESCRIBE USER](../sql-reference/sql/desc-user.md) command.
   In the output, the `RSA_PUBLIC_KEY_FP` property should be set to the fingerprint of the public key assigned to the user.

For instructions on how to generate the key pair and assign a key to a user, see [Key-pair authentication and key-pair rotation](key-pair-auth.md).

For language-specific examples of creating a fingerprint and generating a JWT token, see the following sections:

> * [Python](../developer-guide/sql-api/authenticating.md)
> * [Java](../developer-guide/sql-api/authenticating.md)
> * [Node.js](../developer-guide/sql-api/authenticating.md)

### Grant access privileges

Calling the Snowpipe REST endpoints requires a role with the following minimum privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Named pipe | OPERATE (`insertFiles` endpoint), MONITOR (`insertReport`, `loadHistoryScan` endpoints) |  |
| Named stage | USAGE (external stage) , READ (internal stage) |  |
| Named file format | USAGE | Optional; only needed if the either the stage (see Step 1: Create a Stage (If Needed)) or the pipe (see Step 2: Create a Pipe) references a named file format. |
| Target database | USAGE |  |
| Target schema | USAGE |  |
| Target table | INSERT , SELECT |  |

Use the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command to grant these privileges to the role.

> **Note:**
>
> Only security administrators (i.e. users with the SECURITYADMIN role) or higher, or another role with both the CREATE ROLE privilege on the account and the global MANAGE GRANTS privilege, can create roles and grant privileges.

For example, create a role named `snowpipe1` that can load data via a pipe named `mypipe`. The pipe references an external stage:

```sqlexample
 -- Create a role for the Snowpipe privileges.
use role securityadmin;

create or replace role snowpipe1;

-- Grant the USAGE privilege on the database and schema that contain the pipe object.
grant usage on database mydb to role snowpipe1;
grant usage on schema mydb.myschema to role snowpipe1;

-- Grant the INSERT and SELECT privileges on the target table.
grant insert, select on mydb.myschema.mytable to role snowpipe1;

-- Grant the USAGE privilege on the external stage.
grant usage on stage mydb.myschema.mystage to role snowpipe1;

-- Grant the OPERATE and MONITOR privileges on the pipe object.
grant operate, monitor on pipe mydb.myschema.mypipe to role snowpipe1;

-- Grant the role to a user
grant role snowpipe1 to user jsmith;

-- Set the role as the default role for the user
alter user jsmith set default_role = snowpipe1;
```

## Step 4: Stage data files

Copy data files to the internal or external stage you created for loading files using Snowpipe.

* Copy files to an external stage using the tools provided by the cloud storage service.
* Copy files to an internal stage using the [PUT](../sql-reference/sql/put.md) command.

  > **Note:**
  >
  > If your Snowflake account is hosted on Amazon Web Services, we recommend always using the PUT … OVERWRITE = TRUE syntax.
  >
  > Amazon S3 provides read-after-write consistency for new objects created in a bucket. However, if a HEAD or GET request for an object is made before it is created, then S3 provides *eventual consistency* for the object. This means that an immediate request for a new object after it is created could return a `file not found` exception. Setting the OVERWRITE = TRUE parameter avoids the initiation of a HEAD request prior to the creation of the object in the S3 bucket.
  >
  > For more information about the S3 consistency model, see the [S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/dev/Introduction.html#ConsistencyModel).

**Next:** Learn how to call the public REST endpoints to load data and retrieve load history reports, in [Overview of the Snowpipe REST endpoints to load data](data-load-snowpipe-rest-overview.md).

---
title: Data sharing with dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-data-sharing.md
section: User Guide
---

# Data sharing with dynamic tables

Dynamic tables are shareable objects. To share a dynamic table, data sharing providers grant privileges on a dynamic table to a share, which
in turn can be used by data sharing consumers.

## How data is shared with dynamic tables

To share a dynamic table with other Snowflake accounts, you can add dynamic tables to a share or to an application package.

* To share a dynamic table with accounts in your region, you can use a Direct Share. For more information, see [Data sharing and collaboration in Snowflake](../guides-overview-sharing.md).
* To share a dynamic table with accounts in other regions, add the share or application package to a listing as a data product and set up Cross-Cloud Auto-Fulfillment. For more information, see [Create and publish a listing](../collaboration/provider-listings-creating-publishing.md).

A data sharing provider can choose to grant the SELECT privilege on a single dynamic table or grant the SELECT privilege on all dynamic
tables in a database, as shown in the following examples.

```sqlexample
GRANT SELECT ON ALL DYNAMIC TABLES IN SCHEMA mydb.public TO SHARE share1;

GRANT SELECT ON DYNAMIC TABLE mydb.public TO SHARE share1;
```

For more details, see [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md).

## Create a dynamic table to ingest shared data

When you use a dynamic table to ingest shared data, the query can’t select from a shared dynamic table or a shared secure view that references
an upstream dynamic table.

To create a dynamic table to ingest shared data, do the following:

1. Ensure that you have the [right privileges](../sql-reference/sql/create-database.md), and create a database from a share and grant
   privileges on it.

   ```sqlexample
   CREATE DATABASE my_shared_db FROM SHARE provider_account.share1;
   ```
2. [Grant privileges](data-share-consumers.md) to the shared database.
3. Create a shared dynamic table.

> ```sqlexample
> CREATE OR REPLACE DYNAMIC TABLE my_dynamic_table
>   TARGET_LAG = '1 day'
>   WAREHOUSE = mywh
>   AS
>     SELECT * FROM my_shared_db.public.mydb;
> ```
>
> > **Note:**
> >
> > Change tracking must be enabled on all underlying objects used by a dynamic table. To use a dynamic table to ingest shared data, the data
> > sharing provider needs to enable `change_tracking` on the shared object. For more information, see
> > [Enable change tracking](dynamic-tables-create.md).

---
title: Data storage considerations
source: https://docs.snowflake.com/en/user-guide/tables-storage-considerations.md
section: User Guide
---

# Data storage considerations

This topic provides guidelines and best practices for controlling data storage costs associated with Continuous Data Protection (CDP), particularly for tables.

CDP, which includes Time Travel and Fail-safe, is a standard set of features available to all Snowflake accounts at no additional cost. However, because your account is charged for all data stored in
tables, schemas, and databases created in the account, CDP does have an impact on storage costs, based on the total amount of data stored and the length of time the data is stored.

Storage is calculated and charged for data regardless of whether it is in the Active, Time Travel, or Fail-safe state. Because these life-cycle states are sequential, updated/deleted data protected by
CDP will continue to incur storage costs until the data leaves the Fail-safe state.

> **Note:**
>
> TIME_TRAVEL_BYTES and FAILSAFE_BYTES will incur charges when you load data using INSERT, COPY or SNOWPIPE. That’s because small micro-partition defragmentation deletes small micro-partitions and creates a new micro-partition that has the same data. The deleted micro-partitions contribute to TIME_TRAVEL_BYTES and FAILSAFE_BYTES.

## Monitoring data storage

### Storage for your account (account administrators only)

If you have been assigned the ACCOUNTADMIN role (i.e. you serve as the top-level administrator for your Snowflake account), you can use [Snowsight](ui-snowsight-gs.md) to view data storage across your entire
account:

Snowsight:
:   In the navigation menu, select Admin » Cost management, and then select Consumption.

This page displays the total average data storage for your account, as well as the total for all databases, internal and named stages, and data in Fail-safe.

For more information, see [Exploring storage cost](cost-exploring-data-storage.md).

### Individual table storage

Any user with the appropriate privileges can view data storage for individual tables. Snowflake provides the following methods for viewing table data storage:

Snowsight:
:   In the navigation menu, select Catalog » Database Explorer. Then select the *<db_name>* » Tables.

SQL:
:   Execute a [SHOW TABLES](../sql-reference/sql/show-tables.md) command.

    or

    Query either of the following:

    * [TABLE_STORAGE_METRICS](../sql-reference/info-schema/table_storage_metrics.md) view (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
    * [TABLE_STORAGE_METRICS view](../sql-reference/account-usage/table_storage_metrics.md) view (in [Account Usage](../sql-reference/account-usage.md)).

Of the three methods, [TABLE_STORAGE_METRICS](../sql-reference/info-schema/table_storage_metrics.md) provides the most detailed information because it includes a breakdown of the physical storage
(in bytes) for table data in the following three states of the CDP life-cycle:

* Active (ACTIVE_BYTES column)
* Time Travel (TIME_TRAVEL_BYTES column)
* Fail-safe (FAILSAFE_BYTES column)

The view also provides columns for distinguishing between owned storage and referenced storage that occurs when cloning tables (see section below).

### Staged file storage (for data loading)

To support bulk loading of data into tables, Snowflake utilizes stages where the files containing the data to be loaded are stored. Snowflake supports both internal stages and external stages.

Data files staged in Snowflake internal stages are not subject to the additional costs associated with Time Travel and Fail-safe, but they do incur standard data storage costs. As such, to help
manage your storage costs, Snowflake recommends that you monitor these files and remove them from the stages once the data has been loaded and the files are no longer needed. You can choose to
remove these files either during data loading (using the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command) or afterwards (using the [REMOVE](../sql-reference/sql/remove.md) command).

For more information, see [Data loading considerations](data-load-considerations.md).

> **Tip:**
>
> Periodic purging of staged files can have other benefits, such as improved data loading performance.

## Cloned table, schema, and database storage

Snowflake’s zero-copy cloning feature provides a convenient way to quickly take a “snapshot” of any table, schema, or database and create a derived copy of that object which initially shares the
underlying storage. This can be extremely useful for creating instant backups that do not incur any additional costs (until changes are made to the cloned object).

However, cloning makes calculating total storage usage more complex because each clone has its own separate life-cycle. This means that changes can be made to the original object or the clone
independently of each other and these changes are protected through CDP.

For example, when a clone is created of a table, the clone utilizes no data storage because it shares all the existing micro-partitions of the original table at the time it was cloned; however,
rows can then be added, deleted, or updated in the clone independently from the original table. Each change to the clone results in new micro-partitions that are owned exclusively by the clone
and are protected through CDP.

In addition, clones can be cloned, with no limitations on the number or iterations of clones that can be created (e.g. you can create a clone of a clone of a clone, and so on), which results
in a n-level hierarchy of cloned objects, each with their own portion of shared and independent data storage.

> **Note:**
>
> This storage behavior applies to standard tables but not hybrid tables. When you clone a database that contains hybrid tables, additional storage costs are incurred. See [Clone databases that contain hybrid tables](tables-hybrid-clone.md).

### Table IDs

Every Snowflake table has an ID that uniquely identifies the table. In addition, every table is also associated with a CLONE_GROUP_ID. If a table has no clones, then the ID and CLONE_GROUP_ID are
identical. These IDs are displayed in the [TABLE_STORAGE_METRICS](../sql-reference/info-schema/table_storage_metrics.md) view.

### Owned storage versus referenced storage

When a table is cloned, it is assigned a new ID and the CLONE_GROUP_ID for the original table. At the instant the clone is created, all micro-partitions in both tables are fully shared. The storage
associated with these micro-partitions is owned by the oldest table in the clone group and the clone references these micro-partitions.

After a clone is created, both tables within the clone group have separate life-cycles, such that any DML operations on either table create new micro-partitions that are owned by their respective
tables. Storage associated with these micro-partitions can be queried using the RETAINED_FOR_CLONE_BYTES column in the
[TABLE_STORAGE_METRICS](../sql-reference/info-schema/table_storage_metrics.md) view.

Because every table within a clone group has an independent life-cycle, ownership of the storage within these tables sometimes needs to be transferred to a different table within the clone group.
For example, consider a clone group that consists of:

> |  |  |  |  |  |
> | --- | --- | --- | --- | --- |
> | Original table: |  | Cloned to: |  | Cloned to: |
> | **T1** | » | **T2** | » | **T3** |

If T2 and T3 share some micro-partitions and T2 is dropped, then ownership of that storage must be transferred before T2 enters Fail-safe. In Snowflake, this transfer occurs at the time the
micro-partitions exit the Time Travel state and would otherwise enter Fail-safe. In the case above, the micro-partitions that were previously owned by T2 are transferred to T3 when the Time Travel
retention period expires.

## Managing costs for short-lived tables

CDP is designed to provide long-term protection for your data. This data is typically stored in permanent tables. Unless otherwise specified at the time of their creation, tables in Snowflake
are created as permanent.

During an ETL or data modeling process, tables may be created that are short-lived. For these tables, it does not make sense to incur the storage costs of CDP. Snowflake provides two separate
mechanisms to support short-lived tables:

* Temporary tables
* Transient tables

### Temporary tables

Similar to other SQL databases, a temporary table exists only within a single user session and only within the duration of the session. Snowflake temporary tables have no Fail-safe and have a Time
Travel retention period of only 0 or 1 day; however, the Time Travel period ends when the table is dropped.

Thus, the maximum total CDP charges incurred for a temporary table are 1 day (or less if the table is explicitly dropped or dropped as a result of terminating the session). During this period, Time
Travel can be performed on the table.

> **Important:**
>
> A connection and a session are different concepts within Snowflake. When logged into Snowflake, one or more sessions may be created. A Snowflake session is only terminated if the user
> explicitly terminates the session or the session times out due to inactivity after 4 hours. Disconnecting from Snowflake does not terminate the active sessions. Thus, a Snowflake session may be
> very long-lived and any temporary tables created within that session will continue to exist until they are dropped or the session is terminated.
>
> To avoid unexpected storage costs for temporary tables, Snowflake recommends creating them as needed within a session and dropping them when they are no longer required.

### Transient tables

Transient tables are unique to Snowflake. They have characteristics of both permanent and temporary tables:

* In contrast to temporary tables, transient tables are not associated with a session; they are visible to all users who have permissions to access that table. Also, similar to permanent tables,
  they persist beyond the session in which they were created.
* In keeping with temporary tables, transient tables have no Fail-safe and have a Time Travel retention period of only 0 or 1 day.

Thus, the maximum total CDP charges incurred for a transient table are 1 day. During this period, Time Travel can be performed on the table.

## Managing costs for large, high-churn tables

In data platforms, tables are typically either *fact* or *dimension* tables, which have different usage patterns and, therefore, different storage considerations:

* Fact tables are typically very large in size and experience a low degree of churn (row updates or deletes). Most changes to fact tables are inserts of new data or, in some cases, deletions of
  older data. CDP is ideal for fact tables as it provides full data protection at a very low storage cost.
* Dimension tables have a different update pattern. Row updates and deletions are much more common in dimension tables. When one or more rows of a table are updated or deleted, the underlying
  micro-partitions that store this data begin the life-cycle transitions associated with CDP. For high-churn dimension tables, the resulting storage associated with Time Travel and Fail-safe data
  can be much larger than the active table storage.

For the vast majority of dimension tables, the CDP storage cost associated with these updates are reasonable. Dimension tables are usually small in size and even if frequently updated, the cost of
storage in Snowflake is inexpensive and the benefits of CDP far outweigh the costs.

For some larger, high-churn dimension tables, the storage costs associated with CDP can be significant. When multiple updates are made to a table, all of the impacted micro-partitions are re-created
and then they transition through the CDP storage life-cycle.

High-churn dimension tables can be identified by calculating the ratio of FAILSAFE_BYTES divided by ACTIVE_BYTES in the [TABLE_STORAGE_METRICS](../sql-reference/info-schema/table_storage_metrics.md)
view. Any table with a large ratio is considered to be a high-churn table. Because storage in Snowflake is inexpensive and most high-churn tables consume a modest amount of total storage, even if
the ratio is high, the preferred option is to create these tables as permanent and use CDP to protect the data.

In some cases, the cost of storage for high-churn dimension tables is excessive and you might prefer an alternative option to CDP. As an extreme example, consider a table with rows associated with
every micro-partition within the table (consisting of 200 GB of physical storage). If every row is updated 20 times a day, the table would consume the following storage:

> Active:
> :   200 GB
>
> Time Travel:
> :   4 TB
>
> Fail-safe:
> :   28 TB
>
> Total Storage:
> :   32.2 TB

For large, high-churn dimension tables that incur overly-excessive CDP costs, the solution is to create these tables as transient with zero Time Travel retention (i.e. DATA_RETENTION_TIME_IN_DAYS=0)
and then copy these tables on a periodic basis into a permanent table. This effectively creates a full backup of these tables. Because each backup is protected by CDP, when a new backup is created,
the old one can be deleted.

Using the example above, the storage costs associated with the same 200 GB, high-churn dimension table that was backed up once a day would be:

> Active:
> :   200 GB
>
> Time Travel:
> :   200 GB
>
> Fail-safe:
> :   1.4 TB
>
> Backup:
> :   200 GB
>
> Total Storage:
> :   2 TB

> **Tip:**
>
> The backups should be performed as often as necessary to ensure full recovery in the event of data loss. For these tables, Snowflake recommends backups be taken at least once a day.

---
title: Data types for Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-data-types.md
section: User Guide
---

# Data types for Apache Iceberg™ tables

Snowflake supports most of the data types defined by the [Apache Iceberg™ specification](https://iceberg.apache.org/spec/),
and writes Iceberg data types to table files so that your Iceberg tables remain interoperable across different compute engines
when you use Snowflake as the catalog.

For an overview of the Iceberg data types that Snowflake supports,
see Supported data types.

## Approximate types

If your table uses an Iceberg data type that Snowflake doesn’t support an *exact match* for,
Snowflake uses an *approximate* Snowflake type. This type mapping affects column values for converted tables and Iceberg tables
that use Snowflake as the catalog.

For example, consider a table with a column of Iceberg type `int`. Snowflake processes the column values using the
Snowflake data type NUMBER(10,0).

NUMBER(10,0) has a range of (-9,999,999,999, +9,999,999,999),
but `int` has a more limited range of (-2,147,483,648, +2,147,483,647). If you try to insert a value of 3,000,000,000 into that column,
Snowflake returns an out-of-range error message.

For details about approximate types, see the notes in the Supported data types table.

## Supported data types

The tables in this section show the relationship between Iceberg data types and Snowflake data types. They use the following columns:

Iceberg type:
:   The data type defined in the Apache Iceberg specification.
    When you use Snowflake as the catalog, Snowflake writes the Iceberg type to your table data files so that your tables remain
    interoperable across different compute engines.

Snowflake type:
:   The Snowflake data type that is used to process and return table data. For example,
    if your schema specifies the Iceberg type `timestamp`, Snowflake processes and returns values using the Snowflake data type
    TIMESTAMP_NTZ(6) with microsecond precision.

Notes:
:   Additional usage notes, including notes for working with approximate types.

### Numeric types

#### Snowflake as the catalog

The following table shows how Iceberg numeric data types map to Snowflake numeric data types for tables that use Snowflake as the
Iceberg catalog (Snowflake-managed tables). When you create Snowflake-managed Iceberg table, you can use
[Iceberg data types](https://iceberg.apache.org/spec/#schemas-and-data-types) to define numeric columns.

| Iceberg data type | Snowflake data type | Notes |
| --- | --- | --- |
| `int` (32-bit signed integer) | [NUMBER(10,0)](../sql-reference/data-types-numeric.md) | Inserting a 10-digit number smaller than the minimum or larger than the maximum 32-bit signed integer value results in an out-of-range error. |
| `long` (64-bit signed integer) | [NUMBER(19,0)](../sql-reference/data-types-numeric.md) | Inserting a 19-digit number smaller than the minimum or larger than the maximum 64-bit signed integer value results in an out-of-range error. |
| `float` (single-precision 32-bit IEEE 754 floating point) | [FLOAT](../sql-reference/data-types-numeric.md) | Synonymous with the Snowflake DOUBLE data type. Snowflake treats all floating-point numbers as double-precision 64-bit floating-point numbers, but writes Iceberg floats as 32-bit floating-point numbers in table data files.  Narrowing conversions from 64 bits to 32 bits results in precision loss.  You can’t use `float` or `double` as primary keys (in accordance with the [Apache Iceberg spec](https://iceberg.apache.org/spec/#identifier-field-ids)). |
| `double` (double-precision 64-bit IEEE 754 floating point) | [FLOAT](../sql-reference/data-types-numeric.md) | Synonymous with the Snowflake DOUBLE data type. Snowflake treats all floating-point numbers as double-precision 64-bit floating-point numbers.  Narrowing conversions from 64 bits to 32 bits results in precision loss.  You can’t use `float` or `double` as primary keys (in accordance with the [Apache Iceberg spec](https://iceberg.apache.org/spec/#identifier-field-ids)). |
| `decimal(P,S)` | [NUMBER(P,S)](../sql-reference/data-types-numeric.md) | Specifying `decimal(10,0)` instead of `int` creates a decimal type in Iceberg. The same applies when you specify `decimal(19,0)`. |

#### External catalog

When you create an Iceberg table that uses an external Iceberg catalog, Iceberg numeric types are mapped to Snowflake numeric types according
to the following table.

| Iceberg data type | Snowflake data type |
| --- | --- |
| `int` (32-bit signed integer) | [NUMBER(10,0)](../sql-reference/data-types-numeric.md) |
| `long` (64-bit signed integer) | [NUMBER(19,0)](../sql-reference/data-types-numeric.md) |
| `float` (single-precision 32-bit IEEE 754 floating point) | [FLOAT](../sql-reference/data-types-numeric.md) |
| `double` (double-precision 64-bit IEEE 754 floating point) | [FLOAT](../sql-reference/data-types-numeric.md) |
| `decimal(P,S)` | [NUMBER(P,S)](../sql-reference/data-types-numeric.md) |

> **Note:**
>
> You can’t use `float` or `double` as primary keys (in accordance with the
> [Apache Iceberg spec](https://iceberg.apache.org/spec/#identifier-field-ids)).

### Other data types

> **Note:**
>
> For non-numeric data types, specify the Snowflake data type in your table DDL when you use Snowflake as the catalog (for example, use a
> structured ARRAY instead of the `list` type).
> Snowflake automatically maps each Snowflake type to the corresponding Iceberg data type in the
> table metadata for interoperability with external Iceberg tools.

| Iceberg data type | Snowflake data type | Notes |
| --- | --- | --- |
| `boolean` | [BOOLEAN](../sql-reference/data-types-logical.md) |  |
| `date` | [DATE](../sql-reference/data-types-datetime.md) |  |
| `time` | [TIME(6)](../sql-reference/data-types-datetime.md) | Microsecond precision per the Apache Iceberg table specification. |
| `timestamp` | [TIMESTAMP_NTZ(6)](../sql-reference/data-types-datetime.md) | Microsecond precision per the Apache Iceberg table specification.  You can also use the Parquet physical type `int96` for timestamps. Snowflake translates `timestamp` to microseconds (per the Apache Iceberg table specification). |
| `timestamptz` | [TIMESTAMP_LTZ(6)](../sql-reference/data-types-datetime.md) | Microsecond precision per the Apache Iceberg table specification.  You can also use the Parquet physical type `int96` for timestamps. Snowflake translates `timestamp` to microseconds (per the Apache Iceberg table specification). |
| `string` | [VARCHAR(134217728)](../sql-reference/data-types-text.md) | The default size is 128 MB, and the only size that you can specify explicitly is 134217728 (128 MB). |
| `uuid` | [UUID](../sql-reference/data-types-uuid.md) |  |
| `fixed(L)` | [BINARY(L)](../sql-reference/data-types-text.md) | You can create an Iceberg table that uses this type, but you can’t convert a table that has a column of type `fixed(L)`. Inserting a value that doesn’t exactly match L bytes in length results in an error.  **Important:** To use the `fixed(L)` data type, you must enable the 2026_02 behavior-change bundle in your account. For instructions on how to enable this bundle, see Use the Iceberg fixed(L) or binary primitive data types. |
| `binary` | [BINARY(67108864)](../sql-reference/data-types-text.md) | The default size is 64 MB, and the only size that you can specify explicitly is 67108864 (64 MB).  **Important:** To use the `binary` data type, you must enable the 2026_02 behavior-change bundle in your account. For instructions on how to enable this bundle, see Use the Iceberg fixed(L) or binary primitive data types. |
| `struct` | [Structured OBJECT](../sql-reference/data-types-structured.md) | Structured type columns support a maximum of 1000 sub-columns. |
| `list` | [Structured ARRAY](../sql-reference/data-types-structured.md) | Structured type columns support a maximum of 1000 sub-columns. |
| `map` | [MAP](../sql-reference/data-types-structured.md) | Structured type columns support a maximum of 1000 sub-columns. |

#### Use the Iceberg `fixed(L)` or `binary` primitive data types

To use the Iceberg `fixed(L)` or `binary` primitive data types, you must enable the 2026_02 behavior-change bundle in your account.

To [enable this bundle in your account](../release-notes/bcr-bundles/managing-behavior-change-releases.md), execute the following statement:

```sqlexample
SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2026_02');
```

### Iceberg v3 data types

> **Important:**
>
> To use Iceberg v3 data types, you must specify the version of the Apache Iceberg™ specification that your Iceberg table conforms to as
> `3`. For instructions on how to specify a version, see [Configure the default Iceberg version](tables-iceberg-v3-specification-support.md).

The following table shows the Apache Iceberg™ v3 data types that you can use with Iceberg tables:

| Iceberg data type | Snowflake data type | Notes |
| --- | --- | --- |
| `geography` | `GEOGRAPHY` | Snowflake supports the [GEOGRAPHY data type](../sql-reference/data-types-geospatial.md) in [Apache Iceberg™ tables](tables-iceberg.md). You can create a Snowflake-managed or externally managed Iceberg table with a GEOGRAPHY column. To create an Iceberg table with a GEOGRAPHY column, use the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command.  **Caution:** Iceberg uses the WKB format to store geography data. This format can’t represent all data that can be contained in Snowflake GEOGRAPHY values. The `Feature` and `FeatureCollection` types aren’t supported in WKB format. When inserting GEOGRAPHY values with these types into an Iceberg table, Snowflake converts features to their underlying geography objects and drops all properties. The automatic conversion for Iceberg tables behaves identically to the [ST_ASWKB](../sql-reference/functions/st_aswkb.md) function.  For GEOGRAPHY objects, the SRID is always 4326. |
| `geometry` | `GEOMETRY` | Snowflake supports the [GEOMETRY data type](../sql-reference/data-types-geospatial.md) in [Apache Iceberg™ tables](tables-iceberg.md). You can create a Snowflake-managed or externally managed Iceberg table with a GEOMETRY column. To create an Iceberg table with a GEOMETRY column, use the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command.  **Note:** All GEOMETRY objects in a single column must have the same SRID. |
| `timestamp_ns` | `TIMESTAMP_NTZ(9)` | Nanosecond precision per the Apache Iceberg table specification. No timezone semantics (wall-clock). TIMESTAMP(9) is first mapped to either the TIMESTAMP_NTZ(9) or the TIMESTAMP_LTZ(9) Snowflake type, depending on the value of the Snowflake parameter [TIMESTAMP_TYPE_MAPPING](../sql-reference/parameters.md). Then, it is mapped to the appropriate Iceberg data type. |
| `timestamptz_ns` | `TIMESTAMP_LTZ(9)` | Nanosecond precision per the Apache Iceberg table specification. Stored in UTC.  TIMESTAMP(9) is first mapped to either the TIMESTAMP_NTZ(9) or the TIMESTAMP_LTZ(9) Snowflake type, depending on the value of the Snowflake parameter [TIMESTAMP_TYPE_MAPPING](../sql-reference/parameters.md). Then, it is mapped to the appropriate Iceberg data type. |
| `variant` | [VARIANT](../sql-reference/data-types-semistructured.md) | Snowflake initially developed the [VARIANT](../sql-reference/data-types-semistructured.md) data type for standard Snowflake tables.  VARIANT provides efficient binary encoding for dynamic semi-structured data such as JSON, Avro, Protobuf, which makes it easier to work with and operate on data containing other nested data types. For more information, see [Semi-structured data types](../sql-reference/data-types-semistructured.md) and [Introduction to loading semi-structured data](semistructured-intro.md).  **Shredding**  Snowflake provides built-in shredding (also called [subcolumnarization](semistructured-considerations.md)) for the VARIANT data type. Shredding is the process of extracting fields from a VARIANT-type column into separate fields, and storing each field in columnar form (subcolumns) that you can traverse and query by using special notation.  Snowflake tracks metadata and statistics for shredded subcolumns, which enables pruning for faster, more efficient queries.  When you insert semi-structured data into a VARIANT column, Snowflake shreds as much of the data as possible.  For more information, see [Semi-structured data files and subcolumnarization](semistructured-considerations.md). |

#### Considerations for the Iceberg v3 data types

Consider the following as you use the Iceberg v3 data types:

**Nanosecond timestamps**

* Usage notes for the `nanosecond timestamps` data type:

  + Use TIMESTAMP_NTZ(9), TIMESTAMP_LTZ(9), or TIMESTAMP(9) in [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) and
    [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) statements. A scale of `9` specifies a new Iceberg nanosecond type.
    A scale of `6` continues to specify the legacy microsecond type.
  + When a scale is omitted, the session-level parameter `ICEBERG_TIMESTAMP_DEFAULT_SCALE` controls the precision.
    The default remains `6` for compatibility. If you want Iceberg timestamp columns to default to nanoseconds, set the
    parameter to `9`.
  + All standard Iceberg partition transforms (for example, identity, bucket, year, month, day, and hour) accept the new nanosecond
    types exactly as they do the microsecond variants.
  + Compatibility

    - **Read/write** - Read and write operations are supported for both Snowflake-managed and externally managed Iceberg tables.
    - **External tools** - No connector changes are required. Nanosecond values are used in read and write operations
      as standard Iceberg `timestamp_ns` and `timestamptz_ns` values.

> **VARIANT**
>
> * Consider the following considerations and limitations when you use the `VARIANT` data type with Iceberg tables:
>
>   + The regular consideration for Iceberg data types apply to the VARIANT data type. For more information, see
>     Considerations for working with data types for Iceberg tables.
>   + The keys for objects in VARIANT columns should be of type STRING.
>   + Using Snowpipe or COPY INTO to load data into Iceberg tables with Variant columns is supported. However, Snowpipe and COPY INTO cannot
>     be used to load data into OBJECT, ARRAY, or MAP columns that contain a nested Variant column.
>   + Nested variants aren’t supported.
>   + Also see [Considerations for semi-structured data stored in VARIANT](semistructured-considerations.md).

#### Examples

The following section contains examples for the Iceberg v3 data types.

##### GEOGRAPHY

To insert data into a GEOGRAPHY column, specify the input data. The following example inserts a geospatial
object that is defined as well-known text (WKT) into the `geog` column of the `geog_points` table that
was created in the previous example:

```sqlexample
INSERT INTO geog_points
  SELECT TO_GEOGRAPHY('POINT(-122.3861109 37.61637595)');
```

You can also insert geospatial data without explicitly constructing the GEOGRAPHY value:

```sqlexample
INSERT INTO geog_points
  SELECT 'POINT(-122.3861109 37.61637595)';
```

##### GEOMETRY

The following example creates an empty Iceberg table that contains a single GEOMETRY column named
`geom` with a default SRID of `4326`.

```sqlexample
CREATE ICEBERG TABLE geo_points (geom GEOMETRY)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'us_states'
  ICEBERG_VERSION = 3;
```

You can also set the SRID explicitly in the DDL statement. The following example sets the SRID to
`4269`:

```sqlexample
CREATE ICEBERG TABLE geo_points (geom GEOMETRY(4269))
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'us_states'
  ICEBERG_VERSION = 3;
```

To insert data into a GEOMETRY column, specify the input data. The following example inserts a geospatial
object defined as well-known text (WKT) into the `geom` column of the `geo_points` table that was created
in the previous example.

```sqlexample
INSERT INTO geo_points
  SELECT TO_GEOMETRY('POINT(-122.3861109 37.61637595)');
```

You can also insert geospatial data without explicitly constructing the GEOMETRY value:

```sqlexample
INSERT INTO geo_points
  SELECT 'POINT(-122.3861109 37.61637595)';
```

If the SRID isn’t available as part of the GEOMETRY object, you can set it explicitly using the constructor function:

```sqlexample
INSERT INTO geo_points
  SELECT TO_GEOMETRY('POINT(-122.3861109 37.61637595)', 4326);
```

##### nanosecond timestamps

The following example creates a managed Iceberg table with nanosecond timestamps:

```sqlexample
CREATE ICEBERG TABLE sensor_readings (
    reading_ntz TIMESTAMP_NTZ(9),
    reading_ltz TIMESTAMP_LTZ(9))
  ICEBERG_VERSION = 3;
```

For this statement, Snowflake performs the following data type mappings:

* The data type of the `reading_ntz` column is mapped to the `timestamp_ns` Iceberg v3 data type.
* The data type of the `reading_ltz` column is mapped to the `timestamptz_ns` Iceberg v3 data type.

##### VARIANT

You can create an Iceberg table with a VARIANT column by using the
[CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command.

The following example creates an empty Snowflake-managed Iceberg table that contains a single VARIANT column named `record`.

```sqlexample
CREATE ICEBERG TABLE car_sales (record VARIANT)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'car_sales'
  ICEBERG_VERSION = 3;
```

Similarly, the following example creates an empty externally managed Iceberg table in a catalog-linked database, to which Snowflake can write.

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA my_namespace;

CREATE ICEBERG TABLE car_sales (record VARIANT)
ICEBERG_VERSION = 3;
```

To insert data into a VARIANT column, you specify the input data format.
The following example uses the [PARSE_JSON](../sql-reference/functions/parse_json.md)
function to insert JSON-formatted data into the `record` column of the `car_sales` table (created previously).

```sqlexample
INSERT INTO car_sales SELECT
  PARSE_JSON(
    '{
        "date" : "2017-04-28",
        "dealership" : "Valley View Auto Sales",
        "salesperson" : {
          "id": "55",
          "name": "John Salesperson"
        },
        "customer" : [
          {"name": "Alice Doe", "phone": "14151234567", "address": "San Francisco, CA"},
          {"name": "Bob Doe", "phone": "14151234567", "address": "San Francisco, CA"}
        ],
        "vehicle" : [
          {"make": "Honda", "model": "Civic", "year": "2017", "price": "20275", "extras":["ext warranty", "paint protection"]}
        ]
      }'
    );
```

Running a SELECT \* FROM statement on the table returns the following output:

```output
+--------------------------------------------+
| RECORD                                     |
|--------------------------------------------|
| {                                          |
|    "customer": [                           |
|      {                                     |
|        "address": "San Francisco, CA",     |
|        "name": "Alice Doe",                |
|        "phone": "14151234567"              |
|      },                                    |
|      {                                     |
|        "address": "San Francisco, CA",     |
|        "name": "Bob Doe",                  |
|        "phone": "14151234567"              |
|      }                                     |
|    ],                                      |
|    "date": "2017-04-28",                   |
|    "dealership": "Valley View Auto Sales", |
|    "salesperson": {                        |
|      "id": "55",                           |
|      "name": "John Salesperson"            |
|    },                                      |
|    "vehicle": [                            |
|      {                                     |
|        "extras": [                         |
|          "ext warranty",                   |
|          "paint protection"                |
|        ],                                  |
|        "make": "Honda",                    |
|        "model": "Civic",                   |
|        "price": "20275",                   |
|        "year": "2017"                      |
|      }                                     |
|    ]                                       |
|  }                                         |
+--------------------------------------------+
```

To query the data in a VARIANT column, you can use dot or bracket notation to access elements nested in the data.

The following example uses dot notation to get the names of all salespeople who sold cars. Since there’s one row in the table,
the query produces a single result value.

```sqlexample
SELECT record:salesperson.name
  FROM car_sales
  ORDER BY 1;
```

Output:

```sqlexample
+-------------------------+
| RECORD:SALESPERSON.NAME |
|-------------------------|
| "John Salesperson"      |
+-------------------------+
```

For more information about querying semi-structured data, see [Querying Semi-structured Data](querying-semistructured.md).

> **Note:**
>
> * When using Apache Spark to read or write Iceberg tables with Variant columns, you must use Apache Spark 4.0 or later which includes
>   Variant support.
>
>   Variant columns in Snowflake-managed Iceberg tables can be read by engines that support Iceberg Variant, such as Apache Spark. Engines
>   can read Snowflake-managed Iceberg v3 tables through the [Horizon Iceberg REST Catalog API](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).
>
>   > ```sqlexample
>   > spark.sql("""
>   > SELECT
>   > variant_get(record, '$.customer[0].name', 'string') AS customer_1_name
>   > variant_get(record, '$.salesperson.name', 'string') AS name
>   > FROM CAR_SALES
>   > ORDER BY name
>   > """).show()
>   > ```
>
>   Similarly, Snowflake can read or write to externally managed Iceberg tables containing Variant columns.
> * Snowflake can write null values to a table, if needed.
>
>   > For example:
>   >
>   > ```sqlexample
>   > INSERT INTO my_table_new
>   >   SELECT ARRAY_CONSTRUCT(
>   >       OBJECT_CONSTRUCT_KEEP_NULL('field1', NULL, 'field2', 123)
>   >   )::ARRAY(OBJECT(field1 STRING, field2 INT));
>   > ```

## Delta data types

The following table shows how Delta data types map to Snowflake data types for
[Iceberg tables created from Delta table files](tables-iceberg-create.md).

| Delta type | Snowflake data type | Note |
| --- | --- | --- |
| BINARY | BINARY |  |
| BOOLEAN | BOOLEAN |  |
| BYTE | NUMBER(3,0) |  |
| DATE | DATE |  |
| DECIMAL(P,S) | NUMBER(P,S) |  |
| DOUBLE | REAL |  |
| FLOAT | REAL |  |
| INTEGER | NUMBER(10,0) |  |
| LONG | NUMBER(20,0) |  |
| SHORT | NUMBER(5,0) |  |
| STRING | TEXT |  |
| TIMESTAMP | TIMESTAMP_LTZ(6) | You can also use the Parquet physical type `int96` for TIMESTAMP, but Snowflake doesn’t support `int96` for TIMESTAMP_NTZ. |
| TIMESTAMP_NTZ | TIMESTAMP_NTZ(6) |  |

The following table shows how Delta nested data types map to Snowflake data types.

| Delta nested type | Snowflake data type |
| --- | --- |
| STRUCT | [Structured OBJECT](../sql-reference/data-types-structured.md) |
| ARRAY | [Structured ARRAY](../sql-reference/data-types-structured.md) |
| MAP | [MAP](../sql-reference/data-types-structured.md) |

## Considerations

Consider the following items when you work with data types for Iceberg tables:

* [Converting a table](tables-iceberg-conversion.md) with columns that use the following Iceberg data types is not supported:

  + `uuid`
  + `fixed(L)`
* For tables that use Snowflake as the catalog, creating a table that uses the Iceberg `uuid` data type is not supported.
* For tables that use an external catalog, you can’t create Iceberg v3 tables with structured type columns, which includes OBJECT, ARRAY,
  or MAP. For example, you can’t use CREATE ICEBERG TABLE … AS SELECT (CTAS) to create an externally managed Iceberg v3 table with
  structured type columns.

  You can create Snowflake-managed Iceberg v3 tables with structured type columns.
* For all Iceberg table types:

  + Structured type columns support a maximum of 1000 sub-columns.
  + Iceberg supports microsecond precision for time and timestamp types. As a result, you can’t create an Iceberg table in Snowflake
    that uses another precision like millisecond or nanosecond.
  + You can’t use `float` or `double` as primary keys (in accordance with the
    [Apache Iceberg spec](https://iceberg.apache.org/spec/#identifier-field-ids)).
  + For Parquet files that use the `LIST` logical type, be aware of the following:

    - The three-level annotation structure with the `element` keyword is supported. For more
      information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md#lists). If your
      Parquet file uses an obsolete format with the `array` keyword, you must regenerate your data based on the supported format.
* For tables created from Delta files, be aware of the following:

  + Parquet files (data files for Delta tables) that use any of the following features or data types aren’t supported:

    - Field IDs.
    - The INTERVAL data type.
    - The DECIMAL data type with precision higher than 38.
    - LIST or MAP types with one-level or two-level representation.
    - Unsigned integer types (INT(signed = false)).
    - The FLOAT16 data type.
  + You can use the Parquet physical type `int96` for TIMESTAMP, but Snowflake doesn’t support `int96` for TIMESTAMP_NTZ.

---
title: Data unloading considerations
source: https://docs.snowflake.com/en/user-guide/data-unload-considerations.md
section: User Guide
---

# Data unloading considerations

This topic provides best practices, general guidelines, and important considerations for unloading data from a table. It is intended to help simplify
exporting data from Snowflake tables into files in stages using the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command.

## Empty strings and NULL values

An empty string is a string with zero length or no characters, whereas NULL values represent an absence of data. In CSV files, a NULL value is typically represented by two successive delimiters (e.g. `,,`) to indicate that the field contains no data; however, you can use string values to denote NULL (e.g. `null`) or any unique string. An empty string is typically represented by a quoted empty string (e.g. `''`) to indicate that the string contains zero characters.

The following file format options enable you to differentiate between empty strings and NULL values when unloading or loading data. For more information about these file formats, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md):

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Use this option to enclose strings in the specified character: single quote (`'`), double quote (`"`), or NONE.

    Enclosing string values in quotes while unloading data is not required. The COPY INTO location command can unload empty string values without enclosing quotes, with the EMPTY_FIELD_AS_NULL option set to
    FALSE. If the EMPTY_FIELD_AS_NULL option is TRUE (which is prohibited), then empty strings and NULL values are indistinguishable in the output file.

    When a field contains this character, escape it using the same character. For example, if the value is the double quote character and a field contains the string `"A"`, escape the double quotes as
    follows: `""A""`.

    Default: `NONE`

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   * When unloading empty string data from tables, choose one of the following options:

      + Preferred: Enclose strings in quotes by setting the `FIELD_OPTIONALLY_ENCLOSED_BY` option, to distinguish empty strings from NULLs in output CSV files.
      + Leave string fields unenclosed by setting the `FIELD_OPTIONALLY_ENCLOSED_BY` option to `NONE` (default), and set the `EMPTY_FIELD_AS_NULL` value to `FALSE` to unload empty strings as empty fields.

        > **Important:**
        >
        > If you choose this option, make sure to specify a replacement string for NULL data using the `NULL_IF` option, to distinguish NULL values from empty strings in the output file. If you later choose
        > to load data from the output files, you will specify the same `NULL_IF` value to identify the NULL values in the data files.
    * When loading data into tables, use this option to specify whether to insert SQL NULL for empty fields in an input file. If set to FALSE, Snowflake attempts to cast an empty field to the corresponding
      column type. An empty string is inserted into columns of data type STRING. For other column types, the COPY command produces an error.

    Default: `TRUE`

`NULL_IF = ( 'string1' [ , 'string2' ... ] )`
:   When unloading data from tables: Snowflake converts SQL NULL values to the first value in the list. Be careful to specify a value that you want interpreted as NULL. For example, if you are unloading data to a file that will get read by another system, make sure to specify a value that will be interpreted as NULL by that system.

    Default: `\\N` (i.e. NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\` (default))

### Example: Unload and load data with enclosing quotes

In the following example, a set of data is unloaded from the `null_empty1` table to the user’s stage. The output data file is then used to load data into the `null_empty2` table:

```sqlexample
-- Source table (null_empty1) contents
+---+------+--------------+
| i | V    | D            |
|---+------+--------------|
| 1 | NULL | NULL value   |
| 2 |      | Empty string |
+---+------+--------------+

-- Create a file format that describes the data and the guidelines for processing it
create or replace file format my_csv_format
  field_optionally_enclosed_by='0x27' null_if=('null');

-- Unload table data into a stage
copy into @mystage
  from null_empty1
  file_format = (format_name = 'my_csv_format');

-- Output the data file contents
1,'null','NULL value'
2,'','Empty string'

-- Load data from the staged file into the target table (null_empty2)
copy into null_empty2
    from @mystage/data_0_0_0.csv.gz
    file_format = (format_name = 'my_csv_format');

select * from null_empty2;

+---+------+--------------+
| i | V    | D            |
|---+------+--------------|
| 1 | NULL | NULL value   |
| 2 |      | Empty string |
+---+------+--------------+
```

### Example: Unload and load data without enclosing quotes

In the following example, a set of data is unloaded from the `null_empty1` table to the user’s stage. The output data file is then used to load data into the `null_empty2` table:

```sqlexample
-- Source table (null_empty1) contents
+---+------+--------------+
| i | V    | D            |
|---+------+--------------|
| 1 | NULL | NULL value   |
| 2 |      | Empty string |
+---+------+--------------+

-- Create a file format that describes the data and the guidelines for processing it
create or replace file format my_csv_format
  empty_field_as_null=false null_if=('null');

-- Unload table data into a stage
copy into @mystage
  from null_empty1
  file_format = (format_name = 'my_csv_format');

-- Output the data file contents
1,null,NULL value
2,,Empty string

-- Load data from the staged file into the target table (null_empty2)
copy into null_empty2
    from @mystage/data_0_0_0.csv.gz
    file_format = (format_name = 'my_csv_format');

select * from null_empty2;

+---+------+--------------+
| i | V    | D            |
|---+------+--------------|
| 1 | NULL | NULL value   |
| 2 |      | Empty string |
+---+------+--------------+
```

## Unload to a single file

By default, COPY INTO location statements separate table data into a set of output files to take advantage of parallel operations. The maximum size for each file is set using the `MAX_FILE_SIZE` copy option. The default value is `16777216` (16 MB) but can be increased to accommodate larger files. The maximum file size supported is 5 GB for Amazon S3, Google Cloud Storage, or Microsoft Azure stages.

To unload data to a single output file (at the potential cost of decreased performance), specify the `SINGLE = true` copy option in your statement. You can optionally specify a name for the file in the path.

> **Note:**
>
> If the `COMPRESSION` option is set to true, specify a filename with the appropriate file extension for the compression method so that the output file can be decompressed. For example, specify the GZ file extension if the `GZIP` compression method is specified.

For example, unload the `mytable` table data to a single file named `myfile.csv` in a named stage. Increase the `MAX_FILE_SIZE` limit to accommodate the large data set:

```sqlexample
copy into @mystage/myfile.csv.gz from mytable
file_format = (type=csv compression='gzip')
single=true
max_file_size=4900000000;
```

## Unload a relational table to JSON

You can use the OBJECT_CONSTRUCT function combined with the COPY command to convert the rows in a relational table to a single VARIANT column and unload the rows into a file.

For example:

```sqlexample
-- Create a table
CREATE OR REPLACE TABLE mytable (
 id number(8) NOT NULL,
 first_name varchar(255) default NULL,
 last_name varchar(255) default NULL,
 city varchar(255),
 state varchar(255)
);

-- Populate the table with data
INSERT INTO mytable (id,first_name,last_name,city,state)
 VALUES
 (1,'Ryan','Dalton','Salt Lake City','UT'),
 (2,'Upton','Conway','Birmingham','AL'),
 (3,'Kibo','Horton','Columbus','GA');

-- Unload the data to a file in a stage
COPY INTO @mystage
 FROM (SELECT OBJECT_CONSTRUCT('id', id, 'first_name', first_name, 'last_name', last_name, 'city', city, 'state', state) FROM mytable)
 FILE_FORMAT = (TYPE = JSON);

-- The COPY INTO location statement creates a file named data_0_0_0.json.gz in the stage.
-- The file contains the following data:

{"city":"Salt Lake City","first_name":"Ryan","id":1,"last_name":"Dalton","state":"UT"}
{"city":"Birmingham","first_name":"Upton","id":2,"last_name":"Conway","state":"AL"}
{"city":"Columbus","first_name":"Kibo","id":3,"last_name":"Horton","state":"GA"}
```

## Unload a relational table to Parquet with multiple columns

You can unload data in a relational table to a multi-column Parquet file by using a [SELECT](../sql-reference/sql/select.md) statement as input to the COPY statement. The SELECT statement specifies the column data in the relational table to include in the unloaded file. Use the `HEADER = TRUE` copy option to include the column headers in the output files.

For example, unload the rows from three columns (`id`, `name`, `start_date`) in the `mytable` table into one or more files that have the naming format `myfile.parquet`:

```sqlexample
COPY INTO @mystage/myfile.parquet FROM (SELECT id, name, start_date FROM mytable)
  FILE_FORMAT=(TYPE='parquet')
  HEADER = TRUE;
```

> **Note:**
>
> COPY INTO <location> is supported with the following limitations:
>
> * VARIANT, GEOMETRY, and GEOGRAPHY are unloaded as JSON-encoded strings.
> * TIMESTAMP_NTZ(9) is unloaded as milliseconds, not nanoseconds.
> * TIMESTAMP_LTZ(9), ARRAY, OBJECT, and MAP must be cast to other data types.

## Explicitly convert numeric columns to Parquet data types

By default, when table data is unloaded to Parquet files, [fixed-point number](../sql-reference/data-types-numeric.md) columns are
unloaded as DECIMAL columns, while [floating-point number](../sql-reference/data-types-numeric.md) columns are unloaded as
DOUBLE columns.

To choose the Parquet data types for sets of unloaded data, call the [CAST , ::](../sql-reference/functions/cast.md) function in the COPY INTO
*<location>* statement to convert specific table columns to explicit data types. A query in a COPY INTO *<location>* statement
enables selecting specific columns to unload and accepts conversion SQL functions to transform the column data.

Queries in COPY INTO *<location>* statements support the syntax and semantics of SELECT statements to query specific Snowflake table
columns to unload. Convert the data in numeric columns to specific data types using the [CAST , ::](../sql-reference/functions/cast.md) function.

The following table maps Snowflake numeric data types to Parquet physical and logical data types:

| Snowflake Logical Data Type | Parquet Physical Data Type | Parquet Logical Data Type |
| --- | --- | --- |
| TINYINT | INT32 | INT(8) |
| SMALLINT | INT32 | INT(16) |
| INT | INT32 | INT(32) |
| BIGINT | INT64 | INT(64) |
| FLOAT | FLOAT | N/A |
| DOUBLE | DOUBLE | N/A |

The following example shows a COPY INTO *<location>* statement that converts the numeric data in each unloaded column to a different data type to explicitly choose the data types in the Parquet files:

```sqlexample
COPY INTO @mystage
FROM (SELECT CAST(C1 AS TINYINT) ,
             CAST(C2 AS SMALLINT) ,
             CAST(C3 AS INT),
             CAST(C4 AS BIGINT) FROM mytable)
FILE_FORMAT=(TYPE=PARQUET);
```

## Floating-point numbers truncated

When [floating-point number](../sql-reference/data-types-numeric.md) columns are unloaded to CSV or JSON files, Snowflake
truncates the values to approximately (15,9).

The values are not truncated when unloading floating-point number columns to Parquet files.

---
title: Database replication considerations
source: https://docs.snowflake.com/en/user-guide/database-replication-considerations.md
section: User Guide
---

# Database replication considerations

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

This topic describes the behavior of certain Snowflake features in secondary databases when using database replication.
For additional guidance for working with replicated objects and data, refer to [Replication considerations](account-replication-considerations.md).

## Database replication and security objects

This section describes the database replication behavior of security policies and secrets.

Masking & Row Access Policies:
:   The replication operation fails if either of the following conditions is true:

    * The primary database is in an Enterprise (or higher) account and contains a policy/tag but one or more of the accounts approved for
      replication are on lower editions.
    * An object contained in the primary database has a dangling reference to a tag
      in a different database.

    The dangling reference behavior for database replication can be avoided when replicating multiple databases in a
    [replication or failover group](account-replication-intro.md).

Tag-based masking policies:
:   The replication operation fails if either of the following conditions is true:

    * The primary database is in an Enterprise (or higher) account and contains a policy/tag but one or more of the accounts approved for
      replication are on lower editions.
    * An object contained in the primary database has a dangling reference to a tag
      in a different database.

    For more information about tag-based masking policies, refer to [Tag-based masking policies](tag-based-masking-policies.md).

Password, Session, & Authentication Policies:
:   The replication operation fails if either of the following conditions is true:

    * The primary database is in an Enterprise (or higher) account and contains a policy but one or more of the accounts approved for
      replication are on lower editions.
    * Either of these objects contained in the primary database is attached to a user in the same account. In this case, Snowflake fails
      the replication operation.

    To avoid the failed database replication operation due to a reference to a user, use a
    [replication or failover group](account-replication-intro.md) instead.

    For details, refer to [Replication and security policies](account-replication-considerations.md).

Secrets:
:   You cannot replicate a secret using database replication. Use a replication or failover group to replicate a secret. For details, see
    [Replication and secrets](account-replication-considerations.md).

## Dangling references

### References to objects in another database

Carefully analyze whether views or table constraints in a primary database reference objects in another database.
For database objects, you can view [object dependencies](object-dependencies.md) in the Account Usage
[OBJECT_DEPENDENCIES view](../sql-reference/account-usage/object_dependencies.md).

The following table describes the database replication behavior when an object (the referencing object) in a database references
an object (the referenced object) in another database:

| Referencing Object | Referenced Object | Replication Behavior |
| --- | --- | --- |
| Non-materialized view | Object | Succeeds |
| Materialized view | Object | Fails |
| Materialized view | Dropped object | Fails |
| Foreign key constraint | Primary key | Fails |
| Table | Sequence | Fails |
| Masking policy, row access policy, or tag | Object policy/tag is assigned to | Fails |
| [Stream](account-replication-considerations.md) | Object | Fails |

#### Non-materialized views

Non-materialized views that reference any object in another database (e.g. table columns, other views, UDFs, or stages) can be
replicated, because this type of reference is name based. Name-based references do not cause replication to fail; however, queries
on the view in secondary databases will fail if the other database(s) are not replicated in the same region.

For example, suppose view `v1` in database `d1` references tables `t1` and `t2` in databases `d1` and `d2`,
respectively. To successfully query view `v1` in the secondary database `d1`, secondary database `d2` must also exist in the
account (e.g. as another secondary database). In addition, for consistent query results with the primary databases, secondary
databases `d1` and `d2` must be refreshed at the same time.

#### Materialized views

Dangling references in materialized views can cause replication to fail with the following error message:

```bash
Dangling references in the snapshot. Correct the errors before refreshing again. The following references are missing (referred entity <- [referring entities])
```

These dangling references can occur if:

* A materialized view references any object in another database.

  Materialized views reference objects by ID rather than name. A database snapshot cannot resolve ID-based references to objects
  outside the database.

  To work around this limitation, replicate both databases together in the same
  [replication or failover group](account-replication-intro.md). Alternatively, you
  can store materialized views and the objects they reference in the same database.
* A materialized view is invalid (i.e. references a dropped object).

  To avoid a dangling reference error for invalid materialized views, identify and fix the problem with the materialized view. Refer
  to the [Troubleshooting](views-materialized.md) section in the materialized views topic.

#### Constraints

Currently, dangling foreign keys cause the replication to fail with the following error message:

```bash
Dangling references in the snapshot. Correct the errors before refreshing again. The following references are missing
(referredentity <- [referring entities])
```

This situation occurs when a foreign key in the primary database references a primary key in another database, or vice-versa. That
is because constraint references are ID-based. A database snapshot cannot resolve ID-based references to objects outside its own
database.

To view the foreign key references in your account, query the Information Schema [TABLE_CONSTRAINTS view](../sql-reference/info-schema/table_constraints.md)
or the Account Usage [TABLE_CONSTRAINTS view](../sql-reference/account-usage/table_constraints.md).

To work around this limitation, replicate both databases together in the same
[replication or failover group](account-replication-intro.md). Alternatively, you can
store linked tables in the same database.

#### Sequences

Currently, dangling sequences cause the replication to fail with the following error message:

```bash
Dangling references in the snapshot. Correct the errors before refreshing again. The following references are missing
(referred entity <- [referring entities])
```

This situation occurs when a table in a primary database references a sequence in another database. That is because sequence references
are ID-based. A database snapshot cannot resolve ID-based references to objects outside its own database.

To work around this limitation, replicate both databases together in the same
[replication or failover group](account-replication-intro.md). Alternatively, you can reference sequences in the
same database.

#### Masking & row access policies and tags

A dangling reference for a [masking policy](security-column-intro.md),
[row access policy](security-row-intro.md), or [tag](object-tagging/interaction.md) causes the replication to fail with
the following error message:

```bash
Dangling references in the snapshot. Correct the errors before refreshing again. The following references are missing
(referred entity <- [referring entities])
```

This situation occurs when the policy/tag and the object that has the policy/tag assigned to it exist in different databases. For
example, a table named `db1.s1.t1`, a row access policy named `db2.s1.rap1`, and the row access policy is assigned to the table.

To work around this limitation, replicate both databases together in the same
[replication or failover group](account-replication-intro.md).

### References to dropped objects

Dropping an object that is referenced by another object in the same, or another, database results in a dangling reference. When an object in the primary database references a dropped object, a replication operation fails with the following error message:

```bash
Dangling references in the snapshot. Correct the errors before refreshing again. The following references are missing
(referred entity <- [referring entities])
```

To work around this limitation, we recommend that you complete any one of the following steps:

* Undrop any referenced objects.
* Modify the referring objects (for example, modify a materialized view using [ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md)). Either reference a different object or remove the reference to the dropped object.
* Drop any objects in the primary database that reference dropped objects.

## Replication of multiple databases

When multiple databases are replicated, point in time consistency across databases is not available. A snapshot of each primary database is
created independently and changes to the secondary database are committed independently. This can be problematic if you have views that join
across tables in different databases or depend on cross-database transactions. For example, a transaction that updates two primary databases
atomically might not be reflected in the secondary databases at the same time.

To replicate multiple databases with point in time consistency, use a
[replication or failover group](account-replication-intro.md).

## Dynamic tables and data replication

If a dynamic table references source objects outside database replication, it can still be replicated. However, name
resolution can become complex if the secondary database has a different name than the primary. After failover, this
can lead to unexpected refresh results depending on how the source object is referenced. To prevent this, avoid
renaming the database during replication setup or use failover group replication instead.

In the following diagram, the dynamic table `dt` references a source object `source_table` using a fully qualified
name. For example:

```sqlexample
CREATE DYNAMIC TABLE dt
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = my_wh
  AS
    SELECT * FROM db2.sch1.source_table
```

During replication, `DB1` is renamed to `DB2` in the secondary account. After failover, refreshing the
dynamic table `dt` in `DB2` in the secondary account resolves the source table within the same database, not the
original primary database. While this aligns with name resolution rules, it might lead to unexpected results.

In the following diagram, `dt` references `source_table` using a fully qualified name, and the replication renames
`DB1` to `DB2` in the secondary account. `dt` in the secondary account now references a source table that is
outside of the containing database.

---
title: Databases, Tables & Views
source: https://docs.snowflake.com/en/user-guide/databases.md
section: User Guide
---

# Databases, Tables & Views

All data in Snowflake is maintained in databases. Each database consists of one or more schemas, which are logical groupings of
database objects, such as tables and views. Snowflake does not place any hard limits on the number of databases, schemas
(within a database), or objects (within a schema) you can create.

**Next Topics:**

* [Understanding Snowflake Table Structures](tables-micro-partitions.md)
* [Working with Temporary and Transient Tables](tables-temp-transient.md)
* [Introduction to external tables](tables-external-intro.md)
* [Search optimization service](search-optimization-service.md)
* [Overview of Views](views-introduction.md)
* [Working with Secure Views](views-secure.md)
* [Working with Materialized Views](views-materialized.md)
* [Table Design Considerations](table-considerations.md)
* [Cloning considerations](object-clone.md)
* [Data storage considerations](tables-storage-considerations.md)

---
title: dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake.md
section: User Guide
---

# dbt Projects on Snowflake

[dbt Core](https://github.com/dbt-labs/dbt-core) is an open-source data transformation tool and framework that you can use to define, test, and deploy SQL transformations.

With dbt Projects on Snowflake, you can use familiar Snowflake features to create, edit, test, run, and manage your dbt Core projects, typically as follows:

1. **Start with a valid dbt project:** (With `dbt_project.yml`, `profile.yml`, `/models/...`.) This is stored either in a
   workspace in Snowsight or a Git repository that you’ve connected to Snowflake. Prepare a database, schema, and warehouse with a
   role that has the [necessary privileges](dbt-projects-on-snowflake-access-control.md).
2. **Install dependencies:** Execute the `dbt deps` command within a Snowflake workspace, local machine, or git orchestrator to
   populate the `dbt_packages` folder for your dbt Project.

   For more information, see [Understand dependencies for dbt Projects on Snowflake](dbt-projects-on-snowflake-dependencies.md).
3. **Deploy the DBT PROJECT object:** Create a schema-level DBT PROJECT object by copying your project files into a new version of that
   object. You can do this by using the CREATE OR REPLACE DBT PROJECT … FROM <source> command or the `snow dbt deploy` Snowflake CLI
   command.

   For more information, see [Deploy dbt project objects](dbt-projects-on-snowflake-deploy.md).
4. **Execute the dbt project in Snowflake:** Execute a dbt Core project within a dbt project object by using the EXECUTE DBT PROJECT command
   or the `snow dbt execute` Snowflake CLI command. Executing a dbt project involves invoking dbt Core commands to build or test models;
   this is what you schedule and orchestrate.

   For more information, see [EXECUTE DBT PROJECT](../../sql-reference/sql/execute-dbt-project.md).
5. **Schedule with Snowflake tasks:** Use Snowflake tasks to schedule and orchestrate dbt project runs.

   For more information, see [Schedule runs of dbt Projects on Snowflake](dbt-projects-on-snowflake-schedule-project-execution.md).
6. **Set up CI/CD integrations:** Use Snowflake CLI commands to integrate deployment and execution into your CI/CD workflows.

   dbt project objects support Snowflake CLI commands that you can use to create and manage dbt projects from the command line. This is
   useful for integrating dbt projects into your data engineering workflows and CI/CD pipelines. For more information, see
   [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [Integrating CI/CD with Snowflake CLI](../../developer-guide/snowflake-cli/cicd/integrate-ci-cd.md), and
   [snow dbt commands](../../developer-guide/snowflake-cli/command-reference/dbt-commands/overview.md).
7. **Monitor the dbt project:** Use Snowflake monitoring features to inspect, manage, and tune dbt project execution whether you execute a
   dbt project object manually or use tasks to execute dbt project objects on a schedule.

   For more information, see [Monitor dbt Projects on Snowflake](dbt-projects-on-snowflake-monitoring-observability.md).

## Key concepts

* **dbt project objects:** A *dbt project* is a directory that contains a `dbt_project.yml` file and a set of files that define dbt
  assets, such as models and sources. A DBT PROJECT is a schema-level object that contains versioned source files for your dbt project in
  Snowflake. You can connect a dbt project object to a workspace, or you can create and manage the object independent of a workspace. You
  can CREATE, ALTER, and DROP dbt project objects like other schema-level objects in Snowflake.

  A dbt project object is typically based on a dbt project directory that contains a `dbt_project.yml` file. This is the pattern that
  Snowflake uses when you deploy (create) a dbt project object from within a workspace.

  For more information, see [Understand dbt project objects](dbt-projects-on-snowflake-understanding-dbt-project-objects.md).
* **Schema customization:** dbt uses the default macro `generate_schema_name` to decide where a model is built. You can customize how
  dbt builds your models, seeds, snapshots, and test tables.

  For more information, see [Understand schema generation and customization](dbt-projects-on-snowflake-schema-customization.md).
* **Workspaces:** Workspaces in the Snowflake web interface are a Git-connected web IDE where you can visualize, test, run, and scaffold one
  or many dbt projects, link them to a Snowflake dbt project object to create/update it, and edit other Snowflake code in one place.

  For more information, see [Workspaces for dbt Projects on Snowflake](dbt-projects-on-snowflake-using-workspaces.md).
* **Versioning:** Every dbt project object is versioned; versions live under `snow://dbt/<db>.<schema>.<project>/versions/...`.

  For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).

---
title: DCM Projects files and templates
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-files.md
section: User Guide
---

# DCM Projects files and templates

A DCM project requires a manifest file and one or more SQL object definition files. These files are typically stored and managed in a
Git repository or your local workspace.

* The manifest file

  + Specifies which object definition files to include.
  + Defines configurations for different environments with template variables.
* The object definition files

  + Define a group of Snowflake objects that you want to manage together in the DCM project.

The high-level workflow to create DCM project files is:

1. Create a DCM project folder to store your definition files
2. Create a manifest file
3. Create object definition files

## Create a DCM project folder to store your definition files

To create a new DCM project, create a folder to store your manifest file (`manifest.yml`) and SQL object definition files.

Snowflake CLISnowsight

```snowcli
snow init <project_name> --template DCM_PROJECT
```

The `snow init` command with the `DCM_PROJECT` template creates example definition files in your project directory. You can open and
edit these files to define your DCM project.

1. In the navigation menu, select Projects » Workspaces.
2. In the Workspaces menu, select + Add New.
3. Select DCM Project.

DCM Projects follow the standardized folder structure:

* DCM Projects object definition files must be placed under `sources/definitions/`.
* The optional global macro files can be placed under `sources/macros/`.
* File naming and nesting inside these project directories are flexible.
* Saved output artifacts from DCM commands are always written to `out/`.
* If you have additional scripts or project files that you want to use for DCM commands, you can add them under `sources` (for
  example, dbt project files).
* If you have other custom scripts that you want to store within the project folder that should not be used by DCM Projects commands and not
  uploaded from local, add them in a folder outside of the `sources` folder.
* The CLI commands only upload files within the `sources` folder.

> **Note:**
>
> If you are using Git, add `out/` to your `.gitignore` file to avoid pushing local output files to Git.

An example of a DCM project folder structure is:

```none
my_dcm_project/
  ├── manifest.yml
  ├── sources/
  │   ├── definitions/
  │   │   ├── bronze.sql
  │   │   └── silver.sql
  │   ├── macros/
  │   │   └── global_macro.sql
  │   └── dbt/
  │       ├── my_dbt_project_1/
  │       └── my_dbt_project_2/
  ├── my_post_scripts/
  └── out/
      └── plan/
```

## Create a manifest file

Each DCM project requires a `manifest.yml` file. It holds the essential configuration details of the project and allows the project folder to be
identified as a DCM project.

You use the manifest file to control which DCM project objects and roles to use when deploying to different target environments and to
manage sets of templating values.

The manifest file is a YAML file that contains the following properties:

```yaml
manifest_version: 2
type: DCM_PROJECT
default_target:
targets:
templating:
```

| Property | Required | Description |
| --- | --- | --- |
| `manifest_version` | Required | Version of the manifest schema. The current version is 2. |
| `type` | Required | Type of the project. Set to `DCM_PROJECT`. |
| `default_target` | Optional | If you have more than one target, specify the default target. The Snowflake CLI and Workspaces use the default target if you do not specify a target using the `--target` flag. |
| `targets` | Required | The `targets` section maps each deployment target to a specific Snowflake account, DCM project object, owner role, and optionally a templating configuration. This mapping eliminates the need to pass fully qualified project names and configuration flags in every CLI command. See Project targets for more details. |
| `templating` | Optional | The `templating` section defines the templating configurations to use for the project. See Project templating configurations for more details. |

### Project targets

Each target in the manifest file contains the following properties:

```yaml
targets:
  <target_name>:
    account_identifier:
    project_name:
    project_owner:
    templating_config:
```

| Property | Description |
| --- | --- |
| `account_identifier` | The Snowflake account identifier for this target.  See [Finding the region and locator for an account](../admin-account-identifier.md). |
| `project_name` | The fully qualified name of the DCM project object, for example, `DCM_DEMO.PROJECTS.DCM_PROJECT_DEV`.  Use the [SHOW DCM PROJECTS](../../sql-reference/sql/show-dcm-projects.md) SQL command to find it. |
| `project_owner` | The role with OWNERSHIP on this project object.  Use the [SHOW DCM PROJECTS](../../sql-reference/sql/show-dcm-projects.md) or [DESCRIBE DCM PROJECT](../../sql-reference/sql/desc-dcm-project.md) SQL command to find it. |
| `templating_config` (optional) | The name of the templating configuration defined in the `templating` section, to use for this target. |

#### Map between project definitions and project objects

DCM Projects definition files aren’t strictly tied to a specific DCM project object. You can use the same set of definitions to deploy to multiple
projects, either on different Snowflake accounts or by referencing different configuration profiles. For example, the same definition files
on a repository branch can be deployed to both DEV and PROD accounts as shown in the following figure.

Similarly, you can execute a DCM Projects object by referencing definition files from different paths. For example, your CI/CD automation can deploy
definitions from your main branch, and you can manually run a PLAN from your local definition files against the same project to check
how your definitions diverge from the latest deployment. You can also use this approach for ad-hoc manual deployments from other branches or
local paths.

### Project templating configurations

Following is the high-level structure of the templating configuration in the manifest file. You can set only `defaults`, or only
`configurations`, or both.

```yaml
templating:
  defaults:
    <variable_name>: <value>
  configurations:
    <configuration_name>:
      <variable_name>: <value>
```

| Property | Description |
| --- | --- |
| `defaults` | The shared variable values, in key-value pairs, that apply across all configurations to avoid repetition. |
| `configurations` | The templating configurations to use for the project.  Individual configurations can override defaults with configuration-specific values. For details on how variables are resolved, see Configurations. |
| `<configuration_name>` | The name of the templating configuration.  Configuration names are case-insensitive. |
| `<variable_name>` | The name of the variable.  Variable names should follow [Python variable naming rules](https://www.w3schools.com/python/python_variables_names.asp).  All variables in project definitions must be declared either in defaults, the selected configuration, or at runtime.  If you want string variables to resolve empty, specify them as `""`. |
| `<value>` | The value of the variable.  Values can be strings, numbers, booleans, lists, or dictionaries.  Dictionaries can be defined in the manifest but cannot be overwritten at runtime. |

### Example: manifest.yml

This is an example of a DCM project manifest file (`manifest.yml`) that includes three configurations,
DEV, STAGE, and PROD, with template variables and their default values:

```yaml
manifest_version: 2

type: DCM_PROJECT

default_target: DCM_DEV

targets:
  DCM_DEV:
    account_identifier: MYORG-MYACCOUNT_DEV
    project_name: DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
    project_owner: DCM_DEVELOPER
    templating_config: DEV

  DCM_STAGE:
    account_identifier: MYORG-MYACCOUNT_STAGE
    project_name: DCM_DEMO.PROJECTS.DCM_PROJECT_STG
    project_owner: DCM_STAGE_DEPLOYER
    templating_config: STAGE

  DCM_PROD:
    account_identifier: MYORG-MYACCOUNT_PROD
    project_name: DCM_DEMO.PROJECTS.DCM_PROJECT_PROD
    project_owner: DCM_PROD_DEPLOYER
    templating_config: PROD

templating:
  defaults:
    user: "GITHUB_ACTIONS_SERVICE_USER"
    wh_size: "SMALL"

  configurations:
    DEV:
      env_suffix: "_DEV"
      user: "INSERT_YOUR_USER"
      wh_size: "X-SMALL"
      teams:
        - name: "DEV_TEAM"
          write_access: TRUE

    STAGE:
      env_suffix: "_STG"
      teams:
        - name: "TEST_TEAM_A"
          write_access: TRUE
        - name: "TEST_TEAM_B"
          write_access: FALSE

    PROD:
      env_suffix: ""
      teams:
        - name: "Marketing"
          write_access: FALSE
        - name: "Finance"
          write_access: FALSE
          wh_size: "LARGE"
        - name: "HR"
          write_access: FALSE
        - name: "IT"
          write_access: TRUE
```

## Create object definition files

A DCM project definition file is a template that resolves to valid SQL statements for managing Snowflake objects. Each DCM project
requires at least one definition file.

You can organize your object definitions and grants across multiple files and folders. Snowflake recommends choosing a structure that
represents the business logic of the project (for example, bronze, silver, and gold) rather than grouping by object type.

Definition files can only contain DEFINE, GRANT, or ATTACH statements. Other SQL commands are not supported.

To get started quickly with DCM Projects, you can convert your existing SQL deployment scripts by using the DEFINE keyword for your existing DDLs
(*for supported object types*).

The DEFINE statement works like the [CREATE OR ALTER <object>](../../sql-reference/sql/create-or-alter.md) command, but with the following key
differences:

* The order and location of DEFINE statements don’t matter. Snowflake collects and sorts all statements from all definition
  files during project execution.
* If you remove a DEFINE statement, Snowflake drops the corresponding object the next time you deploy the project.
* Only a subset of Snowflake objects is supported. For details, see [Supported object types in DCM Projects](dcm-projects-supported-entities.md).
* All objects must be defined with a fully qualified name in the format `database.schema.object_name`.

Definition files can contain various Jinja2 templating options and support advanced templating features, which allow you to
do the following:

* Customize file content at runtime using template variables.
* Use Jinja2 syntax for logic such as loops and conditionals.
* Make definition files reusable and adaptable for different scenarios.

### Object definition templating

DCM Projects support the Jinja2 framework for templating SQL statements. You can declare variables and assign values using the Jinja2 syntax either
from configuration profiles, in the [EXECUTE DCM PROJECT](../../sql-reference/sql/execute-dcm-project.md) command, or within Jinja. You can also construct
loops through lists of values, case statements, reusable functions, and more. For more information, see the [Jinja2 documentation](https://jinja.palletsprojects.com/en/stable/).

Supported Jinja2 functionality includes:

* String-replacements
* Lists
* Dictionaries and nested dictionaries
* Conditions (IF statements)
* Looping
* Global and in-file macros

  + Macros defined in the `sources/macros` folder can be used across all definition files.
  + Macros defined in a file can be used within the file.

Unsupported Jinja2 functionality includes:

* Using the following tags:

  + [import](https://jinja.palletsprojects.com/en/stable/templates/#import)
  + [extends](https://jinja.palletsprojects.com/en/stable/templates/#extends)
  + [include](https://jinja.palletsprojects.com/en/stable/templates/#include)

> **Note:**
>
> The `_snow` identifier is reserved for future use, and cannot be used as a variable or macro name

> **Important:**
>
> Do not use DCM Projects templating variables for object definitions that contain sensitive information or credentials.
> The rendered SQL definitions do not redact any values inserted by environment variables.
>
> Similarly, do not enter any personal data, sensitive data, export-controlled data, or other regulated data as metadata, for example, file
> names, configuration and variable names, when using the Snowflake service. For more information, see [Metadata fields in Snowflake](https://docs.snowflake.com/en/sql-reference/metadata).

The following is an example DCM project definition file that uses Jinja2 templating:

```sqlexample-jinja
DEFINE WAREHOUSE DCM_PROJECT_WH_{{db}}
  WITH
    warehouse_size = '{{wh_size}}'
    auto_suspend = 300;
```

The following is an example of a DCM project manifest file (`manifest.yml`) that defines two configurations: DEV and PROD.

```yaml
templating:
  configurations:
    DEV:
      db: "DEV_2"
      wh_size: "X-SMALL"
    PROD:
      db: "PROD"
      wh_size: "LARGE"
```

Rendering this warehouse definition with the DEV configuration (selected automatically through the target’s `templating_config` or at runtime)
resolves to:

```sqlexample
DEFINE WAREHOUSE DCM_PROJECT_WH_DEV_2
  WITH
    warehouse_size = "X-SMALL"
    auto_suspend = 300;
```

#### Macros

Macro files are any SQL files located in the `macros` folder and its sub-folders. They can only contain macros.

An example of a directory structure of a DCM project with macro files is:

```none
My_dcm_project
 |_ manifest.yml
 |_ sources
    |_ definitions
       |_ my_definitions.sql
    |_ macros
       |_ my_global_macros.sql
```

Similar to functions in regular programming languages, macros help organize often-used pieces of code into reusable functions, thereby
avoiding repetition and following the DRY (Don’t Repeat Yourself) principle. Macros in DCM Projects work in the same way as [Jinja2 macros](https://jinja.palletsprojects.com/en/stable/templates/#macros) with the following exceptions:

* Dedicated location for macro files in the `macros` folder.
* Macros defined in macro files are automatically visible in other source files.
  The [import](https://jinja.palletsprojects.com/en/stable/templates/#import) Jinja tag is not permitted.
* Duplicate definition of a macro with the same name is detected and rejected.

##### Automatic import of global macros

During the definition file rendering process, source files are scanned for potential macro calls. If a called macro is defined in a macro
file, the implicit [from […] import tag](https://jinja.palletsprojects.com/en/stable/templates/#import) is added automatically, so
no explicit import is needed.

Similar to [Jinja2 macros](https://jinja.palletsprojects.com/en/stable/templates/#macros), you can define a local macro
by prefixing it with an underscore. A local macro can be used only in the file where it’s declared and isn’t visible to
other files.

#### Template comments

In SQL commands, you can add `--` before your code to comment out the line. Jinja still processes variables within the SQL code but leaves the SQL
comments.

For example, the following Jinja code:

```sqlexample-jinja
-- hello {{ project_owner_role }}
```

Renders as:

```sqlexample
-- hello DCM_DEVELOPER
```

Commented out commands do not execute in SQL. You can use template comments to debug Jinja templating without affecting your SQL code.

To ignore Jinja code during rendering, add `#` inside opening and closing brackets as shown in the following example:

```sqlexample-jinja
{# This Jinja comment will not appear in the rendered output. #}
```

#### Configurations

When using templates in your object definitions, you have the following options:

* Assign values to variables at runtime.
* Define different configuration profiles under `templating: configurations:` in the `manifest.yml`. Each target can reference a
  configuration through `templating_config`. See Project templating configurations for more details.

  If configuration profiles are defined and a target references one through `templating_config`, the configuration is automatically applied
  when using that target. For examples, see [Plan a DCM project](dcm-projects-use.md).

  The primary use case for configuration profiles in DCM Projects is to target different environments. Configuration profiles allow you to do the following:

  + Deploy the same code to multiple environments.
  + Test production code on a non-prod environment at a reduced scale.
  + Maintain multiple isolated environments on the same account.

  Not all templating configurations have to be referenced by a target profile. You can keep unused configurations to switch the templating
  config for your target from one to another.
* Define shared default values under `templating: defaults:` to avoid repeating common variables across configurations. See
  Project templating configurations for more details.
* Overwrite specific variables with one-time values at runtime using the `--variable` flag in CLI.

Variables are resolved with a three-tier hierarchy: global defaults < configuration variables < runtime execution variables.

#### Dictionaries

DCM Projects templating supports dictionaries as variable values, enabling structured configuration for complex multi-tenant or multi-resource
deployments.

By grouping related configuration details into dictionaries, you get:

* Granular control: Apply specific settings, such as warehouse sizes, retention policies, and grants, to individual resources without writing
  unique logic for every variation.
* Cleaner code bases: Replace repetitive hard-coded scripts with dynamic loops that adapt based on the configuration.
* Scalability: Onboard new teams or resources by adding entries to your configuration, rather than refactoring deployment pipelines.

> **Note:**
>
> Dictionaries can be defined in the manifest but can’t be overwritten at runtime with the `--variable` flag or SQL
> `USING CONFIGURATION (...)` overrides. Only scalar values and lists can be overwritten at runtime.

##### Example use case for dictionaries: Multi-tenant environment provisioning

Consider a platform shared by multiple departments, such as Marketing, Finance, and HR, each with different compliance and compute
requirements. With dictionaries, you define a single configuration that captures each team’s needs.

Manifest example:

```yaml
templating:
  defaults:
    user: "GITHUB_ACTIONS_SERVICE_USER"
    wh_size: "X-SMALL"
  configurations:
    PROD:
      env_suffix: ""
      project_owner_role: "DCM_PROD_DEPLOYER"
      teams:
        - name: "Marketing"
          wh_size: "MEDIUM"
          data_retention_days: 14
          needs_sandbox_schema: true
        - name: "Finance"
          wh_size: "X-LARGE"
          data_retention_days: 90
          needs_sandbox_schema: false
        - name: "HR"
          data_retention_days: 30
          needs_sandbox_schema: false
```

Definition example:

Your SQL template loops through this dictionary. It automatically creates schemas, assigns the correct retention policy, and conditionally
creates extra resources only for the teams that request them.

```sqlexample-jinja
-- loop through team dictionaries
{% for team in teams %}
    {% set team_name = team.name | upper %}

    -- inject dictionary values directly into object properties
    define schema DCM_DEMO_1{{env_suffix}}.{{team_name}}
        comment = 'using JINJA dictionary values'
        data_retention_time_in_days = {{ team.data_retention_days }};

    -- pass the name to your macro
    {{ create_team_roles(team_name) }}

    define table DCM_DEMO_1{{env_suffix}}.{{team_name}}.PRODUCTS(
        ITEM_NAME varchar,
        ITEM_ID varchar,
        ITEM_CATEGORY array
    )
    data_metric_schedule = 'TRIGGER_ON_CHANGES';

    {% if team_name == 'HR' %}
        define table DCM_DEMO_1{{env_suffix}}.{{team_name}}.EMPLOYEES(
            NAME varchar,
            ID int
        )
        comment = 'This table is only created in HR';
    {% endif %}

    -- use dictionary booleans to deploy optional infrastructure
    {% if team.needs_sandbox_schema | default(false) %}
        define schema DCM_DEMO_1{{env_suffix}}.{{team_name}}_SANDBOX
            comment = 'Sandbox schema defined via dictionary flag'
            data_retention_time_in_days = 1;
    {% endif %}

{% endfor %}
```

---
title: DCM Projects for data pipelines
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-pipelines.md
section: User Guide
---

# DCM Projects for data pipelines

DCM Projects provide a full-lifecycle developer experience which includes capabilities tailored to managing data pipelines.

The pipeline-specific commands don’t apply to all object types. They extend the core commands for the following pipeline use cases:

* REFRESH command for dynamic tables managed by a DCM project.
* TEST command for data quality expectations attached to managed objects.
* PREVIEW command for checking sample output from a dynamic table, view, or table before deploying.

## REFRESH command for dynamic tables

After you deploy a pipeline definition change, you can refresh the dynamic tables inside the pipeline project before testing data quality
expectations, so that any new transformation logic is applied end to end.

You can refresh all dynamic tables managed by the DCM project and their required upstream dynamic tables with one command.
This command applies only to dynamic tables that are deployed and managed by the referenced project, independent of any definition files.
Other object types, such as tasks, are not affected.

See TEST command for data quality expectations for usage examples that combine REFRESH and TEST.

The command runs until all dynamic table refreshes are complete and returns a summary of the row changes or errors for
each dynamic table.

To run the REFRESH command:

SQLSnowflake CLI

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_STG
  REFRESH ALL;
```

```snowcli
snow dcm refresh --target STAGE --save-output
```

For the REFRESH ALL output format, including the JSON schema and examples, see the
[REFRESH ALL output](../../sql-reference/sql/execute-dcm-project.md) section of the [EXECUTE DCM PROJECT](../../sql-reference/sql/execute-dcm-project.md) command reference.

## TEST command for data quality expectations

You can set data quality expectations as quality gates on all stages of your data transformation:

* Attach expectations to raw data in your bronze layer landing tables to ensure your raw input meets expectations and does not cause errors
  during transformation.
* Attach expectations as quality gates to your silver layer to make it easier to debug data issues by having checkpoints at different
  transformation stages.
* Attach expectations to your gold layer to ensure the output quality of your data product.
* Attach expectations from downstream consumers of your data product to your gold layer so you can validate those expectations before deploying
  breaking changes.

See [Data metric function](dcm-projects-supported-entities.md) for how to attach expectations in DCM projects.

You can test all data quality expectations attached to tables, dynamic tables, or views that are managed by the DCM project with one
command.

Data metric functions that are attached without expectations are not checked.

You can use the CLI commands to set up automated testing as part of your CI/CD workflow. For example, if you have production-like data on a
QA, test, or staging environment, you can follow these steps:

1. PLAN against QA to verify the expected project definition changes.
2. DEPLOY to QA.
3. REFRESH ALL dynamic tables on QA to update data based on any new transformation logic and updated definitions, so that expectations are
   not tested against outdated data.
4. TEST ALL data quality expectations attached to table objects on the QA environment to verify that the newly deployed logic works as
   expected and has no negative side effects on the expected shape of your data output.
5. If all expectations are met on QA, continue with PLAN and DEPLOY to your production environment.

To run the TEST command:

SQLSnowflake CLI

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_STG
  TEST ALL;
```

```snowcli
snow dcm test --target STAGE --save-output
```

For the TEST ALL output format, including the JSON schema and examples, see the
[TEST ALL output](../../sql-reference/sql/execute-dcm-project.md) section of the [EXECUTE DCM PROJECT](../../sql-reference/sql/execute-dcm-project.md) command reference.

## PREVIEW command

When you write or alter the SELECT statement of a dynamic table or view, a sample output helps validate the shape of
the data. For complex lineage graphs with multiple transformation steps, you can check the output of a downstream view or
dynamic table when making changes further upstream.

To validate that the transformation in your code results in the expected data output before deploying, run the PREVIEW command.

The PREVIEW command runs PLAN to compile the current definitions, independent of any deployed state, and then returns a data sample for a
specified dynamic table, view, or regular table.

Keep the following requirements and considerations in mind:

* The PREVIEW command must always reference a fully qualified name of a table object, without Jinja variables.
* To see sample data in the output, you must ensure that data is already available in the source tables.
* PREVIEW queries all SELECT statements of referenced dynamic tables and views, but it does not run tasks or CREATE TABLE AS SELECT statements.

To run the PREVIEW command:

SQLSnowflake CLI

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  PREVIEW
    DCM_PROJECT_DEV.SERVE.V_DASHBOARD_KPI_SUMMARY
  USING CONFIGURATION DEV
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/DCM_Project_Quickstart_1'
  LIMIT 100;
```

```snowcli
snow dcm preview --object DCM_PROJECT_DEV.SERVE.V_DASHBOARD_KPI_SUMMARY --limit 100
```

---
title: Dealing with real-world data in Time-Series Forecasting
source: https://docs.snowflake.com/en/user-guide/ml-functions/preprocessing.md
section: User Guide
---

# Dealing with real-world data in Time-Series Forecasting

Time-series data from the real world is often imperfect, with missing, duplicate, or unaligned time steps. The Forecast
and Anomaly Detection ML Functions include these preprocessing features to help you use your real-world data to train a
model that makes useful predictions:

* You can specify an event frequency to override the frequency that the model automatically infers.
* The model can infer data at missing time steps and aggregate multiple values within a time step. You can specify how
  aggregation should be done for each feature or type of feature, or let the ML function do it for you automatically.

These capabilities let you train a useful model even when your training data has common consistency issues. Generally,
the more consistent your data is, the more accurate your forecasting model will be, but a relatively small number of
such adjustments does not noticeably affect the accuracy of the model.

## Specifying event frequency

Model training infers the frequency of the time steps in your training data using heuristics that, on rare occasions,
choose the wrong frequency. To avoid this risk, or to correct an incorrect inference, you can optionally specify the
desired frequency when initiating training using the CONFIG_OBJECT parameter `frequency`. This parameter
specifies a time period in a form similar to `'1 day'` or `'2 weeks'`.

* The interval specification is a string and therefore must be surrounded with single quotes.
* Supported intervals are seconds, minutes, hours, days, weeks, months, quarters, and years.
* Use the full interval names, not abbreviations. Plurals are acceptable. (“Second” or “seconds,” not “sec”).

The following example shows how to train a forecast model using a frequency of one day.

```sqlexample
CREATE SNOWFLAKE.ML.FORECAST model1(
  INPUT_DATA => TABLE(v1),
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales',
  CONFIG_OBJECT => {'frequency': '1 day'}
);
```

If you do not specify an event frequency, the training process infers the closest matching event frequency.

## Filling in values for missing time steps

A time stamp that does not have a target value uses:

* Zero if the target value aggregation behavior is SUM (see Handling multiple values in a time step)
* Linear interpolation from nearby values in all other cases

Missing feature values are not filled in, but rather replaced with NULL values. Model training ignores these.

## Handling multiple values in a time step

When there are multiple events in a time step, preprocessing can aggregate their values in various ways. For example,
if the frequency of events is hourly, then values that occur outside of the hourly cadence can be averaged to produce a
value for the nearest canonical hourly timestamp.

The following table summarizes the available aggregation behaviors.

| Kind of value | Available behaviors | Default behavior |
| --- | --- | --- |
| Numeric | * MEAN: average of values * MEDIAN: middle value * MODE: most frequent value * MIN: lowest value * MAX: highest value * SUM: total of values * FIRST: earliest value * LAST: latest value | MEAN |
| Categorical (string or Boolean) | * MODE: most frequent value * FIRST: earliest value * LAST: latest value | MODE |

> **Tip:**
>
> Use the SUM method for count data, such as the number of items sold. MEAN is appropriate for most other numeric values.

All behaviors ignore NULL values and apply over the time period being interpolated or aggregated. For example, the SUM
of values on an hourly cadence is the sum of the values within the hour centered on the canonical time stamp.

You can override the default behavior for a column in two ways:

* By kind of value (target, numeric, or categorical)
* By the exact column name

If you override behaviors in both ways, the column name override takes precedence.

### Overriding by kind of value

Set the following options in the function’s CONFIG_OBJECT parameter to override specific types of values: categorical,
numeric, and target. The behaviors are as previously defined.

| Option | Possible values |
| --- | --- |
| `aggregation_categorical` | MODE, FIRST, LAST |
| `aggregation_numeric` | MEAN, MEDIAN, MODE, MIN, MAX, SUM, FIRST, LAST |
| `aggregation_target` | MEAN, MEDIAN, MODE, MIN, MAX, SUM, FIRST, LAST |

> **Note:**
>
> If `aggregation_target` is not specified, target aggregation uses the behavior, if any, specified by `aggregate_numeric`.
> Otherwise, the default, MEAN, is used.

The following example shows how to set aggregation behaviors for categorical and numeric features.

```sqlexample
CREATE SNOWFLAKE.ML.FORECAST model1(
  INPUT_DATA => TABLE(v1),
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales',
  CONFIG_OBJECT => {
    'frequency': '1 day',
    'aggregation_categorical': 'MODE',
    'aggregation_numeric': 'MEDIAN'}
);
```

> **Tip:**
>
> Consider specifying all of these values even if you’re using the defaults. That way, you don’t need to know what the
> default behavior is to understand what the statement is doing, and if you want to change the behavior later, you
> won’t need to look up the parameter name.

### Overriding by column name

The `aggregation_column` option in CONFIG_OBJECT is an object that maps behaviors to column names. These behaviors
override any behaviors specified using the parameters described above.

> **Note:**
>
> The aggregation behavior for the target value cannot be specified by column name. Use the `aggregation_target`
> option instead.

For example, the following SQL statement specifies aggregation behaviors for two different columns using the
`aggregation_column` option.

```sqlexample
CREATE SNOWFLAKE.ML.FORECAST model1(
  INPUT_DATA => TABLE(v1),
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales',
  CONFIG_OBJECT => {
    'frequency': '1 day',
    'aggregation_target': 'MEDIAN',
    'aggregation_column': {
        'temperature': 'MEDIAN',
        'employee_id': 'FIRST'
    }
  }
);
```

---
title: Debugging dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-debug.md
section: User Guide
---

# Debugging dynamic tables

This topic addresses solutions for troubleshooting dynamic tables that don’t run as expected.

Some actions might be restricted due to limitations on using dynamic tables or if you don’t have the necessary privileges. For more
information, see [Dynamic table limitations](dynamic-tables-limitations.md) and [Dynamic table access control](dynamic-tables-privileges.md).

If you encounter an issue not listed here, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

| Issue | Solution |
| --- | --- |
| I can’t see the metadata for my dynamic table. | To view the metadata and Information Schema of a dynamic table, you must use a role that has the MONITOR privilege on that dynamic table. For more information, see [Privileges to view a dynamic table’s metadata](dynamic-tables-privileges.md). |
| My dynamic table is suspended. | A dynamic table might be suspended for several reasons:   * It was suspended directly using the [ALTER DYNAMIC TABLE … SUSPEND](../sql-reference/sql/alter-dynamic-table.md)   command. * It is downstream of a suspended dynamic table. * It failed to refresh five consecutive times (skips don’t contribute to this count). * It is a replicated dynamic table, either in a replication group or failover group.   See [Replication and dynamic tables](account-replication-considerations.md). * It was cloned from a dynamic table that has one or more base tables dropped at the time of cloning.   To see the reason why your dynamic table was suspended, do the following:   1. Sign in to [Snowsight](ui-snowsight-gs.md). 2. In the navigation menu, select Transformation » Dynamic tables. 3. Select your dynamic table and go to the Table Details tab. 4. Hover over Scheduling State under Details. A dialog detailing the reason and date of the suspension appears. |

---
title: Department of Defense (DOD) Impact Level 5 (IL5)
source: https://docs.snowflake.com/en/user-guide/cert-dodIL5.md
section: User Guide
---

# Department of Defense (DOD) Impact Level 5 (IL5)

This topic describes how Snowflake supports customers with DOD Cloud Computing SRG compliance requirements.

## Understanding DOD Cloud Computing SRG compliance requirements

The Department of Defense (DOD) Cloud Computing Security Requirements Guide (SRG) outlines the security model and controls
for the DOD’s use of cloud computing. The U.S. military creates, stores, and operationalizes massive amounts of sensitive data.
Protecting that data is a strategic priority and is the focus of the DOD Cloud Computing SRG framework. This framework is used to
categorize information systems and data and to indicate the security requirements that data is subject to. Snowflake has received
Provisional Authorization (PA) by the Defense Information Systems Agency (DISA) to operate at Information Impact Level 5 (IL5) on
AWS GovCloud. This IL5 authorization allows Snowflake to offer authorized solutions to organizations requiring the highest levels of
protection for Controlled Unclassified Information (CUI) within the DOD and related agencies.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

You can view the latest Snowflake DOD Cloud Computing SRG IL5 authorizations within the
[Current Authorized CSOs website](https://public.cyber.mil/dccs/cso/).

Agencies may download Snowflake’s DoD package from eMASS.

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Deploy and manage DCM Projects
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-use.md
section: User Guide
---

# Deploy and manage DCM Projects

This topic describes how to create and deploy DCM Projects to manage Snowflake environments, including as accounts.

Managing a DCM project involves the following steps:

1. Prepare your Snowflake account for a DCM project.
2. Define project configuration and objects in project files.
3. Create a DCM Projects object.
4. Plan to preview proposed changes before deployment.
5. Deploy the project.
6. Maintain the project by monitoring, updating, and repeating the process as needed.

You can continuously deploy incremental changes to your project as well as large-scale account infrastructure changes.

Snowflake recommends that you continuously deploy incremental changes and additions to your project, rather than going from 0 to 100 large-scale account
infrastructure changes in a single deployment.

## Prepare for a DCM project

To get started with DCM Projects, your Snowflake account must satisfy the following prerequisites:

* A database and schema where you can create your DCM Projects object
* A role with privileges to create a DCM Projects object and access to run queries on a warehouse
* For Snowflake CLI, a role with privileges to create a temporary stage

This section describes the tasks that you need to complete to prepare for DCM Projects:

* Install interfaces to use with DCM Projects if you want to use Snowflake CLI or Cortex CLI.
* Configure Git integration (recommended but not mandatory)

> **Note:**
>
> The [snowflake-labs DCM repository](https://github.com/Snowflake-Labs/snowflake-dcm-projects) is continuously
> updated with resources to help you get started.
>
> * **Quickstarts and demo projects**: Clone the repository into a Snowflake Workspace or local folder to try out DCM Projects
>   commands and explore DCM Projects capabilities.
> * **Reusable GitHub Actions**: Composite actions for parsing manifests, testing connections, planning, and deploying
>   DCM Projects in CI/CD pipelines. For more information, see GitHub Actions.
> * **Sample workflows**: Ready-to-use workflow files that compose the reusable actions into complete CI/CD pipelines.

### Interface tools

You have the following interface options available for DCM Projects.

| Interface tool | Best for |
| --- | --- |
| **Snowsight**  A workspace in Snowsight is a Snowflake native cloud IDE in your account. | * Easily create or upload DCM definition files via the UI. * Connect to a Git repository to pull and push changes. * Review, edit and debug definition files. * Execute DCM PLAN and DEPLOY commands using the workspace UI. * Browse the database catalog to see DCM project objects and their configuration, managed objects and deployment history. * Select a target profile to automatically use the linked DCM project and templating configuration. |
| **Local IDE** with **Snowflake CLI**  The most familiar and personalized interface for software engineers. | * Create and edit definition files locally. * Connect to a Git repository to pull and push changes. * Concise Snowflake CLI commands with directory context and optional flags. * Rich formatted output and an option to save as a `.json` file. * Option to leverage Cortex Code CLI for agentic or assisted development. * See Snowflake CLI for DCM Projects for information about installing and running Snowflake CLI in your local IDE. |
| **Cortex Code**  An agentic AI tool for Snowflake. See Cortex Code for DCM Projects for more information. | * AI assisted or agentic authoring of local definition files. * AI assisted or agentic code validation and debugging by running static analysis and DCM PLAN commands. |
| **SQL commands** | * Run SQL commands from the Snowflake CLI REPL, workspaces, notebooks, or worksheets. * Customize commands with additional arguments. * Same commands work across all Snowflake SQL interfaces. |

#### Cortex Code for DCM Projects

Cortex Code is an agentic AI tool for Snowflake. With the DCM skill enabled, Cortex Code can autonomously create, migrate, debug, and
deploy DCM Projects. It can also work alongside you step by step.

> **Note:**
>
> Cortex Code with the DCM skill is currently available via the Cortex Code CLI only. It is not available in Snowsight Workspaces.

The Cortex Code DCM skill enables the following:

* Scaffold a new DCM project from scratch, including the manifest file, the folder structure, and definition files.
* Author and edit DEFINE statements, Jinja templates, and macros.
* Run PLAN, DEPLOY, REFRESH, TEST, and PREVIEW commands.
* Interpret plan output, diagnose failures, and suggest fixes.
* Download and inspect deployment artifacts.
* Navigate and explain an existing DCM project.

To get started with the Cortex Code DCM skill, follow these steps:

1. Install Cortex Code CLI as described in [Installing Cortex Code](../cortex-code/cortex-code-cli.md).
2. Start Cortex Code in your terminal.
3. Use the `$dcm` skill reference or use the term `DCM` in your natural language prompt to interact with your DCM Projects conversationally.

For example:

* “Create a new DCM project for our analytics pipeline”
* “Plan my project against the PROD target”
* “Why did my last plan fail?”
* “Add a new dynamic table definition for customer spending”

#### Snowflake CLI for DCM Projects

Snowflake CLI is a command-line interface for Snowflake. It is a tool that you can use to interact with your Snowflake account from your local
IDE.

1. DCM Projects require Snowflake CLI version 3.16 or higher. Install or upgrade Snowflake CLI as described in [Installing Snowflake CLI](../../developer-guide/snowflake-cli/installation/installation.md).
2. Configure your connection to your Snowflake account, as described in [Configuring Snowflake CLI and connecting to Snowflake](../../developer-guide/snowflake-cli/connecting/connect.md). Confirm you have a working connection:

   ```bash
   snow connection test
   ```
3. Navigate to the local directory of your Git repository clone. For example:

   ```bash
   cd ./Quickstarts/DCM_Quickstart_1
   ```
4. See the Snowflake CLI DCM commands available to you:

   ```snowcli
   snow dcm --help
   ```

### Git integration

Connect to the Git repository where your DCM project definition files are stored.

SnowsightLocal IDE

1. [Create a new workspace from a Git repository](../ui-snowsight/workspaces-git.md).
2. Create or select a Git branch for your planned changes.

   Snowflake clones files from that branch into your workspace editor.
3. Navigate to the folder where you have your DCM project definition files or want to create them.

1. [Install Snowflake CLI](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation).

   The [Snowflake extension](https://docs.snowflake.com/en/user-guide/vscode-ext) for VSCode is not needed here, but can be helpful.
2. [Connect to Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/connect).
3. Connect to your Git repository.

   1. Connect your local IDE to your remote Git repository.
   2. Create or select a branch for your planned changes.
   3. Clone that branch to your local disk.
   4. Navigate to the folder where you have your DCM Projects definition files or want to create them.

## Create a DCM project

### Required roles and privileges

The role of the user who creates a DCM project object must have the following roles and privileges:

* The CREATE DCM PROJECT ON SCHEMA privilege:

  ```sqlexample
  GRANT CREATE DCM PROJECT ON SCHEMA <schema_name> TO ROLE <role_name>;
  ```

### Create a DCM project

Create a DCM project object by using one of the following options.

SQLSnowflake CLISnowsight

```sqlexample
CREATE [OR REPLACE] DCM PROJECT [IF NOT EXISTS] <my_project>
[COMMENT = 'my comment'];
```

```snowcli
snow dcm create <my_project> --if-not-exists
```

To create a project for a non-default target, use one of the following commands:

```snowcli
snow dcm create <my_project> --if-not-exists

snow dcm create # uses the name specified in the default target from the manifest

snow dcm create --target # uses the named target from the manifest
```

1. In the navigation menu, select Projects » Workspaces.
2. On the Workspaces page, select “+Add new” -> “DCM Project” to create a new DCM project folder
3. Select “Define default target environment” to select or create a new DCM project object for the default target in the manifest

   When running DCM PLAN against a target which has a DCM project object defined in the manifest, but does not yet exist, the UI will
   prompt you to confirm creation of that DCM project object based on the defined name and owner role before executing the plan.
4. The Target indicates whether the specified DCM project object already exists and can be used or not.

   * Green: The DCM project object exists and can be used to run PLAN or DEPLOY.
   * Red: The DCM project object does not exist and first needs to be created.

## Access control and role privileges

You can set role-based access control (RBAC) of the schema-level DCM project object to READ, MONITOR, or OWNERSHIP privileges.

These privileges are independent of the access control for definition files stored in a workspace, stage, or repository.

| Privilege | Description | Allowed operations |
| --- | --- | --- |
| READ | * Shows if the DCM project object exists. * Lists the objects and grants deployed by the DCM project, which are visible to the user’s role.  This means you need both READ on the DCM project and READ on the objects themselves. | * SHOW DCM PROJECTS LIKE ‘%project’ * DESCRIBE DCM PROJECT <project> * SHOW ENTITIES IN DCM PROJECT <project> |
| MONITOR | * Gives access to the complete deployment history, including all artifacts. * Gives the role the ability to analyze, debug, or audit production deployments without the ability to deploy changes directly. | * All READ privileges * DESCRIBE DCM PROJECT <project> (with source and deployment path of latest deployment) * INFORMATION_SCHEMA.DCM_DEPLOYMENT_HISTORY (project_name => ‘db.schema.project’) * SHOW DEPLOYMENTS IN DCM PROJECT <project> * LIST all files in the deployment * GET any access to files inside the DCM project |
| OWNERSHIP | * The role that is used to create the DCM project object is the owner of that project. * Gives the role the ability to deploy changes. * Gives the role the ability to transfer ownership of the project to another role when the project has not yet been deployed. | * All MONITOR privileges * EXECUTE DCM PROJECT <project> PLAN * EXECUTE DCM PROJECT <project> DEPLOY * EXECUTE DCM PROJECT <project> PREVIEW * EXECUTE DCM PROJECT <project> REFRESH * EXECUTE DCM PROJECT <project> TEST * DROP DCM PROJECT <project> * ALTER DCM PROJECT <project> * GRANT READ on DCM PROJECT <project> TO ROLE <role2> * GRANT MONITOR on DCM PROJECT <project> TO ROLE <role2> |

> **Note:**
>
> Like other Snowflake commands, `EXECUTE DCM PROJECT` respects when **privileges from secondary roles** are enabled for the user who
> runs the command. Run `USE SECONDARY ROLES NONE;` so that you are not leveraging privileges from other roles than the project owner
> role. This ensures that deployment behavior is consistent across different environments when executed by different service-users with the
> same primary role.

### Ownership on DCM-managed objects

The role that deploys a DCM project, by default, has the OWNERSHIP privilege of all deployed objects.

The project definitions can include GRANT OWNERSHIP statements to other roles. Snowflake recommends that the DCM project owner role only grant
ownership of DCM-managed objects to another lower-level role that it also holds. Then the project can continue to manage this object, as
the project owner role “inherits” the privileges of the new object owner role.

If the DCM project owner role grants ownership of DCM-managed objects to another role that it does not hold itself, the project can no
longer manage this deployed object because the project owner role no longer has ownership of it. Subsequent deployments will fail. The object
definition needs to be removed from the project or ownership needs to be granted back to the project owner role.

If you want to migrate existing objects to be managed by a DCM project, the role that owns the DCM project object also has to have ownership
privileges (direct or inherited through other roles) on the object to be managed by DCM project.

> **Note:**
>
> If a migrated object, we recommend adding the corresponding GRANT OWNERSHIP statement to the project definitions as well to ensure that
> the current state and DCM project definitions are in sync.

## Define a DCM project

A DCM project is based on a manifest file and one or more SQL object definition files. These files are typically stored and managed in a
Git repository or your local workspace.

* The manifest file

  + Specifies one or more target environments with corresponding account identifiers, DCM project objects, owner roles for these object and optional templating configurations
  + Optionally, specifies templating defaults and one or more configurations with values for [template variables](dcm-projects-files.md).
* The object definition files

  + Define a group of Snowflake objects, grants, and expectations that you want to manage together in this DCM project.

See [Create a DCM project folder to store your definition files](dcm-projects-files.md) for how to set up a DCM project folder and the definition files and how to use templates to define your
DCM project.

## Plan a DCM project

Planning a DCM project performs a dry run to preview changes before deployment. Snowflake compares your [project definition files](dcm-projects-files.md) to existing objects and shows which objects will be created, altered, or dropped. No changes are made
to your account.

Use planning to review and validate changes before deploying a DCM project.
You can specify options such as a [configuration](dcm-projects-files.md)
or an output path for plan results.

The PLAN mimics the DEPLOY command as much as possible, except it doesn’t actually execute any DDL statements.

> **Important:**
>
> Always run the PLAN command on your projects before deployment to help ensure there are no errors from syntax, templating, object
> dependency, access privileges, and so on. Review the plan output to debug any errors, preview the rendered Jinja with the provided
> variables, and preview the changes that will be made once you deploy.

The plan performs the following steps:

1. Renders all Jinja templating with the selected configuration profile or values provided at runtime.
2. Compares all definitions against the current state of entities that were defined as part of the last deployment.
3. Converts all defined statements into CREATE, ALTER, DROP, GRANT, and REVOKE statements.
4. Sorts all statements based on their interdependencies.
5. Compiles all statements.

> **Note:**
>
> Although PLAN catches almost all possible errors that can occur during deployment, it does not guarantee a successful deployment.

### Run the PLAN command

The PLAN command takes the following information as input:

* The path to the manifest file

  The CLI reads the target from the manifest (`default_target` or `--target` flag). For SQL commands, the path to the manifest file and
  the project name must be provided.
* Defined values for Jinja variables (optional).
* The target’s `templating_config` automatically selects the configuration profile. For SQL commands, use the USING CONFIGURATION clause to
  specify the profile.
* One or more values of the configuration profile to overwrite (optional).

The following are examples of how to run the PLAN command.

Snowflake CLISnowsightSQL

Run the `snow dcm plan` command in your local IDE terminal or as part of a Git workflow.

An example of a CLI command to plan a DCM project from a local directory is:

```snowcli
cd ./Quickstarts/DCM_Project_Quickstart_1/
snow dcm plan
```

An example of a CLI command to plan a DCM project from a Snowflake stage or Git repository clone is:

```snowcli
snow dcm plan --target PROD_US --save-output
```

An example of a CLI command to plan a DCM project with optional arguments is:

```snowcli
snow dcm plan
--variable "wh_size='MEDIUM'" --variable "teams = ['TEAM_A', 'TEAM_B']"
--save-output
```

Variables are required in double-quotes with additional single quotes for string-values. Lists of values require
square-brackets.

In the DCM control panel, at the top:

1. Select your project folder in your current workspace.
2. Select your target (if you have multiple targets).
3. (optional) Overwrite specific parameters.
4. Click Plan.
5. The Snowsight UI automatically uses the DCM project object defined in the manifest target. If the project object does not yet
   exist, you can create it from the UI.

When the PLAN is completed, the output opens in a new tab. If you don’t see it, click Plan again to open the tab.

If a Plan already exists, you can choose to re-plan if you have changed your definitions.

The plan output is always generated automatically under the project sub-folder `out/`.

You can execute a DCM PLAN in SQL from anywhere you can run SQL commands, inside Snowflake or connected to Snowflake. Use the
[EXECUTE DCM PROJECT](../../sql-reference/sql/execute-dcm-project.md) command with the `PLAN` mode.

An example of a SQL command to plan a DCM project from a Workspace path is:

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  PLAN
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/Quickstarts/DCM_Project_Quickstart_9_36';
```

An example of a SQL command to plan a DCM project when using Jinja with configuration profiles but overwriting `wh_size` and `teams` is:

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  PLAN
  USING CONFIGURATION DEV (wh_size => 'MEDIUM', teams => ['TEAM_A', 'TEAM_B'])
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/Quickstarts/DCM_Project_Quickstart_9_36'
```

An example of a SQL command to plan a DCM project when using Jinja templating without configuration profiles is:

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  PLAN
  USING (wh_size => 'MEDIUM')
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/Quickstarts/DCM_Project_Quickstart_9_36';
```

### Definition file path

You have the following options to reference the location of the manifest and definition files.

* From a Workspace path

  The Snowsight user interface automatically lists all DCM project definitions inside the current workspace. You can select one of these
  paths and workspaces will use it to run DCM commands.

  If you want to manually run SQL commands in workspaces you can also refer to that same path inside any of your workspaces.

  **Tip:** The 3-dot menu behind every file in your workspace lets you copy the full path to that file into your SQL code.

  An example of a SQL command to plan a DCM project from a workspace path is:

  ```sqlexample
  EXECUTE DCM PROJECT DCM_PROJECT_DEV
    PLAN
    USING CONFIGURATION DEV
  FROM
    'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/Quickstarts/DCM_Project_Quickstart_1'
  ```
* From a local Git repository clone on your disk

  Select the directory that contains your `manifest.yml` file before running the CLI command in your local IDE.
  Alternatively you can specify a different local directory that contains the manifest and definitions you want to use.

  An example of a CLI command to plan a DCM project from the current directory of a local Git repo:

  ```snowcli
  cd ./Quickstarts/DCM_Project_Quickstart_1/

  snow dcm plan

  snow dcm plan --target PROD
  ```

  An example of a CLI command to plan a DCM project from a different directory in a local Git repo clone:

  ```snowcli
  snow dcm plan DCM_PROJECT_DEV --configuration DEV --from ./Quickstarts/DCM_Project_Quickstart_2/
  ```
* From your remote repository in a workflow

  The same CLI syntax can be used when the DCM commands are executed in a CI/CD workflow. You can call the CLI directly
  or use the reusable GitHub Actions from the snowflake-labs DCM repository, which
  handle CLI setup, authentication, and DCM commands internally.

  An example using the reusable `dcm-plan` action:

  ```yaml
  steps:
    - uses: actions/checkout@v4
    - uses: Snowflake-Labs/snowflake-dcm-projects/actions/dcm-plan@v1
      with:
        target: PROD
        project-path: Quickstarts/DCM_Project_Quickstart_1/
        snowflake-user: ${{ env.SNOWFLAKE_USER }}
  ```
* From a Stage or Git repository clone in Snowflake

  In case you want to run a PROCEDURE or TASK inside Snowflake that runs DCM commands, this SQL command can reference an absolute
  path to a Snowflake stage or Git repository clone inside the account.

  For Git Repository clones consider first running ALTER GIT REPOSITORY FETCH to have the latest version.

  `'@...'` paths can only be used when executing DCM SQL commands.

  An example of a SQL command to plan a DCM project from a Stage or Git repository clone in Snowflake is:

  ```sqlexample
  EXECUTE DCM PROJECT DCM_PROJECT_DEV
    PLAN
    USING CONFIGURATION DEV
  FROM
    '@DCM_DEMO.DEPLOY.DCM_DEMO/branches/main/Quickstarts/DCM_Project_Quickstart_1/'
  ```

### Plan output

For the PLAN and DEPLOY output format, including the JSON schema and examples, see the
[PLAN and DEPLOY output](../../sql-reference/sql/execute-dcm-project.md) section of the [EXECUTE DCM PROJECT](../../sql-reference/sql/execute-dcm-project.md) command reference.

## Deploy a DCM project

When you deploy a DCM project, the following actions are performed:

* Objects that are defined but don’t exist yet are created.
* Objects that already exist but differ from the current definition are altered.
* Objects that already exist as defined are skipped.
* Objects that already exist but are no longer defined are dropped.

The same behavior applies to grants and attached data quality expectations defined in the project.

> **Important:**
>
> To avoid any unintended data loss, always run and **review your PLAN** output before running DEPLOY.

Each DCM project can only have one instance deployed at any time. Multiple configuration profiles can’t coexist. Deploying configuration B
with the same DCM project will drop any objects from other previous configurations that are not defined in B.

Create one DCM project for each target environment. The DCM project for each environment can then point to the same definition files, but
deploy independently with different values for each variable, like `suffix => 'DEV_JS'`, so that they can exist
independently side-by-side on the same Snowflake account.

You can overwrite values for selected variables at runtime if you want to use a pre-defined profile with a slight variation.

For example:

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  DEPLOY
  USING CONFIGURATION DEV (suffix=>'DEV_USER', user=>'JANEDOE')
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/DCM_Project_Quickstart_1';
```

```snowcli
snow dcm deploy DCM_PROJECT_DEV --configuration DEV --variable "suffix='DEV_USER'" --variable "user='JANEDOE'"
```

Each deployment attempt (successful, failed, or canceled) has a deployment number, for example `DEPLOYMENT$1`. Optionally you can
specify a unique string as a deployment *alias* to name individual deployments for better observability in the deployment history.
Think of the deployment *alias* like a commit message for your code change.

Each DEPLOY command first runs an internal PRE-PLAN as part of the deployment. If the PRE-PLAN succeeds the DEPLOY is executed
directly afterwards. There is no option to intercept or review this internal plan step. The PRE-PLAN is executed to further
reduce the risk of failure during the deployment.
If a DEPLOY fails, you can see in the error message if it failed during the PRE-PLAN or DEPLOY step.
Failure during the PRE-PLAN step is similar to PLAN - no DDL changes are executed.

> **Important:**
>
> Failure during the DEPLOY step can result in partial execution of the defined changes. This can potentially cause some of the
> managed objects to be in an undefined state. In most cases fixing the root cause and executing DEPLOY again restores the
> defined target state.

The target path for the DEPLOY output file can’t be customized. Deployment artifacts are always stored inside the DCM project.

### Run the DEPLOY command

To execute the DEPLOY command, provide the following inputs:

* The path to the manifest file.
* A configuration profile must be named if configuration profiles are defined in the manifest.
* Optionally, values for the configuration profile overriding the default values.
* Optionally, a deployment *alias*.

The following are examples of how to run the DEPLOY command.

SQLSnowflake CLISnowsight

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  DEPLOY
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/Quickstarts/DCM_Project_Quickstart_1';
```

An example of a SQL command to deploy a DCM project when using Jinja with configuration profiles but overwriting `wh_size` and `teams` is:

```sqlexample
EXECUTE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  DEPLOY AS "testing 2 teams"
  USING CONFIGURATION DEV (wh_size => 'MEDIUM', teams => ['TEAM_A', 'TEAM_B'])
FROM
  'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/live/Quickstarts/DCM_Project_Quickstart_1';
```

You can run `snow dcm deploy` either in your local IDE terminal or as part of a Git workflow.

An example of a CLI command to deploy a DCM project from a local directory is:

```snowcli
cd ./Quickstarts/DCM_Project_Quickstart_1/
snow dcm deploy DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
```

An example of a CLI command to deploy a DCM project targeting a non-default environment is:

```snowcli
snow dcm deploy --target PROD_US
```

An example of a CLI command to deploy a DCM project with optional arguments is:

```snowcli
snow dcm deploy DCM_DEMO.PROJECTS.DCM_PROJECT_DEV \
--target DCM_DEV \
--variable "wh_size='MEDIUM'" --variable "teams = ['TEAM_A', 'TEAM_B']" \
--alias 'testing 2 teams'
```

1. In the navigation menu, select Projects » Workspaces.
2. Select your project folder in the current workspace.
3. Select your target (if you have multiple targets).
4. Click Plan.

   The UI will automatically use the DCM project object defined in the manifest target. If the project object does not yet exist, you
   can create it from the UI.
5. Once the PLAN is completed, the output opens in a new tab. If you don’t see it, click Plan again to open the tab.

   If a Plan already exists you can choose to re-plan if you have changed your definitions.
6. Review your PLAN output to ensure it does not contain unintended changes.
7. Click Deploy to execute the deployment with the same target and values from PLAN.

See [PLAN and DEPLOY output](../../sql-reference/sql/execute-dcm-project.md) for the standard plan output structure.

## Manage a DCM project

### Show all objects managed by a DCM project

The [SHOW ENTITIES IN DCM PROJECT](../../sql-reference/sql/show-entities-in-dcm-project.md) command allows you to see a list of all Snowflake objects that are currently managed by a specific DCM project.
It provides a list of fully qualified names for all objects. To see the results, you need both READ privilege on the DCM project and privileges to see the managed object itself.

> **Note:**
>
> The result does not necessarily match the objects of the most recent deployment. Objects that were manually dropped or detached from the project are not listed in the result.

You can use `LIKE` to search by name or use a flow operator to further process or filter the result set.

Similarly you can SHOW GRANTS and SHOW FUTURE GRANTS that are defined and deployed with this project.

Examples to see all objects that are currently managed by a DCM project:

SQLSnowsight

```sqlexample
SHOW ENTITIES LIKE '%DASHBOARD%' IN DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV;

SHOW ENTITIES IN DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV
  ->> SELECT * FROM $1 WHERE "object_type" = 'DYNAMIC_TABLE';

SHOW [FUTURE] GRANTS IN DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV;
```

1. In the navigation menu, select Catalog » Database Explorer.
2. Navigate to the schema that contains the DCM project object.
3. Select the DCM project object to see its details.
4. Select the Objects tab to see a list of all Snowflake objects currently managed by this project object.
5. Click the name of an object to open that object’s details page in a new tab.

### Detach objects from a DCM project

Using the [ALTER <object>](../../sql-reference/sql/alter.md) command with the UNSET DCM PROJECT clause, you can detach an object that was deployed and
is now managed by a DCM project. The command removes the association between the object and the DCM project without dropping the object.
You can use this command when you want to start managing an object by a different DCM project.

Make sure to remove the corresponding DEFINE statement from your [project definition files](dcm-projects-files.md) before
you deploy it again. Otherwise, the object will be reintegrated into the DCM project.

An example of a SQL command to detach an object from a DCM project:

```sqlexample
ALTER TABLE MY_DB.MY_SCH.MY_TABLE
  UNSET DCM PROJECT;
```

You can not detach deployed grants or exectations from a DCM Project.

### Drop a DCM project

When a DCM project object is dropped, all managed entities, grants, and expectations remain in place as “unmanaged”.

> **Important:**
>
> Dropping or replacing a DCM project object causes you to lose all deployment history artifacts that the object contains.

SQLSnowflake CLISnowsight

```sqlexample
DROP DCM PROJECT [IF EXISTS] <my_project>;
```

```snowcli
snow dcm drop my_project
```

1. In the navigation menu, select Catalog » Database Explorer.
2. Navigate to the schema that contains the DCM project.
3. Select the DCM project to see its details page.
4. Click the 3-dot menu in the top right and select Drop.

## Automate a DCM project deployment

### CI/CD best practices

Follow these practices when automating deployments with CI/CD pipelines:

* A DCM project targeting a non-production environment should be owned by a different role than its production counterpart to avoid
  accidental deployments to production.
* A DCM project targeting a production environment should be owned by a dedicated role for production deployments with specifically tailored access
  privileges that are just enough to deploy all objects in the project.

  + Avoid using general administrator roles for DCM project ownership. Grant such roles only to service users, not to individual developers.
  + Grant the dedicated production deployment role only to service users, not to individual developers.
  + Restrict the ownership to the production deployment role to ensure immutability of critical infrastructure or data products.

    If the dedicated production deployment role grants ownership of production objects to other roles, users who are granted those roles can
    still modify or drop the production objects.

### GitHub Actions

The [snowflake-labs DCM repository](https://github.com/Snowflake-Labs/snowflake-dcm-projects) provides a set of reusable
composite GitHub Actions that automate DCM Projects pipelines. Each action handles one step of the lifecycle, and you can reference them
from your own workflows to build end-to-end CI/CD pipelines. Only the workflow syntax differs across platforms; the same
CI/CD concepts apply to Azure DevOps, GitLab CI/CD, Bitbucket Pipelines, and others.

> **Note:**
>
> The GitHub Actions in the snowflake-labs DCM repository are provided as-is for evaluation purposes. They aren’t officially
> supported by Snowflake. Use at your own risk.

The following reusable actions are available:

| Action | Description |
| --- | --- |
| `dcm-parse-manifest` | Parses `manifest.yml` and outputs target names as a JSON array for matrix strategies. |
| `dcm-connection-test` | Tests Snowflake connectivity, validates that the connection role matches the manifest `project_owner`, and checks whether the DCM project object already exists. |
| `dcm-plan` | Runs `snow dcm plan`, summarizes the changeset (CREATE, ALTER, DROP counts by object domain), and uploads plan artifacts. Optionally posts the plan summary as a comment on the associated pull request. |
| `dcm-deploy` | Deploys the DCM project with optional data drop detection, Dynamic Table refresh, expectation testing, and post-deployment SQL scripts. Optionally posts a deploy summary to the pull request. |

To use an action in your workflow, reference it with:

```yaml
- uses: Snowflake-Labs/snowflake-dcm-projects/actions/<action-name>@v1
```

For full documentation of each action’s inputs and outputs, see the
[actions README](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/actions/README.md).

#### Prerequisites

Before using the reusable GitHub Actions, complete the following setup steps:

* Store the DCM project files in a Git repository.
* Create a **GitHub Environment** for each manifest target (for example, `DCM_STAGE`, `DCM_PROD_US`). The environment name must
  match the target name in your `manifest.yml`.
* Set the `SNOWFLAKE_USER` and `DCM_PROJECT_PATH` variables in the workflow `env` block or as GitHub repository variables.
* Grant the workflow the required permissions:

  ```yaml
  permissions:
    id-token: write       # Required for OIDC authentication
    contents: read
    pull-requests: write  # Required only when using comment-on-pr
  ```

##### Authentication

All actions authenticate using the
[Snowflake CLI GitHub Action](https://github.com/snowflakedb/snowflake-cli-action). OIDC (OpenID Connect) is the recommended
approach because it uses GitHub’s built-in identity tokens so that no passwords or private keys need to be stored as secrets.

To configure OIDC authentication, create a Snowflake service user with a workload identity that trusts GitHub’s OIDC provider:

```sqlexample
CREATE USER SVC_GITHUB_ACTIONS
  TYPE = SERVICE
  DEFAULT_ROLE = 'PUBLIC'
  COMMENT = 'GitHub Actions service user for CI/CD via OIDC'
  WORKLOAD_IDENTITY = (
    TYPE = OIDC
    ISSUER = 'https://token.actions.githubusercontent.com'
    SUBJECT = 'repo:<owner>/<repo>:environment:<env_name>'
  );
```

Replace `<owner>/<repo>` with your GitHub repository and `<env_name>` with the GitHub Environment name (for example,
`DCM_STAGE`). If you have multiple environments, create a separate service user per environment or use
[subject claim customization](https://docs.github.com/en/actions/security-for-github-actions/security-hardening-your-deployments/about-security-hardening-with-openid-connect#customizing-the-subject-claims).
Then grant the service user the role specified as `project_owner` in your manifest.

If you can’t use OIDC, the actions also support password, PAT, and key-pair authentication. See the
[actions README authentication section](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/actions/README.md#authentication)
for setup instructions.

#### Sample workflows

The [GitHub_workflows](https://github.com/Snowflake-Labs/snowflake-dcm-projects/tree/main/GitHub_workflows) directory in the
snowflake-labs DCM repository contains ready-to-use workflow files that compose the reusable actions into complete CI/CD pipelines.
You can copy them into your repository’s `.github/workflows/` directory and customize them for your project. For full setup
instructions, see the
[sample workflows README](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/GitHub_workflows/README.md).

All sample workflows read the Snowflake `account_identifier` and `project_owner` role directly from the manifest targets,
so that environment-specific configuration lives in the version-controlled `manifest.yml` rather than in duplicated
GitHub secrets. Only the service user credentials are stored as secrets.

The sample workflows demonstrate the following patterns applicable to any DCM Projects CI/CD setup:

* **Manifest-driven configuration**: Each workflow reads `account_identifier`, `project_owner`, and `project_name` from the
  manifest targets, keeping environment configuration in one place.
* **Data drop protection**: The deploy workflow detects destructive DROP operations on data-bearing objects
  (databases, schemas, tables, and stages) and blocks the deployment if any are found.
* **Sequential stage-to-production promotion**: Production deployment starts only after staging deployment succeeds, Dynamic Tables
  are refreshed, and data quality tests pass.
* **Pull request comments**: Plan and deploy summaries are posted as comments on the originating pull request.

##### Sample workflow: Test connections

* Workflow configuration file: [DCM_1_Test_Connections.yml](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/GitHub_workflows/DCM_1_Test_Connections.yml)
* Trigger: Manual with the `workflow_dispatch` event

This workflow validates that the GitHub Actions service user can connect to every target environment defined in the manifest. Use it
when setting up a new repository, onboarding a new account, or debugging authentication issues. The workflow performs the following steps:

* Parses all target names from `manifest.yml` dynamically.
* Uses a GitHub Actions matrix strategy to test each target in parallel.
* For each target, verifies the Snowflake connection, reports the connected account, user, and role, and checks whether the connected
  role matches the DCM project owner.
* Reports whether the DCM project object already exists and whether the service user has deployment privileges.

##### Sample workflow: Test PR to main

* Workflow configuration file: [DCM_2_Test_PR_to_main.yml](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/GitHub_workflows/DCM_2_Test_PR_to_main.yml)
* Trigger: Pull request opened, synchronized, or reopened against the `main` branch

This workflow runs a PLAN against the production target as an integration test for every pull request. It provides reviewers with a
summary of the planned changes directly on the pull request. The workflow performs the following steps:

* Runs `snow dcm plan` against the PROD target.
* Parses `plan_result.json` to summarize CREATE, ALTER, and DROP operations grouped by object domain.
* Uploads plan artifacts for later inspection.
* Posts the plan summary as a comment on the pull request.
* Fails the check if the PLAN fails, blocking the merge.

##### Sample workflow: Deploy to Prod

* Workflow configuration file: [DCM_3_Deploy_to_Prod.yml](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/GitHub_workflows/DCM_3_Deploy_to_Prod.yml)
* Trigger: Push to the `main` branch (typically a merged pull request)

This workflow plans and deploys to a single production target. Use it when you don’t need a staging environment or when staging
is handled separately. The workflow performs the following steps:

1. Plan: Runs `snow dcm plan` and summarizes the changeset.
2. Data drop detection: Blocks the pipeline if the plan contains DROP operations for databases, schemas, tables, or stages.
3. Deploy: Runs `snow dcm deploy`.
4. Post scripts (optional): Runs SQL post-hook scripts with Jinja variable injection.
5. Refresh Dynamic Tables (optional): Runs `snow dcm refresh` to apply any new transformation logic.
6. Test expectations (optional): Runs `snow dcm test` to validate data quality expectations.

After deployment, the workflow optionally posts a status summary to the originating pull request.

##### Sample workflow: Deploy to Stage then Prod

* Workflow configuration file: [DCM_4_Deploy_to_Stage_then_Prod.yml](https://github.com/Snowflake-Labs/snowflake-dcm-projects/blob/main/GitHub_workflows/DCM_4_Deploy_to_Stage_then_Prod.yml)
* Trigger: Push to the `main` branch (typically a merged pull request)

This workflow implements a sequential promotion pipeline. Changes are first deployed to staging, validated end-to-end, and only then
promoted to production. If any step fails, the pipeline stops and production is not affected.

The deployment sequence for each target (STAGE, then PROD) includes:

1. Plan: Runs `snow dcm plan` and summarizes the changeset.
2. Data drop detection: Blocks the pipeline if the plan contains DROP operations for databases, schemas, tables, or stages.
3. Deploy: Runs `snow dcm deploy`.
4. Post scripts (optional): Runs SQL post-hook scripts with Jinja variable injection.
5. Refresh Dynamic Tables (optional): Runs `snow dcm refresh` to apply any new transformation logic.
6. Test expectations (optional): Runs `snow dcm test` to validate data quality expectations.

Production deployment starts only after all staging steps pass. After all jobs complete, the workflow optionally posts a final
status summary to the originating pull request.

## Frequently asked questions (FAQ)

How do I rename an existing object?
:   1. Run an ALTER command outside of the DCM project.
    2. Change the definition.
    3. Run PLAN to verify that the new definition matches the new state (no change in PLAN).
    4. Run DEPLOY to save the new state.

How do I deploy objects that are not yet supported by DEFINE statements?
:   You can run CREATE IF NOT EXISTS or CREATE OR REPLACE statements in a separate SQL script after executing your DCM project plan or
    deployment.

    Both options support Jinja2 templating and dry-run (dry-run renders the Jinja templating but does not verify successful SQL compilation).

    For example:

    SQLSnowflake CLI

    ```sqlexample
    EXECUTE DCM PROJECT my_project
      PLAN ...
    USING ...
    FROM ...

    EXECUTE IMMEDIATE
    FROM
      'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/head/DCM_Project_Quickstart_1/hooks/post_hook.sql'
      USING (db => 'DEV')
      dry_run = TRUE      -- shows the rendered Jinja but does not verify successful compilation
    ;
    ```

    ```snowcli
    snow dcm deploy --target DEV

    snow sql -f hooks/post_hook.sql --variable "db='DEV'" --enable-templating JINJA
    ```

---
title: Deploy dbt project objects
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-deploy.md
section: User Guide
---

# Deploy dbt project objects

In dbt Projects on Snowflake, deploying a dbt project object means copying your dbt Project code into Snowflake to create the object or update it with a new
version. You do this with Snowsight, CREATE DBT PROJECT or ALTER DBT PROJECT SQL commands, or the `snow dbt deploy` command in
the Snowflake CLI.

## Deploy a dbt project object using Snowsight

Deploying a dbt project object in Snowsight takes the dbt code in your workspace and creates a new or updates an existing dbt project.

To deploy a dbt project object in Snowsight, [run the dbt deps command](dbt-projects-on-snowflake-dependencies.md),
then complete the following steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. In the Workspaces menu, select the workspace that contains your dbt project.
4. Confirm that your dbt files are in place.

   To verify that things work, run the `dbt compile`, `dbt run`, or dbt build command, as follows:

   1. Below the workspace editor, open the Output tab so that you can see stdout after you run dbt commands from the workspace.
   2. From the menu bar above the workspace editor, confirm that the correct Project and Profile are selected.
   3. From the command list, select dbt compile, `dbt run`, or dbt build, then select the execute button. This step parses
      your project.
5. From the top right of your workspace, select Connect then select one of the following:

   * Deploy dbt project to connect a new dbt project. On first deploy, this creates a schema-level dbt project object.
   * Existing dbt deployment to connect to an existing dbt project. Deploying adds a new version to the existing dbt project object
     (equivalent to `ALTER DBT PROJECT … ADD VERSION FROM 'snow://workspace/…/versions/last'`).
6. In the Deploy dbt project popup window, select the following:

   > * Under Select location, select your database and schema.
   > * Under Select or Create dbt project, select Create dbt project.
   > * Enter a name and description.
   > * Optionally, enter a default target to choose which profile will be used for compilation and subsequent runs (for example, prod). The
   >   target of a dbt project run can still be overridden with `--target` in `ARGS`.
   > * Optionally, select Run dbt deps, then select your external access integration to execute `dbt deps` automatically during
   >   deployment.
7. Select Deploy.

   The Output tab displays the command that runs on Snowflake, which is similar to the following example:

   ```sqlexample
   CREATE DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
     FROM 'snow://workspace/mydb.my_dbt_projects_schema.sales_model/versions/version$2'
     EXTERNAL_ACCESS_INTEGRATIONS = ();
   ```

   ```output
   my_dbt_project successfully created.
   ```

   The Connect menu now displays the name of the dbt project object that you created, with the following options:

   * Redeploy dbt project: Updates the dbt project object with the current workspace version of the project by using ALTER. This
     increments the version of the dbt project object by one. For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).
   * Disconnect: Disconnects the workspace from the dbt project object, but doesn’t delete the dbt project object.
   * Edit project: Update the comment, default target, and external access integration for the dbt project object.
   * View project: Opens the dbt project object in the object explorer, where you can view the CREATE DBT PROJECT command for the dbt
     project object and run history for the project.
   * Create schedule: Provides options for you to create a task that runs the dbt project object on a schedule. For more information,
     see [Create a task to schedule dbt project execution](../tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md).
   * View schedules: Opens a list of schedules (tasks) that run the dbt project object, with the option to view task details in the
     object explorer.
8. Optionally, confirm your dbt project exists by running the SHOW DBT PROJECTS command in a worksheet, for example:

   ```sqlexample
   SHOW DBT PROJECTS IN DATABASE mydb;
   ```

## Deploy a dbt project object using SQL commands

The [CREATE DBT PROJECT](../../sql-reference/sql/create-dbt-project.md) and [ALTER DBT PROJECT](../../sql-reference/sql/alter-dbt-project.md) commands copy the files specified in the FROM
clause of the statement to create and add new versions to a dbt project object, respectively.

The CREATE DBT PROJECT command creates a new object with a single initial version (for example, `VERSION$1`), as shown below.

```sqlexample
CREATE DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  FROM '@sales_db.integrations_schema.sales_dbt_git_stage/branches/main'
  DEFAULT_TARGET = 'prod'
  EXTERNAL_ACCESS_INTEGRATIONS = my_dbt_ext_access
  COMMENT = 'Generates sales data models.';
```

The ALTER DBT PROJECT command creates a new version within the existing object with a unique, incremented version number (for example,
`VERSION$2`, `VERSION$3`, etc.).

```sqlexample
-- Update the Git repository object to fetch the latest code
ALTER GIT REPOSITORY sales_db.integrations_schema.sales_dbt_git_stage FETCH;

-- Add a new version to the dbt project object based on the updated Git repository object
ALTER DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  ADD VERSION
  FROM '@sales_db.integrations_schema.sales_dbt_git_stage/branches/main/sales_dbt_project';
```

## Deploy a dbt project object using Snowflake CLI

The [snow dbt deploy](../../developer-guide/snowflake-cli/command-reference/dbt-commands/deploy.md) command uploads local files to a temporary stage and creates a new dbt project object, updates it by
making a new version, or completely recreates it. A valid dbt project must contain two files:

* `dbt_project.yml`: A standard dbt configuration file that specifies the profile to use.
* `profiles.yml`: A dbt connection profile definition referenced in `dbt_project.yml`. `profiles.yaml` must define the database, role, schema, and type.

  + By default, dbt Projects on Snowflake uses your target schema (`target.schema`) specified from your dbt environment or profile. Unlike dbt Core behavior, the target schema specified in the `profiles.yml`
    file must exist before you create your dbt Project in order for it to compile or execute successfully.

  ```yaml
  <profile_name>:
  target: dev
  outputs:
    dev:
      database: <database_name>
      role: <role_name>
      schema: <schema_name>
      type: snowflake
  ```

The following examples illustrate how to use the `snow dbt deploy` command:

* Deploy a dbt project object named `jaffle_shop`:

  ```snowcli
  snow dbt deploy jaffle_shop
  ```
* Deploy a project named `jaffle_shop` from a specified directory and create or add a new version depending on whether the dbt project object already exists:

  ```snowcli
  snow dbt deploy jaffle_shop --source /path/to/dbt/directory --profiles-dir ~/.dbt/ --force
  ```
* Deploy a project named `jaffle_shop` from a specified directory using a custom profiles directory, a specific dbt version, and enabling [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md):

  ```snowcli
  snow dbt deploy jaffle_shop --source /path/to/dbt/directory
  --profiles-dir ~/.dbt/
  --default-target prod
  --dbt-version 1.10.15
  --external-access-integration dbthub-integration
  --external-access-integration github-integration
  --force
  ```
* Deploy a project named `jaffle_shop` and set a specific version for the dbt project object:

  ```snowcli
  snow dbt deploy jaffle_shop --dbt-version '1.10.15'
  ```

## Source file locations

The dbt project source files can be in any one of the following locations:

> * **A Git repository stage**, for example:
>
>   `'@my_db.my_schema.my_git_repository_stage/branches/my_branch/path/to/dbt_project_or_projects_parent'`
>
>   For more information about creating a Git repository object in Snowflake that connects a Git repository to a workspace for dbt Projects on Snowflake, see [Create a workspace connected to your Git repository](../tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md). For more information about creating and managing a Git repository object and stage without using a workspace, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md) and [CREATE GIT REPOSITORY](../../sql-reference/sql/create-git-repository.md).
> * **An existing dbt project stage**, for example:
>
>   `'snow://dbt/my_db.my_schema.my_existing_dbt_project_object/versions/last'`
>
>   The version specifier is required and can be `last` (as shown in the previous example), `first`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).
> * **An internal named stage**, for example:
>
>   `'@my_db.my_schema.my_internal_named_stage/path/to/dbt_projects_or_projects_parent'`
>
>   Internal user stages and table stages aren’t supported.
> * **A workspace for dbt on Snowflake**, for example:
>
>   `'snow://workspace/user$.public."my_workspace_name"/versions/live/path/to/dbt_projects_or_projects_parent'`
>
>   We recommend enclosing the workspace name in double quotes because workspace names are case-sensitive and can contain special characters.
>
>   The version specifier is required and can be `last`, `first`, `live`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).

---
title: Detecting anomalies in data quality
source: https://docs.snowflake.com/en/user-guide/data-quality-anomaly.md
section: User Guide
---

# Detecting anomalies in data quality

Returning a value from a data metric function (DMF) provides useful information, but it might be hard to know whether it indicates a data
quality issue. You can define an [expectation](data-quality-expectations.md) if you know what is an acceptable value, but it might be
difficult to define enough manual rules to identify all possible data quality issues.

As a solution, Snowflake provides an algorithm that can detect anomalies in the values returned by a DMF. Snowflake trains this algorithm
with historical data, then automatically identifies return values that are above or below a predicted range.

You can enable anomaly detection for the following system DMFs:

* [ROW_COUNT](../sql-reference/functions/dmf_row_count.md) — Use to identify anomalies in the volume of data in a table.
* [FRESHNESS](../sql-reference/functions/dmf_freshness.md) — Use to identify anomalies in the frequency with which a table is being
  updated.

The following example shows how to enable anomaly detection for the association between the ROW_COUNT DMF and table `t1`:

```sqlexample
ALTER TABLE t1
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.ROW_COUNT ON ()
    ANOMALY_DETECTION = TRUE;
```

Snowflake trains the algorithm and then automatically starts identifying anomalies in the volume of table `t1`.

## About the training period

When you enable anomaly detection, Snowflake trains the anomaly-detecting algorithm on historical data. The length of the training period
depends on how frequently the DMF runs.

* **For DMFs that run frequently**, Snowflake requires at least two weeks of DMF data to start detecting anomalies. This two-week period is
  essential for establishing weekly seasonality. If the DMF has been running for longer, Snowflake trains the algorithm on up to 60 days of
  data. This longer training period establishes monthly seasonality and increases accuracy. Snowflake recommends that you let the algorithm
  be trained on 60 days of data to detect anomalies with a high degree of confidence.
* **For DMFs that run infrequently or on a trigger-based schedule**, Snowflake must have at least two data points to train the algorithm.
  For example, if a DMF runs every month, then Snowflake looks back two months to train the algorithm.

You can identify whether Snowflake is still in the training period by running the
[DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/functions/data_metric_function_references.md) function. If anomaly detection was enabled but the algorithm is still
being trained, the `anomaly_detection_status` column of the output contains the value `TRAINING_IN_PROGRESS`.

## Enable anomaly detection

You can enable anomaly detection for a DMF association when you first associate the DMF with an object, or you can enable it later.

Example: Enable anomaly detection when associating DMF
:   To enable anomaly detection when associating the FRESHNESS DMF with view `v1`, run the following command:

    ```sqlexample
    ALTER VIEW v1
      ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.FRESHNESS ON (c_timestamp)
        ANOMALY_DETECTION = TRUE;
    ```

Example: Enable anomaly detection for an existing association
:   To enable anomaly detection for an existing association between the ROW_COUNT DMF and table `t1`, run the following command:

    ```sqlexample
    ALTER TABLE t1
      MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.ROW_COUNT ON ()
        SET ANOMALY_DETECTION = TRUE;
    ```

## Adjust the sensitivity level of anomaly detection

After you enable anomaly detection, you can track how many anomalies are occurring in your
account. If the number of anomalies seems too low or too high, you can adjust the sensitivity level of the anomaly detection
algorithm.

* If there are too many false positives (that is, values mistakenly identified as anomalies), you can change the sensitivity to LOW to find
  fewer anomalies.
* If there are too many false negatives (that is, values that weren’t identified as anomalies, but really are), you can change the
  sensitivity to HIGH to find more anomalies.

The default sensitivity level is MEDIUM.

For example, to increase the sensitivity for a DMF association that finds anomalies in the volume of table `t1`, run the following
command:

```sqlexample
ALTER TABLE t1
  MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.ROW_COUNT ON ()
    SET SENSITIVITY = 'HIGH';
```

## Disable anomaly detection

You can disable anomaly detection for a DMF association at any time by using an ALTER statement to modify the object.

For example, to disable anomaly detection for the association between the ROW_COUNT DMF and table `t1`, run the following command:

```sqlexample
ALTER TABLE t1
  MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.ROW_COUNT ON ()
    SET ANOMALY_DETECTION = FALSE;
```

## Identify anomalies

You can identify anomalies using the following:

* SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW — A dedicated event table that records raw data quality results.
* DATA_QUALITY_MONITORING_ANOMALY_DETECTION_STATUS view — View in the SNOWFLAKE.LOCAL schema that contains flattened results.

### SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW

Data quality results are recorded in the dedicated event table SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW.

If anomaly detection is enabled for a DMF association, two rows are added to the table every time Snowflake computes the result
of the DMF. The first row records information about the object the DMF is associated with, the DMF itself, and the result of the data
quality check. The second row records information related to anomaly detection.

The `snow.data_metric.record_type` field in the `record_attribute` column indicates whether a row corresponds to anomaly
detection. This field has two possible values:

* `ANOMALY_DETECTION_STATUS` - Indicates that the row corresponds to anomaly detection.
* `EVALUATION_RESULT` - Indicates that the row corresponds to the evaluation of the DMF.

#### Identifying whether there was an anomaly

After you have determined that a row in the event table corresponds to anomaly detection, you can check the
`snow.data_metric.evaluation_result` field in the `resource_attribute` column to determine if there was an anomaly.

This field contains a VARIANT that contains the value returned by the DMF and a BOOLEAN value indicating whether that value was an anomaly.
For example, if the value of the `snow.data_metric.evaluation_result` field is `5, TRUE`, then the returned value was `5` and
Snowflake identified it as an anomaly.

#### Additional fields

If the row in the event table corresponds to anomaly detection, the `resource_attribute` column also contains the following fields:

* `snow.data_metric.upper_bound`— Highest value that should be returned by the DMF based on the anomaly-detecting algorithm. If the
  value returned by the DMF is above this upper bound, it is an anomaly.
* `snow.data_metric.lower_bound` — Lowest value that should be returned by the DMF based on the anomaly-detecting algorithm. If the
  value returned by the DMF is below this lower bound, it is an anomaly.
* `snow.data_metric.forecast` — Value that the anomaly-detecting algorithm predicted would be returned by the DMF.

### DATA_QUALITY_MONITORING_ANOMALY_DETECTION_STATUS view

The [DATA_QUALITY_MONITORING_ANOMALY_DETECTION_STATUS view](../sql-reference/local/data_quality_monitoring_anomaly_detection_status.md), which exists in the SNOWFLAKE.LOCAL schema, flattens the
information in the event table to make it easier to access DMF results.

---
title: Determining the benefits of search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/view-benefits.md
section: User Guide
---

# Determining the benefits of search optimization

After you configure search optimization for your tables, you can assess the benefits of search optimization by querying the
SEARCH_OPTIMIZATION_BENEFITS view.

This view provides information about the number of partitions pruned due to search optimization. To determine the efficacy of
pruning, you can compare the number of partitions pruned in the `partitions_pruned_additional` column against the total number
of partitions pruned (the sum of the values in the `partitions_pruned_default` column and the `partitions_pruned_additional`
column).

For more information, see [SEARCH_OPTIMIZATION_BENEFITS view](../../sql-reference/account-usage/search_optimization_benefits.md).

---
title: Diagnosing common dynamic table refresh issues
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-troubleshooting.md
section: User Guide
---

# Diagnosing common dynamic table refresh issues

This topic addresses solutions for troubleshooting dynamic tables that don’t refresh as expected:

Some actions might be restricted due to limitations on using dynamic tables or if you don’t have the necessary privileges. For more
information, see [Dynamic table limitations](dynamic-tables-limitations.md) and [Dynamic table access control](dynamic-tables-privileges.md).

If you encounter an issue not listed here, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

| Issue | Solution |
| --- | --- |
| My dynamic table is using full refresh instead of incremental refresh. | A dynamic table’s actual [refresh mode](dynamic-tables-refresh.md) is determined at creation time and is immutable afterward. If not specified explicitly, the refresh mode defaults to `AUTO`, which selects a refresh mode based on various factors such as query complexity, or unsupported constructs, operators, or functions.  For consistent behavior across Snowflake releases, explicitly set the refresh mode on all dynamic tables. For example, if you want your dynamic tables to refresh only incrementally, you must explicitly set the refresh mode to `INCREMENTAL` when creating them, keeping in mind that there might be some [limitations on using incremental refresh](dynamic-tables-limitations.md). For more information, see [Choose a refresh mode](dynamic-tables-performance-optimize.md).  Using a role with the [necessary privileges](dynamic-tables-privileges.md), you can verify the refresh mode using one of the following methods:  * Using SQL: Run the SHOW DYNAMIC TABLES statement. In the output, the `text` column shows the   user-specified refresh mode, the `refresh_mode` column shows the actual refresh mode, and the   `refresh_mode_reason` shows why the actual refresh mode was chosen. * Using Snowsight: In the navigation menu, select Transformation » Dynamic tables, and then   select your dynamic table. You can view the refresh mode for the dynamic table in the Table Details tab. |
| My dynamic table’s incremental refresh is slow. | For detailed diagnostic guidance and common performance patterns, see [Diagnose slow refreshes](dynamic-tables-performance-monitor.md).  You can also use Refresh History to view variance or spot outliers:   1. Sign in to [Snowsight](ui-snowsight-gs.md). 2. In the navigation menu, select Transformation » Dynamic tables. 3. Select your dynamic table and go to the Refresh History tab. 4. Use your dynamic table’s refresh durations over the last 24 hours to troubleshoot. |
| My dynamic table is running an empty refresh but I am seeing a cost. | Refreshes that produce zero net new rows (that is, zero rows added, updated, or deleted) consume warehouse resources when they’re associated with changes in any of the upstream objects referenced by the dynamic table.  For example, if the associated virtual warehouse is suspended and no changes in base objects are identified, the suspended virtual warehouse doesn’t resume and no credits are consumed. This is referred to as a [NO_DATA](../sql-reference/functions/dynamic_table_refresh_history.md) refresh. Conversely, if changes are identified, the virtual warehouse is automatically resumed to process the updates, which consumes warehouse resources even if the net result is zero rows applied to the dynamic table.  If you’re seeing a cost but you haven’t made any changes to your dynamic table, it might be due to a change in your source table. You can use the Refresh History tab in Snowsight to check if virtual warehouse credits were consumed:   1. Sign in to [Snowsight](ui-snowsight-gs.md). 2. In the navigation menu, select Transformation » Dynamic tables. 3. Select your dynamic table and go to the Refresh History tab. 4. Check the Warehouse used only checkbox to view refreshes that used the warehouse to update.   For more information, see [Understanding costs for dynamic tables](dynamic-tables-cost.md). |
| My dynamic table is reinitializing. | Your dynamic table might be reinitializing due to one of the following reasons:   * One or more of the inputs of the dynamic table are replaced. For example, if your dynamic table is defined on a view,   and you replace the view, the dynamic table has to reinitialize. * If the schema of the inputs changed and your dynamic table relies on the changed columns. * [Data access policies](../guides-overview-govern.md) are added, removed, or changed on the dynamic table’s inputs. * [Cloned incremental dynamic tables](../sql-reference/sql/create-dynamic-table.md) might need to reinitialize on their first refresh   after being created. * [Replicated dynamic tables](account-replication-considerations.md) with incremental refresh reinitialize after   failover before they can resume incremental refresh.   For general information about initialization, see [Understanding dynamic table initialization](dynamic-tables-refresh.md). |

---
title: Differences between sfsql and SnowSQL
source: https://docs.snowflake.com/en/user-guide/snowsql-sfsql-diff.md
section: User Guide
---

# Differences between sfsql and SnowSQL

SnowSQL (`snowsql`) provides many improvements and enhancements over the `sfsql` command-line interface, including more intuitive option and command names. This topic lists differences in usage between the two
command-line clients.

## Command-line options

Many of the command-line options in SnowSQL are backward-compatible with the corresponding options in `sfsql`; however, there are key differences, as described in the following table:

| Option | `sfsql` | SnowSQL (`snowsql`) |
| --- | --- | --- |
| Account identifier | `-a` | `-a` , `--accountname` |
| User name | `-u` | `-u` , `--username` |
| Password | `-c` | N/A (use SNOWSQL_PWD environment variable) |
| Prompt for password | N/A | `-P` |
| Database | `-d` | `-d` , `--dbname` |
| Schema | `-s` | `-s` , `--schemaname` |
| Warehouse | `-w` | `-w` , `--warehouse` |
| Role | `-r` | `-r` , `--rolename` |
| Host name | `-g` | `-h` , `--host` |
| Port number | `-p` | `-p` , `--port` |
| MFA passcode | `-m` | `-m` , `--mfa-passcode` |
| MFA passcode in password | `-n` | `--mfa-passcode-in-password` |
| Explain a SQL | `-e` (not supported) | N/A |
| Explain a SQL in dot form | `-x` (not supported) | N/A |
| Run a SQL file | `-f` | `-f` , `--filename` |
| Stop on error | N/A | `-o stop_on_error=true` |
| Exit on error | `-k` | `-o exit_on_error=true` |
| Authenticator | `-b` | `--authenticator` |
| Use a user-defined connection | N/A | `-c` , `--connection` |
| Trace level | `-t` | `-o log_level=(INFO|DEBUG)` |
| Show CLI version | N/A | `-v` , `--version` |
| Use specified config | N/A | `--config` |
| Set options | N/A | `-o` , `--option` |
| Set variables | N/A | `-D` , `--variable` |
| Help | `-h` | `-?` , `--help` |

## Commands

For commands, the key difference is all commands in SnowSQL must be prefixed with an exclamation point (e.g. `!exit`). In addition, the names of some of the commands have changed.

| Command | `sfsql` | SnowSQL (`snowsql`) |
| --- | --- | --- |
| Load and run a SQL file | `load` , `@` | `!source` , `!load` |
| Print a message | `echo` | `!print` |
| Set an option | N/A | `!set` |
| Show all options | N/A | `!options` |
| Set a variable | `set-var` | `!define` |
| Unset a variable | `unset-var` | N/A |
| Show all variables | N/A | `!variables` |
| Connect and start a new session | `connect` | `!connect` |
| Exit the current session | N/A | `!exit` , `!disconnect` (see also `!quit`) |
| Spool output into a file | `spool` | `!spool` |
| Quit the CLI | `exit` , `quit` | `!quit` |
| Executes a system command | `system` | `!system` |
| Help | `help` | `!help` |

## Special characters

The following characters have special meaning in the two clients:

| Usage | `sfsql` | SnowSQL (`snowsql`) |
| --- | --- | --- |
| Prefix for variable names | `$` | `&&` |
| Setting off comments in code | `#` | `--` and `/* ... */` |

---
title: Differential privacy in Snowflake
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-overview.md
section: User Guide
---

# Differential privacy in Snowflake

Differential privacy is a widely recognized standard for data privacy that limits the risk that a user could leak sensitive information
from a sensitive dataset. It protects the identity and information of individual entities in the data, for example, people, corporations,
and locations. While each individual entity’s information is protected, differential privacy still lets data consumers learn
statistics, trends, and behaviors about groups of individuals.

Differential privacy provides strong protection against re-identification that is particularly effective against targeted privacy attacks.
This protection lets you share sensitive data across teams, outside of your organization, and across regulatory lines. Differential
privacy mitigates the increased re-identification risk associated with joining two sensitive datasets, adding new fields, unmasking existing
fields, or providing individual rows instead of pre-aggregated data.

Unlike other privacy methodologies, differential privacy does the following:

* Protects against targeted privacy attacks, for example differencing and amplification attacks.
* Quantifies and manages the trade-off between privacy and utility, that is, controls how much non-sensitive information data consumers
  can learn about the data.
* Removes the need for data providers to transform sensitive data to reduce re-identification risk (for example, masking,
  redaction, and bucketing).

## How does differential privacy protect sensitive information?

Under differential privacy, query results must not reveal information that could be used to identify an individual entity. Snowflake does
the following to enforce differential privacy:

* Returns noisy aggregates.
* Limits privacy loss.

### Noisy aggregates

Differentially private queries must aggregate data to return results; row-level queries like `SELECT *` are blocked. These aggregates
are noisy; they’re not the exact result of a computation. Noise (that is, variation or randomization) is introduced into the result to
obscure whether any particular row or entity was included in the aggregation.

The addition of noise protects against privacy attacks like thin-slicing and differencing. The amount of noise that’s added to the query
result depends on several factors that influence the sensitivity of the query, including the number of records queried, type of aggregate,
and types of data transformations. Snowflake calculates the sensitivity of a query based on rigorous mathematics, but it can be understood
loosely as the query’s potential to leak information about an individual entity. In general, less-sensitive queries have less noise,
potentially to the point that it’s statistically negligible. Very sensitive queries, for example a query that tries to single out an
individual entity, have a large amount of noise to prevent sensitive information from being leaked.

Snowflake does not introduce noise into intermediate aggregations that occur before the final aggregation of the query; noise is only
introduced once per query.

Snowflake considers the number of rows in the privacy-protected table to be public. For example, executing `SELECT COUNT(*) FROM t`, where
table `t` is protected by a privacy policy, returns an exact result without incurring any privacy loss.

For more information about how to understand the level of noise, see [Understanding query results](differential-privacy-analyst.md).

### Limiting privacy loss

Every query against a protected dataset can result in the exposure of private information associated with an individual, including the
noisy aggregate results that differentially private queries produce. In differential privacy, this disclosure of information is known as
*privacy loss*, and is a quantifiable unit of measure. The more private information that is revealed by a query, the higher the privacy
loss associated with that query. Because privacy loss is quantifiable, Snowflake can use differential privacy to protect sensitive data
across a history of queries up to a certain degree of statistical confidence.

Privacy loss accumulates as a user executes queries against the protected data. When the cumulative privacy loss reaches a certain
threshold, subsequently letting the user see more results would theoretically let them identify individuals with an unacceptable level of
confidence. A *privacy budget* sets a limit on how much privacy loss is acceptable. Snowflake tallies the privacy loss of the queries
executed by a user or group of users and makes sure that tally never exceeds the privacy budget associated with those users. When the
user’s privacy budget reaches the budget limit set by the privacy policy creator, queries submitted by that user fail until that user’s budget is
refreshed. Snowflake offers a customizable privacy budget with a default value that sets the privacy loss threshold and the
refresh period.

Snowflake uses a [privacy policy](differential-privacy-admin-privacy-policies.md), which is a schema-level
object, to associate a privacy budget with a user or group of users. When an administrator assigns that privacy policy to a table or view,
it becomes *privacy-protected*. When a user runs a query against a privacy-protected table, Snowflake uses the privacy policy to determine
which privacy budget is associated with the user and ensures that the privacy loss the query incurs will not exceed the budget’s limit.

## Differential privacy in theory vs. in practice

The standard of differential privacy comes from academic literature, and was formulated to have strong, mathematically proven privacy
guarantees, particularly against theoretical privacy attacks. In particular, privacy settings like privacy budget are set more conservatively
when discussed in academic settings. These settings favor strong protection against theoretical privacy attacks, at the expense of data
utility (analytical fidelity, accuracy, and availability). When considering the tradeoff between privacy and utility for your use case,
including for highly sensitive data like PII and PHI, consider the following:

* Practical privacy attacks aren’t as effective as theoretical privacy attacks described in academic literature that assume that attackers
  have unlimited compute resources and access to all datasets except the one they’re attacking.
* Data consumers typically don’t want to intentionally launch attacks because the data provider can revoke the consumer’s data access, and
  the analytical value of the data is too high for them to risk losing it.

Snowflake has selected default settings that reasonably balance privacy protection and utility in line with the goals of real-world use
cases, but you can always set different settings to meet your specific needs.

## Differential privacy workflow

The following workflow consists of tasks performed by the data provider who is protecting their data with differential privacy
and tasks for an analyst who is querying the data after it’s protected.

**Data provider:**

* If you want to implement [entity-level privacy](differential-privacy-admin.md), structure your data to meet requirements.
* [Create a privacy policy](differential-privacy-admin-privacy-policies.md) that associates privacy budgets with users based on factors like role or
  account.
* [Assign the privacy policy](differential-privacy-admin-privacy-policies.md) to a table or view so that queries must be differentially private.
* [Define a privacy domain](differential-privacy-privacy-domains-admin.md) for numerical and categorical columns in the
  privacy-protected table or view.
* Grant privileges to the analysts so that they can access the privacy-protected data.
* As analysts execute queries against the privacy-protected data, you can
  [manage the privacy budgets associated with the users](differential-privacy-admin-privacy-budgets.md).

**Analyst:**

* [View the privacy domains](differential-privacy-privacy-domains-admin.md) that the data provider defined for the columns in the
  privacy-protected table to better understand the contents of the column.
* If the data provider forgot to set a privacy domain for a column that you want to use in an aggregation or in a GROUP BY clause,
  [specify the privacy domain for the column](differential-privacy-privacy-domains-analyst.md).
* [Execute differentially private queries](differential-privacy-analyst.md) against privacy-protected tables and views.
* Use the noise interval to [help understand the results](differential-privacy-analyst.md) of an aggregation.
* If desired, [narrow the data provider’s privacy domain](differential-privacy-privacy-domains-analyst.md) to try to
  improve the results of the query.

## Limitations

* When a table is privacy-protected, analysts can only query the following data types:

  + [Numeric data types](../../sql-reference/data-types-numeric.md)
  + [String data types](../../sql-reference/data-types-text.md). Binary types are not supported.
  + [Logical data types](../../sql-reference/data-types-logical.md)
  + [Date & time data types](../../sql-reference/data-types-datetime.md). For timestamps, only TIMESTAMP_NTZ is supported.
* Some Snowflake features are currently not supported when using differential privacy. For details, see [Interactions with Snowflake features](differential-privacy-admin.md).
* Query functionality is limited in order to protect privacy. For a list of supported operators, query syntax, and functions,
  see [Differential privacy SQL reference](differential-privacy-sql-reference.md).
* When a query is run on a privacy-protected table, Snowflake first calculates statistics that influence how much noise will be added, then
  it runs the query. If the data changes in between these two steps, the amount of noise added may be incorrect. Snowflake recommends that
  data providers schedule data updates so that they don’t occur when analysts can run queries.

## Next steps

If you’re a data provider who is using differential privacy to protect your dataset, see [Implementing differential privacy](differential-privacy-admin.md).

If you’re an analyst who is querying a dataset that’s protected by differential privacy, see [Querying data protected by differential privacy](differential-privacy-analyst.md).

---
title: Differential privacy SQL reference
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-sql-reference.md
section: User Guide
---

# Differential privacy SQL reference

This topic provides the following information:

* A reference for the SQL functions that are unique to differential privacy.
* A list of the Snowflake data types, operators, query syntax, and functions that are supported by differential privacy.

## Differential privacy functions

The following functions are unique to differential privacy.

| Function | Description |
| --- | --- |
| [DP_INTERVAL_LOW](../../sql-reference/functions/dp_interval_low.md) | Returns the lower bound of the noise interval. |
| [DP_INTERVAL_HIGH](../../sql-reference/functions/dp_interval_high.md) | Returns the upper bound of the noise interval. |

## Data types

The following [data types](../../sql-reference-data-types.md) are supported.

| Data type | Notes |
| --- | --- |
| BOOLEAN |  |
| CHAR, CHARACTER |  |
| DATE |  |
| DATETIME |  |
| DECIMAL, NUMERIC |  |
| DOUBLE, DOUBLE PRECISION, REAL |  |
| FLOAT, FLOAT4, FLOAT8 |  |
| INT, INTEGER , BIGINT, SMALLINT, TINYINT, BYTEINT |  |
| NUMBER |  |
| STRING |  |
| TEXT |  |
| TIME |  |
| TIMESTAMP, TIMESTAMP_NTZ | Time data types with time zones are not supported. Use TIMESTAMP or TIMESTAMP_NTZ. |
| VARCHAR |  |

## Query syntax

The following elements of the Snowflake [query syntax](../../sql-reference/constructs.md) are supported.

| Syntax | Notes |
| --- | --- |
| SELECT |  |
| SELECT ALL |  |
| FROM |  |
| INNER JOIN ON | See [Supported joins](differential-privacy-analyst.md). |
| INNER JOIN USING | See [Supported joins](differential-privacy-analyst.md). |
| LEFT OUTER JOIN ON | See [Supported joins](differential-privacy-analyst.md). |
| LEFT OUTER JOIN USING | See [Supported joins](differential-privacy-analyst.md). |
| RIGHT OUTER JOIN ON | See [Supported joins](differential-privacy-analyst.md). |
| RIGHT OUTER JOIN USING | See [Supported joins](differential-privacy-analyst.md). |
| FULL OUTER JOIN ON | See [Supported joins](differential-privacy-analyst.md). |
| FULL OUTER JOIN USING | See [Supported joins](differential-privacy-analyst.md). |
| NATURAL JOIN | See [Supported joins](differential-privacy-analyst.md). |
| WHERE |  |
| GROUP BY | Aliases are not supported in the GROUP BY clause. For example, `GROUP BY col_a AS column_a` is not supported. |

Limitations on query syntax
:   Quoted identifiers (for example, column, table, schema and database names) are not supported.

## Operators

### Arithmetic operators

The following [arithmetic operators](../../sql-reference/operators-arithmetic.md) are supported.

| Operator | Notes |
| --- | --- |
| `-` (unary) |  |
| `-` |  |
| `+` (unary) | Does not work with strings. |
| `+` |  |
| `*` |  |
| `/` |  |
| `%` |  |

### Comparison operators

The following [comparison operators](../../sql-reference/operators-comparison.md) are supported.

| Operator | Notes |
| --- | --- |
| `=` |  |
| `!=` |  |
| `<` |  |
| `>` |  |
| `<=` |  |
| `>=` |  |

### Logical operators

The following [logical operators](../../sql-reference/operators-logical.md) are supported.

| Operator | Notes |
| --- | --- |
| AND |  |
| NOT |  |
| OR |  |

### Set operators

The following [set operators](../../sql-reference/operators-query.md) are supported.

| Operator | Notes |
| --- | --- |
| UNION [ ALL ] |  |

### Subquery operators

[Subquery operators](../../sql-reference/operators-subquery.md) are not supported.

## Functions

### Aggregate functions

The following [aggregate functions](../../sql-reference/functions-aggregation.md) are supported.

| Function | Notes |
| --- | --- |
| ANY_VALUE | Supported only as an aggregate for a subquery with a GROUP BY clause. |
| COUNT |  |
| COUNT DISTINCT |  |

### Bitwise expression functions

[Bitwise expression functions](../../sql-reference/expressions-byte-bit.md) are not supported.

### Conditional expression functions

The following [conditional expression functions](../../sql-reference/expressions-conditional.md) are supported.

| Function | Notes |
| --- | --- |
| [ NOT ] IN |  |
| CASE |  |
| COALESCE |  |
| DECODE |  |
| EQUAL_NULL |  |
| GREATEST |  |
| IFF |  |
| IS [NOT] NULL |  |
| LEAST |  |

### Context functions

[Context functions](../../sql-reference/functions-context.md) are not supported.

### Conversion functions

The following [conversion functions](../../sql-reference/functions-conversion.md) are supported.

| Function | Notes |
| --- | --- |
| CAST, `::` | Columns must be explicitly non-null to be casted. To do this, filter out nulls before casting.  Casting other data types to STRING is not supported. |
| TO_BOOLEAN |  |
| TO_CHAR , TO_VARCHAR |  |
| TO_DECIMAL , TO_NUMBER , TO_NUMERIC |  |
| TO_DOUBLE |  |
| TRY_CAST |  |
| TRY_TO_BOOLEAN |  |
| TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC |  |
| TRY_TO_DOUBLE |  |

### Data generation functions

[Data generation functions](../../sql-reference/functions-data-generation.md) are not supported.

### Data metric functions

[Data metric functions](../../sql-reference/functions-data-metric.md) are not supported. User-defined DMFs are also not supported.

### Date & time functions

The following [date & time functions](../../sql-reference/functions-date-time.md) are supported.

| Function | Notes |
| --- | --- |
| DATE_PART | The following date and time parts are not supported: `dayofweek`, `week`, `yearofweek`, `nanosecond`, `epoch_*`, and `timezone_*`. |
| DAYNAME |  |
| EXTRACT | The following date and time parts are not supported: `dayofweek`, `week`, `yearofweek`, `nanosecond`, `epoch_*`, and `timezone_*`. |
| HOUR |  |
| LAST_DAY |  |
| MINUTE |  |
| SECOND |  |
| TRUNC |  |
| YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER |  |

### Encryption functions

[Encryption functions](../../sql-reference/functions-encryption.md) are not supported.

### File functions

[File functions](../../sql-reference/functions-file.md) are not supported.

### Geospatial functions

[Geospatial functions](../../sql-reference/functions-geospatial.md) are not supported.

### Hash functions

[Hash functions](../../sql-reference/functions-hash-scalar.md) are not supported.

### Metadata functions

[Metadata functions](../../sql-reference/functions-metadata.md) are not supported.

### Numeric functions

The following [numeric functions](../../sql-reference/functions-numeric.md) are supported.

| Function | Notes |
| --- | --- |
| ABS |  |
| CEIL |  |
| FLOOR |  |
| MOD |  |
| SIGN |  |

### Regular expression functions

[Regular expression functions](../../sql-reference/functions-regexp.md) are not supported.

### Semi-structured and structured data functions

[Semi-structured and structured data functions](../../sql-reference/functions-semistructured.md) are not supported.

### String and binary functions

The following [string & binary functions](../../sql-reference/functions-string.md) are supported.

| Function | Notes |
| --- | --- |
| CONTAINS |  |
| LENGTH , LEN |  |
| LOWER |  |
| POSITION |  |
| UPPER |  |

### System functions

[System functions](../../sql-reference/functions-system.md) are not supported.

### Table functions

[Table functions](../../sql-reference/functions-table.md) are not supported.

---
title: Directory tables
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables.md
section: User Guide
---

# Directory tables

This topic introduces key concepts, provides ancillary information, and links to instructions for using directory tables.

## About directory tables

A directory table is an implicit object layered on a stage (not a separate database object) and is conceptually similar to an
external table because it stores file-level metadata about the data files in the stage. A directory table has no grantable privileges of its own.

Both external (external cloud storage) and internal (Snowflake) stages support directory tables. You can add a directory table
to a stage when you create a stage (using [CREATE STAGE](../sql-reference/sql/create-stage.md)) or later
(using [ALTER STAGE](../sql-reference/sql/alter-stage.md)).

In particular, you can use a directory table to accomplish the following unstructured data tasks:

* [Query a list of all the unstructured files on a stage](data-load-dirtables-query.md).
  You can query a directory table to retrieve a list of all the files on a stage. The query output contains information about each file,
  including the size, a timestamp of when it was last modified, and its [Snowflake file URL](unstructured-intro.md).
* [Create views of unstructured data](data-load-dirtables-query.md).
  You can join a directory table with a Snowflake table that contains additional
  data and metadata about unstructured files to see unstructured files and their related data in a single view.
* [Construct a file processing pipeline](data-load-dirtables-pipeline.md). You can use a directory table with
  the Snowpark API or external functions to create a file processing pipeline.

To register changes to files on a stage, you can [refresh the directory table metadata](data-load-dirtables-manage.md).

## Billing for directory tables

An overhead to manage event notifications for the automatic refreshing of directory table metadata is included in your charges. This overhead increases in
relation to the number of files added in cloud storage for your stages that include directory tables. This overhead charge appears as
Snowpipe charges in your billing statement because Snowpipe is used for event notifications for the automatic directory table refreshes.
You can estimate this charge by querying the [PIPE_USAGE_HISTORY](../sql-reference/functions/pipe_usage_history.md) function or examining the Account Usage [PIPE_USAGE_HISTORY view](../sql-reference/account-usage/pipe_usage_history.md).

In addition, a small maintenance overhead is charged for manually refreshing the directory table metadata (using ALTER STAGE …
REFRESH). This overhead is charged in accordance with the standard [cloud services billing model](cost-understanding-compute.md),
like all similar activity in Snowflake. Manual refreshes of directory table metadata don’t appear in queries to the [PIPE_USAGE_HISTORY](../sql-reference/functions/pipe_usage_history.md) function or in the Account Usage [PIPE_USAGE_HISTORY view](../sql-reference/account-usage/pipe_usage_history.md).

Users with the ACCOUNTADMIN role, or a role with the global MONITOR USAGE privilege, can query the
[AUTO_REFRESH_REGISTRATION_HISTORY](../sql-reference/functions/auto_refresh_registration_history.md) table function to retrieve the history of data files registered in the
metadata of specified objects and the credits billed for these operations.

## Access control requirements for directory tables

The following table summarizes the stage [privileges](security-access-control-overview.md) that you need to execute common
SQL commands when you work with directory tables.

| Operation | Object Type | Privilege Required |
| --- | --- | --- |
| Retrieve file URLs from a directory table using a SELECT FROM DIRECTORY statement. | Stage | One of the following, depending on the type of stage:   * Internal stage: An account role or database role with the READ privilege on the stage. * External stage: An account role or database role with either the READ or USAGE privilege on the stage. |
| Upload data using the [PUT](../sql-reference/sql/put.md) command. | Stage (internal only) | An account role or database role with the WRITE privilege on the stage. |
| Remove files using the [REMOVE](../sql-reference/sql/remove.md) command. | Stage | One of the following, depending on the type of stage:   * Internal stage: An account role or database role with the WRITE privilege on the stage. * External stage: An account role or database role with either the WRITE or USAGE privilege on the stage. |
| Refresh the metadata using the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command. | Stage | One of the following, depending on the type of stage:   * Internal stage: An account role or database role with the WRITE privilege on the stage. * External stage: An account role or database role with either the WRITE or USAGE privilege on the stage. |

## Information Schema

The Snowflake [Snowflake Information Schema](../sql-reference/info-schema.md) includes table functions you can query to retrieve information about your directory
tables.

### Table functions

[AUTO_REFRESH_REGISTRATION_HISTORY](../sql-reference/functions/auto_refresh_registration_history.md)
:   Retrieve the history of data files registered in the metadata of specified objects and the credits billed for these operations.

[STAGE_DIRECTORY_FILE_REGISTRATION_HISTORY](../sql-reference/functions/stage_directory_file_registration_history.md)
:   Retrieve information about the metadata history for a directory table, including any errors found when refreshing the metadata.

**Next Topics:**

* [Manage directory tables](data-load-dirtables-manage.md)
* [Query directory tables](data-load-dirtables-query.md)
* [Automated directory table metadata refreshes](data-load-dirtables-auto.md)
* [Build a data processing pipeline using a directory table](data-load-dirtables-pipeline.md)

---
title: Downloading Snowflake Clients, Connectors, Drivers, and Libraries
source: https://docs.snowflake.com/en/user-guide/snowflake-client-repository.md
section: User Guide
---

# Downloading Snowflake Clients, Connectors, Drivers, and Libraries

To download the installation package for a Snowflake client, connector, driver, or library, use the
download pages in the Snowflake Developer Center.

If you want to write a script to download clients over HTTP (e.g. using [curl](https://curl.se/)), you can download SnowSQL, the ODBC Driver,
the Snowpark Library, and SnowCD directly from the Snowflake Client Repository.

See [Drivers](../developer-guide/drivers.md) and [Using Snowflake with Kafka and Spark](connectors.md) for documentation for the drivers and
connectors, respectively. For other developer documentation,
see [Develop Apps and Extensions](https://docs.snowflake.com/developer).

## Snowflake Developer Center Download Pages

To download a Snowflake client, use the following download pages in the [Snowflake Developer Center](https://developers.snowflake.com/):

| Client / Connector / Driver / Library | Download Page |
| --- | --- |
| [Snowflake CLI](../developer-guide/snowflake-cli/index.md) | [Snowflake CLI Download](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html) |
| [ODBC Driver](../developer-guide/odbc/odbc.md) | [ODBC Download](https://developers.snowflake.com/odbc/) |
| [Snowpark API](../developer-guide/snowpark/index.md) | [Snowpark Client Download](https://developers.snowflake.com/snowpark/) |
| [Drivers](../developer-guide/drivers.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) |
| [Scala and Java connectors](connectors.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) |
| [SnowCD](snowcd.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) |
| [Snowpark ML](../developer-guide/snowflake-ml/overview.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) |

## Snowflake Client Repository

To download SnowSQL, the ODBC Driver, the Snowpark Library, or SnowCD over HTTP programmatically (e.g. using [curl](https://curl.se/)), use the
Snowflake Client Repository. The Snowflake Client Repository serves the packages for these clients through CDN (Content Delivery
Network) using the following endpoints:

> * <https://sfc-repo.azure.snowflakecomputing.com/index.html> (mirror on Azure Blob)

If the endpoint is not specified explicitly, the client upgrader (e.g., the SnowSQL auto-upgrader) uses the AWS endpoint. For instructions on specifying the endpoint, see the installation documentation for the client.

> **Note:**
>
> Users can download Snowflake clients from either endpoint regardless of which cloud provider hosts their Snowflake account.

---
title: Drop an external volume by using Snowsight
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-drop-external-volume.md
section: User Guide
---

# Drop an external volume by using Snowsight

Dropping an external volume removes the [external volume](tables-iceberg.md) from the account, but retains a version of the
external volume so that it can be recovered using [UNDROP EXTERNAL VOLUME](../sql-reference/sql/undrop-external-volume.md). For more information, see [Usage Notes for DROP EXTERNAL VOLUME](../sql-reference/sql/drop-external-volume.md).

> **Note:**
>
> To drop an external volume by using SQL, use the [DROP EXTERNAL VOLUME](../sql-reference/sql/drop-external-volume.md) command.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role that has OWNERSHIP privilege on the external volume you want to drop.

   For instructions, see [Switch your primary role](ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » External data.
4. Select the External volumes tab.
5. Select the external volume you want to drop.
6. Select … » Drop external volume.
7. Select Drop external volume again.

---
title: Drop or undrop dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-drop-undrop.md
section: User Guide
---

# Drop or undrop dynamic tables

This topic describes dropping existing dynamic tables and restoring them.

You might no longer need a dynamic table if it’s no longer relevant to your data pipeline. Dropping it helps clean up your environment and
reduces unnecessary storage and compute usage. Because dynamic tables consume resources, especially with frequent refreshes, dropping unused
tables can help manage costs by preventing further resource consumption.

You can undrop or, in other words, restore a dropped dynamic table using the UNDROP DYNAMIC TABLE command. This allows you to recover the
dynamic table and its data without needing to recreate it, whether it’s due to accidental deletion or if a previously dropped table becomes relevant again, such as with changing project priorities or data needs.

To drop or undrop a dynamic table, you must use a role that has the [OWNERSHIP](../sql-reference/sql/grant-ownership.md) privilege on that
dynamic table.

## Drop existing dynamic tables

To drop a dynamic table, you can use either the [DROP DYNAMIC TABLE](../sql-reference/sql/drop-dynamic-table.md) command or [Snowsight](ui-snowsight.md),
as long as you have the [OWNERSHIP](../sql-reference/sql/grant-ownership.md) privilege on that dynamic table.

SQLSnowsight

The following example uses the DROP DYNAMIC TABLE command to drop `my_dynamic_table`.

```sqlexample
DROP DYNAMIC TABLE my_dynamic_table;
```

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Dynamic tables.
3. Find your dynamic table in the list and then select  » Drop.
4. In the popup, confirm that you want to drop the dynamic table.

## Restore dropped dynamic tables

To undrop a dynamic table, you can use the [UNDROP DYNAMIC TABLE](../sql-reference/sql/undrop-dynamic-table.md) command, as long as you have the
[OWNERSHIP](../sql-reference/sql/grant-ownership.md) privilege on that dynamic table. Note that you can only undrop dynamic tables within
the retention period (default is 24 hours). If a dynamic table with the same name already exists, an error will be returned.

The following example uses the UNDROP DYNAMIC TABLE command to drop `my_dynamic_table`.

```sqlexample
UNDROP DYNAMIC TABLE my_dynamic_table;
```

---
title: Dropping an account
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-delete.md
section: User Guide
---

# Dropping an account

The organization administrator can drop an account to delete it from the system. A dropped account
is not deleted immediately, but rather enters a grace period during which the administrator can restore (“undrop”) the account. When the
grace period expires, Snowflake purges the dropped account from the system.

> **Tip:**
>
> Because Snowflake does not permanently delete an account when it is initially dropped, you cannot immediately create a new account
> with the same name as the one you just dropped. As a workaround,
> [rename the account](organizations-manage-accounts-rename.md) before dropping it.

If the organization administrator is using the ORGADMIN role to drop an account, they cannot drop the account while they are logged in to it;
they must log in to a different ORGADMIN-enabled account before executing the DROP ACCOUNT command. This means that the organization
administrator cannot drop the last account in the organization. If your organization consists of a single account that needs to be deleted,
contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## About the grace period

When dropping the account, the organization administrator defines a grace period during which the account can be restored. Once an account
is dropped, it is locked to prevent activity during the grace period. An organization continues to pay for the cost of account storage
during the grace period.

The minimum grace period is 3 days and the maximum grace period is 90 days, not including the current date. For example, if the
organization administrator defines the grace period as 3 days when they drop the account on Monday at 11 a.m., then the grace period
expires on Thursday at 11 a.m.

If you want to change the grace period of a dropped account, restore the account, then drop it
again with the new grace period.

The grace period is not the same as the data retention period of [Time Travel](data-time-travel.md).

## Dropping an account that provides listings, reader accounts, and shares

You cannot drop an account that has active listings shared to specific consumers or listings published on the Snowflake Marketplace.
Before you can drop the account, you must do the following:

1. Delete any listings provided by the account. Listings subject to a retirement policy must complete the retirement
   flow before the account can be dropped.
   See [Remove listings as a provider](../collaboration/provider-listings-removing.md).
2. Drop the shares associated with the listings.

If the account provides shares or reader accounts to consumers, the organization administrator of the provider account should
contact those consumers to let them know that they will lose access to the shares and reader accounts provided by the to-be-dropped account.

As soon as the account is dropped, the following happens to the shared data and data products:

* Shares stop working. Consumers lose access to data shared by the account.
* Reader accounts are dropped and then deleted at the same time as the provider account.

## Dropping an account

[As the organization administrator](organization-administrators.md), you can drop an account using [Snowsight](ui-snowsight-gs.md) or
SQL.

Snowsight:
:   1. In the navigation menu, select Admin » Accounts.
    2. Find the active account, and select … » Drop Account.
    3. Enter a grace period during which the account can be restored.
    4. Select Drop Account.

SQL:
:   Execute the [DROP ACCOUNT](../sql-reference/sql/drop-account.md)
    command.

    For example, to drop an account `my_account` and allow a 14-day grace period for recovering the account, enter:

    ```sqlexample
    DROP ACCOUNT my_account GRACE_PERIOD_IN_DAYS = 14;
    ```

> **Note:**
>
> If you want to drop a reader account, execute the [DROP MANAGED ACCOUNT](../sql-reference/sql/drop-managed-account.md) command.

## Viewing dropped accounts

Organization administrators have multiple options for viewing dropped accounts that are still within their grace period. Some of these
options also show dropped accounts that have been permanently deleted from the system.

Snowsight:
:   [As the organization administrator](organization-administrators.md), you can use Snowsight to view all dropped
    accounts, including those that have been permanently deleted.

    1. In the navigation menu, select Admin » Accounts.
    2. Select the Dropped Accounts tab.

    Dropped accounts that are still within the grace period appear with a yellow indicator and have a Drop Date that is in the future.

    Permanently deleted accounts have a Drop Date that is on or before the current date.

SQL:
:   Executing the [SHOW ACCOUNTS](../sql-reference/sql/show-accounts.md) command with the optional HISTORY keyword shows dropped accounts
    that are still within their grace period. Permanently deleted accounts are not included in the output.

    When the [organization administrator](organization-administrators.md) executes the following command:

    ```sqlexample
    SHOW ACCOUNTS HISTORY;
    ```

    The output includes dropped accounts and the additional `dropped_on`, `scheduled_deletion_time`, and `restored_on`
    columns.

ACCOUNTS View:
:   Users with access to the [ORGANIZATION_USAGE schema](../sql-reference/organization-usage.md) can query the
    [ACCOUNTS view](../sql-reference/organization-usage/accounts.md) to see all dropped accounts, including those that have been
    permanently deleted.

## Restoring an account

An organization administrator can restore, or undrop, a dropped account within the grace period, which prevents it from being purged.
Undropping an account unlocks it, allowing users to access the account as if it had never been dropped.

[As the organization administrator](organization-administrators.md), you can undrop an account using Snowsight or
SQL.

Snowsight:
:   1. In the navigation menu, select Admin » Accounts.
    2. Select the Dropped Accounts tab.
    3. Find the account, and select … » Undrop Account.
    4. Select Undrop Account.

SQL:
:   Execute the [UNDROP ACCOUNT](../sql-reference/sql/undrop-account.md) command to restore an account. For example, the following command
    restores the dropped account `myaccount123`, which was still within the grace period:

    ```sqlexample
    UNDROP ACCOUNT myaccount123;
    ```

---
title: Dynamic table access control
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-privileges.md
section: User Guide
---

# Dynamic table access control

This topic discusses the privileges needed to perform operations with dynamic tables, such as creating, querying, altering, viewing, and dropping.

For more information about the Snowflake privilege model, see [Overview of Access Control](security-access-control-overview.md) and [Access control privileges](security-access-control-privileges.md).

## Transfer ownership

To provide a user full access to a dynamic table, you can do either of the following:

* [Grant OWNERSHIP on the dynamic table to a role](../sql-reference/sql/grant-ownership.md).
* [Grant all privileges, except OWNERSHIP, on the dynamic table to a role](../sql-reference/sql/grant-privilege.md).
* [Grant the OWNERSHIP privilege or ALL PRIVILEGES on future dynamic tables to a role](../sql-reference/sql/grant-privilege.md).

When assigning grants, ensure that you specify the object type as `DYNAMIC TABLE` because dynamic tables have a different set of privileges
than regular tables.

To grant the OWNERSHIP privilege on dynamic tables, ensure the receiving role has the USAGE privilege on the following. Otherwise, subsequent
scheduled refreshes fail.

* The database and schema that contains the dynamic table.
* The warehouse used to refresh the table.

To transfer ownership of a dynamic table, you can use either the [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command or
[Snowsight](ui-snowsight.md).

SQLSnowsight

The following example uses the [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command to grant ownership privileges on `my_dynamic_table`
to the `budget_admin` role.

```sqlexample
GRANT OWNERSHIP ON DYNAMIC TABLE my_dynamic_table TO ROLE budget_admin;
```

The following example uses the [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command to grant ownership privileges on all future dynamic
tables created in the `mydb.myschema` schema to the `budget_admin` role.

```sqlexample
GRANT OWNERSHIP ON FUTURE DYNAMIC TABLES IN SCHEMA mydb.myschema TO ROLE budget_admin;
```

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Dynamic tables.
3. Find your dynamic table in the list and then select  » Transfer Ownership.
4. Select the role to transfer ownership to.

To learn more about the Snowflake privilege model, see [Overview of Access Control](security-access-control-overview.md) and [Access control privileges](security-access-control-privileges.md).

## Refresh dynamic tables with specific user privileges and secondary roles

Dynamic tables can be configured to refresh with the privileges of a specific user, in addition to privileges of the owner role. Dynamic
tables that specify EXECUTE AS USER run on behalf of the named user, instead of the system user.

For example, you can grant a user a primary role that provides access to a table and a secondary role that provides access to a virtual
warehouse. The user can then create a dynamic table that operates with the combined privileges of both roles, simplifying privilege
management and enhancing the flexibility of your data operations.

While the EXECUTE AS USER option enables dynamic tables to refresh under the user’s role, all other operations on these dynamic tables adhere
to the standard privilege model.

### Key use cases

* **Manage multi-role privileges:** In situations where users have secondary roles, they can create and refresh a dynamic table using the
  combined privileges of their primary and secondary roles. This configuration ensures that the user who is refreshing the dynamic table has
  the necessary permissions to access all required resources, while maintaining consistency with existing role-based access controls.
* **Granular security and governance controls:** Users can configure optional security measures with additional options such as REQUIRE USER,
  where a dynamic table can’t run unless a user is specified.
* **Accountability for all operations:** All refreshes on an EXECUTE AS USER dynamic table are attributed to the configured user instead
  of the SYSTEM user. This attribution helps maintain a clear audit trail for all operations.

### Access control

The owner role of the dynamic table must be granted the IMPERSONATE privilege on the user specified by EXECUTE AS USER, and the specified user
must be granted the owner role of the dynamic table. If the IMPERSONATE privilege is revoked, the dynamic table refresh will fail and the
dynamic table might be [auto suspended](dynamic-tables-suspend-resume.md).

When the dynamic table refreshes, the primary role of the refresh session is the owner role of the dynamic table, and the user’s default
secondary roles are activated. Users can switch primary roles with the USE ROLE command and adjust the secondary roles in the refresh session
with the USE SECONDARY ROLES command.

### Cross-product considerations

* **Data masking and row access policies:** Policies—for example, those using CURRENT_USER()—evaluate based on the specified user and
  roles rather than the SYSTEM user.
* **Replication and failover:** The user name and role name are replicated to secondary deployments.

  If a user or role is not available on the secondary deployment, the user is marked as INVALID and refreshes will fail until fixed.

  Invalid secondary roles are skipped during execution if the remaining roles provide sufficient privileges.

### Examples

#### Configure a dynamic table to run refreshes as a user

The following example creates a dynamic table that executes refreshes as the specified user, with the primary role set to the owner role of
the dynamic table. Refreshes execute with any user-lineage parameters that the user has set.

If no option for secondary roles is explicitly specified, the refresh defaults to the user’s current session setting.

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table
  [ EXECUTE AS USER my_user_name
    [ USE SECONDARY ROLES { ALL | NONE | (<role1>, <role2>, ... ) } ]
  ]
```

#### Set a secondary role for an existing dynamic table

The following example configures a dynamic table to execute as the specified user. If no specific secondary roles are selected, the refresh
process defaults to the current session’s active secondary roles. If the dynamic table is already set to execute as a specific user, this
command will update the configuration to execute as the user executing the ALTER DYNAMIC TABLE command.

Executing this command requires the OWNERSHIP privilege on the dynamic table.

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET
  EXECUTE AS USER my_user_name
  [ USE SECONDARY ROLES { ALL | NONE | (<role1>, <role2>, ... ) } ]
```

#### Switch a dynamic table to execute as the SYSTEM user

The following example reverts a dynamic table to execute under the SYSTEM user using the owner role of the dynamic table.

Executing this command requires the OWNERSHIP privilege on the dynamic table.

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table UNSET EXECUTE AS USER;
```

## Privileges to create a dynamic table

To create a dynamic table, you must use a role that has the following privileges:

| Privilege | Object |
| --- | --- |
| CREATE DYNAMIC TABLE | Schema in which you plan to create the dynamic table. |
| SELECT | Existing tables and views that you plan to query for the new dynamic table. |
| USAGE | Database and schema that you plan to use for the new dynamic table.  Warehouse that you plan to use to refresh the table.  **Note:** Although you can execute `CREATE DYNAMIC TABLE ... INITIALIZE = ON_SCHEDULE` with a secondary role that has the USAGE privilege, the dynamic table won’t successfully refresh if the primary role lacks this privilege, and therefore it won’t be initialized. |

To create a dynamic table that depends on another dynamic table, you must use a role that has the following privileges:

| Privilege | Object |
| --- | --- |
| SELECT | Dynamic table you plan to query from to create the new dynamic table. |
| OPERATE | All upstream dynamic tables the new dynamic table depends on. Only required if you set the dynamic table to refresh synchronously at creation and the upstream dynamic table is referenced directly in the definition, not through [DYNAMIC_TABLE_REFRESH_BOUNDARY()](dynamic-tables-refresh-boundary.md). |

## Privileges to query a dynamic table

To query a dynamic table, you can use a role that has the privileges to create a dynamic table.
For scenarios where a user only needs to query a dynamic table - for example, a data analyst - use a role that has the following privileges:

| Privilege | Object |
| --- | --- |
| USAGE | Database and schema that contains the dynamic table.  Warehouse used to run the query. |
| SELECT | The dynamic table being queried. |

## Privileges to alter a dynamic table

To alter a dynamic table, you must use a role that has either the OWNERSHIP or OPERATE privilege on that dynamic table.

If you have the OPERATE privilege on a dynamic table, you can do the following with the ALTER DYNAMIC TABLE command:

* Suspend a dynamic table using [ALTER … SUSPEND](../sql-reference/sql/alter-dynamic-table.md).
* Resume a dynamic table using [ALTER … RESUME](../sql-reference/sql/alter-dynamic-table.md).
* Refresh a dynamic table using [ALTER … REFRESH](../sql-reference/sql/alter-dynamic-table.md).
* Set or change the warehouse and/or target lag using [ALTER … SET](../sql-reference/sql/alter-dynamic-table.md).

If you have the OWNERSHIP privilege on a dynamic table, you can do the following in addition to the operations listed above:

* Set or unset a comment using [ALTER … SET | UNSET COMMENT](../sql-reference/sql/alter-dynamic-table.md).
* Rename a dynamic table using [ALTER … RENAME TO](../sql-reference/sql/alter-dynamic-table.md).
* Swap a dynamic table with another using [ALTER … SWAP WITH](../sql-reference/sql/alter-dynamic-table.md)
* Set a new parameter using [ALTER … SET](../sql-reference/sql/alter-dynamic-table.md)
* Specify or drop clustering keys. See [Clustering actions (clusteringAction)](../sql-reference/sql/alter-dynamic-table.md).
* Change governance policies. See [Data Governance policy and tag actions (dataGovnPolicyTagAction)](../sql-reference/sql/alter-dynamic-table.md).
* Change search optimization. See [Search optimization actions (searchOptimizationAction)](../sql-reference/sql/alter-dynamic-table.md).

## Privileges to view a dynamic table’s metadata

To view metadata, you must use a role that has the MONITOR privilege on that dynamic table.

For scenarios where the user only needs to view the metadata and Information Schema of a dynamic table (for example, roles held by data
scientists), use a role that has the MONITOR privilege on that dynamic table. While the OPERATE privilege grants this access, it also includes
the capability to alter dynamic tables, making MONITOR the more suitable option for scenarios where a user does not need to alter a dynamic
table.

If you have the MONITOR privilege on a dynamic table, you can do the following:

* Use the [DESCRIBE DYNAMIC TABLE](../sql-reference/sql/desc-dynamic-table.md) command and Snowsight dynamic tables details page to view the specific details
  for a dynamic table. The following fields are hidden if you only have the SELECT privilege on a dynamic table: `text`, `warehouse`,
  `scheduling_state`, `last_suspended_on`, and `suspend_reason_code` (UI-only).
* Use the [SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) command to view which dynamic tables you have access to.
* Call the [DYNAMIC_TABLE_GRAPH_HISTORY](../sql-reference/functions/dynamic_table_graph_history.md) table function to view graph history.
* Call the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md) table function to view refresh history.

## Privileges to drop a dynamic table

To drop a dynamic table, you must use a role that has the [OWNERSHIP](../sql-reference/sql/grant-ownership.md) privilege on that dynamic
table.

## Privileges to use dual warehouses

All privilege requirements for using INITIALIZATION_WAREHOUSE are the same as WAREHOUSE.

| Operation | Privilege |
| --- | --- |
| CREATE DYNAMIC TABLE using INITIALIZATION_WAREHOUSE | CREATE DYNAMIC TABLE and USAGE on both warehouses, the WAREHOUSE and INITIALIZATION_WAREHOUSE. |
| ALTER DYNAMIC TABLE … SET / UNSET INITIALIZATION_WAREHOUSE | OWNERSHIP or OPERATE on the dynamic table and USAGE on the applicable warehouse. |
| ALTER DYNAMIC TABLE … REFRESH on a dynamic table that uses INITIALIZATION_WAREHOUSE | OPERATE on the dynamic table and USAGE on the applicable warehouse. |

For more information, see [Understand warehouse usage for dynamic tables](dynamic-tables-warehouses.md).

---
title: Dynamic table limitations
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-limitations.md
section: User Guide
---

# Dynamic table limitations

This topic describes general and cross-feature limitations on dynamic tables.

## General limitations

The following general limitations apply to using dynamic tables:

* A single account can hold a maximum of 50,000 dynamic tables.
* You can’t truncate data from a dynamic table.
* You can’t create a temporary dynamic table.
* When you use a dynamic table to [ingest shared data](dynamic-tables-data-sharing.md), the query can’t select from a shared
  dynamic table or a shared secure view that references an upstream dynamic table.
* You can’t use secondary roles with dynamic tables because dynamic table refreshes act as their owner role. For more information, see
  [Authorization through primary role and secondary roles](security-access-control-overview.md).
* You can’t use dynamic SQL (for example, session variables or unbound variables of anonymous blocks) in the dynamic table’s definition.
* In a dynamic table definition, SELECT blocks that read from user-defined table functions (UDTF) must explicitly specify columns and can’t
  use `*`.
* Dynamic tables can become stale if they are not refreshed within the [MAX_DATA_EXTENSION_TIME_IN_DAYS](../sql-reference/parameters.md) period of the input
  tables. Once stale, they must be recreated to resume refreshes.
* When creating a dynamic table that uses a warehouse named DEFAULT, you must use double quotes around the name, following the
  [double-quoted identifier requirements](../sql-reference/identifiers-syntax.md). For example, `CREATE DYNAMIC TABLE ... WAREHOUSE = "DEFAULT"`.
  For more information on creating dynamic tables, see [Create dynamic tables](dynamic-tables-create.md).
* Dynamic tables don’t support sources that include directory tables, external tables, streams, and materialized views.
* You can’t create dynamic tables that read from views that query other dynamic tables, unless the view is wrapped in
  [DYNAMIC_TABLE_REFRESH_BOUNDARY()](dynamic-tables-refresh-boundary.md).
* You can’t clone dynamic Iceberg tables. Additionally, cloning a database or schema containing a dynamic Iceberg table does not clone the
  table to the new location.
* You can’t set the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) object parameter to zero if your base table is a shared table.

## Immutability constraints

The following limitations apply when you work with [immutability constraints](dynamic-tables-immutability-constraints.md)
and backfilled data:

* Currently, only regular and dynamic tables can be used for backfilling.
* You can’t specify policies or tags in the new dynamic table because they are copied from the backfill table.
* Clustering keys in the new dynamic table and backfill table must be the same.

## Support for cross-feature interactions

The following cross-feature interactions are not supported:

* Using the query acceleration service (QAS) for dynamic table refreshes.
* Masking policies with database roles on shared tables.
* Aggregation and projection policies cannot be applied to the base tables of dynamic tables. If a base table has aggregation or projection
  policies associated with it, the dynamic table will fail to create.

## Support for incremental refresh

Dynamic tables support two refresh modes: incremental and full. You can either set the refresh mode to AUTO or set it explicitly. For more
information, see [Dynamic table refresh modes](dynamic-tables-refresh.md) and [Choose a refresh mode](dynamic-tables-performance-optimize.md).

### Incremental refresh on full-refresh dynamic tables

Dynamic tables in incremental refresh mode can’t consume an upstream dynamic table with full refresh mode unless the upstream full-refresh
dynamic table has a system-derived reliable unique key. When such a reliable unique key exists, Snowflake can compute row-level changes across full refreshes,
enabling downstream incremental processing.

To use this capability, set `REFRESH_MODE = INCREMENTAL` explicitly on the downstream dynamic table. `REFRESH_MODE = AUTO` doesn’t
resolve to incremental in this scenario.

For more information, see [Understanding primary keys in dynamic tables](dynamic-tables-primary-keys.md).

### Masking and row access policies

Masking or row access policies on a dynamic table don’t affect its refresh mode. However, policies applied on base tables might affect the
refresh mode:

> * Incremental refresh is supported if the policies on base tables use the [CURRENT_ROLE](../sql-reference/functions/current_role.md) or
>   [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) function.
> * Incremental refresh isn’t supported if the policies on base tables use any other functions, INFORMATION_SCHEMA views, or query a table
>   (for example, a [mapping table lookup](security-row-using.md)).
> * Changes to the policies on base objects of dynamic tables with incremental refresh trigger reinitialization.

### Replication

Replicated dynamic tables with incremental refresh reinitialize after failover before they can resume incremental refresh.

For more information, see [Replication and dynamic tables](account-replication-considerations.md).

### Cloning

[Cloned incremental dynamic tables](../sql-reference/sql/create-dynamic-table.md) might need to reinitialize during their first refresh after being
created.

If a dynamic table is cloned from another dynamic table with dropped base tables, the clone will be suspended and can’t be resumed or
refreshed.

---
title: Dynamic table performance and optimization
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-performance.md
section: User Guide
---

# Dynamic table performance and optimization

Learn how to optimize and monitor dynamic tables for speed and cost-efficiency. This section
provides foundational concepts and links to more detailed topics.

Dynamic table *performance* refers to how quickly and efficiently a
[dynamic table refresh](dynamic-tables-refresh.md) completes. A
well-performing dynamic table refreshes fast enough to meet its [target lag](dynamic-tables-target-lag.md) without
consuming excessive compute resources.

## Why performance matters

Data freshness
:   Dynamic tables refresh based on a [target lag](dynamic-tables-target-lag.md) that you specify,
    which is the maximum allowed delay between updates to source tables and the dynamic table’s content.
    When refreshes take too long, your pipeline might not meet your freshness requirements.

    For example, setting a target lag of five minutes when your refresh takes eight minutes means your
    pipeline can’t maintain the required freshness.

Cost efficiency
:   Dynamic tables require virtual warehouses for refreshes, which consume credits. Poorly optimized
    dynamic tables might scan more data than necessary, trigger full refreshes when incremental would
    suffice, or require larger warehouses to complete within target lag windows.

    For more information about costs, see [Understanding costs for dynamic tables](dynamic-tables-cost.md).

## Performance decisions

Changes that affect dynamic table performance fall into two categories based on *when* you
can make them:

|  | Design changes | Adjustments |
| --- | --- | --- |
| **When** | Before you create a pipeline. | After your pipeline is running. |
| **Impact** | High | Medium |
| **Flexibility** | Hard to change; requires recreating tables. | Easy to change; no need to recreate tables. |
| **Examples** | Query structure, refresh mode, pipeline design. | Warehouse size, clustering keys, target lag. |

For detailed guidance on both categories, see
[Optimize dynamic table performance](dynamic-tables-performance-optimize.md).

## Get started

To get started with dynamic table performance optimization, try the hands-on tutorial:

[Tutorial: Optimize dynamic table performance for SCD Type 1 workloads](tutorials/optimize-dynamic-table-performance.md)
:   Learn how to identify and resolve performance bottlenecks in a dynamic table pipeline. This tutorial
    shows how different SQL patterns affect incremental refresh and how to use the `QUALIFY` clause
    to efficiently remove duplicate rows.

## Topics in this section

[Monitor dynamic table performance](dynamic-tables-performance-monitor.md)
:   How to monitor refresh performance, analyze query profiles, and track key metrics.

[Optimize dynamic table performance](dynamic-tables-performance-optimize.md)
:   Key concepts and optimization techniques: refresh modes, data locality, warehouse sizing,
    target lag, query patterns, and clustering.

[Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md)
:   Performance guide for how SQL operators affect incremental refresh speed.

[Use immutability constraints](dynamic-tables-performance-optimize-immutability.md)
:   How to use immutability constraints to mark historical data as unchanging and reduce refresh scope.

---
title: Dynamic table refresh boundary
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-refresh-boundary.md
section: User Guide
---

# Dynamic table refresh boundary

Use a dynamic table refresh boundary to decouple dynamic table pipelines while still reading upstream results.

When one dynamic table references another, the two are refreshed together as a single pipeline. While this works in the majority of
scenarios, there are certain use cases, such as sharing data across team boundaries, where data refresh needs can vary. You can declare
the two dynamic tables as independent (and thus belonging to different pipelines) by wrapping the upstream reference in
`DYNAMIC_TABLE_REFRESH_BOUNDARY()`. Snapshot isolation is only guaranteed within a single pipeline, so dynamic tables across a refresh
boundary do not provide snapshot isolation with each other.

## Overview

By default, when a dynamic table reads from another dynamic table, Snowflake:

* Refreshes both tables together as part of a single pipeline.
* Coordinates refreshes so that downstream dynamic tables see
  snapshot isolation across all upstream dynamic tables in the pipeline.
* Enforces pipeline-level rules such as target lag checks.

In some pipelines, you don’t want every relationship to cause both dynamic tables to be refreshed together. Common examples include:

* **Cross-team pipelines** where one team publishes a dynamic table that another team consumes, but the downstream dynamic table should not
  influence or inherit the upstream pipeline.
* **Incremental migrations** where you convert an upstream pipeline step to a dynamic table but don’t want downstream consumers to start
  coordinating refreshes with it.
* **Dynamic-table-on-view-on-dynamic-table** patterns, where a dynamic table reads from a view that queries another dynamic table.
  This pattern is unsupported unless the view is wrapped in `DYNAMIC_TABLE_REFRESH_BOUNDARY()`.

A refresh boundary makes this separation explicit: inputs inside the boundary are treated as belonging to a separate pipeline
and are read like regular tables.

## Syntax

```sqlsyntax
DYNAMIC_TABLE_REFRESH_BOUNDARY( <object_name> )
```

Where:

`object_name`
:   A table, view, dynamic table, or common table expression (CTE) name.

Use this keyword in the FROM / JOIN clause (including within CTEs and UNION branches) of a dynamic table definition.

## Examples

The following example reads from a view that queries another dynamic table. Without a refresh boundary, creating a dynamic table that reads
from a view on another dynamic table is unsupported. Wrapping the view in `DYNAMIC_TABLE_REFRESH_BOUNDARY()` makes this pattern possible:

```sqlexample
CREATE DYNAMIC TABLE analytics.click_analytics_dt
  WAREHOUSE = analytics_wh
  TARGET_LAG = '5 minutes'
AS
SELECT *
FROM DYNAMIC_TABLE_REFRESH_BOUNDARY(analytics.enriched_clicks_view);
```

The following example joins a directly referenced dynamic table with a view whose upstream dynamic table refreshes on a longer schedule.
Wrapping the view in `DYNAMIC_TABLE_REFRESH_BOUNDARY()` prevents the downstream dynamic table from triggering the expensive upstream
refresh every 5 minutes, while still allowing it to read the latest available version. Snapshot isolation is not guaranteed across the
refresh boundary:

```sqlexample
CREATE DYNAMIC TABLE data_eng.enriched_clicks_dt
  WAREHOUSE = de_wh
  TARGET_LAG = '5 minutes'
AS
SELECT
  c.*,
  p.product_name
FROM data_eng.clickstream_dt AS c
LEFT JOIN DYNAMIC_TABLE_REFRESH_BOUNDARY(product_db.active_products_view) AS p
  ON c.product_id = p.product_id;
```

## Behavior

### How refresh boundaries change dependencies

When you wrap an input in `DYNAMIC_TABLE_REFRESH_BOUNDARY()` inside a dynamic table definition:

* That input is treated as a refresh boundary input for this definition.
* Any dynamic tables reachable from that input are not included in the pipeline for this definition.
* On refresh, the dynamic table reads those objects at their current version, not at the data timestamp coordinated across the pipeline.

As a result:

**No cascading refresh across the boundary**
:   Refreshing the downstream dynamic table does not trigger refreshes of dynamic tables that are only reachable through a refresh boundary.

**Independent scheduling**
:   Target lag and refresh scheduling for the downstream dynamic table ignore dynamic tables that are only reachable through the boundary.

**No snapshot isolation across the boundary**
:   The downstream dynamic table reads whatever version of the upstream data is available at refresh time. The data across the boundary is not
    guaranteed to be aligned with the
    snapshot isolation that applies to other upstream dependencies.

### Snapshot isolation vs. refresh boundaries

Within a single pipeline (without a boundary), Snowflake guarantees
[snapshot isolation](dynamic-tables-refresh.md) across all upstream dynamic tables participating in that pipeline.

Refresh boundaries intentionally weaken this guarantee on the dependency that crosses the boundary:

* **Inside the boundary:** objects refresh and coordinate according to their own pipelines.
* **Outside the boundary:** the downstream dynamic table reads whatever version is available at its refresh time.

A single dynamic table definition can therefore reference both types of inputs:

* **Direct references** to upstream dynamic tables, which participate in snapshot isolation and coordinated refreshes within the pipeline.
* **Refresh boundary references**, which read the latest available version of the upstream data independently, without snapshot isolation.

Use refresh boundaries only on dependencies where you do not require snapshot isolation between the upstream and downstream dynamic tables.

## Use cases

### Decoupling cross-team pipelines

Different teams might own different parts of a logical pipeline:

* **Team A:** publishes a core dynamic table used across the organization.
* **Team B:** defines a downstream dynamic table that joins the core dynamic table with team-specific data.

Team B can wrap Team A’s output in a refresh boundary to:

* Avoid pulling Team A’s dynamic tables into their own pipeline.
* Keep their own refresh schedule independent.
* Treat Team A’s dynamic table similar to an external, periodically updated table.

### Enabling dynamic table on view on dynamic table

Without a refresh boundary, creating a dynamic table that reads from a view on another dynamic table is unsupported. With a refresh boundary,
you can explicitly mark the view dependency as a boundary:

```sqlexample
CREATE VIEW v_orders AS
SELECT *
FROM orders_dt;

CREATE DYNAMIC TABLE order_summary_dt
  WAREHOUSE = analytics_wh
  TARGET_LAG = '15 minutes'
AS
SELECT
  customer_id,
  COUNT(*) AS num_orders
FROM DYNAMIC_TABLE_REFRESH_BOUNDARY(v_orders)
GROUP BY customer_id;
```

Here, `order_summary_dt`:

* Reads from `orders_dt` through a refresh boundary.
* Does not belong to the same pipeline as `orders_dt`.
* Reads whatever version of `orders_dt` is available when it refreshes.

### Example: team-owned boundary view

A common pattern is for one team to own both a dynamic table and a view on top of it, and to apply the refresh boundary inside the view
definition. Other teams then consume that view without introducing new dependencies to the owning team’s dynamic table.

```sqlexample
-- Team A: owns product_catalog_dt and publishes a boundary view
CREATE DYNAMIC TABLE product.product_catalog_dt
  WAREHOUSE = product_wh
  TARGET_LAG = '1 hour'
AS
SELECT *
FROM product.raw_products;

CREATE VIEW product.active_products_public_v AS
SELECT * FROM DYNAMIC_TABLE_REFRESH_BOUNDARY(product.product_catalog_dt)
WHERE is_active = TRUE;

-- Team B: consumes Team A's view in their own dynamic table
CREATE DYNAMIC TABLE analytics.active_product_clicks_dt
  WAREHOUSE = analytics_wh
  TARGET_LAG = '5 minutes'
AS
SELECT
  c.*,
  p.product_name
FROM analytics.clickstream_dt AS c
JOIN product.active_products_public_v AS p
  ON c.product_id = p.product_id;
```

In this pattern:

* Team A controls the refresh boundary by wrapping `product_catalog_dt` inside `product.active_products_public_v`.
* Team B and other teams define their own dynamic tables that reference only the published view.
* Those downstream dynamic tables do not add `product_catalog_dt` to their own pipeline;
  `product_catalog_dt` remains outside their pipelines even though its data is visible through the view.

### Incremental migration to dynamic tables

If you migrate an existing pipeline step to a dynamic table, you might not want downstream consumers to:

* Start triggering refreshes of the new dynamic table.
* Inherit new target lag requirements.

Wrapping the new dynamic table (or a view on top of it) in a refresh boundary lets downstream dynamic tables consume it without being added
to the same pipeline.

## Target lag

Refresh boundaries also influence how target lag is enforced.

The target lag of an upstream dynamic table must be the same as or shorter than that of any downstream dynamic table within the same pipeline.
Dynamic tables referenced through `DYNAMIC_TABLE_REFRESH_BOUNDARY()` do not belong to the same pipeline, so this rule does not apply
across the boundary.

Upstream dynamic tables inside a refresh boundary keep their own target lag and scheduling behavior; they are not tightened or relaxed by
downstream choices across the boundary.

## Restrictions and limitations

Refresh boundaries are subject to a few important rules:

**Same dynamic table both inside and outside a refresh boundary is not allowed**

All references to the same upstream dynamic table within a single definition must be either directly in the definition or wrapped in
`DYNAMIC_TABLE_REFRESH_BOUNDARY()`. Mixing both would allow the same dynamic table to be read at different versions.
Snowflake blocks these definitions and returns a descriptive error.

**Unsupported boundary targets**

`DYNAMIC_TABLE_REFRESH_BOUNDARY()` must wrap a named object (table, view, dynamic table, or CTE). It cannot wrap:

* Inline subqueries.
* Table functions or UDTFs.
* Arbitrary `TABLE(...)` calls.

**Effect outside dynamic tables**

You can call `DYNAMIC_TABLE_REFRESH_BOUNDARY()` in regular SELECT queries, but outside of a dynamic table definition it is a no-op.

## Best practices

When using refresh boundaries in dynamic table pipelines:

**Use a refresh boundary when:**

* You want to consume another team’s dynamic table without joining its pipeline.
* You do not need snapshot isolation from a particular upstream dependency.
* A dynamic table depends on a view that references another dynamic table. This pattern is only supported when either the view or the upstream
  dynamic table is wrapped in `DYNAMIC_TABLE_REFRESH_BOUNDARY()`.

**Avoid a refresh boundary when:**

* You need snapshot isolation across that dependency.
* You want downstream refreshes to coordinate with upstream dynamic tables and, if needed, cascade refreshes.
* You rely on global target lag relationships across the entire pipeline.

---
title: Dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-about.md
section: User Guide
---

# Dynamic tables

Dynamic tables are tables that automatically refresh based on a defined query and target freshness, simplifying data transformation and
pipeline management without requiring manual updates or custom scheduling.

When you create a dynamic table, you define a query that specifies how data should be transformed from base objects. Snowflake handles the
refresh schedule of the dynamic table and updates the table automatically to reflect the changes made to the base objects based on the query.

## Key considerations and general best practices

**Immutability constraints**: Use immutability constraints to let you control dynamic table updates. The constraints keep specific rows static
while enabling incremental updates to the rest of the table. They prevent unwanted changes to marked data while they let normal refreshes occur
for other parts of the table. For more information, see [Understanding immutability constraints](dynamic-tables-immutability-constraints.md).

**Primary keys**: Snowflake uses reliable primary keys to track changes more efficiently in dynamic table pipelines. When a dynamic table has a
system-derived primary key, downstream tables can use incremental refresh even if the upstream table uses full refresh mode. For more information,
see [Understanding primary keys in dynamic tables](dynamic-tables-primary-keys.md).

**Performance considerations:** Dynamic tables use incremental processing for workloads that support it,
which can improve performance by processing only changed data instead of recomputing entire tables.
Performance depends on your query patterns and data organization. For guidance on optimizing dynamic table
performance, see [Dynamic table performance and optimization](dynamic-tables-performance.md).

**Break down complex dynamic tables:** Break your pipeline into smaller, focused dynamic tables to improve performance and simplify
troubleshooting. For more information, see [Best practices for creating dynamic tables](dynamic-tables-create.md).

## How dynamic tables work

Snowflake runs the definition query specified in your CREATE DYNAMIC TABLE statement and your dynamic tables are updated through an automated
refresh process.

The following diagram shows how this process computes the changes made to the base objects and merges them into the dynamic table by using
compute resources associated with the table.

### Target lag

Use *target lag* to set how fresh you want your data to be. Usually, the table data freshness won’t be more than that far behind the base table
data freshness. With target lag, you control how often the table refreshes and how up-to-date the data stays. Target lag affects
refresh frequency and compute costs.

For more information, see [Understanding dynamic table target lag](dynamic-tables-target-lag.md). For guidance on balancing data freshness with
performance, see [Optimize dynamic table performance](dynamic-tables-performance-optimize.md).

### Dynamic table refresh

Dynamic tables aim to refresh within the target lag you specify. For example, a target lag of five minutes ensures that the data in the dynamic
table is no more than five minutes behind data updates to the base table. You set the refresh mode when you create the table and, afterward,
refreshes can happen on a schedule or manually.

For more information, see [Understanding dynamic table initialization and refresh](dynamic-tables-refresh.md) and [Manually refresh dynamic tables](dynamic-tables-manual-refresh.md).

## When to use dynamic tables

Dynamic tables are ideal for the following scenarios:

* You want to materialize query results without writing custom code.
* You want to avoid manually tracking data dependencies and managing refresh schedules. Dynamic tables enable you to define pipeline outcomes
  declaratively, without managing transformation steps manually.
* You want to chain together multiple tables for data transformations in a pipeline.
* You don’t need fine-grained control over refresh schedules, and you only need to specify a target freshness for the pipeline. Snowflake
  handles the orchestration of data refreshes, including scheduling and execution, based on your target freshness requirements.

### Example use cases

* **Slowly changing dimensions (SCDs):** Dynamic tables can be used to implement Type 1 and Type 2 SCDs by reading from a change stream and
  using window functions over per-record keys ordered by a change timestamp. This method handles insertions, deletions, and updates that occur
  out of order, simplifying the creation of SCDs. For more information, see
  [Slowly Changing Dimensions with Dynamic Tables](https://medium.com/snowflake/slowly-changing-dimensions-with-dynamic-tables-d0d76582ff31).
* **Joins and aggregations:** To enable fast queries, you can use dynamic tables to incrementally precompute slow joins and aggregations.
  For guidance on optimizing these operators for incremental refresh, see
  [Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).
* **Batch to streaming transitions**: Dynamic tables support seamless transitions from batch to streaming with a single ALTER DYNAMIC TABLE
  command. You can control the refresh frequency in your pipeline to balance cost and data freshness.

---
title: Dynamic tables compared to streams and tasks, and materialized views
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-comparison.md
section: User Guide
---

# Dynamic tables compared to streams and tasks, and materialized views

Like streams and tasks, dynamic tables provide a way to transform data in your pipeline.

## Dynamic tables compared to streams and tasks

Although dynamic tables serve a similar purpose compared to streams and tasks, there are important differences.

### Create a stream on a dynamic table

You can use a dynamic table as the source of a stream, like streams on regular tables, with the following limitations:

* Refresh mode: Streams can be created only on dynamic tables that refresh [incrementally](dynamic-tables-refresh.md).
  Full refresh dynamic tables aren’t supported because they completely rewrite the table on every refresh.
* Stream type: Dynamic tables support only standard (that is, delta) streams. For more information, see
  [Types of streams](streams-intro.md).

The following example shows how to create a stream on a dynamic table:

```sqlexample
-- Create the dynamic table, for reference only
CREATE OR REPLACE DYNAMIC TABLE product ...;

-- Create the stream.
CREATE OR REPLACE STREAM deltaStream ON DYNAMIC TABLE product;
```

### Comparison between streams and tasks and dynamic tables

| Key Characteristics | Streams and Tasks | Dynamic Tables |
| --- | --- | --- |
| Data transformation | Tasks use an imperative approach: You write procedural code to transform data from base tables. | Dynamic tables use a declarative approach: You write a query that specifies the result you want, and data is retrieved and transformed from the base tables used in the query. Except for [Supported non-deterministic functions in incremental and full refresh modes](dynamic-tables-supported-queries.md), the query can’t contain non-deterministic functions. |
| Refresh timing | You define a schedule for executing the code that transforms the data. | An automated refresh process determines the schedule for performing refreshes. The process schedules these refreshes to meet the specified target level of freshness (lag). |
| Orchestration | The procedural code can contain calls to non-deterministic code, stored procedures, and other tasks. The procedural code can contain calls to UDFs and external functions. | Although the SELECT statement for a dynamic table can contain joins, aggregations, window functions, and other SQL functions and constructions, the statement cannot contain calls to stored procedures and tasks. Currently, the SELECT statement also cannot contain calls to external functions.  This limitation is due to the way in which dynamic tables are refreshed. To refresh the data, an automated process analyzes the SELECT statement for the dynamic table in order to determine the best approach to refresh the data. The automated process cannot determine this for certain types of queries.  For the complete list of restrictions on the SELECT statement, see [Supported queries in incremental and full refresh modes](dynamic-tables-supported-queries.md) and [General limitations](dynamic-tables-limitations.md). |
| Data freshness | Tasks can use streams to refresh data in target tables incrementally. You can schedule these tasks to run on a regular basis. | An automated refresh process performs incremental refreshes of dynamic tables on a regular basis. The process determines the schedule based on the target “freshness” of the data that you specify. |

### Example: Comparison of data transformation between streams and tasks and dynamic tables

The example in [Transform loaded JSON data on a schedule](data-pipelines-examples.md) uses streams and tasks to transform and insert new data into a target
table (`names`) as the data is streamed into a landing table (`raw`).

The following examples demonstrate how to perform the same transformation using dynamic tables. When creating a dynamic table,
you specify the query for the results that you want to see. For the incremental refresh of the data, you don’t need to create a
stream to track changes and write a task to examine those changes and apply the changes to the target table. The automated refresh
process does this for you based on the query that you specify.

| SQL Statements for Streams and Tasks | SQL Statements for Dynamic Tables |
| --- | --- |
| ```sqlexample -- Create a landing table to store -- raw JSON data. CREATE OR REPLACE TABLE raw   (var VARIANT);  -- Create a stream to capture inserts -- to the landing table. CREATE OR REPLACE STREAM rawstream1   ON TABLE raw;  -- Create a table that stores the -- names of office visitors from the -- raw data. CREATE OR REPLACE TABLE names   (id INT,    first_name STRING,    last_name STRING);  -- Create a task that inserts new name -- records from the rawstream1 stream -- into the names table. -- Execute the task every minute when -- the stream contains records. CREATE OR REPLACE TASK raw_to_names   WAREHOUSE = mywh   SCHEDULE = '1 minute'   WHEN     SYSTEM$STREAM_HAS_DATA('rawstream1')   AS     MERGE INTO names n       USING (         SELECT var:id id, var:fname fname, var:lname lname,                 metadata$action, metadata$isupdate                   FROM rawstream1       ) r1 ON n.id = TO_NUMBER(r1.id)       WHEN MATCHED AND metadata$action = 'DELETE'             AND NOT metadata$isupdate THEN           DELETE       WHEN MATCHED AND metadata$action = 'INSERT' THEN         UPDATE SET n.first_name = r1.fname, n.last_name = r1.lname       WHEN NOT MATCHED AND metadata$action = 'INSERT' THEN         INSERT (id, first_name, last_name)           VALUES (r1.id, r1.fname, r1.lname);  -- Start the task ALTER TASK raw_to_names RESUME; ``` | ```sqlexample -- Create a landing table to store -- raw JSON data. CREATE OR REPLACE TABLE raw   (var VARIANT);  -- Create a dynamic table containing the -- names of office visitors from -- the raw data. -- Try to keep the data up to date within -- 1 minute of real time. CREATE OR REPLACE DYNAMIC TABLE names   TARGET_LAG = '1 minute'   WAREHOUSE = mywh   AS     SELECT var:id::int id, var:fname::string first_name,     var:lname::string last_name FROM raw; ``` |

## Dynamic tables compared to materialized views

Dynamic tables have some similarities to materialized views in that both materialize the results of a query.
However, there are important differences:

| Key Characteristics | Materialized Views | Dynamic Tables |
| --- | --- | --- |
| Query performance | Materialized views are designed to improve query performance transparently.  For example, if you query the base table, the query optimizer in Snowflake can rewrite the query automatically to query the materialized view instead. | Dynamic tables are designed to build multi-level data pipelines.  Although dynamic tables can improve query performance, the query optimizer in Snowflake does not automatically rewrite queries to use dynamic tables. A dynamic table is used in a query only if you specify the dynamic table in the query. |
| Query complexity | A materialized view can only use a single base table. A materialized view cannot be based on a complex query (that is, a query with joins or nested views). | A dynamic table can be based on a complex query, including one with joins and unions. |
| Data freshness | Data accessed through materialized views is [always current](views-materialized.md). If a DML operation changes the data in the base table, Snowflake either updates the materialized view or uses the updated data from the base table. | The data is current up to the target lag time for the dynamic table.  Dynamic table maintenance and refresh is automatically managed by a separate compute service, including refresh logic, along with the compute for any updates, typically at additional cost. For more information, see [Understanding costs for dynamic tables](dynamic-tables-cost.md). |

---
title: Enable automatic table schema evolution
source: https://docs.snowflake.com/en/user-guide/data-load-schema-evolution.md
section: User Guide
---

# Enable automatic table schema evolution

Semi-structured data tends to evolve over time. Systems that generate data add new columns to accommodate additional information, which requires downstream tables to evolve accordingly.

The structure of tables in Snowflake can evolve automatically to support the structure of new data received from the data sources. Snowflake supports the following:

> * Automatically adding new columns.
> * Automatically dropping the NOT NULL constraint from columns that are missing in new data files.

To enable table schema evolution, do the following:

> * If you are creating a new table, set the `ENABLE_SCHEMA_EVOLUTION` parameter to TRUE when you use the [CREATE TABLE](../sql-reference/sql/create-table.md) command.
> * For an existing table, modify the table using the [ALTER TABLE](../sql-reference/sql/alter-table.md) command and set the `ENABLE_SCHEMA_EVOLUTION` parameter to TRUE.

Loading data from files evolves the table columns when all of the following are true:

> * The Snowflake table has the `ENABLE_SCHEMA_EVOLUTION` parameter set to TRUE.
> * The [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement uses the `MATCH_BY_COLUMN_NAME` option.
> * The role used to load the data has the EVOLVE SCHEMA or OWNERSHIP privilege on the table.

Additionally, for schema evolution with CSV, when used with `MATCH_BY_COLUMN_NAME` and `PARSE_HEADER`, `ERROR_ON_COLUMN_COUNT_MISMATCH` must be set to false.

Schema evolution is a standalone feature but can be used in conjunction with the [schema detection support for retrieving the column definitions](data-load-overview.md) from a set of files in cloud storage. In combination, these features enable continuous data pipelines to create new tables from a set of data files in cloud storage and then modify columns of the tables as the schema of new source data files evolves with column additions or deletions.

## Usage notes

> * This feature supports Apache Avro, Apache Parquet, CSV, JSON, and ORC files.
> * This feature is limited to [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statements and Snowpipe data loads. INSERT operations cannot evolve the target table schema automatically.
> * [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) data loads using the Snowflake Ingest SDK directly are not supported with schema evolution. [The Kafka connector with Snowpipe Streaming](snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md) supports schema detection and evolution.
> * By default, this feature is limited to adding a maximum of 100 columns or evolving no more than 1 schema per COPY operation. To request more than 100 added columns or 1 schema per COPY operation, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
> * There is no limit on dropping NOT NULL column constraints.
> * Schema evolution is tracked by the `SchemaEvolutionRecord` output in the following views and commands: [INFORMATION_SCHEMA COLUMNS View](../sql-reference/info-schema/columns.md), [ACCOUNT_USAGE COLUMNS View](../sql-reference/account-usage/columns.md), [DESCRIBE TABLE command](../sql-reference/sql/desc-table.md), and [SHOW COLUMNS command](../sql-reference/sql/show-columns.md).
>
>   However, for the Kafka connector with Snowpipe Streaming, schema evolution is not tracked by the `SchemaEvolutionRecord` output. The `SchemaEvolutionRecord` output always shows NULL.
> * When a column is manually renamed or modified after a schema evolution, the schema evolution record will be cleared.
> * Schema evolution isn’t supported by [tasks](tasks-intro.md).

## Schema evolution support: Ingestion method comparison

The specific metadata field `SchemaEvolutionRecord` is used to track schema evolution. You can view this field with the [INFORMATION_SCHEMA.COLUMNS View](../sql-reference/info-schema/columns.md), [DESCRIBE TABLE command](../sql-reference/sql/desc-table.md), and [SHOW COLUMNS command](../sql-reference/sql/show-columns.md).

The following table summarizes schema evolution support and the corresponding `SchemaEvolutionRecord` tracking behavior across different Snowflake ingestion methods:

| Ingestion method | Architecture or context | Schema evolution support status | SchemaEvolutionRecord tracking behavior |
| --- | --- | --- | --- |
| File-based (batch/micro-batch) | [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command | Fully supported | Visible in tracking views/commands. |
| File-based (batch/micro-batch) | [Snowpipe](data-load-snowpipe-auto.md), using automated loading | Fully supported | Visible in tracking views or commands. |
| Streaming at the row level | [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) (High-performance architecture) | Fully supported | Visible in tracking views or commands. |
| Streaming at the row level | Snowpipe Streaming with classic architecture; for example, Kafka connector | Only [the classic architecture with Kafka connector](snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md) is supported, and tracking is limited. | Always shows NULL in tracking views or commands. |

## Examples

The following example creates a table with column definitions derived from a set of Parquet data. With automatic table schema evolution enabled for the table, further data loads from Parquet files with additional name/value pairs automatically add columns to the table:

Note that the `mystage` stage and `my_parquet_format` file format referenced in the statement must already exist. A set of files must
already be staged in the cloud storage location referenced in the stage definition.

This example builds on an example in the [INFER_SCHEMA](../sql-reference/functions/infer_schema.md) topic:

> ```sqlexample
> -- Create table t1 in schema d1.s1, with the column definitions derived from the staged file1.parquet file.
> USE SCHEMA d1.s1;
>
> CREATE OR REPLACE TABLE t1
>   USING TEMPLATE (
>     SELECT ARRAY_AGG(object_construct(*))
>       FROM TABLE(
>         INFER_SCHEMA(
>           LOCATION=>'@mystage/file1.parquet',
>           FILE_FORMAT=>'my_parquet_format'
>         )
>       ));
>
> -- Row data in file1.parquet.
> +------+------+------+
> | COL1 | COL2 | COL3 |
> |------+------+------|
> | a    | b    | c    |
> +------+------+------+
>
> -- Describe the table.
> -- Note that column c2 is required in the Parquet file metadata. Therefore, the NOT NULL constraint is set for the column.
> DESCRIBE TABLE t1;
> +------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
> | name | type              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
> |------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
> | COL1 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
> | COL2 | VARCHAR(16777216) | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
> | COL3 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
> +------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
>
> -- Use the SECURITYADMIN role or another role that has the global MANAGE GRANTS privilege.
> -- Grant the EVOLVE SCHEMA privilege to any other roles that could insert data and evolve table schema in addition to the table owner.
>
> GRANT EVOLVE SCHEMA ON TABLE d1.s1.t1 TO ROLE r1;
>
> -- Enable schema evolution on the table.
> -- Note that the ENABLE_SCHEMA_EVOLUTION property can also be set at table creation with CREATE OR REPLACE TABLE
> ALTER TABLE t1 SET ENABLE_SCHEMA_EVOLUTION = TRUE;
>
> -- Load a new set of data into the table.
> -- The new data drops the NOT NULL constraint on the col2 column.
> -- The new data adds the new column col4.
> COPY INTO t1
>   FROM @mystage/file2.parquet
>   FILE_FORMAT = (type=parquet)
>   MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE;
>
> -- Row data in file2.parquet.
> +------+------+------+
> | col1 | COL3 | COL4 |
> |------+------+------|
> | d    | e    | f    |
> +------+------+------+
>
> -- Describe the table.
> DESCRIBE TABLE t1;
> +------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | name | type              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | schema evolution record                                                                                                                                                                  |
> |------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> | COL1 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL                                                                                                                                                                                     |
> | COL2 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | {"evolutionType":"DROP_NOT_NULL","evolutionMode":"COPY","fileName":"file2.parquet","triggeringTime":"2024-03-15 23:52:59.514000000Z","queryId":"01b303b8-0808-c9ed-0000-0971491b5932"}   |
> | COL3 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL                                                                                                                                                                                     |
> | COL4 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | {"evolutionType":"ADD_COLUMN","evolutionMode":"COPY","fileName":"file2.parquet","triggeringTime":"2024-03-15 23:52:59.514000000Z","queryId":"01b303b8-0808-c9ed-0000-0971491b5932"}      |
> +------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> -- Note that since MATCH_BY_COLUMN_NAME is set as CASE_INSENSITIVE, all column names are retrieved as uppercase letters.
> ```

---
title: Enable credential vending for an external catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/enable-credential-vending-external-catalog.md
section: User Guide
---

# Enable credential vending for an external catalog

With credential vending, you can use Snowflake Open Catalog to configure and manage access control to a catalog and its underlying cloud
storage in a single place. For internal catalogs, credential vending is enabled by default. For external catalogs, you can enable credential
vending by using one of the following options:

* Using Open Catalog
* Using the Apache Polaris™ (Incubating) CLI

> **Important:**
>
> Before you enable credential vending for an external catalog, ensure that your tables in the catalog don’t have overlapping storage directory
> locations. Otherwise, a user could gain access to tables that they shouldn’t have permission to access. For more information, see
> [Credential vending for external catalogs](overview.md).

## Using Open Catalog

1. Sign in to Open Catalog.
2. In the menu on the left, select **Catalogs**.
3. In the list of catalogs, select the catalog for which you want to enable credential vending.
4. On the **Catalog Details** tab, under Storage details, under Credential Vending, select the **Edit** icon.
5. From the popup that appears, select **Enable**.

## Apache Polaris™ (Incubating) CLI

This section describes how to enable credential vending for an external catalog by using the Polaris CLI. The Apache Polaris (Incubating) CLI
is a command line interface for customers to programmatically update settings. For more information, see
[Apache Polaris (Incubating) CLI](https://polaris.apache.org/in-dev/unreleased/command-line-interface/).

To enable credential vending for an external catalog, use a service connection with the Polaris CLI.

### Step 1: Prepare a service connection with the necessary privileges

1. [Create a principal role](create-principal-role.md) to assign to the new service connection. Skip this step if you already have a
   principal role to assign to the service connection.
2. [Configure a service connection](configure-service-connection.md) and save the Client ID and
   Client Secret to use later with the Polaris CLI. Skip this step if you already have a service connection to use the Polaris CLI.
3. [Create a catalog role](create-catalog-role.md) in the target catalog(s) to grant it with the privileges needed to enable
   credential vending. Skip this step if you already have a catalog role to use for enabling credential vending.
4. [Secure the target catalog](secure-catalogs.md). When securing it, ensure the catalog role has at least one of the following privileges granted to it:

   * CATALOG_MANAGE_CONTENT
   * CATALOG_MANAGE_METADATA
   * CATALOG_WRITE_PROPERTIES

   To enable credential vending, the service principal used to perform this operation must have one of these privileges granted to it.

### Step 2: Run the CLI command

To run the CLI command, see the applicable steps for your environment:

* Run the CLI command as a Linux or Mac user
* Run the CLI command as a Windows user

#### Run the CLI command as a Linux or Mac user

##### Prerequisites

Before you run the CLI command, you should meet the following prerequisites for your environment:

* [Python 3.x](https://www.python.org/downloads/)
* [Git](https://git-scm.com/)

To run the CLI command in Linux or Mac, follow these steps:

1. Clone the [Apache Polaris™](https://github.com/apache/polaris) GitHub repository by running the following command:

   ```sql
   git clone https://github.com/apache/polaris.git
   ```

   For instructions on how to clone a GitHub repository, see [Cloning a repository](https://docs.github.com/en/repositories/creating-and-managing-repositories/cloning-a-repository).
2. Define the following environment variables for the CLI command:

   ```console
   export CLIENT_ID=<client-id>
   export CLIENT_SECRET=<client-secret>
   export sfAccountUrl=https://<open_catalog_account_identifier>.snowflakecomputing.com
   export catalogName=<my-catalog>
   ```

   Where:

   * `sfAccountUrl` is the following URL: `https://<open_catalog_account_identifier>.snowflakecomputing.com`. For `<open_catalog_account_identifier>`,
     specify the account identifier for your Open Catalog account. Depending on the region and cloud platform for the account, this identifier might
     be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see
     [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier).
   * `CLIENT_ID` is the client_id for your service connection that you saved.
   * `CLIENT_SECRET` is the client_secret for your service connection that you saved.
   * `catalogName` is the name of the external catalog you want to enable credential vending for.
3. From the directory that you cloned the Apache Polaris repo into, run the Polaris CLI:

   ```console
   ./polaris \
     --base-url "${sfAccountUrl}/polaris" \
     --client-id ${CLIENT_ID} \
     --client-secret ${CLIENT_SECRET} \
     catalogs \
     update "${catalogName}" \
     --set-property "enable.credential.vending"="true"
   ```

#### Run the CLI command as a Windows user

##### Prerequisites

To run the following code, you need Docker installed in the Windows machine.

1. Create a Dockerfile using the following code example:

   ```docker
   FROM python:3.11

   # install git
   RUN apt-get update && apt-get install -y git

   # get polaris
   RUN git clone https://github.com/apache/polaris.git

   WORKDIR /polaris

   RUN pip install --upgrade pip

   # install polaris cli
   RUN ./polaris --help
   ```
2. From the folder where the Dockerfile is located, build the docker image for Polaris CLI with the following command:

   ```console
   % docker build -t polaris-cli .                                                                                                                 0.0s
   ```
3. Run the docker container and bash terminal with the following command:

   ```shell
   % docker run --rm -it polaris-cli /bin/bash
   root@ae4c8353b45f:/polaris#
   ```
4. Run the following code to update the catalog to set the property `enable.credential.vending` to `true`:

   ```shell
   % docker run --rm -it polaris-cli /bin/bash
   root@ae4c8353b45f:/polaris# export CLIENT_ID=<client-id>
   export CLIENT_SECRET=<client-secret>
   export sfAccountUrl=https://<open_catalog_account_identifier>.snowflakecomputing.com
   export catalogName=<my-catalog>
   root@ae4c8353b45f:/polaris# ./polaris \
     --base-url "${sfAccountUrl}/polaris" \
     --client-id ${CLIENT_ID} \
     --client-secret ${CLIENT_SECRET} \
     catalogs \
     update "${catalogName}" \
     --set-property "enable.credential.vending"="true"
   ```
5. Run the following code to validate that the parameter `enable.credential.vending` was configured correctly:

   ```shell
   root@ae4c8353b45f:/polaris# ./polaris \
     --base-url "${sfAccountUrl}/polaris" \
     --client-id ${CLIENT_ID} \
     --client-secret ${CLIENT_SECRET} \
     catalogs \
     get "${catalogName}"
   {"type": "EXTERNAL", "name": "<my-catalog>", "properties": {"default-base-location": "s3://<bucket-name>/polaris/my-catalog-v2-storage/", "enable.credential.vending": "true"}, "createTimestamp": 1722547448827, "lastUpdateTimestamp": 1730906335286, "entityVersion": 3, "storageConfigInfo": {"storageType": "S3", "allowedLocations": ["s3://<bucket-name>/polaris/my-catalog-v2-storage/"], "roleArn": "arn:aws:iam::<aws-account-id>:role/<polaris-aws-role>"}}
   ```

---
title: Enable non-ACCOUNTADMIN roles to perform data sharing tasks
source: https://docs.snowflake.com/en/user-guide/security-access-privileges-shares.md
section: User Guide
---

# Enable non-ACCOUNTADMIN roles to perform data sharing tasks

This topic lists the minimum privileges required to perform SQL actions related to shares.

By default, the privileges required to create and manage shares are granted only to the ACCOUNTADMIN role, ensuring that only account
administrators can perform these tasks. However, the privileges can also be granted to other roles, enabling the tasks to be delegated to
other users in the account.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

> **Note:**
>
> If you grant sharing privileges to other users in the account, make sure that the user profiles for those other users includes a first
> name, last name, and an email address. To modify the user profile in Snowsight, see [Add user details to your user profile](ui-snowsight-profile.md).

## Data providers

Data providers can choose either of the following options to add objects to a share:

* **Option 1:** Create a database role in a database, grant privileges on objects to the database role, and then grant the database role to
  the share.
* **Option 2:** Grant privileges on the database and database objects directly to the share.

For more information on these options, see [How to share database objects](data-sharing-gs.md).

The minimum privileges required to create and manage shares in a data provider or data consumer account depend on which option was used.

Option 1:
:   | Action | Privilege | Object | Notes |
    | --- | --- | --- | --- |
    | Create shares. | CREATE SHARE | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
    | Create database roles in a database. | CREATE DATABASE ROLE | Database | Only the database owner role (i.e. the role that has the OWNERSHIP privilege on the database) has this privilege by default. The privilege can be granted to additional roles as needed. |

Option 2:
:   | Action | Privilege | Object | Notes |
    | --- | --- | --- | --- |
    | Create shares. | CREATE SHARE | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
    | Grant or revoke privileges on objects to or from a share. | OWNERSHIP | Share | This role must also have, at a minimum, the following privileges on the database objects with the grant option:   * USAGE on the database * USAGE on the schema * SELECT on any tables, external tables, secure views, or secure materialized views * USAGE on any secure UDFs |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

> **Attention:**
>
> Granting CREATE SHARE to other roles makes managing shares more flexible, but also allows users with these roles to expose any objects
> they own (or on which they have the necessary privileges) to other accounts. This is particularly important to note if you are sharing
> data from an account that contains sensitive or proprietary data.
>
> Take this into consideration before granting CREATE SHARE to other roles.

### Blocking access to objects in a share

Access to objects in a share can be blocked by either the role that owns share or the role that owns the objects:

* If your role owns the share, you can block access by revoking privileges on the objects from the share.
* If your role does not own the share, but owns the objects in the share, you can block access by revoking the USAGE or SELECT privileges
  with CASCADE on the objects from the share owner.

> **Note:**
>
> Ownership of a share, as well as the objects in the share, may be either through a direct grant to the role or inherited from a
> lower-level role in the role hierarchy. For more details, see
> [Role hierarchy and privilege inheritance](security-access-control-overview.md).
>
> It is possible for the same role to own a share and the objects in the share.

## Data consumers

In a consumer account, the global IMPORT SHARE privilege enables viewing the inbound shares shared with the account. The privilege also
permits creating databases from inbound shares if the role is also granted the global CREATE DATABASE privilege.

### IMPORT SHARE privilege

If the IMPORT SHARE privilege is granted to a role, any user with the role can perform the following tasks:

* View all INBOUND shares (shared by provider accounts).
* View all OUTBOUND shares owned by the role.
* Create databases from inbound shares if the role is also granted the global CREATE DATABASE privilege

### Granting the privilege to another role

To grant the global IMPORT SHARE privilege to a non-ACCOUNTADMIN role in a consumer account, use the ACCOUNTADMIN role and the
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command.

For example, to grant the privilege to the SYSADMIN role:

```sqlexample
USE ROLE ACCOUNTADMIN;

GRANT IMPORT SHARE ON ACCOUNT TO SYSADMIN;
```

---
title: Enabling and disabling search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/enabling.md
section: User Guide
---

# Enabling and disabling search optimization

To enable search optimization, use a role that has the necessary privileges, then enable it for an entire table or
specific columns using the [ALTER TABLE](../../sql-reference/sql/alter-table.md) …
[ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command.

## Required access control privileges

To add, configure, or remove search optimization for a table, you must:

* Have OWNERSHIP privilege on the table.
* Have ADD SEARCH OPTIMIZATION privilege on the schema that contains the table. To grant this privilege:

  ```sqlsyntax
  GRANT ADD SEARCH OPTIMIZATION ON SCHEMA <schema_name> TO ROLE <role>
  ```

To use the search optimization service for a query, you just need the SELECT privilege on the table.

You don’t need any additional privileges. Because SEARCH OPTIMIZATION is a table property, it is automatically
detected and used (if appropriate) when querying a table.

## Configuring search optimization

> **Note:**
>
> Adding search optimization to a large table (a table containing terabytes (TB) or more of data) might result in an immediate
> increase in credit consumption over a short period of time.
>
> When you add search optimization to a table, the maintenance service immediately starts building the search access paths for the
> table in the background. If the table is large, the maintenance service might massively parallelize this work, which can result
> in increased costs over a short period of time.
>
> Before you add search optimization to a large table,
> [get an estimate of these costs](cost-estimation.md) so that you know what to expect.

When you enable search optimization, you have a choice of enabling it for a whole table or for specific columns in the
table.

* Enabling search optimization for a whole table enables it for point-lookup queries on all eligible columns.

  To enable search optimization for a whole table, use the [ALTER TABLE](../../sql-reference/sql/alter-table.md) …
  [ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command *without* the ON clause.
* Enabling search optimization for specific columns avoids spending credits on creating search access paths for columns
  that you don’t often use in queries, and also allows you to select additional types of queries to be optimized for
  each column, potentially further increasing performance.

  To enable search optimization for specific columns, specifying the types of queries to be optimized, use the
  ON clause in the ALTER TABLE … ADD SEARCH OPTIMIZATION command.
  In the ON clause in ADD SEARCH OPTIMIZATION, you specify which columns should be enabled for search optimization. When enabling
  search optimization for a given column, you can also specify a search method (for example, EQUALITY for equality and IN searches,
  GEO for GEOGRAPHY searches, or SUBSTRING for substring searches). You can enable more than one search method on the same column.
* You can enable search optimization for a whole Apache Iceberg™ table or for specific columns in the table by using the
  [ALTER ICEBERG TABLE](../../sql-reference/sql/alter-iceberg-table.md) …
  [ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-iceberg-table.md) command.

In general, enabling search optimization only for specific columns is the best practice.

The following sections explain how to configure search optimization for a table:

* Enabling search optimization for specific columns
* Enabling search optimization for an entire table

After you have configured search optimization, you can inspect your configuration to make sure it is correct.

* Verifying that a table is configured for search optimization

You can remove search optimization from specific columns or whole tables when you have discovered that search
optimization does not provide enough benefit.

* Removing search optimization from specific columns or the entire table

## Enabling search optimization for specific columns

To configure search optimization for a specific column, use the
[ALTER TABLE](../../sql-reference/sql/alter-table.md) …
[ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command with the ON clause.

> **Note:**
>
> When running this command, use a role that has
> the privileges to add search optimization to the table.

The ON clause specifies that you want to configure search optimization for specific columns. For details on the syntax, see
[the section on ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md).

> **Note:**
>
> If you just want to apply search optimization for equality and IN predicates to all applicable columns in the table, see
> Enabling search optimization for an entire table.

After running this command, you can
verify that the columns have been configured for search optimization.

The next sections contain examples that demonstrate how to specify the configuration for search optimization:

* Example: Full-text search optimization on specific columns
* Example: Supporting equality and IN predicates for specific columns
* Example: Supporting equality and IN predicates for all applicable columns
* Example: Supporting different types of predicates
* Example: Supporting different predicates on the same column
* Example: Supporting equality and IN predicates for an element in a VARIANT
* Example: Supporting geospatial functions

### Example: Full-text search optimization on specific columns

You can perform text searches by using the [SEARCH](../../sql-reference/functions/search.md) and [SEARCH_IP](../../sql-reference/functions/search_ip.md)
functions. To improve query execution performance when these functions are used, enable FULL_TEXT search optimization. You can
enable FULL_TEXT search optimization on a table by using different subsets of the columns in the table and different text
analyzers. For information about the behavior of different analyzers, see [How search terms are tokenized](../../sql-reference/functions/search.md).

Enable FULL_TEXT search optimization on a set of columns in a table by using the following syntax.

```sqlsyntax
ALTER TABLE <name> ADD SEARCH OPTIMIZATION
  ON FULL_TEXT( { * | <col1> [ , <col2>, ... ] } [ , ANALYZER => '<analyzer_name>' ]);
```

The columns you specify must be VARCHAR, VARIANT, ARRAY, or OBJECT columns. Columns with other data types aren’t supported.
In addition, you can specify individual [paths](../querying-semistructured.md) to columns of type VARIANT,
ARRAY, or OBJECT.

You can specify the wildcard asterisk character (`*`) instead of a list of columns. In this case, the optimization is
automatically enabled on all the columns of supported types.

If specified, the [ANALYZER => 'analyzer_name'](../../sql-reference/functions/search.md) argument must be one of the choices that is documented for the
SEARCH function. If you don’t specify an analyzer, the DEFAULT_ANALYZER is used.

> **Note:**
>
> For query execution with the SEARCH function to be optimized, the analyzer specified for the search optimization
> in the ALTER TABLE command must be the same as the analyzer specified in the SEARCH function call. If the analyzers don’t
> match, the search access path won’t be selected.

This example enables FULL_TEXT search optimization on three VARCHAR columns that might be the targets of a
SEARCH query.

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(play, character, line);
```

To describe the search optimization configuration for this table, run the following command:

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON lines;
```

```output
+---------------+----------------------------+-----------+------------------+--------+
| expression_id | method                     | target    | target_data_type | active |
|---------------+----------------------------+-----------+------------------+--------|
|             1 | FULL_TEXT DEFAULT_ANALYZER | PLAY      | VARCHAR(50)      | true   |
|             2 | FULL_TEXT DEFAULT_ANALYZER | CHARACTER | VARCHAR(30)      | true   |
|             3 | FULL_TEXT DEFAULT_ANALYZER | LINE      | VARCHAR(2000)    | true   |
+---------------+----------------------------+-----------+------------------+--------+
```

For more information, see Displaying the search optimization configuration for a table.

This example enables FULL_TEXT search optimization on a VARCHAR column that might be the target of a
SEARCH_IP query.

```sqlexample
ALTER TABLE ipt ADD SEARCH OPTIMIZATION ON FULL_TEXT(ip1, ANALYZER => 'ENTITY_ANALYZER');
```

To remove the search optimization configuration, run one of the following commands:

```sqlexample
ALTER TABLE lines DROP SEARCH OPTIMIZATION
  ON FULL_TEXT(play, character, line);
```

```sqlexample
ALTER TABLE lines DROP SEARCH OPTIMIZATION
  ON play, character, line;
```

```sqlexample
ALTER TABLE lines DROP SEARCH OPTIMIZATION
  ON 1, 2, 3;
```

In the third ALTER TABLE … DROP SEARCH OPTIMIZATION command, `1, 2, 3` refers to the expression IDs
returned by the DESCRIBE command.

You can also modify a FULL_TEXT search optimization configuration by dropping a subset of the columns (by name or
expression ID). For more information, see Removing search optimization from specific columns or the entire table.

For more examples that enable and drop FULL_TEXT search optimization, see
[Examples of ADD (and DROP) FULL_TEXT search optimization](text-queries.md).

### Example: Supporting equality and IN predicates for specific columns

To optimize searches with equality predicates for the columns `c1`, `c2`, and `c3` in the table `t1`, execute the
following statement:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2, c3);
```

You can also specify the same search method more than once in the ON clause:

```sqlexample
-- This statement is equivalent to the previous statement.
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1), EQUALITY(c2, c3);
```

### Example: Supporting equality and IN predicates for all applicable columns

To optimize searches with equality predicates for all applicable columns in the table, execute the following statement:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(*);
```

Note the following:

* As explained in the
  [description of the syntax for the search method and target](../../sql-reference/sql/alter-table-event-table.md),
  for a given method, you cannot specify both an asterisk and specific columns.
* Although omitting the ON clause also configures search optimization for equality and IN predicates on all applicable
  columns in the table, there are differences between specifying and omitting the ON clause. See
  Enabling search optimization for an entire table.

### Example: Supporting different types of predicates

To optimize searches with equality predicates for the column `c1` and `c2` and substring searches for the column `c3`,
execute the following statement:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2), SUBSTRING(c3);
```

### Example: Supporting different predicates on the same column

To optimize searches for both equality predicates and substring predicates on the same column, `c1`, execute the following statement:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1), SUBSTRING(c1);
```

### Example: Supporting equality and IN predicates for an element in a VARIANT

To optimize searches with equality predicates on the VARIANT element `uuid` nested in the element `user` in the VARIANT column
`c4`, execute the following statement:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c4:user.uuid);
```

### Example: Supporting geospatial functions

To optimize searches with predicates that use geospatial functions with GEOGRAPHY objects in the `c1` column, execute the following
statement:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON GEO(c1);
```

## Enabling search optimization for an entire table

To specify EQUALITY for all columns of the supported data types (except for
[semi-structured](../../sql-reference/data-types-semistructured.md) and [GEOGRAPHY](../../sql-reference/data-types-geospatial.md)),
use the [ALTER TABLE](../../sql-reference/sql/alter-table.md) …
[ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command without the ON clause.

> **Note:**
>
> When running this command, use a role that has
> the privileges to add search optimization to the table.

For example:

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION;
```

For more information on the syntax, see
[the section on search optimization in ALTER TABLE](../../sql-reference/sql/alter-table.md).

After running this command, you can
verify that the columns have been configured for search optimization.

### Effect on subsequently added columns

After you run ALTER TABLE … ADD SEARCH OPTIMIZATION command without the ON clause, any columns that are subsequently added to the table
will also be configured for optimization on EQUALITY.

However, if you execute ALTER TABLE … { ADD | DROP } SEARCH OPTIMIZATION with the ON clause on the same table, any
columns that are subsequently added to the table won’t be configured for EQUALITY automatically. You must execute
ALTER TABLE … ADD SEARCH OPTIMIZATION ON … to configure these newly added columns for EQUALITY.

## Verifying that a table is configured for search optimization

To verify that the table and its columns have been configured for search optimization:

1. Display the search optimization configuration for the
   table and its columns.
2. Run the [SHOW TABLES](../../sql-reference/sql/show-tables.md) command to verify that search optimization has been added and to determine how
   much of the table has been optimized.

   For example:

   ```sqlexample
   SHOW TABLES LIKE '%test_table%';
   ```

   In the output from this command:

   * Verify that SEARCH_OPTIMIZATION is `ON`, which indicates that search optimization has been added.
   * Check the value of SEARCH_OPTIMIZATION_PROGRESS. This specifies the percentage of the table that has been optimized so far.

     When search optimization is first added to a table, the performance benefits don’t appear immediately.
     The search optimization service starts populating data in the background. The benefits appear increasingly as
     the maintenance catches up to the current state of the table.

     Before you run a query to verify that search optimization is working, wait until this shows that the table has been fully
     optimized.
3. Run a query to verify that search optimization is working.

   Note that the Snowflake optimizer automatically chooses when to use the search optimization service for a particular query.
   Users cannot control which queries search optimization is used for.

   Choose a query that the search optimization service is designed to optimize.
   See [Identifying queries that can benefit from search optimization](queries-that-benefit.md).
4. In the web UI, view the query plan for this query, and verify that the query node “Search Optimization Access” is part of the
   query plan.

## Displaying the search optimization configuration for a table

To display the search optimization configuration for a table, use the [DESCRIBE SEARCH OPTIMIZATION](../../sql-reference/sql/desc-search-optimization.md) command.

For example, suppose that you execute the following statement to configure search optimization for a column:

```sqlexample
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1);
```

Executing DESCRIBE SEARCH OPTIMIZATION produces the following output:

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON t1;
```

```output
+---------------+----------+--------+------------------+--------+
| expression_id |  method  | target | target_data_type | active |
+---------------+----------+--------+------------------+--------+
| 1             | EQUALITY | C1     | NUMBER(38,0)     | true   |
+---------------+----------+--------+------------------+--------+
```

## Removing search optimization from specific columns or the entire table

You can remove the search optimization configuration for specific columns, or you can remove the SEARCH OPTIMIZATION property
from the entire table.

* Dropping search optimization for specific columns
* Removing search optimization from the table

### Dropping search optimization for specific columns

To drop the search optimization configuration for specific columns, use the following command:
[ALTER TABLE](../../sql-reference/sql/alter-table.md) …
[DROP SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command with the ON clause.

For example, suppose that executing the DESCRIBE SEARCH OPTIMIZATION command prints the following expressions:

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON t1;
```

```output
+---------------+-----------+-----------+-------------------+--------+
| expression_id |  method   | target    | target_data_type  | active |
+---------------+-----------+-----------+-------------------+--------+
|             1 | EQUALITY  | C1        | NUMBER(38,0)      | true   |
|             2 | EQUALITY  | C2        | VARCHAR(16777216) | true   |
|             3 | EQUALITY  | C4        | NUMBER(38,0)      | true   |
|             4 | EQUALITY  | C5        | VARCHAR(16777216) | true   |
|             5 | EQUALITY  | V1        | VARIANT           | true   |
|             6 | SUBSTRING | C2        | VARCHAR(16777216) | true   |
|             7 | SUBSTRING | C5        | VARCHAR(16777216) | true   |
|             8 | GEO       | G1        | GEOGRAPHY         | true   |
|             9 | EQUALITY  | V1:"key1" | VARIANT           | true   |
|            10 | EQUALITY  | V1:"key2" | VARIANT           | true   |
+---------------+-----------+-----------+-------------------+--------+
```

To drop search optimization for substrings on the column `c2`, execute the following statement:

```sqlexample
ALTER TABLE t1 DROP SEARCH OPTIMIZATION ON SUBSTRING(c2);
```

To drop search optimization for all methods on the column `c5`, execute the following statement:

```sqlexample
ALTER TABLE t1 DROP SEARCH OPTIMIZATION ON c5;
```

Because the column `c5` is configured to optimize equality and substring searches, the statement above drops the configuration
for equality and substring searches for `c5`.

To drop search optimization for equality on the column `c1` and to drop the configuration specified by the expression IDs `6`
and `8`, execute the following statement:

```sqlexample
ALTER TABLE t1 DROP SEARCH OPTIMIZATION ON EQUALITY(c1), 6, 8;
```

For more information on the syntax, see
[the section on ALTER TABLE … DROP SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md).

### Removing search optimization from the table

To remove the SEARCH OPTIMIZATION property from a table:

1. Switch to a role that has the privileges to remove search optimization from the table.
2. Run the [ALTER TABLE](../../sql-reference/sql/alter-table.md) …
   [DROP SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command without the ON clause:

   ```sqlsyntax
   ALTER TABLE [IF EXISTS] <table_name> DROP SEARCH OPTIMIZATION;
   ```

   For example:

   ```sqlexample
   ALTER TABLE test_table DROP SEARCH OPTIMIZATION;
   ```

For more information, see
[the section on ALTER TABLE … DROP SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md).

---
title: Enabling Snowpipe error notifications for Amazon SNS
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-errors-sns.md
section: User Guide
---

# Enabling Snowpipe error notifications for Amazon SNS

This topic provides instructions for pushing Snowpipe error notifications to the [Amazon Simple Notification Service](https://docs.aws.amazon.com/sns/) (SNS) service. SNS is a publish/subscribe messaging service.

This feature can push error notifications for the following types of loads:

* Auto-ingest Snowpipe.
* Calls to the Snowpipe `insertFiles` REST API endpoint.
* Loads from Apache Kafka using the Snowflake Connector for Kafka with the Snowpipe ingestion method only.

## Cloud platform support

Currently, this feature is limited to Snowflake accounts hosted on Amazon Web Services (AWS). Snowpipe can load data from files in any supported cloud storage service; however, push notifications to SNS are only supported in Snowflake accounts hosted on AWS.

## Notes

* This feature is implemented using the notification integration object. A notification integration is a Snowflake object that provides an
  interface between Snowflake and third-party cloud message queuing services. A single notification integration can support multiple pipes.
* Snowflake guarantees at-least-once message delivery of error notifications (i.e. multiple attempts are made to deliver messages to ensure at least one attempt succeeds, which can result in duplicate messages).

## Enabling error notifications

### Creating the notification integration

See [Creating a notification integration to send notifications to an Amazon SNS topic](notifications/creating-notification-integration-amazon-sns.md).

### Enabling error notifications in pipes

A single notification integration can be shared by multiple pipes. The body of error messages identifies the pipe, external stage and path,
and file where the error originated, among other details.

To enable error notifications for a pipe, specify an ERROR_INTEGRATION parameter value.

> **Note:**
>
> Creating or modifying a pipe that references a notification integration requires a role that has the USAGE privilege on the notification
> integration. In addition, the role must have either the CREATE PIPE privilege on the schema or the OWNERSHIP privilege on the pipe,
> respectively.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

#### New pipe

Create a new pipe using [CREATE PIPE](../sql-reference/sql/create-pipe.md). Note that the configuring automated data loads (i.e. auto-ingest Snowpipe)
requires additional pipe parameters. For instructions, see [Automate continuous data loading with cloud messaging](data-load-snowpipe-auto.md).

```sqlsyntax
CREATE PIPE <name>
  [ AUTO_INGEST = TRUE | FALSE  ]
  ERROR_INTEGRATION = <integration_name>
  AS <copy_statement>
```

Where:

`ERROR_INTEGRATION = <integration_name>`
:   Name of the notification integration you created in [Create the notification integration](notifications/creating-notification-integration-amazon-sns.md).

The following example shows a CREATE PIPE statement that supports both error notifications and automated data loads:

```sqlexample
CREATE PIPE mypipe
  AUTO_INGEST = TRUE
  ERROR_INTEGRATION = my_notification_int
  AS
  COPY INTO mydb.public.mytable
  FROM @mydb.public.mystage;
```

#### Existing pipe

Modify an existing pipe using [ALTER PIPE](../sql-reference/sql/alter-pipe.md):

```sqlsyntax
ALTER PIPE <name> SET ERROR_INTEGRATION = <integration_name>;
```

Where `<integration_name>` is the name of the notification integration you created in
[Create the notification integration](notifications/creating-notification-integration-amazon-sns.md).

For example:

```sqlexample
ALTER PIPE mypipe SET ERROR_INTEGRATION = my_notification_int;
```

## Error notification message payload

The body of error messages identifies the pipe and the errors encountered during a load.

The following is a sample message payload describing a Snowpipe error. The payload can include one or more error messages.

```bash
{\"version\":\"1.0\",\"messageId\":\"a62e34bc-6141-4e95-92d8-f04fe43b43f5\",\"messageType\":\"INGEST_FAILED_FILE\",\"timestamp\":\"2021-10-22T19:15:29.471Z\",\"accountName\":\"MYACCOUNT\",\"pipeName\":\"MYDB.MYSCHEMA.MYPIPE\",\"tableName\":\"MYDB.MYSCHEMA.MYTABLE\",\"stageLocation\":\"s3://mybucket/mypath\",\"messages\":[{\"fileName\":\"/file1.csv_0_0_0.csv.gz\",\"firstError\":\"Numeric value 'abc' is not recognized\"}]}
```

Note that you must parse the string into a JSON object to process values in the payload.

---
title: Enabling Snowpipe error notifications for Google Pub/Sub
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-errors-gcs.md
section: User Guide
---

# Enabling Snowpipe error notifications for Google Pub/Sub

This topic provides instructions for pushing Snowpipe error notifications to the [Google Cloud Pub/Sub](https://cloud.google.com/storage/docs/reporting-changes) (Pub/Sub) service.

This feature can push error notifications for the following types of loads:

* Auto-ingest Snowpipe.
* Calls to the Snowpipe `insertFiles` REST API endpoint.
* Loads from Apache Kafka using the Snowflake Connector for Kafka with the Snowpipe ingestion method only.

## Cloud platform support

Currently, this feature is limited to Snowflake accounts hosted on Google Cloud (GC). Snowpipe can load data from files in any supported cloud storage service; however, push notifications to Pub/Sub are only supported in Snowflake accounts hosted on GC.

## Notes

* Snowflake guarantees at-least-once message delivery of error notifications (i.e. multiple attempts are made to deliver messages to ensure at least one attempt succeeds, which can result in duplicate messages).
* This feature is implemented using the notification integration object. A notification integration is a Snowflake object that provides an
  interface between Snowflake and third-party cloud message queuing services. A single notification integration can support multiple pipes.

## Enabling error notifications

### Creating the notification integration

See [Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic](notifications/creating-notification-integration-google-pubsub.md).

### Enabling error notifications in pipes

A single notification integration can be shared by multiple pipes. The body of error messages identifies the pipe, external stage and
path, and file where the error originated, among other details.

To enable error notifications for a pipe, specify an ERROR_INTEGRATION parameter value.

> **Note:**
>
> Creating or modifying a pipe that references a notification integration requires a role that has the USAGE privilege on the notification
> integration. In addition, the role must have either the CREATE PIPE privilege on the schema or the OWNERSHIP privilege on the pipe,
> respectively.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

#### New pipe

Create a new pipe using [CREATE PIPE](../sql-reference/sql/create-pipe.md):

```sqlsyntax
CREATE PIPE <name>
  AUTO_INGEST = TRUE
  [ INTEGRATION = '<string>' ]
  ERROR_INTEGRATION = <integration_name>
  AS <copy_statement>
```

Where:

`ERROR_INTEGRATION = <integration_name>`
:   Name of the notification integration you created in [Create a notification integration in Snowflake](notifications/creating-notification-integration-google-pubsub.md).

For example:

```sqlexample
CREATE PIPE mypipe
  AUTO_INGEST = TRUE
  INTEGRATION = 'my_storage_int'
  ERROR_INTEGRATION = my_notification_int
  AS
  COPY INTO mydb.public.mytable
  FROM @mydb.public.mystage;
```

#### Existing pipe

Modify an existing pipe using [ALTER PIPE](../sql-reference/sql/alter-pipe.md).

> **Note:**
>
> If a notification integration was specified when the pipe was created, it is necessary to first
> unset the ERROR_INTEGRATION parameter (using ALTER PIPE … UNSET ERROR_INTEGRATION) and then set the parameter.

```sqlsyntax
ALTER PIPE <name> SET ERROR_INTEGRATION = <integration_name>;
```

Where `<integration_name>` is the name of the notification integration you created in
[Create a notification integration in Snowflake](notifications/creating-notification-integration-google-pubsub.md).

For example:

```sqlexample
ALTER PIPE mypipe SET ERROR_INTEGRATION = my_notification_int;
```

## Error notification message payload

The body of error messages identifies the pipe and the errors encountered during a load.

The following is a sample message payload describing a Snowpipe error. The payload can include one or more error messages.

```bash
{\"version\":\"1.0\",\"messageId\":\"a62e34bc-6141-4e95-92d8-f04fe43b43f5\",\"messageType\":\"INGEST_FAILED_FILE\",\"timestamp\":\"2021-10-22T19:15:29.471Z\",\"accountName\":\"MYACCOUNT\",\"pipeName\":\"MYDB.MYSCHEMA.MYPIPE\",\"tableName\":\"MYDB.MYSCHEMA.MYTABLE\",\"stageLocation\":\"gcs://mybucket/mypath\",\"messages\":[{\"fileName\":\"/file1.csv_0_0_0.csv.gz\",\"firstError\":\"Numeric value 'abc' is not recognized\"}]}
```

Note that you must parse the string into a JSON object to process values in the payload.

---
title: Enabling Snowpipe error notifications for Microsoft Azure Event Grid
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-errors-azure.md
section: User Guide
---

# Enabling Snowpipe error notifications for Microsoft Azure Event Grid

This topic provides instructions for pushing Snowpipe error notifications to the [Microsoft Azure Event Grid](https://azure.microsoft.com/en-us/services/event-grid/) (Event Grid).

This feature can push error notifications for the following types of loads:

* Auto-ingest Snowpipe.
* Calls to the Snowpipe `insertFiles` REST API endpoint.
* Loads from Apache Kafka using the Snowflake Connector for Kafka with the Snowpipe ingestion method only.

## Cloud platform support

Currently, this feature is limited to Snowflake accounts hosted on Microsoft Azure. Snowpipe can load data from files in any supported cloud storage service; however, push notifications to Event Grid are only supported in Snowflake accounts hosted on Azure

## Notes

* Snowflake guarantees at-least-once message delivery of error notifications (i.e. multiple attempts are made to deliver messages to ensure at least one attempt succeeds, which can result in duplicate messages).
* This feature is implemented using the notification integration object. A notification integration is a Snowflake object that provides an
  interface between Snowflake and third-party cloud message queuing services. A single notification integration can support multiple pipes.

## Enabling error notifications

### Creating the notification integration

See [Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic](notifications/creating-notification-integration-azure-event-grid.md).

### Enabling error notifications in pipes

A single notification integration can be shared by multiple pipes. The body of error messages identifies the pipe, external stage and
path, and file where the error originated, among other details.

To enable error notifications for a pipe, specify an ERROR_INTEGRATION parameter value.

> **Note:**
>
> Creating or modifying a pipe that references a notification integration requires a role that has the USAGE privilege on the notification
> integration. In addition, the role must have either the CREATE PIPE privilege on the schema or the OWNERSHIP privilege on the pipe,
> respectively.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

#### New pipe

Create a new pipe using [CREATE PIPE](../sql-reference/sql/create-pipe.md):

```sqlsyntax
CREATE PIPE <name>
  AUTO_INGEST = TRUE
  [ INTEGRATION = '<string>' ]
  ERROR_INTEGRATION = <integration_name>
  AS <copy_statement>
```

Where:

`ERROR_INTEGRATION = <integration_name>`
:   Name of the notification integration you created in [Create notification integration in Snowflake](notifications/creating-notification-integration-azure-event-grid.md).

For example:

```sqlexample
CREATE PIPE mypipe
  AUTO_INGEST = TRUE
  INTEGRATION = 'my_storage_int'
  ERROR_INTEGRATION = my_notification_int
  AS
  COPY INTO mydb.public.mytable
  FROM @mydb.public.mystage;
```

#### Existing pipe

Modify an existing pipe using [ALTER PIPE](../sql-reference/sql/alter-pipe.md):

```sqlsyntax
ALTER PIPE <name> SET ERROR_INTEGRATION = <integration_name>;
```

Where `<integration_name>` is the name of the notification integration you created in
[Create notification integration in Snowflake](notifications/creating-notification-integration-azure-event-grid.md).

For example:

```sqlexample
ALTER PIPE mypipe SET ERROR_INTEGRATION = my_notification_int;
```

## Error notification message payload

The body of error messages identifies the pipe and the errors encountered during a load.

The following is a sample message payload describing a Snowpipe error. The payload can include one or more error messages.

```bash
{\"version\":\"1.0\",\"messageId\":\"a62e34bc-6141-4e95-92d8-f04fe43b43f5\",\"messageType\":\"INGEST_FAILED_FILE\",\"timestamp\":\"2021-10-22T19:15:29.471Z\",\"accountName\":\"MYACCOUNT\",\"pipeName\":\"MYDB.MYSCHEMA.MYPIPE\",\"tableName\":\"MYDB.MYSCHEMA.MYTABLE\",\"stageLocation\":\"azure://myaccount.blob.core.windows.net/mycontainer/mypath\",\"messages\":[{\"fileName\":\"/file1.csv_0_0_0.csv.gz\",\"firstError\":\"Numeric value 'abc' is not recognized\"}]}
```

Note that you must parse the string into a JSON object to process values in the payload.

---
title: Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md
section: User Guide
---

# Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™

This topic describes how to enforce data protection policies set on Apache Iceberg™ tables when accessed over Apache Spark™ through Snowflake
Horizon Catalog. To enforce data protection policies, you install the Snowflake Connector for Spark, *Spark connector*. For more information
about the Spark connector, see [Snowflake Connector for Spark](spark-connector.md).

The Spark connector supports querying tables that are protected by Snowflake policies by routing the query through Snowflake, which ensures efficient
use of compute and consistent enforcement. The Spark connector also supports performing write operations on tables that are protected by Snowflake policies by
routing the write through Snowflake.

> **Note:**
>
> The Spark connector also supports directly querying Apache Iceberg tables without fine-grained data protection policies by using Spark
> session compute through Snowflake Horizon Catalog.

## Workflow to enforce data protection policies when querying Iceberg tables from Spark

To enforce data protection policies when querying Iceberg tables from Spark, complete the following steps:

1. Configure data protection policies.
2. Connect Spark with Snowflake Spark Connector to Iceberg tables, which includes downloading the Snowflake Connector for Spark
   and connecting Spark to Iceberg tables through Snowflake Horizon Catalog.
3. Query Iceberg tables.

## Supported data protection policies

The following data protection policies are supported:

* [Masking policies](security-column-intro.md)
* [Tag-based masking policies](tag-based-masking-policies.md)
* [Row access policies](security-row-intro.md)

Queries on tables that are protected with any other data policy result in an error.

## Prerequisites

* Spark 3.5.3 or higher is required to use this feature.
* Retrieve the following information:

  + The username of the Snowflake user who will query the tables
  + The name of the Snowflake database that contains the tables that you want to query
  + The name of the virtual warehouse in Snowflake to use for policy evaluation
* Retrieve the account identifier for your Snowflake account that contains the Iceberg tables that you want to query. For instructions,
  see [Account identifiers](admin-account-identifier.md). You specify this identifier when you
  connect Spark to Iceberg tables with data access policies enforced.

  > **Tip:**
  >
  > To get your account identifier by using SQL, run the following command:
  >
  > ```sqlexample
  > SELECT CURRENT_ORGANIZATION_NAME() || '-' || CURRENT_ACCOUNT_NAME();
  > ```

## Step 1: Configure data protection policies

> **Important:**
>
> If you already set data protection policies on the Iceberg tables that you want to query, proceed to the next step.

In this step, you configure data protection policies.

* To configure data protection policies, set data access policies on the Iceberg tables that you want to query:

  + To assign masking policies, see [Understanding Dynamic Data Masking](security-column-ddm-intro.md).
  + To assign tag-based masking policies, see [Tag-based masking policies](tag-based-masking-policies.md).
  + To assign row access policies, see [Understanding row access policies](security-row-intro.md).

## Step 2: Connect Spark with Snowflake Connector for Spark to Iceberg tables

In this step, you connect Spark to Iceberg tables through Horizon Catalog. With this
connection, you can query the tables by using Spark with the data protection policies enforced on the tables.

To Connect Spark with the Snowflake Connector for Spark (Spark connector) to Iceberg tables, you first download the Spark connector, and
then you connect Spark to Iceberg tables.

### Download the Snowflake Connector for Spark

To download 3.1.6 or a later version of the Snowflake Connector for Spark, follow the instructions in [Installing and Configuring the Spark Connector](spark-connector-install.md).

### Connect Spark to Iceberg tables

In this step, you connect Spark to Iceberg tables through Horizon Catalog. This connection includes configurations for you to use
the Snowflake Connector for Spark with Horizon catalog to query the tables that are protected by Snowflake data protection policies.

> **Note:**
>
> If you’re using External OAuth or key-pair authentication, see Connect Spark to Iceberg tables by using External OAuth or key pair authentication.

* To connect Spark to Iceberg tables by using a programmatic access token (PAT), use the following example PySpark code:

  ```python
  from pyspark.sql import SparkSession

  # Snowflake Horizon Catalog Configuration, change as per your environment

  CATALOG_URI = "https://<account_identifier>.snowflakecomputing.com/polaris/api/catalog"
  ROLE = "<role>"
  HORIZON_SESSION_ROLE = f"session:role:{ROLE}"
  CATALOG_NAME = "<database_name>" #provide in UPPER CASE
  SF_URL= "<account_identifier>.snowflakecomputing.com"
  SF_USER = "<user_name>" #provide in UPPER CASE
  SF_PASSWORD = "<user_password>"
  SF_SCHEMA = "<schema_name>" #provide in UPPER CASE
  SF_WAREHOUSE = "<warehouse_name>" #provide in UPPER CASE

  # Cloud Service Provider Region Configuration (where the Iceberg data is stored)
  REGION = "<region_name>"

  # Paste the External Oauth Access token that you generated in Snowflake here
  ACCESS_TOKEN = "<your_access_token>"

  # Paste the PAT you generated in Snowflake here
  PAT_TOKEN = "<your_PAT_token>"

  # Iceberg Version
  ICEBERG_VERSION = "1.9.1"

  #Snowflake Connector for Spark
  DRIVER_VERSION = "3.24.0" # (or above)
  SNOWFLAKE_CONNECTOR_VERSION = "3.1.6"

  try:
      spark.stop()
  except:
      pass

    spark = (
        SparkSession.builder

        .master("local[*]")
  .config("spark.ui.port", "0")
        .config("spark.driver.bindAddress", "127.0.0.1")
        .config("spark.driver.host", "127.0.0.1")
        .config("spark.driver.port", "0")
        .config("spark.blockManager.port", "0")

  # JAR Dependencies for Iceberg, Azure and Snowflake Connector for Spark
        .config(
   "spark.jars.packages",
   f"org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:{ICEBERG_VERSION},"
   f"org.apache.iceberg:iceberg-aws-bundle:{ICEBERG_VERSION},"

     # for Azure storage, use the below package and comment above azure bundle
            # f"org.apache.iceberg:iceberg-azure-bundle:{ICEBERG_VERSION}"
  # for Snowflake Connector for Spark
   f"net.snowflake:snowflake-jdbc:{DRIVER_VERSION},"
   f"net.snowflake:spark-snowflake_2.12:{SNOWFLAKE_CONNECTOR_VERSION}"

  )
        # Iceberg SQL Extensions
        .config("spark.sql.extensions", "org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions")
        .config("spark.sql.defaultCatalog", "horizoncatalog")
  .config("spark.sql.catalog.horizoncatalog", "org.apache.spark.sql.snowflake.catalog.SnowflakeFallbackCatalog")

    #Horizon REST Catalog Configuration
     .config(f"spark.sql.catalog.horizoncatalog.catalog-impl", "org.apache.iceberg.spark.SparkCatalog")
        .config(f"spark.sql.catalog.horizoncatalog.type", "rest")
        .config(f"spark.sql.catalog.horizoncatalog.uri", CATALOG_URI)
        .config(f"spark.sql.catalog.horizoncatalog.warehouse", CATALOG_NAME)
        .config(f"spark.sql.catalog.horizoncatalog.scope", HORIZON_SESSION_ROLE)
        .config(f"spark.sql.catalog.horizoncatalog.client.region", REGION)
        .config(f"spark.sql.catalog.horizoncatalog.credential", PAT_TOKEN)
  # for External Oauth use below and comment above configuration .token
  #.config(f"spark.sql.catalog.horizoncatalog.token", ACCESS_TOKEN)

  .config("spark.sql.catalog.horizoncatalog.io-impl","org.apache.iceberg.aws.s3.S3FileIO")
  # Enforcing policies using Snowflake Connector for Spark
  .config("spark.snowflake.sfURL", SF_URL)
  .config("spark.snowflake.sfUser", SF_USER)
  .config("spark.snowflake.sfPassword", SF_PASSWORD)
  # for External Oauth uncomment below and comment above configurations for user and password
  #.config("spark.snowflake.sfAuthenticator","oauth")
  #.config("spark.snowflake.sfToken",ACCESS_TOKEN)
  .config("spark.snowflake.sfDatabase", CATALOG_NAME)
  .config("spark.snowflake.sfSchema",SF_SCHEMA) # Optional
  .config("spark.snowflake.sfRole",ROLE)
  .config("spark.snowflake.sfWarehouse",SF_WAREHOUSE)

    # Required for vended credentials
   .config(f"spark.sql.catalog.horizoncatalog.header.X-Iceberg-Access-Delegation", "vended-credentials")
        .config("spark.sql.iceberg.vectorization.enabled", "false")
        .getOrCreate()
    )
    spark.sparkContext.setLogLevel("ERROR")
  ```

  Where:

  + `<account_identifier>` is your Snowflake account identifier for the Snowflake account that contains the Iceberg tables that you want
    to query. To find this identifier, see [Account identifiers](admin-account-identifier.md).
  + `<your_access_token>` is your access token that you obtained. To obtain an access token, see
    [Obtain access token for authentication](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

    > **Note:**
    >
    > For External OAuth, alternatively, you can configure your connection to the engine with automatic token refresh instead of specifying
    > an access token.
  + `<database_name>` is the name of the database in your Snowflake account that contains Snowflake-managed Iceberg tables that you want to query.

    > **Note:**
    >
    > The following properties in Spark expect your Snowflake *database* name, not your Snowflake warehouse name:
    >
    > - `.warehouse`
    > - `.sfDatabase`
  + `<role>` is the role in Snowflake that is configured with access to the Iceberg tables that you want to query. For example: DATA_ENGINEER.
  + `<user_name>` is the user name that is used to access tables in Snowflake.
  + `<user_password>` is the password for the user accessing the tables.

    > **Note:**
    >
    > This password can be the programmatic access token (PAT)
    > that you obtained for authentication, if applicable.
  + `<schema_name>` is the schema in Snowflake where the tables are stored. This is optional.
  + `<warehouse_name>` is the Snowflake warehouse (compute instance) name that you want to be used for evaluating policies.
  > **Important:**
  >
  > By default, the code example is set up for Apache Iceberg™ tables stored on Amazon S3. If your Iceberg tables are stored on Azure Storage (ADLS),
  > perform the following steps:
  >
  > > 1. Comment out the following line: `f"org.apache.iceberg:iceberg-aws-bundle:{ICEBERG_VERSION}"`
  > > 2. Uncomment the following line: `# f"org.apache.iceberg:iceberg-azure-bundle:{ICEBERG_VERSION}"`

#### Connect Spark to Iceberg tables by using External OAuth or key pair authentication

The previous code example shows the configuration for connecting by using a programmatic access token (PAT).

To connect
Spark to Iceberg tables by using External OAuth or key pair authentication, follow these steps to alter the previous code example:

1. For `<your_access_token>`, specify your access token for External OAuth or key-pair authentication.

   To obtain an access token, see [Step 3: Obtain an access token for authentication](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).
2. Comment out the following line: `.config(f"spark.sql.catalog.{CATALOG_NAME}.credential", PAT_TOKEN)`
3. Uncomment the following line: `#.config(f"spark.sql.catalog.{CATALOG_NAME}.token", ACCESS_TOKEN)`

## Step 3: Query Iceberg tables by using Spark

Use Spark to read Iceberg tables that are protected by Snowflake data protection policies. Spark can automatically route queries of tables that
are protected by Snowflake policies through Snowflake to ensure consistent enforcement.

### Query a table

```python
spark.sql("SHOW NAMESPACES").show(truncate=False)
spark.sql("USE horizoncatalog.<schema_name>")
spark.sql("SHOW TABLES").show(truncate=False)
spark.sql("Select * from <your_table_name_in_snowflake>").show(truncate=False)
```

## Monitor a query for policy evaluation

To monitor query activity in Snowflake for queries that are routed from Spark to Snowflake for policy evaluation,
you can monitor query activity in your Snowflake account.

* To monitor query history in Snowflake, follow the instructions in [Monitor query activity with Query History](ui-snowsight-activity.md).

## Considerations for configuring data protection policies

Consider the following items when you configure data protection policies:

* Enforcing data protection policies on Iceberg tables that you query by using Spark is only supported when the following data protection policies
  are set on the tables:

  + Masking policies
  + Tag-based masking policies
  + Row access policies

  Queries on tables that are protected by all other policies will result in an error.

---
title: Enforcement of privatelink-only access
source: https://docs.snowflake.com/en/user-guide/security-disable-public-access-privatelink.md
section: User Guide
---

# Enforcement of privatelink-only access

## Overview

Each Snowflake customer can access their Snowflake account using their customer-specific, dedicated account URLs and generic Snowflake UI
URLs. Enabling private connectivity establishes private URLs for your account. After establishing private connectivity, the private URLs
that you use to connect to Snowflake must include “privatelink”. For example, the host URL can have the following formats:

* Account Name: `https://<orgname>-<account_name>.privatelink.snowflakecomputing.com`
* Connection Name: `https://<orgname>-<connectionname>.privatelink.snowflakecomputing.com`
* Account Locator (legacy): `https://<account_locator>.<region>.privatelink.snowflakecomputing.com`

Accounts that use only privatelink for inbound connections to Snowflake are also known as “privatelink-only” accounts. For more information
about using URLs to connect to your Snowflake account, see [Connecting with a URL](organizations-connect.md).

You can harden your security posture by disabling public access to your privatelink-only accounts. For example, after you disable public
access to your privatelink-only accounts, anyone attempting to “guess” your Snowflake account URL by providing a public URL sees a static
web page that displays: `HTTP - 404 account not found`. Snowflake Core Service checks requests incoming from the public internet before
requesting authorization. Returning `HTTP - 404 account not found` provides no indication that the account exists. In this way, disabling
public access protects your privatelink-only accounts.

> **Important:**
>
> Connect to your account using private connectivity, then run the [SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY](../sql-reference/functions/system_enforce_privatelink_access_only.md)
> command. Any SaaS service that does not support private connectivity cannot connect to Snowflake after you have disabled public access to
> your privatelink-only accounts.

Disabling public access to your privatelink-only accounts:

* Disables **public** access to all Snowflake service endpoints only.
* Does not affect public access to internal stage buckets.

### Granular network access restrictions

You can define granular access to your account by creating network rules that restrict network access through specific private endpoint IDs.
You can also define network rules to limit or deny publicly-routed sessions. For more information, see [CREATE NETWORK RULE](../sql-reference/sql/create-network-rule.md).

To enforce the access definitions, you can create network policies that use your network rule definitions. For more information,
see [Controlling network traffic with network policies](network-policies.md).

> **Note:**
>
> Blocking access to private endpoints using network rules is not (yet) supported on Google Cloud.

## Disable public access to your privatelink-only accounts

To disable public access to all Snowflake service endpoints in your Snowflake account:

1. Verify or establish private connectivity to your account.
2. Call the [SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY](../sql-reference/functions/system_enforce_privatelink_access_only.md) function.

## Restore public access to your privatelink-only accounts

To restore public access to all Snowflake service endpoints in your Snowflake account, call the [SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY](../sql-reference/functions/system_disable_privatelink_access_only.md) function.

## Restrict access to the function that restores public access

Customers who want to restrict their account administrators from restoring public access for inbound network traffic must request that
Snowflake modify their account.

To restrict access to the SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY function:

1. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
2. Request that Snowflake restrict access to the SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY function for your account.

---
title: Enroll in multi-factor authentication (MFA)
source: https://docs.snowflake.com/en/user-guide/opencatalog/enroll-mfa.md
section: User Guide
---

# Enroll in multi-factor authentication (MFA)

MFA provides increased sign-in security for users connecting to Snowflake Open Catalog. MFA support is provided as an integrated Open Catalog feature
powered by the [Duo Security](https://duo.com/) service, which is managed completely by Snowflake.

**Important**

> We strongly recommend that you enroll in MFA. If you don’t, you’re putting your data at risk.

To enroll in MFA, you must enroll using Open Catalog. Enrollment
requires a smartphone with a valid phone number and the Duo Mobile application installed.

To enroll in MFA, follow these steps:

1. Sign in to Open Catalog.
2. Select the user menu, and then select **My profile**.
3. Select **Enroll**.
4. Select **Start setup**.
5. Follow the instructions provided in the dialog.

After you enroll, each time you attempt to sign in to Open Catalog, you are prompted to enter your required user credentials (user name and
password) and then prompted for a passcode generated by the Duo Mobile application.

---
title: Enterprise use cases for DCM Projects
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-enterprise.md
section: User Guide
---

# Enterprise use cases for DCM Projects

This topic covers how to use DCM Projects in enterprise environments, such as managing multiple projects, working with multiple environments, and
collaborating on projects.

## When to use multiple DCM projects

When deciding if and how to split a DCM project into multiple projects, consider ownership and templating.

### Ownership

Each project has one owner role that can deploy all defined objects. Grants allow granular access management for individual objects inside
the project. However, if different groups of users are responsible for deploying changes to a project, it generally makes sense to
split a DCM project accordingly.

The following is an example scenario:

* The platform administrator deploys a database and a warehouse, creates the team administrator role, and
  grants CREATE privileges to the team administrator for a defined set of object types inside that database, as well as access to a defined
  set of account-level integrations.
* The team administrator can now decide how to organize schemas and dynamic tables inside that database, fine-tune refresh frequencies, and grant more
  granular read access to individual team members.

The following is a solution:

* The platform administrator deploys the high-level infrastructure for the team and grants the team administrator the privilege to create DCM project projects
  inside their database.
* The team administrator can now also benefit from DCM Projects by creating one or more projects inside the team database to manage tables and grants to team members.

### Template variables

If a DCM project defines a range of objects that are and should remain mostly similar, it is generally more convenient to define them once
as parameterized template.

The following is an example scenario:

* The platform team deploys a database for each regional team in the organization.
* New regions are expected to be added over time.
* All regions require mostly the same setup of schema, landing tables, roles, and warehouse.
* Changes in this database template should be applied to all teams, for example, adding a read-only role.

The following is a solution:

* You can execute a single set of definitions in a loop for each regional team listed in the manifest profile.

When more elements of this template start to diverge and the number of templating conditions increases, it can become easier to read and
maintain separate DCM projects with their individual object definitions.

## Use DCM Projects with multiple environments

The following diagram shows a typical workflow for deploying a DCM project to multiple environments.

### Separate accounts vs. separate databases

Snowflake generally recommends setting up each environment as a separate Snowflake account. This ensures complete separation of production
infrastructure from any experimental development and and guarantees restricted developer access to production data.

However, with careful access management, you can successfully manage multiple environments on one Snowflake account. This is easier when the
databases are clearly separated and can become more challenging when account-level objects and integrations are involved.

The benefit of a single-account setup is the ability to easily clone production infrastructure and data for testing alterations before
deploying those changes to production. However, copying parts of production data and infrastructure to a different account, for example,
through org-internal data shares, can be more costly.

### Impact on DCM project templating

Distinct object names for each environment are a requirement for single-account setups, for example, to keep `EMEA_DB` and `EMEA_ADMIN`
separate from `EMEA_DB_DEV` and `EMEA_ADMIN_DEV`. Snowflake also recommends this practice for multi-account setups. Templated names
allow for multiple instances of entities like `EMEA_DB_DEV_JOHN` and `EMEA_DB_DEV_MARY` to coexist for independent development and to
quickly create and destroy sandbox environments to test different solutions.

This applies to all account-level objects, such as databases, roles, and warehouses. You then need to apply these templated names to all
fully qualified names of nested objects.

## Collaborate on DCM Projects

### Shared development environment

Multiple developers commonly share the same development account to build and iterate on data products in parallel. However, if multiple
users work on the same project in parallel, their PLAN and DEPLOY operations can cause conflicts if they don’t use templating to create
unique names.

The following is an example scenario:

* Users A and B are both testing changes to different parts of project `TASTYBYTES`, which already runs on production.
* Each user creates their own feature branch of `prod-main` and starts editing the entity definitions.
* Each user creates their own DCM project (`TASTYBYTES_DEV_A` and `TASTYBYTES_DEV_B`).
* If both users deploy with the same `DEV` templating configuration to the same Snowflake account, then:

  + User A deploys the new `_DEV` instance of all entities first including the `TB_WAREHOUSE_DEV`, so they are managed by their project `TASTYBYTES_DEV_A`.
  + Once user B tries to deploy one or more of the same object names (for example, `TB_WAREHOUSE_DEV`), the deployment for
    `TASTYBYTES_DEV_B` fails because the warehouse is already managed by `TASTYBYTES_DEV_A`.
* Alternatively, both users could own and deploy from the same project `TASTYBYTES_DEV`, each pointing at their different branch folders.
  This would lead to user B overwriting all deployed entity versions of user A and vice versa.

The following is a solution:

* When working on the same development environment in parallel, Snowflake recommends always using distinct entity names to avoid conflicting
  object names. You can achieve this by templating database, warehouse, and role names with unique suffixes. For example, `DEFINE DATABASE
  DCM_PROJECT_{{db}};`
* When using configuration profiles like the following example, multiple developers can all use the `DEV` configuration to set their warehouses
  to `X-SMALL`.
* To avoid conflicting database names, developers should overwrite the `db` variable with a unique string. This could be based on
  user names, feature names, ticket numbers, or branch names.

  For example, `snow dcm deploy --variable "db='DEV_JS'"` would resolve to a unique `DEFINE DATABASE DCM_PROJECT_DEV_JS;` operation.

  ```yaml
  templating:
    defaults:
      wh_size: "X-SMALL"

    configurations:
      DEV:
        db: "DEV"

      TEST:
        db: "TEST"

      PROD:
        db: "PROD"
        wh_size: "LARGE"
  ```
* You can apply the same templating solution when one developer works on multiple projects.
* The following is an example of a scalable project setup for teams.

  When you start a new Jira ticket, complete the following steps:

  > 1. `CREATE GIT BRANCH {{ticket_number}} FROM REPO`
  > 2. `CREATE DCM PROJECT {{ticket_number}}`
  > 3. `EXECUTE DCM PROJECT {{ticket_number}} PLAN USING CONFIGURATION "DEV" (db => '{{ticket_number}}') FROM @REPO/BRANCHES/{{ticket_number}}/DCM_PROJECT/`

---
title: Error handling in Snowpipe Streaming high-performance architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-error-handling.md
section: User Guide
---

# Error handling in Snowpipe Streaming high-performance architecture

This topic outlines the error handling mechanisms available in the high-performance edition of Snowpipe Streaming. This enhanced approach provides detailed error information and improves the overall error handling process for a more robust and informative experience.

## Key error handling features in the high-performance architecture

* Enhanced channel status endpoint: This edition extends the channel status endpoint to provide more comprehensive error information.
* Granular error details: The high-performance edition provides more detailed error information to help identify where it occurred and find the root causes of ingestion issues.
* Improved client experience: The high-performance edition simplifies error handling for clients, reducing the complexity of error reasoning and recovery.
* The channel history view: [SNOWPIPE_STREAMING_CHANNEL_HISTORY view](../../sql-reference/account-usage/snowpipe_streaming_channel_history.md) provides a historical record of channel activity to monitor and locate errors. This feature lets you track error trends and proactively address potential issues.

## Channel status endpoint details

The high-performance architecture includes a channel status endpoint to provide more detailed, point-in-time information about a channel.

In addition to the channel status information for the classic architecture, which is `statusCode`, `persistedOffsetToken`, the high-performance architecture includes the following information:

* `channel_status_code`: Represents the current operational status of the streaming channel. This code provides a high-level indication of the channel’s health and ability to ingest data. For more information about the channel status codes, see Client-side error handling and required actions.
* `last_commited_offset_token`: Indicates the offset token of the last row set that was successfully committed to the target table by Snowflake. This is crucial for tracking progress and ensuring data delivery.
* `created_on_ms`: The timestamp, in milliseconds, that indicates when the streaming channel was initially created within Snowflake.
* `database_name`: The name of the database to which the streaming channel is configured to ingest data.
* `schema_name`: The name of the schema within the specified database where the target table for the streaming channel resides.
* `pipe_name`: The name of the Snowpipe object that is configured to use this Snowpipe Streaming channel for data ingestion into a specific target table.
* `channel_name`: A user-created name for the specific Snowpipe Streaming channel instance.
* `rows_inserted`: A count of the total number of data rows that have been successfully inserted into the target table through this streaming channel since its creation.
* `rows_parsed`: A count of the total number of data rows that have been processed and parsed by the Snowpipe Streaming service for this channel. (but not necessarily inserted, for example, due to errors).
* `rows_error_count`: A count of the total number of data rows that encountered errors during processing and were therefore rejected by the Snowpipe Streaming service for this channel.
* `last_error_offset_upper_bound`: The upper bound of the offset token range of the last rowset that contained errors. This helps in identifying the approximate location of the most recent errors within the data stream.
* `last_error_message`: A human-readable message corresponding to the latest error code.
* `last_error_timestamp`: The timestamp indicating when the most recent error occurred on this streaming channel.
* `snowflake_avg_processing_latency_ms`: The average latency, in milliseconds, observed by the Snowflake service in processing rowsets received by this channel. This metric provides insight into the performance of the ingestion pipeline within Snowflake.

## Error-handling flow in the high-performance architecture

* Client sends data: The client application uses the Snowpipe Streaming SDK to send data to Snowflake through the `appendRow(s)` API.
* Server processing: The Snowflake service processes the data. This involves:

  > + Buffering the data.
  > + Parsing and validating the data.
  > + Committing the data to the table.
* Error detection: Errors can occur during any of the server-side processing stages.
* Error recording: Snowflake records detailed information about the last error that occurred, including the following information:

  > + The upper bound of the offset token range of the last rowset that contained errors. This helps in identifying the approximate location of the most recent errors within the data stream.
  > + An error message.
  > + A timestamp.
* Error reporting:

  > + The enhanced channel status endpoint provides access to the recorded error information.
  > + Clients can query this endpoint to retrieve details about the last error that occurred.
  > + [SNOWPIPE_STREAMING_CHANNEL_HISTORY view](../../sql-reference/account-usage/snowpipe_streaming_channel_history.md) provides a historical record of errors and their offsets.
* Client action: The client application uses the error information to perform the following actions:

  > + Identify the cause of the error.
  > + Implement appropriate error handling logic, such as the following actions:
  >
  >   > - Retrying the failed operation.
  >   > - Logging the error.
  >   > - Alerting an administrator.
  >   > - Moving the erroneous data to a dead-letter queue.
  >   > - Reopening channels.

## Client-side error handling and required actions

The Snowpipe Streaming SDK simplifies error handling by implementing internal retry logic for transient errors. However, for fatal channel errors and persistent authorization issues, you are required to take manual action.

### SDK retry logic for transient errors

The SDK automatically retries the request to send unflushed data in the channel to the server for the following HTTP status codes, as they typically indicate a temporary or transient service issue:

* 5XX (Server errors)
* 429 (Too many requests)
* 408 (Request timeout)

### Channel errors that require a manual reopen

The Snowpipe Streaming SDK doesn’t automatically reopen the channel. When a channel enters a state that isn’t valid, you must explicitly close and reopen the channel to continue ingestion.

A channel is considered invalid — and requires client action — if the `channel_status_code` in the channel status response is anything other than `SUCCESS`.

The following table shows persisted error codes that indicate a fatal channel state and require the channel to be reopened:

| Error code | Context | Required client action |
| --- | --- | --- |
| ERR_PIPE_DOES_NOT_EXIST_OR_NOT_AUTHORIZED | The target pipe is missing or inaccessible. | Fix the pipe issue. Reopen channel. |
| ERR_TABLE_DOES_NOT_EXIST_NOT_AUTHORIZED | The target table is missing or inaccessible. | Fix the table issue. Reopen channel. |
| ERR_CHANNEL_HAS_INVALID_ROW_SEQUENCER | Row sequencing state isn’t valid. | Reopen channel. |
| ERR_CHANNEL_HAS_INVALID_CLIENT_SEQUENCER | Channel sequencing state isn’t valid. | Reopen channel. |
| ERR_CHANNEL_MUST_BE_REOPENED | A general error indicating the channel is unusable. | Reopen channel. |
| ERR_CHANNEL_MUST_BE_REOPENED_DUE_TO_ROW_SEQ_GAP | A gap in the row sequence was detected. | Reopen channel. |

### Schema evolution failure and channel invalidation

When you use the Snowpipe Streaming high-performance architecture, it is important for you to understand a specific exception to the general `ON_ERROR=CONTINUE` behavior regarding schema evolution.

#### Channel invalidation on schema errors

Even if the `ON_ERROR=CONTINUE` option is configured for the load, the channel is invalidated if it encounters a schema evolution failure caused by user errors.

The following list includes common user errors that trigger channel invalidation:

* Submitting data with invalid column names that can’t be mapped.
* Attempting to add more columns than allowed by the configured column limit in a single batch. When columns are added across multiple batches, there is no limit.

This channel invalidation prevents the pipe from continuing to accept data that would cause persistent, non-recoverable schema issues. You can verify the invalidation status and reason for the channel failure by calling the `getChannelStatus()` method. For more information about the channel status fields, see Channel status endpoint details.

### Authorization errors that require a configuration fix

When an ingestion attempt results in an HTTP authorization error, you must correct the underlying permission or credential issue. Don’t reopen the channel for these errors because the new channel immediately encounters the same problem.

* 401 (Unauthorized)
* 403 (Forbidden)

For these errors, stop the ingestion, and then fix the client application’s security configuration — for example, pipe permissions, user role, authentication credentials — before you resume ingestion. After you fix the authorization issue, you can reopen the client to continue ingestion.

## Handling SDK exceptions and HTTP status codes

When you use the Java SDK, methods such as `insertRows`, `getLatestCommittedOffsetToken`, and `getChannelStatus` might throw an `SFException`. To ensure resilient ingestion, applications must catch these exceptions, and then inspect `getHttpStatusCode()` to determine the required recovery action.

The following table describes common HTTP status codes, their error conditions, and the required actions:

| Status code | Error condition | Required action |
| --- | --- | --- |
| 409 | Channel invalidated | The channel is no longer valid; for example, superseded by another client. Close the current channel, call `openChannel` to create a new instance, and then resume from the last committed offset. |
| 429 | Throttling | The client is sending data too quickly. Implement an exponential back-off and retry strategy. |
| 500 / 503 | Service error | Transient network or server-side issues. These errors are typically retryable after a short delay. |
| 401 | Unauthorized | Authentication failed. Verify credentials, JWT configuration, or role permissions. |

### Understanding invalidation levels

It is critical to distinguish between a channel invalidation and a client invalidation:

* **InvalidChannelException (HTTP 409)**: Only the specific channel is affected. Reopening the channel is sufficient.
* **InvalidClientException**: The entire `SnowflakeStreamingIngestClient` is compromised. You must close the existing client, initialize a new one using the factory, and then reopen all associated channels.

## Row-level error logging

For row-level error debugging, turn on **error logging** on your target table. When turned on, rows that fail
during server-side processing are automatically captured in a dedicated error table.

For more information, see [Error logging in Snowpipe Streaming with high-performance architecture](snowpipe-streaming-error-tables.md).

---
title: Error logging in Snowpipe Streaming with high-performance architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-error-tables.md
section: User Guide
---

# Error logging in Snowpipe Streaming with high-performance architecture

Error logging for Snowpipe Streaming builds on Snowflake’s [DML error logging](../data-load-overview.md)
feature to provide a robust way to manage and recover from data ingestion errors. This feature prevents silent data loss
and increases visibility into faulty data rows. When error logging is turned on, error-free data continues to load into your
target table, while rows that fail processing are automatically routed to a dedicated error table for review and recovery.

> **Important:**
>
> The data stored in error tables is the original payload sent to the API or SDK before any pipe transformations
> are applied. Even if your pipe drops or transforms fields, the full original payload is persisted in the error table.

## Overview

When using the Snowpipe Streaming high-performance architecture, data processing happens server-side in Snowflake.
The high-performance architecture implicitly operates in `ON_ERROR = CONTINUE` mode, meaning valid rows are ingested while
problematic rows are skipped.

### Error handling options

You can monitor and handle ingestion errors in the following ways:

**Without error tables:**

* Use [getChannelStatus()](snowpipe-streaming-high-performance-error-handling.md)
  to monitor aggregated error counts, last error message, and last error timestamp.
* Query the [SNOWPIPE_STREAMING_CHANNEL_HISTORY](../../sql-reference/account-usage/snowpipe_streaming_channel_history.md)
  view for historical error trends and patterns.

These methods tell you *that* errors occurred and *how many*, but not *which rows* failed or their payloads.

**With error tables:**

* Rows that fail processing are automatically captured in a dedicated error table.
* Each error row includes the full original payload and detailed error metadata.
* You can query, analyze, and reprocess failed rows using standard SQL.

Error tables complete the picture by showing you exactly which rows failed and why, enabling full debugging and recovery.

## Turn on error logging

To turn on error logging for Snowpipe Streaming, set the `ERROR_LOGGING` property on the target table.
For complete details on turning on and configuring error logging, see
[Configure DML error logging for a table](../data-load-overview.md).

```sqlexample
-- For a new table:
CREATE TABLE my_streaming_table (...) ERROR_LOGGING = TRUE;

-- For an existing table:
ALTER TABLE my_streaming_table SET ERROR_LOGGING = TRUE;
```

When error logging is turned on, the same error table captures errors from both DML statements and Snowpipe Streaming
ingestion workloads.

## Query error tables

To query the error table for a base table, use the `ERROR_TABLE` table function. For complete details on
error table schema, access control, and supported operations, see
[Error logging and error tables](../data-load-overview.md).

```sqlexample
SELECT * FROM ERROR_TABLE(my_streaming_table) ORDER BY timestamp;
```

The result contains a row for every erroneous row in the ingestion stream.

## Snowpipe Streaming error fields

Snowpipe Streaming errors are stored in the same
[error table columns](../data-load-overview.md) as DML errors (`timestamp`,
`query_id`, `error_code`, `error_metadata`, `error_data`). The `error_metadata` and `error_data`
objects include additional fields for Snowpipe Streaming, described in the following sections.

### Identify Snowpipe Streaming errors

The `error_metadata:service` field is populated with `snowpipe_streaming` for errors from Snowpipe Streaming.
Use this field to filter errors by source:

```sqlexample
SELECT * FROM ERROR_TABLE(my_streaming_table)
WHERE error_metadata:service = 'snowpipe_streaming';
```

### Error metadata details

For Snowpipe Streaming errors, the `error_metadata:details` object contains the following additional fields:

| Field | Description |
| --- | --- |
| `pipe_name` | Name of the pipe used to ingest the erroneous row. |
| `channel_name` | Name of the channel used to ingest the erroneous row. |
| `offset_token_upper_bound` | Upper bound offset token containing the erroneous row. The row appears in the payload with this offset token or earlier. |
| `error_data_truncated` | Indicates whether the raw payload was truncated to fit into the error table (maximum 128 MB). |
| `error_data_content_type` | Indicates the type of content stored in the `error_data` column. See Error data content types. |

### Error data format

For Snowpipe Streaming errors, the `error_data:$1` field contains the raw payload representing the erroneous row.

If the payload contains invalid UTF-8 characters, the raw payload is stored as a base64-encoded binary string.

### Error data content types

The `error_data_content_type` field indicates the type of error encountered and suggests remediation steps.

#### json

The erroneous row is a syntactically valid JSON string, but a logical error occurred while ingesting the data
into the target table.

Common logical errors include:

* **Missing non-nullable columns**: A required column with a NOT NULL constraint was not provided in the payload.
* **Type conversion errors**: The JSON data type can’t be cast to the target column type. For example, a string
  value `"abc"` can’t be converted to a NUMBER column.
* **Transformation errors**: An error occurred while evaluating a pipe transformation expression, such as
  division by zero.

To resolve, inspect the error message in `error_metadata:error_message` and the column name in
`error_metadata:error_source` that caused the ingestion error. Parse the payload with
`PARSE_JSON(error_data:$1)`, correct the data, and reinsert it into the target table.

#### json-invalid

A syntactically invalid JSON object was ingested.

To resolve, inspect the error message in `error_metadata:error_message`, which contains details about
the syntax error. Correct the payload stored in `error_data:$1`, and reinsert it into the target table.

#### binary-base64

Invalid UTF-8 data was ingested. The error payload is stored in the error table as a base64-encoded binary string.

This error type typically indicates a format mismatch or encoding error in the upstream data source.

To resolve, examine the data source and the data formats and encodings it produces. Decode the payload stored
in `error_data:$1` with the [BASE64_DECODE_STRING](../../sql-reference/functions/base64_decode_string.md)
function to inspect the raw bytes and identify incorrect UTF-8 sequences.

## Error recovery workflow

The following example demonstrates how to query errors, analyze them, and reinsert corrected data.

### Query recent errors

```sqlexample
SELECT
    timestamp,
    error_code,
    error_metadata:error_message::STRING AS error_message,
    error_metadata:details:channel_name::STRING AS channel,
    error_metadata:details:pipe_name::STRING AS pipe,
    error_metadata:details:error_data_content_type::STRING AS content_type,
    error_data:"$1"::STRING AS raw_payload
FROM ERROR_TABLE(my_streaming_table)
WHERE error_metadata:service = 'snowpipe_streaming'
  AND timestamp >= DATEADD(hour, -1, CURRENT_TIMESTAMP())
ORDER BY timestamp DESC;
```

### Analyze error distribution

```sqlexample
SELECT
    error_code,
    error_metadata:error_message::STRING AS error_message,
    COUNT(*) AS error_count
FROM ERROR_TABLE(my_streaming_table)
WHERE error_metadata:service = 'snowpipe_streaming'
  AND timestamp >= DATEADD(hour, -24, CURRENT_TIMESTAMP())
GROUP BY 1, 2
ORDER BY error_count DESC;
```

### Fix and reinsert recoverable errors

For errors with valid JSON payloads, you can parse, correct, and reinsert the data:

```sqlexample
INSERT INTO my_streaming_table (col1, col2, col3)
SELECT
    TRY_CAST(PARSE_JSON(error_data:"$1"):col1 AS NUMBER),
    PARSE_JSON(error_data:"$1"):col2::STRING,
    TRY_CAST(PARSE_JSON(error_data:"$1"):col3 AS TIMESTAMP)
FROM ERROR_TABLE(my_streaming_table)
WHERE error_metadata:service = 'snowpipe_streaming'
  AND error_metadata:details:error_data_content_type = 'json'
  AND timestamp >= DATEADD(hour, -24, CURRENT_TIMESTAMP());
```

After successfully reprocessing errors, you can truncate the error table:

```sqlexample
TRUNCATE ERROR_TABLE(my_streaming_table);
```

## Billing

Snowpipe Streaming ingestion is billed at the standard Snowpipe Streaming rate. Turning on error logging
doesn’t change your ingestion costs. There are no additional ingestion charges for routing failed rows to the
error table.

Snowflake charges for data stored in the error table at the standard storage rate, the same as any other table.
The error table stores the raw payload and error metadata for each failed row.

For more information about Snowpipe Streaming costs, see
[Snowpipe Streaming high-performance architecture: Understand your costs](snowpipe-streaming-high-performance-cost.md).

## Limitations

* Error tables capture errors that occur during server-side data processing (parsing and transformation).
  Errors from other stages (SDK validation, API failures, and other server-side asynchronous errors) aren’t captured in error tables.
  Monitor server-side asynchronous errors using [getChannelStatus()](snowpipe-streaming-high-performance-error-handling.md).
* A high failure rate of incoming rows can increase processing latency due to overhead of storing error information.
* Payloads larger than 128 MB are truncated. The `error_data_truncated` field indicates when truncation occurred.
* Error tables are available only for the Snowpipe Streaming high-performance architecture. For the classic
  architecture, error handling is managed client-side through the SDK.

---
title: Error messages
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/error-messages.md
section: User Guide
---

# Error messages

Client connectivity error messages can signal various underlying causes located in the network path between a host and a Snowflake endpoint, including any possible proxies, security appliances, load balancers, DNS servers, and so on. You can find common error messages and their potential causes and resolutions for the following clients:

## JDBC errors

|  |  |
| --- | --- |
| JDBC error 1 | **Error(s)**  ```output Cannot connect: connection refused: Java::NetSnowflakeClientJdbc::SnowflakeSQLException: JDBC driver encountered communication error. Message: Exception encountered for HTTP request: Connection reset. ```  **Root cause**: This error has various underlying causes, located in the network path between the host you’re trying to connect from, and the Snowflake endpoint, including any possible proxies, security appliances, load balancers and such.  **Resolution scenario**: [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 2 | **Error(s)**  ```output JDBC driver encountered communication error. Message: Exception encountered for HTTP request:  sun.security.validator.ValidatorException: No trusted certificate found.  OR  javax.net.ssl.SSLHandshakeException: No trusted certificate found  OR  'SSL peer certificate or SSH remote key was not OK' ```  **Root cause**: The issue is likely caused by a proxy or security appliance performing an SSL inspection.  On rare occasions, usually with older installations of Java, the same symptom can also occur when there’s no SSL inspection but the cloud provider changed one of the intermediary certificate authorities to another (well-known) authority, which is not yet present in the truststore.  **Resolution scenario**: [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 3 | **Error(s)**  ```output JDBC driver encountered a communication error. Message: Exception encountered for an HTTP request: Network is unreachable (Connect Failed) ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 4 | **Error(s)**  ```output JDBC driver encountered communication error. Message: Exception encountered for HTTP request: <SERVICE_ENDPOINT>: nodename nor servname provided, or not known. ```  **Root cause**: See [DNS configuration issues](common-issues.md).  **Resolution scenario**: [DNS configuration issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 5 | **Error(s)**  ```output WARNING!!! Using fail-open to connect. Driver is connecting to an HTTPS endpoint without OCSP based Certificate Revocation checking as it could not obtain a valid OCSP Response to use from the CA OCSP responder. Details: {"cacheEnabled":true,"ocspReqBase64":null,"ocspMode":"FAIL_OPEN","sfcPeerHost":"<SERVICE_ENDPOINT>","ocspResponderURL":null,"cacheHit":true,"eventType":"OCSPValidationError","certId":"<OBFUSCATED>"} ```  **Root cause**: See [OCSP and port 80 issues](common-issues.md).  **Resolution scenario**: [OCSP and port 80 issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 6 | **Error(s)**  ```output JDBC driver internal error: Max retry reached for the download of #chunk0 (Total chunks:<x>) retry=<y>, error=net.snowflake.client.jdbc.SnowflakeSQLException: JDBC driver encountered communication error. Message: Error encountered when downloading a result chunk: ```  **Root cause**: See [Fetching large query result sets failures](common-issues.md).  **Resolution scenario**: [Fetching large query result sets failures](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 7 | **Error(s)**  ```output JDBC driver encountered communication error. Message: Exception encountered for HTTP request: Failed to find the root CA ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 8 | **Error(s)**  ```output net.snowflake.client.jdbc.internal.apache.http.impl.execchain.RetryExec execute INFO: I/O exception (java.net.SocketException) caught when processing request to {s}->https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>]:443: Broken pipe (Write failed) ```  **Root cause**:  The client driver tried to send data over a connection (pipe) which it believes to be up, which particular connection is already closed down on the remote end, just the client driver was not aware of this.   * a simple(r) scenario for this error is when there is an idle timeout configured on a proxy or security appliance between the client driver and Snowflake, which when expires, terminates the connection without notifying the parties * oftentimes, troubleshooting the true underlying root cause of what exactly, and why is tearing down the connections between the client driver and Snowflake can be a complex endeavor where details are out of scope for this documentation   **Resolution scenario**:  You can configure a TTL inside the JDBC driver which will gracefully close the connections from the client side sooner than they would be torn down by a remote idle timeout; preventing the issue. Setting is available from JDBC driver version 3.12.17; and from 3.13.30 there’s a default (1 minute) already configured.  For more information, see [I/O error: Connection reset](../../developer-guide/jdbc/jdbc-using.md). |

|  |  |
| --- | --- |
| JDBC error 9 | **Error(s)**  ```output JDBC driver encountered communication error. Message: Exception encountered for HTTP request: Remote host terminated the handshake ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 10 | **Error(s)**  ```output net.snowflake.client.jdbc.SnowflakeSQLLoggedException: JDBC driver encountered IO error. Message: Encountered exception during upload: null. ```  **Root cause**: The client driver has issues accessing the cloud storage associated with your Snowflake account, during an upload operation. This is caused by a misconfiguration on a proxy / security appliance sitting on the network path between the client driver and the cloud storage.  **Resolution scenario**: Although the direction of the traffic is the opposite, see [Fetching large query result sets failures](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 11 | **Error(s)**  ```output JDBC driver encountered communication error. Message: Exception encountered for HTTP request: Certificate for [<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>] doesn't match any of the subject alternative names: [*.us-west-2.snowflakecomputing.com, *.us-west-2.aws.snowflakecomputing.com, *.global.snowflakecomputing.com, *.snowflakecomputing.com, *.prod1.us-west-2.aws.snowflakecomputing.com, *.prod2.us-west-2.aws.snowflakecomputing.com]. ```  **Root cause**: What this error message means: The client driver is trying to connect to a Snowflake account (or cloud storage) located in AWS US WEST, which is also the default cloud region. The connection is not successful, because the certificate seen by the client driver is not a match for the hostname in the request.  Most likely causes include:   * If your Snowflake account is not in AWS US WEST: most common issue is a misconfiguration in the account part in the JDBC driver connection string. * If your Snowflake account is indeed in AWS US WEST: likely cause could be a proxy / security appliance performing SSL inspection.   **Resolution scenario**:   * For the first cause, please either use the regionless notation in the account field of the configuration, such as myorg-test, myorg-prod, etc. Alternatively, if you want to use the locator notation, make sure to use the correct one as indicated in the [Configuring a client, driver, library, or third-party application to connect to Snowflake](../gen-conn-config.md) documentation. For example, an account in AWS EU Frankfurt would be `xy12345.eu-central-1`. * For the second cause, see [Fetching large query result sets failures](common-issues.md). |

|  |  |
| --- | --- |
| JDBC error 12 | Error(s)  ```output I/O exception (net.snowflake.client.jdbc.internal.apache.http.NoHttpResponseException) caught when processing request to {s}->https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>].snowflakecomputing.com:443: The target server failed to respond ```  **Root cause**:  The client driver did not receive a timely response to the request sent to the remote endpoint.  Most likely causes include:   * If the issue is persistent, it is likely an actual connectivity problem. * If the issue is intermittent and specifically; if NoHttpResponseException happens very quickly (milliseconds) after the request has been sent, this error indicates that the TCP session between the client driver and the server is down, just the driver did not know about it. This latter often happens if an intermediary proxy/load balancer tears down the session between the client and the server, without letting any of the parties know.   **Resolution scenario**:   * For persistent errors and where the NoHttpResponseException happens after a longer wait following the request, please follow the [Troubleshooting steps](troubleshooting-steps.md). * For occasions where this exception is intermittent and is thrown very quickly after the client driver sends the request, between versions 3.12.17 and 3.13.30 you have a configuration option net.snowflake.jdbc.ttl to ensure idle connections are closed and thus prevent the intermediary node (such as `loadbalancer`) tearing it down unexpectedly, without telling the clients on the other ends. For more information, see [I/O error: Connection reset](../../developer-guide/jdbc/jdbc-using.md).   From JDBC driver version 3.13.30 and onwards; you still have this configuration option but usually it’s not necessary to change it, as it now has a default value of 1 minute idle timeout (60 seconds).  In both scenarios, the JDBC driver should automatically retry sending the failed request per its retry strategy, without needing any user intervention. |

## ODBC errors

|  |  |
| --- | --- |
| ODBC error 1 | **Error(s)**  ```output 'OLE DB or ODBC error: [DataSource.Error] ERROR [HY000] [Snowflake][Snowflake] (25) Result download worker error: Worker error: [Snowflake][Snowflake] (4) REST request for URL <>.... :  CURLerror (curl_easy_perform() failed) - code=60 msg='SSL peer certificate or SSH remote key was not OK' osCode=9 osMsg='Bad file descriptor'. . '.* ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 2 | **Error(s)**  ```output Error: nanodbc/nanodbc.cpp:1135: 01S00: [Snowflake][Snowflake] (4) REST request for URL *** failed: CURLerror (curl_easy_perform() failed) - code=60 msg='SSL peer certificate or SSH remote key was not OK'. ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 3 | **Error(s)**  ```output 'SSL peer certificate or SSH remote key was not OK' ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 4 | **Error(s)**  ```output SSL certificate problem: self signed certificate in certificate chain. Please check for SSL interception proxy in your network. ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 5 | **Error(s)**  ```output CURLerror (curl _easy_perform failed) - code=35 msg='SSL connect error' osCode=10054 osMsg='Unknown error'. ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 6 | **Error(s)**  ```output 'Empty reply from server' (CURLerror (curl_easy_perform() failed) - code=52 msg='Server returned nothing (no header..) ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 7 | **Error(s)**  ```output ERROR 5052 Simba::ODBC::Connection::SQLDriverConnectW: [Snowflake][Snowflake] (4) REST request for URL https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>]:443/session/v1/login-request?requestId=<OBFUSCATED>&request_guid=<OBFUSCATED>&databaseName=<OBFUSCATED>&schemaName=<OBFUSCATED>&warehouse=<OBFUSCATED>failed: CURLerror (curl_easy_perform() failed) - code=35 msg='SSL connect error'. ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 8 | **Error(s)**  ```output ERROR 710 Simba::ODBC::Statement::SQLFetchScroll: [Snowflake][Snowflake] (25) Result download worker error: Worker error: [Snowflake][Snowflake] (4) REST request for URL https://<STAGE>/<OBFUSCATED>/results/<OBFUSCATED>_0/main/data_0_0_1?x-amz-server-side-encryption-customer-algorithm=<OBFUSCATED>&response-content-encoding=gzip&AWSAccessKeyId=<OBFUSCATED>&Expires=<OBFUSCATED>&Signature=<OBFUSCATED> failed: CURLerror (curl_easy_perform() failed) - code=52 msg='Server returned nothing (no headers, no data)'. ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 9 | **Error(s)**  ```output [Snowflake][Snowflake] (6) Assertion failure: error_in_response_json ```  **Root cause**: There are multiple factors that can lead to this error.  **Resolution scenario**: Try [Common connectivity issues and resolutions](common-issues.md) and perform the [Troubleshooting steps](troubleshooting-steps.md). |

|  |  |
| --- | --- |
| ODBC error 10 | **Error(s)**  ```output WARN 9594 sf::RestRequest::httpPerform: Got CURL(0000015547C0CC10) error: Failed to connect to <PROXY_HOST> port 80: Timed out when fetching data from https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>]:443/session/v1/login-request?requestId=<OBFUSCATED>&request_guid=<OBFUSCATED>. Status code: 11, curl error code: 28 ```  **Root cause**:  The client driver was unable to perform the login operation for the given user, due to the request timing out. (curl error code 28 = CURLE_OPERATION_TIMEDOUT).  This is likely due to misconfiguration on one or more devices (proxy / security appliance) on the network path between the client driver and Snowflake.  **Resolution scenario**:  Please follow the [Troubleshooting steps](troubleshooting-steps.md) and work with your sysadmin/network admin to ensure all Snowflake endpoints are reachable from the host you’re running the client driver from. |

|  |  |
| --- | --- |
| ODBC error 11 | **Error(s)**  ```output ERROR [HY000] [Microsoft][Snowflake] (4) REST request for URL https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>]:443/session/v1/login-request?requestId=<OBFUSCATED>&request_guid=<OBFUSCATED> failed: CURLerror (curl_easy_perform() failed) - code=6 msg='Couldn't resolve host name'. ```  **Root cause**: See [DNS configuration issues](common-issues.md).  **Resolution scenario**: See [DNS configuration issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 12 | **Error(s)**  ```output ERROR [HY000] [Snowflake][Snowflake] (4) REST request for URL https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>]:443/session/v1/login-request?requestId=<OBFUSCATED>&request_guid=<OBFUSCATED> failed: CURLerror (curl_easy_perform() failed) - code=5 msg='Couldn't resolve proxy name' osCode=9 osMsg='Bad file descriptor'. ```  **Root cause**: See [DNS configuration issues](common-issues.md).  **Resolution scenario**: See [DNS configuration issues](common-issues.md). |

|  |  |
| --- | --- |
| ODBC error 13 | **Error(s)**  ```output [Snowflake][Snowflake] (25) Result download worker error: Worker error: [Snowflake][Snowflake] (4) REST request for URL https://<STAGE>/results/<OBFUSCATED>_02Fmain2Fdata_0_0_8?sv=<OBFUSCATED>&spr=https&se=<OBFUSCATED>&sr=b&sp=r&sig=<OBFUSCATED>&rsce=gzip failed: CURLerror (curl_easy_perform() failed) - code=42 msg='Operation was aborted by an application callback'. ```  **Root cause**: See [Fetching large query result sets failures](common-issues.md).  **Resolution scenario**: See [Fetching large query result sets failures](common-issues.md). |

## Snowflake Connector for Python and SnowSQL errors

|  |  |
| --- | --- |
| Python error 1 | **Error(s)**  ```output SSL validation failed for https://<STAGE>/?accelerate [SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed (_ssl.c:852) ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| Python error 2 | **Error(s)**  ```output SSLError: HTTPSConnectionPool(host='<STAGE>', port=443): Max retries exceeded with url: /<OBFUSCATED>/results/<OBFUSCATED>_0/main/data_0_0_1?x-amz-server-side-encryption-customer-algorithm=<OBFUSCATED>&response-content-encoding=gzip&AWSAccessKeyId=<OBFUSCATED>&Expires=<OBFUSCATED>&Signature=<OBFUSCATED> (Caused by SSLError(SSLError("bad handshake: Error([('SSL routines', 'tls_process_server_certificate', 'certificate verify failed')])"))) ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| Python error 3 | **Error(s)**  ```output (Caused by ProxyError('Cannot connect to proxy.', OSError('Tunnel connection failed: 407 Request rejected by proxy'))) ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| Python error 4 | **Error(s)**  ```output 250001 (n/a): Could not connect to Snowflake backend after 0 attempt(s).Aborting ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| Python error 5 | **Error(s)**  ```output snowflake.connector.network.RetryRequest: HTTP 403: Forbidden ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| Python error 6 | **Error(s)**  ```output 250003 (n/a): Failed to get the response. Hanging? method: post, url: https://[<SNOWFLAKE_DEPLOYMENT>|<SNOWFLAKE_DEPLOYMENT_REGIONLESS>|<CLIENT_FAILOVER>]:443/session/authenticator-request?request_guid=<OBFUSCATED> ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

|  |  |
| --- | --- |
| Python error 7 | **Error(s)**  ```output Retrying (Retry(total=0, connect=None, read=None, redirect=None)) after connection broken by 'ProtocolError('Connection aborted.', RemoteDisconnected ('Remote end closed connection without response'))' ```  **Root cause**:  What this error message means: The client driver was able to connect to the remote end and sent a HTTP request to it, but when attempting to read the response, no data was read from it, indicating that something on the remote end closed the connection.  The most likely cause is a persistent RemoteDisconnected error, which suggests misconfiguration on one or more proxy/security appliances between the client driver and the Snowflake endpoint.  **Resolution scenario**: Please follow the [Troubleshooting steps](troubleshooting-steps.md) and make sure all Snowflake endpoints are allowed on any intermediary proxy or security appliances you might have. |

|  |  |
| --- | --- |
| Python error 8 | **Error(s)**  ```output HTTPSConnectionPool(host='<STAGE>', port=443): Max retries exceeded with url: /<OBFUSCATED>/results/<OBFUSCATED>_0/main/data_0_0_1?x-amz-server-side-encryption-customer-algorithm=<OBFUSCATED>&response-content-encoding=gzip&X-Amz-Algorithm=<OBFUSCATED>&X-Amz-Date=<OBFUSCATED>&X-Amz-SignedHeaders=<OBFUSCATED>&X-Amz-Expires=<OBFUSCATED>&X-Amz-Credential=<OBFUSCATED>&X-Amz-Signature=<OBFUSCATED> (Caused by SSLError(SSLError("bad handshake: SysCallError(-1, 'Unexpected EOF')"))) ```  **Root cause**: See [Firewall or proxy SSL inspection issues](common-issues.md).  **Resolution scenario**: See [Firewall or proxy SSL inspection issues](common-issues.md). |

If the resolution steps do not resolve the issue, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for further assistance.

---
title: Error notifications for replication and failover groups
source: https://docs.snowflake.com/en/user-guide/account-replication-error-notifications.md
section: User Guide
---

# Error notifications for replication and failover groups

You can receive error notifications for refresh operation failures by setting a notification integration for a primary replication
or failover group.

## Error notifications for refresh operation failures

When you enable error notifications for a replication or failover group, a notification is sent via the designated email, cloud messaging
service, or the webhook when a refresh operation fails.

Notifications includes the following information:

* Source and target account names.
* Source and target regions (and region group, if applicable).
* Primary and secondary replication or failover group name.
* Timestamp when the error occurred.
* Error code and message.
* Source and target login URL.

## Error notifications and failover

Notifications are enabled on the primary replication or failover group and sent using a notification integration. The
notification integration is not required to be replicated to the target account. In the case of failover, if the notification
integration has been replicated, or there is an existing notification integration with the same name, in the newly promoted
source account, error notifications continue to be sent.

If the notification integration is not available, error notifications are not sent for refresh operation failures.

## Prerequisite: Notification integration for error notifications

A notification integration is required to send error notifications. The notification integration must be one of the
following types to send email notifications on refresh operation failures:

TYPE = EMAIL:
:   The email notification integration must have at least one verified email address in the DEFAULT_RECIPIENTS list.

    For more information about creating an email notification with a default list of recipients, see
    [Specify a default list of recipients and a default subject line](notifications/email-notifications.md).

TYPE = QUEUE:
:   You can use a notification integration that is configured to push notifications to a messaging service for any of the
    cloud providers supported by Snowflake. You must set the notification integration TYPE parameter to QUEUE and the DIRECTION
    parameter to OUTBOUND.

    For more information, see [Sending notifications to cloud provider queues (Amazon SNS, Google Cloud PubSub, and Azure Event Grid)](notifications/queue-notifications.md).

TYPE = WEBHOOK:
:   You can use a notification integration that is configured to push notifications to a webhook for any of the external systems
    supported by Snowflake. Set the notification integration TYPE parameter to WEBHOOK. You may also need to create a secret
    (if required by the external system).

    For more information, see [Sending webhook notifications](notifications/webhook-notifications.md).

### Create a notification integration (TYPE = EMAIL)

To create an email notification integration named `my_notification_int` with email address `first.last@example.com`, follow
these steps:

1. Ensure that the email address `first.last@example.com` [has been verified](notifications/email-notifications.md).
2. Create the notification integration by executing the [CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-email.md) command. For example:

   ```sqlexample
   CREATE NOTIFICATION INTEGRATION my_notification_int
     TYPE = EMAIL
     ENABLED = TRUE
     DEFAULT_RECIPIENTS = ('first.last@example.com');
   ```

### Create a notification integration (TYPE = QUEUE)

To create a notification integration for pushing notifications to a cloud provider queue, follow the instructions provided for the
currently supported cloud provider queues:

* [Creating a notification integration to send notifications to an Amazon SNS topic](notifications/creating-notification-integration-amazon-sns.md)
* [Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic](notifications/creating-notification-integration-azure-event-grid.md)
* [Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic](notifications/creating-notification-integration-google-pubsub.md)

### Create a notification integration (TYPE = WEBHOOK)

To create a notification integration for pushing notifications to an external system webhook, follow the instructions provided for
the currently supported external system webhooks:

* Creating a [Slack secret](notifications/webhook-notifications.md) and [Slack notification integration](notifications/webhook-notifications.md)
* Creating a [Microsoft Teams secret](notifications/webhook-notifications.md) and [Microsoft Teams notification integration](notifications/webhook-notifications.md)
* Creating a [PagerDuty secret](notifications/webhook-notifications.md) and [PagerDuty notification integration](notifications/webhook-notifications.md)

> **Important:**
>
> The webhook notification integration must specify the WEBHOOK_BODY_TEMPLATE parameter with `SNOWFLAKE_WEBHOOK_MESSAGE`
> as a placeholder value. When the notification is sent, the placeholder is replaced with the contents of the replication
> error notification, as described in Error notifications for refresh operation failures.
>
> The format for specifying WEBHOOK_BODY_TEMPLATE depends on the external system:
>
> * For Slack or Microsoft Teams, WEBHOOK_BODY_TEMPLATE utilizes the following single-value JSON object
>   format as its value:
>
>   ```sqljson
>   WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
>   ```
> * For PagerDuty, WEBHOOK_BODY_TEMPLATE utilizes a multi-value JSON object as its value, but with the following differences
>   from a standard PagerDuty notification integration:
>
>   + Within the `payload` key, the `summary` key is not used to specify `SNOWFLAKE_WEBHOOK_MESSAGE`.
>   + Instead, an additional `custom_details` key is used to specify `SNOWFLAKE_WEBHOOK_MESSAGE`.
>
>   For example:
>
>   ```sqljson
>   WEBHOOK_BODY_TEMPLATE='{
>     "routing_key": "SNOWFLAKE_WEBHOOK_SECRET",
>     "event_action": "trigger",
>     "payload": {
>       "summary": "Snowflake replication failure",
>       "source": "Snowflake monitoring",
>       "severity": "INFO",
>       "custom_details": {
>         "message": "SNOWFLAKE_WEBHOOK_MESSAGE"
>       }
>     }
>   }'
>   ```

## Add an error notification for a replication or failover group

To enable error notifications for an existing replication/failover group, use the [ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md)
or [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command to set the ERROR_INTEGRATION parameter.

For example, add notification integration `my_notification_int` to failover group `my_fg`. The following statement must
be executed from the source account:

```sqlexample
ALTER FAILOVER GROUP my_fg SET
  ERROR_INTEGRATION = my_notification_int;
```

To create a replication/failover group and enable error notifications, use the [CREATE REPLICATION GROUP](../sql-reference/sql/create-replication-group.md)
or [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md) command and set the ERROR_INTEGRATION parameter.

For example, to create failover group `my_fg` to enable replication and failover of databases `db1`, `db2` to accounts
`myaccount2` and `myaccount2` in organization `myorg`, execute the following statement in the source account to
create a primary failover group:

```sqlexample
CREATE FAILOVER GROUP my_fg
  OBJECT_TYPES = DATABASES
  ALLOWED_DATABASES = db1, db2
  ALLOWED_ACCOUNTS = myorg.myaccount2, myorg.myaccount3
  REPLICATION_SCHEDULE = '10 MINUTE'
  ERROR_INTEGRATION = my_notification_int;
```

> **Note:**
>
> If the replication schedule for a replication or failover group is set to a high frequency, for example one minute,
> error notifications for the same issue are sent for every scheduled refresh operation.

---
title: Estimating Frequent Values
source: https://docs.snowflake.com/en/user-guide/querying-approximate-frequent-values.md
section: User Guide
---

# Estimating Frequent Values

Snowflake uses the Space-Saving algorithm, a space and time efficient way of estimating approximate frequent values in data sets.

## Overview

Snowflake provides an implementation of the Space-Saving algorithm presented in [Efficient Computation of Frequent and Top-k Elements in Data Streams](https://www.cs.ucsb.edu/research/tech-reports/2005-23) by Metwally, Agrawal and Abbadi. It is implemented through the [APPROX_TOP_K](../sql-reference/functions/approx_top_k.md) family of functions.

Additionally, the [APPROX_TOP_K_COMBINE](../sql-reference/functions/approx_top_k_combine.md) function utilizes the [parallel Space-Saving algorithm](https://arxiv.org/abs/1401.0702) outlined by Cafaro, Pulimeno and Tempesta.

The percentage of error for the algorithm depends heavily on how skewed the data is, and the number of counters used in the algorithm. As data becomes more skewed, or more counters are used, the output
will be more accurate.

## SQL Functions

The following [Aggregate functions](../sql-reference/functions-aggregation.md) are provided for using Space-Saving to estimate frequent values:

* [APPROX_TOP_K](../sql-reference/functions/approx_top_k.md): Returns an approximation of frequent values in the input.
* [APPROX_TOP_K_ACCUMULATE](../sql-reference/functions/approx_top_k_accumulate.md): Skips the final estimation step and returns the Space-Saving state at the end of an aggregation.
* [APPROX_TOP_K_COMBINE](../sql-reference/functions/approx_top_k_combine.md): Combines (i.e. merges) input states into a single output state.
* [APPROX_TOP_K_ESTIMATE](../sql-reference/functions/approx_top_k_estimate.md): Computes a cardinality estimate of a Space-Saving state produced by APPROX_TOP_K_ACCUMULATE and APPROX_TOP_K_COMBINE.

## Implementation Details

Each counter in our implementation tracks an item and its frequency. Notably, our implementation does not track the epsilon values of counters, as they are only useful for giving guarantees about the
output of the algorithm, they are not used for the algorithm itself.

The maximum number of counters is set to 100 thousand. In this case, there are 100 thousand counters stored in memory, but only a fraction of these are stored in an exported state.

The maximum number of `k` is 100 thousand. This value is automatically reduced if all the values cannot fit in the output.

In most cases, the runtime of our implementation does not depend on the number of counters. Our implementation ensures the number of counters does not have a noticeable effect on the runtime of the
algorithm.

Each counter in each aggregation state uses a constant amount of memory overhead of around 100 bytes. Thus, if an aggregation uses `c` counters and there are `g` aggregation groups, the
aggregation will use `c * g * 100B` of memory, plus memory to store the values. If this memory exceeds the total memory budget, memory is spilled to disk. This is far less memory than the
exact version would use, especially when there a large number of unique values.

---
title: Estimating Percentile Values
source: https://docs.snowflake.com/en/user-guide/querying-approximate-percentile-values.md
section: User Guide
---

# Estimating Percentile Values

Snowflake uses an improved version of the t-Digest algorithm, a space and time efficient way of estimating approximate percentile
values in data sets.

## Overview

Snowflake provides an improved version of an implementation of the
[t-Digest algorithm papers](https://github.com/tdunning/t-digest/tree/master/docs/t-digest-paper) by Dunning and Ertl.
It has been implemented through the
[APPROX_PERCENTILE](../sql-reference/functions/approx_percentile.md) family of functions.

As documented, the algorithm has a constant relative error. Note that the algorithm has substantial empirical support, but no rigorous proof of any accuracy guarantees.

## SQL Functions

The following [Aggregate functions](../sql-reference/functions-aggregation.md) are provided for using t-Digest to approximate percentile values:

* [APPROX_PERCENTILE](../sql-reference/functions/approx_percentile.md): Returns an approximation of the desired percentile value.
* [APPROX_PERCENTILE_ACCUMULATE](../sql-reference/functions/approx_percentile_accumulate.md): Skips the final estimation step and, instead, returns the intermediate t-Digest state at the end of an aggregation.
* [APPROX_PERCENTILE_COMBINE](../sql-reference/functions/approx_percentile_combine.md): Combines (i.e. merges) multiple input states into a single output state.
* [APPROX_PERCENTILE_ESTIMATE](../sql-reference/functions/approx_percentile_estimate.md): Computes a percentile estimate of a t-Digest state produced by APPROX_PERCENTILE_ACCUMULATE or APPROX_PERCENTILE_COMBINE.

## Implementation Details

* The estimation uses a constant amount of space regardless of the size of the input.
* The t-Digest state is independent from the percentile value. This enables calculating the t-Digest state once, and then querying the state for multiple percentile values.

---
title: Estimating Similarity of Two or More Sets
source: https://docs.snowflake.com/en/user-guide/querying-approximate-similarity.md
section: User Guide
---

# Estimating Similarity of Two or More Sets

Snowflake uses MinHash for estimating the approximate similarity between two or more data sets. The MinHash scheme compares sets without computing the intersection or union of the sets, which enables
efficient and effective estimation.

## Overview

Typically, the Jaccard similarity coefficient (or index) is used to compare the similarity between two sets. For two sets, `A` and `B`, the Jaccard index is defined to be the ratio of the
size of their intersection and the size of their union:

> `J(A,B) = (A ∩ B) / (A ∪ B)`

However, this calculation can consume significant resources and time and, therefore, is not ideal for large data sets.

In contrast, the goal of the MinHash scheme is to estimate `J(A,B)` quickly, without computing the intersection or union.

## SQL Functions

The following [Aggregate functions](../sql-reference/functions-aggregation.md) are provided for estimating approximate similarity using MinHash:

* [MINHASH](../sql-reference/functions/minhash.md): Returns a MinHash state containing a MinHash array of length *k* (input argument).
* [MINHASH_COMBINE](../sql-reference/functions/minhash_combine.md): Combines two (or more) input MinHash states into a single output MinHash state.
* [APPROXIMATE_SIMILARITY](../sql-reference/functions/approximate_similarity.md) (or [APPROXIMATE_JACCARD_INDEX](../sql-reference/functions/approximate_jaccard_index.md)): Returns an estimation of the similarity (Jaccard index) of input sets based on
  their MinHash states.

## Implementation Details

As detailed in [MinHash](https://en.wikipedia.org/wiki/MinHash) (in Wikipedia):

> “Let `H` be a hash function that maps the members of `A` and `B` to distinct integer values and, for any set `S`, define `H_min(S)` to be the minimal member of `S`
> with respect to `H`, i.e. the member `s` of `S` with the minimum value of `H(s)`, as expressed in the following equation:
>
> > `H_min(S) = argmin_{s in S} (H(s))`
>
> If we apply `H_min` to both `A` and `B`, we will get the same value exactly when the element of the union `A ∪ B` with minimum hash value lies in the intersection `A ∩ B`. The probability of this being true is the above ratio, therefore:
>
> > `Pr[H_min(A) = H_min(B)] = J(A,B)`
>
> Namely, assuming randomly chosen sets `A` and `B`, the probability that `H_min(A) = H_min(B)` holds is equal to `J(A,B)`. In other words, if `X` is the random variable
> that is 1 when `H_min(A) = H_min(B)` and 0 otherwise, then `X` is an unbiased estimator of `J(A,B)`. Note that `X` has a too large variance to be a good estimator for the
> Jaccard index on its own (since it is always 0 or 1).
>
> The MinHash scheme reduces this variance by averaging together several variables constructed in the same way using `k` number of different hash functions.”

In order to achieve this, the [MINHASH](../sql-reference/functions/minhash.md) function initially creates `k` number of different hash functions and applies them to every element of each input set, retaining
the minimum of each one, to produce a MinHash array (also called a MinHash *state*) for each set. More specifically, for `i = 0 to k-1`, the entry `i` of the MinHash array for set
`A` (shown by `MinHash_A`) corresponds to the minimum value of hash function `H_i` applied to every element of set `A`.

Finally, an approximation for the similarity of the two sets `A` and `B` is calculated as:

> `J_apprx(A,B) = (# of entries MinHash_A and MinHash_B agree on) / k`

## Examples

In the following example, we show how this scheme and the corresponding functions can be used in order to approximate the similarity of two sets of elements.

First, create two sample tables and insert some sample data:

> ```sqlexample
> CREATE OR REPLACE TABLE mhtab1(c1 NUMBER,c2 DOUBLE,c3 TEXT,c4 DATE);
> CREATE OR REPLACE TABLE mhtab2(c1 NUMBER,c2 DOUBLE,c3 TEXT,c4 DATE);
>
> INSERT INTO mhtab1 VALUES
>     (1, 1.1, 'item 1', to_date('2016-11-30')),
>     (2, 2.31, 'item 2', to_date('2016-11-30')),
>     (3, 1.1, 'item 3', to_date('2016-11-29')),
>     (4, 44.4, 'item 4', to_date('2016-11-30'));
>
> INSERT INTO mhtab2 VALUES
>     (1, 1.1, 'item 1', to_date('2016-11-30')),
>     (2, 2.31, 'item 2', to_date('2016-11-30')),
>     (3, 1.1, 'item 3', to_date('2016-11-29')),
>     (4, 44.4, 'item 4', to_date('2016-11-30')),
>     (6, 34.23, 'item 6', to_date('2016-11-29'));
> ```

Then, approximate the similarity of the two sets (tables `mhtab1` and `mhtab2`) using their MinHash states:

> ```sqlexample
> SELECT APPROXIMATE_SIMILARITY(mh) FROM
>     ((SELECT MINHASH(100, *) AS mh FROM mhtab1)
>     UNION ALL
>     (SELECT MINHASH(100, *) AS mh FROM mhtab2));
>
> +----------------------------+
> | APPROXIMATE_SIMILARITY(MH) |
> |----------------------------|
> |                       0.79 |
> +----------------------------+
> ```

The similarity index of these two tables is approximated as 0.79, as opposed to the exact value 0.8 (i.e., 4/5).

---
title: Estimating the Number of Distinct Values
source: https://docs.snowflake.com/en/user-guide/querying-approximate-cardinality.md
section: User Guide
---

# Estimating the Number of Distinct Values

Snowflake uses HyperLogLog to estimate the approximate number of distinct values in a data set. HyperLogLog is a state-of-the-art cardinality estimation algorithm, capable of estimating distinct
cardinalities of trillions of rows with an average relative error of a few percent.

HyperLogLog can be used in place of [COUNT(DISTINCT …)](../sql-reference/functions/count.md) in situations where estimating cardinality is acceptable.

## Overview

Snowflake provides a bias-corrected implementation of the HyperLogLog algorithm presented in [HyperLogLog: the analysis of a near-optimal cardinality estimation algorithm](http://algo.inria.fr/flajolet/Publications/FlFuGaMe07.pdf) by Flajolet et al.

We recommend using HyperLogLog whenever the input is potentially large and an approximate result is acceptable. The average relative error of our HyperLogLog implementation is 1.62338% (i.e. the
average relative difference to the corresponding [COUNT(DISTINCT …)](../sql-reference/functions/count.md) result).

## SQL Functions

The following [Aggregate functions](../sql-reference/functions-aggregation.md) are provided for estimating cardinality using HyperLogLog:

* [HLL](../sql-reference/functions/hll.md): Returns an approximation of the distinct cardinality of the input.
* [HLL_ACCUMULATE](../sql-reference/functions/hll_accumulate.md): Skips the final estimation step and returns the HyperLogLog state at the end of an aggregation.
* [HLL_COMBINE](../sql-reference/functions/hll_combine.md): Combines (i.e. merges) input states into a single output state.
* [HLL_ESTIMATE](../sql-reference/functions/hll_estimate.md): Computes a cardinality estimate of a HyperLogLog state produced by HLL_ACCUMULATE and HLL_COMBINE.
* [HLL_EXPORT](../sql-reference/functions/hll_export.md): Converts HyperLogLog states from BINARY format to an OBJECT (which can then be printed and exported as JSON).
* [HLL_IMPORT](../sql-reference/functions/hll_import.md): Converts HyperLogLog states from OBJECT format to BINARY format.

## Implementation Details

Our implementation hashes input rows to 64-bit values, of which the upper 12 bits or “precision” (as referred to in the HyperLogLog algorithm paper; see above document link for details) are used to
partition input values into so-called sub-streams. This yields an average relative error of:

> `sqrt(3*ln(2)-1)/sqrt(2^precision) = 0.0162338 = 1.62338%`

In other words, for a query where [COUNT(DISTINCT …)](../sql-reference/functions/count.md) would return a result of `1,000,000`, HyperLogLog typically returns a result in the range of
`983,767` to `1,016,234`.

For each sub-stream, HyperLogLog maintains the maximum leading-zero count (between 0 and 52 for 64-bit values at precision = 12). The most straight-forward representation of this state is a simple byte
array, one byte for each of the `2^12 = 4096` sub-streams. Our implementation indeed requires at most 4096 Byte (`2^precision = 2^12 = 4096`) of memory per aggregation group. Technically, only
6 bits (rather than 8 bits) are required per sub-stream, but we trade some space efficiency for computational efficiency.

For small input cardinalities, most of the sub-streams will never be hit. So rather than allocating an entire block of 4096 Byte per aggregation group up-front, our implementation uses a space-optimized
“sparse” representation of this state whenever beneficial. Consequently, the memory cost of HyperLogLog can be substantially lower than 4096 Byte per aggregation group (down to about 32 Byte per
aggregation group). This allows cardinality estimation over many aggregation groups (millions or even billions, as determined by the GROUP BY or OVER clause of the query), using orders of magnitude less
memory and CPU time than a corresponding [COUNT(DISTINCT …)](../sql-reference/functions/count.md) query.

In the (rare) case where an extremely large input table and many aggregation groups cause HyperLogLog to exceed its total memory budget, Snowflake is still able to spill to temp space and perform
recursive aggregation, as with any other aggregation function.

## Exported State Format

The state of the HyperLogLog algorithm can be exported and imported (or reimported) using the [HLL_EXPORT](../sql-reference/functions/hll_export.md) and [HLL_IMPORT](../sql-reference/functions/hll_import.md) functions,
respectively. The exported state is of type OBJECT and contains the following fields.

### Dense Format

`version`:
:   Version number of the HyperLogLog implementation.

`precision`:
:   Number of hashed value bits to use to select sub-streams. Currently fixed to 12.

`dense`:
:   An array of integers, each containing the maximum leading-zero count + 1 for the corresponding sub-stream. 0 indicates that the corresponding sub-stream has not been hit yet. Legal values
    are in the range of 0 to 53. The corresponding sub-stream index is given by the element position in the array.

For example:

> ```sqljson
> {
>   "version" : 3,
>   "precision" : 12,
>   "dense" : [3,3,3,3,5,3,4,3,5,6,2,4,4,7,5,6,6,3,2,2,3,2,4,5,5,5,2,5,5,3,6,1,4,2,2,4,4,5,2,5,...,4,6,3]
> }
> ```

### Sparse Format

`version`:
:   Version number of the HyperLogLog implementation.

`precision`:
:   Number of hashed value bits to use to select sub-streams. Currently fixed to 12.

`sparse`:
:   `indices`: An array of integers, each containing a sub-stream index (base 0). Legal values are in the range of 0 to 4095.

    `maxLzCounts`: An array of integers, each containing the maximum leading-zero count + 1 for the corresponding sub-stream. 0 indicates that the corresponding sub-stream has not been hit yet.
    :   Legal values are in the range of 0 to 53. The sub-stream for a given leading-zero count is given by the corresponding element in the `indices` array.

    The `indices` and `maxLzCounts` arrays must have the same length. The [HLL_IMPORT](../sql-reference/functions/hll_import.md) function also checks that sub-stream indices are in the valid range, and
    that there are no duplicate sub-stream indices. The `indices` array need not be sorted. The leading-zero counts are not validated. Invalid values will not cause query failures, but will lead
    to undefined results for [HLL_ESTIMATE](../sql-reference/functions/hll_estimate.md).

For example:

> ```sqljson
> {
>   "version" : 3,
>   "precision" : 12,
>   "sparse" : {
>     "indices": [1131,1241,1256,1864,2579,2699,3730],
>     "maxLzCounts":[2,4,2,1,3,2,1]
>   }
> }
> ```

## Examples

**Environment set up:**

> ```sqlexample
> USE WAREHOUSE dontdrop;
> USE DATABASE stressdb;
> USE SCHEMA bdb_5nodes;
>
> SELECT COUNT(*) FROM uservisits;
>
> -----------+
>  COUNT(*)  |
> -----------+
>  751754869 |
> -----------+
> ```

**Step 1:**

Create a table that contains the calendar date (year/month/day) and the HLL structure. We use [HLL_EXPORT](../sql-reference/functions/hll_export.md) to store the binary structure as a text object:

> ```sqlexample
> CREATE OR REPLACE TABLE daily_uniques
> AS
> SELECT
>  visitdate,
>  hll_export(hll_accumulate(sourceip)) AS hll_sourceip
> FROM uservisits
> GROUP BY visitdate;
> ```

**Step 2:**

We can calculate the unique IPs by month by aggregating each day’s HLL structure from Step 1. We use [HLL_IMPORT](../sql-reference/functions/hll_import.md) to transform the text to the binary structure, then
[HLL_COMBINE](../sql-reference/functions/hll_combine.md) to combine multiple HLL structures into a single structure, then [HLL_ESTIMATE](../sql-reference/functions/hll_estimate.md) to compute the number of distinct values:

> ```sqlexample
> SELECT
>   EXTRACT(year FROM visitdate) AS visit_year,
>   EXTRACT(month FROM visitdate) AS visit_month,
>   hll_estimate(hll_combine(hll_import(hll_sourceip))) AS distinct_ips
> FROM daily_uniques
> WHERE visitdate BETWEEN '2000-01-01' AND '2000-12-31'
> GROUP BY 1,2
> ORDER BY 1,2;
>
> ------------+-------------+--------------+
>  VISIT_YEAR | VISIT_MONTH | DISTINCT_IPS |
> ------------+-------------+--------------+
>        2000 |           1 |      1515168 |
>        2000 |           2 |      1410289 |
>        2000 |           3 |      1491997 |
>        2000 |           4 |      1460837 |
>        2000 |           5 |      1546647 |
>        2000 |           6 |      1485599 |
>        2000 |           7 |      1522643 |
>        2000 |           8 |      1492831 |
>        2000 |           9 |      1488507 |
>        2000 |          10 |      1553201 |
>        2000 |          11 |      1461140 |
>        2000 |          12 |      1515772 |
> ------------+-------------+--------------+
>
> Elapsed 1.3s
> ```

**Compare:**

We compare the use of the aggregation using the HLL functions to HLL on the detail level data. In this case, `HLL()` is equivalent to the `HLL_ESTIMATE(HLL_COMBINE(HLL_IMPORT()))` from
Step 2:

> ```sqlexample
> SELECT
>   EXTRACT(year FROM visitdate) AS visit_year,
>   EXTRACT(month FROM visitdate) AS visit_month,
>   hll(sourceip) AS distinct_ips
> FROM uservisits
> WHERE visitdate BETWEEN '2000-01-01' AND '2000-12-31'
> GROUP BY 1,2
> ORDER BY 1,2;
>
> ------------+-------------+--------------+
>  VISIT_YEAR | VISIT_MONTH | DISTINCT_IPS |
> ------------+-------------+--------------+
>        2000 |           1 |      1515168 |
>        2000 |           2 |      1410289 |
>        2000 |           3 |      1491997 |
>        2000 |           4 |      1460837 |
>        2000 |           5 |      1546647 |
>        2000 |           6 |      1485599 |
>        2000 |           7 |      1522643 |
>        2000 |           8 |      1492831 |
>        2000 |           9 |      1488507 |
>        2000 |          10 |      1553201 |
>        2000 |          11 |      1461140 |
>        2000 |          12 |      1515772 |
> ------------+-------------+--------------+
>
> Elapsed 2m 29s
> ```

As you can see, aggregation of the HLL structures is significantly faster than aggregation over the base data, e.g. 1.3 seconds vs 149 seconds in this small example, which represents a 100x decrease
in query time.

---
title: Evaluate cost for hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-cost.md
section: User Guide
---

# Evaluate cost for hybrid tables

When you use hybrid tables, your account is charged based on two modes of consumption:

* **Hybrid table storage**: Cost for storage of hybrid tables depends on the
  amount of data that you are storing. Storage cost is based on a flat monthly rate per gigabyte (GB).
  See Table 3(b) in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf), which covers unit pricing for hybrid
  table storage.

  Note that hybrid table storage *for the row-store copy of the data* is more expensive than traditional
  Snowflake storage. The copy of the current data in the column store (object storage) is not billed.

  Historical time travel data is billed at standard storage prices.
* **Virtual warehouse compute**: Queries against hybrid tables are executed
  through virtual warehouses. The consumption rate of a warehouse is the same
  for querying hybrid tables as it is for standard tables.
  See [Virtual warehouse credit usage](cost-understanding-compute.md).

## Monitoring storage consumption for hybrid tables

You can view storage usage for hybrid tables and monitor consumption of hybrid table storage credits by querying the following views and functions:

* [STORAGE_USAGE view](../sql-reference/account-usage/storage_usage.md) (STORAGE_BYTES and HYBRID_TABLE_STORAGE_BYTES columns).
* DATABASE_STORAGE_USAGE_HISTORY (AVERAGE_HYBRID_TABLE_STORAGE_BYTES and AVERAGE_DATABASE_BYTES columns):

  + Account Usage [DATABASE_STORAGE_USAGE_HISTORY view](../sql-reference/account-usage/database_storage_usage_history.md)
  + Organization Usage [DATABASE_STORAGE_USAGE_HISTORY view](../sql-reference/organization-usage/database_storage_usage_history.md)
  + Information Schema [DATABASE_STORAGE_USAGE_HISTORY](../sql-reference/functions/database_storage_usage_history.md) function
* [HYBRID_TABLES view](../sql-reference/account-usage/hybrid_tables.md) (data per specific hybrid table in the BYTES column).
* [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md): Monitor virtual warehouse compute resources used during specific queries that are
  executed against hybrid tables. See [Monitor workloads](tables-hybrid-monitor-workload.md).

## Hybrid table storage for Time Travel data

Consumption for hybrid table storage takes into account the data that is retained by [Time Travel](data-time-travel.md).
Data retained by Time Travel is included in the following storage metrics:

* STORAGE_BYTES column in the [STORAGE_USAGE view](../sql-reference/account-usage/storage_usage.md)
* AVERAGE_DATABASE_BYTES column in DATABASE_STORAGE_USAGE_HISTORY:

  + Account Usage [DATABASE_STORAGE_USAGE_HISTORY view](../sql-reference/account-usage/database_storage_usage_history.md)
  + Organization Usage [DATABASE_STORAGE_USAGE_HISTORY view](../sql-reference/organization-usage/database_storage_usage_history.md)
  + Information Schema [DATABASE_STORAGE_USAGE_HISTORY](../sql-reference/functions/database_storage_usage_history.md) function

Data retained by Time Travel is stored in object storage, not the row store, and is charged at the standard table rate,
not the higher hybrid table rate.

---
title: Event table monitoring and alerts for dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-monitor-event-table-alerts.md
section: User Guide
---

# Event table monitoring and alerts for dynamic tables

This topic discusses how to query an event table that provides information about your refresh status and how to set up alerts on new data in
an event table.

## Query an event table to monitor refreshes

When a dynamic table is refreshed, you can configure Snowflake to record an event that provides information about the status of the refresh
operation. The event is recorded in the [active event table](../developer-guide/logging-tracing/event-table-setting-up.md) associated with
the dynamic table.

For example, suppose that you have [associated an event table with a database](../developer-guide/logging-tracing/event-table-setting-up.md). When a
dynamic table in that database is refreshed, Snowflake records an event to that event table.

You can query the events logged in this active event table to monitor your dynamic table refreshes.

For example, to get the timestamp, dynamic table name, query ID, and error message for errors with dynamic tables in the database `my_db`,
do the following:

```sqlexample
SELECT
    timestamp,
    resource_attributes:"snow.executable.name"::VARCHAR AS dt_name,
    resource_attributes:"snow.query.id"::VARCHAR AS query_id,
    value:message::VARCHAR AS error
  FROM my_event_table
  WHERE
    resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE' AND
    resource_attributes:"snow.database.name" = 'MY_DB' AND
    value:state = 'FAILED'
  ORDER BY timestamp DESC;
```

```output
+-------------------------+------------------+--------------------------------------+---------------------------------------------------------------------------------+
| TIMESTAMP               | DT_NAME          | QUERY_ID                             | ERROR                                                                           |
|-------------------------+------------------+--------------------------------------+---------------------------------------------------------------------------------|
| 2025-02-17 21:40:45.444 | MY_DYNAMIC_TABLE | 01ba7614-0107-e56c-0000-a995024f304a | SQL compilation error:                                                          |
|                         |                  |                                      | Failure during expansion of view 'MY_DYNAMIC_TABLE': SQL compilation error:     |
|                         |                  |                                      | Object 'MY_DB.MY_SCHEMA.MY_BASE_TABLE' does not exist or not authorized.        |
+-------------------------+------------------+--------------------------------------+---------------------------------------------------------------------------------+
```

The following example retrieves all columns for upstream errors with dynamic tables in the schema `my_schema`:

```sqlexample
SELECT *
  FROM my_event_table
  WHERE
    resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE' AND
    resource_attributes:"snow.schema.name" = 'MY_SCHEMA' AND
    value:state = 'UPSTREAM_FAILURE'
  ORDER BY timestamp DESC;
```

```output
+-------------------------+-----------------+-------------------------+-------+----------+-------------------------------------------------+-------+------------------+-------------+-----------------------------+-------------------+-------------------------------+-----------+
| TIMESTAMP               | START_TIMESTAMP | OBSERVED_TIMESTAMP      | TRACE | RESOURCE | RESOURCE_ATTRIBUTES                             | SCOPE | SCOPE_ATTRIBUTES | RECORD_TYPE | RECORD                      | RECORD_ATTRIBUTES | VALUE                         | EXEMPLARS |
|-------------------------+-----------------+-------------------------+-------+----------+-------------------------------------------------+-------+------------------+-------------+-----------------------------+-------------------+-------------------------------+-----------|
| 2025-02-17 21:40:45.486 | NULL            | 2025-02-17 21:40:45.486 | NULL  | NULL     | {                                               | NULL  | NULL             | EVENT       | {                           | NULL              | {                             | NULL      |
|                         |                 |                         |       |          |   "snow.database.id": 49,                       |       |                  |             |   "name": "refresh.status", |                   |   "state": "UPSTREAM_FAILURE" |           |
|                         |                 |                         |       |          |   "snow.database.name": "MY_DB",                |       |                  |             |   "severity_text": "WARN"   |                   | }                             |           |
|                         |                 |                         |       |          |   "snow.executable.id": 487426,                 |       |                  |             | }                           |                   |                               |           |
|                         |                 |                         |       |          |   "snow.executable.name": "MY_DYNAMIC_TABLE_2", |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          |   "snow.executable.type": "DYNAMIC_TABLE",      |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          |   "snow.owner.id": 2601,                        |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          |   "snow.owner.name": "DATA_ADMIN",              |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          |   "snow.owner.type": "ROLE",                    |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          |   "snow.schema.id": 411,                        |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          |   "snow.schema.name": "MY_SCHEMA"               |       |                  |             |                             |                   |                               |           |
|                         |                 |                         |       |          | }                                               |       |                  |             |                             |                   |                               |           |
+-------------------------+-----------------+-------------------------+-------+----------+-------------------------------------------------+-------+------------------+-------------+-----------------------------+-------------------+-------------------------------+-----------+
```

For information about the role that you need to use to query the event table and the conditions that you can use to filter the results, see
Set up an alert on new data.

## Set up alerts on new data to monitor refreshes

As mentioned earlier, when a dynamic table is refreshed, an event is logged in the
event table to indicate whether the refresh succeeded or failed. You can set up an [alert on new data](alerts.md) to
monitor the event table. You can configure the alert to [send a notification](notifications/about-notifications.md) when a
refresh fails.

The next sections explain how to set up the event logging to capture the events, how to set up the alert, and how to interpret
the events recorded in the event table:

* Set the severity level of the events to capture
* Set up an alert on new data
* Information logged for dynamic table events

> **Note:**
>
> Logging events for dynamic tables incurs costs. See [Costs of telemetry data collection](../developer-guide/logging-tracing/logging-tracing-billing.md).

### Set the severity level of the events to capture

> **Note:**
>
> If you do not set the severity level, no events will be captured.

To set up dynamic table events to be recorded to the event table,
[set the severity level of events](../developer-guide/logging-tracing/telemetry-levels.md) that you want captured in the event
table. Events are captured at the following levels:

* `ERROR`: Refresh failure events.
* `WARN`: Failures to refresh upstream dynamic tables and refresh failure events.
* `INFO`: Successful refresh events, failures to refresh upstream dynamic tables, and refresh failure events.

To set the level, set the [LOG_EVENT_LEVEL](../sql-reference/parameters.md) parameter for the account or object. You can set the level for:

* All objects in the account.
* All objects in a database or schema.
* A specific dynamic table.

For example:

* To capture ERROR-level dynamic table events for all supported objects in the account, execute
  [ALTER ACCOUNT SET LOG_EVENT_LEVEL](../sql-reference/sql/alter-account.md):

  ```sqlexample
  ALTER ACCOUNT SET LOG_EVENT_LEVEL = ERROR;
  ```

  Setting `LOG_EVENT_LEVEL` at the account level applies to log events (record type EVENT) for supported workloads in the account, including dynamic tables. It does not replace [LOG_LEVEL](../sql-reference/parameters.md) for log messages from logging APIs. For more information, see [Parameters](../sql-reference/parameters.md).
* To capture INFO-level events for all supported objects in the database `my_db`, execute
  [ALTER DATABASE … SET LOG_EVENT_LEVEL](../sql-reference/sql/alter-database.md):

  ```sqlexample
  ALTER DATABASE my_db SET LOG_EVENT_LEVEL = INFO;
  ```

  Similar to the case of setting the level on the account, setting the level on the database affects log events for supported object types in the database.
* To capture WARN-level events for the dynamic table `my_dynamic_table`, execute
  [ALTER DYNAMIC TABLE … SET LOG_EVENT_LEVEL](../sql-reference/sql/alter-dynamic-table.md):

  ```sqlexample
  ALTER DYNAMIC TABLE my_dynamic_table SET LOG_EVENT_LEVEL = WARN;
  ```

### Set up an alert on new data

After you set the severity level for logging events, you can set up an alert on new data to monitor the event table for new events
that indicate a failure in a dynamic table refresh. An alert on new data is triggered when new rows in the event table are
inserted and meet the condition specified in the alert.

> **Note:**
>
> To create the alert on new data, you must use a role that has been granted the required privileges to query the event table.
>
> * If the alert condition queries the default event table ([SNOWFLAKE.TELEMETRY.EVENTS](../developer-guide/logging-tracing/event-table-setting-up.md))
>   or the predefined view ([SNOWFLAKE.TELEMETRY.EVENTS_VIEW view](../sql-reference/telemetry/events_view.md)),
>   see [Roles for access to the default event table and EVENTS_VIEW](../developer-guide/logging-tracing/event-table-setting-up.md).
>
>   To manage access to the EVENTS_VIEW view, see [Manage access to the EVENTS_VIEW view](../developer-guide/logging-tracing/event-table-setting-up.md).
> * If the alert condition queries a custom event table, see [Access control privileges for event tables](../developer-guide/logging-tracing/event-table-operations.md).
>
>   To manage access to a custom event table, see [Managing access to event table data](../developer-guide/logging-tracing/event-table-operations.md).

In the alert condition, to query for dynamic table events, select rows where
`resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE'`. To narrow down the list of events, you can filter on the
following columns:

* To restrict the results to dynamic tables in a specific database, use `resource_attributes:"snow.database.name"`.
* To return events where the refresh failed due to an error with the dynamic table, use `value:state = 'FAILED'`.
* To return events where the refresh failed due to an error with an upstream dynamic table, use
  `value:state = 'UPSTREAM_FAILURE'`.

For information on the values logged for a dynamic table event, see
Information logged for dynamic table events.

> **Note:**
>
> The `timestamp` column in the event table stores values in UTC. If you use a scheduled alert with a timestamp filter
> (for example, `timestamp > DATEADD('minute', -5, CURRENT_TIMESTAMP())`), convert the current timestamp to UTC to ensure
> accurate comparisons:
>
> ```sqlexample
> timestamp > DATEADD('minute', -5, CONVERT_TIMEZONE('UTC', CURRENT_TIMESTAMP()))
> ```

For example, the following statement creates an alert on new data that performs an action when refreshes fail for dynamic tables
in the database `my_db`. The example assumes that:

* Your active event table is the [default event table](../developer-guide/logging-tracing/event-table-setting-up.md) (SNOWFLAKE.TELEMETRY.EVENTS).
* You have [set up a webhook notification integration](notifications/webhook-notifications.md) for that Slack
  channel.

```sqlexample
CREATE ALERT my_alert_on_dt_refreshes
  IF( EXISTS(
    SELECT * FROM SNOWFLAKE.TELEMETRY.EVENT_TABLE
      WHERE resource_attributes:"snow.executable.type" = 'dynamic_table'
        AND resource_attributes:"snow.database.name" = 'my_db'
        AND record:"name" = 'refresh.status'
        AND record:"severity_text" = 'ERROR'
        AND value:"state" = 'FAILED'))
  THEN
    BEGIN
      LET result_str VARCHAR;
      (SELECT ARRAY_TO_STRING(ARRAY_AGG(name)::ARRAY, ',') INTO :result_str
         FROM (
           SELECT resource_attributes:"snow.executable.name"::VARCHAR name
             FROM TABLE(RESULT_SCAN(SNOWFLAKE.ALERT.GET_CONDITION_QUERY_UUID()))
             LIMIT 10
         )
      );
      CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
        SNOWFLAKE.NOTIFICATION.TEXT_PLAIN(:result_str),
        '{"my_slack_integration": {}}'
      );
    END;
```

### Information logged for dynamic table events

When a dynamic table refreshes, an event is logged to the event table. The following sections describe the event table row that
represents the event:

* Event table column values
* Key-value pairs in the resource_attributes column
* Key-value pairs in the record column

#### Event table column values

When a dynamic table refreshes, a row with the following values is inserted into the event table.

> **Note:**
>
> If a column is not listed below, the column value is NULL for the event.

| Column | Data type | Description |
| --- | --- | --- |
| `timestamp` | TIMESTAMP_NTZ | The UTC timestamp when an event was created. |
| `observed_timestamp` | TIMESTAMP_NTZ | A UTC time used for logs. Currently, this is the same value that is in the `timestamp` column. |
| `resource_attributes` | OBJECT | Attributes that identify the dynamic table that was refreshed. |
| `record_type` | STRING | The event type, which is `EVENT` for dynamic table refreshes. |
| `record` | OBJECT | Details about the status of the dynamic table refresh. |
| `value` | VARIANT | The status of the dynamic table refresh and, if the refresh failed, the error message for the failure. |

#### Key-value pairs in the `resource_attributes` column

The `resource_attributes` column contains an [OBJECT](../sql-reference/data-types-semistructured.md) value with the following key-value pairs:

| Attribute name | Attribute type | Description | Example |
| --- | --- | --- | --- |
| `snow.database.id` | INTEGER | The internal/system-generated identifier of the database containing the dynamic table. | `12345` |
| `snow.database.name` | VARCHAR | The name of the database containing the dynamic table. | `MY_DATABASE` |
| `snow.executable.id` | INTEGER | The internal/system-generated identifier of the dynamic table that was refreshed. | `12345` |
| `snow.executable.name` | VARCHAR | The name of the dynamic table that was refreshed. | `MY_DYNAMIC_TABLE` |
| `snow.executable.type` | VARCHAR | The type of the object. The value is `DYNAMIC_TABLE` for dynamic table events. | `DYNAMIC_TABLE` |
| `snow.owner.id` | INTEGER | The internal/system-generated identifier of the role with the OWNERSHIP privilege on the dynamic table. | `12345` |
| `snow.owner.name` | VARCHAR | The name of the role with the OWNERSHIP privilege on the dynamic table. | `MY_ROLE` |
| `snow.owner.type` | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. | `ROLE` |
| `snow.query.id` | VARCHAR | ID of the query that refreshed the dynamic table. | `01ba7614-0107-e56c-0000-a995024f304a` |
| `snow.schema.id` | INTEGER | The internal/system-generated identifier of the schema containing the dynamic table. | `12345` |
| `snow.schema.name` | VARCHAR | The name of the schema containing the dynamic table. | `MY_SCHEMA` |
| `snow.warehouse.id` | INTEGER | The internal/system-generated identifier of the warehouse used to refresh the dynamic table. | `12345` |
| `snow.warehouse.name` | VARCHAR | The name of the warehouse used to refresh the dynamic table. | `MY_WAREHOUSE` |

#### Key-value pairs in the `record` column

The `record` column contains an [OBJECT](../sql-reference/data-types-semistructured.md) value with the following key-value pairs:

| Key | Type | Description | Example |
| --- | --- | --- | --- |
| `name` | VARCHAR | The name of the event. The value is `refresh.status` for dynamic table refreshes. | `refresh.status` |
| `severity_text` | VARCHAR | The severity level of the event, which is one of the following values:   * `INFO`: The refresh succeeded. * `ERROR`: The refresh failed. * `WARN`: The refresh of an upstream dynamic table failed. | `INFO` |

#### Key-value pairs in the `value` column

The `value` column contains an [VARIANT](../sql-reference/data-types-semistructured.md) value with the following key-value pairs:

| Key | Type | Description | Example |
| --- | --- | --- | --- |
| `state` | VARCHAR | The state of the refresh, which can be one of the following values:   * `SUCCEEDED`: The refresh succeeded. * `FAILED`: The refresh failed. * `UPSTREAM_FAILURE`: The refresh failed due to a failure to refresh a dynamic table that this dynamic table depends on. | `SUCCEEDED` |
| `message` | VARCHAR | If the value in `state` is `FAILED`, this column includes the error message. | `SQL compilation error:\nFailure during expansion of view 'MY_DYNAMIC_TABLE': SQL compilation error:\nObject 'MY_DB.MY_SCHEMA.MY_BASE_TABLE' does not exist or not authorized.` |

## Query pipeline spans to trace refreshes

In addition to events, Snowflake can record pipeline
spans for dynamic table refreshes. Events and spans are two separate observability mechanisms:

* **Events** (controlled by [LOG_LEVEL](../sql-reference/parameters.md)) provide logs per-dynamic-table refresh,
  indicating whether each refresh succeeded or failed.
* **Spans** (controlled by [TRACE_LEVEL](../sql-reference/parameters.md)) provide richer pipeline-level
  observability, including correlated trace IDs across a pipeline, skip reasons, and dependency topology.

Spans capture additional states for which events are not emitted, including `SKIPPED` refreshes due to upstream
skips or refresh cycles where the scheduler skipped refreshing to minimize the lag of the dynamic table and
its consumers.

> **Note:**
>
> Recording spans for dynamic tables incurs costs. See [Costs of telemetry data collection](../developer-guide/logging-tracing/logging-tracing-billing.md).

### Enable pipeline spans

To enable pipeline spans for dynamic table refreshes, set the TRACE_LEVEL parameter to `ALWAYS` at the
database or schema level:

```sqlexample
ALTER SCHEMA my_db.my_schema SET TRACE_LEVEL = 'ALWAYS';
```

You can also set this at the database level to capture spans for all dynamic tables in the database:

```sqlexample
ALTER DATABASE my_db SET TRACE_LEVEL = 'ALWAYS';
```

### Query span data

To query pipeline spans for dynamic table refreshes, filter for rows where `record_type = 'SPAN'` and
`record:"name" = 'table_refresh'`:

```sqlexample
SELECT
    resource_attributes:"snow.executable.name"::STRING AS dt_name,
    record_attributes:"snow.dynamic_table.state"::STRING AS state,
    record_attributes:"snow.dynamic_table.state_reason"::STRING AS state_reason,
    record_attributes:"snow.dynamic_table.data_timestamp"::STRING AS data_timestamp,
    trace:"trace_id"::STRING AS trace_id,
    trace:"span_id"::STRING AS span_id,
    record:"status":"code"::STRING AS status_code
  FROM my_event_table
  WHERE record_type = 'SPAN'
    AND record:"name" = 'table_refresh'
  ORDER BY start_timestamp ASC;
```

#### Span attributes (`record_attributes`)

Each span row includes the following attributes in the `record_attributes` column, specific to dynamic
table refreshes:

| Attribute name | Type | Description |
| --- | --- | --- |
| `snow.dynamic_table.state` | STRING | The state of the refresh: `SUCCEEDED`, `FAILED`, or `SKIPPED`. |
| `snow.dynamic_table.state_reason` | STRING | Why the dynamic table was skipped or failed. NULL on success. Possible values:   * `QUERY_FAILURE`: The refresh query failed. * `UPSTREAM_FAILURE`: An upstream dynamic table failed to refresh. * `UPSTREAM_SKIP`: An upstream dynamic table was skipped. * `NOT_EFFECTIVE_TICK_TO_REFRESH`: The pipeline is already running behind schedule, skipping this   refresh operation to minimize the lag of this dynamic table and its consumers. |
| `snow.dynamic_table.data_timestamp` | STRING | The transactional timestamp when the refresh was evaluated. (This might be slightly before the actual time of the refresh.) All data in base objects that arrived before this timestamp is included in the dynamic table. |

> **Note:**
>
> Spans cover `SKIPPED` states (with reasons `UPSTREAM_SKIP` and `NOT_EFFECTIVE_TICK_TO_REFRESH`)
> for which events are not emitted. If you need visibility into skipped refreshes, use spans instead of events.

### Pipeline correlation with trace IDs and span links

A unique capability of spans is pipeline-level correlation. When a refresh cycle includes refresh operations
for multiple dynamic tables, all the resulting spans share the same `trace:"trace_id"`. This lets you
reconstruct the full set of refresh operations that occurred in a single refresh cycle.

Each span also includes a `record:"links"` array that lists the `span_id` of each upstream dependency.
For example, if `DT_B` depends on `DT_A`, then `DT_A`’s `span_id` appears in `DT_B`’s
`record:"links"`.

The `record:"status":"code"` field is `STATUS_CODE_OK` for successes and skips, and
`STATUS_CODE_ERROR` for failures.

For example, to correlate all dynamic table refresh operations in a single refresh cycle, query for spans
with the same `trace_id`:

```sqlexample
SELECT
    resource_attributes:"snow.executable.name"::STRING AS dt_name,
    record_attributes:"snow.dynamic_table.state"::STRING AS state,
    record:"links" AS upstream_links
  FROM my_event_table
  WHERE record_type = 'SPAN'
    AND record:"name" = 'table_refresh'
    AND trace:"trace_id" = '<trace_id>'
  ORDER BY start_timestamp;
```

### Trace a pipeline refresh

This section walks through how to use pipeline spans to trace a refresh cycle end to end: finding the
relevant spans, retrieving the full pipeline, and diagnosing failures or skips.

#### Example pipeline scenario

Consider a linear pipeline of four dynamic tables:

```sql
DT1 --> DT2 --> DT3 --> DT4
```

In this example, `DT1` and `DT2` refresh successfully, but `DT3` fails due to a query error. Because
`DT3` failed, `DT4` is automatically skipped with the reason `UPSTREAM_FAILURE`.

The following steps show how to retrieve and interpret the pipeline spans for this scenario.

#### Step 1: Find the span for a dynamic table

To investigate a specific dynamic table’s refresh, query the event table for its most recent span. Filter
by database, schema, and dynamic table name to ensure you match the correct object:

```sqlexample
SELECT
    trace:"span_id"::STRING AS span_id,
    trace:"trace_id"::STRING AS trace_id,
    resource_attributes:"snow.executable.name"::STRING AS dt_name,
    record_attributes:"snow.dynamic_table.data_timestamp"::STRING AS data_timestamp,
    record_attributes:"snow.dynamic_table.state"::STRING AS state,
    record_attributes:"snow.dynamic_table.state_reason"::STRING AS state_reason,
    resource_attributes:"snow.query.id"::STRING AS query_id,
    start_timestamp,
    timestamp AS end_timestamp
  FROM my_event_table
  WHERE record_type = 'SPAN'
    AND record:"name" = 'table_refresh'
    AND resource_attributes:"snow.database.name" = 'MY_DB'
    AND resource_attributes:"snow.schema.name" = 'MY_SCHEMA'
    AND resource_attributes:"snow.executable.name" = 'DT3'
  ORDER BY start_timestamp DESC
  LIMIT 5;
```

```output
+----------+------------------+---------+-------------------------+-----------+--------------+--------------------------------------+-------------------------+-------------------------+
| SPAN_ID  | TRACE_ID         | DT_NAME | DATA_TIMESTAMP          | STATE     | STATE_REASON | QUERY_ID                             | START_TIMESTAMP          | END_TIMESTAMP           |
|----------+------------------+---------+-------------------------+-----------+--------------+--------------------------------------+-------------------------+-------------------------|
| a1b2c3d4 | 4f3e2d1c0b9a8877 | DT3     | 2026-02-13T10:00:00.000 | FAILED    | QUERY_FAILURE| 01ba7614-0107-e56c-0000-a995024f304a | 2026-02-13 10:02:01.000 | 2026-02-13 10:02:20.000 |
| e5f6a7b8 | 7a8b9c0d1e2f3344 | DT3     | 2026-02-13T09:55:00.000 | SUCCEEDED | NULL         | 01ba7614-0107-e56c-0000-a995024f2f9b | 2026-02-13 09:57:01.000 | 2026-02-13 09:57:18.000 |
+----------+------------------+---------+-------------------------+-----------+--------------+--------------------------------------+-------------------------+-------------------------+
```

The `trace_id` value identifies the refresh cycle. All dynamic table spans within a single pipeline refresh
share the same `trace_id`. Use this value in the next step to retrieve
all spans from the same refresh cycle.

#### Step 2: Retrieve the full pipeline

Query all spans that share the same `trace_id` to see every dynamic table in the refresh cycle.
Include `record:"links"` to capture the dependency graph and `DATEDIFF` to compute the duration of each
refresh operation:

```sqlexample
SELECT
    trace:"span_id"::STRING AS span_id,
    trace:"trace_id"::STRING AS trace_id,
    resource_attributes:"snow.executable.name"::STRING AS dt_name,
    record_attributes:"snow.dynamic_table.state"::STRING AS state,
    record_attributes:"snow.dynamic_table.state_reason"::STRING AS state_reason,
    resource_attributes:"snow.query.id"::STRING AS query_id,
    start_timestamp,
    timestamp AS end_timestamp,
    DATEDIFF('second', start_timestamp, timestamp) AS duration_sec,
    record:"links" AS upstream_links
  FROM my_event_table
  WHERE record_type = 'SPAN'
    AND record:"name" = 'table_refresh'
    AND trace:"trace_id" = '4f3e2d1c0b9a8877'
  ORDER BY start_timestamp ASC;
```

```output
+----------+------------------+---------+-----------+-----------------+--------------------------------------+-------------------------+-------------------------+--------------+---------------------------------------------+
| SPAN_ID  | TRACE_ID         | DT_NAME | STATE     | STATE_REASON    | QUERY_ID                             | START_TIMESTAMP          | END_TIMESTAMP           | DURATION_SEC | UPSTREAM_LINKS                              |
|----------+------------------+---------+-----------+-----------------+--------------------------------------+-------------------------+-------------------------+--------------+---------------------------------------------|
| f1e2d3c4 | 4f3e2d1c0b9a8877 | DT1     | SUCCEEDED | NULL            | 01ba7614-0107-e56c-0000-a995024f3001 | 2026-02-13 10:01:00.000 | 2026-02-13 10:01:30.000 |           30 | []                                          |
| b5a6c7d8 | 4f3e2d1c0b9a8877 | DT2     | SUCCEEDED | NULL            | 01ba7614-0107-e56c-0000-a995024f3002 | 2026-02-13 10:01:31.000 | 2026-02-13 10:02:00.000 |           29 | [{"span_id": "f1e2d3c4", ...}]              |
| a1b2c3d4 | 4f3e2d1c0b9a8877 | DT3     | FAILED    | QUERY_FAILURE   | 01ba7614-0107-e56c-0000-a995024f304a | 2026-02-13 10:02:01.000 | 2026-02-13 10:02:20.000 |           19 | [{"span_id": "b5a6c7d8", ...}]              |
| c9d0e1f2 | 4f3e2d1c0b9a8877 | DT4     | SKIPPED   | UPSTREAM_FAILURE| NULL                                 | 2026-02-13 10:02:20.000 | 2026-02-13 10:02:20.000 |            0 | [{"span_id": "a1b2c3d4", ...}]              |
+----------+------------------+---------+-----------+-----------------+--------------------------------------+-------------------------+-------------------------+--------------+---------------------------------------------+
```

From this result, you can see the full picture of the refresh cycle:

* `DT1` and `DT2` succeeded (30 and 29 seconds respectively).
* `DT3` failed after 19 seconds due to a query failure.
* `DT4` was skipped immediately (represented by a zero-duration span) because its upstream dependency failed.
* The `UPSTREAM_LINKS` column shows each dynamic table’s direct dependencies by `span_id`.

#### Step 3: Identify the root cause of a failure or skip

When a dynamic table is skipped or fails, you can trace its upstream dependencies through the span links to
find the root cause. This query resolves the span links for a specific dynamic table back to the other
spans in the pipeline:

```sqlexample
WITH pipeline AS (
  SELECT
    trace:"span_id"::STRING AS span_id,
    resource_attributes:"snow.executable.name"::STRING AS dt_name,
    record_attributes:"snow.dynamic_table.state"::STRING AS state,
    record_attributes:"snow.dynamic_table.state_reason"::STRING AS state_reason,
    resource_attributes:"snow.query.id"::STRING AS query_id,
    record:"links" AS upstream_links
  FROM my_event_table
  WHERE record_type = 'SPAN'
    AND record:"name" = 'table_refresh'
    AND record_attributes:"snow.dynamic_table.data_timestamp" = '2026-02-13T10:00:00.000'
),
target_links AS (
  SELECT f.value:"span_id"::STRING AS upstream_span_id
  FROM pipeline,
  LATERAL FLATTEN(input => upstream_links) f
  WHERE dt_name = 'DT4'
)
SELECT
  p.dt_name AS upstream_dt,
  p.state AS upstream_state,
  p.state_reason AS upstream_reason,
  p.query_id AS upstream_query_id
FROM target_links tl
JOIN pipeline p ON tl.upstream_span_id = p.span_id;
```

```output
+-------------+----------------+-----------------+--------------------------------------+
| UPSTREAM_DT | UPSTREAM_STATE | UPSTREAM_REASON | UPSTREAM_QUERY_ID                    |
|-------------+----------------+-----------------+--------------------------------------|
| DT3         | FAILED         | QUERY_FAILURE   | 01ba7614-0107-e56c-0000-a995024f304a |
+-------------+----------------+-----------------+--------------------------------------+
```

In this example, `DT4` was skipped because its upstream dependency `DT3` failed with
`QUERY_FAILURE`. You can use the `query_id` to investigate the failed query further (for example,
by calling [GET_QUERY_OPERATOR_STATS](../sql-reference/functions/get_query_operator_stats.md) or
checking the [query history](../sql-reference/account-usage/query_history.md)).

For longer dependency chains, repeat the same pattern: replace the target dynamic table name to walk
further upstream until you reach a span with `state = 'FAILED'` and `state_reason = 'QUERY_FAILURE'`,
which is the root cause.

#### Find downstream impact of a failure

To find which dynamic tables were affected by a specific failure, reverse the span link lookup. This query
finds all dynamic tables whose `record:"links"` reference the failed dynamic table’s `span_id`:

```sqlexample
WITH pipeline AS (
  SELECT
    trace:"span_id"::STRING AS span_id,
    resource_attributes:"snow.executable.name"::STRING AS dt_name,
    record_attributes:"snow.dynamic_table.state"::STRING AS state,
    record_attributes:"snow.dynamic_table.state_reason"::STRING AS state_reason,
    record:"links" AS upstream_links
  FROM my_event_table
  WHERE record_type = 'SPAN'
    AND record:"name" = 'table_refresh'
    AND record_attributes:"snow.dynamic_table.data_timestamp" = '2026-02-13T10:00:00.000'
)
SELECT p.dt_name, p.state, p.state_reason
FROM pipeline p,
LATERAL FLATTEN(input => p.upstream_links) f
WHERE f.value:"span_id"::STRING = 'a1b2c3d4';
```

```output
+---------+---------+-----------------+
| DT_NAME | STATE   | STATE_REASON    |
|---------+---------+-----------------|
| DT4     | SKIPPED | UPSTREAM_FAILURE|
+---------+---------+-----------------+
```

This returns the direct dependents of the failed dynamic table. To find all transitively affected dynamic
tables, repeat the query with each dependent’s `span_id` to walk further downstream.

#### Use OpenTelemetry-compatible tools

Dynamic table pipeline spans follow the standard OpenTelemetry data model. Because all spans in a refresh
cycle share the same `trace:"trace_id"`, you can export them from the event table into
OpenTelemetry-compatible tools for visualization.

These tools can render the pipeline as a trace timeline, showing the duration and status of each dynamic
table’s refresh operation and the dependency relationships encoded in the span links.

---
title: Example of using SQL to create a semantic view
source: https://docs.snowflake.com/en/user-guide/views-semantic/example.md
section: User Guide
---

# Example of using SQL to create a semantic view

The following is a complete example of creating a [semantic view](overview.md).

The example uses the [TPC-H sample data](../sample-data-tpch.md) available in Snowflake. This dataset contains
tables that represent a simplified business scenario with customers, orders, and line items.

## Creating the semantic view

The following statements create the semantic view:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW tpch_analysis

  TABLES (
    region AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.REGION PRIMARY KEY (r_regionkey),
    nation AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.NATION PRIMARY KEY (n_nationkey),
    customer AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER PRIMARY KEY (c_custkey),
    orders AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS PRIMARY KEY (o_orderkey),
    lineitem AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.LINEITEM PRIMARY KEY (l_orderkey, l_linenumber),
    supplier AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.SUPPLIER PRIMARY KEY (s_suppkey)
  )

  RELATIONSHIPS (
    nation   (n_regionkey) REFERENCES region,
    customer (c_nationkey) REFERENCES nation,
    orders   (o_custkey)   REFERENCES customer,
    lineitem (l_orderkey)  REFERENCES orders,
    supplier (s_nationkey) REFERENCES nation
  )

  FACTS (
    region.r_name AS r_name,
    nation.n_name AS n_name,
    orders.o_orderkey AS o_orderkey,
    customer.c_customer_order_count AS COUNT(orders.o_orderkey),
    lineitem.line_item_id AS CONCAT(l_orderkey, '-', l_linenumber),
    orders.count_line_items AS COUNT(lineitem.line_item_id)
  )

  DIMENSIONS (
    nation.nation_name AS n_name,
    customer.customer_name AS c_name,
    customer.customer_region_name AS region.r_name,
    customer.customer_nation_name AS nation.n_name,
    customer.customer_market_segment AS c_mktsegment,
    customer.customer_country_code AS LEFT(c_phone, 2),
    orders.order_date AS orders.o_orderdate
  )

  METRICS (
    customer.customer_count AS COUNT(c_custkey),
    customer.customer_order_count AS SUM(c_customer_order_count),
    orders.order_count AS COUNT(o_orderkey),
    orders.order_average_value AS AVG(orders.o_totalprice),
    orders.average_line_items_per_order AS AVG(orders.count_line_items),
    supplier.supplier_count AS COUNT(s_suppkey)
  )
;
```

---
title: Excluding data from sensitive data classification
source: https://docs.snowflake.com/en/user-guide/classify-auto-exclude.md
section: User Guide
---

# Excluding data from sensitive data classification

With sensitive data classification, Snowflake classifies data as sensitive at regular intervals without user intervention. You can
use settings and system tags to exclude certain data from this classification process.

For example, suppose a database `my_db` has three tables, `t1`, `t2`, and `t3`. By default, when you classify
`my_db`, all three tables are automatically classified. You can configure Snowflake to skip `t2` during classification so only
tables `t1` and `t3` are classified.

## Workflow

Excluding data from sensitive data classification is a two-step process:

1. Apply the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag to every object that you want
   excluded from sensitive data classification.
2. Enable the exclusion setting for tag-based sensitive data exclusion.

## Set tag on data objects

An [object tag](object-tagging/introduction.md) is an object that can be set on another object. Snowflake
provides a system-defined tag, SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION, that you can set on objects that you want excluded from
sensitive data classification. When the value of this tag is `TRUE`, then Snowflake skips the object during classification.

You can set the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag on a schema, table, or column to control which data is excluded from
sensitive data classification.

Exclude a schema
:   You can set the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag on a schema in the database to exclude the schema from the
    classification process. For example:

    ```sqlexample
    ALTER SCHEMA my_schema SET TAG SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION = 'TRUE';
    ```

Exclude a table
:   You can set the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag on a table in the database or schema to exclude the
    table from the classification process. For example:

    ```sqlexample
    ALTER TABLE my_table SET TAG SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION = 'TRUE';
    ```

Exclude a column
:   You can set the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag on a column so that Snowflake skips it when classifying the
    table. If you exclude a column, the classification result contains an empty value for the column, even if it contains sensitive data.

    For example, suppose you want to automatically classify all columns in a table except the column `employee_id`. You can run the
    [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md) command to set the system-defined tag on the column:

    ```sqlexample
    ALTER TABLE my_table ALTER COLUMN employee_id
      SET TAG SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION = 'TRUE';
    ```

    When Snowflake automatically classifies data in the table, the `employee_id` field in the JSON result is empty.

For the access control requirements for setting the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag, see
Access control requirements.

## Enable the exclusion setting

Setting the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION tag on
objects has no effect until you enable the setting for tag-based sensitive data exclusion.

You can enable this setting using the Trust Center user interface or using SQL commands.

### Use the Trust Center to enable tag-based sensitive data exclusion

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the [required privileges](classify-ui-trust-center.md).
2. In the navigation menu, select Governance & security » Trust Center.
3. Select the Data Security tab.
4. Select the Settings tab.
5. Do one of the following:

   * If you are enabling the setting for an existing classification profile, find the profile and select  » Edit.
   * If you are setting up an advanced classification profile for the first time, select Create New.
6. Go through the classification settings until you get to the Define classification criteria page.
7. In the Exclusion criteria section, select Exclude SKIP_SENSITIVE_DATA_CLASSIFICATION tagged objects.

### Use SQL to enable tag-based sensitive data exclusion

A classification profile contains the settings that control how Snowflake automatically classifies data in a database. These
settings are specified using key-value pairs in an [OBJECT](../sql-reference/data-types-semistructured.md).

You must define the `enable_tag_based_sensitive_data_exclusion` key of the classification profile if you want data excluded from
sensitive data classification.

The following is an example of a classification profile that, when set on a database, excludes properly tagged objects from
sensitive data classification:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  my_classification_profile(
    {
      'minimum_object_age_for_classification_days': 0,
      'maximum_classification_validity_days': 30,
      'auto_tag': true,
      'enable_tag_based_sensitive_data_exclusion': true
    });
```

You can also execute the [SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION](../sql-reference/classes/classification_profile/methods/set_enable_tag_based_sensitive_data_exclusion.md) method to enable the setting for an existing classification profile.

## Access control requirements

By default, a user with the ability to enable or disable classification settings can set the
SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION system tag only on their own schemas and tables.

If you want a user to be able to set the system tag on all objects, not just the ones they own, run the following command:

```sqlexample
GRANT APPLY TAG ON ACCOUNT TO ROLE <classify_user>;
```

---
title: Explore and manage database objects in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-data.md
section: User Guide
---

# Explore and manage database objects in Snowsight

You can explore and manage your database objects in Snowsight using the *database object explorer*. The database object
explorer contains a hierarchical view of all databases in your account, the schemas for each database, and the objects contained
in each database and schema, organized by type.

To open the database object explorer:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Explore your database objects in the database object explorer.

You can only see objects on which your active role has been granted, at a minimum, the USAGE privilege.
For more information about object privileges, see [Access control privileges](security-access-control-privileges.md).

You can also explore database objects from the context of a worksheet. See [Refer to database object names in worksheets](ui-snowsight-query.md).

## Working with databases in Snowsight

When you select a database in the database object explorer, you can view details about the database.

You must have the relevant [database privileges](security-access-control-privileges.md) to access and manage the database in Snowsight.

After opening a database in Snowsight, you can do the following:

* Identify whether the database is a shared database.
* Review the source of the database, such as local, share, Snowflake Marketplace, a data exchange, or privately shared by a provider.
* Determine the owner role for the database.
* Identify when the database was created. You can hover over the time details to see the exact creation date and time.

### Manage a shared database in Snowsight

For a shared database, you can review the Source details to learn more about the sharing source:

* For a direct share, you can see the name of the share from which the database was created, and the provider account name.
* For a listing published on the Snowflake Marketplace or in a data exchange, you can see the name of the provider and the listing from which
  the database was created. Select the provider name to open the provider profile, or listing name to open the listing details on
  the Snowflake Marketplace or in the data exchange.
* For a private listing, you can see the name of the provider and the listing from which the database was created. To open the listing
  details, in the navigation menu, select Data sharing » Internal sharing » Shared with You, and then select the listing name.

You can perform the following basic management tasks for a shared database in Snowsight:

* To edit the database name or add a comment, select  » Edit.
* To drop the database, select  » Drop. This removes the database created from the share or listing.
* Review and manage privileges in the Privileges section of the Database Details tab.
  To manage privileges, see [Manage object privileges with Snowsight](security-access-control-configure.md).

### Manage a local database in Snowsight

You can perform the following basic management tasks for a database in Snowsight:

* To edit the database name or add a comment, select  » Edit.
* To drop the database, select  » Drop.
* To transfer ownership of the database to another role, select  » Transfer Ownership
* Review and manage privileges for the database in the Privileges section of the Database Details tab.
  To manage privileges, see [Manage object privileges with Snowsight](security-access-control-configure.md).
* To create a schema for the database, select + Schema. For more information, see [CREATE SCHEMA](../sql-reference/sql/create-schema.md).

For accounts using private connectivity, you can also select  » Enable Replication to enable
replication of the database to another account. For all other accounts, use a
[replication or failover group](account-replication-intro.md). For more information, see
[Create a replication or failover group using Snowsight](account-replication-config.md).

### Review the schemas in a database

To review the schemas in the database, select the Schemas tab. A table of schemas contained in the database appears.
On this tab, you can do the following:

* Search for a schema name.
* Review and sort by schema name, owner role, or date created.
* Manage the schema.
* Hover over the  to read a comment on the schema.

Select a schema in the table to open the Schema Details page. See Explore schema details in Snowsight.

## Explore schema details in Snowsight

To view a schema, in the navigation menu, select Catalog » Database Explorer, and then search for or browse to the database
schema. Select the schema to explore details about the schema, the objects contained in the schema,
and create objects in the schema.

You can work with schemas in SQL or in Snowsight.
For details about the available SQL commands for working with schemas, see [Database, schema, and share commands](../sql-reference/commands-database.md).

You must have the relevant [schema privileges](security-access-control-privileges.md)
to access and manage the database schema in Snowsight.

For each schema, you can view basic details for the objects contained in the schema. See Review and manage schema objects.

### Manage a schema in Snowsight

You can perform the following basic management tasks for a schema in Snowsight:

* To edit the schema name or add a comment, select  » Edit.
* To clone the schema, select  » Clone.
* To drop the schema, select  » Drop.
* To transfer ownership of the schema to another role, select  » Transfer Ownership.
* Review and manage privileges for the schema in the Privileges section of the Schema Details tab.
  To manage privileges, see [Manage object privileges with Snowsight](security-access-control-configure.md).

### Create schema objects in Snowsight

To create objects in a database schema using Snowsight, do the following:

> **Note:**
>
> You must use a role granted the relevant privileges to create objects in the schema.
> See [Schema privileges](security-access-control-privileges.md).

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Locate and select the database schema in which you want to create an object.
4. On the schema details page, select Create and then select the object that you want to create.

   For most objects, a worksheet opens with template SQL to create the object you selected. Customize the SQL and create the object.

   * If you choose to create a table from a file, see [Create a new table using Snowsight](data-load-web-ui.md).
   * If you choose to create a stage, see [Staging files using Snowsight](data-load-local-file-system-stage-ui.md).

### Review and manage schema objects

For each type of database object contained in a database schema, you can select a tab and review, sort, and search a table of those objects.

* For Tables, review the name, type, classification, owner role, number of rows, bytes, and date created. You can also filter by
  the type of table.
* For Views, review the name, type, owner role, and date created. You can also filter by the type of view.
* For Semantic Views, review the name, type, owner role, and date created.
* For Stages, review the name, cloud and region for an external stage, storage integration associated with the stage, owner role,
  and date created.
* For File Formats, review the name, type, owner role, and date created.
* For Sequences, review the name, next value, interval, owner role, and date created.
* For Dynamic Tables, review the name, state, target lag, warehouse used, rows, owner role, and date created.
* For Streams, review the name, table name to which the stream is associated, owner role, and date created.
* For Tasks, review the name, state, schedule, condition, warehouse used, and owner role.
* For Pipes, review the name, notification channel, owner role, and date created.
* For Functions, review the name, arguments, and date created.
* For Procedures, review the name, arguments, and date created.

For any object, you can hover over the  to read the comment on the object.
If you have the relevant privileges for an object, you can also select  and manage the object.

To view details about an object, select the row for the object and open the object details page.

---
title: Exploring compute cost
source: https://docs.snowflake.com/en/user-guide/cost-exploring-compute.md
section: User Guide
---

# Exploring compute cost

Total compute cost consists of the overall use of:

* Virtual warehouses (user-managed compute resources)
* Serverless features such as Automatic Clustering and Snowpipe that use Snowflake-managed compute resources
* Cloud services layer of the Snowflake architecture
* vCPU usage for [Openflow BYOC cost and scaling considerations](data-integration/openflow/cost-byoc.md) and [Openflow Snowflake Deployment cost and scaling considerations](data-integration/openflow/cost-spcs.md).
  See [Openflow components](data-integration/openflow/about.md) for more information about Openflow components including runtimes.

This topic describes how to gain insight into historical compute costs using [Snowsight](ui-snowsight-gs.md), or by writing queries against views in
the [ACCOUNT_USAGE](../sql-reference/account-usage.md) and [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schemas.
Snowsight allows you to quickly and easily obtain information about cost from a visual dashboard. Queries against the usage views
allow you to drill down into cost data and can help generate custom reports and dashboards.

If you need more information about how compute costs are incurred, refer to [Understanding compute cost](cost-understanding-compute.md).

> **Note:**
>
> The cloud services layer consumes credits, but not all of those credits are actually billed. Usage for cloud services is charged only if
> the daily consumption of cloud services exceeds 10% of the daily usage of virtual warehouses. Snowsight and a majority of views
> show the total number of credits consumed by warehouses, serverless features, and cloud services without accounting for this daily
> adjustment to cloud services.
>
> To determine how many credits were actually billed for compute costs, run queries against the
> [METERING_DAILY_HISTORY view](../sql-reference/account-usage/metering_daily_history.md).

## Viewing credit usage

All compute resources (virtual warehouses, serverless, cloud services) consume Snowflake credits. Users can use Snowsight to
view the overall cost of compute usage for any given day, week, or month.

To explore compute cost:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost and usage data](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an XS warehouse for this purpose.
5. Select Consumption.
6. Select Compute from the Usage Type drop-down.

For usage notes related to the Consumption page, see [Usage notes](cost-exploring-overall.md).

### Filter by tag

You can use tags to [attribute the cost](cost-attributing.md) of using resources to a logical
unit within your organization. A tag is a Snowflake object that can have one or more values associated with it. A user with the
appropriate privileges applies a tag/value pair to each resource that is used by a cost center or other logical unit (e.g. the Development
environment, a business unit, or business line). Once resources have been tagged, you can isolate costs based on a
specific tag/value pair, allowing you to attribute this cost to a specific logical unit.

To filter the Consumption dashboard to show costs associated with a specific tag/value combination:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost and usage data](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an XS warehouse for this purpose.
5. Select Consumption.
6. Select Compute from the Usage Type drop-down.
7. From the Tags drop-down, select the tag.
8. Select the value from the list of the tag’s values.
9. Select Apply.

For example, you can use the drop-down to select the `COST_CENTER` tag and the `SALES` value to show usage associated with resources
tagged with `COST_CENTER = SALES` while excluding all other usage from the dashboard.

You can also display all resources with a tag regardless of their tag value. Use the drop down to select a
tag, then choose All instead of a specific value.

### View consumption by type, service, or resource

When viewing the bar graph that displays compute history, you can filter the data By Type, By Service or By Resource.

> By Type:
> :   Separates resource consumption into compute (virtual warehouses and serverless resources) and cloud services. For the purpose
>     of this filter, cloud services is separated out from the other types of compute resources.
>
> By Service:
> :   Separates resource consumption into warehouse consumption and consumption by each serverless feature. For example,
>     WAREHOUSE_METERING represents credits consumed by warehouses while PIPE represents credits consumed by the serverless Snowpipe feature.
>     Cloud services compute is included in warehouse consumption.
>
> By Resource:
> :   Separates resource consumption by the Snowflake object that consumed credits. For example, each warehouse is represented,
>     as is every table that incurred serverless costs.

## Querying data for compute cost

Snowflake provides two schemas, [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) and
[ACCOUNT_USAGE](../sql-reference/account-usage.md), that contain data related to usage and cost. The ORGANIZATION_USAGE schema provides
cost information for all of the accounts in the organization while the ACCOUNT_USAGE schema provides similar information for a single
account. Views in these schemas provide granular, analytics-ready usage data to build custom reports or dashboards.

Most views in the ORGANIZATION_USAGE and ACCOUNT_USAGE schemas contain the cost of compute resources in terms of
[credits](cost-understanding-compute.md) consumed. To explore compute cost in currency, rather than credits, write queries against the
[USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md). This view converts credits consumed into cost in currency using the daily
price of a credit.

### General cost views

The following views contain information related to the compute costs of all Snowflake features. You can focus on a particular feature by filtering on the `service_type` column.

For additional views that focus on the cost of a specific feature, see Feature-specific cost views.

| View | Compute resource | Description | Schema |
| --- | --- | --- | --- |
| METERING_DAILY_HISTORY | Warehouses  Serverless  Cloud Services  Openflow runtimes | Credits consumed by all compute resources (warehouses, serverless, cloud services and Openflow) in a given day.  Can be used to determine whether cloud services compute costs were actually billed for a specific day (that is, cloud services credit consumption exceeded 10% of warehouse consumption). | [ORGANIZATION_USAGE](../sql-reference/organization-usage/metering_daily_history.md) [ACCOUNT_USAGE](../sql-reference/account-usage/metering_daily_history.md) |
| METERING_HISTORY | Warehouses  Serverless  Cloud Services  Openflow runtimes | Credits consumed by warehouses, cloud services, serverless, and Openflow features on an hourly basis. To see how many credits an individual warehouse is consuming, query the WAREHOUSE_METERING_HISTORY view. | [ACCOUNT_USAGE](../sql-reference/account-usage/metering_history.md) |
| [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) | Warehouses  Serverless  Cloud Services | Daily credit consumption by all compute resources along with the cost of that usage in the organization’s currency. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/usage_in_currency_daily.md) |

### Feature-specific cost views

The following views that are dedicated to the usage and cost information for a specific feature.

| View | Compute resource | Description |
| --- | --- | --- |
| APPLICATION_DAILY_USAGE_HISTORY | Warehouses  Serverless  Cloud Services | Daily credit usage for Snowflake Native Apps in an account within the last 365 days. |
| ARCHIVE_STORAGE_DATA_ RETRIEVAL_USAGE_HISTORY | Serverless | Bytes retrieved from archive storage for storage lifecycle policies. See [Billing for storage lifecycle policies](storage-management/storage-lifecycle-policies-billing.md) for more information. |
| AUTOMATIC_CLUSTERING_HISTORY | Serverless | Credits consumed by automatic clustering. |
| CATALOG_LINKED_DATABASE_ USAGE_HISTORY | Serverless | Credits consumed by catalog-linked databases. |
| CORTEX_AI_FUNCTIONS_USAGE_HISTORY | Serverless | Credits consumed by Cortex AI Functions. |
| CORTEX_AGENT_USAGE_HISTORY | Serverless | Credits consumed by Cortex Agents. |
| CORTEX_ANALYST_ USAGE_HISTORY | Serverless | Credits consumed by Cortex Analyst. |
| CORTEX_FINE_TUNING_ USAGE_HISTORY | Serverless | Credits consumed for Cortex Fine-tuning. |
| CORTEX_FUNCTIONS_ QUERY_USAGE_HISTORY | Serverless | Credits consumed to run queries that use Cortex LLM functions. |
| CORTEX_FUNCTIONS_ DOCUMENT_PROCESSING_USAGE_HISTORY | Serverless | Credits consumed to process documents with Document AI. |
| CORTEX_FUNCTIONS_ USAGE_HISTORY | Serverless | Credits consumed to call Cortex LLM functions. |
| CORTEX_REST_API_ USAGE_HISTORY | Serverless | Credits consumed by Cortex REST API calls. |
| CORTEX_SEARCH_DAILY_ USAGE_HISTORY | Serverless | Daily credits consumed for Cortex Search for serving and text embeddings |
| CORTEX_SEARCH_SERVING_ USAGE_HISTORY | Serverless | Credits consumed for Cortex Search serving |
| DATA_QUALITY_MONITORING_ USAGE_HISTORY | Serverless | Credits consumed to call scheduled DMFs and ingest results into an event table. |
| DATABASE_REPLICATION_USAGE_ HISTORY | Serverless | Credits consumed for database replication. |
| DOCUMENT_AI_ USAGE_HISTORY | Serverless | Credits consumed by Document AI. |
| HYBRID_TABLE_USAGE_HISTORY | Serverless | Credits consumed for Hybrid Table Requests resources. (As of March 1, 2026, hybrid table requests are no longer billed, and metering was disabled soon after this pricing change took effect. No new events are recorded in this view.) |
| LISTING_AUTO_FULFILLMENT_ REFRESH_DAILY | Warehouses | Credits used to refresh data fulfilled to other regions by Cross-Cloud Auto-Fulfillment. |
| LISTING_AUTO_FULFILLMENT_ USAGE_HISTORY | Warehouses | Estimated usage associated with fulfilling data products to other regions by using Cross-Cloud Auto-Fulfillment. Refer to the SERVICE_TYPE of REPLICATION. |
| MATERIALIZED_VIEW_REFRESH_ HISTORY | Serverless | Credits consumed the refreshing of materialized views. |
| OPENFLOW_USAGE_HISTORY | Openflow | Credits consumed by Openflow runtimes. This view is available in the ACCOUNT_USAGE schema only. |
| PIPE_USAGE_HISTORY | Serverless | Credits consumed by Snowpipe. |
| QUERY_ACCELERATION_HISTORY | Serverless | Credits consumed by the query acceleration service. |
| QUERY_ATTRIBUTION_HISTORY | Warehouses | Credits consumed [per query](cost-attributing.md) for warehouse usage. |
| REPLICATION_USAGE_HISTORY | Serverless | Credits consumed and number of bytes transferred during database replication. If possible, use the [DATABASE_REPLICATION_USAGE_HISTORY view](../sql-reference/account-usage/database_replication_usage_history.md) instead. |
| REPLICATION_GROUP_USAGE_ HISTORY | Serverless | Credits consumed and number of bytes transferred during replication for a specific replication group. |
| SEARCH_OPTIMIZATION_HISTORY | Serverless | Credits consumed by the search optimization service. |
| SERVERLESS_ALERT_HISTORY | Serverless | Credits consumed by serverless alerts. |
| SERVERLESS_TASK_HISTORY | Serverless | Credits consumed by serverless tasks. |
| SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY | Serverless | Credits consumed by Snowflake Intelligence. |
| SNOWPIPE_STREAMING_FILE_ MIGRATION_HISTORY | Serverless | Credits consumed by Snowpipe Streaming compute (does not include client costs). |
| WAREHOUSE_METERING_HISTORY | Warehouses  Cloud Services | Hourly credit usage of each warehouse, including the cloud services cost associated with using the warehouse. |

> **Note:**
>
> The views and table functions of the [Snowflake Information Schema](../sql-reference/info-schema.md) also provide usage data related to cost. Though
> the ACCOUNT_USAGE schema is preferred, the Information Schema can be faster in some circumstances.

### Example queries

The following queries drill-down into data in ACCOUNT_USAGE views to gain insight into compute costs.

> **Note:**
>
> Queries executed against views in the Account Usage schema can be modified to gain insight into cost for the entire organization by
> using the corresponding view in the Organization Usage schema. For example, both schemas include a WAREHOUSE_METERING_HISTORY view.

Click the name of a query below to see the full SQL example.

Compute for Warehouses:
:   * Query: Average hour-by-hour Snowflake spend (across all warehouses) over the past m days
    * Query: Credit consumption by warehouse over specific time period
    * Query: Warehouse usage over m-day average
    * [Query: Warehouse cost attribution by query tag](cost-attributing.md).
    * [Query: Warehouse cost attribution by user](cost-attributing.md).

Compute for Cloud Services:
:   * Query: Billed cloud services
    * Query: Total cloud services cost by type of query
    * Query: Cloud services cost for queries of a given type
    * Query: Warehouses with high cloud services usage
    * Query: Cloud services cost sorted by portion of query time

Compute for Automatic Clustering:
:   * Query: Automatic Clustering cost history (by day, by object)
    * Query: Automatic Clustering History & m-day average

Compute for Search Optimization:
:   * Query: Search Optimization cost history (by day, by object)
    * Query: Search Optimization History & m-day average

Compute for Materialized Views:
:   * Query: Materialized Views cost history (by day, by object)
    * Query: Materialized Views History & m-day average

Compute for Query Acceleration Service:
:   * Query: Query Acceleration Service cost by warehouse

Compute for Snowpipe:
:   * Query: Cumulative usage of data ingest (Snowpipe and “Copy”)
    * Query: Snowpipe cost history (by day, by object)
    * Query: Snowpipe History & m-day average

Compute and client costs for Snowpipe Streaming:
:   * Query: Snowpipe Streaming cost

Compute for Serverless Alerts:
:   * Query: Total serverless alert cost

Compute for Serverless Tasks:
:   * Query: Total serverless task cost

Compute for Replication:
:   * Query: Account replication cost
    * Query: Database replication cost history (by day, by object)
    * Query: Database replication History & m-day average

Compute for Partner Tools:
:   * Query: Credit consumption by partner tools

Compute for Hybrid Tables:
:   * Query: Credit consumption by hybrid tables

Compute for Cortex Agents:
:   * Query: Credit consumption by Cortex Agents

Compute for Cortex Analyst:
:   * Query: Credit consumption by Cortex Analyst

Compute for Cortex Fine-tuning:
:   * Query: Credit consumption by Cortex Fine-tuning

Compute for Cortex functions:
:   * Query: Credit consumption by Cortex functions
    * Query: Credit consumption by Cortex functions query

Compute for Cortex Search:
:   * Query: Daily credit consumption by Cortex Search
    * Query: Credit consumption by Cortex Search serving

Compute for Document AI:
:   * Query: Credit consumption by Document AI

Compute for Snowflake Intelligence:
:   * Query: Credit consumption by Snowflake Intelligence

Compute for Snowflake Notebooks:
:   * Query: Hourly credit consumption by notebooks
    * Query: Cost to run a specific notebook
    * Query: Total compute pool cost per notebook
    * Query: Identify users who ran a specific notebook

#### Compute for warehouses

Query: Average hour-by-hour Snowflake spend (across all warehouses) over the past m days
:   This query shows the total credit consumption on an hourly basis to help understand consumption trends (peaks, valleys) over the past m
    days. This helps identify times of day when there are spikes in consumption.

    ```sqlexample
    SELECT start_time,
      warehouse_name,
      credits_used_compute
    FROM snowflake.account_usage.warehouse_metering_history
    WHERE start_time >= DATEADD(day, -m, CURRENT_TIMESTAMP())
      AND warehouse_id > 0  -- Skip pseudo-VWs such as "CLOUD_SERVICES_ONLY"
    ORDER BY 1 DESC, 2;

    -- by hour
    SELECT DATE_PART('HOUR', start_time) AS start_hour,
      warehouse_name,
      AVG(credits_used_compute) AS credits_used_compute_avg
    FROM snowflake.account_usage.warehouse_metering_history
    WHERE start_time >= DATEADD(day, -m, CURRENT_TIMESTAMP())
      AND warehouse_id > 0  -- Skip pseudo-VWs such as "CLOUD_SERVICES_ONLY"
    GROUP BY 1, 2
    ORDER BY 1, 2;
    ```

Query: Credit consumption by warehouse over specific time period
:   This query shows the total credit consumption for each warehouse over a specific time period. This helps identify warehouses that are
    consuming more credits than others and specific warehouses that are consuming more credits than anticipated.

    ```sqlexample
    -- Credits used (all time = past year)
    SELECT warehouse_name,
      SUM(credits_used_compute) AS credits_used_compute_sum
    FROM snowflake.account_usage.warehouse_metering_history
    GROUP BY 1
    ORDER BY 2 DESC;

    -- Credits used (past N days/weeks/months)
    SELECT warehouse_name,
      SUM(credits_used_compute) AS credits_used_compute_sum
    FROM snowflake.account_usage.warehouse_metering_history
    WHERE start_time >= DATEADD(day, -m, CURRENT_TIMESTAMP())
    GROUP BY 1
    ORDER BY 2 DESC;
    ```

Query: Warehouse usage over m-day average
:   This query returns the daily average credit consumption grouped by week and warehouse. It can be used to identify anomalies in credit
    consumption for warehouses across weeks from the past year.

    ```sqlexample
    WITH cte_date_wh AS (
      SELECT TO_DATE(start_time) AS start_date,
        warehouse_name,
        SUM(credits_used) AS credits_used_date_wh
      FROM snowflake.account_usage.warehouse_metering_history
      GROUP BY start_date, warehouse_name
    )

    SELECT start_date,
      warehouse_name,
      credits_used_date_wh,
      AVG(credits_used_date_wh) OVER (PARTITION BY warehouse_name ORDER BY start_date ROWS m PRECEDING) AS credits_used_m_day_avg,
      100.0*((credits_used_date_wh / credits_used_m_day_avg) - 1) AS pct_over_to_m_day_average
    FROM cte_date_wh
      QUALIFY credits_used_date_wh > 100  -- Minimum N=100 credits
        AND pct_over_to_m_day_average >= 0.5  -- Minimum 50% increase over past m day average
    ORDER BY pct_over_to_m_day_average DESC;
    ```

#### Compute for cloud services

Query: Billed cloud services
:   [Usage for cloud services](cost-understanding-compute.md) is billed only if the daily consumption of cloud
    services exceeds 10% of the daily usage of virtual warehouses. This query returns how much of cloud services consumption was actually
    billed for a particular day, ordered by the highest billed amount.

    ```sqlexample
    SELECT
        usage_date,
        credits_used_cloud_services,
        credits_adjustment_cloud_services,
        credits_used_cloud_services + credits_adjustment_cloud_services AS billed_cloud_services
    FROM snowflake.account_usage.metering_daily_history
    WHERE usage_date >= DATEADD(month,-1,CURRENT_TIMESTAMP())
        AND credits_used_cloud_services > 0
    ORDER BY 4 DESC;
    ```

Query: Total cloud services cost by type of query
:   This query returns the total credits consumed for cloud services by a particular type of query.

    ```sqlexample
    SELECT query_type,
      SUM(credits_used_cloud_services) AS cs_credits,
      COUNT(1) num_queries
    FROM snowflake.account_usage.query_history
    WHERE true
      AND start_time >= TIMESTAMPADD(day, -1, CURRENT_TIMESTAMP)
    GROUP BY 1
    ORDER BY 2 DESC
    LIMIT 10;
    ```

Query: Cloud services cost for queries of a given type
:   This query returns the total credits consumed for cloud services by all queries of a specific type. Replace `'COPY'` if you want to focus on a different type of query and `day` if you want to explore a longer or shorter period of time.

    ```sqlexample
    SELECT *
    FROM snowflake.account_usage.query_history
    WHERE true
      AND start_time >= TIMESTAMPADD(day, -1, CURRENT_TIMESTAMP)
      AND query_type = 'COPY'
    ORDER BY credits_used_cloud_services DESC
    LIMIT 10;
    ```

Query: Warehouses with high cloud services usage
:   This query shows the warehouses that are not using enough warehouse time to cover the cloud services portion of compute. This provides a
    launching point for additional investigation by isolating warehouses with a high ratio of cloud service use (>10% of overall credits).
    Investigation candidates include issues around cloning, listing files in S3, partner tools, setting session parameters, etc.

    ```sqlexample
    SELECT
      warehouse_name,
      SUM(credits_used) AS credits_used,
      SUM(credits_used_cloud_services) AS credits_used_cloud_services,
      SUM(credits_used_cloud_services)/SUM(credits_used) AS percent_cloud_services
    FROM snowflake.account_usage.warehouse_metering_history
    WHERE TO_DATE(start_time) >= DATEADD(month,-1,CURRENT_TIMESTAMP())
        AND credits_used_cloud_services > 0
    GROUP BY 1
    ORDER BY 4 DESC;
    ```

Query: Cloud services usage sorted by portion of query time
:   This query returns all queries run within the last minute and sorts them by parts of total query execution time (e.g. compilation time vs. queue time).

    ```sqlexample
    SELECT *
    FROM snowflake.account_usage.query_history
    WHERE true
      AND start_time >= TIMESTAMPADD(minute, -60, CURRENT_TIMESTAMP)
    ORDER BY compilation_time DESC,
      execution_time DESC,
      list_external_files_time DESC,
      queued_overload_time DESC,
      credits_used_cloud_services DESC
    LIMIT 10;
    ```

#### Compute for Automatic Clustering

Query: Automatic Clustering cost history (by day, by object)
:   This query provides a list of tables with Automatic Clustering and the volume of credits consumed via the service over the last 30 days,
    broken out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional investigation.

    ```sqlexample
    SELECT TO_DATE(start_time) AS date,
      database_name,
      schema_name,
      table_name,
      SUM(credits_used) AS credits_used
    FROM snowflake.account_usage.automatic_clustering_history
    WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1,2,3,4
    ORDER BY 5 DESC;
    ```

Query: Automatic Clustering History & m-day average
:   This query shows the average daily credits consumed by Automatic Clustering grouped by week over the last year. It can help identify
    anomalies in daily averages over the year so you can investigate spikes or unexpected changes in consumption.

    ```sqlexample
    WITH credits_by_day AS (
      SELECT TO_DATE(start_time) AS date,
        SUM(credits_used) AS credits_used
      FROM snowflake.account_usage.automatic_clustering_history
      WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
      GROUP BY 1
      ORDER BY 2 DESC
    )

    SELECT DATE_TRUNC('week',date),
          AVG(credits_used) AS avg_daily_credits
    FROM credits_by_day
    GROUP BY 1
    ORDER BY 1;
    ```

#### Compute for Search Optimization

Query: Search Optimization cost history (by day, by object)
:   This query provides a full list of tables with Search Optimization and the volume of credits consumed via the service over the last 30
    days, broken out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional
    investigation.

    ```sqlexample
    SELECT TO_DATE(start_time) AS date,
      database_name,
      schema_name,
      table_name,
      SUM(credits_used) AS credits_used
    FROM snowflake.account_usage.search_optimization_history
    WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1,2,3,4
    ORDER BY 5 DESC;
    ```

Query: Search Optimization History & m-day average
:   This query shows the average daily credits consumed by Search Optimization grouped by week over the last year. It can help identify
    anomalies in daily averages over the year so you can investigate spikes or unexpected changes in
    consumption.

    ```sqlexample
    WITH credits_by_day AS (
      SELECT TO_DATE(start_time) AS date,
        SUM(credits_used) AS credits_used
      FROM snowflake.account_usage.search_optimization_history
      WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
      GROUP BY 1
      ORDER BY 2 DESC
    )

    SELECT DATE_TRUNC('week', date),
      AVG(credits_used) as avg_daily_credits
    FROM credits_by_day
    GROUP BY 1
    ORDER BY 1;
    ```

#### Compute for Materialized Views

Query: Materialized Views cost history (by day, by object)
:   This query provides a full list of materialized views and the volume of credits consumed via the service over the last 30 days, broken
    out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional investigation.

    ```sqlexample
    SELECT TO_DATE(start_time) AS date,
      database_name,
      schema_name,
      table_name,
      SUM(credits_used) AS credits_used
    FROM snowflake.account_usage.materialized_view_refresh_history
    WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1,2,3,4
    ORDER BY 5 DESC;
    ```

Query: Materialized Views History & m-day average
:   This query shows the average daily credits consumed by materialized views grouped by week over the last year. It can help identify
    anomalies in daily averages over the year so you can investigate spikes or unexpected changes in
    consumption.

    ```sqlexample
    WITH credits_by_day AS (
      SELECT TO_DATE(start_time) AS date,
        SUM(credits_used) AS credits_used
      FROM snowflake.account_usage.materialized_view_refresh_history
      WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
      GROUP BY 1
      ORDER BY 2 DESC
    )

    SELECT DATE_TRUNC('week',date),
      AVG(credits_used) AS avg_daily_credits
    FROM credits_by_day
    GROUP BY 1
    ORDER BY 1;
    ```

#### Compute for Query Acceleration Service

Query: Query Acceleration Service cost by warehouse
:   This query returns the total number of credits used by each warehouse in your account for the query acceleration service
    (month-to-date):

    ```sqlexample
    SELECT warehouse_name,
           SUM(credits_used) AS total_credits_used
      FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
      WHERE start_time >= DATE_TRUNC(month, CURRENT_DATE)
      GROUP BY 1
      ORDER BY 2 DESC;
    ```

#### Compute for Snowpipe and Snowpipe Streaming

Query: Cumulative usage of data ingest (Snowpipe and “Copy”)
:   This query returns an aggregated daily summary of all loads for each table in Snowflake showing average file size, total rows, total
    volume and the ingest method (copy or Snowpipe). If file sizes are too small or big for optimal ingest, additional
    investigation/optimization may be required. By mapping the volume to credit consumption, it is possible to determine which tables are
    consuming more credits per TB loaded.

    ```sqlexample
    SELECT TO_DATE(last_load_time) AS load_date,
      status,
      table_catalog_name AS database_name,
      table_schema_name AS schema_name,
      table_name,
      CASE
        WHEN pipe_name IS NULL THEN 'COPY'
        ELSE 'SNOWPIPE'
      END AS ingest_method,
      SUM(row_count) AS row_count,
      SUM(row_parsed) AS rows_parsed,
      AVG(file_size) AS avg_file_size_bytes,
      SUM(file_size) AS total_file_size_bytes,
      SUM(file_size)/POWER(1024,1) AS total_file_size_kb,
      SUM(file_size)/POWER(1024,2) AS total_file_size_mb,
      SUM(file_size)/POWER(1024,3) AS total_file_size_gb,
      SUM(file_size)/POWER(1024,4) AS total_file_size_tb
    FROM snowflake.account_usage.copy_history
    GROUP BY 1,2,3,4,5,6
    ORDER BY 3,4,5,1,2;
    ```

Query: Snowpipe cost history (by day, by object)
:   This query provides a full list of pipes and the volume of credits consumed via the service over the last 30 days, broken out by day.
    Any irregularities in the credit consumption or consistently high consumption are flags for additional investigation.

    ```sqlexample
    SELECT TO_DATE(start_time) AS date,
      pipe_name,
      SUM(credits_used) AS credits_used
    FROM snowflake.account_usage.pipe_usage_history
    WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1,2
    ORDER BY 3 DESC;
    ```

Query: Snowpipe History & m-day average
:   This query shows the average daily credits consumed by Snowpipe grouped by week over the last year. It can help identify anomalies in
    daily averages over the year so you can investigate spikes or unexpected changes in consumption.

    ```sqlexample
    WITH credits_by_day AS (
      SELECT TO_DATE(start_time) AS date,
        SUM(credits_used) AS credits_used
      FROM snowflake.account_usage.pipe_usage_history
      WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
      GROUP BY 1
      ORDER BY 2 DESC
    )

    SELECT DATE_TRUNC('week',date),
      AVG(credits_used) AS avg_daily_credits
    FROM credits_by_day
    GROUP BY 1
    ORDER BY 1;
    ```

Query: Total Snowpipe Streaming cost
:   This query lists the current credit usage for Snowpipe Streaming, including both Snowpipe Streaming compute and client costs.

    ```sqlexample
    SELECT start_time,
      end_time,
      SUM(credits_used) AS total_credits,
      name,
      IFF(CONTAINS(name,':'),'streaming client cost', 'streaming compute cost') AS streaming_cost_type
    FROM SNOWFLAKE.ACCOUNT_USAGE.METERING_HISTORY
    WHERE service_type ='SNOWPIPE_STREAMING'
    GROUP BY ALL;
    ```

#### Compute for serverless alerts

Query: Total serverless alert cost
:   This query lists the current credit usage for all serverless alerts:

    ```sqlexample
    SELECT
        start_time,
        end_time,
        alert_id,
        alert_name,
        credits_used,
        schema_id,
        schema_name,
        database_id,
        database_name
      FROM SNOWFLAKE.ACCOUNT_USAGE.serverless_alert_history
      ORDER BY start_time, alert_id;
    ```

#### Compute for serverless tasks

Query: Total serverless task cost
:   This query lists the current credit usage for all serverless tasks:

    ```sqlexample
    SELECT start_time,
      end_time,
      task_id,
      task_name,
      credits_used,
      schema_id,
      schema_name,
      database_id,
      database_name
    FROM snowflake.account_usage.serverless_task_history
    ORDER BY start_time, task_id;
    ```

#### Compute for replication

Query: Account replication cost
:   This query lists the credits used by a replication or failover group for account replication in the current month:

    ```sqlexample
    SELECT start_time,
      end_time,
      replication_group_name,
      credits_used,
      bytes_transferred
    FROM snowflake.account_usage.replication_group_usage_history
    WHERE start_time >= DATE_TRUNC('month', CURRENT_DATE());
    ```

Query: Database replication cost history (by day, by object)
:   This query provides a full list of replicated databases and the volume of credits consumed via the replication service over the last 30
    days, broken out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional
    investigation.

    ```sqlexample
    SELECT TO_DATE(start_time) AS date,
      database_name,
      SUM(credits_used) AS credits_used
    FROM snowflake.account_usage.database_replication_usage_history
    WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1,2
    ORDER BY 3 DESC;
    ```

Query: Database replication History & m-day average
:   This query shows the average daily credits consumed by Replication grouped by week over the last year. This helps identify any
    anomalies in the daily average so you can investigate any spikes or changes in consumption.

    ```sqlexample
    WITH credits_by_day AS (
      SELECT TO_DATE(start_time) AS date,
        SUM(credits_used) AS credits_used
      FROM snowflake.account_usage.database_replication_usage_history
      WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
      GROUP BY 1
      ORDER BY 2 DESC
    )

    SELECT DATE_TRUNC('week',date),
      AVG(credits_used) AS avg_daily_credits
    FROM credits_by_day
    GROUP BY 1
    ORDER BY 1;
    ```

#### Compute for partner tools

Query: Credit consumption by partner tools
:   This query identifies which of Snowflake’s partner tools/solutions (e.g. BI, ETL, etc.) are consuming the most credits. This can help
    identify partner solutions that are consuming more credits than anticipated, which can be a starting point for additional investigation.

    ```sqlexample
    -- This Is Approximate Credit Consumption By Client Application
    WITH
      client_hour_execution_cte AS (
        SELECT
          CASE
            WHEN client_application_id LIKE 'Go %' THEN 'Go'
            WHEN client_application_id LIKE 'Snowflake UI %' THEN 'Snowflake UI'
            WHEN client_application_id LIKE 'Snowflake CLI %' THEN 'Snowflake CLI'
            WHEN client_application_id LIKE 'SnowSQL %' THEN 'SnowSQL'
            WHEN client_application_id LIKE 'JDBC %' THEN 'JDBC'
            WHEN client_application_id LIKE 'PythonConnector %' THEN 'Python'
            WHEN client_application_id LIKE 'ODBC %' THEN 'ODBC'
            ELSE 'NOT YET MAPPED: ' || CLIENT_APPLICATION_ID
          END AS client_application_name,
          warehouse_name,
          DATE_TRUNC('hour',start_time) AS start_time_hour,
          SUM(execution_time)  AS client_hour_execution_time
        FROM snowflake.account_usage.query_history qh
          JOIN snowflake.account_usage.sessions se
            ON se.session_id = qh.session_id
        WHERE warehouse_name IS NOT NULL
          AND execution_time > 0
          AND start_time > DATEADD(month,-1,CURRENT_TIMESTAMP())
        GROUP BY 1,2,3
      ),
      hour_execution_cte AS (
        SELECT start_time_hour,
          warehouse_name,
          SUM(client_hour_execution_time) AS hour_execution_time
        FROM client_hour_execution_cte
        GROUP BY 1,2
      ),
      approximate_credits AS (
        SELECT A.client_application_name,
          C.warehouse_name,
          (A.client_hour_execution_time/B.hour_execution_time)*C.credits_used AS approximate_credits_used
        FROM client_hour_execution_cte A
          JOIN hour_execution_cte B
            ON A.start_time_hour = B.start_time_hour and B.warehouse_name = A.warehouse_name
          JOIN snowflake.account_usage.warehouse_metering_history C
            ON C.warehouse_name = A.warehouse_name AND C.start_time = A.start_time_hour
      )

    SELECT client_application_name,
      warehouse_name,
      SUM(approximate_credits_used) AS approximate_credits_used
    FROM approximate_credits
    GROUP BY 1,2
    ORDER BY 3 DESC;
    ```

#### Compute for hybrid tables

Query: Credit consumption for hybrid tables over a specific period of time
:   This query shows the total credit consumption for hybrid tables in your account over a
    specific period of time. This helps track hybrid table credit usage
    against expectations.

    > **Note:**
    >
    > As of March 1, 2026, Snowflake no longer bills customers for hybrid table requests,
    > and metering was disabled soon after this pricing change took effect. Any new data
    > in the view as of March 1, 2026, will not be billed to customers, and you can still
    > query the historical data in the view.
    >
    > For information about hybrid table storage costs, see
    > [Evaluate cost for hybrid tables](tables-hybrid-cost.md).

    ```sqlexample
    -- Credits used (all time = past year)

    SELECT SUM(credits_used) AS total_credits
      FROM SNOWFLAKE.ACCOUNT_USAGE.HYBRID_TABLE_USAGE_HISTORY;

    -- Credits used (past N days/weeks/months)

    SELECT SUM(credits_used) AS total_credits
      FROM SNOWFLAKE.ACCOUNT_USAGE.HYBRID_TABLE_USAGE_HISTORY
      WHERE start_time >= DATEADD(day, -5, CURRENT_TIMESTAMP());
    ```

#### Compute for Cortex Agents

Query: Credit consumption by Cortex Agents.
:   This query shows the credit consumption for Cortex Agents.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AGENT_USAGE_HISTORY;
    ```

#### Compute for Cortex Analyst

Query: Credit consumption by Cortex Analyst.
:   This query shows the credit consumption for Cortex Analyst.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_ANALYST_USAGE_HISTORY;
    ```

#### Compute for Cortex Fine-tuning

Query: Credit consumption by Cortex Fine-tuning.
:   This query shows the training credit consumption for each Cortex Fine-tuning,
    aggregated in one hour increments.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FINE_TUNING_USAGE_HISTORY;
    ```

#### Compute for Cortex functions

Query: Credit consumption by Cortex functions.
:   This query shows the credit consumption for each Cortex function call, aggregated in one hour increments based on
    function and model.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_USAGE_HISTORY;
    ```

Query: Credit consumption by Cortex function called with the `mistral-large` model.
:   This query shows the credit consumption for each Cortex function called with the `mistral-large` model, aggregated in one
    hour increments based on function and model.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_USAGE_HISTORY
      WHERE model_name = 'mistral-large';
    ```

Query: Credit consumption by Cortex functions query.
:   This query shows the credit consumption for each Cortex functions query, aggregated in one hour increments based on
    function and model.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY
      WHERE query_id = 'query-id';
    ```

#### Compute for Cortex REST API

Query: Credit consumption by Cortex REST API.
:   This query shows the credit consumption for Cortex REST API calls, including the number of tokens processed
    and the model used for each request.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_REST_API_USAGE_HISTORY;
    ```

#### Compute for Cortex Search

Query: Daily credit consumption by Cortex Search.
:   This query shows the credit consumption for each Cortex Search Service,
    aggregated daily, including both serving and embed text token consumption.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_SEARCH_DAILY_USAGE_HISTORY;
    ```

Query: Credit consumption by Cortex Search serving.
:   This query shows the serving credit consumption for each Cortex Search Service,
    aggregated in one hour increments.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_SEARCH_SERVING_USAGE_HISTORY;
    ```

#### Compute for Document AI

Query: Credit consumption by Document AI.
:   This query shows the credit consumption for Document AI.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.DOCUMENT_AI_USAGE_HISTORY;
    ```

Query: Credit consumption per Document AI query.
:   This query retrieves records from the CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view where the CREDITS_USED is greater than 0.072.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY
      WHERE CREDITS_USED > 0.072
    ```

#### Compute for Snowflake Intelligence

Query: Credit consumption by Snowflake Intelligence.
:   This query shows the credit consumption for Snowflake Intelligence.

    ```sqlexample
    SELECT *
      FROM SNOWFLAKE.ACCOUNT_USAGE.SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY;
    ```

#### Compute for Snowflake Notebooks

Query: Hourly credit consumption by notebook
:   This query retrieves runtime history for a specific notebook, including credit usage and execution timestamps. Use this data to understand how
    often and how long a notebook runs, and to identify patterns or spikes in credit consumption by hour.

    ```sqlexample
    SELECT * FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE notebook_name = '<example_nb_name>';
    ```

Query: Cost to run a specific notebook
:   This query shows the total credits consumed by a specific notebook. Use this to estimate a notebook’s cost and identify high-cost notebooks.

    ```sqlexample
    SELECT
      notebook_name,
      SUM(credits) AS total_credits
    FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE notebook_name = '<example_nb_name>'
    GROUP BY notebook_name;
    ```

Query: Total compute pool cost per notebook
:   This query shows the total credits consumed by each notebook running on a specific compute pool. Use this to break down compute usage by
    notebook, which can help identify which notebooks contribute most to the compute pool’s overall cost.

    ```sqlexample
    SELECT
      notebook_name,
      SUM(credits) AS total_credits
    FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE compute_pool_name = '<example_cp_name>'
    GROUP BY notebook_name;
    ```

Query: Identify users who ran a specific notebook
:   This query returns a list of users who have executed a specific notebook. Use this to understand usage patterns, or identify collaborators
    and consumers of shared notebooks.

    ```sqlexample
    SELECT
      DISTINCT user_name
    FROM snowflake.account_usage.notebooks_container_runtime_history
    WHERE notebook_name = '<example_nb_name>';
    ```

---
title: Exploring data transfer cost
source: https://docs.snowflake.com/en/user-guide/cost-exploring-data-transfer.md
section: User Guide
---

# Exploring data transfer cost

Snowflake does not charge a data ingress fee to bring data into your account, but does charge a per-byte fee to transfer data from a
Snowflake account into another region on the same cloud platform or into a different cloud platform.

This topic describes how to gain insight into historical data transfer costs using [Snowsight](ui-snowsight-gs.md), or by writing queries against
views in the [ACCOUNT_USAGE](../sql-reference/account-usage.md) and [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schemas.
Snowsight allows you to quickly and easily obtain information about cost from a visual dashboard. Queries against the usage views
allow you to drill down into cost data and can help generate custom reports and dashboards.

To gain a better understanding of how data transfer fees accrue, see [Understanding data transfer cost](cost-understanding-data-transfer.md).

## Viewing the data transfer history

Users can use Snowsight to view the amount of data transferred from your Snowflake account to
a different cloud provider or region within a specified date range. The unit of measure is bytes.

To explore data transfer costs:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost-related features](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an XS warehouse for this purpose.
5. Select Consumption.
6. Select Data Transfer from the Usage Type drop-down.

For usage notes related to the Consumption page, see [Usage notes](cost-exploring-overall.md).

## Querying data for data transfer cost

Snowflake provides two schemas, [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) and
[ACCOUNT_USAGE](../sql-reference/account-usage.md), that contain data related to usage and cost. The ORGANIZATION_USAGE schema provides
cost information for all of the accounts in the organization while the ACCOUNT_USAGE schema provides similar information for a single
account. Views in these schemas provide granular, analytics-ready usage data to build custom reports or dashboards.

Most views in the ORGANIZATION_USAGE and ACCOUNT_USAGE schemas contain the cost of data transfers in terms of the volume of data
transferred. To view cost in currency rather than volume, write queries against the
[USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md). This view converts the volume of data transferred into cost in currency
using the daily price of transferring a TB.

The following views provide usage and cost information related to transferring data from your Snowflake account to a different cloud
provider or region.

| View | Description | Schema |
| --- | --- | --- |
| DATA_TRANSFER_DAILY_HISTORY | Number of bytes transferred on a given day. For more detailed data, use the DATA_TRANSFER_HISTORY view instead. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/data_transfer_daily_history.md) |
| DATA_TRANSFER_HISTORY | Number of bytes transferred, include the source cloud and region, target cloud and region, and type of transfer. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/data_transfer_history.md) [ACCOUNT_USAGE](../sql-reference/account-usage/data_transfer_history.md) |
| DATABASE_REPLICATION_USAGE_HISTORY | Number of bytes transferred and credit consumed during database replication. | [ACCOUNT_USAGE](../sql-reference/account-usage/database_replication_usage_history.md) |
| LISTING_AUTO_FULFILLMENT_ USAGE_HISTORY | Estimated usage associated with fulfilling data products to other regions by using Cross-Cloud Auto-Fulfillment. Refer to the SERVICE_TYPE of DATA_TRANSFER. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/listing_auto_fulfillment_usage_history.md) |
| REPLICATION_USAGE_HISTORY | Number of bytes transferred and credits consumed during database replication. If possible, use the [DATABASE_REPLICATION_USAGE_HISTORY view](../sql-reference/account-usage/database_replication_usage_history.md) instead. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/replication_usage_history.md) [ACCOUNT_USAGE](../sql-reference/account-usage/replication_usage_history.md) |
| REPLICATION_GROUP_USAGE_HISTORY | Number of bytes transferred and credits consumed during replication for a specific replication group. | [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) [ACCOUNT_USAGE](../sql-reference/account-usage/replication_group_usage_history.md) |
| USAGE_IN_CURRENCY_DAILY | Daily data transfer in TB along with the cost of that usage in the organization’s currency. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/usage_in_currency_daily.md) |

> **Note:**
>
> The views and table functions of the [Snowflake Information Schema](../sql-reference/info-schema.md) also provide usage data related to cost. Though
> the ACCOUNT_USAGE schema is preferred, the Information Schema can be faster in some circumstances.

---
title: Exploring execution times
source: https://docs.snowflake.com/en/user-guide/performance-query-exploring.md
section: User Guide
---

# Exploring execution times

This topic explains how to examine the past performance of queries and [tasks](tasks-intro.md). This information helps
identify candidates for performance optimizations and allows you to see whether your optimization strategies are having the desired effect.

You can explore historical performance using Snowsight or by writing queries against views in the ACCOUNT_USAGE schema. A user
without access to the ACCOUNT_USAGE schema can query similar data using the Information Schema.

## View execution times and load

You can use Snowsight to gain visual insights into the performance of queries and tasks as well as the load of a warehouse.

Queries:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Monitoring » Query History.
    3. Use the Duration column to understand how long it took a query to execute. You can sort the column to find the queries that ran
       the longest.
    4. If you want to focus on a particular user’s queries, use the User drop-down to select the user.
    5. If you want to focus on the queries that ran on a particular warehouse, select Filters » Warehouse, and then select
       the warehouse.

Warehouses:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. Switch to a role that has privileges for the warehouse.
    3. In the navigation menu, select Compute » Warehouses.
    4. Select a warehouse.
    5. Use the Warehouse Activity chart to visualize the load of the warehouse, including whether queries were queued.

Tasks:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Transformation » Tasks to view how long it took to execute a task’s SQL code.

### Drill down into execution times

The [Query Profile](ui-snowsight-activity.md) allows you to examine which parts of a query are taking the longest to execute.
It includes a Most Expensive Nodes pane that identifies the operator nodes that are taking the longest to execute. You can drill
down even further by viewing what percentage of a node’s execution time was spent in a particular category of query processing.

To access the Query Profile for a query:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Query History.
3. Select the query ID of a query.
4. Select the Query Profile tab.

> **Tip:**
>
> You can programmatically access the performance statistics of the Query Profile by executing the
> [GET_QUERY_OPERATOR_STATS](../sql-reference/functions/get_query_operator_stats.md) function.

## Write queries to explore execution times

The [Account Usage](../sql-reference/account-usage.md) schema contains views related to the execution times of queries and tasks. It also contains a
view related to the load of a warehouse as it executes queries. You can write queries against these views to drill down into performance
data and create custom reports and dashboards.

By default, only the account administrator (i.e. user with the ACCOUNTADMIN role) can access views in the ACCOUNT_USAGE schema. To allow
other users to access these views, refer to [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md).

Users without access to the ACCOUNT_USAGE schema (e.g. a user who ran a query or a warehouse administrator) can still return recent
execution times and other query metadata using the [QUERY_HISTORY table functions](../sql-reference/functions/query_history.md) of the
Information Schema.

Be aware that the ACCOUNT_USAGE views are not updated immediately after running a query or task. If you want to check the execution time
of a query right after running it, use Snowsight to view its performance. The Information Schema
is also updated quicker than the ACCOUNT_USAGE views.

| ACCOUNT_USAGE View | Description | Latency |
| --- | --- | --- |
| [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) | Used to analyze the Snowflake query history by various dimensions (time range, execution time, session, user, warehouse, etc.) within the last 365 days (1 year). | Up to 45 minutes |
| [WAREHOUSE_LOAD_HISTORY](../sql-reference/account-usage/warehouse_load_history.md) | Used to analyze the workload on a warehouse within a specified date range. | Up to 3 hours |
| [TASK_HISTORY](../sql-reference/account-usage/task_history.md) | Used to retrieve the history of task usage within the last 365 days (1 year). | Up to 45 minutes |

### Example queries

The following queries against the ACCOUNT_USAGE schema provide insight into the past performance of queries, warehouses, and tasks.
Click the name of a query to see the full SQL example.

Query Performance:
:   * Query: Top n longest-running queries
    * Query: Queries organized by execution time over past month
    * Query: Find long running repeated queries
    * Query: Track the average performance of a query over time

Warehouse Load:
:   * Query: Total warehouse load

Task Performance:
:   * Query: Longest running tasks

#### Query performance

##### Query: Top n longest-running queries

This query provides a listing of the top n (50 in the example below) longest-running queries in the last day. You can adjust the
`DATEADD` function to focus on a shorter or longer period of time. Replace `my_warehouse` with the name of a warehouse.

```sqlexample
SELECT query_id,
  ROW_NUMBER() OVER(ORDER BY partitions_scanned DESC) AS query_id_int,
  query_text,
  total_elapsed_time/1000 AS query_execution_time_seconds,
  partitions_scanned,
  partitions_total,
FROM snowflake.account_usage.query_history Q
WHERE warehouse_name = 'my_warehouse' AND TO_DATE(Q.start_time) > DATEADD(day,-1,TO_DATE(CURRENT_TIMESTAMP()))
  AND total_elapsed_time > 0 --only get queries that actually used compute
  AND error_code IS NULL
  AND partitions_scanned IS NOT NULL
ORDER BY total_elapsed_time desc
LIMIT 50;
```

##### Query: Queries organized by execution time over past month

This query groups queries for a given warehouse by buckets for execution time over the last month. These trends in query completion time
can help inform decisions to resize warehouses or separate out some queries to another warehouse. Replace `MY_WAREHOUSE` with the name
of a warehouse.

```sqlexample
SELECT
  CASE
    WHEN Q.total_elapsed_time <= 60000 THEN 'Less than 60 seconds'
    WHEN Q.total_elapsed_time <= 300000 THEN '60 seconds to 5 minutes'
    WHEN Q.total_elapsed_time <= 1800000 THEN '5 minutes to 30 minutes'
    ELSE 'more than 30 minutes'
  END AS BUCKETS,
  COUNT(query_id) AS number_of_queries
FROM snowflake.account_usage.query_history Q
WHERE  TO_DATE(Q.START_TIME) >  DATEADD(month,-1,TO_DATE(CURRENT_TIMESTAMP()))
  AND total_elapsed_time > 0
  AND warehouse_name = 'my_warehouse'
GROUP BY 1;
```

##### Query: Find long running repeated queries

You can use the [query hash](query-hash.md) (the value of the `query_hash` column in the ACCOUNT_USAGE
QUERY_HISTORY view) to find patterns in query performance that might not be obvious. For example, although a query might not be
excessively expensive during any single execution, a frequently repeated query could lead to high costs, based on the number of
times the query runs.

You can use the query hash to identify the queries that you should focus on optimizing first. For example, the following query
uses the value in the `query_hash` column to identify the query IDs for the 100 longest-running queries:

```sqlexample
SELECT
    query_hash,
    COUNT(*),
    SUM(total_elapsed_time),
    ANY_VALUE(query_id)
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE warehouse_name = 'MY_WAREHOUSE'
    AND DATE_TRUNC('day', start_time) >= CURRENT_DATE() - 7
  GROUP BY query_hash
  ORDER BY SUM(total_elapsed_time) DESC
  LIMIT 100;
```

##### Query: Track the average performance of a query over time

The following statement computes the daily average total elapsed time for all queries that have a specific parameterized query
hash (`cbd58379a88c37ed6cc0ecfebb053b03`).

```sqlexample
SELECT
    DATE_TRUNC('day', start_time),
    SUM(total_elapsed_time),
    ANY_VALUE(query_id)
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_parameterized_hash = 'cbd58379a88c37ed6cc0ecfebb053b03'
    AND DATE_TRUNC('day', start_time) >= CURRENT_DATE() - 30
  GROUP BY DATE_TRUNC('day', start_time);
```

#### Warehouse load

##### Query: Total warehouse load

This query provides insight into the total load of a warehouse for executed and queued queries. These load values represent the ratio of the total execution time (in seconds) of all queries in a specific state in an interval by the total time (in seconds) for that interval.

For example, if 276 seconds was the total time for 4 queries in a 5 minute (300 second) interval, then the query load value is 276 / 300 = 0.92.

```sqlexample
 SELECT TO_DATE(start_time) AS date,
  warehouse_name,
  SUM(avg_running) AS sum_running,
  SUM(avg_queued_load) AS sum_queued
FROM snowflake.account_usage.warehouse_load_history
WHERE TO_DATE(start_time) >= DATEADD(month,-1,CURRENT_TIMESTAMP())
GROUP BY 1,2
HAVING SUM(avg_queued_load) >0;
```

#### Task performance

##### Query: Longest running tasks

This query lists the longest running tasks in the last day, which can indicate an opportunity to optimize the SQL being executed by the
task.

```sqlexample
SELECT DATEDIFF(seconds, query_start_time,completed_time) AS duration_seconds,*
FROM snowflake.account_usage.task_history
WHERE state = 'SUCCEEDED'
  AND query_start_time >= DATEADD (day, -1, CURRENT_TIMESTAMP())
ORDER BY duration_seconds DESC;
```

---
title: Exploring overall cost
source: https://docs.snowflake.com/en/user-guide/cost-exploring-overall.md
section: User Guide
---

# Exploring overall cost

You can explore historical cost using Snowsight, or by writing queries against views in the
[ACCOUNT_USAGE](../sql-reference/account-usage.md) and [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schemas.
Snowsight allows you to quickly and easily obtain information about cost from a visual dashboard. Queries against the usage
views allow you to drill down into cost data and can help generate custom reports and dashboards.

If you need an introduction to how costs are incurred in Snowflake, refer to [Understanding overall cost](cost-understanding-overall.md).

To obtain a billing statement that contains information about historical usage, see [Access a billing usage statement](billing-usage-statement.md).

## Viewing costs using Snowsight

Snowsight provides multiple pages that allow you to explore the historical cost of using Snowflake. For details on using these
pages to view overall costs, see:

* Overview of organization-level costs
* Overview of account-level costs
* Drilling down into incurred costs

> **Note:**
>
> Keep the following in mind when viewing costs in Snowsight:
>
> * It can take up to 72 hours for cost information to become available in Snowsight.
> * Information is shown in the UTC time zone, not your local time zone.

### Overview of organization-level costs

The Organization Overview page provides insights into how your organization is spending the capacity commitment made in the current
contract. For example, it shows you the remaining balance of the contract, the accumulated cost of Snowflake usage since the start of the
contract, and the monthly spend for the organization.

It also gives you an overview of how much each account in the organization has spent.

> **Note:**
>
> The Organization Overview page is not available in the following cases:
>
> * The organization uses On Demand accounts rather than a capacity commitment with a contract.
> * The organization signed a contract through a Snowflake reseller.

To access an overview of incurred costs at the organization level:

1. Sign in to the [organization account](organization-accounts.md) or an [ORGADMIN-enabled account](organization-administrators.md).
2. Switch to a role with [access to cost-related features](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an X-Small warehouse for this purpose.
5. Select Organization Overview.

The Account Spend Summary tile has a View All option to expand the contents of the tile to include all of the accounts in the
organization, not just the accounts that have spent the most. To display the SQL query used to populate this tile, select
View All » View query () .

### Overview of account-level costs

The Account Overview page provides high-level insights into the cost of using Snowflake and can be a starting off point for optimizing
your spend.

> **Note:**
>
> Account administrators cannot see the price of a credit or usage costs in currency unless they also have the ORGADMIN role.

To access an overview of incurred costs at the account level:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost-related features](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an X-Small warehouse for this purpose.
5. Select Account Overview.

Many tiles on the Account Overview page have a View All option to expand the contents of the tile to include more items. For
example, for the Top warehouses by cost tile, select View All to open a dialog that displays all warehouses in your account
sorted by cost.

To display the SQL query used to populate a tile, select View All » View query () . For example, if
you view the query for the Top warehouses by cost tile, you see that the data comes from querying the
[WAREHOUSE_METERING_HISTORY](../sql-reference/account-usage/warehouse_metering_history.md) view in the ACCOUNT_USAGE schema of the shared
SNOWFLAKE database.

> **Note:**
>
> Customers who signed a contract through a Snowflake reseller cannot see the price of a credit or usage in a currency.

### Drilling down into incurred costs

You can use the Consumption page to drill down into the overall cost of using Snowflake for
any given day, week, or month.

To use Snowsight to drill down into the overall cost:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost-related features](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an X-Small warehouse for this purpose.
5. Select Consumption.
6. Select All Usage Types from the drop-down list.

This totals the cost of compute, storage, and data transfer resources and displays them in a bar graph using the organization’s currency.
The total cost of these resources during the selected time period appears above the bar graph.

To isolate the cost of compute, storage, or data transfer, adjust your selection in the All Usage Types filter.

#### Usage notes

Keep the following in mind when accessing the Consumption page:

* It can take up to 72 hours for cost information to become available in Snowsight.
* To access all of the features on the Consumption page, the account administrator must also have
  the ORGADMIN role. For example, if a user has the ACCOUNTADMIN role, but does *not* have the ORGADMIN role, they can only view costs
  for the current account. The Account filter that would allow them to switch to a different account does not appear.
* If the usage details fail to load with a message indicating that The result set is too large to display, you
  must use the filters to select a shorter date range or otherwise filter the results.
* Compute costs do not include queries executed on a warehouse by the SYSTEM user as part of a user-defined
  [task](tasks-intro.md).

## Querying data for overall cost

Snowflake provides two schemas, [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) and
[ACCOUNT_USAGE](../sql-reference/account-usage.md), that contain data related to usage and cost. The ORGANIZATION_USAGE schema provides
cost information for all of the accounts in the organization while the ACCOUNT_USAGE schema provides similar information for a single
account. Views in these schemas provide granular, analytics-ready usage data to build custom reports or dashboards.

The following query combines data from the USAGE_IN_CURRENCY view in the ORGANIZATION_USAGE schema in order to gain insight into the
overall cost of using Snowflake.

Query: Total usage costs in dollars for the organization, broken down by account
:   ```sqlexample
    SELECT account_name,
      ROUND(SUM(usage_in_currency), 2) as usage_in_currency
    FROM snowflake.organization_usage.usage_in_currency_daily
    WHERE usage_date > DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1
    ORDER BY 2 desc;
    ```

**Next Topics**

* [Exploring compute cost](cost-exploring-compute.md)
* [Exploring storage cost](cost-exploring-data-storage.md)
* [Exploring data transfer cost](cost-exploring-data-transfer.md)

---
title: Exploring storage cost
source: https://docs.snowflake.com/en/user-guide/cost-exploring-data-storage.md
section: User Guide
---

# Exploring storage cost

Total Storage cost is the sum of costs associated with:

* Staged file storage
* Database table storage
* Fail-safe and Time Travel storage

This topic describes how to gain insight into historical storage costs using [Snowsight](ui-snowsight-gs.md), or by writing queries against views in
the [ACCOUNT_USAGE](../sql-reference/account-usage.md) and [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schemas.
Snowsight allows you to quickly and easily obtain information about cost from a visual dashboard. Queries against the usage views
allow you to drill down into cost data and can help generate custom reports and dashboards.

To gain a better understanding of how storage costs are incurred, see [Understanding storage cost](cost-understanding-data-storage.md).

## Viewing the storage history

Users can use Snowsight to view the amount of data that is stored in Snowflake.

To explore storage costs:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost-related features](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an XS warehouse for this purpose.
5. Select Consumption.
6. Select Storage from the Usage Type drop-down.

For usage notes related to the Consumption page, see [Usage notes](cost-exploring-overall.md).

### Filter by tag

To help [attribute cost](cost-attributing.md) to a logical unit within your organization, you can filter the Usage
dashboard to show storage associated with a specific tag/value combination. This ability to filter storage by tag is similar to filtering
credit consumption by tag. For details, refer to [Exploring Compute Costs](cost-exploring-compute.md).

### View storage by type or object

When viewing the bar graph that displays storage history, you can filter the data either By Type or By Object.

Filtering By Type shows the size of storage for each storage type: Database, Fail Safe, and Stage. Storage
associated with Time Travel is included in the Database category.

Filtering By Object graphs the size of storage for each object, for example the size of a particular database or stage.

## Viewing data usage for a table

Users with the appropriate access privileges can use Snowsight to view the size (in bytes) of individual tables in a
schema/database:

To view the size of a table:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Expand a database, then any schema in the database.
4. Click on any table to view the table statistics, including its size.

> **Important:**
>
> The size displayed for a table represents the number of *active* bytes. In most cases, this is the number of bytes that will be scanned
> if the entire table is scanned in a query. However, this number might be different from the number of physical bytes (i.e. bytes stored
> on-disk) for the table, specifically for cloned tables and tables with deleted data:
>
> * A cloned table does not utilize additional storage (until rows are added to the table or existing rows in the table are modified or
>   deleted). As a result, the table size displayed may be larger
>   than the actual physical bytes stored for the table, i.e. the table contributes less to the overall storage for the account
>   than the size indicates.
> * Data deleted from a table is not included in the displayed table size; however, the data is maintained in Snowflake until both the
>   Time Travel retention period (default is 1 day) and the Fail-safe period (7 days) for the data has passed. During these two periods,
>   the table size displayed is smaller than the actual physical bytes stored for the table, i.e. the table contributes more
>   to the overall storage for the account than the size indicates.
> * Dropping a column from a table does not immediately delete the data in the column. The physical bytes for the data in the dropped
>   column remain in storage. In this case, the table size displayed is larger than the number of bytes that is scanned if the
>   entire table is scanned in a query. For more information, see the [usage notes](../sql-reference/sql/alter-table.md) for
>   ALTER TABLE.
>
> For more information about storage for cloned tables and deleted data, see [Data storage considerations](tables-storage-considerations.md).

## Querying data for table size

You can write SQL queries to gain insights into tables, including their size, instead of using the web interface.

A user with the proper access privileges can list data about tables using the [SHOW TABLES](../sql-reference/sql/show-tables.md) command.

In addition, users with the ACCOUNTADMIN role can use SQL to view table size information by executing queries against the
[TABLE_STORAGE_METRICS](../sql-reference/account-usage/table_storage_metrics.md) view in the ACCOUNT_USAGE schema.

For important information about interpreting the table data retrieved by these SQL queries, see the note in
Viewing data usage for a table (in this topic).

## Querying data for storage cost

Snowflake provides two schemas, [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) and
[ACCOUNT_USAGE](../sql-reference/account-usage.md), that contain data related to usage and cost. The ORGANIZATION_USAGE schema provides
cost information for all of the accounts in the organization while the ACCOUNT_USAGE schema provides similar information for a single
account. Views in these schemas provide granular, analytics-ready usage data to build custom reports or dashboards.

Most views in the ORGANIZATION_USAGE and ACCOUNT_USAGE schemas contain the cost of storage in terms of the size of storage. To view cost
in currency rather than size, write queries against the [USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md). This view
converts the size of storage into cost in currency using the daily price of a TB.

The following views provide usage and cost information related to storage.

| View | Description | Schema |
| --- | --- | --- |
| APPLICATION_DAILY_USAGE_HISTORY | Daily storage usage consumption for Snowflake Native Apps in an account within the last 365 days. | [ACCOUNT_USAGE](../sql-reference/account-usage/application_daily_usage_history.md) |
| DATABASE_STORAGE_USAGE_HISTORY | Daily storage in bytes for databases (including data in Time Travel), Fail-safe, and hybrid tables in the account/organization. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/database_storage_usage_history.md) [ACCOUNT_USAGE](../sql-reference/account-usage/database_storage_usage_history.md) |
| HYBRID_TABLES | Data storage in bytes for each hybrid table row in the account. | [ACCOUNT_USAGE](../sql-reference/account-usage/hybrid_tables.md) |
| LISTING_AUTO_FULFILLMENT_ DATABASE_STORAGE_DAILY | Data storage in bytes for databases fulfilled to other regions by Cross-Cloud Auto-Fulfillment. | [DATA_SHARING_USAGE](../sql-reference/data-sharing-usage/listing-auto-fulfillment-database-storage-daily.md) |
| LISTING_AUTO_FULFILLMENT_ USAGE_HISTORY | Estimated usage associated with fulfilling data products to other regions by using Cross-Cloud Auto-Fulfillment. Refer to the SERVICE_TYPE of STORAGE. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/listing_auto_fulfillment_usage_history.md) |
| POSTGRES_STORAGE_USAGE_HISTORY | Data storage in bytes for Snowflake Postgres instances. | [ACCOUNT_USAGE](../sql-reference/account-usage/postgres_storage_usage_history.md) |
| STORAGE_DAILY_HISTORY | Average daily storage for storage in bytes. Combines database storage (DATABASE_STORAGE_USAGE_HISTORY) and stage storage (STAGE_STORAGE_USAGE_HISTORY). | [ORGANIZATION_USAGE](../sql-reference/organization-usage/storage_daily_history.md) |
| STAGE_STORAGE_USAGE_HISTORY | Average daily storage usage, in bytes, for all the Snowflake stages including named internal stages and default staging areas. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/stage_storage_usage_history.md) [ACCOUNT_USAGE](../sql-reference/account-usage/stage_storage_usage_history.md) |
| TABLE_STORAGE_METRICS | Storage in bytes for tables, including storage that is no longer active but continues to incur cost (e.g. deleted tables with the Time Travel retention period). | [ACCOUNT_USAGE](../sql-reference/account-usage/table_storage_metrics.md) |
| USAGE_IN_CURRENCY_DAILY | Daily average storage in bytes along with the cost of that usage in the organization’s currency. | [ORGANIZATION_USAGE](../sql-reference/organization-usage/usage_in_currency_daily.md) |

> **Note:**
>
> The views and table functions of the [Snowflake Information Schema](../sql-reference/info-schema.md) also provide usage data related to cost. Though
> the ACCOUNT_USAGE schema is preferred, the Information Schema can be faster in some circumstances.

---
title: Exploring the Snowsight user interface
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-homepage.md
section: User Guide
---

# Exploring the Snowsight user interface

The Snowsight interface is one of the many methods that can be used to access Snowflake.
Users wishing to access Snowflake to run queries, create schemas, databases, and related objects can do so using Snowsight.

The Snowsight home page is opened by selecting (Home) in the navigation menu, and is automatically displayed when
initially opening Snowsight.

The Snowsight user interface is composed of four sections or panes:

1. Navigation menu: Selections for creating and managing data products, notebooks, worksheets, databases and all related artifacts.
   See [Snowsight navigation menu](ui-snowsight-navigation.md) for additional details.
2. Search: Easily discover content for data products, database elements such as tables and views, functions and more.
3. Quick actions: Quickly and easily perform operations specific to your current role.
   For example: query data using a worksheet, upload files directly to tables, create worksheets to execute python code and more.
4. Recently viewed: Tabs with recent operations and associated content. For example, select the Worksheets tab to show all recently
   accessed worksheets. Tabs are also available to show projects, Snowflake Notebooks, Streamlits, and Dashboards.

> **Note:**
>
> Note that all features are not available in all regions and platforms. If a feature is not available in your region or platform, it will
> not be available in quick actions or otherwise.

---
title: External API authentication and secrets
source: https://docs.snowflake.com/en/user-guide/api-authentication.md
section: User Guide
---

# External API authentication and secrets

This topic provides concepts about external API authentication and secrets.

## Overview

External API authentication provides a pathway to authenticate to a service that is hosted outside of Snowflake. The API request to access
the service requires the API request to be authenticated. Snowflake supports the following methods of authentication while using External
API Authentication:

* Basic authentication.
* OAuth with code grant flow.
* OAuth with client credentials flow.

Snowflake supports basic authentication (i.e. username and password) in the API request header as specified in
[RFC 7617](https://datatracker.ietf.org/doc/html/rfc7617), where the authentication credentials are encoded using Base64. Similarly,
Snowflake supports OAuth 2.0 as specified in [RFC 6749](https://datatracker.ietf.org/doc/html/rfc6749). In Snowflake,
the authentication credentials are stored and accessed securely from an object called a secret. The secret is used with a connector to
access the service outside of Snowflake, such as the [Snowflake Connector for ServiceNow](https://other-docs.snowflake.com/connectors/servicenow/about.html). A security integration for
[external API authentication](../sql-reference/sql/create-security-integration-api-auth.md) enables Snowflake to connect to the service
hosted outside of Snowflake when using the OAuth flows.

A secret is a schema-level object that stores sensitive information, limits access to the sensitive information using
[RBAC](security-access-control-overview.md), and is encrypted using the Snowflake
[key encryption hierarchy](security-encryption-manage.md). Information present in the secret object is encrypted using a key in the key
hierarchy. After you create a secret, only dedicated Snowflake components such as integrations and external functions can read the
sensitive information.

For example, an external function needs to access and read the secret to pass the authentication credentials into an API authorization
header to make an API request to the service outside of Snowflake. The binding of the secret to the external function occurs during the
connector installation process. However, if a user runs a [DESCRIBE SECRET](../sql-reference/sql/desc-secret.md) operation on the secret, the password value
stored in the secret is never exposed.

Snowflake provides centralized management and access control of credentials used for API authentication in the secret. You can implement
separation of duties (i.e. SoD) for the management of the secret and the roles associated with managing the connector. The connector only
needs to use the secret name, and the users granted the connector roles do not need to view the sensitive information stored in the secret.

## Managing secrets

Snowflake provides the following commands to manage the secret object:

* [CREATE SECRET](../sql-reference/sql/create-secret.md)
* [ALTER SECRET](../sql-reference/sql/alter-secret.md)
* [DESCRIBE SECRET](../sql-reference/sql/desc-secret.md)
* [DROP SECRET](../sql-reference/sql/drop-secret.md)
* [SHOW SECRETS](../sql-reference/sql/show-secrets.md)

Snowflake supports the following privileges to determine whether users can create, use and own secrets.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| CREATE | Enables creating a new secret in a schema. |
| USAGE | Enables using a secret. |
| OWNERSHIP | Transfers ownership of the secret, which grants full control over the secret. Required to alter most properties of a secret. |

The following table summarizes the relationship between the secret command operations and their necessary privileges.

| Operation | Privilege |
| --- | --- |
| CREATE SECRET | A role with the USAGE privilege on the parent database and schema with the CREATE SECRET privilege in the same schema. |
| ALTER SECRET | A role with the OWNERSHIP privilege on the secret. |
| DROP SECRET | A role with the OWNERSHIP privilege on the secret. |
| DESCRIBE SECRET | A role with the USAGE privilege on the secret. |
| SHOW SECRETS | A role with the USAGE privilege on the secret. |
| USE SECRET | A role with the USAGE privilege on the secret.  This privilege is required for the role creating the external function and calling the external function at query runtime, if the secret is to be used with an external function. |

## Using external API authentication and secrets

For representative examples, see:

* [Snowflake Connectors](https://other-docs.snowflake.com/connectors.html)
* [Creating and using an external access integration](../developer-guide/external-network-access/creating-using-external-network-access.md)

Additionally, you can replicate secrets using [account replication](account-replication-intro.md). For details, see
the Replication and secrets section in [Replication considerations](account-replication-considerations.md).

---
title: External lineage
source: https://docs.snowflake.com/en/user-guide/external-lineage.md
section: User Guide
---

# External lineage

External lineage extends Snowflake’s [native lineage](ui-snowsight-lineage.md) to include external data sources and
destinations, providing you with visibility into data flows across your entire data ecosystem. It captures lineage from external ETL tools
and source databases to create a unified view of how data moves through your data pipeline.

[OpenLineage](https://openlineage.io) is an open standard for capturing and sharing data lineage information across diverse data
tools and platforms. Snowflake leverages this framework by accepting OpenLineage-compatible events through a REST endpoint. External tools
like dbt and Apache Airflow can use the endpoint to send lineage metadata to Snowflake, which then incorporates this information into the
native lineage graph displayed in Snowsight.

External lineage REST endpoint
:   ```none
    /api/v2/lineage/external-lineage
    ```

Snowflake base URL for REST endpoints
:   ```none
    https://<account_identifier>.snowflakecomputing.com
    ```

    Where `account-identifier` is the [account identifier](admin-account-identifier.md) of your Snowflake account. You
    can use either the account name format or the account locator format as your account identifier.

    For example, if your account identifier is `myorg-dev_account`, then the base URL of the external lineage
    endpoint is: `https://myorg-dev_account.snowflakecomputing.com`

## External lineage workflow

Implementing external lineage for a data tool consists of the following tasks:

1. Grant the necessary privileges to the user who is authenticating to the external lineage
   endpoint.
2. Configure your data tool to send OpenLineage events to the Snowflake REST endpoint.
3. Choose an authentication method that works for Snowflake REST APIs, and then configure your data
   tool to use it to authenticate its requests to the external lineage endpoint.
4. Use your data tool as usual. OpenLineage events are sent to Snowflake automatically and appear in the native lineage graph in
   Snowsight.

If you want to test the external lineage endpoint before you configure a data tool to emit OpenLineage events, see
Send manual requests to establish lineage.

## View your data lineage

To view data lineage in Snowsight, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md) with the [necessary privileges](ui-snowsight-lineage.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select a [supported object](ui-snowsight-lineage.md) such as a table or
   view.
3. Select the Lineage tab.

When a data tool sends lineage information to Snowflake, external objects appear in the Snowsight lineage graph and are
labeled as an external node. For example:

You can select an external object or the line connecting objects to obtain additional information just like you can with native lineage.

## Grant Snowflake privileges

After a REST request is authenticated, Snowflake checks whether the user associated with the request
is authorized to use external lineage. The user associated with the request must have a role that is granted the INGEST LINEAGE privilege
on the account.

For example, suppose you want requests sent by the service user `dbt_integration_user` to show up in Snowsight lineage. As an
administrator, run the following commands to create a dedicated role, grant it the necessary privilege, and then grant the role to the user:

```sqlexample
CREATE ROLE dbt_lineage_role;
GRANT INGEST LINEAGE ON ACCOUNT TO ROLE dbt_lineage_role;
GRANT ROLE dbt_lineage_role TO USER dbt_integration_user;
```

## Configure your data tool

> **Note:**
>
> Any data tool with an OpenLineage integration can be configured to send lineage data to Snowflake. For a full list of tools that have an
> integration, see [OpenLineage Integrations](https://github.com/OpenLineage/OpenLineage/tree/main/integration#openlineage-integrations).

The following sections provide basic instructions for using external lineage with dbt and Apache AirFlow.

* Configure dbt to send lineage data to Snowflake
* Configure Airflow to send lineage data to Snowflake

### Configure dbt to send lineage data to Snowflake

> **Note:**
>
> Configuring dbt to emit OpenLineage events isn’t unique to Snowflake; the only thing specific to Snowflake is the endpoint and base URL
> of external lineage.

The following steps provide the minimum configuration you need to set up your dbt environment. Consult the
[OpenLineage dbt documentation](https://openlineage.io/docs/integrations/dbt) and the [OpenLineage specification](https://openlineage.io/apidocs/openapi/) to configure your OpenLineage-dbt integration.

1. Install the [OpenLineage-dbt integration](https://pypi.org/project/openlineage-dbt/):

   ```bash
   pip3 install openlineage-dbt
   ```
2. Set your transport variables to specify the base URL,
   endpoint, and security token for external lineage.

   For example, if the account identifier of your account is `MYORG-DEV_ACCOUNT`, define the following code in your YAML configuration
   file:

   ```yaml
   transport:
      type: http
      url: https://MYORG-DEV_ACCOUNT.snowflakecomputing.com
      endpoint: /api/v2/lineage/external-lineage
      auth:
         type: api_key
         apiKey: eyJ0eXAiOiJKV1QiLsecuritytoken...
      compression: gzip
   ```
3. Replace `dbt` commands with `dbt-ol`. For example, change the `dbt run` command to `dbt-ol run`.

   These `dbt-ol` commands are required by the OpenLineage-dbt integration, and aren’t unique to Snowflake.

For more information about OpenLineage-dbt integrations, including other methods of setting variables, see the
[OpenLineage dbt documentation](https://openlineage.io/docs/integrations/dbt).

### Configure Airflow to send lineage data to Snowflake

> **Note:**
>
> Configuring Apache Airflow to emit OpenLineage events isn’t unique to Snowflake; the only thing specific to Snowflake is the endpoint
> and base URL of external lineage.

The following steps provide the minimum configuration you need to set up your Airflow environment for Airflow version 2.7+, which is the
preferred version for OpenLineage. Consult the [OpenLineage Airflow documentation](https://openlineage.io/docs/integrations/airflow) and the [OpenLineage specification](https://openlineage.io/apidocs/openapi/) to
configure your OpenLineage-Airflow integration.

1. Install the [OpenLineage Airflow integration](https://airflow.apache.org/docs/apache-airflow-providers-openlineage/stable/index.html#apache-airflow-providers-openlineage)
   for version 2.7+:

   > ```bash
   > pip install apache-airflow-providers-openlineage
   > ```

   If you use an older version of Airflow, install `openlineage-airflow` instead.
2. Set your transport variables to specify the base URL,
   endpoint, and security token for external lineage.

   For example, if the account identifier of your account is `MYORG-DEV_ACCOUNT`, define the following code in your YAML configuration
   file:

   ```yaml
   transport:
      type: http
      url: https://MYORG-DEV_ACCOUNT.snowflakecomputing.com
      endpoint: /api/v2/lineage/external-lineage
      auth:
         type: api_key
         apiKey: eyJ0eXAiOiJKV1QiLsecuritytoken...
      compression: gzip
   ```

For more information about OpenLineage-Airflow integrations, including other methods of setting variables, see the
[OpenLineage Airflow documentation](https://openlineage.io/docs/integrations/airflow).

## Choose an authentication method

Snowflake provides multiple ways to authenticate requests to a Snowflake REST endpoint like the one used by external lineage. For a
complete list of authentication methods, see [Authenticating Snowflake REST APIs with Snowflake](../developer-guide/snowflake-rest-api/authentication.md).

After you select your preferred authentication method, you must generate a security token for a specific user. The token is used to
associate a user with the REST request so that Snowflake can authenticate the user and verify that the user is
authorized to use external lineage.

After successfully associating a user with a security token in Snowflake, you need to configure your data tool to authenticate its requests
with this token. For example, if you use a YAML configuration file to set OpenLineage transport variables, use the following code to
specify the security token that is sent in the header of the request:

```yaml
transport:
   auth:
      type: api_key
      apiKey: eyJ0eXAiOiJKV1QiLsecuritytoken...
```

For other methods of specifying a security token, see the OpenLineage documentation for your data tool.

## Send manual requests to establish lineage

External lineage works by accepting JSON payloads that conform to the OpenLineage specification for COMPLETE events. When integrated with a
data tool, the tool emits these COMPLETE events. But you can also construct a COMPLETE event, then send it to the endpoint by using any tool
or language that can send POST requests to an endpoint.

A valid request consists of the following method, base URL, and endpoint:

```none
POST https://<account_identifier>.snowflakecomputing.com/api/v2/lineage/external-lineage
```

Where `account_identifier` is the [account identifier](admin-account-identifier.md) of your Snowflake account.

The following example shows how to use curl to send lineage information to external lineage:

```bash
curl -i -X POST \
 -H "Content-Type: application/json" \
 -H "Authorization: Bearer eyJ0eXAiOiJKV1QiLsecuritytoken..." \
 -H "Accept: application/json" \
 -H "User-Agent: myApplicationName/1.0" \
 -H "X-Snowflake-Authorization-Token-Type: KEYPAIR_JWT" \
 -d "@request_body.json" \
 "https://MYORG-DEV_ACCOUNT.snowflakecomputing.com/api/v2/lineage/external-lineage"
```

Where `request_body.json` conforms to the OpenLineage specification for COMPLETE events. For more information about this JSON payload, see Payload requirements.

### Authentication and authorization of a manual request

The authentication and authorization of a manual request sent to the external lineage endpoint are the same as those in a request sent from
a data tool.

* The header of the request must include a security token from one of the
  [forms of authentication](../developer-guide/snowflake-rest-api/authentication.md) supported by Snowflake REST endpoints.
* The user associated with the security token must have the proper privileges.

### Payload requirements

When you send the JSON payload in a manual request to the external lineage endpoint, the payload must meet the following requirements:

* Must conform to the [OpenLineage specification](https://openlineage.io/apidocs/openapi/).
* Must be a COMPLETE event. That is, the `eventType` property must be `COMPLETE`. Other types of events are ignored.
* The `inputs` property and `outputs` property must be a mix of Snowflake and external objects. You cannot use external lineage
  to establish lineage between two external objects or between two Snowflake objects. If both properties specify the same type of object
  (Snowflake or external), then the request returns a 404 HTTP status code.
* Must contain the following properties:

  + `inputs`
  + `outputs`
  + `eventType`
  + `eventTime`
  + `job`

  You can optionally include the `run` property, which is useful in identifying the job. The payload can contain additional
  properties, but Snowflake ignores them.

#### Minimal payload example

The following example shows a minimal payload that you can send to the external lineage endpoint:

```json
{
   "eventType": "COMPLETE",
   "eventTime": "2025-03-12T06:51:12.000Z",
   "job": {"namespace": "exampleNamespace", "name": "exampleJob"},
   "run": {"runId": "123e4567-e89b-12d3-a456-426614174000"},
   "producer": "https://github.com/OpenLineage/OpenLineage/blob/v1-0-0/client",
   "schemaURL": "https://openlineage.io/spec/0-0-1/OpenLineage.json",
   "inputs": [{"namespace": "snowflake://AXORG-AX_TEST_PP8", "name": "OL_TEST.OL_TEST_SCH.TEST_DEMO"}],
   "outputs": [{"namespace": "postgres://localhost:5432", "name": "PDB.SCH.OUTPUT"}]
}
```

#### Specifying object types

Within the `outputs` array of the payload, you can use the `facets` field to specify the type of the object, which can be any
user-defined string. For example, the following snippet of the payload specifies that the object is of type VIEW:

```json
"outputs": [
    {
        "namespace": "postgres://db.company.com:5432",
        "name": "db.schema.view",
        "facets": {"datasetType": {"datasetType": "VIEW"}},
    },
],
```

If you don’t specify a `facets` field, the type of object defaults to `External Node`.

#### Specifying multiple inputs

If a payload includes more than one input, the resulting lineage shows the output as a downstream object of both inputs. For example, if a payload has input A and B along with an output C, then the lineage shows both A-C and B-C.

## Send requests to remove lineage

You can send a DELETE request to the external lineage endpoint to remove lineage that was established between a Snowflake object and an
external object.

* To break lineage between the source object and target object, use URL query parameters to specify details about the two objects.
* To break lineage between an object and all of its downstream objects, specify the source object without specifying a target object.
* To remove a target object from the lineage graph regardless of how many objects are upstream of it, specify the target object without
  specifying a source object.

A valid request to remove lineage consists of the following method, base URL, and endpoint:

```none
DELETE https://<account_identifier>.snowflakecomputing.com/api/v2/lineage/external-lineage
```

| Query parameter | Description |
| --- | --- |
| `sourceNamespace={namespace}` | Namespace of the source dataset. |
| `sourceName={FQN}` | Fully qualified name of the source dataset. |
| `sourceDatasetType={dataset type}` | Type of the source dataset (for example, TABLE, VIEW, DATASET). By default, the value should be External node. If you provided a value in the `facets` field of the payload when you sent a request to establish lineage, then specify the value that you sent in the payload, not External node. |
| `targetNamespace={namespace}` | Namespace of the target dataset. |
| `targetName={FQN}` | Fully qualified name of the target dataset. |
| `targetDatasetType={dataset type}` | Type of the target dataset (for example, TABLE, VIEW, DATASET). By default, the value should be External Node (`External%20Node`). If you provided a value in the `facets` field of the payload when you sent a request to establish lineage, then specify the value that you sent in the payload, not External node. |

> **Note:**
>
> The values of the query parameters are case sensitive.

### Access control for removing lineage

The user sending a request to remove lineage between objects must have the DELETE LINEAGE privilege on the account.

## Limitations and considerations

* A Snowflake object must be either the INPUT or the OUTPUT of a COMPLETE event. That is, external lineage doesn’t ingest lineage events
  when neither the input data nor the output data is a Snowflake object.
* Snowflake doesn’t support OpenLineage version 2.
* The retention period for external lineage events is one year.
* Snowflake only recognizes COMPLETE lineage events. All other events emitted by a data tool are ignored.
* Lineage from external sources doesn’t appear in the output of the GET_LINEAGE function.
* External lineage doesn’t support the lineage of columns.
* The fully qualified name of a dataset — that is, the input or output — can’t exceed 1000 characters.
* You can’t store more than 10,000 events in the same account. If you reach this limit, you’ll have to delete events before adding new ones.

---
title: External OAuth overview
source: https://docs.snowflake.com/en/user-guide/oauth-ext-overview.md
section: User Guide
---

# External OAuth overview

This topic teaches you how to configure External OAuth servers that use OAuth 2.0 for accessing Snowflake.

External OAuth integrates the customer’s OAuth 2.0 server to provide a seamless SSO experience, enabling external client access to
Snowflake.

Snowflake supports the following external authorization servers, custom clients, and partner applications:

* [Okta](oauth-okta.md)
* [Microsoft Entra ID](oauth-azure.md)
* [Ping Identity PingFederate](oauth-pingfed.md)
* [External OAuth Custom Clients](oauth-ext-custom.md)
* [Microsoft Power BI](oauth-powerbi.md)
* [Sigma](oauth-ext-partner.md)

After configuring your organization’s External OAuth server, which includes any necessary [OAuth 2.0 Scopes](https://oauth.net/2/scope/)
mapping to Snowflake roles, the user can connect to Snowflake securely and programmatically without having to enter any additional
authentication or authorization factors or methods. The user’s access to Snowflake data is dependent on both their role and the role being
integrated into the access token for the session. For more information, refer to Scopes (in this topic).

## Use cases and benefits

1. Snowflake delegates the token issuance to a dedicated authorization server to ensure that the OAuth Client and user properly
   authenticate. The result is centralized management of tokens issued to Snowflake.
2. Customers can integrate their policies for authentication (e.g. multi-factor, subnet, biometric) and authorization
   (e.g. no approval, manager approval required) into the authorization server. The result is greater security leading to more robust data
   protection by issuing challenges to the user. If the user doesn’t pass the policy challenge(s), the Snowflake session is not
   instantiated, and access to Snowflake data does not occur.
3. For programmatic clients that can access Snowflake and users that only initiate their Snowflake sessions through External OAuth, no
   additional authentication configuration (i.e. set a password) is necessary in Snowflake. The result is that service accounts or users
   used exclusively for programmatic access will only ever be able to use Snowflake data when going through the External OAuth configured
   service.
4. Clients can authenticate to Snowflake without browser access, allowing ease of integration with the External OAuth server.
5. Snowflake’s integration with External OAuth servers is cloud-agnostic.

   * It does not matter whether the authorization server exists in a cloud provider’s cloud or if the authorization server is on-premises.
     The result is that customers have many options in terms of configuring the authorization server to interact with Snowflake.

## General workflow

For each of the supported identity providers, the workflow for OAuth relating to External OAuth authorization servers can be summarized as
follows. Note that the first step only occurs once and the remaining steps occur with each attempt to access Snowflake data.

1. Configure your External OAuth authorization server in your environment and the security integration in Snowflake to establish a trust.
2. A user attempts to access Snowflake data through their business intelligence application, and the application attempts to verify the
   user.
3. On verification, the authorization server sends a JSON Web Token (i.e. OAuth token) to the client application.
4. The Snowflake driver passes a connection string to Snowflake with the OAuth token.
5. Snowflake validates the OAuth token.
6. Snowflake performs a user lookup.
7. On verification, Snowflake instantiates a session for the user to access data in Snowflake based on their role.

## Scopes

The scope parameter in the authorization server limits the operations and roles permitted by the access token and what the user can access
after instantiating a Snowflake session.

The ACCOUNTADMIN, GLOBALORGADMIN, ORGADMIN, and SECURITYADMIN roles are blocked by default. If it is necessary to use one or more of these roles,
use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to set the [EXTERNAL_OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST](../sql-reference/parameters.md) account
parameter to FALSE.

* For Okta, PingFederate, and Custom, use the role scope pattern in the following table.
* For Microsoft Entra ID, refer to [Determine the OAuth flow in Microsoft Entra ID](oauth-azure.md)
* If you do not want to manage Snowflake roles in your External OAuth server, pass the static value of SESSION:ROLE-ANY in the scope
  attribute of the token.

The following table summarizes External OAuth scopes. Note that
if you do not define a scope, the connection attempt to Snowflake will fail.

| Scope/Role Connection Parameter | Description |
| --- | --- |
| `session:role-any` | Maps to the ANY role in Snowflake.  Use this scope if the user’s default role in Snowflake is desirable.  The `external_oauth_any_role_mode` security integration parameter must be configured in order to enable ANY role for a given External OAuth Provider. For configuration details, refer to the ANY role section in [Okta](oauth-okta.md), [Microsoft Entra ID](oauth-azure.md), [PingFederate](oauth-pingfed.md), or [Custom](oauth-ext-custom.md).  Note that with a [Power BI to Snowflake integration](oauth-powerbi.md), a PowerBI user cannot switch roles using this scope. |
| `session:role:custom_role` | Maps to a custom Snowflake role. For example, if your custom role is ANALYST, your scope is `session:role:analyst`. |
| `session:role:public` | Maps to the PUBLIC Snowflake role. |

### Using secondary roles with External OAuth

Snowflake supports using [secondary roles](security-access-control-overview.md) with External OAuth.

Snowflake OAuth does not support in-session role switching to secondary roles.

For more information, refer to:

* [Secondary roles with Okta](oauth-okta.md)
* [Secondary roles with Microsoft Entra ID](oauth-azure.md)
* [Secondary roles with PingFederate](oauth-pingfed.md)
* [Secondary roles with Custom Clients](oauth-ext-custom.md)
* [Using secondary roles with Power BI SSO to Snowflake](oauth-powerbi.md)

## Configuring External OAuth support

Snowflake supports the use of partner applications and custom clients that support External OAuth.

Refer to the list below if you need to configure partner applications or custom clients:

* [Configuring partner applications](oauth-ext-partner.md).
* [Configuring custom clients configured by your organization](oauth-ext-custom.md).

## Restricting network traffic for External OAuth

You can associate a [network policy](network-policies.md) with the External OAuth security integration to restrict network traffic from the client to Snowflake as the resource server. This network policy governs login requests and queries against Snowflake.

When you associate a network policy with the security integration, it overrides network policies associated with the user or the account. For more information, see [Network policy precedence](network-policies.md).

To associate a network policy with the External OAuth security integration, set the NETWORK_POLICY parameter when creating or updating the integration. For example:

```sqlexample
CREATE SECURITY INTEGRATION external_oauth_azure_1
  TYPE = external_oauth
  ENABLED = true
  EXTERNAL_OAUTH_TYPE = azure
  EXTERNAL_OAUTH_ISSUER = '<AZURE_AD_ISSUER>'
  EXTERNAL_OAUTH_JWS_KEYS_URL = '<AZURE_AD_JWS_KEY_ENDPOINT>'
  EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = 'upn'
  EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = 'login_name'
  NETWORK_POLICY = 'allow_private_ip_only';
```

## Error codes

Refer to the table below for descriptions of error codes associated with External OAuth:

| Error Code | Error | Description |
| --- | --- | --- |
| 390318 | OAUTH_ACCESS_TOKEN_EXPIRED | OAuth access token expired. {0} |
| 390144 | JWT_TOKEN_INVALID | JWT token is invalid. |

## Troubleshooting

* Use the [SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN](../sql-reference/functions/system_verify_ext_oauth_token.md) function to determine whether your External OAuth access token is
  valid or needs to be regenerated.
* If you encounter an error message associated with a failed External OAuth login attempt, and the error message has a UUID, you can
  ask an
  administrator that has a MONITOR privilege assigned to their role to use the UUID from the error message to get a more detailed
  description of the error using the [SYSTEM$GET_LOGIN_FAILURE_DETAILS](../sql-reference/functions/system_get_login_failure_details.md)
  function.

---
title: External OAuth partner applications
source: https://docs.snowflake.com/en/user-guide/oauth-ext-partner.md
section: User Guide
---

# External OAuth partner applications

The following External OAuth Partner applications are available to access Snowflake:

* [Microsoft Power BI](oauth-powerbi.md)
* [Sigma](https://help.sigmacomputing.com/hc/en-us/articles/360053705993-OAuth-with-Snowflake)
* [ThoughtSpot](https://docs.thoughtspot.com/software/latest/connections-snowflake-azure-ad-oauth)

**Next Topics:**

* [Power BI SSO to Snowflake](oauth-powerbi.md)

---
title: Failing over account objects
source: https://docs.snowflake.com/en/user-guide/account-replication-failover-failback.md
section: User Guide
---

# Failing over account objects

This topic describes the steps necessary to fail over replicated account objects across multiple accounts in different
[regions](intro-regions.md) for disaster recovery.

For information about the purpose of the failover mechanism and when to use it, see [Introduction to business continuity & disaster recovery](replication-intro.md).

## Prerequisites

1. Enable replication in a set of accounts within the same organization, across multiple regions in one cloud service provider
   or across different cloud service providers.
2. Create a primary failover group that defines the kinds of objects to replicate, and specifies the target accounts
   to which to replicate. You can optionally divide the replicated objects across multiple failover groups, for example if
   some databases should be replicated more frequently than others.
3. Create at least one secondary failover group (replica) of each primary failover group in one or more secondary accounts.
4. Refresh (synchronize) each replica with the latest updates to the objects in the failover group. Perform an
   initial refresh, and set up a schedule to regularly bring the latest changes to each secondary account.

For instructions, see [Replicating account objects and databases](account-replication-config.md).

## Promote a target account to serve as the source account

You can promote a target account to serve as the source account (failover) using Snowsight or
[SQL](account-replication-config.md).

For more information about the kinds of objects you can specify in a failover group,
see [Replication groups and failover groups](account-replication-intro.md).

### Promote a target account to serve as the source account using Snowsight

> **Note:**
>
> Only account administrators can edit a replication or failover group using Snowsight (see
> [Limitations of using Snowsight for replication configuration](account-replication-config.md)).
>
> For the most consistent and reliable failover experience, select all the applicable failover groups and
> connections and promote them all at the same time. We refer to this operation as a *bulk failover*.

To promote a target account to serve as the source account using Snowsight, follow these steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md). **Make sure to sign in using the target account**.
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, then select Initiate failover. Doing so brings up a dialog where you make the remaining choices.
4. Select any failover groups to promote. After the failover, the objects specified in those
   failover groups become writable on the newly promoted primary account. Those objects become
   read-only on the account that formerly was the primary and is now a secondary account.
5. Select Next.
6. Select any connections to promote. After the failover, those connections connect to the account
   that you’re promoting to be the new primary account.
7. Select Next.
8. Select Fail over in the confirmation window.
9. If any refresh operations are in progress for the failover groups you selected, you can wait for those refreshes
   to complete, or choose an alternative approach if your failover is urgent and should take priority.

   The default action is to wait for the refreshes to complete. That way, the primary and secondary systems are all
   in a consistent state when the bulk failover runs. Snowflake uses your currently selected warehouse to poll the
   status of the ongoing refreshes. If you don’t have a selected warehouse, you select one now using the
   Select warehouse option.

   Or, you can proceed with the failover immediately by selecting Show advanced options.

   * To fail over only the failover groups that aren’t currently being refreshed, select Exit with current progress.
     In that case, you perform additional refreshes later for the groups that were skipped during the bulk failover.
   * To cancel the refresh operations and continue the failover, select Cancel refreshes and force failover.
     In that case, you might need to clean up any inconsistencies on the secondary system from the interrupted refreshes.

If the failover operation didn’t complete for all failover groups, you can perform another bulk failover. Or you can fail over
the remaining failover groups one at a time, using the procedure in Promote a single failover group to serve as the primary using Snowsight.

### Promote a single failover group to serve as the primary using Snowsight

> **Note:**
>
> Only account administrators can edit a replication or failover group using Snowsight (see
> [Limitations of using Snowsight for replication configuration](account-replication-config.md)).

To promote a single failover group to be the primary using Snowsight, follow these steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md). **Make sure to sign in using the target account**.
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, then select Groups.
4. Locate the failover group that you want to promote, and select the More menu (…) in the last column of the row.
5. Select Fail over, then select Fail over in the confirmation window.

> **Tip:**
>
> You typically use this procedure if you encounter a problem failing over one group, and you need to retry the
> failover only for that group. To promote an entire account to be the primary, select multiple failover groups and connections and
> perform a bulk failover. For more information, see Promote a target account to serve as the source account using Snowsight.

### Promote a target account to serve as the source account using SQL

To promote a target account to serve as the source account using SQL, you sign in to the target account
and execute the [ALTER FAILOVER GROUP … PRIMARY](../sql-reference/sql/alter-failover-group.md) command.

#### Promote a secondary failover group to primary failover group

> **Note:**
>
> The example in this section must be executed by a role with the FAILOVER privilege.

The following example promotes `myaccount2` in the current `myorg` organization to serve as the source account.

1. Sign in to target account `myaccount2`.
2. List failover groups in the account:

   ```sqlexample
   SHOW FAILOVER GROUPS;
   ```
3. Execute the following statement for each secondary failover group you want to promote to serve as the primary failover group:

   ```sqlexample
   ALTER FAILOVER GROUP myfg PRIMARY;
   ```

   > **Note:**
   >
   > During a partial outage in your source region, the replication service might continue to be available and might continue
   > to refresh the secondary failover groups in target regions.
   >
   > To ensure data integrity, Snowflake prevents failover if a refresh operation is in progress. This means you cannot
   > promote a secondary failover group to serve as the primary if it is being refreshed by a replication operation.
   > The ALTER FAILOVER GROUP … PRIMARY command returns an error in this scenario.

#### Resolving failover statement failure due to an in-progress refresh operation

If there is a refresh operation in progress for the secondary failover group you are trying to promote, the failover statement
results in the following error:

```output
Replication group "<GROUP_NAME>" cannot currently be set as primary because it is being
refreshed. Either wait for the refresh to finish or cancel the refresh and try again.
```

To successfully fail over, you must complete the following steps.

1. Select and complete one of the following options:

   > **Important:**
   >
   > Suspending a refresh operation in the SECONDARY_DOWNLOADING_METADATA or SECONDARY_DOWNLOADING_DATA phase
   > might result in an inconsistent state on the target account. For more information, see
   > View the current phase of an in-progress refresh operation.

   1. Suspend future refresh operations for the failover group. If there is an in-progress refresh operation, you must wait for
      it to complete before you can failover:

      ```sqlexample
      ALTER FAILOVER GROUP myfg SUSPEND;
      ```
   2. Suspend future refresh operations *and* cancel a scheduled refresh operation that is currently in progress (if there is one).

      If the in-progress refresh operation was manually triggered, see Cancel an in-progress refresh operation that wasn’t automatically scheduled.

      ```sqlexample
      ALTER FAILOVER GROUP myfg SUSPEND IMMEDIATE;
      ```

      > **Note:**
      >
      > You might experience a slight delay between the time that the statement returns and the time that the cancellation
      > of the refresh operation is finished.
2. Verify no refresh operations are in progress for the failover group `myfg`. The following query
   should return no results:

   ```sqlexample
   SELECT phase_name, start_time, job_uuid
     FROM TABLE(INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY('myfg'))
     WHERE phase_name <> 'COMPLETED' and phase_name <> 'CANCELED';
   ```

   To see canceled refresh operations for failover group `myfg`, you can execute the following statement:

   ```sqlexample
   SELECT phase_name, start_time, job_uuid
     FROM TABLE(INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY('myfg'))
     WHERE phase_name = 'CANCELED';
   ```
3. Now you can promote the secondary failover group `myfg` to primary failover group:

   ```sqlexample
   ALTER FAILOVER GROUP myfg PRIMARY;
   ```

#### Resume scheduled replication in target accounts

On failover, scheduled refreshes on all secondary failover groups are suspended.
[ALTER FAILOVER GROUP … RESUME](../sql-reference/sql/alter-failover-group.md) must be executed in each **target account** with a
secondary failover group to resume automatic refreshes.

```sqlexample
ALTER FAILOVER GROUP myfg RESUME;
```

## View the current phase of an in-progress refresh operation

A refresh operation can be safely canceled during most phases of the refresh operation. However, canceling a refresh operation
in the SECONDARY_DOWNLOADING_METADATA or SECONDARY_DOWNLOADING_DATA phase might result in an inconsistent state on the target
account. If the refresh operation has started one of these phases, it proceeds to completion regardless of the availability of
the source account. Allowing the phase to complete before you fail over ensures replicas are in a consistent state.
After the replicas are in a consistent state, you can resume or replay your ingest and transformation pipelines to update the
replicas to the current state.

To view the current phase of an in-progress refresh operation for a failover group, use the Information Schema
[REPLICATION_GROUP_REFRESH_PROGRESS, REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB, REPLICATION_GROUP_REFRESH_PROGRESS_ALL](../sql-reference/functions/replication_group_refresh_progress.md) table function.

For example, to view the current phase of an in-progress refresh operation for failover group `myfg`, execute
the following statement:

```sqlexample
SELECT phase_name, start_time, end_time
  FROM TABLE(
    INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_PROGRESS('myfg')
  );
```

For a list of refresh operations phases, see the [usage notes](../sql-reference/functions/replication_group_refresh_progress.md)
for the function.

## Cancel an in-progress refresh operation that wasn’t automatically scheduled

To cancel an in-progress refresh operation that was not triggered automatically by a replication schedule, you must use the
[SYSTEM$CANCEL_QUERY](../sql-reference/functions/system_cancel_query.md) function:

1. Find the query ID or JOB_UUID for running refresh operations using one of the following options:

   1. Find the query IDs for all running refresh operations:

      ```sqlexample
      SELECT query_id, query_text
        FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY())
        WHERE query_type = 'REFRESH REPLICATION GROUP'
        AND execution_status = 'RUNNING'
        ORDER BY start_time;
      ```

      Use the QUERY_TEXT column to identify the QUERY_ID for failover group refresh operations from the list.
   2. Find the JOB_UUID for an in-progress refresh operation for a specific failover group `myfg`:

      ```sqlexample
      SELECT phase_name, start_time, job_uuid
        FROM TABLE(INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY('myfg'))
        WHERE phase_name <> 'COMPLETED' and phase_name <> 'CANCELED';
      ```
2. Cancel the refresh operation using the SYSTEM$CANCEL_QUERY function and the QUERY_ID or JOB_UUID:

   ```sqlexample
   SELECT SYSTEM$CANCEL_QUERY('<QUERY_ID | JOB_UUID>');
   ```

   Returns the following output:

   ```output
   query [<QUERY_ID>] terminated.
   ```
3. After you cancel the in-progress refresh operation, continue to the
   next steps.

## Reopen active channels for Snowpipe Streaming in newly promoted source account

Tables in a primary database that are populated by [Snowpipe Streaming are replicated](account-replication-considerations.md)
to secondary databases. After failover, reopen active Snowpipe Streaming channels for tables and re-insert any missing data rows
for the channels:

1. Reopen active channels for the table by calling the [openChannel](https://javadoc.io/doc/net.snowflake/snowflake-ingest-sdk/latest/net/snowflake/ingest/streaming/SnowflakeStreamingIngestClient.html) API.
2. Fetch offset tokens:

   1. Call the [getLatestCommittedOffsetToken](https://javadoc.io/doc/net.snowflake/snowflake-ingest-sdk/latest/net/snowflake/ingest/streaming/SnowflakeStreamingIngestChannel.html#getLatestCommittedOffsetToken()) API or
   2. Execute the [SHOW CHANNELS](../sql-reference/sql/show-channels.md) command to retrieve a list of the active channels of the table.
3. Re-insert data rows for the channel from the fetched offset tokens.

> **Note:**
>
> These steps apply only to Snowpipe Streaming with the Snowflake Ingest SDK; it doesn’t apply to Snowpipe Streaming with the Kafka connector. Follow the steps below to restart the Kafka Connector after failover.

### Snowpipe Streaming and the Kafka connector

If you are using the Kafka connector and Snowpipe Streaming, follow these steps after failover:

1. Update the Kafka connector configuration to point to the newly promoted source account.
2. Execute the SHOW CHANNELS command to retrieve the list of active channels and the offset tokens. Each channel belongs to a
   single partition in the Kafka topic.
3. Manually reset offsets in the Kafka Topic for each of those partitions (channels).
4. Restart the Kafka Connector.

For more information, see:

* [Snowflake Connector for Kafka with Snowpipe Streaming classic](snowpipe-streaming/snowpipe-streaming-classic-kafka.md).
* [Replication and Snowpipe Streaming](account-replication-considerations.md).

---
title: Failing over databases across multiple accounts
source: https://docs.snowflake.com/en/user-guide/database-failover-config.md
section: User Guide
---

# Failing over databases across multiple accounts

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

This topic describes the steps necessary to fail over your replicated databases across multiple accounts in different [regions](intro-regions.md) for disaster recovery.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) can enable and manage failover for a database.
> * Snowflake recommends using the [account replication feature](account-replication-intro.md) to failover
>   databases. [Replication and failover groups](account-replication-intro.md) enable replication of
>   multiple databases and other account objects with point-in-time consistency. Failover groups additionally enable
>   failing over a collection of objects as a unit. For a full list of
>   [feature availability](account-replication-intro.md)
>   and [supported objects](account-replication-intro.md), refer to [Introduction to replication and failover across multiple accounts](account-replication-intro.md).

## Use Snowsight for database replication and failover/failback

> **Attention:**
>
> Managing and monitoring replication and failover/failback in Snowsight are only available to accounts
> using private connectivity.
>
> For all other accounts, see [Use Snowsight to monitor replication](account-replication-monitor.md) and [Replicating account objects and databases](account-replication-config.md).

Account administrators (users with the ACCOUNTADMIN role) can manage replication and failover/failback actions in Snowsight.

See [Web interface for database replication and failover/failback](db-replication-config.md) for instructions on promoting a local database to serve as the primary database.

## Account identifier for replication and failover SQL commands

The example SQL statements in the instructions below use an [account identifier](admin-account-identifier.md) in the format,
`organization_name.account_name`. However, account identifiers in the format `snowflake_region.account_locator` are supported.

For more details, see [Account identifiers for replication and failover](admin-account-identifier.md).

## Prerequisite requirements

1. Enable replication for a primary database in a set of accounts.
2. Create at least one secondary database (i.e. replica) of the primary database in one or more of the accounts specified in Step 1, and regularly refresh (i.e. synchronize) the replica with the latest updates to the primary database.

For instructions, see [Replicating databases across multiple accounts](db-replication-config.md).

## Step 1: Viewing all accounts enabled for replication

Query [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md) to view the list of accounts in your organization in which replication has been enabled.

```sqlexample
SHOW REPLICATION ACCOUNTS;
```

```output
+------------------+---------------------------------+---------------+------------------+---------+-------------------+
| snowflake_region | created_on                      | account_name  | account_locator  | comment | organization_name |
|------------------+---------------------------------+---------------+------------------+---------+-------------------|
| AWS_US_WEST_2    | 2018-11-19 16:11:12.720 -0700   | ACCOUNT1      | MYACCOUNT1       |         | MYORG             |
| AWS_US_EAST_1    | 2019-06-02 14:12:23.192 -0700   | ACCOUNT2      | MYACCOUNT2       |         | MYORG             |
+------------------+---------------------------------+---------------+------------------+---------+-------------------+
```

See the complete list of [Region IDs](admin-account-identifier.md).

## Step 2: Enabling failover for a primary database

> **Note:**
>
> Skip this step if you enabled failover for this primary database in [Replicating databases across multiple accounts](db-replication-config.md).

Enable failover for a primary database to one or more accounts in your organization using an [ALTER DATABASE … ENABLE FAILOVER TO ACCOUNTS](../sql-reference/sql/alter-database.md) statement. The replica of this primary database in any one of these accounts (i.e. a secondary database) can be promoted to serve as the primary database.

Note that enabling failover for a primary database can be done either before or after a replica of the primary database has been created in a specified account.

### Example

Enable failover for primary database `mydb1` to accounts `myaccount2` and `myaccount3`. In this example, suppose the primary database
is stored in the `myaccount1` account and all three accounts belong to the organization, `myorg`. The ALTER DATABASE command must be
executed from `myaccount1`.

```sqlexample
ALTER DATABASE mydb1 ENABLE FAILOVER TO ACCOUNTS myorg.myaccount2, myorg.myaccount3;
```

## Step 3: Promoting a replica database to serve as the primary database

Any replica of a primary database can be promoted to serve as the primary database by executing an [ALTER DATABASE … PRIMARY](../sql-reference/sql/alter-database.md) statement. When promoted, the database becomes writeable. At the same time, the previous primary database becomes a read-only replica database.

Execute the `ALTER DATABASE` statement in the account containing the secondary database that you are promoting.

> **Note:**
>
> To promote a secondary database, the role used to perform the operation must have the OWNERSHIP privilege on the database.

### Example

Promote a secondary database to serve as the primary database.

```sqlexample
ALTER DATABASE mydb1 PRIMARY;
```

Verify that the former secondary database was promoted successfully.

```sqlexample
SHOW REPLICATION DATABASES;
```

---
title: Federated authentication and SSO troubleshooting
source: https://docs.snowflake.com/en/user-guide/errors-saml.md
section: User Guide
---

# Federated authentication and SSO troubleshooting

This topic provides information to help troubleshoot a federated authentication environment, including the error codes and messages that
are generated during an unsuccessful user login attempt.

## Password-related errors

A user with an expired Snowflake password cannot log in with SSO even though they are not using the password. This behavior is
intentional and prevents someone from logging in using expired credentials.

SSO logins are also rejected if an administrator set the `MUST_CHANGE_PASSWORD` parameter to TRUE when creating the user, but the user
has not changed the password yet.

## Error codes

Errors are generated for each failed login attempt. These errors can be obtained from the [Snowflake Information Schema](../sql-reference/info-schema.md) or the
[ACCOUNT_USAGE schema](../sql-reference/account-usage.md):

* The Snowflake Information Schema provides data from within the past 7 days and can be queried using
  the [LOGIN_HISTORY , LOGIN_HISTORY_BY_USER](../sql-reference/functions/login_history.md) table functions.
* The [LOGIN_HISTORY](../sql-reference/account-usage/login_history.md) view in the ACCOUNT_USAGE schema provides similar data from within the past year.

### Federated authentication error codes

The table below contains the error codes and messages related to federated authentication.

| Error Code | Error | Description |
| --- | --- | --- |
| 390136 | FED_REAUTH_PENDING | Authentication response is pending from IDP. |
| 390137 | FED_REAUTH | Federated authentication request URL is generated. |
| 390138 | FED_REAUTH_TIMEOUT | Timeout waiting for authentication response from IDP. |
| 390139 | AUTHENTICATOR_NOT_SUPPORTED | The specified authenticator is not accepted by your Snowflake account configuration. Please contact your local system administrator to get the correct URL to use. |
| 390140 | FED_PASSWORD_EXPIRED | Identity Provider (IdP) password has expired. Contact your IdP team. |
| 390191 | USERNAMES_MISMATCH | The user you were trying to authenticate as differs from the user currently logged in at the IDP. |

### SAML error codes

Troubleshooting a login failure differs depending on whether the error message has an UUID.

If you encounter an error message associated with a failed SAML SSO login attempt, and the error message does not have a UUID, then ensure
the user exists. If the user exists, then the SAML response is invalid and the number of login attempts is too high.

If you encounter an error message associated with a failed SAML SSO login attempt, and the error message has a UUID, you can ask an
administrator that has MONITOR privilege assigned to their role to get a more detailed description of the error by following the steps
below:

1. Find the UUID in the error message:

   > ```output
   > SAML response is invalid or matching user is not found. Contact your local system administrator. [eb55b777-50a4-4db5-b231-9ee457fb3981]
   > ```
2. Use the UUID as an argument to the SYSTEM$GET_LOGIN_FAILURE_DETAILS function, and extract the error using the
   [JSON_EXTRACT_PATH_TEXT](../sql-reference/functions/json_extract_path_text.md) function:

   > ```sqlexample
   > SELECT JSON_EXTRACT_PATH_TEXT(SYSTEM$GET_LOGIN_FAILURE_DETAILS('eb55b777-50a4-4db5-b231-9ee457fb3981'), 'errorCode');
   > ```
3. Find the error description in the table below:

   > | Error Code | Error | Description |
   > | --- | --- | --- |
   > | 390133 | SAML_RESPONSE_INVALID | The SAML response was invalid for an unspecified reason, although it is most likely malformed (this is also used if there is an error on parsing). |
   > | 390165 | SAML_RESPONSE_INVALID_SIGNATURE | The SAML response contains an invalid Signature. |
   > | 390166 | SAML_RESPONSE_INVALID_DIGEST_METHOD | The SAML response contains an invalid “DigestMethod” attribute or omits it entirely. |
   > | 390167 | SAML_RESPONSE_INVALID_SIGNATURE_METHOD | The SAML response contains an invalid “SignatureMethod” or omits it entirely. |
   > | 390168 | SAML_RESPONSE_INVALID_DESTINATION | The “Destination” attribute in the SAML response does not match a valid destination URL on the account. |
   > | 390169 | SAML_RESPONSE_INVALID_AUDIENCE | The SAML response does not contain exactly one audience or the audience URL does not match what we expect the audience URL to be. |
   > | 390170 | SAML_RESPONSE_INVALID_MISSING_INRESPONSETO | The “InResponseTo” attribute in the SAML assertion is missing. |
   > | 390171 | SAML_RESPONSE_INVALID_RECIPIENT_MISMATCH | The “Recipient” attribute does not match a valid destination URL. |
   > | 390172 | SAML_RESPONSE_INVALID_NOTONORAFTER_VALIDATION | This typically indicates that the time in which the SAML assertion is valid has expired. |
   > | 390173 | SAML_RESPONSE_INVALID_NOTBEFORE_VALIDATION | This typically indicates that the time in which the SAML assertion is valid has not yet come. |
   > | 390174 | SAML_RESPONSE_INVALID_USERNAMES_MISMATCH | The login names do not match during re-authentication. |
   > | 390175 | SAML_RESPONSE_INVALID_SESSIONID_MISSING | During re-authentication, we were unable to find a session corresponding to the user. |
   > | 390176 | SAML_RESPONSE_INVALID_ACCOUNTS_MISMATCH | During re-authentication, the names of the accounts were found to not match. |
   > | 390177 | SAML_RESPONSE_INVALID_BAD_CERT | The x.509 certificate contained in the SAML response is either malformed or does not match the expected certificate. |
   > | 390178 | SAML_RESPONSE_INVALID_PROOF_KEY_MISMATCH | The proof keys do not match with respect to the authentication request ID. |
   > | 390179 | SAML_RESPONSE_INVALID_INTEGRATION_MISCONFIGURATION | The SAML IdP configuration is invalid. |
   > | 390180 | SAML_RESPONSE_INVALID_REQUEST_PAYLOAD | During authentication, using an invalid payload or using an invalid federated OAuth connection string. |
   > | 390181 | SAML_RESPONSE_INVALID_MISSING_SUBJECT_CONFIRMATION_BEARER | The Subject confirmation with Bearer method is missing and cannot be validated. |
   > | 390182 | SAML_RESPONSE_INVALID_MISSING_SUBJECT_CONFIRMATION_DATA | The Subject confirmation data is missing in the assertion. |
   > | 390183 | SAML_RESPONSE_INVALID_CONDITIONS | The SAML assertion is not valid for a reason that is different than the preceding conditions in this table. |
   > | 390184 | SAML_RESPONSE_INVALID_ISSUER | The SAML Response contained an issuer/entityID value different from the one configured in the SAML IDP Configuration. |

---
title: FedRAMP (Moderate and High)
source: https://docs.snowflake.com/en/user-guide/cert-fedramp.md
section: User Guide
---

# FedRAMP (Moderate and High)

This topic describes how Snowflake supports customers with FedRAMP compliance requirements.

## Understanding FedRAMP compliance requirements

The Federal Risk and Authorization Management Program (FedRAMP) is a program established to provide an efficient and effective risk based
approach to use cloud services by the federal government. This program provides the ability for government agencies to leverage cloud
technologies while ensuring these technologies meet the stringent requirements and security necessary to protect federal information.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

You can view the latest of Snowflake’s FedRAMP authorizations within the
[FedRAMP Marketplace](https://marketplace.fedramp.gov/#!/products?sort=productName&productNameSearch=snowflake).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: File formats to unload data
source: https://docs.snowflake.com/en/user-guide/data-unload-prepare.md
section: User Guide
---

# File formats to unload data

This topic provides an overview of supported data file formats for unloading data.

## Supported file formats

The following file formats are supported:

> | Structured/Semi-structured | Type | Notes |
> | --- | --- | --- |
> | Structured | Delimited (CSV, TSV, etc.) | Any valid singlebyte delimiter is supported; default is comma (i.e. CSV). |
> | Semi-structured | JSON, Parquet |  |

File format options specify the type of data contained in a file, as well as other related characteristics about the format of the data. The file format options you can specify are different depending on the type of data you are unloading to. Snowflake provides a full set of file format option defaults.

### Semi-structured data

When unloading to JSON files, Snowflake outputs to the [NDJSON](https://github.com/ndjson/ndjson-spec) (newline delimited JSON) standard format.

## Specify file format options

Individual file format options can be specified in any of the following places:

* In the definition of a table.
* In the definition of a named stage. For more information, see [CREATE STAGE](../sql-reference/sql/create-stage.md).
* Directly in a [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command when unloading data.

In addition, to simplify data unloading, Snowflake supports creating named file formats, which are database objects that encapsulate all of the required
format information. Named file formats can then be used as input in all the same places where you can specify individual file format options, thereby
helping to streamline the data unloading process for similarly-formatted data.

Named file formats are optional, but are recommended when you plan to regularly unload similarly-formatted data.

### Create a named file format

You can create a file format using either the web interface or SQL:

> Snowsight:
> :   In the navigation menu, select Catalog » Database Explorer. Then select the *<db_name>* » File Formats.
>
> SQL:
> :   [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md)

For detailed descriptions of all the file format options, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).

#### Examples

The following example creates a named CSV file format with a specified field delimiter:

> ```sqlexample
> CREATE OR REPLACE FILE FORMAT my_csv_unload_format
>   TYPE = 'CSV'
>   FIELD_DELIMITER = '|';
> ```

The following example creates a named JSON file format:

> ```sqlexample
> CREATE OR REPLACE FILE FORMAT my_json_unload_format
>   TYPE = 'JSON';
> ```

---
title: Filter query results in dashboards and worksheets
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-filters.md
section: User Guide
---

# Filter query results in dashboards and worksheets

You can filter your query results in dashboards and SQL worksheets using system filters, available to all roles in Snowflake,
or with custom filters created by administrators.

## Create custom filters

Custom filters let you change the results of a query without directly editing the query.

Filters are implemented as special keywords that resolve as a subquery or list of values, which are then used in the execution of a query.
As a result, there are some limitations when using a filter in a SQL query. See Specify a filter in a SQL query.

> **Note:**
>
> Anyone in your account can view and use a custom filter after it is created. A custom filter has an associated role,
> but that role does not limit filter visibility.

### Grant permission to create custom filters

To let a user create custom filters, a user with the ACCOUNTADMIN role must grant the relevant permissions to a role granted to that user.
You can only use Snowsight to grant roles the ability to create custom filters.

To grant a role permission to create custom filters for your account, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Dashboards or another project tool.
3. Select  and, if in a worksheet, select Manage Filters.
4. In the dialog that appears, select Edit Permission.
5. In the Filter Permissions dialog, select the roles you want to grant the ability to create filters to.
6. Select Save.

### Create a custom filter

You must use Snowsight to create a filter, and you must use a role with permissions to create custom filters.

To create a custom filter, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Dashboards or another project tool.
3. Select  and, if in a worksheet, select Manage Filters.
4. In the Filters dialog that appears, select + Filter.
5. To add a filter, complete the following:

   > 1. For Display Name, enter a name for the filter. This name appears on the filter when selecting the filter on a worksheet or dashboard.
   > 2. For SQL Keyword, enter a unique keyword to insert into queries. Use the format `:<string>`, without spaces.
   >    For example: `:page_path`.
   > 3. For Description, enter a description of the filter.
   > 4. For Role, select a role to associate with the filter and run the query used to populate filter values, if the filter is based
   >    on a query. Only roles with permissions to create custom filters appear in the drop-down list.
   >    See Manage ownership of custom filters for more details.
   > 5. For Warehouse, select a warehouse to use to refresh filter values, if the filter is based on a query.
   >    The owner role for the filter must have the USAGE privilege on the warehouse you select.
   >    If you want to run and validate your query as part of these steps, the warehouse must be running.
   > 6. For Options via, choose whether the filter values are populated by a query or a list:
   >
   >    > * If you select Query, select Write Query and see Write a query to populate a filter for guidance writing a
   >    >   filter query.
   >    > * If you select List, do the following:
   >    >
   >    >   1. Select Edit List.
   >    >   2. Optionally, for Name, enter a name for the list item. The name appears in the drop-down list for the filter.
   >    >      If you do not provide a name, the Value is used.
   >    >   3. For Value, enter the value of the column name to use in the filter.
   >    >   4. Continue adding name and value pairs until your list is complete, then select Save.
6. In the Add Filter dialog, for Value Type, choose whether the list items are Text or Number types of data.
7. If you want users to be able to select multiple items in the drop-down list of filter options,
   turn on the toggle for Multiple values can be selected.
8. If you want users to be able to see results for all items in the column, turn on the toggle for Include an “All” option, then select
   how you want the All option to function:

   * Select Any value to have the All in the filter mean that the column to which the filter applies can have any value in
     the results, whether or not the value exists in the filter list.
   * Select Any value in list of options to have All in the filter mean that the column to which the filter applies contains
     any item in the filter list.
9. If you want users to be able to see results for items not specified in the filter, turn on the toggle for Include an “Other” option.
10. Select Save.
11. Select Done to close the Filters dialog.

#### Write a query to populate a filter

To populate a list of filter options from a query, your query must follow certain guidelines:

* Must return the columns `name` and `value`.
* Can return the optional column `description`.
* Can return other columns, but those do not appear in the drop-down filter list.

A filter can only run one query at a time. You cannot run multiple queries to generate the list of filter options, for example by running
one query to return the `name` column and a second query to return the `value` column.

> **Note:**
>
> The query used to populate a list of filter options is run as the user that created (or last modified) the filter.
> Because anyone in your account can view and use a custom filter after it is created, make sure that the list of
> filter options produced by your query do not contain protected or sensitive data.

After you write your filter query and add it in the New filter dialog, do the following to finish setting up your query filter:

1. Select Done to save your filter query and return to the Add Filter dialog.
2. Optionally change the default refresh option from Refresh hourly to Never refresh or Refresh daily. For details
   and considerations for filter refresh options, see Manage refresh frequency for a custom filter.
3. Return to the steps for creating a custom filter to finish creating your filter. See Create a custom filter.

## Review and manage custom filters in an account

To review custom filters in your account, open a worksheet or dashboard and then select .

To make changes to any filters, such as changing the refresh frequency for the query used to populate a custom filter list,
you must have the ACCOUNTADMIN role or a role with permissions to manage filters.
See Manage refresh frequency for a custom filter.

### Manage ownership of custom filters

Each custom filter has an associated role. Anyone with that role can edit or delete the filter.
Users with the ACCOUNTADMIN role can view and edit every filter in the account.

If the role associated with a filter is dropped, the role dropping the filter role does not inherit ownership of the custom filter. Instead, a user with the ACCOUNTADMIN role can edit the filter and change the role associated with the filter.

### Manage refresh frequency for a custom filter

A custom filter that is populated by a SQL query also has a refresh frequency. The refresh frequency can be hourly, daily, or never.

The filter runs based on when it was saved and how long it took to run the query that refreshes the filter options.

For example, if you save a filter that has an hourly query refresh frequency at 10:07 AM, the first refresh query runs at or after 11:07 AM.
If a large number of filter refresh queries end up scheduled to run at the same time, the queries are queued to limit the number of
filter refresh queries running at the same time. The next filter refresh is based on when the last refresh completed. In this example, if
the query refresh at 11:07 AM takes 20 minutes to complete, the next refresh query would run at or after 12:27 PM.

Filter refreshes run as the user that created or last modified the filter, and are visible in Query History as
one of the types of Queries executed by user tasks.
See [Monitor query activity with Query History](ui-snowsight-activity.md) for details on using Query History.

To determine which filter is responsible for a filter query refresh, you must open the list of filters and open each filter to
view the details.

> **Note:**
>
> Setting a custom filter’s refresh frequency can lead to increased consumption on your virtual warehouse. The virtual warehouse will run
> the underlying query according to the configured schedule, even if no user has the filter open in their web browser. The cost incurred depends
> on the query complexity and the refresh schedule that you set.

#### Troubleshoot failed filter query refreshes

Refreshes of the filter query can fail for one of the following reasons:

* The user that created or last modified the filter has been dropped or disabled in Snowflake.
* The user is inactive because they have not signed in for 3 months.

It is not possible to see which users created or last modified a given filter. If you have filters that are failing to refresh,
you might see successful authentication attempts by the WORKSHEETS_APP_USER user followed by failed authentication
attempts from a user in the [LOGIN_HISTORY view](../sql-reference/account-usage/login_history.md) view of the ACCOUNT_USAGE schema in the
shared SNOWFLAKE database.

For example, you can use the following query to identify login activity that uses an OAuth access token from the previous two days:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.LOGIN_HISTORY
WHERE
    FIRST_AUTHENTICATION_FACTOR = 'OAUTH_ACCESS_TOKEN'
    AND
    REPORTED_CLIENT_TYPE = 'SNOWFLAKE_UI'
    AND
    EVENT_TIMESTAMP > DATEADD('DAY', -2, CURRENT_DATE())
ORDER BY
    EVENT_TIMESTAMP DESC;
```

Failed authentication attempts associated with a failed query refresh frequency would happen at the same time each day or each hour,
depending on the custom filter refresh frequency.

## Specify a filter in a SQL query

You can use a system filter or a custom filter in a SQL query.
You cannot use a filter in a stored procedure or a user-defined function (UDF).

To add a filter to your SQL query, use one of the following formats:

* Specify the filter as part of a SELECT statement, like `SELECT :<filter_name>(<col_name>)`.
* Specify the filter using an equals sign as the comparator. For example:

  + `WHERE <col_name> = :<filter_name>`
  + `WHERE <:filter_name> = <col_name>`
  + `<value_a>:<value_b>::string = <:filter_name>`

You can only use an equals sign as the comparator for a filter, and as such, cannot use a filter with
[LIKE](../sql-reference/functions/like.md) or [CONTAINS](../sql-reference/functions/contains.md).

The column to which the filter applies must also match the value type expected by the filter:

* For a custom filter set to use a value type of text, the column must be a text string or cast to a text string in the query.
  See [Data types for text strings](../sql-reference/data-types-text.md).
* For a custom filter set to use a value type of number, the column must be a numeric data type. See [Numeric data types](../sql-reference/data-types-numeric.md).
* For a system filter, the column must be a TIMESTAMP data type. See [Date & time data types](../sql-reference/data-types-datetime.md).

When you add a filter to your SQL query and then use the drop-down list to choose a filter option, the SQL syntax of your query is
changed. For details about how the SQL syntax is changed when different options in the list are selected, refer to the following table:

Filter SQL reference

| Filter option selected | SQL used |
| --- | --- |
| List item | `<col> = <list_item>` |
| Multiple list items selected | `<col> IN (<list_item>, <list_item>)` |
| All, with Any value specified | `true` |
| All, with Any value in list of options specified | `<col> IN (<list_item>, <list_item>, ... )` |
| Other | `<col> NOT IN(<list_item>, <list_item>, ... )` |

### Applying and saving filters

When you change the options selected in a filter, the option to apply your changes appears.
When you select Apply, the worksheet or dashboard runs and updated filtered results appear,
letting you review the changes without saving.

After you apply changes to a filter on a dashboard, the option to save your changes appears. When you select Save, the
changes you made to the dashboard are saved and available to other users of the dashboard.

For example, you might select Apply to change a filter to see results from All Time, but you don’t want the dashboard to run
over such a large volume of data the next time someone opens the dashboard, so you do **not** select Save.
After you run your dashboard over all time, you change the date range filter to Last 7 days, select Apply to run the dashboard,
and then select Save to save that default filter value for dashboard users.

## Snowsight system filters

The following system filters are available to all roles:

* `:daterange`

  + Filters a column by a date range, such as Last day, Last 7 days, Last 28 days, Last 3 months,
    Last 6 months, Last 12 months, All time, or a custom date range.

    > **Note:**
    >
    > The date range filter always uses the UTC time zone and is not affected by the [TIMESTAMP_INPUT_FORMAT](../sql-reference/parameters.md) parameter.

    Defaults to Last day.
* `:datebucket`

  + Groups aggregate data by a period of time, such as Second, Minute, Hour, Day, Week,
    Month, Quarter in calendar months, or Year.

    Defaults to Day.

These filters cannot be edited or dropped.

### Example: Working with date filters

For example, given a table with order data, such as the ORDERS table in the SNOWFLAKE_SAMPLE_DATA database and TPCH_SF1 schema, you
might want to query the table and group the results by a specific time bucket, such as by day or by week, and specify a specific date range
for which to retrieve results.

To do so, you can write a query as follows:

```sqlexample
SELECT
    COUNT(O_ORDERDATE) as orders,
    :datebucket(O_ORDERDATE) as bucket
FROM
    SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS
WHERE
    O_ORDERDATE = :daterange
GROUP BY
    :datebucket(O_ORDERDATE)
ORDER BY
    bucket;
```

In this example, you:

* Count the number of orders and retrieve details about the order date from the ORDERS table.
* Filter your results by a specific date range by including the `:daterange` system filter in your WHERE clause.
* Group your results by a specific period of time by including the `:datebucket` system filter in your GROUP BY clause.
* Sort the results from earliest to latest time period by including the ORDER BY clause.

When you add filters to your query, corresponding filter buttons appear at the top of your worksheet or dashboard:

To manipulate the results that you see from your query, use the filters to select specific values.

For this example, set the Group by filter, which corresponds to the date bucket filter, to group by `Day`. Set the other
filter, which corresponds to the date range filter, to `All time`.

When you select Apply and apply the filter to your results, the results are grouped by day and results like the following output
appear:

```output
+--------+------------+
| orders |  buckets   |
+--------+------------+
|    621 | 1992-01-01 |
|    612 | 1992-01-02 |
|    598 | 1992-01-03 |
|    670 | 1992-01-04 |
+--------+------------+
```

You can select a different date bucket to show a different grouping of data. For example, to view weekly order data, set the Group by
filter to `Week` and select Apply. Results like the following output appear:

```output
+--------+------------+
| orders |  buckets   |
+--------+------------+
|   3142 | 1991-12-30 |
|   4404 | 1992-01-06 |
|   4306 | 1992-01-13 |
|   4284 | 1992-01-20 |
+--------+------------+
```

---
title: Find the account name for a Snowflake Open Catalog account
source: https://docs.snowflake.com/en/user-guide/opencatalog/find-account-name.md
section: User Guide
---

# Find the account name for a Snowflake Open Catalog account

In addition to an account locator, a Snowflake Open Catalog account also has an account name. You might need the account name for tasks like
creating a catalog integration for Snowflake Open Catalog. You can use this catalog integration to query a table in Snowflake Open Catalog
using Snowflake or to sync a Snowflake-managed table with Open Catalog. For more information, see
[Configure a catalog integration for Snowflake Open Catalog](https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-open-catalog).

1. Sign in to Open Catalog.
2. To open the user menu, select your username on the bottom-left corner of the page. In the user menu, under Account, note the account name
   for your Open Catalog account.

---
title: Follow alternative troubleshooting steps
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/alternate-steps.md
section: User Guide
---

# Follow alternative troubleshooting steps

> **Note:**
>
> The following steps are not necessary if you can perform tests using Snowflake’s [connectivity troubleshooting tools](snowflake-tools.md). Follow these steps if using the troubleshooting tools is not an option.

For platform specific instructions, see:

* [MacOS and Linux troubleshooting steps](mac-linux.md)
* [Windows troubleshooting steps](windows.md)
* [Browser test](browser-test.md)

After completing these instructions, proceed to [follow-up actions](followup-actions.md).

---
title: Follow-up actions
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/followup-actions.md
section: User Guide
---

# Follow-up actions

After completing the troubleshooting steps using the Snowflake tools mentioned in [Use Snowflake troubleshooting tools](snowflake-tools.md) or the platform specific instructions mentioned in [Follow alternative troubleshooting steps](alternate-steps.md), you should perform the following steps based on whether the connection test succeeds or fails.

## If the connection test succeeds

* Ensure the certificate issuer matches a trusted issuer, such as a cloud-based provider. A discrepancy might indicate an SSL inspection or an intermediary modifying the traffic, which Snowflake does not support.
* If the issuer does not match, contact your network team to address potential SSL inspection issues. Provide them with the output and request allowlisting of necessary URLs in the [Snowflake allowlist](../../sql-reference/functions/system_allowlist.md).

## If the connection test fails

* If connectivity tests fail, Snowflake recommends working with your network team and asking them to double-check your network settings, firewall rules, and proxy configurations. Verify that you have followed Snowflake’s suggestions in the [Common connectivity issues and resolutions](common-issues.md) section.
* If the troubleshooting steps or your system or network do not resolve the issue, and your system or network administrators verified the respective proxies and appliances are configured correctly, open a case with Snowflake support. Provide all relevant details and test outputs to facilitate a quick resolution. Collaborate with your networking team to ensure all necessary URLs and ports are accessible as per Snowflake’s requirements.
* Please note that remediation of these issues in collaboration with [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) requires the engagement of teams responsible for managing the client network and network security, as Snowflake personnel does not have the authority to make changes outside its own managed networks.
* Snowflake Support might instruct you to collect the Snowflake driver and connector log files right after reproducing the issue and attach them to your Support ticket to Snowflake. You might also be instructed to collect the results of the [browser test](browser-test.md).

## Collect driver and connector log files

The following links explain how you can collect logs if requested by [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support):

* [.NET log files](https://github.com/snowflakedb/snowflake-connector-net?tab=readme-ov-file#logging)
* [Go log files](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Logging)
* [JDBC log files](../../developer-guide/jdbc/jdbc-configure.md)
* [ODBC log files](../../developer-guide/odbc/odbc-parameters.md)
* [Node.js log files](../../developer-guide/node-js/nodejs-driver-logs.md)
* [Snowflake Connector for Python log files](../../developer-guide/python-connector/python-connector-example.md)
* [Snowflake CLI](../../developer-guide/snowflake-cli/connecting/configure-cli.md)
* [SnowSQL log files](../../developer-guide/python-connector/python-connector-example.md)

---
title: Generate descriptions with Snowflake Cortex
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-cortex-descriptions.md
section: User Guide
---

# Generate descriptions with Snowflake Cortex

You can use Snowsight and the power of the
[Snowflake Cortex COMPLETE function](../sql-reference/functions/complete-snowflake-cortex.md) to automatically generate descriptions for a
column, table, or view. The Cortex Powered Object Descriptions feature leverages Snowflake-hosted large language models (LLMs) to evaluate
object metadata and, if desired, sample data to generate the description.

The generated description, once saved, is preserved in the COMMENT property of the column, table, or view. You can view the
description anywhere the COMMENT property is displayed, which includes the following:

* The Table Details and View Details tabs in Snowsight.
* The Columns tab for the table or view in Snowsight.
* The output of a [DESCRIBE TABLE](../sql-reference/sql/desc-table.md) command.
* The output of the Account Usage [TABLES](../sql-reference/account-usage/tables.md) view.

A user with *any privilege* on the table, view, or column can view the description after it is saved.

> **Note:**
>
> You can also call a stored procedure to programmatically generate object descriptions using Snowflake Cortex. For more information, see
> [Using SQL to automatically generate object descriptions](sql-cortex-descriptions.md).

## Cortex descriptions access control requirements

To use the Cortex Powered Object Descriptions feature, you must have:

* The [SNOWFLAKE.CORTEX_USER database role](../sql-reference/snowflake-db-roles.md).
* The USAGE privilege on a warehouse.

You must also set the [CORTEX_MODELS_ALLOWLIST](../sql-reference/parameters.md) parameter to allow access to the `mistral-7b` and
`llama3.1-8b` models. By default, this parameter is set to `'All'`, which allows access to all models. If the parameter has been
changed, ensure that these models are included. For more information about controlling model access with this parameter, see
[Account-level allowlist parameter](snowflake-cortex/aisql.md).

### LLM regional requirements

Your region must support the LLM used by Snowflake Cortex to generate the descriptions. If you have the required privileges, but do not see
this feature, check the [availability of the COMPLETE function](snowflake-cortex/aisql.md). If the COMPLETE function is not
supported in your region, you need to enable [cross-region inference](snowflake-cortex/cross-region-inference.md) to use the
feature.

## Supported objects

You can generate descriptions for the following objects:

* All table types
* Views
* Materialized views
* Columns that are in tables and views.

## Create, edit, and save descriptions with Snowflake Cortex

The steps to generate and edit Snowflake Cortex Powered Descriptions are in the following subsections.

### Generate and save descriptions

To generate and save a description for a table or view, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the OWNERSHIP privilege.
2. Navigate to the table or view for which you want to generate descriptions.
3. If prompted, select a warehouse.
4. On the Table Details tab or View Details tab, select Generate with Cortex.
5. If you want to edit the description, select the pencil icon and edit the description.
6. Select Save.

> **Note:**
>
> Users with the OWNERSHIP privilege can execute the following to let users with the role `my_role` generate descriptions. In this example, the user has an ACCOUNTADMIN role:
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> GRANT USAGE ON WAREHOUSE ai_wh TO ROLE my_role;
> GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE my_role;
> ```

### Create descriptions for all columns at once

Snowsight lets you generate descriptions for multiple columns at once, with an limit of 50 columns at a time. To generate
descriptions for all columns in a table or view with a single action, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the required privileges.
2. Navigate to the table or view that contains the columns.
3. If prompted, select a warehouse.
4. Select the Columns tab.
5. Select Generate Descriptions in the toolbar.
6. If prompted, decide whether to use sample data.
7. If you want to edit a description, select the pencil icon.
8. Select the columns you want to save.
9. Select Save.
10. If your table or view has more than 50 columns and you want to generate descriptions for the remainder of the columns, repeat this
    process.

### Create descriptions for a single column

To generate a description for a single column, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the required privileges.
2. Navigate to the table or view that contains the columns.
3. If prompted, select a warehouse.
4. Select the Columns tab.
5. Find the column, hover over its row in the Description column, and then select Generate with Cortex.
6. If prompted, decide whether to use sample data.
7. If you want to edit the description, select the pencil icon.
8. Select Save.

### Overwrite existing descriptions

To replace a user-specified description with a generated description, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the required privileges.
2. Navigate to the table or view for which you want to edit descriptions.
3. Select a warehouse if one is not already in use.
4. Edit the descriptions for tables, views, and columns:

   * Tables and views: In the Table Details tab, select the pencil icon to edit the existing description, and select
     Generate with Cortex.
   * Columns: In the Columns tab, select the pencil icon for existing descriptions, and select Generate with Cortex.
5. Select Save.

## Generate descriptions without saving

To generate a description of a table or view, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the SELECT privilege.
2. Navigate to the table or view for which you want to generate descriptions.
3. If prompted, select a warehouse.
4. On the Table Details tab or View Details tab, select Describe Table.

> **Note:**
>
> The table owner can’t save the descriptions generated by selecting the Describe Table button. If you are a table owner who wants to
> edit and save descriptions, select Generate with Cortex in the Description section of the page.

## Sample data inputs

When generating a description for a column, you can rely only on metadata, or you can choose to use sample data to
improve the Snowflake Cortex Powered Description. Sample data refers to data within a particular column that is evaluated when you
use Snowflake Cortex to generate descriptions. If you choose to use sample data, Snowflake uses a portion of the sample data to generate the
description, which leads to more accurate descriptions. Sample data is not stored by Snowflake as Usage Data.

The decision to use sample data is specific to the individual user. The first time you generate a column description in a browser
session, you will be prompted to decide whether to use sample data. The pop-up box defaults to yes and allows you to choose to disable
sample data before proceeding. Your browser stores your response to this question for the duration of your
[Snowflake session](session-policies.md) and you won’t be asked again until your next session. You can also use your
[User Profile](ui-snowsight-profile.md) to set your preference for whether to use sample data.

> **Note:**
>
> Sample data can cross regional boundaries if the region supports Snowflake Cortex. For more information, see
> LLM regional requirements.

## Cost considerations

Generating descriptions incurs the following costs:

* Credits consumed by the warehouse in use.
* Credits charged for the use of Snowflake Cortex with smaller LLMs like Mistral-7b and Llama 3.1-8b. These charges appear on a bill as
  AI-Services, which includes all uses of Snowflake Cortex.

## Legal Notices

This feature relies on the COMPLETE function to generate a recommended object description, which the user may save (with or without
revision) or reject. When the user initiates the description generation, Usage Data may be collected through the COMPLETE function.

Until a description is explicitly saved by the user, it is not retained by Snowflake. If the user saves the description, an object comment
is created. The saved comment is stored as a [metadata field](../sql-reference/metadata.md).

For additional information about the use of AI, see [Snowflake AI and ML](../guides-overview-ai-features.md).

---
title: Getting started with differential privacy
source: https://docs.snowflake.com/en/user-guide/tutorials/diff-privacy.md
section: User Guide
---

Differential Privacy

Getting Started

# Getting started with differential privacy

## Introduction

This tutorial demonstrates how to protect sensitive data using a differential privacy policy so that you can share it safely with analysts.

### What you will learn

In this tutorial you will learn how to do the following:

* Create a differential privacy policy.
* Apply that privacy policy to a table to protect it with differential privacy.
* Define privacy domains for a table.
* Run a query on a table protected by differential privacy.
* Determine the amount of noise present in query results.

This tutorial does not fully explain the key concepts of differential privacy, such as [noise](../diff-privacy/differential-privacy-overview.md),
[privacy budgets](../diff-privacy/differential-privacy-admin-privacy-budgets.md), and
[privacy domains](../diff-privacy/differential-privacy-privacy-domains.md). This tutorial focuses on how to apply differential
privacy to your data.

### About admins and analysts

You’ll be assuming two personas in this tutorial:

* The admin, who has privileges to the raw data and manages differential privacy policies on a table.
* The analyst, who runs queries on this protected data.

In real-world use cases these might be two different people or groups of people, or they could be one person who wants to analyze and
share protected results safely with others.

While this tutorial shows how to run queries on protected data, it is intended primarily to show how to implement differential privacy
rather than how to consume it.

### Prerequisites

* You must be on an account with **Enterprise edition or above**.
* You must be able to **use the ACCOUNTADMIN role**.

> **Important:**
>
> In this tutorial you will perform all of the admin persona steps using the ACCOUNTADMIN role. In general practice, though, you should use roles with privileges specifically defined for the action you’re performing. The privileges required to create and apply privacy policies are [described here](../diff-privacy/differential-privacy-overview.md).

## Create roles, a warehouse, and data

In this section, you will perform the following setup steps:

* Create a role for the analyst.
* Create the warehouse used to execute the queries against the protected data.
* Create mock sensitive data that will be protected by the privacy policy.

None of these setup steps are specific to differential privacy policies. If there already exists a suitable role, warehouse, and/or dataset, you can use those instead.

### Create the analyst role

In a Snowsight worksheet or other environment that is connected to run Snowflake SQL on your Snowflake account, run the following commands
to create the analyst role and assign it to yourself:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE dp_tutorial_analyst;

-- You can find your own user name by running "SELECT CURRENT_USER();"
GRANT ROLE dp_tutorial_analyst TO USER <user_name>;
```

### Create a warehouse for the data

```sqlexample
CREATE OR REPLACE WAREHOUSE dp_tutorial_wh;
GRANT USAGE ON WAREHOUSE dp_tutorial_wh TO ROLE dp_tutorial_analyst;
```

### Create mock sensitive data

The following commands create a database, schema, and table, and fill it with data. The data simulates a simple diabetes study in which we want to protect patient identities. Later in the tutorial you’ll use differential privacy to protect the identity of individuals in the study.

```sqlexample
-- Create the table
CREATE OR REPLACE DATABASE dp_db;
CREATE OR REPLACE SCHEMA dp_db.dp_schema;
USE SCHEMA dp_db.dp_schema;
CREATE OR REPLACE TABLE dp_tutorial_diabetes_survey (
  patient_id TEXT,
  is_smoker BOOLEAN,
  has_difficulty_walking BOOLEAN,
  gender TEXT,
  age INT,
  has_diabetes BOOLEAN,
  income_code INT);

-- Populate the table
INSERT INTO dp_db.dp_schema.dp_tutorial_diabetes_survey
VALUES
('ID-23493', TRUE, FALSE, 'male', 39, TRUE, 2),
('ID-00923', FALSE, FALSE, 'female', 82, TRUE, 5),
('ID-24020', FALSE, FALSE, 'male', 69, FALSE, 8),
('ID-92848', TRUE, TRUE, 'other', 75, FALSE, 3),
('ID-62937', FALSE, FALSE, 'male', 46, TRUE, 5);
```

**Notes:**

Although it might seem that masking the patient ID would be better than using differential privacy, that would prevent joins against that
column. Additionally, if you added a table where each patient has multiple rows, such as a medications table or a visits table, simple
masking would prevent you from grouping results by person. This is a case where differential privacy can be much more powerful than simple
masking and row hiding; you can make more of your data available to analysts and allow more useful queries while still protecting entity
privacy.

## Define a privacy policy

Applying a [privacy policy](../diff-privacy/differential-privacy-admin-privacy-policies.md) to a table or view protects it with differential privacy and assigns a [privacy budget](../diff-privacy/differential-privacy-admin-privacy-budgets.md) to groups or users so that Snowflake can prevent multiple queries from revealing too much sensitive information.

You will create the privacy policy in its own database. This is a best practice for all types of policies in Snowflake. If you create the
policy in the same database, then cloning the database would create unsynchronized copies of the policy. Putting all policies in a single,
separate database, and applying them to multiple tables lets you manage and update a single copy of each policy.

You’ll name this new policy `patients_policy`.

```sqlexample
-- Define a privacy policy. Use default budget, budget window, max budget per aggregate.
CREATE OR REPLACE DATABASE policy_db;
CREATE OR REPLACE SCHEMA policy_db.diff_priv_policies;
CREATE OR REPLACE PRIVACY POLICY policy_db.diff_priv_policies.patients_policy AS () RETURNS privacy_budget ->
  CASE
    WHEN CURRENT_ROLE() = 'ACCOUNTADMIN' THEN no_privacy_policy()
    WHEN CURRENT_ROLE() IN ('DP_TUTORIAL_ANALYST')
      THEN privacy_budget(budget_name => 'clinical_analysts')
    ELSE privacy_budget(budget_name => 'default')
END;
```

**Notes:**

* The privacy policy applied depends on the role of the user, as specified in the CASE statement. Role names are given here in uppercase
  because CURRENT_ROLE() returns uppercase values.
* Creating separate privacy budgets per role allows you to separate the budget used for analysts and other users, and also to monitor
  usage by each group.
* If the privacy policy resolves to a valid privacy budget when evaluated, the user cannot run non-aggregated SELECT queries, noise is
  added to the results, and the number of queries is limited by the privacy budget for that policy.
* The account admin role has no privacy policy applied. This means that queries run as that role have no differential privacy applied.
  To indicate no privacy policy, you must return `no_privacy_policy()` rather than returning NULL.
* The DP_TUTORIAL_ANALYST role uses a privacy policy named “clinical_analysts” with default values for privacy budget, budget window, and
  maximum budget per aggregate.
* Any other user with SELECT access will get a privacy budget named “default,” also with default privacy policy values. If you want to
  prevent other users from running queries on this table, you should do so by limiting the SELECT privileges on the table. Table-level
  policies require an ELSE clause and cannot return NULL.

## Assign the privacy policy

Next you’ll assign the privacy policy you just created to the table to protect it with differential privacy.

```sqlexample
-- Assign the privacy policy to the table.
ALTER TABLE dp_db.dp_schema.dp_tutorial_diabetes_survey
ADD PRIVACY POLICY policy_db.diff_priv_policies.patients_policy ENTITY KEY (patient_id);
```

**Notes:**

The ENTITY KEY clause specifies a column that uniquely identifies the entity that should be protected by differential privacy. In this
tutorial, which has a single table where each entity is listed in one and only one row, defining the entity key is less important. But if
each patient could appear in multiple rows (for example, if it captured patient visits or patient medications), then defining the key would
be important. It’s still a good practice to define the key here in case a second such table is added to the database later. Learn more about
[entity-level privacy](../diff-privacy/differential-privacy-admin.md).

## Define a privacy domain

Next you’ll set [privacy domains](../diff-privacy/differential-privacy-privacy-domains.md) on select columns in the table.

A privacy domain tells the system the range of values that can be shown in the results for that column. The system uses this
information in two ways:

* Values outside this range will be omitted or pegged to the boundaries, depending on whether the column is a string or numeric/date value.
* The system uses this “valid range” as a way to determine the range of results in order to determine the noise applied to each
  measure value.

An analyst can further restrict a domain, for example by using a WHERE clause, to potentially reduce the amount of noise generated by
differential privacy (the smaller the domain, the less the noise). If you don’t set a privacy domain on a column, the analyst must add a
privacy domain with a WHERE clause to see values for that column (columns without a privacy domain cannot be shown or used in the query).

For the diabetes survey data, you will set privacy domains on three columns: `gender`, `age`, and `income_code`. You won’t set privacy
domains on any boolean columns (with only two possible values, a privacy domain doesn’t make sense and isn’t required), and you should not
set a privacy domain on the `patient_id` column because the user can see the values you set in the privacy domain, which would tell them
which patient IDs are in the data. If you need to specify a privacy domain for a limited number of string values, such as ZIP codes, you
should [pad the domain definition](../diff-privacy/differential-privacy-privacy-domains-admin.md) with additional, non-present values to obscure possible values.

```sqlexample
-- Define privacy domains.
ALTER TABLE dp_db.dp_schema.dp_tutorial_diabetes_survey ALTER (
COLUMN gender SET PRIVACY DOMAIN IN ('female', 'male', 'other'),
COLUMN age SET PRIVACY DOMAIN BETWEEN (0, 90),
COLUMN income_code SET PRIVACY DOMAIN BETWEEN (1, 8)
);
```

## Grant analyst access to the table

Grant access to the table only after you’ve assigned privacy policies to the data. Otherwise, users could see the data before you apply
privacy policies.

```sqlexample
GRANT USAGE ON DATABASE dp_db TO ROLE dp_tutorial_analyst;
GRANT USAGE ON SCHEMA dp_schema TO ROLE dp_tutorial_analyst;
GRANT SELECT
  ON TABLE dp_db.dp_schema.dp_tutorial_diabetes_survey
  TO ROLE dp_tutorial_analyst;
```

## Run some queries

Finally, you can start running queries against your data!

You will switch roles between admin and analyst to compare the behavior and output for each role.

### Check that differential privacy is working

Use the administrator role to run a query that returns individual rows. This query succeeds because the privacy policy resolves to no_privacy_policy() for the ACCOUNTADMIN role:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT * FROM dp_db.dp_schema.dp_tutorial_diabetes_survey;
```

Now run the same query using the analyst role. The query fails because differential privacy does not allow SELECT \* queries.

```sqlexample
USE ROLE dp_tutorial_analyst;
SELECT * FROM dp_db.dp_schema.dp_tutorial_diabetes_survey;
```

Try with a third role to ensure that the default result is the same. (Don’t forget to grant SELECT on the table to the person or role!)

### See what noise looks like

First, run a simple query as the administrator, without differential privacy applied. You will see the exact table values.

```sqlexample
-- Run a basic query without DP.
USE ROLE ACCOUNTADMIN;
SELECT COUNT(DISTINCT patient_id)
  FROM dp_db.dp_schema.dp_tutorial_diabetes_survey
  WHERE income_code = 5;
```

Now run the same query as an analyst, and you’ll see that noise has been applied to the results. Note that the query takes a little longer
because differential privacy is being applied.

```sqlexample
USE ROLE dp_tutorial_analyst;
SELECT COUNT(DISTINCT patient_id)
  FROM dp_db.dp_schema.dp_tutorial_diabetes_survey
  WHERE income_code = 5;
```

The results are typically different from the admin results because differential privacy has introduced noise into the results to obscure
the presence of an individual in the dataset. However, the results can sometimes be identical because in any given query the randomly
generated noise was small enough to round down to 0. But the analyst cannot know whether or not there is noise applied to any given query.
You can try running this query again to see if you get a different result.

### Analyze the amount of noise

Although analysts cannot see results without noise, they do need a way to understand how noisy the result is, in general, to
determine whether the data is usable for their needs. In order to provide this information, we expose the noise interval of each query
parameter to the analyst. The noise interval is retrieved using the functions [DP_INTERVAL_LOW](../../sql-reference/functions/dp_interval_low.md) and
[DP_INTERVAL_HIGH](../../sql-reference/functions/dp_interval_high.md).

```sqlexample
-- Retrieve noise interval for the previous query.
USE ROLE dp_tutorial_analyst;
SELECT COUNT(DISTINCT patient_id) as c,
  DP_INTERVAL_LOW(c) as LOW,
  DP_INTERVAL_HIGH(c) as HIGH
  FROM dp_db.dp_schema.dp_tutorial_diabetes_survey
  WHERE income_code = 5;
```

There is a minimum 95% confidence that the true value of the aggregation is between LOW and HIGH.

Note that the interval for this query on this data is wide compared to the magnitude of the result because of the artificially small
dataset. This wide noise interval essentially means that there are too few patients here for Snowflake to be able to give an accurate
answer while protecting their privacy.

### See your budget and estimated remaining queries

Users running queries on differential privacy protected tables can see their differential privacy budget used, and an estimate of the
number of remaining queries, by calling the [ESTIMATE_REMAINING_DP_AGGREGATES](../../sql-reference/functions/estimate_remaining_dp_aggregates.md) table function. Assume the role
for which you want to see the budget, then call the function as shown here:

```sqlexample
USE ROLE <role_name>;
SELECT * FROM TABLE(SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES(dp_db.dp_schema.dp_tutorial_diabetes_survey));
```

## Clean up

Clean up your resources so that you, or someone else in your org, can run the tutorial again later.

```sqlexample
USE ROLE ACCOUNTADMIN;
DROP ROLE dp_tutorial_analyst;
DROP WAREHOUSE dp_tutorial_wh;
ALTER TABLE dp_tutorial_diabetes_survey
  DROP PRIVACY POLICY policy_db.diff_priv_policies.patients_policy;
DROP DATABASE dp_db;
DROP DATABASE policies_db;
```

---
title: Getting started with hybrid tables
source: https://docs.snowflake.com/en/user-guide/tutorials/getting-started-with-hybrid-tables-tutorial.md
section: User Guide
---

Snowflake

Getting Started

Hybrid Tables

Unistore

# Getting started with hybrid tables

## Introduction

A [hybrid table](../tables-hybrid.md) is a Snowflake table type that is optimized for
hybrid transactional and analytic workloads. These workloads require low latency and high throughput on
small but random reads and writes, which often access a single row in a table. Hybrid tables enforce unique
and referential integrity constraints, which are critical for transactional workloads.

You can use a hybrid table along with other Snowflake tables and features to power
[Unistore workloads](https://www.snowflake.com/en/data-cloud/workloads/unistore/),
which unite transactional and analytic data in a single platform.

Hybrid tables are integrated seamlessly into the existing Snowflake architecture. Customers connect to the
same Snowflake database service. Queries are compiled and optimized in the cloud services layer and
executed in the same query engine in virtual warehouses. This architecture provides several key benefits:

* Snowflake platform features, such as data governance, work with hybrid tables out of the box.
* You can run hybrid workloads that mix operational and analytic queries.
* You can join hybrid tables with other Snowflake tables, and the query executes natively and efficiently in the
  same query engine. No federation is required.
* You can execute an atomic transaction across hybrid tables and other Snowflake tables. There is no need to
  orchestrate your own two-phase commit.

Hybrid tables leverage a row store as the primary data store to provide excellent operational query performance.
When you write to a hybrid table, the data is written directly into the row store. Data is asynchronously copied
into object storage in order to provide better performance and workload isolation for large scans without affecting
ongoing operational workloads. Some data may also be cached in columnar format on your warehouse in order to provide
better performance on analytical queries. You simply execute SQL statements against the logical hybrid table, and the
query optimizer decides where to read data from to provide the best performance. You get one consistent view of your data
without worrying about the underlying infrastructure.

### What you will learn

In this tutorial you will learn how to:

* Create and bulk load hybrid tables.
* Create and check the enforcement of UNIQUE, PRIMARY KEY, and FOREIGN KEY constraints.
* Run concurrent updates that depend on row-level locks.
* Run a multi-statement operation in a consistent atomic transaction (across hybrid and standard tables).
* Query hybrid tables and join them to standard tables.
* Verify that security and governance principles apply to both hybrid and standard tables.

## Prerequisites

This tutorial assumes that you are:

* Familiar with the Snowsight interface
* Familiar with SQL
* Using a non-trial Snowflake account in [select AWS regions](../tables-hybrid-limitations.md)
* Able to run as a user who has been granted the ACCOUNTADMIN role
* Aware of [unsupported features and limitations on hybrid tables](../tables-hybrid-limitations.md)

## Step 1. Set up your account

To get started, set up your Snowflake account by creating a new worksheet, a role, database objects, and a virtual warehouse.
Then you will be able to create two hybrid tables and one standard table. Follow these steps:

1. Under Worksheets, click the + button in the top-right corner of Snowsight and select SQL Worksheet.
2. Rename the worksheet by selecting its auto-generated timestamp name and typing `Hybrid Tables - QuickStart`.
3. Complete the following steps by copying the block of SQL commands into your worksheet and running them all.

   1. Use the ACCOUNTADMIN role to create the `hybrid_quickstart_role` custom role, then grant this role to the current user.
   2. Create the `hybrid_quickstart_wh` warehouse and the `hybrid_quickstart_db` database. Grant ownership on these
      objects to the new role.
   3. Use the new role to create the `data` schema.
   4. Use the new warehouse. (The database and schema you created are already in use, by default.)

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE OR REPLACE ROLE hybrid_quickstart_role;
   SET my_user = CURRENT_USER();
   GRANT ROLE hybrid_quickstart_role TO USER IDENTIFIER($my_user);

   CREATE OR REPLACE WAREHOUSE hybrid_quickstart_wh WAREHOUSE_SIZE = XSMALL, AUTO_SUSPEND = 300, AUTO_RESUME = TRUE;
   GRANT OWNERSHIP ON WAREHOUSE hybrid_quickstart_wh TO ROLE hybrid_quickstart_role;
   CREATE OR REPLACE DATABASE hybrid_quickstart_db;
   GRANT OWNERSHIP ON DATABASE hybrid_quickstart_db TO ROLE hybrid_quickstart_role;

   USE ROLE hybrid_quickstart_role;
   CREATE OR REPLACE SCHEMA data;

   USE WAREHOUSE hybrid_quickstart_wh;
   ```

## Step 2. Create and bulk load three tables

This tutorial uses the Tasty Bytes Snowflake fictional food truck business to simulate a use case where you can
serve data to an application.

You will create three tables:

* `order_header` hybrid table - This table stores order metadata such as `truck_id`, `customer_id`,
  `order_amount`, and so on.
* `truck` hybrid table - This table stores truck metadata such as `truck_id`, `franchise_id`, `menu_type_id`,
  and so on.
* `truck_history` standard table - This table stores historical information about food trucks, enabling you to
  track changes over time.

You are creating hybrid and standard tables to demonstrate how well they work together. Nonetheless, hybrid tables
have some fundamental differences in their definition and behavior:

* Hybrid tables require a primary key on one or more columns (which implies the creation of a primary key index).
* Hybrid tables allow the creation of [secondary indexes](../tables-hybrid-index.md) on any column.
* PRIMARY KEY, FOREIGN KEY, and UNIQUE [constraints](../../sql-reference/constraints-overview.md) are all enforced on hybrid tables.
* Locks on hybrid tables are [row-level](../../sql-reference/transactions.md), not table-level.
* Hybrid table data resides in a row store, but is also copied to columnar object storage.

These differences result in:

* Support for referential integrity when table data is loaded, updated, or deleted.
* Faster DML operations (especially those that update single rows).
* Faster lookup queries.

You can bulk load data into hybrid tables by copying data from a stage or from other tables (that is, by using
[CTAS](../../sql-reference/sql/create-table.md), [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md), or
[INSERT INTO … SELECT](../../sql-reference/sql/insert.md)). This tutorial uses the CTAS command. For more
information about bulk loading hybrid tables, see [Loading data](../tables-hybrid-create.md).

Create a [file format](../../sql-reference/sql/create-file-format.md), which describes a staged data set
that you can access or load into Snowflake tables, and a [stage](../data-load-overview.md), which is a Snowflake object
that points to a cloud storage location that Snowflake can access to both ingest and query data. The
data is stored in a publicly accessible AWS S3 bucket that you reference when you create the stage.

```sqlexample
CREATE OR REPLACE FILE FORMAT csv_format TYPE = CSV FIELD_DELIMITER = ',' SKIP_HEADER = 1 NULL_IF = ('NULL', 'null') EMPTY_FIELD_AS_NULL = true;
CREATE OR REPLACE STAGE frostbyte_tasty_bytes_stage URL = 's3://sfquickstarts/hybrid_table_guide' FILE_FORMAT = csv_format;
```

Now use the [LIST](../../sql-reference/sql/list.md) statement to return all the files in the FROSTBYTE_TASTY_BYTES_STAGE:

```sqlexample
LIST @frostbyte_tasty_bytes_stage;
```

The statement should return two records: one for the `TRUCK.csv` file and one for the `ORDER_HEADER.csv` file.

After you have created the stage, which points to the location of the data in cloud storage, you can create and load the data into the
`truck` by using a [CTAS command](../../sql-reference/sql/create-hybrid-table.md) that selects data from the `TRUCK.csv` file.
Note the PRIMARY KEY constraint on the `truck_id` column.

The second DDL statement creates a standard table named `truck_history`, also by using a CTAS statement.

```sqlexample
SET CURRENT_TIMESTAMP = CURRENT_TIMESTAMP();

CREATE OR REPLACE HYBRID TABLE truck (
  truck_id NUMBER(38,0) NOT NULL,
  menu_type_id NUMBER(38,0),
  primary_city VARCHAR(16777216),
  region VARCHAR(16777216),
  iso_region VARCHAR(16777216),
  country VARCHAR(16777216),
  iso_country_code VARCHAR(16777216),
  franchise_flag NUMBER(38,0),
  year NUMBER(38,0),
  make VARCHAR(16777216),
  model VARCHAR(16777216),
  ev_flag NUMBER(38,0),
  franchise_id NUMBER(38,0),
  truck_opening_date DATE,
  truck_email VARCHAR NOT NULL UNIQUE,
  record_start_time TIMESTAMP,
  PRIMARY KEY (truck_id)
  )
  AS
  SELECT
      t.$1 AS truck_id,
      t.$2 AS menu_type_id,
      t.$3 AS primary_city,
      t.$4 AS region,
      t.$5 AS iso_region,
      t.$6 AS country,
      t.$7 AS iso_country_code,
      t.$8 AS franchise_flag,
      t.$9 AS year,
      t.$10 AS make,
      t.$11 AS model,
      t.$12 AS ev_flag,
      t.$13 AS franchise_id,
      t.$14 AS truck_opening_date,
      CONCAT(truck_id, '_truck@email.com') truck_email,
      $CURRENT_TIMESTAMP AS record_start_time
    FROM @FROSTBYTE_TASTY_BYTES_STAGE (PATTERN=>'.*TRUCK.csv') t;

CREATE OR REPLACE TABLE truck_history (
  truck_id NUMBER(38,0) NOT NULL,
  menu_type_id NUMBER(38,0),
  primary_city VARCHAR(16777216),
  region VARCHAR(16777216),
  iso_region VARCHAR(16777216),
  country VARCHAR(16777216),
  iso_country_code VARCHAR(16777216),
  franchise_flag NUMBER(38,0),
  year NUMBER(38,0),
  make VARCHAR(16777216),
  model VARCHAR(16777216),
  ev_flag NUMBER(38,0),
  franchise_id NUMBER(38,0),
  truck_opening_date DATE,
  truck_email VARCHAR NOT NULL UNIQUE,
  record_start_time TIMESTAMP,
  record_end_time TIMESTAMP,
  PRIMARY KEY (truck_id)
  )
  AS
  SELECT
      t.$1 AS truck_id,
      t.$2 AS menu_type_id,
      t.$3 AS primary_city,
      t.$4 AS region,
      t.$5 AS iso_region,
      t.$6 AS country,
      t.$7 AS iso_country_code,
      t.$8 AS franchise_flag,
      t.$9 AS year,
      t.$10 AS make,
      t.$11 AS model,
      t.$12 AS ev_flag,
      t.$13 AS franchise_id,
      t.$14 AS truck_opening_date,
      CONCAT(truck_id, '_truck@email.com') truck_email,
      $CURRENT_TIMESTAMP AS record_start_time,
      NULL AS record_end_time
   FROM @frostbyte_tasty_bytes_stage (PATTERN=>'.*TRUCK.csv') t;
```

The following DDL statement creates the structure for the `order_header` hybrid table.
Note the PRIMARY KEY constraint on the `order_id` column, the FOREIGN KEY constraint on the
`truck_id` column from the `truck` table, and the secondary index on the `order_ts` column.

```sqlexample
CREATE OR REPLACE HYBRID TABLE order_header (
  order_id NUMBER(38,0) NOT NULL,
  truck_id NUMBER(38,0),
  location_id NUMBER(19,0),
  customer_id NUMBER(38,0),
  discount_id FLOAT,
  shift_id NUMBER(38,0),
  shift_start_time TIME(9),
  shift_end_time TIME(9),
  order_channel VARCHAR(16777216),
  order_ts TIMESTAMP_NTZ(9),
  served_ts VARCHAR(16777216),
  order_currency VARCHAR(3),
  order_amount NUMBER(38,4),
  order_tax_amount VARCHAR(16777216),
  order_discount_amount VARCHAR(16777216),
  order_total NUMBER(38,4),
  order_status VARCHAR(16777216) DEFAULT 'INQUEUE',
  PRIMARY KEY (order_id),
  FOREIGN KEY (truck_id) REFERENCES TRUCK(truck_id),
  INDEX IDX01_ORDER_TS(order_ts)
);
```

The following DML statement inserts data into the `order_header` table, using an INSERT INTO … SELECT statement.

```sqlexample
INSERT INTO order_header (
  order_id,
  truck_id,
  location_id,
  customer_id,
  discount_id,
  shift_id,
  shift_start_time,
  shift_end_time,
  order_channel,
  order_ts,
  served_ts,
  order_currency,
  order_amount,
  order_tax_amount,
  order_discount_amount,
  order_total,
  order_status)
  SELECT
      t.$1 AS order_id,
      t.$2 AS truck_id,
      t.$3 AS location_id,
      t.$4 AS customer_id,
      t.$5 AS discount_id,
      t.$6 AS shift_id,
      t.$7 AS shift_start_time,
      t.$8 AS shift_end_time,
      t.$9 AS order_channel,
      t.$10 AS order_ts,
      t.$11 AS served_ts,
      t.$12 AS order_currency,
      t.$13 AS order_amount,
      t.$14 AS order_tax_amount,
      t.$15 AS order_discount_amount,
      t.$16 AS order_total,
      '' as order_status
    FROM @frostbyte_tasty_bytes_stage (PATTERN=>'.*ORDER_HEADER.csv') t;
```

## Step 3. Explore your data

Earlier you created the `hybrid_quickstart_role` role, `hybrid_quickstart_wh` warehouse, `hybrid_quickstart_db` database,
and `data` schema. Continue to use those objects.

You also created and loaded the `truck`, `truck_history`, and `order_header` tables. Now you can run a few queries and become familiar with both the data in these tables and their metadata.

Use the [SHOW TABLES](../../sql-reference/sql/show-tables.md) command to view properties and metadata for both standard tables and
hybrid tables. Use the [SHOW HYBRID TABLES](../../sql-reference/sql/show-hybrid-tables.md) command to view information about hybrid tables only.

```sqlexample
SHOW TABLES LIKE '%truck%';
```

```sqlexample
SHOW HYBRID TABLES LIKE '%order_header%';
```

Display information about the columns in the table by using [DESCRIBE <object>](../../sql-reference/sql/desc.md) commands. Note the columns with
PRIMARY KEY and UNIQUE constraints.

```sqlexample
DESCRIBE TABLE truck;
```

```sqlexample
DESCRIBE TABLE order_header;
```

List the [hybrid tables](../../sql-reference/sql/show-hybrid-tables.md) for which you have access privileges.

```sqlexample
SHOW HYBRID TABLES;
```

List all the [indexes](../../sql-reference/sql/show-indexes.md) for which you have access privileges. Note the value in the
`is_unique` column for each index.

```sqlexample
SHOW INDEXES;
```

Look at sample data from the tables by running these simple queries.

```sqlexample
SELECT * FROM truck LIMIT 10;
SELECT * FROM truck_history LIMIT 10;
SELECT * FROM order_header LIMIT 10;
```

The output for the first query looks similar to the following:

## Step 4. Test the behavior of UNIQUE and FOREIGN KEY constraints

In this step, you will test UNIQUE and FOREIGN KEY [constraints](../../sql-reference/constraints-overview.md).
These constraints are enforced when they are defined on hybrid tables.

UNIQUE constraints preserve data integrity by preventing duplicate values from being inserted into a
column. FOREIGN KEY constraints work in tandem with PRIMARY KEY constraints to preserve referential integrity. A value
cannot be inserted into a primary key column if no matching foreign key value exists in the referenced table.
For example, a sale of a product with ID `100` cannot be recorded in a sales fact table if no such product ID
already exists in a referenced product dimension table.

Both types of constraints support data accuracy and consistency for applications that rely heavily on reliable
but fast transaction processing.

### Step 4.1. Test a UNIQUE constraint

A UNIQUE constraint ensures that all values in a column are different. In the `truck` table, you
defined the `truck_email` column as NOT NULL and UNIQUE.

Given the UNIQUE constraint, if you attempt to insert two records with
the same email address, the statement will fail. To test this behavior, run the following commands.

Start by selecting an existing email address and setting a variable `truck_email` to that string. Then select the
maximum value of `truck_id` from the table and set another variable `max_truck_id` to that value. Next, set a third
variable, `new_truck_id` that increments `max_truck_id` by 1. This process ensures that you do not run into a
“Primary key already exists” error when you insert a new row.

Finally, insert the new row.

```sqlexample
SET truck_email = (SELECT truck_email FROM truck LIMIT 1);
SET max_truck_id = (SELECT MAX(truck_id) FROM truck);
SET new_truck_id = $max_truck_id+1;
INSERT INTO truck VALUES
  ($new_truck_id,2,'Stockholm','Stockholm län','Stockholm','Sweden','SE',1,2001,'Freightliner','MT45 Utilimaster',0,276,'2020-10-01',$truck_email,CURRENT_TIMESTAMP());
```

The INSERT statement fails and you receive the following error message:

```output
Duplicate key value violates unique constraint SYS_INDEX_TRUCK_UNIQUE_TRUCK_EMAIL
```

Now create a new unique email address and insert a new record into the `truck` table:

```sqlexample
SET new_unique_email = CONCAT($new_truck_id, '_truck@email.com');
INSERT INTO truck VALUES ($new_truck_id,2,'Stockholm','Stockholm län','Stockholm','Sweden','SE',1,2001,'Freightliner','MT45 Utilimaster',0,276,'2020-10-01',$new_unique_email,CURRENT_TIMESTAMP());
```

The INSERT statement should run successfully this time.

### Step 4.2. Test a FOREIGN KEY constraint

In this step you will test a FOREIGN KEY constraint.

First, show the DDL that you used to create the `order_header` table by executing the
[GET_DDL](../../sql-reference/functions/get_ddl.md) function. Note the FOREIGN KEY constraint for the `truck_id` column in the output.

```sqlexample
SELECT GET_DDL('table', 'order_header');
```

The output of this command looks similar to the following partial result:

Now try to insert a new record into the `order_header` table, using a non-existent truck ID.

```sqlexample
SET max_order_id = (SELECT MAX(order_id) FROM order_header);
SET new_order_id = ($max_order_id +1);
SET no_such_truck_id = -1;
INSERT INTO order_header VALUES
  ($new_order_id,$no_such_truck_id,6090,0,0,0,'16:00:00','23:00:00','','2022-02-18 21:38:46.000','','USD',17.0000,'','',17.0000,'');
```

The INSERT statement should fail because it violates the FOREIGN KEY constraint on the `truck` table. You should receive
the following error message:

```output
Foreign key constraint SYS_INDEX_ORDER_HEADER_FOREIGN_KEY_TRUCK_ID_TRUCK_TRUCK_ID was violated.
```

Now use the new `new_truck_id` variable that you used earlier and insert a new record into the `order_header` table:

```sqlexample
INSERT INTO order_header VALUES
  ($new_order_id,$new_truck_id,6090,0,0,0,'16:00:00','23:00:00','','2022-02-18 21:38:46.000','','USD',17.0000,'','',17.0000,'');
```

The INSERT statement should run successfully this time.

### Step 4.3. Attempt to truncate a table referenced by a FOREIGN KEY constraint

Next, you can verify that a table referenced by a FOREIGN KEY constraint cannot be truncated as long as the foreign-key relationship
exists. Run the following [TRUNCATE TABLE](../../sql-reference/sql/truncate-table.md) statement:

```sqlexample
TRUNCATE TABLE truck;
```

The statement should fail, and you should receive the following error message:

```output
91458 (0A000): Hybrid table 'TRUCK' cannot be truncated as it is involved in active foreign key constraints.
```

### Step 4.4. Delete a row referenced by a FOREIGN KEY constraint

Next, you can verify that a record referenced by a FOREIGN KEY constraint cannot be deleted as long as the foreign-key
relationship exists. Run the following [DELETE](../../sql-reference/sql/delete.md) statement.

```sqlexample
DELETE FROM truck WHERE truck_id = $new_truck_id;
```

The statement should fail, and you should receive the following error message:

```output
Foreign keys that reference key values still exist.
```

To delete a record referenced by a FOREIGN KEY constraint, you must first delete the corresponding record from the
`order_header` table. Then you can delete the referenced record from the `truck` table. Run the following DELETE
statements:

```sqlexample
DELETE FROM order_header WHERE order_id = $new_order_id;
DELETE FROM truck WHERE truck_id = $new_truck_id;
```

Both statements should run successfully.

## Step 5. Use row-level locking to run concurrent updates

Unlike standard tables, which use partition or table-level locking, hybrid tables employ
[row-level locking](../../sql-reference/transactions.md) for
update operations. Row-level locking allows concurrent updates on independent records so that transactions don’t
wait on full table locks. For applications that rely on heavy transactional workloads,
wait times for locks must be kept to a minimum, allowing concurrent operations to access the same table very frequently.

In this step, you can test concurrent updates to different records in the `order_header` hybrid table.

You will use the main `Hybrid Tables - QuickStart` worksheet that you created earlier, and you will create a new worksheet named `Hybrid Tables - QuickStart Session 2` to simulate a new session. From the `Hybrid Tables - QuickStart`
worksheet, you will start a new transaction by using the [BEGIN](../../sql-reference/sql/begin.md)
statement, then run an UPDATE statement (a DML operation). Before running the [COMMIT](../../sql-reference/sql/commit.md)
transaction statement, you will open the `Hybrid Tables - QuickStart Session 2` worksheet and run another UPDATE statement.
Finally, you will commit the open transaction.

### Step 5.1. Create a new worksheet

Under Worksheets, click the + button in the top-right corner of
Snowsight, then select SQL Worksheet.

Rename the worksheet by selecting its auto-generated timestamp name and typing `Hybrid Tables - QuickStart Session 2`.
This new worksheet will only be used in the current step.

### Step 5.2. Run concurrent updates

First, open the `Hybrid Tables - QuickStart` worksheet. Make sure you are using the right role, warehouse, database, and
schema, then set and select the `max_order_id` variable.

```sqlexample
USE ROLE hybrid_quickstart_role;
USE WAREHOUSE hybrid_quickstart_wh;
USE DATABASE hybrid_quickstart_db;
USE SCHEMA data;

SET max_order_id = (SELECT MAX(order_id) FROM order_header);
SELECT $max_order_id;
```

Note the value of the `max_order_id` variable.

Start a new transaction and run the first UPDATE statement.

```sqlexample
BEGIN;
UPDATE order_header
  SET order_status = 'COMPLETED'
  WHERE order_id = $max_order_id;
```

Note that you did not commit the transaction, so now there is an open lock on the row that matches this condition:

```sqlexample
WHERE order_id = $max_order_id
```

Run the [SHOW TRANSACTIONS](../../sql-reference/sql/show-transactions.md) command, which should return a single open transaction.

```sqlexample
SHOW TRANSACTIONS;
```

The output of this command looks similar to the following partial result:

Open the `Hybrid Tables - QuickStart Session 2` worksheet. Make sure you are using the right role, warehouse, database, and schema, then set and select the `min_order_id` variable.

```sqlexample
USE ROLE hybrid_quickstart_role;
USE WAREHOUSE hybrid_quickstart_wh;
USE DATABASE hybrid_quickstart_db;
USE SCHEMA data;
```

```sqlexample
SET min_order_id = (SELECT MIN(order_id) FROM order_header);
SELECT $min_order_id;
```

Note that the `min_order_id` value is different from the `max_order_id` value that you used in the first UPDATE statement.
Run the second UPDATE statement.

```sqlexample
UPDATE order_header
  SET order_status = 'COMPLETED'
  WHERE order_id = $min_order_id;
```

Because hybrid tables use row-level locking and the open transaction locks the row `WHERE order_id = $MAX_ORDER_ID`,
the UPDATE statement runs successfully.

Open the `Hybrid Tables - QuickStart` worksheet and commit the open transaction.

```sqlexample
COMMIT;
```

Run the following query to view the updated records:

```sqlexample
SELECT * FROM order_header WHERE order_status = 'COMPLETED';
```

The output of this command looks similar to the following partial result:

## Step 6. Demonstrate consistency

In this step, you will learn about a unique hybrid tables feature: the ability to run multi-statement
operations natively, easily, and effectively in one consistent atomic transaction, with access to both hybrid
tables and standard tables. Snowflake [transactions](../../sql-reference/transactions.md) guarantee the “ACID”
properties of atomicity, consistency, isolation, and durability. Any given transaction is treated as an atomic unit;
preserves a consistent database state when writes occur; is isolated from other concurrent transactions (as if they
were being run sequentially); and is committed durably (remains committed, once committed).

In this example, the company acquires a new truck of the same model as an existing truck. Consequently, you must
update the `year` column for the relevant record in the `truck` hybrid table to reflect the change.
After this update, you need to promptly update a row and insert a new row in the `truck_history` table. This
standard table will track and preserve all the changes to the truck fleet over time. You complete all of these steps
as part of one explicitly committed transaction.

### Step 6.1. Run a single transaction that contains multiple DML statements

Open the original `Hybrid Tables - QuickStart` worksheet.

Start a new transaction to ensure that a subsequent series of operations is treated as a single, atomic unit. Then
execute multiple DML statements:

* Update the relevant truck record in the `truck` hybrid table.
* Update the corresponding record in the `truck_history` table by setting the `record_end_time` to mark the end of
  its validity.
* Insert a new record in the `truck_history` table, capturing the updated information.

Finally, commit the transaction.

```sqlexample
BEGIN;
SET CURRENT_TIMESTAMP = CURRENT_TIMESTAMP();
UPDATE truck SET year = '2024', record_start_time=$CURRENT_TIMESTAMP WHERE truck_id = 1;
UPDATE truck_history SET record_end_time=$CURRENT_TIMESTAMP WHERE truck_id = 1 AND record_end_time IS NULL;
INSERT INTO truck_history SELECT *, NULL AS record_end_time FROM truck WHERE truck_id = 1;
COMMIT;
```

### Step 6.2. Check the results

Now run the following SELECT queries to review the results of the UPDATE and INSERT statements.

The first query should return two rows, and the second query should return one.

```sqlexample
SELECT * FROM truck_history WHERE truck_id = 1;
```

The output of this command looks similar to the following partial result:

```sqlexample
SELECT * FROM truck WHERE truck_id = 1;
```

The output of this command looks similar to the following partial result:

## Step 7. Join a hybrid table to a standard table

In this step, you run a [join](../../sql-reference/constructs/join.md) query that combines data from a hybrid table
(`order_header`) and a standard table (`truck_history`). This query demonstrates the interoperability of the two
table types.

### Step 7.1. Explore the data in the tables

Earlier you created and loaded the `order_header` table. Now you can run a few queries and review some
information to get familiar with the table. First, list the tables in the database with the SHOW TABLES command,
then select two columns from the output of that list.

```sqlexample
SHOW TABLES IN DATABASE hybrid_quickstart_db;
SELECT "name", "is_hybrid" FROM TABLE(RESULT_SCAN(last_query_id()));
```

The output of this command looks similar to the following partial result:

Now run two simple queries:

```sqlexample
SELECT * FROM truck_history LIMIT 10;
SELECT * FROM order_header LIMIT 10;
```

The output of the second query looks similar to the following partial result:

### Step 7.2. Join a hybrid table to a standard table

To join the `order_header` hybrid table to the `truck_history` standard table, run the following
SET statement and query. Joining hybrid tables to standard tables does not require any special syntax.

```sqlexample
SET order_id = (SELECT order_id FROM order_header LIMIT 1);

SELECT hy.*,st.*
  FROM order_header AS hy JOIN truck_history AS st ON hy.truck_id = st.truck_id
  WHERE hy.order_id = $order_id
    AND st.record_end_time IS NULL;
```

The join result looks similar to the following partial result:

## Step 8. Demonstrate security and governance

In this step, you will run two security-related examples to demonstrate that Snowflake
[security and governance](../ecosystem-security.md) functionality applies equally to standard tables
and hybrid tables.

Roles and grants of privileges to those roles are standard mechanisms for enforcing security when large numbers
of database users have access to the same system, whether the workload is transactional, analytic, or hybrid.

### Step 8.1. Set up hybrid table access control and user management

[Role-based access control (RBAC)](../security-access-control-overview.md)
works the same for hybrid tables and standard tables. You can manage access to hybrid table data in Snowflake by
granting privileges to some roles.

First, create a new `hybrid_quickstart_bi_user_role` role. Use the ACCOUNTADMIN role to create the new role.

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE hybrid_quickstart_bi_user_role;
SET my_user = CURRENT_USER();
GRANT ROLE hybrid_quickstart_bi_user_role TO USER IDENTIFIER($my_user);
```

Now you can grant USAGE privileges for the `hybrid_quickstart_wh` warehouse, `hybrid_quickstart_db` database,
and all of its schemas to the new role. Use `hybrid_quickstart_role` to run the GRANT statements.

```sqlexample
USE ROLE hybrid_quickstart_role;
GRANT USAGE ON WAREHOUSE hybrid_quickstart_wh TO ROLE hybrid_quickstart_bi_user_role;
GRANT USAGE ON DATABASE hybrid_quickstart_db TO ROLE hybrid_quickstart_bi_user_role;
GRANT USAGE ON ALL SCHEMAS IN DATABASE hybrid_quickstart_db TO hybrid_quickstart_bi_user_role;
```

Using the new role (`hybrid_quickstart_bi_user_role`), try to select some data from the `order_header` table.

```sqlexample
USE ROLE hybrid_quickstart_bi_user_role;
USE DATABASE hybrid_quickstart_db;
USE SCHEMA data;

SELECT * FROM order_header LIMIT 10;
```

You cannot select any data because the role `hybrid_quickstart_bi_user_role` has not been granted the necessary
SELECT privilege on the tables. You receive the following error message:

```output
Object 'ORDER_HEADER' does not exist or not authorized.
```

To solve this problem, use the role `hybrid_quickstart_role` to grant SELECT privileges on all the tables in the
`data` schema to `hybrid_quickstart_bi_user_role`.

```sqlexample
USE ROLE hybrid_quickstart_role;
GRANT SELECT ON ALL TABLES IN SCHEMA DATA TO ROLE hybrid_quickstart_bi_user_role;
```

Try again to select data from the `order_header` hybrid table.

```sqlexample
USE ROLE hybrid_quickstart_bi_user_role;
SELECT * FROM order_header LIMIT 10;
```

This time the query succeeds because HYBRID_QUICKSTART_BI_USER_ROLE has the appropriate privileges at all
levels of the hierarchy. The output looks similar to the following partial result:

### Step 8.2. Create and implement a masking policy

In this step, you create a [masking policy](../security-column-intro.md) and apply it to the
`truck_email` column in the `truck` hybrid table by using an ALTER TABLE … ALTER COLUMN statement.
A masking policy is a standard way of controlling the column-level visibility of data to users with different roles
and privileges.

> **Note:**
>
> To create masking policies, you must use an Enterprise Edition account (or a higher-level account). If you are
> using a Standard Edition account, skip this step. For more information, see [Snowflake editions](../intro-editions.md).

Use the `hybrid_quickstart_role` role, then create the new masking policy, which is intended to mask entire column
values from unauthorized roles.

```sqlexample
USE ROLE hybrid_quickstart_role;

CREATE MASKING POLICY hide_column_values AS
  (col_value VARCHAR) RETURNS VARCHAR ->
    CASE WHEN CURRENT_ROLE() IN ('HYBRID_QUICKSTART_ROLE') THEN col_value
      ELSE '***MASKED***'
      END;
```

Now apply this policy to the hybrid table.

```sqlexample
ALTER TABLE truck MODIFY COLUMN truck_email
  SET MASKING POLICY hide_column_values USING (truck_email);
```

Because you are currently using the `hybrid_quickstart_role`, the `truck_email` column should *not* be masked.
Run the following query:

```sqlexample
SELECT * FROM truck LIMIT 10;
```

Switch to `HYBRID_QUICKSTART_BI_USER_ROLE` and run the query again. The `TRUCK_EMAIL` column should be
masked now.

```sqlexample
USE ROLE hybrid_quickstart_bi_user_role;
SELECT * FROM truck LIMIT 10;
```

## Step 9. Cleanup, conclusion, and further reading

### Cleanup

To clean up your Snowflake environment, run the following SQL statements:

```sqlexample
USE ROLE hybrid_quickstart_role;
USE WAREHOUSE hybrid_quickstart_wh;
USE DATABASE hybrid_quickstart_db;
USE SCHEMA data;
```

```sqlexample
DROP DATABASE hybrid_quickstart_db;
DROP WAREHOUSE hybrid_quickstart_wh;
USE ROLE ACCOUNTADMIN;
DROP ROLE hybrid_quickstart_role;
DROP ROLE hybrid_quickstart_bi_user_role;
```

Finally, manually delete the `Hybrid Tables - QuickStart` and `Hybrid Tables - QuickStart Session 2`
worksheets.

### What you learned

In this tutorial, you learned how to:

* Create and bulk load hybrid tables.
* Create and check the enforcement of UNIQUE, PRIMARY KEY, and FOREIGN KEY constraints.
* Run concurrent updates that depend on row-level locks.
* Run a multi-statement operation in a consistent atomic transaction (across hybrid and standard tables).
* Query hybrid tables and join them to standard tables.
* Verify that security and governance principles apply to both hybrid and standard tables.

### Related resources

* [Snowflake Unistore Landing
  Page](https://www.snowflake.com/en/data-cloud/workloads/unistore/)
* [Snowflake Documentation for Hybrid Tables](../tables-hybrid.md)
* [Blog: Simplify Application Development with Hybrid Tables](https://www.snowflake.com/blog/simplify-application-development-hybrid-tables)

---
title: Getting started with Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/tutorials/open-catalog-gs.md
section: User Guide
---

Snowflake Open Catalog

# Getting started with Snowflake Open Catalog

## Overview

Snowflake Open Catalog is an open catalog for Apache Iceberg™. Open Catalog is available as an SaaS service managed on Snowflake. It is also available as open source code that you can build and deploy yourself.
Open Catalog provides an implementation of the Apache Iceberg REST catalog with cross-engine security via role-based access control.

In this tutorial, you learn how to get started with Open Catalog managed on Snowflake.

### What you’ll learn

* How to create a new Open Catalog account.
* How to create a new Iceberg catalog in the Open Catalog account and secure it using RBAC.
* How to use Apache Spark™ to create tables in the catalog and run queries.
* How to use Snowflake to run queries on tables in the catalog.
* How to mirror or publish managed Iceberg tables in Snowflake to Open Catalog.

### What you’ll need

* ORGADMIN privileges in your Snowflake organization (to create a new Open Catalog account).
* ACCOUNTADMIN privileges in your Snowflake account (to connect to the Open Catalog account). This Snowflake account does not have to be the
  same as the Snowflake organization account.

### What you’ll do

You’ll complete two use cases:

* Use case 1: Create a catalog in Open Catalog, create a table using Apache Spark, and query the table using Apache Spark and Snowflake.
* Use case 2: Create an Apache Iceberg table in the Snowflake DB account using Snowflake, and publish it to Open Catalog so Apache Spark can run queries on it.

## Set up the environment

### Install Conda, Spark, and Jupyter on your laptop

In this tutorial, you can use Conda to easily create a development environment and download necessary packages. This is only needed if you
follow use case 2 for using Apache Spark™ to read Snowflake-managed Apache Iceberg™ tables. This is not required to create or use Iceberg tables
on Snowflake.

1. To install Conda, use the instructions specific to your OS:

   * [Mac](https://docs.conda.io/projects/conda/en/latest/user-guide/install/macos.html)
   * [Windows](https://docs.conda.io/projects/conda/en/latest/user-guide/install/windows.html)
   * [Linux](https://docs.conda.io/projects/conda/en/latest/user-guide/install/linux.html)
2. Create a file named `environment.yml` with the following contents:

   ```bash
   name: iceberg-lab
   channels:
     - conda-forge
   dependencies:
     - findspark=2.0.1
     - jupyter=1.0.0
     - pyspark=3.5.0
     - openjdk=11.0.13
   ```
3. To create the environment needed, run the following in your shell:

   ```bash
   conda env create -f environment.yml
   ```

### Create an Open Catalog account

An Open Catalog account can be created only by an ORGADMIN.

1. In Snowsight, in the navigation pane, select **Admin > Accounts**.
2. In the **+ Account** drop-down, select **Create Snowflake Open Catalog Account**.
3. Complete the **Create Snowflake Open Catalog Account** dialog:

   * **Cloud**: The cloud provider where you want to store Apache Iceberg™
     tables.
   * **Region**: The region where you want to store Iceberg tables.
   * **Edition**: The edition for your Open Catalog account.
4. Select **Next**.
5. From the Create New Account dialog, complete the Account Name, User
   Name, Password, and Email fields.
6. Select **Create Account**. Your new Open Catalog Account is
   created and a confirmation box appears.
7. In the confirmation box, select the **Account Locator URL** to open
   the Account Locator URL in your web browser.
8. Bookmark the Account Locator URL. When signing in to Open
   Catalog, you must specify the Account Locator URL.

### Sign in to the Open Catalog web interface

1. Click the account URL that you received via email after creating the account, OR
   go to <https://app.snowflake.com>.
2. Click **Sign into a different account** and sign in with the Open Catalog account created earlier.

## Use case 1: Create a table using Apache Spark™

### Create an IAM policy that grants access to your S3 location

If you don’t have one already, start by creating an IAM policy that grants access to your S3 location. For instructions on creating this policy, see [Create an IAM policy that grants access to your S3 location](../create-catalog.md).

### Create an IAM role

If you don’t have one already, create an AWS IAM role for Open Catalog to grant privileges on your S3 bucket. For instructions, see [Create an IAM role](../create-catalog.md). When the instructions prompt you to select a policy, select
the IAM policy that grants access to your S3 location.

### Create an internal catalog in Open Catalog

You can use an internal catalog in your Open Catalog account to create tables, query them, and run DML against the tables using
Apache Spark™ or other query engines.

1. Sign in to your new Open Catalog account.
2. To create a new catalog, in the pane on the left, select **Catalogs**.
3. Select **+Catalog** in the upper right.
4. In the **Create Catalog** dialog, enter the following details:

   * **Name**: Name the catalog **demo_catalog**.
   * **Default base location:** The location where the table data will be stored.
   * **Additional locations (optional):** A comma separated list of multiple storage locations. It is mainly used if you need to import tables from different locations in this catalog. You can leave it blank.
   * **S3 role ARN:** An AWS role that has read-write access to storage locations. Enter the ARN of the IAM role that you created for Open Catalog.
   * **External ID: (optional):** A secret that you want to provide while creating a trust relationship between catalog user and storage account.
     If you skip this, it will be auto-generated. Use a simple string like **abc123** for this tutorial.
5. Select **Create**. Your catalog is created and the following values are added to your catalog:

   * The **IAM user arn** for your Open Catalog account.
   * If you didn’t enter an External ID yourself, an **External ID** is auto-generated for your catalog.

   You’ll need this values in the next section when you create a trust relationship.

### Create a trust relationship

After creating a catalog, you need to set up a trust relationship so that the S3 role specified in the configuration above can read and write data in the storage location. Note that to complete this task, you will need the S3 IAM user arn and External ID for your catalog.

1. After the catalog is created, select your catalog in the list to display the S3 IAM user arn and External ID for your catalog.
2. To create the trust relationship, complete the instructions in [Step 5: Grant the IAM user permissions to access bucket objects](../create-catalog.md).

   In the JSON object shown in these instructions:

   * For `<open_catalog_user_arn>`, use the value under **IAM user arn** in the Open Catalog UI.
   * For `<open_catalog_external_id>`, use the value under **External ID** in the Open Catalog UI.

### Configure a new service connection for Apache Spark™

Create a new connection (client_id/client_secret pair) for Apache Spark to run queries against the catalog that you just created.

1. In Open Catalog, in the left pane, select the **Connections** tab, and then select **+ Connection** in the upper right.
2. In the **Configure Service Connection** dialog, create a new principal role or choose from one of the available roles.
3. Select **Create**.
4. From the **Configure Service Connection** dialog, to copy the Client ID and Client Secret to a text editor, select **Copy** inside the
   **As <CLIENT ID>:<SECRET>** field.

   **Important**

   > You won’t be able to retrieve these text strings from the Open Catalog service later, so you must copy them now. You use these text
   > strings when you configure Spark.

   **Note**

   > In this tutorial, you connect to Open Catalog with a service connection. If you need to connect to Open Catalog with External OAuth or key pair authentication, see:

   > * [Configure External OAuth in Snowflake Open Catalog](../external-oauth-configure.md). This topic includes instructions for setting up catalog privileges and setting up Spark that are specific to External OAuth.
   > * [Configure key pair authentication in Snowflake Open Catalog](../key-pair-auth-configure.md). This topic includes instructions for setting up catalog privileges and setting up Spark that are specific to key pair authentication.

### Set up catalog privileges for connection

Now you give privileges to the service connection so that it can access the catalog. Without access privileges, the
service connection can’t run any queries on the catalog.

1. In the navigation pane, select **Catalogs**, and then select your catalog in the
   list.
2. To create a new role, select the **Roles** tab.
3. Select **+ Catalog role**.
4. In the **Create Catalog Role** dialog, for **Name**, enter **spark_catalog_role**.
5. For **Privileges**, select **CATALOG_MANAGE_CONTENT**, and then select **Create**.

   This gives the role privileges to create, read, and write to tables.
6. Select **Grant to Principal Role**.
7. In the **Grant Catalog Role** dialog, for **Principal role to receive grant**, select **my_spark_admin_role**.
8. For **Catalog role to grant**, select **spark_catalog_role**, and then select **Grant**.

As a result of this procedure, the role spark_catalog_role is granted to my_spark_admin_role, which gives admin
privileges for the Spark connection that you created in the previous procedure.

### Set up Spark

From your terminal, run the following commands to activate the virtual environment you created in the setup, and open Jupyter Notebooks:

```python
conda activate iceberg-lab
jupyter notebook
```

### Configure Spark

* To register the service connection, run the following commands in a Jupyter notebook.

  ```python
  import os
  os.environ['SPARK_HOME'] = '/Users/<username>/opt/anaconda3/envs/iceberg-lab/lib/python3.12/site-packages/pyspark'

  import pyspark
  from pyspark.sql import SparkSession

  spark = SparkSession.builder.appName('iceberg_lab') \
  .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160,software.amazon.awssdk:url-connection-client:2.20.160') \
  .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
  .config('spark.sql.defaultCatalog', 'opencatalog') \
  .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
  .config('spark.sql.catalog.opencatalog.type', 'rest') \
  .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
  .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
  .config('spark.sql.catalog.opencatalog.credential','<client_id>:<client_secret>') \
  .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
  .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:<principal_role_name>') \
  .getOrCreate()

  #Show namespaces
  spark.sql("show namespaces").show()

  #Create namespace
  spark.sql("create namespace spark_demo")

  #Use namespace
  spark.sql("use namespace spark_demo")

  #Show tables; this will show no tables since it is a new namespace
  spark.sql("show tables").show()

  #create a test table
  spark.sql("create table test_table (col1 int) using iceberg");

  #insert a record in the table
  spark.sql("insert into test_table values (1)");

  #query the table
  spark.sql("select * from test_table").show();
  ```

  For more information, see [Register a service connection in Spark](../register-service-connection.md).

#### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<client_id>` | Specifies the client ID for the service principal to use.   Enter the **Client ID** that you copied when you configured a new service connection. |
| `<client_secret>` | Specifies the client secret for the service principal to use.   Enter the **Secret** that you copied when you configured a new service connection. |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account.   Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |
| `<principal_role_name>` | Specifies the principal role that is granted to the service principal.  To view this principal role, in Open Catalog, select the **Connections** page, select your service connection, and in the **Principal Details** dialog, refer to **Principal Roles.** |

#### Optional: S3 cross region

When your Open Catalog account is hosted on Amazon S3 but is located in a different region compared to the region where your S3 storage bucket is located, you must provide an additional Spark configuration setting:

```python
.config('spark.sql.catalog.opencatalog.client.region','<target_s3_region>') \
```

`<target_s3_region>` specifies the region where your S3 storage bucket is located. For the list of region codes, see [Regional endpoints](https://docs.aws.amazon.com/general/latest/gr/rande.html#regional-endpoints) in the AWS documentation.

The following code example is modified to include the s3 region:

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
.config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160,software.amazon.awssdk:url-connection-client:2.20.160') \
.config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
.config('spark.sql.defaultCatalog', 'opencatalog') \
.config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
.config('spark.sql.catalog.opencatalog.type', 'rest') \
.config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
.config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
.config('spark.sql.catalog.opencatalog.credential','<client_id>:<secret>') \
.config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
.config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:<principal_role_name>') \
.config('spark.sql.catalog.opencatalog.client.region','<target_s3_region>') \
.getOrCreate()
```

### Query the tables using Snowflake

You can create a catalog integration object in Snowflake and create an Apache Iceberg™ table in Snowflake that represents the table in
Open Catalog. In the following example, you create an Iceberg table in Snowflake that represents the Iceberg table just created by Spark in the
internal catalog in Open Catalog.

You can use the same Spark connection credentials, or you can create a new Snowflake connection. If you create a new connection, you
have to set up roles and privileges accordingly.

1. Create a catalog integration object:

   ```sqlsyntax
   CREATE OR REPLACE CATALOG INTEGRATION demo_open_catalog_int
     CATALOG_SOURCE = POLARIS
     TABLE_FORMAT = ICEBERG
     CATALOG_NAMESPACE = '<catalog_namespace>'
     REST_CONFIG = (
       CATALOG_URI = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/polaris/api/catalog'
       CATALOG_NAME = ‘<catalog_name>’
     )
       REST_AUTHENTICATION = (
       TYPE = OAUTH
       OAUTH_CLIENT_ID = '<client_id>'
       OAUTH_CLIENT_SECRET = '<secret>'
       OAUTH_ALLOWED_SCOPES = ('PRINCIPAL_ROLE:ALL')
     )
     ENABLED = TRUE;

   # the <catalog_namespace> created in previous step is spark_demo.
   # the <catalog_name> created in previous step is demo_catalog.
   ```
2. Create the table representation in Snowflake using the catalog integration created above, and query the table:

   ```sqlsyntax
   CREATE OR REPLACE ICEBERG TABLE test_table
     CATALOG = 'demo_open_catalog_int'
     EXTERNAL_VOLUME = '<external_volume>'
     CATALOG_TABLE_NAME = 'test_table';

   SELECT * FROM test_table;
   ```

## Use case 2: Sync Apache Iceberg™ tables from Snowflake to Open Catalog

If you have Iceberg tables in Snowflake, you can sync them to Open Catalog so other engines can query those tables.

### Create an external catalog in Open Catalog

The Iceberg tables from Snowflake can be synchronized in an external catalog in your Open Catalog account.

1. Sign in to your new Open Catalog account.
2. To create a new catalog, in the pane on the left, select **Catalogs**.
3. Select **+Catalog** in the upper right.
4. In the **Create Catalog** dialog, enter the following details:

   * **Name**: Name the catalog **demo_catalog_ext**.
   * Set the toggle for **External** to **On**.
   * **Default base location:** The location where the table data will be stored.

     **Note**

     > You must use a different storage location, compared to the internal catalog you created during Use case 1 of this tutorial. To ensure that the access privileges defined for a catalog are enforced correctly, two different catalogs can’t have
     > overlapping locations.
   * **Additional locations (optional):** A comma separated list of multiple storage locations. It is mainly used if you need to import tables from different locations in this catalog. You can leave it blank.
   * **S3 role ARN:** An AWS role that has read-write access to storage locations.
   * **External ID: (optional):** A secret that you want to provide while creating a trust relationship between catalog user and storage account.
     If you skip this, it will be auto-generated. Use a simple string like **abc123** for this tutorial.
5. Select **Create**. The following values are added to your catalog:

   * The **IAM user arn** for your Open Catalog account.
   * If you didn’t enter an External ID yourself, an **External ID** is auto-generated for your catalog.

### Configure a new service connection for Snowflake

1. In Open Catalog, in the left pane, select the **Connections** tab, and then select **+ Connection** in the upper right.
2. In the **Configure Service Connection** dialog, create a new principal role or choose from one of the available roles.
3. Select **Create**.
4. From the **Configure Service Connection** dialog, to copy the Client ID and Client Secret to a text editor, select **Copy** inside the
   **As <CLIENT ID>:<SECRET>** field.

   **Important**

   > You won’t be able to retrieve these text strings from the Open Catalog service later, so you must copy them now. You use these text
   > strings when you configure Spark.

   **Note**

   > In this tutorial, you connect to Open Catalog with a service connection. If you need to connect to Open Catalog with External OAuth or key pair authentication, see:

   > * [Configure External OAuth in Snowflake Open Catalog](../external-oauth-configure.md). This topic includes instructions for setting up catalog privileges and setting up Spark that are specific to External OAuth.
   > * [Configure key pair authentication in Snowflake Open Catalog](../key-pair-auth-configure.md). This topic includes instructions for setting up catalog privileges and setting up Spark that are specific to key pair authentication.

### Set up catalog privileges

To set up privileges on the external catalog so Snowflake connection has the right privileges for an external catalog, follow these steps:

1. In the navigation pane, select **Catalogs**, and then select your external catalog in the
   list.
2. To create a new role, select the **Roles** tab.
3. Select **+ Catalog role**.
4. In the **Create Catalog Role** dialog, for **Name**, enter **spark_catalog_role**.
5. For **Privileges**, select **CATALOG_MANAGE_CONTENT**, and then select **Create**.

   This gives the role privileges to create, read, and write to tables.
6. Select **Grant to Principal Role**.
7. In the **Grant Catalog Role** dialog, for **Principal role to receive grant**, select **my_spark_admin_role**.
8. For **Catalog role to grant**, select **spark_catalog_role**, and then select **Grant**.

### Create a catalog integration object in Snowflake

In Snowflake, create a catalog integration object by using the [CREATE CATALOG INTEGRATION (Snowflake Open Catalog) command](https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-open-catalog).
For CATALOG_NAME, specify the name of the external catalog that you configured in your Open Catalog account (demo_catalog_ext).

Snowflake syncs the table and its parent namespace to this external catalog in Open Catalog. For example, if you have an `open_catalog_demo.iceberg.test_table_managed`
Iceberg table registered in Snowflake and you specify `demo_catalog_ext` in the catalog integration, Snowflake syncs the table with Open Catalog with the following fully qualified name: `demo_catalog_ext.open_catalog_demo.iceberg.test_table_managed`.

```sqlsyntax
CREATE OR REPLACE CATALOG INTEGRATION demo_open_catalog_ext
  CATALOG_SOURCE=POLARIS
  TABLE_FORMAT=ICEBERG
  REST_CONFIG = (
    CATALOG_URI = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/polaris/api/catalog'
    CATALOG_NAME = '<catalog_name>'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = '<client_id>'
    OAUTH_CLIENT_SECRET = '<secret>'
    OAUTH_ALLOWED_SCOPES = ('PRINCIPAL_ROLE:ALL')
  )
  ENABLED=TRUE;

# the <catalog_name> created in previous step is demo_catalog_ext.
```

### Set up catalog sync

Before you can sync a Snowflake-managed Iceberg table to Open Catalog, you must specify the external catalog in Open Catalog that Snowflake
should sync the table to.

To set up catalog sync, use the [ALTER DATABASE](https://docs.snowflake.com/en/sql-reference/sql/alter-database) command with the CATALOG_SYNC
parameter. For the value of this parameter, specify the name of the catalog integration for Open Catalog. For example:

```sqlsyntax
ALTER DATABASE open_catalog_demo SET CATALOG_SYNC = 'demo_open_catalog_ext';
```

After running this code, Snowflake syncs all Snowflake-managed Iceberg tables in the `open_catalog_demo` database to the `<catalog_name>` external catalog
in Open Catalog that you specified in the `demo_open_catalog_ext` catalog integration.

### Create a Snowflake-managed Iceberg table

Create a Snowflake-managed Iceberg table and sync it from Snowflake to Open Catalog. For more information, see:

* [Configure an external volume for Amazon S3](https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-s3)
* [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-snowflake)

**Important**

> The `STORAGE_BASE_URL` for the external volume must match the **Default base location** for the external catalog you created in Open Catalog.

```sqlsyntax
use database open_catalog_demo;
use schema iceberg;

# Note that the storage location for this external volume will be different than the storage location for the external volume in use case 1

CREATE OR REPLACE EXTERNAL VOLUME snowflake_demo_ext
  STORAGE_LOCATIONS =
      (
        (
            NAME = '<storage_location_name>'
            STORAGE_PROVIDER = 'S3'
            STORAGE_BASE_URL = 's3://<s3_location>'
            STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::<aws_acct>:role/<rolename>'
            STORAGE_AWS_EXTERNAL_ID = '<external_id>'
        )
      );

CREATE OR REPLACE ICEBERG TABLE test_table_managed (col1 int)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'snowflake_demo_ext'
  BASE_LOCATION = 'test_table_managed'
```

When you modify the table in Snowflake, the changes are automatically synchronized with the external catalog in your Open Catalog account.
Other engines such as Apache Spark™ can query the table by connecting to Open Catalog.

**Note**

> If the table fails to sync to Open Catalog, run the SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG system function to diagnose the reason
> for the sync failure. For more information, see [SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG](https://docs.snowflake.com/en/sql-reference/functions/system_send_notifications_to_catalog).

## Conclusion

You can use an internal catalog in your Open Catalog account to create tables, query them, and run DML against the tables using Apache Spark™ or other query engines.

In Snowflake, you can create a catalog integration for Open Catalog to perform the following tasks:

* Run queries on Open Catalog managed tables.
* Sync Snowflake tables to an external catalog in your Open Catalog account.

### What you learned

* Create an Open Catalog account.
* Create an internal catalog in your Open Catalog account.
* Use Spark to create tables on the internal catalog.
* Use Snowflake to create a catalog integration for Open Catalog to run queries on a table created on an internal catalog in your Open
  Catalog account.
* Create an external catalog in your Open Catalog account.
* Create a managed Apache Iceberg™ table in Snowflake and sync it, along with two parent namespaces, to the external catalog in your Open Catalog account. In the
  tutorial, you learned how to set up catalog sync at the database level. However, you can also set it up at the account, schema, or table
  level, and sync it with one parent namespace. For more information, see the following topics:

  + For an example of setting up catalog sync at the schema level, see [Set up catalog sync at the schema level](https://docs.snowflake.com/en/user-guide/tables-iceberg-open-catalog-sync#set-up-catalog-sync-at-the-schema-level)
    in the Snowflake documentation.
  + For more information on setting up catalog sync, see [CATALOG_SYNC](https://docs.snowflake.com/en/sql-reference/parameters#catalog-sync)
    in the Snowflake documentation.
  + To sync the table with one parent namespace, set the CATALOG_SYNC_NAMESPACE_MODE property with the CREATE DATABASE command. To learn more, see [CREATE DATABASE](https://docs.snowflake.com/en/sql-reference/sql/create-database)
    in the Snowflake documentation.

    > **Note:**
    >
    > If your third-party query engine can only query tables located up to the second namespace level in a
    > catalog, you must sync the table with one parent namespace. Otherwise, Snowflake will sync the table to the
    > third namespace level in Open Catalog and you can’t query the table.

### Related resources

* [Snowflake Iceberg tables documentation](https://docs.snowflake.com/en/user-guide/tables-iceberg)
* [Apache Polaris™ (incubating) GitHub repository](https://github.com/apache/polaris)
* [Apache Iceberg documentation](https://iceberg.apache.org/)

---
title: Getting started with Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-gs.md
section: User Guide
---

# Getting started with Snowsight

This topic describes how to get started with Snowsight, the Snowflake web interface.

> **Note:**
>
> Some Snowsight features require a warehouse to run SQL queries for retrieving data, such as Task Run History or
> Data Preview for a table. An X-Small warehouse is recommended and generally sufficient for most of these queries. For information,
> see [Warehouse considerations](warehouses-considerations.md). Features that execute queries against a warehouse incur compute costs. For
> strategies to reduce usage credits, see [Optimizing cost](cost-optimize.md).

## Signing in to Snowsight

You can access Snowsight over the internet or through private connectivity to the Snowflake service:

* Using the internet
* Using private connectivity

After signing in to Snowsight, you see your recently updated worksheets.
See [Getting started with worksheets](ui-snowsight-worksheets-gs.md).

### Using the internet

To access Snowsight over the public internet, complete the following steps:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](admin-account-identifier.md) or account URL.
   If you previously signed in to Snowsight, you might see an account name that you can select.
3. Choose your authentication method, and then sign in.

### Using private connectivity

After completing the configuration to use private connectivity,
sign in to Snowsight with private connectivity directly:

1. In the browser location bar, enter either of the following URLs:

   * `https://app-orgname-account_name.privatelink.snowflakecomputing.com`
   * `https://app.cloud_region_id.privatelink.snowflakecomputing.com`

   Where:

   * `orgname` is the name of your Snowflake organization.
   * `account_name` is the unique name of your account within your organization.
   * `cloud_region_id` is the identifier for the cloud region, which is controlled by the cloud platform.

   After signing in, you can find these details in the account selector in Snowsight.

   For details, see Locate your Snowflake account information in Snowsight and [Format 1 (preferred): Account name in your organization](admin-account-identifier.md).

   > **Note:**
   >
   > If you are unsure of the values to enter, contact your internal Snowflake administrator before contacting Snowflake
   > Support.
2. Choose your authentication method, and then sign in.

## Snowsight and MFA

Snowflake takes security seriously and strongly encourages all users to configure multi-factor authentication (MFA). Users signing in to
Snowsight who have not yet configured MFA will be prompted to do so. You can dismiss the request, however you will
be re-prompted every three days. You will also receive Trust Center notifications until your account is enrolled. For more information, see
[Enable notifications from Trust Center](ui-snowsight-profile.md).

To configure MFA:

1. Select your username, and then select My Profile.
2. In the Multi-factor authentication section, select Enroll.
3. Follow the prompts to configure MFA for your device type.

For more information see [Enroll in multi-factor authentication (MFA)](ui-snowsight-profile.md).

### Switch to a different Snowflake account

You can sign in to a different Snowflake account by following these steps:

1. While signed in to Snowsight, select your username at the bottom of the navigation bar.
2. Select an account that you have previously signed in to, or select Sign Into Another Account.

   You’re prompted to sign in to the selected account.

## Supported browsers for using Snowsight

Snowsight supports the latest three major versions of the following browsers:

* Apple Safari for macOS
* Google Chrome
* Microsoft Edge
* Mozilla Firefox

## Access Snowsight through a proxy or firewall

To access Snowsight through a proxy or firewall, you might need to add the fully qualified URL and port values to the proxy servers
or firewall configuration.

To determine the fully qualified URL and port for Snowsight, run the [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md) function
and review the `SNOWSIGHT_DEPLOYMENT` entry in the return value.

## Locate your Snowflake account information in Snowsight

To locate account information, such as the account identifier or URL, for either your current account or one that you have previously
signed in to, follow these steps:

1. Open the account selector and review the list of accounts that you previously signed in to.
2. Select View account details.

   The Account Details dialog displays information about the account, including the account identifier and the account URL.

## Switch your primary role

While using Snowsight, you can change the primary role in your current session. Your primary role, along with any activated secondary roles, determines which pages in Snowsight you can access, as well as which databases, tables, and other objects you can see and the actions you can perform on them (excluding object creation, which is tied to your primary role).

To switch your primary role:

1. To open the user menu, in the navigation menu, select your username.
2. Select your current primary role. For example, PUBLIC.

   The role selector appears.
3. Select the role that you want to use as your new primary role. For example, ACCOUNTADMIN.

To learn more about roles and privileges, see [Overview of Access Control](security-access-control-overview.md).

## Configuring private connectivity for Snowsight

Before you can set up private connectivity for Snowsight, you must set up private connectivity for your Snowflake account.
Follow the guide specific to the cloud platform that hosts your Snowflake account:

* [AWS](admin-security-privatelink.md)
* [Azure](privatelink-azure.md)
* [Google Cloud Platform](private-service-connect-google.md)

To use private connectivity with Snowsight, configure your DNS and ensure firewalls allow access to the relevant values:

1. Using the ACCOUNTADMIN role, call the [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function in your Snowflake account
   and identify the values for the following:

   > * `privatelink-account-url`
   > * `snowsight-privatelink-url`
   > * `regionless-snowsight-privatelink-url`
2. Confirm that your DNS settings can resolve the values.
3. Confirm that you can connect to Snowsight using each of these URLs from your browser.
4. By default, changing the value of `regionless-snowsight-privatelink-url` only updates the connection URL used internally by
   Snowsight to connect to your Snowflake account. It does **not** automatically update the primary Snowsight access URL
   or change URL redirects.

   If you want to use the account name URL (the value for `regionless-snowsight-privatelink-url`) as your primary URL to access
   Snowsight and have all URL redirects point to it, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) and request this configuration change.

   When contacting Support, provide:

   * The exact URL you want to use as your primary Snowsight access URL.
   * A reference to [Private connectivity URLs](organizations-connect.md).

---
title: Getting started with the Trust Center
source: https://docs.snowflake.com/en/user-guide/trust-center/getting-started.md
section: User Guide
---

# Getting started with the Trust Center

You can use the Trust Center to check for common security risks in your Snowflake account, and get recommendations
on how to remediate those risks.

## Enable the CIS Benchmarks scanner package

Complete the following steps to enable the [CIS Benchmarks scanner package](overview.md):

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select a warehouse.
5. Select Scanner Packages.
6. Select CIS Benchmarks.
7. Select Enable and then Continue.

After you enable the scanner package, you can
[enable or disable individual scanners in the scanner package](using-the-trust-center.md). You can also
[change the schedule of individual scanners in the scanner package](using-the-trust-center.md).

## Enable the Threat Intelligence scanner package

Complete the following steps to enable the [Threat Intelligence scanner package](overview.md):

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select a warehouse.
5. Select Scanner Packages.
6. Select Threat Intelligence.
7. Select Enable and then Continue.

After you enable the scanner package, you can
[enable or disable individual scanners in the scanner package](using-the-trust-center.md). You can also
[change the schedule of individual scanners in the scanner package](using-the-trust-center.md).

## Ensure multi-factor authentication (MFA) is enforced for all human users using password-based authentication

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Ensure you have enabled the CIS Benchmarks scanner package.
5. Select Violations.
6. Above the list of violations, select Search.
7. In the Search box, enter `multi-factor authentication`.
8. Under the Violation column, select
   `Ensure multi-factor authentication (MFA) is turned on for all human users with password-based authentication`.

   A side panel opens.
9. In the side panel, select Remediation, and follow the guidance.

## Find over-privileged roles

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Ensure you have enabled the CIS Benchmarks scanner package.
5. Select Violations.
6. Above the list of violations, select Search.
7. In the Search box, enter `snowflake tasks`.
8. Under the Violation column, select
   `Ensure that Snowflake tasks do not run with the ACCOUNTADMIN or SECURITYADMIN role privileges`.

   A side panel opens.
9. In the side panel, select Remediation, and follow the guidance.

## Ensure the amount of users with the ACCOUNTADMIN and SECURITYADMIN system roles is limited

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Ensure you have enabled the CIS Benchmarks scanner package.
5. Select Violations.
6. Above the list of violations, select Search.
7. In the Search box, enter `limit the number of users`.
8. Under the Violation column, select
   `Limit the number of users with ACCOUNTADMIN and SECURITYADMIN`.

   A side panel opens.
9. In the side panel, select Remediation, and follow the guidance.

## Find users who have not logged in for 90 days

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Ensure you have enabled the CIS Benchmarks scanner package.
5. Select Violations.
6. Above the list of violations, select Search.
7. In the Search box, enter `did not log in`.
8. Under the Violation column, select
   `Ensure that users who did not log in for 90 days are disabled`.

   A side panel opens.
9. In the side panel, select Remediation, and follow the guidance.

## Find risky users and mitigate authentication risks

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Ensure you have enabled the Threat Intelligence scanner package.
5. Select Violations.
6. Above the list of violations, select Search.
7. In the Search box, enter `Ensure that every user is subject to an authentication policy`.
8. Under the Violation column, select
   `Ensure that every user is subject to an authentication policy that requires MFA enrollment`.

   A side panel opens.
9. In the side panel, select Remediation, and follow the guidance.

For more information, see the following resources:

* [How Organizations Can Use Snowflake To Move Beyond A Password-Only Sign-in Process (Whitepaper)](https://www.snowflake.com/en/resources/white-paper/best-practices-to-mitigate-the-risk-of-credential-compromise/)
* [Best Practices to Mitigate the Risk of Credential Compromise (Video)](https://youtu.be/XT16HYfaRzg?si=lojzoYbxpioxJcCF)

## Next steps

* [Using the Trust Center](using-the-trust-center.md)

---
title: Getting started with worksheets
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-worksheets-gs.md
section: User Guide
---

# Getting started with worksheets

> **Important:**
>
> Legacy Worksheets will be removed from Snowsight on **June 22, 2026**.
> [Workspaces](ui-snowsight/workspaces.md) is the replacement
> SQL editing experience. For the full deprecation timeline and migration guidance, see
> [Deprecation of Legacy Worksheets and Dashboards](../release-notes/bcr-bundles/un-bundled/bcr-2260.md).

View and create worksheets in Snowsight.

SQL worksheets let you write and run SQL statements, explore and filter query results, and visualize the results.
See [Querying data using worksheets](ui-snowsight-query.md) and [Visualizing worksheet data](ui-snowsight-visualizations.md).
You can also write Snowpark Python in worksheets. See [Writing Snowpark Code in Python Worksheets](../developer-guide/snowpark/python/python-worksheets.md).

Manage your worksheets by organizing them into folders, share worksheets with colleagues that also use Snowflake, and
manage the version history for worksheets. For more details, see [Work with worksheets in Snowsight](ui-snowsight-worksheets.md).

## Viewing worksheets in Snowsight

After signing in to Snowsight, you see the worksheets in your account.

Using the options, you can view recent worksheets opened by you, worksheets that your colleagues have shared with you,
worksheets that you created and own, or folders you created or that your colleagues have shared with you.

For any worksheet or worksheet folder, you can review the title, roughly when the worksheet or folder was last viewed or updated,
and the role associated with the worksheet or folder. In each row, you can see the initials of the user that owns the worksheet or folder.
You can sort by any column in the table.

Use the Search option to search the titles and contents of worksheets and dashboards that you can access.

## Create worksheets in Snowsight

To create a worksheet in Snowsight, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Select + and select SQL Worksheet or Python Worksheet to create a worksheet.

   The worksheet opens in the same window with the date and time of creation as the default title.

You can then start writing in your worksheet. For a SQL worksheet, [start writing queries](ui-snowsight-query.md).
For a Python worksheet, [start writing code](../developer-guide/snowpark/python/python-worksheets.md).

### Create worksheets from a SQL file

To create a SQL worksheet from an existing SQL file, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Select the … more menu » Create Worksheet from SQL File.
4. Browse to the SQL file to upload.
5. A new worksheet opens with a title that matches the file name.

You can also add a SQL file to an existing SQL worksheet. Refer to [Append a SQL script to an existing worksheet](ui-snowsight-query.md).

## Opening worksheets in tabs

You can use tabs to refer to multiple active worksheets and explore the databases and schemas in Snowflake while writing SQL
statements or Python code in Snowsight. Your scroll position is preserved in each tab, making comparisons across worksheets easier
to perform. Worksheet tabs are preserved across sessions, so you can pick up your work where you left off.

To open your Snowsight worksheets in tabs, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Select an existing worksheet, or select + Worksheet to open a new worksheet.
4. Select a role to run the worksheet as, and select a warehouse to allocate the compute resources for your query.
5. In the Worksheets menu, select an existing worksheet or select + to open a new worksheet tab. By default, the new worksheet
   uses your default role and warehouse.
6. (Optional) Make changes to the role or warehouse used to run the new worksheet.

After you open a worksheet, you can [update the contents](ui-snowsight-worksheets.md),
[run SQL statements](ui-snowsight-query.md) or
[write Python code](../developer-guide/snowpark/python/python-worksheets.md), and manage the worksheet.

---
title: Git repository replication
source: https://docs.snowflake.com/en/user-guide/account-replication-git-repositories.md
section: User Guide
---

# Git repository replication

This topic provides information about Snowflake support for replicating Git repository objects.

Before you get started, we recommend that you be familiar with Snowflake support for Git repositories.
For more information, see [Using a Git repository in Snowflake](../developer-guide/git/git-overview.md).

## Considerations for replicating Git repository clones

To replicate any Git repository objects that you’ve integrated with Snowflake,
you specify the database or schema that contains the Git repository object
in a replication group or a failover group. You don’t have to perform any
separate step to enable replication for Git repository clones.

The secrets from the primary system are replicated to the secondary system.

On the secondary system, you can read from the repository. However, you can’t commit, fetch from, or push to
the remote `origin` server from the secondary system. After you promote the secondary system to be the primary
by failing over, you can perform these other operations on the Git repository.

Snowflake supports replication for Git repository clones up to 5 GB in size. Larger repositories currently aren’t supported.

---
title: Google Cloud Private Service Connect and Snowflake
source: https://docs.snowflake.com/en/user-guide/private-service-connect-google.md
section: User Guide
---

# Google Cloud Private Service Connect and Snowflake

This topic describes concepts and how to configure Google Cloud Private Service Connect to connect your Google Cloud Virtual
Private Cloud (VPC) network subnet to your Snowflake account hosted on Google Cloud without traversing the public Internet.

Note that Google Cloud Private Service Connect is not a service provided by Snowflake. It is a Google service that Snowflake
enables for use with your Snowflake account.

## Overview

Google Cloud [Private Service Connect](https://cloud.google.com/vpc/docs/private-service-connect) provides private connectivity to
Snowflake by ensuring that access to Snowflake is through a private IP address. Snowflake appears as a resource in your network (that is,
customer network), but the traffic flows one-way from your VPC to Snowflake VPC over the Google networking backbone. This setup
significantly simplifies the network configuration while providing secure and private communication.

The following diagram summarizes the Google Cloud Private Service Connect architecture with respect to the customer Google Cloud VPC and
the Snowflake service.

The Google Compute Engine (that is, a virtual machine) connects to a private, virtual IP address that routes to a forwarding rule (1). The
forwarding rule connects to the service attachment through a private connection (2). The connection is routed through a load balancer (3)
that redirects to Snowflake (4).

### Limitations

* Maximum 10 connections per project.
* Maximum 50 connections per account.
* Some Snowflake system functions for self-service management are not supported. For information, see
  [Current Limitations for Accounts on Google Cloud](intro-cloud-platforms.md).

  For details, see:

  + [Account identifiers](admin-account-identifier.md)
  + [Connecting to your accounts](organizations-connect.md)

## Authorize Private Service Connect for your account

This section describes how to authorize Snowflake to accept network traffic over Private Service Connect.

1. Sign in to the Google Cloud account that has access to the project that you plan to authorize. You can use your
   [Google Cloud CLI](https://cloud.google.com/sdk/gcloud) environment to execute the following:

   ```bash
   gcloud auth login
   ```

   If you want to check the current account, execute the following:

   ```bash
   gcloud auth list
   ```
2. Use the Google Cloud CLI to create an access token by executing the following command:

   ```bash
   gcloud auth print-access-token
   ```

   This command generates an access token for your Google Cloud account. By default, the token expires after 1 hour. If you need to
   authorize, verify, or revoke authorization for Private Service Connect after the token expires, you’ll need to repeat this step to
   generate a new token.

   If you have a service account to the Google Cloud project, you can generate a [short-lived access token](https://cloud.google.com/iam/docs/create-short-lived-credentials-direct#sa-credentials-oauth) instead, but be sure the lifetime of the token is long enough to finish
   these configuration steps.
3. As a Snowflake account administrator (that is, a user with the ACCOUNTADMIN system role), call the
   [SYSTEM$AUTHORIZE_PRIVATELINK](../sql-reference/functions/system_authorize_privatelink.md) function to authorize (that is, enable) Private Service Connect for your
   Snowflake account. The syntax of this function for Private Service Connect is:

   ```sqlsyntax
   SELECT SYSTEM$AUTHORIZE_PRIVATELINK ( '<gcp_project_id>' , '<access_token>' )
   ```

   Where:

   * `gcp_project_id` is the Google Cloud Project ID from which you plan to create endpoints and connect to Snowflake securely.
   * `access_token` is the access token that you generated in a previous step in this configuration procedure.

   For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$AUTHORIZE_PRIVATELINK (
    'my-gcp-project-id',
    'ya29.a0AcM612zT4pJaXdYfwgY8aiMoDE9W_xkqQ20coFTB1TJcImKDPo...'
   );
   ```
4. Call the [SYSTEM$GET_PRIVATELINK](../sql-reference/functions/system_get_privatelink.md) function to verify that Private Service Connect was successfully
   authorized for your Snowflake account. Pass in the same arguments that you used to authorize. For example:

   ```sqlexample
   SELECT SYSTEM$GET_PRIVATELINK(
    'my-gcp-project-id',
    'ya29.a0AcM612zT4pJaXdYfwgY8aiMoDE9W_xkqQ20coFTB1TJcImKDPo...'
   );
   ```

   Snowflake returns `Account is authorized for PrivateLink` if the account is authorized for Private Service Connect.

## Configure your Google Cloud VPC environment

This section covers the Snowflake-specific details for configuring your Google Cloud VPC environment.

> **Important:**
>
> Snowflake is not responsible for the configuration of your Google Cloud environment. This procedure shows the basics of using the
> Google Cloud CLI, but is not a definitive guide. For example:
>
> * You could use the Google Cloud console to configure your Google Cloud environment instead of the Google Cloud CLI, which would change the
>   steps. For example, when using the Google Cloud console, you are creating an endpoint, not a forwarding rule.
> * It does not show you how to configure required firewall updates and DNS records.
> * It does not show you how to make an endpoint available in other regions (Private Service Connect endpoints are regional resources).
>   For more information about making an endpoint available in other regions, see the [Google documentation](https://cloud.google.com/vpc/docs/about-accessing-vpc-hosted-services-endpoints#global-access).
>
> For additional help, contact your internal Google Cloud administrator.

1. As a Snowflake account administrator (that is, a user with the ACCOUNTADMIN system role), open a worksheet and call the
   [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function. You need to save the output for subsequent steps.

   For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT key, value FROM TABLE(flatten(input=>parse_json(system$get_privatelink_config())));
   ```
2. Use the Google Cloud CLI to update the
   [gcloud library](https://cloud.google.com/sdk/gcloud/reference/components/update) to the latest version:

   ```bash
   gcloud components update
   ```
3. [Authenticate](https://cloud.google.com/sdk/gcloud/reference/auth/login) to Google Cloud using the following command:

   ```bash
   gcloud auth login
   ```
4. In your Google Cloud VPC, set the project [ID](https://cloud.google.com/sdk/gcloud/reference/config/set) in which the forwarding
   rule should reside.

   ```bash
   gcloud config set project <project_id>
   ```

   To obtain a list of project IDs, execute the following command:

   > ```bash
   > gcloud projects list --sort-by=projectId
   > ```
5. In your Google Cloud VPC, [create](https://cloud.google.com/sdk/gcloud/reference/compute/addresses/create) a virtual IP address:

   ```bash
   gcloud compute addresses create <customer_vip_name> \
   --subnet=<subnet_name> \
   --addresses=<customer_vip_address>
   --region=<region>
   ```

   Where:

   * `customer_vip_name` specifies the name of the virtual IP rule (for example, `psc-vip-1`).
   * `subnet_name` specifies the name of the subnet.
   * `customer_vip_address` specifies an IP address to which all private connectivity URLs resolve. Specify an IP address from your
     network or use CIDR notation to specify a range of IP addresses.
   * `region` specifies the cloud region where your Snowflake account is located.

   For example:

   ```bash
   gcloud compute addresses create psc-vip-1 \
   --subnet=psc-subnet \
   --addresses=192.168.3.3 \
   --region=us-central1
   ```

   Output:

   ```output
   Created [https://www.googleapis.com/compute/v1/projects/docstest-123456/regions/us-central1/addresses/psc-vip-1].
   ```
6. Create a [forwarding rule](https://cloud.google.com/sdk/gcloud/reference/compute/forwarding-rules/create) to have your subnet route
   to the Private Service Connect endpoint, and then to the Snowflake service endpoint.

   ```bash
   gcloud compute forwarding-rules create <name> \
   --region=<region> \
   --network=<network_name> \
   --address=<customer_vip_name> \
   --target-service-attachment=<privatelink-gcp-service-attachment>
   ```

   Where:

   * `name` specifies the name of the forwarding rule.
   * `region` specifies the cloud region where your Snowflake account is located.
   * `network_name` specifies the name of the network for this forwarding rule.
   * `customer_vip_name` specifies the `<name>` value (that is, `psc-vip-1`) of the virtual IP address created in the previous
     step.
   * `privatelink-gcp-service-attachment` specifies the endpoint for the Snowflake service, which you obtained when you executed the
     [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function in an earlier step.

   For example:

   ```bash
   gcloud compute forwarding-rules create test-psc-rule \
   --region=us-central1 \
   --network=psc-vpc \
   --address=psc-vip-1 \
   --target-service-attachment=projects/us-central1-deployment1-c8cc/regions/us-central1/serviceAttachments/snowflake-us-central1-psc
   ```

   Output:

   ```output
   Created [https://www.googleapis.com/compute/projects/mdlearning-293607/regions/us-central1/forwardingRules/test-psc-rule].
   ```
7. Use the following command to verify the forwarding-rule was created
   [successfully](https://cloud.google.com/sdk/gcloud/reference/compute/forwarding-rules/list):

   ```bash
   gcloud compute forwarding-rules list --regions=<region>
   ```

   Where:

   * `region` is the cloud region where your Snowflake account is located. For example, if your Snowflake account is located in
     the `europe-west2` region, replace `<region>` with `europe-west2`.

   For a complete list of Google Cloud regions and their formatting, see [Viewing a list of available regions](https://cloud.google.com/compute/docs/regions-zones/viewing-regions-zones#viewing_a_list_of_available_regions).
8. Update your DNS settings.

   All requests to Snowflake need to be routed through the Private Service Connect endpoint so that the URLs returned by the
   [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function resolve to the VIP address that you created (`<customer_vip_address>`).

   The values to obtain from the output of SYSTEM$GET_PRIVATELINK_CONFIG depend on which Snowflake features you access using private
   connectivity. For a description of the possible values, see [Return values](../sql-reference/functions/system_get_privatelink_config.md).

   Note that the values for `regionless-snowsight-privatelink-url` and `snowsight-privatelink-url` allow access to
   Snowsight and the Snowflake Marketplace using private connectivity. However, there is additional configuration if you want to enable
   URL redirects. For information, see [Snowsight & Private Connectivity](ui-snowsight-gs.md).

   > **Note:**
   >
   > A full explanation of DNS configuration is beyond the scope of this procedure. For example, you can choose to integrate a private DNS
   > zone into your environment using [Cloud DNS](https://cloud.google.com/dns/docs/overview). Please consult your internal Google Cloud
   > and cloud infrastructure administrators to configure and resolve the URLs in DNS properly.

## Connect to Snowflake

Before connecting to Snowflake, you can optionally leverage SnowCD (Snowflake Connectivity Diagnostic tool) to evaluate the
network connection with Snowflake and Private Service Connect. For more information, see
[SnowCD](snowcd.md) and [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md).

To connect to Snowflake with your private connectivity account, see [Connecting with a URL](organizations-connect.md).

## Revoke authorization

If it’s necessary to disable Private Service Connect in your Snowflake account, call the
[SYSTEM$REVOKE_PRIVATELINK](../sql-reference/functions/system_revoke_privatelink.md) function, using the same argument values that you used to authorize the account.
For example:

```sqlexample
SELECT SYSTEM$REVOKE_PRIVATELINK(
 'my-gcp-project-id',
 'ya29.a0AcM612zT4pJaXdYfwgY8aiMoDE9W_xkqQ20coFTB1TJcImKDPo...'
);
```

## Using SSO with Google Private Service Connect

Snowflake supports using SSO with Google Cloud Private Service Connect. For more information, see:

* [SSO with private connectivity](admin-security-fed-auth-overview.md)
* [Partner applications](oauth-snowflake-overview.md)

## Using Client Redirect with Google Cloud Private Service Connect

Snowflake supports using Client Redirect with Google Cloud Private Service Connect.

For more information, see [Redirecting client connections](client-redirect.md).

## Using Replication & Tri-Secret Secure with Private Connectivity

Snowflake supports replicating your data from the source account to the target account, regardless of whether you enable
Tri-Secret Secure or this feature in the target account.

## Blocking public access — *Recommended*

After testing the Google Cloud Private Service Connect connectivity with Snowflake, you can optionally block public access to
Snowflake using network policies. For more information, see [Controlling network traffic with network policies](network-policies.md).

Configure the CIDR block range to block public access to Snowflake using your organization’s IP address range. This range can be
from within your virtual network.

Once the CIDR Block ranges are set, only IP addresses within the CIDR block range can access Snowflake.

To block public access using a network policy:

1. Create an IPv4 network rule or edit an existing IPv4 network rule to add the CIDR block range for your organization.
2. Create or modify a network policy to use the IPv4 network rule.
3. Activate the network policy for your account.

---
title: Google Cloud Storage data file encryption
source: https://docs.snowflake.com/en/user-guide/data-load-gcs-encrypt.md
section: User Guide
---

# Google Cloud Storage data file encryption

Cloud Storage always encrypts your data on the server side by default. Snowflake handles the encrypted files correctly.

In addition, Snowflake supports GCS buckets encrypted using a key stored in Cloud KMS on top of the default server-side encryption provided by GCS. The `GCS_SSE_KMS` encryption type accepts an optional `KMS_KEY_ID` value.

For more information, see the Google Cloud documentation:

* <https://cloud.google.com/storage/docs/encryption/customer-managed-keys>
* <https://cloud.google.com/storage/docs/encryption/using-customer-managed-keys>

**Next:** [Configure an integration for Google Cloud Storage](data-load-gcs-config.md)

---
title: Google Private Service Connect endpoints for internal stages
source: https://docs.snowflake.com/en/user-guide/private-internal-stages-gcp.md
section: User Guide
---

# Google Private Service Connect endpoints for internal stages

This topic provides concepts as well as detailed instructions for connecting to Snowflake internal stages using
[Google Private Service Connect endpoints](https://cloud.google.com/vpc/docs/private-service-connect#endpoints).

## Google Private Service Connect endpoints: Overview

You can configure Google Private Service Connect (PSC) endpoints to provide secure, private connectivity to Snowflake internal stages. This
setup ensures that data loading and unloading operations to Snowflake internal stages use the Google PSC network and not
the public internet. The following diagram summarizes this new support:

The following list provides information about the numbers in the diagram:

> * The diagram shows a single PSC endpoint from one Google VPC network that points to a single Snowflake internal stage
>   (2 and 3).

> **Note:**
>
> You can configure multiple private endpoints within the same VPC network that access the same Snowflake internal stage.

* An on-premises user can connect to Snowflake directly, as shown in number 1.
* To connect to a Snowflake internal stage, an on-premises user must route their request through the VPC Network, 2, and then through the
  Google PSC network, 3, to connect to the Snowflake internal stage.

### Benefits

Implementing private endpoints to access Snowflake internal stages provides the following benefits:

* Internal stage data doesn’t traverse the public internet.
* On-premises client and SaaS applications can securely access a Snowflake internal stage bucket by using the Google PSC network.
* Administrators aren’t required to modify firewall settings to access internal stage data.
* Administrators can implement consistent security and monitoring to restrict access to their internal stages.

### Limitations

* A maximum of 10 VPC networks can be allowlisted for a Snowflake account.

## Configure private endpoints to access Snowflake internal stages

To configure private endpoints to access Snowflake internal stages, you must use the following three roles:

* The Snowflake ACCOUNTADMIN system role.
* The Google Cloud administrator.
* The network administrator.

You might need to coordinate your configuration efforts with more than one person or team, depending on the role hierarchy in your organization.

To configure and implement secure access to Snowflake internal stages through Google PSC endpoints, complete the following steps:

1. As a Google Cloud administrator, use the Google Cloud console to get the fully qualified path value that Snowflake uses to limit
   network access.

   1. In <https://console.cloud.google.com>, go to Quick Access » VPC Network, and then select your project in
      » VPC Networks » Name.
   2. In VPC network details, select Equivalent REST.
   3. In Equivalent REST Response, copy the value of `"selfLink"`.

      This value should look something like `projects/vpc_network_name/global/networks/network_name`.

      You will provide this value as the `'google_cloud_vpc_network_name'` argument for the system function in the next step.
2. In Snowflake, use the ACCOUNTADMIN role to authorize access to the internal stage by calling the
   [SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS](../sql-reference/functions/system_authorize_stage_privatelink_access.md) function.
   For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS('<google_cloud_vpc_network_name>');
   ```
3. Visit <https://console.cloud.google.com/> as a Google Cloud administrator, create a Google PSC endpoint, and then attach it to the VPC network
   that Snowflake will access:

   1. Create a new endpoint: in Network services » Private Service Connect, select Connect endpoint.
   2. In Target, select `All Google APIs` as the target, and then fill in the required fields.

      > **Note:**
      >
      > `All Google APIs` is appropriate for *global* endpoints. Currently, only global endpoints are supported.
   3. Select ADD ENDPOINT.
4. Record the newly created Google PSC endpoint IP address and the VPC Network ID to which the Google PSC endpoint connects.
5. As the network administrator, configure the DNS settings to resolve the URLs:

   1. In Network services, navigate to Cloud DNS.
   2. Create a new DNS zone with the following settings:

      * **Zone type:** `private`
      * **DNS Name:** `storage.googleapis.com`
      * **Options:** `Default (private)`
      * **Networks:** `prod`
   3. Select CREATE.
6. In the new, private DNS zone, create a new record with the following values:

   1. Use the bucket name for your internal stage.
   2. **Resource record type:** `A`
   3. **IPv4 address:** `10.10.80.55` — Use the IP address of the Google PSC endpoint that you created earlier.
   4. Select CREATE.
7. From a client in the same VPC, confirm that the internal stage URL resolves the IP address of the endpoint by using the `nslookup` or
   `dig` command.

   For example, use the following `dig` command to confirm the resolution:

   ```shell
   dig gcpeuropewest4-63osaw1-stage.storage.googleapis.com
   ```

   A properly configured global endpoint should return a result like the following:

   ```output
   DNS name: gcpeuropewest4-63osaw1-stage
   ```

## Block public access to the internal stage — *Recommended*

Snowflake recommends that you deny all access to your Google PSC endpoints except through the VPC Network that you authorize. This includes
denying public internet access to the internal stages.

To block public access to the internal stage, call the [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](../sql-reference/functions/system_block_internal_stages_public_access.md) function.

Controlling public access to a Google internal stage is different from controlling public access to the Snowflake service. You use the
SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function, not a network policy, to block requests to an internal stage. Unlike network policies,
this function can’t block some public IP addresses while allowing others. This function blocks *all* public IP addresses. The
SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function can take a few minutes to complete.

### Ensure that public access is blocked

Determine whether public IP addresses can access an internal stage by running the
[SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](../sql-reference/functions/system_internal_stages_public_access_status.md) function.

If the Google Cloud settings currently block all public traffic, this function returns `Public Access to internal stages is blocked`.
This message indicates that the settings weren’t changed after the SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS function was executed.

### Unblock public access

To allow public access to an internal stage that was previously blocked, you can execute the [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](../sql-reference/functions/system_unblock_internal_stages_public_access.md)
function.

Executing this function removes all restrictions from the internal stage.

## Revoke access to Snowflake internal stages

To revoke access to Snowflake internal stages through Google PSC private endpoints, complete the following steps:

1. As a Snowflake administrator, confirm that the [ENABLE_INTERNAL_STAGES_PRIVATELINK](../sql-reference/parameters.md) parameter is set to `TRUE`.
   For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SHOW PARAMETERS LIKE 'enable_internal_stages_privatelink' IN ACCOUNT;
   ```
2. As a Snowflake administrator, revoke access to the private endpoint by calling the [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](../sql-reference/functions/system_revoke_stage_privatelink_access.md)
   function, and using the same `google_cloud_vpc_network_name` value that was used to originally authorize access to the private endpoint.
   For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS('<google_cloud_vpc_network_name>');
   ```
3. As a Google Cloud administrator, delete the private endpoint through the Google Cloud portal.
4. As a network administrator, remove the DNS and alias records that were used to resolve the storage account URLs.

Completing these steps revokes access to the VPC network.

---
title: GovRAMP (Moderate and High)
source: https://docs.snowflake.com/en/user-guide/cert-stateramp.md
section: User Guide
---

# GovRAMP (Moderate and High)

This topic describes how Snowflake supports customers with GovRAMP compliance requirements.

## Understanding GovRAMP compliance requirements

GovRAMP is a 501(c)6 nonprofit that standardizes security requirements for cloud offerings and verifies those cloud offerings for use by
state and local governments and public education institutions through independent audits and continuous monitoring. The Snowflake offerings
that are working towards or have achieved GovRAMP authorizations are included on the
[Authorized Product List](https://govramp.org/product-list/).

State and local governments, public education institutions, and special districts are invited to become members of GovRAMP. Government
membership provides access to shared services for managing supplier risk. Providers are also eligible for membership. Provider membership
benefits include: a public profile on the [Authorized Product List](https://govramp.org/product-list/), transferrable credentials,
committee eligibility, access to the complete membership directory, an opportunity to provide feedback on policies, and documentation, and
member education.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Grant privileges to other roles
source: https://docs.snowflake.com/en/user-guide/data-exchange-marketplace-privileges.md
section: User Guide
---

# Grant privileges to other roles

Snowflake provides a set of privileges for working with listings in the Snowflake Marketplace or a Data Exchange.

## Granting administrator privileges in a Data Exchange

By default, only an account administrator (a user with the ACCOUNTADMIN role) in the Data Exchange administrator account can manage a
Data Exchange, which includes the following tasks:

* Add or remove members.
* Approve or deny listing approval requests.
* Approve or deny provider profile approval requests.
* Show categories.

To support delegating these tasks to other users, the IMPORTED PRIVILEGES privilege can be granted on a Data Exchange to other roles.

### Granting the IMPORTED PRIVILEGES privilege to other roles

To grant the IMPORTED PRIVILEGES privilege on a Data Exchange to a role, use the ACCOUNTADMIN role and the
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command.

> **Note:**
>
> The WITH GRANT OPTION parameter does not support the IMPORTED PRIVILEGES privilege.

Syntax:

```sqlsyntax
GRANT IMPORTED PRIVILEGES ON DATA EXCHANGE <exchange_name> TO <role_name>;
```

Where:

* `exchange_name` is the name of a Data Exchange.
* `role_name` is the role to which the privilege is granted.

For example, grant imported privileges on the `mydataexchange` Data Exchange to a custom role called `myrole`:

```sqlexample
USE ROLE ACCOUNTADMIN;

GRANT IMPORTED PRIVILEGES ON DATA EXCHANGE mydataexchange TO myrole;
```

### Usage notes

* This privilege is granted at the Data Exchange level. Therefore, users with the role can only administer the Data Exchange for which the
  privilege has been granted.
* Only an account administrator in the Data Exchange administrator account can grant the privilege to another role.
* When a role has been granted IMPORTED PRIVILEGES on a database created from a share, subsequent calls to the
  [SHOW GRANTS](../sql-reference/sql/show-grants.md) command list the privilege as USAGE and **not** IMPORTED PRIVILEGES.
* This privilege can only be used for a Data Exchange. In the Snowflake Marketplace, only Snowflake administrators can perform administrative tasks.

## Granting provider privileges to other roles in the Snowflake Marketplace or a Data Exchange

Snowflake provides a set of privileges to allow providers to perform various tasks related to sharing data and apps with specific consumers,
on the Snowflake Marketplace, or data in a Data Exchange.

|  |  |  |  |
| --- | --- | --- | --- |
| Privilege | Object Type | Can be Granted by | Description |
| Global CREATE LISTING privilege | ACCOUNT | ACCOUNTADMIN | Grants the ability to create a listing or provider profile. |
| CREATE SHARE privilege | ACCOUNT | ACCOUNTADMIN | Grants the ability to create a share. |
| IMPORT SHARE privilege | ACCOUNT | ACCOUNTADMIN | Grants the ability to view an inbound share shared with the account and create a database from the share. |
| PURCHASE DATA EXCHANGE LISTING privilege | ACCOUNT | ACCOUNTADMIN | Grants the ability to purchase a paid listing. |
| MODIFY privilege on a listing | LISTING | Role with the OWNERSHIP privilege on the listing. | Grants the ability to modify listing properties. |
| USAGE privilege on a listing | LISTING | Role with the OWNERSHIP privilege on the listing. | Grants the ability to view a listing. |
| OWNERSHIP privilege on a listing | LISTING | Role with the OWNERSHIP privilege on the listing. | Transfer the OWNERSHIP privilege on the listing. |
| MODIFY privilege on a provider profile | PROVIDER PROFILE | Role with the OWNERSHIP privilege on the profile. | Grants the ability to modify properties for a provider profile. |
| OWNERSHIP privilege on a provider profile | PROVIDER PROFILE | Role with the OWNERSHIP privilege on the profile. | Transfer the OWNERSHIP privilege on the profile. |

### Account-level privileges

Snowflake provides the following privileges for working with shares, listings, and provider profiles at the account level in the
Snowflake Marketplace or a Data Exchange:

* Global CREATE LISTING privilege
* CREATE SHARE privilege
* [IMPORT SHARE privilege](security-access-privileges-shares.md)

#### Global CREATE LISTING privilege

If the global CREATE LISTING privilege is granted to a role, any user with the role can create a listing or provider profile.
As the creator and therefore owner of the listing, the role can be used to perform all tasks on the listing, including:

* Create listings.
* Modify listings properties.
* View listings.
* View incoming listing requests.
* Reject listing requests.
* Submit listings for approval.
* Publish a listings.
* Create and view provider profiles.
* View offers.
* View pricing plans.

If an account is a provider in more than one Data Exchange, a role with the global CREATE LISTING privilege can create listings
in each of those Data Exchanges.

> **Note:**
>
> * A role that creates a listing becomes the owner of the listing. The OWNERSHIP privilege can be transferred using
>   OWNERSHIP privilege on a listing to a different role by the owning role.
> * Only account administrators (users with the ACCOUNTADMIN role) can grant the global CREATE LISTING privilege to a role.

To grant the global CREATE LISTING privilege to a role in a Data Exchange, use the
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) [WITH GRANT OPTION] command.

For example, use the ACCOUNTADMIN role to grant the privilege:

```sqlexample
USE ROLE ACCOUNTADMIN;
```

Then grant the privilege to a custom role, `myrole`:

```sqlexample
GRANT CREATE LISTING ON ACCOUNT TO ROLE myrole;
```

Then grant the privilege to the role `myrole` with grant option:

```sqlexample
GRANT CREATE LISTING ON ACCOUNT TO ROLE myrole WITH GRANT OPTION;
```

#### CREATE SHARE privilege

If the CREATE SHARE privilege is granted to a role, any user with the role can create a share. As the creator and therefore owner of the
share, the role can also be used to perform all tasks on the share, including:

* Granting privileges on objects to or revoking privileges on objects from the share.
* Adding accounts to or removing consumer accounts from the share.

For more information, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).

#### IMPORT SHARE privilege

If the IMPORT SHARE privilege is granted to a role, any user with the role can perform the following tasks:

* View all INBOUND shares (shared by provider accounts).
* View all OUTBOUND shares owned by the role.
* Create databases from inbound shares if the role is also granted the global CREATE DATABASE privilege.

For more information, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).

#### PURCHASE DATA EXCHANGE LISTING privilege

If the PURCHASE DATA EXCHANGE LISTING privilege is granted to a role, any user with the role can purchase a listing shared privately or
on the Snowflake Marketplace.

For more information about purchasing listings, see
[Becoming a consumer of listings](../collaboration/consumer-becoming.md).

For more information about this privilege, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).

### Listing-level privileges

Snowflake provides the following privileges for listings. You can only grant these privileges using the role granted the OWNERSHIP privilege
on the listing.

* MODIFY privilege on a listing
* USAGE privilege on a listing
* OWNERSHIP privilege on a listing

#### MODIFY privilege on a listing

If the MODIFY privilege on a listing is granted to a role, any user with the role can perform the following tasks for a listing:

* Modify listing properties.
* View a listing.
* View incoming listing access requests.
* Submit a listing for approval.
* Publish a listing.
* Reject listing requests.

Only the role with the OWNERSHIP privilege on the listing can grant this privilege.

To grant the MODIFY privilege on a listing shared with specific consumers or published on the Snowflake Marketplace:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings.
4. Locate the listing that you want to modify and select the row to open the listing details.
5. In the listing details page, select Settings.
6. In the Privileges section, select the pencil icon next to the Modify Listing privilege.
7. Select Add Role and add required roles.
8. Save your changes.

To grant the MODIFY privilege on a listing in a data exchange:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select Shared by your account.
4. Locate the listing that you want to modify and select the row to open the listing details.
5. In the listing details page, select Settings.
6. In the Privileges section, select the pencil icon next to the Modify Listing privilege.
7. Select Add Role and add required roles.
8. Save your changes.

#### USAGE privilege on a listing

If the USAGE privilege on a listing is granted to a role, any user with the role can view listings and incoming listing requests.
Only the role with the OWNERSHIP privilege on the listing can grant this privilege.

To grant the USAGE privilege on a listing shared with specific consumers or published on the Snowflake Marketplace:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings.
4. Locate the listing that you want to modify and select the row to open the listing details.
5. In the listing details page, select Settings.
6. In the Privileges section, select the pencil icon next to the Modify Listing privilege.
7. Select Add Role and add required roles.
8. Save your changes.

To grant the USAGE privilege on a listing in a data exchange:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select Shared by your account.
4. Locate the listing that you want to modify and select the row to open the listing details.
5. In the listing details page, select Settings.
6. In the Privileges section, select the pencil icon next to the Modify Listing privilege.
7. Select Add Role and add required roles.
8. Save your changes.

#### OWNERSHIP privilege on a listing

If the OWNERSHIP privilege on a listing is granted to a role, that role becomes the new OWNER of the listing. Only the OWNER of the listing
can grant this privilege. OWNERSHIP is a special type of privilege that can only be granted from one role to another role; it cannot be
revoked. For more details, see [Overview of Access Control](security-access-control-overview.md).

> **Important:**
>
> When listing ownership is transferred, all existing grants get revoked. All roles that have been granted privileges immediately lose access to this listing, and their privileges are revoked. The new listing owner must re-grant these privileges.

To grant the OWNERSHIP privilege on a listing shared with specific consumers or published on the Snowflake Marketplace:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings.
4. Locate the listing that you want to modify and select the row to open the listing details.
5. In the listing details page, select Settings.
6. In the Privileges section, select the pencil icon next to the Modify Listing privilege.
7. Select Add Role and add required roles.
8. Save your changes.

To grant the OWNERSHIP privilege on a listing in a data exchange:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select Shared by your account.
4. Locate the listing that you want to modify and select the row to open the listing details.
5. In the listing details page, select Settings.
6. In the Privileges section, select the pencil icon next to the Modify Listing privilege.
7. Select Add Role and add required roles.
8. Save your changes.

### Provider profile level privileges

Snowflake provides the following privileges for provider profiles. Only the role with the OWNERSHIP privilege on the provider profile
can grant this privilege.

* MODIFY privilege on a provider profile
* OWNERSHIP privilege on a provider profile

> **Note:**
>
> * To create a profile, use the Global CREATE LISTING privilege global privilege.
> * Any role in the provider account can view all profiles. This task does not require granting a privilege.

#### MODIFY privilege on a provider profile

If the MODIFY privilege is granted to a role on a provider profile, any user with the role can view and modify provider profile properties.
Only the role with the OWNERSHIP privilege on the provider profile can grant this privilege.

The MODIFY privilege can be granted through the web interface or using SQL:

> [Snowsight](ui-snowsight-gs.md):
> :   In the navigation menu, select Data sharing » Internal sharing » Manage Exchanges » Select an Exchange » Select a Provider Profile » Manage » Manage Profile Editors.
>
> SQL:
> :   Execute the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) [WITH GRANT OPTION] command.
>
> For example, to grant the privilege to the custom role `myrole`:
>
> ```sqlexample
> GRANT MODIFY ON DATA EXCHANGE PROFILE "<provider_profile_name>" TO ROLE myrole;
> ```

#### OWNERSHIP privilege on a provider profile

If the OWNERSHIP privilege on a provider profile is granted to a role, that role becomes the new owner of the profile. Only the role with
the OWNERSHIP privilege on the provider profile can grant this privilege.

OWNERSHIP is a special type of privilege that can only be granted from one role to another role; it cannot be revoked.
For more details, see [Overview of Access Control](security-access-control-overview.md).

To grant the OWNERSHIP privilege on a provider profile to a role, use the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) [WITH GRANT OPTION]
command. You cannot use Snowsight to grant this privilege.

For example to grant the privilege to the custom role `myrole`:

```sqlexample
GRANT OWNERSHIP ON DATA EXCHANGE PROFILE "<provider_profile_name>" TO ROLE myrole;
```

---
title: HITRUST CSF
source: https://docs.snowflake.com/en/user-guide/cert-hitrust.md
section: User Guide
---

# HITRUST CSF

This topic describes how Snowflake supports customers with HITRUST CSF compliance requirements.

## Understanding HITRUST CSF compliance requirements

The Health Information Trust Alliance Common Security Framework (HITRUST CSF) serves to unify security controls based on aspects of US
federal law (such as HIPAA and HITECH), certain state-specific laws and other industry-standard compliance frameworks into a single
comprehensive set of baseline security and privacy controls, built specifically for healthcare needs.

Snowflake participates in the HITRUST Shared Responsibility and Inheritance Program. With the Shared Responsibility Matrix (SRM), customers
can now inherit Snowflake’s HITRUST CSF certification provided that customers apply the controls detailed in the HITRUST Alliance website.
Customers should download the Snowflake Custom HITRUST Shared Responsibility Matrix to determine HITRUST controls that they are responsible
for implementing as part of the shared responsibility model. Customers should refer to the HITRUST webpage for guidance on how to initiate
an inheritance request.

For details, see:

* [HITRUST Alliance website](https://hitrustalliance.net/hitrust-srm-inheritance-program/).
* [Snowflake Custom HITRUST Shared Responsibility Matrix](https://hitrustalliance.net/shared-responsibility-matrices).
* [HITRUST user guide](https://help.mycsf.net/user-guide).

---
title: How conjunctions (AND) and disjunctions (OR) work with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/conjunctions-disjunctions.md
section: User Guide
---

# How conjunctions (AND) and disjunctions (OR) work with search optimization

Search optimization can accelerate queries using conjunctions (AND operator)
and disjunctions (OR operator) of supported predicates.

## Conjunctions of supported predicates (AND)

For queries that use conjunctions of predicates (i.e., AND), query performance can be improved by search optimization if
any of the predicates [would benefit](queries-that-benefit.md).

For example, suppose that a query has:

> `where condition_x and condition_y`

Search optimization can improve performance if either condition separately returns a few rows (i.e., `condition_x`
returns a few rows or `condition_y` returns a few rows).

If `condition_x` returns a few rows but `condition_y` returns many rows, the query performance can still
benefit from search optimization.

### Examples

If predicates are individually supported by the search optimization service, then they can be joined by the conjunction
`AND` and still be supported by the search optimization service:

```sqlexample
SELECT id, c1, c2, c3
  FROM test_table
  WHERE c1 = 1
    AND c3 = TO_DATE('2004-03-09')
  ORDER BY id;
```

DELETE and UPDATE (and MERGE) can also use the search optimization service:

```sqlexample
DELETE FROM test_table WHERE id = 3;
```

```sqlexample
UPDATE test_table SET c1 = 99 WHERE id = 4;
```

## Disjunctions of supported predicates (OR)

For queries that use disjunctions of predicates (i.e., OR), query performance can be improved by search optimization if
all predicates [would benefit](queries-that-benefit.md).

For example, suppose that a query has:

> `where condition_x or condition_y`

Search optimization can improve performance if each condition separately returns a few rows (i.e., `condition_x` returns
a few rows and `condition_y` returns a few rows).

If `condition_x` returns a few rows but `condition_y` returns many rows, the query performance does not
benefit from search optimization.

In the case of disjunctions, each predicate in isolation is not decisive in the query. All predicates must be evaluated
to determine whether search optimization can improve performance.

---
title: How Snowflake validates semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/validation-rules.md
section: User Guide
---

# How Snowflake validates semantic views

Snowflake verifies that a semantic view complies with a set of validation rules when you define it. These rules ensure your
semantic model is well-formed and will function correctly.

These rules are explained in the next sections:

* General validation rules
* Validation rules for relationships
* Expression validation rules

  + General rules about expressions
  + Rules for row-level expressions (dimensions and facts)
  + Rules for aggregate-level expressions (metrics)
  + Rules for window function metrics

## General validation rules

The following rules apply to semantic views in general:

* **Required elements:** A semantic view must define at least one dimension or metric.

  For example, your TPC-H semantic view needs at least one dimension (like `customer_name`) or a metric (like
  `order_average_value`).
* **Primary and foreign keys:** In the primary and foreign key definitions, you must use physical base table columns or
  expressions defined in logical tables that directly refer to a base table column (for example, `t1.fact AS t1.col`).

  For example, in the TPC-H schema, you can use `c_custkey` as the primary key for the `customer` table and `o_custkey` as
  the foreign key in the `orders` table. `c_custkey` and `o_custkey` are columns in the physical base tables.
* **Table alias references:** When referring to tables in relationships or expressions, you must use their defined aliases.

  For example, if you define the table alias `orders AS snowflake_sample_data.tpch.orders_table`, you must use the table
  alias `orders` (not `orders_table`) in the definitions of your metrics.

  If you don’t specify an alias for a logical table, you must use the logical table name in any expressions.

## Validation rules for relationships

The following rules apply to relationships in semantic views:

* **Many-to-one relationships and one-to-one relationships:** Relationships work like foreign key constraints.

  Suppose that the logical table `table_1` identifies `col_1` as a primary key:

  ```sqlexample
  TABLES (
    table_1 AS my_table_1 PRIMARY KEY (col_1)
    ...
  ```

  When you define a relationship as `table_2 (col_2) REFERENCES table_1 (col_1)`, `col_1` must be a primary key, and
  `col_2` must serve as a foreign key:

  + If multiple rows in `table_2` use the same value in `col_2`, you’re creating a many-to-one relationship from `table_2`
    to `table_1`.

    For example, `orders (o_custkey) REFERENCES customers (c_custkey)` creates a many-to-one relationship from `orders`
    to `customers` (many orders can belong to one customer with the key `c_custkey`).
  + If each row in `table_2` has a unique value in `col_2`, you’re creating a one-to-one relationship from `table_2` to
    `table_1`.

    For example, `customer_details_extended (e_custkey) REFERENCES customer_details (c_custkey)` creates a one-to-one
    relationship from `customer_details_extended` to `customer_details` (one row of extended details for a customer belongs
    to one row of customer details with the key `c_custkey`).
* **Validations performed on one-to-one relationships:**

  + Row-level expressions can refer to other row-level expressions
    at the same (or lower) granularity.

    For example, `customer_details` and `customer_details_extended` have a one-to-one relationship, where one row in
    `customer_details` is related to one row in `customer_details_extended`. A row-level expression on each of these tables
    refers to one specific customer. Each can refer directly to the other in row-level expressions because the row-level
    expressions are at the same granularity.

    As a corollary, a row-level expression on `customer_details` cannot reference a metric or aggregation of a
    row-level expression on `customer_details_extended` (and vice versa).
  + Aggregate-level expressions must refer to row-level expressions
    at the same granularity using a single aggregate.

    For example, aggregate-level expressions on `customer_details` or `customer_details_extended` must use a single aggregate
    when referencing the other entity. In addition, metrics on `customer_details` and `customer_details_extended` should
    refer to other metrics on the two entities directly, without any aggregation.

  These rules apply whether the relationship between the entities is defined as
  `customer_details REFERENCES customer_details_extended` or
  `customer_details_extended REFERENCES customer_details`.
* **Transitive relationships:** Snowflake automatically derives indirect relationships.

  For example, if you define a relationship between `line_items` and `orders` and another relationship between `orders` and
  `customer`, Snowflake understands there’s also a relationship between `line_items` and `customer`.

  Note that one-to-one relationships respect transitivity when interacting with other one-to-one and many-to-one relationships:

  + If logical tables `customers` and `customer_details` have a one-to-one relationship and logical tables
    `customer_details` and `customer_details_extended` have a one-to-one relationship, logical tables `customers` and
    `customer_details_extended` are automatically inferred to have a one-to-one relationship and are treated as such during
    validation.
  + If logical tables `customers` and `customer_details` have a one-to-one relationship and logical tables
    `customer_details` and `regions` have a many-to-one relationship, `customers` is inferred to be transitively
    many-to-one to `regions`, which gives `customers` a higher granularity than `regions` during expression validation.
* **No circular relationships:** You cannot define circular relationships, even through transitive paths.

  For example, you cannot define a relationship from `orders` to `customer` and another relationship from `customer` to
  `orders`.
* **No self-references:** Currently, a table cannot reference itself (like an employee manager hierarchy where employees can
  reference other employees as their manager).
* **Multi-path relationship restrictions:** You can define multiple relationships between two tables, but there are limitations.

  For example, if `line_items` is related to `orders` through both `order_key` and another column, those tables cannot
  refer to each other’s semantic expressions.

  > **Note:**
  >
  > If there are multiple paths that can be used to join two tables, you should define these relationships and specify which path
  > to use when defining a metric. For information, see [Specifying the relationship for a metric when multiple relationship paths exist](sql.md).

## Expression validation rules

The following rules apply to semantic expressions in facts, dimensions, and metrics:

* General rules about expressions
* Rules for row-level expressions (dimensions and facts)
* Rules for aggregate-level expressions (metrics)

### General rules about expressions

The following rules apply to semantic expressions in general:

* **Expression types:** Dimensions and facts are row-level expressions (unaggregated), while metrics are aggregate-level
  expressions.

  For example, `customer_name` is a dimension (row-level), while `order_average_value` is a metric (aggregate-level).
* **Table association:** Every semantic expression must be associated with a table.

  For example, `customer_name` must be defined as `customer.customer_name` and `order_average_value` as
  `orders.order_average_value`.
* **Same-table references:** Expressions can refer to base table columns or other expressions on the same logical table using
  either qualified or unqualified names.

  For example, in the `orders` table, you could define `orders.shipping_month` as

  + `MONTH(o_shipdate)` (using the unqualified column name)
  + `MONTH(orders.o_shipdate)` (using the qualified name)
* **Cross-table limitations:** Expressions cannot refer to base table columns from other tables or expressions from unrelated
  logical tables.

  For example, `customer.customer_name` cannot directly reference an expression from the `orders` table unless there’s a
  relationship between them. To work with data across tables, you must:

  1. Define relationships between logical tables (for example, between `customer` and `orders` through `c_custkey`).
  2. Define a fact on the source table (for example, `orders.total_value`).
  3. Refer to these expressions from a connected logical table (for example, `customer.order_value` can refer to
     `orders.total_value`).
* **Name resolution:** If both a semantic expression and a column have the same name, references to that name resolve to the
  semantic expression.

  For example, if you define a `region` dimension and there’s also a `region` column, `region` in expressions resolves to
  the dimension, not the column. An exception is when an expression refers to the same name in its definition (for example,
  `customer.c_name AS customers.c_name`). The reference resolves to the column, rather than to the defining expression itself.
* **Expression reference cycles:** You cannot create circular references between expressions.

  For example, you cannot define `customer.total_value` based on `orders.customer_value` and then define
  `orders.customer_value` based on `customer.total_value`.
* **Table reference cycles:** You cannot create circular references between logical tables in expression definitions.

  For example, you cannot define `customer.total_value` based on `orders.customer_value` and then define
  `orders.customer_count` based on `customer.c_custkey`.
* **Function usage:** You can use scalar functions like [YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER](../../sql-reference/functions/year.md) in dimensions, but table functions
  are not allowed.

### Rules for row-level expressions (dimensions and facts)

The following rules apply to row-level expressions in dimensions and facts:

* **Same-table references:** A row-level expression can directly refer to columns from its own table.

  For example, `customers.customer_name` can be defined as `customers.c_name` directly.
* **Equal or lower granularity:** A row-level expression can directly refer to other row-level expressions at the same or
  lower granularity.

  For example, `orders.order_details` can refer to `customer.customer_name` because `customer` is at a lower granularity
  than `orders` (one customer can have many orders).
* **Higher granularity references:** When referencing row-level expressions at higher granularity, a row-level expression must
  use aggregation.

  For example, `customer.total_orders` must use `COUNT(orders.o_orderkey)` because `orders` is at a higher granularity
  than `customer` (one customer can have many orders).
* **Aggregate references:** A dimension like `orders.order_type` cannot refer to a metric like `orders.order_average_value`,
  but `customer.customer_segment` can refer to `orders.order_average_value` because `customer` is at a lower granularity
  than orders.

### Rules for aggregate-level expressions (metrics)

The following rules apply to aggregate-level expressions in metrics:

* **Basic aggregation:** A metric that is not a derived metric must use an aggregate function.

  For example, `orders.order_average_value` must use `AVG(orders.o_totalprice)`.
* **Equal or lower granularity:** When referring to row-level expressions at equal or lower granularity, a metric must use a
  single aggregate.

  For example, `orders.total_value` can use `SUM(line_items.discounted_price)` because `line_items` is at lower
  granularity than orders.
* **Higher granularity references:** When referring to row-level expressions at higher granularity, a metric must use nested
  aggregation.

  For example, `customer.average_order_value` must use `AVG(SUM(orders.o_totalprice))` because `orders` is at higher
  granularity than `customer`.
* **Other aggregate references:** A metric can directly refer to other metrics at equal or lower granularity without aggregation.

  For example, `orders.profit_margin` can be defined as `orders.total_revenue / orders.total_cost` without additional
  aggregation. However, when referring to metrics at higher granularity, an aggregation is required.

### Rules for window function metrics

These rules apply to [window function metrics](querying.md):

* Window function metrics cannot be used by row-level calculations (facts and dimensions).
* Window function metrics cannot be used in the definitions of other metrics.

---
title: How tags interact with Snowflake features
source: https://docs.snowflake.com/en/user-guide/object-tagging/interaction.md
section: User Guide
---

# How tags interact with Snowflake features

## Replication

Tags and their assignments can be replicated from a source account to a target account.

Tag assignments cannot be modified in the target account after the initial replication from the source account. For example,
setting a tag on a secondary (i.e. replicated) database is not allowed. To modify tag assignments in the target account, modify
them in the source account and replicate them to the target account.

For [database replication](../database-replication-considerations.md), the replication operation fails if either of the
following conditions is true:

* The primary database is in an Enterprise (or higher) account and contains a tag but one or more of the accounts approved for
  replication are on lower editions.
* An object contained in the primary database has a [dangling reference](../database-replication-considerations.md) to a tag in
  a different database.

To avoid a dangling reference error, replicate the database and account-level objects
using a [replication or failover group](../account-replication-intro.md). Ensure that the replication group includes:

* The database containing the tags in the `ALLOWED_DATABASES` property.
* Other account-level objects that have a tag in the `OBJECT_TYPES` property (e.g. `ROLES`, `WAREHOUSES`).

  For details, refer to [CREATE REPLICATION GROUP](../../sql-reference/sql/create-replication-group.md) and [CREATE FAILOVER GROUP](../../sql-reference/sql/create-failover-group.md).

> **Note:**
>
> When using replication and failover groups or database replication:
>
> * Failover/failback features are only available to Snowflake accounts that are Business Critical Edition (or higher).
>
>   For more information, refer to [Introduction to replication and failover across multiple accounts](../account-replication-intro.md).
> * If you specify the `IGNORE EDITION CHECK` clause for database replication in an
>   [ALTER DATABASE](../../sql-reference/sql/alter-database.md) statement or in a CREATE OR ALTER statement
>   for a replication or failover group, tag replication can occur when the target account is a lower edition than
>   [Business Critical](../intro-editions.md).
>
>   For details, refer to the clause description in these commands.

## Cloning

* Tag associations in the source object (e.g. table) are maintained in the cloned objects.
* For a database or a schema:

  When a database or schema is cloned, tags that reside in that schema or database are also cloned.

  If a table or view exists in the source schema/database and has references to tags in the same schema or database, the cloned table or view is mapped to the corresponding cloned tag (in the target schema/database) instead of the tag in the source schema or database.

## Data sharing

* When the shared view and tag exist in different databases, grant the REFERENCE_USAGE privilege on the database containing the tag to the
  share. For information, see [Share data from multiple databases](../data-sharing-multiple-db.md).
* In the data sharing consumer account:

  + Executing the [SHOW TAGS](../../sql-reference/sql/show-tags.md) command returns the shared tag, provided that the role executing the SHOW TAGS command
    has the USAGE privilege on the schema containing the shared tag.

    If the provider grants the READ privilege on the tag to the share or to a shared database role, the consumer can view the tag
    assignments for the shared tag. For information, see [shared tag references](../data-sharing-provider.md).
  + If a tag from the data sharing provider account is assigned to a shared table, the data sharing consumer cannot call the
    [SYSTEM$GET_TAG](../../sql-reference/functions/system_get_tag.md) function or the [TAG_REFERENCES](../../sql-reference/functions/tag_references.md) Information Schema table
    function to view the tag assignment.

---
title: Hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid.md
section: User Guide
---

# Hybrid tables

A hybrid table is a Snowflake table type that is optimized for low latency and high
throughput using index-based random reads and writes. Hybrid tables provide a row-based
storage engine that supports row locking for high concurrency. Hybrid tables also enforce
unique and referential integrity constraints, which are critical for
transactional workloads. You can use a hybrid table along with other Snowflake
tables and features to power
[Unistore workloads](https://www.snowflake.com/en/data-cloud/workloads/unistore/)
that bring transactional and analytical data together in a single platform.

Use cases that may benefit from hybrid tables include:

* Metadata for applications and workflows, such as maintaining state for an ingestion workflow that requires
  high-concurrency updates to a single table from thousands of parallel workers.
* Lower-latency serving of precomputed aggregates through an API or a user interface.
* Lightweight transactional applications with relational data models.

> **Tip:**
>
> Before creating and using hybrid tables, you should become familiar with some
> [unsupported features and limitations](tables-hybrid-limitations.md).

## Architecture

Hybrid tables integrate seamlessly into the existing Snowflake architecture.
Customers connect to the same Snowflake database service. Queries are compiled
and optimized in the cloud services layer and executed in the same query engine
and virtual warehouses as standard tables. This architecture has several key benefits:

* Snowflake platform features, such as data governance, work with hybrid tables out of the box.
* You can run hybrid workloads that mix operational and analytical queries.
* You can join hybrid tables with other Snowflake tables; queries executed natively and
  efficiently in the same query engine. No federation is required.
* You can execute an atomic transaction across hybrid tables and other Snowflake tables.
  There is no need to orchestrate your own two-phase commit.

Hybrid tables leverage a row store as the primary data store to provide excellent operational query performance.
When you write to a hybrid table, the data is written directly into the row store. Data is asynchronously copied
into object storage in order to provide better performance and workload isolation for large scans without affecting
your ongoing operational workloads. Some data may also be cached in columnar format on your warehouse in order
to provide better performance for analytical queries. You simply execute SQL statements against the logical hybrid table
and the Snowflake query optimizer decides where to read data from in order to provide the best performance.
You get one consistent view of your data without needing to worry about the underlying infrastructure.

> **Note:**
>
> Because the primary storage for hybrid tables is a row store, hybrid tables typically have a larger storage footprint than standard tables.
> The main reason for the difference is that columnar data for standard tables often achieves higher rates of compression. For details about
> storage costs, see [Evaluate cost for hybrid tables](tables-hybrid-cost.md).

## Features

Hybrid tables provide some additional features that are not supported by other Snowflake table types.

| Feature | Hybrid tables | Standard tables |
| --- | --- | --- |
| Primary data layout | Row-oriented, with secondary columnar storage | Columnar [micro-partitions](tables-clustering-micropartitions.md) |
| Locking | Row-level | Partition or table |
| PRIMARY KEY constraints | Required, enforced | Optional, not enforced |
| FOREIGN KEY constraints | Optional, enforced (referential integrity) | Optional, not enforced |
| UNIQUE constraints | Optional (except for PRIMARY KEY), enforced | Optional, not enforced |
| NOT NULL constraints | Optional (except for PRIMARY KEY), enforced | Optional, enforced |
| Indexes | Supported for performance; updated synchronously on writes | The search optimization service indexes columns for better point-lookup performance; batch updated/maintained asynchronously |

A constraint is *enforced* when it protects a column from being updated in certain ways. For example, a column that is declared NOT NULL
cannot contain a NULL value. An attempt to copy or insert a NULL value into a NOT NULL column always results in an error.

For hybrid tables, you cannot set the NOT ENFORCED property on PRIMARY KEY, FOREIGN KEY, and UNIQUE constraints. Setting this property results in an
“invalid constraint property” error. For more information about rules for constraints, see [Constraints for hybrid tables](../sql-reference/sql/create-hybrid-table.md).

A constraint is *required* when one or more columns in a table must have such a constraint, which is only true for
PRIMARY KEY constraints on hybrid tables.

## When to use a hybrid table

While you should expect Snowflake standard tables to offer better performance
on large analytical queries, hybrid tables allow for faster results on
short-running operational queries. Hybrid tables deliver high concurrency and low latency for many workloads.
The following types of queries are most likely to benefit from hybrid tables:

* Index-based random-point reads that retrieve a small number of records, such as customer objects
* High-concurrency random writes, including inserts, updates, and merges:

Applications commonly work with a mix of hybrid tables and standard tables, with different
data sets stored in each table type. For example, you might have some data that you frequently bulk load, scan,
and aggregate for analytics purposes, and other data that you access one row at a time, filtered on an ID column
at high concurrency. You can blend the use of standard tables and hybrid tables in a single database based on the needs of
your workload.

---
title: Hybrid Tables Dedicated Storage Mode for TSS
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-dedicated-storage-mode.md
section: User Guide
---

# Hybrid Tables Dedicated Storage Mode for TSS

This section explains how to start using [Tri-Secret Secure (TSS)](security-encryption-tss.md) in accounts that contain hybrid tables.

> **Note:**
>
> For information about billing and costs for this feature, consult your Snowflake account team.

## Introduction

In a standard storage configuration for hybrid tables, underlying multi-tenant storage is used for all hybrid table data. This means that different databases and data that belongs to different customers use a shared storage layer. This shared storage configuration does not work if you have enabled or plan to enable TSS because TSS protects your data with encryption keys owned by individual customers. Enabling TSS encryption for hybrid tables requires a storage configuration known as Hybrid Tables Dedicated Storage Mode. You can also use periodic rekeying for additional encryption support, but periodic rekeying does not require this Dedicated Storage Mode.

When an account has both Dedicated Storage Mode and TSS enabled, all of the data that is stored in a hybrid table is protected with your TSS composite master key, which combines a Snowflake-maintained key with a customer-managed key. This protection covers hybrid table data in the underlying operational row store, the copy of the data in object storage, data retained for Time Travel, and metadata. You can use hybrid tables with the same serverless experience as you would with a standard storage configuration, and no additional management or provisioning is required.

## Using Dedicated Storage Mode

You must enable Dedicated Storage Mode if you intend to create hybrid tables in your account and TSS is already enabled or will be enabled. Enabling Dedicated Storage Mode is a one-time action on the account. Before you take this action, you will not be able to create hybrid tables with TSS protection.

Note the following important considerations:

* To ensure that your data is fully TSS-protected, you can’t enter a state where a TSS-enabled account contains hybrid tables that are stored in a standard multi-tenant storage configuration. Only one storage mode can be active at any given time.
* Data that exists in hybrid tables before TSS is enabled can never be encrypted with TSS-compliant keys. TSS protection is guaranteed only for data written to hybrid tables after Dedicated Storage Mode and TSS are both enabled.
* You can’t enable TSS if your account already contains hybrid tables. You have to drop individual hybrid tables or any databases that contain hybrid tables, then request enablement of Dedicated Storage Mode and TSS.

  > **Note:**
  >
  > To ensure that all hybrid table data is fully removed from your account, Snowflake recommends the following steps:
  >
  > 1. Set the [data retention period](data-time-travel.md) to `0` for either individual hybrid tables or any databases that contain hybrid tables.
  > 2. Drop either individual hybrid tables or any databases that contain hybrid tables.
* For information about billing and costs for this feature, consult your Snowflake account team.

### Enabling Dedicated Storage Mode and TSS

To enable Dedicated Storage Mode on an account, follow these steps:

1. Contact your account team and request enablement of Hybrid Tables Dedicated Storage Mode with TSS support on your account.
   Assuming that no hybrid tables exist in your account, the team will enable Dedicated Storage Mode (and enable TSS if it’s not already enabled).
2. Create and use hybrid tables in your account, following the [standard documentation](tables-hybrid.md).
3. Repeat this process for any additional TSS-enabled Snowflake accounts in which you want to use hybrid tables.

### Disabling Dedicated Storage Mode

To ensure that your data is fully TSS-protected, disabling Dedicated Storage Mode in a TSS-enabled account requires the following steps:

1. Set the [data retention period](data-time-travel.md) to `0` for either individual hybrid tables or any databases that contain hybrid tables.
2. Drop either individual hybrid tables or any databases that contain hybrid tables.
   If you need to retain the data, you can copy it to standard tables in your account before dropping tables or databases.
3. Contact your account team and request that Dedicated Storage Mode be disabled on your account.
   The team will disable Dedicated Storage Mode, but if your account still contains hybrid tables, you will be asked to remove them first.

---
title: Identifier-first login
source: https://docs.snowflake.com/en/user-guide/identifier-first-login.md
section: User Guide
---

# Identifier-first login

Identifier-first login allows Snowflake to identify a user *before* presenting authentication options. In this flow, Snowflake prompts the
user for their email address or username only, then displays authentication options based on the identity of the user.

An identifier-first login reduces confusion and login issues by only showing users’ valid authentication options. For example, identifier-first
login can do the following:

* In an environment that uses [multiple identity providers](admin-security-fed-auth-security-integration-multiple.md), it
  can restrict single sign-options to include only those identity providers that are associated with the user.
* It can hide the password option for users without passwords, who instead need to be using an identity provider to authenticate.

For examples of how authentication policies and the identifier-first login can be combined to customize the login experience for users, see
[Combining identifier-first login with authentication policies](authentication-policies.md).

## Enable identifier-first login

A user with the ACCOUNTADMIN role can use the [ENABLE_IDENTIFIER_FIRST_LOGIN](../sql-reference/parameters.md) parameter to enable the identifier-first login
flow for an account. For example:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET ENABLE_IDENTIFIER_FIRST_LOGIN = true;
```

---
title: Identifying queries that can benefit from search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/queries-that-benefit.md
section: User Guide
---

# Identifying queries that can benefit from search optimization

Search optimization can improve the performance of many queries. This topic describes characteristics of the kinds of
queries that search optimization helps the most with, and conversely, the kinds of queries that do not benefit.

## General query characteristics

Search optimization works best to improve the performance of queries with the following characteristics:

* The query involves a column or columns other than the primary cluster key.
* The query typically runs for a few seconds or longer (before applying search optimization). In most cases, search optimization will
  not substantially improve the performance of a query that has a sub-second execution time.
* At least one of the columns accessed by the query filter operation has on the order of 100,000 distinct values or more.

  To determine the number of distinct values, you can use either of the following:

  + Use `APPROX_COUNT_DISTINCT` to get the approximate number of distinct values:

    ```sqlexample
    SELECT APPROX_COUNT_DISTINCT(column1) FROM table1;
    ```
  + Use `COUNT(DISTINCT <col_name>)` to get the actual number of distinct values:

    ```sqlexample
    SELECT COUNT(DISTINCT c1), COUNT(DISTINCT c2) FROM test_table;
    ```

  Because you need only an approximation of the number of distinct values, consider using `APPROX_COUNT_DISTINCT`, which
  is generally faster and cheaper than `COUNT(DISTINCT <col_name>)`.

## Supported data types

The search optimization service currently supports the following data types:

* [Data types for fixed-point numbers](../../sql-reference/data-types-numeric.md) (for example, INTEGER and NUMERIC)
* [String & binary data types](../../sql-reference/data-types-text.md) (for example, VARCHAR and BINARY)
* [Date & time data types](../../sql-reference/data-types-datetime.md) (for example, DATE, TIME, and TIMESTAMP)
* [Semi-structured data types](../../sql-reference/data-types-semistructured.md) (for example, VARIANT, OBJECT, and ARRAY)
* [Structured data types](../../sql-reference/data-types-structured.md) (for example, structured ARRAY, OBJECT, and MAP)
* [GEOGRAPHY data type](../../sql-reference/data-types-geospatial.md)

Queries that involve other values of other data types (for example, FLOAT, DECFLOAT, or GEOMETRY) don’t benefit.

## Supported table types

The search optimization service currently supports the following types of tables:

* Standard Snowflake tables
* [Interactive tables](../interactive.md)
* Iceberg tables
* [Dynamic tables](../dynamic-tables-about.md)
* [Transient tables](../tables-temp-transient.md)

The search optimization service currently *doesn’t* support the following types of tables:

* [External tables](../tables-external-intro.md)
* [Hybrid tables](../tables-hybrid.md)
* [Temporary tables](../tables-temp-transient.md)

## Supported predicate types

Search optimization can improve the performance of queries using these kinds of predicates:

* [Point lookup queries using equality and IN](point-lookup-queries.md).
* [Join queries](join-queries.md).
* [Queries using scalar subqueries](scalar-subqueries.md).
* [Queries using scalar functions](scalar-functions.md).
* [Character data (text) queries using the SEARCH and SEARCH_IP functions](text-queries.md).
* [Substring queries using wildcards and regular expressions](substring-queries.md).
* [Searches in semi-structured data](semi-structured-queries.md).
* [Searches in structured data](structured-queries.md).
* [Geospatial queries](geospatial-queries.md).
* [Queries using conjunctions (AND) and disjunctions (OR)](conjunctions-disjunctions.md).

## Support for collation

Search optimization can improve the performance of queries on columns defined with a [COLLATE clause](../../sql-reference/collation.md),
depending on the search method:

* When search optimization is [enabled](enabling.md) on a column using the
  `EQUALITY` search method, any collation specification is supported.
* When search optimization is enabled on a column using the `FULL_TEXT` or `SUBSTRING` search method,
  the `'utf8'` or `'bin'` collation specifications are supported.

For more information about search methods, see [ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md).

Search optimization doesn’t support predicates that change the collation specification of a column using the
[COLLATE](../../sql-reference/functions/collate.md) function.

For example, create a table with columns that have collation specifications and insert a row:

```sqlexample
CREATE OR REPLACE TABLE search_optimization_collation_demo (
  en_ci_col VARCHAR COLLATE 'en-ci',
  utf_8_col VARCHAR COLLATE 'utf8');

INSERT INTO search_optimization_collation_demo VALUES (
  'test_collation_1',
  'test_collation_2');
```

Enable search optimization for equality predicates on both columns in the table:

```sqlexample
ALTER TABLE search_optimization_collation_demo
  ADD SEARCH OPTIMIZATION ON EQUALITY(en_ci_col, utf_8_col);
```

The following query can benefit from search optimization:

```sqlexample
SELECT *
  FROM search_optimization_collation_demo
  WHERE utf_8_col = 'test_collation_2';
```

The following query can’t benefit from search optimization because it changes the collation specification of the
`utf_8_col` column using the COLLATE function:

```sqlexample
SELECT *
  FROM search_optimization_collation_demo
  WHERE utf_8_col COLLATE 'de-ci' = 'test_collation_2';
```

The following query also can’t benefit from search optimization. Based on the
[collation rules of precedence](../../sql-reference/collation.md),
the query applies the `'de-ci'` collation specification to the `utf_8_col` column using the COLLATE
function.

```sqlexample
SELECT *
  FROM search_optimization_collation_demo
  WHERE utf_8_col = 'test_collation_2' COLLATE 'de-ci';
```

## Support for Apache Iceberg™ tables

Search optimization can improve the performance of queries on Apache Iceberg™ tables. For information
about configuring search optimization for Iceberg tables, see [ALTER ICEBERG TABLE](../../sql-reference/sql/alter-iceberg-table.md).

The following limitations apply to search optimization support for Iceberg tables:

* Search optimization can’t be added for columns with data types that Iceberg tables don’t support, which include
  [semi-structured](../../sql-reference/data-types-semistructured.md) and [geospatial](../../sql-reference/data-types-geospatial.md)
  data types. For more information, see [Data types for Apache Iceberg™ tables](../tables-iceberg-data-types.md).
* If Apache Parquet™ files are too large (for example, hundreds of megabytes compressed), then queries might not fully benefit from
  the search optimization service in some scenarios.

Other limitations that apply to search optimization for Snowflake tables also apply to Iceberg tables. For more information, see
Queries that do not benefit from search optimization.

## Potential improvements for views

The search optimization service can indirectly improve the performance of views (including secure views). If a base table for a
view has search optimization enabled, and the query uses a selective predicate for that table, the search optimization service
can improve performance when filtering rows. See Supported predicate types.

Not all tables in the view need to have search optimization enabled. Search optimization is performed on each table
independently.

## Queries that do not benefit from search optimization

Currently, the search optimization service doesn’t support floating point data types, GEOMETRY, or other data types not already discussed.
Snowflake might add support for more data types in the future.

Additionally, the search optimization service doesn’t support the following:

* Some table types.

  For more information, see Supported table types.
* Materialized views.
* Column concatenation.
* Analytical expressions.
* Casts on table columns (except for fixed-point numbers cast to strings).

  Although search optimization supports predicates with implicit and explicit casts on constant values, it doesn’t support
  predicates that cast values in the actual table column (except for casts from INTEGER and NUMBER to VARCHAR).

  For example, the following predicates are supported because they use implicit and explicit casts on constant values (not values
  in the table column):

  ```sqlexample
  -- Supported predicate
  -- (where the string '2020-01-01' is implicitly cast to a date)
  WHERE timestamp1 = '2020-01-01';

  -- Supported predicate
  -- (where the string '2020-01-01' is explicitly cast to a date)
  WHERE timestamp1 = '2020-01-01'::date;
  ```

  The following predicate is not supported because it uses a cast on values in the table column:

  ```sqlexample
  -- Unsupported predicate
  -- (where values in a VARCHAR column are cast to DATE)
  WHERE to_date(varchar_column) = '2020-01-01';
  ```

  The search optimization service considers the original column values, not the values after the cast. As a result,
  the search optimization service is not used for queries with these predicates.

As noted, the exception to this rule is casting NUMBER or INTEGER values to VARCHAR values in the table column. The
search optimization service does support this type of predicate:

> ```sqlexample
> -- Supported predicate
> -- (where values in a numeric column are cast to a string)
> WHERE cast(numeric_column as varchar) = '2'
> ```

Search optimization doesn’t improve performance of queries that use [Time Travel](../data-time-travel.md)
because search optimization works only on active data.

---
title: Identifying Sequences of Rows That Match a Pattern
source: https://docs.snowflake.com/en/user-guide/match-recognize-introduction.md
section: User Guide
---

# Identifying Sequences of Rows That Match a Pattern

## Introduction

In some cases, you might need to identify sequences of table rows that match a pattern. For example, you might need to:

* Determine which users followed a specific sequence of pages and actions on your website before opening a support ticket or
  making a purchase.
* Find the stocks with prices that followed a V-shaped or W-shaped recovery over a period of time.
* Look for patterns in sensor data that might indicate an upcoming system failure.

To identify sequences of rows that match a specific pattern, use the `MATCH_RECOGNIZE` subclause of the
[FROM](../sql-reference/constructs/from.md) clause.

> **Note:**
>
> You cannot use the MATCH_RECOGNIZE clause in a **recursive** [common table expression (CTE)](queries-cte.md).

## A Simple Example That Identifies a Sequence of Rows

As an example, suppose that a table contains data about stock prices. Each row contains the closing price of each stock symbol on
a specific day. The table contains the following columns:

| Column Name | Description |
| --- | --- |
| `price_date` | The date of the closing price. |
| `price` | The closing stock price on that date. |

Suppose that you want to detect a pattern in which the stock price decreases and then increases, producing a “V” shape in the
graph of the stock price.

(This example does not account for cases in which the stock price does not change from day to day.)

In this example, for a given stock symbol, you want to find sequences of rows where the value in the `price` column decreases
before increasing.

For each sequence of rows that matches this pattern, you want to return:

* A number that identifies the sequence (the first matching sequence, the second matching sequence, etc.).
* The day before the stock price decreased.
* The last day when the stock price increased.
* The number of days in the “V” pattern.
* The number of days when the stock price decreased.
* The number of days when the stock price increased.

```none
+---------+--------------+------------+------------+------------------+---------------+---------------+
| COMPANY | MATCH_NUMBER | START_DATE | END_DATE   | ROWS_IN_SEQUENCE | NUM_DECREASES | NUM_INCREASES |
|---------+--------------+------------+------------+------------------+---------------+---------------|
| ABCD    |            1 | 2020-10-01 | 2020-10-04 |                4 |             1 |             2 |
| ABCD    |            2 | 2020-10-04 | 2020-10-08 |                5 |             1 |             3 |
+---------+--------------+------------+------------+------------------+---------------+---------------+
```

The following figure illustrates the price decreases (`NUM_DECREASES`) and increases (`NUM_INCREASES`) within the “V” pattern
that the returned data captures. Note that `ROWS_IN_SEQUENCE` includes an initial row that is not counted in `NUM_DECREASES`
or `NUM_INCREASES`.

To produce this output, you can use the `MATCH_RECOGNIZE` clause shown below.

> ```sqlexample
> SELECT * FROM stock_price_history
>   MATCH_RECOGNIZE(
>     PARTITION BY company
>     ORDER BY price_date
>     MEASURES
>       MATCH_NUMBER() AS match_number,
>       FIRST(price_date) AS start_date,
>       LAST(price_date) AS end_date,
>       COUNT(*) AS rows_in_sequence,
>       COUNT(row_with_price_decrease.*) AS num_decreases,
>       COUNT(row_with_price_increase.*) AS num_increases
>     ONE ROW PER MATCH
>     AFTER MATCH SKIP TO LAST row_with_price_increase
>     PATTERN(row_before_decrease row_with_price_decrease+ row_with_price_increase+)
>     DEFINE
>       row_with_price_decrease AS price < LAG(price),
>       row_with_price_increase AS price > LAG(price)
>   )
> ORDER BY company, match_number;
> ```

As shown above, the `MATCH_RECOGNIZE` clause consists of many subclauses, each of which serves a different purpose (e.g.
specifying the pattern to match, specifying the data to return, etc.).

The next sections explain each of the subclauses in this example.

### Setting Up the Data For This Example

To set up the data used in this example, run the following SQL statements:

> ```sqlexample
> create table stock_price_history (company TEXT, price_date DATE, price INT);
> ```
>
> ```sqlexample
> insert into stock_price_history values
>     ('ABCD', '2020-10-01', 50),
>     ('XYZ' , '2020-10-01', 89),
>     ('ABCD', '2020-10-02', 36),
>     ('XYZ' , '2020-10-02', 24),
>     ('ABCD', '2020-10-03', 39),
>     ('XYZ' , '2020-10-03', 37),
>     ('ABCD', '2020-10-04', 42),
>     ('XYZ' , '2020-10-04', 63),
>     ('ABCD', '2020-10-05', 30),
>     ('XYZ' , '2020-10-05', 65),
>     ('ABCD', '2020-10-06', 47),
>     ('XYZ' , '2020-10-06', 56),
>     ('ABCD', '2020-10-07', 71),
>     ('XYZ' , '2020-10-07', 50),
>     ('ABCD', '2020-10-08', 80),
>     ('XYZ' , '2020-10-08', 54),
>     ('ABCD', '2020-10-09', 75),
>     ('XYZ' , '2020-10-09', 30),
>     ('ABCD', '2020-10-10', 63),
>     ('XYZ' , '2020-10-10', 32);
> ```

### Step 1: Specifying the Order and Grouping of Rows

The first step in identifying a sequence of rows is defining the grouping and sort order of the rows that you want to search. For
the example of finding a “V” pattern in the stock price for a company:

* The rows should be grouped by company, since you want to find a pattern in the price for a given company.
* Within each group of rows (the prices for a given company), the rows should be sorted by date in ascending order.

In a `MATCH_RECOGNIZE` clause, you use the `PARTITION BY` and `ORDER BY` subclauses to specify the grouping and
order of rows. For example:

> ```sqlexample
> MATCH_RECOGNIZE(
>   PARTITION BY company
>   ORDER BY price_date
>   ...
> )
> ```

### Step 2: Defining the Pattern to Match

Next, determine the pattern that matches the sequence of rows that you want to find.

To specify this pattern, you use something similar to a [regular expression](https://en.wikipedia.org/wiki/Regular_expression).
In regular expressions, you use a combination of literals and metacharacters to specify a pattern to match in a string.

For example, to find a sequence of characters that includes:

* any single character, followed by
* one or more uppercase letters, followed by
* one or more lowercase letters

you can use the following Perl-compatible regular expression:

```none
.[A-Z]+[a-z]+
```

where:

* `.` matches any single character.
* `[A-Z]+` matches one or more uppercase letters.
* `[a-z]+` matches one or more lowercase letters.

`+` is a [quantifier](https://en.wikipedia.org/wiki/Regular_expression#Basic_concepts) that specifies that one or more of the
preceding characters need to match.

For example, the regular expression above matches sequences of characters like:

* `1Stock`
* `@SFComputing`
* `%Fn`

In a `MATCH_RECOGNIZE` clause, you use a similar expression to specify the pattern of rows to match. In this case, finding
rows that match a “V” pattern involves finding a sequence of rows that includes:

* the row before the stock price decreases, followed by
* one or more rows where the stock price decreases, followed by
* one or more rows where the stock price increases

You can express this as the following row pattern:

```none
row_before_decrease row_with_price_decrease+ row_with_price_increase+
```

Row patterns consist of *pattern variables*, [quantifiers](../sql-reference/constructs/match_recognize.md) (which are similar to those
used in regular expressions), and [operators](../sql-reference/constructs/match_recognize.md). A pattern variable defines an expression
that is evaluated against a row.

In this row pattern:

* `row_before_decrease`, `row_with_price_decrease`, and `row_with_price_increase` are pattern variables. The expressions for
  these pattern variables should evaluate to:

  + any row (the row before the stock price decreases)
  + a row where the stock price decreases
  + a row where the stock price increases

  `row_before_decrease` is similar to `.` in a regular expression. In the following regular expression, `.` matches any
  single character that appears before the first uppercase letter in the pattern.

  ```none
  .[A-Z]+[a-z]+
  ```

  Similarly, in the row pattern, `row_before_decrease` matches any single row that appears before the first row with a price
  decrease.
* The `+` quantifiers after `row_with_price_decrease` and `row_with_price_increase` specify that one or more rows of each
  of these must match.

In a `MATCH_RECOGNIZE` clause, you use the `PATTERN` subclause to specify the row pattern to match:

```sqlexample
MATCH_RECOGNIZE(
  ...
  PATTERN(row_before_decrease row_with_price_decrease+ row_with_price_increase+)
  ...
)
```

To specify the expressions for the pattern variables, you use the `DEFINE` subclause:

> ```sqlexample
> MATCH_RECOGNIZE(
>   ...
>   DEFINE
>     row_with_price_decrease AS price < LAG(price)
>     row_with_price_increase AS price > LAG(price)
>   ...
> )
> ```

where:

* `row_before_decrease` does not need to be defined here because it should evaluate to any row.
* `row_with_price_decrease` is defined as an expression for a row with a price decrease.
* `row_with_price_increase` is defined as an expression for a row with a price increase.

To compare the prices in different rows, the definitions of these variables use the
[navigational function](../sql-reference/constructs/match_recognize.md) `LAG()` to specify price for the previous row.

The row pattern matches two sequences of rows, as illustrated below:

For the first matching sequence of rows:

* `row_before_decrease` matches the row with the stock price `50`.
* `row_with_price_decrease` matches the next row with the stock price `36`.
* `row_with_price_increase` matches the next two rows with the stock prices `39` and `42`.

For the second matching sequence of rows:

* `row_before_decrease` matches the row with the stock price `42`. (This is the same row that is at the end of the first
  matching sequence of rows.)
* `row_with_price_decrease` matches the next row with the stock price `30`.
* `row_with_price_increase` matches the next two rows with the stock prices `47`, `71`, and `80`.

### Step 3: Specifying the Rows to Return

`MATCH_RECOGNIZE` can either return:

* a single row that summarizes each matching sequence, or
* each row in each matching sequence

For this example, you want to return a summary of each matching sequence. Use the `ONE ROW PER MATCH` subclause to specify
that one row should be returned for each matching sequence.

```sqlexample
MATCH_RECOGNIZE(
  ...
  ONE ROW PER MATCH
  ...
)
```

### Step 4: Specifying the Measures to Select

When you use `ONE ROW PER MATCH`, `MATCH_RECOGNIZE` does not return any of the columns in the table (except for the
column specified by `PARTITION BY`), even when `MATCH_RECOGNIZE` is in a `SELECT *` statement. To specify the
data to be returned by this statement, you must define *measures*. Measures are additional columns of data that are calculated for
each matching sequence of rows (e.g. the starting date of the sequence, the ending date of the sequence, the number of days in the
sequence, etc.).

Use the `MEASURES` subclause to specify these additional columns to return in the output. The general format for defining a
measure is:

```sqlexample
<expression> AS <column_name>
```

where:

* `expression` specifies the information about the sequence that you want to return. For the expression, you can use
  functions with columns from the table and pattern variables that you defined earlier.
* `column_name` specifies the name of the column that will be returned in the output.

For this example, you can define the following measures:

* A number that identifies the sequence (the first matching sequence, the second matching sequence, etc.).

  For this measure, use the `MATCH_NUMBER()` function, which returns the number of the match. The numbers start with `1`
  for the first match for a partition of rows. If there are
  multiple partitions, the number starts with `1` for each partition.
* The day before the stock price decreased.

  For this measure, use the `FIRST()` function, which returns the value of the expression for the first row in the matching
  sequence. In this example, `FIRST(price_date)` returns the value of the `price_date` column in the first row in each
  matching sequence, which is the date before the stock price decreased.
* The last day when the stock price increased.

  For this measure, use the `LAST()` function, which returns the value of the expression for the last row in the matching
  sequence.
* The number of days in the “V” pattern.

  For this measure, use `COUNT(*)`. Because you are specifying `COUNT(*)` in the definition of a measure, the asterisk
  (`*`) specifies that you want to count all of the rows in a matching sequence (not all of the rows in the table).
* The number of days when the stock decreased.

  For this measure, use `COUNT(row_with_price_decrease.*)`. The period followed by an asterisk (`.*`) specifies that you
  want to count all of the rows in a matching sequence that match the pattern variable `row_with_price_decrease`.
* The number of days when the stock increased.

  For this measure, use `COUNT(row_with_price_increase.*)`.

The following is the `MEASURES` subclause that defines the measures above:

```sqlexample
MATCH_RECOGNIZE(
  ...
  MEASURES
    MATCH_NUMBER() AS match_number,
    FIRST(price_date) AS start_date,
    LAST(price_date) AS end_date,
    COUNT(*) AS num_matching_rows,
    COUNT(row_with_price_decrease.*) AS num_decreases,
    COUNT(row_with_price_increase.*) AS num_increases
  ...
)
```

The following shows an example of the output with the selected measures:

```none
+---------+--------------+------------+------------+-------------------+---------------+---------------+
| COMPANY | MATCH_NUMBER | START_DATE | END_DATE   | NUM_MATCHING_ROWS | NUM_DECREASES | NUM_INCREASES |
|---------+--------------+------------+------------+-------------------+---------------+---------------|
| ABCD    |            1 | 2020-10-01 | 2020-10-04 |                 4 |             1 |             2 |
| ABCD    |            2 | 2020-10-04 | 2020-10-08 |                 5 |             1 |             3 |
+---------+--------------+------------+------------+-------------------+---------------+---------------+
```

As mentioned earlier, the output includes the `company` column because the `PARTITION BY` clause specifies that column.

### Step 5: Specifying Where to Continue Finding the Next Match

After finding a matching sequence of rows, `MATCH_RECOGNIZE` continues to find the next matching sequence. You can specify
where `MATCH_RECOGNIZE` should start searching for the next matching sequence.

As shown in the illustration of matching sequences, a row can be part of
more than one matching sequence. In this example, the row for `2020-10-04` is part of two “V” patterns.

For this example, to find the next matching sequence, you can start from a row where the price increased. To specify this in the
`MATCH_RECOGNIZE` clause, use `AFTER MATCH SKIP`:

```sqlexample
MATCH_RECOGNIZE(
  ...
  AFTER MATCH SKIP TO LAST row_with_price_increase
  ...
)
```

where `TO LAST row_with_price_increase` specifies that you want to start searching at
the last row where the price increased.

## Partitioning and Sorting the Rows

The first step in identifying patterns across rows is putting the rows in an order that allows you to find your patterns. For
example, if you want to find a pattern of changes in stock prices over time for each company’s stock:

* Partition the rows by company, so that you can search across each company’s stock prices.
* Sort the rows within each partition by date, so that you can find changes to a company’s stock price over time.

To partition the data and specify the order of rows, use the [PARTITION BY](../sql-reference/constructs/match_recognize.md) and
[ORDER BY](../sql-reference/constructs/match_recognize.md) subclauses in `MATCH_RECOGNIZE`. For example:

```sqlexample
SELECT ...
    FROM stock_price_history
        MATCH_RECOGNIZE (
            PARTITION BY company
            ORDER BY price_date
            ...
        );
```

(The `PARTITION BY` clause for `MATCH_RECOGNIZE` works the same way as the `PARTITION BY` clause for
[window functions](../sql-reference/functions-window-syntax.md).)

An additional benefit of partitioning is that it can take advantage of parallel processing.

## Defining the Pattern of Rows to Match

With `MATCH_RECOGNIZE`, you can find a sequence of rows that match a pattern. You specify this pattern in terms of rows that
match specific conditions.

In the example of the table of daily stock prices for different companies, suppose that you want to find a sequence of three rows
in which:

* On a given day, the stock price for a company is less than 45.00.
* On the next day, the stock price decreases by at least 10%.
* On the following day, the stock price increases by at least 3%.

To find this sequence, you specify a pattern that matches three rows with the following conditions:

* In the first row in the sequence, the value of the `price` column must be less than 45.00.
* In the second row, the value of the `price` column must be less than or equal to 90% of the value of the previous row.
* In the third row, the value of the `price` column must be greater than or equal to 105% of the value of the previous row.

The second and third rows have conditions that require a comparison between column values in different rows. To compare the value
in one row against the value in the previous or next row, use the functions `LAG()` or `LEAD()`:

* `LAG(column)` returns the value of `column` in the previous row.
* `LEAD(column)` returns the value of `column` in the next row.

For this example, you can specify the conditions for the three rows as:

* The first row in the sequence must have `price < 45.00`.
* The second row must have `LAG(price) * 0.90 >= price`.
* The third row must have `LAG(price) * 1.05 <= price`.

When specifying the pattern for the sequence of these three rows, you use a pattern variable for each row that has a different
condition. Use the `DEFINE` subclause to define each pattern variable as a row that must meet a specified condition. The
following example defines three pattern variables for the three rows:

```sqlexample
define
    low_priced_stock as price < 45.00,
    decreased_10_percent as lag(price) * 0.90 >= price,
    increased_05_percent as lag(price) * 1.05 <= price
```

To define the pattern itself, use the `PATTERN` subclause. In this subclause, use a regular expression to specify the
pattern to match. For the building blocks of the expression, use the pattern variables that you defined. For example, the
following pattern finds the sequence of three rows:

```sqlexample
pattern ( low_priced_stock  decreased_10_percent  increased_05_percent )
```

The SQL statement below uses the `DEFINE` and `PATTERN` subclauses shown above:

> ```sqlexample
> SELECT company, price_date, price
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            ALL ROWS PER MATCH
>            PATTERN (LESS_THAN_45 DECREASED_10_PERCENT INCREASED_05_PERCENT)
>            DEFINE
>                LESS_THAN_45 AS price < 45.00,
>                DECREASED_10_PERCENT AS LAG(price) * 0.90 >= price,
>                INCREASED_05_PERCENT AS LAG(price) * 1.05 <= price
>            )
>     ORDER BY company, price_date;
> +---------+------------+-------+
> | COMPANY | PRICE_DATE | PRICE |
> |---------+------------+-------|
> | ABCD    | 2020-10-04 |    42 |
> | ABCD    | 2020-10-05 |    30 |
> | ABCD    | 2020-10-06 |    47 |
> +---------+------------+-------+
> ```

The next sections explain how to define patterns that match specific numbers of rows and rows that appear at the beginning or end
of a partition.

* Using Quantifiers With Pattern Variables
* Matching Patterns Relative to the Beginning or End of a Partition

> **Note:**
>
> MATCH_RECOGNIZE uses [backtracking](https://en.wikipedia.org/wiki/Backtracking) to match patterns. As is the case with other
> [regular expression engines that use backtracking](https://en.wikipedia.org/wiki/Regular_expression#Implementations_and_running_times),
> some combinations of patterns and data to match can take a long time to execute, which can result in high computation costs.
>
> To improve performance, define a pattern that is as specific as possible:
>
> * Make sure that each row matches only one symbol or a small number of symbols
> * Avoid using symbols that match every row (e.g. symbols not in the `DEFINE` clause or symbols that are defined as true)
> * Define an upper limit for quantifiers (e.g. `{,10}` instead of `*`).
>
> For example, the following pattern can result in increased costs if no rows match:
>
> ```sqlexample
> symbol1+ any_symbol* symbol2
> ```
>
> If there is an upper limit to the number of rows that you want to match, you can specify that limit in the quantifiers to
> improve performance. In addition, rather than specifying that you want to find `any_symbol` that follows `symbol1`, you can
> look for a row that is not `symbol1` (`not_symbol1`, in this example);
>
> ```sqlexample
> symbol1{1,limit} not_symbol1{,limit} symbol2
> ```
>
> In general, you should monitor the query execution time to verify that the query is not taking longer than expected.

### Using Quantifiers With Pattern Variables

In the `PATTERN` subclause, you use a regular expression to specify a pattern of rows to match. You use pattern variables
to identify rows in the sequence that meet specific conditions.

If you need to match multiple rows that meet a specific condition, you can use a
[quantifier](../sql-reference/constructs/match_recognize.md), as you would in a
[regular expression](https://en.wikipedia.org/wiki/Regular_expression#Basic_concepts).

For example, you can use the quantifier `+` to specify that pattern must include one or more rows in which the stock price
decreases by 10%, followed by one or more rows in which the stock price increases by 5%:

```sqlexample
pattern (decreased_10_percent+ increased_05_percent+)
define
    decreased_10_percent as lag(price) * 0.90 >= price,
    increased_05_percent as lag(price) * 1.05 <= price
```

### Matching Patterns Relative to the Beginning or End of a Partition

To find a sequence of rows relative to the beginning or end of a partition, you can use the metacharacters `^` and `$` in the
`PATTERN` subclause. These metacharacters in a row pattern have a similar purpose as
[the same metacharacters have in a regular expression](https://en.wikipedia.org/wiki/Regular_expression#POSIX_basic_and_extended):

* `^` represents the beginning of a partition.
* `$` represents the end of a partition.

The following pattern matches a stock with a price greater than 75.00 at the beginning of the partition:

```sqlexample
PATTERN (^ GT75)
DEFINE
    GT75 AS price > 75.00
```

Note that `^` and `$` specify positions and do not represent the rows at those positions (much like `^` and `$` in a
regular expression specify the position and not the characters at those positions). In `PATTERN (^ GT75)`, the first row
(not the second row) must have a price greater than 75.00. In `PATTERN (GT75 $)`, the last row (not the second-to-last row)
must be greater than 75.

Here is a complete example with `^`. Note that although the XYZ stock has a price higher than 60.00 in more than one
row in this partition, only the row at the start of the partition is considered a match.

> ```sqlexample
> SELECT *
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            MEASURES
>                MATCH_NUMBER() AS "Match #",
>                MATCH_SEQUENCE_NUMBER() AS "Match Sequence #"
>            ALL ROWS PER MATCH
>            PATTERN (^ GT60)
>            DEFINE
>                GT60 AS price > 60.00
>            )
>     ORDER BY "Match #", "Match Sequence #";
> +---------+------------+-------+---------+------------------+
> | COMPANY | PRICE_DATE | PRICE | Match # | Match Sequence # |
> |---------+------------+-------+---------+------------------|
> | XYZ     | 2020-10-01 |    89 |       1 |                1 |
> +---------+------------+-------+---------+------------------+
> ```

Here is a complete example with `$`. Note that although the ABCD stock has a price higher than 50.00 in more than
one row in this partition, only the row at the end of the partition is considered a match.

> ```sqlexample
> SELECT *
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            MEASURES
>                MATCH_NUMBER() AS "Match #",
>                MATCH_SEQUENCE_NUMBER() AS "Match Sequence #"
>            ALL ROWS PER MATCH
>            PATTERN (GT50 $)
>            DEFINE
>                GT50 AS price > 50.00
>            )
>     ORDER BY "Match #", "Match Sequence #";
> +---------+------------+-------+---------+------------------+
> | COMPANY | PRICE_DATE | PRICE | Match # | Match Sequence # |
> |---------+------------+-------+---------+------------------|
> | ABCD    | 2020-10-10 |    63 |       1 |                1 |
> +---------+------------+-------+---------+------------------+
> ```

## Specifying Output Rows

Statements that use `MATCH_RECOGNIZE` can choose which rows to output.

### Generating One Row for Each Match vs Generating All Rows for Each Match

When `MATCH_RECOGNIZE` finds a match, the output can be either one summary row for the entire match, or one row
for each data point in the pattern.

* `ALL ROWS PER MATCH` specifies that the output include all rows in the match.
* `ONE ROW PER MATCH` specifies that the output include only one row for each match in each partition.

  The projection clause of the SELECT statement can use only the output of the `MATCH_RECOGNIZE`.
  Effectively, this means that the SELECT statement can only use columns from the following subclauses of
  `MATCH_RECOGNIZE`:

  + The `PARTITION BY` subclause.

    All rows in a match are from the same partition, and therefore have the same value for the `PARTITION BY`
    subclause expressions.
  + The `MEASURES` clause.

    When you use `MATCH_RECOGNIZE ... ONE ROW PER MATCH`, the `MEASURES` subclause generates not only
    expressions that return the same value for all rows in the match (e.g. `MATCH_NUMBER()`), but also expressions
    that can return different values for different rows in the match (e.g. `MATCH_SEQUENCE_NUMBER()`). If you use
    expressions that can return different values for different rows in the match, the output is not deterministic.

  If you are familiar with aggregate functions and `GROUP BY`, the following analogy might be helpful in
  understanding `ONE ROW PER MATCH`:

  + The `PARTITION BY` clause in `MATCH_RECOGNIZE` groups data similarly to the way that `GROUP BY`
    groups data in a `SELECT`.
  + The `MEASURES` clause in a `MATCH_RECOGNIZE ... ONE ROW PER MATCH` allows aggregate functions, such as
    `COUNT()`, that return the same value for each row in the match, as `MATCH_NUMBER()` does.

  If you use only aggregate functions and expressions that return the same value for each row in the match,
  then `... ONE ROW PER MATCH` behaves similarly to `GROUP BY` and aggregate functions.

The default is `ONE ROW PER MATCH`.

The following examples show the difference in outputs between `ONE ROW PER MATCH` and `ALL ROWS PER MATCH`.
These two code examples are almost identical except for the `...ROW(S) PER MATCH` clause. (In typical usage, a SQL
statement with `ONE ROW PER MATCH` has different `MEASURES` subclauses than a SQL statement with
`ALL ROWS PER MATCH`.)

```sqlexample
SELECT *
    FROM stock_price_history
       MATCH_RECOGNIZE (
           PARTITION BY company
           ORDER BY price_date
           MEASURES
               MATCH_NUMBER() AS "Match #",
               MATCH_SEQUENCE_NUMBER() AS "Match Sequence #",
               COUNT(*) AS "Num Rows In Match"
           ALL ROWS PER MATCH
           PATTERN (LESS_THAN_45 UP UP)
           DEFINE
               LESS_THAN_45 AS price < 45.00,
               UP AS price > LAG(price)
           )
    WHERE company = 'ABCD'
    ORDER BY "Match #", "Match Sequence #";
+---------+------------+-------+---------+------------------+-------------------+
| COMPANY | PRICE_DATE | PRICE | Match # | Match Sequence # | Num Rows In Match |
|---------+------------+-------+---------+------------------+-------------------|
| ABCD    | 2020-10-02 |    36 |       1 |                1 |                 1 |
| ABCD    | 2020-10-03 |    39 |       1 |                2 |                 2 |
| ABCD    | 2020-10-04 |    42 |       1 |                3 |                 3 |
| ABCD    | 2020-10-05 |    30 |       2 |                1 |                 1 |
| ABCD    | 2020-10-06 |    47 |       2 |                2 |                 2 |
| ABCD    | 2020-10-07 |    71 |       2 |                3 |                 3 |
+---------+------------+-------+---------+------------------+-------------------+

-- As you can see, the MATCH_SEQUENCE_NUMBER isn't useful when using
-- "ONE ROW PER MATCH". But the COUNT(*), which wasn't very useful in
-- "ALL ROWS PER MATCH", is useful here.
SELECT *
    FROM stock_price_history
       MATCH_RECOGNIZE (
           PARTITION BY company
           ORDER BY price_date
           MEASURES
               MATCH_NUMBER() AS "Match #",
               MATCH_SEQUENCE_NUMBER() AS "Match Sequence #",
               COUNT(*) AS "Num Rows In Match"
           ONE ROW PER MATCH
           PATTERN (LESS_THAN_45 UP UP)
           DEFINE
               LESS_THAN_45 AS price < 45.00,
               UP AS price > LAG(price)
           )
    WHERE company = 'ABCD'
    ORDER BY "Match #", "Match Sequence #";
+---------+---------+------------------+-------------------+
| COMPANY | Match # | Match Sequence # | Num Rows In Match |
|---------+---------+------------------+-------------------|
| ABCD    |       1 |                3 |                 3 |
| ABCD    |       2 |                3 |                 3 |
+---------+---------+------------------+-------------------+
```

### Excluding Rows from the Output

For some queries, you might want to include only part of the pattern in the output. For example, you might want to
find patterns in which stocks rose many days in a row, but display only the peaks and some summary information
(for example, the number of days of price increases before each peak).

You can use [exclusion syntax](../sql-reference/constructs/match_recognize.md) in the pattern to tell `MATCH_RECOGNIZE` to
search for a particular pattern variable but not include it in the output. To include a pattern variable as part of
the pattern to search for, but not as part of the output, use the `{- <pattern_variable> -}` notation.

Here is a simple example that shows the difference between using exclusion syntax and not using it. This example
contains two queries, each of which searches for a stock price that started at less than $45,
then decreased, and then increased. The first query does not use exclusion syntax, and therefore shows all
the rows. The second query uses the exclusion syntax and does not show the day that the stock price fell.

> ```sqlexample
> SELECT company, price_date, price
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            ALL ROWS PER MATCH
>            PATTERN (LESS_THAN_45 DECREASED_10_PERCENT INCREASED_05_PERCENT)
>            DEFINE
>                LESS_THAN_45 AS price < 45.00,
>                DECREASED_10_PERCENT AS LAG(price) * 0.90 >= price,
>                INCREASED_05_PERCENT AS LAG(price) * 1.05 <= price
>            )
>     ORDER BY price_date;
> +---------+------------+-------+
> | COMPANY | PRICE_DATE | PRICE |
> |---------+------------+-------|
> | ABCD    | 2020-10-04 |    42 |
> | ABCD    | 2020-10-05 |    30 |
> | ABCD    | 2020-10-06 |    47 |
> +---------+------------+-------+
> ```
>
> ```sqlexample
> SELECT company, price_date, price
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            ALL ROWS PER MATCH
>            PATTERN (LESS_THAN_45 {- DECREASED_10_PERCENT -} INCREASED_05_PERCENT)
>            DEFINE
>                LESS_THAN_45 AS price < 45.00,
>                DECREASED_10_PERCENT AS LAG(price) * 0.90 >= price,
>                INCREASED_05_PERCENT AS LAG(price) * 1.05 <= price
>            )
>     ORDER BY price_date;
> +---------+------------+-------+
> | COMPANY | PRICE_DATE | PRICE |
> |---------+------------+-------|
> | ABCD    | 2020-10-04 |    42 |
> | ABCD    | 2020-10-06 |    47 |
> +---------+------------+-------+
> ```

The next example is more realistic. It searches for patterns in which a stock price rose one or more days in a row,
and then fell one or more days in a row. Because the output could be quite large, this uses exclusion to show only
the first day the stock rose (if it rose more than one day in a row) and only the first
day it dropped (if it dropped more than one day in a row). The pattern is shown below:

```sqlexample
PATTERN(LESS_THAN_45 UP {- UP* -} DOWN {- DOWN* -})
```

This pattern looks for the following events in order:

* A starting price less than 45.
* An UP, possibly followed immediately by others that are not included in the output.
* A DOWN, possibly followed immediately by others that are not included in the output.

Here are the code and output for versions of the preceding pattern without exclusion and with exclusion:

> ```sqlexample
> SELECT company, price_date, price
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            ALL ROWS PER MATCH
>            PATTERN ( LESS_THAN_45 UP UP* DOWN DOWN* )
>            DEFINE
>                LESS_THAN_45 AS price < 45.00,
>                UP   AS price > LAG(price),
>                DOWN AS price < LAG(price)
>            )
>     WHERE company = 'XYZ'
>     ORDER BY price_date;
> +---------+------------+-------+
> | COMPANY | PRICE_DATE | PRICE |
> |---------+------------+-------|
> | XYZ     | 2020-10-02 |    24 |
> | XYZ     | 2020-10-03 |    37 |
> | XYZ     | 2020-10-04 |    63 |
> | XYZ     | 2020-10-05 |    65 |
> | XYZ     | 2020-10-06 |    56 |
> | XYZ     | 2020-10-07 |    50 |
> +---------+------------+-------+
> ```
>
> ```sqlexample
> SELECT company, price_date, price
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            ALL ROWS PER MATCH
>            PATTERN ( {- LESS_THAN_45 -}  UP  {- UP* -}  DOWN  {- DOWN* -} )
>            DEFINE
>                LESS_THAN_45 AS price < 45.00,
>                UP   AS price > LAG(price),
>                DOWN AS price < LAG(price)
>            )
>     WHERE company = 'XYZ'
>     ORDER BY price_date;
> +---------+------------+-------+
> | COMPANY | PRICE_DATE | PRICE |
> |---------+------------+-------|
> | XYZ     | 2020-10-03 |    37 |
> +---------+------------+-------+
> ```

## Returning Information About the Match

### Basic Match Information

In many cases, you want your query to list not only information from the table that contains the data, but also
information about the patterns that were found. When you want information about the matches themselves, you specify
that information in the `MEASURES` clause.

The `MEASURES` clause can include the following functions, which are specific to `MATCH_RECOGNIZE`:

* `MATCH_NUMBER()`: Each time a match is found, it is assigned a sequential match number, starting from one. This
  function returns that match number.
* `MATCH_SEQUENCE_NUMBER()`: Because a pattern usually involves more than one data point, you might want to know which
  data point is associated with each value from the table. This function returns the sequential number of the data
  point within the match.
* `CLASSIFIER()`: The classifier is the name of the pattern variable that the row matched.

The query below includes a `MEASURES` clause with the match number, match sequence number, and classifier.

> ```sqlexample
> SELECT company, price_date, price,
>        "Match #", "Match Sequence #", "Symbol Matched"
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            MEASURES
>                MATCH_NUMBER() AS "Match #",
>                MATCH_SEQUENCE_NUMBER() AS "Match Sequence #",
>                CLASSIFIER AS "Symbol Matched"
>            ALL ROWS PER MATCH
>            PATTERN (LESS_THAN_45 DECREASED_10_PERCENT INCREASED_05_PERCENT)
>            DEFINE
>                LESS_THAN_45 AS price < 45.00,
>                DECREASED_10_PERCENT AS LAG(price) * 0.90 >= price,
>                INCREASED_05_PERCENT AS LAG(price) * 1.05 <= price
>            )
>     ORDER BY company, "Match #", "Match Sequence #";
> +---------+------------+-------+---------+------------------+----------------------+
> | COMPANY | PRICE_DATE | PRICE | Match # | Match Sequence # | Symbol Matched       |
> |---------+------------+-------+---------+------------------+----------------------|
> | ABCD    | 2020-10-04 |    42 |       1 |                1 | LESS_THAN_45         |
> | ABCD    | 2020-10-05 |    30 |       1 |                2 | DECREASED_10_PERCENT |
> | ABCD    | 2020-10-06 |    47 |       1 |                3 | INCREASED_05_PERCENT |
> +---------+------------+-------+---------+------------------+----------------------+
> ```

The `MEASURES` subclause can produce much more information than this. For more details, see
[the MATCH_RECOGNIZE reference documentation](../sql-reference/constructs/match_recognize.md).

### Windows, Window Frames, and Navigational Functions

The `MATCH_RECOGNIZE` clause operates on a “window” of rows. If the `MATCH_RECOGNIZE` contains a `PARTITION`
subclause, then each partition is one window. If there is
no `PARTITION` subclause, then the entire input is one window.

The `PATTERN` subclause of `MATCH_RECOGNIZE` specifies the symbols in order from left to right. For example:

```sqlexample
PATTERN (START DOWN UP)
```

If you picture the data as a sequence of rows in ascending order from left to right, you can think of
`MATCH_RECOGNIZE` as moving rightward (e.g. from the earliest date to the latest date in the stock price example),
searching for a pattern in the rows inside each window.

`MATCH_RECOGNIZE` starts with the first row in the window and checks whether that row and the subsequent rows
match the pattern.

In the simplest case, after determining whether there’s a pattern match starting at the first row in the window,
`MATCH_RECOGNIZE` moves rightward one row and repeats the process, checking whether the 2nd row is the beginning
of an occurrence of the pattern. `MATCH_RECOGNIZE` continues moving rightward until it reaches the end of the window.

(`MATCH_RECOGNIZE` can move rightward by more than one row. For example, you can tell `MATCH_RECOGNIZE` to start
searching for the next pattern after the end of the current pattern.)

You can picture this loosely as though there were a “frame” moving rightward inside the window. The left-hand edge
of that frame is at the first row in the set of rows currently being checked for a match. The right-hand edge
of the frame is not defined until a match is found; once a match is found, the right-hand edge of the frame is
the last row in the match. For example, if the search pattern were `pattern (start down up)` then the
row that matches the `up` is the last row before the right-hand edge of the frame.

(If no match is found, then the right-hand edge of the frame is never defined and is never referenced.)

In simple cases, you can picture a sliding window frame as illustrated below:

You have already seen [navigational functions](../sql-reference/constructs/match_recognize.md) such as `LAG()`
used in expressions in the `DEFINE` subclause (e.g. `DEFINE down_10_percent as LAG(price) * 0.9 >= price`).
The following query shows that navigational functions can also be used in the `MEASURES` subclause. In this example,
the navigational functions show the edges (and thus the size) of the window frame that contains the current match.

Each output row from this query includes the values of the `LAG()`, `LEAD()`, `FIRST()`, and `LAST()`
navigational functions for that row. The size of the window frame is the number of rows between `FIRST()` and
`LAST()`, including the first and last rows themselves.

The `DEFINE` and `PATTERN` clauses in the query below select groups of three rows
(October 1-3, October 2-4, October 3-5, etc.).

```sqlexample
SELECT company, price_date,
       "First(price_date)", "Lag(price_date)", "Lead(price_date)", "Last(price_date)",
       "Match#", "MatchSeq#", "Classifier"
    FROM stock_price_history
        MATCH_RECOGNIZE (
            PARTITION BY company
            ORDER BY price_date
            MEASURES
                -- Show the "edges" of the "window frame".
                FIRST(price_date) AS "First(price_date)",
                LAG(price_date) AS "Lag(price_date)",
                LEAD(price_date) AS "Lead(price_date)",
                LAST(price_date) AS "Last(price_date)",
                MATCH_NUMBER() AS "Match#",
                MATCH_SEQUENCE_NUMBER() AS "MatchSeq#",
                CLASSIFIER AS "Classifier"
            ALL ROWS PER MATCH
            AFTER MATCH SKIP TO NEXT ROW
            PATTERN (CURRENT_ROW T2 T3)
            DEFINE
                CURRENT_ROW AS TRUE,
                T2 AS TRUE,
                T3 AS TRUE
            )
    ORDER BY company, "Match#", "MatchSeq#"
    ;
+---------+------------+-------------------+-----------------+------------------+------------------+--------+-----------+-------------+
| COMPANY | PRICE_DATE | First(price_date) | Lag(price_date) | Lead(price_date) | Last(price_date) | Match# | MatchSeq# | Classifier  |
|---------+------------+-------------------+-----------------+------------------+------------------+--------+-----------+-------------|
| ABCD    | 2020-10-01 | 2020-10-01        | NULL            | 2020-10-02       | 2020-10-01       |      1 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-02 | 2020-10-01        | 2020-10-01      | 2020-10-03       | 2020-10-02       |      1 |         2 | T2          |
| ABCD    | 2020-10-03 | 2020-10-01        | 2020-10-02      | NULL             | 2020-10-03       |      1 |         3 | T3          |
| ABCD    | 2020-10-02 | 2020-10-02        | NULL            | 2020-10-03       | 2020-10-02       |      2 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-03 | 2020-10-02        | 2020-10-02      | 2020-10-04       | 2020-10-03       |      2 |         2 | T2          |
| ABCD    | 2020-10-04 | 2020-10-02        | 2020-10-03      | NULL             | 2020-10-04       |      2 |         3 | T3          |
| ABCD    | 2020-10-03 | 2020-10-03        | NULL            | 2020-10-04       | 2020-10-03       |      3 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-04 | 2020-10-03        | 2020-10-03      | 2020-10-05       | 2020-10-04       |      3 |         2 | T2          |
| ABCD    | 2020-10-05 | 2020-10-03        | 2020-10-04      | NULL             | 2020-10-05       |      3 |         3 | T3          |
| ABCD    | 2020-10-04 | 2020-10-04        | NULL            | 2020-10-05       | 2020-10-04       |      4 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-05 | 2020-10-04        | 2020-10-04      | 2020-10-06       | 2020-10-05       |      4 |         2 | T2          |
| ABCD    | 2020-10-06 | 2020-10-04        | 2020-10-05      | NULL             | 2020-10-06       |      4 |         3 | T3          |
| ABCD    | 2020-10-05 | 2020-10-05        | NULL            | 2020-10-06       | 2020-10-05       |      5 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-06 | 2020-10-05        | 2020-10-05      | 2020-10-07       | 2020-10-06       |      5 |         2 | T2          |
| ABCD    | 2020-10-07 | 2020-10-05        | 2020-10-06      | NULL             | 2020-10-07       |      5 |         3 | T3          |
| ABCD    | 2020-10-06 | 2020-10-06        | NULL            | 2020-10-07       | 2020-10-06       |      6 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-07 | 2020-10-06        | 2020-10-06      | 2020-10-08       | 2020-10-07       |      6 |         2 | T2          |
| ABCD    | 2020-10-08 | 2020-10-06        | 2020-10-07      | NULL             | 2020-10-08       |      6 |         3 | T3          |
| ABCD    | 2020-10-07 | 2020-10-07        | NULL            | 2020-10-08       | 2020-10-07       |      7 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-08 | 2020-10-07        | 2020-10-07      | 2020-10-09       | 2020-10-08       |      7 |         2 | T2          |
| ABCD    | 2020-10-09 | 2020-10-07        | 2020-10-08      | NULL             | 2020-10-09       |      7 |         3 | T3          |
| ABCD    | 2020-10-08 | 2020-10-08        | NULL            | 2020-10-09       | 2020-10-08       |      8 |         1 | CURRENT_ROW |
| ABCD    | 2020-10-09 | 2020-10-08        | 2020-10-08      | 2020-10-10       | 2020-10-09       |      8 |         2 | T2          |
| ABCD    | 2020-10-10 | 2020-10-08        | 2020-10-09      | NULL             | 2020-10-10       |      8 |         3 | T3          |
| XYZ     | 2020-10-01 | 2020-10-01        | NULL            | 2020-10-02       | 2020-10-01       |      1 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-02 | 2020-10-01        | 2020-10-01      | 2020-10-03       | 2020-10-02       |      1 |         2 | T2          |
| XYZ     | 2020-10-03 | 2020-10-01        | 2020-10-02      | NULL             | 2020-10-03       |      1 |         3 | T3          |
| XYZ     | 2020-10-02 | 2020-10-02        | NULL            | 2020-10-03       | 2020-10-02       |      2 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-03 | 2020-10-02        | 2020-10-02      | 2020-10-04       | 2020-10-03       |      2 |         2 | T2          |
| XYZ     | 2020-10-04 | 2020-10-02        | 2020-10-03      | NULL             | 2020-10-04       |      2 |         3 | T3          |
| XYZ     | 2020-10-03 | 2020-10-03        | NULL            | 2020-10-04       | 2020-10-03       |      3 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-04 | 2020-10-03        | 2020-10-03      | 2020-10-05       | 2020-10-04       |      3 |         2 | T2          |
| XYZ     | 2020-10-05 | 2020-10-03        | 2020-10-04      | NULL             | 2020-10-05       |      3 |         3 | T3          |
| XYZ     | 2020-10-04 | 2020-10-04        | NULL            | 2020-10-05       | 2020-10-04       |      4 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-05 | 2020-10-04        | 2020-10-04      | 2020-10-06       | 2020-10-05       |      4 |         2 | T2          |
| XYZ     | 2020-10-06 | 2020-10-04        | 2020-10-05      | NULL             | 2020-10-06       |      4 |         3 | T3          |
| XYZ     | 2020-10-05 | 2020-10-05        | NULL            | 2020-10-06       | 2020-10-05       |      5 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-06 | 2020-10-05        | 2020-10-05      | 2020-10-07       | 2020-10-06       |      5 |         2 | T2          |
| XYZ     | 2020-10-07 | 2020-10-05        | 2020-10-06      | NULL             | 2020-10-07       |      5 |         3 | T3          |
| XYZ     | 2020-10-06 | 2020-10-06        | NULL            | 2020-10-07       | 2020-10-06       |      6 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-07 | 2020-10-06        | 2020-10-06      | 2020-10-08       | 2020-10-07       |      6 |         2 | T2          |
| XYZ     | 2020-10-08 | 2020-10-06        | 2020-10-07      | NULL             | 2020-10-08       |      6 |         3 | T3          |
| XYZ     | 2020-10-07 | 2020-10-07        | NULL            | 2020-10-08       | 2020-10-07       |      7 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-08 | 2020-10-07        | 2020-10-07      | 2020-10-09       | 2020-10-08       |      7 |         2 | T2          |
| XYZ     | 2020-10-09 | 2020-10-07        | 2020-10-08      | NULL             | 2020-10-09       |      7 |         3 | T3          |
| XYZ     | 2020-10-08 | 2020-10-08        | NULL            | 2020-10-09       | 2020-10-08       |      8 |         1 | CURRENT_ROW |
| XYZ     | 2020-10-09 | 2020-10-08        | 2020-10-08      | 2020-10-10       | 2020-10-09       |      8 |         2 | T2          |
| XYZ     | 2020-10-10 | 2020-10-08        | 2020-10-09      | NULL             | 2020-10-10       |      8 |         3 | T3          |
+---------+------------+-------------------+-----------------+------------------+------------------+--------+-----------+-------------+
```

The output of this query also illustrates that `LAG()` and `LEAD()` functions return NULL for expressions that
attempt to reference rows outside the match group (i.e. outside the [window frame](../sql-reference/functions-window-syntax.md)).

The rules for navigational functions in `DEFINE` clauses are slightly different from the rules for
navigational functions in `MEASURES` clauses. For example, the `PREV()` function is available in the `MEASURES` clause
but currently not in the `DEFINE` clause. Instead, you can use `LAG()` in the `DEFINE` clause. The reference
documentation for [MATCH_RECOGNIZE](../sql-reference/constructs/match_recognize.md) lists the corresponding rule for each
[navigational function](../sql-reference/constructs/match_recognize.md).

The `MEASURES` subclause can also include the following:

* Aggregate functions. For example, if the pattern can match a varying number of rows (e.g. because it matches
  1 or more falling stock prices), then you might want to know the total number of rows in the match; you can
  show this by using `COUNT(*)`.
* General expressions that operate on values in each row in the match. These can be mathematical expressions,
  logical expressions, etc. For example, you could look at values in the row and print text descriptors such as
  “ABOVE AVERAGE”.

  Remember that if you group rows (`ONE ROW PER MATCH`), and if a column has different values for different rows
  in the group, the value selected for that column for that match is non-deterministic, and expressions based on that
  value are also likely to be non-deterministic.

For more information about the `MEASURES` subclause, see the
[reference documentation for MATCH_RECOGNIZE](../sql-reference/constructs/match_recognize.md).

## Specifying Where to Search for the Next Match

By default, after `MATCH_RECOGNIZE` finds a match, it starts looking for the next match immediately after the
end of the most recent match. For example, if `MATCH_RECOGNIZE` finds a match in rows 2, 3, and 4, then
`MATCH_RECOGNIZE` will start looking for the next match at row 5. This prevents overlapping matches.

However, you can choose alternative starting points.

Consider the following data:

```none
Month  | Price | Price Relative to Previous Day
=======|=======|===============================
     1 |   200 |
     2 |   100 | down
     3 |   200 | up
     4 |   100 | down
     5 |   200 | up
     6 |   100 | down
     7 |   200 | up
     8 |   100 | down
     9 |   200 | up
```

Suppose you search the data for a `W` pattern (down, up, down up). There are three `W` shapes:

1. Months: 1, 2, 3, 4, and 5.
2. Months: 3, 4, 5, 6, and 7.
3. Months: 5, 6, 7, 8, and 9.

You can use the `SKIP` clause to specify whether you want all patterns, or only non-overlapping patterns. The
`SKIP` clause supports other options, as well. The `SKIP` clause is documented in more detail in
[MATCH_RECOGNIZE](../sql-reference/constructs/match_recognize.md).

## Best Practices

* Include an ORDER BY clause in your `MATCH_RECOGNIZE` clause.

  + Remember that this ORDER BY applies only within the `MATCH_RECOGNIZE` clause. If you want the entire query to
    return results in a specific order, then use an additional `ORDER BY` clause at the outermost level of the query.
* Pattern variable names:

  + Use meaningful pattern variable names to make your patterns easier to understand and debug.
  + Check for typographical errors in pattern variable names in both the `PATTERN` and `DEFINE` clauses.
* Avoid using defaults for subclauses that have defaults. Make your choices explicit.
* Test your pattern with a small sample of data before scaling up to your full data set.
* The `MATCH_NUMBER()`, `MATCH_SEQUENCE_NUMBER()`, and `CLASSIFIER()` are very helpful in debugging.
* Consider using an `ORDER BY` clause in the outermost level of the query to force the output to be in order by
  using `MATCH_NUMBER()` and `MATCH_SEQUENCE_NUMBER()`. If the output data is in another order, then the output might
  not appear to match the pattern.

## Avoiding Analytic Errors

### Correlation vs Causality

Correlation does not guarantee causality. `MATCH_RECOGNIZE` can return “false positives” (cases where you see a
pattern, but it is just a coincidence).

Pattern matching can also result in “false negatives” (cases where there is a pattern in the real world, but the
pattern does not appear in the data sample).

In most cases, finding a match (for example, finding a pattern that suggests insurance fraud) is just the first step
in an analysis.

The following factors typically increase the number of false positives:

* Large data sets.
* Searching for a large number of patterns.
* Searching for short or simple patterns.

The following factors typically increase the number of false negatives.

* Small data sets.
* Not searching for all the possible relevant patterns.
* Searching for patterns that are more complex than necessary.

### Order-Insensitive Patterns

Although most pattern matching requires that the data be in order (for example, by time), there are exceptions.
For example, if a person commits insurance fraud both in an automobile accident and in a home burglary, it doesn’t
matter which order the frauds occur in.

If the pattern you’re looking for is not order-sensitive, then you can use operators such as
“alternative” (`|`) and `PERMUTE` to make your searches less order-sensitive.

## Examples

This section contains additional examples.

You can find still more examples in [MATCH_RECOGNIZE](../sql-reference/constructs/match_recognize.md).

### Find Multi-Day Price Increases

The following query finds all the patterns in which the price of company ABCD rose two days in a row:

> ```sqlexample
> SELECT *
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            MEASURES
>                MATCH_NUMBER() AS "Match #",
>                MATCH_SEQUENCE_NUMBER() AS "Match Sequence #"
>            ALL ROWS PER MATCH
>            PATTERN (MINIMUM_37 UP UP)
>            DEFINE
>                MINIMUM_37 AS price >= 37.00,
>                UP AS price > LAG(price)
>            )
>     ORDER BY company, "Match #", "Match Sequence #";
> +---------+------------+-------+---------+------------------+
> | COMPANY | PRICE_DATE | PRICE | Match # | Match Sequence # |
> |---------+------------+-------+---------+------------------|
> | ABCD    | 2020-10-06 |    47 |       1 |                1 |
> | ABCD    | 2020-10-07 |    71 |       1 |                2 |
> | ABCD    | 2020-10-08 |    80 |       1 |                3 |
> | XYZ     | 2020-10-03 |    37 |       1 |                1 |
> | XYZ     | 2020-10-04 |    63 |       1 |                2 |
> | XYZ     | 2020-10-05 |    65 |       1 |                3 |
> +---------+------------+-------+---------+------------------+
> ```

#### Demonstrate the PERMUTE Operator

This example demonstrates the `PERMUTE` operator in the pattern. Search for all upward and downward spikes in the
charts limiting the number of rising prices to two:

> ```sqlexample
> select * from stock_price_history match_recognize(
>         partition by company
>         order by price_date
>         measures
>             match_number() as "MATCH_NUMBER",
>             first(price_date) as "START",
>             last(price_date) as "END",
>             count(up.price) as ups,
>             count(*) as "PRICE_COUNT",
>             last(classifier()) = 'DOWN' up_spike
>         after match skip to next row
>         pattern(ANY_ROW PERMUTE(UP{2}, DOWN+))
>         define
>             ANY_ROW AS TRUE,
>             UP as price > lag(price),
>             DOWN as price < lag(price)
>     )
>     order by company, match_number;
> +---------+--------------+------------+------------+-----+-------------+----------+
> | COMPANY | MATCH_NUMBER | START      | END        | UPS | PRICE_COUNT | UP_SPIKE |
> |---------+--------------+------------+------------+-----+-------------+----------|
> | ABCD    |            1 | 2020-10-01 | 2020-10-04 |   2 |           4 | False    |
> | ABCD    |            2 | 2020-10-02 | 2020-10-05 |   2 |           4 | True     |
> | ABCD    |            3 | 2020-10-04 | 2020-10-07 |   2 |           4 | False    |
> | ABCD    |            4 | 2020-10-06 | 2020-10-10 |   2 |           5 | True     |
> | XYZ     |            1 | 2020-10-01 | 2020-10-04 |   2 |           4 | False    |
> | XYZ     |            2 | 2020-10-03 | 2020-10-07 |   2 |           5 | True     |
> +---------+--------------+------------+------------+-----+-------------+----------+
> ```

### Demonstrate the SKIP TO NEXT ROW Option

This example demonstrates the `SKIP TO NEXT ROW` option. This query searches for W-shaped curves in each company’s chart.
The matches can overlap.

> ```sqlexample
> select * from stock_price_history match_recognize(
>     partition by company
>     order by price_date
>     measures
>         match_number() as "MATCH_NUMBER",
>         first(price_date) as "START",
>         last(price_date) as "END",
>         count(*) as "PRICE_COUNT"
>     after match skip to next row
>     pattern(ANY_ROW DOWN+ UP+ DOWN+ UP+)
>     define
>         ANY_ROW AS TRUE,
>         UP as price > lag(price),
>         DOWN as price < lag(price)
> )
> order by company, match_number;
> +---------+--------------+------------+------------+-------------+
> | COMPANY | MATCH_NUMBER | START      | END        | PRICE_COUNT |
> |---------+--------------+------------+------------+-------------|
> | ABCD    |            1 | 2020-10-01 | 2020-10-08 |           8 |
> | XYZ     |            1 | 2020-10-01 | 2020-10-08 |           8 |
> | XYZ     |            2 | 2020-10-05 | 2020-10-10 |           6 |
> | XYZ     |            3 | 2020-10-06 | 2020-10-10 |           5 |
> +---------+--------------+------------+------------+-------------+
> ```

#### Exclusion Syntax

This example shows the exclusion syntax in the pattern. This pattern (like the previous pattern) searches for
`W` shapes, but this query’s output excludes falling prices. Note that in this query, matching continues past the
last row of a match:

> ```sqlexample
> select * from stock_price_history match_recognize(
>         partition by company
>         order by price_date
>         measures
>             match_number() as "MATCH_NUMBER",
>             classifier as cl,
>             count(*) as "PRICE_COUNT"
>         all rows per match
>         pattern(ANY_ROW {- DOWN+ -} UP+ {- DOWN+ -} UP+)
>         define
>             ANY_ROW AS TRUE,
>             UP as price > lag(price),
>             DOWN as price < lag(price)
>     )
>     order by company, price_date;
> +---------+------------+-------+--------------+---------+-------------+
> | COMPANY | PRICE_DATE | PRICE | MATCH_NUMBER | CL      | PRICE_COUNT |
> |---------+------------+-------+--------------+---------+-------------|
> | ABCD    | 2020-10-01 |    50 |            1 | ANY_ROW |           1 |
> | ABCD    | 2020-10-03 |    39 |            1 | UP      |           3 |
> | ABCD    | 2020-10-04 |    42 |            1 | UP      |           4 |
> | ABCD    | 2020-10-06 |    47 |            1 | UP      |           6 |
> | ABCD    | 2020-10-07 |    71 |            1 | UP      |           7 |
> | ABCD    | 2020-10-08 |    80 |            1 | UP      |           8 |
> | XYZ     | 2020-10-01 |    89 |            1 | ANY_ROW |           1 |
> | XYZ     | 2020-10-03 |    37 |            1 | UP      |           3 |
> | XYZ     | 2020-10-04 |    63 |            1 | UP      |           4 |
> | XYZ     | 2020-10-05 |    65 |            1 | UP      |           5 |
> | XYZ     | 2020-10-08 |    54 |            1 | UP      |           8 |
> +---------+------------+-------+--------------+---------+-------------+
> ```

### Search for Patterns in Non-Adjacent Rows

In some situations, you might want to look for patterns in non-contiguous rows. For example, if you are analyzing
log files, you might want to search for all patterns in which a fatal error was preceded by a particular sequence
of warnings. There might not be a natural way to partition and sort the rows so that all of the relevant messages
(rows) are in a single window and adjacent. In that situation, you might need a pattern that looks for
particular events, but doesn’t require that the events be contiguous in the data.

Below is an example of `DEFINE` and `PATTERN` clauses that recognizes either contiguous or non-contiguous
rows that fit the pattern. The symbol `ANY_ROW` is defined as TRUE (so it matches any row). The `*` after each
occurrence of `ANY_ROW` says to allow 0 or more of occurrences of `ANY_ROW` between the first warning and the
second warning, and between the second warning and the fatal error log message. Thus the entire pattern says to
search for `WARNING1`, followed by any number of rows, followed by `WARNING2`, followed by any number of rows,
followed by `FATAL_ERROR`. To omit the irrelevant rows from the output, the query uses
[exclusion](../sql-reference/constructs/match_recognize.md) syntax (`{-` and `-}`).

```sqlexample
MATCH_RECOGNIZE (
    ...
    ORDER BY log_message_timestamp
    ...
    ALL ROWS PER MATCH
    PATTERN ( WARNING1  {- ANY_ROW* -}  WARNING2  {- ANY_ROW* -}  FATAL_ERROR )
    DEFINE
        ANY_ROW AS TRUE,
        WARNING1 AS SUBSTR(log_message, 1, 42) = 'WARNING: Available memory is less than 10%',
        WARNING2 AS SUBSTR(log_message, 1, 41) = 'WARNING: Available memory is less than 5%',
        FATAL_ERROR AS SUBSTR(log_message, 1, 11) = 'FATAL ERROR'
    )
...
```

## Troubleshooting

### Errors When Using ONE ROW PER MATCH and Specifying Columns in the Select Clause

The `ONE ROW PER MATCH` clause acts similarly to an aggregate function. This limits the output columns you can
use. For example, if you use `ONE ROW PER MATCH` and each match contains three rows with different dates, then
you can’t specify the date column as an output column in the SELECT clause because no single date is correct for all
three rows.

### Unexpected Results

* Check for typographical errors in the `PATTERN` and `DEFINE` clauses.

  If a pattern variable name used in the `PATTERN` clause is not defined in the `DEFINE` clause (e.g. because the
  name is typed incorrectly in either the `PATTERN` or `DEFINE` clause), then no error is reported. Instead, the
  pattern variable name is simply assumed to be true for each row.
* Review the `SKIP` clause to make sure that it is appropriate, for example to include or exclude overlapping patterns.

---
title: Implementing differential privacy
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-admin.md
section: User Guide
---

# Implementing differential privacy

This topic contains information for the data provider who is implementing differential privacy for their account.

As you implement differential privacy for your dataset, your tasks involve three key concepts:

* [Privacy policies](differential-privacy-admin-privacy-policies.md). A table or view is not protected by
  differential privacy until you assign a privacy policy to it. A table or view with a privacy policy is considered to be
  *privacy-protected*.
* [Privacy budgets](differential-privacy-overview.md). As analysts query a privacy-protected table, you can
  [manage the privacy budgets](differential-privacy-admin-privacy-budgets.md) associated with those analysts.
* [Privacy domains](differential-privacy-privacy-domains.md). You should define a privacy domain for fact and
  dimension columns in a privacy-protected table or view.

**Limitations**

* You cannot assign a privacy policy and an aggregation policy or masking policy to the same table or view.
* Apart from querying the [noise interval](differential-privacy-analyst.md), analysts don’t know whether they’re querying a
  privacy-protected table, so the data provider should inform them that query results contain noise.
* A data provider cannot monitor the privacy loss incurred by analysts running queries in another account.
* Applying multiple privacy policies to one table is currently not supported. Because of this, protecting more than one entity with
  entity-level differential privacy in a single table is not possible.
* Queries on replicated or cloned tables that have a privacy policy associated with an entity key
  are currently blocked.

## About entity-level privacy

An *entity* refers to a class of data subject that should be protected, for example people, organizations, or locations. If each individual
entity appeared in only one row, row-level privacy would be enough to protect their identities. However, if data belonging to an individual
entity appears in multiple rows (for example, in transactional data), differential privacy must be configured for entity-level privacy to
correctly protect each entity.

To achieve entity-level privacy, Snowflake lets you specify which attribute can be used to identify an entity (an *entity key*). This
lets Snowflake identify all of the records that belong to a particular entity within a dataset. For example, if the entity key is defined
as the column `email`, then Snowflake can determine that all records where `email=joe.smith@example.com` belong to the same entity.

In most cases, entity-level privacy is preferred over row-level privacy, but row-level privacy might be a good fit for a table if the
following is true:

* No column in the table uniquely identifies entities. Entity-level privacy requires an identifying column.
* Each individual entity only appears once.
* The table will not be used in a join. Joins with tables protected by row-level privacy are possible, but have
  [limitations](differential-privacy-analyst.md).

You choose whether to implement entity-level or row-level privacy when assigning a privacy policy to a table or view. For more information,
see [Assign a privacy policy](differential-privacy-admin-privacy-policies.md). If you choose to implement entity-level privacy, the data must also meet
structural requirements to ensure that the entity identifier is used correctly.

> **Tip:**
>
> If you want to protect two separate tables with the same privacy policy, but they do not have the same entity key values, you can create
> a new table that maps the two identifying columns, create a view that joins two of the tables, and apply the privacy policy to the view.
> For example, you could use this strategy if the entity key in one table is `email` and in another table it is `user_id`, but
> both refer to the same entities.

### Structural requirements for entity-level privacy

The structure of data protected by entity-level differential privacy must conform to certain requirements. These requirements must be met so
that Snowflake can accurately track the privacy loss associated with entities.

You should structure your data to meet these requirements *before* applying privacy policies to implement differential privacy. Snowflake
cannot determine whether data conforms to these structural requirements because they concern the meaning of the data, not the differential
privacy implementation. For example, if the entity keys for two different tables are both set to the column `user_id`, but one of the
columns contains values for a numeric identifier while the other column contains email addresses, Snowflake cannot correctly link entity information
across the two tables.

To achieve entity-level privacy, your data must conform to the following requirements:

* **Each row belongs to only one individual within an entity** — As an example, suppose a table contains users and households. If the
  entity that needs to be protected is users, the table cannot be structured such that each row is a household and all the users in that
  household are captured in other columns. You would need to restructure the table so there is only one row per user, with a `household_id`
  column to indicate which household a user belongs to.
* **Consistent entity identifier across all tables** — You can create a privacy policy that represents the protection needed for a single
  entity, then apply that policy to multiple tables that contain information about the entity. When you assign the privacy policy to each
  table, you need to specify the column that uniquely identifies the entity (that is, the entity key). The value that uniquely identifies an
  entity within these entity key columns must be the same. For example, suppose the `email` column is the entity key for two tables that
  contain information about an entity. If the email address of an entity is `joe@example.com` in one table, then the email address in the
  other table must also be `joe@example.com`.
* **Entity identifier in all tables**: Although an entity identifier is not required to implement entity-level privacy, you can make it possible for analysts to
  minimize noise in query joins by including the entity identifier in all tables related to an entity. In some cases, you might need to
  denormalize the entity key column to meet this requirement. For example, suppose you had the following tables where the entity is
  customers:

  | Table | Description |
  | --- | --- |
  | `customer` | Customer directory, where each row is a customer and has a `customer_id`. |
  | `transactions` | Customer transactions, where each row is a transaction and has a `transaction_id`. Each customer can have multiple transactions. |
  | `transaction_lines` | Unique items that were purchased in a transaction. There can be multiple rows in a single transaction. |

  Under best practices for normalization, the `transaction_lines` table would have the `transaction_id` but not the `customer_id`.
  The `transaction_lines` table would link to the `transactions` table, which could then be linked to the `customers` table with
  `customer_id`.

  However for differential privacy, you probably want to optimize the data for the analyst by adding the `customer_id` identifier to the
  `transaction_lines` table. This allows the analyst to minimize noise by including `customer_id` in the join key when joining the
  `transaction_lines` table with another table.

## Interactions with Snowflake features

This section discusses how the following differential privacy objects interact with other Snowflake features. It discusses the effect on
privacy policies, privacy budgets, and privacy domains.

### Data sharing

Secure views and tables with a privacy policy are protected by differential privacy when added to a share. Unsecured views are not
protected by privacy policies if they are queried via a share.

### Replication

For considerations when replicating privacy policies and privacy-protected tables and views, see
[Privacy policies](../account-replication-considerations.md).

> **Note:**
>
> There is a current limitation when querying replicated tables that have a privacy policy associated with an entity key. Queries on those tables are blocked until the limitation is removed.

### Cross-cloud auto-fulfillment

Keep the following in mind when using cross-cloud auto-fulfillment to replicate a data product:

* Administrators in the account to which the data product was replicated cannot adjust the privacy budget.
* Administrators cannot use a single account to view the privacy loss incurred in all regions.

### Cloning

For the effects of cloning privacy-protected tables and views, see [Cloning and differential privacy](../object-clone.md).

> **Note:**
>
> There is a current limitation when querying cloned tables that have a privacy policy associated with an entity key. Queries on those tables are blocked until the limitation is removed.

### Views built on a privacy-protected base object

You can build a view on a privacy-protected table or view. However, the privacy domains of the base table or view are not inherited. As a
result, note the following:

* Privacy domains must be set on the columns of the new view.
* Adjusting the privacy domain of the base table does not affect the privacy domains of the view that is built on it.

### Materialized views

You can assign a privacy policy to a materialized view to make it privacy-protected.

Other interactions between privacy policies and materialized views include the following:

* You cannot create a materialized view based on a privacy-protected table or view.
* You cannot assign a privacy policy to a table if it is referenced as the base table of a materialized view.

### UDFs

Analysts cannot use a user-defined function to query a privacy-protected table.

### Streams

You cannot query a stream that is based on a privacy-protected table.

You cannot assign a privacy policy to a stream.

### Other policies

Privacy policies interact with other Snowflake policies in the following ways:

Masking policies
:   You cannot assign a privacy policy and a masking policy to the same table or view.

Row access policies
:   Row access policies take precedence over a privacy policy. If a row is blocked by the row access policy, it is not included in the results
    of the differentially private query.

Projection policies
:   Protecting a table with a privacy policy and any of its columns with a projection policy at the same time is currently not supported.
    While you’re able to assign the policies in this way, queries against the table will fail.

Aggregation policies
:   You cannot assign a privacy policy and an aggregation policy to the same table or view.

### Dynamic tables

You cannot create a dynamic table when the referenced source table is privacy-protected.

You can assign a privacy policy to a table that is referenced by an existing dynamic table; however, once the policy is assigned, the
dynamic table will no longer refresh.

### External tables

You can assign a privacy policy to an external table. If an analyst tries to aggregate on a VARIANT column, the query fails. However, if an
analyst tries to aggregate on a virtual column, it succeeds.

### Time travel

For time travel, when a previous version of a table is copied as a new table, the current version of a privacy domain is used for the table
because Snowflake does not store previous versions of the privacy domain in table metadata.

---
title: Implementing entity-level privacy with aggregation policies
source: https://docs.snowflake.com/en/user-guide/aggregation-policies-entity-privacy.md
section: User Guide
---

# Implementing entity-level privacy with aggregation policies

Entity-level privacy strengthens the privacy protections provided by aggregation policies. With entity-level privacy, Snowflake
can ensure that each group contains a minimum number of unique entities, not just a minimum number of rows.

The majority of tasks and considerations related to aggregation policies are the same regardless of whether you are implementing
entity-level privacy. For general information about working with aggregation policies, see [Aggregation policies](aggregation-policies.md).

## About entity-level privacy

An *entity* refers to a set of attributes that belong to a logical object (for example, a user profile or household information). These
attributes can be used to identify an entity within a dataset. Entity-level privacy is a feature of privacy-enhancing technologies (PET)
that protects the privacy of an entity that is stored in a shared dataset. It ensures that queries cannot expose sensitive attributes of an
entity, even if those attributes are found in multiple records.

To achieve entity-level privacy, Snowflake allows you to specify which columns identify an entity (an *entity key*). This
lets Snowflake identify all of the records that belong to a particular entity within a dataset. For example, if the entity key is defined
as the column `email`, then Snowflake can determine that all records where `email=joe.smith@example.com` belong to the same entity.

When you define multiple entities for a table, the aggregation policy is evaluated separately for each entity key.

The policy is applied to a query even if the key columns do not appear in the query. For example, given a policy that applies to entity key (user_id), the query `SELECT age FROM T1 GROUP BY age;` will still apply the `min_group_size` restriction for `user_id` in each group, although `user_id` does not appear in the query.

### Aggregation policies *without* entity-level privacy

By default, aggregation policies require analysts to run queries that aggregate data rather
than retrieving individual rows, thereby achieving *row-level privacy*. However, row-level privacy does not prevent a query from
exposing attributes of an entity when those attributes are found in multiple rows (for example, in a table containing transactional data).

For example, suppose a streaming service, ActonViz, has a transactional table that contains the email address (`user_id`) and household
(`household_id`) of each viewer as they watch shows.

| user_id | household_id | program_id | watch_time | start_time |
| --- | --- | --- | --- | --- |
| dave_sr@example.com | 12345 | 1 | 29 | 2023-09-12 09:00 |
| mary@bazco.com | 23485 | 1 | 30 | 2023-09-12 09:00 |
| dave_sr@example.com | 12345 | 6 | 18 | 2023-09-11 13:00 |
| joe@jupiterlink.com | 85456 | 6 | 25 | 2023-09-15 22:00 |
| junior@example.com | 12345 | 5 | 30 | 2023-09-13 11:00 |

ActonViz can use an aggregation policy to force the advertisers to aggregate data into groups that contain at least 2 records. This prevents
the advertisers from retrieving data from an individual record (row-level privacy). If each viewer and household only appeared once in
the table, that would be enough to protect their privacy.

However, an advertiser’s query could still learn information about both viewers and their households. A query could create a group that
consists entirely of records from household `12345` or, even worse, a group that consisted entirely of records for viewer `dave_sr`.
In both cases, the number of records in the group would meet the requirements set by ActonViz (minimum of 2 records per group).

### Aggregation policies *with* entity-level privacy

To achieve entity-level privacy, Snowflake allows you to specify one or more entity keys when assigning an aggregation policy to a table or
view. After the entity key is defined, the groups returned by a query against an aggregation-constrained table or view must contain
at least the specified number of *entities*, not a specified number of *rows*.

In the preceding example, suppose ActonViz defines `household_id` as the entity key because it uniquely identifies each household. The
privacy of each household is now enhanced. Before the change, a group could consist entirely of records where `household_id = 12345`,
but now it must contain at least two distinct values of `household_id`.

Note that the entity key is not always the same as the [primary key](../sql-reference/constraints-overview.md) of a table. In this example,
the table might use `user_id` as the primary key because it uniquely identifies a viewer. But in this case, ActonViz wants to protect
the privacy of an entire household, which consists of multiple viewers, so they chose `household_id` as the entity key.

## About minimum group sizes

Every aggregation policy specifies a minimum group size. Without entity-level privacy, the minimum group size defines the
number of records that must be included in an aggregation group. When an entity key is specified, the minimum group size defines the minimum
number of *unique* entities that must appear in the group to allow it to appear in final results. Remember that aggregation functions such as
SUM and AVG return one group, whereas GROUP BY columns return one group per unique value in the grouped columns.

The following column-level policies do not affect how Snowflake calculates whether there are enough entities in an aggregation group:

* Projection policies are enforced after aggregation policies.
* Masking policies are enforced before aggregation policies. Any aggregation functions or policies work on masked data.

In cases where name references are used several times (for example, in JOIN or UNION operators), Snowflake enforces the minimum group size
for each name reference of each dataset separately. This applies even when the reference points to the same dataset several times.

## Enforce entity-level privacy with aggregation policies

To enforce entity-level privacy with aggregation policies, do the following:

1. When executing the CREATE AGGREGATION POLICY command to create the aggregation policy, specify the number of entities that must be included in each aggregation group.
2. Define the entity key when assigning the aggregation policy to a table or view.

### Specify the minimum number of entities

The syntax for creating an aggregation policy with
[CREATE AGGREGATION POLICY](../sql-reference/sql/create-aggregation-policy.md) does not change if you are using an entity key to achieve entity-level privacy. You
still use the MIN_GROUP_SIZE argument of the AGGREGATION_CONSTRAINT function to specify a minimum group size. As soon as you
define an entity key, the minimum group size changes from a requirement on the number
of records in a group to the number of entities in a group.

For example, the following code creates an aggregation policy that has a minimum group size of 5. As long as you define an entity key when
assigning the policy to a table, each aggregation group must contain at least 5 entities.

```sqlexample
CREATE AGGREGATION POLICY my_agg_policy
  AS () RETURNS AGGREGATION_CONSTRAINT ->
  AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);
```

For complete details about creating aggregation policies, including an example of a conditional aggregation policy that enforces different
restrictions under different circumstances, see [Create an aggregation policy](aggregation-policies.md).

### Define an entity key

You define an entity key for a table when you assign the aggregation policy to the table or view. You can define the entity key when
creating a new table or view, or when
updating an existing table of view.

#### Define an entity key for existing tables and views

When executing the ALTER TABLE … SET AGGREGATION POLICY command or the ALTER VIEW … SET AGGREGATION POLICY command to assign the
aggregation policy, use the ENTITY KEY clause to specify which columns in the table or view contain the identifying attributes of an
entity (that is, the entity key).

For example, to create an entity key while assigning an aggregation policy `my_agg_policy` to a table `viewership_log`, execute:

```sqlexample
ALTER TABLE viewership_log
  SET AGGREGATION POLICY my_agg_policy
  ENTITY KEY (first_name,last_name);
```

Because columns `first_name` and `last_name` are the entity key, the aggregation policy can determine that all rows where
`first_name = joe` and `last_name = peterbilt` belong to the same entity.

##### Define multiple entity keys for existing tables and views

To define multiple entity keys for an existing table, you can either add new keys in multiple calls, or add multiple keys in a single call.
Defining a key on a table is additive; it does not overwrite or drop previously defined keys.

**Add two entity keys in two calls.** The first key comprises two columns.

```sqlexample
ALTER TABLE transactions ADD AGGREGATION POLICY ap ENTITY KEY (user_id, user_email);
ALTER TABLE transactions ADD AGGREGATION POLICY ap ENTITY KEY (vendor_id);
```

**Add two entity keys in one call**

```sqlexample
ALTER TABLE transactions ADD AGGREGATION POLICY ap ENTITY KEY (user_id) ENTITY KEY (vendor_id);
```

#### Define an entity key for new tables and views

When executing the CREATE TABLE … WITH AGGREGATION POLICY command or the CREATE VIEW … WITH AGGREGATION POLICY command to assign the
aggregation policy, use the ENTITY KEY clause to specify which columns in the table or view contain the identifying attributes of an entity.

For example, to create a new table `t1` while assigning an aggregation policy and defining an entity key, execute:

```sqlexample
CREATE TABLE t1
  WITH AGGREGATION POLICY my_agg_policy
  ENTITY KEY (first_name,last_name);
```

Because columns `first_name` and `last_name` are the entity key, the aggregation policy can determine that all rows where
`first_name = joe` and `last_name = peterbilt` belong to the same entity.

## Deferred aggregation policies

If a query has subqueries, Snowflake will attempt to enforce any entity aggregation policies on the innermost query. If that query has a
GROUP BY clause, and the GROUP BY columns match the entity key for an aggregation policy, that aggregation policy will not be applied to
that subquery but to the parent query of that subquery. This deferment continues up the chain until either a query is reached that doesn’t
have a set of GROUP BY columns that match the entity key of the policy, or until the top-level query is reached; in either case, the
aggregation policy will be applied to that query. An aggregation policy is applied only once in a query chain.

For example, suppose you have an aggregation policy `my_agg_policy` with entity key `(name, zipcode)`. In the following pseudo query, the inner query has a GROUP BY set that matches the entity key for `my_agg_policy`, and so the policy is deferred to its parent. The policy is applied at the parent because it is a top-level query, even though the GROUP BY columns also match the policy columns.

```sqlexample
SELECT age, name, zipcode FROM(                        -- Outermost query: my_agg_policy enforced.
  SELECT name, zipcode FROM T GROUP BY name, zipcode   -- Matches my_agg_policy entity key: my_agg_policy deferred
)
  GROUP BY age, name, zipcode;
```

Note that GROUP BY columns can be a superset of the entity key columns to trigger a deferment, and policies are deferred only when GROUP BY columns are matched; aggregation functions do not trigger deferment.

Each aggregation policy is applied separately to all query blocks in the query. A query comprised of multiple blocks through a [set operator](../sql-reference/operators-query.md) (such as UNION) will evaluate the aggregation policies separately for each query block.

Aggregation deferment has some useful effects, demonstrated in the following example.

### Deferment example

Imagine you want to aggregate users into two buckets, “low spenders” and “high spenders”, for entities defined as `(zipcode, email)`.
Deferment allows this to work as shown in the following example. Without deferment, the inner query would return NULL, because each group
would consist of one `(zipcode, email)` entity, which would be suppressed when `min_group_size` is set to any value greater than 1

```sqlexample
WITH bucketed AS (
  SELECT
    CASE
      WHEN SUM(transaction_amount) BETWEEN 0 AND 100 THEN 'low'
      WHEN SUM(transaction_amount) BETWEEN 101 AND 100000 THEN 'high'
    END AS transaction_bucket,
    zipcode,               -- zipcode and email need not appear in the select list, but this lets us compute entity_count below
    email
  FROM my_transactions
  GROUP BY zipcode, email  -- This would not work if it was only GROUP BY zipcode, since the entity key is (zipcode, email)
)
SELECT
  transaction_bucket,
  COUNT(DISTINCT zipcode, email) AS entity_count
FROM
  bucketed
GROUP BY transaction_bucket;
```

### Multiple policy deferment

If a table has multiple aggregation policies, each aggregation policy is evaluated, and possibly deferred, independently. If you have multiple aggregation policies on a table, design your queries carefully, as you can encounter unexpected results when different policies are applied at different query levels.

For example, here is a problem you might encounter if you try a nested query to bucket your users into high and low spender categories on a table with two separate aggregation policies:

**Table T:**

> ```output
> user_id, vendor_id, zipcode, email,         transaction_amount
>    1     1001       90000    a@example.com        100
>    1     1001       90000    a@example.com         50
>    2     2001       90001    b@example.com         12
>    2     2001       90001    b@example.com          5
>    3     3001       90002    c@example.com         40
> ```

**Aggregation policies:**

> * `user_policy`: `min_group_size` = 3, entity key = `(user_id)`
> * `vendor_policy`: `min_group_size` = 2, entity key = `(vendor_id)`

**Query to bucket users as high or low spenders:**

> ```sqlexample
> WITH amounts AS (
>   SELECT
>     user_id,
>     IFF(SUM(transaction_amount) > 50, 'high', 'low') AS bucket
>   FROM T
>   GROUP BY user_id -- user_policy is deferred, but vendor_policy is enforced
> )
> SELECT COUNT(*) FROM amounts GROUP BY bucket
> ```

**Unexpected results:**

In the inner query, `vendor_policy` is enforced. Each row is grouped by `user_id`, which has only one corresponding `vendor_id`, which violates the `vendor_policy` minimum group size, and the inner query will return NULL, even though three distinct customers belong in the “high” bucket.

## Removing entity key constraints

**To remove an aggregation policy for a single entity key:**

```sqlexample
-- Drop agg policy ap associated with entity key user_id
ALTER TABLE transactions DROP AGGREGATION POLICY ap ENTITY KEY (user_id)
```

**To remove an aggregation policy for multiple entity keys,** remove each policy separately:

```sqlexample
-- Drop the agg policies associated with two separate keys
ALTER TABLE transactions DROP AGGREGATION POLICY ap ENTITY KEY (user_id)
ALTER TABLE transactions DROP AGGREGATION POLICY ap ENTITY KEY (vendor_id)
```

**To remove an aggregation policy together with all its entities,** omit ENTITY KEY from the DROP statement:

```sqlexample
-- Drop agg policy ap from the table entirely
ALTER TABLE transactions DROP AGGREGATION POLICY ap
```

## Restrictions

The following restrictions apply when working with tables that have multiple entity keys or aggregation policies defined:

* An entity key may be associated with at most one policy. Attempting to assign another policy for an entity key that is already mapped to a policy will result in an error.
* A policy cannot be used for both row-level privacy and entity-level privacy.
* At most one policy may be used for row-level privacy. Attempting to assign another policy as the row-level aggregation policy will result in an error.

## Querying an aggregation-constrained table

The requirements for querying an aggregation-constrained table that has an entity key is the same as querying a table without one. For
information about what types of queries conform to these requirements, see [Query requirements](aggregation-policies.md).

---
title: Increasing warehouse size
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse-size.md
section: User Guide
---

# Increasing warehouse size

This topic discusses how a warehouse owner or administrator can adjust the size of a warehouse to improve the performance of queries
running on it.

The larger a warehouse, the more compute resources are available to execute a query or set of queries. This makes increasing the size of a
warehouse a straightforward strategy for improving query performance; simply upsize the warehouse, re-run the query, and if the increased
performance does not justify the increased cost of running the query, return the warehouse to its original size.

Using a larger warehouse has the biggest impact on larger, more complex queries, and may not improve the performance of small, basic
queries.

> **Note:**
>
> You must have [access to the shared SNOWFLAKE database](../sql-reference/account-usage.md) to execute the diagnostic queries provided in this topic. By default, only the ACCOUNTADMIN role has the privileges needed to execute the queries.

## Determining the load of the warehouse

Examining the load of a warehouse can help determine whether increasing its size can help improve performance. If a warehouse is heavily
loaded, concurrent queries might be competing for its compute resources, in which case increasing the size of a warehouse might not
provide as big of a performance boost as expected. But if you can determine that the load is low, there is a good chance that increasing
the size of the warehouse will improve the performance of a complex query.

**Query: Warehouse Load**

This query provides insight into the total load of a warehouse for executed and queued queries. These load values represent the ratio of the total execution time (in seconds) of all queries in a specific state in an interval by the total time (in seconds) for that interval.

For example, if 276 seconds was the total time for 4 queries in a 5 minute (300 second) interval, then the query load value is 276 / 300 = 0.92.

```sqlexample
 SELECT TO_DATE(start_time) AS date,
  warehouse_name,
  SUM(avg_running) AS sum_running,
  SUM(avg_queued_load) AS sum_queued
FROM snowflake.account_usage.warehouse_load_history
WHERE TO_DATE(start_time) >= DATEADD(month,-1,CURRENT_TIMESTAMP())
GROUP BY 1,2
HAVING SUM(avg_queued_load) >0;
```

## Cost considerations

A larger warehouse consumes more credits for a given length of time:

| Warehouse size | Credits / hour (Gen1 warehouses) | Credits / second (Gen1 warehouses) | Notes |
| --- | --- | --- | --- |
| X-Small | 1 | 0.0003 | Default size for warehouses created in Snowsight and using [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md). |
| Small | 2 | 0.0006 |  |
| Medium | 4 | 0.0011 |  |
| Large | 8 | 0.0022 |  |
| X-Large | 16 | 0.0044 | Default size for warehouses created using Snowsight. |
| 2X-Large | 32 | 0.0089 |  |
| 3X-Large | 64 | 0.0178 |  |
| 4X-Large | 128 | 0.0356 |  |
| 5X-Large | 256 | 0.0711 | Generally available in Amazon Web Services (AWS) and Microsoft Azure regions, and in preview in US Government regions. |
| 6X-Large | 512 | 0.1422 | Generally available in Amazon Web Services (AWS) and Microsoft Azure regions, and in preview in US Government regions. |

The numbers in the preceding table refer to the first generation (Gen1) of Snowflake standard warehouses.
For usage information about the newer Gen2 warehouses, see [Snowflake generation 2 standard warehouses](warehouses-gen2.md).
For information about credit consumption for generation 2 standard warehouses,
see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
Gen2 warehouses aren’t yet available for all cloud service providers or for all regions, and currently are not the default
when you create a standard warehouse.

> **Tip:**
>
> For information about cost implications of changing the RESOURCE_CONSTRAINT property, see
> [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](warehouses-gen2.md).

Another way that you can scale the capacity of Snowflake warehouses without changing the warehouse size is by using
multi-cluster warehouses. For more information about that feature, see [Multi-cluster warehouses](warehouses-multicluster.md).

If a query takes less time to execute on a larger warehouse, the increased cost of running a large warehouse might be offset by the reduced
execution time. For example, if a query runs twice as fast on the next largest warehouse, the total cost of running the query remains the
same.

> **Tip:**
>
> Best practice is to limit who can adjust the size of a warehouse. Allowing users to increase the size of a warehouse to meet the needs
> of an individual query can result in unexpected costs if they forget to return the warehouse to its original size once the query has
> been executed.

## How to increase the warehouse size

To increase the size of a warehouse, do one of the following:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Compute » Warehouses.
    3. Find the warehouse, and select … » Edit.
    4. Use the Size drop-down to select the new warehouse size.
    5. Select Save Warehouse.

SQL:
:   Use the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command to change the warehouse size. For example:

    ```sqlexample
    ALTER WAREHOUSE my_wh SET WAREHOUSE_SIZE = large;
    ```

---
title: Index hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-index.md
section: User Guide
---

# Index hybrid tables

This topic explains how to index [hybrid tables](tables-hybrid.md).

## Types of indexes

Hybrid tables support two types of indexes:

* Indexes that are created automatically when you declare constraints for hybrid table columns.

  + Indexes for PRIMARY KEY constraints
  + Indexes for FOREIGN KEY constraints
  + Indexes for UNIQUE constraints
* User-defined indexes, known as *secondary indexes*, that you can define on other columns as needed.
  A single index can cover one or more columns. You can use CREATE HYBRID TABLE or CREATE INDEX to
  define secondary indexes.

  When you create secondary indexes, you can “include” columns that are not part of the index key but are
  associated and stored with the index itself. See INCLUDE columns.

  > **Attention:**
  >
  > To add a secondary index, you must use a role that is granted the SELECT
  > privilege on the hybrid table. If you have access to a view of the
  > data in the hybrid table, but not the table itself, you can’t add a secondary index.

## Add secondary indexes

All hybrid tables require a unique primary key. The data in a hybrid table is ordered by this primary key.
You can create additional secondary indexes on non-primary key attributes to accelerate lookups on those
attributes. Indexes might be able to reduce the number of records that are scanned when
a query predicate uses one of the following conditions:

* `=`, `>`, `>=`, `<`, `<=` ([comparison operators](../sql-reference/operators-comparison.md))
* [[ NOT ] IN](../sql-reference/functions/in.md) conditions
* [[ NOT ] BETWEEN](../sql-reference/functions/between.md) conditions

If you have common, repeated queries with predicates on a specific attribute or a composite group of attributes,
consider adding an index to that attribute or group of attributes to improve performance. Be aware of the
following considerations when you use indexes:

* Increase in storage consumption when storing additional copies of the subset of data in the index.
* Additional overhead on DMLs because indexes are maintained synchronously.

You can add secondary indexes to a hybrid table when you create it, or you can add them later by using the
CREATE INDEX command. For example, the following CREATE HYBRID TABLE statement creates two indexes automatically (on the
PRIMARY KEY and UNIQUE columns, `col1` and `col2`) and one user-defined secondary index (on `col3`):

```sqlexample
CREATE OR REPLACE HYBRID TABLE target_hybrid_table (
    col1 VARCHAR(32) PRIMARY KEY,
    col2 NUMBER(38,0) UNIQUE,
    col3 NUMBER(38,0),
    INDEX index_col3 (col3)
    )
  AS SELECT col1, col2, col3 FROM source_table;
```

Alternatively, you can create a secondary index for an existing hybrid table by using the
[CREATE INDEX](../sql-reference/sql/create-index.md) command. Use this command to add an index to a hybrid table
that is actively being used for a workload and is serving queries, or has foreign keys. The CREATE INDEX
command builds indexes concurrently without locking the table during the operation.

> **Tip:**
>
> Check the index build status with the [SHOW INDEXES](../sql-reference/sql/show-indexes.md) command. Only one
> index build at a time is supported.

However, if your hybrid table application is in development or test mode, and some downtime for the
table is not an issue, it is more efficient to recreate the hybrid table and create the indexes by
running an optimized bulk load. This method is more efficient than online index building with the CREATE INDEX
command.

Optimized bulk loading is supported for CTAS, COPY, and INSERT INTO … SELECT,
but you can’t use CTAS if your table has a FOREIGN KEY constraint. The second table created in this
example, `fk_hybrid_table`, would have to be bulk-loaded with COPY or INSERT INTO … SELECT:

```sqlexample
CREATE OR REPLACE HYBRID TABLE ref_hybrid_table (
    col1 VARCHAR(32) PRIMARY KEY,
    col2 NUMBER(38,0) UNIQUE
);

CREATE OR REPLACE HYBRID TABLE fk_hybrid_table (
    col1 VARCHAR(32) PRIMARY KEY,
    col2 NUMBER(38,0),
    col3 NUMBER(38,0),
    FOREIGN KEY (col2) REFERENCES ref_hybrid_table(col2),
    INDEX index_col3 (col3)
);
```

### INCLUDE columns

Although they are not part of the secondary index key, INCLUDE columns are stored with the index records. Because of this
association between the actual indexed columns and the data in the included columns, certain queries can avoid table scans and
benefit from less costly scans that use the index. However, using included columns in indexes might cause an
increase in storage consumption because additional columns are stored with the indexed columns.

For example, consider the following table and index. The index in this case could be declared in either the CREATE TABLE
statement or the CREATE INDEX statement.

```sqlexample
CREATE OR REPLACE HYBRID TABLE sensor_data_device1 (
  device_id VARCHAR(10),
  timestamp TIMESTAMP PRIMARY KEY,
  temperature DECIMAL(6,4),
  vibration DECIMAL(6,4),
  motor_rpm INT
  );

CREATE INDEX sec_sensor_idx
  ON sensor_data_device1(temperature)
    INCLUDE (vibration, motor_rpm);
```

Because this secondary index covers one column directly (`temperature`) and two columns indirectly
(`vibration, motor_rpm`), the index can be used to optimize certain queries that constrain `temperature` and select
data from the included columns.

To test this behavior, first generate some rows for the table:

```sqlexample
INSERT INTO sensor_data_device1 (device_id, timestamp, temperature, vibration, motor_rpm)
  SELECT 'DEVICE1', timestamp,
    UNIFORM(25.1111, 40.2222, RANDOM()), -- Temperature range in °C
    UNIFORM(0.2985, 0.3412, RANDOM()), -- Vibration range in mm/s
    UNIFORM(1400, 1495, RANDOM()) -- Motor RPM range
  FROM (
    SELECT DATEADD(SECOND, SEQ4(), '2024-03-01') AS timestamp
      FROM TABLE(GENERATOR(ROWCOUNT => 2678400)) -- seconds in 31 days
  );
```

Now run the following query:

```sqlexample
SELECT temperature, vibration, motor_rpm
  FROM sensor_data_device1
  WHERE temperature = 25.6;
```

This query makes use of the secondary index named `sec_sensor_idx`. You can verify this behavior
by running the EXPLAIN command on the query or by reviewing the query profile in Snowsight.
You will see an index scan on the secondary index and no “probe scan” on the hybrid table itself.

The following queries, using other supported WHERE clause conditions, would also benefit from the
same secondary index:

```sqlexample
SELECT temperature, vibration, motor_rpm
  FROM sensor_data_device1
  WHERE temperature IN (25.6, 31.2, 35.8);

SELECT temperature, vibration, motor_rpm
  FROM sensor_data_device1
  WHERE temperature BETWEEN 25.0 AND 26.0;
```

Now modify the first query by adding the `device_id` column to the select list. This column isn’t covered by
the `sec_sensor_idx` index.

```sqlexample
SELECT device_id, temperature, vibration, motor_rpm
  FROM sensor_data_device1
  WHERE temperature = 25.6;
```

This query can’t depend on the secondary index entirely; a probe scan of the hybrid table is needed
to return the correct `device_id` values.

---
title: Installing and configuring the Kafka connector
source: https://docs.snowflake.com/en/user-guide/kafka-connector-install.md
section: User Guide
---

# Installing and configuring the Kafka connector

The Kafka connector is provided as a JAR (Java executable) file.

Snowflake provides two versions of the connector:

* A version for the [Confluent package version of Kafka](https://www.confluent.io/hub/snowflakeinc/snowflake-kafka-connector).
* A version for the [open source software (OSS) Apache Kafka package](https://mvnrepository.com/artifact/com.snowflake/snowflake-kafka-connector/).

The instructions in this topic specify which steps apply only to either version of the connector.

## Configuring access control for Snowflake objects

### Required privileges

Creating and managing Snowflake objects used by the Kafka connector requires a role with the following minimum privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE . CREATE TABLE . CREATE STAGE . CREATE PIPE | After the schema-level objects have been created, the CREATE `object` privileges can be revoked. |
| Table | OWNERSHIP | Only required when using the Kafka connector to ingest data into an existing table. . If the connector creates a new target table for records from the Kafka topic, the default role for the user specified in the Kafka configuration file becomes the table owner (i.e. has the OWNERSHIP privilege on the table). |
| Stage | READ . WRITE | Only required when using the Kafka connector to stage data files from Kafka to an existing internal stage (not recommended). . If the connector creates a new stage to temporarily store data files consumed from the Kafka topic, the default role for the user specified in the Kafka configuration file becomes the stage owner (i.e. has the OWNERSHIP privilege on the stage). |

Snowflake recommends that you create a separate user (using [CREATE USER](../sql-reference/sql/create-user.md)) and role (using [CREATE ROLE](../sql-reference/sql/create-role.md)) for each Kafka instance so that the access privileges can be individually revoked if needed. The role should be assigned as the default role for the user.

### Creating a role to use the Kafka connector

The following script creates a custom role for use by the Kafka connector (e.g. KAFKA_CONNECTOR_ROLE_1). Any role that can grant privileges (e.g. SECURITYADMIN or any role with the MANAGE GRANTS privilege) can grant this custom role to any user to allow the Kafka connector to create the required Snowflake objects and insert data into tables. The script references a specific existing database and schema (`kafka_db.kafka_schema`) and user (`kafka_connector_user_1`):

```sqlexample
-- Use a role that can create and manage roles and privileges.
USE ROLE securityadmin;

-- Create a Snowflake role with the privileges to work with the connector.
CREATE ROLE kafka_connector_role_1;

-- Grant privileges on the database.
GRANT USAGE ON DATABASE kafka_db TO ROLE kafka_connector_role_1;

-- Grant privileges on the schema.
GRANT USAGE ON SCHEMA kafka_schema TO ROLE kafka_connector_role_1;
GRANT CREATE TABLE ON SCHEMA kafka_schema TO ROLE kafka_connector_role_1;
GRANT CREATE STAGE ON SCHEMA kafka_schema TO ROLE kafka_connector_role_1;
GRANT CREATE PIPE ON SCHEMA kafka_schema TO ROLE kafka_connector_role_1;

-- Only required if the Kafka connector will load data into an existing table.
GRANT OWNERSHIP ON TABLE existing_table1 TO ROLE kafka_connector_role_1;

-- Only required if the Kafka connector will stage data files in an existing internal stage: (not recommended).
GRANT READ, WRITE ON STAGE existing_stage1 TO ROLE kafka_connector_role_1;

-- Grant the custom role to an existing user.
GRANT ROLE kafka_connector_role_1 TO USER kafka_connector_user_1;

-- Set the custom role as the default role for the user.
-- If you encounter an 'Insufficient privileges' error, verify the role that has the OWNERSHIP privilege on the user.
ALTER USER kafka_connector_user_1 SET DEFAULT_ROLE = kafka_connector_role_1;
```

Note that any privileges must be granted directly to the role used by the connector. Grants cannot be inherited from role hierarchy.

For more information on creating custom roles and role hierarchies, see [Configuring access control](security-access-control-configure.md).

## Installation prerequisites

* The Kafka connector supports the following package versions:

  | Package | Snowflake Kafka Connector Version | Package Support (Tested by Snowflake) |
  | --- | --- | --- |
  | Apache Kafka | 2.0.0 (or higher) | Apache Kafka 2.8.2, 3.7.2 |
  | Confluent | 2.0.0 (or higher) | Confluent 6.2.15, 7.8.2 |
* The Kafka connector is built for use with Kafka Connect API 3.9.0. Any newer versions of Kafka Connect API have not been tested. Any versions older than 3.9.0 are compatible with the connector. For more information, see [Kafka Compatibility](https://kafka.apache.org/protocol.html#protocol_compatibility).
* When you have both the Kafka connector and the JDBC driver jar files in your environment, make sure your JDBC version matches the `snowflake-jdbc` version specified in the `pom.xml` file of your intended Kafka connector version. You can go to your preferred Kafka connector release version, for example, [v2.0.1](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v2.0.1). Then browse `pom.xml` file to find out the version of `snowflake-jdbc`.
* If you use Avro format for ingesting data:

  > + Use the Avro parser, version 1.8.2 (or higher), available from <https://mvnrepository.com/artifact/org.apache.avro>.
  > + If you use the schema registry feature with Avro, use version 5.0.0 (or higher) of the Kafka Connect Avro Converter available at <https://mvnrepository.com/artifact/io.confluent>.
  >
  >   Note that the schema registry feature is not available in the OSS Apache Kafka package.
* Configure Kafka with the desired data retention time and/or storage limit.
* Install and configure the Kafka Connect cluster.

  Each Kafka Connect cluster node should include enough RAM for the Kafka connector. The minimum recommended amount is 5 MB per Kafka partition. This is in addition to the RAM required for any other work that Kafka Connect is doing.
* We recommend using the same versions on Kafka Broker and Kafka Connect Runtime.
* We strongly recommend running your Kafka Connect instance in the same cloud provider [region](intro-regions.md) as your Snowflake account. This is not strictly required, but typically improves throughput.

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../release-notes/requirements.md).

## Installing the connector

This section provides instructions for installing and configuring the Kafka connector for Confluent. Refer to the following table for Kafka connector versions:

| Release Series | Status | Notes |
| --- | --- | --- |
| 4.x.x | Public Preview | Early access. Currently the migration from 3.x and 2.x is not supported. |
| 3.x.x | Officially supported | Latest version and strongly recommended. |
| 2.x.x | Officially supported | Upgrade recommended. |
| 1.x.x | Not supported |  |

### Installing the connector for Confluent

#### Download the Kafka connector files

Download the Kafka connector JAR file from either of the following locations:

Confluent Hub:
:   <https://www.confluent.io/hub/>

    The package includes all dependencies required to use either an encrypted or unencrypted private key for key pair authentication. For more information, see Using Key Pair Authentication & Key Rotation (in this topic).

Maven Central Repository:
:   <https://mvnrepository.com/artifact/com.snowflake>

    The JAR file does not require any additional dependencies to use an unencrypted private key for key pair authentication. To use an encrypted private key, download the [Bouncy Castle](https://www.bouncycastle.org/) cryptography library (a JAR file). Snowflake uses Bouncy Castle to decrypt encrypted RSA private keys used to log in:

    > * <https://mvnrepository.com/artifact/org.bouncycastle/bc-fips/2.1.0>
    > * <https://mvnrepository.com/artifact/org.bouncycastle/bcpkix-fips/2.1.8>

    For the Kafka Connector versions prior to 3.1.1, use the following Bouncy Castle versions instead:

    > * <https://mvnrepository.com/artifact/org.bouncycastle/bc-fips/1.0.1>
    > * <https://mvnrepository.com/artifact/org.bouncycastle/bcpkix-fips/1.0.3>

    Download these files to the same local folder as the Kafka connector JAR file.

    The source code for the connector is available at <https://github.com/snowflakedb/snowflake-kafka-connector>.

#### Install the Kafka connector

Install the Kafka connector using the instructions provided for installing other connectors:

> <https://docs.confluent.io/current/connect/userguide.html>

### Installing the connector for open source Apache Kafka

This section provides instructions for installing and configuring the Kafka connector for open source Apache Kafka.

#### Install Apache Kafka

1. Download the Kafka package from its official website: <https://kafka.apache.org/downloads>.
2. In a terminal window, change to the directory where you downloaded the package file.
3. Execute the following command to decompress the `kafka_<scala_version>-<kafka_version>.tgz` file:

   ```none
   tar xzvf kafka_<scala_version>-<kafka_version>.tgz
   ```

#### Install the JDK

Install and configure the Java Development Kit (JDK). Snowflake tests with the Standard Edition (SE) of the JDK. The Enterprise Edition (EE) is expected to be compatible but has not been tested.

If you have already completed this step, you can skip this section.

1. Download the JDK from <https://www.oracle.com/technetwork/java/javase/downloads/index.html>.
2. Install or decompress the JDK.
3. Following the instructions for your operating system, set the environment variable JAVA_HOME to point to the directory containing the JDK.

#### Download the Kafka connector JAR files

1. Download the Kafka connector JAR file from the Maven Central Repository:

   <https://mvnrepository.com/artifact/com.snowflake>
2. The JAR file does not require any additional dependencies to use an unencrypted private key for key pair authentication. To use an encrypted private key, download the [Bouncy Castle](https://www.bouncycastle.org/) cryptography library (a JAR file). Snowflake uses Bouncy Castle to decrypt encrypted RSA private keys used to log in:

   * <https://mvnrepository.com/artifact/org.bouncycastle/bc-fips/1.0.1>
   * <https://mvnrepository.com/artifact/org.bouncycastle/bcpkix-fips/1.0.3>
3. If your Kafka data is streamed in [Apache Avro](https://avro.apache.org/) format, then download the Avro JAR file:

   <https://mvnrepository.com/artifact/org.apache.avro/avro>

The source code for the connector is available at <https://github.com/snowflakedb/snowflake-kafka-connector>.

#### Install the Kafka connector

Copy the JAR files you downloaded in Download the Kafka Connector JAR Files to the `<kafka_dir>/libs` folder.

## Configuring the Kafka connector

The connector is configured by creating a file that specifies parameters such as the Snowflake login credentials, topic name(s), Snowflake table name(s), etc.

> **Important:**
>
> The Kafka Connect framework broadcasts the configuration settings for the Kafka connector from the master node to worker nodes. The configuration settings include sensitive information (specifically, the Snowflake username and private key). Make sure to secure the communication channel between Kafka Connect nodes. For instructions, see the documentation for your Apache Kafka software.

Each configuration file specifies the topics and corresponding tables for one database and one schema in that database. Note that a connector can ingest messages from
any number of topics, but the corresponding tables must all be stored in a single database and schema.

This section provides instructions for both the distributed and standalone modes.

For descriptions of the configuration fields, see Kafka configuration properties.

> **Important:**
>
> Because the configuration file typically contains security related information, such as the private key, set read/write privileges appropriately on the file to limit access.
>
> In addition, consider storing the configuration file in a secure external location or a key management service. For more information, see Externalizing Secrets (in this topic).

### Distributed mode

Create the Kafka configuration file, e.g. `<path>/<config_file>.json`. Populate the file with all connector configuration
information. The file should be in JSON format.

**Sample configuration file**

```sqljson
{
  "name":"XYZCompanySensorData",
  "config":{
    "connector.class":"com.snowflake.kafka.connector.SnowflakeSinkConnector",
    "tasks.max":"8",
    "topics":"topic1,topic2",
    "snowflake.topic2table.map": "topic1:table1,topic2:table2",
    "buffer.count.records":"10000",
    "buffer.flush.time":"60",
    "buffer.size.bytes":"5000000",
    "snowflake.url.name":"myorganization-myaccount.snowflakecomputing.com:443",
    "snowflake.user.name":"jane.smith",
    "snowflake.private.key":"xyz123",
    "snowflake.private.key.passphrase":"jkladu098jfd089adsq4r",
    "snowflake.database.name":"mydb",
    "snowflake.schema.name":"myschema",
    "key.converter":"org.apache.kafka.connect.storage.StringConverter",
    "value.converter":"com.snowflake.kafka.connector.records.SnowflakeAvroConverter",
    "value.converter.schema.registry.url":"http://localhost:8081",
    "value.converter.basic.auth.credentials.source":"USER_INFO",
    "value.converter.basic.auth.user.info":"jane.smith:MyStrongPassword"
  }
}
```

### Standalone mode

Create a configuration file, e.g. `<kafka_dir>/config/SF_connect.properties`. Populate the file with all connector
configuration information.

**Sample configuration file**

```none
connector.class=com.snowflake.kafka.connector.SnowflakeSinkConnector
tasks.max=8
topics=topic1,topic2
snowflake.topic2table.map= topic1:table1,topic2:table2
buffer.count.records=10000
buffer.flush.time=60
buffer.size.bytes=5000000
snowflake.url.name=myorganization-myaccount.snowflakecomputing.com:443
snowflake.user.name=jane.smith
snowflake.private.key=xyz123
snowflake.private.key.passphrase=jkladu098jfd089adsq4r
snowflake.database.name=mydb
snowflake.schema.name=myschema
key.converter=org.apache.kafka.connect.storage.StringConverter
value.converter=com.snowflake.kafka.connector.records.SnowflakeAvroConverter
value.converter.schema.registry.url=http://localhost:8081
value.converter.basic.auth.credentials.source=USER_INFO
value.converter.basic.auth.user.info=jane.smith:MyStrongPassword
```

### Kafka configuration properties

The following properties can be set in the Kafka configuration file for either distributed mode or standalone mode:

#### Required properties

`name`
:   Application name. This must be unique across all Kafka connectors used by the customer. This name must be a valid Snowflake unquoted identifier. For information about valid identifiers, see [Identifier requirements](../sql-reference/identifiers-syntax.md).

`connector.class`
:   `com.snowflake.kafka.connector.SnowflakeSinkConnector`

`topics`
:   Comma-separated list of topics. By default, Snowflake assumes that the table name is the same as the topic name. If the table name is not the same as the topic name, then use the optional `topic2table.map` parameter (below) to specify the mapping from topic name to table name. This table name must be a valid Snowflake unquoted identifier. For information about valid table names, see [Identifier requirements](../sql-reference/identifiers-syntax.md).

    > **Note:**
    >
    > Either `topics` or `topics.regex` is required; not both.

`topics.regex`
:   This is a regular expression (“regex”) that specifies the topics that contain the messages to load into Snowflake tables. The connector loads data from any topic name that matches the regex. The regex must follow the rules for Java regular expressions (i.e. be compatible with java.util.regex.Pattern). The configuration file should contain either `topics` or `topics.regex`, not both.

`snowflake.url.name`
:   The URL for accessing your Snowflake account. This URL must include your [account identifier](admin-account-identifier.md). Note that the protocol (`https://`) and port number are optional.

`snowflake.user.name`
:   User login name for the Snowflake account.

`snowflake.private.key`
:   The private key to authenticate the user. Include only the key, not the header or footer. If the key is split across multiple lines, remove the line breaks. You can provide an unencrypted key, or you can provide an encrypted key and provide the `snowflake.private.key.passphrase` parameter to enable Snowflake to decrypt the key. Use this parameter if and only if the `snowflake.private.key` parameter value is encrypted. This decrypts private keys that were encrypted according to the instructions in Using Key Pair Authentication & Key Rotation (in this topic).

    > **Note:**
    >
    > Also see `snowflake.private.key.passphrase` in Optional Properties (in this topic).

`snowflake.database.name`
:   The name of the database that contains the table to insert rows into.

`snowflake.schema.name`
:   The name of the schema that contains the table to insert rows into.

`header.converter`
:   Required only if the records are formatted in Avro and include a header. The value is `"org.apache.kafka.connect.storage.StringConverter"`.

`key.converter`
:   This is the Kafka record’s key converter (e.g. `"org.apache.kafka.connect.storage.StringConverter"`). This is not used by the Kafka connector, but is required by the Kafka Connect Platform.

    See [Kafka connector limitations](kafka-connector-overview.md) for current limitations.

`value.converter`
:   If the records are formatted in JSON, this should be `"com.snowflake.kafka.connector.records.SnowflakeJsonConverter"`.

    > **Note:**
    >
    > `"com.snowflake.kafka.connector.records.SnowflakeJsonConverter"` deserializes the records as is. Every json field is considered to be a record field and no special treatment is applied to a schema or any other field containing metadata.

    If the records are formatted in Avro and use Kafka’s Schema Registry Service, this should be `"com.snowflake.kafka.connector.records.SnowflakeAvroConverter"`.

    If the records are formatted in Avro and contain the schema (and therefore do not need Kafka’s Schema Registry Service), this should be `"com.snowflake.kafka.connector.records.SnowflakeAvroConverterWithoutSchemaRegistry"`.

    If the records are formatted in plain text, this should be `"org.apache.kafka.connect.storage.StringConverter"`.

    See [Kafka connector limitations](kafka-connector-overview.md) for current limitations.

#### Optional properties

`snowflake.private.key.passphrase`
:   If the value of this parameter is not empty, the Kafka uses this phrase to try to decrypt the private key.

`tasks.max`
:   Number of tasks, usually the same as the number of CPU cores across the worker nodes in the Kafka Connect cluster. To achieve best performance, Snowflake recommends setting the number of tasks equal to the total number of Kafka partitions, but not exceeding the number of CPU cores. High number of tasks may result in an increased memory consumption and frequent rebalances.

`snowflake.topic2table.map`
:   This optional parameter lets a user specify which topics should be mapped to which tables. Each topic and its table name should be separated by a colon (see example below). This table name must be a valid Snowflake unquoted identifier. For information about valid table names, see [Identifier requirements](../sql-reference/identifiers-syntax.md). The topic configuration allows use of regular expressions to define topics, just as the use of `topics.regex` does. The regular expressions cannot be ambiguous — any matched topic must match only a single target table.

    > **Important:**
    >
    > If the `snowflake.topic2table.map` parameter is configured, Snowflake strongly recommends that you upgrade to version 3.1.0 of the connector.
    > For more information about the Snowflake Connector for Kafka releases, see [Snowflake Connector for Kafka release notes](../release-notes/clients-drivers/kafka-connector.md).

    Example:

    ```none
    topics="topic1,topic2,topic5,topic6"
    snowflake.topic2table.map="topic1:low_range,topic2:low_range,topic5:high_range,topic6:high_range"
    ```

    could be written as:

    ```none
    topics.regex="topic[0-9]"
    snowflake.topic2table.map="topic[0-4]:low_range,topic[5-9]:high_range"
    ```

`buffer.count.records`
:   Number of records buffered in memory per Kafka partition before ingesting to Snowflake. The default value is `10000` records.

`buffer.flush.time`
:   Number of seconds between buffer flushes, where the flush is from the Kafka’s memory cache to the internal stage. The default value is `120` seconds.

`buffer.size.bytes`
:   Cumulative size in bytes of records buffered in memory per the Kafka partition before they are ingested in Snowflake as data files. The default value for this is `5000000` (5 MB).

    The records are compressed when they are written to data files. As a result, the size of the records in the buffer may be larger than the size of the data files created from the records.

`value.converter.schema.registry.url`
:   If the format is Avro and you are using a Schema Registry Service, this should be the URL of the Schema Registry Service. Otherwise this field should be empty.

`value.converter.break.on.schema.registry.error`
:   If loading Avro data from the Schema Registry Service, this property determines if the Kafka connector should stop consuming records if it encounters an error while fetching the schema id. The default value is `false`. Set the value to `true` to enable this behavior.

    Supported by Kafka connector version 1.4.2 (and higher).

`jvm.proxy.host`
:   To enable the Snowflake Kafka Connector to access Snowflake through a proxy server, set this parameter to specify the host of that proxy server.

`jvm.proxy.port`
:   To enable the Snowflake Kafka Connector to access Snowflake through a proxy server, set this parameter to specify the port of that proxy server.

`jvm.proxy.username`
:   Username that authenticates with the proxy server.

    Supported by Kafka connector version 1.4.4 (and higher).

`jvm.proxy.password`
:   Password for the username that authenticates with the proxy server.

    Supported by Kafka connector version 1.4.4 (and higher).

`snowflake.jdbc.map`
:   Example: `"snowflake.jdbc.map": "networkTimeout:20,tracing:WARNING"`

    Additional JDBC properties (see [JDBC Driver connection parameter reference](../developer-guide/jdbc/jdbc-parameters.md)) are not validated. These additional properties
    are not validated, and must not override nor be used instead of required properties such as: `jvm.proxy.xxx`,
    `snowflake.user.name`, `snowflake.private.key`, `snowflake.schema.name` etc.

    Specifying either of the following combinations:
    :   * `tracing` property along with `JDBC_TRACE` env variable
        * `database` property along with `snowflake.database.name`

    Will result in an ambiguous behavior and the behavior will be determined by the JDBC Driver.

`value.converter.basic.auth.credentials.source`
:   If you are using the Avro data format and require secure access to the Kafka schema registry, set this parameter to the string “USER_INFO”, and set the `value.converter.basic.auth.user.info` parameter described below. Otherwise, omit this parameter.

`value.converter.basic.auth.user.info`
:   If you are using the Avro data format and require secure access to the Kafka schema registry, set this parameter to the string “<user_ID>:<password>”, and set the value.converter.basic.auth.credentials.source parameter described above. Otherwise, omit this parameter.

`snowflake.metadata.createtime`
:   If value is set to FALSE, the `CreateTime` property value is omitted from the metadata in the RECORD_METADATA column. The default value is TRUE.

    Supported by the Kafka connector 1.2.0 (and higher).

`snowflake.metadata.topic`
:   If value is set to FALSE, the `topic` property value is omitted from the metadata in the RECORD_METADATA column. The default value is TRUE.

    Supported by the Kafka connector 1.2.0 (and higher).

`snowflake.metadata.offset.and.partition`
:   If value is set to FALSE, the `Offset` and `Partition` property values are omitted from the metadata in the RECORD_METADATA column. The default value is TRUE.

    Supported by the Kafka connector 1.2.0 (and higher).

`snowflake.metadata.all`
:   If value is set to FALSE, the metadata in the RECORD_METADATA column is completely empty. The default value is TRUE.

    Supported by the Kafka connector 1.2.0 (and higher).

`transforms`
:   Specify to skip tombstone records encountered by the Kafka connector and not load them into the target table. A tombstone record is
    defined as a record where the entire value field is null.

    Set the property value to `"tombstoneHandlerExample"`.

    > **Note:**
    >
    > Use this property with the Kafka community converters (i.e. `value.converter` property value) only (e.g.
    > `org.apache.kafka.connect.json.JsonConverter` or `org.apache.kafka.connect.json.AvroConverter`). To manage tombstone record
    > handling with the Snowflake converters, use the `behavior.on.null.values` property instead.

`transforms.tombstoneHandlerExample.type`
:   Required when setting the `transforms` property.

    Set the property value to `"io.confluent.connect.transforms.TombstoneHandler"`

`behavior.on.null.values`
:   Specify how the Kafka connector should handle tombstone records. A tombstone record is defined as a record where the entire value field
    is null. For [Snowpipe](data-load-snowpipe-intro.md), this property is supported by the Kafka connector version 1.5.5 and later. For [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md), this property is supported by the Kafka connector version 2.1.0 and later.

    This property supports the following values:

    `DEFAULT`
    :   When the Kafka connector encounters a tombstone record, it inserts an empty JSON string in the content column.

    `IGNORE`
    :   The Kafka connector skips tombstone records and does not insert rows for these records.

    The default value is `DEFAULT`.

    > **Note:**
    >
    > Tombstone records ingestion varies by the ingestion methods:
    >
    > * For Snowpipe, the Kafka connector uses Snowflake converters only. To manage tombstone record handling with the Kafka community converters, use the `transform` and `transforms.tombstoneHandlerExample.type` properties instead.
    > * For Snowpipe Streaming, the Kafka connector uses community converters only.
    >
    > Records sent to Kafka brokers must not be NULL because these records will be dropped by the Kafka connector resulting in missing offsets. The missing offsets will break the Kafka connector in specific use cases. It is recommended that you use tombstone records instead of NULL records.

`snowflake.snowpipe.v2CleanerEnabled`
:   Specifies whether to run the improved version of stage file cleaner for Snowpipe ingestion method. The old cleaner had some limitations, which caused some of the files to be left on stage.

    This property is supported by the Kafka connector version 2.2.2 and later.

    Values:
    :   * `true`
        * `false`

    Default:
    :   `true` for versions 2.3.0 and later, `false` for version 2.2.2

`snowflake.snowpipe.v2CleanerIntervalSeconds`
:   Specifies how often the new file cleaner is run. For cost optimization purposes, Snowflake recommends that you increase the parameter value significantly, for example, to 30 minutes, if a small number of messages are being processed.

    This property is supported by the Kafka connector version 2.2.2 and later.

    Values:
    :   * Minimum: `1`
        * Maximum: No upper limit

    Default:
    :   `61` seconds

`snowflake.streaming.channel.name.include.connector.name`
:   When enabled, Snowflake Streaming channel names are prefixed with the connector name.
    This option enables or disables usage of channel names that were used in Kafka Connector versions 2.1.0 and 2.1.1 and are intended for users that previously used these versions and have not updated the connector.

    Supported by the Kafka connector 3.4.0 (and higher).

    > **Important:**
    >
    > Enabling this option when updating from versions other than 2.1.0 or 2.1.1 may result in data duplication.
    > Cannot be used together with `enable.streaming.channel.offset.migration=true`

    Values:
    :   * `true`
        * `false`

    Default:
    :   `false`

`enable.streaming.channel.offset.migration`
:   This option is used to enable or disable streaming channel offset migration logic.
    When `true`, offset tokens are migrated from V2 channel name format V2 to V1 channel name format.
    The V2 channel name format was used in Kafka Connector versions 2.1.0 and 2.1.1 only and is deprecated.
    V1 format name format is used unless V2 format is enabled using `snowflake.streaming.channel.name.include.connector.name = true`.
    Disabling this option might have side effects.
    Please consult Snowflake support before disabling this option.

    Channel name formats:
    :   * V1 - `[topic]_[partition]`, used in all versions except 2.1.0 and 2.1.1
        * V2 - `[connectorName]_[topic]_[partition]`, used in versions 2.1.0 and 2.1.1. Can be used in 3.4.0 and later — Please see `snowflake.streaming.channel.name.include.connector.name`.

    Values:
    :   * `true`
        * `false`

    Default:
    :   `true` for versions from 2.1.2 until 3.4.0, `false` for version 3.4.0 and later

### Using key pair authentication & key rotation

The Kafka connector relies on key pair authentication rather than basic authentication (i.e. username and password). This authentication method requires a 2048-bit (minimum) RSA key pair.
Generate the public-private key pair using OpenSSL. The public key is assigned to the Snowflake user defined in the configuration file.

After completing the key pair authentication instructions on this page and the instructions for [key pair rotation](key-pair-auth.md), evaluate the recommendation for Externalizing Secrets (in this topic).

To configure the public/private key pair:

1. From the command line in a terminal window, generate a private key.

   You can generate either an encrypted version or unencrypted version of the private key.

   > **Note:**
   >
   > The Kafka connector supports encryption algorithms that are validated to meet the Federal Information Processing Standard (140-2) (i.e. FIPS 140-2) requirements. For more information, see [FIPS 140-2](https://csrc.nist.gov/publications/detail/fips/140/2/final).

   To generate an unencrypted version, use the following command:

   > ```bash
   > $ openssl genrsa -out rsa_key.pem 2048
   > ```

   To generate an encrypted version, use the following command:

   > ```bash
   > $ openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 <algorithm> -inform PEM -out rsa_key.p8
   > ```
   >
   > Where `<algorithm>` is a FIPS 140-2 compliant encryption algorithm.
   >
   > For example, to specify AES 256 as the encryption algorithm:
   >
   > ```bash
   > $ openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 aes256 -inform PEM -out rsa_key.p8
   > ```
   >
   > If you generate an encrypted version of the private key, record the passphrase. Later, you will specify the passphrase in the `snowflake.private.key.passphrase` property in the Kafka configuration file.

   **Sample PEM private key**

   ```bash
   -----BEGIN ENCRYPTED PRIVATE KEY-----
   MIIE6TAbBgkqhkiG9w0BBQMwDgQILYPyCppzOwECAggABIIEyLiGSpeeGSe3xHP1
   wHLjfCYycUPennlX2bd8yX8xOxGSGfvB+99+PmSlex0FmY9ov1J8H1H9Y3lMWXbL
   ...
   -----END ENCRYPTED PRIVATE KEY-----
   ```
2. From the command line, generate the public key by referencing the private key:

   Assuming the private key is encrypted and contained in the file named `rsa_key.p8`, use the following command:

   ```bash
   $ openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
   ```

   **Sample PEM public key**

   ```bash
   -----BEGIN PUBLIC KEY-----
   MIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8AMIIBCgKCAQEAy+Fw2qv4Roud3l6tjPH4
   zxybHjmZ5rhtCz9jppCV8UTWvEXxa88IGRIHbJ/PwKW/mR8LXdfI7l/9vCMXX4mk
   ...
   -----END PUBLIC KEY-----
   ```
3. Copy the public and private key files to a local directory for storage. Record the path to the files. Note that the private key is stored using the PKCS#8 (Public Key Cryptography Standards) format
   and is encrypted using the passphrase you specified in the previous step; however, the file should still be protected from unauthorized access using the file permission mechanism provided by your
   operating system. It is your responsibility to secure the file when it is not being used.
4. Log into Snowflake. Assign the public key to the Snowflake user using [ALTER USER](../sql-reference/sql/alter-user.md). For example:

   > ```sqlexample
   > ALTER USER jsmith SET RSA_PUBLIC_KEY='MIIBIjANBgkqh...';
   > ```

   > **Note:**
   > * Only security administrators (i.e. users with the SECURITYADMIN role) or higher can alter a user.
   > * Exclude the public key header and footer in the SQL statement.

   Verify the user’s public key fingerprint using [DESCRIBE USER](../sql-reference/sql/desc-user.md):

   ```sqlexample
   DESC USER jsmith;
   +-------------------------------+-----------------------------------------------------+---------+-------------------------------------------------------------------------------+
   | property                      | value                                               | default | description                                                                   |
   |-------------------------------+-----------------------------------------------------+---------+-------------------------------------------------------------------------------|
   | NAME                          | JSMITH                                              | null    | Name                                                                          |
   ...
   ...
   | RSA_PUBLIC_KEY_FP             | SHA256:nvnONUsfiuycCLMXIEWG4eTp4FjhVUZQUQbNpbSHXiA= | null    | Fingerprint of user's RSA public key.                                         |
   | RSA_PUBLIC_KEY_2_FP           | null                                                | null    | Fingerprint of user's second RSA public key.                                  |
   +-------------------------------+-----------------------------------------------------+---------+-------------------------------------------------------------------------------+
   ```

   > **Note:**
   >
   > The `RSA_PUBLIC_KEY_2_FP` property is described in [Configuring key-pair rotation](key-pair-auth.md).
5. Copy and paste the entire private key into the `snowflake.private.key` field in the configuration file. Save the file.

### Using external OAuth

The connector supports External OAuth, described in the section [External OAuth overview](oauth-ext-overview.md), but it works with [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) only.

To configure the connector, use these configuration properties. All of them are required.

`snowflake.authenticator`
:   Indicate you intend to use `oauth`. Use `oauth` value.

`snowflake.oauth.client.id`
:   The client ID associated with the OAuth app.

`snowflake.oauth.client.secret`
:   The client secret associated with the OAuth app.

`snowflake.oauth.refresh.token`
:   The refresh token used to exchange for an access token.

`snowflake.oauth.token.endpoint`
:   The endpoint used for exchanging the refresh token for the access token. You need to specify your OAuth server’s authorization endpoint.

#### Externalizing secrets

Snowflake strongly recommends externalizing secrets such as the private key and storing them in an encrypted form or in a key management service such as AWS Key Management Service (KMS), Microsoft Azure Key Vault,
or HashiCorp Vault. This can be accomplished by using a `ConfigProvider` implementation on your Kafka Connect cluster.

For more information, see the Confluent description of this [service](https://docs.confluent.io/current/connect/security.html#externalizing-secrets).

## Starting Kafka

Start Kafka using the instructions provided in the third-party Confluent or Apache Kafka documentation.

## Starting the Kafka connector

You can start the Kafka connector in either distributed mode or standalone mode. Instructions for each are shown below:

### Distributed mode

In a terminal window, execute the following command:

```none
curl -X POST -H "Content-Type: application/json" --data @<path>/<config_file>.json http://localhost:8083/connectors
```

### Standalone mode

In a terminal window, execute the following command:

```none
<kafka_dir>/bin/connect-standalone.sh <kafka_dir>/<path>/connect-standalone.properties <kafka_dir>/config/SF_connect.properties
```

(A default installation of Apache Kafka or Confluent Kafka should already have the file `connect-standalone.properties`.)

## Testing and using the Kafka connector

We recommend testing the Kafka connector with a small amount of data before using the connector in a production system. The process for testing is the same
as the process for using the connector normally:

1. Verify that Kafka and Kafka Connect are running.
2. Verify that you have created the appropriate Kafka topic.
3. Create (or use an existing) message publisher. Make sure that the messages published to the topic have the right format (JSON, Avro, or plain text).
4. Create a configuration file that specifies the topic to subscribe to and the Snowflake table to write to. For instructions, see Configuring the Kafka Connector (in this topic).
5. (Optional) Create a table into which to write data. This step is optional; if you do not create the table, the Kafka connector creates the table for you. If you do not plan to
   use the connector to add data to an existing, non-empty table, then we recommend that you let the connector create the table for you to minimize the possibility of a
   schema mismatch.
6. Grant the minimum privileges required on the Snowflake objects (database, schema, target table, etc.) to the role that will be used to ingest data.
7. Publish a sample set of data to the configured Kafka topic.
8. Wait a few minutes for data to propagate through the system, and then check the Snowflake table to verify that the records were inserted.

> **Tip:**
>
> Consider verifying your network connection to Snowflake using [SnowCD](snowcd.md) before loading data to Snowflake in your test and production environments.

---
title: Installing and Configuring the Spark Connector
source: https://docs.snowflake.com/en/user-guide/spark-connector-install.md
section: User Guide
---

# Installing and Configuring the Spark Connector

Multiple versions of the connector are supported; however, Snowflake strongly recommends using the most recent version of the connector. To view release information
about the latest version, see the *Spark Connector Release Notes* (link in the sidebar).

The instructions in this topic can be used to install and configure all supported versions of the connector.

## Supported Versions

Snowflake supports multiple versions of the connector:

|  |  |
| --- | --- |
| **Snowflake Spark Connector versions:** | **3.x**, **2.x** |
| **Supported Spark versions:** | **Connector version 3.x**: Spark 3.5, 3.4, 3.3, 3.2 . **Connector version 2.x**: Spark 3.4, 3.3, 3.2 |
| **Supported Scala versions:** | Scala 2.13 . Scala 2.12 |
| **Data source name:** | **Connector version 3.x and 2.x**: `net.snowflake.spark.snowflake` — v2.10.0 (or higher) of the connector allows `snowflake` as the data source name |
| **Package name (for imported classes):** | `net.snowflake.spark.snowflake` |
| **Package distribution:** | [Scala 2.13](https://central.sonatype.com/search?q=g%3Anet.snowflake%20a%3Aspark-snowflake_2.13) in the Maven Central Repository . [Scala 2.12](https://central.sonatype.com/search?q=g%3Anet.snowflake%20a%3Aspark-snowflake_2.12) in the Maven Central Repository |
| **Source code:** | [spark-snowflake (GitHub)](https://github.com/snowflakedb/spark-snowflake) . `master` (for the latest version), . `previous_spark_version` (for earlier versions) |

The developer notes for the different versions are hosted with the source code.

> **Note:**
>
> * **3.x**: A single version of the Snowflake Spark Connector version 3.0.0 and higher supports multiple versions of Spark.
> * **2.x**: The Snowflake Spark Connector for 2.x generally supports the three most recent versions of Spark. Download a version of the connector that is specific to your Spark version. For example, to use version 2.16.0 of the connector with Spark version 3.4, download the `2.16.0-spark_3.4` version of the connector.
> * To [enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md), you must install 3.1.6 or a later version.

## Requirements

To install and use Snowflake with Spark, you need the following:

* A supported operating system. For a list of supported operating systems, see
  [Operating system support](../release-notes/requirements.md).
* Snowflake Connector for Spark.
* Snowflake JDBC Driver (the version compatible with the version of the connector).
* Apache Spark environment, either self-hosted or hosted in any of the following:

  > + [Qubole Data Service](https://www.qubole.com/products/qubole-data-service/apache-spark-service/).
  > + [Databricks](http://www.databricks.com).
  > + [Amazon EMR](http://aws.amazon.com/elasticmapreduce).
* In addition, you can use a dedicated Amazon S3 bucket or Azure Blob storage container as a staging zone between the two systems; however, this is not required with version 2.2.0
  (and higher) of the connector, which uses a temporary Snowflake internal stage (by default) for all data exchange.
* The role used in the connection needs USAGE and CREATE STAGE privileges on the schema that contains the table that you will read from or write to.

> **Note:**
>
> If you are using Databricks or Qubole to host Spark, you do not need to download or install the Snowflake Connector for Spark (or any of the other requirements). Both Databricks
> and Qubole have integrated the connector to provide native connectivity.
>
> For more details, see:
>
> * [Configuring Snowflake for Spark in Databricks](spark-connector-databricks.md)
> * [Configuring Snowflake for Spark in Qubole](spark-connector-qubole.md)

## Verifying the OCSP Connector or Driver Version

Snowflake uses OCSP to evaluate the certificate chain when making a connection to Snowflake. The driver or connector version and its configuration both determine the OCSP behavior. For more information about the driver or connector version, their configuration, and OCSP behavior, see [OCSP Configuration](ocsp.md).

## Downloading and Installing the Connector

The instructions in this section pertain to version 2.x and higher of the Snowflake Connector for Spark.

> **Important:**
>
> Snowflake periodically releases new versions of the connector. The following installation tasks must be performed each time you install a new version. This also applies to the
> Snowflake JDBC driver, which is a prerequisite for the Spark connector.

### Step 1: Download the latest version of the Snowflake Connector for Spark

Snowflake provides multiple versions of the connector. Download the appropriate version, based on the following:

* The version of the Snowflake Connector for Spark that you want to use.
* The version of Spark that you are using.
* The version of Scala that you are using.

You can download the Snowflake Spark Connector from Maven. If you want to build the driver, you can access the source code from
GitHub.

#### Maven Central Repository

Snowflake provides separate package artifacts for each supported Scala version (2.12 and 2.13). For each of these Scala versions,
Snowflake provides different versions of the Spark connector as well as separate artifacts that support different versions of
Spark.

To download the Spark connector:

1. Search the Maven repository for your desired version of the Snowflake Spark Connector:

   * [Scala 2.13](https://central.sonatype.com/search?q=g%3Anet.snowflake%20a%3Aspark-snowflake_2.13)
   * [Scala 2.12](https://central.sonatype.com/search?q=g%3Anet.snowflake%20a%3Aspark-snowflake_2.12)

   The following screenshot provides an example of the search results page:
2. The Latest version label shows the most current version of the driver. If you want to download a prior version, click the
   View all link beside the latest version to see all available packages. The following screenshot shows an example of all available packages for spark-snowflake_2.12.

   The individual packages for Snowflake Spark Connector version 2.x use the following naming convention:

   ```none
   net.snowflake:spark-snowflake_C.C:N.N.N-spark_P.P
   ```

   where:

   * `C.C` is the Scala version (e.g. 2.12).
   * `N.N.N` is the Snowflake version (e.g. 2.16.0).
   * `P.P` is the Spark version (e.g. 3.4).

   For example:

   ```none
   net.snowflake:spark-snowflake_2.12:2.16.0-spark_3.4
   ```

   The individual packages for Snowflake Spark Connector version 3.x use the following naming convention:

   ```none
   net.snowflake:spark-snowflake_C.C:N.N.N
   ```

   where:

   * `C.C` is the Scala version (e.g. 2.13).
   * `N.N.N` is the Snowflake version (e.g. 3.1.1).

   For example:

   > ```none
   > net.snowflake:spark-snowflake_2.13:3.1.1
   > ```
3. Click the Browse link beside the version you want to download, then select and download the JAR file.
4. If you plan to verify the package signature, you need to download the signature
   file as well. Click the filename with the `.jar.asc` filename extension (for example, net.snowflake:spark-snowflake_2.13:3.1.1.jar.asc or net.snowflake:spark-snowflake_2.12:2.16.0-spark_3.4.jar.asc).

#### GitHub

The source code for the Spark Snowflake Connector is available on GitHub. However, the compiled packages are not available on GitHub.
You can download the compiled packages from Maven.

### Step 2: Download the Compatible Version of the Snowflake JDBC Driver

Next, you need to download the version of the Snowflake JDBC driver that is compatible with the version of the
Snowflake Spark Connector that you are using.

The Snowflake JDBC driver is provided as a standard Java package through the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc). You
can either download the package as a `.jar` file or you can directly reference the package. These instructions assume you are referencing the package.

To find the supported version of the Snowflake JDBC Driver for the version of the Snowflake Spark Connector that you are using,
see the [Snowflake Connector for Spark release notes](../release-notes/clients-drivers/spark-connector.md).

For more details on downloading and installing the Snowflake JDBC Driver, see [Downloading / integrating the JDBC Driver](../developer-guide/jdbc/jdbc-download.md).

### Step 3 (Optional): Verify the Snowflake Connector for Spark Package Signature

To verify the Snowflake Connector for Spark package signature:

1. From the public keyserver, download and import the Snowflake GPG public key for the version of the Snowflake Connector for
   Spark that you are using:

   * For version 3.1.2 and higher:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 2A3149C82551A34A
     ```
   * For version 3.1.0 through 3.1.1:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 5A125630709DD64B
     ```
   * For version 2.11.1 through 3.0.0:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 630D9F3CAB551AF3
     ```
   * For version 2.8.2 through 2.11.0:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 37C7086698CB005C
     ```
   * For version 2.4.13 through 2.8.1:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys EC218558EABB25A1
     ```
   * For version 2.4.12 and lower:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 93DB296A69BE019A
     ```
   > **Note:**
   >
   > If this command fails with the following error:
   >
   > > ```none
   > > gpg: keyserver receive failed: Server indicated a failure
   > > ```
   >
   > then specify that you want to use port 80 for the keyserver:
   >
   > > ```bash
   > > gpg --keyserver hkp://keyserver.ubuntu.com:80  ...
   > > ```
2. Run the `gpg --verify` command to verify the signature of the package.

   For the `--verify` command-line flag, specify the `.asc` file that you
   downloaded earlier as the signature file and the JAR file as the file containing
   the signed data.

   ```bash
   $ gpg --verify spark-snowflake_x.xx-N.N.N-spark_P.P.jar.asc spark-snowflake_x.xx-N.N.N-spark_P.P.jar
   gpg: Signature made Wed 22 Feb 2017 04:31:58 PM UTC using RSA key ID <gpg_key_id>
   gpg: Good signature from "Snowflake Computing <snowflake_gpg\ @snowflake.net>"
   ```

   where:

   * `x.xx` is the Scala version (e.g. 2.12).
   * `N.N.N` is the version of the Snowflake Connector for Spark (e.g. 2.16.0).
   * `P.P` is the Spark version (e.g. 3.4).
   > **Note:**
   >
   > Verifying the signature produces a warning similar to the following:
   >
   > > ```none
   > > gpg: Signature made Mon 24 Sep 2018 03:03:45 AM UTC using RSA key ID <gpg_key_id>
   > > gpg: Good signature from "Snowflake Computing <snowflake_gpg@snowflake.net>" unknown
   > > gpg: WARNING: This key is not certified with a trusted signature!
   > > gpg: There is no indication that the signature belongs to the owner.
   > > ```
   >
   > To avoid the warning, you can grant the Snowflake GPG public key implicit trust.
3. Your local environment can contain multiple GPG keys; however, for security reasons, Snowflake periodically rotates the public
   GPG key. As a best practice, we recommend deleting the existing public key after confirming that the latest key works with the
   latest signed package. For example:

   ```bash
   $ gpg --delete-key "Snowflake Computing"
   ```

### Step 4: Configure the Local Spark Cluster or Amazon EMR-hosted Spark Environment

If you have a local Spark installation, or a [Spark installation in Amazon EMR](http://docs.aws.amazon.com/ElasticMapReduce/latest/DeveloperGuide/emr-spark-shell.html), you
need to configure the `spark-shell` program to include both the Snowflake JDBC driver and the Spark Connector:

* To include the Snowflake JDBC driver, use the `--package` option to reference the JDBC package from the
  [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc), providing
  the exact version of the driver you wish to use
  (e.g. `net.snowflake:snowflake-jdbc:3.13.30`).
* To include the Spark Connector, use the `--package` option to reference the appropriate package
  ([Scala 2.12](https://central.sonatype.com/search?q=g%3Anet.snowflake%20a%3Aspark-snowflake_2.12) or [Scala 2.13](https://central.sonatype.com/search?q=g%3Anet.snowflake%20a%3Aspark-snowflake_2.13)) hosted in the Maven Central Repository, providing the exact version of the driver
  you want to use (e.g. `net.snowflake:spark-snowflake_2.12:2.16.0-spark_3.4`).

For example:

> ```bash
> spark-shell --packages net.snowflake:snowflake-jdbc:3.13.22,net.snowflake:spark-snowflake_2.12:2.16.0-spark_3.4
> ```

## Installing Additional Packages (If Needed)

Depending on your Spark installation, some packages required by the connector may be missing. You can add missing packages to your installation by using the appropriate
flag for `spark-shell`:

* `--packages`
* `--jars` (if the packages were downloaded as `.jar` files)

The required packages are listed below, with the syntax (including version number) for using the `--packages` flag to reference the packages:

* `org.apache.hadoop:hadoop-aws:2.7.1`
* `org.apache.httpcomponents:httpclient:4.3.6`
* `org.apache.httpcomponents:httpcore:4.3.3`
* `com.amazonaws:aws-java-sdk-core:1.10.27`
* `com.amazonaws:aws-java-sdk-s3:1.10.27`
* `com.amazonaws:aws-java-sdk-sts:1.10.27`

For example, if the Apache packages are missing, to add the packages by reference:

> ```bash
> spark-shell --packages org.apache.hadoop:hadoop-aws:2.7.1,org.apache.httpcomponents:httpclient:4.3.6,org.apache.httpcomponents:httpcore:4.3.3
> ```

## Preparing an External Location For Files

You might need to prepare an external location for files that you want to transfer between Snowflake and Spark.

This task is required if either of the following situations is true:

* You will run jobs that take longer than 36 hours, which is the maximum
  duration for the token used by the connector to access the internal stage
  for data exchange.
* The Snowflake Connector for Spark version is 2.1.x or lower (even if your
  jobs require less than 36 hours).

  > **Note:**
  >
  > If you are not currently using v2.2.0 (or higher) of the connector,
  > Snowflake strongly recommends upgrading to the latest version.

### Preparing an AWS External S3 Bucket

Prepare an external S3 bucket that the connector can use to exchange data between Snowflake and Spark. You then provide the location information, together with the
necessary AWS credentials for the location, to the connector. For more details, see [Authenticating S3 for Data Exchange](spark-connector-use.md) in the next topic.

> **Important:**
>
> If you use an external S3 bucket, the connector does not automatically remove any intermediate/temporary data from this location. As a result, it is best to use a
> specific bucket or path (prefix) and set a lifecycle policy on the bucket/path to clean up older files automatically. For more details on configuring a lifecycle policy,
> see the [Amazon S3 documentation](http://docs.aws.amazon.com/AmazonS3/latest/dev/object-lifecycle-mgmt.html).

### Preparing an Azure Blob Storage Container

Prepare an external Azure Blob storage container that the connector can use to exchange data between Snowflake and Spark. You then provide the location information, together
with the necessary Azure credentials for the location, to the connector. For more details, see [Authenticating Azure for Data Exchange](spark-connector-use.md) in the next topic.

---
title: Installing SnowSQL
source: https://docs.snowflake.com/en/user-guide/snowsql-install-config.md
section: User Guide
---

# Installing SnowSQL

This topic describes how to download and install SnowSQL on all supported platforms.

To download the SnowSQL installer, go to the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page.

> **Note:**
>
> The SnowSQL 1.3.0 release disabled automatic upgrades, so you must manually download and reinstall for each new version.

## Installing SnowSQL on Linux using the installer

This section describes how to download, verify, and run the installer package to install SnowSQL on Linux.

To upgrade SnowSQL manually (such as if your software installation policy prohibits upgrading automatically), use the
RPM package to install SnowSQL. The RPM package does not set up SnowSQL to upgrade automatically. For instructions, see
Installing SnowSQL on Linux using the RPM package (in this topic).

### Setting the download directory and configuration file location

By default, the SnowSQL installer downloads the binaries to the following directory:

`~/.snowsql`

Consequently, the [configuration file](snowsql-config.md) is located under the download directory:

`~/.snowsql/config`

To change both the download directory and location of the configuration file, set the `WORKSPACE` environment variable to
any user-writable directory. This approach is particularly useful if you have an isolated SnowSQL environment for each process.

In addition, you can separate the download directory from the configuration file by setting the `SNOWSQL_DOWNLOAD_DIR` environment variable so that
multiple SnowSQL processes can share the binaries. For example:

> ```bash
> $ SNOWSQL_DOWNLOAD_DIR=/var/shared snowsql -h
> ```

Note that `SNOWSQL_DOWNLOAD_DIR` is supported starting with the SnowSQL 1.1.70 bootstrap version. To check the version you are using, execute the
following command from the terminal window prompt:

> ```bash
> $ snowsql --bootstrap-version
> ```

### Downloading the SnowSQL installer

Go to the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page, find the version of the SnowSQL that you want to install, and download the files with the
following filename extensions:

* `.bash` (the installer script)
* `.bash.sig` (the signature that you can use to verify the downloaded package)

### Using curl to download the SnowSQL installer

If you want to download the installer from a script or a terminal window (such as using [curl](https://curl.se/), rather than your web browser),
you can download the installers directly from the [Snowflake Client Repository](snowflake-client-repository.md). For increased flexibility, Snowflake
provides both Amazon Web Services (AWS) and Azure endpoints for the repository. Accounts hosted on any supported cloud platform
can download the installer from either endpoint.

Run `curl` (or an equivalent command-line tool) to download the installer. The `curl` syntax is as follows:

AWS endpoint:
:   ```bash
    $ curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/<bootstrap_version>/linux_x86_64/snowsql-<version>-linux_x86_64.bash
    ```

Microsoft Azure endpoint:
:   ```bash
    $ curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/<bootstrap_version>/linux_x86_64/snowsql-<version>-linux_x86_64.bash
    ```

Where:

* `<version>` is the combined SnowSQL major, minor, and patch versions. For example, for version 1.5.0, the major version is 1, the
  minor version is 5, and the patch version is 0. So, the version is 1.5.0.
* `<bootstrap_version>` is the combined SnowSQL major and minor versions. For example, for version 1.5.0, the major version is
  1 and the minor version is 5, so the bootstrap version is 1.5.

For example, to download the SnowSQL installer where `<bootstrap_version>` is 1.5 and `<version>` is 1.5.0:

AWS endpoint:
:   ```
    $ curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/1.5/linux_x86_64/snowsql-1.5.0-linux_x86_64.bash
    ```

Microsoft Azure endpoint:
:   ```
    $ curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/1.5/linux_x86_64/snowsql-1.5.0-linux_x86_64.bash
    ```

For more information about SnowSQL versions, see Understanding SnowSQL Versioning (in this topic).

### Verifying the package signature

To verify the signature for the downloaded package:

1. Download and import the latest Snowflake GPG public key from the public keyserver by entering the following command, using the GPG key associated with the SnowSQL version:

   > * For SnowSQL 1.3.3 and higher:
   >
   >   ```
   >   $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 2A3149C82551A34A
   >   ```
   > * For SnowSQL 1.2.24 through 1.3.2:
   >
   >   ```
   >   $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 630D9F3CAB551AF3
   >   ```
   > > **Note:**
   > >
   > > If this command fails with the following error:
   > >
   > > > ```none
   > > > gpg: keyserver receive failed: Server indicated a failure
   > > > ```
   > >
   > > then specify that you want to use port 80 for the keyserver:
   > >
   > > > ```bash
   > > > gpg --keyserver hkp://keyserver.ubuntu.com:80  ...
   > > > ```
2. Download the GPG signature and verify the signature:

   ```
   # If you prefer to use curl to download the signature file, run this command:
   curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/1.5/linux_x86_64/snowsql-1.5.0-linux_x86_64.bash.sig

   # Verify the package signature.
   gpg --verify snowsql-1.5.0-linux_x86_64.bash.sig snowsql-1.5.0-linux_x86_64.bash
   ```

   or, if you are downloading the signature file from the Azure endpoint:

   ```
   # If you prefer to use curl to download the signature file, run this command:
   curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/1.5/linux_x86_64/snowsql-1.5.0-linux_x86_64.bash.sig

   # Verify the package signature.
   gpg --verify snowsql-1.5.0-linux_x86_64.bash.sig snowsql-1.5.0-linux_x86_64.bash
   ```

   > **Note:**
   >
   > Verifying the signature produces a warning similar to the following:
   >
   > > ```none
   > > gpg: Signature made Mon 24 Sep 2018 03:03:45 AM UTC using RSA key ID <gpg_key_id>
   > > gpg: Good signature from "Snowflake Computing <snowflake_gpg@snowflake.net>" unknown
   > > gpg: WARNING: This key is not certified with a trusted signature!
   > > gpg: There is no indication that the signature belongs to the owner.
   > > ```
   >
   > To avoid the warning, you can grant the Snowflake GPG public key implicit trust.
3. Your local environment can contain multiple GPG keys; however, for security reasons, Snowflake periodically rotates the public GPG key.
   As a best practice, we recommend deleting the existing public key after confirming that the latest key works with the latest signed
   package. For example:

   ```bash
   gpg --delete-key "Snowflake Computing"
   ```

### Installing SnowSQL using the installer

1. Open a terminal window.
2. Run the Bash script installer from the download location:

   > ```bash
   > bash snowsql-linux_x86_64.bash
   > ```
3. Follow the instructions provided by the installer.

> **Note:**
>
> The installation can be automated by setting the following environment variables:
>
> * `SNOWSQL_DEST`: Target directory of the `snowsql` executable.
> * `SNOWSQL_LOGIN_SHELL`: The login shell initialization file, which includes the `PATH` environment update.
>
> ```bash
> SNOWSQL_DEST=~/bin SNOWSQL_LOGIN_SHELL=~/.profile bash snowsql-linux_x86_64.bash
> ```

When you install a new major or minor version, SnowSQL does not upgrade itself immediately. Rather, you must log into your [Snowflake account](gen-conn-config.md) using SnowSQL and remain connected for a sufficient period of time for the auto-upgrade feature to upgrade the client to the latest release. To verify the SnowSQL version that currently starts when you run the client, use the `-v` option without a value:

> ```bash
> snowsql -v
> ```
>
> ```output
> Version: 1.3.1
> ```

To force SnowSQL to install and use a specific version, use the `-v` option and specify the version you want to install. For example, execute the following command for version 1.3.0:

> ```bash
> snowsql -v 1.3.0
> ```

## Installing SnowSQL on Linux using the RPM package

To upgrade software manually, you can use the RPM package (rather than the
installer) to install SnowSQL. The RPM package does not support automatic upgrades.

### Downloading the SnowSQL RPM package

Go to the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page, find the version of the SnowSQL that you want to install, and download the file with the
`.rpm` filename extension.

### Installing the SnowSQL RPM package

The downloaded RPM file can be installed the way that any other RPM package is installed:

```bash
rpm -i <package_name>
```

## Installing SnowSQL on macOS using the installer

This section describes how to download and run the installer package to install SnowSQL on macOS.

### Setting the download directory and configuration file location

By default, the SnowSQL installer downloads the binaries to the following directory:

`~/.snowsql`

Consequently, the [configuration file](snowsql-config.md) is located under the download directory:

`~/.snowsql/config`

You can change both the download directory and location of the configuration file by setting the `WORKSPACE` environment variable to any user-writable
directory. This is particularly useful if you have an isolated SnowSQL environment for each process.

In addition, you can separate the download directory from the configuration file by setting the `SNOWSQL_DOWNLOAD_DIR` environment variable so that
multiple SnowSQL processes can share the binaries. For example:

> ```bash
> SNOWSQL_DOWNLOAD_DIR=/var/shared snowsql -h
> ```

Note that `SNOWSQL_DOWNLOAD_DIR` is supported starting with the SnowSQL 1.1.70 bootstrap version. To check the version you are using, execute the
following command from the terminal window prompt:

> ```bash
> snowsql --bootstrap-version
> ```

### Downloading the SnowSQL installer

To download the SnowSQL installer, go to the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page. This version of the SnowSQL installer enables auto-upgrade
for patches.

### Using curl to download the SnowSQL installer

If you want to download the installer from a script or a terminal window (such as using [curl](https://curl.se/), rather than your web browser),
you can download the installers directly from the [Snowflake Client Repository](snowflake-client-repository.md). For increased flexibility, Snowflake
provides both Amazon Web Services (AWS) and Azure endpoints for the repository. Accounts hosted on any supported cloud platform
can download the installer from either endpoint.

Run `curl` (or an equivalent command-line tool) to download the installer. The `curl` syntax is as follows:

AWS endpoint:
:   ```bash
    curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/<bootstrap_version>/darwin_x86_64/snowsql-<version>-darwin_x86_64.pkg
    ```

Microsoft Azure endpoint:
:   ```bash
    curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/<bootstrap_version>/darwin_x86_64/snowsql-<version>-darwin_x86_64.pkg
    ```

where:

* `<version>` is the combined SnowSQL major, minor, and patch versions. For example, for version 1.5.0, the major version is 1, the
  minor version is 5, and the patch version is 0. So, the version is 1.5.0.
* `<bootstrap_version>` is the combined SnowSQL major and minor versions. For example, for version 1.5.0, the major version is
  1 and the minor version is 5, so the bootstrap version is 1.5.

For example, to download the SnowSQL installer where `<bootstrap_version>` is 1.5 and `<version>` is 1.5.0:

AWS endpoint:
:   ```
    curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/1.5/darwin_x86_64/snowsql-1.5.0-darwin_x86_64.pkg
    ```

Microsoft Azure endpoint:
:   ```
    curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/1.5/darwin_x86_64/snowsql-1.5.0-darwin_x86_64.pkg
    ```

For more information about SnowSQL versions, see Understanding SnowSQL Versioning (in this topic).

The macOS operating system can verify the installer signature automatically, so GPG signature verification is not needed.

### Installing SnowSQL using the installer

1. Open `snowsql-darwin_x86_64.pkg` in the download location to run the installer PKG file.
2. Follow the instructions provided by the installer.

> **Note:**
>
> The installation can be automated by running the installer from the command line. The target directory can be set to either
> `CurrentUserHomeDirectory` (`~/Applications` directory) or `LocalSystem` (`/Applications` directory):
>
> ```bash
> installer -pkg snowsql-darwin_x86_64.pkg -target CurrentUserHomeDirectory
> ```

When you install a new major or minor version, SnowSQL does not upgrade itself immediately. Rather, you must log into your Snowflake account using SnowSQL and remain connected for a sufficient period of time for the auto-upgrade feature to upgrade the client to the latest release. To verify the SnowSQL version that currently starts when you run the client, use the `-v` option without a value:

> ```bash
> snowsql -v
> ```
>
> ```output
> Version: 1.3.0
> ```

To force SnowSQL to install and use a specific version, use the `-v` option and specify the version you want to install. For example, execute the following command for version 1.3.1:

> ```bash
> snowsql -v 1.3.1
> ```

#### Configuring the Z shell alias (macOS only)

If Z shell (also known as zsh) is your default terminal shell, set an alias to the SnowSQL executable so that you can run SnowSQL on the command line in Terminal. The SnowSQL installer installs the executable in `/Applications/SnowSQL.app/Contents/MacOS/snowsql` and appends this path to the PATH or alias entry in `~/.profile`. Because zsh does not normally read this file, add an alias to this path in `~/.zshrc`, which zsh does read.

To add an alias to the SnowSQL executable:

1. Open (or create, if missing) the `~/.zshrc` file.
2. Add the following line:

   ```bash
   alias snowsql=/Applications/SnowSQL.app/Contents/MacOS/snowsql
   ```
3. Save the file.

## Installing SnowSQL on macOS using homebrew cask

[Homebrew Cask](https://caskroom.github.io/) is a popular extension of [Homebrew](https://brew.sh/) used for package distribution, installation, and
maintenance. There is no separate SnowSQL installer to download. If Homebrew Cask is installed on your macOS platform, you can install Snowflake directly.

Run the `brew install` command, specifying `snowflake-snowsql` as the cask to install:

```bash
brew install --cask snowflake-snowsql
```

### Configuring the Z shell alias (macOS only)

If Z shell (also known as zsh) is your default terminal shell, set an alias to the SnowSQL executable so that you can run SnowSQL on the command line in Terminal. The SnowSQL installer installs the executable in `/Applications/SnowSQL.app/Contents/MacOS/snowsql` and appends this path to the PATH or alias entry in `~/.profile`. Because zsh does not normally read this file, add an alias to this path in `~/.zshrc`, which zsh does read.

To add an alias to the SnowSQL executable:

1. Open (or create, if missing) the `~/.zshrc` file.
2. Add the following line:

   ```bash
   alias snowsql=/Applications/SnowSQL.app/Contents/MacOS/snowsql
   ```
3. Save the file.

## Installing SnowSQL on Microsoft Windows using the installer

This section describes how to download and run the installer package to install SnowSQL on Microsoft Windows.

### Setting the download directory and configuration file location

By default, the SnowSQL installer downloads the binaries to the following directory:

`%USERPROFILE%\.snowsql`

Consequently, the [configuration file](snowsql-config.md) is located under the download directory:

`%USERPROFILE%\.snowsql\config`

You can change both the download directory and location of the configuration file by setting the `WORKSPACE` environment variable to any user-writable
directory. This is particularly useful if you have an isolated SnowSQL environment for each process.

In addition, you can separate the download directory from the configuration file by setting the `SNOWSQL_DOWNLOAD_DIR` environment variable so that
multiple SnowSQL processes can share the binaries. For example:

> ```bash
> SNOWSQL_DOWNLOAD_DIR=/var/shared snowsql -h
> ```

Note that `SNOWSQL_DOWNLOAD_DIR` is supported starting with the SnowSQL 1.1.70 bootstrap version. To check the version you are using, execute the
following command from the terminal window prompt:

> ```bash
> snowsql --bootstrap-version
> ```

### Downloading the SnowSQL installer

To download the SnowSQL installer, go to the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page. This version of the SnowSQL installer enables auto-upgrade
for patches.

### Using curl to download the SnowSQL installer

If you want to download the installer from a script or a terminal window (such as using [curl](https://curl.se/), rather than your web browser),
you can download the installers directly from the [Snowflake Client Repository](snowflake-client-repository.md). For increased flexibility, Snowflake
provides both Amazon Web Services (AWS) and Azure endpoints for the repository. Accounts hosted on any supported cloud platform
can download the installer from either endpoint.

Run `curl` (or an equivalent command-line tool) to download the installer. The `curl` syntax is as follows:

AWS endpoint:
:   ```bash
    curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/<bootstrap_version>/windows_x86_64/snowsql-<version>-windows_x86_64.msi
    ```

Microsoft Azure endpoint:
:   ```bash
    curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/<bootstrap_version>/windows_x86_64/snowsql-<version>-windows_x86_64.msi
    ```

Where:

* `<version>` is the combined SnowSQL major, minor, and patch versions. For example, for version 1.3.1, the major version is 1, the
  minor version is 3, and the patch version is 1. So, the version is 1.3.1.
* `<bootstrap_version>` is the combined SnowSQL major and minor versions. For example, for version 1.3.1, the major version is
  1 and the minor version is 3, so the bootstrap version is 1.3.

For example, to download the SnowSQL installer where `<bootstrap_version>` is 1.5 and `<version>` is 1.5.0:

AWS endpoint:
:   ```
    curl -O https://sfc-repo.snowflakecomputing.com/snowsql/bootstrap/1.5/windows_x86_64/snowsql-1.5.0-windows_x86_64.msi
    ```

Microsoft Azure endpoint:
:   ```
    curl -O https://sfc-repo.azure.snowflakecomputing.com/snowsql/bootstrap/1.5/windows_x86_64/snowsql-1.5.0-windows_x86_64.msi
    ```

For more information about SnowSQL versions, see Understanding SnowSQL Versioning (in this topic).

The Windows operating system can verify the installer signature automatically, so GPG signature verification is not needed.

### Installing SnowSQL using the installer

1. Open `snowsql-windows_x86_64.msi` in the download location to run the installer MSI file.
2. Follow the instructions provided by the installer.

> **Note:**
>
> The installation can be automated by running the MSI installer `msiexec` from the command line. The target directory cannot be changed from
> `%ProgramFiles%Snowflake SnowSQL`. For example:
>
> ```bat
> C:\Users\<username> msiexec /i snowsql-windows_x86_64.msi /q
> ```

When you install a new major or minor version, SnowSQL does not upgrade itself immediately. Rather, you must log into your Snowflake account using SnowSQL and remain connected for a sufficient period of time for the auto-upgrade feature to upgrade the client to the latest release. To verify the SnowSQL version that currently starts when you run the client, use the `-v` option without a value:

> ```bash
> snowsql -v
> ```
>
> ```output
> Version: 1.3.1
> ```

To force SnowSQL to install and use a specific version, use the `-v` option and specify the version you want to install. For example, execute the following command for version 1.3.0:

> ```bash
> snowsql -v 1.3.0
> ```

## Understanding SnowSQL versioning

SnowSQL version numbers consist of three digits: `<major version>.<minor version>.<patch version>`.

For example, version 1.3.1 indicates the major version is 1, the minor version is 3, the patch version is 1.

To determine the SnowSQL version that currently starts when you run the client, use the `-v` option without a value:

> ```bash
> snowsql -v
> ```
>
> ```output
> Version: 1.3.1
> ```

In general, the following guidelines apply to the different version types:

Major version:
:   A change in the major version indicates dramatic improvements in the underlying Snowflake service. A new major version breaks backward
    compatibility. You will need to download and install the latest SnowSQL version from the web interface.

Minor version:
:   A change in the minor version indicates improvements to support forward compatibility in either SnowSQL or the underlying Snowflake
    service. A new minor version does not break backward compatibility, but Snowflake strongly recommends that you download and install the latest SnowSQL version
    from the web interface.

Patch version:
:   A change in the patch version indicates small enhancements or bug fixes were applied.

    The auto-upgrade feature automatically installs
    all patch versions. For more information about the auto-upgrade feature, see What is Auto-upgrade? (in this topic).

    > **Note:**
    >
    > If Snowflake releases a new minor or patch version, the functionality in your current version should continue to work, but any newly-released bug fixes and features will
    > not be available via the auto-upgrade feature. Therefore, we strongly recommended that you download and install the latest SnowSQL version
    > when a new version is available.

### What is auto-upgrade?

> **Important:**
>
> Starting with version 1.3.0, SnowSQL disables automatic upgrades by default to avoid potential issues that can affect production environments when an automatic upgrade occurs. To upgrade, you should download and install new versions manually, preferably in a non-production environment. Snowflake strongly recommends you leave this setting disabled, but if want to install new versions automatically when they are released, you can disable the SnowSQL `--noup` option.

If you choose to enable automatic upgrades for SnowSQL, SnowSQL automatically downloads the new binary in a background process and executes the current version. The next time you
run SnowSQL, the new version starts.

To illustrate the process:

1. For a fresh installation, you download the SnowSQL installer (such as version 1.3.0) using the Snowflake web interface and install the client.
2. Each time you run SnowSQL, the client checks whether a newer version is available in the SnowSQL upgrade repository.
3. If a newer version (such as version 1.3.1) is available, SnowSQL downloads it as a background process while the current installed version.
4. The next time you run SnowSQL, the client executes version 1.3.1 while checking if a newer version is available.

### Enabling auto-upgrade

The `-o noup=<value>` option lets you override the SnowSQL default behavior of requiring manual installations for new versions, where:

* `True` enables the no-upgrade behavior (Default value for version 1.3.0 and higher). SnowSQL does not automatically check for upgrades, nor does it automatically upgrade itself.
* `False` disables the no-upgrade behavior (Default value for version 1.2.32 and lower). SnowSQL automatically checks for upgrades and automatically upgrades itself if any new upgrade is available within the same `major.minor` version

You can specify this option while logging into
Snowflake to enable auto-upgrade during that specific session.

For example:

> ```bash
> snowsql -o noup=False
> ```

Alternatively, add the `noup = False` option to the [configuration file](snowsql-config.md) to enable automatic upgrades for SnowSQL.

### Running a previous SnowSQL version

> **Note:**
>
> If you are running SnowSQL version 1.3.0 or newer, you cannot use this process to run a 1.2.x version. If you want to run a 1.2.x version, you must download and install the earlier version manually.

If you encounter an issue with the latest SnowSQL version, such as version 1.3.1, you can temporarily run another 1.3.x version.

To determine the SnowSQL version that currently starts when you run the client, use the `-v` option without a value:

> ```bash
> $ snowsql -v
>
>   Version: 1.3.1
> ```

To display a list of available SnowSQL versions, use the `--versions` option:

> ```bash
> $ snowsql --versions
>
>  1.3.1
>  1.3.0
> ```

To install an earlier SnowSQL version from the list, use the `-v` option and specify the version you want to install. For example, to install version 1.3.0 if you are running a newer version, such as 1.3.1:

> ```bash
> $ snowsql -v 1.3.0
>
>   Installing version: 1.3.0 [####################################]  100%
> ```

Use the same option to specify the version you want to run when you start SnowSQL:

> ```bash
> $ snowsql -v 1.3.0
> ```

## Changing the Snowflake client repository endpoint used by the SnowSQL auto-upgrade feature

By default, the SnowSQL auto-upgrade feature uses the AWS endpoint of the Snowflake Client Repository. To change the endpoint in the SnowSQL configuration file, complete the steps in this section.

### New users

To specify the Microsoft Azure endpoint of the Snowflake Client Repository as a new SnowSQL user, execute the following command:

```bash
snowsql -o repository_base_url=https://sfc-repo.azure.snowflakecomputing.com/snowsql
```

Verify the configuration file (i.e. `~/.snowsql/config` or `%USERPROFILE%\.snowsql\config`) includes the following line.

```bash
repository_base_url=https://sfc-repo.azure.snowflakecomputing.com/snowsql
```

### Existing users

To specify the Microsoft Azure endpoint of the Snowflake Client Repository as an existing SnowSQL user, add the following line to the configuration file (i.e. `~/.snowsql/config` or `%USERPROFILE%\.snowsql\config`):

```bash
repository_base_url=https://sfc-repo.azure.snowflakecomputing.com/snowsql
```

---
title: Integrate Apache Hive metastores with Snowflake
source: https://docs.snowflake.com/en/user-guide/tables-external-hive.md
section: User Guide
---

# Integrate Apache Hive metastores with Snowflake

You can use the Hive metastore connector for Snowflake to integrate [Apache Hive](https://hive.apache.org/)
metastores with Snowflake by using external tables. The connector detects metastore events and transmits the events
to Snowflake to keep the external tables synchronized with the Hive metastore.
With this capability, users can manage their schema in Hive while querying the metastore from Snowflake.

The Apache Hive metastore must be integrated with cloud storage on one of the following cloud platforms:

* Amazon Web Services
* Google Cloud
* Microsoft Azure

## Install and configure the Hive metastore connector

This section describes how to install and configure the Hive metastore connector for Snowflake.

### Prerequisites

The Hive connector for Snowflake has the following prerequisites:

Snowflake database and schemas:
:   Store the external tables that map to the Hive tables in the metastore.

Designated Snowflake user:
:   The connector is configured to execute operations on the external tables as this user.

Storage integration:
:   With storage integrations, you can configure secure access to external cloud storage without passing explicit cloud provider credentials, such as secret keys or access tokens. Create a storage integration to access cloud storage locations referenced in Hive tables using [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md).

    The STORAGE_ALLOWED_LOCATIONS parameter for the storage integration must list the same storage containers as the ones referenced in the `Location` parameter of the Hive tables in your metastore.

Role:
:   The role must be assigned to the designated Snowflake user and include the following object privileges on the other Snowflake objects identified in this section:

    | Object | Privileges |
    | --- | --- |
    | Database | USAGE |
    | Schema | USAGE , CREATE STAGE , CREATE EXTERNAL TABLE |
    | Storage integration | USAGE |

### Step 1: Install the connector

Complete the following steps to install the connector:

1. From the Maven Central Repository ([Sonatype](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-hive-metastore-connector) or <https://repo1.maven.org/maven2/net/snowflake/snowflake-hive-metastore-connector/>), download the connector JAR file and configuration XML file.
2. Copy the JAR file to the following directory:

   Amazon S3 or Google Cloud Storage:
   :   `lib` directory in the Hive classpath. The location can vary depending on the Hive installation. To determine the classpath, check the HIVE_AUX_JARS_PATH environment variable.

   Microsoft Azure HDInsight:
   :   `hive` directory in the user directory; for example, `/usr/hdp/<hdinsight_version>/atlas/hook/hive/`. The location can vary depending on the Azure HDInsight version and installation choices.

       > **Tip:**
       >
       > An example custom script is available in the `scripts` folder on the
       > [GitHub project page for Hive](https://github.com/snowflakedb/snowflake-hive-metastore-connector/). The script adds the JAR file and
       > configuration files to the correct directories.
3. Create a file named `snowflake-config.xml` in the following directory:

   Amazon S3 or Google Cloud Storage:
   :   `conf` directory in the Hive classpath.

   Microsoft Azure HDInsight:
   :   `conf/conf.server` directory in the Hive classpath.
4. In a text editor, open the `snowflake-config.xml` file, and then populate the file with the following `<name>` properties and corresponding `<values>`:

   > `snowflake.jdbc.username`
   > :   Specifies the sign-in name of the Snowflake user designated for refresh operations on the external tables.
   >
   > `snowflake.jdbc.password`
   > :   Specifies the password for the sign-in name.
   >
   >     > **Note:**
   >     > * You can set a placeholder for the password based on a system property or environment variable, depending on your Hadoop version.
   >     >   The configuration behaves like other Hadoop configurations. For more information, see the
   >     >   [Hadoop documentation](https://hadoop.apache.org/).
   >     > * `snowflake.jdbc.privateKey`
   >
   >     Alternatively, authenticate by using key-pair authentication. For instructions about how to generate the key pair and aassign the public key
   >     to a user, see [Key-pair authentication and key-pair rotation](key-pair-auth.md).
   >
   >     To pass the private key to Snowflake, add the `snowflake.jdbc.privateKey` property to the `snowflake-config.xml` file.
   >     Open the private key file (for example, `rsa_key.p8`) in a text editor. Copy the lines between `-----BEGIN RSA PRIVATE KEY-----` and
   >     `-----END RSA PRIVATE KEY-----` as the property or environment variable value.
   >
   > `snowflake.jdbc.account`
   > :   Specifies the name of your account (provided by Snowflake); for example, `xy12345`.
   >
   > `snowflake.jdbc.db`
   > :   Specifies an existing Snowflake database to use for the Hive metastore integration. For more information, see the Prerequisites section earlier in this topic.
   >
   > `snowflake.jdbc.schema`
   > :   Specifies an existing Snowflake schema in the specified database. For more information, see the Prerequisites section earlier in this topic.
   >
   >     To map multiple schemas in your Hive metastore to corresponding schemas in your Snowflake database, set the `snowflake.hive-metastore-listener.schemas` property in addition to the current property. Specify the default Snowflake schema in the `snowflake.jdbc.schema` property.
   >
   > `snowflake.jdbc.role`
   > :   Specifies the access-control role to use by the Hive connector. The role should be an existing role that was already assigned to the specified user.
   >
   >     If no role is specified here, then the Hive connector uses the default role for the specified user.
   >
   > `snowflake.jdbc.connection`
   > :   Specifies the connection string for your Snowflake account in the following format:
   >
   >     `jdbc:snowflake://<account_identifier>.snowflakecomputing.com`
   >
   >     Where:
   >
   >     > `<account_identifier>`
   >     > :   Unique identifier for your Snowflake account.
   >     >
   >     >     The following example shows the preferred format of the account identifier:
   >     >
   >     >     `organization_name-account_name`
   >     >     :   Names of your Snowflake organization and account. For information, see [Format 1 (preferred): Account name in your organization](admin-account-identifier.md).
   >     >
   >     >     Alternatively, specify your *account locator* and the geographical [region](intro-regions.md), and possibly the [cloud platform](intro-cloud-platforms.md), where the account is hosted. For more information, see [Format 2: Account locator in a region](admin-account-identifier.md).
   >
   > `snowflake.hive-metastore-connector.integration`
   > :   Specifies the name of the storage integration object to use for secure access to the external storage locations referenced in Hive tables in the metastore. For more information, see the Prerequisites section earlier in this topic.
   >
   > `snowflake.hive-metastore-listener.schemas`
   > :   Specifies a comma-separated list of Snowflake schemas that exist in the Snowflake database specified in `snowflake.jdbc.db`.
   >
   >     When a table is created in the Hive metastore, the connector checks whether this property lists a Snowflake schema with the same name as the Hive schema or database that contains the new table:
   >
   >     * If a Snowflake schema with the same name is listed, the connector creates an external table in this schema.
   >     * If a Snowflake schema with the same name is not listed, the connector creates an external table in the default schema, which is defined in the `snowflake.jdbc.schema` property.
   >
   >     The external table has the same name as the new Hive table.
   >
   >     > **Note:**
   >     >
   >     > This property requires version 0.5.0 (or higher) of the Hive Connector.

   (Optional) Add the following property:

   > `snowflake.hive-metastore-listener.database-filter-regex`
   > :   Specifies the names of any databases in the Hive metastore to skip with the integration. With this property, you can control which databases to integrate with Snowflake. This option is especially useful when multiple tables have the same name across Hive databases. Currently, in this situation, the Hive connector creates the first table with the name in the Snowflake target database but skips additional tables with the same name.
   >
   >     For example, suppose databases `mydb1`, `mydb2`, and `mydb3` all contain a table named `table1`. You can omit all databases with the naming convention `mydb<number>` except for `mydb1` by adding the regular expression `mydb[^1]` as the property value.
   >
   >     **Example property node**
   >
   >     ```xml
   >     <configuration>
   >       ..
   >       <property>
   >         <name>snowflake.hive-metastore-listener.database-filter-regex</name>
   >         <value>mydb[^1]</value>
   >       </property>
   >     </configuration>
   >     ```
   >
   >     **Example snowflake-config.xml file**
   >
   >     ```xml
   >     <configuration>
   >       <property>
   >         <name>snowflake.jdbc.username</name>
   >         <value>jsmith</value>
   >       </property>
   >       <property>
   >         <name>snowflake.jdbc.password</name>
   >         <value>mySecurePassword</value>
   >       </property>
   >       <property>
   >         <name>snowflake.jdbc.role</name>
   >         <value>custom_role1</value>
   >       </property>
   >       <property>
   >         <name>snowflake.jdbc.account</name>
   >         <value>myaccount</value>
   >       </property>
   >       <property>
   >         <name>snowflake.jdbc.db</name>
   >         <value>mydb</value>
   >       </property>
   >       <property>
   >         <name>snowflake.jdbc.schema</name>
   >         <value>myschema</value>
   >       </property>
   >       <property>
   >         <name>snowflake.jdbc.connection</name>
   >         <value>jdbc:snowflake://myaccount.snowflakecomputing.com</value>
   >       </property>
   >       <property>
   >         <name>snowflake.hive-metastore-listener.integration</name>
   >         <value>s3_int</value>
   >       </property>
   >       <property>
   >         <name>snowflake.hive-metastore-listener.schemas</name>
   >         <value>myschema1,myschema2</value>
   >       </property>
   >     </configuration>
   >     ```
5. Save the changes to the file.
6. Edit the existing Hive configuration file (`hive-site.xml`):

   Amazon S3 or Google Cloud Storage:
   :   Open the `hive-site.xml` file in a text editor. Add the connector to the configuration file as follows:

       ```bash
       <configuration>
        ...
        <property>
         <name>hive.metastore.event.listeners</name>
         <value>net.snowflake.hivemetastoreconnector.SnowflakeHiveListener</value>
        </property>
       </configuration>
       ```

   Microsoft Azure HDInsight:
   :   Complete the steps in the
       [Azure HDInsight documentation](https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-customize-cluster-bootstrap) to
       edit the `hive-site.xml` file. Add the following custom property to the cluster configuration:

       `hive.metastore.event.listeners=net.snowflake.hivemetastoreconnector.SnowflakeHiveListener`

       Alternatively, add the custom property in the HDInsight Cluster Management Portal:

       1. Click the Hive tab in the left-hand menu » Configs » Advanced.
       2. Scroll down to the Custom Hive Site tab.
       3. Add the custom property.

   > **Note:**
   >
   > If there are other connectors already configured in this file, add the Hive connector for Snowflake in a comma-separated list in the `<value>` node.
7. Save the changes to the file.
8. Restart the Hive metastore service.

### Step 2: Validate the installation

1. In Hive, create a new table.
2. In your Snowflake database and schema, query the list of external tables by using [SHOW EXTERNAL TABLES](../sql-reference/sql/show-external-tables.md):

   ```sqlsyntax
   SHOW EXTERNAL TABLES IN <database>.<schema>;
   ```

   Where `database` and `schema` are the database and schema you specified in the `snowflake-config.xml` file in Step 1: Install the Connector earlier in this topic.

   The results should show an external table with the same name as the new Hive table.

Connector records are written to the Hive metastore logs. You can view queries run by the connector in the Snowflake QUERY_HISTORY view/function output similar to other queries.

## Integrate existing Hive tables and partitions with Snowflake

To integrate existing Hive tables and partitions with Snowflake, run the following command in Hive for each table and partition:

```sqlexample
ALTER TABLE <table_name> TOUCH [PARTITION partition_spec];
```

For more information, see the [Hive documentation](https://cwiki.apache.org/confluence/display/Hive/Home).

Alternatively, Snowflake provides a script for synching existing Hive tables and partitions. For information, see the [GitHub project page](https://github.com/snowflakedb/snowflake-hive-metastore-connector/blob/master/scripts/sync_hive_to_snowflake.sh).

> **Important:**
>
> If an external table with the same name as the Hive table already exists in the corresponding Snowflake schema in the database specified
> in the `snowflake.jdbc.db` property, the ALTER TABLE … TOUCH command does not recreate the external table. If you need to
> recreate the external table, drop the external table (by using [DROP EXTERNAL TABLE](../sql-reference/sql/drop-external-table.md)) before you run the ALTER
> TABLE … TOUCH command in the Hive metastore.

## Supported and unsupported features

The following sections list supported and unsupported features of the Apache Hive metastores integration with the Hive metastore connector for Snowflake.

### Supported Hive operations and table types

#### Hive operations

The connector supports the following Hive operations:

* Create table
* Drop table
* Alter table add column
* Alter table drop column
* Alter (that is, *touch*) table
* Add partition
* Drop partition
* Alter (touch) partition

#### Hive table types

The connector supports the following types of Hive tables:

* External and managed tables
* Partitioned and unpartitioned tables

### Hive and Snowflake data types

The following table shows the mapping between Hive and Snowflake data types:

| Hive | Snowflake |
| --- | --- |
| BIGINT | BIGINT |
| BINARY | BINARY |
| BOOLEAN | BOOLEAN |
| CHAR | CHAR |
| DATE | DATE |
| DECIMAL | DECIMAL |
| DOUBLE | DOUBLE |
| DOUBLE PRECISION | DOUBLE |
| FLOAT | FLOAT |
| INT | INT |
| INTEGER | INT |
| NUMERIC | DECIMAL |
| SMALLINT | SMALLINT |
| STRING | STRING |
| TIMESTAMP | TIMESTAMP |
| TINYINT | SMALLINT |
| VARCHAR | VARCHAR |
| All other data types | VARIANT |

### Supported file formats and options

The following data file formats and Hive file format options are supported:

* CSV

  The following options are supported using the SerDe (Serializer/Deserializer) properties:

  + `field.delim` / `separatorChar`
  + `line.delim`
  + `escape.delim` / `escapeChar`
* JSON
* AVRO
* ORC
* PARQUET

  The following options are supported using the table properties:

  + `parquet.compression`.

### Unsupported Hive commands, features, and use cases

The connector does not support the following Hive commands, features, and use cases:

* Hive views
* ALTER statements other than TOUCH, ADD COLUMNS, and DROP COLUMNS
* Custom SerDe properties.
* Modifying an existing managed Hive table to become an external Hive table, or vice versa

## Refresh external table metadata to reflect Cloud Storage events

When any of the Hive operations listed in Supported Hive Operations and Table Types earlier in this topic are run on a table, the Hive connector listens to the Hive events and then refreshes the metadata for the corresponding external table in Snowflake.

However, the connector does not refresh the external table metadata based on events in cloud storage, such as adding or removing data files.

To refresh the metadata for an external table to reflect events in the cloud storage, run the respective ALTER TABLE … TOUCH command for your partitioned or unpartitioned Hive table. TOUCH reads the metadata and writes it back. For more information about the command, see the [Hive documentation](https://cwiki.apache.org/confluence/display/Hive/Home):

Partitioned Hive table:
:   Run the following command:

    ```sqlexample
    ALTER TABLE <table_name> TOUCH PARTITION <partition_spec>;
    ```

Unpartitioned Hive table:
:   Run the following command:

    ```sqlexample
    ALTER TABLE <table_name> TOUCH;
    ```

## Differences between Hive tables and Snowflake external tables

The following list describes the main differences between Hive tables and Snowflake external tables:

Partitions:
:   * Snowflake partitions are composed of subpaths of the storage location referenced by the table, while Hive partitions don’t have this constraint. If partitions are added in Hive tables that are not subpaths of the storage location, those partitions aren’t added to the corresponding external tables in Snowflake.

      For example, if the storage location associated with the Hive table (and corresponding Snowflake external table) is `s3://path/`, then all partition locations in the Hive table must also be prefixed by `s3://path/`.
    * Two Snowflake partitions in a single external table can’t point to the exact same storage location. For example, the following partitions conflict with each other:

      ```sqlexample
      ALTER EXTERNAL TABLE exttable ADD PARTITION(partcol='1') LOCATION 's3:///files/2019/05/12';

      ALTER EXTERNAL TABLE exttable ADD PARTITION(partcol='2') LOCATION 's3:///files/2019/05/12';
      ```

Column names:
:   Hive column names are case-insensitive, but Snowflake virtual columns derived from VALUES are case-sensitive. If Hive tables contain columns with mixed-case names, the data in those columns might be NULL in the corresponding columns in the Snowflake external tables.

---
title: Introduction to business continuity & disaster recovery
source: https://docs.snowflake.com/en/user-guide/replication-intro.md
section: User Guide
---

# Introduction to business continuity & disaster recovery

This topic describes the main use cases for replication and failover across regions and cloud platforms. The Snowflake replication
and failover/failback functionality is composed of the following features:

* Replication and Failover/Failback
* Client Redirect

Collectively, these individual features are designed to support a number of different fundamental business continuity scenarios,
including:

* **Planned failovers**: For disaster recovery drills to test preparedness, and measure recovery point and time.
* **Unplanned failovers**: In the case of an outage in a region or a cloud platform, promote secondary account objects and databases
  in another region or cloud platform to serve as read-write primary objects.
* **Migration**: Move your Snowflake account to a different region or cloud platform without disrupting your business. For example, to
  maintain business continuity during mergers and acquisitions, or facilitate a change in cloud strategy.
* **Multiple readable secondaries**: Account objects and databases can be replicated to multiple accounts in
  different regions and cloud platforms, mitigating the risk of multiple region or cloud platform outages.

In addition, [Snowflake Secure Data Sharing](secure-data-sharing-across-regions-platforms.md) and Database Replication enable sharing data securely across regions and cloud platforms.

## Account replication and failover/failback features

### Replication and failover/failback

[Replication](account-replication-intro.md) uses two Snowflake objects,
[replication group and failover group](account-replication-intro.md), to replicate a group of objects with point-in-time
consistency from a source account to one or more target accounts. A replication group allows customers to specify what to replicate, where
to replicate to, and how often. This means specifying which objects to replicate, to which regions or cloud platforms, at customizable
scheduled intervals. A failover group enables the replication and failover of the objects in the group.

Account objects can include warehouses, users, and roles, along with databases and shares (see [Replicated objects](account-replication-intro.md) for the full
list of objects that can be included in a replication or failover group). Account objects can be grouped in one or multiple groups.

In the case of failover, account replication enables the failover of your account to a different region or cloud platform.
Each replication and failover group has its own replication schedule, allowing you to set the frequency for replication at different
intervals for different groups of objects. In the case of failover groups, it also enables failover of groups individually. You can choose
to failover all failover groups, or only select failover groups.

### Client Redirect

[Client Redirect](client-redirect.md) provides a *connection URL* that can be used by Snowflake clients to connect to
Snowflake. The connection URL can redirect Snowflake clients to a different Snowflake account as needed.

## Business continuity and disaster recovery

In the event of a massive outage (due to a network issue, software bug, etc.) that disrupts the cloud services in a given region, access to
Snowflake will be unavailable until the source of the outage is resolved and services are restored. To ensure continued availability and
data durability in such a scenario, replicate your critical account objects to another Snowflake account in your organization
in a different region.

With asynchronous replication, secondary replicas typically lag behind the primary objects based on the replication schedule you
configure. Secondary replica objects are at most 2x the time interval between scheduled refreshes behind the primary objects. For
example, if you choose to replicate a primary replication or failover group every 30 minutes, the secondary objects in the group
are at most 60 minutes behind the primary objects during an outage.

Depending on your business needs you could choose to:

> * Recover reads first to let client applications read data that is 30 minutes stale.
> * Recover writes first to reconcile the last 30 minutes of data on the new primary before
>   opening up reads from client applications.
> * Recover both reads and writes simultaneously, that is, open up reads from client applications on data that is 30 minutes stale as
>   you reconcile the last 30 minutes of data on the new primary.

### Normal status: Region is operational

**Account Object Replication:** Replicate the failover group(s) with critical account objects to one or more Snowflake accounts in
regions different from that of the account that stores the primary (source) failover group(s). Refresh the failover group(s) frequently.

### Region outage

To prioritize reads, writes, or both at the same time, follow the steps in one of the following example scenarios.

#### Reads before writes

When an outage in a region results in full or partial loss of Snowflake availability, this path allows you to redirect Snowflake clients to read-only replicas of account objects in critical failover group(s) first for minimal downtime. Choosing to operate in read-only mode is often desirable during short-term outages.

A longer-term outage combined with the need for the latest data necessitates read-write mode.

1. **Client Redirect:** Point the connection URL used by clients to a Snowflake account that stores your read-only replica (secondary) failover group(s).
2. **Failover (When Needed):** In the event of a longer-term outage, promote the secondary failover group(s) in the Snowflake account where your connection URL is pointing to serve as read-write primary failover group(s).

#### Writes before reads

When an outage in a region results in full or partial loss of Snowflake availability, this path allows you to recover failover group(s) with critical account objects and continue to process data first. This option is preferable for account administrators who want to fail over their databases and ETL (Extract, Transform, Load) processes first, and then choose to redirect Snowflake clients only when the data is current.

1. **Failover:** Promote the secondary failover group(s) with critical account objects in a different region to serve as the primary
   failover group(s), which allows writing to the objects included in each failover group(s). Once the databases in the group(s)
   are writable, you can use your ETL processes to prioritize writes and reconcile data.

   If you use Snowflake data pipeline objects for ETL processes, you can replicate and fail over those objects. For more information,
   see [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md).

   Otherwise, configure separate connection URLs for your data ingestion pipeline and one for your clients (for example, a BI
   dashboard). After failing over the failover group, fail over the connection URL for data ingestion, and write data to the newly
   promoted primary objects. After data has been reconciled, fail over the connection URL for your clients to enable reads.
2. **Client Redirect (When Needed):** Point the connection URL used by clients to the Snowflake account that stores the new primary failover group(s).

#### Prioritize both reads and writes

To prioritize both reads and writes at the same time, fail over both the client connection and secondary failover group(s) without
waiting for the secondary objects to be up to date. This enables immediate access for clients to potentially stale data while the
newly promoted databases can start reingesting data from data pipelines.

1. **Client Redirect:** Point the connection URL used by clients to a Snowflake account that stores your read-only replica (secondary)
   failover group(s).
2. **Failover:** Promote the secondary failover group(s) with critical account objects in a different region to serve as the primary
   failover group(s), which enables writing to the objects included in each failover group(s).

### Normal status: Outage is resolved

1. **Replication:** Refresh the failover group(s) in the Snowflake account in the region where the outage occurred.
2. **Failback:** Promote the failover group(s) in the Snowflake account where the outage occurred to again serve as the primary failover
   group(s).
3. **Client Redirect:** Point the connection URL used by clients to the Snowflake account in the region where the outage occurred.

## Account migration

Account migration is the one-time process of migrating (or transferring) the Snowflake objects and your stored data to an account in
another region or on a different cloud platform. Typical reasons for migrating your account include a closer proximity to your user base
or a preference for a different cloud platform based on your corporate strategy or co-location with other cloud assets (e.g. a data lake).

Account object replication supports the replication of account objects such as warehouses, users, and roles, along with databases and
shares. See [Replicated objects](account-replication-intro.md) for the complete list of replicated objects.

> **Note:**
>
> Account object replication and failover/failback requires a Business Critical (or higher) edition of Snowflake. For account migrations
> Snowflake support can temporarily lift this restriction and enable this feature for your account without changing your Snowflake edition.
> This service is available on a one-time basis only.

---
title: Introduction to cost anomalies
source: https://docs.snowflake.com/en/user-guide/cost-anomalies.md
section: User Guide
---

# Introduction to cost anomalies

A cost anomaly occurs when daily consumption is above or below the expected range of consumption for the day. Snowflake uses an algorithm to
automatically detect these cost anomalies based on prior levels of consumption, which simplifies the process of identifying spikes or dips
in costs so you can find ways to optimize your spend. Snowflake also provides tools to investigate these cost anomalies to identify root
causes.

> **Note:**
>
> The algorithm that detects cost anomalies requires at least 30 days of consumption before it can identify anomalies. If your consumption
> in the last seven days was less than 10 credits, Snowflake does not identify changes as an anomaly.

## Account-level vs. organization-level cost anomalies

An account-level cost anomaly occurs when the consumption in a single account falls outside the expected range of consumption for that
account.

An organization-level cost anomaly occurs when the consumption in the entire organization falls outside the expected range of consumption
for the organization. It is based on the aggregate consumption of all accounts in the organization. For example, if there is a significant
consumption spike in one account, but a dip in another, the two might offset each other such that it is not flagged as an organization-level
anomaly. To help investigate organization-level anomalies, Snowflake provides tools to identify which accounts had the biggest increase or
decrease in consumption on a specific day.

To identify and investigate organization-level cost anomalies, you need to be signed in to the
[organization account](organization-accounts.md) or an [ORGADMIN-enabled account](organization-administrators.md).

## Get started

To identify and investigate cost anomalies using a user interface:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the [required privileges](cost-anomalies-access-control.md).
2. In the navigation menu, select Admin » Cost management, and then select Anomalies.

## Unit of measure for cost data

Cost data can be shown with credits as the unit of measure or with a currency as the unit of measure. The unit of measure is a currency in
the following situations:

* If you use the ACCOUNTADMIN or GLOBALORGADMIN system role to work with cost anomalies, cost data displays in a currency if you
  are signed in to the [organization account](organization-accounts.md) or one that has the
  [ORGADMIN role enabled](organization-administrators.md).
* If you are not a system administrator, cost data displays in a currency if you are granted the ORGANIZATION_BILLING_VIEWER
  application role or APP_ORGANIZATION_BILLING_VIEWER application role. For more information about these application roles, see
  [Access control for cost anomalies](cost-anomalies-access-control.md).

## Run queries against cost anomaly views

You can run queries against views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas to return historical data about account-level cost
anomalies. Each row in the view includes the consumption on a specific day, and whether that consumption was a cost anomaly.

Cost anomalies for current account
:   Execute queries against the [ANOMALIES_DAILY view](../sql-reference/account-usage/anomalies_daily.md) in the
    [ACCOUNT_USAGE schema](../sql-reference/account-usage.md) to gain insights into whether cost anomalies occurred in the current account.

    This view uses credits as the unit of measure for consumption.

Cost anomalies for all accounts in an organization
:   Execute queries against the [ANOMALIES_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/anomalies_in_currency_daily.md) in the
    [ORGANIZATION_USAGE schema](../sql-reference/organization-usage.md) to gain insights into whether cost anomalies occurred in accounts in
    the organization. Note that not all accounts have access to the ORGANIZATION_USAGE schema.

    Use this view to see currency as the unit of measure rather than credits.

## Learn more

For information about how to work with cost anomalies, see the following:

* [Use Snowsight to work with cost anomalies](cost-anomalies-ui.md)
* [Programmatically work with cost anomalies](cost-anomalies-class.md)

---
title: Introduction to data quality checks
source: https://docs.snowflake.com/en/user-guide/data-quality-intro.md
section: User Guide
---

# Introduction to data quality checks

Data quality checks in Snowflake continuously validate the health of your data. These checks help you comply with regulatory standards, meet
service-level agreements through accurate metrics, and build credibility in data-driven decisions by providing automated, consistent data
validation. Cortex Data Quality lets you leverage AI to agentically suggest data quality checks based on characteristics of your metadata
and usage patterns, eliminating the need to manually define checks and accelerating your setup process while keeping your data securely
inside Snowflake. Once configured, quality checks run automatically on your chosen schedule, reporting violations so you can take corrective
action.

## Get started

Snowflake provides a web interface to set up data quality checks and monitor the results of these checks.

To get started, do one of the following:

* To set up data quality checks for your data, see [Use Snowsight to set up data quality checks](data-quality-ui-setup.md).
* To monitor the results of your existing data quality checks, see [Monitoring data quality checks in Snowsight](data-quality-ui-monitor.md).

## Core concepts of data quality checks

Data metric function (DMF)
:   A DMF measures an attribute of your data such as how many NULL values exist in a column or how often a table is being updated. The DMF
    returns a value based on the current state of your data, but doesn’t define whether that value constitutes a data quality issue; a DMF is
    a building block of a data quality check.

    Snowflake provides *system DMFs* to measure common metrics without requiring configuration. For a list of the system DMFs that are
    available for various dimensions, see [System data metric functions](data-quality-system-dmfs.md).

    If there isn’t a system DMF for the metric that you want to monitor, you can define a *custom DMF*. To learn how to create a custom DMF,
    see [Custom data metric functions](data-quality-custom-dmfs.md).

Expectations
:   An expectation is combined with a DMF to create a data quality check. When a DMF returns a value, it’s compared to the expectation’s
    definition to determine whether data passed or failed the check. Return values that fail the check are reported as expectation violations
    so you can take appropriate action.

    If you [use Snowsight to create a data quality check](data-quality-ui-setup.md), you choose the DMF and define the expectation at the
    same time. You can also [use SQL to work with expectations directly](data-quality-expectations.md).

Anomaly detection
:   Anomaly detection uses historical data to automatically detect when a DMF return value is above or below a predicted range. Currently,
    Snowflake can automatically detect anomalies in the volume and freshness of your data. For more information, see
    [Detecting anomalies in data quality](data-quality-anomaly.md).

DMF schedule
:   The DMF schedule for a table or view determines how often a DMF runs. Because a DMF powers a data quality check, the DMF schedule
    determines how often the quality check is performed. By default, the DMF schedule runs a DMF once every hour. To adjust the schedule for
    a table or view, see [Adjust how often quality checks run](data-quality-ui-setup.md).

    The DMF schedule doesn’t affect how often Snowflake checks whether there is an anomaly.

## Supported table kinds

You can set a DMF on the following kinds of table objects:

* Dynamic table
* Event table
* External table
* Apache Iceberg™ table
* Materialized view
* Table (CREATE TABLE), including temporary and transient tables
* View

You cannot set a DMF on a hybrid table or a stream object.

## Cost considerations

The DMFs that power data quality checks use [serverless compute resources](cost-understanding-compute.md) that incur costs. For the
pricing of these costs, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

The credits consumed by the serverless compute resources are listed under the “Data Quality Monitoring” category on your monthly bill.
These credits include compute consumed by all system or user-defined data quality metrics that you use. You are not billed for creating a
DMF.

* Billing occurs only when a scheduled DMF is computed on an object. You are not billed for unscheduled data metric
  function usage, such as calling a DMF with a SELECT statement.
* The logging infrastructure consolidates metric outputs in the event table. Consumption incurred by the logging service shows up on
  your monthly bill as “Logging.”

> **Tip:**
>
> To track consumption related to quality checks, you can query the following views:
>
> > * [DATA_QUALITY_MONITORING_USAGE_HISTORY](../sql-reference/account-usage/data_quality_monitoring_usage_history.md) to
> >   track your credit consumption related to using DMFs in your account.
> > * [METERING_DAILY_HISTORY](../sql-reference/organization-usage/metering_daily_history.md) to track the daily credits consumed for an
> >   account in your organization. The `service_type` column specifies `DATA_QUALITY_MONITORING`.

## Replication

For information about replication and DMFs, see [Replication of data metric functions (DMFs)](account-replication-considerations.md).

## Limitations

Note the following limitations when using DMFs:

* You can only have 10,000 total associations of DMFs on objects per account. Each instance of setting a DMF on a table or view counts as
  one association.
* [Data sharing](data-sharing-intro.md): You can’t grant privileges on a DMF to a share or set a DMF on a shared table or view.
* Setting a DMF on an object tag is not supported.
* You can’t set a DMF on objects in a [reader account](data-sharing-intro.md).
* Trial accounts don’t support this feature.

---
title: Introduction to database replication across multiple accounts
source: https://docs.snowflake.com/en/user-guide/db-replication-intro.md
section: User Guide
---

# Introduction to database replication across multiple accounts

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

This feature enables replicating databases between Snowflake accounts (within the same organization) and keeping the database objects and stored data synchronized. Database replication is supported across [regions](intro-regions.md) and across [cloud platforms](intro-cloud-platforms.md).

## What is a primary database?

Replication can be enabled for any existing permanent or transient database. Enabling replication designates the database as a *primary database*. Any number of databases in an account can be designated a primary database. Likewise, a primary database can be replicated to any number of accounts in your organization. This involves creating a *secondary database* as a replica of a specified primary database in each of the target accounts. These accounts are typically located in other regions, on the same or a different cloud platform, or they can be in the same region as the source account.

All DML/DDL operations are executed on the primary database. Each read-only, secondary database can be refreshed periodically with a snapshot of the primary database, replicating all data as well as DDL operations on database objects (i.e. schemas, tables, views, etc.).

## Overview of database replication

For the full list of replicated database objects, see [Replicated database objects](account-replication-intro.md).

### Other objects in an account

Database replication is supported for databases only. Other types of objects in an account can be replicated with
[account replication](account-replication-intro.md). For the full list of supported objects for account
replication, see [Replicated objects](account-replication-intro.md).

### Access control

Privileges granted on database objects are not replicated to a secondary database.
This includes privilege grants on existing database objects as well as grants on future
objects (i.e. future grants).

Privilege grants can be replicated with [account replication](account-replication-intro.md).

### Parameters

Account parameters are not replicated with database replication. Account parameters can be replicated with [account replication](account-replication-intro.md).

Object parameters that are set at the schema or schema object level are replicated:

> | Parameter | Objects |
> | --- | --- |
> | [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) | schema, table |
> | [DEFAULT_DDL_COLLATION](../sql-reference/parameters.md) | schema, table |
> | [MAX_DATA_EXTENSION_TIME_IN_DAYS](../sql-reference/parameters.md) | schema, table |
> | [PIPE_EXECUTION_PAUSED](../sql-reference/parameters.md) [1] | schema, pipe |
> | [QUOTED_IDENTIFIERS_IGNORE_CASE](../sql-reference/parameters.md) | schema, table |

Parameter replication is only applicable to objects in the database (schema, table) and only if the parameter is explicitly set using CREATE
`<object>` `<parameter>` or ALTER `<object>` … SET `<parameter>`. Database level parameters are not replicated.

Parameters explicitly set on objects in the primary database overwrite parameters set on objects in the secondary database. For example, if
the primary database has a schema `s1` with DATA_RETENTION_TIME_IN_DAYS set to 10 and the secondary database has
DATA_RETENTION_TIME_IN_DAYS set to 1 at the database level, DATA_RETENTION_TIME_IN_DAYS for schema `s1` in the secondary database is set
to 10 after replication.

Parameters explicitly set at the database level on secondary databases are not overwritten. For example, if the secondary database parameter
DATA_RETENTION_TIME_IN_DAYS is explicitly set to 1 and the primary database parameter DATA_RETENTION_TIME_IN_DAYS is explicitly set to 10,
DATA_RETENTION_TIME_IN_DAYS for the secondary database remains set to 1 after replication.

[1] Note that PIPE objects are not replicated. If the PIPE_EXECUTION_PAUSED parameter is set at the schema level in the primary
database, it is replicated to the secondary database. When the secondary database is promoted to primary database in the case of a failover
and a pipe is created, the parameter setting will take effect.

## Database replication to accounts on lower editions

If either of the following conditions is true, Snowflake displays an error message when a local database is promoted to serve as a primary database:

* The primary database is in a Business Critical (or higher) account but one or more of the accounts approved for replication are on lower editions. Business Critical Edition is intended for Snowflake accounts with extremely sensitive data.
* The primary database is in a Business Critical (or higher) account and a signed business associate agreement is in place to store PHI data in the account per HIPAA and [HITRUST CSF](intro-cloud-platforms.md) regulations, but no such agreement is in place for one or more of the accounts approved for replication, regardless if they are Business Critical (or higher) accounts.

This behavior is implemented in an effort to help prevent account administrators for Business Critical (or higher) accounts from inadvertently replicating sensitive data to accounts on lower editions.

An account administrator can override this default behavior by including the IGNORE EDITION CHECK clause when executing the [ALTER DATABASE … ENABLE REPLICATION TO ACCOUNTS](../sql-reference/sql/alter-database.md) statement. If IGNORE EDITION CHECK is set, the primary database can be replicated to the specified accounts on any Snowflake edition.

## Current limitations of database replication

* Databases created from shares cannot be replicated.
* Refresh operations fail if the primary database includes a stream with an unsupported source object.
  The operation also fails if the source object for any stream has been dropped.
* Append-only streams are not supported on replicated source objects.

* The CREATE DATABASE … AS REPLICA command does not support the WITH TAG clause.

  This clause is not supported because the secondary database is read only. If your primary database specifies the WITH TAG clause, remove
  the clause prior to creating the secondary database. To verify whether your database has the WITH TAG clause, call the
  [GET_DDL](../sql-reference/functions/get_ddl.md) function in your Snowflake account and specify the primary database in the function argument. If
  a tag is set on the database, the function output will include an ALTER DATABASE … SET TAG statement.
* Stage and pipe replication are not supported. You can replicate stages and pipes using account replication. For more information, see
  [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md).
* [Secrets](../sql-reference/sql/create-secret.md) is not supported. You can replicate secrets using a replication or failover group.

---
title: Introduction to external tables
source: https://docs.snowflake.com/en/user-guide/tables-external-intro.md
section: User Guide
---

# Introduction to external tables

An *external table* is a Snowflake feature that you can use to query data stored in an [external stage](data-load-overview.md)
as if the data were inside a table in Snowflake. The external stage is not part of Snowflake, so Snowflake doesn’t store or manage the
stage. To harden your security posture, you can configure the external stage for [outbound private connectivity](private-connectivity-outbound.md)
to access the external table by using private connectivity.

External tables let you store (within Snowflake) certain file-level metadata, including filenames, version identifiers,
and related properties. External tables can access data stored in any format that the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command supports,
except XML.

External tables are read-only. You can’t perform data manipulation language (DML) operations on external tables.
However, you can use external tables for query and join operations. You can also create views against external tables.

Querying data in an external table might be slower than querying data that you store natively in a table within Snowflake. To improve query
performance, you can use a [materialized view](views-materialized.md) based on an external table.
For optimal query performance when you work with Parquet files, consider using [Apache Iceberg™ tables](tables-iceberg.md) instead.

> **Note:**
>
> If Snowflake encounters an error while scanning a file in cloud storage during a query operation,
> the file is skipped and scanning continues on the next file. A query can partially scan a file and return the rows scanned before the error
> was encountered.

## Planning the schema of an external table

The following sections describe the options available to you to plan your external tables.

### Schema on read

All external tables include the following columns:

VALUE:
:   A VARIANT type column that represents a single row in the external file.

METADATA$FILENAME:
:   A pseudocolumn that identifies the name of each staged data file that is included in the external table, including its path in the stage.

METADATA$FILE_ROW_NUMBER:
:   A pseudocolumn that shows the row number for each record in a staged data file.

To create external tables, you are only required to have some knowledge of the file format and record format of the source data files.
Knowing the schema of the data files isn’t required.

> **Note:**
>
> [SELECT](../sql-reference/sql/select.md) `*` always returns the VALUE column, in which all regular or semi-structured data is cast to variant rows.

### Virtual columns

If you’re familiar with the schema of the source data files, you can create additional virtual columns as expressions by using the VALUE
column and the METADATA$FILENAME or METADATA$FILE_ROW_NUMBER pseudocolumns. When the external data is scanned, the data types of any
specified fields or semi-structured data elements in the data file must match the data types of these additional columns in the external
table. This requirement enables strong type checking and schema validation over the external data.

### General file sizing recommendations

To optimize the number of parallel scanning operations when you query external tables, we recommend the following file or row group sizes
per format:

| Format | Recommended size range | Notes |
| --- | --- | --- |
| Parquet files | 256 - 512 MB |  |
| Parquet row groups | 16 - 256 MB | When Parquet files include multiple row groups, Snowflake can operate on each row group in a different server. For improved query performance, we recommend sizing Parquet files in the recommended range; or, if large file sizes are necessary, including multiple row groups in each file. |
| All other supported file formats | 16 - 256 MB |  |

For optimal performance when querying large data files, create and query
materialized views over external tables.

### Partitioned external tables

We strongly recommend partitioning your external tables, which requires that your underlying data is organized using logical paths that
include date, time, country, or similar dimensions in the path. Partitioning divides your external table data into multiple parts using
partition columns.

An external table definition can include multiple partition columns, which impose a multi-dimensional structure on the external data.
Partitions are stored in the external table metadata.

Partitioning improves query performance. Because the external data is partitioned into separate slices or parts, query
response time is faster when processing a small part of the data instead of scanning the entire data set.

Based on your individual use cases, you can do either of the following:

* Add new partitions automatically by refreshing an external table that defines an expression for each partition column.
* Add new partitions manually.

Partition columns are defined when an external table is created, using the CREATE EXTERNAL TABLE … PARTITION BY syntax. After an external
table is created, the method by which partitions are added can’t be changed.

The following sections explain the different options for adding partitions in greater detail. For examples, see
[CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md).

#### Partitions added automatically

An external table creator defines partition columns in a new external table as expressions that parse the path or filename information
stored in the METADATA$FILENAME pseudocolumn. A partition consists of all data files that match the path or filename in the expression
for the partition column.

The following CREATE EXTERNAL TABLE syntax adds partitions automatically based on expressions:

```sqlsyntax
CREATE EXTERNAL TABLE
  <table_name>
     ( <part_col_name> <col_type> AS <part_expr> )
     [ , ... ]
  [ PARTITION BY ( <part_col_name> [, <part_col_name> ... ] ) ]
  ..
```

Snowflake computes and adds partitions based on the defined partition column expressions when external table metadata is refreshed. By
default, the metadata is refreshed automatically when the object is created. In addition, the object owner can configure the metadata to
refresh automatically when new or updated data files are available in the external stage. The owner can alternatively refresh the metadata
manually by executing the ALTER EXTERNAL TABLE … REFRESH command.

#### Partitions added manually

An external table creator determines the partition type of a new external table as *user-defined* and specifies only the data types of
partition columns. Use this option when you prefer to add and remove partitions selectively rather than add partitions automatically
for all new files in an external storage location that match an expression.

You generally choose this option to synchronize external tables with other metastores (for example, AWS Glue or Apache Hive).

The following CREATE EXTERNAL TABLE syntax manually adds partitions:

```sqlsyntax
CREATE EXTERNAL TABLE
  <table_name>
     ( <part_col_name> <col_type> AS <part_expr> )
     [ , ... ]
  [ PARTITION BY ( <part_col_name> [, <part_col_name> ... ] ) ]
  PARTITION_TYPE = USER_SPECIFIED
  ..
```

Include the required `PARTITION_TYPE = USER_SPECIFIED` parameter.

The partition column definitions are expressions that parse the column metadata in the internal (hidden)
METADATA$EXTERNAL_TABLE_PARTITION column.

The object owner adds partitions to the external table metadata manually by running the ALTER EXTERNAL TABLE … ADD PARTITION command:

```sqlsyntax
ALTER EXTERNAL TABLE <name> ADD PARTITION ( <part_col_name> = '<string>' [ , <part_col_name> = '<string>' ] ) LOCATION '<path>'
```

Automatically refreshing an external table with user-defined partitions isn’t supported. Attempting to manually refresh this type of
external table produces a user error.

### Delta Lake support

> **Note:**
>
> This feature is still supported but will be deprecated in a future release.
>
> Consider using an [Apache Iceberg™ table](tables-iceberg.md) instead. Iceberg tables
> use an [external volume](tables-iceberg.md)
> to connect to Delta table files in your cloud storage.
>
> For more information, see [Iceberg tables](tables-iceberg.md) and [CREATE ICEBERG TABLE (Delta files in object storage)](../sql-reference/sql/create-iceberg-table-delta.md).
> You can also Migrate a Delta external table to Apache Iceberg™.

[Delta Lake](https://delta.io/) is a table format on your data lake that supports ACID (atomicity, consistency, isolation, durability)
transactions among other features. All data in Delta Lake is stored in Apache Parquet format. You can create external tables that reference your
cloud storage locations enhanced with Delta Lake.

To create an external table that references a Delta Lake, set the `TABLE_FORMAT = DELTA` parameter in the CREATE EXTERNAL TABLE
statement.

When you set this parameter, the external table scans for Delta Lake transaction log files in the `[ WITH ] LOCATION` location. Delta
log files have names like `_delta_log/00000000000000000000.json` or `_delta_log/00000000000000000010.checkpoint.parquet`.
When the metadata for an external table is refreshed, Snowflake parses the Delta Lake transaction logs and determines which Parquet files
are current. In the background, the refresh performs add and remove file operations to keep the external table metadata in sync.

For more information, including examples, see [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md).

> **Note:**
>
> * External tables that reference Delta Lake files don’t support deletion vectors.
> * Automated refreshes aren’t supported for this feature because
>   the ordering of event notifications triggered by Data Definition Language (DDL) operations in cloud storage isn’t guaranteed. To register any added or removed files,
>   periodically run an
>   [ALTER EXTERNAL TABLE … REFRESH](../sql-reference/sql/alter-external-table.md) statement.

#### Migrate a Delta external table to Apache Iceberg™

To migrate one or more external tables that reference a Delta Lake to [Apache Iceberg™ tables](tables-iceberg.md),
complete the following steps:

1. Use the [SHOW EXTERNAL TABLES](../sql-reference/sql/show-external-tables.md) command to retrieve the `location` (external stage and folder path)
   for the external table(s).

   For example, the following command returns information for external tables and filters on names like `my_delta_ext_table`:

   ```sqlexample
   SHOW EXTERNAL TABLES LIKE 'my_delta_ext_table';
   ```
2. [Create an external volume](tables-iceberg-configure-external-volume.md); specify the location that you retrieved in
   the previous step as the `STORAGE_BASE_URL`.

   To create a single external volume for multiple Delta tables under the same storage location,
   set the external volume’s active location (`STORAGE_BASE_URL`) as the common root directory.

   For example, consider the following locations for three Delta tables that branch from the same storage location:

   * `s3://my-bucket/delta-ext-table-1/`
   * `s3://my-bucket/delta-ext-table-2/`
   * `s3://my-bucket/delta-ext-table-3/`

   As shown in the following example, specify the bucket as the `STORAGE_BASE_URL` when you create the external volume. Later, you can specify the relative path
   to the table files (for example, `delta-ext-table-1/`) as the `BASE_LOCATION` when you create an Iceberg table:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL VOLUME delta_migration_ext_vol
   STORAGE_LOCATIONS = (
     (
       NAME = storage_location_1
       STORAGE_PROVIDER = 'S3'
       STORAGE_BASE_URL = 's3://my-bucket/'
       STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789123:role/my-storage-role' )
   );
   ```
3. [Create a catalog integration for Delta tables](tables-iceberg-configure-catalog-integration-object-storage.md).
4. Create an Iceberg table by using the [CREATE ICEBERG TABLE (Delta files in object storage)](../sql-reference/sql/create-iceberg-table-delta.md) command.
   The `BASE_LOCATION` of the external volume must point to the existing external table location.

   The following example creates an Iceberg table based on external table files located in `s3://my-bucket/delta-ext-table-1/`, and
   references the external volume created previously. To determine the full storage location for the table, Snowflake appends the `BASE_LOCATION`
   to the `STORAGE_BASE_URL` of the external volume:

   ```sqlexample
   CREATE ICEBERG TABLE my_delta_table_1
     BASE_LOCATION = 'delta-ext-table-1'
     EXTERNAL_VOLUME = 'delta_migration_ext_vol'
     CATALOG = 'delta_catalog_integration';
   ```
5. Drop the external table:

   ```sqlexample
   DROP EXTERNAL TABLE my_delta_ext_table_1;
   ```

### Add or drop columns

To alter an existing external table to add or remove columns, use the following ALTER TABLE syntax:

* Add columns: ALTER TABLE … ADD COLUMN.
* Remove columns: ALTER TABLE … DROP COLUMN.

> **Note:**
>
> The default VALUE column and METADATA$FILENAME and METADATA$FILE_ROW_NUMBER pseudocolumns can’t be dropped.

For more information, see the example in [ALTER TABLE](../sql-reference/sql/alter-table.md).

### Protection of external tables

You can protect an external table by using a masking policy and a row access policy. For more information, see the following topics:

* [Masking policies and external tables](security-column-intro.md).
* [Row access policies and external tables](security-row-intro.md).

## Materialized views over external tables

In many cases, [materialized views](views-materialized.md) over external
tables can provide faster performance than equivalent queries over the underlying
external table. When you run a query frequently or your query is sufficiently complex, materialized views can be significantly faster.

Refresh the file-level metadata in any queried external tables so that your
materialized views reflect the current set of files in the referenced cloud storage
location.

You can refresh the metadata for an external table
automatically by using the event notification
service for your cloud storage service or by manually using
[ALTER EXTERNAL TABLE … REFRESH](../sql-reference/sql/alter-external-table.md) statements.

## Automatically refreshing external table metadata

You can automatically refresh the metadata for an external table by using the event notification service for your cloud storage service.

The refresh operation synchronizes the metadata with the latest set of associated files in the external stage and path, that is:

> * New files in the path are added to the table metadata.
> * Changes to files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

For more information, see [Refresh external tables automatically](tables-external-auto.md).

## Billing for external tables

Snowflake includes an overhead in your charges to manage event notifications for the automatic refreshing of external table metadata. This overhead increases in
relation to the number of files added in cloud storage for the external stages and paths specified for your external tables. This overhead
charge appears as Snowpipe charges in your billing statement because Snowpipe is used for event notifications for the automatic external
table refreshes. You can estimate this charge by querying the [PIPE_USAGE_HISTORY](../sql-reference/functions/pipe_usage_history.md) function or examining the Account Usage [PIPE_USAGE_HISTORY view](../sql-reference/account-usage/pipe_usage_history.md).

In addition, Snowflake includes a small maintenance-overhead charge for manually refreshing the external table metadata (using ALTER EXTERNAL TABLE …
REFRESH). This overhead is charged in accordance with the standard [cloud services billing model](cost-understanding-compute.md),
like all similar activity in Snowflake. Manual refreshes of standard external tables are cloud services operations only; however, manual
refreshes of external tables enhanced with Delta Lake rely on user-managed compute resources (that is, a virtual warehouse).

Users with the ACCOUNTADMIN role, or a role with the global MONITOR USAGE privilege, can query the
[AUTO_REFRESH_REGISTRATION_HISTORY](../sql-reference/functions/auto_refresh_registration_history.md) table function to retrieve the history of data files registered in the
metadata of specified objects and the credits that are billed for these operations.

## Overview of setup and load workflows

> **Note:**
>
> External tables don’t support storage versioning (S3 versioning, Object Versioning in Google Cloud Storage, or versioning for Azure Storage).

### Amazon S3 workflow

The following steps provide a high-level overview of the setup and load workflow for external tables that reference Amazon S3 stages. For complete instructions, see [Refresh external tables automatically for Amazon S3](tables-external-s3.md):

1. Create a named stage object (by using [CREATE STAGE](../sql-reference/sql/create-stage.md)) that references the external location (that is, S3 bucket) where your data files are staged.
2. Create an external table (by using [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md)) that references the named stage.
3. Manually refresh the external table metadata by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) … REFRESH to synchronize the metadata with the current list of files in the stage path. This step also verifies the settings in your external table definition.
4. Configure an event notification for the S3 bucket. Snowflake relies on event notifications to continually refresh the external table metadata to maintain consistency with the staged files.
5. Manually refresh the external table metadata one more time by using ALTER EXTERNAL TABLE … REFRESH to synchronize the metadata with any changes that occurred after Step 3. Thereafter, the S3 event notifications trigger the metadata refresh automatically.
6. Configure Snowflake access control privileges for any additional roles to grant them query access to the external table.

### Google Cloud Storage workflow

The following steps provide a high-level overview of the setup and load workflow for external tables that reference Google Cloud Storage (GCS)
stages:

1. Configure a Google Pub/Sub subscription for GCS events.
2. Create a notification integration in Snowflake. A notification integration is a Snowflake object that provides an interface between
   Snowflake and third-party cloud message queuing services such as Pub/Sub.
3. Create a named stage object (by using [CREATE STAGE](../sql-reference/sql/create-stage.md)) that references the external location (that is, GCS bucket) where
   your data files are staged.
4. Create an external table (by using [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md)) that references the named stage and integration.
5. Manually refresh the external table metadata one time by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) … REFRESH to synchronize the
   metadata with any changes that occurred after Step 4. Thereafter, the Pub/Sub notifications trigger the metadata refresh automatically.
6. Configure Snowflake access control privileges for any additional roles to grant them query access to the external table.

### Microsoft Azure workflow

The following steps provide a high-level overview of the setup and load workflow for external tables that reference Azure stages. For complete instructions, see [Refresh external tables automatically for Azure Blob Storage](tables-external-azure.md):

1. Configure an Event Grid subscription for Azure Storage events.
2. Create a notification integration in Snowflake. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party cloud message queuing services such as Microsoft Event Grid.
3. Create a named stage object (by using [CREATE STAGE](../sql-reference/sql/create-stage.md)) that references the external location (that is, Azure container) where your data files are staged.
4. Create an external table (by using [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md)) that references the named stage and integration.
5. Manually refresh the external table metadata one time by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) … REFRESH to synchronize the metadata with any changes that occurred after Step 4. Thereafter, the Event Grid notifications trigger the metadata refresh automatically.
6. Configure Snowflake access control privileges for any additional roles to grant them query access to the external table.

## Querying external tables

[Query](../guides-overview-queries.md) external tables just as you would standard tables.

Snowflake omits query results for records that contain invalid UTF-8 data. After encountering invalid data,
Snowflake continues to scan the file without returning an error message.

To avoid missing records in your query results caused by invalid UTF-8 data, specify `REPLACE_INVALID_CHARACTERS = TRUE` for your
[file format](../sql-reference/sql/create-external-table.md).
Doing so replaces any invalid UTF-8 characters with the Unicode replacement character (`�`) when you query the table.

For Parquet files, you can also set `BINARY_AS_TEXT = FALSE` for your file format so that Snowflake interprets the columns
with no defined logical data type as binary data instead of as UTF-8 text.

### Filtering records in Parquet files

To use row group statistics to prune data in Parquet files, you can include either partition columns, regular
columns, or both in a WHERE clause. The following limitations apply:

* The clause can’t include any VARIANT columns.
* The clause can only include one or more of the following [comparison operators](../sql-reference/operators-comparison.md):

  + =
  + >
  + <
* The clause can only include one or more [logical/Boolean operators](../sql-reference/operators-logical.md), as well as the
  [STARTSWITH](../sql-reference/functions/startswith.md) SQL function.

In addition, queries in the form `"value:<path>::<data type>"` (or the [GET](../sql-reference/functions/get.md)/
[GET_PATH , :](../sql-reference/functions/get_path.md) function equivalent) use the vectorized scanner. Queries in the form
`"value"` or simply `"value:<path>"` are processed by using the non-vectorized scanner. Convert all time-zone data to a standard
time zone by using the [CONVERT_TIMEZONE](../sql-reference/functions/convert_timezone.md) function for queries that use the vectorized scanner.

You might get better pruning results when files are sorted by a key included in a query filter, and if there are multiple row groups in the files, .

The following table shows similar query structures that show the behaviors in this section, where `et` is an external table and `c1`, `c2`, and `c3` are virtual columns:

| Optimized | Not optimized |
| --- | --- |
| `SELECT c1, c2, c3 FROM et;` | `SELECT value:c1, c2, c3 FROM et;` |
| `SELECT c1, c2, c3  FROM et WHERE c1 = 'foo';`  `SELECT c1, c2, c3 FROM et WHERE value:c1::string = 'foo';` | `SELECT c1, c2, c3 FROM et WHERE value:c1 = 'foo';` |

### Persisted query results

Similar to tables, the query results for external tables [persist](querying-persisted-results.md) for 24 hours. Within this 24-hour period, the following operations invalidate and purge the query result cache for external tables:

* Any DDL operation that modifies the external table definition. This includes explicitly modifying the external table definition (by using ALTER EXTERNAL TABLE) or recreating the external table (by using [CREATE OR REPLACE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md)).
* Changes in the set of files in cloud storage that are registered in the external table metadata. Either automatic refresh operations by using the event notification service for the storage location or manual refresh operations (by using ALTER EXTERNAL TABLE … REFRESH) invalidate the result cache.

> **Note:**
>
> Changes in the referenced files in cloud storage don’t invalidate the query results cache in the following circumstances, which lead to outdated query results:
>
> * The automated refresh operation is turned off (that is, AUTO_REFRESH = FALSE) or isn’t configured correctly.
> * The external table metadata isn’t refreshed manually.

## Example: Remove older staged files from external table metadata

The following steps provide an example of how you can use an [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) … REMOVE FILES statement to remove older staged files from the metadata in an external table . The stored procedure removes the files from the metadata based on their last modified date in the stage:

1. Create the stored procedure by using a [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md) statement:

   ```sqlexample
   CREATE or replace PROCEDURE remove_old_files(external_table_name varchar, num_days float)
     RETURNS varchar
     LANGUAGE javascript
     EXECUTE AS CALLER
     AS
     $$
     // 1. Get the relative path of the external table
     // 2. Find all files registered before the specified time period
     // 3. Remove the files

     var resultSet1 = snowflake.execute({ sqlText:
       `call exttable_bucket_relative_path('` + EXTERNAL_TABLE_NAME + `');`
     });
     resultSet1.next();
     var relPath = resultSet1.getColumnValue(1);

     var resultSet2 = snowflake.execute({ sqlText:
       `select file_name
        from table(information_schema.EXTERNAL_TABLE_FILES (
            TABLE_NAME => '` + EXTERNAL_TABLE_NAME +`'))
        where last_modified < dateadd(day, -` + NUM_DAYS + `, current_timestamp());`
     });

     var fileNames = [];
     while (resultSet2.next())
     {
       fileNames.push(resultSet2.getColumnValue(1).substring(relPath.length));
     }

     if (fileNames.length == 0)
     {
       return 'nothing to do';
     }

     var alterCommand = `ALTER EXTERNAL TABLE ` + EXTERNAL_TABLE_NAME + ` REMOVE FILES ('` + fileNames.join(`', '`) + `');`;

     var resultSet3 = snowflake.execute({ sqlText: alterCommand });

     var results = [];
     while (resultSet3.next())
     {
       results.push(resultSet3.getColumnValue(1) + ' -> ' + resultSet3.getColumnValue(2));
     }

     return results.length + ' files: \n' + results.join('\n');

     $$;

     CREATE or replace PROCEDURE exttable_bucket_relative_path(external_table_name varchar)
     RETURNS varchar
     LANGUAGE javascript
     EXECUTE AS CALLER
     AS
     $$
     var resultSet = snowflake.execute({ sqlText:
       `show external tables like '` + EXTERNAL_TABLE_NAME + `';`
     });

     resultSet.next();
     var location = resultSet.getColumnValue(10);

     var relPath = location.split('/').slice(3).join('/');
     return relPath.endsWith("/") ? relPath : relPath + "/";

     $$;
   ```
2. Call the stored procedure:

   ```sqlexample
   -- Remove all files from the exttable external table metadata:
   call remove_old_files('exttable', 0);

   -- Remove files staged longer than 90 days ago from the exttable external table metadata:
   call remove_old_files('exttable', 90);
   ```

   Alternatively, you can create a task by using [CREATE TASK](../sql-reference/sql/create-task.md) to call the stored procedure periodically to remove older files from the external table metadata.

## Apache Hive metastore integration

Snowflake supports integrating [Apache Hive](https://hive.apache.org/) metastores with Snowflake by using external tables. The Hive connector detects metastore events and transmits the events to Snowflake to keep the external tables synchronized with the Hive metastore. With this capability, users can manage their data in Hive while querying it from Snowflake.

For instructions, see [Integrate Apache Hive metastores with Snowflake](tables-external-hive.md).

## External table DDL

To support creating and managing external tables, Snowflake provides the following set of special DDL commands:

* [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md)
* [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md)
* [DROP EXTERNAL TABLE](../sql-reference/sql/drop-external-table.md)
* [DESCRIBE EXTERNAL TABLE](../sql-reference/sql/desc-external-table.md)
* [SHOW EXTERNAL TABLES](../sql-reference/sql/show-external-tables.md)

## Required access privileges

Creating and managing external tables requires a role with a minimum of the following role permissions:

| Object | Privilege |
| --- | --- |
| Database | USAGE |
| Schema | USAGE, CREATE STAGE (if creating a new stage), CREATE EXTERNAL TABLE |
| Stage (if using an existing stage) | USAGE |

## Information Schema

The [Snowflake Information Schema](../sql-reference/info-schema.md) includes views and table functions you can query to retrieve information about your external tables and their staged data files.

### View

[EXTERNAL_TABLES view](../sql-reference/info-schema/external_tables.md)
:   Displays information for external tables in the specified (or current) database.

### Table functions

[AUTO_REFRESH_REGISTRATION_HISTORY](../sql-reference/functions/auto_refresh_registration_history.md)
:   Retrieve the history of data files that are registered in the metadata of specified objects and the credits billed for these operations.

[EXTERNAL_TABLE_FILES](../sql-reference/functions/external_table_files.md)
:   Retrieve information about the staged data files that are included in the metadata for a specified external table.

[EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY](../sql-reference/functions/external_table_registration_history.md)
:   Retrieve information about the metadata history for an external table, including any errors found when refreshing the metadata.

---
title: Introduction to loading semi-structured data
source: https://docs.snowflake.com/en/user-guide/semistructured-intro.md
section: User Guide
---

# Introduction to loading semi-structured data

This topic describes semi-structured data and provides information about how to load and store it in Snowflake.

## About semi-structured data

Semi-structured data is data that does not conform to the standards of traditional structured data, but contains tags (labels)
or other types of mark-up that identify individual, distinct entities within the data.

Two of the key attributes that distinguish semi-structured data from structured data are nested data structures and the lack of a fixed schema:

* Structured data requires a fixed schema that is defined before the data can be loaded and queried in a relational database system. Semi-structured data does not require a prior definition of a schema and can constantly evolve (i.e. new attributes can be added at any time).

  In addition, entities within the same class may have different attributes even though they are grouped together, and the order of the attributes is not important.
* Unlike structured data, which represents data as a flat table, semi-structured data can contain N-level hierarchies of nested information.

## About hierarchical data

Semi-structured data is usually organized hierarchically.
Complex data structures can be built by nesting simpler data types, such as [arrays](../sql-reference/data-types-semistructured.md) and
[objects](../sql-reference/data-types-semistructured.md). (Note: a Snowflake OBJECT corresponds to a “dictionary” or a “map”. A Snowflake
object is not an “object” in the sense of “object-oriented programming”.)

For example, JSON data can contain an object that contains an array.
Each cell of that array might itself contain a nested object or array.

You can use Snowflake data types to construct a hierarchy to hold your semi-structured data by using the
following properties of data types:

* A [VARIANT](../sql-reference/data-types-semistructured.md) can hold a value of any other data type, including an ARRAY or an OBJECT.
* An ARRAY or OBJECT holds a value of type VARIANT.

For example, suppose that you want to store the dates on which different types of natural disasters occurred. You might create
an OBJECT that contains the keys ‘Hurricane’, ‘Earthquake’, ‘Flood’, etc. The value associated with each of those keys can
be an ARRAY that contains the dates on which each type of disaster occurred. Because the value in each key-value
pair must be a VARIANT, each array of dates would be stored as an ARRAY wrapped inside a VARIANT inside the corresponding OBJECT.
The top level of the hierarchy would look similar to the following (the curly braces indicate an OBJECT, which contains key-value
pairs):

```sqlexample
{
    "Flood": flood_date_array::VARIANT,
    "Earthquake": earthquake_date_array::VARIANT,
    ...
}
```

As another example, suppose that you want to store a single list of disasters in chronological order. In that case, your outer
data type might be ARRAY. Each cell of that ARRAY might contain an OBJECT (wrapped in a VARIANT) that contains
key-value pairs with information about the event. For example, each OBJECT that describes an earthquake might have keys
like ‘Timestamp’, ‘Location’, and ‘Magnitude’. Each OBJECT that describes a tornado might have keys like ‘Timestamp’ and
‘Maximum_wind_speed’.

```sqlexample
[
    {
        "Event_ID": 54::VARIANT,
        "Type": "Earthquake"::VARIANT,
        "Magnitude": 7.4::VARIANT,
        "Timestamp": "2018-06-09 12:32:15"::TIMESTAMP_LTZ::VARIANT
        ...
    }::VARIANT,
    {
        "Event_ID": 55::VARIANT,
        "Type": "Tornado"::VARIANT,
        "Maximum_wind_speed": 186::VARIANT,
        "Timestamp": "2018-07-01 09:42:55"::TIMESTAMP_LTZ::VARIANT
        ...
    }::VARIANT
]
```

You can create data hierarchies of almost any depth or breadth (up to the limit of storage for each data type). For example,
an OBJECT that contains information about a tornado might need information about the wind speed at different times during
the tornado, so your data structure might look like the following:

1. The top level is an ARRAY.
2. Each cell of that ARRAY contains one OBJECT that describes one tornado.
3. Each OBJECT contains an ARRAY of windspeed data.
4. Each cell of that inner ARRAY is an OBJECT that contains data with keys such as:

   * Timestamp of the windspeed.
   * Location of the windspeed.
   * The windspeed in KPH (kilometers per hour).

   In some cases, data might be incomplete. For example, if the windspeed at a particular location was estimated based on the
   damage visible after the tornado (rather than measured directly during the tornado), then the data might include location and
   windspeed, but not a timestamp.

## Load semi-structured data

Snowflake can import semi-structured data from JSON, Avro, ORC, Parquet, and XML formats and store it in
[Snowflake data types designed specifically to support semi-structured data](../sql-reference/data-types-semistructured.md).

Depending upon the structure of the data, the size of the data, and the way that the user chooses to import the data,
semi-structured data can be stored in a single column or split into multiple columns.

The steps for loading semi-structured data into tables are similar to those for loading structured data into tables.
However, when you load and store semi-structured data, you can also explicitly specify all, some, or none of the structure:

* If your data is a set of key-value pairs, you can load it into a column of type OBJECT.
* If your data is an array, you can load it into a column of type ARRAY.
* If you have hierarchical data, you may do either of the following:

  + Split the data across multiple columns. You may:

    - Explicitly [extract and transform](data-load-transform.md) columns from semi-structured data into separate
      columns in target tables.
    - Use Snowflake to automatically [detect and retrieve](data-load-overview.md) the
      column definitions from staged semi-structured data files. Create Snowflake tables, external tables, or views from the column
      definitions. To save time, create tables with the column definitions automatically retrieved from the staged files.
  + Store the data in a single column of type VARIANT. You may:

    - Specify the structure explicitly (e.g. specify a hierarchy of VARIANT, ARRAY, and OBJECT data types).
    - Load the data without explicitly specifying the structure. If you specify a data format that Snowflake recognizes and
      parses (JSON, Avro, Parquet, or ORC), the data is converted to an internal data format that uses Snowflake
      VARIANT, ARRAY, and OBJECT data types.

If the data is complex or an individual value requires more than about 128 MB of storage space, then you can use more than one of
the preceding techniques. For example, you can split the data into multiple columns, and some of those columns can contain
an explicitly specified hierarchy of data types.

You can load semi-structured data the following ways:

* Specify the input data format and the Snowflake data type while creating the table and loading the data. For example, in the code
  below, the VARIANT data type is specified in the CREATE TABLE statement, while the JSON input data format is specified in the
  `TYPE = <data_format>` clause of the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command:

  ```sqlexample
  CREATE TABLE my_table (my_variant_column VARIANT);
  COPY INTO my_table ... FILE FORMAT = (TYPE = 'JSON') ...
  ```
* Specify the input data format and the Snowflake data type by calling an appropriate function to convert the data. For example, to
  convert JSON-formatted data to a VARIANT value, call [PARSE_JSON](../sql-reference/functions/parse_json.md), as shown below:

  ```sqlexample
  INSERT INTO my_table (my_variant_column) SELECT PARSE_JSON('{...}');
  ```

When data is stored in ARRAY, OBJECT, or VARIANT data types, or a hierarchy of those types, you can
[query it](querying-semistructured.md).

## Store semi-structured data

Semi-structured data is typically stored in the following Snowflake data types:

* [ARRAY](../sql-reference/data-types-semistructured.md): similar to an array in other languages.
* [OBJECT](../sql-reference/data-types-semistructured.md): similar to a JSON object, also called a “dictionary”, “hash”, or “map” in many
  languages. This contains key-value pairs.
* [VARIANT](../sql-reference/data-types-semistructured.md): a data type that can hold a value of any other data type (including ARRAY and OBJECT).
  VARIANT is used to build and store hierarchical data.

(If imported data is split into multiple columns before it is stored, then some or all of those columns can be simple data types,
such as FLOAT, VARCHAR, etc.)

The ARRAY, OBJECT, and VARIANT data types can be used individually, or nested to build a hierarchy.

If the data is imported in JSON, Avro, ORC, or
Parquet format, then Snowflake can build the hierarchy for you and store it in a VARIANT. You can also create a hierarchy manually.

Regardless of how the hierarchy was constructed, Snowflake converts the data to an optimized internal storage format that uses
ARRAY, OBJECT, and VARIANT. This internal storage format supports fast and efficient SQL querying.

More information about [ARRAY](../sql-reference/data-types-semistructured.md), [OBJECT](../sql-reference/data-types-semistructured.md), and
[VARIANT](../sql-reference/data-types-semistructured.md) data types is in [Semi-structured data types](../sql-reference/data-types-semistructured.md).

## Query semi-structured data

Snowflake supports operators for:

* [Accessing an element in an array.](../sql-reference/data-types-semistructured.md)
* [Retrieving a specified value from a key-value pair in an OBJECT.](../sql-reference/data-types-semistructured.md)
* [Traversing the levels of a hierarchy stored in a VARIANT.](querying-semistructured.md)

More information about querying semi-structured data is in [Querying Semi-structured Data](querying-semistructured.md).

For information about querying XML by specifying XML tags, see the documentation of the [XMLGET](../sql-reference/functions/xmlget.md)
function.

---
title: Introduction to OAuth
source: https://docs.snowflake.com/en/user-guide/oauth-intro.md
section: User Guide
---

# Introduction to OAuth

Snowflake enables OAuth for clients through integrations. An integration is a Snowflake object that provides an interface between Snowflake
and third-party services. Administrators configure OAuth using a
[Security integration](../sql-reference/sql/create-security-integration.md), which enables clients that support OAuth to redirect
users to an authorization page and generate access tokens (and optionally, refresh tokens) for accessing Snowflake.

Snowflake supports the [OAuth 2.0](https://oauth.net/2/) protocol for authentication and authorization using one of the options
below:

* [Snowflake OAuth](oauth-snowflake-overview.md)
* [External OAuth](oauth-ext-overview.md)

The following table compares Snowflake OAuth and External OAuth:

| Category | Snowflake OAuth | External OAuth |
| --- | --- | --- |
| Modify client application | Required | Required |
| Client application browser access | Required | Not required |
| Programmatic clients | Requires a browser | Best fit |
| Driver property | `authenticator = oauth` | `authenticator = oauth` |
| Security integration syntax | `create security integration type = oauth ...` | `create security integration type = external_oauth` |
| OAuth flow | OAuth 2.0 code grant flow | Any OAuth flow that the client can initiate with the External OAuth server |

## Auditing OAuth logins

To query login attempts by Snowflake users, Snowflake provides a login history:

* [LOGIN_HISTORY , LOGIN_HISTORY_BY_USER](../sql-reference/functions/login_history.md) (table function)
* [LOGIN_HISTORY view](../sql-reference/account-usage/login_history.md) (view)

When OAuth is used to authenticate (successfully or unsuccessfully), the FIRST_AUTHENTICATION_FACTOR column in the output has the value
OAUTH_ACCESS_TOKEN.

## Private connectivity

Snowflake supports External OAuth with private connectivity to the Snowflake service.

Snowflake OAuth and Tableau can be used with private connectivity to Snowflake as follows:

> Tableau Desktop:
> :   Starting with Tableau 2020.4, Tableau contains an embedded OAuth client that supports connecting to Snowflake with the account URL
>     for private connectivity to the Snowflake service.
>
>     After upgrading to Tableau 2020.4, no further configuration is needed; use the corresponding private connectivity URL for either AWS
>     or Azure to connect to Snowflake.
>
> Tableau Cloud:
> :   Starting with Tableau 2020.4, users can optionally configure Tableau Cloud to use the embedded OAuth Client to connect to Snowflake
>     with the account URL for private connectivity to the Snowflake service.
>
>     To use this feature, create a new [Custom Client](oauth-custom.md) security integration and follow the
>     [Tableau instructions](https://help.tableau.com/current/server/en-us/config_oauth_snowflake.htm).
>
> > **Important:**
> >
> > To determine the account URL to use with private connectivity to the Snowflake service, call the
> > [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function.
>
> Looker:
> :   Currently, combining Snowflake OAuth and Looker requires access to the public Internet. Therefore, you cannot use Snowflake OAuth and
>     Looker with private connectivity to the Snowflake service.

For more information, refer to:

* [SSO with private connectivity](admin-security-fed-auth-overview.md)
* [Configure Snowflake OAuth for partner applications](oauth-partner.md)

## Clients, drivers, and connectors

Supported clients, drivers, and connectors can use OAuth to verify user login credentials.

Note the following:

* It is necessary to set the `authenticator` parameter to `oauth` and the `token` parameter to the
  `oauth_access_token`.
* When passing the `token` value as a URL query parameter, it is necessary to URL-encode the `oauth_access_token` value.
* When passing the `token` value to a Properties object (e.g. JDBC Driver), no modifications are necessary.

For more information about connection parameters, refer to the reference
documentation for the following clients, drivers, or connectors:

* [Snowflake CLI](../developer-guide/snowflake-cli/connecting/configure-connections.md)
* [SnowSQL](snowsql-start.md)
* [Python](../developer-guide/python-connector/python-connector-connect.md)
* [Go](https://godoc.org/github.com/snowflakedb/gosnowflake#hdr-Connection_Parameters)
* [JDBC](../developer-guide/jdbc/jdbc-configure.md)
* [ODBC](../developer-guide/odbc/odbc-parameters.md)
* [Spark Connector](spark-connector-use.md)
* [.NET Driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/README.md#create-a-connection)
* [Node.js Driver](../developer-guide/node-js/nodejs-driver-authenticate.md)

## Client Redirect

Snowflake supports using Client Redirect with Snowflake OAuth and External OAuth, including using Client Redirect and OAuth with supported
Snowflake Clients.

For more information, refer to [Redirecting client connections](client-redirect.md).

## Replication

Snowflake supports replication and failover/failback with both the Snowflake OAuth and External OAuth security integrations from the source
account to the target account.

For details, refer to [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

---
title: Introduction to object tagging
source: https://docs.snowflake.com/en/user-guide/object-tagging/introduction.md
section: User Guide
---

# Introduction to object tagging

## What is a tag?

A tag is a schema-level object that can be assigned to another Snowflake object. Users associate a tag with an arbitrary string value when
assigning the tag to a Snowflake object. Snowflake stores the tag and its string value as a key-value pair. The tag must be unique for
your schema, and the tag value is always a string.

The following are general characteristics of object tagging:

* An object can have multiple tags at the same time. For more information, see Tag quotas.
* A single tag can be assigned to different object types at the same time (for example, a warehouse and a table simultaneously).
* At the time of assignment, the tag string value can be duplicated or remain unique. For example, multiple tables can be assigned
  the `cost_center` tag and the tag can always have the string value `sales`. Alternatively, the string value could be different
  for each table (for example, `engineering`, `marketing`, and `finance`).

After defining the tags and assigning the tags to Snowflake objects, you can query them to monitor usage on objects and facilitate data
governance operations, such as auditing and reporting.

### Highlights

Ease of use:
:   Define a tag once and apply it to as many different objects as desirable.

Tag inheritance:
:   Because tags are inherited, applying the tag to objects higher in the securable objects hierarchy results in the tag being applied to all
    child objects. For example, if a tag is set on a table, the tag will be inherited by all columns in that table.

Automatic propagation:
:   Configure a tag so it automatically propagates to target objects from a source object.

Consistent assignment with replication:
:   Snowflake replicates tags and their assignments within the primary database to the secondary database.

Centralized or decentralized management:
:   Tags support different management approaches to facilitate compliance with internal and external regulatory requirements.

    In a centralized approach, you can create a `tag_admin` custom role that creates and applies tags to Snowflake objects.

    In a decentralized approach, individual teams apply tags to Snowflake objects and the `tag_admin` custom role creates tags to ensure
    consistent tag naming.

## Using tags for data protection

Because tags can be assigned to tables, views, and columns, setting a tag and then querying the tag enables the
discovery of a multitude of database objects and columns that contain sensitive information. Upon discovery, data stewards can determine
how best to make that data available, such as selective filtering using [row access policies](../security-row-intro.md), or
using [masking policies](../security-column-intro.md) to determine whether the data is tokenized, fully masked, partially
masked, or unmasked.

You can also combine object tagging and masking policies to simplify the governance of data. With this approach, you assign a masking
policy to a tag, then assign that tag to a table or column. When the data type of a column matches the data type in the masking policy
signature, the tagged column is automatically protected by the masking policy. For more information, see
[Tag-based masking policies](../tag-based-masking-policies.md).

## Using tags to monitor resource usage

Assigning tags to warehouses enables accurate resource usage monitoring. Querying tags on resources allows for easy resource
grouping by cost center or some other organization unit. Additionally, the tag can facilitate analyzing relatively short-term business
activities, such as projects, to provide a more granular insight into what, when, and how resources were used.

For an example of using tags to monitor resource usage, see [Setting up object tags for cost attribution](../cost-attributing.md).

## How a tag gets associated with an object

A tag can be associated with an object in the following ways:

* Someone manually set the tag on the object using a CREATE <object> or ALTER <object> command. See [Set a tag](work.md).
* The object inherited the tag from an object higher up in the Snowflake securable object hierarchy. For example, a warehouse in an account
  inherits tags set on the account. See [Tag inheritance](inheritance.md).
* The tag was automatically propagated from one object to another. Tags can be propagated when an object depends on another object (for
  example, a view based on a tagged table) or when data moves from a tagged object to another object (for example, using a CTAS statement
  to create a table). See [Automatic tag propagation with user-defined tags](propagation.md).
* The tag was automatically set on a column that was classified as containing sensitive data. To learn how sensitive data
  classification uses a tag map to set these tags, see [About classification tags](../classify-intro.md).
* Someone used the CREATE TABLE … LIKE or CREATE TABLE … CLONE command to create a table from an existing table with tags.

### Determine how a tag was associated with an object

The following views and functions include the `apply_method` column, which shows how a tag was associated with an object.

> * View: [ACCOUNT_USAGE.TAG_REFERENCES](../../sql-reference/account-usage/tag_references.md)
> * Functions:
>
>   + [INFORMATION_SCHEMA.TAG_REFERENCES](../../sql-reference/functions/tag_references.md)
>   + [INFORMATION_SCHEMA.TAG_REFERENCES_ALL_COLUMNS](../../sql-reference/functions/tag_references_all_columns.md)
>   + [ACCOUNT_USAGE.TAG_REFERENCES_WITH_LINEAGE](../../sql-reference/functions/tag_references_with_lineage.md)

For example, to find whether a tag was set manually on the object or was propagated, you could execute the following command and check the
value of the `apply_method` column.

> ```sqlexample
> SELECT tag_name, tag_value, apply_method, level, domain
>   FROM TABLE(my_db.INFORMATION_SCHEMA.TAG_REFERENCES('my_table', 'TABLE'));
> ```

## Tag quotas

You can set a maximum of 50 tags on a single object, including tables and views.

If you have reached the limit on tags and want to drop one, execute an ALTER <object> UNSET TAG statement.

### Separate quota for columns

You can set a maximum of 50 different tags on the columns of a single table. This is a limit on all of the columns combined.

The column limit is separate from the limit on the number of tags set on a table. For example, suppose you create the following table with
tags on both the table and its columns:

```sqlexample
CREATE TABLE t1 (
  COL1 INT WITH TAG (tag1='col1', tag2='col1'),
  COL2 INT WITH TAG (tag1='col2'),
  )
  WITH TAG (tag3='t1');
```

Snowflake allows you to do the following:

* Set 49 more tags on the table `t1`.
* Set 48 more tags on the columns of `t1`. The limit is on *different* tags, so `tag1` isn’t counted twice.

If you run a CREATE TABLE or ALTER TABLE statement to apply tags on the columns of a table, the maximum number of unique tag-entity
associations is 100, where an entity is the table or a column. For example, if you have a table with 1,000 columns and you want to associate
the same tag with every column, you need to run 10 ALTER statements.

## Capabilities that require Enterprise Edition

Creating and setting tags is available to all accounts. However, there are advanced capabilities that require Enterprise Edition or higher.
Your account must be Enterprise Edition or higher to use the following capabilities:

* [Tag propagation](propagation.md)
* [Tag-based masking policies](../tag-based-masking-policies.md)

## Supported objects

The following table lists the supported objects for tags, including columns, based on the Snowflake securable object hierarchy.

A tag can be set on an object with a [CREATE <object>](../../sql-reference/sql/create.md) statement or an [ALTER <object>](../../sql-reference/sql/alter.md) statement unless
specified otherwise in the table below.

A tag can be set on a column using a CREATE TABLE, CREATE VIEW, ALTER TABLE … MODIFY COLUMN, or ALTER VIEW statement.

| Object hierarchy | Supported objects | Notes |
| --- | --- | --- |
| Organization | Account | A tag can be [set](../../sql-reference/sql/alter-account.md) on your [current account](../../sql-reference/functions/current_account.md) by a role with the global APPLY TAG privilege. |
| Account | Application |  |
|  | Application package |  |
|  | Compute pool |  |
|  | Database |  |
|  | Failover group |  |
|  | Integration | All [types](../../sql-reference/sql/create-integration.md) are supported.  Use an [ALTER INTEGRATION](../../sql-reference/sql/alter-integration.md) command to set a tag on the integration. |
|  | Network policy | Use an [ALTER NETWORK POLICY](../../sql-reference/sql/alter-network-policy.md) command to set a tag on a network policy. |
|  | Replication group |  |
|  | Role |  |
|  | Share | Tags are set on the share by the data sharing provider. These tags are not visible to the data sharing consumer. Use an [ALTER SHARE](../../sql-reference/sql/alter-share.md) command to set a tag on the share. |
|  | User |  |
|  | Warehouse |  |
| Database | Database role | Use an [ALTER DATABASE ROLE](../../sql-reference/sql/alter-database-role.md) command to set a tag on a database role. |
|  | Schema |  |
| Schema | Aggregation policy |  |
|  | Alert |  |
|  | Backup set | For [WORM backups](../backups.md). Contains a set of backups for a specific database, schema, or table. |
|  | BUDGET instance | Use an [ALTER BUDGET](../../sql-reference/classes/budget/commands/alter-budget.md) command to set a tag on an instance of the SNOWFLAKE.CORE.BUDGET class. |
|  | CLASSIFICATION instance | Use an [ALTER SNOWFLAKE.ML.CLASSIFICATION](../../sql-reference/classes/classification/commands/alter-classification.md) command to set a tag on an instance of the SNOWFLAKE.ML.CLASSIFICATION class. |
|  | Dynamic table |  |
|  | Event table |  |
|  | External function and UDF | Use an [ALTER FUNCTION](../../sql-reference/sql/alter-function.md) command to set a tag on an external function or UDF. |
|  | External table | You can create an external table with a tag using a [CREATE EXTERNAL TABLE](../../sql-reference/sql/create-external-table.md) statement.  To manage tag assignments on an external table, use the [ALTER TABLE](../../sql-reference/sql/alter-table.md) command. |
|  | Git repository |  |
|  | Apache Iceberg™ table |  |
|  | Image repository |  |
|  | Interactive table |  |
|  | Join policy |  |
|  | Materialized view |  |
|  | Notebook |  |
|  | Password policy |  |
|  | Pipe | Set a tag on a pipe with an [ALTER PIPE](../../sql-reference/sql/alter-pipe.md) statement. |
|  | Policy | Set a tag on a [masking](../../sql-reference/sql/alter-masking-policy.md), [password](../../sql-reference/sql/alter-password-policy.md), [row access](../../sql-reference/sql/alter-row-access-policy.md), [session](../../sql-reference/sql/alter-session-policy.md), [aggregation](../../sql-reference/sql/alter-aggregation-policy.md), [join](../../sql-reference/sql/alter-join-policy.md), or [projection](../../sql-reference/sql/alter-projection-policy.md) policy with the corresponding ALTER *<policy>* statement. |
|  | Procedure | Set a tag on a stored procedure with an [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md) statement. |
|  | Projection policy |  |
|  | Session policy |  |
|  | Snapshot | For [block storage volume snapshots](../../developer-guide/snowpark-container-services/block-storage-volume.md). |
|  | Stage | Set a tag on a stage with an [ALTER STAGE](../../sql-reference/sql/alter-stage.md) statement. |
|  | Stream |  |
|  | Streamlit |  |
|  | Table |  |
|  | Task | Set a tag on a task with an [ALTER TASK](../../sql-reference/sql/alter-task.md) statement. |
|  | View |  |
| Table or View | Column | Includes [event tables](../../sql-reference/sql/alter-table-event-table.md). |

## Limitations and considerations

Future grants:
:   [Future grants](../../sql-reference/sql/grant-privilege.md) of privileges on tags are not supported.

    As a workaround, grant the APPLY TAG privilege to a custom role to allow that role to apply tags to another object.

Snowflake Native App:
:   Use caution when creating the setup script when tags exist in a versioned schema. For information, see
    [version schema considerations](../../developer-guide/native-apps/creating-setup-script.md).

---
title: Introduction to organizations
source: https://docs.snowflake.com/en/user-guide/organizations.md
section: User Guide
---

# Introduction to organizations

An *organization* is a first-class Snowflake object that links the accounts owned by your business entity. Organizations simplify account
management and billing, [Replication and Failover/Failback](replication-intro.md), Snowflake Secure Data Sharing, and other
account administration tasks.

This feature allows organization administrators to view, create, and manage all of your accounts across different regions and cloud platforms.

## Types of accounts

An organization can consist of the following types of accounts:

* Organization account: Special account used by organization administrators to manage multi-account organizations and to access usage data
  from premium views in the ORGANIZATION_USAGE schema. For more information, see [Organization accounts](organization-accounts.md).
* Regular Snowflake account, including [trial accounts](admin-trial-account.md).
* Snowflake Open Catalog account: Special account used by service admins and catalog admins to manage catalogs defined in Snowflake Open
  Catalog. For more information, see [Snowflake Open Catalog overview](https://other-docs.snowflake.com/en/opencatalog/overview).

## Benefits

* A central view of all accounts within your organization. For more information, refer to [Viewing accounts in your organization](organizations-manage-accounts-view.md).
* Self-service account creation. For more information, refer to [Creating an account](organizations-manage-accounts-create.md).
* Data availability and durability by leveraging data replication and failover. For more information, see [Introduction to replication and failover across multiple accounts](account-replication-intro.md).
* Seamless data sharing with Snowflake consumers across regions. For more information, see [Share data securely across regions and cloud platforms](secure-data-sharing-across-regions-platforms.md).
* Ability to monitor and understand usage across all accounts in the organization. For more information, see [Organization Usage](../sql-reference/organization-usage.md) views.

## Organization creation

Snowflake customers never directly create an organization. For users who sign-up for a Snowflake account using the self-service option,
an organization is automatically created with a system-generated name when the account is created. For entities who work directly with
Snowflake personnel to set up accounts, Snowflake creates the organization to which the accounts belong using a custom name. In either
case, users can create additional accounts that belong to the organization after it is created with the initial account.

## Viewing the name of your organization and its accounts

If you are the [organization administrator](organization-administrators.md), you can view the name of your organization and
its accounts through the web interface or using SQL:

> SQL:
> :   Execute a [SHOW ACCOUNTS](../sql-reference/sql/show-accounts.md) command.
>
> [Snowsight](ui-snowsight-gs.md):
> :   In the navigation menu, select Admin » Accounts. The organization name is listed above the account names.

Users with any role can execute the [CURRENT_ORGANIZATION_NAME](../sql-reference/functions/current_organization_name.md)
function to return the organization of the current account.

Users with any role can also find the organization name and account name for a specific account that they have previously signed in to.
See [Finding the organization and account name for an account](admin-account-identifier.md).

## Changing the name of your organization

If you want to change the name of an organization, for example to change a system-generated name to a more user-friendly one,
contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

When you contact Snowflake Support, you must decide whether users can temporarily access accounts in the
organization using the original [account URL](organizations-connect.md). If you keep the original account URL, it is automatically dropped
after 90 days, at which time users must use the new account URL to access the account. If you want to drop the account URL before the 90 days
expire, see [Deleting an organization URL](organizations-manage-accounts-urls.md).

## Deleting an organization

To delete your Snowflake organization:

1. [Delete all accounts in the organization](organizations-manage-accounts-delete.md), except the account being used for the
   deletion.
2. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to delete the last account and the organization.

---
title: Introduction to replication and failover across multiple accounts
source: https://docs.snowflake.com/en/user-guide/account-replication-intro.md
section: User Guide
---

# Introduction to replication and failover across multiple accounts

This feature enables the replication of objects from a *source* account to one or more *target* accounts in the same organization.
Replicated objects in each target account are referred to as *secondary* objects and are replicas of the *primary* objects in the source
account. Replication is supported across [regions](intro-regions.md) and across
[cloud platforms](intro-cloud-platforms.md).

## Region support for replication and failover/failback

All Snowflake regions across Amazon Web Services, Google Cloud Platform, and Microsoft Azure support replication.

Customers can replicate across all regions within a [region group](admin-account-identifier.md). To replicate between regions in
different region groups, (i.e. from a Snowflake commercial region to a Snowflake government or Virtual Private Snowflake region),
please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Replication groups and failover groups

A *replication group* is a defined collection of objects in a source account that are replicated as a unit to one or more target accounts. Replication groups provide read-only access for the replicated objects.

A *failover group* is a replication group that can also fail over. A secondary failover group in a target account provides read-only access for the replicated objects. When a secondary failover group is promoted to become the primary failover group, read-write access is available. You can promote any target account specified in the list of allowed accounts in a failover group to serve as the primary failover group.

Replication and failover groups provide point-in-time consistency for the objects on the target account. The objects that can be included in a replication or failover group are listed below in Replicated objects.

### Replication feature / edition matrix

Note that some replication features are only available for Business Critical Edition (or higher).
The following table lists the availability of replication features for each Snowflake edition:

| Feature | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| Database replication | ✔ | ✔ | ✔ | ✔ |
| Share replication | ✔ | ✔ | ✔ | ✔ |
| Replication Group | ✔ | ✔ | ✔ | ✔ |
| Account object (other than database and share) replication |  |  | ✔ | ✔ |
| Failover Group |  |  | ✔ | ✔ |
| Data protected with Tri-Secret Secure |  |  | ✔ | ✔ |
| Dataset replication |  |  | ✔ | ✔ |
| Cortex Search Service replication |  |  | ✔ | ✔ |

## Replicated objects

This feature supports replicating the objects listed below. Database replication and share replication are available on all editions.
Replication of all other objects is only available for Business Critical Edition (or higher). For details on feature availability,
see the Replication feature / edition matrix.

| Object | Type or Feature | Replicated | Notes |
| --- | --- | --- | --- |
| Databases |  | ✔ | Replication of some databases is not supported or might fail the refresh operation. For more information, see Current limitations of replication. |
| External volumes |  | ✔ | Failover group replication requires Business Critical Edition or higher later. Replication group replication is available to all accounts. |
| Integrations | Security, API, Notification, Storage, External Access | ✔ | For additional caveats and details on the supported types, see Integration replication.  Requires Business Critical Edition (or higher). |
| Listings |  | ✔ | Requires Business Critical Edition (or higher). |
| Network policies |  | ✔ | Requires Business Critical Edition (or higher). |
| Parameters (account level) |  | ✔ | Requires Business Critical Edition (or higher). |
| Profiles |  | ✔ | Requires Business Critical Edition (or higher). |
| Programmatic access tokens for users |  | ✔ | If users and roles are replicated, programmatic access tokens for users are replicated automatically. |
| Resource monitors |  | ✔ | Resource monitor notifications for non-administrator users are replicated if you include `users` in the group, however account administrator notification settings are not replicated. For more information, see Replication of resource monitor email notification settings.  Requires Business Critical Edition (or higher). |
| Roles |  | ✔ | * Includes [account and database roles](security-access-control-overview.md). * Includes privileges granted to roles, as well as roles granted to roles (i.e. hierarchies of roles). * If users and roles are replicated, roles granted to users are also replicated. * The REPLICATE and FAILOVER privileges are *not* replicated. * Requires Business Critical Edition (or higher). |
| Shares |  | ✔ | Replication of [inbound shares](data-share-consumers.md) (shares from providers) is *not* supported. |
| Users |  | ✔ | Requires Business Critical Edition (or higher). |
| Warehouses |  | ✔ | Requires Business Critical Edition (or higher). Includes [interactive warehouses](interactive.md). |
| Workspaces |  | ✔ | Requires Business Critical Edition (or higher). |

### Database replication

Snowflake account replication supports replicating databases. Replication for a database includes the objects contained in that
database. The refresh operation for a database includes changes to the objects and data since the previous refresh for that database.

If `roles` are replicated (in the same or different replication or failover group), the database refresh also synchronizes the
privilege grants on the secondary database and the objects in the database (schemas, tables, views, etc.) to roles in the account.
Refer to Grants for database objects for more details.

Replication of some databases is not supported or might fail the refresh operation. For more information, see
Current limitations of replication.

#### Replicated database objects

When a primary database is replicated, a snapshot of its database objects and data is transferred to the secondary database. However,
some database objects are not replicated. The following table indicates which database objects are replicated to a secondary database.

For specific usage information about these objects, see [Replication considerations](account-replication-considerations.md).

> **Note:**
>
> Objects that are *not* supported for replication are skipped during replication and won’t be available in the target account post failover.

| Object | Type or Feature | Replicated | Notes |
| --- | --- | --- | --- |
| Schemas |  | ✔ | By default, all schemas in replicated databases are replicated. If you use failover groups, you can choose which schemas within a database are replicated. For more information, see [Schema-level replication for failover groups](account-replication-config.md). |
| Tables | Permanent tables | ✔ |  |
|  | Transient tables | ✔ |  |
|  | Error tables | ✔ | For more information, see [DML error logging](data-load-overview.md). |
|  | Temporary tables |  |  |
|  | Automatic Clustering of clustered tables | ✔ |  |
|  | Dynamic tables | ✔ | For more information, see [Replication and dynamic tables](account-replication-considerations.md). |
|  | External tables |  |  |
|  | Hybrid tables |  |  |
|  | Apache Iceberg™ tables | ✔ | Only Snowflake-managed Iceberg tables are supported. Replication for Iceberg tables requires external volume replication. For more information, see [Configure replication for Snowflake-managed Apache Iceberg™ tables](tables-iceberg-replication.md). |
|  | Interactive tables | ✔ |  |
|  | Table constraints | ✔ | Except if a foreign key in the database references a primary/unique key in another database. . |
| Event tables |  |  |  |
| Sequences |  | ✔ |  |
| Views | Views | ✔ | If a view references any object in another database (e.g. table columns, other views, UDFs, or stages), . both databases must be replicated. |
|  | Materialized views | ✔ |  |
|  | Secure views | ✔ |  |
|  | Semantic views | ✔ | If a semantic view references any other objects (for example, tables, views, and Cortex Search Services), you must also replicate those objects. |
| User-defined types |  | ✔ |  |
| File formats |  | ✔ |  |
| Stages | Stages | ✔ | Supported for replication and failover groups only. Not supported for database replication. . For more information, see [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md). |
|  | Temporary stages |  |  |
| Pipes |  | ✔ | Supported for replication and failover groups only. Not supported for database replication. . For more information, see [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md). |
| Stored procedures |  | ✔ | For more information, see [Replication of stored procedures and user-defined functions (UDFs)](account-replication-considerations.md). |
| Streams |  | ✔ | For more information, see [Replication and streams](account-replication-considerations.md). |
| Tasks |  | ✔ | For more information, see [Replication and tasks](account-replication-considerations.md). |
| Data metric functions (DMFs) | Data Quality | ✔ | For more information, see [Replication of data metric functions (DMFs)](account-replication-considerations.md). |
| UDFs |  | ✔ | For more information, see [Replication of stored procedures and user-defined functions (UDFs)](account-replication-considerations.md). |
| Policies | Aggregation policies | ✔ |  |
|  | Authentication policies | ✔ |  |
|  | Column-level Security (masking) | ✔ | For masking, row access, and tag-based masking policies, see [policy replication considerations](database-replication-considerations.md). |
|  | Join policies | ✔ |  |
|  | Password policies | ✔ |  |
|  | Privacy policies | ✔ | For more information, see [Privacy policies](account-replication-considerations.md). |
|  | Projection policies | ✔ |  |
|  | Row access policies | ✔ |  |
|  | Session policies | ✔ | For session, password, and authentication policies, see [replication and security policies](account-replication-considerations.md). |
|  | Tag-based masking policies | ✔ |  |
|  | Backup policies | ✔ | * [Backups](backups.md) are available for all Snowflake editions. * Backups with retention lock and backups with legal holds are available for Business Critical Edition (or higher).   To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
|  | Storage lifecycle policies | ✔ | For information about replication of policies and archived data, see [Replication and storage lifecycle policies](account-replication-considerations.md). |
| Tags | Object Tagging | ✔ | For tags, see [Replication and tags](account-replication-considerations.md). |
| Alerts |  | ✔ |  |
| Secrets | Secrets for External API Authentication | ✔ | You can replicate secrets by using a replication group and failover group. For additional details, see [Replication and secrets](account-replication-considerations.md). |
| Network rules |  | ✔ | For replication of network policies that use network rules, see [Replicating network policies](account-replication-security-integrations.md). |
| Backups |  |  |  |
| Backup sets |  | ✔ | * [Backups](backups.md) are available for all Snowflake editions. * Backups with retention lock and backups with legal holds are available for Business Critical Edition (or higher).   To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| Class instances | CUSTOM_CLASSIFIER | ✔ | Replication is supported for instances of the [CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier.md) class. Instances of all other Snowflake [classes](../sql-reference/snowflake-db-classes.md) are *not* replicated. For the full list of Snowflake classes, see [Available classes](../sql-reference-classes.md). |
| Packages policies | Python UDF, UDTF, stored procedures | ✔ | If there is a [packages policy](../developer-guide/udf/python/packages-policy.md) set on the source account, in order to successfully replicate account objects, the database containing the packages policy *must* be replicated to the target account in the same or different replication or failover group. Otherwise, the refresh operation fails with a [dangling references error](account-replication-considerations.md). |
| Objects for machine learning workflows | Models | ✔ | For usage information, see [Snowflake Model Registry](../developer-guide/snowflake-ml/model-registry/overview.md). |
|  | Datasets | ✔ | For information about how replication works for Datasets, see Dataset replication. |
|  | Online feature tables |  | Online feature tables do not support replication or cloning. |
| Git repository clones |  | ✔ | For information about how replication works for Git repository clones, see [Git repository replication](account-replication-git-repositories.md). For usage information for Git repository clones, see [Using a Git repository in Snowflake](../developer-guide/git/git-overview.md). |
| Snowflake Notebooks |  | ✔ | For information about how replication works for Snowflake Notebooks, see [Notebook replication](ui-snowsight/notebooks-replication.md). |
| dbt projects |  | ✔ | For more information, see [Replication and dbt projects](account-replication-considerations.md). |
| Cortex Knowledge Extensions (CKEs) |  | ✔ | For information about how replication works for [CKEs](snowflake-cortex/cortex-knowledge-extensions/cke-overview.md), see [Replicate a Cortex Search Service](snowflake-cortex/cortex-search/cortex-search-replication.md). |

#### Database replication and encryption

Snowflake protects metadata and data sets at rest and in transit between the source and target accounts. The account
[master key](https://csrc.nist.gov/glossary/term/master_key) (AMK) encrypts the key hierarchy within the account as shown in the
[hierarchical key model](security-encryption-manage.md). Snowflake encrypts replicated data in the target account using the
account master key and the key hierarchy in the target account, regardless of whether you enable Tri-Secret Secure in the target account.

When you enable Tri-Secret Secure in the target account, Snowflake uses the composite master key and the corresponding key hierarchy in
the target account to encrypt the data. Note that target accounts do not have Tri-Secret Secure enabled by default; you must enable this
feature.

For more information about data encryption in Snowflake, see [Understanding end-to-end encryption in Snowflake](security-encryption-end-to-end.md).

### External volume replication

Iceberg tables rely on external volumes, which are
account-level objects that require extra configuration to connect to your external cloud storage. Before you can replicate an Iceberg table,
you must configure replication for external volumes. Account replication supports the replication of external volumes. For more
information about replicating external volumes and Snowflake-managed Iceberg tables, see [Configure replication for Snowflake-managed Apache Iceberg™ tables](tables-iceberg-replication.md).

For more information about external volumes, see [External volume](tables-iceberg.md).

### Integration replication

Account replication supports the replication of integrations for the following features:

* Security integrations of the following types:

  + Federated Authentication & SSO (i.e. SAML2)
  + SCIM
  + Snowflake OAuth
  + External OAuth

  For more information about security integrations, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).
* API integrations.

  After replicating API integrations to a target account, you must grant access to the remote service to the replicated
  external functions. For more information, see [Updating the remote service for API integrations](account-replication-config.md).
* Notification integrations of the following types:

  + TYPE = EMAIL
  + TYPE = QUEUE with DIRECTION = OUTBOUND
  + TYPE = WEBHOOK
* Storage integrations.

  When you replicate a storage integration, you must establish a new trust relationship for your cloud storage in the target
  accounts. To learn more, see [Configure cloud storage access for secondary storage integrations](account-replication-config.md).
* External access integrations.

  For more information about external access integrations, see
  [External network access overview](../developer-guide/external-network-access/external-network-access-overview.md).

### Listing replication

For listings that have auto-fulfillment enabled, this feature allows you to add the listings and (optionally) their shares to a failover
group for replication and failover.

For more information, see [Listing support in Business Continuity and Disaster Recovery](../collaboration/listings-bcdr.md).

### Network policy replication

The feature supports replicating network policies.

For more information, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

### Parameter replication

This feature supports replicating account-level parameters and object parameters. Object parameters are replicated when the object is
included in the replication group. For example, if `WAREHOUSES` are replicated, warehouse-specific parameters
(e.g. [STATEMENT_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md)) are replicated. For a full list, see [Object parameters](../sql-reference/parameters.md).

Account-level parameter replication includes all [Account parameters](../sql-reference/parameters.md) and
[parameters set on the account](admin-account-management.md).
Account-level parameters (e.g. [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md)) are replicated when `ACCOUNT PARAMETERS` is included in
the list of object types for a replication group.

### Profile

This feature supports adding profiles to a failover group. For more information about provider profiles, see [Manage your provider profile](../collaboration/provider-profiles-managing.md).

### Resource monitor replication

This feature supports replicating resource monitors and privileges granted on resource monitors to roles. A secondary resource monitor
follows the same quota reset schedule as its primary. For example, if the quota on the primary resource monitor resets on the first of the
month, and the secondary is first replicated on the 15th of the month, its quota will reset on the first of the next month along with the
primary.

#### Replication of resource monitor email notification settings

Email notification settings for resource monitors are not included with resource monitor replication. Email notifications for
non-administrator users can be replicated with resource monitors. However, account administrator notification settings are
currently not replicated:

* If `users` and `resource monitors` are included in the `object_types` list for the replication or failover group,
  notification settings for non-administrator users are replicated:

  + The `notify_users` list for a warehouse-level resource monitor is replicated to target accounts.
  + [Email notifications for non-administrator users](resource-monitors.md) are sent
    on the target account.
* If `resource monitors` is included in the `object_types` list for the replication or failover group, but `users`
  is not included, the `notify_users` list for a secondary warehouse-level resource monitor is empty.
* Account administrator notification settings are *not* replicated:

  + An account administrator must [enable email notifications](resource-monitors.md) in each account using the web interface.
  + Resource monitor notifications are sent to account administrators if they have enabled email notifications in the source and/or
    target accounts.

### Role replication

This feature supports replicating roles, including role hierarchies. Role objects must be replicated to replicate access privileges.
Replicated access privileges are listed in Replication of roles and grants below.

> **Note:**
>
> All roles are replicated.

### Share replication

This feature supports replication of share objects as well as access privileges granted to shares on database objects.

Replication of [inbound shares](data-share-consumers.md) (shares from providers) is not supported.

### Backup replication for database, schema, and table backups

The Snowflake [backups](backups.md) feature lets you encapsulate a series of backups for a specific database,
schema, or table inside an object known as a backup set. You can optionally control the schedule of automatic backups and
automatic deletion of backups after an expiry period by applying a backup policy to the backup set. Backup sets and
backup policies are database-level objects. Snowflake replicates those objects along with the databases and schemas that contain them.

For information about how Snowflake replicates backup sets and backup policies, see [Replicate backup-related objects](backups.md).

### User replication

This feature supports replicating users and their properties to target accounts, the following user authentication methods, and provisioning
users and groups with SCIM:

| Authentication Method | Works in Target Accounts | Notes |
| --- | --- | --- |
| Password | ✔ |  |
| Password with MFA (multi-factor authentication) | ✔ | Users who are enrolled in MFA in the source account must separately enroll in MFA when they log in to each target account. |
| [Multi-factor authentication (MFA)](security-mfa.md) | ✔ | Users who are enrolled in MFA in the source account must separately enroll in MFA when they log in to each target account. |
| [Key-pair authentication](key-pair-auth.md) | ✔ |  |
| [Programmatic access tokens](programmatic-access-tokens.md) | ✔ | Programmatic access tokens are replicated to the target account only if users and roles are replicated. |
| [Federated Authentication](admin-security-fed-auth-overview.md) | ✔ | Refer to [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md) for details on replicating federated SSO (i.e. SAML2) security integrations. |
| [Snowflake OAuth](oauth-snowflake-overview.md) | ✔ | Refer to [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md) for details on replicating OAuth security integrations. |
| [External OAuth](oauth-ext-overview.md) | ✔ | Refer to [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md) for details on replicating OAuth security integrations. |
| [SCIM](scim-intro.md) | ✔ | Refer to [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md) for details on replicating SCIM security integrations. |

> **Note:**
>
> If `USERS` and `ROLES` objects are replicated to a target account, these object types are read-only in the target account
> and cannot be modified. Users and roles must be created in the source account, then replicated to each target account. Refer to
> [Replication and read-only secondary objects](account-replication-considerations.md).

### Warehouse replication

This feature supports replicating warehouses, including interactive warehouses. Snowflake also replaces privileges granted on
warehouses to roles (if `roles` are replicated).
The state of the primary warehouse is not replicated. Warehouses are replicated in the suspended state to each target account
and can be resumed in the target account.

### Workspaces replication

[Shared workspaces](ui-snowsight/workspaces-shared.md) are replicated when they are included in a database that is part of a
replication or failover group. Private workspaces are replicated when their owning users are replicated. In secondary (target) accounts,
replicated content is read-only; Workspace files (including SQL files, Notebook files, and so on) are executable but cannot be edited.

### Dataset replication

Account replication supports replicating Datasets. Datasets are materialized data objects that you use with Snowflake ML.
For usage information, see [Snowflake Datasets](../developer-guide/snowflake-ml/dataset.md).
Replication is supported for Datasets created starting with the General Availability of the Dataset replication feature.
For the release announcement, see [Mar 20, 2025: Snowflake Datasets (General availability)](../release-notes/2025/other/2025-03-20-snowflake-ml-datasets.md).

### Cortex Search Service replication

The feature supports replicating Cortex Search Services.

For more information, see [Replicate a Cortex Search Service](snowflake-cortex/cortex-search/cortex-search-replication.md).

### Replication of roles and grants

In order to replicate grants on objects to roles, roles must be replicated from the source account to the target account. To
replicate roles in a replication or failover group, you must include `roles` in the `object_types` list. Roles can be in a
separate replication or failover group from the data objects on which the privileges are granted.

When `roles` are replicated, grants on objects are only replicated to a target account if:

* The privilege was granted by the owner of the object or indirectly by a role that was granted the privilege with the
  [WITH GRANT OPTION](../sql-reference/sql/grant-privilege.md) parameter by the owner of the object.
* Both the grantee and grantor role for a privilege grant are located in the target account.
* The object is replicated (i.e. the object type is included in the `object_types` list).

Otherwise the grant on the object is not replicated.

For information about replicating secondary roles and session policies, see [Session policies with secondary roles](account-replication-considerations.md).

> **Note:**
>
> * If a role is dropped that has the OWNERSHIP privilege on an active pipe in the target account, the refresh operation
>   fails.
> * Privileges on replication groups and failover groups are not
>   replicated. If the REPLICATE or FAILOVER privilege has been granted on replication groups or failover groups, these
>   privileges need to be granted in both the source and target accounts. Refer to [Replication privileges](account-replication-considerations.md)
>   for details on these privileges.

#### Grants for database objects

If `roles` and `databases` are replicated to a target account (in the same or different replication or
failover group), refreshing a secondary database synchronizes the privilege grants on the database and the objects in the database
(schemas, tables, views, etc.) to existing roles in the target account (i.e. roles that have been replicated to the target account).
Note that only privilege grants on objects supported by database replication are synchronized. For the list of supported objects,
see Replicated database objects.

External tables are not currently supported for replication. As a result, privilege grants on external tables are
also not replicated.

#### Future grants for objects

If roles are replicated to the target account, [future grants](security-access-control-considerations.md) that are granted at the
database or schema level are replicated to the target account. This also includes future grants on non-replication supported objects. For
example, external table replication is not yet supported, however future grants on external tables are replicated.
When you create an external table in a target account, the privileges granted on future external tables materialize as intended.

#### Object creation and ownership

If new objects are created in a target account during a refresh from the source account, and roles are not replicated to the target
account, the OWNERSHIP privilege for the new objects is granted to the GLOBALORGADMIN role.

If roles are replicated to the target account, the OWNERSHIP privilege is granted to the same role on the target account as the
role with the OWNERSHIP privilege in the source account when roles are next replicated. The roles may be replicated at the same
time the new objects are created in the target account if the objects and roles are in the same replication (or failover) group.

#### Grants for shares

In order to enable secure data sharing, grants on objects to shares are replicated even if `roles` are not
replicated to target accounts. This section provides information on how grants on objects to shares are replicated.

If `roles` are replicated from the source account to the target account, grants to objects on shares are replicated if:

* The grantor role exists in the target account or
* The grantor role in the source account has the OWNERSHIP privilege on the primary object.

If `roles` are not replicated from the source account to the target account, then:

* Grants on objects to shares are replicated.
* The grantor role for grants on replicated objects to shares is the role with the OWNERSHIP privilege on the object.

### User who refreshes objects in a target account

A user who executes the [ALTER FAILOVER GROUP … REFRESH](../sql-reference/sql/alter-failover-group.md) command to refresh objects in a target account
from the source account must use a role with the REPLICATE privilege on the failover group. Snowflake protects this user in the target account
by failing in the following scenarios:

* If the user does not exist in the source account, the refresh operation fails.
* If the user exists in the source account, but a role with the REPLICATE privilege was not granted to the user, the refresh operation fails.

## Replication schedule

As a best practice, Snowflake recommends scheduling automatic refreshes using the REPLICATION_SCHEDULE parameter. The schedule can be
defined when creating a new replication or failover group with CREATE *<object>* or later (using ALTER *<object>*).

When you create a secondary replication or failover group, Snowflake automatically executes an initial refresh. The next refresh is
scheduled based on when the prior refresh started and the scheduling interval, or the next valid time based on the cron expression. For
example, if the refresh schedule interval is 10 minutes and the prior refresh operation (either a scheduled refresh or manually triggered
refresh) starts at 12:01, the next refresh is scheduled for 12:11.

Snowflake ensures only one refresh is executed at any given time. If a refresh is still executing when the next refresh is scheduled, the
next refresh is delayed to start when the currently executing refresh completes. For example, if a refresh is scheduled to execute 15
minutes after the hour, every hour, and the prior refresh completes at 12:16, the next refresh is scheduled to execute when the previously
executing refresh is completed.

> **Note:**
>
> Automatically scheduled refresh operations are executed using the role with the OWNERSHIP privilege on the replication
> or failover group. If a scheduled refresh operation fails due to insufficient privileges, grant the required privileges
> to the role with the OWNERSHIP privilege on the group.

### Suspend and resume scheduled replication

A secondary failover group cannot be promoted to the primary group while a refresh is executing. To fail over gracefully, suspend scheduled
replication in the target account. After the failover is completed, resume the scheduled replication. For more information,
see [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md).

## Replication to accounts on lower editions

If either of the following conditions is true, Snowflake displays an error message:

* A primary replication group with only database and/or share objects is in a Business Critical (or higher) account but one
  or more of the accounts approved for replication are on lower editions. Business Critical Edition is intended for Snowflake
  accounts with extremely sensitive data.
* A primary replication or failover group with any [object types](../sql-reference/sql/create-replication-group.md) is in a
  Business Critical (or higher) account and a signed business associate agreement is in place to store PHI data in the account per HIPAA
  and [HITRUST CSF](intro-cloud-platforms.md) regulations. However, no such agreement is in place for one or more of the accounts
  enabled for replication, regardless if they are Business Critical (or higher) accounts.

This behavior is implemented in an effort to help prevent account administrators for Business Critical (or higher) accounts from
inadvertently replicating sensitive data to accounts on lower editions.

An account administrator (a user with the ACCOUNTADMIN role) or a user with a role with the
CREATE REPLICATION GROUP/CREATE FAILOVER GROUP or OWNERSHIP privilege can override this default behavior by including the
IGNORE EDITION CHECK clause when executing the CREATE *<object>* or ALTER *<object>*
statement. If IGNORE EDITION CHECK is set, the primary replication or failover group may be replicated to the specified accounts on
lower Snowflake editions in these specific scenarios.

> **Note:**
>
> Failover groups can only be created in a Business Critical Edition (or higher) account. Therefore failover groups can only be
> replicated to an account that is a Business Critical Edition (or higher) account.

## Current limitations of replication

* Databases created from shares cannot be replicated.
* Refresh operations fail if the primary database includes a stream with an unsupported source object.
  The operation also fails if the source object for any stream has been dropped.
* Append-only streams are not supported on replicated source objects.

> **Note:**
>
> Database replication does not work for task graphs if the graph is owned by a different role than the role that performs replication.

---
title: Introduction to sensitive data classification
source: https://docs.snowflake.com/en/user-guide/classify-intro.md
section: User Guide
---

# Introduction to sensitive data classification

It’s critical to know where your sensitive data resides and if it’s adequately protected. This isn’t just a best practice; it’s
a vital requirement across many industries to maintain compliance with regulations. Snowflake provides a solution that automatically
discovers sensitive data and makes it easy to apply governance controls like tags and masking policies.

Snowflake classifies sensitive data into [native categories](classify-native.md) like name and national identifier, but you
can also create your own [custom categories](classify-custom.md) to detect sensitive data that is specific to your
organization or domain.

## Get started

Snowflake provides a web interface to configure sensitive data classification and to view the governance posture of sensitive data.

To get started, do one of the following:

* To set up sensitive data classification, see [Use the Trust Center to set up sensitive data classification](classify-ui-trust-center.md).
* To view the results of sensitive data classification, see [Use the Trust Center to view classification results](classify-results.md).

## Core concepts of sensitive data classification

### About classification categories

With sensitive data classification, every column that is identified as containing sensitive data is assigned two categories: a semantic
category and a privacy category.

* A **semantic category** identifies the *type* of personal attribute. Snowflake provides
  [native categories](classify-native.md) for common attributes such as names and addresses. If your sensitive data doesn’t
  fit into a native category, you can create a [custom category](classify-custom.md) for it.
* A **privacy category** identifies the *sensitivity* of a personal attribute. It can be either IDENTIFIER, QUASI_IDENTIFIER, or SENSITIVE (a generic, non-identifier category for things such as medical/health data or salary).

### About classification tags

A [tag](object-tagging/introduction.md) is a Snowflake object that can be assigned to a column. Snowflake uses
the following system-defined tags to identify columns that it has classified as containing sensitive data.

* SNOWFLAKE.CORE.SEMANTIC_CATEGORY: Tag used to identify the native or custom category of the data in a column.
* SNOWFLAKE.CORE.PRIVACY_CATEGORY: Tag used to identify the privacy category of the data in a column.

You can map user-defined tags to system-defined classification tags. For example, you can set up a tag map so that every time the system tag
`SNOWFLAKE.CORE.SEMANTIC_CATEGORY = 'NAME'` is applied to a column, the user-defined tag `tag_db.sch.pii = 'Highly confidential'`
is also applied.

### About classification profiles

When you use the Trust Center web interface to specify classification settings, those settings are saved as a *classification profile*. This
classification profile can be edited later to change the settings that control how data is classified. In the web interface, the
classification profile also controls which databases are being classified with the profile’s settings.

You can also [use SQL commands](classify-auto.md) to create and modify a classification profile. If you are using SQL,
associating the classification profile with a database to start the classification process is a separate step.

## Protecting sensitive data

Snowflake provides the governance tools you need to track and protect your sensitive data.

* You can configure the classification process so Snowflake automatically assigns system and user-defined
  [tags](object-tagging/introduction.md) to data that it classifies as sensitive. You can then track the data within your
  data estate by tracking the tags.
* You can assign a [masking policy](security-column-ddm-intro.md) to columns that contain sensitive data to selectively mask
  the data at query time.
* You can combine tagging and masking policies to automatically mask data that is classified as sensitive. If you use
  [tag-based masking](tag-based-masking-policies.md) to associate a masking policy with a user-defined tag, the data will be
  automatically masked when Snowflake applies the tag as part of the classification process. As new data is added to a database, the
  tag-based masking policies are automatically assigned to the columns that contain sensitive data.

## Determine which databases are being classified

You can determine what data is being monitored for sensitive data classification by listing the databases that are
associated with a classification profile. If a database is associated with a classification
profile, all the tables and views in that database are being automatically classified according to the criteria defined in the profile.

To determine which databases are being classified:

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the [required privileges](classify-ui-trust-center.md).
2. In the navigation menu, select Governance & security » Trust Center.
3. Select the Data Security tab.
4. Select the Dashboard tab.
5. Find the Databases monitored by classification tile. To list the databases being classified, select Monitored or
   Partially monitored.

> **Note:**
>
> A database is partially monitored if someone used SQL to set a classification profile directly on a schema in the database rather
> than setting the profile at the database level.

Use the [SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES](../sql-reference/functions/system_show_sensitive_data_monitored_entities.md) function to list the databases that are
associated with a classification profile.

```sqlexample
SELECT SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES('DATABASE');
```

## Cost considerations

Sensitive data classification consumes credits as it uses [serverless compute resources](cost-understanding-compute.md) to
classify tables in the database. For more information about pricing for this consumption, see Table 5 in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

> **Note:**
>
> Classifying views can cost more than classifying tables. The additional cost depends on the complexity of the query that created
> the view. Materialized views don’t incur these additional costs. By default, views are excluded from classification.

### View costs in Snowsight

To explore the cost of sensitive data classification:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost and usage data](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select a warehouse to use to view the usage data. Snowflake recommends using an XS warehouse for this purpose.
5. Select Consumption.
6. From the Usage Type drop-down, select Compute.
7. From the Service Type drop-down, select Sensitive Data Classification.

### Use SQL to query costs

You can query views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas to determine how much was spent on automatically classifying
sensitive data. To monitor credit consumption, query the following views:

METERING_HISTORY view (ACCOUNT_USAGE)
:   Lets you retrieve the hourly cost of automatic classification by focusing on `SENSITIVE_DATA_CLASSIFICATION` in the
    `SERVICE_TYPE` column. For example:

    ```sqlexample
    SELECT
      service_type,
      start_time,
      end_time,
      entity_id,
      name,
      credits_used_compute,
      credits_used_cloud_services,
      credits_used,
      budget_id
      FROM SNOWFLAKE.ACCOUNT_USAGE.METERING_HISTORY
      WHERE service_type = 'SENSITIVE_DATA_CLASSIFICATION';
    ```

METERING_DAILY_HISTORY view (ACCOUNT_USAGE and ORGANIZATION_USAGE)
:   Lets you retrieve the daily cost of automatic classification by focusing on `SENSITIVE_DATA_CLASSIFICATION` in the
    `SERVICE_TYPE` column. For example:

    ```sqlexample
    SELECT
      service_type,
      usage_date,
      credits_used_compute,
      credits_used_cloud_services,
      credits_used
      FROM SNOWFLAKE.ACCOUNT_USAGE.METERING_DAILY_HISTORY
      WHERE service_type = 'SENSITIVE_DATA_CLASSIFICATION';
    ```

USAGE_IN_CURRENCY_DAILY (ORGANIZATION_USAGE)
:   Lets you retrieve the daily cost of automatic classification by focusing on `SENSITIVE_DATA_CLASSIFICATION` in the
    `SERVICE_TYPE` column. Use this view to determine the cost in currency, not credits.

## Supported objects

Snowflake supports classifying data stored in the following types of tables and views:

Tables:

* [Snowflake tables](tables-micro-partitions.md)
* [External tables](tables-external-intro.md)
* [Managed and unmanaged Apache Iceberg™ tables](tables-iceberg.md)
* [Dynamic tables](dynamic-tables-about.md)
* [Event tables](../developer-guide/logging-tracing/event-table-setting-up.md)

Views:

* [Snowflake views](views-introduction.md)
* [Materialized views](views-materialized.md)
* [Secure views](views-secure.md)

> **Note:**
>
> Although views can be classified, classifying a view can cost significantly more than classifying the underlying
> tables directly, because of the complexity of the query that created the view. For more information, see Cost considerations.

Note that Snowflake does not support classification on [shared tables](data-sharing-intro.md) and shared schemas from the
consumer’s side. If a table is created by the provider and placed into the provider’s outbound share, the classification only works if it is
called from the provider’s side.

## Supported data types

You can classify table and view columns for all supported [data types](../sql-reference-data-types.md) except for the following
data types:

* BINARY
* DECFLOAT
* GEOGRAPHY
* UUID
* VECTOR

> **Note:**
>
> * Unstructured data like long text stored in columns is not supported.
> * JSON is the only semi-structured data that is supported.

## Limitations and considerations

* Classification profiles cannot be set on a reader account.
* A classification profile cannot be set on more than 1,000 databases.
* A classification profile cannot be *directly* set on more than 10,000 schemas.
* A maximum of 100 million tables can be classified in a schema.
* You cannot automatically classify a table if it has any of the following characteristics:

  + More than 10,000 columns.
  + A column with a name that has more than 255 characters.
  + A column with a name that includes the `$` character.

---
title: Introduction to streams
source: https://docs.snowflake.com/en/user-guide/streams-intro.md
section: User Guide
---

# Introduction to streams

A stream object records data manipulation language (DML) changes made to tables, including inserts (including COPY INTO), updates, and deletes,
as well as metadata about each change, so that actions can be taken using the changed data. This process is referred to as change data capture (CDC).
This topic introduces key concepts for change data capture using streams.

An individual table stream tracks the changes made to rows in a *source table*. A table stream (also referred to as simply a “stream”) makes
a “change table” available of what changed, at the row level, between two transactional points of time in a table. This allows querying and
consuming a sequence of change records in a transactional fashion.

Streams can be created to query change data on the following objects:

* Standard tables, including shared tables.
* Views, including secure views
* [Directory tables](data-load-dirtables.md)
* [Dynamic tables](dynamic-tables-about.md)
* [Apache Iceberg™ tables](tables-iceberg.md) with Limitations.
* [Event tables](../developer-guide/logging-tracing/event-table-setting-up.md)
* [External tables](tables-external-intro.md)

## Offset storage

When created, a stream logically takes an initial snapshot of every row in the source object (e.g. table, external table, or the underlying
tables for a view) by initializing a point in time (called an *offset*) as the current transactional version of the object. The change
tracking system utilized by the stream then records information about the DML changes after this snapshot was taken. Change records provide
the state of a row before and after the change. Change information mirrors the column structure of the tracked source object and includes
additional metadata columns that describe each change event.

Streams use the current table schema. However, since streams may read deleted data to track changes over time, any incompatible schema
changes between the offset and the advance can cause query failures.

Note that a stream itself does not contain any table data. A stream only stores an offset for the source object and returns CDC
records by leveraging the versioning history for the source object. When the first stream for a table is created, several hidden columns
are added to the source table and begin storing change tracking metadata. These columns consume a small amount of storage. The CDC records
returned when querying a stream rely on a combination of the *offset* stored in the stream and the *change tracking metadata* stored in the
table. Note that for streams on views, change tracking must be enabled explicitly for the view and underlying tables to add the hidden
columns to these tables.

It might be useful to think of a stream as a bookmark, which indicates a point in time in the pages of a book (i.e. the source object). A
bookmark can be thrown away and other bookmarks inserted in different places in a book. So too, a stream can be dropped and other streams
created at the same or different points of time (either by creating the streams consecutively at different times or by using [Time
Travel](data-time-travel.md)) to consume the change records for an object at the same or different offsets.

One example of a consumer of CDC records is a data pipeline, in which only the data in staging tables that has
changed since the last extraction is transformed and copied into other tables.

## Table versioning

A new table version is created whenever a transaction that includes one or more [DML](../sql-reference/sql-dml.md) statements is committed
to the table. This applies to the following table types:

* Standard tables
* Directory tables
* Dynamic tables
* External tables
* Apache Iceberg™ tables
* Underlying tables for a view

In the transaction history for a table, a stream offset is located between two table versions. Querying a stream returns the
changes caused by transactions committed after the offset and at or before the current time.

The following example shows a source table with 10 committed versions in the timeline. The offset for stream `s1` is currently between
table versions `v3` and `v4`. When the stream is queried (or consumed), the records returned include all transactions between table
version `v4`, the version immediately after the stream offset in the table timeline, and `v10`, the most recent committed table version
in the timeline, inclusive.

A stream provides the minimal set of changes from its current offset to the current version of the table.

Multiple queries can independently consume the same change data from a stream without changing the offset. A stream advances the offset
only when it is used in a DML transaction. This includes a Create Table As Select (CTAS) transaction or a COPY INTO location
transaction and this behavior applies to both explicit and *autocommit* transactions. (By default, when a
DML statement is executed, an autocommit transaction is implicitly started and the transaction is committed at the completion of the
statement. This behavior is controlled with the [AUTOCOMMIT](../sql-reference/parameters.md) parameter.) Querying a stream alone does not advance its offset,
even within an explicit transaction; the stream contents must be consumed in a DML statement.

> **Note:**
>
> To advance the offset of a stream to the current table version without consuming the change data in a DML operation, complete either of
> the following actions:
>
> * Recreate the stream (using the CREATE OR REPLACE STREAM syntax).
> * Insert the current change data into a temporary table. In the INSERT statement, query the stream but include a WHERE clause that
>   filters out all of the change data (e.g. `WHERE 0 = 1`).

When a SQL statement queries a stream within an explicit transaction, the stream is queried at the stream advance point (i.e. the timestamp)
when the transaction began rather than when the statement was run. This behavior pertains both to DML statements and CREATE TABLE … AS
SELECT (CTAS) statements that populate a new table with rows from an existing stream.

A DML statement that selects from a stream consumes all of the change data in the stream as long as the transaction commits successfully. To
ensure multiple statements access the same change records in the stream, surround them with an explicit transaction statement
([BEGIN](../sql-reference/sql/begin.md) .. [COMMIT](../sql-reference/sql/commit.md)). This locks the stream. DML updates to the source object in parallel
transactions are tracked by the change tracking system but do not update the stream until the explicit transaction statement is committed
and the existing change data is consumed.

## Repeatable read isolation

Streams support repeatable read isolation. In repeatable read mode, multiple SQL statements within a transaction see the same set of records
in a stream. This differs from the read committed mode supported for tables, in which statements see any changes made by previous statements
executed within the same transaction, even though those changes are not yet committed.

The delta records returned by streams in a transaction is the range from the current position of the stream until the transaction start
time. The stream position advances to the transaction start time if the transaction commits; otherwise it stays at the same position.

Consider the following example:

| Time | Transaction 1 | Transaction 2 |
| --- | --- | --- |
| 1 | Begin transaction. |  |
| 2 | Query stream `s1` on table `t1`. The stream returns the change data capture records . between the current position to the Transaction 1 start time. If the stream is used in a DML statement . the stream is then locked to avoid changes by concurrent transactions. |  |
| 3 | Update rows in table `t1`. |  |
| 4 | Query stream `s1`. Returns the same state of stream when it was used at **Time** `2`. |  |
| 5 | Commit transaction. If the stream was consumed in DML statements within the transaction, the stream position advances to the transaction start time. |  |
| 6 |  | Begin transaction. |
| 7 |  | Query stream `s1`. Results include table changes committed by Transaction 1. |

Within Transaction 1, all queries to stream `s1` see the same set of records. DML changes to table `t1` are recorded to the stream only
when the transaction is committed.

In Transaction 2, queries to the stream see the changes recorded to the table in Transaction 1. Note that if Transaction 2 had begun
before Transaction 1 was committed, queries to the stream would have returned a snapshot of the stream from the position of the
stream to the beginning time of Transaction 2 and would not see any changes committed by Transaction 1.

## Stream columns

A stream stores an offset for the source object and not any actual table columns or data. When queried, a stream accesses and returns the
historic data in the same shape as the source object (i.e. the same column names and ordering) with the following additional columns:

METADATA$ACTION:
:   Indicates the DML operation (INSERT, DELETE) recorded.

METADATA$ISUPDATE:
:   Indicates whether the operation was part of an UPDATE statement. Updates to rows in the source object are represented as a pair of DELETE
    and INSERT records in the stream with a metadata column METADATA$ISUPDATE values set to TRUE.

    Note that streams record the differences between two offsets. If a row is added and then updated in the current offset, the delta change
    is a new row. The METADATA$ISUPDATE row records a FALSE value.

METADATA$ROW_ID:
:   Specifies a unique, immutable row ID for tracking changes over time. If CHANGE_TRACKING is disabled and later re-enabled on the stream’s
    source object, the row ID could change.

Snowflake provides the following guarantees with respect to METADATA$ROW_ID:

1. The METADATA$ROW_ID depends on the stream’s source object.

   For instance, a stream `stream1` on table `table1` and stream `stream2` on table `table1` produce the same METADATA$ROW_IDs for the same
   rows, but a stream `stream_view` on view `view1` is not guaranteed to produce the same METADATA$ROW_IDs as `stream1`, even if `view` is
   defined using the statement `CREATE VIEW view AS SELECT * FROM table1`.
2. A stream on a source object and a stream on the source object’s clone produce the same METADATA$ROW_IDs for the rows that exist at the time of the
   cloning.
3. A stream on a source object and a stream on the source object’s replica produce the same METADATA$ROW_IDs for the rows that were replicated.

## Types of streams

The following stream types are available based on the metadata recorded by each:

Standard:
:   Supported for streams on standard tables, dynamic tables, Snowflake-managed Apache Iceberg™ tables, directory tables, or views. A standard (i.e. delta) stream tracks all DML
    changes to the source object, including inserts, updates, and deletes (including table truncates). This stream type performs a join on
    inserted and deleted rows in the change set to provide the row level delta. As a net effect, for example, a row that is inserted and
    then deleted between two transactional points of time in a table is removed in the delta (i.e. is not returned when the stream is queried).

    > **Note:**
    >
    > Standard streams cannot retrieve change data for geospatial data. We recommend creating append-only streams on objects that contain
    > geospatial data.

Append-only:
:   Supported for streams on standard tables, dynamic tables, Snowflake-managed Apache Iceberg™ tables, or views. An append-only stream exclusively tracks row
    inserts. Update, delete, and truncate operations are not captured by append-only streams. For instance, if 10 rows are
    initially inserted into a table, and then 5 of those rows are deleted before advancing the offset for an append-only stream, the
    stream would only record the 10 inserted rows.

    An append-only stream specifically returns the appended rows, making it notably more performant than a standard stream for
    extract, load, and transform (ELT), and similar scenarios reliant solely on row inserts. For example, a source table can be
    truncated immediately after the rows in an append-only stream are consumed, and the record deletions do not contribute to the
    overhead the next time the stream is queried or consumed.

    Creating an append-only streams in a target account using a secondary object as the source is not supported.

Insert-only:
:   Supported for streams on externally managed Apache Iceberg™ or external tables. An insert-only stream tracks row inserts only; they do not record delete
    operations that remove rows from an inserted set (i.e. no-ops). For example, in-between any two offsets, if `File1` is removed from the
    cloud storage location referenced by the external table, and `File2` is added, the stream returns records for the rows in `File2` only, regardless of whether
    `File1` was added before or within the requested change interval. Unlike when tracking CDC data for standard tables, access to the historical
    records for files in cloud storage is not governed by or guaranteed to Snowflake.

    Overwritten or appended files are essentially handled as new files: The old version of the file is removed from cloud storage, but the
    insert-only stream does not record the delete operation. The new version of the file is added to cloud storage, and the insert-only
    stream records the rows as inserts. The stream does not record the diff of the old and new file versions. Note that appends may not
    trigger an automatic refresh of the external table metadata, such as when using
    [Azure AppendBlobs](tables-external-azure.md).

## Data flow

The following diagram shows how the contents of a standard stream change as rows in the source table are updated. Whenever a DML
statement consumes the stream contents, the stream position advances to track the next set of DML changes to the table (i.e. the changes in
a table version):

## Data retention period and staleness

A stream becomes stale when its offset falls outside of the data retention period for
its source table (or underlying tables for a source view). In a stale state, historical
data and any unconsumed change records for the source table are no longer accessible. To
continue tracking new change records, you must recreate the stream using the
[CREATE STREAM](../sql-reference/sql/create-stream.md) command.

To prevent a stream from becoming stale, consume the stream records within a DML
statement during the table’s retention period and regularly consume its change data
before its STALE_AFTER timestamp (that is, within the extended data retention period
for the source object). Additionally, calling
[SYSTEM$STREAM_HAS_DATA](../sql-reference/functions/system_stream_has_data.md) on the stream prevents it from
becoming stale, provided the stream is empty and the SYSTEM$STREAM_HAS_DATA function
returns `FALSE`.

For more information on data retention periods, see [Understanding & using Time Travel](data-time-travel.md).

> **Note:**
>
> Streams on shared tables or views don’t extend the data retention period for the table
> or underlying tables, respectively. For more information, see
> [Streams on shared objects](data-sharing-provider.md).

If the data retention period for a table is less than 14 days and a stream hasn’t been
consumed, Snowflake temporarily extends this period to prevent the stream from going
stale. The retention period is extended to the stream’s offset, up to a maximum of 14 days
by default, regardless of your [Snowflake edition](intro-editions.md). The
maximum number of days for which Snowflake can extend the data retention period is
determined by the [MAX_DATA_EXTENSION_TIME_IN_DAYS](../sql-reference/parameters.md) parameter value. Once the
stream is consumed, the extended data retention period reverts to the table’s default.

The following table shows examples of [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) and
MAX_DATA_EXTENSION_TIME_IN_DAYS values, indicating how often the stream contents should be
consumed to avoid staleness:

| DATA_RETENTION_TIME_IN_DAYS | MAX_DATA_EXTENSION_TIME_IN_DAYS | Consume Streams in X Days |
| --- | --- | --- |
| 14 | 0 | 14 |
| 1 | 14 | 14 |
| 0 | 90 | 90 |

To check the staleness status of a stream, use the [DESCRIBE STREAM](../sql-reference/sql/desc-stream.md)
or [SHOW STREAMS](../sql-reference/sql/show-streams.md) command. The STALE_AFTER column timestamp is
the extended data retention period for the source object. It shows when the stream is
predicted to become stale or when it became stale if the timestamp is in the past. This
timestamp is calculated by adding the greater value of the DATA_RETENTION_TIME_IN_DAYS
or MAX_DATA_EXTENSION_TIME_IN_DAYS parameters setting for the source object to the
last consumption time of the stream.

> **Note:**
>
> If the data retention period for the source table is set at the schema or database level, the current role must have access to the schema or database to calculate the STALE_AFTER value.

Consuming change data for a stream updates the STALE_AFTER timestamp. Although reading
from the stream might succeed for some time after the STALE_AFTER timestamp, the stream
can become stale at any moment. The STALE column indicates if the stream is expected to
be stale, though it might not be stale yet.

To prevent a stream from becoming stale, regularly consume its change data before its
STALE_AFTER timestamp (that is, within the extended data retention period for the
source object). Don’t rely on the results from a stream after the STALE_AFTER period has
elapsed because the STREAM_HAS_DATA function might return unexpected results.

After the STALE_AFTER timestamp has passed, the stream can become stale at any time,
even if it has no unconsumed records. Querying a stream might return 0 records even
if there is change data for the source object. For example, an append-only stream
tracks row inserts only, but updates and deletes also write change records to the
source object. Additionally, some table writes, like reclustering, don’t produce change
data. Consuming change data for a stream advances its offset to the present, regardless
of whether there is intervening change data.

> **Important:**
>
> * Recreating an object (using the CREATE OR REPLACE TABLE syntax) drops its history, which also makes any stream on the table or view
>   stale. In addition, recreating or dropping any of the underlying tables for a view makes any stream on the view stale.
> * Currently, when a database or schema that contains a stream and its source table (or the underlying tables for a source view) is
>   cloned, any unconsumed records in the stream clone are inaccessible. This behavior is consistent with
>   [Time Travel](data-time-travel.md) for tables. If a table is cloned, historical data for the table clone begins at the
>   time/point when the clone was created.
> * Renaming a source object does not break a stream or cause it to go stale. In addition, if a source object is dropped and a new object
>   is created with the same name, any streams linked to the original object are not linked to the new object.

## Multiple consumers of streams

We recommend that users create a separate stream for each consumer of change records for an object. “Consumer” refers to a task, script, or
other mechanism that consumes the change records for an object using a DML transaction. As stated earlier in this topic, a stream advances its
offset only when it is used in a DML transaction. This includes a Create Table As Select (CTAS) transaction or a COPY INTO location transaction.

Different consumers of change data in a single stream retrieve different deltas unless Time Travel is used. When the change data captured from
the latest offset in a stream is consumed using a DML transaction, the stream advances the offset. The change data is no longer available for
the next consumer. To consume the same change data for an object, create multiple streams for the object. A stream only stores an offset
for the source object and not any actual table column data; therefore, you can create any number of streams for an object without incurring significant cost.

## Streams on views

Streams on views support both local views and views shared using Snowflake Secure Data Sharing, including secure views. Currently, streams
cannot track changes in materialized views.

Streams are limited to views that satisfy the following requirements:

Underlying Tables:
:   * All of the underlying tables must be native tables.
    * The view can apply only the following operations:

      + Projections
      + Filters
      + Inner or cross joins
      + UNION ALL

    Nested views and subqueries in the FROM clause are supported as long as the fully expanded query satisfies the other requirements in this requirements table.

View Query:
:   General requirements:

    * The query can select any number of columns.
    * The query can contain any number of WHERE predicates.
    * Views with the following operations are not yet supported:

      + GROUP BY clauses
      + QUALIFY clauses
      + Subqueries not in the FROM clause
      + Correlated subqueries
      + LIMIT clauses
      + DISTINCT clauses

    Functions:

    * Functions in the select list must be system-defined, scalar functions.

Change Tracking:
:   Change tracking must be enabled in the underlying tables.

Before creating a stream on a view, you must enable change tracking on the underlying tables for the view. For instructions, see
[Enabling change tracking on views and underlying tables](streams-manage.md).

### Join results behavior

When examining the results of a stream that tracks changes to a view containing a join,
it’s important to understand what data is being joined.
Changes that have occurred on the left table since the stream offset are being joined with the right table,
changes on the right table since the stream offset are being joined with the left table,
and changes on both tables since the stream offset are being joined with each other.

Consider the following example:

Two tables are created:

```sqlexample
create or replace table orders (id int, order_name varchar);
create or replace table customers (id int, customer_name varchar);
```

A view is created to join the two tables on `id`. Each table has a single row that joins with the other:

```sqlexample
create or replace view ordersByCustomer as select * from orders natural join customers;
insert into orders values (1, 'order1');
insert into customers values (1, 'customer1');
```

A stream is created that tracks changes to the view:

```sqlexample
create or replace stream ordersByCustomerStream on view ordersBycustomer;
```

The view has one entry and the stream has none since there have been no changes to the tables since the stream’s current offset:

```sqlexample
select * from ordersByCustomer;
+----+------------+---------------+
| ID | ORDER_NAME | CUSTOMER_NAME |
|----+------------+---------------|
|  1 | order1     | customer1     |
+----+------------+---------------+

select * exclude metadata$row_id from ordersByCustomerStream;
+----+------------+---------------+-----------------+-------------------+
| ID | ORDER_NAME | CUSTOMER_NAME | METADATA$ACTION | METADATA$ISUPDATE |
|----+------------+---------------+-----------------+-------------------|
+----+------------+---------------+-----------------+-------------------+
```

Once updates are made to the underlying tables, selecting `ordersByCustomerStream` will produce records of `orders` x Δ `customers` + Δ `orders` x
`customers` + Δ `orders` x Δ `customers` where:

> * Δ `orders` and Δ `customers` are the changes that have occurred to each table since the stream offset.
> * orders and customers are the total contents of the tables at the current stream offset.

Note that due to optimizations in Snowflake the cost of computing this expression is not always linearly proportional to the size of the inputs.

If another joining row is inserted in `orders` then `ordersByCustomer` will have a new row:

```sqlexample
insert into orders values (1, 'order2');
select * from ordersByCustomer;
+----+------------+---------------+
| ID | ORDER_NAME | CUSTOMER_NAME |
|----+------------+---------------|
|  1 | order1     | customer1     |
|  1 | order2     | customer1     |
+----+------------+---------------+
```

Selecting from `ordersByCustomersStream` produces one row because Δ `orders` x `customers` contains the new insert and `orders` x Δ `customers` +
Δ `orders` x Δ `customers` is empty:

```sqlexample
select * exclude metadata$row_id from ordersByCustomerStream;
+----+------------+---------------+-----------------+-------------------+
| ID | ORDER_NAME | CUSTOMER_NAME | METADATA$ACTION | METADATA$ISUPDATE |
|----+------------+---------------+-----------------+-------------------|
|  1 | order2     | customer1     | INSERT          | False             |
+----+------------+---------------+-----------------+-------------------+
```

If another joining row is then inserted into `customers` then `ordersByCustomer` will have a total of three *new* rows:

```sqlexample
insert into customers values (1, 'customer2');
select * from ordersByCustomer;
+----+------------+---------------+
| ID | ORDER_NAME | CUSTOMER_NAME |
|----+------------+---------------|
|  1 | order1     | customer1     |
|  1 | order2     | customer1     |
|  1 | order1     | customer2     |
|  1 | order2     | customer2     |
+----+------------+---------------+
```

Selecting from `ordersByCustomersStream` produces three rows because
Δ `orders` x `customers`, `orders` x Δ `customers`, and Δ `orders` x Δ `customers` will each produce one row:

```sqlexample
select * exclude metadata$row_id from ordersByCustomerStream;
+----+------------+---------------+-----------------+-------------------+
| ID | ORDER_NAME | CUSTOMER_NAME | METADATA$ACTION | METADATA$ISUPDATE |
|----+------------+---------------+-----------------+-------------------|
|  1 | order1     | customer2     | INSERT          | False             |
|  1 | order2     | customer1     | INSERT          | False             |
|  1 | order2     | customer2     | INSERT          | False             |
+----+------------+---------------+-----------------+-------------------+
```

Note that for append-only streams, Δ `orders` and Δ `customers` will contain row inserts only,
while `orders` and `customers` will contain the complete contents of the tables including any updates that happened before the stream offset.

## CHANGES clause: Read-only alternative to streams

As an alternative to streams, Snowflake supports querying change tracking metadata for tables or views using the
[CHANGES](../sql-reference/constructs/changes.md) clause for SELECT statements. The CHANGES clause enables querying change tracking metadata between
two points in time without having to create a stream with an explicit transactional offset. Using the CHANGES clause does not
advance the offset (i.e. consume the records). Multiple queries can retrieve the change tracking metadata between different transactional
start and endpoints. This option requires specifying a transactional start point for the metadata using an
[AT | BEFORE](../sql-reference/constructs/at-before.md) clause; the end point for the change tracking interval can be set using the optional END clause.

A stream stores the current transactional table version and is the appropriate source of CDC
records in most scenarios. For infrequent scenarios that require managing the offset for arbitrary periods of time, the CHANGES clause is
available for your use.

Currently, the following must be true before change tracking metadata is recorded:

Tables:
:   Either enable change tracking on the table (using [ALTER TABLE](../sql-reference/sql/alter-table.md) … CHANGE_TRACKING = TRUE), or create a stream
    on the table (using [CREATE STREAM](../sql-reference/sql/create-stream.md)).

Views:
:   Enable change tracking on the view and its underlying tables. For instructions, see [Enabling change tracking on views and underlying tables](streams-manage.md).

Enabling change tracking adds several hidden columns to the table and begins storing change tracking metadata. The values in these
hidden CDC data columns provide the input for the stream metadata columns. The columns consume a
small amount of storage.

No change tracking metadata for the object is available for the period before one of these conditions is satisfied.

## Required access privileges

Querying a stream requires a role with a minimum of the following role permissions:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Stream | SELECT |  |
| Table | SELECT | Streams on tables only. |
| View | SELECT | Streams on views only. |
| External stage | USAGE | Streams on directory tables (on external stages) only |
| Internal stage | READ | Streams on directory tables (on internal stages) only |

## Billing for streams

As described in Data Retention Period and Staleness (in this topic), when a stream is not consumed regularly, Snowflake
temporarily extends the data retention period for the source table or the underlying tables in the source view. If the data
retention period for the table is less than 14 days, then behind the scenes, the period is extended to the smaller of the
stream transactional offset or 14 days (if the data retention period for the table is less than 14 days) regardless of the
[Snowflake edition](intro-editions.md) for your account.

Extending the data retention period requires additional storage which will be reflected in your monthly storage charges.

The main cost associated with a stream is the processing time used by a virtual warehouse to query the stream. These charges appear on your
bill as familiar Snowflake credits.

## Limitations

The following limitations apply for streams:

* You can’t use standard or append-only streams on Apache Iceberg™ tables that use an external catalog. (Insert-only streams are supported.)
* You can’t track changes on a view with GROUP BY clauses.
* After adding or modifying a column to be NOT NULL, queries on streams might fail if the stream outputs rows with impermissible NULL
  values. This happens because the stream’s schema enforces the current NOT NULL constraint, which doesn’t match the historical data
  returned by the stream.
* When a [task is triggered](tasks-triggered.md) by Streams on Views, then any changes to tables referenced by the Streams on Views query will also trigger the task, regardless of any joins, aggregations, or filters in the query.
* Streams are not supported on partitioned external tables or partitioned Apache Iceberg™ tables managed by an external catalog.

---
title: Introduction to streams and tasks
source: https://docs.snowflake.com/en/user-guide/data-pipelines-intro.md
section: User Guide
---

# Introduction to streams and tasks

Snowflake supports continuous data pipelines with Streams and Tasks:

Streams:
:   A *stream* object records the delta of change data capture (CDC) information for a table (such as a staging table), including inserts and other data manipulation language (DML) changes. A stream allows querying and consuming a set of changes to a table, at the row level, between two transactional points of time.

    In a continuous data pipeline, table streams record when staging tables and any downstream tables are populated with data from business applications using continuous data loading and are ready for further processing using SQL statements.

    For more information, see [Introduction to streams](streams-intro.md).

Tasks:
:   A *task* object runs a SQL statement, which can include calls to stored procedures. Tasks can run on a schedule or based on a trigger that you define, such as the arrival of data. You can use task graphs to chain tasks together, definining directed acyclic graphs (DAGs) to support more complex periodic processing. For more information, see [Introduction to tasks](tasks-intro.md) and [Create a sequence of tasks with a task graph](tasks-graphs.md).

    Combining tasks with table streams is a convenient and powerful way to continuously process new or changed data. A task can transform new or changed rows that a stream surfaces using [SYSTEM$STREAM_HAS_DATA](../sql-reference/functions/system_stream_has_data.md). Each time a task runs, it can either consume the change data or skip the current run if no change data exists.

For other continuous data pipeline features, see:

* Continuous data loading with [Snowpipe](data-load-snowpipe-intro.md), [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md), or [Snowflake Connector for Kafka](kafka-connector.md).
* Continuous data transformation with [Dynamic tables](dynamic-tables-about.md).

---
title: Introduction to tasks
source: https://docs.snowflake.com/en/user-guide/tasks-intro.md
section: User Guide
---

# Introduction to tasks

Tasks are a powerful way to automate data processing and to optimize business procedures on your data pipeline.

Tasks can run at scheduled times or can be triggered by events, such as when new data arrives in a [stream](streams-intro.md).

Tasks can run SQL commands and stored procedures that use [supported languages and tools](../developer-guide/stored-procedure/stored-procedures-overview.md),
including JavaScript, Python, Java, Scala, and [Snowflake scripting](../developer-guide/snowflake-scripting/index.md).

For complex workflows, you can create sequences of tasks called [task graphs](tasks-graphs.md).
Task graphs can use logic to perform dynamic behavior, running tasks in parallel or in series.

## Task creation workflow overview

1. Create a task administrator role that can run the commands in the following steps.
2. Define a new task using [CREATE TASK](../sql-reference/sql/create-task.md).

   * Define compute resources
   * Define schedules or triggers
   * Define what happens when a task fails
   * Define additional session parameters
3. Manually test tasks using EXECUTE TASK.
4. Allow the task to run continuously using [ALTER TASK … RESUME](../sql-reference/sql/alter-task.md).
5. Monitor task costs
6. Refine the task as needed using [ALTER TASK](../sql-reference/sql/alter-task.md).

For information about running tasks, see:

> * Versioning of task runs
> * View the task history for your account
> * Task costs

## Define compute resources

Tasks require compute resources to run statements and procedures.
You can choose between the following two models:

* Serverless tasks: Snowflake predicts resources that are needed and assigns them automatically.
* User-managed virtual warehouse model: You manage the compute resources using a virtual warehouse.

### Serverless tasks

With this model, you set when you want the task to run, and Snowflake predicts and assigns compute resources needed to complete the task in that time.
The prediction is based on a dynamic analysis of the most recent runs of the same task.

#### Limitations

* The maximum compute size for a serverless task is equivalent to an XXLARGE [virtual warehouse](warehouses.md).

#### Create a task using the serverless compute model

Use [CREATE TASK](../sql-reference/sql/create-task.md) to define the task. Don’t include the WAREHOUSE parameter.

The role that runs the task must have the global EXECUTE MANAGED TASK privilege. For more information, see Task security.

The following example creates a task that runs every hour.

SQLPython

```sqlexample
CREATE TASK SCHEDULED_T1
  SCHEDULE='60 MINUTES'
  AS SELECT 1;
```

```python
from datetime import timedelta
from snowflake.core.task import Cron, Task

tasks = root.databases["TEST_DB"].schemas["TEST_SCHEMA"].tasks

task = tasks.create(
    Task(
        name="SCHEDULED_T1",
        definition="SELECT 1",
        schedule=timedelta(minutes=60),
        ),
    )
```

#### Cost and performance: Warehouse sizes

To make sure serverless tasks run efficiently, you can set the minimum and maximum [warehouse sizes](warehouses-overview.md) by setting the following parameters:

* SERVERLESS_TASK_MIN_STATEMENT_SIZE: the minimum warehouse size for predictable performance (default: XSMALL).
* SERVERLESS_TASK_MAX_STATEMENT_SIZE: the maximum warehouse size to prevent unexpected costs (default: XXLARGE).

After a task completes, Snowflake reviews the performance and adjusts compute resources for future runs within these limits.

The following example shows a task that runs every 30 seconds, with a minimum warehouse size of SMALL and a maximum warehouse size of LARGE.

SQLPython

```sqlexample
CREATE TASK SCHEDULED_T2
  SCHEDULE='30 SECONDS'
  SERVERLESS_TASK_MIN_STATEMENT_SIZE='SMALL'
  SERVERLESS_TASK_MAX_STATEMENT_SIZE='LARGE'
  AS SELECT 1;
```

```python
from datetime import timedelta
from snowflake.core.task import Cron, Task

tasks = root.databases["TEST_DB"].schemas["TEST_SCHEMA"].tasks

task = tasks.create(
    Task(
        name="SCHEDULED_T2",
        definition="SELECT 1",
        schedule=timedelta(seconds=30),
        serverless_task_min_statement_size="SMALL",
        serverless_task_max_statement_size="LARGE",
        ),
    )
```

#### Target completion interval

You can set an earlier target for a serverless task to complete.
A target completion interval is required for [serverless triggered tasks](tasks-triggered.md).

When set, Snowflake estimates and scales resources to complete within the target completion interval.
When a task is already at its maximum warehouse size and is running too long, the target completion interval is ignored.

In the following example, a task runs every day at midnight, with a target of completing by 2 a.m.
The start time and time zone are defined by [USING CRON](../sql-reference/sql/create-task.md).
If the task gets to the largest warehouse size, it may run as long as three hours before finally triggering a timeout.

SQLPython

```sqlexample
CREATE TASK SCHEDULED_T3
  SCHEDULE='USING CRON 0 * * * * America/Los_Angeles'
  TARGET_COMPLETION_INTERVAL='120 MINUTE'
  SERVERLESS_TASK_MAX_STATEMENT_SIZE='LARGE'
  USER_TASK_TIMEOUT_MS = 10800000         -- (3 hours)
  SUSPEND_TASK_AFTER_NUM_FAILURES = 3
  AS SELECT 1;
```

```python
from datetime import timedelta
from snowflake.core.task import Cron, Task

tasks = root.databases["TEST_DB"].schemas["TEST_SCHEMA"].tasks

task = tasks.create(
    Task(
        name="SCHEDULED_T3",
        definition="SELECT 1",
        schedule=Cron("0 * * * *", "America/Los_Angeles"),
        target_completion_interval=timedelta(minutes=120),
        serverless_task_max_statement_size="LARGE",
        user_task_timeout_ms=10800000,  # (3 hours)
        suspend_task_after_num_failures=3,
    ),
)
```

### User-managed virtual warehouse model

With this model, you have full control of the compute resources used for each workload.

#### Choose a warehouse

When choosing a warehouse, consider the following:

* Review the best practices in [Warehouse considerations](warehouses-considerations.md).
* Analyze average task run times using different warehouses based on warehouse size and clustering.
  For more information, see Task duration.
* If the warehouse is shared by multiple processes, consider the impact of the task on other workloads.

#### Create a task using the user-managed compute model

Use [CREATE TASK](../sql-reference/sql/create-task.md), and include the WAREHOUSE parameter.

The role that runs the task must have the global EXECUTE MANAGED TASK privilege.
For more information, see Task security.

The following example creates a task that runs every hour.

```sqlexample
CREATE TASK SCHEDULED_T1
  WAREHOUSE='COMPUTE_WH'
  SCHEDULE='60 MINUTES'
  AS SELECT 1;
```

### Recommendations for choosing a compute model

The following table describes various factors that can help you decide when to use serverless tasks versus user-managed tasks:

| Category | Serverless tasks | User-managed tasks | Notes |
| --- | --- | --- | --- |
| Number, duration, and predictability of concurrent task workloads | Recommended for under-utilized warehouses with too few tasks running concurrently, or completing quickly.  Tasks with relatively stable runs are good candidates for serverless tasks. | Recommended for fully utilized warehouses with multiple concurrent tasks.  Also recommended for unpredictable loads on compute resources. [Multi-cluster warehouses](warehouses-multicluster.md) with [auto-suspend and auto-resume](warehouses-overview.md) enabled could help moderate your credit consumption. | For serverless tasks, Snowflake bills your account based on the actual compute resource usage.  For user-managed tasks, billing for warehouses is based on warehouse size, with a 60-second minimum each time the warehouse is resumed. |
| Schedule interval | Recommended when adherence to the schedule interval is highly important.  If a run of a standalone task or scheduled task graph exceeds the interval, Snowflake increases the size of the compute resources. | Recommended when adherence to the schedule interval is less important. | *Schedule interval* refers to the interval of time between scheduled runs of a standalone task or the root task in a task graph.  Increasing the compute resources can reduce the runtime of some, but not all, SQL code. It doesn’t ensure a task run is completed within the batch window. |

The maximum size for a serverless task run is equivalent to an XXLARGE warehouse.
If a task workload requires a larger warehouse, create a user-managed task with a warehouse of the required size.

## Define schedules or triggers

A task can be set to run on a fixed schedule, or it can be triggered by an event, for example, when a stream has new data.

* Run a task on a fixed schedule
* Run a task whenever a stream has new data

When a task is created, it starts as suspended.
To allow a task to follow a schedule or detect events continuously, use [ALTER TASK … RESUME](../sql-reference/sql/alter-task.md).
To run the task one time, use [EXECUTE TASK](../sql-reference/sql/execute-task.md).

### Run a task on a fixed schedule

To run tasks on a fixed schedule, define the schedule when creating or altering task using [CREATE TASK](../sql-reference/sql/create-task.md) or [ALTER TASK](../sql-reference/sql/alter-task.md),
or by editing the task in Snowsight, using the SCHEDULE parameter.

Snowflake ensures only one instance of a task with a schedule is run at a time.
If a task is still running when the next scheduled run time occurs, then that scheduled time is skipped.

The following example creates a task that runs every 10 seconds:

```sqlexample
CREATE TASK task_runs_every_10_seconds
  SCHEDULE='10 SECONDS'
  AS SELECT 1;
```

To define a schedule based on a specific time or day, use the SCHEDULE =’USING CRON…’ parameter.

The following example creates a task that runs every Sunday at 3 a.m., using the Americas/Los_Angeles time zone:

```sqlexample
CREATE TASK task_sunday_3_am_pacific_time_zone
  SCHEDULE='USING CRON 0 3 * * SUN America/Los_Angeles'
AS SELECT 1;
```

For more information, see [CREATE TASK … SCHEDULE](../sql-reference/sql/create-task.md).

### Run a task whenever a stream has new data

To run tasks whenever a defined [stream](streams-intro.md) has new data, use [Triggered tasks](tasks-triggered.md).
This approach is useful for Extract, Load, Transform (ELT) workflows, because it eliminates frequent polling of the source when new data arrival is unpredictable.
It also reduces latency by processing data immediately. For example:

```sqlexample
CREATE TASK triggered_task_stream
  WHEN SYSTEM$STREAM_HAS_DATA('orders_stream')
  AS
    INSERT INTO completed_promotions
    SELECT order_id, order_total, order_time, promotion_id
    FROM orders_stream;
```

For more information, see [Triggered tasks](tasks-triggered.md).

### Run on a schedule, but only if a stream has new data

You can combine a scheduled task with a triggered task.
For example, the following code creates a task that checks a stream for new data every hour:

```sqlexample
CREATE TASK triggered_task_stream
  SCHEDULE = '1 HOUR'
  WHEN SYSTEM$STREAM_HAS_DATA('orders_stream')
  AS SELECT 1;
```

## Define what happens when a task fails

### Automatically suspend tasks after failed runs

Optionally suspend tasks automatically after a specified number of consecutive runs that either fail or time out.
This feature can reduce costs by suspending tasks that consume Snowflake credits but fail to run to completion.

Set the `SUSPEND_TASK_AFTER_NUM_FAILURES = num` parameter on a task. When the parameter
is set to a value greater than `0`, tasks are automatically suspended after the specified number of consecutive task runs either fail or time out.

The parameter can be set when creating a task using [CREATE TASK](../sql-reference/sql/create-task.md) or later using
[ALTER TASK](../sql-reference/sql/alter-task.md). You can also change this value in Snowsight.

The [SUSPEND_TASK_AFTER_NUM_FAILURES](../sql-reference/parameters.md) parameter can also be set at the account, database, or schema level.
The setting applies to all tasks contained in the modified object.
Note that explicitly setting the parameter at a lower level overrides the parameter value set at a higher level.

### Automatically retry failed task runs

If any task completes in a FAILED state, Snowflake can automatically retry the task.
The automatic task retry is disabled by default.
To enable this feature, set TASK_AUTO_RETRY_ATTEMPTS to a value greater than 0.

Tasks that use error notifications send notifications for each failed retry attempt.
For more information, see [Configure a task to send error notifications](tasks-errors-integrate.md).

When you set the [TASK_AUTO_RETRY_ATTEMPTS](../sql-reference/parameters.md) parameter value at the account, database, or schema level, the change is applied to tasks contained in the modified object during their next scheduled run.

## Define additional session parameters

A task supports all session parameters. For the complete list, see [Parameters](../sql-reference/parameters.md).
Tasks don’t support account or user parameters.

To set session parameters for a task, add the parameter to the task definition with [CREATE TASK](../sql-reference/sql/create-task.md), or modify the task using [ALTER TASK … SET](../sql-reference/sql/alter-task.md). Examples:

```sqlexample
CREATE TASK my_task
  SCHEDULE = 'USING CRON 0 * * * * UTC'
  TIMESTAMP_INPUT_FORMAT = 'YYYY-MM-DD HH24'
  USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = 'XSMALL'
  AS
    INSERT INTO mytable(ts) VALUES(CURRENT_TIMESTAMP);
```

```sqlexample
ALTER TASK my_task
  SET USER_TASK_TIMEOUT_MS = 10000  -- Changes maximum runtime to 10 seconds
```

## Run tasks

This section describes the different ways that a task can be scheduled and run, and how the version of a task is determined.

* Run a task manually
* Versioning of task runs

### Run a task manually

After you have set up a new task and its parameters using [CREATE TASK](../sql-reference/sql/create-task.md) or [ALTER TASK](../sql-reference/sql/alter-task.md), you can start a single run of the task using [EXECUTE TASK](../sql-reference/sql/execute-task.md).
This command is useful for testing new or modified tasks.

> **Note:**
>
> * You can call this SQL command directly in scripts or in stored procedures.
> * This command supports integrating tasks in external data pipelines.
> * Any third-party service that can authenticate into your Snowflake account and authorize SQL actions can run tasks with the EXECUTE TASK command.

### Versioning of task runs

When a standalone task is first resumed or manually run, an initial version of the task is set. The standalone task runs using this version.
After a task is suspended and modified, a new version is set when the standalone task is resumed or manually run.

When the task is suspended, all future scheduled runs of the task are cancelled; however, currently running tasks continue to run using the current version.

For example, suppose the task is suspended, but a scheduled run of this task has already started.
The owner of the task modifies the SQL code called by the task while the task is still running.
The task runs the SQL code in its definition using the version of the task that was current when the task started its run.
When the task is resumed or is manually run, a new version of the task is set. This new version includes the modifications to the task.

To retrieve the history of task versions, query [TASK_VERSIONS](../sql-reference/account-usage/task_versions.md) [Account Usage view](../sql-reference/account-usage.md) (in the SNOWFLAKE shared database).

## View the task history for your account

To view task history, see either the [TASK_HISTORY](../sql-reference/functions/task_history.md) table function or the [Tasks page on Snowsight](ui-snowsight-tasks.md).

For information about required privileges, see Viewing task history.

To view the run history for a single task:

> SQL:
> :   Query the [TASK_HISTORY](../sql-reference/functions/task_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).

To view details on a task graph run that is currently scheduled or is running:

> SQL:
> :   Query the [CURRENT_TASK_GRAPHS](../sql-reference/functions/current_task_graphs.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).

To view the history for task graph runs that completed successfully, failed, or were cancelled in the past 60 minutes:

> SQL:
> :   Query the [COMPLETE_TASK_GRAPHS](../sql-reference/functions/complete_task_graphs.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
>
>     Query the [COMPLETE_TASK_GRAPHS view](../sql-reference/account-usage/complete_task_graphs.md) view (in [Account Usage](../sql-reference/account-usage.md)).

## Task costs

The costs associated with running a task to run SQL code differ depending on the source of the compute resources for the task:

User-managed warehouse
:   Snowflake bills your account for [credit usage](cost-understanding-compute.md) based on warehouse usage while a task is
    running, similar to the warehouse usage for running the same SQL statements in a client or the Snowflake web interface. Per-second
    credit billing and warehouse auto-suspend give you the flexibility to start with larger warehouse sizes and then adjust the size to match
    your task workloads.

Serverless compute model
:   Snowflake bills your account based on compute resource usage. Charges are calculated based on your total usage of the resources,
    including cloud service usage, measured in *compute-hours* credit usage. The compute-hours cost changes based on warehouse size and query
    runtime. For more information, see [Serverless credit usage](cost-understanding-compute.md) or [Query: Total serverless task cost](cost-exploring-compute.md).

    Snowflake analyzes task runs in the task history to dynamically determine the correct size and number of the serverless compute
    resources. As Snowflake automatically scales up and down resources to manage your task runs, the cost to run the task runs scales
    proportionally.

    To learn how many credits are consumed by tasks, refer to the “Serverless
    Feature Credit Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

    Consider the following best practices to optimize for cost when you create tasks:

    * Set the SCHEDULE to run less frequently.
    * Use the auto-suspend and auto-retry parameters to prevent resource waste on failing tasks.
    * Set up [Triggered tasks](tasks-triggered.md) for tasks that only need to run under certain conditions, such as when a data stream has new data.
    * Create a budget and alert on spend limits for serverless features. For more information, see [Monitor credit usage with budgets](budgets.md).

    To retrieve the current credit usage for a specific task, query the [SERVERLESS_TASK_HISTORY](../sql-reference/functions/serverless_task_history.md) table
    function. Execute the following statement as the task owner, where `<database_name>` is the database that contains the task and `<task_name>` is the name of the task:

    ```sqlexample
    SET num_credits = (SELECT SUM(credits_used)
      FROM TABLE(<database_name>.information_schema.serverless_task_history(
        date_range_start=>dateadd(D, -1, current_timestamp()),
        date_range_end=>dateadd(D, 1, current_timestamp()),
        task_name => '<task_name>')
        )
      );
    ```

    To retrieve the current credit usage for all serverless tasks, query the
    [SERVERLESS_TASK_HISTORY](../sql-reference/account-usage/serverless_task_history.md) view. Execute the following statement as an account administrator:

    ```sqlexample
    SELECT start_time,
      end_time,
      task_id,
      task_name,
      credits_used,
      schema_id,
      schema_name,
      database_id,
      database_name
    FROM snowflake.account_usage.serverless_task_history
    ORDER BY start_time, task_id;
    ```

## Monitor cost

Serverless tasks incur [compute cost](cost-understanding-compute.md) when in use.
You can use cost-related views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas to track the costs associated with serverless tasks.
When querying these views, filter on the `service_type` column to find `SERVERLESS_TASK` or `SERVERLESS_TASK_FLEX` values.

| View | Schema | `service_type` | Roles with required privileges |
| --- | --- | --- | --- |
| [METERING_HISTORY](../sql-reference/account-usage/metering_history.md) | ACCOUNT_USAGE | SERVERLESS_TASK | ACCOUNTADMIN role USAGE_VIEWER database role |
| [METERING_DAILY_HISTORY](../sql-reference/account-usage/metering_daily_history.md) | ACCOUNT_USAGE | SERVERLESS_TASK | ACCOUNTADMIN role USAGE_VIEWER database role |
| [METERING_DAILY_HISTORY](../sql-reference/organization-usage/metering_daily_history.md) | ORGANIZATION_USAGE | SERVERLESS_TASK | ACCOUNTADMIN role USAGE_VIEWER database role |
| [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) | ORGANIZATION_USAGE | SERVERLESS_TASK | ORGADMIN role GLOBALORGADMIN role ORGANIZATION_USAGE_VIEWER database role |

**Example:** View the total account cost that serverless tasks incurred across the organization.

Example: View the total account cost that serverless task incurred between December 1, 2024 and December 31, 2024.

```sqlexample
SELECT
 name,
 SUM(credits_used_compute) AS total_credits
FROM
  SNOWFLAKE.ACCOUNT_USAGE.METERING_HISTORY
WHERE
 service_type ILIKE '%SERVERLESS_TASK%'
 AND start_time >= '2024-12-01'
 AND end_time <= '2024-12-31'
GROUP BY
 name
ORDER BY
 name ASC;
```

**Example:** View the total account cost that serverless tasks incurred across the organization.

```sqlexample
SELECT
  usage_date AS date,
  account_name,
  SUM(usage) AS credits,
  currency,
  SUM(usage_in_currency) AS usage_in_currency
FROM
  SNOWFLAKE.ORGANIZATION_USAGE.USAGE_IN_CURRENCY_DAILY
WHERE
  USAGE_TYPE ILIKE '%SERVERLESS_TASK%'
GROUP BY
  usage_date, account_name, currency
ORDER BY
  USAGE_DATE DESC;
```

For information about how many credits are charged per Compute-Hour for the operation of the Trust Center, see Table 5 in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Task duration

Task duration includes the time from when a task is scheduled to start to when it completes. This duration includes both of the following:

* **Queuing time:** The time a task spends waiting for compute resources to become available before it begins. To calculate queueing time, query [TASK_HISTORY view](../sql-reference/account-usage/task_history.md) and compare SCHEDULED_TIME with QUERY_START_TIME.
* **Execution time:** The time taken by the task to run its SQL statements or other operations. To calculate run time, query [TASK_HISTORY view](../sql-reference/account-usage/task_history.md), and compare QUERY_START_TIME with COMPLETED_TIME.

For example, the following diagram shows a serverless task that is scheduled to run every 15 seconds. The total duration of this task run is 12 seconds, which includes 5 seconds of queuing time and 7 seconds of run time.

### Timeouts

If a task run exceeds the scheduled time or target completion interval, by default, the task continues to run until it is complete, it times out, or it fails.

When both [STATEMENT_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md) and [USER_TASK_TIMEOUT_MS](../sql-reference/parameters.md) are set, the timeout is the lowest non-zero value of the two parameters.

When both [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md) and USER_TASK_TIMEOUT_MS are set, the value of USER_TASK_TIMEOUT_MS takes precedence.

For information about timeouts with task graphs, see [Task graph timeouts](tasks-graphs.md).

### Considerations

* For serverless tasks, Snowflake automatically scales resources to make sure tasks complete within a target completion interval, including queueing time.
* For user-managed tasks, longer queueing periods are common when tasks are scheduled to run on a shared or busy warehouse.

## Task security

To run tasks, you must have the correct access privileges. This section describes how to manage access to tasks.

For information about task graph ownership, see [Manage task graph ownership](tasks-graphs.md).

### Access control privileges

#### Creating tasks

Creating tasks requires a role with a minimum of the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Account | EXECUTE MANAGED TASK | Required only for tasks that rely on serverless compute resources. |
| Database | USAGE |  |
| Schema | USAGE, CREATE TASK |  |
| Warehouse | USAGE | Required only for tasks that rely on user-managed warehouses. |

#### Running tasks

After a task is created, the task owner must have the following privileges for the task to run:

| Object | Privilege | Notes |
| --- | --- | --- |
| Account | EXECUTE TASK | Required to run any tasks the role owns. Revoking the EXECUTE TASK privilege on a role prevents all subsequent task runs from starting under that role. |
| Account | EXECUTE MANAGED TASK | Required only for tasks that rely on serverless compute resources. |
| Database | USAGE |  |
| Schema | USAGE |  |
| Task | USAGE |  |
| Warehouse | USAGE | Required only for tasks that rely on user-managed warehouses. |

In addition, the role must have the permissions required to run the SQL statement that the task runs.

> **Note:**
>
> By default, Snowflake runs tasks by using the system user with the privileges of the task owner role.
> To run a task as a specific user, configure the task with EXECUTE AS USER. For more information, see Run tasks with user privileges.

#### Viewing task history

To view tasks, you must have one or more of the following privileges:

* The ACCOUNTADMIN role
* The OWNERSHIP privilege on the task
* The global MONITOR EXECUTION privilege

#### Resuming or suspending tasks

In addition to the task owner, a role that has the OPERATE privilege on the task can suspend or resume the task. This role must have the
USAGE privilege on the database and schema that contain the task. No other privileges are required.

When a task is resumed, Snowflake verifies that the task owner role has the privileges listed in Running tasks.

### Create custom roles to manage task permissions

With custom roles you can easily manage permissions granted to each account or role in Snowflake. To make changes to permissions for all accounts or roles using the custom role, update the custom role. Or, revoke permissions by removing the custom role.

#### Create a custom role to create tasks

Snowflake requires different permissions to create serverless and user-managed tasks.

For example, to create user-managed tasks, create a custom role named `warehouse_task_creation`
and grant that role the CREATE TASK and USAGE privileges on the warehouse that the role can create tasks in.

SQLPython

```sqlexample
USE SYSADMIN;

CREATE ROLE warehouse_task_creation
  COMMENT = 'This role can create user-managed tasks.';
```

```python
from snowflake.core.role import Role

root.session.use_role("SYSADMIN")

my_role = Role(
    name="warehouse_task_creation",
    comment="This role can create user-managed tasks."
)
root.roles.create(my_role)
```

SQLPython

```sqlexample
USE ACCOUNTADMIN;

GRANT CREATE TASK
  ON SCHEMA schema1
  TO ROLE warehouse_task_creation;
```

```python
from snowflake.core.role import Securable

root.session.use_role("ACCOUNTADMIN")

root.roles['warehouse_task_creation'].grant_privileges(
    privileges=["CREATE TASK"], securable_type="schema", securable=Securable(name='schema1')
)
```

SQLPython

```sqlexample
GRANT USAGE
  ON WAREHOUSE warehouse1
  TO ROLE warehouse_task_creation;
```

```python
from snowflake.core.role import Securable

root.roles['warehouse_task_creation'].grant_privileges(
    privileges=["USAGE"], securable_type="warehouse", securable=Securable(name='warehouse1')
)
```

As an example of a role that can create serverless tasks; create a custom role named `serverless_task_creation` and grant the role the CREATE TASK privilege and the account level EXECUTE MANAGED TASK privilege.

SQLPython

```sqlexample
USE SYSADMIN;

CREATE ROLE serverless_task_creation
  COMMENT = 'This role can create serverless tasks.';
```

```python
from snowflake.core.role import Role

root.session.use_role("SYSADMIN")

my_role = Role(
    name="serverless_task_creation",
    comment="This role can create serverless tasks."
)
root.roles.create(my_role)
```

SQLPython

```sqlexample
USE ACCOUNTADMIN;

GRANT CREATE TASK
  ON SCHEMA schema1
  TO ROLE serverless_task_creation;
```

```python
from snowflake.core.role import Securable

root.session.use_role("ACCOUNTADMIN")

root.roles['serverless_task_creation'].grant_privileges(
    privileges=["CREATE TASK"], securable_type="schema", securable=Securable(name='schema1')
)
```

SQLPython

```sqlexample
GRANT EXECUTE MANAGED TASK ON ACCOUNT
  TO ROLE serverless_task_creation;
```

```python
root.roles['serverless_task_creation'].grant_privileges(
    privileges=["EXECUTE MANAGED TASK"], securable_type="account"
)
```

#### Create a custom role to administer tasks

Create a custom role, grant it the EXECUTE TASK privilege, and then grant this custom role to any task owner role to allow altering their own
tasks. To remove the ability for the task owner role to run the task, revoke this custom role from the task owner role.

For example, create a custom role name `taskadmin` and grant that role the EXECUTE TASK privilege. Assign the `taskadmin` role to a
task owner role named `myrole`:

SQLPython

```sqlexample
USE ROLE securityadmin;

CREATE ROLE taskadmin;
```

```python
from snowflake.core.role import Role

root.session.use_role("securityadmin")

root.roles.create(Role(name="taskadmin"))
```

Set the active role to ACCOUNTADMIN before granting the account-level privileges to the new role

SQLPython

```sqlexample
USE ROLE accountadmin;

GRANT EXECUTE TASK, EXECUTE MANAGED TASK ON ACCOUNT TO ROLE taskadmin;
```

```python
root.session.use_role("accountadmin")

root.roles['taskadmin'].grant_privileges(
    privileges=["EXECUTE TASK", "EXECUTE MANAGED TASK"], securable_type="account"
)
```

Set the active role to SECURITYADMIN to show that this role can grant a role to another role

SQLPython

```sqlexample
USE ROLE securityadmin;

GRANT ROLE taskadmin TO ROLE myrole;
```

```python
from snowflake.core.role import Securable

root.session.use_role("securityadmin")

root.roles['myrole'].grant_role(role_type="ROLE", role=Securable(name='taskadmin'))
```

For more information about how to create custom roles and role hierarchies, see [Configuring access control](security-access-control-configure.md).

#### Drop a task owner role

When you delete the owner role of a task, the task transfers ownership to the role that dropped the owner role. When a task transfers
ownership, it is automatically paused and new task runs aren’t scheduled until the new owner resumes the task.

If you drop the role while the task is running, the task run completes processing under the dropped role.

### Tasks run by a system service

By default, tasks run as a system service that is decoupled from a user.

The system service runs the task using the same privileges as the task owner.

This avoids complications associated with user management: for example, if a user is dropped, locked due to authentication issues, or has roles removed, the task continues to run without interruption.

The query history for task runs are associated with the system service. There are no user credentials for this service, and no individual can assume its identity. Activity for the system service is limited to your account. The same encryption protections and other security protocols are built into this service as are enforced for other operations.

### Run tasks with user privileges

Tasks can be configured to run with the privileges of a specific user,
in addition to privileges of the task owner role. Tasks that specify EXECUTE AS USER run on behalf of the named user, instead of the system service.

* **Manage multi-role privileges**: In situations where users have secondary roles, users can run a task using the combined privileges of their primary and secondary roles. This configuration ensures that the task has the necessary permissions to access all required resources.
* **Leverage user-based data masking and row access policies**: In situations where data governance policies consider the querying user, running a task as a user ensures the task is compatible with the applicable policies.
* **Provide accountability for all operations**: All instances of a task that are run with EXECUTE AS USER are attributed to the configured user instead of the SYSTEM user. This attribution helps maintain a clear audit trail for all operations.

#### Access control

The owner role of the task must be granted the IMPERSONATE privilege on the user specified by EXECUTE AS USER, and the specified user must be granted the owner role of the task.

When the task runs, the primary role of the task session will be the owner role of the task, and the user’s default secondary roles will be activated. Users will be able to switch primary roles with the [USE ROLE](../sql-reference/sql/use-role.md) command and adjust the secondary roles in the task session with the [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md) command.

#### Share tasks by using a service user and role

For production environments, we recommend that you create a separate service user to represent your team or business process. In contrast to running as an existing service or person user, this best practice helps make the workflow more secure:

* When a task runs as a dedicated service user, it gains access only to the intended privileges. If instead, a user impersonates a different user, they gain access to all privileges associated with the other user, which might include unintended privileges, including user privileges granted after creating and resuming the task.
* A task running as a user might be interrupted if the person leaves the department or organization.

#### Examples: Set up the service user and team role

1. Using the admin role, set up a service user to be used for the task.

   The following example creates a service user named `task_user`:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE USER task_user;
   ```
2. Create a task role, and then grant it to the service user:

   ```sqlexample
   CREATE ROLE task_role;
   GRANT ROLE task_role to USER task_user;
   ```
3. Allow the task role to run queries on behalf of the team user role:

   ```sqlexample
   GRANT IMPERSONATE ON USER task_user TO ROLE task_role;
   ```
4. Grant appropriate privileges to the task role.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   -- Grant the team role the privileges to create tasks in a specific schema
   GRANT CREATE TASK
     ON SCHEMA schema1
     TO ROLE task_role;

   -- Grant the team role the privileges to use a specific warehouse
   GRANT USAGE
     ON WAREHOUSE warehouse1
     TO ROLE task_role;

   -- Grant the team role the privileges to run tasks on a serverless compute model
   GRANT EXECUTE MANAGED TASK ON ACCOUNT TO ROLE task_role;
   ```

#### Run a task on behalf of a service user

After the team role has ownership of the task, team members can modify the task, and run it on behalf of the service user.

**Example:**

```sqlexample
USE ROLE task_owner;

CREATE TASK team_task
  SCHEDULE='12 HOURS'
  EXECUTE AS USER task_user
  AS SELECT 1;
```

In the previous example, the resulting logs would show that `task_user` modified the task.

#### (For testing only) Allow a user to impersonate another user directly

When you test or prototype changes, you, as an administrator, can allow users to directly impersonate another user. This scenario, while supported, isn’t recommended in a production environment.

1. Set up a role for impersonation:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE ROLE janes_role;
   GRANT ROLE janes_role to USER jane;
   GRANT IMPERSONATE ON USER jane TO ROLE janes_role;
   ```
2. Create a task by using the new role:

   ```sqlexample
   USE ROLE janes_role;

   CREATE TASK janes_task
     SCHEDULE='60 M' AS SELECT 1;
   ```
3. Grant the role to another user.

   In the following example, the user Jane grants access to the user Billy:

   ```sqlexample
   --Logged in as Jane or account admin
   GRANT ROLE janes_role to USER billy;
   ```
4. The other user modifies the task.

   In the following example, the user Billy modifies the task:

   ```sqlexample
   -- Logged in as billy
   USE ROLE janes_role;

   ALTER TASK janes_task
     SET EXECUTE AS USER jane;
   ```
5. Review the logs.

   The [SHOW GRANTS TO ROLE](../sql-reference/sql/show-grants.md) command would show that Jane granted the role to Billy. The
   [QUERY_HISTORY](../sql-reference/functions/query_history.md) view would then show that Billy modified the task. Future task runs would still appear as run by Jane.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SHOW GRANTS TO ROLE janes_role;

   QUERY_HISTORY()
     WHERE QUERY_TEXT ILIKE '%janes_task%';
   ```

## Task Data Definition Language (DDL) operations

To support creating and managing tasks, Snowflake provides the following set of special DDL operations:

SQLPython

* [CREATE TASK](../sql-reference/sql/create-task.md)
* [ALTER TASK](../sql-reference/sql/alter-task.md)
* [DROP TASK](../sql-reference/sql/drop-task.md)
* [DESCRIBE TASK](../sql-reference/sql/desc-task.md)
* [SHOW TASKS](../sql-reference/sql/show-tasks.md)

* [TaskCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.TaskCollection)
* [TaskResource.create_or_alter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.TaskResource)
* [TaskResource.drop](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.TaskResource)
* [TaskResource.fetch](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.TaskResource)
* [TaskCollection.iter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.TaskCollection)

In addition, providers can view, grant, or revoke access to the necessary database objects for ELT using the following standard access
control DDL:

SQLPython

* [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
* [REVOKE <privileges> … FROM ROLE](../sql-reference/sql/revoke-privilege.md)
* [SHOW GRANTS](../sql-reference/sql/show-grants.md)

[DatabaseRoleResource](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.database_role.DatabaseRoleResource) methods:

* `grant_future_privileges`
* `grant_privileges`
* `grant_privileges_on_all`
* `grant_role`
* `iter_future_grants_to`
* `iter_grants_to`
* `revoke_future_privileges`
* `revoke_grant_option_for_future_privileges`
* `revoke_grant_option_for_privileges`
* `revoke_grant_option_for_privileges_on_all`
* `revoke_privileges`
* `revoke_privileges_on_all`
* `revoke_role`

[RoleResource](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.role.RoleResource) (account role) methods:

* `grant_future_privileges`
* `grant_privileges`
* `grant_privileges_on_all`
* `grant_role`
* `iter_future_grants_to`
* `iter_grants_of`
* `iter_grants_on`
* `iter_grants_to`
* `revoke_future_privileges`
* `revoke_grant_option_for_future_privileges`
* `revoke_grant_option_for_privileges`
* `revoke_grant_option_for_privileges_on_all`
* `revoke_privileges`
* `revoke_privileges_on_all`
* `revoke_role`

[UserResource](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource) methods:

* `grant_role`
* `iter_grants_to`
* `revoke_role`

## Task functions

To support retrieving information about tasks, Snowflake provides the following set of functions:

SQLPython

* [SYSTEM$CURRENT_USER_TASK_NAME](../sql-reference/functions/system_current_user_task_name.md)
* [SYSTEM$TASK_RUNTIME_INFO](../sql-reference/functions/system_task_runtime_info.md)
* [TASK_HISTORY](../sql-reference/functions/task_history.md)
* [TASK_DEPENDENTS](../sql-reference/functions/task_dependents.md)

* [TaskContext.get_current_task_name](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.context.TaskContext)
* [TaskContext.get_runtime_info](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.context.TaskContext)
* [TaskResource.fetch_task_dependents](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.TaskResource)

## More Python examples

For more Python examples, see [Managing Snowflake tasks and task graphs with Python](../developer-guide/snowflake-python-api/snowflake-python-managing-tasks.md).

---
title: Introduction to unstructured data
source: https://docs.snowflake.com/en/user-guide/unstructured-intro.md
section: User Guide
---

# Introduction to unstructured data

Unstructured data is information that does not fit into a predefined data model or schema. Typically text-heavy, such as form responses and
social media conversations, unstructured data also encompasses images, video, and audio. Industry-specific file types such as VCF
(genomics), KDF (semiconductors), or HDF5 (aeronautics) are included in this category.

Snowflake supports the following actions:

* Securely access data files located in cloud storage.
* Share file access URLs with collaborators and partners.
* Load file access URLs and other file metadata into Snowflake tables.
* Process unstructured data.

This topic introduces key concepts and provides instructions for accessing, sharing, and processing unstructured data files.

## Cloud Storage service support

Both external (external cloud storage) and internal (i.e. Snowflake) stages support unstructured data.

External stages:
:   Store files in external cloud storage: Amazon S3, Google Cloud Storage, or one of the supported Microsoft Azure cloud storage services:

    * Blob storage
    * Data Lake Storage Gen2
    * General-purpose v1
    * General-purpose v2

## Types of URLs available to access files

The following types of URLs are available to access files in cloud storage:

Scoped URL:
:   Encoded URL that permits temporary access to a staged file without granting privileges to the stage.

    The URL expires when the [persisted query result period](querying-persisted-results.md) ends (i.e. the results cache
    expires), which is currently 24 hours.

File URL:
:   URL that identifies the database, schema, stage, and file path to a set of files. A role that has sufficient privileges on the stage can
    access the files.

Pre-signed URL:
:   Simple HTTPS URL used to access a file via a web browser. A file is temporarily accessible to users via this URL using a pre-signed
    access token. The expiration time for the access token is configurable.

The following table describes key characteristics of these URL types:

|  | Scoped URL | File URL | Pre-signed URL |
| --- | --- | --- | --- |
| Use cases | Recommended for file administrators to give scoped access to data files to specific roles in the same account. Provide access to the files with a view that retrieves scoped URLs. Only roles that have privileges on the view can access the files. Snowflake records information in the query history about who uses a scoped URL to access a file, and when. Ideal for use in custom applications, for providing unstructured data to other accounts through a share, or for downloading and analysis of unstructured data in Snowsight. | Permanent URL to a file on a stage. To download or access a file, users send the file URL in a GET request to the REST API endpoint along with the authorization token. Ideal for custom applications that require access to unstructured data files. | Used to download or access files without authenticating into Snowflake or passing an authorization token. Pre-signed URLs are open; any user or application can directly access or download the files. Ideal for business intelligence applications or reporting tools that need to display the unstructured file contents. |
| How to generate | Query the [BUILD_SCOPED_FILE_URL](../sql-reference/functions/build_scoped_file_url.md) function. | Either Query the directory table for the stage that references the staged files or call the [BUILD_STAGE_FILE_URL](../sql-reference/functions/build_stage_file_url.md) function. | Query the [GET_PRESIGNED_URL](../sql-reference/functions/get_presigned_url.md) function. |
| Usage | The following options are available:   * In Snowsight, click on a scoped URL in the query results table. Snowsight retrieves the file only for the user who generated the   scoped URL. * Send a scoped URL in a GET request to the file support REST API endpoint. For information, see [REST API for unstructured data support](data-load-unstructured-rest-api.md). | The following options are available:   * In Snowsight, click on a file URL in the query results table. Snowsight retrieves the file only if the active role has sufficient   privileges. * Send a file URL in a GET request to the file support REST API endpoint. For information, see [REST API for unstructured data support](data-load-unstructured-rest-api.md). | The following options are available:   * In Snowsight, click on a pre-signed URL in the query results table. * Navigate to the pre-signed URL directly in a web browser. |
| [Snowflake Secure Data Sharing](../guides-overview-sharing.md) | Unstructured data files can be accessed by data consumers via column values of this type in secure views shared by data providers. | Unstructured data files cannot be accessed by data consumers via column values of this type in secure views shared by data providers. | Unstructured data files can be accessed by data consumers via column values of this type in secure views shared by data providers. |
| Authorization | Only the user who generates a scoped URL can use the URL to access the referenced file. | Role specified in the GET REST API call must have sufficient privileges on the stage: USAGE (external stage) or READ (internal stage). | Any person who has the pre-signed URL can access the referenced file for the life of the token. |
| Expiration | Expiration period for the query results cache (currently 24 hours). | Permanent. | Length of time specified in the `expiration_time` argument. |

## Server-side encryption for unstructured data access

To enable unstructured data access on an internal stage, you can consider using server-side encryption when you create the stage.
Otherwise, staged files will be client-side encrypted by default. The encryption keys are owned by Snowflake,
and client-side encrypted files are unreadable by users and external tools using pre-signed, file, or scoped URLs.

To configure server-side encryption for an internal stage, specify the `SNOWFLAKE_SSE` encryption type in the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.
See [Internal stage parameters (internalStageParams)](../sql-reference/sql/create-stage.md) for more information.

The following example creates an internal stage named `my_int_stage` with server-side encryption and a directory table.

```sqlexample
CREATE STAGE my_int_stage
  ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')
  DIRECTORY = ( ENABLE = true );
```

> **Important:**
>
> If you require Tri-Secret Secure for security compliance, use the `SNOWFLAKE_FULL` encryption type for internal stages.
> `SNOWFLAKE_SSE` does not support Tri-Secret Secure.

> **Note:**
>
> * You cannot change the encryption type for an internal stage after you create the stage.
> * Currently, creating internal stages with server-side encryption is limited to the following Snowflake client versions: JDBC Driver v3.12.11 (or higher)

## Directory tables

Directory tables store a catalog of staged files in cloud storage. Roles with sufficient privileges can query a directory table to retrieve
file URLs to access the staged files.

For details, see [Directory tables](data-load-dirtables.md).

## SQL functions

The following [File functions](../sql-reference/functions-file.md) are provided to access data files:

| SQL Function | Description |
| --- | --- |
| [GET_STAGE_LOCATION](../sql-reference/functions/get_stage_location.md) | Returns the URL for an external or internal named stage using the stage name as the input. |
| [GET_RELATIVE_PATH](../sql-reference/functions/get_relative_path.md) | Extracts the path of a staged file relative to its location in the stage using the stage name and absolute file path in cloud storage as inputs. |
| [GET_ABSOLUTE_PATH](../sql-reference/functions/get_absolute_path.md) | Returns the absolute path of a staged file using the stage name and path of the file relative to its location in the stage as inputs. |
| [GET_PRESIGNED_URL](../sql-reference/functions/get_presigned_url.md) | Generates the pre-signed URL to a staged file using the stage name and relative file path as inputs. Access files in an external stage using the function. |
| [BUILD_SCOPED_FILE_URL](../sql-reference/functions/build_scoped_file_url.md) | Generates a scoped Snowflake file URL to a staged file using the stage name and relative file path as inputs. |
| [BUILD_STAGE_FILE_URL](../sql-reference/functions/build_stage_file_url.md) | Generates a Snowflake file URL to a staged file using the stage name and relative file path as inputs. |
| [TO_FILE](../sql-reference/functions/to_file.md) | Returns a FILE object that represents a file stored in an internal or external stage. |
| [TRY_TO_FILE](../sql-reference/functions/try_to_file.md) | Returns a FILE object as with TO_FILE, but returns NULL if the file does not exist or is not accessible. |

## Download staged files in Snowsight

### Download a generated scoped, pre-signed, or file URL

Users can select a generated scoped, pre-signed, or file URL in the results table of a [Snowsight](ui-snowsight-gs.md)
worksheet and download the referenced file.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets » My Worksheets, or open a local worksheet by navigating to Recent
   or Folders » *<worksheet_name>*.
3. Return a scoped, pre-signed, or file URL in a query using any one of the supported methods.
4. Select the URL in the results table. Snowsight downloads the file referenced by the URL.

### Download from an internal stage

Users can download a file from the internal stage directly from Snowsight.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Navigate to your file on the internal stage. For more information about finding files, see [Viewing staged files](data-load-local-file-system-stage-ui.md).
3. Select the  button, and select Download.

## Features to process unstructured data

Snowflake supports the following features to help you process unstructured data.

External Functions
:   External functions are user-defined functions that you store and execute outside of Snowflake. With external functions, you can use libraries such as Amazon Textract, Document AI, or Azure Computer Vision that cannot be accessed from internal user-defined functions (UDFs).

    For more information, see [Writing external functions](../sql-reference/external-functions.md).

User-defined Functions and Stored Procedures
:   Snowflake supports multiple ways to read a file within Java or Python code so that you can process unstructured data or use your own machine learning models in user-defined functions (UDFs), user-defined table functions (UDTFs), or stored procedures.

    You can [extend the SQL that you use in Snowflake](../developer-guide/extensibility.md), or build an application using the [Snowpark API](../developer-guide/snowpark/index.md).

    See the following topics for more information and examples.

    > * [Process unstructured data with UDF and procedure handlers](unstructured-data-java.md)
    > * [Reading a File with a Python UDF Handler](../developer-guide/udf/python/udf-python-examples.md)
    > * [Reading files with a Python Stored Procedure Handler](../developer-guide/stored-procedure/python/procedure-python-read-files.md)
    > * [Reading Files with the Snowpark API for Java](../developer-guide/snowpark/java/creating-udfs.md)
    > * [Reading Files with the Snowpark API for Python](../developer-guide/snowpark/python/creating-udfs.md)

## FILE data type

The [FILE data type](../sql-reference/data-types-unstructured.md) represents a file stored in an internal or external stage. Some built-in
functions accept a FILE object in place of a stage name and file path. Use the [TO_FILE](../sql-reference/functions/to_file.md)
or [TRY_TO_FILE](../sql-reference/functions/try_to_file.md) function to convert a file’s location in a stage to a FILE object.

---
title: IRAP (Protected)
source: https://docs.snowflake.com/en/user-guide/cert-irap.md
section: User Guide
---

# IRAP (Protected)

This topic describes how Snowflake supports customers with IRAP compliance requirements.

> **Note:**
>
> Snowflake supports the IRAP certification in certain regions within each cloud platform.
>
> For details, refer to [Asia Pacific and China](intro-regions.md).

## Understanding IRAP compliance requirements

The Infosec Registered Assessors Program, or IRAP, is a program governed by the Australian Signals Directorate (ASD) of the Australian
Government which endorses suitably-qualified cyber security professionals to provide relevant services which aim to secure broader industry
and Australian Government systems and data.

IRAP provides a security framework and an assessment methodology that enables Australian Government agencies and their customers to
validate Snowflake’s security control implementations and compliance against those requirements defined within the Australian Government
Information Security Manual (ISM) developed by the Australian Signals Directorate (ASD). Snowflake employs IRAP assessors to validate the
effectiveness of Snowflake Australian systems against the Information Security Manual at the Protected level.

---
title: IRS Publication 1075
source: https://docs.snowflake.com/en/user-guide/cert-irspub1075.md
section: User Guide
---

# IRS Publication 1075

This topic describes how Snowflake supports customers with IRS Publication 1075 compliance requirements.

## Understanding IRS Publication 1075 compliance requirements

Snowflake’s [U.S. regions supporting public sector workloads](intro-regions.md) are ready and able to support customer compliance with the
Internal Revenue Service (IRS) Office of Safeguards Publication 1075. The IRS Publication 1075 provides guidance to
ensure that the policies, practices, controls, and safeguards employed by recipient agencies, agents, or contractors
adequately protect the confidentiality of Federal Tax Information (FTI). Snowflake recognizes the importance of protecting
FTI and works collaboratively with customers to satisfy IRS Publication 1075 requirements.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: ISO-27001
source: https://docs.snowflake.com/en/user-guide/cert-iso-27001.md
section: User Guide
---

# ISO-27001

This topic describes how Snowflake supports customers with ISO-27001 compliance requirements.

## Understanding ISO-27001 compliance requirements

The International Organization for Standardization provides requirements for establishing, implementing, maintaining, and continually
improving an information security management system. Snowflake’s ISO Certificate is available for download from the
[Snowflake Compliance Center](https://trust.snowflake.com). The statement of applicability also includes control objectives from the
ISO 27017:2015 & ISO 27018:2019 framework.

ISO/IEC 27001:2013 specifies the requirements for establishing, implementing, maintaining and continually improving an information
security management system within the context of the organization. It also includes requirements for the assessment and treatment of
information security risks tailored to the needs of the organization.

---
title: ISO-27017
source: https://docs.snowflake.com/en/user-guide/cert-iso-27017.md
section: User Guide
---

# ISO-27017

This topic describes how Snowflake supports customers with ISO-27017 compliance requirements.

## Understanding ISO-27017 compliance requirements

The International Organization for Standardization provides requirements for establishing, implementing, maintaining, and continually
improving an information security management system. Snowflake’s ISO Certificate is available for download from the
[Snowflake Compliance Center](https://trust.snowflake.com). The statement of applicability also includes control objectives from the
ISO 27017:2015 & ISO 27018:2019 framework.

ISO 27017:2015 provides guidance for information security controls applicable specifically to cloud service provisioning and usage.

---
title: ISO-27018
source: https://docs.snowflake.com/en/user-guide/cert-iso-27018.md
section: User Guide
---

# ISO-27018

This topic describes how Snowflake supports customers with ISO-27018 compliance requirements.

## Understanding ISO-27018 compliance requirements

The International Organization for Standardization provides requirements for establishing, implementing, maintaining, and continually
improving an information security management system. Snowflake’s ISO Certificate is available for download from the
[Snowflake Compliance Center](https://trust.snowflake.com). The statement of applicability also includes control objectives from the
ISO 27017:2015 & ISO 27018:2019 framework.

ISO/IEC 27018:2019 is a code of practice concerned with the protection of personally identifiable information (PII) in public clouds in
accordance with the privacy principles in ISO/IEC 29100.

---
title: ISO-9001:2015
source: https://docs.snowflake.com/en/user-guide/cert-iso-9001_2015.md
section: User Guide
---

# ISO-9001:2015

This topic describes how Snowflake supports customers with ISO-9001:2015 compliance requirements for a quality management system.

## Understanding ISO-9001 compliance requirements

The ISO 9001 standard sets forth universally acknowledged benchmarks for quality management, shaping the foundation for Snowflake to
enhance its operations. Implementing ISO 9001 not only improves overall performance but also enables Snowflake to surpass customer
expectations. By adhering to its stringent requirements, Snowflake showcases an unwavering commitment to quality. ISO 9001 guides the
establishment, implementation, maintenance, and continuous improvement of a robust Quality Management System (QMS) at Snowflake,
solidifying Snowflake’s commitment to delivering exceptional quality.

---
title: ITAR
source: https://docs.snowflake.com/en/user-guide/cert-itar.md
section: User Guide
---

# ITAR

This topic describes how Snowflake supports customers with ITAR compliance requirements.

> **Note:**
>
> Snowflake supports the ITAR certification in certain regions within each cloud platform.
>
> For details, refer to [SnowGov Regions](intro-regions.md).

## Understanding ITAR compliance requirements

The International Traffic in Arms Regulations
([ITAR](https://www.pmddtc.state.gov/ddtc_public?id=ddtc_kb_article_page&sys_id=24d528fddbfc930044f9ff621f961987)) is a United States
compliance standard that controls and restricts access and export of military and defense articles, services, and related technologies.

Companies that manufacture, provide, or distribute goods and services as specified on the United States Munitions List
([USML](https://www.ecfr.gov/cgi-bin/retrieveECFR?gp=&SID=70e390c181ea17f847fa696c47e3140a&mc=true&r=PART&n=pt22.1.121)) may be subject
to ITAR.

Snowflake supports customer ITAR compliance in the Snowflake [SnowGov regions](intro-regions.md) by limiting region access to
vetted Snowflake employees and contractors who are eligible to support ITAR workloads. For clarity, customers may not use the Snowflake
Service or the Snowflake Government Regions in connection with U.S. “classified information” (e.g. confidential, secret, or top secret).

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

## Export-controlled Data and Cross-region Features

Unlike commercial regions, where multiple regions belong to the same [region group](admin-account-identifier.md), each government region
belongs to its own region group to maintain similar security controls, isolation, and compliance across that group. This distinction is
important because, by default, certain features and functionality are limited to the boundaries of a region group.

As an example, [replication](account-replication-intro.md) is only possible to the boundary of a region group. Because a database cannot be replicated to a region outside
of its region group, this default restriction prevents an organization within a government region from sharing data in a commercial region
without first contacting Snowflake to connect the different region groups. Data sharing is available to customers who
belong within the same government region because it is not crossing the boundary of a region group.

If an organization within a government region needs to replicate a database or share data outside of a government region, it must
contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to gain access to
other region groups before it can use these features.

## Considerations When Working with ITAR Workloads

Be aware of entering export-controlled data in any Snowflake [metadata fields](../sql-reference/metadata.md).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Join policies
source: https://docs.snowflake.com/en/user-guide/join-policies.md
section: User Guide
---

# Join policies

A join policy is a schema-level object that controls the requirement to join a table when it is queried. When a join
policy is applied to a table, queries against that table either require or do not require a join. When joins
are required, they can be restricted to certain joining columns.

You can use this kind of policy to enforce joins for queries on certain tables and views, as a means of finding common information across shared tables. A table or view with a join policy assigned to it is said to be *protected* or *join-constrained*.

## Overview

A core feature of Snowflake is the ability to share data sets with other entities. Join policies allow a provider (data owner) to exercise control
over what can be done with their data even after it is shared with a consumer. Specifically, the provider can require a consumer of a table to join
the data to another table rather than retrieve records from a single table. This requirement facilitates data sharing among semi-trusted partners
that have common data sets. The selection of data from a single table is not generally useful; meaningful data is available when one owner’s table is
joined with a similar table owned by a consumer or partner.

After the join policy is applied to a table or view, a query must:

* Join the table or view to another table or view.
* Specify a supported [join type](../sql-reference/constructs/join.md).
* Specify a join condition with allowed joining columns.

At least one table or view participating in the join must be unprotected. This restriction prevents attackers from circumventing the join policy by joining two protected tables that are shared by the same organization and have matching key values.

While join policies control access to joined tables, they do not guarantee that a malicious actor could not use carefully constructed queries to obtain
sensitive data from a join-constrained table. With enough query attempts, a malicious actor could potentially work around the join requirements. Join
policies are best suited for use with partners and customers with whom you have an existing level of trust. In addition, providers should be vigilant
about potential misuses of their data (for example, reviewing the access history for their listings).

## Creating and implementing join policies

To create and implement a join policy:

1. Create the policy itself.
2. Apply the policy to a new or existing table or view.
3. Run some queries to verify the expected behavior of the policy.

These steps are explained in the following sections.

You might also want to modify an existing policy and verify that the expected behavior changes.

At any time, you can see the join policies in your account by using the [SHOW JOIN POLICIES](../sql-reference/sql/show-join-policies.md) and [DESCRIBE JOIN POLICY](../sql-reference/sql/desc-join-policy.md) commands. To see the actual definition of a policy, query the [JOIN_POLICIES](../sql-reference/account-usage/join_policies.md) view. To see which tables and views are attached to policies, query the [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) Information Schema table function.

For information about managing policies, including setup of a custom policy administration role, see Managing join policies.

### Creating a join policy

The simplest form of join policy requires users to always join a table or view to some other table or view. In other words,
queries against a single table, without a join specification, are disallowed. For example, create a policy named `jp1`:

```sqlexample
CREATE JOIN POLICY jp1
  AS () RETURNS JOIN_CONSTRAINT -> JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE);
```

For the complete syntax of this command and its JOIN_CONSTRAINT function, see [CREATE JOIN POLICY](../sql-reference/sql/create-join-policy.md).

### Applying a join policy to a table or view

Having created a join policy, you implement it by assigning it to a specific table or view:

* Use an ALTER TABLE or ALTER VIEW command if the table or view already exists.
* Use a CREATE TABLE or CREATE VIEW command for a new table or view.

In either case, specify the JOIN POLICY parameter to identify the join policy itself. For example, the following
command assigns the policy `jp` to the table `join_table`:

```sqlexample
CREATE OR REPLACE TABLE join_table (
  col1 INT,
  col2 VARCHAR,
  col3 NUMBER )
  JOIN POLICY jp1;
```

Optionally, you can also specify the ALLOWED JOIN KEYS parameter if you want to restrict joins to use specific joining columns. See Restricting joins on specific columns.

A table or view can have only one join policy assigned to it at any given time. See Replacing a join policy.

### Testing the join policy by running some queries

The following queries demonstrate the expected behavior when the `jp1` policy is in effect for the table `join_table`. This table does not
need to be loaded; an empty table is sufficient to demonstrate the expected behavior.

A query without a join returns an expected error:

```sqlexample
SELECT * FROM join_table;
```

```output
506037 (23001): SQL compilation error: Join Policy violation, please contact the policy admin for details
```

A query with an explicit inner join on `col1` returns results:

```sqlexample
SELECT * FROM join_table jt1 INNER JOIN join_table_2 jt2 ON jt1.col1=jt2.col1;
```

```output
+------+------+------+------+------+------+
| COL1 | COL2 | COL3 | COL1 | COL2 | COL3 |
|------+------+------+------+------+------|
+------+------+------+------+------+------+
```

A query with an explicit inner join on `col2` also returns results:

```sqlexample
SELECT * FROM join_table jt1 INNER JOIN join_table_2 jt2 ON jt1.col2=jt2.col2;
```

```output
+------+------+------+------+------+------+
| COL1 | COL2 | COL3 | COL1 | COL2 | COL3 |
|------+------+------+------+------+------|
+------+------+------+------+------+------+
```

### Restricting joins on specific columns

To further test join policy behavior, detach (unset) the policy from the table, drop and recreate the join policy,
then recreate `join_table` with DDL that includes the ALLOWED JOIN KEYS parameter.

```sqlexample
ALTER TABLE join_table UNSET JOIN POLICY;

DROP JOIN POLICY jp1;

CREATE JOIN POLICY jp1
  AS () RETURNS JOIN_CONSTRAINT -> JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE);

CREATE OR REPLACE TABLE join_table (
  col1 INT,
  col2 VARCHAR,
  col3 NUMBER )
  JOIN POLICY jp1 ALLOWED JOIN KEYS (col1);
```

Now try one of the previous queries again, with `col2` as the joining column. The query fails
because `col2` is not an allowed join key.

```sqlexample
SELECT * FROM join_table jt1 INNER JOIN join_table_2 jt2 ON jt1.col2=jt2.col2;
```

```output
506038 (23001): SQL compilation error: Join Policy violation, invalid join condition with reason: Disallowed join key used.
```

The same query with `jt1.col1=jt2.col1` as the join condition would succeed. A natural join of these two tables would fail
because it would attempt to join over `col1` *and* `col2`.

### Showing and describing join policies

You can use [SHOW JOIN POLICIES](../sql-reference/sql/show-join-policies.md) and [DESCRIBE JOIN POLICY](../sql-reference/sql/desc-join-policy.md) commands to get basic information about existing join policies in your account. To return more detailed information about join policies, see Monitoring join policies.

```sqlexample
SHOW JOIN POLICIES;
```

```output
+-------------------------------+------+---------------+----------------+-------------+--------------+---------+-----------------+---------+
| created_on                    | name | database_name | schema_name    | kind        | owner        | comment | owner_role_type | options |
|-------------------------------+------+---------------+----------------+-------------+--------------+---------+-----------------+---------|
| 2024-12-04 15:15:49.591 -0800 | JP1  | POLICY1_DB    | POLICY1_SCHEMA | JOIN_POLICY | POLICY1_ROLE |         | ROLE            |         |
+-------------------------------+------+---------------+----------------+-------------+--------------+---------+-----------------+---------+
```

```sqlexample
DESCRIBE JOIN POLICY jp1;
```

```output
+------+-----------+-----------------+----------------------------------------+
| name | signature | return_type     | body                                   |
|------+-----------+-----------------+----------------------------------------|
| JP1  | ()        | JOIN_CONSTRAINT | JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE) |
+------+-----------+-----------------+----------------------------------------+
```

### Creating and applying conditional join policies

Policy administrators can define the SQL expression of a join policy so different queries have different restrictions based on
factors such as the role of the user executing the query. This strategy can allow one user to query a table without restriction while requiring others to use joins.

For example, the following join policy gives users with the roles `ACCOUNTADMIN`, `FINANCE_ROLE`, or `HR_ROLE` unrestricted access to a table while requiring all other users to specify a join.

> ```sqlexample
> CREATE JOIN POLICY my_join_policy
>   AS () RETURNS JOIN_CONSTRAINT ->
>     CASE
>       WHEN CURRENT_ROLE() = 'ACCOUNTADMIN'
>           OR CURRENT_ROLE() = 'FINANCE_ROLE'
>           OR CURRENT_ROLE() = 'HR_ROLE'
>         THEN JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE)
>       ELSE JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE)
>     END;
> ```
>
> > **Tip:**
> >
> > You can use the following strategies when using context functions such as [CURRENT_ROLE](../sql-reference/functions/current_role.md) in a conditional
> > policy:
> >
> > * Context functions return strings, so comparisons using them are case-sensitive. You can use
> >   [LOWER](../sql-reference/functions/lower.md) to convert strings to all lowercase if you’d like to do a case-insensitive comparison.
> > * The [POLICY_CONTEXT](../sql-reference/functions/policy_context.md) function helps you evaluate whether a policy body is returning the correct value
> >   when a context function returns a certain value. The POLICY_CONTEXT function simulates query results based upon a specified value of
> >   one or more context functions.

## Join query requirements

Queries against a join-constrained table or view must conform to the following requirements in order
for a join policy to take effect:

Supported [join](../sql-reference/constructs/join.md) types
:   The following explicit join types are supported for join-constrained tables:

    * INNER JOIN (with the optional INNER keyword; JOIN is required)
    * LEFT OUTER JOIN and RIGHT OUTER JOIN, where the join-constrained table is the “opposite” table. If the
      join-constrained table is the first or “left” table, the outer join must be a RIGHT OUTER JOIN. If the
      join-constrained table is the second or “right” table, the outer join must be a LEFT OUTER JOIN.
    * NATURAL JOIN (inner join over columns with common names)

    Inner and outer joins must be specified explicitly within the FROM clause, with ON or USING join conditions. These joins can’t be specified in the WHERE clause.

Unsupported join types
:   The following join types are not supported:

    * FULL OUTER JOIN
    * ASOF JOIN
    * LATERAL joins
    * Implicit joins in the WHERE clause
    * Cross-joins (Cartesian product)

Joins with multiple join-constrained tables
:   If both (or all) tables in a join query have been assigned a join policy, the query fails with an error. In a join of two tables, only one can be join-constrained.

[UNION, INTERSECT, and EXCEPT](../sql-reference/operators-query.md)

> * UNION and UNION ALL set operations are not supported in queries against join-constrained tables.
> * INTERSECT set operations are supported because they are semantically equivalent to inner joins.
> * MINUS and EXCEPT set operations are supported only when the join-constrained table is on the filtered side of the operator (that is, in the
>   second query block).

Data type conversions
:   A query that includes a data type conversion function in the SELECT statement must use the TRY version of the function. For example, the
    TRY_CAST function is allowed, but the CAST function is prohibited. The following data type conversion functions are allowed for numeric
    types:

    * [TRY_CAST](../sql-reference/functions/try_cast.md)
    * [TRY_TO_DECFLOAT](../sql-reference/functions/try_to_decfloat.md)
    * [TRY_TO_DECIMAL](../sql-reference/functions/try_to_decimal.md)
    * [TRY_TO_DOUBLE](../sql-reference/functions/try_to_double.md)
    * [TRY_TO_NUMBER](../sql-reference/functions/try_to_decimal.md)
    * [TRY_TO_NUMERIC](../sql-reference/functions/try_to_decimal.md)

## Expected errors for queries against join-constrained tables

The following examples show some of the cases where queries are expected to fail because a join policy is applied to a table or view. For background information, see Join query requirements. The tables in these queries do
not contain any rows, so queries return either an empty result (success) or an error (failure).

### Queries without joins fail

When a join policy is assigned to `join_table`, simple queries without joins fail. The following query returns an error.

```sqlexample
SELECT col1, col2 FROM join_table;
```

### WHERE clause joins are prohibited

If `join_table` (alias `jt1`) is a join-constrained table, the following WHERE clause join returns an error:

```sqlexample
SELECT *
  FROM join_table jt1, join_table_2 jt2
  WHERE jt1.col1=jt2.col1;
```

### Left and right outer joins depend on the order of tables

The following examples show the expected behavior with outer joins, where `join_table` (alias `jt1`) is the join-constrained table. The first query returns an error.

```sqlexample
SELECT * FROM join_table jt1
  LEFT OUTER JOIN join_table_2 jt2 ON jt1.col1=jt2.col1;
```

The second query returns results.

```sqlexample
SELECT * FROM join_table jt1
  RIGHT OUTER JOIN join_table_2 jt2 ON jt1.col1=jt2.col1;
```

### Joining two join-constrained tables is not supported

If `join_table` and `join_table_2` both have a join policy assigned, the following join query returns an error:

```sqlexample
SELECT * FROM join_table jt1 JOIN join_table_2 jt2 ON jt1.col1=jt2.col1;
```

### UNION set operations are disallowed, but INTERSECT operations are allowed

In these examples, `join_table` has a join policy but `join_table_3` does not. The UNION query fails, but a similar INTERSECT query succeeds.

```sqlexample
SELECT * FROM JOIN_TABLE
UNION
SELECT * FROM JOIN_TABLE_3;
```

```sqlexample
SELECT * FROM JOIN_TABLE
INTERSECT
SELECT * FROM JOIN_TABLE_3;
```

### EXCEPT behavior depends on the order of query blocks

The EXCEPT behavior depends on the order of query blocks. Note that the
first query selects from the join-constrained table first and returns an error.

```sqlexample
SELECT * FROM JOIN_TABLE
EXCEPT
SELECT * FROM JOIN_TABLE_3;
```

The second query succeeds.

```sqlexample
SELECT * FROM JOIN_TABLE_3
EXCEPT
SELECT * FROM JOIN_TABLE;
```

### A view on a join-constrained table is also protected

Assume that `join_table` has been assigned join policy `jp1`. Create a view on the table:

```sqlexample
CREATE VIEW join_table_view AS
  SELECT * FROM join_table;
```

Now query the view without specifying a join:

```sqlexample
SELECT * FROM join_table_view;
```

The query fails because it violates the policy on `join_table`. The query on the view must contain a join. For
more information about join policy behavior with views, see Views and materialized views.

## Managing join policies

You can modify, replace, detach, describe, and monitor join policies. The following sections cover these management
tasks.

### Modifying a join policy

You can use the [ALTER JOIN POLICY](../sql-reference/sql/alter-join-policy.md) command to modify join policy rules. You can also
rename a policy or change its comment.

Before modifying a join policy, run the [DESCRIBE JOIN POLICY](../sql-reference/sql/desc-join-policy.md) command or
[GET_DDL](../sql-reference/functions/get_ddl.md) function to review the policy’s current SQL expression.

For example, run the following command to update the SQL expression of the join policy `jp3` so that joins are not
required:

```sqlexample
ALTER JOIN POLICY jp3 SET BODY -> JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE);
```

### Replacing a join policy

The recommended method to replace a join policy is to use the `FORCE` parameter in an ALTER TABLE statement,
which detaches the existing policy and assigns a new one in a single command. This approach allows you to atomically replace the old policy, leaving no gap in protection.

For example, to assign a new join policy to a table that is already join-constrained:

```sqlexample
ALTER TABLE join_table SET JOIN POLICY jp2 FORCE;
```

You can also detach the policy from a table or view in one statement (using UNSET JOIN POLICY), then set a new policy
in a different statement (using SET JOIN POLICY). If you choose this method, the table is not protected by a join policy in the interim between the two operations. A query could potentially access sensitive data during this time.

### Detaching a join policy

Use the UNSET JOIN POLICY clause of an ALTER TABLE or ALTER VIEW command to detach a join policy from a table or view. The name of the policy is not required because an object can’t have more than one policy. For example:

```sqlexample
ALTER VIEW join_view UNSET JOIN POLICY;
```

### Monitoring join policies

You can monitor join policy usage in the following ways:

* Query the [JOIN_POLICIES](../sql-reference/account-usage/join_policies.md) view in the Account Usage schema of the
  shared SNOWFLAKE database. This view is a *catalog* for all join policies in your Snowflake account.
* Query the [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) Information Schema table function to identify join
  policy references and find out which tables and views currently have policies applied to them.

#### Getting information about join policies

To get information about existing join policies, query the [JOIN_POLICIES](../sql-reference/account-usage/join_policies.md) view in the Account Usage schema of the shared SNOWFLAKE database. This view is a *catalog* for all join policies in your Snowflake account. For example, you can return the policy body for a specific join policy:

```sqlexample
SELECT policy_name, policy_body, created
  FROM SNOWFLAKE.ACCOUNT_USAGE.JOIN_POLICIES
  WHERE policy_name='JP2' AND created LIKE '2024-11-26%';
```

```output
+-------------+----------------------------------------------------------+-------------------------------+
| POLICY_NAME | POLICY_BODY                                              | CREATED                       |
|-------------+----------------------------------------------------------+-------------------------------|
| JP2         | CASE                                                     | 2024-11-26 11:22:54.848 -0800 |
|             |           WHEN CURRENT_ROLE() = 'ACCOUNTADMIN'           |                               |
|             |             THEN JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE) |                               |
|             |           ELSE JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE)    |                               |
|             |         END                                              |                               |
+-------------+----------------------------------------------------------+-------------------------------+
```

#### Getting information about tables and views attached to join policies

The [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) Information Schema table function returns information about tables
and views attached to existing join policies. You can use two different syntax options:

* Return a row for each object (table or view) that has the specified join policy set:

  ```sqlexample
  USE DATABASE my_db;
  USE SCHEMA INFORMATION_SCHEMA;
  SELECT
      policy_name,
      policy_kind,
      ref_entity_name,
      ref_entity_domain,
      ref_column_name,
      ref_arg_column_names,
      policy_status
    FROM TABLE(INFORMATION_SCHEMA.POLICY_REFERENCES(policy_name => 'my_db.my_schema.jp1'));
  ```

* Return information about the policy that is assigned to `join_table`:

  ```sqlexample
  USE DATABASE my_db;
  USE SCHEMA INFORMATION_SCHEMA;
  SELECT
      policy_name,
      policy_kind,
      ref_entity_name,
      ref_entity_domain,
      ref_column_name,
      ref_arg_column_names,
      policy_status
    FROM TABLE(INFORMATION_SCHEMA.POLICY_REFERENCES(ref_entity_name => 'my_db.my_schema.join_table', ref_entity_domain => 'table'));
  ```

  ```output
  +-------------+-------------+-----------------+-------------------+-----------------+----------------------+---------------+
  | POLICY_NAME | POLICY_KIND | REF_ENTITY_NAME | REF_ENTITY_DOMAIN | REF_COLUMN_NAME | REF_ARG_COLUMN_NAMES | POLICY_STATUS |
  |-------------+-------------+-----------------+-------------------+-----------------+----------------------+---------------|
  | JP1         | JOIN_POLICY | JOIN_TABLE      | TABLE             | NULL            | [ "COL1" ]           | ACTIVE        |
  +-------------+-------------+-----------------+-------------------+-----------------+----------------------+---------------+
  ```

### Best practices for policy administration

Creating a join policy and assigning the policy to a table requires the same general procedure as creating and assigning
other policies, such as masking, projection, and aggregation policies:

1. If you are using a centralized management approach, create a custom role (such as `join_policy_admin`) to manage the policy. Alternatively, you can use an existing role.
2. Grant this role the privileges to create and assign a join policy.
3. Create the join policy.
4. Create or alter a table to assign the policy to the table and to allow joining columns (ALLOWED JOIN KEYS).
5. Test some join and non-join queries on the table.

Successful queries against the table must join its data to another table or view and must join on the allowed columns.

Access control administrator tasks
:   1. Create a custom role to manage the join policy. You could also re-use an existing role.

       ```sqlexample
       USE ROLE USERADMIN;

       CREATE ROLE join_policy_admin;
       ```
    2. Grant the `join_policy_admin` custom role the privileges to create a join policy in a schema and assign the policy
       to a table or view in the Snowflake account.

       This step assumes the join policy will be stored in a database and schema named `privacy.join_policies` and that this database and
       schema already exist:

       ```sqlexample
       GRANT USAGE ON DATABASE privacy TO ROLE join_policy_admin;
       GRANT USAGE ON SCHEMA privacy.join_policies TO ROLE join_policy_admin;

       GRANT CREATE JOIN POLICY
         ON SCHEMA privacy.join_policies TO ROLE join_policy_admin;

       GRANT APPLY JOIN POLICY ON ACCOUNT TO ROLE join_policy_admin;
       ```

       The `join_policy_admin` role can now be assigned to one or more users.

       For information about the privileges needed to work with join policies, refer to Managing join policies
       (in this topic).

Join policy administrator tasks
:   * Create a join policy:

      > ```sqlexample
      > USE ROLE join_policy_admin;
      > USE SCHEMA privacy.join_policies;
      >
      > CREATE OR REPLACE JOIN POLICY jp1
      >   AS () RETURNS JOIN_CONSTRAINT -> JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE);
      > ```

## Interaction of join policies with other Snowflake features

The following sections summarize how join policies interact with other Snowflake features and services.

### Other policies

This section describes how a join policy interacts with other policies, including
[masking policies](security-column-intro.md),
[row access policies](security-row-intro.md), [aggregation policies](aggregation-policies.md),
and [projection policies](projection-policies.md).

You can attach other policies to a join-constrained table. A successful query against the table must meet the requirements of all
policies.

If a row access policy is assigned to a join-constrained table, a row excluded from the query results based on the row
access policy is not included when calculating the results of the join.

The body of a masking policy, row access policy, aggregation policy, or projection policy cannot reference a join-constrained table, including its
columns.

### Views and materialized views

You can assign a join policy to both views and materialized views. When a join policy is applied to a view, the underlying
table does not become join-constrained. This base table can still be queried without restriction.

Whether you can create a view from a join-constrained table depends on the type of view:

> * You can create a regular view from one or more join-constrained tables; however, queries against that view must join data in
>   a way that meets the restrictions of those base tables. You cannot circumvent a join policy on a protected table by creating a view on the table. The policy for the table is respected and enforced for queries against the view. For an example, see A view on a join-constrained table is also protected.
> * You cannot create a materialized view based on a join-constrained table or view, nor can you assign a join policy to a
>   table or view upon which a materialized view is based.

### Cloned objects

The following approach helps to safeguard data from users with the SELECT privilege on a cloned table or view that is stored in the cloned
database or schema:

* Cloning an individual join policy object is not supported.
* Cloning a database results in the cloning of all join policies within the database.
* Cloning a schema results in the cloning of all join policies within the schema.
* A cloned table maps to the same join policies as the source table.

  + When a table is cloned in the context of its parent schema being cloned, if the source table has a reference to a join policy in the
    same parent schema (that is, a local reference), the cloned table will have a reference to the cloned join policy.
  + If the source table refers to a join policy in a different schema (that is, a foreign reference), the cloned table retains the
    foreign reference.

For more information, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

### Replication

Join policies and their assignments can be replicated using database replication and replication groups.

For [database replication](database-replication-considerations.md), the replication operation fails if either of the
following conditions is true:

* The primary database is in an Enterprise (or higher) account and contains a policy but one or more of the accounts approved for
  replication are on lower editions.
* A table or view contained in the primary database has a [dangling reference](database-replication-considerations.md) to a policy in another database.

The dangling reference behavior for database replication can be avoided when replicating multiple databases in a
[replication group](account-replication-intro.md).

## Privileges and commands

The following subsections provide information to help manage join policies.

### Join policy privileges

Snowflake supports the following privileges on the join policy object.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables the set and unset operations for a join policy on a table. |
| OWNERSHIP | Transfers ownership of the join policy, which grants full control over the policy. Required to alter most properties of a join policy. |

For information, see Summary of DDL commands, operations, and privileges.

### Join policy DDL reference

Snowflake supports the following DDL commands to create and manage join policies.

* [CREATE JOIN POLICY](../sql-reference/sql/create-join-policy.md)
* [ALTER JOIN POLICY](../sql-reference/sql/alter-join-policy.md)
* [DESCRIBE JOIN POLICY](../sql-reference/sql/desc-join-policy.md)
* [DROP JOIN POLICY](../sql-reference/sql/drop-join-policy.md)
* [SHOW JOIN POLICIES](../sql-reference/sql/show-join-policies.md)

### Summary of DDL commands, operations, and privileges

The following table summarizes the relationship between join policy privileges and DDL operations.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Operation | Privilege required |
| --- | --- |
| Create join policy. | A role with the CREATE JOIN POLICY privilege in the same schema. |
| Alter join policy. | The role with the OWNERSHIP privilege on the join policy. |
| Describe join policy | One of the following:   * A role with the global APPLY JOIN POLICY privilege. * A role with the OWNERSHIP privilege on the join policy. * A role with the APPLY privilege on the join policy. |
| Drop join policy. | A role with the OWNERSHIP privilege on the join policy. |
| Show join policies. | One of the following:   * A role with the USAGE privilege on the schema in which the join policy exists. * A role with the APPLY JOIN POLICY on the account. |
| Set or unset a join policy on a table. | One of the following:   * A role with the APPLY JOIN POLICY privilege on the account. * A role with the APPLY privilege on the join policy and the OWNERSHIP privilege on the table or view. |

Snowflake supports different permissions to create and set a join policy on an object.

1. For a centralized policy management approach in which the `join_policy_admin` custom role creates and sets
   join policies on all tables, the following permissions are necessary:

   ```sqlexample
   USE ROLE securityadmin;
   GRANT USAGE ON DATABASE mydb TO ROLE join_policy_admin;
   GRANT USAGE ON SCHEMA mydb.schema TO ROLE join_policy_admin;
   GRANT CREATE JOIN POLICY ON SCHEMA mydb.schema TO ROLE join_policy_admin;
   GRANT APPLY ON JOIN POLICY ON ACCOUNT TO ROLE join_policy_admin;
   ```
2. In a hybrid management approach, a single role has the CREATE JOIN POLICY privilege to ensure join policies are named
   consistently and individual teams or roles have the APPLY privilege for a specific join policy.

   For example, the custom role `finance_role` can be granted the permission to set the join policy `cost_center` on tables
   and views the role owns (that is, the role has the OWNERSHIP privilege on the table or view):

   ```sqlexample
   USE ROLE securityadmin;
   GRANT CREATE JOIN POLICY ON SCHEMA mydb.schema TO ROLE join_policy_admin;
   GRANT APPLY ON JOIN POLICY cost_center TO ROLE finance_role;
   ```

---
title: K-FSI (Korean Financial Security Institute) with RSEFT
source: https://docs.snowflake.com/en/user-guide/cert-kfsi-rseft.md
section: User Guide
---

# K-FSI (Korean Financial Security Institute) with RSEFT

This topic describes how Snowflake supports customers with KSFI with RSEFT compliance requirements.

## Understanding K-FSI with RSEFT compliance requirements

The Korean Financial Security Institute (K-FSI) performs the CSP Safety Evaluation in order to evaluate cloud service provider compliance
with the Regulation on Supervision of Electronic Financial Transactions (RSEFT) regulation. They support the financial services industry in
security assessments and assist in various areas that help create a secure environment for financial institutions. Snowflake’s CSP Safety
Evaluation is scoped to SaaS service controls. If customers will store or process unique private information (UPI) or protected credit
information (PCI) on Snowflake or safety/reliability of electronic financial transactions are materially impacted by using Snowflake, then
customers must review the CSP Safety Evaluation results and perform an analysis of vendor risk and submit documentation to the Financial
Supervisory Service prior to utilizing Snowflake for these types of data.

---
title: Key Pair Authentication: Troubleshooting
source: https://docs.snowflake.com/en/user-guide/key-pair-auth-troubleshooting.md
section: User Guide
---

# Key Pair Authentication: Troubleshooting

This topic helps you troubleshoot errors that occur when connecting to Snowflake with
[key pair authentication](key-pair-auth.md). It focuses on errors that contain `JWT token is invalid`.

## Find the Error

Before troubleshooting, you need to determine that the issue is resulting in a `JWT token is invalid` error.

If your Snowflake client is a driver or connector that does not have an interactive interface, use logs to inspect connection errors:

1. Enable logging for the Snowflake connector or driver. For details, see
   [Generate log files for Snowflake Drivers & Connectors](https://community.snowflake.com/s/article/How-to-generate-log-file-on-Snowflake-connectors) (Snowflake Knowledge Base article).
2. Check for errors that contain the string `JWT token is invalid`.

   For example, a client using the Snowflake JDBC driver might get the following error when trying to use key pair authentication:

   > ```output
   > yyyy-mm-dd hh:mm:ss.nnn n.s.c.jdbc.SnowflakeSQLException FINE <init>:40 -
   > Snowflake exception: JWT token is invalid. [0ce9eb56-821d-4ca9-a774-04ae89a0cf5a],
   > sqlState:08001, vendorCode:390,144, queryId:
   > ```

## Retrieve additional details

Each `JWT token is invalid` error includes a UUID that appears in brackets immediately after the error (for example,
`JWT token is invalid. [0ce9eb56-821d-4ca9-a774-04ae89a0cf5a]`). You should provide the UUID of the error to a Snowflake administrator so
they can obtain more information about the error.

Administrators use the [SYSTEM$GET_LOGIN_FAILURE_DETAILS](../sql-reference/functions/system_get_login_failure_details.md) function to obtain additional details about the
error. For example, to obtain additional information about the error `JWT token is invalid. [0ce9eb56-821d-4ca9-a774-04ae89a0cf5a]`,
a user with the ACCOUNTADMIN role can execute:

```sqlexample
SELECT JSON_EXTRACT_PATH_TEXT(SYSTEM$GET_LOGIN_FAILURE_DETAILS('0ce9eb56-821d-4ca9-a774-04ae89a0cf5a'), 'errorCode');
```

The JSON_EXTRACT_PATH_TEXT function parses the JSON output of the SYSTEM$GET_LOGIN_FAILURE_DETAILS function to retrieve the error code and
error.

## List of Errors

The output of the SYSTEM$GET_LOGIN_FAILURE_DETAILS function is one of the following error code/error combinations.

| Error Code | Error | Description |
| --- | --- | --- |
| 394307 | JWT_TOKEN_ACCOUNT_MISMATCH | The Snowflake account obtained from the token is not the same as the account in the request’s URL. |
| 390144 | JWT_TOKEN_INVALID | There is a general issue with the JWT token. For possible solutions, see Common Errors and Solutions. |
| 394300 | JWT_TOKEN_INVALID_USER_IN_ISSUER | The user name specified in the issuer does not exist in the Snowflake account. For possible solutions, see Common Errors and Solutions. |
| 394301 | JWT_TOKEN_MISSING_ISSUE_OR_EXPIRATION_TIME | The JWT token does not contain an issue time or an expiration time. |
| 394302 | JWT_TOKEN_INVALID_ISSUE_TIME | The JWT token was received by Snowflake more than 60 seconds after the issue time. For possible solutions, see Common Errors and Solutions. |
| 394303 | JWT_TOKEN_INVALID_EXPIRATION_TIME | The JWT token is expired. |
| 394304 | JWT_TOKEN_INVALID_PUBLIC_KEY_FINGERPRINT_MISMATCH | There is a mismatch between the public key fingerprint specified in the issuer and the one stored for the user in Snowflake. For possible solutions, see Common Errors and Solutions. |
| 394305 | JWT_TOKEN_INVALID_ALGORITHM | The JWT token was not signed with the RS256 algorithm. |
| 394306 | JWT_TOKEN_INVALID_SIGNATURE | Snowflake could not verify the signature provided by the JWT token. It is possible that the JWT was signed with a private key that is not paired with the provided public key. It is also possible that the JWT signature is corrupt or has been modified. |

## Common Errors and Solutions

The most common errors associated with key pair authentication are:

* JWT_TOKEN_INVALID
* JWT_TOKEN_INVALID_PUBLIC_KEY_FINGERPRINT_MISMATCH
* JWT_TOKEN_INVALID_USER_IN_ISSUER
* JWT_TOKEN_INVALID_ISSUE_TIME

Use the following descriptions and solutions to troubleshoot these errors.

### JWT_TOKEN_INVALID

Description:
:   There is a general problem with the JWT token.

Solution #1:
:   The token itself might be malformed. Double-check that the application accessing Snowflake is generating valid JWT
    tokens.

Solution #2:
:   Double-check that the client (driver, connector, or request URL) is using the correct
    [account identifier](admin-account-identifier.md) to connect to the Snowflake account. You should also check that this value matches the
    account identifier in the `iss` claim.

Solution #3:
:   Double-check that the account identifier and user name in the `sub` claim match the corresponding values in the
    `iss` claim.

Solution #4:
:   Double-check that `iss` claim specifies `SHA256` as the signing algorithm.

### JWT_TOKEN_INVALID_PUBLIC_KEY_FINGERPRINT_MISMATCH

Description:
:   There is a mismatch between the public key fingerprint specified in the issuer and the one stored for the user in
    Snowflake.

Solution:
:   Obtain the public key fingerprint of the JWT token, then compare it with the fingerprint associated with the user in
    Snowflake.

    One method of obtaining the fingerprint of the JWT token is to [enable DEBUG logging for your driver](https://community.snowflake.com/s/article/How-to-generate-log-file-on-Snowflake-connectors) and attempt to login. Look for the pattern `SHA256:<hash>`, where `<hash>` is the
    public key fingerprint.

    To obtain the public key fingerprint associated with the user in Snowflake, execute the [DESCRIBE USER](../sql-reference/sql/desc-user.md) command. The
    public key fingerprint is located in either the RSA_PUBLIC_KEY_FP or the RSA_PUBLIC_KEY_2_FP property. If the user is missing a public key
    fingerprint, execute the ALTER USER command to set these properties.

### JWT_TOKEN_INVALID_USER_IN_ISSUER

Description:
:   The user name specified in the issuer does not exist in the Snowflake account.

Solution:
:   The user name configured in the Snowflake client must match the `LOGIN_NAME` of the Snowflake user, not its
    `NAME`. Sometimes these values are different.

    Execute the [DESCRIBE USER](../sql-reference/sql/desc-user.md) command and verify that the value of the `LOGIN_NAME` property matches the user name
    that the Snowflake client is using to connect.

### JWT_TOKEN_INVALID_ISSUE_TIME

Description:
:   The JWT token was received by Snowflake more than 60 seconds after the issue time.

Solution #1:
:   Check the host where the driver is running to ensure it is synchronized to NTP and doesn’t have clock skew. If the server
    clock is skewed, Snowflake might determine that the current time is more than 60 seconds after the issue time of the token, when it really
    isn’t. For example, if the client machine is 30 seconds behind and it took the token 45 seconds to reach Snowflake, then Snowflake
    determines that it has been 75 seconds since the JWT token was issued, not 45 seconds.

    You can check that the clock of the client machine is accurate by comparing it to an NTP server. For example, you can use
    [NIST Internet Time Servers](https://tf.nist.gov/tf-cgi/servers.cgi) to sync the client machine. For help with checking and
    synchronizing your host’s clock to an NTP server, refer to the administrator guide for your operating system or reach out to a system
    administrator.

Solution #2:
:   Use an OS-specific monitoring tool to determine if the error occurs during times of extreme CPU/disk usage.

Solution #3:
:   Check whether there is excessive network latency that is causing the JWT token to be processed by Snowflake more than
    60 seconds after the token’s issue time.

    When using a connectivity diagnostic tool to measure network latency, set the destination to
    `account_identifier.snowflakecomputing.com`. For example, if the Snowflake client is using `myorg-account1` as the account
    identifier, set the destination to `myorg-account1.snowflakecomputing.com`.

---
title: Key-pair authentication and key-pair rotation
source: https://docs.snowflake.com/en/user-guide/key-pair-auth.md
section: User Guide
---

# Key-pair authentication and key-pair rotation

This topic describes using key pair authentication and key pair rotation in Snowflake.

## Overview

Snowflake supports using key pair authentication for enhanced authentication security as an alternative to basic authentication, such as
username and password.

This authentication method requires, as a minimum, a 2048-bit RSA key pair. You can generate the Privacy Enhanced Mail (PEM)
private-public key pair using OpenSSL. Some of the Supported Snowflake Clients allow using encrypted private keys to connect to
Snowflake. The public key is assigned to the Snowflake user who uses the Snowflake client to connect and authenticate to Snowflake.

Snowflake also supports rotating public keys in an effort to allow compliance with more robust security and governance postures.

## Supported Snowflake clients

The following table summarizes support for key pair authentication among Snowflake Clients. A checkmark (✔) indicates full support.
A missing checkmark indicates key pair authentication is not supported.

| Client | Key Pair Authentication | Key Pair Rotation | Unencrypted Private Keys | Encrypted Private Keys |
| --- | --- | --- | --- | --- |
| [Snowflake CLI](../developer-guide/snowflake-cli/index.md) | ✔ | ✔ | ✔ | ✔ |
| [SnowSQL (CLI client)](snowsql.md) | ✔ | ✔ | ✔ | ✔ |
| [Snowflake Connector for Python](../developer-guide/python-connector/python-connector.md) | ✔ | ✔ | ✔ | ✔ |
| [Snowflake Connector for Spark](spark-connector.md) | ✔ | ✔ | ✔ |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | ✔ | ✔ | ✔ |  |
| [Snowflake Horizon Catalog endpoint](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md) | ✔ | ✔ | ✔ | ✔ |
| [Go driver](https://godoc.org/github.com/snowflakedb/gosnowflake) | ✔ | ✔ | ✔ |  |
| [JDBC Driver](../developer-guide/jdbc/jdbc.md) | ✔ | ✔ | ✔ | ✔ |
| [ODBC Driver](../developer-guide/odbc/odbc.md) | ✔ | ✔ | ✔ | ✔ |
| [Node.js Driver](../developer-guide/node-js/nodejs-driver.md) | ✔ | ✔ | ✔ | ✔ |
| [.NET Driver](../developer-guide/dotnet/dotnet-driver.md) | ✔ | ✔ | ✔ | ✔ |
| [PHP PDO Driver for Snowflake](../developer-guide/php-pdo/php-pdo-driver.md) | ✔ | ✔ | ✔ | ✔ |

## Configuring key-pair authentication

Complete the following steps to configure key pair authentication for all supported Snowflake clients.

### Generate the private keys

Depending on which one of the Supported Snowflake Clients you use to connect to Snowflake, you have the option to generate encrypted or
unencrypted private keys. Generally, it is safer to generate encrypted keys. Snowflake recommends communicating with your internal security
and governance officers to determine which key type to generate prior to completing this step.

Snowflake supports cryptographic keys generated using the following algorithms:

* RSA digital signature algorithms RS256, RS384, and RS512.
* Elliptic Curve Digital Signature Algorithms (ECDSA) algorithms ES256(P-256), ES384 (P-384), and ES512 (P-512).

These signatures use the SHA-256, SHA-384, and SHA-512 hash algorithms, respectively.

> **Tip:**
>
> The command to generate an encrypted key prompts for a passphrase to regulate access to the key. Snowflake recommends using a passphrase
> that complies with PCI DSS standards to protect the locally generated private key. Additionally, Snowflake recommends storing the
> passphrase in a secure location. If you are using an encrypted key to connect to Snowflake, enter the passphrase during the initial
> connection. The passphrase is only used for protecting the private key and will never be sent to Snowflake.
>
> To generate a long and complex passphrase based on PCI DSS standards:
>
> > 1. Access the [PCI Security Standards Document Library](https://www.pcisecuritystandards.org/document_library).
> > 2. For PCI DSS, select the most recent version and your desired language.
> > 3. Complete the form to access the document.
> > 4. Search for `Passwords/passphrases must meet the following:` and follow the recommendations for password/passphrase
> >    requirements, testing, and guidance. Depending on the document version, the phrase is likely located in a section called
> >    `Requirement 8: Identify and authenticate access to system components` or a similar name.

To start, open a terminal window and generate a private key.

You can generate either an encrypted version of the private key or an unencrypted version of the private key.

To generate an unencrypted version, use the following command:

```bash
openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out rsa_key.p8 -nocrypt
```

To generate an encrypted version, use the following command, which omits `-nocrypt`:

```bash
openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 des3 -inform PEM -out rsa_key.p8
```

The commands generate a private key in PEM format.

```bash
-----BEGIN ENCRYPTED PRIVATE KEY-----
MIIE6T...
-----END ENCRYPTED PRIVATE KEY-----
```

### Generate a public key

From the command line, generate the public key by referencing the private key. The following command assumes the private key is encrypted
and contained in the file named `rsa_key.p8`.

```bash
openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
```

The command generates the public key in PEM format.

```bash
-----BEGIN PUBLIC KEY-----
MIIBIj...
-----END PUBLIC KEY-----
```

### Store the private and public keys securely

Copy the public and private key files to a local directory for storage. Record the path to the files. Note that the private key is stored
using the PKCS#8 (Public Key Cryptography Standards) format and is encrypted using the passphrase you specified in the previous step.

However, the file should still be protected from unauthorized access using the file permission mechanism provided by your operating system.
It is your responsibility to secure the file when it is not being used.

### Grant the privilege to assign a public key to a Snowflake user

To assign a public key to a user, you must have one of the following
[roles or privileges](security-access-control-overview.md):

* MODIFY PROGRAMMATIC AUTHENTICATION METHODS privilege on the user.
* OWNERSHIP privilege on the user.

You can use the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) or [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command to grant the
MODIFY PROGRAMMATIC AUTHENTICATION METHODS or OWNERSHIP privilege on the user to a role.

For example, suppose that you want to users with the `my_service_owner_role` custom role to assign the public key to the service
user `my_service_user`. The following statement grants the MODIFY PROGRAMMATIC AUTHENTICATION METHODS privilege on the
`my_service_user` user to the role `my_service_owner_role`:

```sqlexample
GRANT MODIFY PROGRAMMATIC AUTHENTICATION METHODS ON USER my_service_user
  TO ROLE my_service_owner_role;
```

### Assign the public key to a Snowflake user

To assign the public key to the user, execute an [ALTER USER](../sql-reference/sql/alter-user.md) command to set the RSA_PUBLIC_KEY property
of the user. For example:

```sqlexample
ALTER USER example_user SET RSA_PUBLIC_KEY='MIIBIjANBgkqh...';
```

> **Note:**
>
> * Exclude the public key delimiters in the SQL statement.

### Verify the user’s public key fingerprint

1. Execute the following command to retrieve the user’s public key fingerprint:

   ```sqlexample
   DESC USER example_user
     ->> SELECT SUBSTR(
           (SELECT "value" FROM $1
              WHERE "property" = 'RSA_PUBLIC_KEY_FP'),
           LEN('SHA256:') + 1) AS key;
   ```

   Output:

   ```output
   Azk1Pq...
   ```
2. Copy the output.
3. Run the following command on the command line:

   ```bash
   openssl rsa -pubin -in rsa_key.pub -outform DER | openssl dgst -sha256 -binary | openssl enc -base64
   ```

   Output:

   ```output
   writing RSA key
   Azk1Pq...
   ```
4. Compare both outputs. If both outputs match, the user correctly configured their public key.

### Configure the Snowflake client to use key-pair authentication

Update the client to use key pair authentication to connect to Snowflake.

* [Snowflake CLI](../developer-guide/snowflake-cli/connecting/configure-connections.md)
* [SnowSQL](snowsql-start.md)
* [Python connector](../developer-guide/python-connector/python-connector-connect.md)
* [Spark connector](spark-connector-use.md)
* [Kafka connector](kafka-connector-install.md)
* [Go driver](https://godoc.org/github.com/snowflakedb/gosnowflake)
* [JDBC driver](../developer-guide/jdbc/jdbc-configure.md)
* [ODBC driver](../developer-guide/odbc/odbc-parameters.md)
* [.NET driver](https://github.com/snowflakedb/snowflake-connector-net/blob/master/README.md)
* [Node.js Driver](../developer-guide/node-js/nodejs-driver-authenticate.md)

## Configuring key-pair rotation

Snowflake supports multiple active keys to allow for uninterrupted rotation. Rotate and replace your public and private keys based on the
expiration schedule you follow internally.

Currently, you can use the `RSA_PUBLIC_KEY` and `RSA_PUBLIC_KEY_2` parameters for [ALTER USER](../sql-reference/sql/alter-user.md) to
associate up to 2 public keys with a single user.

Complete the following steps to configure key pair rotation and rotate your keys.

1. Complete all steps in Configuring key-pair authentication with the following updates:

   * Generate a new private and public key set.
   * Assign the public key to the user. Set the public key value to either `RSA_PUBLIC_KEY` or `RSA_PUBLIC_KEY_2`, whichever key
     value is not currently in use. For example:

     ```sqlexample
     ALTER USER example_user SET RSA_PUBLIC_KEY_2='JERUEHtcve...';
     ```
2. Update the code to connect to Snowflake. Specify the new private key.

   Snowflake verifies the correct active public key for authentication based on the private key submitted with your connection information.
3. Remove the old public key from the user profile using an [ALTER USER](../sql-reference/sql/alter-user.md) command.

   ```sqlexample
   ALTER USER example_user UNSET RSA_PUBLIC_KEY;
   ```

---
title: Leaked password protection
source: https://docs.snowflake.com/en/user-guide/leaked-password-protection.md
section: User Guide
---

# Leaked password protection

Leaked password protection is a background service in Snowflake that monitors and disables passwords that have been leaked to help prevent
unauthorized access to Snowflake accounts. The leaked password protection service provides a notification system for administrators so they
are aware of leaked passwords when they are detected in external databases.

This topic provides the following information:

* Discovering leaked passwords
* Email notifications
* Types of users scanned
* Reset a user password
* Reset an admin password

## Discovering leaked passwords

If Snowflake discovers a leaked password, then Snowflake identifies which user the password is associated with,
and then securely verifies whether or not the user can still use the leaked password to authenticate.

If the user can authenticate with the leaked password, Snowflake disables the leaked password by unsetting the password for the user, and
rejecting the use of the leaked password by that user in the future. After disabling the leaked password, the user cannot authenticate using
their password, but the user can use other methods of authentication, such as SSO, if available.

Snowflake then notifies the administrator and user about the leaked password through email. Snowflake also provides a message to the
affected user during login time, telling them they must contact their account administrator to send them a reset link to change their
password. The user’s new password must meet the requirements of the password policies set on the account.

If Snowflake identifies that the leaked password is an administrator password, and no other administrators can reset the password, the
administrator must contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Leaked password protection is enabled by default, and you cannot disable leaked password protection.

> **Note:**
>
> Snowflake only processes passwords in-memory, and does not store passwords in cleartext. Snowflake employees cannot view passwords.

## Email notifications

When Snowflake discovers a leaked password, Snowflake attempts to notify email addresses in the following order:

1. If one or more [verified email addresses](notifications/email-notifications.md) for account-level security notifications is
   found, then Snowflake notifies the verified email addresses.
2. If one or more verified email addresses for account-level security notifications are not found, then Snowflake attempts to look for and
   notify email addresses at the organization level.
3. If email addresses at the organization level are not found, Snowflake contacts users with admin roles.

For more information about notification contacts for Snowflake, see [Set up and manage notification contacts for Snowflake](ui-snowsight-contacts.md).

## Types of users scanned

All [user types](admin-user-management.md) are scanned, and Snowflake can disable any user’s password if the user has a password.

## Reset a user password

If Snowflake blocks a user from authenticating with a password, then the user must ask their administrator to
[reset their password](password-authentication.md).

## Reset an admin password

If Snowflake blocks a user from authenticating with a password, then the user must ask their administrator to reset their password.

If no other administrator is available to reset the password of another administrator, then the administrator must contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Limitations and considerations for Snowpipe Streaming with classic architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-limitations.md
section: User Guide
---

# Limitations and considerations for Snowpipe Streaming with classic architecture

For Snowpipe Streaming classic, be aware of the following limitations:

* Snowpipe Streaming classic doesn’t support the increased maximum size limits for database objects – 128 MB for VARCHAR, VARIANT, ARRAY, and OBJECT, and 64 MB for BINARY, GEOGRAPHY, and GEOMETRY — that are part of the 2025_03 behavior change bundle.
* Fail-safe doesn’t support tables that contain data ingested by Snowpipe Streaming classic. For such tables, you can’t use fail-safe for recovery because fail-safe operations on these tables fail completely.
* Snowpipe Streaming only supports using 256-bit AES keys for data encryption.
* If [Automatic Clustering](../tables-auto-reclustering.md) is also enabled on the same table that Snowpipe Streaming is inserting into, compute costs for file migration might be reduced. For more information, see [Best practices for Snowpipe Streaming with classic architecture](snowpipe-streaming-classic-recommendation.md).
* The following objects or types aren’t supported or have limitations:

  + GEOGRAPHY and GEOMETRY data types
  + Columns with collations
  + TEMPORARY tables
  + Transient Iceberg tables that use [Snowflake storage](../tables-iceberg-internal-storage.md)
  + Structured data types (like OBJECT, MAP, ARRAY) are only supported for ingestion to iceberg tables.
* The total number of channels per table can’t exceed 10,000. We recommend reusing channels when you need them. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need to open more than 10,000 channels per table.

---
title: Limitations and considerations for Snowpipe Streaming with high-performance architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-limitations.md
section: User Guide
---

# Limitations and considerations for Snowpipe Streaming with high-performance architecture

This document outlines the known limitations and key considerations for Snowpipe Streaming with high-performance architecture.

## General and service-level limitations

* The service is available in all Amazon Web Services (AWS), Microsoft Azure, and Google Cloud regions except for government-specific regions and regions in China.

## Table limits

* Maximum throughput: A table can achieve an aggregate throughput of 10 GBps uncompressed.

## Pipe limits

* Channels per pipe: By default, a single pipe can have up to 2,000 active channels. Contact Snowflake Support if you require more channels for your use case.
* Pipes for Snowpipe Streaming: The maximum number of PIPE objects configured for Snowpipe Streaming is limited to 1,000 per account and 10 per table. If you require more pipes, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Channel limits

Each channel has the following soft limits. If your application requires higher throughput per channel, contact Snowflake Support to discuss increasing these limits.

* SDK throughput: 12 MBps (uncompressed)
* REST endpoint throughput: 1 MBps (observed size)
* REST payload limit: 4 MB per request (observed size). To ingest more data per request, use compression (Gzip or ZSTD). This lets you fit a larger uncompressed data volume into the 4 MB limit.
* Request rate: 10 requests per second (RPS).

## Ingestion and data-specific limitations

* The ON_ERROR option in Snowpipe Streaming with high-performance architecture only supports CONTINUE. To capture
  failed rows for debugging and recovery, turn on error logging on your target table. For more information, see [Error logging in Snowpipe Streaming with high-performance architecture](snowpipe-streaming-error-tables.md).
* Sudden spikes in data throughput might experience brief increases in end-to-end latency because the service is elastically scaling to support the new throughput level.
* Partitioned Iceberg tables aren’t supported. Non-partitioned Snowflake-managed Iceberg tables are supported. For more information, see [Snowpipe Streaming high-performance architecture with Apache Iceberg™ tables](snowpipe-streaming-high-performance-iceberg.md).
* MATCH_BY_COLUMN_NAME is not supported with default, auto-increment, or identity columns:

  The MATCH_BY_COLUMN_NAME option isn’t supported when you load data into tables that contain columns that are defined with the DEFAULT, AUTOINCREMENT, or IDENTITY properties. When you use this option, the streaming ingestion process explicitly inserts NULL values for these columns, overriding the intended default value or the auto-generation mechanism.

  Workaround: To use these column properties, you must omit MATCH_BY_COLUMN_NAME. Instead, you define the pipe by using a COPY INTO statement that explicitly lists only the columns for which the source data provides values. The columns with the auto-generation properties must be omitted from the target column list to ensure that the table engine applies the defined value generation logic.

## SDK and architectural limitations

* Supported architectures (Rust Core): ARM64 Mac, Windows, ARM64-Linux, and x86_64-Linux.
* Linux requirements: If you use the SDK on Linux, your system must have glibc version 2.26 or later.
* Timezone: The SDK automatically uses UTC, and this setting can’t be changed by the user.
* Authentication: RSA Key-Pair authentication is required. OAuth and Personal Access Tokens (PATs) aren’t supported.
* Snowpark Container Services (SPCS) isn’t supported.

---
title: Limitations and unsupported features for hybrid tables
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-limitations.md
section: User Guide
---

# Limitations and unsupported features for hybrid tables

The following guidance on limitations and
unsupported features applies to hybrid tables, and is subject
to change.

Be sure to read both sections.

> **Note:**
>
> Reach out to your account team if you have questions.

## Limitations

* Clouds and regions
* Collations
* Consistency
* Constraints
* COPY
* Data size
* Data types not supported in indexes
* DML commands
* Higher-order functions
* Native applications
* Optimized bulk loading
* Persisted query results
* Quotas and throttling
* Secondary indexes
* Throughput
* Time Travel and cloning
* Transactions
* Transient schemas and databases
* Tri-Secret Secure

Clouds and regions
:   Hybrid tables are generally available in all commercial Amazon Web Services (AWS) and
    Microsoft Azure [regions](intro-regions.md).

    Note the following restrictions:

    * Hybrid tables are not available in Google Cloud.
    * Hybrid tables are not available in [U.S. SnowGov Regions](intro-regions.md).
    * Hybrid tables are not supported in trial accounts.
    * If you are a Virtual Private Snowflake (VPS) customer, contact
      [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to inquire about enabling hybrid tables for your account.

Collations
:   Hybrid tables support collations only on character columns that are not indexed. PRIMARY KEY columns and other
    indexed columns don’t accept the COLLATE clause. If the [DEFAULT_DDL_COLLATION](../sql-reference/parameters.md) parameter is set for
    hybrid tables in an account, database, or schema, the parameter is ignored for indexed columns.

    For more information, see [Collations on hybrid table columns](../sql-reference/sql/create-hybrid-table.md) and [Collation control](../sql-reference/collation.md).

Consistency
:   By default, hybrid tables use a session-based consistency model where read operations in the session return
    the latest data from write operations in the same session. There might be some staleness (less than 100ms) for
    changes made outside of the session. To avoid staleness,
    set `READ_LATEST_WRITES = true` at the statement or session level. Note that this
    might incur some latency overhead of a few milliseconds.

Constraints
:   PRIMARY KEY, UNIQUE, and FOREIGN KEY constraints are enforced for hybrid tables, but some limitations apply.
    For information, see [Constraints for hybrid tables](../sql-reference/sql/create-hybrid-table.md).

COPY
:   When you load a hybrid table with the COPY INTO command, `ABORT_STATEMENT` is the only option that is
    supported for `ON_ERROR`. Setting `ON_ERROR=SKIP_FILE` returns an error. For
    more information, see [Loading data](tables-hybrid-create.md).

Data size
:   You are limited to storing 2 TB of data in hybrid tables per Snowflake database.
    See Quotas and throttling for more information.

Data types not supported in indexes
:   Columns with [geospatial data types](../sql-reference/data-types-geospatial.md)
    (GEOGRAPHY and GEOMETRY), [semi-structured data types](../sql-reference/data-types-semistructured.md)
    (ARRAY, OBJECT, VARIANT), and [vector data types](../sql-reference/data-types-vector.md) (VECTOR) are not supported as either
    PRIMARY KEY columns (which are automatically indexed) or explicitly indexed columns. (Hybrid table columns support these
    data types as long as the columns are not indexed.)

    The [UUID](../sql-reference/data-types-uuid.md) data type isn’t supported for any column in a hybrid table.

    The [TIMESTAMP_TZ](../sql-reference/data-types-datetime.md) data type (or a [TIMESTAMP](../sql-reference/data-types-datetime.md) data
    type that resolves to TIMESTAMP_TZ) is not supported for columns that are indexed using UNIQUE, PRIMARY KEY, and FOREIGN KEY
    constraints. However, TIMESTAMP_TZ is supported for secondary indexes.

    See also Secondary indexes.

DML commands
:   When using DML commands to change a small number of rows, optimize performance
    by using INSERT, UPDATE, or DELETE statements instead of MERGE.

Higher-order functions
:   The [FILTER](../sql-reference/functions/filter.md), [REDUCE](../sql-reference/functions/reduce.md), and
    [TRANSFORM](../sql-reference/functions/transform.md) higher-order functions are not supported in queries
    against hybrid tables.

Native applications
:   You can include hybrid tables in a Snowflake Native App. However, hybrid tables
    cannot be shared from the provider to the consumer. Native Apps can create
    hybrid tables in the consumer account, and they can read from and write to
    those hybrid tables. You can also expose hybrid tables to application roles
    so that they can be queried directly by consumer users.

    You cannot create a hybrid table in a provider account, nor can you include
    that hybrid table in a view that is shared through the Native App.

Optimized bulk loading
:   When a hybrid table is empty, CTAS, COPY, and INSERT INTO … SELECT all use optimized
    bulk loading. When hybrid tables are not empty, optimized bulk loading is not used. For more
    information, see [Loading data](tables-hybrid-create.md).

Persisted query results
:   Queries against hybrid tables do not use the results cache, as defined with the
    [USE_CACHED_RESULT parameter](../sql-reference/parameters.md). See [Using Persisted Query Results](querying-persisted-results.md).

Quotas and throttling
:   Your usage of hybrid tables is restricted by quotas in order to ensure equitable availability of
    shared resources, ensure consistent quality of service, and reduce spikes in usage.

    | Quota | Default | Notes |
    | --- | --- | --- |
    | Hybrid storage | 2 TB per Snowflake database | This quota controls how much data you can store in hybrid tables. This limit applies only to active hybrid table data in the row store; it does not apply to object storage. If you exceed the storage quota, write operations that add data to any hybrid tables are temporarily blocked until you bring your hybrid storage consumption back under quota by removing tables or data.  You can reclaim space in a matter of seconds by [dropping](../sql-reference/sql/drop-table.md) or [truncating](../sql-reference/sql/truncate-table.md) unneeded hybrid tables. However, when you [delete](../sql-reference/sql/delete.md) data from tables, it takes some number of hours to recover space (because background compaction is required). |
    | Hybrid table requests | Approximately 16,000 operations per second, per Snowflake database | This quota controls the rate at which you can read from and write to hybrid tables. You should be able to achieve up to 16,000 operations per second against hybrid tables for a balanced workload consisting of 80% point reads and 20% point writes. To monitor throttling, see the example in [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md). |
    | Databases that contain hybrid tables | 200 total per Snowflake account, and no more than 100 databases added within a one-hour window | This quota controls how many databases within your Snowflake account may contain hybrid tables. If you exceed this quota, you will be unable to create a hybrid table in a new database without dropping all hybrid tables from an existing database. If necessary, you can request help from [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to increase the quota. |

    Throttling can be caused by a combination of factors that result in too many read and write requests being sent to the hybrid
    table storage provider:

    * Too many read requests can occur because of poorly optimized queries or because of a large, aggressive workload with very
      high query concurrency.
    * Too many write requests can occur because the bulk-load path wasn’t chosen when a table was loaded or because the workload
      consists of too many concurrent write operations.

    If you receive an error or throttling occurs because of a quota limit, contact your system administrator or DBA to look into the
    overall Unistore workload; possibly it can be modified to avoid exceeding the quota. DBAs can contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to
    evaluate query performance and quota usage. For some workloads, you might need to initiate a quota increase by requesting help from
    the support team.

Secondary indexes
:   The following secondary index features are *not* supported:

    > * Adding a column to an existing index.
    > * Altering an index on an existing hybrid table.

    Changes can be applied by dropping and re-creating the index.

    To use a secondary index on a hybrid table, you must use a role that is granted the SELECT privilege on the table.
    If you only have access to objects other than the hybrid table itself, you will not be able to use secondary indexes.

    TIMESTAMP_NTZ is a supported column type for secondary indexes; however, TIMESTAMP_TZ is *not* supported.
    [DATETIME](../sql-reference/data-types-datetime.md) is an alias for TIMESTAMP_NTZ and is therefore supported.
    [TIMESTAMP](../sql-reference/data-types-datetime.md) is supported when configured as an alias for TIMESTAMP_NTZ.

    For more information about secondary indexes, see [Add secondary indexes](tables-hybrid-index.md).

Throughput
:   You can execute up to approximately 16,000 operations per second against
    hybrid tables in each database in your account for a balanced 80%/20% read/write workload. If
    you exceed this limit, Snowflake might reduce your throughput.
    See Quotas and throttling for more information.

Time Travel and cloning
:   [Time Travel](data-time-travel.md) queries that select from hybrid tables are supported
    with the following limitations:

    * Only the TIMESTAMP parameter is supported in the AT clause.

      + The value of the TIMESTAMP parameter must be the same for all tables that belong to the same database.
      + If the tables belong to different databases, you can use different TIMESTAMP values.
    * The OFFSET, STATEMENT, and STREAM parameters are not supported.
    * The BEFORE clause is not supported.
    * The UNDROP TABLE command, which depends on Time Travel, is not supported.

    For information about cloning support for hybrid tables, see [Clone databases that contain hybrid tables](tables-hybrid-clone.md).

Transactions
:   For hybrid tables, the transaction scope is the database in which the hybrid table resides. All the hybrid tables
    referenced in a transaction must reside in the same database; standard Snowflake tables referenced in the same
    transaction may reside in different databases.

Transient schemas and databases
:   You cannot create hybrid tables that are [temporary or transient](tables-temp-transient.md).
    In turn, you cannot create hybrid tables within transient schemas or databases.

Tri-Secret Secure
:   You can use hybrid tables in a TSS-enabled account by enabling Dedicated Storage Mode. For information,
    see [Hybrid Tables Dedicated Storage Mode for TSS](tables-hybrid-dedicated-storage-mode.md).

## Unsupported features

At this time, hybrid tables do not support:

* [Clustering keys](tables-clustering-keys.md)

  Data in hybrid tables is ordered by the primary key.
* [Data sharing](../guides-overview-sharing.md)
* [Dynamic tables](dynamic-tables-about.md)
* [Fail-safe](data-failsafe.md)
* [Materialized views](views-materialized.md)
* [Query Acceleration Service](query-acceleration-service.md)
* [Replication](account-replication-intro.md)
* [Search Optimization Service](search-optimization-service.md)
* [Snowpipe](data-load-snowpipe-intro.md)
* [Snowpipe Streaming API](snowpipe-streaming/data-load-snowpipe-streaming-overview.md)
* [Streams](streams-intro.md)
* [UNDROP](../sql-reference/sql/undrop-table.md)

  + [UNDROP SCHEMA](../sql-reference/sql/undrop-schema.md) and
    [UNDROP DATABASE](../sql-reference/sql/undrop-database.md) commands succeed for entities that
    contain hybrid tables, but those hybrid tables and their associated
    constraints and indexes cannot be restored.
  + The DELETED column in [TABLES view](../sql-reference/account-usage/tables.md) displays
    the time of deletion as the UNDROP time of the parent entity.
  + The [ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md) contains an entry for DROP/UNDROP of the parent
    entity, but no entries for hybrid tables.

---
title: Limitations, requirements, and considerations for dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-limitations.md
section: User Guide
---

# Limitations, requirements, and considerations for dbt Projects on Snowflake

Before you use dbt Projects on Snowflake, review the requirements, considerations, and limitations:

* Limitations, requirements, and considerations for stored procedures
* [Limitations, requirements, and considerations for using workspaces with dbt projects](dbt-projects-on-snowflake-using-workspaces.md)

  + [Personal database requirement](dbt-projects-on-snowflake-using-workspaces.md)
  + [Git repositories](dbt-projects-on-snowflake-using-workspaces.md)
* [Limitations, requirements, and considerations for dbt dependencies](dbt-projects-on-snowflake-dependencies.md)
* Limitations, requirements, and considerations for telemetry, logging, and tracing
* [Replication and dbt projects](../account-replication-considerations.md)
* Limitations for the query history DAG

## Limitations, requirements, and considerations for dbt project configurations

The following requirements, considerations, and limitations apply to dbt project configurations that are supported by dbt Projects on Snowflake:

> * Only dbt Core projects are supported. dbt Cloud projects aren’t supported. When you migrate an existing dbt Core project to Snowflake, it
>   must be compatible with [supported Snowflake versions](dbt-projects-on-snowflake-versions.md).
> * Each dbt project folder in your Snowflake workspace must contain a `profiles.yml` file that specifies a target `warehouse`,
>   `database`, `schema`, and `role` in Snowflake for the project. The `type` must be set to `snowflake`. dbt
>   requires an `account` and `user`, but these can be left with an empty or arbitrary string because the dbt project runs in
>   Snowflake under the current account and user context.
> * A dbt project in a workspace can’t have more than 20,000 files in its folder structure. This limit includes all files in the dbt project
>   directory and subdirectories, including the `target/dbt_packages/logs` directories, which is where log files are saved when a dbt
>   project runs from within the workspace.
> * Environment variables (for example, `{{ env_var ('MY_ENV_VAR') }}`) aren’t supported when running a dbt project object. As an
>   alternative, use project variables (for example, `--vars`). For more information, see [Project variables](https://docs.getdbt.com/docs/build/project-variables).
> * Serverless tasks can’t be used to run dbt projects. When you create a task that executes the EXECUTE DBT PROJECT command, you must
>   specify a user-managed warehouse.
> * Running multiple EXECUTE DBT PROJECT commands concurrently against the same dbt project object isn’t supported, even when using model
>   selectors (for example, `EXECUTE DBT PROJECT foo ARGS='--select model1'` and `EXECUTE DBT PROJECT foo ARGS='--select model2'`).
>   Doing so can result in unexpected internal error messages. Run only one EXECUTE DBT PROJECT command against a given dbt project object at
>   a time. If you need to run multiple commands in parallel, create separate dbt project objects for each concurrent command.
>
>   Using a threading configuration within dbt (for example, `threads: 8`) is supported and encouraged. The concurrency limitation
>   applies only to running multiple EXECUTE DBT PROJECT calls at the same time on the same dbt project object.

## Limitations, requirements, and considerations for stored procedures

When you use a stored procedure to call EXECUTE DBT PROJECT, use a caller’s rights stored procedure. For more information, see
[CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) and [Creating a stored procedure](../../developer-guide/stored-procedure/stored-procedures-creating.md).

## Limitations, requirements, and considerations for telemetry, logging, and tracing

The following requirements, considerations, and limitations apply to telemetry, logging, and tracing for dbt on Snowflake:

> * Workspaces for dbt Projects on Snowflake don’t stream stdout dynamically, and stdout is only viewable upon command completion.
> * Viewing logs and tracing requires that you set the LOG_LEVEL and TRACE_LEVEL on the dbt project object. For more information, see [Access control for dbt projects on Snowflake](dbt-projects-on-snowflake-access-control.md) and [Monitor dbt Projects on Snowflake](dbt-projects-on-snowflake-monitoring-observability.md).
> * By default, Snowflake collects telemetry in the default SNOWFLAKE.TELEMETRY.EVENTS table. If you have a custom event table that is set as the event table for your account, telemetry data is collected there. If you use an Enterprise Edition account, you can create an event table to collect telemetry data and associate it with the database where the dbt project object is deployed. For more information, see [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md).

## Limitations for the query history DAG

The query history DAG requires both `manifest.json` and `run_results.json` artifacts to render the visualization. If a dbt
project object execution fails before `run_results.json` is generated, the DAG tab in Query Details shows “No data available”
instead.

Common causes of fast-failing executions that prevent `run_results.json` from being generated include:

> * Insufficient privileges to execute the dbt project object.
> * Invalid project configuration (for example, a missing or malformed `dbt_project.yml` file).
> * Missing dependencies that haven’t been installed with `dbt deps`.

To resolve this, check the Output tab in the run details pane for error messages, fix the underlying issue, and re-run
the dbt project object. For more information about monitoring dbt project object executions, see
[View the query history DAG](dbt-projects-on-snowflake-monitoring-observability.md).

---
title: Limiting concurrently running queries
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse-max-concurrency.md
section: User Guide
---

# Limiting concurrently running queries

This topic discusses how a warehouse owner or administrator can reduce the number of queries that are running concurrently on a warehouse
in order to improve the performance of those queries.

Queries running concurrently in a warehouse must share the warehouse’s resources, meaning each query might be granted fewer
resources. You can use the [MAX_CONCURRENCY_LEVEL](../sql-reference/parameters.md) parameter to limit the number of concurrent queries
running in a warehouse. Because fewer queries are competing for the warehouse’s resources, a query can potentially be given more resources.

Lowering the concurrency level may boost performance for individual queries, especially large, complex, or multi-statement queries, but
these adjustments should be thoroughly tested to ensure they have the desired effect.

Be aware that lowering the MAX_CONCURRENCY_LEVEL for a warehouse can lead to more queries being placed in a queue, which has a performance
implication for those queries. Other strategies such as using a dedicated warehouse or using the
[Query Acceleration Service](performance-query-warehouse-qas.md) can boost the performance of a large or complex query without impacting
the rest of the workload.

For more information, see [MAX_CONCURRENCY_LEVEL](../sql-reference/parameters.md).

> **Note:**
> > Adjusting the [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md) parameter can cancel queries rather than let them remain in the queue for an extended period of time.

## How to lower MAX_CONCURRENCY_LEVEL

The default maximum concurrency level is 8. To lower the level, use the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command to specify a
lower number. For example:

> ```sqlexample
> ALTER WAREHOUSE my_wh SET MAX_CONCURRENCY_LEVEL = 4;
> ```

---
title: Limits on Query Text Size
source: https://docs.snowflake.com/en/user-guide/query-size-limits.md
section: User Guide
---

# Limits on Query Text Size

Snowflake recommends you limit the size of query text (i.e. SQL statements) submitted through Snowflake clients
to 1 MB per statement. Larger queries process normally, but you could not rerun or retry the larger queries, as
Snowflake truncates queries larger than 1MB per statement before persisting them to the metadata store.

This limit includes any literals, such as string literals or binary literals, that are part of the statement,
whether as part of a WHERE clause, SET clause (in an UPDATE statement), etc.

This limit also applies when binding values in client applications that use Snowflake connectors and drivers, such
as the JDBC Driver.

If multiple SQL statements are combined into a single string (separated by semicolons), the length limit applies to
the entire string, not to individual statements within the string.

Similarly, if data is batched, for example by using the JDBC `PreparedStatement.addBatch()` method, the entire batch
must fit within the limit.

> **Note:**
>
> Snowflake compresses data when sending it between client and server. The limit applies to the size
> after compression. However, since the compression ratio for data varies widely, it is safest to keep the
> uncompressed size within the limit.

To load data that exceeds the limit, load from data files as described in [Load data into Snowflake](../guides-overview-loading-data.md).

---
title: Load and query sample data using SQL
source: https://docs.snowflake.com/en/user-guide/tutorials/tasty-bytes-sql-load.md
section: User Guide
---

Snowflake

Getting Started

# Load and query sample data using SQL

## Introduction

This tutorial uses a fictitious food truck brand named Tasty Bytes to show you how to load
and query data in Snowflake using SQL. You can access a pre-loaded
[Snowsight template](../ui-snowsight/snowsight-templates.md) worksheet
to complete these tasks.

The following illustration provides an overview of Tasty Bytes.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage used for any sample data in
> this tutorial. The tutorial provides steps to drop objects and minimize storage
> cost. Snowflake requires a [virtual warehouse](../warehouses.md) to load the
> data and execute queries. A running virtual warehouse consumes Snowflake credits.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

### What you will learn

In this tutorial you will learn how to:

* Use a role to get access to functionality from granted privileges.
* Use a warehouse to access resources.
* Create a database and schema.
* Create a table.
* Load data into the table.
* Query the data in the table.

## Prerequisites

This tutorial assumes the following:

* You have a [supported browser](../ui-snowsight-gs.md).
* You have access to a Snowflake account and can log in as a user.

  If you don’t have an account, you can sign up for a [free trial](https://signup.snowflake.com/)
  and choose any [Snowflake Cloud Region](../intro-regions.md).

## Step 1. Sign in using Snowsight

To access Snowsight over the public Internet, do the following:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](../admin-account-identifier.md) or account URL.
   If you’ve previously signed in to Snowsight, you might see an account name that you can select.
3. Sign in using your Snowflake account credentials.

## Step 2. Open the SQL worksheet for loading and querying sample data

You can use worksheets to write and run SQL commands on your Snowflake database. You can access a
pre-loaded template worksheet for this tutorial. The worksheet has the SQL commands that
you will run to use a database, load data into it, and query the data. For more information
about worksheets, see [Getting started with worksheets](../ui-snowsight-worksheets-gs.md).

To open the pre-loaded template worksheet, follow these steps:

1. In the navigation menu, select Projects » Templates.
2. Find and open Load sample data from Amazon AWS S3 with SQL.

   The beginning of your worksheet looks similar to the following image:

## Step 3. Set the role and warehouse to use

The role you use determines the privileges you have. In this tutorial, use the
SNOWFLAKE_LEARNING_ROLE role so that you can view and manage objects in your account.
For more information, see [Snowsight templates](../ui-snowsight/snowsight-templates.md).

A warehouse provides the required resources to create and manage objects and run
SQL commands. These resources include CPU, memory, and temporary storage. You have
access to the `SNOWFLAKE_LEARNING_WH` virtual warehouse that you can use for this
tutorial. For more information, see [Virtual warehouses](../warehouses.md).

To set the role and warehouse to use, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line.

   ```sqlexample
   USE ROLE SNOWFLAKE_LEARNING_ROLE;
   ```
2. At the top of the worksheet, select Run.

   > **Note:**
   >
   > In this tutorial, run SQL statements one at a time. Don’t select Run All.
3. Place your cursor in the [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) line, then select Run.

   ```sqlexample
   USE WAREHOUSE SNOWFLAKE_LEARNING_WH;
   ```

## Step 4. Use a database, schema, and table

A database stores data in tables that you can manage and query. A schema is a logical
grouping of database objects, such as tables and views. For example, a schema might
contain the database objects required for a specific application. For more information,
see [Databases, Tables and Views - Overview](../../guides-overview-db.md).

In this tutorial, you use the database `SNOWFLAKE_LEARNING_DB`, a
schema that concatenates your username with `_LOAD_SAMPLE_DATA_FROM_S3`, and a table
that you create named `menu`.

To use this database, schema, and table, do the following:

1. In the open worksheet, place your cursor in the [USE DATABASE](../../sql-reference/sql/use-database.md) line,
   then select Run.

   ```sqlexample
   USE DATABASE SNOWFLAKE_LEARNING_DB;
   ```
2. Place your cursor in the SET line, then select Run.

   ```sqlexample
   SET schema_name = CONCAT(current_user(), '_LOAD_SAMPLE_DATA_FROM_S3');
   ```
3. Place your cursor in the USE SCHEMA IDENTIFIER line, then select Run.

   ```sqlexample
   USE SCHEMA IDENTIFIER($schema_name);
   ```
4. Place your cursor in the [CREATE TABLE](../../sql-reference/sql/create-table.md) lines, then select Run.

   ```sqlexample
   CREATE OR REPLACE TABLE MENU
   (
      menu_id NUMBER(19,0),
      menu_type_id NUMBER(38,0),
      menu_type VARCHAR(16777216),
      truck_brand_name VARCHAR(16777216),
      menu_item_id NUMBER(38,0),
      menu_item_name VARCHAR(16777216),
      item_category VARCHAR(16777216),
      item_subcategory VARCHAR(16777216),
      cost_of_goods_usd NUMBER(38,4),
      sale_price_usd NUMBER(38,4),
      menu_item_health_metrics_obj VARIANT
   );
   ```
5. To confirm that the table was created successfully, place your cursor in the
   [SELECT](../../sql-reference/sql/select.md) line, then select Run.

   ```sqlexample
   SELECT * FROM menu;
   ```

   Your output shows the columns of the table you created. At this point in the tutorial, the
   table doesn’t have any rows.

## Step 5. Create a stage and load the data

A stage is a location that holds data files to load into a Snowflake database. This tutorial creates
a stage that loads data from an Amazon S3 bucket. This tutorial uses an existing bucket with
a CSV file that contains the data. You load the data from this CSV file into the table you created
previously. For more information, see [Bulk loading from Amazon S3](../data-load-s3.md).

To create a stage, do the following:

1. In the open worksheet, place your cursor in the [CREATE STAGE](../../sql-reference/sql/create-stage.md) lines,
   then select Run.

   ```sqlexample
   CREATE OR REPLACE STAGE blob_stage
   url = 's3://sfquickstarts/tastybytes/'
   file_format = (type = csv);
   ```
2. To confirm that the stage was created successfully, place your cursor in the
   [LIST](../../sql-reference/sql/list.md) line, then select Run.

   ```sqlexample
   LIST @blob_stage/raw_pos/menu/;
   ```

   Your output looks similar to the following image:
3. To load the data into the table, place your cursor in the [COPY INTO](../../sql-reference/sql/copy-into-table.md)
   lines, then select Run.

   ```sqlexample
   COPY INTO menu
   FROM @blob_stage/raw_pos/menu/;
   ```

## Step 6. Query the data

Now that the data is loaded, you can run queries on the `menu` table.

To run a query in the open worksheet, select the line or lines of the
[SELECT](../../sql-reference/sql/select.md) command, and then select Run.

For example, to return the number of rows in the table, run the following query:

```sqlexample
SELECT COUNT(*) AS row_count FROM menu;
```

Your output looks similar to the following image:

Run this query to return the top ten rows in the table:

```sqlexample
SELECT TOP 10 * FROM menu;
```

Your output looks similar to the following image:

For more information about running a query that returns the specified number of rows,
see [TOP <n>](../../sql-reference/constructs/top_n.md).

You can run other queries in the worksheet to explore the data in the `menu` table.

## Step 7. Clean up, summary, and additional resources

Congratulations! You have successfully completed this tutorial for trial accounts.

Take a few minutes to review a short summary and the key points covered in this tutorial.
Consider cleaning up by dropping any objects you created in this tutorial. Learn more by reviewing
other topics in the Snowflake Documentation.

If the objects you created in this tutorial are no longer needed,
you can remove them from the system with [DROP <object>](../../sql-reference/sql/drop.md) commands.
To remove the table you created, run the following command:

```sqlexample
DROP TABLE IF EXISTS menu;
```

### Summary and key points

In summary, you used a pre-loaded template worksheet in Snowsight to complete the following steps:

1. Set the role and warehouse context.
2. Use a database, schema, and table.
3. Create a stage and load the data from the stage into the database.
4. Query the data.

Here are some key points to remember about loading and querying data:

* You need the required permissions to create and manage objects in your account. In this tutorial,
  you use the SNOWFLAKE_LEARNING_ROLE role, which is provided with the template environment.
* You need a warehouse for the resources required to create and manage objects and run SQL commands.
  This tutorial uses the `SNOWFLAKE_LEARNING_WH` warehouse included with the template environment.
* You used a database to store the data and a schema to group the database objects logically.
* You created a stage to load data from a CSV file.
* After the data was loaded into your database, you queried it using SELECT statements.

### What’s next?

Continue learning about Snowflake using the following resources:

* Complete the other tutorials provided by Snowflake:

  + [Tutorials to get started with Snowflake](../../learn-tutorials.md)
* Familiarize yourself with key Snowflake concepts and features, and the SQL commands used to perform queries
  and insert/update data:

  + [Get started with Snowflake for users](../../getting-started-for-users.md)
  + [Query syntax](../../sql-reference/constructs.md)
  + [Data Manipulation Language (DML) commands](../../sql-reference/sql-dml.md)
* Try the Tasty Bytes Quickstarts provided by Snowflake:

  + [Tasty Bytes Quickstarts](https://www.snowflake.com/en/developers/guides/?searchTerm=tasty+bytes)

---
title: Load data from cloud storage: Amazon S3
source: https://docs.snowflake.com/en/user-guide/tutorials/load-from-cloud-tutorial.md
section: User Guide
---

Snowflake

Getting Started

# Load data from cloud storage: Amazon S3

## Introduction

This tutorial shows you how to load data from an Amazon S3 bucket into Snowflake
using SQL. You can access a pre-loaded [Snowsight template](../ui-snowsight/snowsight-templates.md)
worksheet to complete these tasks.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage used for any sample data in
> this tutorial. The tutorial provides steps to drop objects and minimize storage
> cost. Snowflake requires a [virtual warehouse](../warehouses.md) to load the
> data and execute queries. A running virtual warehouse consumes Snowflake credits.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

### What you will learn

In this tutorial you will learn how to:

* Use a role that has the privileges to create and use the Snowflake objects required by this tutorial.
* Use a warehouse to access resources.
* Select a database and schema to use for the session.
* Create a table.
* Create a storage integration for your cloud platform.
* Create a stage for your storage integration.
* Load data into the table from the stage.
* Query the data in the table.

## Prerequisites

This tutorial assumes the following:

* You have a [supported browser](../ui-snowsight-gs.md).
* You have access to a Snowflake account and can log in as a user who has been granted the
  ACCOUNTADMIN system role. For more information, see
  [system-defined roles](../security-access-control-overview.md).

  If you don’t have an account, you can sign up for a [free trial](https://signup.snowflake.com/)
  and choose any [Snowflake Cloud Region](../intro-regions.md).
* You have an AWS account that you can use to bulk load data from Amazon S3. See [Bulk loading from Amazon S3](../data-load-s3.md).

## Step 1. Sign in using Snowsight

To access Snowsight over the public Internet, do the following:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](../admin-account-identifier.md) or account URL.
   If you’ve previously signed in to Snowsight, you might see an account name that you can select.
3. Sign in using your Snowflake account credentials.

## Step 2. Open the SQL worksheet for loading data from Amazon S3

You can use worksheets to write and run SQL commands on your database. You can access a
pre-loaded template worksheet for this tutorial. The worksheet has the SQL
commands that you will run to create database objects, load data, and query the
data. Because it is a template worksheet, you will be invited to enter your own values
for certain SQL parameters. For more information about worksheets,
see [Getting started with worksheets](../ui-snowsight-worksheets-gs.md).

To open the pre-loaded template worksheet, follow these steps:

1. In the navigation menu, select Projects » Templates.
2. Find and open Load data from Amazon AWS.

   The beginning of your worksheet looks similar to the following image:

## Step 3. Set the role and warehouse to use

The role you use determines the privileges you have. In this tutorial, use the
ACCOUNTADMIN system role so that you can view and manage objects in your account.
For more information, see [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).

A warehouse provides the compute resources that you need to execute DML operations, load data,
and run queries. These resources include CPU, memory, and temporary storage. You can use the
`SNOWFLAKE_LEARNING_WH` warehouse for this tutorial. For more information,
see [Virtual warehouses](../warehouses.md).

To set the role and warehouse to use, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
2. At the top of the worksheet, select Run.

   > **Note:**
   >
   > In this tutorial, run SQL statements one at a time. Don’t select Run all.
3. Place your cursor in the [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) line, then select Run.

   ```sqlexample
   USE WAREHOUSE SNOWFLAKE_LEARNING_WH;
   ```

## Step 4. Set up a table that you can load

A database is a repository for your data. The data is stored in tables that you can
manage and query. A schema is a logical grouping of database objects, such as tables
and views. For example, a schema might contain the database objects required for a
specific application. For more information, see [Databases, Tables and Views - Overview](../../guides-overview-db.md).

In this tutorial, you use the database `SNOWFLAKE_LEARNING_DB`, a
schema that concatenates your username with `_LOAD_SAMPLE_DATA_FROM_S3`, and a table
that you create named `calendar`.

To select this database and schema for use in the session and create the table, do the following:

1. In the open worksheet, place your cursor in the [USE DATABASE](../../sql-reference/sql/use-database.md) line,
   then select Run.

   ```sqlexample
   USE DATABASE SNOWFLAKE_LEARNING_DB;
   ```
2. Place your cursor in each SET line, then select Run.

   ```sqlexample
   SET user_name = current_user();
   SET schema_name = CONCAT($user_name, '_LOAD_DATA_FROM_AMAZON_AWS');
   ```
3. Place your cursor in the USE SCHEMA IDENTIFIER line, then select Run.

   ```sqlexample
   USE SCHEMA IDENTIFIER($schema_name);
   ```
4. Place your cursor in the [CREATE TABLE](../../sql-reference/sql/create-table.md) lines, complete the table
   definition, add an optional comment, and select Run. For example, the following
   table contains six columns:

   ```sqlexample
   CREATE OR REPLACE TABLE calendar
     (
     full_date DATE
     ,day_name VARCHAR(10)
     ,month_name VARCHAR(10)
     ,day_number VARCHAR(2)
     ,full_year VARCHAR(4)
     ,holiday BOOLEAN
     )
     COMMENT = 'Table to be loaded from S3 calendar data file';
   ```
5. To confirm that the table was created successfully, place your cursor in the
   [SELECT](../../sql-reference/sql/select.md) line, then select Run.

   ```sqlexample
   SELECT * FROM calendar;
   ```

   The output shows the columns of the table you created. Currently, the table doesn’t have any rows.

## Step 5. Create a storage integration

Before you can load data from cloud storage, you must configure a storage integration that is
specific to your cloud provider. The following example is specific to Amazon S3 storage.

Storage integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud
provider credentials such as secret keys or access tokens. Integration objects store an AWS identity
and access management (IAM) user ID.

To create a storage integration for Amazon S3, do the following:

1. Use the AWS Management Console to create an IAM policy and an IAM role. These resources provide
   secure access to your S3 bucket for loading data. You will need these resources
   to create a storage integration in Snowflake. After logging into the console, complete
   [Steps 1 and 2](../data-load-s3-config-storage-integration.md) under
   [Option 1: Configure a Snowflake storage integration to access Amazon S3](../data-load-s3-config-storage-integration.md).
2. In the open worksheet, place your cursor in the [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md)
   lines, define the required parameters, and select Run. For example:

   ```sqlexample
   CREATE OR REPLACE STORAGE INTEGRATION s3_data_integration
     TYPE = EXTERNAL_STAGE
     STORAGE_PROVIDER = 'S3'
     STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/tutorial_role'
     ENABLED = TRUE
     STORAGE_ALLOWED_LOCATIONS = ('s3://snow-tutorial-bucket/s3data/');
   ```

   Set STORAGE_AWS_ROLE_ARN to the unique identifier for the IAM role that you created previously.
   You can find this value under IAM > Roles in the AWS Management Console.
3. Place your cursor in the [DESCRIBE INTEGRATION](../../sql-reference/sql/desc-integration.md) line, specify the name of the storage
   integration you created, and select Run.

   ```sqlexample
   DESCRIBE INTEGRATION s3_data_integration;
   ```

   This command retrieves the ARN and external ID for the AWS IAM user that was created
   automatically for your Snowflake account. You will use these values to configure permissions
   for Snowflake in the AWS Management Console.

   The output for this command looks similar to the following:
4. Place your cursor in the [SHOW INTEGRATIONS](../../sql-reference/sql/show-integrations.md) line and select Run. This command returns
   information about the storage integration you created.

   ```sqlexample
   SHOW INTEGRATIONS;
   ```

   The output for this command looks similar to the following:
5. Use the AWS Management Console to configure permissions for the IAM user to access storage buckets.
   Follow [Step 5](../data-load-s3-config-storage-integration.md) under
   [Option 1: Configure a Snowflake storage integration to access Amazon S3](../data-load-s3-config-storage-integration.md).

## Step 6. Create a stage

A stage is a location that holds data files to load into a Snowflake database. This tutorial creates
a stage that can load data from a specific type of cloud storage, such as an S3 bucket.

To create a stage, do the following:

1. In the open worksheet, place your cursor in the [CREATE STAGE](../../sql-reference/sql/create-stage.md) lines, specify a name,
   the storage integration you created, the bucket URL, and the correct file format, then select Run.
   For example:

   ```sqlexample
   CREATE OR REPLACE STAGE cloud_data_db.s3_data.s3data_stage
     STORAGE_INTEGRATION = s3_data_integration
     URL = 's3://snow-tutorial-bucket/s3data/'
     FILE_FORMAT = (TYPE = CSV);
   ```
2. Return information about the stage you created:

   ```sqlexample
   SHOW STAGES;
   ```

   The output for this command looks similar to the following:

## Step 7. Load data from the stage

Load the table from the stage you created by using the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md)
command. For more information about loading from S3 buckets, see
[Copying data from an S3 stage](../data-load-s3-copy.md).

To load the data into the table, place your cursor in the [COPY INTO](../../sql-reference/sql/copy-into-table.md)
lines, specify the table name, the stage you created, and name of the file (or files) you want to load, then
select Run. For example:

> ```sqlexample
> COPY INTO cloud_data_db.s3_data.calendar
>   FROM @cloud_data_db.s3_data.s3data_stage
>     FILES = ('calendar.txt');
> ```

Your output looks similar to the following image:

## Step 8. Query the table

Now that the data is loaded, you can run queries on the `calendar` table.

To run a query in the open worksheet, select the line or lines of the [SELECT](../../sql-reference/sql/select.md)
command, and then select Run. For example, run the following query:

```sqlexample
SELECT * FROM calendar;
```

Your output looks similar to the following image:

## Step 9. Cleanup, summary, and additional resources

Congratulations! You have successfully completed this tutorial.

Take a few minutes to review a short summary and the key points covered in the tutorial.
You might also want to consider cleaning up by dropping any objects you created in the tutorial.
For example, you might want to drop the table you created and loaded:

```sqlexample
DROP TABLE calendar;
```

As long as they are no longer needed, you can also drop the other objects you created, such as
the storage integration and stage. For details, see [Data Definition Language (DDL) commands](../../sql-reference/sql-ddl-summary.md).

### Summary and key points

In summary, you used a pre-loaded template worksheet in Snowsight to complete the following steps:

1. Set the role and warehouse to use.
2. Select a database and schema to use for the session.
3. Create a table.
4. Create a storage integration and configure permissions on cloud storage.
5. Create a stage and load the data from the stage into the table.
6. Query the data.

Here are some key points to remember about loading and querying data:

* You need the required permissions to create and manage objects in your account. In this tutorial,
  you use the ACCOUNTADMIN system role for these privileges.

  This role is not normally used to create objects. Instead, we recommend creating a hierarchy of
  roles aligned with business functions in your organization. For more information, see
  [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).
* You need a warehouse for the resources required to create and manage objects and run SQL commands.
  This tutorial uses the `SNOWFLAKE_LEARNING_WH` warehouse included with the template environment.
* You used a database to store the data and a schema to group the database objects logically.
* You created a storage integration and a stage to load data from a CSV file stored in an Amazon S3 bucket.
* After the data was loaded into your database, you queried it using SELECT statements.

### What’s next?

Continue learning about Snowflake using the following resources:

* Complete the other tutorials provided by Snowflake:

  + [Tutorials to get started with Snowflake](../../learn-tutorials.md)
* Familiarize yourself with key Snowflake concepts and features, as well as the SQL commands used to
  load tables from cloud storage:

  + [Get started with Snowflake for users](../../getting-started-for-users.md)
  + [Load data into Snowflake](../../guides-overview-loading-data.md)
  + [Data loading and unloading commands](../../sql-reference/commands-data-loading.md)
* Try the Tasty Bytes Quickstarts provided by Snowflake:

  + [Tasty Bytes Quickstarts](https://www.snowflake.com/en/developers/guides/?searchTerm=tasty+bytes)

---
title: Load data from cloud storage: Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/tutorials/load-from-cloud-tutorial-gcs.md
section: User Guide
---

Snowflake

Getting Started

# Load data from cloud storage: Google Cloud Storage

## Introduction

This tutorial shows you how to load data from Google Cloud Storage into Snowflake using SQL.
You can access a pre-loaded [Snowsight template](../ui-snowsight/snowsight-templates.md)
worksheet to complete these tasks.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage used for any sample data in
> this tutorial. The tutorial provides steps to drop objects and minimize storage
> cost. Snowflake requires a [virtual warehouse](../warehouses.md) to load the
> data and execute queries. A running virtual warehouse consumes Snowflake credits.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

### What you will learn

In this tutorial you will learn how to:

* Use a role that has the privileges to create and use the Snowflake objects required by this tutorial.
* Use a warehouse to access resources.
* Select a database and schema to use for the session.
* Create a table.
* Create a storage integration for your cloud platform.
* Create a stage for your storage integration.
* Load data into the table from the stage.
* Query the data in the table.

## Prerequisites

This tutorial assumes the following:

* You have a [supported browser](../ui-snowsight-gs.md).
* You have access to a Snowflake account and can log in as a user who has been granted the
  ACCOUNTADMIN system role. For more information, see
  [system-defined roles](../security-access-control-overview.md).

  If you don’t have an account, you can sign up for a [free trial](https://signup.snowflake.com/)
  and choose any [Snowflake Cloud Region](../intro-regions.md).
* You have a Google Cloud account that you can use to bulk load data from Google Cloud Storage. See
  [Bulk loading from Google Cloud Storage](../data-load-gcs.md).

## Step 1. Sign in using Snowsight

To access Snowsight over the public Internet, do the following:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](../admin-account-identifier.md) or account URL.
   If you’ve previously signed in to Snowsight, you might see an account name that you can select.
3. Sign in using your Snowflake account credentials.

## Step 2. Open the SQL worksheet for loading data from Google Cloud Storage

You can use worksheets to write and run SQL commands on your database. You can access a
pre-loaded template worksheet for this tutorial. The worksheet has the SQL
commands that you will run to create database objects, load data, and query the
data. Because it is a template worksheet, you will be invited to enter your own values
for certain SQL parameters. For more information about worksheets,
see [Getting started with worksheets](../ui-snowsight-worksheets-gs.md).

To open the pre-loaded template worksheet, follow these steps:

1. In the navigation menu, select Projects » Templates.
2. Find and open Load data from Google Cloud Storage.

   Your worksheet looks similar to the following image:

## Step 3. Set the role and warehouse to use

The role you use determines the privileges you have. In this tutorial, use the
ACCOUNTADMIN system role so that you can view and manage objects in your account.
For more information, see [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).

A warehouse provides the compute resources that you need to execute DML operations, load data,
and run queries. These resources include CPU, memory, and temporary storage. You can use the
`SNOWFLAKE_LEARNING_WH` warehouse for this tutorial. For more information,
see [Virtual warehouses](../warehouses.md).

To set the role and warehouse to use, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
2. At the top of the worksheet, select Run.

   > **Note:**
   >
   > In this tutorial, run SQL statements one at a time. Don’t select Run all.
3. Place your cursor in the [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) line, then select Run.

   ```sqlexample
   USE WAREHOUSE SNOWFLAKE_LEARNING_WH;
   ```

## Step 4. Create a database, schema, and table

A database is a repository for your data. The data is stored in tables that you can
manage and query. A schema is a logical grouping of database objects, such as tables
and views. For example, a schema might contain the database objects required for a
specific application. For more information, see [Databases, Tables and Views - Overview](../../guides-overview-db.md).

In this tutorial, you use the database `SNOWFLAKE_LEARNING_DB`, a
schema that concatenates your username with `_LOAD_DATA_FROM_GOOGLE_CLOUD_STORAGE`, and a table
that you create named `calendar`.

To select this database and schema for use in the session and create the table, do the following:

1. In the open worksheet, place your cursor in the [USE DATABASE](../../sql-reference/sql/use-database.md) line,
   then select Run.

   ```sqlexample
   USE DATABASE SNOWFLAKE_LEARNING_DB;
   ```
2. Place your cursor in each SET line, then select Run.

   ```sqlexample
   SET user_name = current_user();
   SET schema_name = CONCAT($user_name, '_LOAD_DATA_FROM_GOOGLE_CLOUD_STORAGE');
   ```
3. Place your cursor in the USE SCHEMA IDENTIFIER line, then select Run.

   ```sqlexample
   USE SCHEMA IDENTIFIER($schema_name);
   ```
4. Place your cursor in the [CREATE TABLE](../../sql-reference/sql/create-table.md) lines, complete the table
   definition, add an optional comment, and select Run. For example, the following
   table contains six columns:

   ```sqlexample
   CREATE OR REPLACE TABLE calendar
     (
     full_date DATE
     ,day_name VARCHAR(10)
     ,month_name VARCHAR(10)
     ,day_number VARCHAR(2)
     ,full_year VARCHAR(4)
     ,holiday BOOLEAN
     )
     COMMENT = 'Table to be loaded from GCS calendar data file';
   ```
5. To confirm that the table was created successfully, place your cursor in the
   [SELECT](../../sql-reference/sql/select.md) line, then select Run.

   ```sqlexample
   SELECT * FROM calendar;
   ```

   The output shows the columns of the table you created. Currently, the table doesn’t have any rows.

## Step 5. Create a storage integration

Before you can load data from cloud storage, you must configure a storage integration that is
specific to your cloud provider. The following example is specific to Google Cloud Storage.

Storage integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud
provider credentials such as secret keys or access tokens; instead, integration objects reference
a Google Cloud Storage service account.

To create a storage integration for Google Cloud Storage, do the following:

1. In the open worksheet, place your cursor in the in the [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md)
   lines, define the required parameters, and select Run. For example:

   ```sqlexample
   CREATE OR REPLACE STORAGE INTEGRATION gcs_data_integration
     TYPE = EXTERNAL_STAGE
     STORAGE_PROVIDER = 'GCS'
     ENABLED = TRUE
     STORAGE_ALLOWED_LOCATIONS = ('gcs://tutorial24bucket/gcsdata/');
   ```
2. Place your cursor in the [DESCRIBE INTEGRATION](../../sql-reference/sql/desc-integration.md) line, specify the name of the storage
   integration you created, and select Run. This command returns information about the
   storage integration you created, including the Service Account ID (`STORAGE_GCP_SERVICE_ACCOUNT`) that was
   created automatically for your Snowflake account. You will use this value to configure permissions for
   Snowflake in the Google Cloud Storage Console.

   ```sqlexample
   DESCRIBE INTEGRATION gcs_data_integration;
   ```

   The output for this command looks similar to the following:
3. Place your cursor in the [SHOW INTEGRATIONS](../../sql-reference/sql/show-integrations.md) line and select Run.

   ```sqlexample
   SHOW INTEGRATIONS;
   ```

   The output for this command looks similar to the following:
4. Use the Google Cloud Storage Console to configure permissions to access storage buckets from your
   Cloud Storage Service Account. Follow Step 3 under [Configure an integration for Google Cloud Storage](../data-load-gcs-config.md).

## Step 6. Create a stage

A stage is a location that holds data files to load into a Snowflake database. This tutorial creates
a stage that can load data from a specific type of cloud storage, such as a Google Cloud Storage bucket.

To create a stage, do the following:

1. In the open worksheet, place your cursor in the [CREATE STAGE](../../sql-reference/sql/create-stage.md) lines, specify a name,
   the storage integration you created, the bucket URL, and the correct file format, then select Run.
   For example:

   ```sqlexample
   CREATE OR REPLACE STAGE cloud_data_db.gcs_data.gcsdata_stage
     STORAGE_INTEGRATION = gcs_data_integration
     URL = 'gcs://tutorial24bucket/gcsdata/'
     FILE_FORMAT = (TYPE = CSV);
   ```
2. Return information about the stage you created:

   ```sqlexample
   SHOW STAGES;
   ```

   The output for this command looks similar to the following:

## Step 7. Load data from the stage

Load the table from the stage you created by using the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md)
command. For more information about loading from Google Cloud Storage buckets, see
[Copy data from a Google Cloud Storage stage](../data-load-gcs-copy.md).

To load the data into the table, place your cursor in the [COPY INTO](../../sql-reference/sql/copy-into-table.md)
lines, specify the table name, the stage you created, and name of the file (or files) you want to load, then
select Run. For example:

> ```sqlexample
> COPY INTO cloud_data_db.gcs_data.calendar
>   FROM @cloud_data_db.gcs_data.gcsdata_stage
>     FILES = ('calendar.txt');
> ```

Your output looks similar to the following image:

## Step 8. Query the table

Now that the data is loaded, you can run queries on the `calendar` table.

To run a query in the open worksheet, select the line or lines of the [SELECT](../../sql-reference/sql/select.md)
command, and then select Run. For example, run the following query:

```sqlexample
SELECT * FROM calendar;
```

Your output looks similar to the following image:

## Step 9. Cleanup, summary, and additional resources

Congratulations! You have successfully completed this tutorial for trial accounts.

Take a few minutes to review a short summary and the key points covered in the tutorial.
You might also want to consider cleaning up by dropping any objects you created in the tutorial.
For example, you might want to drop the table you created and loaded:

```sqlexample
DROP TABLE calendar;
```

As long as they are no longer needed, you can also drop the other objects you created, such as
the storage integration and stage. For details, see [Data Definition Language (DDL) commands](../../sql-reference/sql-ddl-summary.md).

### Summary and key points

In summary, you used a pre-loaded template worksheet in Snowsight to complete the following steps:

1. Set the role and warehouse to use.
2. Select a database and schema to use for the session.
3. Create a table.
4. Create a storage integration and configure permissions on cloud storage.
5. Create a stage and load the data from the stage into the table.
6. Query the data.

Here are some key points to remember about loading and querying data:

* You need the required permissions to create and manage objects in your account. In this tutorial,
  you use the ACCOUNTADMIN system role for these privileges.

  This role is not normally used to create objects. Instead, we recommend creating a hierarchy of
  roles aligned with business functions in your organization. For more information, see
  [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).
* You need a warehouse for the resources required to create and manage objects and run SQL commands.
  This tutorial uses the `SNOWFLAKE_LEARNING_WH` warehouse included with the template environment.
* You used a database to store the data and a schema to group the database objects logically.
* You created a storage integration and a stage to load data from a CSV file stored in a Google Cloud Storage bucket.
* After the data was loaded into your database, you queried it using SELECT statements.

### What’s next?

Continue learning about Snowflake using the following resources:

* Complete the other tutorials provided by Snowflake:

  + [Tutorials to get started with Snowflake](../../learn-tutorials.md)
* Familiarize yourself with key Snowflake concepts and features, as well as the SQL commands used to
  load tables from cloud storage:

  + [Get started with Snowflake for users](../../getting-started-for-users.md)
  + [Load data into Snowflake](../../guides-overview-loading-data.md)
  + [Data loading and unloading commands](../../sql-reference/commands-data-loading.md)
* Try the Tasty Bytes Quickstarts provided by Snowflake:

  + [Tasty Bytes Quickstarts](https://www.snowflake.com/en/developers/guides/?searchTerm=tasty+bytes)

---
title: Load data from cloud storage: Microsoft Azure
source: https://docs.snowflake.com/en/user-guide/tutorials/load-from-cloud-tutorial-azure.md
section: User Guide
---

Snowflake

Getting Started

# Load data from cloud storage: Microsoft Azure

## Introduction

This tutorial shows you how to load data from Microsoft Azure cloud storage into
Snowflake using SQL. You can access a pre-loaded [Snowsight template](../ui-snowsight/snowsight-templates.md)
worksheet to complete these tasks.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage used for any sample data in
> this tutorial. The tutorial provides steps to drop objects and minimize storage
> cost. Snowflake requires a [virtual warehouse](../warehouses.md) to load the
> data and execute queries. A running virtual warehouse consumes Snowflake credits.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

### What you will learn

In this tutorial you will learn how to:

* Use a role that has the privileges to create and use the Snowflake objects required by this tutorial.
* Use a warehouse to access resources.
* Select a database and schema to use for the session.
* Create a table.
* Create a storage integration for your cloud platform.
* Create a stage for your storage integration.
* Load data into the table from the stage.
* Query the data in the table.

## Prerequisites

This tutorial assumes the following:

* You have a [supported browser](../ui-snowsight-gs.md).
* You have access to a Snowflake account and can log in as a user who has been granted the
  ACCOUNTADMIN system role. For more information, see
  [system-defined roles](../security-access-control-overview.md).

  If you don’t have an account, you can sign up for a [free trial](https://signup.snowflake.com/)
  and choose any [Snowflake Cloud Region](../intro-regions.md).
* You have a Microsoft Azure account that you can use to bulk load data from Microsoft Azure. See
  [Bulk loading from Microsoft Azure](../data-load-azure.md).

## Step 1. Sign in using Snowsight

To access Snowsight over the public Internet, do the following:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](../admin-account-identifier.md) or account URL.
   If you’ve previously signed in to Snowsight, you might see an account name that you can select.
3. Sign in using your Snowflake account credentials.

## Step 2. Open the SQL worksheet for loading data from Microsoft Azure

You can use worksheets to write and run SQL commands on your database. You can access a
pre-loaded template worksheet for this tutorial. The worksheet has the SQL
commands that you will run to create database objects, load data, and query the
data. Because it is a template worksheet, you will be invited to enter your own values
for certain SQL parameters. For more information about worksheets,
see [Getting started with worksheets](../ui-snowsight-worksheets-gs.md).

The worksheet for this tutorial is not pre-loaded into the trial account. To open
the worksheet for this tutorial, follow these steps:

To open the pre-loaded template worksheet, follow these steps:

1. In the navigation menu, select Projects » Templates.
2. Find and open Load data from Microsoft Azure.

   The beginning of your worksheet looks similar to the following image:

## Step 3. Set the role and warehouse to use

The role you use determines the privileges you have. In this tutorial, use the
ACCOUNTADMIN system role so that you can view and manage objects in your account.
For more information, see [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).

A warehouse provides the compute resources that you need to execute DML operations, load data,
and run queries. These resources include CPU, memory, and temporary storage. You can use the
`SNOWFLAKE_LEARNING_WH` warehouse for this tutorial. For more information,
see [Virtual warehouses](../warehouses.md).

To set the role and warehouse to use, do the following:

1. In the open worksheet, place your cursor in the [USE ROLE](../../sql-reference/sql/use-role.md) line.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ```
2. At the top of the worksheet, select Run.

   > **Note:**
   >
   > In this tutorial, run SQL statements one at a time. Don’t select Run all.
3. Place your cursor in the [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) line, then select Run.

   ```sqlexample
   USE WAREHOUSE SNOWFLAKE_LEARNING_WH;
   ```

## Step 4. Set up a table that you can load

A database is a repository for your data. The data is stored in tables that you can
manage and query. A schema is a logical grouping of database objects, such as tables
and views. For example, a schema might contain the database objects required for a
specific application. For more information, see [Databases, Tables and Views - Overview](../../guides-overview-db.md).

In this tutorial, you use the database `SNOWFLAKE_LEARNING_DB`, a
schema that concatenates your username with `_LOAD_DATA_FROM_MICROSOFT_AZURE`, and a table
that you create named `calendar`.

To select this database and schema for use in the session and create the table, do the following:

1. In the open worksheet, place your cursor in the [USE DATABASE](../../sql-reference/sql/use-database.md) line,
   then select Run.

   ```sqlexample
   USE DATABASE SNOWFLAKE_LEARNING_DB;
   ```
2. Place your cursor in each SET line, then select Run.

   ```sqlexample
   SET user_name = current_user();
   SET schema_name = CONCAT($user_name, '_LOAD_DATA_FROM_MICROSOFT_AZURE');
   ```
3. Place your cursor in the USE SCHEMA IDENTIFIER line, then select Run.

   ```sqlexample
   USE SCHEMA IDENTIFIER($schema_name);
   ```
4. Place your cursor in the [CREATE TABLE](../../sql-reference/sql/create-table.md) lines, complete the table
   definition, add an optional comment, and select Run. For example, the following
   table contains six columns:

   ```sqlexample
   CREATE OR REPLACE TABLE calendar
     (
     full_date DATE
     ,day_name VARCHAR(10)
     ,month_name VARCHAR(10)
     ,day_number VARCHAR(2)
     ,full_year VARCHAR(4)
     ,holiday BOOLEAN
     )
     COMMENT = 'Table to be loaded from Azure calendar data file';
   ```
5. To confirm that the table was created successfully, place your cursor in the
   [SELECT](../../sql-reference/sql/select.md) line, then select Run.

   ```sqlexample
   SELECT * FROM calendar;
   ```

   The output shows the columns of the table you created. Currently, the table doesn’t have any rows.

## Step 5. Create a storage integration

Before you can load data from cloud storage, you must configure a storage integration that is
specific to your cloud provider. The following example is specific to Microsoft Azure storage.

Storage integrations are named, first-class Snowflake objects that avoid the need for passing explicit
cloud provider credentials such as secret keys or access tokens. Integration objects store an Azure
identity and access management (IAM) user ID called the app registration.

To create a storage integration for Azure, do the following:

1. Use the Azure portal to configure an Azure container for loading data. For details, see
   [Configure an Azure container for loading data](../data-load-azure-config.md).
2. In the open worksheet, place your cursor in the in the [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md)
   lines, define the required parameters, and select Run. For example:

   ```sqlexample
   CREATE OR REPLACE STORAGE INTEGRATION azure_data_integration
     TYPE = EXTERNAL_STAGE
     STORAGE_PROVIDER = 'AZURE'
     AZURE_TENANT_ID = '075f576e-6f9b-4955-8e99-4086736225d9'
     ENABLED = TRUE
     STORAGE_ALLOWED_LOCATIONS = ('azure://tutorial99.blob.core.windows.net/snow-tutorial-container/');
   ```

   Set AZURE_TENANT_ID to the Office 365 tenant ID for the storage account that contains the allowed storage
   locations that you want to use. You can find this ID in the Azure portal under
   Microsoft Entra ID > Properties > Tenant ID. (Microsoft Entra ID is the new name for Azure Active
   Directory.)

   Set STORAGE_ALLOWED_LOCATIONS to the path for the Azure container where your source data file is stored.
   Use the format shown in this example, where `tutorial99` is the storage account name and `snow-tutorial-container` is the container name.
3. Place your cursor in the [DESCRIBE INTEGRATION](../../sql-reference/sql/desc-integration.md) line, specify the name of the storage
   integration you created, and select Run.

   ```sqlexample
   DESCRIBE INTEGRATION azure_data_integration;
   ```

   This command retrieves the AZURE_CONSENT_URL and AZURE_MULTI_TENANT_APP_NAME for the client application
   that was created automatically for your Snowflake account. You will use these values to configure
   permissions for Snowflake in the Azure portal.

   The output for this command looks similar to the following:
4. Place your cursor in the [SHOW INTEGRATIONS](../../sql-reference/sql/show-integrations.md) line and select Run. This command returns
   information about the storage integration you created.

   ```sqlexample
   SHOW INTEGRATIONS;
   ```

   The output for this command looks similar to the following:
5. Use the Azure portal to configure permissions for the client application (which was created
   automatically for your trial account) to access storage containers. Follow
   [Step 2: Grant Snowflake Access to the Storage Locations](../data-load-azure-config.md)
   under [Configure an Azure container for loading data](../data-load-azure-config.md).

## Step 6. Create a stage

A stage is a location that holds data files to load into a Snowflake database. This tutorial creates
a stage that can load data from a specific type of cloud storage, such as an Azure container.

To create a stage, do the following:

1. In the open worksheet, place your cursor in the [CREATE STAGE](../../sql-reference/sql/create-stage.md) lines, specify a name,
   the storage integration you created, the bucket URL, and the correct file format, then select Run.
   For example:

   ```sqlexample
   CREATE OR REPLACE STAGE cloud_data_db.azure_data.azuredata_stage
     STORAGE_INTEGRATION = azure_data_integration
     URL = 'azure://tutorial99.blob.core.windows.net/snow-tutorial-container/'
     FILE_FORMAT = (TYPE = CSV);
   ```
2. Return information about the stage you created:

   ```sqlexample
   SHOW STAGES;
   ```

   The output for this command looks similar to the following:

## Step 7. Load data from the stage

Load the table from the stage you created by using the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md)
command. For more information about loading from Azure containers, see
[Copy data from an Azure stage](../data-load-azure-copy.md).

To load the data into the table, place your cursor in the [COPY INTO](../../sql-reference/sql/copy-into-table.md)
lines, specify the table name, the stage you created, and name of the file (or files) you want to load, then select
Run. For example:

> ```sqlexample
> COPY INTO cloud_data_db.azure_data.calendar
>   FROM @cloud_data_db.azure_data.azuredata_stage
>     FILES = ('calendar.txt');
> ```

Your output looks similar to the following image:

## Step 8. Query the table

Now that the data is loaded, you can run queries on the `calendar` table.

To run a query in the open worksheet, select the line or lines of the [SELECT](../../sql-reference/sql/select.md)
command, and then select Run. For example, run the following query:

```sqlexample
SELECT * FROM calendar;
```

Your output looks similar to the following image:

## Step 9. Cleanup, summary, and additional resources

Congratulations! You have successfully completed this tutorial for trial accounts.

Take a few minutes to review a short summary and the key points covered in the tutorial.
You might also want to consider cleaning up by dropping any objects you created in the tutorial.
For example, you might want to drop the table you created and loaded:

```sqlexample
DROP TABLE calendar;
```

As long as they are no longer needed, you can also drop the other objects you created, such as
the storage integration and stage. For details, see [Data Definition Language (DDL) commands](../../sql-reference/sql-ddl-summary.md).

### Summary and key points

In summary, you used a pre-loaded template worksheet in Snowsight to complete the following steps:

1. Set the role and warehouse to use.
2. Select a database and schema to use for the session.
3. Create a table.
4. Create a storage integration and configure permissions on cloud storage.
5. Create a stage and load the data from the stage into the table.
6. Query the data.

Here are some key points to remember about loading and querying data:

* You need the required permissions to create and manage objects in your account. In this tutorial,
  you use the ACCOUNTADMIN system role for these privileges.

  This role is not normally used to create objects. Instead, we recommend creating a hierarchy of
  roles aligned with business functions in your organization. For more information, see
  [Using the ACCOUNTADMIN Role](../security-access-control-considerations.md).
* You need a warehouse for the resources required to create and manage objects and run SQL commands.
  This tutorial uses the `SNOWFLAKE_LEARNING_WH` warehouse included with the template environment.
* You used a database to store the data and a schema to group the database objects logically.
* You created a storage integration and a stage to load data from a CSV file stored in an Azure container.
* After the data was loaded into your database, you queried it using SELECT statements.

### What’s next?

Continue learning about Snowflake using the following resources:

* Complete the other tutorials provided by Snowflake:

  + [Tutorials to get started with Snowflake](../../learn-tutorials.md)
* Familiarize yourself with key Snowflake concepts and features, as well as the SQL commands used to
  load tables from cloud storage:

  + [Get started with Snowflake for users](../../getting-started-for-users.md)
  + [Load data into Snowflake](../../guides-overview-loading-data.md)
  + [Data loading and unloading commands](../../sql-reference/commands-data-loading.md)
* Try the Tasty Bytes Quickstarts provided by Snowflake:

  + [Tasty Bytes Quickstarts](https://www.snowflake.com/en/developers/guides/?searchTerm=tasty+bytes)

---
title: Load data into Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-load.md
section: User Guide
---

# Load data into Apache Iceberg™ tables

Snowflake supports the following options for loading data into a Snowflake-managed Iceberg table:

* [INSERT](../sql-reference/sql/insert.md)
* [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)
* [Snowpipe](data-load-snowpipe-intro.md)
* [Snowpipe Streaming high-performance architecture with Apache Iceberg™ tables](snowpipe-streaming/snowpipe-streaming-high-performance-iceberg.md)
* [Snowpipe Streaming Classic with Apache Iceberg™ tables](snowpipe-streaming/snowpipe-streaming-classic-iceberg.md)
* [Using the Snowflake Connector for Kafka with Apache Iceberg™ tables](kafka-connector-iceberg.md)

## File formats

You can load data into an Iceberg table from files in any of the formats supported for loading into standard Snowflake tables.

For CSV, JSON, Avro, and ORC,
Snowflake converts the data from non-Parquet file formats into Iceberg Parquet files and stores the data in the base location of the Iceberg table. Only the default
`LOAD_MODE = FULL_INGEST` option is supported for these file format loading scenarios that require type conversion.

For Apache Parquet files, Snowflake loads the data directly into table columns and lets you choose from the following `LOAD_MODE` options:

* `FULL_INGEST`: Scans the files and rewrites the Parquet data under the base location of the Iceberg table.
* `ADD_FILES_COPY`: Binary copies the Iceberg-compatible Apache Parquet files that aren’t registered with an Iceberg catalog
  into the base location of the Iceberg table, then registers the files to the Iceberg table.

For more information, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

> **Important:**
>
> Registering Parquet files by using ADD_FILES_COPY isn’t recommended if those files are already part of another Iceberg table.
>
> The best practice for converting externally-managed Iceberg tables to Snowflake-managed Iceberg tables without rewriting files is to use
> the [ALTER ICEBERG TABLE … CONVERT TO MANAGED](../sql-reference/sql/alter-iceberg-table-convert-to-managed.md) command.

## Considerations and limitations when you load data into Iceberg tables

* To load the row lineage metadata columns for Parquet files, which are `_row_id` and `_last_updated_sequence_number`, you
  must use the FULL_INGEST option. The other LOAD_MODE
  options aren’t supported. However, Parquet files that contain row lineage are likely already part of an Iceberg v3 table. For the best
  practice on how to handle Parquet files that are already part of another Iceberg table, see the note above.

## Example: Load Iceberg-compatible Parquet files

This example shows how to create an Iceberg table, and then load data into it from
Iceberg-compatible Parquet data files on an external stage.

> **Important:**
>
> Registering Parquet files by using ADD_FILES_COPY isn’t recommended if those files are already part of another Iceberg table. The best
> practice for converting externally managed Iceberg tables to Snowflake-managed Iceberg tables without rewriting files is to use the
> [ALTER ICEBERG TABLE … CONVERT TO MANAGED](../sql-reference/sql/alter-iceberg-table-convert-to-managed.md) command.

For demonstration purposes, this example uses the following resources:

* An external volume named `iceberg_ingest_vol`. To create
  an external volume, see [Configure an external volume](tables-iceberg-configure-external-volume.md).
* An external stage named `my_parquet_stage` with Iceberg-compatible Parquet files on it. To create an external stage, see
  [CREATE STAGE](../sql-reference/sql/create-stage.md).

1. Create a file format object that describes the staged Parquet files, using the required configuration for copying
   Iceberg-compatible Parquet data (`TYPE = PARQUET USE_VECTORIZED_SCANNER = TRUE`):

   ```sqlexample
   CREATE OR REPLACE FILE FORMAT my_parquet_format
     TYPE = PARQUET
     USE_VECTORIZED_SCANNER = TRUE;
   ```
2. Create a Snowflake-managed Iceberg table, defining columns with data types that are compatible with the source Parquet file data types:

   This example uses case-sensitive column names. You must surround the column names in double quotes when you create the Iceberg table, and
   specify the column names exactly as they appear in your Parquet footer.

   ```sqlexample
   CREATE OR REPLACE ICEBERG TABLE customer_iceberg_ingest (
     "c_custkey" INTEGER,
     "c_name" STRING,
     "c_address" STRING,
     "c_nationkey" INTEGER,
     "c_phone" STRING,
     "c_acctbal" INTEGER,
     "c_mktsegment" STRING,
     "c_comment" STRING
   )
     CATALOG = 'SNOWFLAKE'
     EXTERNAL_VOLUME = 'iceberg_ingest_vol'
     BASE_LOCATION = 'customer_iceberg_ingest/';
   ```

   > **Note:**
   >
   > The example statement specifies Iceberg data types that map to Snowflake data types. For more information,
   > see [Data types for Apache Iceberg™ tables](tables-iceberg-data-types.md).
3. To load the data from the staged Parquet files, which are located directly under the stage URL path, into the Iceberg table, use a COPY INTO statement:

   In COPY INTO *<table>* statements with `LOAD_MODE = ADD_FILES_COPY`, only `MATCH_BY_COLUMN_NAME = CASE_SENSITIVE` is supported.

   ```sqlexample
   COPY INTO customer_iceberg_ingest
     FROM @my_parquet_stage
     FILE_FORMAT = 'my_parquet_format'
     LOAD_MODE = ADD_FILES_COPY
     PURGE = TRUE
     MATCH_BY_COLUMN_NAME = CASE_SENSITIVE;
   ```

   > **Note:**
   >
   > The example specifies `LOAD_MODE = ADD_FILES_COPY`, which tells Snowflake to copy the files into your external volume location,
   > and then register the files to the table.
   >
   > This option avoids file charges, because Snowflake doesn’t scan the source Parquet files and rewrite the data into new Parquet files.

   Output:

   ```output
   +---------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   | file                                                          | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
   |---------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_008.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_006.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_005.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_002.parquet | LOADED |           5 |           5 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_010.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   +---------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   ```
4. Query the table:

   ```sqlexample
   SELECT
       c_custkey,
       c_name,
       c_mktsegment
     FROM customer_iceberg_ingest
     LIMIT 10;
   ```

   Output:

   ```output
   +-----------+--------------------+--------------+
   | C_CUSTKEY | C_NAME             | C_MKTSEGMENT |
   |-----------+--------------------+--------------|
   |     75001 | Customer#000075001 | FURNITURE    |
   |     75002 | Customer#000075002 | FURNITURE    |
   |     75003 | Customer#000075003 | MACHINERY    |
   |     75004 | Customer#000075004 | AUTOMOBILE   |
   |     75005 | Customer#000075005 | FURNITURE    |
   |         1 | Customer#000000001 | BUILDING     |
   |         2 | Customer#000000002 | AUTOMOBILE   |
   |         3 | Customer#000000003 | AUTOMOBILE   |
   |         4 | Customer#000000004 | MACHINERY    |
   |         5 | Customer#000000005 | HOUSEHOLD    |
   +-----------+--------------------+--------------+
   ```

## Example: Load Iceberg-compatible Parquet files into the table created with INFER_SCHEMA function

This example covers how to do the following:

1. Create an Apache Iceberg™ table by using the [INFER_SCHEMA](../sql-reference/functions/infer_schema.md) function.
2. Load data into it from Iceberg-compatible Parquet data files on an external stage.

For demonstration purposes, this example uses the following resources:

* An external volume named `iceberg_ingest_vol`. To create
  an external volume, see [Configure an external volume](tables-iceberg-configure-external-volume.md).
* An external stage named `my_parquet_stage` with Iceberg-compatible Parquet files on it. To create an external stage, see
  [CREATE STAGE](../sql-reference/sql/create-stage.md).

1. Create a file format object that describes the staged Parquet files, using the required configuration for copying
   Iceberg-compatible Parquet data (`TYPE = PARQUET USE_VECTORIZED_SCANNER = TRUE`):

   ```sqlexample
   CREATE OR REPLACE FILE FORMAT my_parquet_format
     TYPE = PARQUET
     USE_VECTORIZED_SCANNER = TRUE;
   ```
2. Retrieve the column definitions for Parquet files in the `my_parquet_stage` stage:

   ```sqlexample
   SELECT *
     FROM TABLE(
       INFER_SCHEMA(
         LOCATION=>'@my_parquet_stage/customer_iceberg/files-to-ingest/'
         , FILE_FORMAT=>'my_parquet_format'
         , KIND => 'ICEBERG'
         )
       );
   ```

   Output:

   ```output
   +-------------+---------+----------+---------------------+------------------------------------------------------+----------+
   | COLUMN_NAME | TYPE    | NULLABLE | EXPRESSION          | FILENAMES                                            | ORDER_ID |
   |-------------+---------+----------+---------------------+------------------------------------------------------|----------+
   | id          | INT     | False    | $1:id::INT          | customer_iceberg/files-to-ingest/customers.parquet   | 0        |
   | custnum     | INT     | False    | $1:custnum::INT     | customer_iceberg/files-to-ingest/customers.parquet   | 1        |
   +-------------+---------+----------+---------------------+------------------------------------------------------+----------+
   ```
3. Create an Iceberg table using the detected schema.

   ```sqlexample
   CREATE ICEBERG TABLE myicebergtable
     USING TEMPLATE (
       SELECT ARRAY_AGG(OBJECT_CONSTRUCT(*))
       WITHIN GROUP (ORDER BY order_id)
         FROM TABLE(
           INFER_SCHEMA(
             LOCATION=>'@my_parquet_stage/customer_iceberg/files-to-ingest/',
             FILE_FORMAT=>'my_parquet_format',
             KIND => 'ICEBERG'
           )
         ))
    ... {rest of the ICEBERG options}
    ;
   ```

   > **Note:**
   >
   > Using `*` for `ARRAY_AGG(OBJECT_CONSTRUCT())` might result in an error if the returned result is larger than 16MB. We
   > recommend avoiding the use of `*` for larger result sets, and only using the required columns, `COLUMN NAME`, `TYPE`, and
   > `NULLABLE`, for the query. Optional column `ORDER_ID` can be included when using `WITHIN GROUP (ORDER BY order_id)`.
4. Use a COPY INTO statement to load the data from the staged Parquet files into the Iceberg table:

   ```sqlexample
   COPY INTO myicebergtable
     FROM @my_parquet_stage/customer_iceberg/files-to-ingest/
     FILE_FORMAT = 'my_parquet_format'
     LOAD_MODE = ADD_FILES_COPY
     MATCH_BY_COLUMN_NAME = CASE_SENSITIVE;
   ```

   > **Note:**
   >
   > The example specifies `LOAD_MODE = ADD_FILES_COPY`, which tells Snowflake to copy the files into your external volume location
   > and then register the files to the table.
   >
   > This option avoids file charges, because Snowflake doesn’t scan the source Parquet files and rewrite the data into new Parquet files.

   Output:

   ```output
   +---------------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   | file                                                                | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
   |---------------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
   | my_parquet_stage/customer_iceberg/files-to-ingest/customers.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   +---------------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   ```
5. After loading the data, query the table:

   ```sqlexample
   SELECT
       id,
       custnum
     FROM myicebergtable
     LIMIT 10;
   ```

   Output:

   ```output
   +-----------+---------+
   | id        | custnum |
   |-----------+---------+
   |         1 |   75001 |
   |         2 |   75002 |
   |         3 |   75003 |
   |         4 |   75004 |
   |         5 |   75005 |
   |         6 |   75006 |
   |         7 |   75007 |
   |         8 |   75008 |
   |         9 |   75009 |
   |        10 |   75010 |
   +-----------+---------+
   ```

---
title: Load data using Snowsight
source: https://docs.snowflake.com/en/user-guide/data-load-web-ui.md
section: User Guide
---

# Load data using Snowsight

You can add data to tables through Snowsight.

From these interfaces, you can upload files that include structured data, including
CSV or TSV formats, or semi-structured data, including JSON, Avro, ORC, Parquet, or XML formats.

You can upload data from the following locations:

* Your local computer.
* An existing stage.

You can upload up to 250 files at a time. Each file can be up to 250 MB.
To load larger files, or a large number of files, use the [Snowflake CLI](../developer-guide/snowflake-cli/index.md) or [SnowSQL](snowsql.md) client.
For more information, see [Bulk loading from a local file system](data-load-local-file-system.md).

## Load data using Snowsight

When loading your data, you can either create a new table, or load data into an existing table.

For data loading sessions in Snowsight, Snowflake runs all SQL commands in an [explicit transaction](../sql-reference/transactions.md). These commands will be committed regardless of values you set for AUTOCOMMIT at ACCOUNT or USER levels.

### Create a new table using Snowsight

When loading data, you can often create and automatically configure a new table for the data at the same time.

> **Note:**
>
> Creating a new table from an XML file when loading data isn’t supported.
>
> Creating a new Apache Iceberg™ table when loading data isn’t supported.
>
> In these situations, create a new empty table, and then use the instructions to load data into an existing table.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role, and select a role that includes the following privileges:

   | Object | Privilege | Notes |
   | --- | --- | --- |
   | Database | USAGE |  |
   | Schema | CREATE TABLE |  |
   | Stage | USAGE |  |
   | Table | OWNERSHIP |  |
3. At the top of the navigation menu, select  (Create) » Table » From File.

   The Load Data into Table dialog appears.
4. Select or create a database and schema where you want the table to be created.
5. Select the files that contain the data using one of these methods:

   * Drag and drop to upload files directly from your local system.
   * Browse to files on your local system.
   * Add from stage.

     If you select Add from stage, the stage explorer appears.

     From the stage explorer, you can navigate into stages and subfolders and select specific folders and files from the stage.

     If you select Add without selecting any specific files on the stage, the root stage, which includes all the files and folders on the stage, will be added.

     The maximum number of files that can be shown in a stage folder is 250.
6. Enter a name for the new table and then select Next. The table schema dialog appears.

   Snowsight detects the metadata schema for the file and returns the file format and column definitions identified by the [INFER_SCHEMA](../sql-reference/functions/infer_schema.md) function.
7. Review the inferred file format, data type, column name, and a sample of column data. Ensure that all information is accurate and make updates if needed.
8. Select Load.

   Snowsight loads the file and creates a new table for the file.

### Load data into an existing table using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Open the [user menu](ui-snowsight-quick-tour.md), and select an account role that includes at least the following privileges:

   | Object | Privilege | Notes |
   | --- | --- | --- |
   | Database | USAGE |  |
   | Schema | USAGE |  |
   | Stage | USAGE | Required for loading a file from a stage. |
   | File format | USAGE | Required for using a [named file format](data-load-prepare.md). |
   | Table | INSERT |  |
3. In the navigation menu, select Ingestion » Add Data.
4. Select Load data into a Table. The Load Data into Table dialog appears.
5. Select the files that contain the data using one of these methods:

   * Drag and drop to upload files directly from your local system.
   * Browse to files on your local system.
   * Add from stage.

     If you select Add from stage, the stage explorer appears.

     From the stage explorer, you can navigate into stages and subfolders and select specific folders and files from the stage.

     If you select Add without selecting any specific files on the stage, the root stage, which includes all the files and folders on the stage, will be added.

     The maximum number of files that can be shown in a stage folder is 250.
6. Select the database, schema, and table where you want to load data.
7. Select Next. The Edit Schema page appears in the Load Data into Table dialog.
8. Make final customizations as needed:

   * Select a [file format](data-load-prepare.md) from the current database.
   * Select a file type to customize, and then select the relevant settings for your data file.

     > **Note:**
     >
     > To load Parquet data into a Snowflake-managed Iceberg table, deselect Load as a single variant column?. Snowflake loads
     > Parquet data directly into Iceberg table columns. Only the default LOAD_MODE = FULL_INGEST is supported when you use Snowsight to load Parquet files.
     > For more information, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).
   * (Optional) Select View options for [format type options](../sql-reference/sql/copy-into-table.md) (examples: specify date and time formats or replace invalid characters).
   * (Optional) Select what should happen if an error occurs during loading. By default, no data is loaded from the file.
   * Select one of the following options for Table loading methods. The default option is Append.

     > + Append: New data will be appended to the existing table during data loading.
     > + Replace: New data will replace the existing data in the table.
   * Select one of the Match by column names options to automatically match the source file and the target table. The default option is case insensitive.
9. Select the Edit Schema tab on the right side of the table schema dialog. If there are any discrepancies between the source file and the target table, make adjustments as needed.

   Select the correct column name from the dropdown list to match the source file with the target table. For example, in the following screenshot, the source file has a column named `building` and the target table has a column named `BUILDING_ID`.
10. Optional: Select the Table Preview tab to preview how the data of the incoming source file will look in the target table.
11. Select Load.

    Snowsight loads your file and displays the number of rows successfully inserted into the table.

### Select a role

Select a role that has the appropriate privileges. (In the lower-left corner, select your name » Switch role » ACCOUNTADMIN.)

* To load data, your role must have the USAGE privilege on the database and the schema that contain the table that you load data into.
* To create a stage when you load data, your role must have the CREATE STAGE privilege on the database schema.
* To create a file format when you load data, your role must have the CREATE FILE FORMAT privilege on the database schema.

### Select the table where you will load the data

1. Select Databases .
2. Select a specific database and schema.
3. Select the Tables tab.
4. Locate the table into which you want to load data.
5. Start loading data into a specific table by doing one of the following:

   * Select a table row, then select Load Data.
   * Select a table name to open the table details page, then select Load Table.

   The Load Data wizard opens.
6. Select a warehouse to use to load data into the table. The drop-down includes any warehouse on which you have the USAGE privilege.
7. Select Next.

### Select the data to load

Depending on where you choose to load data from, follow the relevant steps. If you want to load data from multiple locations,
use the Load Data wizard multiple times.

To load data from your computer:

1. Select the Load files from your computer option, and select Select Files to browse to the files that you want to load.
2. Select one or more local data files and select Open.
3. Select Next.

To load data from an existing stage:

1. Select the Load files from external stage option.
2. Select an existing stage from the Stage dropdown list.
3. (Optional) Specify a path to the files in the stage.
4. Select Next.

To create a stage, for example to load data from external cloud storage:

1. Select the Load files from external stage option.
2. Select the + next to the Stage dropdown list.
3. Select the supported cloud storage service where your files are located.
4. Select Next.
5. Complete the fields to describe your stage. For more information, refer to [CREATE STAGE](../sql-reference/sql/create-stage.md).
6. Select Finish.

   Your new stage is automatically selected from the Stage dropdown list.
7. (Optional) Specify a path to the files in the stage.
8. Select Next.

### Finish loading data

After you select the files to load, finish loading data into your table.

> **Note:**
>
> If your warehouse is not running when you finish loading data, you must wait for the warehouse to resume (up to 5 minutes)
> before data is loaded.

To finish loading data, do the following:

1. Select an existing named file format from the dropdown list, or create one.

   To create a file format:

   1. Select the + next to the dropdown list.
   2. Fill in the fields to match the format of your data files. For descriptions of the options, refer to [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).
   3. Select Finish.

   Your new named file format is automatically selected from the dropdown list.
2. Determine how you want to handle errors that occur when the data is loaded:

   * If you want data loading to stop if an error occurs, select Load.
   * If you want errors to be handled in a different way:

     1. Select Next.
     2. Select the option that describes how you want to handle errors. For details about the options,
        refer to the `ON_ERROR` section of [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).
     3. Select Load.

   Snowflake loads the data into your selected table using the warehouse you selected.
3. Select OK to close the Load Data wizard.

---
title: Loading data
source: https://docs.snowflake.com/en/user-guide/data-load-considerations-load.md
section: User Guide
---

# Loading data

This topic provides best practices, general guidelines, and important considerations for loading staged data.

## Options for selecting staged data files

The COPY command supports several options for loading data files from a stage:

* By path (internal stages) / prefix (Amazon S3 bucket). See [Organizing data by path](data-load-considerations-stage.md) for information.
* Specifying a list of specific files to load.
* Using pattern matching to identify specific files by pattern.

These options enable you to copy a fraction of the staged data into Snowflake with a single command. This allows you to execute concurrent COPY statements that match a subset of files, taking advantage of parallel operations.

### Lists of files

The [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command includes a FILES parameter to load files by specific name.

> **Tip:**
>
> Of the three options for identifying/specifying data files to load from a stage, providing a discrete list of files is
> generally the fastest; however, the FILES parameter supports a maximum of 1,000 files, meaning a COPY command executed with the FILES
> parameter can only load up to 1,000 files.

For example:

> ```sqlexample
> COPY INTO load1 FROM @%load1/data1/ FILES=('test1.csv', 'test2.csv', 'test3.csv')
> ```

File lists can be combined with paths for further control over data loading.

### Pattern matching

The [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command includes a PATTERN parameter to load files using a regular expression.

For example:

> ```sqlexample
> COPY INTO people_data FROM @%people_data/data1/
>    PATTERN='.*person_data[^0-9{1,3}$$].csv';
> ```

Pattern matching using a regular expression is generally the slowest of the three options for identifying/specifying
data files to load from a stage; however, this option works well if you exported your files in
named order from your external application and want to batch load the files in the same order.

Pattern matching can be combined with paths for further control over data loading.

> **Note:**
>
> The regular expression is applied differently to bulk data loads versus Snowpipe data loads.
>
> * Snowpipe trims any path segments in the stage definition from the storage location and applies the regular expression to any remaining
>   path segments and filenames. To view the stage definition, execute the [DESCRIBE STAGE](../sql-reference/sql/desc-stage.md) command for the stage.
>   The URL property consists of the bucket or container name and zero or more path segments. For example, if the FROM location in a COPY
>   INTO *<table>* statement is `@s/path1/path2/` and the URL value for stage `@s` is `s3://mybucket/path1/`, then Snowpipe trims
>   `s3://mybucket/path1/path2/` from the storage location in the FROM clause and applies the regular expression to the remaining filenames in the path.
> * Bulk data load operations apply the regular expression to the entire storage location in the FROM clause.
>
> Snowflake recommends that you enable cloud event filtering for Snowpipe to reduce costs, event noise, and latency. Only use the PATTERN option when your cloud provider’s event filtering feature is not sufficient. For more information about configuring event filtering for each cloud provider, see the following pages:
>
> * [Configuring event notifications using object key name filtering - Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/notification-how-to-filtering.html)
> * [Understand event filtering for Event Grid subscriptions - Azure](https://docs.microsoft.com/en-us/azure/event-grid/event-filtering)
> * [Filtering messages - Google Pub/Sub](https://cloud.google.com/pubsub/docs/filtering)

## Executing parallel COPY statements that reference the same data files

When a COPY statement is executed, Snowflake sets a load status in the table metadata for the data files referenced in the statement. This prevents parallel COPY statements from loading the same files into the table, avoiding data duplication.

When processing of the COPY statement is completed, Snowflake adjusts the load status for the data files as appropriate. If one or more data files fail to load, Snowflake sets the load status for those files as load failed. These files are available for a subsequent COPY statement to load.

If your workload consists of highly concurrent COPY statements loading data into the same table, use Snowpipe, because the service is designed to handle concurrent COPY statements and can better take advantage of parallel operations including table metadata management. You might need to consider migrating existing COPY statement workloads to Snowpipe over time, possibly due to changes in data volume and the frequency of loads executed. In the meantime, you can space out your COPY statements to reduce concurrency, which might lead to better performance.

## Loading older files

This section describes how the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command prevents data duplication differently based on whether the load status for a file is known or unknown. If you partition your data in stages using logical, granular paths by date (as recommended in [Organizing data by path](data-load-considerations-stage.md)) and load data within a short period of time after staging it, this section largely does not apply to you. However, if the COPY command skips older files (i.e. historical data files) in a data load, this section describes how to bypass the default behavior.

### Load metadata

Snowflake maintains detailed metadata for each table into which data is loaded, including:

* Name of each file from which data was loaded
* File size
* ETag for the file
* Number of rows parsed in the file
* Timestamp of the last load for the file
* Information about any errors encountered in the file during loading

This load metadata expires after 64 days. If the LAST_MODIFIED date for a staged data file is less than or equal to 64 days, the COPY command can determine its load status for a given table and prevent reloading (and data duplication). The LAST_MODIFIED date is the timestamp when the file was initially staged or when it was last modified, whichever is later.

If the LAST_MODIFIED date is older than 64 days, the load status is still known if either of the following events occurred less than or equal to 64 days prior to the current date:

> * The file was loaded successfully.
> * The initial set of data for the table (i.e. the first batch after the table was created) was loaded.

However, the COPY command cannot definitively determine whether a file has been loaded already if the LAST_MODIFIED date is older than 64 days and the initial set of data was loaded into the table more than 64 days earlier (and if the file was loaded into the table, that also occurred more than 64 days earlier). In this case, to prevent accidental reload, the command skips the file by default.

### Workarounds

To load files whose metadata has expired, set the LOAD_UNCERTAIN_FILES copy option to true. The copy option references load metadata, if available, to avoid data duplication, but also attempts to load files with expired load metadata.

Alternatively, set the FORCE option to load all files, ignoring load metadata if it exists. Note that this option reloads files, potentially duplicating data in a table.

### Examples

In this example:

* A table is created on **March 1**, and the initial table load occurs on the same day.
* 64 days pass. On **May 4**, the load metadata expires.
* A file is staged and loaded into the table on **July 1** and **2**, respectively. Because the file was staged one day prior to being loaded, the LAST_MODIFIED date was within 64 days. The load status was known. There are no data or formatting issues with the file, and the COPY command loads it successfully.
* 64 days pass. On **September 3**, the LAST_MODIFIED date for the staged file exceeds 64 days. On **September 4**, the load metadata for the successful file load expires.
* An attempt is made to reload the file into the same table on **November 1**. Because the COPY command cannot determine whether the file has been loaded already, the file is skipped. The LOAD_UNCERTAIN_FILES copy option (or the FORCE copy option) is required to load the file.

In this example:

* A file is staged on **March 1**.
* 64 days pass. On **May 4**, the LAST_MODIFIED date for the staged file exceeds 64 days.
* A new table is created on **September 29**, and the staged file is loaded into the table. Because the initial table load occurred less than 64 days prior, the COPY command can determine that the file had not been loaded already. There are no data or formatting issues with the file, and the COPY command loads it successfully.

## JSON data: Removing “null” values

In a VARIANT column, NULL values are stored as a string containing the word “null,” not the SQL NULL value. If the “null” values in your JSON documents indicate missing values and have no other special meaning, we recommend setting the file format option STRIP_NULL_VALUES to TRUE for the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command when loading the JSON files. Retaining the “null” values often wastes storage and slows query processing.

## CSV data: Trimming leading spaces

If your external software exports fields enclosed in quotes but inserts a leading space before the opening quotation character for each field, Snowflake reads the leading space rather than the opening quotation character as the beginning of the field. The quotation characters are interpreted as string data.

Use the TRIM_SPACE [file format](../sql-reference/sql/create-file-format.md) option to remove undesirable spaces during the data load.

For example, each of the following fields in an example CSV file includes a leading space:

```sqlexample
"value1", "value2", "value3"
```

The following COPY command trims the leading space and removes the quotation marks enclosing each field:

```sqlexample
COPY INTO mytable
FROM @%mytable
FILE_FORMAT = (TYPE = CSV TRIM_SPACE=true FIELD_OPTIONALLY_ENCLOSED_BY = '0x22');

SELECT * FROM mytable;

+--------+--------+--------+
| col1   | col2   | col3   |
+--------+--------+--------+
| value1 | value2 | value3 |
+--------+--------+--------+
```

---
title: Loading protobuf data using the Snowflake Connector for Kafka
source: https://docs.snowflake.com/en/user-guide/kafka-connector-protobuf.md
section: User Guide
---

# Loading protobuf data using the Snowflake Connector for Kafka

This topic provides instructions for installing and configuring protocol buffers (protobuf) support in the Snowflake Connector for Kafka (“Kafka connector”). Support for protobuf requires Kafka connector 1.5.0 (or higher).

The Kafka connector supports the following versions of the protobuf converter:

Confluent version:
:   This version is supported by the Confluent package version of Kafka only.

Community version:
:   This version is supported by the open source software (OSS) Apache Kafka package. This version is also supported by the Confluent package version of Kafka; however, for ease of use, we suggest using the Confluent version instead.

Install only one of these protobuf converters.

## Prerequisite: Installing the Snowflake Connector for Kafka

Install the Kafka connector using the instructions in [Installing and configuring the Kafka connector](kafka-connector-install.md).

## Configuring the Confluent version of the protobuf converter

> **Note:**
>
> The Confluent version of the Protobuf converter is available with Confluent version 5.5.0 (or higher).

1. Open your Kafka configuration file (e.g. `<kafka_dir>/config/connect-distributed.properties`) in a text editor.
2. Configure the converter properties in the file. For information about the Kafka connector properties in general, see [Kafka configuration properties](kafka-connector-install.md).

   ```sqljson
   {
    "name":"XYZCompanySensorData",
      "config":{
        ..
        "key.converter":"io.confluent.connect.protobuf.ProtobufConverter",
        "key.converter.schema.registry.url":"CONFLUENT_SCHEMA_REGISTRY",
        "value.converter":"io.confluent.connect.protobuf.ProtobufConverter",
        "value.converter.schema.registry.url":"http://localhost:8081"
      }
    }
   ```

   For example:

   ```sqljson
   {
     "name":"XYZCompanySensorData",
     "config":{
       "connector.class":"com.snowflake.kafka.connector.SnowflakeSinkConnector",
       "tasks.max":"8",
       "topics":"topic1,topic2",
       "snowflake.topic2table.map": "topic1:table1,topic2:table2",
       "buffer.count.records":"10000",
       "buffer.flush.time":"60",
       "buffer.size.bytes":"5000000",
       "snowflake.url.name":"myorganization-myaccount.snowflakecomputing.com:443",
       "snowflake.user.name":"jane.smith",
       "snowflake.private.key":"xyz123",
       "snowflake.private.key.passphrase":"jkladu098jfd089adsq4r",
       "snowflake.database.name":"mydb",
       "snowflake.schema.name":"myschema",
       "key.converter":"io.confluent.connect.protobuf.ProtobufConverter",
       "key.converter.schema.registry.url":"CONFLUENT_SCHEMA_REGISTRY",
       "value.converter":"io.confluent.connect.protobuf.ProtobufConverter",
       "value.converter.schema.registry.url":"http://localhost:8081"
     }
   }
   ```
3. Save the file.

Produce protobuf data from Kafka using the Confluent console protobuf producer, the source protobuf producer, or the Python producer.

Example Python code located in [GitHub](https://github.com/snowflakedb/snowflake-kafka-connector/blob/3bb3e0491d932cdbc58fba3efc0f5c71fa341429/test/test_suit/test_confluent_protobuf_protobuf.py) demonstrates how to produce protobuf data from Kafka.

## Configuring the community version of the protobuf converter

This section provides instructions for installing and configuring the community version of the protobuf converter.

### Step 1: Installing the community protobuf converter

1. In a terminal window, change to the directory where you want to store a clone of the GitHub repository for the protobuf converter.
2. Execute the following command to clone the [GitHub repository](https://github.com/blueapron/kafka-connect-protobuf-converter):

   ```bash
   git clone https://github.com/blueapron/kafka-connect-protobuf-converter
   ```
3. Execute the following commands to build the 3.1.0 version of the converter using [Apache Maven](https://maven.apache.org/). Note that versions 2.3.0, 3.0.0, and 3.1.0 of the converter are supported by the Kafka connector:

   > **Note:**
   >
   > Maven must already be installed on your local machine.

   ```bash
   cd kafka-connect-protobuf-converter

   git checkout tags/v3.1.0

   mvn clean package
   ```

   Maven builds a file named `kafka-connect-protobuf-converter-<version>-jar-with-dependencies.jar` in the current folder. This is the converter JAR file.
4. Copy the compiled `kafka-connect-protobuf-converter-<version>-jar-with-dependencies.jar` file to the directory for your Kafka package version:

   Confluent:
   :   `<confluenct_dir>/share/java/kafka-serde-tools`

   Apache Kafka:
   :   `<apache_kafka_dir>/libs`

### Step 2: Compiling your .proto file

Compile the protobuf `.proto` file that defines your messages into a `java` file.

For example, suppose your messages are defined in a file named `sensor.proto`. In a terminal window, execute the following command to compile the protocol buffers file. Specify the source directory for the application source code, the destination directory (for the `.java` file), and the path to your `.proto` file:

```bash
protoc -I=$SRC_DIR --java_out=$DST_DIR $SRC_DIR/sensor.proto
```

A sample `.proto` file is available here: <https://github.com/snowflakedb/snowflake-kafka-connector/blob/master/test/test_data/sensor.proto>.

The command generates a file named `SensorReadingImpl.java` in the specified destination directory.

For more information, see the [Google developer documentation](https://developers.google.com/protocol-buffers/docs/javatutorial)

### Step 3: Compiling the SensorReadingImpl.Java file

Compile the generated `SensorReadingImpl.java` file from Step 2: Compiling Your .proto File along with the Project Object Model of the protobuf project structure.

1. Open your `.pom` file from Step 2: Compiling Your .proto File in a text editor.
2. Create an otherwise empty directory with a structure:

   ```bash
   protobuf_folder
   ├── pom.xml
   └── src
       └── main
           └── java
               └── com
                   └── ..
   ```

   Where the directory structure under `src` / `main` / `java` mirrors the package name in your `.proto` file (line 3).
3. Copy the generated `SensorReadingImpl.java` file from Step 2: Compiling Your .proto File to the bottom folder in the directory structure.
4. Create a file named `pom.xml` in the root of the `protobuf_folder` directory.
5. Open the empty `pom.xml` file in a text editor. Copy the following example project model into the file and modify it:

   ```bash
   <?xml version="1.0" encoding="UTF-8"?>
   <project xmlns="http://maven.apache.org/POM/4.0.0"
           xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
           xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
      <modelVersion>4.0.0</modelVersion>

      <groupId><group_id></groupId>
      <artifactId><artifact_id></artifactId>
      <version><version></version>

      <properties>
          <java.version><java_version></java.version>
          <project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
      </properties>

      <dependencies>
          <dependency>
              <groupId>com.google.protobuf</groupId>
              <artifactId>protobuf-java</artifactId>
              <version>3.11.1</version>
          </dependency>
      </dependencies>

      <build>
          <plugins>
              <plugin>
                  <groupId>org.apache.maven.plugins</groupId>
                  <artifactId>maven-compiler-plugin</artifactId>
                  <version>3.3</version>
                  <configuration>
                      <source>${java.version}</source>
                      <target>${java.version}</target>
                  </configuration>
              </plugin>
              <plugin>
                  <artifactId>maven-assembly-plugin</artifactId>
                  <version>3.1.0</version>
                  <configuration>
                      <descriptorRefs>
                          <descriptorRef>jar-with-dependencies</descriptorRef>
                      </descriptorRefs>
                  </configuration>
                  <executions>
                      <execution>
                          <id>make-assembly</id>
                          <phase>package</phase>
                          <goals>
                              <goal>single</goal>
                          </goals>
                      </execution>
                  </executions>
              </plugin>
          </plugins>
      </build>
   </project>
   ```

   Where:

   `<group_id>`
   :   Group ID segments of the package name specified in your `.proto` file. For example, if the package name is `com.foo.bar.buz`, then the group ID is `com.foo`.

   `<artifact_id>`
   :   Artifact ID for the package that you choose. The artifact ID can be randomly picked.

   `<version>`
   :   Version of the package that you choose. The version can be randomly picked.

   `<java_version>`
   :   Version of the Java Runtime Environment (JRE) installed on your local machine.

   For example:

   ```bash
   <?xml version="1.0" encoding="UTF-8"?>
   <project xmlns="http://maven.apache.org/POM/4.0.0"
           xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
           xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd">
      <modelVersion>4.0.0</modelVersion>

      <groupId>com.snowflake</groupId>
      <artifactId>kafka-test-protobuf</artifactId>
      <version>1.0.0</version>

      <properties>
          <java.version>1.8</java.version>
          <project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
      </properties>

      <dependencies>
          <dependency>
              <groupId>com.google.protobuf</groupId>
              <artifactId>protobuf-java</artifactId>
              <version>3.11.1</version>
          </dependency>
      </dependencies>

      <build>
          <plugins>
              <plugin>
                  <groupId>org.apache.maven.plugins</groupId>
                  <artifactId>maven-compiler-plugin</artifactId>
                  <version>3.3</version>
                  <configuration>
                      <source>${java.version}</source>
                      <target>${java.version}</target>
                  </configuration>
              </plugin>
              <plugin>
                  <artifactId>maven-assembly-plugin</artifactId>
                  <version>3.1.0</version>
                  <configuration>
                      <descriptorRefs>
                          <descriptorRef>jar-with-dependencies</descriptorRef>
                      </descriptorRefs>
                  </configuration>
                  <executions>
                      <execution>
                          <id>make-assembly</id>
                          <phase>package</phase>
                          <goals>
                              <goal>single</goal>
                          </goals>
                      </execution>
                  </executions>
              </plugin>
          </plugins>
      </build>
   </project>
   ```
6. In a terminal window, change to the root of the `protobuf_folder` directory. Execute the following command to compile the protobuf data JAR file from the files in the directory:

   ```bash
   mvn clean package
   ```

   Maven generates a file named `<artifact_id>-<version>-jar-with-dependencies.jar` in the `protobuf_folder/target` folder (e.g. `kafka-test-protobuf-1.0.0-jar-with-dependencies.jar`).
7. Copy the compiled `kafka-test-protobuf-1.0.0-jar-with-dependencies.jar` file to the directory for your Kafka package version:

   Confluent:
   :   `<confluenct_dir>/share/java/kafka-serde-tools`

   Apache Kafka:
   :   Copy the file to the directory in your `$CLASSPATH` environment variable.

### Step 4: Configuring the Kafka connector

1. Open your Kafka configuration file (e.g. `<kafka_dir>/config/connect-distributed.properties`) in a text editor.
2. Add the `value.converter.protoClassName` property to the file. This property specifies the protocol buffer class to use to deserialize messages (e.g. `com.google.protobuf.Int32Value`).

   > **Note:**
   >
   > Nested classes must be specified using the `$` notation (e.g. `com.blueapron.connect.protobuf.NestedTestProtoOuterClass$NestedTestProto`).

   For example:

   ```bash
   {
    "name":"XYZCompanySensorData",
      "config":{
        ..
        "value.converter.protoClassName":"com.snowflake.kafka.test.protobuf.SensorReadingImpl$SensorReading"
      }
    }
   ```

   For information about the Kafka connector properties in general, see [Kafka configuration properties](kafka-connector-install.md).

   For more information about protocol buffer classes, see the Google developer documentation referenced earlier in this topic.
3. Save the file.

---
title: Machine Learning & Data Science
source: https://docs.snowflake.com/en/user-guide/ecosystem-analytics.md
section: User Guide
---

# Machine Learning & Data Science

Also referred to as advanced analytics, artificial intelligence (AI), and “Big Data”, machine learning and data science cover a broad
category of vendors, tools, and technologies that provide advanced capabilities for statistical and predictive modeling.

These tools and technologies often share some overlapping features and functionality with [BI tools](ecosystem-bi.md);
however, they focus less on analyzing/reporting past data. Instead, they focus on examining large data sets to discover patterns and
uncover useful business information that can be used to predict future trends.

The following machine learning and data science platforms and technologies are known to provide native connectivity to Snowflake:

| Solution |  | Version / Installation Requirements | Notes |
| --- | --- | --- | --- |
|  |  | **Alteryx:** Analytics 11.5 (or higher)  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake In-Database Functionality Now Available](https://community.alteryx.com/t5/Analytics-Blog/Snowflake-In-DB-Functionality-is-Now-Available-Making-11-0-Even/ba-p/77268)     (Alteryx Community Blog)   + [Supported Data Sources — Snowflake](https://help.alteryx.com/current/DataSources/Snowflake.htm) (Alteryx Documentation) |
|  |  | **Amazon SageMaker:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Developer Guide > Prepare and Analyze Datasets > Import data from Snowflake](https://docs.aws.amazon.com/sagemaker/latest/dg/data-wrangler-import.html#data-wrangler-snowflake) (AWS Documentation)   + [Preconfigured Amazon SageMaker Instance with Snowflake Connector](https://www.snowflake.com/blog/preconfigured-amazon-sagemaker-instance-with-snowflake-connector/) (Snowflake Blog) |
|  |  | **BoostKPI:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [BoostKPI Integration with Snowflake](https://boostkpi.com/partners/snowflake) (BoostKPI website) |
|  |  | **Dataiku:** DSS  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Connecting to Data > SQL Databases > Snowflake](https://doc.dataiku.com/dss/latest/connecting/sql/snowflake.html)     (Dataiku Documentation) |
|  |  | **DataRobot:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [The Power of DataRobot + Snowflake](https://blog.datarobot.com/the-power-of-datarobot-plus-snowflake) (DataRobot Blog) |
|  |  | **Domino:** 3.6 (or higher)  **Snowflake:** See the Domino documentation for requirements | * Additional resources:    + [Connecting to Snowflake from Domino](https://docs.dominodatalab.com/en/latest/reference/data/data_sources/Connecting_to_Snowflake_from_Domino.html)     (Domino Documentation) |
|  |  | **Domo:** No requirements  **Snowflake:** No requirements | Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md).   * Additional resources:    + [What’s next for your business with AI and ML](https://www.domo.com/data-science)   + [An executive’s guide to automated machine learning](https://www.domo.com/learn/article/an-executives-guide-to-automated-machine-learning)   + [What is automated machine learning?](https://www.domo.com/glossary/what-is-automated-machine-learning)   + Train and Deploy Models with AutoML: [Customer Support Community](https://domo-support.domo.com/s/article/360048127854?language=en_US) |
|  |  | **Fosfor**: no requirements  **Snowflake**: No requirements | Additional resources   * [Fosfor website](https://fosfor.com) |
|  |  | **H2O.ai:** Driverless AI 1.4.2 (or higher)  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Using Data Connectors with Native Installs - Snowflake](http://docs.h2o.ai/driverless-ai/latest-stable/docs/userguide/connectors-nd/snowflake.html)     (H2O.ai Documentation) |
|  |  | **Hex:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Query Snowflake data right from a powerful data notebook](https://hex.tech/integrations/snowflake) (Hex website)   + [Connect to Data](https://learn.hex.tech/docs/connect-to-data/data-connections/overview) (Hex Documentation) |
|  |  | **KNIME:**   * Analytics Platform 4.4.0 (or higher) * [Extension: KNIME Snowflake Integration](https://hub.knime.com/knime/extensions/org.knime.features.snowflake/latest)   **Snowflake:** None (JDBC Driver embedded in the KNIME extension); other Snowflake drivers also supported | * Additional resources:    + [Overview: KNIME for Snowflake Users](https://www.knime.com/knime-for-snowflake-users) (KNIME website)   + [Collection: KNIME for Snowflake Users](https://hub.knime.com/knime/collections/KNIME%20for%20Snowflake%20Users~1sIkhkwhAvlptfBj)     (KNIME Community Hub)   + [KNIME Snowflake Extension Guide](https://docs.knime.com/latest/snowflake_extension_guide/) (KNIME Documentation) |
|  |  | **Qlik AutoML:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Request a Qlik AutomML Demo](https://www.qlik.com/us/contact-us/demo-request-automl)   + [Qlik AutoML User Guide](https://res.cloudinary.com/talend/image/upload/v1708902939/qlik/docs/resource-library/ebooks/resource-eb-automating-machine-learning-in-analytics-a-practical-guide-en_rh26zv.pdf)   + [How do I create a Snowflake connection](https://support.bigsquid.com/hc/en-us/articles/360000232593-How-do-I-create-a-Snowflake-connection-) (Big Squid Support) |
|  |  | **Qubole:** Enterprise Edition  **Snowflake:** No requirements | * Integration implemented through the [Snowflake Connector for Spark](spark-connector-qubole.md) embedded in Qubole   Data Service (QDS) * Additional resources:    + [Qubole Quickstart Guide](http://docs.qubole.com/en/latest/quick-start-guide/index.html) (Qubole Documentation)   + [Qubole-Snowflake Integration Guide](http://docs.qubole.com/en/latest/partner-integration/snowflake-integration/index.html)     (Qubole Documentation) |
|  |  | **SAS:**   * Cloud Analytic Services 3.4 (or higher) * SAS/ACCESS 9.4 (or higher) for Relational Databases   **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page | * Additional resources:    + [Snowflake: Data Connector Specifics](https://documentation.sas.com/?docsetId=casref&docsetTarget=p183rli8obtde3n10y9bzbrpwnsh.htm&docsetVersion=3.4&locale=en)     (SAS Documentation)   + [SAS/ACCESS Interface to Snowflake](https://documentation.sas.com/?docsetId=acreldb&docsetTarget=p19i7uzcbso1szn1pczxn88co3g1.htm&docsetVersion=9.4&locale=en)     (SAS Documentation) |
|  |  | **Spark:** 3.0 (or higher)  **Scala:** 2.12 or 2.13  **Snowflake:**   * [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the   [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) * [Connector for Spark](spark-connector.md) — download from the   [Snowflake Connector for Spark page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20spark-snowflake&sort=published) | * Additional resources:    + [Configuring Snowflake to Communicate with Spark Running on EMR](https://community.snowflake.com/s/article/configuring-snowflake-to-communicate-with-apache-spark-running-on-amazon-emr-with-apache-zeppelin)     (Snowflake Community) |
|  |  | **Tellius:** None  **Snowflake:** None | * Additional resources:    + [Tellius + Snowflake for Instant Analytics & AI at Scale](https://www.tellius.com/snowflake/)     (Tellius website)   + [4 Things That Make Tellius and Snowflake Great Together](https://www.tellius.com/4-things-that-make-tellius-and-snowflake-great-together/)     (Tellius website)   + [Transform Data in Snowflake Data Cloud with Java UDFs in Tellius](https://www.tellius.com/transform-data-in-snowflake-data-cloud-with-java-udfs-in-tellius/)     (Tellius website) |
|  |  | **Zepl:** No requirements  **Snowflake:**   * [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the   [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) * [Connector for Spark](spark-connector.md) — download from the   [Snowflake Connector for Spark page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20spark-snowflake&sort=published) | * Additional resources:    + [Zepl and Snowflake Bring Data Science as a Service to Cloud Data Warehouses](https://www.zepl.com/blog/zepl-and-snowflake-bring-data-science-as-a-service-to-cloud-data-warehouses/) (Zepl Blog)   + [Getting Started with Zepl and Snowflake in Minutes](https://www.zepl.com/blog/get-started-with-zepl-and-snowflake-in-minutes/)     (Zepl Blog)   + [Zepl and Snowflake — Data Science as a Service for your Cloud Warehouse](https://www.zepl.com/wp-content/uploads/2019/06/Snowflake1s.pdf)     (Zepl Solution Brief) |

---
title: MacOS and Linux troubleshooting steps
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/mac-linux.md
section: User Guide
---

# MacOS and Linux troubleshooting steps

Follow these steps to identify and confirm that you have a proxy and to gather the proxy host and port numbers that you need for further troubleshooting.

1. Open a new `Terminal` window.
2. Execute the following command to retrieve proxy configuration details specific to your network. Replace `example.com` with the actual hostname you want to test.

   ```bash
   networksetup -getsecurewebproxy "$(networksetup -listnetworkserviceorder | grep $(route get example.com | grep interface | awk -F: '{print $2}') | awk -FPort: '{print $2}' | awk -F, '{print $1}' | sed 's/^ //g')"
   ```

   Sample output with a proxy configuration

   > ```output
   > Enabled: Yes
   > Server: 192.168.21.12
   > Port: 3128
   > Authenticated Proxy Enabled: 1
   > ```

   Sample output without a proxy configuration

   > ```output
   > Enabled: No
   > Server:
   > Port: 0
   > Authenticated Proxy Enabled: 0
   > ```
3. Additionally, you can test common environment variables used for proxy settings with the following command:

   ```bash
   env | grep -i proxy
   ```

   The command returns output similar to the following:

   ```output
   http_proxy=http://my.pro.xy:123
   HTTP_PROXY=http://my.pro.xy:123
   HTTPS_PROXY=http://my.pro.xy:123
   https_proxy=http://my.pro.xy:123
   NO_PROXY=localhost,.example.com,.amazonaws.com
   ```

   * **Proxy found**: Based on these environment variables settings, you can gather the proxy host and port that you will need for further testing.
   * **No proxy found**: If the output is empty, you likely have no environment variables set for a proxy configuration, which needs further testing.
   * The `NO_PROXY` defines the hosts that a client can use to connect directly without going through the proxy server.

## If you have a proxy

You can identify the specific URL that is experiencing connectivity issues. While it is beneficial to test all URLs listed in the Snowflake allowlist, you might want to focus on the URL that is directly causing issues in your setup.

```bash
export http_proxy=http://<PROXY_HOST:PROXY_PORT> && export HTTP_PROXY=$http_proxy && export HTTPS_PROXY=$http_proxy && export https_proxy=$http_proxy

curl -v https://URL 2>&1 | tee | grep "Trying\|Connected\|Establish\|CONNECT\|subject\|issuer\|HTTP\|curl"
```

Alternatively, you can pass the proxy settings directly into `curl` (without setting the environment variables first), as shown:

* Unauthenticated proxy

  ```bash
  curl --proxy “<PROTOCOL>://<HOST>:<PORT>” ..rest of the arguments..
  ```
* Authenticated proxy

  ```bash
  curl --proxy “<PROTOCOL>://<HOST>:<PORT>” --proxy-user user:pass ..rest of the arguments..
  ```

In the `Terminal`, run the following commands. Update the command with the URL that is causing issues. Replace `<URL>` with the problematic URL. Additionally, replace `<PROXY_URL>` with your proxy information.

```bash
export http_proxy=http://<PROXY_URL> && export HTTP_PROXY=$http_proxy && export HTTPS_PROXY=$http_proxy && export https_proxy=$http_proxy

curl -v https://<URL> 2>&1 | tee | grep "Trying\|Connected\|Establish\|CONNECT\|subject\|issuer\|HTTP\|curl"
```

These commands configure your environment to use the proxy for HTTP and HTTPS requests and attempt to connect to the specified Snowflake URL. It also outputs detailed information about the connection attempt, including any successful connections or errors encountered.

Successful connection example output:

```output
➜  curl -v https://<account>.snowflakecomputing.com 2>&1 | tee | grep "Trying\|Connected\|Establish\|CONNECT\|subject\|issuer\|HTTP\|curl"
  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0*   Trying <IP ADDRESS>...
* Connected to <IP ADDRESS> (<IP ADDRESS>) port <PORT> (#0)
* Establish HTTP proxy tunnel to <account>.snowflakecomputing.com:443
> CONNECT <account>.snowflakecomputing.com:443 HTTP/1.1
> User-Agent: curl/7.79.1
< HTTP/1.1 200 Connection established
* Proxy replied 200 to CONNECT request
* CONNECT phase completed!
*  subject: CN=*.us-east-1.snowflakecomputing.com
*  subjectAltName: host "<account>.snowflakecomputing.com" matched cert's "*.us-east-1.snowflakecomputing.com"
*  issuer: C=US; O=Amazon; OU=Server CA 1B; CN=Amazon
> GET / HTTP/1.1
> User-Agent: curl/7.79.1
< HTTP/1.1 302 Found
```

Output analysis:

* “Connected to…” indicates a successful connection to the proxy (<IP ADDRESS>) and the establishment of an HTTP tunnel to Snowflake.
* HTTP status codes like `HTTP/1.1 200 Connection established` followed by `HTTP/1.1 302 Found` suggests a successful to the login page.

After completing these steps, continue with [follow-up actions](followup-actions.md).

## If you don’t have a proxy

In the `Terminal`, run the following command, making sure to update the URL in the commands to match the Snowflake URL that you are testing.

```bash
curl -v https://<URL> 2>&1 | tee | grep "Trying\|Connected\|Establish\|CONNECT\|subject\|issuer\|HTTP\|curl"
```

Successful connection example output:

```output
➜  curl -v https://<account>.snowflakecomputing.com 2>&1 | tee | grep "Trying\|Connected\|Establish\|CONNECT\|subject\|issuer\|HTTP\|curl"

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0*   Trying 52.22.29.117:443...
* Connected to <account>.snowflakecomputing.com (52.22.29.117) port 443 (#0)
*  subject: CN=*.us-east-1.snowflakecomputing.com
*  subjectAltName: host "<account>.snowflakecomputing.com" matched cert's "*.us-east-1.snowflakecomputing.com"
*  issuer: C=US; O=Amazon; OU=Server CA 1B; CN=Amazon
  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0> GET / HTTP/1.1
< HTTP/1.1 302 Found
```

This output demonstrates a successful connection, indicating that your system can reach and communicate with the Snowflake server.

Connection failure example:

```output
➜  curl -v https://<account>.snowflakecomputing.com 2>&1 | tee | grep "Trying\|Connected\|Establish\|CONNECT\|subject\|issuer\|HTTP\|curl"
  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0*   Trying 52.22.29.117:443...
*   Trying 3.222.247.13:443...
*   Trying 54.81.51.170:443...
curl: (7) Failed to connect to <account>.us-east-1.snowflakecomputing.com port 443 after 3139 ms: Connection refused
```

After completing these steps, continue with [follow-up actions](followup-actions.md).

---
title: Malicious IP Protection
source: https://docs.snowflake.com/en/user-guide/malicious-ip-protection.md
section: User Guide
---

# Malicious IP Protection

## Overview

The Malicious IP Protection service continuously detects network access attempts that originate from IP addresses that are maintained on a
curated list. The service protects the Snowflake instance by blocking network access attempts that originate from those IP addresses. The
service hardens both Snowflake’s and the customer’s security posture by reducing the risk of unauthorized access, data breaches, and
malicious activity.

Snowflake maintains and curates a list of IP addresses, based on data that is obtained from third-party cybersecurity data sources that provide
external threat intelligence. The IP addresses are from known bad actors. The following table lists and describes how Snowflake categorizes
IP addresses based on impact analysis:

| IP Category | Description |
| --- | --- |
| ANONYMOUS_VPN | IP addresses associated with anonymous VPN services. |
| ANONYMOUS_PROXIES | IP addresses associated with anonymous proxy servers. |
| MALICIOUS_BEHAVIOR | IP addresses associated with known malware and behavior such as automated brute force login attempts. |
| TOR_EXITS | IP addresses used as exit nodes for the Tor network. |

The Malicious IP Protection service blocks network access attempts that originate from IP addresses in all categories on this curated list
by default.

## View network login details

You can use the Account Usage [LOGIN_HISTORY view](../sql-reference/account-usage/login_history.md) to see details of network access attempts that the Malicious
IP Protection service has blocked. For example, to view login events for your account, run the following query:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.LOGIN_HISTORY
  WHERE NOT is_success AND login_details IS NOT NULL
  ORDER BY event_timestamp DESC;
```

Next, examine the `is_success` and `login_details` columns of the LOGIN_HISTORY view output for your account.

`NO` appears in the `is_success` column for blocked network access attempts.

The following examples show output that appears in the `login_details` column for blocked IP addresses:

Example — blocked IP categorized as “LOW” risk:

```json
{
  "malicious_ip_protection_info":"{\"result\":\"BLOCKED\",\"riskClassification\":\"LOW\",\"categories\":[\"MALICIOUS_BEHAVIOR\"]}"
}
```

Example — blocked IP categorized as “HIGH” risk:

```json
{
  "malicious_ip_protection_info":"{\"result\":\"BLOCKED\",\"riskClassification\":\"HIGH\",\"categories\":[\"ANONYMOUS_VPN\",\"ANONYMOUS_PROXIES\"]}"
}
```

The IP address that corresponds to each result appears in the `ip_address` column.

If you notice that IP addresses that were categorized as low-risk were blocked, you might choose to opt out of blocking that category.

## Manage Malicious IP Protection for low-risk categories

You can manage Malicious IP Protection by opting out of blocking IP addresses that are categorized as low-risk. You can’t opt out of blocking
IP addresses that are categorized as high-risk.

To opt out of blocking a category, run the [SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY](../sql-reference/functions/system_opt_out_malicious_ip_protection_by_category.md) function and
provide a low-risk category name as an argument. For example:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('MALICIOUS_BEHAVIOR');
```

To opt out of blocking for a another category, run the [SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY](../sql-reference/functions/system_opt_out_malicious_ip_protection_by_category.md)
function again and provide *both* low-risk category names as arguments. For example:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('ANONYMOUS_VPN,MALICIOUS_BEHAVIOR');
```

To re-enable blocking IP addresses, run the SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY function and provide `''` as an argument.
For example:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('');
```

Optionally, run the function and provide a user name as the second argument to either opt out of, or re-enable, blocking of IP addresses for
only the user that you specify. For example, to disable Malicious IP Protection for IP addresses in the `ANONYMOUS_VPN` category for the
specific user `JSMITH`, run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('ANONYMOUS_VPN', 'JSMITH');
```

The following example shows Account Usage LOGIN_HISTORY view output in the `login_details` column. The IP address for this result was opted
out of blocking the MALICIOUS_BEHAVIOR category by running the SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY function:

Example — unblocked IP categorized as “LOW” risk:

```json
{
  "malicious_ip_protection_info":"{\"result\":\"OPTED_OUT\",\"riskClassification\":\"LOW\",\"categories\":[\"MALICIOUS_BEHAVIOR\"]}"
}
```

---
title: Manage Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-manage.md
section: User Guide
---

# Manage Apache Iceberg™ tables

Manage [Apache Iceberg™ tables](tables-iceberg.md) in Snowflake:

* Query a table
* Use DML commands
* Generate snapshots of DML changes
* Use row-level deletes
* Set a target file size
* Table optimization for Snowflake-managed Iceberg tables
* Maintain tables that use an external catalog
* Refresh the table metadata
* Retrieve storage metrics
* Set data compaction
* Use default values with Iceberg tables
* Use row lineage with Iceberg tables
* Migrate an Iceberg table to Azure Data Lake Storage

You can also convert an Iceberg table that uses an external catalog into a table that uses Snowflake as the Iceberg catalog.
To learn more, see [Convert an Apache Iceberg™ table to use Snowflake as the catalog](tables-iceberg-conversion.md).

## Query a table

To query an Iceberg table, a user must be granted or inherit the following privileges:

* The USAGE privilege on the database and schema that contain the table
* The SELECT privilege on the table

You can query an Iceberg table using a SELECT statement. For example:

```sqlexample
SELECT col1, col2 FROM my_iceberg_table;
```

> **Note:**
>
> Along with Snowflake, you can also use an external query engine to query Iceberg tables. For more information,
> see [Use an external query engine with Apache Iceberg™ tables](tables-iceberg-use-external-query-engine.md).

## Use DML commands

Iceberg tables that use Snowflake as the catalog support full [Data Manipulation Language (DML) commands](../sql-reference/sql-dml.md),
including the following:

* [INSERT](../sql-reference/sql/insert.md)
* [MERGE](../sql-reference/sql/merge.md)
* [UPDATE](../sql-reference/sql/update.md)
* [DELETE](../sql-reference/sql/delete.md)
* [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md)

Snowflake-managed tables also support efficient bulk loading using features such as [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)
and [Snowpipe](data-load-snowpipe-intro.md). For more information,
see [Load data into Apache Iceberg™ tables](tables-iceberg-load.md).

> **Note:**
>
> * Snowflake also supports writing to externally managed Iceberg tables. For more information, see [Write support for externally managed Apache Iceberg™ tables](tables-iceberg-externally-managed-writes.md)
>   and [Writing to externally managed Iceberg tables](tables-iceberg-externally-managed-writes.md).
> * For Snowflake-managed Iceberg tables, if a DML operation fails unexpectedly and rolls back, some Parquet files might get written to your
>   external cloud storage but won’t be tracked or referenced by your Iceberg table metadata. These Parquet files are orphan files.
>
>   If you see a mismatch between storage usage for your
>   external cloud storage and Snowflake, you might have orphan files in your external cloud storage. To see your storage usage for Snowflake,
>   you can use the [TABLE_STORAGE_METRICS view](../sql-reference/info-schema/table_storage_metrics.md) or [TABLE_STORAGE_METRICS view](../sql-reference/account-usage/table_storage_metrics.md).
>   If you see a mismatch, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for assistance with determining whether you have orphan files and removing them.

### Example: Update a table

You can use [INSERT](../sql-reference/sql/insert.md) and [UPDATE](../sql-reference/sql/update.md) statements to modify Snowflake-managed Iceberg tables.

The following example inserts a new value into an Iceberg table named `store_sales`,
then updates the `cola` column to 1 if the value is currently -99.

```sqlexample
INSERT INTO store_sales VALUES (-99);

UPDATE store_sales
  SET cola = 1
  WHERE cola = -99;
```

## Generate snapshots of DML changes

For tables that use Snowflake as the catalog, Snowflake automatically generates the Iceberg metadata. Snowflake writes
the metadata to a folder named `metadata` on your external volume. To find the `metadata` folder,
see [Data and metadata directories](tables-iceberg-storage.md).

Alternatively, you can call the [SYSTEM$GET_ICEBERG_TABLE_INFORMATION](../sql-reference/functions/system_get_iceberg_table_information.md) function to generate Iceberg metadata
for any new changes.

For tables that aren’t managed by Snowflake, the function returns information about the latest refreshed snapshot.

For example:

```sqlexample
SELECT SYSTEM$GET_ICEBERG_TABLE_INFORMATION('db1.schema1.it1');
```

Output:

```output
+-----------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_ICEBERG_TABLE_INFORMATION('DB1.SCHEMA1.IT1')                                                   |
|-----------------------------------------------------------------------------------------------------------|
| {"metadataLocation":"s3://mybucket/metadata/v1.metadata.json","status":"success"}                         |
+-----------------------------------------------------------------------------------------------------------+
```

## Use row-level deletes

Snowflake supports querying tables with row-level deletes and writing to tables by using row-level deletes.

### Query tables

Snowflake supports querying [externally managed Iceberg tables](tables-iceberg.md) when you’ve configured
[row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) for update, delete, and merge operations.

To configure row-level deletes, see
[Write properties](https://iceberg.apache.org/docs/latest/configuration/#write-properties) in the Apache Iceberg documentation.

### Write to tables by using positional delete files

> **Note:**
>
> * Supported for externally managed Iceberg tables only.
> * To use position row-level deletes, ensure that the Iceberg version for Iceberg tables is set to v2, which is the default. For
>   more information, see [ICEBERG_VERSION_DEFAULT](../sql-reference/parameters.md). If
>   the Iceberg version is set to v3, the merge-on-read behavior in Snowflake is to use deletion vectors.

Snowflake supports position row-level deletes for writing to externally managed Iceberg tables stored on Amazon S3, Azure, or
Google Cloud. To turn off position deletes, which enable running the DML operations in copy-on-write mode, set the
`ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE at the table, schema, or database level.

### Write to tables by using deletion vectors

To optimize row-level data
modifications, Snowflake supports deletion vectors for writing to externally managed and Snowflake-managed Iceberg tables stored on Amazon
S3, Azure, or
Google Cloud. With deletion vectors, Snowflake can perform “merge-on-read” (MOR) operations, which improve write performance
for the following DML statements:

* DELETE
* UPDATE
* MERGE

Snowflake achieves this performance by writing small vector files instead of rewriting large data files. For more information, see
[Deletion vectors](https://iceberg.apache.org/spec/#deletion-vectors) in the Apache Iceberg specification.

#### Enable deletion vectors

To enable deletion vectors, complete the following steps:

1. Set the default Iceberg version for Iceberg tables to v3 by following the instructions in [Configure the default Iceberg version](tables-iceberg-v3-specification-support.md).

   > **Note:**
   >
   > If the default Iceberg version for Iceberg tables is v2, Snowflake performs “merge-on-read” (MOR) operations by using
   > positional delete files.
2. Set the `ENABLE_ICEBERG_MERGE_ON_READ` parameter to `TRUE`, which is the default, by following the instructions in [ENABLE_ICEBERG_MERGE_ON_READ](../sql-reference/parameters.md).
3. To run DML operations in copy-on-write mode, set the `ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE.

#### Usage notes for deletion vectors

* **Default behavior**

  + The system default for ENABLE_ICEBERG_MERGE_ON_READ is TRUE.
* **Write method heuristics**

  + When ENABLE_ICEBERG_MERGE_ON_READ is set to TRUE, Snowflake uses heuristics to decide per-file whether to use merge-on-read or
    copy-on-write:

    - **Row count:** Snowflake only writes a deletion vector if fewer than ~5% of rows in a data file are deleted. If ≥5% are deleted, Snowflake
      rewrites the file by using copy-on-write.
    - **File size:** For Snowflake to write deletion vectors, the data file must be larger than approximately 1.6 MB.
* **Compatibility**

  + If you use compute engines that don’t yet support Iceberg v3 deletion vectors, set ENABLE_ICEBERG_MERGE_ON_READ to FALSE to enforce
    copy-on-write for all writes.
* **Parameter precedence**

  + Snowflake only checks the ENABLE_ICEBERG_MERGE_ON_READ parameter to determine the write method. It doesn’t recognize the following Iceberg
    table properties:

    - write.delete.mode
    - write.update.mode
    - write.merge.mode

### Copy-on-write vs. merge-on-read

Iceberg provides two modes for configuring how compute engines handle row-level operations for externally
managed tables. Snowflake supports both of these modes.

The following table describes when you might want to use each mode:

| Mode | Description |
| --- | --- |
| Copy-on-write (default) | This mode prioritizes read time and affects write speed.  When you perform an update, delete, or merge operation, your compute engine rewrites the entire affected Parquet data file. This can result in slow writes, especially if you have large data files, but doesn’t impact read time.  This is the default mode. |
| Merge-on-read | This mode prioritizes write speed and slightly affects read time.  When you perform an update, delete, or merge operation, your compute engine creates a delete file that contains information about only the changed rows.  When you read from a table, your query engine merges delete files with data files. Merging can increase read time. However, you can optimize read performance by scheduling regular compaction and table maintenance. |

For more information about row-level changes for Iceberg, see [Row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) in the
Apache Iceberg documentation.

### Considerations and limitations

Consider the following information when you use row-level deletes with Iceberg tables:

* Snowflake supports [position deletes](https://iceberg.apache.org/spec/#position-delete-files) only for v2 Iceberg tables, and
  [deletion vectors](https://iceberg.apache.org/spec/#deletion-vectors) for v3 Iceberg tables.
* Snowflake only supports position deletes with externally managed Iceberg tables.
* For the best read performance when you use row-level deletes, perform regular compaction and table maintenance to remove old delete files. For
  information, see Maintain tables that use an external catalog.
* Excessive position deletes, especially dangling position deletes, might prevent table creation and refresh operations.
  To avoid this issue, perform table maintenance to remove extra position deletes.

  The table maintenance method to use depends on your external Iceberg engine. For example, you can use the `rewrite_data_files` method
  for Spark with the `delete-file-threshold` or `rewrite-all` options. For more information, see
  [rewrite_data_files](https://iceberg.apache.org/docs/latest/spark-procedures/#rewrite_data_files) in the Apache Iceberg™ documentation.

## Set a target file size

To improve query performance for external Iceberg engines such as
Spark or Trino, you can configure a target file size for both Snowflake-managed and
[externally managed Iceberg tables with write support](tables-iceberg-externally-managed-writes.md). You can either set a
specific size (16MB, 32MB, 64MB, or 128MB), or use the AUTO option. AUTO works differently, depending on the table type:

* Snowflake-managed tables: AUTO specifies that Snowflake should choose the file size for the table based on table characteristics
  such as size, DML patterns, ingestion workload, and clustering configuration. Snowflake automatically
  adjusts the file size, starting at 16 MB, for better read and write performance in Snowflake.
* Externally managed tables: AUTO specifies that Snowflake should aggressively scale to a larger file size.

You can set the target file size when you create an Iceberg table, or run the ALTER ICEBERG TABLE command
to change the target file size for an existing Iceberg table.
Snowflake attempts to maintain file sizes close to the target size when writing Parquet files for a table.

After you set a target file size, Snowflake immediately starts to create larger files for new Data Manipulation Language (DML) operations.
Snowflake’s table maintenance operations asynchronously change the existing table files according to the target file size.

The following example uses TARGET_FILE_SIZE to set a target file size of 128 MB for a Snowflake-managed table:

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (col1 INT)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table'
  TARGET_FILE_SIZE = '128MB';
```

Alternatively, use [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) to set the TARGET_FILE_SIZE property for an existing table:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table
  SET TARGET_FILE_SIZE = '32MB';
```

To check the value of the TARGET_FILE_SIZE property for a table, use the [SHOW PARAMETERS](../sql-reference/sql/show-parameters.md) command:

```sqlexample
SHOW PARAMETERS LIKE 'target_file_size' FOR my_iceberg_table;
```

## Table optimization for Snowflake-managed Iceberg tables

Table optimization automatically performs maintenance to improve the performance and reduce the storage costs of your Snowflake-managed Iceberg tables.

> **Note:**
>
> * Snowflake doesn’t support orphan file deletion for Snowflake-managed Iceberg tables. If you see a mismatch between storage usage for your
>   external cloud storage and Snowflake, you might have orphan files in your external cloud storage. To see your storage usage for Snowflake,
>   you can use the [TABLE_STORAGE_METRICS view](../sql-reference/info-schema/table_storage_metrics.md) or [TABLE_STORAGE_METRICS view](../sql-reference/account-usage/table_storage_metrics.md).
>   If you see a mismatch, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for assistance with determining whether you have orphan files and removing them.
> * To improve query performance, you can also set a target file size. For more information, see Set a target file size.

Snowflake supports the Iceberg table optimization features summarized in the following table:

| Feature | Improves query performance | Reduces storage costs | Notes |
| --- | --- | --- | --- |
| Automatic Clustering [1] | ✔ | ✔ | * Billed. * Disabled by default. |
| Data compaction | ✔ | ✔ | * Billed. * Enabled by default. |
| Manifest compaction | ✔ | ✔ | * No cost. * Enabled automatically; you can’t disable it. |
| Snapshot expiry | ✔ | ✔ | * No cost. * Enabled automatically; you can’t disable it. |

[1] Unlike the other table optimization features, Automatic Clustering is billed separately as a standalone feature.

### Automatic Clustering

Automatic Clustering reorganizes data within files or partitions based on frequently queried columns. The file size for Iceberg tables is
based on your clustering configuration, unless you set a target file size. If you do, the file size is the
specific size you set. For more information, see Set a target file size.

To set Automatic Clustering,
specify the CLUSTER BY parameter when you create a Snowflake-managed Iceberg table or modify an existing table. For more information, see:

* [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-snowflake.md)
* [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md)

For more information about Automatic Clustering, see [Automatic Clustering](tables-auto-reclustering.md).

### Data compaction

Data compaction combines small files into larger, more efficient files to manage storage, maintain an optimal file size, and improve query
performance.

In most cases, data compaction doesn’t have a significant effect on compute costs, but if these costs are a concern, you
can disable compaction. For example, you might want to disable compaction on a table if you rarely query it. To disable or enable data
compaction, see Set data compaction.

> **Note:**
>
> * To query data compaction jobs for Iceberg tables, see [ICEBERG_STORAGE_OPTIMIZATION_HISTORY view](../sql-reference/account-usage/iceberg_storage_optimization_history.md).
>   This view includes the number of credits that are billed for data compaction.
> * If you have [Automatic Clustering](tables-auto-reclustering.md) enabled, clustering performs data compaction on the table. This is
>   true, regardless of whether data compaction is enabled or disabled on the table.
> * You also have the option to set a target file size. For more information, see
>   Set a target file size.

### Manifest compaction

Manifest compaction optimizes the metadata layer by reorganizing and combining smaller manifest files. This compaction reduces metadata
overhead and improves query performance.

This feature is enabled automatically and you can’t disable it.

### Snapshot expiry

Snapshot expiry systematically deletes old snapshots and their unique data and metadata files from the table’s history. This deletion is
based on predefined retention policies.

This feature is enabled automatically and you can’t disable it.

## Maintain tables that use an external catalog

Snowflake doesn’t perform maintenance operations on externally managed Iceberg tables. You must use your own
external Iceberg engine to perform maintenance operations such as:

* Expiring snapshots
* Removing old metadata files
* Compacting data files

> **Important:**
>
> To keep your Iceberg table in sync with external changes, it’s important to align your Snowflake refresh schedule with table maintenance.
> Refresh the table each time you perform a maintenance operation.

To learn about maintenance for Iceberg tables that aren’t managed by Snowflake,
see [Maintenance](https://iceberg.apache.org/docs/latest/maintenance/) in the Apache Iceberg documentation.

## Refresh the table metadata

When you use an external Iceberg catalog, you can refresh the table metadata using the [ALTER ICEBERG TABLE … REFRESH](../sql-reference/sql/alter-iceberg-table-refresh.md) command.
Refreshing the table metadata synchronizes the metadata with the most recent table changes.

> **Note:**
>
> We recommend setting up [automated refresh](tables-iceberg-auto-refresh.md) for supported externally managed tables.

### Refresh the metadata for a table

The following example manually refreshes the metadata for a table that uses an external catalog (for example, AWS Glue or Delta).
Refreshing the table keeps the table in sync with any changes that have occurred in the remote catalog.

With this type of Iceberg table, you don’t specify a metadata file path in the command.

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table REFRESH;
```

To keep a table updated automatically, you can set up [automated refresh](tables-iceberg-auto-refresh.md).
Use the [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) command.

For example:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table SET AUTO_REFRESH = TRUE;
```

### Refresh the metadata for a table created from Iceberg files

The following example manually refreshes a table created from *Iceberg metadata files* in an external cloud storage location,
specifying the relative path to a metadata file without the leading forward slash (`/`).
The metadata file defines the data in the table after refreshing.

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table REFRESH 'metadata/v1.metadata.json';
```

## Retrieve storage metrics

Snowflake doesn’t bill your account for Snowflake-managed Iceberg table storage costs. However, you can track how much storage a
Snowflake-managed Iceberg table occupies by querying the TABLE_STORAGE_METRICS and TABLES views in the
[Snowflake Information Schema](../sql-reference/info-schema.md) or [Account Usage](../sql-reference/account-usage.md) schema.

The following example query joins the ACCOUNT_USAGE.TABLE_STORAGE_METRICS view with the ACCOUNT_USAGE.TABLES view, filtering on
the TABLES.IS_ICEBERG column.

```sqlexample
SELECT metrics.* FROM
  snowflake.account_usage.table_storage_metrics metrics
  INNER JOIN snowflake.account_usage.tables tables
  ON (
    metrics.id = tables.table_id
    AND metrics.table_schema_id = tables.table_schema_id
    AND metrics.table_catalog_id = tables.table_catalog_id
  )
  WHERE tables.is_iceberg='YES';
```

## Set data compaction

You can set data compaction on Snowflake-managed Iceberg tables when you create a database, schema, or table, or run the ALTER command to
change the setting for an existing database, schema, or table. You can also set data compaction at the account level by using the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md)
command. For more information about data compaction, see Data compaction.

The following example uses ENABLE_DATA_COMPACTION to disable data compaction for a Snowflake-managed table:

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (col1 INT)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table'
  ENABLE_DATA_COMPACTION = FALSE;
```

Alternatively, use [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) to disable it for an existing table.

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table
  SET ENABLE_DATA_COMPACTION = FALSE;
```

For more information, see:

* [ENABLE_DATA_COMPACTION](../sql-reference/parameters.md)
* [ALTER ACCOUNT](../sql-reference/sql/alter-account.md)
* [CREATE DATABASE](../sql-reference/sql/create-database.md)
* [ALTER DATABASE](../sql-reference/sql/alter-database.md)
* [CREATE SCHEMA](../sql-reference/sql/create-schema.md)
* [ALTER SCHEMA](../sql-reference/sql/alter-schema.md)
* [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-snowflake.md)
* [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md)

## Use default values with Iceberg tables

> **Note:**
>
> For the other Iceberg v3 features that are supported in this preview, see
> [Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)](tables-iceberg-v3-specification-support.md).

This preview introduces support for the default values feature for Apache Iceberg™ tables in accordance with the Iceberg v3 specification.

> **Important:**
>
> To use default values with Iceberg tables, the tables must conform to v3 of the Apache Iceberg™ table specification.
> For instructions on how to configure the Iceberg version for tables, see [Configure the default Iceberg version](tables-iceberg-v3-specification-support.md).

This feature lets you to set default values for
existing and new records without having to rewrite existing data files. You can set the following default values for table columns:

* An initial default, which provides a default value for *existing* records when a field is added.
* A write default, which provides a default value for *new* records if the field with the default value isn’t specified during writes.

With this feature, you can evolve schemas while presenting values for historical data and provide a fallback value for future writes.
For more information, see [Default values](https://iceberg.apache.org/spec/#default-values).

You can specify a default value when you create or modify a table:

* To create a table with a default value for a column, use the DEFAULT keyword with your column definition. The value you specify
  is set as both the initial default and write default for the column. You can’t change the initial default for the column.
* To add a column with a default value to a table, use the DEFAULT keyword with the column definition in your ALTER ICEBERG TABLE command.
  The value you specify
  is set as both the initial default and write default for the column. You can’t change the initial default for the column.
* To change the write default for a column, use the WRITE DEFAULT keywords with the ALTER ICEBERG TABLE command.

> **Important:**
>
> When you specify a default value for a column, you must specify a static value; you can’t specify an expression or
> function for the value. This requirement is in accordance with the Iceberg v3 specification and applies to both the initial default
> and write default.

The following sections include examples of how to specify default values and change the default write value.

### Example: Create a table with a default value

To create an Iceberg table with default values, use the
[CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command.

In the following example, you first set a default value for a column when you create a Snowflake-managed Iceberg table. Next, you insert a
record into the table without specifying a value for the column with the default value.

1. Create a `user_events` table, which includes an `event_version` column with a default value of `2`:

   ```sqlexample
   CREATE ICEBERG TABLE user_events (
       event_id INT,
       user_id INT,
       event_type STRING,
       event_time TIMESTAMP,
       event_version INT DEFAULT 2
     )
     CATALOG = 'SNOWFLAKE'
     EXTERNAL_VOLUME = 'my_external_volume'
     BASE_LOCATION = 'database/schema/user_event'
     ICEBERG_VERSION = 3;
   ```

   Setting a default value in the table definition sets an initial default and a write default. Because the column has a write default,
   the value `2` will be used for new records if the `event_version` isn’t specified during writes.
2. Add a login event with `event_version` specified:

   ```sqlexample
   INSERT INTO user_events VALUES
     (1, 101, 'login', '2025-11-01 10:00:00', 1);
   ```
3. Add a purchase event, but don’t specify an `event_version`:

   ```sqlexample
   INSERT INTO user_events VALUES
   (1, 101, 'purchase', '2025-11-01 10:01:00');
   ```

   As a result, Snowflake inputs the value for `event_version` into the table as `2`.
4. Query the table:

   ```sqlexample
   SELECT * FROM user_events;
   ```

   Output:

   ```output
   +-----------+----------+-------------+---------------------+----------------+
   | event_id  | user_id  | event_type  | event_time          | event_version  |
   +-----------+----------+-------------+---------------------+----------------+
   | 1         | 101      | login       | 2025-11-01 10:00:00 | 1              |
   | 1         | 101      | purchase    | 2025-11-01 10:01:00 | 2              |
   +-----------+----------+-------------+---------------------+----------------+
   ```

### Example: Add a column with a default value to an existing table

To add a new column with a default value to an Iceberg table, use the [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) command.

In the following example, you modify the `user_events` table by adding an `event_version` column, which has a default value of `2`:

```sqlexample
ALTER ICEBERG TABLE user_events ADD COLUMN event_version INT DEFAULT 2;
```

In addition to setting a write default, adding a column with a default value also sets an initial default for the column. As a
result, the default value for existing records for the `event_version` column is `2`.

### Example: Change the write default for a column

The following example changes the write default for the `event_version` column of the `user_events` table to `3`:

```sqlexample
ALTER ICEBERG TABLE user_events ALTER COLUMN event_version SET WRITE DEFAULT 3;
```

### View the default values defined for a table

To view the default value for a table column in a Snowflake-managed or externally managed Iceberg table,
run the [DESCRIBE ICEBERG TABLE](../sql-reference/sql/desc-iceberg-table.md) command, and then view the `DEFAULT` column and `WRITE DEFAULT` column in the output:

* The `DEFAULT` column maps to the `initial-default` value in the Apache Iceberg specification.
* The `WRITE DEFAULT` column maps to the `write-default` value in the Apache Iceberg specification.

These columns return in the output, regardless of whether the table is a v2 Iceberg table or a v3 Iceberg table.

The following example describes the columns for the `user_events` table. This table has an initial default and write default specified for
the `event_version` column:

```sqlexample
DESC ICEBERG TABLE user_events
  ->> SELECT
    "name",
    "kind",
    "default",
    "write default"
      FROM $1;
```

Output:

```output
+-----------------+---------+---------+---------------+
| name            | kind    | default | write default |
+-----------------+---------+-------------------------+
| EVENT_ID        | COLUMN  |         |               |
| USER_ID         | COLUMN  |         |               |
| EVENT_TYPE      | COLUMN  |         |               |
| EVENT_TIME      | COLUMN  |         |               |
| EVENT_VERSION   | COLUMN  | 2       | 3             |
+-----------------+---------+---------+---------------+
```

### Drop the write default

To drop the write default for a column, use the `DROP WRITE DEFAULT` keywords with the ALTER ICEBERG TABLE command.

The following example drops the default write value for the `event_version` column:

```sqlexample
ALTER ICEBERG TABLE user_events ALTER COLUMN event_version DROP WRITE DEFAULT;
```

### Considerations and limitations for default values

Consider the following items when you use default values with Snowflake-managed and externally managed Iceberg tables:

#### Snowflake-managed and externally managed Iceberg tables

* You can’t later add or change an initial default for a column after you create it. Therefore, you need to drop the column and add the
  column by using ALTER TABLE … DROP COLUMN and ALTER TABLE … ADD COLUMN commands.
* The maximum size for a default value is 128|~|MB.
* Default values can’t use data types that can’t be represented as constants, so you can’t use the following data types with a default value:

  + map
  + list
  + struct
  + variant

#### Snowflake-managed Iceberg tables

* The `write-default` value is always initialized to the `initial-default` value. To see the default for both of these values, run the
  DESCRIBE ICEBERG TABLE command, and then view the `WRITE DEFAULT` and `DEFAULT` columns in the output.
* You can’t specify a default value that uses the TIMESTAMP_NTZ(9) or TIMESTAMP_LTZ(9) data type.
* You can only set a default value to an expression, such as `DEFAULT pi()`, when you *create* a table; you can’t set a default value to an
  expression when you *modify* a table by using the ALTER ICEBERG TABLE command.
* Sequences aren’t supported.

  For example, the following CREATE ICEBERG TABLE command fails because it includes `LOG_ID NUMBER(38,0) NOT NULL autoincrement order`:

  > ```sqlexample
  > CREATE OR REPLACE ICEBERG TABLE CDC_RUN_LOG (
  >     LOG_ID NUMBER(38,0) NOT NULL autoincrement order,
  >     ENTITY_NAME VARCHAR(100),
  >     LAST_RUN TIMESTAMP_NTZ(9),
  >     DAG_NAME VARCHAR(100)
  >     )
  >     CATALOG = 'SNOWFLAKE'
  >     EXTERNAL_VOLUME = 'my_external_volume'
  >     BASE_LOCATION = 'my_iceberg_table';
  >     COMMENT='CDC table to manage log of runs'
  >     ICEBERG_VERSION = 3;
  > ```

#### Externally managed Iceberg tables

* You can’t specify a default value that uses the TIMESTAMP_NTZ(9) or TIMESTAMP_LTZ(9) data type.

These considerations and limitations apply to default values, which are features of Iceberg v3. For a list of considerations and limitations that apply
to all Iceberg v3 tables, see [Considerations and limitations for Iceberg v3 features](tables-iceberg-v3-specification-support.md).

## Use row lineage with Iceberg tables

> **Note:**
>
> For the other Iceberg v3 features that are supported in this preview, see
> [Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)](tables-iceberg-v3-specification-support.md).

This preview introduces support for the row lineage feature for Apache Iceberg™ tables. With this feature,
the following columns are automatically written by Snowflake to an Iceberg table:

* `_row_id`
* `_last_updated_sequence_number`

This feature lets
query engines to reliably match the same row across snapshots and detect row-level changes. For more information,
see [Row lineage](https://iceberg.apache.org/spec/#row-lineage).

This feature is supported with both Snowflake-managed
and externally managed Iceberg tables.

> **Important:**
>
> To use row lineage with Iceberg tables, the tables must conform to v3 of the Apache Iceberg™ table specification.
> For instructions on how to configure the Iceberg version for tables, see [Configure the default Iceberg version](tables-iceberg-v3-specification-support.md).

### Considerations and limitations for row lineage

Row lineage is supported in streams with the following considerations:

* Append-only streams and standard streams are supported on Snowflake-managed Iceberg v3 tables.
* Insert-only streams and standard streams are supported on externally managed Iceberg v3 tables.

  + To have standard streams produce the correct results, the external engine must write to Iceberg v3 tables with respect to the Iceberg v3
    specification. Specifically, newly inserted rows should have `_row_id=NULL`. Rows that are copied during copy-on-write should maintain the `_row_id`.
  + MAX_DATA_EXTENSION_TIME_IN_DAYS doesn’t work on externally managed Iceberg v3 tables.
* When DMLs are committed over multi-statement transactions, append-only streams on Iceberg v3 tables have different semantics compared to Iceberg v2 tables:

  + On Iceberg v2, for append-only streams, if a row is added and then deleted in a multi-statement transaction, this row is considered an
    insertion.
  + On Iceberg v3, for append-only streams, this row isn’t treated as an insertion.

These considerations and limitations apply to row lineage, which is a feature from Iceberg v3. For a list of considerations and limitations
that apply to all Iceberg v3 tables, see [Considerations and limitations for Iceberg v3 features](tables-iceberg-v3-specification-support.md).

## Migrate an Iceberg table to Azure Data Lake Storage

This section shows you how to migrate an existing Iceberg table from Blob Storage to Data Lake Storage in Azure.

> **Note:**
>
> If you haven’t created your table yet, you can simply configure an external volume that uses Data Lake Storage and then create your table
> in Data Lake Storage.

You might want to perform
this migration so that the table is interoperable with remote catalogs that are only configured to use Data Lake Storage
in Azure. For more information, see [Enable interoperability with remote catalogs that use Data Lake Storage](tables-iceberg-configure-external-volume-azure.md).

To migrate an Iceberg table to Azure Data Lake Storage, follow these steps:

1. Configure a new external volume that is connected to Data Lake Storage.

   [Preview feature](../release-notes/preview-features.md) — Open

   Available to all accounts. Configuring an external volume that is connected to Data Lake Storage is in public preview.

   To configure this external volume, for the STORAGE_BASE_URL parameter, specify a URL that points to the `dfs.core.windows.net` endpoint. For more information, see
   [Configure an external volume for Azure](tables-iceberg-configure-external-volume-azure.md).

   ```sqlexample
   CREATE EXTERNAL VOLUME exvoldfs
     STORAGE_LOCATIONS =
       (
         (
           NAME = 'my-azure-northeurope'
           STORAGE_PROVIDER = 'AZURE'
           STORAGE_BASE_URL = 'azure://exampleacct.dfs.core.windows.net/my_container_northeurope/'
           AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
         )
       );
   ```
2. Create an external stage for loading data from your existing table in Blob Storage.

   To create this external stage, for the URL parameter, specify the base location for your table that is stored in Blob Storage, such as
   `azure://myaccount.blob.core.windows.net/container/my_iceberg_table.<randomId>`. For more information, see [CREATE STAGE](../sql-reference/sql/create-stage.md).
3. Recreate your table in Data Lake Storage by creating a new table in Data Lake Storage and loading this table with the data from your table
   in Blob Storage. For examples, see the following sections:

   * [Example: Load Iceberg-compatible Parquet files](tables-iceberg-load.md)
   * [Example: Load Iceberg-compatible Parquet files into the table created with INFER_SCHEMA function](tables-iceberg-load.md)
   > **Important:**
   > * When you create your table in Data Lake Storage, you must specify the name of your external volume that is connected to Data Lake Storage.
   >   To specify this external volume, use the EXTERNAL_VOLUME parameter of your CREATE ICEBERG
   >   TABLE statement.
   > * When you load data from your table in Blob Storage into your Iceberg table in Data lake Storage, you must
   >   specify the name of your external stage that references the data files for your table in Blob Storage. To specify this external
   >   stage, use the FROM … parameter of your COPY INTO statement.

---
title: Manage data listings
source: https://docs.snowflake.com/en/user-guide/data-exchange-managing-data-listings.md
section: User Guide
---

# Manage data listings

## Considerations for creating a listing

> **Note:**
>
> These considerations also apply for creating a listing in a remote region.

* Since the data is shared between different accounts, data consumers should be able to use shared data objects without using double-quoted identifiers (see [Identifier requirements](../sql-reference/identifiers-syntax.md)). As a result, object identifiers for tables, columns, and share names must be upper case and use only alphanumeric characters.
* To ensure that your sensitive data in a shared database is not exposed to users in consumer accounts, see [Use secure objects to control data access](data-sharing-secure-views.md).
* Shares that are currently shared with a consumer account (i.e. via a direct share) can be added to a listing. Consumers must accept the listing terms in a Data Exchange web interface before they can create a database from the share.
* Only the role that created the share can attach the share to a listing.
* A share can only be attached to one listing. If a share has already been attached to a listing, it cannot be attached to another listing, even if the listing has been deleted.
* Before a new or modified free listing can be published, all sample queries are auto-validated to ensure that referenced objects are added to the share and the queries can be run successfully.
* The data must be legally shareable (i.e. the provider must own the data or have the right to share it).

  > > **Note:**
  > >
  > > To the extent any data in your data listing or data set is governed by any laws or contractual obligations, you must ensure that you have the legal and contractual rights to share such data. For example, you can only share protected health information (PHI) through a personalized data share and, to do so, you must: (1) have signed a business associate agreement (BAA) with Snowflake and the Consumer receiving the PHI, and; (2) ensure that the Consumer has also signed a BAA with Snowflake. Also, while you can share personal data through a data share, to do so you must have the applicable legal and contractual rights if the data is not publicly available.

## Considerations for creating a listing in a remote region and replicating data

* When you publish a listing, consumers will see your listing in all selected regions.
* While listings are automatically replicated, the data is not.
* For free listings, you must replicate data to each of the selected regions before publishing the listing.
* For personalized listings, you can replicate data upon consumer’s request.
* Make sure to allocate time to set up replication and understand costs involved.
* To share data in a region, you must have an account in that region in order to replicate data. If you have more than one account, all accounts must belong to the same organization.
* When you publish a listing in a remote region, you can either allow all accounts in your organization to fulfill listing requests or explicitly add individual accounts as providers. Only the listing owner can specify who can fulfill listing requests.
* Cross region data sharing utilizes Snowflake data replication functionality, for more information, see [Share data securely across regions and cloud platforms](secure-data-sharing-across-regions-platforms.md).
* You do not need to replicate the data to each region until a consumer requests it.
* For free listings, you have an option to pre-associate a share with the listing in a remote region. This will allow consumers to get the share instantly without submitting a request.
* To see a list of shares attached to a listing in a remote region, you must log in to the remote account from which you attached the share to the listing.

## Data listing fields

The following table describes parameters required for creating and configuring a data listing in the Data Exchange.

| Section | Field Name | Description | Example |
| --- | --- | --- | --- |
| **Basic Information** | **Listing Type** | See [Types of Listings](../collaboration/collaboration-listings-about.md). | Available Values: Free, Personalized |
|  | **Profile** | The name of the provider profile that owns the share. You must create a provider profile before you can publish a listing. |  |
|  | **Title** | Title of the data listing. The title cannot exceed 110 characters. | Historical Weather by Postcode. |
|  | **Subtitle** | Subtitle of the data listing. The subtitle cannot exceed 110 characters. Title and subtitle should not be redundant. | Historical Weather Data by Location. |
|  | **Data Update Frequency** | How often the data is updated. | Available values: Near real-time, Daily, Weekly, Monthly, Quarterly, Annually, Never (Static Data). |
|  | **Category** | Data listings are categorized for easy discovery. |  |
|  | **Terms of Service** | A link to the listing terms hosted on the provider’s website. Consumers accept the terms before they can access the data. Listing terms are required for free listings, and are optional for personalized listings. | `https://www.example.com/en/legal` |
| **Details** | **Description** | Description of the shared dataset. The description must include: . (a) Scale of data . (b) Description of tables/views . (c) Whether the dataset is a sample . (d) Where to find data dictionaries. | ACME is the number one supplier of customized, pinpoint weather warnings to large enterprises, as well as a vital information source for worldwide weather forecasts, data and meteorological consulting services. This data is historical weather data for US zip codes that can be used to further enhance your existing data to provide deeper analytics. |
|  | **Link to Documentation** | A link to a page on provider’s website with more detailed documentation. Documentation must be clear and reference the right schema objects present in the Snowflake share. It cannot be just standard documentation. | `https://developer.example.com` |
| **Data** | **Database Objects or Secure Share** | Select data you wish to share. This section is only available for free data listings. |  |
| **Business Needs** | **Business Need** | Data listings are grouped by business needs for easy discovery. . - You can select up to six business needs for your listing. If you do not see a relevant business need in the drop-down list, you can create a custom one. . - Consumers can easily discover listings based on business needs available in the drop-down list. However, custom business needs you add are not included, and are only visible in your listing details. |  |
|  | **Description** | Description of how your data or data service addresses the business need. |  |
| **Sample SQL Query** | **Title** | Descriptive title for the query to help consumers understand the data. You can add more than one example. |  |
|  | **Description (Optional)** | Description of the example with additional instructions, e.g. name of the schema, sample tables, fields, use cases. |  |
|  | **SQL Query** | Test sample queries against the database you use to create the share. Snowflake auto-validates the queries to ensure that all referenced objects are added to the share and the queries run successfully. If the validation fails, an error message with a reason is displayed. You can see an exclamation sign next to each query that failed. |  |
| **Region Availability** | **All available regions** or **Specific Regions** | Regions where your listing will be visible. You will need to replicate the data to these regions. You can edit the list of available regions at any time without resubmitting it for administrator’s approval. If you remove a region that was previously available, consumers in that region will no longer be able to see the listing. |  |

## Create and publish a data listing

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select the Share Data drop-down list and select a data exchange.
4. In the New Listing dialog box, enter the listing title as it appears to the consumers and select the listing type. For more information about listing type, see [Types of Listings](../collaboration/collaboration-listings-about.md).
5. Complete each of the sections for the new listing. You can save the draft at any time to finish it later. For a description of each section and related fields, see [Configure listings](../collaboration/provider-listings-reference.md).

   For a free listing, to associate a share with the listing, when editing the Data section:

   > > **Note:**
   > >
   > > Until a listing is published, it can only be associated with a share in the local/primary account. After the listing is published, it can be associated with a share in additional regions that you have selected.

   1. Select Select Data.
   2. If a secure share exists, navigate to the share and select it. If a share does not exist, navigate to the desired database and select the database objects you wish to add to the share.

      > **Note:**
      >
      > If you do not see a share, it is either already attached to another listing, or has been previously shared with consumers.
   3. Select Done.
   4. (Optional) You can change the default name for the secure share.
   5. Select Save.
6. Once you complete all of the sections, select Publish to publish the listing to the selected regions.

   The Publish button is not activated if:

   * Any of the provided sample SQL queries fail validation. For more information, see Data listing fields.
   * You are not the share owner.

## View personalized listing requests

> > **Note:**
> >
> > Email notifications are sent to providers to notify them of data requests. You can change the request notification email for a specific listing in the Settings tab.

1. In the navigation menu, select Data sharing » External sharing.
2. Select the Requests tab. Use the filtering drop down list to view requests by status.

## Approve consumer requests for data listings in a remote region

> **Note:**
>
> * For **personalized** listings, data is not automatically available in remote regions. The provider is responsible for replicating their data to each of these regions.
> * For **free** listings, you have an option to pre-associate a share with the listing in a remote region. This allows consumers to get the share instantly without submitting a request. You can also replicate data and attach a share to a listing after receiving a request from the first consumer in a region. Once the listing is attached to the share, all consumers in that region can access the share instantly.
> * You can specify whether a listing can be fulfilled by a select provider account(s) or by any account in the organization.
> * If the consumer is in a different region, before attaching a share, you must set up replication of data to the account in each remote region. For more information, see [Share data securely across regions and cloud platforms](secure-data-sharing-across-regions-platforms.md).

1. In the navigation menu, select Data sharing » External sharing.
2. Select the Requests tab.
3. Select Review next to the listing name.
4. In the Associate Secure Share section, select an account where you wish to create the share.
5. Select the role that owns the share and the shared database objects (or has the necessary privileges on the database objects to be able to add them to a share).
6. select Select Data.
7. If a secure share exists, navigate to the share and select it. If a share does not exist, navigate to the desired database and select the database objects you wish to add to the share.

   > **Note:**
   >
   > If you do not see a share, it is either already attached to another listing, or has been previously shared with consumers.
8. Select Done.
9. (Optional) Change the default name for the secure share.
10. Select Fulfill Request.

    > > **Tip:**
    > >
    > > If you receive an error when fulfilling a request for a remote region, consider the following:
    > >
    > > * Has the remote account been added to the Marketplace as a provider?
    > > * Is the remote account part of the same organization as the account you published the listing from?
    > > * Did you create a new share using the ACCOUNTADMIN role?
    > > * Have you added other consumers to the share you are trying to attach?

## View fulfilled listing requests

Providers who fulfill free or personalized listing requests using [Snowsight](ui-snowsight-gs.md) can view records of consumers added to the
share. To view the records, in the navigation menu, select Data sharing » Internal sharing, and select the Shared by your account tab.

These records are also available in the [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md).

## Edit a data listing

When you publish a new version of the listing, it overwrites the previously published listing. If you remove a region that was previously available, consumers in that region will no longer have access to the shared dataset.

1. Sign in to [Snowsight](ui-snowsight-gs.md) as an ACCOUNTADMIN.
2. In the navigation menu, select Data sharing » External sharing » Shared by your account.
3. Click the name of the listing you wish to update.
4. Next to the listing title, click New Draft.
5. Click Edit for the section you wish to update.
6. Click Publish.

## Unpublish a data listing

When you unpublish a data listing, existing consumers can still access the data share unless you remove them from the share. New consumers cannot see it.

1. Sign in to [Snowsight](ui-snowsight-gs.md) as an ACCOUNTADMIN.
2. In the navigation menu, select Data sharing » External sharing » Shared by your account.
3. Select the name of the listing you wish to unpublish.
4. In the top-right corner, from the Live drop-down list, select Unpublish.

## Republish a data listing

1. Sign in to [Snowsight](ui-snowsight-gs.md) as an ACCOUNTADMIN.
2. In the navigation menu, select Data sharing » External sharing » Shared by your account.
3. Select the name of the listing you wish to republish.
4. In the top-right corner, from the drop-down list select Re-publish.
5. Select Re-publish to republish the listing.

## Remove a data listing

See [Removing listings as a provider](https://other-docs.snowflake.com/en/collaboration/provider-listings-removing).

## Update a data share

You can update a data share using Snowsight.
Keep in mind that each time you modify a data listing, you must notify the consumers to ensure that you do not break their processes.
Examples of breaking changes include:

* Adding/removing a column.
* Renaming objects.
* Removing objects.

---
title: Manage directory tables
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-manage.md
section: User Guide
---

# Manage directory tables

This topic provides instructions for creating and managing external or internal stages with directory tables.

## Create a stage with a directory table

This section provides instructions for creating stages (using [CREATE STAGE](../sql-reference/sql/create-stage.md)) that layer a directory table to store
metadata about the staged files.

Directory tables on internal stages require manual metadata refreshes.
You could also choose to include a directory table on external stages and refresh the metadata manually. For information about automated metadata refreshes, see
automated metadata refreshes.

The syntax for creating a stage with a directory table is nearly identical to creating a standard external or internal stage. Set the
optional DIRECTORY parameter to TRUE.

For the complete syntax and parameter descriptions, see [CREATE STAGE](../sql-reference/sql/create-stage.md). To add a directory table to an existing stage, use the [ALTER STAGE … SET DIRECTORY](../sql-reference/sql/alter-stage.md) command.

> **Note:**
>
> After you create a stage with a directory table, you must execute ALTER STAGE … REFRESH to manually refresh the directory table
> metadata.

### Examples

Create an internal stage named `mystage` that includes a directory table. The stage references a file format named `myformat`:

> ```sqlexample
> CREATE STAGE mystage
>   DIRECTORY = (ENABLE = TRUE)
>   FILE_FORMAT = myformat;
> ```

Create an external stage named `mystage` that includes a directory table. The stage references a bucket or container named `load`
with a path of `files`. Secure access to the cloud storage location is provided via the `my_storage_int` storage integration:

> **Note:**
>
> The storage location in the URL value must end in a forward slash (`/`).

**Amazon S3**

```sqlexample
CREATE STAGE mystage
  URL='s3://load/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (ENABLE = TRUE);
```

**Google Cloud Storage**

```sqlexample
CREATE STAGE mystage
  URL='gcs://load/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (ENABLE = TRUE);
```

**Microsoft Azure**

```sqlexample
CREATE STAGE mystage
  URL='azure://myaccount.blob.core.windows.net/load/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (ENABLE = TRUE);
```

## Refresh directory table metadata

### Automated refresh

You can automatically refresh the metadata for a directory table by using the event messaging service for your cloud storage service.
To configure automated refreshes, see [Automated directory table metadata refreshes](data-load-dirtables-auto.md).

### Manual refresh

> **Note:**
>
> * Manual refreshes on an external stage block simultaneous automated refreshes.
>   Automated refreshes resume after the manual refresh completes.
> * Manual refreshes perform a list operation on a stage, and can be slow or expensive for large or fast-growing stages.
>   Instead, use event-based [automated refreshes](data-load-dirtables-auto.md).

To manually refresh the metadata in a directory table, use the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command.

For best performance, use a selective `SUBPATH` with [ALTER STAGE](../sql-reference/sql/alter-stage.md).
Doing so reduces the number of files that need to be listed and checked. To learn about organizing your data by path,
see [best practices for staging your data files](data-load-considerations-stage.md).

For example:

```sqlexample
ALTER STAGE my_stage REFRESH SUBPATH = '2024/01/31';
```

The command returns the following columns:

| Column | Description |
| --- | --- |
| `file` | Name of the staged source file and relative path to the file. |
| `status` | Status: REGISTERED_NEW, REGISTERED_UPDATE, REGISTER_SKIPPED, REGISTER_FAILED, UNREGISTERED, or UNREGISTER_FAILED. |
| `description` | Detailed description of the file registration status. |

---
title: Manage offers in Snowsight
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/consumers-manage-offers.md
section: User Guide
---

# Manage offers in Snowsight

## View and accept an offer

Consumers can view offers in the following ways:

* For offers on the Data sharing » External sharing page, the Shared with you tab shows offers that have been
  shared privately with you outside of Snowflake Marketplace.
* For offers on Snowflake Marketplace, you can access an offer by doing the following:

  + Copy and paste the URL that the provider shared with you into the search bar.
  + Search for or browse to the listing you want to access that includes the offer.

To accept an offer, follow these steps:

1. Navigate to the listing you want to access that includes the offer.
2. Select Accept offer.

   This opens the Review and accept offer page.
3. Review the offer details, including the pricing plan, contract duration, and terms of service.
4. Select Accept offer to confirm and accept the offer.

After you accept an offer start, that offer’s access start date controls when you can start using the product, and its first invoice date controls when you’re billed the first time.

## View and accept a private offer with flat-fee pricing

The following steps describe the checkout experience for consumers who accept a private, flat-fee offer.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. On the Shared with you tab, select an offer with flat-fee pricing.
4. To review the offer, select View offer details.
5. To begin the checkout process, select Accept offer.

   This opens the Complete your purchase page, which provides the following information:

   * Product details: Review the provider and listing information.
   * PO number: If desired, add a purchase order number that identifies this purchase for your records.
   * MCD balance: If you’re enrolled in the [Marketplace Capacity Drawdown (MCD)](../../../../collaboration/marketplace-capacity-drawdown.md) program, review whether you have MCD funds available. By default, any MCD funds will be applied to this purchase.

     > **Note:**
     >
     > There can be a delay of up to 2 hours for the MCD balance to update. If you purchase multiple listings in a short amount of time and use MCD funds, this value might not be the most up-to-date value.
   * Pricing details: Review the total contact value, including applicable taxes based on today’s date.

     > **Note:**
     >
     > Sales tax information is only available for consumers in the U.S. and Canada.
   * Payment schedule: Review the payment schedule, including the due date and any amount that is due now.

     The values shown in the Payment schedule don’t include sales tax.
   * Payment methods: Review and select a payment method.

     You must select a payment method to complete the purchase. You can select to pay by credit card or be invoiced for the amount due. To add a new payment method, select Add new payment method and add new credit card information.
   * Billing notifications: Review your primary billing contact email address. If you have the BILLING ADMIN role, you can add recipients from the Billing contacts page. For more information, see [Update billing contact information](../../../billing-contacts.md).
   * Terms of service: Select that you’ve read the terms of service, privacy policy, and privacy notice.

     You must select all checkboxes to complete the purchase.
   * Payment summary: Review the payment summary, which includes the amount due today along with any applicable sales taxes.

     > **Note:**
     >
     > Sales tax information is only available for consumers in the U.S. and Canada. For other locales, Snowflake doesn’t calculate tax, but you may still be liable for taxes.
6. Select Complete purchase to purchase the listing.

## Update a paid listing payment method

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. In the right pane, select a paid listing.
4. Select Manage Purchase.
5. Select Update payment method.
6. Select Add payment method, complete the mandatory fields, and then click Add.
7. Select Return to Snowflake Inc. (Snowflake Marketplace).

## View paid listing invoices

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. In the right pane, select a paid listing.
4. Select Manage Purchase.
5. Select View Invoices on Stripe.
6. Select an invoice in the INVOICE HISTORY list.
7. Select Return to Snowflake Inc. (Snowflake Marketplace).

## Add a purchase order to the next generated invoice

> **Note:**
>
> You can’t add a purchase order to a historical invoice.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. In the right pane, select a paid listing.
4. Select Manage Purchase.
5. Select Edit next to PO Number.
6. Enter the purchase order number and click Save list.
7. Select X (Close) to close the dialog.

## Cancel a paid listing purchase

When you cancel a paid listing purchase, auto-renewal stops, but the cancellation is not immediate, and you do not receive a refund. To request the cancellation of a paid listing, or request a refund, contact [Snowflake Marketplace Operations](https://snowforce.my.site.com/s/consumer-reporting).

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. In the right pane, select a paid listing.
4. Select Manage Purchase.
5. Select Cancel Purchase.
6. Select Cancel Purchase to confirm purchase cancellation.

---
title: Manage organizational listings
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-manage.md
section: User Guide
---

# Manage organizational listings

You can alter a listing to add, change, or remove the settings of the organizational listing,
such as the title, ULL, target accounts or roles, auto-fulfillment, and more.

## View available organizational listings

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Browse the available data products or use the search bar to find a specific listing.

Use [SHOW AVAILABLE LISTINGS](../../../../sql-reference/sql/show-available-listings.md) to find listings in your organization which are available to
you.

```sqlexample
SHOW AVAILABLE LISTINGS
  IS_ORGANIZATION = TRUE;
```

Use `SHOW LISTINGS` to find listings on which you are granted USAGE, MODIFY, or OWNERSHIP.

```sqlexample
SHOW LISTINGS;
```

## Edit an organizational listing

> **Note:**
>
> To avoid overwriting the existing settings of an organizational listing, you must include the existing manifest (`manifest_yaml`) when you make changes.
> Use [DESCRIBE LISTING](../../../../sql-reference/sql/desc-listing.md) to view the current settings.
>
> You can’t change the [Uniform Listing Locator(ULL)](org-listing-configure.md) or remove the data product after the listing has been published.

SnowsightSQL

1. Open the listing:

   1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
   2. In the navigation menu, select Data sharing » Internal sharing.
   3. On the Listings tab, select the listing you want to edit.

      * To refine your search, select Status and choose a status, such as Draft or Live.
      * You can sort the result set by any column.
2. Edit the listing:

   1. To edit the listing title, select the title. The Edit listing title dialog appears.
   2. To edit other metadata on the listing page, select the Edit button near the item you want to change.
   3. To edit the data product information, select the Data Product icon. You can change the description of the
      data product or change the table or view selections.

In the following example, the organization target and the locations of an organizational listing named `my-org-listing1` are changed.
The ALTER statement includes the existing listing manifest, captured with the [DESCRIBE LISTING](../../../../sql-reference/sql/desc-listing.md) command.

> **Note:**
>
> You must have the OWNERSHIP privilege or have been granted the MODIFY privilege on the listing to alter it. You can grant modify privileges to other roles using the following command:
>
> ```sqlexample
> grant modify on data exchange listing <listing_name> to role <role_name>
> ```

```sqlexample-yaml
USE ROLE <organizational_listing_role>;

ALTER LISTING my-org-listing1
AS
$$
title: "My title"
description: "One region, all accounts"
organization_profile: INTERNAL
organization_targets:
  access:
  - account: "<account_name>"
    roles:
    - "<role>"
locations:
  access_regions:
  - name: "PUBLIC.<snowflake_region>"
$$;
```

This example manifest targets all accounts in one Snowflake region.

```yaml
title: "My title"
description: "One region, all accounts"
organization_profile: INTERNAL
organization_targets:
  access:
  - account: "<account_name>"
    roles:
    - "<role>"
locations:
  access_regions:
  - name: "PUBLIC.<snowflake_region>"
```

This example manifest targets two accounts, with two roles each, in one Snowflake region.

```yaml
title: "My title"
description: "One region, two accounts, four roles"
organization_profile: INTERNAL
organization_targets:
  access:
  - account: "<account_name>"
    roles:
    - "<role>"
    - "<role>"
  - account: "<account_name>"
    roles:
    - "<role>"
    - "<role>"
locations:
  access_regions:
  - name: "PUBLIC.<snowflake_region>"
```

This example manifest targets all accounts in three Snowflake regions.

```yaml
title : 'My title'
description: "Three region, all accounts"
organization_profile: INTERNAL
organization_targets:
  access:
  - all_accounts : true
locations:
  access_regions:
  - names:
  "PUBLIC.<snowflake_region>"
  "PUBLIC.<snowflake_region>"
  "PUBLIC.<snowflake_region>"
auto_fulfillment:
  refresh_type: "SUB_DATABASE"
  refresh_schedule: "10 MINUTE"
```

This example manifest targets all accounts in all regions.

```yaml
title : "My title"
description: "Three region, all accounts"
organization_profile: INTERNAL
organization_targets:
  access:
  - all_accounts : true
locations:
  access_regions:
  - names: "ALL"
auto_fulfillment:
  refresh_type: "SUB_DATABASE"
  refresh_schedule: "10 MINUTE"
```

For a complete list of all fields and values for an Organization listing see [Organization listing manifest reference](org-listing-manifest-reference.md).

## Remove a listing from Internal Marketplace

To remove a listing from the Internal Marketplace, you must change its status.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. On the Listings tab, select the listing you want to remove from the Internal Marketplace.
4. Select the listing tile to open the listing page.
5. To unpublish the listing, select  » Unpublish.

```sqlexample
ALTER LISTING <organizational_listing_name> UNPUBLISH;
```

## Delete a listing

You must unpublish a listing before it can be deleted.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. On the Listings tab, select the unpublished listing you want to delete.
4. Select the listing tile to open the listing page.
5. To delete a listing, select the ⋮ icon. From the list that appears, select Delete.

To delete a listing, run the following command:

```sqlexample
DROP LISTING <organizational_listing_name>;
```

---
title: Manage private connectivity endpoints for Snowflake Open Catalog: AWS
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-outbound-manage-endpoints-aws.md
section: User Guide
---

# Manage private connectivity endpoints for Snowflake Open Catalog: AWS

When the data for your catalogs in your Snowflake Open Catalog account is stored in Amazon Simple Storage Service (Amazon S3) storage buckets,
follow these steps to set up private connectivity for outbound network traffic.

To enable private connectivity for your Open Catalog *account*, you typically only need to complete the
setup steps in this topic one time. After that, you enable outbound private
connectivity for each *catalog* in your Open Catalog account.

For example, if you completed the setup steps and then later create a new `catalog1` catalog, you typically only need to enable outbound private
connectivity for `catalog1`. For instructions on how to enable private connectivity for a catalog, see
Enable outbound private connectivity for a catalog.
However, if `catalog1` uses a storage bucket for which you haven’t updated
its bucket policy, you also need to update the bucket policy for the bucket.
When you update a bucket policy, you restrict network access for the bucket to a private connectivity endpoint.

## Prerequisites

* Your Open Catalog account and external cloud storage must both be hosted in the same AWS region.
* You need the IAM permissions in AWS that allow you to modify the bucket policy for your AWS storage bucket where your Iceberg tables
  are stored. For details, see [Bucket policies for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/bucket-policies.html)
  in the AWS documentation.
* Your third-party query engine or Snowflake engine must have access to your storage bucket through AWS PrivateLink or S3 Gateway
  Endpoint. For details, see [Configure an interface endpoint](https://docs.aws.amazon.com/vpc/latest/privatelink/interface-endpoints.html)
  in the AWS documentation. Otherwise, when you enable outbound private connectivity, the engine can’t read or write to tables stored in
  the bucket, but Open Catalog can read or write metadata to the bucket.

## Set up private connectivity for your account

Follow these steps to set up private connectivity for your Open Catalog account.

### Step 1: Create a Snowflake CLI connection for Open Catalog

To set up private connectivity in Open Catalog, you need a Snowflake CLI connection for Open Catalog. Follow these steps to create this
connection. If you don’t already have Snowflake CLI installed, see [Installing Snowflake CLI](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation).

#### Before you begin

To create a Snowflake CLI connection for Open Catalog, you need your full Open Catalog account identifier. The account identifier includes your Snowflake
organization name and your Open Catalog account name; for example, `<orgname>.<my-snowflake-open-catalog-account-name>`.

* To find your *Snowflake* organization name (`<orgname>`), see [Finding the organization and account name for an account](../admin-account-identifier.md).
* To find your *Snowflake Open Catalog* account name (`<my-snowflake-open-catalog-account-name>`), see
  [Find the account name for a Snowflake Open Catalog account](find-account-name.md).

> **Important:**
>
> To create this connection, you must be an Open Catalog user with service admin privileges. For information about service admin privileges, see
> [Service admin role](https://other-docs.snowflake.com/opencatalog/access-control.html#service-admin-role).

#### Add a Snowflake CLI connection for Snowflake Open Catalog

Add a connection for the Snowflake Open Catalog account where you want to enable private connectivity.

* [Add a connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md)
  with the following values. For all other parameters, press `Enter` to skip specifying a value for the parameter.

  | Connection configuration parameters | Value |
  | --- | --- |
  | **Name for this connection** | Specify a name for the connection; for example, `myopencatalogconnection`. |
  | **Account name** | Specify your Snowflake organization name, followed by your Open Catalog account name, in this format:  `<orgname>-<my-snowflake-open-catalog-account-name>`.  For example, `ABCDEFG-MYACCOUNT1`.  To find these names, see Before you begin. |
  | **Username** | Specify your username for Open Catalog; for example, `jsmith`. |
  | **Password [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  Enter your password for Open Catalog; for example, `MyPassword123456789`. |
  | **Role for the connection [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  You must enter `POLARIS_ACCOUNT_ADMIN` |

#### Test the Snowflake CLI connection

* To test your CLI connection, follow this example, which tests the connection for `myopencatalogconnection`:

  ```snowcli
  snow connection test -c myopencatalogconnection
  ```

  The response should look like this:

  ```snowcli
  +------------------------------------------------------------------------------+
  | key              | value                                                     |
  |----------------------------+-------------------------------------------------|
  | Connection name  | myopencatalogconnection                                   |
  | Status           | OK                                                        |
  | Host             | ABCDEFG-MYACCOUNT1.snowflakecomputing.com                 |
  | Account          | ABCDEFG-MYACCOUNT1                                        |
  | User             | jsmith                                                    |
  | Role             | POLARIS_ACCOUNT_ADMIN                                     |
  | Database         | not set                                                   |
  | Warehouse        | not set                                                   |
  +------------------------------------------------------------------------------+
  ```

#### Set your Snowflake CLI connection for Snowflake Open Catalog as the default

To ensure that the connection you’re using always has the required POLARIS_ACCOUNT_ADMIN role granted to it, you can set the Snowflake CLI
connection you created for Open Catalog as the default connection. For more information about the default connection, see
[Set the default connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md).

1. Follow this example, which sets the `myopencatalogconnection` connection as the default:

   ```snowcli
   snow connection set-default myopencatalogconnection
   ```
2. To confirm that you’re using the correct user and role, run the following:

   ```snowcli
   snow sql -q "Select current_user(); select current_role();"
   ```

   The response should return your Open Catalog username and the CURRENT
   ROLE should be POLARIS_ACCOUNT_ADMIN.

   ```snowcli
   +----------------+
   | CURRENT_USER() |
   |----------------|
   | JSMITH        |
   +----------------+
   select current_role();
   +-----------------------+
   | CURRENT_ROLE()        |
   |-----------------------|
   | POLARIS_ACCOUNT_ADMIN |
   +-----------------------+
   ```

### Step 2: Provision a private connectivity endpoint

Use your Snowflake CLI connection for Open Catalog to call the following system functions:

1. To provision a private connectivity endpoint for your storage buckets, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](https://docs.snowflake.com/en/sql-reference/functions/system_provision_privatelink_endpoint) system function.
2. To confirm that the private connectivity endpoint is ready to use, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_endpoints_info) system function.

For instructions, see [Provision private connectivity endpoints](https://docs.snowflake.com/en/user-guide/private-manage-endpoints-aws#provision-private-connectivity-endpoints) in the Snowflake documentation. Remember that the instructions
refer to a Snowflake account instead of a Snowflake Open Catalog account, but the process is the same in Open Catalog.

> **Important:**
>
> * You must use the POLARIS_ACCOUNT_ADMIN role instead of the ACCOUNTADMIN role mentioned in the instructions.
> * If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
>   following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN;`
> * With your command, you must insert a forward slash immediately before `$` to escape it. For example, `snow sql -q "SELECT SYSTEM\$GET_PRIVATELINK_CONFIG();"`.

> **Note:**
>
> You can use this private connectivity endpoint that you provision to grant access to all storage buckets located in the same AWS region
> where your Open Catalog account is hosted; you can’t use it to grant access to a bucket located in a different region.

#### Example: Provision a private connectivity endpoint

The following example creates a PrivateLink with external access to Amazon S3 to configure an endpoint for the `us-west-2` region:

```sqlsyntax
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'com.amazonaws.us-west-2.s3',
  '*.s3.us-west-2.amazonaws.com'
);
```

### Step 3: Update your bucket policy

To restrict network access to your storage bucket to the private connectivity endpoint you created in the previous step, in AWS, update
the bucket policy for your storage bucket. For instructions, see [Restricting access to a specific VPCendpoint](https://docs.aws.amazon.com/AmazonS3/latest/userguide/example-bucket-policies-vpc-endpoint.html#example-bucket-policies-restrict-accesss-vpc-endpoint)
in the AWS documentation. For `<vpce-id>` in the bucket policy, specify the ID of the private connectivity endpoint you created in the previous
step.

If needed, repeat this step for any additional buckets you want to connect to Open Catalog.

> **Important:**
>
> Ensure your bucket policy includes privileges that allow you to access
> the bucket and bucket policy from the browser after you add it.
> Otherwise, after you update your bucket policy, you won’t be able to
> access the bucket or bucket policy from the browser.
>
> This example bucket policy allows you to access the bucket and bucket policy from the browser:
>
> ```sqljson
> {
>     "Version": "2012-10-17",
>     "Id": "Policy1234567890123",
>     "Statement": [
>         {
>             "Sid": "Deny public access",
>             "Effect": "Deny",
>             "Principal": "*",
>             "Action": [
>                 "s3:GetObject",
>                 "s3:GetObjectVersion",
>                 "s3:PutObject",
>                 "s3:DeleteObject",
>                 "s3:DeleteObjectVersion",
>             ],
>             "Resource": [
>                 "arn:aws:s3:::my-bucket",
>                 "arn:aws:s3:::my-bucket/*"
>             ],
>             "Condition": {
>                 "StringNotLike": {
>                     "aws:SourceVpc": "vpc-*"
>                 }
>             }
>         },
>         {
>             "Sid": "Access-to-specific-VPCE-only",
>             "Effect": "Deny",
>             "Principal": "*",
>             "Action": [
>                 "s3:GetObject",
>                 "s3:GetObjectVersion",
>                 "s3:PutObject",
>                 "s3:DeleteObject",
>                 "s3:DeleteObjectVersion",
>             ],
>             "Resource": [
>                 "arn:aws:s3:::my-bucket",
>                 "arn:aws:s3:::my-bucket/*"
>             ],
>             "Condition": {
>                 "StringNotEquals": {
>                     "aws:SourceVpce": "vpce-xxxxxxxxxxx"
>                 }
>             }
>         }
>     ]
> }
> ```

## Enable outbound private connectivity for a catalog

This section describes how to enable outbound private connectivity for a catalog in your Open Catalog account.

### Step 1: Enable private connectivity

You can enable private connectivity for a new or existing catalog:

* Enable private connectivity for a new catalog
* Enable private connectivity for an existing catalog

#### Enable private connectivity for a new catalog

Follow the instructions in [Create a catalog using Amazon Simple Storage Service (Amazon S3)](create-catalog.md).
Ensure that, for the catalog, the **Private Link** toggle is **Enabled**.

> **Note:**
>
> If you haven’t updated the bucket policy for the bucket where the catalog’s tables are stored, see Update your bucket policy. When you update a bucket policy, you restrict network access to your storage bucket to your private
> connectivity endpoint.

#### Enable private connectivity for an existing catalog

1. Sign in to Open Catalog.
2. In the navigation menu, select **Catalogs**.
3. In the list of catalogs, select the catalog for which you want to enable private connectivity.
4. On the **Catalog Details** tab, set the **PrivateLink** toggle to **Enabled**.

### Step 2: Create a table by using the query engine

To verify that your query engine is connected to your catalog through AWS PrivateLink, use your query engine to create a table and insert data
into it. If you can’t insert data into the table, you might not have configured AWS PrivateLink for the query engine.

## Troubleshooting

This section provides troubleshooting for issues with outbound private connectivity for network traffic.

### Can’t view the schema for a table in Open Catalog

**Symptom**

In Open Catalog, you select a table in your catalog (for example, `catalog1`) but receive the following error message: “No permissions to
access this resource.”

**Cause**

In AWS, you successfully updated your bucket policy to route network traffic through your VPC endpoint. However, in Open Catalog, you haven’t
enabled private connectivity for this catalog, so Open Catalog can’t access your bucket.

**Solution**

Enable private connectivity for the catalog (for example, `catalog1`). For details, see
Enable private connectivity for a catalog.

### ‘Business Critical’ error when running the SYSTEM$PROVISION_PRIVATELINK_ENDPOINT command

**Symptom**

In your Snowflake CLI connection, you run the `SYSTEM\$PROVISION_PRIVATELINK_ENDPOINT` command, but it fails with the following error message:
“Business Critical or higher edition is required for this operation. Please upgrade to the valid edition and then retry.”

**Cause**

The edition for your Open Catalog account isn’t Business Critical.

To enable private connectivity for outbound network traffic, which includes provisioning a private connectivity endpoint, the
[edition](https://docs.snowflake.com/en/user-guide/intro-editions) for your Snowflake Open Catalog account must be Business Critical.

**Solution**

Contact [Snowflake support](https://docs.snowflake.com/en/user-guide/contacting-support) for assistance with upgrading your Open Catalog account to Business Critical.

---
title: Manage private connectivity endpoints for Snowflake Open Catalog: Azure
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-outbound-manage-endpoints-azure.md
section: User Guide
---

# Manage private connectivity endpoints for Snowflake Open Catalog: Azure

Follow these steps to set up outbound private connectivity for outbound network traffic where the data for your catalogs is stored in Azure
cloud storage.

## Prerequisites

* Your Open Catalog account and external cloud storage must both be hosted on Azure.
* You need permissions to set the firewall rules for your Azure storage accounts to allow requests that are routed through specific
  private connectivity endpoints.
* Your third-party query engine or Snowflake engine must have access to your Azure storage through Azure Private Link. Here are options
  for granting this access:

  + Use private endpoints for Azure Storage. For the instructions,
    see [Use private endpoints for Azure Storage](https://learn.microsoft.com/en-us/azure/storage/common/storage-private-endpoints)
    in the Azure documentation.
  + Use an Azure Service endpoint.
  + Change the firewall settings to whitelist the IP address for the machine that the query engine runs on.

  Otherwise, when you enable outbound private connectivity, the engine
  can’t read or write to tables stored in the bucket, and Open Catalog
  can’t read or write metadata to the bucket.

## Step 1: Create a Snowflake CLI connection for Open Catalog

To set up private connectivity in Open Catalog, you need a Snowflake CLI connection for Open Catalog. Follow these steps to create this
connection. If you don’t already have Snowflake CLI installed, see [Installing Snowflake CLI](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation).

### Before you begin

To create a Snowflake CLI connection for Open Catalog, you need your full Open Catalog account identifier. The account identifier includes your Snowflake
organization name and your Open Catalog account name; for example, `<orgname>.<my-snowflake-open-catalog-account-name>`.

* To find your *Snowflake* organization name (`<orgname>`), see [Finding the organization and account name for an account](../admin-account-identifier.md).
* To find your *Snowflake Open Catalog* account name (`<my-snowflake-open-catalog-account-name>`), see
  [Find the account name for a Snowflake Open Catalog account](find-account-name.md).

> **Important:**
>
> To create this connection, you must be an Open Catalog user with service admin privileges. For information about service admin privileges, see
> [Service admin role](https://other-docs.snowflake.com/opencatalog/access-control.html#service-admin-role).

### Add a Snowflake CLI connection for Snowflake Open Catalog

Add a connection for the Snowflake Open Catalog account where you want to enable private connectivity.

* [Add a connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md)
  with the following values. For all other parameters, press `Enter` to skip specifying a value for the parameter.

  | Connection configuration parameters | Value |
  | --- | --- |
  | **Name for this connection** | Specify a name for the connection; for example, `myopencatalogconnection`. |
  | **Account name** | Specify your Snowflake organization name, followed by your Open Catalog account name, in this format:  `<orgname>-<my-snowflake-open-catalog-account-name>`.  For example, `ABCDEFG-MYACCOUNT1`.  To find these names, see Before you begin. |
  | **Username** | Specify your username for Open Catalog; for example, `jsmith`. |
  | **Password [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  Enter your password for Open Catalog; for example, `MyPassword123456789`. |
  | **Role for the connection [optional]** | This parameter is *not* optional when you create a connection for Open Catalog.  You must enter `POLARIS_ACCOUNT_ADMIN` |

### Test the Snowflake CLI connection

* To test your CLI connection, follow this example, which tests the connection for `myopencatalogconnection`:

  ```snowcli
  snow connection test -c myopencatalogconnection
  ```

  The response should look like this:

  ```snowcli
  +------------------------------------------------------------------------------+
  | key              | value                                                     |
  |----------------------------+-------------------------------------------------|
  | Connection name  | myopencatalogconnection                                   |
  | Status           | OK                                                        |
  | Host             | ABCDEFG-MYACCOUNT1.snowflakecomputing.com                 |
  | Account          | ABCDEFG-MYACCOUNT1                                        |
  | User             | jsmith                                                    |
  | Role             | POLARIS_ACCOUNT_ADMIN                                     |
  | Database         | not set                                                   |
  | Warehouse        | not set                                                   |
  +------------------------------------------------------------------------------+
  ```

### Set your Snowflake CLI connection for Snowflake Open Catalog as the default

To ensure that the connection you’re using always has the required POLARIS_ACCOUNT_ADMIN role granted to it, you can set the Snowflake CLI
connection you created for Open Catalog as the default connection. For more information about the default connection, see
[Set the default connection](../../developer-guide/snowflake-cli/connecting/configure-connections.md).

1. Follow this example, which sets the `myopencatalogconnection` connection as the default:

   ```snowcli
   snow connection set-default myopencatalogconnection
   ```
2. To confirm that you’re using the correct user and role, run the following:

   ```snowcli
   snow sql -q "Select current_user(); select current_role();"
   ```

   The response should return your Open Catalog username and the CURRENT
   ROLE should be POLARIS_ACCOUNT_ADMIN.

   ```snowcli
   +----------------+
   | CURRENT_USER() |
   |----------------|
   | JSMITH        |
   +----------------+
   select current_role();
   +-----------------------+
   | CURRENT_ROLE()        |
   |-----------------------|
   | POLARIS_ACCOUNT_ADMIN |
   +-----------------------+
   ```

## Step 2: Provision a private connectivity endpoint for a storage account

You must provision a private connectivity endpoint for each storage account that you want to use with your Open Catalog account.

> **Note:**
>
> If you’re provisioning a private connectivity endpoint for a Data Lake Storage storage account (not a blob storage account), you must provision
> two private connectivity endpoints. One of these endpoints is for the DFS endpoint, and the other is for the blob endpoint. For an example, see
> Provision private connectivity endpoints for a Data Lake Storage storage account.

Use your Snowflake CLI connection for Open Catalog to call the following system functions:

1. To provision a private connectivity endpoint for the storage account, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](https://docs.snowflake.com/en/sql-reference/functions/system_provision_privatelink_endpoint) system function.
2. To confirm that the private connectivity endpoint is ready to use, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_endpoints_info) system function.

For instructions, see [Manage private connectivity endpoints: Azure](https://docs.snowflake.com/en/user-guide/private-manage-endpoints-azure)
in the Snowflake documentation. Remember that the instructions refer to a Snowflake account instead of a Snowflake Open Catalog account,
but the process is the same in Open Catalog.

> **Important:**
>
> * You must use the POLARIS_ACCOUNT_ADMIN role instead of the ACCOUNTADMIN role mentioned in the instructions.
> * If the default Snowflake CLI connection that you set doesn’t have the POLARIS_ACCOUNT_ADMIN role granted to it, you must include the
>   following statement with your command: `USE ROLE POLARIS_ACCOUNT_ADMIN;`
> * With your command, you must insert a forward slash immediately before `$` to escape it. For example, `snow sql -q "SELECT SYSTEM\$GET_PRIVATELINK_CONFIG();"`.

### Example: Provision private connectivity endpoints for a Data Lake Storage storage account

If you’re using a Data Lake Storage storage account to store your Iceberg tables, you must provision two private connectivity endpoints
for the account. For more information, see
[Creating a private endpoint](https://learn.microsoft.com/en-us/azure/storage/common/storage-private-endpoints#creating-a-private-endpoint)
in the Azure documentation.

For example:

Provision a private endpoint for blob the endpoint

```sqlsyntax
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
'/subscriptions/mysubscriptionid/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
  'storagedemo.blob.core.windows.net',
  'blob'
);
```

Provision a private endpoint for the DFS endpoint

```sqlsyntax
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
'/subscriptions/mysubscriptionid/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
  'storagedemo.dfs.core.windows.net',
  'dfs'
);
```

## Step 3: Configure public network access to your storage account

In Azure, navigate to the networking settings in your storage account and configure the public network access to it. You can configure this
access as one of the following:

* Disable all public network access
* Disable all public network access except for the virtual networks and IP addresses that you specify

For more information, see [Configure Azure Storage firewalls and virtual networks](https://learn.microsoft.com/en-us/azure/storage/common/storage-network-security?tabs=azure-portal)
in the Azure documentation.

## Step 4: Enable private connectivity for a catalog

In this step, you enable private connectivity for a catalog in your Open Catalog account. You can enable private connectivity for a new
or existing catalog:

* Enable private connectivity for a new catalog
* Enable private connectivity for an existing catalog

### Enable private connectivity for a new catalog

Follow the instructions in [Create a catalog using Azure storage](create-catalog.md).
Ensure that, for the catalog, the **Private Link** toggle is **Enabled**.

### Enable private connectivity for an existing catalog

1. Sign in to Open Catalog.
2. In the navigation menu, select **Catalogs**.
3. In the list of catalogs, select the catalog for which you want to enable private connectivity.
4. On the Catalog Details tab, set the **PrivateLink** toggle to **Enabled**.

## Step 5: Approve a private endpoint connection to your storage account

To approve the connection, you must first either create or load a table in your catalog. Performing one of these actions generates a private
endpoint connection approval request in Azure.

1. Use your query engine to do one of the following:

   * If there aren’t any tables stored in your catalog, to create a table in your Open Catalog account , use your query engine and insert
     data into it.
   * If there is a table stored in your catalog, try to load the table.
   > **Note:**
   >
   > If you can’t insert data into the table, you might not have configured Azure Private Link for the query engine; if so, your query engine isn’t
   > connected to the catalog through Azure Private Link. To resolve this, configure Azure Private Link for the
   > query engine. For more information, see [Use private endpoints for Azure Storage](https://learn.microsoft.com/en-us/azure/storage/common/storage-private-endpoints)
   > and [Tutorial: Connect to a storage account using an Azure Private Endpoint](https://learn.microsoft.com/en-us/azure/private-link/tutorial-private-endpoint-storage-portal?tabs=dynamic-ip)
   > in the Azure documentation.
2. In Azure, follow these steps:

   1. Navigate to the networking settings for your storage account.
   2. Approve the connection request for the private endpoint connection. If you created a table, it is created in the Azure storage account
      when you approve the request.

## Troubleshooting

This section provides troubleshooting for issues with outbound private connectivity for network traffic.

### Can’t view the schema for a table in Open Catalog

**Symptom**

In Open Catalog, you select a table in your catalog (for example, `catalog1`) but receive the following error message: “No permissions to
access this resource.”

**Cause**

In Azure, you successfully updated the network settings for your storage account to route network traffic through your VPC endpoint. However, in
Open Catalog, you haven’t enabled private connectivity for this catalog, so Open Catalog can’t access your bucket.

**Solution**

Enable private connectivity for the catalog (for example, catalog1). For details, see
Enable private connectivity for a catalog.

### Received a “Failed to get subscoped credentials” error message in the query engine

**Symptom**

You attempt to read or write data to a table by using a query engine but receive the following error message: “Failed to get subscoped credentials.”

**Cause**

You locked down your storage account but haven’t provisioned the private connectivity endpoint or enabled private connectivity in your Open
Catalog account. As a result, Open Catalog can’t generate the subscoped credentials and return them to the query engine, so your query engine
can’t access the storage.

**Solution**

Follow these steps:

* If you haven’t provisioned the private connectivity endpoint yet, see
  Provision a private connectivity endpoint for the storage account.
* If you haven’t enabled private connectivity for the catalog yet, Enable private connectivity for the catalog.

### ‘Business Critical’ error when running the SYSTEM$PROVISION_PRIVATELINK_ENDPOINT command

**Symptom**

In your Snowflake CLI connection, you ran the `SYSTEM$PROVISION_PRIVATELINK_ENDPOINT` command but it fails with the following error message: “Business Critical or higher
edition is required for this operation. Please upgrade to the valid edition and then retry.”

**Cause**

The edition for your Open Catalog account isn’t Business Critical.

To enable private connectivity for outbound network traffic, which includes provisioning a private connectivity endpoint, the
[edition](https://docs.snowflake.com/en/user-guide/intro-editions) for your Snowflake Open Catalog account must be Business Critical.

**Solution**

Contact [Snowflake support](https://docs.snowflake.com/en/user-guide/contacting-support) for assistance with upgrading your Open Catalog account to Business Critical.

---
title: Manage private connectivity endpoints: AWS
source: https://docs.snowflake.com/en/user-guide/private-manage-endpoints-aws.md
section: User Guide
---

# Manage private connectivity endpoints: AWS

This topic provides information on how to manage private connectivity endpoints for use with outbound private connectivity to AWS.

## Provision private connectivity endpoints

> **Note:**
>
> AWS doesn’t support cross-region VPC interface endpoints for the Amazon S3 service. Therefore, cross-region PrivateLink isn’t supported
> for outbound connectivity to external stages and volumes that use the Amazon S3 service.
>
> Cross-region support for AWS PrivateLink isn’t available in government regions or in the People’s Republic of China.

You can use the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to create a private connectivity
endpoint by specifying the service or resource, and the host name. You must use the ACCOUNTADMIN role when you use this system function.

> **Note:**
>
> If you use private connectivity for an external stage or external volume, you must use the wildcard character `*` when you specify
> the host name. Using the wildcard doesn’t mean that all Amazon S3 buckets are accessed over a private connection. Only buckets referenced
> by a Snowflake object that is enabled for private connectivity — that is, the external stage or external volume — can be accessed
> through the VPC endpoint.

For example, to create a PrivateLink endpoint that connects to Amazon S3, execute the following SQL statement to configure an endpoint for
`us-west-2`:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'com.amazonaws.us-west-2.s3',
  '*.s3.us-west-2.amazonaws.com'
);
```

> **Note:**
>
> When you configure an endpoint for Amazon S3 or another platform as a service (PaaS), such as KMS, that service must be in the same region
> as your Snowflake account.

The SYSTEM$PROVISION_PRIVATELINK_ENDPOINT function accepts a provider service name and host name as its arguments. You can obtain these
values by using the `describe-vpc-endpoint-services` subcommand from the AWS command line. As described in the
[AWS documentation](https://awscli.amazonaws.com/v2/documentation/api/latest/reference/ec2/describe-vpc-endpoint-services.html), this AWS
subcommand returns a JSON object with a `ServiceName` field and a `PrivateDnsName` field. Use the following table to determine
which values to use for the SYSTEM$PROVISION_PRIVATELINK_ENDPOINT function:

| SYSTEM$PROVISION_PRIVATELINK_ENDPOINT argument | `describe-vpc-endpoint-services` output |
| --- | --- |
| `provider_service_name` | `ServiceName` |
| `host_name` | `PrivateDnsName`  If you use private connectivity for external stages or external volumes, you must use the value with a wildcard. |

You can create a private connectivity endpoint to a VPC endpoint service in an AWS region that is different from your Snowflake region.
If you do, ensure that the VPC endpoint service supports the Snowflake region. For information about finding the region names for your account,
see [Find the cloud-provider’s name of the region for your account](admin-security-privatelink.md).

> **Important:**
>
> *Before* you specify the `provider_service_name` as an argument for the SYSTEM$PROVISION_PRIVATELINK_ENDPOINT function, refer to the
> Cross-Region Connectivity Pricing section on the [AWS PrivateLink pricing](https://aws.amazon.com/privatelink/pricing) page to determine
> the appropriate region.

If the target service is a [VPC endpoint service](https://docs.aws.amazon.com/vpc/latest/privatelink/privatelink-share-your-services.html), the endpoint service must allow Snowflake
to connect to it. Before you create an endpoint, add the value of `privatelink-account-principal`
from the output of [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) as an [allowed principal](https://docs.aws.amazon.com/vpc/latest/privatelink/configure-endpoint-service.html#add-remove-permissions) of the VPC endpoint service.

The following SQL statement configures an endpoint to a VPC endpoint service:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'com.amazonaws.vpce.us-west-2.vpce-svc-012345678910f1234',
  'my.onprem.storage.com'
);
```

> **Note:**
>
> In this example, the service might be in different region from your Snowflake account.

After you create an endpoint, there is a delay before you can use the endpoint. For information about checking the status of a created
endpoint, see [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md).

## Set up connectivity to an endpoint that can’t be accessed directly

Not every service allows Snowflake to connect directly to specific instances through an interface endpoint. In these cases, you can
instead enable access to the service by
setting up a proxy and exposing the service as a VPC endpoint service.

For a walkthrough specific to Amazon RDS, see the blog post
[Connecting To Amazon RDS Using Private Connectivity from Snowflake](https://medium.com/snowflake/connecting-to-amazon-rds-using-private-connectivity-from-snowflake-c2b538d623ed).

### Discover whether a service is available for direct access

Snowflake can usually access an AWS service directly through private connectivity if one of the following is true:

* The DNS name of the service—its `PrivateDnsName` value from the output of AWS
  [DescribeVpcEndpointServices](https://docs.aws.amazon.com/AWSEC2/latest/APIReference/API_DescribeVpcEndpointServices.html)—is
  prefixed with a wildcard.

  If the service’s DNS name starts with a wildcard character `*`, it’s likely that AWS supports directly accessing individual resources on
  that service. The DNS name is usually in this form:

  ```none
  *.<service>.<region>.amazonaws.com
  ```
* The service is purely data-plane. [AWS Bedrock Runtime](https://docs.aws.amazon.com/bedrock/latest/APIReference/API_Operations_Amazon_Bedrock_Runtime.html)
  is an example.

  To discover this about a given service, see the AWS documentation and blog posts.

### Access a service when direct access is not available

When a service is not available through direct access via an interface endpoint, you can enable access to the service by setting up
a proxy and exposing the service as a VPC endpoint service.

Examples of such services include the following:

* Amazon EC2 instances at `ec2.us-west-2.amazonaws.com`
* Amazon Relational Database Service (RDS) servers at `rds.us-west-2.amazonaws.com`

#### Set up AWS for access through a proxy

To expose a service instance through a proxy, you set up a virtual private cloud (VPC) and load balancer on AWS, then create a Snowflake
private link endpoint using the service name and load balancer DNS name of the AWS endpoint service.

The following describes the basic steps:

1. On AWS, create a [virtual private cloud (VPC)](https://docs.aws.amazon.com/vpc/latest/userguide/create-vpc.html) with subnets
   spanning three different availability zones.

   Choose initial availability zones (for example, az1 and az2) for your resources; Snowflake might not support newer AZs in some regions. Ensure
   that endpoints and other resources are created in the same availability zones to avoid cross-zone traffic.
2. In network settings for the service instance you want to access, ensure that the instance is in the VPC you created.
3. Create a [target group](https://docs.aws.amazon.com/elasticloadbalancing/latest/network/create-target-group.html) that contains the
   service instance you want to access.
4. Create a [network load balancer](https://docs.aws.amazon.com/elasticloadbalancing/latest/network/create-network-load-balancer.html)
   that forwards traffic to the target group you created.
5. Create an [endpoint service](https://docs.aws.amazon.com/vpc/latest/privatelink/configure-endpoint-service.html) with the network
   load balancer you created.

   Record the endpoint service name—`endpoint_service_name`—for use when setting up Snowflake for access to the service.
6. In Snowflake, execute the following query to retrieve the Snowflake account principal to allow the creation of endpoints:

   ```sqlexample
   SELECT key, value FROM TABLE(FLATTEN(INPUT => PARSE_JSON(SYSTEM$GET_PRIVATELINK_CONFIG())));
   ```
7. From the results of the query, locate the `privatelink-account-principal` key and note its value.
8. On AWS, for the endpoint service you created, update the Allow principals section to add a principal whose ARN is the
   `privatelink-account-principal` key value from Snowflake.
9. In Snowflake, create a private endpoint to the AWS endpoint service
   you created.

   When you execute the SYSTEM$PROVISION_PRIVATELINK_ENDPOINT function, use the following values as arguments:

   | SYSTEM$PROVISION_PRIVATELINK_ENDPOINT argument | Value from AWS configuration |
   | --- | --- |
   | `provider_service_name` | AWS endpoint Service name—the `endpoint_service_name` value—from the details section of the endpoint service. |
   | `host_name` | DNS Name from the network load balancer you created. |
10. On AWS, approve the PrivateLink connection:

    1. Navigate to the endpoint connections for the endpoint service you created.
    2. Select the relevant endpoint connection in a pending state.
    3. Click Accept Endpoint Connection Request.
11. Verify the endpoint status by running the following query.

    Ensure that the endpoint status changed from `pendingAcceptance` to `available`.

    ```sqlexample
    SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
    ```

## Change the host name of a private connectivity endpoint

You can change only the host name of a previously provisioned, private connectivity endpoint without changing its network resource.
Changing the host name for an endpoint tells Snowflake that this endpoint now connects to the same service by using a different host name. To
change the host name, call the [SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME](../sql-reference/functions/system_set_privatelink_endpoint_hostname.md) system function.

## Remove a private connectivity endpoint to services

You can use the [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function to remove a private connectivity
endpoint by specifying the service or resource.

After the endpoint is removed, the endpoint is put on a queue to be deleted after 7 days.

You need to use the ACCOUNTADMIN role when using this system function.

For example, to remove a PrivateLink with external access to Amazon S3, execute the following SQL statement:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT('com.amazonaws.us-west-2.s3');
```

## Restore a private connectivity endpoint to services

You can use the [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_restore_privatelink_endpoint.md) system function to restore a removed private connectivity
endpoint that is still on the deletion queue by specifying the service or resource. If the endpoint is not found on the deletion queue, then
you cannot restore the endpoint.

You need to use the ACCOUNTADMIN role when using this system function.

For example, to restore a PrivateLink with external access to Amazon S3, execute the following SQL statement:

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT('com.amazonaws.us-west-2.s3');
```

## List all private connectivity endpoints to services

You can use the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) system function to list all private connectivity
endpoints, and information about the endpoints, in your account.

You need to use the ACCOUNTADMIN role when using this system function.

For example, to list all AWS PrivateLink endpoints with AWS services, execute the following SQL statement:

SQLReturned value

```sqlexample
SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
```

```json
[
  {
    "provider_service_name": "com.amazonaws.us-west-2.s3",
    "snowflake_endpoint_name": "vpce-123456789012abcdea",
    "endpoint_state": "CREATED",
    "host": "*.s3.us-west-2.amazonaws.com",
    "status": "Available"
  },
  ...
]
```

For a description of the fields of the JSON object returned by the function, see [Returns](../sql-reference/functions/system_get_privatelink_endpoints_info.md).

> **Note:**
>
> You can also query the [OUTBOUND_PRIVATELINK_ENDPOINTS](../sql-reference/account-usage/outbound_privatelink_endpoints.md) view in the
> ACCOUNT_USAGE schema to list the private endpoints in your account.

---
title: Manage private connectivity endpoints: Azure
source: https://docs.snowflake.com/en/user-guide/private-manage-endpoints-azure.md
section: User Guide
---

# Manage private connectivity endpoints: Azure

This topic provides information on how to manage private connectivity endpoints for use with private connectivity to an external service. The examples are specific to Microsoft Azure.

## Provision private connectivity endpoints

You can create a private connectivity endpoint by calling the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system
function. For example, for your Snowflake account on Microsoft Azure:

Provision a private endpoint to allow Snowflake on Microsoft Azure to connect to the Microsoft Azure API Management service in your Microsoft Azure VNet:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
  'aztest1-external-function-api.azure.net',
  'Gateway'
  );
```

```output
Private endpoint with ID "/subscriptions/e48379a7-2fc4-473e-b071-f94858cc83f5/resourcegroups/test_rg/providers/microsoft.network/privateendpoints/32bd3122-bfbd-417d-8620-1a02fd68fcf8" to resource "/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api" has been provisioned successfully. Please note down the endpoint ID and approve the connection from it on the Azure portal.
```

Provision a private endpoint to allow Snowflake on Microsoft Azure to connect to an external service using external network access:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  '/subscriptions/11111111-2222-3333-4444-5555555555/resourceGroups/leorg1/providers/Microsoft.Sql/servers/myserver',
  'testdb.database.windows.net',
  'sqlServer'
  );
```

```output
"Resource Endpoint with id "/subscriptions/f0abb333-1b05-47c6-8c31-dd36d2512fd1/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" provisioned successfully"
```

Provision a private endpoint to allow Snowflake to connect to an external stage for Microsoft Azure:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  '/subscriptions/cc2909f2-ed22-4c89-8e5d-bdc40e5eac26/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
  'storagedemo.blob.core.windows.net',
  'blob'
);
```

```output
"Resource Endpoint with id "/subscriptions/57faea9a-20c2-4d35-b283-9c0c1e9593d8/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" provisioned successfully"
```

Snowflake calls the APIs for the cloud platform that hosts your Snowflake account to create the endpoint and updates the related networking
configurations.

> **Note:**
>
> Private connectivity endpoints aren’t supported for Microsoft Fabric OneLake storage locations.

## Change the host name of a private connectivity endpoint

You can change only the host name of a previously provisioned, private connectivity endpoint without changing its network resource.
Changing the host name for an endpoint tells Snowflake that this endpoint now connects to the same service by using a different host name. To
change the host name, call the [SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME](../sql-reference/functions/system_set_privatelink_endpoint_hostname.md) system function.

## List private connectivity endpoints

You can list the private connectivity endpoints that you create by calling the
[SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) system function. For example, for your Snowflake account on Microsoft Azure:

SQLReturned value

```sqlexample
SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
```

```json
  [
     {
        "provider_resource_id": "/subscriptions/11111111-2222-3333-4444-5555555555/...",
        "subresource": "sqlServer",
        "snowflake_resource_id": "/subscriptions/fa57a1f0-b4e6-4847-9c00-95f39520f...",
        "host": "testdb.database.windows.net",
        "endpoint_state": "CREATED",
        "status": "Approved",
     }
  ]
```

> **Note:**
>
> You can also query the [OUTBOUND_PRIVATELINK_ENDPOINTS](../sql-reference/account-usage/outbound_privatelink_endpoints.md) view in the
> ACCOUNT_USAGE schema to list the private endpoints in your account.

## Deprovision private connectivity endpoints

You can delete an existing private connectivity endpoint by calling the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function. For example, for your Snowflake account on Microsoft Azure:

Deprovision a private endpoint to prevent Snowflake on Microsoft Azure from connecting to the Microsoft Azure API Management service in your
Microsoft Azure VNet:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
  '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
  'Gateway'
  );
```

```output
Private endpoint with id "/subscriptions/e48379a7-2fc4-473e-b071-f94858cc83f5/resourcegroups/test_rg/providers/microsoft.network/privateendpoints/5ef8fd34-07db-4583-b0dd-0e2360398ed3" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
```

Deprovision a private endpoint to prevent Snowflake on Microsoft Azure from connecting to an external service using external network access:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
  '/subscriptions/11111111-2222-3333-4444-5555555555/resourceGroups/leorg1/providers/Microsoft.Sql/servers/myserver/databases/testdb',
  'sqlServer'
  );
```

```output
"Resource Endpoint with id "/subscriptions/f0abb333-1b05-47c6-8c31-dd36d2512fd1/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" deprovisioned successfully"
```

Deprovision a private endpoint to prevent Snowflake from connecting to an external stage for Microsoft Azure:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
  '/subscriptions/cb72345g5-d347-4sdc-r3ee-70d234551a78/resourceGroups/rg-db-dev/providers/Microsoft.Storage/storageAccounts/dbasdfffext',
  'blob'
);
```

```output
"Resource Endpoint with id "/subscriptions/57faea9a-20c2-4d35-b283-9c0c1e9593d8/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" deprovisioned successfully"
```

## Restore a deprovisioned private connectivity endpoint

You can restore a private connectivity endpoint that you deprovisioned within 7 days of deprovisioning it by calling the
[SYSTEM$RESTORE_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_restore_privatelink_endpoint.md) system function. After 7 days, the endpoint cannot be restored and you
need to provision a new endpoint.

Restore a private endpoint to allow Snowflake on Microsoft Azure to connect to the Azure API Management service in your Azure VNet:

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
  '/subscriptions/11111111-2222-3333-4444-5555555555/resourceGroups/my_rg/providers/Microsoft.Sql/servers/my_db_server',
  'sqlServer'
);
```

```output
Private endpoint with id ''/subscriptions/66666666-7777-8888-9999-0000000000/resourcegroups/rg/providers/microsoft.network/privateendpoints/00000000-1111-2222-3333-4444444444'' restored successfully.
```

## Troubleshooting

### Microsoft Azure external services: You cannot access a specified subscription

|  |  |
| --- | --- |
| Error | ```output (LinkedAuthorizationFailed) The client has permission to perform action '<action_name>' on scope '<service_name>', however the current tenant '<tenant_id>' is not authorized to access linked subscription '<subscription_id'.  Code: LinkedAuthorizationFailed  Message: The client has permission to perform action '<action_name>' on scope '<service_name>', however the current tenant '<tenant_id>' is not authorized to access linked subscription '<subscription_id>'. ``` |
| Cause | The private endpoint that maps to the external service does not have the correct information to access the subscription. |
| Solution | 1. Call the [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function to delete the endpoint for the    external service. 2. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to recreate the endpoint for the    external service. Be sure    to specify the correct subscription, hostname, and subresource values. 3. [Replace](../sql-reference/sql/create-network-rule.md) the network rule and be sure to specify the correct hostname value in the    `VALUE_LIST` property. |

---
title: Manage private connectivity endpoints: Google Cloud
source: https://docs.snowflake.com/en/user-guide/private-manage-endpoints-gcp.md
section: User Guide
---

# Manage private connectivity endpoints: Google Cloud

This topic provides information about how to manage private connectivity endpoints for use with private connectivity to an external service.
The examples are specific to Google Cloud.

## Provision private connectivity endpoints

You can create a private connectivity endpoint by calling the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system
function. For example, for your Snowflake account on Google Cloud:

Connect to a published service:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'projects/my-project/regions/us-west2/serviceAttachments/my-http-server',
  'my-http-server.com'
);
```

After creating the endpoint, the connection must be accepted on Google Cloud by the resource provider.

Provision a private endpoint to allow Snowflake on Google Cloud to connect to a service attachment in your Google Cloud VPC Network:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment',
  'my-service.com'
  );
```

```output
Private endpoint with ID "abcd0000000000000001" to resource "projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment"
was provisioned successfully. Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
```

Provision a private endpoint to allow Snowflake on Google Cloud to connect to the regional Cloud Key Management Service (Cloud KMS) endpoint:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'cloudkms.us-east4.rep.googleapis.com',
  'cloudkms.us-east4.rep.googleapis.com'
  );
```

```output
Private endpoint with ID "abcd0000000000000001" to resource "cloudkms.us-east4.rep.googleapis.com" was provisioned successfully.
Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
```

Provision a private endpoint to allow Snowflake to connect to an external stage for Google Cloud:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
  'storage.us-east4.rep.googleapis.com',
  'storage.us-east4.rep.googleapis.com'
);
```

```output
Private endpoint with ID "abcd0000000000000001" to resource "storage.us-east4.rep.googleapis.com" was provisioned successfully.
Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
```

Snowflake calls the APIs for the cloud platform that hosts your Snowflake account to create the endpoint. Snowflake also updates the related
networking configurations.

You can provision private connectivity endpoints to Google API [regional service endpoints](https://cloud.google.com/vpc/docs/regional-service-endpoints).
Connections to these Google-managed endpoints are automatically approved.

## Change the host name of a private connectivity endpoint

You can change only the host name of a previously provisioned, private connectivity endpoint without changing its network resource.
Changing the host name for an endpoint tells Snowflake that this endpoint now connects to the same service by using a different host name. To
change the host name, call the [SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME](../sql-reference/functions/system_set_privatelink_endpoint_hostname.md) system function.

## List private connectivity endpoints

You can list the private connectivity endpoints that you create by calling the
[SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) system function. For example, for your Snowflake account on Google Cloud:

SQLReturned value

```sqlexample
SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
```

```json
  [
     {
        "provider_resource_id": "projects/my-project/regions/us-east4/serviceAttachments/...",
        "snowflake_resource_id": "abcd0000000000000001",
        "host": "my-service.com",
        "endpoint_state": "CREATED",
        "status": "ACCEPTED",
     }
  ]
```

> **Note:**
>
> You can also query the [OUTBOUND_PRIVATELINK_ENDPOINTS](../sql-reference/account-usage/outbound_privatelink_endpoints.md) view in the
> ACCOUNT_USAGE schema to list the private endpoints in your account.

## Deprovision private connectivity endpoints

You can delete an existing private connectivity endpoint by calling the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function. For example, for your Snowflake account on Google Cloud:

Deprovision a private endpoint to prevent Snowflake on Google Cloud from connecting to the service attachment in your Google Cloud VPC
Network:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
  'projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment'
  );
```

```output
Private endpoint with id "abcd0000000000000001" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
```

Deprovision a private endpoint to prevent Snowflake on Google Cloud from connecting to a regional Google service endpoint (CloudKMS):

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
 'cloudkms.us-east4.rep.googleapis.com'
 );
```

```output
Private endpoint with id "abcd0000000000000001" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
```

Deprovision a private endpoint to prevent Snowflake from connecting to an external stage for Google Cloud:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
 'storage.us-east4.rep.googleapis.com'
 );
```

```output
Private endpoint with id "abcd0000000000000001" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
```

## Restore a deprovisioned private connectivity endpoint

You can restore a private connectivity endpoint that you deprovisioned within 7 days of deprovisioning it by calling the
[SYSTEM$RESTORE_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_restore_privatelink_endpoint.md) system function. After 7 days, the endpoint can’t be restored and you
need to provision a new endpoint.

Restore a private endpoint to allow Snowflake on Google Cloud to connect to the Google API Management service in your Google Cloud VPC Network:

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
  'projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment'
);
```

```output
Private endpoint with id ''abcd0000000000000001'' restored successfully.
```

## Usage notes

A Snowflake account that is used to provision private endpoints can only connect with services in the same region. For example, a Snowflake account
in `us-central1` can only provision private endpoints to service attachments and Google regional endpoints also located in `us-central1`.

## Limitations

Cross-regional connections aren’t supported.

---
title: Manage provider profiles
source: https://docs.snowflake.com/en/user-guide/data-exchange-becoming-a-provider.md
section: User Guide
---

# Manage provider profiles

The Data Exchange has the following requirements:

* A full Snowflake account to provide or consume data sets; reader accounts are not supported.
* By default, the ACCOUNTADMIN role can perform provider functions, such as creating a listing, creating a provider profile, reviewing listing requests, etc. These tasks can be delegated to other roles. For more information, see [Grant privileges to other roles](data-exchange-marketplace-privileges.md).

## Provider profile fields

The following table describes parameters required for creating and configuring your provider profile in the Data Exchange.

| Field Name | Description | Example |
| --- | --- | --- |
| **Logo** | A high-resolution image of your logo in the JPG or PNG format. The file size cannot exceed 2MB. Square image is recommended. | image.jpg |
| **Company Name** | Company name or brand name as it appears in the data listing. This is not the name of your Snowflake account. If the provider name includes special characters, these characters are parsed out in the suggested database name. The company name is the name of the provider profile. As a provider, you can have more than one provider profile (the provider nickname must be unique for each profile). When you publish a listing, you associate it with a provider profile. | ACME |
| **Description** | A short introduction (2-3 sentences) of the provider. | Acme, recognized and documented as the most accurate source of weather forecasts and warnings in the world, has saved tens of thousands of lives, prevented hundreds of thousands of injuries and tens of billions of dollars in property damage. With global headquarters in Palo Alto, CA and other offices around the world, Acme serves more than 1.5 billion people daily to help them plan their lives. |
| **Contact Email** | Email address for potential data consumers to contact you, typically a Sales contact. | `sales@example.com` |
| **Support Link** | Email for data consumers to contact the provider with questions. This is typically a Technical Support contact. | `support@example.com` |
| **Privacy Policy Link** | A link to a privacy policy on provider’s website. The link is required only for personalized shares. | `https://www.example.com/privacy` |
| **Snowflake General Contact Email** | An email address for Snowflake to contact the provider with questions about listings. |  |
| **Snowflake Technical Contact Email** | An email address for Snowflake to contact the provider about shared data. |  |

## Create a provider profile

When you join the Data Exchange as a provider, you must set up your provider profile. A provider profile is required for publishing a data listing.

If you are assigned the Data Exchange Admin role, or you have [Provider profile level privileges](data-exchange-marketplace-privileges.md), you can create and manage provider profiles for a Data Exchange in the Manage Exchanges tab of Snowsight.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select Manage Exchanges and then select the Provider Profiles tab.
4. Select Add Profile.
5. Complete the required fields. For the description of the fields, see Delete a provider profile below.
6. Save your changes.

## Edit a provider profile

You can edit a provider profile at any time. The profile updates are reflected for all listings associated with the profile.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select Manage Exchanges and select the Provider Profiles tab.
4. Select the profile you want to edit.
5. From the Manage drop-down list, select Update Profile.
6. Make changes to the profile.
7. Select Next to review the preview of the profile.
8. Save your changes.

## Delete a provider profile

You can delete a provider profile as long it is not associated with any listings, both published or unpublished.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. Select Manage Exchanges and select the Provider Profiles tab.
4. Select the profile that you want to delete.
5. From the Manage drop-down list, select Delete Profile.

   If the Delete Profile option is inactive, make sure no listings are associated with the profile.

---
title: Manage reader accounts
source: https://docs.snowflake.com/en/user-guide/data-sharing-reader-create.md
section: User Guide
---

# Manage reader accounts

Reader accounts (formerly known as “read-only accounts”) enable providers to share data with consumers who are not already Snowflake customers,
without requiring the consumers to become Snowflake customers.

> **Note:**
>
> All tasks described in this topic must be performed using the ACCOUNTADMIN role or a role granted the CREATE ACCOUNT global privilege.

## Overview

A reader account enables data consumers to access and query data shared by the provider of the account, with no setup or usage costs for
the consumer, and no requirements for the consumer to sign a licensing agreement with Snowflake.

The reader account is created, owned, and managed by the provider account, which assumes all responsibility for credit charges incurred by
users in the reader account. Similar to standard consumer accounts, the provider account uses *shares* to share databases with reader
accounts; however, a reader account can only consume data from the provider account that created it.

> **Note:**
>
> Warehouses in a reader account can consume an unlimited number of credits each month, which will be charged to your provider account.
> To limit usage, set up a [resource monitor for the warehouse](data-sharing-reader-config.md).

### What is restricted/allowed in a reader account?

A reader account is intended primarily for querying data shared by the provider of the account. You can work with data, for example,
by creating materialized views.

You cannot perform the following tasks in a reader account:

* Set a [data metric function](data-quality-intro.md) on objects in the reader account.
* Upload new data.
* Modify existing data.
* Unload data using a storage integration. However, you can use the
  [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command with your connection credentials to unload data into a cloud storage location.

Additionally, you cannot execute the following commands in a reader account:

* [INSERT](../sql-reference/sql/insert.md)
* [UPDATE](../sql-reference/sql/update.md)
* [DELETE](../sql-reference/sql/delete.md)
* [MERGE](../sql-reference/sql/merge.md)
* [CREATE IMAGE REPOSITORY](../sql-reference/sql/create-image-repository.md)
* [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)
* [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md)
* [CREATE PIPE](../sql-reference/sql/create-pipe.md)
* [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md)
* [CREATE SERVICE](../sql-reference/sql/create-service.md)
* [CREATE SHARE](../sql-reference/sql/create-share.md)
* [CREATE STAGE](../sql-reference/sql/create-stage.md)
* [CREATE STREAMLIT](../sql-reference/sql/create-streamlit.md)
* [SHOW PROCEDURES](../sql-reference/sql/show-procedures.md)

All other operations are allowed.

### Who provides support for a reader account?

Because a reader account does not have a licensing agreement with Snowflake, support services are not available to the general users in the account. Instead, as the provider of the account, you
field questions and requests from users in the account and respond as appropriate.

If you are unable to directly answer their questions or resolve their requests/issues, you can open a Snowflake Support ticket through the normal channels (as outlined in your support agreement).
Once a response has been provided by Snowflake Support, you then communicate the information back to the appropriate users in the reader account.

## Managing and creating reader accounts using the web interface

If you have the ACCOUNTADMIN role (or have a role that has been granted the CREATE ACCOUNT privilege), you can use Snowsight to
perform most tasks related to creating and managing reader accounts.

### Using Snowsight

To create or manage reader accounts in Snowsight, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select the Accounts tab.
4. Select the Reader accounts sub-tab.

On this page, you can do the following:

* Create a reader account by selecting + New.
* Review existing reader accounts.
* Drop a reader account by selecting … » Drop.

> **Note:**
>
> By default, the total number of reader accounts a provider can create is 20. If you reach the limit and require creating additional
> accounts, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> If you dropped a reader account in order to create a new account without exceeding this limit, you cannot create the new reader account for
> 7 days, which is the retention period for deleted reader accounts.

## DDL for reader accounts

To enable creating and managing reader accounts, Snowflake provides a first-class object, MANAGED ACCOUNT, that supports the following DDL commands:

* [CREATE MANAGED ACCOUNT](../sql-reference/sql/create-managed-account.md)
* [DROP MANAGED ACCOUNT](../sql-reference/sql/drop-managed-account.md)
* [SHOW MANAGED ACCOUNTS](../sql-reference/sql/show-managed-accounts.md)

## Enabling other roles to create and manage reader accounts

By default, only users with the ACCOUNTADMIN role can create reader accounts and therefore, as the owner of the account, manage the accounts. To support delegating these tasks to other users, the
CREATE ACCOUNT global privilege can be granted to other roles (system-defined or custom). Then, users with the role can create reader accounts and perform all tasks associated with managing the
accounts created using the role.

For example, to grant the privilege to the SYSADMIN role:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> GRANT CREATE ACCOUNT ON ACCOUNT TO ROLE SYSADMIN;
> ```

## Creating and managing reader accounts using SQL

In addition to using the web interface to manage and create reader accounts, you can also use SQL.

### Creating a reader account

To create a reader account, use the ACCOUNTADMIN role (or a role granted the CREATE ACCOUNT global privilege) and the
[CREATE MANAGED ACCOUNT](../sql-reference/sql/create-managed-account.md) command.

In the command, specify the identifier for the account and the user who will serve as the administrator for the account. For example,
use the following syntax:

```sqlsyntax
USE ROLE ACCOUNTADMIN;

CREATE MANAGED ACCOUNT <account_name>
    ADMIN_NAME = <username> , ADMIN_PASSWORD = '<password>' ,
    TYPE = READER;
```

After running the command, you see the account name and login URL for the account:

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| status                                                                                                                                                                            |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| {"accountName":"READER_ACCT1","accountLocator":"IIB88126","url":"https://myorg-reader_acct1.snowflakecomputing.com","accountLocatorUrl":"https://iib88126.snowflakecomputing.com"}|
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

Note:

* By default, the total number of reader accounts a provider can create is 20. If you reach the limit and require creating additional
  accounts, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

  If you dropped a reader account in order to create a new account without exceeding this limit, you cannot create the new reader account for
  7 days, which is the retention period for deleted reader accounts.
* The `url` is the preferred [account URL](organizations-connect.md) for the new reader account. The account locator is a legacy identifier for the account.
* The reader account utilizes the same [Snowflake Edition](intro-editions.md) as the provider account and is created in the same [region](intro-regions.md).

> **Important:**
>
> After creating a reader account, wait for up to five minutes to ensure that the account is fully provisioned. Then, you must perform the following additional tasks before the account is ready to use:
>
> 1. [Add the account to one or more shares](data-sharing-provider.md) so that the Snowflake objects in the shares can be shared with the account.
> 2. [Configure the account](data-sharing-reader-config.md).

### Renaming a reader account

You must use SQL commands to rename a reader account. For instructions, see [Renaming an account](organizations-manage-accounts-rename.md).

### Dropping a reader account

To drop a reader account, use the [DROP MANAGED ACCOUNT](../sql-reference/sql/drop-managed-account.md) command. For example:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> DROP MANAGED ACCOUNT reader_acct1;
> ```

> **Attention:**
>
> Dropping a reader account drops all the objects created in the account and immediately restricts all access to the account. It also removes the account from your total number of reader accounts.
>
> This operation can not be undone. Before you drop a reader account, please take this into consideration.

### Viewing reader accounts

To view all the reader accounts that have been created for your account, use the [SHOW MANAGED ACCOUNTS](../sql-reference/sql/show-managed-accounts.md) command. For example:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SHOW MANAGED ACCOUNTS;
> ```

This command can be used to monitor the total number of reader accounts for your account. If the total number reaches the limit (20), you may need to drop some accounts or contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request the limit be increased.

In addition, you can use the views in the READER_ACCOUNT_USAGE schema (in the SNOWFLAKE shared database) to query information about the reader accounts created for your account. For more details, see
[Account Usage](../sql-reference/account-usage.md).

## Redirecting client connections in case of failover

In the event of an outage in a region, you can use [Client Redirect](client-redirect.md) to provide continued access to
data consumers using reader accounts. Create two reader accounts in different regions and designate one to act as the primary connection.
In the event of an outage in a region, you can redirect client connections to the reader account in another region. For more information,
see [Configuring Client Redirect and reader accounts](client-redirect.md).

---
title: Manage Snowflake Support cases
source: https://docs.snowflake.com/en/user-guide/ui-support.md
section: User Guide
---

# Manage Snowflake Support cases

If you have a verified email address and sufficient privileges to create, view, and manage Snowflake Support cases, you can do so
using the Support page in Snowsight.

## Verify your email address

Before you can access the Support system, you must verify your email:

In some cases, you automatically receive an email prompting you to Please Validate Your Email. If you didn’t, follow these
steps to verify your email address:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. In My Profile, configure your email address:

   * If you don’t have an email address listed, enter an email address in the Email field, and then select Save.
   * If you can’t enter an email address, an account administrator must either add an email address on your behalf or grant your user
     the role with the OWNERSHIP privilege on your user.
   * If you didn’t receive an email, select Resend verification email. Snowflake sends a verification email to the address listed.
4. Open your email, and then select the link in the email to validate your email address.

## Privileges required to access the Support system

By default, users with the organization administrator (ORGADMIN) or account administrator (ACCOUNTADMIN) roles can access the Support system.

If you want users with a custom role to access the Support system, a user with an administrator role must grant one or more global
privileges to that custom role.

The following table describes the available privileges and indicates the system role that has each privilege by default:

| Privilege | Description | Granted to system role |
| --- | --- | --- |
| MANAGE ORGANIZATION SUPPORT CASES | Grants the ability to view, comment on, and manage all Support cases for the organization. | ORGADMIN |
| MANAGE ACCOUNT SUPPORT CASES | Grants the ability to view, comment on, and manage all Support cases for the current account. | ACCOUNTADMIN |
| MANAGE USER SUPPORT CASES | Grants the ability to view, comment on, and manage all Support cases that were opened by the current user. | ACCOUNTADMIN |

Snowflake recommends that you grant the MANAGE ORGANIZATION SUPPORT CASES privilege to a role for users that require the broadest
historical view of support cases in your organization.

### Grant access to the Support system to individual users

Only organization administrators (users with the ORGADMIN role) can grant the MANAGE ORGANIZATION SUPPORT CASES privilege to roles.

Only account administrators (users with the ACCOUNTADMIN role) can grant either the MANAGE ACCOUNT SUPPORT CASES or MANAGE USER SUPPORT
CASES privilege to roles.

To grant one or more of the privileges to a custom role, run a
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) statement or use Snowsight.

For example, grant the MANAGE USER SUPPORT CASES privilege to the role `myrole`:

> ```sqlexample
> GRANT MANAGE USER SUPPORT CASES ON ACCOUNT TO ROLE myrole;
> ```

## Managing Support cases

When accessing the Support system for the first time, a user must select Enable Support.

### Create Support cases

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Support.
3. Select + Support Case.
4. Complete and submit the form. Provide as much useful information as possible to help Snowflake Support resolve your issue.
5. Optionally, add Snowflake users as *watchers* to your case to receive email notifications when the case is updated or comments are added.
   To add a user as a watcher, the user must have enabled the Support page, or have a registered user account in the Snowflake Community.
   If you add a watcher who has a role with sufficient privileges, they can also view, comment on,
   and modify the case.

> **Important:**
>
> It is your responsibility to ensure that no confidential information, export-controlled data, personal data, sensitive data, or other
> regulated data is entered into the form. Ensure that the information submitted is not “Customer Data” as defined in the Snowflake Terms
> of Service or any other agreement between you and Snowflake covering use of the Snowflake Service.

#### Create a Support case during an incident

During an incident, click Create Case on the incident banner in Snowsight to quickly create a related Support case. The summary, category, and sub-category fields are pre-filled to streamline the process.

* In Snowsight, select Support from the bottom navigation bar to view your Support cases. The Support Cases page displays new or ongoing incident summaries and their status.

### Attach files to Support cases

You can attach a maximum of 30 files to a Support case.

Each attachment must be no more than 2GB, and file names for attachments must be no more than 255 characters in length, including the
file extension.

File attachments must use one of the following file types:

* `gz`
* `gzip`
* `jpeg`
* `jpg`
* `log`
* `png`
* `txt`
* `zip`

### View and update a Support case

To review an open or closed Support case:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Support.
3. In the table of Support cases, select the row for the case that you want to view. The case details page opens.

   You can add comments to the case to answer questions or provide additional details.

### Escalate a Support case

If your case requires expedited resolution, [escalate](https://community.snowflake.com/s/article/Escalate-Button-FAQ) the case:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Support.
3. In the table of Support cases, select the row for the case that you want to escalate. The case details page opens.
4. Select Escalate Case.
5. Complete and submit the form.

### Resolve a Support case

When your business needs related to this case have been resolved, mark the case as resolved:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Support.
3. In the table of Support cases, select the row for the case that you want to resolve. The case details page opens.
4. Select Resolve Case.
5. Confirm that you want to resolve the case.

   The case is closed.

### Add watchers to a case

You can add users as watchers to an active case, or when you create the case. You can only add other
Support-enabled users as watchers. Your privileges determine the watchers available to you:

* Users with a role granted the MANAGE ACCOUNT SUPPORT CASES and
  MANAGE USER SUPPORT CASES privileges can add any support-enabled user in an account as a watcher.
* Users with a role granted the MANAGE ORGANIZATION SUPPORT CASES privilege can add any
  support-enabled users from any account in their organization as a watcher.

To add watchers to a case:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Support.
3. In the table of Support cases, select the row for the case that you want to view. The case details page opens.
4. On the right side of the case details page, select Watchers.
5. Select the users that you want to add as watchers.

> **Important:**
>
> In order to view Support cases as a watcher, a user added as a watcher must be granted a role with the MANAGE ACCOUNT SUPPORT CASES
> privilege. See Privileges required to access the Support system.

---
title: Manage Snowpipe in Snowsight
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-snowsight.md
section: User Guide
---

# Manage Snowpipe in Snowsight

You can use Snowsight to view [Snowpipe](data-load-snowpipe-intro.md) details and perform some pipe management tasks.

* Visualize the stages, pipes, and tables in a graph and understand the relationships and data lineage between these objects.
* View the complete information for any of your pipes about what was loaded (or partially loaded).
* Check if any of your pipes are failing, stalled, or stopped loading new data from files.
* Perform some pipe management tasks, such as pausing or resuming a pipe, dropping a pipe, transferring ownership of a pipe, and adding comments to a pipe.
* View the detailed status and copy history.

## Requirements

To view details about the pipe, you must use a role with the MONITOR or OWNERSHIP privilege on the pipe and the USAGE privilege on both the database and schema that contain the pipe. For more information, see [Pipe privileges](security-access-control-privileges.md).

## Pipe details

To view pipe details in Snowsight, take the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Locate the database and schema that contain the pipe.
4. Select the pipe to open the details.

The Pipe Details page includes information about the following:

* [Status](../sql-reference/functions/system_pipe_status.md). Examples: Running; Paused.
* The number of files pending in the pipe, if any.
* The date of last ingestion performed, if applicable.
* The warehouse. (Snowpipe always runs using Serverless compute resources.)
* The incoming [Notification channel](data-load-snowpipe-auto.md) to tell the Pipe when there are new files.
* Relationships between the stages, pipes, and tables in a graph.
* The latest copies performed.
* The SQL command used to create the pipe (Pipe Definition).
* The [Privileges](security-access-control-configure.md) granted on the pipe.

### Manage pipes

You can perform the following tasks from the Pipe Details page:

* To add a comment to the pipe, select  » Edit. To edit other properties of a pipe, you must use the
  [CREATE PIPE](../sql-reference/sql/create-pipe.md) SQL command to replace the pipe.
* To pause or resume the pipe, select  » Pause or Resume.
* To drop the pipe, select  » Drop.
* To transfer ownership of the pipe to another role, select  » Transfer Ownership.

For more information about managing pipes, see [Managing Snowpipe](data-load-snowpipe-manage.md).

## Copy history

To view the copy history for any of your pipes, go to the Pipe Details page, and select the Copy History tab.

The Copy History tab shows details including STATUS, DURATION, ROWS, SIZE, and FILE NAME.

The histogram displays up to 14 days of loading history and allows you to select from the following dimensions:

* Copies (default): the number of files loaded. Displays file counts grouped by status on a daily or hourly basis, helping to identify failed loads and monitor ingestion trends over time.
* Rows: the number of rows inserted. Aggregates row counts by day or hour, providing insights into data throughput trends.
* Duration: pipe ingestion duration. Shows the time taken for pipe ingestion (aggregated by day or hour), which represents the serverless compute time of your pipe and serves as a proxy for compute cost.

The pipe metrics section helps analyze health and throughput of your pipe with the following key metrics:

* Success rate: Percentage of files successfully loaded within the selected time range.
* Max ingestion gap: Highlights large gaps between ingestion cycles, making it easier to identify interruptions in continuous ingestion.
* Time since last ingestion: indicates the time elapsed since the most recent file was loaded.
* Min row count: identifies files with fewer rows than expected or empty files.
* Pending files: shows the number of detected files yet to be loaded into the table.

You can also choose to manually load files that haven’t been loaded by selecting the Manual Refresh option on the ellipsis drop-down menu on the top right corner of the page.

To search for individual files, use the search bar on the top right corner of the page. You can search by file name, status, or date.

---
title: Manage streams
source: https://docs.snowflake.com/en/user-guide/streams-manage.md
section: User Guide
---

# Manage streams

This topic describes the administrative tasks associated with managing streams.

## Enabling change tracking on views and underlying tables

In order for users to query change data on a view, change tracking must be enabled on the view and underlying tables.

Only the object owner (i.e. the role with the OWNERSHIP privilege) on a given view or underlying tables can enable change tracking.

The following options are available to enable change tracking:

1. Create a stream on the view using the view owner role. This action enables change tracking on the view.

   If the same role also owns the underlying tables, change tracking is also enabled on the tables. Otherwise, the table owner must explicitly
   enable change tracking on the tables. For these steps, see Explicitly Enable Change Tracking on the Underlying Tables (in this topic).
2. Explicitly enable change tracking on the view and tables. For instructions, see the remaining instructions in this section.

### Explicitly enable change tracking on views

Set the CHANGE_TRACKING parameter when creating a view (using CREATE VIEW) or later (using ALTER VIEW).

Note that change tracking must also be explicitly enabled on the underlying tables for a view. For instructions, see Explicitly Enable
Change Tracking on the Underlying Tables (in this topic).

For example, create a secure view in the current schema that selects a subset of rows from a table:

> ```sqlexample
> CREATE SECURE VIEW v CHANGE_TRACKING = TRUE AS SELECT col1, col2 FROM t;
> ```

For example, modify an existing view to enable change tracking:

> ```sqlexample
> ALTER VIEW v2 SET CHANGE_TRACKING = TRUE;
> ```

### Explicitly enable change tracking on the underlying tables

> **Important:**
>
> When either creating or altering a view to specify CHANGE_TRACKING, the associated dependent database objects are automatically
> updated to enable change tracking. During the operation, the underlying resources are locked, which can cause latency for DDL/DML operations.
> For more information, refer to [Resource locking](../sql-reference/transactions.md).
>
> If the user executing the statement has not specified a role with sufficient permissions (OWNERSHIP),
> the statement will fail, underlying database objects will not updated, and locks will be released.

Set the CHANGE_TRACKING parameter when creating a table (using CREATE TABLE) or later (using ALTER TABLE).

For example, to create a table in the current schema:

```sqlexample
CREATE TABLE t (col1 STRING, col2 NUMBER) CHANGE_TRACKING = TRUE;
```

For example,to modify an existing table to enable change tracking:

```sqlexample
ALTER TABLE t1 SET CHANGE_TRACKING = TRUE;
```

> **Important:**
>
> When either creating or altering a TABLE to specify CHANGE_TRACKING, the table is locked for the duration of the operation
> which can cause latency for DML operations. For more information, refer to [Resource locking](../sql-reference/transactions.md).

## Avoiding stream staleness

To prevent a stream from becoming stale, consume the stream records within a DML
statement during the table’s retention period and regularly consume its change data
before its STALE_AFTER timestamp (that is, within the extended data retention period
for the source object).. Additionally, calling
[SYSTEM$STREAM_HAS_DATA](../sql-reference/functions/system_stream_has_data.md) on the stream prevents it from
becoming stale, provided the stream is empty and the SYSTEM$STREAM_HAS_DATA function
returns `FALSE`.

> **Important:**
>
> When `SYSTEM$STREAM_HAS_DATA` returns `TRUE` for a stream, you should consume the stream in a DML operation, even if
> the result is a false positive. If you don’t consume the stream, `SYSTEM$STREAM_HAS_DATA`
> returns `TRUE`, and any tasks that use this function in their WHEN clause won’t skip execution. This results
> in unnecessary task runs and associated warehouse charges.
>
> To consume the stream efficiently when the result is a false positive — for example, querying the stream with
> `SELECT COUNT(*) FROM stream_name` returns no records — use a statement like the following example:
>
> ```sqlexample
> CREATE TEMPORARY TABLE _unused_table AS SELECT * FROM my_stream WHERE 1=0;
> ```
>
> This statement consumes the stream, because `CREATE TABLE AS SELECT` is a DML transaction. The `WHERE 1=0` clause
> filters out all data, so nothing gets processed. This advances the stream offset, and `SYSTEM$STREAM_HAS_DATA` returns
> `FALSE` until new changes occur.

For more information on data retention periods, see [Understanding & using Time Travel](data-time-travel.md).

To view the data retention period for a stream, execute the [DESCRIBE STREAM](../sql-reference/sql/desc-stream.md)
or [SHOW STREAMS](../sql-reference/sql/show-streams.md) command. The `stale_after` column timestamp indicates when
the stream is currently predicted to become stale (or when it became stale, if the timestamp is in
the past). This timestamp is calculated by adding the larger of the
[DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) or [MAX_DATA_EXTENSION_TIME_IN_DAYS](../sql-reference/parameters.md) parameter
setting to the current timestamp. Note that if the timestamp is in the past, the stream might already
be stale. The `stale` column also indicates whether the stream is expected to be stale, though the
stream might not actually be stale yet.

Consuming the change data for a stream moves the STALE_AFTER timestamp forward.

For more information, see [Data retention period and staleness](streams-intro.md).

## View and manage streams in Snowsight

To view and manage a stream in Snowsight, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. For a specific database and schema, select Streams and select the stream you want to manage.

When viewing the stream in Snowsight, you can do the following:

* In the Details section, review the table name to which the stream applies, the type of stream, and whether or not the stream is stale.
* Review the SQL statement used to create the stream.
* Manage privileges on the stream. See [Manage object privileges with Snowsight](security-access-control-configure.md).

---
title: Manage the request approval workflow
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/request-approval-workflow.md
section: User Guide
---

# Manage the request approval workflow

The request approval workflow allows consumers to request access to Internal Marketplace organizational listings. This workflow reduces the time providers need to spend managing organizational listing access, and it provides consumers with quicker access to critical organizational listings.

When setting up the request approval workflow, providers can choose to manage organizational listing access requests within Snowsight, or they can provide an email or a URL that consumers can use to request access to an organizational listing. Allowing consumers to manage their organizational listing access requests within Snowsight simplifies the request process and makes sure organizational listing access requests are processed quickly.

All request approval workflow tasks are completed in Snowsight. As the functionality matures, programmatic options for managing the request approval workflow will become available.

The request approval workflow cannot be used to grant access to roles and users.

## Create a new organizational listing with a request approval workflow

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. Select + Create Listing.
4. Select a data product such as a table, view, or other data product to add to the listing.

   1. Review the generated share identifier, then select Generate listing.
5. Select + Access control.
6. Complete the Grant access section:

   > | Field | Description |
   > | --- | --- |
   > | Who can access this data product? | Select one of the following:  * Entire organization: Anyone in the organization can access the listing.  If Entire organization is selected and [cross-cloud auto-fulfillment](http://other-docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment) is enabled on your account, then you’ll be prompted to review the auto-fulfillment refresh settings for the listing. * Selected accounts and roles: Only selected accounts and roles can access. * No accounts or roles are pre-approved: (Default) Data product will only be available by request. |
   > | Accounts | If Selected accounts and roles is selected, select one or more accounts.  Optional. Select + Add another account to add second and subsequent accounts.  By default, all roles in the selected accounts can access the listing. Select Selected roles to grant access only to specific roles each selected account. |
7. Complete the Allow discovery section:

   > | Field | Description |
   > | --- | --- |
   > | Who else can discover the listing and request access? | Select one of the following:  * Entire organization: (Default) Anyone in the organization can discover the listing and request access. * Selected accounts and roles: Only selected accounts and roles can discover the listing and request access. * Not discoverable by users without access: Only users with access can discover the listing. |
   > | Accounts | If Selected accounts and roles is selected, select one or more accounts.  Optional. Select + Add another account to add additional accounts. |
   > | Selected user roles | If Selected roles is selected, enter one or more roles to grant access. |
8. Select Set up request approval flow and then select one of the following options in the How should the request approval happen list:

   * Manage requests in Snowflake: Consumers submit, review, and manage organizational listing access in Snowsight. Go to step 10.
   * Manage requests outside of Snowflake: Consumers request organizational listing access using the email address or URL you provide. Go to step 11.
9. If you selected Manage requests in Snowflake:

   1. In the Approver email for notifications field, enter the email address for organizational listing access submissions.
   2. Optional. To add additional organizational listing approvers, select Add Role and then select a role.
   3. Select Done.
10. If you selected Manage requests outside of Snowflake:

    1. In the Approver contact field, enter the email address or a URL for organizational listing access submissions.
    2. Select Done.
11. Select Save.
12. Add an organizational listing title:

    1. Select Untitled listing.
    2. In the Listing title field, enter a descriptive title for your organizational listing.
    3. Select Save.
13. Optional. Add supporting documentation, terms and conditions, and attributes.
14. Select Publish to make the listing available in the Internal Marketplace.

    If you exit without publishing, the listing is saved as a draft.

## Set up the request approval workflow in an existing organizational listing

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. On the Listings tab, select the listing you want to edit.
4. Select Edit in the Approver Contact area.
5. Select one of the following options in the How should the request approval happen list:

   * Manage requests in Snowflake: Consumers submit, review, and manage organizational listing access in Snowsight. Go to step 7.
   * Manage requests outside of Snowflake: Consumers request organizational listing access using the email address or URL you provide. Go to step 8.
6. If you selected Manage requests in Snowflake:

   1. In the Approver email for notifications field, enter the email address for organizational listing access submissions.
   2. Optional. Select Add Role to add additional organizational listing approvers.
   3. Select Done.
7. If you selected Manage requests outside of Snowflake:

   1. In the Approver contact field, enter the email address or a URL for organizational listing access submissions.
   2. Select Done.

## Respond to an organizational listing access request

As a provider, requests for organizational listing access are sent to the email address you specified when you set up the request approval workflow for an organizational listing.

> **Note:**
>
> To approve an organizational listing access request, you need access to the Snowflake account the request originated from, and a role that owns or can modify the organizational listing. If you don’t meet these requirements, the Review Request control within the request email is inoperative.

1. Open your email application, then locate and open the organizational listing access request.
2. Review the request details.
3. Select Review Request.

   The Internal Requests page in Snowsight opens.
4. Select the organizational listing request that matches the organizational listing the consumer requested in their email.
5. Review the details of the organizational listing access request.
6. Optional. To grant organizational listing access to a role different from what the consumer specified, select Give access to a different role from requested, and then select or enter a new role name in the Change requested role to field.

   The options available for the Change requested role to field are determined by the consumer account where the request originated.

   If the consumer’s organizational listing request and the organizational listing originate from the same account as the provider, a list of autopopulated roles is available. If the consumer’s organizational listing request and the organizational listing originate from a different account than the provider, the role name must be entered manually.

   Manually entered role names must be an exact match to the roles defined in Snowsight. Only a single role can be entered.

   Only roles with OWNERSHIP or MODIFY privileges on the organizational listing can approve organizational listing access requests. To increase the number of organizational listing access approvers, grant them the MODIFY privilege on the organizational listing.
7. Optional. Enter comments explaining your reasoning for granting or denying the organizational listing access request.
8. Select one of the following options:

   * Select Deny request to deny the organizational listing access request. An email is sent to the consumer indicating organizational listing access was denied.
   * Select Grant request to grant the organizational listing access request. An email is sent to the consumer indicating organizational listing access was granted.

## View the Snowsight Internal Requests page

As a provider, you can use the Internal Request page in Snowsight to grant, deny, and review previous organizational listing access requests.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Requests tab.
4. Optional. Select the Needs review tab, select an organizational listing access request, and then grant or deny the request.
5. Optional. Select the Resolved requests tab, select a previous organizational listing access request, and then review the request details. You can use the Status list to filter previous organizational listing requests by their status.

## Request access to an organizational listing

As a consumer, you can quickly request access to an organizational listing that you want to access in the Internal Marketplace.

> **Note:**
>
> To request access to an organizational listing, your Snowsight user profile must be complete and include a valid email address. See [Manage your user settings in Snowsight](../../../ui-snowsight-profile.md).

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Search for an organizational listing and then select it.
4. Select Request access.
5. Select the role you are using to access the organizational listing.
6. Enter the reason why you’re requesting access to the organizational listing in the Reason for access field.
7. Select Submit request.
8. Select Submit request to close the Request sent dialog.

## View the status of an organizational listing access request

As a consumer, you can check the status of an active organizational listing access request at any time. You can also review when and why a previous organizational listing access request was denied.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Search for the organizational listing you’re waiting to access and then select it.
4. Select View request or View previous request if previous access was denied.
5. Review the details of your organizational listing access request.
6. Select Close.

## Access an approved organizational listing

As a consumer, a notification that your organizational listing access request was approved or denied is sent to the email address specified in your Snowsight user profile.

1. Open your email application and then locate and open the organizational listing access request.
2. Review the request details.
3. Select Review Request.

   The landing page for the organizational listing opens in Snowsight.
4. Select Query in worksheet to access the organizational listing.
5. Optional. To request access to an approved organizational listing for a different role, select a different role, and then select Request access.

## Withdraw an organizational listing access request

As a consumer, you can cancel an organizational listing access when it’s no longer required, or you need to update the access request information.

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Search for the organizational listing access request you want to cancel and then select it.
4. Select Withdraw request.
5. Select Confirm.

## Specify the request approval type programmatically

You can specify the request approval type programmatically using the `request_approval_type` parameter.

`request_approval_type` (Optional)
:   You must specify one of the following with `request_approval_type` to define whether the request and approval will happen inside or outside of Snowflake:

    * `REQUEST_AND_APPROVE_IN_SNOWFLAKE`: Consumers submit, review, and manage organizational listing access in Snowsight.
    * `REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE`: Consumers request organizational listing access using the email address or URL you provide.

    The following is an example of the format:

    ```yaml
    . . .
    request_approval_type: "REQUEST_AND_APPROVE_IN_SNOWFLAKE"
    . . .
    ```

---
title: Manage users in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/manage-users.md
section: User Guide
---

# Manage users in Snowflake Open Catalog

A service admin can perform the following actions to manage users through the Snowflake Open Catalog web interface:

* Create and drop users.
* Grant and revoke user roles. For more information about user roles, see [User roles](access-control.md).

If you created a Snowflake Open Catalog account, you are a service admin in the account.

## Create a user

1. Sign in to Snowflake Open Catalog.
2. From the menu on the left, select **Users**.
3. Select **+ User**.
4. For **User Name**, enter a unique identifier for the user.

   The user signs in to Snowflake Open Catalog with this identifier.
5. Optional: For **Email**, specify an email address for the user.
6. Optional: To grant the service admin role to the user, move the **Grant service admin** toggle to **On**.
7. For **Password** and **Verify Password**, enter the password for the user.
8. Select **Create**.

## Grant to a user the catalog admin role to a catalog

You can only grant the catalog admin role to a catalog that you created.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. Select the relevant catalog admin user.
4. Select **+ Grant catalog admin**.
5. Grant user access to a catalog:

   1. From the drop down menu, select the catalog you want to grant the catalog admin access to.
   2. Select **Add**.

   **Note**

   > You can only grant catalog admin access to catalogs that you’ve created, not to catalogs
   > created by other service admins.
6. Optional: Repeat the previous step to grant catalog admin access to additional catalogs.
7. Select **Close**.

## Revoke your own catalog admin role to a catalog

A service admin user can revoke their catalog admin privileges to a catalog that they created.

### Step 1: Revoke your service admin role

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. From the list of users, select the relevant service admin user.
4. Move the **Service admin** toggle to **Off**.
5. Select **Revoke**.
6. Select **Close**.

### Step 2: Revoke your catalog admin privileges from the catalog

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. From the list of users, select your user account.
4. In **Granted catalog admin on**, select **x** next to the relevant catalog.
5. Select **Revoke**.
6. Select **Close**.

## Revoke a user’s catalog admin role for a catalog

When you revoke a user’s catalog admin privileges to a catalog, the user can no longer manage or access the catalog.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. Select the user whose catalog privileges you want to revoke.
4. From the **Granted catalog admin on** field, select **x** next to the relevant catalog.
5. Select **Revoke**.
6. Select **Close**.

## Grant to a user the service admin role

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. Select the relevant user.
4. Move the **Grant service admin** toggle to **On**.
5. Select **Grant**.
6. Select **Close**.

## Revoke a user’s service admin role

**Note**

> If needed, you can revoke the service admin role granted to your own user account.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. From the list of users, select the relevant user.
4. Move the **Service admin** toggle to **Off**.
5. Select **Revoke**.
6. Select **Close**.

## Drop a user

Dropping a user removes the user credentials from Open Catalog.

> **Important:**
>
> Before you drop a user, determine whether that user created any catalogs that only they have catalog admin privileges to. If so, that user
> must first grant those privileges to another user. Otherwise, no other user can access the catalog. Snowflake customer support *cannot*
> provide access to the catalog.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Users**.
3. From the list of users, select the user you want to drop.
4. Select **Drop User**.
5. Select **Drop**.

---
title: Manage your user settings in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-profile.md
section: User Guide
---

# Manage your user settings in Snowsight

When you manage your user profile in Snowsight, you can add user details, change your password, select a default
language, configure notifications, enroll in multi-factor authentication (MFA), verify your email address, and more.

## Add user details to your user profile

To access your user profile:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the user menu, select your username.
3. From the user menu, select Settings.
4. In Profile, you can review and set the following user details:

   * Profile picture
   * Username (cannot be changed)
   * First name
   * Last name
   * Email

   When possible, ensure that your user profile includes a first name, last name, and email address. These details are required to complete
   some tasks in Snowflake, such as accepting the terms of service for the Snowflake Marketplace. If you cannot set these preferences for your
   user, contact an account administrator.

## Set Snowsight display preferences

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. In Preferences, you can review and set the following user details:

   * Theme: Select the display mode for Snowsight - Dark, Light, or System. The mode controls the appearance of text,
     background color, and other visual elements.
   * Language: Select the language to use for Snowsight.
   * Default role: Select a role to use by default when you use Snowsight.
   * Default warehouse: Select a warehouse to use by default when you use Snowsight. Snowflake uses the
     default warehouse to display pages that you view in Snowsight and, unless another warehouse is specified, run worksheets and
     dashboards.
   * Enhance Cortex-powered column descriptions with sample data: Select this option on to automatically generate descriptions for a column, table, or view using
     sample values from columns. This option applies to all Cortex-generated descriptions during your current session.

## Send a notification when a query completes

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. In Query history, specify whether to send a browser notification when a query finishes running in the background. When you set this
   preference for the first time, your browser prompts you to allow notifications from Snowflake.

   If your active role has access to set up resource monitor notifications, you can also select a checkbox to set up Email notifications from resource monitors.

## Enable notifications from Trust Center

To enable notifications about your account from Trust Center that display in Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. Select Notifications.
4. In the Trust center section, select either of the following options:

   * Strong authentication notification
   * Weekly digest

When the selection displays blue, the notification option is enabled.

For more information, see [Snowsight and MFA](ui-snowsight-gs.md).

## View notifications from Trust Center

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, next to your name and role, select .
3. In Notifications, select a notification listed on either Unread or All.
   The notification details open in Trust Center. For information about remediation options, see
   [Remediate security risks](trust-center/using-the-trust-center.md).

## Disable notifications from Trust Center

To disable notifications about your account from Trust Center that display in Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. Select Notifications.
4. In the Trust center section, select either of the following options:

   * Strong authentication notification
   * Weekly digest

When the selection displays grey, the notification option is disabled.

## Change your user password

To change your password:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. Select Authentication.
4. In the General section, select Change password.
5. Enter your current password.
6. Enter a new password, and confirm your new password.
7. Select Confirm.

Your new password must comply with the password policy. See [Snowflake-provided password policy](password-authentication.md).

## Enroll in multi-factor authentication (MFA)

[MFA](security-mfa.md) provides increased login security for users connecting to Snowflake.

> **Note:**
>
> If you have not previously enabled and configured MFA, Snowsight will automatically suggest you enable it.
> You can dismiss the request to configure MFA, however you will be re-prompted every three days.

To enroll in MFA:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. From the user menu, select Settings.
3. Select Authentication.
4. In the Multi-factor authentication section, select Add new authentication method.
5. Follow the prompts to configure your second factor of authentication. You are enrolled in MFA when you configure one of these MFA methods.

After you enroll, each time you attempt to sign in to Snowflake you are prompted to enter your required user credentials
(login name and password) and then prompted to use a second factor of authentication.

> **Important:**
>
> If you cannot sign in to Snowflake due to an MFA issue, for example, you don’t have access to your phone, contact one of the account
> administrators for your Snowflake account. They can temporarily disable MFA so that you can log in or reset the MFA methods that you use
> to authenticate. For more information, see [Recovering a user who is locked out](security-mfa.md).

### Disabling MFA

After you enroll in MFA, you cannot use Snowsight to disable MFA. Contact your account administrator.

## Generate a programmatic access token

Create a token to authenticate into Snowflake endpoints such as Snowflake REST APIs, Snowflake SQL API, the Snowflake Catalog SDK or Snowpark Container Services.
See [Generating a programmatic access token](programmatic-access-tokens.md).

## Verify your email address

To verify the email address associated with your Snowflake user account, follow these steps:

In some cases, you automatically receive an email prompting you to Please Validate Your Email. If you didn’t, follow these
steps to verify your email address:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. In My Profile, configure your email address:

   * If you don’t have an email address listed, enter an email address in the Email field, and then select Save.
   * If you can’t enter an email address, an account administrator must either add an email address on your behalf or grant your user
     the role with the OWNERSHIP privilege on your user.
   * If you didn’t receive an email, select Resend verification email. Snowflake sends a verification email to the address listed.
4. Open your email, and then select the link in the email to validate your email address.

You must verify your email address before you can receive email notifications for resource monitors.

## Specify appearance

Snowsight supports multiple appearance modes, including what is often referred to as dark mode.
Modes let you select how text, background color, and other aspects of how Snowsight is presented.

Snowsight supports the following modes:

| Mode | Description |
| --- | --- |
| Light | Display dark text on a light background. Light mode is typically used in normal sunlight conditions. |
| System | Set display settings based on the setting specified in the operating system. For example, in Apple OSX, match the appearance to the appearance system setting found in the Apple menu » System Settings » Appearance dialog. |
| Dark | Display light text on a dark background to reduce eye strain in low-light conditions. |

Note that the appearance setting is persistent at the user level.
For example, if you choose Light for your appearance setting, then it will still be Light the next time you log in.

To specify appearance:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the user menu, select your username.
3. Select Appearance and your preferred appearance setting: Light, System, or Dark.

> **Note:**
>
> When you first log in to Snowsight, because the appearance setting isn’t set by default, you are asked to select one of the three appearance modes.

---
title: Managing account URLs
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-urls.md
section: User Guide
---

# Managing account URLs

When an account is renamed or has its organization modified, it is assigned a new [account URL](organizations-connect.md) that is
used to connect to the account. Whether users can continue to use the original URL to access the account depends on how the account changed
and choices made when it changed:

Renamed account:
:   When the organization administrator renames an account, the original account URL is saved by default so users can continue to access the
    account with it. If saved, there is no limit on how long users can continue to use the original URL.

    During account renaming, the administrator can change the default to delete the original URL and prevent users from accessing the
    account with it.

    If the original URL was saved, but now you want to delete it, see Deleting the account URL for a renamed account.

Modified organization:
:   When Snowflake Support modifies the organization of an account, they can save or delete the original account URL. If saved, the
    original URL is referred to as the “old organization URL”. This URL can be used to access the account for 90 days, at which time it is
    deleted.

    If the original URL was saved but you want to delete it before the 90 days expires, see Deleting an organization URL.

## Deleting the account URL for a renamed account

[As the organization administrator](organization-administrators.md), you can use [Snowsight](ui-snowsight-gs.md) or SQL to delete an
old account URL that was saved when the account was renamed:

Snowsight:
:   1. In the navigation menu, select Admin » Accounts.
    2. Find the active account, and select … » Manage Urls.
    3. In the Previous Account URL section, select Delete URL.
    4. Select Delete.

SQL:
:   Execute the [ALTER ACCOUNT … DROP OLD URL](../sql-reference/sql/alter-account.md) command. For example, to drop the original
    account URL for an account that was renamed, execute:

    ```sqlexample
    ALTER ACCOUNT my_account1 DROP OLD URL;
    ```

## Deleting an organization URL

An “old organization URL” refers to the original account URL of an account whose organization has changed in one of the following ways:

* Organization was renamed.
* Organization was merged with another organization.
* Account was moved from one organization to another.

If the original account URL is saved during one of these events, users can continue to use the original URL for 90 days, at which time
it is deleted.

[As the organization administrator](organization-administrators.md), you can use Snowsight or SQL to delete the
account URL before the 90 days expire:

Snowsight:
:   1. In the navigation menu, select Admin » Accounts.
    2. Find the active account, and select … » Manage Urls.
    3. In the Previous Organization URL section, select Delete URL.
    4. Select Delete.

SQL:
:   Execute the [ALTER ACCOUNT … DROP OLD ORGANIZATION URL](../sql-reference/sql/alter-account.md) command for the account.

    For example, to drop the original account URL for an account that had its organization modified, execute:

    ```sqlexample
    ALTER ACCOUNT my_account1 DROP OLD ORGANIZATION URL;
    ```

---
title: Managing accounts in your organization
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts.md
section: User Guide
---

# Managing accounts in your organization

An organization administrator manages the lifecycle of every account that belongs to the organization, from creating a new account to
deleting it.

Within this lifecycle, the organization administrator can:

| Task | Description |
| --- | --- |
| [Create an account](organizations-manage-accounts-create.md) | Add an account to the organization. |
| [View a list of accounts](organizations-manage-accounts-view.md) | Obtain a list of the accounts that currently belong to the organization. |
| [Rename an account](organizations-manage-accounts-rename.md) | Change the name of an account and specify whether users can access the account using the original URL. |
| [Manage account URLs](organizations-manage-accounts-urls.md) | Understand what happens to an [account URL](organizations-connect.md) when an account is renamed or has its organization modified, and delete old account URLs when necessary. |
| [Work with the account edition](organizations-manage-accounts-editions.md) | View the current Snowflake edition of an account. |
| [Drop an account](organizations-manage-accounts-delete.md) | Removes an account from the organization. |

---
title: Managing cost in Snowflake
source: https://docs.snowflake.com/en/user-guide/cost-management-overview.md
section: User Guide
---

# Managing cost in Snowflake

Approaching Snowflake cost using the cost management framework described in this topic allows you to manage costs more effectively.
Each part of the framework offers powerful features that help minimize total cost of ownership while maximizing the
economic value that Snowflake provides.

## Cost management framework

Effective Snowflake cost management is divided into three parts: visibility, control, and optimization.

### Visibility

Visibility includes understanding the different sources of cost and the ability to explore that cost in detail. Visibility also includes
attributing cost to the right entities within your organization and monitoring costs as they accumulate so you can avoid unexpected costs.

Understand:
:   Gaining visibility into your Snowflake cost begins with understanding the basic concepts of Snowflake cost, including
    the different usage types that incur cost and the factors that determine the cost of using Snowflake resources.
    [Learn More](cost-understanding-overall.md)

Explore:
:   Once you have a good understanding of how costs accumulate in Snowflake, you are ready to explore your current Snowflake costs.
    Snowsight provides pre-built dashboards that help you visualize the cost of your Snowflake usage. If you would like to gather
    more details about your Snowflake cost, you can write custom queries against the Organization Usage and Account Usage schemas, which
    contain views dedicated to usage and cost. [Learn More](cost-exploring-overall.md)

Attribute:
:   The ability to chargeback cost to different entities within your organization clarifies who is incurring costs and for what
    purpose. This visibility can inform decisions on how to implement cost-saving measures. [Learn More](cost-attributing.md)

### Control

Snowflake provides features that let you monitor credit usage, which helps you control how much is spent during a given time period.
Budgets let you control costs for both serverless features and warehouses, while resource monitors focus solely on warehouses. Snowflake also
helps you set cost controls so you don’t spend more than expected. For example, by setting limits on how long a query can run
before it’s terminated, you can avoid unexpected costs associated with runaway queries. [Learn More](cost-controlling.md)

### Optimization

Snowflake provides tools that provide insights into how you might save and help identify significant changes in daily costs so you can
investigate in order to prevent cost spikes in the future. [Learn More](cost-optimize.md).

---
title: Managing external volumes in your Snowflake Account
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-managing-external-volumes.md
section: User Guide
---

# Managing external volumes in your Snowflake Account

These topics describe the tasks associated with managing external volumes in your Snowflake Account

* [Configure an external volume](tables-iceberg-configure-external-volume.md)

  > Instructions for configuring an external volume.
* [Drop an external volume by using Snowsight](tables-iceberg-drop-external-volume.md)

  > Instructions for dropping an external volume.

---
title: Managing integrations in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-integrations.md
section: User Guide
---

# Managing integrations in Snowsight

Snowflake supports a large variety of integrations that allow you to connect to external services and data sources.

This topic provides an overview of the different types of integrations and how to manage them using Snowsight.

Supported integrations include:

* [API integrations](../sql-reference/sql/create-api-integration.md) supporting integrating services reached via HTTPS API, including information about the cloud platform, types of services, access credentials, and more.
* [Catalog integrations](../sql-reference/sql/create-catalog-integration.md) supporting integrating [Apache Iceberg™ tables](tables-iceberg.md).
* [External integrations](../sql-reference/sql/create-external-access-integration.md) to enable access to specific external network locations, including network rules and credentials.
* [Notification integration](../sql-reference/sql/create-notification-integration.md) providing an interface between Snowflake and third-party messaging services.
* [Security integrations](../sql-reference/sql/create-security-integration.md) for external access to services that require authentication and authorization.
* [Storage integration](../sql-reference/sql/create-storage-integration.md) for storing generated identity and access management (IAM) entity for your external cloud storage.

To manage integrations in Snowsight:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Integrations. The list of integrations is displayed.

Then:

* Create a new integration:

  1. Click Create and select the integration type to create.
  2. Complete the associated template to create the integration.
* View, disable/enable, drop, or transfer ownership of an integration:

  1. Select an integration and click …
  2. Select one of Disable, Enable, Drop, or Transfer Ownership.
  3. Confirm the operation or cancel.
* Grant a privileges to select roles for a given integration:

  > **Note:**
  >
  > You must be the owner of an integration to grant privileges on that integration.

  1. Click anywhere in an integration row. The details of the selected integration are displayed.
  2. in the Privileges section, click + Privilege. The Grant new privilege on <integration> dialog box is displayed.
  3. From the ROLES drop down select a role.
  4. From the privileges drop down select either USAGE, USE_ANY_ROLE or both.
  5. Click Grant Privileges to grant the privilege to the selected role or Cancel to cancel the operation.

---
title: Managing regular data loads
source: https://docs.snowflake.com/en/user-guide/data-load-considerations-manage.md
section: User Guide
---

# Managing regular data loads

This topic provides best practices, general guidelines, and important considerations for managing regular data loads.

## Partitioning staged data files

When planning regular data loads such as ETL (Extract, Transform, Load) processes or regular imports of machine-generated data, it is important to partition the data in your internal (i.e. Snowflake)
stage or external locations (S3 buckets or Azure containers) using logical, granular paths. Create a partitioning structure that includes identifying details such as application or location, along
with the date when the data was written. You can then copy any fraction of the partitioned data into Snowflake with a single command. You can copy data into Snowflake by the hour, day, month, or even
year when you initially populate tables.

Some examples of partitioned S3 buckets using paths:

> `s3://bucket_name/application_one/2016/07/01/11/`
>
> `s3://bucket_name/application_two/location_one/2016/07/01/14/`

Where:

`application_one` , `application_two` , `location_one` , etc.
:   Identifying details for the source of all data in the path. The data can be organized by the date when it was written. An optional 24-hour directory reduces the amount of data in each directory.

    > **Note:**
    >
    > S3 transmits a directory list with each COPY statement used by Snowflake, so reducing the number of files in each directory improves the performance of your COPY statements. You may even consider
    > creating subfolders of 10-15 minute increments within the folders for each hour.

Similarly, you can also add a path when you stage files in an internal stage. For example:

> ```sqlexample
> PUT file:///tmp/file_20160701.11*.csv @my_stage/<application_one>/<location_one>/2016/07/01/11/;
> ```

## Loading staged data

Load organized data files into Snowflake tables by specifying the precise path to the staged files. For more information, see [Organizing data by path](data-load-considerations-stage.md).

## Removing loaded data files

When data from staged files is loaded successfully, consider removing the staged files to ensure the data is not inadvertently loaded again (duplicated).

> **Note:**
>
> Do not remove the staged files until the data has been loaded successfully. To check if the data has been loaded successfully, use the [COPY_HISTORY](../sql-reference/functions/copy_history.md) command. Check the `STATUS` column to determine if the data from the file has been loaded. Note that if the status is `Load in progress`, removing the staged file can result in partial loads and data loss.

Staged files can be deleted from a Snowflake stage (user stage, table stage, or named stage) using the following methods:

* Files that were loaded successfully can be deleted from the stage during a load by specifying the PURGE copy option in the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command.
* After the load completes, use the [REMOVE](../sql-reference/sql/remove.md) command to remove the files in the stage.

Removing files ensures they aren’t inadvertently loaded again. It also improves load performance, because it reduces the number of files that COPY commands must scan to verify whether existing files in
a stage were loaded already.

---
title: Managing session policies
source: https://docs.snowflake.com/en/user-guide/session-policies-managing.md
section: User Guide
---

# Managing session policies

This topic describes Snowflake sessions and session policies and provides instructions for configuring session policies at the account or
user level.

## Session policy privileges

Snowflake supports the following session policy privileges to determine whether users can create, set, and own session policies.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| CREATE SESSION POLICY | Enables creating a new session policy in a schema. |
| APPLY SESSION POLICY | Enables applying any session policy at the account or user level. |
| OWNERSHIP | Grants full control over the session policy. Required to alter most properties of a session policy. |

## Summary of commands, operations, and privileges

The following table summarizes the relationship between the session policy DDL operations and their necessary privileges.

| Operation | Privilege required |
| --- | --- |
| Create session policy | A role with the CREATE SESSION POLICY privilege on the schema. |
| Alter session policy | A role with the OWNERSHIP privilege on the session policy. |
| Drop session policy | A role with the OWNERSHIP privilege on the session policy. |
| Describe session policy | A role with the OWNERSHIP privilege on the session policy or . the APPLY SESSION POLICY privilege on the account. |
| Show session policies | A role with the OWNERSHIP privilege on the session policy or . the APPLY SESSION POLICY privilege on the account. |
| Set & unset session policy | For accounts, a role with the APPLY SESSION POLICY privilege on the account and the OWNERSHIP privilege on the session policy, or a role with the APPLY SESSION POLICY privilege on the account and the APPLY privilege on a specific session policy.  For users, a role with the APPLY SESSION POLICY privilege on the user. |

## Session Policy DDL Reference

Snowflake provides the following DDL commands to manage session policy objects:

* [CREATE SESSION POLICY](../sql-reference/sql/create-session-policy.md)
* [ALTER SESSION POLICY](../sql-reference/sql/alter-session-policy.md)
* [DROP SESSION POLICY](../sql-reference/sql/drop-session-policy.md)
* [SHOW SESSION POLICIES](../sql-reference/sql/show-session-policies.md)
* [DESCRIBE SESSION POLICY](../sql-reference/sql/desc-session-policy.md)

To set or unset a session policy on the account, execute the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command as shown below.

> ```sqlexample
> ALTER ACCOUNT SET SESSION POLICY mydb.policies.session_policy_prod_1;
> ```
>
> ```sqlexample
> ALTER ACCOUNT UNSET SESSION POLICY;
> ```

To set or unset a user-level session policy, execute the [ALTER USER](../sql-reference/sql/alter-user.md) command as shown below.

> ```sqlexample
> ALTER USER jsmith SET SESSION POLICY mydb.policies.session_policy_prod_1_jsmith;
> ```
>
> ```sqlexample
> ALTER USER jsmith UNSET SESSION POLICY;
> ```

## Auditing session policies

* You can query the [SESSION_POLICIES view](../sql-reference/account-usage/session_policies.md) view to return a row for each session policy and its metadata in
  your Snowflake account.
* You can call the [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) function to return a row for each user that is assigned to the
  specified session policy and a row for the session policy assigned to the Snowflake account.

  Currently, only the following syntax is supported for session policies:

  > ```sqlsyntax
  > POLICY_REFERENCES( POLICY_NAME => '<session_policy_name>' )
  > ```

  Where `session_policy_name` is the fully qualified name of the session policy.

  For example, execute the following query to return a row for each user that is assigned the session policy named
  `session_policy_prod_1`, which is stored in the database named `my_db` and the schema named `my_schema`:

  > ```sqlexample
  > SELECT *
  > FROM TABLE(
  >   my_db.INFORMATION_SCHEMA.POLICY_REFERENCES(
  >     POLICY_NAME => 'my_db.my_schema.session_policy_prod_1'
  >   ));
  > ```

## Troubleshooting session policies

* If a session policy is assigned to an account or a user and the database or schema that contains the session policy is dropped, and then
  a new session policy is assigned to the account or user, the user will not be held to the idle session timeout value(s) in the new
  session policy.

  The workaround is to unset the original session policy from the account using an ALTER ACCOUNT command or from the user using an ALTER
  USER command as shown in this topic.
* The following table summarizes some error messages that can occur with session policies.

  | Behavior | Error Message | Troubleshooting Action |
  | --- | --- | --- |
  | Cannot create a session policy. | Cannot perform CREATE SESSION POLICY. This session does not have a current database. Call ‘USE DATABASE’, or use a qualified name. | Specify a database prior to executing CREATE SESSION POLICY or use the fully qualified object name in the CREATE SESSION POLICY statement. |
  | Cannot create a session policy. | SQL access control error: Insufficient privileges to operate on schema ‘<schema_name>’ | Verify that the role executing the CREATE SESSION POLICY statement has the CREATE SESSION POLICY on SCHEMA privilege. |
  | Cannot create a session policy. | SQL compilation error: Database ‘<database_name>’ does not exist or not authorized. | Verify that the database exists and that the role executing the CREATE SESSION POLICY statement has the USAGE privilege on the schema in which the session policy should exist. |
  | Cannot execute a describe statement. | SQL compilation error: Schema ‘<schema_name>’ does not exist or not authorized. | Verify that the role executing the DESC SESSION POLICY statement has the OWNERSHIP privilege on the session policy or the APPLY privilege on the session policy. |
  | Cannot drop a session policy. | SQL compilation error: Session policy ‘<policy_name>’ does not exist or not authorized. | Verify that the role executing the DROP SESSION POLICY statement has the OWNERSHIP privilege on the session policy. |
  | Cannot drop a session policy. | Session policy <policy_name> cannot be dropped because it is attached to an account. | Unset the session policy from the account with an [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) statement and try the drop statement again. |
  | Cannot set a session policy on an account. | Session policy ‘<policy_name> is already attached to account <account_name>. | An account can only have one active session policy. Determine which session policy should be set for the account. . If necessary, unset the current session policy from the account with a [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command; then set the other session policy on the account with another ALTER ACCOUNT command. |
  | Cannot set a timeout value. | SQL compilation error: invalid value ‘<integer>’ for property ‘session_idle_timeout_mins’ | The session timeout value, in minutes, must be an integer between `5` and `240`, inclusive. . Choose a valid integer for the session timeout and execute the CREATE or ALTER SESSION POLICY statement again. |
  | Cannot update an existing session policy. | SQL compilation error: Session policy ‘<policy_name>’ does not exist or not authorized. | Verify the name of the session policy, the syntax of the ALTER SESSION POLICY command, and the privileges to operate on the session policy, database, and schema. |

---
title: Managing Snowpipe
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-manage.md
section: User Guide
---

# Managing Snowpipe

This topic describes the administrative tasks associated with managing Snowpipe.

## Deleting staged files after Snowpipe loads the data

Pipe objects do not support the PURGE copy option. Snowpipe cannot delete staged files automatically when the data is successfully loaded into tables.

To remove staged files that you no longer need, we recommend periodically executing the [REMOVE](../sql-reference/sql/remove.md) command to delete the files.

Alternatively, configure any lifecycle management features provided by your cloud storage service provider.

## Loading historic data

> **Note:**
>
> The information in this section pertains to automated data loads using event notifications. Calls to the Snowpipe REST API can load historic data without the need for additional steps.

An [ALTER PIPE … REFRESH](../sql-reference/sql/alter-pipe.md) statement copies a set of data files staged within the previous 7 days to the Snowpipe ingest queue for loading into the target table. If you want to load data from files staged earlier, we recommend the following steps:

1. Load the historic data into the target table by executing a [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement.
2. Configure automatic data loads using Snowpipe with event notifications. Files that are newly staged will trigger event notifications for ingestion into the target table. Because the historic data files do not trigger event notifications, they are not loaded twice.

   For instructions, see:

   Amazon S3:
   :   [Automating Snowpipe for Amazon S3](data-load-snowpipe-auto-s3.md)

   Google Cloud Storage:
   :   [Automating Snowpipe for Google Cloud Storage](data-load-snowpipe-auto-gcs.md)

   Microsoft Azure:
   :   [Automating Snowpipe for Microsoft Azure Blob Storage](data-load-snowpipe-auto-azure.md)
3. Execute an ALTER PIPE … REFRESH statement to queue any files staged in-between Steps 1 and 2. The statement checks the load history for both the target table and the pipe to ensure the same files are not loaded twice.

## Recreating pipes

Recreating a pipe (using a [CREATE OR REPLACE PIPE](../sql-reference/sql/create-pipe.md) statement) is necessary to modify most pipe
properties.

This section describes considerations and best practices to follow when recreating pipes.

### Recreating pipes for automated data loads

When recreating a pipe that automates data loads using event notifications, we recommend that you complete the following steps:

1. Pause the pipe (using [ALTER PIPE … SET PIPE_EXECUTION_PAUSED = true](../sql-reference/sql/alter-pipe.md)).
2. Query the [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function and verify that the pipe execution state is `PAUSED`.
3. Recreate the pipe (using CREATE OR REPLACE PIPE).
4. Pause the pipe again.
5. Review the configuration steps for your cloud messaging service to ensure the settings are still accurate:

   * [Automating Snowpipe for Amazon S3](data-load-snowpipe-auto-s3.md)
   * [Automating Snowpipe for Google Cloud Storage](data-load-snowpipe-auto-gcs.md)
   * [Automating Snowpipe for Microsoft Azure Blob Storage](data-load-snowpipe-auto-azure.md)
6. Resume the pipe (using ALTER PIPE … SET PIPE_EXECUTION_PAUSED = false).
7. Query the SYSTEM$PIPE_STATUS function again and verify that the pipe execution state is `RUNNING`.

### Load history

The load history for Snowpipe operations is stored in the metadata of the pipe object. When a pipe is recreated, the load history is
dropped. In general, this condition only affects users if they subsequently execute an
[ALTER PIPE … REFRESH](../sql-reference/sql/alter-pipe.md) statement on the pipe. Doing so could load duplicate data from staged files
in the storage location for the pipe if the data was already loaded successfully and the files were not deleted subsequently.

## Changing the cloud parameters of the referenced stage

The cloud parameters of an external stage include the following:

* `URL`
* `STORAGE_INTEGRATION`
* `ENCRYPTION`

After Snowpipe has been configured successfully, if you need to modify any of the cloud parameters of the referenced stage, you must recreate the pipe.

> **Warning:**
>
> Modifying the `URL` parameter of a stage can cause any reliant pipes that leverage cloud messaging to trigger data loads (i.e. where `AUTO_INGEST = TRUE`) to stop working.

## Transferring pipe ownership

Complete the following steps to transfer ownership of a pipe:

1. Set the [PIPE_EXECUTION_PAUSED](../sql-reference/parameters.md) parameter to TRUE.

   This parameter enables pausing or resuming a pipe. The parameter is supported at the following levels:

   * Account
   * Schema
   * Pipe

   At the pipe level, the object owner (or a parent role in a role hierarchy) can set the parameter to pause or resume an individual pipe.

   An account administrator (user with the ACCOUNTADMIN role) can set this parameter at the account level to pause or resume all pipes in the account. Likewise, a user with the MODIFY privilege on the schema can pause or resume pipes at the schema level. Note that this larger domain control only affects pipes for which the parameter was not already set at a lower level; e.g., by the owner at the object level.
2. Transfer ownership of the pipe using [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md).
3. Force the pipe to resume (using [SYSTEM$PIPE_FORCE_RESUME](../sql-reference/functions/system_pipe_force_resume.md)).

   This step allows the new owner to evaluate the pipe status and determine how many data files are waiting to be loaded using [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md). We recommend verifying that only files approved for loading into the target table are queued.

## Modifying the COPY statement in a pipe definition

Complete the following steps to modify the COPY statement in a pipe definition; for example, when columns are added to the target table.

To execute the commands in this section, the current role for the user must have the OWNERSHIP privilege on the pipe.

1. Pause the pipe (using [ALTER PIPE … SET PIPE_EXECUTION_PAUSED=true](../sql-reference/sql/alter-pipe.md)).
2. Query the [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function and verify that the pipe execution state is `PAUSED` and the pending file count is 0.
3. Recreate the pipe to change the COPY statement in the definition. Choose either of the following options:

   * Drop the pipe (using [DROP PIPE](../sql-reference/sql/drop-pipe.md)) and create it (using [CREATE PIPE](../sql-reference/sql/create-pipe.md)).
   * Recreate the pipe (using the [CREATE OR REPLACE PIPE](../sql-reference/sql/create-pipe.md) syntax). Internally, the pipe is dropped and created.
4. Pause the pipe again.
5. Review the configuration steps for your cloud messaging service to ensure the settings are still accurate:

   * [Automating Snowpipe for Amazon S3](data-load-snowpipe-auto-s3.md)
   * [Automating Snowpipe for Google Cloud Storage](data-load-snowpipe-auto-gcs.md)
   * [Automating Snowpipe for Microsoft Azure Blob Storage](data-load-snowpipe-auto-azure.md)
6. Resume the pipe (using ALTER PIPE … SET PIPE_EXECUTION_PAUSED = false).
7. Query the SYSTEM$PIPE_STATUS function again and verify that the pipe execution state is `RUNNING`.

> **Note:**
>
> The file loading metadata is associated with the pipe object rather than the table. Recreating the pipe removes the history of files loaded. Ensure that files already loaded by Snowpipe are not accidentally resubmitted to the pipe and loaded into the target table again. To view the query history for a table, query the [COPY_HISTORY](../sql-reference/functions/copy_history.md) function.

## Resuming a stale pipe

> **Note:**
>
> This section only pertains to pipe objects that leverage cloud messaging to trigger data loads (i.e. where `AUTO_INGEST = TRUE` in
> the pipe definition).

When a pipe is paused, event messages received for the pipe enter a limited retention period. The period is 14 days by default. If a pipe
is paused for longer than 14 days, it is considered stale.

To resume a stale pipe, a qualified role must call the [SYSTEM$PIPE_FORCE_RESUME](../sql-reference/functions/system_pipe_force_resume.md)
function and input the STALENESS_CHECK_OVERRIDE argument. This argument indicates an understanding that the role is resuming a stale pipe.

For example, resume the stale `stalepipe1` pipe in the `mydb.myschema` database and schema:

```sqlexample
SELECT SYSTEM$PIPE_FORCE_RESUME('mydb.myschema.stalepipe1','staleness_check_override');
```

While the stale pipe was paused, if ownership of the pipe was transferred to another role, then resuming the pipe requires the
additional OWNERSHIP_TRANSFER_CHECK_OVERRIDE argument. For example, resume the stale `stalepipe2` pipe in the `mydb.myschema` database
and schema, which transferred to a new role:

```sqlexample
SELECT SYSTEM$PIPE_FORCE_RESUME('mydb.myschema.stalepipe1','staleness_check_override, ownership_transfer_check_override');
```

As an event notification received while a pipe is paused reaches the end of the limited retention period, Snowflake schedules it to be
dropped from the internal metadata. If the pipe is later resumed, Snowpipe processes these older notifications on a best effort
basis. Snowflake cannot guarantee that they are processed.

For example, if a pipe is resumed 15 days after it was paused, Snowpipe generally skips any event notifications that were received on the
first day the pipe was paused (i.e. that are now more than 14 days old). If the pipe is resumed 16 days after it was paused, Snowpipe
generally skips any event notifications that were received on the first and second days after the pipe was paused. And so on.

---
title: Managing the Kafka connector
source: https://docs.snowflake.com/en/user-guide/kafka-connector-manage.md
section: User Guide
---

# Managing the Kafka connector

This topic describes the administrative tasks associated with managing the Kafka connector.

## Dropping Snowflake objects used by the Kafka connector

If you no longer plan to load data into Snowflake tables using the Kafka connector, you can shut down Kafka and drop the Snowflake objects used by the connector.

The connector uses Snowflake objects of the following types to ingest data:

* Named internal stages
* Pipes
* Tables

This section provides instructions for finding and dropping the Snowflake objects used by the Kafka connector.

### Dropping stages

The connector creates one named internal stage for each Kafka topic. The format of the stage name is:

> `SNOWFLAKE_KAFKA_CONNECTOR_connector_name_STAGE_table_name`

Note that each internal stage stores not only files to be loaded into tables, but also “state” information that is used to ensure delivery
of rows from Kafka to the table.

If a stage and its state information are preserved, then if the connector is stopped and restarted, the connector automatically tries to
resume at the point where it left off. However, if a stage is removed, the connector cannot resume where it left off.

To drop the stages used by the Kafka connector:

1. Find the names of the stages by executing [SHOW STAGES](../sql-reference/sql/show-stages.md) as the stages owner (i.e. the role with the OWNERSHIP privilege on the stages. This should be the default role of the user defined in the Kafka configuration file to run the Kafka connector).
2. Execute [DROP STAGE](../sql-reference/sql/drop-stage.md) to drop each stage you want to remove from the system.

### Dropping pipes

The connector creates one pipe for each partition in a Kafka topic. The format of the pipe name is:

> `SNOWFLAKE_KAFKA_CONNECTOR_connector_name_PIPE_table_name_partition_number`

To drop the pipes used by the Kafka connector:

1. Find the names of the pipes by executing [SHOW PIPES](../sql-reference/sql/show-pipes.md) as the pipes owner (i.e. the role with the
   OWNERSHIP privilege on the pipes. This should be the default role of the user defined in the Kafka configuration file to run
   the Kafka connector).
2. Execute [DROP PIPE](../sql-reference/sql/drop-pipe.md) to drop each pipe you want to remove from the system.

### Dropping tables

If the data loaded into your target tables is no longer needed, you can also drop these tables.

If you did not map Kafka topics to tables using the `snowflake.topic2table.map` parameter in the [Kafka configuration properties](kafka-connector-install.md), the Kafka connector created new tables using the topic names. The table name is in uppercase but is otherwise identical to the topic name, as long as the topic name does not violate Snowflake object naming rules. For example, Snowflake creates a table name `TEMPERATURE_DATA` for a Kafka topic named `temperature_data`.

To drop the tables used by the Kafka connector:

1. Find the names of the tables by executing [SHOW TABLES](../sql-reference/sql/show-tables.md) as the tables owner (i.e. the role with the OWNERSHIP privilege on the tables. This should be the default role of the user defined in the Kafka configuration file to run the Kafka connector).
2. Execute [DROP TABLE](../sql-reference/sql/drop-table.md) to drop each table you want to remove from the system.

---
title: Managing user consent for OAuth
source: https://docs.snowflake.com/en/user-guide/oauth-consent.md
section: User Guide
---

# Managing user consent for OAuth

This topic describes how to manage delegated authorizations for OAuth, that is, user consent given to one or more clients associated with
Snowflake integrations for a specified role.

## Adding delegated authorizations

Adding a delegated authorization to a user pre-authorizes consent to initiate a session using a specified role for a particular
integration. Without the delegated authorization, the user must authorize consent for the role after authentication. Note that a delegated
authorization only bypasses the authorization step for a given role; a user must always authenticate to request an authorization code.

The ability to add delegated authorizations is limited to custom clients. For public clients (that is, Tableau Cloud or Desktop), Snowflake
always displays the confirmation dialog for a given role.

Add user consent for a role using [ALTER USER](../sql-reference/sql/alter-user.md) with the ADD DELEGATED AUTHORIZATION keywords:

```sqlsyntax
ALTER USER <username> ADD DELEGATED AUTHORIZATION
    OF ROLE <role_name>
    TO SECURITY INTEGRATION <integration_name>;
```

Where:

`username`
:   Specifies the user whose consent you are adding.

`role_name`
:   Specifies the role associated with the access token.

`integration_name`
:   Specifies the integration associated with the access tokens for a specific client.

> **Note:**
>
> Only security administrators (that is, users with the SECURITYADMIN role) or higher can execute this SQL command.

For example, add user consent for the CUSTOM1 role to user JANE.SMITH for the MYINT integration:

```sqlexample
ALTER USER jane.smith ADD DELEGATED AUTHORIZATION
    OF ROLE custom1
    TO SECURITY INTEGRATION myint;
```

## Viewing delegated authorizations

List the active delegated authorizations for which you have access privileges, using
[SHOW DELEGATED AUTHORIZATIONS](../sql-reference/sql/show-delegated-authorizations.md):

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS;

+-------------------------------+-----------+-----------+-------------------+--------------------+
| created_on                    | user_name | role_name | integration_name  | integration_status |
+-------------------------------+-----------+-----------+-------------------+--------------------+
| 2018-11-27 07:43:10.914 -0800 | JSMITH    | PUBLIC    | MY_OAUTH_INT      | ENABLED            |
+-------------------------------+-----------+-----------+-------------------+--------------------+
```

List the active delegated authorizations for a specified user. Users can list their own delegated authorizations; otherwise, this command
variant requires the OWNERSHIP privilege on the user.

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS
    BY USER <username>;
```

List the active delegated authorizations for a specified integration. This command variant requires the OWNERSHIP privilege on the
integration (that is, the ACCOUNTADMIN role):

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS
    TO SECURITY INTEGRATION <integration_name>;
```

## Revoking delegated authorizations

A user can revoke consent from a specified integration. This has the effect of revoking any access token associated with the integration.

To revoke user consent for a given integration, execute the ALTER USER … REMOVE DELEGATED AUTHORIZATIONS command.

> **Note:**
>
> Only security administrators (that is, users with the SECURITYADMIN role) or higher can execute this SQL command.

```sqlsyntax
ALTER USER <username>
  REMOVE DELEGATED AUTHORIZATIONS
  FROM SECURITY INTEGRATION <integration_name>
```

To revoke user consent associated with a specific role, include the `OF ROLE role_name` parameter in the statement:

```sqlsyntax
ALTER USER <username>
  REMOVE DELEGATED AUTHORIZATION OF ROLE <role_name>
  FROM SECURITY INTEGRATION <integration_name>
```

Where:

`username`
:   Specifies the user whose consent you are revoking.

`role_name`
:   Specifies the role associated with the access token.

`integration_name`
:   Specifies the integration associated with the access tokens for a specific client.

For example, remove user consent for the CUSTOM1 role from user JANE.SMITH for the MYINT integration:

```sqlexample
ALTER USER jane.smith
  REMOVE DELEGATED AUTHORIZATION OF ROLE custom1
  FROM SECURITY INTEGRATION myint;
```

---
title: Managing/Using federated authentication
source: https://docs.snowflake.com/en/user-guide/admin-security-fed-auth-use.md
section: User Guide
---

# Managing/Using federated authentication

This topic describes how to manage and use federated authentication once it has been
[configured](admin-security-fed-auth-security-integration.md).

## Managing users with federated authentication enabled

### Managing Snowflake user passwords

With federated authentication enabled for your account, Snowflake still allows maintaining and using Snowflake user credentials (login name and password).
In other words:

* Account and security administrators can still create users with passwords maintained in Snowflake.
* Users can still log into Snowflake using their Snowflake credentials.

However, if federated authentication is enabled for your account, Snowflake does not recommend maintaining user passwords in Snowflake. Instead, user
passwords should be maintained solely in your IdP.

If you create a user with no password (or alter an existing user and remove their password), this effectively disables Snowflake authentication for the user.
Without a password in Snowflake, a user cannot log in using Snowflake authentication and must use federated authentication instead. Note that you cannot use
the Snowflake web interface to create users with no passwords or remove passwords from existing users. You must use [CREATE USER](../sql-reference/sql/create-user.md) or
[ALTER USER](../sql-reference/sql/alter-user.md).

Specifically, we recommend that you disable Snowflake authentication for all non-administrator users.

> **Important:**
>
> The MUST_CHANGE_PASSWORD user property does not apply for federated authentication and should not be used. In particular, if you choose to not maintain
> passwords in Snowflake for users, ensure this property is set to FALSE for these users.
>
> Also, you must maintain at least one Snowflake account administrator with a Snowflake password. This ensures that an account administrator can
> access Snowflake at all times to manage federated authentication and troubleshoot any issues that occur.

### Disabling and dropping users

As an account or security administrator in Snowflake, you may find it necessary to drop or, more likely, disable a user. Users who are dropped or disabled
in Snowflake are still able to log into their Okta accounts, but they will receive an error message when they attempt to connect to Snowflake. You must
recreate or enable the user before they can log in.

You can drop/create and disable/enable users using either the Snowflake web interface or the equivalent SQL commands.

## Using SSO with client applications that connect to Snowflake

With an [IdP configured for your account](admin-security-fed-auth-configure-idp.md), Snowflake supports using SSO to connect
and authenticate with the following Snowflake-provided clients:

Snowflake CLI:
:   v3.0.0 or higher

SnowSQL:
:   v1.1.43 or higher

Python Connector:
:   v1.4.8 or higher

JDBC Driver:
:   v3.2.7 or higher

ODBC Driver:
:   v2.13.11 or higher

.NET Driver:
:   v1.0.13 or higher

Node.js Driver:
:   v1.6.0 or higher (for browser-based SSO); v1.6.1 or higher (for native SSO authentication through Okta)

Go Driver:
:   v1.1.5 or higher

Snowflake supports two methods of authenticating:

* Browser-based SSO
* Programmatic SSO (only for Okta)

> **Important:**
>
> When using SSO with client applications that connect to Snowflake, users must enter their login credentials when prompted; however, for security reasons,
> these credentials are never processed through the client. Instead, the credentials are sent to the IdP for authentication and the IdP sends back a valid
> SAML response which enables the client to initiate a Snowflake session.

### Browser-based SSO

If users have the required version (or higher) of the Snowflake-provided clients installed,
they can use browser-based SSO to log into Snowflake.

#### How browser-based SSO works

When a client application is configured to use browser-based SSO, the application uses the following workflow for user authentication:

1. The application launches the default web browser in the user’s operating system or opens a new browser tab/window, displaying the authentication page for
   the IdP.
2. The user enters their IdP credentials (username and password).
3. If the user is enrolled in MFA (multi-factor authentication) in Snowflake, they are prompted to type the MFA passcode (sent from another device) or
   confirm the authentication (on the other device).
4. After the IdP has authenticated the user’s credentials, the browser displays a success message. The user can then close the browser tab/window (it does not
   need to be open after authentication), return to the application, and use the Snowflake session that has been initiated.

#### Requirements for using browser-based SSO

With browser-based SSO, the Snowflake-provided client (for example, the Snowflake JDBC driver) needs to be able to open the user’s web browser. For this
reason, the Snowflake-provided client and the client application that uses it need to be installed on the user’s machine. Browser-based SSO does not work if
the Snowflake-provided client is used by code that runs on a server.

#### Setting up browser-based SSO

To set up browser-based SSO for authentication, set the `authenticator` login parameter/option to
`externalbrowser` for the client.

| Client | Instructions |
| --- | --- |
| Snowflake CLI | Set the `authenticator` parameter for the connection in the `config.toml` file, or specify the command line flag `--authenticator externalbrowser` when starting the client. |
| SnowSQL | Specify the command line flag `--authenticator externalbrowser` when starting the client. |
| Python | Pass `authenticator='externalbrowser'` to the `snowflake.connector.connect()` function. |
| JDBC | Set `authenticator=externalbrowser` in the connection string for the driver. |
| ODBC (Linux/macOS) | Set `authenticator=externalbrowser` in the `odbc.ini` file. |
| ODBC (Windows) | In the ODBC Data Source Administrator tool, edit the DSN for Snowflake and set Authenticator to `externalbrowser`. |
| .NET | Set `authenticator=externalbrowser` in the connection string for the driver. |
| Node.js | Set the `authenticator=EXTERNALBROWSER` option when calling the `snowflake.createConnection` function. |
| Go | Set `authenticator=externalbrowser` in the connection string for the driver. |

#### Using connection caching to minimize the number of prompts for authentication — *Optional*

Whenever a client application establishes a new connection to Snowflake, the user is prompted for authentication. This can
result in multiple prompts for authentication if the client application establishes a connection multiple times.

To minimize the number of times that a user is prompted for authentication, the account administrator can enable connection
caching.

When connection caching is enabled, the client application stores a connection token for use in subsequent connections. For
security, the connection token is stored in the keystore for the operating system. Before you enable connection caching,
consult with your security team to determine if this complies with your security policies.

> **Tip:**
>
> Connection caching can be combined with MFA token caching.
>
> For more information on how to combine these two features, see [Using MFA token caching to minimize the number of prompts during authentication — optional](security-mfa.md).

Snowflake supports connection caching with the following drivers and connectors:

* .NET driver version 4.4.0 (or later)
* Go driver version 1.6.15 (or later)
* JDBC driver version 3.12.8 (or later)
* Node.js driver version 1.12.0 (or later)
* ODBC driver version 2.21.2 (or later)
* Snowflake Connector for Python 2.2.8 (or later)

To enable connection caching:

1. Set the account-level parameter [ALLOW_ID_TOKEN](../sql-reference/parameters.md) to `true`:

   ```sqlexample
   alter account set allow_id_token = true;
   ```

   > **Note:**
   >
   > You must be an account administrator (i.e. a user with the ACCOUNTADMIN role) to enable connection caching.
2. Add the package or libraries needed by the driver or connector:

   * If you are using the Snowflake Connector for Python, install the optional keyring package by running:

     > ```bash
     > pip install "snowflake-connector-python[secure-local-storage]"
     > ```
     >
     > You must enter the square brackets (`[` and `]`) as shown in the command. The square brackets specify the [extra part of the package](https://www.python.org/dev/peps/pep-0508/#extras) that should be installed.
     >
     > Use quotes around the name of the package as shown to prevent the square brackets from being interpreted as a wildcard.
     >
     > If you need to install other extras (for example, `pandas` for [using the Python Connector APIs for Pandas](../developer-guide/python-connector/python-connector-pandas.md)), use a comma between the extras:
     >
     > ```bash
     > pip install "snowflake-connector-python[secure-local-storage,pandas]"
     > ```
   * For the Snowflake JDBC Driver, see [Add the JNA classes to your classpath](../developer-guide/jdbc/jdbc-download.md).

### Native SSO — *Okta only*

If Okta is your IdP, Snowflake also supports authenticating natively through Okta. This authentication method is useful when you are using SSO with a client
that doesn’t have access to a web browser (e.g. connecting programmatically through the Python connector or either the JDBC or ODBC driver).

> **Note:**
>
> Please disable Okta MFA for the user who uses Native SSO authentication with client drivers.
> Please consult your Okta administrator for more information.

To enable native SSO through Okta, set the `authenticator` login parameter/option for the client to the Okta URL endpoint for your Okta account
(provided by Okta), typically in the form of `https://<okta_account_name>.okta.com`:

| Client | Instructions |
| --- | --- |
| Snowflake CLI | Set the `authenticator` parameter for the connection in the `config.toml` file, or specify the command line flag `--authenticator https://<okta_account_name>.okta.com` when starting the client. |
| SnowSQL | Specify the command line flag `--authenticator https://<okta_account_name>.okta.com` when starting the client. |
| Python | Pass `authenticator='https://<okta_account_name>.okta.com'` to the `snowflake.connector.connect()` function. |
| JDBC | Set `authenticator=https://<okta_account_name>.okta.com` in the connection string for the driver. |
| ODBC (Linux/macOS) | Set `authenticator=https://<okta_account_name>.okta.com` in the `odbc.ini` file. |
| ODBC (Windows) | In the ODBC Data Source Administrator tool, edit the DSN for Snowflake and set Authenticator to `https://<okta_account_name>.okta.com`. |
| .NET | Set `authenticator=https://<okta_account_name>.okta.com` in the connection string for the driver. |
| Node.js | Set the `authenticator` option to `https://<okta_account_name>.okta.com` when calling `snowflake.createConnection`. |

#### Upgrading to Okta Identity Engine

If you are upgrading from Okta Classic to Okta Identity Engine for native SSO, you need to update your Snowflake client drivers before the
upgrade.

If you encounter HTTP 429 errors after your upgrade, you have most likely hit the rate limit enforced by the authentication endpoint used
by the latest client drivers. For details, refer to HTTP 429 errors (in this topic).

#### HTTP 429 errors

The Okta Identity Engine requires communication through its authentication endpoint (`/api/v1/authn`), which currently has
a rate limit of 20 requests per user per 5 seconds. To support the Okta Identity Engine, the latest Snowflake client drivers use this
Authentication endpoint, and are therefore subject to the rate limit. If this limit is prohibitive, contact Okta Support to increase the
rate limit of the authentication endpoint.

Snowflake client drivers switched to the authentication endpoint in the following versions:

> * Go: 1.6.20
> * JDBC: 3.13.22
> * .NET: 2.0.20
> * Node.js: 1.6.21
> * ODBC: 2.25.5
> * Python: 2.7.12
> * Snowflake CLI: 3.0.0
> * SnowSQL: 1.2.24
> * SQLAlchemy: 1.4.6

## Using SSO with MFA

Snowflake supports using MFA in conjunction with SSO to provide additional levels of security:

* Individual users in Snowflake can enroll in MFA. If a Snowflake user is enrolled in MFA and uses SSO to connect, the MFA login workflow is initiated within
  the SSO workflow and is required to successfully complete the authentication. For more information about MFA in Snowflake, see
  [Multi-factor authentication (MFA)](security-mfa.md).

  > **Note:**
  >
  > To connect through Okta SSO with MFA, Snowflake requires using browser-based SSO. If you are using native SSO for Okta, MFA is not supported.
* In addition, your IdP may also support MFA, but this is separate from MFA in Snowflake and must be configured separately through your IdP. If MFA is enabled
  for your IdP, the IdP determines the workflow. To determine whether your IdP supports MFA and how it is implemented, see the documentation for your IdP.
* With certain Snowflake-provided clients, you can cache MFA tokens for up to four hours. For more information, see [Using MFA token caching to minimize the number of prompts during authentication — optional](security-mfa.md).

## Using SSO with multiple audience values

Snowflake supports multiple audience values (i.e. Audience or Audience Restriction Fields) in the SAML 2.0 assertion from the identity provider to Snowflake.

This functionality supports the URLs to access Snowflake as audience values. The URLs for multiple Snowflake accounts are supported because
each account has a URL with a unique [account identifier](admin-account-identifier.md) to access Snowflake. Additionally,
Snowflake accepts the account domain names and the URLs to access Snowflake using private connectivity to the Snowflake service as audience
values.

For more details on SSO and avoiding the public Internet, see [SSO with private connectivity](admin-security-fed-auth-overview.md).

Currently, Snowflake supports and accepts up to four different audience values. No configuration is necessary in Snowflake. If it is
necessary to include more than four audience values, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

For help in configuring SAML 2.0 audience values, please contact your organization’s identity provider administrator.

## Using SSO with private connectivity

Snowflake supports SSO with private connectivity to the Snowflake service for Snowflake accounts on Amazon Web Services (AWS),
Microsoft Azure, and Google Cloud Platform (GCP).

For details, see [SSO with private connectivity](admin-security-fed-auth-overview.md).

---
title: Manual Reclustering — Deprecated
source: https://docs.snowflake.com/en/user-guide/tables-clustering-manual.md
section: User Guide
---

# Manual Reclustering — *Deprecated*

If manual reclustering is still available in your account, you can use the [ALTER TABLE](../sql-reference/sql/alter-table.md) command with a `RECLUSTER` clause to manually recluster
a clustered table at any time.

## What is Manual Reclustering?

The `RECLUSTER` clause instructs Snowflake to perform immediate reclustering of the specified table. Unlike Automatic Clustering, this DML operation requires a virtual
warehouse in your account and locks the table for the duration of the operation.

Also, after a period of significant/sustained DML activity on a clustered table that does not have Automatic Clustering enabled, manual reclustering may need to be performed
multiple times on the table to achieve the desired results.

For these reasons, as well as other benefits, we recommend using [Automatic Clustering](tables-auto-reclustering.md) instead of manual reclustering.

> **Tip:**
>
> As a general rule of thumb and best practice, we recommend manual reclustering after performing significant DML on a clustered table. You can use the
> [clustering information](../sql-reference/functions/system_clustering_information.md) for the table to measure whether clustering on the table has degraded due to DML.

## Performance Impact of Manual Reclustering

The grouping/sorting that Snowflake performs during manual reclustering can impact the performance of the virtual warehouse used to perform the reclustering.

Due to this impact, if you chose to perform manual reclustering, we recommend using a separate, dedicated warehouse, and ensuring that the warehouse is of sufficient size.

## Switching from Manual Reclustering to Automatic Clustering

If manual reclustering is still available in your account, [Automatic Clustering](tables-auto-reclustering.md) may not be enabled yet for your account.

You can request Automatic Clustering to be enabled for your account; however, it will only affect clustered tables that are defined from the time after the feature is
enabled.

For clustered tables that were defined before the feature is enabled, you must explicitly “resume” Automatic Clustering for each table. You can use SQL to determine whether
Automatic Clustering is enabled for a given table.

For more details, see:

* [Viewing the Automatic Clustering status for a table](tables-auto-reclustering.md).
* [Resuming Automatic Clustering for a table](tables-auto-reclustering.md).

## Manually Reclustering a Table

Use [ALTER TABLE](../sql-reference/sql/alter-table.md) with a `RECLUSTER` clause to manually recluster a table for which a clustering key have been defined. You can use a `WHERE`
clause to specify a condition or range on which to recluster data in the table.

For example:

* To recluster table `t1`:

  > ```sqlexample
  > ALTER TABLE t1 RECLUSTER;
  > ```
* To recluster data that was inserted into table `t1` in the first week of 2016:

  > ```sqlexample
  > ALTER TABLE t2 RECLUSTER WHERE CREATE_DATE BETWEEN ('2016-01-01') AND ('2016-01-07');
  > ```

These examples use the current warehouse (for the session) to recluster the table. The amount of resources allocated to manual reclustering is based on the size of the warehouse.
The larger the warehouse, the more resources are allocated to the recluster command, which results in more effective reclustering.

> **Note:**
>
> Manual reclustering can only be performed on clustered tables (i.e. tables that have a clustering key defined).

---
title: Manually refresh dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-manual-refresh.md
section: User Guide
---

# Manually refresh dynamic tables

## Manual refresh of dynamic tables with the scheduler enabled

You can manually refresh a dynamic table to include the latest data without waiting for the next [scheduled refresh](dynamic-tables-refresh.md).
This is useful for one-time updates or when a table has a large target lag and the next refresh occurs much later.

> **Tip:**
>
> Avoid frequent manual refreshes on dynamic tables with downstream dynamic tables that are expected to refresh according to target lag.
> These kinds of manual refreshes can cause scheduled refreshes to skip and prevent downstream tables from updating.

To manually refresh, use the ALTER DYNAMIC TABLE … REFRESH command or Snowsight as shown in the following steps:

SQLSnowsight

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table REFRESH
```

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Select Monitoring » Dynamic Tables.
3. In the list, find your dynamic table, and then select  » Refresh Manually.

For situations that require precise refresh timing, such as aligning refreshes with external system schedules or batch processing windows, you
can use a task with a CRON expression to trigger the refresh.

For example:

```sqlexample
-- Create the task
CREATE TASK my_dt_refresh_task
  WAREHOUSE = my_wh
  SCHEDULE = 'USING CRON 0 0 * * * America/Los_Angeles' -- Example: daily at midnight PST
  COMMENT = 'Daily 5pm PT manual refresh of my_dynamic_table'
  AS
    ALTER DYNAMIC TABLE my_dynamic_table REFRESH;

-- Enable the task
ALTER TASK my_dt_refresh_task RESUME;

-- Show the task
SHOW TASKS LIKE 'my_dt_refresh_task';
```

```output
+------------+-----------------+-------------------------------------+---------------+-------------+--------------+-------------------------------------------------+-----------|-------------------------------------------+------------------+---------+----------------------------------------------+-----------+-----------------------------+-------------------+-------------------------------|-------------------+-----------------+--------+---------------------+-----------------------+---------------------+-----------------+----------------------------+-----------------+
| CREATED_ON | NAME            | ID                                  | DATABASE_NAME | SCHEMA_NAME | OWNER        | COMMENT                                         | WAREHOUSE | SCHEDULE                                  | [ ] PREDECESSORS | STATE   | DEFINITION                                   | CONDITION | ALLOW_OVERLAPPING_EXECUTION | ERROR_INTEGRATION | LAST_COMMITTED_ON             | LAST_SUSPENDED_ON | OWNER_ROLE_TYPE | CONFIG | TASK_RELATIONS      | LAST_SUSPENDED_REASON | SUCCESS_INTEGRATION | SCHEDULING_MODE | TARGET_COMPLETION_INTERVAL | EXECUTE_AS_USER |
|------------+-----------------+-------------------------------------+---------------+-------------+--------------+-------------------------------------------------+-----------+-------------------------------------------+------------------+---------+----------------------------------------------+-----------+-----------------------------+-------------------+-------------------------------+-------------------+-----------------+--------+---------------------+-----------------------+---------------------+-----------------+----------------------------+-----------------|
| 2025-10-02 | DT_REFRESH_TASK | 01bf6f0d-690f-f373-0000-000000025e3d| mydb          | my_schema   | ACCOUNTADMIN | Daily 5pm PT manual refresh of my_dynamic_table | mywh      | USING CRON 0 17 * * * America/Los_Angeles | []               | Started | ALTER DYNAMIC TABLE my_dynamic_table REFRESH | null      | false                       | null              | 2025-10-02 05:08:52.897 +0000 | null              | ROLE            | null   | {"Predecessors":[]} | null                  | null                | null            | null                       | null            |
+------------+-----------------+-------------------------------------+---------------+-------------+--------------+-------------------------------------------------+-----------|-------------------------------------------+------------------+---------+----------------------------------------------+-----------+-----------------------------+-------------------+-------------------------------|-------------------+-----------------+--------+---------------------+-----------------------+---------------------+-----------------+----------------------------+-----------------+
```

For most cases, Snowflake recommends using target lag, which optimizes refresh frequency and can reduce costs compared to fixed CRON schedules
that might run unnecessarily.

## Manual refresh of dynamic tables with the scheduler disabled

Dynamic tables with the `SCHEDULER` attribute set to `DISABLE` can only be refreshed manually.

This type of manual refresh refreshes only that dynamic table. It doesn’t cascade to any upstream
dynamic tables, regardless of their scheduler state.

In the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md) output, the `REFRESH_TRIGGER` value for these refreshes is
`MANUAL`. No `SCHEDULED` entries are generated for dynamic tables with `SCHEDULER` set to `DISABLE`.

This behavior allows external orchestrators, such as dbt, to issue one manual refresh per dynamic table without
triggering upstream refreshes.

To disable the scheduler and then manually refresh a dynamic table, use the [ALTER DYNAMIC TABLE](../sql-reference/sql/alter-dynamic-table.md) command as shown in the following steps:

SQLSnowsight

1. Disable the scheduler:

   ```sqlexample
   ALTER DYNAMIC TABLE my_dynamic_table SET SCHEDULER = DISABLE
   ```
2. Manually refresh the dynamic table:

   ```sqlexample
   ALTER DYNAMIC TABLE my_dynamic_table REFRESH
   ```

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Select Monitoring » Dynamic Tables.
3. In the list, find your dynamic table, and then select  » Refresh Manually.

---
title: Metadata and retention for Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-metadata.md
section: User Guide
---

# Metadata and retention for Apache Iceberg™ tables

Snowflake handles metadata for Apache Iceberg™ tables according to the type of catalog you use (Snowflake or external).

> **Note:**
>
> Specifying the default minimum number of snapshots with the `history.expire.min-snapshots-to-keep`
> [table property](https://iceberg.apache.org/docs/1.2.1/configuration/#table-behavior-properties) is not supported
> for any type of Iceberg table.

## Tables that use Snowflake as the catalog

Snowflake manages the metadata life cycle for this table type,
and deletes old metadata, manifest lists, and manifest files based on the retention period for the table data and snapshots.

To set the retention period for table data and snapshots, set the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) parameter
at the account, database, schema, or table level.

### Creation

Snowflake generates metadata for version 2 of the Apache Iceberg specification
on a periodic basis, and writes the metadata to files on your external volume.
Each new metadata file contains all DML or DDL changes since the last Snowflake-generated metadata file was created.

You can also create metadata on demand by using the [SYSTEM$GET_ICEBERG_TABLE_INFORMATION](../sql-reference/functions/system_get_iceberg_table_information.md) function.
For instructions, see [Generate snapshots of DML changes](tables-iceberg-manage.md).

For information about locating metadata files, see [Data and metadata directories](tables-iceberg-storage.md).

### Viewing metadata creation history

To access a full history of metadata generation attempts, view the query history for your account and filter the results. Search for the
[SYSTEM$GET_ICEBERG_TABLE_INFORMATION](../sql-reference/functions/system_get_iceberg_table_information.md) function name in the SQL text.

Snowflake internally uses the same SYSTEM$GET_ICEBERG_TABLE_INFORMATION function to generate table metadata. Attempts made by Snowflake
appear under the user called `SYSTEM` in the query history. The `STATUS` column in the query history
indicates whether metadata was successfully generated.

For viewing options, see [Monitor query activity with Query History](ui-snowsight-activity.md).

### Deletion

Snowflake deletes Iceberg metadata from your external cloud storage when the following events occur:

* After you drop a table.
* When the Iceberg metadata refers to snapshots or table data that has expired.

Deletion doesn’t occur immediately after the data retention period expires.
As a result, metadata storage might incur costs with your cloud storage provider for longer than a table’s lifetime.

> **Warning:**
>
> Snowflake does not support [Fail-safe](data-failsafe.md) for Snowflake-managed Iceberg tables,
> because the table data is in external cloud storage that you manage.
> To protect Iceberg table data, you need to configure data protection and recovery with your cloud provider.

#### After dropping a table

When you drop a table, you can use the [UNDROP ICEBERG TABLE](../sql-reference/sql/undrop-iceberg-table.md) command
to restore it within the data retention period.

When the retention period expires, Snowflake deletes table metadata and snapshots that it
has written from your external volume location. Deletion occurs asynchronously and can take a few
days to complete after the retention period has passed.

> **Note:**
>
> For [converted tables](tables-iceberg-conversion.md),
> Snowflake deletes only metadata that was generated *after* table conversion.

#### After snapshots expire

Snowflake deletes Iceberg metadata files related to expired snapshots after the data retention period passes.
Deletion usually occurs 7-14 days after a snapshot expires.

Only previous table snapshots can expire. Snowflake never deletes metadata files that represent the latest (current) state of a table from
your external cloud storage.

## Tables that use an external catalog

For tables that use an external catalog, Snowflake uses the value of the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md)
parameter to set a retention period for Snowflake Time Travel and undropping the table. When the retention period expires,
Snowflake does not delete the Iceberg metadata or snapshots from your external cloud storage.

Snowflake sets DATA_RETENTION_TIME_IN_DAYS at the table level to the smaller of
the following values:

* The `history.expire.max-snapshot-age-ms` value in the current metadata file. Snowflake converts the value to days (rounding down).
* The following value, depending on your [Snowflake account edition](intro-editions.md):

  + Standard Edition: 1 day.
  + Enterprise Edition or higher: 5 days.

You can’t manually change the value of DATA_RETENTION_TIME_IN_DAYS in Snowflake. To change the value, you must update
`history.expire.max-snapshot-age-ms` in your metadata file and then [refresh the table](tables-iceberg-manage.md).

You can use the following table functions to retrieve information about the files registered to an externally managed Iceberg table or
the most recent snapshot refresh history:

* [ICEBERG_TABLE_FILES](../sql-reference/functions/iceberg_table_files.md)
* [ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY](../sql-reference/functions/iceberg_table_snapshot_refresh_history.md)

### Delta-based tables

> **Note:**
>
> If you want to use metadata writes for Delta-based Iceberg tables, the
> [2025_01 behavior change bundle](../release-notes/bcr-bundles/2025_01_bundle.md) must not be disabled in your account.

For Iceberg tables created from Delta table files, Snowflake automatically writes Iceberg metadata to your external storage if you
configure your external volume with write access (see [ALLOW_WRITES](../sql-reference/sql/create-external-volume.md)).
For more information about the write location, see [Data and metadata directories](tables-iceberg-storage.md).

To prevent Snowflake from writing Iceberg metadata, you can set the ALLOW_WRITES parameter to FALSE on your
external volume as long as no Snowflake-managed Iceberg tables use the same external volume.

## Iceberg partitioning

This section describes Iceberg partitioning.

Snowflake supports the following partitioning use cases:

* Reading from and writing to partitioned Iceberg tables.
* Creating partitioned Iceberg tables
  that are Snowflake-managed or externally managed in a [catalog-linked database](tables-iceberg-catalog-linked-database.md)
  or [externally managed by an Iceberg REST catalog](tables-iceberg-externally-managed-writes.md).

  When you create a partitioned Iceberg table, you can enable hidden partitioning or
  partitioning with hierarchical paths, which is also called “Hive-style”
  partitioning.

### “Hidden” partitioning

[“Hidden” partitioning](https://iceberg.apache.org/docs/latest/partitioning/#icebergs-hidden-partitioning)
for Apache Iceberg™ is metadata-based and adaptable. Iceberg produces partition values
based on transforms that you define when you create a table. When they read from a partitioned table, Iceberg engines
use the partition values defined in your table metadata to efficiently identify relevant data.

This option is the default. With this option, Snowflake stores your Parquet data files by using a flat directory layout.

To create a partitioned Iceberg table that uses hidden partitioning, include the PARTITION BY clause with one or more [partition transforms](https://iceberg.apache.org/spec/#partition-transforms)
in your regular [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) statement.

> **Note:**
>
> To create a partitioned Iceberg table that uses hidden partitioning, the PATH_LAYOUT parameter must be set to FLAT, which is the
> default, so you don’t need to specify this parameter in your CREATE ICEBERG TABLE statement.

For an example, see [Create an Iceberg table in a catalog-linked database](tables-iceberg-externally-managed-writes.md).

### Partitioning with hierarchical paths

With this option, Snowflake writes data to partitioned Iceberg tables by using a hierarchical path layout
for Parquet data files. Partitioning information is included in the file paths and the values are based on transforms that you define
when you create a table. This layout is also called
“Hive-style” partitioning. You might use this option for interoperability between Snowflake and external engines that support partitioned
writes with hierarchical paths.

Here’s an example of a data file stored under a hierarchical path:

`s3://my-bucket/iceberg/db_sales/orders/data/country=US/year=2025/month=02/day=21/part-00023.parquet`

For more information on the layout of the data and metadata directories for tables that use hierarchical paths,
see [File management](tables-iceberg-storage.md).

#### Create a table with hierarchical paths

To create a partitioned Iceberg table with a hierarchical path layout, set the following properties in your regular [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) statement:

* Set PATH_LAYOUT = HIERARCHICAL.
* Include the PARTITION BY clause with one or more [partition transforms](https://iceberg.apache.org/spec/#partition-transforms).

For an example of creating a partitioned Iceberg table with a hierarchical path layout in a catalog-linked database,
see [Create an Iceberg table in a catalog-linked database with hierarchical path layout](tables-iceberg-externally-managed-writes.md).

### Partitioning support matrix

The following table shows which features and actions are supported for each type of partitioned Iceberg table, and indicates compliance with
version 2 of the Apache Iceberg specification. The table shows support for both hidden partitioning and partitioning with hierarchical paths.

> **Note:**
>
> * Support for version 3 of the Apache Iceberg™ specification is in public preview. This public preview includes support for using deletion
>   vectors with partitioned tables. For more information about this public preview, see [Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)](tables-iceberg-v3-specification-support.md).
> * CLD stands for catalog-linked database.

|  | Snowflake managed | Externally managed (CLD) | Externally managed (non-CLD) | Iceberg spec V2 compatibility | Comment |
| --- | --- | --- | --- | --- | --- |
| COPY commands with the ON_ERROR = ABORT_STATEMENT option | ❌ | ❌ | ❌ | ❌ |  |
| COPY INTO <table> | Limited support | Limited support | Limited support | Limited support | See [Usage notes](../sql-reference/sql/copy-into-table.md). |
| CREATE ICEBERG TABLE … AS SELECT (CTAS) | ✔ | ✔ | ✔ | ✔ |  |
| Cloning | ✔ | ✔ | ✔ | ✔ | See usage notes:   * [Snowflake managed](../sql-reference/sql/create-iceberg-table-snowflake.md) * [Externally managed](../sql-reference/sql/create-iceberg-table-rest.md) |
| CREATE ICEBERG TABLE … LIKE | ✔ | ✔ | ✔ | ✔ | See usage notes:   * [Snowflake managed](../sql-reference/sql/create-iceberg-table-snowflake.md) * [Externally managed](../sql-reference/sql/create-iceberg-table-rest.md) |
| Deletion vectors | ✔ | ✔ | ✔ | N/A | Currently in *Public Preview*. |
| Clustering | ❌ | ❌ | ❌ | ❌ |  |
| Partition evolution | ❌ | Limited support | Limited support | Limited support | We support partition evolution if it is done with an external engine. |
| Partition transforms | ✔ | ✔ | ✔ | ✔ | For the supported partition transforms, see:   * [Snowflake managed](../sql-reference/sql/create-iceberg-table-snowflake.md) * [Externally managed](../sql-reference/sql/create-iceberg-table-rest.md) |
| Positional deletes | ✔ | ✔ | ✔ | ✔ |  |
| Snowpipe | Limited support | Limited support | Limited support | Limited support | * Currently in *Public Preview*. * See the [usage notes](../sql-reference/sql/copy-into-table.md) for COPY INTO <table>. |
| Snowpipe Streaming | ❌ | ❌ | ❌ | ❌ |  |
| Sorting within partitions | ❌ | ❌ | ❌ | ❌ |  |
| TARGET_FILE_SIZE | ✔ | ✔ | ✔ | ✔ |  |

### Partitioning considerations

Consider the following before you use partitioned writes for Iceberg tables:

* If you use an external engine to add, drop, or replace a partition field in an externally managed table,
  Snowflake writes data according to the latest partition specification.
* The [GET_DDL](../sql-reference/functions/get_ddl.md) function doesn’t include the PARTITION BY clause in its output.
* The sum of the sizes of the outputs for all partition transforms can’t exceed 1024 bytes for a single row.
* Because partition evolution isn’t supported for Snowflake-managed tables, you must drop the table and create a new one with partitioning.
* The DAY(), MONTH(), YEAR() partition transform parameters, which you specify within the PARTITION BY clause under table properties,
  are part of the Iceberg specification. For multiple days, months, or years, the partition expression parameter returns a partition for
  each calendar day, month, or year.
  For example, when the DAY() transform is used on a timestamp column that has 2 months of data, 61 partitions are created.

  In contrast, the [DAY(), MONTH(), YEAR() functions](../sql-reference/functions/year.md)
  in Snowflake are part of the SQL standard. For multiple days, months, or years, these functions extract the corresponding day, month, or
  year part from a date or timestamp. For example, when the DAY() function is used on a timestamp column that has multiple months of data,
  this function returns a day of the month ranging from 1 to 31.
* You can’t use the ALTER ICEBERG TABLE command to modify the PATH_LAYOUT property for an existing table.
* For partitioning with hierarchical paths:

  + For `float` values, Snowflake and external engines might behave differently.
  + Snowflake can’t guarantee that the paths Snowflake writes will match the paths that external query
    engines write.

    Snowflake can’t guarantee this because when a query engine writes a hierarchical path, the query engine must serialize values into a string
    and insert the resulting value into the path.
    The Apache Iceberg table specification doesn’t define a standard serialization method, so different engines might implement different
    methods.

    For example, Snowflake doesn’t encode the `~` character but Apache Spark™ encodes this character as `%7E`.
  + Snowflake always writes the hierarchical paths directly under the `/data` directory in your external cloud storage.

## Time travel

With [Snowflake Time Travel](data-time-travel.md),
you can use Snowflake to query historical data for a table.

You can also use a third-party compute engine to perform time travel
queries on Snowflake-managed tables when you [Sync a Snowflake-managed table with Snowflake Open Catalog](tables-iceberg-open-catalog-sync.md) or
use the [Snowflake Catalog SDK](tables-iceberg-catalog.md).

You can query any snapshots that were committed within the data retention period.
To specify the data retention period, set the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) object parameter.

When you delete table data or drop a table, Snowflake deletes objects after the table retention period expires.
This might incur costs with your cloud storage provider for longer than the table’s lifetime.

---
title: Micro-partitions & Data Clustering
source: https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions.md
section: User Guide
---

# Micro-partitions & Data Clustering

Traditional data warehouses rely on static partitioning of large tables to achieve acceptable performance and enable better scaling. In these systems, a *partition* is a unit of management that is
manipulated independently using specialized DDL and syntax; however, static partitioning has a number of well-known limitations, such as maintenance overhead and data skew, which can result in
disproportionately-sized partitions.

In contrast to a data warehouse, the Snowflake Data Platform implements a powerful and unique form of partitioning, called *micro-partitioning*, that delivers all the advantages of static partitioning without the known limitations, as well as
providing additional significant benefits.

> **Attention:**
>
> [Hybrid tables](tables-hybrid.md) are based on an architecture that does not support some of the
> features that are available in standard Snowflake tables, such as clustering keys.

## What are Micro-partitions?

All data in Snowflake tables is automatically divided into micro-partitions, which are contiguous units of storage. Each micro-partition contains between 50 MB and 500 MB of uncompressed data (note that
the actual size in Snowflake is smaller because data is always stored compressed). Groups of rows in tables are mapped into individual micro-partitions, organized in a columnar fashion. This size and
structure allows for extremely granular pruning of very large tables, which can be comprised of millions, or even hundreds of millions, of micro-partitions.

Snowflake stores metadata about all rows stored in a micro-partition, including:

* The range of values for each of the columns in the micro-partition.
* The number of distinct values.
* Additional properties used for both optimization and efficient query processing.

> **Note:**
>
> Micro-partitioning is automatically performed on all Snowflake tables. Tables are transparently partitioned using the ordering of the data as it is inserted/loaded.

## Benefits of Micro-partitioning

The benefits of Snowflake’s approach to partitioning table data include:

* In contrast to traditional static partitioning, Snowflake micro-partitions are derived automatically; they don’t need to be explicitly defined up-front or maintained by users.
* As the name suggests, micro-partitions are small in size (50 to 500 MB, before compression), which enables extremely efficient DML and fine-grained pruning for faster queries.
* Micro-partitions can overlap in their range of values, which, combined with their uniformly small size, helps prevent skew.
* Columns are stored independently within micro-partitions, often referred to as *columnar storage*. This enables efficient scanning of individual columns; only the columns referenced by a query
  are scanned.
* Columns are also compressed individually within micro-partitions. Snowflake automatically determines the most efficient compression algorithm for the columns in each micro-partition.

You can enable clustering on specific tables by specifying a clustering key for each of those tables. For information about
specifying a clustering key, see:

* [CREATE TABLE](../sql-reference/sql/create-table.md)
* [ALTER TABLE](../sql-reference/sql/alter-table.md)

For additional information about clustering, including strategies for choosing which tables to cluster, see:

* [Automatic Clustering](tables-auto-reclustering.md)

## Impact of Micro-partitions

### DML

All DML operations (e.g. DELETE, UPDATE, MERGE) take advantage of the underlying micro-partition metadata to facilitate and simplify table maintenance. For example, some operations, such as deleting all
rows from a table, are metadata-only operations.

### Dropping a Column in a Table

When a column in a table is dropped, the micro-partitions that contain the data for the dropped column are not re-written when the drop
statement is executed. The data in the dropped column remains in storage. For more information, see the
[usage notes](../sql-reference/sql/alter-table.md) for ALTER TABLE.

### Query Pruning

The micro-partition metadata maintained by Snowflake enables precise pruning of columns in micro-partitions at query run-time, including columns containing semi-structured data. In other words, a query that
specifies a filter predicate on a range of values that accesses 10% of the values in the range should ideally only scan 10% of the micro-partitions.

For example, assume a large table contains one year of historical data with date and hour columns. Assuming uniform distribution of the data, a query targeting a particular hour would ideally scan 1/8760th
of the micro-partitions in the table and then only scan the portion of the micro-partitions that contain the data for the hour column; Snowflake uses columnar scanning of partitions so that
an entire partition is not scanned if a query only filters by one column.

In other words, the closer the ratio of scanned micro-partitions and columnar data is to the ratio of actual data selected, the more efficient is the pruning performed on the table.

For time-series data, this level of pruning enables potentially sub-second response times for queries within ranges (i.e. “slices”) as fine-grained as one hour or even less.

Not all predicate expressions can be used to prune. For example, Snowflake does not prune micro-partitions based on a predicate with a subquery, even if the subquery results in a constant.

## What is Data Clustering?

Typically, data stored in tables is sorted/ordered along natural dimensions (e.g. date and/or geographic regions). This “clustering” is a key factor in queries because table data that is not sorted or
is only partially sorted may impact query performance, particularly on very large tables.

In Snowflake, as data is inserted/loaded into a table, clustering metadata is collected and recorded for each micro-partition created during the process. Snowflake then leverages this clustering information
to avoid unnecessary scanning of micro-partitions during querying, significantly accelerating the performance of queries that reference these columns.

The following diagram illustrates a Snowflake table, `t1`, with four columns sorted by date:

The table consists of 24 rows stored across 4 micro-partitions, with the rows divided equally between each micro-partition. Within each micro-partition, the data is sorted and stored by column, which
enables Snowflake to perform the following actions for queries on the table:

1. First, prune micro-partitions that are not needed for the query.
2. Then, prune by column within the remaining micro-partitions.

Note that this diagram is intended only as a small-scale conceptual representation of the data clustering that Snowflake utilizes in micro-partitions. A typical Snowflake table may consist of thousands,
even millions, of micro-partitions.

## Clustering Information Maintained for Micro-partitions

Snowflake maintains clustering metadata for the micro-partitions in a table, including:

* The total number of micro-partitions that comprise the table.
* The number of micro-partitions containing values that overlap with each other (in a specified subset of table columns).
* The depth of the overlapping micro-partitions.

### Clustering Depth

The clustering depth for a populated table measures the average depth (`1` or greater) of the overlapping micro-partitions for specified columns in a table. The smaller the average depth, the better
clustered the table is with regards to the specified columns.

Clustering depth can be used for a variety of purposes, including:

* Monitoring the clustering “health” of a large table, particularly over time as DML is performed on the table.
* Determining whether a large table would benefit from explicitly defining a [clustering key](tables-clustering-keys.md).

A table with no micro-partitions (i.e. an unpopulated/empty table) has a clustering depth of `0`.

> **Note:**
>
> The clustering depth for a table is not an absolute or precise measure of whether the table is well-clustered. Ultimately, query performance is the best indicator of how well-clustered a table is:
>
> * If queries on a table are performing as needed or expected, the table is likely well-clustered.
> * If query performance degrades over time, the table is likely no longer well-clustered and may benefit from clustering.

### Clustering Depth Illustrated

The following diagram provides a conceptual example of a table consisting of five micro-partitions with values ranging from A to Z, and illustrates how overlap affects clustering depth:

As this diagram illustrates:

1. At the beginning, the range of values in all the micro-partitions overlap.
2. As the number of overlapping micro-partitions decreases, the overlap depth decreases.
3. When there is no overlap in the range of values across all micro-partitions, the micro-partitions are considered to be in a *constant state* (i.e. they cannot be improved by clustering).

The diagram is not intended to represent an actual table. In an actual table, with data contained in a large numbers of micro-partitions, reaching a constant state across all micro-partitions is neither
likely nor required to improve query performance.

## Monitoring Clustering Information for Tables

To view/monitor the clustering metadata for a table, Snowflake provides the following system functions:

* [SYSTEM$CLUSTERING_DEPTH](../sql-reference/functions/system_clustering_depth.md)
* [SYSTEM$CLUSTERING_INFORMATION](../sql-reference/functions/system_clustering_information.md) (including clustering depth)

For more details about how these functions use clustering metadata, see Clustering Depth Illustrated (in this topic).

---
title: Microsoft Entra ID SCIM integration with Snowflake
source: https://docs.snowflake.com/en/user-guide/scim-azure.md
section: User Guide
---

# Microsoft Entra ID SCIM integration with Snowflake

Snowflake supports Microsoft Entra ID as a SCIM identity provider.

This topic provides details about provisioning users and groups from Microsoft Entra ID to Snowflake.

## Features

* Automatic Microsoft Entra ID User Provisioning to Snowflake.
* Use the `allowedInterfaces` custom attribute to prevent a provisioned user from using certain interfaces to access Snowflake.
* Automatic Microsoft Entra ID Group Provisioning to Snowflake.
* Synchronizing Microsoft Entra ID users and groups to Snowflake.
* If Microsoft Entra ID is configured for
  [SAML SSO to Snowflake](https://docs.microsoft.com/en-us/azure/active-directory/saas-apps/snowflake-tutorial), Microsoft Entra ID users
  provisioned to Snowflake can access Snowflake using SAML SSO.

  > **Note:**
  >
  > By default, Microsoft Entra ID users provisioned to Snowflake using SCIM are not assigned a password in Snowflake. This means that if SAML SSO is configured in Microsoft Entra ID, users will authenticate to Snowflake using SSO.
  >
  > SAML SSO is not a requirement if using SCIM to provision users and groups from Microsoft Entra ID to Snowflake. For additional options, see
  > [Configure Entra ID single sign-on](https://docs.microsoft.com/en-us/azure/active-directory/saas-apps/snowflake-tutorial#configure-azure-ad-single-sign-on).

### Limitations

* Snowflake supports a maximum of 500 concurrent requests per account per SCIM endpoint (e.g. the `/Users` endpoint, the `/Groups` endpoint). After your account exceeds this threshold, Snowflake returns a `429` HTTP status code (i.e. too many requests). Note that this request limit usually only occurs during the initial provisioning when relatively large numbers of requests (i.e. more than 10 thousand) occur to provision users or groups.

### Not supported

* AWS PrivateLink and Google Cloud Private Service Connect. Customers wanting to provision users and groups to Snowflake from
  Microsoft Entra ID without traversing the public Internet need to have their Snowflake account in Microsoft Azure.
* If you are using Azure Private Link to access Snowflake, ensure that you are not using the Azure Private Link URL in the integration
  settings. Enter the public endpoint (i.e. without `.privatelink`), and ensure that the network policy allows access from the
  Azure IP addresses as shown in the Prerequisites section. Otherwise, you cannot use this integration.
* Transferring ownership of existing users and roles. Microsoft Entra ID is the authoritative source for its users and groups. Group membership
  can be updated in Microsoft Entra ID. However, existing users and groups in Snowflake cannot be transferred to Microsoft Entra ID.
* Microsoft Entra ID does not currently support reading or provisioning
  [nested groups](https://docs.microsoft.com/en-us/azure/active-directory/app-provisioning/how-provisioning-works#scoping). Therefore, you
  cannot use the Snowflake Microsoft Entra ID SCIM integration to provision or manage nested groups in Snowflake. Please contact Microsoft to
  request the support of nested groups.
* Enabling or disabling password synchronization from Microsoft Entra ID to Snowflake.

  Setting the `SYNC_PASSWORD` property in the Snowflake security integration will not synchronize user passwords from Microsoft Entra ID
  to Snowflake. This is a Microsoft Entra ID limitation. To request support, please contact Microsoft Entra ID.

## Prerequisites

Before using SCIM to provision Microsoft Entra ID users and groups to Snowflake, verify the following:

1. An existing Microsoft Entra ID tenant.
2. An existing Snowflake tenant.

   * During the configuration process in Microsoft, you will need to input the URL of the Snowflake SCIM endpoint (i.e. Tenant URL
     in the Microsoft Entra ID SCIM configuration guide). The Snowflake SCIM endpoint consists of the Snowflake account URL appended with
     `/scim/v2/`. For example, if you use the account name URL format, the SCIM endpoint is
     `https://myorg-myaccount.snowflakecomputing.com/scim/v2/`. For a list of supported formats for the Snowflake account URL, see
     [Connecting with a URL](organizations-connect.md).
3. At least one user in Snowflake with the ACCOUNTADMIN role
4. Before provisioning users or groups, as it pertains to your account, ensure that the [network policy](network-policies.md)
   in Snowflake allows access from all of the Azure IP addresses for the [Public Cloud](https://www.microsoft.com/en-us/download/details.aspx?id=56519) or the [US Government Cloud](https://www.microsoft.com/en-us/download/details.aspx?id=57063). Currently, all Azure IP addresses are required to create a
   Microsoft Entra ID SCIM network policy. For more information, see Managing SCIM Network Policies.

## Configuration

The Snowflake configuration process creates a SCIM security integration to allow users and roles created in Microsoft Entra ID to be owned by
the AAD_PROVISIONER SCIM role in Snowflake and creates an access token to use in SCIM API requests. The access token (i.e. the
Secret Token in the Microsoft Entra ID SCIM configuration guide) is valid for six months. Upon expiration, create a new access token
manually using [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md) as shown below.

> **Note:**
>
> To invalidate an existing access token for a SCIM integration, execute a [DROP INTEGRATION](../sql-reference/sql/drop-integration.md) statement.
>
> To continue using SCIM with Snowflake, recreate the SCIM integration with a [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-scim.md) statement and generate a new access token using [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md).

### Microsoft Entra ID configuration

To use Microsoft Entra ID as a SCIM identity provider, follow the instructions in the
[Microsoft documentation](https://docs.microsoft.com/en-us/azure/active-directory/saas-apps/snowflake-provisioning-tutorial). While
completing these steps, do not re-use an existing enterprise application in Microsoft Entra ID. Failure to create a new enterprise
application for provisioning can result in unexpected behavior.

> **Note:**
>
> If you are creating custom attributes and would like the `name` and `login_name` fields for the Snowflake user to have different
> values, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to enable separate mappings for your account before creating the custom attributes.

### Snowflake configuration

To facilitate the Snowflake configuration, you can copy the SQL below for use in this first [step](https://docs.microsoft.com/en-us/azure/active-directory/saas-apps/snowflake-provisioning-tutorial#setup-snowflake-for-provisioning). Each of the following statements is explained below.

```sqlexample
use role accountadmin;
create role if not exists aad_provisioner;
grant create user on account to role aad_provisioner;
grant create role on account to role aad_provisioner;
grant role aad_provisioner to role accountadmin;
create or replace security integration aad_provisioning
    type = scim
    scim_client = 'azure'
    run_as_role = 'AAD_PROVISIONER';
select system$generate_scim_access_token('AAD_PROVISIONING');
```

> **Important:**
>
> The example SQL statements use the ACCOUNTADMIN system role and the AAD_PROVISIONER custom role is granted to the ACCOUNTADMIN role.
>
> It is possible not to use the ACCOUNTADMIN role in favor of a less-privileged role. Using a less-privileged role can help to address compliance concerns relating to least-privileged access, however, using a less-privileged role can result in unexpected errors during the SCIM configuration and management process.
>
> These errors could be the result of the less-privileged role not having sufficient rights to manage all of the roles through SCIM due to how the roles are created and the resultant role hierarchy. Therefore, in an effort to avoid errors in the configuration and management processes, choose one of the following options:
>
> 1. Use the ACCOUNTADMIN role as shown in the example SQL statements.
> 2. Use a role with the global MANAGE GRANTS privilege.
> 3. If neither of these first two options are desirable, use a custom role that has the OWNERSHIP privilege on all of the roles that will be managed using SCIM.

1. Login to Snowflake as an administrator and execute the following from either the Snowflake worksheet interface, Snowflake CLI, or SnowSQL.
2. Use the ACCOUNTADMIN role.

   > ```sqlexample
   > use role accountadmin;
   > ```
3. Create the custom role AAD_PROVISIONER. All users and roles in Snowflake created by Microsoft Entra ID will be owned by the scoped down
   AAD_PROVISIONER role.

   > ```sqlexample
   > create role if not exists aad_provisioner;
   > grant create user on account to role aad_provisioner;
   > grant create role on account to role aad_provisioner;
   > ```
4. Let the ACCOUNTADMIN role create the security integration using the AAD_PROVISIONER custom role. For more information, see [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-scim.md).

   > ```sqlexample
   > grant role aad_provisioner to role accountadmin;
   > create or replace security integration aad_provisioning
   >     type=scim
   >     scim_client='azure'
   >     run_as_role='AAD_PROVISIONER';
   > ```
5. Create and copy the authorization token to the clipboard and store securely for later use. Use this token for each SCIM REST API request and place it in the request header. The access token expires after six months and a new access token can be generated with this statement.

   > ```sqlexample
   > select system$generate_scim_access_token('AAD_PROVISIONING');
   > ```

## Enabling Snowflake-initiated SSO

The SCIM provisioning process does not automatically enable single sign-on (SSO).

To use SSO after the SCIM provisioning process is complete, enable
[Snowflake-initiated SSO](admin-security-fed-auth-security-integration.md).

## Managing SCIM network policies

Applying a network policy to a SCIM security integration allows the SCIM network policy to be distinct from network policies that apply to the entire Snowflake account.
It allows the SCIM provider to provision users and groups without adding IP addresses to a network policy that controls access for normal users.

A network policy applied to a SCIM integration overrides a network policy applied to the entire Snowflake account.

After creating the SCIM security integration, create the SCIM network policy using this command:

> ```sqlsyntax
> alter security integration aad_provisioning set network_policy = <scim_network_policy>;
> ```

To unset the SCIM network policy, use this command:

> ```sqlexample
> alter security integration aad_provisioning unset network_policy;
> ```

Where:

`aad_provisioning`
:   Specifies the name of the Microsoft Entra ID SCIM security integration.

`scim_network_policy`
:   Specifies the Microsoft Entra ID SCIM network policy in Snowflake.

For more information, see [Controlling network traffic with network policies](network-policies.md) and [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-scim.md).

## Using secondary roles with SCIM

Snowflake supports setting the [user](../sql-reference/sql/create-user.md) property `DEFAULT_SECONDARY_ROLES` to `'ALL'` with
SCIM to allow users to use [secondary roles](security-access-control-overview.md) in a Snowflake session.

For a representative example, see [Update a user](scim-user-api-reference.md).

## Populating Snowflake tags with SCIM integrations

You can populate tags by using the `snowflakeTags` attribute when you ingest user information into the SCIM security integration. The exact request input can be found in [Create a user](scim-user-api-reference.md).

To enable support for this feature:

* Create the tag before you run the SCIM integration.
* Grant proper privileges on each tag and tag schema to the GENERIC_SCIM_PROVISIONER role.

Here is an example of creating a tag and assigning the proper role privileges:

```sqlexample
-- Create the tag.
CREATE TAG my_database_name.my_schema_name.my_tag_name;

-- Assign the proper privileges to the SCIM integration.
GRANT USAGE ON SCHEMA my_database_name.my_schema_name TO ROLE GENERIC_SCIM_PROVISIONER;
GRANT APPLY ON TAG my_database_name.my_schema_name.my_tag_name TO ROLE GENERIC_SCIM_PROVISIONER;
```

You must grant USAGE ON SCHEMA and APPLY ON TAG to all tags and tag schemas that you plan to assign through your SCIM security integration.

## Replicating the Microsoft Entra ID SCIM security integration

Snowflake supports replication and failover/failback with the SCIM security integration from the source account to the target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## Troubleshooting

* To verify that Microsoft Entra ID is sending updates to Snowflake, check the log events in Microsoft Entra ID for the Snowflake application and
  the SCIM audit logs in Snowflake to ensure Snowflake is receiving updates from Microsoft Entra ID. Use the following SQL to query the
  Snowflake SCIM audit logs, where `demo_db` is the name of your database.

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  > USE SCHEMA snowflake.information_schema;
  >
  > SELECT * FROM TABLE(REST_EVENT_HISTORY('scim'));
  >
  > SELECT *
  >   FROM TABLE(REST_EVENT_HISTORY(
  >   'scim',
  >   DATEADD('MINUTES',-5,CURRENT_TIMESTAMP()),
  >   CURRENT_TIMESTAMP(),
  >   200))
  >   ORDER BY event_timestamp;
  > ```
* If the user update fails, check the ownership of the user in Snowflake. If it is not owned by the `aad_provisioner` role (or the role set in the `run_as_role` parameter when creating the security integration in Snowflake), then the update will fail. Transfer the ownership by running the following SQL statement in Snowflake, and try again.

  > ```sqlexample
  > grant ownership on user <username> to role AAD_PROVISIONER;
  > ```
* If there are changes to the `UPN`
  [attribute value](https://learn.microsoft.com/en-us/entra/identity/hybrid/connect/plan-connect-userprincipalname#what-is-userprincipalname)
  in Microsoft Entra ID after the initial SCIM provisioning, subsequent updates to the user will not work. A change to the `UPN`
  attribute value breaks the link between the Microsoft Entra ID user object and the Snowflake user object. If a change to the UPN attribute
  value occurs, reprovision the user with the correct `UPN` attribute value.

**Next topics:**

* [SCIM API references](scim-api-references.md)

---
title: Migrating from SnowSQL to Snowflake CLI
source: https://docs.snowflake.com/en/user-guide/snowsql-migrate.md
section: User Guide
---

# Migrating from SnowSQL to Snowflake CLI

> **Note:**
>
> Snowflake CLI migration support is still in development. In the meantime, Snowflake encourages you to migrate from SnowSQL using these instructions.

This guide provides instructions for migrating from SnowSQL to Snowflake CLI to help you seamlessly move your existing SnowSQL connections and environment variables.

* Migration steps
* Migrate your configurations
* Connect to Snowflake
* Executing SQL queries

## Migration steps

To migrate from SnowSQL to Snowflake CLI, follow these steps:

1. Install Snowflake CLI with your preferred method.
2. Import your connections.
3. Optionally check for suggested changes for your environment variables.
4. Optionally create an alias that maps the `snowsql` shell command to the `snow sql` shell command.

### Install the Snowflake CLI software

> **Tip:**
>
> Useful links:
>
> * [Installing Snowflake CLI](../developer-guide/snowflake-cli/installation/installation.md) documentation
> * [Snowflake CLI binaries repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html)

Similarly to SnowSQL, Snowflake CLI provides [binary installers](../developer-guide/snowflake-cli/installation/installation.md). Additionally, it also lets you install the software using [homebrew](../developer-guide/snowflake-cli/installation/installation.md) and [pip](../developer-guide/snowflake-cli/installation/installation.md).

Currently, Snowflake CLI supports following platforms:

* macOS (arm)

  + [Package installer](../developer-guide/snowflake-cli/installation/installation.md)
  + [Homebrew](../developer-guide/snowflake-cli/installation/installation.md)
  + [PyPi (pip)](../developer-guide/snowflake-cli/installation/installation.md)
* macOS (x86_64):

  + [Package installer](../developer-guide/snowflake-cli/installation/installation.md)
  + [Homebrew](../developer-guide/snowflake-cli/installation/installation.md)
  + [PyPi (pip)](../developer-guide/snowflake-cli/installation/installation.md)
* Linux (x86_64 & aarch64):

  + [deb package installer](../developer-guide/snowflake-cli/installation/installation.md)
  + [rpm package installer](../developer-guide/snowflake-cli/installation/installation.md)
  + [PyPi (pip)](../developer-guide/snowflake-cli/installation/installation.md)
* Windows (64bit):

  + [Installer](../developer-guide/snowflake-cli/installation/installation.md)
  + [PyPi (pip)](../developer-guide/snowflake-cli/installation/installation.md)

Currently, Snowflake CLI does not support the following platform:

* Linux bash installer

For more information about installing Snowflake CLI, see [Installing Snowflake CLI](../developer-guide/snowflake-cli/installation/installation.md).

### Migrate your SnowSQL connections and settings

Snowflake CLI provides a [snow helpers](../developer-guide/snowflake-cli/command-reference/helpers-commands/overview.md) command group to simplify the process of transitioning from SnowSQL to Snowflake CLI.
Use these commands to easily import your existing connections and your environment variables:

* The [snow helpers import-snowsql-connections](../developer-guide/snowflake-cli/command-reference/helpers-commands/import-snowsql-connections.md) command uses an interactive menu to let you choose which SnowSQL connections you want to import.
  For more information, see [Import connections from SnowSQL](../developer-guide/snowflake-cli/connecting/configure-connections.md).
* The [snow helpers check-snowsql-env-vars](../developer-guide/snowflake-cli/command-reference/helpers-commands/check-snowsql-env-vars.md) command helps you diagnose which environment variables are set in your SnowSQL environment and displays their corresponding Snowflake CLI equivalents.
  For more information, see [Use variables in SQL](../developer-guide/snowflake-cli/project-definitions/use-sql-variables.md).

If you use SnowSQL to execute inline SQL statements or execute files but do not want to edit all your scripts, consider creating an alias that maps `snowsql` to the `snow sql` command. For example, on Unix-like systems, use the following command:

```bash
alias snowsql='snow sql'
```

With this alias, you can use your existing scripts with Snowflake CLI.

Note that if you are a more advanced SnowSQL user, you might occasionally encounter incompatibility messages, which typically relate to options used for configuring SnowSQL.
Because Snowflake CLI doesn’t use all of the SnowSQL configuration options, you might need to make copies of your scripts and remove those incompatible options.

### Roll back to SnowSQL

Snowflake CLI uses its own configuration files, so you can continue to use SnowSQL.
You can install both SnowSQL and Snowflake CLI and run them independently.
If you set an alias, as described above, you must remove the alias to use the `snowsql` command for SnowSQL.

## Migrate your configurations

> **Tip:**
>
> Useful links:
>
> * [Configuring Snowflake CLI and connecting to Snowflake](../developer-guide/snowflake-cli/connecting/connect.md) documentation

### Differences in the configuration files

* SnowSQL

  SnowSQL is configured by its [configuration file](snowsql-config.md), which is a file in TOML format that contains connection configurations, various settings of the tool, and variables that can be used in SQL queries.
  Configurations can be split into several locations, which lets you define system-wide defaults and override them for different users.
  You can also specify configurations from custom locations by specifying the `--config` command-line option.
  For more information, see [Connection parameters reference](snowsql-start.md).
* Snowflake CLI

  Snowflake CLI also has its own TOML [configuration file](../developer-guide/snowflake-cli/connecting/configure-cli.md) that specifies connection configurations and settings of the tool.
  It does not allow you to define variables for later use in SQL queries. Variables in Snowflake CLI are defined at the project level in [project definition files](../developer-guide/snowflake-cli/project-definitions/use-sql-variables.md).
  Snowflake CLI uses only one configuration file that, by default, is located in the user’s home directory.
  You can also specify configurations from custom locations by specifying the `--config` command-line option.
  For more information, see the [snow](../developer-guide/snowflake-cli/command-reference/snow.md) command reference.

### Find the Snowflake CLI default configuration file

The location of the default Snowflake CLI configuration depends on your system and is determined by the order specified in [Location of the .toml configuration file](../developer-guide/snowflake-cli/connecting/configure-cli.md).

* To find the value of the `default_config_file_path` parameter for your Snowflake CLI installation, run the `snow --info` command as shown:

  ```snowcli
  snow --info
  ```

  ```output
  [
    ...

    {
        "key": "default_config_file_path",
        "value": "/<user_home>/.snowflake/config.toml"
    },

    ...
  ]
  ```

### Import connections from SnowSQL

> **Tip:**
>
> Useful links:
>
> * [Import connections from SnowSQL](../developer-guide/snowflake-cli/connecting/configure-connections.md) documentation

You can import all of your SnowSQL connections with the `snow helpers import-snowsql-connections` command.
For more information, see [Import connections from SnowSQL](../developer-guide/snowflake-cli/connecting/configure-connections.md) and the [snow helpers import-snowsql-connections](../developer-guide/snowflake-cli/command-reference/helpers-commands/import-snowsql-connections.md) command reference.

### Manually migrate the default connection configuration

If you choose not to import connections using the `snow helpers import-snowsql-connections` command, you can migrate the default connection manually.

Differences in specifying the default connection include the following:

* SnowSQL

  The default connection is configured in the SnowSQL [configuration file](snowsql-config.md), and connection settings are defined directly in the [[connections]](snowsql-start.md) section.
* Snowflake CLI

  The default connection is configured in the Snowflake CLI [configuration file](../developer-guide/snowflake-cli/connecting/configure-cli.md) as a named connection with the name `default_connection_name`, set at the top level of configuration (see [Set the default connection](../developer-guide/snowflake-cli/connecting/configure-connections.md)).
  You can change the default connection by using the `snow connection set-default` command.

By default both SnowSQL and Snowflake CLI use the default connection configuration to connect to Snowflake. If you have it configured in SnowSQL, you should migrate this configuration to the Snowflake CLI configuration file, as follows:

1. Open the SnowSQL configuration file and find the default connection parameters in the `[connections]` section. You need the values of the connection parameters when adding the connection to Snowflake CLI.
2. To add the connection to Snowflake CLI, use one of the following methods:

   * Manually edit the Snowflake CLI configuration file, as follows:

     1. Open the Snowflake CLI configuration file.
     2. Add a `[connections.your_connection_name]` section and copy/paste the default configuration details from the SnowSQL configuration file.
     3. Change the names of the following parameters, as shown:

        + `accountname` to `account`
        + `username` to `user`
        + `dbname` to `database`
        + `schemaname` to `schema`
        + `warehousename` to `warehouse`
        + `rolename` to `role`
     4. Add or set `default_connection_name = "your_connection_name"` setting at the top level of the configuration file (see [Set the default connection](../developer-guide/snowflake-cli/connecting/configure-connections.md)).
   * Use the `snow connection add` and `snow connection set-default` commands.
     For more information, see [Manage or add your connections to Snowflake with the snow connection commands](../developer-guide/snowflake-cli/connecting/configure-connections.md).

### Manually migrate your named connection configurations

If you don’t use the `snow helpers import-snowsql-connections` command to import your connections, you can migrate them manually.

Differences in specifying named connections include the following:

* SnowSQL

  Named connections are configured in the SnowSQL [configuration file](snowsql-start.md). Each named connection has its own `[connections.your_connection_name]` section.
* Snowflake CLI

  Snowflake CLI uses almost the same format to [configure named connections](../developer-guide/snowflake-cli/connecting/configure-connections.md). You can copy them from the SnowSQL configuration and rename the parameters as specified in the default connection.

By default both SnowSQL and Snowflake CLI let you use a named connection to connect to Snowflake. If you want to continue using those named connections in SnowSQL, you should migrate them to the Snowflake CLI configuration file:

1. Open the SnowSQL configuration file and locate the `[connections.your_connection_name]` sections. You need the values of the connection parameters when adding the connections to Snowflake CLI.
2. To add the connection to Snowflake CLI, use one of the following methods:

   * Manually edit the Snowflake CLI configuration file, as follows:

     1. Open the Snowflake CLI configuration file.
     2. Add a `[connections.your_connection_name]` section and copy/paste the default configuration details from the SnowSQL configuration file.
     3. Change the names of the following parameters, as shown:

        + `accountname` to `account`
        + `username` to `user`
        + `dbname` to `database`
        + `schemaname` to `schema`
        + `warehousename` to `warehouse`
        + `rolename` to `role`
   * Use the `snow connection add` command.
     For more information, see [Manage or add your connections to Snowflake with the snow connection commands](../developer-guide/snowflake-cli/connecting/configure-connections.md).

### Configure logs

> **Tip:**
>
> Useful links:
>
> * [Configure logging](../developer-guide/snowflake-cli/connecting/configure-cli.md) documentation

To manually configure logging for Snowflake CLI, see the [Configure logging](../developer-guide/snowflake-cli/connecting/configure-cli.md) documentation.

### Migrate your variables

> **Tip:**
>
> Useful links:
>
> * [About project definition files](../developer-guide/snowflake-cli/project-definitions/about.md) documentation
> * [Use variables in SQL](../developer-guide/snowflake-cli/project-definitions/use-sql-variables.md) documentation

Snowflake CLI doesn’t support specifying variables in its configuration file. Instead, it uses a more project-focused approach that associates variables with specific projects. Snowflake CLI lets you define variables in `snowflake.yml` [project definition files](../developer-guide/snowflake-cli/project-definitions/about.md). You can then use these variables in SQL queries as described in [About project definition files](../developer-guide/snowflake-cli/project-definitions/about.md).

* To define variables for your project, add an `env` section to the project’s `snowflake.yml` file and include any variables you want to use in your queries.

The following example defines two variables: `database` and `role`:

```yaml
definition_version: 2
env:
  database: "dev"
  role: "eng_rl"
```

### Manually migrate your environment variables

> **Tip:**
>
> Useful links:
>
> * [Use environment variables for Snowflake credentials](../developer-guide/snowflake-cli/connecting/configure-connections.md) documentation
> * [Use variables in SQL](../developer-guide/snowflake-cli/project-definitions/use-sql-variables.md) documentation

In SnowSQL, you can use environment variables (like `$SNOWSQL_ACCOUNT` and `$SNOWSQL_DATABASE`) instead of specifying command-line parameters when starting a connection.
This approach provides another way to specify default connection configurations. Snowflake CLI offers the same functionality but uses different names for these parameters and allows you to override many more configuration parameters via environment variables.
If you’re using environment variables to connect to Snowflake, for more information, see [connecting to Snowflake with environment variables](../developer-guide/snowflake-cli/connecting/configure-connections.md). Also, see information about possibilities for [configuring environment variables](../developer-guide/snowflake-cli/connecting/configure-cli.md) in the Snowflake CLI documentation.

## Connect to Snowflake

> **Tip:**
>
> Useful links:
>
> * [Managing Snowflake connections](../developer-guide/snowflake-cli/connecting/configure-connections.md) documentation
> * [snow sql](../developer-guide/snowflake-cli/command-reference/sql-commands/sql.md) documentation

Assuming that you have migrated your configuration, you can connect to Snowflake from Snowflake CLI using similar methods to these used by SnowSQL, including the following:

* Use the default connection.
* Use a connection with command-line options.
* Use a named configuration.
* Use only command-line options.
* Use environment variables.
* Use a mixture of connections, environment variables, and command-line options.

### Use the default connection

* To connect using the default configuration defined in your configuration file:

  + SnowSQL

    ```bash
    snowsql -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    snow sql -q "select 1"
    ```

### Use a connection with command-line options

* To connect using the default configuration defined in your configuration file and override parameters with command-line options:

  + SnowSQL

    ```bash
    snowsql --username myname -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    snow sql --username myname -q "select 1"
    ```

    For a list of possible command-line options, see [snow sql](../developer-guide/snowflake-cli/command-reference/sql-commands/sql.md). Note that some options have different names than in SnowSQL.

### Use a named configuration

* To connect using a named configuration defined in your configuration file:

  + SnowSQL

    ```bash
    snowsql -c dev -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    snow sql -c dev -q "select 1"
    ```

### Use only command-line options

* To connect using only command-line options instead a configured connection:

  + SnowSQL

    ```bash
    snowsql \
      --accountname myaccount \
      --username myuser \
      --authenticator SNOWFLAKE_JWT \
      --private-key-path "path_to_my_key" \
      -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    snow sql \
      --temporary-connection \
      --accountname myaccount \
      --username myuser \
      --authenticator SNOWFLAKE_JWT \
      --private-key-path "path_to_my_key" \
      -q "select 1"
    ```

    Note that Snowflake CLI requires the `--temporary-connection` option for this method.

### Use environment variables

* To connect using the default connection, passing some parameters as environment variables:

  + SnowSQL

    ```bash
    export SNOWSQL_USER=myuser
    snowsql -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    export SNOWFLAKE_USER=myuser
    snow sql -q "select 1"
    ```

    Note that the names of environment variables might differ. For more information, see [Use environment variables for Snowflake credentials](../developer-guide/snowflake-cli/connecting/configure-connections.md).

### Use a mixture of connections, environment variables, and command-line options

* To connect using a mixed approach with a named connection, environment variables, and command-line options:

  + SnowSQL

    ```bash
    export SNOWSQL_USER=myuser
    snowsql -c dev --accountname myaccount -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    export SNOWFLAKE_USER=myuser
    snow sql -c dev --accountname myaccount -q "select 1"
    ```

    You can use this method with both the default and named connections.

## Executing SQL queries

> **Tip:**
>
> Useful links:
>
> * [snow sql](../developer-guide/snowflake-cli/command-reference/sql-commands/sql.md) documentation
> * [Executing SQL statements](../developer-guide/snowflake-cli/sql/execute-sql.md) documentation

### Execute SQL queries from various inputs

Snowflake CLI lets you execute SQL queries using inputs similar to these handled by SnowSQL. The following examples execute SQL queries using various inputs.

* Execute queries using command-line parameters:

  + SnowSQL

    ```bash
    snowsql -q "select 1"
    ```
  + Snowflake CLI

    ```bash
    snow sql -q "select 1"
    ```
* Execute queries from a file:

  + SnowSQL

    ```bash
    snowsql -f test.sql
    ```
  + Snowflake CLI

    ```bash
    snow sql -f test.sql
    ```
* Execute queries from standard input:

  + SnowSQL

    ```bash
    cat test.sql | snowsql
    ```
  + Snowflake CLI

    ```bash
    cat test.sql | snow sql --stdin
    ```

### Save query results to a JSON file

Snowflake CLI currently does not support all of the [SnowSQL output formatting options](snowsql-use.md). Snowflake CLI does let you save query results as either a formatted table or as JSON. Although CSV and other formats are not yet available, you can use external tools, such as [jq](https://jqlang.org/), to covert data from JSON other formats.

* SnowSQL

  ```bash
  snowsql \
    -f test.sql \
    -o "output_format=json" \
    -o "output_file=result.json"
  ```
* Snowflake CLI

  ```bash
  snow sql -f test.sql --format json > result.json
  ```

### Execute queries using variables

Both SnowSQL and Snowflake CLI let you use variables in queries. SnowSQL lets you use variables from command-line options, from its configuration file, and using a few [built-in variables](snowsql-use.md). Although Snowflake CLI does not support variables in its configuration file or using built-in variables, it does support specifying parameters with command-line options and specifying variables in project definition files. For information about migrating your SnowSQL configuration file variables, see Migrate your variables.

After migrating your variables from SnowSQL’s configuration, you can run Snowflake CLI queries using variables from both command-line options and project definitions.

When using variables, note the following important differences between SnowSQL and Snowflake CLI:

* They use different syntaxes for variable substitutions. SnowSQL uses the `&variable` or `&{variable}` syntax while Snowflake CLI uses `<% variable %>`. The syntax from SnowSQL is currently supported, but has been deprecated.
* Snowflake CLI automatically enables variable substitution, so you do not need to explicitly enable it as with SnowSQL.
* Variable names in Snowflake CLI project definition files must be prefixed with `ctx.env`, as shown:

The following examples show the differences when executing SQL queries with variables:

* Execute a query using variables in command-line options, where `x` is the variable name:

  + SnowSQL

    ```bash
    snowsql \
      -o variable_substitution=true \
      -q "select &x" \
      -D x=1
    ```
  + Snowflake CLI

    ```bash
    snow sql \
      -q "select <% x %>" \
      -D x=1
    ```
  + Snowflake CLI (using deprecated syntax to facilitate quick migrations)

    ```bash
    snow sql \
      -q "select &x" \
      -D x=1
    ```
* Execute a query using variables in a SnowSQL configuration versus a Snowflake CLI project definition file:

  + SnowSQL

    ```bash
    # save variables to config
    echo "[variables]
    xyz=Hello World" > custom_config

    # execute query
    snowsql \
      --config custom_config \
      -o variable_substitution=true \
      -q "select '&{xyz}'"
    ```
  + Snowflake CLI

    ```bash
    # save variables to project definition
    echo "definition_version: 2
    env:
      xyz: Hello World" > snowflake.yml

    # execute query
    snow sql -q "select '<% ctx.env.xyz %>'"
    ```
  + Snowflake CLI (using deprecated syntax to facilitate quick migrations)

    ```bash
    # save variables to project definition
    echo "definition_version: 2
    env:
      xyz: Hello World" > snowflake.yml

    # execute query
    snow sql -q "select '&{ctx.env.xyz}'"
    ```

## SnowSQL and Snowflake CLI feature parity

The following table shows how SnowSQL features are integrated into Snowflake CLI.

SnowSQL and Snowflake CLI feature parity

| SnowSQL feature | Snowflake CLI implementation |
| --- | --- |
| Global configuration file (`~/.snowsql/config`) in a `.ini` format. | Configuration and connection files use a TOML format and are stored in the `~/.snowflake` directory (Linux) or in another subdirectory in the user’s HOME directory (other OS systems). For more information, see [Location of the .toml configuration file](../developer-guide/snowflake-cli/connecting/configure-cli.md). |
| Connection configuration through command-line options supports everything the Snowflake Connector for Python supports. | Snowflake CLI supports the command-line options as described in the [snow connection add](../developer-guide/snowflake-cli/command-reference/connection-commands/add-connection.md) command reference. |
| Connection testing via the `--probe-connection` command-line option. This option is mainly used to print out the TLS/SSL certificate chain. | Currently, the `snow connection test` command does the connection probe but does not print the TLS/SSL certificate chain. You can generate connection diagnostic data for Snowflake Support. |
| Ability to generate and display a JWT token based on the `user`, `account`, and `private-key-path` parameters. | Use the `snow connection generate-jwt` command. For more information, see [Use a private key file for authentication](../developer-guide/snowflake-cli/connecting/configure-connections.md). |
| Execute a query from a file using the `-f` or `--filename FILE` options. | Use the `snow sql [-f/--filename] file.sql` command. |
| Execute a query from command-line input using the `-q` or `--query TEXT` options. | Use the `snow sql [-q/--query] "<query-text>"` command; for example, `snow sql -q "select emp_id FROM employees"`. |
| Query templating with the option to provide variables using the `--variable` command-line option, such as `--variable db_key=$DB_KEY`. | Snowflake CLI supports SQL variables in SQL templates and in snowflake.yml project definition files. For more information, see [Using variables for SQL templates](../developer-guide/snowflake-cli/sql/execute-sql.md) and [Storing variables in the snowflake.yml project definition file](../developer-guide/snowflake-cli/sql/execute-sql.md). |
| [Interactive SQL shell mode](snowsql-use.md). | Use [interactive mode](../developer-guide/snowflake-cli/sql/execute-sql.md). Support for asynchronous queries will be added at a later date. |
| Include, or source, one or more SQL files from another SQL file:  ```sqlexample !source file1.sql; !source file2.sql; !source http://example.com/my.sql ``` | Snowflake CLI supports nesting SQL scripts with template support. For more information, see [Working with SQL query commands](../developer-guide/snowflake-cli/sql/execute-sql.md). |
| Display EXIT_ON_ERROR error codes. | Use the `--enhanced-exit-codes` command-line option, or set the `SNOWFLAKE_ENHANCED_EXIT_CODES` environment variable to `1` to send the enhanced return codes for all `snow sql` commands. For more information, see [Enhanced error codes](../developer-guide/snowflake-cli/command-reference/sql-commands/sql.md). |

---
title: Migrating to a SAML2 security integration
source: https://docs.snowflake.com/en/user-guide/admin-security-fed-auth-configure-snowflake.md
section: User Guide
---

# Migrating to a SAML2 security integration

> **Important:**
>
> The [SAML_IDENTITY_PROVIDER](../sql-reference/parameters.md) and [SSO_LOGIN_PAGE](../sql-reference/parameters.md) parameters used for SAML SSO configuration and management are
> deprecated. Snowflake configurations should use a
> [SAML2 security integration](admin-security-fed-auth-security-integration.md) instead of these parameters.
>
> Snowflake will continue to support these deprecated parameters as long as there are implementations that use them.

If you are implementing federated authentication for the first time, refer to
[Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

If you have an existing SSO implementation that uses the SAML_IDENTITY_PROVIDER account parameter, follow the steps below to migrate your
SSO implementation to a SAML2 security integration:

1. Run the [SYSTEM$MIGRATE_SAML_IDP_REGISTRATION](../sql-reference/functions/system_migrate_saml_idp_registration.md) function.
2. Confirm that a SAML2 security integration was created by running the following SQL statement:

   > ```sqlexample
   > desc security integration <integration_name>;
   > ```

If you want to configure your security integration, refer to [Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

---
title: Monitor and troubleshoot DCM Projects
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-monitor.md
section: User Guide
---

# Monitor and troubleshoot DCM Projects

This topic describes how to monitor DCM deployments and troubleshoot failing DCM plans.

## Troubleshoot a DCM project

If you are unfamiliar with the DCM project, you might run into errors from misconfigurations or other common pitfalls. This section
describes those errors and how to resolve them.

### Common causes for errors

The following table lists common causes for errors in a DCM project execution:

| Error category | Common causes |
| --- | --- |
| Secondary roles | * Users encounter inconsistent behavior due to unknowingly leveraging secondary role privileges when running DCM commands. |
| Insufficient role privileges | * Insufficient role privileges to create defined object types * Insufficient role privileges to alter or drop existing objects that are now owned by another role * Insufficient role privileges to use system-DMFs * Insufficient role privileges to run a warehouse to refresh a dynamic table at creation |
| Jinja rendering issues | * Jinja rendering issues from Incorrect Jinja syntax * Jinja rendering issues from Value-type mismatches |
| Project issues | * Incorrect manifest path * Empty definition folders * Outdated definition files on the wrong repo branch * Objects that are already deployed by another DCM project * Mismatched project and object references |

### Recommended troubleshooting steps

Follow these steps to troubleshoot and debug a DCM project.

| Step | Details |
| --- | --- |
| Set secondary roles to none | * Ensures that the primary role holds all privileges needed for the DCM project and provides consistent behavior across users   and accounts. |
| Use error messages from PLAN | * See if the error message refers to a specific file and line number. * Review the code in the specified line of the source file. * Optionally, open the corresponding rendered output file from the `out/plan/sources/definitions` folder. |
| Narrow down | * Run PLAN on the project with only an empty SQL file in the definitions folder to confirm that project privileges and settings are correct. * Gradually add definition files back to the project. |
| Change the client | * If CI/CD workflow fails, try running the same CLI commands locally. * If the local CLI commands fail, try running them in Workspaces. * Run `snow connection test` to check the account, role, and user. |
| Use Cortex Code for AI-assisted debugging | * Start Cortex Code with the DCM skill and describe the error. * Cortex Code can read plan output, diagnose Jinja rendering issues, identify privilege gaps, and suggest fixes. * Especially useful for complex Jinja templating errors and dependency resolution issues. |

## Observe and audit DCM project deployments

DCM Projects are designed to provide full transparency and audit trails for all changes to your account infrastructure. This requires you to follow
a few software development best practices for setting up infrastructure deployment processes. For more information, see
[Automate a DCM project deployment](dcm-projects-use.md).

Use the following sources to review previous deployments:

* Deployment artifacts stored inside the DCM project
* Deployment history
* Event logs from a DCM project (depending on log-level settings)

### Deployment artifacts

For every executed deployment, an immutable snapshot of the deployment artifacts is stored inside the DCM project, with the following
information:

* The manifest file (`manifest.yml`)
* All object definition and macro files (`.sql` files) inside the `sources` folder
* The output of the PLAN operation (`plan_result.json`) and the DEPLOY operation (`deploy_result.json`), including:

  + The templating variables used for this deployment
  + Deployment metadata, including timestamp, object name, and query ID
  + The changeset

This complete set makes all deployment actions reproducible for debugging, auditing, or redeploying
the defined state.

The following commands are available for observing and auditing a DCM project:

* With the MONITOR privilege, you can:

  + List all deployments stored inside the DCM project.
  + List all files inside a specified deployment.
  + Read, copy, or download specific files inside that deployment.
* With the OWNERSHIP privilege, you can manually drop a deployment if it contains sensitive data.
* With the READ privilege, you can run the DESCRIBE command to see the most recent deployment name, alias, and timestamp for a selected DCM project.

Example commands:

SQLSnowflake CLI

```sqlexample
DESCRIBE DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV;

SHOW DEPLOYMENTS IN DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV;

LIST 'snow://project/DCM_DEMO.PROJECTS.DCM_PROJECT_DEV/deployments/DEPLOYMENT$1/';

ALTER DCM PROJECT DCM_DEMO.PROJECTS.DCM_PROJECT_DEV DROP DEPLOYMENT DEPLOYMENT$1;
```

```snowcli
snow dcm describe

snow dcm describe --target STAGE

snow dcm list-deployments

snow dcm drop-deployment 'DEPLOYMENT$1'
```

### Deployment history

The `DCM_DEPLOYMENT_HISTORY` Information Schema table function provides role-based access and low-latency ways to see successful and failed deployments for a selected DCM project.

For the full syntax, arguments, output columns, and examples, see the
[DCM_DEPLOYMENT_HISTORY](../../sql-reference/info-schema/dcm_deployment_history.md) reference.

SQLSnowsight

```sqlexample
SELECT *
FROM
  TABLE (DCM_DEMO.INFORMATION_SCHEMA.DCM_DEPLOYMENT_HISTORY(
    project_name => 'DCM_DEMO.PROJECTS.DCM_PROJECT_DEV',
    result_limit => 50
  ));
```

To see your deployment history in Snowsight:

1. In the navigation menu, select Catalog » Database Explorer.
2. Navigate to the schema that contains the DCM project.
3. Select the DCM project object to see its details.
4. Select the Deployment History tab to see a list of all deployments from this project object.
5. Select a deployment from the table to see more details about which objects were added, modified, or dropped.

### Event logs

You can set the preferred LOG_LEVEL on the DCM project object or inherit the defined LOG_LEVEL from the parent schema, database, or account.

If the LOG_LEVEL for the DCM project is set, failed PLAN and DEPLOY executions are logged with the corresponding error messages
as an event, and you can see them by querying the defined event table. For more information about setting up event tables and log levels,
see [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md).

For example:

SQLSnowflake CLISnowsight

```sqlexample
SELECT
  TIMESTAMP,
  RESOURCE_ATTRIBUTES:"snow.executable.name" ::STRING AS PROJECT_NAME,
  CASE
    WHEN RESOURCE_ATTRIBUTES:"snow.project.dcm.execution.command" ::STRING = 'plan' THEN 'PLAN'
    WHEN RESOURCE_ATTRIBUTES:"snow.project.dcm.execution.command" ::STRING = 'deploy' THEN 'DEPLOY'
    ELSE RESOURCE_ATTRIBUTES:"snow.project.dcm.execution.command" ::STRING
  END AS COMMAND,
  CASE
    WHEN VALUE:"state" ::STRING = 'SUCCEEDED' THEN 'SUCCEEDED'
    WHEN VALUE:"state" ::STRING = 'FAILED' THEN 'FAILED'
    ELSE VALUE:"state" ::STRING
  END AS STATUS,
  COALESCE(
    CONCAT('Error message: ',VALUE:"message"::STRING),
    VALUE:"operation"::STRING)
  AS OPERATIONS,
  RESOURCE_ATTRIBUTES:"snow.session.role.primary.name" ::STRING AS ROLE,
  RESOURCE_ATTRIBUTES:"db.user" ::STRING AS USER_NAME,
  RECORD:"severity_text" ::STRING AS SEVERITY
FROM
  SNOWFLAKE.TELEMETRY.EVENTS
WHERE
  RESOURCE_ATTRIBUTES:"snow.executable.type" ::STRING = 'DCM_PROJECT'
ORDER BY
  TIMESTAMP DESC
LIMIT
  250;
```

```snowcli
snow logs dcm_project DCM_DEMO.PROJECTS.DCM_PROJECT_DEV \
  --from SNOWFLAKE.TELEMETRY.EVENTS
```

1. In the navigation menu, select Monitoring » Traces & logs.
2. Select the Logs tab.
3. Select the appropriate event table.
4. Filter by the project’s parent database or schema.

---
title: Monitor budgets
source: https://docs.snowflake.com/en/user-guide/budgets/monitor.md
section: User Guide
---

# Monitor budgets

This topic describes how to monitor budget spending and identify the budget that tracks the credit usage of a specific resource.

## Creating a custom role to monitor budgets

You can delegate budget monitoring by creating a custom role that can be used by non-administrator users to monitor budgets.

### Create a custom role to monitor the account budget

You can create a custom role to enable non-account administrator users to monitor the account budget. For a full list of privileges
and roles that must be granted to a role to monitor the account budget, see [Budgets roles and privileges](../budgets.md).

#### Example

> **Note:**
>
> Only an account administrator can execute the statements in this example.

For example, create role `account_budget_monitor` and grant the role the ability to view credit usage for the
account budget:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE account_budget_monitor;

GRANT APPLICATION ROLE SNOWFLAKE.BUDGET_VIEWER TO ROLE account_budget_monitor;

GRANT IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE TO ROLE account_budget_monitor;
```

### Create a custom role to monitor a custom budget

You can create a custom role to enable non-account administrator users to monitor custom budgets. For a full list of privileges
and roles that must be granted to a role to monitor a custom budget, see [Budgets roles and privileges](../budgets.md).

#### Example

> **Note:**
>
> Only a budget owner (a role with the OWNERSHIP privilege) can execute the statements in this example.

Use the budget owner role to grant the custom role `budget_monitor` the ability to monitor the budget `my_budget` in schema
`budgets_db.budgets_schema`:

```sqlexample
USE ROLE custom_budget_owner;

GRANT USAGE ON DATABASE budgets_db TO ROLE budget_monitor;

GRANT USAGE ON SCHEMA budget_db.budgets_schema TO ROLE budget_monitor;

GRANT SNOWFLAKE.CORE.BUDGET ROLE budgets_db.budgets_schema.my_budget!VIEWER
  TO ROLE budget_monitor;

GRANT DATABASE ROLE SNOWFLAKE.USAGE_VIEWER TO ROLE budget_monitor;
```

## Monitoring budgets

You can monitor budgets using Snowsight or SQL.

### Use Snowsight to monitor budgets

You can view current and historical budget spending using the Budgets page in Snowsight.

> **Note:**
>
> Only a user with the ACCOUNTADMIN role or a role granted the required privileges and role
> can monitor budgets using Snowsight.
>
> * For more information about using a custom account role to monitor the account budget, see Create a custom role to monitor the account budget.
> * For more information about using a custom account role to monitor custom budgets, see Create a custom role to monitor a custom budget.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.

In the Current Month view for a budget, you can review the credit usage per day up to the current day. You can see whether
you might exceed your budget for the month. The bar graph continues to the end of the month with your projected credit usage based on your
actual credit usage for the month. The Spending limit line indicates the spending limit at which a budget notification is triggered.

Select  (months to display) to filter the view by Current Month or longer time periods.

You can compare the Spend (current credit usage) to Interval (time left in the current month) to see if
your spending is outpacing your monthly budget.

You can filter the view by selecting  Budgets or  Resources:

* You can select a custom budget in the Budgets view for details on a specific budget.

  > **Note:**
  >
  > The Service Type list for a custom budget includes an Unused Resources type. This service type is displayed
  > when an object in a budget has no credit usage data to display. This can happen if the object has no credit usage for
  > compute costs, or if you recently added an object to a budget and the [serverless background task](cost.md)
  > has not yet executed.
* In the Resources view, you can filter and sort by Service Type, object Name, and Credit Usage.

### Use SQL commands to monitor budgets

To monitor the account budget, you must have the required privileges. For more information, see Create a custom role to monitor the account budget.

Use the `account_budget_monitor` role to view the spending history for the account budget:

```sqlexample
USE ROLE account_budget_monitor;

CALL snowflake.local.account_root_budget!GET_SPENDING_HISTORY(
  TIME_LOWER_BOUND => DATEADD('days', -7, CURRENT_TIMESTAMP()),
  TIME_UPPER_BOUND => CURRENT_TIMESTAMP()
);
```

You can monitor the spending history by service type. To view the spending history for the search optimization serverless feature
for the account budget in an eight-month period, execute the following statements:

```sqlexample
USE ROLE account_budget_monitor;

SELECT *
   FROM table(snowflake.local.account_root_budget!GET_SERVICE_TYPE_USAGE_V2(
         '2025-05', '2025-12'))
   WHERE service_type = 'SEARCH_OPTIMIZATION';
```

To monitor a custom budget, you must have the required privileges. For more information, see Create a custom role to monitor a custom budget.

Use the `budget_monitor` role to view spending history for a custom budget. For example, to view the spending history for custom
budget `na_finance_budget` in schema `budgets_db.budgets_schema`, execute the following statements:

```sqlexample
USE ROLE budget_monitor;

CALL budgets_db.budgets_schema.na_finance_budget!GET_SPENDING_HISTORY(
  TIME_LOWER_BOUND => DATEADD('days', -7, CURRENT_TIMESTAMP()),
  TIME_UPPER_BOUND => CURRENT_TIMESTAMP()
);
```

You can monitor the spending history by service type. For example, to view the spending history in a one year period for the materialized
views included in the budget, execute the following statements:

```sqlexample
USE ROLE budget_monitor;

SELECT *
   FROM table(budgets_db.budgets_schema.na_finance_budget!GET_SERVICE_TYPE_USAGE_V2(
         '2025-05', '2025-12'))
   WHERE service_type = 'MATERIALIZED_VIEW';
```

For more information, see [Budget methods](../../sql-reference/classes/budget.md).

## Identifying the budgets that track a resource

If you want to determine which budgets track a resource, you can call the
[SYSTEM$SHOW_BUDGETS_FOR_RESOURCE](../../sql-reference/functions/system_show_budgets_for_resource.md) function.

For example:

```sqlexample
SELECT SYSTEM$SHOW_BUDGETS_FOR_RESOURCE('TABLE', 'my_db.my_schema.my_table');
```

```output
+-----------------------------------------------------------------------+
| SYSTEM$SHOW_BUDGETS_FOR_RESOURCE('TABLE', 'MY_DB.MY_SCHEMA.MY_TABLE') |
|-----------------------------------------------------------------------|
| [BUDGETS_DB.BUDGETS_SCHEMA.MY_BUDGET]                                 |
+-----------------------------------------------------------------------+
```

The function returns the budget that the resource has been added to. It includes budgets that include the resource because of any of the following reasons:

* The resource was added directly to the budget.
* The resource has the tag/value combination that was added to the budget.
* The resource belongs to an object (for example, a database) that was added to the budget.

---
title: Monitor data loading activity by using Copy History
source: https://docs.snowflake.com/en/user-guide/data-load-monitor.md
section: User Guide
---

# Monitor data loading activity by using Copy History

You can monitor data loading activity for all tables in your account, or for a specific table, by using Snowsight or SQL.

* Monitor data loading for your account by using Copy History.
* Monitor data loading for a table by using Copy History.

## Monitor data loading for your account by using Copy History

Review the data loading activity that has occurred over the last 365 days for all tables in your account by using the Copy History
page in Snowsight or the [COPY_HISTORY view](../sql-reference/account-usage/copy_history.md) in the ACCOUNT_USAGE schema of the SNOWFLAKE database.

The account-level data loading activity has a latency of up to 2 hours and includes bulk data loading performed using COPY INTO statements, continuous data loading using pipes, and files loaded through the web interface.

### Prerequisites

* You must use a role with access to the SNOWFLAKE database. See [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md).
* Viewing the Copy History page in Snowsight or querying the SNOWFLAKE database requires a warehouse.
  If you have a default warehouse for your user profile, Snowsight uses that warehouse. You can switch warehouses at any time.

### Review account-level Copy History

> **Note:**
>
> You must use a role with access to the SNOWFLAKE database. See [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md).

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Copy History.

The Copies Over Time graph provides a visualization of data loading over a given period of time.
By default, the graph shows a 7-day history with each bar on the graph representing one day.

Select a bar on the graph to filter the Copies table by that day.

For more details about data loading activity, you can review the Copies table. The table includes the following information:

* File Name displays the name of the file loaded.
* Loaded displays the timestamp in your local timezone for when the data was loaded.
* Status displays the status of the data load. You can hover over data loads with a status of Failed to review error details.
* Database displays the database into which data was loaded.
* Schema displays the schema into which data was loaded.
* Table displays the table into which data was loaded.
* Pipe displays the pipe used to load data, if applicable.
* Size displays the size of the data loaded, rounded to the nearest decimal point in KB, MB, GB, or TB. For example,
  if you load 45800 bytes, the size is listed as 45.8KB.
* Rows displays the number of rows loaded, rounded to the nearest decimal point in thousands, millions, and so on. For example,
  if you load 2000 rows of data, the rows are listed as 2K.
* Location displays a link to the location from which the data was loaded. For example, a link to an AWS S3 bucket added as an
  external stage, or an internal named stage. Hover over the link to see the stage name, or select the link to copy the path to the stage.

To more easily identify specific data loading activities, you can search and filter the Copy History page.

You can filter by the following:

* Time range, up to 365 days (1 year)
* Status of the data loading activity, such as All (default), In progress, Loaded, Failed, Partially loaded,
  and Skipped.
* The location of the data:

  + Database
  + Schema
  + Pipe

You can also search the column values in the Copies table for specific data loading activities.

Select  (Open underlying SQL query in worksheet) to open a worksheet that contains the SQL query used to populate
the table. The SQL query is based on the filters you select.

When you select a specific data load activity in the Copies table, Snowsight opens the table-level Copy History.
See Monitor data loading for a table by using Copy History. You might see newer results in that table due to reduced latency, but you can only review 14 days of
activity.

## Monitor data loading for a table by using Copy History

Review the data loading activity that has occurred over the last 14 days for a specific table in a database by using the Copy History
details for the table in Snowsight or the [COPY_HISTORY](../sql-reference/functions/copy_history.md) table function.

The table-level data loading activity has very low latency and includes bulk data loading performed using COPY INTO statements, continuous data
loading using pipes, and files loaded through the web interface.

### Prerequisites

You must use a role that has one of the following:

* The MONITOR privilege on your Snowflake account.
* The USAGE privilege on the database and schema that contain the table, and any privilege on the table.

If you use a role that does not have the MONITOR privilege on the pipe, pipe details are masked as NULL.

Viewing the Copy History details for a database in Snowsight or running the table function requires a warehouse.
If you have a default warehouse for your user profile, Snowsight uses that warehouse. You can switch warehouses at any time.

### Review table-level Copy History

To review the copy history for a table, locate and open the table for which you want to review activity:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Locate and select the database with the table for which you want to review activity.
4. Select the schema with the table for which you want to review activity.
5. Select Tables and select the table.
6. In the table details, select the Copy History tab.

The Copies Over Time graph provides a visualization of data loading over a given period of time.
By default, the graph shows a 7-day history with each bar on the graph representing one day.

Select a bar on the graph to filter the Copies table by that day.

You can filter by the following:

* Time range, up to 14 days.
* Status of the data loading activity, such as All (default), In progress, Loaded, Failed, Partially loaded,
  and Skipped.
* The pipe used to load the data.

You can also search the column values in the Copies table for specific data loading activities.

Select  (Open underlying SQL query in worksheet) to open a worksheet that contains the SQL query used to
populate the table. The SQL query is based on the filters you select.

---
title: Monitor dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md
section: User Guide
---

# Monitor dbt Projects on Snowflake

This topic explains the ways you can use monitoring features for dbt Projects on Snowflake to inspect dbt project executions—–manual or task-scheduled–—and how to view logs and artifacts.

| Section | Description |
| --- | --- |
| Enable monitoring features for dbt projects | Capture logging and tracing events for a dbt project object and for any scheduled task that runs it. To enable this feature, you must set logging, tracing, and metrics on the schema where the dbt project object and task are deployed. |
| Monitor scheduled executions of dbt project objects | In Snowsight, in the navigation menu, select Transformation » dbt Projects to view run history, task graphs, and query details for dbt project objects. When a workspace is connected to a dbt project object that runs according to a task schedule, you can open task-run history and task graphs from within the workspace. |
| Access dbt artifacts and logs programmatically | Use the DBT_PROJECT_EXECUTION_HISTORY table function and dbt system functions to access dbt artifacts and logs programmatically. |

## Enable monitoring features for dbt projects

To enable monitoring features for your dbt project object, set LOG_LEVEL, TRACE_LEVEL, and METRIC_LEVEL on the database and schema where your dbt project object is created, as shown in the following SQL example:

```sqlexample
ALTER SCHEMA my_db.my_dbt_project_schema SET LOG_LEVEL = 'INFO';
ALTER SCHEMA my_db.my_dbt_project_schema SET TRACE_LEVEL = 'ALWAYS';
ALTER SCHEMA my_db.my_dbt_project_schema SET METRIC_LEVEL = 'ALL';
```

## Monitor scheduled executions of dbt project objects

When you use a task to run a dbt project on a schedule and have a workspace connected to a dbt project object, you can use the workspace for dbt Projects on Snowflake to quickly access monitoring information for task-run history and a task graph, if applicable.

> **Note:**
>
> This feature is only available for workspaces that are connected to a dbt project object.

**To monitor scheduled execution of a dbt project object from a workspace:**

1. From the dbt project menu in the upper right of the workspace editor, under Scheduled runs, choose View schedules.
2. From the list, select the schedule (task) that you want to inspect, and then choose View details.

   The information pane for the task opens, where you can view Task details, the task Graph (if applicable), and Run history of this task. For more information, see [View tasks and task graphs in Snowsight](../ui-snowsight-tasks.md).
3. From the Run history for any scheduled dbt project run in the list, select the Open query history button on the far right to view query details, the query profile, and the query telemetry for the run. For more information, see [Review details and profile of a specific query](../ui-snowsight-activity.md).

## Monitor dbt projects in Snowsight

You can use Monitoring in Snowsight to view detailed monitoring information about dbt project executions (runs). You must use a role with the MONITOR privilege to view monitoring information for the dbt project object.
For more information, see [Access control for dbt projects on Snowflake](dbt-projects-on-snowflake-access-control.md).

1. In the navigation menu, select Transformation » dbt Projects. A histogram shows the frequency of dbt project runs and a list of projects that have run.

   The list of dbt projects includes columns with the following information. You can filter the list by date range, command, and run status.

   * PROJECT - The name of the dbt project object and the number of executions (runs) in the selected time period.
   * LAST COMMAND - The dbt command that executed during the last run.
   * LAST RUN STATUS - The result of the run: Succeeded, Executing, or Failed.
   * LAST RUN - The elapsed time since the last run. To reverse the sort order, select the column header. The most recent run is shown first by default.
   * PREVIOUS RUNS - The number of runs in the selected time period by status.
   * DATABASE and SCHEMA - The database and schema where the dbt project object is saved.
   * LAST RUN PARAMETERS - The dbt command-line arguments (ARGS) specified in the EXECUTE DBT PROJECT command for the last dbt project run.
2. To inspect individual project runs, select a dbt project object from the list.

   The dbt project details page in the database object explorer opens for that dbt project object.

   The Run history tab is selected by default, with the following information for each job run in the selected time period:

   * COMMAND - The dbt command that executed during the last run.
   * STATUS - The result of the run: Succeeded, Executing, or Failed.
   * RUN TIME - The elapsed time since the last run. To reverse the sort order, select the column header. The most recent run is shown first by default.
   * PARAMETERS The dbt command-line arguments (ARGS) specified in the EXECUTE DBT PROJECT command for the last dbt project run.
3. To see job details for a run, select it from the list.

   The dbt run details pane opens, which includes the following tabs:

   * The Job details tab is selected by default and displays the following information:

     + Status - The result of the run: Succeeded, Executing, or Failed..
     + Start time, End time, and Duration - The time that the run started, the time it ended, and how long it took to run.
     + Warehouse size - The size of the warehouse that was used to execute the run.
     + Query ID - The unique identifier for the query that executed the dbt project command. To view the query details in query history, select the query ID.
     + SQL text - The EXECUTE DBT PROJECT command that executed.
     + dbt <command> - For the dbt command that ran (for example, `run` or `build`), shows the dbt model, the time taken for the run to execute, and the status of that model run.
   * The Output tab shows the stdout generated by the dbt project during the run.
   * The Trace tab shows the trace information generated by the dbt project during the run. For more information about traces, see [Viewing trace data](../../developer-guide/logging-tracing/tracing-accessing-events.md).
4. To see more detailed query information, from the Job details tab, select the Query ID.

   The query history page for the job run query opens with tabs to view Query Details, the Query Profile, and Query Telemetry for the dbt run that you selected.

   For more information, see [Review details and profile of a specific query](../ui-snowsight-activity.md).

### View the query history DAG

The Query Details for a dbt project run includes a DAG tab that visualizes what ran during an
execution and the results for each model. This differs from the DAG on the project details page, which serves as a
documentation layer for your project, including models, tests, sources, and their dependencies.

The query history DAG is built from the `manifest.json` and `run_results.json` artifacts produced during an execution. Select a
node in the DAG to open a side panel with details for that specific query, including the query ID and any error messages if
the query failed.

To view the query history DAG:

1. In the navigation menu, select Transformations » dbt Projects.
2. From the list of dbt projects, select a project.
3. On the Run History tab, select a run to open the Query Details for that execution.
4. Select the DAG tab.

> **Note:**
>
> If the query history DAG shows “No data available,” the run likely failed before `run_results.json` could be generated.
> For more information, see [Limitations for the query history DAG](dbt-projects-on-snowflake-limitations.md).

## Access dbt artifacts and logs programmatically

Use the [DBT_PROJECT_EXECUTION_HISTORY](../../sql-reference/functions/dbt_project_execution_history.md) table function and the following system functions to access dbt artifacts and logs programmatically.

| Function | What it returns | Typical use | Notes |
| --- | --- | --- | --- |
| [SYSTEM$GET_DBT_LOG](../../sql-reference/functions/system_get_dbt_log.md) | Text log output (the run’s log tail) | Quick debugging in SQL. For example, see errors and warnings without downloading files. | Returns log content; nothing is created or moved. |
| [SYSTEM$LOCATE_DBT_ARTIFACTS](../../sql-reference/functions/system_locate_dbt_artifacts.md) | Folder path (for example, `snow://…/results/query_id_…/`) containing artifact files such as `manifest.json`, compiled SQL, logs. | Browse or copy specific files with LIST, GET, or COPY FILES. | Just a locator (a URL); you still run GET/COPY FILES to fetch. |
| [SYSTEM$LOCATE_DBT_ARCHIVE](../../sql-reference/functions/system_locate_dbt_archive.md) | Single ZIP file URL (for example, `…/dbt_artifacts.zip`). | Handy when you want to download one file (for example, with GET). | Use `GET '<url>' file:///local/dir` to download. |

### Get logs and download a ZIP file of the latest dbt project query

The following example queries Snowflake’s dbt execution history to show the most recent query ID for the dbt Project. It pulls the log output
for that execution and returns the location of the zipped dbt artifacts for that execution.

The Snowflake CLI example downloads the artifacts ZIP file or specific files (like `manifest.json`) to your local folder using GET.

To download the ZIP file from Snowsight, navigate to Monitoring » Query History. Select the query, navigate to Query Details,
and select Download Build Artifacts under dbt Output.

You must use a role with the OWNERSHIP, USAGE, or MONITOR privilege on your dbt Projects.

SQLSnowflake CLI

```sqlexample
--Look up the most recent dbt Project execution
SET latest_query_id = (SELECT query_id
   FROM TABLE(INFORMATION_SCHEMA.DBT_PROJECT_EXECUTION_HISTORY())
   WHERE OBJECT_NAME = 'MY_DBT_PROJECT'
   ORDER BY query_end_time DESC LIMIT 1);

--Get the dbt run logs for the most recent dbt Project execution
SELECT SYSTEM$GET_DBT_LOG($latest_query_id);
```

```output
============================== 15:14:53.100781 | 46d19186-61b8-4442-8339-53c771083f16 ==============================
[0m15:14:53.100781 [info ] [Dummy-1   ]: Running with dbt=1.9.4
...
[0m15:14:58.198545 [debug] [Dummy-1   ]: Command `cli run` succeeded at 15:14:58.198121 after 5.19 seconds
```

To view the stage path where Snowflake stored the dbt Project run’s artifacts (that is, the results folder for that execution), use the
SYSTEM$LOCATE_DBT_ARTIFACTS function. You can then use that path with `GET` or `COPY FILES` with the Snowflake CLI to download
things like `manifest.json`, compiled SQL, or logs.

```sqlexample
--Get the location of the dbt Project archive ZIP file (see all files)
SELECT SYSTEM$LOCATE_DBT_ARTIFACTS($latest_query_id);
```

```output
+-------------------------------------------------------------------------------------------------+
| SYSTEM$LOCATE_DBT_ARTIFACTS($LATEST_QUERY_ID)                                                   |
+-------------------------------------------------------------------------------------------------+
| snow://dbt/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01c01096-010c-0ccb-0000-a99506bd199e/ |
+-------------------------------------------------------------------------------------------------+
```

```sqlexample
--List all the files of a dbt run
ls 'snow://dbt/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf3f5a-010b-4d87-0000-53493abb7cce/';
```

You can also create a fresh internal stage, locate the Snowflake-managed path for the specified dbt Project run’s artifacts, and copy those
artifacts into your stage for retrieval, as shown in the following example:

```sqlexample
CREATE OR REPLACE STAGE my_dbt_stage ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');

SELECT SYSTEM$LOCATE_DBT_ARTIFACTS($latest_query_id);
```

```output
snow://dbt/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf51c1-010b-5676-0000-53493ae6db02/
```

```sqlexample
COPY FILES INTO @my_dbt_stage/results/ FROM 'snow://dbt/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf51c1-010b-5676-0000-53493ae6db02/';
```

```output
results/dbt_artifacts.zip
results/logs/dbt.log
results/target/manifest.json
results/target/semantic_manifest.json
```

```snowcli
snowsql -q "SELECT query_id
   FROM TABLE(INFORMATION_SCHEMA.DBT_PROJECT_EXECUTION_HISTORY())
   WHERE OBJECT_NAME = 'MY_DBT_PROJECT'
   ORDER BY query_end_time DESC LIMIT 1;"

snowsql -q "SELECT SYSTEM\$GET_DBT_LOG('01bf3f89-0300-0001-0000-0000000c1229')"
```

```output
| ============================== 11:17:39.152234 | 4df65841-7aa3-40e2-81cb-2007c09c2b81
| 11:17:39.152234 [info ] [Dummy-1   ]: Running with dbt=1.9.4
....
```

```snowcli
snowsql -q "SELECT SYSTEM\$LOCATE_DBT_ARCHIVE('01bf3f89-0300-0001-0000-0000000c1229')"
```

```output
snow://dbt_project/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf3f89-0300-0001-0000-0000000c1229/dbt_artifacts.zip
```

```snowcli
snowsql -q "GET 'snow://dbt_project/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf3f89-0300-0001-0000-0000000c1229/dbt_artifacts.zip' file:///Users/user_name/Code/temp"
```

```output
Type SQL statements or !help
+-----------------------------------------------------------------+--------+------------+-----
| file                                                            |   size | status    | ....
|-----------------------------------------------------------------+--------+------------+-----
| query_id_01bf3f89-0300-0001-0000-0000000c1229/dbt_artifacts.zip | 137351 | DOWNLOADED |...
+-----------------------------------------------------------------+--------+------------+-----
```

---
title: Monitor dynamic table performance
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-performance-monitor.md
section: User Guide
---

# Monitor dynamic table performance

Performance monitoring helps you with the following tasks:

* Identify slow or costly dynamic table refreshes.
* Diagnose bottlenecks.
* Measure the impact of optimizations.

This topic explains what to look for to monitor dynamic table performance and how to diagnose issues.
For information about monitoring tools, see [Monitor dynamic tables](dynamic-tables-monitor.md).

> **Tip:**
>
> For a hands-on example, see
> [Tutorial: Optimize dynamic table performance for SCD Type 1 workloads](tutorials/optimize-dynamic-table-performance.md).

## Key performance indicators

To monitor dynamic table performance, focus on the metrics described in this section.

### Refresh duration

Refresh duration measures how long each refresh takes to complete. To spot performance
degradation, track refresh duration over time.

Warning signs:

* **Duration increases over time**: Growing data volumes or degrading
  [data locality](dynamic-tables-performance-optimize.md) can cause refresh times to steadily increase.
* **Duration approaches target lag**: When refreshes take nearly as long as your target lag,
  you might not meet data freshness requirements.
* **High variance in duration**: Large swings in refresh time might indicate workload spikes or
  resource contention.

To view refresh duration, see [Monitor the refresh status for your dynamic tables](dynamic-tables-monitor.md).

### Lag metrics

Lag metrics show how well your dynamic table meets its freshness target. For information about
how target lag works, see [Understanding dynamic table target lag](dynamic-tables-target-lag.md).

Key metrics:

* **Actual lag**: The time between when source data changed and when the dynamic table
  reflected those changes.
* **Time within target lag ratio**: The percentage of time a table stayed within its target lag.
  A ratio below one indicates that the pipeline isn’t meeting its freshness goal.
* **Maximum lag**: The longest actual lag during a given period.

To view lag metrics, see [Monitor the refresh status for your dynamic tables](dynamic-tables-monitor.md).

### Partition statistics

For incremental refreshes, the number of partitions scanned should be proportional to the
data that changed, not the total table size. High partition scans indicate poor data locality.

Warning signs:

* Scanning a large percentage of total partitions during incremental refresh.
* Partition scans increasing over time without corresponding data growth.

To view partition statistics, see Analyze query profiles.

For guidance on improving data locality, see [Improve data locality](dynamic-tables-performance-optimize.md).

### Refresh mode

The refresh mode directly affects performance. Verify that your dynamic table uses the
expected mode.

To check refresh mode, use [SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) and review the
`refresh_mode` and `refresh_mode_reason` columns. In Snowsight, view the
refresh mode in the object header.

For guidance on choosing the right refresh mode, see [Choose a refresh mode](dynamic-tables-performance-optimize.md).

## Diagnose slow refreshes

When refreshes take longer than expected, follow these steps to identify the cause:

1. Check the refresh history for trends in refresh duration, such as gradual increases or sudden spikes
   ([Monitor the refresh status for your dynamic tables](dynamic-tables-monitor.md)).
2. Review the query profile to identify bottlenecks (Analyze query profiles):

   * High partition scans suggest poor [data locality](dynamic-tables-performance-optimize.md).
   * Bytes spilled suggest that the warehouse is too small.
   * Specific operators taking a long time might indicate an opportunity to [optimize your dynamic table
     query](dynamic-tables-performance-optimize.md).
3. Check whether lag consistently exceeds your target, which indicates that refreshes might not keep up
   with your data volume ([Monitor the refresh status for your dynamic tables](dynamic-tables-monitor.md)).
4. Review upstream dependencies to check whether upstream tables cause delays or produce
   large volumes of changes.

   In the Graph view in Snowsight, look for the following conditions:

   * Upstream tables executing a refresh (shown with `executing` status).
   * Failed or suspended upstream tables.
   * Upstream tables taking longer than usual to refresh.

   To access the Graph view, see [View the graph of tables connected to your dynamic tables](dynamic-tables-monitor.md).
5. Check the volume of changes that the dynamic table processes, because large volumes of changes
   from upstream dependencies can slow down refreshes.

   Use the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md)
   function to see how many rows changed in recent refreshes:

   ```sqlexample
   SELECT
     name,
     data_timestamp,
     statistics:numInsertedRows::INT AS rows_inserted,
     statistics:numDeletedRows::INT AS rows_deleted,
     refresh_action
   FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLE_REFRESH_HISTORY(
     NAME => 'my_dynamic_table'
   ))
   ORDER BY data_timestamp DESC
   LIMIT 10;
   ```

   When change volume is high relative to total table size (more than five percent of the table rows), consider
   using full refresh mode instead.

### Common patterns and recommended actions

* **Refresh duration is stable, but lag is high**: Your target lag is probably too aggressive for
  the current warehouse size and data volume. Refreshes finish successfully but can’t keep up with incoming
  changes. Check whether your [target lag](dynamic-tables-performance-optimize.md) and
  [warehouse resources](dynamic-tables-performance-optimize.md) match your data volume.
* **Refresh duration suddenly spikes and bytes spilled are high**: The warehouse doesn’t have enough memory
  to process the refresh, either because the warehouse is too small or because other queries are running at
  the same time. [Increase the warehouse size](dynamic-tables-performance-optimize.md) or move dynamic table
  refreshes to a dedicated warehouse.
* **Partition scans increase over time, but data volume stays the same**: Your data locality is poor, which
  forces Snowflake to scan more partitions than necessary. Check your
  [clustering keys and data locality](dynamic-tables-performance-optimize.md). Also check whether upstream changes
  affect many scattered partitions instead of a few contiguous ones.
* **Each refresh processes a large portion of the table (more than five percent of rows or partitions)**:
  Incremental refresh provides little benefit when most of the table changes frequently.
  [Switch to full refresh mode](dynamic-tables-performance-optimize.md) or redesign your pipeline to reduce the
  amount of data that changes with each refresh.

Based on your findings, apply appropriate fixes from
[Optimize dynamic table performance](dynamic-tables-performance-optimize.md).

> **Note:**
>
> *Skipped or failed refreshes* are typically caused by configuration issues, not
> performance problems. See [Troubleshooting skipped or failed dynamic table refreshes](dynamic-tables-troubleshoot-refresh.md).

## Analyze query profiles

The [query profile](ui-snowsight-activity.md) shows detailed execution statistics for
each refresh. When a refresh is slow, the query profile helps you identify opportunities for optimization.

To access the query profile:

SnowsightSQL

1. Navigate to Transformation » Dynamic Tables.
2. Select the dynamic table and go to the Refresh History tab.
3. Select Show query profile next to the refresh you want to analyze.

First, get the query ID from refresh history:

```sqlexample
SELECT
  name,
  refresh_start_time,
  query_id
FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLE_REFRESH_HISTORY(
  NAME => 'my_dynamic_table'
))
WHERE state = 'SUCCEEDED'
ORDER BY refresh_start_time DESC
LIMIT 5;
```

Then analyze the query profile with the [GET_QUERY_OPERATOR_STATS](../sql-reference/functions/get_query_operator_stats.md)
function:

```sqlexample
SELECT *
FROM TABLE(GET_QUERY_OPERATOR_STATS('<query_id>'));
```

### What to look for

* **Partitions scanned vs. pruned**: When partition scans are high relative to the total number of partitions,
  the cause is usually poor [data locality](dynamic-tables-performance-optimize.md) or missing clustering.
* **Time distribution**: Check which operators consume the most time. Operators that take
  disproportionately long might indicate an opportunity to optimize your query. See
  [Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md) for operator-specific guidance.
* **Bytes spilled to local or remote storage**: High bytes spilled often indicate that the warehouse
  is too small for the refresh workload or that other queries running on the same warehouse
  reduce the memory available for refreshes. Consider [increasing the warehouse size](dynamic-tables-performance-optimize.md)
  or running dynamic table refreshes on a dedicated warehouse to reduce contention.

For more guidance on how to address issues found in the query profile, see
[Optimize dynamic table performance](dynamic-tables-performance-optimize.md).

## Monitor warehouse usage

To check whether your warehouse can handle your dynamic table workload and
find ways to reduce costs, monitor warehouse usage.

### Key metrics to monitor

* **Bytes spilled**: Bytes spilled to local or remote storage means that the warehouse might be too small.
  Consider [increasing warehouse size](dynamic-tables-performance-optimize.md). For more details on identifying
  and troubleshooting bytes spilled, see [Finding queries that spill to storage](performance-query-warehouse-memory.md).
* **Warehouse utilization**: Check whether the warehouse has enough resources for refresh workloads.
  Low utilization means you might have an oversized warehouse. High queue times mean your warehouse
  is too small or runs too many concurrent queries.
* **Query queuing**: Queued queries delay refreshes. If refreshes frequently queue,
  [increase warehouse size](dynamic-tables-performance-optimize.md), use a dedicated warehouse for
  dynamic table refreshes, or consider a multi-cluster warehouse to handle variable workloads.
* **Credit usage**: Track credits to balance performance with costs. Monitor regularly to find
  opportunities to right-size warehouses or adjust refresh schedules.

To view warehouse usage and queue times, see [Reducing queues](performance-query-warehouse-queue.md).
Optimize warehouse configuration for dynamic tables with
[Optimize dynamic table performance](dynamic-tables-performance-optimize.md).

## Monitor dependencies

Dependencies between dynamic tables can affect performance. Performance issues in upstream
tables cascade to downstream tables because a downstream table must wait for upstream tables to
complete their refreshes before it can start its own refresh.

To diagnose performance issues related to upstream dependencies, see
Diagnose slow refreshes.

To view the graph of dependencies, see [View the graph of tables connected to your dynamic tables](dynamic-tables-monitor.md).

## Set up alerts for performance issues

You can set up alerts to notify you when performance degrades. We recommend creating alerts
for the following conditions:

* Refresh duration exceeds a threshold.
* Lag consistently misses the target.

Alerts use event tables to track refresh events. For setup instructions, see
[Event table monitoring and alerts for dynamic tables](dynamic-tables-monitor-event-table-alerts.md).

---
title: Monitor dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-monitor.md
section: User Guide
---

# Monitor dynamic tables

This topic describes how to view and monitor the dynamic tables in your pipelines. For
guidance on what to look for when diagnosing performance issues, see
[Monitor dynamic table performance](dynamic-tables-performance-monitor.md).

| Section | Description |
| --- | --- |
| List dynamic tables or view information on specific columns | List the dynamic tables in a schema and view information about them. |
| View the graph of tables connected to your dynamic tables | See the graph of tables connected to your dynamic tables. |
| Monitor your dynamic tables using SQL table functions | Monitor your dynamic tables using SQL table functions. |
| Monitor the refresh status for your dynamic tables | View the refresh status for your dynamic tables. |

## List dynamic tables or view information on specific columns

To list the dynamic tables in a schema and view information about those dynamic tables, you can use either the following SQL commands or
[Snowsight](ui-snowsight-gs.md), as long as you use a role that has the MONITOR privilege on the dynamic tables.

For more information, see [Privileges to view a dynamic table’s metadata](dynamic-tables-privileges.md).

SQLSnowsight

To list the dynamic tables in the current database (or in the account, if no database is currently in use), use the
[SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) command.

For example, to list the dynamic tables with names that start with `product_` in the database `mydb` and schema `myschema`, execute
the following SQL statement:

```sqlexample
SHOW DYNAMIC TABLES LIKE 'product_%' IN SCHEMA mydb.myschema;
```

```output
+-------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  | created_on               | name       | database_name | schema_name | cluster_by | rows | bytes  | owner    | target_lag | refresh_mode | refresh_mode_reason  | warehouse | comment | text                            | automatic_clustering | scheduling_state | last_suspended_on | is_clone  | is_replica  | is_iceberg | data_timestamp           | owner_role_type |
  |-------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
  |2025-01-01 16:32:28 +0000 | product_dt | my_db         | my_schema   |            | 2    | 2048   | ORGADMIN | DOWNSTREAM | INCREMENTAL  | null                 | mywh      |         | create or replace dynamic table | OFF                  | ACTIVE           | null              | false     | false       | false      |2025-01-01 16:32:28 +0000 | ROLE            |
                                                                                                                                                                                         |  product dt ...                 |                                                                                                                                                 |                                                                                                                                                                                                                                                                                                                                                                                                                       |
  +-------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

To output information about the columns in a dynamic table, use the [DESCRIBE DYNAMIC TABLE](../sql-reference/sql/desc-dynamic-table.md) command.

For example, to list the columns in `my_dynamic_table`, execute the following SQL statement:

```sqlexample
DESC DYNAMIC TABLE my_dynamic_table;
```

```output
+-------------------+--------------------------------------------------------------------------------------------------------------------------+
  | name   | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name  | privacy domain |
  |-------------------+------------------------------------------------------------------------------------------------------------------------|
  | AMOUNT | NUMBER(38,0) | COLUMN | Y     | null    | N           | N          | null  | null       | null    | null         | null           |                                                                                                                                                  |                                                                                                                                                                                                                                                                                                                                                                                                                       |
  +-------------------+------------------------------------------------------------------------------------------------------------------------+
```

Dynamic tables are also included in the results of the [TABLES view](../sql-reference/account-usage/tables.md).

To list the dynamic tables in a schema and view information about a specific dynamic table, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select a database and schema.
4. Select the Dynamic Tables tab or expand Dynamic Tables in the database object explorer.
5. To view information about a specific dynamic table, select the dynamic table from the list of dynamic tables in the Dynamic Tables
   tab or from the database object explorer.
6. The tabs in this page provide the following details about your selected dynamic table:

   * Table Details: Displays basic information about the dynamic table, including:
   > * The scheduling state of your dynamic table.
   > * The last refresh status of your dynamic table. For failed refreshes, you can see more information about the error if you hover over
   >   the status.
   > * The current and target lag for your dynamic table.
   > * Whether [incremental refreshes or full refreshes](dynamic-tables-refresh.md) are used to update the table.
   > * The definition of the dynamic table.
   > * The tags for the dynamic table.
   > * The privileges granted for working with the dynamic table.

> * Columns: Information about the columns in the dynamic table.
> * Data Preview: A preview of up to 100 rows of the data in the dynamic table.
> * Graph: Displays the [directed acyclic graph (DAG)](dynamic-tables-create.md) that includes this dynamic
>   table.
> * Refresh History: Displays the history of refreshes and the lag metrics.

## View the graph of tables connected to your dynamic tables

Viewing dependencies is particularly useful for troubleshooting dynamic table chains. In Snowsight, you can visualize which dynamic
tables a given dynamic table depends on using the lineage graph. For example, you can identify the following:

* Upstream dependencies where a dynamic table pulls data from.
* Downstream dependencies that might be impacted by changes to a dynamic table.

Dependencies can impact refresh performance. For example, suppose your dynamic table’s upstream table has a large data load added just before
its scheduled refresh. Your dynamic table will wait for it to finish the refresh, causing it to miss its target lag. In the lineage graph,
you’d see the input table marked as “executing,” indicating the delay.

To view the graph of a particular dynamic table, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Dynamic tables.
3. Select your dynamic table. The Graph view is displayed by default. This displays the graph with the node for the dynamic table
   selected. The Details pane on the right displays information about its lag metrics and configuration.
4. To display the details of a different table in the graph, select that table.

To update the graph, select the refresh button in the bar above the graph.

If a refresh failed due to an UPSTREAM_FAILED error code, you can use the graph to visualize which upstream table caused the failure.

To view the full details of a table in the graph, see List dynamic tables or view information on specific columns.

## Monitor your dynamic tables using SQL table functions

Use the following INFORMATION_SCHEMA table functions to monitor your dynamic tables:

* [DYNAMIC_TABLES](../sql-reference/functions/dynamic_tables.md): Returns metadata about your dynamic tables, including aggregate lag metrics and the status
  of the most recent refreshes, within seven days of the current time.
* [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md): Returns information about each completed and running refresh of your dynamic
  tables, including refresh status and trigger, and the target lag.

  + [DYNAMIC_TABLE_REFRESH_HISTORY view](../sql-reference/account-usage/dynamic_table_refresh_history.md): This Account Usage view also displays information for dynamic table
    refresh history. It is useful for debugging issues that are for longer than the DYNAMIC_TABLE_REFRESH_HISTORY table function’s data
    retention time (seven days).
* [DYNAMIC_TABLE_GRAPH_HISTORY](../sql-reference/functions/dynamic_table_graph_history.md): Returns information that provides the history of each dynamic table, its
  properties, and its dependencies on other tables and dynamic tables.

  You can use this table function to get a snapshot of the dependency tree of dynamic tables at a given point in time.

  The output also reflects the changes made to the properties of a dynamic table over time. Each row represents a dynamic table
  and a specific set of properties. If you change a property of a dynamic table (for example, the target lag), the function returns the most
  up to date property.

## Monitor the refresh status for your dynamic tables

This section explains how to view the refresh status of all or specific dynamic tables.

* For guidance on what to look for when diagnosing slow refreshes, see
  [Monitor dynamic table performance](dynamic-tables-performance-monitor.md).
* For troubleshooting skipped or failed refreshes, see
  [Troubleshooting skipped or failed dynamic table refreshes](dynamic-tables-troubleshoot-refresh.md).

### Monitor the refreshes for all your dynamic tables

You can use Snowsight or the DYNAMIC_TABLES table function to view the refresh status for all your dynamic tables.

SnowsightSQL

Sign in to [Snowsight](ui-snowsight-gs.md). In the navigation menu, select Transformation » Dynamic tables.

You can view the state and last refresh status for all your dynamic tables on this page. You can also filter by database or schema to
narrow the results.

[DYNAMIC_TABLES](../sql-reference/functions/dynamic_tables.md) provides information about all of the dynamic tables in your account.

The following example retrieves the information about the state and target lag for all dynamic tables in the account and their associated
database and schema.

```sqlexample
SELECT
  name,
  database_name,
  schema_name,
  scheduling_state,
  target_lag_type,
  target_lag_sec,
FROM
  TABLE (
    INFORMATION_SCHEMA.DYNAMIC_TABLES ()
  )
ORDER BY
  name;
```

```output
+--------------------+------------------------------+--------------------------------------------------------------------------------------------------+-----------------+----------------+
| NAME               | DATABASE_NAME | SCHEMA_NAME | SCHEDULING_STATE                                                                                  | TARGET_LAG_TYPE | TARGET_LAG_SEC |
|--------------------+------------------------------+--------------------------------------------------------------------------------------------------|-----------------+----------------+
| MY_DYNAMIC_TABLE_1 | MY_DB_1       | MY_SCHEMA_1 | {                                                                                                 |                 |                |
|                    |               |             |    "reason_code": "UPSTREAM_SUSPENDED_DUE_TO_ERRORS",                                             |                 |                |
|                    |               |             |    "reason_message": "The DT was suspended because an input DT had 5 consecutive refresh errors", |                 |                |
|                    |               |             |    "state": "SUSPENDED",                                                                          |                 |                |
|                    |               |             |    "suspended_on": "2025-04-14 11:49:09.576 Z"                                                    | USER_DEFINED    | 60             |
|                    |               |             |  }                                                                                                |                 |                |
| MY_DYNAMIC_TABLE_2 | MY_DB_2       | MY_SCHEMA_2 | null                                                                                              |                 |                |
+--------------------+------------------------------+--------------------------------------------------------------------------------------------------+-----------------+----------------|
```

The following example retrieves the state and information about each state for refresh for all dynamic tables in the account.

```sqlexample
-- latest_data_timestamp is the refresh timestamp associated with last successful refresh.
SELECT
  name,
  last_completed_refresh_state,
  last_completed_refresh_state_code,
  last_completed_refresh_state_message,
  latest_data_timestamp,
  time_within_target_lag_ratio,
  maximum_lag_sec,
  executing_refresh_query_id
FROM
  TABLE (
    INFORMATION_SCHEMA.DYNAMIC_TABLES ()
  )
ORDER BY
  name;
```

```output
-- Both dynamic tables in the example below have a target lag of one minute.

+--------------------+------------------------------+-----------------------------------+-----------------------------------------------+-----------------------+------------------------------+-----------------+----------------------------+
| NAME               | LAST_COMPLETED_REFRESH_STATE | LAST_COMPLETED_REFRESH_STATE_CODE | LAST_COMPLETED_REFRESH_STATE_MESSAGE          | LATEST_DATA_TIMESTAMP | TIME_WITHIN_TARGET_LAG_RATIO | MAXIMUM_LAG_SEC | EXECUTING_REFRESH_QUERY_ID |
|--------------------+------------------------------+-----------------------------------+-----------------------------------------------|-----------------------+------------------------------+-----------------+----------------------------+
| MY_DYNAMIC_TABLE_1 | UPSTREAM_FAILED              | UPSTREAM_FAILURE                  | Skipped refreshing because an input DT failed | 2025-04-12 09:00:48   | null                         | null            | null                       |
| MY_DYNAMIC_TABLE_2 | SUCCEEDED                    | SUCCESS                           | null                                          | 2025-04-12 09:01:36   | 0.999                        | 125             | null                       |
+--------------------+------------------------------+-----------------------------------+-----------------------------------------------+-----------------------+------------------------------+-----------------+----------------------------+
```

### Monitor all the refreshes for a specific dynamic table

You can use Snowsight or the DYNAMIC_TABLES_REFRESH_HISTORY table function to view the refresh history for a given dynamic table.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Dynamic tables.
3. Select your dynamic table and then go to the Refresh History tab.

   This page displays your dynamic table’s refresh history, which includes information about each refresh’s status, duration, and actual
   lag time, and the number of rows changed with each refresh.

   It also displays your dynamic table’s lag metrics, which includes the percentage of the time within the target lag and the longest
   actual lag time during the given interval.

To view the refresh history for a specific dynamic table, use the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md) table
function.

For example, if you want to view the refresh history for all the dynamic tables in the `my_db` database and `my_schema` schema, execute
the following statement:

```sqlexample
SELECT
  name,
  data_timestamp,
  state,
  state_code,
  state_message
    FROM TABLE (INFORMATION_SCHEMA.DYNAMIC_TABLE_REFRESH_HISTORY (NAME_PREFIX => 'MY_DB.MY_SCHEMA')) ORDER BY data_timestamp desc;
```

```output
+--------------------+---------------------+-----------+------------------------------+----------------------------------------------------------------+
| NAME               | DATA_TIMESTAMP      | STATE     | STATE_CODE                   | STATE_MESSAGE                                                  |
|--------------------+---------------------+-----------+------------------------------+----------------------------------------------------------------|
| MY_DYNAMIC_TABLE_1 | 2025-04-12 09:01:36 | SKIPPED   | SKIP_DUE_TO_UPSTREAM_FAILURE | Skipped refreshing because an input DT failed.                 |
| MY_DYNAMIC_TABLE_1 | 2025-04-12 09:00:48 | SUCCEEDED |                              |                                                                |
| MY_DYNAMIC_TABLE_1 | 2025-04-12 09:00:00 | FAILED    | 100038                       | Numeric value 'Good' is not recognized.                        |
| MY_DYNAMIC_TABLE_2 | 2025-04-12 09:01:36 | SUCCEEDED |                              |                                                                |
| MY_DYNAMIC_TABLE_2 | 2025-04-12 09:00:48 | FAILED    | 091930                       | SQL compilation error: Change tracking is not enabled or has   |
|                    |                     |           |                              | been missing for the time range requested on table 'MY_TABLE'. |
| MY_DYNAMIC_TABLE_2 | 2025-04-12 09:00:00 | CANCELLED | 002724                       | Dynamic Table refresh job cancelled.                           |
+--------------------+---------------------+-----------+------------------------------+----------------------------------------------------------------+
```

To filter for refreshes that had errors, pass in the argument `ERROR_ONLY => TRUE`. For example:

```sqlexample
SELECT
  name,
  data_timestamp,
  state,
  state_code,
  state_message
    FROM TABLE (INFORMATION_SCHEMA.DYNAMIC_TABLE_REFRESH_HISTORY (NAME_PREFIX => 'MY_DB.MY_SCHEMA', ERROR_ONLY => TRUE));
```

```output
+--------------------+---------------------+-----------+------------------------------+----------------------------------------------------------------+
| NAME               | DATA_TIMESTAMP      | STATE     | STATE_CODE                   | STATE_MESSAGE                                                  |
|--------------------+---------------------+-----------+------------------------------+----------------------------------------------------------------|
| MY_DYNAMIC_TABLE_1 | 2025-04-12 09:00:00 | FAILED    | 100038                       | Numeric value 'Good' is not recognized.                        |
| MY_DYNAMIC_TABLE_2 | 2025-04-12 09:00:48 | FAILED    | 091930                       | SQL compilation error: Change tracking is not enabled or has   |
|                    |                     |           |                              | been missing for the time range requested on table 'MY_TABLE'. |
| MY_DYNAMIC_TABLE_2 | 2025-04-12 09:00:00 | CANCELLED | 002724                       | Dynamic Table refresh job cancelled.                           |
+--------------------+---------------------+-----------+------------------------------+----------------------------------------------------------------+
```

---
title: Monitor events for Snowpipe
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-monitor-events.md
section: User Guide
---

# Monitor events for Snowpipe

You can configure Snowflake to record events that provide detailed information about the status of your pipes. These events are captured in the active event table associated with the pipe.

By monitoring these events, you can gain insights into the following areas:

* Pipe status changes: Track the operational state of your Snowpipes.
* File processing progress: Understand the journey of files through the Snowpipe system.
* Periodic, aggregated, ingestion statistics digest: Get summarized statistics on data ingestion.

Additionally, you can configure alerts for the following critical conditions:

* High volume of incoming files
* High ingestion latencies
* Pipe errors
* File errors

The following sections explain how to enable event logging for Snowpipe, configure the severity level for log events, and interpret the events recorded in the event table:

* Snowpipe event types: Learn about the different categories of events and their details.
* Set the severity level of events to capture: Configure which events are recorded based on their importance.
* Query the event table for Snowpipe events: Discover how to retrieve and analyze event data.
* Information logged for Snowpipe events: Understand the structure and meaning of the data within the event table columns.

> **Caution:**
>
> Logging events for Snowpipe incurs costs. For more information, see [Costs of telemetry data collection](../developer-guide/logging-tracing/logging-tracing-billing.md).

## Snowpipe event types

Snowpipe events are identified by the `name` attribute within the `RECORD` column of your event table.

### file_lifecycle

`file_lifecycle` events track a file’s journey through the Snowpipe system. The state of a file can be `RECEIVED`, `INGESTED`, or `ERRORED`.

`RECEIVED`: An event is emitted when Snowpipe receives a file request. The pipe might skip this file if it was previously processed; in such cases, the `skipped` attribute indicates this.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "snow.database.name": "<MY_DB_NAME>",
    "snow.schema.name": "<MY_SCHEMA_NAME>",
    "snow.pipe.name": "<MY_PIPE_NAME>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "file_lifecycle",
    "severity_text": "DEBUG"
  },
  "RECORD_ATTRIBUTES": {
    "snow.file.path": "<a/path/to/a/file>"
  },
  "VALUE": {
    "notification_channel": "<notification_channel>",
    "file_content_key": "<file_content_key>",
    "last_modified_time": "<last_modified_time>",
    "state": "<received_or_skipped>"
  }
}
```

`INGESTED`: An event is emitted after the file is successfully ingested by Snowpipe.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "snow.database.name": "<MY_DB_NAME>",
    "snow.schema.name": "<MY_SCHEMA_NAME>",
    "snow.pipe.name": "<MY_PIPE_NAME>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "file_lifecycle",
    "severity_text": "DEBUG"
  },
  "RECORD_ATTRIBUTES": {
    "snow.file.path": "<a/path/to/a/file>"
  },
  "VALUE": {
    "notification_channel": "<notification_channel>",
    "file_content_key": "<file_content_key>",
    "state": "ingested"
  }
}
```

`ERRORED`: An event is emitted if the file failed to be ingested by Snowpipe.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "snow.database.name": "<MY_DB_NAME>",
    "snow.schema.name": "<MY_SCHEMA_NAME>",
    "snow.pipe.name": "<MY_PIPE_NAME>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "file_lifecycle",
    "severity_text": "ERROR"
  },
  "RECORD_ATTRIBUTES": {
    "snow.file.path": "<a/path/to/a/file>"
  },
  "VALUE": {
    "notification_channel": "<notification_channel>",
    "file_content_key": "<file_content_key>",
    "first_error_message": "<first_error_message>",
    "first_error_line_number": "<some_number>",
    "first_error_character_pos": "<some_character_pos>",
    "error_count": "<error_count>",
    "error_limit": "<error_limit>",
    "file_state": "FAILED"
  }
}
```

### notification_received

This event is emitted when Snowflake receives a notification message.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "notification_channel_name": "<notification_channel_name>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "notification_received",
    "severity_text": "TRACE"
  },
  "VALUE": {
    "file_path": "<a/path/to/a/file>",
    "file_content_key": "<file_content_key>",
    "upstream_event_time": "<upstream_event_time>"
  }
}
```

### notification_channel_errored

This event is emitted when an error occurs while Snowflake is reading messages from a notification channel. This typically indicates a user configuration error, such as an authorization issue.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "notification_channel_name": "<notification_channel_name>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "notification_channel_errored",
    "severity_text": "ERROR"
  },
  "VALUE": {
    "first_error_message": "<error_message>"
  }
}
```

### pipe_lifecycle

This event is emitted when the [status of a pipe](../sql-reference/functions/system_pipe_status.md) changes. The new status can be `RUNNING`, `PAUSED`, `STOPPED`, or `STALLED`.

For RUNNING or PAUSED pipe statuses:

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "snow.database.name": "<MY_DB_NAME>",
    "snow.schema.name": "<MY_SCHEMA_NAME>",
    "snow.pipe.name": "<MY_PIPE_NAME>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "pipe_lifecycle",
    "severity_text": "INFO"
  },
  "VALUE": {
    "state": "<running_or_paused>"
  }
}
```

For STOPPED_\* or STALLED_\* pipe statuses: These statuses indicate that a pipe has unexpectedly stopped processing files.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "snow.database.name": "<MY_DB_NAME>",
    "snow.schema.name": "<MY_SCHEMA_NAME>",
    "snow.pipe.name": "<MY_PIPE_NAME>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "pipe_lifecycle",
    "severity_text": "<WARN_or_ERROR>"
  },
  "VALUE": {
    "state": "<pipe_status>",
    "error_message": "<error_message>"
  }
}
```

The `severity_text` for `STOPPED_*` or `STALLED_*` states depends on the specific reason:

`WARN` if the pipe stopped because of the following reasons:

* `STOPPED_BY_SNOWFLAKE_ADMIN`
* `STOPPED_CLONED`
* `STOPPED_FEATURE_DISABLED`

`ERROR` if the pipe stopped because of the following reasons:

* `STOPPED_STAGE_ALTERED`
* `STOPPED_STAGE_DROPPED`
* `STOPPED_FILE_FORMAT_DROPPED`
* `STOPPED_NOTIFICATION_INTEGRATION_DROPPED`
* `STOPPED_MISSING_PIPE`
* `STOPPED_MISSING_TABLE`
* `STALLED_COMPILATION_ERROR`
* `STALLED_INITIALIZATION_ERROR`
* `STALLED_EXECUTION_ERROR`
* `STALLED_INTERNAL_ERROR`
* `STALLED_STAGE_PERMISSION_ERROR`

### pipe_throttled

This event is emitted if a Snowpipe is throttled.

```json
{
  "TIMESTAMP": "<some_timestamp>",
  "RESOURCE_ATTRIBUTES": {
    "snow.database.name": "<MY_DB_NAME>",
    "snow.schema.name": "<MY_SCHEMA_NAME>",
    "snow.pipe.name": "<MY_PIPE_NAME>"
  },
  "RECORD_TYPE": "EVENT",
  "RECORD": {
    "name": "pipe_throttled",
    "severity_text": "WARN"
  },
  "VALUE": {
    "throttled_files": "<throttled_file_name_list>"
  }
}
```

## Set the severity level of events to capture

To enable Snowpipe events to be recorded in an event table, you must set the [LOG_EVENT_LEVEL](../sql-reference/parameters.md) parameter at either the pipe level or the account level. `LOG_EVENT_LEVEL` determines which log events are captured based on their severity. For more information, see [Parameters](../sql-reference/parameters.md) and [New LOG_EVENT_LEVEL parameter to control events](../release-notes/bcr-bundles/2026_02/bcr-2229.md).

* `ERROR`: Use for events that signal a change that requires human intervention to resolve.
* `WARN`: Use for events that signal an issue that can be resolved without human intervention.
* `INFO`: Use for user-initiated events that are generally useful and aren’t high- volume events.
* `DEBUG`: Use for high-volume events.
* `TRACE`: The lowest level of logging, that captures very detailed information.

> **Caution:**
>
> If the severity level isn’t set at the account level or pipe level, no events are captured.

**Examples:**

To capture ERROR-level events for all objects in an account, run the following code:

```sqlexample
ALTER ACCOUNT <my_account_name> SET LOG_EVENT_LEVEL = ERROR;
```

To capture INFO-level events for a specific pipe, run the following code:

```sqlexample
ALTER PIPE <my_pipe_name> SET LOG_EVENT_LEVEL = INFO;
```

## Severity level for each event type

The following table summarizes the default or recommended severity for each Snowpipe event type when you use `LOG_EVENT_LEVEL`:

| Event | Severity |
| --- | --- |
| file_lifecycle - RECEIVED, INGESTED | DEBUG |
| file_lifecycle - ERRORED | ERROR |
| notification_received | TRACE |
| notification_channel_errored | ERROR |
| pipe_lifecycle - RUNNING, PAUSED | INFO |
| pipe_lifecycle - STOPPED | WARN or ERROR (see the following section) |
| pipe_throttled | WARN |

**pipe_lifecycle - STOPPED severity details:**

`WARN` if the pipe stopped because of the following reasons:

* `STOPPED_BY_SNOWFLAKE_ADMIN`
* `STOPPED_CLONED`
* `STOPPED_FEATURE_DISABLED`

`ERROR` if the pipe stopped because of the following reasons:

* `STOPPED_STAGE_ALTERED`
* `STOPPED_STAGE_DROPPED`
* `STOPPED_FILE_FORMAT_DROPPED`
* `STOPPED_NOTIFICATION_INTEGRATION_DROPPED`
* `STOPPED_MISSING_PIPE`
* `STOPPED_MISSING_TABLE`
* `STALLED_COMPILATION_ERROR`
* `STALLED_INITIALIZATION_ERROR`
* `STALLED_EXECUTION_ERROR`
* `STALLED_INTERNAL_ERROR`
* `STALLED_STAGE_PERMISSION_ERROR`

## Query the event table for Snowpipe events

Before you query, ensure that you have set up an event table and configured the severity level you want for events to be captured.

The following example query shows how to retrieve Snowpipe events, such as those generated during file ingestion:

```sqlexample
SELECT
    record_type,
    record:"name" AS event_name,
    record:"severity_text" AS log_level,
    resource_attributes:"snow.database.name" AS database_name,
    resource_attributes:"snow.schema.name" AS schema_name,
    resource_attributes:"snow.pipe.name" AS pipe_name,
    record_attributes,
    PARSE_JSON(value):file_content_key AS file_content_key,
    PARSE_JSON(value):state AS state
FROM {my_event_table_name}
ORDER BY state;
```

**Example output (successful file ingestion):**

The following output shows both RECEIVED and INGESTED events for a file, which indicates successful processing:

| SCOPE | RECORD_TYPE | EVENT_NAME | LOG_LEVEL | DATABASE_NAME | SCHEMA_NAME | PIPE_NAME | RECORD_ATTRIBUTES | FILE_CONTENT_KEY | STATE |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| [NULL] | EVENT | “file_lifecycle” | “DEBUG” | “TESTDB” | “TESTSH” | “MYPIPE” | { “snow.file.path”: “data_0_0_0.event” } | “ba1f2511fc30423bdbb183fe33f3dd0f” | “INGESTED” |
| [NULL] | EVENT | “file_lifecycle” | “DEBUG” | “TESTDB” | “TESTSH” | “MYPIPE” | { “snow.file.path”: “data_0_0_0.event” } | “ba1f2511fc30423bdbb183fe33f3dd0f” | “RECEIVED” |

## Information logged for Snowpipe events

The following sections describe the key columns and their contents within the event table for Snowpipe events. If a column isn’t explicitly listed in the following sections, its value is NULL for Snowpipe events.

## Event table column values

| Column | Data Type | Description |
| --- | --- | --- |
| timestamp | TIMESTAMP_NTZ | The UTC timestamp when the event was created. |
| resource_attributes | OBJECT | Attributes that identify the Snowpipe event, such as database, schema, and pipe names. |
| record_type | STRING | The type of event recorded. For Snowpipe events, this value is always EVENT. |
| record | OBJECT | Contains detailed information about the status of running the Snowpipe event, including name and severity_text. |
| value | VARIANT | Additional information specific to the Snowpipe event. If the Snowpipe task failed, this includes the error message. |

## Key-value pairs in the resource_attributes column

The `resource_attributes` column contains an OBJECT value with the following key-value pairs:

| Attribute Name | Attribute Type | Description | Example |
| --- | --- | --- | --- |
| snow.database.name | VARCHAR | The name of the database associated with the pipe. | “MY_DATABASE” |
| snow.schema.name | VARCHAR | The name of the schema associated with the pipe. | “MY_SCHEMA_NAME” |
| snow.pipe.name | VARCHAR | The name of the pipe. | “MY_PIPE_NAME” |
| notification_channel_name | VARCHAR | The name of the notification channel that received the message or encountered an error. | “arn:aws:sqs:us-west-2:774383465531:sf-snowpipe-AIDA3” |

## Key-value pairs in the record column

The `record` column contains an OBJECT value with the following key-value pairs:

| Key | Type | Description | Example |
| --- | --- | --- | --- |
| name | VARCHAR | The name of the event. | “pipe_lifecycle” |
| severity_text | VARCHAR | The severity level of the event. | “INFO” |

---
title: Monitor events for task executions
source: https://docs.snowflake.com/en/user-guide/tasks-events.md
section: User Guide
---

# Monitor events for task executions

You can configure Snowflake to record an event that provides information about the status of the task execution. The event is
recorded in the [active event table](../developer-guide/logging-tracing/event-table-setting-up.md) associated with the task.

For example, suppose that you have [associated an event table with a database](../developer-guide/logging-tracing/event-table-setting-up.md). When a
task in that database executes, Snowflake records an event to that event table.

You can set up an [alert on new data](alerts.md) to monitor the event table. You can configure the alert
to [send a notification](notifications/about-notifications.md) when a task execution fails.

The next sections explain how to set up the event logging to capture the events, how to set up the alert, and how to interpret
the events recorded in the event table:

* Set the severity level of the events to capture
* Set up an alert on new data for task completion events
* Query the event table for task completion events
* Information logged for task events

> **Note:**
>
> Logging events for tasks incurs costs. See [Costs of telemetry data collection](../developer-guide/logging-tracing/logging-tracing-billing.md).

## Limitations

* Task events aren’t supported for Snowflake Native Apps.

### Set the severity level of the events to capture

To set up task events to be recorded to the event table,
[set the severity level of events](../developer-guide/logging-tracing/telemetry-levels.md) that you want captured in the event
table:

* `ERROR`: Events for failed task runs.
* `INFO`: Events for successful and failed task runs.

To set the level, set the [LOG_EVENT_LEVEL](../sql-reference/parameters.md) parameter for the account or object. You can set the level for:

* All objects in the account,
* All objects in a database or schema.
* A specific task.

> **Note:**
>
> If the severity level is not set on the account or object, no events will be captured.

For example:

* To capture ERROR-level task events for all supported objects in the account, execute
  [ALTER ACCOUNT SET LOG_EVENT_LEVEL](../sql-reference/sql/alter-account.md):

  ```sqlexample
  ALTER ACCOUNT SET LOG_EVENT_LEVEL = ERROR;
  ```

  Setting `LOG_EVENT_LEVEL` at the account level applies to log events (record type EVENT) for supported workloads in the account, including tasks. It does not replace [LOG_LEVEL](../sql-reference/parameters.md) for log messages from logging APIs. For more information, see [Parameters](../sql-reference/parameters.md).
* To capture INFO-level task events for all supported objects in the database `my_db`, execute
  [ALTER DATABASE … SET LOG_EVENT_LEVEL](../sql-reference/sql/alter-database.md):

  ```sqlexample
  ALTER DATABASE my_db SET LOG_EVENT_LEVEL = INFO;
  ```

  Similar to the case of setting the level on the account, setting the level on the database affects log events for supported object types in the database.
* To capture ERROR-level events for the task `my_task`, execute
  [ALTER TASK … SET LOG_EVENT_LEVEL](../sql-reference/sql/alter-task.md):

  ```sqlexample
  ALTER TASK my_task SET LOG_EVENT_LEVEL = ERROR;
  ```

### Set up an alert on new data for task completion events

After you set the severity level for logging events, you can set up an alert on new data to monitor the event table for new events
that indicate a failure in a task completion. An alert on new data is triggered when new rows in the event table are inserted
and meet the condition specified in the alert.

> **Note:**
>
> To create the alert on new data, you must use a role that has been granted the required privileges to query the event table.
>
> * If the alert condition queries the default event table ([SNOWFLAKE.TELEMETRY.EVENTS](../developer-guide/logging-tracing/event-table-setting-up.md))
>   predefined view ([SNOWFLAKE.TELEMETRY.EVENTS_VIEW view](../sql-reference/telemetry/events_view.md)),
>   see [Roles for access to the default event table and EVENTS_VIEW](../developer-guide/logging-tracing/event-table-setting-up.md).
>
>   To manage access to the EVENTS_VIEW view, see [Manage access to the EVENTS_VIEW view](../developer-guide/logging-tracing/event-table-setting-up.md).
> * If the alert condition queries a custom event table, see [Access control privileges for event tables](../developer-guide/logging-tracing/event-table-operations.md).
>
>   To manage access to a custom event table, see [Managing access to event table data](../developer-guide/logging-tracing/event-table-operations.md).

In the alert condition, to query for task completion events, select rows where
`resource_attributes:"snow.executable.type" = 'TASK'`. To narrow down the list of events, you can filter on the following
columns:

* To restrict the results to tasks in a specific database, use `resource_attributes:"snow.database.name"`.
* To return events where the task execution failed, use `value:state = 'FAILED'`.

For information on the values logged for a task execution event, see
Information logged for task events.

For example, the following statement creates an alert on new data that performs an action when task completions fail for tasks
in the database `my_db`. The example assumes that:

* Your active event table is the [default event table](../developer-guide/logging-tracing/event-table-setting-up.md) (SNOWFLAKE.TELEMETRY.EVENTS).
* You have [set up a webhook notification integration](notifications/webhook-notifications.md) for that Slack
  channel.

```sqlexample
CREATE ALERT my_alert_on_task_failures
  IF( EXISTS(
    SELECT * FROM SNOWFLAKE.TELEMETRY.EVENT_TABLE
      WHERE resource_attributes:"snow.executable.type" = 'task'
        AND resource_attributes:"snow.database.name" = 'my_db'
        AND record:"severity_text" = 'ERROR'
        AND value:"state" = 'FAILED'))
  THEN
    BEGIN
      LET result_str VARCHAR;
      (SELECT ARRAY_TO_STRING(ARRAY_AGG(name)::ARRAY, ',') INTO :result_str
         FROM (
           SELECT resource_attributes:"snow.executable.name"::VARCHAR name
             FROM TABLE(RESULT_SCAN(SNOWFLAKE.ALERT.GET_CONDITION_QUERY_UUID()))
             LIMIT 10
         )
      );
      CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
        SNOWFLAKE.NOTIFICATION.TEXT_PLAIN(:result_str),
        '{"my_slack_integration": {}}'
      );
    END;
```

### Query the event table for task completion events

You can also query the event table for events that indicate that a task completion failed.

For information on the role that you need to use to query the event table and the conditions that you can use to filter the
results, see Set up an alert on new data for task completion events.

For example, to get the timestamp, task name, query ID, and error message for errors with tasks in the database `my_db`:

```sqlexample
SELECT
    timestamp,
    resource_attributes:"snow.executable.name"::VARCHAR AS task_name,
    resource_attributes:"snow.query.id"::VARCHAR AS query_id,
    value:message::VARCHAR AS error
  FROM my_event_table
  WHERE
    resource_attributes:"snow.executable.type" = 'TASK' AND
    resource_attributes:"snow.database.name" = 'MY_DB' AND
    value:state = 'FAILED'
  ORDER BY timestamp DESC;
```

```output
+-------------------------+-----------+--------------------------------------+------------------------------------------------------+
| TIMESTAMP               | TASK_NAME | QUERY_ID                             | ERROR                                                |
|-------------------------+-----------+--------------------------------------+------------------------------------------------------|
| 2025-02-18 00:21:19.461 | T1        | 01ba76b5-0107-e56d-0000-a995024f4222 | 002003: SQL compilation error:                       |
|                         |           |                                      | Object 'MY_TABLE' does not exist or not authorized.  |
+-------------------------+-----------+--------------------------------------+------------------------------------------------------+
```

The following example retrieves all columns for errors with tasks in the schema `my_schema`:

```sqlexample
SELECT *
  FROM my_event_table
  WHERE
    resource_attributes:"snow.executable.type" = 'FAILED' AND
    resource_attributes:"snow.schema.name" = 'MY_SCHEMA' AND
    value:state = 'FAILED'
  ORDER BY timestamp DESC;
```

```output
+-------------------------+-----------------+-------------------------+-------+----------+------------------------------------------------------------+-------+------------------+-------------+-------------------------------+-------------------+------------------------------------------------------------------------------------------------------+-----------+
| TIMESTAMP               | START_TIMESTAMP | OBSERVED_TIMESTAMP      | TRACE | RESOURCE | RESOURCE_ATTRIBUTES                                        | SCOPE | SCOPE_ATTRIBUTES | RECORD_TYPE | RECORD                        | RECORD_ATTRIBUTES | VALUE                                                                                                | EXEMPLARS |
|-------------------------+-----------------+-------------------------+-------+----------+------------------------------------------------------------+-------+------------------+-------------+-------------------------------+-------------------+------------------------------------------------------------------------------------------------------+-----------|
| 2025-02-18 00:21:19.461 | NULL            | 2025-02-18 00:21:19.461 | NULL  | NULL     | {                                                          | NULL  | NULL             | EVENT       | {                             | NULL              | {                                                                                                    | NULL      |
|                         |                 |                         |       |          |   "snow.database.id": 49,                                  |       |                  |             |   "name": "execution.status", |                   |   "message": "002003: SQL compilation error:\nObject 'EMP_TABLE' does not exist or not authorized.", |           |
|                         |                 |                         |       |          |   "snow.database.name": "MY_DB",                        |       |                  |                |   "severity_text": "ERROR"    |                   |   "state": "FAILED"                                                                                  |           |
|                         |                 |                         |       |          |   "snow.executable.id": 518,                               |       |                  |             | }                             |                   | }                                                                                                    |           |
|                         |                 |                         |       |          |   "snow.executable.name": "T1",                            |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.executable.type": "TASK",                          |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.owner.id": 2601,                                   |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.owner.name": "DATA_ADMIN",                         |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.owner.type": "ROLE",                               |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.query.id": "01ba76b5-0107-e56d-0000-a995024f4222", |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.schema.id": 411,                                   |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.schema.name": "MY_SCHEMA",                      |       |                  |             |                               |                   |                                                                                                      |              |
|                         |                 |                         |       |          |   "snow.warehouse.id": 41,                                 |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          |   "snow.warehouse.name": "INTAKE_WAREHOUSE"                |       |                  |             |                               |                   |                                                                                                      |           |
|                         |                 |                         |       |          | }                                                          |       |                  |             |                               |                   |                                                                                                      |           |
+-------------------------+-----------------+-------------------------+-------+----------+------------------------------------------------------------+-------+------------------+-------------+-------------------------------+-------------------+------------------------------------------------------------------------------------------------------+-----------+
```

### Information logged for task events

When a task runs, an event is logged to the event table. The following sections describe the event table row that represents the
event:

* Event table column values
* Key-value pairs in the resource_attributes column
* Key-value pairs in the record column

## Event table column values

When a task completes or fails, a row with the following values is inserted into the event table.

> **Note:**
>
> If a column is not listed below, the column value is NULL for the event.

| Column | Data type | Description |
| --- | --- | --- |
| `timestamp` | TIMESTAMP_NTZ | The UTC timestamp when an event was created. |
| `observed_timestamp` | TIMESTAMP_NTZ | A UTC time used for logs. Currently, this is the same value that is in the `timestamp` column. |
| `resource_attributes` | OBJECT | Attributes that identify the task that was executed. |
| `record_type` | STRING | The event type, which is `EVENT` for task executions. |
| `record` | OBJECT | Details about the status of the task execution. |
| `value` | VARIANT | The status of the task execution and, if the execution failed, the error message for the failure. |

## Key-value pairs in the `resource_attributes` column

The `resource_attributes` column contains an [OBJECT](../sql-reference/data-types-semistructured.md) value with the following key-value pairs:

| Attribute name | Attribute type | Description | Example |
| --- | --- | --- | --- |
| `snow.database.id` | INTEGER | The internal/system-generated identifier of the database containing the task. | `12345` |
| `snow.database.name` | VARCHAR | The name of the database containing the task. | `MY_DATABASE` |
| `snow.executable.id` | INTEGER | The internal/system-generated identifier of the task that executed. | `12345` |
| `snow.executable.name` | VARCHAR | The name of the task that executed. | `MY_TASK` |
| `snow.executable.type` | VARCHAR | The type of the object. The value is `TASK` for task events. | `TASK` |
| `snow.owner.id` | INTEGER | The internal/system-generated identifier of the role with the OWNERSHIP privilege on the task. | `12345` |
| `snow.owner.name` | VARCHAR | The name of the role with the OWNERSHIP privilege on the task. | `MY_ROLE` |
| `snow.owner.type` | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. | `ROLE` |
| `snow.query.id` | VARCHAR | ID of the query that executed the task. | `01ba7614-0107-e56c-0000-a995024f304a` |
| `snow.schema.id` | INTEGER | The internal/system-generated identifier of the schema containing the task. | `12345` |
| `snow.schema.name` | VARCHAR | The name of the schema containing the task. | `MY_SCHEMA` |
| `snow.warehouse.id` | INTEGER | The internal/system-generated identifier of the warehouse used to execute the task. | `12345` |
| `snow.warehouse.name` | VARCHAR | The name of the warehouse used to execute the task. | `MY_WAREHOUSE` |

## Key-value pairs in the `record` column

The `record` column contains an [OBJECT](../sql-reference/data-types-semistructured.md) value with the following key-value pairs:

| Key | Type | Description | Example |
| --- | --- | --- | --- |
| `name` | VARCHAR | The name of the event. The value is `execution.status` for task executions. | `execution.status` |
| `severity_text` | VARCHAR | The severity level of the event, which is one of the following values:   * `INFO`: The task execution succeeded. * `ERROR`: The task execution failed. | `INFO` |

## Key-value pairs in the `value` column

The `value` column contains an [VARIANT](../sql-reference/data-types-semistructured.md) value with the following key-value pairs:

| Key | Type | Description | Example |
| --- | --- | --- | --- |
| `state` | VARCHAR | The state of the task execution, which can be one of the following values:   * `SUCCEEDED`: The task execution succeeded. * `ERROR`: The task execution failed. | `SUCCEEDED` |
| `message` | VARCHAR | If the value in `state` is `ERROR`, this column includes the error message. |  |

---
title: Monitor hybrid table workloads
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-monitor-workload.md
section: User Guide
---

# Monitor hybrid table workloads

Unistore workloads that leverage hybrid tables will be different from many
analytical workloads that you are running in Snowflake. For example, your
workloads might contain fewer unique queries that take less time to run and
execute at a higher frequency. You have several options to monitor your
workloads.

> * Monitor transactions
> * Monitor workloads
> * Monitor overall workload health
> * Identify and investigate repeated queries

## Monitor transactions

Hybrid tables support Snowflake transaction monitoring features, including [SHOW TRANSACTIONS](../sql-reference/sql/show-transactions.md),
[DESCRIBE TRANSACTION](../sql-reference/sql/desc-transaction.md), [SHOW LOCKS](../sql-reference/sql/show-locks.md), and
[LOCK WAIT HISTORY](../sql-reference/account-usage/lock_wait_history.md).

The behavior of these commands and views for hybrid tables is consistent with the behavior for standard
Snowflake tables, except for the following changes:

* A new `ROW` lock type is introduced in the [SHOW LOCKS](../sql-reference/sql/show-locks.md) command to
  represent row locks against hybrid tables. The locks are summarized to show one transaction holding
  (one or multiple) row locks and another transaction waiting for these locks.
* [LOCK WAIT HISTORY](../sql-reference/account-usage/lock_wait_history.md) does not show schema-related information.
* LOCK_WAIT_HISTORY does not summarize BLOCKER_QUERIES. If a query is blocked by multiple blockers,
  then they will appear as multiple records in the view rather than as multiple entries in the
  BLOCKER_QUERIES JSON array for the single waiter record.
* For the result of SHOW LOCKS, and the LOCK_WAIT_HISTORY view:

  > + As the row locks are summarized, the lock-holding transaction is assumed to acquire the lock when it starts.
  > + Due to the potential high volume of Unistore transactions, only locks that have blocked other transaction(s)
  >   for an extended period (approximately 5 seconds) are shown.
  > + The lock-waiting transaction might still appear to be waiting for the locks even if it has acquired them
  >   (for no more than 1 minute). The accuracy of lock reporting will improve in future releases.
  > + If a statement that blocked a waiting query has completed and was a short-running query against hybrid
  >   tables, the following information for the blocker query is not shown in the BLOCKER_QUERY field
  >   of the waiting query record:
  >
  >   - Query UUID of the blocker query
  >   - Session ID of the blocker query
  >   - User name of the blocker query
  >   - Database ID of the blocker query
  >   - Database name of the blocker query

## Monitor workloads

To monitor your operational workloads effectively, use the
[AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md). This view enables
you to monitor the health of your workload, diagnose issues, and identify avenues
for optimization. The AGGREGATE_QUERY_HISTORY view aggregates query execution statistics
for a repeated parameterized query over a time interval so that it is easier
and more efficient to identify patterns in your workloads and queries over time. Note
that all Snowflake workloads and queries will be combined in the output of this view.

The AGGREGATE_QUERY_HISTORY view helps you answer the following questions about your workloads:

> * How many operations per second are being executed in my virtual warehouse?
> * Which queries are consuming the most total time or resources in my workload?
> * Has the performance of a specific query changed substantially over time?

To help improve performance and efficiency in your workload, individual
executions of low latency operations (under one second) will not be
stored in [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) nor will they
generate a unique query profile. Instead, aggregate statistics for repeated
executions of that query will be returned in the AGGREGATE_QUERY_HISTORY view.
You will also be able to view a sampled query profile for the query over a selected
time interval. For more information about this behavior, see [Usage notes](../sql-reference/account-usage/query_history.md).

> **Tip:**
>
> You can use the [Grouped Query History view](ui-snowsight-activity.md)
> in Snowsight to visualize performance and statistics for typical hybrid table workloads.
> This view does not capture all hybrid table activity, but it provides a good alternative to
> monitoring performance for a large volume of individual queries that are somewhat repetitive and
> run extremely fast.

## Monitor overall workload health

Use the AGGREGATE_QUERY_HISTORY view to monitor your overall workload
throughput and concurrency, and to investigate unexpected spikes or drops in
your workloads. For example:

```sqlexample
SELECT
    interval_start_time
    , SUM(calls) as execution_count
    , SUM(calls) / 60 as queries_per_second
    , COUNT(DISTINCT session_id) as unique_sessions
    , COUNT(user_name) as unique_users
FROM snowflake.account_usage.aggregate_query_history
WHERE warehouse_name = '<MY_WAREHOUSE>'
  AND interval_start_time > $START_DATE
  AND interval_start_time < $END_DATE
GROUP BY ALL;
```

You can also use aggregate query history to monitor for potential problems
with errors, queueing, lock blocking, or throttling. For example:

```sqlexample
WITH time_issues AS
(
    SELECT
        interval_start_time
        , SUM(transaction_blocked_time:"SUM") as transaction_blocked_time
        , SUM(queued_provisioning_time:"SUM") as queued_provisioning_time
        , SUM(queued_repair_time:"SUM") as queued_repair_time
        , SUM(queued_overload_time:"SUM") as queued_overload_time
        , SUM(hybrid_table_requests_throttled_count) as hybrid_table_requests_throttled_count
    FROM snowflake.account_usage.aggregate_query_history
    WHERE WAREHOUSE_NAME = '<MY_WAREHOUSE>'
      AND interval_start_time > $START_DATE
      AND interval_start_time < $END_DATE
    GROUP BY ALL
),
errors AS
(
    SELECT
        interval_start_time
        , SUM(value:"count") as error_count
    FROM
    (
        SELECT
            a.interval_start_time
            ,e.*
        FROM
            snowflake.account_usage.aggregate_query_history a,
            TABLE(flatten(input => errors)) e
        WHERE interval_start_time > $START_DATE
          AND interval_start_time < $END_DATE
  )
  GROUP BY ALL
)
    SELECT
        ts.interval_start_time
        , error_count
        , transaction_blocked_time
        , queued_provisioning_time
        , queued_repair_time
        , queued_overload_time
        , hybrid_table_requests_throttled_count
    FROM time_issues ts
    FULL JOIN errors e ON e.interval_start_time = ts.interval_start_time
;
```

Ordinarily, such metrics should remain low. If you see an unexpected spike, it is recommended that you
investigate the cause.

## Identify and investigate repeated queries

You may opt to optimize or investigate the performance of common and often
executed queries to improve the efficiency of your workload. Use the
AGGREGATE_QUERY_HISTORY view to identify top queries for a workload by
execution count. For example:

```sqlexample
SELECT
    query_parameterized_hash
    , any_value(query_text)
    , SUM(calls) as execution_count
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
          AND warehouse_name = '<MY_WAREHOUSE>'
          AND interval_start_time > '2024-02-01'
          AND interval_start_time < '2024-02-08'
GROUP BY
          query_parameterized_hash
ORDER BY execution_count DESC
;
```

You can choose to view metrics for the slowest queries. For example:

```sqlexample
SELECT
    query_parameterized_hash
    , any_value(query_text)
    , SUM(total_elapsed_time:"sum"::NUMBER) / SUM (calls) as avg_latency
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
          AND warehouse_name = '<MY_WAREHOUSE>'
          AND interval_start_time > '2024-02-01'
          AND interval_start_time < '2024-02-08'
GROUP BY
          query_parameterized_hash
ORDER BY avg_latency DESC
;
```

You can analyze the performance of a particular query over time to gain insight
into trends in latency. For example:

```sqlexample
SELECT
    interval_start_time
    , total_elapsed_time:"avg"::number avg_elapsed_time
    , total_elapsed_time:"min"::number min_elapsed_time
    , total_elapsed_time:"p90"::number p90_elapsed_time
    , total_elapsed_time:"p99"::number p99_elapsed_time
    , total_elapsed_time:"max"::number max_elapsed_time
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
          AND query_parameterized_hash = '<123456>'
          AND interval_start_time > '2024-02-01'
          AND interval_start_time < '2024-02-08'
ORDER BY interval_start_time DESC
;
```

This query calculates total query time. You can also modify the query to return
more granular metrics on the different phases of a query (compilation, execution,
queuing, and lock waiting). Aggregate statistics will be returned for each phase.

---
title: Monitor object tags
source: https://docs.snowflake.com/en/user-guide/object-tagging/monitor.md
section: User Guide
---

# Monitor object tags

You can monitor tags and how they’ve been implemented using SQL or Snowsight.

## Monitor tags with SQL

You can monitor tags with SQL by using two different Account Usage views, two Information Schema table functions, an Account Usage table
function, and a system function.

It can be helpful to think of two general approaches to determine how to monitor tag usage.

* Discover Tags
* Identify Assignments

### Discover tags

Snowflake supports the following options to list tags and to identify the string value for a given tag key.

* Identify tags in your account:

  Use the [TAGS](../../sql-reference/account-usage/tags.md) view in the Account Usage schema of the shared SNOWFLAKE database. This view
  can be thought of as a *catalog* for all tags in your Snowflake account that provides information on current and deleted tags. For
  example:

  > ```sqlexample
  > SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.TAGS
  >   ORDER BY tag_name;
  > ```
* Identify a value for a given tag:

  Use the [SYSTEM$GET_TAG](../../sql-reference/functions/system_get_tag.md) system function to return the tag value assigned to the specified tag, and
  the Snowflake object or column.

  > ```sqlexample
  > SELECT SYSTEM$GET_TAG('cost_center', 'my_table', 'table');
  > ```

### Identify assignments

Snowflake supports different options to identify tag assignments, depending on whether the query needs to target the account or a
specific database, and whether you want to track tag inheritance.

* Account-level query with lineage:

  Use the Account Usage table function [TAG_REFERENCES_WITH_LINEAGE](../../sql-reference/functions/tag_references_with_lineage.md) to determine all of the objects that
  have a given tag key and tag value, including objects that inherited tags:

  > ```sqlexample
  > SELECT *
  >   FROM TABLE(
  >     SNOWFLAKE.ACCOUNT_USAGE.TAG_REFERENCES_WITH_LINEAGE(
  >       'my_db.my_schema.cost_center'
  >     )
  >   );
  > ```
* Account-level query without lineage:

  Use the Account Usage [TAG_REFERENCES](../../sql-reference/account-usage/tag_references.md) view to determine all of the objects that
  have a given tag key and tag value, but does not include objects that inherited the tag:

  > ```sqlexample
  > SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.TAG_REFERENCES
  >   ORDER BY tag_name, domain, object_id;
  > ```
* Database-level query, with lineage:

  Every Snowflake database includes an [Snowflake Information Schema](../../sql-reference/info-schema.md). Use the Information Schema table function
  [TAG_REFERENCES](../../sql-reference/functions/tag_references.md) to determine all of the objects that have a given tag, including objects that inherited the
  tag, in a given database:

  > ```sqlexample
  > SELECT *
  >   FROM TABLE(
  >     my_db.INFORMATION_SCHEMA.TAG_REFERENCES(
  >       'my_table',
  >       'table'
  >     )
  >   );
  > ```
* Database-level query for all of the tags on every column in a table or view, with lineage:

  Use the Information Schema table function [TAG_REFERENCES_ALL_COLUMNS](../../sql-reference/functions/tag_references_all_columns.md) to obtain all of the tags that
  are set on every column in a given table or view.

  Note that the domain `TABLE` must be used for all objects that contain columns, even if the object name is a view
  (that is, view, materialized view).

  > ```sqlexample
  > SELECT *
  >   FROM TABLE(
  >     INFORMATION_SCHEMA.TAG_REFERENCES_ALL_COLUMNS(
  >       'my_table',
  >       'table'
  >     )
  >   );
  > ```

## Monitor tags with Snowsight

You can use the Snowsight Governance & security » Tags & policies area to monitor and report on the usage of
policies and tags with tables, views, and columns. There are two different interfaces: Dashboard and Tagged Objects.

When using the Dashboard and the Tagged Objects interface, note the following details.

* The Dashboard and Tagged Objects interfaces require a running warehouse.
* Snowsight updates the Dashboard every 12 hours.
* The Tagged Objects information latency can be up to two hours and returns up to 1000 objects.

### Accessing the Governance area in Snowsight

To access the Tags & policies area, your Snowflake account must be [Enterprise Edition or higher](../intro-editions.md).
Additionally, you must do either of the following:

* Use the ACCOUNTADMIN role.
* Use an account role that is directly granted the GOVERNANCE_VIEWER and OBJECT_VIEWER database roles.

  You must use an account role with these database role grants. Currently, Snowsight does not evaluate role hierarchies
  and user-defined database roles that have access to tables, views, data access policies, and tags.

  To determine if your account role is granted these two database roles, use a [SHOW GRANTS](../../sql-reference/sql/show-grants.md) command:

  > ```sqlexample
  > SHOW GRANTS LIKE '%VIEWER%' TO ROLE data_engineer;
  > ```
  >
  > ```output
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > | created_on                    | privilege | granted_on    | name                        | granted_to | grantee_name    | grant_option | granted_by |
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > | 2024-01-24 17:12:26.984 +0000 | USAGE     | DATABASE_ROLE | SNOWFLAKE.GOVERNANCE_VIEWER | ROLE       | DATA_ENGINEER   | false        |            |
  > | 2024-01-24 17:12:47.967 +0000 | USAGE     | DATABASE_ROLE | SNOWFLAKE.OBJECT_VIEWER     | ROLE       | DATA_ENGINEER   | false        |            |
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > ```

  If your account role is not granted either or both of these database roles, use the [GRANT DATABASE ROLE](../../sql-reference/sql/grant-database-role.md) command
  and run the SHOW GRANTS command again to confirm the grants:

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  > GRANT DATABASE ROLE SNOWFLAKE.GOVERNANCE_VIEWER TO ROLE data_engineer;
  > GRANT DATABASE ROLE SNOWFLAKE.OBJECT_VIEWER TO ROLE data_engineer;
  > SHOW GRANTS LIKE '%VIEWER%' TO ROLE data_engineer;
  > ```

  For details about these database roles, see [SNOWFLAKE database roles](../../sql-reference/snowflake-db-roles.md).

### Dashboard

As a data administrator, you can use the Dashboard interface to monitor tag and policy usage in the following ways.

* Coverage: specifies the count and percentage based on whether a table, view, or column has a policy or tag.
* Prevalence: lists and counts the most frequently used policies and tags.

The coverage and prevalence provide a snapshot as to how well the data is protected and tagged.

When you select a count number, percentage, policy name, or tag name, the Tagged Objects interface opens. The Tagged Objects
interface updates the filters automatically based on your selection in the Dashboard.

The monitoring information is an alternative or complement to running complex and query-intensive operations on multiple Account
Usage views.

These views might include, but are not limited to, the [COLUMNS](../../sql-reference/account-usage/columns.md),
[POLICY_REFERENCES](../../sql-reference/account-usage/policy_references.md), [TABLES](../../sql-reference/account-usage/tables.md),
[TAG_REFERENCES](../../sql-reference/account-usage/tag_references.md), and [VIEWS](../../sql-reference/account-usage/views.md) views.

### Tagged Objects

As a data administrator, you can use this table to associate the coverage and prevalence in the Dashboard to a list of specific
tables, view, or columns quickly. You can also filter the table results manually as follows.

* Choose Tables or Columns.
* For tags, you can filter with tags, without tags, or by a specific tag.
* For policies, you can filter with policies, without policies, or by a specific policy.

When you select a row in the table, the Table Details or Columns tab in Catalog » Database Explorer opens. You can edit
the tag and policy assignments as needed.

---
title: Monitor query activity with Query History
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-activity.md
section: User Guide
---

# Monitor query activity with Query History

To monitor query activity in your account, you can use:

* The Query History and Grouped Query History pages in Snowsight.
* The [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) and [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md) in the ACCOUNT_USAGE schema
  of the SNOWFLAKE database.
* The [QUERY_HISTORY](../sql-reference/functions/query_history.md) family of table functions in
  [INFORMATION_SCHEMA](../sql-reference/info-schema.md).

With the Query History page in Snowsight, you can do the following:

* Monitor individual or grouped queries that are executed by users in your account.
* View details about queries, including performance data. In some cases,
  query details are unavailable.
* Explore each step of an executed query in the query profile.

The Query History page lets you explore queries executed in your Snowflake account over the last 14 days.

Within a worksheet, you can see the query history for queries that have been run in that worksheet.
See [View query history](ui-snowsight-query.md).

## Review Query History in Snowsight

To access the Query History page in Snowsight, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Query History.
3. Go to Individual Queries or Grouped Queries.
   For more information about grouped queries, see Use the Grouped Query History view in Snowsight.
4. For Individual Queries, filter your view to see the most relevant and accurate results.

   If a Load More button appears at the top of the list, it means that there are more available results to load. You can fetch the next set
   of results by either selecting Load More or scrolling to the bottom of the list.

### Privileges required to view Query History

You can always view history for queries that you have run.

To view history for other queries, your active role affects what else you can see in Query History:

* If your active role is the ACCOUNTADMIN role, you can view all query history for the account.
* If your active role has the MONITOR or OPERATE privilege granted on a warehouse, you can view queries run by other users that
  use that warehouse.
* If your active role is granted the GOVERNANCE_VIEWER database role for the SNOWFLAKE database, it is sufficient for querying the ACCOUNT_USAGE
  views directly with SQL, and it also allows you to view Grouped Queries in the Query History. However, this role alone does not grant the ability
  to see Individual Queries by other users in Snowsight. To view all user queries (both grouped and individual), your role must be ACCOUNTADMIN
  or be granted IMPORTED PRIVILEGES on the SNOWFLAKE database. Alternatively, the following two privileges can replace the IMPORTED PRIVILEGES:

  > + [MONITOR](security-access-control-privileges.md) on all or specific users.
  > + [MONITOR](security-access-control-privileges.md) on all or specific warehouses.
* If your active role is granted the READER_USAGE_VIEWER database role for the SNOWFLAKE database, you can view the query history for all
  users in reader accounts associated with your account. See [SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md).

### Considerations for using Query History

When reviewing the Query History for your account, consider the following:

* Details for queries executed more than seven days ago do not include User information due to the data retention policy for
  [sessions](session-policies.md). You can use the user filter to retrieve queries run by individual users.
  See Filter Query History.
* For queries that failed due to syntax or parsing errors, you see `<redacted>` instead of the SQL statement that was executed.
  If you are granted a role with appropriate privileges, you can set the [ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR](../sql-reference/parameters.md) parameter to view
  the full query text.
* Filters and the Started and End Time columns use your current time zone. You can’t change this setting.
  Setting the [TIMEZONE](../sql-reference/parameters.md) parameter for the session doesn’t change the time zone used.

### Filter Query History

You can filter the Query History list by the following:

* Status of the query, for example to identify long-running queries, failed queries, and queued queries.
* User who performed the query, including:

  + All, to see all users for which you have access to view query history.
  + The user you are signed in as (default)
  + Individual Snowflake users in your account, if your role can view query history for other users.
* Time period during which the query was run, up to 14 days.
* Other filters, including the following:

  + SQL Text, for example, to view queries that use specific statements, such as GROUP BY.
  + Query ID, to view details for a specific query.
  + Warehouse, to view queries that were run using a specific warehouse.
  + Statement Type, to view queries that used a specific type of statement, such as DELETE, UPDATE, INSERT, or SELECT.
  + Duration, for example, to identify especially long-running queries.
  + Session ID, to view queries run during a specific Snowflake session.
  + Query Tag, to view queries with a specific query tag set through the [QUERY_TAG](../sql-reference/parameters.md) session parameter.
  + Parameterized Query Hash, to display queries grouped according to the parameterized query hash ID specified in the filter. For more
    information, see [Using the Hash of the Parameterized Query (query_parameterized_hash)](query-hash.md).
  + Client generated statements, to view internal queries run by a client, driver, or library, including the web interface.
    For example, whenever a user navigates to the Warehouses page in Snowsight, Snowflake executes a SHOW WAREHOUSES
    statement in the background. That statement would be visible when this filter is enabled. Your account is not billed for
    client-generated statements.
  + Queries executed by user tasks, to view SQL statements executed or stored procedures called by user tasks.
  + Show replication refresh history, to view queries used to perform [replication](account-replication-intro.md)
    refresh tasks to remote regions and accounts.

If you want to see near-real-time results, enable Auto Refresh. When Auto Refresh is enabled, the table refreshes every ten seconds.

You can see the following columns in the Queries table by default:

* SQL Text, the text of the executed statement (always shown).
* Query ID, the ID of the query (always shown).
* Status, the status of the executed statement (always shown).
* User, to see the username that executed a statement.
* Warehouse, to see the warehouse used to execute a statement.
* Duration, to see the length of time it took to execute a statement.
* Started, to see the time a statement started running.

If you have more results, you cannot sort the table. If you select Load More at the top of the list after sorting the table, the new
results will be appended to the end of the data and the sort order will no longer apply.

To view more specific information, you can select Columns to add or remove columns from the table, such as:

* All to display all columns.
* User to display the user who ran the statement.
* Warehouse to display the name of the warehouse used to run the statement.
* Warehouse Size to display the size of the warehouse used to run the statement.
* Duration to display the time it took for the statement to run.
* Started to display the start time of the statement.
* End Time to display the end time of the statement.
* Session ID to display the ID of the session that executed the statement.
* Client Driver to display the name and version of the client, driver, or library used to execute the statement.
  Statements run in Snowsight display `Go 1.1.5`.
* Bytes Scanned to display the number of bytes scanned during the processing of the query.
* Rows to display the number of rows returned by a statement.
* Query Tag to display the query tag set for a query.
* Parameterized Query Hash to display queries grouped according to the parameterized query hash ID specified in the filter. For more
  information, see [Using the Hash of the Parameterized Query (query_parameterized_hash)](query-hash.md).
* Incident to display details for statements with an execution status of incident, used for troubleshooting or debugging purposes.

To view additional details about a query, select a query in the table to open the Query Details.

## Use the Grouped Query History view in Snowsight

You can use the Grouped Query History view in Snowsight to monitor usage
and performance of critical and frequently run queries. This graphical view is based on information
that is recorded in the [AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md). Executed queries
are grouped by a [parameterized query hash ID](query-hash.md). You can monitor key statistics
over time and drill down into individual queries that belong to each group.

Although this view includes all queries against Snowflake, it is particularly useful for
monitoring and analyzing [Unistore workloads](https://www.snowflake.com/en/data-cloud/workloads/unistore/)
that execute a small number of distinct statements repeatedly at high throughput. For workloads that
involve [hybrid tables](tables-hybrid.md), it is challenging to monitor performance by
looking at individual queries.

For example, your workload might consist of thousands of very similar point-lookup queries
and inserts that vary only by user ID, run extremely fast, and are repeated at a rate that
makes them impossible to analyze individually. An aggregated view of these operations is
essential when you want to answer questions like these:

* Which grouped queries (or *parameterized queries*) are consuming the most total time or resources in my account or workload?
* Has the performance of a parameterized query changed substantially over time?
* What sorts of issues is a parameterized query running into? Locking? Queueing? Long compilation
  times?
* How often does a parameterized query succeed or fail? Less than one percent of the time,
  or more often than that?

### How to use the Grouped Query History

To access the Grouped Query History in Snowsight, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Query History » Grouped Queries.
   The page shows you queries grouped by their common
   [parameterized query hash ID](query-hash.md).

   > **Note:**
   >
   > Individual queries do not immediately appear in the Grouped Queries list. Latency
   > for updating the list from the records in the AGGREGATE_QUERY_HISTORY view might be up to
   > 180 minutes (3 hours), but the list is often populated much faster.
3. Select any grouped query to see performance statistics for that parameterized query hash.
   Snowsight displays the total number of queries executed, the number of queries that failed,
   latency (p50, p90, p99), and executions per minute. The bottom section of the page
   shows some sample individual queries run as part of that hash; you can select each query to see
   its specific details.

   For example, during the last day, about 540 queries ran in this group, with a total duration of
   about 3 hours:

In the Parameterized query hash view, you can filter queries by status, such as `Failed` or
`Successful`. In both the Grouped Queries view and the Parameterized query hash
view, you can filter by a date range, by user, and by warehouse. The date range is limited to a maximum
of the last 14 days of grouped query history. You can’t retrieve information about queries that were
executed more than 14 days ago.

Note that some warehouses are internal warehouses managed by Snowflake, and those warehouses don’t
appear in the Warehouse filter. Similarly, the SYSTEM user does not appear in the User filter.

As an alternative to using filters, you can run aggregate queries that return filtered results. See also the
[AGGREGATE_QUERY_HISTORY view](../sql-reference/account-usage/aggregate_query_history.md).

### Individual Queries list

Alternatively, you can look at Individual Queries. This view doesn’t reflect all of the queries run by
Unistore workloads, given the sheer volume of queries that can be created against hybrid tables. For more
information about this behavior, see the hybrid tables section of the
[Usage notes](../sql-reference/account-usage/query_history.md). Snowflake recommends that all users with Unistore workloads
start by monitoring the Grouped Queries view.

### Privileges required to view Grouped Query History

Users can always view the history of queries that they’ve run. A user’s active role affects which other
queries are visible. You can view both Grouped Queries and Individual Queries if one of the following
is true:

* Your active role is ACCOUNTADMIN.
* Your active role has been granted IMPORTED PRIVILEGES on the SNOWFLAKE database (see
  [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md)).
* You have the GOVERNANCE_VIEWER [database role](../sql-reference/snowflake-db-roles.md).

If you don’t have any of these roles or privileges, you can only see
Individual Queries. For more information about access privileges, see
Monitor query activity with Query History.

## Review details and profile of a specific query

When you select a query in Query History, you can review details and the profile of the query.

### Query Profile data redacted from a Snowflake Native App

The Snowflake Native App Framework redacts information from the query profile in the
following contexts:

* Queries that are run when the app is installed or upgraded.
* Queries that originate from a stored procedure owned by the app.
* Queries containing a non-secure view or function owned by the app.

For each of these types of queries, Snowsight collapses the query profile data into a single
empty node instead of displaying the full query profile tree.

### Review Query Details

To review the details of a specific query, and view the results of a successful query, open the Query Details for a query.

You can review the Details for information about the query execution, including:

* The status of the query.
* When the query started, in the user’s local timezone.
* When the query ended, in the user’s local timezone.
* The size of the warehouse used to run the query.
* The duration of the query.
* The query ID.
* The query tag for the query, if one exists.
* The driver status. For more details, see [View the Snowflake client version](snowflake-client-version-check.md).
* The name and version of the client, driver, or library used to submit the query.
  For example, `Go 1.1.5` for queries run using Snowsight.
* The session ID.

You can see the warehouse used to run the query and the user who ran the query listed above the Query Details tab.

Review the SQL Text section for the actual text of the query. You can hover over the SQL text to open the statement in a worksheet
or copy the statement. If the query failed, you can review the error details.

The Results section displays the results of the query. You can only view the first 10,000 rows of results, and
only the user who ran the query can view the results. Select Export Results to export the full set of results as a CSV-formatted file.

### Troubleshoot why query details might be unavailable

If a query doesn’t have query details, some possible causes include the following:

* The query is still running. When the query finishes running, you can view the query details and profile.
* Your role does not have privileges to view the query details.
* The query was run more than 14 days ago and query details and profile are no longer available.
* The query failed to run and therefore has no query profile.
* While the Snowflake platform is designed to preserve job details, the depth of job query detail and Query Profile metrics is on a best-effort basis and is not guaranteed for all queries.

### Review Query Profile

The Query Profile tab lets you explore the query execution plan and understand granular details about each step of execution.

The query profile is a powerful tool for understanding the mechanics of queries. It can be used whenever you need to know more about the
performance or behavior of a particular query. It is designed to help you spot typical mistakes in SQL query expressions to identify
potential performance bottlenecks and improvement opportunities.

This section provides a brief overview of how to navigate and use the query profile.

* Query execution plan
* Operator node
* Query profile navigation
* Information panes

| Interface | Description |
| --- | --- |
| Query execution plan | The query execution plan appears at the center of the query profile.  The query execution plan is composed of operator nodes, which represent rowset operators.  Arrows between operator nodes indicate the rowsets that flow out of one operator and into another. |
| Operator node | Each operator node includes the following:  * The operator type and ID number. * The time used to execute this operator, represented as a percentage of the query duration. * A preview of the operator details. For example, the name of a table or a list of expressions. |
| Query profile navigation | In the upper-left corner of the query profile, use the buttons to:  * Move between execution steps. * Fit the query execution plan in the window. * Zoom in and out of the query execution plan.  **Note:** Steps only appear if the query was executed in steps. |
| Information panes | The query profile provides various information panes. The panes appear in the query execution plan. The panes that appear depend on the focus of the query execution plan.  The query profile includes the following information panes:  * Profile Overview * Query Insights * Statistics * Most Expensive Nodes * Attributes  To learn more about the information provided by the panes, see Query Profile reference. |

### Query History data redacted from a Snowflake Native App

For queries related to a Snowflake Native App, the `query_text` and `error_message` fields are redacted
from the query history in the following contexts:

* Queries run when the app is installed or upgraded.
* Queries that originate from a child job of a stored procedure owned by the app.

In each of these situations, the cell of the query history in Snowsight appears blank.

## Query Profile reference

This section describes all items that can appear in each information pane. The exact content of the information panes depends on the context
of the query execution plan.

### Profile overview

The pane provides information about which processing tasks consumed query time. Execution time provides information about “where the time
was spent” during the processing of a query. Time spent can be broken down into the following categories:

* Processing — time spent on data processing by the CPU.
* Local Disk IO — time when the processing was blocked by local disk access.
* Remote Disk IO — time when the processing was blocked by remote disk access.
* Network Communication — time when the processing was waiting for the network data transfer.
* Synchronization — various synchronization activities between participating processes.
* Initialization — time spent setting up the query processing.
* Hybrid Table Requests Throttling — time spent
  [throttling requests](tables-hybrid-limitations.md) to read and write
  data that is stored in hybrid tables.

### Query insights

If there are conditions that affect the performance of the query execution, this pane provides insights about those conditions.
Each insight includes a message that explains how query performance might be affected and provides a general recommendation for
next steps.

For information, see [Using query insights to improve performance](query-insights.md).

> **Note:**
>
> You can also access these insights by querying the [QUERY_INSIGHTS view](../sql-reference/account-usage/query_insights.md).

### Statistics

A major source of information provided in the detail pane is the various statistics, grouped in the following sections:

* IO — information about the input-output operations performed during the query:

  + *Scan progress* — the percentage of data scanned for a given table so far.
  + *Bytes scanned* — the number of bytes scanned so far.
  + *Percentage scanned from cache* — the percentage of data scanned from the local disk cache.
  + *Bytes written* — bytes written (e.g. when loading into a table).
  + *Bytes written to result* — bytes written to the result object. For example, `select * from . . .` would produce a set of results in tabular format representing each field in the selection.
    In general, the results object represents whatever is produced as a result of the query, and *Bytes written to result* represents the size of the returned result.
  + *Bytes read from result* — bytes read from the result object.
  + *External bytes scanned* — bytes read from an external object, e.g. a stage.
* DML — statistics for Data Manipulation Language (DML) queries:

  + *Number of rows inserted* — number of rows inserted into a table (or tables).
  + *Number of rows updated* — number of rows updated in a table.
  + *Number of rows deleted* — number of rows deleted from a table.
  + *Number of rows unloaded* — number of rows unloaded during data export.
* Pruning — information on the effects of table pruning:

  + *Partitions scanned* — number of partitions scanned so far.
  + *Partitions total* — total number of partitions in a given table.
* Spilling — information about disk usage for operations where intermediate results do not fit in memory:

  + *Bytes spilled to local storage* — volume of data spilled to local disk.
  + *Bytes spilled to remote storage* — volume of data spilled to remote disk.
* Network — network communication:

  + *Bytes sent over the network* — amount of data sent over the network.
* External Functions — information about calls to external functions:

  The following statistics are shown for each external function called by the SQL statement. If the same function was
  called more than once from the same SQL statement, then the statistics are aggregated.

  + *Total invocations* — number of times that an external function was called. (This can be different from the number of external
    function calls in the text of the SQL statement due to the number of batches that rows are divided into, the number of retries (if
    there are transient network problems), etc.)
  + *Rows sent* — number of rows sent to external functions.
  + *Rows received* — number of rows received back from external functions.
  + *Bytes sent (x-region)* — number of bytes sent to external functions. If the label includes “(x-region)”, the data was sent
    across regions (which can impact billing).
  + *Bytes received (x-region)* — number of bytes received from external functions. If the label includes “(x-region)”, the data was
    sent across regions (which can impact billing).
  + *Retries due to transient errors* — number of retries due to transient errors.
  + *Average latency per call* — average amount of time per invocation (call) between the time Snowflake sent the data and
    received the returned data.
  + *HTTP 4xx errors* — total number of HTTP requests that returned a 4xx status code.
  + *HTTP 5xx errors* — total number of HTTP requests that returned a 5xx status code.
  + *Latency per successful call (avg)* — average latency for successful HTTP requests.
  + *Avg throttle latency overhead* — average overhead per successful request due to a slowdown caused by throttling (HTTP 429).
  + *Batches retried due to throttling* — number of batches that were retried due to HTTP 429 errors.
  + *Latency per successful call (P50)* — 50th percentile latency for successful HTTP requests. 50 percent of all successful requests
    took less than this time to complete.
  + *Latency per successful call (P90)* — 90th percentile latency for successful HTTP requests. 90 percent of all successful requests
    took less than this time to complete.
  + *Latency per successful call (P95)* — 95th percentile latency for successful HTTP requests. 95 percent of all successful requests
    took less than this time to complete.
  + *Latency per successful call (P99)* — 99th percentile latency for successful HTTP requests. 99 percent of all successful requests
    took less than this time to complete.
* Extension Functions — information about calls to extension functions:

  + *Java UDF handler load time* — amount of time for the Java UDF handler to load.
  + *Total Java UDF handler invocations* — number of times the Java UDF handler is invoked.
  + *Max Java UDF handler execution time* — maximum amount of time for the Java UDF handler to execute.
  + *Avg Java UDF handler execution time* — average amount of time to execute the Java UDF handler.
  + *Java UDTF process() invocations* — number of times the Java UDTF [process method](../developer-guide/udf/java/udf-java-tabular-functions.md) was invoked.
  + *Java UDTF process() execution time* — amount of time to execute the Java UDTF process.
  + *Avg Java UDTF process() execution time* — average amount of time to execute the Java UDTF process.
  + *Java UDTF’s constructor invocations* — number of times the Java UDTF [constructor](../developer-guide/udf/java/udf-java-tabular-functions.md) was invoked.
  + *Java UDTF’s constructor execution time* — amount of time to execute the Java UDTF constructor.
  + *Avg Java UDTF’s constructor execution time* — average amount of time to execute the Java UDTF constructor.
  + *Java UDTF endPartition() invocations* — number of times the Java UDTF [endPartition method](../developer-guide/udf/java/udf-java-tabular-functions.md) was invoked.
  + *Java UDTF endPartition() execution time* — amount of time to execute the Java UDTF endPartition method.
  + *Avg Java UDTF endPartition() execution time* — average amount of time to execute the Java UDTF endPartition method.
  + *Max Java UDF dependency download time* — maximum amount of time to download the Java UDF dependencies.
  + *Max JVM memory usage* — peak memory usage as reported by the JVM.
  + *Java UDF inline code compile time in ms* — compile time for the Java UDF inline code.
  + *Total Python UDF handler invocations* — number of times the Python UDF handler was invoked.
  + *Total Python UDF handler execution time* — total execution time for Python UDF handler.
  + *Avg Python UDF handler execution time* — average amount of time to execute the Python UDF handler.
  + *Python sandbox max memory usage* — peak memory usage by the Python sandbox environment.
  + *Avg Python env creation time: Download and install packages* — average amount of time to create the Python environment, including downloading and installing packages.
  + *Conda solver time* — amount of time to run the Conda solver to solve Python packages.
  + *Conda env creation time* — amount of time to create the Python environment.
  + *Python UDF initialization time* — amount of time to initialize the Python UDF.
  + *Number of external file bytes read for UDFs* — number of external file bytes read for UDFs.
  + *Number of external files accessed for UDFs* — number of external files accessed for UDFs.

  If the value of a field, for example “Retries due to transient errors”, is zero, then the field is not displayed.

### Most expensive nodes

The pane lists all nodes that lasted for 1% or longer of the total execution time of the query (or the execution time for the displayed
query step, if the query was executed in multiple processing steps). The pane lists nodes by execution time in descending order, enabling
users to quickly locate the costliest operator nodes in terms of execution time.

### Attributes

The following sections provide a list of the most common operator types and their attributes:

#### Data access and generation operators

TableScan:
:   Represents access to a single table. Attributes:

    * *Full table name* — the fully qualified name of the scanned table
    * *Table alias* — used table alias, if present
    * *Columns* — list of scanned columns
    * *Extracted variant paths* — list of paths extracted from VARIANT columns
    * *Scan mode* — ROW_BASED or COLUMN_BASED (shown only for [scans of hybrid tables](tables-hybrid-read-query-profiles.md))
    * *Access predicates* — conditions from the query that are applied during the table scan

IndexScan:
:   Represents access to [secondary indexes](tables-hybrid-index.md) on hybrid tables. Attributes:

    * *Full table name* — the fully qualified name of the scanned table that contains the index
    * *Columns* — list of scanned index columns
    * *Scan mode* — ROW_BASED or COLUMN_BASED
    * *Access predicates* — conditions from the query that are applied during the index scan
    * *Full index name* — the fully qualified name of the scanned index

ValuesClause:
:   List of values provided with the VALUES clause. Attributes:

    * *Number of values* — the number of produced values.
    * *Values* — the list of produced values.

Generator:
:   Generates records using the `TABLE(GENERATOR(...))` construct. Attributes:

    * *rowCount* — provided rowCount parameter.
    * *timeLimit* — provided timeLimit parameter.

ExternalScan:
:   Represents access to data stored in stage objects. Can be a part of queries that scan data from stages directly, but also for data loading
    operations (i.e. COPY statements).

    Attributes:

    * *Stage name* — the name of the stage where the data is read from.
    * *Stage type* — the type of the stage (e.g. TABLE STAGE).

InternalObject:
:   Represents access to an internal data object (e.g. an Information Schema table or the result of a previous query). Attributes:

    * *Object Name* — the name or type of the accessed object.

#### Data processing operators

Filter:
:   Represents an operation that filters the records. Attributes:

    * *Filter condition* - the condition used to perform filtering.

Join:
:   Combines two inputs on a given condition. Attributes:

    * *Join Type* — Type of join (e.g. INNER, LEFT OUTER, etc.).
    * *Equality Join Condition* — for joins which use equality-based conditions, it lists the expressions used for joining elements.
    * *Additional Join Condition* — some joins use conditions containing non-equality based predicates. They are listed here.

    > **Note:**
    >
    > Non-equality join predicates might result in significantly slower processing speeds and should be avoided if possible.

Aggregate:
:   Groups input and computes aggregate functions. Can represent SQL constructs such as GROUP BY, as well as SELECT DISTINCT. Attributes:

    * *Grouping Keys* — if GROUP BY is used, this lists the expressions we group by.
    * *Aggregate Functions* — list of functions computed for each aggregate group, e.g. SUM.

GroupingSets:
:   Represents constructs such as GROUPING SETS, ROLLUP and CUBE. Attributes:

    * *Grouping Key Sets* — list of grouping sets
    * *Aggregate Functions* — list of functions computed for each group, e.g. SUM.

WindowFunction:
:   Computes window functions. Attributes:

    * *Window Functions* — list of window functions computed.

Sort:
:   Orders input on a given expression. Attributes:

    * *Sort keys* — expression defining the sorting order.

SortWithLimit:
:   Produces a part of the input sequence after sorting, typically a result of an `ORDER BY ... LIMIT ... OFFSET ...` construct in SQL.

    Attributes:

    * *Sort keys* — expression defining the sorting order.
    * *Number of rows* — number of rows produced.
    * *Offset* — position in the ordered sequence from which produced tuples are emitted.

Flatten:
:   Processes VARIANT records, possibly flattening them on a specified path. Attributes:

    * *input* — the input expression used to flatten the data.

JoinFilter:
:   Special filtering operation that removes tuples that can be identified as not possibly matching the condition of a Join further in the
    query plan. Attributes:

    * *Original join ID* — the join used to identify tuples that can be filtered out.

UnionAll:
:   Concatenates two inputs. Attributes: none.

ExternalFunction:
:   Represents processing by an external function.

#### DML operators

Insert:
:   Adds records to a table either through an INSERT or COPY operation. Attributes:

    * *Input expressions* — which expressions are inserted.
    * *Table names* — names of tables that records are added to.

Delete:
:   Removes records from a table. Attributes:

    * *Table name* — the name of the table that records are deleted from.

Update:
:   Updates records in a table. Attributes:

    * *Table name* — the name of the updated table.

Merge:
:   Performs a MERGE operation on a table. Attributes:

    * *Full table name* — the name of the updated table.

Unload:
:   Represents a COPY operation that exports data from a table into a file in a stage. Attributes:

    * *Location* - the name of the stage where the data is saved.

#### Metadata operators

Some queries include steps that are pure metadata/catalog operations rather than data-processing operations. These steps consist of a
single operator. Some examples include:

DDL and Transaction Commands:
:   Used for creating or modifying objects, session, transactions, etc. Typically, these queries are not processed by a virtual warehouse and
    result in a single-step profile that corresponds
    to the matching SQL statement. For example:

    > CREATE DATABASE | SCHEMA | …
    >
    > ALTER DATABASE | SCHEMA | TABLE | SESSION | …
    >
    > DROP DATABASE | SCHEMA | TABLE | …
    >
    > COMMIT

Table Creation Command:
:   DDL command for creating a table. For example:

    > CREATE TABLE

    Similar to other DDL commands, these queries result in a single-step profile; however, they can also be part of a multi-step profile,
    such as when used in a CTAS statement. For example:

    > CREATE TABLE … AS SELECT …

Query Result Reuse:
:   A query that reuses the result of a previous query.

Metadata-based Result:
:   A query whose result is computed based purely on metadata, without accessing any data. These queries are not processed by a virtual
    warehouse. For example:

    > SELECT COUNT(\*) FROM …
    >
    > SELECT CURRENT_DATABASE()

#### Miscellaneous operators

Result:
:   Returns the query result. Attributes:

    * *List of expressions* - the expressions produced.

## Common query problems identified by Query Profile

This section describes some of the problems you can identify and troubleshoot using Query Profile.

### “Exploding” joins

One of the common mistakes SQL users make is joining tables without providing a join condition (resulting in a “Cartesian product”), or
providing a condition where records from one table match multiple records from another table. For such queries, the Join operator
produces significantly (often by orders of magnitude) more tuples than it consumes.

This can be observed by looking at the number of records produced by a Join operator, and typically is also reflected in Join
operator consuming a lot of time.

### UNION without ALL

In SQL, it is possible to combine two sets of data with either UNION or UNION ALL constructs. The difference between them is that UNION ALL
simply concatenates inputs, while UNION does the same, but also performs duplicate elimination.

A common mistake is to use UNION when the UNION ALL semantics are sufficient. These queries show in Query Profile as a UnionAll
operator with an extra Aggregate operator on top (which performs duplicate elimination).

### Queries too large to fit in memory

For some operations (e.g. duplicate elimination for a huge data set), the amount of memory available for the servers used to execute the
operation might not be sufficient to hold intermediate results. As a result, the query processing engine will start *spilling* the data to
local disk. If the local disk space is not sufficient, the spilled data is then saved to remote disks.

This spilling can have a profound effect on query performance (especially if remote disk is used for spilling). To alleviate this, we
recommend:

* Using a larger warehouse (effectively increasing the available memory/local disk space for the operation), and/or
* Processing data in smaller batches.

### Inefficient pruning

Snowflake collects rich statistics on data allowing it not to read unnecessary parts of a table based on the query filters. However, for
this to have an effect, the data storage order needs to be correlated with the query filter attributes.

The efficiency of pruning can be observed by comparing *Partitions scanned* and *Partitions total* statistics in the TableScan
operators. If the former is a small fraction of the latter, pruning is efficient. If not, the pruning did not have an effect.

Of course, pruning can only help for queries that actually filter out a significant amount of data. If the pruning statistics do not show
data reduction, but there is a Filter operator above TableScan which filters out a number of records, this might signal that a
different data organization might be beneficial for this query.

For more information about pruning, see [Understanding Snowflake Table Structures](tables-micro-partitions.md).

---
title: Monitor storage lifecycle policies
source: https://docs.snowflake.com/en/user-guide/storage-management/storage-lifecycle-policies-monitoring.md
section: User Guide
---

# Monitor storage lifecycle policies

Identify which tables have storage lifecycle policies
attached, and monitor storage lifecycle policy runs by using Snowflake’s built-in functions.

> **Note:**
>
> For information about monitoring storage lifecycle policy costs,
> see [Billing for storage lifecycle policies](storage-lifecycle-policies-billing.md).

## Monitor policy assignments

To view storage lifecycle policy metadata, use the following views:

* [ACCOUNT_USAGE.STORAGE_LIFECYCLE_POLICIES](../../sql-reference/account-usage/storage_lifecycle_policies.md)
* [ORGANIZATION_USAGE.STORAGE_LIFECYCLE_POLICIES](../../sql-reference/organization-usage/storage_lifecycle_policies.md)
* [ACCOUNT_USAGE.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/account-usage/storage_lifecycle_policy_history.md)
* [ORGANIZATION_USAGE.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/organization-usage/storage_lifecycle_policy_history.md)

## See lifecycle policy attachments

To see which tables a particular lifecycle policy is attached to, call the [POLICY_REFERENCES](../../sql-reference/functions/policy_references.md) table function in
the [Snowflake Information Schema](../../sql-reference/info-schema.md). The function displays only the tables that you have the OWNERSHIP privilege on.

The function returns a row for each table in a database that has the specified policy attached to it.

### Example: List all tables associated with a policy

The following query retrieves a list of tables with a specified storage lifecycle policy attached:

```sqlexample
SELECT *
  FROM TABLE(
    my_db.INFORMATION_SCHEMA.POLICY_REFERENCES(
    POLICY_NAME => 'my_storage_lifecycle_policy'
  )
);
```

### Example: Find the policy assigned to a table

Retrieve the policy assigned to a specified table:

```sqlexample
SELECT *
  FROM TABLE(
    my_db.INFORMATION_SCHEMA.POLICY_REFERENCES(
      REF_ENTITY_NAME => 'my_db.my_schema.my_table',
      REF_ENTITY_DOMAIN => 'table'))
  WHERE POLICY_KIND = 'STORAGE_LIFECYCLE_POLICY';
```

## Monitor storage lifecycle policy runs

To monitor storage lifecycle policy executions over the last 14 days, use the [STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/functions/storage_lifecycle_policy_history.md) table function. For information about the function output, see the [STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/functions/storage_lifecycle_policy_history.md) page.

The following example retrieves the 100 most recent executions for a policy attached to a specified table,
scheduled within the last day:

```sqlexample
SELECT * FROM
  TABLE(
    INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY(
      REF_ENTITY_NAME => 'my_db.my_schema.my_source_table',
      REF_ENTITY_DOMAIN => 'table',
      TIME_RANGE_START => DATEADD('DAY', -1, CURRENT_TIMESTAMP()),
      RESULT_LIMIT => 100
    )
  );
```

Alternatively, to retrieve historical data for storage lifecycle policy runs, use the following views:

* [ACCOUNT_USAGE.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/account-usage/storage_lifecycle_policy_history.md)
* [ORGANIZATION_USAGE.STORAGE_LIFECYCLE_POLICY_HISTORY](../../sql-reference/organization-usage/storage_lifecycle_policy_history.md)

---
title: Monitor task runs
source: https://docs.snowflake.com/en/user-guide/tasks-monitor.md
section: User Guide
---

# Monitor task runs

## Monitor task errors

You can configure Snowflake to push notifications when errors occur in task runs. You can also query the event table to
determine if tasks failed to run. For more information, see the following sections:

* [Set up error notifications for tasks](tasks-errors.md)
* [Monitor events for task executions](tasks-events.md)

## See task owners

To see who ran a task that is currently being run, see [SHOW TASKS](../sql-reference/sql/show-tasks.md) or [DESCRIBE TASK](../sql-reference/sql/desc-task.md).

* Check the OWNER column to see the role of the task owner.
* To see if the task has been run on behalf of the task owner, check the EXECUTE_AS_USER column. By default, this shows as NULL, but when the task is run using impersonated privileges, the user name of the user who modified the task is displayed.

To see who ran a task, use the [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) view.

* If the task is not run as an actual user, the QUERY EXECUTED BY TASK column displays the user name as “SYSTEM”.
* If the task is running on behalf of another user, the QUERY EXECUTED BY TASK column displays the user name that the task is running on behalf of.

---
title: Monitoring data quality checks in Snowsight
source: https://docs.snowflake.com/en/user-guide/data-quality-ui-monitor.md
section: User Guide
---

# Monitoring data quality checks in Snowsight

You can use a Snowsight page to monitor the quality of data in a table or view. It provides an
interactive view of the data metric functions (DMFs) that are associated with an object, including insights about the results of those DMFs.

To gain a better understanding of data quality and DMFs, see [Introduction to data quality checks](data-quality-intro.md).

## Get started

To start gaining insights into the data quality of an object, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select the object.
3. Select the Data Quality tab.
4. Select Monitoring.
5. Do one of the following:

   * If you haven’t associated any DMFs before, select Set up, which opens a populated Worksheet that helps you get started with
     setting a schedule, creating custom DMFs, and associating a DMF with the object.
   * If you already have DMFs associated with the object, start exploring! You can only see a DMF if you have the appropriate
     [access control privileges](data-quality-access-control.md).

## Understanding which DMFs are running

The DMFs associated with the object are listed under Quality Dimensions.

DMFs are grouped as follows:

* System DMFs are grouped based on their [category](data-quality-system-dmfs.md). For example, the NULL_COUNT and BLANK_COUNT DMFs are grouped
  into the Accuracy category. When there is only one system DMF in a category (for example, the ROW_COUNT DMF in the Volume
  category), the name of the DMF is omitted.
* All [custom DMFs](data-quality-custom-dmfs.md) associated with the object are grouped under Custom.

For each DMF, there is a row for every association between the DMF and the object. Remember that as long as the column arguments are
different, the same DMF can be associated with the same object multiple times. If there are multiple rows, select a specific column row to
see the results of running the DMF with that column as an argument.

For example, suppose the NULL_COUNT DMF was associated with table `t1` using the following SQL statement:

```sqlexample
ALTER TABLE t1
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT
    ON (c1);
```

The row containing the column `c1` shows the results of running this DMF.

The Run Schedule widget specifies how often the DMFs are running. This corresponds to the value that was set for the
DATA_METRIC_SCHEDULE parameter of the object. For more information, see [Adjust the schedule for DMFs](data-quality-working.md).

## Investigate failed quality checks

A data quality check consists of a DMF association that has an *expectation*. An expectation lets you define criteria for whether data
passes a data quality check performed by a DMF. When the DMF returns a value, that value is compared to the expectation’s criteria to
determine whether the data passed or failed the check. For more information about using expectations to set up data quality checks,
see [Use SQL to work with expectations](data-quality-expectations.md).

You can use the following process to investigate failed quality checks.

Step 1: Were there any failed quality checks?
:   The number of failed quality checks for all DMFs associated with the object displays at the top of the Monitoring page.

Step 2: Which DMF category had a failed quality check?
:   Use the Checks by dimension widget to check the status of each group of DMFs on the Monitoring page. Red indicates that at
    least one DMF in the group failed a quality check.

Step 3: Which DMF association had a failed quality check?
:   If there was at least one failed quality check in the category, expand the widget for the category, and then scan the Quality Checks
    column to find the row where not all of the checks passed.

Step 4: What is the quality check?
:   To better understand the quality check that you’re investigating:

    1. Select the DMF association that failed the data quality check. A side panel opens.
    2. In the Quality Checks section, check the Status column to determine which quality check failed. This corresponds to the
       [expectation](data-quality-expectations.md) that was violated.
    3. For each failed quality check, use the Expression column to determine the value that the quality check expected the DMF to return.
       This corresponds to the [expression of the expectation](data-quality-expectations.md).

Step 5: What assets are impacted by the quality issue?
:   With the side panel open, find the Impacted Assets section so you can determine what other objects might be affected by the
    quality issue. For information about interpreting the list of objects, see Impacted Assets section.

Step 6: Which records violated the quality check? ([Select system DMFs only](data-quality-fixing.md))
:   1. With the side panel open, select View Failed Records.
    2. Execute the prepopulated query to see the records that failed the quality check. This query calls the SYSTEM$DATA_METRIC_SCAN
       function.

       For information about using the SYSTEM$DATA_METRIC_SCAN function to remediate the data quality issues, see
       [Using SYSTEM$DATA_METRIC_SCAN to fix data](data-quality-fixing.md).

## Drill down into DMF results

Each row under Quality Dimensions shows the most current results of the DMF and a seven day trend of results. To drill down into these
results, select a row to open a side panel. The following describes the elements of this side panel.

View Lineage button
:   Select a DMF to view the [lineage](ui-snowsight-lineage.md) of the object associated with that DMF.

View failed records button ([Select system DMFs only](data-quality-fixing.md))
:   If the DMF returned a value greater than 0, you can determine which records were flagged as having quality issues. For example, if the
    NULL_COUNT DMF returned `5`, then you can determine which five records contain a NULL value.

    Selecting View failed records opens a worksheet that is prepopulated with a query that calls the SYSTEM$DATA_METRIC_SCAN function.
    Execute this query to return the records that were included in the result of the DMF.

    For more information about using the SYSTEM$DATA_METRIC_SCAN function, see [Remediation of data quality issues](data-quality-fixing.md).

Arguments section (Multi-argument DMFs only)
:   If a custom DMF takes multiple columns as arguments, these columns are listed. You can select a column to navigate to the Columns
    tab of the object that contains the column.

Quality Checks section
:   Lists the [expectations](data-quality-expectations.md) that were added to the association between the DMF and the
    object. Each expectation implements a data quality check. This section contains the following columns:

    * Name — Name of the expectation.
    * Expression — Expression of the expectation. For more information, see [Defining what meets the expectation](data-quality-expectations.md).
    * Status — Indicates whether the expectation was violated the last time the DMF ran.

Impacted Assets section
:   Displays the objects that are [downstream](ui-snowsight-lineage.md) in the lineage of the object with which the DMF is
    associated. If there is a data quality issue, you can determine what other objects are possibly affected. The contents of the section
    depends on whether the DMF accepts a single argument (like system DMFs) or whether it accepts multiple arguments.

    * If the DMF accepts one column as an argument, Snowflake checks whether the downstream object contains data from that column. For
      example, suppose the NULL_COUNT DMF identifies NULL values in the `name` column of table `t1`. A downstream view built from `t1`
      only appears in the list of impacted assets if it contains data from the `name` column.
    * If the DMF accepts multiple columns, all downstream objects appear, even if data from the columns doesn’t exist in the downstream object.

Run History section
:   Graphically displays the result of the DMF over time so you can determine trends.

---
title: Monitoring replication and failover
source: https://docs.snowflake.com/en/user-guide/account-replication-monitor.md
section: User Guide
---

# Monitoring replication and failover

This topic provides information on how to monitor account replication progress, history, and costs.

## Use Snowsight to monitor replication

To monitor the replication progress and status for [replication and failover groups](account-replication-intro.md) in an
organization, use the Replication page in Snowsight.

You can view the status and details of refresh operations, including:

* Current status of the most recent refresh operation.
* Replica lag time (time since the last refresh operation).
* Distribution of replica lag times across groups.
* Date and time of the next scheduled refresh operation.

> **Note:**
>
> * Snowsight lists the replication and failover groups for which your role has the MONITOR, OWNERSHIP, or REPLICATE privilege on.
> * Refresh operation details are only available to users with the ACCOUNTADMIN role or the OWNERSHIP privilege on the group.
> * You must be signed in to the source or target account to view refresh operation details. If you are not, you will be prompted to sign in.
>
>   Both the source account and the target account must use the same connection type (public internet). Otherwise, signing in to
>   the target account fails.
> * Currently, if your account uses private connectivity, you can’t use Snowsight to create or modify groups or connection
>   objects. However, you can use Snowsight to monitor groups that were created using SQL.

To view the replication status of each replication or failover group, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication and then select Groups.

The Groups page displays refresh operation details for all the groups for which your
role has a privilege to view. You can use the tiles to filter the view.

* For example, if the Status tile indicates there are failed refresh operations, you can select the tile to investigate the group(s)
  with failures.
* The lag time in the Longest Replication lag tile refers to the duration of time since the last refresh operation. This is the
  length of time that the secondary replication or failover group *lags* behind the primary group. The longest lag time is the
  length of time since the oldest secondary replication group was last refreshed.

  For example, if you have three failover groups, `fg_1`, `fg_2`, `fg_3`, with independent replication schedules of
  10 minutes, 2 hours, and 12 hours respectively, the longest lag time could be as long as 12 hours. If `fg_3`, however, was
  recently refreshed in the target account, its lag time resets to 0 and a different failover group could have a longer lag time.
* You can select an individual bar in the Group Lag Distribution tile to filter the results to an individual group.

You can also filter groups by using the search field or the dropdown menus:

* You can search by replication or failover group name using the  (search) box.
* Choose Type to filter the results by replication or failover group.
* Choose Replicating to filter by primary (select To) or secondary groups (select From).
* Choose the  (accounts) menu to filter the results by account name.
* Choose Status to filter results by refresh operation status:

  + Refresh Cancelled
  + Refresh Failed
  + Refresh In Progress
  + Refresh Successful

You can see the following details about your replication and failover groups:

| Column | Description |
| --- | --- |
| Name | Name of the replication or failover group. |
| Is Replicating | Indicates if the group is being replicated *to* a target account or *from* a source account.  If this column contains *destinations available*, there are no secondary replication or failover groups. The number of destinations available indicates the number of target accounts the primary group can be replicated to. |
| Status | Displays the status of the latest refresh operation.  You must be signed in to the source or target account in order to access replication details. If you are not signed in, select Sign in to view refresh operation status for the secondary group.  Both the source account and the target account must use the same connection type (public internet). Otherwise, signing in to the target account fails. |
| Replication Lag | The length of time since the last refresh operation. This is the length of time that the secondary replication group “lags” behind the primary replication group. |
| Next Refresh | The date and time of the next scheduled refresh operation. |

You can select a replication or failover group to view detailed information about each refresh operation. For more information, see
the section on replication history in Snowsight.

## Monitor the progress of refresh operations

This section provides information on how to monitor replication progress for a specific replication or failover group using either
Snowsight or SQL.

### Use Snowsight to monitor the progress of refresh operations

You can view the status of a refresh operation in progress and the details of historical refresh operations using Snowsight.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, select Groups.
4. Select the name of a replication or failover group.

> **Tip:**
>
> If your account uses private connectivity, you can still use Snowsight to monitor groups.
> Although creating or modifying groups or connection objects through Snowsight isn’t currently available
> with private connectivity, Snowsight can monitor the groups that you create using SQL.

For more information about the detailed view, see the section on replication history in Snowsight.

### Use SQL to monitor the progress of refresh operations

To monitor the progress of a replication or failover group refresh, query the
[REPLICATION_GROUP_REFRESH_PROGRESS, REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB, REPLICATION_GROUP_REFRESH_PROGRESS_ALL](../sql-reference/functions/replication_group_refresh_progress.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).

#### Example

View the progress of the most recent refresh operation for the failover group `myfg`:

```sqlexample
SELECT phase_name, start_time, end_time, progress, details
  FROM TABLE(INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_PROGRESS('myfg'));
```

## View replication history

You can view replication history using Snowsight or using SQL.

> **Note:**
>
> You can view the replication history for the replication and failover groups for which your role has the MONITOR, OWNERSHIP, or
> REPLICATE privilege on.

### Use Snowsight to view replication history

You can view the replication history and details for each refresh operation for a specific replication or failover group in the details
page for the group.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, select Groups.
4. Select the name of a replication or failover group.

You can review the following information about the group:

* Group type (replication group or failover group).
* Replication schedule (for example, every 10 minutes).
* Duration of each refresh operation.
* Replica lag time (length of time since last refresh operation).
* Date and time of the next scheduled refresh operation.

> **Tip:**
>
> If your account uses private connectivity, you can still use Snowsight to monitor groups.
> Although creating or modifying groups or connection objects through Snowsight isn’t currently available
> with private connectivity, Snowsight can monitor the groups that you create using SQL.

You can filter the data on the page by status and time period:

* Choose Status to filter results by refresh operation status:

  + Refresh Cancelled
  + Refresh Failed
  + Refresh In Progress
  + Refresh Successful
* Choose Duration to show refresh operation details for:

  + Last hour
  + Last 24 hours
  + Last 7 days
  + All

  Selecting All displays the last 14 days of refresh operations.

The details for each refresh operation include the following columns:

| Column | Description |
| --- | --- |
| Query ID | Query ID of the refresh operation. |
| Status | Displays the status of the refresh operation. Valid values include `Successful`, `Failed`, `In Progress`. |
| Ended | Date and time the refresh operation ended. |
| Duration | The length of time the refresh operation took to complete.  The duration period is broken down and color coded by [replication phase](../sql-reference/functions/replication_group_refresh_progress.md). The width of each colored segment indicates the portion of the time spent in that phase.  The image below is for reference only. This graph is available when you select the refresh operation for additional details. |
| Transferred | The number of bytes replicated. |
| Objects | The number of objects replicated. |

Select a row to view additional details about a specific refresh operation including:

* Duration of each replication phase.
* Error message (for failed refresh operations).
* List of database objects replicated by type and number.
* Number of databases replicated and database names.

### Use SQL to view replication history

To view the replication history of a specific replication or failover group within a specified date range, query one of the following:

* [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](../sql-reference/functions/replication_group_refresh_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
* [REPLICATION_GROUP_REFRESH_HISTORY view](../sql-reference/account-usage/replication_group_refresh_history.md) (in [Account Usage](../sql-reference/account-usage.md)).

#### Examples

Query the Information Schema REPLICATION_GROUP_REFRESH_HISTORY table function to view the account replication history of failover
group `myfg` in the last 7 days:

```sqlexample
SELECT PHASE_NAME, START_TIME, END_TIME, TOTAL_BYTES, OBJECT_COUNT
  FROM TABLE(information_schema.replication_group_refresh_history('myfg'))
  WHERE START_TIME >= CURRENT_DATE() - INTERVAL '7 days';
```

Query the Account Usage REPLICATION_GROUP_REFRESH_HISTORY view to view the account replication history in the current month:

```sqlexample
SELECT REPLICATION_GROUP_NAME, PHASE_NAME, START_TIME, END_TIME, TOTAL_BYTES, OBJECT_COUNT
  FROM snowflake.account_usage.replication_group_refresh_history
  WHERE START_TIME >= DATE_TRUNC('month', CURRENT_DATE());
```

## Monitor replication costs

To monitor credit usage for replication, query one of the following:

* [REPLICATION_GROUP_USAGE_HISTORY view](../sql-reference/account-usage/replication_group_usage_history.md) (in [Account Usage](../sql-reference/account-usage.md)).
* [REPLICATION_GROUP_USAGE_HISTORY](../sql-reference/functions/replication_group_usage_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).

### Examples

Query the REPLICATION_GROUP_USAGE_HISTORY table function to view credits used for account replication in the last 7 days:

```sqlexample
SELECT start_time, end_time, replication_group_name, credits_used, bytes_transferred
  FROM TABLE(information_schema.replication_group_usage_history(date_range_start=>DATEADD('day', -7, CURRENT_DATE())));
```

Query the Account Usage REPLICATION_GROUP_USAGE_HISTORY view to view the credits used by replication or failover group for account
replication history in the current month:

```sqlexample
SELECT start_time,
  end_time,
  replication_group_name,
  credits_used,
  bytes_transferred
FROM snowflake.account_usage.replication_group_usage_history
WHERE start_time >= DATE_TRUNC('month', CURRENT_DATE());
```

## Monitor replication costs for databases

The cost for replication for an individual database included in a replication or failover group can be calculated by retrieving the
number of copied bytes for the database and associating it with the credits used.

### Examples

#### Query Account Usage views

The following examples calculate the costs for database replication in one replication group for the past 30 days.

1. Query the REPLICATION_GROUP_REFRESH_HISTORY Account Usage view and calculate the sum of the number of bytes replicated per database.

   For example, to calculate the sum of the number of bytes replicated for databases in the replication group `myrg` in the last
   30 days:

   ```sqlexample
   SELECT SUM(value:totalBytesToReplicate) as sum_database_bytes
     FROM snowflake.account_usage.replication_group_refresh_history rh,
       LATERAL FLATTEN(input => rh.total_bytes:databases)
     WHERE rh.replication_group_name = 'MYRG' AND
           rh.start_time >= CURRENT_DATE() - INTERVAL '30 days';
   ```

   Note the output of the sum of database bytes:

   ```output
   +--------------------+
   | SUM_DATABASE_BYTES |
   |--------------------|
   |              22016 |
   +--------------------+
   ```
2. Query the REPLICATION_GROUP_USAGE_HISTORY Account Usage view and calculate the sum of the number of credits used and the sum
   of the bytes transferred for replication.

   For example, to calculate the sum of the number of credits used and the sum of the bytes transferred for replication of the
   replication group `myrg` in the last 30 days:

   ```sqlexample
   SELECT SUM(credits_used) AS credits_used, SUM(bytes_transferred) AS bytes_transferred
     FROM snowflake.account_usage.replication_group_usage_history
     WHERE replication_group_name = 'MYRG' AND
           start_time >= CURRENT_DATE() - INTERVAL '30 days';
   ```

   Note the output of the sum of the credits used and the sum of bytes transferred:

   ```output
   +--------------+-------------------+
   | CREDITS_USED | BYTES_TRANSFERRED |
   |--------------+-------------------|
   |  1.357923604 |             22013 |
   +--------------+-------------------+
   ```
3. Calculate the replication costs for databases using the values of the bytes transferred for databases, sum of the credits used, and
   the sum of all bytes transferred for replication from the previous two steps:

   > `(<database_bytes_transferred> / <bytes_transferred>) * <credits_used>`

   For example:

   > `(22016 / 22013) * 1.357923604 = 1.35810866)`

#### Query Information Schema table functions

For refresh operations within the past 14 days, query the associated Information Schema table functions.

* [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](../sql-reference/functions/replication_group_refresh_history.md)
* [REPLICATION_GROUP_USAGE_HISTORY](../sql-reference/functions/replication_group_usage_history.md)

1. Query the REPLICATION_GROUP_REFRESH_HISTORY table function to view the sum of the number of bytes copied for database replication
   for the replication group `myrg`:

   ```sqlexample
   SELECT SUM(value:totalBytesToReplicate)
     FROM TABLE(information_schema.replication_group_refresh_history('myrg')) AS rh,
     LATERAL FLATTEN(input => total_bytes:databases)
     WHERE rh.phase_name = 'COMPLETED' AND
           rh.start_time >= CURRENT_DATE() - INTERVAL '14 days';
   ```
2. Query the REPLICATION_GROUP_USAGE_HISTORY table function to view sum of the number of credits used and the sum of the bytes
   transferred for replication for the replication group `myrg`:

   ```sqlexample
   SELECT SUM(credits_used), SUM(bytes_transferred)
     FROM TABLE(information_schema.replication_group_usage_history(
       date_range_start => DATEADD('day', -14, CURRENT_DATE()),
       replication_group_name => 'myrg'));
   ```

---
title: Monitoring search optimization using Snowsight
source: https://docs.snowflake.com/en/user-guide/search-optimization/monitoring-search-optimization.md
section: User Guide
---

# Monitoring search optimization using Snowsight

After you enable the search optimization service, you can use Snowsight to monitor statistics about how queries
use it. You can also use Snowsight to determine why a query isn’t using the search optimization service.

## Monitoring search optimization usage for a query

When a query uses the search optimization service, the [query profile](../ui-snowsight-activity.md) includes
the following:

* Search Optimization Access node - A dedicated Search Optimization Access node is in the query plan.
  Select this node to access the table scan information, as well as information that is specific to search optimization.
* Attributes pane - This pane for the node contains the following:

  + Full table name - Identifies the table that was scanned for the query that used search optimization.
  + Search optimization usage information - This section lists the expression IDs that search optimization referenced during
    query execution. Each expression ID corresponds to a search method and column target defined for the table. Execute the
    following query to show the expression IDs and their corresponding methods and targets:

    ```sqlexample
    DESCRIBE SEARCH OPTIMIZATION ON <table_name>;
    ```

    For more information about this command, see [DESCRIBE SEARCH OPTIMIZATION](../../sql-reference/sql/desc-search-optimization.md).
* Statistics pane - This pane for the node contains the following metrics:

  + Bytes scanned - The total amount of data that was read during the execution of a table scan operation.
  + Partitions scanned - The number of micro-partitions that were actually scanned.
  + Partitions total - The total number of the micro-partitions for the table.
  + Partitions pruned by search optimization - The number of micro-partitions that search optimization effectively
    eliminated from the corresponding table scan.

The following image shows an example of the metrics on the Statistics pane:

## Determining the reason why search optimization wasn’t used

Even when search optimization is configured for a table, it might not always be used. If search optimization wasn’t used for a query,
examine the Table Scan node’s Search Optimization Usage Info section on the Attributes pane. The section
shows one of the following explanations:

* When there is a predicate mismatch, the following message is shown:

  ```output
  Search optimization service was not used because no
  match was found between used predicates and the
  search access paths added for the table.
  ```

  This message indicates that the predicate used in the query on this table isn’t compatible with the search methods defined for the table.
  You can review the optimization configuration for the table by executing the following command:

  ```sqlexample
  DESCRIBE SEARCH OPTIMIZATION ON <table_name>;
  ```

  For information about the predicates and data types supported by search optimization, see [Identifying queries that can benefit from search optimization](queries-that-benefit.md).
* When there is a cost-based decision not to use search optimization, the following message is shown:

  ```output
  The query optimizer estimated that the search optimization
  service would not be beneficial for this table scan.
  ```

  This message indicates that the predicates used in the query are compatible with the search methods defined for the table, but the query
  optimizer decided that query performance likely wouldn’t be improved by search optimization. Subsequent queries with different predicates or
  different data in the source table might use search optimization.
* When the predicate limit is exceeded, the following message is shown:

  ```output
  Search optimization service was not used because the
  predicate limit was exceeded.
  ```

  This message indicates that the predicate contains too many distinct predicates. The exact count of search optimization predicates depends on
  the types of the predicates and might not match exactly the number of predicates in the query.
  [Substring queries](substring-queries.md) and
  [full-text search queries](text-queries.md) that use the wildcard syntax are more likely to reach the
  predicate limit.

The following image shows an example of a predicate mismatch message:

---
title: Monitoring the Kafka connector using Java Management Extensions (JMX)
source: https://docs.snowflake.com/en/user-guide/kafka-connector-monitor.md
section: User Guide
---

# Monitoring the Kafka connector using Java Management Extensions (JMX)

This topic describes how to use Java Management Extensions (JMX) to monitor the Snowflake Connector for
Kafka. Kafka Connect provides pre-configured JMX metrics that provides information about the Kafka connector.
The Snowflake Connector for Kafka provides multiple Managed Beans (MBeans) that you can use to ingest metrics
about the Kafka environment. You can load this information into 3rd-party tools, including Prometheus and
Grafana.

The JMX feature is enabled in the connector by default. To disable JMX, set the `jmx` property to `false`.

> **Important:**
>
> [Snowpipe](data-load-snowpipe-intro.md) supports the Kafka connector version 1.6.0 and later.
>
> [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) supports the Kafka connector version 2.1.2 and later.

## Configuring JMX in the Kafka connector

JMX is enabled by default in the Snowflake Kafka connector. To enable JMX in Kafka, perform the following:

1. Enable JMX to connect to your Kafka installation:

   * To make JMX connections to a Kafka installation running on a remote server, set the `KAFKA_JMX_OPTS` environment variable in your Kafka Connect startup script:

     ```bash
     export KAFKA_JMX_OPTS="-Dcom.sun.management.jmxremote=true
         -Dcom.sun.management.jmxremote.authenticate=false
         -Dcom.sun.management.jmxremote.ssl=false
         -Djava.rmi.server.hostname=<ip_address>
         -Dcom.sun.management.jmxremote.port=<jmx_port>"
     ```

     Where:

     + `ip_address`: specifies the IP address of your Kafka Connect installation.
     + `jmx_port`: specifies the JMX port where Kafka Connect listens for JMX connections.
   * To make JMX connections to Kafka running on the same server, set the `JMX_PORT` environment variable in your Kafka startup script:

     ```bash
     export JMX_PORT=<port_number>
     ```

     Where `port_number` is the JMX port of your Kafka installation.
2. Restart the Kafka connector.

## Using the Snowflake Kafka connector managed beans (MBeans)

JMX uses MBeans to represent objects within Kafka that it can monitor (e.g. thread count, cpu load, etc.). The Snowflake Kafka connector provides MBeans for accessing objects managed by the connector. You can use these MBeans to create monitoring dashboards.

The general format of the Kafka Connector MBean object name is:

`snowflake.kafka.connector:connector=connector_name,pipe=pipe_name,category=category_name,name=metric_name`

Where:

* `connector=connector_name` specifies the name of the connector defined in the Kafka configuration file.
* `pipe=pipe_name` specifies the Snowpipe object used to ingest data. The Kafka connector defines Snowpipe objects for each partition.
* `category=category_name` specifies the category of the MBean. Each category contains a set of metrics.
* `name=metric_name` specifies the name of the metric.

The following sections list the names of the categories and metrics provided by the Snowflake Kafka connector.

### Category: `file-counts`

This category of metrics only applies to Snowpipe-based Kafka connector and does not apply to Snowpipe Streaming.

| Metric Name | Data Type | Description |
| --- | --- | --- |
| `file-count-on-stage` | long | The number of files currently on an internal stage. This value is decremented after the process of purging the files has started. This property provides an estimate of how many files are currently on an internal stage. |
| `file-count-on-ingestion` | long | The number of files in Snowpipe determined by calling the `insertFiles` REST API. There is currently a limitation of 5k for files that are being sent via a single REST API request. There is not a one to one relation between the number of files and the number of REST API calls. The number of calls to the `insertFiles` REST API can be larger than this value. The value of this property is `0` if there are no more files to be ingested. |
| `file-count-table-stage-ingestion-fail` | long | The number of files on the table stage that failed ingestion. |
| `file-count-table-stage-broken-record` | long | The number of files present on the table stage that corresponds to a broken offset. |
| `file-count-purged` | long | The number of files purged from the internal stage after the ingestion status was determined. |

### Category: `offsets`

The `offsetPersistedInSnowflake` and `latestConsumerOffset` metrics apply to Snowpipe Streaming-based Kafka connector. The rest of this category only applies to Snowpipe-based Kafka connector.

| Metric Name | Data Type | Description |
| --- | --- | --- |
| `processed-offset` | long | An offset referring to the most recent record sent to the in-memory buffer. |
| `flushed-offset` | long | An offset referring to a record that is being flushed on an internal stage after the buffer threshold was reached. The buffer can reach its threshold by time, number of records, or size. |
| `committed-offset` | long | An offset referring to a record that has had the precommit API called and has called the Snowpipe `insertFiles` REST API called. |
| `purged-offset` | long | An offset referring to a record that is being purged from the internal stage. This number is the value of the highest recent offset that was purged from the internal stage. |
| `offsetPersistedInSnowflake` | long | An offset that refers to a record that has the latest persisted data in Snowflake. The offset is determined by the `insertRows` API call. |
| `latestConsumerOffset` | long | An offset that refers to the most recent record sent to the in-memory buffer. It is only used to resend the offset when the channel offset token is `NULL`. |

### Category: `buffer`

This category of metrics is only available to Snowpipe-based Kafka connector.

| Metric Name | Data Type | Description |
| --- | --- | --- |
| `buffer-size-bytes` | long | Based on buffer thresholds, returns the buffer size (in bytes) before it is flushed to an internal stage. This value may not be same as the file size since files are compressed when being loaded to an internal stage. |
| `buffer-record-count` | long | Based on buffer thresholds, returns the number of Kafka records buffered into memory before the buffer is flushed to an internal stage. |

### Category: `latencies`

This category of metrics is only available to Snowpipe-based Kafka connector.

| Metric Name | Data Type | Description |
| --- | --- | --- |
| `kafka-lag` | long | The difference (in seconds) between the time the record is put into Kafka and the time the record is fetched into Kafka Connect. Note that this value can be null if the value was not set inside a record. |
| `commit-lag` | long | The difference (in seconds) between the time the file is uploaded to an internal stage and the time the `insertFiles` REST API is called. |
| `ingestion-lag` | long | The difference (in seconds) between the time a file is uploaded to an internal stage and the time the file ingestion status is reported through the `insertReport` or `loadHistoryScan` API. |

---
title: Monitoring warehouse load
source: https://docs.snowflake.com/en/user-guide/warehouses-load-monitoring.md
section: User Guide
---

# Monitoring warehouse load

The web interface provides a *query load* chart that depicts concurrent queries processed by a warehouse over a two-week period. Warehouse
query load measures the average number of queries that were running or queued within a specific interval.

You can customize the time period and time interval during which to evaluate warehouse performance by querying the Account Usage
[QUERY_HISTORY view](../sql-reference/account-usage/query_history.md).

## Viewing the load monitoring chart

> **Note:**
>
> To view the load monitoring chart, you must be using a role that has the MONITOR privilege on the warehouse.

To view the chart:

> In the navigation menu, select Compute » Warehouses » *<warehouse_name>*
>
> > The Warehouse Activity tile appears with a bar chart and lets you select a window of time to view in
> > the chart. By default, the chart displays the past two weeks in 1-day intervals.
> >
> > You can select a range from 1 hour (minimum) to 2 weeks (maximum). The chart displays the total query load in intervals of 1 minute
> > to 1 day, depending on the range you selected.

### Understanding the bar chart

Hover over a bar to view the average number of queries processed by the warehouse during the time period represented. The bar displays the
individual load for each query status that occurred within the interval:

| Query Status | Description |
| --- | --- |
| Running | Queries that were actively running during the interval. Note that they may have started running before and continued running after the interval. |
| Queued (Provisioning) | Queries that were waiting while the warehouse provisioned compute resources. Typically only occurs in the first few minutes after a warehouse resumes. |
| Blocked | Queries that were blocked during the interval due to a transaction lock. |
| Queued | Queries that were waiting to run due to warehouse overload (i.e. waiting for other queries to finish running and free compute resources). |

## How query load is calculated

Query load is calculated by dividing the execution time (in seconds) of all queries in an interval by the total time (in seconds) for the interval.

For example, the following table illustrates how query load is calculated based on 5 queries that contributed to the warehouse load during a 5-minute interval. The load from running queries was .92 and queued queries (due to warehouse overload) was .08.

| Query | Status | Execution Time / Interval (in Seconds) | Query Load |
| --- | --- | --- | --- |
| Query 1 | Running | 30 / 300 | 0.10 |
| Query 2 | Running | 201 / 300 | 0.67 |
| Query 3 | Running | 15 / 300 | 0.05 |
| Query 4 | Running | 30 / 300 | 0.10 |
|  |  | **Running Load** | **0.92** |
| Query 5 | Queued | 24 / 300 | 0.08 |
|  |  | **Queued Load** | **0.08** |
|  |  | **TOTAL WAREHOUSE LOAD** | **1.00** |

To determine the actual number of running queries (and the duration of each query) during a specific interval, consult the
History  page. On the page, filter the query history by warehouse, then scroll down to the interval you specified in
the load monitoring chart.

## Using the load monitoring chart to make decisions

The load monitoring chart can help you make decisions for managing your warehouses by showing current and historic usage patterns.

### Slow query performance

When you notice that a query is running slowly, check whether an overloaded warehouse is causing the query to compete for resources or get queued:

* If the running query load is high or there’s queuing, consider starting a separate warehouse and moving queued queries to that warehouse.
  Alternatively, if you are using [multi-cluster warehouses](warehouses-multicluster.md), you could change your multi-cluster
  settings to add additional clusters to handle higher concurrency going forward.

* If the running query load is low and query performance is slow, you could resize the warehouse to provide more compute resources. You would
  need to restart the query once all the new resources were fully provisioned to take advantage of the added resources.

### Peak query performance

Analyze the daily workload on the warehouse over the previous two weeks. If you see recurring usage spikes, consider moving some of the peak
workload to its own warehouse and potentially running the remaining workload on a smaller warehouse. Alternatively, you could change your
multi-cluster settings to add additional clusters to handle higher concurrency going forward.

If you notice that your current workload is considerably higher than normal, open the History  page to investigate which
queries are contributing to the higher load.

### Excessive credit usage

Analyze the daily workload on the warehouse over the previous two weeks. If the chart shows recurring time periods when the warehouse was
running and consuming credits, but the total query load was less than **1** for substantial periods of time, the warehouse use is inefficient.
You might consider any of the following actions:

* Decrease the warehouse size. Note that decreasing the warehouse size generally increases the query execution time.
* For a multi-cluster warehouse, decrease the **MIN_CLUSTER_COUNT** parameter value.

## Using Account Usage QUERY_HISTORY view to evaluate warehouse performance

You can query the QUERY_HISTORY view to calculate virtual warehouse performance metrics such as throughput and latency for specific
statement types. For more information, see [Examples: Warehouse performance](../sql-reference/account-usage.md).

---
title: Multi-cluster warehouses
source: https://docs.snowflake.com/en/user-guide/warehouses-multicluster.md
section: User Guide
---

# Multi-cluster warehouses

Multi-cluster warehouses enable you to scale compute resources to manage your user and query concurrency needs as they change, such as during
peak and off hours.

## What is a multi-cluster warehouse?

By default, a virtual warehouse consists of a single cluster of compute resources available to the
warehouse for executing queries. As queries are submitted to a warehouse, the warehouse allocates resources to each query and begins
executing the queries. If sufficient resources are not available to execute all the queries submitted to the warehouse, Snowflake queues the
additional queries until the necessary resources become available.

With multi-cluster warehouses, Snowflake supports allocating, either statically or dynamically, additional clusters to make a larger pool
of compute resources available. A multi-cluster warehouse is defined by specifying the following properties:

* Maximum number of clusters, greater than 1. The highest value you can specify depends on the warehouse size.
  For the upper limit on the number of clusters for each warehouse size,
  see Upper limit on number of clusters for a multi-cluster warehouse (in this topic).
* Minimum number of clusters, equal to or less than the maximum.

Additionally, multi-cluster warehouses support all the same properties and actions as single-cluster warehouses, including:

* Specifying a warehouse size.
* Resizing a warehouse at any time.
* Auto-suspending a running warehouse due to inactivity; note that this does not apply to individual clusters, but rather the entire
  multi-cluster warehouse.
* Auto-resuming a suspended warehouse when new queries are submitted.

### Upper limit on number of clusters for a multi-cluster warehouse

The maximum number of clusters for a multi-cluster warehouse depends on the warehouse size. Larger warehouse sizes have lower limits on the number of clusters. By default, all warehouses are limited to a maximum of ten clusters. You can override that setting to allow more clusters, depending on your warehouse size. The following table shows the maximum number of clusters for each warehouse size:

| Warehouse size | Allowed maximum cluster count |
| --- | --- |
| XSMALL | 300 |
| SMALL | 300 |
| MEDIUM | 300 |
| LARGE | 160 |
| XLARGE | 80 |
| 2XLARGE | 40 |
| 3XLARGE | 20 |
| 4XLARGE | 10 |
| 5XLARGE | 10 |
| 6XLARGE | 10 |

### Maximized vs. auto-scale

You can choose to run a multi-cluster warehouse in either of the following modes:

Maximized:
:   This mode is enabled by specifying the same value for both maximum and minimum number of clusters (note that the
    specified value must be larger than 1). In this mode, when the warehouse is started, Snowflake starts all the clusters so
    that maximum resources are available while the warehouse is running.

    This mode is effective for statically controlling the available compute resources, particularly if you have large numbers of concurrent
    user sessions and/or queries and the numbers do not fluctuate significantly.

Auto-scale:
:   This mode is enabled by specifying different values for maximum and minimum number of clusters. In this mode,
    Snowflake starts and stops clusters as needed to dynamically manage the load on the warehouse:

    * As the number of concurrent user sessions and/or queries for the warehouse increases, and queries start to queue due to
      insufficient resources, Snowflake automatically starts additional clusters, up to the maximum number defined for the warehouse.
    * Similarly, as the load on the warehouse decreases, Snowflake automatically shuts down clusters to reduce the number of
      running clusters and, correspondingly, the number of credits used by the warehouse.

    To help control the usage of credits in Auto-scale mode, Snowflake provides a property, SCALING_POLICY, that determines the scaling policy
    to use when automatically starting or shutting down additional clusters. For more information, see Setting the scaling policy for a multi-cluster warehouse (in
    this topic).

To create a multi-cluster warehouse, see Creating a multi-cluster warehouse (in this topic).

* For auto-scale mode, the maximum number of clusters must be *greater* than the minimum number of clusters.
* For maximized mode, the maximum number of clusters must be *equal* to the minimum number of clusters.

> **Tip:**
>
> When determining the maximum and minimum number of clusters to use for a multi-cluster warehouse, start with Auto-scale mode and start
> small (for example, maximum = 2 or 3, minimum = 1). As you track how your warehouse load fluctuates over time, you can increase the maximum and
> minimum number of clusters until you determine the numbers that best support the upper and lower boundaries of your user/query concurrency.

### Multi-cluster size and credit usage

The amount of compute resources in each cluster is determined by the warehouse size:

* The total number of clusters for the multi-cluster warehouse is calculated by multiplying the warehouse size by the maximum number of
  clusters. This also indicates the maximum number of credits consumed by the warehouse per full hour of usage (i.e. if
  all clusters run during the hour).

  For example, the maximum number of credits consumed per hour for a Medium-size multi-cluster warehouse with 3 clusters is 12 credits.
* If a multi-cluster warehouse is resized, the new size applies to all the clusters for the warehouse, including
  clusters that are currently running and any clusters that are started after the multi-cluster warehouse is resized.

The actual number of credits consumed per hour depends on the number of clusters running during each hour that the warehouse
is running. For more details, see Examples of multi-cluster credit usage (in this topic).

> **Tip:**
>
> If you use Query Acceleration Service (QAS) for a multi-cluster warehouse, consider adjusting the QAS scale
> factor higher than for a single-cluster warehouse. That helps to apply the QAS optimizations across all the
> clusters of the warehouse.
> For more information, see [Adjusting the scale factor](query-acceleration-service.md).

### Benefits of multi-cluster warehouses

With a standard, single-cluster warehouse, if your user/query load increases to the point where you need more compute resources:

1. You must either increase the size of the warehouse or start additional warehouses and explicitly redirect the additional users/queries to
   these warehouses.
2. Then, when the resources are no longer needed, to conserve credits, you must manually downsize the larger warehouse or suspend the additional
   warehouses.

In contrast, a multi-cluster warehouse enables larger numbers of users to connect to the same size warehouse. In addition:

* In Auto-scale mode, a multi-cluster warehouse eliminates the need for resizing the warehouse or starting and stopping additional
  warehouses to handle fluctuating workloads. Snowflake automatically starts and stops additional clusters as needed.
* In Maximized mode, you can control the capacity of the multi-cluster warehouse by increasing or decreasing the number of clusters as
  needed.

> **Tip:**
>
> Multi-cluster warehouses are best utilized for scaling resources to improve concurrency for users/queries. They are not as beneficial for
> improving the performance of slow-running queries or data loading. For these types of operations, resizing the warehouse provides
> more benefits.

## Examples of multi-cluster credit usage

The following four examples illustrate credit usage for a multi-cluster warehouse. Refer to [Virtual warehouse credit usage](cost-understanding-compute.md) for
the number of credits billed per full hour by warehouse size.

> **Note:**
>
> For the sake of simplicity, all these examples depict credit usage in increments of 1 hour, 30 minutes, and 15 minutes. In a real-world
> scenario, with per-second billing, the actual credit usage would contain fractional amounts, based on the number of seconds that each
> cluster runs.

### Example 1: Maximized (2 Hours)

In this example, a Medium-size Standard warehouse with 3 clusters runs in Maximized mode for 2 hours:

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
|  | Cluster 1 | Cluster 2 | Cluster 3 | **Total Credits** |
| 1st Hour | 4 | 4 | 4 | **12** |
| 2nd Hour | 4 | 4 | 4 | **12** |
| **Total Credits** | **8** | **8** | **8** | **24** |

### Example 2: Auto-scale (2 Hours)

In this example, a Medium-size Standard warehouse with 3 clusters runs in Auto-scale mode for 2 hours:

* Cluster 1 runs continuously.
* Cluster 2 runs continuously for the 2nd hour only.
* Cluster 3 runs for 30 minutes during the 2nd hour.

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
|  | Cluster 1 | Cluster 2 | Cluster 3 | **Total Credits** |
| 1st Hour | 4 | 0 | 0 | **4** |
| 2nd Hour | 4 | 4 | 2 | **10** |
| **Total Credits** | **8** | **4** | **2** | **14** |

### Example 3: Auto-scale (3 Hours)

In this example, a Medium-size Standard warehouse with 3 clusters runs in Auto-scale mode for 3 hours:

* Cluster 1 runs continuously.
* Cluster 2 runs continuously for the entire 2nd hour and 30 minutes in the 3rd hour.
* Cluster 3 runs for 30 minutes in the 3rd hour.

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
|  | Cluster 1 | Cluster 2 | Cluster 3 | **Total Credits** |
| 1st Hour | 4 | 0 | 0 | **4** |
| 2nd Hour | 4 | 4 | 0 | **8** |
| 3rd Hour | 4 | 2 | 2 | **8** |
| **Total Credits** | **12** | **6** | **2** | **20** |

### Example 4: Auto-scale (3 Hours) with resize

In this example, the same warehouse from example 3 runs in Auto-scale mode for 3 hours with a resize from Medium to Large:

* Cluster 1 runs continuously.
* Cluster 2 runs continuously for the 2nd and 3rd hours.
* Warehouse is resized from Medium to Large at 1:30 hours.
* Cluster 3 runs for 15 minutes in the 3rd hour.

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
|  | Cluster 1 | Cluster 2 | Cluster 3 | **Total Credits** |
| 1st Hour | 4 | 0 | 0 | **4** |
| 2nd Hour | 4+2 | 4+2 | 0 | **12** |
| 3rd Hour | 8 | 8 | 2 | **18** |
| **Total Credits** | **18** | **14** | **2** | **34** |

## Creating a multi-cluster warehouse

You can create a multi-cluster warehouse in [Snowsight](ui-snowsight-gs.md) or by using SQL:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses » + Warehouse
>
>     1. Expand Advanced Options.
>     2. Select the Multi-cluster Warehouse checkbox.
>     3. In the Max Clusters field, select a value greater than 1.
>
>        > **Note:**
>        >
>        > Currently, the highest value you can choose in Snowsight is 10.
>        > The maximum sizes shown in Upper limit on number of clusters for a multi-cluster warehouse
>        > apply to the CREATE WAREHOUSE and ALTER WAREHOUSE commands in SQL only.
>     4. In the Min Clusters field, optionally select a value greater than 1.
>     5. Enter other information for the warehouse, as needed, and click Create Warehouse.
>
> SQL:
> :   Execute a [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) command with:
>
>     * `MAX_CLUSTER_COUNT` set to a value greater than `1`. For the highest value
>       you can specify depending on the warehouse size, see
>       Upper limit on number of clusters for a multi-cluster warehouse (in this topic).
>     * `MIN_CLUSTER_COUNT` (optionally) set to a value greater than `1`.

To view information about the multi-cluster warehouses you create:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses.
>
>     The Clusters column displays the minimum and maximum clusters for each warehouse, as well as the number of
>     clusters that are currently running if the warehouse is started. You can sort by the Clusters column in
>     descending order to list the multi-cluster warehouses at the top.
>
> SQL:
> :   Execute a [SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md) command.
>
>     The output includes three columns (`min_cluster_count`, `max_cluster_count`, `started_clusters`)
>     that display the same information provided in the Clusters column in the web interface.
>
>     > **Tip:**
>     >
>     > If the SHOW WAREHOUSES output is difficult to read because it includes so many columns, you can
>     > use the [pipe operator](../sql-reference/operators-flow.md) (`->>`) to show just the columns you want,
>     > along with any other clauses for filtering and sorting. Use a query that is similar to the following example,
>     > and adjust it to suit your needs. The column names are quoted because they’re case-sensitive in
>     > the SHOW WAREHOUSES output:
>     >
>     > ```sqlexample
>     > SHOW WAREHOUSES
>     >   ->> SELECT "name", "state", "size", "max_cluster_count", "started_clusters", "type"
>     >         FROM $1
>     >         WHERE "state" IN ('STARTED','SUSPENDED')
>     >         ORDER BY "type" DESC, "name";
>     > ```

All other tasks for multi-cluster warehouses (except for the remaining tasks described in this topic) are identical to single-cluster
[warehouse tasks](warehouses-tasks.md).

## Setting the scaling policy for a multi-cluster warehouse

To help control the credits consumed by a multi-cluster warehouse running in Auto-scale mode, Snowflake provides scaling policies.
Snowflake uses the scaling policies to determine how to adjust the capacity of your multi-cluster warehouse
by starting or shutting down individual clusters while the warehouse is running. You can specify a scaling policy
to make Snowflake prioritize responsiveness and throughput for the queries in that warehouse, or to minimize costs
for that warehouse.

The scaling policy for a multi-cluster warehouse only applies if it is running in Auto-scale mode.
In Maximized mode, all clusters run concurrently, so there is no need to start or shut down individual clusters.

Snowflake supports the following scaling policies:

| Policy | Description | A new cluster starts… | An idle or lightly loaded cluster shuts down… |
| --- | --- | --- | --- |
| Standard (default) | Prevents/minimizes queuing by favoring starting additional clusters over conserving credits. | When a query is queued, or if Snowflake estimates the currently running clusters don’t have enough resources to handle any additional queries, Snowflake increases the number of clusters in the warehouse.  For warehouses with a MAX_CLUSTER_COUNT of 10 or less, Snowflake starts one additional cluster.  For warehouses with a MAX_CLUSTER_COUNT greater than 10, Snowflake starts multiple clusters at once to accommodate rapid increases in workload. | After a sustained period of low load, Snowflake shuts down one or more of the least-loaded clusters when the queries running on them finish. When the cluster count is higher than 10, Snowflake might shut down multiple clusters at a time. When the cluster count is 10 or less, Snowflake shuts down the idle clusters one at a time. |
| Economy | Conserves credits by favoring keeping running clusters fully-loaded rather than starting additional clusters, which may result in queries being queued and taking longer to complete. | Only if the system estimates there’s enough query load to keep the cluster busy for at least 6 minutes. | Snowflake marks the least-loaded cluster for shutdown if it estimates the cluster has less than 6 minutes of work left to do. Snowflake shuts down the cluster after finishing any queries that are running on that cluster. When the cluster count is higher than 10, Snowflake might shut down multiple clusters at a time. When the cluster count is 10 or less, Snowflake shuts down the idle clusters one at a time. |

> **Note:**
>
> Interactive warehouses support standard scaling policy only. When you use auto-scale mode with interactive warehouses, the scaling is more proactive than it is with standard warehouses.
> This is because interactive warehouses are designed to be more responsive to user queries, and scaling out in advance of anticipated queuing is important for maintaining performance.

> **Note:**
>
> A third scaling policy, Legacy, was formerly provided for backward compatibility. Legacy has been removed.
> All warehouses that were using the Legacy policy now use the default Standard policy.

You can set the scaling policy for a multi-cluster warehouse when it is created or at any time afterwards,
either in Snowsight or using SQL:

> Snowsight:
> :   When you select Multi-cluster Warehouse under Advanced Options in the New Warehouse dialog,
>     you can select the scaling policy from the Scaling Policy drop-down list.
>
>     For an existing multi-cluster warehouse, in the navigation menu, select Compute » Warehouses. Then select Edit
>     under the More menu (…).
>
>     In the Scaling Policy field, select the desired value from the drop-down list.
>
>     > **Tip:**
>     >
>     > You only see the Scaling Policy drop-down list when the warehouse you selected is a multi-cluster warehouse,
>     > and the maximum clusters value is higher than the minimum clusters value.
>
> SQL:
> :   Execute a [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) or [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command with `SCALING_POLICY`
>     set to the desired value.

For example, in SQL:

> ```sqlexample
> CREATE WAREHOUSE mywh WITH MAX_CLUSTER_COUNT = 2, SCALING_POLICY = 'STANDARD';
> ALTER WAREHOUSE mywh SET SCALING_POLICY = 'ECONOMY';
> ```

## Increasing or decreasing clusters for a multi-cluster warehouse

You can increase or decrease the maximum and minimum number of clusters for a warehouse at any time,
even while it is running and executing statements. You can adjust the maximum and minimum clusters
for a warehouse in Snowsight or using SQL:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses.
>
>     Click on the warehouse name to view its properties and historical activity.
>     Select Edit from More menu (…).
>     You can also deselect the Multi-cluster Warehouse checkbox to reset the maximum and minimum
>     cluster settings to 1, changing the warehouse to a single-cluster one.
>
> SQL:
> :   Execute an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command.

> **Note:**
>
> Currently, Snowsight supports updating MAX_CLUSTER_COUNT to a maximum of 10 clusters.
> To increase MAX_CLUSTER_COUNT beyond 10, use the ALTER WAREHOUSE command in SQL.

The effect of changing the maximum and minimum clusters for a running warehouse depends on whether it is running in
Maximized or Auto-scale mode:

* Maximized:

  ↑ max & min:
  :   Specified number of clusters start immediately.

  ↓ max & min:
  :   Specified number of clusters shut down when they finish executing statements and the auto-suspend period elapses.
* Auto-scale:

  ↑ max:
  :   If `new_max_clusters > running_clusters`, no changes until additional clusters are needed.

  ↓ max:
  :   If `new_max_clusters < running_clusters`, excess clusters shut down when they finish executing statements and the
      scaling policy conditions are met.

  ↑ min:
  :   If `new_min_clusters > running_clusters`, additional clusters immediately started to meet the minimum.

  ↓ min:
  :   If `new_min_clusters < running_clusters`, excess clusters shut down when they finish executing statements and the
      scaling policy conditions are met.

## Monitoring multi-cluster warehouses

You can monitor usage of multi-cluster warehouses through the web interface:

1. In the navigation menu, select Compute » Warehouses.
2. Select a warehouse name.

   > That way, you can monitor one warehouse in precise detail, such as viewing queries that are currently
   > running or queued.
   >
   > Alternatively, in the navigation menu, select Monitoring » Query History.
   > This page lets you view activity across multiple warehouses in your account.
   > To see the activity only for one warehouse, select Warehouse under the
   > Filters drop-down menu. Then choose a warehouse name from the list.

When you monitor a multi-cluster warehouse, you can see all the queries the warehouse processed.
For each query, you can see details such as how long it took, how many bytes
it scanned, and how many rows it returned. You can also see the cluster used to execute
each statement that the warehouse processed. To choose which details to view, select
the items such as Cluster Number, Duration, Rows, and so on
under the Columns drop-down menu.

---
title: Multi-factor authentication (MFA)
source: https://docs.snowflake.com/en/user-guide/security-mfa.md
section: User Guide
---

# Multi-factor authentication (MFA)

Multi-factor authentication (MFA) reduces the security risks associated with password authentication. When a password user is enrolled in
MFA, they must use a second factor of authentication when signing in to Snowflake. These users enter their password, and then use the second
factor. For information about how a user adds an MFA method that they can use as a second factor of authentication, see
[Configuring a second factor of authentication](security-mfa-second-factor.md).

MFA is intended for *human users* who authenticate with a password. *Service users* must use another form of authentication. For more
information about these user types, see [Types of users](admin-user-management.md).

> **Important:**
>
> To improve the security posture of all of its customers, Snowflake is rolling out changes to require MFA for all password sign-ins. For
> information about this rollout, see [Planning for the deprecation of single-factor password sign-ins](security-mfa-rollout.md).

## Requiring users to enroll in MFA

Currently, strategies for implementing MFA for your organization vary depending on whether or not an account existed when the
[2024_08 behavior change bundle](../release-notes/bcr-bundles/2024_08_bundle.md) was enabled:

* If an account existed before the 2024_08 bundle was enabled, then you must configure your account if you want to require all human users to
  use MFA. For information about implementing MFA to require all human users to enroll in MFA, see
  [Hardening user or account authentication using MFA](authentication-policies.md).
* If the account was created after the 2024_08 bundle was enabled, then all human users who authenticate with a password must enroll in MFA
  by default. This MFA requirement does not apply to service users.

  If you want to disable the requirement that all human users enroll in MFA, create a custom authentication policy with
  `MFA_ENROLLMENT=OPTIONAL`, and then set the authentication policy on the account. Password users who use Snowsight must still
  use MFA, but MFA isn’t required for other interfaces. For more information about creating and setting authentication policies, see
  [Authentication policies](authentication-policies.md).

  Be aware that the ability to opt out of mandatory MFA for human users is temporary; see [Planning for the deprecation of single-factor password sign-ins](security-mfa-rollout.md).

## Requiring MFA for single sign-on authentication

By default, Snowflake doesn’t require MFA for users who authenticate with single sign-on (SSO). Snowflake relies on the identity provider
(IdP) to enforce MFA or some other strong authentication method. If you want to harden authentication for SSO users, you use an
[authentication policy](authentication-policies.md) to require SSO users to use Snowflake MFA after authenticating with the
IdP.

The following authentication policy requires SSO users to enroll and use Snowflake MFA:

```sqlexample
CREATE AUTHENTICATION POLICY ACCOUNTADMIN_DOUBLE_MFA
  AUTHENTICATION_METHODS = ('PASSWORD', 'SAML')
  SECURITY_INTEGRATIONS = ('<SAML SECURITY INTEGRATIONS>')
  MFA_ENROLLMENT = 'REQUIRED'
  MFA_POLICY=(ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION='ALL');
```

## Restricting which MFA methods are available

When a user is enrolled in MFA, they are required to use an MFA method as a second factor of authentication. Snowflake allows the following
MFA methods:

* Authenticating with a passkey that can be stored and accessed in a variety of ways.
* Authenticating with an authenticator app that generates a time-based one-time passcode (TOTP).
* Authenticating with Duo.

> **Tip:**
>
> As you decide which MFA methods to allow, keep in mind the following:
>
> * Passkeys are recommended due to their security and usability.
> * Duo is not replicated like the other MFA methods.

As an administrator, you can use an [authentication policy](authentication-policies.md) to control which MFA methods can be
used as a second factor of authentication. For example, the following authentication policy allows users to use a passkey or authenticator
app as their second factor of authentication, but not Duo:

```sqlexample
CREATE AUTHENTICATION POLICY mfa_policy
  MFA_ENROLLMENT = REQUIRED
  MFA_POLICY = (ALLOWED_METHODS = ('PASSKEY', 'TOTP'));
```

If a user previously configured an MFA method that is now prohibited, the next time they sign in they’ll be prompted to authenticate
using the pre-existing method, then prompted to configure a new, allowed method.

For more information about the MFA_POLICY parameter, see [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md).

## Removing a user’s MFA methods

You can remove an MFA method that a user previously added so that they can no longer use it as their second factor of authentication.

1. Execute the [SHOW MFA METHODS](../sql-reference/sql/show-mfa-methods.md) command and find the value in the `name` column. For example, if you are
   removing an MFA method for a user `joe`, execute the following and copy the `name` of the MFA method from the output:

   ```sqlexample
   SHOW MFA METHODS FOR USER joe;
   ```

   ```output
   +---------------+-----------------+------------------------+-------------------------------+---------------------------------+---------------------+
   |   name        |      type       |    comment             |     last_used                 |        created_on               |  additional_info    |
   +---------------+-----------------+------------------------+-------------------------------+---------------------------------+---------------------+
   | TOTP-48A7     |    TOTP         | Authenticator App 48A7 | 2025-02-26 11:14:38.000 -0800 |  2025-02-26 11:13:19.000 -0800  | null                |
   +---------------+-----------------+------------------------+-------------------------------+---------------------------------+---------------------+
   ```
2. Execute an [ALTER USER … REMOVE MFA METHOD](../sql-reference/sql/alter-user.md) statement to remove the MFA method:

   ```sqlexample
   ALTER USER joe REMOVE MFA METHOD TOTP-48A7;
   ```

## Recovering a user who is locked out

If a password user is locked out of Snowflake because they don’t have access to a second factor of authentication, an administrator can help
them recover the ability to sign in by temporarily disabling MFA or by helping the user
set up a new MFA method.

### Prompt user to add a new MFA method

If a user loses access to the MFA method that they use as their second factor of authentication (for example, by losing the YubiKey that
stores their passkey), an administrator can help the user set up a new MFA method so that they can sign in to Snowflake.

When a user does not have access to their MFA method and needs to set up a new one, the administrator executes an
[ALTER USER … ENROLL MFA](../sql-reference/sql/alter-user.md) statement. For example, if user `joe` needs to establish a new MFA
method, the administrator can execute the following:

```sqlexample
ALTER USER joe ENROLL MFA;
```

* If the user has a [verified email](ui-snowsight-profile.md), Snowflake sends an email prompting them to add an MFA
  authentication method.
* If the user doesn’t have a verified email, Snowflake returns the URL of a page that prompts the user to add an MFA authentication method.
  Administrators can send this URL to the locked-out user.

### Temporarily disable MFA

If an administrator needs to temporarily disable MFA for a user, they can execute an
[ALTER USER … SET MINS_TO_BYPASS_MFA](../sql-reference/sql/alter-user.md) statement. For example, to temporarily disable MFA so that user
`joe` can authenticate with a single-factor password for 30 minutes, execute the following:

```sqlexample
ALTER USER joe SET MINS_TO_BYPASS_MFA = 30;
```

## Setting up administrators for break glass access

Break glass refers to the ability to log in using alternative authentication methods not typically available in the account. Administrators
need break glass access to Snowflake if regular authentication methods become unavailable; for example, if an organization’s identity
provider has an outage.

Organizations can provide break glass access by creating a dedicated Snowflake user, and then storing the user’s password credential in a
key vault. An administrator can generate one or more one-time passcodes (OTPs) that can be stored in the vault with the user’s password. To
access Snowflake, an administrator can retrieve the password and OTP from the vault, and then sign in. Using OTPs creates an additional
layer of protection and satisfies Snowflake multi-factor authentication requirements.

> **Important:**
>
> After an OTP is used to authenticate, it is invalidated and can’t be used to authenticate again.
>
> If there aren’t additional OTPs available and the user doesn’t have another MFA method available, the user might be locked out when their
> session expires. Always ensure a backup MFA method is available for the user to prevent accidental lockouts. For information about
> recovering a user who is locked out, see Recovering a user who is locked out.

### Generating one-time passcodes

To generate one or more OTPs for a user, run an [ALTER USER … ADD MFA METHOD OTP](../sql-reference/sql/alter-user.md) command. The
optional COUNT keyword determines how many OTPs are generated. For example, to generate 5 OTPs for the user `breakglass_user`, run the
following command:

```sqlexample
ALTER USER breakglass_user ADD MFA METHOD OTP COUNT = 5;
```

After the codes are generated, you can use them as your second factor of authentication when authenticating to Snowflake.

### Invalidating one-time passcodes

You have the following options if you want to invalidate a one-time passcode (OTP) so it can’t be used to authenticate.

**Invalidate all existing OTPs for a user**

* Use the ALTER USER … ADD MFA METHOD OTP command to generate new OTPs. Previously generated OTPs are invalidated.

**Invalidate a specific OTP for the current user**

* Use Snowsight to invalidate an OTP by taking the following steps:

  1. In the left-hand navigation, select your name.
  2. In the user menu, select Settings.
  3. Select Authentication.
  4. In the Multi-factor authentication section, find the OTP, and then select the More icon.
  5. Select Unenroll, and then confirm that you want to delete the OTP.

**Invalidate a specific OTP for a different user**

* Use the ALTER USER … REMOVE MFA METHOD command to invalidate a specific OTP for a different user. If you want to invalidate an OTP for
  yourself, use Snowsight. For example, to invalidate the `OTP_2` passcode for user `joe`, run the following command:

  ```sqlexample
  ALTER USER joe REMOVE MFA METHOD OTP_2;
  ```

### Replicating authentication for break glass users

You can’t replicate OTPs from a source account to a target account when you replicate the break glass user. This is to prevent a scenario where an OTP could be used twice, once in the source account and again in the target account. You have two options to implement authentication for the break glass user in the target account:

* Sign in to the target account and generate OTPs for the user in that account.
* Replace the use of OTPs with a time-based one-time passcode (TOTP) or passkey, which can be replicated.

## Connecting to Snowflake with MFA

MFA login is designed primarily for connecting to Snowflake through the web interface, but is also fully-supported by Snowflake CLI, SnowSQL, and the
Snowflake JDBC, Node.js, and ODBC drivers.

> **Note:**
>
> MFA configurations using landline or phone callbacks do not support connecting with drivers, such as ODBC and JDBC.

### Using MFA token caching to minimize the number of prompts during authentication — *optional*

MFA token caching can help to reduce the number of prompts that must be acknowledged while connecting and authenticating to Snowflake,
especially when multiple connection attempts are made within a relatively short time interval.

A cached MFA token is valid for up to four hours.

The cached MFA token is invalid if any of the following conditions are met:

1. The [ALLOW_CLIENT_MFA_CACHING](../sql-reference/parameters.md) parameter is set to FALSE for the account.
2. The method of authentication changes.
3. The authentication credentials change (i.e. username and/or password).
4. The authentication credentials are not valid.
5. The cached token expires or is not cryptographically valid.
6. The account name associated with the cached token changes.

The overall process Snowflake uses to cache MFA tokens is similar to that used to cache connection tokens for browser-based federated
[single sign-on](admin-security-fed-auth-use.md). The client application stores the MFA token in the keystore of the
client-side operating system. Users can delete the cached MFA token from the keystore at any time.

Snowflake supports MFA token caching with the following drivers, connectors, and tools:

* .NET driver version 4.3.0 (or later)
* ODBC driver version 2.23.0 (or later).
* JDBC driver version 3.12.16 (or later).
* Python Connector for Snowflake version 2.3.7 (or later).
* Snowflake CLI version 3.0 (or later)

Snowflake recommends consulting with internal security and compliance officers prior to enabling MFA token caching.

> **Tip:**
>
> MFA token caching can be combined with connection caching in federated [single sign-on](admin-security-fed-auth-use.md).
>
> To combine these two features, ensure that the [ALLOW_ID_TOKEN](../sql-reference/parameters.md) parameter is set to `true` in tandem with the ALLOW_CLIENT_MFA_CACHING parameter.

To enable MFA token caching, complete the following steps:

1. As an account administrator (i.e. a user with the ACCOUNTADMIN system role), set the ALLOW_CLIENT_MFA_CACHING parameter to `true`
   for an account using the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command.

   ```sqlexample
   ALTER ACCOUNT SET ALLOW_CLIENT_MFA_CACHING = TRUE;
   ```
2. In the client connection string, update the authenticator value to `authenticator = username_password_mfa`.
3. Add the package or libraries needed by the driver or connector:

   * If you are using the Snowflake Connector for Python, install the optional keyring package by running:

     > ```bash
     > pip install "snowflake-connector-python[secure-local-storage]"
     > ```
     >
     > You must enter the square brackets (`[` and `]`) as shown in the command. The square brackets specify the [extra part of the package](https://www.python.org/dev/peps/pep-0508/#extras) that should be installed.
     >
     > Use quotes around the name of the package as shown to prevent the square brackets from being interpreted as a wildcard.
     >
     > If you need to install other extras (for example, `pandas` for [using the Python Connector APIs for Pandas](../developer-guide/python-connector/python-connector-pandas.md)), use a comma between the extras:
     >
     > ```bash
     > pip install "snowflake-connector-python[secure-local-storage,pandas]"
     > ```
   * For the Snowflake JDBC Driver, see [Add the JNA classes to your classpath](../developer-guide/jdbc/jdbc-download.md).

To disable MFA token caching, unset the ALLOW_CLIENT_MFA_CACHING parameter:

```sqlexample
ALTER ACCOUNT UNSET ALLOW_CLIENT_MFA_CACHING;
```

To find all users who use MFA token caching as the second-factor authentication to log in, you can execute the following SQL statement as an
account administrator (a user with the ACCOUNTADMIN role):

```sqlexample
SELECT EVENT_TIMESTAMP,
       USER_NAME,
       IS_SUCCESS
  FROM SNOWFLAKE.ACCOUNT_USAGE.LOGIN_HISTORY
  WHERE SECOND_AUTHENTICATION_FACTOR = 'MFA_TOKEN';
```

### Using MFA with Snowflake CLI

MFA can be used for connecting to Snowflake through Snowflake CLI. By default, the Duo Push authentication mechanism is used when a user is
enrolled in MFA.

To use a Duo-generated passcode instead of the push mechanism, the login parameters must include one of the following connection options:

* Use the `--mfa-passcode <string>` option.
* Set `passcode_in_password=true` in the `config.toml` configuration file.

For more details, see [Use multi-factor authentication (MFA)](../developer-guide/snowflake-cli/connecting/configure-connections.md).

### Using MFA with SnowSQL

MFA can be used for connecting to Snowflake through SnowSQL. By default, the Duo Push authentication mechanism is used when a user is
enrolled in MFA.

To use a Duo-generated passcode instead of the push mechanism, the login parameters must include one of the following connection options:

> `--mfa-passcode <string>` OR `--mfa-passcode-in-password`

For more details, see [SnowSQL (CLI client)](snowsql.md).

### Using MFA with JDBC

MFA can be used for connecting to Snowflake via the Snowflake JDBC driver. By default, the Duo Push authentication mechanism is used when a
user is enrolled in MFA; no changes to the JDBC connection string are required.

To use a Duo-generated passcode instead of the push mechanism, one of the following parameters must be included in the JDBC connection
string:

> `passcode=<passcode_string>` OR `passcodeInPassword=on`

Where:

* `passcode_string` is a Duo-generated passcode for the user who is connecting. This can be a passcode generated by the Duo Mobile
  application or an SMS passcode.
* If `passcodeInPassword=on`, then the password and passcode are concatenated, in the form of
  `<password_string><passcode_string>`.

For more details, see [JDBC Driver](../developer-guide/jdbc/jdbc.md).

#### Examples of JDBC connection strings using Duo

JDBC connection string for user `demo` connecting to the `xy12345` account (in the US West region) using a Duo passcode:

> ```bash
> jdbc:snowflake://xy12345.snowflakecomputing.com/?user=demo&passcode=123456
> ```

JDBC connection string for user `demo` connecting to the `xy12345` account (in the US West region) using a Duo passcode that is
embedded in the password:

> ```bash
> jdbc:snowflake://xy12345.snowflakecomputing.com/?user=demo&passcodeInPassword=on
> ```

### Using MFA with Node.js

MFA can be used for connecting to Snowflake through the Snowflake Node.js driver. By default, the Duo Push authentication mechanism is used when a user is enrolled in MFA.

To use a Duo-generated passcode instead of the push mechanism, the login parameters must include one of the following connection options. Both examples use a password of `abc123` and MFA passcode of `987654` to demonstrate the configuration.

* Set the `passcodeInPassword` option to `true` and include the passcode as part of the password string, similar to the following:

  ```javascript
  authenticator: 'USERNAME_PASSWORD_MFA',
  password: "abc123987654", // passcode 987654 is part of the password
  passcodeInPassword: true  // because passcodeInPassword is true
  ```
* Set the `passcode` option to the value of the passcode to specify the password and the passcode separately, similar to the following:

  ```javascript
  authenticator: 'USERNAME_PASSWORD_MFA',
  password: "abc123", // password and MFA passcode are input separately
  passcode: "987654"
  ```

  To use this approach, ensure that the `passcodeInPassword` option is `false` (the default value). If both `passcodeInPassword` is set to `true` and `passcode` is also configured, the `passcodeInPassword` setting takes precedence and the driver assumes the `password` field contains both the password and the MFA passcode when authenticating.

For more details, see [Use an MFA passcode](../developer-guide/node-js/nodejs-driver-authenticate.md).

### Using MFA with ODBC

MFA can be used for connecting to Snowflake via the Snowflake ODBC driver. By default, the Duo Push authentication mechanism is used when a
user is enrolled in MFA; no changes to the ODBC settings are required.

To use a Duo-generated passcode instead of the push mechanism, one of the following parameters must be specified for the driver:

> `passcode=<passcode_string>` OR `passcodeInPassword=on`

Where:

* `passcode_string` is a Duo-generated passcode for the user who is connecting. This can be a passcode generated by the Duo Mobile
  application or an SMS passcode.
* If `passcodeInPassword=on`, then the password and passcode are concatenated, in the form of
  `<password_string><passcode_string>`.

For more details, see [ODBC Driver](../developer-guide/odbc/odbc.md).

### Using MFA with Python

MFA can be used for connecting to Snowflake via the Snowflake Python Connector. By default, the Duo Push authentication mechanism is used
when a user is enrolled in MFA; no changes to the Python API calls are required.

To use a Duo-generated passcode instead of the push mechanism, one of the following parameters must be specified for the driver in the
connect() method:

> `passcode=<passcode_string>` OR `passcode_in_password=True`

Where:

* `passcode_string` is a Duo-generated passcode for the user who is connecting. This can be a passcode generated by the Duo Mobile
  application or an SMS passcode.
* If `passcode_in_password=True`, then the password and passcode are concatenated, in the form of
  `<password_string><passcode_string>`.

For more details, see the description of the connect() method in the [Functions](../developer-guide/python-connector/python-connector-api.md) section of the Python
Connector API documentation.

---
title: Multi-Location Resilience for Data Pipelines
source: https://docs.snowflake.com/en/user-guide/multi-location-resilience-data-pipelines.md
section: User Guide
---

# Multi-Location Resilience for Data Pipelines

Multi-location resilience for data pipelines helps you safeguard your data
pipelines against potential region-wide cloud provider outages. This feature
ensures that upon failing over to a secondary location, your data pipelines
(specifically those using Snowpipe and COPY INTO) resume loading new data
without interruption or duplicate ingestion.

This feature works cross-cloud, allowing your primary and backup storage
locations to span entirely different cloud providers (for example, failing over
from AWS to Azure), as well as cross-region within the same cloud.

This feature relies on a shared-responsibility model:

* **Snowflake’s role:** Snowflake natively replicates your target tables and
  load history (ingestion state) to your secondary account. During a failover,
  Snowflake uses this state to prevent duplicates and only ingest files that
  were not processed in the primary location.
* **Your role:** In the event of an outage (or as part of a dual-write cloud
  setup), you must route new incoming files to your secondary cloud storage
  location. Snowflake does not replicate your external cloud storage files.

Pipeline resilience is powered by configuring up to two key resources:

* **Multi-Location Storage Integration (MLSI):** Securely connects Snowflake to
  multiple external cloud storage locations across regions or clouds. MLSI is
  needed when you want resilience for either COPY INTO from external stages
  alone or for your Snowpipe pipeline.
* **Multi-Queue Notification Integration (MQNI):** Connects Snowflake to
  multiple third-party cloud message queues, ensuring continuous receipt of new
  file notifications. MQNI is only needed if you want resilience for your
  Snowpipe pipeline, that is, for continuous data loading.

## Requirements and considerations

Before configuring this feature, review the following prerequisites and
considerations:

### Requirements

* **Edition:** Snowflake Business Critical Edition (or higher).
* **Supported ingestion methods:** This feature exclusively supports file-based
  data loading through Snowpipe (auto-ingest) and COPY INTO <table>. It does
  not support Openflow or Snowpipe Streaming.
* **Identical path structures:** To allow your pipelines to locate new files
  after failover, you must write them to the secondary storage location using
  the exact same hierarchy, folder structure, and relative path as your primary
  location.

### Considerations

* **Billing:** This feature incurs standard replication charges (data transfer
  and compute resources), billed to your target account.
* **Stage modification downtime:** Changing the RELATIVE_URL property on an
  existing stage will invalidate dependent objects and halt ingestion. We
  recommend creating new stages during setup to avoid downtime.
* **Multi-Queue Notification Integration (MQNI):** Using the same active queue
  in both source and target accounts is not supported. Doing so can result in
  notification loss. Snowflake does not check whether the same queue is in use
  across accounts.
* **Directory table:** Creating a directory table on a stage using MLSI is
  currently not supported.

### Replication behavior

* **Asynchronous replication:** Snowflake replicates your tables and your
  pipeline’s load history together in the exact same snapshot. Because they are
  synchronized, an outage will not result in duplicate data. If your secondary
  database is four hours behind, the table data is also four hours behind, and
  processing four hours of queued notifications simply brings the table up to
  date.
* **Dual-write data loss avoidance:** Your Recovery Point Objective (RPO) is
  dictated by your replication refresh interval. To prevent data loss during a
  failover, your secondary cloud message queue’s message retention period must
  be longer than your replication interval. If your queue drops messages before
  your scheduled replication catches up, those files will not be ingested upon
  failover.
* **Single-write data loss risk:** If you use single-write routing, any files
  processed in the primary location after your last successful replication are
  entirely unknown to the secondary location. Upon failover, this data will be
  temporarily missing in your target account.

> **Warning:**
>
> **Critical warning for single-write failback:** When you execute a refresh to
> fail back to your original primary account, the primary database is
> overwritten by the secondary database. If you do not manually reconcile and
> load those orphaned files into your secondary database before syncing back,
> they will be permanently erased from your primary database.

## Choosing the right architecture

Because Snowflake asynchronously replicates your target tables and your
pipeline’s load history together in the same snapshot, your pipelines are
protected against data duplication and partial loads. If an outage occurs
mid-ingestion, the transaction rolls back completely so that there are no
partially loaded files.

However, how you recover “in-flight” files during an outage depends entirely on
whether your external cloud storage routing is configured for dual-writes or
single-write routing.

### 1. Dual-writes - recommended

Your producer application writes files to both your primary and secondary cloud
storage buckets simultaneously. The secondary message queue accumulates
notifications, but Snowflake does not process them because the secondary
database is read-only.

* **What happens on failover:** The secondary database becomes writable. Snowpipe
  reads the secondary queue and uses the replicated load history to deduplicate
  files. If an outage prevented a file from finishing in the primary location,
  the secondary pipeline reads the notification from the secondary queue, sees
  the file is missing from the load history, and ingests it.
* **What happens on failback:** When the primary location recovers and you refresh
  the failover group and then fail back, Snowpipe automatically starts ingesting
  new files since the load history was synced from the secondary account during
  your failback preparation.
* **Result:** No missing data, no duplicates. Snowflake handles reconciliation
  automatically in both directions.
* **Action needed:** None, beyond the standard pre-failback replication sync
  (ALTER FAILOVER GROUP … REFRESH) to ensure the primary account has the
  latest load history.

### 2. Single-write routing

Your producer only writes to the primary cloud storage. Upon an outage, you
reroute the producer to start writing new files to the secondary cloud storage.

* **What happens on failover:** The secondary account immediately begins processing
  new files routed to the secondary bucket. However, any in-flight files
  trapped in the impacted primary location are left behind temporarily.
* **What happens on failback:** When the primary location recovers and you fail back
  to your primary Snowflake account, Snowpipe automatically processes any file
  notifications that successfully reached the queue before the outage.
* **Result:** No duplicates. However, any files where the cloud notification
  completely failed to generate because of the outage (or where the outage
  outlasted your queue’s message retention policy) require manual intervention.
* **Action needed:** After failing back, compare your primary storage bucket against
  the COPY_HISTORY view in Snowflake to identify any missing files. Run
  ALTER PIPE … REFRESH or a manual COPY INTO command to load those specific
  stranded files.

## Part 1: One-time configuration (setup)

The following steps are performed once to configure your resilient data
pipelines. Because you configure the active locations for both accounts during
setup, failing over during an actual outage is nearly instantaneous.

### Step 1: Create a Multi-Location Storage Integration (MLSI)

To configure a Multi-Location Storage Integration, you follow the standard
steps for configuring a storage integration with a few differences noted in
this section.

In your source account, create the MLSI by providing values for each location
in the STORAGE_LOCATIONS list. You can mix and match cloud providers for
cross-cloud setups.

```sqlexample
CREATE STORAGE INTEGRATION my_mlsi
  TYPE = EXTERNAL_STAGE
  STORAGE_LOCATIONS =
  (
    (
      NAME = 'my-s3-us-west-1'
      STORAGE_PROVIDER = 'S3'
      STORAGE_BASE_URL = 's3://myBucketWest'
      STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::12345:role/myrole'
      STORAGE_AWS_EXTERNAL_ID = 'mlsi-external-id'
      ENCRYPTION = ( TYPE = 'AWS_SSE_S3' )
    ),
    (
      NAME = 'my-s3-us-east-1'
      STORAGE_PROVIDER = 'S3'
      STORAGE_BASE_URL = 's3://myBucketEast'
      STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::67890:role/myrole'
      STORAGE_AWS_EXTERNAL_ID = 'mlsi-external-id'
      ENCRYPTION = ( TYPE = 'AWS_SSE_S3' )
    )
  )
  ENABLED = TRUE
  STORAGE_ALLOWED_LOCATIONS = ('*')
  ACTIVE = 'my-s3-us-west-1';
```

Where:

* **STORAGE_LOCATIONS:** Specifies a list of one or more storage locations (S3
  bucket, GCS bucket, or Azure container) for the storage integration. To view
  the parameters for each cloud provider, see
  [cloud provider parameters (cloudProviderParams)](../sql-reference/sql/create-storage-integration.md)
  on the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) reference page.
* **NAME:** String that specifies the identifier (name) for the storage location.
* **ENCRYPTION:** Specifies encryption for the storage location. You must specify
  encryption for the storage location at the storage integration level instead
  of at the stage level. Required only for loading from encrypted files; not
  required if the storage location and files are unencrypted. To view the
  encryption options for each cloud provider, see
  [ENCRYPTION](../sql-reference/sql/create-stage.md) on the
  [CREATE STAGE](../sql-reference/sql/create-stage.md) reference page.
* **ACTIVE:** Specifies the name of the storage location to set as the active
  location for the storage integration in the current account.

  For the active storage location in your source account, you must set up
  access control and grant Snowflake permission to access your storage. Use the
  instructions in the following topics:

  + [Configuring a Snowflake storage integration to access Amazon S3](data-load-s3-config-storage-integration.md)
  + [Configure an integration for Google Cloud Storage](data-load-gcs-config.md)
  + [Configure an Azure container for loading data](data-load-azure-config.md)

### Step 2: Associate MLSI with an external stage

We highly recommend creating a new stage rather than altering an existing one.

> **Warning:**
>
> **WARNING: Changing RELATIVE_URL causes downtime**
>
> If you use ALTER STAGE to change the RELATIVE_URL of an existing stage, any
> dependent directory tables are recreated, and any external tables or pipes
> using this stage are marked as invalid and will stop ingestion. Prepare for
> downtime if you choose to alter an existing stage.

Use the CREATE STAGE command to associate the multilocation storage integration
that you created with one or more external stages:

```sqlexample
CREATE STAGE my_ext_stage
  RELATIVE_URL = '/my_folder/my_sub_folder/'
  STORAGE_INTEGRATION = 'my_mlsi';
```

Where:

* **RELATIVE_URL:** The relative path to your external stage location from the
  storage location defined in your storage integration. To allow your pipelines
  to locate new files after failover, you must write them to the secondary
  storage location using the same hierarchy, folder structure, and relative
  path as your primary location.

> **Note:**
>
> This value must be a literal path. Specifying a pattern or wildcard isn’t
> supported. To specify access to all locations under the STORAGE_BASE_URL of
> your storage integration, use an empty string RELATIVE_URL = ‘’.

* **STORAGE_INTEGRATION:** The name of your Multi-Location Storage Integration.

> **Note:**
>
> Alternatively, you can alter an existing external stage by specifying the
> RELATIVE_URL parameter and your MLSI. The ALTER STAGE command also supports
> rolling back this change so that the external stage does not use a
> Multi-Location Storage Integration.

For example:

```sqlexample
ALTER STAGE my_ext_stage SET
  RELATIVE_URL = '/my_folder/my_sub_folder/'
  STORAGE_INTEGRATION = 'my_mlsi';
```

### Step 3: Configure a Multi-Queue Notification Integration (MQNI)

If you use automated data loading through cloud messaging and have configured a
Multi-Location Storage Integration for your external stage, you must also use a
Multi-Queue Notification Integration for seamless failover of your Snowpipe
pipelines.

For each queue that you define for the notification integration, you must
prepare your messaging service using the steps in the following topics:

* [Configuring Amazon SNS to Automate Snowpipe using SQS Notifications](data-load-snowpipe-auto-s3.md).
  Create an SNS topic for each AWS region in which your MLSI has storage
  locations.
* [Configuring Automation Using GCS Pub/Sub](data-load-snowpipe-auto-gcs.md)
* [Configuring Automation With Azure Event Grid](data-load-snowpipe-auto-azure.md)

> **Note:**
>
> If you do not want to use Amazon SNS with Snowpipe, you can avoid creating an
> MQNI but you must perform an additional step during failover. If you choose
> this option, associate your pipe with the stage and MLSI created above, and
> then proceed to Step 4.

#### Scenario A: Create a new Multi-Queue Notification Integration (MQNI)

To create a Multi-Queue Notification Integration, you follow the standard steps
for creating a notification integration with a few differences noted in this
section.

In your source account, create a multi-queue notification integration by
providing values for each queue in the QUEUES list:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_mqni
  ENABLED = TRUE
  TYPE = MULTI_QUEUE
  DIRECTION = INBOUND
  QUEUES = (
    (
      NAME = 'my-us-west-1'
      NOTIFICATION_PROVIDER = AWS_SNS
      AWS_SNS_TOPIC_ARN = 'arn:aws:sns:us-west-1:12345:my-snowpipe-mlsi-west'
    ),
    (
      NAME = 'my-us-east-1'
      NOTIFICATION_PROVIDER = AWS_SNS
      AWS_SNS_TOPIC_ARN = 'arn:aws:sns:us-west-1:12345:my-snowpipe-mlsi-east'
    )
  )
  ACTIVE = 'my-us-west-1';
```

Where:

* **TYPE = MULTI_QUEUE:** Specifies that this is a multi-queue integration between
  Snowflake and a third-party cloud message-queuing service.
* **DIRECTION = INBOUND:** Specifies that Snowflake receives notifications sent by
  the cloud messaging service.
* **QUEUES:** Specifies a list of one or more queues for the notification
  integration.
* **NAME:** String that specifies the identifier (name) for the queue.

To view the specific queue parameters for each cloud provider, see:

* **AWS:**

  + **NOTIFICATION_PROVIDER = AWS_SNS:** Specifies Amazon Simple Notification
    Service (SNS) as the third-party cloud message queueing service.
  + **AWS_SNS_TOPIC_ARN:** Amazon Resource Name (ARN) of the Amazon SNS topic to
    which notifications are pushed.
* **Google Cloud:**
  [CREATE NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](../sql-reference/sql/create-notification-integration-queue-inbound-gcp.md)
* **Azure:**
  [CREATE NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](../sql-reference/sql/create-notification-integration-queue-inbound-azure.md)
* **ACTIVE:** Specifies the name of the queue to set as the active queue for the
  notification integration in the current account.

  For the active queue in your source account, you must grant Snowflake
  permission to access your messaging service. Follow the instructions for your
  cloud provider:

  + For AWS, see Subscribe the Snowflake SQS Queue to the SNS Topic under
    [Prerequisite: Create an Amazon SNS Topic and Subscription](data-load-snowpipe-auto-s3.md).
  + For Google Cloud, see Grant Snowflake Access to the Pub/Sub Subscription
    under [Configuring Automation Using GCS Pub/Sub](data-load-snowpipe-auto-gcs.md).
  + For Azure, see Grant Snowflake Access to the Storage Queue under
    [Configuring Automation With Azure Event Grid](data-load-snowpipe-auto-azure.md).

After you create an MQNI, you can use it to create a new pipe with the CREATE
PIPE command. The following example creates a pipe to load data from Amazon S3
into a table using an external stage (my_ext_stage) that depends on a
Multi-Location Storage Integration:

```sqlexample
CREATE PIPE my_pipe
  AUTO_INGEST = TRUE
  INTEGRATION = my_mqni
  AS COPY INTO my_table FROM @my_ext_stage/my_pipe/;
```

#### Scenario B: Migrate an existing notification integration to MQNI

If you already have existing notification integrations that you want to convert
to MQNI rather than creating a new one from scratch, use the
SYSTEM$CONVERT_PIPES_TO_MULTI_QUEUE function.

The function creates a new multi-queue notification integration using the name
you specify, sets the active queue for your source account to the original
queue, and automatically migrates any pipes in the source account to use the
new MQNI.

Syntax:

```sqlsyntax
SYSTEM$CONVERT_PIPES_TO_MULTI_QUEUE(
  '<new_mqni_name>',
  '<original_sns_topic_arn_or_int_name>',
  '<new_sns_topic_arn_or_int_name>'
)
```

Where:

* **new_mqni_name:** String that specifies an identifier (name) to assign to the
  new multi-queue notification integration that the function creates.
* **original_sns_topic_arn_or_int_name:**

  + For AWS, the Amazon Resource Name (ARN) of the original SNS topic
    associated with one or more pipes.
  + For Google Cloud or Azure, a string that specifies the identifier of your
    original single-queue notification integration associated with one or more
    pipes.
* **new_sns_topic_arn_or_int_name:**

  + For AWS, the Amazon Resource Name (ARN) of a new SNS topic to add as a
    queue to the MQNI.
  + For Google Cloud or Azure, a string that specifies the identifier of your
    new single-queue notification integration to combine with the original
    notification integration.

Example 1: Add a new SNS topic queue

```sqlexample
SELECT SYSTEM$CONVERT_PIPES_TO_MULTI_QUEUE(
  'my_mqni',
  'arn:aws:sns:us-west-1:12345:my-snowpipe-mlsi-west',
  'arn:aws:sns:us-east-1:67890:my-snowpipe-mlsi-east'
);
```

This results in an MQNI named my_mqni with the following queues:

* MY_MQNI-queue1 (for the original, active SNS topic)
* MY_MQNI-queue2 (for the new SNS topic)

Example 2: Combine two notification integrations into MQNI

```sqlexample
SELECT SYSTEM$CONVERT_PIPES_TO_MULTI_QUEUE(
  'my_azure_mqni',
  'my_azure_ni_1',
  'my_azure_ni_2'
);
```

This results in an MQNI named my_azure_mqni with the following queues:

* my_azure_ni_1 (for the original, active queue)
* my_azure_ni_2 (for the new queue)

> **Note:**
>
> If you want to change the active queue in your source account, you can use
> an ALTER INTEGRATION … SET ACTIVE = ‘<my_queue>’ statement. You must pause
> any pipes that use the notification integration before updating the active
> queue.

### Step 4: Replicate your MLSI and MQNI to target account

> **Note:**
>
> A refresh operation drops any storage or notification integrations in the
> target account that are not replicas unless the objects have global IDs.
>
> For more information, see [Replication and objects in target accounts](account-replication-considerations.md).

1. To replicate your multi-location storage integration and multi-queue
notification integration, alter your existing replication or failover group to
include STORAGE INTEGRATIONS and NOTIFICATION INTEGRATIONS in the
ALLOWED_INTEGRATION_TYPES list.

For example, use the ALTER FAILOVER GROUP command:

```sqlexample
ALTER FAILOVER GROUP my_fg SET
  OBJECT_TYPES = DATABASES, ROLES, INTEGRATIONS
  ALLOWED_INTEGRATION_TYPES = API INTEGRATIONS, STORAGE INTEGRATIONS,
    NOTIFICATION INTEGRATIONS;
```

2. Then, in your target account, perform a refresh operation:

```sqlexample
ALTER FAILOVER GROUP my_fg REFRESH;
```

### Step 5: Configure active states in target account

After you perform a refresh operation, to ensure a seamless failover during an
actual outage, configure the active storage location and queue (if using a
notification integration) in your target account.

In your target account:

1. For the storage location that you want to set as the active location in your
   target account, use the instructions in the following topics to grant
   Snowflake access to your storage:

   * [Option 1: Configure a Snowflake storage integration to access Amazon S3](data-load-s3-config-storage-integration.md)
   * [Configure an integration for Google Cloud Storage](data-load-gcs-config.md)
   * [Configure an Azure container for loading data](data-load-azure-config.md)
2. Activate secondary storage: Set the MLSI to use your secondary backup
   storage location in the target account.

   ```sqlexample
   ALTER STORAGE INTEGRATION my_mlsi SET ACTIVE = 'my-s3-us-east-1';
   ```
3. If you are using a Multi-Queue Notification Integration, grant Snowflake
   permission to access your messaging service for the queue that you want to
   set as active in your target account. Follow the instructions for your cloud
   provider:

   * For AWS, see Subscribe the Snowflake SQS Queue to the SNS Topic under
     [Prerequisite: Create an Amazon SNS Topic and Subscription](data-load-snowpipe-auto-s3.md).
   * For Google Cloud, see Grant Snowflake Access to the Pub/Sub Subscription
     under [Configuring Automation Using GCS Pub/Sub](data-load-snowpipe-auto-gcs.md).
   * For Azure, see Grant Snowflake Access to the Storage Queue under
     [Configuring Automation With Azure Event Grid](data-load-snowpipe-auto-azure.md).
4. Activate secondary queue (if using MQNI): Set the active queue to your
   secondary location in the target account.

   ```sqlexample
   ALTER INTEGRATION my_mqni
     SET ACTIVE = 'MY_MQNI-queue2';
   ```

## Part 2: Failover steps

Execute these steps during an outage to redirect your data ingestion to your
secondary location. Because your active queues and storage were preconfigured
in setup, this process requires minimal commands.

1. Promote the target account: Log in to your target account and promote it to
   serve as the new primary account. Data loading automatically resumes from
   your secondary cloud infrastructure.

   ```sqlexample
   ALTER FAILOVER GROUP my_fg PRIMARY;
   ```
2. If not using Amazon SNS with Snowpipe: If you are not using SNS with
   Snowpipe and only relying on SQS, you do not need to create an MQNI.
   Instead, call the following system function to rebind your pipe during
   failover.

   ```sqlexample
   SELECT SYSTEM$INGEST_REBIND_PIPE('my_db.my_schema.my_pipe');
   ```

## Part 3: Failback steps

Once the outage is resolved and your primary location is healthy, execute these
steps to move your pipelines back to the primary location.

1. Sync data back: Before promoting your original account, you must pull all
   data and state changes that occurred during the outage back to your original
   account. Log in to your original primary account (currently acting as the
   secondary account) and initiate a manual refresh:

   ```sqlexample
   ALTER FAILOVER GROUP my_fg REFRESH;
   ```

   > **Important:**
   >
   > Wait for this refresh operation to finish completely before moving to the
   > next step. Failing over before sync completes can result in data loss.

   > **Warning:**
   >
   > **Critical warning for single-write failback:** If you use single-write
   > routing, any files processed in the primary location after your last
   > successful replication are unknown to the secondary location. Upon
   > failover, this data is temporarily missing in your target account. When
   > you execute a refresh to fail back to your original primary account, the
   > primary database is overwritten by the secondary database. If you do not
   > manually reconcile and load those orphaned files into your secondary
   > database before syncing back, they are permanently erased from your
   > primary database.
2. Promote the original account: Once refresh is complete, promote your
   original source account back to primary.

   ```sqlexample
   ALTER FAILOVER GROUP my_fg PRIMARY;
   ```
3. If not using Amazon SNS with Snowpipe: Call the system function to rebind
   your pipe back to the original source location.

   ```sqlexample
   SELECT SYSTEM$INGEST_REBIND_PIPE('my_db.my_schema.my_pipe');
   ```

## Part 4: Monitoring and validation

After initiating a failover or failback, use the following commands to verify
that your data pipelines successfully redirected and resumed ingestion.

### 1. Verify active integration states

Confirm that your integrations are pointing to the correct storage and queues
by checking their properties. Look for the ACTIVE property in the output:

```sqlexample
-- Check the active storage location
DESCRIBE STORAGE INTEGRATION my_mlsi;

-- Check the active message queue
DESCRIBE INTEGRATION my_mqni;
```

### 2. Check pipe status (Snowpipe only)

Use the SYSTEM$PIPE_STATUS function to ensure your pipe is running and to check
whether it is actively queueing new files from your secondary location.

```sqlexample
SELECT SYSTEM$PIPE_STATUS('my_pipe');
```

Look for “executionState”:”RUNNING” and check “pendingFileCount” to confirm it
is actively recognizing new files dropped into your secondary bucket.

### 3. Validate successful ingestion (load history)

To guarantee that data is loading without errors or duplicates, query the
COPY_HISTORY view. This shows exactly which files were ingested, their source
path, and when they were loaded.

```sqlexample
SELECT file_name, status, row_count, last_load_time
FROM TABLE(information_schema.copy_history(
  table_name => 'my_table',
  start_time => DATEADD(hours, -1, CURRENT_TIMESTAMP())
));
```

Verify that the file_name paths reflect your active storage location and that
status shows as LOADED.

---
title: Native Programmatic Interfaces
source: https://docs.snowflake.com/en/user-guide/ecosystem-lang.md
section: User Guide
---

# Native Programmatic Interfaces

Snowflake supports developing applications using many popular programming languages and development platforms. Using native clients
(connectors, drivers, etc.) provided by Snowflake, you can develop applications using any of the following programmatic interfaces:

| Interface |  | Version / Installation Requirements | Notes |
| --- | --- | --- | --- |
|  |  | **Go Language:** 1.14 (or higher)  **Snowflake:** Go Snowflake Driver — available from the [Go](https://pkg.go.dev/github.com/snowflakedb/gosnowflake) site |  |
|  |  | **Java:** Java LTS (Long-Term Support) versions 1.8 or higher  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) |  |
|  |  | **Microsoft .NET:** Visual Studio 2017  **Snowflake:** .NET Driver — download from [GitHub](https://github.com/snowflakedb/snowflake-connector-net) |  |
|  |  | **Node.js:** 10.0 (or higher)  **Snowflake:** [Node.js Driver](../developer-guide/node-js/nodejs-driver.md) — install using [npm](https://www.npmjs.com/package/snowflakedb) |  |
|  |  | **C Language:** Requirements are OS-specific  **Snowflake:** [ODBC Driver](../developer-guide/odbc/odbc.md) — download from the [ODBC Download](https://developers.snowflake.com/odbc/) page |  |
|  |  | **PHP:** 7.2 (or higher)  **Snowflake:** PHP PDO Driver — build from [GitHub](https://github.com/snowflakedb/pdo_snowflake) |  |
|  |  | **Python:**  Version 3.7 and later  **Snowflake:** [Connector for Python](../developer-guide/python-connector/python-connector.md) — install using [pip](https://pypi.org/project/snowflake-connector-python/) |  |
|  |  | **Python:** 3.6, 3.7, or 3.8  **SQLAlchemy:** No requirements  **Snowflake:** [SQLAlchemy Toolkit](../developer-guide/python-connector/sqlalchemy.md) — install using [pip](https://pypi.org/project/snowflake-sqlalchemy/) |  |

---
title: Native semantic categories of sensitive data classification
source: https://docs.snowflake.com/en/user-guide/classify-native.md
section: User Guide
---

# Native semantic categories of sensitive data classification

A semantic category is a label that describes the meaning or type of information in a data column, beyond the fundamental data type.
You can use semantic categories to add business context and improve data goverance. Snowflake provides the following semantic categories
that identify common types of sensitive attributes, such as names and addresses. These native semantic categories can be sectioned into the
following privacy categories:

* Identifiers
* Quasi-identifiers
* Sensitive information

> **Important:**
>
> Under various laws and regulations, multiple semantic categories can be considered “Sensitive Personal Data”, “Special Categories of
> Data”, or similar terms. These semantic categories might require additional protections or controls.

To classify attributes that are not supported natively, refer to [Create custom categories for sensitive data](classify-custom.md).

## About semantic subcategories

If Snowflake identifies that the type of sensitive data is specific to a country, it records a *semantic subcategory* in the classification details. For example, a social security number (SSN) is an identifier in the United States (US), and its semantic subcategory is `NATIONAL_IDENTIFIER`.

You can find the semantic subcategory in the `Details` field of the JSON object returned by the classification
process. For more information about viewing this response object, refer to [Use SQL to view classification results](classify-results.md).

If the type of sensitive data is not specific to a country and is globally applicable, it does not have a semantic subcategory. This type of
sensitive data is categorized as a global identifier.

## Identifiers

Identifier semantic categories represent personally identifiable information (PII) or sensitive data elements that can be used to
identify individuals or entities.

### Global identifiers

Global identifer categories are semantic categories that are not specific to a country and are globally applicable.

| Semantic category | Notes |
| --- | --- |
| BANK_ACCOUNT | For countries outside of Cananda, New Zealand, and the United States, the semantic subcategory is International Bank Account Number (IBAN). |
| EMAIL |  |
| IMEI | An International Mobile Equiment Identity (IMEI) is a unique number that identifies a phone’s model and serial number. |
| IP_ADDRESS |  |
| NAME |  |
| PAYMENT_CARD |  |
| URL | A Uniform Resource Locator (URL) is the unique address of a resource (such as a document or website) on the Internet. |
| VIN | The Vehicle Identification Number. |

### Country-specific identifiers

| Semantic category | Country | Semantic subcategory | Notes |
| --- | --- | --- | --- |
| BANK_ACCOUNT | Canada (CA) | CA_BANK_ACCOUNT |  |
|  | New Zealand (NZ) | NZ_BANK_ACCOUNT |  |
|  | United States (US) | US_BANK_ACCOUNT |  |
| DRIVERS_LICENSE | Austria (AT) | AT_DRIVERS_LICENSE |  |
|  | Australia (AU) | AU_DRIVERS_LICENSE |  |
|  | Belgium (BE) | BE_DRIVERS_LICENSE |  |
|  | Bulgaria (BG) | BG_DRIVERS_LICENSE |  |
|  | Canada (CA) | CA_DRIVERS_LICENSE |  |
|  | Croatia (HR) | HR_DRIVERS_LICENSE |  |
|  | Cyprus (CY) | CY_DRIVERS_LICENSE |  |
|  | Czechia (CZ) | CZ_DRIVERS_LICENSE |  |
|  | Denmark (DK) | DK_DRIVERS_LICENSE |  |
|  | Estonia (EE) | EE_DRIVERS_LICENSE |  |
|  | Finland (FI) | FI_DRIVERS_LICENSE |  |
|  | France (FR) | FR_DRIVERS_LICENSE |  |
|  | Germany (DE) | DE_DRIVERS_LICENSE |  |
|  | Greece (GR) | GR_DRIVERS_LICENSE |  |
|  | Hungary (HU) | HU_DRIVERS_LICENSE |  |
|  | India (IN) | IN_DRIVERS_LICENSE |  |
|  | Ireland (IE) | IE_DRIVERS_LICENSE |  |
|  | Italy (IT) | IT_DRIVERS_LICENSE |  |
|  | Latvia (LV) | LV_DRIVERS_LICENSE |  |
|  | Lithuania (LT) | LT_DRIVERS_LICENSE |  |
|  | Luxembourg (LU) | LU_DRIVERS_LICENSE |  |
|  | Malta (MT) | MT_DRIVERS_LICENSE |  |
|  | Netherlands (NL) | NL_DRIVERS_LICENSE |  |
|  | New Zealand (NZ) | NZ_DRIVERS_LICENSE |  |
|  | Poland (PL) | PL_DRIVERS_LICENSE |  |
|  | Portugal (PT) | PT_DRIVERS_LICENSE |  |
|  | Romania (RO) | RO_DRIVERS_LICENSE |  |
|  | Slovakia (SK) | SK_DRIVERS_LICENSE |  |
|  | Slovenia (SI) | SI_DRIVERS_LICENSE |  |
|  | Spain (ES) | ES_DRIVERS_LICENSE |  |
|  | Sweden (SE) | SE_DRIVERS_LICENSE |  |
|  | United States (US) | US_DRIVERS_LICENSE |  |
| MEDICARE_NUMBER | Australia (AU) | AU_MEDICARE_NUMBER |  |
|  | New Zealand (NZ) | NZ_NHI_NUMBER |  |
| NATIONAL_IDENTIFIER | Austria (AT) | AT_IDENTITY_CARD  AT_SSN |  |
|  | Belgium (BE) | BE_NATIONAL_NUMBER |  |
|  | Bulgaria (BG) | BG_UNIFORM_CIVIL_NUMBER |  |
|  | Canada (CA) | CA_SOCIAL_INSURANCE_NUMBER |  |
|  | Croatia (HR) | HR_PERSONAL_IDENTIFICATION_NUMBER |  |
|  | Cyprus (CY) | CY_IDENTITY_CARD |  |
|  | Czechia (CZ) | CZ_PERSONAL_IDENTITY_NUMBER |  |
|  | Denmark (DK) | DK_PERSONAL_IDENTIFICATION_NUMBER |  |
|  | Estonia (EE) | EE_PERSONAL_IDENTIFICATION_CODE |  |
|  | Finland (FI) | FI_NATIONAL_IDENTITY_CARD |  |
|  | France (FR) | FR_CNI  FR_SSN | The FR_SSN is also known as the INSEE number. |
|  | Germany (DE) | DE_IDENTITY_CARD |  |
|  | Greece (GR) | GR_NATIONAL_IDENTITY_CARD  GR_SSN | The GR_SSN is also known as the AMKA number. |
|  | Hungary (HU) | HU_PERSONAL_IDENTIFICATION_NUMBER  HU_SSN | The HU_SSN is also known as the TAJ number. |
|  | India (IN) | IN_PAN  IN_AADHAAR  IN_VOTER_ID |  |
|  | Ireland (IE) | IE_PERSONAL_PUBLIC_SERVICE_NUMBER |  |
|  | Latvia (LV) | LV_PERSONAL_CODE |  |
|  | Lithuania (LT) | LT_PERSONAL_CODE |  |
|  | Luxembourg (LU) | LU_NATIONAL_IDENTIFICATION_NUMBER_NATURAL_PERSONS  LU_NATIONAL_IDENTIFICATION_NUMBER_NON_NATURAL_PERSONS |  |
|  | Malta (MT) | MT_IDENTITY_CARD |  |
|  | Netherlands (NL) | NL_CITIZEN_SERVICE_NUMBER |  |
|  | New Zealand (NZ) | NZ_STUDENT_NUMBER |  |
|  | Poland (PL) | PL_NATIONAL_ID |  |
|  | Portugal (PT) | PT_CITIZEN_CARD_NUMBER |  |
|  | Romania (RO) | RO_PERSONAL_NUMERIC_CODE |  |
|  | Singapore (SG) | SG_NATIONAL_REGISTRATION_IDENTITY_CARD |  |
|  | Slovakia (SK) | SK_PERSONAL_NUMBER |  |
|  | Slovenia (SI) | SI_UNIQUE_MASTER_CITIZEN_NUMBER |  |
|  | Spain (ES) | ES_DNI  ES_SSN |  |
|  | Sweden (SE) | SE_NATIONAL_ID |  |
|  | United Kingdom (UK) | UK_NATIONAL_INSURANCE_NUMBER |  |
|  | United States (US) | US_SSN |  |
| ORGANIZATION_IDENTIFIER | Australia (AU) | AU_BUSINESS_NUMBER  AU_COMPANY_NUMBER |  |
|  | New Zealand (NZ) | NZ_BUSINESS_NUMBER |  |
|  | Singapore (SG) | SG_UNIQUE_ENTITY_NUMBER |  |
| PASSPORT | Australia (AU) | AU_PASSPORT |  |
|  | Austria (AT) | AT_PASSPORT |  |
|  | Belgium (BE) | BE_PASSPORT |  |
|  | Bulgaria (BG) | BG_PASSPORT |  |
|  | Canada (CA) | CA_PASSPORT |  |
|  | Croatia (HR) | HR_PASSPORT |  |
|  | Cyprus (CY) | CY_PASSPORT |  |
|  | Czechia (CZ) | CZ_PASSPORT |  |
|  | Denmark (DK) | DK_PASSPORT |  |
|  | Estonia (EE) | EE_PASSPORT |  |
|  | Finland (FI) | FI_PASSPORT |  |
|  | France (FR) | FR_PASSPORT |  |
|  | Germany (DE) | DE_PASSPORT |  |
|  | Greece (GR) | GR_PASSPORT |  |
|  | Hungary (HU) | HU_PASSPORT |  |
|  | Ireland (IE) | IE_PASSPORT |  |
|  | Italy (IT) | IT_PASSPORT |  |
|  | Latvia (LV) | LV_PASSPORT |  |
|  | Lithuania (LT) | LT_PASSPORT |  |
|  | Luxembourg (LU) | LU_PASSPORT |  |
|  | Malta (MT) | MT_PASSPORT |  |
|  | Netherlands (NL) | NL_PASSPORT |  |
|  | New Zealand (NZ) | NZ_PASSPORT |  |
|  | Poland (PL) | PL_PASSPORT |  |
|  | Portugal (PT) | PT_PASSPORT |  |
|  | Romania (RO) | RO_PASSPORT |  |
|  | Singapore (SG) | SG_PASSPORT |  |
|  | Slovakia (SK) | SK_PASSPORT |  |
|  | Slovenia (SI) | SI_PASSPORT |  |
|  | Spain (ES) | ES_PASSPORT |  |
|  | Sweden (SE) | SE_PASSPORT |  |
|  | United States (US) | US_PASSPORT |  |
| PHONE_NUMBER | Australia (AU) | AU_PHONE_NUMBER |  |
|  | Canada (CA) | CA_PHONE_NUMBER |  |
|  | Japan (JP) | JP_PHONE_NUMBER |  |
|  | United Kingdom (UK) | UK_PHONE_NUMBER |  |
|  | United States (US) | US_PHONE_NUMBER |  |
| STREET_ADDRESS | Canada (CA) | CA_STREET_ADDRESS |  |
|  | New Zealand (NZ) | NZ_STREET_ADDRESS |  |
|  | United States (US) | US_STREET_ADDRESS |  |
| TAX_IDENTIFIER | Australia (AU) | AU_TAX_NUMBER |  |
|  | Austria (AT) | AT_TAX_ID_NUMBER |  |
|  | Cyprus (CY) | CY_TAX_ID_NUMBER |  |
|  | France (FR) | FR_TAX_ID_NUMBER |  |
|  | Germany (DE) | DE_TAX_ID_NUMBER |  |
|  | Greece (GR) | GR_TAX_ID_NUMBER |  |
|  | Hungary (HU) | HU_TAX_ID_NUMBER |  |
|  | India (IN) | IN_GST_NUMBER |  |
|  | Italy (IT) | IT_FISCAL_CODE |  |
|  | Malta (MT) | MT_TAX_ID_NUMBER |  |
|  | Netherlands (NL) | NL_TAX_ID_NUMBER |  |
|  | New Zealand (NZ) | NZ_INLAND_REVENUE_NUMBER |  |
|  | Poland (PL) | PL_TAX_ID_NUMBER |  |
|  | Portugal (PT) | PT_TAX_ID_NUMBER |  |
|  | Slovenia (SI) | SI_TAX_ID_NUMBER |  |
|  | Spain (ES) | ES_TAX_ID_NUMBER |  |
|  | Sweden (SE) | SE_TAX_ID_NUMBER |  |
|  | United States (US) | US_TAX_IDENTIFIER | The semantic subcategory US_TAX_IDENTIFIER is an identifier because it is the ITIN of an individual. The EMPLOYER_IDENTIFICATION_NUMBER subcategory of the TAX_IDENTIFIER category is a quasi-identifier because it is the EIN of a company. |

## Quasi-identifiers

Quasi-identifiers are attributes that do not uniquely identify an individual on their own, but when combined with other data, could
be used to re-identify someone. Examples of quasi-identifiers include demographic information, geographic data, and administrative regions.

### Global quasi-identifiers

Global quasi-identifiers are quasi-identifier semantic categories that are not specific to a country and are globally applicable.

| Semantic category |
| --- |
| AGE |
| COUNTRY |
| DATE_OF_BIRTH |
| ETHNICITY |
| GENDER |
| LATITUDE |
| LAT_LONG |
| LONGITUDE |
| MARITAL_STATUS |
| MEDICAL_SPECIALTY |
| OCCUPATION |
| YEAR_OF_BIRTH |

### Country-specific quasi-identifiers

| Semantic category | Country | Semantic subcategory | Notes |
| --- | --- | --- | --- |
| ADMINISTRATIVE_AREA_1 | Canada (CA) | CA_PROVINCE_OR_TERRITORY |  |
|  | New Zealand (NZ) | NZ_REGION |  |
|  | United States (US) | US_STATE_OR_TERRITORY |  |
| ADMINISTRATIVE_AREA_2 | United States (US) | US_COUNTY |  |
| CITY | Canada (CA) | CA_CITY |  |
|  | New Zealand (NZ) | NZ_CITY |  |
|  | United States (US) | US_CITY |  |
| POSTAL_CODE | Australia (AU) | AU_POSTAL_CODE |  |
|  | Canada (CA) | CA_POSTAL_CODE |  |
|  | Japan (JP) | JP_POSTAL_CODE |  |
|  | New Zealand (NZ) | NZ_POSTAL_CODE |  |
|  | Switzerland (CH) | CH_POSTAL_CODE |  |
|  | United Kingdom (UK) | UK_POSTAL_CODE | Contains public sector information licensed under the [Open Government Licence v3.0](https://www.nationalarchives.gov.uk/doc/open-government-licence/version/3/). |
|  | United States (US) | US_POSTAL_CODE |  |
| TAX_IDENTIFIER | United States (US) | EMPLOYER_IDENTIFICATION_NUMBER | The semantic subcategory EMPLOYER_IDENTIFICATION_NUMBER is a quasi-identifier, not an identifier, because it is the EIN of a company. The US_TAX_IDENTIFIER subcategory of the TAX_IDENTIFIER category represents the ITIN of an individual, and is an identifier. |

## Sensitive information

Sensitive information includes data elements that contain confidential or private details. While such data does not directly identify an
individual, they require protection due to their sensitive nature.

### Global sensitive information

| Semantic category | Semantic subcategory | Notes |
| --- | --- | --- |
| MEDICAL_DATA | ICD_CODE | International Classification of Diseases, 10th Revision, codes. |
|  | LAB_TERM | This includes terms related to laboratory analysis of blood samples (for example, CBC, lipid panel) and general terms for non-blood laboratory analyses (for example, urine analysis, biopsy). |
|  | MEDICAL_CONDITION | This includes specific medical conditions, illnesses, or disorders, and loss or abnormality of psychological, physiological, or anatomical structure or function (for example, impairments). |
|  | MEDICAL_PROCEDURE | Interventions involving physical alteration of tissues or organs (for example, appendectomy). |
|  | MEDICATION | This includes classifications of drugs based on function or composition (for example, antibiotics, antihistamines), proprietary trademarked names of drugs (for example, Advil, Amoxil), and non-proprietary chemical names of drugs (for example, ibuprofen, amoxicillin). |
| SALARY | n/a | n/a |

---
title: Network policy advisor
source: https://docs.snowflake.com/en/user-guide/network-policy-advisor.md
section: User Guide
---

# Network policy advisor

## Overview

Snowflake network policies are a powerful security control, but can be difficult to design correctly, especially when no current policy
exists or when traffic patterns are complex.

The Network Policy Advisor is a step-wise procedure that guides a *security*
*administrator*, that is a user with the SECURITYADMIN role, to create a recommended candidate for an ingress network policy that is based on
historical ingress-access data. You, as the administrator, then evaluate the recommended policy using a what-if simulation before activating
the policy. You can recommend and evaluate a candidate network policy for a user or for all users in an account. The advisor procedure
involves calling two non-disruptive system stored procedures. These procedures generate human-readable SQL and evaluation results that you
can review, refine, and then apply manually.

## Considerations

The Snowflake Network Policy Advisor doesn’t automatically activate or modify existing network policies. It makes no determination about
whether an IP address is correct or safe for your network environment. The advisor provides recommendations and simulations only. Any final
network policy decisions — that is, any changes to existing network rules and policies — remain the responsibility of the customer.

## Key benefits

The Network Policy Advisor provides the following key benefits:

* Enables you to safely design a first network policy.
* Provides visibility into what traffic would be blocked before enforcement.
* Reduces trial-and-error when tightening security controls.
* Supports iterative refinement and validation workflows.

## Access control requirements

A user must have the SECURITYADMIN role at a minimum to run these stored procedures.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

## Generate and evaluate a candidate network policy

To generate and evaluate a candidate network policy for an account, log in to Snowsight, open a worksheet, and follow these steps:

1. Generate the SQL syntax for a candidate policy by calling the RECOMMEND_NETWORK_POLICY procedure.

   ```sqlexample
   USE ROLE SECURITYADMIN;

   CALL SNOWFLAKE.NETWORK_SECURITY.RECOMMEND_NETWORK_POLICY(
     LOOKBACK_DAYS => 30,
     );
   ```
2. Review the SQL syntax generated in the previous step.
3. Based on your review, create a candidate network rule and policy by running commands similar to the following
   examples.

   ```sqlexample
   USE ROLE SECURITYADMIN;

   -- Create a network rule
   CREATE OR REPLACE NETWORK RULE my_ingress_rule
     MODE = INGRESS
     TYPE = IPV4
     VALUE_LIST = ('203.0.113.0/24', ...);

   -- Create a network policy
   CREATE OR REPLACE NETWORK POLICY my_ingress_policy
     ALLOWED_NETWORK_RULE_LIST = ('my_ingress_rule');
   ```
4. Run the EVALUATE_CANDIDATE_NETWORK_POLICY procedure on the candidate policy to simulate which IP addresses
   it would allow or block.

   ```sqlexample
   USE ROLE SECURITYADMIN;

   CALL SNOWFLAKE.NETWORK_SECURITY.EVALUATE_CANDIDATE_NETWORK_POLICY(
     POLICY_NAME => 'my_ingress_policy'
     );
   ```
5. Analyze the output to confirm which IP addresses would be allowed or blocked by the recommended candidate policy.
6. Refine the candidate policy based on the evaluation results.

   For example, you could add rules to allow legitimate IPs that were blocked and remove rules for unauthorized IPs that were allowed.
7. If necessary, re-evaluate the candidate policy by re-running the EVALUATE_CANDIDATE_NETWORK_POLICY procedure and refining the
   candidate network policy until it returns an acceptable result.
8. (Optional) After you determine that the candidate policy performs successfully, activate it:

   ```sqlexample
   ALTER ACCOUNT SET NETWORK_POLICY = 'my_ingress_policy';
   ```
9. (Optional) Run a query like this to view the history of ingress traffic in your network:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT *
     FROM SNOWFLAKE.ACCOUNT_USAGE.INGRESS_NETWORK_ACCESS_HISTORY
     LIMIT 100;
   ```

---
title: Network rules
source: https://docs.snowflake.com/en/user-guide/network-rules.md
section: User Guide
---

# Network rules

Network rules are schema-level objects that group network identifiers into logical units.

Snowflake features that restrict network traffic can reference network rules rather than defining network identifiers directly in the
feature. A network rule does not define whether its identifiers should be allowed or blocked. The Snowflake feature that uses the network
rule specifies whether the identifiers in the rule are permitted or prohibited.

The following features use network rules to control network traffic:

* [Network policies](network-policies.md) use network rules to control inbound network traffic to the Snowflake service and
  internal stages.
* [External network access](../developer-guide/external-network-access/external-network-access-overview.md) uses network rules to restrict
  access to external network locations from a Snowflake UDF or procedure.

## Supported network identifiers

Administrators need to be able to restrict access based on the network identifier associated with the origin or destination of a request.
Network rules allow administrators to allow or block the following network identifiers:

Incoming requests:
:   * IPv4 addresses. Snowflake supports ranges of IP addresses using
      [Classless Inter-Domain Routing (CIDR) notation](https://tools.ietf.org/html/rfc4632). For example, `192.168.1.0/24` represents
      all IPv4 addresses in the range of `192.168.1.0` to `192.168.1.255`.
    * VPCE IDs of [AWS VPC endpoints](https://docs.aws.amazon.com/vpc/latest/privatelink/concepts.html#concepts-service-consumers) . VPC
      IDs are not supported.
    * LinkIDs of [Azure private endpoints](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview). Execute
      the [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](../sql-reference/functions/system_get_privatelink_authorized_endpoints.md) function to retrieve the LinkID associated with an
      account.
    * GCPPSCIDs (pscConnectionIDs) of [Google Cloud Private Service Connect (PSC) endpoints](https://docs.cloud.google.com/vpc/docs/private-service-connect#endpoints).
      Run the [gcloud compute forwarding-rules describe command](https://docs.cloud.google.com/memorystore/docs/cluster/multiple-vpcs-automatically-registered-psc-connection#get_the_connection_id_1) to get the pscConnectionID for each forwarding rule.

Outgoing requests:
:   Domains, including a port range.

    In most cases, the valid port range is 1-65535. If you do not specify a port, it defaults to 443. If an external network location supports dynamic ports, you need to specify all possible ports.

    To allow access to all ports, define the port as 0; for example, `example.com:0`.

Each network rule contains a list of one or more network identifiers of the same type. The network rule’s `TYPE` property indicates
the type of identifiers that are included in the rule. For example, if the `TYPE` property is `IPV4`, then the network rule’s
value list must contain valid IPv4 addresses or address ranges in CIDR notation.

## Incoming vs. outgoing requests

The mode of a network rule indicates whether the Snowflake feature that uses the rule restricts incoming or outgoing requests.

### Incoming requests

[Network policies](network-policies.md) protect the Snowflake service and internal stages from incoming traffic. When a
network rule is used with a network policy, the administrator can set the mode to one of the following:

`INGRESS`
:   The behavior of the `INGRESS` mode depends on the value of the network rule’s `TYPE` property.

    * If `TYPE=IPV4`, by default the network rule controls access to the Snowflake service only.

      If the account administrator enables the [ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES](../sql-reference/parameters.md) parameter, then `MODE=INGRESS` and `TYPE=IPV4` also protects an AWS internal stage.
    * If `TYPE=AWSVPCEID`, `TYPE=AZURELINKID`, or `TYPE=GCPPSCID`, then the network rule controls access to the Snowflake service only.

`INTERNAL_STAGE`
:   Controls access to an AWS internal stage without restricting access to the Snowflake service. Using this mode requires the following:

    * The account administrator must enable the [ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES](../sql-reference/parameters.md) parameter.
    * The `TYPE` property of the network rule must be `AWSVPCEID`.
    * If you want to restrict access to an internal stage based on the VPCE ID of an interface endpoint, you must create a separate network
      rule by using the `INTERNAL_STAGE` mode.

    For accounts on Microsoft Azure, you cannot use a network rule to restrict access to the internal stage. However, you can [block all public network traffic](private-internal-stages-azure.md) from accessing the internal stage.

`SNOWFLAKE_MANAGED_STORAGE_VOLUME`
:   Controls access to an AWS Snowflake-managed storage volume without restricting access to the Snowflake service. Using this
    mode requires the following:

    * The account administrator must enable the [ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME](../sql-reference/parameters.md) parameter.
    * The `TYPE` property of the network rule must be `AWSVPCEID`.

    For accounts on Microsoft Azure, you cannot use a network rule to restrict access to a Snowflake-managed storage volume. However, you can [block all public network traffic](private-managed-volumes-azure.md) from accessing the Snowflake-managed storage volume.

### Outgoing requests

Administrators can use network rules with features that control where requests can be sent. In these cases, the administrator defines the
network rule with the following mode:

`EGRESS`
:   Indicates that the network rule is used for traffic sent *from* Snowflake.

    Currently used with [external network access](../developer-guide/external-network-access/external-network-access-overview.md), which
    allows a UDF or procedure to send requests to an external network location.

## Creating a network rule

You need the CREATE NETWORK RULE privilege on the schema to create a network rule. By default, only the ACCOUNTADMIN and SECURITYADMIN
roles, along with the schema owner, have this privilege.

You can create a network rule using Snowsight or by executing a SQL command:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Rules tab.
    3. Select + Network Rule.
    4. Enter the name of the network rule.
    5. Select the schema of the network rule. Network rule are schema-level objects.
    6. Optionally, add a descriptive comment for the network rule to help organize and maintain network rules in the schema.
    7. In the Type drop-down, select the type of identifier being defined in the network
       rule.
    8. In the Mode drop-down, select the mode of the network rule. The `INGRESS` and `INTERNAL STAGE` modes indicate the
       network rule will be used with a network policy to restrict incoming requests and the `EGRESS` mode indicates the network rule
       will be used with an external access integration to restrict outgoing requests.
    9. Enter a comma-separated list of the identifiers that will be allowed or blocked when the network rule is added to a network policy. The
       identifiers in this list must all be of the type specified in the Type drop-down.
    10. Select Create Network Rule.

SQL:
:   An administrator can execute the [CREATE NETWORK RULE](../sql-reference/sql/create-network-rule.md) command to create a new network rule, specifying a list of
    network identifiers along with the type of those identifiers.

    For example, to use a custom role to create a network rule that can be used to allow or block traffic from a range of IP addresses:

    ```sqlexample
    GRANT USAGE ON DATABASE securitydb TO ROLE network_admin;
    GRANT USAGE ON SCHEMA securitydb.myrules TO ROLE network_admin;
    GRANT CREATE NETWORK RULE ON SCHEMA securitydb.myrules TO ROLE network_admin;
    USE ROLE network_admin;

    CREATE NETWORK RULE cloud_network TYPE = IPV4 MODE = INGRESS VALUE_LIST = ('47.88.25.32/27');
    ```

### IPv4 addresses

When specifying IP addresses for a network rule, Snowflake supports ranges of IP addresses using [Classless Inter-Domain Routing (CIDR) notation](https://tools.ietf.org/html/rfc4632).

For example, `192.168.1.0/24` represents all IPv4 addresses in the range of `192.168.1.0` to `192.168.1.255`.

## Identifying network rules in your account

You can identify the network rules in your account using Snowsight or SQL.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Rules tab.

SQL:
:   Call the [NETWORK_RULE_REFERENCES](../sql-reference/functions/network_rule_references.md) Information Schema table function, or query the
    [NETWORK_RULES](../sql-reference/account-usage/network_rules.md) or
    [NETWORK_RULE_REFERENCES](../sql-reference/account-usage/network_rule_references.md) Account Usage view.

## Modifying a network rule

You can modify the identifiers and comment of an existing network rule, but you cannot modify its type, mode, name, or schema.

To add or remove identifiers and comments from an existing network rule using Snowsight or SQL, do one of the following:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Governance & security » Network policies, and then select the Network Rules tab.
    3. Find the network rule, select the … button, and then select Edit.
    4. Modify the comma-delimited list of identifiers or the comment.
    5. Select Update Network Rule.

SQL:
:   Execute an [ALTER NETWORK RULE](../sql-reference/sql/alter-network-rule.md) statement.

## Replication of network rules

Network rules are schema-level objects and are replicated with the database in which they are contained.

For information about replicating the network policies that use network rules, see [Replicating network policies](account-replication-security-integrations.md).

## Privileges and commands

| Command | Privilege | Description |
| --- | --- | --- |
| [CREATE NETWORK RULE](../sql-reference/sql/create-network-rule.md) | CREATE NETWORK RULE on SCHEMA | Creates a new network rule. |
| [ALTER NETWORK RULE](../sql-reference/sql/alter-network-rule.md) | OWNERSHIP on NETWORK RULE | Modifies an existing network rule. |
| [DROP NETWORK RULE](../sql-reference/sql/drop-network-rule.md) | OWNERSHIP on NETWORK RULE | Removes an existing network rule from the system. |
| [DESCRIBE NETWORK RULE](../sql-reference/sql/desc-network-rule.md) | OWNERSHIP on NETWORK RULE | Describes the properties of an existing network rule. |
| [SHOW NETWORK RULES](../sql-reference/sql/show-network-rules.md) | OWNERSHIP on NETWORK RULE or USAGE on SCHEMA | Lists all of the network rules in the system. |

## Snowflake-managed network rules

Snowflake provides the SNOWFLAKE.NETWORK_SECURITY schema that contains a suite of *built-in* network rules. Built-in network rules are one
type of Snowflake-managed network rule. Snowflake can update the NETWORK_SECURITY schema with new, Snowflake-managed network rules. Built-in
network rules provide a secure, consistent, fast, and low-maintenance way to manage network security.

Snowflake-managed network rules align with easy-to-use Snowflake network policy and rule management features. Snowflake customers can add
Snowflake-managed rules to new or existing [network policies](network-policies.md). Snowflake continuously updates built-in
network rules without requiring regular maintenance by account administrators. For more information about adding a network rule to a network
policy, see [Modify a network policy](network-policies.md).

Built-in network rules define the set of allowed IP addresses that a frequently used, third-party partner application uses to
connect with Snowflake. Snowflake automatically updates these rules to capture any changes that third-party providers make to their egress IP
addresses. For example, Snowflake manages a rule that defines the IP addresses that a Microsoft Power BI application uses to connect with
Snowflake. If Microsoft updates these addresses, Snowflake rules automatically update to reflect this change.

The following table lists current partner applications for which Snowflake maintains built-in network rules and information about the current
egress IP addresses for each partner application:

| SaaS applications | Egress IP addresses |
| --- | --- |
| dbt platform | [dbt platform IP addresses](https://docs.getdbt.com/docs/cloud/about-cloud/access-regions-ip-addresses) |
| Microsoft Power BI | [Power BI IP ranges](https://www.microsoft.com/en-us/download/details.aspx?id=56519) |
| Qlik | [Qlik IP addresses](https://help.qlik.com/en-US/cloud-services/Subsystems/Hub/Content/Sense_Hub/Introduction/qlik-cloud-dns-ip.htm) |
| Tableau | [Tableau Cloud IP ranges](https://help.tableau.com/current/pro/desktop/en-us/publish_tableau_online_ip_authorization.htm) |
| GitHub Actions | [REST API endpoints for meta data](https://docs.github.com/en/rest/meta) |

### Working with Snowflake-managed network rules

To see the current list of Snowflake-managed network rules, run the [SHOW NETWORK RULES](../sql-reference/sql/show-network-rules.md) command to list the network rules in the SNOWFLAKE.NETWORK_SECURITY schema:

```sqlexample
SHOW NETWORK RULES IN SNOWFLAKE.NETWORK_SECURITY;
```

> **Note:**
>
> The SHOW command doesn’t explicitly expose IP addresses, only the number of IP addresses per rule.

To see your current Snowflake-managed rules, *including* IP addresses, query the [NETWORK_RULES view](../sql-reference/account-usage/network_rules.md) and filter on rows where the database is SNOWFLAKE and the schema is NETWORK_SECURITY:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.NETWORK_RULES
  WHERE DATABASE = 'SNOWFLAKE' AND SCHEMA = 'NETWORK_SECURITY';
```

The following example shows how to create or replace a network policy that references a built-in network rule:

```sqlexample
CREATE OR REPLACE NETWORK POLICY example_network_policy ALLOWED_NETWORK_RULE_LIST = (
  'SNOWFLAKE.NETWORK_SECURITY.DBT_APAC_AWS',
  'SNOWFLAKE.NETWORK_SECURITY.DBT_EMEA_AWS'
);
```

The following example shows how to add a built-in network rule to an existing network policy by using the ALTER NETWORK POLICY syntax:

```sqlexample
ALTER NETWORK POLICY example_network_policy ADD ALLOWED_NETWORK_RULE_LIST = (
  'SNOWFLAKE.NETWORK_SECURITY.DBT_APAC_AWS'
);
```

### Snowflake-managed egress network rules

Snowflake provides the following pre-defined, Snowflake-managed egress network rule:

`SNOWFLAKE.EXTERNAL_ACCESS.PYPI_RULE`

> You can use an EGRESS network rule with an external access integration to provide a connection from Snowflake to Python Package Index (PyPI).
> For example, you might want to use the network rule to allow Notebook users on Container Runtime to install `pip` packages by using
> the `pip install` command.
>
> For examples of how to use this network rule, see
> [Accessing PyPI to install packages in Snowpark Container](../developer-guide/external-network-access/external-network-access-examples.md)
> and [Set up a PyPI EAI for app developers](../developer-guide/streamlit/object-management/security.md).

Only users with the ACCOUNTADMIN role have access to the SNOWFLAKE.EXTERNAL_ACCESS.PYPI_RULE.

---
title: NIST SP 800-171
source: https://docs.snowflake.com/en/user-guide/cert-nist.md
section: User Guide
---

# NIST SP 800-171

This topic describes how Snowflake supports customers with NIST SP 800-171 compliance requirements.

## Understanding NIST SP 800-171 compliance requirements

The National Institute of Standards and Technology (NIST) Special Publication (SP) 800-171 provides recommended
security requirements for protecting the confidentiality of Controlled Unclassified Information (CUI) when the
information is resident in nonfederal systems and organizations.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Notice of planned future deprecation: Snowpipe Streaming classic architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-deprecation.md
section: User Guide
---

# Notice of planned future deprecation: Snowpipe Streaming classic architecture

## Important service update: Advance notice of planned deprecation

**Action required:** Begin reviewing migration resources.

Snowflake is providing advance notice regarding the planned deprecation of the Snowpipe Streaming classic architecture.
This strategic transition is necessary because the successor platform, the
[Snowpipe Streaming high-performance architecture](snowpipe-streaming-high-performance-overview.md),
fundamentally supersedes the classic version by providing greater performance, scalability, and stability.
Moving forward, all innovation and major enhancements are built exclusively on the high-performance architecture.

To ensure that every customer benefits from the most modern and capable platform, Snowflake will eventually retire the classic architecture.

## What this means for you today

**No immediate action is required.**

* Your existing pipelines are safe. The classic architecture isn’t deprecated today, and your current workloads continue to operate normally.
* Support continues uninterrupted. Your current service level agreements (SLAs) and support models remain unchanged.

## Expected timeline and migration window

To ensure you have a well-supported transition, Snowflake is finalizing the official deprecation timeline:

* Snowflake plans to issue a formal deprecation announcement in mid-2026.
* This forthcoming formal announcement will provide you with the full transition timeline, specific milestones,
  detailed migration guides, and the final end-of-life date.
* After the formal deprecation is announced, an 18-month sunset period begins. This period is provided to give your engineering
  teams time to plan, test, and safely migrate your workloads.

## Special note for Kafka Connect users

If you rely on the Snowflake Kafka Connector to stream data, Snowflake is working on a new version (4.0.x)
that natively supports the high-performance architecture:

* The updated Kafka Connector is in Public Preview.
* Snowflake intends to fully upgrade all Snowflake-supported connectors to the high-performance architecture before deprecation.
* The official deprecation timelines will be deliberately adjusted to ensure Kafka Connect users have the full
  transition window after the GA connector is widely available.

## Frequently asked questions (FAQs)

Do I need to change anything right now?
:   No immediate changes are required. This is an advance notice to encourage you to begin planning and conduct early
    validation when you’re ready.

Is there a replacement?
:   Yes. The [Snowpipe Streaming high-performance architecture](snowpipe-streaming-high-performance-overview.md) is
    the recommended path forward.

How should I plan my migration if I use a third-party connector?
:   Third-party connectors are encouraged to begin migrating or planning to migrate as part of this notice.

Where can I find more information?
:   Snowflake strongly recommends that you begin assessing your current pipelines now and prioritize upgrading to the
    high-performance architecture where possible. To get started, review the following resources:

    * [High-performance architecture overview](snowpipe-streaming-high-performance-overview.md)
    * [Migration guide](snowpipe-streaming-high-performance-migration.md)

## Contact us

For general questions regarding this strategic transition, reach out to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Notifications for budgets
source: https://docs.snowflake.com/en/user-guide/budgets/notifications.md
section: User Guide
---

# Notifications for budgets

To receive notifications when your credit usage is expected to exceed your spending limits, you must set up the budget so that
notifications can be sent to the destination of your choice. You can receive notifications through the following means:

* Email.
* Messages pushed to a queue provided by a cloud service (Amazon SNS, Azure Event Grid, or Google Cloud PubSub).
* Calls to a webhook for Slack, Microsoft Teams, or PagerDuty.

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

## Controlling when notifications are sent

By default, notifications begin when the projected spending is more than 10% above the spending limit of the budget.

You can override this default by defining a notification threshold, which is a percentage of the budget’s spending limit. Notifications are
sent when Snowflake predicts that spending will exceed the threshold.

For example, suppose you want notifications sent when projected spending exceeds 50% of the budget’s spending limit. To set this
notification threshold for the account budget, run the following command:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_NOTIFICATION_THRESHOLD(50);
```

You can also set a notification threshold for custom budgets.

If you want to reset the notification threshold to the default, call the
[<budget_name>!SET_NOTIFICATION_THRESHOLD](../../sql-reference/classes/budget/methods/set_notification_threshold.md) method with `110` as the argument.

## Setting up email notification

To set up email notification:

1. (Optional) If you want to use your own notification integration, create a notification integration or choose an existing
   notification integration that you want to use. A notification integration enables Snowflake to send notifications to a
   third-party system.

   1. Create a notification integration with TYPE = EMAIL and ALLOWED_RECIPIENTS set to the list of verified email addresses of
      the recipients. For information, see [Create an email notification integration](../notifications/email-notifications.md) and
      [Restrict the list of email addresses that can receive notifications](../notifications/email-notifications.md).

      > **Note:**
      >
      > Each email address added for budget notifications must be [verified](../notifications/email-notifications.md). The
      > email notification setup fails if any email address in the list is *not* verified.

      For example:

      ```sqlexample
      CREATE NOTIFICATION INTEGRATION budgets_notification_integration
        TYPE = EMAIL
        ENABLED = TRUE
        ALLOWED_RECIPIENTS = ('costadmin@example.com','budgetadmin@example.com');
      ```
   2. Verify that the notification integration works as expected by calling the
      [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure to send a test message.

      For example, you can send a test message in JSON format:

      ```sqlexample
      CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
        SNOWFLAKE.NOTIFICATION.APPLICATION_JSON('{"name": "value"}'),
        SNOWFLAKE.NOTIFICATION.INTEGRATION('budgets_notification_integration')
      );
      ```
   3. Grant the USAGE privilege on the notification integration to the SNOWFLAKE application. The USAGE privilege enables the
      budget to use the notification integration to send the notification. For example:

      ```sqlexample
      GRANT USAGE ON INTEGRATION budgets_notification_integration
        TO APPLICATION snowflake;
      ```
2. Specify the email addresses that should receive the notification. If you created or selected a notification integration to use,
   associate the notification integration with the budget.

   To do this, call the [<budget_name>!SET_EMAIL_NOTIFICATIONS](../../sql-reference/classes/budget/methods/set_email_notifications.md) method, and specify the following:

   * If you do not have a notification integration that you want to use, pass in a comma-delimited list of verified email
     addresses. For example, if you are configuring notifications for the account budget:

     ```sqlexample
     CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_EMAIL_NOTIFICATIONS(
       'costadmin@example.com, budgetadmin@example.com'
     );
     ```

     If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
     if you created a custom budget named `my_budget`:

     ```sqlexample
     CALL my_budget!SET_EMAIL_NOTIFICATIONS(
       'costadmin@example.com, budgetadmin@example.com'
     );
     ```
   * If you have a notification integration that you want to use, pass in the name of that integration and a comma-delimited list
     of verified email addresses. For example, if you are configuring notifications for the account budget:

     ```sqlexample
     CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_EMAIL_NOTIFICATIONS(
       'budgets_notification_integration',
       'costadmin@example.com, budgetadmin@example.com'
     );
     ```

     If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
     if you created a custom budget named `my_budget`:

     ```sqlexample
     CALL my_budget!SET_EMAIL_NOTIFICATIONS(
       'budgets_notification_integration',
       'costadmin@example.com, budgetadmin@example.com'
     );
     ```
3. If you associated a notification integration with the budget, you can verify that the budget is associated with your
   notification integration by calling the
   [<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](../../sql-reference/classes/budget/methods/get_notification_integration_name.md) method. This method returns the name of the
   email notification integration associated with the budget.

   For example, if you are configuring notifications for the account budget:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_NOTIFICATION_INTEGRATION_NAME();
   ```

   If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
   if you created a custom budget named `my_budget`:

   ```sqlexample
   CALL my_budget!GET_NOTIFICATION_INTEGRATION_NAME();
   ```

## Setting up queue notification

To set up queue notification:

1. Create a notification integration or choose an existing notification integration that you want to use. A notification
   integration enables Snowflake to send notifications to a third-party system.

   Create a notification integration with TYPE=QUEUE, DIRECTION=OUTBOUND, and the additional properties required for the cloud
   provider. For information, see:

   * [Creating a notification integration to send notifications to an Amazon SNS topic](../notifications/creating-notification-integration-amazon-sns.md)
   * [Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic](../notifications/creating-notification-integration-azure-event-grid.md)
   * [Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic](../notifications/creating-notification-integration-google-pubsub.md)
   > **Note:**
   >
   > Your account must be on the same [cloud platform](../intro-cloud-platforms.md) as the cloud provider queue.

   For example:

   ```sqlexample
   CREATE OR REPLACE NOTIFICATION INTEGRATION budgets_notification_integration
     ENABLED = TRUE
     TYPE = QUEUE
     DIRECTION = OUTBOUND
     NOTIFICATION_PROVIDER = AWS_SNS
     AWS_SNS_TOPIC_ARN = '<ARN_for_my_SNS_topic>'
     AWS_SNS_ROLE_ARN = '<ARN_for_my_IAM_role>';
   ```

   > **Note:**
   >
   > For queue and webhook notifications, you can associate up to 10 notification integrations with a budget.
2. Verify that the notification integration works as expected by calling the
   [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure to send a test message.

   For example, you can send a test message in JSON format:

   ```sqlexample
   CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
     SNOWFLAKE.NOTIFICATION.APPLICATION_JSON('{"name": "value"}'),
     SNOWFLAKE.NOTIFICATION.INTEGRATION('budgets_notification_integration')
   );
   ```
3. Grant the USAGE privilege on the notification integration to the SNOWFLAKE application. The USAGE privilege enables the budget
   to use the notification integration to send the notification. For example:

   ```sqlexample
   GRANT USAGE ON INTEGRATION budgets_notification_integration
     TO APPLICATION snowflake;
   ```
4. Associate the notification integration with the budget. Call the
   [<budget_name>!ADD_NOTIFICATION_INTEGRATION](../../sql-reference/classes/budget/methods/add_notification_integration.md) method, passing in the name of the integration.

   For example, if you are configuring notifications for the account budget:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!ADD_NOTIFICATION_INTEGRATION(
     'budgets_notification_integration',
   );
   ```

   If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
   if you created a custom budget named `my_budget`:

   ```sqlexample
   CALL my_budget!ADD_NOTIFICATION_INTEGRATION(
     'budgets_notification_integration',
   );
   ```
5. Verify that the notification integration is associated with the budget.

   Call the [<budget_name>!GET_NOTIFICATION_INTEGRATIONS](../../sql-reference/classes/budget/methods/get_notification_integrations.md) method to print out the list of
   notification integrations associated with the budget.

   For example, if you are configuring notifications for the account budget:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_NOTIFICATION_INTEGRATIONS();
   ```

   If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
   if you created a custom budget named `my_budget`:

   ```sqlexample
   CALL my_budget!GET_NOTIFICATION_INTEGRATIONS();
   ```

   The method prints out a table that lists the names of the integrations, the times that they were last used to send
   notifications, and the dates when they were added.

   ```output
   +----------------------------------+------------------------+------------+
   |  INTEGRATION_NAME                | LAST_NOTIFICATION_TIME | ADDED_DATE |
   +----------------------------------+------------------------+------------+
   | budgets_notification_integration | -1                     | 2024-09-23 |
   +----------------------------------+------------------------+------------+
   ```

## Setting up webhook notification

To set up webhook notification:

1. Create a notification integration or choose an existing notification integration that you want to use. A notification
   integration enables Snowflake to send notifications to a third-party system.

   Create a notification integration with TYPE=WEBHOOK and the additional properties required for the webhook. For information,
   see [Sending webhook notifications](../notifications/webhook-notifications.md).

   The notification message is in JSON format, so you should configure the notification integration to handle this. For example,
   the following statements create a secret and a notification integration for a Slack webhook:

   ```sqlexample
   CREATE OR REPLACE SECRET my_database.my_schema.slack_secret
     TYPE = GENERIC_STRING
     SECRET_STRING = '... secret in my Slack webhook URL ...';

   CREATE OR REPLACE NOTIFICATION INTEGRATION budgets_notification_integration
     ENABLED = TRUE
     TYPE = WEBHOOK
     WEBHOOK_URL = 'https://hooks.slack.com/services/SNOWFLAKE_WEBHOOK_SECRET'
     WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
     WEBHOOK_HEADERS=('Content-Type'='application/json')
     WEBHOOK_SECRET = slack_secret;
   ```

   > **Note:**
   >
   > For queue and webhook notifications, you can associate up to 10 notification integrations with a budget.
2. Verify that the notification integration works as expected by calling the
   [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure to send a test message.

   For example, you can send a test message in JSON format. Make sure to escape the double quotes in the JSON string and the
   backslashes:

   ```sqlexample
   CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
     SNOWFLAKE.NOTIFICATION.APPLICATION_JSON('{\\\"name\\\": \\\"value\\\"}'),
     SNOWFLAKE.NOTIFICATION.INTEGRATION('budgets_notification_integration')
   );
   ```
3. Grant the USAGE privilege on the notification integration to the SNOWFLAKE application. The USAGE privilege enables the budget
   to use the notification integration to send the notification. For example:

   ```sqlexample
   GRANT USAGE ON INTEGRATION budgets_notification_integration
     TO APPLICATION snowflake;
   ```
4. If you are using a webhook notification integration that relies on a secret, grant the following privileges to the
   SNOWFLAKE application.

   * The READ privilege on that secret.
   * The USAGE privilege on the schema containing that secret.
   * The USAGE privilege on the database containing that schema.

   For example:

   ```sqlexample
   GRANT READ ON SECRET slack_secret TO APPLICATION snowflake;
   GRANT USAGE ON SCHEMA my_schema TO APPLICATION snowflake;
   GRANT USAGE ON DATABASE my_database TO APPLICATION snowflake;
   ```
5. Associate the notification integration with the budget.

   Call the [<budget_name>!ADD_NOTIFICATION_INTEGRATION](../../sql-reference/classes/budget/methods/add_notification_integration.md) method, and pass in the name of the
   integration.

   For example, if you are configuring notifications for the account budget:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!ADD_NOTIFICATION_INTEGRATION(
     'budgets_notification_integration',
   );
   ```

   If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
   if you created a custom budget named `my_budget`:

   ```sqlexample
   CALL my_budget!ADD_NOTIFICATION_INTEGRATION(
     'budgets_notification_integration',
   );
   ```
6. Verify that the notification integration is associated with the budget.

   Call the [<budget_name>!GET_NOTIFICATION_INTEGRATIONS](../../sql-reference/classes/budget/methods/get_notification_integrations.md) method, which prints out the list of
   notification integrations associated with the budget.

   For example, if you are configuring notifications for the account budget:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_NOTIFICATION_INTEGRATIONS();
   ```

   If you are configuring notifications for a custom budget, call the method on the object for the custom budget. For example,
   if you created a custom budget named `my_budget`:

   ```sqlexample
   CALL my_budget!GET_NOTIFICATION_INTEGRATIONS();
   ```

   The method prints out a table that lists the names of the integrations, the times that they were last used to send
   notifications, and the dates when they were added.

   ```output
   +----------------------------------+------------------------+------------+
   |  INTEGRATION_NAME                | LAST_NOTIFICATION_TIME | ADDED_DATE |
   +----------------------------------+------------------------+------------+
   | budgets_notification_integration | -1                     | 2024-09-23 |
   +----------------------------------+------------------------+------------+
   ```

## Interpreting the JSON notification message

When you configure a budget to send a notification to a cloud provider queue or a webhook, the notification message contains a
JSON object similar to the following:

```json
{
  "account_name": "MY_ACCOUNT",
  "budget_name": "MY_BUDGET_NAME",
  "type": "BUDGET_LIMIT_WARNING",
  "limit": "100",
  "spending": "67.42",
  "spending_percent": "67.42",
  "spending_trend_percent": "130.63",
  "time_percent":"51.61"
}
```

The JSON object contains the following key-value pairs:

| Key | Description |
| --- | --- |
| `account_name` | Name of your account. |
| `budget_name` | Name of your budget. For the account budget, the name is `ACCOUNT_ROOT_BUDGET`. |
| `type` | The type of the notification (for example, `BUDGET_LIMIT_WARNING`). |
| `limit` | The spending limit that you set for the budget. |
| `spending` | The amount of credit usage for this month. |
| `spending_percent` | The percentage of the spending limit that has already been spent (`spending / limit`). |
| `spending_trend_percent` | Expected percentage of the spending limit to be spent by the end of the month (`spending_percent / time_percent * 100`). |
| `time_percent` | Percentage of time that has passed for the month (for example, `50.00` if the month is half over). |

## Checking the history of notifications for a budget

To view the history of notifications for a budget, call the [NOTIFICATION_HISTORY](../../sql-reference/functions/notification_history.md) function and
filter on the integration name. For example:

```sqlexample
SELECT * FROM TABLE(
  INFORMATION_SCHEMA.NOTIFICATION_HISTORY(
    INTEGRATION_NAME=>'budgets_notification_integration'
  )
);
```

The `message_source` column contains `BUDGET` for rows representing budget notifications.

## Disabling notifications for a budget

To disable notifications for a budget, call the
[SET_NOTIFICATION_MUTE_FLAG](../../sql-reference/classes/budget/methods/set_notification_mute_flag.md) method, and pass in TRUE as
an argument. For example:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_NOTIFICATION_MUTE_FLAG(TRUE);
```

## Removing a notification integration from a budget

To remove a notification integration from a budget, call the
[<budget_name>!REMOVE_NOTIFICATION_INTEGRATION](../../sql-reference/classes/budget/methods/remove_notification_integration.md) method, passing in the name of the integration.
For example:

```sqlexample
CALL my_budget!REMOVE_NOTIFICATION_INTEGRATION(
  'budgets_notification_integration',
);
```

---
title: Notifications in Snowflake
source: https://docs.snowflake.com/en/user-guide/notifications/about-notifications.md
section: User Guide
---

# Notifications in Snowflake

You can configure Snowflake to send notifications to a queue provided by a Cloud service (Amazon SNS, Google Cloud PubSub, or
Azure Event Grid), an email address, or a webhook. For details, see the following sections:

* [Sending notifications to cloud provider queues (Amazon SNS, Google Cloud PubSub, and Azure Event Grid)](queue-notifications.md)
* [Sending email notifications](email-notifications.md)
* [Sending webhook notifications](webhook-notifications.md)

## Viewing the history of notifications

To view the history of notifications, call the Information Schema [NOTIFICATION_HISTORY](../../sql-reference/functions/notification_history.md) table
function.

---
title: Object Dependencies
source: https://docs.snowflake.com/en/user-guide/object-dependencies.md
section: User Guide
---

# Object Dependencies

This topic provides concepts on object dependencies and information related to the Account Usage view OBJECT_DEPENDENCIES.

## What is an object dependency?

An object dependency means that in order to operate on an object, the object that is being operated on must reference metadata for itself
or reference metadata for at least one other object. Snowflake tracks object dependencies in the Account Usage view
[OBJECT_DEPENDENCIES](../sql-reference/account-usage/object_dependencies.md).

Snowflake supports object dependencies in your local Snowflake account and certain dependencies related to data sharing, such as creating a
view in the consumer account from a table that is made available through a provider share. The dependencies for shared objects enable data
officers to ensure greater data integrity, comply with each regulatory standard more fully, and generate more detailed impact analysis.

Snowflake supports the following dependency types that can trigger a dependency: the object `name` value, the object ID value, and
the combination of the object `name` value with the object ID value.

BY_NAME:
:   A `BY_NAME` dependency occurs when the SQL statement specifies the `name` value of the object itself
    (e.g. a [CREATE](../sql-reference/sql/create.md) or [ALTER](../sql-reference/sql/alter.md) command), or when an object calls the
    `name` value of another object (e.g. using a [FROM](../sql-reference/constructs/from.md) clause) to complete a SQL operation.

    For example, consider the following statement:

    > ```sqlexample
    > create view myview as select * from mytable;
    > ```

    The table `name` value `mytable` is metadata for the table. The view named `myview` is dependent on the table named
    `mytable`; the table must exist to create the view.

    Snowflake refers to the view named `myview` as the referencing object and the table `mytable` as the
    referenced object.

BY_ID:
:   A `BY_ID` dependency occurs when an object stores the object ID value of another object. One example of an ID
    dependency is an external stage storing the OBJECT_ID value of a storage integration. Currently, the storage integration object ID value
    is only accessible to Snowflake and is not made visible through any customer-facing SQL operation.

    > ```sqlexample
    > create stage my_ext_stage
    >   url='s3://load/files/'
    >   storage_integration = myint;
    > ```

    Snowflake refers to the external stage named `my_ext_stage` as the referencing object and the storage integration named
    `myint` as the referenced object.

BY_NAME_AND_ID:
:   Some Snowflake objects (e.g. materialized views) are dependent on both the object `name` value and the object ID value. These
    objects are often the result of a CREATE OR REPLACE statement to replace an existing object or an ALTER statement to rename an object.

    For more information, see the [Usage notes](../sql-reference/account-usage/object_dependencies.md) section of the Account Usage OBJECT_DEPENDENCIES view.

### Supported object dependencies

Snowflake supports referencing objects and referenced objects as follows:

| Referencing Object | Referenced Object | Dependency Type |
| --- | --- | --- |
| View, Secure View, dynamic table, SQL UDF, SQL UDTF, and other objects referenced by name | View  Secure View  Materialized View  Dynamic table  UDF (all kinds)  UDTF  and other objects referenced by name | BY_NAME |
| External Stage  Stream | Storage Integration  Table, View, Secure View | BY_ID |
| External table | Stage | BY_ID |
| Materialized View | Table, External Table | BY_NAME_AND_ID |

Note that Snowflake supports only the following objects in the context of data sharing:

| Referencing object | Referenced object | Dependency type |
| --- | --- | --- |
| View, dynamic table, SQL UDF, SQL UDTF | Table  Secure view  Secure materialized view  Dynamic table  Secure UDF and secure UDTF | BY_NAME |
| Materialized view | Table | BY_NAME_AND_ID |

For more information, see the [Usage Notes](../sql-reference/account-usage/object_dependencies.md) section of the OBJECT_DEPENDENCIES view.

### Benefits

Identifying object dependencies can provide insight into data tracking use cases as follows:

Impact analysis:
:   Knowing the object dependency allows data stewards to identify the relationships between referencing objects and referenced objects to
    ensure that updates to referenced objects do not adversely impact users of the referencing object.

    For example, a table owner plans to add a column to a table. Querying the OBJECT_DEPENDENCIES view based on the table name returns all
    of the objects (e.g. views) that will be affected.

    The data steward can then coordinate a plan of action to ensure that the timing of table and view updates do not result in any broken
    queries that would adversely affect users querying the views created from the table.

Compliance:
:   The object dependency relationship helps the compliance officer identify the relationship between sensitive data sources
    (i.e. referenced object) and data targets (i.e. referencing object). The compliance officer can then decide how best to update the
    referenced object and referencing object based on the compliance requirements (e.g. GDPR).

Data integrity:
:   The object dependency relationship helps primary data professionals, such as analysts, scientists, compliance officers, and other
    business users, to have confidence that the data originates from a trustworthy source.

### Limitations

In addition to the view [usage notes](../sql-reference/account-usage/object_dependencies.md), note the following limitations when querying
the OBJECT_DEPENDENCIES view:

Session parameters:
:   Snowflake cannot accurately compute the dependencies of objects that include [session parameters](../sql-reference/parameters.md) in
    their definitions because session parameters can take on different values depending on the context.

    Snowflake recommends not using session variables in view and function definitions.

Snowflake implementations:
:   This view does not capture dependencies that are necessary for Snowflake implementations. For example, the view does not record the
    dependency necessary to create a new table from the clone of another table.

Object resolution:
:   If a view definition uses a function to call an object to create the view, or if an object is called inside another function or view,
    Snowflake does not record an object dependency. For example:

    > ```sqlexample
    > create or replace view v_on_stage_function
    > as
    > select *
    > from T1
    > where get_presigned_url(@stage1, 'data_0.csv.gz')
    > is not null;
    > ```

    In this example, the function `get_presigned_url` calls the stage `stage1`. Snowflake does not record that the view named
    `v_on_stage_function` depends on the stage named `stage1`.

Broken dependencies:
:   If the dependency type value is `BY_NAME_AND_ID` and an object dependency changes due to a CREATE OR REPLACE or ALTER operation on an
    object, Snowflake only records the object dependency prior to these operations.

    Snowflake does not record the object dependency in the view query result after these operations because the result is a broken reference.

### Object dependencies with snowflake features and services

External objects:
:   Snowflake tracks object dependencies for Snowflake objects only. For example, if a Snowflake object depends on an
    Amazon S3 bucket, this view does not record the dependency on the bucket because the bucket is an Amazon object, not a Snowflake object.

Replication:
:   While a secondary object depends on the primary object, this view does not record dependencies due to a replication operation.

Data sharing:
:   For provider accounts, this view does not allow a data sharing provider account to determine dependent objects in the data sharing
    consumer account. For example, a data sharing provider creates a view and shares the view. The data sharing provider cannot use this view
    to determine any object in the consumer account that was created from the shared view (e.g. new tables or views).

    For consumer accounts, this view does not allow a data sharing consumer account to determine dependent objects in the data sharing
    provider account. For example, if a data sharing consumer account uses a UDF made available by the data sharing provider account, the
    data sharing consumer cannot use this view to identify any objects the shared UDF depends on.

    For more information, refer to the [Usage notes](../sql-reference/account-usage/object_dependencies.md).

## Querying the OBJECT_DEPENDENCIES view

The following examples cover these use cases:

1. Show objects depending on an external table.
2. Impact analysis: find the objects referenced by a table.
3. GDPR: find the data source for a given view.
4. Data sharing.

### Show objects depending on an external table

Create a materialized view named `sales_view` from the external table named `sales_staging_table`:

> ```sqlexample
> CREATE OR REPLACE MATERIALIZED VIEW sales_view AS SELECT * FROM sales_staging_table;
> ```

Query the OBJECT_DEPENDENCIES view in the Account Usage schema of the shared SNOWFLAKE database. Note that the materialized view is the
`referencing_object_name` and the external table is the `referenced_object_domain`:

> ```sqlexample
> SELECT referencing_object_name, referencing_object_domain, referenced_object_name, referenced_object_domain
> FROM snowflake.account_usage.object_dependencies
> WHERE referenced_object_name = 'SALES_STAGING_TABLE' and referenced_object_domain = 'EXTERNAL TABLE';
> ```
>
> ```output
> +-------------------------+---------------------------+------------------------+--------------------------+
> | REFERENCING_OBJECT_NAME | REFERENCING_OBJECT_DOMAIN | REFERENCED_OBJECT_NAME | REFERENCED_OBJECT_DOMAIN |
> +-------------------------+---------------------------+------------------------+--------------------------+
> | SALES_VIEW              | MATERIALIZED VIEW         | SALES_STAGING_TABLE    | EXTERNAL TABLE           |
> +-------------------------+---------------------------+------------------------+--------------------------+
> ```

### Impact analysis: Find the Objects referenced by a table

Consider a base table named `SALES_NA`, where `NA` indicates North America, `US` indicates United States, and `CAL` indicates
California, with a series of nested views:

* (table) `SALES_NA` » (view) `NORTH_AMERICA_SALES` » (view) `US_SALES`
* (table) `SALES_NA` » (view) `NORTH_AMERICA_SALES` » (view) `CAL_SALES`

To create the table and nested views, execute the following commands:

> ```sqlexample
> CREATE TABLE sales_na(product string);
> CREATE OR REPLACE VIEW north_america_sales AS SELECT * FROM sales_na;
> CREATE VIEW us_sales AS SELECT * FROM north_america_sales;
> CREATE VIEW cal_sales AS SELECT * FROM north_america_sales;
> ```

Similarly, consider the relationship of the base table `SALES_NA` to its nested views, and consider the base table `SALES_UK`, where
`UK` indicates the United Kingdom, to its nested view.

Note that two different views serve as source objects to derive the view named `GLOBAL_SALES`:

* (table) `SALES_NA` » (view) `NORTH_AMERICA_SALES` » (view) `GLOBAL_SALES`
* (table) `SALES_UK` » (view) `GLOBAL_SALES`

To create these nested views, execute the following commands:

> ```sqlexample
> CREATE TABLE sales_uk (product string);
> CREATE VIEW global_sales AS SELECT * FROM sales_uk UNION ALL SELECT * FROM north_america_sales;
> ```

Query the OBJECT_DEPENDENCIES view in the Account Usage schema of the shared SNOWFLAKE database to determine the object references for the
table `SALES_NA`. Note the fourth row in the query result, which specifies the table `SALES_NA` but does not reference the table
`SALES_UK`:

> ```sqlexample
> WITH RECURSIVE referenced_cte
> (object_name_path, referenced_object_name, referenced_object_domain, referencing_object_domain, referencing_object_name, referenced_object_id, referencing_object_id)
>     AS
>       (
>         SELECT referenced_object_name || '-->' || referencing_object_name as object_name_path,
>                referenced_object_name, referenced_object_domain, referencing_object_domain, referencing_object_name, referenced_object_id, referencing_object_id
>           FROM snowflake.account_usage.object_dependencies referencing
>           WHERE true
>             AND referenced_object_name = 'SALES_NA' AND referenced_object_domain='TABLE'
>
>         UNION ALL
>
>         SELECT object_name_path || '-->' || referencing.referencing_object_name,
>               referencing.referenced_object_name, referencing.referenced_object_domain, referencing.referencing_object_domain, referencing.referencing_object_name,
>               referencing.referenced_object_id, referencing.referencing_object_id
>           FROM snowflake.account_usage.object_dependencies referencing JOIN referenced_cte
>             ON referencing.referenced_object_id = referenced_cte.referencing_object_id
>             AND referencing.referenced_object_domain = referenced_cte.referencing_object_domain
>       )
>
>   SELECT object_name_path, referenced_object_name, referenced_object_domain, referencing_object_name, referencing_object_domain
>     FROM referenced_cte
> ;
> ```
>
> ```output
> +-----------------------------------------------+------------------------+--------------------------+-------------------------+---------------------------+
> | OBJECT_NAME_PATH                              | REFERENCED_OBJECT_NAME | REFERENCED_OBJECT_DOMAIN | REFERENCING_OBJECT_NAME | REFERENCING_OBJECT_DOMAIN |
> +-----------------------------------------------+------------------------+--------------------------+-------------------------+---------------------------+
> | SALES_NA-->NORTH_AMERICA_SALES                | SALES_NA               | TABLE                    | NORTH_AMERICA_SALES     | VIEW                      |
> | SALES_NA-->NORTH_AMERICA_SALES-->CAL_SALES    | NORTH_AMERICA_SALES    | VIEW                     | CAL_SALES               | VIEW                      |
> | SALES_NA-->NORTH_AMERICA_SALES-->US_SALES     | NORTH_AMERICA_SALES    | VIEW                     | US_SALES                | VIEW                      |
> | SALES_NA-->NORTH_AMERICA_SALES-->GLOBAL_SALES | NORTH_AMERICA_SALES    | VIEW                     | GLOBAL_SALES            | VIEW                      |
> +-----------------------------------------------+------------------------+--------------------------+-------------------------+---------------------------+
> ```

### GDPR: Find the data source for a given view

Derived objects (e.g. views, CTAS) can be created from many different source objects to provide a custom view or dashboard. To meet
regulatory requirements such as GDPR, compliance officers and auditors need to be able to trace data from a given object to its original
data source.

For example, the view `GLOBAL_SALES` is derived from two different dependency paths that point to two different base tables:

* (table) `SALES_NA` » (view) `NORTH_AMERICA_SALES` » (view) `GLOBAL_SALES`
* (table) `SALES_UK` » (view) `GLOBAL_SALES`

To create these nested views, execute the following commands:

> ```sqlexample
> CREATE TABLE sales_na (product string);
> CREATE OR REPLACE VIEW north_america_sales AS SELECT * FROM sales_na;
> CREATE TABLE sales_uk (product string);
> CREATE VIEW global_sales AS SELECT * FROM sales_uk UNION ALL SELECT * FROM north_america_sales;
> ```

Query the OBJECT_DEPENDENCIES view in the Account Usage schema of the shared SNOWFLAKE database to find the data source(s) of the view
`GLOBAL_SALES`. Each row in the query result specifies a dependency path to a unique object.

> ```sqlexample
> WITH RECURSIVE referenced_cte
> (object_name_path, referenced_object_name, referenced_object_domain, referencing_object_domain, referencing_object_name, referenced_object_id, referencing_object_id)
>     AS
>       (
>         SELECT referenced_object_name || '<--' || referencing_object_name AS object_name_path,
>                referenced_object_name, referenced_object_domain, referencing_object_domain, referencing_object_name, referenced_object_id, referencing_object_id
>           from snowflake.account_usage.object_dependencies referencing
>           WHERE true
>             AND referencing_object_name = 'GLOBAL_SALES' and referencing_object_domain='VIEW'
>
>         UNION ALL
>
>         SELECT referencing.referenced_object_name || '<--' || object_name_path,
>               referencing.referenced_object_name, referencing.referenced_object_domain, referencing.referencing_object_domain, referencing.referencing_object_name,
>               referencing.referenced_object_id, referencing.referencing_object_id
>           FROM snowflake.account_usage.object_dependencies referencing JOIN referenced_cte
>             ON referencing.referencing_object_id = referenced_cte.referenced_object_id
>             AND referencing.referencing_object_domain = referenced_cte.referenced_object_domain
>       )
>
>   SELECT object_name_path, referencing_object_name, referencing_object_domain, referenced_object_name, referenced_object_domain
>     FROM referenced_cte
> ;
> ```
>
> ```output
> +-----------------------------------------------+-------------------------+---------------------------+------------------------+--------------------------+
> | OBJECT_NAME_PATH                              | REFERENCING_OBJECT_NAME | REFERENCING_OBJECT_DOMAIN | REFERENCED_OBJECT_NAME | REFERENCED_OBJECT_DOMAIN |
> +-----------------------------------------------+-------------------------+---------------------------+------------------------+--------------------------+
> | SALES_UK<--GLOBAL_SALES                       | GLOBAL_SALES            | VIEW                      | SALES_UK               | TABLE                    |
> | NORTH_AMERICA_SALES<--GLOBAL_SALES            | GLOBAL_SALES            | VIEW                      | NORTH_AMERICA_SALES    | VIEW                     |
> | SALES_NA<--NORTH_AMERICA_SALES<--GLOBAL_SALES | NORTH_AMERICA_SALES     | VIEW                      | SALES_NA               | TABLE                    |
> +-----------------------------------------------+-------------------------+---------------------------+------------------------+--------------------------+
> ```

### Data sharing

Consider the following table, which is an excerpt from the OBJECT_DEPENDENCIES view in the consumer account, where:

* `V1` specifies a view that the consumer creates from a shared object.
* `S_V1` specifies a view that the provider shares.
* `S_T1` specifies a table that the provider shares.

| Row | REFERENCING_OBJECT_NAME | REFERENCED_OBJECT_NAME | REFERENCED_OBJECT_DOMAIN | REFERENCED_OBJECT_ID |
| --- | --- | --- | --- | --- |
| 1 | V1 | S_V1 | TABLE | NULL |
| 2 | V1 | S_T1 | TABLE | NULL |

Given this table, note the following:

* If the provider [revokes](../sql-reference/sql/revoke-privilege-share.md) `S_T1` from the share, the consumer continues to see rows
  that specify `S_T1` (row 2) in their local view as long as `S_T1` was not renamed prior to the revocation.
* If the provider drops a table or view in their account, the table or view is no longer included in the share. The local consumer
  view preserves existing records for the dropped table or view because the table or view was shared prior to the drop operation
  in the provider account.

  The consumer cannot observe view changes in the provider account.

---
title: OCSP Configuration
source: https://docs.snowflake.com/en/user-guide/ocsp.md
section: User Guide
---

# OCSP Configuration

This topic provides an overview of OCSP, its use in Snowflake, and information to help diagnose OCSP issues.

## Overview

Snowflake uses Online Certificate Status Protocol (OCSP) to provide maximum security to determine whether a certificate is revoked when Snowflake clients attempt to connect to an endpoint through HTTPS.

Snowflake uses OCSP to evaluate each certificate in the chain of trust up to the intermediate certificate the root certificate authority (CA) issues. Ensuring that each certificate is not revoked helps Snowflake to establish secure connections with trusted actors during the identity verification process.

Depending on your client or driver version and the configuration described on this page, it is possible to turn off OCSP and to adjust the action that occurs when OCSP determines a certificate is revoked.

## Fail-Open or Fail-Close behavior

Currently, users can choose between either of two behaviors in terms of how Snowflake clients or drivers respond during an OCSP event.

1. Fail-open
2. Fail-close

### Fail-Open

Snowflake supports a fail-open approach by default in terms of evaluating the OCSP CA response. The fail-open approach has the following characteristics:

> * A response indicating a revoked certificate results in a failed connection.
> * A response with any other certificate errors or statuses allows the connection to occur, but denotes the message in the logs at the `WARNING` level with the relevant details in JSON format.

Users can monitor the logs for the specific driver or connector to determine the frequency of fail-open log events.

These event logs can be combined with the [Snowflake Status Page](https://status.snowflake.com) to determine the best course of action, such as temporarily restricting client access or pivoting to fail-close behavior.

Currently, the fail-open default approach applies to the following client and driver versions.

| Client / Driver | Version |
| --- | --- |
| SnowSQL | v1.1.79 or later |
| Python Connector | v1.8.0 or later |
| JDBC Driver | v3.8.0 or later |
| ODBC Driver | v2.19.0 or later |
| SQL Alchemy | Upgrade Python Connector to v1.8.0 or later |
| Spark | v2.4.14 or later if using Maven or SBT to build the Spark application. . JDBC v3.8.0 or later if attaching JAR files to Spark cluster. . Request Databricks to upgrade their Spark connector if using the Databricks built-in Spark connector. |
| Go Driver | v1.2.0 or later |
| Node.js | v1.2.0 or later |

> **Note:**
>
> Snowflake does not support OCSP checking for the .NET driver. Instead, .NET uses its own framework to check the validity of the HTTPS certificate.

### Fail-Close

The fail-close behavior is more restrictive to interpreting the OCSP CA response. If the client or driver does not receive a valid OCSP CA response *for any reason*, the connection fails.

Since this behavior is not default based on the versions listed in the fail-open section, fail-close must be configured manually within each driver or connector.

To preserve the fail-close behavior, set the corresponding `ocsp_fail_open` parameter to `false`.

| Client / Driver | Setting |
| --- | --- |
| SnowSQL | `snowsql -o ocsp_fail_open=false` |
| Python Connector | For details, see [Choosing fail-open or fail-close mode](../developer-guide/python-connector/python-connector-connect.md) in the Python Connector documentation. |
| JDBC Driver | For details, see [Choosing fail-open or fail-close mode](../developer-guide/jdbc/jdbc-configure.md) in the JDBC Driver documentation. |
| ODBC Driver | Choose one of the following: . Set the connection parameter to `OCSP_FAIL_OPEN=false` . Use the environment variable $SIMBAINI to locate the corresponding file. Then set `OCSPFailOpen=false` |
| SQL Alchemy | See JDBC Driver settings |
| Spark | The Spark Connector does not have an `ocsp_fail_open` parameter. . Fail-close can only be preserved with Spark if using the JDBC driver. |
| Go Driver | Do either of the following: . - Set the connection parameter `OCSPFailOpen` in Config to `ocspFailOpenTrue` or `ocspFailOpenFalse`, for example: . `import ( ... sf "github.com/snowflakedb/gosnowflake ... ")` . `config: &Config{ Account: "xy12345", ...,  OCSPFailOpen: sf.ocspFailOpenFalse, ... }` . - Set the `ocspFailOpen` connection parameter in the connect string to `true` or `false`, for example, . `user:pass@account/db/s?ocspFailOpen=false`. . Note the differences in case (uppercase / lowercase). . For more information on Go connection parameters, see the GoDoc [gosnowflake documentation](https://godoc.org/github.com/snowflakedb/gosnowflake). |
| Node.js | Set the global parameter `ocspFailOpen=false`. For details, see [Node.js options reference](../developer-guide/node-js/nodejs-driver-options.md). |

### Legacy client and driver versions

If your client or driver version is older than that listed in the fail-open section, the fail-open behavior is not an option. Therefore, the fail-close behavior is default.

Snowflake deployments using legacy client and driver versions with respect to OCSP have three options:

1. Upgrade their client or driver to its latest version (best option).
2. Continue using the fail-close behavior.
3. Turn off OCSP monitoring as described in this [Knowledge Base article](https://community.snowflake.com/s/article/How-to-turn-off-OCSP-checking-in-Snowflake-client-drivers) (in the Snowflake Community).

### Best practices

To mitigate risk, Snowflake recommends the following best practices to keep communications secure.

1. Use private connectivity to the Snowflake service and block public access to Snowflake.
2. Allow client drivers to run on managed desktops and servers only.
3. Send client driver logs to a management system or upload to Snowflake. Monitor the connections made without OCSP checking.

> **Note:**
>
> Support for private connectivity to the Snowflake service requires [Business Critical](intro-editions.md) (or higher).
> To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## CA site and OCSP responder hosts used by Snowflake

You can call the [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md) or [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md) function
in your Snowflake account to get the hosts Snowflake uses for OCSP verification checks. The host values are unique to the cloud platform
and region where your Snowflake account exists. The reasons for the different host values are based on the CA that the cloud platform uses
and when the certificates are updated or renewed.

For example:

```sqlexample
SELECT t.VALUE:type::VARCHAR as type,
  t.VALUE:host::VARCHAR as host,
  t.VALUE:port as port
FROM TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$ALLOWLIST_PRIVATELINK()))) AS t
WHERE type ILIKE ANY ('OCSP%');
```

```output
+-----------------------+---------------------------------------------------------------+------+
| TYPE                  | HOST                                                          | PORT |
|-----------------------+---------------------------------------------------------------+------|
| OCSP_CACHE            | ocsp.account1234.us-west-2.privatelink.snowflakecomputing.com | 80   |
| OCSP_CACHE_REGIONLESS | ocsp.my_org-my_account.privatelink.snowflakecomputing.com     | 80   |
+-----------------------+---------------------------------------------------------------+------+
```

## OCSP certification checks require Port 80

All communication with Snowflake happens using port 443. However, OCSP certification checks are transmitted over port 80. If your workstation is behind a firewall, make sure that
the network administrator for your organization has opened the firewall to traffic on ports 443 and 80.

---
title: Offer manifest reference
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/offer-manifest-reference.md
section: User Guide
---

# Offer manifest reference

Creating Snowflake offers programmatically requires a manifest, written in YAML (<https://yaml.org/spec/>). Use the information provided here to learn about the parameters available in the offer manifest.

## Offer manifest

```yaml
#
# Offer manifest
#
access_start_date_preference: <preferred_offer_start_date>
comment: <offer_comments>
contract_type: <contract_type>
contract_duration_months: <the_contract_duration_in_months>
invoice_end_time: <invoice_end_date_and_time>
invoice_start_date_preference: <preferred_invoice_start_date>
invoice_start_time: <invoice_start_time>
is_default: <is_a_default_offer_included_with_the_pricing_plane>
offer_display_name: <offer_display_name>
offer_expiration_time: <offer_expiration_time>
payment_terms:
  payment_type: <pricing_plan_payment_method>
  installment_schedule: <pricing_plan_installment_schedule>
  allowed_payment_methods: <allowed_payment_methods>
pricing_plan_name: <the_pricing_plan_name>
access_end_time: <listing_access_end_time>
access_start_time: <listing_access_start_time>
discount: <the_offer_discount>
target_consumer: <offer_target_consumer>
terms_of_service:
  type: <terms_of_service_type>
  custom_link: <link_to_custom_terms_of_service>
additional_information: <additional_offer_information>
```

## Offer parameters

The parameters within the offer manifest allow you to create offers that meet your specific business requirements. Required and optional parameters are identified.

access_start_date_preference
:   Required. String. The preferred offer start date. Accepted values are SPECIFIC_DATE or OFFER_ACCEPTED_DATE.

comment
:   Optional. String. Comments about the offer that are only visible to providers.

contract_type
:   Required. String. The contract type. Accepted values are SUBSCRIPTION or LIMITED_TIME.

contract_duration_months
:   Required. Long. The contract duration in months.

invoice_end_time
:   Required. Long. Invoice end date and time in milliseconds since Unix epoch.

invoice_start_date_preference
:   Required. String. The preferred invoice start date. Accepted values include the following:

    * `OFFER_ACCEPTED_DATE`: Use with flat-fee plans.
    * `SPECIFIC_DATE`: Use with flat-fee plans or (less commonly) with usage-based plans.
    * `FIRST_DAY_NEXT_MONTH`: Use with flat fee plans or with new usage-based plans.
    * `TWO_DAYS_AFTER_OFFER_ACCEPTED_DATE`: Use when allowing a consumer to accept a new usage-based plan that replaces an existing
      usage-based plan. In this case, it can take up to 2 days for the new usage-based pricing plan to take effect.

invoice_start_time
:   Required. Long. The time the invoice was created.

is_default
:   Required. Boolean. When TRUE, specifies that a default offer is included with the pricing plan. The default is FALSE.

offer_display_name
:   Optional. String. The offer name visible to consumers.

offer_expiration_time
:   Optional. Long. The offer expiration time.

payment_terms
:   Required. Provides additional pricing plan parameters. You can specify the following parameters.

    `payment_type``installment_schedule``allowed_payment_methods`

    String. The pricing plan payment types. Accepted values are INVOICE and CREDIT_CARD.

    String. The pricing plan installment schedule.

    List. The allowed pricing plan payment methods. Accepted values are INVOICE and CREDIT_CARD.

pricing_plan_name
:   Required. String. The pricing plan name.

access_end_time
:   Required. Long. The time the consumer loses access to a trial listing.

access_start_time
:   Required. Long. The time a consumer can access a listing.

discount
:   Optional. Double. The offer discount.

target_consumer
:   Optional. String. The target consumer for the offer. The format is `organization_name.account_name`.

terms_of_service
:   Required. Provides additional pricing plan terms of service. You can specify the following parameters.

    `type``custom_link`

    String. The terms of service type. Accepted values are CUSTOM, DEFAULT, and OFFLINE.

    String. A link to custom terms of service.

additional_information
:   Optional. Additional offer information.

## Examples

The following example defines a limited-time offer that’s tied to a PRICING_PLAN_V2 pricing plan.

```yaml
version: V2
contract_type: LIMITED_TIME
contract_duration_months: 12
display_name: OFFER_V2
is_default: true
payment_terms:
  payment_type: FULL
state: PUBLISHED
sales_motion: SELF_SERVE
pricing_plan_details:
  type: DEFAULT
  name: PRICING_PLAN_V2
metadata:
  description: sample-description
  price: 100
  button_text: button-text
  value_propositions:
    - val 1
    - val 2
```

---
title: Okta SCIM integration with Snowflake
source: https://docs.snowflake.com/en/user-guide/scim-okta.md
section: User Guide
---

# Okta SCIM integration with Snowflake

This guide provides the steps required to configure Provisioning in Okta for Snowflake, and includes the following sections:

## Features

User and Role Administration is supported for the Snowflake application.

This enables Okta to:

* Manage the user lifecycle (i.e. create, update, and delete) in Snowflake.
* Manage the role lifecycle (i.e. create, update, and delete) in Snowflake.
* Manage user to role assignments in Snowflake.

The following provisioning features are supported:

Push New Users:
:   New users created through OKTA are also created in Snowflake. You can use the `allowedInterfaces` custom attribute to prevent a provisioned user from using certain interfaces to access Snowflake.

Push Profile Updates:
:   Updates made to the user’s profile through OKTA will be pushed to Snowflake.

Push User Deactivation:
:   Deactivating the user or disabling the user’s access to Snowflake through OKTA will deactivate the user in Snowflake.

    > **Note:**
    >
    > For Snowflake, deactivating a user means setting the `DISABLED` property for the user to `TRUE`.

Reactivate Users:
:   User accounts can be reactivated Snowflake.

Sync Password:
:   User password can be pushed from Okta into Snowflake, if required.

    > **Tip:**
    >
    > The default setting is to create a random password for users giving the user an attribute setting of `has_Password=true`. Without a password, users must access Snowflake through Okta SSO. To prevent a password being generated for users, turn this setting off before provisioning users as follows:
    >
    > 1. Click Edit.
    > 2. Under Sync Password, uncheck the setting Generate a new random password whenever the user’s Okta password changes.
    > 3. Save the change.
    >
    > Enabling this setting in Okta creates a password for the user to access Snowflake. This could result in a pathway for users to access Snowflake without SSO.
    >
    > To disable password synchronization, unset this option in Okta and update the Snowflake Okta SCIM
    > [security integration](../sql-reference/sql/alter-security-integration-scim.md) to set the `SYNC_PASSWORD` property to
    > `False`.

Push Groups:
:   The Push Groups feature creates roles in Snowflake and facilitates role management. The roles created in Snowflake using Okta Push Groups have the same names in Okta and Snowflake. Always create roles in Okta first and use Push Groups to update Snowflake to ensure Okta and Snowflake can synchronize. Okta and the OKTA_PROVISIONER custom role in Snowflake cannot manage manually created roles in Snowflake. Push Groups do not create users in Snowflake.

    > **Tip:**
    >
    > Okta can create users in Snowflake if the Snowflake application in Okta is assigned to a user in Okta.
    >
    > For more information, see [Assign an application to a user](https://help.okta.com/en/prod/Content/Topics/Provisioning/lcm/lcm-assign-app-user.htm).

### Known issues

* Okta does not support URLs that contain underscores. If the name of the Snowflake account contains an underscore, then you need to use
  a special account URL that replaces the underscore with a hyphen. For example, if you are using the account name URL format, the special
  URL might be `https://myorg-account-name.snowflakecomputing.com`.
* Existing Snowflake roles cannot be brought under Okta’s management through transfer of ownership. Only new roles can be created through Okta.
* Existing Snowflake users can be brought under Okta’s management through a transfer of ownership. For more information, see Troubleshooting (in this topic).

### Limitations

* Snowflake supports a maximum of 500 concurrent requests per account per SCIM endpoint (e.g. the `/Users` endpoint, the `/Groups` endpoint). After your account exceeds this threshold, Snowflake returns a `429` HTTP status code (i.e. too many requests). Note that this request limit usually only occurs during the initial provisioning when relatively large numbers of requests (i.e. more than 10 thousand) occur to provision users or groups.

### Not supported

* Okta’s [Enhanced Group Push](https://help.okta.com/en/prod/Content/Topics/users-groups-profiles/usgp-enable-group-push.htm) and Push Now features.

  > **Note:**
  >
  > The `defaultRole`, `defaultSecondaryRoles`, and `defaultWarehouse` attributes are unmapped as they are optional. To map these attributes in Okta, use profiles, expressions, or set a default value for all users. For more information, see [Manage profiles (in Okta)](https://help.okta.com/en/prod/Content/Topics/users-groups-profiles/usgp-user-profiles-main.htm).
* If you are using private connectivity to the Snowflake service to access Snowflake, ensure that you are not entering these URLs in the
  integration settings. Enter the public endpoint (i.e. without `.privatelink`), and ensure that the network policy allows access
  from the Okta IP address listed [here](https://help.okta.com/en-us/Content/Topics/Security/ip-address-allow-listing.htm), otherwise you
  cannot use this integration.
* Okta does not currently support importing Active Directory [nested groups](https://help.okta.com/en/prod/Content/Topics/Directory/ad-agent-import-groups.htm). Therefore, if your Okta integration uses nested groups in AD, you cannot use the Snowflake Okta SCIM integration to provision or manage nested groups in Snowflake. Please contact Okta and Microsoft to request the support of nested groups.

## Prerequisites

1. Before provisioning users or groups, ensure that the [network policy](network-policies.md) in Snowflake allows access
   from Okta’s IP addresses documented [here](https://help.okta.com/en-us/Content/Topics/Security/ip-address-allow-listing.htm). For more
   information, see Managing SCIM Network Policies.
2. Before you configure provisioning for Snowflake, make sure you have configured the General Settings and any Sign-On Options for the Snowflake application in Okta.

Once the above steps are complete, click Next in Okta to take you back to the Provisioning tab.

## Configuration steps

The configuration process requires completing steps in Snowflake and in Okta.

### Snowflake configuration

The Snowflake configuration process creates a SCIM security integration to allow users and roles created in Okta to be owned by the OKTA_PROVISIONER SCIM role in Snowflake and creates an access token to use in SCIM API requests. The access token is valid for six months. Upon expiration, create a new access token manually using [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md) as shown below.

> **Note:**
>
> To invalidate an existing access token for a SCIM integration, execute a [DROP INTEGRATION](../sql-reference/sql/drop-integration.md) statement.
>
> To continue using SCIM with Snowflake, recreate the SCIM integration with a [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-scim.md) statement and generate a new access token using [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md).

Execute the following SQL statements in your preferred Snowflake client. Each of the SQL statements is explained below.

```sqlexample
use role accountadmin;
create role if not exists okta_provisioner;
grant create user on account to role okta_provisioner;
grant create role on account to role okta_provisioner;
grant role okta_provisioner to role accountadmin;
create or replace security integration okta_provisioning
    type = scim
    scim_client = 'okta'
    run_as_role = 'OKTA_PROVISIONER';
select system$generate_scim_access_token('OKTA_PROVISIONING');
```

> **Important:**
>
> The example SQL statements use the ACCOUNTADMIN system role and the OKTA_PROVISIONER custom role is granted to the ACCOUNTADMIN role.
>
> It is possible not to use the ACCOUNTADMIN role in favor of a less-privileged role. Using a less-privileged role can help to address compliance concerns relating to least-privileged access, however, using a less-privileged role can result in unexpected errors during the SCIM configuration and management process.
>
> These errors could be the result of the less-privileged role not having sufficient rights to manage all of the roles through SCIM due to how the roles are created and the resultant role hierarchy. Therefore, in an effort to avoid errors in the configuration and management processes, choose one of the following options:
>
> 1. Use the ACCOUNTADMIN role as shown in the example SQL statements.
> 2. Use a role with the global MANAGE GRANTS privilege.
> 3. If neither of these first two options are desirable, use a custom role that has the OWNERSHIP privilege on all of the roles that will be managed using SCIM.

1. Use the ACCOUNTADMIN role.

   > ```sqlexample
   > use role accountadmin;
   > ```
2. Create the custom role OKTA_PROVISIONER. All users and roles in Snowflake created by Okta will be owned by the scoped down OKTA_PROVISIONER role.

   > ```sqlexample
   > create role if not exists okta_provisioner;
   > grant create user on account to role okta_provisioner;
   > grant create role on account to role okta_provisioner;
   > ```
3. Let the ACCOUNTADMIN role create the security integration using the OKTA_PROVISIONER custom role. For more information, see [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-scim.md).

   > ```sqlexample
   > grant role okta_provisioner to role accountadmin;
   > create or replace security integration okta_provisioning
   >     type = scim
   >     scim_client = 'okta'
   >     run_as_role = 'OKTA_PROVISIONER';
   > ```
4. Create and copy the authorization token to the clipboard and store securely for later use. Use this token for each SCIM REST API request and place it in the request header. The access token expires after six months and a new access token can be generated with this statement.

   > ```sqlexample
   > select system$generate_scim_access_token('OKTA_PROVISIONING');
   > ```
   >
   > > **Important:**
   > >
   > > All users and roles in Snowflake created by Okta will be owned by the scoped down `okta_provisioner` role.
   > >
   > > If you want to manage existing Snowflake users through Okta, complete the following steps:
   > >
   > > 1. Transfer ownership of existing users to the okta_provisioner role.
   > >
   > >    > ```sqlexample
   > >    > use role accountadmin;
   > >    > grant ownership on user <user_name> to role okta_provisioner;
   > >    > ```
   > > 2. Ensure `login_name` property is set for existing users which should already be set if these existing Snowflake users are using Okta SSO.
   > > 3. Be advised that the name for existing users brought under Okta’s management will be updated to match with Okta’s username. Inform your users about this change as they may be using the name to connect to Snowflake from other integration (i.e. Tableau).

### Okta configuration

This section discusses how to create and configure a Snowflake application in Okta.

> **Note:**
>
> When creating the Snowflake application in Okta, the SubDomain field for the application must contain the
> [account identifier](admin-account-identifier.md) of your Snowflake account. If the Snowflake account name contains an
> underscore and you are using the account name format of the identifier, you must convert the underscore to a hyphen because Okta does
> not support underscores in URLs (e.g. `myorg-account-name`).
>
> Do not include a `privatelink` segment in the SubDomain field because private connectivity is
> not supported and entering this segment causes the SCIM connection to fail.

To configure the Snowflake application in Okta, complete the following steps.

1. In Settings, select Integration from the left hand menu and then check the Enable API Integration box.
2. For API Token, enter the value generated above from the clipboard. Click Test API Credentials button, and, if successful, save the configuration.
3. Select To App from the left hand menu.
4. Select the Provisioning Features you want to enable.
5. Verify the Attribute Mappings. The `defaultRole`, `defaultSecondaryRoles`, and `defaultWarehouse` attributes are unmapped as they are optional. If there’s a need, you can map them using Okta profile or expression or set the same value for all users.

You can now assign users to the Snowflake application (if needed) and finish the application setup.

> **Note:**
>
> Okta supports an attribute called `snowflakeUserName` which maps to the `name` field of the Snowflake user.
>
> If you want the `name` and `login_name` fields for the Snowflake user to have different values, follow this procedure.
>
> 1. Contact Snowflake support to enable separate mapping for your account.
> 2. In Okta, access the Snowflake application and navigate to Provisioning > Attribute Mappings > Edit Mappings.
> 3. Search for the attribute `snowflakeUserName`.
> 4. If the attribute is not found, the Snowflake application was created prior to this attribute being available. Recreate the Snowflake application with the mappings shown below or add the attribute manually as follows:
>
>    * Click Add Attribute.
>    * Set the following values for each of the listed fields in the table.
>
>    | Field | Value |
>    | --- | --- |
>    | Data type | string |
>    | Display name | Snowflake Username |
>    | Variable name | `snowflakeUserName` |
>    | External name | `snowflakeUserName` |
>    | External namespace | `urn:ietf:params:scim:schemas:extension:enterprise:2.0:User` |
>    | Description | Maps to the `name` field of the user in Snowflake. |
>    | Scope | User personal |
> 5. Click Save.

## Enabling Snowflake-initiated SSO

The SCIM provisioning process does not automatically enable single sign-on (SSO).

To use SSO after the SCIM provisioning process is complete, enable
[Snowflake-initiated SSO](admin-security-fed-auth-security-integration.md).

## Managing SCIM network policies

Applying a network policy to a SCIM security integration allows the SCIM network policy to be distinct from network policies that apply to the entire Snowflake account.
It allows the SCIM provider to provision users and groups without adding IP addresses to a network policy that controls access for normal users.

A network policy applied to a SCIM integration overrides a network policy applied to the entire Snowflake account.

After creating the SCIM security integration, create the SCIM network policy using this command:

> ```sqlsyntax
> alter security integration okta_provisioning set network_policy = <scim_network_policy>;
> ```

To unset the SCIM network policy, use this command:

> ```sqlexample
> alter security integration okta_provisioning unset network_policy;
> ```

Where:

`okta_provisioning`
:   Specifies the name of the Okta SCIM security integration.

`scim_network_policy`
:   Specifies the Okta SCIM network policy in Snowflake.

For more information, see [Controlling network traffic with network policies](network-policies.md) and [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-scim.md).

## Using secondary roles with SCIM

Snowflake supports setting the [user](../sql-reference/sql/create-user.md) property `DEFAULT_SECONDARY_ROLES` to `'ALL'` with
SCIM to allow users to use [secondary roles](security-access-control-overview.md) in a Snowflake session.

For a representative example, see [Update a user](scim-user-api-reference.md).

## Populating Snowflake tags with SCIM integrations

You can populate tags by using the `snowflakeTags` attribute when you ingest user information into the SCIM security integration. The exact request input can be found in [Create a user](scim-user-api-reference.md).

For more information about adding custom attributes to an Okta user profile, see the [Okta documentation](https://help.okta.com/en-us/content/topics/users-groups-profiles/usgp-add-custom-user-attributes.htm)

To enable support for this feature:

* Create the tag before you run the SCIM integration.
* Grant proper privileges on each tag and tag schema to the OKTA_PROVISIONER role.

Here is an example of creating a tag and assigning the proper role privileges:

```sqlexample
-- Create the tag.
CREATE TAG my_database_name.my_schema_name.my_tag_name;

-- Assign the proper privileges to the SCIM integration.
GRANT USAGE ON SCHEMA my_database_name.my_schema_name TO ROLE OKTA_PROVISIONER;
GRANT APPLY ON TAG my_database_name.my_schema_name.my_tag_name TO ROLE OKTA_PROVISIONER;
```

You must grant USAGE ON SCHEMA and APPLY ON TAG to all tags and tag schemas that you plan to assign through your SCIM security integration.

## Replicating the Okta SCIM security integration

Snowflake supports replication and failover/failback with the SCIM security integration from the source account to the target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## Troubleshooting

* **Transferring ownership.** If the user update fails, check the ownership of the user in Snowflake. If it is not owned by the `okta_provisioner` role (or the role set in the `run_as_role` parameter when creating the security integration in Snowflake), then the update will fail. Transfer the ownership by running the following SQL statement in Snowflake and try again.

  > ```sqlexample
  > grant ownership on user <username> to role OKTA_PROVISIONER;
  > ```
* Ensure `login_name` property is set for existing users which should already be set if these existing Snowflake users are using Okta SSO.
* To verify that Okta is sending updates to Snowflake, check the log events in Okta for the Snowflake application and the SCIM audit logs in Snowflake to ensure Snowflake is receiving updates from Okta. Use the following to query the Snowflake SCIM audit logs.

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  > USE SCHEMA snowflake.information_schema;
  >
  > SELECT * FROM TABLE(REST_EVENT_HISTORY('scim'));
  >
  > SELECT *
  >   FROM TABLE(REST_EVENT_HISTORY(
  >   'scim',
  >   DATEADD('MINUTES',-5,CURRENT_TIMESTAMP()),
  >   CURRENT_TIMESTAMP(),
  >   200))
  >   ORDER BY event_timestamp;
  > ```
* It is possible that an authentication error may occur during the provisioning process. One possible error message is as follows:

  > ```none
  > Error authenticating: Forbidden. Errors reported by remote server: Invalid JSON: Unexpected character ('<' (code 60)): expected a valid value (number, String, array, object, 'true', 'false' or 'null') at [Source: java.io.StringReader@4c76ba04; line: 1, column: 2]
  > ```
  >
  > If this error message or other authentication error messages occur, try this troubleshooting procedure:
  >
  > 1. In Okta, remove the current Snowflake application and create a new Snowflake application.
  > 2. In Snowflake, create a new SCIM security integration and generate a new access token.
  > 3. Copy the new token by clicking Copy.
  > 4. In Okta, paste and verify the new access token as described in how to configure Okta as a SCIM identity provider.
  > 5. Provision users and roles from Okta to Snowflake using the new Snowflake application in Okta.

**Next topics:**

* [SCIM API references](scim-api-references.md)

---
title: Operations and reference
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-operations.md
section: User Guide
---

# Operations and reference

This topic covers monitoring, observability, and access privileges for Snowpipe Streaming with high-performance architecture.

## Monitoring and observability

You can monitor ingestion status through the [SNOWPIPE_STREAMING_CHANNEL_HISTORY](../../sql-reference/account-usage/snowpipe_streaming_channel_history.md) view in Snowsight and the `GET_CHANNEL_STATUS` API. These provide insight into channel state, offset progress, and ingestion health.

## Required access privileges

Calling the Snowpipe Streaming API requires a role with the following privileges:

| Object | Privilege |
| --- | --- |
| Table | OWNERSHIP or a minimum of INSERT and EVOLVE SCHEMA (only required when using schema evolution for Kafka connector with Snowpipe Streaming) |
| Database | USAGE |
| Schema | USAGE |
| Pipe | OPERATE |

---
title: Optimize dynamic table performance
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-performance-optimize.md
section: User Guide
---

# Optimize dynamic table performance

This topic covers techniques for optimizing dynamic table performance, organized into
design changes and adjustments.

Before you optimize a dynamic table, you might want to diagnose the cause of slow refreshes. See
[Diagnose slow refreshes](dynamic-tables-performance-monitor.md) for a step-by-step workflow.

For background on performance categories, see
[Performance decisions](dynamic-tables-performance.md).

## Design changes

Design changes require you to recreate a dynamic table, but have greater impact on performance.

> **Note:**
>
> We recommend that you group changes and recreate tables together instead of making incremental modifications.

### Choose a refresh mode

The refresh mode you choose has a significant impact on performance because it determines
how much data Snowflake processes during each refresh. For information about how each mode
works, see [Dynamic table refresh modes](dynamic-tables-refresh.md).

> **Important:**
>
> Dynamic tables with incremental refresh can’t be downstream from dynamic tables that use
> full refresh.

Use the following decision process to select a refresh mode:

1. Review your query against the list of [supported query constructs](dynamic-tables-supported-queries.md).
   Not all query operators support incremental refresh. For operators that *are* supported,
   see [Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md) to understand how
   they affect performance.
2. Estimate your change volume, which is the percentage of your data that changes between
   refreshes. Incremental refresh, for example, works best when less than five percent of data changes.
3. Evaluate your data locality. Check whether your source tables are clustered by the keys
   that you plan to use in joins, GROUP BY, or PARTITION BY clauses in your dynamic table query.
   Poor locality reduces incremental refresh efficiency. To improve locality,
   see Improve data locality.
4. Choose a mode based on the following table:

   | Mode | When to use |
   | --- | --- |
   | **Incremental** | Your query uses supported operators, less than five percent of data changes between refreshes, and your source tables have good data locality.  **Note:** Incremental refresh can still scan source tables, not just the rows that changed. For example, a new row in one side of a join must match against all rows in the other table. Even a small number of changes can trigger significant work. |
   | **Full** | A large percentage of data changes, your query uses unsupported operators, or your data lacks locality. |
   | **Auto** | You’re prototyping or testing. Avoid AUTO in production because its behavior might change between Snowflake releases. |
5. When you create a dynamic table, specify the mode with `REFRESH_MODE = INCREMENTAL` or
   `REFRESH_MODE = FULL` in your CREATE DYNAMIC TABLE statement.

To check which refresh mode a dynamic table uses, see [Refresh mode](dynamic-tables-performance-monitor.md).

### Optimize your queries and pipeline

The structure of your dynamic table queries and pipeline directly affects refresh performance. Use the
following guidelines to reduce the work during each refresh.

#### Simplify individual queries

* Use inner joins instead of outer joins. Inner joins perform better with incremental
  refresh. Verify referential integrity in your source data so that you can avoid outer joins.
* Avoid unnecessary operations. Remove redundant DISTINCT clauses and unused columns.
  Exclude wide columns (like large JSON blobs) that aren’t frequently queried.
* Remove duplicates efficiently. Use ranking functions instead of DISTINCT where possible.

For detailed guidance on how specific SQL operators affect incremental refresh performance,
see [Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).

> **Note:**
>
> For a comprehensive example, see [Tutorial: Optimize dynamic table performance for SCD Type 1 workloads](tutorials/optimize-dynamic-table-performance.md).

#### Split transformations across dynamic tables

Breaking complex transformations into multiple dynamic tables makes it easier to identify
bottlenecks and improves debugging. With immutability constraints,
you can also use different refresh modes for different stages.

* Add filters early. Apply `WHERE` clauses in the dynamic tables closest to your source
  data so that downstream tables process fewer rows.
* To avoid repeated `DISTINCT` operations in downstream tables, remove duplicate rows earlier in your pipeline.
* Reduce the number of operations per table. Move joins, aggregations, or window functions
  into intermediate dynamic tables instead of combining them all in one query.
* Materialize compound expressions (like `DATE_TRUNC('minute', ts)`) in an intermediate
  table before grouping by them. For details, see [Optimize aggregations](dynamic-tables-performance-optimize-query.md).

> **Note:**
>
> Finding optimal split points requires trial and error.
>
> Consider splitting between operations
> that shuffle data on different keys, such as `GROUP BY`, `DISTINCT`, window functions
> with `PARTITION BY`, and joins. This lets each dynamic table maintain better data
> locality for its key operation. For operator-specific guidance, see
> [Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).

The following example shows how to split a complex query into intermediate dynamic tables.

Initial complex query:

```sqlexample
CREATE DYNAMIC TABLE final_result
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT ...
  FROM large_table a
  JOIN dimension_table b ON ...
  JOIN another_table c ON ...
  GROUP BY ...;
```

Split the complex pipeline by adding an intermediate dynamic table:

```sqlexample
CREATE DYNAMIC TABLE intermediate_joined
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = my_warehouse
AS
  SELECT ...
  FROM large_table a
  JOIN dimension_table b ON ...;

CREATE DYNAMIC TABLE final_result
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT ...
  FROM intermediate_joined
  JOIN another_table c ON ...
  GROUP BY ...;
```

For detailed information and examples of how operators affect performance, see
[Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).

### Mark historical data immutable

Use the `IMMUTABLE WHERE` clause to tell Snowflake that certain rows won’t change. This
reduces the scope of work during each refresh.

For syntax, examples, and detailed guidance, see
[Use immutability constraints](dynamic-tables-performance-optimize-immutability.md).

## Adjustments

Adjustments don’t require you to recreate dynamic tables. You can make adjustments while your pipeline is running.

### Adjust your warehouse configuration

The warehouse that you specify in your CREATE DYNAMIC TABLE statement runs all refreshes for that
table. Warehouse size and configuration directly affect refresh duration and cost.

For more information about warehouses and dynamic tables, see [Understand warehouse usage for dynamic tables](dynamic-tables-warehouses.md).
For general warehouse performance optimization strategies, see [Optimizing warehouses for performance](performance-query-warehouse.md).

#### Use a separate warehouse for initialization

Initial refreshes often process significantly more data than incremental refreshes. Use
INITIALIZATION_WAREHOUSE to run initializations on a larger warehouse. Reserve a
smaller, more cost-effective warehouse for regular refreshes:

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = 'DOWNSTREAM'
  WAREHOUSE = 'XS_WAREHOUSE'
  INITIALIZATION_WAREHOUSE = '4XL_WAREHOUSE'
  AS <query>;
```

To add or change the initialization warehouse for an existing dynamic table:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET INITIALIZATION_WAREHOUSE = '4XL_WAREHOUSE';
```

To remove the initialization warehouse and use the primary warehouse for all refreshes:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table UNSET INITIALIZATION_WAREHOUSE;
```

To view the warehouse configuration, use [SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) or
check the [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md)
table function.

#### Resize when needed

To balance cost and performance, choose a warehouse size that prevents bytes from being spilled but doesn’t
exceed what your workload can use in parallel. When faster refreshes are critical, increase the
size slightly beyond the cost-optimal point.

Considerations for dynamic table refreshes:

* **Bytes spilled**: When query history shows bytes spilled to local or remote storage, the warehouse
  ran out of memory during refresh. A larger warehouse provides more memory to prevent spilling.
  For details, see [Queries too large to fit in memory](performance-query-warehouse-memory.md).
* **Slow initial refresh**: When the initial refresh is slow, consider setting INITIALIZATION_WAREHOUSE
  for the initial creation, or temporarily resize the warehouse and then resize it down after the table
  is created.
* **Saturated parallelism**: Beyond a certain point, additional parallelism provides diminishing
  returns. Doubling warehouse size might double cost without halving runtime. To check how your
  refresh uses parallelism, review the [query profile](dynamic-tables-performance-monitor.md).

To resize a warehouse, see [Increasing warehouse size](performance-query-warehouse-size.md).

For cost considerations, see [Virtual warehouse credit usage](cost-understanding-compute.md) and
[Working with warehouses](warehouses-tasks.md).

#### Handle concurrent refreshes with multi-cluster warehouses

If multiple dynamic tables share a warehouse and refreshes frequently queue, consider using a
[multi-cluster warehouse](warehouses-multicluster.md). Multi-cluster warehouses
automatically add clusters when queries queue and remove them when demand drops. This improves
refresh latency during peak periods without paying for unused capacity during quiet periods.

For guidance on identifying and reducing queues, see [Reducing queues](performance-query-warehouse-queue.md).

Multi-cluster warehouses require Enterprise Edition or higher. For cost considerations, see
[Setting the scaling policy for a multi-cluster warehouse](warehouses-multicluster.md).

### Identify the right target lag

Target lag controls how often your dynamic table refreshes. Shorter target lag means fresher
data but more frequent refreshes and higher compute cost. For more information about how target lag works,
see [Understanding dynamic table target lag](dynamic-tables-target-lag.md).

Use the following recommendations to optimize target lag for your workload:

* **Use DOWNSTREAM for intermediate tables** that don’t need independent freshness guarantees.
  These tables refresh only when downstream tables need them.
* **Check the refresh history to find the right lag**: Use
  [DYNAMIC_TABLE_REFRESH_HISTORY](../sql-reference/functions/dynamic_table_refresh_history.md) or [Snowsight](ui-snowsight-gs.md) to
  analyze refresh durations and skipped refreshes. Set the target lag slightly higher than your
  typical refresh duration.

#### Change target lag

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET TARGET_LAG = '1 hour';
```

To set a dynamic table to refresh based on downstream demand:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET TARGET_LAG = DOWNSTREAM;
```

### Improve data locality

*Locality* describes how closely Snowflake stores rows that share the same key values. When
rows with matching keys span fewer micro-partitions (good locality), incremental refreshes scan
less data. When matching keys span many micro-partitions (poor locality), incremental refresh
can take longer than full refresh.

For more information about how Snowflake stores data, see
[Micro-partitions & Data Clustering](tables-clustering-micropartitions.md).

#### Cluster source tables

The most effective way to improve locality is to cluster your source tables by the keys used in
your dynamic table query (JOIN, GROUP BY, or PARTITION BY keys):

```sqlexample
ALTER TABLE my_source_table CLUSTER BY (join_key_column);
```

When you join on multiple columns and can’t cluster by all of them:

* Prioritize clustering larger tables by the most selective keys.
* Consider creating separate copies of the same data clustered by different keys for use in
  different dynamic tables.

For more information, see [Clustering Keys & Clustered Tables](tables-clustering-keys.md). To enable automatic
reclustering, see [Automatic Clustering](tables-auto-reclustering.md).

#### Factors that affect locality

Beyond source table clustering, two other factors affect locality. These depend on your data
patterns and are harder to change directly:

* **How new data aligns with partition keys**: Incremental refresh is faster when new rows
  affect only a small portion of the table. This depends on your data ingestion patterns, not
  your query structure.

  For example, time-series data grouped by hour has good locality because
  new rows share recent timestamps. Data grouped by a column with values spread across the
  entire table has poor locality.
* **How changes align with dynamic table clustering**: When Snowflake applies updates or
  deletions to a dynamic table, it must locate the affected rows. This is faster when the changed rows are stored
  close together.

  For example, updates to recent rows perform well when the dynamic table is
  naturally ordered by time. Updates scattered across the entire table are slower. This factor
  depends on your workload patterns, including which rows change and how often.

When you experience poor locality because of these factors, consider whether you can adjust your data model or
ingestion patterns upstream.

---
title: Optimize queries for incremental refresh
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-performance-optimize-query.md
section: User Guide
---

# Optimize queries for incremental refresh

Use this page when you design a new dynamic table query or want to optimize an existing one
for incremental refresh. This guide shows which operators perform well, which need careful
handling, and how to restructure queries for better performance.

For a complete list of which query constructs are *supported* for incremental refresh, see
[Supported queries for dynamic tables](dynamic-tables-supported-queries.md).

## Performance expectations by operator

Before you optimize a dynamic table query, understand which operators benefit from incremental refresh and which can
cause problems.

> **Note:**
>
> Short queries (less than 10 seconds) might see smaller performance gains because of fixed
> overheads like query optimization and warehouse scheduling.

### Operators that perform consistently well

These operators work efficiently with incremental refresh:

* `SELECT`
* `WHERE`
* `FROM` <base table>
* `UNION ALL`
* `QUALIFY` [ `RANK` | `ROW_NUMBER` | `DENSE_RANK` ] … = 1

For details on how Snowflake processes each operator, see the operator reference table.

### Operators affected by data locality

For these operators, performance depends on [data locality](dynamic-tables-performance-optimize.md), which is
how you organize your data and where changes occur relative to your keys:

* `INNER JOIN`
* `OUTER JOIN`
* `GROUP BY`
* `DISTINCT`
* `OVER` (window functions)

When changes affect only a small portion of grouping or
partition keys, these operators perform well. Poor
data locality or changes spread across many keys can
make incremental refresh *slower* than full refresh.

For details on how Snowflake processes each operator, see the operator reference table.

## Common optimization patterns

The following sections show common patterns to optimize queries that use locality-sensitive operators.

### Optimize aggregations

When you use [GROUP BY](../sql-reference/constructs/group-by.md), Snowflake recomputes aggregates for every grouping key that contains
changes. Performance depends on the following factors:

* **Data clustering**: Source data clustered by grouping keys performs best.
* **Change distribution**: Aim for changes that affect fewer than five percent of grouping keys.
* **Key complexity**: Simple column references outperform compound expressions.

#### Problem: Compound expressions in grouping keys

This query performs poorly because the grouping key is an expression:

```sqlexample
CREATE DYNAMIC TABLE hourly_sums
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT DATE_TRUNC('minute', ts), SUM(amount)
  FROM transactions
  GROUP BY 1;
```

#### Solution: Materialize the expression

Split into two dynamic tables to expose a simple grouping key:

```sqlexample
CREATE DYNAMIC TABLE transactions_with_minute
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = my_warehouse
AS
  SELECT DATE_TRUNC('minute', ts) AS ts_minute, amount
  FROM transactions;

CREATE DYNAMIC TABLE hourly_sums
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT ts_minute, SUM(amount)
  FROM transactions_with_minute
  GROUP BY 1;
```

Now `GROUP BY` operates on a simple column, and the intermediate table benefits from
better [data locality](dynamic-tables-performance-optimize.md).

### Optimize joins

Join performance depends on which side changes and how you cluster data.

**INNER JOIN**: Snowflake joins changes from the left side with the right table, then joins
changes from the right side with the left table. Joins perform well when one side is small
or changes infrequently.

**OUTER JOIN**: Snowflake must also compute NULL values for non-matching rows. Which side
changes significantly affects performance.

#### Problem: Large table on both sides with poor clustering

Neither source table is clustered by join key:

```sqlexample
CREATE DYNAMIC TABLE order_details
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT o.order_id, o.customer_id, p.product_name, o.quantity
  FROM orders o
  JOIN products p ON o.product_id = p.product_id;
```

#### Solution: Cluster the table that changes less often

Cluster the dimension table by the join key. Then, the join benefits from better locality:

```sqlexample
ALTER TABLE products CLUSTER BY (product_id);

CREATE DYNAMIC TABLE order_details
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT o.order_id, o.customer_id, p.product_name, o.quantity
  FROM orders o
  JOIN products p ON o.product_id = p.product_id;
```

For OUTER JOINs:

* Put the table that changes more often on the LEFT side.
* Minimize changes on the side opposite the OUTER keyword.
* For FULL OUTER JOINs, good locality is critical on both sides.

### Optimize window functions

Snowflake recomputes [window functions](../sql-reference/functions-window.md) for every partition key that contains changes. Optimize
them similarly to `GROUP BY`.

Key requirements:

* Always include a PARTITION BY clause. Window functions without PARTITION BY result in a full
  recomputation.
* Cluster source data by partition keys.
* Keep changes to fewer than five percent of partitions.

#### Problem: Window function without partition clustering

The source table isn’t clustered by the partition key:

```sqlexample
CREATE DYNAMIC TABLE ranked_sales
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT
    region,
    salesperson,
    amount,
    RANK() OVER (PARTITION BY region ORDER BY amount DESC) as sales_rank
  FROM daily_sales;
```

#### Solution: Cluster by the partition key

Cluster the source table by the partition key so that the window function benefits from locality:

```sqlexample
ALTER TABLE daily_sales CLUSTER BY (region);

CREATE DYNAMIC TABLE ranked_sales
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT
    region,
    salesperson,
    amount,
    RANK() OVER (PARTITION BY region ORDER BY amount DESC) as sales_rank
  FROM daily_sales;
```

### Remove duplicates efficiently (DISTINCT vs QUALIFY)

Both [DISTINCT](../sql-reference/sql/select.md) and [QUALIFY](../sql-reference/constructs/qualify.md) can remove duplicates,
but they perform differently.

**DISTINCT**: Equivalent to `GROUP BY ALL`. Locality directly affects performance; poor
locality causes slow refreshes.

**QUALIFY with ROW_NUMBER = 1**: Snowflake optimizes the pattern `QUALIFY ROW_NUMBER() ... = 1`
when it’s in the top-level projection of the dynamic table. This pattern consistently performs
faster than full refresh.

The optimization works best when all PARTITION BY and ORDER BY columns in the OVER() clause
are queryable and persisted in the dynamic table that is included in the top-level SELECT projection.

#### Recommendation: Use QUALIFY instead of DISTINCT when possible

The following example uses DISTINCT:

```sqlexample
CREATE DYNAMIC TABLE unique_customers
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT DISTINCT customer_id, customer_name, email
  FROM customer_events;
```

The following example uses QUALIFY:

```sqlexample
CREATE DYNAMIC TABLE unique_customers
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
AS
  SELECT customer_id, customer_name, email, event_time
  FROM customer_events
  QUALIFY ROW_NUMBER() OVER (PARTITION BY customer_id ORDER BY event_time DESC) = 1;
```

The QUALIFY version is more explicit about which duplicate to keep (the most recent) and
performs consistently well.

#### Remove redundant DISTINCT operations

Each DISTINCT consumes resources on every refresh. When your data is already unique or you
eliminate duplicates upstream, remove unnecessary DISTINCT clauses.

## Operator reference

The following table explains how Snowflake processes each SQL operator during incremental
refresh:

| Operator | How Snowflake processes it | Performance notes |
| --- | --- | --- |
| SELECT | Applies expressions to changed rows only. | Performs well. No special considerations. |
| WHERE | Evaluates the predicate on changed rows only. | Performs well. Cost scales linearly with changes. Note: A highly selective WHERE might require warehouse uptime even when the output doesn’t change. |
| FROM <table> | Scans micro-partitions that Snowflake added or removed since the last refresh. | Cost scales with the volume of changed partitions. Limit changes to about five percent of the source table. |
| UNION ALL | Takes the union of changes from each side. | Performs well. No special considerations. |
| WITH (CTEs) | Computes changes for each Common Table Expression. | Performs well, but avoid overly complex single-table definitions. Consider splitting into multiple dynamic tables. |
| Scalar aggregates | Fully recomputes the aggregate when input changes. | Avoid in performance-critical tables. Consider grouping by a constant instead. |
| GROUP BY | Recomputes aggregates for changed grouping keys. | Cluster source by grouping keys. Avoid compound expressions in keys. See Optimize aggregations. |
| DISTINCT | Equivalent to GROUP BY ALL. | Locality-sensitive. Consider using QUALIFY instead. See Remove duplicates efficiently (DISTINCT vs QUALIFY). |
| Window functions | Recomputes for changed partition keys. | Always include PARTITION BY. Cluster source by partition keys. See Optimize window functions. |
| INNER JOIN | Joins changes from each side with the other table. | Performs well when one side is small. Cluster the less-frequently-changing side. See Optimize joins. |
| OUTER JOIN | Combines inner join with NOT EXISTS queries for NULL computation. | Most locality-sensitive operator. See Optimize joins. |
| LATERAL FLATTEN | Applies flatten to changed rows only. | Performs well. Cost scales linearly with changes. |
| QUALIFY with ranking | Uses an optimized path for ROW_NUMBER/RANK/DENSE_RANK … = 1. | Highly efficient. Place QUALIFY at the top-level projection of the dynamic table. |

---
title: Optimizing cloud services for cost
source: https://docs.snowflake.com/en/user-guide/cost-optimize-cloud-services.md
section: User Guide
---

# Optimizing cloud services for cost

If you find that your [cloud services usage](cost-understanding-compute.md) is higher than expected, check if your use of
Snowflake follows any of the following patterns. Each pattern includes a recommendation that might help you reduce costs associated with
cloud services.

* Pattern: Copy commands with poor selectivity
* Pattern: High-frequency DDL operations and cloning
* Pattern: High-frequency, simple queries
* Pattern: High-frequency INFORMATION_SCHEMA queries
* Pattern: High-frequency SHOW commands (by data applications and third-party tools)
* Pattern: Single-row inserts and fragmented schemas (by data applications)
* Pattern: Complex SQL queries

Pattern: Copy commands with poor selectivity
:   Executing copy commands involves listing files from Amazon Simple Storage Service (S3). Because listing files uses only cloud services
    compute, executing copy commands with poor selectivity can result in high cloud services usage.

    **Recommendation:** Consider changing the structure of your S3 bucket to include some kind of date prefix, so you list only the targeted
    files you need.

Pattern: High-frequency DDL operations and cloning
:   Data Definition Language (DDL) operations, particularly cloning, are entirely metadata operations, meaning they use only cloud services
    compute. Frequently creating or dropping large schemas or tables, or cloning databases for backup, can result in significant cloud
    services usage.

    **Recommendation:** Cloning uses only a fraction of the resources needed to do deep copies, so you should continue to clone. Review your
    cloning patterns to ensure they are as granular as possible, and aren’t being executed too frequently. For example, you might want to
    clone only individual tables rather than an entire schema.

Pattern: High-frequency, simple queries
:   The consumption of cloud services by a single simple query is negligible, but running queries such as `SELECT 1`,
    `SELECT sequence1.NEXTVAL`, or `SELECT CURRENT_SESSION()` at an extremely high frequency (tens of thousands per day) can result in
    significant cloud services usage.

    **Recommendation:** Review your query frequency and determine whether the frequency is appropriately set for your use case. If you
    observe a high frequency of `SELECT CURRENT_SESSION()` queries originating from partner tools using the JDBC driver, confirm that
    the partner has updated their code to use the `getSessionId()` method in the
    [SnowflakeConnection interface](../developer-guide/jdbc/jdbc-api.md). This takes advantage of caching and reduces cloud services usage.

Pattern: High-frequency INFORMATION_SCHEMA queries
:   Queries against the [Snowflake Information Schema](../sql-reference/info-schema.md) consume only cloud services resources. The consumption of cloud services by a single
    query against INFORMATION_SCHEMA views might be negligible, but running these queries at extremely high frequency (tens of thousands per
    day) can result in significant cloud services usage.

    **Recommendation:** Review your query frequency and determine whether the frequency is appropriately set for your use case.
    Alternatively, you can query a view in the [ACCOUNT_USAGE schema](../sql-reference/account-usage.md) instead of an INFORMATION_SCHEMA
    view. Querying the ACCOUNT_USAGE schema uses a virtual warehouse rather than cloud services.

Pattern: High-frequency SHOW commands (by data applications and third-party tools)
:   SHOW commands are entirely metadata operations, meaning they consume only cloud services resources. This pattern typically occurs when
    you have created an application built on top of Snowflake that executes SHOW commands at a high frequency. These commands might also be
    initiated by third-party tools.

    **Recommendation:**
    Review your query frequency and determine whether the frequency is appropriately set for your use case. In the case of partner tools,
    reach out to your partner to see if they have any plans to adjust their usage.

Pattern: Single-row inserts and fragmented schemas (by data applications)
:   Snowflake is not an OLTP system, so single-row inserts are suboptimal, and can consume significant cloud services resources.

    Building a data application that defines one schema per customer might result in several data loads in a given time period, which can
    result in high cloud services consumption.

    This pattern also results in a lot more metadata that Snowflake needs to maintain, and metadata operations consume cloud services
    resources. Each metadata operation individually consumes minimal resources, but consumption might be significant in aggregate.

    **Recommendation:** In general, do batch or bulk loads rather than single-row inserts.

    Using a shared schema is significantly more efficient, which saves costs. You’ll likely want to cluster all tables on `customer_ID` and
    use [secure views](views-secure.md).

Pattern: Complex SQL queries
:   Queries can consume significant cloud services compute if they include a lot of joins/Cartesian products, use the IN operator with large
    lists, or are very large queries. These types of queries all have high compilation times.

    **Recommendation:** Review your queries to confirm they are doing what you intend them to do. Snowflake supports these queries and will
    charge you only for the resources consumed.

---
title: Optimizing cost
source: https://docs.snowflake.com/en/user-guide/cost-optimize.md
section: User Guide
---

# Optimizing cost

This topic summarizes the features and strategies you can use to optimize Snowflake to reduce costs and maximize your spend.

[Using cost insights to save](cost-insights.md)
:   Learn how to use cost insights to optimize Snowflake for cost within a particular account.

[Optimizing cloud services for cost](cost-optimize-cloud-services.md)
:   Learn how to adjust your cloud services usage to reduce costs.

---
title: Optimizing query performance
source: https://docs.snowflake.com/en/user-guide/performance-query-options.md
section: User Guide
---

# Optimizing query performance

You can optimize Snowflake query performance in the following ways:

* Search optimization service
* Query acceleration
* Creating one or more materialized views (clustered or unclustered)
* Clustering a table

Each of these optimization methods has different advantages, as shown in the following table:

| Feature | Supported query types | Notes |
| --- | --- | --- |
| [Search optimization service](search-optimization-service.md) | * [Equality searches](search-optimization/point-lookup-queries.md). * [Substring and regular expression searches](search-optimization/substring-queries.md). * [Character data (text) and IP address searches](search-optimization/text-queries.md). * Searches of [elements in VARIANT](search-optimization/semi-structured-queries.md). * Searches of [elements in structured types](search-optimization/structured-queries.md). * Searches of [GEOGRAPHY columns using geospatial functions](search-optimization/geospatial-queries.md).   The search optimization service can improve the performance of these types of searches for the [supported data types](search-optimization/queries-that-benefit.md). |  |
| [Query acceleration service](query-acceleration-service.md) | Queries with filters or aggregation. If the query includes LIMIT, the query must also include ORDER BY.  The filters must be highly selective, and the ORDER BY clause must have a low cardinality.    Query acceleration works well with ad-hoc analytics, queries with unpredictable data volume,  and queries with large scans and selective filters. | Query acceleration and search optimization are complementary. Both can accelerate the same query. See Compatibility with query acceleration. |
| [Materialized views](views-materialized.md) | * Equality searches. * Range searches. * Sort operations. | You can also use materialized views to define different clustering keys on the same source table, or a subset of that table, or to store flattened JSON or VARIANT data so it only needs to be flattened once.  Materialized views improve performance only for the subset of rows and columns included in the materialized view. |
| [Clustering the table](tables-clustering-keys.md) | * Equality searches. * Range searches. | A table can be clustered only on a single key, which can contain one or more columns or expressions. |

The following table shows which of these optimizations have storage or compute costs:

| Optimization | Storage cost | Compute cost |
| --- | --- | --- |
| Search optimization service | ✔ | ✔ |
| Query acceleration service |  | ✔ |
| Materialized view | ✔ | ✔ |
| Clustering the table | ✔ [1] | ✔ |

[1]

The process of reclustering can increase the size of [fail-safe](data-failsafe.md) storage
because of the rewriting of existing partitions into new partitions. Reclustering doesn’t introduce any new rows.
For more information, see [Credit and Storage Impact of Reclustering](tables-clustering-keys.md).

## Compatibility with query acceleration

Search optimization and [query acceleration](query-acceleration-service.md) can work together to
optimize query performance. First, search optimization can prune the [micro-partitions](tables-clustering-micropartitions.md) that aren’t needed for a query. Then, for [eligible queries](query-acceleration-service.md), query acceleration can offload portions of the rest of the work to
shared compute resources that the service provides.

The performance of queries that are accelerated by both services varies depending on the workload and available resources.

---
title: Optimizing storage for performance
source: https://docs.snowflake.com/en/user-guide/performance-query-storage.md
section: User Guide
---

# Optimizing storage for performance

This topic discusses storage optimizations that can improve query performance, such as storing similar data together, creating optimized
data structures, and defining specialized data sets. Snowflake provides three of these storage strategies: automatic clustering, search
optimization, and materialized views.

In general, these storage strategies do not substantially improve the performance of queries that already execute in a second or faster.

The strategies discussed in this topic are just one way to boost the performance of queries. For strategies related to the computing
resources used to execute a query, refer to [Optimizing warehouses for performance](performance-query-warehouse.md).

## Introduction to storage strategies

### Automatic Clustering

Snowflake stores a table’s data in [micro-partitions](tables-clustering-micropartitions.md). Among these micro-partitions, Snowflake
organizes (i.e. clusters) data based on dimensions of the data. If a query filters, joins, or aggregates along those dimensions, fewer
micro-partitions must be scanned to return results, which speeds up the query considerably.

You can set a [cluster key](tables-clustering-keys.md) to change the default organization of the micro-partitions so data is clustered
around specific dimensions (i.e. columns). Choosing a cluster key improves the performance of queries that filter, join, or aggregate by
the columns defined in the cluster key.

Snowflake enables Automatic Clustering to maintain the clustering of the table as soon as you define a cluster key. Once enabled, Automatic
Clustering updates micro-partitions as new data is added to the table. [Learn More](tables-auto-reclustering.md)

### Search Optimization Service

The Search Optimization Service improves the performance of point lookup queries (i.e. “needle in a haystack searches”) that return a
small number of rows from a table using highly selective filters. The Search Optimization Service is ideal when it is critical to have
low-latency point lookup queries (e.g. investigative log searches, threat or anomaly detection, and critical dashboards with selective
filters).

The Search Optimization Service reduces the latency of point lookup queries by building a persistent data structure that is optimized for
a particular type of search.

You can enable the Search Optimization Service for an entire table or for specific columns. As long as they are selective enough,
[equality searches](search-optimization/point-lookup-queries.md),
[substring searches](search-optimization/substring-queries.md), and
[geo searches](search-optimization/geospatial-queries.md) against those columns can be sped up significantly.

The Search Optimization Service supports both structured and semi-structured data (see [supported data types](search-optimization/queries-that-benefit.md)).

The Search Optimization Service requires Snowflake Enterprise Edition or higher. [Learn More](search-optimization-service.md)

### Materialized views

A materialized view is a pre-computed data set derived from a SELECT statement that is stored for later use. Because the data is
pre-computed, querying a materialized view is faster than executing a query against the base table on which the view is defined. For
example, if you specify `SELECT SUM(column1)` when creating the materialized view, then a query that returns `SUM(column1)` from the
view executes faster because `column1` has already been aggregated.

Materialized views are designed to improve query performance for workloads composed of common, repeated query patterns that return a small
number of rows and/or columns relative to the base table.

A materialized view cannot be based on more than one table.

Materialized views require Snowflake Enterprise Edition or higher. [Learn More](views-materialized.md)

## Choosing an optimization strategy

Different types of queries benefit from different storage strategies. You can use the following sections to discover which strategy best
fits a workload.

Automatic Clustering is the broadest option that can benefit a range of queries that access the same columns of a table. An administrator
often picks the most important queries based on frequency and latency requirements, and then chooses a cluster key that maximizes the
performance of those queries. Automatic Clustering makes sense when many queries filter, join, or aggregate the same few columns.

The Search Optimization Service and materialized views have a narrower scope. When specific queries access a well-defined subset of a
table’s data, the administrator can use the characteristics of the query to decide whether using the Search Optimization Service or a
materialized view might improve performance. For example, administrators could identify important point lookup queries and implement the
Search Optimization Service for a table or column. Likewise, the administrator could optimize specific query patterns by creating a
materialized view.

You can implement more than one of these strategies for a table, and an individual query with multiple filters could potentially benefit
from both Automatic Clustering and the Search Optimization Service. However, enabling the Search Optimization Service or creating a
materialized view on a clustered table can be more expensive. To learn why this increases compute costs, refer
to Ongoing Costs (in this topic).

If more than one strategy could potentially improve the performance of a particular query, you might want to start with Automatic
Clustering or the Search Optimization Service because other queries with similar access patterns could also be improved.

### Differentiating considerations

The following is not an exhaustive comparison of the storage strategies, but rather provides the most important considerations when
differentiating between them.

Automatic Clustering:
:   * Biggest performance boost comes from a WHERE clause that filters on a column of the cluster key, but it can also improve the performance
      of other clauses and functions that act upon that same column (e.g. joins and aggregations).
    * Ideal for range queries or queries with an inequality filter. Also improves an equality filter, but the Search Optimization Service is
      usually faster for point lookup queries.
    * Available in Standard Edition of Snowflake.
    * There can be only one cluster key. [1] If different queries against a table act upon different columns, consider using the Search
      Optimization Service or a materialized view instead.

Search Optimization Service:
:   * Improves point lookup queries that return *a small number of rows*. If the query returns more than a few records, consider Automatic
      Clustering instead.
    * Includes support for point lookup queries that:

      + Match substrings or regular expressions using predicates such as LIKE and RLIKE.
      + Search for specific fields in VARIANT, ARRAY, or OBJECT columns.
      + Use geospatial functions with GEOGRAPHY values.

Materialized view:
:   * Improves intensive and frequent calculations such as aggregation and analyzing semi-structured data (not just filtering).
    * Usually focused on a specific query/subquery calculation.
    * Improves queries against [external tables](tables-external-intro.md).

[1] If there is an important reason to define multiple cluster keys, you could create multiple materialized views, each with its own
cluster key.

### Prototypical queries

The following examples are intended to highlight which type of query typically runs faster with a particular storage strategy.

Prototypical Query for Clustering
:   Automatic Clustering provides a performance boost for *range queries* with large table scans. For example, the following query will
    execute faster if the `shipdate` column is the table’s cluster key because the `WHERE` clause scans a lot of data.

    ```sqlexample
    SELECT
      SUM(quantity) AS sum_qty,
      SUM(extendedprice) AS sum_base_price,
      AVG(quantity) AS avg_qty,
      AVG(extendedprice) AS avg_price,
      COUNT(*) AS count_order
    FROM lineitem
    WHERE shipdate >= DATEADD(day, -90, to_date('2023-01-01));
    ```

    For an additional example of a query that might run faster if the table was clustered, refer to [Benefits of Defining Clustering Keys (for Very Large Tables)](tables-clustering-keys.md).

Prototypical Query for Search Optimization
:   The Search Optimization Service can provide a performance boost for *point lookup queries* that scan a large table to return a small
    subset of records. For example, the following query will execute faster with the Search Optimization Service if the `sender_ip` column
    has a large number of distinct values.

    ```sqlexample
    SELECT error_message, receiver_ip
    FROM logs
    WHERE sender_ip IN ('198.2.2.1', '198.2.2.2');
    ```

    To review other queries that might run faster with the Search Optimization Service, refer to the following examples:

    * [Equality operators](search-optimization-service.md)
    * [Geospatial functions](search-optimization/geospatial-queries.md)
    * [Substring and Regular Expressions](search-optimization/substring-queries.md)
    * [Fields in VARIANT Columns](search-optimization/semi-structured-queries.md)

Prototypical Query for Materialized View
:   A materialized view can provide a performance boost for queries that access a small subset of data using expensive operations like
    aggregation. As an example, suppose that an administrator aggregated the `totalprice` column when creating a materialized view
    `mv_view1`. The following query against the materialized view will execute faster than it would against the base table.

    ```sqlexample
    SELECT
      orderdate,
      SUM(totalprice)
    FROM mv_view1
    GROUP BY 1;
    ```

    For more use cases where materialized views can speed up queries, refer to [Examples of Use Cases For Materialized Views](views-materialized.md).

## Implementation and cost considerations

This section discusses cost considerations of using a storage strategy to improve query performance, along with implementation
considerations as you balance cost and performance.

### Initial investment

Implementing a storage strategy can require a bigger time commitment and upfront financial investment than other types of performance
optimizations (e.g. re-writing SQL statements or [optimizing the warehouse](performance-query-warehouse.md) running the query), but the
performance improvements can be significant.

Snowflake uses [serverless compute resources](cost-understanding-compute.md) to implement each storage strategy, which consumes
credits before you can test how well the optimization improves performance. In addition, it can take Snowflake a significant amount of
time to fully implement Automatic Clustering and the Search Optimization Service (e.g. a week for a very large table).

The Search Optimization Service and materialized views also require the Enterprise Edition or higher, which increases the price of a credit.

### Ongoing cost

Storage strategies incur both compute and storage costs.

Compute Costs
:   Snowflake uses serverless compute resources to maintain storage optimizations as new data is added to a table. The more changes to a
    table, the higher the maintenance costs. If a table is constantly updated, the cost of maintaining a storage optimization might be
    prohibitive.

    The cost of maintaining materialized views or the Search Optimization Service can be significant when Automatic Clustering is enabled
    for the underlying table. With Automatic Clustering, Snowflake is constantly reclustering its micro-partitions around the dimensions of
    the cluster key. Every time the base table is reclustered, Snowflake must use serverless compute resources to update the storage used by
    materialized views and the Search Optimization Service. As a result, Automatic Clustering activities on the base table can trigger
    maintenance costs for materialized views and the Search Optimization Service beyond the cost of the DML commands on the base table.

Storage Costs
:   Automatic Clustering
    :   Unlike the Search Optimization Service and materialized views, Automatic Clustering reorganizes existing data rather than creating
        additional storage. However, reclustering can incur additional storage costs if it increases the size of
        [Fail-safe](data-failsafe.md) storage. For details, refer to [Credit and Storage Impact of Reclustering](tables-clustering-keys.md).

    Search Optimization / Materialized Views
    :   Materialized views and the Search Optimization Service incur the cost of additional storage, which is billed at the standard rate.

### Estimating costs

Automatic Clustering
:   You can run the [SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS](../sql-reference/functions/system_estimate_automatic_clustering_costs.md) function to help estimate the cost of enabling Automatic Clustering for a table and maintaining the table in a well-clustered state. This estimate is based on the change history of the table. Actual costs can vary significantly, especially if DML patterns change after enabling Automatic Clustering.

Search Optimization Service
:   You can run the [SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS](../sql-reference/functions/system_estimate_search_optimization_costs.md) function to help estimate the cost of adding the
    Search Optimization Service to a column or entire table. The estimated costs are proportional to the number of columns that will be
    enabled and how much the table has recently changed.

### Implementation strategy

Because the compute costs and storage costs of a storage strategy can be significant, you might want to start small and carefully track the
initial and ongoing costs before committing to a more extensive implementation. For example, you might choose a cluster key for just one or
two tables, and then assess the cost before choosing a key for other tables.

When tracking the ongoing cost associated with a storage strategy, remember that virtual warehouses consume credits only during the time
they are running a query, so a faster query costs less to run. Snowflake recommends carefully reporting on the cost of running a query
before the storage optimization and comparing it to the cost of running the same query after the storage optimization so it can be factored
into the cost assessment.

---
title: Optimizing the warehouse cache
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse-cache.md
section: User Guide
---

# Optimizing the warehouse cache

This topic discusses how a warehouse owner or administrator can optimize a warehouse’s cache in order to improve the performance of queries
running on the warehouse.

A running warehouse maintains a cache of table data that can be accessed by queries running on the same warehouse. This can improve the
performance of subsequent queries if they are able to read from the cache instead of from tables.

See also [Using Persisted Query Results](querying-persisted-results.md), which explains how the results of specific queries may be cached and reused.

> **Note:**
>
> You must have [access to the shared SNOWFLAKE database](../sql-reference/account-usage.md) to execute the diagnostic queries provided in this topic. By default, only the ACCOUNTADMIN role has the privileges needed to execute the queries.

## Finding data scanned from cache

The following query provides the percentage of data scanned from cache, aggregated across all queries and broken out by warehouse.

If you have queries that can benefit from scanning data from the cache (e.g. frequent, similar queries) and the percentage of data scanned
from cache is low, you might see a performance boost by optimizing the cache.

```sqlexample
SELECT warehouse_name
  ,COUNT(*) AS query_count
  ,SUM(bytes_scanned) AS bytes_scanned
  ,SUM(bytes_scanned*percentage_scanned_from_cache) AS bytes_scanned_from_cache
  ,SUM(bytes_scanned*percentage_scanned_from_cache) / SUM(bytes_scanned) AS percent_scanned_from_cache
FROM snowflake.account_usage.query_history
WHERE start_time >= dateadd(month,-1,current_timestamp())
  AND bytes_scanned > 0
GROUP BY 1
ORDER BY 5;
```

## About the cache and auto-suspension

The auto-suspend setting of the warehouse can have a direct impact on query performance because the cache is dropped when the warehouse
is suspended. If a warehouse is running frequent and similar queries, it might not make sense to suspend the warehouse in between queries
because the cache might be dropped before the next query is executed.

You can use the following general guidelines when setting the auto-suspension time limit:

* For [tasks](tasks-intro.md), Snowflake recommends immediate suspension.
* For DevOps, DataOps, and Data Science use cases, Snowflake recommends setting auto-suspension to approximately 5 minutes because the cache is
  not as important for ad-hoc and unique queries.
* For query warehouses, for example BI and SELECT use cases, Snowflake recommends setting auto-suspend to at least 10 minutes to
  maintain the cache for users.

## Cost considerations

Keep in mind that a running warehouse consumes credits even if it is not processing queries. Be sure that your auto-suspend setting
matches your workload. For example, if a warehouse executes a query every 30 minutes, it does not make sense to set the auto-suspend
setting to 10 minutes. The warehouse will consume credits while sitting idle without gaining the benefits of a cache because it will be
dropped before the next query executes.

## How to configure auto-suspension

To change how much time must elapse before a warehouse is suspended and its cache dropped:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Compute » Warehouses.
    3. Find the warehouse, and select … » Edit.
    4. Ensure that Auto Suspend is turned on.
    5. In the Suspend After (min) field, enter the number of minutes that must elapse before the warehouse is suspended.
    6. Select Save Warehouse.

SQL:
:   Use the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command to change the auto-suspend time limit, which is specified in seconds, not
    minutes. For example:

    ```sqlexample
    ALTER WAREHOUSE my_wh SET AUTO_SUSPEND = 600;
    ```

---
title: Optimizing warehouses for performance
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse.md
section: User Guide
---

# Optimizing warehouses for performance

In the Snowflake architecture, virtual warehouses provide the computing power that is required to execute queries. Fine-tuning the compute
resources provided by a warehouse can improve the performance of a query or set of queries.

A warehouse owner or administrator can try the following warehouse-related strategies as they attempt to improve the performance of one or
more queries. As they adjust a warehouse based on one of these strategies, they can test the change by re-running the query and
[checking its execution time](performance-query-exploring.md).

Warehouse-related strategies are just one way to boost the performance of queries. For performance strategies involving how data
is stored, refer to [Optimizing storage for performance](performance-query-storage.md).

| Strategy | Description |
| --- | --- |
| [Reduce queues](performance-query-warehouse-queue.md) | Minimizing queuing can improve performance because the time between submitting a query and getting its results is longer when the query must wait in a queue before starting. |
| [Resolve memory spillage](performance-query-warehouse-memory.md) | Adjusting the available memory of a warehouse can improve performance because a query runs substantially slower when a warehouse runs out of memory, which results in bytes “spilling” onto storage. |
| [Increase warehouse size](performance-query-warehouse-size.md) | The larger a warehouse, the more compute resources are available to execute a query or set of queries. |
| [Try query acceleration](performance-query-warehouse-qas.md) | The query acceleration service offloads portions of query processing to serverless compute resources, which speeds up the processing of a query while reducing its demand on the warehouse’s compute resources. |
| [Optimize the warehouse cache](performance-query-warehouse-cache.md) | Query performance improves if a query can read from the warehouse’s cache instead of from tables. |
| [Limit concurrently running queries](performance-query-warehouse-max-concurrency.md) | Limiting the number of queries that are running concurrently in a warehouse can improve performance because there are fewer queries putting demands on the warehouse’s resources. |

> **Tip:**
>
> Optimizing a warehouse for query performance is more straightforward when the warehouse runs similar workloads. For example, if a
> warehouse runs significantly different queries, the cost of a performance enhancement might be wasted on a query that does not benefit
> from the optimization.
>
> For general guidelines about distributing workloads to your organization’s warehouses, see the Analyzing Your Workloads section of
> the [Managing Snowflake’s Compute Resources](https://www.snowflake.com/blog/managing-snowflakes-compute-resources/) (Snowflake blog).

---
title: Option 1: Configure a Snowflake storage integration to access Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-load-s3-config-storage-integration.md
section: User Guide
---

# Option 1: Configure a Snowflake storage integration to access Amazon S3

This topic describes how to use storage integrations to allow Snowflake to read data from and write data to an Amazon S3 bucket referenced in an external (i.e. S3) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an AWS identity and access management (IAM) user ID. An administrator in your organization grants the integration IAM user permissions in the AWS account.

An integration can also list buckets (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires permissions in AWS to create and manage IAM policies and roles. If you are not an
>   AWS administrator, ask your AWS administrator to perform these tasks.
> * Access to S3 storage in [government regions](intro-regions.md) using a storage integration is limited to Snowflake accounts
>   hosted on AWS in the same government region.
> * Confirm that Snowflake supports the AWS region that your storage is hosted in. For more information, see
>   [Supported cloud regions](intro-regions.md).

The following diagram shows the integration flow for a S3 stage:

1. An external (i.e. S3) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a S3 IAM user created for your account. Snowflake creates a single IAM user that is referenced by all S3 storage integrations in your Snowflake account.
3. An AWS administrator in your organization grants permissions to the IAM user to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same storage integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the IAM user on the bucket before allowing or denying access.

## Configure secure access to cloud storage

This section describes how to configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage
to a Snowflake identity and access management (IAM) entity.

### Step 1: Configure access permissions for the S3 bucket

#### AWS access control requirements

Snowflake requires the following permissions on an S3 bucket and folder to be able to access files in the folder (and sub-folders):

* `s3:GetBucketLocation`
* `s3:GetObject`
* `s3:GetObjectVersion`
* `s3:ListBucket`

> **Note:**
>
> The following additional permissions are required to perform additional SQL actions:
>
> | Permission | SQL Action |
> | --- | --- |
> | `s3:PutObject` | Unload files to the bucket. |
> | `s3:DeleteObject` | Either automatically purge files from the stage after a successful load or execute [REMOVE](../sql-reference/sql/remove.md) statements to manually remove files. |

As a best practice, Snowflake recommends creating an IAM policy for Snowflake access to the S3 bucket. You can then attach the policy to the role and use the security credentials generated by AWS for the role to access files in the bucket.

#### Create an IAM policy

The following step-by-step instructions describe how to configure access permissions for Snowflake in your AWS Management Console so that you can use an
S3 bucket to load and unload data:

1. Log into the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add a policy document that will allow Snowflake to access the S3 bucket and folder.

   The following policy (in JSON format) provides Snowflake with the required permissions to load or unload data using a single bucket and folder path. You can also purge data files using the PURGE copy option.

   Copy and paste the text into the policy editor:

   > **Note:**
   > * Make sure to replace `bucket` and `prefix` with your actual bucket name and folder path prefix.
   > * The Amazon Resource Names (ARN) for buckets in
   >   [government regions](intro-regions.md) have a `arn:aws-us-gov:s3:::` prefix.
   > * The ARN for buckets in public AWS regions in China have a `arn:aws-cn:s3:::` prefix.
   > * If you’re using an S3 access point, specify the access point ARN instead of a bucket ARN. For more information, see
   >   [Configuring IAM policies for using access points](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-policies.html).

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:PutObject",
                 "s3:GetObject",
                 "s3:GetObjectVersion",
                 "s3:DeleteObject",
                 "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   > **Note:**
   >
   > Setting the `"s3:prefix":` condition to either `["*"]` or `["<path>/*"]` grants access to all prefixes in the
   > specified bucket or path in the bucket, respectively.

   Note that AWS policies support a variety of different security use cases.

   The following policy provides Snowflake with the required permissions to load data from a single read-only bucket and folder
   path. The policy includes the `s3:GetBucketLocation`, `s3:GetObject`, `s3:GetObjectVersion`, and
   `s3:ListBucket` permissions:

   **Alternative policy: Load from a read-only S3 bucket**

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:GetObject",
                 "s3:GetObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

### Step 2: Create the IAM role in AWS

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. Select AWS account as the trusted entity type.
4. Select Another AWS account
5. In the Account ID field, enter your own AWS account ID temporarily. Later, you modify the trust relationship and grant
   access to Snowflake.
6. Select the Require external ID option. An external ID is used to grant access to your AWS resources
   (such as S3 buckets) to a third party like Snowflake.

   Enter a placeholder ID such as `0000`.
   In a later step, you will modify the trust relationship for your IAM role and specify the external ID for your storage integration.
7. Select Next.
8. Select the policy you created in Step 1: Configure access permissions for the S3 bucket (in this topic).
9. Select Next.
10. Enter a name and description for the role, then select Create role.

    You have now created an IAM policy for a bucket, created an IAM role, and attached the policy to the role.
11. On the role summary page, locate and record the Role ARN value. In the next step, you will create a Snowflake integration that
    references this role.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and access data from the cloud storage location until the cache expires.

### Step 3: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake
object that stores a generated identity and access management (IAM) user for your S3 cloud storage, along with an optional set of allowed
or blocked storage locations (that is, buckets). Cloud provider administrators in your organization grant permissions on the storage locations
to the generated user. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, S3) stages. The URL in the stage definition must align with the S3
buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this
> SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = '<iam_role>'
  STORAGE_ALLOWED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `iam_role` is the Amazon Resource Name (ARN) of the role you created in Step 2: Create the IAM role in AWS (in this topic).
* `protocol` is one of the following:

  + `s3` refers to S3 storage in public AWS regions outside of China.
  + `s3china` refers to S3 storage in public AWS regions in China.
  + `s3gov` refers to S3 storage in [government regions](intro-regions.md).
* `bucket` is the name of a S3 bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS
  parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that
  reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that allows access to all buckets in the account but blocks access to the defined `sensitivedata` folders.

Additional external stages that also use this integration can reference the allowed buckets and paths:

```sqlexample
CREATE STORAGE INTEGRATION s3_int
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
  STORAGE_ALLOWED_LOCATIONS = ('*')
  STORAGE_BLOCKED_LOCATIONS = ('s3://mybucket1/mypath1/sensitivedata/', 's3://mybucket2/mypath2/sensitivedata/');
```

> **Note:**
>
> Optionally, use the [STORAGE_AWS_EXTERNAL_ID](../sql-reference/sql/create-storage-integration.md) parameter to specify
> your own external ID. You might choose this option
> to use the same external ID across multiple external volumes and/or storage integrations.

### Step 4: Retrieve the AWS IAM user for your Snowflake account

1. To retrieve the ARN for the IAM user that was created automatically for your Snowflake account, use the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md).

   ```sqlsyntax
   DESC INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 3: Create a Cloud Storage Integration in Snowflake
     (in this topic).

   For example:

   ```sqlexample
   DESC INTEGRATION s3_int;
   ```

   ```output
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   | property                  | property_type | property_value                                                                 | property_default |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------|
   | ENABLED                   | Boolean       | true                                                                           | false            |
   | STORAGE_ALLOWED_LOCATIONS | List          | s3://mybucket1/mypath1/,s3://mybucket2/mypath2/                                | []               |
   | STORAGE_BLOCKED_LOCATIONS | List          | s3://mybucket1/mypath1/sensitivedata/,s3://mybucket2/mypath2/sensitivedata/    | []               |
   | STORAGE_AWS_IAM_USER_ARN  | String        | arn:aws:iam::123456789001:user/abc1-b-self1234                                 |                  |
   | STORAGE_AWS_ROLE_ARN      | String        | arn:aws:iam::001234567890:role/myrole                                          |                  |
   | STORAGE_AWS_EXTERNAL_ID   | String        | MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=                               |                  |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   ```
2. Record the values for the following properties:

   | Property | Description |
   | --- | --- |
   | `STORAGE_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account; for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. All S3 storage integrations in your account use that IAM user. |
   | `STORAGE_AWS_EXTERNAL_ID` | The external ID that Snowflake uses to establish a trust relationship with AWS. If you didn’t specify an external ID (`STORAGE_AWS_EXTERNAL_ID`) when you created the storage integration, Snowflake generates an ID for you to use. |

   You provide these values in the next section.

### Step 5: Grant the IAM user permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your AWS Management Console so that you can use a S3 bucket to load and unload data:

1. Sign in to the AWS Management Console.
2. Select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the role you created in Step 2: Create the IAM role in AWS (in this topic).
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the DESC STORAGE INTEGRATION output values you recorded in
   Step 4: Retrieve the AWS IAM user for your Snowflake account (in this topic):

   **Policy document for IAM role**

   ```sqljson
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<snowflake_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<snowflake_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   > * `snowflake_user_arn` is the STORAGE_AWS_IAM_USER_ARN value you recorded.
   > * `snowflake_external_id` is the STORAGE_AWS_EXTERNAL_ID value you recorded.
   >
   >   In this example, the `snowflake_external_id` value is `MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=`.
   >
   >   > **Note:**
   >   >
   >   > For security reasons, if you create a new storage integration (or recreate an existing storage integration using the CREATE OR
   >   > REPLACE STORAGE INTEGRATION syntax) without specifying an external ID, the new integration has a *different* external ID and
   >   > can’t resolve the trust relationship unless you update the trust policy.
8. Select Update policy to save your changes.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

### Step 6: Create an external stage

Create an external (i.e. S3) stage that references the storage integration you created in Step 3: Create a Cloud Storage Integration in Snowflake (in this topic).

> **Note:**
>
> Creating a stage that uses a storage integration requires a role that has the CREATE STAGE privilege for the schema as well as the USAGE privilege on the storage integration. For example:
>
> ```sqlexample
> GRANT CREATE STAGE ON SCHEMA public TO ROLE myrole;
>
> GRANT USAGE ON INTEGRATION s3_int TO ROLE myrole;
> ```

Create the stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.

For example, set `mydb.public` as the current database and schema for the user session, and then create a stage named `my_s3_stage`. In this example, the stage references the S3 bucket and path `mybucket1/path1`, which are supported by the integration. The stage also references a named file format object called `my_csv_format`:

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE my_s3_stage
>   STORAGE_INTEGRATION = s3_int
>   URL = 's3://bucket1/path1/'
>   FILE_FORMAT = my_csv_format;
> ```

> **Note:**
>
> * The stage owner (i.e. the role with the OWNERSHIP privilege on the stage) must have the USAGE privilege on the storage integration.
> * Append a forward slash (`/`) to the URL value to filter to the specified folder path. If the forward slash is omitted, all files and
>   folders starting with the prefix for the specified path are included.
>
>   Note that the forward slash is required to access and retrieve unstructured data files in the stage.
> * To load or unload data from or to a stage that uses an integration, a role must have the USAGE privilege on the stage. It is not necessary to also have the USAGE privilege on the storage integration.
> * The STORAGE_INTEGRATION parameter is handled separately from other stage parameters, such as FILE_FORMAT. Support for these other parameters is the same regardless of the integration used to access your S3 bucket.

**Next:** [AWS data file encryption](data-load-s3-encrypt.md)

---
title: Option 1: Load data with the Snowpipe REST API
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-rest-load.md
section: User Guide
---

# Option 1: Load data with the Snowpipe REST API

This topic describes how to call the public REST endpoints to load data and retrieve load history reports. The instructions assume you have completed the setup instructions in
[Data loading preparation using the Snowpipe REST API](data-load-snowpipe-rest-gs.md).

## Load data

Loading takes place in two steps:

Step 1:
:   Stage your data files:

    * Internal stage: Use the [PUT](../sql-reference/sql/put.md) command to stage your files.
    * External stage: Use the client tools provided by the cloud provider to copy your files to the stage location (Amazon S3, Google Cloud Storage, or Microsoft Azure).

Step 2:
:   Submit a request to the [insertFiles](data-load-snowpipe-rest-apis.md) REST endpoint to load the staged data files.

    For your convenience, sample Java and Python programs that illustrate how to submit a REST endpoint are provided in this topic.

### Sample program for the Java SDK

```java
import net.snowflake.ingest.SimpleIngestManager;
import net.snowflake.ingest.connection.HistoryRangeResponse;
import net.snowflake.ingest.connection.HistoryResponse;
import org.bouncycastle.asn1.pkcs.PrivateKeyInfo;
import org.bouncycastle.jce.provider.BouncyCastleProvider;
import org.bouncycastle.openssl.PEMParser;
import org.bouncycastle.openssl.jcajce.JcaPEMKeyConverter;
import org.bouncycastle.openssl.jcajce.JceOpenSSLPKCS8DecryptorProviderBuilder;
import org.bouncycastle.operator.InputDecryptorProvider;
import org.bouncycastle.operator.OperatorCreationException;
import org.bouncycastle.pkcs.PKCS8EncryptedPrivateKeyInfo;
import org.bouncycastle.pkcs.PKCSException;
import java.io.FileReader;
import java.io.IOException;
import java.nio.file.Paths;
import java.security.PrivateKey;
import java.security.Security;
import java.time.Instant;
import java.util.Set;
import java.util.TreeSet;
import java.util.concurrent.Callable;
import java.util.concurrent.ExecutorService;
import java.util.concurrent.Executors;
import java.util.concurrent.Future;
import java.util.concurrent.TimeUnit;

public class SDKTest
{
  // Path to the private key file that you generated earlier.
  private static final String PRIVATE_KEY_FILE = "/<path>/rsa_key.p8";

  public static class PrivateKeyReader
  {
    // If you generated an encrypted private key, implement this method to return
    // the passphrase for decrypting your private key.
    private static String getPrivateKeyPassphrase() {
      return "<private_key_passphrase>";
    }

    public static PrivateKey get(String filename)
            throws Exception
    {
      PrivateKeyInfo privateKeyInfo = null;
      Security.addProvider(new BouncyCastleProvider());
      // Read an object from the private key file.
      PEMParser pemParser = new PEMParser(new FileReader(Paths.get(filename).toFile()));
      Object pemObject = pemParser.readObject();
      if (pemObject instanceof PKCS8EncryptedPrivateKeyInfo) {
        // Handle the case where the private key is encrypted.
        PKCS8EncryptedPrivateKeyInfo encryptedPrivateKeyInfo = (PKCS8EncryptedPrivateKeyInfo) pemObject;
        String passphrase = getPrivateKeyPassphrase();
        InputDecryptorProvider pkcs8Prov = new JceOpenSSLPKCS8DecryptorProviderBuilder().build(passphrase.toCharArray());
        privateKeyInfo = encryptedPrivateKeyInfo.decryptPrivateKeyInfo(pkcs8Prov);
      } else if (pemObject instanceof PrivateKeyInfo) {
        // Handle the case where the private key is unencrypted.
        privateKeyInfo = (PrivateKeyInfo) pemObject;
      }
      pemParser.close();
      JcaPEMKeyConverter converter = new JcaPEMKeyConverter().setProvider(BouncyCastleProvider.PROVIDER_NAME);
      return converter.getPrivateKey(privateKeyInfo);
    }
  }

  private static HistoryResponse waitForFilesHistory(SimpleIngestManager manager,
                                                     Set<String> files)
          throws Exception
  {
    ExecutorService service = Executors.newSingleThreadExecutor();

    class GetHistory implements
            Callable<HistoryResponse>
    {
      private Set<String> filesWatchList;
      GetHistory(Set<String> files)
      {
        this.filesWatchList = files;
      }
      String beginMark = null;

      public HistoryResponse call()
              throws Exception
      {
        HistoryResponse filesHistory = null;
        while (true)
        {
          Thread.sleep(500);
          HistoryResponse response = manager.getHistory(null, null, beginMark);
          if (response.getNextBeginMark() != null)
          {
            beginMark = response.getNextBeginMark();
          }
          if (response != null && response.files != null)
          {
            for (HistoryResponse.FileEntry entry : response.files)
            {
              //if we have a complete file that we've
              // loaded with the same name..
              String filename = entry.getPath();
              if (entry.getPath() != null && entry.isComplete() &&
                      filesWatchList.contains(filename))
              {
                if (filesHistory == null)
                {
                  filesHistory = new HistoryResponse();
                  filesHistory.setPipe(response.getPipe());
                }
                filesHistory.files.add(entry);
                filesWatchList.remove(filename);
                //we can return true!
                if (filesWatchList.isEmpty()) {
                  return filesHistory;
                }
              }
            }
          }
        }
      }
    }

    GetHistory historyCaller = new GetHistory(files);
    //fork off waiting for a load to the service
    Future<HistoryResponse> result = service.submit(historyCaller);

    HistoryResponse response = result.get(2, TimeUnit.MINUTES);
    return response;
  }

  public static void main(String[] args)
  {
    final String host = "<account_identifier>.snowflakecomputing.com";
    final String user = "<user_login_name>";
    final String pipe = "<db_name>.<schema_name>.<pipe_name>";
    try
    {
      final long oneHourMillis = 1000 * 3600L;
      String startTime = Instant
              .ofEpochMilli(System.currentTimeMillis() - 4 * oneHourMillis).toString();
      final PrivateKey privateKey = PrivateKeyReader.get(PRIVATE_KEY_FILE);
      SimpleIngestManager manager = new SimpleIngestManager(host.split("\.")[0], user, pipe, privateKey, "https", host, 443);
      List<StagedFileWrapper> files = new ArrayList<>();
      // Add the paths and sizes the files that you want to load.
      // Use paths that are relative to the stage where the files are located
      // (the stage that is specified in the pipe definition)..
      files.add(new StagedFileWrapper("<path>/<filename>", <file_size_in_bytes> /* file size is optional but recommended, pass null when it is not available */));
      files.add(new StagedFileWrapper("<path>/<filename>", <file_size_in_bytes> /* file size is optional but recommended, pass null when it is not available */));
      ...
      manager.ingestFiles(files, null);
      HistoryResponse history = waitForFilesHistory(manager, files);
      System.out.println("Received history response: " + history.toString());
      String endTime = Instant
              .ofEpochMilli(System.currentTimeMillis()).toString();

      HistoryRangeResponse historyRangeResponse =
              manager.getHistoryRange(null,
                                      startTime,
                                      endTime);
      System.out.println("Received history range response: " +
                                 historyRangeResponse.toString());

    }
    catch (Exception e)
    {
      e.printStackTrace();
    }

  }
}
```

This example uses the [Bouncy Castle Crypto APIs](https://www.bouncycastle.org/java.html). In order to compile and run this
example, you must include the following JAR files in your classpath:

* the provider JAR file (`bcprov-jdkversions.jar`)
* the PKIX / CMS / EAC / PKCS / OCSP / TSP / OPENSSL JAR file (`bcpkix-jdkversions.jar`)

where `versions` specifies the versions of the JDK that the JAR file supports.

Before you compile the sample code, replace the following placeholder values:

> `PRIVATE_KEY_FILE = "/<path>/rsa_key.p8"`
> :   Specify the local path to the private key file you created in [Use key pair authentication & key rotation](data-load-snowpipe-rest-gs.md) (in [Data loading preparation using the Snowpipe REST API](data-load-snowpipe-rest-gs.md)).
>
> `return "<private_key_passphrase>"` in `getPrivateKeyPassphrase()`
> :   If you generated an encrypted key, implement the `getPrivateKeyPassphrase()` method to return the passphrase for decrypting that key.
>
> `host = "<account_identifier>.snowflakecomputing.com"`
> :   Specify your host information in the form of a URL.
>
>     The preferred format of the account identifier is as follows:
>
>     `organization_name-account_name`
>     :   Names of your Snowflake organization and account. For details, see [Format 1 (preferred): Account name in your organization](admin-account-identifier.md).
>
>     Alternatively, specify your *account locator*, along with the [region](intro-regions.md) and [cloud platform](intro-cloud-platforms.md) where the account is hosted, if required. For details, see [Format 2: Account locator in a region](admin-account-identifier.md).
>
> `user = "<user_login_name>"`
> :   Specify your Snowflake login name.
>
> `pipe = "<db_name>.<schema_name>.<pipe_name>"`
> :   Specify the fully-qualified name of the pipe to use to load the data.
>
> `files.add("<path>/<filename>", <file_size_in_bytes>)`
> :   Specify the path to your files to load in the file objects list.
>
>     Optionally specify the size of each file, in bytes, to avoid delays when Snowpipe calculates the operations required to load the data.
>
>     The path you specify must be relative to the stage where the files are located. Include the complete name for each file, including the file extension. For example, a CSV file that is gzip-compressed might have the extension `.csv.gz`.

### Sample program for the Python SDK

```python
from logging import getLogger
from snowflake.ingest import SimpleIngestManager
from snowflake.ingest import StagedFile
from snowflake.ingest.utils.uris import DEFAULT_SCHEME
from datetime import timedelta
from requests import HTTPError
from cryptography.hazmat.primitives import serialization
from cryptography.hazmat.primitives.serialization import load_pem_private_key
from cryptography.hazmat.backends import default_backend
from cryptography.hazmat.primitives.serialization import Encoding
from cryptography.hazmat.primitives.serialization import PrivateFormat
from cryptography.hazmat.primitives.serialization import NoEncryption
import time
import datetime
import os
import logging

logging.basicConfig(
        filename='/tmp/ingest.log',
        level=logging.DEBUG)
logger = getLogger(__name__)

# If you generated an encrypted private key, implement this method to return
# the passphrase for decrypting your private key.
def get_private_key_passphrase():
  return '<private_key_passphrase>'

with open("/<private_key_path>/rsa_key.p8", 'rb') as pem_in:
  pemlines = pem_in.read()
  private_key_obj = load_pem_private_key(pemlines,
  get_private_key_passphrase().encode(),
  default_backend())

private_key_text = private_key_obj.private_bytes(
  Encoding.PEM, PrivateFormat.PKCS8, NoEncryption()).decode('utf-8')
# Assume the public key has been registered in Snowflake:
# private key in PEM format

ingest_manager = SimpleIngestManager(account='<account_identifier>',
                                     host='<account_identifier>.snowflakecomputing.com',
                                     user='<user_login_name>',
                                     pipe='<db_name>.<schema_name>.<pipe_name>',
                                     private_key=private_key_text)
# List of files, but wrapped into a class
staged_file_list = [
  StagedFile('<path>/<filename>', <file_size_in_bytes>),  # file size is optional but recommended, pass None if not available
  StagedFile('<path>/<filename>', <file_size_in_bytes>),  # file size is optional but recommended, pass None if not available
  ...
  ]

try:
    resp = ingest_manager.ingest_files(staged_file_list)
except HTTPError as e:
    # HTTP error, may need to retry
    logger.error(e)
    exit(1)

# This means Snowflake has received file and will start loading
assert(resp['responseCode'] == 'SUCCESS')

# Needs to wait for a while to get result in history
while True:
    history_resp = ingest_manager.get_history()

    if len(history_resp['files']) > 0:
        print('Ingest Report:\n')
        print(history_resp)
        break
    else:
        # wait for 20 seconds
        time.sleep(20)

    hour = timedelta(hours=1)
    date = datetime.datetime.utcnow() - hour
    history_range_resp = ingest_manager.get_history_range(date.isoformat() + 'Z')

    print('\nHistory scan report: \n')
    print(history_range_resp)
```

Before you execute the sample code, replace the following placeholder values:

> `<private_key_path>`
> :   Specify the local path to the private key file you created in [Use key pair authentication & key rotation](data-load-snowpipe-rest-gs.md) (in [Data loading preparation using the Snowpipe REST API](data-load-snowpipe-rest-gs.md)).
>
> `return "<private_key_passphrase>"` in `get_private_key_passphrase()`
> :   If you generated an encrypted key, implement the `get_private_key_passphrase()` function to return the passphrase for decrypting that key.
>
> `account='<account_identifier>'`
> :   Specify the unique identifier for your account (provided by Snowflake). See the `host` description.
>
> `host='<account_identifier>.snowflakecomputing.com'`
> :   Specify the unique hostname for your Snowflake account.
>
>     The preferred format of the account identifier is as follows:
>
>     `organization_name-account_name`
>     :   Names of your Snowflake organization and account. For details, see [Format 1 (preferred): Account name in your organization](admin-account-identifier.md).
>
>     Alternatively, specify your *account locator*, along with the [region](intro-regions.md) and [cloud platform](intro-cloud-platforms.md) where the account is hosted, if required. For details, see [Format 2: Account locator in a region](admin-account-identifier.md).
>
> `user='<user_login_name>'`
> :   Specify your Snowflake login name.
>
> `pipe='<db_name>.<schema_name>.<pipe_name>'`
> :   Specify the fully-qualified name of the pipe to use to load the data.
>
> `file_list=['<path>/<filename>', '<path>/<filename>']` | `staged_file_list=[StagedFile('<path>/<filename>', <file_size_in_bytes>), StagedFile('<path>/<filename>', <file_size_in_bytes>)]`
> :   Specify the path to your files to load in the file objects list.
>
>     The path you specify must be relative to the stage where the files are located. Include the complete name for each file, including the file extension. For example, a CSV file that is gzip-compressed might have the extension `.csv.gz`.
>
>     Optionally specify the size of each file, in bytes, to avoid delays when Snowpipe calculates the operations required to load the data.

## View the load history

Snowflake provides [REST endpoints](data-load-snowpipe-rest-apis.md) and an [Snowflake Information Schema](../sql-reference/info-schema.md) table function for viewing your load history:

* REST endpoints:

  + [insertReport](data-load-snowpipe-rest-apis.md)
  + [loadHistoryScan](data-load-snowpipe-rest-apis.md)
* Information Schema table function:

  + [COPY_HISTORY](../sql-reference/functions/copy_history.md)
* Account Usage view:

  + [COPY_HISTORY](../sql-reference/account-usage/copy_history.md)

Note that querying either the Information Schema table function or Account Usage view, unlike calling the REST endpoints, requires a running warehouse.

## Delete staged files

Delete the staged files after you successfully load the data and no longer require the files. For instructions, see
[Deleting staged files after Snowpipe loads the data](data-load-snowpipe-manage.md).

---
title: Option 2: Automate Snowpipe with AWS Lambda
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-rest-lambda.md
section: User Guide
---

# Option 2: Automate Snowpipe with AWS Lambda

AWS Lambda is a compute service that runs when triggered by an event and executes code that has been loaded into the system. You can adapt the sample Python code provided in this topic and create a Lambda
function that calls the Snowpipe REST API to load data from your external stage (i.e. S3 bucket; Azure containers are not supported). The function is deployed to your AWS account, where it is hosted. Events
you define in Lambda (e.g. when files in your S3 bucket are updated) invoke the Lambda function and run the Python code.

This topic describes the steps necessary to configure a Lambda function to automatically load data in micro-batches continuously using Snowpipe.

> **Note:**
>
> This topic assumes you have configured Snowpipe using the instructions in [Data loading preparation using the Snowpipe REST API](data-load-snowpipe-rest-gs.md).

## Step 1: Write Python code invoking the Snowpipe REST API

**Sample Python code**

```python
from __future__ import print_function
from snowflake.ingest import SimpleIngestManager
from snowflake.ingest import StagedFile
from requests import HTTPError
from cryptography.hazmat.primitives import serialization
from cryptography.hazmat.primitives.serialization import load_pem_private_key
from cryptography.hazmat.primitives.serialization import Encoding
from cryptography.hazmat.primitives.serialization import PrivateFormat
from cryptography.hazmat.primitives.serialization import NoEncryption
from cryptography.hazmat.backends import default_backend

import os

with open("./rsa_key.p8", 'rb') as pem_in:
  pemlines = pem_in.read()
  private_key_obj = load_pem_private_key(pemlines,
  os.environ['PRIVATE_KEY_PASSPHRASE'].encode(),
  default_backend())

private_key_text = private_key_obj.private_bytes(
  Encoding.PEM, PrivateFormat.PKCS8, NoEncryption()).decode('utf-8')
# Assume the public key has been registered in Snowflake:
# private key in PEM format

# List of files in the stage specified in the pipe definition
ingest_manager = SimpleIngestManager(account='<account_identifier>',
                   host='<account_identifier>.snowflakecomputing.com',
                   user='<user_login_name>',
                   pipe='<db_name>.<schema_name>.<pipe_name>',
                   private_key=private_key_text)

def handler(event, context):
  for record in event['Records']:
    bucket = record['s3']['bucket']['name']
    key = record['s3']['object']['key']

    print("Bucket: " + bucket + " Key: " + key)
    # List of files in the stage specified in the pipe definition
    # wrapped into a class
    staged_file_list = []
    staged_file_list.append(StagedFile(key, None))

    print('Pushing file list to ingest REST API')
    resp = ingest_manager.ingest_files(staged_file_list)
```

> **Note:**
>
> The sample code does not account for error handling. For example, it does not retry failed `ingest_manager` calls.

Before using the sample code, make the following changes:

1. Update the security parameter:

   > `private_key=""" / -----BEGIN RSA PRIVATE KEY----- / ... / -----END RSA PRIVATE KEY----- """`
   > :   Specifies the content of the private key file you created in [Use key pair authentication & key rotation](data-load-snowpipe-rest-gs.md) (in [Data loading preparation using the Snowpipe REST API](data-load-snowpipe-rest-gs.md)).
   >
   > Specify the passphrase for decrypting the private key file using the `PRIVATE_KEY_PASSPHRASE` environment variable:
   >
   > > * Linux or macOS:
   > >
   > >   > ```bash
   > >   > export PRIVATE_KEY_PASSPHRASE='<passphrase>'
   > >   > ```
   > > * Windows:
   > >
   > >   > ```bash
   > >   > set PRIVATE_KEY_PASSPHRASE='<passphrase>'
   > >   > ```
2. Update the session parameters:

   > `account='<account_identifier>'`
   > :   Specify the unique identifier for your account (provided by Snowflake). See the `host` description.
   >
   > `host='<account_identifier>.snowflakecomputing.com'`
   > :   Specify the unique hostname for your Snowflake account.
   >
   >     The preferred format of the account identifier is as follows:
   >
   >     `organization_name-account_name`
   >     :   Names of your Snowflake organization and account. For details, see [Format 1 (preferred): Account name in your organization](admin-account-identifier.md).
   >
   >     Alternatively, specify your *account locator*, along with the [region](intro-regions.md) and [cloud platform](intro-cloud-platforms.md) where the account is hosted, if required. For details, see [Format 2: Account locator in a region](admin-account-identifier.md).
   >
   > `user='<user_login_name>'`
   > :   Specifies the login name of the Snowflake user that will run the Snowpipe code.
   >
   > `pipe='<db_name>.<schema_name>.<pipe_name>'`
   > :   Specifies the fully-qualified name of the pipe to use to load the data, in the form of `<db_name>.<schema_name>.<pipe_name>`.
3. Specify the path to your files to import in the file objects list:

   > `staged_file_list = []`
   > :   The path you specify must be relative to the stage where the files are located. Include the complete name for each file, including the file extension. For example, a CSV file that is
   >     gzip-compressed might have the extension `.csv.gz`.
4. Save the file in a convenient location.

The remaining instructions in this topic assume the file name to be `SnowpipeLambdaCode.py`.

## Step 2: Create a Lambda function deployment package

Complete the following instructions to build a Python runtime environment for Lambda and add the Snowpipe code you adapted in Step 1: Write Python Code Invoking the Snowpipe REST API (in this topic).
For more information about these steps, see the [AWS Lambda deployment package documentation](http://docs.aws.amazon.com/lambda/latest/dg/with-s3-example-deployment-pkg.html) (see the instructions for
Python).

> **Important:**
>
> The scripts in the following steps are a representative example and assume that you are creating an AWS EC2 Linux instance based on an Amazon Machine Instance (AMI) that uses the YUM package manager, which depends on RPM. If you select a Debian-based Linux AMI, please update your scripts accordingly.

1. Create an AWS EC2 Linux instance by completing the [AWS EC2 instructions](http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/EC2_GetStarted.html#ec2-launch-instance). This instance will provide the
   compute resources to run the Snowpipe code.
2. Copy the Snowpipe code file to your new AWS EC2 instance using SCP (Secure Copy):

   > ```bash
   > scp -i key.pem /<path>/SnowpipeLambdaCode.py ec2-user@<machine>.<region_id>.compute.amazonaws.com:~/SnowpipeLambdaCode.py
   > ```

   Where:

   > * `<path>` is the path to your local `SnowpipeLambdaCode.py` file.
   > * `<machine>.<region_id>` is the DNS name of the EC2 instance (e.g. `ec2-54-244-54-199.us-west-2.compute.amazonaws.com`).
   >
   >   The DNS name is displayed on the Instances screen in the Amazon EC2 console.
3. Connect to the EC2 instance using SSH (Secure SHell):

   > ```bash
   > ssh -i key.pem ec2-user@<machine>.<region_id>.compute.amazonaws.com
   > ```
4. Install Python and related libraries on the EC2 instance:

   > ```bash
   > sudo yum install -y gcc zlib zlib-devel openssl openssl-devel
   >
   > wget https://www.python.org/ftp/python/3.6.1/Python-3.6.1.tgz
   >
   > tar -xzvf Python-3.6.1.tgz
   >
   > cd Python-3.6.1 && ./configure && make
   >
   > sudo make install
   >
   > sudo /usr/local/bin/pip3 install virtualenv
   >
   > /usr/local/bin/virtualenv ~/shrink_venv
   >
   > source ~/shrink_venv/bin/activate
   >
   > pip install Pillow
   >
   > pip install boto3
   >
   > pip install requests
   >
   > pip install snowflake-ingest
   > ```
5. Create the .zip deployment package (`Snowpipe.zip`):

   > ```bash
   > cd $VIRTUAL_ENV/lib/python3.6/site-packages
   >
   > zip -r9 ~/Snowpipe.zip .
   >
   > cd ~
   >
   > zip -g Snowpipe.zip SnowpipeLambdaCode.py
   > ```

## Step 3: Create an AWS IAM role for Lambda

Follow the [AWS Lambda documentation](http://docs.aws.amazon.com/lambda/latest/dg/with-s3-example-create-iam-role.html) to create an IAM role to execute the Lambda function.

Record the [IAM Amazon Resource Name (ARN)](http://docs.aws.amazon.com/IAM/latest/UserGuide/reference_identifiers.html#identifiers-arns) for the role. You will use it in the next step.

## Step 4: Create the Lambda function

Create the Lambda function by uploading the `.zip` deployment package you created in Step 2: Create a Lambda Function Deployment Package (in this topic):

> ```bash
> aws lambda create-function \
> --region us-west-2 \
> --function-name IngestFile \
> --zip-file fileb://~/Snowpipe.zip \
> --role arn:aws:iam::<aws_account_id>:role/lambda-s3-execution-role \
> --handler SnowpipeLambdaCode.handler \
> --runtime python3.6 \
> --profile adminuser \
> --timeout 10 \
> --memory-size 1024
> ```

For `--role`, specify the role ARN you recorded in Step 3: Create an AWS IAM Role for Lambda (in this topic).

Record the ARN for the new function from the output. You will use it in the next step.

## Step 5: Allow calls to the Lambda function

Grant S3 the permissions required to invoke your function.

For `--source-arn`, specify the function ARN you recorded in Step 4: Create the Lambda Function (in this topic).

> ```bash
> aws lambda add-permission \
> --function-name IngestFile \
> --region us-west-2 \
> --statement-id enable-ingest-calls \
> --action "lambda:InvokeFunction" \
> --principal s3.amazonaws.com \
> --source-arn arn:aws:s3:::<SourceBucket> \
> --source-account <aws_account_id> \
> --profile adminuser
> ```

## Step 6: Register the Lambda notification event

Register a Lambda notification event by completing the [Amazon S3 Event Notifications](http://docs.aws.amazon.com/AmazonS3/latest/dev/NotificationHowTo.html) instructions. In the input field, specify the
function ARN you recorded in Step 4: Create the Lambda Function (in this topic).

---
title: Option 2: Configure an AWS IAM role to access Amazon S3 — Deprecated
source: https://docs.snowflake.com/en/user-guide/data-load-s3-config-aws-iam-role.md
section: User Guide
---

# Option 2: Configure an AWS IAM role to access Amazon S3 — *Deprecated*

> **Note:**
>
> You may encounter an `assumeRole` error when using the deprecated authentication method.

This section describes how to configure an S3 bucket, IAM role, and policies for Snowflake to access an external stage in a secure manner on behalf of one or more individual users in your Snowflake account.

As a best practice, limit S3 bucket access to a specific IAM role with the minimum required permissions. The IAM role is created in your AWS account along with the permissions to access your S3 bucket and the trust policy to allow Snowflake to assume the IAM role.

1. An AWS IAM user created for your Snowflake account is associated with an IAM role you configure via a trust relationship.
2. The role is granted limited access to an S3 bucket through IAM policies you configure.

> **Note:**
>
> Completing the instructions in this topic requires administrative access to AWS. If you are not an AWS administrator, ask your AWS administrator
> to perform these tasks.

## Step 1: Configure S3 bucket access permissions

### AWS access control requirements

Snowflake requires the following permissions on an S3 bucket and folder to be able to access files in the folder (and any sub-folders):

* `s3:GetBucketLocation`
* `s3:GetObject`
* `s3:GetObjectVersion`
* `s3:ListBucket`

> **Note:**
>
> The following additional permissions are required to perform additional SQL actions:
>
> | Permission | SQL Action |
> | --- | --- |
> | `s3:PutObject` | Unload files to the bucket. |
> | `s3:DeleteObject` | Either automatically purge files from the stage after a successful load or execute [REMOVE](../sql-reference/sql/remove.md) statements to manually remove files. |

As a best practice, Snowflake recommends creating an IAM policy for Snowflake access to the S3 bucket. You can then attach the policy to the role and use the security credentials generated by AWS for the role to access files in the bucket.

### Create an IAM policy

The following step-by-step instructions describe how to configure access permissions for Snowflake in your AWS Management Console so that you can use an
S3 bucket to load and unload data:

1. Log into the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. Choose Account settings from the left-hand navigation pane.
4. Expand the Security Token Service Regions list, find the AWS region corresponding to the [region](intro-regions.md) where your account is located, and choose Activate if the status is Inactive.
5. Choose Policies from the left-hand navigation pane.
6. Click Create Policy.
7. Click the JSON tab.
8. Add a policy document that will allow Snowflake to access the S3 bucket and folder.

   The following policy (in JSON format) provides Snowflake with the required permissions to load or unload data using a single bucket and folder path. You can also purge data files using the PURGE copy option.

   Copy and paste the text into the policy editor:

   > **Note:**
   >
   > Make sure to replace `bucket` and `prefix` with your actual bucket name and folder path prefix.

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:PutObject",
                 "s3:GetObject",
                 "s3:GetObjectVersion",
                 "s3:DeleteObject",
                 "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   > **Note:**
   >
   > Setting the `"s3:prefix":` condition to either `["*"]` or `["<path>/*"]` grants access to all prefixes in the
   > specified bucket or path in the bucket, respectively.

   Note that AWS policies support a variety of different security use cases.

   The following policy provides Snowflake with the required permissions to load data from a single read-only bucket and folder
   path. The policy includes the `s3:GetBucketLocation`, `s3:GetObject`, `s3:GetObjectVersion`, and
   `s3:ListBucket` permissions:

   **Alternative policy: Load from a read-only S3 bucket**

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:GetObject",
                 "s3:GetObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```
9. Click Review policy.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

## Step 2: Create an AWS IAM role

In the AWS Management Console, create an AWS IAM role that grants privileges on the S3 bucket containing your data files.

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. Select AWS account as the trusted entity type.
4. Under An AWS account, select This account. In a later step,
   you modify the trusted relationship and grant access to Snowflake.
5. Select the Require external ID option. Enter a dummy ID such as `0000`. Later, you will modify the trusted relationship and specify the external ID for your Snowflake stage.
6. Click the Next button.
7. Locate the policy you created in Step 1: Configure S3 Bucket Access Permissions (in this topic), and select this policy.
8. Click the Next button.
9. Enter a name and description for the role, and click the Create role button.

   You have now created an IAM policy for a bucket, created an IAM role, and attached the policy to the role.
10. Record the Role ARN value located on the role summary page. In the next step, you will create a Snowflake stage that references this role as the security credentials.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60 minute expiration time. If you revoke access from Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

## Step 3: Create an external stage

Create an external (i.e. S3) stage that references the AWS role you created.

1. Create an external stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command, or you can choose to alter an existing external stage and set the CREDENTIALS option.

   > **Note:**
   > * Credentials are handled separately from other stage parameters such as ENCRYPTION and FILE_FORMAT. Support for these other
   >   parameters is the same regardless of the credentials used to access your external S3 bucket.
   > * Append a forward slash (`/`) to the URL value to filter to the specified folder path. If the forward slash is omitted, all files and
   >   folders starting with the prefix for the specified path are included.
   >
   >   Note that the forward slash is required to access and retrieve unstructured data files in the stage.

   For example, set `mydb.public` as the current database and schema for the user session, and then create a stage named `my_S3_stage`. In this example, the stage references the S3 bucket and path `mybucket/load/files`. Files in the S3 bucket are encrypted with server-side encryption (AWS_SSE_KMS):

   > ```sqlexample
   > USE SCHEMA mydb.public;
   >
   > CREATE STAGE my_s3_stage
   >   URL='s3://mybucket/load/files'
   >   CREDENTIALS = (AWS_ROLE = 'arn:aws:iam::001234567890:role/mysnowflakerole')
   >   ENCRYPTION=(TYPE='AWS_SSE_KMS' KMS_KEY_ID = 'aws/key');
   > ```
2. Execute the [DESCRIBE STAGE](../sql-reference/sql/desc-stage.md) command to view the stage properties:

   > ```sqlexample
   > DESC STAGE my_S3_stage;
   >
   > +--------------------+--------------------------------+---------------+----------------------------------------------------------------+------------------+
   > | parent_property    | property                       | property_type | property_value                                                 | property_default |
   > |--------------------+--------------------------------+---------------+----------------------------------------------------------------+------------------|
   > ...
   > | STAGE_CREDENTIALS  | AWS_ROLE                       | String        | arn:aws:iam::001234567890:role/mysnowflakerole                 |                  |
   > | STAGE_CREDENTIALS  | AWS_EXTERNAL_ID                | String        | MYACCOUNT_SFCRole=2_jYfRf+gT0xSH7G2q0RAODp00Cqw=               |                  |
   > | STAGE_CREDENTIALS  | SNOWFLAKE_IAM_USER             | String        | arn:aws:iam::123456789001:user/vj4g-a-abcd1234                 |                  |
   > +--------------------+--------------------------------+---------------+----------------------------------------------------------------+------------------+
   > ```
3. Record the values for the SNOWFLAKE_IAM_USER and AWS_EXTERNAL_ID properties, where:

   SNOWFLAKE_IAM_USER:
   :   An AWS IAM user created for your Snowflake account. This user is the same for every external S3 stage created in your account.

   AWS_EXTERNAL_ID:
   :   A unique ID assigned to the specific stage. The ID has the following format:

       `snowflakeAccount_SFCRole=snowflakeRoleId_randomId`

   Note that the AWS_ROLE, AWS_EXTERNAL_ID, and SNOWFLAKE_IAM_USER values used in this example are for illustration purposes only.

   In the next step, you will configure your AWS IAM role to grant access to the Snowflake IAM user using the generated AWS external ID.

## Step 4: Configure the AWS IAM role to allow access to the stage

In the AWS Management Console, configure the IAM role using the stage properties you recorded in Step 3: Create an External Stage (in this topic):

1. Log into the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. Choose Roles from the left-hand navigation pane, and click on the role you created in Step 2: Create an AWS IAM Role (in this topic).
4. Click the Trust relationships tab, and click the Edit trust relationship button.
5. In the Policy Document field, update the policy with the property values for the stage:

   * **AWS:** Enter the ARN for the SNOWFLAKE_IAM_USER stage property, i.e. `arn:aws:iam::123456789001:user/vj4g-a-abcd1234` in this example.
   * **sts:ExternalId:** Enter the generated external ID, i.e. `MYACCOUNT_SFCRole=2_jYfRf+gT0xSH7G2q0RAODp00Cqw=` in this example.

     > ```sqljson
     > {
     >     "Version": "2012-10-17",
     >     "Statement": [
     >       {
     >           "Effect": "Allow",
     >           "Principal": {
     >               "AWS": [
     >                   "arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
     >               ]
     >           },
     >           "Action": "sts:AssumeRole",
     >           "Condition": {
     >               "StringEquals": {
     >                   "sts:ExternalId": "MYACCOUNT_SFCRole=2_jYfRf+gT0xSH7G2q0RAODp00Cqw="
     >               }
     >           }
     >       }
     >     ]
     > }
     > ```

     > **Note:**
     >
     > The above trust policy allows a single external stage in your Snowflake account to assume your IAM role. It is the most restrictive trust policy and is therefore the most secure.
     >
     > The permission to assume the IAM role is associated with the external ID. An external ID has the following format:
     >
     > `snowflake_account_SFCRole=snowflake_role_id_random_id`
     >
     > Where:
     >
     > > + `snowflake_account` is the name assigned to your Snowflake account.
     > > + `snowflake_role_id` is an ID assigned to the Snowflake role that created the stage in Step 3: Create an External Stage (in this topic).
     > >
     > >   In the current example, the `snowflake_role_id` value is `2`. This ID is associated with a single role in your Snowflake account. The purpose of this ID is limited to the trust policies for external stages; as such, a mapping of Snowflake roles to IDs is not available. The role ID for a given role is only exposed in the AWS_EXTERNAL_ID value in the DESCRIBE STAGE output. As a best practice, restrict the ability to create external S3 stages to a single Snowflake role.
     > >
     > >   Note that the role that creates a stage is not necessarily the same as the stage owner (i.e. the role that has the OWNERSHIP privilege on the stage). Ownership of the stage can be transferred to a different role later with no corresponding change required to the trust policy.
     >
     > For security reasons, if you create a new storage integration (or recreate an existing storage integration using the CREATE OR
     > REPLACE STORAGE INTEGRATION syntax), the resulting integration has a different external ID and so it cannot assume the IAM role
     > unless the trust policy is modified.
     >
     > If you require a trust policy with a less secure set of restrictions (i.e. a policy that supports all external stages in your account), replace `random_id` in the external ID with a wildcard character (`*`):
     >
     > > `snowflake_account_SFCRole=snowflake_role_id_*`, e.g. `MYACCOUNT_SFCRole=2_*` in the current example.
     >
     > This form of the external ID allows any external S3 stage created by a user in your account with the same Snowflake role (i.e. SYSADMIN) to assume the IAM role, and in turn any S3 bucket the IAM role has access to. Note that if you implement this less secure type of trust policy, you must change the `Condition` from `StringEquals` to `StringLike`.
6. Click the Update Trust Policy button.

You have now completed the one-time setup to access your S3 bucket using an AWS role.

**Next:** [AWS data file encryption](data-load-s3-encrypt.md)

---
title: Option 3: Configure AWS IAM user credentials to access Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-load-s3-config-aws-iam-user.md
section: User Guide
---

# Option 3: Configure AWS IAM user credentials to access Amazon S3

This section describes how to configure a security policy for an S3 bucket and access credentials for a specific IAM user to access an external stage in a secure manner.

## Step 1: Configure an S3 bucket access policy

### AWS access control requirements

Snowflake requires the following permissions on an S3 bucket and folder to be able to access files in the folder (and any sub-folders):

* `s3:GetBucketLocation`
* `s3:GetObject`
* `s3:GetObjectVersion`
* `s3:ListBucket`

> **Note:**
>
> The following additional permissions are required to perform additional SQL actions:
>
> | Permission | SQL Action |
> | --- | --- |
> | `s3:PutObject` | Unload files to the bucket. |
> | `s3:DeleteObject` | Either automatically purge files from the stage after a successful load or execute [REMOVE](../sql-reference/sql/remove.md) statements to manually remove files. |

As a best practice, Snowflake recommends creating an IAM policy and user for Snowflake access to the S3 bucket. You can then attach the policy to the user and use the security credentials generated by AWS for the user to access files in the bucket.

### Create an IAM policy

The following step-by-step instructions describe how to configure access permissions for Snowflake in your AWS Management Console so that you can use an
S3 bucket to load and unload data:

1. Log into the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add the policy document that will allow Snowflake to access the S3 bucket and folder.

   The following policy (in JSON format) provides Snowflake with the required access permissions for the specified bucket and folder path. You can copy and paste the text into the policy editor:

   > **Note:**
   > * Make sure to replace `bucket` and `prefix` with your actual bucket name and folder path prefix.
   > * The Amazon Resource Names (ARN) for buckets in
   >   [government regions](intro-regions.md) have a `arn:aws-us-gov:s3:::` prefix.

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:PutObject",
                 "s3:GetObject",
                 "s3:GetObjectVersion",
                 "s3:DeleteObject",
                 "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket_name>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket_name>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   > **Note:**
   >
   > Setting the `"s3:prefix":` condition to either `["*"]` or `["<path>/*"]` grants access to all prefixes in the
   > specified bucket or path in the bucket, respectively.
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

## Step 2: Create an AWS IAM user

1. Choose Users from the left-hand navigation pane, then click Add user.
2. On the Add user page, enter a new user name (e.g. `snowflake1`). Select Programmatic access as the access type, then click Next:
3. Click Attach existing policies directly, and select the policy you created earlier. Then click Next:
4. Review the user details, then click Create user.
5. Record the access credentials. The easiest way to record them is to click Download Credentials to write them to a file (e.g. `credentials.csv`)

   > > **Attention:**
   > >
   > > Once you leave this page, the Secret Access Key will no longer be available anywhere in the AWS console. If you lose the key, you must generate a new set of credentials for the user.

You have now:

* Created an IAM policy for a bucket.
* Created an IAM user and generated access credentials for the user.
* Attached the policy to the user.

With the AWS key and secret key for the S3 bucket, you have the credentials necessary to access your S3 bucket in Snowflake using an external stage.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60 minute expiration time. If you revoke access from Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

## Step 3: Create an external (i.e. S3) stage

Create an external stage that references the AWS credentials you created.

Create the stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command, or you can choose to alter an existing external stage and set the CREDENTIALS option.

> **Note:**
>
> Credentials are handled separately from other stage parameters such as ENCRYPTION and FILE_FORMAT. Support for these other parameters is the same regardless of the credentials used to access your external S3 bucket.

For example, set `mydb.public` as the current database and schema for the user session, and then create a stage named `my_S3_stage`. In this example, the stage references the S3 bucket and path `mybucket/load/files`. Files in the S3 bucket are encrypted with server-side encryption (AWS_SSE_KMS):

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE OR REPLACE STAGE my_S3_stage
>   URL='s3://mybucket/load/files/'
>   CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z')
>   ENCRYPTION=(TYPE='AWS_SSE_KMS' KMS_KEY_ID = 'aws/key');
> ```

**Next:** [AWS data file encryption](data-load-s3-encrypt.md)

---
title: Organization accounts
source: https://docs.snowflake.com/en/user-guide/organization-accounts.md
section: User Guide
---

# Organization accounts

The *organization account* is a special type of account that organization administrators use to perform tasks that affect the entire
organization. For example, administrators use the organization account to do the following:

* View organization-level data collected from all accounts in the organization, including the query history from each account.
* Manage organization-level objects — for example, organization users — for all accounts, or a subset of accounts, in an organization.
* Enable Snowflake Marketplace terms for the entire organization.
* Manage the lifecycle of accounts in an organization, including creating and deleting accounts.
* Enable replication for an account.

There is only one organization account for an organization.

## Features available in the organization account

This section describes features that are available from the organization account.

See the following sections for more information about each feature:

* Premium views
* Organization users and user groups

### Premium views

The ORGANIZATION_USAGE schema in the organization account contains views that are not available in the ORGANIZATION_USAGE schema of a
regular account. These additional views are called *premium views*, which are available by default when you create the organization account.
These premium views provide organization-level data that isn’t otherwise available in a single view. For example, you can query the
TAG_REFERENCES premium view to learn how tags are used throughout the organization, not just in a specific account.

For more information, including costs associated with premium views, see [Premium views in the organization account](organization-accounts-premium-views.md).

### Organization users and user groups

Organizations with more than one account sometimes need someone to manage a user or role in multiple accounts. If you don’t want to create
a separate user or role in each account, then you can create an organization user and organization user group in the organization account.

For more information, see [Organization users](organization-users.md).

## Compliance considerations for hybrid organizations

An organization can have accounts in both regulated regions and non-regulated regions. For example, an organization can have one account in
a [U.S. SnowGov Region](intro-regions.md) and another in a [commercial region](intro-regions.md).
These organizations are called *hybrid organizations*.

Features associated with an organization account might result in [metadata](../sql-reference/metadata.md) moving from one region to another.
For example, [premium views](organization-accounts-premium-views.md) might move associated metadata from a regular account’s
region to the region of the organization account. Metadata associated with an organization-level object — for example, an
[organization user](organization-users.md) — might move from the region of the organization account to the region of an
account that imports the object. For hybrid organizations, this means that metadata might move between a regulated region and a non-regulated
region.

If you have a hybrid organization, Snowflake recommends the following actions:

* Create your organization account in the regulated region.
* Don’t define an organization-level object with sensitive or regulated data.

Compliance standards, such as [FedRAMP](cert-fedramp.md), and support for different regulated workloads, such as
[ITAR](cert-itar.md), might be different or unavailable outside of your U.S. SnowGov Region. Consider your compliance
requirements before choosing to move or share data between Snowflake regions.

## About administrator roles and assignable privileges

Organization administrators use the GLOBALORGADMIN role in the organization account to perform all organization-level tasks, including
administration of the organization account itself.

> **Note:**
>
> Before the introduction of the organization account, organization administrators used the ORGADMIN role in an ORGADMIN-enabled account to
> perform organization-level tasks. Using the ORGADMIN role in an ORGADMIN-enabled account is being phased out. Use the GLOBALORGADMIN role
> in the organization account to perform organization-level tasks.
>
> Snowflake will send a notification email to customers at least three months prior to phasing out the ORGADMIN role.

The GLOBALORGADMIN role can assign privileges to other roles to let other users perform organization-level tasks. In the organization
account, the GLOBALORGADMIN role can assign the following privileges:

* APPLY TAG
* MANAGE ACCOUNTS
* MANAGE LISTING AUTO FULFILLMENT
* MANAGE ORGANIZATION CONTACTS
* MANAGE ORGANIZATION TERMS
* PURCHASE DATA EXCHANGE LISTING

These privileges are set on the account level. For example, to assign the MANAGE ACCOUNTS privilege to the role `custom_role`, execute the
following:

```sqlexample
USE ROLE GLOBALORGADMIN;

GRANT MANAGE ACCOUNTS ON ACCOUNT TO ROLE custom_role;
```

For more information about these privileges, see [Access control privileges](security-access-control-privileges.md).

## Create the organization account

Before create the organization account, consider the following details:

* If your organization includes an account in a [U.S. SnowGov Region](intro-regions.md), Snowflake recommends
  creating the organization account in this regulated region. For more information about the implication of having an organization with
  both regulated and non-regulated accounts, see Compliance considerations for hybrid organizations.
* Creating the organization account results in the ORGANIZATION_USAGE schema being populated with data, which
  [incurs additional costs](organization-accounts-premium-views.md) for your organization.
* You can’t convert an existing ORGADMIN enabled account to be the organization account.

To create the organization account:

1. Choose an existing account from which you will create the organization account. This existing account must have the
   [ORGADMIN role enabled](organization-administrators.md).
2. Sign in to the account you are using to create the organization account.
3. Switch to the ORGADMIN role. For example:

   ```sqlexample
   USE ROLE ORGADMIN;
   ```
4. Execute the [CREATE ORGANIZATION ACCOUNT](../sql-reference/sql/create-organization-account.md) command. For example:

   ```sqlexample
   CREATE ORGANIZATION ACCOUNT myorgaccount
       ADMIN_NAME = admin
       ADMIN_PASSWORD = 'TestPassword1'
       EMAIL = 'myemail@myorg.org'
       MUST_CHANGE_PASSWORD = true
       EDITION = enterprise;
   ```

> **Note:**
>
> Snowflake does not support custom account locators for organization accounts. For alternatives, contact your Snowflake representative.

## Delete the organization account

If you want to drop the organization account in your multi-account organization, then contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Note:**
>
> New functionality in Snowflake that includes organization-level administrative tasks will require an organization account. If you are
> concerned about the costs associated with [premium views](organization-accounts-premium-views.md), contact
> [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request that they be disabled instead of deleting the account.

## Move the organization account to a different region

You can move an organization account between regions as long as those regions are in either the PUBLIC region group or a VPS region group.

Snowflake uses replication groups to move objects from the organization account in the source region to the organization account in the new
region. As a result, only objects that can be replicated are moved with the organization account and there are replication costs associated
with the move. For a list of objects that can be moved with the organization account, see [Replicated objects](account-replication-intro.md).

> **Note:**
>
> [Organization profiles](collaboration/organization-profiles/org-profiles-create-manage.md) move with the organization
> account to the new region.

Moving the organization account to a different region is a two-step process:

1. Call the [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](../sql-reference/functions/system_initiate_move_organization_account.md) function from the organization account to start
   the process of moving it. Snowflake begins replicating objects to the new region.

   The function accepts a temporary account name, the new region, and a list of objects to move as its arguments. For example:

   ```sqlexample
   CALL SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT(
     'MY_TEMP_NAME',
     'aws_us_west_2',
     'ALL');
   ```
2. When you have verified that the data in the organization account has been successfully replicated in the new region, call the
   [SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT](../sql-reference/functions/system_commit_move_organization_account.md) function to finalize the move, specifying a grace period
   after which the original organization account is deleted.

   For example, the following call finalizes the move, and specifies that the original organization account in the source region will
   be deleted after 14 days.

   ```sqlexample
   CALL SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT(14);
   ```

At any point, you can view the status of an attempt to move an organization account by calling the
[SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](../sql-reference/functions/system_show_move_organization_account_status.md) function.

> **Note:**
>
> When an organization account is moved, the views in the ORGANIZATION_USAGE schema must be repopulated with data, a process that can take up
> to one week.

## Limitations

Replication isn’t fully supported for the organization account; some objects can’t be replicated to or from the organization account.

---
title: Organization administrators
source: https://docs.snowflake.com/en/user-guide/organization-administrators.md
section: User Guide
---

# Organization administrators

Organization administrators perform organization-level tasks such as managing accounts and viewing organization-level usage information.
Currently, there are two ways to perform organization-level tasks:

* Use the GLOBALORGADMIN role in the organization account
* Use the ORGADMIN role

## Using the GLOBALORGADMIN role

Multi-account organizations should use the GLOBALORGADMIN in the [organization account](organization-accounts.md) to perform
organization-level tasks. A user with the GLOBALORGADMIN role is also known as the global organization administrator.

To perform tasks as the global organization administrator, do the following:

1. Sign in to the organization account.
2. Do one of the following:

   * If you are performing tasks in a SQL worksheet, execute the following command:

     ```sqlexample
     USE ROLE GLOBALORGADMIN;
     ```
   * If you are performing other tasks in Snowsight, [switch your active role](ui-snowsight-gs.md) to
     GLOBALORGADMIN.

## Using the ORGADMIN role

> **Important:**
>
> Using the ORGADMIN role in an ORGADMIN-enabled account is being phased out for multi-account organizations. Strongly consider using the
> GLOBALORGADMIN role in the organization account to perform organization-level tasks.
>
> Snowflake will send a notification email to customers at least three months prior to phasing out the ORGADMIN role.

To perform tasks with the ORGADMIN role, do the following:

1. Sign in to an ORGADMIN-enabled account.
2. Do one of the following:

   * If you are performing tasks in a SQL worksheet, execute the following command:

     ```sqlexample
     USE ROLE ORGADMIN;
     ```
   * If you are performing other tasks in Snowsight, [switch your active role](ui-snowsight-gs.md) to ORGADMIN.

## Enabling the ORGADMIN role in an account

The first account in an organization has the ORGADMIN role enabled. You can use this account to enable the role in other accounts. For example, to
enable the ORGADMIN role for an account `my_account1`, the organization administrator can execute the following
command from an account that already has the ORGADMIN role enabled:

```sqlexample
USE ROLE ORGADMIN;

ALTER ACCOUNT my_account1 SET IS_ORG_ADMIN = TRUE;
```

Keep the following in mind when enabling the ORGADMIN role:

* The ALTER ACCOUNT syntax only accepts the [account name format](admin-account-identifier.md) of the account identifier. You cannot use the
  account locator to specify the account.
* By default, the ORGADMIN role can be enabled in a maximum of eight accounts. If your organization requires more accounts with the ORGADMIN
  role, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* The ORGADMIN role cannot be enabled for a [reader account](data-sharing-reader-create.md).

## Disable the ORGADMIN role

You can prevent regular, ORGADMIN-enabled accounts from being used to perform organization-level tasks.
To accomplish this, execute the ALTER ACCOUNT command to remove the ORGADMIN role from the account. For example, if you want to stop using
the `account_123` account to perform organization-level tasks, do the following:

1. Sign in to a **different** ORGADMIN-enabled account.
2. Assume the ORGADMIN role:

   ```sqlexample
   USE ROLE ORGADMIN;
   ```
3. Execute the following command:

   ```sqlexample
   ALTER ACCOUNT account_123 SET IS_ORG_ADMIN = FALSE;
   ```

The ALTER ACCOUNT syntax only accepts the [account name format](admin-account-identifier.md) of the account identifier. You cannot use the
account locator to specify the account.

> **Note:**
>
> Currently, you cannot disable the ORGADMIN role if it is the last account that has the role enabled. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Organization listing manifest reference
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-manifest-reference.md
section: User Guide
---

# Organization listing manifest reference

As a provider, you can use organizational listings to share data products securely within your organization.
A manifest, written in YAML (<https://yaml.org/spec/>) is required to create organization listings programmatically.
Use the information provided here to learn about the manifest format and its individual fields.

Organization listing fields are part of the larger [Listing manifest reference](../../../../progaccess/listing-manifest-reference.md).
To add or modify organizational listing fields programatically locate and modify the listing manifest using
[DESCRIBE LISTING](../../../../sql-reference/sql/desc-listing.md) and [ALTER LISTING](../../../../sql-reference/sql/alter-listing.md) commands.

## Organization listing manifest

> **Note:**
>
> Organizational listing fields can be one of the following:
>
> * Optional - Optional for organizational listings.
> * Required - Required for organizational listings.

The general format of a organization listing manifest is:

```yaml
#
# Organization listing manifest
#
title: <Required listing title>
description: <listing description>
resources: <optional listing resources>
listing_terms: <optional listing terms>
data_dictionary: <optional data dictionary>
usage_examples: <optional usage examples>
data_attributes: <optional data attributes>
organization_profile: <Optional custom organization profile. Default "INTERNAL">
organization_targets:
  - # Required
support_contact: "<support email address>"
  - # Required
approver_contact: "<approver email address"
  - # Required when the organization_targets includes the organization_targets.discover field
request_approval_type:
  - # Optional. Can be REQUEST_AND_APPROVE_IN_SNOWFLAKE or REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE.
locations:
  - # Optional list of regions to share into.
auto_fulfillment:
  - # Required when the target accounts are outside the provider's region, otherwise optional.
resharing:
  enabled: true # Optional. Controls whether the listing can be reshared by consumers.
```

## Organization listing fields

Organization listing manifests include a prefix, followed by a set of required and optional fields.

### Organization listing prefix

Each organization listing manifest starts with the following fields:

* `title` (String, required, maximum length 110): Listing title.
* `description` (String, optional, maximum length 7500): Listing description. Markdown syntax is supported.
* `resources` (String, optional): Resources for the listing.
* `listing_terms` (parent with child fields, optional): Terms for the listing.
* `organization_profile` (String, optional): Optional custom organization profile. Defaults to INTERNAL if not specified.

### `resources`

Resources for the listing.

The **optional** `resources` field contains the following name value pairs:

* `resources.documentation` (String, required ): A fully qualified link to a page on your website with more detailed documentation for the listing.
  Must start with `http` or `https`.
* `resources.media` (String, optional): A fully qualified link to an unlisted or public YouTube video for the listing.

For more information about the type of information you can include with this field, see [Details](../../../../collaboration/provider-listings-reference.md).

#### `resources` example

```yaml
. . .
resources:
  documentation: https://www.example.com/documentation/
  media: https://www.youtube.com/watch?v=MEFlT3dc3uc
. . .
```

### `listing_terms`

Defines the terms of service for the listing.

The **optional** `listing_terms` field contains the following name value pairs:

* `listing_terms.type`

  + `CUSTOM` - Only `CUSTOM` is supported. If `listing_terms.type` is specified, then you must also specify a value for `listing_terms.link`.
* `listing_terms.link`: A fully qualified link to the provider’s listing terms, which must start with `http` or `https`.

For more information, refer to **Terms of Service** in the table in [Basic information](../../../../collaboration/provider-listings-reference.md).

> **Note:**
>
> Consumers can accept listing terms programmatically. For more information contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

#### `listing_terms` example

```yaml
. . .
listing_terms:
  type: "CUSTOM"
  link: "http://example.com/my/listing/terms"
. . .
```

### `data_dictionary`

The **optional** `data_dictionary` field provides information on data preview and column types for the objects within the listing.

The `data_dictionary` field contains a list of up to five data dictionary entries:

* `data_dictionary.featured` (required when using `data_dictionary`): Must be ‘featured’.
* `data_dictionary.featured.database` (required when using `data_dictionary`): The database name.
* `data_dictionary.featured.objects` (required when using `data_dictionary`): A list of the following name value pairs:

  + `name` (string, required): The object name.
  + `schema` (string, required): The schema associated with the data dictionary.
  + `domain` (required):

    One of the following:

    - DATABASE
    - SCHEMA
    - TABLE
    - VIEW
    - EXTERNAL_TABLE
    - MATERIALIZED_VIEW
    - DIRECTORY_TABLE
    - FUNCTION
    - COLUMN

For more information see [Data product - data dictionary](../../../../collaboration/provider-listings-reference.md).

### `data_dictionary` example

```yaml
. . .
data_dictionary:
 featured:
    database: "WEATHERDATA"
    objects:
       - name: "GLOBAL_WEATHER"
         schema: "PUBLIC"
         domain: "TABLE"
       - name: "GLOBAL_WEATHER_REPORT"
         schema: "PUBLIC"
         domain: "TABLE"
. . .
```

### `usage_examples`

The **optional** `usage_examples` field contains a list of the following name value pairs:

* `usage.title` (String, required): The usage example title. The maximum length is 110 characters.
* `usage.description` (String, optional): A description of the usage example. The maximum length is 300 characters.
* `usage.query` (String, required): The query associated with the usage example. The maximum length is 30000 characters.

For more information, see [Sample SQL queries](../../../../collaboration/provider-listings-reference.md).

#### `usage_examples` example

```yaml
. . .
usage_examples:
  - title: "Return all weather for the US"
    description: "Example of how to select weather information for the United States"
    query: "select * from weather where country_code='USA'";
. . .
```

### `data_attributes`

Data attributes provide consumers with listing information.

The **optional** `data_attributes` field contains the following name value pairs:

* `data_attributes.refresh_rate` (required)

  One of the following: Specifies how often your data product is updated in Snowflake.

  + CONTINUOUSLY
  + HOURLY
  + DAILY
  + WEEKLY
  + MONTHLY
  + QUARTERLY
  + ANNUALLY
  + STATIC
* `data_attributes.geography` (required):

  Specifies geographic information for the data product.

  > + `granularity` (string, required)
  >
  >   Geographic coverage of the dataset.
  >
  >   One of the following:
  >
  >   - LATITUDE_LONGITUDE
  >   - ADDRESS
  >   - POSTAL_CODE
  >   - CITY
  >   - COUNTY
  >   - STATE
  >   - COUNTRY
  >   - REGION_CONTINENT
  > + `geo_option` (string, required)
  >
  >   One of the following:
  >
  >   - NOT_APPLICABLE
  >   - GLOBAL
  >   - COUNTRIES
  > + `coverage` (required based on selection of `geo_option`):
  >
  >   - `states` (list of states) containing any list of valid U.S. state names.
  >
  >   Or
  >
  >   - `continents` (list of continents):
  >
  >     Any of the following:
  >
  >     * ASIA
  >     * EUROPE
  >     * AFRICA
  >     * NORTH AMERICA
  >     * SOUTH AMERICA
  >     * OCEANIA
  >     * ANTARCTICA
  > + `time` (required):
  >
  >   Specifies the time period for the data product.
  >
  >   - `granularity` (required)
  >
  >   One of the following:
  >
  >   - EVENT_BASED
  >   - HOURLY
  >   - DAILY
  >   - WEEKLY
  >   - MONTHLY
  >   - YEARLY
  >   - `time_range` (required) containing the following name/value pairs:
  >
  >     * `time_frame` (required)
  >
  >       One of the following:
  >
  >       + NEXT
  >       + LAST
  >       + BETWEEN
  >     * `unit` (required)
  >
  >       > One of the following:
  >       >
  >       > + DAYS
  >       > + WEEKS
  >       > + MONTHS
  >       > + YEARS
  >     > * `value` (required when `time_frame` is NEXT/LAST, integer). The range is 1 to 100.
  >     > * `start_time` (required when `time_frame` is BETWEEN, String date). The start time for the data product. The format is MM-DD-YYYY.
  >     > * `end_time` (required when `time_frame` is BETWEEN, String date), format MM-DD-YYYY.

For additional information on data product attributes, see [Data product - attributes](../../../../collaboration/provider-listings-reference.md).

#### `data_attributes` example

```yaml
. . .
data_attributes:
  refresh_rate: DAILY
  geography:
    granularity:
      - REGION_CONTINENT
    geo_option: COUNTRIES
    coverage:
      continents:
        ASIA:
          - INDIA
          - CHINA
        NORTH AMERICA:
          - UNITED STATES
          - CANADA
        EUROPE:
          - UNITED KINGDOM
    time:
      granularity: MONTHLY
      time_range:
        time_frame: LAST
        unit: MONTHS
        value: 6
```

### `organization_targets`

The **required** `organization_targets` field defines who can discover and access the listing.

Contains the `discovery` and `access` fields, one of which must be specified.

`discovery`
:   **Required** when `access` isn’t specified, but otherwise **optional**.
    Defines who can discover the listing. When not present no accounts can discover the listing.

`access`
:   **Required** when `discovery` isn’t specified, but otherwise **optional**.
    Defines who can access the listing.

Both `discovery` and `access`, contain the same child fields.

Either:

`all_internal_accounts : {true | false}`
:   When `true`, all internal accounts can find or access the listing. When `false` no accounts can find or access the listing.

Or an array of accounts, followed by the optional `roles` array within the specified accounts.
:   `- account: "<account_name>"`

When `roles` is present, it specifies a list of roles within the account that can access or discover the listing. For example:

> …
> `roles: [ 'role1','role2']`
> …

### `organization_target` examples

The following examples show various combinations of the `discovery` and `access` fields.

#### All internal accounts in the organization can discover and access the listing

```yaml
. . .
organization_targets:
   discovery:
   - all_internal_accounts : true
   access:
   - all_internal_accounts : true
. . .
```

#### Discoverable but only accessible by limited accounts

All internal accounts within the organization can discover the listing, but only `finance` accounts can access the listing.

```yaml
. . .
organization_targets:
   discovery:
   - all_internal_accounts : true
   access:
   - account: 'finance'
. . .
```

#### Discoverable but accessible by only select accounts

All internal accounts within the organization can discover the listing, but only accounts in the `finance` or `credit` account can access the listing.

```yaml
. . .
organization_targets:
   discovery:
   - all_internal_accounts : true
   access:
   - account: 'finance'
   - account: 'credit'
. . .
```

#### Discoverable but only accessible by limited accounts and specific roles

All internal accounts within the organization can discover the listing,
but only accounts in the `finance` account which have the `accounting` or `debit` role can access the listing.

```yaml
. . .
organization_targets:
    discovery:
    - all_internal_accounts : true
    access:
    - account: 'finance'

      roles: [ 'accounting','debit']
. . .
```

### `support_contact`

The email address for support information associated with the listing.

**Required** when the `discovery` field is specified.

```yaml
. . .
support_contact: "support@exampledomain.com"
. . .
```

### `approver_contact`

The email address for the listing approver.

**Required** when the `discovery` field is specified.

> ```yaml
> . . .
>   approver_contact: "approver@exampledomain.com"
> . . .
> ```

### `request_approval_type`

Define whether approval requests and approvals will happen inside or outside of Snowflake. Specify one of the following values:

* `NULL`
* `REQUEST_AND_APPROVE_IN_SNOWFLAKE` indicates access requests are submitted and approved within the Snowflake environment.
* `REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE` indicates the provider manages access request submissions and approvals independently.

The value for external listings is always `NULL`.

> ```yaml
> . . .
>   request_approval_type: "REQUEST_AND_APPROVE_IN_SNOWFLAKE"
> . . .
> ```

### `locations`

Specifies the **optional** `locations` which can discover or access the listing.

The `access_regions` field is **required** when `locations` is specified and it must include one of the following sub-fields:

* `ALL` - All regions can discover or access the listing.
* An array of regions names prefixed with `PUBLIC` which can discover or access the listing.
  For example `access_regions: - name: PUBLIC.AWS_US_WEST_2`.

  ```yaml
  . . .
  locations:
    access_regions:
    - name: "<names | ALL>"
  . . .
  ```

For a complete list of regions, see [SHOW REGIONS](../../../../sql-reference/sql/show-regions.md).

## `auto_fulfillment`

Cross-Cloud Auto-fulfillment allows the data product associated with a listing
to be automatically fulfilled to other Snowflake regions.
The `auto_fulfillment` field defines how that auto-fulfillment takes place.

For more information on Cross-Cloud Auto-fulfillment, see [Auto-fulfillment for listings](../../../../collaboration/provider-listings-auto-fulfillment.md).

Auto-fulfillment is only required if you’re sharing data to multiple regions.
Do not enable it if you are sharing to accounts in the same region.

If you share data across multiple regions, the `auto_fulfillment` is:

* Required if your data product is an application package.
* Required if your data product is shared through a private listing.
* Recommended if your data product is shared through a public listing.

Contains the following name value pairs:

* `auto_fulfillment.refresh_schedule`

  + `<num> MINUTE` - Number of minutes. Minimum 10 minutes, maximum 8 days, or 11520 minutes.

    If `refresh_type` is specified as `SUB_DATABASE_WITH_REFERENCE_USAGE`, do not include this setting.
    The refresh schedule for application packages must be defined at the account level and cannot specified at the listing level.

    For more information see [Set the account-level refresh interval](../../../../collaboration/provider-listings-auto-fulfillment-set-refresh-interval.md).
* `USING CRON <expression>` - Defines the data product auto-fulfillment refresh schedule.

  > The syntax for `USING CRON` and `REPLICATION SCHEDULE` are the same. See [Parameters](../../../../sql-reference/sql/create-replication-group.md).
* `auto_fulfillment.refresh_type` (required when using `auto_fulfillment`): Must be one of -

  + `SUB_DATABASE` - database replication (object level) - recommended.
  + `SUB_DATABASE_WITH_REFERENCE_USAGE` - application package.
  + `FULL_DATABASE` - database replication (for the entire database). (Deprecated.)
* `auto_fulfillment.refresh_schedule_override` (optional): Overrides the defined update refresh frequency for all listings that use the same database. When this value is `FALSE`, listing updates fail when multiple listings sharing the same database have different refresh frequencies.

  + `TRUE` - enables the refresh frequency override.
  + `FALSE` - (default) disables the refresh frequency override.
* `auto_fulfillment.warehouse` (optional): The name of the warehouse used to create and refresh hidden dynamic tables for
  cross-region resharing. This warehouse is used only for resharing maintenance operations. Required when the listing is a reshared
  listing. Can be omitted for non-reshared listings. For more information, see [Reshare incoming data as a resharer](../../../../collaboration/resharing-as-resharer.md).

See also [Auto-fulfillment for listings](../../../../collaboration/provider-listings-auto-fulfillment.md).

### `auto_fulfillment.refresh_schedule` examples

The following example refreshes the data product associated with a listing every 10 minutes:

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_schedule: 10 MINUTE
  refresh_type: SUB_DATABASE
. . .
```

The following example refreshes the data product associated with a listing on specific days and times in specific regions:

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_schedule: USING CRON  0 17 * * MON-FRI Europe/London
  refresh_type: SUB_DATABASE
. . .
```

The following example enables the refresh frequency override for listings that share the same database but have different refresh frequencies:

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_schedule: 10 MINUTE
  refresh_type: SUB_DATABASE
  refresh_schedule_override: TRUE
. . .
```

### Snowflake Native App `auto_fulfillment` example

`SUB_DATABASE_WITH_REFERENCE_USAGE` can only be used with application packages
and cannot be combined with `auto_fulfillment.refresh_schedule`.

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_type: SUB_DATABASE_WITH_REFERENCE_USAGE
. . .
```

### Object level `auto_fulfillment` example

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_type: SUB_DATABASE
. . .
```

## `resharing`

The `resharing` field controls whether consumers of the listing can reshare the data product with other accounts.

For more information on resharing, see [Resharing listings](../../../../collaboration/reshare-listings.md).

Contains the following name value pairs:

* `resharing.enabled` (optional): Enables or disables resharing for the listing.

  + `true` - allows consumers to reshare the listing data.
  + `false` - (default) prevents consumers from resharing the listing data.

> **Note:**
>
> Only external listings support the `resharing` property. You can’t enable resharing on internal (organizational) listings.

### `resharing` example

The following example enables resharing on a listing:

```yaml
. . .
listing_terms: . . .
. . .
resharing:
  enabled: true
. . .
```

---
title: Organization profile manifest reference
source: https://docs.snowflake.com/en/user-guide/collaboration/organization-profiles/org-profile-manifest-reference.md
section: User Guide
---

# Organization profile manifest reference

Creating organization profiles programmatically requires a manifest, written in YAML (<https://yaml.org/spec/>). Use the information provided here to learn about the parameters available in an organization profile manifest.

## Organization profile manifest

```yaml
#
# Organization profile manifest
#
title: <organization_profile_title>
description: <organization_profile_description>
contact: <organization_profile_contact>
approver_contact: <organization_profile_approver_contacts>
allowed_publishers:
  access:
      - account: account_name1
        roles: [<roles_list>]
      - account: account_name2
logo: <organization_profile_logo_urn>
```

## Organization profile fields

The parameters within the organization manifest allow you to create organization profiles for specific organizational listings. Required and optional fields are identified.

`title` (Required)
:   String. The organization profile title. This field represents the Provider domain. It’s shown under the Organization Listing and as a filter option under Providers in an Internal Marketplace.

    > ```yaml
    > . . .
    > title: "Title"
    > . . .
    > ```

`description` (Required)
:   String. A description for the organization profile.

    > ```yaml
    > . . .
    > description: "Description"
    > . . .
    > ```

`contact` (Required)
:   String. The email address of the organization profile owner.

    ```yaml
    . . .
    contact: "contact@snowflake.com"
    . . .
    ```

`approver_contact` (Required)
:   String. The email address of the organization profile approver.

    > The following is an example of the format:
    >
    > ```yaml
    > . . .
    > approver_contact: "approver_contact@snowflake.com"
    > . . .
    > ```

`allowed_publishers` (Optional)
:   The accounts that are allowed to publish the listing associated with the organization profile. You must specify the following with `allowed_publishers`:

    * `access`: A list of accounts allowed to publish the listing associated with the organization profile. To allow all accounts to publish the listing associated with the organization profile, use `all_internal_accounts: "true"`. To specify a list of roles within the current account that can access a profile, use `roles`.

      > **Note:**
      >
      > You can only assign specific roles in the current account.

    The following is an example of the format:

    ```yaml
    . . .
    allowed_publishers:
      access:
        - account: "account_name1"
          roles: ['PUBLIC']
        - account: "account_name2"
    . . .
    ```

`logo` (Optional)
:   String. The URN for the organization profile icon or emoji. Use the following format to specify a logo: `logo: "urn:icon:<name>:<color>"`

    The following table lists the available icons:

    | Icon | Name | Icon | Name |
    | --- | --- | --- | --- |
    |  | ai |  | blocks |
    |  | book |  | calendar |
    |  | classification |  | code |
    |  | compute |  | dataengineering |
    |  | diamond |  | energy |
    |  | environment |  | icon_forecasting |
    |  | gear |  | government |
    |  | healthmedicine |  | healthscience |
    |  | language |  | legal |
    |  | loudspeaker |  | machinelearning |
    |  | marketplaceinternal |  | package |
    |  | personalinfo |  | pin |
    |  | pinbuilding |  | pindata |
    |  | pinglobe |  | pinmap |
    |  | public |  | scale |
    |  | shieldlock |  | sport |
    |  | team |  | transportation |
    |  | travel |  | weather |
    |  | writinghand |  |  |

    Available logo colors include:

    * Default (Grey)
    * Blue
    * Violet
    * Pink
    * Orange
    * Aqua

    The following is an example of the format:

    ```yaml
    . . .
    logo: "urn:icon:shieldlock:blue"
    . . .
    ```

---
title: Organization users
source: https://docs.snowflake.com/en/user-guide/organization-users.md
section: User Guide
---

# Organization users

Organizations with multiple accounts often need to have the same person be a user in more than one of those accounts. To avoid the
repetition of creating a user object for the person in each account separately, the organization administrator can create an
*organization user* in the [organization account](organization-accounts.md). Each organization user acts as a global user
entity that can be imported into regular accounts by account administrators, simplifying the process of having the same person have a
user object in multiple accounts.

Account administrators don’t add organization users directly to their regular account. Rather, they add *organization user groups*, which
are logical groupings of organization users. When the account administrator imports the organization user group, its organization users are
added to the account.

> **Note:**
>
> If you want to create organization users for people who already have a user object in one or more regular accounts, you’ll need to link
> the organization user with the existing user object after importing the organization user group. For more information, see
> Resolve conflicts after importing users.

## Get started

The basic workflow of getting organization users into one or more accounts is as follows:

1. As a global organization administrator in the organization account:

   1. Create an organization user for each person that
      you want to be a user in multiple regular accounts.
   2. Create an organization user group that is a logical grouping of the users.
   3. Add the organization users to the organization user group.
   4. Make the organization user group available to the account administrators in regular accounts.
2. As an administrator in a regular account:

   1. Import the organization user group into the account.
   2. Check for and resolve any conflicts.

For an end-to-end example of this workflow, see Extended example.

## Create an organization user

The organization administrator creates an organization user with the basic properties of a user object such as the login name and email.
Only an email is required, but these basic properties can’t be set in a regular account after the user is imported. For a list of these
basic properties, see [CREATE ORGANIZATION USER](../sql-reference/sql/create-organization-user.md).

As an example, the following command creates an organization user:

```sqlexample
USE ROLE GLOBALORGADMIN;

CREATE ORGANIZATION USER asmith
   EMAIL = 'asmith@example.com'
   LOGIN_NAME = 'asmith@example.com';
```

The USERADMIN role can also create an organization user.

## Organization user groups

*Organization user groups* are logical groupings of organization users. The organization administrator creates these organization
user groups, then adds the organization users that should belong in each group. When the account administrator imports an organization user
group into an account, all the organization users in the group become user objects in the regular account. An organization user can be a
member of multiple organization user groups.

When the account administrator imports an organization user group into a regular account, Snowflake creates an access control
[role](security-access-control-overview.md) of the same name. For example, if the organization user group is named `data_stewards`,
then importing the group to the regular account creates a role named `data_stewards`. Each user imported from the organization user group is
granted this role.

Administrators in the regular account can fine-tune access control by granting and revoking privileges to the role that has
been granted to each of the users that were imported from the organization user group. You can also grant account-specific roles to the new
role or grant the new role to account-specific roles.

You can import the same organization user group into multiple regular accounts to implement consistent roles across the organization. Each
regular account can assign account-specific privileges to the role, but the naming will be consistent. Alternatively, you could create a
separate organization user group for each account, then add the organization users that are needed in a particular account to the
appropriate organization user group.

If the administrator imports multiple organization user groups that contain the same organization user, only one local user is created, and
this user is granted the roles from all of the organization user groups.

The organization administrator task of preparing an organization user group for the account administrator of regular accounts is a
three-step process:

1. Create the organization user group.
2. Add organization users to the group.
3. Set the visibility of the group to specify which regular accounts can access it.

### Create an organization user group

The organization administrator executes the [CREATE ORGANIZATION USER GROUP](../sql-reference/sql/create-organization-user-group.md) command to create a new organization
user group in the organization account.

As an example, the following command creates an organization user group that represents a logical grouping of data engineers.

```sqlexample
USE ROLE GLOBALORGADMIN;

CREATE ORGANIZATION USER GROUP data_engineers_group
 IS_GRANTABLE = TRUE;
```

Because the administrator set `IS_GRANTABLE=TRUE`, the account administrator will be able to grant the role created from the
organization user group to a local, account-specific role. Without that parameter, the account administrator can’t grant the role imported
from the organization user group to another role in the regular account.

The USERADMIN role can also create an organization user group.

### Add organization users to an organization user group

After the organization administrator creates an organization user group, they can execute the
[ALTER ORGANIZATION USER GROUP](../sql-reference/sql/alter-organization-user-group.md) command to add organization users to the group as a comma-delimited list. For
example, to add two existing organization users to the organization user group `data_engineers_group`, execute:

```sqlexample
ALTER ORGANIZATION USER GROUP data_engineers_group
   ADD ORGANIZATION USERS asmith, sjohnson;
```

### Make organization user groups available to regular accounts

After you have created an organization group, you need to specify which regular accounts can view and import the group. Account administrators
cannot use the organization user group to import users until you use the [ALTER ORGANIZATION USER GROUP](../sql-reference/sql/alter-organization-user-group.md) command
to set the visibility of the group. You can specify that all regular accounts can import the organization user group or you can restrict access
to specific accounts.

The following command only allows the account `qa_env` to add the organization user group:

```sqlexample
ALTER ORGANIZATION USER GROUP data_engineers_group
   SET VISIBILITY = ACCOUNTS qa_env;
```

> **Note:**
>
> An organization administrator cannot unilaterally hide an organization user group from an
> account that previously had visibility. An administrator in the regular account must run the ALTER ACCOUNT REMOVE ORGANIZATION USER GROUP
> command to remove the organization user group from the account before the organization administrator can change the visibility.

## Import users in a regular account

After the organization administrator has created an organization user group, administrators in regular accounts can
import the organization users by executing the ALTER ACCOUNT command to add the organization user group. These administrators can
only import an organization user group if the organization administrator has
set the visibility of the group so the regular account can access it.

By default, only users with the ACCOUNTADMIN role can import organization user groups into the regular account. To allow other users to import an
organization group, grant them the IMPORT ORGANIZATION USER GROUPS privilege.

The syntax to import an organization user group to a regular account is as follows:

```sqlsyntax
ALTER ACCOUNT ADD ORGANIZATION USER GROUP <group_name>
```

For an example of importing an organization user group to add users, see Extended example.

## Resolve conflicts after importing users

The account administrator who imports organization users in their regular account must manually check for conflicts. These conflicts can
arise between the properties of users or the name of the organization user group.

### Conflict between organization user group and existing role

A conflict occurs when the name of the organization user group matches the name of an existing
[role](security-access-control-overview.md) in the regular account. The users in the group are not imported until you resolve the
conflict.

To check whether there is a conflict after importing an organization user group, do the following:

1. Execute the [SHOW ORGANIZATION USER GROUPS](../sql-reference/sql/show-organization-user-groups.md) command.
2. In the `is_imported` column, check if the value is TRUE. If the value is FALSE, the organization user group was not successfully
   imported, which might indicate that there is a conflict.

You can resolve the conflict between a role and an organization user group by linking the role with the group. Linking a role allows it to
be managed as an organization user group going forward. After you link the conflicting role, the organization user group is added to the
account without further action. Call the [SYSTEM$LINK_ORGANIZATION_USER_GROUP](../sql-reference/functions/system_link_organization_user_group.md) function to link a role with
an organization user group.

For example, suppose the role `marketing_team` existed in your account before importing the organization user group `marketing_team` to the
account. To link the role to the organization user group and complete the process of importing the group, execute the following:

```sqlexample
SELECT SYSTEM$LINK_ORGANIZATION_USER_GROUP('marketing_team');
```

### Conflict between organization user and existing user

A conflict occurs when any of the following is true:

* The `name` property of an organization user matches the `name` of an existing user in the regular account.
* The `login_name` property of an organization user matches the `login_name` of an existing user in the regular account.

To check whether there is a user conflict after importing an organization user group, do the following:

1. Execute the [SHOW ORGANIZATION USERS IN ORGANIZATION USER GROUP](../sql-reference/sql/show-organization-users.md) command.
2. In the `is_imported` column, find rows where the value is FALSE. At least one property of the user in that row conflicts with the
   properties of an existing user.

> **Tip:**
>
> You can use the [pipe operator](../sql-reference/operators-flow.md) (`->>`) to post-process the output of SHOW ORGANIZATION USERS and filter on
> the `is_imported` column. For example, to search for organization users that were not successfully imported from the
> `marketing_team` organization user group, run the following query:
>
> ```sqlexample
> SHOW ORGANIZATION USERS IN ORGANIZATION USER GROUP marketing_team
>   ->> SELECT * FROM $1 WHERE "is_imported" = 'false';
> ```

Use one of the following strategies to resolve a conflict between an organization user and an existing user:

* **Link the existing user**: If an existing user object corresponds to the same person as an organization user, and you want to manage the
  user as an organization user going forward, you can link the existing user with the organization user to resolve the conflict. Call the
  [SYSTEM$LINK_ORGANIZATION_USER](../sql-reference/functions/system_link_organization_user.md) function to link an existing user with an organization user. For example, to
  link the existing user `jloeb` with the organization user `jloebsmith`, call the function as follows:

  ```sqlexample
  SELECT SYSTEM$LINK_ORGANIZATION_USER('jloeb', 'jloebsmith');
  ```
* **Drop the existing user**: If you want the organization user to completely replace the local user, run a
  [DROP USER](../sql-reference/sql/drop-user.md) command to delete the local user. After the local object is dropped, Snowflake automatically adds the
  new user object that corresponds to the organization user.
* **Rename the existing user or its properties**: If you don’t want to link the existing local user with an organization user, but you
  want to preserve the existing user instead of dropping it, you can rename the user object or its properties in the
  regular account to resolve the conflict. After the local object is renamed, Snowflake automatically adds the new user object that
  corresponds to the organization user. For example, if the pre-existing user and the organization user both have the login name
  `JOE_LOGIN`, you could execute the following in the regular account to avoid the conflict:

  ```sqlexample
  USE ROLE ACCOUNTADMIN;
  ALTER USER joe SET LOGIN_NAME = joe_login_renamed;
  ```

## Modifying imported users

Administrators in a regular account can use the [ALTER USER](../sql-reference/sql/alter-user.md) command to modify a subset of the properties of a user
object after it has been imported. The administrator can modify all properties *except* the properties that can be set on the
organization user in the organization account. For a list of the properties that can only be set in the organization account, see
[CREATE ORGANIZATION USER](../sql-reference/sql/create-organization-user.md).

## Testing whether users and roles were imported

Administrators in a regular account can use the [SYS_CONTEXT](../sql-reference/functions/sys_context.md) function to determine whether local users and
roles were created when an organization user group was imported into the account.

To determine whether local user `joe` is linked to an organization user, run the following command:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_USER_IMPORTED', 'joe');
```

To determine whether the role `analysts` corresponds to an organization user group that was imported, run the following command:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_GROUP_IMPORTED', 'analysts');
```

## Removing organization users and groups

Organization users and organization user groups can be removed from a single account or removed from
all accounts by dropping them in the organization account.

### Removing users from a single regular account

An account administrator can execute an ALTER ACCOUNT command to remove an organization user group from the account. Removing the
organization user group drops all of the users that were imported and removes the role that was created when the organization user group
was imported. This command does not affect the organization users and organization user groups in other regular accounts, nor in the
organization account.

> **Note:**
>
> An organization user can be a member of multiple organization user groups. If a user was imported from more than one organization
> group, removing one of the groups from the regular account does not remove the user. The user isn’t removed until all of the organization
> user groups are removed.

For example, the following command drops all of the users imported from the `data_stewards` group and deletes the `data_stewards`
role:

```sqlexample
ALTER ACCOUNT REMOVE ORGANIZATION USER GROUP data_stewards;
```

### Removing users from all regular accounts

When an organization user is dropped in the organization account, the corresponding user object is dropped from every regular account that
imported the user. To drop an organization user, execute the [DROP ORGANIZATION USER](../sql-reference/sql/drop-organization-user.md) command in the organization
account.

When an organization user group is dropped in the organization account, the effect on organization users depends on whether the users in the
regular account belong to other organization user groups that were also imported into the account. If an organization user belongs to a different
organization user group that was imported, the user is not removed from the account. Otherwise, dropping the organization user group removes
all of the users imported from the group.

Dropping an organization user group also removes the role that was created when the group was imported.

To drop an organization user group, execute the [DROP ORGANIZATION USER GROUP](../sql-reference/sql/drop-organization-user-group.md) command in the organization account.

## Unlinking organization users and organization user groups

When organization users are successfully imported into a regular account, the local user object is linked to the organization user. If you
decide you want to keep the user object in an account, but no longer want it associated with the organization user, you can use the
[SYSTEM$UNLINK_ORGANIZATION_USER](../sql-reference/functions/system_unlink_organization_user.md) function to unlink the local user from the organization user. All of the
properties of the user are preserved and it can be managed as a local user going forward.

Similarly, you can use the [SYSTEM$UNLINK_ORGANIZATION_USER_GROUP](../sql-reference/functions/system_unlink_organization_user_group.md) function to unlink a role that was created
by adding an organization user group. This keeps everything about the role the same, but unlinks it from the organization user group. Local
user objects that were added when the organization user group was imported are also unlinked, and are managed as local users going forward.

## Extended example

Organization administrator workflow
:   1. As the organization administrator, sign in to the organization account.
    2. Create organization users for two people who are data stewards:

       ```sqlexample
       USE ROLE GLOBALORGADMIN;

       CREATE ORGANIZATION USER joe_kelley
       EMAIL = 'jkelley@example.com'
       LOGIN_NAME = 'jkelley@example.com';

       CREATE ORGANIZATION USER grace_vivian
       EMAIL = 'gvivian@example.com'
       LOGIN_NAME = 'gvivian@example.com';
       ```
    3. Create an organization user group that represents a logical grouping of data stewards.

       ```sqlexample
       CREATE ORGANIZATION USER GROUP data_stewards_group;
       ```
    4. Add the organization users to the new organization user group.

       ```sqlexample
       ALTER ORGANIZATION USER GROUP data_stewards_group
          ADD ORGANIZATION USERS joe_kelley, grace_vivian;
       ```
    5. Allow all regular accounts to import the organization user group.

       ```sqlexample
       ALTER ORGANIZATION USER GROUP data_stewards_group
          SET VISIBILITY = ALL;
       ```

Account administrator workflow
:   1. As the account administrator, sign in to the regular account where you want to import the organization users.
    2. List the organization user groups that can be imported into the account.

       ```sqlexample
       USE ROLE ACCOUNTADMIN;

       SHOW ORGANIZATION USER GROUPS;
       ```
    3. Import the organization user group into the account.

       ```sqlexample
       ALTER ACCOUNT
         ADD ORGANIZATION USER GROUP data_stewards_group;
       ```
    4. Check for conflicts between the organization user group and an existing role:

       ```sqlexample
       SHOW ORGANIZATION USER GROUPS;
       ```

       Make sure the value of the `is_imported` column is TRUE, which indicates there was no conflict.
    5. List the users that have been added to the account and check for conflicts:

       ```sqlexample
       SHOW ORGANIZATION USERS IN ORGANIZATION USER GROUP data_stewards_group;
       ```

       Make sure the value of the `is_imported` column is TRUE for all of the organization users, which indicates there were no
       conflicts.

## Related functions

For a list of functions that help you work with organization users and organization user groups, see
[Organization user and organization user group functions](../sql-reference/functions-organization-users.md).

## Limitations and considerations

After an organization user is added to a regular account, you’ll set up the user’s authentication methods the same as any other user.
You can’t set up authentication at the organization level.

---
title: Organizational listing governance
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-governance.md
section: User Guide
---

# Organizational listing governance

[Organization-level access history](../../../access-history.md) provides data governors with the information they need to track
when a consumer’s query reads from a data product made available by a provider through an organizational listing. The data governor can
determine which account provided the organizational listing and exactly which data object was accessed. They can also determine if the data
object provided by the organizational listing is protected by a policy (such as a masking policy or row access policy) in the provider’s
account.

You can gain these insights into the consumer queries by using the [organization account](../../../organization-accounts.md) to query
the ACCESS_HISTORY view of the ORGANIZATION_USAGE schema. This ACCESS_HISTORY view contains the
following columns related to the governance of organizational listings:

* `provider_base_objects_accessed` - Specifies the data objects in the provider’s account that were accessed by the consumer query.
* `provider_policies_referenced` - If a consumer query accessed base objects that are protected by a policy in the provider’s
  account, this column lists the policy.

For example, if an organization administrator wants to know all the intra-organization, cross-account queries that have accessed data
objects via organizational listings, they could execute the following query *from the organization account*:

```sqlexample
SELECT * FROM snowflake.organization_usage.access_history
  WHERE provider_base_objects_accessed IS NOT NULL;
```

---
title: Organize catalog content
source: https://docs.snowflake.com/en/user-guide/opencatalog/organize-catalog-content.md
section: User Guide
---

# Organize catalog content

This topic provides instructions for how to create namespaces and tables for an internal catalog in Snowflake Open Catalog.

> **Important:**
>
> If you drop a table in Snowflake Open Catalog without purging it, don’t create a new table with the same name and location as the dropped
> table. If you do, a user could gain access to the original table’s data when they shouldn’t have permission to access it. For example, if
> you drop but don’t purge `Table1` where its storage directory location is `/MyCatalog/Schema1/Table1`, don’t create a new `Table1` within
> the same `Table1` storage directory. When you drop a table without purging it, its data is retained in the external cloud storage.

> **Important:**
>
> To ensure that the access privileges defined for a catalog are enforced correctly, the following conditions must be met:
>
> * A directory only contains the data files that belong to a single table.
> * A directory hierarchy matches the namespace hierarchy for the catalog.
>
> For example, if a catalog includes the following items:
>
> * Top-level namespace `namespace1`
> * Nested namespace `namespace1a`
> * A `customers` table grouped under nested namespace `namespace1a`
> * An `orders` table grouped under nested namespace `namespace1a`
>
> The directory hierarchy for the catalog must be:
>
> * `/namespace1/namespace1a/customers/<files for the customers table *only*>`
> * `/namespace1/namespace1a/orders/<files for the orders table *only*>`
>
> These conditions apply to both internal and external catalogs, including external catalogs that contain
> [Snowflake-managed Apache Iceberg™ tables](https://docs.snowflake.com/en/user-guide/tables-iceberg). When you create a table in an
> internal catalog, Open Catalog prohibits you from creating the table within the directory or subdirectory for an existing table. When you
> create Snowflake-managed Iceberg tables in an external catalog, Open Catalog doesn’t prohibit overlapping directory locations. Therefore,
> when you create these tables, use the BASE_LOCATION parameter to specify a unique parent directory for each table. For more information, see
> [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-snowflake).
>
> For more information about internal and external catalogs, see [Catalog types](overview.md).

## Organizing catalog content

A catalog admin can use Open Catalog or a third-party query engine to organize catalog content as follows:

| Object | Use |
| --- | --- |
| Namespace | * Open Catalog * Third-party query engine |
| Table | Third-party query engine |

**Note**

> The tables and namespaces for an external catalog are read-only in Open Catalog. If you need to organize catalog content for an external
> catalog, you must use Snowflake. For more information, see [Snowflake-managed Apache Iceberg™ tables](https://docs.snowflake.com/en/user-guide/tables-iceberg).

The example code in this topic shows how to use Apache Spark to organize catalog content. The example code is in PySpark.

## Create a namespace

This section provides instructions for creating top-level or nested namespaces.

**Important**

> When you create a namespace, don’t use periods or spaces in the namespace name.

### Create a top-level namespace

To create a top-level namespace, you can use Apache Spark or Open Catalog.

#### Example: Create a top-level namespace by using Apache Spark

The following example code creates a top-level namespace named `namespace1` in the catalog `catalog1`:

```python
spark.sql("use catalog1").show()
spark.sql("CREATE NAMESPACE namespace1")
```

#### Create a top-level namespace by using Open Catalog

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog where you want to create a top-level namespace.
4. Select **+ Namespace**.
5. For **Name**, enter a name for the namespace, and then select **Submit**.

### Create a nested namespace

To create a nested namespace, you can use Apache Spark or Open Catalog.

#### Example: Create a nested namespace by using Apache Spark

The following example code creates a nested namespace named `namespace1a` in the catalog `catalog1`. This nested namespace is created under the
existing top-level namespace `namespace1`:

```python
spark.catalog.setCurrentCatalog("catalog1")
spark.sql("use catalog1").show()
spark.sql("CREATE NAMESPACE namespace1.namespace1a")
```

#### Create a nested namespace by using Open Catalog

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog where you want to create a nested namespace.
4. On the **Namespaces** tab, navigate to the parent namespace where you want to create the nested namespace.
5. Select **+ Namespace**.
6. For **Name**, enter a name for the nested namespace, and then select **Submit**.

## Create a table

This section provides examples for creating tables by using Apache Spark.

> **Important:**
>
> If you drop a table in Snowflake Open Catalog without purging it, don’t create a new table with the same name and location as the dropped
> table. If you do, a user could gain access to the original table’s data when they shouldn’t have permission to access it. For example, if
> you drop but don’t purge `Table1` where its storage directory location is `/MyCatalog/Schema1/Table1`, don’t create a new `Table1` within
> the same `Table1` storage directory. When you drop a table without purging it, its data is retained in the external cloud storage.

### Example: Create a table

The following example code creates a `customers` table under nested namespace `namespace1a` in the catalog `catalog1`. It is created with `id` and
`custnum` columns, and the data type for both columns is `integer`:

```python
spark.sql("use catalog1").show()
spark.sql ("use namespace1.namespace1a")
spark.sql("CREATE OR REPLACE TABLE customers (id int, custnum int) using iceberg")
```

### Example: Insert rows into a table

The following example code inserts a row into the `customers` table:

```python
spark.sql("use catalog1").show()
spark.sql ("use namespace1.namespace1a")
spark.sql("INSERT INTO customers VALUES (123,456)")
```

---
title: Override share restrictions
source: https://docs.snowflake.com/en/user-guide/override_share_restrictions.md
section: User Guide
---

# Override share restrictions

To allow sharing data from a Business Critical account to a non-Business Critical account, or from a HIPAA-compliant account to a non-HIPAA-
compliant account, a user with a role granted the OVERRIDE SHARE RESTRICTIONS global privilege can specify the SHARE_RESTRICTIONS parameter
for a specific share offered by their provider account.

## Grant the OVERRIDE SHARE RESTRICTIONS privilege to another role

The OVERRIDE SHARE RESTRICTIONS global privilege is granted to the ACCOUNTADMIN role by default, but it can be granted to other roles.
To grant OVERRIDE SHARE RESTRICTIONS to a role, use the ACCOUNTADMIN role and the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command.

Syntax:

`GRANT OVERRIDE SHARE RESTRICTIONS ON ACCOUNT TO ROLE <role_name>`

Where:

`<role_name>` is the role to which the privilege is granted.

For example, to grant the privilege to the SYSADMIN role:

```sqlexample
use role accountadmin;
grant override share restrictions on account to role sysadmin;
```

## Set the SHARE_RESTRICTIONS parameter on a share

As a provider in a Business Critical account, you can share data with a consumer in a non-Business Critical account using the
SHARE_RESTRICTIONS parameter on a direct share. This parameter also applies to HIPAA-compliant providers that share data with a non-HIPAA consumer.

You must use the ACCOUNTADMIN role, or use a custom role granted the following privileges:

* The OVERRIDE SHARE RESTRICTIONS global privilege.
* OWNERSHIP on the share or the CREATE SHARE global privilege.

Use the [ALTER SHARE](../sql-reference/sql/alter-share.md) command to set the SHARE_RESTRICTIONS parameter on a share:

For example, to update the share `my_share` to add a non-Business Critical or non-HIPAA consumer account `consumerorg.consumeraccount`,
run the following:

```sqlexample
use role sysadmin;
alter share my_share add accounts = consumerorg.consumeraccount SHARE_RESTRICTIONS=false;
```

See [ALTER SHARE](../sql-reference/sql/alter-share.md) for more details.

> **Attention:**
>
> Snowflake is not responsible for ensuring that HIPAA (and HITRUST) accounts who engage in data sharing have a signed BAA with each other;
> this is at the discretion of the accounts that are sharing data. Note that failure to have a signed BAA may impact the HIPAA (and HITRUST)
> compliance of both accounts, particularly the provider account.
>
> Also, if you have Business Critical account, to maintain the expected level of data protection provided by Business Critical, we
> strongly recommend considering the following before requesting Snowflake to enable Secure Data Sharing with non-Business Critical
> accounts:
>
> * Do not share sensitive data with non-Business Critical accounts.
> * Consider creating a second, non-Business Critical account where you store less sensitive data and share this data with non-Business
>   Critical accounts.
> * If you are using [Tri-Secret Secure](security-encryption-tss.md) with your Business Critical account and you share data with other accounts, Snowflake
>   treats the data access from these accounts as if the access occurred from within your own account. Specifically, granting access to
>   the consumer account might require Snowflake to access the key management service on the cloud platform that hosts your Snowflake
>   account.
>
> These are only recommendations and are not enforced by Snowflake. The decision to share data is always at the discretion of the data
> provider and Snowflake does not assume any responsibility for data that is improperly shared.

---
title: Overview of Access Control
source: https://docs.snowflake.com/en/user-guide/security-access-control-overview.md
section: User Guide
---

# Overview of Access Control

This topic provides information on the main access control topics in Snowflake.

## Access control framework

Snowflake’s approach to access control combines aspects from the following models:

* **Discretionary Access Control (DAC):** Each object has an owner, who can in turn grant access to that object.
* **Role-based Access Control (RBAC):** Access privileges are assigned to roles, which are in turn assigned to users.
* **User-based Access Control (UBAC):** Access privileges are assigned directly to users. Access control considers privileges assigned
  directly to users only when USE SECONDARY ROLE is set to ALL.

For more information about secondary roles, see [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md) and Authorization through primary role and secondary roles.

Several concepts key to understanding access control in Snowflake include:

* **Securable object:** An entity to which access can be granted. Unless allowed by a grant, access is denied.
* **Role:** An entity to which privileges can be granted.
* **Privilege:** A defined level of access to an object. Multiple distinct privileges may be used to control the granularity of access
  granted.
* **User:** A user identity recognized by Snowflake, whether associated with a person or service. A user is also an entity to which
  privileges can be granted.

In Snowflake, privileges assigned to roles or users allow access to securable objects. Roles can be assigned to users or
other roles. Granting a role to another role creates a role hierarchy, further described in
Role hierarchy and privilege inheritance. Usually, you use RBAC to manage access to securable objects in Snowflake.

The following diagram illustrates how DAC, RBAC, and UBAC support appropriate privilege assignment on different securable objects. In this
example, Role 1 has the OWNERSHIP privilege on both Object 1 and Object 2. In other words, Role 1 owns both objects. This illustrates DAC.

Privileges on Object 1 can be granted to Role 2, which can then be granted to User 1 and User 2. In other words, User 1 and User 2 have
access to Object 1, limited by these privileges, because both users are assigned Role 2. This part of the figure illustrates RBAC.

Privileges on Object 2 can be granted directly to User 3 and User 4. This part of the figure illustrates how you can use UBAC to
extend the Snowflake access control framework, providing a significant amount of both control and flexibility.

## Securable objects

Every securable object resides within a logical container in a hierarchy of containers. The top-most container is the customer
organization. Securable objects such as tables, views, functions, and stages are contained in a schema object, which are in turn
contained in a database. All databases for your Snowflake account are contained in the account object. This hierarchy of objects and
containers is illustrated below:

To *own* an object means that a role has the OWNERSHIP
privilege on the object. Each securable object is owned by a single role, which by
default is the role used to create the object. When this role is assigned to users, they effectively have shared control over the object.
The [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command lets you transfer the ownership of an object from one role to another role, including
to database roles. This command also specifies the securable objects in each container.

In a regular schema, the owner role has all privileges on the object by default, including the ability to grant or revoke privileges on the
object to other roles. In addition, ownership can be transferred from one role to another. However, in a
[managed access schema](security-access-control-configure.md), object owners lose the ability to make grant decisions. Only the schema owner
(the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant privileges on objects in
the schema.

The ability to perform SQL actions on objects is defined by the privileges granted to the
active role in a user session. For example, if the active role in your session has been
granted CREATE, USAGE, SELECT, and WRITE privileges in a specific Snowflake database schema, then you can create a warehouse, list tables
contained, and add data to a table in that schema.

## Roles

Roles are the entities to which privileges on securable objects can be granted and
revoked. Roles are assigned to users to allow them to perform actions required for business functions in their organization. A user can be
assigned multiple roles. This allows users to switch roles (that is, choose which role is active in the current Snowflake session) to perform
different actions using separate sets of privileges.

There are a small number of system-defined roles
in a Snowflake account. System-defined roles cannot be dropped. In addition, the privileges granted
to these roles by Snowflake cannot be revoked.

Users who have been granted a role with the necessary privileges can create custom roles
to meet specific business and security needs.

Roles can be also granted to other roles, creating a hierarchy of roles. The privileges associated with a role are inherited by any roles
above that role in the hierarchy. For more information about role hierarchies and privilege inheritance, see
Role Hierarchy and Privilege Inheritance.

> **Note:**
>
> A role owner (the role that has the OWNERSHIP privilege on the role) does
> not inherit the privileges of the owned role. Privilege inheritance is only
> possible within a role hierarchy.

Although additional privileges can be granted to the system-defined roles, it is not recommended. System-defined roles are created with
privileges related to account-management. As a best practice, it is not recommended to mix account-management privileges and
entity-specific privileges in the same role. If additional privileges are needed, Snowflake recommends granting the additional privileges
to a custom role and assigning the custom role to the system-defined role.

> **Tip:**
>
> You can use organization user groups to implement consistent roles across accounts within an organization. For more information, see
> [Organization users](organization-users.md).

### Types of roles

The following role types vary in their scope, which enable administrators to authorize and restrict access to objects in your account.

> **Note:**
>
> Except where noted in the product documentation, the term *role* refers to either type.

Account roles:
:   To permit SQL actions on any object in your account, grant privileges on the object to an account role.

Database roles:
:   To limit SQL actions to a single database, as well as any object in the database, grant privileges on the object to a database role
    in the same database.

    Note that database roles cannot be activated directly in a session. Grant database
    roles to account roles, which can be activated in a session.

    For more information about database roles, see:

    * Role hierarchy and privilege inheritance
    * Database roles and role hierarchies
    * [Managing database object access using database roles](security-access-control-considerations.md)
    * Database roles in the shared [SNOWFLAKE database](../sql-reference/snowflake-db-roles.md).
    * [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md)

Instance roles:
:   To permit access to an instance of a [class](../sql-reference/snowflake-db-classes.md), grant an instance role to
    an account role.

    A class may have one or more class roles with different privileges granted to each role. When an instance of a class is created,
    the instance role(s) can be granted to account roles to grant access to instance methods.

    Note that instance roles cannot be activated directly in a session. Grant instance
    roles to account roles, which can be activated in a session.

    For more information, see [Instance roles](../sql-reference/snowflake-db-classes.md).

Application roles:
:   To enable consumer access to objects in a Snowflake Native App, the provider creates the application role and grants privileges to the
    application role in the [set up script](../developer-guide/native-apps/creating-setup-script.md).

System application roles:
:   To support specific functionality for a particular feature, such as granting access to objects in which Snowflake is the owner,
    Snowflake can provide one or more *system application roles*. You can grant the system application roles to account roles at your
    discretion.

    System application roles are discussed in the context of a specific feature because that specific feature is the only place where you
    can use the system application role(s). For example:

    * Budgets: [Application roles to manage the account budget](budgets.md).
    * Data Quality and data metric functions (DMFs): [View results of a data metric function](data-quality-results.md).

Service roles:
:   To allow a role access to service endpoints, grant the service role to that role. You can grant a service role to an account role, an
    application role, or a database role. For more information, see [Managing service-related privileges](../developer-guide/snowpark-container-services/working-with-services.md).

### Active roles

*Active roles* serve as the source of authorization for any action taken by a user in a session. Both the
primary role and any secondary roles can be activated in a user session.

A role becomes an active role in either of the following ways:

* When a session is first established, the user’s default role and default secondary roles are activated as the session primary and
  secondary roles, respectively.

  Note that client connection properties used to establish the session could explicitly override the primary role or secondary roles to use.
* Executing a [USE ROLE](../sql-reference/sql/use-role.md) or [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md) statement activates a different primary
  role or secondary roles, respectively. These roles can change over the course of a session if either command is executed again.

### System-defined roles

GLOBALORGADMIN:
:   (aka Organization Administrator)

    Role that performs organization-level tasks such as managing the lifecycle of accounts and
    viewing organization-level usage information. The role exists only in the [organization account](organization-accounts.md).

ORGADMIN:
:   Role that uses a regular account to manage operations at the organization level. The ORGADMIN role will be phased out in a future
    release, so organization administrators are encouraged to use the GLOBALORGADMIN role instead.

ACCOUNTADMIN:
:   (aka Account Administrator)

    Role that encapsulates the SYSADMIN and SECURITYADMIN system-defined roles. It is the top-level role in the system and should be granted
    only to a limited/controlled number of users in your account.

SECURITYADMIN:
:   (aka Security Administrator)

    Role that can manage any object grant globally, as well as create, monitor, and manage users and roles. More specifically, this role:

    * Is granted the MANAGE GRANTS security privilege to be able to modify any grant, including revoking it.

      > **Note:**
      >
      > The MANAGE GRANTS privilege provides the ability to grant and revoke privileges. It does not give the SECURITYADMIN the ability to
      > perform other actions such as creating objects. To create an object, the SECURITYADMIN role must also be granted the privileges
      > needed to create the object. For example, to create a database role, the SECURITYADMIN must also be granted the CREATE DATABASE ROLE
      > privilege, as described in [CREATE DATABASE ROLE Access control requirements](../sql-reference/sql/create-database-role.md).
    * Inherits the privileges of the USERADMIN role via the system role hierarchy (that is, USERADMIN role is granted to SECURITYADMIN).

USERADMIN:
:   (aka User and Role Administrator)

    Role that is dedicated to user and role management only. More specifically, this role:

    * Is granted the CREATE USER and CREATE ROLE security privileges.
    * Can create users and roles in the account.

      This role can also manage users and roles that it owns. Only the role with the OWNERSHIP privilege on an object (that is, user or role), or
      a higher role, can modify the object properties.

SYSADMIN:
:   (aka System Administrator)

    Role that has privileges to create warehouses and databases (and other objects) in an account.

    If, as [recommended](security-access-control-considerations.md), you create a role hierarchy that ultimately assigns all
    custom roles to the SYSADMIN role, this role also has the ability to grant privileges on warehouses, databases, and other objects to other
    roles.

PUBLIC:
:   Pseudo-role that is automatically granted to every user and every role in your account. The PUBLIC role can own securable objects, just
    like any other role; however, the objects owned by the role are, by definition, available to every other user and role in your account.

    This role is typically used in cases where explicit access control is not needed and all users are viewed as equal with regard to their
    access rights.

### Custom roles

Custom account roles can be created using the USERADMIN role (or a higher role) as well as
by any role to which the CREATE ROLE privilege has been granted.

Custom database roles can be created by the database owner (that is, the role that has the OWNERSHIP privilege on the database).

By default, a newly-created role is not assigned to any user, nor granted to any other role.

When creating roles that will serve as the owners of securable objects in the system, Snowflake recommends creating a hierarchy of custom
roles, with the top-most custom role assigned to the system role SYSADMIN. This role structure allows system administrators to manage all
objects in the account, such as warehouses and database objects, while restricting management of users and roles to the USERADMIN role.

Conversely, if a custom role is not assigned to SYSADMIN through a role hierarchy, the system administrators cannot manage the
objects owned by the role. Only those roles granted the MANAGE GRANTS privilege (only the SECURITYADMIN role by default) can view the
objects and modify their access grants.

For instructions to create custom roles, see [Creating custom roles](security-access-control-configure.md).

## Privileges

Access control privileges determine who can access and perform operations on specific objects in Snowflake. For each securable object,
there is a set of privileges that can be granted on it. For existing objects, privileges must be granted on individual objects
(such as the SELECT privilege on the `mytable` table). To simplify grant management,
[future grants](security-access-control-considerations.md) allow defining an initial set of privileges on objects created in a schema
(for example, granting the SELECT privilege on all *new* tables created in the `myschema` schema to a specified role).

Privileges are managed using the following commands:

* [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
* [REVOKE <privileges> … FROM ROLE](../sql-reference/sql/revoke-privilege.md)
* [GRANT <privileges> … TO USER](../sql-reference/sql/grant-privilege-user.md)
* [REVOKE <privileges> … FROM USER](../sql-reference/sql/revoke-privilege-user.md)

In regular (non-managed) schemas, use of these commands is restricted to the role that owns an object (has the OWNERSHIP privilege on the
object), any roles or users that have the MANAGE GRANTS global privilege for the object (only the SECURITYADMIN role by default).

In [managed access schemas](security-access-control-configure.md), object owners lose the ability to make grant decisions. Only the schema
owner or a role with the MANAGE GRANTS privilege can grant privileges on objects in the schema, including future grants, centralizing
privilege management.

Note that a role that holds the global MANAGE GRANTS privilege can grant additional privileges to the current (grantor) role.

For more details, see [Access control privileges](security-access-control-privileges.md).

## Role hierarchy and privilege inheritance

The following diagram illustrates the hierarchy for the system-defined roles, as well as the recommended structure for additional,
user-defined account roles and database roles. The highest-level database role in the example hierarchy is granted to a custom
(user-defined) account role. In turn, this role is granted to another custom role in a recommended structure that allows the system-defined
SYSADMIN role to inherit the privileges of custom account roles and database roles:

> **Note:**
>
> ORGADMIN is a separate system role that manages operations at the organization level. This role is not included in the hierarchy of
> system roles.

For a more specific example of role hierarchy and privilege inheritance, consider the following scenario:

> * Role 3 has been granted to Role 2.
> * Role 2 has been granted to Role 1.
> * Role 1 has been granted to User 1.

In this scenario:

> * Role 2 inherits Privilege C.
> * Role 1 inherits Privileges B and C.
> * User 1 has all three privileges.

For instructions on creating a role hierarchy, see [Creating a role hierarchy](security-access-control-configure.md).

### Database roles and role hierarchies

The following limitations currently apply to database roles:

* If a database role is granted to a [share](data-sharing-gs.md), then no other database roles can be granted to
  that database role. For example, if database role `d1.r1` is granted to a share, then attempting to grant database role `d1.r2` to
  `d1.r1` is blocked.

  In addition, if a database role is granted to another database role, the grantee database role cannot be granted to a share.

  Database roles that are granted to a share can be granted to other database roles, as well as account roles.
* Account roles cannot be granted to database roles in a role hierarchy.

## Authorization through primary role and secondary roles

Every active user session has a current role, also referred to as a *primary role*. When a session is initiated (for example, when a user
connects using JDBC/ODBC or logs in to the Snowflake web interface), the current role is determined based on the following criteria:

1. If a role was specified as part of the connection and that role is a role that has already been granted to the connecting user, the
   specified role becomes the current role.
2. If no role was specified and a default role has been set for the connecting user, that role becomes the current role.
3. If no role was specified and a default role has not been set for the connecting user, the system role PUBLIC is used.

To view the current role for a session, execute the [CURRENT_ROLE](../sql-reference/functions/current_role.md) function.

In addition, a set of *secondary* roles can be activated in a user session. A user can perform SQL actions on objects in a session using
the aggregate privileges granted to the primary and secondary roles. The roles must be granted to the user before they can be activated in
a session. Note that while a session must have exactly one active primary role at a time, a session can activate any number of secondary
roles at the same time.

> **Note:**
>
> A database role can be neither a primary nor a secondary role. To assume the privileges granted to a database role, grant the database
> role to an account role. Only account roles can be activated in a session.

Authorization to execute [CREATE <object>](../sql-reference/sql/create.md) statements comes from the primary role only. When an object is created, its
ownership is set to the currently active primary role. However, for any other SQL action, any permission granted to any active primary or
secondary role can be used to authorize the action. For example, if any role in a secondary role hierarchy owns an object (has the
OWNERSHIP privilege on the object), the secondary roles would authorize performing any DDL actions on the object. Both the primary role and
all secondary roles inherit privileges from any roles lower in their role hierarchies.

For organizations whose security model includes a large number of roles, each with a fine granularity of authorization defined by
permissions, using secondary roles simplifies role management. All roles that were granted to a user can be activated in a session.
Secondary roles are particularly useful for SQL operations such as cross-database joins that would otherwise require creating a parent
role of the roles that have permissions to access the objects in each database.

During the course of a session, you can execute the [USE ROLE](../sql-reference/sql/use-role.md) or [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md)
command to change the current primary or secondary roles, respectively. You can use the [CURRENT_SECONDARY_ROLES](../sql-reference/functions/current_secondary_roles.md)
function to show all active secondary roles for the current session.

When you create an object that requires one or more privileges to use, only the primary role and those roles that it directly or
indirectly inherits are considered when searching for the grants of those privileges.

For any other statement that requires one or more privileges (such as querying a table requires the SELECT privilege on a table with the
USAGE privilege on the database and schema), the primary role, the secondary roles, and any other inherited roles are considered when
searching for the grants of those privileges.

> **Note:**
>
> There is no concept of a “super-user” or “super-role” in Snowflake that can bypass authorization checks. All access requires appropriate
> access privileges.

---
title: Overview of data loading
source: https://docs.snowflake.com/en/user-guide/data-load-overview.md
section: User Guide
---

# Overview of data loading

This topic provides an overview of the main options available to load data into Snowflake.

To easily and accurately measure the ingestion latency of your data pipelines, use row timestamps. For more information, see [Use row timestamps to measure latency in your pipelines](data-engineering/row-timestamps.md).

## Supported file locations

Snowflake refers to the location of data files in cloud storage as a *stage*.
The [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command used for both bulk and continuous data loads (Snowpipe) supports
cloud storage accounts managed by your business entity (*external stages*) as well as cloud storage contained in your Snowflake account
(*internal stages*).

### External stages

Loading data from any of the following cloud storage services is supported regardless of the [cloud platform](intro-cloud-platforms.md) that hosts your Snowflake account:

* Amazon S3
* Google Cloud Storage
* Microsoft Azure

You cannot access data held in archival cloud storage classes that requires restoration before it can be retrieved. These archival storage classes include, for example, the Amazon S3 Glacier Flexible Retrieval or Glacier Deep Archive storage class, or Microsoft Azure Archive Storage.

Upload (i.e. *stage*) files to your cloud storage account using the tools provided by the cloud storage service.

A named external stage is a database object created in a schema. This object stores the URL to files in cloud storage, the settings used to access the cloud storage account, and convenience settings such as the options that describe the format of staged files. Create stages using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.

> **Note:**
>
> Some data transfer billing charges may apply when loading data from files in a cloud storage service in a different region or cloud platform from your Snowflake account. For more information, see [Understanding data transfer cost](cost-understanding-data-transfer.md).

### Internal stages

Snowflake maintains the following stage types in your account:

User:
:   A user stage is allocated to each user for storing files. This stage type is designed to store files that are staged and managed by a single user but can be loaded into multiple tables. User stages cannot be altered or dropped.

Table:
:   A table stage is available for each table created in Snowflake. This stage type is designed to store files that are staged and managed by one or more users but only loaded into a single table. Table stages cannot be altered or dropped.

    Note that a table stage is not a separate database object; rather, it is an implicit stage tied to the table itself. A table stage has no grantable privileges of its own. To stage files to a table stage, list the files, query them on the stage, or drop them, you must be the table owner (have the role with the OWNERSHIP privilege on the table).

Named:
:   A named internal stage is a database object created in a schema. This stage type can store files that are staged and managed by one or more users and loaded into one or more tables. Because named stages are database objects, the ability to create, modify, use, or drop them can be controlled using security access control privileges. Create stages using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.

Upload files to any of the internal stage types from your local file system using the [PUT](../sql-reference/sql/put.md) command.

## Bulk vs continuous loading

Snowflake provides the following main solutions for data loading. The best solution may depend upon the volume of data to load and the frequency of loading.

### Bulk loading using the COPY command

This option enables loading batches of data from files already available in cloud storage, or copying (i.e. *staging*) data files from a local machine to an internal (i.e. Snowflake) cloud storage location before loading the data into tables using the COPY command.

#### Compute resources

Bulk loading relies on user-provided virtual warehouses, which are specified in the COPY statement. Users are required to size the warehouse appropriately to accommodate expected loads.

#### Simple transformations during a load

Snowflake supports transforming data while loading it into a table using the COPY command. Options include:

* Column reordering
* Column omission
* Casts
* Truncating text strings that exceed the target column length

There is no requirement for your data files to have the same number and ordering of columns as your target table.

### Continuous loading using Snowpipe

This option is designed to load small volumes of data (i.e. micro-batches) and incrementally make them available for analysis. Snowpipe loads data within minutes after files are added to a stage and submitted for ingestion. This ensures users have the latest results, as soon as the raw data is available.

#### Compute resources

Snowpipe uses compute resources provided by Snowflake (i.e. a serverless compute model). These Snowflake-provided resources are automatically resized and scaled up or down as required, and are charged and itemized using per-second billing. Data ingestion is charged based upon the actual workloads.

#### Simple transformations during a load

The COPY statement in a pipe definition supports the same COPY transformation options as when bulk loading data.

In addition, data pipelines can leverage Snowpipe to continuously load micro-batches of data into staging tables for transformation and optimization using automated tasks and the change data capture (CDC) information in streams.

### Continuous loading using Snowpipe Streaming

The Snowpipe Streaming API writes rows of data directly to Snowflake tables without the requirement of staging files. This architecture results in lower load latencies with corresponding lower costs for loading any volume of data, which makes it a powerful tool for handling near real-time data streams.

Snowpipe Streaming is also available for the Snowflake Connector for Kafka, which offers an easy upgrade path to take advantage of the lower latency and lower cost loads.

For more information, refer to [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

## Loading data from Apache Kafka topics

The [Snowflake Connector for Kafka](kafka-connector.md) enables users to connect to an [Apache Kafka](https://kafka.apache.org/) server, read data from one or more topics, and load that data into Snowflake tables.

## DML error logging

When you execute a set of DML statements and one of the statements fails with an error, the DML operation ends
and the changes made by the DML statement are rolled back. If you want to continue to execute the rest of the
DML statements and log the error that occurred, you can turn on DML error logging for the table. The table for
which DML error logging is turned on is called the *base table*. Errors are logged in an *error table* that is
associated with the base table.

DML error logging is turned on for a table only when *both* of the following conditions are met:

* The ERROR_LOGGING property is set to `TRUE` for the table.
* The [OPT_OUT_ERROR_LOGGING](../sql-reference/parameters.md) parameter is set to `FALSE` for the current session.

DML error logging is turned off for a table only when *either* of the following conditions are met:

* The ERROR_LOGGING property is set to `FALSE` for the table.
* The OPT_OUT_ERROR_LOGGING parameter is set to `TRUE` for the current session.

The following sections provide more information about DML error logging:

### Use cases for DML error logging

You might use DML error logging to avoid failures on errors for the following use cases:

* Migration of third-party data that relies on DML error logging, such as data from an Oracle database.
* Enforcement of some table constraints, such as NOT NULL constraints, during data ingestion.

### Configure DML error logging for a table

You can turn on or turn off DML error logging for a standard Snowflake table or a Snowflake-managed Iceberg
table when you create or alter the table.

To turn on or turn off error logging for a table, use the following SQL commands to set the ERROR_LOGGING
property for the table:

* [CREATE TABLE](../sql-reference/sql/create-table.md)
* [ALTER TABLE](../sql-reference/sql/alter-table.md)
* [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) (Snowflake-managed only)
* [ALTER ICEBERG TABLE](../sql-reference/sql/alter-iceberg-table.md) (Snowflake-managed only)

The following examples configure DML error logging for tables and show how errors are logged in
error tables:

* Log errors when inserting rows directly
* Log errors when inserting rows from one table into another table

The following examples configure DML error logging for tables and show how errors are logged in error tables:

#### Log errors when inserting rows directly

The following example logs errors when inserting rows directly into a table:

1. Create a table and turn on DML error logging for it:

   ```sqlexample
   CREATE TABLE test_dml_error_logging(
     n NUMBER(4, 0) NOT NULL,
     t VARCHAR(5)
     )
     ERROR_LOGGING = true;
   ```
2. Run an INSERT statement that tries to insert several rows, including both valid and invalid
   values:

   ```sqlexample
   INSERT INTO test_dml_error_logging
     VALUES
       ('invalid_cast', '1'),
       (10, 'valid'),
       (NULL, 'toolong');
   ```

   ```output
   +-------------------------+
   | number of rows inserted |
   |-------------------------|
   |                       1 |
   +-------------------------+
   ```
3. Query the table to confirm that one valid row was inserted:

   ```sqlexample
   SELECT * FROM test_dml_error_logging;
   ```

   ```output
   +----+-------+
   |  N | T     |
   |----+-------|
   | 10 | valid |
   +----+-------+
   ```
4. Query the error table for the `test_dml_error_logging` base table to view the errors that
   were logged:

   ```sqlexample
   SELECT * FROM ERROR_TABLE(test_dml_error_logging);
   ```

   ```output
   +-------------------------------+--------------------------------------+------------+----------------------------------------------------------------------+--------------------+
   | TIMESTAMP                     | QUERY_ID                             | ERROR_CODE | ERROR_METADATA                                                       | ERROR_DATA         |
   |-------------------------------+--------------------------------------+------------+----------------------------------------------------------------------+--------------------|
   | 2026-03-12 12:18:39.470 -0700 | 01c2fc06-000e-6668-0000-76b90170a28e |     100038 | {                                                                    | {                  |
   |                               |                                      |            |   "error_code": 100038,                                              |   "N": [           |
   |                               |                                      |            |   "error_message": "Numeric value 'invalid_cast' is not recognized", |     "invalid_cast" |
   |                               |                                      |            |   "error_source": "N",                                               |   ],               |
   |                               |                                      |            |   "sql_state": "22018"                                               |   "T": "1"         |
   |                               |                                      |            | }                                                                    | }                  |
   | 2026-03-12 12:18:39.470 -0700 | 01c2fc06-000e-6668-0000-76b90170a28e |     100072 | {                                                                    | {                  |
   |                               |                                      |            |   "error_code": 100072,                                              |   "N": [           |
   |                               |                                      |            |   "error_message": "NULL result in a non-nullable column",           |     null           |
   |                               |                                      |            |   "error_source": "N",                                               |   ],               |
   |                               |                                      |            |   "sql_state": "22000"                                               |   "T": [           |
   |                               |                                      |            | }                                                                    |     "toolong"      |
   |                               |                                      |            |                                                                      |   ]                |
   |                               |                                      |            |                                                                      | }                  |
   +-------------------------------+--------------------------------------+------------+----------------------------------------------------------------------+--------------------+
   ```
5. Turn off DML error logging for the `test_dml_error_logging` table:

   ```sqlexample
   ALTER TABLE test_dml_error_logging
     SET ERROR_LOGGING = false;
   ```
6. Attempt the same INSERT statement that you ran previously. An error is returned and no errors are logged in an
   error table:

   ```sqlexample
   INSERT INTO test_dml_error_logging
     VALUES
       ('invalid_cast', '1'),
       (10, 'valid'),
       (NULL, 'toolong');
   ```

   ```output
   100038 (22018): DML operation to table TEST_DML_ERROR_LOGGING failed on column N with error: Numeric value 'invalid_cast' is not recognized
   ```

#### Log errors when inserting rows from one table into another table

The following example logs errors when inserting rows from one table into another table:

1. Create a source table and insert values:

   ```sqlexample
   CREATE TABLE dml_error_logging_source(col1 INT);

   INSERT INTO dml_error_logging_source VALUES (1), (0), (-1);
   ```
2. Create a target table with the same definition as the source table:

   ```sqlexample
   CREATE TABLE dml_error_logging_target(col1 INT);
   ```
3. Turn on DML error logging on the `dml_error_logging_target` table:

   ```sqlexample
   ALTER TABLE dml_error_logging_target
     SET ERROR_LOGGING = true;
   ```
4. Insert values into the target table by querying the source table so that one of the inserts
   results in a division by zero error:

   ```sqlexample
   INSERT INTO dml_error_logging_target(col1)
     SELECT 1/col1 FROM dml_error_logging_source;
   ```

   ```output
   +-------------------------+
   | number of rows inserted |
   |-------------------------|
   |                       2 |
   +-------------------------+
   ```
5. Query the table to confirm that two valid rows were inserted:

   ```sqlexample
   SELECT * FROM dml_error_logging_target;
   ```

   ```output
   +------+
   | COL1 |
   |------|
   |    1 |
   |   -1 |
   +------+
   ```
6. Query the error table for the `dml_error_logging_target` base table to view the errors that
   were logged:

   ```sqlexample
   SELECT * FROM ERROR_TABLE(dml_error_logging_target);
   ```

   ```output
   +-------------------------------+--------------------------------------+------------+----------------------------------------+-------------+
   | TIMESTAMP                     | QUERY_ID                             | ERROR_CODE | ERROR_METADATA                         | ERROR_DATA  |
   |-------------------------------+--------------------------------------+------------+----------------------------------------+-------------|
   | 2026-03-12 12:25:56.297 -0700 | 01c2fc0d-000e-6696-0000-76b90170b64a |     100051 | {                                      | {           |
   |                               |                                      |            |   "error_code": 100051,                |   "COL1": [ |
   |                               |                                      |            |   "error_message": "Division by zero", |     1,      |
   |                               |                                      |            |   "error_source": "COL1",              |     0       |
   |                               |                                      |            |   "sql_state": "22012"                 |   ]         |
   |                               |                                      |            | }                                      | }           |
   +-------------------------------+--------------------------------------+------------+----------------------------------------+-------------+
   ```

### Error logging and error tables

When error logging is turned on for a table, Snowflake automatically creates an error table that is associated
with the base table. DML operations that encounter supported errors log the errors in the error table instead
of failing.

When DML error logging is turned on for a table, the following types of DML statements are logged:

* Single-table INSERT
* UPDATE
* MERGE

Error tables have a fixed definition and can only be accessed by the owner of the base table or a user with a role
that has been granted the SELECT ERROR TABLE privilege on the base table. The only supported direct operations on
an error table are SELECT and TRUNCATE statements. You can’t run other types of statements directly on error tables.
Error tables can’t be used indirectly in materialized views or dynamic tables.

You can copy the data out of the error table to other tables. You can remove the data in an error table
by running the TRUNCATE command.

The following sections provide more information about error logging and error tables:

#### Error table definition

Snowflake creates error tables with a standard definition that can’t be modified.

When you turn off DML error logging for a base table or drop a base table that has an error table,
the error table associated with the base table is dropped automatically.

An error table has the following columns:

| Name | Type | Description |
| --- | --- | --- |
| `timestamp` | TIMESTAMP | The timestamp of the statement that triggered the error. |
| `query_id` | VARCHAR | The unique ID of the statement that triggered the error. |
| `error_code` | NUMBER | The error code. When multiple columns in one row contain errors, this column only captures the first error that is encountered. |
| `error_metadata` | OBJECT | The error metadata.  The OBJECT values have the following structure:  ```output {   "error_code": <value>,   "error_message": "<value>",   "error_source": "<value>",   "sql_state": "<value>" } ```  The OBJECT values contain the following key-value pairs:   * `error_code`: The error code. * `error_message`: The error message. * `error_source`: The origin of the error, such as a column name. * `sql_state`: A five-character code that is modeled on the ANSI SQL standard   [SQLSTATE](https://en.wikipedia.org/wiki/SQLSTATE). Snowflake uses additional values beyond   those in the ANSI SQL standard.   When multiple columns in one row contain errors, this column only captures the first error that is encountered. |
| `error_data` | OBJECT | The data that caused the error.  The OBJECT values have the following structure:  ```output {   "<column_name>": [     <invalid_column_values>   ]   "<column_name>": <valid_column_values>   ... } ```  The OBJECT values contain the key-value pairs that represent each column in the base table. The key is the column name. For invalid column values that caused the DML operation to fail, the value in the key-value pair is an array that contains the values. Valid values are shown directly; that is, they aren’t shown in arrays.  If the data can’t be represented in an OBJECT value, the value is NULL. |

#### Interact with error tables

You can run SELECT statements and TRUNCATE statements on error tables by using the following
syntax:

```sqlsyntax
SELECT ... FROM ERROR_TABLE( <base_table_name> )

TRUNCATE [ TABLE ] [ IF EXISTS ] ERROR_TABLE( <base_table_name> )
```

Where:

`base_table_name`
:   The name of the table for which the error table was created.

For example, if the name of the base table is `my_table`, the following statement queries
the error table for this base table:

```sqlexample
SELECT * FROM ERROR_TABLE(my_table);
```

The following statement truncates the error table:

```sqlexample
TRUNCATE ERROR_TABLE(my_table);
```

#### Access control requirements for error tables

Any role that can insert into a base table can trigger inserts into its error table.
Regardless of the current role, direct inserts into an error table aren’t allowed.

The following users can run SELECT statements on an error table:

* The owner of the error table’s base table.
* Users who have been granted SELECT ERROR TABLE privilege privileges on the base table, either through a
  role or directly.

  To grant SELECT ERROR TABLE privilege on a base table, run a [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
  statement or a [GRANT <privileges> … TO USER](../sql-reference/sql/grant-privilege-user.md) statement.

  These statements use the following syntax:

  ```sqlsyntax
  GRANT SELECT ERROR TABLE ON TABLE <base_table_name> TO ROLE <role_name>

  GRANT SELECT ERROR TABLE ON TABLE <base_table_name> TO USER <user_name>
  ```

  For example, to grant SELECT ERROR TABLE privilege on a base table named `mybasetable` to a role named `myrole`,
  run the following statement:

  ```sqlexample
  GRANT SELECT ERROR TABLE ON TABLE mybasetable TO ROLE myrole;
  ```

Alternatively, to grant other roles access to an error table, the base table owner can also create a view
based on the error table and grant access to that view.

### Metadata for error logging

To determine whether error logging is turned on for a table, you can run the
[GET_DDL](../sql-reference/functions/get_ddl.md) function and pass in the name of the base table:

```sqlexample
SELECT GET_DDL('TABLE', '[<namespace>.]<base_table_name>');
```

For example, for a base table named `test_dml_error_logging` in the current schema, run the following statement:

```sqlexample
SELECT GET_DDL('TABLE', 'test_dml_error_logging');
```

```output
+--------------------------------------------------+
| GET_DDL('TABLE', 'TEST_DML_ERROR_LOGGING')       |
|--------------------------------------------------|
| create or replace TABLE TEST_DML_ERROR_LOGGING ( |
|     N NUMBER(4,0) NOT NULL,                      |
|     T VARCHAR(5)                                 |
| ) ERROR_LOGGING = true                           |
| ;                                                |
+--------------------------------------------------+
```

Metrics for error tables are recorded in the following views:

* [STORAGE_USAGE view](../sql-reference/account-usage/storage_usage.md)
* [TABLE_STORAGE_METRICS view](../sql-reference/account-usage/table_storage_metrics.md)

### Streams on error tables

[Streams](streams-intro.md) aren’t supported directly on error tables. To enable change
tracking on error tables, first create a view on the error table, and then create a stream on the view.

The following example shows you how to enable change tracking on error tables:

1. Run the [CREATE VIEW](../sql-reference/sql/create-view.md) command to create a view on the error table:

   ```sqlexample
   CREATE VIEW my_error_view AS
     SELECT timestamp,
            query_id,
            error_code,
            error_metadata,
            error_data
       FROM ERROR_TABLE(test_dml_error_logging);
   ```
2. Run the [CREATE STREAM](../sql-reference/sql/create-stream.md) command to create a stream on the view:

   ```sqlexample
   CREATE STREAM my_error_stream ON VIEW my_error_view;
   ```

### DML error logging usage notes

The following usage notes apply when error logging is turned on for a table:

* Only errors directly related to the base table are logged.
* The following types of errors are logged:

  + NOT NULL table constraint violations.
  + Type conversion errors that occur when attempting to convert a value from to the base table column.
  + Incompatible precision and scale values.
  + Incompatible length for string and binary types.
  + Some expression evaluation failures, such as division by zero or PARSE_JSON function failures.
* Multi-table INSERT and CREATE TABLE … AS SELECT (CTAS) statements run normally. They fail on DML
  errors and don’t log them.
* If you try to run a COPY INTO statement on a table with error logging enabled, the
  `Error logging is not supported in statement 'COPY INTO'` error is returned at compilation time.
* Errors that aren’t supported by DML error logging cause the DML operation to fail directly.
* If a SQL statement results in a compilation error, the operation ends and no errors are logged in the
  error table.
* Failures that occur in other ingestion paths, such as COPY and Snowpipe, aren’t logged in error tables.
  For Snowpipe Streaming high-performance error logging, see [Error logging in Snowpipe Streaming with high-performance architecture](snowpipe-streaming/snowpipe-streaming-error-tables.md).
* The following are considerations related to DML error logging and performance:

  + When DML error logging is enabled for a base table, and there are *no* errors in a DML statement that is run on the
    base table, no performance difference or very little performance difference is expected.
  + When DML error logging is enabled for a base table, and there *are* errors in a DML statement that is run on the
    base table, additional time is required to complete the DML statement because the error information is inserted into
    the error table.
* When a base table with an associated error table is cloned, the behavior is as follows:

  + The base table’s schema and content are cloned.
  + The error table’s content isn’t cloned
  + The cloned base table has the ERROR_LOGGING property turned on, which implicitly creates an empty error
    table for it.

## Schema detection of column definitions from staged semi-structured data files

Semi-structured data can include thousands of columns. Snowflake provides robust solutions for handling this data. Options include
referencing the data directly in cloud storage using external tables, loading the data into a single column of type VARIANT, or
transforming and loading the data into separate columns in a standard relational table. All of these options require some knowledge of the
column definitions in the data.

A different solution involves automatically detecting the schema in a set of staged semi-structured data files and retrieving the column
definitions. The column definitions include the names, data types, and ordering of columns in the files. Generate syntax in a format
suitable for creating Snowflake standard tables, external tables, or views.

> **Note:**
>
> This feature supports Apache Parquet, Apache Avro, ORC, JSON, and CSV files.

This support is implemented through the following SQL functions:

[INFER_SCHEMA](../sql-reference/functions/infer_schema.md)
:   Detects the column definitions in a set of staged data files and retrieves the metadata in a format suitable for creating Snowflake objects.

[GENERATE_COLUMN_DESCRIPTION](../sql-reference/functions/generate_column_description.md)
:   Generates a list of columns from a set of staged files using the INFER_SCHEMA function output.

These SQL functions support both internal and external stages.

Create tables or external tables with the column definitions derived from a set of staged files using the
[CREATE TABLE … USING TEMPLATE](../sql-reference/sql/create-table.md) or [CREATE EXTERNAL TABLE … USING TEMPLATE](../sql-reference/sql/create-external-table.md) syntax. The USING TEMPLATE clause accepts an expression that
calls the INFER_SCHEMA SQL function to detect the column definitions in the files. After the table is created, you can then use a COPY statement with the `MATCH_BY_COLUMN_NAME` option to load files directly into the structured table.

Schema detection can also be used in conjunction with [table schema evolution](data-load-schema-evolution.md), where the structure of tables evolves automatically to support the structure of new data received from the data sources.

## Alternatives to loading data

You can use the following option to query your data in cloud storage without loading it into Snowflake tables.

### External tables (data lake)

[External tables](tables-external-intro.md) enable querying existing data stored in external cloud storage for analysis without first loading it into Snowflake. The source of truth for the data remains in the external cloud storage. Data sets materialized in Snowflake via materialized views are read-only.

This solution is especially beneficial to accounts that have a large amount of data stored in external cloud storage and only want to query a portion of the data; for example, the most recent data. Users can create materialized views on subsets of this data for improved query performance.

### Working with Amazon S3-compatible storage

You can create external stages and tables in Snowflake to access storage in an application or device that is Amazon S3-compatible.
This feature lets you manage, govern, and analyze your data, regardless of where the data is stored.
For information, see [Work with Amazon S3-compatible storage](data-load-s3-compatible-storage.md).

---
title: Overview of data unloading
source: https://docs.snowflake.com/en/user-guide/data-unload-overview.md
section: User Guide
---

# Overview of data unloading

Similar to data loading, Snowflake supports bulk export (i.e. unload) of data
from a database table into flat, delimited text files.

## Bulk unloading process

The process for unloading data into files is the same as the loading process, except in reverse:

Step 1:
:   Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to copy the data from the Snowflake database table into one or more files in a Snowflake or external stage.

Step 2:
:   Download the file from the stage:

    * From a Snowflake stage, use the [GET](../sql-reference/sql/get.md) command to download the data file(s).
    * From S3, use the interfaces/tools provided by Amazon S3 to get the data file(s).
    * From Azure, use the interfaces/tools provided by Microsoft Azure to get the data file(s).

## Bulk unloading using queries

Snowflake supports specifying a SELECT statement instead of a table in the
[COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command. The results of the query
are written to one or more files as specified in the command and the
file(s) are stored in the specified location (internal or external).

SELECT queries in COPY statements support the full syntax and semantics of Snowflake SQL queries, including JOIN clauses,
which enables downloading data from multiple tables.

## Bulk unloading into single or multiple files

The COPY INTO *<location>* command provides a copy option
(SINGLE) for unloading data into a single file or multiple files. The default
is SINGLE = FALSE (i.e. unload into multiple files).

Snowflake assigns each file a unique name. The location path specified for
the command can contain a filename prefix that is assigned to all the data files
generated. If a prefix is not specified, Snowflake prefixes the generated
filenames with `data_`.

Snowflake appends a suffix that ensures each file name is unique across
parallel execution threads; e.g. `data_stats_0_1_0`.

When unloading data into multiple files, use the MAX_FILE_SIZE copy option to
specify the maximum size of each file created.

## Partitioned data unloading

The COPY INTO *<location>* command includes a PARTITION BY copy option for partitioned unloading of data to stages.

The ability to partition data during the unload operation enables a variety of use cases, such as using Snowflake to transform data for
output to a data lake. In addition, partitioning unloaded data into a directory structure in cloud storage can increase the efficiency with
which third-party tools consume the data.

The PARTITION BY copy option accepts an expression by which the unload operation partitions table rows into separate files unloaded to the
specified stage.

## Tasks for unloading data using the COPY command

For more information about the tasks associated with unloading data, see:

* [Unload into a Snowflake stage](data-unload-snowflake.md)
* [Unload into Amazon S3](data-unload-s3.md)
* [Unload into Google Cloud Storage](data-unload-gcs.md)
* [Unload into Microsoft Azure](data-unload-azure.md)

---
title: Overview of External OAuth in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/external-oauth-overview.md
section: User Guide
---

# Overview of External OAuth in Snowflake Open Catalog

This topic teaches you how to configure External OAuth servers that use OAuth 2.0 for accessing Snowflake Open Catalog.

External OAuth integrates the customer’s OAuth 2.0 server to provide a seamless SSO experience, which enables query engines access to
Open Catalog.

Open Catalog supports the following external authorization servers:

* Auth0
* Microsoft Entra ID
* Okta

## Use cases and benefits

1. Open Catalog delegates the token issuance to a dedicated authorization server to ensure that the OAuth Client and user properly
   authenticate. The result is centralized management of tokens issued to Open Catalog.
2. Clients can authenticate to Snowflake without browser access, allowing ease of integration with the External OAuth server.

## General workflow

For each of the supported identity providers, the workflow for OAuth relating to External OAuth authorization servers can be summarized as
follows. Note that the first step only occurs once and the remaining steps occur with each attempt to access Open Catalog data.

1. Configure your External OAuth authorization server in your environment and the security integration in Open Catalog to establish a trust.
2. A service principal attempts to access Open Catalog data through the client application, and the application attempts to verify the service
   principal.
3. On verification, the authorization server sends a JSON Web Token (i.e. OAuth token) to the query engine.
4. The Open Catalog driver passes a connection string to Open Catalog with the OAuth token.
5. Open Catalog validates the OAuth token.
6. Open Catalog performs a service principal lookup.
7. On verification, Open Catalog instantiates a session for the service principal to access data in Open Catalog based on its role.

---
title: Overview of federated authentication and SSO
source: https://docs.snowflake.com/en/user-guide/admin-security-fed-auth-overview.md
section: User Guide
---

# Overview of federated authentication and SSO

This topic describes the components that comprise a federated environment for authenticating users, and the SSO (single sign-on) workflows supported by
Snowflake.

## What is a federated environment?

In a federated environment, user authentication is separated from user access through the use of one or more external entities that provide independent
authentication of user credentials. The authentication is then passed to one or more services, enabling users to access the services through SSO. A federated
environment consists of the following components:

* Service provider (SP):
  :   In a Snowflake federated environment, Snowflake serves as the SP.
* Identity provider (IdP):
  :   The external, independent entity responsible for providing the following services to the SP:

      + Creating and maintaining user credentials and other profile information.
      + Authenticating users for SSO access to the SP.

Snowflake supports most SAML 2.0-compliant vendors as an IdP; however, certain vendors include native support for Snowflake (see below for details).

## Supported identity providers

The following vendors provide native Snowflake support for federated authentication and SSO:

* Okta
* Microsoft Entra ID

In addition to the native Snowflake support provided by Okta and Entra ID, Snowflake supports using most SAML 2.0-compliant vendors as an IdP, including:

* [Google G Suite](https://gsuite.google.com/)
* [Microsoft Entra ID](https://www.microsoft.com/en-us/security/business/identity-access/microsoft-entra-id)
* [OneLogin](https://www.onelogin.com/product/sso)
* [Ping Identity PingOne](https://www.pingidentity.com/en/products/pingone.html)

> **Note:**
>
> To use an IdP other than Okta or Entra ID, you must define a custom application for Snowflake in the IdP.

For details about configuring Okta, Entra ID, or another SAML 2.0-compliant vendor as the IdP for Snowflake, see [Configuring an identity provider (IdP) for Snowflake](admin-security-fed-auth-configure-idp.md).

## Using multiple identity providers

You can configure Snowflake so different users authenticate using different identity providers.

Once you have [configured all of the identity providers](admin-security-fed-auth-configure-idp.md), follow the guidance in
[Using multiple identity providers for federated authentication](admin-security-fed-auth-security-integration-multiple.md).

> **Note:**
>
> Currently, only a subset of Snowflake drivers support the use of multiple identity providers. These drivers include JDBC, ODBC, and Python.

## Supported SSO workflows

Federated authentication enables the following SSO workflows:

* Logging into Snowflake.
* Logging out of Snowflake.
* System timeout due to inactivity.

The behavior for each workflow is determined by whether the action is initiated within Snowflake or your IdP.

### Login workflow

When a user logs in, the behavior of the system is determined by whether the login is initiated through Snowflake or the IdP:

* Snowflake-initiated login:
  :   To log in through Snowflake:

      1. User goes to the Snowflake web interface.

         > **Note:**
         >
         > You can configure Snowflake so that a user accessing Snowflake with a URL is redirected to the IdP to authenticate without seeing
         > the Snowflake sign in page. For more information, see [ALTER ACCOUNT](../sql-reference/sql/alter-account.md).
      2. User chooses to log in using the IdP configured for your account (Okta, Entra ID, or a custom IdP).
      3. User authenticates with the IdP using their IdP credentials (e.g. email address and password).
      4. If authentication is successful, the IdP sends a SAML response to Snowflake to initiate a session and displays the Snowflake web
         interface.
* IdP-initiated login:
  :   To log in through the IdP for your account:

      1. User goes to the IdP site/application and authenticates using their IdP credentials (e.g. email address and password).
      2. In the IdP, user selects the Snowflake application (if using Okta or Entra ID) or the custom application that has been defined in the IdP (if using
         another IdP).
      3. The IdP sends a SAML response to Snowflake to initiate a session and then displays the Snowflake web interface.

### Logout workflow

When a user logs out, the available options are dictated by whether the IdP supports *global* logout or only *standard* logout:

> Standard:
> :   Requires users to explicitly log out of both the IdP and Snowflake to completely disconnect. All IdPs support standard logout.
>
> Global:
> :   Enables a user to log out of the IdP and subsequently all their Snowflake sessions. Support for global logout is IdP-dependent.

In addition, the behavior of the system is determined by whether the logout is initiated through Snowflake or the IdP:

* Snowflake-initiated logout:
  :   Global logout is not supported from within Snowflake, regardless of whether the IdP supports it. When a user logs out of a Snowflake session, they are
      logged out of that session only. All their other current Snowflake sessions stay open, as does their IdP session. As a result, they can continue working
      in their other sessions or they can initiate additional sessions without having to re-authenticate through the IdP.

      To completely disconnect, users must explicitly log out of both Snowflake and the IdP.
* IdP-initiated logout:
  :   When a user logs out through an IdP, the behavior depends on whether the IdP supports standard logout only or also global logout:

      + Entra ID supports both standard and global logout. If global logout is enabled, the Entra ID IdP login page
        provides an option for signing out from all sites
        that the user has accessed. Selecting this option logs the user out of Entra ID and all
        their Snowflake sessions. To access Snowflake again, they must re-authenticate using Entra ID.
      + Okta supports standard logout only. When a user logs out of Okta, they are not automatically logged out of any of their active
        Snowflake sessions and they can continue working. However, to initiate any new Snowflake sessions, they must authenticate again through
        Okta.
      + All custom providers support standard logout; support for global logout varies by provider.
      > **Note:**
      >
      > For a web-based IdP (for example, Okta), closing the browser tab/window does not necessarily end the IdP session. If a user’s IdP session is
      > still active, they can still access Snowflake until the IdP session times out.

### Timeout workflow

When a user’s session times out, the behavior is determined by whether it is their Snowflake session or IdP session that timed out:

* Snowflake timeout:
  :   If a users logs into Snowflake using SSO and their Snowflake session expires due to inactivity, the Snowflake web interface is disabled and the prompt for
      IdP authentication is displayed:

      + To continue using their expired Snowflake session, the user must authenticate again through the IdP.
      + The user can exit the session by selecting the Cancel button.
      + The user can also go to the IdP site/application directly and relaunch Snowflake, but this initiates a new Snowflake session.
* IdP timeout:
  :   After a specified period of time (defined by the IdP), a user’s session in the IdP automatically times out, but this does not affect their Snowflake
      sessions. Any Snowflake sessions that are active at the time remain open and do not require re-authentication. However, to initiate any new Snowflake
      sessions, the user must log into the IdP again.

## SSO with private connectivity

Snowflake supports SSO with private connectivity to the Snowflake service for Snowflake accounts on Amazon Web Services (AWS),
Microsoft Azure, and Google Cloud Platform (GCP).

Currently, for any given Snowflake account, SSO works with only one account URL at a time: either the public account URL or the URL
associated with the private connectivity service on AWS, Microsoft Azure, or Google Cloud Platform.

Snowflake supports using SSO with [organizations](organizations.md), and you can use the corresponding URL in the SAML2
security integration. For more information, see [Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md).

To use SSO with private connectivity to Snowflake, configure private connectivity before configuring SSO:

* If your Snowflake account is on AWS or Azure, follow the self-service instructions as listed in
  [AWS PrivateLink and Snowflake](admin-security-privatelink.md) and [Azure Private Link and Snowflake](privatelink-azure.md).
* If your Snowflake account is on GCP, you must contact
  [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) and provide the
  Snowflake account URL to use with [Google Cloud Private Service Connect and Snowflake](private-service-connect-google.md).

  To determine the correct URL to use, call the [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function in your Snowflake
  account on GCP.

## Replicate the SSO Configuration

Snowflake supports replication and failover/failback of the
[SAML2 security integration](admin-security-fed-auth-security-integration.md) from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

---
title: Overview of key features
source: https://docs.snowflake.com/en/user-guide/intro-supported-features.md
section: User Guide
---

# Overview of key features

This topic lists the notable and significant features supported in the current release. It *doesn’t* list every feature provided
by Snowflake.

## Security, governance, and data protection

* Choose the geographical location where your data is stored, based on your [region](intro-regions.md).
* [User authentication](admin-user-management.md) through standard user/password credentials.
* Enhanced authentication:

  + [Multi-factor authentication (MFA)](security-mfa.md).
  + [Federated authentication and single sign-on (SSO)](admin-security-fed-auth-overview.md).
  + [Snowflake OAuth](oauth-snowflake-overview.md).
  + [External OAuth](oauth-ext-overview.md).
  + [Key-pair authentication](key-pair-auth.md).
  + [Authentication through programmatic access tokens](programmatic-access-tokens.md).
* All communication between clients, including all Snowflake connectors and drivers, and the server is protected through TLS.
* Deployment inside a cloud platform VPC (AWS or GCP) or VNet (Azure).
* Isolation of data (for loading and unloading) using:

  + [Amazon S3 policy controls](data-load-s3-config.md).
  + [Azure storage access controls](data-load-azure-config.md).
  + [Google Cloud Storage access permissions](data-load-gcs-config.md).
* Support for PHI data (in compliance with HIPAA and [HITRUST CSF](intro-cloud-platforms.md) regulations) — requires Business Critical
  Edition (or higher).
* Automatic [data encryption](security-encryption-end-to-end.md) by Snowflake using Snowflake-managed keys.
* [Object-level access control](security-access-control-overview.md).
* [Snowflake Time Travel](data-time-travel.md) (1 day standard for all accounts; additional days, up to 90, allowed with
  Snowflake Enterprise) for:

  + Querying historical data in tables.
  + Restoring and cloning historical data in databases, schemas, and tables.
* [Snowflake Fail-safe](data-failsafe.md) (7 days standard for all accounts) for disaster recovery of historical data.
* [Column-level Security](security-column-intro.md) to apply masking policies to columns in tables or views — requires Enterprise
  Edition (or higher).
* [Row-level Security](security-row-intro.md) to apply row access policies to tables and views — requires Enterprise Edition (or
  higher).
* [Introduction to object tagging](object-tagging/introduction.md) to apply tags to Snowflake objects to facilitate tracking sensitive data and resource usage
  — requires Enterprise Edition (or higher).
* [Differential privacy](diff-privacy/differential-privacy-overview.md) to protect data against targeted privacy attacks.
  — requires Enterprise Edition (or higher).

## Standard and extended SQL support

* Most DDL defined in SQL:1999, including:

  + [Databases, schemas, tables, and related objects](../sql-reference/sql-ddl-summary.md).
  + [Core data types](../sql-reference-data-types.md).
  + [SET operations](../sql-reference/constructs.md).
  + [CAST functions](../sql-reference/functions-conversion.md).
* [Standard DML](../sql-reference/sql-dml.md) such as UPDATE, DELETE, and INSERT, as well as more advanced DML:

  + [Multi-table INSERT, MERGE, and multi-merge](../sql-reference/sql-dml.md).
  + [DML for bulk data loading/unloading](../sql-reference/sql-dml.md).
* [Iceberg tables](tables-iceberg.md).
* [Transactions](../sql-reference/transactions.md).
* [Temporary and transient tables](../sql-reference/sql/create-table.md) for transitory data.
* [Lateral views](../sql-reference/constructs/from.md).
* [Materialized views](views-materialized.md).
* [Statistical aggregate functions](../sql-reference/functions-aggregation.md).
* [Analytical aggregates (Group by cube, rollup, and grouping sets)](../sql-reference/constructs/group-by.md).
* Parts of the SQL:2003 analytic extensions:

  + [Window functions](../sql-reference/functions-window.md).
  + [Grouping sets](../sql-reference/constructs/group-by.md).
* Scalar and tabular [user-defined functions (UDFs)](../developer-guide/udf/udf-overview.md), with support for Java, JavaScript,
  Python, Scala, and SQL.
* [Stored procedures](../developer-guide/stored-procedure/stored-procedures-overview.md) and procedural language support
  ([Snowflake Scripting](../developer-guide/snowflake-scripting/index.md))
* [Snowflake Information Schema](../sql-reference/info-schema.md) for querying object and account metadata, as well as query and warehouse usage history data.
* Recursive queries, including:

  + [CONNECT BY](../sql-reference/constructs/connect-by.md).
  + [Recursive CTE (common table expressions)](../sql-reference/constructs/with.md).
* [Collation support](../sql-reference/collation.md).
* [Geospatial data support](../sql-reference/data-types-geospatial.md).
* [User-defined types support](../sql-reference/data-types-user-defined.md).

## Tools and interfaces

* [Snowsight](ui-snowsight-quick-tour.md) for account and general management, monitoring of resources and system usage, and
  querying data.
* [Snowflake CLI (open source command-line client)](../developer-guide/snowflake-cli/index.md).
* [SnowSQL (Python-based command line client)](snowsql.md).
* Virtual warehouse management from the GUI or command line, including
  [creating, resizing (with zero downtime), suspending, and dropping](warehouses.md) warehouses.
* [Snowflake Extension for Visual Studio Code](vscode-ext.md) - Detailed instructions for installing, configuring and using the Snowflake Extension for Visual Studio Code.

## Apps and extensibility

* [APIs for Java, Python, and Scala](../developer-guide/snowpark/index.md) with which you can build applications that process data in
  Snowflake without moving data to the system where your application code runs.
* A [framework for creating applications](../developer-guide/native-apps/native-apps-about.md)
  to share data content and application logic with other Snowflake accounts.
* A [RESTful API](../developer-guide/sql-api/index.md) for accessing and updating data.
* Support for running [Streamlit apps natively in Snowflake](../developer-guide/streamlit/about-streamlit.md)
  to create and share custom web apps for machine learning and data science.
* Support for [developing procedures and user-defined functions (UDFs)](../developer-guide/extensibility.md) with a handler in one of
  several programming languages.
* Extensive set of client connectors and drivers provided by Snowflake:

  + [Python connector](../developer-guide/python-connector/python-connector.md)
  + [Spark connector](spark-connector.md)
  + [Node.js driver](../developer-guide/node-js/nodejs-driver.md)
  + [Go Snowflake driver](../developer-guide/golang/go-driver.md)
  + [.NET driver](../developer-guide/dotnet/dotnet-driver.md)
  + [JDBC client driver](../developer-guide/jdbc/jdbc.md)
  + [ODBC client driver](../developer-guide/odbc/odbc.md)
  + [PHP PDO driver](../developer-guide/php-pdo/php-pdo-driver.md)
* [Snowpark Container Services](../developer-guide/snowpark-container-services/overview.md) is a fully managed container offering that helps you easily deploy, manage, and scale containerized applications.

## Connectivity

* Broad [ecosystem](ecosystem.md) of supported 3rd-party partners and technologies.
* Support for using free trials to [connect to selected partners](ecosystem-partner-connect.md).

## Data import and export

* Support for bulk [loading](../guides-overview-loading-data.md) and [unloading](data-unload-overview.md) data into/out of tables, including:

  + Load any data that uses a supported character encoding.
  + Load data from compressed files.
  + Load most flat, delimited data files (CSV, TSV, etc.).
  + Load data files in JSON, Avro, ORC, Parquet, and XML format.
  + Load from files in cloud storage or local files using the Snowflake web interface or command-line client.
* Support for continuous data loading from files:

  + Use [Snowpipe](data-load-snowpipe-intro.md) to load data in micro-batches from internal (i.e. Snowflake) stages or external
    (Amazon S3, Google Cloud Storage, or Microsoft Azure) stages.
* Support for accessing data in [S3-compatible storage](data-load-s3-compatible-storage.md).

## Data sharing

* Support for both [sharing data in secured objects](../guides-overview-sharing.md) and [sharing data in non-secure views](../guides-overview-sharing.md) with other Snowflake accounts:

  + Provide data to other accounts to consume.
  + Consume data provided by other accounts.
* Support for collaborators using [Snowflake Data Clean Rooms](cleanrooms/overview.md) to share data in a privacy-preserving environment.

## Replication and failover

* Support for [replication and failover](account-replication-intro.md) across multiple Snowflake accounts in
  different [regions](intro-regions.md) and [cloud platforms](intro-cloud-platforms.md):

  + Replicate objects between Snowflake accounts (within the same organization) and keep the objects and stored data synchronized.
  + Configure failover to one or more Snowflake accounts for business continuity and disaster recovery.

---
title: Overview of key pair authentication in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/key-pair-auth-overview.md
section: User Guide
---

# Overview of key pair authentication in Snowflake Open Catalog

This topic describes using key pair authentication and key pair rotation in Snowflake Open Catalog.

Open Catalog supports using key pair authentication for enhanced authentication security as an alternative to
using a client ID and secret.

This authentication method requires, as a minimum, a 2048-bit RSA key pair. You can generate the Privacy Enhanced Mail (PEM) private-public
key pair using OpenSSL. The public key is assigned to the Open Catalog user who uses a client application to connect and authenticate
to Snowflake.

---
title: Overview of semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/overview.md
section: User Guide
---

# Overview of semantic views

You can store semantic business concepts directly in the database in a *Semantic View*, which is a schema-level object. You
can define business metrics and model business entities and their relationships. By adding business meaning to physical data,
the semantic view enhances data-driven decisions and provides consistent business definitions across enterprise applications.

You can use Semantic Views in [Cortex Analyst](../snowflake-cortex/cortex-analyst.md) and
[query these views](querying.md) in a SELECT statement. You can also [share Semantic Views](sharing-semantic-views.md) in [private listings](../../collaboration/provider-listings-creating-publishing.md), in public listings on the [Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace), and in [organizational listings](../collaboration/listings/organizational/org-listing-about.md).

To create and manage Semantic Views, you can use SQL commands (such as
[CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md)) or a wizard in Snowsight that guides you through the process of creating
a semantic view.

> **Note:**
>
> Semantic views are considered [metadata](../../sql-reference/metadata.md).

## Why use Semantic Views?

Semantic views address the mismatch between how business users describe data and how it’s stored in database schemas. For
example, for a critical business concept like *gross revenue*, the data might be stored in a table column named
`amt_ttl_pre_dsc` in the database, making it difficult for business users to find and interpret.

Additionally, if *net revenue* within a company always means gross revenue after discounts, the semantic view can define it
consistently as a metric with the correct aggregation: `SUM(gross_revenue * (1 - discount))`. This ensures a single
authoritative definition with proper aggregation behavior. For example, when a user asks for “Net Revenue by Region,” the semantic view knows to aggregate at the appropriate level. Without a semantic view, dozens of inconsistent calculations might exist throughout different reports and applications, often with incorrect aggregation methods leading to erroneous results.

This business-focused abstraction layer solves several common problems:

* **For AI applications:** Semantic views improve accuracy by combining LLM reasoning with rule-based definitions. Currently,
  [Cortex Analyst](../snowflake-cortex/cortex-analyst.md) reads the information captured in the semantic view
  definition and generates the SQL against the physical tables directly.
* **For business intelligence (BI):** Business users benefit from consistent metrics and dimensions across all tools. They can
  easily combine these predefined business concepts in their familiar BI interfaces to explore data and gain insights.
* **For technical analysts:** The centralized location for business logic reduces duplication of metric definitions across
  queries and simplifies complex schema relationships, making it easier to build and maintain data models.

## Understanding Semantic Views

> **Note:**
>
> Throughout this topic, database-related artifacts (such as database tables) are referred to as *physical objects*, and
> artifacts related to the semantic view are referred to as *logical objects*.

Within a semantic view, you define logical tables that typically correspond to business entities, such as customers, orders, or
suppliers. You can define relationships between logical tables through joins on shared keys, enabling you to analyze data
across entities (as you would when joining database tables).

Using logical tables, you can define:

* **Facts:** Facts are row-level attributes in your data model that represent specific business events or transactions. While
  facts can be defined using aggregates from more detailed levels of data (such as `SUM(t.x)` where `t` represents data
  at a more detailed level), they are always presented as attributes at the individual row level of the logical table. Facts
  capture “how much” or “how many” at the most granular level, such as individual sales amounts, quantities purchased, or costs.
  It’s important to note that facts typically function as “helper” concepts within the semantic view to help construct
  dimensions and metrics.
* **Metrics:** Metrics are quantifiable measures of business performance calculated by aggregating facts or other columns from
  the same table (using functions like [SUM](../../sql-reference/functions/sum.md), [AVG](../../sql-reference/functions/avg.md), and
  [COUNT](../../sql-reference/functions/count.md)) across multiple rows. They transform raw data into meaningful business indicators,
  often combining multiple calculations in complex formulas. Examples include *Total Revenue* or *Profit Margin Percentage*.
  Metrics represent the KPIs in reports and dashboards that drive business decision-making.
* **Dimensions:** Dimensions represent categorical attributes. They provide the contextual framework that gives meaning to
  metrics by grouping data into meaningful categories. They answer “who,” “what,” “where,” and “when” questions, such as
  purchase date, customer details, product category, or location. Typically text-based or hierarchical, dimensions enable
  users to filter, group, and analyze data from multiple perspectives.

In a semantic view, these three elements have distinct roles, but metrics and dimensions are the primary elements that you
interact with when analyzing data through the semantic view. Facts provide the underlying row-level numerical data, metrics
transform data into actionable insights through aggregation and calculation, and dimensions determine viewing perspectives.

For more information about these concepts, see the [YAML Specification for Semantic Views](semantic-view-yaml-spec.md).

## Interfaces for working with Semantic Views

You can use the following interfaces to create, manage, and use Semantic Views:

* **SQL commands**: You can use SQL commands to create and manage Semantic Views directly. For information, see [Using SQL commands to create and manage semantic views](sql.md).

  You can also execute a SELECT statement to [query a semantic view](querying.md).
* **Snowsight**: You can use a wizard in Snowsight that guides you through the process of creating a semantic view.
  You can also upload a [YAML specification](semantic-view-yaml-spec.md) that defines your semantic view. For information,
  see [Using Snowsight to create and manage semantic views](ui.md).
* **Cortex Analyst REST API**: To use a semantic view with Cortex Analyst, you specify the view in the REST API request. For
  information, see [Cortex Analyst REST API](../snowflake-cortex/cortex-analyst/rest-api.md).

## Getting started

To get started with Semantic Views:

1. Design your business data model.

   * What business entities exist in your data (for example, customers, products, orders, and so on)?
   * How do these entities relate to each other?
   * What metrics are important to your business?
   * What dimensions do you use to analyze these metrics?
2. Map your business concepts to your physical data.

   * Which tables contain the data you need? We recommend starting with a simple star schema.
   * How will you join these tables?
   * What calculations are needed to derive your metrics?
3. Create a semantic view.

   You can use one of these interfaces to create a semantic view.
4. Use the semantic view in the following ways:

   * Use Cortex Analyst for natural language queries of your semantic view.

     You can use the [Cortex Analyst REST API](../snowflake-cortex/cortex-analyst/rest-api.md) to perform a natural
     language query that uses your semantic view.

     If you need to monitor the REST API requests that use your semantic view, see
     [Cortex Analyst administrator monitoring](../snowflake-cortex/cortex-analyst/admin-observability.md).
   * Query the semantic view in a SELECT statement. For information, see [Querying semantic views](querying.md).

## Additional information about Semantic Views

For additional information about Semantic Views, see the following topics:

* [Using Snowsight to create and manage semantic views](ui.md)
* [Using SQL commands to create and manage semantic views](sql.md)
* [How Snowflake validates semantic views](validation-rules.md)
* [Example of using SQL to create a semantic view](example.md)
* [Querying semantic views](querying.md)

For information about the privileges required to work with Semantic Views, see the following sections:

* [Privileges required to create or replace a semantic view](sql.md)
* [Privileges required to query a semantic view](querying.md)
* [Granting privileges on semantic views](sql.md)

For reference information about the SQL commands and views for Semantic Views, see the following topics:

* Documentation on SQL commands:

  + [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md)
  + [ALTER SEMANTIC VIEW](../../sql-reference/sql/alter-semantic-view.md)
  + [DESCRIBE SEMANTIC VIEW](../../sql-reference/sql/desc-semantic-view.md)
  + [DROP SEMANTIC VIEW](../../sql-reference/sql/drop-semantic-view.md)
  + [SHOW SEMANTIC VIEWS](../../sql-reference/sql/show-semantic-views.md)
  + [SHOW SEMANTIC DIMENSIONS](../../sql-reference/sql/show-semantic-dimensions.md)
  + [SHOW SEMANTIC METRICS](../../sql-reference/sql/show-semantic-metrics.md)
  + [SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md)
* Documentation on ACCOUNT_USAGE views:

  + [SEMANTIC_VIEWS view](../../sql-reference/account-usage/semantic_views.md)
  + [SEMANTIC_TABLES view](../../sql-reference/account-usage/semantic_tables.md)
  + [SEMANTIC_RELATIONSHIPS view](../../sql-reference/account-usage/semantic_relationships.md)
  + [SEMANTIC_FACTS view](../../sql-reference/account-usage/semantic_facts.md)
  + [SEMANTIC_DIMENSIONS view](../../sql-reference/account-usage/semantic_dimensions.md)
  + [SEMANTIC_METRICS view](../../sql-reference/account-usage/semantic_metrics.md)
* Documentation on INFORMATION_SCHEMA views:

  + [SEMANTIC_VIEWS view](../../sql-reference/info-schema/semantic_views.md)
  + [SEMANTIC_TABLES view](../../sql-reference/info-schema/semantic_tables.md)
  + [SEMANTIC_RELATIONSHIPS view](../../sql-reference/info-schema/semantic_relationships.md)
  + [SEMANTIC_FACTS view](../../sql-reference/info-schema/semantic_facts.md)
  + [SEMANTIC_DIMENSIONS view](../../sql-reference/info-schema/semantic_dimensions.md)
  + [SEMANTIC_METRICS view](../../sql-reference/info-schema/semantic_metrics.md)

---
title: Overview of Snowflake authentication
source: https://docs.snowflake.com/en/user-guide/security-authentication-overview.md
section: User Guide
---

# Overview of Snowflake authentication

The following sections describe the authentication methods that users and applications can use to access Snowflake. They also provide key
considerations to help you select the best authentication method for your use case.

## Choosing authentication for Snowsight

Snowsight is the user interface for Snowflake. This section provides an overview of the authentication methods that users can use
to sign in to Snowsight, followed by a comparison of the methods.

> **Note:**
>
> When you create a Snowflake user object for a person authenticating to Snowsight, specify `TYPE = PERSON`. For more information
> about user types, see [Types of users](admin-user-management.md).

Single sign-on (SSO)
:   With SSO for Snowsight, users authenticate with a third-party identity provider (IdP) rather than authenticating
    with Snowflake directly. When a user accesses Snowsight, the sign-in page includes an option to authenticate with the IdP instead
    of a Snowflake-managed password. The IdP confirms the user’s identity, and then sends a Security Assertion Markup Language (SAML)
    assertion to Snowflake. Because Snowflake and the IdP have a previously established relationship of trust, Snowflake accepts the assertion
    as proof of the user’s identity, and allows the user to access Snowsight.

    Some organizations use the same IdP to provide an SSO experience for all of the organization’s applications. These organizations can
    simply add Snowflake as a new service provider (SP) to allow its employees to use the IdP to access Snowsight.

Username and password with multi-factor authentication (MFA)
:   Password authentication lets users access Snowsight by entering a string of characters that conform to the requirements enforced
    by a password policy. To strengthen the security of this authentication method, Snowflake requires MFA for all
    password users. With MFA, the user enters a password, and then uses a second factor of authentication to confirm their identity. For
    example, a user might use a passkey stored on their computer as the second factor of authentication.

The following table compares authentication methods that users can use to sign in to Snowsight:

| Method | Advantages | Challenges |
| --- | --- | --- |
| Single sign-on . Preferred option | Lets an organization centrally manage authentication. A user authenticates with the same IdP for all of the organization’s applications, not just Snowflake.  Ideal for organizations that already use an IdP to provide SSO for applications. | Requires configuration of a third-party IdP. |
| Password with MFA | Simple implementation. | If passwords are managed by Snowflake, an organization must repeat authentication setup for all of its applications. |

## Overview of authentication methods for applications

In this topic, *application* refers to anything that accesses Snowflake data programmatically rather than
through the Snowsight user interface. This definition includes custom web applications, third-party multi-tenant applications,
desktop applications, local scripts, and workloads in the cloud.

When discussing available authentication methods, this topic distinguishes between two types of applications:

> * An *interactive application* that interacts with a person and authenticates to Snowflake on behalf of that person; for example, a
>   business intelligence (BI) tool that interacts with analysts.
> * A *service-to-service application* that doesn’t interact with a person and has a dedicated authentication method for the service; for
>   example, a CI/CD pipeline.

Workload identity federation (WIF)
:   Workload identity federation is a form of secretless authentication, and is highly secure because it leverages short-lived credentials
    that are already available to cloud workloads. It eliminates the need to manage and rotate secrets.

    When a workload is running on a cloud provider like AWS EC2, Microsoft Azure VMs, or Google Cloud VMs, workload identity federation lets
    the workload authenticate to Snowflake by using the cloud provider’s native identity mechanism. For example, a workload running on AWS
    EC2 can obtain an attestation — that is, proof of its identity — from an AWS Identity and Access Management (IAM) role that is
    associated with the workload. The workload’s driver obtains the attestation from the native identity mechanism and then sends it to
    Snowflake to authenticate the workload.

    Workload identity federation also allows third-party workloads like GitHub Actions and workloads running in Kubernetes to authenticate
    with an OpenID Connect-compliant identity provider (IdP), in a process known as *OIDC federation*. Snowflake accepts ID tokens generated
    by the IdP as proof of the workload’s identity.

    **Suitable for**:

    * Service-to-service applications

OAuth using Snowflake as the authorization server (Snowflake OAuth)
:   Snowflake OAuth provides the security of the [OAuth 2.0 Authorization Framework](https://datatracker.ietf.org/doc/html/rfc6749). With
    Snowflake OAuth, Snowflake is both the authorization server that authenticates a Snowflake user and the resource server that accepts an
    access token from the client to access that user’s data. Snowflake OAuth lets the client use the authorization code grant type.

    Because Snowflake is the authorization server, the user who is interacting with the application uses the Snowflake user interface to
    authenticate. You can configure Snowflake to authenticate the user with single sign-on (SSO) or a password. For information about the
    advantages and challenges of SSO and password authentication, see Choosing authentication for Snowsight.

    **Suitable for**:

    * Interactive applications

OAuth using a third-party authorization server (External OAuth)
:   External OAuth also provides the security of OAuth 2.0, but a third-party IdP, not Snowflake, acts as the authorization
    server. An application obtains an access token from the third-party IdP, then uses the token to access Snowflake as the resource.

    A service-to-service application could use the client credentials grant type to access its own Snowflake data. An interactive application
    could use the authorization code grant type to access the Snowflake data of a person who is using the application.

    **Suitable for**:

    * Interactive applications
    * Service-to-service applications

Key-pair authentication
:   Key-pair authentication relies on a *cryptographic key pair*: a private key and a public key. The private key is a secret kept by the
    application, while the public key is associated with a Snowflake user object. During authentication, the application sends proof that it
    has the private key, and Snowflake responds by verifying that the private key corresponds to the public key associated with the Snowflake
    user. This authentication method eliminates the need to transmit or store passwords, reducing the risk of credential theft.

    **Suitable for**:

    * Interactive applications
    * Service-to-service applications

Programmatic access tokens (PATs)
:   A PAT is a time-limited credential that allows applications to authenticate without a password. A PAT can be
    used as a drop-in replacement for a single-factor password in scenarios where MFA or more secure methods of
    authentication won’t work. A PAT is stronger than a password because it is a short-lived credential, requires that you implement additional
    security measures, and can be scoped to a specific access control role.

    **Suitable for**:

    * Interactive applications
    * Service-to-service applications

## Choosing authentication for interactive applications

An interactive application is one that interacts with a person and authenticates to Snowflake on behalf of that person. The following table
provides the advantages and challenges associated with authentication methods that you can use for interactive applications. For an
overview of these authentication methods, see Overview of authentication methods for applications.

> **Note:**
>
> When you create Snowflake user objects for the people who are using an interactive app, specify `TYPE = PERSON`. For more information
> about user types, see [Types of users](admin-user-management.md).

| Method | Advantages | Challenges |
| --- | --- | --- |
| Snowflake OAuth . Strong option | * Can be simpler to implement than External OAuth. * Local applications, such as a script running in VS Code, can use a built-in implementation of Snowflake OAuth, which provides the   security of OAuth without administrative setup. [Learn more](oauth-local-applications.md) * Avoids driver limitations. * User can authorize access with single sign-on (SSO), which allows them to use the secure authentication methods of a third-party   IdP. | None. |
| External OAuth . Strong option | * If the IdP supports it, the person using the application can use a secretless form of authentication. * Ideal for organizations that already use a third-party IdP as an authorization server for their applications. | Requires expertise in configuring a third-party IdP as an authorization server. |
| Programmatic access token (PAT) | * Easy replacement for single-factor passwords. * Snowflake-generated credential, so it can’t be reused outside of Snowflake. * Can be scoped to a specific access control role to limit damage if it is compromised. * Snowflake *requires* that you implement additional security measures to mitigate the risks of using long-lived secrets. * [GitHub secret scanner program](https://docs.github.com/en/code-security/secret-scanning/secret-scanning-partnership-program/secret-scanning-partner-program)   automatically detects leaked Snowflake PATs in public repositories, disables them, and notifies Snowflake administrators. | * Unlike key-pair, must be secured on both client and server. * Unlike key-pair, the secret must be sent in the request to Snowflake, increasing exposure. * If compromised, anyone with possession can impersonate the application. * Security risks associated with long-lived credentials must be mitigated with other security measures like a robust storage and   rotation strategy. * Because network policies are required, if you have a multi-tenant cloud application, you must provide your customers with your   IP addresses so they can create a network policy that allows those address ranges. * Input field that accepts PATs must be at least 256 characters. |
| Key-pair | * Flexible authentication method. * Passwordless credential that isn’t exposed in a request. | * Not usually used for interactive applications. * Security risks associated with long-lived credentials must be mitigated with other security measures like network policies and a   robust storage and rotation strategy. Unlike programmatic access tokens, key-pair doesn’t *require* additional measures, which can   result in less secure authentication. |

## Choosing authentication for service-to-service applications

A service-to-service application doesn’t interact with a person and has a dedicated authentication method for the service. The following
table provides the advantages and challenges associated with authentication methods that you can use for service-to-service applications.
For an overview of these authentication methods, see Overview of authentication methods for applications.

> **Note:**
>
> When you create a Snowflake user object for a service-to-service application, specify `TYPE = SERVICE`. For more information about user
> types, see [Types of users](admin-user-management.md).

| Method | Advantages | Challenges |
| --- | --- | --- |
| Workload identity federation . Preferred option | * Secretless authentication. * Administrators don’t have to continuously secure and rotate client IDs and secrets. | None. |
| External OAuth . Strong option | * If the IdP supports it, the application can use a secretless form of authentication. * Ideal for organizations that already use a third-party IdP as an authorization server for their applications. | Requires expertise in configuring a third-party IdP as an authorization server. |
| Key-pair | * Flexible authentication method. * Passwordless credential that isn’t exposed in a request. | Security risks associated with long-lived credentials must be mitigated with other security measures like network policies and a robust storage and rotation strategy. Unlike programmatic access tokens, key-pair doesn’t *require* additional measures, which can result in less secure authentication. |
| Programmatic access token (PAT) | * Easy replacement for single-factor passwords. * Snowflake-generated credential, so it can’t be reused outside of Snowflake. * Can be scoped to a specific access control role to limit damage if it is compromised. * Snowflake *requires* that you implement additional security measures to mitigate the risks of using long-lived secrets. * [GitHub secret scanner program](https://docs.github.com/en/code-security/secret-scanning/secret-scanning-partnership-program/secret-scanning-partner-program)   automatically detects leaked Snowflake PATs in public repositories, disables them, and notifies Snowflake administrators. | * Unlike key-pair, must be secured on both client and server. * Unlike key-pair, the secret must be sent in the request to Snowflake, increasing exposure. * If compromised, anyone with possession can impersonate the application. * Security risks associated with long-lived credentials must be mitigated with other security measures like a robust storage and   rotation strategy. |

---
title: Overview of SSO in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/sso-overview.md
section: User Guide
---

# Overview of SSO in Snowflake Open Catalog

This topic describes single sign-on (SSO) for Snowflake Open Catalog. SSO for Open Catalog lets you integrate Open Catalog with a third-party
identity provider. This integration lets users sign in to the Open Catalog web application by using existing credentials managed by an identity provider
(IdP), so you don’t have to manage separate usernames and passwords in Open Catalog.

SSO for Open Catalog supports signing in and out of the Open Catalog UI and timing out the system due to inactivity.

## Supported identity providers

Open Catalog supports SSO for the following identity providers (IdPs):

* Auth0
* Okta
* Any other SAML-based IdP

The steps for configuring any other SAML-based IdP are similar to [configuring Auth0](sso-configure-idp.md) or
[configuring Okta](sso-configure-idp.md).

## Security integration support

Only SAML is supported.

A Snowflake security integration is an account-level object. You use the security integration to integrate with the IdP you are using to
implement SSO. For more information, see [CREATE SECURITY INTEGRATION (SAML2)](https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-saml2).

You can only use one SAML integration at a time. To see which one is enabled, see [Verify the security integration](sso-configure-open-catalog.md).

## Configuring SSO for Open Catalog

A service admin in Open Catalog can set up SSO by following these steps:

1. [Configure an identity provider (IdP) for Snowflake Open Catalog](sso-configure-idp.md). During this process, you generate values that
   you need to configure SSO in Open Catalog.
2. [Configure Snowflake Open Catalog to use SSO](sso-configure-open-catalog.md).

---
title: Overview of the data lifecycle
source: https://docs.snowflake.com/en/user-guide/data-lifecycle.md
section: User Guide
---

# Overview of the data lifecycle

Snowflake provides support for all standard SELECT, DDL, and DML operations across the lifecycle of data in the system, from organizing and
storing data to querying and working with data, as well as removing data from the system.

## Lifecycle diagram

All user data in Snowflake is logically represented as tables that you can query and modify through standard SQL interfaces. Each table
belongs to a schema which in turn belongs to a database.

## Organize data

You can organize your data into databases, schemas, and tables. Snowflake doesn’t limit the number of databases you can create or the
number of schemas you can create within a database. Snowflake also doesn’t limit the number of tables you can create in a schema.

For more information, see the following topics:

* [CREATE DATABASE](../sql-reference/sql/create-database.md)
* [ALTER DATABASE](../sql-reference/sql/alter-database.md)
* [CREATE SCHEMA](../sql-reference/sql/create-schema.md)
* [ALTER SCHEMA](../sql-reference/sql/alter-schema.md)
* [CREATE TABLE](../sql-reference/sql/create-table.md)
* [ALTER TABLE](../sql-reference/sql/alter-table.md)

## Store data

You can insert data directly into tables. In addition, Snowflake provides DML for loading data into Snowflake tables from external,
formatted files.

For more information, see the following topics:

* [INSERT](../sql-reference/sql/insert.md)
* [COPY INTO <table>](../sql-reference/sql/copy-into-table.md)

## Query data

After data is stored in a table, you can issue SELECT statements to query the data.

For more information, see [SELECT](../sql-reference/sql/select.md).

## Work with data

After data is stored in a table, you can perform all standard DML operations on the data. In addition, Snowflake supports DDL actions,
such as cloning entire databases, schemas, and tables.

For more information, see the following topics:

* [UPDATE](../sql-reference/sql/update.md)
* [MERGE](../sql-reference/sql/merge.md)
* [DELETE](../sql-reference/sql/delete.md)
* [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md)

## Remove data

In addition to using the DML command, [DELETE](../sql-reference/sql/delete.md), to remove data from a table, you can truncate or drop an entire
table. You can also drop entire schemas and databases.

For more information, see the following topics:

* [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md)
* [DROP TABLE](../sql-reference/sql/drop-table.md)
* [DROP SCHEMA](../sql-reference/sql/drop-schema.md)
* [DROP DATABASE](../sql-reference/sql/drop-database.md)

---
title: Overview of the Kafka connector
source: https://docs.snowflake.com/en/user-guide/kafka-connector-overview.md
section: User Guide
---

# Overview of the Kafka connector

This topic provides an overview of the Apache Kafka and the Snowflake Connector for Kafka.

> **Note:**
>
> The Kafka connector is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms).

## Introduction to Apache Kafka

Apache Kafka software uses a publish and subscribe model to write and read streams of records, similar to a message queue or enterprise messaging system. Kafka allows processes to read and write messages asynchronously. A subscriber does not need to be connected directly to a publisher; a publisher can queue a message in Kafka for the subscriber to receive later.

An application publishes messages to a *topic*, and an application subscribes to a topic to receive those messages. Kafka can process, as well as transmit, messages; however, that is outside the scope of this document. Topics can be divided into *partitions* to increase scalability.

Kafka Connect is a framework for connecting Kafka with external systems, including databases. A Kafka Connect cluster is a separate cluster from the Kafka cluster. The Kafka Connect cluster supports running and scaling out connectors (components that support reading and/or writing between external systems).

The Kafka connector is designed to run in a Kafka Connect cluster to read data from Kafka topics and write the data into Snowflake tables.

Snowflake provides two versions of the connector:

* A version for the [Confluent package version of Kafka](https://www.confluent.io/hub/snowflakeinc/snowflake-kafka-connector).

  For more information about Kafka Connect, see <https://docs.confluent.io/current/connect/>.

  > **Note:**
  >
  > A hosted version of the Kafka connector is available in Confluent Cloud. For information, see <https://docs.confluent.io/current/cloud/connectors/cc-snowflake-sink.html>.
* A version for the [open source software (OSS) Apache Kafka package](https://mvnrepository.com/artifact/com.snowflake/snowflake-kafka-connector/).

  For more information about Apache Kafka, see <https://kafka.apache.org/>.

From the perspective of Snowflake, a Kafka topic produces a stream of rows to be inserted into a Snowflake table. In general, each Kafka message contains one row.

Kafka, like many message publish/subscribe platforms, allows a many-to-many relationship between publishers and subscribers. A single application can publish to many
topics, and a single application can subscribe to multiple topics. With Snowflake, the typical pattern is that one topic supplies messages (rows) for one Snowflake table.

The current version of the Kafka connector is limited to loading data into Snowflake. The Kafka connector supports two data loading methods:

* [Snowpipe](data-load-snowpipe-intro.md)
* [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

For more information, refer to [Load Data into Snowflake](../guides-overview-loading-data.md) and [Using Snowflake Connector for Kafka With Snowpipe Streaming](snowpipe-streaming/snowpipe-streaming-classic-kafka.md).

## Target tables for Kafka topics

Kafka topics can be mapped to existing Snowflake tables in the Kafka configuration. If the topics are not mapped, then the Kafka connector creates a new table for each topic using the topic name.

The connector converts the topic name to a valid Snowflake table name using the following rules:

* Lowercase topic names are converted to uppercase table names.
* If the first character in the topic name is not a letter (`a-z`, or `A-Z`) or an underscore character (`_`), then the connector prepends an underscore to the table name.
* If any character inside the topic name is not a legal character for a Snowflake table name, then that character is replaced with the underscore character. For more information about which characters are valid in table names, see [Identifier requirements](../sql-reference/identifiers-syntax.md).

Note that if the Kafka connector needs to adjust the name of the table created for a Kafka topic, it is possible that the names of two tables in the same schema could be identical. For example, if you are reading data from topics `numbers+x` and `numbers-x`, the tables created for these topics would both be `NUMBERS_X`. To avoid accidental duplication of table names, the connector appends a suffix to the table name. The suffix is an underscore followed by a generated hash code.

> **Tip:**
>
> Snowflake recommends that, when possible, you choose topic names that follow the rules for Snowflake identifier names.

## Schema of tables for Kafka topics

The schema for a table loaded by the Kafka connector depends on the table type and how you configure the connector:

* Most tables use the default schema described in this section.
* When you use [schema detection and evolution](snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md), the schema contains columns that match the user-defined schema.
* When you ingest into an Iceberg table, the schema includes the same default columns (`record_content` and `record_metadata`). However, they are [structured type](../sql-reference/data-types-structured.md) columns instead of VARIANT.

By default, with [Snowpipe](data-load-snowpipe-intro.md) or [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md), every Snowflake table loaded by the Kafka connector has a schema consisting of two VARIANT columns:

* RECORD_CONTENT. This contains the Kafka message.
* RECORD_METADATA. This contains metadata about the message, for example, the topic from which the message was read.

If Snowflake creates the table, then the table contains only these two columns. If the user creates the table for the
Kafka Connector to add rows to, then the table can contain more than these two columns (any additional columns must
allow NULL values because data from the connector does not include values for those columns).

The RECORD_CONTENT column contains the Kafka message.

A Kafka message has an internal structure that depends upon the information being sent. For example, a message from an IoT (Internet of Things) weather sensor
might include the timestamp at which the data was recorded, the location of the sensor, the temperature, humidity, etc. A message from an inventory system
might include the product ID and the number of items sold, perhaps along with a timestamp indicating when they were sold or shipped.

Typically, each message in a specific topic has the same basic structure. Different topics typically use different structure.

Each Kafka message is passed to Snowflake in JSON format or Avro format. The Kafka connector stores that formatted information in a single column of
type [VARIANT](../sql-reference/data-types-semistructured.md). The data is not parsed, and the data is not split into multiple columns in the Snowflake table.

The RECORD_METADATA column contains the following information by default:

| Field | Java . Data Type | SQL . Data Type | Required | Description |
| --- | --- | --- | --- | --- |
| topic | String | VARCHAR | Yes | The name of the Kafka topic that the record came from. |
| partition | String | VARCHAR | Yes | The number of the partition within the topic. (Note that this is the Kafka partition, not the Snowflake micro-partition.) |
| offset | long | INTEGER | Yes | The offset in that partition. |
| CreateTime / . LogAppendTime | long | BIGINT | No | This is the timestamp associated with the message in the Kafka topic. The value is milliseconds since midnight January 1, 1970, UTC. For more information, see: <https://kafka.apache.org/0100/javadoc/org/apache/kafka/clients/producer/ProducerRecord.html> |
| SnowflakeConnectorPushTime | long | BIGINT | No | Available only when using Snowpipe Streaming. A timestamp when a record was pushed into an Ingest SDK buffer. The value is the number of milliseconds since midnight January 1, 1970, UTC. For more information, see [Estimating ingestion latency](snowpipe-streaming/snowpipe-streaming-classic-kafka.md). |
| key | String | VARCHAR | No | If the message is a Kafka KeyedMessage, this is the key for that message. In order for the connector to store the key in the RECORD_METADATA, the key.converter parameter in the [Kafka configuration properties](kafka-connector-install.md) must be set to “org.apache.kafka.connect.storage.StringConverter”; otherwise, the connector ignores keys. |
| schema_id | int | INTEGER | No | When using Avro with a schema registry to specify a schema, this is the schema’s ID in that registry. |
| headers | Object | OBJECT | No | A header is a user-defined key-value pair associated with the record. Each record can have 0, 1, or multiple headers. |

The amount of metadata recorded in the RECORD_METADATA column is configurable using optional Kafka configuration properties. For information, see [Installing and configuring the Kafka connector](kafka-connector-install.md).

The field names and values are case-sensitive.

Expressed in JSON syntax, a sample message might look similar to the following:

```sqljson
{
    "meta":
    {
        "offset": 1,
        "topic": "PressureOverloadWarning",
        "partition": 12,
        "key": "key name",
        "schema_id": 123,
        "CreateTime": 1234567890,
        "headers":
        {
            "name1": "value1",
            "name2": "value2"
        }
    },
    "content":
    {
        "ID": 62,
        "PSI": 451,
        "etc": "..."
    }
}
```

You can query the Snowflake tables directly by using the appropriate [syntax for querying VARIANT columns](querying-semistructured.md).

Here is a simple example of extracting data based on the topic in the RECORD_METADATA:

```sqlexample
select
       record_metadata:CreateTime,
       record_content:ID
    from table1
    where record_metadata:topic = 'PressureOverloadWarning';
```

The output would look similar to:

```sqlexample
+------------+-----+
| CREATETIME | ID  |
+------------+-----+
| 1234567890 | 62  |
+------------+-----+
```

Alternatively, you can extract the data from these tables, flatten the data into individual columns, and store the data in other tables, which typically are
easier to query.

## Workflow for the Kafka connector

The Kafka connector completes the following process to subscribe to Kafka topics and create Snowflake objects:

1. The Kafka connector subscribes to one or more Kafka topics based on the configuration information provided via the Kafka configuration file or command line (or the Confluent Control Center; Confluent only).
2. The connector creates the following objects for each topic:

   * One internal stage to temporarily store data files for each topic.
   * One pipe to ingest the data files for each topic partition.
   * One table for each topic. If the table specified for each topic does not exist, the connector creates it; otherwise, the connector creates the RECORD_CONTENT and RECORD_METADATA columns in the existing table and verifies that the other columns are nullable (and produces an error if they are not).

The following diagram shows the ingest flow for Kafka with the Kafka connector:

1. One or more applications publish JSON or Avro records to a Kafka cluster. The records are split into one or more topic partitions.
2. The Kafka connector buffers messages from the Kafka topics. When a threshold (time or memory or number of messages) is reached, the connector writes the messages to a temporary file in the internal stage. The connector triggers [Snowpipe](data-load-snowpipe-intro.md) to ingest the temporary file. Snowpipe copies a pointer to the data file into a queue.
3. A Snowflake-provided virtual warehouse loads data from the staged file into the target table (i.e. the table specified in the configuration file for the topic) via the pipe created for the Kafka topic partition.
4. (Not shown) The connector monitors Snowpipe and deletes each file in the internal stage after confirming that the file data was loaded into the table.

   If a failure prevented the data from loading, the connector moves the file into the table stage and produces an error message.
5. The connector repeats steps 2-4.

> **Attention:**
>
> Snowflake polls the `insertReport` API for one hour. If the status of an ingested file does not
> succeed within this hour, the files being ingested are moved to a table stage.
>
> It may take at least one hour for these files to be available on the table stage. Files are
> only moved to the table stage when their ingestion status could not be found within the
> previous hour.

## Fault tolerance

Both Kafka and the Kafka connector are fault-tolerant. Messages are neither duplicated nor silently dropped.

Data deduplication logic in the Snowpipe workflow in the data loading chain eliminates duplicate copies of repeating data except
in rare cases. If an error is detected while Snowpipe loads a record (for example, the record was not well-formed JSON or Avro), then the
record is not loaded; instead, the record is moved to a table stage.

The Kafka connector with Snowpipe Streaming supports dead-letter queues (DLQ) for error handling. For more information, refer to [Error Handling and DLQ Properties for the Kafka Connector with Snowpipe Streaming](snowpipe-streaming/snowpipe-streaming-classic-kafka.md).

### Limitations of fault tolerance with the connector

Kafka Topics can be configured with a limit on storage space or retention time.

* The default retention time is 7 days. If the system is offline for more than the retention time, then expired records will
  not be loaded. Similarly, if Kafka’s storage space limit is exceeded, some messages will not be delivered.
* If messages in the Kafka topic are deleted or updated, these changes might not be reflected in the Snowflake table.

> **Attention:**
>
> Instances of the Kafka connector do not communicate with each other. If you start multiple instances of the connector on the
> same topics or partitions, then multiple copies of the same row might be inserted into the table. This is not recommended;
> each topic should be processed by only one instance of the connector.

It is theoretically possible for messages to flow from Kafka faster than Snowflake can ingest them. In practice, however, this
is unlikely. If it does occur, then solving the problem would require performance tuning of the Kafka Connect cluster. For
example:

* Tuning the number of nodes in the Connect cluster.
* Tuning the number of tasks allocated to the connector.
* Understanding the impact of the network bandwidth between the connector and the Snowflake deployment.

> **Important:**
>
> There is no guarantee that rows are inserted in the order that they were originally published.

## Supported platforms

The Kafka connector can run in any Kafka Connect cluster, and can send data to a Snowflake account on any supported [cloud platform](intro-cloud-platforms.md).

## Protobuf data support

Kafka connector 1.5.0 (or higher) supports protocol buffers (protobuf) via a protobuf converter. For details, see [Loading protobuf data using the Snowflake Connector for Kafka](kafka-connector-protobuf.md).

## Billing information

There is no direct charge for using the Kafka connector. However, there are indirect costs:

* Snowpipe is used to load the data that the connector reads from Kafka, and Snowpipe processing time is charged to your account.
* Data storage is charged to your account.

## Kafka connector limitations

Single Message Transformations (SMTs) are applied to messages as they flow through Kafka Connect. When you configure the [Kafka configuration properties](kafka-connector-install.md), if you set either `key.converter` or `value.converter` to one of the following values, then SMTs are not supported on the corresponding key or value:

* `com.snowflake.kafka.connector.records.SnowflakeJsonConverter`
* `com.snowflake.kafka.connector.records.SnowflakeAvroConverter`
* `com.snowflake.kafka.connector.records.SnowflakeAvroConverterWithoutSchemaRegistry`

When neither `key.converter` or `value.converter` is set, then most SMTs are supported, with the current exception of `regex.router`.

Although the Snowflake converters do not support SMTs, Kafka connector version 1.4.3 (or higher) supports many community-based converters such as the following:

* `io.confluent.connect.avro.AvroConverter`
* `org.apache.kafka.connect.json.JsonConverter`

For more information about SMTs, see <https://docs.confluent.io/current/connect/transforms/index.html>.

---
title: Overview of the Snowpipe REST endpoints to load data
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-rest-overview.md
section: User Guide
---

# Overview of the Snowpipe REST endpoints to load data

This topic provides an overview of the usage details when calling the public REST endpoints to load data and retrieve load history reports.

## Authentication

Calls to the public Snowpipe REST endpoints use key-based authentication, rather than the typical username/password authentication, because the ingestion service does not maintain client sessions.

To follow the general principle of least privilege, we recommend creating a separate user and role to use for ingesting files using a pipe. The user should be created with this role as its default role, and the role should have the minimum set of permissions needed to insert files into the target table for data loading.

## Process flow

Your client application calls a public REST endpoint with a list of data filenames and a referenced pipe name (Java and Python SDKs are provided for your convenience). If new data files matching the list are discovered in the stage, they are queued for loading. Snowflake-provided compute resources load data from the queue into a Snowflake table based on parameters defined in the pipe.

The following diagram shows the Snowpipe REST API process flow:

1. Data files are copied to an internal (Snowflake) or external (Amazon S3, Google Cloud Storage, or Microsoft Azure) stage.
2. A client calls the `insertFiles` endpoint with a list of files to ingest and a defined pipe.

   The endpoint moves these files to an ingest queue.
3. A Snowflake-provided virtual warehouse loads data from the queued files into the target table based on parameters defined in the specified pipe.

## Workflow

This section provides a high-level overview of the setup and load workflow.

### Configuring Snowpipe

1. Create a named stage object where your data files will be staged. Snowpipe supports both internal (Snowflake) stages and external stages, i.e. S3 buckets.
2. Create a pipe object using [CREATE PIPE](../sql-reference/sql/create-pipe.md).
3. Configure security for the user who will execute the continuous data load. If you plan to restrict Snowpipe data loads to a single user, you only need to configure key pair authentication for the user once. After that, you only need to grant access control privileges on the database objects used for each data load.
4. Install a client SDK (Java or Python) for calling the Snowpipe public REST endpoints.

### Using the Snowpipe REST API to load data

#### Option 1: Using a client to call the REST API

Use a client to call the REST API. Java and Python SDK sample code is provided. For more information, see [Option 1: Load data with the Snowpipe REST API](data-load-snowpipe-rest-load.md).

1. Call a REST endpoint with a list of files to load when staged.
2. Retrieve the load history.

#### Option 2: Using AWS Lambda to call the REST API

Automate Snowpipe by using an AWS Lambda function to call the REST API. A Lambda function can call the REST API to load data from files stored in Amazon S3 only. For more information, see [Option 2: Automate Snowpipe with AWS Lambda](data-load-snowpipe-rest-lambda.md).

1. Create an AWS Lambda function that calls the Snowpipe REST API to load data from your external (i.e. S3) stage .
2. Retrieve the load history.

---
title: Overview of the Spark Connector
source: https://docs.snowflake.com/en/user-guide/spark-connector-overview.md
section: User Guide
---

# Overview of the Spark Connector

The Snowflake Connector for Spark enables using Snowflake as an Apache Spark data source, similar to other data sources (PostgreSQL, HDFS, S3, etc.).

> **Note:**
>
> As an alternative to using Spark, consider writing your code to use [Snowpark API](../developer-guide/snowpark/index.md) instead. Snowpark
> allows you to perform all of your work within Snowflake (rather than in a separate Spark compute cluster). Snowpark also
> supports pushdown of all operations, including Snowflake UDFs. However, when you want to enforce row and column policies on Iceberg tables,
> use the Snowflake Spark Connector. For more information, see
> [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

## Interaction between Snowflake and Spark

The connector supports bi-directional data movement between a Snowflake cluster and a Spark cluster. The Spark cluster can be self-hosted or accessed through another service, such as Qubole, AWS EMR,
or Databricks.

Using the connector, you can perform the following operations:

* Populate a Spark DataFrame from a table (or query) in Snowflake.
* Write the contents of a Spark DataFrame to a table in Snowflake.

The connector uses Scala 2.12.x or 2.13.x to perform these operations and uses the Snowflake JDBC driver to communicate with
Snowflake.

> **Note:**
>
> The Snowflake Connector for Spark is not strictly required to connect Snowflake and Apache Spark; other 3rd-party JDBC drivers can be used. However, we recommend using the Snowflake Connector for
> Spark because the connector, in conjunction with the Snowflake JDBC driver, has been optimized for transferring large amounts of data between the two systems. It also provides enhanced performance
> by supporting query pushdown from Spark into Snowflake.

## Data Transfer

The Snowflake Spark Connector supports two transfer modes:

* Internal transfer uses a temporary location created and managed internally/transparently by Snowflake.
* External transfer uses a storage location, usually temporary, created and managed by the user.

> **Tip:**
>
> Use external data transfer if either of the following is true:
>
> * You are using version 2.1.x or lower of the Spark Connector (which does not support internal transfer).
> * Your transfer is likely to take 36 hours or more (internal transfer uses temporary credentials that expire after 36 hours).
>
> Otherwise, we recommend using internal data transfer.

### Internal Data Transfer

The transfer of data between the two systems is facilitated through a Snowflake internal stage that the connector automatically creates and manages:

* Upon connecting to Snowflake and initializing a session in Snowflake, the connector creates the internal stage.
* Throughout the duration of the Snowflake session, the connector uses the stage to store data while transferring it to its destination.
* At the end of the Snowflake session, the connector drops the stage, thereby removing all the temporary data in the stage.

Note that support for internal transfer requires a specific version (or higher) of the connector, based on the cloud platform for your Snowflake account:

AWS:
:   The internal data transfer mode is supported only in version 2.2.0 (and higher) of the connector.

Azure:
:   The internal data transfer mode is supported only in version 2.4.0 (and higher) of the connector.

GCP:
:   The internal data transfer mode is supported only in version 2.7.0 (and higher) of the connector.

### External Data Transfer

The transfer of data between the two systems is facilitated through a storage location that the user specifies and files automatically created by the connector:

AWS:
:   Transfer data files are created and stored in an S3 bucket.

Azure:
:   Transfer data files are created and stored in a Blob storage container. External transfer via Azure is supported only in version 2.4.0 (and higher) of the connector.

The parameter(s) to specify the storage location are documented in [Setting Configuration Options for the Connector](spark-connector-use.md):

> **Note:**
>
> For external data transfer, the storage location must be created and configured as part of the Spark connector installation/configuration.
>
> Also, the files created by the connector during external transfer are intended to be temporary, but the connector does not automatically delete the files from the storage location. To delete the
> files, use any of the following methods:
>
> * Delete them manually.
> * Set the `purge` parameter for the connector. For more information about this parameter, see [Setting Configuration Options for the Connector](spark-connector-use.md).
> * Set a storage system parameter, such as the Amazon S3 lifecycle policy parameter, to clean up the files after the transfer is done.

## Column Mapping

When you copy data from a Spark table to a Snowflake table, if the column names do not match, you can map column names from Spark to Snowflake using the `columnmapping` parameter, which is
documented in [Setting Configuration Options for the Connector](spark-connector-use.md).

> **Note:**
>
> Column mapping is supported only for internal data transfer.

## Query Pushdown

For optimal performance, you typically want to avoid reading lots of data or transferring large intermediate results between systems. Ideally, most of the processing should happen close to where
the data is stored to leverage the capabilities of the participating stores to dynamically eliminate data that is not needed.

Query pushdown leverages these performance efficiencies by enabling large and complex Spark logical plans (in their entirety or
in parts) to be processed in Snowflake, thus using Snowflake to do most of the actual work.

Query pushdown is supported in Version 2.1.0 (and higher) of the Snowflake Connector for Spark.

Pushdown is not possible in all situations. For example, Spark UDFs cannot be pushed down to Snowflake. See
[Pushdown](spark-connector-use.md) for the list of operations supported for pushdown.

> **Note:**
>
> If you need pushdown for all operations, consider writing your code to use [Snowpark API](../developer-guide/snowpark/index.md) instead.
> Snowpark also supports pushdown of Snowflake UDFs.

## Databricks Integration

Databricks has integrated the Snowflake Connector for Spark into the Databricks Unified Analytics Platform to provide native connectivity between Spark and Snowflake.

For more details, including code examples using Scala and Python, see [Data Sources — Snowflake](https://docs.databricks.com/spark/latest/data-sources/snowflake.html)
(in the Databricks documentation) or [Configuring Snowflake for Spark in Databricks](spark-connector-databricks.md).

## Qubole Integration

Qubole has integrated the Snowflake Connector for Spark into the Qubole Data Service (QDS) ecosystem to provide native connectivity between Spark and Snowflake. Through this integration, Snowflake
can be added as a Spark data store directly in Qubole.

Once Snowflake has been added as a Spark data store, data engineers and data scientists can use Spark and the QDS UI, API, and Notebooks to:

* Perform advanced data transformations, such as preparing and consolidating external data sources into Snowflake, or refining and transforming Snowflake data.
* Build, train, and execute machine learning and AI models in Spark using the data that already exists in Snowflake.

For more details, see the [Qubole-Snowflake Integration Guide](http://docs.qubole.com/en/latest/partner-integration/snowflake-integration/index.html)
(in the Qubole Documentation) or [Configuring Snowflake for Spark in Qubole](spark-connector-qubole.md).

---
title: Overview of Views
source: https://docs.snowflake.com/en/user-guide/views-introduction.md
section: User Guide
---

# Overview of Views

This topic covers concepts for understanding and using views.

## What is a View?

A view allows the result of a query to be accessed as if it were a table. The query is specified in the CREATE VIEW statement.

Views serve a variety of purposes, including combining, segregating, and protecting data. For example, you can create separate views
that meet the needs of different types of employees, such as doctors and accountants at a hospital:

> ```sqlexample
> CREATE TABLE hospital_table (patient_id INTEGER,
>                              patient_name VARCHAR,
>                              billing_address VARCHAR,
>                              diagnosis VARCHAR,
>                              treatment VARCHAR,
>                              cost NUMBER(10,2));
> INSERT INTO hospital_table
>         (patient_ID, patient_name, billing_address, diagnosis, treatment, cost)
>     VALUES
>         (1, 'Mark Knopfler', '1982 Telegraph Road', 'Industrial Disease',
>             'a week of peace and quiet', 2000.00),
>         (2, 'Guido van Rossum', '37 Florida St.', 'python bite', 'anti-venom',
>             70000.00)
>         ;
> ```
>
> ```sqlexample
> CREATE VIEW doctor_view AS
>     SELECT patient_ID, patient_name, diagnosis, treatment FROM hospital_table;
>
> CREATE VIEW accountant_view AS
>     SELECT patient_ID, patient_name, billing_address, cost FROM hospital_table;
> ```

A view can be used almost anywhere that a table can be used (joins, subqueries, etc.). For example, using the views created above:

* Show all of the types of medical problems for each patient:

  > ```sqlexample
  > SELECT DISTINCT diagnosis FROM doctor_view;
  > +--------------------+
  > | DIAGNOSIS          |
  > |--------------------|
  > | Industrial Disease |
  > | python bite        |
  > +--------------------+
  > ```
* Show the cost of each treatment (without showing personally identifying information about specific patients):

  > ```sqlexample
  > SELECT treatment, cost
  >     FROM doctor_view AS dv, accountant_view AS av
  >     WHERE av.patient_ID = dv.patient_ID;
  > +---------------------------+----------+
  > | TREATMENT                 |     COST |
  > |---------------------------+----------|
  > | a week of peace and quiet |  2000.00 |
  > | anti-venom                | 70000.00 |
  > +---------------------------+----------+
  > ```

A [CREATE VIEW](../sql-reference/sql/create-view.md) command can use a fully-qualified, partly-qualified, or unqualified table
name. For example:

> ```sqlexample
> CREATE VIEW v1 AS SELECT ... FROM my_database.my_schema.my_table;
>
> CREATE VIEW v1 AS SELECT ... FROM my_schema.my_table;
>
> CREATE VIEW v1 AS SELECT ... FROM my_table;
> ```

If the schema is not specified, then Snowflake assumes that the table is in the same schema as the view.
(If the table were assumed to be in the active schema, then the view could refer to different tables at different
times.)

## Types of Views

Snowflake supports two types of views:

* Non-materialized views (usually simply referred to as “views”)
* Materialized views.

### Non-materialized Views

The term “view” generically refers to all types of views; however, the term is used here to refer specifically to non-materialized
views.

A view is basically a named definition of a query. A non-materialized view’s results are created by executing the query at the
time that the view is referenced in a query. The results are not stored for future use. Performance is slower than with materialized
views. Non-materialized views are the most common type of view.

Any query expression that returns a valid result can be used to create a non-materialized view, such as:

* Selecting some (or all) columns in a table.
* Selecting a specific range of data in table columns.
* Joining data from two or more tables.

### Materialized Views

Although a materialized view is named as though it were a type of view, in many ways it behaves more like a table. A materialized
view’s results are stored, almost as though the results were a table. This allows faster access, but requires storage space and active
maintenance, both of which incur additional costs.

In addition, materialized views have some restrictions that non-materialized views do not have.

For more details, see [Working with Materialized Views](views-materialized.md).

## Secure Views

Both non-materialized and materialized views can be defined as *secure*. Secure views have advantages over standard views, including
improved data privacy and data sharing; however, they also have some performance impacts to take into consideration.

For more details, see [Working with Secure Views](views-secure.md).

## Recursive Views (Non-materialized Views Only)

A non-materialized view can be recursive (i.e. the view can refer to itself).

Use of recursion in views is similar to the use of recursion in [recursive CTEs](queries-cte.md).
In fact, a view can be defined with a recursive CTE. For example:

> ```sqlexample
> CREATE VIEW employee_hierarchy (title, employee_ID, manager_ID, "MGR_EMP_ID (SHOULD BE SAME)", "MGR TITLE") AS (
>    WITH RECURSIVE employee_hierarchy_cte (title, employee_ID, manager_ID, "MGR_EMP_ID (SHOULD BE SAME)", "MGR TITLE") AS (
>       -- Start at the top of the hierarchy ...
>       SELECT title, employee_ID, manager_ID, NULL AS "MGR_EMP_ID (SHOULD BE SAME)", 'President' AS "MGR TITLE"
>         FROM employees
>         WHERE title = 'President'
>       UNION ALL
>       -- ... and work our way down one level at a time.
>       SELECT employees.title,
>              employees.employee_ID,
>              employees.manager_ID,
>              employee_hierarchy_cte.employee_id AS "MGR_EMP_ID (SHOULD BE SAME)",
>              employee_hierarchy_cte.title AS "MGR TITLE"
>         FROM employees INNER JOIN employee_hierarchy_cte
>        WHERE employee_hierarchy_cte.employee_ID = employees.manager_ID
>    )
>    SELECT *
>       FROM employee_hierarchy_cte
> );
> ```

Instead of using a recursive CTE, you can create a recursive view with the keyword `RECURSIVE`, for example:

> ```sqlexample
> CREATE RECURSIVE VIEW employee_hierarchy_02 (title, employee_ID, manager_ID, "MGR_EMP_ID (SHOULD BE SAME)", "MGR TITLE") AS (
>       -- Start at the top of the hierarchy ...
>       SELECT title, employee_ID, manager_ID, NULL AS "MGR_EMP_ID (SHOULD BE SAME)", 'President' AS "MGR TITLE"
>         FROM employees
>         WHERE title = 'President'
>       UNION ALL
>       -- ... and work our way down one level at a time.
>       SELECT employees.title,
>              employees.employee_ID,
>              employees.manager_ID,
>              employee_hierarchy_02.employee_id AS "MGR_EMP_ID (SHOULD BE SAME)",
>              employee_hierarchy_02.title AS "MGR TITLE"
>         FROM employees INNER JOIN employee_hierarchy_02
>         WHERE employee_hierarchy_02.employee_ID = employees.manager_ID
> );
> ```

For more details, including examples, see [CREATE VIEW](../sql-reference/sql/create-view.md).

## Advantages of Views

### Views Enable Writing More Modular Code

Views help you to write clearer, more modular SQL code. For example, suppose that your hospital database has a table listing information
about all employees. You can create views to make it convenient to extract information about only the medical staff or only the maintenance
staff. You can even create hierarchies of views.

For example, you can create one view for the doctors, and one for the nurses, and then create the `medical_staff` view by referring to
the doctors view and nurses view:

> ```sqlexample
> CREATE TABLE employees (id INTEGER, title VARCHAR);
> INSERT INTO employees (id, title) VALUES
>     (1, 'doctor'),
>     (2, 'nurse'),
>     (3, 'janitor')
>     ;
>
> CREATE VIEW doctors as SELECT * FROM employees WHERE title = 'doctor';
> CREATE VIEW nurses as SELECT * FROM employees WHERE title = 'nurse';
> CREATE VIEW medical_staff AS
>     SELECT * FROM doctors
>     UNION
>     SELECT * FROM nurses
>     ;
> ```
>
> ```sqlexample
> SELECT *
>     FROM medical_staff
>     ORDER BY id;
> +----+--------+
> | ID | TITLE  |
> |----+--------|
> |  1 | doctor |
> |  2 | nurse  |
> +----+--------+
> ```

In many cases, rather than writing one large and difficult-to-understand query, you can decompose the query into smaller pieces, and create
a view for each of those pieces. This not only makes the code easier to understand, but in many cases it also makes the code easier to debug
because you can debug one view at a time, rather than the entire query.

One view can be referenced by many different queries, so views help increase code re-use.

### Views Allow Granting Access to a Subset of a Table

Views allow you to grant access to just a portion of the data in a table(s). For example, suppose that you have a table of medical patient
records. The medical staff should have access to all of the medical information (for example, diagnosis) but not the financial information
(for example, the patient’s credit card number). The accounting staff should have access to the billing-related information, such as the costs
of each of the prescriptions given to the patient, but not to the private medical data, such as diagnosis of a mental health condition. You can
create two separate views, one for the medical staff, and one for the billing staff, so that each of those roles sees only the information
needed to perform their jobs. Views allow this because you can grant privileges on a particular view to a particular role, without the grantee
role having privileges on the table(s) underlying the view.

In the medical example:

* The medical staff would not have privileges on the data table(s), but would have privileges on the view showing diagnosis and treatment.
* The accounting staff would not have privileges on the data table(s), but would have privileges on the view showing billing information.

For additional security, Snowflake supports defining a view as secure. For more details about secure views, see [Working with Secure Views](views-secure.md).

> **Note:**
>
> * If a user has enough privilege to access the content of a view, but has no access to the underlying table of the view, then the user
>   cannot query the view unless the owner role of the view has access to the underlying table.
> * If a user has enough privilege to access the content of both the view and the underlying table of the view, the user can query the view
>   successfully, regardless of whether the owner role of the view has access to the underlying table.

### Materialized Views Can Improve Performance

Materialized Views are designed to improve performance. Materialized Views contain a copy of a subset of the data in a table.
Depending upon the amount of data in the table and in the materialized view, scanning the materialized view can be much faster
than scanning the table. Materialized views also support clustering, and you can create multiple materialized views on the same
data, with each materialized view being clustered on a different column, so that different queries can each run on the view with
the best clustering for that query.

For more details, see [Working with Materialized Views](views-materialized.md).

## Limitations on Views

* For limitations and usage notes related to creating views, see [CREATE VIEW](../sql-reference/sql/create-view.md).
* The definition for a view cannot be updated (i.e. you cannot use [ALTER VIEW](../sql-reference/sql/alter-view.md) or
  [ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md) to change the definition of a view). To change a view definition, you must recreate
  the view with the new definition.
* Views are read-only (i.e. you cannot execute DML commands directly on a view). However, you can use a view in a subquery within a DML
  statement that updates the underlying base table. For example:

  ```sqlexample
  DELETE FROM hospital_table
      WHERE cost > (SELECT AVG(cost) FROM accountant_view);
  ```
* Changes to a table are not automatically propagated to views created on that table. For example, if you drop a column in a table, the
  views on that table might become invalid.

---
title: Overview of warehouses
source: https://docs.snowflake.com/en/user-guide/warehouses-overview.md
section: User Guide
---

# Overview of warehouses

Warehouses are required for queries, as well as all DML operations, including loading data into tables. In addition to being defined by its
type as either Standard or Snowpark-optimized, a warehouse is defined by its size, as well as the other properties that can be set to help
control and automate warehouse activity.

Warehouses can be started and stopped at any time. They can also be resized at any time, even while running, to accommodate the need for more
or less compute resources, based on the type of operations being performed by the warehouse.

## Warehouse size

Size specifies the amount of compute resources available per cluster in a warehouse. Snowflake supports the following warehouse sizes:

| Warehouse size | Credits / hour (Gen1 warehouses) | Credits / second (Gen1 warehouses) | Notes |
| --- | --- | --- | --- |
| X-Small | 1 | 0.0003 | Default size for warehouses created in Snowsight and using [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md). |
| Small | 2 | 0.0006 |  |
| Medium | 4 | 0.0011 |  |
| Large | 8 | 0.0022 |  |
| X-Large | 16 | 0.0044 | Default size for warehouses created using Snowsight. |
| 2X-Large | 32 | 0.0089 |  |
| 3X-Large | 64 | 0.0178 |  |
| 4X-Large | 128 | 0.0356 |  |
| 5X-Large | 256 | 0.0711 | Generally available in Amazon Web Services (AWS) and Microsoft Azure regions, and in preview in US Government regions. |
| 6X-Large | 512 | 0.1422 | Generally available in Amazon Web Services (AWS) and Microsoft Azure regions, and in preview in US Government regions. |

The numbers in the preceding table refer to the first generation (Gen1) of Snowflake standard warehouses.
For usage information about the newer Gen2 warehouses, see [Snowflake generation 2 standard warehouses](warehouses-gen2.md).
For information about credit consumption for generation 2 standard warehouses,
see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
Gen2 warehouses aren’t yet available for all cloud service providers or for all regions, and currently are not the default
when you create a standard warehouse.

> **Tip:**
>
> For information about cost implications of changing the RESOURCE_CONSTRAINT property, see
> [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](warehouses-gen2.md).

Another way that you can scale the capacity of Snowflake warehouses without changing the warehouse size is by using
multi-cluster warehouses. For more information about that feature, see [Multi-cluster warehouses](warehouses-multicluster.md).

### Larger warehouse sizes

Larger warehouse sizes 5X-Large and 6X-Large are generally available in all Amazon Web Services (AWS) and Microsoft Azure regions.

Larger warehouse sizes are in preview in US Government regions (requires FIPS support on ARM).

### Impact on credit usage and billing

As shown in the above table, there is a doubling of credit usage as you increase in size to the next larger warehouse size for each full
hour that the warehouse runs; however, note that Snowflake utilizes per-second billing (with a 60-second minimum each time the warehouse
starts) so warehouses are billed only for the credits they actually consume.

The total number of credits billed depends on how long the warehouse runs continuously. For comparison purposes, the following table shows
the billing totals for three different size Gen1 standard warehouses based on their running time (totals rounded to the nearest 1000th of a credit):

| Running Time | Credits . (X-Small) | Credits . (X-Large) | Credits . (5X-Large) |
| --- | --- | --- | --- |
| 0-60 seconds | 0.017 | 0.267 | 4.268 |
| 61 seconds | 0.017 | 0.271 | 4.336 |
| 2 minutes | 0.033 | 0.533 | 8.532 |
| 10 minutes | 0.167 | 2.667 | 42.668 |
| 1 hour | 1.000 | 16.000 | 256.000 |

> **Note:**
>
> * For a [multi-cluster warehouse](warehouses-multicluster.md), the number of credits billed is calculated based on the
>   warehouse size and the number of clusters that run within the time period.
>
>   For example, if a 3X-Large multi-cluster warehouse runs 1 cluster for one full hour and then runs 2 clusters for the next full
>   hour, the total number of credits billed would be 192 (i.e. 64 + 128).
>
>   Multi-cluster warehouses are an [Enterprise Edition](intro-editions.md) feature.

### Impact on data loading

Increasing the size of a warehouse does not always improve data loading performance. Data loading performance is influenced more by
the number of files being loaded (and the size of each file) than the size of the warehouse.

> **Tip:**
>
> Unless you are bulk loading a large number of files concurrently (i.e. hundreds or thousands of files), a smaller warehouse
> (Small, Medium, Large) is generally sufficient. Using a larger warehouse (X-Large, 2X-Large, etc.) will consume more credits and may not
> result in any performance increase.
>
> For more data loading tips and guidelines, see [Data loading considerations](data-load-considerations.md).

### Impact on query processing

The size of a warehouse can impact the amount of time required to execute queries submitted to the warehouse, particularly for larger, more
complex queries. In general, query performance scales with warehouse size because larger warehouses have more compute resources available to
process queries.

If queries processed by a warehouse are running slowly, you can always resize the warehouse to provision more compute resources. The
additional resources do not impact any queries that are already running, but once they are fully provisioned they become available for use
by any queries that are queued or newly submitted.

> **Tip:**
>
> Larger is not necessarily faster for small, basic queries.
>
> For more warehouse tips and guidelines, see [Warehouse considerations](warehouses-considerations.md).

## Auto-suspension and auto-resumption

You can set a warehouse to automatically resume or suspend, based on activity:

* By default, auto-suspend is enabled. Snowflake automatically suspends the warehouse if it is inactive for the specified period of time.
* By default, auto-resume is enabled. Snowflake automatically resumes the warehouse when any statement that requires a warehouse is submitted
  and the warehouse is the current warehouse for the session.

These properties can be used to simplify and automate your monitoring and usage of warehouses to match your workload. Auto-suspend ensures
that you don’t leave a warehouse running (and consuming credits) when there are no incoming queries. Similarly, auto-resume ensures that
the warehouse starts up again as soon as it is needed.

> **Note:**
>
> Auto-suspend and auto-resume apply only to the entire warehouse and not to the individual clusters in the warehouse.
> For a [multi-cluster warehouse](warehouses-multicluster.md):
>
> * Auto-suspend only occurs when the minimum number of clusters is running and there is no activity for the specified period of time. The
>   minimum is typically 1 (cluster), but could be more than 1.
> * Auto-resume only applies when the entire warehouse is suspended (i.e. no clusters are running).

## Query processing and concurrency

The number of queries that a warehouse can concurrently process is determined by the size and complexity of each query. As queries are
submitted, the warehouse calculates and reserves the compute resources needed to process each query. If the warehouse does not have enough
remaining resources to process a query, the query is queued, pending resources that become available as other running queries complete.

Snowflake provides some object-level parameters that can be set to help control query processing and concurrency:

* [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md)
* [STATEMENT_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md)

> **Note:**
>
> If queries are queuing more than desired, another warehouse can be created and queries can be manually redirected to the new warehouse.
> In addition, resizing a warehouse can enable limited scaling for query concurrency and queuing; however, warehouse resizing is primarily
> intended for improving query performance.
>
> To enable fully automated scaling for concurrency, Snowflake recommends [multi-cluster warehouses](warehouses-multicluster.md),
> which provide essentially the same benefits as creating additional warehouses and redirecting queries, but without requiring manual
> intervention.
>
> Multi-cluster warehouses are an [Enterprise Edition](intro-editions.md) feature.

## Warehouse usage in sessions

When a session is initiated in Snowflake, the session does not, by default, have a warehouse associated with it. Until a session has a
warehouse associated with it, queries cannot be submitted within the session.

### Default warehouse for users

To facilitate querying immediately after a session is initiated, Snowflake supports specifying a default warehouse for each individual user.
The default warehouse for a user is used as the warehouse for all sessions initiated by the user.

A default warehouse can be specified when creating or modifying the user, either through the web interface or using
[CREATE USER](../sql-reference/sql/create-user.md)/[ALTER USER](../sql-reference/sql/alter-user.md).

### Default warehouse for client utilities/drivers/connectors

In addition to default warehouses for users, any of the Snowflake clients (Snowflake CLI, SnowSQL, JDBC driver, ODBC driver, Python connector, etc.) can
have a default warehouse:

* Snowflake CLI and SnowSQL support both a configuration file and command line options for specifying a default warehouse.
* The drivers and connectors support specifying a default warehouse as a connection parameter when initiating a session.

For more information, see [Applications and tools for connecting to Snowflake](../guides-overview-connecting.md).

### Default warehouse for notebooks

To enhance cost efficiency for notebook workloads, a multi-cluster X-Small warehouse, SYSTEM$STREAMLIT_NOTEBOOK_WH, is automatically
provisioned within each account. This warehouse, featuring a maximum of 10 clusters and a 60-second default timeout, uses improved bin
packing. The ACCOUNTADMIN role has OWNERSHIP privileges.

#### Recommendations for cost management

* Snowflake recommends using the SYSTEM$STREAMLIT_NOTEBOOK_WH warehouse exclusively for notebook workloads.
* To improve bin-packing efficiency and reduce cluster fragmentation, direct SQL queries from Notebook apps to a separate customer-managed query warehouse. Using a single shared warehouse for all Notebook apps in an account further enhances bin-packing efficiency.
* Separating notebook Python workloads from SQL queries minimizes cluster fragmentation. This approach optimizes overall costs by ensuring that notebook Python workloads are not co-located with larger warehouses, which are typically used for query execution.

#### Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | SYSTEM$STREAMLIT_NOTEBOOK_WH | By default, the PUBLIC role has USAGE privileges. ACCOUNTADMIN can grant and revoke USAGE privileges. |
| MONITOR, OPERATE, APPLYBUDGET | SYSTEM$STREAMLIT_NOTEBOOK_WH | Available to the ACCOUNTADMIN and grantable by the ACCOUNTADMIN to other roles. |

### Precedence for warehouse defaults

When a user connects to Snowflake and start a session, Snowflake determines the default warehouse for the session in the following order:

1. Default warehouse for the user,

   » **overridden by…**
2. Default warehouse in the configuration file for the client utility (SnowSQL, JDBC driver, etc.) used to connect to Snowflake (if the
   client supports configuration files),

   » **overridden by…**
3. Default warehouse specified on the client command line or through the driver/connector parameters passed to Snowflake.

> **Note:**
>
> In addition, the default warehouse for a session can be changed at any time by executing the [USE WAREHOUSE](../sql-reference/sql/use-warehouse.md)
> command within the session.

---
title: Parameter management
source: https://docs.snowflake.com/en/user-guide/admin-account-management.md
section: User Guide
---

# Parameter management

Snowflake provides three types of parameters that can be set for your account:

* Account parameters that affect your entire account.
* Session parameters that default to users and their sessions.
* Object parameters that default to objects (warehouses, databases, schemas, and tables).

All parameters have default values, which can be overridden at the account level. To override default values at the account level, you must
be an account administrator (i.e. user granted the ACCOUNTADMIN role).

In addition, the default values for session and object parameters can be overridden at each level in the parameter hierarchy.

## Viewing parameters for your account

To see a list of the parameters and their current values for your account, as well as their default values, use the
[SHOW PARAMETERS](../sql-reference/sql/show-parameters.md) command with the following syntax:

```sqlsyntax
SHOW PARAMETERS [ LIKE '<pattern>' ] IN ACCOUNT
```

For example, to see a complete list of all account-level parameters:

> ```sqlexample
> SHOW PARAMETERS IN ACCOUNT;
>
> +-------------------------------------+----------------------------------+----------------------------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | key                                 | value                            | default                          | level   | description                                                                                                                                                                         |
> |-------------------------------------+----------------------------------+----------------------------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> | ABORT_DETACHED_QUERY                | false                            | false                            |         | If true, Snowflake will automatically abort queries when it detects that the client has disappeared.                                                                                |
> | AUTOCOMMIT                          | true                             | true                             |         | The autocommit property determines whether is statement should to be implicitly                                                                                                     |
> |                                     |                                  |                                  |         | wrapped within a transaction or not. If autocommit is set to true, then a                                                                                                           |
> |                                     |                                  |                                  |         | statement that requires a transaction is executed within a transaction                                                                                                              |
> |                                     |                                  |                                  |         | implicitly. If autocommit is off then an explicit commit or rollback is required                                                                                                    |
> |                                     |                                  |                                  |         | to close a transaction. The default autocommit value is true.                                                                                                                       |
> | AUTOCOMMIT_API_SUPPORTED            | true                             | true                             |         | Whether autocommit feature is enabled for this client. This parameter is for                                                                                                        |
> |                                     |                                  |                                  |         | Snowflake use only.                                                                                                                                                                 |
> | BINARY_INPUT_FORMAT                 | HEX                              | HEX                              |         | input format for binary                                                                                                                                                             |
> | BINARY_OUTPUT_FORMAT                | HEX                              | HEX                              |         | display format for binary                                                                                                                                                           |
> | CLIENT_ENCRYPTION_KEY_SIZE          | 128                              | 128                              |         | Client-side encryption key size in bits. Either 128 or 256.                                                                                                                         |
> | CLIENT_SESSION_KEEP_ALIVE           | false                            | false                            |         | If true, client session will not expire automatically                                                                                                                               |
> | DATA_RETENTION_TIME_IN_DAYS         | 1                                | 1                                |         | number of days to retain the old version of deleted/updated data                                                                                                                    |
> | DATE_INPUT_FORMAT                   | AUTO                             | AUTO                             |         | input format for date                                                                                                                                                               |
> | DATE_OUTPUT_FORMAT                  | YYYY-MM-DD                       | YYYY-MM-DD                       |         | display format for date                                                                                                                                                             |
> | ERROR_ON_NONDETERMINISTIC_MERGE     | true                             | true                             |         | raise an error when attempting to merge-update a row that joins many rows                                                                                                           |
> | ERROR_ON_NONDETERMINISTIC_UPDATE    | false                            | false                            |         | raise an error when attempting to update a row that joins many rows                                                                                                                 |
> | LOCK_TIMEOUT                        | 43200                            | 43200                            |         | Number of seconds to wait while trying to lock a resource, before timing out                                                                                                        |
> |                                     |                                  |                                  |         | and aborting the statement. A value of 0 turns off lock waiting i.e. the                                                                                                            |
> |                                     |                                  |                                  |         | statement must acquire the lock immediately or abort. If multiple resources                                                                                                         |
> |                                     |                                  |                                  |         | need to be locked by the statement, the timeout applies separately to each                                                                                                          |
> |                                     |                                  |                                  |         | lock attempt.                                                                                                                                                                       |
> | MAX_CONCURRENCY_LEVEL               | 8                                | 8                                |         | Concurrency level for SQL statements (i.e. queries and DML) executed by a warehouse cluster (used to determine when statements are queued or additional clusters are started).      |
> | NETWORK_POLICY                      |                                  |                                  |         | Network policy assigned for the given target.                                                                                                                                       |
> | PERIODIC_DATA_REKEYING              | false                            | false                            |         | If true, Snowflake will re-encrypt data that was encrypted more than a year ago.                                                                                                    |
> | QUERY_TAG                           |                                  |                                  |         | String (up to 2000 characters) used to tag statements executed by the session                                                                                                       |
> | QUOTED_IDENTIFIERS_IGNORE_CASE      | false                            | false                            |         | If true, the case of quoted identifiers is ignored                                                                                                                                  |
> | ROWS_PER_RESULTSET                  | 0                                | 0                                |         | maxium number of rows in a result set                                                                                                                                               |
> | SAML_IDENTITY_PROVIDER              |                                  |                                  |         | Authentication attributes for the SAML Identity provider                                                                                                                            |
> | SSO_LOGIN_PAGE                      | true                             | false                            | ACCOUNT | Enable federated authentication for console login and redirects preview page to console login                                                                                       |
> | STATEMENT_QUEUED_TIMEOUT_IN_SECONDS | 0                                | 0                                |         | Timeout in seconds for queued statements: statements will automatically be canceled if they are queued on a warehouse for longer than this amount of time; disabled if set to zero. |
> | STATEMENT_TIMEOUT_IN_SECONDS        | 0                                | 0                                |         | Timeout in seconds for statements: statements will automatically be canceled if they run for longer than this amount of time; disabled if set to zero.                              |
> | TIMESTAMP_DAY_IS_ALWAYS_24H         | false                            | true                             | SYSTEM  | If set, arithmetic on days always uses 24 hours per day,                                                                                                                            |
> |                                     |                                  |                                  |         | possibly not preserving the time (due to DST changes)                                                                                                                               |
> | TIMESTAMP_INPUT_FORMAT              | AUTO                             | AUTO                             |         | input format for timestamp                                                                                                                                                          |
> | TIMESTAMP_LTZ_OUTPUT_FORMAT         |                                  |                                  |         | Display format for TIMESTAMP_LTZ values. If empty, TIMESTAMP_OUTPUT_FORMAT is used.                                                                                                 |
> | TIMESTAMP_NTZ_OUTPUT_FORMAT         | YYYY-MM-DD HH24:MI:SS.FF3        | YYYY-MM-DD HH24:MI:SS.FF3        | SYSTEM  | Display format for TIMESTAMP_NTZ values. If empty, TIMESTAMP_OUTPUT_FORMAT is used.                                                                                                 |
> | TIMESTAMP_OUTPUT_FORMAT             | YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM | YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM | SYSTEM  | Default display format for all timestamp types.                                                                                                                                     |
> | TIMESTAMP_TYPE_MAPPING              | TIMESTAMP_NTZ                    | TIMESTAMP_NTZ                    | SYSTEM  | If TIMESTAMP type is used, what specific TIMESTAMP* type it should map to:                                                                                                          |
> |                                     |                                  |                                  |         |   TIMESTAMP_LTZ (default), TIMESTAMP_NTZ or TIMESTAMP_TZ                                                                                                                            |
> | TIMESTAMP_TZ_OUTPUT_FORMAT          |                                  |                                  |         | Display format for TIMESTAMP_TZ values. If empty, TIMESTAMP_OUTPUT_FORMAT is used.                                                                                                  |
> | TIMEZONE                            | America/Los_Angeles              | America/Los_Angeles              |         | time zone                                                                                                                                                                           |
> | TIME_INPUT_FORMAT                   | AUTO                             | AUTO                             |         | input format for time                                                                                                                                                               |
> | TIME_OUTPUT_FORMAT                  | HH24:MI:SS                       | HH24:MI:SS                       |         | display format for time                                                                                                                                                             |
> | TRANSACTION_ABORT_ON_ERROR          | false                            | false                            |         | If this parameter is true, and a statement issued within a non-autocommit                                                                                                           |
> |                                     |                                  |                                  |         | transaction returns with an error, then the non-autocommit transaction is                                                                                                           |
> |                                     |                                  |                                  |         | aborted. All statements issued inside that transaction will fail until an                                                                                                           |
> |                                     |                                  |                                  |         | commit or rollback statement is executed to close that transaction.                                                                                                                 |
> | TRANSACTION_DEFAULT_ISOLATION_LEVEL | READ COMMITTED                   | READ COMMITTED                   |         | The default isolation level when starting a starting a transaction, when no                                                                                                         |
> |                                     |                                  |                                  |         | isolation level was specified                                                                                                                                                       |
> | TWO_DIGIT_CENTURY_START             | 1970                             | 1970                             |         | For 2-digit dates, defines a century-start year.                                                                                                                                    |
> |                                     |                                  |                                  |         | For example, when set to 1980:                                                                                                                                                      |
> |                                     |                                  |                                  |         |   - parsing a string '79' will produce 2079                                                                                                                                         |
> |                                     |                                  |                                  |         |   - parsing a string '80' will produce 1980                                                                                                                                         |
> | UNSUPPORTED_DDL_ACTION              | ignore                           | ignore                           |         | The action to take upon encountering an unsupported ddl statement                                                                                                                   |
> | USE_CACHED_RESULT                   | true                             | true                             |         | If enabled, query results can be reused between successive invocations of the same query as long as the original result has not expired                                             |
> +-------------------------------------+----------------------------------+----------------------------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> ```

In the output, note the `value` and `level` columns:

* `value` specifies the current value for each parameter.
* `level` specifies where the current value comes from. If the `level` column is empty for a parameter, the parameter is not
  explicitly set and the current value is the default value.

## Altering parameters for your account

To alter a parameter for your account, log into Snowflake as an account administrator and use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md)
command with the following syntax:

```sqlsyntax
ALTER ACCOUNT SET <param> = <value>
```

For example, to set the [DATE_OUTPUT_FORMAT](../sql-reference/parameters.md) session parameter:

> ```sqlexample
> ALTER ACCOUNT SET DATE_OUTPUT_FORMAT = 'DD/MM/YYYY';
>
> SHOW PARAMETERS LIKE 'DATE_OUTPUT%' IN ACCOUNT;
>
> +--------------------+------------+------------+---------+-------------------------+
> | key                | value      | default    | level   | description             |
> |--------------------+------------+------------+---------+-------------------------|
> | DATE_OUTPUT_FORMAT | DD/MM/YYYY | YYYY-MM-DD | ACCOUNT | display format for date |
> +--------------------+------------+------------+---------+-------------------------+
> ```
>
> Note that the `level` column in the output shows the value for the parameter is currently set at the account level.

This specifies that the default display format for all dates in a session will be DD/MM/YYYY instead of YYYY-MM-DD (e.g. 23/04/2016 instead
of 2016-04-23). Note that this date display format is only the default and can be overridden for any individual user or session.

### Resetting parameters

To reset a parameter for your account back to the default, use [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) with the following syntax:

```sqlsyntax
ALTER ACCOUNT UNSET <param>
```

For example, to set the [DATE_OUTPUT_FORMAT](../sql-reference/parameters.md) session parameter back to its default value:

> ```sqlexample
> ALTER ACCOUNT UNSET DATE_OUTPUT_FORMAT;
> ```

---
title: Partner support for Snowflake authentication methods
source: https://docs.snowflake.com/en/user-guide/partner-authentication-support.md
section: User Guide
---

# Partner support for Snowflake authentication methods

Use this topic to determine which Snowflake authentication methods you can use to connect to Snowflake from a partner application.

## Supported authentication for `TYPE = PERSON` users

When the Snowflake user is a human user, the `TYPE` property of the user object is set to `PERSON`. This section details which
Snowflake authentication methods are available to human users when connecting from a partner application. For a description of these
authentication methods, see [Overview of authentication methods for applications](security-authentication-overview.md).

Snowflake recommends configuring your partner application to authenticate with OAuth because it is stronger than other authentication
methods. For help choosing between External OAuth and Snowflake OAuth, see [Choosing authentication for interactive applications](security-authentication-overview.md). A person
authenticates with the OAuth authorization code flow because the user can interact with the authorization server during authentication.

Alternatively, you can use a programmatic access token (PAT) as a replacement for a password when authenticating to Snowflake as long as the
password field accepts 256 characters. However, be aware that PATs aren’t as strong as OAuth.

| Application | External OAuth | Snowflake OAuth | Key pair authentication | Programmatic access token (PAT) |
| --- | --- | --- | --- | --- |
| [PowerBI Cloud (BI)](https://learn.microsoft.com/en-us/fabric/data-factory/connector-snowflake#authentication) | **Yes** (Only Microsoft Entra ID is supported) | No | **Yes** | No |
| [PowerBI Desktop (BI)](https://learn.microsoft.com/en-us/fabric/data-factory/connector-snowflake#authentication) | **Yes** (Only Microsoft Entra ID is supported) | No | **Yes** | No |
| [Tableau Cloud (BI)](https://help.tableau.com/current/pro/desktop/en-us/examples_snowflake.htm) | **Yes** | **Yes** | **Yes** | **Yes** |
| [Tableau Server (BI)](https://help.tableau.com/current/server/en-us/config_oauth_snowflake.htm) | **Yes** | **Yes** | No | No |
| [DBT Cloud (Transform)](https://docs.getdbt.com/docs/cloud/connect-data-platform/connect-snowflake) | No | **Yes** | **Yes** | **Yes** |
| [DBT Core (Transform)](https://docs.getdbt.com/docs/core/connect-data-platform/snowflake-setup) | No | **Yes** | **Yes** | **Yes** |
| [Airflow (Workflow orchestration)](../developer-guide/python-connector/python-connector-connect.md) | N/A | N/A | **Yes** | **Yes** |
| [Qlik Sense Cloud (BI)](https://help.qlik.com/en-US/connectors/Subsystems/ODBC_connector_help/Content/Connectors_ODBC/Snowflake/Create-Snowflake-connection.htm) | **Yes** | **Yes** | **Yes** | **Yes** |
| [Qlik Sense Desktop (BI)](https://help.qlik.com/en-US/connectors/Subsystems/ODBC_connector_help/Content/Connectors_ODBC/Snowflake/Create-Snowflake-connection.htm) | No | No | **Yes** | **Yes** |
| [Fivetran (EL)](https://fivetran.com/docs/destinations/snowflake/setup-guide#optionalkeypairauthentication) | No | No | **Yes** | No |
| [Matillion (ELT)](https://docs.matillion.com/data-productivity-cloud/administration/docs/snowflake-key-pair-authentication/) | No | No | **Yes** | **Yes** |
| [Informatica (ETL)](https://docs.informatica.com/integration-cloud/data-integration-connectors/current-version/snowflake-data-cloud-connector/part-1--getting-started-with-snowflake-data-cloud-connector/connections-for-snowflake-data-cloud/connect-to-snowflake/authentication-typesdwsnowflakev2conn-authentication.html) | No | **Yes** | **Yes** | **Yes** |
| [ThoughtSpot (BI - interactive)](https://docs.thoughtspot.com/software/10.1.0.sw/connections-snowflake-add) | **Yes** | **Yes** | **Yes** | No |
| Strategy Cloud (BI) | **Yes** | No | **Yes** | **Yes** |
| Strategy Workstation/Developer (BI) | **Yes** | No | No | **Yes** |

## Supported authentication for `TYPE = SERVICE` users

When a service — for example, an application or workflow — is authenticating to Snowflake, the `TYPE` property of the user object is set to `SERVICE`. This section details which Snowflake authentication methods are available when connecting from a partner application as a service. For a description of these authentication methods, see [Overview of authentication methods for applications](security-authentication-overview.md).

Snowflake recommends configuring your partner application to authenticate with OAuth, because it is stronger than other available authentication methods. A service authenticates using the OAuth client credentials flow, because there isn’t a person to interact with the authorization server.

Alternatively, you can use a programmatic access token (PAT) as a replacement for a password when authenticating to Snowflake as long as the
password field accepts 256 characters. However, be aware that PATs aren’t as strong as OAuth.

| Application | External OAuth | Key pair authentication | Programmatic access token (PAT) |
| --- | --- | --- | --- |
| [PowerBI Cloud (BI)](https://learn.microsoft.com/en-us/fabric/data-factory/connector-snowflake#authentication) | No | **Yes** | No |
| [PowerBI Desktop (BI)](https://learn.microsoft.com/en-us/fabric/data-factory/connector-snowflake#authentication) | No | **Yes** | No |
| [Tableau Cloud (BI)](https://help.tableau.com/current/pro/desktop/en-us/examples_snowflake.htm) | No | **Yes** | **Yes** |
| [Tableau Server (BI)](https://help.tableau.com/current/server/en-us/config_oauth_snowflake.htm) | No | No | No |
| [DBT Cloud (Transform)](https://docs.getdbt.com/docs/cloud/connect-data-platform/connect-snowflake) | No | **Yes** | **Yes** |
| [DBT Core (Transform)](https://docs.getdbt.com/docs/core/connect-data-platform/snowflake-setup) | No | **Yes** | **Yes** |
| [Airflow (Workflow orchestration)](../developer-guide/python-connector/python-connector-connect.md) | **Yes** | **Yes** | **Yes** |
| [Qlik Sense Cloud (BI)](https://help.qlik.com/en-US/connectors/Subsystems/ODBC_connector_help/Content/Connectors_ODBC/Snowflake/Create-Snowflake-connection.htm) | No | **Yes** | **Yes** |
| [Qlik Sense Desktop (BI)](https://help.qlik.com/en-US/connectors/Subsystems/ODBC_connector_help/Content/Connectors_ODBC/Snowflake/Create-Snowflake-connection.htm) | No | **Yes** | **Yes** |
| [Fivetran (EL)](https://fivetran.com/docs/destinations/snowflake/setup-guide#optionalkeypairauthentication) | No | **Yes** | No |
| [Matillion (ELT)](https://docs.matillion.com/data-productivity-cloud/administration/docs/snowflake-key-pair-authentication/) | No | **Yes** | **Yes** |
| [Informatica (ETL)](https://docs.informatica.com/integration-cloud/data-integration-connectors/current-version/snowflake-data-cloud-connector/part-1--getting-started-with-snowflake-data-cloud-connector/connections-for-snowflake-data-cloud/connect-to-snowflake/authentication-typesdwsnowflakev2conn-authentication.html) | **Yes** | **Yes** | **Yes** |
| [ThoughtSpot (BI - interactive)](https://docs.thoughtspot.com/software/10.1.0.sw/connections-snowflake-add) | **Yes** | **Yes** | **No** |
| Strategy Cloud (BI) | No | **Yes** | No |
| Strategy Workstation/Developer (BI) | No | No | No |

---
title: PCI DSS
source: https://docs.snowflake.com/en/user-guide/cert-pci-dss.md
section: User Guide
---

# PCI DSS

This topic describes how Snowflake supports customers with PCI-DSS compliance requirements.

## Understanding PCI DSS compliance requirements

The Payment Card Industry Data Security Standards are a set of requirements prescribed by the Payment Card Industry Security Standards
Council. Snowflake is a Level 1 Service Provider compliant under PCI DSS version 3.2.1 and undergoes a third party assessment from a QSA
(Qualified Security Assessor) on an annual basis. The AoC (Attestation of Compliance) is available upon request. Snowflake’s PCI DSS
compliance allows customers to store, process, or transmit cardholder data utilizing the Snowflake Service. However, there are PCI
compliance responsibilities that fall to customers outside of those managed by Snowflake, for a breakdown of these responsibilities
customers can request the Snowflake PCI Shared Responsibility Matrix.

---
title: Performance testing
source: https://docs.snowflake.com/en/user-guide/tables-hybrid-test.md
section: User Guide
---

# Performance testing

This topic provides information for testing [hybrid tables](tables-hybrid.md) in Snowflake. When
evaluating hybrid tables for the first time in your environment, you will likely want to do some basic performance
testing. This section refers to the
[getting started with hybrid tables tutorial](tutorials/getting-started-with-hybrid-tables-tutorial.md). If
you have not completed that tutorial, now is a good time to do so.

> **Attention:**
>
> Performance statistics reported in Snowsight are not indicative of query performance for driver-based workloads.

## Understand your use case

Testing for the outcome you are looking for is very important. Understanding how hybrid tables will augment
your architecture is important when designing your tests.

Design your test scenario:

* Do you require a high volume of UPDATE, INSERT, or DELETE statements?
* Does your application need fast access to indexed data?
* Do you have batch jobs you would like to run more often without impacting SELECT performance?
* What do you want to measure during the test?

## Select a test framework

Performance testing frameworks are ubiquitous in software development. Most customers have testing frameworks that
are already in place and can be used to test hybrid tables. Regardless of the test framework you select,
it needs to be able to:

* Authenticate with Snowflake using shared key authentication
* Support multi-threaded query execution
* Issue queries as prepared statements, binding variables as needed
* Create a mix of INSERT, UPDATE, DELETE, and SELECT queries

Ideally, your framework will track query execution time for each request in each thread to calculate:

* Total query throughput
* Min, max, average, and standard deviation of response time
* Total bytes received per query

## Execute the test

The hybrid tables query optimizer takes some time to “warm up” and establish a steady-state latency. This
warm-up period can vary based on the amount of data, the number of indexes, and the complexity of the query.
For most test cases, a warm-up period of 1-2 minutes is sufficient. Longer warm-up periods may be required.

> **Tip:**
>
> The warm-up period ends when the throughput and latency curves converge to a steady state.

This is a typical performance test result for random queries on a single hybrid table. Note that the
performance improves over time and achieves a steady state after a few seconds:

> **Note:**
>
> The time to achieve steady-state response times varies depending on many factors and can take
> several minutes.

---
title: Personal Databases
source: https://docs.snowflake.com/en/user-guide/personal-databases.md
section: User Guide
---

# Personal Databases

## What is a Personal Database?

A Personal Database (PDB) is a system-owned, user-managed database instance that is automatically provisioned by Snowflake. It serves as a
dedicated, personal storage location where users can create, organize, and manage their own database objects.

Automatic provisioning removes the administrative requirement for users to manually select or request access to a shared database, which
ensures a dedicated development environment. When a user is dropped from the system, their associated PDB and all its objects are
automatically transferred to ACCOUNTADMIN ownership.

### Advantages of a PDB

* **Organize personal projects:** Users can organize their own projects in an isolated environment, reducing clutter and potential naming
  conflicts in shared databases.
* **Easy administrator governance:** All file-related developments are fully governed by RBAC.

## PDB object types

Currently, PDBs support two primary object types that provide a dedicated development environment for the user: workspaces and notebooks.

### Workspaces

The PDB is created when a user first interacts with the [Workspaces UI](ui-snowsight/workspaces.md).
Workspaces are file-based entities and require storage within a Snowflake database.

### Notebooks

PDBs support managed compute services for [Snowflake notebooks](ui-snowsight/notebooks.md). To enable code execution, a
Snowflake-managed service object is automatically created within the PDB. This ensures that the notebook’s execution context is bound to all
the roles and permissions the user already possesses. This object connects the workspace to a Snowpark Container Services (SPCS) compute pool, allowing developers
to execute their Snowflake Notebooks code.

> **Important:**
>
> The user must have the USAGE privilege on the associated compute pool before they can create a service to execute code. This privilege can be granted by any role with the
> MANAGE GRANTS privilege.

## Security

The PDB’s architecture is intentionally streamlined and adheres to the principle of least privilege, which ensures that all operations are strictly
limited to the user’s existing security context:

* **No new data access:** PDBs do not introduce any new or expanded access to data or any additional ability to share data. **Users can’t
  move data from a regular database to a PDB**.
* **Permissions context:** Any SQL queries executed within a workspace are run with the exact same set of roles and permissions that the user
  already possesses. This mirrors the execution environment of a standard Snowflake workspace file.

> **Note:**
>
> Personal databases also support personal secrets. [Secret objects](../sql-reference/sql/create-secret.md) are owned exclusively by the user. This ensures, by default, that the secret
> remains private, is accessible only to the user, and is not shared unintentionally.

## PDB management and visibility

Administrators can monitor and control usage of PDBs, which are owned by the system, not by any role. Usage on a PDB is limited
to the user it is assigned to. Objects inside a PDB cannot be shared.

### Administrator visibility

Roles with the MANAGE GRANTS privilege have visibility into all objects within the account, including personal objects owned by individual
users. For example, roles like ACCOUNTADMIN can view all databases, including personal databases, by default. These roles can also access
details about schemas and their objects within personal databases.

* To view details for all personal databases within an account, query the [DATABASES Account Usage view](../sql-reference/account-usage/databases.md):

  > ```sqlexample
  > SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.DATABASES
  > WHERE DATABASE_NAME LIKE 'USER$%';
  > ```
* To view the workspaces that exist in a specific personal database, use the following code:

  > ```sqlexample
  > SHOW WORKSPACES IN DATABASE "USER$CMEYER";
  > ```
* To view a specific user’s personal database, use the following code:

  > ```sqlexample
  > SHOW DATABASES LIKE 'USER$BOBR';
  > ```
  >
  > For personal databases, the value in the `kind` column is `PERSONAL DATABASE`.
* To view objects in a specific personal database, use the following code:

  > ```sqlsyntax
  > SHOW OBJECTS IN DATABASE "USER$<username>";
  > ```

### Drop a workspace

* To drop a workspace in a personal database, use the following code:

  > ```sqlexample
  > DROP WORKSPACE "USER$JSMITH_DROP_WS_TEST".PUBLIC."drop_this_ws";
  > ```

## Cost considerations

* Users cannot store data in tables in their PDBs.
* Storage costs reflect only the size of the workspace files and associated metadata.

## Limitations

Administrators cannot perform the following tasks:

* View filenames or file contents that belong to other users.
* View how much storage is used for PDBs. PDBs do not appear in `DATABASE_STORAGE_USAGE_HISTORY`.
* Limit how much storage is used for each PDB.
* Drop PDBs, or prevent individual users from using them.
* Create new PDBs. New PDBs are created on demand when a user creates a workspace.

---
title: Pinning private connectivity endpoints for inbound traffic
source: https://docs.snowflake.com/en/user-guide/pin-private-endpoints.md
section: User Guide
---

# Pinning private connectivity endpoints for inbound traffic

For Snowflake accounts on Amazon Web Services (AWS) and Microsoft Azure (Azure), you can pin (specify, register, and map) private connectivity endpoints to your
account. By pinning private endpoints to your account, Snowflake ensures that the inbound traffic originating from the pinned endpoints
only goes to the account that pinned them. Snowflake recommends using pinned endpoints, network policies, and network rules to harden your
security posture by reducing the network attack surface to your Snowflake account.

> **Tip:**
>
> Pinning allows only authorized private endpoint(s) to be used to send traffic from the customer network to a specific
> Snowflake account. If you want to restrict inbound access to Snowflake accounts from specific lists of IPs and VPCE IDs/LinkIDs, use
> [network policies](network-policies.md) and [network rules](network-rules.md).

Snowflake enforces a private endpoint pinning check at the point of ingress for every request received over private connectivity.
This check compares two key pieces of information:

* The endpoint ID provided in the request header.
* The account that pinned the endpoint, as recorded in Snowflake’s metadata.

If these match — in other words, if the request originates from the account that registered the endpoint — then Snowflake allows the
connection. Otherwise, Snowflake blocks the connection.

For example:

| Pinned private endpoint | Snowflake account that pinned private endpoint | Request’s target Snowflake account | Snowflake pinning check decision |
| --- | --- | --- | --- |
| PE1 | A1 | A1 | ALLOW |
| PE1 | A1 | A2 | DENY |
| PE2 | A2 | A1 | DENY |
| PE2 | A2 | A2 | ALLOW |

## Prerequisites

Before pinning a private endpoint, you must:

* Configure a private link for your Snowflake account on AWS or Azure.
* Limit the scope of the access token you use to register an endpoint with your Snowflake account.

For more information about configuring private links, see [AWS PrivateLink](admin-security-privatelink.md) or
[Azure Private Link](privatelink-azure.md).

> **Important:**
>
> Before you pin a private endpoint, when [Configuring private connectivity for Snowsight](ui-snowsight-gs.md), ensure that the endpoint uses a *regionless*
> Snowsight privatelink URL for all your accounts. A regional Snowsight privatelink URL will not connect to a pinned private endpoint.

## Manage enforcement with the delay time argument

After configuring your private links, you call the [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md) system function to
register a private connectivity endpoint with your Snowflake account. In that function call, you can optionally specify a delay time.
The delay time is the number of minutes to wait before enforcing the private endpoint registration. The delay time value helps prevent you
from accidentally blocking yourself from accessing Snowflake when you register a new private endpoint. The maximum delay time is 1440
minutes (24 hours) and the default value is 60 minutes.

The private endpoint that you register for your Snowflake account can also be registered for other Snowflake accounts. For example, you
might have three Snowflake accounts and you want to ensure that the connection to each Snowflake account only goes through one registered
private endpoint. By setting the delay time argument to 60 minutes, you allow for sufficient time to register the private connectivity
endpoint with each Snowflake account.

However, when you register a private connectivity endpoint and specify a delay time, you must be mindful of the local timestamp of
the first account in which you call the system function. The enforcement time is based on the local timestamp of the first account
when you call the system function plus any delay time that you specify, relative to a specific private connectivity endpoint.

For example, consider pinning a single private connectivity endpoint with three accounts in the same time zone:

* If you call the system function in `account1` at 10:00 AM and specify a delay time of 60 minutes, the enforcement time is 11:00 AM.
* If you call the system function in `account2` at 10:30 AM, the enforcement time is 11:00 AM.
* If you call the system function in `account3` at 11:01 AM, the enforcement time is immediate (now).

> **Tip:**
>
> Store the timestamp of when you register the private endpoint in the first account. Maintain a record of the accounts that are pinned
> to a particular private endpoint.
>
> If you anticipate registering multiple accounts and a delay time of 1440 minutes is not enough time, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Managing access token scope on Microsoft Azure

Before pinning a private endpoint to your Snowflake account on Azure, you must limit the scope of the access token that you pass into the
[SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md) system function. Requiring the caller to scope the access token to the
private endpoint helps Snowflake authorize the caller’s access to the endpoint. This means that the token is only valid for the private
endpoint and the Snowflake account where you call the system function.

> **Important:**
>
> Do not use the token used in the [SYSTEM$AUTHORIZE_PRIVATELINK](../sql-reference/functions/system_authorize_privatelink.md) system function.
> The following steps generate a token unique to [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md).

To limit the scope of the access token for your private endpoint on Azure, do the following steps in your Microsoft Azure account:

1. [Create](https://learn.microsoft.com/en-us/cli/azure/role/definition?view=azure-cli-latest#az-role-definition-create) a subscription
   custom role definition for a role called `snowflake-pep-role`, and replace the `subscription_id` placeholder with the ID
   of your subscription.

   ```bash
   az role definition create --role-definition '{"Name":"snowflake-pep-role","Description":
   "To generate advanced proof of access token for Snowflake private endpoint pinning","Actions":
   ["Microsoft.Network/privateEndpoints/read"],"AssignableScopes":["/subscriptions/<subscription_id>"]}'
   ```

   The subscription ID must match the subscription where the private endpoint exists. You only need to create the role definition once for
   your subscription.
2. Create the role assignment and
   [assign](https://learn.microsoft.com/en-us/cli/azure/role/assignment?view=azure-cli-latest#az-role-assignment-create)
   the `snowflake-pep-role` role and private endpoint scope to a user (or a group).
   Replace the placeholders for the `user` and the `private_endpoint_resource_id`.

   ```bash
   az role assignment create --assignee <user> --role snowflake-pep-role --scope <private_endpoint_resource_id>
   ```
3. Generate the [access token](https://learn.microsoft.com/en-us/cli/azure/account?view=azure-cli-latest#az-account-get-access-token) to
   use with the [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md) system function. Replace the `subscription_id`
   placeholder with the ID of your subscription.

   ```bash
   az account get-access-token --subscription <subscription_id>
   ```

## Managing access token scope on Amazon Web Services

Before pinning a private endpoint to your Snowflake account on AWS, you must limit the scope of the access token that you pass into the
[SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md) system function. Requiring the caller to scope the access token to the
private endpoint helps Snowflake authorize the caller’s access to the endpoint. This means that the token is only valid for the private
endpoint and the Snowflake account where you call the system function.

> **Important:**
>
> Do not use the token used in the [SYSTEM$AUTHORIZE_PRIVATELINK](../sql-reference/functions/system_authorize_privatelink.md) system function. The following steps
> generate a token unique to [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md).

To limit the scope of the access token for your private endpoint on AWS, generate a federated token, as shown in the following example:

```bash
aws sts get-federation-token --name snowflake --policy
'{ "Version": "2012-10-17", "Statement":
  [ {
  "Effect": "Allow", "Action": ["ec2:DescribeVpcEndpoints"],
  "Resource": ["*"] }
  ] }'
```

## Example

As a representative example, register an endpoint to route your connection to the Snowflake service.

1. Configure [AWS PrivateLink](admin-security-privatelink.md) or
   [Azure Private Link](privatelink-azure.md) for your Snowflake account. If you already have this service configured,
   skip to the next step.
2. Log in to Snowflake by using the public internet, and use the URL that doesn’t contain a `privatelink` segment in the URL.
3. Call the [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md) system function to register the VPC endpoint with your
   Snowflake account. The `token` arguments contain truncated values and the delay time unit is minutes:

   **AWS**

   ```sqlexample
   SELECT SYSTEM$REGISTER_PRIVATELINK_ENDPOINT(
     'vpce-0c1...',
     '123.....',
     '{
       "Credentials": {
         "AccessKeyId": "ASI...",
         "SecretAccessKey": "alD...",
         "SessionToken": "IQo...",
         "Expiration": "2024-12-10T08:20:20+00:00"
       },
       "FederatedUser": {
         "FederatedUserId": "0123...:snowflake",
         "Arn": "arn:aws:sts::174...:federated-user/snowflake"
       },
       "PackedPolicySize": 9,
       }',
     120
     );
   ```

   **Azure**

   ```sqlexample
   SELECT SYSTEM$REGISTER_PRIVATELINK_ENDPOINT(
     '123....',
     '/subscriptions/0cc51670-.../resourceGroups/dbsec_test_rg/providers/Microsoft.Network/
     privateEndpoints/...',
     'eyJ...',
     120
   );
   ```
4. To confirm the private connectivity endpoint mapping, call the
   [SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS](../sql-reference/functions/system_get_privatelink_endpoint_registrations.md) system function.

You can unregister the private connectivity endpoint from your Snowflake account by calling the
[SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_unregister_privatelink_endpoint.md) system function.

> **Important:**
>
> If you register a VPC endpoint or private endpoint in Snowflake and delete the endpoint in your VPC or VNet, you must call the
> [SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_unregister_privatelink_endpoint.md) system function in your Snowflake account to unregister the
> endpoint. Otherwise, your connection to the Snowflake Service can’t use private connectivity. It uses the public internet.

---
title: Planning a data load
source: https://docs.snowflake.com/en/user-guide/data-load-considerations-plan.md
section: User Guide
---

# Planning a data load

This topic provides best practices, general guidelines, and important considerations for planning a data load.

## Dedicating separate warehouses to load and query operations

Loading large data sets can affect query performance. We recommend dedicating separate warehouses for loading and querying operations to optimize performance for each.

The number of data files that can be processed in parallel is determined by the amount of compute resources in a warehouse. If you follow the file sizing guidelines described in [Preparing your data files](data-load-considerations-prepare.md), a data load requires minimal resources. Splitting larger data files allows the load to scale linearly. Unless you are bulk loading a large number of files concurrently (i.e. hundreds or thousands of files), a smaller warehouse (Small, Medium, Large) is generally sufficient. Using a larger warehouse (X-Large, 2X-Large, etc.) will consume more credits and may not result in any performance increase.

---
title: Planning for the deprecation of single-factor password sign-ins
source: https://docs.snowflake.com/en/user-guide/security-mfa-rollout.md
section: User Guide
---

# Planning for the deprecation of single-factor password sign-ins

To improve the security posture of all of its customers, Snowflake is rolling out changes to require multi-factor authentication (MFA) for
all human users using passwords, and disallow passwords for all service users. These service users must switch to a stronger authentication
method that doesn’t require interaction with a person. This topic describes how single-factor passwords will be deprecated so you can plan
accordingly.

> **Important:**
>
> Snowflake provides a tool that guides you through the process of implementing strong authentication for all users, so you are ready for
> the deprecation of single-factor passwords. For more information, see [Strong Authentication Hub](strong-authentication-hub.md).

The phases described in this topic don’t apply to reader accounts, trial accounts, or Snowflake Postgres. You can continue to sign in to
these types of accounts with a single-factor password.

## Human users vs. service users

User objects in Snowflake don’t always correspond to human users. There are users who sign in to Snowflake without human interaction — for
example, an application or service. These users are considered *service users*.

Administrators use the `TYPE` parameter of a user object to define whether a user is a human user or a service user.

* For human users, `TYPE=PERSON`. If you don’t set the `TYPE` parameter or set it to NULL, the user is treated as a human user.
* For service users, `TYPE=SERVICE`.

  > **Note:**
  >
  > The `LEGACY_SERVICE` user type helps customers transition service users to using a secure form of authentication. Setting
  > a user’s type to `LEGACY_SERVICE` temporarily allows the user to authenticate with a password even though it’s an application or
  > service. The rollout described in this topic involves the gradual deprecation of this user type.

The distinction between a human user and a service user is important because this rollout affects these two types of users differently.
To harden the security posture for both types of users, the enforcement of strong authentication consists of the following:

* All *human users* who use password authentication will be required to use a second factor of authentication.
* All *legacy service users* who currently use password authentication will be required to migrate to a more secure authentication method.

## Enforcement timeline

The following table provides the timeline for the enforcement of strong authentication methods.

| Estimated date | Affected users | Phase |
| --- | --- | --- |
| Sep. 2025 - Jan. 2026 | * Human users | Mandatory MFA for all Snowsight users |
| May 2026 - Jul. 2026 | * Human users * Legacy service users | Strong authentication for NEW users |
| Aug. 2026 - Oct. 2026 | * Human users * Legacy service users | Strong authentication for ALL users |

To learn how to implement strong authentication to meet these deadline, see [Strong Authentication Hub](strong-authentication-hub.md).

### Phase 1: Mandatory MFA for all *Snowsight* users (new and existing)

Phase 1 is implemented using Snowflake’s established behavior change release process. In this process, Snowflake releases a
*behavior change bundle* each month. Because changes will be included in a behavior change bundle, enforcement of the new restrictions
coincide with the lifecycle of the bundle.

For more information about the lifecycle of behavior change bundles so you can plan for the enforcement of this phase, see
[Behavior change policy](../release-notes/behavior-change-policy.md).

**2025_06 bundle (September 2025 - January 2026)** [1]

| Objective | New behavior |
| --- | --- |
| Mandatory MFA for all Snowsight users | Human users must authenticate with a second factor when using a password to access Snowsight, with no exceptions.  Keep in mind the following:   * This phase affects Snowsight only. Human users can continue to use a single-factor password to access the Snowflake   service from business intelligence (BI) and similar tools, even after they use Snowsight to enroll in MFA. You can choose to   enforce MFA for these other tools; users who are already enrolled in MFA and use MFA outside Snowsight will continue to use   MFA. * Authentication policies that implemented optional MFA enrollment for Snowsight are overridden. * Because users authenticating with Snowflake OAuth use the Snowsight login interface, they must be enrolled in MFA. * Single sign-on users are not impacted by this change and can continue to access Snowsight with no changes. * Legacy service users (`TYPE=LEGACY_SERVICE`) are not impacted by this change and can continue to access   Snowsight with a single-factor password. |

For detailed information about how the changes in this bundle affect password and SSO authentication for your users, see [Upcoming Multi-Factor Authentication (MFA) enforcement for Snowsight logins with single-factor passwords](https://community.snowflake.com/s/article/Upcoming-MFA-enforcement-for-Snowsight-logins) (Knowledge Base article).

[1]

These dates are estimated, and are dependent on the release and lifecycle of the bundle. To understand this lifecycle, see [Monthly behavior change bundles](../release-notes/behavior-change-policy.md).

### Phase 2: Strong authentication for *new* users

Phase 2 will be enforced in accounts on a rolling basis during a three-month period. You’ll receive a notification with the enforcement
date for your account.

**May 2026 - July 2026** [2]

| Objective | New behavior |
| --- | --- |
| Mandatory MFA for all new human users | All human users that are created *after* this phase is enforced must use a second factor when authenticating with a password, including those using BI tools or similar.  Human users who existed *before* the phase is enforced are not affected. These password users can continue to use BI tools or similar (anything but Snowsight) without a second factor of authentication until the next phase.  For example, suppose this phase is enforced on May 15, 2026. All human users created on or after this date must use a second factor of authentication regardless of the surface. Human users who existed before this date can continue to use password-only authentication for BI tools, but not Snowsight. |
| No new legacy service users | All non-human users created after the phase is enforced must be of type `SERVICE`, which prevents them from using a password. The `LEGACY_SERVICE` type is no longer available when creating a new user object. In addition, administrators cannot change the type of an existing user to `LEGACY_SERVICE`.  For example, suppose this phase is enforced on May 15, 2026. After this date, `TYPE=LEGACY_SERVICE` is an invalid option when executing a CREATE USER or ALTER USER command. |

[2]

These dates don’t correspond to a behavior change bundle, but are subject to change.

### Phase 3: Strong authentication for all users

Phase 3 will be enforced in accounts on a rolling basis during a three-month period. You’ll receive a notification with the enforcement
date for your account.

**August 2026 - October 2026** [3]

| Objective | New behavior |
| --- | --- |
| Mandatory MFA for all human users | When this phase is enforced, all new and existing human users must use a second factor when authenticating with a password, with no exceptions. |
| No legacy service users | When this phase is enforced, all non-human users are blocked from using a password to authenticate.  The `LEGACY_SERVICE` user type is fully deprecated. All existing user objects with `TYPE=LEGACY_SERVICE` are migrated to `TYPE=SERVICE`, which prevents them from using a password. |

To learn how to implement strong authentication to meet the requirements of this phase, see [Strong Authentication Hub](strong-authentication-hub.md).

[3]

These dates don’t correspond to a behavior change bundle, but are subject to change.

---
title: Power BI SSO to Snowflake
source: https://docs.snowflake.com/en/user-guide/oauth-powerbi.md
section: User Guide
---

# Power BI SSO to Snowflake

This topic describes how to use Microsoft Power BI to instantiate a Snowflake session and access Snowflake using single sign-on (SSO).

## Overview

Snowflake allows Microsoft Power BI users to connect to Snowflake using Identity Provider credentials and an OAuth 2.0 implementation to
provide an SSO experience to access Snowflake data.

This feature eliminates the need for on-premises Power BI Gateway implementations since the Power BI service uses an embedded Snowflake
driver to connect to Snowflake.

### General workflow

The following diagram summarizes the authorization flow to instantiate a Snowflake session from Power BI:

1. The user logs into the Power BI service using Microsoft Entra ID.
2. Optionally, Microsoft Entra ID can verify the user through an IdP via SAML. Currently, Microsoft only supports Microsoft Entra ID as the IdP for Power BI SSO.
3. When the user connects to Snowflake, the Power BI service asks Microsoft Entra ID to give it a token for Snowflake.
4. The Power BI service uses the embedded Snowflake driver to send the Microsoft Entra ID token to Snowflake as part of the connection string.
5. Snowflake validates the token, extracts the username from the token, maps it to the Snowflake user, and creates a Snowflake session for
   the Power BI service using the user’s default role.

## Prerequisites

For your Snowflake account, please verify the following before using the Power BI SSO feature:

* In Snowflake, if you’re using [Controlling network traffic with network policies](network-policies.md), you should allow the [Azure IP range](https://www.microsoft.com/en-us/download/details.aspx?id=56519) that includes the Azure region where your Snowflake account is hosted
  and any additional Azure regions as necessary.

  > **Important:**
  >
  > To create a network policy that is specific to Power BI for the Azure [region](intro-regions.md) where your Snowflake on
  > Azure account is located, search the JSON download from Microsoft for your region.
  >
  > For example, if your Snowflake on Azure account is located in the Canada Central region, search the JSON download for
  > `PowerBI.CanadaCentral`. Select the IP address ranges from the `addressPrefixes` list. Use these IP address ranges to
  > create or update a network policy in Snowflake.
  >
  > If the `addressPrefixes` list is empty, please contact Microsoft to request an update.
  >
  > If you are using multiple Microsoft Azure services (e.g. Power BI, SCIM), contact your Azure administrator to verify the correct IP
  > address ranges to ensure the Snowflake network policy contains the correct IP address ranges to allow users to access Snowflake.
* Either the `login_name`, `name`, or the `email` attribute for the user in Snowflake must map to the Microsoft Entra ID
  `upn` attribute. If the `login_name` attribute is not defined, then the process defaults to the `name` attribute.

## Considerations

With the Power BI gateway:
:   Private connectivity to the Snowflake service is supported. If it is necessary to use any of these two services to connect to Snowflake,
    use the on-premises gateway to connect.

Without the Power BI gateway:
:   Private connectivity to the Snowflake service is not supported. For the Power BI Service and Power BI Desktop, create a network policy to
    allow the Microsoft Entra ID public IP address ranges. Note that network policies have a 100,000 character limit for the allowed IP
    addresses.

Tokens and Keys:
:   Snowflake tries to verify Microsoft Entra ID through the URL value in the `external_oauth_jws_keys_url` property (shown below)
    or through the allowed IP addresses in the network policy, if the network policy exists. Microsoft updates its tokens and keys every 24
    hours. For more information on the Microsoft updates, see
    [Overview of tokens in Microsoft Entra ID B2C](https://docs.microsoft.com/en-us/azure/active-directory-b2c/tokens-overview).

Setting allowed roles:
:   By default, the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN system roles are blocked from using Microsoft Power BI to instantiate a
    Snowflake session. If it is necessary to use these highly privileged roles, update the `EXTERNAL_OAUTH_ALLOWED_ROLES` security
    integration parameter to specify these roles. Exercise caution before specifying the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN system
    roles in the `EXTERNAL_OAUTH_ALLOWED_ROLES` security integration parameter.

    For more information, see [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md) and [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-oauth-external.md).

## Getting started

This section explains how to create a Power BI security integration in Snowflake and how to access Snowflake through Power BI.

### Creating a Power BI security integration

> **Note:**
>
> This step is not required if you are using the Power BI gateway for Power BI service to connect to Snowflake or are using your Snowflake
> username and password for authentication.

To use Power BI to access Snowflake data through SSO, it is necessary to create a security integration for Power BI using
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md) as shown below.

The security integration must have the correct value for the `external_oauth_issuer` parameter. Part of this value maps to your
Microsoft Entra tenant. You can find this value in the About section of your Power BI tenant.

If your organization has an advanced deployment of the Power BI service, then check with your Microsoft Entra ID administrator to get the correct
value of the Microsoft Entra tenant to use in constructing the Issuer URL.

For example, if your Microsoft Entra tenant ID is `a828b821-f44f-4698-85b2-3c6749302698`, then construct the `AZURE_AD_ISSUER` value
similar to `https://sts.windows.net/a828b821-f44f-4698-85b2-3c6749302698/`. It is important to include the forward slash (i.e.
`/`) at the end of the value.

After constructing the value for `AZURE_AD_ISSUER`, execute the [CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-external.md) command.

If your Snowflake account or Microsoft Power BI service is in the Microsoft Azure Government cloud [region](intro-regions.md), set
the `external_oauth_audience_list` property value to `https://analysis.usgovcloudapi.net/powerbi/connector/Snowflake`.

**Security integration for Microsoft Power BI**

> ```sqlexample
> create security integration powerbi
>     type = external_oauth
>     enabled = true
>     external_oauth_type = azure
>     external_oauth_issuer = '<AZURE_AD_ISSUER>'
>     external_oauth_jws_keys_url = 'https://login.windows.net/common/discovery/keys'
>     external_oauth_audience_list = ('https://analysis.windows.net/powerbi/connector/Snowflake', 'https://analysis.windows.net/powerbi/connector/snowflake')
>     external_oauth_token_user_mapping_claim = 'upn'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name'
> ```

**Microsoft Azure Government security integration for Microsoft Power BI**

> ```sqlexample
> create security integration powerbi_mag
>     type = external_oauth
>     enabled = true
>     external_oauth_type = azure
>     external_oauth_issuer = '<AZURE_AD_ISSUER>'
>     external_oauth_jws_keys_url = 'https://login.windows.net/common/discovery/keys'
>     external_oauth_audience_list = ('https://analysis.usgovcloudapi.net/powerbi/connector/Snowflake', 'https://analysis.usgovcloudapi.net/powerbi/connector/snowflake')
>     external_oauth_token_user_mapping_claim = 'upn'
>     external_oauth_snowflake_user_mapping_attribute = 'login_name'
> ```

> **Important:**
>
> Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute
> this SQL command.
>
> The security integration parameter values are case-sensitive, and the values you put into the security integration must match those
> values in your environment. If the case does not match, it is possible that the access token will not be validated, resulting in a failed authentication attempt.
>
> The list values that you specify for the `EXTERNAL_OAUTH_AUDIENCE_LIST` property are URLs with an uppercase and lowercase Snowflake
> name. Include both URLs in this list to ensure that your client can connect to Snowflake based on the values that Microsoft might expect
> to form a connection.
>
> Verify that all parameter values are an exact match. For example, if the `<AZURE_AD_ISSUER>` URL value does not end with a
> backslash and the security integration is created with a backslash character at the end of the URL, an error message will occur. It would
> then be necessary to drop the security integration object (using DROP INTEGRATION) and then create the object again with the correct URL
> value (using CREATE SECURITY INTEGRATION).
>
> In your environment, if the user’s `UPN`
> [attribute value](https://docs.microsoft.com/en-us/azure/active-directory/hybrid/plan-connect-userprincipalname#what-is-userprincipalname)
> matches the user’s email field instead of the `login_name` in Snowflake, then replace `login_name` with
> `email_address`. For example:
>
> ```sqlexample
> create security integration powerbi
>     type = external_oauth
>     ...
>     external_oauth_snowflake_user_mapping_attribute = 'email_address';
> ```

### Using Power BI SSO with B2B guest users

To allow Microsoft Entra ID business to business (i.e. B2B) guest users to access Snowflake using SSO from Microsoft Power BI, set the
`EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM` property value to `'unique_name'`. For example:

> ```sqlexample
> create security integration powerbi
>   type = external_oauth
>   enabled = true
>   external_oauth_type = azure
>   external_oauth_issuer = '<AZURE_AD_ISSUER>'
>   external_oauth_jws_keys_url = 'https://login.windows.net/common/discovery/keys'
>   external_oauth_audience_list = ('https://analysis.windows.net/powerbi/connector/Snowflake', 'https://analysis.windows.net/powerbi/connector/snowflake')
>   external_oauth_token_user_mapping_claim = 'unique_name'
>   external_oauth_snowflake_user_mapping_attribute = 'login_name';
> ```

For more information, see [Understand the B2B User](https://docs.microsoft.com/en-us/azure/active-directory/external-identities/user-properties).

### Modifying Your External OAuth Security Integration

You can update your External OAuth security integration by executing an ALTER statement on the security integration.

For more information, see [ALTER SECURITY INTEGRATION (External OAuth)](../sql-reference/sql/alter-security-integration-oauth-external.md).

### Using secondary roles with Power BI SSO to Snowflake

The desired scope for the primary role is passed in the external token. This role is a specific role that was granted to the user (`session:role:<role_name>`).

By default, the default [secondary roles](security-access-control-overview.md) for a user (i.e. the DEFAULT_SECONDARY_ROLES user
property) are not activated in the session.

To activate the default secondary roles for a user in a session and allow executing the [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md)
command while using External OAuth, complete this step:

1. Configure the security integration for the connection. Set the EXTERNAL_OAUTH_ANY_ROLE_MODE parameter value to either ENABLE or
   ENABLE_FOR_PRIVILEGE when you create the security integration (using CREATE SECURITY INTEGRATION) or later (using ALTER SECURITY
   INTEGRATION).

### Using Client Redirect with Power BI SSO to Snowflake

Snowflake supports using Client Redirect with Power BI SSO to Snowflake.

For more information, see [Redirecting client connections](client-redirect.md).

### Using replication with Power BI SSO

Snowflake supports replication and failover/failback of the External OAuth security integration from a source account to a target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

### Connecting to Snowflake from Power BI

For more details on how to connect to Snowflake from Power BI, refer to the Power BI documentation.

* [Power BI Desktop documentation](https://docs.microsoft.com/en-us/power-bi/desktop-connect-snowflake)
* [Power BI Service documentation](https://docs.microsoft.com/en-us/power-bi/service-connect-snowflake)

### Using network policies with External OAuth

Currently, network policies cannot be added to an External OAuth security integration, which means that you cannot define a network policy
that applies only to the Power BI integration. However, you can still implement network policies that apply broadly to the entire
Snowflake account. For information about the Microsoft IP range that should be included in the network policy, see the Prerequisites
section (in this topic).

## Troubleshooting

* Warehouse resumption. If a given user attempts to use a suspended warehouse, Microsoft Power BI displays an error message that is not
  described in Error Messages. Verify, and if necessary, configure the warehouse to resume automatically to resolve the error message.
  For more information, see [Starting or resuming a warehouse](warehouses-tasks.md).
* While attempting to connect Power BI to Snowflake, errors may occur. Depending on the error message it may require troubleshooting in
  Microsoft, Snowflake, or both.

  + Error Messages describes common error messages Snowflake can return that display in Power BI.
  + Login History describes how to use Snowflake to verify whether or when a user last accessed Snowflake.

### Error messages

The following table describes error messages Snowflake returns while a user authenticates in Power BI:

| Behavior | Error Message | Troubleshooting Action |
| --- | --- | --- |
| Invalid access token or audience value. | Failed to update data source credentials: ODBC:ERROR [28000] Invalid OAuth access token. [<number>]. | Verify that the `external_oauth_issuer` parameter contains the correct value. . In Microsoft Entra ID, verify the access token is current. |
| AAD user not found in Snowflake account. | Failed to update data source credentials: ODBC:ERROR [28000] Incorrect username or password was specified. | Verify that the user exists in Snowflake (either the `name` or `login_name` attribute value matches with the user’s UPN value in Microsoft Entra ID). If you are adding a user, then verify that the UPN value does not already exist in Microsoft Entra ID. |
| Snowflake user present, but disabled. | Failed to update data source credentials: ODBC:ERROR [28000] User access disabled. Contact your local system administrator. | In Snowflake, run `desc user <username>` to verify if the `disabled` attribute is set to `true`. If you want this user to be allowed, run `alter user <username> set disabled = true;`. Try to access Snowflake from Power BI again. |
| Snowflake receives an expired AAD token from Power BI. | Failed to update data source credentials: ODBC:ERROR [28000] OAuth access token expired. [<number>]. | Contact Snowflake Support. |
| Security integration not created or disabled in Snowflake account. | Failed to update data source credentials: ODBC:ERROR [28000] OAuth Authz Server Integration is not enabled. | Run `desc <security_integration_name>` to verify or recreate the security integration. |
| Default role is not set for the user. | Failed to update data source credentials: ODBC: ERROR [28000] No default role has been assigned to the user, contact a local system administrator to assign a default role and retry. | Set default role for the user. |
| Default role for the user is not granted to the user. | Test failed because of 250001 (08001): Failed to connect to DB: <host>. User’s configured default role ‘<ROLE>’ is not granted to this user. Contact your local system administrator, or attempt to login using a CLI client with a connect string selecting another role, e.g. PUBLIC. | Check the default role for the user and grant it to them. |

### Login history

If a user is able to access Power BI but not instantiate a Snowflake session, you can determine when the user last accessed Snowflake by
running the following commands using any supported [connector](../guides-overview-connecting.md) or the Snowflake web interface. Note that only
successful authentications are logged.

```sqlexample
use role accountadmin;
select *
from table(information_schema.login_history(dateadd('hours',-1,current_timestamp()),current_timestamp()))
order by event_timestamp;
```

For each result, evaluate the `USER_NAME` and `FIRST_AUTHENTICATION_FACTOR` columns.

> * The `USER_NAME` value should align with the attribute mappings described the Prerequisites section.
> * The `FIRST_AUTHENTICATION_FACTOR` should be set to `OAUTH_ACCESS_TOKEN`.

---
title: Premium views in the organization account
source: https://docs.snowflake.com/en/user-guide/organization-accounts-premium-views.md
section: User Guide
---

# Premium views in the organization account

The [ORGANIZATION_USAGE schema](../sql-reference/organization-usage.md) contains views that provide organization-level data. The
ORGANIZATION_USAGE schema in the [organization account](organization-accounts.md)
contains views that are not available in the ORGANIZATION_USAGE schema of a regular account. These views are considered *premium views*
because they aggregate usage and object data from all accounts into a single view that is not otherwise available, and therefore incur
additional costs.

Premium views correspond to views in the ACCOUNT_USAGE schema, but provide organization-level data rather than account-level data. For
example, someone could query the TAG_REFERENCES view in the ACCOUNT_USAGE schema to learn about how tags are used in a specific account, but
someone could query the TAG_REFERENCES view in the ORGANIZATION_USAGE schema of the organization account to learn how tags are used
throughout the organization.

For a list of premium views, see [Organization Usage](../sql-reference/organization-usage.md).

> **Note:**
>
> It can take two weeks from the time the organization account is created until premium views are fully populated with 365 days of
> historical data from accounts.

## Costs associated with premium views

Premium views incur additional costs based on how many records were processed to generate the views. For the current rate for premium views, find the Organization Usage table in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Grant access to the premium views

For information about granting access to premium views, see [Access schema in the organization account](../sql-reference/organization-usage.md).

## Organizations without a capacity contract

By default, premium views are only available in organizations that have a capacity contract. If you have on demand accounts and want to
access premium views, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Effect on views in the ACCOUNT_USAGE schema

Snowflake uses the hidden schema `snowflake.organization_usage_local` to store internal objects used in conjunction with premium views.
These objects might be visible in the ACCOUNT_USAGE views in the organization account. Because these objects are internal, they might
change without notice in the future.

---
title: Preparing to load data
source: https://docs.snowflake.com/en/user-guide/data-load-prepare.md
section: User Guide
---

# Preparing to load data

This topic provides an overview of supported data file formats and data compression. Depending on your data’s structure, you might need to
[prepare](data-load-considerations-prepare.md) the data before loading it.

## Supported data types

See [SQL data types reference](../sql-reference-data-types.md) for descriptions of the data types supported by Snowflake.

## Data file compression

We recommend that you compress your data files when you are loading large data sets. See [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md) for the compression algorithms supported for each data type.

When loading compressed data, Snowflake will automatically determine the file and codec compression method for your data files. The COMPRESSION file format option describes how your data files are already compressed in the stage. Set the COMPRESSION option in one of the following ways:

> * As a file format option specified directly in the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement.
> * As a file format option specified for a named file format or stage object. The named file format/stage object can then be referenced in the COPY INTO *<table>* statement.

## Supported file formats

The following file formats are supported:

> | Structured/Semi-structured | Type | Notes |
> | --- | --- | --- |
> | Structured | Delimited (CSV, TSV, etc.) | Any valid singlebyte delimiter is supported; default is comma (i.e. CSV). |
> | Semi-structured | JSON |  |
> |  | Avro | Includes automatic detection and processing of compressed Avro files. |
> |  | ORC | Includes automatic detection and processing of compressed ORC files. |
> |  | Parquet | Includes automatic detection and processing of compressed Parquet files. . . Currently, Snowflake supports the schema of Parquet files produced using the Parquet writer v1. Files produced using v2 of the writer are not supported. |
> |  | XML |  |

File format options specify the type of data contained in a file, as well as other related characteristics about the format of the data. The file format options you can specify are different depending on the type of data you plan to load. Snowflake provides a full set of file format option defaults.

### Semi-structured file formats

Snowflake natively supports semi-structured data, which means semi-structured data can be loaded into relational tables without requiring the definition of a schema in advance. Snowflake supports loading semi-structured data directly into columns of type VARIANT (see [Semi-structured data types](../sql-reference/data-types-semistructured.md) for more details).

Currently supported semi-structured data formats include JSON, Avro, ORC, Parquet, or XML:

* For JSON, Avro, ORC, and Parquet data, each top-level, complete object is loaded as a separate row in the table. Each object can contain new line characters and spaces as long as the object is valid.
* For XML data, each top-level element is loaded as a separate row in the table. An element is identified by a start and close tag of the same name.

Typically, tables used to store semi-structured data consist of a single VARIANT column. Once the data is loaded, you can query the data similar to structured data. You can also perform other tasks, such as extracting values and objects from arrays. For more information, see the [FLATTEN](../sql-reference/functions/flatten.md) table function.

> **Note:**
>
> Semi-structured data can be loaded into tables with multiple columns, but the semi-structured data must be stored as a field in a structured file (e.g. CSV file). Then, the data can be loaded into a specified column in the table.

### Named file formats

Snowflake supports creating named file formats, which are database objects that encapsulate all of the required
format information. Named file formats can then be used as input in all the same places where you can specify individual file format options, thereby
helping to streamline the data loading process for similarly-formatted data.

Named file formats are optional, but are recommended when you plan to load similarly formatted data on a regular basis.

#### Creating a named file format

You can create a file format using either Snowsight or SQL:

> Snowsight:
> :   1. In the navigation menu, select Catalog » Database Explorer.
>     2. Locate a database and select the schema to which you want to add the file format.
>     3. Select Create » File Format.
>     4. Complete the SQL statement and select Create File Format.
>
> SQL:
> :   [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md)

For descriptions of all file format options and the default values, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).

## Supported copy options

Copy options determine the behavior of a data load with regard to error handling, maximum data size, and so on.

For descriptions of all copy options and the default values, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

### Overriding default file format and copy options

You can specify the desired load behavior (i.e. override the default settings) in any of the following locations:

In the table definition:
:   Not recommended.

In the named stage definition:
:   Not recommended.

Directly in the COPY INTO TABLE statement when loading data:
:   Explicitly set the options separately. For more information, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

> **Note:**
>
> Do not specify copy options using the CREATE STAGE, ALTER STAGE, CREATE TABLE, or ALTER TABLE commands. We recommend that you use the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to specify copy options.

If file format options or copy options are specified in multiple locations, the load operation applies the options in the following order of precedence:

1. COPY INTO TABLE statement.
2. Stage definition.
3. Table definition.

> **Note:**
>
> File format options set in multiple locations are not cumulative. Any options set in one place override all options (whether the same or different options) set lower in the order of precedence.
>
> Copy options set in multiple locations are cumulative. Individual options set in one place override the same option set lower in the order of precedence.

---
title: Preparing your data files
source: https://docs.snowflake.com/en/user-guide/data-load-considerations-prepare.md
section: User Guide
---

# Preparing your data files

This topic provides best practices, general guidelines, and important considerations for preparing your data files for loading.

## File sizing best practices

For best load performance and to avoid size limitations, consider
the following data file sizing guidelines. Note that these recommendations apply to bulk data loads as well as
continuous loading using Snowpipe.

### General file sizing recommendations

The number of load operations that run in parallel can’t exceed the number of data files to be loaded. To optimize the number of parallel operations for a load, we recommend aiming to produce data files roughly 100-250 MB (or larger) in size compressed.

> **Note:**
>
> Loading very large files (for example, 100 GB or larger) is not recommended.
>
> If you must load a large file, carefully consider the [ON_ERROR](../sql-reference/sql/copy-into-table.md) copy option value. Aborting or
> skipping a file due to a small number of errors could result in delays and wasted credits. In addition, if a data loading operation
> continues beyond the maximum allowed duration of 24 hours, it could be aborted without any portion of the file being committed.

Aggregate smaller files to minimize the processing overhead for each file. Split larger files into a greater number of smaller files to distribute the load among the compute resources in an active warehouse. The number of data files that are processed in parallel is determined by the amount of compute resources in a warehouse. We recommend splitting large files by line to avoid records that span chunks.

If your data source doesn’t allow exporting data files in smaller chunks, you can use a third-party utility to split large CSV files.

If you are loading large uncompressed CSV files (greater than 128MB) that follow the RFC4180 specification, Snowflake supports parallel scanning of these CSV files when MULTI_LINE is set to `FALSE`, COMPRESSION is set to `NONE`, and ON_ERROR is set to `ABORT_STATEMENT` or `CONTINUE`.

#### Linux or macOS

The `split` utility enables you to split a CSV file into multiple smaller files.

**Syntax:**

> ```bash
> split [-a suffix_length] [-b byte_count[k|m]] [-l line_count] [-p pattern] [file [name]]
> ```

For more information, type `man split` in a terminal window.

**Example:**

> ```bash
> split -l 100000 pagecounts-20151201.csv pages
> ```

This example splits a file named `pagecounts-20151201.csv` by line length. Suppose the large single file is 8 GB and contains 10 million lines. Split by 100,000, each of the 100 smaller files is 80 MB (10 million / 100,000 = 100). The split files are named `pagessuffix`.

#### Windows

Windows does not include a native file split utility; however, Windows supports many third-party tools and scripts that can split large data files.

## Size limits for database objects

When you use any of the available methods for [loading data into Snowflake](data-load-overview.md),
you can store objects with sizes up to the following limits:

| Data type | Storage limit |
| --- | --- |
| ARRAY | 128 MB |
| BINARY | 64 MB |
| GEOGRAPHY | 64 MB |
| GEOMETRY | 64 MB |
| OBJECT | 128 MB |
| VARCHAR | 128 MB |
| VARIANT | 128 MB |

The default size for VARCHAR columns is 16 MB (8 MB for binary). To create tables with column sizes larger than 16 MB,
specify the size explicitly. For example:

```sqlexample
CREATE OR REPLACE TABLE my_table (
  c1 VARCHAR(134217728),
  c2 BINARY(67108864));
```

To use the new limits for VARCHAR columns, you can alter tables to change the column size. For example:

```sqlexample
ALTER TABLE my_table ALTER COLUMN col1 SET DATA TYPE VARCHAR(134217728);
```

To apply the new size to columns of type BINARY in these tables, recreate the tables. You can’t alter the length
of a BINARY column in an existing table.

For columns of type ARRAY, GEOGRAPHY, GEOMETRY, OBJECT, and VARIANT, you can store objects larger than 16 MB
in existing tables and new tables, by default, without specifying the length. For example:

```sqlexample
CREATE OR REPLACE TABLE my_table (c1 VARIANT);
```

If you have procedures and functions that were created in the past and that use VARIANT, VARCHAR, or BINARY values as input,
you might need to recreate them (without specified length) to support objects larger than 16 MB. For example:

```sqlexample
CREATE OR REPLACE FUNCTION udf_varchar(g1 VARCHAR)
  RETURNS VARCHAR
  AS $$
    'Hello' || g1
  $$;
```

For externally managed [Iceberg tables](tables-iceberg-create.md), the default length for VARCHAR and BINARY columns is 128 MB.
This default length applies to newly created or refreshed tables. If you have tables that were created in the past, smaller limits might
apply to them. You can refresh these tables so that they support larger size limits.

For managed Iceberg tables, the default length for VARCHAR and BINARY columns is 128 MB. Tables that were created before the new size limits
were enabled still have the previous default lengths. To apply the new size to columns of type VARCHAR in these tables, recreate the tables
or alter the columns. The following example alters a column to use the new size limit:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ALTER COLUMN col1 SET DATA TYPE VARCHAR(134217728);
```

To apply the new size to columns of type BINARY in these tables, recreate the tables. You can’t alter the length
of a BINARY column in an existing table.

### Driver versions that support large objects in the result set

Drivers support objects larger than 16 MB (8 MB for BINARY, GEOMETRY, and GEOGRAPHY). You might need to update your drivers to the
versions that support larger objects. The following driver versions are required:

| Driver | Minimum supported version | Release date |
| --- | --- | --- |
| Snowpark Library for Python | 1.21.0 | August 19, 2024 |
| Snowflake Connector for Python | 3.10.0 | April 29, 2024 |
| JDBC | 3.17.0 | July 8, 2024 |
| ODBC | 3.6.0 | March 17, 2025 |
| Go Snowflake Driver | 1.1.5 | April 17, 2022 |
| .NET | 2.0.11 | March 15, 2022 |
| Snowpark Library for Scala and Java | 1.14.0 | September 14, 2024 |
| Node.js | 1.6.9 | April 21, 2022 |
| Spark connector | 3.0.0 | July 31, 2024 |
| PHP | 3.0.2 | August 29, 2024 |
| Snowflake CLI | 3.0.0 | October 1, 2024 |
| SnowSQL | 1.3.2 | August 12, 2024 |

If you try to use a driver that doesn’t support larger objects, an error similar to the following example is returned:

```output
100067 (54000): The data length in result column <column_name> is not supported by this version of the client.
Actual length <actual_size> exceeds supported length of 16777216.
```

## Continuous data loads — that is, Snowpipe — and file sizing

Snowpipe is designed to load new data typically within a minute after a file notification is sent; however, loading can take significantly longer for really large files or in cases where an unusual amount of compute resources is necessary to decompress, decrypt, and transform the new data.

In addition to resource consumption, an overhead to manage files in the internal load queue is included in the utilization costs charged for Snowpipe. This overhead increases in relation to the number of files queued for loading. This overhead charge appears as Snowpipe charges in your billing statement
because Snowpipe is used for event notifications for the automatic external table refreshes.

For the most efficient and cost-effective load experience with Snowpipe, we recommend following the file sizing recommendations in File sizing best practices (in this topic). Loading data files roughly 100-250 MB or larger reduces the overhead charge relative to the amount of total data loaded to the point where the overhead cost is immaterial.

If it takes longer than one minute to accumulate MBs of data in your source application, consider creating a new (potentially smaller) data file once per minute. This approach typically leads to a good balance between cost (that is, resources spent on Snowpipe queue management and the actual load) and performance (that is, load latency).

Creating smaller data files and staging them in cloud storage more often than once per minute has the following disadvantages:

* A reduction in latency between staging and loading the data can’t be guaranteed.
* An overhead to manage files in the internal load queue is included in the utilization costs charged for Snowpipe. This overhead increases in relation to the number of files queued for loading.

Various tools can aggregate and batch data files. One convenient option is Amazon Data Firehose. Firehose allows defining both the
desired file size, called the *buffer size*, and the wait interval after which a new file is sent (to cloud storage in this case), called
the *buffer interval*. For more information, see the
[Amazon Data Firehose documentation](https://docs.aws.amazon.com/firehose/latest/dev/create-configure.html). If your source application
typically accumulates enough data within a minute to populate files larger than the recommended maximum for optimal parallel processing,
you could decrease the buffer size to trigger delivery of smaller files. Keeping the buffer interval setting at 60 seconds (the minimum
value) helps avoid creating too many files or increasing latency.

## Preparing delimited text files

Consider the following guidelines when preparing your delimited text (CSV) files for loading:

* UTF-8 is the default character set, however, additional encodings are supported. Use the ENCODING file format option to specify the character set for the data files. For more information, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).
* Fields that contain delimiter characters should be enclosed in quotes (single or double). If the data contains single or double quotes, then those quotes must be escaped.
* Carriage returns are commonly introduced on Windows systems in conjunction with a line feed character to mark the end of a line (`\r \n`). Fields that contain carriage returns should also be enclosed in quotes (single or double).
* The number of columns in each row should be consistent.

## Semi-structured data files and subcolumnarization

When semi-structured data is inserted into a VARIANT column, Snowflake uses certain rules to extract as much of the data as possible
to a columnar form. The rest of the data is stored as a single column in a parsed semi-structured structure.

By default, Snowflake extracts a maximum of 200 elements per partition, per table. To increase this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### Elements that are not extracted

Elements with the following characteristics are not extracted into a column:

* Elements that contain even a single “null” value are not extracted into a column.
  This applies to elements with “null” values and not to elements with missing values, which are represented in columnar form.

  This rule ensures that no information is lost (that is, that the difference between VARIANT “null” values and SQL NULL values is not lost).
* Elements that contain multiple data types. For example:

  The `foo` element in one row contains a number:

  ```sqljson
  {"foo":1}
  ```

  The same element in another row contains a string:

  ```sqljson
  {"foo":"1"}
  ```

### How extraction impacts queries

When you query a semi-structured element, Snowflake’s execution engine behaves differently according to whether an element was extracted.

* If the element was extracted into a column, the engine scans only the extracted column.
* If the element was not extracted into a column, the engine must scan the entire JSON structure,
  and then for each row traverse the structure to output values. This impacts performance.

To avoid the performance impact for elements that were not extracted, do the following:

* Extract semi-structured data elements containing “null” values into relational columns before you load them.

  Alternatively, if the “null” values in your files indicate missing values and have no other special meaning,
  we recommend setting the [file format option](../sql-reference/sql/create-file-format.md) STRIP_NULL_VALUES to TRUE
  when you load the semi-structured data files. This option removes OBJECT elements or ARRAY elements containing “null” values.
* Ensure each unique element stores values of a single data type that is native to the format (for example, string or number for JSON).

## Numeric data guidelines

* Avoid embedded characters, such as commas (for example, `123,456`).
* If a number includes a fractional component, it should be separated from the whole number portion by a decimal point (for example, `123456.789`).
* Oracle only. The Oracle NUMBER or NUMERIC types allow for arbitrary scale, meaning they accept values with decimal components even if the data type was not defined with a precision or scale. Whereas in Snowflake, columns designed for values with decimal components must be defined with a scale to preserve the decimal portion.

## Date and timestamp data guidelines

* For information on the supported formats for date, time, and timestamp data, see [Date and time input and output formats](../sql-reference/date-time-input-output.md).
* Oracle only. The Oracle DATE data type can contain date *or* timestamp information. If your Oracle database includes DATE columns that also store time-related information, map these columns to a TIMESTAMP data type in Snowflake rather than DATE.

> **Note:**
>
> Snowflake checks temporal data values at load time. Invalid date, time, and timestamp values (for example, `0000-00-00`) produce an error.

---
title: Pricing plan manifest reference
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/pricing-plan-manifest-reference.md
section: User Guide
---

# Pricing plan manifest reference

Creating Snowflake pricing plans programmatically requires a manifest, written in YAML (<https://yaml.org/spec/>). Use the information provided here to learn about the parameters available in the pricing plan manifest.

## Pricing plan manifest

```yaml
#
# Pricing plan manifest
#
pricing_plan_display_name: <pricing_plan_name>
currency: <three_letter_currency_code>
pricing_model: <pricing_plan_pricing_model>
usage_details:
  free_units: <number_of_free_monthly_queries>
  free_unit_kind: <free_unit_kind>
  usage_unit_price: <price_per_unit>
  usage_unit_kind: <usage_unit_kind>
  max_fee: <maximum_fee_per_month>
billing_events:
   class: <class_name>
   display_name: <billing_event_display_name>
   billing_quantity: <price_per_unit>
   billing_unit: <display_units>
   description: <description_for_the_billing_event>
compute_pool_surcharge:
   surcharge_type: <surcharge_type>
   compute_pool_rates:
    - identifier_type: <compute_pool_type>
      identifier_name: <compute_pool_name>
      surcharge_price: <price_per_credit>
      description: <compute_pool_rate_description>
    - identifier_type: <compute_pool_type>
      identifier_name: <compute_pool_name>
      surcharge_price: <price_per_credit>
      description: <compute_pool_rate_description>
base_fee: <monthly_fixed_fee>
billing_duration: <billing_duration_in_months>
sales_motion: <pricing_plan_type>
comment: <a_note_visible_only_to_the_provider>
metadata:
  description: <pricing_plan_description>
  price_prefix: <pricing_plan_prefix>
  pricing_unit: <|sf-web-interface|_pricing_plan_pricing_unit>
  button_text: <|sf-web-interface|_button_text>
  index: <|sf-web-interface|_pricing_plan_index>
  value_propositions: <pricing_plan_value_proposition>
visibility: <pricing_plan_visibility>
#
# Default offer fields
#
contract_type: <pricing plan contract type>
contract_duration_months: <pricing plan contract duration>
```

## Pricing plan parameters

The parameters within the pricing plan manifest allow you to create pricing plans that meet your specific business requirements. Required and optional parameters are identified.

pricing_plan_display_name
:   Required. String. The pricing plan name that is visible to providers and consumers.

currency
:   Required. String. The three-letter currency code for the pricing plan. The default is USD.

pricing_model
:   Required. String. The pricing model for the pricing plan. The available values are FLAT_FEE and USAGE_BASED. For more information about pricing models, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).

usage_details
:   Optional. Defines pricing plan usage limitations. You can specify the following optional parameters.

    `free_units``free_unit_kind``usage_unit_price``usage_unit_kind``max_fee`

    Long. The number of queries a consumer can make without incurring a monthly usage fee.

    String. The free unit type. The accepted value is QUERY.

    Double. The price per unit kind.

    String. The usage unit type. The accepted value is QUERY.

    Double. The maximum fee a consumer can be charged monthly. The accepted value is QUERY.

billing_events
:   Optional. Defines consumer billing events. You can specify the following optional parameters.

    `class``display_name``billing_quantity``billing_unit``description`

    String. The billing class name.

    String. The billing event name.

    Double. The price per unit.

    String. The billing unit to display.

    String. The billing event description.

compute_pool_surcharge
:   Optional. Defines rates for compute pool use. You can specify the following optional parameters.

    `surcharge_type``compute_pool_rates`

    String. The compute pool surcharge type. The accepted values are HOUR or CREDIT.

    String. Defines the rates charged for compute pool access. You can specify the following parameters.

    `identifier_type``identifier_name``surcharge_price``description`

    String. The compute pool identifier type. The accepted value is COMPUTE_POOL_NAME.

    String. The compute pool identifier name. This value must be identical to the name of the compute pool being used in the app.

    Double. The price per compute pool credit when the value is CREDIT, or the price per compute hour when the value is HOUR. When a compute node is started or resumed, a minimum of 5 minutes worth of Snowflake credits are consumed. After a compute node is started or resumed, virtual warehouses and compute nodes are charged on a per second basis, rounded up to the nearest whole second.

    String. The compute pool description.

base_fee
:   Required. Double. The pricing plan monthly fixed fee.

billing_duration
:   Required. Long. The pricing plan duration in months.

sales_motion
:   Required. String. The pricing plan type. The accepted values are SELF_SERVE and TALK_TO_SALES.

comment
:   Optional. String. Pricing plan information that is visible only to a provider.

metadata
:   Optional. Provides additional pricing plan information. You can specify the following optional parameters.

    `description``price_prefix``pricing_unit``button_text``index``value_propositions`

    String. A description of the pricing plan.

    String. The prefix for pricing plan pricing.

    String. The pricing plan pricing unit to display in Snowsight.

    String. The text to display on the Snowsight pricing plan button.

    String. The pricing plan index displayed in Snowsight.

    String. The pricing plan value proposition.

visibility
:   Required. String. Defines pricing plan visibility. The accepted values are VISIBLE and HIDDEN.

contract_type
:   Required. String. The pricing plan contract type. The accepted values are SUBSCRIPTION and LIMITED_TIME.

contract_duration_months
:   Required. Long. The pricing plan contract duration in months.

## Examples

The following example defines a flat-fee, self-serve pricing plan.

```yaml
display_name: Default pricing plan display name
currency: USD
pricing_model: FLAT_FEE
base_fee: 100.0
billing_duration_months: 1
sales_motion: SELF_SERVE
comment: Comment for the pricing plan
metadata:
  description: Pricing plan description
  price: $100 / unit
  button_text: Buy Now
  value_propositions:
    - val 1
    - val 2
visibility: VISIBLE
contract_type: LIMITED_TIME
contract_duration_months: 12
state: PUBLISHED
```

---
title: Pricing plans and offers
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md
section: User Guide
---

# Pricing plans and offers

## Understand pricing plans

A pricing plan lets providers specify a [pricing model](../../../../collaboration/provider-listings-pricing-model.md) (flat-fee versus usage-based), a base price, a billing frequency, and more for their listings. These listings can be public (available on the Snowflake Marketplace) or private (shared directly with a consumer).

* Providers can create public listings on the Snowflake Marketplace that include multiple pricing plans.
* For both public and private listings, providers can optionally specify to show the price and pricing information on the listing page, allowing consumers to review this information before they purchase.
* For private listings, where providers opt to not show pricing information on the pricing plan, providers can still reference the listing when extending offers to consumers.

### Self-serve pricing plans

Self-serve pricing plans allow consumers to purchase products directly from listings without provider interaction. This type of pricing plan includes a price, pricing model, and billing frequency on the provider’s listing page.

Providers can configure multiple pricing plans for a listing, such as “Good-Better-Best” pricing. This allows consumers to select the pricing option that works best for them. And when a consumer selects a pricing plan, they can immediately complete a purchase.

## Understand offers

Providers can create offers that define purchase terms for a listing and then extend those offers to consumers. Offers provide individualized billing, payment terms, payment schedules, and contract start and end dates. Before accepting or rejecting an offer, consumers can review the terms and request changes.

When consumers purchase an offer on the Snowflake Marketplace, all existing billing methods are supported, including the [Marketplace Capacity Drawdown (MCD) program](../../../../collaboration/marketplace-capacity-drawdown.md).

Providers can extend offers to consumers in Snowsight or programmatically.

### Understand offer types

Providers can create two types of offers: standard and private.

* Standard offers are public on the Snowflake Marketplace and are tied to pricing plans. When a consumer views a listing with a standard offer, they can see pricing details and accept the offer directly from the listing page.
* Private offers are individualized offers that providers can extend directly to specific consumers. Private offers aren’t visible on the Snowflake Marketplace. They also don’t have to be tied to a pricing plan. For private offers that are tied to pricing plans, providers can apply negotiated pricing that includes discounts and custom terms. For private offers that aren’t tied to a pricing plan, providers can create a [one-time pricing offer](providers-create-manage-offers.md) for the listing.

## FAQs

**Can I add offers to my existing paid listings?**

No, existing paid listings are v1 listings. Offers are only available on v2 paid listings.

**How can I convert my existing v1 paid listing to a v2 paid listing?**

A migration feature that will allow converting a v1 paid listing to a v2 paid listing will be available to partners at a future date.

**If my consumer is on a free trial and they accept an offer, what happens?**

After a consumer accepts an offer, the offer will govern access to and metering of the product. Specifically, the offer’s access start date determines when the consumer can start using the product. To ensure a seamless experience for your consumers, set the access start date to When offer accepted. This way, the consumer can start using the product immediately after accepting the offer, even if they are still on a free trial.

**How can I give free access for a period of time on a paid offer?**

You can set the access start time to When offer accepted and set the First invoice date to the date when you want to start charging the consumer.

---
title: Private connectivity for inbound network traffic
source: https://docs.snowflake.com/en/user-guide/private-connectivity-inbound.md
section: User Guide
---

# Private connectivity for inbound network traffic

Your connection to Snowflake can be routed over the public Internet or through a private IP address associated with the cloud platform that
hosts your Snowflake account. By using your cloud platform’s private connectivity solution to create private endpoints, you can harden your
security posture so that inbound network traffic uses private connectivity when accessing the following features:

* To the Snowflake Service
* To Snowsight
* To Streamlit in Snowflake
* To internal stages
* To Snowflake-managed storage volumes
* To Snowpark Container Services
* To Snowflake Intelligence

## To the Snowflake Service

When the routing is through a private IP address *from your VPC or VNET to the Snowflake VPC or VNet*, that is *private
connectivity to the Snowflake Service*. These connections use [AWS PrivateLink](admin-security-privatelink.md),
[Azure Private Link](privatelink-azure.md), or
[Google Cloud Private Service Connect](private-service-connect-google.md). The service depends on the cloud platform that
hosts your Snowflake account.

## To Snowsight

To use private connectivity to access Snowsight, see [Configuring private connectivity for Snowsight](ui-snowsight-gs.md).

After private connectivity is configured, users can [sign in using private connectivity](ui-snowsight-gs.md).

## To Streamlit in Snowflake

To access Streamlit in Snowflake with AWS PrivateLink, Azure Private Link, or Google Cloud Private Service Connect, see [Private connectivity for Streamlit in Snowflake](../developer-guide/streamlit/object-management/privatelink.md).

## To internal stages

You can use private connectivity to connect to Snowflake internal stages. For information, see the following:

* [AWS VPC interface endpoints for internal stages](private-internal-stages-aws.md)
* [Azure private endpoints for internal stages](private-internal-stages-azure.md)
* [Google Private Service Connect endpoints for internal stages](private-internal-stages-gcp.md)

## To Snowflake-managed storage volumes

You can use private connectivity to connect to Snowflake-managed storage volumes for Apache Iceberg tables. For information, see
the following:

* [AWS VPC interface endpoints for Snowflake-managed storage volumes](private-managed-volumes-aws.md)
* [Azure private endpoints for Snowflake-managed storage volumes](private-managed-volumes-azure.md)

## To Snowpark Container Services

You can use private connectivity to connect to Snowpark Container Services. For information, see [Inbound connectivity](../developer-guide/snowpark-container-services/private-connectivity.md).

## To Snowflake Intelligence

You can use private connectivity to connect to Snowflake Intelligence. For information, see [Configure Snowflake Intelligence with private connectivity](snowflake-cortex/snowflake-intelligence/deploy-agents.md).

---
title: Private connectivity for inbound network traffic in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-inbound.md
section: User Guide
---

# Private connectivity for inbound network traffic in Snowflake Open Catalog

Your connection to Snowflake Open Catalog can be routed over the public internet or through a private IP address associated with the cloud
platform that hosts your Open Catalog account. By using your cloud platform’s private connectivity solution to create private endpoints,
you can harden your security posture so that inbound network traffic uses private connectivity.

When your query engine connects to your Snowflake Open Catalog account, inbound network traffic is generated for your account. In addition,
inbound network traffic is generated when you access the Open Catalog UI.

> **Note:**
>
> For Snowflake to query Open Catalog–managed tables through private connectivity, Snowflake and Open Catalog must both be located
> in the same deployment.

## Configuring private connectivity for your Open Catalog account

Private connectivity for inbound network traffic is supported for the following cloud platforms:

* [AWS](private-connectivity-inbound-configure-aws.md)
* [Azure](private-connectivity-inbound-configure-azure.md)

## Configuring private connectivity for the Open Catalog UI

You can access the Open Catalog UI through private connectivity. To configure this access, see
[Configure private connectivity for the Snowflake Open Catalog UI](private-connectivity-ui-configure.md).

## Billing

Snowflake calculates costs for inbound private connectivity based on endpoint usage. For details on pricing for inbound private connectivity,
see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: Private connectivity for outbound network traffic
source: https://docs.snowflake.com/en/user-guide/private-connectivity-outbound.md
section: User Guide
---

# Private connectivity for outbound network traffic

Snowflake features such as external functions and external stages generate outbound network traffic from Snowflake to a cloud platform. For
increased security, you can create private endpoints in Snowflake to access the cloud platform by using the platform’s private connectivity
solution rather than traversing the public Internet. This lets you access cloud platform services privately and securely from Snowflake.

Outbound private connectivity is available for the following Snowflake features:

* External network locations using external access integrations
* External functions
* External stages
* External tables
* Catalog integrations for Apache Iceberg™ tables
* External volumes for Apache Iceberg™ tables
* Apache Iceberg™ REST catalog integrations
* Snowpipe automation

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Basic workflow

Each Snowflake feature that can use outbound private connectivity has its own prerequisites and configuration procedures. However,
there are common steps to establish outbound private connectivity.

For example, a Snowflake account administrator (user with the ACCOUNTADMIN role) or a user that has a role with the appropriate privileges
can do the following:

1. Complete any prerequisite configuration for the feature generating outbound network traffic.
2. In Snowflake, provision a private connectivity endpoint to connect to the cloud platform.
3. Authorize the private connectivity endpoint.
4. Retrieve the private connectivity endpoint URL that points to the service or resource.
5. Integrate the private connectivity endpoint URL into the Snowflake configuration of your Snowflake feature.
6. Deprovision private connectivity endpoints that are not actively being used to avoid cloud platform
   limitations.

> **Tip:**
>
> These steps are self-service but might require collaboration with different parties to complete the setup. Consult with the
> administrators that own the different services before starting.
>
> The placement of these steps depends on the Snowflake feature. For details, refer to the configuration procedure for feature.

## Scaling considerations

Your implementation of outbound private connectivity must conform to the following limitations associated with cloud providers:

Cannot have more than five private endpoints per Snowflake account
:   Private endpoints that have been deprovisioned within the last seven days count toward this limit.

    To increase this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Cannot have more than one endpoint to the same AWS service or Azure subresource
:   For AWS, this limitation is per service. So if you have one endpoint to an S3 bucket, you cannot have a different endpoint to another S3
    bucket because the endpoint-to-S3 service combination would be duplicated.

    For Azure, if a resource has only one subresource, you can only have one endpoint. But if the resource has different subresources
    available, you can have multiple endpoints to the resource as long as they connect to different subresources.

    > **Note:**
    >
    > You can duplicate an endpoint-to-service or endpoint-to-subresource combination in a different Snowflake account.

## U.S. government regions

Support for outbound private connectivity from [U.S. SnowGov regions](intro-regions.md) is as follows:

Microsoft Azure:
:   Outbound private connectivity from regions on Microsoft Azure Government is supported. The source region with the private endpoints and
    the target Microsoft service must both be on Microsoft Azure Government.

AWS:
:   Outbound private connectivity from regions on AWS GovCloud is supported.

## External network locations using external access integrations

You can use outbound private connectivity and external access integrations to reach external network locations from Snowpark Container
Services or from UDF/UDTF and stored procedures within Snowpark.

From Snowpark
:   * [External network access and private connectivity on AWS](../developer-guide/external-network-access/creating-using-private-aws.md)
    * [External network access and private connectivity on Microsoft Azure](../developer-guide/external-network-access/creating-using-private-azure.md)

From Snowpark Container Services
:   * [Network egress using private connectivity](../developer-guide/snowpark-container-services/service-network-communications.md)

## External functions

* [Private connectivity with external functions: Azure Portal](../sql-reference/external-functions-creating-azure-ui-private-connect.md)
* [Private connectivity with external functions: Azure ARM template](../sql-reference/external-functions-creating-azure-template-private-connect.md)

## External stages

* [Private connectivity to external stages for Amazon Web Services](data-load-aws-private.md)
* [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](data-load-azure-private.md)
* [Private connectivity to external stages for Google Cloud](data-load-gcs-private.md)

## External tables

An [external table](tables-external-intro.md) is a Snowflake feature that allows you to query data stored in an external
stage as if the data were inside a table in Snowflake. If you configure the external stage to use private connectivity, then network traffic
to the external table uses private connectivity rather than the public internet.

## Catalog integrations for Apache Iceberg™ tables

* [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](tables-iceberg-configure-catalog-integration-rest-private.md)

## External volumes for Apache Iceberg™ tables

* [Private connectivity to external volumes for Amazon Web Services](tables-iceberg-configure-external-volume-s3-private.md)
* [Private connectivity to external volumes for Microsoft Azure](tables-iceberg-configure-external-volume-azure-private.md)
* [Private connectivity to external volumes for Google Cloud](tables-iceberg-configure-external-volume-gcs-private.md)

## Apache Iceberg™ REST catalog integrations

* [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](tables-iceberg-configure-catalog-integration-rest-private.md)

## Snowpipe automation

* [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](data-load-azure-private.md)

---
title: Private connectivity for outbound network traffic in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/private-connectivity-outbound.md
section: User Guide
---

# Private connectivity for outbound network traffic in Snowflake Open Catalog

When you work with tables in Open Catalog, you generate outbound network traffic from your Open Catalog account to your external cloud
storage. For example:

* When you select a table in Open Catalog, Open Catalog displays the schema for the table by retrieving the metadata for the table. This
  metadata is stored in your external cloud storage.
* When your query engine attempts to load data from Open Catalog, Open Catalog accesses the external cloud storage to read the metadata
  for your Iceberg table and then returns the metadata for the table to the query engine.

By default, outbound network traffic traverses the public internet. For increased security, you can enable private connectivity for outbound
network traffic to route this traffic through private endpoints instead of the public internet.

> **Note:**
>
> Private connectivity for outbound network traffic is only supported for the following cloud storage providers:
>
> * [Amazon S3](private-connectivity-outbound-manage-endpoints-aws.md)
> * [Azure](private-connectivity-outbound-manage-endpoints-azure.md)

## Scaling considerations

Your implementation of outbound private connectivity must conform to the following limitations associated with cloud providers:

**Cannot have more than five private endpoints per Snowflake account**

> Private endpoints that have been deprovisioned within the last seven days count toward this limit.

> To increase this limit, contact Snowflake Support.

**Cannot have more than one endpoint to the same AWS service or Azure subresource**

> For AWS, this limitation is per service. So if you have one endpoint to an S3 bucket, you cannot have a different endpoint to another S3
> bucket because the endpoint-to-S3 service combination would be duplicated.

> For Azure, if a resource has only one subresource, you can only have one endpoint. But if the resource has different subresources
> available, you can have multiple endpoints to the resource as long as they connect to different subresources.

> > **Note:**
> >
> > You can duplicate an endpoint-to-service or endpoint-to-subresource combination in a different Snowflake account.

## Billing

Snowflake calculates costs for outbound private connectivity based on private endpoint usage. For details on pricing for outbound private
connectivity, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: Private connectivity to external stages and Snowpipe automation for Microsoft Azure
source: https://docs.snowflake.com/en/user-guide/data-load-azure-private.md
section: User Guide
---

# Private connectivity to external stages and Snowpipe automation for Microsoft Azure

This topic provides configuration details to set up [outbound private connectivity](private-connectivity-outbound.md) for the
following Snowflake features:

* Bulk loading from Microsoft Azure using an external stage.
* Automating Snowpipe for Microsoft Azure Blob Storage.

The differences between configuring bulk loading and Snowpipe automation for private connectivity and configuring them for public network
traffic consists of the following:

* Setting `USE_PRIVATELINK_ENDPOINT = TRUE` for the required storage integration, stage, or notification integration.
* Creating a private connectivity endpoint for the external stage (bulk loading and Snowpipe automation).
* Creating a private connectivity endpoint for the notification integration (Snowpipe automation only).

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Considerations

> **Note:**
>
> Private connectivity isn’t supported for Microsoft Fabric OneLake storage.

You can configure outbound public connectivity and outbound private connectivity for the same storage account. If you want to do this,
create a dedicated storage integration for outbound public connectivity and specify `USE_PRIVATELINK_ENDPOINT = FALSE`.

## Private connectivity property

The `USE_PRIVATELINK_ENDPOINT` property of a storage integration or external stage determines whether it is accessed through private
connectivity or by traversing the public network. To use private connectivity, set `USE_PRIVATELINK_ENDPOINT = TRUE`.

A stage that references a storage integration that specifies `USE_PRIVATELINK_ENDPOINT = TRUE` inherits the private endpoint
configuration. As a result, if you are using a storage integration that is configured to use private connectivity, you do not need to
specify the `USE_PRIVATELINK_ENDPOINT` property in the stage, and you cannot modify the stage to set the
`USE_PRIVATELINK_ENDPOINT` property.

## Configure external stage access

These steps are unique to using outbound private connectivity with a storage integration to unload data to an external stage on Microsoft Azure.
You need to modify the flow if you are using the stage’s `CREDENTIALS` property instead of referencing a storage integration.

These steps are required for both bulk loading and Snowpipe automation.

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a
   private connectivity endpoint in your Snowflake VNet to enable Snowflake to connect to your external Blob storage account using private
   connectivity:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/cc2909f2-ed22-4c89-8e5d-bdc40e5eac26/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
     'mystorageaccount.blob.core.windows.net',
     'blob'
   );
   ```

   This function binds the private endpoint to the hostname, which enables the storage integration to use the private endpoint to connect
   to the storage location.
2. In the Azure Portal and as the owner of the Microsoft Azure Blob storage resource, approve the private endpoint. For details, see the
   [approval process](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
3. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity (after the other necessary Snowflake objects are enabled for outbound private connectivity).

   You can continue with the next steps while waiting for the `"APPROVED"` status.
4. Create a storage integration and be sure to specify the `USE_PRIVATELINK_ENDPOINT` property:

   ```sqlexample
   CREATE OR REPLACE STORAGE INTEGRATION outbound_private_link_int
     TYPE = EXTERNAL_STAGE
     STORAGE_PROVIDER = AZURE
     AZURE_TENANT_ID = 'cc2909f2-ed22-4c89-8e5d-bdc40e5eac26'
     STORAGE_ALLOWED_LOCATIONS = ('azure://mystorageaccount.blob.core.windows.net/mycontainer/snowflake_privatelink_external_stage_test/')
     USE_PRIVATELINK_ENDPOINT = TRUE
     ENABLED = TRUE;
   ```

   > **Note:**
   >
   > After you create the storage integration, you must grant Snowflake access to your storage locations. For more information, see
   > [Configuring a Snowflake storage integration](data-load-azure-config.md).
5. Create an external stage that references the storage integration:

   ```sqlexample
   CREATE OR REPLACE STAGE my_storage_private_stage
     URL = 'azure://mystorageaccount.blob.core.windows.net/mycontainer/snowflake_privatelink_external_stage_test/'
     STORAGE_INTEGRATION = outbound_private_link_int;
   ```
6. After the private endpoint has an `"APPROVED"` status, test unloading data from Snowflake to the external stage:

   ```sqlexample
   COPY INTO @my_storage_private_stage
     FROM mytable
     FILE_FORMAT = (FORMAT_NAME = my_csv_format);
   ```
7. View the result in your Microsoft Azure stage.

## Syntax update for notification integrations

Automating Snowpipe for Microsoft Azure Blob Storage requires you to create a notification integration. The following syntax update allows
you to configure the notification integration for private connectivity.

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  ...
  USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
```

## Configure Snowpipe automation

This section modifies the procedures described in [Automating Snowpipe for Microsoft Azure Blob Storage](data-load-snowpipe-auto-azure.md) to highlight how to implement Snowpipe
automation with private connectivity. The only differences are provisioning private connectivity endpoints and configuring the
`USE_PRIVATELINK_ENDPOINT` property of the storage integration and notification integration.

1. Create a storage integration and stage, along with its dedicated private connectivity endpoint, as described
   earlier in this document.
2. [Grant Snowflake access to the storage locations](https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-azure#step-2-grant-snowflake-access-to-the-storage-locations),
   as described in the Automating Snowpipe for Microsoft Azure Blob Storage topic.
3. [Configure the Event Grid Subscription](https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-azure#step-1-configuring-the-event-grid-subscription),
   as described in the Automating Snowpipe for Microsoft Azure Blob Storage topic.
4. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a
   private endpoint in your Snowflake VNet to enable Snowflake to connect to your Azure queue using private
   connectivity:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/cc2909f2-ed22-4c89-8e5d-bdc40e5eac26/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/mystorageaccount',
       'mystorageaccount.queue.core.windows.net',
       'queue'
   );
   ```
5. In the Azure Portal and as the owner of the Microsoft Azure Storage resource, approve the private endpoint. For information, see the
   [approval process](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
6. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity (after the other necessary Snowflake objects are enabled for outbound private connectivity).

   > **Important:**
   >
   > You must wait until the status is `APPROVED` before continuing with the next step.
7. [Retrieve the storage queue URL and tenant ID](https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-azure#retrieve-the-storage-queue-url-and-tenant-id),
   as described in the Automating Snowpipe for Microsoft Azure Blob Storage topic.
8. Create a notification integration and be sure to specify the `USE_PRIVATELINK_ENDPOINT` property:

   ```sqlexample
   CREATE OR REPLACE NOTIFICATION INTEGRATION ni_pl
     ENABLED = TRUE
     TYPE = QUEUE
     NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
     AZURE_STORAGE_QUEUE_PRIMARY_URI = "https://storageaccount.queue.core.windows.net/queuename"
     AZURE_TENANT_ID = '00000000-0000-0000-0000-000000000000'
     USE_PRIVATELINK_ENDPOINT = TRUE;
   ```
9. [Grant Snowflake access to the storage queue](https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-azure#grant-snowflake-access-to-the-storage-queue),
   as described in the Automating Snowpipe for Microsoft Azure Blob Storage topic.
10. [Create a pipe with auto-ingest enabled](data-load-snowpipe-auto-azure.md), as described in the Automating Snowpipe for
    Microsoft Azure Blob Storage topic.

## Disable private connectivity

The process of disabling private connectivity varies depending on whether the endpoint was provisioned for a storage integration, an
external stage, or a notification integration.

Storage integration/external stage
:   If you no longer need the private connectivity endpoint for the external stage, unset the
    `USE_PRIVATELINK_ENDPOINT` property on the stage or storage integration, and then call the
    [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function.

Notification integration
:   Unlike storage integrations and external stages, you cannot unset the `USE_PRIVATELINK_ENDPOINT` property of a notification
    integration. If you no longer need private connectivity, you need to drop the notification integration, then create a new one. After
    recreating the notification integration, you can call the SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT system function to deprovision the
    endpoint.

---
title: Private connectivity to external stages for Amazon Web Services
source: https://docs.snowflake.com/en/user-guide/data-load-aws-private.md
section: User Guide
---

# Private connectivity to external stages for Amazon Web Services

This topic provides configuration details to set up outbound private connectivity to an external stage on AWS. The primary difference
between the outbound public connectivity and outbound private connectivity is how you configure the storage integration or stage. For
example, you can specify the `USE_PRIVATELINK_ENDPOINT` property for the storage integration and then reference this storage
integration in the external stage. The external stage inherits the private endpoint configuration from the storage integration.
Subsequently, your connection to the AWS S3 stage goes through the AWS internal network. By configuring your storage
integration and stage to use outbound private connectivity, you add additional security to your data unloading operations by blocking
public access to the storage account.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Considerations

You can configure outbound public connectivity and outbound private connectivity for the same storage account. If you want to do this,
create a dedicated storage integration for outbound public connectivity and specify `USE_PRIVATELINK_ENDPOINT = FALSE`.

## Syntax updates

Storage integration
:   You can specify the `USE_PRIVATELINK_ENDPOINT` property when you create a storage integration that has one or more locations:

    ```sqlsyntax
    CREATE OR REPLACE STORAGE INTEGRATION my_int
      ...
      USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
    ```

    You can modify a storage integration and set the `USE_PRIVATELINK_ENDPOINT` property:

    ```sqlsyntax
    ALTER STORAGE INTEGRATION my_int
      SET USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
    ```

External stages
:   A stage that references a storage integration that specifies the `USE_PRIVATELINK_ENDPOINT` property inherits the private endpoint
    configuration. As a result, you do not need to specify the `USE_PRIVATELINK_ENDPOINT` property in the stage, and
    you cannot modify the stage to set the `USE_PRIVATELINK_ENDPOINT` property.

    If you are using the stage’s `CREDENTIALS` property instead of referencing a storage integration, you need to specify the
    `USE_PRIVATELINK_ENDPOINT` property when you create or modify the stage.

    ```sqlsyntax
    CREATE OR REPLACE STAGE my_sas_private_stage
      URL = '...'
      CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z')
      USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }

    ALTER STAGE my_sas_private_stage
      SET USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
    ```

    The [DESCRIBE STAGE](../sql-reference/sql/desc-stage.md) command includes the `USE_PRIVATELINK_ENDPOINT` property and its value.

## Configure external stage access

These steps are unique to using outbound private connectivity with a storage integration to unload data to an external stage on AWS. You
need to modify the flow if you are using the stage’s `CREDENTIALS` property instead of referencing a storage integration.

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system
   function to provision a private connectivity endpoint in your Snowflake VNet to enable Snowflake to connect to your external AWS S3
   storage using private connectivity.

   As the following example demonstrates, you must use a wildcard character (`*`) instead of specifying an individual AWS S3 bucket. Using
   the wildcard does not mean that all S3 buckets are accessed over a private connection. Only buckets referenced by an external stage that
   is configured for private connectivity can be accessed via the VPC endpoint.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
       'com.amazonaws.us-west-2.s3',
       '*.s3.us-west-2.amazonaws.com');
   ```

   This function binds the private endpoint to the hostname, which enables the storage integration to use the private endpoint to connect
   to the storage location.
2. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity (after the other necessary Snowflake objects are enabled for outbound private connectivity).

   You can continue with the next steps while waiting for the `"APPROVED"` status.
3. Restrict access on the S3 bucket to access over the VPC endpoint only by updating the bucket policy with the following.

   ```JSON
   {
     "Sid": "AccesstospecificVPCEonly",
     "Effect": "Deny",
     "Principal": {
       "AWS": "arn:aws:iam::001234567890:role/myrole"
     },
     "Action": "s3:*",
     "Resource": [
       "arn:aws:s3:::mybucket1",
       "arn:aws:s3:::mybucket1/*"
     ],
     "Condition": {
       "StringNotEquals": {
         "aws:SourceVpce": "vpce-01c31eb5f4a1e817d"
       }
     }
   }
   ```
4. Create a storage integration which specifies both the limited `STORAGE_AWS_ROLE_ARN` role and the `USE_PRIVATELINK_ENDPOINT` property:

   ```sqlexample
   CREATE OR REPLACE STORAGE INTEGRATION outbound_private_link_int
     TYPE = EXTERNAL_STAGE
     STORAGE_PROVIDER = 'S3'
     STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
     STORAGE_ALLOWED_LOCATIONS = ('s3://mybucket1/path1/')
     USE_PRIVATELINK_ENDPOINT = TRUE
     ENABLED = TRUE;
   ```

   > **Note:**
   >
   > For information about creating a role for the storage integration, see
   > [Configuring a Snowflake storage integration to access Amazon S3](data-load-s3-config-storage-integration.md).
5. Create an external stage that references the storage integration:

   ```sqlexample
   CREATE OR REPLACE STAGE my_storage_private_stage
     URL = 's3://mybucket1/path1/'
     STORAGE_INTEGRATION = outbound_private_link_int;
   ```
6. After the private endpoint has an “APPROVED” status, test unloading data from Snowflake to the external stage:

   ```sqlexample
   COPY INTO @my_storage_private_stage
     FROM mytable
     FILE_FORMAT = (FORMAT_NAME = my_csv_format);
   ```
7. View the result in your AWS stage.

## Deprovision an endpoint

If you no longer need the private connectivity endpoint for the external stage, unset the
`USE_PRIVATELINK_ENDPOINT` property on the stage or storage integration, and then call the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function.

---
title: Private connectivity to external stages for Google Cloud
source: https://docs.snowflake.com/en/user-guide/data-load-gcs-private.md
section: User Guide
---

# Private connectivity to external stages for Google Cloud

This topic describes how to configure outbound private connectivity to an external stage on Google Cloud. The primary
difference between the outbound public connectivity and outbound private connectivity is how you configure the storage integration.
For example, you can specify the USE_PRIVATELINK_ENDPOINT property for the storage integration and then reference this storage
integration in the external stage. The external stage inherits the private endpoint configuration from the storage integration.
Subsequently, your connection to the Google Cloud stage goes through the Google Cloud internal network. By configuring your storage
integration and stage to use outbound private connectivity, you add additional security to your data unloading operations by blocking
public access to the storage account.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Considerations

You can configure outbound public connectivity and outbound private connectivity for the same storage account. If you want to do this,
create a dedicated storage integration for outbound public connectivity and specify `USE_PRIVATELINK_ENDPOINT = FALSE`.

## Limitations

Outbound private connectivity to a Google Cloud stage doesn’t support multi-region buckets.

## Specify private connectivity for a storage integration

To specify private connectivity when creating, replacing, or modifying a storage integration, include the USE_PRIVATELINK_ENDPOINT
property as shown in the following examples. To use private connectivity, set `USE_PRIVATELINK_ENDPOINT = TRUE` for the integration.

Storage integration
:   The following examples shows how you can specify the USE_PRIVATELINK_ENDPOINT property when you create a storage integration that has one or more locations:

    ```sqlsyntax
    CREATE OR REPLACE STORAGE INTEGRATION my_int
      TYPE=EXTERNAL_STAGE
      STORAGE_PROVIDER='gcs'
      STORAGE_ALLOWED_LOCATIONS=('gcs://<bucket>/<prefix>/')
      USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
    ```

    The following example shows how you can modify a storage integration and set the USE_PRIVATELINK_ENDPOINT property:

    ```sqlsyntax
    ALTER STORAGE INTEGRATION my_int
      SET USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
    ```

External stages
:   Updates for USE_PRIVATELINK_ENDPOINT syntax aren’t supported when you create or modify the stage. The following example shows how you must alter the storage
    integration to use the URL of the new or modified stage:

    ```sqlsyntax
    CREATE OR REPLACE STAGE my_gcs_stage
      URL = 'gcs://<bucket>/<prefix>/'
      STORAGE_INTEGRATION=my_int
    ```

## Configure external stage access

These steps are unique to using outbound private connectivity with a storage integration to unload data to an external stage on Google Cloud.

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function.
   Provide as arguments a regional Storage API endpoint and a host name. For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'storage.us-east4.rep.googleapis.com',
     'storage.us-east4.rep.googleapis.com');
   ```

   > **Note:**
   >
   > Snowflake supports only Google Cloud regional Storage API endpoints.
   > Google Cloud multi-region buckets aren’t supported.

   Using SYSTEM$PROVISION_PRIVATELINK_ENDPOINT to provision a private endpoint in your Snowflake VNet binds the private endpoint to the host name. This enables the storage integration to connect to your external Google Cloud stage by using private connectivity.
2. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO includes `"status": "APPROVED"`, your connection from Snowflake to your storage account can use private connectivity (after the other necessary Snowflake objects are enabled for outbound private connectivity).

   You can continue with the next steps while awaiting the `"APPROVED"` status.
3. Create a storage integration and be sure to specify TRUE as the value for the USE_PRIVATELINK_ENDPOINT property. For example:

   ```sqlexample
   CREATE OR REPLACE STORAGE INTEGRATION outbound_private_link_int
     TYPE = EXTERNAL_STAGE
     STORAGE_PROVIDER = 'gcs'
     STORAGE_ALLOWED_LOCATIONS = ('gcs://mybucket1/path1/'')
     USE_PRIVATELINK_ENDPOINT = true
     ENABLED = true;
   ```

   For information about creating a role for the storage integration, see [Configure an integration for Google Cloud Storage](data-load-gcs-config.md).
4. Create an external stage that references the storage integration. For example:

   ```sqlexample
   CREATE OR REPLACE STAGE my_gcs_stage
     URL = 'gcs://mybucket1/path1/'
     STORAGE_INTEGRATION = outbound_private_link_int;
   ```
5. After the private endpoint has “APPROVED” status, test unloading data from Snowflake to the external stage. For example:

   ```sqlexample
   COPY INTO @my_gcs_stage
     FROM mytable
     FILE_FORMAT = (FORMAT_NAME = my_csv_format);
   ```
6. View the result in your Google Cloud stage.

## Disable private connectivity

If you no longer require private connectivity for the external stage, you can set the USE_PRIVATELINK_ENDPOINT property on the storage integration
to FALSE, and then call the [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function to deprovision the endpoint.
For example:

```sqlexample
USE ROLE ACCOUNTADMIN;

ALTER STORAGE INTEGRATION my_int
  SET USE_PRIVATELINK_ENDPOINT = false;

SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT('storage.us-east4.rep.googleapis.com');
```

---
title: Private connectivity to external volumes for Amazon Web Services
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-s3-private.md
section: User Guide
---

# Private connectivity to external volumes for Amazon Web Services

This topic provides configuration details to set up outbound private connectivity to an external volume on AWS. The primary difference
between the outbound public connectivity and outbound private connectivity is how you set the `USE_PRIVATELINK_ENDPOINT` property for
the external volume.

When the external volume is configured to use private connectivity, your connection to the AWS cloud storage service goes through the
AWS internal network. By configuring your external volume to use outbound private connectivity, you add additional security to your
data-unloading operations by blocking public access to the storage account.

For more information about using external volumes to connect to your external cloud storage for Iceberg tables, see
[Configure an external volume](tables-iceberg-configure-external-volume.md).

> **Note:**
>
> You can use AWS PrivateLink to access Snowflake-managed Iceberg tables and Iceberg tables that use a catalog integration for object
> storage. In addition, you can use AWS PrivateLink to access externally managed Iceberg tables and Iceberg tables created from Delta files
> in object storage.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Considerations

You can configure outbound public connectivity and outbound private connectivity for the same cloud storage service. If you want to do this,
create a dedicated external volume for outbound public connectivity and specify `USE_PRIVATELINK_ENDPOINT = FALSE`.

## Set up outbound private connectivity to an external volume

To set up outbound private connectivity to an external volume, you can use SQL
or use Snowsight.

### Use SQL

#### Syntax updates

The `USE_PRIVATELINK_ENDPOINT` property of an external volume determines whether it is accessed through private connectivity or
by traversing the public network. To use private connectivity, set `USE_PRIVATELINK_ENDPOINT = TRUE` when creating or modifying an external
volume.

The new syntax for CREATE EXTERNAL VOLUME and ALTER EXTERNAL VOLUME is as follows:

```sqlsyntax
CREATE OR REPLACE EXTERNAL VOLUME <ext_volume_name>
  STORAGE_LOCATIONS =
  (
    (
      NAME = 'my-s3-loc'
      STORAGE_PROVIDER = 's3'
      STORAGE_BASE_URL = 's3://<bucket>[/<path>/]'
      STORAGE_AWS_ROLE_ARN = '<iam_role>'
      USE_PRIVATELINK_ENDPOINT = [ TRUE | FALSE ]
    )
  )
  ALLOW_WRITES=true;

ALTER EXTERNAL VOLUME <ext_volume_name>
  UPDATE STORAGE_LOCATION = '<storage_location_name>'
  USE_PRIVATELINK_ENDPOINT = [ TRUE | FALSE ];
```

The [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) command includes the `USE_PRIVATELINK_ENDPOINT` property and its value.

#### Configure external volume access

Use the following steps to use outbound private connectivity to unload data to an external volume on AWS:

1. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system
   function to provision a private endpoint in your Snowflake VNet to enable Snowflake to connect to external AWS cloud
   storage over private connectivity.

   As the following example demonstrates, you must use a wildcard character (`*`) instead of specifying an individual AWS S3 bucket. Using
   the wildcard does not mean that all S3 buckets are accessed over a private connection. Only buckets referenced by an external volume that
   has the USE_PRIVATELINK_ENDPOINT parameter enabled can be accessed via the endpoint.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
       'com.amazonaws.us-west-2.s3',
       '*.s3.us-west-2.amazonaws.com');
   ```

   This function binds the private endpoint to the hostname, which enables the external volume to use the private endpoint to connect
   to the storage location, as long as AWS PrivateLink is enabled on the object.
2. Call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity.

   You can continue with the next steps while waiting for the `"APPROVED"` status.
3. Create the external volume, being sure to set the `USE_PRIVATELINK_ENDPOINT` property to `TRUE`. For example:

   ```sqlexample
   CREATE EXTERNAL VOLUME external_volume
     STORAGE_LOCATIONS =
       (
         (
           NAME = 'my-s3-loc'
           STORAGE_PROVIDER = 's3'
           STORAGE_BASE_URL = 's3://bucketinuswest2/'
           STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
           USE_PRIVATELINK_ENDPOINT = TRUE
         )
       )
     ALLOW_WRITES=TRUE;
   ```
4. Use the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command to create an Iceberg table that references the external volume. For example:

   ```sqlexample
   CREATE ICEBERG TABLE rand_table (data string)
     BASE_LOCATION='table'
     EXTERNAL_VOLUME=external_volume
     CATALOG='snowflake';
   ```
5. After the private endpoint has an `"APPROVED"` status, test unloading data from Snowflake to the external volume.

### Use Snowsight

To set up external volume access using private connectivity in Snowsight, follow these steps:

1. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system
   function to provision a private endpoint in your Snowflake VNet to enable Snowflake to connect to external AWS cloud
   storage over private connectivity.

   As the following example demonstrates, you must use a wildcard character (`*`) instead of specifying an individual AWS S3 bucket. Using
   the wildcard does not mean that all S3 buckets are accessed over a private connection. Only buckets referenced by an external volume that
   has the USE_PRIVATELINK_ENDPOINT parameter enabled can be accessed via the endpoint.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
       'com.amazonaws.us-west-2.s3',
       '*.s3.us-west-2.amazonaws.com');
   ```

   This function binds the private endpoint to the hostname, which enables the external volume to use the private endpoint to connect
   to the storage location, as long as AWS PrivateLink is enabled on the object.
2. Call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity.

   You can continue with the next steps while waiting for the `"APPROVED"` status.
3. Follow the steps to
   [create an external volume for S3 by using Snowsight](tables-iceberg-configure-external-volume-s3.md) and enable private
   connectivity when you configure the external volume.

   > **Important:**
   >
   > To enable private connectivity, on the Configure external volume page, from the Connectivity field, you must select
   > Private (Azure Private Endpoint).
4. Use the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command to create an Iceberg table that references the external volume. For example:

   ```sqlexample
   CREATE ICEBERG TABLE rand_table (data string)
     BASE_LOCATION='table'
     EXTERNAL_VOLUME=external_volume
     CATALOG='snowflake';
   ```
5. After the private endpoint has an `"APPROVED"` status, test unloading data from Snowflake to the external volume.

## Deprovision an endpoint

If you no longer need the private connectivity endpoint for the external volume, unset the
`USE_PRIVATELINK_ENDPOINT` property on the external volume, and then call the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function.

---
title: Private connectivity to external volumes for Google Cloud
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-gcs-private.md
section: User Guide
---

# Private connectivity to external volumes for Google Cloud

This topic describes how to configure outbound private connectivity to an external volume on Google Cloud Storage (GCS). The
primary difference between outbound public connectivity and outbound private connectivity is how you set the USE_PRIVATELINK_ENDPOINT
property for the external volume.

When the external volume is configured to use private connectivity, your connection to the Google Cloud Storage service goes through the
Google Cloud internal network. By configuring your external volume to use outbound private connectivity, you add additional security to your
data-unloading operations by blocking public access to the storage account.

For more information about using external volumes to connect to your external cloud storage for Iceberg tables, see
[Configure an external volume](tables-iceberg-configure-external-volume.md).

> **Note:**
>
> You can use Google Cloud Private Service Connect to access Snowflake-managed Iceberg tables and Iceberg tables that use a catalog
> integration for object storage. In addition, you can use Google Cloud Private Service Connect to access externally managed Iceberg tables and Iceberg tables
> created from Delta files in object storage.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Considerations

You can configure outbound public connectivity and outbound private connectivity for the same cloud storage service. If you want to do this,
create a dedicated external volume for outbound public connectivity and specify `USE_PRIVATELINK_ENDPOINT = FALSE`.

## Limitations

Outbound private connectivity to Google Cloud Storage volumes does not support multi-region buckets.

## Set up outbound private connectivity to an external volume

To set up outbound private connectivity to an external volume, you use SQL or
use Snowsight.

### Use SQL

#### Specify private connectivity for an external volume

The USE_PRIVATELINK_ENDPOINT property of an external volume determines whether it is accessed through private connectivity or
by traversing the public network. To use private connectivity, set `USE_PRIVATELINK_ENDPOINT = TRUE` when creating or modifying an external
volume, as shown in the following examples.

Use the following syntax to create an external volume:

```sqlsyntax
CREATE OR REPLACE EXTERNAL VOLUME <ext_volume_name>
  STORAGE_LOCATIONS =
  (
    (
      NAME = 'my-gcs-loc'
      STORAGE_PROVIDER = 'gcs'
      STORAGE_BASE_URL = 'gcs://<bucket>/<prefix>/'
      USE_PRIVATELINK_ENDPOINT = [ TRUE | FALSE ]
    )
  )
  ALLOW_WRITES=true;
```

Use the following syntax to alter an existing external volume:

```sqlsyntax
ALTER EXTERNAL VOLUME <ext_volume_name>
  UPDATE STORAGE_LOCATION = '<storage_location_name>'
  USE_PRIVATELINK_ENDPOINT = [ TRUE | FALSE ]
```

The [DESCRIBE EXTERNAL VOLUME](../sql-reference/sql/desc-external-volume.md) command includes the USE_PRIVATELINK_ENDPOINT property and its value.

#### Provision a private endpoint

Use the following steps to provision a private endpoint for your Google Cloud Storage volume:

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function.
   Provide as arguments a regional Storage API endpoint and host name. For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'storage.us-east4.rep.googleapis.com',
     'storage.us-east4.rep.googleapis.com');
   ```

   > **Note:**
   >
   > Snowflake supports only Google Cloud regional Storage API endpoints.
   > Google Cloud multi-region buckets aren’t supported.

   Using SYSTEM$PROVISION_PRIVATELINK_ENDPOINT to provision a private endpoint in your Snowflake VNet to enable Snowflake to connect to
   external Google Cloud Storage over private connectivity. Only buckets referenced by an external volume that has the USE_PRIVATELINK_ENDPOINT
   property enabled can be accessed using the endpoint.
2. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO includes `"status": "APPROVED"`, your connection from Snowflake to your storage
   account can use private connectivity.

   You can continue with the next steps while awaiting the `"APPROVED"` status.

#### Configure external volume access

Use the following steps to configure private connectivity to your external storage volume:

1. Create the external volume, and set the USE_PRIVATELINK_ENDPOINT property to TRUE. For example:

   ```sqlexample
   CREATE EXTERNAL VOLUME external_volume
     STORAGE_LOCATIONS =
     (
       (
         NAME = 'my-gcs-loc'
         STORAGE_PROVIDER = 'gcs'
         STORAGE_BASE_URL =  'gcs://<bucket>/<prefix>/'
         USE_PRIVATELINK_ENDPOINT = true
       )
     )
     ALLOW_WRITES=true;
   ```
2. Use the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command to create an Iceberg table that references the external volume. For example:

   ```sqlexample
   CREATE ICEBERG TABLE rand_table (data STRING)
     BASE_LOCATION='table'
     EXTERNAL_VOLUME=external_volume
     CATALOG='snowflake';
   ```
3. After the private endpoint has “APPROVED” status, test unloading data from Snowflake to the external volume.

### Use Snowsight

To set up external volume access using private connectivity in Snowsight, follow these steps:

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function.
   Provide as arguments a regional Storage API endpoint and host name. For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'storage.us-east4.rep.googleapis.com',
     'storage.us-east4.rep.googleapis.com');
   ```

   > **Note:**
   >
   > Snowflake supports only Google Cloud regional Storage API endpoints.
   > Google Cloud multi-region buckets aren’t supported.

   Using SYSTEM$PROVISION_PRIVATELINK_ENDPOINT to provision a private endpoint in your Snowflake VNet to enable Snowflake to connect to
   external Google Cloud Storage over private connectivity. Only buckets referenced by an external volume that has the USE_PRIVATELINK_ENDPOINT
   property enabled can be accessed using the endpoint.
2. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO includes `"status": "APPROVED"`, your connection from Snowflake to your storage
   account can use private connectivity.

   You can continue with the next steps while awaiting the `"APPROVED"` status.
3. Follow the steps to
   [configure an external volume for Google Cloud Storage by using Snowsight](tables-iceberg-configure-external-volume-gcs.md)
   and enable private connectivity when you configure the external volume.

   > **Important:**
   >
   > To enable private connectivity, on the Configure external volume page, from the Connectivity field, you must select
   > Private (Private Service Connect).
4. Use the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table.md) command to create an Iceberg table that references the external volume. For example:

   ```sqlexample
   CREATE ICEBERG TABLE rand_table (data STRING)
     BASE_LOCATION='table'
     EXTERNAL_VOLUME=external_volume
     CATALOG='snowflake';
   ```
5. After the private endpoint has “APPROVED” status, test unloading data from Snowflake to the external volume.

## Disable private connectivity

If you no longer require private connectivity for the external volume, you can set the USE_PRIVATELINK_ENDPOINT property for the volume to
FALSE, and then call the [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function tp deprovision the endpoint.
For example:

```sqlexample
ALTER EXTERNAL VOLUME <ext_volume_name>
  UPDATE STORAGE_LOCATION = '<storage_location_name>'
  USE_PRIVATELINK_ENDPOINT = false;

SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT('storage.us-east4.rep.googleapis.com');
```

---
title: Private connectivity to external volumes for Microsoft Azure
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume-azure-private.md
section: User Guide
---

# Private connectivity to external volumes for Microsoft Azure

This topic provides configuration details to set up outbound private connectivity to an external volume on Microsoft Azure. The primary difference
between outbound public connectivity and outbound private connectivity is how you set the `USE_PRIVATELINK_ENDPOINT` property for
the external volume.

When the external volume is configured to use private connectivity, your connection to the Microsoft Azure cloud storage services goes through the
Microsoft Azure internal network. By configuring your external volume to use outbound private connectivity, you add additional security to your
operations by blocking public access to the storage account.

For more information about using external volumes to connect to your external cloud storage for Iceberg tables, see
[Configure an external volume](tables-iceberg-configure-external-volume.md).

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Considerations and limitations

* You can use private connectivity to access Snowflake-managed Iceberg tables and Iceberg tables that use a catalog integration for object
  storage. In addition, you can use private connectivity to access externally managed Iceberg tables and Iceberg tables created from Delta files
  in object storage.
* You can configure outbound public connectivity and outbound private connectivity for the same cloud storage service. If you want to do
  this, create a dedicated external volume for outbound public connectivity and specify `USE_PRIVATELINK_ENDPOINT = FALSE`.

## Set up outbound private connectivity to an external volume

To set up outbound private connectivity to an external volume, you can use SQL or
use Snowsight.

### Use SQL

#### Private connectivity property

The `USE_PRIVATELINK_ENDPOINT` property of an external volume determines whether it is accessed through private connectivity or
by traversing the public network. To use private connectivity, set `USE_PRIVATELINK_ENDPOINT = TRUE` when creating or modifying an external
volume.

#### Configure external volume access using private connectivity

Use the following steps to use outbound private connectivity to unload data to an external volume on Microsoft Azure:

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a
   private connectivity endpoint in your Snowflake VNet to enable Snowflake to connect to your external Microsoft Azure cloud storage services using
   private connectivity:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/cc2909f2-ed22-4c89-8e5d-bdc40e5eac26/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
     'mystorageaccount.blob.core.windows.net',
     'blob'
   );
   ```

   This function binds the private endpoint to the hostname, which enables the external volume to use the private endpoint to connect
   to the storage location.
2. In the Azure Portal and as the owner of the Microsoft Azure storage resource, approve the private endpoint. For details, see the
   [approval process](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
3. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity.

   You can continue with the next steps while waiting for the `"APPROVED"` status.
4. Create the external volume, being sure to set the `USE_PRIVATELINK_ENDPOINT` property to `TRUE`:

   ```sqlexample
   CREATE EXTERNAL VOLUME exvol
     STORAGE_LOCATIONS =
       (
         (
           NAME = 'my-azure-northeurope'
           STORAGE_PROVIDER = 'AZURE'
           STORAGE_BASE_URL = 'azure://exampleacct.blob.core.windows.net/my_container_northeurope/'
           AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
           USE_PRIVATELINK_ENDPOINT = TRUE
         )
       );
   ```
5. After the private endpoint has an `"APPROVED"` status, test accessing the external volume with a supported operation.

### Use Snowsight

To set up external volume access through private connectivity in Snowsight, follow these steps:

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a
   private connectivity endpoint in your Snowflake VNet to enable Snowflake to connect to your external Microsoft Azure cloud storage services using
   private connectivity:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/cc2909f2-ed22-4c89-8e5d-bdc40e5eac26/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
     'mystorageaccount.blob.core.windows.net',
     'blob'
   );
   ```

   This function binds the private endpoint to the hostname, which enables the external volume to use the private endpoint to connect
   to the storage location.
2. In the Azure Portal and as the owner of the Microsoft Azure storage resource, approve the private endpoint. For details, see the
   [approval process](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
3. In Snowflake, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../sql-reference/functions/system_get_privatelink_endpoints_info.md) function.

   When the output of the function includes `"status": "APPROVED`, your connection from Snowflake to your storage account will be
   able to use private connectivity.

   You can continue with the next steps while waiting for the `"APPROVED"` status.
4. Follow the steps to
   [Configure an external volume for Azure in Snowsight](tables-iceberg-configure-external-volume-azure.md)
   and enable private connectivity when you configure the external volume.

   > **Important:**
   >
   > To enable private connectivity, on the Configure external volume page, from the Connectivity field, you must select
   > Private (Azure Private Endpoint).
5. After the private endpoint has an `"APPROVED"` status, test accessing the external volume with a supported operation.

## Deprovision an endpoint

If you no longer need the private connectivity endpoint for the external volume, unset the
`USE_PRIVATELINK_ENDPOINT` property on the external volume, and then call the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function.

**Next Topics:**

* [Manage private connectivity endpoints: Azure](private-manage-endpoints-azure.md)

---
title: Process unstructured data with UDF and procedure handlers
source: https://docs.snowflake.com/en/user-guide/unstructured-data-java.md
section: User Guide
---

# Process unstructured data with UDF and procedure handlers

This topic provides examples of reading and processing unstructured data in staged files with handler code written for the following:

* [Java user-defined functions (UDFs)](../developer-guide/udf/java/udf-java-introduction.md)
* [Java user-defined table functions (UDTFs)](../developer-guide/udf/java/udf-java-tabular-functions.md)
* [Java procedures](../developer-guide/stored-procedure/java/procedure-java-overview.md)

You can also read a file with handlers written in other languages:

Python:
:   * [Python UDFs](../developer-guide/udf/python/udf-python-examples.md)
    * [Python procedures](../developer-guide/stored-procedure/python/procedure-python-read-files.md)

Scala:
:   * [Scala UDFs](../developer-guide/udf/scala/udf-scala-examples.md)
    * [Scala procedures](../developer-guide/stored-procedure/scala/procedure-scala-read-files.md)

> **Note:**
>
> To make your code resilient to file injection attacks, always use a scoped URL when passing a file’s location to a UDF, particularly
> when the function’s caller is not also its owner. You can create a scoped URL in SQL using the built-in function
> BUILD_SCOPED_FILE_URL. For more information about what the BUILD_SCOPED_FILE_URL does, see
> [Introduction to unstructured data](unstructured-intro.md).

## Process a PDF with a UDF and procedure

The examples in this section process staged unstructured files using Java handler code – first with a UDF, then with a procedure. Both
handlers extract the contents of a specified PDF file using the [Apache PDFBox library](https://pdfbox.apache.org/).

The handler code is very similar between the UDF and procedure. They differ in how the read the incoming PDF file.

* In the UDF, the handler reads the file using a Java `InputStream`.
* In the procedure, the handler reads the file using a Snowflake `SnowflakeFile`.

The examples use in-line handler code (as opposed to compiled in a staged JAR), which means that you do not need to compile,
package, and upload the handler code to a stage. For more information on the difference between in-line and staged handlers, see
[Keeping handler code in-line or on a stage](../developer-guide/inline-or-staged.md).

### Download the PDFBox library

Before you begin writing the UDF, download the PDFBox library JAR file if you don’t have it already. It will be a dependency for your
handler code. You’ll later upload the library JAR file to a stage.

Download the latest released version of the library from the
[Apache PDFBox library download page](https://pdfbox.apache.org/download.html).

### Create stages

Create stages in which to keep your handler code’s dependency libraries and the data file the handler code will read.

Using the code below, you’ll create separate internal stages to hold:

* A library JAR file that’s a dependency for your handler. You’ll reference the stage and JAR file from the UDF.
* A data file that your handler code will read.

Code in the following example uses the [CREATE STAGE](../sql-reference/sql/create-stage.md) command to create the stages you’ll need.

```sqlexample
-- Create an internal stage to store the JAR files.
CREATE OR REPLACE STAGE jars_stage;

-- Create an internal stage to store the data files. The stage includes a directory table.
CREATE OR REPLACE STAGE data_stage DIRECTORY=(ENABLE=TRUE) ENCRYPTION = (TYPE='SNOWFLAKE_SSE');
```

### Upload the required library and the PDF file to read

Complete the following steps to upload the dependency JAR file (with the library code that processes the PDF) and the data file (the PDF
file the handler code will process).

You can use the PDF file of your choosing in this example.

1. Copy the JAR file for Apache PDFBox from the local temporary directory to the stage that stores JAR files:

   Linux/Mac:
   :   ```sqlexample
       PUT file:///tmp/pdfbox-app-2.0.27.jar @jars_stage AUTO_COMPRESS=FALSE;
       ```

   Windows:
   :   ```sqlexample
       PUT file://C:\temp\pdfbox-app-2.0.27.jar @jars_stage AUTO_COMPRESS=FALSE;
       ```
2. Copy the PDF file from the local temporary directory to the stage that stores data files:

   Linux/Mac:
   :   ```sqlexample
       PUT file:///tmp/myfile.pdf @data_stage AUTO_COMPRESS=FALSE;
       ```

   Windows:
   :   ```sqlexample
       PUT file://C:\temp\myfile.pdf @data_stage AUTO_COMPRESS=FALSE;
       ```

### Create and call the UDF

Complete the following steps to create a UDF that reads and processes PDF files.

1. Paste and run the following code to create a UDF.

   This UDF’s handler parses PDF documents and retrieves their content. The handler uses the `InputStream` class to read the file.
   For more on reading files with `InputStream`, refer to [Reading a dynamically-specified file with InputStream](../developer-guide/udf/java/udf-java-cookbook.md).

   ```sqlexample
   CREATE FUNCTION process_pdf_func(file STRING)
   RETURNS STRING
   LANGUAGE JAVA
   RUNTIME_VERSION = 11
   IMPORTS = ('@jars_stage/pdfbox-app-2.0.27.jar')
   HANDLER = 'PdfParser.readFile'
   AS
   $$
   import org.apache.pdfbox.pdmodel.PDDocument;
   import org.apache.pdfbox.text.PDFTextStripper;
   import org.apache.pdfbox.text.PDFTextStripperByArea;

   import java.io.File;
   import java.io.FileInputStream;
   import java.io.IOException;
   import java.io.InputStream;

   public class PdfParser {

       public static String readFile(InputStream stream) throws IOException {
           try (PDDocument document = PDDocument.load(stream)) {

               document.getClass();

               if (!document.isEncrypted()) {

                   PDFTextStripperByArea stripper = new PDFTextStripperByArea();
                   stripper.setSortByPosition(true);

                   PDFTextStripper tStripper = new PDFTextStripper();

                   String pdfFileInText = tStripper.getText(document);
                   return pdfFileInText;
               }
           }
           return null;
       }
   }
   $$;
   ```
2. Refresh the directory table for the `data_stage` stage with the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command:

   ```sqlexample
   ALTER STAGE data_stage REFRESH;
   ```
3. Call the UDF to read the staged PDF file and extract the content.

   Code in the following example calls the UDF, passing a scoped URL to make the code resilient to file injection attacks. Always use a
   scoped URL when the function’s caller is not also its owner. You can pass the URL argument as a scoped URL or another form when the UDF’s
   caller is also its owner.

   ```sqlexample
   SELECT process_pdf_func(BUILD_SCOPED_FILE_URL('@data_stage', '/myfile.pdf'));
   ```

### Create and call the procedure

Complete the following steps to create a procedure that reads and processes PDF files.

1. Paste and run the following code to create a procedure.

   This procedure’s handler parses PDF documents and retrieves their content. The handler uses the `SnowflakeFile` class to read the
   file. For more on reading files with `SnowflakeFile`, refer to [Reading a dynamically-specified file with SnowflakeFile](../developer-guide/stored-procedure/java/procedure-java-read-files.md).

   ```sqlexample
   CREATE PROCEDURE process_pdf_proc(file STRING)
   RETURNS STRING
   LANGUAGE JAVA
   RUNTIME_VERSION = 11
   IMPORTS = ('@jars_stage/pdfbox-app-2.0.28.jar')
   HANDLER = 'PdfParser.readFile'
   PACKAGES = ('com.snowflake:snowpark:latest')
   AS
   $$
   import org.apache.pdfbox.pdmodel.PDDocument;
   import org.apache.pdfbox.text.PDFTextStripper;
   import org.apache.pdfbox.text.PDFTextStripperByArea;
   import com.snowflake.snowpark_java.types.SnowflakeFile;
   import com.snowflake.snowpark_java.Session;

   import java.io.File;
   import java.io.FileInputStream;
   import java.io.IOException;
   import java.io.InputStream;

   public class PdfParser {

       public static String readFile(Session session, String fileURL) throws IOException {
           SnowflakeFile file = SnowflakeFile.newInstance(fileURL);
           try (PDDocument document = PDDocument.load(file.getInputStream())) {

               document.getClass();

               if (!document.isEncrypted()) {

                   PDFTextStripperByArea stripper = new PDFTextStripperByArea();
                   stripper.setSortByPosition(true);

                   PDFTextStripper tStripper = new PDFTextStripper();

                   String pdfFileInText = tStripper.getText(document);
                   return pdfFileInText;
               }
           }

           return null;
       }
   }
   $$;
   ```
2. Refresh the directory table for the `data_stage` stage with the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command:

   ```sqlexample
   ALTER STAGE data_stage REFRESH;
   ```
3. Call the procedure to read the staged PDF file and extract the content.

   Code in the following example passes a scoped URL pointing to the PDF file on the stage you created.

   ```sqlexample
   CALL process_pdf_proc(BUILD_SCOPED_FILE_URL('@data_stage', '/UsingThird-PartyPackages.pdf'));
   ```

## Process a CSV with a UDTF

The example in this section extracts and returns data from staged files using Java UDTFs.

### Create data stage

Create a stage using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command:

The following SQL statement creates an internal stages to store the data files for the example:

```sqlexample
-- Create an internal stage to store the data files. The stage includes a directory table.
CREATE OR REPLACE STAGE data_stage DIRECTORY=(ENABLE=TRUE) ENCRYPTION = (TYPE='SNOWFLAKE_SSE');
```

### Upload the CSV file to read

Copy the CSV file from the local temporary directory to the stage that stores data files:

Linux/Mac:
:   ```sqlexample
    PUT file:///tmp/sample.csv @data_stage AUTO_COMPRESS=FALSE;
    ```

Windows:
:   ```sqlexample
    PUT file://C:\temp\sample.csv @data_stage AUTO_COMPRESS=FALSE;
    ```

### Create and call the UDTF

This example extracts the contents of a specified set of CSV files and returns the rows in a table. By processing file data as it’s read
from the source, you can avoid potential out-of-memory errors that might arise when the file is very large.

Code in the following UDTF handler example uses `SnowflakeFile` to generate an `InputStream` from a file URL to read a CSV
file. (In a Java UDTF handler, row processing begins when Snowflake calls the `process` method you implement.) The code uses the
stream when constructing an instance of a `CsvStreamingReader` class defined in the handler itself.

The `CsvStreamingReader` class reads the contents of the received CSV file stream row by row, providing a way for other code to
retrieve each row as a record where commas delimit columns. The `process` method returns each record as it is read from the stream.

For more about writing tabular user-defined functions (UDTFs) with a Java handler, see
[Tabular Java UDFs (UDTFs)](../developer-guide/udf/java/udf-java-tabular-functions.md).

Complete the following steps to create the Java UDTF and upload the required files:

1. Create a Java UDTF that uses the `SnowflakeFile` class:

   ```sqlexample
   CREATE OR REPLACE FUNCTION parse_csv(file STRING)
   RETURNS TABLE (col1 STRING, col2 STRING, col3 STRING )
   LANGUAGE JAVA
   HANDLER = 'CsvParser'
   AS
   $$
   import org.xml.sax.SAXException;

   import java.io.*;
   import java.util.ArrayList;
   import java.util.List;
   import java.util.stream.Stream;
   import com.snowflake.snowpark_java.types.SnowflakeFile;

   public class CsvParser {

     static class Record {
       public String col1;
       public String col2;
       public String col3;

       public Record(String col1_value, String col2_value, String col3_value)
       {
         col1 = col1_value;
         col2 = col2_value;
         col3 = col3_value;
       }
     }

     public static Class getOutputClass() {
       return Record.class;
     }

     static class CsvStreamingReader {
       private final BufferedReader csvReader;

       public CsvStreamingReader(InputStream is) {
         this.csvReader = new BufferedReader(new InputStreamReader(is));
       }

       public void close() {
         try {
           this.csvReader.close();
         } catch (IOException e) {
           e.printStackTrace();
         }
       }

       Record getNextRecord() {
         String csvRecord;

         try {
           if ((csvRecord = csvReader.readLine()) != null) {
             String[] columns = csvRecord.split(",", 3);
             return new Record(columns[0], columns[1], columns[2]);
           }
         } catch (IOException e) {
           throw new RuntimeException("Reading CSV failed.", e);
         } finally {
           // No more records, we can close the reader.
           close();
         }

         // Return null to indicate the end of the stream.
         return null;
       }
     }

     public Stream<Record> process(String file_url) throws IOException {
       SnowflakeFile file = SnowflakeFile.newInstance(file_url);

       CsvStreamingReader csvReader = new CsvStreamingReader(file.getInputStream());
       return Stream.generate(csvReader::getNextRecord);
     }
   }
   $$
   ;
   ```
2. Refresh the directory table for the `data_stage` stage:

   ```sqlexample
   ALTER STAGE data_stage REFRESH;
   ```
3. Call the Java UDTF to read one or more staged CSV files and extract the contents in a table format:

   Code in the following example calls the UDF, passing a scoped URL to reduce the risk of file injection attacks. Always used a scoped
   URL when the function’s caller is not also its owner. You can pass the URL argument as a scoped URL or another supported form when the
   UDF’s caller is also its owner.

   > ```sqlexample
   > -- Input a file URL.
   > SELECT * FROM TABLE(PARSE_CSV(BUILD_SCOPED_FILE_URL(@data_stage, 'sample.csv')));
   > ```

---
title: Programmatically work with cost anomalies
source: https://docs.snowflake.com/en/user-guide/cost-anomalies-class.md
section: User Guide
---

# Programmatically work with cost anomalies

You can use the ANOMALY_INSIGHTS [class](../sql-reference/snowflake-db-classes.md) to programmatically identify and investigate cost
anomalies. The fully qualified instance that you use to work with anomalies is SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS.

You must have the [required privileges](cost-anomalies-access-control.md) to run the
[class methods](../sql-reference/classes/anomaly_insights.md).

For an overview of cost anomalies, see [Introduction to cost anomalies](cost-anomalies.md).

## Identify cost anomalies with ANOMALY_INSIGHTS

Snowflake creates an instance of the ANOMALY_INSIGHTS class that you can use to programmatically identify cost anomalies. The
[ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA](../sql-reference/classes/anomaly-insights/methods/get_daily_consumption_anomaly_data.md) method returns consumption data for an account or
organization along with a boolean value that indicates whether that consumption is a cost anomaly.

### Identify organization-level cost anomalies

Users call the GET_DAILY_CONSUMPTION_ANOMALY_DATA method from the organization account or an ORGADMIN-enabled account to identify
[organization-level cost anomalies](cost-anomalies.md). To focus on organization-level cost anomalies, the user passes NULL as
an argument instead of the name of an account.

Example: Organization-level cost anomaly
:   To identify organization-level cost anomalies between January 1, 2024, and March 31, 2024, do the following:

    1. Sign in to the [organization account](organization-accounts.md) or an
       [ORGADMIN-enabled account](organization-administrators.md).
    2. Call the method:

       ```sqlexample
       CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
         '2024-01-01', '2024-03-31', NULL);
       ```
    3. In the output, find days where the value of the `is_anomaly` column is `TRUE`.

### Identify account-level cost anomalies

You can use the GET_DAILY_CONSUMPTION_ANOMALY_DATA method to identify account-level cost anomalies for the current account or, if you are
signed in to the organization account or an ORGADMIN-enabled account, any account in the organization.

Example: Cost anomalies in the current account
:   To identify cost anomalies in the current account between January 1, 2024, and March 31, 2024, call the following method when signed in
    to the account.

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
      '2024-01-01', '2024-03-31', CURRENT_ACCOUNT_NAME() );
    ```

    To use the output to identify the cost anomalies, look for the days where the value of the `is_anomaly` column is `TRUE`.

Example: Cost anomalies in a different account
:   If you are signed in to the organization account or an ORGADMIN-enabled account, and want to identify cost anomalies in a different
    account, specify the name of the account when you call the GET_DAILY_CONSUMPTION_ANOMALY_DATA method.

    For example, suppose you are signed in to the organization account `my_orgacct`. You can identify cost anomalies in the account
    `prod_acct` between November 1, 2024, and December 31, 2024 by executing the following command:

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
      '2024-11-01', '2024-12-31', 'prod_acct');
    ```

    To use the output to identify the cost anomalies, look for the days where the value of the `is_anomaly` column is `TRUE`.

## Investigate cost anomalies with ANOMALY_INSIGHTS

The ANOMALY_INSIGHTS class provides methods that you can use to investigate why a cost anomaly occurred. These methods allow you to drill
down into the following:

* Account-level consumption
* Warehouse-level consumption
* Query-level consumption
* Hourly consumption by service type

### Account-level consumption

Call the [ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION](../sql-reference/classes/anomaly-insights/methods/get_top_accounts_by_consumption.md) method to retrieve a list of accounts with
the highest change in consumption on a given day. Change in consumption is determined by comparing the consumption on a specified day with
consumption on the previous day. This is useful to investigate organization-level cost anomalies.

For example, if you are an administrator who wants to know the top five accounts in terms of change in consumption when comparing
December 14, 2024, and December 15, 2024, execute the following from the organization account or an ORGADMIN-enabled account:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION('2024-12-15', 5);
```

### Warehouse-level consumption

Call the [ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE](../sql-reference/classes/anomaly-insights/methods/get_top_warehouses_on_date.md) method to retrieve a list of warehouses with the
highest change in consumption on a given day. Change in consumption is determined by comparing the consumption of a warehouse on a specified
day with consumption on the previous day. You can focus on the top warehouses within a specific account or identify top warehouses across
the organization.

Example: Identify top warehouses in the organization
:   To find the top six warehouses in the organization in terms of change in consumption when comparing August 9, 2024, and August 10, 2024,
    sign in to the organization account or an ORGADMIN-enabled account and execute the following:

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE(
      '2024-08-10', 6, NULL);
    ```

Example: Identify top warehouses in current account
:   To find the top five warehouses in the current account in terms of change in consumption when comparing December 8, 2024, and December 9,
    2024, execute the following:

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE(
      '2024-12-09', 5, CURRENT_ACCOUNT_NAME());
    ```

Example: Identify top warehouses in a different account
:   To find the top three warehouses in the account `my_acct` in terms of change in consumption when comparing November 8, 2024, and November 9,
    2024, sign in to the organization account or an ORGADMIN-enabled account and execute the following:

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE(
      '2024-11-09', 5, 'my_acct');
    ```

### Query-level consumption

Call the [ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE](../sql-reference/classes/anomaly-insights/methods/get_top_queries_from_warehouse.md) method to retrieve a list of queries that ran
on a specific warehouse so you can identify which queries resulted in high consumption. The returned queries are listed in the order of
consumption, from highest to lowest.

You use a Warehouse ID to specify which warehouse you are investigating. You can find the Warehouse ID by calling the
[ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE](../sql-reference/classes/anomaly-insights/methods/get_top_warehouses_on_date.md) method or querying the
[WAREHOUSE_METERING_HISTORY view](../sql-reference/account-usage/warehouse_metering_history.md).

For example, to investigate consumption of a warehouse whose Warehouse ID is `838`, execute the following to list the top six queries that
consumed the most credits on December 1, 2024:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE(838, '2024-12-01', 6);
```

### Hourly consumption by service type

Call the [ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE](../sql-reference/classes/anomaly-insights/methods/get_hourly_consumption_by_service_type.md) method to retrieve the hourly
consumption for a given day, broken down by service type. This allows you to see which service types (for example, `AI_SERVICES`)
are contributing to your consumption during each hour of the day. You can only retrieve data for the account that you are currently
signed in to.

You can specify the number of top service types to return. If you specify `NULL` instead of a number, the method returns all service types
that had non-zero consumption on the specified day.

Example: Top 5 service types
:   To return the hourly consumption on January 15, 2026, broken down by the five services that had the most consumption, run the following:

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE(
      '2026-01-15',
      5);
    ```

Example: All service types
:   To return the hourly consumption on January 15, 2026, for all service types, run the following:

    ```sqlexample
    CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE(
      '2026-01-15',
      NULL);
    ```

---
title: Projection policies
source: https://docs.snowflake.com/en/user-guide/projection-policies.md
section: User Guide
---

# Projection policies

This topic shows how to use projection policies to allow or prevent column projection in the final output of a SQL query result.

## Overview

A projection policy is a first-class, schema-level object that defines whether a column can be projected in the output of a SQL query
result. A column with a projection policy assigned to it is said to be *projection constrained*. Projection policies can be used to
constrain sensitive or private information (for example, name or phone number) when [sharing data securely](data-sharing-gs.md) between partners.

However, note that columns that are hidden by projection policies can still be used in inner queries or in WHERE clauses, which can
disclose information about a given field. For details, see the Considerations section (in this topic).

After creating the projection policy, a policy administrator can assign the projection policy to a column. A column can only have one
projection policy assigned to it at any given time. A user can project the column only if their active role matches a projection policy
condition that allows the column to be projected.

Note that a projection constrained column can also be protected by a masking policy and the table containing the projection constrained
column can be protected by a row access policy. For more details, see Masking & row access policies (in this topic).

Projection policies affect only columns visible in the final results table. So a user running the following query would see NULL as indicated when column “protected_C” denies projection:

```sqlexample
SELECT protected_C,                         -- NULL, in outer query
       my_func(protected_C),                -- Any functions on NULL returns NULL
       nonprotected_C
       FROM (SELECT protected_C,            -- Projection policies are ignored in a nested query.
                    nonprotected_C from T)
WHERE protected_C > 5;                      -- Not shown in results, so this works
```

### Column usage

Snowflake tracks column usage. Indirect references to a column, such as a view definition,
UDF (in this topic), and common table expression, impact column projection when a projection
policy is set on a column.

When a projection policy is set on the column and the column cannot be projected, the column:

* Is not included in the output of a query result.
* Cannot be inserted into another table.
* Cannot be an argument for an external function or stored procedure.

### Limitations

* For limitations regarding user-defined functions (UDFs), see User-defined functions (UDFs) (in this topic).
* A projection policy cannot be applied to:

  + A tag assigned to a table or column (that is, a tag-based projection policy).
  + A virtual column or to the VALUE column in an external table. As a workaround, create a view and assign a projection policy to each
    column that should not be projected.
  + The `value_column` in a [PIVOT](../sql-reference/constructs/pivot.md) construct. For related details, see
    UNPIVOT (in this topic).
* A projection policy `body` cannot reference a column protected by a masking policy or a table protected by a row access
  policy. For additional details, see Masking & row access policies (in this topic).

### Considerations

Use projection policies when the use case calls for querying a sensitive column without directly exposing the column value to an analyst or
similar role. The column value within a projection constrained column can be analyzed with greater flexibility than a masked or tokenized
value. However, consider the following prior to setting a projection policy on a column:

* A projection policy does not prevent the targeting of an individual.

  For example, a user can filter rows where the `name` column corresponds to a particular individual, even if the column is
  projection constrained. However, the user cannot run a SELECT statement to view names of the individuals in the table.
* When a projection constrained column is the join key for a query that combines data from the protected table with data from an unprotected
  table, nothing prevents the user from projecting values from the column in the unprotected table. As a result, if a value in the
  unprotected table matches a value in the protected column, the user can obtain that value by projecting it from the unprotected table.

  For example, suppose a projection policy was assigned to the `email` column of the `t_protected` table. A user can still ascertain
  values in the `t_protected.email` column by executing:

  > ```sqlexample
  > SELECT t_unprotected.email
  >   FROM t_unprotected JOIN t_protected ON t_unprotected.email = t_protected.email;
  > ```
* A projection constraint does not guarantee that a malicious actor could not use deliberate queries to obtain potentially sensitive data
  from a projection-constrained column. Projection policies are best suited for use with partners and customers with whom you have an
  existing level of trust. In addition, providers should be vigilant about potential misuses of their data (e.g. reviewing the access
  history for their listings).
* In rare instances, an error message for a query containing a projection-constrained column can contain a single value from the column.
* For all of these reasons, if you need to prevent leakage about a specific column or entity, you should omit the column entirely from
  your data, or employ [differential privacy](diff-privacy/differential-privacy-overview.md).

## Create a projection policy

A projection policy contains a `body` that calls the PROJECTION_CONSTRAINT function to determine whether to project
a column.

> ```sqlsyntax
> CREATE OR REPLACE PROJECTION POLICY <name>
>   AS () RETURNS PROJECTION_CONSTRAINT -> <body>
> ```
>
> Where:
>
> * `name` specifies the name of the policy.
> * `AS () RETURNS PROJECTION_CONSTRAINT` is the signature and return type of the policy. The signature does not accept any
>   arguments and the return type is PROJECTION_CONSTRAINT, which is an internal data type. All projection policies have the same
>   signature and return type.
> * `body` is a SQL expression that determines whether to project the column. This can include CASE and other valid SQL
>   statements, and can also include SELECT clauses that evaluate to TRUE or FALSE. **Do not return NULL to disallow projection.** You
>   must return the PROJECTION_CONSTRAINT function with values specifying whether to allow projection of the specified column,
>   and how to treat queries that request that column. See [CREATE PROJECTION POLICY](../sql-reference/sql/create-projection-policy.md) to learn the syntax.

### Example policies

The simplest projection policies call the PROJECTION_CONSTRAINT function directly:

Allow column projection
:   ```sqlexample
    CREATE OR REPLACE PROJECTION POLICY mypolicy
    AS () RETURNS PROJECTION_CONSTRAINT ->
    PROJECTION_CONSTRAINT(ALLOW => true);
    ```

Prevent column projection
:   ```sqlexample
    CREATE OR REPLACE PROJECTION POLICY mypolicy
    AS () RETURNS PROJECTION_CONSTRAINT ->
    PROJECTION_CONSTRAINT(ALLOW => false);
    ```

Prevent column projection to specific roles
:   More complicated SQL expressions can be written to call the PROJECTION_CONSTRAINT function. The expression can use
    [Conditional expression functions](../sql-reference/expressions-conditional.md) and [Context functions](../sql-reference/functions-context.md) to introduce logic to allow certain users with a
    particular role to project a column and prevent all other users from projecting a column.

    > **Tip:**
    >
    > You can use the following strategies when using context functions in a conditional policy:
    >
    > * Context functions return strings, so comparisons using them are case-sensitive. You can use
    >   [LOWER](../sql-reference/functions/lower.md) to convert strings to all lowercase if you’d like to do a case-insensitive comparison.
    > * The [POLICY_CONTEXT](../sql-reference/functions/policy_context.md) function helps you evaluate whether a policy body is returning the correct value
    >   when a context function returns a certain value. The POLICY_CONTEXT function simulates query results based upon a specified value of
    >   one or more context functions.

    The following example includes a [CASE](../sql-reference/functions/case.md) expression and [CURRENT_ROLE](../sql-reference/functions/current_role.md) context
    function to create a conditional policy that allows only users with the `analyst` custom role to project a column:

    ```sqlexample
    CREATE OR REPLACE PROJECTION POLICY mypolicy
    AS () RETURNS PROJECTION_CONSTRAINT ->
    CASE
      WHEN CURRENT_ROLE() = 'ANALYST'
        THEN PROJECTION_CONSTRAINT(ALLOW => true)
      ELSE PROJECTION_CONSTRAINT(ALLOW => false)
    END;
    ```

    The next example allows users with the `analyst` role to access the column, but anyone else will see only NULL values for that column or
    any column that derives from that column.

    ```sqlexample
    CREATE OR REPLACE PROJECTION POLICY mypolicy
    AS () RETURNS PROJECTION_CONSTRAINT ->
    CASE
      WHEN CURRENT_ROLE() = 'ANALYST'
        THEN PROJECTION_CONSTRAINT(ALLOW => true)
      ELSE PROJECTION_CONSTRAINT(ALLOW => false, ENFORCEMENT => 'NULLIFY')
    END;
    ```

Using tags in projection policies:
:   The following example uses the [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](../sql-reference/functions/system_get_tag_on_current_column.md) function so that a tag that is assigned to
    a column determines whether the column can be projected. In this case, when the policy is assigned to a column, the value of the
    `tags.accounting_col` tag on that column must be `public` in order to project the column.

    ```sqlexample
    CREATE PROJECTION POLICY mypolicy
    AS () RETURNS PROJECTION_CONSTRAINT ->
    CASE
      WHEN SYSTEM$GET_TAG_ON_CURRENT_COLUMN('tags.accounting_col') = 'public'
        THEN PROJECTION_CONSTRAINT(ALLOW => true)
      ELSE PROJECTION_CONSTRAINT(ALLOW => false)
    END;
    ```

For data sharing use cases, the provider can write a projection policy to constrain column projection for all consumer accounts using the
[CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) context function, or selectively restrict column projection in specific shares using the
[INVOKER_SHARE](../sql-reference/functions/invoker_share.md) context function. For example:

Restrict all consumer accounts
:   In this example, `provider.account` is the [account identifier](admin-account-identifier.md) in the account name format:

    ```sqlexample
    CREATE OR REPLACE PROJECTION POLICY restrict_consumer_accounts
    AS () RETURNS PROJECTION_CONSTRAINT ->
    CASE
      WHEN CURRENT_ACCOUNT() = 'provider.account'
        THEN PROJECTION_CONSTRAINT(ALLOW => true)
      ELSE PROJECTION_CONSTRAINT(ALLOW => false)
    END;
    ```

Restrict to specific shares
:   Consider a data sharing provider account that has a projection policy set on a column of a secure view. There are two different shares
    (`SHARE1` and `SHARE2`) that can access the secure view to support two different data sharing consumers.

    If a user in the data sharing consumer account attempts to project the column through either share they can project the column,
    otherwise the column cannot be projected:

    ```sqlexample
    CREATE OR REPLACE PROJECTION POLICY projection_share
    AS () RETURNS PROJECTION_CONSTRAINT ->
    CASE
      WHEN INVOKER_SHARE() IN ('SHARE1', 'SHARE2')
        THEN PROJECTION_CONSTRAINT(ALLOW => true)
      ELSE PROJECTION_CONSTRAINT(ALLOW => false)
    END;
    ```

Query a separate table to determine the projection policy
:   You can use a SELECT query in your policy logic to help determine whether to allow or block projection. If you query a table (a
    *mapping table*) in this way, we recommend puting the mapping table in the same database as the protected table. This is particularly
    important if the `body` section calls [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md).

    Here is an extended example of creating and populating a simple mapping table of role names and projection permission, and then querying
    that table to determine whether a column can be projected to the current user according to their role.

    ```sqlexample
    -- Create mapping table with two columns: role name, whether that role can project the column
    CREATE OR REPLACE TABLE roles_with_access(role string, allowed boolean)
    AS SELECT * FROM VALUES ('ACCOUNTADMIN', true), ('RANDOM_ROLE', false);

    -- Create a policy that queries the mapping table, and allows projection when current
    -- user role has an `allowed` value of TRUE.
    -- Note that the logic is written to default to FALSE in all other cases, including the
    -- current role not being in the queried table.
    CREATE OR REPLACE PROJECTION POLICY pp AS () RETURNS projection_constraint ->
      CASE WHEN
        exists(
          SELECT 1 FROM roles_with_access WHERE role = current_role() AND allowed = true
        ) THEN projection_constraint(ALLOW=>true)
      ELSE projection_constraint(ALLOW=>false) END;

    -- Create a new table with the policy and query it in one step.
    CREATE OR REPLACE TABLE t(user string, address string WITH PROJECTION POLICY pp)
      AS SELECT * FROM VALUES ('Carson', 'CA'), ('Emily', 'NY'), ('John', 'NV');

    -- Succeeds
    USE ROLE ACCOUNTADMIN;
    SELECT * FROM t;

    -- Fails with projection policy error on column ADDRESS
    USE ROLE any_other_role;
    SELECT * FROM t;
    ```

## Assign a projection policy

A projection policy is applied to a table column using an [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md) command and a view column using an
[ALTER VIEW](../sql-reference/sql/alter-view.md) command. Each column supports only one projection policy.

> ```sqlsyntax
> ALTER { TABLE | VIEW } <name>
> { ALTER | MODIFY } COLUMN <col1_name>
> SET PROJECTION POLICY <policy_name> [ FORCE ]
> [ , <col2_name> SET PROJECTION POLICY <policy_name> [ FORCE ] ... ]
> ```

Where:

* `name` specifies the name of the table or view.
* `col1_name` specifies the name of the column in the table or view.
* `col2_name` specifies the name of an additional column in the table or view.
* `policy_name` specifies the name of the projection policy set on the column.
* `FORCE` is an optional parameter that allows the command to assign the projection policy to a column that already has a projection
  policy assigned to it. The new projection policy atomically replaces the existing one.

For example, to set a projection policy `proj_policy_acctnumber` on the `account_number` column of a table:

> ```sqlexample
> ALTER TABLE finance.accounting.customers
>  MODIFY COLUMN account_number
>  SET PROJECTION POLICY proj_policy_acctnumber;
> ```

You can also use the WITH clause of the [CREATE TABLE](../sql-reference/sql/create-table.md) and [CREATE VIEW](../sql-reference/sql/create-view.md) commands to assign
a projection policy to a column when the table or view is created. For example, to assign the policy `my_proj_policy` to the
`account_number` column of a new table, execute:

> ```sqlexample
> CREATE TABLE t1 (account_number NUMBER WITH PROJECTION POLICY my_proj_policy);
> ```

You can also use the WITH clause when adding a new column to an existing table. For example, to assign the policy `my_proj_policy` to the
`zipcode` column, which is being added to the existing table `customers`, execute:

> ```sqlexample
> ALTER TABLE customers ADD COLUMN account_number NUMBER WITH PROJECTION POLICY my_proj_policy;
> ```

### Replace a projection policy

The recommended method of replacing a projection policy is to use the `FORCE` parameter to detach the existing projection policy and
assign the new one in a single command. This allows you to atomically replace the old policy, leaving no gap in protection.

For example, to assign a new projection policy to a column that is already projection-constrained:

```sqlexample
ALTER TABLE finance.accounting.customers
  MODIFY COLUMN account_number
  SET PROJECTION POLICY proj_policy2 FORCE;
```

You can also detach the projection policy from a column in one statement (… UNSET PROJECTION POLICY) and then set a new policy on the
column in a different statement (… SET PROJECTION POLICY <name>). If you choose this method, the column is not protected by a projection policy
in between detaching one policy and assigning another. A query could potentially access sensitive data during this time.

## Detach a projection policy

Use the UNSET PROJECTION POLICY clause of an ALTER TABLE or ALTER VIEW command to detach a projection policy from the column of a table or
view. The name of the projection policy is not required because a column cannot have more than one projection policy attached.

> ```sqlsyntax
> ALTER { TABLE | VIEW } <name>
> { ALTER | MODIFY } COLUMN <col1_name>
> UNSET PROJECTION POLICY
> [ , <col2_name> UNSET PROJECTION POLICY ... ]
> ```

Where:

* `name` specifies the name of the table or view.
* `col1_name` specifies the name of the column in the table or view.
* `col2_name` specifies the name of an additional column in the table or view.

For example, to remove the projection policy from the `account_number` column:

> ```sqlexample
> ALTER TABLE finance.accounting.customers
>  MODIFY COLUMN account_number
>  UNSET PROJECTION POLICY;
> ```

## View projection policies with Snowsight

To determine whether a column has a projection policy, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then navigate to the table that contains the column.
3. Select the Columns tab.
4. Use the Policy column to determine if the column has any data governance policies.
5. Hover over each policy to determine whether it’s a projection policy.

   If it is a projection policy, you can also determine whether the projection policy prevents the query from executing or returns NULL
   values in the output. If the body of the policy is complex and behaves differently under different conditions, Snowsight
   displays the contents of the body instead of simply stating whether the query fails or returns NULL values.

## Monitor projection policies with SQL

It can be helpful to think of two general approaches to determine how to monitor projection policy usage.

* Discover projection policies
* Identify projection policy references

### Discover projection policies

You can use the [PROJECTION_POLICIES](../sql-reference/account-usage/projection_policies.md) view in the Account Usage schema of the shared
SNOWFLAKE database. This view is a *catalog* for all projection policies in your Snowflake account. For example:

> ```sqlexample
> SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.PROJECTION_POLICIES
> ORDER BY POLICY_NAME;
> ```

### Identify projection policy references

The [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) Information Schema table function can identify projection policy references. There
are two different syntax options:

1. Return a row for each object (that is, table or view) that has the specified projection policy set on a column:

   ```sqlexample
   USE DATABASE my_db;
   SELECT policy_name,
          policy_kind,
          ref_entity_name,
          ref_entity_domain,
          ref_column_name,
          ref_arg_column_names,
          policy_status
   FROM TABLE(information_schema.policy_references(policy_name => 'my_db.my_schema.projpolicy'));
   ```
2. Return a row for each policy assigned to the table named `my_table`:

   ```sqlexample
   USE DATABASE my_db;
   USE SCHEMA information_schema;
   SELECT policy_name,
          policy_kind,
          ref_entity_name,
          ref_entity_domain,
          ref_column_name,
          ref_arg_column_names,
          policy_status
   FROM TABLE(information_schema.policy_references(ref_entity_name => 'my_db.my_schema.my_table', ref_entity_domain => 'table'));
   ```

## Extended example

Creating a projection policy and assigning the projection policy to a column follows the same general procedure as creating and assigning
other policies, such as masking and row access policies:

1. For a centralized management approach, create a custom role (e.g. `proj_policy_admin`) to manage the policy.
2. Grant this role the privileges to create and assign a projection policy.
3. Create the projection policy.
4. Assign the projection policy to a column.

Based on this general procedure, complete the following steps to assign a projection policy to a column:

1. Create a custom role to manage the projection policy:

   ```sqlexample
   USE ROLE useradmin;

   CREATE ROLE proj_policy_admin;
   ```
2. Grant the `proj_policy_admin` custom role the privileges to create a projection policy in a schema and assign the projection policy
   to any table or view column in the Snowflake account.

   This step assumes the projection policy will be stored in a database and schema named `privacy.projpolicies` and this database and
   schema already exist:

   ```sqlexample
   GRANT USAGE ON DATABASE privacy TO ROLE proj_policy_admin;
   GRANT USAGE ON SCHEMA privacy.projpolicies TO ROLE proj_policy_admin;

   GRANT CREATE PROJECTION POLICY
     ON SCHEMA privacy.projpolicies TO ROLE proj_policy_admin;

   GRANT APPLY PROJECTION POLICY ON ACCOUNT TO ROLE proj_policy_admin;
   ```

   For details, see Privileges and commands (in this topic).
3. Create a projection policy to prevent column projection:

   ```sqlexample
   USE ROLE proj_policy_admin;
   USE SCHEMA privacy.projpolicies;

   CREATE OR REPLACE PROJECTION POLICY proj_policy_false
   AS () RETURNS PROJECTION_CONSTRAINT ->
   PROJECTION_CONSTRAINT(ALLOW => false);
   ```
4. Assign the projection policy to a table column:

   ```sqlexample
   ALTER TABLE customers MODIFY COLUMN active
   SET PROJECTION POLICY privacy.projpolicies.proj_policy_false;
   ```

## Projection policies with Snowflake features

The following subsections briefly summarize how projection policies interact with various Snowflake features and services.

### Masking & row access policies

This section describes how a projection policy interacts with a [masking policy](security-column-intro.md) and a
[row access policy](security-row-intro.md).

Multiple policies:
:   A column can have a masking policy and a projection policy at the same time, and the table containing this column can be protected by a
    row access policy. If all three policies are present, Snowflake processes the table and policies as follows:

    1. Apply row filters according to the row access policy.
    2. Determine if the query is attempting to project any columns that are restricted by the projection policy, and if so, reject the query.
    3. Apply column masks according to the masking policy.

    A column protected by a masking policy can also be projection constrained. For example, a masking policy set on a column containing
    account numbers can have a condition that allows users with the `finance_admin` custom role to see the account numbers and another
    condition to replace the account numbers with a hash for all other roles.

    A projection policy can further restrict the column such that users with the `analyst` custom role cannot project the column. Note that
    users with the `analyst` custom role can still analyze the column by grouping hashes or joining on these hashes.

    Snowflake recommends that policy administrators work with internal compliance and regulatory officers to determine the columns that
    should be projection constrained.

Policy evaluation:
:   A projection constrained column cannot be referenced by a masking policy or a row access policy when:

    * Assigning a row access policy to a table.
    * Enumerating one or more columns in a [conditional masking policy](security-column-intro.md).
    * Performing a mapping table lookup.

    As mentioned in the Limitations (in this topic), a projection policy `body` cannot reference a
    column protected by a masking policy or a table protected by a row access policy.

### Dependent objects with other projection policies

Consider the following series of objects:

> `base_table` » `v1` » `v2`
>
> Where:
>
> * `v1` is a view built from the table named `base_table`.
> * `v2` is a view built from `v1`.

If there is a query on a column in a view that is projection-constrained and that column depends on a projection constrained column in
`base_table`, the view column will be projected only if both projection policies allow the column to be projected.

Snowflake checks the column lineage chain all the way to the base table to ensure that any references to the column are not projection
constrained. If any column in the lineage chain is projection constrained and the column is not allowed to be projected, Snowflake blocks
the query.

### Views & materialized views

A projection policy on a view column constrains the view column and not the underlying base table column.

Regarding references, a projection policy that constrains a table column carries over to a view that references the constrained table
column.

### Streams & tasks

Projection policies on columns in a table carry over to a stream on the same table. Note that a projection policy cannot be set on a stream.

Similarly, a projection constrained column remains constrained when a task references the constrained column.

### UNPIVOT

The result of an [UNPIVOT](../sql-reference/constructs/unpivot.md) construct depends on whether a column was initially constrained by a
projection policy. Note:

* Constrained columns prior to and after executing UNPIVOT remain projection constrained.
* The `name_column` always appears in the query result.
* If any columns in the `column_list` are projection constrained, the `value_column` is also projection constrained.

### Cloned objects

The following approach helps to safeguard data from users with the SELECT privilege on a cloned table or view that is stored in the cloned
database or schema:

* Cloning an individual projection policy object is not supported.
* Cloning a schema results in the cloning of all projection policies within the schema.
* A cloned table maps to the same projection policies as the source table.

  + When a table is cloned in the context of its parent schema cloning, if the source table has a reference to a projection policy in the
    same parent schema (i.e. a local reference), the cloned table will have a reference to the cloned projection policy.
  + If the source table refers to a projection policy in a different schema (i.e. a foreign reference), then the cloned table retains the
    foreign reference.

For more information, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

### Replication

Projection policies and their assignments can be replicated using database replication and replication groups.

For [database replication](database-replication-considerations.md), the replication operation fails if either of the
following conditions is true:

* The primary database is in an Enterprise (or higher) account and contains a policy but one or more of the accounts approved for
  replication are on lower editions.
* A table or view contained in the primary database has a [dangling reference](database-replication-considerations.md) to a
  projection policy in another database.

The dangling reference behavior for database replication can be avoided when replicating multiple databases in a
[replication group](account-replication-intro.md).

### User-defined functions (UDFs)

Note the following regarding projection constraints and UDFs:

Scalar SQL UDFs:
:   Snowflake evaluates the UDF and then applies the projection policy to the projection constrained column.

    If a column in a SELECT statement is transitively derived from a UDF, which is also derived from a projection constrained column,
    Snowflake blocks the query. In other words:

    `pc_column` » UDF » column (in SELECT statement)

    Where:

    * `pc_column` refers to a projection constrained column.

    Because the column in the SELECT statement can be traced to a projection constrained column, Snowflake blocks the query.

SQL UDTFs:
:   SQL user-defined table functions (UDTF) follow the same behavior as SQL UDFs, except that because rows are returned in the function
    output, Snowflake evaluates each table column independently to determine whether to project the column in the function output.

Other UDFs:
:   The following applies to [Introduction to Java UDFs](../developer-guide/udf/java/udf-java-introduction.md), [Introduction to JavaScript UDFs](../developer-guide/udf/javascript/udf-javascript-introduction.md),
    [Introduction to Python UDFs](../developer-guide/udf/python/udf-python-introduction.md):

    * A projection constrained column is constrained in the UDTF output.

Logging & Event Tables:
:   When a UDF, UDTF, or JavaScript UDF has a projection-constrained argument, Snowflake does not capture log and event details in the
    corresponding event table. However, Snowflake allows the UDF/UDTF to execute and does not fail the statement calling the UDF/UDTF due to
    logging reasons.

## Privileges and commands

The following subsections provide information to help manage projection policies.

### Projection policy privileges

Snowflake supports the following privileges on the projection policy object.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables the set and unset operations for a projection policy on a column. |
| OWNERSHIP | Transfers ownership of the projection policy, which grants full control over the projection policy. Required to alter most properties of a projection policy. |

For details, see Summary of DDL commands, operations, and privileges (in this topic).

### Projection policy DDL reference

Snowflake supports the following DDL to create and manage projection policies.

* [CREATE PROJECTION POLICY](../sql-reference/sql/create-projection-policy.md)
* [ALTER PROJECTION POLICY](../sql-reference/sql/alter-projection-policy.md)
* [DESCRIBE PROJECTION POLICY](../sql-reference/sql/desc-projection-policy.md)
* [DROP PROJECTION POLICY](../sql-reference/sql/drop-projection-policy.md)
* [SHOW PROJECTION POLICIES](../sql-reference/sql/show-projection-policies.md)

### Summary of DDL commands, operations, and privileges

The following table summarizes the relationship between projection policy privileges and DDL operations.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Operation | Privilege required |
| --- | --- |
| Create projection policy. | A role with the CREATE PROJECTION POLICY privilege in the same schema. |
| Alter projection policy. | The role with the OWNERSHIP privilege on the projection policy. |
| Describe projection policy | One of the following:   * A role with the global APPLY PROJECTION POLICY privilege, or * A role with the OWNERSHIP privilege on the projection policy, or * A role with the APPLY privilege on the projection policy. |
| Drop projection policy. | A role with the OWNERSHIP privilege on the projection policy. |
| Show projection policies. | One of the following:   * A role with the USAGE privilege on the schema in which the projection policy exists, or * A role with the APPLY PROJECTION POLICY on the account. |
| Set or unset a projection policy on a column. | One of the following:   * A role with the APPLY PROJECTION POLICY privilege on the account, or * A role with the APPLY privilege on the projection policy and the OWNERSHIP privilege on the table or view. |

Snowflake supports different permissions to create and set a projection policy on an object.

1. For a centralized projection policy management approach in which the `projection_policy_admin` custom role creates and sets projection
   policies on all columns, the following permissions are necessary:

   ```sqlexample
   USE ROLE securityadmin;
   GRANT USAGE ON DATABASE mydb TO ROLE projection_policy_admin;
   GRANT USAGE ON SCHEMA mydb.schema TO ROLE projection_policy_admin;

   GRANT CREATE PROJECTION POLICY ON SCHEMA mydb.schema TO ROLE projection_policy_admin;
   GRANT APPLY ON PROJECTION POLICY ON ACCOUNT TO ROLE projection_policy_admin;
   ```
2. In a hybrid management approach, a single role has the CREATE PROJECTION POLICY privilege to ensure projection policies are named
   consistently and individual teams or roles have the APPLY privilege for a specific projection policy.

   For example, the custom role `finance_role` role can be granted the permission to set the projection policy `cost_center` on tables
   and views the role owns (i.e. the role has the OWNERSHIP privilege on the table or view):

   ```sqlexample
   USE ROLE securityadmin;
   GRANT CREATE PROJECTION POLICY ON SCHEMA mydb.schema TO ROLE projection_policy_admin;
   GRANT APPLY ON PROJECTION POLICY cost_center TO ROLE finance_role;
   ```

---
title: Python and Java support for serverless tasks
source: https://docs.snowflake.com/en/user-guide/tasks-python-jvm.md
section: User Guide
---

# Python and Java support for serverless tasks

[Serverless tasks](tasks-intro.md) can invoke the following object types and functions: user-defined functions (UDFs) and stored procedures written in Python, Java, and Scala.

You can use Python or Java in your tasks in a few different ways. To understand the difference between these options, see
[Choosing whether to write a stored procedure or a user-defined function](../developer-guide/stored-procedures-vs-udfs.md).

## User-defined functions

You can create UDFs to call in your task’s AS clause. You can use UDFs to perform operations not available in SQL. For more information
about UDFs, see [User-defined functions overview](../developer-guide/udf/udf-overview.md).

The following examples in Python and Java create a function that adds one to the input value.

PythonJava

```sqlexample
CREATE OR REPLACE FUNCTION addone(i int)
  RETURNS int
  LANGUAGE python
  RUNTIME_VERSION = '3.8'
  HANDLER = 'addone_py'
  AS
    $$
    def addone_py(i):
      return i+1
    $$;
```

```sqlexample
CREATE OR REPLACE FUNCTION add_one(i int)
  RETURNS int
  LANGUAGE java
  CALLED ON NULL INPUT
  HANDLER = 'TestFunc.addOne'
  TARGET_PATH = '@~/testfunc.jar'
  AS
    'class TestFunc {
      public static int addOne(int i) {
        return i+1;
      }
    }';
```

The following examples create `my_task2` that adds one to the return value of `my_task1`.

PythonJava

```sqlexample
CREATE OR REPLACE TASK IF NOT EXISTS my_task2
  AFTER my_task1
  AS
    SELECT addone(SYSTEM$GET_PREDECESSOR_RETURN_VALUE());
```

```sqlexample
CREATE OR REPLACE TASK IF NOT EXISTS my_task2
  AFTER my_task1
  AS
    SELECT add_one(SYSTEM$GET_PREDECESSOR_RETURN_VALUE());
```

## Stored procedures

You can create stored procedures to call in your task’s AS clause. Stored procedures generally perform administrative operations by
executing SQL statements. For more information about stored procedures, see
[Stored procedures overview](../developer-guide/stored-procedure/stored-procedures-overview.md).

The following examples in Python and Java accept a table name and role name to return a filtered table with rows that match the specified
role.

PythonJava

```sqlexample
CREATE OR REPLACE PROCEDURE filterByRole(tableName VARCHAR, role VARCHAR)
  RETURNS TABLE(id NUMBER, name VARCHAR, role VARCHAR)
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.8'
  PACKAGES = ('snowflake-snowpark-python')
  HANDLER = 'filter_by_role'
  AS
    $$
    from snowflake.snowpark.functions import col

    def filter_by_role(session, table_name, role):
      df = session.table(table_name)
      return df.filter(col("role") == role)
    $$;
```

```sqlexample
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
  RETURNS TABLE()
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:latest')
  HANDLER = 'FilterClass.filterByRole'
  AS
    $$
    import com.snowflake.snowpark_java.*;

    public class FilterClass {
      public DataFrame filterByRole(Session session, String tableName, String role) {
        DataFrame table = session.table(tableName);
        DataFrame filteredRows = table.filter(Functions.col("role").equal_to(Functions.lit(role)));
        return filteredRows;
      }
    }
    $$;
```

The following examples create `task2` that calls the stored procedure with the table returned from task1 and the role of `dev`.

PythonJava

```sqlexample
CREATE OR REPLACE TASK IF NOT EXISTS my_task2
  AFTER my_task1
  AS
    CALL filterByRole(SYSTEM$GET_PREDECESSOR_RETURN_VALUE(), 'dev');
```

```sqlexample
CREATE OR REPLACE TASK IF NOT EXISTS my_task2
  AFTER my_task1
  AS
    CALL filter_by_role(SYSTEM$GET_PREDECESSOR_RETURN_VALUE(), 'dev');
```

## SQL AS clause

You can also define Python or Java code directly in the AS clause of your task definition.

The following example uses Python to set the return value of `task2` to a string.

```sqlexample
CREATE OR REPLACE TASK IF NOT EXISTS task2
  SCHEDULE = '1 minute'
  AS
    $$
    print(Task completed successfully.)
    $$
  ;
```

---
title: Queries too large to fit in memory
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse-memory.md
section: User Guide
---

# Queries too large to fit in memory

This topic discusses how a warehouse owner or administrator can resolve memory spillage in order to improve the performance of a query.

Performance degrades drastically when a warehouse runs out of memory while executing a query because memory bytes must “spill” onto local
disk storage. If the query requires even more memory, it spills onto remote cloud-provider storage, which results in even worse performance.

> **Note:**
>
> You must have [access to the shared SNOWFLAKE database](../sql-reference/account-usage.md) to execute the diagnostic queries provided in this topic. By default, only the ACCOUNTADMIN role has the privileges needed to execute the queries.

## Finding queries that spill to storage

This query identifies the top 10 worst offending queries in terms of bytes spilled to local and remote storage.

```sqlexample
SELECT query_id, SUBSTR(query_text, 1, 50) partial_query_text, user_name, warehouse_name,
  bytes_spilled_to_local_storage, bytes_spilled_to_remote_storage
FROM  snowflake.account_usage.query_history
WHERE (bytes_spilled_to_local_storage > 0
  OR  bytes_spilled_to_remote_storage > 0 )
  AND start_time::date > dateadd('days', -45, current_date)
ORDER BY bytes_spilled_to_remote_storage, bytes_spilled_to_local_storage DESC
LIMIT 10;
```

## Recommendations

Data spilling to storage can have a negative impact on query performance (especially if the query has to spill to remote storage). To alleviate this, Snowflake recommends:

* Using a larger warehouse (effectively increasing the available memory/local storage space for the operation)
* Processing data in smaller batches.

You can use the [Query Profile](ui-snowsight-activity.md) to identify which operation nodes are causing data to spill to storage.
For considerations for selecting the appropriate warehouse sizing, please refer to [Warehouse considerations](warehouses-considerations.md).

For more information about the performance implications of spilling, see the community article
[Performance impact from local and remote disk spilling](https://community.snowflake.com/s/article/Performance-impact-from-local-and-remote-disk-spilling).

> **Tip:**
>
> When the query acceleration service (QAS) is enabled, Snowflake writes a small amount of data to remote storage
> for each eligible query, even if QAS isn’t used for that query. Therefore, don’t be concerned by a nonzero
> value for `bytes_spilled_to_remote_storage` in the QUERY_HISTORY view when QAS is enabled.

---
title: Query a table in Snowflake Open Catalog using a third-party engine
source: https://docs.snowflake.com/en/user-guide/opencatalog/query-table-using-third-party-engine.md
section: User Guide
---

# Query a table in Snowflake Open Catalog using a third-party engine

This topic provides instructions for using a third-party query engine to query a table in Snowflake Open Catalog.

## Prerequisites

Before you can query a table in Open Catalog, you must do the following:

* [Create a catalog](create-catalog.md).
* Grant read privileges to the catalog you created. For more
  information, see [Secure catalogs](secure-catalogs.md).
* [Configure a service connection](configure-service-connection.md).
* Register the service connection you configured. For more information, see [Register a service connection](register-service-connection.md).

## Considerations for querying Snowflake-managed Apache Iceberg™ tables

If you use Snowflake and sync a Snowflake-managed Iceberg table to Open Catalog, be aware of the following considerations when querying
the table in Open Catalog:

* [Unquoted identifiers](https://docs.snowflake.com/sql-reference/identifiers-syntax#label-unquoted-identifier): If you create a database, schema,
  or Iceberg table in Snowflake and give it a name that contains letters *without* enclosing the name in double quotes, you must specify the
  name in all caps when you reference it in Open Catalog. For example, if `iceberg_tables.public.table1` is the name in Snowflake, use
  `ICEBERG_TABLES.PUBLIC.TABLE1` in Open Catalog.
* [Double-quoted identifiers](https://docs.snowflake.com/sql-reference/identifiers-syntax#label-delimited-identifier): If you create an object in
  Snowflake with the name in double quotes, when referencing the object in a query in Open Catalog, you must do the following:

  + Enclose the object name with backticks.
  + Specify the object name exactly as it appears in Open Catalog to account for any character that was rendered as a different character,
    when applicable.

  The following example shows the `My 'Identifier'` Snowflake identifier, which was created with double quotes, being referenced in a query in
  Open Catalog:

  ```python
    spark.sql ("select * from `My+'Identifier'`.PUBLIC.TABLE1").show()
  ```

  Open Catalog renders the space character in double-quoted Snowflake identifiers as `+`.

## Example: Query a table

The following example code shows how to use Apache Spark to query the `customers` table in the catalog `catalog1`. The `customers` table is located
under `namespace1a`, which is nested under the top-level namespace `namespace1`:

```python
spark.sql("use catalog1").show()
spark.sql("use namespace1.namespace1a").show()
spark.sql("SELECT * FROM customers").show()
```

---
title: Query a table in Snowflake Open Catalog using Snowflake
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-open-catalog-query.md
section: User Guide
---

# Query a table in Snowflake Open Catalog using Snowflake

To query a table registered in [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) using Snowflake,
you can create an [externally managed](tables-iceberg.md) Apache Iceberg™ table and a catalog integration.

The table represents the Iceberg table in Snowflake Open Catalog and provides read-only access.

> **Note:**
>
> This topic covers how to create a single externally managed Iceberg table. Alternatively,
> use a [catalog-linked database](tables-iceberg-catalog-linked-database.md) to automatically discover multiple tables in Open Catalog.

## Prerequisites

Before you start, you need the following:

* An Iceberg table registered with Open Catalog.
* A service connection that Snowflake can use to connect to Open Catalog.
  You can use an existing service connection that you’ve set up roles and privileges for,
  or [Configure a service connection](https://other-docs.snowflake.com/en/opencatalog/configure-service-connection#configure-a-service-connection) for Snowflake. If you configure a new service connection, you must also configure access control for it.

## Step 1: Create an external volume in Snowflake

If you don’t have one already, start by creating an external volume in Snowflake that provides access to the
cloud storage location where you store your table data and metadata.

Complete the instructions for your cloud storage service:

* [Amazon S3](tables-iceberg-configure-external-volume-s3.md)
* [Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
* [Azure Storage](tables-iceberg-configure-external-volume-azure.md)

## Step 2: Create a catalog integration for Open Catalog

Next, use the [CREATE CATALOG INTEGRATION](../sql-reference/sql/create-catalog-integration-open-catalog.md) command to
create a catalog integration in Snowflake that uses OAuth to connect to Open Catalog using your service connection credentials. The
CATALOG_NAMESPACE parameter is optional. However, if you don’t specify it with the catalog integration, you must specify it when you create
an externally managed table. This section includes the following examples:

* If you don’t use private connectivity for inbound network traffic in Open Catalog, see the
  example Snowflake catalog integration that uses the public internet.
* If you use [private connectivity for inbound network traffic in Open Catalog](https://other-docs.snowflake.com/en/opencatalog/private-connectivity-inbound),
  see the example Snowflake catalog integration that uses a private IP address.

### Example: Catalog integration that uses the public internet

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION open_catalog_int
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE= 'myOpenCatalogNamespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/polaris/api/catalog'
    CATALOG_NAME = 'myOpenCatalogName'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = 'my-client-id'
    OAUTH_CLIENT_SECRET = 'my-client-secret'
    OAUTH_ALLOWED_SCOPES = ( 'PRINCIPAL_ROLE:ALL' )
  )
  ENABLED = TRUE;
```

> **Note:**
>
> * To find your Snowflake organization name (`<orgname>`), follow the steps in [Finding the organization and account name for an account](admin-account-identifier.md).
> * To find `<my-snowflake-open-catalog-account-name`,
>   see [Find the account name for a Snowflake Open Catalog account](https://other-docs.snowflake.com/en/opencatalog/find-account-name) in
>   the Snowflake Open Catalog documentation.

### Example: Catalog integration that uses a private IP address

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION open_catalog_int
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE= 'myOpenCatalogNamespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://<open_catalog_privatelink_account_url>/polaris/api/catalog'
    CATALOG_API_TYPE = PRIVATE
    CATALOG_NAME = 'myOpenCatalogName'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = 'my-client-id'
    OAUTH_CLIENT_SECRET = 'my-client-secret'
    OAUTH_ALLOWED_SCOPES = ( 'PRINCIPAL_ROLE:ALL' )
  )
  ENABLED = TRUE;
```

> **Note:**
>
> For `<open_catalog_privatelink_account_url>`, enter one of the following values:
>
> * **PrivateLink Account URL**
> * **Regionless PrivateLink Account URL**
>
> To obtain these values, retrieve your Open Catalog account settings for private connectivity. For details, see the instructions for the
> cloud platform where your Open Catalog account is hosted:
>
> * [AWS](http://docs.snowflake.com/user-guide/opencatalog/private-connectivity-inbound-configure-aws#step-3-retrieve-your-open-catalog-account-settings)
> * [Azure](http://docs.snowflake.com/user-guide/opencatalog/private-connectivity-inbound-configure-azure#step-1-retrieve-your-open-catalog-account-settings)

## Step 3: Create an externally managed table

Create an Iceberg table in Snowflake using the external volume and catalog integration that you previously configured.

For CATALOG_TABLE_NAME, specify the table name as it appears in Open Catalog.

```sqlexample
CREATE ICEBERG TABLE open_catalog_iceberg_table
  CATALOG = 'open_catalog_int'
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG_TABLE_NAME = 'my_iceberg_table';
```

You can optionally enable automated refreshes of the table metadata by specifying `AUTO_REFRESH = TRUE`.
For more information, see [Automatically refresh Apache Iceberg™ tables](tables-iceberg-auto-refresh.md). If you didn’t specify a CATALOG_NAMESPACE with the catalog integration
you created in the previous step, you must specify this parameter to set a catalog namespace for the table.

> **Note:**
>
> To retrieve a list of tables or namespaces in your remote catalog, you can use the following functions:
>
> * [SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG](../sql-reference/functions/system_list_iceberg_tables_from_catalog.md)
> * [SYSTEM$LIST_NAMESPACES_FROM_CATALOG](../sql-reference/functions/system_list_namespaces_from_catalog.md)

## Step 4: Query the table using Snowflake

You can now use Snowflake to query the table in Open Catalog. You can also join the query results with other Snowflake tables.

```sqlexample
SELECT id, date
  FROM open_catalog_iceberg_table
  LIMIT 10;
```

---
title: Query data in staged files
source: https://docs.snowflake.com/en/user-guide/querying-stage.md
section: User Guide
---

# Query data in staged files

Snowflake supports using standard SQL to query data files located in an internal (i.e. Snowflake) stage or *named* external (Amazon S3, Google Cloud Storage, or Microsoft Azure) stage. This can be useful for inspecting/viewing the contents of the staged files, particularly before loading or after unloading data.

In addition, by referencing [metadata columns](querying-metadata.md) in a staged file, a staged data query can return additional information, such as filename and row numbers, about the file.

Snowflake utilizes support for staged data queries to enable [transforming data during loading](data-load-transform.md).

> **Note:**
>
> This functionality is primarily for performing simple queries only, particularly when loading and/or transforming data, and is not intended to replace loading data into tables and performing queries on the tables.

## Query syntax and parameters

Query staged data files using a [SELECT](../sql-reference/sql/select.md) statement with the following syntax:

> ```sqlsyntax
> SELECT [<alias>.]$<file_col_num>[:<element>] [ , [<alias>.]$<file_col_num>[:<element>] , ...  ]
>   FROM { <internal_location> | <external_location> }
>   [ ( FILE_FORMAT => '<namespace>.<named_file_format>', PATTERN => '<regex_pattern>' ) ]
>   [ <alias> ]
> ```

For the syntax for transforming data during a load, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

> **Important:**
>
> The list of objects returned for an external stage may include one or more “directory blobs”; essentially, paths that end in a forward slash character (`/`), e.g.:
>
> ```sqlexample
> LIST @my_gcs_stage;
>
> +---------------------------------------+------+----------------------------------+-------------------------------+
> | name                                  | size | md5                              | last_modified                 |
> |---------------------------------------+------+----------------------------------+-------------------------------|
> | my_gcs_stage/load/                    |  12  | 12348f18bcb35e7b6b628ca12345678c | Mon, 11 Sep 2019 16:57:43 GMT |
> | my_gcs_stage/load/data_0_0_0.csv.gz   |  147 | 9765daba007a643bdff4eae10d43218y | Mon, 11 Sep 2019 18:13:07 GMT |
> +---------------------------------------+------+----------------------------------+-------------------------------+
> ```
>
> These blobs are listed when directories are created in the Google Cloud console rather than using any other tool provided by Google.
>
> SELECT statements that reference a stage can fail when the object list includes directory blobs. To avoid errors, we recommend using file pattern matching to identify the files for inclusion (i.e. the PATTERN clause) when the file list for a stage includes directory blobs.

### Required parameters

`[alias.]$file_col_num[:element] [ , [alias.]$file_col_num[:element] , ...  ]`
:   Specifies an explicit set of fields/columns in data files staged in either an internal or external location, where:

    `alias`
    :   Specifies the optional “table” alias defined, if any, in the FROM clause.

    `file_col_num`
    :   Specifies the positional number of the field/column (in the file) that contains the data to be loaded (`1` for the first field, `2` for the second field, etc.)

    `element`
    :   Specifies the path and element name of a repeating value (applies only to semi-structured data files).

`internal_location` or `external_location`
:   Specifies the location where the data files are staged:

    * `internal_location` is the URI specifier for the location in Snowflake where files containing data are staged:

      |  |  |
      | --- | --- |
      | `@[namespace.]internal_stage_name[/path]` | Files are in the specified named internal stage. |
      | `@[namespace.]%table_name[/path]` | Files are in the stage for the specified table. |
      | `@~[/path]` | Files are in the stage for the current user. |
    * `external_location` is the URI specifier for the named external stage or external location (Amazon S3, Google Cloud Storage, or Microsoft Azure) where files containing data are staged:

      |  |  |
      | --- | --- |
      | `@[namespace.]external_stage_name[/path]` | Files are in the specified named external stage. |

    Where:

    > * `namespace` is the database and/or schema in which the internal or external stage resides. It is optional if a database and schema are currently in use within the user session; otherwise, it is required.
    > * The optional `path` parameter restricts the set of files being queried to the files under the folder prefix. If `path` is specified, but no file is explicitly named in the path, all data files in the path are queried.

    > **Note:**
    >
    > * The URI string for an external storage location (Amazon S3, Google Cloud Storage, or Microsoft Azure) must be enclosed in single quotes; however, you can enclose any URI string in single quotes, which allows special characters, including spaces, in location and file names. For example:
    >
    >   > Internal:
    >   > :   `'@~/path 1/file 1.csv'`
    >   >
    >   >     `'@%my table/path 1/file 1.csv'`
    >   >
    >   >     `'@my stage/path 1/file 1.csv'`
    > * Relative path modifiers such as `/./` and `/../` are interpreted literally, because “paths” are literal prefixes for a name. For example:
    >
    >   > S3:
    >   > :   `COPY INTO mytable FROM @mystage/./../a.csv`
    >
    >   In these COPY statements, the system look for a file literally named `./../a.csv` in the storage location.

### Optional parameters

`( FILE_FORMAT => 'namespace.named_file_format' )`
:   Specifies a named file format that describes the format of the staged data files to query.

    Note that this parameter is optional if either of the following conditions are true:

    * The files are formatted in the default file format (CSV) with the default delimiters: `,` (as the field delimiter) and the new line character (as the record delimiter).
    * The files are in an internal or external stage and the stage definition describes the file format.

    If referencing a file format in the current namespace for your user session, you can omit the single quotes around the format identifier.

    Otherwise, this parameter is required. For more details, see File Formats (in this topic).

    `namespace` optionally specifies the database and/or schema for the table, in the form of `database_name.schema_name` or `schema_name`. It is optional
    if a database and schema are currently in use within the user session; otherwise, it is required.

    If the identifier contains spaces, special characters, or mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`PATTERN => 'regex_pattern'`
:   A regular expression pattern string, enclosed in single quotes, specifying the file names and/or paths on the external stage to match.

    > **Tip:**
    >
    > For the best performance, try to avoid applying patterns that filter on a large number of files.

`alias`
:   Specifies a “table” alias for the internal/external location where the files are staged.

## File formats

To parse a staged data file, it is necessary to describe its file format. The default file format is character-delimited UTF-8 text (i.e. CSV), with the comma character (`,`) as the field delimiter
and new line character as the record delimiter. If the source data is in another format (JSON, Avro, etc.), you must specify the corresponding file format type (and options).

To explicitly specify file format options, set them in one of the following ways:

Querying staged data files:
:   As file format options specified for a named file format or stage object. The named file format/stage object can then be referenced in the SELECT statement.

Loading columns from staged data files:
:   * As file format options specified directly in the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).
    * As file format options specified for a named file format or stage object. The named file format/stage object can then be referenced in the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement.

## Query examples

### Example 1: Query columns in a CSV file

The following example illustrates staging multiple CSV data files (with the same file format) and then querying the data columns in the files.

This example assumes the files have the following names and are located in the root directory in a macOS or Linux environment:

* `/tmp/data1.csv` contains two records:

  ```bash
  a|b
  c|d
  ```
* `/tmp/data2.csv` contains two records:

  ```bash
  e|f
  g|h
  ```

To stage and query the files:

> ```sqlexample
> -- Create a file format.
> CREATE OR REPLACE FILE FORMAT myformat TYPE = 'csv' FIELD_DELIMITER = '|';
>
> -- Create an internal stage.
> CREATE OR REPLACE STAGE mystage1;
>
> -- Stage the data files.
> PUT file:///tmp/data*.csv @mystage1;
>
> -- Query the filename and row number metadata columns and the regular data columns in the staged file.
> -- Optionally apply pattern matching to the set of files in the stage and optional path.
> -- Note that the table alias is provided to make the statement easier to read and is not required.
> SELECT t.$1, t.$2 FROM @mystage1 (file_format => 'myformat', pattern=>'.*data.*[.]csv.gz') t;
>
> +----+----+
> | $1 | $2 |
> |----+----|
> | a  | b  |
> | c  | d  |
> | e  | f  |
> | g  | h  |
> +----+----+
>
> SELECT t.$1, t.$2 FROM @mystage1 t;
>
> +-----+------+
> | $1  | $2   |
> |-----+------|
> | a|b | NULL |
> | c|d | NULL |
> | e|f | NULL |
> | g|h | NULL |
> +-----+------+
> ```

> **Note:**
>
> The file format is required in this example to correctly parse the fields in the staged files. In the second query, the file format is omitted, causing the `|` field delimiter to
> be ignored and resulting in the values returned for `$1` and `$2`.
>
> However, if the file format is included in the stage definition, you can omit it from the SELECT statement. See Example 3: Query elements in a JSON file.

### Example 2: Call functions when querying a staged data file

Get the ASCII code for the first character of each column in the data files staged in Example 1: Query columns in a CSV file:

> ```sqlexample
> SELECT ascii(t.$1), ascii(t.$2) FROM @mystage1 (file_format => myformat) t;
>
> +-------------+-------------+
> | ASCII(T.$1) | ASCII(T.$2) |
> |-------------+-------------|
> |          97 |          98 |
> |          99 |         100 |
> |         101 |         102 |
> |         103 |         104 |
> +-------------+-------------+
> ```

> **Note:**
>
> If the file format is included in the stage definition, you can omit it from the SELECT statement. See Example 3: Query elements in a JSON file.

### Example 3: Query elements in a JSON file

This example illustrates staging a JSON data file containing the following objects and then querying individual elements within the objects in the file:

> ```sqljson
> {"a": {"b": "x1","c": "y1"}},
> {"a": {"b": "x2","c": "y2"}}
> ```

This example assumes the file is named `/tmp/data1.json` and is located in the root directory in a macOS or Linux environment.

To stage and query the file:

> ```sqlexample
> -- Create a file format
> CREATE OR REPLACE FILE FORMAT my_json_format TYPE = 'json';
>
> -- Create an internal stage
> CREATE OR REPLACE STAGE mystage2 FILE_FORMAT = my_json_format;
>
> -- Stage the data file
> PUT file:///tmp/data1.json @mystage2;
>
> -- Query the repeating a.b element in the staged file
> SELECT parse_json($1):a.b FROM @mystage2/data1.json.gz;
>
> +--------------------+
> | PARSE_JSON($1):A.B |
> |--------------------|
> | "x1"               |
> | "x2"               |
> +--------------------+
> ```

---
title: Query directory tables
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-query.md
section: User Guide
---

# Query directory tables

This topic covers how to query a directory table to retrieve a list of all files on a stage with metadata,
such as the Snowflake file URL, for each file.

Syntax for querying a directory table:

```sqlexample
SELECT * FROM DIRECTORY( @<stage_name> )
```

Where:

`stage_name`
:   Name of a stage that has a directory table enabled.

For information about SELECT as a statement, and the other clauses within the statement, see [Query syntax](../sql-reference/constructs.md) in the Snowflake SQL Command Reference.

## Output

The output from a directory table query can include the following columns:

> | Column | Data Type | Description |
> | --- | --- | --- |
> | RELATIVE_PATH | TEXT | Path to the files to access using the file URL. |
> | SIZE | NUMBER | Size of the file (in bytes). |
> | LAST_MODIFIED | TIMESTAMP_TZ | Timestamp when the file was last updated in the stage. |
> | MD5 | HEX | MD5 checksum for the file. |
> | ETAG | HEX | ETag header for the file. |
> | FILE_URL | TEXT | Snowflake file URL to the file.  The file URL has the following format:  ```sqlsyntax https://<account_identifier>/api/files/<db_name>.<schema_name>.<stage_name>/<relative_path> ```  Where:  `account_identifier`  Hostname of the Snowflake account for your stage. The hostname starts with an account locator (provided by Snowflake) and ends with the Snowflake domain (`snowflakecomputing.com`):  `account_locator.snowflakecomputing.com`  For more details, see [Account identifiers](admin-account-identifier.md).  **Note:** For [Business Critical](intro-editions.md) accounts, a `privatelink` segment is prepended to the URL just before `snowflakecomputing.com` (`privatelink.snowflakecomputing.com`), even if private connectivity to the Snowflake service is not enabled for your account.  `db_name`  Name of the database that contains the stage where your files are located.  `schema_name`  Name of the schema that contains the stage where your files are located.  `stage_name`  Name of the stage where your files are located.  `relative_path`  Path to the files to access using the file URL. |

## Usage notes

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

This example retrieves all metadata columns in a directory table for a stage named `mystage`:

> ```sqlexample
> SELECT * FROM DIRECTORY(@mystage);
> ```

This example retrieves the FILE_URL column values from a directory table for files greater than 100 K bytes in size:

> ```sqlexample
> SELECT FILE_URL FROM DIRECTORY(@mystage) WHERE SIZE > 100000;
> ```

This example retrieves the FILE_URL column values from a directory table for comma-separated value files:

> ```sqlexample
> SELECT FILE_URL FROM DIRECTORY(@mystage) WHERE RELATIVE_PATH LIKE '%.csv';
> ```

## Create a view for unstructured data using a directory table

You can join a directory table with other Snowflake tables to produce a view of unstructured data that combines the file URLs with
metadata about the files.

The following diagram illustrates how you can use a stage with a directory table enabled along with a separate data table to create a comprehensive
view for unstructured files on a stage.

**Example: Create a view of PDF files and their data**

The following example creates a view called `reports_information` by joining a directory table on a stage named `my_pdf_stage` with a table named
`report_metadata` using the `file_url` key. The stage contains PDF reports, while the `report_metadata` table contains
structured information about each PDF report such as the `author` and `publish_date`.
The resulting view provides a way to get information about the unstructured PDFs and their related, structured metadata.

```sqlexample
CREATE VIEW reports_information AS
  SELECT
    file_url as report_link,
    author,
    publish_date,
    approved_date,
    geography,
    num_of_pages
  FROM directory(@my_pdf_stage) s
  JOIN report_metadata m
  ON s.file_url = m.file_url
```

---
title: Query metadata for staged files
source: https://docs.snowflake.com/en/user-guide/querying-metadata.md
section: User Guide
---

# Query metadata for staged files

Snowflake automatically generates metadata for files in internal (i.e. Snowflake) stages or external (Amazon S3, Google Cloud Storage, or Microsoft Azure) stages. This metadata is “stored” in virtual columns that can be:

* Queried using a standard [SELECT](../sql-reference/sql/select.md) statement.
* Loaded into a table, along with the regular data columns, using [COPY INTO <table>](../sql-reference/sql/copy-into-table.md). For general information about querying staged data files, see [Query data in staged files](querying-stage.md).

## Metadata columns

Currently, the following metadata columns can be queried or copied into tables:

METADATA$FILENAME
:   Name of the staged data file the current row belongs to. Includes the full path to the data file.

METADATA$FILE_ROW_NUMBER
:   Row number for each record in the staged data file.

METADATA$FILE_CONTENT_KEY
:   Checksum of the staged data file the current row belongs to.

METADATA$FILE_LAST_MODIFIED
:   Last modified timestamp of the staged data file the current row belongs to. Returned as TIMESTAMP_NTZ.

METADATA$START_SCAN_TIME
:   Start timestamp of operation for each record in the staged data file. Returned as TIMESTAMP_LTZ.

## Query limitations

* Metadata cannot be inserted into existing table rows.
* Metadata columns can only be queried by name; as such, they are not included in the output of any of the following statements:

  > + [SELECT \*](../sql-reference/sql/select.md)
  > + [SHOW <objects>](../sql-reference/sql/show.md)
  > + [DESCRIBE <object>](../sql-reference/sql/desc.md)
  > + [Queries on INFORMATION_SCHEMA views](../sql-reference/info-schema.md)

## Query examples

### Example 1: Query the metadata columns for a CSV file

The following example illustrates staging multiple CSV data files (with the same file format) and then querying the metadata columns, as well as the regular data columns, in the files.

This example assumes the files have the following names and are located in the root directory in a macOS or Linux environment:

* `/tmp/data1.csv` contains two records:

  ```bash
  a|b
  c|d
  ```
* `/tmp/data2.csv` contains two records:

  ```bash
  e|f
  g|h
  ```

To stage and query the files:

> ```sqlexample
> -- Create a file format
> CREATE OR REPLACE FILE FORMAT myformat
>   TYPE = 'csv' FIELD_DELIMITER = '|';
>
> -- Create an internal stage
> CREATE OR REPLACE STAGE mystage1;
>
> -- Stage a data file
> PUT file:///tmp/data*.csv @mystage1;
>
> -- Query the filename and row number metadata columns and the regular data columns in the staged file
> -- Note that the table alias is provided to make the statement easier to read and is not required
> SELECT METADATA$FILENAME, METADATA$FILE_ROW_NUMBER, METADATA$FILE_CONTENT_KEY, METADATA$FILE_LAST_MODIFIED, METADATA$START_SCAN_TIME, t.$1, t.$2 FROM @mystage1 (file_format => myformat) t;
>
> +-------------------+--------------------------+---------------------------+-----------------------------+-------------------------------+----+----+
> | METADATA$FILENAME | METADATA$FILE_ROW_NUMBER | METADATA$FILE_CONTENT_KEY | METADATA$FILE_LAST_MODIFIED |      METADATA$START_SCAN_TIME | $1 | $2 |
> |-------------------+--------------------------+---------------------------+-----------------------------+-------------------------------+----+----|
> | data2.csv.gz      |                        1 | aaa11bb2cccccaaaaac1234d9 |     2022-05-01 10:15:57.000 |  2023-02-02 01:31:00.713 +0000| e  | f  |
> | data2.csv.gz      |                        2 | aaa11bb2cccccaaaaac1234d9 |     2022-05-01 10:05:35.000 |  2023-02-02 01:31:00.755 +0000| g  | h  |
> | data1.csv.gz      |                        1 | 39ab11bb2cdeacdcdac1234d9 |     2022-08-03 10:15:26.000 |  2023-02-02 01:31:00.778 +0000| a  | b  |
> | data1.csv.gz      |                        2 | 39ab11bb2cdeacdcdac1234d9 |     2022-08-03 11:15:55.000 |  2023-02-02 01:31:00.778 +0000| c  | d  |
> +-------------------+--------------------------+---------------------------+-----------------------------+-------------------------------+----+----+
>
> SELECT METADATA$FILENAME, METADATA$FILE_ROW_NUMBER, METADATA$FILE_CONTENT_KEY, METADATA$FILE_LAST_MODIFIED, METADATA$START_SCAN_TIME, t.$1, t.$2 FROM @mystage1 t;
>
> +-------------------+--------------------------+---------------------------+-----------------------------+-------------------------------+-----+------+
> | METADATA$FILENAME | METADATA$FILE_ROW_NUMBER | METADATA$FILE_CONTENT_KEY | METADATA$FILE_LAST_MODIFIED |      METADATA$START_SCAN_TIME | $1  | $2   |
> |-------------------+--------------------------+---------------------------+-----------------------------+-------------------------------+-----+------|
> | data2.csv.gz      |                        1 | aaa11bb2cccccaaaaac1234d9 |     2022-05-01 10:15:57.000 |  2023-02-02 01:31:00.713 +0000| e|f | NULL |
> | data2.csv.gz      |                        2 | aaa11bb2cccccaaaaac1234d9 |     2022-05-01 10:05:35.000 |  2023-02-02 01:31:00.755 +0000| g|h | NULL |
> | data1.csv.gz      |                        1 | 39ab11bb2cdeacdcdac1234d9 |     2022-08-03 10:15:26.000 |  2023-02-02 01:31:00.778 +0000| a|b | NULL |
> | data1.csv.gz      |                        2 | 39ab11bb2cdeacdcdac1234d9 |     2022-08-03 11:15:55.000 |  2023-02-02 01:31:00.778 +0000| c|d | NULL |
> +-------------------+--------------------------+---------------------------+-----------------------------+-------------------------------+-----+------+
> ```

> **Note:**
>
> The file format is required in this example to correctly parse the fields in the staged files. In the second query, the file format is omitted, causing the `|` field delimiter to
> be ignored and resulting in the values returned for `$1` and `$2`.
>
> However, if the file format is included in the stage definition, you can omit it from the SELECT statement. See the next example for details.

### Example 2: Query the metadata columns for a JSON file

This example illustrates staging a JSON data file containing the following objects and then querying the metadata columns, as well as the objects, in the file:

> ```sqljson
> {"a": {"b": "x1","c": "y1"}},
> {"a": {"b": "x2","c": "y2"}}
> ```

This example assumes the file is named `/tmp/data1.json` and is located in the root directory in a macOS or Linux environment.

To stage and query the file:

> ```sqlexample
> -- Create a file format
> CREATE OR REPLACE FILE FORMAT my_json_format
>   TYPE = 'json';
>
> -- Create an internal stage
> CREATE OR REPLACE STAGE mystage2
>   FILE_FORMAT = my_json_format;
>
> -- Stage a data file
> PUT file:///tmp/data1.json @mystage2;
>
> -- Query the filename and row number metadata columns and the regular data columns in the staged file
> SELECT METADATA$FILENAME, METADATA$FILE_ROW_NUMBER, parse_json($1) FROM @mystage2/data1.json.gz;
>
> +-------------------+--------------------------+----------------+
> | METADATA$FILENAME | METADATA$FILE_ROW_NUMBER | PARSE_JSON($1) |
> |-------------------+--------------------------+----------------|
> | data1.json.gz     |                        1 | {              |
> |                   |                          |   "a": {       |
> |                   |                          |     "b": "x1", |
> |                   |                          |     "c": "y1"  |
> |                   |                          |   }            |
> |                   |                          | }              |
> | data1.json.gz     |                        2 | {              |
> |                   |                          |   "a": {       |
> |                   |                          |     "b": "x2", |
> |                   |                          |     "c": "y2"  |
> |                   |                          |   }            |
> |                   |                          | }              |
> +-------------------+--------------------------+----------------+
> ```

### Example 3: Load metadata columns into a table

The [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command supports copying metadata from staged data files into a target table. Use the data transformation syntax (i.e. a SELECT list) in your COPY statement.
For more information about transforming data using a COPY statement, see [Transform data during a load](data-load-transform.md).

The following example loads the metadata columns and regular data columns from Example 1: Query the metadata columns for a CSV file into a table:

> ```sqlexample
> CREATE OR REPLACE TABLE table1 (
>   filename varchar,
>   file_row_number int,
>   file_content_key varchar,
>   file_last_modified timestamp_ntz,
>   start_scan_time timestamp_ltz,
>   col1 varchar,
>   col2 varchar
> );
>
> COPY INTO table1(filename, file_row_number, file_content_key, file_last_modified, start_scan_time, col1, col2)
>   FROM (SELECT METADATA$FILENAME, METADATA$FILE_ROW_NUMBER, METADATA$FILE_CONTENT_KEY, METADATA$FILE_LAST_MODIFIED, METADATA$START_SCAN_TIME, t.$1, t.$2 FROM @mystage1/data1.csv.gz (file_format => myformat) t);
>
> SELECT * FROM table1;
>
> +--------------+-----------------+---------------------------+-------------------------+-------------------------------+------+------+
> | FILENAME     | FILE_ROW_NUMBER | FILE_CONTENT_KEY          | FILE_LAST_MODIFIED      |  START_SCAN_TIME              | COL1 | COL2 |
> |--------------+-----------------+---------------------------+-------------------------+-------------------------------+------+------+
> | data1.csv.gz | 1               | 39ab11bb2cdeacdcdac1234d9 | 2022-08-03 10:15:26.000 | 2023-02-02 01:31:00.778 +0000 | a    | b    |
> | data1.csv.gz | 2               | 39ab11bb2cdeacdcdac1234d9 | 2022-09-10 11:15:55.000 | 2023-02-02 01:31:00.778 +0000 | c    | d    |
> +--------------+-----------------+---------------------------+-------------------------+-------------------------------+------+------+
> ```

---
title: Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-query-using-microsoft-fabric.md
section: User Guide
---

# Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric

To view Snowflake-managed Iceberg tables in Microsoft Fabric, you can connect a standard Snowflake database to Fabric.

This topic provides the steps for you to connect a standard Snowflake database to Fabric, which syncs the database with Fabric. When you connect a
database, you can either select an existing database or create a new one.
You can then view any Snowflake-managed Iceberg tables in the database in Fabric.

For more information about
Microsoft OneLake Fabric, see
[OneLake in Microsoft Fabric](https://learn.microsoft.com/en-us/fabric/onelake/)
in the Microsoft Fabric documentation.

## Prerequisites

Before you begin, complete the following prerequisites for Microsoft Fabric and Snowflake.

**Microsoft Fabric**

* Create a Microsoft Fabric account. For more information, see [Get started with Microsoft Fabric](https://www.microsoft.com/microsoft-fabric/getting-started).
* Create a workspace in your Fabric account. For instructions, see [Create a workspace](https://learn.microsoft.com/en-us/fabric/fundamentals/create-workspaces)
  in the Microsoft Fabric documentation. You use this workspace to query Snowflake-managed Iceberg tables.

  > **Note:**
  >
  > We recommend that you name your Fabric workspace by using only alphanumeric characters. If your Fabric workspace name contains special
  > characters or non-alphanumeric characters such as spaces, you will need to copy the ID of the workspace for specifying this ID later.
  > To find your workspace ID, open your workspace in the Fabric UI, and then refer to the URL in your browser.
* You must be an administrator of the Fabric workspace.
* Your Fabric tenant administrator must enable the Enable Snowflake database item (Preview) tenant setting or delegate this decision
  to your Fabric capacity administrator. You can enable this setting in the admin portal of the Fabric web UI. To get to the admin portal,
  see [How to get to the admin portal](https://learn.microsoft.com/en-us/fabric/admin/admin-center#how-to-get-to-the-admin-portal) in the
  Microsoft Fabric documentation. You can enable this setting at the tenant level, have it delegated to Fabric capacity administrators, or have it enabled
  only for certain security groups.

**Snowflake**

* You must have access to the ACCOUNTADMIN role or another role in Snowflake with the CREATE USER privilege on the account.
* You must have access to the ACCOUNTADMIN role or another role in Snowflake with privileges to create an external volume.
* You must have a standard database in Snowflake. For instructions, see [CREATE DATABASE](../sql-reference/sql/create-database.md). This guide refers to an
  example standard database named `SnowflakeFabricIcebergDB`.

  > **Note:**
  >
  > To complete the steps in this topic, you should have an existing standard database. The topic includes steps for you to grant privileges to that database. However, you have the
  > option to create a database when you connect a Snowflake database to Fabric.
  > If you choose to create a new database when you connect a
  > database to Fabric, you would then need to grant the necessary privileges to the database in Snowflake.

## Step 1: Find your Microsoft Fabric Tenant ID, Snowflake organization name, and Snowflake account name

To connect to Microsoft Fabric from Snowflake, you need your Microsoft Fabric Tenant ID. To connect to Microsoft OneLake from Snowflake,
you need your Snowflake organization name and Snowflake account name.

* To find your Microsoft Fabric Tenant ID, follow these steps:

  1. Navigate to [Microsoft Fabric](https://app.fabric.microsoft.com/) and sign in.
  2. Select ?.
  3. From the Help pane, select About Fabric.
  4. From the Fabric window, see the value for Tenant URL and copy the portion of the URL after `ctid` into a text editor.

     For example: `a111a1a1-1111-111a-a11a-1a11a11111a1`
* To find your Snowflake organization name, (`<orgname>`), and Snowflake account name (`<accountname>`), see [Finding the organization and account name for an account](admin-account-identifier.md).

## Step 2: Create a role in Snowflake

In this step, you create a role in Snowflake, and then grant it the privileges required to use your standard database and execute
a SELECT statement on tables in the database. Later, you grant this role to a user.

Complete the following steps with the ACCOUNTADMIN role:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Create a worksheet in Snowsight. For more information, see [Create worksheets in Snowsight](ui-snowsight-worksheets-gs.md).
3. Use the [CREATE ROLE](../sql-reference/sql/create-role.md) command to create a role:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE IF NOT EXISTS R_ICEBERG_METADATA;
   ```
4. To grant the Iceberg metadata role privileges to a standard database, follow this example, which grants them to a
   `SnowflakeFabricIcebergDB` database:

   ```sqlexample
   BEGIN
      LET db STRING := 'SnowflakeFabricIcebergDB';
      EXECUTE IMMEDIATE 'GRANT USAGE ON DATABASE ' || db || ' TO ROLE R_ICEBERG_METADATA';
      EXECUTE IMMEDIATE 'GRANT USAGE ON ALL SCHEMAS IN DATABASE ' || db || ' TO ROLE R_ICEBERG_METADATA';
      EXECUTE IMMEDIATE 'GRANT USAGE ON FUTURE SCHEMAS IN DATABASE ' || db || ' TO ROLE R_ICEBERG_METADATA';
      EXECUTE IMMEDIATE 'GRANT SELECT ON ALL ICEBERG TABLES IN DATABASE ' || db || ' TO ROLE R_ICEBERG_METADATA';
      EXECUTE IMMEDIATE 'GRANT SELECT ON FUTURE ICEBERG TABLES IN DATABASE ' || db || ' TO ROLE R_ICEBERG_METADATA';
   END;
   ```
5. To grant the role the permissions to run queries on an existing warehouse, follow this example, which grants the role
   permissions to run queries on a `COMPUTE_WH` warehouse:

   ```sqlexample
   GRANT USAGE ON WAREHOUSE COMPUTE_WH TO ROLE R_ICEBERG_METADATA;
   ```

## Step 3: Create a user in Snowflake

In this step, you create a user in Snowflake, and then grant the user with the role you created earlier. This grant allows the user to use
the standard database. Later, you specify this user’s credentials when you create a Snowflake connection in
Microsoft Fabric.

If you previously created a user in Snowflake, you can skip this step.

1. To create a user with the role you created as the default, use the [CREATE USER](../sql-reference/sql/create-user.md) command:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE USER IF NOT EXISTS SVC_FABRIC_ICEBERG_METADATA
      TYPE = LEGACY_SERVICE
      LOGIN_NAME = 'SVC_FABRIC_ICEBERG_METADATA'
      DISPLAY_NAME = 'Service - Fabric Iceberg Metadata'
      PASSWORD = '<strong_password>'
      MUST_CHANGE_PASSWORD = FALSE
      DEFAULT_ROLE = R_ICEBERG_METADATA;
   ```
2. Grant the role that you created to the user:

   ```sqlexample
   GRANT ROLE R_ICEBERG_METADATA TO USER SVC_FABRIC_ICEBERG_METADATA;
   ```

## Step 4: Create a Snowflake connection in Microsoft Fabric

In this step you create a Snowflake connection in Microsoft Fabric, which allows you to connect your standard database in Snowflake to
Microsoft Fabric.

> **Important:**
>
> If you already have an existing Snowflake connection configured in Microsoft Fabric that meets the following conditions, you can skip
> this step:
>
> > * It uses the correct Snowflake username and password credentials.
> > * It has access to the required warehouse in Snowflake.

1. Navigate to [Microsoft Fabric](https://app.fabric.microsoft.com/), and then sign in.
2. Select the Settings icon.
3. In Settings, select Manage connections and gateways.
4. Select + New.
5. In the New connection dialog, create a Snowflake connection:

   1. Select Cloud.
   2. For Connection name, enter a connection name.
   3. For Connection type, select Snowflake.
   4. For Server, enter your identifier for your Snowflake account:

      ```text
      https://<orgname>-<accountname>.snowflakecomputing.com
      ```

      Where:

      * `<orgname>` is the name of your Snowflake organization and `<accountname>` is the name of your Snowflake account. To find
        these names, see Step 1: Find your Microsoft Fabric Tenant ID, Snowflake organization name, and Snowflake account name.
   5. For Warehouse, enter the name of the warehouse in Snowflake that you granted the R_ICEBERG_METADATA role usage access to, such
      as `COMPUTE_WH`, when you created a role.
   6. For Authentication method, select Snowflake.
   7. For Username, enter the name of the user you created in Snowflake.
   8. For Password, enter the password for the user that you created in Snowflake.
   9. Select Create.
   > **Note:**
   >
   > For more information about creating a Snowflake connection in Microsoft Fabric,
   > see [Set up your Snowflake database connection](https://learn.microsoft.com/fabric/data-factory/connector-snowflake) in the Microsoft Fabric documentation.
6. After your connection is created, copy the Connection ID for your connection into a text editor.

   For example: `1111a111-11a1-1111-11a1-11aa1111aaa1`. You must specify this Connection ID later in Snowflake when you
   connect your Snowflake standard database to Microsoft Fabric.

## Step 5: Retrieve your Azure multi-tenant application name

In this step, you use Snowflake to retrieve your Azure multi-tenant application name. You specify this application name later when you
give your Azure multi-tenant application access to your Fabric workspace.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Add Data.
3. On the Add Data page, select Microsoft OneLake.
4. Enter your Fabric tenant ID, and then select Continue.
5. Near the top of the Create an item in Microsoft Fabric dialog, copy your Multi-tenant app name into a text editor.

## Step 6: Give your Azure multi-tenant application access to your workspace

In this step, you give Azure multi-tenant application access to your workspace in Fabric.

1. Navigate to [Microsoft Fabric](https://app.fabric.microsoft.com/), and then sign in.
2. Open your Microsoft Fabric workspace.

   To create a workspace, see Prerequisites.
3. Select Manage access.
4. Select + Add people or groups.
5. In the Enter name or email field, paste your Azure multi-tenant application name from Snowflake.

   To retrieve your Azure multi-tenant app
   name, see Step 5: Retrieve your Azure multi-tenant application name.
6. In the drop-down menu, select Contributor access or higher to allow the app to create the necessary Fabric item.
7. Select Add.
8. In the top-right area, select Settings, and then select Manage connections and gateways.
9. In the top-right area, search for your connection ID.

   You copied this connection ID when you created a Snowflake connection in Microsoft Fabric.
10. On the Connections tab, hover on your connection, select the … icon for your connection, and then select Manage users.
11. In the Search by name or email field, search for your multi-tenant application name, and then select it.
12. Select the appropriate level of privileges for the user.
13. To allow the multi-tenant application to use the Snowflake connection, select Share.

## Step 7: Enable access to Fabric public APIs

In this step, you enable access to Fabric public APIs by allowing your Snowflake service principal to call the Fabric public APIs.

### Step 7.1: Allow your service principal to call the Fabric public APIs

To allow your Snowflake service principal to call Fabric public APIs, follow these steps:

1. Sign in to Microsoft Fabric.
2. Go to the tenant settings. For instruction, see [How to get to the tenant settings](https://learn.microsoft.com/fabric/admin/about-tenant-settings?source=recommendations#how-to-get-to-the-tenant-settings)
   in the Microsoft Fabric documentation.
3. From the tenant settings, enable the Service principals can call Fabric public APIs setting by selecting one of the following options:

   * Entire organization
   * Specific security groups and then select the security group that will contain your Snowflake service principal.

   For more information about this setting, see [Service principals can call Fabric public APIs](https://learn.microsoft.com/en-us/fabric/admin/service-admin-portal-developer#service-principals-can-call-fabric-public-apis).

### Step 7.2: Add your multi-tenant app to the allowed security group

> **Important:**
>
> If you enabled the Service principals can call Fabric public APIs setting in Fabric for your *entire organization*,
> you can skip this step.

In this section, you add your multi-tenant app as a member of the security group that you granted access to the Fabric public APIs.

* In the Microsoft Entra admin center, add your multi-tenant app name to the allowed security group. You copied your multi-tenant app
  name when you
  retrieved your Azure multi-tenant application name.

  For instructions, see
  [Add members or owners of a group](https://learn.microsoft.com/en-us/entra/fundamentals/how-to-manage-groups#add-members-or-owners-of-a-group)
  in the Microsoft Entra documentation.

  > **Important:**
  >
  > When you add a member, search for and select your multi-tenant app name.

## Step 8: Connect your Snowflake standard database to Microsoft Fabric

In this step, you connect a standard Snowflake database to Microsoft Fabric.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Add Data.
3. On the Add Data page, select Microsoft OneLake.
4. Enter your Fabric tenant ID and select Continue.

   To find your Fabric tenant ID, see Step 1: Find your Microsoft Fabric Tenant ID, Snowflake organization name, and Snowflake account name.
5. To provide consent to the use of your Snowflake account’s multi-tenant application in your Entra tenant, select Provide consent.

   > * If you haven’t performed this step before, you should see a prompt to consent. Review the permissions, provide your consent, and then proceed to the next step.
   > * It’s possible that this step is already completed for your Snowflake account. If so, close the pop-up that appears, and then proceed to the next step.
   > * If you can’t complete the consent flow, ask your Entra tenant administrator to complete this step for you.
6. Select Continue.

   > **Note:**
   >
   > If you receive a “You have not consented to the application. Please provide the consent to continue” error message when you select
   > Continue, make sure you have enabled access to Fabric public APIs
7. In the Create an item in Microsoft Fabric dialog, fill in the fields:

   * For Fabric workspace name, enter the name of the workspace in Fabric where you want to view your Iceberg tables.
   * To validate that your connection ID is in the correct format, for Snowflake connection ID in Fabric, enter your
     Snowflake connection ID that you copied when you created a Snowflake connection in Microsoft Fabric.

     > **Note:**
     >
     > You must create a Snowflake connection object in Fabric before you can read your Snowflake-managed tables.
   * For Snowflake database, select the Snowflake database that contains the Snowflake-managed Iceberg tables that you want to view in Fabric.

     > **Note:**
     >
     > If you want to create a new Snowflake database and connect it to Fabric, select + Create a new database.
8. To create a Fabric item and database, select Continue.
9. In the Create External Volume dialog, to create an external volume, review the volume details, and then select Create Volume.

   A Fabric item is created in Microsoft Fabric and an external volume is created in Microsoft Fabric OneLake.

## Step 9: Verify your external volume

Verify the external volume you configured to check that Snowflake can successfully authenticate to your storage provider using the
external volume. For instructions, see [Verify an external volume](tables-iceberg-configure-external-volume.md).

## Step 10: Create an Iceberg table

In this step, you create a Snowflake-managed Iceberg table in your standard database in Snowflake.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Open your worksheet in Snowsight.

   For more information, see [Opening worksheets in tabs](ui-snowsight-worksheets-gs.md).
3. In your standard database, create a sample Iceberg table:

   ```sqlexample
   CREATE ICEBERG TABLE SnowflakeFabricIcebergDB.PUBLIC.SampleIcebergTable (
      id INT,
      name STRING
   )
   CATALOG = 'SNOWFLAKE';
   ```
4. In the sample Iceberg table, insert two rows:

   > ```sqlexample
   > INSERT INTO SnowflakeFabricIcebergDB.PUBLIC.SampleIcebergTable VALUES
   >    (1, 'Alice'),
   >    (2, 'Bob');
   > ```

## Step 11: View the Iceberg table in Fabric

1. Navigate to [Microsoft Fabric](https://app.fabric.microsoft.com/), and then sign in.
2. Open your workspace.

   You should see a new Snowflake database item named after your database. If needed, refresh the page.
3. Where you created your table in Snowflake, open the database item and schema.

   You should see the Iceberg table you created in Snowflake. As you update the table in Snowflake, you can refresh
   the table updates in Microsoft Fabric.
4. In the upper-right corner, select SQL analytics endpoint.

   You can use SQL to interact with your table or try using other Fabric workloads to query this table alongside your other Fabric data.

---
title: Querying ACCOUNT_USAGE and INFORMATION_SCHEMA views for semantic view information
source: https://docs.snowflake.com/en/user-guide/views-semantic/views.md
section: User Guide
---

# Querying ACCOUNT_USAGE and INFORMATION_SCHEMA views for semantic view information

You can query the following views for information about semantic views:

* In the ACCOUNT_USAGE schema:

  + [SEMANTIC_VIEWS view](../../sql-reference/account-usage/semantic_views.md)
  + [SEMANTIC_TABLES view](../../sql-reference/account-usage/semantic_tables.md)
  + [SEMANTIC_RELATIONSHIPS view](../../sql-reference/account-usage/semantic_relationships.md)
  + [SEMANTIC_FACTS view](../../sql-reference/account-usage/semantic_facts.md)
  + [SEMANTIC_DIMENSIONS view](../../sql-reference/account-usage/semantic_dimensions.md)
  + [SEMANTIC_METRICS view](../../sql-reference/account-usage/semantic_metrics.md)
* In the INFORMATION_SCHEMA schema:

  + [SEMANTIC_VIEWS view](../../sql-reference/info-schema/semantic_views.md)
  + [SEMANTIC_TABLES view](../../sql-reference/info-schema/semantic_tables.md)
  + [SEMANTIC_RELATIONSHIPS view](../../sql-reference/info-schema/semantic_relationships.md)
  + [SEMANTIC_FACTS view](../../sql-reference/info-schema/semantic_facts.md)
  + [SEMANTIC_DIMENSIONS view](../../sql-reference/info-schema/semantic_dimensions.md)
  + [SEMANTIC_METRICS view](../../sql-reference/info-schema/semantic_metrics.md)

---
title: Querying data protected by differential privacy
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-analyst.md
section: User Guide
---

# Querying data protected by differential privacy

This topic helps an analyst run queries against data protected by differential privacy (that is, privacy-protected tables and views), and
understand and adjust the results returned by the queries.

To execute a query against a privacy-protected table, a user must have the SELECT privilege on the table.

**Limitations**

* Differential privacy supports a subset of Snowflake data types, operators, query syntax, and functions. For a list of supported SQL that
  you can use in a query, see [Differential privacy SQL reference](differential-privacy-sql-reference.md).
* Queries against privacy-protected tables take longer because Snowflake must run additional computations to determine how much
  noise to add. For basic queries, this latency is at least 7 seconds. Complex
  queries, such as the following, can take much longer:

  + Queries with many joins and sub-queries.
  + Queries that output multiple rows in the result, for example, when using GROUP BY clauses that result in hundreds or thousands of
    groups.
* In tables protected by differential privacy, fields in the outermost SELECT clause can only have aggregation, GROUP BY, or
  DP_INTERVAL_LOW/HIGH applied. Other actions, such as math and concatenation, are not allowed. Examples:

  + `SELECT key, COUNT(*) AS 'c', DP_INTERVAL_LOW('c') FROM T GROUP BY key`

    **Succeeds:** No unsupported actions taken on `key`, `COUNT(*)`, or `c`.
  + `SELECT key, 1 + COUNT(*) AS 'c', DP_INTERVAL_LOW('c') FROM T GROUP BY key`

    **Fails:** `1 + COUNT(*)` is specified on a field in the outermost SELECT clause.
  + `SELECT key, COUNT(1 + x) AS 'cnt', DP_INTERVAL_LO('cnt') FROM T GROUP BY key`

    **Succeeds:** `COUNT`, a permitted aggregation, happens after `1 + x` in the outermost SELECT clause.
  + `SELECT key, COUNT(x) AS 'c', DP_INTERVAL_LOW('c') FROM (SELECT key, 1 + income AS x FROM table) GROUP BY key`

    **Succeeds:** `1 + income` is applied in a nested SELECT clause, which is allowed.

## Query Fundamentals

This section discusses the basic components of a query that will succeed when run against a privacy-protected table. It includes:

* Aggregating data
* Using joins
* Querying data protected by entity-level privacy

### Aggregating data

All queries against a privacy-protected table must aggregate results rather than retrieve individual records. Not every part of a query
needs to use an aggregation function as long as the final result is aggregated.

With the exception of a COUNT function, a query cannot aggregate a column unless the column has a
[privacy domain](differential-privacy-privacy-domains.md).

For a list of supported aggregations, see [Aggregate functions](differential-privacy-sql-reference.md).

### Using joins

The following sections provide guidelines for using joins in a differentially private query:

* Join operators
* Supported joins
* Using entity keys in joins
* Data types and privacy domains
* Uniqueness requirement

To learn about the implications that joining two privacy-protected tables has on privacy domains, see
[Privacy domains and joins](differential-privacy-privacy-domains.md).

#### Join operators

Each join must be an equi join that uses a single operator. For example, `t1.c1 == t2.c1` is supported, but `col1 > col2` and
`col1 + 10 = col2` are not. Unconditioned joins are not supported.

Joins must use the JOIN operator. The WHERE syntax for joins is not supported. For more information about join
syntax, see [Implementing joins](../querying-joins.md).

#### Supported joins

Joins in a differentially private query must be one of the following:

* INNER
* { LEFT | RIGHT | FULL } OUTER
* NATURAL

Both sides of the join must have the same query pattern. For example, the following joins are supported:

**Both sides are identifiers**

> ```sqlexample
> SELECT COUNT(*)
> FROM t1 INNER JOIN t2 ON t1.a=t2.a;
> ```

**Both sides are subqueries**

> ```sqlexample
> SELECT COUNT(*)
> FROM (SELECT a, COUNT(b) FROM t1 GROUP BY a) AS g1
>     INNER JOIN (SELECT * FROM t2) AS g2
>     ON g1.a=g2.a;
> ```

Joining an identifier with a subquery is currently not supported.

For information about the supported query syntax related to joins, see [Query syntax](differential-privacy-sql-reference.md).

#### Using entity keys in joins

When working with tables protected with [entity-level privacy](differential-privacy-admin.md), you can minimize the amount
of noise by including the entity key column as part of the join key, especially if it doesn’t semantically change the query.

For example, consider the following tables where the entity is customers:

> | Table | Description |
> | --- | --- |
> | `customers` | Customer directory, where each row is a customer and has a `customer_id`. |
> | `transactions` | Customer transactions, where each customer can have multiple transactions. |
> | `transaction_lines` | Unique items that were purchased in a transaction. There can be multiple rows in a single transaction. |

If they are following best practices, the data provider has structured the data so that each of these tables has the entity key
`customer_id`. For this data schema, each transaction line can only belong to one transaction, and each transaction can only belong to
one customer. This relationship is not evident from the data itself, so without additional information the amount of noise added for
differential privacy will be higher than it needs to be.

You can minimize the amount of noise by including the entity key `customer_id` as part of the join key, even if it is redundant. For
example, joining the table `transactions` with `transaction_lines` typically only requires the join key `transaction_id`. However,
joining on both `transaction_id` and `customer_id` will result in a lower amount of noise.

#### Data types and privacy domains

When joining two tables, the data types of the join key columns from either side must be the same. For differential privacy, the data type
of a column includes whether or not it has a [privacy domain](differential-privacy-privacy-domains.md).

For example, if you had a privacy-protected table `transactions` and an unprotected table `product_lookup`, and you wanted to join them
on `product_id`, the `product_id` column in both tables must be the same data type (for example, a string) and must each have a privacy
domain.

To meet this requirement, the administrator for the analyst might need to define a privacy domain just like the data provider defines them.
For information on how to set a privacy domain for a table, see [Setting a privacy domain](differential-privacy-privacy-domains-admin.md).

#### Uniqueness requirement

Joins can potentially duplicate rows of data, which can cause the amount of noise added to a query result to become unbounded. To ensure
that privacy-protected data is not duplicated in a join, the join key (that is, the columns on which the tables are joined) for
privacy-protected tables must match only one record in the other table. This means that when joining with a privacy-protected table, the
join key on the opposite side must be de-duplicated.

> **Important:**
>
> The uniqueness requirement for joins doesn’t always apply to queries against tables that are protected by [entity-level privacy](differential-privacy-admin.md). For entity-level privacy, queries must de-duplicate on the entity key before the aggregation.
> As long as this is done after a join but before the aggregation, the join doesn’t need to be on de-duplicated data. For more
> information about meeting these requirements, see Querying data protected by entity-level privacy.

To satisfy the uniqueness requirement for joins, the query can use a GROUP BY on a subset of the join columns to group duplicate rows into
one result.

For example, suppose the `patients` table is protected by differential privacy and the `geo_lookup` table is not. The analyst wants to
join these two tables on `zip_code` so that they can filter the `patients` table on `State`. In order to ensure that the records in
the privacy-protected `patients` table are not duplicated, the query must de-duplicate the `zip_code` table on the join key. This must
be done explicitly even if the `geo_lookup` table is already unique on `zip_code`. This ensures that Snowflake can correctly account
for privacy.

```sqlexample
SELECT COUNT(*)
  FROM patients
  LEFT JOIN (SELECT zip_code, ANY_VALUE(state) AS residence_state
            FROM geo_lookup
            GROUP BY zip_code)
  USING zip_code
  WHERE birth_state = residence_state;
```

### Querying data protected by entity-level privacy

Most data providers use an entity key to implement [entity-level privacy](differential-privacy-admin.md) when configuring
differential privacy. When a table is protected by entity-level privacy, Snowflake does not allow aggregates on fields if there might be an
unbounded number of rows per entity. This means queries must meet the following requirements:

* At some point in the query, the privacy-protected table must be deduplicated on the entity key. Operations that can be used to deduplicate
  data are:

  + COUNT( DISTINCT <entity_key_column> )
  + GROUP BY <entity_key_column>
  + UNION (but not UNION ALL) when only the entity key is projected.
* If a join uses a join key other than the entity key column, that join cannot occur between the deduplication and the final SELECT clause
  with aggregation.

> **Note:**
>
> If the data provider implemented row-level privacy, the deduplication requirement for joins is different. For more information about these
> requirements, see Uniqueness requirement.

To help illustrate the requirements for entity-level privacy, suppose you have a privacy-protected table `patients` with the entity key
column `patient_id`. You also have a non-sensitive, unprotected table `geo_lookup`. The following examples show a query that fails
followed by a re-written version that succeeds.

Example: Deduplication
:   The following query fails because it doesn’t meet the deduplication requirement. Even though the table `patients` might already be
    unique on `patient_id`, the query fails because it does not explicitly deduplicate.

    ```sqlexample
    SELECT COUNT(*)
      FROM patients
      WHERE insurance_type = 'Commercial';
    ```

    To re-write the query so it succeeds, include a distinct count on the entity key column in order to explicitly deduplicate on the entity
    key. For example:

    ```sqlexample
    SELECT COUNT(DISTINCT patient_id)
      FROM patients
      WHERE insurance_type = 'Commercial';
    ```

Example: Location of join
:   The following query fails even though it is using a GROUP BY clause to meet the deduplication requirement. It fails because the table is
    being joined with another table using a column that is not the entity key column.

    ```sqlexample
    SELECT COUNT(bmi)
      FROM (SELECT patient_id, ANY_VALUE(zip_code) AS zip_code
        FROM patients
        GROUP BY patient_id) AS p
      JOIN geo_lookup AS g
        ON p.zip_code = g.zip_code
      WHERE state='CA';
    ```

    To re-write the query so it succeeds, use the GROUP BY clause *after* the join. The join cannot occur in between the deduplication and
    the SELECT clause with aggregation.

    ```sqlexample
    SELECT COUNT(bmi)
      FROM (SELECT patient_id, ANY_VALUE(bmi) as bmi, ANY_VALUE(state) as state
          FROM patients AS p
          JOIN geo_lookup AS g
            ON p.zip_code = g.zip_code
          GROUP BY patient_id)
      WHERE state='CA';
    ```

#### Executing transaction-level queries

The deduplication requirement for entity-level differential privacy does not prevent you from executing transaction-level queries. However,
you must first group the data to the entity-level, and then count those groups.

For example, suppose you have a table `doctor_visits` and that the data provider has defined an entity key `patient_id` to implement
entity-level privacy. A transaction-level query might be: “How many doctor visits weren’t for a regular checkup?” The following is an
example of how to write this query:

```sqlexample
SELECT COUNT(num_visits)
  FROM (SELECT COUNT((visit_reason<>'Regular checkup')::INT) AS num_visits
        WHERE visit_reason IS NOT NULL
        GROUP BY patient_id)
  WHERE num_visits > 0 AND num_visits < 20;
```

* The subquery groups by `patient_id` to deduplicate the data.
* The aggregate column `num_visits` captures the number of visits per patient that were not for a regular checkup.
* The query then aggregates again on that per-patient column to get the total number of visits.
* The WHERE clause on the outer query is required in order to
  [specify a privacy domain on the data](differential-privacy-privacy-domains-analyst.md).

> **Note:**
>
> While not a requirement, a best practice when joining tables protected by entity-level differential privacy is to include the entity key
> column as part of the join key (if it doesn’t semantically change the query). For more information, see
> Using entity keys in joins.

## Understanding query results

Queries against a privacy-protected table don’t return the exact value of an aggregation. Differential privacy introduces
[noise](differential-privacy-overview.md) into the result so it becomes an approximation of the actual value. The returned value
differs enough from the actual value to conceal whether an individual’s data is included in the aggregation. This applies to all queries
except for a query that returns the total number of rows in the privacy-protected table, for example, `SELECT COUNT(*) FROM t`.

An analyst needs to be able to determine whether the noise introduced into the result has decreased the usefulness of the query. Snowflake
uses a *noise interval* to help analysts interpret the results. A noise interval is a closed mathematical interval that, in most cases,
includes the actual value of the aggregation. There is a 95% chance that the actual result of a query falls within the noise interval.

Adding the following functions to a query allows the analyst to use the noise interval to make decisions about the utility of a query:

* [DP_INTERVAL_LOW](../../sql-reference/functions/dp_interval_low.md) — Returns the lower bound of the noise interval. The actual value is most
  likely to be equal to or larger than this number.
* [DP_INTERVAL_HIGH](../../sql-reference/functions/dp_interval_high.md) — Returns the upper bound of the noise interval. The actual value is most
  likely to be equal to or smaller than this number.

To use these functions, pass in the alias of an aggregated column in the main query. For example, the following query returns the count of
the `num_claims` column along with the noise interval for that aggregation:

```sqlexample
SELECT COUNT(num_claims) AS count_claims,
    DP_INTERVAL_LOW(count_claims),
    DP_INTERVAL_HIGH(count_claims)
FROM t1;
```

The output might be:

```output
+----------------+----------------------------------+----------------------------------+
|  count_claims  |  dp_interval_low("count_claims") |  dp_interval_high("count_claims")|
|----------------+----------------------------------+----------------------------------+
|  50            |  35                              |    75                            |
+----------------+----------------------------------+----------------------------------+
```

In this case, the return value is a count of 50. But the analyst has also determined with 95% certainty that the actual value of the
aggregation is between 35 and 75.

> **Tip:**
>
> For information about techniques that can potentially reduce noise in results, see
>
> * [Narrowing a privacy domain to improve results](differential-privacy-privacy-domains-analyst.md)
> * Using entity keys in joins

## Tracking privacy budget spending

You can use the [ESTIMATE_REMAINING_DP_AGGREGATES](../../sql-reference/functions/estimate_remaining_dp_aggregates.md) function to estimate how many more queries you can run
within the current budget window (that is, until the cumulative privacy loss is reset to 0). The estimate is based on the number of
aggregates, not queries. For example, the query `SELECT COUNT(a), COUNT(b) FROM T` contains two aggregate functions: `COUNT(a)` and
`COUNT(b)`.

When executing the ESTIMATE_REMAINING_DP_AGGREGATES function, be sure to use the exact conditions you’re using to execute queries, for
example, the same user, role, and account.

If you’re running a query that uses multiple tables, you should run ESTIMATE_REMAINING_DP_AGGREGATES once per table, then use the lowest
`NUMBER_OF_REMAINING_DP_AGGREGATES` value as the estimated usage cap.

The following example shows how a series of queries affect how much of the privacy budget’s limit has been spent (that is, the cumulative
privacy loss of the queries) and the estimated number of remaining aggregates.

**1. Initial check**

Let’s look at privacy budget numbers on the table `my_table`. You’ve never run any queries on this table.

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES('my_table'));
```

No budget used so far:

```output
+-----------------------------------+--------------+---------------+--------------+
| NUMBER_OF_REMAINING_DP_AGGREGATES | BUDGET_LIMIT | BUDGET_WINDOW | BUDGET_SPENT |
|-----------------------------------+--------------+---------------+--------------|
|                 996               |     233      |     WEEKLY    |     0.0      |
+-----------------------------------+--------------+---------------+--------------+
```

**2. Run a query**

Let’s run a query with one aggregate function and check our numbers again:

```sqlexample
SELECT COUNT(salary) FROM my_table;

-- results omitted ...

SELECT * FROM TABLE(SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES('my_table'));
```

Estimate of remaining aggregate calls has dropped by one and the cumulative privacy loss (budget spent) has increased.

```output
+-----------------------------------+--------------+---------------+--------------+
| NUMBER_OF_REMAINING_DP_AGGREGATES | BUDGET_LIMIT | BUDGET_WINDOW | BUDGET_SPENT |
|-----------------------------------+--------------+---------------+--------------|
|                 995               |     233      |     WEEKLY    |     0.6      |
+-----------------------------------+--------------+---------------+--------------+
```

**3. Run another query with two aggregate functions**

```sqlexample
SELECT COUNT(salary), COUNT(age) FROM my_table GROUP BY STATE;

-- results omitted ...

SELECT * FROM TABLE(SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES('my_table'));
```

Estimated remaining queries has dropped by two. Remember, this is an estimate.

```output
+-----------------------------------+--------------+---------------+--------------+
| NUMBER_OF_REMAINING_DP_AGGREGATES | BUDGET_LIMIT | BUDGET_WINDOW | BUDGET_SPENT |
|-----------------------------------+--------------+---------------+--------------|
|                 993               |     233      |     WEEKLY    |     1.8      |
+-----------------------------------+--------------+---------------+--------------+
```

**4. Rerun a query**

Let’s rerun a previous query to show that privacy budget is always charged, even on identical queries. A duplicate query incurs the same
privacy loss each time it runs (that is, it spends the same amount of privacy budget).

```sqlexample
SELECT COUNT(salary), COUNT(age) FROM T GROUP BY STATE;

-- results omitted ...

SELECT * FROM TABLE(SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES('my_table'));
```

Same charge for the query as before: 1.2 units of privacy loss.

```output
+-----------------------------------+--------------+---------------+--------------+
| NUMBER_OF_REMAINING_DP_AGGREGATES | BUDGET_LIMIT | BUDGET_WINDOW | BUDGET_SPENT |
|-----------------------------------+--------------+---------------+--------------|
|                 991               |     233      |     WEEKLY    |     3.0      |
+-----------------------------------+--------------+---------------+--------------+
```

---
title: Querying data using worksheets
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-query.md
section: User Guide
---

# Querying data using worksheets

After you [create or open a worksheet](ui-snowsight-worksheets-gs.md), you can
[manage the worksheet](ui-snowsight-worksheets.md), write and execute queries, explore query results and history,
and set up filters using Snowsight.

## Writing queries in worksheets

After you open a worksheet, you can write SQL queries and statements.

> **Note:**
>
> Multiple SQL statements in a single API call are not supported. Ensure that each SQL query in the worksheet ends with a single
> semicolon (;).

### Set worksheet context

When you set a database and optionally, a database schema, as the worksheet context, you can reference objects in the schema
without fully qualifying the object names in your query.

### Write queries with autocomplete

As you enter your script in the query editor, the autocomplete feature suggests:

* Query syntax keywords such as SQL functions or aliases.
* Values that match table or column names within a schema.

Select a function to view its syntax and a brief description.

Snowflake tracks table aliases and suggests them as autocomplete options. For example, if you execute a query using `posts as p` or
`posts p` as an alias, the next time you type `p`, the autocomplete feature suggests the alias as an option.

### Use Snowflake Copilot to write queries

Snowflake Copilot is an LLM-powered assistant that simplifies data analysis. You can use natural language requests to explore a new dataset,
generate queries or refine existing queries.

See [Using Snowflake Copilot](snowflake-copilot.md) to learn more about Snowflake Copilot and for example prompts to get you started.

### Append a SQL script to an existing worksheet

If you have a SQL script in a file, you can append it to an existing worksheet by doing the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Hover over the tab for the worksheet and select , then choose Import SQL from File.
5. Browse to the SQL file on your computer.

   The file contents are appended to your worksheet.

### Refer to database object names in worksheets

While you write queries in your worksheet, refer to the database objects relevant to the queries in the Databases explorer. You can
drill down to specific database objects, or use search to locate a database, schema, or object that you have access to.

Using the Databases explorer, you can pin databases and database objects for quick reference. When you hover over a database object,
select the Pin icon to pin them. Pinned objects appear at the top of the Databases explorer in the Pinned section.
You might need to expand the section to view of all your pinned objects.

After you locate a database object, you can place the name of the object in the worksheet that you’re editing:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Locate the database object in the Databases explorer.
5. Hover over the object name and select … more menu » Place Name in Editor.

   The fully qualified object name appears after your cursor location in the worksheet.

For database tables and views, you can also add the column names to the worksheet that you’re editing:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Locate the database object in the Databases explorer.
5. Hover over the object name and select … more menu » Add Columns in Editor.

   The comma-separated column names appear after your cursor location in the worksheet.

### Format your queries

When a worksheet is open, you can select the name of the worksheet to format the queries in your worksheet, and view the keyboard shortcuts.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Hover over the tab for the worksheet and select .
5. In the drop-down list, select Format query to format the query text for readability.

### Load data to a table

If you’re using a worksheet and want to add some data to work with, you can load data into a table without leaving your worksheet:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Select Objects to view the object explorer.
5. Locate a specific table using search or browsing.
6. Hover over a specific table name and select  » Load Data.
7. Follow the prompts to upload one or more structured or unstructured files of 50MB or less.

Refer to [Load data using Snowsight](data-load-web-ui.md) for more details.

## Executing and running queries

You can run a single query or multiple queries sequentially in the same worksheet.

* To run a single query, in the query editor link, place your cursor in the query, and then select the Run button.
* To run the entire worksheet, from the More options dropdown menu next to the Run button, select Run All.

### Running worksheets in folders

Folders no longer have a role assigned to them. An owner or editor of a worksheet in a folder can change the worksheet to run as any role.
You can also add [USE ROLE](../sql-reference/sql/use-role.md) to a worksheet in a folder to run different statements in the worksheet as different roles.

When you create a worksheet inside a folder, the worksheet is created with the role of your current session.

> **Note:**
>
> To run a worksheet in a folder that was shared with you, even if you have View and Run or Edit permissions on the folder,
> you must use the same role as the worksheet. If you do not have the same role, duplicate the worksheet and run it as one of your own roles.

## Exploring the worksheet results

> **Note:**
>
> Available to most accounts. Accounts in U.S. government regions, accounts using Virtual Private Snowflake (VPS), and accounts
> that use Private Connectivity to access Snowflake continue to see query results limited to 10,000 rows.

When you run one query or all queries in a worksheet, you see the query results.

The query results display as a table. You can navigate the query results with the arrow keys on your keyboard, as you would with a
spreadsheet. You can select columns, cells, rows, or ranges in the results table. You can copy and paste any selection.

For up to 1 million rows of results, you can review generated statistics that display contextual information for any selection,
as well as overall statistics. See Automatic contextual statistics for more details.

If you want to view your results as a chart, select Chart. For more details about charts, see
[Visualizing worksheet data](ui-snowsight-visualizations.md).

Query results are cached. For more details, see [Stored results for past worksheet versions](ui-snowsight-worksheets.md) and
[Manage worksheet history and versions](ui-snowsight-worksheets.md).

### Cost considerations for transforming query results

> **Note:**
>
> Available to most accounts. Accounts in U.S. government regions, accounts using Virtual Private Snowflake (VPS), and accounts
> that use Private Connectivity to access Snowflake are not charged when transforming query results.

Note that some column transformation activities performed on the query results of Snowsight worksheets incur
compute cost. The compute cost is billed against the same warehouse used to run the query.

For example, when you sort a column by ascending or descending order using the column options, the changes affect all of your results,
instead of just the first 10,000 rows returned, and you incur compute cost.

To identify the interactions that incur compute cost, filter the Query History page to view only SQL statements that contain the
SQL Text: `snowsight_transform_cte`.

The following transformations do not incur cost:

* Showing a thousands separator for numeric columns.
* Displaying a column as a percentage.
* Increasing or decreasing decimal precision.
* Formatting date and timestamp columns.

In addition, transformations performed by the recipient of a shared worksheet operating on a limited set of results do not incur cost.
For more details about shared worksheet results, see [Viewing results for past runs of a worksheet](ui-snowsight-worksheets.md).

For more details about compute cost, see [Exploring compute cost](cost-exploring-compute.md).

### Automatic contextual statistics

Select columns, cells, rows, or ranges in the results table to view relevant information about the selected data in the inspector pane (to
the right of the results table). Contextual statistics are automatically generated for all column types. The statistics are intended to help
you make sense of your data at a glance.

The column overview displays a preview of the statistics for each column. Select a column from the inspector or the column header to view
detailed column statistics.

The statistics pane generates different metrics for different types of columns. You can interact with and filter using the items in the
statistics pane.

Filled/empty meters
:   All columns show how many rows are filled and empty. Columns displaying some data types, such as email and JSON, also indicate the
    number of invalid rows.

Histograms
:   Displayed for all date, time, and numeric columns.

    The histogram indicates the rows that fall into a particular range. Click a bar or drag over the histogram to select a range. You can
    fine tune your selection by clicking the value labels above the histogram to input specific values.

Frequency distributions
:   Displayed for all categorical columns. Categorical columns are text columns where the same values are used more than once.

Email domain distributions
:   Displayed for email columns. The email domain distribution shows the frequency distribution of domain name occurrences.

Key distributions
:   Displayed for JSON columns. The key distribution shows the frequency of the top keys present in the result set if all the rows contain
    JSON objects. If the column includes JSON arrays, the key distribution shows the relative types of JSON values in the column.

### View query details

The Query Details includes information about the execution of the query, including:

* The duration of the query execution.
* The number of rows in the results.
* When the execution completed.
* The quantity of data scanned by the query.
* The role used to execute the query.
* The warehouse used to execute the query.

Some query details are only available for only 14 days.

### View the query profile

To access a detailed profile of your query, on the Query Details pane select the … more menu » View Query Profile.

The query profile opens in a new browser tab.

For information on reviewing the query profile, see [Review Query Profile](ui-snowsight-activity.md).

### Download your query results

To download your query results as a CSV-formatted or TSV-formatted file, select Download results.

The size of your file depends on the amount of data returned by your query. Snowflake does not limit the size of files
exported for query results.

### View query history

After you run SQL in a worksheet, you can review the history of queries run in the worksheet, for example to compare results of different
query runs. You must use the same role as the worksheet to view the query history for the worksheet.

When the Results pane is visible, select  (Query history) to review the queries that have been
run in the worksheet, as well as the results for those queries.
The history includes up to 25 queries run in that worksheet during your current session and previous sessions over the last 14 days.

You can review the following information:

* The status of a query that is in progress.
* What time the query was run.
* How long the query took to run, in milliseconds or seconds.
* Which query was run.
* The query ID.

Select a row to see the results for that query execution in the Results pane. If you do not have the primary role used to run a query
that you view in Query history, you cannot view the results for that query. Subqueries spawned by stored procedures or Python
worksheets do not display.

To filter the query history for the worksheet by status, warehouse, or other aspects:

* Filter the query executions by status. For example, review queries that are still in the Running or Queued status
  and do not yet display results.
* Select  to filter by warehouse, SQL text in the query, a specific query ID, or a duration greater
  than a specific time period.

Hover over a query execution row to see a full preview of the SQL statement that was run, copy the query ID, and optionally open the
query details for the query execution. See [Review Query History in Snowsight](ui-snowsight-activity.md) for more information about query details.

### Query history data redacted from a Snowflake Native App

For queries related to a Snowflake Native App, the `query_text` and `error_message` fields are redacted
from the [query history](ui-snowsight-activity.md) in the following contexts:

* Queries run when the app is installed or upgraded.
* Queries that originate from a child job of a stored procedure owned by the app.

In each of these situations, the cell of the query history in Snowsight appears blank.

---
title: Querying Hierarchical Data
source: https://docs.snowflake.com/en/user-guide/queries-hierarchical.md
section: User Guide
---

# Querying Hierarchical Data

This topic describes how to store and query hierarchical data using:

* JOINs
* Recursive CTEs (common table expressions)
* CONNECT BY

See also:
:   [CONNECT BY](../sql-reference/constructs/connect-by.md) , [the recursive CTE portion of the WITH command](../sql-reference/constructs/with.md) , [Working with CTEs (Common Table Expressions)](queries-cte.md) , [Tabular SQL UDFs (UDTFs)](../developer-guide/udf/sql/udf-sql-tabular-functions.md)

## Storing Hierarchical Data

Many types of data are best represented as a hierarchy, such as a tree.

For example, employees are usually organized in a hierarchy, with a company President at the top of the hierarchy.

Another example of a hierarchy is a “parts explosion”. For example, a car contains an engine; an engine contains a
fuel pump; and a fuel pump contains a hose.

You can store hierarchical data in:

* A hierarchy of tables.
* A single table with one (or more) columns representing the hierarchy (e.g. indicating each employee’s direct manager).

Both techniques are described below.

> **Note:**
>
> This topic focuses on hierarchical data stored as *structured* data. Hierarchical data can also be stored as semi-structured
> data (e.g. JSON data can be stored in ARRAY, OBJECT, or VARIANT data types). For information about semi-structured data, see:
>
> > * [Introduction to loading semi-structured data](semistructured-intro.md)
> > * [Querying Semi-structured Data](querying-semistructured.md)

### Hierarchical Data Across Multiple Tables

Relational databases often store hierarchical data by using different tables. For example, one table might
contain “parent” data and another table might contain “child” data. When the entire hierarchy is known in advance,
one table can be created for each layer in the hierarchy.

For example, consider a Human Resources database that stores employee information and manager information. If the
company is small, then there might be only two levels, for example, one manager and two employees.

> ```sqlexample
> CREATE OR REPLACE TABLE managers  (title VARCHAR, employee_ID INTEGER);
> ```
>
> ```sqlexample
> CREATE OR REPLACE TABLE employees (title VARCHAR, employee_ID INTEGER, manager_ID INTEGER);
> ```
>
> ```sqlexample
> INSERT INTO managers (title, employee_ID) VALUES
>     ('President', 1);
> INSERT INTO employees (title, employee_ID, manager_ID) VALUES
>     ('Vice President Engineering', 10, 1),
>     ('Vice President HR', 20, 1);
> ```

### Hierarchical Data in a Single Table

In some situations, the number of levels in the hierarchy might change.

For example, a company that started with a two-level hierarchy (President and other employees) might increase the
number of levels as the company grows. The company might expand to include a President, Vice Presidents, and regular
employees.

If the number of levels is unknown, so that it is not possible to create a hierarchy with a known number of tables,
then in some cases the hierarchical data can be stored in one table. For example, a single table can contain all
employees, and can include a column that stores each employee’s manager_ID, which points to another employee in that
same table. For example:

> ```sqlexample
> CREATE OR REPLACE TABLE employees (title VARCHAR, employee_ID INTEGER, manager_ID INTEGER);
> ```
>
> ```sqlexample
> INSERT INTO employees (title, employee_ID, manager_ID) VALUES
>     ('President', 1, NULL),  -- The President has no manager.
>         ('Vice President Engineering', 10, 1),
>             ('Programmer', 100, 10),
>             ('QA Engineer', 101, 10),
>         ('Vice President HR', 20, 1),
>             ('Health Insurance Analyst', 200, 20);
> ```

Storing an entire hierarchy of data in one table works best if all levels
of the hierarchy store the same data – in our example, employee ID, title, etc.
If the data at different levels doesn’t fit the same record structure, then
storing all the data in one table might not be practical.

## Using Joins to Query Hierarchical Data

In a two-level hierarchy (for example, managers and employees), the data can be queried with a two-way join:

> ```sqlexample
> SELECT
>         employees.title,
>         employees.employee_ID,
>         managers.employee_ID AS MANAGER_ID,
>         managers.title AS "MANAGER TITLE"
>     FROM employees, managers
>     WHERE employees.manager_ID = managers.employee_ID
>     ORDER BY employees.title;
> +----------------------------+-------------+------------+---------------+
> | TITLE                      | EMPLOYEE_ID | MANAGER_ID | MANAGER TITLE |
> |----------------------------+-------------+------------+---------------|
> | Vice President Engineering |          10 |          1 | President     |
> | Vice President HR          |          20 |          1 | President     |
> +----------------------------+-------------+------------+---------------+
> ```

In a three-level hierarchy, you can use a 3-way join:

> ```sqlexample
> SELECT
>      emps.title,
>      emps.employee_ID,
>      mgrs.employee_ID AS MANAGER_ID,
>      mgrs.title AS "MANAGER TITLE"
>   FROM employees AS emps LEFT OUTER JOIN employees AS mgrs
>     ON emps.manager_ID = mgrs.employee_ID
>   ORDER BY mgrs.employee_ID NULLS FIRST, emps.employee_ID;
> +----------------------------+-------------+------------+----------------------------+
> | TITLE                      | EMPLOYEE_ID | MANAGER_ID | MANAGER TITLE              |
> |----------------------------+-------------+------------+----------------------------|
> | President                  |           1 |       NULL | NULL                       |
> | Vice President Engineering |          10 |          1 | President                  |
> | Vice President HR          |          20 |          1 | President                  |
> | Programmer                 |         100 |         10 | Vice President Engineering |
> | QA Engineer                |         101 |         10 | Vice President Engineering |
> | Health Insurance Analyst   |         200 |         20 | Vice President HR          |
> +----------------------------+-------------+------------+----------------------------+
> ```

This concept can be extended to as many levels as needed, as long as you know how many levels are needed. But if
the number of levels changes, the queries need to change.

## Using CONNECT BY or Recursive CTEs to Query Hierarchical Data

Snowflake provides two ways to query hierarchical data in which the number of levels is not known in advance:

* Recursive CTEs (common table expressions).
* `CONNECT BY` clauses.

A recursive CTE allows you to create a [WITH](../sql-reference/constructs/with.md) clause that can refer to itself. This lets
you iterate through each level of your hierarchy and accumulate results.

A `CONNECT BY` clause allows you to create a type of `JOIN` operation that processes the hierarchy one level
at a time, and allows each level to refer to data in the prior level.

For more details, see:

* [WITH](../sql-reference/constructs/with.md) and [Working with CTEs (Common Table Expressions)](queries-cte.md).
* [CONNECT BY](../sql-reference/constructs/connect-by.md).

## Differences between Self-Join, Recursive CTE, and CONNECT BY

`CONNECT BY` allows only self-joins. Recursive CTEs are more flexible and allow a table to be joined to one
or more other tables.

A `CONNECT BY` clause has most of the power of a recursive CTE. However,
a recursive CTE can do some things that a `CONNECT BY` cannot.

For example, if you look at the recursive CTE examples, you see that one
of the queries indents the output and also sorts the output so that each
“child” appears underneath the corresponding “parent”. The sorting is done
by creating a sort key that contains the chain of IDs from the top all the
way down to the current level. In the manager/employee example, the chain
contains the President’s ID, followed by the Vice President’s ID, etc.
This sort key groups rows in a way that looks similar to a sideways tree.
The `CONNECT BY` syntax doesn’t support this because the
“START WITH” clause does not allow the code to specify additional columns
(beyond those in the table itself), such as the sort_key. Contrast the two
code snippets below:

```sqlexample
SELECT indent(LEVEL) || employee_ID, manager_ID, title
  FROM employees
    -- This sub-clause specifies the record at the top of the hierarchy,
    -- but does not allow additional derived fields, such as the sort key.
    START WITH TITLE = 'President'
    CONNECT BY ...

WITH RECURSIVE current_layer
   (employee_ID, manager_ID, sort_key) AS (
     -- This allows us to add columns, such as sort_key, that are not part
     -- of the employees table.
     SELECT employee_ID, manager_ID, employee_ID AS sort_key
     ...
     )
```

You can, however, use the `SYS_CONNECT_BY_PATH` function to achieve a similar effect with the
`CONNECT BY` clause.

Although the `CONNECT BY` clause version is limited because the START WITH
clause cannot add columns to those already in the row (even derived columns
based on values already in the row), it also has some advantages:

* You have access to all columns of each row without specifying those columns
  in a column list. In a recursive CTE, the recursive clause
  does not have access to columns that are not explicitly specified in the CTE.
* In a recursive CTE, you must specify the columns in the
  CTE, and the projection lists of the selects in the anchor clause and the
  recursive clause, must both match the columns in the CTE. If the order of
  the columns in the various projection clauses does not match, you can
  cause problems such as infinite loops.
* The `CONNECT BY` syntax supports convenient pseudo-columns such as `LEVEL`,
  `CONNECT_BY_ROOT`, and `CONNECT_BY_PATH`

A minor difference between `CONNECT BY` and recursive CTE is that in `CONNECT BY`
you use the keyword `PRIOR` to indicate which column values should be taken
from the previous iteration, whereas in a recursive CTE you use the table
name and the CTE name to indicate which values are taken from the current
iteration and which are taken from the previous iteration. (In a recursive CTE,
you can also distinguish between current and previous iterations by using
different column names in the CTE column list than in the source table or table
expression.)

## Non-Contiguous Hierarchies

This topic described hierarchies and how parent-child relationships
can be used by recursive CTEs (common table expressions) and `CONNECT BY`
clauses. In all of this topic’s examples, as well as all the examples in the
`CONNECT BY` documentation and the recursive CTE documentation, the hierarchies are
contiguous. None of the examples has a parent and a grandchild without having a corresponding child between them.

For example, if you do a “parts explosion” of a car, you’re not going to have
a component for the car, and a component for the tire, without having a
component for the wheel that contains the tire (and that is contained by the
car).

However, there can be cases where data is incomplete. For example, in an
employee/manager hierarchy, suppose that the Vice President of Engineering
retires and the company doesn’t hire a replacement immediately. If the
VP’s employee record is deleted, then employees below that VP are “cut off”
from the rest of the hierarchy, so the employees table no longer contains a single contiguous hierarchy.

If you use recursive CTEs or `CONNECT BY` to process data, you need to think about whether the data in your table
represents a single, contiguous tree. You can use recursive CTEs and `CONNECT BY` on
a single table that contains multiple trees, but you can only
query one tree at a time, and that tree must be contiguous.

---
title: Querying semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/querying.md
section: User Guide
---

# Querying semantic views

To query a semantic view, you can use a standard [SELECT statement](../../sql-reference/constructs.md). Within this statement, you
can use one of the following approaches:

* Specify the SEMANTIC_VIEW clause in the FROM clause. For example:

  ```sqlexample
  SELECT * FROM SEMANTIC_VIEW(
      tpch_analysis
      DIMENSIONS customer.customer_market_segment
      METRICS orders.order_average_value
    )
    ORDER BY customer_market_segment;
  ```

  For information, see Specifying the SEMANTIC_VIEW clause in the FROM clause.
* Specify the name of the semantic view in the FROM clause. For example:

  ```sqlexample
  SELECT customer_market_segment, AGG(order_average_value)
    FROM tpch_analysis
    GROUP BY customer_market_segment
    ORDER BY customer_market_segment;
  ```

  For information, see Specifying the name of the semantic view in the FROM clause.

## Privileges required to query a semantic view

If you are using a role that does not own the semantic view, you must be granted the SELECT privilege on that semantic view to
query that semantic view.

> **Note:**
>
> To query a semantic view, you don’t need the SELECT privilege on the tables used in the semantic view. You only need the
> SELECT privilege on the semantic view itself.
>
> This behavior is consistent with [the privileges required to query standard views](../views-introduction.md).

For information about granting privileges on semantic views, see [Granting privileges on semantic views](sql.md).

## Specifying the SEMANTIC_VIEW clause in the FROM clause

To query a semantic view, you can specify the [SEMANTIC_VIEW clause](../../sql-reference/constructs/semantic_view.md) in the FROM
clause.

The following example selects the `customer_market_segment` dimension and the `order_average_value` metric from the
`tpch_analysis` semantic view, [which you defined earlier](example.md):

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_market_segment
    METRICS orders.order_average_value
  )
  ORDER BY customer_market_segment;
```

```output
+-------------------------+---------------------+
| CUSTOMER_MARKET_SEGMENT | ORDER_AVERAGE_VALUE |
+-------------------------+---------------------+
| AUTOMOBILE              |     142570.25947219 |
| FURNITURE               |     142563.63314267 |
| MACHINERY               |     142655.91550608 |
| HOUSEHOLD               |     141659.94753445 |
| BUILDING                |     142425.37987558 |
+-------------------------+---------------------+
```

Note that you can define an alias for a dimension or metric by specifying the alias after the dimension or metric name. You can
also specify the optional keyword AS before the alias. The following example runs the same query but uses the aliases `segment`
and `average` for the dimension and metric returned in the results.

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_market_segment AS segment
    METRICS orders.order_average_value average
  )
  ORDER BY segment;
```

```output
+------------+-----------------+
| SEGMENT    |         AVERAGE |
|------------+-----------------|
| AUTOMOBILE | 142570.25947219 |
| BUILDING   | 142425.37987558 |
| FURNITURE  | 142563.63314267 |
| HOUSEHOLD  | 141659.94753445 |
| MACHINERY  | 142655.91550608 |
+------------+-----------------+
```

The following example selects the `customer_name` dimension and the `c_customer_order_count` fact from the
`tpch_analysis` semantic view:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_name
    FACTS customer.c_customer_order_count
  )
  ORDER BY customer_name
  LIMIT 5;
```

```output
+--------------------+------------------------+
| CUSTOMER_NAME      | C_CUSTOMER_ORDER_COUNT |
|--------------------+------------------------|
| Customer#000000001 |                      9 |
| Customer#000000002 |                     11 |
| Customer#000000003 |                      0 |
| Customer#000000004 |                     20 |
| Customer#000000005 |                     10 |
+--------------------+------------------------+
```

### Guidelines for specifying the SEMANTIC_VIEW clause

When specifying the SEMANTIC_VIEW clause, follow these guidelines:

* In the SEMANTIC_VIEW clause, you must specify at least one of the following clauses:

  + METRICS
  + DIMENSIONS
  + FACTS

  You cannot omit all of these clauses from the SEMANTIC_VIEW clause.
* When specifying a combination of these clauses, note the following:

  + You cannot specify FACTS and METRICS in the same SEMANTIC_VIEW clause.
  + Although you can specify both FACTS and DIMENSIONS in a query, you should do so only if the dimensions can uniquely determine
    the facts.

    The query groups the results by dimensions. if the facts do not depend on the dimensions, the results can be
    non-deterministic.
  + If you specify both FACTS and DIMENSIONS, all facts and dimensions used in the query (including those specified in the WHERE
    clause) must be defined in the same logical table.
  + If you specify a dimension and a metric, the logical table for the dimension must be related to the logical table for the
    metric.

    In addition, the logical table for the dimension must have an equal or lower level of granularity than the logical table for
    the metric.

    To determine which dimensions meet this criteria, you can run the
    [SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md) command.

    For details, see Choosing the dimensions that you can return for a given metric.
* In the DIMENSIONS clause, you can specify an expression that refers to a fact. Similarly, in the FACTS clause, you can specify
  an expression that refers to a dimension. For example:

  ```sqlexample
  -- Dimension expression that refers to a fact
  DIMENSIONS my_table.my_fact

  -- Fact expression that refers to a dimension
  FACTS my_table.my_dimension
  ```

  One of the main differences between using DIMENSIONS and FACTS is that the query groups the results by the dimensions and
  expressions specified in the DIMENSIONS clause.
* In the METRICS clause, you can specify an expression that includes:

  + A scalar expression referring to metrics.
  + An aggregation of dimensions or facts.
* Specify the METRICS, DIMENSIONS, and FACTS clauses in the order in which you want them to appear in the results.

  If you want the dimensions to appear first in the results, specify DIMENSIONS before METRICS. Otherwise, specify METRICS first.

  For example, suppose that you specify the METRICS clause first:

  ```sqlexample
  SELECT * FROM SEMANTIC_VIEW(
      tpch_analysis
      METRICS customer.customer_order_count
      DIMENSIONS customer.customer_name
    )
    ORDER BY customer_name
    LIMIT 5;
  ```

  In the output, the first column is the metric column (`customer_order_count`) and the second column is the dimension column
  (`customer_name`):

  ```output
  +----------------------+--------------------+
  | CUSTOMER_ORDER_COUNT | CUSTOMER_NAME      |
  |----------------------+--------------------|
  |                    6 | Customer#000000001 |
  |                    7 | Customer#000000002 |
  |                    0 | Customer#000000003 |
  |                   20 | Customer#000000004 |
  |                    4 | Customer#000000005 |
  +----------------------+--------------------+
  ```

  If you instead specify the DIMENSIONS clause first:

  ```sqlexample
  SELECT * FROM SEMANTIC_VIEW(
      tpch_analysis
      DIMENSIONS customer.customer_name
      METRICS customer.customer_order_count
    )
    ORDER BY customer_name
    LIMIT 5;
  ```

  In the output, the first column is the dimension column (`customer_name`) and the second column is the metric column
  (`customer_order_count`):

  ```output
  +--------------------+----------------------+
  | CUSTOMER_NAME      | CUSTOMER_ORDER_COUNT |
  |--------------------+----------------------|
  | Customer#000000001 |                    6 |
  | Customer#000000002 |                    7 |
  | Customer#000000003 |                    0 |
  | Customer#000000004 |                   20 |
  | Customer#000000005 |                    4 |
  +--------------------+----------------------+
  ```
* You can use the relation defined by a SEMANTIC_VIEW clause in other SQL constructs, including
  [JOIN](../../sql-reference/constructs/join.md), [PIVOT](../../sql-reference/constructs/pivot.md), [UNPIVOT](../../sql-reference/constructs/unpivot.md),
  [GROUP BY](../../sql-reference/constructs/group-by.md), and [common table expressions (CTEs)](../queries-cte.md).
* The output column headers use the unqualified names of the metrics and dimensions.

  If you have multiple metrics and dimensions with the same names, use a table alias to assign different names to the column
  headers. See Handling duplicate column names in the output.

To return all metrics or dimensions in a given logical table, use an asterisk as a wildcard, qualified by the name of the logical
table. For example, to return all metrics and dimensions defined in the `customer` logical table:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
  tpch_analysis
  DIMENSIONS customer.*
  METRICS customer.*
);
```

```output
+-----------------------+-------------------------+--------------------+----------------------+----------------------+----------------+----------------------+
| CUSTOMER_COUNTRY_CODE | CUSTOMER_MARKET_SEGMENT | CUSTOMER_NAME      | CUSTOMER_NATION_NAME | CUSTOMER_REGION_NAME | CUSTOMER_COUNT | CUSTOMER_ORDER_COUNT |
|-----------------------+-------------------------+--------------------+----------------------+----------------------+----------------+----------------------|
| 18                    | BUILDING                | Customer#000034857 | INDIA                | ASIA                 |              1 |                    0 |
| 14                    | AUTOMOBILE              | Customer#000145116 | EGYPT                | MIDDLE EAST          |              1 |                    0 |
...
```

### Examples of specifying the SEMANTIC_VIEW clause

The following examples use the `tpch_analysis` view defined in [Example of using SQL to create a semantic view](example.md):

#### Retrieving a metric

The following statement retrieves the total count of customers by querying a metric:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    METRICS customer.customer_count
  );
```

```output
+----------------+
| CUSTOMER_COUNT |
+----------------+
|          15000 |
+----------------+
```

#### Grouping metric data by a dimension

The following statement groups metric data (`order_average_value`) by a dimension (`customer_market_segment`):

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_market_segment
    METRICS orders.order_average_value
  );
```

```output
+-------------------------+---------------------+
| CUSTOMER_MARKET_SEGMENT | ORDER_AVERAGE_VALUE |
+-------------------------+---------------------+
| AUTOMOBILE              |     142570.25947219 |
| FURNITURE               |     142563.63314267 |
| MACHINERY               |     142655.91550608 |
| HOUSEHOLD               |     141659.94753445 |
| BUILDING                |     142425.37987558 |
+-------------------------+---------------------+
```

#### Using the SEMANTIC_VIEW subclause with other constructs

The following example demonstrates how you can use dimensions and metrics in the SEMANTIC_VIEW subclause with other SQL
constructs to filter, sort, and limit results:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_name
    METRICS orders.average_line_items_per_order,
            orders.order_average_value
  )
  WHERE average_line_items_per_order > 4
  ORDER BY average_line_items_per_order DESC
  LIMIT 5;
```

```output
+--------------------+------------------------------+---------------------+
| CUSTOMER_NAME      | AVERAGE_LINE_ITEMS_PER_ORDER | ORDER_AVERAGE_VALUE |
+--------------------+------------------------------+---------------------+
| Customer#000045678 |                         6.87 |           175432.21 |
| Customer#000067890 |                         6.42 |           182376.58 |
| Customer#000012345 |                         5.93 |           169847.42 |
| Customer#000034567 |                         5.76 |           178952.36 |
| Customer#000056789 |                         5.64 |           171248.75 |
+--------------------+------------------------------+---------------------+
```

#### Specifying scalar expressions that use dimensions

The following example uses a scalar expression that refers to a dimension in the DIMENSIONS clause:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS DATE_PART('year', orders.order_date) AS year
  )
  ORDER BY year;
```

```output
+------+
| YEAR |
|------|
| 1992 |
| 1993 |
| 1994 |
| 1995 |
| 1996 |
| 1997 |
| 1998 |
+------+
```

#### Specifying the WHERE clause

The following example specifies a WHERE clause that refers to a dimension in the DIMENSIONS clause:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS orders.order_date
    METRICS orders.average_line_items_per_order,
            orders.order_average_value
    WHERE orders.order_date > '1995-01-01'
  )
  ORDER BY order_date ASC
  LIMIT 5;
```

```output
+------------+------------------------------+---------------------+
| ORDER_DATE | AVERAGE_LINE_ITEMS_PER_ORDER | ORDER_AVERAGE_VALUE |
|------------+------------------------------+---------------------|
| 1995-01-02 |                     3.884547 |     151237.54900533 |
| 1995-01-03 |                     3.894819 |     145751.84384615 |
| 1995-01-04 |                     3.838863 |     145331.39167457 |
| 1995-01-05 |                     4.040689 |     150723.67353678 |
| 1995-01-06 |                     3.990755 |     152786.54109399 |
+------------+------------------------------+---------------------+
```

#### Specifying facts in the WHERE clause

The following example uses the `region.r_name` fact in a condition in the WHERE clause:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    FACTS customer.c_customer_order_count
    WHERE orders.order_date < '2021-01-01' AND region.r_name = 'AMERICA'
  );
```

## Specifying the name of the semantic view in the FROM clause

You can specify the name of the semantic view in the FROM clause of a SELECT statement, as you would when querying a standard SQL
view:

```sqlsyntax
SELECT [ DISTINCT ]
    {
      [<qualifiers>.]<dimension_or_fact>                          |
      <scalar_expression_over_dimension_or_fact>                  |
      AGG( [<qualifiers>.]<metric> )                              |
      <aggregate_function>( [<qualifiers>.]<dimension_for_fact> )
    }
    [ , ... ]
  FROM <semantic_view> [ AS <alias> ]
  [ WHERE <expr_using_dimensions_or_facts> ]
  [ GROUP BY <expr_using_dimensions_or_facts> [ , ... ] ]
  [ HAVING <expr_using_metrics> ]
  [ ORDER BY ... ]
  [ LIMIT ... ]
```

Internally, this statement is rewritten as a SELECT statement that uses the
SEMANTIC_VIEW clause:

* The expressions that you specify in the GROUP BY clause are rewritten into the DIMENSIONS clause in the SEMANTIC_VIEW clause.

  In the SELECT statement, if you use an expression that is not in the GROUP BY clause (for example, a dimension
  expression in the SELECT list), the rewrite uses that expression in the FACTS clause in the SEMANTIC_VIEW clause.
* When you refer to a metric that is defined in a semantic view, you must pass the metric to the AGG function.
* You can select ad-hoc metrics by passing a dimension or fact to any
  [aggregate function](../../sql-reference/functions-aggregation.md).
* Any other calculated values that don’t fall into the first two categories are considered to be fact references.

The next sections explain these requirements in more detail:

### Requirements for dimensions and metrics in a SELECT statement

In the SELECT statement, you can only refer to dimensions and metrics that have distinct names and that are not distinguished by
their logical table name. For example, suppose that a semantic view has two dimensions that have the unqualified name `name`:

```sqlexample
DIMENSIONS (
  nation.name AS nation.n_name,
  region.name AS region.r_name
);
```

In the SELECT statement, when you specify the qualified name of a dimension or metric, the qualifier is interpreted as the name
of the semantic view, not the name of a logical table:

```sqlexample
SELECT nation.name, region.name
  FROM duplicate_names
  GROUP BY nation.name, region.name;
```

```output
000904 (42000): SQL compilation error: error line 1 at position 7
invalid identifier 'NATION.NAME'
```

### Selecting metrics

If you want to select a metric that is defined in a semantic view, you must pass the metric to the
[AGG](../../sql-reference/functions/agg.md) function, which is a special aggregate function for metrics in semantic views.

For example:

```sqlexample
SELECT AGG(order_average_value) FROM tpch_analysis;
```

> **Note:**
>
> The AGG function has no effect on the metric because the function evaluates one value of the metric.

In the SELECT list, you can specify an expression that uses a metric. For example:

```sqlexample
SELECT AGG(order_average_value) * 10 FROM tpch_analysis;
```

You can also define and select ad-hoc metrics by passing a dimension or fact to any
[aggregate function](../../sql-reference/functions-aggregation.md). For example:

```sqlexample
SELECT COUNT(customer_market_segment) FROM tpch_analysis;
```

### Selecting dimensions

If the SELECT list includes dimensions, you must specify those dimensions in the GROUP BY clause. For example:

```sqlexample
SELECT customer_market_segment, customer_nation_name, AGG(order_average_value)
  FROM tpch_analysis
  GROUP BY customer_market_segment, customer_nation_name;
```

In the SELECT list and in the GROUP BY clause, you can specify a dimension or a scalar expression that uses a dimension or a fact.
For example:

```sqlexample
SELECT LOWER(customer_nation_name), AGG(order_average_value)
  FROM tpch_analysis
  GROUP BY customer_nation_name;
```

### Specifying the WHERE clause

In the WHERE clause, you can only use conditional expressions that refer to dimensions or facts. For example:

```sqlexample
SELECT customer_market_segment, AGG(order_average_value)
  FROM tpch_analysis
  WHERE customer_market_segment = 'BUILDING'
  GROUP BY customer_market_segment;
```

The dimensions must be reachable by every metric used in the query.

### Specifying the HAVING clause

In the HAVING clause, you can only specify metrics, and you must pass them to one of the aggregate functions listed in
Selecting metrics. For example:

```sqlexample
SELECT customer_market_segment, AGG(order_average_value)
  FROM tpch_analysis
  GROUP BY customer_market_segment
  HAVING AGG(order_average_value) > 142500;
```

### Limitations with specifying the semantic view name in the FROM clause

You cannot specify the following in the SELECT statement:

* Extensions of the FROM clause, including:

  + PIVOT
  + UNPIVOT
  + MATCH_RECOGNIZE
  + LATERAL
* Joins
* Window function calls
* QUALIFY
* Subqueries

## Choosing the dimensions that you can return for a given metric

When you specify a dimension and a metric to return, the base table for the dimension must be related to the base table for the
metric. In addition, the base table for the dimension must have an equal or lower level of granularity than the base table for
the metric.

For example, suppose that you query the `tpch_analysis` semantic view that you created in [Example of using SQL to create a semantic view](example.md), and you want to return
the `orders.order_date` dimension and the `customer.customer_order_count` metric:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  tpch_analysis
  DIMENSIONS orders.order_date
  METRICS customer.customer_order_count
);
```

This query fails because the `orders` table for the `order_date` dimension has a higher level of granularity than the
`customer` table for the `customer_order_count` metric:

```output
010234 (42601): SQL compilation error:
Invalid dimension specified: The dimension entity 'ORDERS' must be related to and
  have an equal or lower level of granularity compared to the base metric or dimension entity 'CUSTOMER'.
```

To list the dimensions that you can return with a specific metric, run the
[SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md) command. For example:

```sqlexample
SHOW SEMANTIC DIMENSIONS IN tpch_analysis FOR METRIC customer_order_count;
```

```output
+------------+-------------------------+-------------+----------+----------+---------+
| table_name | name                    | data_type   | required | synonyms | comment |
|------------+-------------------------+-------------+----------+----------+---------|
| CUSTOMER   | CUSTOMER_COUNTRY_CODE   | VARCHAR(15) | false    | NULL     | NULL    |
| CUSTOMER   | CUSTOMER_MARKET_SEGMENT | VARCHAR(10) | false    | NULL     | NULL    |
| CUSTOMER   | CUSTOMER_NAME           | VARCHAR(25) | false    | NULL     | NULL    |
| CUSTOMER   | CUSTOMER_NATION_NAME    | VARCHAR(25) | false    | NULL     | NULL    |
| CUSTOMER   | CUSTOMER_REGION_NAME    | VARCHAR(25) | false    | NULL     | NULL    |
| NATION     | NATION_NAME             | VARCHAR(25) | false    | NULL     | NULL    |
+------------+-------------------------+-------------+----------+----------+---------+
```

## Handling duplicate column names in the output

The output columns use the unqualified names of the metrics and dimensions. If you have multiple metrics and dimensions
with the same names, multiple columns will use the same name.

To work around this, use a table alias to assign different names to the columns.

For example, suppose that you define the following semantic view, which defines the dimensions `nation.name` and
`region.name`:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW duplicate_names

  TABLES (
    nation AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.NATION PRIMARY KEY (n_nationkey),
    region AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.REGION PRIMARY KEY (r_regionkey)
  )

  RELATIONSHIPS (
    nation (n_regionkey) REFERENCES region
  )

  DIMENSIONS (
    nation.name AS nation.n_name,
    region.name AS region.r_name
  );
```

If you query this view and select these two dimensions, the output includes two columns named `name` without any qualifiers:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    duplicate_names
    DIMENSIONS nation.name, region.name
  );
```

```output
+----------------+-------------+
| NAME           | NAME        |
+----------------+-------------+
| BRAZIL         | AMERICA     |
| MOROCCO        | AFRICA      |
| UNITED KINGDOM | EUROPE      |
| IRAN           | MIDDLE EAST |
| FRANCE         | EUROPE      |
| ...            | ...         |
+----------------+-------------+
```

To disambiguate the columns, use a table alias to assign different column names (for example, `nation_name` and
`region_name`):

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    duplicate_names
    DIMENSIONS nation.name, region.name
  ) AS table_alias(nation_name, region_name);
```

```output
+----------------+-------------+
| NATION_NAME    | REGION_NAME |
+----------------+-------------+
| BRAZIL         | AMERICA     |
| MOROCCO        | AFRICA      |
| UNITED KINGDOM | EUROPE      |
| IRAN           | MIDDLE EAST |
| FRANCE         | EUROPE      |
| ...            | ...         |
+----------------+-------------+
```

## Defining and querying window function metrics

You can define metrics that call [window functions](../../sql-reference/functions-window-syntax.md) and pass in aggregated values.
These metrics are called *window function metrics*.

The following examples illustrate the difference between a window function metric and a metric that passes a row-level
expression to a window function:

* The following metric is a window function metric:

  ```sqlexample
  METRICS (
    table_1.metric_1 AS SUM(table_1.metric_3) OVER( ... )
  )
  ```

  In this example, the SUM window function takes another metric (`table_1.metric_3`) as an argument.

  The following metric is also a window function metric:

  ```sqlexample
  METRICS (
    table_1.metric_2 AS SUM(
      SUM(table_1.column_1)
    ) OVER( ... )
  )
  ```

  In this example, the SUM window function takes a valid metric expression (`SUM(table_1.column_1)`) as an argument.
* The following metric is not a window function metric:

  ```sqlexample
  METRICS (
    table_1.metric_1 AS SUM(
      SUM(table_1.column_1) OVER( ... )
    )
  )
  ```

  In this example, the SUM window function takes a column (`table_1.column_1`) as an argument, and the result of that window
  function call is passed to a separate SUM aggregate function call.

The following sections explain how to define and query window function metrics:

### Defining window function metrics

When specifying a window function call, use [this syntax](../../sql-reference/sql/create-semantic-view.md), which is
described in [Parameters for window function metrics](../../sql-reference/sql/create-semantic-view.md).

The following example creates a semantic view that includes the definitions of several window function metrics. The example uses
tables from the [TPC-DS](../sample-data-tpcds.md) sample database. For information on accessing this database, see
[Add the TPC-DS data set to your account](../sample-data-tpcds.md).

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW sv_window_function_example
  TABLES (
    store_sales AS SNOWFLAKE_SAMPLE_DATA.TPCDS_SF10TCL.store_sales,
    date AS SNOWFLAKE_SAMPLE_DATA.TPCDS_SF10TCL.date_dim PRIMARY KEY (d_date_sk)
  )
  RELATIONSHIPS (
    sales_to_date AS store_sales(ss_sold_date_sk) REFERENCES date(d_date_sk)
  )
  DIMENSIONS (
    date.date AS d_date,
    date.d_date_sk AS d_date_sk,
    date.year AS d_year
  )
  METRICS (
    store_sales.total_sales_quantity AS SUM(ss_quantity)
      WITH SYNONYMS = ('Total sales quantity'),

    store_sales.avg_7_days_sales_quantity as AVG(total_sales_quantity)
      OVER (PARTITION BY EXCLUDING date.date, date.year ORDER BY date.date
        RANGE BETWEEN INTERVAL '6 days' PRECEDING AND CURRENT ROW)
      WITH SYNONYMS = ('Running 7-day average of total sales quantity'),

    store_sales.total_sales_quantity_30_days_ago AS LAG(total_sales_quantity, 30)
      OVER (PARTITION BY EXCLUDING date.date, date.year ORDER BY date.date)
      WITH SYNONYMS = ('Sales quantity 30 days ago'),

    store_sales.avg_7_days_sales_quantity_30_days_ago AS AVG(total_sales_quantity)
      OVER (PARTITION BY EXCLUDING date.date, date.year ORDER BY date.date
        RANGE BETWEEN INTERVAL '36 days' PRECEDING AND INTERVAL '30 days' PRECEDING)
      WITH SYNONYMS = ('Running 7-day average of total sales quantity 30 days ago')

  );
```

You can also use other metrics from the same logical table in the metric definition. For example:

```sqlexample
METRICS (
  orders.m3 AS SUM(m2) OVER (PARTITION BY m1 ORDER BY m2),
  orders.m4 AS ((SUM(m2) OVER (..)) / m1) + 1
)
```

> **Note:**
>
> You can’t use window function metrics in row-level calculations (facts and dimensions) or in the definitions of other metrics.

### Querying window function metrics

When you query a semantic view and the query returns a window function metric, you must also return the dimensions specified in
PARTITION BY `dimension`, PARTITION BY EXCLUDING `dimension`, and ORDER BY `dimension` in the
[CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) statement for the semantic view.

For example, suppose that you specify the `date.date` and `date.year` dimensions in the PARTITION BY EXCLUDING and ORDER BY
clauses in the definition of the `store_sales.avg_7_days_sales_quantity` metric:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW sv_window_function_example
  ...
  DIMENSIONS (
    ...
    date.date AS d_date,
    ...
    date.year AS d_year
    ...
  )
  METRICS (
    ...
    store_sales.avg_7_days_sales_quantity as AVG(total_sales_quantity)
      OVER (PARTITION BY EXCLUDING date.date, date.year ORDER BY date.date
        RANGE BETWEEN INTERVAL '6 days' PRECEDING AND CURRENT ROW)
      WITH SYNONYMS = ('Running 7-day average of total sales quantity'),
    ...
  );
```

If you return the `store_sales.avg_7_days_sales_quantity` metric in a query, you must also return the `date.date` and
`date.year` dimensions:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  sv_window_function_example
  DIMENSIONS date.date, date.year
  METRICS store_sales.avg_7_days_sales_quantity
);
```

If you omit the `date.date` and `date.year` dimensions, an error occurs.

```output
010260 (42601): SQL compilation error:
Invalid semantic view query: Dimension 'DATE.DATE' used in a
   window function metric must be requested in the query.
```

To determine which dimensions you must specify in the query, execute the
[SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md) command. For example, to determine the dimensions that you must
specify when retrieving the `store_sales.avg_7_days_sales_quantity` metric, run this command:

```sqlexample
SHOW SEMANTIC DIMENSIONS IN sv_window_function_example FOR METRIC avg_7_days_sales_quantity;
```

In the output of the command, the `required` column contains `true` for the dimensions that you must specify in the query.

```output
+------------+-----------+--------------+----------+----------+---------+
| table_name | name      | data_type    | required | synonyms | comment |
|------------+-----------+--------------+----------+----------+---------|
| DATE       | DATE      | DATE         | true     | NULL     | NULL    |
| DATE       | D_DATE_SK | NUMBER(38,0) | false    | NULL     | NULL    |
| DATE       | YEAR      | NUMBER(38,0) | true     | NULL     | NULL    |
+------------+-----------+--------------+----------+----------+---------+
```

The following additional examples query the window function metrics defined in
Defining window function metrics. Note that the DIMENSIONS clause includes the dimensions specified in the
PARTITION BY EXCLUDING and ORDER BY clauses of the metric definitions.

The following example returns the sales quantity 30 days ago:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  sv_window_function_example
  DIMENSIONS date.date, date.year
  METRICS store_sales.total_sales_quantity_30_days_ago
);
```

The following example returns the running 7-day average of the total sales quantity 30 days ago:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  sv_window_function_example
  DIMENSIONS date.date, date.year
  METRICS store_sales.avg_7_days_sales_quantity_30_days_ago
);
```

---
title: Querying Semi-structured Data
source: https://docs.snowflake.com/en/user-guide/querying-semistructured.md
section: User Guide
---

# Querying Semi-structured Data

This topic explains how to use special operators and functions to query complex hierarchical data stored in a VARIANT.

(For simple examples of how to extract values from ARRAYs and OBJECTs, see [Accessing elements of an array by index or by slice](../sql-reference/data-types-semistructured.md) and
[Accessing elements of an OBJECT value by key](../sql-reference/data-types-semistructured.md).)

Typically, hierarchical data has been imported into a VARIANT from one of the following supported data formats:

> * JSON
> * Avro
> * ORC
> * Parquet

For information about querying XML data (for example, data that originated in XML data format and was converted to an OBJECT
value by calling [PARSE_XML](../sql-reference/functions/parse_xml.md)), see [Examples of querying XML data](semistructured-data-formats.md) and
[XMLGET](../sql-reference/functions/xmlget.md).

> **Tip:**
>
> You can use the search optimization service to improve query performance.
> For details, see [Search optimization service](search-optimization-service.md).

## Sample Data Used in Examples

Except where noted, the examples in this topic refer to a table named `car_sales` that contains a single
[VARIANT](../sql-reference/data-types-semistructured.md) column named `src`. This VARIANT contains nested [ARRAYs](../sql-reference/data-types-semistructured.md)
and [OBJECTs](../sql-reference/data-types-semistructured.md).

Create the table and load it:

```sqlexample
CREATE OR REPLACE TABLE car_sales
(
  src variant
)
AS
SELECT PARSE_JSON(column1) AS src
FROM VALUES
('{
    "date" : "2017-04-28",
    "dealership" : "Valley View Auto Sales",
    "salesperson" : {
      "id": "55",
      "name": "Frank Beasley"
    },
    "customer" : [
      {"name": "Joyce Ridgely", "phone": "16504378889", "address": "San Francisco, CA"}
    ],
    "vehicle" : [
      {"make": "Honda", "model": "Civic", "year": "2017", "price": "20275", "extras":["ext warranty", "paint protection"]}
    ]
}'),
('{
    "date" : "2017-04-28",
    "dealership" : "Tindel Toyota",
    "salesperson" : {
      "id": "274",
      "name": "Greg Northrup"
    },
    "customer" : [
      {"name": "Bradley Greenbloom", "phone": "12127593751", "address": "New York, NY"}
    ],
    "vehicle" : [
      {"make": "Toyota", "model": "Camry", "year": "2017", "price": "23500", "extras":["ext warranty", "rust proofing", "fabric protection"]}
    ]
}') v;
```

Select the data:

```sqlexample
SELECT * FROM car_sales;
+-------------------------------------------+
| SRC                                       |
|-------------------------------------------|
| {                                         |
|   "customer": [                           |
|     {                                     |
|       "address": "San Francisco, CA",     |
|       "name": "Joyce Ridgely",            |
|       "phone": "16504378889"              |
|     }                                     |
|   ],                                      |
|   "date": "2017-04-28",                   |
|   "dealership": "Valley View Auto Sales", |
|   "salesperson": {                        |
|     "id": "55",                           |
|     "name": "Frank Beasley"               |
|   },                                      |
|   "vehicle": [                            |
|     {                                     |
|       "extras": [                         |
|         "ext warranty",                   |
|         "paint protection"                |
|       ],                                  |
|       "make": "Honda",                    |
|       "model": "Civic",                   |
|       "price": "20275",                   |
|       "year": "2017"                      |
|     }                                     |
|   ]                                       |
| }                                         |
| {                                         |
|   "customer": [                           |
|     {                                     |
|       "address": "New York, NY",          |
|       "name": "Bradley Greenbloom",       |
|       "phone": "12127593751"              |
|     }                                     |
|   ],                                      |
|   "date": "2017-04-28",                   |
|   "dealership": "Tindel Toyota",          |
|   "salesperson": {                        |
|     "id": "274",                          |
|     "name": "Greg Northrup"               |
|   },                                      |
|   "vehicle": [                            |
|     {                                     |
|       "extras": [                         |
|         "ext warranty",                   |
|         "rust proofing",                  |
|         "fabric protection"               |
|       ],                                  |
|       "make": "Toyota",                   |
|       "model": "Camry",                   |
|       "price": "23500",                   |
|       "year": "2017"                      |
|     }                                     |
|   ]                                       |
| }                                         |
+-------------------------------------------+
```

## Traversing Semi-structured Data

Insert a colon `:` between the VARIANT column name and any first-level element: `<column>:<level1_element>`.

> **Note:**
>
> In the following examples, the query output is enclosed in double quotes because the query output is
> VARIANT, not VARCHAR. (The VARIANT values are not strings; the VARIANT values contain strings.) Operators `:` and subsequent `.` and `[]` always return VARIANT values containing strings.

For example, get a list of all dealership names:

```sqlexample
SELECT src:dealership
    FROM car_sales
    ORDER BY 1;
+--------------------------+
| SRC:DEALERSHIP           |
|--------------------------|
| "Tindel Toyota"          |
| "Valley View Auto Sales" |
+--------------------------+
```

There are two ways to access elements in a JSON object:

* Dot Notation (in this topic).
* Bracket Notation (in this topic).

> **Important:**
>
> Regardless of which notation you use, the column name is case-insensitive but element names are case-sensitive.
> For example, in the following list, the first two paths are equivalent, but the third is not:
>
> * src:salesperson.name
> * SRC:salesperson.name
> * SRC:Salesperson.Name

### Dot Notation

Use dot notation to traverse a path in a JSON object: `<column>:<level1_element>.<level2_element>.<level3_element>`. Optionally enclose element names in double quotes:
`<column>:"<level1_element>"."<level2_element>"."<level3_element>"`.

> **Note:**
>
> The rules for JSON keys (element names) are different from the rules for
> Snowflake SQL identifiers.
>
> For more information about the rules for Snowflake SQL identifiers, see: [Identifier requirements](../sql-reference/identifiers-syntax.md).
>
> For more information about JSON keys, see <http://json.org>, in particular the description of a “string”.
>
> If an element name does not conform to Snowflake SQL identifier rules,
> for example if it contains spaces, then you must enclose the
> name in double quotes. Below are some examples (not all of which are
> from the car_sales example above) of valid JSON element names that are not valid Snowflake identifier names
> unless they are surrounded by double quotes:
>
> ```sqlexample
> -- This contains a blank.
> SELECT src:"company name" FROM partners;
>
> -- This does not start with a letter or underscore.
> SELECT zipcode_info:"94987" FROM addresses;
>
> -- This contains characters that are not letters, digits, or underscores, and
> -- it does not start with a letter or underscore.
> SELECT measurements:"#sPerSquareInch" FROM english_metrics;
> ```

Get the names of all salespeople who sold cars:

```sqlexample
SELECT src:salesperson.name
    FROM car_sales
    ORDER BY 1;
+----------------------+
| SRC:SALESPERSON.NAME |
|----------------------|
| "Frank Beasley"      |
| "Greg Northrup"      |
+----------------------+
```

### Bracket Notation

Alternatively, use bracket notation to traverse the path in an object: `<column>['<level1_element>']['<level2_element>']`. Enclose element names in single quotes. Values are retrieved as strings.

Get the names of all salespeople who sold cars:

```sqlexample
SELECT src['salesperson']['name']
    FROM car_sales
    ORDER BY 1;
+----------------------------+
| SRC['SALESPERSON']['NAME'] |
|----------------------------|
| "Frank Beasley"            |
| "Greg Northrup"            |
+----------------------------+
```

## Retrieving a Single Instance of a Repeating Element

Retrieve a specific numbered instance of a child element in a repeating array by adding a numbered predicate (starting from 0) to the array reference.

Note that to retrieve all instances of a child element in a repeating array, it is necessary to flatten the array. See an example in Using the FLATTEN Function to Parse Arrays in this topic.

Get the vehicle details for each sale:

```sqlexample
SELECT src:customer[0].name, src:vehicle[0]
    FROM car_sales
    ORDER BY 1;
+----------------------+-------------------------+
| SRC:CUSTOMER[0].NAME | SRC:VEHICLE[0]          |
|----------------------+-------------------------|
| "Bradley Greenbloom" | {                       |
|                      |   "extras": [           |
|                      |     "ext warranty",     |
|                      |     "rust proofing",    |
|                      |     "fabric protection" |
|                      |   ],                    |
|                      |   "make": "Toyota",     |
|                      |   "model": "Camry",     |
|                      |   "price": "23500",     |
|                      |   "year": "2017"        |
|                      | }                       |
| "Joyce Ridgely"      | {                       |
|                      |   "extras": [           |
|                      |     "ext warranty",     |
|                      |     "paint protection"  |
|                      |   ],                    |
|                      |   "make": "Honda",      |
|                      |   "model": "Civic",     |
|                      |   "price": "20275",     |
|                      |   "year": "2017"        |
|                      | }                       |
+----------------------+-------------------------+
```

Get the price of each car sold:

```sqlexample
SELECT src:customer[0].name, src:vehicle[0].price
    FROM car_sales
    ORDER BY 1;
+----------------------+----------------------+
| SRC:CUSTOMER[0].NAME | SRC:VEHICLE[0].PRICE |
|----------------------+----------------------|
| "Bradley Greenbloom" | "23500"              |
| "Joyce Ridgely"      | "20275"              |
+----------------------+----------------------+
```

## Explicitly Casting Values

When you extract values from a VARIANT, you can explicitly cast the values to the desired data type.
For example, you can extract the prices as numeric values and perform calculations on them:

```sqlexample
SELECT src:vehicle[0].price::NUMBER * 0.10 AS tax
    FROM car_sales
    ORDER BY tax;
+--------+
|    TAX |
|--------|
| 2027.5 |
| 2350.0 |
+--------+
```

By default, when VARCHARs, DATEs, TIMEs, and TIMESTAMPs are retrieved from a VARIANT column, the values are surrounded by double
quotes. You can eliminate the double quotes by explicitly casting the values. For example:

```sqlexample
SELECT src:dealership, src:dealership::VARCHAR
    FROM car_sales
    ORDER BY 2;
+--------------------------+-------------------------+
| SRC:DEALERSHIP           | SRC:DEALERSHIP::VARCHAR |
|--------------------------+-------------------------|
| "Tindel Toyota"          | Tindel Toyota           |
| "Valley View Auto Sales" | Valley View Auto Sales  |
+--------------------------+-------------------------+
```

For more information about casting VARIANT values, see [Inserting VARIANT data](../sql-reference/data-types-semistructured.md).

For more information about casting in general, see [Data type conversion](../sql-reference/data-type-conversion.md).

## Using FLATTEN to Filter the Results in a WHERE Clause

The [FLATTEN](../sql-reference/functions/flatten.md) function explodes nested values into separate columns. You can use the function to filter query results in a [WHERE](../sql-reference/constructs/where.md) clause.

The following example returns key-value pairs that match a WHERE clause and displays them in separate columns:

```sqlexample
CREATE TABLE pets (v variant);

INSERT INTO pets SELECT PARSE_JSON ('{"species":"dog", "name":"Fido", "is_dog":"true"} ');
INSERT INTO pets SELECT PARSE_JSON ('{"species":"cat", "name":"Bubby", "is_dog":"false"}');
INSERT INTO pets SELECT PARSE_JSON ('{"species":"cat", "name":"dog terror", "is_dog":"false"}');

SELECT a.v, b.key, b.value FROM pets a,LATERAL FLATTEN(input => a.v) b
WHERE b.value LIKE '%dog%';

+-------------------------+---------+--------------+
| V                       | KEY     | VALUE        |
|-------------------------+---------+--------------|
| {                       | species | "dog"        |
|   "is_dog": "true",     |         |              |
|   "name": "Fido",       |         |              |
|   "species": "dog"      |         |              |
| }                       |         |              |
| {                       | name    | "dog terror" |
|   "is_dog": "false",    |         |              |
|   "name": "dog terror", |         |              |
|   "species": "cat"      |         |              |
| }                       |         |              |
+-------------------------+---------+--------------+
```

## Using FLATTEN to List Distinct Key Names

When working with unfamiliar semi-structured data, you might not know the key names in an OBJECT. You can use the FLATTEN function
with the RECURSIVE argument to return the list of distinct key names in all nested elements in an OBJECT:

```sqlexample
SELECT REGEXP_REPLACE(f.path, '\\[[0-9]+\\]', '[]') AS "Path",
  TYPEOF(f.value) AS "Type",
  COUNT(*) AS "Count"
FROM <table>,
LATERAL FLATTEN(<variant_column>, RECURSIVE=>true) f
GROUP BY 1, 2 ORDER BY 1, 2;
```

The [REGEXP_REPLACE](../sql-reference/functions/regexp_replace.md) function removes the array index values (e.g. `[0]`) and replaces them with brackets (`[]`) to group array elements.

For example:

```sqljson
{"a": 1, "b": 2, "special" : "data"}   <--- row 1 of VARIANT column
{"c": 3, "d": 4, "normal" : "data"}    <----row 2 of VARIANT column

Output from query:

+---------+---------+-------+
| Path    | Type    | Count |
|---------+---------+-------|
| a       | INTEGER |     1 |
| b       | INTEGER |     1 |
| c       | INTEGER |     1 |
| d       | INTEGER |     1 |
| normal  | VARCHAR |     1 |
| special | VARCHAR |     1 |
+---------+---------+-------+
```

## Using FLATTEN to List Paths in an OBJECT

Related to Using FLATTEN to List Distinct Key Names, you can use the FLATTEN function with the RECURSIVE argument to retrieve all keys and paths in an OBJECT.

The following query returns keys, paths, and values (including VARIANT “null” values) for all data types stored in a VARIANT
column. The code assumes that the VARIANT column contains an OBJECT in each row.

```sqlexample
SELECT
  t.<variant_column>,
  f.seq,
  f.key,
  f.path,
  REGEXP_COUNT(f.path,'\\.|\\[') +1 AS Level,
  TYPEOF(f.value) AS "Type",
  f.index,
  f.value AS "Current Level Value",
  f.this AS "Above Level Value"
FROM <table> t,
LATERAL FLATTEN(t.<variant_column>, recursive=>true) f;
```

The following query is similar to the first query, but excludes nested OBJECTs and ARRAYs:

```sqlexample
SELECT
  t.<variant_column>,
  f.seq,
  f.key,
  f.path,
  REGEXP_COUNT(f.path,'\\.|\\[') +1 AS Level,
  TYPEOF(f.value) AS "Type",
  f.value AS "Current Level Value",
  f.this AS "Above Level Value"
FROM <table> t,
LATERAL FLATTEN(t.<variant_column>, recursive=>true) f
WHERE "Type" NOT IN ('OBJECT','ARRAY');
```

The queries return the following values:

> *<variant_column>*
> :   OBJECT stored as a row in the VARIANT column.
>
> Seq
> :   Unique sequence number associated with the data in the row.
>
> Key
> :   String associated with a value in the data structure.
>
> Path
> :   Path to the element within the data structure.
>
> Level
> :   Level of the key-value pair within the data structure.
>
> Type
> :   Data type for the value.
>
> Index
> :   Index of the element in the data structure. Applies to ARRAY values only; otherwise NULL.
>
> Current Level Value
> :   Value at the current level in the data structure.
>
> Above Level Value
> :   Value one level higher in the data structure.

## Using the FLATTEN Function to Parse Arrays

Parse an array using the [FLATTEN](../sql-reference/functions/flatten.md) function. FLATTEN is a table function that produces a lateral view of a VARIANT, OBJECT, or ARRAY column. The function returns a row for each object, and the LATERAL modifier joins the data with any information outside of the object.

Get the names and addresses of all customers. Cast the VARIANT output to string values:

```sqlexample
SELECT
  value:name::string as "Customer Name",
  value:address::string as "Address"
  FROM
    car_sales
  , LATERAL FLATTEN(INPUT => SRC:customer);

+--------------------+-------------------+
| Customer Name      | Address           |
|--------------------+-------------------|
| Joyce Ridgely      | San Francisco, CA |
| Bradley Greenbloom | New York, NY      |
+--------------------+-------------------+
```

## Using the FLATTEN Function to Parse Nested Arrays

The `extras` array is nested within the `vehicle` array in the sample data:

```sqlexample
"vehicle" : [
     {"make": "Honda", "model": "Civic", "year": "2017", "price": "20275", "extras":["ext warranty", "paint protection"]}
   ]
```

Add a second FLATTEN clause to flatten the `extras` array within the flattened `vehicle` array and retrieve the “extras” purchased for each car sold:

```sqlexample
SELECT
  vm.value:make::string as make,
  vm.value:model::string as model,
  ve.value::string as "Extras Purchased"
  FROM
    car_sales
    , LATERAL FLATTEN(INPUT => SRC:vehicle) vm
    , LATERAL FLATTEN(INPUT => vm.value:extras) ve
  ORDER BY make, model, "Extras Purchased";
+--------+-------+-------------------+
| MAKE   | MODEL | Extras Purchased  |
|--------+-------+-------------------|
| Honda  | Civic | ext warranty      |
| Honda  | Civic | paint protection  |
| Toyota | Camry | ext warranty      |
| Toyota | Camry | fabric protection |
| Toyota | Camry | rust proofing     |
+--------+-------+-------------------+
```

## Parsing Text as VARIANT Values Using the PARSE_JSON Function

Parse text as a JSON document using the [PARSE_JSON](../sql-reference/functions/parse_json.md) function.

If the input is NULL, the output will also be NULL. However, if the input string is `null`, it is interpreted as a VARIANT `null` value; that is, the result is not a SQL NULL but a real value used to represent a null value in semi-structured formats.

For an example, see Sample Data Used in Examples in this topic.

## Extracting Values Using the GET Function

[GET](../sql-reference/functions/get.md) accepts a VARIANT, OBJECT, or ARRAY value as the first argument and extracts the VARIANT value of the element in the path provided as the second argument.

Compute and extract the last element of each array in a VARIANT column using the GET and [ARRAY_SIZE](../sql-reference/functions/array_size.md) functions. ARRAY_SIZE returns the size of the input array:

> **Note:**
>
> This example departs from the `car_sales` table used elsewhere in this topic.

```sqlexample
CREATE OR replace TABLE colors (v variant);

INSERT INTO
   colors
   SELECT
      parse_json(column1) AS v
   FROM
   VALUES
     ('[{r:255,g:12,b:0},{r:0,g:255,b:0},{r:0,g:0,b:255}]'),
     ('[{c:0,m:1,y:1,k:0},{c:1,m:0,y:1,k:0},{c:1,m:1,y:0,k:0}]')
    v;

SELECT *, GET(v, ARRAY_SIZE(v)-1) FROM colors;

+---------------+-------------------------+
| V             | GET(V, ARRAY_SIZE(V)-1) |
|---------------+-------------------------|
| [             | {                       |
|   {           |   "b": 255,             |
|     "b": 0,   |   "g": 0,               |
|     "g": 12,  |   "r": 0                |
|     "r": 255  | }                       |
|   },          |                         |
|   {           |                         |
|     "b": 0,   |                         |
|     "g": 255, |                         |
|     "r": 0    |                         |
|   },          |                         |
|   {           |                         |
|     "b": 255, |                         |
|     "g": 0,   |                         |
|     "r": 0    |                         |
|   }           |                         |
| ]             |                         |
| [             | {                       |
|   {           |   "c": 1,               |
|     "c": 0,   |   "k": 0,               |
|     "k": 0,   |   "m": 1,               |
|     "m": 1,   |   "y": 0                |
|     "y": 1    | }                       |
|   },          |                         |
|   {           |                         |
|     "c": 1,   |                         |
|     "k": 0,   |                         |
|     "m": 0,   |                         |
|     "y": 1    |                         |
|   },          |                         |
|   {           |                         |
|     "c": 1,   |                         |
|     "k": 0,   |                         |
|     "m": 1,   |                         |
|     "y": 0    |                         |
|   }           |                         |
| ]             |                         |
+---------------+-------------------------+
```

## Extracting Values by Path Using the GET_PATH Function

Extract a value from a VARIANT column using the [GET_PATH , :](../sql-reference/functions/get_path.md) function. The function is a variation of [GET](../sql-reference/functions/get.md), used to extract a value using a path name. GET_PATH is equivalent to a chain of GET functions.

Get the vehicle make for the car purchased by each customer:

```sqlexample
SELECT GET_PATH(src, 'vehicle[0]:make') FROM car_sales;

+----------------------------------+
| GET_PATH(SRC, 'VEHICLE[0]:MAKE') |
|----------------------------------|
| "Honda"                          |
| "Toyota"                         |
+----------------------------------+
```

Traversing Semi-structured Data describes the path syntax used to retrieve elements in a VARIANT column. The syntax is shorthand for the GET or [GET_PATH , :](../sql-reference/functions/get_path.md) function. Unlike the path syntax, these functions can handle irregular paths or path elements.

The following queries produce the same results:

```sqlexample
SELECT GET_PATH(src, 'vehicle[0].make') FROM car_sales;

SELECT src:vehicle[0].make FROM car_sales;
```

## Parsing Arrays Directly from a Staged Data File

Assume a staged file named `contacts.json.gz` contains the following data:

```sqljson
{
    "root": [
        {
            "employees": [
                {
                    "firstName": "Anna",
                    "lastName": "Smith"
                },
                {
                    "firstName": "Peter",
                    "lastName": "Jones"
                }
            ]
        }
    ]
}
```

Also assume a file format named `my_json_format` includes `TYPE=JSON` in its definition.

Query the name of the first employee in the staged file. In this example, the file is located in the `customers` table stage, but it could be located in any internal (i.e. Snowflake) or external stage:

```sqlexample
SELECT 'The First Employee Record is '||
    S.$1:root[0].employees[0].firstName||
    ' '||S.$1:root[0].employees[0].lastName
FROM @%customers/contacts.json.gz (file_format => 'my_json_format') as S;

+----------------------------------------------+
| 'THE FIRST EMPLOYEE RECORD IS '||            |
|      S.$1:ROOT[0].EMPLOYEES[0].FIRSTNAME||   |
|      ' '||S.$1:ROOT[0].EMPLOYEES[0].LASTNAME |
|----------------------------------------------|
| The First Employee Record is Anna Smith      |
+----------------------------------------------+
```

## Use lambda functions on data with Snowflake higher-order functions

Snowflake higher-order functions enable you to use lambda functions to filter, reduce, and transform semi-structured
and structured data. When you call a Snowflake higher-order function, you use a lambda expression to create
the lambda function that operates on the data, which is specified in an [array](../sql-reference/data-types-semistructured.md).
Snowflake higher-order functions provide a concise, readable, and efficient way to perform data manipulation
and advanced analysis.

The following higher-order functions are available:

* [FILTER](../sql-reference/functions/filter.md)
* [REDUCE](../sql-reference/functions/reduce.md)
* [TRANSFORM](../sql-reference/functions/transform.md)

### Benefits of higher-order functions

When you use semi-structured data in data analytics, you typically need to loop over an array and perform actions
for each value in the array. You can perform these operations with a call to a Snowflake higher-order function.
Higher-order functions provide the following benefits:

* **Streamline advanced analytics** - By simplifying the iteration over array elements, the functions facilitate the implementation
  of custom logic for data filtering, reduction, and transformation, which streamlines analytical processes. Without higher-order functions,
  this type of manipulation requires LATERAL FLATTEN operations or user-defined functions (UDFs).
* **Enhance the developer experience** - Higher-order functions encapsulate the manipulation logic in lambda expressions, enabling
  more readable and maintainable SQL statements. By using higher-order functions, you can avoid writing verbose and convoluted
  SQL queries.
* **Avoid unnecessary UDFs** - With higher-order functions, there is less need to create, maintain, and manage access to
  UDFs for ad-hoc array manipulation logic. These functions can reduce overhead and simplify data manipulation processes.

### Lambda expressions

A lambda expression is a short block of code that takes an argument and returns a value. In the lambda expression,
you specify the argument on the left side of the lambda operator (`->`) and an expression on the right side. You
can use lambda expressions to complete a variety of operations.

For example, you can use a lambda expression to generate numeric output. The following lambda expression multiplies
elements by two:

```sqlsyntax
a -> a * 2
```

You can use a lambda expression to filter elements and return the elements for which the filter condition
returns TRUE. For example, the following lambda expression returns elements with a `value` greater than `50`:

```sqlsyntax
a -> a:value > 50
```

You can use a lambda expression to add text to elements. For example, the following lambda expression
adds the text `some string` to elements:

```sqlsyntax
a -> a || ' some string'
```

You can reference table columns in lambda expressions. For example, the following lambda expression
subtracts the value of `table1.col2` from elements:

```sqlsyntax
a -> a - table1.col2
```

When you reference columns in lambda expressions, you can specify unqualified or qualified column names.
You can also use aliases for column names in lambda expressions. The resolution of identifiers
prioritizes lambda arguments first, then column names (using the standard rules for object name resolution).
For more information, see [Object name resolution](../sql-reference/name-resolution.md).

You can specify the data types of lambda arguments. For example, the following lambda expression
specifies two INTEGER values and adds them:

```sqlsyntax
(x INT, y INT) -> (x + y)
```

You can use function calls in a lambda expression. For example, the following lambda expression
calls the [UPPER](../sql-reference/functions/upper.md) function:

```sqlsyntax
a -> UPPER(a)
```

You can execute an [uncorrelated scalar subquery](querying-subqueries.md) in a lambda
expression. For example, the following lambda expression includes such a subquery:

```sqlsyntax
a -> a + (SELECT MAX(c1) FROM mytable)
```

You can call a [user-defined SQL function](../developer-guide/udf/sql/udf-sql-introduction.md) in a lambda expression.
For example, the following lambda expression calls the function `mysqlfunction`:

```sqlsyntax
a -> mysqlfunction(5)
```

### Limitations

* Lambda expressions aren’t supported as standalone objects. They must be specified as arguments to Snowflake
  higher-order functions.
* Lambda expressions must be anonymous. Named functions can’t be passed in as lambda arguments to Snowflake
  higher-order functions.
* Lambda expressions only accept built-in functions (excluding aggregate and window functions), SQL user-defined functions,
  and uncorrelated scalar subqueries. They don’t support user-defined functions created in languages other than SQL,
  referencing nested context (such as Snowflake Scripting variables), CTE expressions, arguments in user-defined functions,
  or correlated subqueries.

---
title: Reconcile a billing usage statement
source: https://docs.snowflake.com/en/user-guide/billing-reconcile.md
section: User Guide
---

# Reconcile a billing usage statement

Snowflake generates [billing usage statements](billing-usage-statement.md) for customers with at least one active contract,
also known as the Snowflake Order Form.

This topic describes how to use queries to reconcile a statement with the usage data found in the billing views of the
[Organization Usage schema](../sql-reference/organization-usage.md). To execute these queries, you need to do one of the following:

* Sign in to the [organization account](organization-accounts.md) as a user with the GLOBALORGADMIN role.
* Sign in to an ORGADMIN-enabled account as a user with the ACCOUNTADMIN role.

> **Note:**
>
> When executing queries to reconcile usage incurred prior to March 1, 2024, the results of the query might differ slightly from those in
> the usage statement. Before this date, some billing views did not round values to match the usage statement. For example, prior to March
> 1, 2024:
>
> * Usage of $0.001 or -$0.001 were not included in usage statements but were included in the billing views.
> * Usage of $1.004 was rounded down to $1.00 in the usage statement but not in the billing views.
> * Usage of $1.006 was rounded up to $1.01 in the usage statement but not in the billing views.
>
> Differences between query results and usage statements are small, ranging from a few cents to less than 10 dollars, depending on how long
> the contract has been active.

## Reconcile the remaining balance

Snowflake customers with a contract make an upfront financial commitment to pay for a specified amount of usage (that is, a capacity
commitment). As the customer uses Snowflake, the currency spent is deducted from this capacity commitment. The Summary section of each
usage statement identifies the remaining balance on a contract, which is calculated by subtracting the total usage since the start of the
contract from the original capacity commitment.

Use the following query to reconcile the remaining balance shown on a usage statement with data in the
[REMAINING_BALANCE_DAILY view](../sql-reference/organization-usage/remaining_balance_daily.md). Replace the date with the last day of the month shown on the usage
statement.

```sqlexample
SELECT date,
       contract_number,
       (capacity_balance + free_usage_balance + rollover_balance) AS remaining_balance
  FROM snowflake.organization_usage.remaining_balance_daily
  WHERE TRUE
    AND date = LAST_DAY(TO_DATE('2024-01-01'));
```

> **Note:**
>
> If the subscription term of a contract has ended, the preceding query correctly returns 0, but the value in the usage statement might
> be a number other than zero. This is a known discrepancy that will be addressed in a future update.

## Reconcile total usage for a contract

Snowflake keeps track of how much has been spent on usage since the start of a contract, and classifies this amount as Total Consumed, which
is found in the Summary section of a usage statement. This consumption is tracked in currency spent, not credits consumed.

Use the following query to reconcile the total consumption shown on a usage statement with data in the
[USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md). The total consumption returned by the query does not include usage whose
`balance_source` is `overage`. Replace the date with the last day of the month shown on the usage statement.

```sqlexample
SELECT contract_number,
       SUM(usage_in_currency) AS total_consumed
  FROM snowflake.organization_usage.usage_in_currency_daily
  WHERE TRUE
    AND usage_date <= LAST_DAY(TO_DATE('2024-01-01'))
    AND LOWER(balance_source) != 'overage'
  GROUP BY 1
  ORDER BY 1;
```

## Reconcile total monthly usage by account

The Monthly Usage section of a statement includes a line item for each account in the organization. Each line item shows the total usage in
an account for the month. It shows how many credits were consumed and the amount spent in currency.

Use the following query to reconcile the total monthly usage of each account with the data in the
[USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md). The total usage returned by the query does not include usage whose
`balance_source` is `overage`. Replace the date with the last day of the month shown on the usage statement.

```sqlexample
SELECT contract_number,
       DATE_TRUNC(month, usage_date) AS usage_month,
       CONCAT(account_locator,'-',region) AS account_name,
       SUM(usage_in_currency) AS total_consumed,
  FROM snowflake.organization_usage.usage_in_currency_daily
  WHERE TRUE
    AND usage_month = DATE_TRUNC(month,to_date('2024-01-01'))
    AND LOWER(balance_source) != 'overage'
  GROUP BY 1,2,3
  ORDER BY 1,2,3;
```

> **Note:**
>
> There are different naming conventions for regions within Snowflake. The name of the region returned by the preceding query might not
> match what you see in the Monthly Usage section of the usage statement, but it refers to the same region. This is a known
> discrepancy that will be addressed in a future update.

## Reconcile each type of usage

Snowflake usage can be attributed to different features and [architectural components](intro-key-concepts.md). The Monthly
Usage section of a statement itemizes usage based on the source of the usage, grouped by the account where the usage occurred. For example,
usage attributed to automatic clustering in account `account_1` appears on a different line than automatic clustering usage in account
`account_2`. Each line shows how many credits were consumed and the amount spent in currency.

Use the following query to reconcile individual categories of usage shown in the statement’s Monthly Usage section with data in the
[USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md). Like the statement, each type of usage is grouped by account in the
query results. Replace the date with the last day of the month shown on the usage statement.

```sqlexample
SELECT contract_number,
       DATE_TRUNC(month, usage_date) AS usage_month,
       CONCAT(account_locator,'-',region) AS account_name,
       usage_type AS usage_category,
       SUM(usage) AS units_consumed,
       SUM(usage_in_currency) AS total_usage
  FROM snowflake.organization_usage.usage_in_currency_daily
  WHERE TRUE
    AND usage_month = DATE_TRUNC(month, TO_DATE('2024-01-01'))
  GROUP BY 1,2,3,4
  ORDER BY 1,2,3,4;
```

> **Note:**
>
> There are different naming conventions for regions within Snowflake. The name of the region returned by the preceding query might not
> match what you see in the Monthly Usage section of the usage statement, but it refers to the same region. This is a known
> discrepancy that will be addressed in a future update.

---
title: Redirecting client connections
source: https://docs.snowflake.com/en/user-guide/client-redirect.md
section: User Guide
---

# Redirecting client connections

Client Redirect enables redirecting your client connections to Snowflake accounts in different
[regions](intro-regions.md) without changing the connection settings for your application.
You can use Client Redirect in combination with the
[account replication](account-replication-intro.md) feature for business continuity
and disaster recovery. You can also use Client Redirect to minimize changes needed in your application
settings when migrating your account to another region or cloud platform.

## Introduction to Client Redirect

Client Redirect is implemented through a Snowflake *connection* object. The connection object stores a secure *connection URL* that you use
with a Snowflake client to connect to Snowflake.

The hostname in the connection URL is composed of your organization name and the connection object name in addition to a common domain name:

> `organization_name-connection_name.snowflakecomputing.com`

Note that this hostname does not specify the account to which you are connecting. An account administrator determines the account to use by
designating the connection in that account to serve as the *primary connection*. When you use the connection URL to connect to Snowflake,
you are connecting to the account that contains the primary connection.

If an outage occurs in a region or cloud platform and the outage affects the account with the primary connection, the administrator can
promote a connection in a different account in a different region or cloud platform to serve as the primary connection.

Through this outage, you can continue to use the same connection URL to connect to Snowflake. Snowflake resolves the connection URL to the
account with the newly promoted connection (the account outside of the region or cloud platform affected by the outage).

> **Note:**
>
> The Snowflake accounts that store the primary and secondary connections must be hosted in different
> [regions](intro-regions.md).

## Client Redirect flow

1. Complete the steps in Configuring Client Redirect (in this topic) to create a connection URL for client connections. This
   includes creating a primary connection and linked secondary connection(s).
2. Update Snowflake clients to connect using the connection URL. Using a connection URL (in this topic) contains a list of
   supported clients and connection details.
3. In the event of a service outage in the region where the primary connection is located, complete the steps in
   Redirecting client connections (in this topic) to update the connection URL to redirect to a secondary connection.
4. When the outage is resolved, complete the steps in Redirecting client connections to redirect client connections back to the
   original primary connection.

The following diagrams illustrate the Client Redirect flow for two accounts in the same organization but different regions (`Region A` and
`Region B`) on either the same or different cloud platforms.

The primary connection is in `Account 1` in `Region A`. Snowflake clients using the connection URL connect to `Account 1`.

A service outage in `Region A` results in failed client connections:

The connection in `Account 2` in `Region B` is promoted to act as the primary connection. Snowflake clients using the connection URL
now connect to `Account 2`.

### Example

The following SQL statements go through the client redirect workflow. Each step is explained in detail in the sections that follow in this
topic.

#### Normal client connections: Configure Client Redirect

##### Create a primary connection in the source account

Create a new primary connection and enable failover to other accounts in your organization. Each account that is enabled for failover
must be in a different region than the account with the primary connection.

Note the `account_name` column in the output of [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md) for each account to be
enabled for failover.

Execute the following statements in the *source* account:

```sqlexample
-- Create a new primary connection
CREATE CONNECTION myconnection;

-- View accounts in your organization that are enabled for replication
SHOW REPLICATION ACCOUNTS;

-- Configure failover accounts for the primary connection
ALTER CONNECTION myconnection
  ENABLE FAILOVER TO ACCOUNTS myorg.myaccount2, myorg.myaccount3;

-- View the details for the connection
SHOW CONNECTIONS;
```

If private connectivity to the Snowflake service is enabled for your Snowflake account, you must create
and manage a DNS CNAME record for your connection URL. For more details, see Configuring the DNS settings for private connectivity to the Snowflake service.

##### Executed on target account

Create a secondary connection linked to the primary connection. The name of the secondary connection must be the same name as the primary
connection.

```sqlexample
CREATE CONNECTION myconnection
  AS REPLICA OF myorg.myaccount1.myconnection;
```

If private connectivity to the Snowflake service is enabled for your Snowflake account, you must create
or update a DNS CNAME record for your connection URL. For more details, see Modifying the DNS settings for private connectivity to the Snowflake service.

#### Outage occurs in source region: Failover

If an outage occurs in the region where the primary connection is located, promote a secondary connection in a different region
to serve as the primary connection.

##### Executed on target account

1. Sign in to the target account that you want to promote to serve as the new source account.
2. Promote the secondary connection to serve as the primary connection:

   ```sqlexample
   ALTER CONNECTION myconnection PRIMARY;
   ```

If private connectivity to the Snowflake service is enabled for your Snowflake account, you must create
or update a DNS CNAME record for your connection URL. For more details, see Modifying the DNS settings for private connectivity to the Snowflake service.

#### Outage resolved: Failback

Once the outage is resolved, promote the original primary connection to serve as the primary connection again.

##### Executed on the target account that previously served as the source account

1. Sign in to the target account that served as the source account prior to the outage.
2. Promote the secondary connection back to primary connection:

   ```sqlexample
   ALTER CONNECTION myconnection PRIMARY;
   ```

If private connectivity to the Snowflake service is enabled for your Snowflake account, you must create
or update a DNS CNAME record for your connection URL. For more details, see Modifying the DNS settings for private connectivity to the Snowflake service.

## Configuring Client Redirect

This section describes how to create a primary connection and one or more secondary connections in a connection group.

### Prerequisite

To enable the Client Redirect feature for your accounts, an [organization administrator](organization-administrators.md) must enable
replication for two or more accounts. To enable replication, see [Prerequisite: Enable replication for accounts in the organization](account-replication-config.md) for detailed instructions.

### Create a primary connection

> **Important:**
>
> Snowflake assigned your organization a unique, generated name when it was created in the system. The organization name is a part of the
> connection URL defined in a connection object and submitted by Snowflake clients to access an account. Before you create any connection
> objects, verify that your organization name in Snowflake is satisfactory. To change your organization name in the system, contact
> [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

You can create a primary connection in the source account using Snowsight or
SQL.

#### Create a primary and secondary connection using Snowsight

To create a connection using Snowsight, complete the following steps:

> **Note:**
>
> * Only a user with the ACCOUNTADMIN role can create a connection using Snowsight.
> * You must be signed in to the target account as a user with the ACCOUNTADMIN role. If not, you will be prompted to sign in.
> * Currently, if your account uses private connectivity, you can’t use Snowsight to create a primary and secondary
>   connection.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Client Redirect.
4. Select + Connection.
5. Select Target Account.
6. In the Connection Name box, enter a connection name that meets the following requirements:

   * Must start with an alphabetic character and may only contain letters, decimal digits (0-9), and underscores (_).
   * Must be unique across connection names and account names in the organization.
7. Select Create Connection.

#### Create a primary connection using SQL

> **Note:**
>
> Only a user with the ACCOUNTADMIN role can execute the SQL commands in this section.

1. Create a new primary connection using the [CREATE CONNECTION](../sql-reference/sql/create-connection.md) command. The name of each primary
   connection must be unique across all connection and account names in the organization.

   The connection name is included as part of the connection URL used to connect to Snowflake accounts.

   For example, to create a connection named `myconnection`:

   ```sqlexample
   CREATE CONNECTION myconnection;
   ```
2. Modify this primary connection using an [ALTER CONNECTION … ENABLE FAILOVER TO ACCOUNTS](../sql-reference/sql/alter-connection.md)
   statement. Provide a comma-separated list of accounts in your organization that can store a failover option for this connection (i.e. a
   secondary connection).

   Any account that stores a secondary connection must be hosted in a region different from the account that stores the primary connection.
   Client Redirect only operates successfully across regions. For example, if you try to redirect client connections from `account1` to
   `account2` in the same region, client redirect does not work.

   To see the complete list of accounts in your organization that are enabled for replication, execute
   [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md).

   For example, allow accounts `myaccount2` and `myaccount3` in the `myorg` organization to each store a secondary connection for the
   `myconnection` connection:

   ```sqlexample
   ALTER CONNECTION myconnection ENABLE FAILOVER TO ACCOUNTS myorg.myaccount2, myorg.myaccount3;
   ```
3. Execute the [SHOW CONNECTIONS](../sql-reference/sql/show-connections.md) command to view the details for the connection.

   ```sqlexample
   SHOW CONNECTIONS;

   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   | snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
   |--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
   | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   ```

### Create a secondary connection in each target account

Create a secondary connection in one or more accounts, linked to a primary connection using
[CREATE CONNECTION … AS REPLICA OF](../sql-reference/sql/create-connection.md). Note that you can only create a secondary connection in
an account specified in the ALTER CONNECTION … ENABLE FAILOVER TO ACCOUNTS statement in
Create a Primary Connection.

Execute a CREATE CONNECTION … AS REPLICA OF statement in each target account to create a replica of the specified primary connection.

> **Important:**
>
> Each secondary connection must have the same name as its primary connection. The connection name is included in the connection
> URL.

Execute the SQL statements in this section in the *target* account where you want to create a secondary connection.

> **Note:**
>
> Only a user with the ACCOUNTADMIN role can execute the SQL commands in this section.

1. Execute the SHOW CONNECTIONS command to view all connections. Copy the value of the `primary` column for the primary connection.
   You will use this value when creating the secondary connection in the next step.

   ```sqlexample
   SHOW CONNECTIONS;

   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   | snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
   |--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
   | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   ```
2. Execute the CREATE CONNECTION … AS REPLICA OF command to create a secondary connection.

   For example, create a secondary connection named `myconnection` that is linked to the `myorg.myaccount1.myconnection` primary
   connection. After `AS REPLICA OF`, paste in the fully qualified name of the primary connection (the name that you copied from the
   SHOW CONNECTIONS output in the previous step).

   ```sqlexample
   CREATE CONNECTION myconnection
     AS REPLICA OF MYORG.MYACCOUNT1.MYCONNECTION;
   ```
3. Execute the SHOW CONNECTIONS command to verify the secondary connection was created.

   ```sqlexample
   SHOW CONNECTIONS;

   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   | snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
   |--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
   | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
   | AWS_US_EAST_1      | 2020-07-22 13:52:04.925 -0700 | MYORG.MYACCOUNT2    | MYCONNECTION      | NULL            | false         | MYORG.MYACCOUNT1.MYCONNECTION |                                     | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR2 |
   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   ```

### Grant the FAILOVER Privilege to a Role — *Optional*

An account administrator can grant the FAILOVER privilege on a connection object to an account role.
This enables a user other than the account administrator to promote a secondary connection to serve as the primary connection.

For example, to grant the role `my_failover_role` the ability to fail over the connection `myconnection`, execute
the following statement on the *target* account:

```sqlexample
GRANT FAILOVER ON CONNECTION myconnection TO ROLE my_failover_role;
```

A user with the role `my_failover_role` can now promote the secondary connection `myconnection` to serve as
primary connection in the case of failover:

```sqlexample
USE ROLE my_failover_role;

ALTER CONNECTION myconnection PRIMARY;
```

For more information on redirecting client connections, see Redirecting client connections.

## Configuring the DNS settings for private connectivity to the Snowflake service

If private connectivity to the Snowflake service is enabled for your Snowflake account, then your network administrator must create and
manage a DNS record for your connection URL. Your network administrator can use a CNAME record, alias record, or an alias based on the
configuration of the network architecture. For consistency, the following example uses a CNAME record.

These steps use AWS PrivateLink as an example, and the steps are the same if your Snowflake account uses Azure Private Link or Google Cloud
Private Service Connect:

1. Execute SHOW CONNECTIONS in one of your accounts in which client redirect is enabled. For example, suppose AWS PrivateLink is enabled
   for `myaccount1` and `myaccount2`.

   ```sqlexample
   SHOW CONNECTIONS;

   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   | snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
   |--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
   | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
   |--------------------|-------------------------------|---------------------|-------------------|-----------------|---------------|-------------------------------|-------------------------------------|-------------------------------------------|-------------------|-------------------|
   | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
   | AWS_US_EAST_1      | 2020-07-22 13:52:04.925 -0700 | MYORG.MYACCOUNT2    | MYCONNECTION      | NULL            | false         | MYORG.MYACCOUNT1.MYCONNECTION |                                     | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR2 |
   +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
   ```

   Note that the output of this command in the CONNECTION_URL column should match the `privatelink-connection-urls` list when calling
   the [SYSTEM$GET_PRIVATELINK_CONFIG](../sql-reference/functions/system_get_privatelink_config.md) function in either `myaccount1` or `myaccount2`. This list already
   contains the connection URL formatted with the `privatelink` segment (as shown in the next step). You can optionally run the command
   in this step or call the function. If calling the function, use the URLs as is without any further modification.
2. Record the CONNECTION_URL column value, and create two URLs to support private connectivity and [OCSP](ocsp.md).

   1. Add a `privatelink` segment to the URL just before `snowflakecomputing.com`
      (`myorg-myconnection.privatelink.snowflakecomputing.com`, in this example).
   2. Add an `ocsp` segment to the beginning of the URL (`ocsp.myorg-myconnection.privatelink.snowflakecomputing.com`, in this example).
3. Using a tool provided by your DNS provider, create a CNAME record for the connection URL and the OCSP URL:

   * Set the domain (or alias) using the modified CONNECTION_URL column value.
   * Configure the record to have the connection URL resolve to the primary Snowflake account URL. Be sure to include all URL segments for
     the cloud region and AWS PrivateLink based on the URL format that you choose. This is the primary account URL and it is where client
     connections to the connection URL will redirect.
   * Configure the record to have the OCSP URL resolve to either the private endpoint IP address for an account on Azure or the private
     endpoint ID value for an account on AWS.
   * In the case of failover, you must manually update the DNS setting to have the connection URL point to the secondary account URL as
     shown in Modifying the DNS settings for private connectivity to the Snowflake service. Similarly, you must update your OCSP settings to point to the private endpoint IP
     address or private endpoint ID value.

     For example:

     ```bash
     myaccount1.us-west-2.privatelink.snowflakecomputing.com.
     ocsp.myaccount1.us-west-2.privatelink.snowflakecomputing.com.
     ```

     Alternatively, use the organization and account name URL.

     For example:

     ```bash
     myorg-myaccount1.privatelink.snowflakecomputing.com.
     ocsp.myorg-myaccount1.privatelink.snowflakecomputing.com.
     ```

     Note the trailing period, which must be included.

Users connect to Snowflake using the following connection URL format:

`organization_name-connection_name.privatelink.snowflakecomputing.com`

Where:

`organization_name`
:   Name of your Snowflake organization. The Snowflake accounts that your users connect to are contained in this organization.

`connection_name`
:   Name of the connection object.

For more information, see:

* Using a connection URL (in this topic).
* Modifying the DNS settings for private connectivity to the Snowflake service (in this topic).

## Configuring Client Redirect and reader accounts

If you are a data provider with [reader accounts](data-sharing-reader-create.md), you can use Client Redirect
to provide continued access to shared data in the event of a service outage. The configuration steps for creating connections are
the same as those described in the Configuring Client Redirect section for source and target reader accounts:

1. Create two reader accounts. Each reader account must be in a different region.
2. Create a primary connection in the source reader account. Enable failover to the other reader account.
3. Create a secondary connection in each target account in the reader account that you enabled for failover from the source account.
4. Share the connection URL with your data consumers.

If a service outage occurs, redirect client connections. Data consumers using the
connection URL to connect to your reader account now connect to the newly promoted source reader account.

## Using a connection URL

This section provides instructions for referencing a connection URL in the configuration for various Snowflake clients.

### Supported Snowflake clients

Client Redirect is supported by Snowsight and Classic Console.
In addition, the following Snowflake client versions (and higher) support Client Redirect:

| Snowflake Client | Minimum Supported Version |
| --- | --- |
| Snowflake CLI | 3.0.0 |
| SnowSQL | 1.1.82 |
| Snowflake Connector for Python | 1.8.3 |
| Snowflake Connector for Spark | All versions |
| Node.js Driver | 1.2.0 |
| Go Snowflake Driver | 1.2.0 |
| .NET Driver | 1.0.0 |
| JDBC Driver | 3.8.4 |
| ODBC Driver | 2.19.4 |
| Snowpark | All versions |

### Configure Snowflake clients

Use the following host name for the connection URL when connecting to Snowflake:

> Host name: `organization_name-connection_name.snowflakecomputing.com`

Where:

`organization_name`
:   Name of your Snowflake organization. The Snowflake accounts that your users connect to are contained in this organization.

`connection_name`
:   Name of the connection object.

> **Important:**
>
> **Private Connectivity to the Snowflake Service**
>
> Customers using private connectivity to the Snowflake service need to add a `privatelink` segment to the URL just before
> `snowflakecomputing.com`:
>
> `organization_name-connection_name.privatelink.snowflakecomputing.com`

#### Snowsight

Enter the following in the account name field on [app.snowflake.com](https://app.snowflake.com):

```bash
<organization-name>-<connection-name>
```

For example:

```bash
myorg-myconnection
```

When using `organization-connection` to log in, Snowsight navigates to the specific region and locator of the current
primary connection. During an outage, once the connection has been redirected, users must log in again via
`organization-connection` to connect to the new primary.

#### Classic Console

Enter the following URL in a web browser:

```bash
https://<organization_name>-<connection_name>.snowflakecomputing.com/
```

For example:

```bash
https://myorg-myconnection.snowflakecomputing.com/
```

#### Snowflake CLI

Specify the host name for the connection URL in the `account` connection parameter in the Snowflake CLI `config.toml` file. For information
about the `config.toml` file, see [Configuring Snowflake CLI](../developer-guide/snowflake-cli/connecting/configure-cli.md).

```none
account = <organization_name>-<connection_name>
username = <username>
password = <password>
```

For example:

```toml
[connections.myconnection]
account = "myaccount"
user = "jondoe"
password = "password"
```

#### SnowSQL

Specify the host name for the connection URL in the `accountname` connection parameter in the SnowSQL `config` file. For information
about the `config` file, see [Configuring SnowSQL](snowsql-config.md).

```bash
accountname = <organization_name>-<connection_name>
username = <username>
password = <password>
```

For example:

```bash
accountname = myorg-myconnection
username = jsmith
password = mySecurePassword
```

#### Snowflake Connector for Python

Specify the host name for the connection URL in the `account` connection parameter when calling the connect function. For more
information, see [Python Connector API](../developer-guide/python-connector/python-connector-api.md) and [Using the Python Connector](../developer-guide/python-connector/python-connector-example.md).

```bash
con = snowflake.connector.connect (
      account = <organization_name>-<connection_name>
      user = <username>
      password = <password>
)
```

For example:

```bash
con = snowflake.connector.connect (
      account = myorg-myconnection
      user = jsmith
      password = mySecurePassword
)
```

##### Snowflake Connector for Spark

Specify the connection URL in the `URL` property in the properties file or `Map` that you use
to establish the session.

```properties
# Properties file (a text file) for establishing a Connector for Spark session
URL = https://<organization_name>-<connection_name>.snowflakecomputing.com
```

For example:

```properties
URL = https://myorg-myconnection.snowflakecomputing.com
```

For more information about using the Snowflake Connector for Spark, see [Snowflake Connector for Spark](spark-connector.md).
For configuration options, see [Setting Configuration Options for the Connector](spark-connector-use.md).
Depending upon which language you use with the connector, also see
[Using the Connector in Scala](spark-connector-use.md) or
[Using the Connector with Python](spark-connector-use.md).

#### JDBC Driver

Specify the host name for the connection URL in the connection string. For more information, see [Configuring the JDBC Driver](../developer-guide/jdbc/jdbc-configure.md).

```bash
jdbc:snowflake://<organization_name>-<connection_name>.snowflakecomputing.com/?user=<username>&password=<password>
```

For example:

```bash
jdbc:snowflake://myorg-myconnection.snowflakecomputing.com/?user=jsmith&password=mySecurePassword
```

#### ODBC Driver

Specify the host name for the connection URL in the Server connection parameter. For more information about the connection parameters, see
[ODBC configuration and connection parameters](../developer-guide/odbc/odbc-parameters.md).

```bash
[ODBC Data Sources]
<account_name> = SnowflakeDSIIDriver

[<dsn_name>]
Description     = SnowflakeDB
Driver          = SnowflakeDSIIDriver
Locale          = en-US
SERVER          = <organization_name>-<connection_name>.snowflakecomputing.com
```

For example:

```bash
[ODBC Data Sources]
myaccount = SnowflakeDSIIDriver

[client_redirect]
Description     = SnowflakeDB
Driver          = SnowflakeDSIIDriver
Locale          = en-US
SERVER          = myorg-myconnection.snowflakecomputing.com
```

#### Node.js Driver

Specify the host name for the connection URL in the `account` connection option. For more information about the connection parameters,
see [Node.js options reference](../developer-guide/node-js/nodejs-driver-options.md).

```bash
var configuration = {
  username: '<username>',
  password: '<password>',
  account: <organization_name>-<connection_name>.
}

var connection = snowflake.createConnection(configuration)
```

For example:

```bash
var configuration = {
  username: 'jsmith',
  password: 'mySecurePassword',
  account: myorg-myconnection.
}

var connection = snowflake.createConnection(configuration)
```

#### Go Snowflake Driver

Specify the host name for the connection URL in the `Account` parameter. For more information, see [Go Snowflake Driver](../developer-guide/golang/go-driver.md).

```bash
cfg := &Config{
  Account: "<organization_name>-<connection_name>",
  User: "<username>",
  Password: "<password>"
}

dsn, err := DSN(cfg)
```

For example:

```bash
cfg := &Config{
  Account: "myorg-myconnection",
  User: "jsmith",
  Password: "mySecurePassword"
}

dsn, err := DSN(cfg)
```

#### Snowpark

##### Snowpark Python

Specify the host name for the connection URL in the `account` connection parameter in the Python dictionary (`dict`) used to
establish a session. For more information about creating a session, see [Creating a Session for Snowpark Python](../developer-guide/snowpark/python/creating-session.md).

```python
connection_parameters = {
  "account": "<organization_name>-<connection_name>",
  "user": "<snowflake_user>",
  "password": "<snowflake_password>"
}
```

For example:

```python
connection_parameters = {
  "account": "myorg-myconnection",
  "user": "jsmith",
  "password": "mySecurePassword"
}
```

##### Snowpark Java

Specify the connection URL in the `URL` property in the properties file or `Map` that you use to establish the session. For more
information about creating a session, see [Creating a Session for Snowpark Java](../developer-guide/snowpark/java/creating-session.md).

```properties
# Properties file (a text file) for establishing a Snowpark session
URL = https://<organization_name>-<connection_name>.snowflakecomputing.com
```

For example:

```properties
# Properties file (a text file) for establishing a Snowpark session
URL = https://myorg-myconnection.snowflakecomputing.com
```

##### Snowpark Scala

Specify the connection URL in the `URL` property in the properties file or `Map` that you use to establish the session. For more
information about creating a session, see [Creating a Session for Snowpark Scala](../developer-guide/snowpark/scala/creating-session.md).

```properties
# Properties file (a text file) for establishing a Snowpark session
URL = https://<organization_name>-<connection_name>.snowflakecomputing.com
```

For example:

```properties
# Properties file (a text file) for establishing a Snowpark session
URL = https://myorg-myconnection.snowflakecomputing.com
```

## Authentication and Client Redirect

Users must be provisioned in the source account and on each target account if security integrations are not
[replicated](account-replication-security-integrations.md).

### Federated authentication & SSO

Configure federated authentication separately in each target account. Provide the identity provider (IdP) details using the setup
options in [Configuring Snowflake to use federated authentication](admin-security-fed-auth-security-integration.md):

> **Note:**
>
> Snowflake recommends configuring your SAML 2.0-compliant identity provider (IdP) with the connection URL rather than an account URL so
> users are redirected to the correct account in case of failover.

### OAuth

Configure a security integration object for OAuth in each target account. The security integration object must be identical to the same
object in the source account. For instructions, see the appropriate topic:

* [Snowflake OAuth](oauth-intro.md)
* [External OAuth](oauth-ext-overview.md)

To retrieve security integration properties, query the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command for each security integration
in the source account. Then recreate each security integration in a target account by executing the
[CREATE INTEGRATION](../sql-reference/sql/create-integration.md) command.

#### OAuth redirect behavior

If you are using Snowflake OAuth for authenticating a client connection and are connecting to Snowflake using a connection URL, you
are prompted to re-authenticate if the connection URL is redirected to another account (e.g. in case of failover). Snowflake OAuth
tokens are valid for use in a specific account. When a connection URL is updated to point to an account in a different region, the
existing OAuth token becomes invalid.

In the case of a failover, when the connection URL is updated to the new account, the client will disconnect with an
`invalid OAuth access token` error. You must re-authenticate and consent to permissions to re-establish the connection.

> **Note:**
>
> You will not be prompted for re-authentication when the connection URL is updated to a new account if the
> [OAuth security integration is replicated](account-replication-security-integrations.md) to that account. For more
> information, refer to [Replicating OAuth security integrations](account-replication-security-integrations.md).

## Redirecting client connections

In the event of a service outage in the region where the primary connection is located, redirect the client connection to an account that
stores a secondary connection.

### Promoting a secondary connection to serve as the primary connection

Initiating the redirect involves promoting a secondary connection in an available region to serve as the primary connection using
[ALTER CONNECTION](../sql-reference/sql/alter-connection.md). Concurrently, the former primary connection becomes a secondary connection.

1. Sign in to the target account in an available region that contains the secondary connection to be promoted to serve
   as the primary connection.
2. Execute the SQL statements in this section:

   * View all connections in the account:

     ```sqlexample
     SHOW CONNECTIONS;
     ```

     The statement returns the following output:

     ```output
     +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
     | snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
     |--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
     | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
     | AWS_US_EAST_1      | 2020-07-22 13:52:04.925 -0700 | MYORG.MYACCOUNT2    | MYCONNECTION      | NULL            | false         | MYORG.MYACCOUNT1.MYCONNECTION |                                     | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR2 |
     +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
     ```
   * Promote a secondary connection to serve as the primary connection:

     ```sqlexample
     ALTER CONNECTION myconnection PRIMARY;
     ```
   * Verify that the former secondary connection was promoted successfully:

     ```sqlexample
     SHOW CONNECTIONS;
     ```

     The statement returns the following output:

     ```output
     +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
     | snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
     |--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
     | AWS_US_WEST_2      | 2020-07-19 14:49:11.183 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | false         | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
     | AWS_US_EAST_1      | 2020-07-22 13:52:04.925 -0700 | MYORG.MYACCOUNT2    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION |                                     | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR2 |
     +--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
     ```

### Modifying the DNS settings for private connectivity to the Snowflake service

To redirect client connections to a secondary account, your network administrator must modify the DNS setting created in
Configuring the DNS settings for private connectivity to the Snowflake service.

Using a tool provided by your DNS provider, modify the DNS setting for the connection URL.

Set the destination hostname as the complete Snowflake account URL for the account that stores your new primary connection, including the
additional segments that identify the region and cloud platform where your account is hosted and the support for AWS PrivateLink, Azure
Private Link, or Google Cloud Private Service Connect. This is the account name where client connections to the connection URL will now
redirect. Be sure to include the private connectivity OCSP URL when updating the DNS settings.

For example:

```bash
myaccount1.us-east-1.privatelink.snowflakecomputing.com.
ocsp.myaccount1.us-east-1.privatelink.snowflakecomputing.com.
```

(Note the trailing period, which must be included.)

> **Note:**
>
> You can configure private connectivity and client redirect to work with Snowsight. Ensure your DNS updates include the Snowsight
> values from the output of the SYSTEM$GET_PRIVATELINK_CONFIG function. For details, refer to
> [private connectivity and Snowsight](ui-snowsight-gs.md).

### Verifying the connection URL is updated

To verify the connection URL has been updated, you can confirm the region of your current connection. Use the connection URL to connect to
Snowflake and execute the [CURRENT_REGION](../sql-reference/functions/current_region.md) function.

```sqlexample
SELECT CURRENT_REGION();
```

## Modifying a connection

You can edit the target accounts for a connection after creating it using Snowsight or SQL.

### Modify target accounts for a connection using Snowsight

You can modify the target account for a connection after creating it, but you cannot change the connection name.

> **Note:**
>
> * To edit a connection, you must be signed in as a user with the ACCOUNTADMIN role to the following accounts:
>
>   + The source account with the primary connection.
>   + The current target account with the secondary connection.
>   + The new target account you want to add for the primary connection.
> * You can only add one target account for a primary connection using Snowsight. To add additional
>   target accounts, use the ALTER CONNECTION command.
> * Currently, if your account uses private connectivity, you can’t use Snowsight to modify target
>   accounts for a connection.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Client Redirect.
4. Locate the connection you want to edit. Select the More menu (…) in the last column of the row.

### Modify target accounts for a connection using SQL

You can add more than one target account for a primary connection using the [ALTER CONNECTION](../sql-reference/sql/alter-connection.md) command.
For an example, see [Examples](../sql-reference/sql/alter-connection.md).

## Dropping a connection

You can drop a connection using Snowsight or SQL.

### Drop a connection using Snowsight

> **Note:**
>
> Currently, if your account uses private connectivity, you can’t use Snowsight to drop a connection.

To delete a connection, you must sign in as a user with the ACCOUNTADMIN role to the *source* account with the primary connection.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Client Redirect.
4. Locate the connection you want to delete. Select the More menu (…) in the last column of the row.
5. Select Drop, then select Drop Connection

### Drop a connection using SQL

You can use the [DROP CONNECTION](../sql-reference/sql/drop-connection.md) command to delete a connection.

1. Delete all secondary connections in target accounts.
2. Delete the primary connection in the source account.

For an example, see [Examples](../sql-reference/sql/drop-connection.md).

## Monitoring Client Redirect

You can monitor Client Redirect connections and usage for accounts in an organization using Snowsight or SQL.

### Monitor Client Redirect using Snowsight

> **Note:**
>
> * Only a user with the ACCOUNTADMIN role can view connection details using Snowsight.
> * You must be signed in to the target account as a user with the ACCOUNTADMIN role. If you are not, you will be
>   prompted to sign in.
> * Currently, if your account uses private connectivity, you can’t use Snowsight to monitor Client Redirect.

To view the Client Redirect connection details, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication and then select Client Redirect.
4. If there is no active warehouse for the session, you will be prompted to select a warehouse.

Monitor a specific connection using search and filters.

* You can search by connection name. In the  (search) box, enter the connection name to filter results.
* Choose Redirecting to filter the results by primary (To) or secondary (From) connection.
* Choose the  (accounts) menu to filter the results by account name.

You can review the following information about each connection:

| Column | Description |
| --- | --- |
| Name | Connection name. |
| Redirecting | Indicates if the connection is To a target account or From a source account and the account name.  If this column contains *destinations available*, there are no secondary connections. The number of destinations available indicates the number of target accounts the primary connection can be replicated to.  If there is more than one secondary connection, each connection is detailed in a separate row. |
| Usage | Displays the number of times the connection has been used in the last 7 days. You must sign in to the target account to view usage data for that account. |
| Connection URL | The connection URL to use with Snowflake clients. Select the connection URL in the column to copy the URL. |

### Monitor Client Redirect using SQL

You can view connection details and monitor usage using the SHOW CONNECTIONS command and LOGIN_HISTORY function.

#### View connection details

You can retrieve connection names and details using the SHOW CONNECTIONS command:

```sqlexample
SHOW CONNECTIONS;
```

Returns:

```output
+--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------+
| snowflake_region   | created_on                    | account_name        | name              | comment         | is_primary    | primary                       | failover_allowed_to_accounts        | connection_url                            | organization_name | account_locator   |
|--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
| AWS_US_WEST_2      | 2023-07-05 08:57:11.143 -0700 | MYORG.MYACCOUNT1    | MYCONNECTION      | NULL            | true          | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
| AWS_US_EAST_1      | 2023-07-08 09:15:11.143 -0700 | MYORG.MYACCOUNT2    | MYCONNECTION      | NULL            | false         | MYORG.MYACCOUNT1.MYCONNECTION | MYORG.MYACCOUNT2, MYORG.MYACCOUNT3  | myorg-myconnection.snowflakecomputing.com | MYORG             | MYACCOUNTLOCATOR1 |
|--------------------+-------------------------------+---------------------+-------------------+-----------------+---------------+-------------------------------+-------------------------------------+-------------------------------------------+-------------------+-------------------|
```

#### Verify the connection URL used by your users

Query the [LOGIN_HISTORY , LOGIN_HISTORY_BY_USER](../sql-reference/functions/login_history.md) family of table functions to view the login activity for your users within the last
7 days. The output indicates which users and Snowflake clients have been using a connection URL. The REPORTED_CLIENT_TYPE and
REPORTED_CLIENT_VERSION columns display the client and version used for each connection to Snowflake, and the CONNECTION column displays
the connection URL used, if any.

> **Note:**
>
> If a client authenticates through an identity provider (IdP) that is configured with the account URL rather than the connection URL, the
> IdP directs the client to the account URL after authentication is complete. The CONNECTION column for this login event is NULL. See
> Authentication and Client Redirect (in this topic).

For example, retrieve up to 100 login events of every user your current role is allowed to monitor in the last 72 hours:

```sqlexample
SELECT event_timestamp, user_name, client_ip, reported_client_type, is_success, connection
  FROM TABLE(INFORMATION_SCHEMA.LOGIN_HISTORY(
    DATEADD('HOURS',-72,CURRENT_TIMESTAMP()),
    CURRENT_TIMESTAMP()))
  ORDER BY EVENT_TIMESTAMP;
```

## Current limitations of Client Redirect

* Client connections using a connection URL and OAuth integration require re-authentication when the connection URL is updated to point to a
  different account if the OAuth security integration is not replicated to that account. For more information, refer to
  OAuth redirect behavior.
* Web browsers may take several minutes to redirect due to browser cache.

  If you need to verify that the redirect works, you can connect to Snowflake with a different client.

  Alternatively, open a new private browser window (e.g. incognito mode in Google Chrome) to avoid browser caching issues. Note that some web
  browsers in private or incognito mode might still cache data. To avoid using the browser cache, close any open private browsers windows and
  tabs before you open a new private browser window.
* You can only add one target account using Snowsight. To add more than one target account to the list of allowed failover
  accounts, use the [ALTER CONNECTION … ENABLE FAILOVER TO ACCOUNTS](../sql-reference/sql/alter-connection.md) command.

---
title: Reducing queues
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse-queue.md
section: User Guide
---

# Reducing queues

This topic discusses how a warehouse owner or administrator can reduce queuing in order to improve the performance of queries running
on a warehouse.

If too many queries are sent to a warehouse at the same time, the warehouse’s compute resources become exhausted and subsequent queries
are queued until resources become available. The time between submitting a query and getting its results is longer when the query must
wait in a queue before starting.

> **Note:**
>
> You must have [access to the shared SNOWFLAKE database](../sql-reference/account-usage.md) to execute the diagnostic queries provided in this topic. By default, only the ACCOUNTADMIN role has the privileges needed to execute the queries.

## Finding queues

Snowsight:
:   To determine if a particular warehouse is experiencing queues:

    1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Compute » Warehouses.
    3. Select the warehouse.
    4. In the Warehouse Activity chart, use the color associated with Queued load to identify queues.
    5. Look for patterns in the height of the bars to determine if the queues are associated with usage spikes.

SQL:
:   **Query: Warehouses with queueing**

    This query lists the warehouses that had a queue in the last month, sorted by date.

    ```sqlexample
    SELECT TO_DATE(start_time) AS date
      ,warehouse_name
      ,SUM(avg_running) AS sum_running
      ,SUM(avg_queued_load) AS sum_queued
    FROM snowflake.account_usage.warehouse_load_history
    WHERE TO_DATE(start_time) >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    GROUP BY 1,2
    HAVING SUM(avg_queued_load) > 0;
    ```

    You can also write queries against the [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) to calculate the time
    that queries spend in the queue.

## Options for reducing queues

You have several options to stop warehouse queuing:

> * For a regular warehouse (i.e. not a multi-cluster warehouse), consider creating additional warehouses, and then distribute the queries
>   among them. If specific queries are causing usage spikes, focus on moving those queries.
> * Consider converting a warehouse to a [multi-cluster warehouse](warehouses-multicluster.md) so the warehouse can elastically
>   provision additional compute resources when demand spikes. Multi-cluster warehouses require the
>   [Enterprise Edition](intro-editions.md) of Snowflake.
> * If you are already using a multi-cluster warehouse, increase the maximum number of clusters.

## Cost considerations

For a description of how running a multi-cluster warehouse affects credit consumption, refer to [Multi-cluster size and credit usage](warehouses-multicluster.md).

If you are running a multi-cluster warehouse in Auto-scale mode, you can use a [scaling policy](warehouses-multicluster.md) to help
control the costs. The Economy scaling policy favors conserving credits over cluster elasticity by keeping running clusters fully-loaded
rather than starting additional clusters. This might result in queries being queued and taking longer to complete.

## How to configure warehouses to reduce queues

Regular Warehouses:
:   To create new warehouses to which queries can be distributed, sign in to [Snowsight](ui-snowsight-gs.md), and in the navigation menu, select Compute » Warehouses.
    You can also use the [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) command.

Multi-Cluster Warehouses:
:   To convert an existing warehouse to a multi-cluster warehouse or to increase the maximum number of clusters for an existing warehouse:

    1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Compute » Warehouses.
    3. Find the warehouse, and select … » Edit.
    4. If you are converting to a multi-cluster warehouse, turn on the Multi-cluster Warehouse option. If you do not see this option,
       upgrade to Enterprise Edition or higher.
    5. Use the Max Clusters drop-down to adjust the maximum number of clusters.

---
title: Reference organizational listings in queries
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-query.md
section: User Guide
---

# Reference organizational listings in queries

> **Note:**
>
> Organizational listings can be queried without mounting.

To reference an organizational listing’s datasets in a SQL query, use the Uniform Listing Locator (ULL).
The ULL serves as a unique identifier that points to a listing in the Internal Marketplace, making it
easy to query its datasets directly.

SnowsightSQL

1. Sign in to [Snowsight](../../../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Browse or search for a data product.
4. Select a listing and select Copy ULL.
5. In the navigation menu, select Projects » Notebooks or another project tool.
6. Write a SQL query, using the ULL in place of the database name.

To query an organizational listing, use the following syntax:

```sqlsyntax
SELECT * FROM <ull>.<schema>.<view>
```

Example queries:

```sqlexample
SELECT * FROM "<orgdatacloud$internal$organizational_listing_name>".<schema_name>.<object_within_listing>;
SELECT * FROM <orgdatacloud$internal$organizational_listing_name>.<schema_name>.<object_within_listing>;
```

The following query example uses the ULL as a replacement for the database name. Replace `<object_within_listing>` with the name of a table or view that’s part of the listing:

```sqlexample
SELECT * FROM <orgdatacloud$internal$organizational_listing_name>.<schema_name>.<object_within_listing>;
```

If you prefer a more convenient name, consider creating a view:

```sqlexample
CREATE OR REPLACE VIEW <view_name>
AS
SELECT *
FROM <orgdatacloud$internal$organizational_listing_name>.<schema_name>.<object_within_listing>;
```

---
title: Refresh directory tables automatically for Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-auto-s3.md
section: User Guide
---

# Refresh directory tables automatically for Amazon S3

This topic provides instructions for creating a directory table on an external stage and refreshing the directory table metadata automatically using [Amazon SQS
(Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. This operation synchronizes the metadata with the
latest set of associated files in the external stage and path when the following occur:

> * New files in the path are added to the table metadata.
> * Changes to files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

> **Note:**
>
> * To perform the tasks described in this topic, you must use a role that has the CREATE STAGE privilege on a schema.
>
>   In addition, you must have administrative access to AWS. If you are not an AWS administrator, ask your AWS administrator to complete
>   the steps required to configure AWS event notifications.
> * Snowflake recommends that you only send supported events for directory tables to reduce costs, event noise, and latency.

## Limitations of automatic refreshing of directory tables using Amazon SQS

* [Virtual Private Snowflake (VPS)](intro-editions.md) and [AWS PrivateLink](admin-security-privatelink.md)
  customers: Although AWS services within a VPC (including VPS) can communicate with SQS, this traffic is not within the VPC, and therefore is not protected by the VPC.
* SQS notifications notify Snowflake when new files arrive in monitored S3 buckets and are ready to load. SQS notifications contain the S3
  event and a list of the file names. They do not include the actual data in the files.

## Cloud platform support

Triggering automated refreshes using S3 event messages is supported for Snowflake accounts hosted on any of the
[supported cloud platforms](intro-cloud-platforms.md).

## Configure secure access to cloud storage

> **Note:**
>
> If you have already configured secure access to the S3 bucket that stores your data files, you can skip this section.

This section describes how to configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage
to a Snowflake identity and access management (IAM) entity.

> **Note:**
>
> We highly recommend this option, which avoids the need to supply IAM credentials when accessing cloud storage. See
> [Configuring secure access to Amazon S3](data-load-s3-config.md) for additional storage access options.

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Amazon S3 bucket referenced in an external (i.e. S3) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an AWS identity and access management (IAM) user ID. An administrator in your organization grants the integration IAM user permissions in the AWS account.

An integration can also list buckets (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires permissions in AWS to create and manage IAM policies and roles. If you are not an AWS administrator, ask your AWS administrator to perform these tasks.
> * Note that currently, accessing S3 storage in [government regions](intro-regions.md)
>   using a storage integration is limited to Snowflake accounts hosted on AWS in the same government
>   region. Accessing your S3 storage from an account hosted outside of the government region using
>   direct credentials is supported.

The following diagram shows the integration flow for a S3 stage:

1. An external (i.e. S3) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a S3 IAM user created for your account. Snowflake creates a single IAM user that is referenced by all S3 storage integrations in your Snowflake account.
3. An AWS administrator in your organization grants permissions to the IAM user to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same storage integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the IAM user on the bucket before allowing or denying access.

### Step 1: Configure access permissions for the S3 bucket

#### AWS access control requirements

Snowflake requires the following permissions on an S3 bucket and folder to be able to access files in the folder (and sub-folders):

* `s3:GetBucketLocation`
* `s3:GetObject`
* `s3:GetObjectVersion`
* `s3:ListBucket`

As a best practice, Snowflake recommends creating an IAM policy for Snowflake access to the S3 bucket. You can then attach the policy to
the role and use the security credentials generated by AWS for the role to access files in the bucket.

#### Create an IAM policy

The following step-by-step instructions describe how to configure access permissions for Snowflake in your AWS Management Console to access
your S3 bucket.

1. Log into the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add a policy document that will allow Snowflake to access the S3 bucket and folder.

   The following policy (in JSON format) provides Snowflake with the required permissions to load or unload data using a single bucket and
   folder path.

   Copy and paste the text into the policy editor:

   > **Note:**
   > * Make sure to replace `bucket` and `prefix` with your actual bucket name and folder path prefix.
   > * The Amazon Resource Names (ARN) for buckets in
   >   [government regions](intro-regions.md) have a `arn:aws-us-gov:s3:::` prefix.

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:GetObject",
                 "s3:GetObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   > **Note:**
   >
   > Setting the `"s3:prefix":` condition to either `["*"]` or `["<path>/*"]` grants access to all prefixes in the
   > specified bucket or path in the bucket, respectively.

   Note that AWS policies support a variety of different security use cases.
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

### Step 2: Create the IAM role in AWS

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. Select AWS account as the trusted entity type.
4. Select Another AWS account
5. In the Account ID field, enter your own AWS account ID temporarily. Later, you modify the trust relationship and grant
   access to Snowflake.
6. Select the Require external ID option. An external ID is used to grant access to your AWS resources
   (such as S3 buckets) to a third party like Snowflake.

   Enter a placeholder ID such as `0000`.
   In a later step, you will modify the trust relationship for your IAM role and specify the external ID for your storage integration.
7. Select Next.
8. Select the policy you created in Step 1: Configure access permissions for the S3 bucket (in this topic).
9. Select Next.
10. Enter a name and description for the role, then select Create role.

    You have now created an IAM policy for a bucket, created an IAM role, and attached the policy to the role.
11. On the role summary page, locate and record the Role ARN value. In the next step, you will create a Snowflake integration that
    references this role.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and access data from the cloud storage location until the cache expires.

### Step 3: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake
object that stores a generated identity and access management (IAM) user for your S3 cloud storage, along with an optional set of allowed
or blocked storage locations (that is, buckets). Cloud provider administrators in your organization grant permissions on the storage locations
to the generated user. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, S3) stages. The URL in the stage definition must align with the S3
buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this
> SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = '<iam_role>'
  STORAGE_ALLOWED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `iam_role` is the Amazon Resource Name (ARN) of the role you created in Step 2: Create the IAM role in AWS (in this topic).
* `protocol` is one of the following:

  + `s3` refers to S3 storage in public AWS regions outside of China.
  + `s3china` refers to S3 storage in public AWS regions in China.
  + `s3gov` refers to S3 storage in [government regions](intro-regions.md).
* `bucket` is the name of a S3 bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS
  parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that
  reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that allows access to all buckets in the account but blocks access to the defined `sensitivedata` folders.

Additional external stages that also use this integration can reference the allowed buckets and paths:

```sqlexample
CREATE STORAGE INTEGRATION s3_int
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
  STORAGE_ALLOWED_LOCATIONS = ('*')
  STORAGE_BLOCKED_LOCATIONS = ('s3://mybucket1/mypath1/sensitivedata/', 's3://mybucket2/mypath2/sensitivedata/');
```

> **Note:**
>
> Optionally, use the [STORAGE_AWS_EXTERNAL_ID](../sql-reference/sql/create-storage-integration.md) parameter to specify
> your own external ID. You might choose this option
> to use the same external ID across multiple external volumes and/or storage integrations.

### Step 4: Retrieve the AWS IAM user for your Snowflake account

1. To retrieve the ARN for the IAM user that was created automatically for your Snowflake account, use the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md).

   ```sqlsyntax
   DESC INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 3: Create a Cloud Storage Integration in Snowflake
     (in this topic).

   For example:

   ```sqlexample
   DESC INTEGRATION s3_int;
   ```

   ```output
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   | property                  | property_type | property_value                                                                 | property_default |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------|
   | ENABLED                   | Boolean       | true                                                                           | false            |
   | STORAGE_ALLOWED_LOCATIONS | List          | s3://mybucket1/mypath1/,s3://mybucket2/mypath2/                                | []               |
   | STORAGE_BLOCKED_LOCATIONS | List          | s3://mybucket1/mypath1/sensitivedata/,s3://mybucket2/mypath2/sensitivedata/    | []               |
   | STORAGE_AWS_IAM_USER_ARN  | String        | arn:aws:iam::123456789001:user/abc1-b-self1234                                 |                  |
   | STORAGE_AWS_ROLE_ARN      | String        | arn:aws:iam::001234567890:role/myrole                                          |                  |
   | STORAGE_AWS_EXTERNAL_ID   | String        | MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=                               |                  |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   ```
2. Record the values for the following properties:

   | Property | Description |
   | --- | --- |
   | `STORAGE_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account; for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. All S3 storage integrations in your account use that IAM user. |
   | `STORAGE_AWS_EXTERNAL_ID` | The external ID that Snowflake uses to establish a trust relationship with AWS. If you didn’t specify an external ID (`STORAGE_AWS_EXTERNAL_ID`) when you created the storage integration, Snowflake generates an ID for you to use. |

   You provide these values in the next section.

### Step 5: Grant the IAM user permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your AWS Management Console so that you can use a S3 bucket to load and unload data:

1. Sign in to the AWS Management Console.
2. Select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the role you created in Step 2: Create the IAM role in AWS (in this topic).
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the DESC STORAGE INTEGRATION output values you recorded in
   Step 4: Retrieve the AWS IAM user for your Snowflake account (in this topic):

   **Policy document for IAM role**

   ```sqljson
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<snowflake_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<snowflake_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   > * `snowflake_user_arn` is the STORAGE_AWS_IAM_USER_ARN value you recorded.
   > * `snowflake_external_id` is the STORAGE_AWS_EXTERNAL_ID value you recorded.
   >
   >   In this example, the `snowflake_external_id` value is `MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=`.
   >
   >   > **Note:**
   >   >
   >   > For security reasons, if you create a new storage integration (or recreate an existing storage integration using the CREATE OR
   >   > REPLACE STORAGE INTEGRATION syntax) without specifying an external ID, the new integration has a *different* external ID and
   >   > can’t resolve the trust relationship unless you update the trust policy.
8. Select Update policy to save your changes.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Determine the correct option

Before proceeding, determine whether an S3 event notification exists for the target path (or “prefix,” in AWS terminology) in your S3
bucket where your data files are located. AWS rules prohibit creating conflicting notifications for the same path.

The following options for automating the refreshing of directory table metadata using Amazon SQS are supported:

* **Option 1. New S3 event notification:** Create an event notification for the target path in your S3 bucket. The event notification
  informs Snowflake via an SQS queue when new, removed, or modified files in the path require a refresh of the directory table metadata.

  > **Important:**
  >
  > If a conflicting event notification exists for your S3 bucket, use Option 2 instead.
* **Option 2. Existing event notification:** Configure [Amazon Simple Notification Service (SNS)](https://aws.amazon.com/sns/) as a
  broadcaster to share notifications for a given path with multiple endpoints (or “subscribers,” e.g. SQS queues or AWS Lambda workloads),
  including the Snowflake SQS queue for directory table refresh automation. An S3 event notification published by SNS informs Snowflake of
  file changes in the path via an SQS queue.

  > **Note:**
  >
  > We recommend this option if you plan to use [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md). You can also migrate from option 1 to
  > option 2 after you create a replication or failover group. For more information, see [Migrate to Amazon Simple Notification Service (SNS)](account-replication-stages-pipes-load-history.md).

## Option 1: Create a new S3 event notification

This section describes the most common option for automatically refreshing directory table metadata using [Amazon SQS (Simple Queue
Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. The steps explain how to create an event notification for the
target path (or “prefix,” in AWS terminology) in your S3 bucket where your data files are stored.

> > **Important:**
> >
> > If a conflicting event notification exists for your S3 bucket, use Option 2: Configure Amazon SNS (in this topic) instead. AWS
> > rules prohibit creating conflicting notifications for the same target path.

### Step 1: Create a stage with an included directory table

Create an external stage that references your S3 bucket using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads
your staged data files into the directory table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure secure access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration
>   object.

```sqlsyntax
-- External stage
CREATE [ OR REPLACE ] [ TEMPORARY ] STAGE [ IF NOT EXISTS ] <external_stage_name>
      <cloud_storage_access_settings>
    [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
    [ directoryTable ]
    [ COPY_OPTIONS = ( copyOptions ) ]
    [ COMMENT = '<string_literal>' ]
```

> **Note:**
>
> The storage location in the URL value must end in a forward slash (`/`).

Where:

> ```sqlsyntax
> directoryTable (for Amazon S3) ::=
>   [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
>                   [ AUTO_REFRESH = { TRUE | FALSE } ] ) ]
> ```

#### Directory table parameters (`directoryTable`)

`ENABLE = TRUE | FALSE`
:   Specifies whether to add a directory table to the stage. When the value is TRUE, a directory table is created with the stage.

    Default: `FALSE`

`AUTO_REFRESH = TRUE | FALSE`
:   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated data
    files are available in the named external stage specified in the URL value.

    `TRUE`
    :   Snowflake enables triggering automatic refreshes of the directory table metadata.

    `FALSE`
    :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory
        table metadata periodically using [ALTER STAGE](../sql-reference/sql/alter-stage.md) … REFRESH to synchronize the metadata with the current list
        of files in the stage path.

    Default: `FALSE`

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the
path `files`. The stage references a storage integration named `my_storage_int`.

> ```sqlexample
> USE SCHEMA mydb.public;
> ```

```sqlexample
CREATE STAGE mystage
  URL='s3://load/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (
    ENABLE = true
    AUTO_REFRESH = true
  );
```

When new or updated data files are added to the cloud storage location, the event notification informs Snowflake to scan them into the
directory table metadata.

### Step 2: Configure event notifications

Configure event notifications for your S3 bucket to notify Snowflake when new or updated data is available to read into the directory table
metadata. The auto-refresh feature relies on SQS queues to deliver event notifications from S3 to Snowflake.

For ease of use, these SQS queues are created and managed by Snowflake. The [DESCRIBE STAGE](../sql-reference/sql/desc-stage.md) command output displays
the Amazon Resource Name (ARN) of your SQS queue.

1. Execute the DESCRIBE STAGE command:

   > ```sqlsyntax
   > DESC STAGE <stage_name>;
   > ```

   For example:

   > ```sqlexample
   > DESC STAGE mystage;
   > ```

   Note the ARN of the SQS queue for the directory table in the `directory_notification_channel` field. Copy the ARN to a convenient location.

   > **Note:**
   >
   > Following AWS guidelines, Snowflake designates no more than one SQS queue per AWS S3 region. This SQS queue can be shared among multiple
   > buckets in the same AWS account. The SQS queue coordinates notifications for all directory tables reading data files from the same S3
   > bucket. When a new or modified data file is uploaded into the bucket, all directory table definitions that match the stage directory
   > path read the file details into their metadata.
2. Log into the AWS Management Console.
3. Configure an event notification for your S3 bucket using the instructions provided in the
   [Amazon S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/enable-event-notifications.html). Complete the fields
   as follows:

   > * Name: Name of the event notification (e.g. `Auto-ingest Snowflake`).
   > * Events: Select the ObjectCreate (All) and ObjectRemoved options.
   > * Send to: Select SQS Queue from the dropdown list.
   > * SQS: Select Add SQS queue ARN from the dropdown list.
   > * SQS queue ARN: Paste the SQS queue name from the DESC STAGE output.

> **Note:**
>
> These instructions create a single event notification that monitors activity for the entire S3 bucket. This is the simplest approach.
> This notification handles all directory tables configured at a more granular level in the S3 bucket directory.
>
> Alternatively, in the above steps, configure one or more paths and/or file extensions (or *prefixes* and *suffixes*, in AWS terminology)
> to filter event activity. For instructions, see the object key name filtering information in the relevant
> [AWS documentation topic](https://docs.aws.amazon.com/AmazonS3/latest/dev/NotificationHowTo.html). Repeat these steps for each
> additional path or file extension you want the notification to monitor.
>
> Note that AWS limits the number of these notification *queue configurations* to a maximum of 100 per S3 bucket.
>
> Also note that AWS does not allow overlapping queue configurations (across event notifications) for the same S3 bucket. For example, if
> an existing notification is configured for `s3://mybucket/files/path1`, then you cannot create another notification at a higher
> level, such as `s3://mybucket/files`, or vice-versa.

The external stage with auto-refresh is now configured!

When new or updated data files are added to the S3 bucket, the event notification informs Snowflake to scan them into the directory table
metadata.

### Step 3: Manually refresh directory table metadata

Refresh the metadata in a directory table manually using the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command.

```sqlsyntax
ALTER STAGE [ IF EXISTS ] <name> REFRESH [ SUBPATH = '<relative-path>' ]
```

Where:

`REFRESH`
:   Accesses the staged data files referenced in the directory table definition and updates the table metadata:

    * New files in the path are added to the table metadata.
    * Changes to files in the path are updated in the table metadata.
    * Files no longer in the path are removed from the table metadata.

    Currently, it is necessary to execute this command each time files are added to the stage, updated, or dropped. This step synchronizes
    the metadata with the latest set of associated files in the stage definition for the directory table.

`SUBPATH = '<relative-path>'`
:   Optionally specify a relative path to refresh the metadata for a specific subset of the data files.

For example, manually refresh the directory table metadata in a stage named `mystage`:

```sqlexample
ALTER STAGE mystage REFRESH;
```

> **Important:**
>
> If this step is not completed successfully at least once after the directory table is created, querying the directory table returns no
> results until a notification event triggers the directory table metadata to refresh automatically for the first time.

### Step 4: Configure security

For each additional role that will be used to query the directory table, grant sufficient access control privileges on the various objects
(i.e. the database(s), schema(s), stage, and table) using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE |  |

## Option 2: Configure Amazon SNS

This section describes how to trigger directory table metadata refreshing automatically using
[Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. The steps explain how to configure
[Amazon Simple Notification Service (SNS)](https://aws.amazon.com/sns/) as a broadcaster to publish event notifications for your S3
bucket to multiple subscribers (e.g. SQS queues or AWS Lambda workloads), including the Snowflake SQS queue for directory table refresh
automation.

> > **Note:**
> >
> > These instructions assume an event notification exists for the target path in your S3 bucket where your data files are located. If no
> > event notification exists, either:
> >
> > * Follow Option 1: Create a new S3 event notification (in this topic) instead.
> > * Create an event notification for your S3 bucket, then proceed with the instructions in this topic. For information, see the
> >   [Amazon S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/enable-event-notifications.html).

### Prerequisite: Create an Amazon SNS Topic and Subscription

1. Create an SNS topic in your AWS account to handle all messages for the Snowflake stage location on your S3 bucket.
2. Subscribe your target destinations for the S3 event notifications (for example, other SQS queues or AWS Lambda workloads) to this topic. SNS publishes event notifications for your bucket to all subscribers to the topic.

For instructions, see the [SNS documentation](https://aws.amazon.com/documentation/sns/).

### Step 1: Subscribe the Snowflake SQS Queue to the SNS Topic

1. Sign in to the AWS Management Console.
2. From the home dashboard, choose Simple Notification Service (SNS).
3. Choose Topics from the left-hand navigation pane.
4. Locate the topic for your S3 bucket. Note the topic ARN.
5. Using a Snowflake client, query the [SYSTEM$GET_AWS_SNS_IAM_POLICY](../sql-reference/functions/system_get_aws_sns_iam_policy.md) system function with your SNS topic ARN:

   > ```sqlexample
   > select system$get_aws_sns_iam_policy('<sns_topic_arn>');
   > ```

   The function returns an IAM policy that grants a Snowflake SQS queue permission to subscribe to the SNS topic.

   For example:

   > ```sqlexample
   > select system$get_aws_sns_iam_policy('arn:aws:sns:us-west-2:001234567890:s3_mybucket');
   >
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > | SYSTEM$GET_AWS_SNS_IAM_POLICY('ARN:AWS:SNS:US-WEST-2:001234567890:S3_MYBUCKET')                                                                                                                                                                   |
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > | {"Version":"2012-10-17","Statement":[{"Sid":"1","Effect":"Allow","Principal":{"AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"},"Action":["sns:Subscribe"],"Resource":["arn:aws:sns:us-west-2:001234567890:s3_mybucket"]}]}                 |
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > ```
6. Return to the AWS Management Console. Choose Topics from the left-hand navigation pane.
7. Select the topic for your S3 bucket, and click the Edit button. The Edit page opens.
8. Click Access policy - Optional to expand this area of the page.
9. Merge the IAM policy addition from the SYSTEM$GET_AWS_SNS_IAM_POLICY function results into the JSON document.

   For example:

   **Original IAM policy (abbreviated):**

   > ```sqljson
   > {
   >   "Version":"2008-10-17",
   >   "Id":"__default_policy_ID",
   >   "Statement":[
   >      {
   >         "Sid":"__default_statement_ID",
   >         "Effect":"Allow",
   >         "Principal":{
   >            "AWS":"*"
   >         }
   >         ..
   >      }
   >    ]
   >  }
   > ```

   **Merged IAM policy:**

   > ```sqljson
   > {
   >   "Version":"2008-10-17",
   >   "Id":"__default_policy_ID",
   >   "Statement":[
   >      {
   >         "Sid":"__default_statement_ID",
   >         "Effect":"Allow",
   >         "Principal":{
   >            "AWS":"*"
   >         }
   >         ..
   >      },
   >      {
   >         "Sid":"1",
   >         "Effect":"Allow",
   >         "Principal":{
   >           "AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
   >          },
   >          "Action":[
   >            "sns:Subscribe"
   >          ],
   >          "Resource":[
   >            "arn:aws:sns:us-west-2:001234567890:s3_mybucket"
   >          ]
   >      }
   >    ]
   >  }
   > ```
10. Add an additional policy grant to allow S3 to publish event notifications for the bucket to the SNS topic.

    For example (using the SNS topic ARN and S3 bucket used throughout these instructions):

    > ```sqljson
    > {
    >     "Sid":"s3-event-notifier",
    >     "Effect":"Allow",
    >     "Principal":{
    >        "Service":"s3.amazonaws.com"
    >     },
    >     "Action":"SNS:Publish",
    >     "Resource":"arn:aws:sns:us-west-2:001234567890:s3_mybucket",
    >     "Condition":{
    >        "ArnLike":{
    >           "aws:SourceArn":"arn:aws:s3:*:*:s3_mybucket"
    >        }
    >     }
    >  }
    > ```

    **Merged IAM policy:**

    > ```sqljson
    > {
    >   "Version":"2008-10-17",
    >   "Id":"__default_policy_ID",
    >   "Statement":[
    >      {
    >         "Sid":"__default_statement_ID",
    >         "Effect":"Allow",
    >         "Principal":{
    >            "AWS":"*"
    >         }
    >         ..
    >      },
    >      {
    >         "Sid":"1",
    >         "Effect":"Allow",
    >         "Principal":{
    >           "AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
    >          },
    >          "Action":[
    >            "sns:Subscribe"
    >          ],
    >          "Resource":[
    >            "arn:aws:sns:us-west-2:001234567890:s3_mybucket"
    >          ]
    >      },
    >      {
    >         "Sid":"s3-event-notifier",
    >         "Effect":"Allow",
    >         "Principal":{
    >            "Service":"s3.amazonaws.com"
    >         },
    >         "Action":"SNS:Publish",
    >         "Resource":"arn:aws:sns:us-west-2:001234567890:s3_mybucket",
    >         "Condition":{
    >            "ArnLike":{
    >               "aws:SourceArn":"arn:aws:s3:*:*:s3_mybucket"
    >            }
    >         }
    >       }
    >    ]
    >  }
    > ```
11. Click Save changes.

### Step 2: Create a stage with an included directory table

Create an external stage that references your S3 bucket using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads
your staged data files into the directory table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure secure access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration
>   object.

```sqlsyntax
-- External stage
CREATE [ OR REPLACE ] [ TEMPORARY ] STAGE [ IF NOT EXISTS ] <external_stage_name>
      <cloud_storage_access_settings>
    [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
    [ directoryTable ]
    [ COPY_OPTIONS = ( copyOptions ) ]
    [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> directoryTable (for Amazon S3) ::=
>   [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
>                   [ AUTO_REFRESH = { TRUE | FALSE } ]
>                   [ AWS_SNS_TOPIC = '<sns_topic_arn>' ] ) ]
> ```

#### Directory table parameters (`directoryTable`)

`ENABLE = TRUE | FALSE`
:   Specifies whether to add a directory table to the stage. When the value is TRUE, a directory table is created with the stage.

    Default: `FALSE`

`AUTO_REFRESH = TRUE | FALSE`
:   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated data
    files are available in the named external stage specified in the URL value.

    `TRUE`
    :   Snowflake enables triggering automatic refreshes of the directory table metadata.

    `FALSE`
    :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory table
        metadata periodically using [ALTER STAGE](../sql-reference/sql/alter-stage.md) … REFRESH to synchronize the metadata with the current list of
        files in the stage path.

    Default: `FALSE`

**Amazon S3**

> `AWS_SNS_TOPIC = '<sns_topic_arn>'`
> :   Specifies the ARN for the SNS topic for your S3 bucket. The CREATE directory table statement subscribes the Snowflake SQS queue to the
>     specified SNS topic.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the
path `files`. The stage references a storage integration named `my_storage_int`.

> ```sqlexample
> USE SCHEMA mydb.public;
> ```

```sqlexample
CREATE STAGE mystage
  URL='s3://load/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (
    ENABLE = true
    AUTO_REFRESH = true
    AWS_SNS_TOPIC = 'arn:aws:sns:us-west-2:001234567890:s3_mybucket'
  );
```

When new or updated data files are added to the cloud storage location, the event notification informs Snowflake to scan them into the
directory table metadata.

### Step 3: Manually refresh the directory table metadata

Refresh the metadata in a directory table manually using the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command.

```sqlsyntax
ALTER STAGE [ IF EXISTS ] <name> REFRESH [ SUBPATH = '<relative-path>' ]
```

Where:

`REFRESH`
:   Accesses the staged data files referenced in the directory table definition and updates the table metadata:

    * New files in the path are added to the table metadata.
    * Changes to files in the path are updated in the table metadata.
    * Files no longer in the path are removed from the table metadata.

    Currently, it is necessary to execute this command each time files are added to the stage, updated, or dropped. This step synchronizes
    the metadata with the latest set of associated files in the stage definition for the directory table.

`SUBPATH = '<relative-path>'`
:   Optionally specify a relative path to refresh the metadata for a specific subset of the data files.

For example, manually refresh the directory table metadata in a stage named `mystage`:

```sqlexample
ALTER STAGE mystage REFRESH;
```

> **Important:**
>
> If this step is not completed successfully at least once after the directory table is created, querying the directory table returns no
> results until a notification event triggers the directory table metadata to refresh automatically for the first time.

### Step 4: Configure security

For each additional role that will be used to query the directory table, grant sufficient access control privileges on the various objects
(i.e. the database(s), schema(s), stage, and table) using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE |  |

---
title: Refresh directory tables automatically for Azure Blob Storage
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-auto-azure.md
section: User Guide
---

# Refresh directory tables automatically for Azure Blob Storage

This topic provides instructions for creating directory tables and refreshing the directory table metadata automatically using
[Microsoft Azure Event Grid](https://azure.microsoft.com/en-us/services/event-grid/) notifications for an Azure container. This operation
synchronizes the metadata with the latest set of associated files in the external stage and path, i.e.:

> * New files in the path are added to the table metadata.
> * Changes to files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

Snowflake supports the following types of blob storage accounts:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v2

Automatic refresh isn’t supported for Microsoft Fabric OneLake.

> **Note:**
>
> Only `Microsoft.Storage.BlobCreated` and `Microsoft.Storage.BlobDeleted` events trigger refreshes for directory tables. Adding new objects to blob storage
> triggers these events. Renaming a directory or object doesn’t trigger these events. Snowflake recommends that you only send supported events for directory tables to reduce costs, event noise, and latency.

Snowflake supports the following `Microsoft.Storage.BlobCreated` APIs:

* `CopyBlob`
* `PutBlob`
* `PutBlockList`
* `FlushWithClose`
* `SftpCommit`

Snowflake supports the following `Microsoft.Storage.BlobDeleted` APIs:

* `DeleteBlob`
* `DeleteFile`
* `SftpRemove`

For Data Lake Storage Gen2 storage accounts, `Microsoft.Storage.BlobCreated` events are triggered when clients use the `CreateFile`
and `FlushWithClose` operations. If the SSH File Transfer Protocol (SFTP) is used, `Microsoft.Storage.BlobCreated` events are triggered with `SftpCreate` and `SftpCommit` operations. The `CreateFile` or `SftpCreate` API alone does not indicate a commit of a file in the storage account. If the
`FlushWithClose` or `SftpCommit` message is not sent, Snowflake does not refresh the directory table.

> **Note:**
>
> To perform the tasks described in this topic, you must use a role that has the CREATE STAGE privilege on a schema.
>
> In addition, you must have administrative access to Microsoft Azure. If you are not an Azure administrator, ask your Azure
> administrator to complete the steps in Step 1: Configure the Event Grid subscription.
>
> Snowflake only supports the [Azure Event Grid event schema](https://learn.microsoft.com/en-us/azure/event-grid/event-schema); it doesn’t support the [CloudEvents schema with Azure Event Grid](https://learn.microsoft.com/en-us/azure/event-grid/cloud-event-schema).

## Cloud platform support

Triggering automated refreshes using Azure Event Grid messages is supported for Snowflake accounts hosted on any of the
[supported cloud platforms](intro-cloud-platforms.md).

## Configure secure access to cloud storage

> **Note:**
>
> If you have already configured secure access to the Azure blob storage container that stores your data files, you can skip this section.

This section describes how to configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage
to a Snowflake identity and access management (IAM) entity.

> **Note:**
>
> We highly recommend this option, which avoids the need to supply IAM credentials when accessing cloud storage. See
> [Configure an Azure container for loading data](data-load-azure-config.md) for additional storage access options.

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Azure container referenced in an external (Azure) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an Azure identity and access management (IAM) user ID called the *app registration*. An administrator in your organization grants this app the necessary permissions in the Azure account.

An integration must also specify containers (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> Completing the instructions in this section requires permissions in Azure to manage storage accounts. If you are not an Azure administrator, ask your Azure administrator to perform these tasks.

**In this Section:**

### Step 1: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake object that stores a generated service principal for your Azure cloud storage, along with an optional set of allowed or blocked storage locations (that is, containers). Cloud provider administrators in your organization grant permissions on the storage locations to the generated service principal. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, Azure) stages. The URL in the stage definition must align with the Azure containers (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'AZURE'
  ENABLED = TRUE
  AZURE_TENANT_ID = '<tenant_id>'
  STORAGE_ALLOWED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `tenant_id` is the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to. A storage integration can authenticate to only one tenant, so the allowed and blocked storage locations must refer to storage accounts that all belong this tenant.

  To find your tenant ID, sign in to the Azure portal and click Azure Active Directory » Properties. The tenant ID is displayed in the Tenant ID field.
* `container` is the name of an Azure container that stores your data files (for example, `mycontainer`). The STORAGE_ALLOWED_LOCATIONS and STORAGE_BLOCKED_LOCATIONS parameters allow or block access to these containers, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over logical directories in the container.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two containers and paths. In a later step, we will create an external stage that references one of these containers and paths. Multiple external stages that use this integration can reference the allowed containers and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION azure_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'AZURE'
>   ENABLED = TRUE
>   AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
>   STORAGE_ALLOWED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/')
>   STORAGE_BLOCKED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/sensitivedata/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/sensitivedata/');
> ```

### Step 2: Grant Snowflake Access to the Storage Locations

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC STORAGE INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage Integration in Snowflake.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed storage locations.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to be granted an access token on specified resources inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Sign in to the Microsoft Azure portal.
5. Navigate to Azure Services » Storage Accounts. Click the name of the storage account you are granting the Snowflake service principal access to.
6. Click Access Control (IAM) » Add role assignment.
7. Select the desired role to grant to the Snowflake service principal:

   * `Storage Blob Data Reader` grants read access only. This allows loading data from files staged in the storage account.
   * `Storage Blob Data Contributor` grants read and write access. This allows loading data from or unloading data to files staged in
     the storage account. The role also allows executing the [REMOVE](../sql-reference/sql/remove.md) command to remove files staged in the
     storage account.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC STORAGE INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the storage integration stops working.
9. Click the Review + assign button.

   > **Note:**
   > * According to the Microsoft Azure documentation, role assignments may take up to five minutes to propagate.
   > * Snowflake caches the temporary credentials for a period that cannot exceed the 60 minute expiration time. If you revoke access from Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Configure automation with Azure Event Grid

### Step 1: Configure the Event Grid subscription

This section describes how to set up an Event Grid subscription for Azure Storage events using the Azure CLI. For more information about the steps described in this section, see the following articles in the Azure documentation:

* <https://docs.microsoft.com/en-us/azure/event-grid/custom-event-to-queue-storage>
* <https://docs.microsoft.com/en-us/azure/storage/blobs/storage-blob-event-quickstart>

#### Create a resource group

An Event Grid *topic* provides an endpoint where the source (that is, Azure Storage) sends events. A topic is used for a collection of related events. Event Grid topics are Azure resources, and must be placed in an Azure resource group.

Execute the following command to create a resource group:

```bash
az group create --name <resource_group_name> --location <location>
```

Where:

* `resource_group_name` is the name of the new resource group.
* `location` is the location, or *region* in Snowflake terminology, of your Azure Storage account.

#### Enable the Event Grid resource provider

Execute the following command to register the Event Grid resource provider. Note that this step is only required if you have not previously used Event Grid with your Azure account:

```bash
az provider register --namespace Microsoft.EventGrid
az provider show --namespace Microsoft.EventGrid --query "registrationState"
```

#### Create a storage account for data files

Execute the following command to create a storage account to store your data files. This account must be either a Blob storage (that is, a `BlobStorage` kind) or GPv2 (that is, a `StorageV2` kind) account, because only these two account types support event messages.

> **Note:**
>
> If you already have a Blob storage or GPv2 account, you can use that account instead.

For example, create a Blob storage account:

```bash
az storage account create --resource-group <resource_group_name> --name <storage_account_name> --sku Standard_LRS --location <location> --kind BlobStorage --access-tier Hot
```

Where:

* `resource_group_name` is the name of the resource group you created in Create a Resource Group.
* `storage_account_name` is the name of the new storage account.
* `location` is the location of your Azure Storage account.

#### Create a storage account for the storage queue

Execute the following command to create a storage account to host your storage queue. This account must be a GPv2 account, because only this kind of account supports event messages to a storage queue.

> **Note:**
>
> If you already have a GPv2 account, you can use that account to host both your data files and your storage queue.

For example, create a GPv2 account:

```bash
az storage account create --resource-group <resource_group_name> --name <storage_account_name> --sku Standard_LRS --location <location> --kind StorageV2
```

Where:

* `resource_group_name` is the name of the resource group you created in Create a resource group.
* `storage_account_name` is the name of the new storage account.
* `location` is the location of your Azure Storage account.

#### Create a storage queue

A single Azure Queue Storage queue can collect the event messages for many Event Grid subscriptions. For best performance, Snowflake recommends creating a single storage queue to accommodate all of your subscriptions related to Snowflake.

Execute the following command to create a storage queue. A storage queue stores a set of messages, in this case event messages from Event Grid:

```bash
az storage queue create --name <storage_queue_name> --account-name <storage_account_name>
```

Where:

* `storage_queue_name` is the name of the new storage queue.
* `storage_account_name` is the name of the storage account you created in Create a storage account for the storage queue.

#### Export the storage account and queue IDs for Reference

Execute the following commands to set environment variables for the storage account and queue IDs that will be requested later in these instructions:

* Linux or macOS:

  ```bash
  export storageid=$(az storage account show --name <data_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  export queuestorageid=$(az storage account show --name <queue_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  export queueid="$queuestorageid/queueservices/default/queues/<storage_queue_name>"
  ```
* Windows:

  ```bash
  set storageid=$(az storage account show --name <data_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  set queuestorageid=$(az storage account show --name <queue_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  set queueid="%queuestorageid%/queueservices/default/queues/<storage_queue_name>"
  ```

Where:

* `data_storage_account_name` is the name of the storage account you created in Create a storage account for data files.
* `queue_storage_account_name` is the name of the storage account you created in Create a storage account for the storage queue.
* `resource_group_name` is the name of the resource group you created in Create a resource group.
* `storage_queue_name` is the name of the storage queue you created in Create a storage queue.

#### Install the Event Grid extension

Execute the following command to install the Event Grid extension for Azure CLI:

```bash
az extension add --name eventgrid
```

#### Create the Event Grid subscription

Execute the following command to create the Event Grid subscription. Subscribing to a topic informs Event Grid which events to track:

* Linux or macOS:

  ```bash
  az eventgrid event-subscription create \
  --source-resource-id $storageid \
  --name <subscription_name> --endpoint-type storagequeue \
  --endpoint $queueid \
  --advanced-filter data.api stringin CopyBlob PutBlob PutBlockList FlushWithClose SftpCommit DeleteBlob DeleteFile SftpRemove
  ```
* Windows:

  ```bash
  az eventgrid event-subscription create \
  --source-resource-id %storageid% \
  --name <subscription_name> --endpoint-type storagequeue \
  --endpoint %queueid% \
  -advanced-filter data.api stringin CopyBlob PutBlob PutBlockList FlushWithClose SftpCommit DeleteBlob DeleteFile SftpRemove
  ```

Where:

* `storageid` and `queueid` are the storage account and queue ID environment variables you set in Export the storage account and queue IDs for reference.
* `subscription_name` is the name of the new Event Grid subscription.

### Step 2: Create the notification integration

A *notification integration* is a Snowflake object that provides an interface between Snowflake and a third-party cloud message queuing service such as Azure Event Grid.

> **Note:**
>
> A single notification integration supports a single Azure Storage queue. Referencing the same storage queue in multiple notification integrations can result in missing data in target tables because event notifications are split between notification integrations.

#### Retrieve the storage queue URL and tenant ID

1. Sign in to the Microsoft Azure portal.
2. Navigate to Storage account » Queue service » Queues. Record the URL for the queue you created in Create a storage queue for reference later. The URL has the following format:

   ```bash
   https://<storage_account_name>.queue.core.windows.net/<storage_queue_name>
   ```
3. Navigate to Azure Active Directory » Properties. Record the Tenant ID value for reference later. The directory ID, or *tenant ID*, is needed to generate the consent URL that grants Snowflake access to the Event Grid subscription.

#### Create the notification integration

Create a notification integration using the
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-queue-inbound-azure.md) command.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
> * The Azure service principal for notification integrations is different from the service principal created for storage integrations.

```sqlsyntax
CREATE NOTIFICATION INTEGRATION <integration_name>
  ENABLED = true
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = '<queue_URL>'
  AZURE_TENANT_ID = '<directory_ID>';
```

Where:

* `integration_name` is the name of the new integration.
* `queue_URL` and `directory_ID` are the queue URL and tenant ID you recorded in Retrieve the storage queue URL and tenant ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  ENABLED = true
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = 'https://myqueue.queue.core.windows.net/mystoragequeue'
  AZURE_TENANT_ID = 'a123bcde-1234-5678-abc1-9abc12345678';
```

#### Grant Snowflake access to the storage queue

Note that specific steps in this section require a local installation of the Azure CLI.

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC NOTIFICATION INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Create the notification integration.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed topic.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to obtain an access
   token on any resource inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate
   permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Sign in to the Microsoft Azure portal.
5. Navigate to Azure Active Directory » Enterprise applications. Verify that the Snowflake application identifier you
   recorded in Step 2 in this section is listed.

   > **Important:**
   >
   > If you delete the Snowflake application in Azure Active Directory at a later time, the notification integration stops working.
6. Navigate to Queues » `storage_queue_name`, where `storage_queue_name` is the name of the storage queue you created in Create a storage queue.
7. Click Access Control (IAM) » Add role assignment.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC NOTIFICATION
   INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in
   >   this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the notification integration stops working.
9. Grant the Snowflake app the following permissions:

   * Role: Storage Queue Data Message Processor (the minimum required role), or Storage Queue Data Contributor.
   * Assign access to: Azure AD user, group, or service principal.
   * Select: The `appDisplayName` value.

   The Snowflake application identifier should now be listed under Storage Queue Data Message Processor or Storage Queue Data Contributor (on the same dialog).

### Step 3: Create a stage with an included directory table

Create an external stage that references your Azure container using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads
your staged data files into the directory table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure secure access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration
>   object.

```sqlsyntax
-- External stage
CREATE [ OR REPLACE ] [ TEMPORARY ] STAGE [ IF NOT EXISTS ] <external_stage_name>
      <cloud_storage_access_settings>
    [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
    [ directoryTable ]
    [ COPY_OPTIONS = ( copyOptions ) ]
    [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> directoryTable (for Microsoft Azure) ::=
>   [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
>                   [ AUTO_REFRESH = { TRUE | FALSE } ]
>                   [ NOTIFICATION_INTEGRATION = '<notification_integration_name>' ] ) ]
> ```

#### Directory table parameters (`directoryTable`)

`ENABLE = TRUE | FALSE`
:   Specifies whether to add a directory table to the stage. When the value is TRUE, a directory table is created with the stage.

    Default: `FALSE`

`AUTO_REFRESH = TRUE | FALSE`
:   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated data
    files are available in the named external stage specified in the URL value.

    `TRUE`
    :   Snowflake enables triggering automatic refreshes of the directory table metadata.

    `FALSE`
    :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory table
        metadata periodically using [ALTER STAGE](../sql-reference/sql/alter-stage.md) … REFRESH to synchronize the metadata with the current list of
        files in the stage path.

    Default: `FALSE`

**Microsoft Azure**

> `NOTIFICATION_INTEGRATION = '<notification_integration_name>'`
> :   Specifies the name of the notification integration used to automatically refresh the directory table metadata using Azure Event Grid
>     notifications. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party cloud
>     message queuing services.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the
path `files`. The stage references a storage integration named `my_storage_int`.

> ```sqlexample
> USE SCHEMA mydb.public;
> ```

```sqlexample
CREATE STAGE mystage
  URL='azure://myaccount.blob.core.windows.net/load/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (
    ENABLE = true
    AUTO_REFRESH = true
    NOTIFICATION_INTEGRATION = 'MY_NOTIFICATION_INT'
  );
```

> **Note:**
>
> * Use the `blob.core.windows.net` endpoint for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
> * The storage location in the URL value must end in a forward slash (`/`).

The NOTIFICATION_INTEGRATION parameter references the `my_notification_int` integration you created in
Step 2: Create the notification integration. The integration name must be provided in all uppercase.

When new or updated data files are added to the cloud storage location, the event notification informs Snowflake to scan them into the
directory table metadata.

### Step 4: Manually refresh the directory table metadata

Refresh the metadata in a directory table manually using the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command.

#### Syntax

```sqlsyntax
ALTER STAGE [ IF EXISTS ] <name> REFRESH [ SUBPATH = '<relative-path>' ]
```

Where:

`REFRESH`
:   Accesses the staged data files referenced in the directory table definition and updates the table metadata:

    * New files in the path are added to the table metadata.
    * Changes to files in the path are updated in the table metadata.
    * Files no longer in the path are removed from the table metadata.

    Currently, it is necessary to execute this command each time files are added to the stage, updated, or dropped. This step synchronizes
    the metadata with the latest set of associated files in the stage definition for the directory table.

`SUBPATH = '<relative-path>'`
:   Optionally specify a relative path to refresh the metadata for a specific subset of the data files.

#### Examples

Manually refresh the directory table metadata in a stage named `mystage`:

```sqlexample
ALTER STAGE mystage REFRESH;
```

> **Important:**
>
> If this step is not completed successfully at least once after the directory table is created, querying the directory table returns no
> results until a notification event triggers the directory table metadata to refresh automatically for the first time.

### Step 5: Configure security

For each additional role that will be used to query the directory table, grant sufficient access control privileges on the various objects
(i.e. the database(s), schema(s), stage, and table) using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE | Optional; only needed if the stage you created references a named file format. |

---
title: Refresh directory tables automatically for Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-load-dirtables-auto-gcs.md
section: User Guide
---

# Refresh directory tables automatically for Google Cloud Storage

This topic provides instructions for triggering directory table metadata refreshes using
[Google Cloud Pub/Sub](https://cloud.google.com/storage/docs/reporting-changes) messages for Google Cloud Storage (GCS) events.

> **Note:**
>
> To complete the steps described in this topic, you must use a role that has the CREATE STAGE privilege on a schema.
>
> In addition, you must have administrative access to Google Cloud (GC). If you are not a GCP administrator, ask your GCP
> administrator to complete the Prerequisites steps.
>
> Note that only `OBJECT_DELETE` and `OBJECT_FINALIZE` events trigger refreshes for directory tables. Snowflake recommends that you only send supported events for directory tables to reduce costs, event noise, and latency.

## Cloud platform support

Triggering automated refreshes using GCS Pub/Sub event messages is supported for Snowflake accounts hosted on any of the
[supported cloud platforms](intro-cloud-platforms.md).

## Configure secure access to Cloud Storage

> **Note:**
>
> If you have already configured secure access to the GCS bucket that stores your data files, you can skip this section.

This section describes how to configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage
to a Snowflake identity and access management (IAM) entity.

This section describes how to use storage integrations to allow Snowflake to read data from and write to a Google Cloud Storage bucket referenced in an external
(that is, Cloud Storage) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as
secret keys or access tokens; instead, integration objects reference a Cloud Storage service account. An administrator in your organization grants the service
account permissions in the Cloud Storage account.

Administrators can also restrict users to a specific set of Cloud Storage buckets (and optional paths) accessed by external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires access to your Cloud Storage project as a project editor. If you are not a project
>   editor, ask your Cloud Storage administrator to perform these tasks.
> * Confirm that Snowflake supports the Google Cloud Storage region that your storage is hosted in. For more information, see
>   [Supported cloud regions](intro-regions.md).

The following diagram shows the integration flow for a Cloud Storage stage:

1. An external (that is, Cloud Storage) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a Cloud Storage service account created for your account. Snowflake creates a single service account that is referenced by all GCS storage integrations in your Snowflake account.
3. A project editor for your Cloud Storage project grants permissions to the service account to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the service account on the bucket before allowing or denying access.

**In this Section:**

### Step 1: Create a Cloud Storage integration in Snowflake

Create an integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. An integration is a Snowflake object that delegates authentication responsibility for external cloud storage to a Snowflake-generated entity (that is, a Cloud Storage service account). For accessing Cloud Storage buckets, Snowflake creates a service account that can be granted permissions to access the bucket(s) that store your data files.

A single storage integration can support multiple external (that is, GCS) stages. The URL in the stage definition must align with the GCS buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'GCS'
  ENABLED = TRUE
  STORAGE_ALLOWED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `bucket` is the name of a Cloud Storage bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two buckets and paths. In a later step, we will create an external stage that references one of these buckets and paths.

Additional external stages that also use this integration can reference the allowed buckets and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION gcs_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'GCS'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('gcs://mybucket1/path1/', 'gcs://mybucket2/path2/')
>   STORAGE_BLOCKED_LOCATIONS = ('gcs://mybucket1/path1/sensitivedata/', 'gcs://mybucket2/path2/sensitivedata/');
> ```

### Step 2: Retrieve the Cloud Storage service account for your Snowflake account

Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the ID for the Cloud Storage service account that was created automatically for your Snowflake account:

```sqlsyntax
DESC STORAGE INTEGRATION <integration_name>;
```

Where:

> * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage integration in Snowflake (in this topic).

For example:

> ```sqlexample
> DESC STORAGE INTEGRATION gcs_int;
>
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> | property                    | property_type | property_value                                                              | property_default |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------|
> | ENABLED                     | Boolean       | true                                                                        | false            |
> | STORAGE_ALLOWED_LOCATIONS   | List          | gcs://mybucket1/path1/,gcs://mybucket2/path2/                               | []               |
> | STORAGE_BLOCKED_LOCATIONS   | List          | gcs://mybucket1/path1/sensitivedata/,gcs://mybucket2/path2/sensitivedata/   | []               |
> | STORAGE_GCP_SERVICE_ACCOUNT | String        | service-account-id@project1-123456.iam.gserviceaccount.com                  |                  |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> ```

The STORAGE_GCP_SERVICE_ACCOUNT property in the output shows the Cloud Storage service account created for your Snowflake account (that is, `service-account-id@project1-123456.iam.gserviceaccount.com`). We provision a single Cloud Storage service account for your entire Snowflake account. All Cloud Storage integrations use that service account.

### Step 3: Grant the service account permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your Google Cloud console so that you can use a Cloud Storage bucket to load and unload data:

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select IAM & Admin » Roles.
3. Select Create Role.
4. Enter a Title and optional Description for the custom role.
5. Select Add Permissions.
6. Filter the list of permissions, and add the following from the list:

   > | Action(s) | Required permissions |
   > | --- | --- |
   > | Data loading only | * `storage.buckets.get` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading with purge option, executing the REMOVE command on the stage | * `storage.buckets.get` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading and unloading | * `storage.buckets.get` (for calculating data transfer costs) * `storage.objects.create` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data unloading only | * `storage.buckets.get` * `storage.objects.create` * `storage.objects.delete` * `storage.objects.list` |
   > | Using [COPY FILES](../sql-reference/sql/copy-files.md) to copy files to an external stage | You must have the following additional permissions:  * `storage.multipartUploads.abort` * `storage.multipartUploads.create` * `storage.multipartUploads.list` * `storage.multipartUploads.listParts` |
7. Select Add.
8. Select Create.

#### Assign the custom role to the Cloud Storage Service Account

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select Cloud Storage » Buckets.
3. Filter the list of buckets, and select the bucket that you specified when you created your storage integration.
4. Select Permissions » View by principals, then select Grant access.
5. Under Add principals, paste the name of the service account name that you retrieved from the DESC STORAGE INTEGRATION command output.
6. Under Assign roles, select the custom IAM role that you created previously, then select Save.

> **Important:**
>
> If your Google Cloud organization was created on or after May 3, 2024, Google Cloud enforces a
> [domain restriction constraint](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains)
> in project organization policies. The default constraint lists your domain as the only allowed value.
>
> To allow the Snowflake service account access to your storage, you must
> [update the domain restriction](data-load-gcs-allow.md).

#### Grant the Cloud Storage service account permissions on the Cloud Key Management Service cryptographic keys

> **Note:**
>
> This step is required only if your GCS bucket is encrypted using a key stored in the Google Cloud Key Management Service (Cloud KMS).

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, search for and select Security » Key Management.
3. Select the key ring that is assigned to your GCS bucket.
4. Click SHOW INFO PANEL in the upper-right corner. The information panel for the key ring slides out.
5. Click the ADD PRINCIPAL button.
6. In the New principals field, search for the service account name from the DESCRIBE INTEGRATION output in Step 2: Retrieve the Cloud Storage service account for your Snowflake account (in this topic).
7. From the Select a role dropdown, select the `Cloud KMS CrytoKey Encryptor/Decryptor` role.
8. Click the Save button. The service account name is added to the Cloud KMS CrytoKey Encryptor/Decryptor role dropdown in the information panel.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Configure automation using GCS Pub/Sub

### Prerequisites

The instructions in this topic assume the following items have been created and configured:

GCP account:
:   * Pub/Sub topic that receives event messages from the GCS bucket. For more information, see Create the Pub/Sub topic (in this topic).
    * Subscription that receives event messages from the Pub/Sub topic. For more information, see Create the Pub/Sub subscription (in this topic).

    For instructions, see the [Pub/Sub documentation](https://cloud.google.com/pubsub/docs).

Snowflake:
:   * Target table in the Snowflake database where your data will be loaded.

#### Create the Pub/Sub topic

Create a Pub/Sub topic using [Cloud Shell](https://cloud.google.com/shell) or [Cloud SDK](https://cloud.google.com/sdk).

Execute the following command to create the topic and enable it to listen for activity in the specified GCS bucket:

```bash
$ gsutil notification create -t <topic> -f json -e OBJECT_FINALIZE -e OBJECT_DELETE gs://<bucket-name>
```

Where:

* `<topic>` is the name for the topic.
* `<bucket-name>` is the name of your GCS bucket.

If the topic already exists, the command uses it; otherwise, a new topic is created.

For more information, see [Using Pub/Sub notifications for Cloud Storage](https://cloud.google.com/storage/docs/reporting-changes) in the Pub/Sub documentation.

#### Create the Pub/Sub subscription

Create a subscription with pull delivery to the Pub/Sub topic using the Cloud Console, `gcloud` command-line tool, or the Cloud Pub/Sub API. For instructions, see [Managing topics and subscriptions](https://cloud.google.com/pubsub/docs/admin) in the Pub/Sub documentation.

> **Note:**
>
> * Only Pub/Sub subscriptions that use the default pull delivery are supported with Snowflake. Push delivery is not supported.

#### Retrieve the Pub/Sub subscription ID

The Pub/Sub topic subscription ID is used in these instructions to allow Snowflake access to event messages.

1. Log into the Google Cloud Platform Console as a project editor.
2. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
3. Copy the ID in the Subscription ID column for the topic subscription

### Step 1: Create a notification integration in Snowflake

Create a notification integration using the
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-queue-inbound-gcp.md) command.

The notification integration references your Pub/Sub subscription. Snowflake associates the notification integration with a GCS
service account created for your account. Snowflake creates a single service account that is referenced by all GCS notification
integrations in your Snowflake account.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
> * The GCS service account for notification integrations is different from the service account created for storage integrations.
> * A single notification integration supports a single Google Cloud Pub/Sub subscription. Referencing the same Pub/Sub subscription in multiple notification integrations can result in missing data in target tables because event notifications are split between notification integrations.

```sqlsyntax
CREATE NOTIFICATION INTEGRATION <integration_name>
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  ENABLED = true
  GCP_PUBSUB_SUBSCRIPTION_NAME = '<subscription_id>';
```

Where:

* `integration_name` is the name of the new integration.
* `subscription_id` is the subscription name you recorded in Retrieve the Pub/Sub subscription ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  ENABLED = true
  GCP_PUBSUB_SUBSCRIPTION_NAME = 'projects/project-1234/subscriptions/sub2';
```

### Step 2: Grant Snowflake access to the Pub/Sub subscription

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the Snowflake service account ID:

   ```sqlsyntax
   DESC NOTIFICATION INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Notification Integration in Snowflake.

   For example:

   > ```sqlexample
   > DESC NOTIFICATION INTEGRATION my_notification_int;
   > ```
2. Record the service account name in the GCP_PUBSUB_SERVICE_ACCOUNT column, which has the following format:

   ```bash
   <service_account>@<project_id>.iam.gserviceaccount.com
   ```
3. Log into the Google Cloud Platform Console as a project editor.
4. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
5. Select the subscription to configure for access.
6. Click SHOW INFO PANEL in the upper-right corner. The information panel for the subscription slides out.
7. Click the ADD PRINCIPAL button.
8. In the New principals field, search for the service account name you recorded.
9. From the Select a role dropdown, select Pub/Sub Subscriber.
10. Click the Save button. The service account name is added to the Pub/Sub Subscriber role dropdown in the information panel.
11. Navigate to the Dashboard page in the Cloud Console, and select your project from the dropdown list.
12. Click the ADD PEOPLE TO THIS PROJECT button.
13. Add the service account name you recorded.
14. From the Select a role dropdown, select Monitoring Viewer.
15. Click the Save button. The service account name is added to the Monitoring Viewer role.

### Step 3: Create a stage with an included directory table

Create an external stage that references your GCS bucket using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads
your staged data files into the directory table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure secure access to Cloud Storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration
>   object.

```sqlsyntax
-- External stage
CREATE [ OR REPLACE ] [ TEMPORARY ] STAGE [ IF NOT EXISTS ] <external_stage_name>
      <cloud_storage_access_settings>
    [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
    [ directoryTable ]
    [ COPY_OPTIONS = ( copyOptions ) ]
    [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> directoryTable ::=
>   [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
>                   [ AUTO_REFRESH = { TRUE | FALSE } ]
>                   [ NOTIFICATION_INTEGRATION = '<notification_integration_name>' ] ) ]
> ```

#### Directory table parameters (`directoryTable`)

`ENABLE = TRUE | FALSE`
:   Specifies whether to add a directory table to the stage. When the value is TRUE, a directory table is created with the stage.

    Default: `FALSE`

`AUTO_REFRESH = TRUE | FALSE`
:   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated data
    files are available in the named external stage specified in the URL value.

    `TRUE`
    :   Snowflake enables triggering automatic refreshes of the directory table metadata.

    `FALSE`
    :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory table
        metadata periodically using [ALTER STAGE](../sql-reference/sql/alter-stage.md) … REFRESH to synchronize the metadata with the current list of
        files in the stage path.

    Default: `FALSE`

`NOTIFICATION_INTEGRATION = '<notification_integration_name>'`
:   Specifies the name of the notification integration used to automatically refresh the directory table metadata using Pub/Sub
    notifications. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party cloud
    message queuing services.

    The integration name must be provided in all uppercase.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the
path `files`. The stage references a storage integration named `my_storage_int`.

The NOTIFICATION_INTEGRATION parameter references the `my_notification_int` integration you created in
Step 1: Create a Notification Integration in Snowflake:

> ```sqlexample
> USE SCHEMA mydb.public;
> ```

```sqlexample
CREATE STAGE mystage
  URL='gcs://mybucket/files/'
  STORAGE_INTEGRATION = my_storage_int
  DIRECTORY = (
    ENABLE = true
    AUTO_REFRESH = true
    NOTIFICATION_INTEGRATION = 'MY_NOTIFICATION_INT'
  );
```

> **Note:**
>
> * The storage location in the URL value must end in a forward slash (`/`).
> * The integration name must be provided in all uppercase.

When new or updated data files are added to the cloud storage location, the event notification informs Snowflake to scan them into the
directory table metadata.

### Step 4: Manually refresh the directory table metadata

Refresh the metadata in a directory table manually using the [ALTER STAGE](../sql-reference/sql/alter-stage.md) command.

#### Syntax

```sqlsyntax
ALTER STAGE [ IF EXISTS ] <name> REFRESH [ SUBPATH = '<relative-path>' ]
```

Where:

`REFRESH`
:   Accesses the staged data files referenced in the directory table definition and updates the table metadata:

    * New files in the path are added to the table metadata.
    * Changes to files in the path are updated in the table metadata.
    * Files no longer in the path are removed from the table metadata.

    Currently, it is necessary to execute this command each time files are added to the stage, updated, or dropped. This step synchronizes
    the metadata with the latest set of associated files in the stage definition for the directory table.

`SUBPATH = '<relative-path>'`
:   Optionally specify a relative path to refresh the metadata for a specific subset of the data files.

#### Examples

Manually refresh the directory table metadata in a stage named `mystage`:

```sqlexample
ALTER STAGE mystage REFRESH;
```

> **Important:**
>
> If this step is not completed successfully at least once after the directory table is created, querying the directory table returns no
> results until a notification event triggers the directory table metadata to refresh automatically for the first time.

### Step 5: Configure security

For each additional role that will be used to query the directory table, grant sufficient access control privileges on the various objects
(i.e. the database(s), schema(s), and stage) using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE | Optional; only needed if the stage you created references a named file format. |

---
title: Refresh external tables automatically
source: https://docs.snowflake.com/en/user-guide/tables-external-auto.md
section: User Guide
---

# Refresh external tables automatically

Event notifications for cloud storage can start refreshes of the external table metadata or add or drop file references.

> **Important:**
>
> If you transfer ownership on an external table or its parent database by using the [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md) command,
> this sets the table’s `AUTO_REFRESH` property to `FALSE`. This blocks automatic refreshes of the table metadata.
> To restore automatic refreshes after you transfer ownership, set `AUTO_REFRESH = TRUE`
> by using the [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) command.

**Next topics:**

* [Refresh external tables automatically for Amazon S3](tables-external-s3.md)
* [Refresh external tables automatically for Google Cloud Storage](tables-external-gcs.md)
* [Refresh external tables automatically for Azure Blob Storage](tables-external-azure.md)

---
title: Refresh external tables automatically for Amazon S3
source: https://docs.snowflake.com/en/user-guide/tables-external-s3.md
section: User Guide
---

# Refresh external tables automatically for Amazon S3

You can create external tables and refresh the external table metadata automatically by using [Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. This operation synchronizes the metadata with the latest set of associated files in the external stage and path:

> * New files in the path are added to the table metadata.
> * Changes to files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

## Prerequisites

Before you proceed, ensure you meet the following prerequisites:

> * This feature is limited to Snowflake accounts on Amazon Web Services (AWS).
> * To perform the tasks described in this topic, you must use a role that has the CREATE STAGE and CREATE EXTERNAL TABLE privileges on a schema.
>
>   In addition, you must have administrative access to AWS. If you are not an AWS administrator, ask your AWS administrator to complete the steps required to configure AWS event notifications.
> * Snowflake recommends that you only send supported events for external tables to reduce costs, event noise, and latency.
> * External tables don’t support storage versioning (S3 versioning, Object Versioning in Google Cloud Storage, or versioning for Azure Storage).

## Limitations of refreshing external tables automatically by using Amazon SQS

* [Virtual Private Snowflake (VPS)](intro-editions.md) and [AWS PrivateLink](admin-security-privatelink.md) customers: Amazon SQS isn’t currently supported by AWS as a [VPC endpoint](https://docs.aws.amazon.com/AmazonVPC/latest/UserGuide/vpc-endpoints.html). Although AWS services within a VPC (including VPS) can communicate with SQS, this traffic isn’t within the VPC, and therefore isn’t protected by the VPC.
* SQS notifications notify Snowflake when new files arrive in monitored S3 buckets and are ready to load. SQS notifications contain the S3 event and a list of the file names. They don’t include the actual data in the files.

## Cloud platform support

Triggering automated external metadata refreshes by using S3 event messages is supported by Snowflake accounts hosted on AWS only.

## Configure secure access to cloud storage

> **Important:**
>
> If you have already configured secure access to the S3 bucket that stores your data files, skip this section and proceed to Create a new S3 event notification or use an existing notification.

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Amazon S3 bucket referenced in an external (i.e. S3) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an AWS identity and access management (IAM) user ID. An administrator in your organization grants the integration IAM user permissions in the AWS account.

An integration can also list buckets (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires permissions in AWS to create and manage IAM policies and roles. If you are not an AWS administrator, ask your AWS administrator to perform these tasks.
> * Note that currently, accessing S3 storage in [government regions](intro-regions.md)
>   using a storage integration is limited to Snowflake accounts hosted on AWS in the same government
>   region. Accessing your S3 storage from an account hosted outside of the government region using
>   direct credentials is supported.

The following diagram shows the integration flow for a S3 stage:

1. An external (i.e. S3) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a S3 IAM user created for your account. Snowflake creates a single IAM user that is referenced by all S3 storage integrations in your Snowflake account.
3. An AWS administrator in your organization grants permissions to the IAM user to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same storage integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the IAM user on the bucket before allowing or denying access.

> **Important:**
>
> Snowflake strongly recommends that you configure secure access so that you don’t need to supply IAM credentials when you access cloud storage. For more storage access options, see [Configuring secure access to Amazon S3](data-load-s3-config.md).

### Step 1: Configure access permissions for the S3 bucket

#### AWS access control requirements

Snowflake requires the following permissions on an S3 bucket and folder to be able to access files in the folder (and sub-folders):

* `s3:GetBucketLocation`
* `s3:GetObject`
* `s3:GetObjectVersion`
* `s3:ListBucket`

As a best practice, Snowflake recommends that you create an IAM policy for Snowflake access to the S3 bucket. You can then attach the policy to
the role, and then use the security credentials generated by AWS for the role to access files in the bucket.

#### Create an IAM policy

Complete the following steps to configure access permissions for Snowflake to access
your S3 bucket:

1. Sign in to the AWS Management Console.
2. From the home dashboard, search for and then select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](intro-regions.md) where your account is located.
5. If the STS status is inactive, move the toggle to Active.
6. From the left-hand navigation pane, select Policies.
7. Select Create Policy.
8. For Policy editor, select JSON.
9. To add a policy document that allows Snowflake to access the S3 bucket and folder, copy and paste the following syntax block into the policy editor:

   ```sqljson
   {
       "Version": "2012-10-17",
       "Statement": [
           {
               "Effect": "Allow",
               "Action": [
                 "s3:GetObject",
                 "s3:GetObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<bucket>/<prefix>/*"
           },
           {
               "Effect": "Allow",
               "Action": [
                   "s3:ListBucket",
                   "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<bucket>",
               "Condition": {
                   "StringLike": {
                       "s3:prefix": [
                           "<prefix>/*"
                       ]
                   }
               }
           }
       ]
   }
   ```

   > **Note:**
   > * This policy document (in JSON format) provides Snowflake with the required permissions to load or unload data by using a single bucket and folder path.
   > * Amazon Resource Names (ARN) for buckets in [government regions](intro-regions.md) have a `arn:aws-us-gov:s3:::` prefix.
   > * Setting the `"s3:prefix":` condition to either `["*"]` or `["<path>/*"]` grants access to all prefixes in the specified bucket or path in the bucket, respectively.
   > * AWS policies support a variety of different security use cases.
10. Replace `bucket` and `prefix` with your actual bucket name and folder path prefix.
11. Select Next.
12. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
13. Select Create policy.

### Step 2: Create the IAM role in AWS

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. Select AWS account as the trusted entity type.
4. Select Another AWS account
5. In the Account ID field, enter your own AWS account ID temporarily. Later, you modify the trust relationship and grant
   access to Snowflake.
6. Select the Require external ID option. An external ID is used to grant access to your AWS resources
   (such as S3 buckets) to a third party like Snowflake.

   Enter a placeholder ID such as `0000`.
   In a later step, you will modify the trust relationship for your IAM role and specify the external ID for your storage integration.
7. Select Next.
8. Select the policy you created in Step 1: Configure access permissions for the S3 bucket (in this topic).
9. Select Next.
10. Enter a name and description for the role, then select Create role.

    You have now created an IAM policy for a bucket, created an IAM role, and attached the policy to the role.
11. On the role summary page, locate and record the Role ARN value. In the next step, you will create a Snowflake integration that
    references this role.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and access data from the cloud storage location until the cache expires.

### Step 3: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake
object that stores a generated identity and access management (IAM) user for your S3 cloud storage, along with an optional set of allowed
or blocked storage locations (that is, buckets). Cloud provider administrators in your organization grant permissions on the storage locations
to the generated user. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, S3) stages. The URL in the stage definition must align with the S3
buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this
> SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = '<iam_role>'
  STORAGE_ALLOWED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('<protocol>://<bucket>/<path>/', '<protocol>://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `iam_role` is the Amazon Resource Name (ARN) of the role you created in Step 2: Create the IAM role in AWS (in this topic).
* `protocol` is one of the following:

  + `s3` refers to S3 storage in public AWS regions outside of China.
  + `s3china` refers to S3 storage in public AWS regions in China.
  + `s3gov` refers to S3 storage in [government regions](intro-regions.md).
* `bucket` is the name of a S3 bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS
  parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that
  reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that allows access to all buckets in the account but blocks access to the defined `sensitivedata` folders.

Additional external stages that also use this integration can reference the allowed buckets and paths:

```sqlexample
CREATE STORAGE INTEGRATION s3_int
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'S3'
  ENABLED = TRUE
  STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
  STORAGE_ALLOWED_LOCATIONS = ('*')
  STORAGE_BLOCKED_LOCATIONS = ('s3://mybucket1/mypath1/sensitivedata/', 's3://mybucket2/mypath2/sensitivedata/');
```

> **Note:**
>
> Optionally, use the [STORAGE_AWS_EXTERNAL_ID](../sql-reference/sql/create-storage-integration.md) parameter to specify
> your own external ID. You might choose this option
> to use the same external ID across multiple external volumes and/or storage integrations.

### Step 4: Retrieve the AWS IAM user for your Snowflake account

1. To retrieve the ARN for the IAM user that was created automatically for your Snowflake account, use the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md).

   ```sqlsyntax
   DESC INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 3: Create a Cloud Storage Integration in Snowflake
     (in this topic).

   For example:

   ```sqlexample
   DESC INTEGRATION s3_int;
   ```

   ```output
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   | property                  | property_type | property_value                                                                 | property_default |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------|
   | ENABLED                   | Boolean       | true                                                                           | false            |
   | STORAGE_ALLOWED_LOCATIONS | List          | s3://mybucket1/mypath1/,s3://mybucket2/mypath2/                                | []               |
   | STORAGE_BLOCKED_LOCATIONS | List          | s3://mybucket1/mypath1/sensitivedata/,s3://mybucket2/mypath2/sensitivedata/    | []               |
   | STORAGE_AWS_IAM_USER_ARN  | String        | arn:aws:iam::123456789001:user/abc1-b-self1234                                 |                  |
   | STORAGE_AWS_ROLE_ARN      | String        | arn:aws:iam::001234567890:role/myrole                                          |                  |
   | STORAGE_AWS_EXTERNAL_ID   | String        | MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=                               |                  |
   +---------------------------+---------------+--------------------------------------------------------------------------------+------------------+
   ```
2. Record the values for the following properties:

   | Property | Description |
   | --- | --- |
   | `STORAGE_AWS_IAM_USER_ARN` | The AWS IAM user created for your Snowflake account; for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`. Snowflake provisions a single IAM user for your entire Snowflake account. All S3 storage integrations in your account use that IAM user. |
   | `STORAGE_AWS_EXTERNAL_ID` | The external ID that Snowflake uses to establish a trust relationship with AWS. If you didn’t specify an external ID (`STORAGE_AWS_EXTERNAL_ID`) when you created the storage integration, Snowflake generates an ID for you to use. |

   You provide these values in the next section.

### Step 5: Grant the IAM user permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your AWS Management Console so that you can use a S3 bucket to load and unload data:

1. Sign in to the AWS Management Console.
2. Select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the role you created in Step 2: Create the IAM role in AWS (in this topic).
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the DESC STORAGE INTEGRATION output values you recorded in
   Step 4: Retrieve the AWS IAM user for your Snowflake account (in this topic):

   **Policy document for IAM role**

   ```sqljson
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<snowflake_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<snowflake_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   > * `snowflake_user_arn` is the STORAGE_AWS_IAM_USER_ARN value you recorded.
   > * `snowflake_external_id` is the STORAGE_AWS_EXTERNAL_ID value you recorded.
   >
   >   In this example, the `snowflake_external_id` value is `MYACCOUNT_SFCRole=2_a123456/s0aBCDEfGHIJklmNoPq=`.
   >
   >   > **Note:**
   >   >
   >   > For security reasons, if you create a new storage integration (or recreate an existing storage integration using the CREATE OR
   >   > REPLACE STORAGE INTEGRATION syntax) without specifying an external ID, the new integration has a *different* external ID and
   >   > can’t resolve the trust relationship unless you update the trust policy.
8. Select Update policy to save your changes.

> **Note:**
>
> Snowflake caches the temporary credentials for a period that cannot exceed the 60-minute expiration time. If you revoke access from
> Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Create a new S3 event notification or use an existing notification

Before you proceed, determine whether an S3 event notification exists for the target path (or *prefix*, in AWS terminology) in your S3 bucket where your data files are located. AWS rules prohibit creating conflicting notifications for the same path.

You have two options to automate the refreshing of external table metadata by using Amazon SQS:

Option 1: Create a new S3 event notification
:   This is the most common option. Create an event notification for the target path in your S3 bucket. The event notification informs Snowflake via an SQS queue when new, removed, or modified files in the path require a refresh of the external table metadata.

    > **Important:**
    >
    > If a conflicting event notification exists for your S3 bucket, use Option 2 instead.

    For step-by-step instructions, see Option 1: Create a new S3 event notification.

Option 2: Configure Amazon SNS
:   When you have an existing event notification, configure [Amazon Simple Notification Service (SNS)](https://aws.amazon.com/sns/) as a broadcaster to share notifications for a given path with multiple endpoints (or *subscribers*, for example, SQS queues or AWS Lambda workloads), including the Snowflake SQS queue for external table refresh automation. An S3 event notification published by SNS informs Snowflake of file changes in the path through an SQS queue.

    For step-by-step instructions, see Option 2: Configure Amazon SNS later in this topic.

## Option 1: Create a new S3 event notification

This section provides step-by-step instructions for the most common option to automatically refresh external table metadata by using [Amazon Simple Queue Service (SQS)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. The steps show you how to create an event notification for the target path (or *prefix*, in AWS terminology) in your S3 bucket where your data files are stored.

> > **Important:**
> >
> > If a conflicting event notification exists for your S3 bucket, use Option 2: Configure Amazon SNS later in this topic instead. AWS rules prohibit creating conflicting notifications for the same target path.

### (Optional) Step 1: Create a stage

Create an external stage that references your S3 bucket by using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads your staged data files into the external table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure secure access to cloud storage (in this topic).
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the path `files`. The stage references a storage integration named `my_storage_int`.

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE mystage
>   URL = 's3://mybucket/files'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

### Step 2: Create an external table

Create an external table by using the [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md) command. For example, create an external table in the `mydb.public` schema that reads JSON data from staged files.

The stage reference includes a folder path named `path1`. The external table appends this path to the stage definition, that is, the external table references the data files in `@mystage/files/path1`.

The `AUTO_REFRESH` parameter is `TRUE` by default:

> ```sqlexample
> CREATE OR REPLACE EXTERNAL TABLE ext_table
>  WITH LOCATION = @mystage/path1/
>  FILE_FORMAT = (TYPE = JSON);
> ```

### Step 3: Configure event notifications

Configure event notifications for your S3 bucket to notify Snowflake when new or updated data is available to read into the external table metadata. The auto-refresh feature relies on SQS queues to deliver event notifications from S3 to Snowflake.

For ease of use, these SQS queues are created and managed by Snowflake. The SHOW EXTERNAL TABLES command output displays the Amazon Resource Name (ARN) of your SQS queue.

1. Run the SHOW EXTERNAL TABLES command:

   > ```sqlexample
   > SHOW EXTERNAL TABLES;
   > ```
2. In the `notification_channel` column, find the ARN of the SQS queue for the external table, and then copy the ARN to a convenient location.

   > **Note:**
   >
   > Following AWS guidelines, Snowflake designates no more than one SQS queue per AWS S3 region. This SQS queue can be shared among multiple buckets in the same AWS account. The SQS queue coordinates notifications for all external tables reading data files from the same S3 bucket. When a new or modified data file is uploaded into the bucket, all external table definitions that match the stage directory path read the file details into their metadata.
3. Sign in to the AWS Management Console.
4. Configure an event notification for your S3 bucket by using the instructions provided in the [Amazon S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/enable-event-notifications.html). Complete the fields as shown in the following list:

   > * Name: Name of the event notification (for example, `Auto-ingest Snowflake`).
   > * Events: Select the ObjectCreate (All) and ObjectRemoved options.
   > * Send to: Select SQS Queue from the dropdown list.
   > * SQS: Select Add SQS queue ARN from the dropdown list.
   > * SQS queue ARN: Paste the SQS queue name from the SHOW EXTERNAL TABLES output.

> **Note:**
>
> These instructions create a single event notification that monitors activity for the entire S3 bucket. This is the simplest approach. This notification handles all external tables configured at a more granular level in the S3 bucket directory.
>
> Alternatively, in the previous steps, configure one or more paths and file extensions (or *prefixes* and *suffixes*, in AWS terminology) to filter event activity. For instructions, see the object key name filtering information in the relevant [AWS documentation topic](https://docs.aws.amazon.com/AmazonS3/latest/dev/NotificationHowTo.html). Repeat these steps for each additional path or file extension that you want the notification to monitor.
>
> AWS limits the number of these notification *queue configurations* to a maximum of 100 per S3 bucket.
>
> AWS doesn’t allow overlapping queue configurations (across event notifications) for the same S3 bucket. For example, if an existing notification is configured for `s3://mybucket/files/path1`, then you can’t create another notification at a higher level, such as `s3://mybucket/files`, or vice versa.

After you complete this step, the external stage with auto-refresh is configured.

When new or updated data files are added to the S3 bucket, the event notification informs Snowflake to scan them into the external table metadata.

### Step 4: Manually refresh external table metadata

Manually refresh the external table metadata one time by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) with the REFRESH parameter; for example:

> ```sqlexample
> ALTER EXTERNAL TABLE ext_table REFRESH;
> ```

This step ensures the metadata is synchronized with any changes to the file list that occurred after Step 2. Thereafter, the S3 event notifications trigger the metadata refresh automatically.

### Step 5: Configure security

For each additional role that you will use to query the external table, grant sufficient access control privileges on the various objects (that is, the databases, schemas, stage, and table) by using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE |  |
| External table | SELECT |  |

## Option 2: Configure Amazon SNS

This section provides step-by-step instructions about how to trigger external table metadata refreshing automatically by using [Amazon SQS (Simple Queue Service)](https://aws.amazon.com/sqs/) notifications for an S3 bucket. The steps show you how to configure [Amazon Simple Notification Service (SNS)](https://aws.amazon.com/sns/) as a broadcaster to publish event notifications for your S3 bucket to multiple subscribers (for example, SQS queues or AWS Lambda workloads), including the Snowflake SQS queue for external table refresh automation.

> > **Note:**
> >
> > For these instructions to work, you must have an event notification for the target path in your S3 bucket where your data files are located. If no event notification exists, do one of the following tasks:
> >
> > * Follow Option 1: Create a New S3 Event Notification earlier in this topic instead.
> > * Create an event notification for your S3 bucket, and then proceed with the instructions in this section. For more information, see the [Amazon S3 documentation](https://docs.aws.amazon.com/AmazonS3/latest/user-guide/enable-event-notifications.html).

### Prerequisite: Create an Amazon SNS Topic and Subscription

1. Create an SNS topic in your AWS account to handle all messages for the Snowflake stage location on your S3 bucket.
2. Subscribe your target destinations for the S3 event notifications (for example, other SQS queues or AWS Lambda workloads) to this topic. SNS publishes event notifications for your bucket to all subscribers to the topic.

For instructions, see the [SNS documentation](https://aws.amazon.com/documentation/sns/).

### Step 1: Subscribe the Snowflake SQS Queue to the SNS Topic

1. Sign in to the AWS Management Console.
2. From the home dashboard, choose Simple Notification Service (SNS).
3. Choose Topics from the left-hand navigation pane.
4. Locate the topic for your S3 bucket. Note the topic ARN.
5. Using a Snowflake client, query the [SYSTEM$GET_AWS_SNS_IAM_POLICY](../sql-reference/functions/system_get_aws_sns_iam_policy.md) system function with your SNS topic ARN:

   > ```sqlexample
   > select system$get_aws_sns_iam_policy('<sns_topic_arn>');
   > ```

   The function returns an IAM policy that grants a Snowflake SQS queue permission to subscribe to the SNS topic.

   For example:

   > ```sqlexample
   > select system$get_aws_sns_iam_policy('arn:aws:sns:us-west-2:001234567890:s3_mybucket');
   >
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > | SYSTEM$GET_AWS_SNS_IAM_POLICY('ARN:AWS:SNS:US-WEST-2:001234567890:S3_MYBUCKET')                                                                                                                                                                   |
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > | {"Version":"2012-10-17","Statement":[{"Sid":"1","Effect":"Allow","Principal":{"AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"},"Action":["sns:Subscribe"],"Resource":["arn:aws:sns:us-west-2:001234567890:s3_mybucket"]}]}                 |
   > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   > ```
6. Return to the AWS Management Console. Choose Topics from the left-hand navigation pane.
7. Select the topic for your S3 bucket, and click the Edit button. The Edit page opens.
8. Click Access policy - Optional to expand this area of the page.
9. Merge the IAM policy addition from the SYSTEM$GET_AWS_SNS_IAM_POLICY function results into the JSON document.

   For example:

   **Original IAM policy (abbreviated):**

   > ```sqljson
   > {
   >   "Version":"2008-10-17",
   >   "Id":"__default_policy_ID",
   >   "Statement":[
   >      {
   >         "Sid":"__default_statement_ID",
   >         "Effect":"Allow",
   >         "Principal":{
   >            "AWS":"*"
   >         }
   >         ..
   >      }
   >    ]
   >  }
   > ```

   **Merged IAM policy:**

   > ```sqljson
   > {
   >   "Version":"2008-10-17",
   >   "Id":"__default_policy_ID",
   >   "Statement":[
   >      {
   >         "Sid":"__default_statement_ID",
   >         "Effect":"Allow",
   >         "Principal":{
   >            "AWS":"*"
   >         }
   >         ..
   >      },
   >      {
   >         "Sid":"1",
   >         "Effect":"Allow",
   >         "Principal":{
   >           "AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
   >          },
   >          "Action":[
   >            "sns:Subscribe"
   >          ],
   >          "Resource":[
   >            "arn:aws:sns:us-west-2:001234567890:s3_mybucket"
   >          ]
   >      }
   >    ]
   >  }
   > ```
10. Add an additional policy grant to allow S3 to publish event notifications for the bucket to the SNS topic.

    For example (using the SNS topic ARN and S3 bucket used throughout these instructions):

    > ```sqljson
    > {
    >     "Sid":"s3-event-notifier",
    >     "Effect":"Allow",
    >     "Principal":{
    >        "Service":"s3.amazonaws.com"
    >     },
    >     "Action":"SNS:Publish",
    >     "Resource":"arn:aws:sns:us-west-2:001234567890:s3_mybucket",
    >     "Condition":{
    >        "ArnLike":{
    >           "aws:SourceArn":"arn:aws:s3:*:*:s3_mybucket"
    >        }
    >     }
    >  }
    > ```

    **Merged IAM policy:**

    > ```sqljson
    > {
    >   "Version":"2008-10-17",
    >   "Id":"__default_policy_ID",
    >   "Statement":[
    >      {
    >         "Sid":"__default_statement_ID",
    >         "Effect":"Allow",
    >         "Principal":{
    >            "AWS":"*"
    >         }
    >         ..
    >      },
    >      {
    >         "Sid":"1",
    >         "Effect":"Allow",
    >         "Principal":{
    >           "AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"
    >          },
    >          "Action":[
    >            "sns:Subscribe"
    >          ],
    >          "Resource":[
    >            "arn:aws:sns:us-west-2:001234567890:s3_mybucket"
    >          ]
    >      },
    >      {
    >         "Sid":"s3-event-notifier",
    >         "Effect":"Allow",
    >         "Principal":{
    >            "Service":"s3.amazonaws.com"
    >         },
    >         "Action":"SNS:Publish",
    >         "Resource":"arn:aws:sns:us-west-2:001234567890:s3_mybucket",
    >         "Condition":{
    >            "ArnLike":{
    >               "aws:SourceArn":"arn:aws:s3:*:*:s3_mybucket"
    >            }
    >         }
    >       }
    >    ]
    >  }
    > ```
11. Click Save changes.

### (Optional) Step 2: Create a stage

Create an external stage that references your S3 bucket by using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads your staged data files into the external table metadata.

Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure Secure Access to Cloud Storage earlier in this topic.
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the path `files`. The stage references a storage integration named `my_storage_int`:

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE mystage
>   URL = 's3://mybucket/files'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

### Step 3: Create an external table

Create an external table by using [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md). Identify the SNS topic ARN from Prerequisite: Create an Amazon SNS Topic and Subscription:

```sqlsyntax
CREATE EXTERNAL TABLE <table_name>
 ..
 AWS_SNS_TOPIC = '<sns_topic_arn>';
```

Where:

`AWS_SNS_TOPIC = '<sns_topic_arn>'`
:   Specifies the ARN for the SNS topic for your S3 bucket. The CREATE EXTERNAL TABLE statement subscribes the Snowflake SQS queue to the specified SNS topic.

For example, create an external table in the `mydb.public` schema that reads JSON data from staged files. The stage reference includes a folder path named `path1`. The external table appends this path to the stage definition, that is, the external table references the data files in `@mystage/files/path1`. The `AUTO_REFRESH` parameter is `TRUE` by default:

```sqlexample
CREATE EXTERNAL TABLE ext_table
 WITH LOCATION = @mystage/path1/
 FILE_FORMAT = (TYPE = JSON)
 AWS_SNS_TOPIC = 'arn:aws:sns:us-west-2:001234567890:s3_mybucket';
```

To remove this parameter from an external table, you must recreate the external table by using the CREATE OR REPLACE EXTERNAL TABLE syntax.

### Step 4: Manually refresh external table metadata

Manually refresh the external table metadata one time by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) with the REFRESH parameter; for example:

> ```sqlexample
> ALTER EXTERNAL TABLE ext_table REFRESH;
> ```

This step ensures that the metadata is synchronized with any changes to the file list that occurred after Step 3. Thereafter, the S3 event notifications trigger the metadata refresh automatically.

### Step 5: Configure security

For each additional role that you will use to query the external table, grant sufficient access control privileges on the various objects (that is, the databases, schemas, stage, and table) by using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE |  |
| External table | SELECT |  |

---
title: Refresh external tables automatically for Azure Blob Storage
source: https://docs.snowflake.com/en/user-guide/tables-external-azure.md
section: User Guide
---

# Refresh external tables automatically for Azure Blob Storage

You can create external tables and refresh the external table metadata automatically by using [Microsoft Azure Event Grid](https://azure.microsoft.com/en-us/services/event-grid/) notifications for an Azure container. This operation synchronizes the metadata with the latest set of associated files in the external stage and path.

The following list shows how the state of files in the path affects the table metadata:

> * New files in the path are added to the table metadata.
> * Changes to files in the path are updated in the table metadata.
> * Files no longer in the path are removed from the table metadata.

## Supported accounts, APIs, and schemas

Snowflake supports the following types of blob storage accounts:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v2

Automatic refresh of external table isn’t supported for Microsoft Fabric OneLake.
For OneLake external tables, you must manually refresh a table with [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) with the REFRESH parameter.

> **Note:**
>
> Only `Microsoft.Storage.BlobCreated` and `Microsoft.Storage.BlobDeleted` events trigger the refreshing of external table metadata. Adding new objects to blob storage triggers these events. Renaming a directory or object doesn’t trigger these events. Snowflake recommends that you only send supported events for external tables to reduce costs, event noise, and latency.

For cloud platform support, triggering automated external metadata refreshes using Azure Event Grid messages is supported by Snowflake accounts
hosted on Microsoft Azure (Azure).

Snowflake supports the following `Microsoft.Storage.BlobCreated` APIs:

* `CopyBlob`
* `PutBlob`
* `PutBlockList`
* `FlushWithClose`
* `SftpCommit`

Snowflake supports the following `Microsoft.Storage.BlobDeleted` APIs:

* `DeleteBlob`
* `DeleteFile`
* `SftpRemove`

For Data Lake Storage Gen2 storage accounts, `Microsoft.Storage.BlobCreated` events are triggered when clients use the `CreateFile`
and `FlushWithClose` operations. If the SSH File Transfer Protocol (SFTP) is used, `Microsoft.Storage.BlobCreated` events are triggered with `SftpCreate` and `SftpCommit` operations. The `CreateFile` or `SftpCreate` API alone does not indicate a commit of a file in the storage account. If the
`FlushWithClose` or `SftpCommit` message is not sent, Snowflake does not refresh the external table metadata.

Snowflake only supports the [Azure Event Grid event schema](https://learn.microsoft.com/en-us/azure/event-grid/event-schema); it doesn’t support the [CloudEvents schema with Azure Event Grid](https://learn.microsoft.com/en-us/azure/event-grid/cloud-event-schema).

External tables don’t support storage versioning (S3 versioning, Object Versioning in Google Cloud Storage, or versioning for Azure Storage).

## Prerequisites

Before you proceed, ensure you meet the following prerequisites:

> * A role that has the CREATE STAGE and CREATE EXTERNAL TABLE privileges on a schema.
> * Administrative access to Microsoft Azure. If you aren’t an Azure administrator, ask your Azure administrator to complete the steps in Step 1: Configure the Event Grid subscription.
> * A notification integration so that you can refresh external tables automatically for Azure Blob Storage.

## Configure secure access to Cloud Storage

> **Important:**
>
> If you already configured secure access to the Azure blob storage container that stores your data files, you can skip this section, and proceed to Step 1: Configure the Event Grid subscription.

You must configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage to a Snowflake identity and access management (IAM) entity.

> **Note:**
>
> Snowflake strongly recommends that you configure secure access so that you don’t need to supply IAM credentials when you access cloud storage. For information about additional storage access options, see [Configure an Azure container for loading data](data-load-azure-config.md).

This section describes how to use storage integrations to allow Snowflake to read data from and write data to an Azure container referenced in an external (Azure) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as secret keys or access tokens. Integration objects store an Azure identity and access management (IAM) user ID called the *app registration*. An administrator in your organization grants this app the necessary permissions in the Azure account.

An integration must also specify containers (and optional paths) that limit the locations users can specify when creating external stages that use the integration.

> **Note:**
>
> Completing the instructions in this section requires permissions in Azure to manage storage accounts. If you are not an Azure administrator, ask your Azure administrator to perform these tasks.

**In this Section:**

### Step 1: Create a cloud storage integration in Snowflake

Create a storage integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. A storage integration is a Snowflake object that stores a generated service principal for your Azure cloud storage, along with an optional set of allowed or blocked storage locations (that is, containers). Cloud provider administrators in your organization grant permissions on the storage locations to the generated service principal. This option allows users to avoid supplying credentials when creating stages or loading data.

A single storage integration can support multiple external (that is, Azure) stages. The URL in the stage definition must align with the Azure containers (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'AZURE'
  ENABLED = TRUE
  AZURE_TENANT_ID = '<tenant_id>'
  STORAGE_ALLOWED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('azure://<account>.blob.core.windows.net/<container>/<path>/', 'azure://<account>.blob.core.windows.net/<container>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `tenant_id` is the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to. A storage integration can authenticate to only one tenant, so the allowed and blocked storage locations must refer to storage accounts that all belong this tenant.

  To find your tenant ID, sign in to the Azure portal and click Azure Active Directory » Properties. The tenant ID is displayed in the Tenant ID field.
* `container` is the name of an Azure container that stores your data files (for example, `mycontainer`). The STORAGE_ALLOWED_LOCATIONS and STORAGE_BLOCKED_LOCATIONS parameters allow or block access to these containers, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over logical directories in the container.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two containers and paths. In a later step, we will create an external stage that references one of these containers and paths. Multiple external stages that use this integration can reference the allowed containers and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION azure_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'AZURE'
>   ENABLED = TRUE
>   AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
>   STORAGE_ALLOWED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/')
>   STORAGE_BLOCKED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer1/mypath1/sensitivedata/', 'azure://myaccount.blob.core.windows.net/mycontainer2/mypath2/sensitivedata/');
> ```

### Step 2: Grant Snowflake Access to the Storage Locations

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC STORAGE INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage Integration in Snowflake.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed storage locations.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to be granted an access token on specified resources inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Sign in to the Microsoft Azure portal.
5. Navigate to Azure Services » Storage Accounts. Click the name of the storage account you are granting the Snowflake service principal access to.
6. Click Access Control (IAM) » Add role assignment.
7. Select the desired role to grant to the Snowflake service principal:

   * `Storage Blob Data Reader` grants read access only. This allows loading data from files staged in the storage account.
   * `Storage Blob Data Contributor` grants read and write access. This allows loading data from or unloading data to files staged in
     the storage account. The role also allows executing the [REMOVE](../sql-reference/sql/remove.md) command to remove files staged in the
     storage account.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC STORAGE INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the storage integration stops working.
9. Click the Review + assign button.

   > **Note:**
   > * According to the Microsoft Azure documentation, role assignments may take up to five minutes to propagate.
   > * Snowflake caches the temporary credentials for a period that cannot exceed the 60 minute expiration time. If you revoke access from Snowflake, users might be able to list files and load data from the cloud storage location until the cache expires.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Configure automation with Azure Event Grid

### Step 1: Configure the Event Grid subscription

This section describes how to set up an Event Grid subscription for Azure Storage events using the Azure CLI. For more information about the steps described in this section, see the following articles in the Azure documentation:

* <https://docs.microsoft.com/en-us/azure/event-grid/custom-event-to-queue-storage>
* <https://docs.microsoft.com/en-us/azure/storage/blobs/storage-blob-event-quickstart>

#### Create a resource group

An Event Grid *topic* provides an endpoint where the source (that is, Azure Storage) sends events. A topic is used for a collection of related events. Event Grid topics are Azure resources, and must be placed in an Azure resource group.

Execute the following command to create a resource group:

```bash
az group create --name <resource_group_name> --location <location>
```

Where:

* `resource_group_name` is the name of the new resource group.
* `location` is the location, or *region* in Snowflake terminology, of your Azure Storage account.

#### Enable the Event Grid resource provider

Execute the following command to register the Event Grid resource provider. Note that this step is only required if you have not previously used Event Grid with your Azure account:

```bash
az provider register --namespace Microsoft.EventGrid
az provider show --namespace Microsoft.EventGrid --query "registrationState"
```

#### Create a storage account for data files

Execute the following command to create a storage account to store your data files. This account must be either a Blob storage (that is, a `BlobStorage` kind) or GPv2 (that is, a `StorageV2` kind) account, because only these two account types support event messages.

> **Note:**
>
> If you already have a Blob storage or GPv2 account, you can use that account instead.

For example, create a Blob storage account:

```bash
az storage account create --resource-group <resource_group_name> --name <storage_account_name> --sku Standard_LRS --location <location> --kind BlobStorage --access-tier Hot
```

Where:

* `resource_group_name` is the name of the resource group you created in Create a Resource Group.
* `storage_account_name` is the name of the new storage account.
* `location` is the location of your Azure Storage account.

#### Create a storage account for the storage queue

Execute the following command to create a storage account to host your storage queue. This account must be a GPv2 account, because only this kind of account supports event messages to a storage queue.

> **Note:**
>
> If you already have a GPv2 account, you can use that account to host both your data files and your storage queue.

For example, create a GPv2 account:

```bash
az storage account create --resource-group <resource_group_name> --name <storage_account_name> --sku Standard_LRS --location <location> --kind StorageV2
```

Where:

* `resource_group_name` is the name of the resource group you created in Create a resource group.
* `storage_account_name` is the name of the new storage account.
* `location` is the location of your Azure Storage account.

#### Create a storage queue

A single Azure Queue Storage queue can collect the event messages for many Event Grid subscriptions. For best performance, Snowflake recommends creating a single storage queue to accommodate all of your subscriptions related to Snowflake.

Execute the following command to create a storage queue. A storage queue stores a set of messages, in this case event messages from Event Grid:

```bash
az storage queue create --name <storage_queue_name> --account-name <storage_account_name>
```

Where:

* `storage_queue_name` is the name of the new storage queue.
* `storage_account_name` is the name of the storage account you created in Create a storage account for the storage queue.

#### Export the storage account and queue IDs for Reference

Execute the following commands to set environment variables for the storage account and queue IDs that will be requested later in these instructions:

* Linux or macOS:

  ```bash
  export storageid=$(az storage account show --name <data_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  export queuestorageid=$(az storage account show --name <queue_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  export queueid="$queuestorageid/queueservices/default/queues/<storage_queue_name>"
  ```
* Windows:

  ```bash
  set storageid=$(az storage account show --name <data_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  set queuestorageid=$(az storage account show --name <queue_storage_account_name> --resource-group <resource_group_name> --query id --output tsv)
  set queueid="%queuestorageid%/queueservices/default/queues/<storage_queue_name>"
  ```

Where:

* `data_storage_account_name` is the name of the storage account you created in Create a storage account for data files.
* `queue_storage_account_name` is the name of the storage account you created in Create a storage account for the storage queue.
* `resource_group_name` is the name of the resource group you created in Create a resource group.
* `storage_queue_name` is the name of the storage queue you created in Create a storage queue.

#### Install the Event Grid extension

Execute the following command to install the Event Grid extension for Azure CLI:

```bash
az extension add --name eventgrid
```

#### Create the Event Grid subscription

Execute the following command to create the Event Grid subscription. Subscribing to a topic informs Event Grid which events to track:

* Linux or macOS:

  ```bash
  az eventgrid event-subscription create \
  --source-resource-id $storageid \
  --name <subscription_name> --endpoint-type storagequeue \
  --endpoint $queueid \
  --advanced-filter data.api stringin CopyBlob PutBlob PutBlockList FlushWithClose SftpCommit DeleteBlob DeleteFile SftpRemove
  ```
* Windows:

  ```bash
  az eventgrid event-subscription create \
  --source-resource-id %storageid% \
  --name <subscription_name> --endpoint-type storagequeue \
  --endpoint %queueid% \
  -advanced-filter data.api stringin CopyBlob PutBlob PutBlockList FlushWithClose SftpCommit DeleteBlob DeleteFile SftpRemove
  ```

Where:

* `storageid` and `queueid` are the storage account and queue ID environment variables you set in Export the storage account and queue IDs for reference.
* `subscription_name` is the name of the new Event Grid subscription.

### Step 2: Create the notification integration

A *notification integration* is a Snowflake object that provides an interface between Snowflake and a third-party cloud message queuing service such as Azure Event Grid.

> **Note:**
>
> A single notification integration supports a single Azure Storage queue. Referencing the same storage queue in multiple notification integrations can result in missing data in target tables because event notifications are split between notification integrations.

#### Retrieve the storage queue URL and tenant ID

1. Sign in to the Microsoft Azure portal.
2. Navigate to Storage account » Queue service » Queues. Record the URL for the queue you created in Create a storage queue for reference later. The URL has the following format:

   ```bash
   https://<storage_account_name>.queue.core.windows.net/<storage_queue_name>
   ```
3. Navigate to Azure Active Directory » Properties. Record the Tenant ID value for reference later. The directory ID, or *tenant ID*, is needed to generate the consent URL that grants Snowflake access to the Event Grid subscription.

#### Create the notification integration

Create a notification integration using the
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-queue-inbound-azure.md) command.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
> * The Azure service principal for notification integrations is different from the service principal created for storage integrations.

```sqlsyntax
CREATE NOTIFICATION INTEGRATION <integration_name>
  ENABLED = true
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = '<queue_URL>'
  AZURE_TENANT_ID = '<directory_ID>';
```

Where:

* `integration_name` is the name of the new integration.
* `queue_URL` and `directory_ID` are the queue URL and tenant ID you recorded in Retrieve the storage queue URL and tenant ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  ENABLED = true
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = 'https://myqueue.queue.core.windows.net/mystoragequeue'
  AZURE_TENANT_ID = 'a123bcde-1234-5678-abc1-9abc12345678';
```

#### Grant Snowflake access to the storage queue

Note that specific steps in this section require a local installation of the Azure CLI.

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the consent URL:

   ```sqlexample
   DESC NOTIFICATION INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Create the notification integration.

   Note the values in the following columns:

   AZURE_CONSENT_URL:
   :   URL to the Microsoft permissions request page.

   AZURE_MULTI_TENANT_APP_NAME:
   :   Name of the Snowflake client application created for your account. In a later step in this section, you will need to grant this
       application the permissions necessary to obtain an access token on your allowed topic.
2. In a web browser, navigate to the URL in the AZURE_CONSENT_URL column. The page displays a Microsoft permissions request page.
3. Click the Accept button. This action allows the Azure service principal created for your Snowflake account to obtain an access
   token on any resource inside your tenant. Obtaining an access token succeeds only if you grant the service principal the appropriate
   permissions on the container (see the next step).

   The Microsoft permissions request page redirects to the Snowflake corporate site (snowflake.com).
4. Sign in to the Microsoft Azure portal.
5. Navigate to Azure Active Directory » Enterprise applications. Verify that the Snowflake application identifier you
   recorded in Step 2 in this section is listed.

   > **Important:**
   >
   > If you delete the Snowflake application in Azure Active Directory at a later time, the notification integration stops working.
6. Navigate to Queues » `storage_queue_name`, where `storage_queue_name` is the name of the storage queue you created in Create a storage queue.
7. Click Access Control (IAM) » Add role assignment.
8. Search for the Snowflake service principal. This is the identity in the AZURE_MULTI_TENANT_APP_NAME property in the DESC NOTIFICATION
   INTEGRATION output (in Step 1). Search for the string before the underscore in the AZURE_MULTI_TENANT_APP_NAME property.

   > **Important:**
   > * It can take an hour or longer for Azure to create the Snowflake service principal requested through the Microsoft request page in
   >   this section. If the service principal is not available immediately, we recommend waiting an hour or two and then searching again.
   > * If you delete the service principal, the notification integration stops working.
9. Grant the Snowflake app the following permissions:

   * Role: Storage Queue Data Message Processor (the minimum required role), or Storage Queue Data Contributor.
   * Assign access to: Azure AD user, group, or service principal.
   * Select: The `appDisplayName` value.

   The Snowflake application identifier should now be listed under Storage Queue Data Message Processor or Storage Queue Data Contributor (on the same dialog).

### (Optional) Step 3: Creating a stage

Create an external stage that references your Azure container by using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads your staged data files into the external table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure Secure Access to Cloud Storage earlier in this topic.
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the path `files`. The stage references a storage integration named `my_storage_int`:

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE mystage
>   URL='azure://myaccount.blob.core.windows.net/mycontainer/files/'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

> **Note:**
>
> Use the `blob.core.windows.net` endpoint for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.

### Step 4: Create an external table

Create an external table by using the [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md) command.

For example, create an external table in the `mydb.public` schema that reads JSON data from files staged in the `mystage` stage with the `path1/` path:

```sqlexample
CREATE OR REPLACE EXTERNAL TABLE ext_table
 INTEGRATION = 'MY_NOTIFICATION_INT'
 WITH LOCATION = @mystage/path1/
 FILE_FORMAT = (TYPE = JSON);
```

The INTEGRATION parameter references the `my_notification_int` notification integration you created in Step 2: Create the notification integration. You must enter the integration name in all uppercase letters.

When a notification integration is provided, the `AUTO_REFRESH` parameter is `TRUE` by default. If there is no notification integration, AUTO_REFRESH is always `FALSE`.

After you complete this step, the external stage with auto-refresh is configured.

When new or updated data files are added to the Azure container, the event notification informs Snowflake to scan them into the external table metadata.

### Step 5: Manually refresh the external table metadata

Manually refresh the external table metadata once by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) with the REFRESH parameter; for example:

> ```sqlexample
> ALTER EXTERNAL TABLE ext_table REFRESH;
>
> +---------------------------------------------+----------------+-------------------------------+
> | file                                        | status         | description                   |
> |---------------------------------------------+----------------+-------------------------------|
> | files/path1/file1.json                      | REGISTERED_NEW | File registered successfully. |
> | files/path1/file2.json                      | REGISTERED_NEW | File registered successfully. |
> | files/path1/file3.json                      | REGISTERED_NEW | File registered successfully. |
> +---------------------------------------------+----------------+-------------------------------+
> ```

This step synchronizes the metadata with the list of files in the stage and path in the external table definition. Also, this step ensures that the external table can read the data files in the specified stage and path, and that no files were missed in the external table definition.

If the list of files in the `file` column doesn’t match your expectations, verify the paths in the external table definition and external stage definition. Any path in the external table definition is appended to any path specified in the stage definition. For more information, see [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md).

> **Important:**
>
> If this step is not completed successfully at least once after the external table is created, querying the external table returns no results until an Event Grid notification refreshes the external table metadata automatically for the first time.

This step ensures that the metadata is synchronized with any changes to the file list that occurred after Step 4. Thereafter, Event Grid notifications trigger the metadata refresh automatically.

### Step 6: Configure security

For each additional role that you will use to query the external table, grant sufficient access control privileges on the various objects (that is, the databases, schemas, stage, and table) by using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE | Optional; only needed if the stage you created in (Optional) Step 3: Creating a stage references a named file format. |
| External table | SELECT |  |

---
title: Refresh external tables automatically for Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/tables-external-gcs.md
section: User Guide
---

# Refresh external tables automatically for Google Cloud Storage

You can trigger external table metadata refreshes by using
[Google Cloud Pub/Sub](https://cloud.google.com/storage/docs/reporting-changes) messages for Google Cloud Storage (GCS) events.

## Prerequisites

Before you proceed, ensure you meet the following prerequisites:

> * A role that has the CREATE STAGE and CREATE EXTERNAL TABLE privileges on a schema.
> * Administrative access to Google Cloud (GC). If you aren’t a GC administrator, ask your GC
>   administrator to complete the prerequisite steps.
> * Only `OBJECT_DELETE` and `OBJECT_FINALIZE` events trigger refreshes for external table metadata.
>   To reduce costs, event noise, and latency, send only supported events for external tables.
> * External tables don’t support storage versioning (S3 versioning, Object Versioning in Google Cloud Storage, or versioning for Azure Storage).

## Cloud platform support

Triggering automated external metadata refreshes by using GCS Pub/Sub event messages is supported by Snowflake accounts
hosted on Google Cloud (GC).

## Configure secure access to Cloud Storage

> **Important:**
>
> If you have already configured secure access to the GCS bucket that stores your data files, you can skip this section and proceed to Configure automation using GCS Pub/Sub.

You must configure a Snowflake storage integration object to delegate authentication responsibility for cloud storage
to a Snowflake identity and access management (IAM) entity.

This section describes how to use storage integrations to allow Snowflake to read data from and write to a Google Cloud Storage bucket referenced in an external
(that is, Cloud Storage) stage. Integrations are named, first-class Snowflake objects that avoid the need for passing explicit cloud provider credentials such as
secret keys or access tokens; instead, integration objects reference a Cloud Storage service account. An administrator in your organization grants the service
account permissions in the Cloud Storage account.

Administrators can also restrict users to a specific set of Cloud Storage buckets (and optional paths) accessed by external stages that use the integration.

> **Note:**
>
> * Completing the instructions in this section requires access to your Cloud Storage project as a project editor. If you are not a project
>   editor, ask your Cloud Storage administrator to perform these tasks.
> * Confirm that Snowflake supports the Google Cloud Storage region that your storage is hosted in. For more information, see
>   [Supported cloud regions](intro-regions.md).

The following diagram shows the integration flow for a Cloud Storage stage:

1. An external (that is, Cloud Storage) stage references a storage integration object in its definition.
2. Snowflake automatically associates the storage integration with a Cloud Storage service account created for your account. Snowflake creates a single service account that is referenced by all GCS storage integrations in your Snowflake account.
3. A project editor for your Cloud Storage project grants permissions to the service account to access the bucket referenced in the stage definition. Note that many external stage objects can reference different buckets and paths and use the same integration for authentication.

When a user loads or unloads data from or to a stage, Snowflake verifies the permissions granted to the service account on the bucket before allowing or denying access.

**In this Section:**

### Step 1: Create a Cloud Storage integration in Snowflake

Create an integration using the [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) command. An integration is a Snowflake object that delegates authentication responsibility for external cloud storage to a Snowflake-generated entity (that is, a Cloud Storage service account). For accessing Cloud Storage buckets, Snowflake creates a service account that can be granted permissions to access the bucket(s) that store your data files.

A single storage integration can support multiple external (that is, GCS) stages. The URL in the stage definition must align with the GCS buckets (and optional paths) specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.

```sqlsyntax
CREATE STORAGE INTEGRATION <integration_name>
  TYPE = EXTERNAL_STAGE
  STORAGE_PROVIDER = 'GCS'
  ENABLED = TRUE
  STORAGE_ALLOWED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/')
  [ STORAGE_BLOCKED_LOCATIONS = ('gcs://<bucket>/<path>/', 'gcs://<bucket>/<path>/') ]
```

Where:

* `integration_name` is the name of the new integration.
* `bucket` is the name of a Cloud Storage bucket that stores your data files (for example, `mybucket`). The required STORAGE_ALLOWED_LOCATIONS parameter and optional STORAGE_BLOCKED_LOCATIONS parameter restrict or block access to these buckets, respectively, when stages that reference this integration are created or modified.
* `path` is an optional path that can be used to provide granular control over objects in the bucket.

The following example creates an integration that explicitly limits external stages that use the integration to reference either of two buckets and paths. In a later step, we will create an external stage that references one of these buckets and paths.

Additional external stages that also use this integration can reference the allowed buckets and paths:

> ```sqlexample
> CREATE STORAGE INTEGRATION gcs_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'GCS'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('gcs://mybucket1/path1/', 'gcs://mybucket2/path2/')
>   STORAGE_BLOCKED_LOCATIONS = ('gcs://mybucket1/path1/sensitivedata/', 'gcs://mybucket2/path2/sensitivedata/');
> ```

### Step 2: Retrieve the Cloud Storage service account for your Snowflake account

Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the ID for the Cloud Storage service account that was created automatically for your Snowflake account:

```sqlsyntax
DESC STORAGE INTEGRATION <integration_name>;
```

Where:

> * `integration_name` is the name of the integration you created in Step 1: Create a Cloud Storage integration in Snowflake (in this topic).

For example:

> ```sqlexample
> DESC STORAGE INTEGRATION gcs_int;
>
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> | property                    | property_type | property_value                                                              | property_default |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------|
> | ENABLED                     | Boolean       | true                                                                        | false            |
> | STORAGE_ALLOWED_LOCATIONS   | List          | gcs://mybucket1/path1/,gcs://mybucket2/path2/                               | []               |
> | STORAGE_BLOCKED_LOCATIONS   | List          | gcs://mybucket1/path1/sensitivedata/,gcs://mybucket2/path2/sensitivedata/   | []               |
> | STORAGE_GCP_SERVICE_ACCOUNT | String        | service-account-id@project1-123456.iam.gserviceaccount.com                  |                  |
> +-----------------------------+---------------+-----------------------------------------------------------------------------+------------------+
> ```

The STORAGE_GCP_SERVICE_ACCOUNT property in the output shows the Cloud Storage service account created for your Snowflake account (that is, `service-account-id@project1-123456.iam.gserviceaccount.com`). We provision a single Cloud Storage service account for your entire Snowflake account. All Cloud Storage integrations use that service account.

### Step 3: Grant the service account permissions to access bucket objects

The following step-by-step instructions describe how to configure IAM access permissions for Snowflake in your Google Cloud console so that you can use a Cloud Storage bucket to load and unload data:

#### Create a custom IAM role

Create a custom role that has the permissions required to access the bucket and get objects.

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select IAM & Admin » Roles.
3. Select Create Role.
4. Enter a Title and optional Description for the custom role.
5. Select Add Permissions.
6. Filter the list of permissions, and add the following from the list:

   > | Action(s) | Required permissions |
   > | --- | --- |
   > | Data loading only | * `storage.buckets.get` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading with purge option, executing the REMOVE command on the stage | * `storage.buckets.get` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data loading and unloading | * `storage.buckets.get` (for calculating data transfer costs) * `storage.objects.create` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` |
   > | Data unloading only | * `storage.buckets.get` * `storage.objects.create` * `storage.objects.delete` * `storage.objects.list` |
   > | Using [COPY FILES](../sql-reference/sql/copy-files.md) to copy files to an external stage | You must have the following additional permissions:  * `storage.multipartUploads.abort` * `storage.multipartUploads.create` * `storage.multipartUploads.list` * `storage.multipartUploads.listParts` |
7. Select Add.
8. Select Create.

#### Assign the custom role to the Cloud Storage Service Account

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, select Cloud Storage » Buckets.
3. Filter the list of buckets, and select the bucket that you specified when you created your storage integration.
4. Select Permissions » View by principals, then select Grant access.
5. Under Add principals, paste the name of the service account name that you retrieved from the DESC STORAGE INTEGRATION command output.
6. Under Assign roles, select the custom IAM role that you created previously, then select Save.

> **Important:**
>
> If your Google Cloud organization was created on or after May 3, 2024, Google Cloud enforces a
> [domain restriction constraint](https://cloud.google.com/resource-manager/docs/organization-policy/restricting-domains)
> in project organization policies. The default constraint lists your domain as the only allowed value.
>
> To allow the Snowflake service account access to your storage, you must
> [update the domain restriction](data-load-gcs-allow.md).

#### Grant the Cloud Storage service account permissions on the Cloud Key Management Service cryptographic keys

> **Note:**
>
> This step is required only if your GCS bucket is encrypted using a key stored in the Google Cloud Key Management Service (Cloud KMS).

1. Sign in to the Google Cloud console as a project editor.
2. From the home dashboard, search for and select Security » Key Management.
3. Select the key ring that is assigned to your GCS bucket.
4. Click SHOW INFO PANEL in the upper-right corner. The information panel for the key ring slides out.
5. Click the ADD PRINCIPAL button.
6. In the New principals field, search for the service account name from the DESCRIBE INTEGRATION output in Step 2: Retrieve the Cloud Storage service account for your Snowflake account (in this topic).
7. From the Select a role dropdown, select the `Cloud KMS CrytoKey Encryptor/Decryptor` role.
8. Click the Save button. The service account name is added to the Cloud KMS CrytoKey Encryptor/Decryptor role dropdown in the information panel.

> **Note:**
>
> You can use the [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../sql-reference/functions/system_validate_storage_integration.md)
> function to validate the configuration for your storage integration.

## Configure automation using GCS Pub/Sub

### Prerequisites

The instructions in this topic assume the following items have been created and configured:

GCP account:
:   * Pub/Sub topic that receives event messages from the GCS bucket. For more information, see Create the Pub/Sub topic (in this topic).
    * Subscription that receives event messages from the Pub/Sub topic. For more information, see Create the Pub/Sub subscription (in this topic).

    For instructions, see the [Pub/Sub documentation](https://cloud.google.com/pubsub/docs).

Snowflake:
:   * Target table in the Snowflake database where your data will be loaded.

#### Create the Pub/Sub topic

Create a Pub/Sub topic using [Cloud Shell](https://cloud.google.com/shell) or [Cloud SDK](https://cloud.google.com/sdk).

Execute the following command to create the topic and enable it to listen for activity in the specified GCS bucket:

```bash
$ gsutil notification create -t <topic> -f json -e OBJECT_FINALIZE -e OBJECT_DELETE gs://<bucket-name>
```

Where:

* `<topic>` is the name for the topic.
* `<bucket-name>` is the name of your GCS bucket.

If the topic already exists, the command uses it; otherwise, a new topic is created.

For more information, see [Using Pub/Sub notifications for Cloud Storage](https://cloud.google.com/storage/docs/reporting-changes) in the Pub/Sub documentation.

#### Create the Pub/Sub subscription

Create a subscription with pull delivery to the Pub/Sub topic using the Cloud Console, `gcloud` command-line tool, or the Cloud Pub/Sub API. For instructions, see [Managing topics and subscriptions](https://cloud.google.com/pubsub/docs/admin) in the Pub/Sub documentation.

> **Note:**
>
> * Only Pub/Sub subscriptions that use the default pull delivery are supported with Snowflake. Push delivery is not supported.

#### Retrieve the Pub/Sub subscription ID

The Pub/Sub topic subscription ID is used in these instructions to allow Snowflake access to event messages.

1. Log into the Google Cloud Platform Console as a project editor.
2. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
3. Copy the ID in the Subscription ID column for the topic subscription

### Step 1: Create a notification integration in Snowflake

Create a notification integration using the
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-queue-inbound-gcp.md) command.

The notification integration references your Pub/Sub subscription. Snowflake associates the notification integration with a GCS
service account created for your account. Snowflake creates a single service account that is referenced by all GCS notification
integrations in your Snowflake account.

> **Note:**
>
> * Only account administrators (users with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege can execute this SQL command.
> * The GCS service account for notification integrations is different from the service account created for storage integrations.
> * A single notification integration supports a single Google Cloud Pub/Sub subscription. Referencing the same Pub/Sub subscription in multiple notification integrations can result in missing data in target tables because event notifications are split between notification integrations.

```sqlsyntax
CREATE NOTIFICATION INTEGRATION <integration_name>
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  ENABLED = true
  GCP_PUBSUB_SUBSCRIPTION_NAME = '<subscription_id>';
```

Where:

* `integration_name` is the name of the new integration.
* `subscription_id` is the subscription name you recorded in Retrieve the Pub/Sub subscription ID.

For example:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_notification_int
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  ENABLED = true
  GCP_PUBSUB_SUBSCRIPTION_NAME = 'projects/project-1234/subscriptions/sub2';
```

### Step 2: Grant Snowflake access to the Pub/Sub subscription

1. Execute the [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command to retrieve the Snowflake service account ID:

   ```sqlsyntax
   DESC NOTIFICATION INTEGRATION <integration_name>;
   ```

   Where:

   * `integration_name` is the name of the integration you created in Step 1: Create a Notification Integration in Snowflake.

   For example:

   > ```sqlexample
   > DESC NOTIFICATION INTEGRATION my_notification_int;
   > ```
2. Record the service account name in the GCP_PUBSUB_SERVICE_ACCOUNT column, which has the following format:

   ```bash
   <service_account>@<project_id>.iam.gserviceaccount.com
   ```
3. Log into the Google Cloud Platform Console as a project editor.
4. From the home dashboard, choose Big Data » Pub/Sub » Subscriptions.
5. Select the subscription to configure for access.
6. Click SHOW INFO PANEL in the upper-right corner. The information panel for the subscription slides out.
7. Click the ADD PRINCIPAL button.
8. In the New principals field, search for the service account name you recorded.
9. From the Select a role dropdown, select Pub/Sub Subscriber.
10. Click the Save button. The service account name is added to the Pub/Sub Subscriber role dropdown in the information panel.
11. Navigate to the Dashboard page in the Cloud Console, and select your project from the dropdown list.
12. Click the ADD PEOPLE TO THIS PROJECT button.
13. Add the service account name you recorded.
14. From the Select a role dropdown, select Monitoring Viewer.
15. Click the Save button. The service account name is added to the Monitoring Viewer role.

### (Optional) Step 3: Create a stage

Create an external stage that references your GCS bucket by using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command. Snowflake reads your
staged data files into the external table metadata. Alternatively, you can use an existing external stage.

> **Note:**
>
> * To configure secure access to the cloud storage location, see Configure secure access to Cloud Storage earlier in this topic.
> * To reference a storage integration in the CREATE STAGE statement, the role must have the USAGE privilege on the storage integration
>   object.

The following example creates a stage named `mystage` in the active schema for the user session. The cloud storage URL includes the
path `files`. The stage references a storage integration named `my_storage_int`:

> ```sqlexample
> USE SCHEMA mydb.public;
>
> CREATE STAGE mystage
>   URL='gcs://load/files/'
>   STORAGE_INTEGRATION = my_storage_int;
> ```

### Step 4: Create an external table

Create an external table by using the [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md) command.

For example, create an external table in the `mydb.public` schema that reads JSON data from files staged in the `mystage` stage with
the `path1/` path.

The INTEGRATION parameter references the `my_notification_int` notification integration you created in
Step 1: Create a notification integration in Snowflake. You must enter the integration name in all uppercase letters.

The `AUTO_REFRESH` parameter is `TRUE` by default:

```sqlexample
CREATE OR REPLACE EXTERNAL TABLE ext_table
 INTEGRATION = 'MY_NOTIFICATION_INT'
 WITH LOCATION = @mystage/path1/
 FILE_FORMAT = (TYPE = JSON);
```

After you complete this step, the external stage with auto-refresh is configured.

When new or updated data files are added to the GCS bucket, the event notification informs Snowflake to scan them into the external
table metadata.

### Step 5: Manually refresh the external table metadata

Manually refresh the external table metadata once by using [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) with the REFRESH parameter; for example:

> ```sqlexample
> ALTER EXTERNAL TABLE ext_table REFRESH;
>
> +---------------------------------------------+----------------+-------------------------------+
> | file                                        | status         | description                   |
> |---------------------------------------------+----------------+-------------------------------|
> | files/path1/file1.json                      | REGISTERED_NEW | File registered successfully. |
> | files/path1/file2.json                      | REGISTERED_NEW | File registered successfully. |
> | files/path1/file3.json                      | REGISTERED_NEW | File registered successfully. |
> +---------------------------------------------+----------------+-------------------------------+
> ```

This step synchronizes the metadata with the list of files in the stage and path in the external table definition. Also, this step ensures
that the external table can read the data files in the specified stage and path, and that no files were missed in the external table definition.

If the list of files in the `file` column doesn’t match your expectations, verify the paths in the external table definition and
external stage definition. Any path in the external table definition is appended to any path specified in the stage definition. For more
information, see [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md).

> **Important:**
>
> If this step is not completed successfully at least once after the external table is created, querying the external table returns no
> results until a Pub/Sub notification refreshes the external table metadata automatically for the first time.

This step ensures that the metadata is synchronized with any changes to the file list that occurred after Step 4. Thereafter, Pub/Sub
notifications trigger the metadata refresh automatically.

### Step 6: Configure security

For each additional role that you will use to query the external table, grant sufficient access control privileges on the various
objects (that is, the databases, schemas, stage, and table) by using [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md):

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Named stage | USAGE , READ |  |
| Named file format | USAGE | Optional; only needed if the stage you created in (Optional) Step 3: Create a stage references a named file format. |
| External table | SELECT |  |

---
title: Register a service connection
source: https://docs.snowflake.com/en/user-guide/opencatalog/register-service-connection.md
section: User Guide
---

# Register a service connection

This topic covers how to register your service connection credentials with Snowflake or your third-party service (for example, Apache Spark™).
The Snowflake Open Catalog administrator registers a service connection.

The example code in this topic shows how to register a service connection in Spark, and the example code is in PySpark.

## Prerequisites

Before you can register a service connection, you need to configure a service connection. For instructions, see [Configure a service connection](configure-service-connection.md).

## Register a service connection

The following example code is for registering a single service connection.

**Note**

> You can also register multiple service connections; see Example 2: Register two service connections.

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<maven_coordinate>` | Specifies the Maven coordinate for your external cloud storage provider:  * **S3:** software.amazon.awssdk:bundle:2.20.160 * **Cloud Storage (from Google):** org.apache.iceberg:iceberg-gcp-bundle:1.5.2 * **Azure:** org.apache.iceberg:iceberg-azure-bundle:1.5.2  If you don’t see this parameter, the correct value is already specified in the code sample. |
| `<client_id>` | Specifies the client ID for the service principal to use.   Enter the **Client ID** that you copied when you configured a new service connection. |
| `<client_secret>` | Specifies the client secret for the service principal to use.   Enter the **Secret** that you copied when you configured a new service connection. |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account.   Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |
| `<principal_role_name>` | Specifies the principal role that is granted to the service principal.  To view this principal role, in Open Catalog, select the **Connections** page, select your service connection, and in the **Principal Details** dialog, refer to **Principal Roles.** |

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,<maven_coordinate>') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','<client_id>:<client_secret>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:<principal_role_name>') \
    .getOrCreate()
```

## Register a cross-region service connection (Amazon S3 only)

The following example code is for registering a service connection when the following is true:

* Your Open Catalog account is hosted on Amazon S3.
* Your external storage provider is Amazon S3.
* Your Open Catalog account is hosted in an S3 region that is different from the S3 region where the storage bucket containing your Apache Iceberg™ tables is located.

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','<client_id>:<client_secret>') \
    .config('spark.sql.catalog.opencatalog.warehouse','<catalog_name>') \
    .config('spark.sql.catalog.opencatalog.client.region','<target_s3_region>') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:<principal_role_name>') \
    .getOrCreate()
```

### Parameters

| Parameter | Description |
| --- | --- |
| `<catalog_name>` | Specifies the name of the catalog to connect to.   **Important**: <catalog_name> is case sensitive. |
| `<client_id>` | Specifies the client ID for the service principal to use. |
| `<client_secret>` | Specifies the client secret for the service principal to use. |
| `<open_catalog_account_identifier>` | Specifies the account identifier for your Open Catalog account. Depending on the region and cloud platform for the account, this identifier might be the account locator by itself (for example, `xy12345`) or include additional segments. For more information, see [Using an account locator as an identifier](https://docs.snowflake.com/en/user-guide/admin-account-identifier#using-an-account-locator-as-an-identifier). |
| `<target_s3_region>` | Specifies the region code where the S3 bucket containing your Apache Iceberg tables is located. For the region codes, see [AWS service endpoints](https://docs.aws.amazon.com/general/latest/gr/s3.html#s3_region) and refer to the Region column in the table. |
| `<principal_role_name>` | Specifies the principal role that is granted to the service principal. |

## Examples

This section contains examples of registering a service connection in Spark.

* Example 1: Register a single service connection (S3)
* Example 2: Register two service connections (S3)
* Example 3: Register a single service connection (Cloud Storage from Google)
* Example 4: Register a single service connection (Azure)

### Example 1: Register a single service connection (S3)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','000000000000000000000000000=:1111111111111111111111111111111111111111111=') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:data_engineer') \
    .getOrCreate()
```

### Example 2: Register two service connections (S3)

**Important**

> When registering multiple service connections, you must change the `opencatalog` instances in the code for the first connection to
> unique text in the code for each subsequent connection. For example, in the following code, the `opencatalog` instances for the first connection
> are changed to `opencatalog1` for the second connection:

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,software.amazon.awssdk:bundle:2.20.160') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','000000000000000000000000000=:1111111111111111111111111111111111111111111=') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:data_scientist') \
    .config('spark.sql.catalog.opencatalog1', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog1.type', 'rest') \
    .config('spark.sql.catalog.opencatalog1.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog1.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog1.credential','222222222222222222222222222=:3333333333333333333333333333333333333333333=') \
    .config('spark.sql.catalog.opencatalog1.warehouse','Catalog2') \
    .config('spark.sql.catalog.opencatalog1.scope','PRINCIPAL_ROLE:data_scientist') \
    .getOrCreate()
```

### Example 3: Register a single service connection (Cloud Storage from Google)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-gcp-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','000000000000000000000000000=:1111111111111111111111111111111111111111111=') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:data_engineer') \
    .getOrCreate()
```

### Example 4: Register a single service connection (Azure)

```python
import pyspark
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('iceberg_lab') \
    .config('spark.jars.packages', 'org.apache.iceberg:iceberg-spark-runtime-3.5_2.12:1.4.1,org.apache.iceberg:iceberg-azure-bundle:1.5.2') \
    .config('spark.sql.extensions', 'org.apache.iceberg.spark.extensions.IcebergSparkSessionExtensions') \
    .config('spark.sql.defaultCatalog', 'opencatalog') \
    .config('spark.sql.catalog.opencatalog', 'org.apache.iceberg.spark.SparkCatalog') \
    .config('spark.sql.catalog.opencatalog.type', 'rest') \
    .config('spark.sql.catalog.opencatalog.uri','https://ab12345.snowflakecomputing.com/polaris/api/catalog') \
    .config('spark.sql.catalog.opencatalog.header.X-Iceberg-Access-Delegation','vended-credentials') \
    .config('spark.sql.catalog.opencatalog.credential','000000000000000000000000000=:1111111111111111111111111111111111111111111=') \
    .config('spark.sql.catalog.opencatalog.warehouse','Catalog1') \
    .config('spark.sql.catalog.opencatalog.scope','PRINCIPAL_ROLE:data_engineer') \
    .getOrCreate()
```

## Verify the connection to Open Catalog

To verify that Spark is connected to Open Catalog, list the namespaces for the catalog. For more information,
see [List namespaces](spark-code-examples.md).

---
title: Regulatory compliance
source: https://docs.snowflake.com/en/user-guide/intro-compliance.md
section: User Guide
---

# Regulatory compliance

Snowflake is committed to meeting industry-standard regulatory compliance requirements to provide our customers the highest levels of
assurance for data integrity, security, and governance.

This topic is a reference for Snowflake certifications based on geographic regions.

## Global

* [CSA Star Level 1](cert-csa-star-level-1.md)
* [ISO-9001:2015](cert-iso-9001_2015.md)
* [ISO-27001](cert-iso-27001.md)
* [ISO-27017](cert-iso-27017.md)
* [ISO-27018](cert-iso-27018.md)
* [SOC 1 Type II](cert-soc-1.md)
* [SOC 2 Type II](cert-soc-2.md)

## U.S. government

* [CJIS](cert-cjis.md)
* [Department of Defense (DOD) Impact Level 5 (IL5)](cert-dodIL5.md)
* [FedRAMP (Moderate and High)](cert-fedramp.md)
* [GovRAMP (Moderate and High)](cert-stateramp.md)
* [IRS Publication 1075](cert-irspub1075.md)
* [ITAR](cert-itar.md)
* [NIST SP 800-171](cert-nist.md)
* [SSDF](cert-ssdf.md)
* [TX-RAMP](cert-txramp.md)

## Healthcare and life sciences

* [HITRUST CSF](cert-hitrust.md)

## Financial services

* [PCI DSS](cert-pci-dss.md)

## Regional — Australia

* [IRAP (Protected)](cert-irap.md)

## Regional — Germany

* [C5 (Cloud Computing Compliance Controls Catalog)](cert-c5.md)
* [TISAX (Assessment Level) AL 3](cert-tisax.md)

## Regional — Korea

* [K-FSI (Korean Financial Security Institute) with RSEFT](cert-kfsi-rseft.md)

## Regional — United Kingdom

* [CE+ (Cyber Essentials Plus)](cert-cyber-essentials-plus.md)

---
title: Remediation of data quality issues
source: https://docs.snowflake.com/en/user-guide/data-quality-fixing.md
section: User Guide
---

# Remediation of data quality issues

Data quality metrics (DMFs) let you identify how many records in a table might contain quality issues. For example, the
SNOWFLAKE.CORE.NULL_COUNT DMF can identify how many records contain a NULL value in a specific column.

To help you fix these possible quality issues, you can call the [SYSTEM$DATA_METRIC_SCAN](../sql-reference/functions/system_data_metric_scan.md) system function to
return the individual records identified by the DMF as containing data that failed a data quality check. For example, if you pass the
NULL_COUNT DMF into the SYSTEM$DATA_METRIC_SCAN function as an argument, then you can obtain the actual records that contain a NULL value,
not just the number of records that contain a NULL value.

## Supported DMFs

The SYSTEM$DATA_METRIC_SCAN function accepts a DMF as an argument to return the records identified by the DMF as containing problematic
data. The following system DMFs can be used as arguments:

> * [ACCEPTED_VALUES](../sql-reference/functions/dmf_accepted_values.md)
> * [BLANK_COUNT](../sql-reference/functions/dmf_blank_count.md)
> * [BLANK_PERCENT](../sql-reference/functions/dmf_blank_percent.md)
> * [DUPLICATE_COUNT](../sql-reference/functions/dmf_duplicate_count.md)
> * [NULL_COUNT](../sql-reference/functions/dmf_null_count.md)
> * [NULL_PERCENT](../sql-reference/functions/dmf_null_percent.md)

## Limitations and considerations

* You cannot use custom DMFs as an argument of the SYSTEM$DATA_METRIC_SCAN function.
* If a table is protected by a policy, such as a masking policy or row access policy, the SYSTEM$DATA_METRIC_SCAN function might return
  unexpected or incomplete data because results depend on the user’s role when executing the function.

## Calling the SYSTEM$DATA_METRIC_SCAN function

When you call the SYSTEM$DATA_METRIC_SCAN function, it analyses a table with a DMF to identify possible data quality issues. You must pass
in the following arguments to the SYSTEM$DATA_METRIC_SCAN function: the name of the table, the DMF, and any arguments being passed to the
DMF to help identify problematic records.

For example, given that the SNOWFLAKE.CORE.NULL_COUNT system metric function returns the total number of NULL values in a particular column,
the following returns the rows of the `employeesTable` table that have NULL values in the `SSN` column.

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
    REF_ENTITY_NAME  => 'governance.sch.employeesTable'
    ,METRIC_NAME  => 'snowflake.core.null_count'
    ,ARGUMENT_NAME => 'SSN'
    ,AT_TIMESTAMP => '2024-08-28 02:00:00 -0700'
  ));
```

To check the results of a DMF evaluation on the table or view in the past, you can pass the AT_TIMESTAMP argument. The AT_TIMESTAMP
argument allows you to use [Time Travel](../sql-reference/functions/to_timestamp.md) to cast the timestamp string to return only those
records that existed in the table at the ‘2024-08-28 02:00:00 -0700’ timestamp.

### ACCEPTED_VALUES DMF

You must specify an additional argument when calling the SYSTEM$DATA_METRIC_SCAN function to return records identified by the
[ACCEPTED_VALUES](../sql-reference/functions/dmf_accepted_values.md) DMF as containing data quality issues. With this argument,
ARGUMENT_EXPRESSION, you can specify a Boolean expression that determines which rows
to return. If a value in the column does *not* match the expression, the row is returned.

The following command returns the rows where the value of the `age` column is equal to or less than five (that is, the rows that *don’t*
match the condition specified by ARGUMENT_EXPRESSION).

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
    REF_ENTITY_NAME  => 'governance.sch.employeesTable',
    METRIC_NAME  => 'snowflake.core.accepted_values',
    ARGUMENT_NAME => 'age',
    ARGUMENT_EXPRESSION => 'age > 5'
  ));
```

## Using SYSTEM$DATA_METRIC_SCAN to fix data

The SYSTEM$DATA_METRIC_SCAN function is a [table function](../sql-reference/functions-table.md) that returns a set of rows. The output of
the function can be used within a DML statement to take action on the records that have been identified as containing data that failed a
data quality check.

Suppose you want to replace blank values in the `email` column of the `t` table with NULL values. Because the BLANK_COUNT data metric
function identifies blank values, you could run the following statement:

```sqlexample
UPDATE T
  SET email = null
  WHERE T.ID IN (SELECT ID FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
    REF_ENTITY_NAME => 't'
    ,METRIC_NAME => 'snowflake.core.blank_count'
    ,ARGUMENT_NAME => 'email'
  )));
```

---
title: Renaming an account
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-rename.md
section: User Guide
---

# Renaming an account

An organization administrator can rename an account.

When an account is renamed, Snowflake creates a new [account URL](organizations-connect.md) that is used to access the account.
During the renaming, the administrator can accept the default to save the original account URL so users can continue to use it, or they
can delete the original URL to force users to use the new URL. Saved URLs can be
[deleted at a later time](organizations-manage-accounts-urls.md).

> **Note:**
>
> Renaming an account has no effect on [replication and failover](account-replication-intro.md).

[As the organization administrator](organization-administrators.md), you can rename an account using [Snowsight](ui-snowsight-gs.md)
or SQL. You must use SQL to rename a reader account.

Snowsight:
:   1. In the navigation menu, select Admin » Accounts.
    2. Find the active account, and select … » Edit account name.
    3. In the Account Name box, enter the new account name.
    4. If you want to force users to access the account using the new account URL, clear the Save Current URL checkbox. Otherwise,
       accept the default to allow users to continue to use the original account URL.
    5. Select Save.

SQL:
:   Execute the [ALTER ACCOUNT … RENAME TO](../sql-reference/sql/alter-account.md) command.

    For example, the following command renames an account called `original_acctname` to `new_acctname`:

    ```sqlexample
    ALTER ACCOUNT original_acctname RENAME TO new_acctname;
    ```

    To force users to access the account with the new account URL, set the optional SAVE_OLD_URL parameter to FALSE when renaming the account.

> **Note:**
>
> Organization administrators who are using the ORGADMIN role cannot rename an account while they are logged in to it, so they must log in
> to a different account before executing the renaming command. If your organization consists of a single account that needs to be renamed,
> contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: Replicating databases across multiple accounts
source: https://docs.snowflake.com/en/user-guide/db-replication-config.md
section: User Guide
---

# Replicating databases across multiple accounts

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

This topic describes the steps necessary to replicate databases across multiple Snowflake accounts and keep the database objects and
stored data synchronized. Database replication can occur across Snowflake accounts in the same or different
[regions](intro-regions.md).

## Region support for database replication and failover/failback

All Snowflake regions across Amazon Web Services, Google Cloud Platform, and Microsoft Azure support Database Replication and Failover/Failback.

Note that accounts can replicate databases between [Region groups](admin-account-identifier.md) (for example, between Virtual Private Snowflake (VPS) and
multi-tenant regions) to facilitate data sharing and account migrations between these regions. This ability is disabled by default. You can
contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to enable access.

## Web interface for database replication and failover/failback

> **Attention:**
>
> Managing and monitoring replication and failover/failback in Snowsight are only available to accounts
> using private connectivity.
>
> For all other accounts, see [Use Snowsight to monitor replication](account-replication-monitor.md) and [Replicating account objects and databases](account-replication-config.md).

Account administrators (users with the ACCOUNTADMIN role) can manage replication and failover/failback actions in Snowsight.

### Snowsight

Navigation:
:   Catalog » Database Explorer

#### Manage primary databases

> **Attention:**
>
> Only available to accounts using private connectivity. For all other accounts, see
> [Use Snowsight to monitor replication](account-replication-monitor.md) and [Replicating account objects and databases](account-replication-config.md).

1. Sign in to [Snowsight](ui-snowsight-gs.md) to a Snowflake account that contains a primary database.
2. To switch to the account administrator role, in the lower-left corner, select your name » Switch role » ACCOUNTADMIN.
3. In the navigation menu, select Catalog » Database Explorer. Select a primary database in the database object explorer.
   The database details page opens.

   Alternatively, to view only databases that have been enabled for replication, use the Replication Status » Primary
   filter to list primary databases in the account. Select a database from the list to open the details page.

   > **Note:**
   >
   > The Replication Status filter is only available if an account is a source or target account for database replication.
4. Select  » Enable Replication. The Enable replication dialog opens.

   Choose the action that you want to perform:

   * Enable failover. This feature requires [Business Critical Edition](intro-editions.md) (or higher).
   * Create a secondary database in one or more target accounts.

     If a primary database in another account is enabled for replication to the current account, you can create a
     secondary database in the current account. To add additional target accounts, use the
     [ALTER DATABASE](../sql-reference/sql/alter-database.md) command in the source account to update the primary database.
   * Refresh each secondary database once, after it is created.
5. For each target account for this database, check the options to create a secondary database and refresh the database.
6. Sign in to the target account as a user who was previously granted the ACCOUNTADMIN role in that account.

   Snowflake performs the requested actions and displays a success dialog.

   Manage replication for this database from the Replication tab in the database details.

#### Manage secondary databases

> **Attention:**
>
> Only available to accounts using private connectivity. For all other accounts, see
> [Use Snowsight to monitor replication](account-replication-monitor.md) and [Replicating account objects and databases](account-replication-config.md).

1. Sign in to [Snowsight](ui-snowsight-gs.md) to a Snowflake account that contains a secondary database.
2. Select the dropdown menu in the upper left (next to your login name) » Switch Role » `ACCOUNTADMIN`.
3. In the navigation menu, select Catalog » Database Explorer.

   The following actions are available from the actions (…) button in the upper-right corner of the page:

   * Create a secondary database.

     > **Note:**
     >
     > This option is only available if an account is a source or target account for database replication.
     >
     > If a primary database in another account is enabled for replication to the current account, you can create a
     > secondary database in the current account. To add additional target accounts, use the
     > [ALTER DATABASE](../sql-reference/sql/alter-database.md) command in the source account to update the primary database.
4. Select a secondary database in the database object explorer. The database details page opens.
5. Select the Replication tab.

   The following actions are available from the actions (…) button in the upper-right corner of the page:

   * Promote the secondary database to serve as the primary database. This feature requires Business Critical Edition (or higher).

     > **Note:**
     >
     > In order to promote a secondary database to serve as the primary, the primary database must have failover enabled
     > to the target account where the secondary database is located.
     >
     > If this option is not available, you can use the ALTER DATABASE command in the source account to enable failover
     > for the primary database to the target account. For more information, see Step 3: Enabling failover for a primary database.
   * Refresh the secondary database.
   * Copy a template to create a task that refreshes the secondary database on a schedule. Paste the template into a Snowsight worksheet
     and edit it to specify the desired schedule.

## Replicating a database to another account

The instructions in this section explain how to prepare your accounts for replication, promote a local database to serve as a primary database, perform the initial replication of this primary database to another account, and schedule refreshing of secondary databases.

> **Important:**
>
> Target accounts do not have Tri-Secret Secure or private connectivity to the Snowflake service, such as
> [AWS PrivateLink](admin-security-privatelink.md), enabled by default. If you require Tri-Secret Secure or private
> connectivity to the Snowflake service for compliance, security or other purposes, it is your responsibility to configure and enable
> those features in the target account.

### Prerequisite: Enable replication for accounts in the organization

The [organization administrator](organization-administrators.md) must enable replication for the source and target accounts before replicating
a database. For detailed instructions, see [Prerequisite: Enable replication for accounts in the organization](account-replication-config.md).

### Enable database replication and failover, and refresh secondary databases

> **Note:**
>
> Except where noted, only account administrators (users with the ACCOUNTADMIN role) can execute the SQL statements in this section.

#### Step 1: Viewing all accounts in your organization

Retrieve the list of accounts in your organization in which replication has been enabled. Any existing permanent or transient database in these accounts can be modified to serve as a primary database. Replicas of a primary database (i.e. secondary databases) can only be created in these accounts.

To view the list of accounts in your organization, query [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md).

```sqlexample
SHOW REPLICATION ACCOUNTS;

+------------------+---------------------------------+---------------+------------------+---------+-------------------+
| snowflake_region | created_on                      | account_name  | account_locator  | comment | organization_name |
|------------------+---------------------------------+---------------+------------------+---------+-------------------|
| AWS_US_WEST_2    | 2018-11-19 16:11:12.720 -0700   | ACCOUNT1      | MYACCOUNT1       |         | MYORG             |
| AWS_US_EAST_1    | 2019-06-02 14:12:23.192 -0700   | ACCOUNT2      | MYACCOUNT2       |         | MYORG             |
+------------------+---------------------------------+---------------+------------------+---------+-------------------+
```

See the complete list of [Region IDs](admin-account-identifier.md).

#### Step 2: Promoting a local database to serve as a primary database

Modify an existing permanent or transient database to serve as a primary database using an [ALTER DATABASE … ENABLE REPLICATION TO ACCOUNTS](../sql-reference/sql/alter-database.md) statement. Provide a comma-separated list of accounts in your organization that can store a replica of this database (i.e. a secondary database), allowing users in those accounts to query objects in the secondary database.

##### Example

Promote local database `mydb1` (in account `account1`) to serve as a primary database and specify that accounts `account2` and
`account3` can each store a replica of this database:

```sqlexample
ALTER DATABASE mydb1 ENABLE REPLICATION TO ACCOUNTS myorg.account2, myorg.account3;
```

#### Step 3: Enabling failover for a primary database

> **Note:**
>
> Failover/Failback requires Business Critical (or higher). To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Enable failover for a primary database to one or more accounts in your organization using an [ALTER DATABASE … ENABLE FAILOVER TO ACCOUNTS](../sql-reference/sql/alter-database.md) statement. The replica of this primary database in any one of these accounts (i.e. a secondary database) can be promoted to serve as the primary database.

Note that enabling failover for a primary database can be done either before or after a replica of the primary database has been created in a specified account.

##### Example

Enable failover for primary database `mydb1` to accounts `account2` and `account3`.

```sqlexample
-- Executed from primary account
ALTER DATABASE mydb1 ENABLE FAILOVER TO ACCOUNTS myorg.account2, myorg.account3;
```

#### Step 4: Creating a secondary database

Create a replica of an existing primary database in the same account that stores the primary database, or a different account (in the same or a different region). Note that you can only create a secondary database in an account specified in the [ALTER DATABASE … ENABLE REPLICATION TO ACCOUNTS](../sql-reference/sql/alter-database.md) statement in Step 2: Promoting a Local Database to Serve as a Primary Database.

> **Note:**
>
> Replication commands (e.g. promoting a database to a primary database in a source account) typically trigger operations across regions
> and can take a few seconds to take effect. For example, if you are programmatically promoting a database to serve as the primary database
> in a source account and creating a secondary database in a target account, it may be a few seconds before you can create the
> secondary database.

Execute a [CREATE DATABASE … AS REPLICA OF](../sql-reference/sql/create-database.md) statement in each target account to create a replica of the specified primary database.

> **Important:**
>
> As a best practice, we recommend giving each secondary database the same name as its primary database. This practice supports referencing fully-qualified objects (i.e. `'<db>.<schema>.<object>'`) by other objects in the same database, such as querying a fully-qualified table name in a view.
>
> If a secondary database has a different name from the primary database, then these object references would break in the secondary database.

To view the list of primary and secondary databases in your organization, query
[SHOW REPLICATION DATABASES](../sql-reference/sql/show-replication-databases.md). After a secondary database
is created, an account administrator can transfer ownership of the database to
another role (using [GRANT OWNERSHIP](../sql-reference/sql/grant-ownership.md).)

##### Example

The following example creates a replica of the `myorg.account1.mydb1` primary database in the `myorg.account2` account:

```sqlexample
-- Log into the ACCOUNT2 account.

-- Query the set of primary and secondary databases in your organization.
-- In this example, the MYORG.ACCOUNT1 primary database is available to replicate.
SHOW REPLICATION DATABASES;

+------------------+-------------------------------+-----------------+----------+---------+------------+----------------------------+---------------------------------+------------------------------+-------------------+-----------------+
| snowflake_region | created_on                    | account_name    | name     | comment | is_primary | primary                    | replication_allowed_to_accounts | failover_allowed_to_accounts | organization_name | account_locator |
|------------------+-------------------------------+-----------------+----------+---------+------------+----------------------------+---------------------------------+------------------------------+-------------------+-----------------|
| AWS_US_WEST_2    | 2019-11-15 00:51:45.473 -0700 | ACCOUNT1        | MYDB1    | NULL    | true       | MYORG.ACCOUNT1.MYDB1       | MYORG.ACCOUNT2, MYORG,ACCOUNT1  | MYORG.ACCOUNT1               | MYORG             | MYACCOUNT1      |
+------------------+-------------------------------+-----------------+----------+---------+------------+----------------------------+---------------------------------+------------------------------+-------------------+-----------------+

-- Create a replica of the 'mydb1' primary database
-- If the primary database has the DATA_RETENTION_TIME_IN_DAYS parameter set to a value other than the default value,
-- set the same value for the parameter on the secondary database.
CREATE DATABASE mydb1
  AS REPLICA OF myorg.account1.mydb1
  DATA_RETENTION_TIME_IN_DAYS = 10;

-- Verify the secondary database
SHOW REPLICATION DATABASES;

+------------------+-------------------------------+---------------+----------+---------+------------+-------------------------+---------------------------------+------------------------------+-------------------+-----------------+
| snowflake_region | created_on                    | account_name  | name     | comment | is_primary | primary                 | replication_allowed_to_accounts | failover_allowed_to_accounts | organization_name | account_locator |
|------------------+-------------------------------+---------------+----------+---------+------------+------------------------------------------+----------------+------------------------------+-------------------------------------|
| AWS_US_WEST_2    | 2019-11-15 00:51:45.473 -0700 | ACCOUNT1      | MYDB1    | NULL    | true       | MYORG.ACCOUNT1.MYDB1    | MYORG.ACCOUNT2, MYORG.ACCOUNT1  | MYORG.ACCOUNT1               | MYORG             | MYACCOUNT1      |
| AWS_US_EAST_1    | 2019-08-15 15:51:49.094 -0700 | ACCOUNT2      | MYDB1    | NULL    | false      | MYORG.ACCOUNT1.MYDB1    |                                 |                              | MYORG             | MYACCOUNT2      |
+------------------+-------------------------------+---------------+----------+---------+------------+-------------------------+---------------------------------+------------------------------+-------------------+-----------------+
```

#### Step 5. Refreshing each secondary database

The instructions in this section explain how to refresh a secondary database
from a snapshot of its primary database (using ALTER DATABASE … REFRESH). A
snapshot includes changes to the objects and data. For the initial replication
of a very large primary database, we recommend increasing the statement
timeout.

> **Note:**
>
> * To refresh a secondary database, the role used to perform the operation must have the OWNERSHIP privilege on the database or the role
>   must be a granted a role that has the OWNERSHIP privilege on the database.
> * The role that executes the refresh operation owns any new objects added as a result of a database refresh.

To verify the current region after you log into an account, query the [CURRENT_REGION](../sql-reference/functions/current_region.md) function.

```sqlexample
ALTER DATABASE mydb1 REFRESH;
```

#### Step 6. Refreshing a secondary database on a schedule

As a best practice, we recommend scheduling your secondary database refreshes. This section provides instructions for starting a database refresh automatically on a specified schedule.

The frequency with which you refresh a secondary database depends on the Recovery Point Objective (RPO) for the data in the secondary database. For example, if applications that rely on the data can tolerate up to 1 hour of data loss, then you must refresh the data at least every hour. If the data loss tolerance is 5 minutes, then refresh the secondary database at least every 5 minutes.

> **Note:**
>
> * We recommend that you execute the initial replication of a primary database manually (using [ALTER DATABASE](../sql-reference/sql/alter-database.md) … REFRESH), and only schedule subsequent refreshes.
> * There is a 60 minute default limit on a single run of a task. This limitation was implemented as a safeguard against non-terminating tasks. In rare circumstances, a refresh of a very large database could exceed the default task run limit. To determine if this occurred, query the [TASK_HISTORY](../sql-reference/functions/task_history.md) table function. Consider increasing the timeout limit for the task by executing [ALTER TASK](../sql-reference/sql/alter-task.md) … SET USER_TASK_TIMEOUT_MS = *<num>*.

Complete the steps in this section to start a database refresh automatically on a specified schedule.

Prerequisites:
:   The following Snowflake objects are required in the account that stores the secondary database:

    * The secondary database.
    * A separate database to store the new objects created in this section. Because secondary databases are read-only, this database must be separate from the secondary database. This database must also include the following objects:

      + Schema. Use the PUBLIC schema, or create a new schema using [CREATE SCHEMA](../sql-reference/sql/create-schema.md).
      + Warehouse. Any warehouse can be provided here to meet the syntax requirement but is not used for the database refresh. Create
        a new warehouse using [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md).
      + Task that refreshes the secondary database on a schedule.

Required privileges:
:   The steps in this section require a role with the following privileges in the account in which the secondary database is refreshed:

    | Object Type | Object | Privilege | Notes |
    | --- | --- | --- | --- |
    | Account | Account that stores the secondary database | EXECUTE TASK | Required to run the new task. |
    | Database | Secondary database | OWNERSHIP | Required to refresh the secondary database. |
    | Database | Database that stores the new task | USAGE |  |
    | Schema | Schema that stores the new task | USAGE, CREATE TASK |  |
    | Task |  | OWNERSHIP | The role that creates the task owns the object by default. Ownership can be transferred to a different role using GRANT `privileges` … TO ROLE. |
    | Warehouse | Warehouse used to configure the task | USAGE | Specifying a warehouse is required to configure the task, however the warehouse is not used to run the task or for the refresh operation. |

Steps:
:   Complete the following steps for each secondary database you want to refresh on a schedule:

    1. Create a task that starts the database refresh on a schedule (using [CREATE TASK](../sql-reference/sql/create-task.md)). Note that although the
       CREATE TASK syntax for specifying a replication schedule requires a warehouse, the warehouse is not used for replication.

       For example, create a task named `refresh_mydb1_task` that refreshes a secondary database named `mydb1` every 10 minutes with a 4 hour timeout. The task is configured using the existing warehouse `mywh`:

       ```sqlexample
       CREATE TASK refresh_mydb1_task
         WAREHOUSE = mywh
         SCHEDULE = '10 minute'
         USER_TASK_TIMEOUT_MS = 14400000
       AS
         ALTER DATABASE mydb1 REFRESH;
       ```
    2. A task is suspended by default when it is created. Resume the task to allow it to run based on the parameters specified in the task definition:

    > ```sqlexample
    > ALTER TASK refresh_mydb1_task RESUME;
    > ```

#### Example

Execute the following SQL statements in your preferred Snowflake client to
enable replication and failover, do an initial database refresh and set up
scheduled refreshes.

##### Execute from source account

```sqlexample
-- The commands below are executed from the source account

-- View replication enabled accounts
SHOW REPLICATION ACCOUNTS;

ALTER DATABASE mydb ENABLE REPLICATION TO ACCOUNTS myorg.account2, myorg.account3;
ALTER DATABASE mydb ENABLE FAILOVER TO ACCOUNTS myorg.account2, myorg.account3;
```

##### Execute from each target account

```sqlexample
-- The commands below are executed from each target account

-- View replication enabled databases
-- Note the primary column of the source database for the CREATE DATABASE statement below
SHOW REPLICATION DATABASES;

-- If the primary database has the DATA_RETENTION_TIME_IN_DAYS parameter set to a value other than the default value,
-- set the same value for the parameter on the secondary database.
CREATE DATABASE mydb
  AS REPLICA OF myorg.account1.mydb
  DATA_RETENTION_TIME_IN_DAYS = 10;

-- Increase statement timeout for initial refresh
-- Optional but recommended for initial refresh of a large database
ALTER SESSION SET STATEMENT_TIMEOUT_IN_SECONDS = 604800;
-- If you have an active warehouse in current session, update warehouse statement timeout
SELECT CURRENT_WAREHOUSE();
ALTER WAREHOUSE my_wh SET STATEMENT_TIMEOUT_IN_SECONDS = 604800;
-- Reset warehouse statement timeout after initial refresh
ALTER WAREHOUSE my_wh UNSET STATEMENT_TIMEOUT_IN_SECONDS;

-- Refresh a secondary database
ALTER DATABASE mydb REFRESH;

-- Create task
-- Set up refresh schedule for each secondary database using a separate database
USE DATABASE my_db2;

-- Create a task and RESUME the task for each secondary database
-- Edit the task schedule and timeout for your specific use case
CREATE TASK my_refresh_task
  WAREHOUSE = my_wh
  SCHEDULE = '10 minute'
  USER_TASK_TIMEOUT_MS = 14400000
AS
  ALTER DATABASE mydb REFRESH;

-- Start task
ALTER TASK my_refresh_task RESUME;
```

#### Using the legacy account locator

Though the legacy `snowflake_region.account_locator` format is currently supported when identifying an account in
replication and failover commands, its use is discouraged as it may stop working in the future.

## Increasing the statement timeout for the initial replication

Database replication uses Snowflake-provided compute resources instead of your own virtual warehouse to copy objects and data. However, the [STATEMENT_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md) session/object parameter still controls how long a statement runs before it is canceled. The default value is `172800` (2 days). Because the initial replication of a very large primary database can take longer than 2 days to complete (depending on the amount of metadata in the database as well as the amount of data in database objects), we recommend increasing the STATEMENT_TIMEOUT_IN_SECONDS value to `604800` (7 days, the maximum value) for the session in which you run the replication operation.

Run the following [ALTER SESSION](../sql-reference/sql/alter-session.md) statement prior to executing the `ALTER DATABASE secondary_db_name REFRESH` statement in the same session:

```sqlexample
ALTER SESSION SET STATEMENT_TIMEOUT_IN_SECONDS = 604800;
```

Note that the STATEMENT_TIMEOUT_IN_SECONDS parameter also applies to the active warehouse in a session. The parameter honors the *lower* value set at the session or warehouse level. If you have an active warehouse in the current session, set STATEMENT_TIMEOUT_IN_SECONDS to `604800` for this warehouse (using [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md)), too.

For example:

```sqlexample
-- determine the active warehouse in the current session (if any)
SELECT CURRENT_WAREHOUSE();

+---------------------+
| CURRENT_WAREHOUSE() |
|---------------------|
| MY_WH               |
+---------------------+

-- change the STATEMENT_TIMEOUT_IN_SECONDS value for the active warehouse

ALTER WAREHOUSE my_wh SET STATEMENT_TIMEOUT_IN_SECONDS = 604800;
```

You can reset the parameter value to the default after the replication operation is completed:

```sqlexample
ALTER WAREHOUSE my_wh UNSET STATEMENT_TIMEOUT_IN_SECONDS;
```

## Monitoring the progress of a database refresh

To determine the current status of the initial database replication or a subsequent secondary database refresh, query the [DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB](../sql-reference/functions/database_refresh_progress.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).

A database refresh operation can require several hours or longer to complete depending on the amount of data to replicate.

To view the replication history for a specified database within a specified date range, query either of the following:

* [DATABASE_REPLICATION_USAGE_HISTORY](../sql-reference/functions/database_replication_usage_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)). This
  function returns replication usage activity within the last 14 days.
* [DATABASE_REPLICATION_USAGE_HISTORY view](../sql-reference/account-usage/database_replication_usage_history.md) (in [Account Usage](../sql-reference/account-usage.md)). This view returns
  replication usage activity within the last 365 days (1 year).

### Example

Monitor the progress of the `mydb1` secondary database refresh:

```sqlexample
select *
  from table(information_schema.database_refresh_progress(mydb1));
```

## Viewing the database refresh history

To view the history of secondary database refresh operations, query the [DATABASE_REFRESH_HISTORY](../sql-reference/functions/database_refresh_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)). This function returns database refresh activity within the last 14 days.

or

Query the [DATABASE_REPLICATION_USAGE_HISTORY view](../sql-reference/account-usage/database_replication_usage_history.md) (in the [Account Usage](../sql-reference/account-usage.md) schema in the shared Snowflake database). This view returns database replication usage activity within the last 365 days (1 year).

### Example

View the history of the `mydb1` secondary database refresh operation:

```sqlexample
select *
  from table(information_schema.database_refresh_history(mydb1));
```

## Monitoring database replication cost

For individual databases replicated using database replication, users with the
ACCOUNTADMIN role can use [Snowsight](ui-snowsight-gs.md) or SQL to view the amount of replication data transferred
(in bytes) for your Snowflake account within a specified date range.

To view the data transfer amounts for your account:

> 1. In the navigation menu, select Admin » Cost management.
>
> SQL:
> :   Query either of the following:
>
>     * [DATABASE_REPLICATION_USAGE_HISTORY](../sql-reference/functions/database_replication_usage_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
>       This function returns database replication usage activity within the last 14 days.
>     * [DATABASE_REPLICATION_USAGE_HISTORY view](../sql-reference/account-usage/database_replication_usage_history.md) view (in [Account Usage](../sql-reference/account-usage.md)). This
>       view returns database replication usage activity within the last 365 days (1 year).
>
>       The following queries can be executed against the DATABASE_REPLICATION_USAGE_HISTORY view:
>
>       **Query: Replication cost history (by day, by object)**
>
>       This query provides a full list of replicated databases and the volume of credits consumed via the replication service over the last 30
>       days, broken out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional
>       investigation.
>
>       ```sqlexample
>       SELECT TO_DATE(start_time) AS date,
>         database_name,
>         SUM(credits_used) AS credits_used
>       FROM snowflake.account_usage.database_replication_usage_history
>       WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
>       GROUP BY 1,2
>       ORDER BY 3 DESC;
>       ```
>
>       **Query: Replication History & m-day average**
>
>       This query shows the average daily credits consumed by Replication grouped by week over the last year. This helps identify any
>       anomalies in the daily average so you can investigate any spikes or changes in consumption.
>
>       ```sqlexample
>       WITH credits_by_day AS (
>         SELECT TO_DATE(start_time) AS date,
>           SUM(credits_used) AS credits_used
>         FROM snowflake.account_usage.database_replication_usage_history
>         WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
>         GROUP BY 1
>         ORDER BY 2 DESC
>       )
>
>       SELECT DATE_TRUNC('week',date),
>         AVG(credits_used) AS avg_daily_credits
>       FROM credits_by_day
>       GROUP BY 1
>       ORDER BY 1;
>       ```

## Comparing data sets in primary and secondary databases

Optionally use the [HASH_AGG](../sql-reference/functions/hash_agg.md) function to compare the rows in a random set of tables in a primary and secondary database to verify data consistency. The HASH_AGG function returns an aggregate signed 64-bit hash value over the (unordered) set of input rows. Query this function on all or a random subset of tables in a secondary database and on the primary database (as of the timestamp for the primary database snapshot) and compare the output.

### Example

#### Executed on the secondary database

1. On the secondary database, query the [DATABASE_REFRESH_PROGRESS](../sql-reference/functions/database_refresh_progress.md) table function
   (in the [Snowflake Information Schema](../sql-reference/info-schema.md)). Note the `snapshot_transaction_timestamp` in the `DETAILS` column for the
   `PRIMARY_UPLOADING_DATA` phase. This is the timestamp for the latest snapshot of the primary database.

   ```sqlexample
   select parse_json(details)['snapshot_transaction_timestamp']
   from table(information_schema.database_refresh_progress(mydb))
   where phase_name = 'PRIMARY_UPLOADING_DATA';
   ```
2. Query the HASH_AGG function for a specified table. The following query returns a hash value for all rows in the `mytable` table:

   ```sqlexample
   SELECT HASH_AGG( * ) FROM mytable;
   ```

#### Executed on the primary database

3. On the primary database, query the HASH_AGG function for the same table. Using Time Travel, specify the timestamp when the latest snapshot was taken for the secondary database:

   ```sqlexample
   SELECT HASH_AGG( * ) FROM mytable AT(TIMESTAMP => '<snapshot_transaction_timestamp>'::TIMESTAMP);
   ```
4. Compare the results from the two queries. The output should be identical.

## Dropping a secondary database

You can drop a secondary database at any time using the [DROP DATABASE](../sql-reference/sql/drop-database.md) command. Only the database owner (i.e. the role with the OWNERSHIP privilege on the database) can drop the database.

## Dropping a primary database

A primary database cannot be dropped if one or more replicas of the database (i.e. secondary databases) exist. To drop the primary database, first promote a secondary database to serve as the primary database, and then drop the former primary database. Alternatively, drop all of the secondary databases for the primary database, and then drop the primary database.

Note that only the database owner can drop the database.

---
title: Replicating databases and account objects across multiple accounts
source: https://docs.snowflake.com/en/user-guide/account-replication-config.md
section: User Guide
---

# Replicating databases and account objects across multiple accounts

This topic describes the steps necessary to replicate account objects and data across Snowflake accounts in the same organization,
and keep the objects and data synchronized. Account replication can occur across Snowflake accounts in different
[regions](intro-regions.md) and across [cloud platforms](intro-cloud-platforms.md).

> **Note:**
>
> When you upgrade an account to Business Critical Edition (or higher), it might take up to 12 hours for failover capabilities
> to become available.

## Region support for replication and failover/failback

Customers can replicate across all regions within a Region Group. To replicate between regions in different [Region groups](admin-account-identifier.md)
(for example, from a Snowflake commercial region to a Snowflake government region), please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to enable
access.

## Transitioning from database replication to group-based replication

Databases that have been enabled for replication using [ALTER DATABASE](../sql-reference/sql/alter-database.md) must have replication
disabled before they can be added to a replication or failover group.

> **Note:**
>
> Execute the SQL statements in this section using the ACCOUNTADMIN role.

### Step 1. Disable replication for a replication enabled database

Execute the [SYSTEM$DISABLE_DATABASE_REPLICATION](../sql-reference/functions/system_disable_database_replication.md) function to disable replication for a primary database,
along with any secondary databases linked to it, in order to add it to a replication or failover group.

Execute the following SQL statement from the source account with the primary database:

```sqlexample
SELECT SYSTEM$DISABLE_DATABASE_REPLICATION('mydb');
```

### Step 2. Add the database to a primary failover group and create a secondary failover group

Once you have successfully disabled replication for a database, you can add the primary database to a failover group in the source account.

Then create a secondary failover group in the target account. When the secondary failover
group is refreshed in the target account, the previously secondary database will automatically be added as a member of the secondary
failover group and refreshed with the changes from the primary database.

For more details on creating primary and secondary failover groups, see Workflow.

> **Note:**
>
> When you add a previously replicated database to a replication or failover group, Snowflake does not re-replicate the data that
> has already been replicated for that database. Only changes since the last refresh are replicated when the group is refreshed.

## Workflow

The following SQL statements demonstrate the workflow for enabling account and database object replication and refreshing objects. Each step
is discussed in detail below.

> **Note:**
>
> The following examples require replication be enabled for the source and target accounts. For details, see
> Prerequisite: Enable replication for accounts in the organization.

### Examples

Execute the following SQL statements in your preferred Snowflake client to enable account and database object replication and failover,
and refresh objects.

#### Executed on source account

1. Create a role and grant it the CREATE FAILOVER GROUP privilege. This step is *optional*:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE myrole;

   GRANT CREATE FAILOVER GROUP ON ACCOUNT
     TO ROLE myrole;
   ```
2. Create a failover group in the source account and enable replication to specific target accounts.

   > **Note:**
   > * If you have databases to add to a replication or failover group that have been previously enabled for database replication and failover
   >   using [ALTER DATABASE](../sql-reference/sql/alter-database.md), follow the Transitioning from database replication to group-based replication instructions (in this
   >   topic) before adding them to a group.
   > * To add a database to a failover group, the active role must have the MONITOR privilege on the database. For details
   >   on database privileges, see [Database privileges](security-access-control-privileges.md) (in a separate topic).

   ```sqlexample
   USE ROLE myrole;

   CREATE FAILOVER GROUP myfg
     OBJECT_TYPES = USERS, ROLES, WAREHOUSES, RESOURCE MONITORS, DATABASES
     ALLOWED_DATABASES = db1, db2
     ALLOWED_ACCOUNTS = myorg.myaccount2, myorg.myaccount3
     REPLICATION_SCHEDULE = '10 MINUTE';
   ```

#### Executed on target account

3. Create a role in the target account and grant it the CREATE FAILOVER GROUP privilege. This step is *optional*:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE myrole;

   GRANT CREATE FAILOVER GROUP ON ACCOUNT
     TO ROLE myrole;
   ```
4. Create a failover group in the target account as a replica of the failover group in the source account.

   > **Note:**
   >
   > If account objects (for example, users or roles) exist in the target account that do not exist in the source account, refer to
   > Initial replication of users and roles before creating a secondary group.

   ```sqlexample
   USE ROLE myrole;

   CREATE FAILOVER GROUP myfg
     AS REPLICA OF myorg.myaccount1.myfg;
   ```
5. Manually refresh the secondary failover group. This is an *optional* step. If the primary failover group is created with
   a replication schedule, the initial refresh of the secondary failover group is automatically executed when the secondary
   failover group is created.

   1. Create a role with the REPLICATE privilege on the failover group. This step is *optional*.

      Execute in the target account using a role with the OWNERSHIP privilege on the failover group:

      ```sqlexample
      GRANT REPLICATE ON FAILOVER GROUP myfg TO ROLE my_replication_role;
      ```
   2. Execute the refresh statement using a role with the REPLICATE privilege:

      ```sqlexample
      USE ROLE my_replication_role;

      ALTER FAILOVER GROUP myfg REFRESH;
      ```
6. Create a role with the FAILOVER privilege on the failover group. This step is *optional*.

   Execute in the target account using a role with the OWNERSHIP privilege on the failover group:

   ```sqlexample
   GRANT FAILOVER ON FAILOVER GROUP myfg TO ROLE my_failover_role;;
   ```

## Replicating account objects and databases

The instructions in this section explain how to prepare your accounts for replication, enable the replication of specific objects from the
source account to the target account, and synchronize the objects in the target account.

> **Important:**
>
> Target accounts do not have Tri-Secret Secure or private connectivity to the Snowflake service, such as
> [AWS PrivateLink](admin-security-privatelink.md), enabled by default. If you require Tri-Secret Secure or private
> connectivity to the Snowflake service for compliance, security or other purposes, it is your responsibility to configure and enable
> those features in the target account.

### Prerequisite: Enable replication for accounts in the organization

The organization administrator must enable replication for the source and target accounts.

To enable replication for accounts, an [organization administrator](organization-administrators.md) uses the
[SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER](../sql-reference/functions/system_global_account_set_parameter.md) function to set the `ENABLE_ACCOUNT_DATABASE_REPLICATION`
parameter to `true`.

[As an organization administrator](organization-administrators.md), enable replication for each source and target account in your organization.

```sqlexample
-- View the list of the accounts in your organization
-- Note the organization name and account name for each account for which you are enabling replication
SHOW ACCOUNTS;

-- Enable replication by executing this statement for each source and target account in your organization
SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER('<organization_name>.<account_name>', 'ENABLE_ACCOUNT_DATABASE_REPLICATION', 'true');
```

Though the SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER function supports the legacy [account locator](admin-account-identifier.md) identifier,
it causes unexpected results when an organization has multiple accounts that share the same locator (in different regions).

### Step 1: Create a role with the CREATE FAILOVER GROUP privilege in the source account — *Optional*

Create a role and grant it the CREATE FAILOVER GROUP privilege. This step is optional. If you have already created this role, skip to
Step 3: Create a primary failover group in a source account.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE myrole;

GRANT CREATE FAILOVER GROUP ON ACCOUNT
    TO ROLE myrole;
```

### Step 2: Identify accounts enabled for replication and group membership

Before creating a primary failover group, identify the accounts enabled for replication and the existing
failover and replication groups.

#### View all accounts enabled for replication

To retrieve the list of accounts in your organization that are enabled for replication, use
[SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md).

Execute the following SQL statement using the ACCOUNTADMIN role:

```sqlexample
SHOW REPLICATION ACCOUNTS;
```

Returns:

```output
+------------------+-------------------------------+--------------+-----------------+-----------------+-------------------+--------------+
| snowflake_region | created_on                    | account_name | account_locator | comment         | organization_name | is_org_admin |
+------------------+-------------------------------+--------------+-----------------+-----------------+-------------------+--------------+
| AWS_US_WEST_2    | 2020-07-15 21:59:25.455 -0800 | myaccount1   | myacctlocator1  |                 | myorg             | true         |
+------------------+-------------------------------+--------------+-----------------+-----------------+-------------------+--------------+
| AWS_US_EAST_1    | 2020-07-23 14:12:23.573 -0800 | myaccount2   | myacctlocator2  |                 | myorg             | false        |
+------------------+-------------------------------+--------------+-----------------+-----------------+-------------------+--------------+
| AWS_US_EAST_2    | 2020-07-25 19:25:04.412 -0800 | myaccount3   | myacctlocator3  |                 | myorg             | false        |
+------------------+-------------------------------+--------------+-----------------+-----------------+-------------------+--------------+
```

See the complete list of [Region IDs](admin-account-identifier.md).

#### View failover and replication group membership

Account, database, and share objects have [constraints on group membership](account-replication-considerations.md). Before creating new
groups or adding objects to existing groups, you can review the list of existing failover groups and the objects in each group.

> **Note:**
>
> Only an account administrator (user with the ACCOUNTADMIN role) or the group owner (role with the OWNERSHIP privilege on the group) can
> execute the SQL statements in this section.

View all failover groups linked to the current account, and the object types in each group:

```sqlexample
SHOW FAILOVER GROUPS;
```

View all the databases in failover group `myfg`:

```sqlexample
SHOW DATABASES IN FAILOVER GROUP myfg;
```

View all the shares in failover group `myfg`:

```sqlexample
SHOW SHARES IN FAILOVER GROUP myfg;
```

### Step 3: Create a primary failover group in a source account

Create a primary failover group and enable the replication and failover of specific objects from the current (source) account to one or more
target accounts in the same organization.

You can create a replication or failover group using Snowsight
or SQL.

* Create a replication or failover group using Snowsight
* Create a failover group using SQL

> **Note:**
>
> If you have databases to add to a replication or failover group that have been previously enabled for database replication
> using [ALTER DATABASE](../sql-reference/sql/alter-database.md), follow the Transitioning from database replication to group-based replication instructions (in this
> topic) before adding them to a group.

#### Create a replication or failover group using Snowsight

> **Note:**
>
> * Only account administrators can create a replication or failover group using Snowsight (refer to
>   Limitations of using Snowsight for replication configuration).
> * You must be signed in to the target account as a user with the ACCOUNTADMIN role. If you are not, you will be
>   prompted to sign in.
>
>   Both the source account and the target account must use the same connection type (public internet). Otherwise, signing
>   in to the target account fails.

Complete the following steps to create a new replication or failover group:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, then complete one of these actions on the Groups tab:

   * For Business Critical Edition (or higher) accounts, complete one of these actions:

     + If there are no replication groups or connections, select Get started to configure a replication group and
       a connection. The Setup business continuity wizard appears.
     + Select + Group to configure a replication group without configuring a connection. The
       Create a group wizard appears.
   * For Standard Edition and Enterprise Edition accounts, complete one of these actions:

     + If there are no replication groups or connections, select Get started to configure a replication group.
       The Setup replication wizard appears.
     + If one or more replication groups exist, select + Group to configure a replication group. The
       Create a group wizard appears.
4. On the Select a target account page, select a target account and sign into it, then select Next.
5. On the Create a group page, in the Group name box, enter a name for the group that meets the
   following requirements:

   * Must start with an alphabetic character and cannot contain spaces or special characters unless the identifier string is
     enclosed in double quotes (for example, “My object”). Identifiers enclosed in double quotes are also case-sensitive.

     For more information, see [Identifier requirements](../sql-reference/identifiers-syntax.md).
   * Must be unique across failover and replication groups in an account.
6. Choose Edit objects to add share and account objects to your group.

   > **Note:**
   >
   > Account objects can only be added to one replication or failover group. If a replication or failover group with any account
   > objects already exists in your account, you cannot select those objects.
7. Choose Select databases to add database objects to your group.
8. Select the Replication frequency.
9. If the account is Business Critical Edition or higher, a failover group is created by default. You can choose to create a replication group
   instead. To create a replication group, select Advanced options, then unselect Enable failover.
10. Complete one of the following actions:

    * For Business Critical Edition (or higher) accounts, select Next.
    * For Standard Edition and Enterprise Edition accounts, select Start replication to create the replication group.
11. For Business Critical Edition (or higher) accounts, on the Create connection page, enter a connection name in
    the Connection name box, then select Start replication.

If creating the replication group is unsuccessful, refer to Troubleshoot issues with creating and editing replication groups using Snowsight for common errors
and how to resolve them.

#### Create a failover group using SQL

Create a failover group of specified account and database objects in the source account and enable replication and failover to a list of
target accounts. See [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md) for syntax.

For example, enable replication of users, roles, warehouses, resources monitors, and databases `db1` and `db2` from the source account
to the `myaccount2` account in the same organization. Set the replication schedule to automatically refresh `myaccount2` every 10
minutes.

Execute the following statement on the source account:

```sqlexample
USE ROLE myrole;

CREATE FAILOVER GROUP myfg
    OBJECT_TYPES = USERS, ROLES, WAREHOUSES, RESOURCE MONITORS, DATABASES, INTEGRATIONS, NETWORK POLICIES
    ALLOWED_DATABASES = db1, db2
    ALLOWED_INTEGRATION_TYPES = API INTEGRATIONS
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

### Step 4: Create a role with the CREATE FAILOVER GROUP privilege in the target account — *Optional*

Create a role in the target account and grant it the CREATE FAILOVER GROUP privilege. This step is optional. If you have already created
this role, skip to Step 5: Create a secondary failover group in the target account.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE myrole;

GRANT CREATE FAILOVER GROUP ON ACCOUNT
    TO ROLE myrole;
```

### Step 5: Create a secondary failover group in the target account

> **Note:**
>
> If account objects (for example, users or roles) exist in the target account that do not exist in the source account, refer to
> Initial replication of users and roles before creating a secondary group.

Create a secondary failover group in the target account as a replica of the primary failover group in the source account.

Execute a [CREATE FAILOVER GROUP … AS REPLICA OF](../sql-reference/sql/create-failover-group.md) statement in each target account for which you
enabled replication in Step 3: Create a primary failover group in a source account (in this topic).

Executed from each target account:

```sqlexample
USE ROLE myrole;

CREATE FAILOVER GROUP myfg
  AS REPLICA OF myorg.myaccount1.myfg;
```

### Step 6. Refresh a secondary failover group in the target account manually — *Optional*

To manually refresh the objects in a target account, execute the [ALTER FAILOVER GROUP … REFRESH](../sql-reference/sql/alter-failover-group.md)
command.

As a best practice, we recommend scheduling your secondary refreshes by setting the REPLICATION_SCHEDULE parameter using
[CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md) or [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md).

> **Note:**
>
> If the user who calls the function in the target account was dropped in the source account, the refresh operation fails.

#### Grant the REPLICATE privilege on failover group to role — *Optional*

To execute the command to refresh a secondary replication or failover group in the target account, you must use a role with the
REPLICATE privilege on the failover group. The REPLICATE privilege is currently not replicated and must be granted on a
failover (or replication) group in both the source and target accounts.

> Execute this statement from the source account using a role with the OWNERSHIP privilege on the group:
>
> ```sqlexample
> GRANT REPLICATE ON FAILOVER GROUP myfg TO ROLE my_replication_role;
> ```
>
> Execute this statement from the target account using a role with the OWNERSHIP privilege on the group:
>
> ```sqlexample
> GRANT REPLICATE ON FAILOVER GROUP myfg TO ROLE my_replication_role;
> ```

#### Manually refresh a secondary failover group

For example, to refresh the objects in the failover group `myfg`, execute the following statement from the target account:

> ```sqlexample
> USE ROLE my_replication_role;
>
> ALTER FAILOVER GROUP myfg REFRESH;
> ```

### Step 7. Grant the FAILOVER privilege on failover group to role — *Optional*

To execute the command to fail over a secondary failover group in a target account, you must use a role with the
[FAILOVER privilege](account-replication-considerations.md) on the failover group. The FAILOVER privilege is currently not
replicated and must be granted in each source and target account.

For more information, see [Replication of roles and grants](account-replication-intro.md).

For example, to grant the FAILOVER privilege to role `my_failover_role` on failover group `my_fg`, execute the
following statement in the *target account* using a role with the OWNERSHIP privilege on the group:

```sqlexample
GRANT FAILOVER ON FAILOVER GROUP myfg TO ROLE my_failover_role;
```

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

## Schema-level replication for failover groups

For databases in failover groups, you can optionally configure the REPLICABLE_WITH_FAILOVER_GROUPS parameter on the database
and/or individual schemas in the database to specify a subset of schemas for replication.

This feature enables you to control the schemas in a failover group that are replicated, which is useful if only a subset of data
in a database needs the added disaster recovery protection provided by failover.

Because this parameter is enabled by default for all databases and the schemas they contain, you adjust the replication
granularity by choosing which databases and/or schemas to omit from replication. You can further fine-tune the
replication settings by allowing certain schemas to be replicated even though the database that contains them isn’t replicated.

### Specify schemas to replicate or skip

You can explicitly specify schemas to replicate or skip in a database in a failover group using the optional REPLICABLE_WITH_FAILOVER_GROUPS
parameter.

#### REPLICABLE_WITH_FAILOVER_GROUPS parameter

The REPLICABLE_WITH_FAILOVER_GROUPS parameter specifies whether a schema that belongs to a database in a failover group is replicated.
This parameter can be set on a database and any/all schemas in the database. If the parameter is set for a database, all of the schemas
in the database inherit the value, unless a different value is explicitly set for any given schema.

The parameter accepts two values, `'YES'` or `'NO'` (case-insensitive), and is optional:

* If REPLICABLE_WITH_FAILOVER_GROUPS is not explicitly set on a database (or explicitly unset), the database follows the standard replication
  behavior, which is equivalent to setting the parameter to `'YES'`.
* If REPLICABLE_WITH_FAILOVER_GROUPS is not explicitly set on a schema (or explicitly unset), the replication behavior is inherited from its
  parent database.

```sqlsyntax
ALTER DATABASE <name> SET REPLICABLE_WITH_FAILOVER_GROUPS = { 'YES' | 'NO' }
ALTER DATABASE <name> UNSET REPLICABLE_WITH_FAILOVER_GROUPS

ALTER SCHEMA <name> SET REPLICABLE_WITH_FAILOVER_GROUPS = { 'YES' | 'NO' }
ALTER SCHEMA <name> UNSET REPLICABLE_WITH_FAILOVER_GROUPS
```

#### Security requirements

To set or unset this parameter on a database or a schema, the following privileges are required:

* REPLICATE ([account-level privilege](security-access-control-privileges.md)). Prior to the schema-level replication feature, this privilege was
  only an object-level privilege on replication groups and failover groups.
  Users with the ACCOUNTADMIN role can grant this privilege to other roles.
* USAGE ([database](security-access-control-privileges.md) and [schema](security-access-control-privileges.md) privilege)
  or any similar privileges that enable taking action on the database and schema.

#### Examples

Grant necessary privileges on a pre-existing role, `replicationadmin`:

```sqlexample
USE ROLE ACCOUNTADMIN;

GRANT REPLICATE ON ACCOUNT TO ROLE replicationadmin;
GRANT USAGE ON DATABASE db1 TO ROLE replicationadmin;
GRANT USAGE ON SCHEMA db1.sch1 TO ROLE replicationadmin;
```

Replicate only one schema, `sch1`, in the `db1` database:

```sqlexample
USE ROLE replicationadmin;

ALTER DATABASE db1 SET REPLICABLE_WITH_FAILOVER_GROUPS = 'NO';
ALTER SCHEMA sch1 SET REPLICABLE_WITH_FAILOVER_GROUPS = 'YES';
```

Replicate all schemas except one schema, `sch2`, in the `db2` database:

```sqlexample
USE ROLE replicationadmin;

ALTER DATABASE db2 SET REPLICABLE_WITH_FAILOVER_GROUPS = 'YES';
ALTER SCHEMA sch2 SET REPLICABLE_WITH_FAILOVER_GROUPS = 'NO';
```

### Refresh of schemas with REPLICABLE_WITH_FAILOVER_GROUPS set in target accounts

During a database refresh:

* Schemas with REPLICABLE_WITH_FAILOVER_GROUPS set to `'YES'` are replicated from the source account to the target account.
* Schemas with REPLICABLE_WITH_FAILOVER_GROUPS set to `'NO'` are not replicated, except in the following two scenarios:

  + The target schema is a replica of the source account schema. In this case, the target schema is always synchronized with its source schema.
  + The target schema has a name conflict with the source account schema. In this situation, the replication job fails due to the name
    conflict.

### List databases and schemas with REPLICABLE_WITH_FAILOVER_GROUPS set in your account

You can list the values set for the REPLICABLE_WITH_FAILOVER_GROUPS parameter in the current account
by querying the ACCOUNT_USAGE and INFORMATION_SCHEMA views.

> **Tip:**
>
> If you aren’t familiar with why you might use the ACCOUNT_USAGE or INFORMATION_SCHEMA views,
> see [Differences between Account Usage and Information Schema](../sql-reference/account-usage.md).

#### Examples

For these examples, we’ll use the INFORMATION_SCHEMA views. That way, you can see the settings immediately
after making any changes.

Using the pre-existing `replicationadmin` role, return all the parameter values for the account, which has two databases:

* `db1` database explicitly set to `NO` and `sch1` schema in the database explicitly set to `YES`. Only that
  one schema in the database is eligible for replication.
* `db2` database explicitly set to `YES` and `sch2` schema in the database explicitly set to `NO`. All schemas
  in the database are eligible for replication except for that one schema.

```sqlexample
USE ROLE replicationadmin;

SELECT database_name, replicable_with_failover_groups
  FROM db1.INFORMATION_SCHEMA.DATABASES;
```

```output
+---------------+---------------------------------+
| DATABASE_NAME | REPLICABLE_WITH_FAILOVER_GROUPS |
+---------------+---------------------------------+
| DB1           | NO                              |
| DB2           | YES                             |
| DB3           | UNSET                           |
+---------------+---------------------------------+
```

```sqlexample
SELECT schema_name, catalog_name, replicable_with_failover_groups
  FROM db1.INFORMATION_SCHEMA.SCHEMATA ORDER BY catalog_name;
```

```output
+--------------------+--------------+---------------------------------+
| SCHEMA_NAME        | CATALOG_NAME | REPLICABLE_WITH_FAILOVER_GROUPS |
+--------------------+--------------+---------------------------------+
| PUBLIC             | DB1          | NO                              |
| SCH1               | DB1          | YES                             |
| SCH2               | DB1          | NO                              |
| SCH3               | DB1          | NO                              |
| INFORMATION_SCHEMA | DB1          | UNSET                           |
+--------------------+--------------+---------------------------------+
```

```sqlexample
USE ROLE replicationadmin;

SELECT schema_name, catalog_name, replicable_with_failover_groups
  FROM db2.INFORMATION_SCHEMA.SCHEMATA
  ORDER BY catalog_name;
```

```output
+--------------------+--------------+---------------------------------+
| SCHEMA_NAME        | CATALOG_NAME | REPLICABLE_WITH_FAILOVER_GROUPS |
+--------------------+--------------+---------------------------------+
| PUBLIC             | DB2          | YES                             |
| SCH1               | DB2          | YES                             |
| SCH2               | DB2          | NO                              |
| SCH3               | DB2          | YES                             |
| INFORMATION_SCHEMA | DB2          | UNSET                           |
+--------------------+--------------+---------------------------------+
```

## Apply global IDs to objects created by scripts in target accounts

If you created account objects, for example, users and roles, in your target account by any means other than via replication (for example,
using scripts), these users and roles have no global identifier by default. The refresh operation uses global identifiers to synchronize
these objects to the same objects in the source account.

In most cases, when a target account is refreshed from the source account, the refresh operation drops any account objects of the
types in the `OBJECT_TYPES` list in the target account that have no global identifier. The initial replication of users and roles to
a target account, however, might cause the first refresh operation to fail. For details on this behavior, refer to
Initial replication of users and roles.

### Use SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME() to apply global IDs

You can prevent the loss of some object types by linking matching objects with the same name in the source and target accounts. The
SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME function adds a global identifier to account objects in the target account.

> **Note:**
>
> Global identifiers are only added to account objects that are included in a replication or failover group for the
> following object types:
>
> * `RESOURCE_MONITOR`
> * `ROLE`
> * `USER`
> * `WAREHOUSE`

Apply global identifiers to account objects in the target account of the types included in the `object_types` list for failover
group `myfg`:

Execute the following SQL statement using the ACCOUNTADMIN role:

```sqlexample
SELECT SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME('myfg');
```

### Initial replication of users and roles

The behavior of the initial refresh operation for USERS and ROLES object types can vary depending on whether or not there are matching
objects with the same name in the target account.

> **Note:**
>
> * The behavior described in this section applies only the first time these object types are replicated to the target account.
> * The scenarios below describe the replication of USERS. The same also applies to the replication of ROLES.

* If there are existing users in the target account with the same name as users in the source account, the initial refresh operation
  fails and describes the two options you have to continue:

  > + Force the refresh operation and allow any existing users in the target account to be dropped. The users in the source account
  >   will be replicated to the target account.
  >
  >   To force a refresh for a group, use the FORCE parameter for the refresh command. For example, to force the refresh of a failover
  >   group, execute the following command:
  >
  >   ```sqlexample
  >   ALTER FAILOVER GROUP <fg_name> REFRESH FORCE;
  >   ```
  > + Link the account objects by name. The [SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME](../sql-reference/functions/system_link_account_objects_by_name.md) function links
  >   users with the same name in both the target account and the source account. Users in the target account that are linked are
  >   not deleted.
  >
  >   To link account objects by name, execute the following command:
  >
  >   ```sqlexample
  >   SELECT SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME('<rg_name>');
  >   ```
  >
  >   > **Note:**
  >   >
  >   > Any user in the target account that *does not* have a matching user in the source account with the same name is dropped.
* If there are no users in the target account with names matching users in the source account, the initial refresh operation in
  the target account drops all users. This can result in the following data and metadata loss:

  > + If USERS are included in the OBJECT_TYPES list for a replication or failover group:
  >
  >   > - Worksheets are lost.
  >   > - Query history is lost.
  > + If USERS are included in the OBJECT_TYPES list, but ROLES is not:
  >
  >   > - Privilege grants to users are lost.
  > + If ROLES are included in the OBJECT_TYPES list:
  >
  >   > - Privilege grants to share objects are lost.

To avoid dropping users or roles in the target account:

1. In the source account, manually recreate any users or roles that exist *only* in the target account before the initial replication.
2. In the target account, link matching objects with the same name in both accounts using the
   [SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME](../sql-reference/functions/system_link_account_objects_by_name.md) function.

## Configure cloud storage access for secondary storage integrations

If you enable storage integration replication, you must take additional steps after the
storage integration is replicated to target accounts.
The replicated integration has its own identity and access management (IAM) entity that is different from the identity
and IAM entity of the primary integration. Therefore, you must update your cloud provider permissions
to grant the replicated integration access to your cloud storage.

You only need to configure this trust relationship on target accounts one time.

The process is similar to granting access in the source account.
See the following pages for more information:

* [Configuring a Snowflake storage integration to access Amazon S3](data-load-s3-config-storage-integration.md)
* [Configure an integration for Google Cloud Storage](data-load-gcs-config.md)
* [Configuring a Snowflake storage integration for Azure](data-load-azure-config.md)

## Configure automated refresh for directory tables on secondary stages

If you replicate an external stage with a directory table, and you have configured automated refresh for the source directory table,
you must take steps to configure [automated refresh](data-load-dirtables-auto.md) for the secondary directory table.

The process is similar to setting up automated refresh in your source account. See the following for more information:

* Amazon S3: The configuration process depends on how you set up event notifications.

  + If you use Amazon S3 Event Notifications with Amazon Simple Queue Service (SQS),
    follow the instructions in [Step 2: Configure event notifications](data-load-dirtables-auto-s3.md).
    You can also migrate from SQS to SNS. For more information, see [Migrate to Amazon Simple Notification Service (SNS)](account-replication-stages-pipes-load-history.md).
  + If you use Amazon Simple Notification Service (SNS), see [Subscribing the Snowflake SQS Queue to your SNS topic](data-load-dirtables-auto-s3.md).
* Google Cloud Storage: Create a new subscription to your Pub/Sub topic and a new notification integration in your target account.
  Then, grant Snowflake access to the Pub/Sub subscription. For instructions,
  see [Configure automation using GCS Pub/Sub](data-load-dirtables-auto-gcs.md).
* Azure Blob Storage: Create a new Event Grid subscription and storage queue. Then, create a new notification integration in the
  target account and grant Snowflake access to your storage queue. For instructions,
  see [Configure automation with Azure Event Grid](data-load-dirtables-auto-azure.md).

> **Important:**
>
> * After you complete these configuration steps in your target account,
>   you should perform a full refresh of your directory table to ensure that it has not missed any notifications.
> * For Google Cloud Storage and Azure Blob Storage, the name of the notification integration in each target account must match the name of
>   the notification integration in the source account.

## Configure notifications for secondary auto-ingest pipes

You must take additional steps to configure cloud notifications for secondary auto-ingest pipes before failover.
This section covers why this additional configuration is required, and how to complete it for each supported cloud provider.

### Amazon S3

The configuration process depends on how you set up event notifications. For example,
suppose you have an auto-ingest pipe that relies on an Amazon Simple Notification Service (SNS) topic
to publish messages about the Snowflake stage location.

When you replicate the pipe to a target account, Snowflake automatically creates a new Amazon Simple Queue Service (SQS) queue.
You must subscribe this SQS queue for your target account to the SNS topic to get notifications about the stage location.

* If you use Amazon S3 Event Notifications with Amazon Simple Queue Service (SQS),
  follow the instructions in [Step 4: Configure event notifications](data-load-snowpipe-auto-s3.md).

  > **Important:**
  >
  > To ensure that the pipe has not missed any notifications, you should refresh the pipe after switching to the new SQS queue.

  You can also migrate from SQS to SNS. For more information, see [Migrate to Amazon Simple Notification Service (SNS)](account-replication-stages-pipes-load-history.md).
* If you use Amazon Simple Notification Service (SNS), see
  [Subscribing the Snowflake SQS Queue to your SNS topic](data-load-snowpipe-auto-s3.md).
* If you use Amazon EventBridge, see [Option 3: Setting up Amazon EventBridge to automate Snowpipe](data-load-snowpipe-auto-s3.md).

### Microsoft Azure Blob Storage

A pipe that automatically loads data from files located on a stage in Microsoft Azure blob storage requires an Event Grid
subscription, storage queue, and a notification integration bound to the storage queue. A secondary pipe in a target account needs a separate
Event Grid, storage queue, and notification integration bound to the storage queue. The Event Grid in both source and target accounts must be
configured as endpoints for the same Azure Storage source.

See the diagram below for configuration details:

Create a new Event Grid subscription and storage queue. Then, create a new notification integration in the target account and
grant Snowflake access to your storage queue. For instructions, see [Configuring Automation With Azure Event Grid](data-load-snowpipe-auto-azure.md).

> **Important:**
>
> The name of the notification integration in each target account must match the name of the notification integration in
> the source account.

### External stage for Google Cloud Storage

A pipe that automatically loads data from files located in Google Cloud Storage requires a
Google Pub/Sub subscription and a notification integration
that references that subscription. Each replicated pipe in a target account also requires a Google Pub/Sub subscription and a
notification integration that references that subscription.
The Pub/Sub subscription in each source and target account must be subscribed to the same Pub/Sub Topic
that receives notifications from the Google Cloud Storage source.

See the diagram below for configuration details:

Create a new subscription to your Pub/Sub topic and a new notification integration in your target account.
:   Then, grant Snowflake access to the Pub/Sub subscription. For instructions,
    see [Configuring Automation Using GCS Pub/Sub](data-load-snowpipe-auto-gcs.md).

> **Important:**
>
> The name of the notification integration in each target account must match the name of the notification integration in
> the source account.

## Updating the remote service for API integrations

If you have enabled API integration replication, additional steps are required after the API integration is replicated to the target account.
The replicated integration has its own identity and access management (IAM) entity that are different from the identity and IAM entity
of the primary integration. Therefore, you must update the permissions on the remote service to grant access to replicated functions.
The process is similar to granting access to the functions on the primary account. See the below links for more details:

* Amazon Web Services [Set up the trust relationship(s) between Snowflake and the new IAM role](../sql-reference/external-functions-creating-aws-common-api-integration-proxy-link.md).
* Google Cloud Platform:
  [Create a GCP Security Policy for the Proxy Service](../sql-reference/external-functions-creating-gcp-ui-security-policy.md).
* Microsoft Azure:

  + Step 1. [Link the API integration for Azure](../sql-reference/external-functions-creating-azure-common-api-integration-proxy-link.md)
  + Step 2. [Create a validate-JWT policy](../sql-reference/external-functions-creating-azure-ui-security-policy.md)

## Comparing data sets in primary and secondary databases

Snowflake performs automatic verification checks as part of each replication refresh operation.
If a verification failure occurs, the refresh fails. Therefore, you don’t have to manually verify
the replicated data. If you need additional verification for compliance reasons, you
can perform manual verification steps after the refresh operation finishes.

### Automatic verification by Snowflake

Snowflake currently performs the following checks between the primary and secondary account, after each refresh operation:

* Snowflake compares the hash values between the primary and secondary account, for all files that were replicated.
* For each table, Snowflake compares the following values between the primary and secondary account:

  + File count.
  + Row count.
  + Byte count.

### Manual verification

If database objects are replicated in a replication or failover group, you can use the
[HASH_AGG](../sql-reference/functions/hash_agg.md) function to compare the rows in some or all tables in a
primary and secondary database to verify data consistency. The HASH_AGG function returns an
aggregate signed 64-bit hash value over the set of input rows. The hash value is the same regardless
of the ordering of the input rows.

Query this function on all tables, or a random subset of tables, in both the secondary account and
the primary account. On the primary account, use an [AT | BEFORE](../sql-reference/constructs/at-before.md) clause
to specify the point in time of the latest refresh for the associated database. Compare the output
between the queries on both accounts.

#### Example of manually verifying data after a refresh

In the following examples, the database `mydb` is included in the failover group `myfg`. The
database `mydb` contains the table `myschema.mytable`.

##### Commands to run on target account

1. Query the [REPLICATION_GROUP_REFRESH_PROGRESS](../sql-reference/functions/replication_group_refresh_progress.md) table function
   (in the [Snowflake Information Schema](../sql-reference/info-schema.md)). Note the `primarySnapshotTimestamp` in the `DETAILS` column for the
   `PRIMARY_UPLOADING_METADATA` phase. This is the timestamp for the latest refresh of that database on the primary account.

   ```sqlexample
   SELECT PARSE_JSON(details)['primarySnapshotTimestamp']
     FROM TABLE(information_schema.replication_group_refresh_progress('myfg'))
     WHERE PHASE_NAME = 'PRIMARY_UPLOADING_METADATA';
   ```
2. Query the HASH_AGG function for a specified table in the secondary account. The following query returns a hash value for all rows
   in the `myschema.mytable` table:

   ```sqlexample
   SELECT HASH_AGG( * ) FROM mydb.myschema.mytable;
   ```

##### Commands to run on source account

3. Query the HASH_AGG function for the same table in the primary account. Using Time Travel, specify the timestamp when the latest
   refresh was performed for the secondary database:

   ```sqlexample
   SELECT HASH_AGG( * ) FROM mydb.myschema.mytable
     AT(TIMESTAMP => '<primarySnapshotTimestamp>'::TIMESTAMP);
   ```
4. Compare the results from the two queries. The output should be identical.

## Modifying a replication or failover group in a source account

You can edit the name, included objects, and replication schedule of a replication or failover group in a source
account using Snowsight or SQL.

> **Note:**
>
> Replication groups can’t be changed to failover groups or vice versa. To enable or disable failover, delete
> the group and recreate it with the correct failover setting.

### Modify a replication or failover group in a source account using Snowsight

> **Note:**
>
> Only account administrators can edit a replication or failover group using Snowsight (refer to
> Limitations of using Snowsight for replication configuration).

To perform these actions, you must be signed in to the source account. If you are not signed in, the Status column
displays a sign in message instead of the refresh status.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, then select Groups.
4. Locate the replication or failover group you want to edit, and select the More menu (…) in the last column of the row.
5. Select Edit.
6. To change the group name, enter a new name in the Group name box that meets the following requirements:

   * Must start with an alphabetic character and cannot contain spaces or special characters unless the identifier string is
     enclosed in double quotes (for example, “My object”). Identifiers enclosed in double quotes are also case-sensitive.

     For more information, see [Identifier requirements](../sql-reference/identifiers-syntax.md).
   * Names for failover groups and replication groups in an account must be unique.
7. Choose Edit objects to add or remove share and account objects.

   > **Note:**
   >
   > Account objects can only be added to one replication or failover group. If a replication or failover group with any account
   > objects already exists in your account, you can’t select those objects.
8. Choose Select databases to add or remove database objects.
9. Select the Replication frequency to change the replication schedule for a group.
10. Select Save to update the group.

    If saving the changes to the group is unsuccessful, refer to Troubleshoot issues with creating and editing replication groups using Snowsight for common errors
    and how to resolve them.

### Modify a replication or failover group in a source account using SQL

You can modify a replication or failover group properties using the [ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) or
[ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command.

## Pause or resume a replication schedule in a target account

You can pause (suspend) or resume a replication schedule in a target account using Snowsight or
SQL.

### Pause or resume a replication schedule in a target account using Snowsight

> **Note:**
>
> Only account administrators can edit a replication or failover group using Snowsight (refer to
> Limitations of using Snowsight for replication configuration).

To pause or resume a replication schedule, you must be signed in to the target account.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, then select Groups.
4. Locate the replication or failover group you want to edit, and select the More menu (…) in the last column of the row.
5. Select Pause or Resume.

### Pause or resume a replication schedule in a target account using SQL

You can pause or resume a replication schedule in a target account using the [ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) or
[ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command. To pause, specify the `SUSPEND` parameter. To resume, specify the
`RESUME` parameter.

## Dropping a secondary replication or failover group

You can drop a secondary replication or failover using the [DROP REPLICATION GROUP](../sql-reference/sql/drop-replication-group.md) or the
[DROP FAILOVER GROUP](../sql-reference/sql/drop-failover-group.md) command. Only the replication or failover group owner (that is, the role with the OWNERSHIP
privilege on the group) can drop the group.

To drop a secondary replication or failover group using Snowsight, you must drop the group in the source account. See
Drop a replication or failover group using Snowsight.

## Dropping a primary replication or failover group

You can drop a primary replication or failover group using Snowsight or SQL. If you are deleting a primary group using SQL,
you must first drop all secondary groups. See Dropping a secondary replication or failover group.

### Drop a primary replication or failover group using SQL

A primary replication or failover group can only be dropped after all the replicas of the group (that is, secondary replication or failover
groups) have been dropped. Alternatively, you can promote a secondary failover group to serve as the primary failover group,
then drop the former primary failover group.

Note that only the group owner can drop the group.

### Drop a replication or failover group using Snowsight

> **Note:**
>
> Only account administrators can delete a replication or failover group using Snowsight (refer to
> Limitations of using Snowsight for replication configuration).

You can delete a primary replication or failover group and any linked secondary groups.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Accounts.
3. Select Replication, select Groups.
4. Locate the replication or failover group you want to delete. Select the More menu (…) in the last column of the row.
5. Select Drop, then select Drop group.

## Troubleshoot issues with creating and editing replication groups using Snowsight

The following scenarios can help you troubleshoot common issues that can occur when creating or editing replication or
failover group using Snowsight.

* You cannot add a database to a group
* You cannot add a share to a group

### You cannot add a database to a group

|  |  |
| --- | --- |
| Error | ```output Database '<database_name>' is already configured to replicate to account '<account_name>' by replication group '<group_name>'. ``` |
| Cause | A database can only be in one replication or failover group. One of the databases you selected for the group is already included in another replication or failover group. |
| Solution | Choose Select Databases and unselect any database(s) that are already included in another group. |

|  |  |
| --- | --- |
| Error | ```output Cannot directly add previously replicated object '<database_name>' to a replication group. Please use the provided system functions to convert this object first. ``` |
| Cause | The database you want to add to a replication or failover group was previously configured for database replication. |
| Solution | Disable database replication for the database. See Transitioning from database replication to group-based replication. |

### You cannot add a share to a group

|  |  |
| --- | --- |
| Error | ```output Share '<share_name>' is already configured to replicate to account '<account_name>' by replication group '<group_name>'. ``` |
| Cause | A share can only be in one replication or failover group. One of the shares you selected for the group is already included in another replication or failover group. |
| Solution | Choose Select Objects and unselect any share(s) that are already included in another group. |

## Limitations of using Snowsight for replication configuration

* Only a user with the ACCOUNTADMIN role can create a replication or failover group using Snowsight. A user with a role with the
  CREATE REPLICATION GROUP or CREATE FAILOVER GROUP privilege can create a group using the respective SQL commands.
* Only a user with the ACCOUNTADMIN role can edit or drop a replication or failover group using Snowsight. A user with a role
  with the OWNERSHIP privilege on a replication or failover group can edit and drop groups using the respective SQL commands.
* If your account uses private connectivity, you can’t use Snowsight to create, modify, or drop groups. You can use SQL
  to complete these actions.

---
title: Replication considerations
source: https://docs.snowflake.com/en/user-guide/account-replication-considerations.md
section: User Guide
---

# Replication considerations

This topic describes the behavior of certain Snowflake features in secondary databases and objects when replicated with
[replication or failover groups](account-replication-intro.md) or
[database replication](db-replication-config.md), and provides general guidance for working with replicated
objects and data.

If you have previously enabled database replication for individual databases using the ALTER DATABASE … ENABLE REPLICATION TO ACCOUNTS
command, see [Database replication considerations](database-replication-considerations.md) for additional considerations specific to database replication.

## Replication group and failover group constraints

The following sections explain the constraints around adding account objects, databases, and shares to replication and failover groups.

### Database and share objects

The following constraints apply to database and share objects:

* An object can only be in one failover group.
* An object can be in multiple replication groups as long as each group is replicated to a *different* target account.
* An object cannot be in both a failover group and a replication group.

You can only replicate outbound shares. Replication of [inbound shares](data-share-consumers.md) (shares from providers)
is not supported.

### Account objects

An account can only have one replication or failover group that contains objects other than databases or shares.

## Replication privileges

This section describes the replication privileges that are available to be granted to roles to specify the operations users can perform on
replication and failover group objects in the system. For the syntax of the GRANT command, see
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md).

> **Note:**
>
> For [database replication](db-replication-config.md), only a user with the ACCOUNTADMIN role can enable
> and manage database replication and failover. For additional information on required privileges for database replication,
> see the [required privileges table](db-replication-config.md) in [Step 6. Refreshing a secondary database on a schedule](db-replication-config.md).

| Privilege | Object | Usage | Notes |
| --- | --- | --- | --- |
| OWNERSHIP | Replication Group  Failover Group | Grants the ability to delete, alter, and grant or revoke access to an object. | Can be granted by:  The ACCOUNTADMIN role or  A role that has the MANAGE GRANTS privilege or  A role that has the OWNERSHIP privilege on the group. |
| CREATE REPLICATION GROUP | Account | Grants the ability to create a replication group. | Must be granted by the ACCOUNTADMIN role. |
| CREATE FAILOVER GROUP | Account | Grants the ability to create a failover group. | Must be granted by the ACCOUNTADMIN role. |
| FAILOVER | Failover Group | Grants the ability to promote a secondary failover group to serve as primary failover group. | Can be granted or revoked by a role with the OWNERSHIP privilege on the group. |
| REPLICATE | Replication Group  Failover Group | Grants the ability to refresh a secondary group. | Can be granted or revoked by a role with the OWNERSHIP privilege on the group. |
| MODIFY | Replication Group  Failover Group | Grants the ability to change the settings or properties of an object. | Can be granted or revoked by a role with the OWNERSHIP privilege on the group. |
| MONITOR | Replication Group  Failover Group | Grants the ability to view details within an object. | Can be granted or revoked by a role with the OWNERSHIP privilege on the group. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

## Replication and references across replication groups

Objects in a replication (or failover) group that have dangling references (i.e. references to objects in another replication or failover
group) might successfully replicate to a target account in some circumstances. If the replication operation results in behavior in the
target account consistent with behavior that can occur in the source account, replication succeeds.

For example, if a column in a table in failover group `fg_a` references a sequence in failover group `fg_b`, replication of both
groups succeeds. If `fg_a` is replicated before `fg_b`, insert operations (after failover) on the table that references the
sequence fails if `fg_b` was not replicated. This behavior can occur in a source account. If a sequence is dropped in a
source account, insert operations on a table with a column referencing the dropped sequence fails.

When the dangling reference is a security policy that protects data, the replication (or failover) group with the security policy
must be replicated before any replication group that contains objects that reference the policy is replicated.

> **Attention:**
>
> Making updates to security policies that protect data in separate replication or failover groups may result in inconsistencies
> and should be done with care.

For database objects, you can view [object dependencies](object-dependencies.md) in the Account Usage
[OBJECT_DEPENDENCIES view](../sql-reference/account-usage/object_dependencies.md).

### Dangling references and network policies

Dangling references in network policies can cause replication to fail with the following error message:

```output
Dangling references in the snapshot. Correct the errors before refreshing again.
The following references are missing (referred entity <- [referring entities])
```

To avoid dangling references, specify the following object types in the `OBJECT_TYPES` list when executing the CREATE or
ALTER command for the replication or failover group:

* If a network policy uses a network rule, include the database that contains the schema where the network rule was created.
* If a network policy is associated with the account, include `NETWORK POLICIES` and `ACCOUNT PARAMETERS` in the
  `OBJECT_TYPES` list.
* If a network policy is associated with a user, include `NETWORK POLICIES` and `USERS` in the `OBJECT_TYPES` list.

For more details, see [Replicating network policies](account-replication-security-integrations.md).

### Dangling references and packages policies

If there is a [packages policy](../developer-guide/udf/python/packages-policy.md) set on the account, the following dangling references
error occurs during the refresh operation for a replication or failover group that contains account objects:

```output
003131 (55000): Dangling references in the snapshot. Correct the errors before refreshing again.
The following references are missing (referred entity <- [referring entities]):
POLICY '<policy_db>.<policy_schema>.<packages_policy_name>' <- [ACCOUNT '<account_locator>']
```

To avoid dangling references, replicate the database that contains the packages policy to the target account. The database containing the
policy can be in the same or different replication or failover group.

### Dangling references and secrets

For details, see Replication and secrets.

### Dangling references and streams

Dangling references for streams cause replication to fail with the following error message:

```output
Primary database: the source object ''<object_name>'' for this stream ''<stream_name>'' is not included in the replication group.
Stream replication does not support replication across databases in different replication groups. Please see Streams Documentation
https://docs.snowflake.com/en/user-guide/account-replication-considerations#replication-and-streams for options.
```

To avoid dangling reference errors:

* The primary database must include both the stream and its base object or
* The database that contains the stream and the database that contains the base object referenced by the stream must be included in the
  same replication or failover group.

## Replication and read-only secondary objects

All secondary objects in a target account, including secondary databases and shares, are read-only. Changes to replicated objects or object types
cannot be made locally in a target account. For example, if the `USERS` object type is replicated from a source
account to a target account, new users cannot be created or modified in the target account.

New, local databases and shares *can* be created and modified in a target account. If `ROLES` are also replicated
to the target account, new roles cannot be created or modified in that target account. Therefore, privileges cannot be granted to (or revoked from)
a
role on a secondary object in the target account. However, privileges *can* be granted to (or revoked from) a secondary role on local
objects (for example, databases, shares, or replication or failover groups) created in the target account.

## Replication and objects in target accounts

If you created account objects, for example, users and roles, in your target account by *any means other than via replication* (for example,
using scripts), these users and roles have no global identifier by default. When a target account is refreshed from the source account, the
refresh operation **drops** any account objects of the types in the `OBJECT_TYPES` list in the target account that have no
global identifier.

> **Note:**
>
> The initial refresh operation to replicate USERS or ROLES might result in an error. This is to help prevent accidental deletion of
> data and metadata associated with users and roles. For more information about the circumstances that determine whether these
> object types are dropped or the refresh operation fails, see [Initial replication of users and roles](account-replication-config.md).

To avoid dropping these objects, see [Apply global IDs to objects created by scripts in target accounts](account-replication-config.md).

### Objects recreated in target accounts

If an existing object in the source account is replaced using a CREATE OR REPLACE statement, the existing object is dropped, and then
a new object with the same name is created in a single transaction. For example, if you execute a CREATE OR REPLACE statement for an
existing table `t1`, table `t1` is dropped, and then a new table `t1` is created. For more information, see the
[usage notes for CREATE TABLE](../sql-reference/sql/create-table.md).

When objects are replaced on the target account, the DROP and CREATE statements do not execute atomically during a refresh operation.
This means the object might disappear briefly from the target account while it is being recreated as a new object.

## Replication and security policies

The database containing a security policy and the references (i.e. assignments) can be replicated using replication and failover
groups. Security policies include:

* [Aggregation policies](aggregation-policies.md)
* [Authentication policies](authentication-policies.md)
* [Masking policies](security-column-intro.md)
* [Password policies](password-authentication.md)
* [Privacy policies](diff-privacy/differential-privacy-admin-privacy-policies.md)
* [Projection policies](projection-policies.md)
* [Row access policies](security-row-intro.md)
* [Session policies](session-policies.md), including
  session policies with secondary roles
* [Tag-based masking policies](tag-based-masking-policies.md)

If you are using [database replication](db-replication-intro.md),
see [Database replication and security objects](database-replication-considerations.md).

### Authentication, password, & session policies

Authentication, password, and session policy references for users are replicated when specifying the database containing policy
(`ALLOWED_DATABASES = policy_db`) and `USERS` in a replication group or failover group.

If either the policy database or users have already been replicated to a target account, update the replication or failover group
in the source account to include the databases and object types required to successfully replicate the policy. Then execute a refresh
operation to update the target account.

If user-level policies are not in use, `USERS` do not need to be included in the replication or failover group.

> **Note:**
>
> The policy must be in the same account as the account-level policy assignment and the user-level policy assignment.
>
> If you have a security policy set on the account or a user in the account and you do not update the
> replication or failover group to include the `policy_db` containing the policy and `USERS`, a dangling reference occurs in
> the target account. In this case, a dangling reference means that Snowflake cannot locate the policy in the target account because the
> fully-qualified name of the policy points to the database in the source account. Consequently, the target account or users in the target
> account are not required to comply with the security policy.
>
> To successfully replicate a security policy, verify the replication or failover group includes the object types and databases required
> to prevent a dangling reference.

### Privacy policies

Consider the following when replicating privacy policies and privacy-protected tables and views associated with
[differential privacy](diff-privacy/differential-privacy-overview.md):

* If a privacy policy is assigned to a table or view in the source account, the policy needs to be replicated in the target account.
* Cumulative privacy loss for a privacy budget is not replicated.
* Cumulative privacy loss in the target and source accounts are tracked separately.
* Administrators in the target account cannot adjust the replicated privacy budget. The privacy budget is synced with the one in the source
  account.
* If an analyst has access to the privacy-protected table or view in both the source account and the target account, they can incur twice
  the amount of privacy loss before reaching the privacy budget’s limit.
* Privacy domains set on the columns are also replicated.

### Session policies with secondary roles

If you are using session policies with secondary roles, you must specify the policy database
in the same replication group that contains the roles. For example:

```sqlexample
CREATE REPLICATION GROUP myrg
  OBJECT_TYPES = DATABASES, ROLES, USERS
  ALLOWED_DATABASES = session_policy_db
  ALLOWED_ACCOUNTS = myorg.myaccount
  REPLICATION_SCHEDULE = '10 MINUTE';
```

If you specify the session policy database that references secondary roles in a different replication or failover group (`rg2`) than the
replication or failover group that contains account-level objects (`myrg`) and you replicate or fail over `rg2` first, a
dangling reference occurs. An error message tells you to place the session policy
database in the replication or failover group that contains the roles. This behavior occurs when the session policy is set on the account
or users.

If the session policy and account level objects are in different replication groups, and the session policy is not set on the account or
users, you can replicate and refresh the target account. Be sure to refresh for the replication group that contains the account level
objects first.

If you refresh the target account after replicating or failing over the session policy with secondary roles and role objects, the target
account reflects the session policy and secondary roles behavior in the source account.

Additionally, when you refresh the database in the target account and the database contains a session policy that references secondary
roles, `ALLOWED_SECONDARY_ROLES` always evaluates to `[ALL]`.

## Replication and secrets

You can only replicate the secret using a replication or failover group. Specify the database that contains the secret, the database that
contains UDFs or procedures that reference the secret, and the integrations that reference the secret in a single replication or failover
group.

If you have the database that contains the secret in one replication or failover group and the integration that references the secret in a
different replication or failover group, then:

* If you replicate the integration first and then the secret, the operation is successful: all objects are replicated and there are no
  dangling references.
* If you replicate the secret before the integration and the secret does not already exist in the target account, a “placeholder secret” is
  added in the target account to prevent a dangling reference. Snowflake maps the placeholder secret to the integration.

  After you replicate the group that contains the integration, on the next refresh operation for the group that contains the secret,
  Snowflake updates the target account to replace the placeholder secret with the secret that is referenced in the integration.
* If you replicate the secret and do not replicate the integration from `account1` to `account2`, the integration doesn’t work in the
  target account (`account2`) because there is no integration to use the secret. Additionally, if you failover and the target account is
  promoted to source account, the integration will not work.

  When you decide to failover to make `account1` as the source account, the secret and integration references match and the placeholder
  secret is not used. This allows you to use the security integration and the secret that contains the credentials because the objects can
  reference each other.

## Replication and cloning

Historically [Cloned objects](object-clone.md) were replicated physically rather than logically to secondary databases. That is,
cloned tables in a standard database don’t contribute to the overall data storage unless or until DML operations on the clone
add to or modify existing data. However, when a cloned table is replicated to a secondary database, the physical data is also replicated,
increasing the data storage usage for your account.

A logically replicated cloned table shares the micro-partitions of the original table it was cloned from,
reducing the physical storage of the secondary table in the target account.

If the original table and cloned table are included in the same replication or failover group, the cloned table can be replicated
logically to the target account.

### Logical replication of clones

If the original and cloned table are included in the same replication or failover group, the cloned table can be replicated
logically to the target account.

For example, if table `t2` in database `db2` is a clone of table `t1` in database `db1`, and both databases are included
in replication group `rg1`, then table `t2` is created as a logical clone in the target account.

A cloned object can be cloned to create additional clones of the original object. The original object and the cloned objects are part
of the same [clone group](tables-storage-considerations.md). For example, if table `t3` in database `db3` is created as a clone of `t2`, it is in the same clone group
as the original table `t1` and the cloned table `t2`.

If database `db3` is later added to the replication group `rg1`, table `t3` is created in the target account as a logical clone of
table `t1`.

#### Considerations

* Tables that are in the same clone group in the source account might not be in the same clone group in the target account.
* The original table and its cloned table must be in the same replication or failover group.
* In some cases, not all micro-partitions of the clone group can be shared with the cloned table. This can result in additional storage usage
  for the cloned table in the target account.

#### Example

Table `t2` in database `db2` is a clone of table `t1` in database `db1`. Include both databases in
replication group `myrg` to logically replicate `t2` to the target account:

```sqlexample
CREATE REPLICATION GROUP myrg
    OBJECT_TYPES = DATABASES
    ALLOWED_DATABASES = db1, db2
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

## Replication and automatic clustering

In a primary database, Snowflake monitors clustered tables using [Automatic Clustering](tables-auto-reclustering.md) and reclusters them as
needed. As part of a refresh operation, clustered tables are replicated to a secondary database with the current sorting of the table
micro-partitions. As such, reclustering is not performed again on the clustered tables in the secondary database, which would be
redundant.

If a secondary database contains clustered tables and the database is promoted to become the primary database, Snowflake begins Automatic
Clustering of the tables in this database while simultaneously suspending the monitoring of clustered tables in the previous primary
database.

See Replication and Materialized Views (in this topic) for information about Automatic Clustering for materialized views.

## Replication and large, high-churn tables

When one or more rows of a table are updated or deleted, all of the impacted micro-partitions that store this data in a primary database
are re-created and must be synchronized to secondary databases. For large, high-churn dimension tables, the replication costs can be
significant.

For large, high-churn dimension tables that incur significant replication costs, the following mitigations are available:

* Replicate any primary databases that store such tables at a lower frequency.
* Change your data model to reduce churn.

For more information, see [Managing costs for large, high-churn tables](tables-storage-considerations.md).

## Replication and Time Travel

[Time Travel](data-time-travel.md) and [Fail-safe](data-failsafe.md) data is maintained independently for a
secondary database and is not replicated from a primary database. Querying tables and views in a secondary database using Time Travel
can produce different results than when executing the same query in the primary database.

Historical Data:
:   Historical data available to query in a primary database using Time Travel is not replicated to secondary databases.

    For example, suppose data is loaded continuously into a table every 10 minutes using Snowpipe, and a secondary database is refreshed
    every hour. The refresh operation only replicates the latest version of the table. While every hourly version of the table within the
    retention window is available for query using Time Travel, none of the iterative versions within each hour (the individual Snowpipe
    loads) are available.

Data Retention Period:
:   The data retention period for tables in a secondary database begins when the secondary database is refreshed with the DML operations
    (i.e. changing or deleting data) written to tables in the primary database.

    > **Note:**
    >
    > The data retention period parameter, [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md), is only replicated to database objects in the secondary
    > database, not to the database itself. For more details about parameter replication, see [Parameters](db-replication-intro.md).

## Replication and materialized views

In a primary database, Snowflake performs automatic background maintenance of materialized views. When a base table changes, all
materialized views defined on the table are updated by a background service that uses compute resources provided by Snowflake. In addition,
if Automatic Clustering is enabled for a materialized view, then the view is monitored and reclustered as necessary in a primary database.

A refresh operation replicates the materialized view definitions to a secondary database; the materialized view data is not
replicated. Automatic background maintenance of materialized views in a secondary database is enabled by default. If Automatic
Clustering is enabled for a materialized view in a primary database, automatic monitoring and reclustering of the materialized view in the
secondary database is also enabled.

> **Note:**
>
> The charges for automated background synchronization of materialized views are billed to each account that contains a secondary
> database.

## Replication and Apache Iceberg™ tables

Consider the following points when you use replication for Iceberg tables:

* Snowflake currently supports replication of Snowflake-managed tables only.
* Replicating converted Iceberg tables isn’t supported. Snowflake skips converted tables during refresh operations.
* For replicated tables, you must configure access to a storage location in the *same region* as the target account.
* If you drop or alter a storage location that is used for replication on the primary external volume, refresh operations might fail.
* Secondary tables in the target account are read-only until you promote the target account to serve as the source account.
* Snowflake maintains the [directory hierarchy](tables-iceberg-storage.md)
  of the primary Iceberg table for the secondary table.
* Replication costs apply for this feature. For more information, see [Understanding replication cost](account-replication-cost.md).
* For considerations about the account objects for replication and failover groups, see Account objects.
* Replicating dynamic Iceberg tables isn’t supported. Snowflake skips converted tables during refresh operations.

## Replication and dynamic tables

Dynamic table replication behavior varies based on whether the primary database containing the dynamic table is part
of a replication group or a failover group.

### Dynamic tables and replication groups

A database that contains a dynamic table can be replicated using a replication group. The source object(s) it depends
on are not required to be in the same replication group.

Replicated objects in each target account are referred to as *secondary* objects and are replicas of the *primary*
objects in the source account. Secondary objects are read-only in
the target account. If a secondary replication group is dropped in a target account, the databases that were
included in the group become read/write. However, any dynamic tables included in a replication group remain
read-only even after the secondary group is dropped in the target account. No DML or dynamic table refreshes can
happen on these read-only dynamic tables.

### Dynamic tables and failover groups

A database that contains a dynamic table can be replicated using a failover group. If a dynamic table references
source objects outside the failover group or database replication, it can still be replicated. After a failover, the
dynamic table resolves source objects using name resolution during refresh. The refresh might succeed or fail,
depending on the state of the source objects. If successful, the dynamic table is reinitialized with the latest data
from the source objects.

Secondary dynamic tables are read-only and do not get refreshed. After a failover occurs and a secondary dynamic
table is promoted to primary dynamic table, the first refresh is a reinitialization followed by incremental
refreshes if the dynamic table is configured for incremental refresh of data.

> **Note:**
>
> The reinitialized dynamic table might differ from the original replica because the source objects and dynamic table
> are not guaranteed to share the same replication snapshot.

**Example: Refresh failure due to missing source objects**

If a dynamic table depends on a source table outside the failover group, it cannot refresh after a failover. In the
above diagram, the dynamic table `dt` in the primary account is replicated to the secondary account. `dt`
depends on `source_table`, which is not included in the same failover group as the primary account. After failover,
the refresh in the secondary account fails because `source_table` cannot be resolved.

**Example: Successful refresh when source objects exist in secondary account via separate replication**

In the above diagram, the dynamic table `dt` depends on `source_table`. Both `dt` and `source_table` in the
primary account are replicated to the secondary account through independent failover groups. After replication and
failover, when `dt` is refreshed in the secondary account, the refresh succeeds because `source_table` can be
found through name resolution.

**Example: Successful refresh when source objects exist in secondary account locally**

In the above diagram, the dynamic table `dt` depends on `source_table` and is replicated through a failover group
from the primary account to the secondary account. A `source_table` is created locally in the secondary account.
After failover, when `dt1` is refreshed in the secondary account, the refresh can succeed because `source_table`
can be found through name resolution.

## Replication and Snowpipe Streaming

A table populated by [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) in a primary database is replicated to the secondary database in a target account.

In the primary database, tables are created and rows are inserted through [channels](snowpipe-streaming/snowpipe-streaming-channels.md). [Offset tokens](snowpipe-streaming/snowpipe-streaming-channels.md) track the ingestion progress. A refresh operation replicates the table object, table data, and the channel offsets associated with the table from the primary database to the secondary database.

### Snowpipe Streaming architectures

Snowflake supports two underlying architectures for Snowpipe Streaming, which determine the available client APIs and performance characteristics.

#### Snowpipe Streaming with classic architecture

**Read-only operations (available in source and target accounts):**

* The channel `getLatestCommittedOffsetToken` API
* `SHOW CHANNELS` command

**Write operations (only available in the source account):**

* The client [openChannel](https://javadoc.io/doc/net.snowflake/snowflake-ingest-sdk/latest/net/snowflake/ingest/streaming/SnowflakeStreamingIngestClient.html#openChannel(net.snowflake.ingest.streaming.OpenChannelRequest)) API
* The channel [insertRow](https://javadoc.io/doc/net.snowflake/snowflake-ingest-sdk/latest/net/snowflake/ingest/streaming/SnowflakeStreamingIngestChannel.html#insertRow(java.util.Map,java.lang.String)) API
* The channel [insertRows](https://javadoc.io/doc/net.snowflake/snowflake-ingest-sdk/latest/net/snowflake/ingest/streaming/SnowflakeStreamingIngestChannel.html#insertRows(java.lang.Iterable,java.lang.String)) API

#### Snowpipe Streaming with high-performance architecture

This architecture offers optimized features, including bulk operations and enhanced status checks, crucial for managing high-volume, replicated environments.

All functions described below are accessible via both [Snowpipe Streaming SDKs](snowpipe-streaming/snowpipe-streaming-high-performance-overview.md) and the [Snowpipe Streaming REST API](snowpipe-streaming/snowpipe-streaming-high-performance-rest-api.md), allowing for flexible integration based on your infrastructure needs.

Write & management operations (available only in the source account):

* Channel lifecycle management: Open and manage the ingestion channels required to establish a data stream. For example, the [openChannel](https://docs.snowflake.com/user-guide/snowpipe-streaming-sdk/reference/java/com/snowflake/ingest/streaming/package-summary.html) method in the Java SDK.
* Transactionally consistent ingestion: The core function for appending rows. Data inserted here is guaranteed to be included in the replication snapshot once committed. For example, the [appendRows](https://docs.snowflake.com/user-guide/snowpipe-streaming-sdk/reference/java/com/snowflake/ingest/streaming/SnowflakeStreamingIngestChannel.html) method in the Java SDK.
* Offset token tracking: Retrieve the latest committed offset tokens to ensure data integrity and prevent duplication during ingestion. For example, the [getLatestCommittedOffsetToken](https://docs.snowflake.com/user-guide/snowpipe-streaming-sdk/reference/java/com/snowflake/ingest/streaming/SnowflakeStreamingIngestClient.html) method in the Java SDK.
* Bulk status monitoring: Efficiently monitor health and lag metrics across multiple channels. This is critical for verifying that data latency is acceptable before replication occurs. For example, the [getChannelStatus](https://docs.snowflake.com/user-guide/snowpipe-streaming-sdk/reference/java/com/snowflake/ingest/streaming/SnowflakeStreamingIngestClient.html) method in the Java SDK.

Read-only operations (available in both source and target accounts):

* Channel inspection: Use metadata commands, such as `SHOW CHANNELS`, to view configuration details, status, and properties of existing ingestion channels across the replicated environment.

### Data loss avoidance

To avoid data loss in the case of failover, the data retention time for successfully inserted rows in your upstream data source must be greater than the configured replication schedule. If data is inserted into a table in a primary database, and failover occurs before the data can be replicated to the secondary database, the same data will need to be inserted into the table in the newly promoted primary database. The following example shows a failover scenario:

1. Table `t1` in primary database `repl_db` is populated with data with Snowpipe Streaming and the Kafka connector.
2. The `offsetToken` is 100 for channel 1 and 100 for channel 2 for `t1` in the primary database.
3. A refresh operation completes successfully in the target account.
4. The `offsetToken` is 100 for channel 1 and 100 for channel 2 for the `t1` in the secondary database.
5. More rows are inserted into `t1` in the primary database.
6. The `offsetToken` is now 200 for channel 1 and 200 for channel 2 for the `t1` in the primary database.
7. A failover occurs before the additional rows and new channel offsets can be replicated to the secondary database.

In this case, there are 100 missing offsets in each channel for table `t1` in the newly promoted primary database. To insert the missing data, see [Reopen active channels for Snowpipe Streaming in newly promoted source account](account-replication-failover-failback.md).

### Replication support requirements

#### Snowpipe Streaming with classic architecture

Snowpipe Streaming replication support for the classic architecture requires the following minimum versions:

* Snowflake Ingest SDK version 1.1.1 or later.
* If you use the Kafka connector: Kafka connector version 1.9.3 or later.

#### Snowpipe Streaming with high-performance architecture

Snowpipe Streaming replication support for the high-performance architecture requires the following minimum versions:

* Snowpipe Streaming SDK version 1.1.0 or later.

#### Data retention requirement for both architectures

The data retention time for successfully inserted rows in your upstream data source must be greater than the configured replication schedule. If you use the Kafka connector, ensure that your `log.retention` configuration is set with a sufficient buffer.

## Replication and stages

The following constraints apply to stage objects:

* Snowflake currently supports stage replication as part of group-based replication (replication and failover groups).
  Stage replication is not supported for database replication.
* You can replicate an external stage. However, the files on an external stage are not replicated.
* You can replicate an internal stage. To replicate the files on an internal stage, you must enable a directory table on the stage.
  Snowflake replicates only the files that are mapped by the directory table.
* When you replicate an internal stage with a directory table, you cannot disable the directory table on the primary or secondary stage.
  The directory table contains critical information about replicated files and files loaded using a COPY statement.
* A refresh operation will fail if the directory table on an internal stage contains a file that is larger than 5GB. To work around this
  limitation, move any files larger than 5GB to a different stage.

  You cannot disable the directory table on a primary or secondary stage, or any stage that has previously been replicated. Follow
  these steps *before* you add the database that contains the stage to a replication or failover group.

  1. [Disable the directory table](../sql-reference/sql/alter-stage.md) on the primary stage.
  2. Move the files that are larger than 5GB to another stage that does not have a directory table enabled.
  3. After you move the files to another stage, re-enable the directory table on the primary stage.
* Files on user stages and table stages are not replicated.
* For named external stages that use a storage integration, you must configure the trust relationship for secondary storage integrations
  in your target accounts prior to failover. For more information, see [Configure cloud storage access for secondary storage integrations](account-replication-config.md).
* If you replicate an external stage with a directory table, and you have configured
  [automated refresh](data-load-dirtables-auto.md) for the source
  directory table, you must configure automated refresh for the secondary directory table before failover. For more information,
  see [Configure automated refresh for directory tables on secondary stages](account-replication-config.md).
* A copy command might take longer than expected if the directory table on a replicated stage is not consistent with the
  replicated files on the stage. To make a directory table consistent, refresh it with an
  [ALTER STAGE … REFRESH](../sql-reference/sql/alter-stage.md) statement.
  To check the consistency status of a directory table, use the [SYSTEM$GET_DIRECTORY_TABLE_STATUS](../sql-reference/functions/system_get_directory_table_status.md) function.

## Replication and pipes

The following constraints apply to pipe objects:

* Snowflake currently supports pipe replication as part of group-based replication (replication and failover groups).
  Pipe replication is not supported for database replication.
* Snowflake replicates the copy history of a pipe only when the pipe belongs to the same replication group as its target table.
* Replication of notification integrations is not supported.
* Snowflake only replicates load history after the latest table truncate.
* To receive notifications, you must configure a secondary auto-ingest pipe in a target account prior to failover.
  For more information, see [Configure notifications for secondary auto-ingest pipes](account-replication-config.md).
* Use the [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function to resolve any pipes not in their expected execution state after failover.
* Snowflake doesn’t support replication and failover for Snowpipe with the Kafka connector, but Snowflake does support replication and failover for Snowpipe Streaming with the Kafka connector. For more information, see [Snowpipe Streaming and the Kafka connector](account-replication-failover-failback.md).

## Replication of data metric functions (DMFs)

The following behaviors apply to [DMF](data-quality-intro.md) replication:

Event tables
:   The event table that stores the results of manually calling or scheduling a DMF to run is not replicated because the event table is local
    to your Snowflake account, and Snowflake does not support replicating event tables.

Replication groups
:   When you add the database(s) that contain your DMFs to a replication group, the following occurs in the target account:

    * DMFs are replicated from the source account.
    * Tables or views that the [DMF definition](../sql-reference/sql/create-data-metric-function.md) specifies, such as with a
      [foreign key reference](../sql-reference/sql/create-data-metric-function.md) are replicated from the source account, unless the table
      or view is associated with
      [Cross-Cloud Auto-Fulfillment](../collaboration/provider-listings-auto-fulfillment.md).
    * Scheduled DMFs in the target account are suspended. The secondary DMFs resume their schedule when you promote the target account to
      source account and the secondary DMFs become primary DMFs.

Failover groups
:   When you replicate the database(s) that contain your DMFs using a failover group, the following occurs in the case of failover:

    * Resumes the schedule of suspended DMFs when you promote the target account to source account.
    * Suspends scheduled DMFs in the target account after you promote a different account to source account.

    If you do not replicate the database that contains the DMF to a target account, the DMF associations to a table or view are
    dropped when the target account is promoted to source account because they are not available in the
    newly promoted source account.

    > **Tip:**
    >
    > Prior to failing over your account, [check the DMF references](data-quality-monitor.md) by calling the
    > DATA_METRIC_FUNCTION_REFERENCES Information Schema table function to determine the table objects that are associated with a DMF
    > before the promotion and refresh operations.

## Replication of stored procedures and user-defined functions (UDFs)

Stored procedures and UDFs are replicated from a primary database to secondary databases.

### Stored Procedures and UDFs and Stages

If a stored procedure or UDF depends on files in a stage (for example, if the stored
procedure is defined in Python code that is uploaded from a stage), you must replicate the stage and its files to the secondary
database. For more information about replicating stages, see [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md).

For example, if a primary database has an in-line Python UDF that imports any code that is stored on a stage, the UDF does not work unless
the stage and its imported code are replicated in the secondary database.

### Stored Procedures and UDFs and External Network Access

If a stored procedure or UDF depends on access to an
[external network location](../developer-guide/external-network-access/creating-using-external-network-access.md), you must
replicate the following objects:

* EXTERNAL ACCESS INTEGRATIONS must be included in the `allowed_integration_types` list for the replication or
  failover group.
* The database that contains the network rule.
* The database that contains the secret that stores the credentials to authenticate with the external network location.
* If the secret object references a security integration, you must include SECURITY INTEGRATIONS in the `allowed_integration_types`
  list for the replication or failover group.

## Replication and storage lifecycle policies

Snowflake replicates [storage lifecycle policies](storage-management/storage-lifecycle-policies.md)
and their associations with tables to target accounts, but doesn’t run the policies.
Snowflake doesn’t replicate archived data in the COOL or COLD tiers.
Archived data in your source account isn’t available in the target account.

After failover to a target account, Snowflake pauses storage lifecycle policy execution in the
original source account. After *failback* to the source account, Snowflake resumes
policy execution.

Snowflake never automatically runs secondary storage lifecycle policies on secondary tables,
even after failover. However, you can use secondary policies in a target account by attaching
them to new tables. For those new tables, Snowflake runs the policies.

## Replication and streams

This section describes recommended practices and potential areas of concern when replicating streams in [Replicating databases across multiple accounts](db-replication-config.md) or [Account Replication and Failover/Failback](account-replication-intro.md).

### Supported Source Objects for Streams

Replicated streams can successfully track the change data for tables and views in the same database.

Currently, the following source object types are not supported:

* External tables
* Tables or views in databases separate from the stream databases, unless both the stream database and the database that stores the source object are included in the same
  [replication or failover group](account-replication-intro.md).
* Tables or views in a shared databases (i.e. databases shared from provider accounts to your account)

Replicating streams on directory tables is supported when you enable [Stage, pipe, and load history replication](account-replication-stages-pipes-load-history.md).

A database replication or refresh operation fails if the primary database includes a stream with an unsupported source object. The operation also fails if the source object for any stream has been dropped.

Append-only streams are not supported on replicated source objects.

### Avoiding Data Duplication

> **Note:**
>
> In addition to the scenario described in this section, streams in a secondary database could return duplicate rows the first time they are included in a refresh operation. In this case, *duplicate rows* refers to a single row with multiple METADATA$ACTION column values.
>
> After the initial refresh operation, you should not encounter this specific issue in a secondary database.

Data duplication occurs when DML operations write the same change data from a stream multiple times without a uniqueness check. This can occur if a stream and a destination table for the stream change data are stored in separate databases, and these databases are not replicated and failed over in the same group.

For example, suppose you regularly insert change data from stream `s` into table `dt`. (For this example, the source object for the stream does not matter.) Separate databases store the stream and destination table.

1. At timestamp `t1`, a row is inserted into the source table for stream `s`, creating a new table version. The stream stores the offset for this table version.
2. At timestamp `t2`, the secondary database that stores the stream is refreshed. Replicated stream `s` now stores the offset.
3. At timestamp `t3`, the change data for stream `s` is inserted into table `dt`.
4. At timestamp `t4`, the secondary database that stores stream `s` is failed over.
5. At timestamp `t5`, the change data for stream `s` is inserted again into table `dt`.

To avoid this situation, replicate and fail over together the databases that store streams and their destination tables.

### Stream References in Task WHEN Clause

To avoid unexpected behavior when running replicated tasks that reference streams in the `WHEN boolean_expr` clause, we recommend that you either:

* Create the tasks and streams in the same database, or
* If streams are stored in a different database from the tasks that reference them, include both databases in the same [failover group](account-replication-intro.md).

If a task references a stream in a separate database, and both databases are not included in the same failover group, then the database that contains the task could be failed over without the database that contains the stream. In this scenario, when the task is resumed in the failed over database, it records an error when it attempts to run and cannot find the referenced stream. This issue can be resolved by either failing over the database that contains the stream or recreating the database and stream in the same account as the failed over database that contains the task.

### Stream Staleness

If a stream in the primary database has become [stale](streams-intro.md), the replicated stream in a secondary database is also stale and cannot be queried or its change data consumed. To resolve this issue, recreate the stream in the primary database (using [CREATE OR REPLACE STREAM](../sql-reference/sql/create-stream.md)). When the secondary database is refreshed, the replicated stream is readable again.

Note that the offset for a recreated stream is the current table version by default. You can recreate a stream that points to an earlier table version using Time Travel; however, the replicated stream would remain unreadable. For more information, see Stream Replication and Time Travel (in this topic).

### Stream Replication and Time Travel

After a primary database is failed over, if a stream in the database uses [Time Travel](data-time-travel.md) to read a [table version](streams-intro.md) for the source object from a point in time before the last refresh timestamp, the replicated stream cannot be queried or the change data consumed. Likewise, querying the change data for a source object from a point in time before the last refresh timestamp using the [CHANGES](../sql-reference/constructs/changes.md) clause for [SELECT](../sql-reference/sql/select.md) statements fails with an error.

This is because a refresh operation collapses the table history into a single table version. Iterative table versions created before the refresh operation timestamp are not preserved in the table history for the replicated source objects.

Consider the following example:

1. Table `t1` is created in the primary database with change tracking enabled (table version `tv0`). Subsequent DML transactions create table versions `tv1` and `tv2`.
2. A secondary database that contains table `t1` is refreshed. The table version for this replicated table is `tv2`; however, the table history is not replicated.
3. A stream is created in the primary database with its offset set to table version `tv1` using Time Travel.
4. The secondary database is failed over, becoming the primary database.
5. Querying stream `s1` returns an error, because table version `tv1` is not in the table history.

Note that when a subsequent DML transaction on table `t1` iterates the table version to `tv3`, the offset for stream `s1` is advanced. The stream is readable again.

### Avoiding Data Loss

Data loss can occur when the most recent refresh operation for a secondary database is not completed prior to the failover operation. We recommend refreshing your secondary databases frequently to minimize the risk.

## Replication and tasks

This section describes task replication in [Replicating databases across multiple accounts](db-replication-config.md) or [Account Replication and Failover/Failback](account-replication-intro.md).

> **Note:**
>
> Database replication does not work for task graphs if the graph is owned by a different role than the role that performs replication.

### Replication Scenarios

The following table describes different task scenarios and specifies whether the tasks are replicated or not. Except where noted, the scenarios pertain to both standalone tasks and tasks in a [task graph](tasks-graphs.md):

| Scenario | Replicated | Notes |
| --- | --- | --- |
| Task was created and either resumed or executed manually (using [EXECUTE TASK](../sql-reference/sql/execute-task.md)). Resuming or executing a task creates an initial task version. | ✔ |  |
| Task was created but never resumed or executed. | ❌ |  |
| Task was recreated (using [CREATE OR REPLACE TASK](../sql-reference/sql/create-task.md) but never resumed or executed). | ✔ | The latest version before the task was recreated is replicated.  Resuming or manually executing the task commits a new version. When the database is replicated again, the new, or latest, version is replicated to the secondary database. |
| Task was created and resumed or executed, but subsequently dropped. | ❌ |  |
| Task graph was created and resumed or executed. Subsequently, a task in the task graph was modified, but the task graph’s root task wasn’t resumed or executed again. Examples of modifications include the following:   * Using [ALTER TASK … SET/UNSET/MODIFY](../sql-reference/sql/alter-task.md) on a root task, child task, or finalizer task. * Using [ALTER TASK … SUSPEND](../sql-reference/sql/alter-task.md) on a child task or finalizer task. | ✔ | The latest version of the task graph before the task was modified is replicated.  Resuming or manually executing a task commits a new version that includes any changes to the parameters of the tasks within the task graph. Because the new changes were never committed, only the previous version of the task graph is replicated.  Note that if the modified task graph is not resumed within a retention period (currently 30 days), the latest version of the task is dropped. After this period, the task is not replicated to a secondary database unless it’s resumed again. |
| Root task in a task graph was created and resumed or executed, but was subsequently suspended and dropped. | ❌ | The entire task graph is not replicated to a secondary database. |
| Child task in a task graph is created and resumed or executed, but is subsequently suspended and dropped. | ✔ | The latest version of the task graph (before the task was suspended and dropped) is replicated to a secondary database. |

### Resumed or Suspended State of Replicated Tasks

If all of the following conditions are met, a task is replicated to a secondary database in a resumed state:

* A standalone or root task is in a resumed state in the primary database when the replication or refresh operation begins until the operation is completed. If a task is in a resumed state during only part of this period, it might still be replicated in a resumed state.

  A child task is in a resumed state in the latest version of the task.
* The parent database was replicated to the target account along with role objects in the same, or different, [replication or failover group](account-replication-intro.md).

  After the roles and database are replicated, you must refresh the objects in the target account by executing either [ALTER REPLICATION GROUP … REFRESH](../sql-reference/sql/alter-replication-group.md) or [ALTER FAILOVER GROUP … REFRESH](../sql-reference/sql/alter-failover-group.md), respectively. If you refresh the database by executing [ALTER DATABASE … REFRESH](../sql-reference/sql/alter-database.md), the state of the tasks in the database is changed to suspended.

  A replication or refresh operation includes the privilege grants for a task that were current when the latest table version was committed. For more information, see Replicated Tasks and Privilege Grants (in this topic).

If these conditions are not met, the task is replicated to a secondary database in a suspended state.

> **Note:**
>
> Secondary tasks aren’t scheduled until after a failover, regardless of their `state`. For more details, refer to Task Runs After a Failover

### Replicated Tasks and Privilege Grants

If the parent database is replicated to a target account along with role objects in the same, or different, replication or failover group, the privileges granted on the tasks in the database are replicated as well.

The following logic determines which task privileges are replicated in a replication or refresh operation:

* If the current task owner (that is, the role that has the OWNERSHIP privilege on a task) is the same role as when the task was resumed last, then all current grants on the task are replicated to the secondary database.
* If the current task owner is not the same role as when the task was resumed last, then only the OWNERSHIP privilege granted to the owner role in the task version is replicated to the secondary database.
* If the current task owner role is not available (for example, a child task is dropped but a new version of the task graph is not committed yet), then only the OWNERSHIP privilege granted to the owner role in the task version is replicated to the secondary database.

### Task Runs After a Failover

After a secondary failover group is promoted to serve as the primary group, any resumed tasks in databases within the failover group are scheduled gradually. The amount of time required to restore normal scheduling of all resumed standalone tasks and task graphs depends on the number of resumed tasks in a database.

## Replication and dbt projects

* dbt project objects are replicated from a primary database to secondary databases.
* All secondary objects in a target account, including secondary databases, are read-only. A secondary dbt project cannot be executed.
* Any objects that a dbt project references, such as source tables and views, should be replicated with the dbt project in order for
  executions of that dbt project to succeed after failover.

### dbt projects and external network access

If a dbt project depends on access to an
[external network location](../developer-guide/external-network-access/creating-using-external-network-access.md), you must replicate
the following objects:

* EXTERNAL ACCESS INTEGRATIONS must be included in the `allowed_integration_types` list for the replication or
  failover group.
* The database that contains the network rule.
* The database that contains the secret that stores the credentials to authenticate with the external network location.
* If the secret object references a security integration, you must include SECURITY INTEGRATIONS in the `allowed_integration_types`
  list for the replication or failover group.

A dbt project does not store the external network access integrations that it is associated with. External network access
integrations are specified when the user runs the EXECUTE DBT PROJECT command. This makes the requirement to replicate external
access integrations separately more apparent.

## Replication and tags

Tags and their assignments can be replicated from a source account to a target account.

Tag assignments cannot be modified in the target account after the initial replication from the source account. For example,
setting a tag on a secondary (i.e. replicated) database is not allowed. To modify tag assignments in the target account, modify
them in the source account and replicate them to the target account.

To successfully replicate tags, ensure that the replication or failover group includes:

* The database containing the tags in the `ALLOWED_DATABASES` property.
* Other account-level objects that have a tag in the `OBJECT_TYPES` property (e.g. `ROLES`, `WAREHOUSES`).

  For more information, see [CREATE REPLICATION GROUP](../sql-reference/sql/create-replication-group.md) and [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md).

## Replication and instances of Snowflake classes

An instance of the [CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier.md) class is replicated when the database that contains
the instance is replicated. Replication of instances of other Snowflake [classes](../sql-reference-classes.md) is *not* supported.

## Historical usage data

Historical usage data for activity in a primary database is not replicated to secondary databases. Each account has its own query history,
login history, etc.

Historical usage data includes the query data returned by the following [Snowflake Information Schema](../sql-reference/info-schema.md) table functions or
[Account Usage](../sql-reference/account-usage.md) views:

* COPY_HISTORY
* LOGIN_HISTORY
* QUERY_HISTORY
* etc.

---
title: Replication of security integrations & network policies across multiple accounts
source: https://docs.snowflake.com/en/user-guide/account-replication-security-integrations.md
section: User Guide
---

# Replication of security integrations & network policies across multiple accounts

This topic provides information on how to replicate security integrations, along with using failover/failback with
each of these objects, and assumes familiarity with replication and failover/failback with other account-level objects
(e.g. users, roles, warehouses).

For details, see [Introduction to replication and failover across multiple accounts](account-replication-intro.md).

These objects and services are supported across [regions](intro-regions.md) and across
[cloud platforms](intro-cloud-platforms.md).

## Overview

Snowflake supports replicating network policies and security integrations for federated SSO (i.e. SAML2), OAuth, and SCIM along with
enabling failover/failback for each network policy and integration.

The general approach to test replication and failover/failback with each network policy and security integration is as follows:

1. Identify the source account and target account for replication, and identify the connection URL.
2. Complete steps in the source account.
3. Complete steps in the target account.
4. Test failover/failback.

Note that because network policies and security integrations have different use cases, the exact steps for the source account and target
account with respect to each object differ slightly.

For details, see:

* Replicating SAML2 security integrations
* Replicating SCIM security integrations
* Replicating OAuth security integrations
* Replicating network policies
* Replicating integrations and objects for the Snowflake Connector for ServiceNow

## Replicating SAML2 security integrations

Replicating a SAML2 security integration links the source account and the target account to the identity provider by specifying the
[connection URL](client-redirect.md) in the SAML2 security integration definition.

It is important to update the identity provider to specify the connection URL and that users exist in the source account. Without these
updates, user verification cannot occur, which will result in the inability of the user to access the target account.

Current Limitation:
:   For SAML SSO to Snowflake, replicating a SAML2 security integration that specifies the connection URL is only supported on the current
    primary connection and not supported on the secondary connection. Note that for failover, the result is promoting a secondary connection
    to serve as the primary connection. After failover, SAML SSO works on the new primary connection.

    If SAML SSO is needed for both primary and secondary connections, then create and manage SAML2 security integrations independently on
    both Snowflake accounts.

For this procedure, assume the following:

* Source account: `https://example-northamericawest.snowflakecomputing.com/`
* Target account: `https://example-northamericaeast.snowflakecomputing.com/`
* Connection URL: `https://example-global.snowflakecomputing.com`
* A secondary connection does not exist in the target account.

This procedure is a representative example to do the following:

* Replicate a SAML2 security integration from the source account to the target account.
* Test failover.
* Promote the secondary connection in the source account to serve as the primary connection.

**Source account steps (includes IdP steps):**

1. If the source account is already configured for [Database Failover/Failback and Client Redirect](replication-intro.md),
   skip to the next step.

   Otherwise, enable failover using an [ALTER CONNECTION](../sql-reference/sql/alter-connection.md) command:

   > ```sqlexample
   > ALTER CONNECTION global
   > ENABLE FAILOVER TO ACCOUNTS example.northamericaeast;
   > ```
2. Using Okta as a representative example for the identity provider, create a
   [Snowflake application in Okta](https://www.okta.com/integrations/snowflake/#capabilities) that specifies the connection URL. Update
   the Okta fields as follows:

   * Label: `Snowflake`
   * Subdomain: `example-global`
   * Browser plugin auto-submit: Check the box to enable automatic login when a user lands on the login page.
3. In the source account, update the SAML2 security integration to specify the connection URL for the `saml2_snowflake_issuer_url`
   and `saml2_snowflake_acs_url` security integration properties.

   > ```sqlexample
   > CREATE OR REPLACE SECURITY INTEGRATION my_idp
   >   TYPE = saml2
   >   ENABLED = true
   >   SAML2_ISSUER = 'http://www.okta.com/exk6e8mmrgJPj68PH4x7'
   >   SAML2_SSO_URL = 'https://example.okta.com/app/snowflake/exk6e8mmrgJPj68PH4x7/sso/saml'
   >   SAML2_PROVIDER = 'OKTA'
   >   SAML2_X509_CERT = 'MIIDp...'
   >   SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = 'OKTA'
   >   SAML2_ENABLE_SP_INITIATED = true
   >   SAML2_SNOWFLAKE_ISSUER_URL = 'https://example-global.snowflakecomputing.com'
   >   SAML2_SNOWFLAKE_ACS_URL = 'https://example-global.snowflakecomputing.com/fed/login';
   > ```
4. In Okta, assign the Snowflake application to users. For details, see
   [Assign an app integration to a user](https://help.okta.com/en/prod/Content/Topics/Provisioning/lcm/lcm-assign-app-user.htm).
5. Verify that SSO to the source account works for users that are specified in the Snowflake application in Okta and users in the source
   account.

   Note that SSO should work for both IdP-initiated and Snowflake-initiated SSO flows. For details, see
   [Supported SSO workflows](admin-security-fed-auth-overview.md).
6. In the source account, if a failover group does not already exist, [create](../sql-reference/sql/create-failover-group.md) a failover group to
   include security integrations. Note that this example is representative and includes other account objects that might or might not be
   necessary to replicate.

   If a failover group already exists, [alter](../sql-reference/sql/alter-failover-group.md) the failover group to include integrations.

   > ```sqlexample
   > CREATE FAILOVER GROUP FG
   >   OBJECT_TYPES = users, roles, warehouses, resource monitors, integrations
   >   ALLOWED_INTEGRATION_TYPES = security integrations
   >   ALLOWED_ACCOUNTS = example.northamericaeast
   >   REPLICATION_SCHEDULE = '10 MINUTE';
   > ```

**Target Account Steps:**

1. Prior to replication, verify the number of users and security integrations that are present in the target
   account by executing the [SHOW USERS](../sql-reference/sql/show-users.md) and [SHOW INTEGRATIONS](../sql-reference/sql/show-integrations.md) commands, respectively.
2. Create a secondary connection. For details, see [CREATE CONNECTION](../sql-reference/sql/create-connection.md).

   > ```sqlexample
   > CREATE CONNECTION global AS REPLICA OF example.northamericawest.global;
   > ```
3. Create a secondary failover group in the target account. For details, see [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md).

   > ```sqlexample
   > CREATE FAILOVER GROUP fg
   > AS REPLICA OF example.northamericawest.fg;
   > ```
4. When creating a secondary failover group, an initial refresh is automatically executed.

   To manually refresh a secondary failover group in the target account, execute the following statement. For details, see
   [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg REFRESH;
   > ```
5. If the refresh operation was successful, the target account should include new users that were added to the source account and not
   previously present in the target account. Similarly, the target account should include the SAML2 security integration that specifies
   the connection URL.

   Verify the refresh operation was successful by executing the following commands:

   * [SHOW INTEGRATIONS](../sql-reference/sql/show-integrations.md) (should include 1 new integration)
   * [SHOW USERS](../sql-reference/sql/show-users.md) (should include the number of new users added)
   * [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) (for the integration `myidp`)
6. Promote the secondary connection in the target account to serve as the primary connection. After executing the following command, users
   can use SAML SSO to authenticate to the new target account.

   > ```sqlexample
   > ALTER CONNECTION global PRIMARY;
   > ```

## Replicating SCIM security integrations

Replicating a SCIM security integration allows the target account to incorporate SCIM updates that are made to the source account
(e.g. adding new users, adding new roles) after refreshing the target account.

After replicating the SCIM security integration, both Snowflake accounts have the ability to receive SCIM updates from the identity
provider. However, Snowflake allows specifying only one account as the *primary* (i.e. source) account and it is the primary account that
receives SCIM updates from the identity provider.

You can optionally designate a different account as the primary account to receive SCIM updates after replicating the SCIM integration.
Note that the target account can receive SCIM updates from the source account only after refreshing the target account.

For this procedure, assume the following:

* Source account: `https://example-northamericawest.snowflakecomputing.com/`
* Target account: `https://example-northamericaeast.snowflakecomputing.com/`
* Connection URL: `https://example-global.snowflakecomputing.com`
* A secondary connection exists in the target account (i.e. only refresh operations are needed).

This procedure is a representative example to do the following:

* Replicate a SCIM security integration from the source account to the target account.
* Add a new user in Okta, push the new user to the source account, and replicate the new user to the target account.
* Refresh the failover group.
* Promote the secondary connection in the target account to serve as the primary connection.

**Source account steps:**

1. Execute [SHOW CONNECTIONS](../sql-reference/sql/show-connections.md) to verify that the connection in the source account is the primary connection. If it
   is not the primary connection, use the [ALTER CONNECTION](../sql-reference/sql/alter-connection.md) command to promote the connection in the source
   account to serve as the primary connection.
2. If an Okta SCIM security integration is already configured in the source account, skip to the next step.

   Otherwise, configure an [Okta SCIM](scim-okta.md) security integration in the source account.

   > ```sqlexample
   > CREATE ROLE IF NOT EXISTS okta_provisioner;
   > GRANT CREATE USER ON ACCOUNT TO ROLE okta_provisioner;
   > GRANT CREATE ROLE ON ACCOUNT TO ROLE okta_provisioner;
   > GRANT ROLE okta_provisioner TO ROLE ACCOUNTADMIN;
   > CREATE OR REPLACE SECURITY INTEGRATION okta_provisioning
   >    TYPE = scim
   >    SCIM_CLIENT = 'okta'
   >    RUN_AS_ROLE = 'OKTA_PROVISIONER';
   >
   > select system$generate_scim_access_token('OKTA_PROVISIONING');
   > ```

   Be sure to update the Okta SCIM application for Snowflake. For details, see [Okta configuration](scim-okta.md).
3. In Okta, [create a new user](https://www.okta.com/integrations/snowflake/#capabilities) in the Okta application for Snowflake.

   Verify the user is pushed to Snowflake by executing a [SHOW USERS](../sql-reference/sql/show-users.md) command in Snowflake.
4. If the failover group already specifies `security integrations`, skip to the next step. This would be true if you have already
   configured the failover group for the purposes of
   SAML SSO in the target account (in this topic).

   Otherwise, modify the existing failover group using an ALTER FAILOVER GROUP command to specify `security integrations`.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg SET
   >   OBJECT_TYPES = users, roles, warehouses, resource monitors, integrations
   >   ALLOWED_INTEGRATION_TYPES = security integrations;
   > ```
5. At this point, you can optionally refresh the secondary failover group as shown in the
   target account steps for SCIM to ensure the new user in the source account is in the target
   account.

   Choosing to refresh the secondary failover group now allows for an easy check to make sure that the change to the source account, adding
   a new user in this sequence, is visible in the target account.

   However, if you need or prefer to do additional work in the identity provider, such as modifying other users or updating role
   assignments, you can continue doing that work now and then refresh the secondary failover group in one operation later.

**Target account steps:**

1. Prior to replication, verify the number of users and security integrations that are present in the target
   account by executing the [SHOW USERS](../sql-reference/sql/show-users.md) and [SHOW INTEGRATIONS](../sql-reference/sql/show-integrations.md) commands, respectively.
2. Refresh the secondary failover group to update the target account to include the new user
   (and any other changes that were made in Okta and the source account).

   > ```sqlexample
   > ALTER FAILOVER GROUP fg REFRESH;
   > ```
3. Verify that the new user is added to the target account by executing a [SHOW USERS](../sql-reference/sql/show-users.md) command.
4. Optionally, promote the secondary failover group and the secondary connection in the target account to primary. This will promote the
   target account to serve as the new source account.

   Failover group:

   > ```sqlexample
   > ALTER FAILOVER GROUP fg PRIMARY;
   > ```

   Connection:

   > ```sqlexample
   > ALTER CONNECTION global PRIMARY;
   > ```

## Replicating OAuth security integrations

Replicating OAuth security integrations includes both Snowflake OAuth security integrations and External OAuth security integrations.

Note the following:

Snowflake OAuth:
:   After replication and configuring failover/failback, a user connecting to either the source account or target account via an OAuth client
    does not need to re-authenticate to the target account.

External OAuth:
:   After replication and configuring failover/failback, a user connecting to either the source account or target account via an OAuth client
    *might* need to re-authenticate to the target account.

    Re-authentication is likely to be necessary if the OAuth authorization server is not configured to issue a refresh token. Therefore,
    ensure that the OAuth authorization server issues refresh tokens so that the OAuth client can connect to the source and target Snowflake
    accounts.

For this procedure, assume the following:

* Source account: `https://example-northamericawest.snowflakecomputing.com/`
* Target account: `https://example-northamericaeast.snowflakecomputing.com/`
* Connection URL: `https://example-global.snowflakecomputing.com`
* A secondary connection exists in the target account (i.e. only refresh operations are needed).
* The Snowflake OAuth or External OAuth security integrations already exist in the source account.

This procedure is a representative example to do the following:

* Replicate an OAuth security integration.
* Refresh the failover group.
* Promote the secondary connection in the target account to serve as the primary connection.

**Source account steps:**

1. If the failover group already specifies `security integrations`, skip to the next step. This would be true if you have already
   configured the failover group for the purposes of
   SAML SSO in the target account (in this topic) or
   SCIM (also in this topic).

   Otherwise, modify the existing failover group using an ALTER FAILOVER GROUP command to specify `security integrations`.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg SET
   >   OBJECT_TYPES = users, roles, warehouses, resource monitors, integrations
   >   ALLOWED_INTEGRATION_TYPES = security integrations;
   > ```

**Target account steps:**

1. Refresh the secondary failover group to update the target account to include the OAuth security integration objects.

   ```sqlexample
   ALTER FAILOVER GROUP fg REFRESH;
   ```
2. Verify connecting to each Snowflake account using the OAuth client of your choice.
3. Optionally, promote the secondary failover group and the secondary connection in the target account to primary. This will promote the
   target account to serve as the new source account.

   Failover group:

   > ```sqlexample
   > ALTER FAILOVER GROUP fg PRIMARY;
   > ```

   Connection:

   > ```sqlexample
   > ALTER CONNECTION global PRIMARY;
   > ```
4. If you completed the previous step, reverify that you can connect to each Snowflake account using the OAuth client of your choice.

## Replicating network policies

Replicating a network policy from the source account to the target account allows administrators to restrict access to the target account
based on the network identifier of the origin of an incoming request.

### Replicating network policy references and assignments

Replicating a network policy replicates the network policy object and any network policy references/assignments. For example, if a
network policy references a network rule in the source account, and both objects exist in the target account, then the network policy uses
the same network rule in the target account. Similarly, if a network policy is assigned to a user and the user exists in both the source and
target accounts, replicating the network policy assigns the network policy to the user in the target account.

Replicating network policy references and assignments assumes referenced objects and objects to which the network policy is assigned are
also replicated. If you do not replicate the supporting object types properly, Snowflake fails the refresh operation in the target account.

If a referenced object or object to which the network policy is assigned does not already exist in the target account, include its object
type in the same replication or failover group as the network policy. The following examples demonstrate the required settings if the
supporting objects do not already exist in the target account.

Network policies that use network rules
:   The replication or failover group must include `network policies` and `databases`. Network rules are schema-level objects
    and are replicated with the database in which they are contained. For example:

    ```sqlexample
    CREATE FAILOVER GROUP fg
       OBJECT_TYPES = network policies, databases
       ALLOWED_DATABASES = testdb2
       ALLOWED_ACCOUNTS = myorg.myaccount2;
    ```

Network policies assigned to an account
:   The replication or failover group must include `network policies` and `account parameters`. If the network policy uses
    network rules, you must also include `databases`. For example:

    ```sqlexample
    CREATE FAILOVER GROUP fg
       OBJECT_TYPES = network policies, account parameters, databases
       ALLOWED_DATABASES = testdb2
       ALLOWED_ACCOUNTS = myorg.myaccount2;
    ```

Network policies assigned to a user
:   The replication or failover group must include `network policies` and `users`. If the network policy uses network rules, you
    must also include `databases`. For example:

    ```sqlexample
    CREATE FAILOVER GROUP fg
       OBJECT_TYPES = network policies, users, databases
       ALLOWED_DATABASES = testdb2
       ALLOWED_ACCOUNTS = myorg.myaccount2;
    ```

Network policies assigned to a security integration
:   Network policy replication applies to network policies that are specified in Snowflake OAuth and SCIM
    [security integrations](../sql-reference/sql/create-security-integration.md), provided that the replication or failover group includes
    `integrations`, `security integrations` and `network policies`. If the network policy uses network rules, you must also
    include `databases`.

    > ```sqlexample
    > CREATE FAILOVER GROUP fg
    >    OBJECT_TYPES = network policies, integrations, databases
    >    ALLOWED_DATABASES = testdb2
    >    ALLOWED_INTEGRATION_TYPES = security integrations
    >    ALLOWED_ACCOUNTS = myorg.myaccount2;
    > ```

### Example

For this example, assume the following:

* Source account: `https://example-northamericawest.snowflakecomputing.com/`
* Target account: `https://example-northamericaeast.snowflakecomputing.com/`
* Connection URL: `https://example-global.snowflakecomputing.com`
* A secondary connection exists in the target account (i.e. only refresh operations are needed).
* Network policies exist in the source account.
* The Snowflake OAuth and/or SCIM security integration already exists in the source account and the integration specifies a network policy.

This procedural example does the following:

* Replicates network policies along with the network rules that is uses to restrict network traffic.
* Replicates a security integration to which the network policy is assigned.
* Refreshes the failover group.
* Verifies the network policy activation.
* Promotes the secondary connection in the source account to serve as the primary connection.

**Source account steps:**

1. Verify that network policies exist in the source Snowflake account by executing a [SHOW NETWORK POLICIES](../sql-reference/sql/show-network-policies.md)
   command.
2. Verify the Snowflake OAuth and/or SCIM security integrations include a network policy by executing a
   [SHOW INTEGRATIONS](../sql-reference/sql/show-integrations.md) command to identify the security integration and then execute a
   [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) command on the Snowflake OAuth security integration.
3. Update the failover group to include `network policies` and `account parameters` using an ALTER FAILOVER GROUP command.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg SET
   >   OBJECT_TYPES = users, roles, warehouses, resource monitors, integrations, network policies, account parameters
   >   ALLOWED_INTEGRATION_TYPES = security integrations;
   > ```

**Target account steps:**

1. Refresh the secondary failover group to update the target account to include the network policy objects and the Snowflake OAuth
   security integration that specifies the network policy.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg REFRESH;
   > ```
2. Verify the network policy object exists by executing a SHOW NETWORK POLICIES command, and verify the Snowflake OAuth security
   integration specifies the replicated network policy by executing a DESCRIBE SECURITY INTEGRATION command on the security integration.
3. Verify the network policy activation as shown in [Identify an activated network policy](network-policies.md).
4. Verify connecting to each Snowflake account using the Snowflake OAuth client of your choice.
5. Optionally promote the secondary failover group and the secondary connection in the target account to primary. This will promote the
   target account to serve as the new source account.

   Failover group:

   > ```sqlexample
   > ALTER FAILOVER GROUP fg PRIMARY;
   > ```

   Connection:

   > ```sqlexample
   > ALTER CONNECTION global PRIMARY;
   > ```
6. If you completed the previous step, reverify that you can connect to each Snowflake account using the Snowflake OAuth client of your
   choice.

## Replicating integrations and objects for the Snowflake Connector for ServiceNow

The [Snowflake Connector for ServiceNow](https://other-docs.snowflake.com/connectors/servicenow/about.html) allows Snowflake to ingest data from ServiceNow. The connector requires the following objects in
your Snowflake account:

* Secret.
* Security integration of `type = api_authentication`.
* API integration.
* Database to store the ingested data.
* Warehouse for the connector to use.
* Account roles to manage the access to these objects.

You create these objects prior to installing the connector and you can replicate these objects to the target account. After replicating
these objects, you can install the connector in the target account. The connector must be installed in the target account because the
installation depends on a share that Snowflake provides. You need to create a database from the share during the connector installation and
you cannot replicate a database that is created from a share.

Depending on how you want to manage the replication of account objects, you can have one or more replication or failover groups. A single
replication group centralizes the replication management of the objects and avoids scenarios where some objects are replicated and other
objects are not replicated. Otherwise, you must coordinate the replication operation carefully to ensure that all objects are replicated to
the target account.

For example, you can have a replication group for databases. This replication group (e.g. `rg1`) specifies the database that contains the
secret and the database to store the ServiceNow data. The other replication group (e.g. `rg2`) specifies the user, role, and integration
objects and the grants of these roles to users. In this scenario, if you replicate the integrations first and then decide to refresh the
target account to include the secret database, users, and roles, the replication refresh operation is successful.

However, if you replicate the users and roles and the database that contains the secret in a group before you replicate the integration,
then a placeholder secret is used until the security integration is replicated; the placeholder secret prevents a dangling reference. Once
the security integration is replicated, the placeholder secret is replaced with the real secret.

This procedure is a representative example to do the following:

* Replicate the integrations and the databases containing the secret and ingested data.
* Refresh the failover group.
* Promote the secondary connection in the source account to serve as the primary connection.
* Install and use the connector after replication.

For this procedure, assume the following:

* Source account: `https://example-northamericawest.snowflakecomputing.com/`
* Target account: `https://example-northamericaeast.snowflakecomputing.com/`
* Connection URL: `https://example-global.snowflakecomputing.com`
* A secondary connection exists in the target account (i.e. only refresh operations are needed).
* Other security integrations for authentication and network policies to restrict access are already replicated.

**Source account steps:**

1. Verify that the objects for the connector exist in the source Snowflake account by executing SHOW commands on each of these object types.

   > ```sqlexample
   > show secrets in database secretsdb;
   > show security integrations;
   > show api integrations;
   > show tables in database destdb;
   > show warehouses;
   > show roles;
   > ```

   Note that `secretsdb` is the name of the database that contains the secret and `destdb` is the name of the database that contains
   the ingested data from ServiceNow.
2. Update the failover group to include API integrations and the databases containing the secret and ingested data using an ALTER FAILOVER
   GROUP command.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg SET
   >   OBJECT_TYPES = databases, users, roles, warehouses, resource monitors, integrations, network policies, account parameters
   >   ALLOWED_DATABASES = secretsdb, destdb
   >   ALLOWED_INTEGRATION_TYPES = security integrations, api integrations;
   > ```

**Target account steps:**

1. Refresh the secondary failover group to replicate the integrations and databases to the target account.

   > ```sqlexample
   > ALTER FAILOVER GROUP fg REFRESH;
   > ```
2. Verify the replicated objects exist using the following SHOW commands.

   ```sqlexample
   show secrets;
   show security integrations;
   show api integrations;
   show database;
   show tables in database destdb;
   show roles;
   ```
3. Verify connecting to each Snowflake account using the method of your choice (such as Snowflake CLI, a browser, or SnowSQL).
4. Optionally promote the secondary failover group and the secondary connection in the target account to primary. This will promote the
   target account to serve as the new source account.

   Failover group:

   > ```sqlexample
   > ALTER FAILOVER GROUP fg PRIMARY;
   > ```

   Connection:

   > ```sqlexample
   > ALTER CONNECTION global PRIMARY;
   > ```
5. If you completed the previous step, reverify that you can connect to each Snowflake account.

   At this point, the target account contains the replicated objects and users can login. However, there are additional steps in the target
   account to use the connector.
6. Update the remote service associated with the API integration in the cloud platform that hosts your Snowflake account.

   For details, refer to [Updating the remote service for API integrations](account-replication-config.md).
7. Install the connector manually or with Snowsight. For details, refer to:

   * [Install the connector manually](https://other-docs.snowflake.com/connectors/servicenow/installing-sql.html)
   * [Install the connector with Snowsight](https://other-docs.snowflake.com/connectors/servicenow/installing-snowsight.html)
8. [Access the ServiceNow Data in Snowflake](https://other-docs.snowflake.com/connectors/servicenow/accessing-data.html).

---
title: Request a new Data Exchange
source: https://docs.snowflake.com/en/user-guide/data-exchange-requesting.md
section: User Guide
---

# Request a new Data Exchange

To request a new Data Exchange for your Snowflake account, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) and provide the following information:

* **Description**: Description of the business case to use the Snowflake Data Exchange.
* **Name for the New Data Exchange**: Unique identifier for use in SQL. The Data Exchange name cannot include spaces, or special characters, but may include the underscore (_) character. See also [Identifier requirements](../sql-reference/identifiers-syntax.md).
* **Data Exchange Display Name**: Name displayed in the web user interface and Snowsight to Data Exchange members.
* **Account URL**: URL of the account used when the Data Exchange was created.

> **Attention:**
>
> Enabling a Data Exchange for your account may take up to 2 business days. When you request a Data Exchange to be enabled, please be sure to provide Snowflake account URL. Providing incorrect or incomplete information may delay the process.

---
title: REST API for unstructured data support
source: https://docs.snowflake.com/en/user-guide/data-load-unstructured-rest-api.md
section: User Guide
---

# REST API for unstructured data support

This topic describes the REST API used to access staged files.

## `GET /api/files/`

Retrieves (downloads) a data file from an internal or external stage.

### Authentication

Authenticate to the REST API endpoint using OAuth for custom clients. Create a security integration (using
[CREATE SECURITY INTEGRATION](../sql-reference/sql/create-security-integration-oauth-snowflake.md)) to enable an HTTP client that supports OAuth (such as [cURL](https://curl.se/))
to redirect users to an authorization page and generate access tokens for access to the REST API endpoint. For information on configuring
OAuth for custom clients, see [Configure Snowflake OAuth for custom clients](oauth-custom.md).

### Usage notes

* Send the scoped URL or file URL for a staged file in the GET request.

  + Generate a scoped URL by calling the [BUILD_SCOPED_FILE_URL](../sql-reference/functions/build_scoped_file_url.md) SQL function.
  + Generate a file URL by calling the [BUILD_STAGE_FILE_URL](../sql-reference/functions/build_stage_file_url.md) SQL function. Alternatively, query the directory
    table for the stage, if available.
* Authenticate to Snowflake via the Snowflake SQL API using OAuth or key pair authentication. For instructions, see
  [Authenticating to the server](../developer-guide/sql-api/authenticating.md).
* The authorization to access files differs depending on whether a scoped URL or file URL is sent in the GET request:

  Scoped URL:
  :   Only the user who generated the scoped URL can use the URL to access the referenced file.

  File URL:
  :   Any role that has sufficient privileges on the stage can access the file:

      + External stage: USAGE
      + Internal stage: READ
* An HTTP client that sends a URL (either scoped URL or file URL) to the REST API must be configured to allow redirects.

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

### Request headers

The following request headers apply to all operations:

| Header | Description |
| --- | --- |
| `Authorization` | Set this to `Bearer`, followed by the generated OAuth token used to authenticate to Snowflake.  For more information, see [Authenticating to the Server Using OAuth](../developer-guide/sql-api/authenticating.md).  For example:  `Authorization: Bearer token` |
| `Accept` | Set this to `*/*`. |
| `User-Agent` | Set this to the name and version of your application (e.g. `applicationName/applicationVersion`). You must use a value that complies with [RFC 7231](https://tools.ietf.org/html/rfc7231#section-5.5.3). |
| `X-Snowflake-Authorization-Token-Type` | (Optional) Set this to `OAUTH`.  If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.  Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:   * `KEYPAIR_JWT` (for key-pair authentication) * `OAUTH` (for OAuth) * `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](programmatic-access-tokens.md)) |

### Example

The following Python example issues an HTTP request for client `myApplication` version 1.0:

```none
import requests
response = requests.get(url,
    headers={
      "User-Agent": "reg-tests",
      "Accept": "*/*",
      "Authorization": """Bearer {}""".format(token)
      },
    allow_redirects=True)
print(response.status_code)
print(response.content)
```

---
title: Retrieve archived data
source: https://docs.snowflake.com/en/user-guide/storage-management/storage-lifecycle-policies-retrieving-archived-data.md
section: User Guide
---

# Retrieve archived data

Read archived data by using the [CREATE TABLE … FROM ARCHIVE OF](../../sql-reference/sql/create-table.md) command.

For example, the following statement creates a new table from archived rows where the value in the `event_timestamp` column is between
January 15 and January 20 of 2023:

```sqlexample
CREATE TABLE my_table
  FROM ARCHIVE OF my_source_table AS st
  WHERE st.event_timestamp BETWEEN '01/15/2023' AND '01/20/2023';
```

For syntax details and parameter descriptions, see [CREATE TABLE … FROM ARCHIVE OF](../../sql-reference/sql/create-table.md)
in the [CREATE TABLE](../../sql-reference/sql/create-table.md) documentation.

> **Note:**
>
> * Using this command requires the OWNERSHIP privilege on the source table.
> * Specifying column definitions, policies, tags, or other constraints isn’t supported. Snowflake automatically retrieves
>   the table schema, policies, tags, and constraints from the source table.
> * The WHERE clause is required. Reading archived data is expensive, and should be performed infrequently.
>   Filtering results using the WHERE clause helps you minimize costs by ensuring that Snowflake reads only the data that you
>   require from archival storage.
> * To estimate the number of files that Snowflake will retrieve from archive storage, run the [EXPLAIN](../../sql-reference/sql/explain.md) command before
>   this operation. The output includes a `createTableFromArchiveData` operation and displays `ARCHIVE OF <table>` in
>   the `objects` column for the TableScan operation. For more information, see Estimate retrieval costs with EXPLAIN.
> * To see a history of data retrieval from archive storage, use the [ARCHIVE_STORAGE_DATA_RETRIEVAL_USAGE_HISTORY view](../../sql-reference/account-usage/archive_storage_data_retrieval_usage_history.md).
> * To retrieve data from the COLD tier of archive storage, Snowflake must first restore the files from external cloud storage. This process
>   can take up to 48 hours.
>
>   To support this process, set the following parameters appropriately:
>
>   + [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) must be at least 48 hours.
>   + [ABORT_DETACHED_QUERY](../../sql-reference/parameters.md) must be FALSE.
>
>   COLD storage tier restore operations support a maximum of 1 million files per restore operation.
> * If you cancel a CREATE TABLE operation that retrieves data from archive storage, you might still incur retrieval costs.

## View archive metadata before retrieval

Before retrieving archived data, you can inspect metadata about the archive to understand what data
is available. Use the [SYSTEM$GET_TABLE_ARCHIVE_METADATA](../../sql-reference/functions/system_get_table_archive_metadata.md)
function to view:

* Total row count in the archive
* Column data types
* Minimum and maximum values for numeric and timestamp columns

This helps you decide which data to retrieve without incurring retrieval costs.

> **Note:**
>
> The table owner or an account administrator (a user with the ACCOUNTADMIN role) who has
> access to the table can execute this function.

## Estimate retrieval costs with EXPLAIN

To estimate how many files Snowflake will retrieve from archive storage, use the [EXPLAIN](../../sql-reference/sql/explain.md) command.

The command output includes the following data:

* A `createTableFromArchiveData` operation in the `operation` column.
* `ARCHIVE OF <table>` in the `objects` column for the TableScan operation.
* The number of partitions that will be retrieved in the `assignedPartitions` column for the archive
  TableScan operation. This value indicates the number of partitions
  that Snowflake will restore from cold tier to retrieve the data from archive storage.
* The number of bytes that will be retrieved in the `bytesAssigned` column.

For example:

```sqlexample
EXPLAIN
CREATE TABLE my_table
  FROM ARCHIVE OF my_source_table AS st
  WHERE st.event_timestamp BETWEEN '01/15/2023' AND '01/20/2023';
```

---
title: Sample data sets
source: https://docs.snowflake.com/en/user-guide/sample-data.md
section: User Guide
---

# Sample data sets

Snowflake provides sample data sets, such as the industry-standard TPC-DS and TPC-H benchmarks, for evaluating and testing a broad range of Snowflake’s SQL support.

Sample data sets are provided in a database named SNOWFLAKE_SAMPLE_DATA that has been
[shared with your account](data-sharing-intro.md) from the Snowflake SFC_SAMPLES account.
If you do not see the database, you can create it yourself. Refer to [Use the sample database](sample-data-using.md).

The database contains a schema for each data set, with the sample data stored in the tables in each schema. The database and schemas
do not use any data storage so they do not incur storage charges for your account. You can execute queries on the tables in
these databases just as you would with any other databases in your account. Executing queries requires a running, current warehouse
for your session, which consumes credits.

**Next Topics:**

* [Use the sample database](sample-data-using.md)
* [Sample data: TPC-DS](sample-data-tpcds.md)
* [Sample data: TPC-H](sample-data-tpch.md)
* [Sample Data: OpenWeatherMap — *Deprecated*](sample-data-openweathermap.md)

---
title: Sample Data: OpenWeatherMap — Deprecated
source: https://docs.snowflake.com/en/user-guide/sample-data-openweathermap.md
section: User Guide
---

# Sample Data: OpenWeatherMap — *Deprecated*

[OpenWeatherMap](http://openweathermap.org/) is a repository of recent historical and forecasted weather data in JSON format. Snowflake imports this
weather data and makes it available to all Snowflake accounts free of charge so you can experiment with our unique, high-performance semi-structured
columnar functionality using real-world data.

> **Important:**
>
> The sample weather data is provided for evaluation and testing purposes. The data is updated regularly in Snowflake, but is not maintained in real-time,
> which may result in occasional lapses in updates (i.e. we do not guarantee that the data is always current and/or gap-free).
>
> As such, we do not recommend using the data in production systems.

## Tables

The data set includes the following tables, all stored in native JSON format and accumulated over time:

| Table Name | Description | JSON Description |
| --- | --- | --- |
| DAILY_14_TOTAL | 12 days of daily weather forecasts for 20,000+ cities. | Click [here](http://openweathermap.org/forecast16#JSON) |
| DAILY_16_TOTAL | 12 days of daily weather forecasts for 200,000+ cities (lower frequency of updates). | Click [here](http://openweathermap.org/forecast16#JSON) |
| HOURLY_14_TOTAL | 4 days of hourly weather forecasts for 20,000+ cities. | Click [here](http://openweathermap.org/forecast5#JSON) |
| HOURLY_16_TOTAL | 4 days of hourly weather forecasts for 200,000+ cities (lower frequency of updates). | Click [here](http://openweathermap.org/forecast5#JSON) |
| WEATHER_14_TOTAL | Recent weather for 20,000 cities. | Click [here](http://openweathermap.org/current#current_JSON) |

## Query Examples

The following query retrieves the recent high and low temperature readings for New York City, converted from celsius to fahrenheit temperatures, along with the latitude and longitude for
the readings:

> ```sqlexample
> select (V:main.temp_max - 273.15) * 1.8000 + 32.00 as temp_max_far,
>        (V:main.temp_min - 273.15) * 1.8000 + 32.00 as temp_min_far,
>        cast(V:time as TIMESTAMP) time,
>        V:city.coord.lat lat,
>        V:city.coord.lon lon,
>        V
> from snowflake_sample_data.weather.WEATHER_14_TOTAL
> where v:city.name = 'New York'
> and   v:city.country = 'US'
> order by time desc
> limit 10;
> ```

The following query compares weather forecasts to actual weather readings:

> ```sqlexample
> with
> forecast as
> (select ow.V:time         as prediction_dt,
>         ow.V:city.name    as city,
>         ow.V:city.country as country,
>         cast(f.value:dt   as timestamp) as forecast_dt,
>         f.value:temp.max  as forecast_max_k,
>         f.value:temp.min  as forecast_min_k,
>         f.value           as forecast
>  from snowflake_sample_data.weather.daily_16_total ow, lateral FLATTEN(input => V, path => 'data') f),
>
> actual as
> (select V:main.temp_max as temp_max_k,
>         V:main.temp_min as temp_min_k,
>         cast(V:time as timestamp)     as time_dt,
>         V:city.name     as city,
>         V:city.country  as country
>  from snowflake_sample_data.weather.WEATHER_14_TOTAL)
>
> select cast(forecast.prediction_dt as timestamp) prediction_dt,
>        forecast.forecast_dt,
>        forecast.forecast_max_k,
>        forecast.forecast_min_k,
>        actual.temp_max_k,
>        actual.temp_min_k
> from actual
> left join forecast on actual.city = forecast.city and
>                       actual.country = forecast.country and
>                       date_trunc(day, actual.time_dt) = date_trunc(day, forecast.forecast_dt)
> where actual.city = 'New York'
> and   actual.country = 'US'
> order by forecast_dt desc, prediction_dt desc;
> ```

---
title: Sample data: TPC-DS
source: https://docs.snowflake.com/en/user-guide/sample-data-tpcds.md
section: User Guide
---

# Sample data: TPC-DS

TPC-DS is a benchmark that models a retail product supplier’s decision support system. It has customer, order, and product data. Snowflake provides 10TB and 100TB versions of TPC-DS data for you to explore, in schemas named TPCDS_SF10TCL and TPCDS_SF100TCL, respectively, within the SNOWFLAKE_SAMPLE_DATA shared database.

As described in the [TPC Benchmark™ DS (TPC-DS)](http://www.tpc.org/TPC_Documents_Current_Versions/pdf/TPC-DS_v2.5.0.pdf) specification:

> “In order to address the enormous range of query types and user behaviors encountered by a decision support
> system, TPC-DS utilizes a generalized query model. This model allows the benchmark to capture important
> aspects of the interactive, iterative nature of on-line analytical processing (OLAP) queries, the longer-running
> complex queries of data mining and knowledge discovery, and the more planned behavior of well known report queries.”

## Add the TPC-DS data set to your account

You can access TPC-DS data sets in two ways:

* To access TPC-DS data sets that are provided by Snowflake directly, go to [Snowflake Marketplace](https://other-docs.snowflake.com/collaboration/collaboration-marketplace-about) in Snowsight.

  For more information, see Getting TPC-DS data from Snowflake Marketplace.
* To access the list of TPC-DS queries, download [`this script`](../_downloads/0eec2c68e78863a07eb994c85e76b188/tpc-ds-all-queries.sql).

## Database entities, relationships, and characteristics

The TPC-DS data set consists of 7 fact tables and 17 dimensions in the following schemas:

* TPCDS_SF100TCL: The 100 TB (*scale factor* 100,000) version represents 100 million customers and over 500,000 items stored, with sales data spanning 3 channels — stores, catalogs,
  and the web — covering a period of 5 years. The largest table, STORE_SALES, contains nearly 300 billion rows, and the fact tables contain over 560 billion rows in total.
* TPCDS_SF10TCL: The 10 TB (scale factor 10,000) version represents 65 million customers and over 400,000 items stored, with sales data spanning 3 channels — stores, catalogs, and
  the web — covering a period of 5 years. The largest table, STORE_SALES, contains nearly 29 billion rows, and the fact tables contain over 56 billion rows in total.

The relationships between facts and dimensions are represented through joins on surrogate keys. The detailed relationships are too numerous to display here, but can be found in the TPC-DS specification.

## Query definitions

TPC-DS contains a set of 99 queries with wide variation in complexity and range of data scanned. Each TPC-DS query asks a business question and includes the corresponding query to
answer the question. We have generated samples of all 99 TPC-DS queries for you to explore. Alternatively, you can use the tools in the TPC-DS Benchmark Kit to generate many different
versions of these queries that vary by parameter values.

Below, we describe just one of the queries. More information about TPC-DS and all the queries involved can be found in the official TPC-DS specification.

The [`TPC-DS script`](../_downloads/0eec2c68e78863a07eb994c85e76b188/tpc-ds-all-queries.sql), provided by Snowflake, contains the full list of TPC-DS queries. You can save the file to your local file system for reference.

### An example: Catalog sales call center outliers (Q57)

This query looks at a year’s worth of CATALOG_SALES table data and reveals the categories and brands where sales in a month vary more than 10% from average for a given call center.

#### Business question

Find the item brands and categories for each call center and their monthly sales figures for a specified year, where the monthly sales figure deviated
more than 10% of the average monthly sales for the year, sorted by deviation and call center. Report the sales deviation from the previous and following months.

#### Functional query definition

The query lists the following totals:

* Extended price
* Discounted extended price
* Discounted extended price plus tax
* Average quantity
* Average extended price
* Average discount

These aggregates are grouped by RETURNFLAG and LINESTATUS and are listed in ascending order of RETURNFLAG and LINESTATUS. A count of the number of line items in each group is included:

> ```sqlexample
> use schema snowflake_sample_data.tpcds_sf10Tcl;
>
> -- QID=TPC-DS_query57
>
> with
> v1 as(
>   select i_category, i_brand,
>          cc_name,
>          d_year, d_moy,
>          sum(cs_sales_price) sum_sales,
>          avg(sum(cs_sales_price)) over
>            (partition by i_category, i_brand, cc_name, d_year) avg_monthly_sales,
>          rank() over (partition by i_category, i_brand, cc_name order by d_year, d_moy) rn
>   from item, catalog_sales, date_dim, call_center
>   where cs_item_sk = i_item_sk and
>         cs_sold_date_sk = d_date_sk and
>         cc_call_center_sk= cs_call_center_sk and
>           (
>             d_year = 2001 or
>           ( d_year = 2001-1 and d_moy =12) or
>           ( d_year = 2001+1 and d_moy =1)
>           )
>   group by i_category, i_brand,
>            cc_name , d_year, d_moy),
> v2 as(
>   select v1.i_brand
>     ,v1.d_year, v1.d_moy
>     ,v1.avg_monthly_sales
>     ,v1.sum_sales, v1_lag.sum_sales psum, v1_lead.sum_sales nsum
>   from v1, v1 v1_lag, v1 v1_lead
>   where v1.i_category = v1_lag.i_category and
>     v1.i_category = v1_lead.i_category and
>     v1.i_brand = v1_lag.i_brand and
>     v1.i_brand = v1_lead.i_brand and
>     v1.cc_name = v1_lag. cc_name and
>     v1.cc_name = v1_lead. cc_name and
>     v1.rn = v1_lag.rn + 1 and
>     v1.rn = v1_lead.rn - 1)
> select  *
> from v2
> where d_year = 2001 and
>   avg_monthly_sales > 0 and
>   case when avg_monthly_sales > 0
>        then abs(sum_sales - avg_monthly_sales) / avg_monthly_sales
>        else null
>        end > 0.1
> order by sum_sales - avg_monthly_sales, nsum
> limit 100;
> ```

## Getting TPC-DS data from Snowflake Marketplace

You can access TPC-DS data directly by going to Snowflake Marketplace in Snowsight. You can create and query your own instance of the following data sets:

* TPC-DS 10 TB (standard table format)
* TPC-DS 10 TB Managed Iceberg ([Iceberg table format](tables-iceberg.md))

The data in the Managed Iceberg data set is physically stored in Iceberg format, rather than the Snowflake proprietary table format. You can get both data sets and compare the behavior of the two formats.

To get these data sets:

1. [Search for TPC-DS](https://app.snowflake.com/marketplace/data-products/search?search=tpc-ds) in Snowflake Marketplace. (Log in to Snowsight if prompted.)
2. Select one of the TPC-DS data sets.
3. Select Get.

   Request access from your administrator if necessary. Your login role might not have access to these data sets.
4. Under Options, give your TPC-DS database a user-defined name and select the role that you will use to access it. Alternatively, proceed with the default selections.
5. Select Get it for Free.

   In a few seconds, you should see the following pop-up window, which indicates that your instance of the TPC-DS database has been created and is available to inspect and query.
6. Select Query Data.
7. Query the data in the database, using either the worksheet provided or the [`TPC-DS script`](../_downloads/0eec2c68e78863a07eb994c85e76b188/tpc-ds-all-queries.sql), which contains all of the queries.

If you have already used Get to create one of these databases, you can go to it by selecting Open on the Marketplace [search results](https://app.snowflake.com/marketplace/data-products/search?search=tpc-ds).

---
title: Sample data: TPC-H
source: https://docs.snowflake.com/en/user-guide/sample-data-tpch.md
section: User Guide
---

# Sample data: TPC-H

As described in the [TPC Benchmark™ H (TPC-H)](http://www.tpc.org/tpch/) specification:

> “TPC-H is a decision support benchmark. It consists of a suite of business-oriented ad hoc queries and concurrent data modifications. The queries and the data populating the database have been chosen
> to have broad industry-wide relevance. This benchmark illustrates decision support systems that examine large volumes of data, execute queries with a high degree of complexity, and give answers to
> critical business questions.”

## Database and schemas

TPC-H comes with various data set sizes to test different scaling factors. For demonstration purposes, we’ve shared four versions of the TPC-H data. The data is provided in the following schemas in the
SNOWFLAKE_SAMPLE_DATA shared database:

* TPCH_SF1: Consists of the base row size (several million elements).
* TPCH_SF10: Consists of the base row size x 10.
* TPCH_SF100: Consists of the base row size x 100 (several hundred million elements).
* TPCH_SF1000: Consists of the base row size x 1000 (several billion elements).

## Database entities, relationships, and characteristics

The components of TPC-H consist of eight separate and individual tables (the Base Tables). The relationships between columns in these tables are illustrated in the following ER diagram:

(source: [TPC Benchmark H Standard Specification](http://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v2.17.1.pdf))

## Query definitions

Each TPC-H query asks a business question and includes the corresponding query to answer the question. Some of the TPC-H queries are included in Snowflake’s Get Started tutorials.

This section describes one of the queries. For more information about TPC-H and all the queries that are involved, see the official
[TPC Benchmark H Standard Specification](http://www.tpc.org/tpc_documents_current_versions/pdf/tpc-h_v2.17.1.pdf).

### Q1: Pricing summary report query

This query reports the amount of business that was billed, shipped, and returned.

#### Business question

The Pricing Summary Report Query provides a summary pricing report for all line items that were shipped as of a given date. The date is within 60-120 days of the greatest ship date contained in the database.

#### Functional query definition

The query lists totals for extended price, discounted extended price, discounted extended price plus tax, average quantity, average extended price, and average discount. These aggregates are grouped by
RETURNFLAG and LINESTATUS, and listed in ascending order of RETURNFLAG and LINESTATUS. A count of the number of line items in each group is included:

> ```sqlexample
> use schema snowflake_sample_data.tpch_sf1;   -- or snowflake_sample_data.{tpch_sf10 | tpch_sf100 | tpch_sf1000}
>
> select
>        l_returnflag,
>        l_linestatus,
>        sum(l_quantity) as sum_qty,
>        sum(l_extendedprice) as sum_base_price,
>        sum(l_extendedprice * (1-l_discount)) as sum_disc_price,
>        sum(l_extendedprice * (1-l_discount) * (1+l_tax)) as sum_charge,
>        avg(l_quantity) as avg_qty,
>        avg(l_extendedprice) as avg_price,
>        avg(l_discount) as avg_disc,
>        count(*) as count_order
>  from
>        lineitem
>  where
>        l_shipdate <= dateadd(day, -90, to_date('1998-12-01'))
>  group by
>        l_returnflag,
>        l_linestatus
>  order by
>        l_returnflag,
>        l_linestatus;
> ```

---
title: Schedule runs of dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-schedule-project-execution.md
section: User Guide
---

# Schedule runs of dbt Projects on Snowflake

You can use Snowflake [tasks](../tasks-intro.md) to schedule runs of dbt Projects on Snowflake with the [EXECUTE DBT PROJECT](../../sql-reference/sql/execute-dbt-project.md) command. You can use a workspace for dbt Projects on Snowflake to quickly create and schedule a user-managed task. You can also use SQL commands to create a task. If a workspace is connected to a dbt project object, from within the workspace, you can view all tasks that run the EXECUTE DBT PROJECT command for that project.

You must create a task that runs the EXECUTE DBT PROJECT command in the same database and schema as the dbt project object.

> **Note:**
>
> Serverless tasks can’t be used to run dbt Projects. You must specify a user-managed warehouse when creating a task that executes the EXECUTE DBT PROJECT command.

## Create a task from within a workspace

When you create a schedule from within a workspace for dbt Projects on Snowflake, Snowflake creates the schedule by creating a user-managed task that is saved in the same database and schema as the dbt project object. The task runs with the privileges of the task owner, but task runs are not associated with a user.

**To create a task that schedules execution of a dbt project object from within a workspace:**

1. From the dbt project menu in the upper right of the workspace editor, under Scheduled runs, choose Create schedule.
2. In the Schedule a dbt run dialog box, do the following:

   * For Schedule name, enter a name for the task.
   * For Frequency, choose a frequency that ranges from Hourly to Monthly with an at qualifier, or choose Custom and enter a Cron expression. For more information about scheduling tasks, see [SCHEDULE = ...](../../sql-reference/sql/create-task.md) in the CREATE TASK command reference.
   * Under dbt properties:

     + For Operation, select the dbt command that you want to execute on a schedule. For a list of supported commands, see [Supported dbt commands and flags](dbt-projects-on-snowflake-supported-commands.md).
     + For Profile, select one of the profiles defined in the `profiles.yml` file of your dbt project.
     + For Additional flags, enter any additional [command-line options](https://docs.getdbt.com/reference/global-configs/about-global-configs#available-flags) for the dbt command.
3. Choose Create.

   Snowflake creates a task that runs an EXECUTE DBT PROJECT command using the parameters you specify.

## Viewing a task from within a workspace

From within workspace for dbt Projects on Snowflake, you can view all tasks in the database and schema that EXECUTE DBT PROJECT on the dbt project object that is connected to a workspace. You can choose a task to view its details in the object explorer, including the task definition, the run history of the task, and the task graph.

**To view tasks associated with a dbt project object from within a workspace:**

* From the dbt project menu, select View schedules and then choose your schedule (task) from the list.

  > The Task Details for the task opens in the object explorer. Task details, the SQL statement that comprises the task definition, and the privileges granted on the task object are shown.
  >
  > Choose the Run History tab to view the task run history, or choose the Task Graph tab to view the relationship of this task to other tasks in a [task graph](../tasks-graphs.md), if applicable.
  >
  > For more information, see [View tasks and task graphs in Snowsight](../ui-snowsight-tasks.md).

## Create a task using SQL

You can use the [CREATE TASK](../../sql-reference/sql/create-task.md) command to create tasks that run the EXECUTE DBT PROJECT command. Using SQL to create tasks that execute different dbt commands with different dbt CLI options provides a powerful way to orchestrate dbt deployments in Snowflake.

The following SQL example creates a task for a production dbt target that executes a dbt `run` command on a six-hour interval.

```sqlexample
CREATE OR ALTER TASK my_database.my_schema.run_dbt_project
  WAREHOUSE = my_warehouse
  SCHEDULE = '6 hours'
AS
  EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project args='run --target prod';
```

Then, the following SQL creates a task that executes the dbt `test` command after each completion of the previous `run_dbt_project` task.

```sqlexample
CREATE OR ALTER TASK change_this.public.test_dbt_project
        WAREHOUSE = my_warehouse
        AFTER run_dbt_project
AS
  EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project args='test --target prod';
```

---
title: Schema detection and evolution for Kafka connector with Snowpipe Streaming classic
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md
section: User Guide
---

# Schema detection and evolution for Kafka connector with Snowpipe Streaming classic

The Kafka connector with Snowpipe Streaming supports schema detection and evolution. The structure of tables in Snowflake can be defined and evolved automatically to support the structure of new Snowpipe streaming data loaded by the Kafka connector.

Without schema detection and evolution, the Snowflake table loaded by the Kafka connector only consists of two VARIANT columns, RECORD_CONTENT and RECORD_METADATA. With schema detection and evolution enabled, Snowflake can detect the schema of the streaming data and load data into tables that automatically match any user-defined schema. Snowflake also allows adding new columns or dropping the NOT NULL constraint from columns missing in new data files.

> **Note:**
>
> This feature only works with Kafka connector with Snowpipe Streaming. It doesn’t support Kafka connector with file-based Snowpipe.

## Prerequisites

Before enabling this feature, make sure to set up the following prerequisites.

> * Download Kafka connector version 2.0.0 or later. For more information, see [Installing and configuring the Kafka connector](../kafka-connector-install.md).
> * Use the [ALTER TABLE](../../sql-reference/sql/alter-table.md) command to set the `ENABLE_SCHEMA_EVOLUTION` parameter to TRUE on the table. You must also use a role that has the OWNERSHIP privilege on the table. For more information, see [Enable automatic table schema evolution](../data-load-schema-evolution.md).

## Configure required Kafka properties

Configure the following required properties in your Kafka connector properties file:

`snowflake.ingestion.method`
:   Specify to use `SNOWPIPE_STREAMING` to load your Kafka topic data. Note that this feature doesn’t support `SNOWPIPE`.

`snowflake.enable.schematization`
:   Specify to `TRUE` to enable schema detection and evolution for Kafka Connector with Snowpipe Streaming. The default value is `FALSE`.

    When this property is set to `TRUE`,

    * For any new tables that are created by Kafka connector, the table parameter `ENABLE_SCHEMA_EVOLUTION` is automatically set to `TRUE`.
    * For any existing tables, you still need to manually set the table parameter `ENABLE_SCHEMA_EVOLUTION` to `TRUE`.

`schema.registry.url`
:   Specify to the URL of the schema registry service. The default value is empty.

    Depending on the file format, `schema.registry.url` is required or optional. Schema detection with Kafka connector is supported in either of the scenarios below:

    > * Schema registry is required for Avro and Protobuf. The column is created with the data types defined in the provided schema registry.
    > * Schema registry is optional for JSON. If there is no schema registry, the data type will be inferred based on the data provided.

Configure additional properties in your Kafka connector properties file as usual. For more information, see [Configuring the Kafka connector](../kafka-connector-install.md).

## Converters

Structured data converters, such as Json, Avro and Protobuf, are supported.
Note that we have only tested the following structured data converters:

* `io.confluent.connect.avro.AvroConverter`
* `io.confluent.connect.protobuf.ProtobufConverter`
* `org.apache.kafka.connect.json.JsonConverter`
* `io.confluent.connect.json.JsonSchemaConverter`

Any unstructured data converters are not supported with schematization. For example,

* `org.apache.kafka.connect.converters.ByteArrayConverter`
* `org.apache.kafka.connect.storage.StringConverter`

Snowflake converters are not supported with Snowpipe Streaming. Some customized data converters are untested and may also not be supported.

## Usage notes

* Schema detection with Kafka connector is supported with or without a provided schema registry. If using schema registry (Avro and Protobuf), the column will be created with the data types defined in the provided schema registry. If there is no schema registry (JSON), the data type will be inferred based on the data provided.
* Schema evolution with Kafka connector supports the following table column modifications:

  > + Adding new columns
  > + Dropping NOT NULL constraint if the source data column is missing.
* If Kafka connector creates the target table, schema evolution is enabled by default. However, if schema evolution is disabled for an existing table then Kafka connector will try to send the rows with mismatched schemas to the configured dead-letter queues (DLQ).
* JSON ARRAY is not supported for further schematization.
* For the Kafka connector with Snowpipe Streaming, schema evolution is not tracked by the `SchemaEvolutionRecord` output in the following views and commands: [INFORMATION_SCHEMA COLUMNS View](../../sql-reference/info-schema/columns.md), [ACCOUNT_USAGE COLUMNS View](../../sql-reference/account-usage/columns.md), [DESCRIBE TABLE command](../../sql-reference/sql/desc-table.md), and [SHOW COLUMNS command](../../sql-reference/sql/show-columns.md). The `SchemaEvolutionRecord` output always shows NULL.

## Examples

The following examples demonstrate the tables that are created before and after the schema detection and evolution are enabled for Kafka connector with Snowpipe Streaming.

> ```sqlexample
> -- Before schema detection and evolution is enabled, the table only consists of two VARIANT columns, RECORD_CONTENT and RECORD_METADATA, as the following example demonstrates.
> +------+---------------------------------------------------------+---------------------------------------------------+
> | Row  | RECORD_METADATA                                         | RECORD_CONTENT                                    |
> |------+---------------------------------------------------------+---------------------------------------------------|
> | 1    |{"CreateTime":1669074170090, "headers": {"current.iter...| "account": "ABC123", "symbol": "ZTEST", "side":...|
> | 2    |{"CreateTime":1669074170400, "headers": {"current.iter...| "account": "XYZ789", "symbol": "ZABZX", "side":...|
> | 3    |{"CreateTime":1669074170659, "headers": {"current.iter...| "account": "XYZ789", "symbol": "ZTEST", "side":...|
> | 4    |{"CreateTime":1669074170904, "headers": {"current.iter...| "account": "ABC123", "symbol": "ZABZX", "side":...|
> | 5    |{"CreateTime":1669074171063, "headers": {"current.iter...| "account": "ABC123", "symbol": "ZTEST", "side":...|
> +------+---------------------------------------------------------+---------------------------------------------------|
>
> -- After schema detection and evolution is enabled, the table contains the columns that match the user-defined schema. The table can also automatically evolve to support the structure of new Snowpipe streaming data loaded by the Kafka connector.
> +------+---------------------------------------------------------+---------+--------+-------+----------+
> | Row  | RECORD_METADATA                                         | ACCOUNT | SYMBOL | SIDE  | QUANTITY |
> |------+---------------------------------------------------------+---------+--------+-------+----------|
> | 1    |{"CreateTime":1669074170090, "headers": {"current.iter...| ABC123  | ZTEST  | BUY   | 3572     |
> | 2    |{"CreateTime":1669074170400, "headers": {"current.iter...| XYZ789  | ZABZX  | SELL  | 3024     |
> | 3    |{"CreateTime":1669074170659, "headers": {"current.iter...| XYZ789  | ZTEST  | SELL  | 799      |
> | 4    |{"CreateTime":1669074170904, "headers": {"current.iter...| ABC123  | ZABZX  | BUY   | 2033     |
> | 5    |{"CreateTime":1669074171063, "headers": {"current.iter...| ABC123  | ZTEST  | BUY   | 1558     |
> +------+---------------------------------------------------------+---------+--------+-------+----------|
> ```

---
title: SCIM API references
source: https://docs.snowflake.com/en/user-guide/scim-api-references.md
section: User Guide
---

# SCIM API references

Snowflake provides the following SCIM APIs, which allow identity providers to make requests to Snowflake:

* [User API](scim-user-api-reference.md): Allows identity providers to do the following actions:

  + Check if users exist.
  + Get details about users.
  + Create and activate users.
  + Update user attributes.
  + Delete and activate users.
* [Group API](scim-group-api-reference.md): Allows identity providers to do the following actions:

  + Get details about groups.
  + Create groups.
  + Update groups.
  + Delete groups.

For additional examples, see the [Postman collection](https://documenter.getpostman.com/view/5462540/S1Lzx6gY?version=latest#intro).

---
title: SCIM group API reference
source: https://docs.snowflake.com/en/user-guide/scim-group-api-reference.md
section: User Guide
---

# SCIM group API reference

You can use the SCIM group API to access, create, and modify roles.

Snowflake uses SCIM to import roles from Okta, Azure AD and custom-built applications. The roles in these identity providers map one-to-one
with Snowflake roles.

Roles, sometimes called groups, are a collection of access privileges. To access securable objects in Snowflake, privileges must be assigned
to roles, and roles are assigned to other roles or users.

Access permissions and rights that are granted to the role are automatically inherited by every member, such as a user, of the role. For
more information, see [Overview of Access Control](security-access-control-overview.md).

A user’s access requirements to Snowflake can change. For example, a user can change from being an individual contributor to a manager in
their organization, which may require their role in Snowflake to change, or they may require access to data sets only available to managers.

As the user’s role changes in the identity provider, their access to Snowflake automatically changes when their organization role maps to the
corresponding Snowflake role.

## HTTP headers

The Snowflake SCIM API uses bearer tokens for HTTP authentication.

Each HTTP request to the Snowflake SCIM API allows the following HTTP headers:

| Header | Value |
| --- | --- |
| `Authorization` (Required) | `Bearer <access_token>` |
| `Content-Type` | `application/scim+json` |
| `Accept-Encoding` | `utf-8` |
| `Accept-Charset` | `utf-8` |

## Group attributes

You can specify group (that is, a role) attributes in the body of the API requests as key-value pairs in JSON format. These pairs contain
information about the group, such as the group’s display name. Identity providers can specify their own key names for each attribute.

Snowflake supports the following SCIM attributes for role lifecycle management. Attributes are writable unless otherwise noted.

| SCIM Group Attribute | Snowflake Group Attribute | Type | Description |
| --- | --- | --- | --- |
| `id` | `id` | String | The immutable, unique identifier (GUID) of the role in Snowflake.  Snowflake does not return this value.  You can find this value by calling the Information Schema table function [REST_EVENT_HISTORY](../sql-reference/functions/rest_event_history.md). Check the IdP logs to ensure the values match. |
| `displayName` | `name` | String | The text shown in the user interface when referring to the group. |
| `members.value` | N/A | String | The `id` of the user who is a member of the role. |
| `schemas` | N/A | String | An array of strings to indicate the namespace URIs. For example, `urn:ietf:params:scim:schemas:core:2.0:Group`. |

## Get details about a group by displayName

Method and endpoint:
:   `GET /scim/v2/Groups?filter=displayName eq "{{group_name}}"`

Description:
:   Returns details about a group associated with the `displayName` query parameter.

    Returns the HTTP response status code `200` if the HTTP request successfully completed.

## Get details about a group by groupId

Method and endpoint:
:   `GET /scim/v2/Groups/{{group_id}}`

Description:
:   Returns details about a group associated with the `group_id` path parameter.

    Returns the HTTP response status code `200` if the HTTP request successfully completed.

## Create a group

Method and endpoint:
:   `POST /scim/v2/Groups`

Description:
:   Creates a new group in Snowflake.

    Returns the HTTP response status code `201` if the HTTP request successfully completed.

Examples:
:   Create a group with the `displayName` `scim_test_group2`:

    ```sqljson
    {
      "schemas": ["urn:ietf:params:scim:schemas:core:2.0:Group"],
      "displayName":"scim_test_group2"
    }
    ```

## Update a group

Method and endpoint:
:   `PATCH /scim/v2/Groups/{{group_id}}`

Description:
:   Updates the display name attribute or group membership of the group associated with the `group_id` path parameter.

    You must set `op` to `add` or `replace` to perform this HTTP request.

    Returns a `200` or `204` HTTP response status code if the HTTP request successfully completed. A `200` status code indicates the
    SCIM client is Okta.

Examples:
:   Update a group `displayName`, remove a member and add a member:

    ```sqljson
    {
      "schemas": ["urn:ietf:params:scim:api:messages:2.0:PatchOp"],
      "Operations": [{
        "op": "replace",
        "value": { "displayName": "updated_name" }
      },
      {
        "op" : "remove",
        "path": "members[value eq \"user_id_1\"]"
      },
      {
        "op": "add",
        "value": [{ "value": "user_id_2" }]
      }]
    }
    ```

## Delete a group

Method and endpoint:
:   `DELETE /scim/v2/Groups/{{group_id}}`

Description:
:   Deletes the group associated with the `group_id` path parameter.

---
title: SCIM security integrations
source: https://docs.snowflake.com/en/user-guide/scim-security-integrations.md
section: User Guide
---

# SCIM security integrations

Snowflake supports SCIM integration with the following identity providers to provision, manage, and synchronize users and groups in Snowflake:

* [Okta](scim-okta.md)
* [Microsoft Azure Active Directory](scim-azure.md)
* [Custom integrations](scim-custom.md)

> **Note:**
>
> You can use custom SCIM integrations with identity providers that do not have a dedicated integration to provision, manage, and
> synchronize users and groups in Snowflake.
>
> You should use custom SCIM integrations for identity providers that are neither Okta nor Microsoft Azure AD.

---
title: SCIM user API reference
source: https://docs.snowflake.com/en/user-guide/scim-user-api-reference.md
section: User Guide
---

# SCIM user API reference

You can use the SCIM user API to access, create, and modify user data.

## HTTP headers

The Snowflake SCIM API uses bearer tokens for HTTP authentication.

Each HTTP request to the Snowflake SCIM API allows the following HTTP headers:

| Header | Value |
| --- | --- |
| `Authorization` (Required) | `Bearer <access_token>` |
| `Content-Type` | `application/scim+json` |
| `Accept-Encoding` | `utf-8` |
| `Accept-Charset` | `utf-8` |

## User attributes

You can specify user attributes in the body of the API requests as key-value pairs in JSON format. These pairs contain information about the
user, such as the user’s display name or their email address. Identity providers can specify their own key names for each attribute. For
example, identity providers can use the key `lastName`, instead of `familyName`, to represent the user’s last name. Snowflake
does not support multi-value user attributes.

> **Important:**
>
> In the table below, the attributes `userName` and `loginName` both refer to the attribute `userName`. Snowflake
> supports specifying different values for the `userName` and `loginName` attributes.

Snowflake supports the following attributes for user lifecycle management:

| SCIM User Attribute | Snowflake User Attribute | Type | Description |
| --- | --- | --- | --- |
| `id` | `ID` | string | The immutable, unique identifier (GUID) of the user in Snowflake.  Snowflake does not return this value in the [DESCRIBE USER](../sql-reference/sql/desc-user.md) or [SHOW USERS](../sql-reference/sql/show-users.md) output.  For requests on endpoints that require this attribute, such as `PATCH scim/v2/Users/{{id}}`, the `id` attribute can be found using the [REST_EVENT_HISTORY](../sql-reference/functions/rest_event_history.md) function. Check the IdP logs to ensure the values match. |
| `userName` | `NAME`, `LOGIN_NAME` | string | The identifier used to login into Snowflake. For more information about these attributes, see [CREATE USER](../sql-reference/sql/create-user.md). |
| `name.givenName` | `FIRST_NAME` | string | The first name of the user. |
| `name.familyName` | `LAST_NAME` | string | The last name of the user. |
| `emails` | `EMAIL` | string | The email address of the user. |
| `displayName` | `DISPLAY_NAME` | string | The text shown in the user interface when referring to the user. |
| `externalID` | N/A | string | The unique identifier set by the provisioning client (e.g. Azure, Okta). |
| `password` | `PASSWORD` | string | The password for the user.  This value is not returned in the JSON response.  If the `SYNC_PASSWORD` property in the SCIM security integration is set to `FALSE`, and the SCIM API request specifies the `password` attribute, Snowflake ignores the value for the `password` attribute. All other attributes in the API request are processed normally. |
| `active` | `DISABLED` | boolean | Disables the user when set to `false`. |
| `groups` | N/A | string | A list of groups to which the user belongs. The group `displayName` is required.  The user’s groups are immutable and their membership must be updated using the [SCIM groups API](scim-group-api-reference.md). |
| `meta.created` | `CREATED_ON` | string | The time the user was added to Snowflake. |
| `meta.lastModified` | `UPDATED_ON` | string | The time the user was last modified in Snowflake. |
| `meta.resourceType` | N/A | string | The type of resource for the user. You should use `user` as a value for this attribute. |
| `schemas` | N/A | string | A comma-separated array of strings specifying the namespace URIs. Snowflake supports the following values:   * `urn:ietf:params:scim:schemas:core:2.0:User` * `urn:ietf:params:scim:schemas:extension:enterprise:2.0:User` * `urn:ietf:params:scim:schemas:extension:2.0:User` |

## Custom attributes

You can set custom attributes that are not defined by [RFC 7643](https://datatracker.ietf.org/doc/html/rfc7643), such as
`defaultRole`.

You can use the following namespaces to set custom attributes when making POST, PUT, and PATCH requests:

`urn:ietf:params:scim:schemas:extension:enterprise:2.0:User`
:   This namespace was part of the original SCIM implementation in Snowflake. You can only use this namespace for setting custom attributes in
    [Okta SCIM security integrations](scim-okta.md).

    You cannot use this namespace to set custom attributes in [Microsoft Azure SCIM security integrations](scim-azure.md) or
    [custom SCIM integrations](scim-custom.md).

`urn:ietf:params:scim:schemas:extension:2.0:User`
:   You can use this namespace to set custom attributes for all SCIM integrations. You must use this namespace for setting custom attributes
    in [Microsoft Azure SCIM security integrations](scim-azure.md) or
    [Custom SCIM security integrations](scim-custom.md).

Snowflake supports the following custom attributes:

| SCIM User Custom Attribute | Snowflake User Attribute | Type | Description |
| --- | --- | --- | --- |
| `allowedInterfaces` | `ALLOWED_INTERFACES` | string | Defines which Snowflake interfaces the user can access. Specified as a comma-delimited list of interfaces. For a list of possible interfaces, see [CREATE USER](../sql-reference/sql/create-user.md). If a value other than `ALL` is specified, then users can only access the interface specified and cannot interact with any Snowflake data outside of the interface specified. |
| `defaultWarehouse` | `DEFAULT_WAREHOUSE` | string | The virtual warehouse that is active by default for the user’s session upon login. |
| `defaultRole` | `DEFAULT_ROLE` | string | The primary role that is active by default for the user’s session upon login. |
| `defaultSecondaryRoles` | `DEFAULT_SECONDARY_ROLES` | string | The list of secondary roles that are active for the user’s session upon login. You can set this attribute to `ALL` as an alias for `('ALL')`, or you can set this attribute to `NONE` or `""` as an alias for `()`. |
| `type` | `TYPE` | string | The type of user. Default: `person`. You can set this attribute to `person`, `service`, `legacy_service`, or `null`. For more information about types of users, see [Types of users](admin-user-management.md). |
| `snowflakeTags` | `SNOWFLAKE_TAGS` | string | Assign or update [tag values](object-tagging/introduction.md) associated with a user. Specify a comma-separated list of tags and tag values that are to be provisioned. These tags must already exist in the account and the integration must be properly provisioned. The format of each entry is as follows: `database name.schema name.tag name:tag value`. Each entry must point to a different tag. Each entry may optionally be surrounded by `[ ]` (square brackets). Here is an example of creating two tags and assigning values to them:  `[cost_management.tags.cost_center:finance,cost_management.tags.type:PowerUser]` |

### Usage notes for TYPE attribute

> **Note:**
>
> The LEGACY_SERVICE type is being deprecated. Use the SERVICE type for services and applications. For a timeline of the deprecation of
> LEGACY_SERVICE, see [Planning for the deprecation of single-factor password sign-ins](security-mfa-rollout.md).

This list describes the effects of setting the TYPE attribute in the following SCIM requests:

* `POST` request to create a user, and the `type` attribute is unspecified or `NULL`, the `TYPE` property is set to
  `PERSON`.
* `PATCH` request with a replace operation that specifies the `type` attribute as `NULL`, `TYPE` property doesn’t
  change.
* `PUT` request with a replace operation, and the `type` attribute is unspecified or `NULL`, the `TYPE` property is set to
  `PERSON`.
* `PATCH` request with a remove operation that unsets the `type` attribute, the `TYPE` property doesn’t change.

## Check if a user exists

Method and endpoint:
:   `GET /scim/v2/Users?filter=userName eq "{{user_name}}"`

Description:
:   Returns details about a user associated with the `userName` query parameter.

    Returns the HTTP response status code `200` if the HTTP request successfully completed.

## Get details about a user

Method and endpoint:
:   `GET /scim/v2/Users/{{user_id}}`

Description:
:   Returns details about a user associated with the `user_id` path parameter.

    Returns the HTTP response status code `200` if the HTTP request successfully completed.

## Create a user

Method and endpoint:
:   `POST /scim/v2/Users`

Description:
:   Creates a user in Snowflake.

    Returns the HTTP response status code `201` if the HTTP request successfully completed.

    If the user already exists or the HTTP request failed for a different reason, then Snowflake returns the HTTP response status code
    `409`.

Examples:
:   Create a user with `userName` and `loginName` mapped to the same value:

    ```json
    {
      "schemas": [
        "urn:ietf:params:scim:schemas:core:2.0:User",
        "urn:ietf:params:scim:schemas:extension:2.0:User"
      ],
      "userName": "test_user_1",
      "password": "test",
      "name": {
        "givenName": "test",
        "familyName": "user"
      },
      "emails": [
        {"value": "test.user@example.com"}
      ],
      "displayName": "test user",
      "active": true
    }
    ```

    Create a user with `userName` and `loginName` mapped to different values:

    > **Note:**
    >
    > If you use Okta as your identity provider, follow this [procedure](scim-okta.md).

    ```json
    {
      "active": true,
      "displayName": "test user",
      "emails": [
        {"value": "test.user@example.com"}
      ],
      "name": {
        "familyName": "test_last_name",
        "givenName": "test_first_name"
      },
      "password": "test_password",
      "schemas": [
        "urn:ietf:params:scim:schemas:core:2.0:User",
        "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User"
      ],
      "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User": {
        "snowflakeUserName": "TEST_USER"
      },
      "userName": "test.user@example.com"
    }
    ```

    Create a user and assign two tags: `cost_center = finance` and `type = PowerUser`:

    ```json
    {
      "active": true,
      "displayName": "test user",
      "emails": [
        {"value": "test.user@snowflake.com"}
      ],
      "name": {
        "familyName": "test_last_name",
        "givenName": "test_first_name"
      },
      "password": "test_password",
      "schemas": [
        "urn:ietf:params:scim:schemas:core:2.0:User",
        "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User"
      ],
      "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User": {
        "snowflakeUserName": "USER5"
        "snowflakeTags": "[cost_management.tags.cost_center:finance,cost_management.tags.type:PowerUser]"
      },
      "userName": "USER5"
    }
    ```

## Replace user attributes

Method and endpoint:
:   `PATCH /scim/v2/Users/{{id}}`

Description:
:   Replaces attributes of the user associated with the `id` path parameter.

    You must set `op` to `replace` to perform this HTTP request.

    `active` allows the following values:

    * `false`: deactivates the user.
    * `true`: activates the user.

    Returns the HTTP response status code `200` if the HTTP request was successfully completed.

    If unsuccessful, returns the HTTP response code `204`.

Examples:
:   Deactivate a user and update their `givenName` to `deactivated_user`:

    ```json
    {
      "schemas": ["urn:ietf:params:scim:api:messages:2.0:PatchOp"],
      "Operations": [
        {"op": "replace", "value": { "active": false }}
        {"op": "replace", "value": { "givenName": "deactivated_user" }}
      ],
    }
    ```

    Update a user with `userName` and `loginName` mapped to the same value:

    ```json
    {
      "schemas": ["urn:ietf:params:scim:api:messages:2.0:PatchOp"],
      "Operations": [
        {"op": "replace", "value": { "active": false }}
      ]
    }
    ```

    Update a user with `userName` and `loginName` mapped to different values.
    If Okta is your identity provider, follow [this procedure](scim-okta.md) instead.

    ```json
    {
      "Operations": [
        {
          "op": "replace",
          "path": "userName",
          "value": "test_updated_name"
        },
        {
          "op": "replace",
          "path": "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User.snowflakeUserName",
          "value": "USER5"
        }
      ],
      "schemas": ["urn:ietf:params:scim:api:messages:2.0:PatchOp"]
    }
    ```

    Update tag values assigned to a user:

    ```json
    {
      "Operations": [
        {
          "op": "replace",
          "path": "userName",
          "value": "test_updated_name"
        },
        {
          "op": "replace",
          "path": "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User.snowflakeTags",
          "value": "[cost_management.tags.cost_center:finance,cost_management.tags.type:PowerUser]"
        }
      ],
      "schemas": ["urn:ietf:params:scim:api:messages:2.0:PatchOp"]
    }
    ```

## Update a user

Method and endpoint:
:   `PUT /scim/v2/Users/{{id}}`

Description:
:   Updates the attributes of the user associated with the `id` path parameter.

    If unsuccessful, returns the HTTP response code `400`. The HTTP request is unsuccessful if the request tries to change immutable
    attributes or if the attributes being changed do not exist in Snowflake.

Examples:
:   Update a user and their `"defaultRole"`, `"defaultSecondaryRoles"`, and `"defaultWarehouse"` attributes.

    To specify the `"defaultRole"`, `"defaultSecondaryRoles"`, and `"defaultWarehouse"` attributes, you must use one of the
    `extension` schemas. The `defaultSecondaryRoles` attribute only accepts `"ALL"` as
    a value.

    > **Note:**
    >
    > The PUT method is more expensive than the PATCH method. Use the PATCH operation instead.

    ```json
    {
      "schemas": [
       "urn:ietf:params:scim:schemas:core:2.0:User",
       "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User"
      ],
      "userName": "test_user_1",
      "password": "test",
      "name": {
        "givenName": "test",
        "familyName": "user"
      },
      "emails": [{
        "primary": true,
        "value": "test.user@example.com",
        "type": "work"
      }
      ],
      "displayName": "test user",
      "active": true,
      "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User": {
        "defaultRole" : "test_role",
        "defaultSecondaryRoles" : "ALL",
        "defaultWarehouse" : "test_warehouse"
      }
    }
    ```

    Update a user and their tag values. Omitting a tag value or omitting the tag entirely from the request will remove the tag from the user.

    ```json
    {
      "schemas": [
       "urn:ietf:params:scim:schemas:core:2.0:User",
       "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User"
      ],
      "userName": "test_user_1",
      "password": "test",
      "name": {
        "givenName": "test",
        "familyName": "user"
      },
      "emails": [{
        "primary": true,
        "value": "test.user@snowflake.com",
        "type": "work"
      }
      ],
      "displayName": "test user",
      "active": true,
      "urn:ietf:params:scim:schemas:extension:enterprise:2.0:User": {
        "defaultRole" : "test_role",
        "defaultSecondaryRoles" : "ALL",
        "defaultWarehouse" : "test_warehouse",
        "snowflakeTags": "[cost_management.tags.cost_center:finance,cost_management.tags.type:PowerUser]"
      }
    }
    ```

## Delete a user

Method and endpoint:
:   `DELETE /scim/v2/Users/{{id}}`

Description:
:   Deletes the user associated with the `id` path parameter.

---
title: Search optimization cost estimation and management
source: https://docs.snowflake.com/en/user-guide/search-optimization/cost-estimation.md
section: User Guide
---

# Search optimization cost estimation and management

The search optimization service impacts costs for both storage and compute resources:

* Storage resources: The search optimization service creates a search access path data structure that requires space
  for each table on which search optimization is enabled. The storage cost of the search access path depends upon
  multiple factors, including:

  + The number of distinct values in the table. In the extreme case where all columns have data types that use
    the search access path, and all data values in each column are unique, the required storage can be as much as
    the original table’s size.

    Typically, however, the size is approximately 1/4 of the original table’s size.
* Compute resources:

  + Adding search optimization to a table consumes resources during the initial build phase.
  + Maintaining the search optimization service also requires resources. Resource consumption is higher when there is
    high churn (i.e. when large volumes of data in the table change). These costs are roughly proportional to the
    amount of data ingested (added or changed). Deletes also have some cost.

    [Automatic clustering](../tables-auto-reclustering.md), while improving the latency of queries in tables with
    search optimization, can further increase the maintenance costs of search optimization. If a table has a high churn rate,
    enabling automatic clustering and configuring search optimization for the table can result in higher maintenance costs than
    if the table is just configured for search optimization.

    Snowflake ensures efficient credit usage by billing your account only for the actual resources used. Billing is
    calculated in 1-second increments.

    See the “Serverless Feature Credit Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf)
    for the costs per compute hour.

    Once you enable the search optimization service, you can
    view the costs for your use of the service.

> **Tip:**
>
> Snowflake recommends starting slowly with this feature (i.e. adding search optimization to only a few tables at
> first) and closely monitoring the costs and benefits.

## Estimating the costs of search optimization

To estimate the cost of adding search optimization to a table and configuring specific columns for search optimization, use the
[SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS](../../sql-reference/functions/system_estimate_search_optimization_costs.md) function.

In general, the costs are proportional to:

* The number of columns on which the feature is enabled and the number of distinct values in those columns.
* The amount of data that changes in these tables.

> **Important:**
>
> Cost estimates returned by the SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS function are best efforts. The actual realized
> costs can vary by up to 50% (or, in rare cases, by several times) from the estimated costs.
>
> * Build and storage cost estimates are based on sampling a subset of the rows in the table
> * Maintenance cost estimates are based on recent create, delete, and update activity in the table

## Viewing the costs of search optimization

You can view the actual billed costs for the search optimization service by using either the web interface or SQL.
See [Exploring compute cost](../cost-exploring-compute.md).

## Reducing the costs of search optimization

You can control the cost of the search optimization service by carefully
[choosing the tables and columns for which to enable search optimization](queries-that-benefit.md).

In addition, to reduce the cost of the search optimization service:

* Snowflake recommends batching DML operations on the table:

  + `DELETE`: If tables store data for the most recent time period (e.g. the most recent day or week or month),
    then when you trim your table by deleting old data, the search optimization service must take into account the
    updates. In some cases, you might be able to reduce costs by deleting less frequently (e.g. daily rather than
    hourly).
  + `INSERT`, `UPDATE`, and `MERGE`: Batching these types of DML statements on the
    table can reduce the cost of maintenance by the search optimization service.
* If you recluster the entire table, consider
  dropping the SEARCH OPTIMIZATION property for that table before
  reclustering, and then
  [add the SEARCH OPTIMIZATION property](enabling.md) back to the table
  after reclustering.
* Before enabling search optimization for substring searches (`ON SUBSTRING(col)`) or VARIANTs (`ON EQUALITY(variant_col)`),
  call [SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS](../../sql-reference/functions/system_estimate_search_optimization_costs.md) to estimate the costs. The initial build and maintenance
  for these search methods can be computationally intensive, so you should assess the trade-off between performance and cost.

---
title: Search optimization service
source: https://docs.snowflake.com/en/user-guide/search-optimization-service.md
section: User Guide
---

# Search optimization service

The search optimization service can significantly improve the performance of certain types of lookup and analytical
queries. An extensive set of filtering predicates are supported (see [Identifying queries that can benefit from search optimization](search-optimization/queries-that-benefit.md)).

> **Note:**
>
> To start with a tutorial that compares execution time with and without search optimization, see
> [Getting Started with Search Optimization](https://quickstarts.snowflake.com/guide/getting_started_with_search_optimization/index.html).

The search optimization service aims to significantly improve the performance of certain types of queries on tables, including:

* Selective point lookup queries on tables. A point lookup query returns only one or a small number of distinct rows. Use case
  examples include:

  + Business users who need fast response times for critical dashboards with highly selective filters.
  + Data scientists who are exploring large data volumes and looking for specific subsets of data.
  + Data applications retrieving a small set of results based on an extensive set of filtering predicates.

  For more information, see [Speeding up point lookup queries with search optimization](search-optimization/point-lookup-queries.md).
* Character data (text) and IP address searches executed with the [SEARCH](../sql-reference/functions/search.md) and
  [SEARCH_IP](../sql-reference/functions/search_ip.md) functions. For more information, see [Speeding up text queries with search optimization](search-optimization/text-queries.md).
* Substring and regular expression searches (for example, [LIKE](../sql-reference/functions/like.md), [ILIKE](../sql-reference/functions/ilike.md),
  [RLIKE](../sql-reference/functions/rlike.md), and so on). For more information, see [Speeding up substring and regular expression queries with search optimization](search-optimization/substring-queries.md).
* Queries on elements in [VARIANT, OBJECT, and ARRAY](../sql-reference/data-types-semistructured.md) (semi-structured)
  columns that use the following types of predicates:

  + Equality predicates.
  + IN predicates.
  + Predicates that use [ARRAY_CONTAINS](../sql-reference/functions/array_contains.md).
  + Predicates that use [ARRAYS_OVERLAP](../sql-reference/functions/arrays_overlap.md).
  + Predicates that use full-text search with [SEARCH](../sql-reference/functions/search.md).
  + Substring and regular expression predicates.
  + Predicates that check for NULL values.

  For more information, see [Speeding up queries of semi-structured data with search optimization](search-optimization/semi-structured-queries.md).
* Queries on elements in [structured ARRAY, OBJECT, and MAP](../sql-reference/data-types-structured.md) (structured)
  columns that use the following types of predicates:

  + Equality predicates.
  + IN predicates.
  + Substring predicates (on STRING fields).

  For more information, see [Speeding up queries of structured data with search optimization](search-optimization/structured-queries.md).
* Queries that use selected geospatial functions with [GEOGRAPHY](../sql-reference/data-types-geospatial.md) values.
  For more information, see [Speeding up geospatial queries with search optimization](search-optimization/geospatial-queries.md).

Once you identify the queries that can benefit from the search optimization service, you can
[enable search optimization](search-optimization/enabling.md) for the columns and tables used in those queries.

The search optimization service is generally transparent to users. Queries work the same as they do without search
optimization; some are just faster. However, search optimization does have effects on certain other table operations. For
more information, see [Working with search-optimized tables](search-optimization/working-with-tables.md).

## How the search optimization service works

To improve performance of search queries, the search optimization service creates and maintains a persistent data
structure called a *search access path*. The search access path keeps track of which values of the table’s columns might
be found in each of its [micro-partitions](tables-clustering-micropartitions.md), allowing some micro-partitions to be
skipped when scanning the table.

A maintenance service is responsible for creating and maintaining the search access path:

* When you enable search optimization, the maintenance service creates and populates the search access path with the
  data needed to perform the lookups.

  Building the search access path can take significant time, depending on the size of the table. The maintenance service
  works in the background and does not block any operations on the table. Queries are not accelerated until the search
  access path has been fully built.
* When data in the table is updated (for example, by loading new data sets or through DML operations), the maintenance service
  automatically updates the search access path to reflect the changes to the data.

  If queries are run while the search access path is still being updated, queries might run more slowly, but will still
  return correct results.

The progress of each table’s maintenance service appears in the `search_optimization_progress` column in the
output of [SHOW TABLES](../sql-reference/sql/show-tables.md). Before you measure the performance improvement of search
optimization on a newly-optimized table, make sure this column shows that the table has been fully optimized.

Search access path maintenance is transparent. You don’t need to create a virtual warehouse for running the
maintenance service. However, there is a cost for the storage and compute resources of maintenance. For more details
on costs, see [Search optimization cost estimation and management](search-optimization/cost-estimation.md).

## Other options for optimizing query performance

The search optimization service is one of several ways to optimize query performance. The following list shows
other techniques:

* Query acceleration
* Creating one or more materialized views (clustered or unclustered)
* Clustering a table

For more information, see [Optimizing query performance](performance-query-options.md).

## Examples

Start by creating a table with data:

```sqlexample
CREATE OR REPLACE TABLE test_table (id INT, c1 INT, c2 STRING, c3 DATE) AS
  SELECT * FROM VALUES
    (1, 3, '4',  '1985-05-11'),
    (2, 4, '3',  '1996-12-20'),
    (3, 2, '1',  '1974-02-03'),
    (4, 1, '2',  '2004-03-09'),
    (5, NULL, NULL, NULL);
```

Add the SEARCH OPTIMIZATION property to the table using [ALTER TABLE](../sql-reference/sql/alter-table.md):

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION;
```

The following queries can use the search optimization service:

```sqlexample
SELECT * FROM test_table WHERE id = 2;
```

```sqlexample
SELECT * FROM test_table WHERE c2 = '1';
```

```sqlexample
SELECT * FROM test_table WHERE c3 = '1985-05-11';
```

```sqlexample
SELECT * FROM test_table WHERE c1 IS NULL;
```

```sqlexample
SELECT * FROM test_table WHERE c1 = 4 AND c3 = '1996-12-20';
```

The following query can use the search optimization service because the implicit cast is on the constant, not the column:

```sqlexample
SELECT * FROM test_table WHERE c2 = 2;
```

The following can’t use the search optimization service because the cast is on the table’s column:

```sqlexample
SELECT * FROM test_table WHERE CAST(c2 AS NUMBER) = 2;
```

An [IN](../sql-reference/functions/in.md) clause is supported by the search optimization service:

```sqlexample
SELECT id, c1, c2, c3
  FROM test_table
  WHERE id IN (2, 3)
  ORDER BY id;
```

If predicates are individually supported by the search optimization service, then they can be joined by the conjunction
`AND` and still be supported by the search optimization service:

```sqlexample
SELECT id, c1, c2, c3
  FROM test_table
  WHERE c1 = 1
    AND c3 = TO_DATE('2004-03-09')
  ORDER BY id;
```

DELETE and UPDATE (and MERGE) can also use the search optimization service:

```sqlexample
DELETE FROM test_table WHERE id = 3;
```

```sqlexample
UPDATE test_table SET c1 = 99 WHERE id = 4;
```

---
title: Search Snowflake objects and resources
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-universal-search.md
section: User Guide
---

# Search Snowflake objects and resources

With Universal Search, you can quickly and easily find objects in your account, data products available to you in the Snowflake Marketplace,
relevant Snowflake Documentation topics, and relevant Snowflake Community Knowledge Base articles all from the [Snowsight home page](ui-snowsight-homepage.md).

Universal Search understands your query and information about your database objects and can find objects with names that differ from
your search terms. Even if you misspell or type only part of your search term, you can still see useful results.

When you use Universal Search, you can use natural language to describe what you’re looking for. For example, you can use keyword search
terms, like “opportunities” or “sales opportunities”, or use more conversational natural language search terms, like
“sales opportunities that are likely to close” or “which opportunities came from partner referrals”.

For example, if you search for “zip codes”, Universal Search returns results such as listings on the Snowflake Marketplace that mention postal
code data and a table with the column name `postal_code`.

To make it easier to find the right data for your project, object metadata such as names, comments, and tags for objects and columns
are searched. Universal Search searches only the object metadata, not the contents of your database objects.

## Search for objects in Snowsight

When you search for objects in Snowsight, the results displayed are based on the privileges of your currently active role and any
secondary roles.

To search, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Enter search terms in the home page search window.

   If you’re not on the home page, select Search and then enter search terms.
3. Press the **return** or **enter** key to execute the search.

   Results are displayed in categories. If a category isn’t displayed, there are no results for that category or your currently active role
   doesn’t have access to those results.
4. Select a search result to view details. For a database object, you can select Open in Worksheets to query the object in a worksheet.

To get or purchase listings offered on the Snowflake Marketplace that appear in the results, you must have agreed to the Snowflake Provider
and Consumer Terms. See
[Legal requirements for providers and consumers of listings](../collaboration/collaboration-listings-legal.md).

## Supported object types

Universal Search returns results for:

* Tables & views (including standard, Apache Iceberg™, dynamic, and hybrid)
* Workspace files
* Worksheets & dashboards
* Notebooks
* Streamlits
* ML models
* Streams, tasks, pipes, and stages
* Installed applications
* Application packages
* Databases & schemas
* User-defined functions (UDF) and stored procedures
* Feature views
* Data products in the Snowflake Marketplace
* Data products in your Internal Marketplace
* Documentation pages on <https://docs.snowflake.com> and <https://other-docs.snowflake.com>
* Knowledge Base articles on <https://community.snowflake.com/>

## Limitations and considerations

Universal Search is optimized for search terms in English.

New objects can take up to a few hours after they are created to appear in search results. Existing objects that are dropped and recreated,
such as by scheduled tasks or automated pipelines, can disappear from search results for up to a few hours until the recreated objects are
indexed.

---
title: Secure catalogs
source: https://docs.snowflake.com/en/user-guide/opencatalog/secure-catalogs.md
section: User Guide
---

# Secure catalogs

The catalog admin manages a catalog.

When you secure a catalog, you define the actions that a service principal can perform on the following securable objects for the catalog:

* The entire catalog
* A namespace in the catalog and its objects (tables, views, etc.) and optionally any child namespaces nested under it and their objects
* A table in the catalog
* A view in the catalog

**Note**

> If you haven’t already created catalog roles, which you grant with access control privileges when you secure a catalog, create them now. For details, see [Create a catalog](create-catalog.md). If you haven’t already created the principal roles you need to logically group Open Catalog service principals together, create them now. For details, see [Create a principal role](create-principal-role.md).

## Secure a catalog

The workflow to secure a catalog is as follows:

Step 1: Grant catalog privileges on a catalog role.

Step 2: Grant the catalog role to a principal role.

If needed, you can later update the privileges granted on a secured catalog.

### Step 1: Grant catalog privileges on a catalog role

To grant privileges on a catalog, you first grant privileges on a catalog role. These privileges specify a set of permissions for actions that a service principal can take on the catalog. For more information about catalog roles, see [Catalog role](access-control.md).

1. Sign in to Open Catalog.
2. In the menu on the left, select **Catalogs**.
3. In the list of catalogs, select the catalog for which you want to grant privileges.
4. Select the **Catalog Details** tab.
5. Select **+ Privilege**.
6. In the **Grant new privileges on** dialog, complete the fields:

   1. For **Catalog role**, select the catalog role you want to grant privileges on.
   2. For **Privileges**, select each privilege to grant on the catalog.

      For a description of the available privileges, see [Access control privileges for a catalog](access-control.md).
   3. Select **Grant privileges**.

      The privileges are granted to the catalog role.

### Step 2: Grant the catalog role to a principal role

To bestow a catalog role’s privileges to the service principals that a principal role is granted to, grant the catalog role to that principal role. For more information about principal roles and service principals, see [Principal role](access-control.md) and [Service principal](overview.md).

1. Sign in to Open Catalog.
2. In the menu on the left, select **Catalogs**.
3. In the list of catalogs, select the catalog for which you want to grant a catalog role to a principal role.
4. Select the **Roles** tab.
5. Select **Grant to Principal Role**.
6. In the **Grant Catalog Role** dialog, complete the fields:

   1. For **Catalog role to grant**, select the catalog role you granted privileges on.
   2. For **Principal role to receive grant**, select the principal role that is granted to the service principal that you want to grant the privileges to.
   3. Select **Grant**.

      The catalog role is granted to the principal role, and the catalog role’s privileges are now bestowed to the service principals that the principal role is granted to.

### Update the privileges granted to a catalog role at the catalog level

If needed, you can update the catalog privileges granted to a catalog role, which updates the privileges bestowed to the service principal.

**Note**

> If you update the privileges bestowed to a service principal, the updates won’t take effect for up to one hour. This means that if you revoke or grant some privileges for a catalog, namespace, or table, the updated privileges won’t take effect on any service principal with access to that catalog for up to one hour.

1. Sign in to Snowflake Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog whose privileges you want to update.
4. From the **Catalog Details** tab, in the Privileges section, select the **Edit** icon for the catalog role whose privileges you want to update.
5. Optional: To add a privilege to the catalog role, for **Privileges**, select the privilege you want to add.
6. Optional: To remove a privilege from the catalog role, select the **x** icon next to the privilege you want to remove.
7. Select **Update privileges**.

## Secure a namespace

The workflow to secure a namespace is as follows:

Step 1: Navigate to the namespace and grant privileges on a catalog role. If you need to grant the same privileges to another namespace, repeat this step for each namespace.

Step 2: Grant the catalog role to a principal role.

If needed, you can later update the privileges granted on a secured namespace.

### Step 1: Grant namespace privileges on a catalog role

To grant privileges on a namespace, you first navigate to the namespace and grant privileges on a catalog role. These privileges specify a set of permissions for actions that a service principal can take on the namespace. For more information about catalog roles, see [Catalog role](access-control.md).

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog containing the namespace you want to grant privileges on.
4. From the catalog object explorer on the left, select the namespace you want to grant privileges on.
5. On the **Namespace Details** tab, select **+ Privilege**.
6. In the **Grant new privileges on** dialog, complete the fields:

   1. For **Catalog role**, select the catalog role you want to grant privileges on.
   2. For **Privileges**, select each privilege to grant on the namespace.

      For a description of the available privileges, see [Access control privileges for a namespace](access-control.md).
   3. Select **Grant privileges**.

      The privileges are granted to the catalog role.

**Note**

> If you need to secure additional namespaces with the same privileges, repeat the previous steps for the other namespaces. When selecting the catalog role, make sure you select the same catalog role for each namespace.

### Step 2: Grant the catalog role to a principal role

To bestow a catalog role’s privileges to the service principals that a principal role is granted to, grant the catalog role to that principal role. For more information about principal roles and service principals, see [Principal role](access-control.md) and [Service principal](overview.md).

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog for which you want to grant a catalog role to a principal role.
4. On the **Roles** tab, select **Grant to Principal Role**.
5. In the **Grant Catalog Role** dialog, complete the fields:

   1. For **Catalog role to grant**, select the catalog role you granted privileges on.
   2. For **Principal role to receive grant**, select the principal role that is granted to the service principal that you want to grant the privileges.
   3. Select **Grant**.

      The catalog role is granted to the principal role, and the catalog role’s privileges are now bestowed to the service principals that the principal role is granted to.

### Update the privileges granted to a catalog role at the namespace level

If needed, you can update the catalog privileges granted to a catalog role, which updates the privileges bestowed to the service principal.

**Note**

> If you update the privileges bestowed to a service principal, the updates won’t take effect for up to one hour. This means that if you revoke or grant some privileges for a catalog, namespace, or table, the updated privileges won’t take effect on any service principal with access to that catalog for up to one hour.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog containing the namespaces whose privileges you want to update.
4. From the catalog object explorer on the left, select the namespace whose privileges you want to update.
5. On the **Namespace Details** tab, in the **Privileges** section, select the **Edit** icon for the catalog role whose privileges you want to update.
6. Optional: To add a privilege to the catalog role, for **Privileges**, select the privilege you want to add.
7. Optional: To remove a privilege from the catalog role, select the **x** icon next to the privilege you want to remove.
8. Select **Update privileges**.
9. Optional: Repeat these steps for any additional namespaces whose privileges you need to update.

## Secure a table

The workflow to secure a table is as follows:

Step 1: Navigate to the table and grant privileges on a catalog role. If you need to grant the same privileges to another table, repeat this step for each table.

Step 2: Grant the catalog role to a principal role.

If needed, you can later update the privileges granted on a secured table.

### Step 1: Grant table privileges on a catalog role

To grant privileges on a table, you first grant privileges on a catalog role. These privileges specify a set of permissions for actions that a service principal can take on the table. For more information about catalog roles, see [Catalog role](access-control.md).

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog containing the table you want to grant privileges on.
4. From the catalog object explorer on the left, expand the applicable namespaces, and then select the table you want to grant privileges on.
5. On the **Table Details** tab, select **+ Privilege**.
6. In the **Grant new privileges on** dialog, complete the fields:

   1. For **Catalog role**, select the catalog role you want to grant privileges on.
   2. For **Privileges**, select each privilege to grant on the namespace.

      For a description of the available privileges, see [access control privileges for a table](access-control.md).
   3. Select **Grant privileges**.

      The privileges are granted to the catalog role.

**Note**

> If you need to secure additional tables with the same privileges, repeat the previous steps for the other tables. When selecting the catalog role, make sure you select the same catalog role for each table.

### Step 2: Grant the catalog role to a principal role

To bestow a catalog role’s privileges to the service principals that a principal role is granted to, grant the catalog role to that principal role. For more information about principal roles and service principals, see [Principal role](access-control.md) and [Service principal](overview.md).

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog for which you want to grant a catalog role to a principal role.
4. On the **Roles** tab, select **Grant to Principal Role**.
5. In the **Grant Catalog Role** dialog, complete the fields:

   1. For **Catalog role to grant**, select the catalog role you granted privileges on.
   2. For **Principal role to receive grant**, select the principal role that is granted to the service principal that you want to grant the privileges to.
   3. Select **Grant**.

      The catalog role is granted to the principal role, and the catalog role’s privileges are now bestowed to the service principals that the principal role is granted to.

### Update the privileges granted to a catalog role at the table level

If needed, you can update the catalog privileges granted to a catalog role, which updates the privileges bestowed to the service principal.

**Note**

> If you update the privileges bestowed to a service principal, the updates won’t take effect for up to one hour. This means that if you revoke or grant some privileges for a catalog, namespace, or table, the updated privileges won’t take effect on any service principal with access to that catalog for up to one hour.

1. Sign in to Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the list of catalogs, select the catalog containing the table(s) whose privileges you want to update.
4. From the catalog object explorer on the left, expand the applicable namespace(s) and then select the table whose privileges you want to update.
5. From the **Table Details** tab, in the Privileges section, select the **Edit** icon for the catalog role whose privileges you want to update.
6. Optional: To add a privilege to the catalog role, for **Privileges**, select the privilege you want to add.
7. Optional: To remove a privilege from the catalog role, select the **x** icon next to the privilege you want to remove.
8. Select **Update privileges**.
9. Optional: Repeat these steps for any additional namespaces whose privileges you need to update.

---
title: Securing ingress of Snowflake requests with egress IP addresses
source: https://docs.snowflake.com/en/user-guide/egress-ip/network-egress.md
section: User Guide
---

# Securing ingress of Snowflake requests with egress IP addresses

You can securely allow ingress access from Snowflake to your external resources by allowing egress IP address ranges generated from
Snowflake through the resource’s network firewall.

You can generate a list of Snowflake egress IP address ranges (as Classless Inter-Domain Routing (CIDR) addresses) that you can use to
represent Snowflake in allowing access through your external server’s network firewall.

## Supported deployments

Stable egress IP addresses are available on AWS Commercial deployments.

## Supported uses

Using egress IP addresses you generate with Snowflake, you can allow ingress access from the following Snowflake features:

* External access from [UDFs and procedures](../../developer-guide/external-network-access/external-network-access-overview.md)
* [Snowpark Container Services external access](../../developer-guide/snowpark-container-services/service-network-communications.md) and Snowflake Openflow on Snowpark Container Services
* [Snowflake Git integration](../../developer-guide/git/git-setting-up.md) with IP-restricted Git servers

## Generate egress IP address ranges

You can generate IP address ranges that Snowflake uses for egress traffic by using the
[SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](../../sql-reference/functions/system_get_snowflake_egress_ip_ranges.md) function.

The generated IP addresses expire, so for ongoing needs you should set up a means to automate refreshing your external server’s firewall
with fresh egress IP addressess, as described in Automate IP address range refreshes.

To generate and use Snowflake egress IP addresses, follow these steps:

1. Call [SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](../../sql-reference/functions/system_get_snowflake_egress_ip_ranges.md) to get the current and upcoming
   IP ranges and their expiration times.

   The following code shows example output of the function.

   ```sqlexample
   SELECT
    value: "ipv4_prefix":: VARCHAR AS IP_CIDR_RANGE_FOR_REGION,
    value: "effective":: TIMESTAMP AS IP_CIDR_RANGE_EFFECTIVE,
    value: "expires":: TIMESTAMP AS IP_CIDR_RANGE_EXPIRATION
   FROM TABLE(FLATTEN (INPUT => PARSE_JSON(SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES())));
   ```

   ```output
   +--------------------------+-------------------------+--------------------------+
   | IP_CIDR_RANGE_FOR_REGION | IP_CIDR_RANGE_EFFECTIVE | IP_CIDR_RANGE_EXPIRATION |
   +--------------------------+-------------------------+--------------------------+
   | 153.45.34.0/24           | 2025-08-01 00:00:00.000 | 2026-05-06 01:33:26.726  |
   | 153.45.77.0/24           | 2025-08-01 00:00:00.000 | 2026-05-06 01:33:26.726  |
   +--------------------------+-------------------------+--------------------------+
   ```

   * The `IP CIDR RANGE_EFFECTIVE` column shows the start date when a range starts carrying traffic. A new range should emerge in function output at least 60 days before being “effective”.
   * The `IP CIDR RANGE_EXPIRATION` column shows the date when an IP range stops carrying traffic.
2. Use the IP ranges you obtain to update firewall rules by using APIs, CLIs, or configuration management tools, as described in
   Automate IP address range refreshes.

## Automate IP address range refreshes

Snowflake egress IP addresses expire. To keep access secure, you must update the Snowflake egress IP addresses allowed through your
external server’s firewall so that they’re current.

To keep IP addresses fresh, implement a mechanism to trigger these updates to your external server regularly, such as daily or weekly.
You might do this, for example, by using your environment’s tools.

To make updates, follow these steps in your script:

1. Retrieve Snowflake egress IP address ranges by using [SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](../../sql-reference/functions/system_get_snowflake_egress_ip_ranges.md).
2. Compare the newly-retrieved ranges with those you’re currently using.

   You can avoid unnecessary changes by only making updates if the address ranges are different.

   * If they aren’t different, have your script use expiration dates to set a time to check again, such as a few days before the
     expiration.
   * If the newly-retrieved list is different, update your firewall rule programmatically with the new addresses. You can then have the
     script set a new date to check, such as a few days before the new expiration.
3. Log changes made by the script and set up alerts on successful updates or failures.

### Automate updates using your environment’s tools

You can automate the tasks needed to keep Snowflake IP addresses fresh by using scripts and tools. The following describes two examples:

* Scripting with APIs and CLIs on cloud providers such as AWS, Azure, and Google Cloud.

  For cloud environments, you can write scripts by using tools such as Python, PowerShell, and Bash. Your tools can perform the following
  tasks:

  1. Call [SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](../../sql-reference/functions/system_get_snowflake_egress_ip_ranges.md) to retrieve the latest IP
     address ranges and expiration dates.
  2. Use the cloud provider’s API or CLI to update security group rules, network ACLs, or firewall policies.
  3. Schedule scripts that perform these actions to run periodically (such as daily or weekly) or based on expiry dates using cron
     jobs. You can run these by using stored procedures with Snowflake tasks.
* Infrastructure-as-code (IaC) tools

  You can use tools such as Terraform, Ansible, or CloudFormation to manage firewall rules as code. The approach described below also
  provides version control and audit trails for firewall rule changes.

  Using these tools, you can perform the following tasks:

  1. Define firewall rules in IaC configurations.
  2. Call [SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](../../sql-reference/functions/system_get_snowflake_egress_ip_ranges.md) to retrieve the latest IP
     address ranges and expiration dates.
  3. When new Snowflake egress IP ranges are available, update your IaC configuration with the new ranges.
  4. Apply the changes by using your IaC tool, ensuring that firewall rules are updated programmatically and idempotently.

---
title: Security, Governance & Observability
source: https://docs.snowflake.com/en/user-guide/ecosystem-security.md
section: User Guide
---

# Security, Governance & Observability

Security and governance tools ensure sensitive data maintained by an organization is protected from inappropriate access and tampering,
as well as helping organizations to achieve and maintain regulatory compliance. These tools are often used in conjunction with
observability solutions/services to provide organizations with visibility into the status, quality, and integrity of their data,
including identifying potential issues.

Together, these tools support a wide range of operations, including risk assessment, intrusion detection/monitoring/notification, data
masking, data cataloging, data health/quality checks, issue identification/troubleshooting/resolution, and more.

The following security, governance, and observability tools and technologies are known to provide native connectivity to Snowflake:

| Solution |  | Version / Installation Requirements | Notes |
| --- | --- | --- | --- |
|  |  | **Acryl Data:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Metadata Ingestion > Sources > Snowflake](https://datahubproject.io/docs/generated/ingestion/sources/snowflake)     (DataHub Documentation) |
|  |  | **Alation:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Configure Snowflake OAuth for partner applications](oauth-partner.md) (Snowflake Documentation) |
|  |  | **ALTR:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md) (free forever plan). * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [Snowflake data access control and security](https://www.altr.com/ecosystem/snowflake) (ALTR website)   + [Set Up ALTR to Work with Snowflake - Accounts created via altr.com](https://docs.altr.com/docs/set-up-altr-to-work-with-snowflake)     (ALTR Documentation)   + [Snowflake Data Governance Buying Guide](https://www.altr.com/resource/ebook-snowflake-data-governance-buying-guide)     (ALTR eBook)   + [ALTR: Data Query and Anomaly Event Log](https://www.snowflake.com/datasets/altr-data-query-and-anomaly-event-log/)     (Snowflake website) |
|  |  | **Anomalo:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Anomalo Partners With Snowflake to Help Enterprises Trust Their Data](https://anomalo.com/post/anomalo-partners-with-snowflake-to-help-enterprises-trust-their-data)     (Anomalo Blog) |
|  |  | **Atlan:** No requirements  **Snowflake:** No requirements | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [How to Set Up Snowflake in Atlan](https://ask.atlan.com/hc/en-us/articles/4417168972689) (Atlan Documentation)   + [All-in-one Modern Data Workspace for your Snowflake Data Cloud](https://atlan.com/partners/snowflake/) (Atlan website)   + [Atlan + Snowflake](https://6880682.fs1.hubspotusercontent-na1.net/hubfs/6880682/Datasheet%20-%20Snowflake%20+%20Atlan.pdf)     (Atlan Data Sheet)   + [Atlan Is a Snowflake Ready Technology Partner](https://humansofdata.atlan.com/2022/04/atlan-first-data-catalog-snowflake-ready-technology-partner/)     (Atlan Blog) |
|  |  | **Baffle:** No requirements  **Snowflake:** No requirements; however, external tokenization requires Snowflake [Enterprise Edition](intro-editions.md) (or higher) | * Additional resources:    + [De-identifying Data into Snowflake](https://baffle.io/blog/de-identifying-data-into-snowflake) (Baffle Blog)   + [External Tokenization](security-column-ext-token-intro.md) (Snowflake Documentation) |
|  |  | **Bigeye:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [The Field Guide to Trustworthy Data in Snowflake](https://bigeye.com/learn/field-guide-snowflake-data-quality/)     (Bigeye Whitepaper; requires sign-up) |
|  |  | **BigID:** Any supported version of BigID  **Snowflake:** No requirements | * Additional resources:    + [Know Your Data in Snowflake](https://bigid.com/data-coverage/snowflake/) (BigID website)   + [Reduce Risk, Accelerate Governance, and Achieve Compliance in Snowflake](https://bigid.com/blog/accelerate-governance-in-snowflake/)     (BigID Blog)   + [Native Data Access and Masking Control for Snowflake](https://bigid.com/blog/snowflake-data-access-control/) (BigID Blog)   + [BigID Documentation for Snowflake Connector](https://www.docs.bigid.com/bigid/docs/snowflake) (BigID Documentation; login required) |
|  |  | **Collibra:** No requirements  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/). * Additional resources:    + [JDBC Driver for Snowflake](https://marketplace.collibra.com/listings/jdbc-driver-for-snowflake/) (Collibra Marketplace)   + [Snowflake](https://marketplace.collibra.com/wp-content/uploads/2018/09/PRES-Snowflake-240918-1553-809.pdf)     (Collibra Marketplace) |
|  |  | **Comforte:** No requirements  **Snowflake:** No requirements; however, external tokenization requires Snowflake [Enterprise Edition](intro-editions.md) (or higher) | * Additional resources:    + [How to Protect Data on Snowflake with comforte Data Security Platform](https://insights.comforte.com/how-to-protect-data-on-snowflake) (Comforte Blog)   + [External Tokenization](security-column-ext-token-intro.md) (Snowflake Documentation) |
|  |  | **CyberRes Voltage:** SecureData for Snowflake (includes version 6.20 of the Voltage SecureData Simple API, packaged within the JAR file `voltage-snowflake-aws-1.0.0.jar` and `bash` scripts for automating the setup, verification, and removal of Snowflake resources and AWS service instances)  In addition, the AWS account should be provisioned with privileges to create minimally the following services:   * AWS Service (Minimum Privileges Required) * S3 (S3 Buckets: List, Create, Delete Objects (files): Read, Write) * Identity and Access Management IAM (Identity and Access Management IAM) * Lambda (List, Create, Delete, Execute) * API Gateway (API Gateway) * Secrets Manager, if used (Secrets: Access) * Virtual Private Cloud, if used (Subnet: Access Security Group: Access)   **Snowflake:** No requirements | * Additional resources:    + [Voltage SecureData Cloud](https://www.microfocus.com/media/white-paper/voltage-securedata-cloud-wp.pdf)     (CyberRes White Paper)   + [Voltage SecureData for Snowflake Data](https://www.microfocus.com/media/data-sheet/voltage-securedata-for-snowflake-ds.pdf)     (CyberRes Data Sheet)   + [Snowflake + Voltage SecureData](https://www.youtube.com/watch?v=ULZEcYTvjlc) (CyberRes Demo in Youtube) |
|  |  | **Datadog:** Agent 7.23.0  **Snowflake:** No requirements | * Additional resources:    + [Integrations > Snowflake](https://docs.datadoghq.com/integrations/snowflake/) (Datadog Documentation) |
|  |  | **Dataguise:** Data Discovery + Protection Software  **Snowflake:** No requirements | * Additional resources:    + [Snowflake — Overview](https://www.dataguise.com/snowflake-computing/) (Dataguise website)   + [Sensitive Data Governance in Snowflake](https://www.dataguise.com/sensitive-data-governance-in-snowflake/)     (ebook on Dataguise website) |
|  |  | **data.world:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). |
|  |  | **Domo:** No requirements  **Snowflake:** No requirements | Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md).   * Additional resources:    + [Trust Program](https://www.domo.com/platform/security)   + [Advanced governance, robust security and AI driven intelligence](https://www.domo.com/platform/governance)   + [What is data governance?](https://www.domo.com/glossary/what-is-data-governance) |
|  |  | **DvSum:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Configure Snowflake as a Source](https://dvsum.zendesk.com/hc/en-us/articles/10627532986004-Configure-Snowflake-as-a-Source)     (DvSum Help Center) |
|  |  | **Fortanix:** Data Security Manager SaaS  **Snowflake:** External function & AWS API Gateway | * Additional resources:    + [Using Data Security Manager with Snowflake](https://support.fortanix.com/hc/en-us/articles/4407049792148-Using-Data-Security-Manager-with-Snowflake)     (Fortanix Documentation) |
|  |  | **HashiCorp:** Vault 1.7 (or higher), HCP Vault  **Snowflake:** No requirements | * Additional resources:    + [Snowflake Database Secrets Engine](https://www.vaultproject.io/docs/secrets/databases/snowflake)     (Vault Documentation) |
|  |  | **Hunters:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [Creating a Security Data Platform with Snowflake and Hunters.AI](https://www.snowflake.com/blog/creating-a-security-data-platform-with-snowflake-and-hunters-ai/)     (Snowflake Blog)   + [Hunters: The Open XDR Dataset](https://www.snowflake.com/datasets/hunters/) (Snowflake website) |
|  |  | **Immuta:** v2.7 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [Snowflake Integration Overview](https://documentation.immuta.com/SaaS/prologue/about-immuta-platform/data-access-patterns/snowflake/snowflake/)     (Immuta Documentation) |
|  |  | **Informatica Data Governance and Compliance:**   * Cloud Connector for Snowflake — available directly in the Informatica Cloud interface * Secure Agent — download and install from the Informatica Cloud interface   **Snowflake:** No requirements | * Additional resources:    + [Three Steps to Accelerate Data Governance on Snowflake Data Cloud](https://www.informatica.com/blogs/3-steps-to-accelerate-data-governance-on-snowflake-data-cloud.html) (Informatica Blog)   + [Using Snowflake Object Tags with Informatica Data Governance](https://video.informatica.com/detail/video/6282661204001/using-snowflake-object-tags-with-informatica-data-governance?autoStart=true&q=snowflake)     (Informatica Videos) |
|  |  | **jSonar:** No requirements  **Snowflake:** No requirements |  |
|  |  | **Lacework:** No requirements  **Snowflake:** No requirements |  |
|  |  | **Monte Carlo:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Delivering End-to-End Data Trust with Snowflake and Monte Carlo](https://montecarlodata.com/blog-snowflake/)     (Monte Carlo Blog)   + [Monte Carlo: Data Observability Insights](https://snowflake.com/datasets/monte-carlo-data-observability-insights/)     (Snowflake Marketplace) |
|  |  | **Normalyze:** No requirements  **Snowflake:** No requirements | * Additional resources * [Normalyze DSPM Connector for Snowflake](https://app.snowflake.com/marketplace/listing/GZT1ZVNQ93/normalyze-normalyze-dspm-connector-for-snowflake?search=normalyze)   (Snowflake Marketplace) * [Normalyze DSPM connected app for Snowflake](https://normalyze.ai/use-cases/#snowflake-access/connected-app) (Normalyze website) * [Normalyze DSPM native app for Snowflake](https://normalyze.ai/use-cases/#snowflake-access/native-app) (Normalyze website) |
|  |  | **Okera:** Okera SaaS or Okera v2.10+  **Snowflake:**   * For Standard Edition accounts, use Okera BI Gateway. * For Enterprise Edition accounts (or higher), no requirements. | * Additional resources:    + [Create a Snowflake Connection](https://docs.okera.com/odas/latest/catalog/sf_connect/) (Okera Documentation)   + [Migrate Sensitive Data to Snowflake Data Cloud](https://www.okera.com/wp-content/uploads/2022/06/Okera-for-Snowflake-Datasheet-2022-June-15.pdf) (Okera datasheet)   + [Simplify data access control for Snowflake](https://www.okera.com/partners/technology-snowflake/) (Okera demo) |
|  |  | **OneTrust:** Data Governance  **Snowflake:** No requirements | * Additional resources:    + [OneTrust Partners with Snowflake to Simplify Data Classification & Enforce Policy](https://www.onetrust.com/blog/onetrust-partners-with-snowflake-to-simplify-data-classification-enforce-policy/)     (OneTrust Blog) |
|  |  | **OvalEdge:** OvalEdge v3.0 or greater  **Snowflake:** No requirements | * Additional resources:    + [Snowflake Set Up](https://support.ovaledge.com/snowflake) (OvalEdge Documentation)   + [Snowflake Data Security](https://support.ovaledge.com/snowflake-rdam) (OvalEdge Documentation)   + [How Progressive, End-to-End Data Governance Supports Business Agility](https://www.ovaledge.com/blog/data-governance-in-snowflake) (OvalEdge Blog) |
|  |  | **Privacera:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Privacera PolicySync](https://docs.privacera.com/latest/pm-ig/policysync/#snowflake) (Privacera Documentation)   + [PrivaceraCloud](https://docs.privacera.com/cloud/en/snowflake.html) for Snowflake (Privacera Documentation) |
|  |  | **Protegrity:** No requirements  **Snowflake:** No requirements; however, external tokenization requires Snowflake [Enterprise Edition](intro-editions.md) (or higher) | * Additional resources:    + [Snowflake + Protegrity ‘Experience’ in Action - Free Trial](https://www.protegrity.com/snowflake-partnership)     (Protegrity website)   + [External Tokenization](security-column-ext-token-intro.md) (Snowflake Documentation) |
|  |  | Validated by the [Snowflake Ready Technology Validation Program](https://www.snowflake.com/partners/technology-partners/snowflake-ready-technology-validation-program/).  **QLIK QUALITY AND GOVERNANCE**: No requirements  **Snowflake:** No requirements | * Additional resources:    + [Start a trial](https://www.qlik.com/us/trial/talend-data-fabric)   + [Qlik Talend Trust Score](https://www.qlik.com/us/products/data-quality-governance) |
|  |  | **Satori:** No requirements, but must change the hostname to use Satori as the hostname  **Snowflake:** No requirements | * Additional resources:    + [Snowflake Guide](https://satoricyber.com/docs/datastores/snowflake/) (Satori Documentation) |
|  |  | **SecuPi:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Don’t Fear the Cloud: A Snowflake Security Solution](http://www.secupi.com/dont-fear-the-cloud-a-snowflake-security-solution/)     (SecuPi website) |
|  |  | **Select Star:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Select Star and Snowflake Partner to Take Data Governance to a New Level](https://blog.selectstar.com/selectstar-and-snowflake-partner-to-take-data-governance-to-a-new-level-a9d274e1d4c6) (Select Star Blog)   + [Snowflake - Select Star Integration](https://docs.selectstar.com/integrations/snowflake) (Select Star Documentation)   + [Getting Started: Snowflake](https://docs.selectstar.com/learning-data/getting-started-snowflake) (Select Star Documentation) |
|  |  | **Skyflow:** No requirements  **Snowflake:** No requirements | Additional resources:   * [Skyflow + Snowflake Demo Registration](https://info.skyflow.com/snowflake-partner-skyflow) (Skyflow website) |
|  |  | **Sled:** No requirements  **Snowflake:** No requirements | Additional resources:   * [Get started with Sled on Snowflake in 30min](https://docs.sled.so/getstarted) (Sled Documentation) * [Data Catalog, Data Observability and Metric Store for Snowflake](https://www.sled.so/) (Sled website) |
|  |  | **Spring Labs:** No requirements  **Snowflake:** No requirements | Additional resources:   * [Spring Labs + Snowflake](https://springlabs.com/spring-labs-snowflake) (Spring Labs website) |
|  |  | **Tamr:** 2019.011.0-0.3  **Snowflake:** No requirements | * Additional resources:    + [Tamr + Snowflake](https://www.tamr.com/snowflake-automate-data-mastering-for-customer-data-get-insights-faster/)     (Tamr website) |
|  |  | **Thales:** No requirements  **Snowflake:** No requirements | * Additional resources * [CipherTrust Integrations > Snowflake](https://thalesdocs.com/ctp/ig/snowflake/index.html) (Thales Documentation) |
|  |  | **ThoughtSpot:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Software > Connections > Snowflake](https://docs.thoughtspot.com/software/latest/connections-snowflake) (Thoughtspot Documentation)   + [Configure Snowflake OAuth for partner applications](oauth-partner.md) (Snowflake Documentation) |
|  |  | **Trustlogix:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Snowflake](https://trustlogix.io/snowflake/) (Trustlogix website) |

---
title: Semantic View Autopilot
source: https://docs.snowflake.com/en/user-guide/views-semantic/autopilot.md
section: User Guide
---

# Semantic View Autopilot

In Snowsight, you can create and manage semantic views to define logical tables over your data in Cortex Analyst. Semantic views abstract the physical tables and provide a business-friendly layer over your data. You can use the semantic views with Cortex Analyst to answer business questions and perform data analysis. You can create a semantic view manually or use the Semantic View Autopilot, an AI-assisted generator, to create a semantic view.

> **Note:**
>
> You can use the instructions in this section to also create a semantic model, but we recommend using semantic views instead. Semantic views provide the following features:
>
> * Semantic views support advanced features such as Derived Metrics.
> * Semantic views support access modification. They’re public by default, but you can make them private.
> * Semantic views are schema objects that integrate with Snowflake’s privilege system, sharing mechanisms, and metadata catalog. Semantic models are YAML files stored in a stage and lack these native database integrations.

The generator uses the following inputs to build your view:

* Query History: Surfaces historical SQL queries to identify common usage patterns, relationships, and verified query suggestions.
* Table Metadata: Extracts descriptions, primary/unique keys, and cardinality to determine relationships.
* Context (Highly Recommended): Uses example SQL queries or Tableau files you provide to validate relationships and extract relevant business logic.

## Prerequisites

To create a semantic view, you must use a role with the following privileges:

* CREATE SEMANTIC VIEW on the schema where you are creating the view
* USAGE on the database and schema
* SELECT on the tables and views used in the semantic view

You can export a model from Tableau and use it to automatically generate a semantic view. In addition to the preceding prerequisites, the Tableau ingestion feature requires:

* A stage where you have write permissions.
* If your Tableau file contains Custom SQL, you must also have the CREATE VIEW privilege on the schema because the SQL is parsed into a regular Snowflake view.

## Options for providing context

While providing context is optional, it’s extremely useful in creating a high-quality semantic view. Without it, the model only uses the database schema information, which might lack business nuance. We support the following options for providing context:

### Option 1: Upload Tableau file

Semantic View Autopilot supports using a file from Tableau to automatically generate a semantic model. This lets you migrate your existing business logic and metadata directly into Snowflake.

You can either use Tableau Desktop or Tableau Online to provide the file to Semantic View Autopilot. Semantic View Autopilot supports the following file formats:

* `TWB`
* `TWBX`
* `TDS`

The file must meet the following constraints:

> * File Size: Must be under 50 MB.
> * No Published Datasources: Files containing published datasources are not currently supported.
> * No Large Extracts: If using a .twbx file, ensure it does not contain a large extract. If using a .twb file, ensure it does not contain large filters or parameters.
> * LOD Calculations: Level of Detail (LOD) calculations are not supported.

You can get the `TWB` or `TWBX` file from Tableau Desktop. If you can’t find it, you can go to `File | Save As` and choose to save as a `TWB`.

For information about getting a view or workbook from Tableau Online, see [Download Views and Workbooks](https://help.tableau.com/current/pro/desktop/en-us/export.htm).

After you provide the Tableau file to Semantic View Autopilot, autopilot parses it to extract the following metadata:

> * Tables and Columns
> * Relationships between tables
> * Tableau calculated fields
> * Parameters and Filters
> * Custom SQL (parsed and turned into a regular Snowflake view)

### Option 2: Provide SQL queries

You can add example natural language questions and their corresponding SQL queries. This helps the model learn your specific business logic and create relationships.

Snowflake uses these queries to pre-select tables and columns in subsequent steps, and will also auto-add these queries as “verified queries” in the semantic model. Additionally, if valid relationships can be inferred, these will get added to the semantic view.

## Create a semantic view

To create a semantic view, first navigate to the generator:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Cortex Analyst.
3. At the top, select Create new.
4. Select Create new Semantic View.

After you’ve navigated to the generator, you can define the basic information for the semantic view:

1. Select the Location (Database and Schema) to store the view.
2. Enter a Name for the semantic view.
3. Enter a Description. Use clear business terminology to help the AI understand the view’s purpose.
4. Select Next.

To provide context and data as a Tableau file, do the following:

1. Select Tableau Files to upload a Tableau .twb, .tds, or .twbx file.
2. Select the Tableau file to upload.
3. Select Next.

You’ve now successfully provided the context and data for the semantic view as a Tableau file.

To provide context and data as SQL queries, do the following:

1. Select SQL Queries to manually add gold standard example SQL queries.
2. Enter the SQL query.
3. Select Next.
4. Review the tables and columns selected from your tables.
5. Choose the specific columns to include.
6. Configure the AI Options:

> * Sample Values: Select whether to add sample values. This significantly improves Cortex Analyst’s accuracy by helping it recognize specific data values, such as specific region names.
> * AI-Generated Descriptions: Select whether to auto-generate descriptions for tables and columns based on their names and content. This is also a feature that significantly improves accuracy.

To create the semantic view, do the following:

1. Select Create and save.
2. Select Save and run.

It might take a few minutes to generate the semantic view. You can view the progress on the screen.

## Best practices for creating semantic views

When you’re creating a semantic view, follow these tips to ensure high precision.

* Think from the end-user’s perspective. Use names and synonyms that match the vocabulary your business users actually use (for example, “Revenue” instead of AMT_TOT).
* Start simple. Start with a small, focused scope. For example, Sales Analytics with 3-5 tables and expand gradually. This ensures higher accuracy than a massive, “do-it-all” model.
* Review generated content. Always review AI-generated descriptions and relationships. Ensure they align with your actual business logic.
* Capture complex logic. Use Metrics and Verified Queries to handle complex calculations so users don’t have to rely on the LLM deducing them from raw columns.
* Test and iterate. After creation, test the view with real business questions in Cortex Analyst. If an answer is wrong, add a Verified Query or update a Description to fix it.

---
title: Semantic View Editor
source: https://docs.snowflake.com/en/user-guide/views-semantic/editor.md
section: User Guide
---

# Semantic View Editor

The Semantic View Editor in Snowsight provides a visual interface for creating and editing
[Semantic Views](overview.md). Whether you’re refining a view created by the [Autopilot](autopilot.md),
building one from scratch, or editing an uploaded YAML specification, the editor helps you define business concepts,
metrics, and relationships over your data.

The Semantic View Editor allows you to:

* Define logical tables that map to your physical database tables
* Create dimensions (categorical attributes), facts (row-level data), and metrics (aggregated measures)
* Establish relationships between tables
* Add verified queries as examples for Cortex Analyst
* Provide custom instructions for query generation
* Configure synonyms and descriptions to improve discoverability

## Accessing the editor

You can access the Semantic View Editor through the data catalog or through Cortex Analyst.

### Through the data catalog

To access an existing semantic view through the data catalog:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer
3. Navigate to your database and schema.
4. Select Semantic Views in the object list.
5. Select the semantic view you want to edit.
6. Select the Semantic information tab to open the editor.

### Through Cortex Analyst

To access semantic views through Cortex Analyst:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Cortex Analyst.
3. Select the Semantic Views tab.
4. Either:

   * Select an existing view to edit it
   * Select Create new to create a new semantic view

## Edit semantic view metadata

The semantic view name and description help users discover and understand the purpose of the view.

To edit the semantic view name or description:

1. In the editor, select Edit next to the semantic view name at the top of the page.
2. Update the Name or Description fields.
3. Select Save.

> **Tip:**
>
> Write clear, detailed descriptions that explain:
>
> * What business questions this view can answer
> * What data sources it includes
> * Who should use this view
>
> Example: “Revenue analysis across products and customers, including year-over-year trends.
> Use this view to analyze sales performance by region, product category, and customer segment.”

## Manage logical tables

Logical tables represent business entities (such as customers, orders, or products) and map to physical database
tables or views. Each semantic view contains one or more logical tables.

### Add a logical table

To add a logical table to your semantic view:

1. In the editor, select + Logical Table.
2. Browse and select the physical table or view from your database.
3. Select Next.
4. Choose which columns to include from the table.
5. Select Generate logical table.

The editor automatically generates dimensions and facts based on the selected columns.

### Edit a logical table

To modify an existing logical table:

1. Select Edit next to the table name (or select More options » Edit Logical Table).
2. Modify the table properties:

   * Name: The business-friendly name for this table
   * Description: Explanation of what this table represents
   * Synonyms: Alternative names (comma-separated)
   * Primary Key: Columns that uniquely identify rows
3. Select Save.

> **Tip:**
>
> Use the Generate fields button to let AI automatically fill in descriptions and synonyms based on your
> data and column names. This can significantly speed up the initial setup process.

## Managing facts, dimensions, and metrics

Within each logical table, you define the business concepts that users can query: dimensions, facts, and metrics.

### Understanding the content types

* **Dimensions**: Categorical attributes that provide context (such as, customer name, product category, or order date)
* **Facts**: Row-level quantitative data (such as, sale amount, quantity, or unit price)
* **Metrics**: Aggregated measures calculated from functions like SUM, AVG, or COUNT (such as, total revenue, average order value)

### Adding dimensions, facts, or metrics

To add a new item to a logical table:

1. Navigate to the logical table in the editor.
2. Select + next to Dimensions, Facts, or Metrics.
3. Enter the required details:

   * Name: Descriptive name for this item
   * Expression: SQL expression to calculate the value
   * Data Type: The data type of the result
4. Select Add

### Edit or remove items

To modify or delete an existing dimension, fact, or metric:

1. Select the item to open its details and edit properties.
2. Or select More options » Remove to delete the item.
3. Select Save to apply changes.

### Advanced features

**Derived Metrics**: You can create view-level metrics that combine metrics from multiple tables.
For more information, see [Defining derived metrics](sql.md).

**Private Access Modifiers**: Mark facts or metrics as private to hide them from queries while still using them in
other calculations. For more information, see [Marking a fact or metric as private](sql.md).

**Preferred join paths for metrics**: If there are
[multiple relationship paths between two logical tables](sql.md), you can
choose the relationship to use from the Preferred join path menu.

## Managing relationships

Relationships define how logical tables join together, enabling queries that span multiple tables. Each relationship
defines which columns in one table reference columns in another table.

### Adding a relationship

To create a relationship between two logical tables:

1. In the editor, select + next to Relationships.
2. Enter a descriptive Name for the relationship (for example, “orders_to_customers”).
3. Select the Left Table (the table with the foreign key).
4. Select the Right Table (the table being referenced).
5. Specify the Join Columns for each table:

   * Left Column: The foreign key column(s) in the left table
   * Right Column: The primary key or unique column(s) in the right table
6. Select Add.

The relationship now appears in the Relationships list and enables Cortex Analyst to generate queries that join these tables.

> **Note:**
>
> For semantic views, you typically don’t need to specify join types (left outer, inner) or relationship types
> (one-to-one, many-to-one). These are automatically inferred from the data and primary key definitions at query time.

### Editing or removing relationships

To modify or delete a relationship:

1. Select the relationship to view its details.
2. Edit the properties as needed, or select Remove to delete it.
3. Select Save to apply changes.

## Advanced features for Cortex Analyst

To improve the accuracy and reliability of Cortex Analyst, you can add context and guidance through verified queries,
synonyms, and custom instructions.

### Verified queries

Verified queries provide example questions with their correct SQL answers. They serve two purposes:

* Help Cortex Analyst understand how to answer similar questions
* Provide suggested questions for users to get started

Adding a verified query:

1. Select + next to Verified Queries.
2. Enter a natural language Question (for example, “What are the top 10 products by revenue?”).
3. Enter the corresponding SQL Query that correctly answers the question.
4. (Optional) Check Use as onboarding question to show this as a suggestion to users.
5. Select Add.

> **Tip:**
>
> Add verified queries for:
>
> * Common business questions users are likely to ask
> * Complex queries that require specific logic
> * Edge cases or unusual calculations
> * Questions that demonstrate the view’s capabilities

### Synonyms

> **Note:**
>
> Add synonyms manually rather than auto-generating them with AI. Focus on domain-specific alternatives like internal terminology, abbreviations, or legacy names. Auto-generated synonyms often reduce semantic view quality.

Synonyms help users discover and query your data using alternative terminology. For example, users might refer to
“customers” as “clients” or “accounts.”

Adding synonyms to a table or field:

1. Navigate to the table, dimension, fact, or metric you want to add synonyms for.
2. Select Edit to open the item’s properties.
3. In the Synonyms field, enter alternative terms separated by commas.
4. Select Save.

Example synonyms:

* For a “customer_name” dimension: “client name, account name, buyer name”
* For a “revenue” metric: “sales, income, earnings”
* For an “orders” table: “sales orders, purchases”

### Custom instructions

Custom instructions provide specific guidance to Cortex Analyst for SQL generation and question categorization.
Use custom instructions to:

* Define business rules and constraints
* Specify default behaviors
* Handle ambiguous questions
* Reject certain types of questions

Add a custom instruction by:

1. In the editor, select the Custom Instructions section.
2. Enter instructions in natural language. Examples:

   * “Always filter by active customers (status = ‘ACTIVE’) unless specified otherwise”
   * “Round all monetary values to 2 decimal places”
   * “When asked about revenue, use net_revenue metric unless gross revenue is explicitly requested”
   * “If a question asks about users without specifying a region, ask the user to clarify which region”
3. Select Save.

For more information about custom instructions on semantic views, see [Providing custom instructions for Cortex Analyst](sql.md).

## Uploading a YAML file

If you have an existing semantic view YAML specification or a legacy semantic model YAML file, you can upload it
to create a new semantic view or update an existing one.

To upload a YAML file:

1. In the navigation menu, select AI & ML » Cortex Analyst.
2. Select Create new » Upload YAML file.
3. Browse and select your YAML file.
4. Review the generated semantic view structure in the editor.
5. Select Convert and save to create the semantic view as a schema-level object.

The editor converts the YAML specification into a native Snowflake semantic view, which you can then edit using the
visual interface.

For information about the YAML specification format, see [YAML specification for semantic views](semantic-view-yaml-spec.md).

For information about converting a specification to a semantic view programmatically, see [Creating a semantic view from a YAML specification](sql.md).

## Sharing and granting privileges

To allow other users or roles to use your semantic view, you need to grant them appropriate privileges.

### Granting access through the editor

To quickly grant access to a semantic view:

1. In the editor, select Share (or More options » Share).
2. Select the role to grant access to.
3. Confirm the grant operation.

This grants both SELECT and REFERENCES privileges on the semantic view, which allows the role to:

* Query the semantic view
* Use the semantic view with Cortex Analyst

### Understanding privileges

Semantic views support Snowflake’s standard privilege model:

* **SELECT**: Required to query the semantic view and view its contents
* **REFERENCES**: Required to use the semantic view with Cortex Analyst and see its structure
* **OWNERSHIP**: Full control over the semantic view

For more information about granting privileges on semantic views, including future grants and more complex scenarios,
see [Granting privileges on semantic views](sql.md).

### Sharing semantic views

You can share semantic views across accounts using Snowflake’s sharing mechanisms. For more information,
see [Sharing semantic views](sharing-semantic-views.md).

---
title: Sending email notifications
source: https://docs.snowflake.com/en/user-guide/notifications/email-notifications.md
section: User Guide
---

# Sending email notifications

To send an email notification:

1. Make sure that the intended recipients verify their email addresses.
2. Create a notification integration.
3. Call a stored procedure to send the notification.

## Verify the email addresses of the email notification recipients

You can send email notifications only to Snowflake users within the same account.

Users can verify their own email addresses through [Snowsight (the Snowflake web interface)](../ui-snowsight-profile.md).

Administrators can verify the email address of other users by calling the [SYSTEM$START_USER_EMAIL_VERIFICATION](../../sql-reference/functions/system_start_user_email_verification.md) function.

## Create an email notification integration

To send email notifications, use an email notification integration that you create with the
[CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration-email.md) command.

> **Note:**
>
> You must use a role that has the global CREATE INTEGRATION privilege to run this command.

For example, to create an email notification integration named `my_email_int`, execute the following statement:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_email_int
  TYPE=EMAIL
  ENABLED=TRUE;
```

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

### Restrict the list of email addresses that can receive notifications

If you want to restrict the list of email addresses that can receive notifications through this integration, set
ALLOWED_RECIPIENTS to the list of those email addresses. If you do not set ALLOWED_RECIPIENTS, the integration can be used to
send notifications to any user in the account, provided that the
email address has been verified.

> **Note:**
>
> For each email address in ALLOWED_RECIPIENTS, make sure that the email address has been verified. If you specify an email
> address that hasn’t been verified, the CREATE NOTIFICATION INTEGRATION command fails with an error.

For example, to restrict the notification integration so that email messages can be sent only to `first.last@example.com` and
`first2.last2@example.com`, set ALLOWED_RECIPIENTS to the list of those addresses:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_email_int
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=('first.last@example.com','first2.last2@example.com');
```

For details about the syntax of this command, see [CREATE NOTIFICATION INTEGRATION (email)](../../sql-reference/sql/create-notification-integration-email.md).

### Specify a default list of recipients and a default subject line

If you are using the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](snowflake-notifications.md) stored
procedure to send email notifications, you can configure the notification integration with a default list of email addresses
and a default subject line to use. You can override the default list and subject line when you call the stored procedure.

* To specify a default list of email addresses, set the DEFAULT_RECIPIENTS property of the notification integration.
* To specify a default subject line, set the DEFAULT_SUBJECT property of the notification integration.

For example, suppose that you want to set up an email notification integration for the following purpose:

* You want to send most email notifications to `person_a@example.com` and `person_b@example.com`, but you also want the
  ability to send the notifications to the validated email addresses of any users in your account.
* You want most messages to use the subject line “Service status”, but you want to be able to use a different subject line for
  specific messages.

To create an email notification for this purpose, execute the following command:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_email_int
  TYPE=EMAIL
  ENABLED=TRUE
  DEFAULT_RECIPIENTS = ('person_a@example.com','person_b@example.com')
  DEFAULT_SUBJECT = 'Service status';
```

When sending the notification, you can override the list of default recipients and the default subject line. See
[Override the default values in the email notification integration](snowflake-notifications.md).

## Send the email notification

You can call one of the following stored procedures to send an email notification:

* [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md)

  For details, see [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](snowflake-notifications.md).
* [SYSTEM$SEND_EMAIL](../../sql-reference/stored-procedures/system_send_email.md)

  For details, see [Using SYSTEM$SEND_EMAIL to send email notifications](email-stored-procedures.md).

---
title: Sending email notifications about Trust Center findings
source: https://docs.snowflake.com/en/user-guide/trust-center/notifications-trust-center.md
section: User Guide
---

# Sending email notifications about Trust Center findings

Using the Trust Center Snowsight interface, you can configure the Trust Center to send email
notifications when its scanners generate findings. You can specify that the Trust Center sends notifications for all of the
enabled scanners in a scanner package or for individual scanners. You can also specify the severity of the
findings for which email notifications are sent.

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

> **Note:**
>
> Snowflake trial accounts can’t use this feature to send email notifications.

## Email notification recipients

For a scanner package or individual scanner, the Trust Center can send email notifications to users
with verified email addresses. When you
configure notifications, you can specify the users who will receive the email notifications:

* Admin users

  > > The Trust Center sends notifications to administrative users who are
  > > [configured to receive security notifications](../ui-snowsight-contacts.md).
  > >
  > > When this option is selected, the Trust Center sends notifications to users in the following order:
  > >
  > > 1. The security notification contact at the organization level.
  > > 2. If no security notification contact at the organization level is found, the security notification contact at the account level.
  > > 3. If no security notification contact at the organization level is found, the ACCOUNTADMIN users with
  > >    verified email addresses.
  >
  > > **Note:**
  > >
  > > When an organization account and a customer account within that organization are located in different deployments,
  > > Security Updates emails configured at the organization level are not visible from the customer account.
* Custom

  > The Trust Center sends notifications to a custom list of users. Add each user who should receive notifications
  > to the list. You can remove a user from the list by selecting the trash can icon associated with the user.

The Trust Center can send email notifications to at most 50 users.

> **Attention:**
>
> By default, the **Security Essentials** scanner package sends email notifications to
> Admin users with verified email addresses for findings at the critical severity level. By default,
> the **Security Essentials** scanners run once a month. When a scanner runs, it sends an
> email notification to the configured recipients every time it generates a finding at or above the threshold level.
>
> By default, email notifications aren’t sent for other scanner packages or scanners.
>
> You can modify the email notifications settings for scanner packages and for individual scanners.

## Verifying the email addresses of the email notification recipients

The Trust Center can send email notifications only to users who verify their email addresses in Snowsight. For more information,
see [Snowsight (the Snowflake web interface)](../ui-snowsight-profile.md).

## Managing email notifications for a scanner package

Complete the following tasks to manage email notifications for a scanner package:

* Configure email notifications for a scanner package
* Turn off email notifications for a scanner package

### Configure email notifications for a scanner package

A scanner package must be enabled before you can configure email notifications for it. For information about
enabling a scanner package, see [Enable scanner packages](using-the-trust-center.md).

When you configure email notifications for a scanner package, the Trust Center sends notifications for all of the enabled
scanners in the package.

To configure email notifications for a scanner package, complete the following steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role with the [SNOWFLAKE.TRUST_CENTER_ADMIN](overview.md) application role granted to it.

   For more information about granting these roles, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Select the Settings tab.
7. Under Notifications, select one of the following:

   * If notifications are turned off for the scanner package, select Set up notification.
   * If notification are turned on for the scanner package, select the edit icon.
8. Set the Minimum severity level trigger.

   The Trust Center sends email notifications for findings at the specified level or higher. For example, if
   the Minimum severity level trigger is set to Medium, the Trust Center sends findings with a severity of
   medium, high, or critical, but not low.
9. For Recipients, select Admin users or Custom. For more information,
   see Email notification recipients.
10. To save your changes, select Done, or select Cancel to cancel them.

### Turn off email notifications for a scanner package

When you turn off email notifications for a scanner package, you can’t enable email notifications for individual
scanners in the package.

To turn off email notifications for a scanner package, complete the following steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role with the [SNOWFLAKE.TRUST_CENTER_ADMIN](overview.md) application role granted to it.

   For more information about granting these roles, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Select the Settings tab.
7. Under Notifications, select the edit icon.
8. Select Turn off notification, and then select Turn off in the confirmation window.

   If email notifications aren’t turned on for the scanner package, the Turn off notification
   button doesn’t appear.

## Managing email notifications for a scanner

Complete the following tasks to manage email notifications for a scanner:

* Configure email notifications for a scanner
* Turn off email notifications for a scanner

### Configure email notifications for a scanner

The following conditions must be met before you can configure email notifications for a scanner:

* The scanner must be enabled. For information about enabling a scanner, see
  [Enable or disable a scanner in a scanner package](using-the-trust-center.md).
* The scanner’s package must have email notifications enabled. For more information, see
  Configure email notifications for a scanner package.

To configure email notifications for a scanner, complete the following steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Select  More for the scanner, and then select Edit notification.
7. Specify whether to inherit the email notification from the scanner package:

   * To inherit the email notification configuration from the scanner package, select
     Use the same trigger and recipients as package notification.
   * To specify a notification configuration that’s different from the scanner package, make sure
     Use the same trigger and recipients as package notification isn’t selected, and then set
     the Minimum severity level trigger and Recipients for the scanner:

     1. Set the Minimum severity level trigger.

        The Trust Center sends email notifications for findings at the specified level or higher. For example, if
        the Minimum severity level trigger is set to Medium, the Trust Center sends findings
        with a severity of medium, high, or critical, but not low.
     2. For Recipients, select Admin users or Custom. For more information,
        see Email notification recipients.
8. To save your changes, select Done, or select Cancel to cancel them.

### Turn off email notifications for a scanner

To turn off email notifications for a scanner, complete the following steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Select  More for the scanner, and then select Edit notification.
7. Select Turn off notification, and then select Turn off in the confirmation window.

   If email notifications aren’t turned on for the scanner, the Turn off notification
   button doesn’t appear.

---
title: Sending notifications for data quality issues
source: https://docs.snowflake.com/en/user-guide/data-quality-notifications.md
section: User Guide
---

# Sending notifications for data quality issues

Snowflake provides the following features that identify when the value returned by a data metric function (DMF) indicates a data
quality issue:

* [Expectations](data-quality-expectations.md) — Lets you use a Boolean expression to compare the output of a DMF to
  an expected value. A return value that doesn’t match the Boolean expression is considered an expectation violation.
* [Anomaly detection](data-quality-anomaly.md) — Snowflake automatically detects when the output of the DMF
  constitutes an anomaly. An anomaly occurs when the value returned by a DMF is above or below an expected range based on historical
  data.

You can send a notification when either of these features identifies a data quality issue. After
Snowflake is configured, a notification is sent whenever an expectation is violated or Snowflake identifies an anomaly.

You enable notifications at the database level. Once enabled, all objects with an associated DMF in that database generate
notifications when there is a quality issue. Within a database that is enabled for notifications, you can turn off notifications for
a specific association between an object in the database and a DMF.

## Workflow

Configuring Snowflake to send notifications for data quality issues consists of the following tasks:

1. Configure who receives notifications.
2. Grant access control privileges to the database owner.
3. Modify the database to configure and turn on notifications.

For an end-to-end example of this workflow, see Extended example.

## Configure who receives notifications

Define notification recipients by adding email addresses directly, or by creating a notification integration.

A notification integration is a Snowflake object that provides an interface between Snowflake and third-party messaging services. To
send notifications for data quality issues, create a notification integration for the messaging service. Data quality monitoring
supports the following types of notifications:

* Email notifications
* Notifications sent via external systems such as Slack, using webhooks.

If you want to add email addresses without creating an email integration, see Configure database settings for data quality notifications.

### Send notifications via email

To send notifications to a list of email addresses, execute a
[CREATE NOTIFICATION INTEGRATION](../sql-reference/sql/create-notification-integration-email.md) statement to create an integration
of type `EMAIL`. Your integration must use the ALLOWED_RECIPIENTS parameter to specify a list of email addresses where
notifications are sent. You can only add email addresses that are verified. For information about verifying an email address, see
[Verify the email addresses of the email notification recipients](notifications/email-notifications.md).

> **Tip:**
>
> You can send email notifications to a distribution list or group that is managed outside of Snowflake. For more information, see
> the related [Knowledge Base article](https://community.snowflake.com/s/article/How-to-send-Alerts-and-Notifications-to-an-email-distribution-list-or-group-and-manage-the-group-membership-outside-of-Snowflake).

For example, to create a notification integration so user `joe.smith@example.com` can be emailed when there is a data quality
issue, run the following command:

```sqlexample
CREATE NOTIFICATION INTEGRATION my_email_int
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS = ('joe.smith@example.com');
```

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

### Send notifications by using a webhook for an external system

You can send data quality notifications via an external system by creating a webhook integration. For a list of the external
systems that you can use, see [Sending webhook notifications](notifications/webhook-notifications.md).

To use webhooks to send data quality notifications, complete the following steps:

1. [Create a secret for a webhook URL](notifications/webhook-notifications.md).
2. [Create a webhook notification integration](notifications/webhook-notifications.md).

For example, if you want to use Slack to send notifications, you might run the following commands:

```sqlexample
CREATE OR REPLACE SECRET my_slack_webhook_secret
  TYPE = GENERIC_STRING
  SECRET_STRING = 'T00000000/B00000000/XXXXXXXXXXXXXXXXXXXXXXXX';

CREATE OR REPLACE NOTIFICATION INTEGRATION my_slack_webhook_int
  TYPE=WEBHOOK
  ENABLED=TRUE
  WEBHOOK_URL='https://hooks.slack.com/services/SNOWFLAKE_WEBHOOK_SECRET'
  WEBHOOK_SECRET=my_db.sch1.my_slack_webhook_secret
  WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
  WEBHOOK_HEADERS=('Content-Type'='application/json');
```

## Grant privileges

To set up notifications for objects within a database, the database owner must have the following privileges:

* MANAGE DATA QUALITY on the account
* USAGE on any notification integration that is used to send notifications. This is required only if a notification integration is used.

For example, suppose a user with the `data_steward` role is the owner of database `my_db`. To use the notification integration
`my_email_int` to send notifications for quality issues uncovered by DMFs associated with tables and views in `my_db`, run the
following commands:

```sqlexample
GRANT MANAGE DATA QUALITY ON ACCOUNT TO ROLE data_steward;
GRANT USAGE ON INTEGRATION my_email_int TO ROLE data_steward;
```

## Configure database settings for data quality notifications

You can turn on notifications for a database by
running an [ALTER DATABASE](../sql-reference/sql/alter-database.md) statement with the DATA_QUALITY_MONITORING_SETTINGS property.
This property uses a [dollar-quoted](../sql-reference/data-types-text.md) YAML specification to define the notification settings.

DATA_QUALITY_MONITORING_SETTINGS specifies the following aspects of data quality notifications:

* Whether notifications are enabled or disabled for the database.
* Which email addresses receive notifications (specified without a notification integration).
* Which [notification integrations](../sql-reference/sql/create-notification-integration.md), if any, send the notifications. You can specify
  multiple notification integrations to send notifications through different channels.
* How often notifications are sent.
* Whether the notifications include the name of the specific table or view that has the data quality issue. This metadata helps
  quickly identify and address the problem.

For example:

> ```sqlexample-yaml
> ALTER DATABASE my_db SET DATA_QUALITY_MONITORING_SETTINGS =
>   $$
>   notification:
>     enabled: TRUE
>     email_recipients: [ 'joe@example.com', 'mary@example.com']
>     integrations:
>       - WEBHOOK_NOTIFY_INT
>     cooldown_hours: 4
>     metadata_included: TRUE
>   $$;
> ```

This example specifies the following configuration:

* Notifications are enabled for the database `my_db`.
* Notifications are sent to two email addresses and one external channel.
* Notifications aren’t sent more frequently than once every four hours.
* Notifications include metadata that identifies the object and its associated DMF.

## Turn off notifications for a specific DMF association

By default, after you turn on notifications for a database, data quality issues in any object within the database generate a
notification. You can turn off notifications for a specific association between an object and a DMF to prevent notifications from
being sent. To turn off notifications for an association, run an ALTER <object> MODIFY DATA METRIC FUNCTION statement to set the
DATA_QUALITY_NOTIFICATION parameter to FALSE.

For example, suppose notifications are turned on for the database that contains view `v2`. If you don’t want notifications to be
sent when the BLANK_COUNT DMF finds quality issues with column `c1`, run the following command:

```sqlexample
ALTER VIEW v2
  MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.BLANK_COUNT ON (c1)
    SET DATA_QUALITY_NOTIFICATION = FALSE;
```

## Determine whether notifications are turned on

The [DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/functions/data_metric_function_references.md) function returns information about the association between an
object and a DMF. The output includes a column `data_quality_notification_status`, which you can use to determine whether
notifications are turned on for the association.

## Extended example

Suppose you have the following items in your account:

* A database `my_db` that contains two tables (`t1` and `t2`) and one view (`v1`).
* Tables `t1` and `t2` that are associated with the ROW_COUNT DMF, and anomaly detection is turned on for both associations.
* Role `analyst` is the owner of `my_db`.
* View `v1` is associated with the NULL_COUNT DMF, and there is an expectation defined for the association.

You want users to receive an email when there is an anomaly in tables `t1` or `t2`, but you don’t want a notification sent when
there is a quality issue with view `v1`.

> **Note:**
>
> This example demonstrates how to use a notification integration to specify email addresses. You can also specify email addresses directly
> when running the ALTER DATABASE command.

1. Create a notification integration that indicates who should receive
   notifications when there is a data quality issue:

   ```sqlexample
   CREATE NOTIFICATION INTEGRATION notify_int
     TYPE=EMAIL
     ENABLED=TRUE
     ALLOWED_RECIPIENTS=('joe.smith@example.com');
   ```
2. Grant privileges to the role `analyst`,
   which is the owner of `my_db`:

   ```sqlexample
   GRANT MANAGE DATA QUALITY ON ACCOUNT TO ROLE analyst;
   GRANT USAGE ON INTEGRATION notify_int TO ROLE analyst;
   ```
3. Configure the database settings to turn on notifications. These notifications
   will include the name of the object that had the data quality issue.

   ```sqlexample-yaml
   ALTER DATABASE my_db SET DATA_QUALITY_MONITORING_SETTINGS =
     $$
     notification:
       enabled: TRUE
       integrations:
         - NOTIFY_INT
       metadata_included: TRUE
     $$
   ```
4. Turn off notifications for an association between view `v1` and the
   NULL_COUNT DMF:

   ```sqlexample
   ALTER VIEW v1
     MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT ON (c1)
       SET DATA_QUALITY_NOTIFICATION = FALSE;
   ```

---
title: Sending notifications to cloud provider queues (Amazon SNS, Google Cloud PubSub, and Azure Event Grid)
source: https://docs.snowflake.com/en/user-guide/notifications/queue-notifications.md
section: User Guide
---

# Sending notifications to cloud provider queues (Amazon SNS, Google Cloud PubSub, and Azure Event Grid)

You can configure Snowflake to send notifications to a queue provided by a cloud service (Amazon SNS, Google Cloud PubSub, or
Azure Event Grid).

* To configure [Snowpipe](../data-load-snowpipe-intro.md) or specific [tasks](../tasks-intro.md) to send
  notifications about errors to a queue, see the following topics:

  + [Snowpipe error notifications](../data-load-snowpipe-errors.md)
  + [Set up error notifications for tasks](../tasks-errors.md)
* To call a stored procedure to send a notification to a queue:

  1. Create a notification integration for the cloud provider queue. For details, see the following topics:

     + [Creating a notification integration to send notifications to an Amazon SNS topic](creating-notification-integration-amazon-sns.md)
     + [Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic](creating-notification-integration-azure-event-grid.md)
     + [Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic](creating-notification-integration-google-pubsub.md)
     > **Note:**
     >
     > Your account must be on the same [cloud platform](../intro-cloud-platforms.md) as the cloud provider queue.
  2. Call the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure to send the notification
     message to the queue. For details, see [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](snowflake-notifications.md).

---
title: Sending webhook notifications
source: https://docs.snowflake.com/en/user-guide/notifications/webhook-notifications.md
section: User Guide
---

# Sending webhook notifications

You can integrate Snowflake notifications with the following external systems by using the webhooks that these systems provide:

* [Slack](https://api.slack.com/messaging/webhooks)
* [Microsoft Teams](https://support.microsoft.com/en-us/office/create-incoming-webhooks-with-workflows-for-microsoft-teams-8ae491c7-0394-4861-ba59-055e33f75498)
* [PagerDuty](https://developer.pagerduty.com/docs/ZG9jOjExMDI5NTgw-events-api-v2-overview)

> **Note:**
>
> Snowflake does not send webhook notifications to external systems other than the ones listed above.

To send a notification to one of these systems:

1. Create the secret for the webhook URL for the external system.
2. Create the webhook notification integration for the external system.
3. Send the notification to the external system, using the webhook notification integration.

The next sections provide more details about how to set up and send notifications to these external systems.

## Creating a secret for a webhook URL

Most webhooks require a secret or integration key in the incoming HTTP request. For example:

* When you [create an incoming webhook in Slack](https://api.slack.com/messaging/webhooks#create_a_webhook), the URL for the webhook includes a secret:

  ```none
  https://hooks.slack.com/services/<secret>
  ```
* When you [create an incoming webhook with Workflows for Microsoft Teams](https://support.microsoft.com/en-us/office/create-incoming-webhooks-with-workflows-for-microsoft-teams-8ae491c7-0394-4861-ba59-055e33f75498), the URL for the webhook includes a secret.

  Up until November 30, 2025, Microsoft Teams supports URLs in the following format:

  ```none
  https://<hostname>.<region>.logic.azure.com:443/workflows/<secret>
  ```

  [From November 30, 2025 onward](https://learn.microsoft.com/en-us/troubleshoot/power-platform/power-automate/flow-run-issues/triggers-troubleshoot?tabs=new-designer#changes-to-http-or-teams-webhook-trigger-flows),
  Microsoft Teams supports URLs in the following format:

  ```none
  https://default<hostname>.environment.api.powerplatform.com/powerautomate/automations/direct/workflows/<secret>/triggers/manual/paths/invoke
  ```
* When you [set up an integration for your PagerDuty service](https://support.pagerduty.com/docs/services-and-integrations), the integration provides an integration key that you must
  include in webhook requests:

  ```json
  {
     "routing_key" : "<integration_key>",
     /* ... */
  ```

For this secret or integration key, we recommend creating a secret object of the generic string type. This secret object is used
in the following ways:

* When you create a webhook notification integration, you specify this secret object in the
  [CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration-webhooks.md) statement.
* When you send a notification, the secret object is used to construct the HTTP request for the webhook.

Note the following:

* When you create the webhook notification integration, you must use a role that has the USAGE privilege on this secret.
* When you send a notification to this webhook, you must use a role that has the READ privilege on this secret as well as the
  USAGE privileges on the database and schema containing the secret.

To create this object, use the [CREATE SECRET](../../sql-reference/sql/create-secret.md) command, and specify TYPE=GENERIC_STRING. You must use a
role that has the CREATE SECRET privilege on the schema where you plan to create that object.

The next sections provide examples of creating the secret object.

* Example 1: Creating a secret for a Slack webhook
* Example 2: Creating a secret for a Workflows for Microsoft Teams webhook
* Example 3: Creating a secret for a PagerDuty webhook

### Example 1: Creating a secret for a Slack webhook

Suppose that you want to send notifications to a [Slack webhook](https://api.slack.com/messaging/webhooks#create_a_webhook) with the URL:

```none
https://hooks.slack.com/services/T00000000/B00000000/XXXXXXXXXXXXXXXXXXXXXXXX
```

In this example, the webhook URL contains the secret `T00000000/B00000000/XXXXXXXXXXXXXXXXXXXXXXXX`.

Execute the following statement to create a secret object for this secret:

```sqlexample
CREATE OR REPLACE SECRET my_slack_webhook_secret
  TYPE = GENERIC_STRING
  SECRET_STRING = 'T00000000/B00000000/XXXXXXXXXXXXXXXXXXXXXXXX';
```

### Example 2: Creating a secret for a Workflows for Microsoft Teams webhook

Suppose that you want to send notifications to a [Workflows for Microsoft Teams webhook](https://support.microsoft.com/en-us/office/create-incoming-webhooks-with-workflows-for-microsoft-teams-8ae491c7-0394-4861-ba59-055e33f75498) with one of the following URLs:

* Up until November 30, 2025:

  ```none
  https://prod-114.westeurope.logic.azure.com:443/workflows/xxxxxxxx
  ```
* [From November 30, 2025 onward](https://learn.microsoft.com/en-us/troubleshoot/power-platform/power-automate/flow-run-issues/triggers-troubleshoot?tabs=new-designer#changes-to-http-or-teams-webhook-trigger-flows):

  ```none
  https://defaultcac999b557e445acf1fefefe4ae5ff4.34.environment.api.powerplatform.com/powerautomate/automations/direct/workflows/xxxxxxxx/triggers/manual/paths/invoke
  ```

For information about the Microsoft API data format, see <https://adaptivecards.io/> .

In this example, the webhook URL contains the secret `xxxxxxxx`.

Execute the following statement to create a secret object for this secret:

```sqlexample
CREATE OR REPLACE SECRET my_teams_webhook_secret
  TYPE = GENERIC_STRING
  SECRET_STRING = 'xxxxxxxx';
```

### Example 3: Creating a secret for a PagerDuty webhook

Suppose that you want to send notifications to a [PagerDuty webhook](https://support.pagerduty.com/docs/services-and-integrations) and that your integration key (the value that you must
include in the `routing_key` field in requests) is:

```none
xxxxxxxx
```

Execute the following statement to create a secret object for this secret:

```sqlexample
CREATE OR REPLACE SECRET my_pagerduty_webhook_secret
  TYPE = GENERIC_STRING
  SECRET_STRING = 'xxxxxxxx';
```

## Creating a webhook notification integration

To create a notification integration of the webhook type, use the
[CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration-webhooks.md) command.

When executing this command, set the following properties to set up the HTTP request that should be sent for the notification.

* Set TYPE to WEBHOOK.
* If you created a secret object for a secret to be included in the URL, HTTP request
  body, or header, set WEBHOOK_SECRET to the name of that secret object.
* Set WEBHOOK_URL to the URL for the webhook.

  If the webhook URL includes a secret and you created a secret object for this secret, replace the secret in the URL with
  SNOWFLAKE_WEBHOOK_SECRET.
* If the body of the message for the webhook needs to be in a specific format for this external system (for example, if all
  messages sent to this system need to use the same format), set WEBHOOK_BODY_TEMPLATE to a template for the message. In this
  template:

  + Use the SNOWFLAKE_WEBHOOK_SECRET placeholder where the secret should appear in the body of the message.
  + Use the SNOWFLAKE_WEBHOOK_MESSAGE placeholder where the notification message should appear.

  When you call [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) and pass in a message, the stored
  procedure uses the template to construct the body of the webhook request. The stored procedure replaces the
  SNOWFLAKE_WEBHOOK_MESSAGE placeholder with the message that you pass in.
* If the HTTP request to the webhook must include specific HTTP headers, set WEBHOOK_HEADERS to the list of the header names and
  values.

  Use the SNOWFLAKE_WEBHOOK_SECRET placeholder where the secret should appear in the value of a header.

The next sections provide examples of creating webhook notification integrations for different types of external systems.

* Example 1: Creating a notification integration for a Slack webhook
* Example 2: Creating a notification integration for a Workflows for Microsoft Teams webhook
* Example 3: Creating a notification integration for a PagerDuty webhook

### Example 1: Creating a notification integration for a Slack webhook

Suppose that you want to send notifications to a Slack webhook with the URL:

```none
https://hooks.slack.com/services/T00000000/B00000000/XXXXXXXXXXXXXXXXXXXXXXXX
```

Suppose that you created a secret object named `my_slack_webhook_secret`
for the secret `T00000000/B00000000/XXXXXXXXXXXXXXXXXXXXXXXX` that appears in the URL.

Execute the following statement to create a notification integration for this webhook:

```sqlexample
CREATE OR REPLACE NOTIFICATION INTEGRATION my_slack_webhook_int
  TYPE=WEBHOOK
  ENABLED=TRUE
  WEBHOOK_URL='https://hooks.slack.com/services/SNOWFLAKE_WEBHOOK_SECRET'
  WEBHOOK_SECRET=my_secrets_db.my_secrets_schema.my_slack_webhook_secret
  WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
  WEBHOOK_HEADERS=('Content-Type'='application/json');
```

### Example 2: Creating a notification integration for a Workflows for Microsoft Teams webhook

Suppose that you want to send notifications to a Workflows for Microsoft Teams webhook with one of the following URLs:

* Up until November 30, 2025:

  ```none
  https://prod-114.westeurope.logic.azure.com:443/workflows/xxxxxxxx
  ```
* [From November 30, 2025 onward](https://learn.microsoft.com/en-us/troubleshoot/power-platform/power-automate/flow-run-issues/triggers-troubleshoot?tabs=new-designer#changes-to-http-or-teams-webhook-trigger-flows):

  ```none
  https://defaultcac999b557e445acf1fefefe4ae5ff4.34.environment.api.powerplatform.com/powerautomate/automations/direct/workflows/xxxxxxxx/triggers/manual/paths/invoke
  ```

Suppose that you created a secret object named `my_teams_webhook_secret`
for the secret `xxxxxxxx` that appears in the URL.
(For information about the Microsoft API data format, see <https://adaptivecards.io/> .)

Execute one of the following statements to create a notification integration for this webhook:

* For the `logic.azure.com` URL:

  ```sqlexample
  CREATE OR REPLACE NOTIFICATION INTEGRATION my_teams_webhook_int
    TYPE=WEBHOOK
    ENABLED=TRUE
    WEBHOOK_URL='https://prod-114.westeurope.logic.azure.com/workflows/SNOWFLAKE_WEBHOOK_SECRET'
    WEBHOOK_SECRET=my_secrets_db.my_secrets_schema.my_teams_webhook_secret
    WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
    WEBHOOK_HEADERS=('Content-Type'='application/json');
  ```
* For the `environment.api.powerplatform.com` URL:

  ```sqlexample
  CREATE OR REPLACE NOTIFICATION INTEGRATION my_teams_webhook_int
    TYPE=WEBHOOK
    ENABLED=TRUE
    WEBHOOK_URL='https://defaultcac999b557e445acf1fefefe4ae5ff4.34.environment.api.powerplatform.com/powerautomate/automations/direct/workflows/xxxxxxxx/triggers/manual/paths/invoke'
    WEBHOOK_SECRET=my_secrets_db.my_secrets_schema.my_teams_webhook_secret
    WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
    WEBHOOK_HEADERS=('Content-Type'='application/json');
  ```

> **Note:**
>
> You must omit the port number (`:443`) from the URL in the WEBHOOK_URL parameter.

### Example 3: Creating a notification integration for a PagerDuty webhook

Suppose that you want to send notifications to a PagerDuty webhook with the URL:

```none
https://events.pagerduty.com/v2/enqueue
```

Suppose that you created a secret object named `my_pagerduty_webhook_secret`
for the integration key `xxxxxx` that should be included in the `routing_key` field in the body of the message.

Execute the following statement to create a notification integration for this webhook:

```sqlexample
CREATE OR REPLACE NOTIFICATION INTEGRATION my_pagerduty_webhook_int
  TYPE=WEBHOOK
  ENABLED=TRUE
  WEBHOOK_URL='https://events.pagerduty.com/v2/enqueue'
  WEBHOOK_SECRET=my_secrets_db.my_secrets_schema.my_pagerduty_webhook_secret
  WEBHOOK_BODY_TEMPLATE='{
    "routing_key": "SNOWFLAKE_WEBHOOK_SECRET",
    "event_action": "trigger",
    "payload": {
      "summary": "SNOWFLAKE_WEBHOOK_MESSAGE",
      "source": "Snowflake monitoring",
      "severity": "INFO"
    }
  }'
  WEBHOOK_HEADERS=('Content-Type'='application/json');
```

## Sending a notification to a webhook

To send a notification to a webhook:

1. Pass the [SANITIZE_WEBHOOK_CONTENT](../../sql-reference/functions/sanitize_webhook_content.md) function to remove any placeholders (like
   SNOWFLAKE_WEBHOOK_SECRET) from the message.
2. Call the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored
   procedure, passing in the sanitized message and specifying the name of the webhook notification integration to use.

For example, the following statement sends a JSON message to a Slack webhook, using the notification integration that you
created earlier:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_PLAIN(
    SNOWFLAKE.NOTIFICATION.SANITIZE_WEBHOOK_CONTENT('my message')
  ),
  SNOWFLAKE.NOTIFICATION.INTEGRATION('my_slack_webhook_int')
);
```

In this example, the statement passes in a message in plain text (`my message`). When constructing the body of the webhook
request from the template specified by the WEBHOOK_BODY_TEMPLATE property of the notification integration,
SYSTEM$SEND_SNOWFLAKE_NOTIFICATION replaces the SNOWFLAKE_WEBHOOK_MESSAGE placeholder with the message that you pass in.

For example, suppose that you specified the following template for the body of the request:

```sqlexample
CREATE OR REPLACE NOTIFICATION INTEGRATION my_slack_webhook_int
  ...
  WEBHOOK_BODY_TEMPLATE='{"text": "SNOWFLAKE_WEBHOOK_MESSAGE"}'
  ...
```

SYSTEM$SEND_SNOWFLAKE_NOTIFICATION constructs a request with the following body:

```json
{"text": "my message"}
```

---
title: Set up and manage notification contacts for Snowflake
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-contacts.md
section: User Guide
---

# Set up and manage notification contacts for Snowflake

Snowflake sends security, privacy, and product notifications via email. You can use Snowsight to set up and manage which
notifications you want to be sent to which email addresses in your organization. You can set up notifications at the
Snowflake organization level or the account level.

## About notification contacts for Snowflake

To receive security, privacy, and product notifications from Snowflake, specify the email addresses to receive those notifications in the
Notification Contacts page of Snowsight.

Setting up contacts depends on your role:

* Users without the ACCOUNTADMIN role or the ORGADMIN role cannot view Contacts.
* If the ORGADMIN role is granted to your user, you can view and set up contact information for the organization.
* If the ACCOUNTADMIN role is granted to your user, you can view and set up contact information for a specific account.
  If no contacts are provided at the account level, notification contacts set at the organization level are used.
  If you have the ACCOUNTADMIN role but are not granted the ORGADMIN role, you cannot see notification contacts set up at the organization
  level.

If you have privileges to do so, you can set up notification contacts for your entire Snowflake organization, or configure specific
notification contacts for each account. You can specify multiple email addresses for each type of notification.

The following notification types are sent by Snowflake:

* Security notifications
* Privacy notifications
* Product notifications, including:

  + Behavior change notifications, such as upcoming behavior changes that might affect your account. See [About Behavior Changes](../release-notes/intro-bcr-releases.md).
  + Driver support notifications, such as notifications related to client versions and driver support. See [Client versions & support policy](../release-notes/requirements.md).
  + Operational status notifications, such as notifications related to [Snowflake’s operational status](https://status.snowflake.com/).
  + Time-sensitive notifications, such as notifications that might require immediate attention or action.

Emails are sent in accordance with the [Snowflake Privacy Notice](https://www.snowflake.com/privacy-policy/).

## Set up notification contacts for Snowflake

To set up notification contacts for Snowflake, you must be granted the ACCOUNTADMIN role or the ORGADMIN role.

Set up notification contacts:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Admin contacts.
3. If you have the ORGADMIN role, you see the Organization tab. If you want to specify account-level notification contacts, select
   Account.
4. For each type of notification that you want to set up, select Edit or the pencil icon.
5. Add one or more email addresses, pressing Enter to add each address. You cannot use commas to separate multiple email addresses.
6. Select Save to save the email addresses as notification contacts.

> **Note:**
>
> When logging into Snowsight, users with the ACCOUNTADMIN or ORGADMIN role will receive a prompt to add any missing critical contact
> emails or update outdated ones.

### Example: Set up separate time sensitive notifications

For example, if you want the Snowflake administrators to get all product notifications, but want the on-call Snowflake administrators to
see the time-sensitive notifications, do the following:

* For Product Notifications, enter the email address: `snowflake-admin-info@example.com`.
* For Time Sensitive Notifications, enter the email address: `snowflake-admin-oncall@example.com`.

In this example, all product notifications, including the time-sensitive notifications, are sent to `snowflake-admin-info@example.com`.
Only the time-sensitive notifications are sent to the on-call administrators.

### Example: Set up different email addresses for each notification type

Alternatively, if you want different groups to get different types of notifications, you could set up notification contacts differently:

* For Product Notifications, enter the email address: `snowflake-admin-info@example.com` to get all product
  notifications including the Behavior Change notifications, Driver Support notifications, Operational Status notifications, and Time
  Sensitive notifications.
* For Behavior Change Notifications, enter the email address: `snowflake-admin-testing@example.com` to send behavior change
  emails to the administrators responsible for testing Snowflake.
* For Driver Support Notifications, enter the email address: `snowflake-admin-connect@example.com` to send client version and
  driver support details to the team responsible for maintaining the clients and drivers that people in your organization use to connect to Snowflake.
* For Operational Status Notifications, enter the email address: `snowflake-users@example.com` to send all Snowflake users
  status messages about Snowflake, to reduce the number of questions sent to your team.
* For Time Sensitive Notifications, enter the email address: `snowflake-admin-oncall@example.com` to alert the administrators
  on call about any time-sensitive issues.

### Example: Set up some organization-level and some account-level notifications

If you want one group to get all product notifications for the entire organization, the privacy testing group to get specific notifications
for the VPS account in your organization, and the analyst testing group to get specific notifications for your other account, you could do
something like the following example:

* On the Organization tab, for Product Notifications, enter the email address: `snowflake-admin-org@example.com`.
* As a user signed in to the VPS account that is granted the ACCOUNTADMIN role, go to the Account tab. In the
  Behavior Change Notifications section, enter the email address: `snowflake-admin-vps-account-testing@example.com`.
* As a user signed in to the other account that is granted the ACCOUNTADMIN role, go to the Account tab. In the
  Behavior Change Notifications section, enter the email address: `snowflake-admin-testing@example.com`.

In this example, the administrators for the entire Snowflake organization get all product notifications for all accounts in the organization,
and the testing groups for each account get the behavior change notifications for the accounts that they manage.

---
title: Set up error notifications for tasks
source: https://docs.snowflake.com/en/user-guide/tasks-errors.md
section: User Guide
---

# Set up error notifications for tasks

Snowflake can push notifications to a cloud messaging service when it encounters errors while executing tasks, or when a task graph finishes successfully.
The notifications describe the errors encountered when a task executes SQL code, or identify the successfully completed task graphs.

This topic explains how to configure notification support for tasks that use cloud messaging.

Snowflake task integration is implemented using notification integration objects, which provide an interface between Snowflake and
third-party cloud message queuing services.

Snowflake guarantees at-least-once message delivery of notifications; that is, multiple attempts are made to deliver messages to ensure at
least one attempt succeeds, which can result in duplicate messages.

The task notification feature is supported for both serverless tasks and user-managed tasks; that is, tasks that rely on a virtual warehouse
to provide the compute resources.

Notifications rely on cloud messaging that uses one of the following services:

* Amazon Simple Notification Service (SNS)
* Microsoft Azure Event Grid
* Google Pub/Sub

Currently, cross-cloud support isn’t available for push notifications.
You must configure notification support for the messaging service that is provided by the cloud platform where your Snowflake account is hosted.

The email and webhook notification integration types aren’t supported for task error notifications.

You can use the NOTIFICATION_HISTORY table function to query the history of notifications sent through Snowpipe. For more information, see [NOTIFICATION_HISTORY](../sql-reference/functions/notification_history.md).

To set up task notifications, complete the following steps:

1. Create a topic to receive the notifications, and set up a notification integration for that topic.

   For more information, see the instructions for your platform:

   * [AWS SNS](notifications/creating-notification-integration-amazon-sns.md)
   * [Google Pub/Sub](notifications/creating-notification-integration-google-pubsub.md)
   * [Azure Event Grid](notifications/creating-notification-integration-azure-event-grid.md)
2. Create or configure the task to use the notification integration for error and success notifications.

   See [Configure a task to send error notifications](tasks-errors-integrate.md) and [Configure a task to send success notifications](tasks-success-integrate.md).

---
title: Setting up alerts based on data in Snowflake
source: https://docs.snowflake.com/en/user-guide/alerts.md
section: User Guide
---

# Setting up alerts based on data in Snowflake

This topic explains how to set up an alert that periodically performs an action under specific conditions, based on data within
Snowflake.

## Introduction

In some cases, you might want to be notified or take action when data in Snowflake meets certain conditions. For example, you
might want to receive a notification when:

* The warehouse credit usage increases by a specified percentage of your current quota.
* The resource consumption for your pipelines, tasks, materialized views, etc. increases beyond a specified amount.
* Your data fails to comply with a particular business rule that you have set up.

To do this, you can set up a Snowflake alert. A Snowflake alert is a schema-level object that specifies:

* A condition that triggers the alert (e.g. the presence of queries that take longer than a second to complete).
* The action to perform when the condition is met (e.g. send an email notification, capture some data in a table, etc.).
* When and how often the condition should be evaluated (e.g. every 24 hours, every Sunday at midnight, etc.).

For example, suppose that you want to send an email notification when the credit consumption exceeds a certain limit for a
warehouse. Suppose that you want to check for this every 30 minutes. You can create an alert with the following properties:

* Condition: The credit consumption for a warehouse (the sum of the `credits_used` column in the
  [WAREHOUSE_METERING_HISTORY](../sql-reference/account-usage/warehouse_metering_history.md) view in the
  [ACCOUNT_USAGE](../sql-reference/account-usage.md)) schema exceeds a specified limit.
* Action: Email the administrator.
* Frequency / schedule: Check for this condition every 30 minutes.

## Choosing the type of alert

You can create the following types of alerts:

* Alert on a schedule: Snowflake evaluates the condition against the existing data on a
  scheduled basis.

  For example, you can set up a alert on a schedule to check if any of the existing rows in a table has a column value that
  exceeds a specified amount.
* Alert on new data: Snowflake evaluates the condition against any new rows in a specified
  table or a view.

  For example, you can set up an alert on new data to notify you when new rows for error messages are inserted into the
  [event table](../developer-guide/logging-tracing/event-table-setting-up.md) for your account. Because dynamic table refreshes
  and task executions log events to the event table, you can set up an alert on new data to:

  + [Monitor dynamic table refreshes](dynamic-tables-monitor-event-table-alerts.md).
  + [Monitor task executions](tasks-events.md).

### Alerts on a schedule

With an alert on a schedule, you can set up an alert to execute every `n` minutes or on a schedule specified by a cron
expression.

The condition of the alert is evaluated on all of the data (as opposed to alerts on new data, where conditions are evaluated
against only the new rows that have been inserted).

### Alerts on new data

With an alert on new data, you can set up an alert to execute only when new rows are inserted in a table or are made available
in a view.

Whenever new rows are inserted, the alert executes, evaluating the condition against just the new rows, and performing the action
if the condition evaluates to TRUE.

If you want to evaluate a condition on newly inserted rows, use an alert on new data, rather than setting up an alert on a
schedule (which executes on a fixed schedule, regardless of whether or not data has been added).

Because the alert operates only on newly inserted rows in a table or view, there are restrictions on the condition that you can
specify:

* In the SELECT statement, the FROM clause can specify only one regular table, view, or event table.
* You must [enable change tracking](streams-manage.md) on that table or view.
* You cannot use:

  + [Common table expressions (CTEs)](queries-cte.md)
  + [Data Manipulation Language (DML) commands](../sql-reference/sql-dml.md)
  + Calls to stored procedures
  + Joins

> **Note:**
>
> You cannot use the [EXECUTE ALERT](../sql-reference/sql/execute-alert.md) command to execute an alert on new data.

## Choosing the warehouse for the alerts

An alert requires a [warehouse](warehouses.md) for execution. You can either use
the serverless compute model or
a virtual warehouse that you specify.

### Using the serverless compute model (serverless alerts)

Alerts that use the serverless compute model called *serverless alerts*. If you use the serverless compute model, Snowflake
automatically resizes and scales the compute resources required for the alert. Snowflake determines the ideal size of the compute
resources for a given run based on a dynamic analysis of statistics for the most recent previous runs of the same alert. The
maximum size for a serverless alert run is equivalent to an XXLARGE warehouse. Multiple workloads in your account share a common
set of compute resources.

Billing is similar to other serverless features (such as serverless tasks). See Understanding the costs of alerts.

> **Note:**
>
> If you are creating an alert on new data that is added infrequently, consider
> configuring this as a serverless alert. If you configure the alert to use a warehouse instead, even a simple action that sends
> an email notification incurs at least one minute of warehouse cost.

### Using a virtual warehouse that you specify

If you want to specify a virtual warehouse, you must choose a warehouse that is sized appropriately for the SQL actions that
are executed by the alert. For guidelines on choosing a warehouse, see [Warehouse considerations](warehouses-considerations.md).

## Understanding the costs of alerts

The costs associated with running an alert to execute SQL code differ depending on the compute resources used for the alert:

* For serverless alerts, Snowflake bills your account based on compute resource usage. Charges are calculated based on your
  total usage of the resources, including cloud service usage, measured in *compute-hours* credit usage. The compute-hours cost
  changes based on warehouse size and query runtime. For more information, see [Serverless credit usage](cost-understanding-compute.md).

  To learn how many credits are consumed by alerts, refer to the “Serverless Feature Credit Table” in
  the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

  To view the usage history of serverless alerts, you can:

  + Call the [SERVERLESS_ALERT_HISTORY](../sql-reference/functions/serverless_alert_history.md) function.
  + Query the [SERVERLESS_ALERT_HISTORY view](../sql-reference/account-usage/serverless_alert_history.md).
* For alerts that use a virtual warehouse that you specify, Snowflake bills your account for
  [credit usage](cost-understanding-compute.md) based on the warehouse usage when an alert is running. This is
  similar to the warehouse usage for executing the same SQL statements in a client or Snowsight. Per-second credit
  billing and warehouse auto-suspend give you the flexibility to start with larger warehouse sizes and then adjust the size to
  match your alert workloads.

> **Tip:**
>
> If you want to set up an alert that evaluates new rows added to a table or view, use an
> alert on new data, rather than an alert on a schedule. An alert on a schedule will
> execute at a scheduled time, regardless of whether or not new rows have been inserted.

## Granting the privileges to create alerts

In order to create an alert, you must use a role that has the following privileges:

* The EXECUTE ALERT privilege on the account.

  > **Note:**
  >
  > This privilege can only be granted by a user with the ACCOUNTADMIN role.
* One of the following privileges:

  + The EXECUTE MANAGED ALERT privilege on the account, if you are creating a serverless alert.
  + The USAGE privilege on the warehouse used to execute the alert, if you are specifying a virtual warehouse for the alert.
* The USAGE and CREATE ALERT privileges on the schema in which you want to create the alert.
* The USAGE privilege on the database containing the schema.
* The SELECT privilege on the table or view that you want to query in the alert condition (if you are creating an
  alert on new data).

To grant these privileges to a role, use the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command.

For example, suppose that you want to create a custom role named `my_alert_role` that has the privileges to create an alert in
the schema named `my_schema`. You want the alert to use the warehouse `my_warehouse`.

To do this:

1. Have a user with the ACCOUNTADMIN role do the following:

   1. [Create the custom role](security-access-control-configure.md).

      For example:

      ```sqlexample
      USE ROLE ACCOUNTADMIN;

      CREATE ROLE my_alert_role;
      ```
   2. Grant the EXECUTE ALERT global privilege to that custom role.

      For example:

      ```sqlexample
      GRANT EXECUTE ALERT ON ACCOUNT TO ROLE my_alert_role;
      ```
   3. If you want to create a serverless alert, grant the EXECUTE MANAGED ALERT global privilege to that custom role.

      For example:

      ```sqlexample
      GRANT EXECUTE MANAGED ALERT ON ACCOUNT TO ROLE my_alert_role;
      ```
   4. Grant the custom role to a user.

      For example:

      ```sqlexample
      GRANT ROLE my_alert_role TO USER my_user;
      ```
2. Have the owners of the database, schema, and warehouse grant the privileges needed for creating the alert to the custom role:

   * The owner of the schema must grant the CREATE ALERT and USAGE privileges on the schema:

     ```sqlexample
     GRANT CREATE ALERT ON SCHEMA my_schema TO ROLE my_alert_role;
     GRANT USAGE ON SCHEMA my_schema TO ROLE my_alert_role;
     ```
   * The owner of the database must grant the USAGE privilege on the database:

     ```sqlexample
     GRANT USAGE ON DATABASE my_database TO ROLE my_alert_role;
     ```
   * If you want to specify a warehouse for the alert, the owner of that warehouse must grant the USAGE privilege on the
     warehouse:

     ```sqlexample
     GRANT USAGE ON WAREHOUSE my_warehouse TO ROLE my_alert_role;
     ```

## Creating an alert

The following sections provide the basic steps and an example of creating different types of alerts:

* Creating an alert on a schedule
* Creating an alert on new data

### Creating an alert on a schedule

Suppose that whenever one or more rows in a table named `gauge` has a value in the `gauge_value` column that exceeds 200,
you want to insert the current timestamp into a table named `gauge_value_exceeded_history`.

You can create an alert that:

* Evaluates the condition that `gauge_value` exceeds 200.
* Inserts the timestamp into `gauge_value_exceeded_history` if this condition evaluates to true.

To create an alert named `my_alert` that does this:

1. Verify that you are using a role that has the privileges to create an alert.

   If you are not using that role, execute the [USE ROLE](../sql-reference/sql/use-role.md) command to use that role.
2. Verify that you are using the database and schema in which you plan to create the alert.

   If you are not using that database and schema, execute the [USE DATABASE](../sql-reference/sql/use-database.md) and
   [USE SCHEMA](../sql-reference/sql/use-schema.md) commands to use that database and schema.
3. Execute the [CREATE ALERT](../sql-reference/sql/create-alert.md) command to create the alert:

   ```sqlexample
   CREATE OR REPLACE ALERT my_alert
     WAREHOUSE = mywarehouse
     SCHEDULE = '1 minute'
     IF( EXISTS(
       SELECT gauge_value FROM gauge WHERE gauge_value>200))
     THEN
       INSERT INTO gauge_value_exceeded_history VALUES (current_timestamp());
   ```

   If you want to create a serverless alert, omit the WAREHOUSE parameter:

   ```sqlexample
   CREATE OR REPLACE ALERT my_alert
     SCHEDULE = '1 minute'
     IF( EXISTS(
       SELECT gauge_value FROM gauge WHERE gauge_value>200))
     THEN
       INSERT INTO gauge_value_exceeded_history VALUES (current_timestamp());
   ```

   For the full description of the CREATE ALERT command, refer to [CREATE ALERT](../sql-reference/sql/create-alert.md).

   > **Note:**
   >
   > When you create an alert, the alert is suspended by default. You must resume the newly created alert in order for the alert
   > to execute.
4. Resume the alert by executing the [ALTER ALERT … RESUME](../sql-reference/sql/alter-alert.md) command. For example:

   ```sqlexample
   ALTER ALERT my_alert RESUME;
   ```

### Creating an alert on new data

Suppose that you want to receive an email notification when a stored procedure named `my_stored_proc` in the database and
schema `my_db.my_schema` logs a FATAL message to the
[active event table for your account](../developer-guide/logging-tracing/event-table-setting-up.md).

To create an alert named `my_alert` that does this:

1. Find the name of the active event table for your account:

   ```sqlexample
   SHOW PARAMETERS LIKE 'EVENT_TABLE' IN ACCOUNT;
   ```

   ```output
   +-------------+---------------------------+----------------------------+---------+-----------------------------------------+--------+
   | key         | value                     | default                    | level   | description                             | type   |
   |-------------+---------------------------+----------------------------+---------+-----------------------------------------+--------|
   | EVENT_TABLE | my_db.my_schema.my_events | snowflake.telemetry.events | ACCOUNT | Event destination for the given target. | STRING |
   +-------------+---------------------------+----------------------------+---------+-----------------------------------------+--------+
   ```
2. [Enable change tracking](streams-manage.md) on the table or view that you plan to query in the alert
   condition.

   ```sqlexample
   ALTER TABLE my_db.my_schema.my_events SET CHANGE_TRACKING = TRUE;
   ```
3. [Set up a notification integration for sending email](notifications/email-notifications.md).
4. Verify that you are using a role that has the privileges to create an alert.

   If you are not using that role, execute the [USE ROLE](../sql-reference/sql/use-role.md) command to use that role.
5. Verify that you are using database and schema in which you plan to create the alert.

   If you are not using that database and schema, execute the [USE DATABASE](../sql-reference/sql/use-database.md) and
   [USE SCHEMA](../sql-reference/sql/use-schema.md) commands to use that database and schema.
6. Execute the [CREATE ALERT](../sql-reference/sql/create-alert.md) command to create the alert, and omit the SCHEDULE parameter.

   For example, the following example creates an alert on new data that monitors the event table for errors in dynamic table
   refreshes and sends a notification to a Slack channel. The example assumes the following:

   * Your active event table is the [default event table](../developer-guide/logging-tracing/event-table-setting-up.md)
     (SNOWFLAKE.TELEMETRY.EVENTS).
   * You have [set the severity level](dynamic-tables-monitor-event-table-alerts.md) to capture events for your dynamic
     table.
   * You have [set up a webhook notification integration](notifications/webhook-notifications.md) for that Slack
     channel.

   ```sqlexample
   CREATE OR REPLACE ALERT my_alert
     WAREHOUSE = mywarehouse
     IF( EXISTS(
       SELECT * FROM SNOWFLAKE.TELEMETRY.EVENTS
         WHERE
           resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE' AND
           record_type='EVENT' AND
           value:"state"='ERROR'
     ))
     THEN
       BEGIN
         LET result_str VARCHAR;
         (SELECT ARRAY_TO_STRING(ARRAY_AGG(name)::ARRAY, ',') INTO :result_str
           FROM (
             SELECT resource_attributes:"snow.executable.name"::VARCHAR name
               FROM TABLE(RESULT_SCAN(SNOWFLAKE.ALERT.GET_CONDITION_QUERY_UUID()))
               LIMIT 10
           )
         );
         CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
           SNOWFLAKE.NOTIFICATION.TEXT_PLAIN(:result_str),
           '{"my_slack_integration": {}}'
         );
       END;
   ```

   If you want to create a serverless alert, omit the WAREHOUSE parameter:

   ```sqlexample
   CREATE OR REPLACE ALERT my_alert
     IF( EXISTS(
       SELECT * FROM SNOWFLAKE.TELEMETRY.EVENTS
         WHERE
           resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE' AND
           record_type='EVENT' AND
           value:"state"='ERROR'
     ))
     THEN
       BEGIN
         LET result_str VARCHAR;
         (SELECT ARRAY_TO_STRING(ARRAY_AGG(name)::ARRAY, ',') INTO :result_str
           FROM (
             SELECT resource_attributes:"snow.executable.name"::VARCHAR name
               FROM TABLE(RESULT_SCAN(SNOWFLAKE.ALERT.GET_CONDITION_QUERY_UUID()))
               LIMIT 10
           )
         );
         CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
           SNOWFLAKE.NOTIFICATION.TEXT_PLAIN(:result_str),
           '{"my_slack_integration": {}}'
         );
       END;
   ```

   For the full description of the CREATE ALERT command, refer to [CREATE ALERT](../sql-reference/sql/create-alert.md).

   > **Note:**
   >
   > When you create an alert, the alert is suspended by default. You must resume the newly created alert in order for the alert
   > to execute.
7. Resume the alert by executing the [ALTER ALERT … RESUME](../sql-reference/sql/alter-alert.md) command. For example:

   ```sqlexample
   ALTER ALERT my_alert RESUME;
   ```

## Specifying timestamps based on alert schedules

In some cases, you might need to define a condition or action based on the alert schedule.

For example, suppose that a table has a timestamp column that represents when a row was added, and you want to send an alert
if any new rows were added between the last alert that was successfully evaluated and the current scheduled alert. In other
words, you want to evaluate:

```sqlsyntax
<now> - <last_execution_of_the_alert>
```

If you use [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md) and the scheduled time of the alert to calculate this range of
time, the calculated range does not account for latency between the time that the alert is scheduled and the time when the
alert condition is actually evaluated.

Instead, when you need the timestamps of the current schedule alert and the last alert that was successfully evaluated, use the
following functions:

* [SCHEDULED_TIME](../sql-reference/functions/scheduled_time.md) returns the timestamp representing when the current alert was scheduled.
* [LAST_SUCCESSFUL_SCHEDULED_TIME](../sql-reference/functions/last_successful_scheduled_time.md) returns the timestamp representing when the last successfully
  evaluated alert was scheduled.

These functions are defined in the [SNOWFLAKE.ALERT schema](../sql-reference/snowflake-db.md). To call these functions, you need
to use a role that has been granted the [SNOWFLAKE.ALERT_VIEWER database role](../sql-reference/snowflake-db-roles.md). To
grant this role to another role, use the [GRANT DATABASE ROLE](../sql-reference/sql/grant-database-role.md) command. For example, to grant this role
to the custom role `alert_role`, execute:

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.ALERT_VIEWER TO ROLE alert_role;
```

The following example sends an email message if any new rows were added to `my_table` between the time that the last
successfully evaluated alert was scheduled and the time when the current alert has been scheduled:

```sqlexample
CREATE OR REPLACE ALERT alert_new_rows
  WAREHOUSE = my_warehouse
  SCHEDULE = '1 MINUTE'
  IF (EXISTS (
      SELECT *
      FROM my_table
      WHERE row_timestamp BETWEEN SNOWFLAKE.ALERT.LAST_SUCCESSFUL_SCHEDULED_TIME()
       AND SNOWFLAKE.ALERT.SCHEDULED_TIME()
  ))
  THEN CALL SYSTEM$SEND_EMAIL(...);
```

## Checking the results of the SQL statement for the condition in the alert action

Within the action of an alert, if you need to check the results of the SQL statement for the condition:

1. Call the [GET_CONDITION_QUERY_UUID](../sql-reference/functions/get_condition_query_uuid.md) function to get the query ID for the SQL statement for the
   condition.
2. Pass the query ID to the [RESULT_SCAN](../sql-reference/functions/result_scan.md) function to get the results of the execution of that SQL
   statement.

For example:

```sqlexample
CREATE ALERT my_alert
  WAREHOUSE = my_warehouse
  SCHEDULE = '1 MINUTE'
  IF (EXISTS (
    SELECT * FROM my_source_table))
  THEN
    BEGIN
      LET condition_result_set RESULTSET :=
        (SELECT * FROM TABLE(RESULT_SCAN(SNOWFLAKE.ALERT.GET_CONDITION_QUERY_UUID())));
      ...
    END;
```

## Manually executing alerts

In some cases, you might need to execute an alert manually. For example:

* If you are creating a new alert, you might want to verify that the alert works as you would expect.
* You might want to execute the alert at a specific point in your data pipeline. For example, you might want to execute the
  alert at the end of a stored procedure call.

To execute an alert manually, run the [EXECUTE ALERT](../sql-reference/sql/execute-alert.md) command:

```sqlexample
EXECUTE ALERT my_alert;
```

> **Note:**
>
> You cannot use EXECUTE ALERT to execute an alert on new data.

The EXECUTE ALERT command manually triggers a single run of an alert, independent of the schedule defined for the alert.

You can execute this command interactively. You can also execute this command from within a stored procedure or a Snowflake
Scripting block.

For details on the privileges required to run this command and the effect of this command on suspended, running, and scheduled
alerts, see [EXECUTE ALERT](../sql-reference/sql/execute-alert.md).

## Suspending and resuming an alert

If you need to prevent an alert from executing temporarily, you can suspend the alert by executing the
[ALTER ALERT … SUSPEND](../sql-reference/sql/alter-alert.md) command. For example:

```sqlexample
ALTER ALERT my_alert SUSPEND;
```

To resume a suspended alert, execute the [ALTER ALERT … RESUME](../sql-reference/sql/alter-alert.md) command. For example:

```sqlexample
ALTER ALERT my_alert RESUME;
```

> **Note:**
>
> If you are not the owner of the alert, you must have the OPERATE privilege on the alert to suspend or resume the alert.

## Modifying an alert

To modify the properties of an alert, execute the [ALTER ALERT](../sql-reference/sql/alter-alert.md) command.

> **Note:**
>
> * You must be the owner of the alert to modify the properties of the alert.
> * You cannot change an alert on new data to an
>   alert on a schedule. Similarly, you cannot change an alert on a schedule to an alert
>   on new data.

For example:

* To change the warehouse for the alert named `my_alert` to `my_other_warehouse`, execute:

  ```sqlexample
  ALTER ALERT my_alert SET WAREHOUSE = my_other_warehouse;
  ```
* To change the schedule for the alert named `my_alert` to be evaluated every 2 minutes, execute:

  ```sqlexample
  ALTER ALERT my_alert SET SCHEDULE = '2 minutes';
  ```
* To change the condition for the alert named `my_alert` so that you are alerted if any rows in the table named `gauge` have
  values greater than `300` in the `gauge_value` column, execute:

  ```sqlexample
  ALTER ALERT my_alert MODIFY CONDITION EXISTS (SELECT gauge_value FROM gauge WHERE gauge_value>300);
  ```
* To change the action for the alert named `my_alert` to `CALL my_procedure()`, execute:

  ```sqlexample
  ALTER ALERT my_alert MODIFY ACTION CALL my_procedure();
  ```

## Dropping an alert

To drop an alert, execute the [DROP ALERT](../sql-reference/sql/drop-alert.md) command. For example:

```sqlexample
DROP ALERT my_alert;
```

To drop an alert without raising an error if the alert does not exist, execute:

```sqlexample
DROP ALERT IF EXISTS my_alert;
```

> **Note:**
>
> You must be the owner of the alert to drop the alert.

## Viewing details about an alert

To list the alerts that have been created in an account, database, or schema, execute the [SHOW ALERTS](../sql-reference/sql/show-alerts.md)
command. For example, to list the alerts that were created in the current schema, run the following command:

```sqlexample
SHOW ALERTS;
```

This command lists the alerts that you own and the alerts that you have the MONITOR or OPERATE privilege on.

To view the details about a specific alert, execute the [DESCRIBE ALERT](../sql-reference/sql/desc-alert.md) command. For example:

```sqlexample
DESC ALERT my_alert;
```

> **Note:**
>
> If you are not the owner of the alert, you must have the MONITOR or OPERATE privilege on the alert to view the details of the
> alert.

## Cloning an alert

You can clone an alert (either by using [CREATE ALERT … CLONE](../sql-reference/sql/create-alert.md) or by cloning the
database or schema containing the alert).

If you are cloning a serverless alert, you don’t need to use a role that has the global EXECUTE MANAGED ALERT privilege. However,
you will not be able to resume that alert until the role that owns the alert has been granted the EXECUTE MANAGED ALERT privilege.

## Monitoring the execution of alerts

To monitor the execution of the alerts, you can:

* Check the results of the action that was specified for the alert. For example, if the action inserted rows into a table, you can
  check the table for new rows.
* View the history of alert executions by using one of the following:

  + The [ALERT_HISTORY](../sql-reference/functions/alert_history.md) table function in the INFORMATION_SCHEMA schema.

    For example, to view the executions of alerts over the past hour, execute the following statement:

    ```sqlexample
    SELECT *
    FROM
      TABLE(INFORMATION_SCHEMA.ALERT_HISTORY(
        SCHEDULED_TIME_RANGE_START
          =>dateadd('hour',-1,current_timestamp())))
    ORDER BY SCHEDULED_TIME DESC;
    ```
  + The [ALERT_HISTORY](../sql-reference/account-usage/alert_history.md) view in the ACCOUNT_USAGE schema in the shared
    SNOWFLAKE database.

In the query history, the name of the user who executed the query will be SYSTEM. (The alerts are run by the
[system service](tasks-intro.md).)

## Viewing the query history of a serverless alert

To view the query history of a serverless alert, you must be the owner of the alert, or you must use a role that has the
MONITOR or OPERATE privilege on the alert itself. (This differs from alerts that use one your warehouses, which require the
MONITOR or OPERATOR privilege on the warehouse.)

For example, suppose that you want to use the `my_alert_role` role when viewing the query history of the alert `my_alert`.
If `my_alert_role` is not the owner of `my_alert`, you must [grant](../sql-reference/sql/grant-privilege.md) that role the
MONITOR or OPERATE privilege on the alert:

```sqlexample
GRANT MONITOR ON ALERT my_alert TO ROLE my_alert_role;
```

After the role is granted this privilege, you can use the role to view the query history of the alert:

```sqlexample
USE ROLE my_alert_role;
```

```sqlexample
SELECT query_text FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY())
  WHERE query_text LIKE '%Some condition%'
    OR query_text LIKE '%Some action%'
  ORDER BY start_time DESC;
```

---
title: sfsql Tips and Hints — Obsoleted
source: https://docs.snowflake.com/en/user-guide/sfsql-hints.md
section: User Guide
---

# sfsql Tips and Hints — *Obsoleted*

This topic provides tips, hints, and other useful information for using
`sfsql`.

## Setting Session Defaults

If you did not set a default role, database, schema, or warehouse for your session
either in the `login.defaults` file or on the command line when starting `sfsql`,
you should set these values to make executing SQL queries and performing DDL or
DML operations easier. For more information, see:

> * [USE ROLE](../sql-reference/sql/use-role.md)
> * [USE DATABASE](../sql-reference/sql/use-database.md)
> * [USE SCHEMA](../sql-reference/sql/use-schema.md)
> * [USE WAREHOUSE](../sql-reference/sql/use-warehouse.md)

Note that these defaults can also be set at the user level by individual users
or an account administrator.

## Specifying Directory Paths and Files

When performing any file operation in `sfsql`, by default, the client looks for
the file in the directory path from which the client was started. To use files
located in a different directory path, provide the fully-qualified path, e.g.
`/<path>/<to>/<file>` (in a Linux or macOS environment).

## Escaping Control Characters

`sfsql` pre-processes user input for control characters. As a result, to insert
a single backslash character into a SQL string literal in the client, the backslash
character needs to be double-escaped (i.e. `\` must be written as `\\\\`).

## Formatting Output

HenPlus forces output to display in delimited table/column format. This may result
in trailing blank spaces added to field values and a column delimiter (e.g. `|`
or `,`) added to the end of each row in query results. If you don’t want these
additional characters in your results, you will need to remove them manually.

---
title: sfsql — Obsoleted
source: https://docs.snowflake.com/en/user-guide/sfsql.md
section: User Guide
---

# sfsql — *Obsoleted*

`sfsql` provides a command-line interface for connecting to Snowflake through JDBC to execute SQL queries and perform DDL and DML operations, including loading and unloading data from database tables.
`sfsql` is a Bash shell script (on Linux/macOS) or batch file (on Microsoft Windows) implemented on top of [HenPlus](http://henplus.sourceforge.net/).

`sfsql` uses the [Snowflake JDBC driver](../developer-guide/jdbc/jdbc.md) to connect to Snowflake; however, the driver is not a prerequisite for installing the client. The driver is bundled in the
`sfsql` distribution and is automatically installed along with the client.

**Next Topics:**

* [Configuring sfsql — *Obsoleted*](sfsql-install-config.md)
* [Starting and Stopping sfsql — *Obsoleted*](sfsql-start-stop.md)
* [Using sfsql — *Obsoleted*](sfsql-use.md)
* [sfsql Tips and Hints — *Obsoleted*](sfsql-hints.md)
* [Differences between sfsql and SnowSQL](snowsql-sfsql-diff.md)

---
title: Share data from multiple databases
source: https://docs.snowflake.com/en/user-guide/data-sharing-multiple-db.md
section: User Guide
---

# Share data from multiple databases

Snowflake data providers can share data from multiple databases by using secure views. A secure view can reference objects such
as schemas, tables, and other views contained in one or more databases, as long as those databases belong to the same account.

Sharing a secure view that references objects from multiple databases is different from sharing data contained in a
single database.

In addition to performing all the [standard steps to share data](data-sharing-provider.md), you must also grant the REFERENCE_USAGE privilege
on each database referenced by a secure view that you wish to share. However, you do not need to grant REFERENCE_USAGE on the
database that contains the secure view.

> **Note:**
>
> You cannot use database roles to share data from multiple databases. You cannot grant the REFERENCE_USAGE privilege to a
> [database role](security-access-control-overview.md) and you cannot use a database role to grant a secure view
> that references objects from multiple databases to a share.

You must grant the REFERENCE_USAGE privilege separately on each database referenced in a secure view, before granting the
secure view to a share.

To share a secure view that references objects from multiple databases:

1. Connect to your Snowflake account as a user with the ACCOUNTADMIN role or a role granted the CREATE SHARE global privilege.
   For more details about the CREATE SHARE privilege, see [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).
2. Create a share using [CREATE SHARE](../sql-reference/sql/create-share.md).
3. Grant the USAGE privilege on the database you wish to share using [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md).

   > **Note:**
   >
   > If you are sharing a secure view that references objects contained in multiple databases, you only need to grant the USAGE privilege
   > to the database where the secure view is created. You can only grant USAGE to one database per share.
   >
   > Granting the USAGE privilege to the database associates the share with a database, which is required to grant other privileges
   > to the share.
4. Grant the USAGE privilege on each schema in the database you wish to share using [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md).
5. Grant the REFERENCE_USAGE privilege on each additional database that contains objects referenced by the view you wish to share using
   [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md).
6. Add the view to the share by granting the SELECT privilege on the view using [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md).
7. Add one or more consumer accounts to the share using [ALTER SHARE](../sql-reference/sql/alter-share.md).

The share is now ready to be consumed by the specified accounts.

> **Note:**
>
> To share a secure view that references a UDF in a different database, you must make the UDF secure. For more details about creating a
> secure UDF, see [Creating a Secure UDF or Stored Procedure](../developer-guide/secure-udf-procedure.md).

## Examples

Refer to the following examples for creating secure views.

### Example 1: Create and share a secure view in an existing database

A provider who organizes data into different databases based on the characteristics of data and business needs wants to share a secure view
in one database that joins data in the database with objects (e.g. schema, table, view) in other databases.

1. Create database `database1` and data:

   ```sqlexample
   CREATE DATABASE database1;
   CREATE SCHEMA database1.sch;
   CREATE TABLE database1.sch.table1 (id INT);
   CREATE VIEW database1.sch.view1 AS SELECT * FROM database1.sch.table1;
   ```
2. Create database `database2` and data:

   ```sqlexample
   CREATE DATABASE database2;
   CREATE SCHEMA database2.sch;
   CREATE TABLE database2.sch.table2 (id INT);
   ```
3. Create database `database3` and data:

   ```sqlexample
   CREATE DATABASE database3;
   CREATE SCHEMA database3.sch;
   CREATE TABLE database3.sch.table3 (id INT);
   ```
4. Create the secure view with the data to be shared in `database3`:

   ```sqlexample
   CREATE SECURE VIEW database3.sch.view3 AS
     SELECT view1.id AS View1Id,
            table2.id AS table2id,
            table3.id AS table3id
     FROM database1.sch.view1 view1,
          database2.sch.table2 table2,
          database3.sch.table3 table3;
   ```
5. Create the share and grant required privileges to set up the share.

   ```sqlexample
   CREATE SHARE share1;
   GRANT USAGE ON DATABASE database3 TO SHARE share1;
   GRANT USAGE ON SCHEMA database3.sch TO SHARE share1;
   ```
6. Grant the required privileges necessary to add the secure view `view3` to the share.

   The data referenced in additional databases by secure view `view3` requires granting the REFERENCE_USAGE privilege on `database1`
   and `database2` to the share:

   ```sqlexample
   GRANT REFERENCE_USAGE ON DATABASE database1 TO SHARE share1;
   GRANT REFERENCE_USAGE ON DATABASE database2 TO SHARE share1;

   GRANT SELECT ON VIEW database3.sch.view3 TO SHARE share1;
   ```

You can share this data with consumers in other regions by using a replication group to replicate data to an account in another region.
For instructions, see [Example 3: Share data from multiple databases](secure-data-sharing-across-regions-platforms.md).

### Example 2: Create and share a secure view in a separate database

A provider stores customer data in separate databases and does not want to create new objects in those databases. To share data, the provider
creates a new database with a secure view. The secure view references objects (schema, table, view) in the databases with customer data.

Sample Code:

1. Create the customer database `customer1_db` and data:

   ```sqlexample
   CREATE DATABASE customer1_db;
   CREATE SCHEMA customer1_db.sch;
   CREATE TABLE customer1_db.sch.table1 (id INT);
   CREATE VIEW customer1_db.sch.view1 AS SELECT * FROM customer1_db.sch.table1;
   ```
2. Create the customer database `customer2_db` and data:

   ```sqlexample
   CREATE DATABASE customer2_db;
   CREATE SCHEMA customer2_db.sch;
   CREATE TABLE customer2_db.sch.table2 (id INT);
   ```
3. Create the new database `new_db` and schema `sch`:

   ```sqlexample
   CREATE DATABASE new_db;
   CREATE SCHEMA new_db.sch;
   ```
4. Create the secure view in `new_db` that references objects in `customer1_db` and `customer2_db`:

   ```sqlexample
   CREATE SECURE VIEW new_db.sch.view3 AS
     SELECT view1.id AS view1Id,
            table2.id AS table2ID
     FROM customer1_db.sch.view1 view1,
          customer2_db.sch.table2 table2;
   ```
5. Create the share and grant required privileges to set up the share:

   ```sqlexample
   CREATE SHARE share1;

   GRANT USAGE ON DATABASE new_db TO SHARE share1;
   GRANT USAGE ON SCHEMA new_db.sch TO SHARE share1;
   ```
6. Grant the required privileges necessary to add the secure view `view3` to the share.

   The data referenced in additional databases by secure view `view3` requires granting the REFERENCE_USAGE privilege on
   `customer1_db` and `customer2_db` to the share:

   ```sqlexample
   GRANT REFERENCE_USAGE ON DATABASE customer1_db TO SHARE share1;
   GRANT REFERENCE_USAGE ON DATABASE customer2_db TO SHARE share1;

   GRANT SELECT ON VIEW new_db.sch.view3 TO SHARE share1;
   ```

## Sharing data from multiple database with consumers in other regions

You can share data from multiple databases with consumer accounts in other regions and cloud platforms by using a replication group.
Include the share and each database the share references in the group to replicate data to a Snowflake account in another region.
You can then add consumer accounts to the replicated share. For detailed instructions,
see [Share data securely across regions and cloud platforms](secure-data-sharing-across-regions-platforms.md).

---
title: Share data in non-secured views
source: https://docs.snowflake.com/en/user-guide/data-sharing-views.md
section: User Guide
---

# Share data in non-secured views

To take full advantage of the performance gains of query
optimizations on the views that you share, you can
create a share that lets you share non-secure views with other accounts.

> **Note:**
>
> When possible, use secure views to enforce the security of your data.
> See [Use secure objects to control data access](data-sharing-secure-views.md).

## Create a share that allows non-secure objects

To share non-secure views,
create a share that allows non-secure objects.

For example, run the following:

```sqlexample
CREATE OR REPLACE SHARE allow_non_secure_views
 SECURE_OBJECTS_ONLY=FALSE
 COMMENT="Share views that require query optimization";
```

> **Note:**
>
> For full syntax, see Syntax for sharing non-secure views
> in this topic.

After you create a share that allows sharing views,
use the [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) command to
grant a view to a share. For example, to grant a view named
`non_secure_view` to the share, run the following:

```sqlexample
GRANT SELECT ON VIEW non_secure_view TO SHARE allow_non_secure_views;
```

Alternatively, you can grant the SELECT privilege on the view to a
database role, and then grant that database role to the share.
For example, to grant SELECT privileges on the view `non_secure_view`
to the database role `performance_engineer` and then
grant the role to the share, run the following:

```sqlexample
GRANT SELECT ON VIEW non_secure_view TO DATABASE ROLE performance_engineer;
GRANT DATABASE ROLE performance_engineer TO SHARE allow_non_secure_views;
```

## Convert an existing share to allow sharing non-secure views

You can convert an existing share with secure views into a share that
supports sharing non-secure views.

For example, to convert an existing share `secure_views_only` into
one that supports sharing non-secure views, do the following:

1. Use the [SHOW GRANTS](../sql-reference/sql/show-grants.md) command to determine
   which objects are granted to the share, and which accounts
   have access to the share, respectively:

   ```sqlexample
   SHOW GRANTS TO SHARE secure_views_only;
   SHOW GRANTS OF SHARE secure_views_only;
   ```
2. Convert the existing share with one that allows sharing views:

   ```sqlexample
   ALTER SHARE secure_views_only
    SET SECURE_OBJECTS_ONLY = FALSE,
    COMMENT = "Convert to allow sharing non-secure views that require
    query optimization";
   ```
3. Optionally convert an existing secure view into a view. In this example,
   alter `secure_view2` into a non-secure view:

   ```sqlexample
   ALTER VIEW secure_view2 UNSET SECURE;
   ```

> For more details, see Convert a secure view in a share to a non-secure view.

## Convert a secure view in a share to a non-secure view

If you want to convert an existing secure view into a view, you can do
that before or after granting the view to a share.

To convert an existing secure view in a share to a view, the following
must be true:

* The secure view must only be granted to shares that are
  configured to allow sharing non-secure objects.
* The secure view cannot be granted to:

  + Database roles granted to shares that do not allow sharing non-secure
    objects.
  + Shares that do not allow sharing non-secure objects.

For example, for an existing secure view named `high_performance_view`,
unset the SECURE property:

```sqlexample
ALTER VIEW high_performance_view UNSET SECURE;
```

Alternatively, you can recreate the secure view as a view:

```sqlexample
CREATE OR REPLACE VIEW high_performance_view WITH COPY GRANTS;
```

## Limitations of sharing non-secure objects

If you plan to share objects, consider the following:

* After you create a share with the SECURE_OBJECTS_ONLY property set to FALSE, you cannot unset this property or set this property to TRUE.
* You can only add non-secure views to shares that have been explicitly configured to allow non-secure objects.

## Syntax for sharing non-secure views

```sqlsyntax
CREATE [ OR REPLACE ] SHARE <name>
[ SECURE_OBJECTS_ONLY = <boolean> ]
[ COMMENT = '<string_literal>' ]
```

### Required Parameters

`name`
:   Specifies the identifier for the share;
    must be unique for the account in which the share is created.

    In addition, the identifier must start with an alphabetic character
    and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes. For example, `"My object"`.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information about identifier requirements, see [Identifier requirements](../sql-reference/identifiers-syntax.md).

### Optional Parameters

`SECURE_OBJECTS_ONLY = boolean`
:   Specifies whether allow granting only secure objects,
    or also allow granting non-secure objects to the share.

    Default: true

`COMMENT = 'string_literal'`
:   Specifies a comment for the share.

    Default: No value

### Access control requirements

A [role](security-access-control-overview.md) used to execute this operation must have the following
[privileges](security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SHARE | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](security-access-control-overview.md), see [Overview of Access Control](security-access-control-overview.md).

For more information about access control requirements for Snowflake
Secure Data Sharing specifically, see
[Enable non-ACCOUNTADMIN roles to perform data sharing tasks](security-access-privileges-shares.md).

### Usage notes

* You cannot see the value of the SECURE_OBJECTS_ONLY property when you
  run [SHOW SHARES](../sql-reference/sql/show-shares.md). Use the COMMENT property to note the
  value of the SECURE_OBJECTS_ONLY property.
* The existing notes for [CREATE SHARE](../sql-reference/sql/create-share.md) also apply.

### Examples

For an example on how to create a share with non-secure views, see Create a share that allows non-secure objects.

For an example using ALTER SHARE,
see Convert an existing share to allow sharing non-secure views.

---
title: Share data protected by a policy
source: https://docs.snowflake.com/en/user-guide/data-sharing-policy-protected-data.md
section: User Guide
---

# Share data protected by a policy

Data sharing consumers can use a shared database role to access shared data protected by a masking policy or a row access policy.

## Overview

A data sharing provider can share a database role to enable a data sharing consumer to access policy protected data. The provider defines
the policy to call the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function to evaluate the shared database role or
a mapping table column that contains the database role. This provides more options to the provider to share data and allows the consumer to
access sensitive data that the provider makes available.

When the policies and protected tables are in different databases, the provider must:

* Create the database role in the same database as the protected table.
* Grant the database role to the share containing the protected table.
* Share the database that contains the protected table to the consumer account.

When the consumer creates a database from the share, the database roles in the share are granted to the role that creates the database from
the share. This allows the account role in the consumer account to meet the policy conditions that specify the database role and access the
shared data.

To access the shared data protected by the policy, the consumer must specify the database containing the shared database role to make the
shared database role active in the current session. In this context, making the database role active means that the database role is
available in the role hierarchy of the current role for the user. If you do not specify this shared database, users in the consumer account
cannot access shared data that is protected by a policy. You can specify this database using either of the following options:

* Activate the database in the session with the [USE <object>](../sql-reference/sql/use.md) command or select the database in the worksheet. For example:

  ```sqlexample
  USE DATABASE mounted_db;
  ```

  Where `mounted_db` is the name of the database the consumer creates from the share.
* For a specific query, use the fully qualified name of the object that is in the same database as the database role. For example:

  ```sqlexample
  SELECT * FROM mounted_db.myschema.mytable;
  ```

### Call the function

There are two different ways to specify arguments in the IS_DATABASE_ROLE_IN_SESSION function: a string literal or a nonliteral
(i.e. column name).

* When you specify a database role as a string in the IS_DATABASE_ROLE_IN_SESSION function, the result of calling the
  function depends on how the function is called. For example:

  + In a worksheet, Snowflake looks at the database that is in use for the session or the database that is specified in the query. This
    applies to both the provider account and the consumer account.
  + With a policy, UDF, or view, Snowflake looks at the database that contains the protected object. When these objects are not
    shared and the database role being is defined in a different database, the function evaluates to `False`.
* When you specify a column name as the argument in the IS_DATABASE_ROLE_IN_SESSION function:

  + If a table query calls the function, the column maps to the table identifier of the table containing the column. Snowflake then looks
    at the database roles in the database containing the table. For example, to specify the AUTHZ_ROLE (i.e. authorized role) column as the
    argument:

    ```sqlexample
    SELECT * FROM mydb.myschema.t WHERE IS_DATABASE_ROLE_IN_SESSION(AUTHZ_ROLE);
    ```
  + If a masking policy, row access policy, or UDF calls the function, the lookup occurs in the database that contains the protected table.

## General workflow

Sharing policy-protected data with the IS_DATABASE_ROLE_IN_SESSION function in the policy requires the same steps to create a policy to
call the function and share data. To summarize:

1. The provider creates an account role.
2. The provider creates a policy and sets the policy on a table or column.
3. The provider tests the policy with the account role.
4. The provider creates a database role and tests the policy with the database role.
5. The provider creates a share and grants privileges to the share, including granting the database role to the share.
6. The consumer creates a database from the share (the *mounted database*).
7. The consumer queries the shared object that is protected by the policy.

## Example: All objects in the same database

In this example, the database roles, masking policy, and the protected table are all in the same database named `mydb`.

For reference:

* The database roles are `analyst_dbrole` and `support_dbrole`.
* The masking policy is defined as follows:

  ```sqlexample
  CREATE OR REPLACE MASKING POLICY mydb.policies.email_mask
    AS (val string) RETURNS string ->
    CASE
      WHEN IS_DATABASE_ROLE_IN_SESSION('ANALYST_DBROLE')
        THEN val
      WHEN IS_DATABASE_ROLE_IN_SESSION('SUPPORT_DBROLE')
        THEN REGEXP_REPLACE(val,'.+\@','*****@')
      ELSE '********'
    END
    COMMENT = 'use database role for shared data'
    ;
  ```
* The EMAIL column is in a table named `mydb.tables.empl_info` and the masking policy is set on this column.

Complete the following steps to share the database `mydb` and allow the consumer to use the shared database role to query the shared data
protected by the shared masking policy. These steps assume the provider has already tested the masking policy on the EMAIL column with
their account roles and database roles.

1. In the provider account, execute the [CREATE SHARE](../sql-reference/sql/create-share.md) command to create a share for the analyst database role:

   ```sqlexample
   USE ROLE r1;
   CREATE SHARE analyst_share;
   ```
2. Grant privileges to the share. The same privileges are required for each share:

   ```sqlexample
   USE ROLE r1;
   GRANT USAGE ON DATABASE mydb TO SHARE analyst_share;
   GRANT USAGE ON SCHEMA mydb.tables TO SHARE analyst_share;
   GRANT SELECT ON TABLE mydb.tables.empl_info TO SHARE analyst_share;
   GRANT DATABASE ROLE analyst_dbrole TO SHARE analyst_share;
   ```
3. Add the consumer account to the share:

   ```sqlexample
   ALTER SHARE analyst_share ADD ACCOUNTS = consumer_account;
   ```
4. In the consumer account, create the account role `r1` and grant privileges to this role to import the share:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE ROLE r1;

   GRANT USAGE ON WAREHOUSE my_warehouse TO ROLE r1;
   GRANT CREATE DATABASE ON ACCOUNT TO ROLE r1;
   GRANT IMPORT SHARE ON ACCOUNT TO ROLE r1;
   GRANT ROLE r1 TO ROLE ACCOUNTADMIN;
   ```
5. Import the share:

   ```sqlexample
   USE ROLE r1;
   CREATE DATABASE mounted_db FROM SHARE provider_account.analyst_share;
   ```
6. Verify the database role is in session:

   ```sqlexample
   USE DATABASE mounted_db;
   USE SCHEMA mounted_db.tables;

   SELECT IS_DATABASE_ROLE_IN_SESSION('ANALYST_DBROLE');
   ```

   The SELECT statement should return `True`.
7. Query the protected table:

   ```sqlexample
   SELECT * FROM empl_info;
   ```

   The SELECT statement should return the unmasked email addresses.
8. Grant the database roles to account roles so that users with these roles can query the protected table and view data based on the
   masking policy definition.

   After repeating the previous two steps, a user that is granted the `support_dbrole` database role should see a partially masked email
   address.

## Example: Masking policy and protected data in different databases

When the policy and the protected table are in different databases, share the database that contains the protected table with the consumer.

For example:

* `mydb1` contains the masking policy.
* `mydb2` contains the table named `mydb2.tables.empl_info`, which contains the EMAIL column. The masking policy is set on this column.

  You must group the table and the database role, `analyst_dbrole`, in the same database.

The provider follows the same procedure as the previous example in terms of creating a share, granting privileges to the share, and
granting the database role to the share.

The consumer follows the same procedure as the previous example in terms of creating a database from the share. However, the consumer must
have the database containing the protected table in use to activate the database role. Then the consumer can query the protected table by
specifying the fully qualified name of the table.

1. In the provider account, execute the [CREATE SHARE](../sql-reference/sql/create-share.md) command to create a share for each database:

   ```sqlexample
   USE ROLE r1;
   CREATE SHARE analyst_policy_share;
   CREATE SHARE analyst_table_share;
   ```
2. Grant privileges to the share named `analyst_table_share`:

   ```sqlexample
   USE ROLE r1;
   GRANT USAGE ON SCHEMA mydb2.tables TO SHARE analyst_table_share;
   GRANT SELECT ON TABLE mydb2.tables.empl_info TO SHARE analyst_table_share;
   GRANT DATABASE ROLE mydb2.analyst_dbrole TO SHARE analyst_table_share;
   ```
3. Add the consumer account to the share:

   ```sqlexample
   ALTER SHARE analyst_table_share ADD ACCOUNTS = consumer_account;
   ```
4. In the consumer account, create the account role `r1` and grant privileges to this role to import the share:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE ROLE r1;

   GRANT USAGE ON WAREHOUSE my_warehouse TO ROLE r1;
   GRANT CREATE DATABASE ON ACCOUNT TO ROLE r1;
   GRANT IMPORT SHARE ON ACCOUNT TO ROLE r1;
   GRANT ROLE r1 TO ROLE ACCOUNTADMIN;
   ```
5. Import the share that contains the protected table and the database role:

   ```sqlexample
   USE ROLE r1;
   CREATE DATABASE mounted_db2 FROM SHARE provider_account.analyst_table_share;
   ```
6. Verify the database role is in session:

   ```sqlexample
   USE DATABASE mounted_db2;
   USE SCHEMA mounted_db2.tables;

   SELECT IS_DATABASE_ROLE_IN_SESSION('ANALYST_DBROLE');
   ```

   The SELECT statement should return `True`.
7. Query the protected table:

   ```sqlexample
   SELECT * FROM mounted_db2.tables.empl_info;
   ```

   The SELECT statement should return the unmasked email addresses.

## Example: Row access policy without mapping table

In this example, the row access policy calls the IS_DATABASE_ROLE_IN_SESSION function to lookup the role name in the `authz_role`
(authorized role) column. The nonliteral syntax and that function lookup occurs in the database that contains the protected table.

Create the policy:

> ```sqlexample
> CREATE OR REPLACE ROW ACCESS POLICY rap_authz_role AS (authz_role string)
> RETURNS boolean ->
> IS_DATABASE_ROLE_IN_SESSION(authz_role);
> ```

Add the policy to a table:

> ```sqlexample
> ALTER TABLE allowed_roles
>   ADD ROW ACCESS POLICY rap_authz_role ON (authz_role);
> ```

The provider can choose to share objects in a single database or in multiple databases as shown in the masking policy examples. The
consumer follows the same procedure to create a database from a share for each database that the provider makes available.

## Example: Row access policy with mapping table

In this example, the row access policy calls the IS_DATABASE_ROLE_IN_SESSION function to look up the authorized role from a mapping table
column called `role_name`. The nonliteral syntax and that function lookup occurs in the database that contains the protected
table. In this scenario, the mapping table must be in the same database as the protected table. After creating the policy, add the policy
to the table containing the `authz_role` column.

> Create the policy:
>
> > ```sqlexample
> > CREATE OR REPLACE ROW ACCESS POLICY rap_authz_role_map AS (authz_role string)
> > RETURNS boolean ->
> > EXISTS (
> >   SELECT 1 FROM mapping_table m
> >   WHERE authz_role = m.key AND IS_DATABASE_ROLE_IN_SESSION(m.role_name)
> > );
> > ```
>
> Add the policy to a table:
>
> > ```sqlexample
> > ALTER TABLE allowed_roles
> >   ADD ROW ACCESS POLICY rap_authz_role_map ON (authz_role);
> > ```

The provider can choose to share objects in a single database or in multiple databases as shown in the masking policy examples. The
consumer follows the same procedure to create a database from a share for each database that the provider makes available.

---
title: Share data securely across regions and cloud platforms
source: https://docs.snowflake.com/en/user-guide/secure-data-sharing-across-regions-platforms.md
section: User Guide
---

# Share data securely across regions and cloud platforms

This topic provides instructions on using [replication](account-replication-intro.md) to allow data providers
to securely share data with data consumers across different [regions](intro-regions.md) and
[cloud platforms](intro-cloud-platforms.md).

> **Note:**
>
> If you use listings to share data with specific consumer accounts, or you use the Snowflake Marketplace,
> you can use [Cross-Cloud Auto-fulfillment](../collaboration/provider-listings-auto-fulfillment.md)
> to automatically fulfill your data product to other regions.

Cross-region data sharing is supported by Snowflake accounts hosted on any of the following cloud platforms:

* Amazon Web Services (AWS)
* Google Cloud Platform (GCP)
* Microsoft Azure (Azure)

> **Important:**
>
> If you replicate a primary database to accounts in a geographic region or country that is different from that in which your source
> Snowflake account is located, you should confirm that your organization does not have any legal or regulatory restrictions as to where
> your data can be transferred or hosted.

## Data sharing considerations

* Since cross-region data sharing utilizes Snowflake data replication functionality, understand how replication works in Snowflake
  as part of your planning process. For more information, see:

  + [Introduction to replication and failover across multiple accounts](account-replication-intro.md)
  + [Replication considerations](account-replication-considerations.md)
  + [Replicating databases and account objects across multiple accounts](account-replication-config.md)
* Data providers only need to create one copy of the dataset per region; and not a copy per consumer.
* When sharing a view that references objects in multiple databases, each of these other databases must be included in the replication
  group. Sharing data from more than one database requires additional steps. For instructions, see
  [Share data from multiple databases](data-sharing-multiple-db.md).
* For information related to using [Virtual Private Snowflake (VPS)](intro-editions.md) with data sharing,
  see [About collaboration in VPS environments](../collaboration/virtual-private-snowflake/about-vps-collaboration.md).

## Sharing data with data consumers in a different region and cloud platform

Snowflake data providers can share data with data consumers in a different region in a few simple steps.

### Step 1: Set up data replication

> **Note:**
>
> Before configuring data replication, you must create an account in a region where you wish to share data and link it to your local
> account. For more information, see [Working with organizations and accounts](../guides-overview-manage.md).

Setting up data replication involves the following tasks:

1. Enable replication for your accounts.

   An [organization administrator](organization-administrators.md) must enable replication for the source account that
   contains the data to share and the target accounts in regions where you want to share data with consumers. For instructions on enabling
   replication, see [Prerequisite: Enable replication for accounts in the organization](account-replication-config.md).
2. Create a replication group and add databases and shares.
3. Replicate the group with the databases and shares to the regions where you want to share data with consumers.

### Step 2. Share data with data consumers

Sharing data with data consumers in the same region involves adding one or more consumer accounts to the
secondary shares that you replicated from the source account.

For detailed instructions, see [Getting Started with Secure Data Sharing](data-sharing-gs.md).

## Example 1: Share data

A data provider, Acme, wants to share data with data consumers in a different region.

### Execute from source account

To create a replication group that contains the databases and shares to replicate to another region, execute the following
SQL statement.

> **Note:**
>
> If you have previously enabled replication for an individual database, you must disable database replication for the
> database *before* you add it to a replication group. For details, see [Transitioning from database replication to group-based replication](account-replication-config.md).

Create a replication group `my_rg` that includes database `db1` and share `share1` to replicate to the account `account_2`
in the `acme` org.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE REPLICATION GROUP my_rg
  OBJECT_TYPES = databases, shares
  ALLOWED_DATABASES = db1
  ALLOWED_SHARES = share1
  ALLOWED_ACCOUNTS = acme.account_2;
```

### Execute from target account

From the target account in the other region, execute the following SQL statements.
Any account that you add to the share should be local to the region of the target account.
After you alter the share to set a list of accounts (targets), your added accounts won’t be overwritten in the next refresh.

1. Create a secondary replication group in `account_2`:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE REPLICATION GROUP my_rg
     AS REPLICA OF acme.account1.my_rg;
   ```
2. Manually refresh the replication group to replicate the databases and shares to `account_2`:

   ```sqlexample
   ALTER REPLICATION GROUP my_rg REFRESH;
   ```
3. Add one or more consumer accounts to `share1`:

   ```sqlexample
   ALTER SHARE share1 ADD ACCOUNTS = consumer_org.consumer_account_name;
   ```

You can automate refresh operations by setting the REPLICATION_SCHEDULE parameter for the *primary* replication group using the
[ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) command in the source account. For more information,see
[Replication schedule](account-replication-intro.md).

## Example 2: Share a subset of data from a database

A data provider, Acme, wants to share a subset of data with data consumers in a different region. To reduce replication costs, they
would like to only replicate the relevant rows from their master table. Since replication is done at the database level, this example
describes how Acme can use streams and tasks to copy the desired rows from the main database to a new database, create a share and
grant privileges on the view, and replicate both in a replication group to an account in a different region for consumer access.
In this scenario the new database and share are designated as primary objects for data replication.

### Execute from source account

Use the following SQL commands to create a new database in the source account and enable replication.

> **Note:**
>
> If you have previously enabled replication for an individual database, you must disable database replication for the
> database *before* you add it to a replication group. For details, see [Transitioning from database replication to group-based replication](account-replication-config.md).

1. In your local account, create a database `db1` with a subset of data from the database with the source data:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE DATABASE db1;

   CREATE SCHEMA db1.sch;

   CREATE TABLE db1.sch.table_b AS
     SELECT customerid, user_order_count, total_spent
     FROM source_db.sch.table_a
     WHERE REGION='azure_eastus2';
   ```
2. Create a secure view with the data to share:

   ```sqlexample
   CREATE SECURE VIEW db1.sch.view1 AS
     SELECT customerid, user_order_count, total_spent
     FROM db1.sch.table_b;
   ```
3. Create a stream to record changes made to the source table:

   ```sqlexample
   CREATE STREAM mystream ON TABLE source_db.sch.table_a APPEND_ONLY = TRUE;
   ```
4. Create a task to insert data into the table in `db1` with changes from the source data:

   ```sqlexample
   CREATE TASK mytask1
     WAREHOUSE = mywh
     SCHEDULE = '5 minute'
   WHEN
     SYSTEM$STREAM_HAS_DATA('mystream')
   AS
     INSERT INTO table_b(CUSTOMERID, USER_ORDER_COUNT, TOTAL_SPENT)
       SELECT customerid, user_order_count, total_spent
       FROM mystream
       WHERE region='azure_eastus2'
       AND METADATA$ACTION = 'INSERT';
   ```
5. Start the task to update data:

   ```sqlexample
   ALTER TASK mytask1 RESUME;
   ```
6. Create a share and grant privileges to the share:

   ```sqlexample
   CREATE SHARE share1;

   GRANT USAGE ON DATABASE db1 TO SHARE share1;
   GRANT USAGE ON SCHEMA db1.sch TO SHARE share1;
   GRANT SELECT ON VIEW db1.sch.view1 TO SHARE share1;
   ```
7. Create a primary replication group with the database and share:

   ```sqlexample
   CREATE REPLICATION GROUP my_rg
     OBJECT_TYPES = DATABASES, SHARES
     ALLOWED_DATABASES = db1
     ALLOWED_SHARES = share1
     ALLOWED_ACCOUNTS = acme_org.account_2;
   ```

### Execute from target account

Execute the following SQL commands from the target account in the other region.

1. Create a secondary replication group to replicate the databases and shares from the source account:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE REPLICATION GROUP my_rg
     AS REPLICA OF acme_org.account_1.my_rg;
   ```
2. Manually refresh the group to replicate objects to the current account:

   ```sqlexample
   ALTER REPLICATION GROUP my_rg REFRESH;
   ```
3. Add one or more consumer accounts to the share:

   ```sqlexample
   ALTER SHARE share1 ADD ACCOUNTS = consumer_org.consumer_account_name;
   ```

You can automate refresh operations by setting the REPLICATION_SCHEDULE parameter for the *primary* replication group using the
[ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) command in the source account. For more information,see
[Replication schedule](account-replication-intro.md).

## Example 3: Share data from multiple databases

A data provider, Acme, wants to share data from multiple databases with data consumers in a different region. They
create a secure view and share (for instructions, see [Share data from multiple databases](data-sharing-multiple-db.md)), then
replicate all the databases and share in a replication group to replicate data to accounts in other regions.

### Execute from source account

Create a replication group `my_rg` that includes the databases and share from [Example 1: Create and share a secure view in an existing database](data-sharing-multiple-db.md) to replicate
to `account_2` in the `acme` org:

```sqlexample
CREATE REPLICATION GROUP my_rg
  OBJECT_TYPES = databases, shares
  ALLOWED_DATABASES = database1, database2, database3
  ALLOWED_SHARES = share1
  ALLOWED_ACCOUNTS = acme.account_2;
```

### Execute from target account

Execute the following SQL commands from the target account in the other region.

1. Create a secondary replication group to replicate the databases and shares from the source account:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE REPLICATION GROUP my_rg
     AS REPLICA OF acme_org.account_1.my_rg;
   ```
2. Manually refresh the group to replicate objects to the current account:

   ```sqlexample
   ALTER REPLICATION GROUP my_rg REFRESH;
   ```
3. Add one or more consumer accounts to the share:

   ```sqlexample
   ALTER SHARE share1 ADD ACCOUNTS = consumer_org.consumer_account_name;
   ```

You can automate refresh operations by setting the REPLICATION_SCHEDULE parameter for the *primary* replication group using the
[ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) command in the source account. For more information,see
[Replication schedule](account-replication-intro.md).

---
title: Share secure database objects
source: https://docs.snowflake.com/en/user-guide/data-sharing-gs.md
section: User Guide
---

# Share secure database objects

Use the information provided here to share a database and its objects with one or more accounts by creating a share.
You can provide a share to consumers using direct shares or listings.

You can attach a share to a listing, or convert a direct share with active consumers to a listing.
For instructions, see [Convert a direct share to a listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing#convert-a-direct-share-to-a-private-listing).

Are you a consumer interested in consuming shared data? See [Consume imported data](data-share-consumers.md).

## How to share database objects

The following are the options available for adding objects to a share:

* **Grant a database role to a share**

  Segment the securable objects in a share by creating multiple database roles in a database to a share. Grant privileges on a subset of
  the objects in the database to each database role. Then grant each database role to the share.

  After creating a database from a share that includes database roles, data consumers grant each shared database role to one or more
  [account roles](security-access-control-overview.md) in their own account.

  Without database roles, account administrators in data consumer accounts grant a single privilege, IMPORTED PRIVILEGES, to roles
  to allow their users to access all databases and database objects (tables, secure views, etc.) in a share. There is no option
  to allow different groups of users in a data consumer account to access a subset of the shared objects. This all or nothing approach
  requires you to create multiple shares to grant access to different objects in the same database.

  > **Note:**
  >
  > If you plan to include data from multiple databases in a single share, you cannot use this option because the REFERENCE_USAGE
  > privilege cannot be granted to a database role. For guidance sharing data from multiple databases,
  > see [Share data from multiple databases](data-sharing-multiple-db.md).
  >
  > Alternatively, you could create a share that grants database roles to a share (Option 1), but also grants privileges on objects
  > directly to the same share without granting privileges on those objects to a database role (Option 2). Data consumers who create
  > databases from the share can access objects granted to the share directly by granting the IMPORTED PRIVILEGES privilege on the
  > database to local roles.

  > **Tip:**
  >
  > A shared database role does not support future grants on objects. For details, see [GRANT DATABASE ROLE … TO SHARE](../sql-reference/sql/grant-database-role-share.md).
* **Grant privileges on objects directly to a share**

  Grants privileges on specific objects in the database directly to a share. This option allows you to include data from multiple databases
  in a share, as long as these databases belong to the same account. For guidance sharing data from multiple databases,
  see [Share data from multiple databases](data-sharing-multiple-db.md).

  Account administrators in data consumer accounts grant the IMPORTED PRIVILEGES privilege on shared databases to one or more roles
  to allow their users to access the databases and database objects (tables, secure views, and so on) in a share.

  This option does not support segmenting database objects in a share based on roles.

## Grant database roles to a share

This section provides instructions for data providers to restrict access to databases and database objects in a share using database roles.

> **Note:**
>
> To perform the tasks described in this topic, your role must have the global CREATE DATABASE and CREATE SHARE privileges.

In the extended example throughout this section, a data provider shares the following objects with data consumers:

|  |  |  |
| --- | --- | --- |
| Databases | `d1` |  |
| Schemas | `d1.s1` |  |
| Secure views | `d1.s1.v1`  The result set for this view includes records from table `d1.s1.t1`. | `d1.s1.v2`  The result set for this view includes records from tables `d1.s1.t2` and `d1.s1.t3`. |

The data provider creates two database roles in database `d1` to control access to these objects: `d1.r1` and `d1.r2`.

The following diagram shows the relationships among these objects and indicates the privileges that are granted to the database roles:

For more information about the privileges, see [Access control privileges](security-access-control-privileges.md).

### Create database roles

Create a new database role or replace an existing database role using [CREATE DATABASE ROLE](../sql-reference/sql/create-database-role.md).

For example, create database roles `d1.r1` and `d1.r2` using fully-qualified identifiers:

```sqlexample
CREATE DATABASE ROLE d1.r1;

CREATE DATABASE ROLE d1.r2;
```

Alternatively, set the desired database as the current database in the session, and then create the database roles:

```sqlexample
USE DATABASE d1;

CREATE DATABASE ROLE r1;

CREATE DATABASE ROLE r2;
```

### Grant privileges on objects to database roles

Grant privileges on a single database and subset of objects in the database to each database role using
[GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md). Only grant privileges on objects that the database role should allow access to.

Either specify the fully-qualified name of a database role, or set the database as the active database in a session and then specify
the relative name.

> **Note:**
>
> * To perform the tasks described in this topic, you must use the ACCOUNTADMIN role or a
>   [role granted the relevant privileges](security-access-privileges-shares.md). For more information, including
>   additional data sharing scenarios, see [Create and configure shares](data-sharing-provider.md).
> * Privileges granted to a database role are limited to USAGE on the database and schema that contain the database role and privileges on
>   other objects in the same database. In particular, note that the REFERENCE_USAGE privilege cannot be granted to a database role to
>   include objects from multiple databases in a share.

Continuing the extended example in these instructions, the following privileges are granted to the database roles:

| Database Role | Privilege | Object |
| --- | --- | --- |
| `d1.r1` | USAGE | Database `d1` |
|  | USAGE | Schema `d1.s1` |
|  | SELECT | Secure view `d1.s1.v1` |
| `d1.r2` | USAGE | Database `d1` |
|  | USAGE | Schema `d1.s1` |
|  | SELECT | Secure view `d1.s1.v2` |

The following SQL statements grant the privileges to the `d1.r1` database role:

```sqlexample
GRANT USAGE ON SCHEMA d1.s1 TO DATABASE ROLE d1.r1;
GRANT SELECT ON VIEW d1.s1.v1 TO DATABASE ROLE d1.r1;
```

The following SQL statements grant the privileges to the `d1.r2` database role:

```sqlexample
GRANT USAGE ON SCHEMA d1.s1 TO DATABASE ROLE d1.r2;
GRANT SELECT ON VIEW d1.s1.v2 TO DATABASE ROLE d1.r2;
```

Granting the USAGE privilege on the parent database is not necessary. This privilege is granted implicitly when a database role
is created.

To view all privileges granted to a database role, execute SHOW GRANTS TO DATABASE ROLE using fully-qualified identifiers:

```sqlexample
SHOW GRANTS TO DATABASE ROLE d1.r1;
SHOW GRANTS TO DATABASE ROLE d1.r2;
```

Alternatively, set the desired database as the current database in the session, and then execute the command:

```sqlexample
USE DATABASE d1;

SHOW GRANTS TO DATABASE ROLE r1;
SHOW GRANTS TO DATABASE ROLE r2;
```

### Create a share

Create a share using [CREATE SHARE](../sql-reference/sql/create-share.md). The share is an empty container at this stage in the process.

For example, create a new share named `share1`:

```sqlexample
CREATE SHARE share1;
```

### Add the database by granting the USAGE privilege to the share

Currently, it is necessary to grant the USAGE privilege on a database to include it in a share.

For example, grant the USAGE privilege on the `d1` database to share `share1`:

```sqlexample
GRANT USAGE ON DATABASE d1 TO SHARE share1;
```

### Add objects by granting database roles to the share

Add databases and database objects to a share by granting database roles to the share using [GRANT DATABASE ROLE … TO SHARE](../sql-reference/sql/grant-database-role-share.md).

For example, grant database roles `d1.r1` and `d1.r2` to share `share1`:

```sqlexample
GRANT DATABASE ROLE d1.r1 TO SHARE share1;
GRANT DATABASE ROLE d1.r2 TO SHARE share1;
```

### Share the database objects with one or more data consumer accounts

Modify the share [ALTER SHARE … ADD ACCOUNTS](../sql-reference/sql/alter-share.md) and add database consumer accounts with which
you want to share the database objects.

The following example adds accounts `consumer1` and `consumer2` in organization `org1` to share `share1`:

```sqlexample
ALTER SHARE share1 ADD ACCOUNTS = org1.consumer1,org1.consumer2;
```

### Manage database roles

This section provides instructions for managing database roles that are granted to shares.

#### Data providers: Renaming shared database roles

Rename database roles using an [ALTER DATABASE ROLE … RENAME TO](../sql-reference/sql/alter-database-role.md) statement.

For example, rename database role `d1.r1` to `d1.r3`:

```sqlexample
ALTER DATABASE ROLE d1.r1 RENAME TO d1.r3;
```

All privileges granted to `d1.r1` are retained after the database role is renamed.

Notify any data consumers of a share that the name of the database role has changed.

Note that moving a database role to a different database using the RENAME TO clause is prohibited. For example:

```sqlexample
ALTER DATABASE ROLE d1.r1 RENAME TO d2.r1;
```

#### Data providers: Dropping shared database roles

Drop database roles using DROP DATABASE ROLE.

For example, drop database role `d1.r2`:

```sqlexample
DROP DATABASE ROLE d1.r2;
```

Notify any data consumers of a share that includes the database role. Access to any objects granted to the database role is revoked.

#### Data providers: Creating new shared database roles

Create new database roles using CREATE DATABASE ROLE. For information, see Create database roles
(in this topic). Grant privileges on database objects to a database role, and then grant the database role to a share.

Notify any data consumers of a share that includes the new database role. They must grant the new database role to their own account
roles to allow those roles to access the objects associated with the database role.

## Grant privileges directly to a share

This section provides instructions for data providers to allow consumers to access all databases and database objects in a share by
granting a single privilege on shared databases.

### Create a share

Use [CREATE SHARE](../sql-reference/sql/create-share.md) to create a share. At this step, the share is simply a container waiting for objects and
accounts to be added.

### Add objects to the share by granting privileges

Use [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) to grant the following object privileges to the share:

* USAGE privilege on the database you wish to share.
* USAGE privilege on each database schema containing the objects you wish to share.
* SELECT privilege for sharing specific objects in each shared schema:

  + Tables
  + External tables
  + Secure views
  + Secure materialized views
  + Secure UDFs

> **Important:**
>
> If you plan to securely share data with data consumers across different [regions](intro-regions.md) or
> [cloud platforms](intro-cloud-platforms.md), note that currently, replicating a primary database is blocked if one
> or more external tables exist in the database.

> **Note:**
>
> Streams cannot be shared directly. Avoid creating secure views on streams and then sharing those views with consumers.
> Instead, allow consumers to create their own streams on the tables and secure views that you share.
> For more information, see [Streams on shared objects](data-sharing-provider.md).

Optionally use [SHOW GRANTS](../sql-reference/sql/show-grants.md) to view the object grants for the share.

> **Tip:**
>
> Perform this minimal amount of validation of the share at this point, because after you complete
> the next step, the share is visible to all accounts that are added to the share.
>
> To perform a more in-depth validation of the share, you can simulate a consumer account in your account.
> For more details, refer to [Use secure objects to control data access](data-sharing-secure-views.md).

### Add one or more accounts to the share

Use [ALTER SHARE](../sql-reference/sql/alter-share.md) to add one or more accounts to the share. To review the accounts added to the share, you can
use [SHOW GRANTS](../sql-reference/sql/show-grants.md).

The share is now ready to be consumed by the specified accounts. For more detailed instructions for performing these and
other data provider tasks, refer to [Create and configure shares](data-sharing-provider.md).

### Example

The following example illustrates the entire provider process as described above.

Note that this example assumes:

> * A database named `sales_db` exists with a schema named `aggregates_eula` and a table named `aggregate_1`.
> * The database, schema, and table will be shared with two accounts named `xy12345` and `yz23456`.
>
> ```sqlexample
> USE ROLE accountadmin;
>
> CREATE SHARE sales_s;
>
> GRANT USAGE ON DATABASE sales_db TO SHARE sales_s;
> GRANT USAGE ON SCHEMA sales_db.aggregates_eula TO SHARE sales_s;
> GRANT SELECT ON TABLE sales_db.aggregates_eula.aggregate_1 TO SHARE sales_s;
>
> SHOW GRANTS TO SHARE sales_s;
>
> ALTER SHARE sales_s ADD ACCOUNTS=xy12345, yz23456;
>
> SHOW GRANTS OF SHARE sales_s;
> ```

---
title: Share unstructured data with a secure view
source: https://docs.snowflake.com/en/user-guide/unstructured-data-sharing.md
section: User Guide
---

# Share unstructured data with a secure view

This topic briefly covers how to share unstructured data files by using a secure view and [Secure Data Sharing](data-sharing-intro.md).
With Secure Data Sharing, data providers can share selected objects in a database from one Snowflake account
with data consumers in another Snowflake account.

For more information and additional examples, see [Create and configure shares](data-sharing-provider.md).

## Step 1: Create a secure view

First, use the [CREATE SECURE VIEW](../sql-reference/sql/create-view.md) command to create a secure view from unstructured data on a stage.
A view allows the result of a query to be accessed like a table, and a secure view is specifically designated for data privacy. For more information, see [Overview of Views](views-introduction.md).

You can allow data consumers to retrieve either scoped or pre-signed URLs from the secure view.
Scoped URLs provide better security, while pre-signed URLs can be accessed without authorization or authentication.
To choose the correct URL for your use case, see [Types of URLs available to access files](unstructured-intro.md).

> **Note:**
>
> Snowflake does not create scoped or pre-signed URLs until a user in a consumer account queries a secure view.
> This create-on-demand behavior helps you manage the lifetime of pre-signed URLs. To minimize the risk of leaking pre-signed URLs,
> you can also set a short time interval for the EXPIRATION_TIME parameter of the [GET_PRESIGNED_URL](../sql-reference/functions/get_presigned_url.md) function.

The following examples create secure views that allow data consumers to query the scoped or pre-signed URLs for a specific set of staged files.
Both views query the RELATIVE_PATH column in a directory table to retrieve the scoped or pre-signed URL.

### Scoped URL

This example calls the [BUILD_SCOPED_FILE_URL](../sql-reference/functions/build_scoped_file_url.md) function to create a secure view with the scoped URLs for a set of staged files.
The example passes the RELATIVE_PATH column in a directory table on a stage named `mystage` to the BUILD_SCOPED_FILE_URL function:

```sqlexample
CREATE OR REPLACE SECURE VIEW images_scoped_v AS
SELECT BUILD_SCOPED_FILE_URL(@mystage, relative_path) AS scoped_file_url
FROM DIRECTORY(@mystage);
```

You can also create a secure view from a subset of files on a stage so that you do not have to share the entire stage.
The following example creates a secure view of images on a stage where the `client_name` field is equal to `abc`:

```sqlexample
CREATE OR REPLACE SECURE VIEW images_for_client_abc AS
SELECT build_scoped_file_url(@myStage, relative_path) AS scoped_file_url
FROM directory(@mystage) d join clients c on d.relative_path = c.relative_path
WHERE c.client_name = 'abc';
```

### Pre-signed URL

This example calls the [GET_PRESIGNED_URL](../sql-reference/functions/get_presigned_url.md) function to retrieve the pre-signed URLs for a set of staged files.
The example specifies 60 seconds for the EXPIRATION_TIME parameter so that the pre-signed URLs will only be accessible for one minute.

```sqlexample
CREATE OR REPLACE SECURE VIEW images_presigned_v AS
SELECT GET_PRESIGNED_URL(@mystage, relative_path, 60) AS presigned_url
FROM DIRECTORY(@mystage);
```

## Step 2: Create a share

Next, create an empty share, and then grant access privileges for your secure view to the share.
Doing so adds the secure view object to the share.

The following example creates a share with the [CREATE SHARE](../sql-reference/sql/create-share.md) command
and then uses the [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) command to grant the SELECT privilege for a secure view to the share.

```sqlexample
CREATE SHARE my_share;
GRANT SELECT ON my_secure_view TO SHARE my_share;
```

## Step 3: Add accounts to the share

Finally, you must provide access for consumer accounts to your share by adding the accounts to your share.

The following example uses the [ALTER SHARE](../sql-reference/sql/alter-share.md) command to add an account named `consumer_account_1` to the share named `my_share`.

```sqlexample
ALTER SHARE my_share ADD ACCOUNTS=consumer_account_1;
```

After you complete this step, the `consumer_account_1` account can see the share and access the files in the secure view.

---
title: Sharing semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/sharing-semantic-views.md
section: User Guide
---

# Sharing semantic views

Providers can share semantic views in [private listings](../../collaboration/provider-listings-creating-publishing.md), in public listings on the [Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace), and in [organizational listings](../collaboration/listings/organizational/org-listing-about.md).

## Share a semantic view in a listing

The example below describes how to share a semantic view on the Snowflake Marketplace.

SnowsightSQL

To use Snowsight to share a semantic view, follow these steps:

> **Note:**
>
> You can also attach a semantic view to a [private listing](../../collaboration/provider-listings-creating-publishing.md) or an [organizational listing](../collaboration/listings/organizational/org-listing-create.md).

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Snowflake Marketplace.
4. In the Create Listing window, enter a name for your listing.
5. Enter a subtitle and select a profile for your listing.
6. Select + Add data product.
7. Click + Select and select the database and schema that has the semantic view or views that you want to share.
8. In the database, select the semantic view or views that you want to attach to the listing.
9. To create the share, select Done and then Save.
10. Fill in the remaining details for the listing. For more information about these fields, refer to [Configure listings](../../collaboration/provider-listings-reference.md).

    * Access type

      + Free to offer a data product that is freely available to consumers.
      + Limited trial to offer a trial of your data product, with unlimited access to the data product available on request.
    * Description
    * Data dictionary
    * Business needs
    * Quick Start Examples
    * Categories
    * Documentation
    * Legal Terms
    * Attributes
    * Region Availability
11. Select Submit for approval, and then select one of the following:

    * Publish once approved
    * Submit for approval only

To use SQL to share a semantic view, follow these steps:

1. To create a share for your listing, use the [CREATE SHARE](../../sql-reference/sql/create-share.md) command:

   ```sqlexample
   CREATE SHARE my_share;
   ```
2. To ensure that the tables referenced in the view are also shared, run the following [GRANT <privilege> … TO SHARE](../../sql-reference/sql/grant-privilege-share.md) commands:

   ```sqlexample
   GRANT REFERENCES ON SEMANTIC VIEW my_view TO SHARE my_share;
   GRANT SELECT ON SEMANTIC VIEW my_view TO SHARE my_share;
   ```
3. Semantic views reference underlying tables. To ensure that the necessary privileges are granted on these tables, run the following [GRANT <privilege> … TO SHARE](../../sql-reference/sql/grant-privilege-share.md) command:

   ```sqlexample
   GRANT SELECT ON TABLE my_table TO SHARE my_share;
   ```

   Repeat this step for each table used by the semantic view.
4. To identify the tables that are referenced, run the [DESCRIBE SEMANTIC VIEW](../../sql-reference/sql/desc-semantic-view.md) command:

   ```sqlexample
   DESCRIBE SEMANTIC VIEW my_semantic_view;
   ```
5. To create a new secure object in the current account, use the [CREATE LISTING](../../sql-reference/sql/create-listing.md) command and attach the semantic view to the listing.

---
title: Sign in to Snowflake
source: https://docs.snowflake.com/en/user-guide/connecting.md
section: User Guide
---

# Sign in to Snowflake

You can sign in to Snowflake in several ways. If you’re getting started with Snowflake, start by using
[Snowsight](ui-snowsight.md), Snowflake’s browser-based web interface. After you get
comfortable with using Snowsight, you can explore other ways to connect.

To sign in to Snowflake, you need the following information:

* Your account identifier.

  All access to Snowflake is through your account identifier. If you don’t know your account identifier, ask
  your Snowflake account administrator. For more information, see [Account identifiers](admin-account-identifier.md).
* Your authentication method.

  The way you sign in to Snowflake depends on the authentication method used by your organization. Two
  common authentication methods for users are federated authentication with single sign-on (SSO) and password with
  multi-factor authentication (MFA):

  + **SSO:** Users authenticate with a third-party identity provider (IdP) instead of authenticating with Snowflake directly.
    Examples of IdPs are Okta and Microsoft Entra ID.
  + **Password with MFA:** When users sign in with a username and password, they must use a second factor of authentication.
    These users enter their username and password, and then use the second factor to complete the authentication. Currently,
    Snowflake allows the following MFA methods: passkeys, authenticator apps, and Duo.

  If you don’t know which authentication method your organization uses, check with your Snowflake account
  administrator.

  If your organization uses password with MFA authentication, you can configure MFA when you
  sign in to Snowsight.

  Other authentication methods are supported for both users and applications. For more information about authentication
  methods, see [Overview of Snowflake authentication](security-authentication-overview.md).

## Sign in by using Snowsight

You can access [Snowsight](ui-snowsight.md) over the public internet or through private connectivity
to the Snowflake service.

> **Note:**
>
> * Check with your Snowflake account administrator about instructions for signing in.
> * If your organization uses private connectivity to access Snowsight, see
>   [Using private connectivity](ui-snowsight-gs.md).

To access Snowsight over the public internet, complete the following steps:

1. In a supported web browser, navigate to <https://app.snowflake.com>.
2. Provide your [account identifier](admin-account-identifier.md) or account URL.
   If you previously signed in to Snowsight, you might see an account name that you can select.
3. Choose your authentication method, and then sign in.

If you are signing in to Snowsight for the first time, you might be prompted to configure
MFA. For instructions, see [Snowsight and MFA](ui-snowsight-gs.md).

For information about the tasks that you can perform in Snowsight, see
[Snowsight quick tour](ui-snowsight-quick-tour.md).

## Connect by using other methods

In addition to Snowsight, you can use the following methods to connect to Snowflake:

* Using [Snowflake CLI](../developer-guide/snowflake-cli/index.md).
* Using third-party client services and applications that support JDBC or ODBC.
* Developing applications that connect through the Snowflake connectors and drivers for Python, Node.js, Spark,
  and so on.

These methods require additional installation, configuration, and development tasks. For more information, see
[Applications and tools for connecting to Snowflake](../guides-overview-connecting.md).

---
title: Sign in to Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/signin-snowflake-customer.md
section: User Guide
---

# Sign in to Snowflake Open Catalog

This topic describes how to sign in to Snowflake Open Catalog.

## Sign in using your Open Catalog credentials

If your sign-in credentials are managed by Open Catalog, you can access Open Catalog over the public internet or through private connectivity:

* Using the internet
* Using private connectivity

### Using the internet

To access Open Catalog over the public internet, follow these steps:

1. In a supported web browser, navigate to
   <https://app.snowflake.com/>.
2. For **Account identifier**, enter your account identifier. If you’ve previously signed in to Open Catalog, you might see an account name
   that you can select.
3. Enter your username and password, and select **Sign in**.
4. If prompted, enter your multi-factor authentication (MFA) passcode.

### Using private connectivity

After completing the [configuration to use private connectivity](private-connectivity-ui-configure.md), access Open Catalog:

1. In a supported web browser, navigate to one of the following URLs for your Open Catalog account:

   * **PrivateLink Account URL**
   * **Regionless PrivateLink Account URL**

   To retrieve these URLs, see [Retrieve your PrivateLink URLs](private-connectivity-ui-configure.md).
2. Enter your username and password, and select **Sign in**.
3. If prompted, enter your multi-factor authentication (MFA) passcode.

## Sign in using SSO

If your sign-in credentials are managed by an identity provider, follow these steps to sign in:

1. In a supported web browser, navigate to
   <https://app.snowflake.com/>.
2. For **Account identifier**, enter your account identifier.
3. Select **Sign in using [identity provider name]**.
4. In the window that appears, enter your username and password and select **Sign in**.
5. If prompted, enter your multi-factor authentication (MFA) passcode.

---
title: Single-use refresh tokens for Snowflake OAuth security integrations
source: https://docs.snowflake.com/en/user-guide/single-use-refresh-tokens.md
section: User Guide
---

# Single-use refresh tokens for Snowflake OAuth security integrations

This topic describes how to enable single-use refresh tokens for [Snowflake OAuth security integrations](oauth-snowflake-overview.md).

Single-use refresh tokens are a feature that you can enable to prevent stolen refresh tokens from being reused in your Snowflake account.
When you enable single-use refresh tokens, the following changes occur to the behavior of the refresh token grant flow:

* You can only use a refresh token one time during the 90 days that the refresh token is valid.
* After you use a refresh token, the refresh token becomes invalid.
* The refresh token grant flow returns a new refresh token and a new access token, instead of only a new access token. The new refresh token
  will have the same expiration time as specified by OAUTH_REFRESH_TOKEN_VALIDITY when the [integration was created](../sql-reference/sql/create-security-integration-oauth-snowflake.md)
  (or the default system validity period, if not specified).
* After you get a new refresh token, all previous refresh tokens and access tokens become invalid.

## Benefits of single-use refresh tokens

Single-use refresh tokens offer the following security benefits:

* **Reduced effective token lifetime**: When a legitimate application uses a refresh token, any stolen copies of the refresh token become invalidated. This single-use behavior
  makes distributing stolen tokens and using stolen tokens in timed attacks more difficult.

  For example, if your application uses the refresh token grant flow one time every 10 minutes, then a malicious actor who steals the
  refresh token can only use the stolen token within 10 minutes, or before the application gets a new refresh token, even if the token is
  valid for 90 days.
* **Intrusion detection**: You cannot reuse a refresh token. When a refresh token is reused, all previous refresh tokens and access tokens
  become invalid.

  For example, if a malicious actor steals a single-use refresh token and attempts to reuse the token, then the attempt to reuse the
  single-use refresh token invalidates all previous refresh tokens and access tokens, which exposes the malicious use of a refresh token.

## Enabling single-use refresh tokens

You can enable refresh token rotation when you exchange an authorization code for an access token and a refresh token using an authorization
code grant flow.

You can enable single-use refresh tokens by using any of the following methods:

* Use a request parameter in the body of an HTTP request
* Set a property in a Snowflake OAuth security integration

### Use a request parameter in the body of an HTTP request

A client application can set the `enable_single_use_refresh_tokens` request parameter to `TRUE` in the body of an HTTP POST request to
the token request endpoint for [Snowflake OAuth](oauth-snowflake-overview.md) during the authorization code grant flow.

After a client application sets the `enable_single_use_refresh_tokens` request parameter to `TRUE` during the authorization code grant
flow, all future refresh token grant flows return a new refresh token and a new access token, and invalidates all previous access tokens and
refresh tokens.

For example, you can make the following HTTP POST request to do an **authorization code grand flow** and set the
`enable_single_use_refresh_tokens` request parameter to `TRUE` to get your first access token and refresh token:

HTTP requestHTTP response

```http
POST /oauth/token-request HTTP/1.1
Host: <my_subdomain>.snowflakecomputing.com
Authorization: Basic <client_id:client_secret>
Content-Type: application/x-www-form-urlencoded

grant_type=authorization_code&code=123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ12345&redirect_uri=http://127.0.0.1:8080&enable_single_use_refresh_tokens=true
```

```json
{
  "access_token":  "<your_new_access_token>",
  "expires_in": 600,
  "refresh_token": "<your_new_refresh_token>",
  "token_type": "Bearer",
  "username": "<user1>",
}
```

You can then make the following HTTP POST request to do a **refresh token grant flow**, using your old refresh token, to get a new access
token and a new refresh token:

HTTP requestHTTP response

```http
POST /oauth/token-request HTTP/1.1
Host: <my_subdomain>.snowflakecomputing.com
Authorization: Basic <client_id:client_secret>
Content-Type: application/x-www-form-urlencoded

grant_type=refresh_token&refresh_token=<your_old_refresh_token>
```

```json
{
  "access_token":  "<your_new_access_token>",
  "expires_in": 600,
  "refresh_token": "<your_new_refresh_token>",
  "token_type": "Bearer",
}
```

### Set a property in a Snowflake OAuth security integration

If a client application updates its cached refresh token after each refresh token grant flow, then you can enable single-use refresh tokens
for a [Snowflake OAuth security integration](oauth-snowflake-overview.md) by setting the
`OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED` property to `TRUE`.

After you enable single-use refresh tokens for a Snowflake OAuth security integration, all authorization code grant flows and refresh token
grant flows that use the `client_id` of the security integration issue single-use refresh tokens, regardless of whether the client
application specifies the `enable_single_use_refresh_tokens` request parameter during an authorization code grant flow.

For example, you can use [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-oauth-snowflake.md) to enable
single-use refresh tokens for a Snowflake OAuth security integration:

```sqlexample
ALTER SECURITY INTEGRATION my_integration
  SET OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED = TRUE;
```

---
title: SnowCD (Connectivity Diagnostic Tool)
source: https://docs.snowflake.com/en/user-guide/snowcd.md
section: User Guide
---

# SnowCD (Connectivity Diagnostic Tool)

SnowCD (i.e. Snowflake Connectivity Diagnostic Tool) helps users to diagnose and troubleshoot their network connection to Snowflake.

## Overview

SnowCD leverages the Snowflake hostname IP addresses and ports listed by either the `SYSTEM$ALLOWLIST()` or `SYSTEM$ALLOWLIST_PRIVATELINK()` functions to run a series of connection checks to evaluate and help troubleshoot the network connection to Snowflake.

> **Important:**
>
> If your Snowflake account uses private connectivity to the Snowflake service, execute the
> [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md) function to obtain the Snowflake hostname IP address and ports to evaluate
> and troubleshoot network connections to Snowflake.
>
> For more information, see:
>
> * [AWS PrivateLink and Snowflake](admin-security-privatelink.md)
> * [Azure Private Link and Snowflake](privatelink-azure.md)
> * [Google Cloud Private Service Connect and Snowflake](private-service-connect-google.md)

SnowCD returns one of the following:

> 1. `All checks passed` to indicate a healthy network connection.
> 2. A message to state that one or more checks failed with a troubleshooting suggestion.

Users can leverage SnowCD to evaluate the network connection to Snowflake at any time to verify the required configuration settings are correct. For example, users can integrate SnowCD into these use cases:

> 1. Automated deployment scripts.
> 2. A prerequisite check before deploying a service that connects to Snowflake.
> 3. Environment checks while starting a new machine.
> 4. Periodic checks on running machines.

SnowCD works with either direct connections or connections through proxy servers.

SnowCD checks access to the Snowflake database and to stages used to temporarily store
data (for example, for loading).

SnowCD verifies that an HTTP response was returned from the HTTP host. This can detect
problems such as the following:

* No HTTP server is running at the specified IP address and port.
* There was a DNS (Domain Name System) lookup failure.
* A Man-In-The-Middle attack occurred and used an invalid certificate to impersonate
  the desired service.
* Certain types of other network failures below the HTTP level.

SnowCD does not detect all possible problems. The known limitations include:

* Stages require additional authentication information that SnowCD does not have.
  Although SnowCD verifies basic access to a stage, SnowCD does not perform a strict
  check on the HTTP response code from the stage. Therefore, SnowCD does not detect
  problems such as:

  + Access policy denial for Amazon S3 Bucket, Azure Blob storage, or Google Cloud Storage
    for stages.
  + There is a problem connecting to the customer’s proxy server, for example the proxy server returns an HTTP
    403 error.

Because SnowCD does not detect all possible problems, Snowflake recommends that after you successfully verify access to a stage
through SnowCD, you follow up by running a PUT command to load a file to the stage.
The simplest way to run a PUT command is usually through SnowSQL.

> **Attention:**
>
> Troubleshooting one or more network connection issues is challenging. Depending on the environment, it may be necessary to use SnowCD with other troubleshooting approaches. For example, if SnowCD returns information on an OCSP issue, consult the OCSP sections on this page.

## Using SnowCD

### Step 1: Run the SYSTEM$ALLOWLIST or SYSTEM$ALLOWLIST_PRIVATELINK Function

This is a prerequisite step and needs to be completed once unless the hostnames or ports change.

1. Connect to Snowflake through the web interface.
2. Execute `SELECT SYSTEM$ALLOWLIST();` or `SELECT SYSTEM$ALLOWLIST_PRIVATELINK();`.
3. Save the query result to a file (i.e. `allowlist.json`).

For more information about these functions, see [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md) or
[SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md).

The examples below use JSON as the output format from the corresponding SQL function. SnowCD accepts either JSON, CSV, or TSV format as its input for Step 3: Run SnowCD.

To save the query result in CSV or TSV format, in the web interface, click the Download or View Results icon, select either CSV or TSV, and click Export.

**Example file (Not indented and redacted)**

Execute [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md) and save the output to a file (e.g. `allowlist.json`).

> Where:
>
> > `<storage_location>` is the storage location (Amazon S3, Google Cloud Storage, or Microsoft Azure) that stores the file that a Snowflake
> > client can read or write.
> >
> > `<region_id>` is the AWS region where your VPCs and Snowflake account are located.
>
> ```sqljson
> [{"type":"STAGE","host":"<storage_location>.s3.<region_id>.amazonaws.com","port":443},
> {"type":"STAGE","host":"<storage_location>.s3-<region_id>.amazonaws.com","port":443},
> {"type":"STAGE","host":"<storage_location>.s3.amazonaws.com","port":443},
> {"type":"SNOWSQL_REPO","host":"<repository_name_1>.s3.<region_id>.amazonaws.com","port":443},
> {"type":"SNOWSQL_REPO","host":"<repository_name_2>.snowflakecomputing.com","port":443},
> {"type":"OUT_OF_BAND_TELEMETRY","host":"<telemetry_subdomain>.snowflakecomputing.com","port":443},
> {"type":"OCSP_CACHE","host":"ocsp.snowflakecomputing.com","port":80},
> {"type":"OCSP_RESPONDER","host":"ocsp.digicert.com","port":80}]
> ```

**Example file (Indented and redacted)**

Execute [SYSTEM$ALLOWLIST](../sql-reference/functions/system_allowlist.md) and save the output to a file (e.g. `allowlist.json`).

> Where:
>
> > `<storage_location>` is the storage location (Amazon S3, Google Cloud Storage, or Microsoft Azure) that stores the file that a Snowflake
> > client can read or write.
> >
> > `<region_id>` is the AWS region where your VPCs and Snowflake account are located.
>
> ```sqljson
> [{
>   "type": "STAGE",
>   "host": "<storage_location>.s3.<region_id>.amazonaws.com",
>   "port": 443
> }, {
>   "type": "STAGE",
>   "host": "<storage_location>.s3-<region_id>.amazonaws.com",
>   "port": 443
> }, {
>   "type": "STAGE",
>   "host": "<storage_location>.s3.amazonaws.com",
>   "port": 443
> }, {
>   "type": "SNOWSQL_REPO",
>   "host": "<repository_name_1>.s3.<region_id>.amazonaws.com",
>   "port": 443
> }, {
>   "type": "SNOWSQL_REPO",
>   "host": "<repository_name_2>.snowflakecomputing.com",
>   "port": 443
> }, {
>   "type": "OUT_OF_BAND_TELEMETRY",
>   "host": "<telemetry_subdomain>.snowflakecomputing.com",
>   "port": 443
> }, {
>   "type": "OCSP_CACHE",
>   "host": "ocsp.snowflakecomputing.com",
>   "port": 80
> }, {
>   "type": "OCSP_RESPONDER",
>   "host": "ocsp.digicert.com",
>   "port": 80
> }]
> ```

**Example file (Not indented and redacted)**

Execute [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md) and save the output to a file (e.g. `allowlist.json`).

> Where:
>
> > `<storage_location>` is the storage location (Amazon S3, Google Cloud Storage, or Microsoft Azure) that stores the file that a Snowflake
> > client can read or write.
> >
> > `<region_id>` is the AWS region where your VPCs and Snowflake account are located.
> >
> > ```sqljson
> > [{"type":"SNOWFLAKE_DEPLOYMENT","host":"<storage_location>.<region>.privatelink.snowflakecomputing.com","port":443},
> > {"type":"STAGE","host":"<storage_location>.<region>.amazonaws.com","port":443},
> > {"type":"STAGE","host":"<storage_location>-<region>.amazonaws.com","port":443},
> > {"type":"STAGE","host":"<storage_location>.amazonaws.com","port":443},
> > {"type":"SNOWSQL_REPO","host":"<repository_name_1>.s3.<region>.amazonaws.com","port":443},
> > {"type":"SNOWSQL_REPO","host":"<repository_name_2>.snowflakecomputing.com","port":443},
> > {"type":"OUT_OF_BAND_TELEMETRY","host":"<telemetry_subdomain>.snowflakecomputing.com","port":443},
> > {"type":"OCSP_CACHE","host":"ocsp.<storage_location>.<region>.privatelink.snowflakecomputing.com","port":80}]
> > ```

**Example file (Indented and redacted)**

Execute [SYSTEM$ALLOWLIST_PRIVATELINK](../sql-reference/functions/system_allowlist_privatelink.md) and save the output to a file (e.g. `allowlist.json`).

> Where:
>
> > `<storage_location>` is the storage location (Amazon S3, Google Cloud Storage, or Microsoft Azure) that stores the file that a Snowflake
> > client can read or write.
> >
> > `<region_id>` is the AWS region where your VPCs and Snowflake account are located.
> >
> > ```sqljson
> > [{
> >   "type": "SNOWFLAKE_DEPLOYMENT",
> >   "host": "<storage_location>.<region>.privatelink.snowflakecomputing.com",
> >   "port": 443
> > }, {
> >   "type": "STAGE",
> >   "host": "<storage_location>.<region>.amazonaws.com",
> >   "port": 443
> > }, {
> >   "type": "STAGE",
> >   "host": "<storage_location>-<region>.amazonaws.com",
> >   "port": 443
> > }, {
> >   "type": "STAGE",
> >   "host": "<storage_location>.amazonaws.com",
> >   "port": 443
> > }, {
> >   "type": "SNOWSQL_REPO",
> >   "host": "<repository_name_1>.s3.<region>.amazonaws.com",
> >   "port": 443
> > }, {
> >   "type": "SNOWSQL_REPO",
> >   "host": "<repository_name_2>.snowflakecomputing.com",
> >   "port": 443
> > }, {
> >   "type": "OUT_OF_BAND_TELEMETRY",
> >   "host": "<telemetry_subdomain>.snowflakecomputing.com",
> >   "port": 443
> > }, {
> >   "type": "OCSP_CACHE",
> >   "host": "ocsp.<storage_location>.<region>.privatelink.snowflakecomputing.com",
> >   "port": 80
> > }]
> > ```

> **Attention:**
>
> Save the `allowlist.json` file in the location where other external allowed hostnames and ports are defined for your environment.

> **Tip:**
>
> If you do not want the output in JSON format and instead prefer table format, execute the following:
>
> ```sqlexample
> use warehouse my_warehouse;
> select value:type as type,
>        value:host as host,
>        value:port as port
>    from table(flatten(input => parse_json(system$allowlist())));
> ```

### Step 2: Download and Install SnowCD

#### Linux

To download and install SnowCD on Linux, complete the following steps:

1. Download the latest version of the SnowCD from the [SnowCD Download](https://developers.snowflake.com/snowcd/) page.
2. Open the Linux Terminal application and navigate to the directory where you downloaded the file.
3. Verify the SHA256 checksum matches.

   > ```bash
   > $ sha256sum <filename>
   > ```
4. Extract the file.

   > ```bash
   > $ gunzip <filename>
   > ```
5. Make the file executable.

   > ```bash
   > $ chmod +x <filename>
   > ```
6. Rename the executable to `snowcd`.

   > ```bash
   > $ mv <filename> snowcd
   > ```

> **Note:**
>
> Linux users running RHEL or CentOS can install SnowCD using yum while Debian users can install using apt.

#### macOS

To download and install SnowCD on macOS, complete the following steps:

1. Download the latest version of the notarized SnowCD `pkg` file from the [SnowCD Download](https://developers.snowflake.com/snowcd/) page.

   The pkg files use the following naming convention:

   > snowcd-<version_number>-darwin_x86_64.pkg

   For example:

   > snowcd-1.0.5-darwin_x86_64.pkg
2. Open the Terminal application and navigate to the directory where you downloaded the file.
3. Verify the SHA256 checksum matches.

   To get the checksum of the file, execute the command:

   ```bash
   $ shasum -a 256 <filename>
   ```

   Compare the checksum of the file to the checksum shown at the download site.
4. Open the Finder application and navigate to the directory where you downloaded the pkg file.
5. Extract and install SnowCD by double clicking on the pkg file.

The files, including the snowcd executable, are installed in the /opt/snowflake/snowcd directory.

#### Windows

To download and install SnowCD on Windows, complete the following steps:

1. Download the latest version of the SnowCD from the [SnowCD Download](https://developers.snowflake.com/snowcd/) page.
2. Run the MSI file using the Windows Installer.

### Step 3: Run SnowCD

Before running SnowCD in macOS and Linux environments, you can add its directory to your `$PATH`. In Windows environments, you can add SnowCD to your Environment Variables.

1. In macOS or Linux environments, you can run the snowcd executable from the
   command line by executing `snowcd <path_to_allowlist.json> [flags]`.
2. In Windows environments, execute `snowcd.exe <path_to_allowlist.json> [flags]`.

> **Tip:**
>
> For a full description on the flags `snowcd` supports, execute `snowcd -h`.

If all checks are valid, SnowCD returns the number of checks on the number of hosts with the message `All checks passed` as follows.

```text
Performing 30 checks on 12 hosts
All checks passed
```

If you try to run SnowCD without passing in the JSON allow list information from SELECT SYSTEM$ALLOWLIST(), the following error message displays as a reminder to include the file, with the list of currently supported flags, their data type where applicable, and a brief description of the flag.

```text
Error: please provide whitelist generated by SYSTEM$ALLOWLIST()
Usage:
./snowcd <path to input json file> [flags]

Examples:
./snowcd test.json

Flags:
  -h, --help                   help for ./snowcd
  --logLevel string            log level (panic, fatal[default], error, warning, info, debug, trace) (default "fatal")
  --logPath string             Output directory for log. When not specified, no log is generated
  --proxyHost string           host for http proxy. (When not specified, does not use proxy at all)
  --proxyIsHTTPS               Is connection to proxy secure, i.e. https. (default false)
  --proxyPassword string       password for http proxy.(default empty)
  --proxyPort int              port for http proxy.(default 8080) (default 8080)
  --proxyUser string           user name for http proxy.(default empty)
  -t, --timeout int            timeout for each hostname's checks in seconds (default 5) (default 5)
  --version                    version for ./snowcd
```

If SnowCD detects an incorrect setting or configuration, information on the failed check(s) displays with a
troubleshooting suggestion. For example, the response below indicates an invalid hostname.

```text
Check for 1 hosts failed, display as follow:
==============================================
Host: www.google1.com
Port: 443
Type: SNOWFLAKE_DEPLOYMENT
Failed Check: DNS Check
Error: lookup www.google1.com: no such host
Suggestion: Check your configuration on DNS server
```

### Using SnowCD with an HTTP Proxy

SnowCD can be run against an HTTP proxy to determine its connectivity status.

> **Important:**
>
> Currently, Snowflake does not support SSL-terminating proxy servers.
>
> During the configuration of your firewall and proxy allow list, use SSL pass through (i.e. bypass SSL decryption).

Using Linux as a representative example, execute the following command to run SnowCD against a proxy, replacing the
flag values where necessary.

```text
snowcd allowlist.json \
  --proxyHost <hostname> \
  --proxyPort <port_number> \
  --proxyUser <username> \
  --proxyPassword <password>
```

Logging is optional and you can add the two logging flags to the proxy command. It is important to include a path to the log
file to ensure logging occurs when running the command.

```text
snowcd allowlist.json \
  --proxyHost <hostname> \
  --proxyPort <port_number> \
  --proxyUser <username> \
  --proxyPassword <password> \
  --logLevel trace \
  --logPath test.log
```

After executing this command, you can view the trace in the `test.log` file.

---
title: Snowflake Catalog SDK
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-catalog.md
section: User Guide
---

# Snowflake Catalog SDK

The Snowflake Catalog SDK is available for Apache Iceberg™ versions 1.2.0 or later.

With the Snowflake Catalog SDK, you can query [Iceberg tables](tables-iceberg.md) using a third-party engine
such as Apache Spark™ or Trino.

## Supported catalog operations

The SDK supports the following commands for browsing Iceberg metadata in Snowflake:

* SHOW NAMESPACES
* USE NAMESPACE
* SHOW TABLES
* USE DATABASE
* USE SCHEMA

The SDK currently supports read operations (SELECT statements) only.

## Install and connect

To install the Snowflake Catalog SDK,
download the [latest version of the Iceberg libraries](https://iceberg.apache.org/releases/).

Before you can use the Snowflake Catalog SDK, you need a Snowflake database
with one or more Iceberg tables.
To create an Iceberg table, see [Create an Apache Iceberg™ table in Snowflake](tables-iceberg-create.md).

After you establish a connection and the SDK confirms that Iceberg metadata is present,
Snowflake accesses your Parquet data using the external volume that is associated with your Iceberg table(s).

## Examples using Spark

> **Note:**
>
> To learn about using Trino with the Snowflake Catalog SDK, see the
> [Trino documentation](https://trino.io/docs/current/object-storage/metastores.html#iceberg-snowflake-catalog).

To read table data with the SDK, start by configuring the following properties for your Spark cluster:

```bash
spark-shell --packages org.apache.iceberg:iceberg-spark-runtime-3.3_2.13:1.2.0,net.snowflake:snowflake-jdbc:3.13.28
# Configure a catalog named "snowflake_catalog" using the standard Iceberg SparkCatalog adapter
--conf spark.sql.catalog.snowflake_catalog=org.apache.iceberg.spark.SparkCatalog
# Specify the implementation of the named catalog to be Snowflake's Catalog implementation
--conf spark.sql.catalog.snowflake_catalog.catalog-impl=org.apache.iceberg.snowflake.SnowflakeCatalog
# Provide a Snowflake JDBC URI with which the Snowflake Catalog will perform low-level communication with Snowflake services
--conf spark.sql.catalog.snowflake_catalog.uri='jdbc:snowflake://<account_identifier>.snowflakecomputing.com'
# Configure the Snowflake user on whose behalf to perform Iceberg metadata lookups
--conf spark.sql.catalog.snowflake_catalog.jdbc.user=<user_name>
# Provide the user password. To configure the credentials, you can provide either password or private_key_file.
--conf spark.sql.catalog.snowflake_catalog.jdbc.password=<password>
# Configure the private_key_file to use when connecting to Snowflake services; additional connection options can be found at https://docs.snowflake.com/en/user-guide/jdbc-configure.html
--conf spark.sql.catalog.snowflake_catalog.jdbc.private_key_file=<location of the private key>
```

> **Note:**
>
> You can use any Snowflake-supported [JDBC driver connection parameter](../developer-guide/jdbc/jdbc-parameters.md)
> in your configuration by using the following syntax: `--conf spark.sql.catalog.snowflake_catalog.jdbc.property-name=property-value`

After you configure your Spark cluster, you can check which tables are available to query. For example:

```scala
spark.sessionState.catalogManager.setCurrentCatalog("snowflake_catalog");
spark.sql("SHOW NAMESPACES").show()
spark.sql("SHOW NAMESPACES IN my_database").show()
spark.sql("USE my_database.my_schema").show()
spark.sql("SHOW TABLES").show()
```

Then you can select a table to query.

```scala
spark.sql("SELECT * FROM my_database.my_schema.my_table WHERE ").show()
```

You can use the `DataFrame` structure with languages like Python and Scala to query data.

```scala
df = spark.table("my_database.my_schema.my_table")
df.show()
```

> **Note:**
>
> If you receive vectorized read errors while running queries, you can disable the vectorized reads for your session
> by configuring: `spark.sql.iceberg.vectorization.enabled=false`. To keep using vectorized reads,
> you can set the [STORAGE_SERIALIZATION_POLICY](../sql-reference/parameters.md) parameter.

## Query caching

When you issue a query, Snowflake caches the result within a certain time frame (90 seconds by default).
You might experience latency up to that duration. If you plan to access data programmatically for comparison purposes,
you can set the `spark.sql.catalog.cache-enabled` property to `false` to disable caching.

If your application is designed to tolerate a specific amount of latency, you can use the following property
to specify the latency period: `spark.sql.catalog.cache.expiration-interval-ms`.

## Limitations

The following limitations apply to the Snowflake Catalog SDK and are subject to change:

> * The SDK currently supports read operations (SELECT statements) only.
> * Only Apache Spark and Trino are supported for reading Iceberg tables.
> * You cannot use the SDK to access non-Iceberg Snowflake tables.

---
title: Snowflake client connectivity and troubleshooting
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/overview.md
section: User Guide
---

# Snowflake client connectivity and troubleshooting

This topic provides an architecture overview explaining the various service endpoints required for normal client operations. It also provides a methodology for self-service troubleshooting general connectivity issues and error patterns for JDBC, ODBC, and for Snowflake Connector for Python and Snowflake CLI as additional references.

* Architecture
* [Common connectivity issues and resolutions](common-issues.md)
* [Troubleshooting steps](troubleshooting-steps.md)
* [Error messages](error-messages.md)

> **Note:**
>
> The term client as used in this article refers to any custom or third-party application using a Snowflake command-line client (e.g., [Snowflake CLI](../../developer-guide/snowflake-cli/index.md)), driver (e.g., [Go](../../developer-guide/golang/go-driver.md), [JDBC](../../developer-guide/jdbc/jdbc.md), [NodeJs](../../developer-guide/node-js/nodejs-driver.md), [ODBC](../../developer-guide/odbc/odbc.md), [PHP](../../developer-guide/php-pdo/php-pdo-driver.md), [Python](../../developer-guide/python-connector/python-connector.md)), or API (e.g., [Snowpipe REST API](../data-load-snowpipe-rest-apis.md), [SQL API](../../developer-guide/sql-api/index.md)). For completeness, it also includes browser access to the [Snowflake Web Interface](../ui-snowsight.md).

## Architecture

For more information regarding the configuration steps for the architectures, refer to [Securing Snowflake](../../guides-overview-secure.md).

1 Configuration details for this feature are out of scope for this article. For more information, refer to [Securing Snowflake](../../guides-overview-secure.md).

---
title: Snowflake Connector for Kafka
source: https://docs.snowflake.com/en/user-guide/kafka-connector.md
section: User Guide
---

# Snowflake Connector for Kafka

The Snowflake Connector for Kafka (“Kafka connector”) reads data from one or more [Apache Kafka](https://kafka.apache.org/) topics and loads the data into a Snowflake table.

**Next Topics:**

* [Overview of the Kafka connector](kafka-connector-overview.md)
* [Installing and configuring the Kafka connector](kafka-connector-install.md)
* [Managing the Kafka connector](kafka-connector-manage.md)
* [Monitoring the Kafka connector using Java Management Extensions (JMX)](kafka-connector-monitor.md)
* [Loading protobuf data using the Snowflake Connector for Kafka](kafka-connector-protobuf.md)
* [Using the Snowflake Connector for Kafka with Apache Iceberg™ tables](kafka-connector-iceberg.md)
* [Troubleshooting the Kafka connector](kafka-connector-ts.md)
* [Snowflake Connector for Kafka with Snowpipe Streaming classic](snowpipe-streaming/snowpipe-streaming-classic-kafka.md)

---
title: Snowflake Connector for Kafka with Snowpipe Streaming classic
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka.md
section: User Guide
---

# Snowflake Connector for Kafka with Snowpipe Streaming classic

You can replace Snowpipe with [Snowpipe Streaming](data-load-snowpipe-streaming-overview.md) in your data loading chain from Kafka. When the specified flush buffer threshold (time, memory, or number of messages) is reached, the connector calls the Snowpipe Streaming API (“API”) to write rows of data to Snowflake tables. This architecture results in lower load latencies with corresponding lower costs for loading similar volumes of data.

Version 2.0.0 (or later) of the Kafka connector is required for use with Snowpipe Streaming Classic. The Kafka connector with Snowpipe Streaming Classic includes the Snowflake Ingest SDK and supports streaming rows from Apache Kafka topics directly into target tables.

## Minimum required version

The minimum required Kafka connector version that supports Snowpipe Streaming is 2.0.0.

## Kafka configuration properties

Save your connection settings in the Kafka connector properties file. For more information, see [Configuring the Kafka connector](../kafka-connector-install.md).

### Required properties

Add or edit your connection settings in the Kafka connector properties file. For more information, see [Configuring the Kafka connector](../kafka-connector-install.md).

`snowflake.ingestion.method`
:   *Required only if using the Kafka connector as the streaming ingest client.* Specifies whether to use Snowpipe Streaming or
    standard Snowpipe to load your Kafka topic data. The supported values are as follows:

    * `SNOWPIPE_STREAMING`
    * `SNOWPIPE` (default)

    No additional settings are required to choose the backend service to queue and load topic data. Configure additional properties in your
    Kafka connector properties file as usual.

`snowflake.role.name`
:   Access control role to use when inserting the rows into the table.

### Client optimization properties

`enable.streaming.client.optimization`
:   Specifies whether to enable one-client optimization. This property is supported by Kafka connector release version 2.1.2 and later. It is enabled by default.

    With one-client optimization, only one client is created for multiple topic partitions per Kafka connector. This feature can reduce client runtime and lower migration cost by creating larger files.

    Values:
    :   * `true`
        * `false`

    Default:
    :   `true`

    Note that in a high throughput scenario (for example, 50 MB/s per connector), enabling this property can result in a higher latency or cost. We recommend that you disable this property for high-throughput scenarios.

### Buffer and polling properties

`buffer.flush.time`
:   Number of seconds between buffer flushes; each flush results in insert operations for the buffered records. The Kafka connector calls the Snowpipe Streaming API once after each flush.

    The minimum value supported for the `buffer.flush.time` property is `1` (in seconds). For higher average data flow rates, we suggest that you decrease the default value for improved latency. If cost is a greater concern than latency, you could increase the buffer flush time. Be careful to flush the Kafka memory buffer before it becomes full to avoid out-of-memory exceptions.

    Values:
    :   * Minimum: `1`
        * Maximum: No upper limit

    Default:
    :   `10`

    Note that Snowpipe Streaming automatically flushes data every one second, which is different from the buffer flush time for the Kafka connector. After the Kafka buffer flush time is reached, data will be sent with one second latency to Snowflake through Snowpipe Streaming. For more information, see [Snowpipe Streaming latency](snowpipe-streaming-classic-recommendation.md).

`buffer.count.records`
:   Number of records buffered in memory per Kafka partition before ingesting to Snowflake.

    Values:
    :   * Minimum: `1`
        * Maximum: No upper limit

    Default:
    :   `10000`

`buffer.size.bytes`
:   Cumulative size in bytes of records buffered in memory per the Kafka partition before they are ingested in Snowflake as data files.

    The records are compressed when they are written to data files. As a result, the size of the records in the buffer may be larger than the size of the data files created from the records.

    Values:
    :   * Minimum: `1`
        * Maximum: No upper limit

    Default:
    :   `20000000` (20 MB)

`snowflake.streaming.max.client.lag`
:   Specifies how often [Snowflake Ingest Java](https://github.com/snowflakedb/snowflake-ingest-java) flushes the data to Snowflake, in seconds.

    A low value keeps the latency low, but it might result in a worse query performance especially when `snowflake.streaming.enable.single.buffer` is enabled.
    For more information, see the [recommended latency configurations for Snowpipe Streaming](snowpipe-streaming-classic-recommendation.md).

    Values:
    :   * Minimum: `1` second
        * Maximum: `600` seconds

    Default:
    :   `30` seconds for version 3.1.1 and later, `120` seconds for versions 3.0.0 and 3.1.0, `1` second otherwise

`snowflake.streaming.enable.single.buffer`
:   Specifies whether to enable single buffer for Snowpipe Streaming and to skip buffering data in the connector’s internal buffer.

    This property is supported by the Kafka connector version 2.3.1 and later.

    Streaming connector uses internal buffer alongside with the one provided by [Snowflake Ingest Java](https://github.com/snowflakedb/snowflake-ingest-java).
    Setting this property to `true` makes Kafka connector skip the internal buffer in order to achieve lower latency.

    Note that setting this property to `true` makes `buffer.flush.time` and `buffer.count.records` irrelevant.

    Values:
    :   * `true`
        * `false`

    Default:
    :   `true` for version 3.0.0 and later, `false` otherwise

In addition to the Kafka connector properties, note the Kafka consumer `max.poll.records` property, which controls the maximum number of records returned by Kafka to Kafka Connect in a single poll. The default value of `500` can be increased, but be mindful of memory constraints. For more information about this property, see the documentation for your Kafka package:

* [Apache Kafka](https://kafka.apache.org/documentation/#consumerconfigs_max.poll.records)
* [Confluent](https://docs.confluent.io/platform/current/installation/configuration/consumer-configs.html#consumerconfigs_max.poll.records)

### Error handling and DLQ properties

`errors.tolerance`
:   Specifies how to handle errors encountered by the Kafka connector:

    This property supports the following values:

    Values:
    :   * `NONE`: Stop loading data when the first error is encountered.
        * `ALL`: Ignore all errors and continue to load data.

    Default:
    :   `NONE`

`errors.log.enable`
:   Specifies whether to write error messages to the Kafka Connect log file.

    This property supports the following values:

    Values:
    :   * `TRUE`: Write error messages.
        * `FALSE`: Do not write error messages.

    Default:
    :   `FALSE`

`errors.deadletterqueue.topic.name`
:   Specifies the name of the DLQ (dead-letter queue) topic in Kafka for delivering messages to Kafka that could not be ingested into Snowflake tables. For more information, see Dead-letter Queues (in this topic).

    Values:
    :   Custom text string

    Default:
    :   None

## Exactly-once semantics

Exactly-once semantics ensure the delivery of Kafka messages without duplication or data loss. This delivery guarantee is set by default for the Kafka connector with Snowpipe Streaming.

The Kafka connector adopts a one-to-one mapping between partition and channel and uses two distinct offsets:

> * Consumer offset: This tracks the most recent offset consumed by the consumer and is managed by Kafka.
> * Offset token: This tracks the most recent committed offset in Snowflake and is managed by Snowflake.

Note that the Kafka connector doesn’t always handle missing offsets. Snowflake expects that all records to have sequentially increasing offsets. The missing offsets will break the Kafka connector in specific use cases. It is recommended that you use tombstone records instead of NULL records.

The Kafka connector achieves exactly-once delivery by implementing the following best practices:

Opening/reopening a channel:

> * When opening or reopening a channel for a given partition, the Kafka Connector uses the latest committed offset token retrieved from Snowflake through the `getLatestCommittedOffsetToken` API as the source of truth and resets the consumer offset in Kafka accordingly.
> * If the consumer offset is no longer within the data retention period, an exception is thrown, and you can determine the appropriate action to take.
> * The only scenario in which the Kafka Connector does not reset the consumer offset in Kafka and uses it as the source of truth is when the offset token from Snowflake is NULL. In this case, the connector accepts the offset sent by Kafka, and the offset token is subsequently updated.

Processing records:

> * To ensure an additional layer of safety against non-continuous offsets that could arise from potential bugs in Kafka, Snowflake maintains an in-memory variable that tracks the latest processed offset. Snowflake only accepts rows if the current row’s offset equals the latest processed offset plus one, thereby adding an extra layer of protection to ensure that the ingestion process is continuous and accurate.

Dealing with exceptions, failures, crashes recovery:

> * As part of the recovery process, Snowflake consistently adheres to the channel open/reopen logic outlined earlier by reopening the channel and resetting the consumer offset with the latest committed offset token. By doing this, Snowflake signals Kafka to send the data from the offset value that is one greater than the latest committed offset token, which enables the resumption of ingestion from the point of failure with no data loss.

Implementing a retry mechanism:

> * To account for potential transient issues, Snowflake incorporates a retry mechanism in the API calls. Snowflake retries these API calls multiple times to increase the chances of success and mitigate the risk of intermittent failures affecting the ingestion process.

Advancing the consumer offset:

> * At regular intervals, Snowflake advances the consumer offset using the latest committed offset token to ensure that the ingestion process is continuously aligned with the latest state of data in Snowflake.

## Converters

Snowpipe Streaming supports many community-based converters such as the following:

* `io.confluent.connect.avro.AvroConverter`
* `org.apache.kafka.connect.json.JsonConverter`
* `io.confluent.connect.protobuf.ProtobufConverter`
* `io.confluent.connect.json.JsonSchemaConverter`
* `org.apache.kafka.connect.converters.ByteArrayConverter`
* `org.apache.kafka.connect.storage.StringConverter`

Other community-based converters may be supported but have not been validated. Snowflake converters are not supported with Snowpipe Streaming.

## Dead-letter queues

The Kafka connector with Snowpipe Streaming supports dead-letter queues (DLQ) for broken records or records that cannot be processed successfully due to a failure.

For more information about monitoring, see the Apache Kafka [documentation](https://kafka.apache.org/documentation/#connect_monitoring).

## Schema detection and schema evolution

The Kafka connector with Snowpipe Streaming supports schema detection and evolution. The structure of tables in Snowflake can be defined and evolved automatically to support the structure of new Snowpipe Streaming data loaded by the Kafka connector.
To enable schema detection and evolution for the Kafka connector with Snowpipe Streaming, configure the following Kafka properties:

* `snowflake.ingestion.method`
* `snowflake.enable.schematization`
* `schema.registry.url`

For more information, see [Schema detection and evolution for Kafka connector with Snowpipe Streaming classic](snowpipe-streaming-classic-kafka-schema-detection.md).

## Estimating ingestion latency

To estimate ingestion latency, use the `SnowflakeConnectorPushTime` field in RECORD_METADATA.
This timestamp represents a point in time when a record was pushed into an Ingest SDK buffer.

For more information about the RECORD_METADATA format, see [Schema of tables for Kafka topics](../kafka-connector-overview.md).

> **Note:**
>
> This field **does not** represent when a record became visible in a Snowflake table, because it doesn’t take into account your configured [Snowpipe Streaming latency](snowpipe-streaming-classic-recommendation.md).

## Billing and usage

For Snowpipe Streaming billing information, see [Costs for Snowpipe Streaming Classic](snowpipe-streaming-classic-billing.md).

## Limitations

### Snowpipe streaming limitations

See [Snowpipe Streaming limitations](data-load-snowpipe-streaming-overview.md).

### Failover limitations

When a secondary failover group is promoted to primary, the Kafka connector with Snowpipe Streaming requires manual interaction. Exactly-once semantics are still preserved.

If `enable.streaming.client.optimization` property is set to `false`, the Kafka connector should be restarted. After you restart the connector, it will target a new primary deployment.

If `enable.streaming.client.optimization` property is set to `true`, the host JVM that the connector is running on should be shut down and restarted. After you restart the host JVM, a newly started Kafka connector will target a new primary deployment.

---
title: Snowflake Connector for Spark
source: https://docs.snowflake.com/en/user-guide/spark-connector.md
section: User Guide
---

# Snowflake Connector for Spark

The Snowflake Connector for Spark (“Spark connector”) brings Snowflake into the Apache Spark ecosystem, enabling Spark to read
data from, and write data to, Snowflake.
From Spark’s perspective, Snowflake looks similar to other Spark data sources (PostgreSQL, HDFS, S3, etc.).

> **Note:**
>
> You can also use [Snowpark Connect for Spark](../developer-guide/snowpark-connect/snowpark-connect-overview.md) as an alternative to the Snowflake Connector for Spark.

Snowflake supports multiple versions of the Spark connector:

> * Spark Connector 2.x: Spark versions 3.2, 3.3, and 3.4.
>
>   + There’s a separate version of the Snowflake connector for each version of Spark. Use the correct version of the connector for your version of Spark.
> * Spark Connector 3.x: Spark versions 3.2, 3.3, 3.4, and 3.5.
>
>   + Each Spark Connector 3 package supports most versions of Spark.

The connector runs as a Spark plugin and is provided as a Spark package (`spark-snowflake`).

## Enforce data protection policies on Apache Iceberg tables accessed from Spark

Snowflake supports enforcing row access and data masking policies on Apache Iceberg tables that you query from Apache Spark™ through
Snowflake Horizon Catalog. To enable this enforcement, you must install 3.1.6 or a later version of the Spark connector. The Spark connector
connects Spark to Snowflake to evaluate policies that are configured on the Iceberg tables.
For more information, see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

**Next Topics:**

* [Overview of the Spark Connector](spark-connector-overview.md)
* [Installing and Configuring the Spark Connector](spark-connector-install.md)
* [Configuring Snowflake for Spark in Databricks](spark-connector-databricks.md)
* [Configuring Snowflake for Spark in Qubole](spark-connector-qubole.md)
* [Using the Spark Connector](spark-connector-use.md)

---
title: Snowflake DCM Projects
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-overview.md
section: User Guide
---

# Snowflake DCM Projects

Snowflake DCM Projects (Database Change Management Projects) enable a declarative approach to managing Snowflake objects as code. You define the desired target
state of your databases, schemas, tables, and other objects in definition files, and Snowflake determines and applies the necessary changes
to reach that state. It enables version-controlled, repeatable deployments across environments, such as dev, staging, and production, using a
plan-then-deploy workflow common among infrastructure-as-code tools.

If your definitions contain repetitive patterns, you can parameterize your code by using Jinja templating, including dictionaries, loops,
conditions, and macros.

The high-level workflow for managing a DCM project is as follows:

1. Create DCM project files (`manifest.yml` and SQL definition files) in a Snowflake Workspace, remote Git repository, or local
   directory.
2. Create a new DCM project for each target environment.
3. Define Snowflake objects in the DCM project files.

   Convert your existing SQL deployment scripts by using the DEFINE keyword (*for supported object types*).
4. (Optional) Add shared or alternative templating variables and macros.
5. Execute a DCM PLAN command to mimic a deployment and preview the changes.
6. Deploy the project version to apply changes in Snowflake.
7. Monitor project executions.
8. Iterate on your DCM project. Update the project files, review the plan output, and deploy new versions as needed.

This lifecycle helps you build, test, deploy, and monitor database changes in a controlled, versioned, and auditable way.

The following diagram illustrates the DCM Projects lifecycle outlined above.

## Key terms

The following are the key terms you should know when working with DCM Projects.

Declarative definitions
:   In DCM Projects, you define the desired state of your Snowflake environment, such as which tables, schemas, or roles should exist, independent of
    the current state of the objects. You do not specify each step to create or modify them. You describe *what* you want, and Snowflake
    figures out *how* to make it happen.

    Specifically, DCM Projects leverage DEFINE statements with templating features. This makes project files reusable and customizable for different
    environments. The order and location of DEFINE statements within a project don’t affect the results. Snowflake collects and sorts all
    statements before applying changes, so you don’t need to manually handle sequencing or dependencies.

DCM project files
:   A DCM project is based on a set of SQL and YAML source files, usually managed in a Git repository or your local workspace. You define
    Snowflake objects, their attributes, relationships, and constraints for a DCM project in your project definition files (SQL files). You
    update your project files in a development workspace. Changes only take effect in Snowflake after you deploy them through a DCM project
    object.

DCM project object
:   A DCM project is a schema-level object in Snowflake that you use to deploy and manage the objects defined in DCM project files. You need
    a DCM project object for each target environment.

    The DCM project object is used to execute DCM commands and stores the immutable artifacts and definition files of all executed deployments.

    Though a DCM project is a schema-level object, you can use it to create and manage objects in other databases. You can also execute a
    DCM project to perform a dry run of changes to your workflow so that you can preview changes before deploying them.

## Requirements

* Use Snowsight, Snowflake CLI, SQL, or Cortex CLI to manage DCM Projects.
* You need a database and a schema where you can create your DCM project objects.
* Store your DCM project definitions locally or in Snowflake Workspace.
* Use Git for collaboration, versioning, and for synchronizing changes.
* If you want to execute local definitions using Snowflake CLI, you also need privileges to create a temporary stage in the schema of your target
  DCM project object.

## Considerations and limitations

* Project size

  + Currently, DCM Projects supports up to 1,000 source files and 10,000 rendered object definitions or grants.

    Beyond 1,000 files or 10,000 definitions, you can experience performance degradation and, in some cases, execution failure.

    Consolidating definitions into fewer files generally shows faster execution times for PLAN and DEPLOY commands.

    This limit will be raised during the public preview period as performance and scalability continue to improve.
* Changeset

  Both PLAN and DEPLOY commands list all DDL changes inside the `plan_result.json` file. The changeset lists the operations
  performed or planned (CREATE, ALTER, DROP) and the individual attributes affected, such as comment, schedule, and timeout.

  > **Important:**
  >
  > During the preview phase of DCM Projects, it’s not guaranteed that the changeset captures every granular change across all properties of each object.
* Templating

  + Because definition files are Jinja2 templates, all of the limitations for Jinja2 templates apply.
  + DCM templating variables are not intended for sensitive information like credentials. The rendered SQL definitions don’t redact any
    values inserted by environment variables.

## Key use cases for DCM Projects

This section describes key use cases for DCM Projects and how they help address challenges that data businesses face at scale.
These use cases fall into two general categories based on team’s responsibility:

* Platform teams that manage infrastructure and governance
* Feature teams that manage individual data products and pipelines

### DCM Projects for managing infrastructure

DCM Projects help address the following challenges that platform teams often encounter:

When platform teams want to deploy and maintain standardized infrastructure for multiple business units, they can use DCM Projects to define a
standard set of objects in code as SQL files. And with Jinja, this template can be parameterized, for example, by team name, and deployed
multiple times.

#### Example: Create a dedicated DCM project for each business unit

One approach is to create a dedicated DCM project for each business unit, with all projects referencing the same parameterized definition
files, as shown in the following `definitions.sql` example:

```sqlexample
DEFINE DATABASE {{team_name}}_DB;

DEFINE ROLE {{team_name}}_ADMIN;

DEFINE WAREHOUSE {{team_name}}_WH WITH
  warehouse_size = '{{wh_size}}'
  auto_suspend = 300;

GRANT OWNERSHIP ON DATABASE {{team_name}}_DB TO ROLE {{team_name}}_ADMIN;

GRANT OWNERSHIP ON WAREHOUSE {{team_name}}_WH TO ROLE {{team_name}}_ADMIN;

GRANT ROLE {{team_name}}_ADMIN TO ROLE SYSADMIN;
```

Execute the DCM project with the following command:

```sqlexample
EXECUTE DCM PROJECT FINANCE_INFRA PLAN
  USING (team_name => 'Finance', wh_size => 'LARGE')
FROM
  ...
```

#### Example: Create a single DCM project for multiple business units

In this approach, you manage infrastructure for multiple business units in one DCM project by using loops in your Jinja template, as shown in the
following `definitions.sql` example:

```sqlexample
{% for team_name in teams %}

  DEFINE DATABASE {{team_name}}_DB;
  DEFINE ROLE {{team_name}}_ADMIN;
  DEFINE WAREHOUSE {{team_name}}_WH
    WITH
      warehouse_size = '{{wh_size}}'
      auto_suspend = 300;

  GRANT OWNERSHIP ON DATABASE {{team_name}}_DB TO ROLE {{team_name}}_ADMIN;
  GRANT OWNERSHIP ON WAREHOUSE {{team_name}}_WH TO ROLE {{team_name}}_ADMIN;
  GRANT ROLE {{team_name}}_ADMIN TO ROLE SYSADMIN;

{% endfor %}
```

Execute the DCM project with the following command:

```sqlexample
EXECUTE DCM PROJECT FINANCE_INFRA PLAN
  USING (teams => ['Finance', 'HR', 'Engineering'], wh_size => 'MEDIUM')
FROM
  ...
```

This makes it easy for platform teams and admins to make changes such as:

* Add a new team to the list to deploy the existing infrastructure template for that team.
* Remove a team from the list to drop the infrastructure of that team.
* Add a new READ_ONLY role for all teams.
* Change specific configurations such as grants or warehouse size across all teams or for a specific team.
* Run PLAN to compare the current state against expected standards and re-deploy to reinstate standards.

### DCM Projects for data pipelines

DCM Projects help address the following challenges that feature teams often encounter:

Business units that want to easily author and manage their data pipelines can use DCM Projects to define, test, deploy, and iterate over
their business logic.

You can:

* Manage Snowflake object types like tables, dynamic tables, views, warehouses, roles, grants, data metric functions, and expectations all in one project.
* Test and deploy incremental changes to pipelines. You can change configurations, implement transformation logic, and add columns and views.
* Preview data samples to validate transformation logic before deploying objects.
* Deploy the same pipeline definition to multiple environments.
* Test data quality expectations on pre-prod environments before deploying changes to production.

DCM Projects provide additional functionality for authoring and managing data pipelines. See [DCM Projects for data pipelines](dcm-projects-pipelines.md)
for details.

---
title: Snowflake Ecosystem
source: https://docs.snowflake.com/en/user-guide/ecosystem.md
section: User Guide
---

# Snowflake Ecosystem

Snowflake works with a wide array of industry-leading tools and technologies, enabling you to access Snowflake through an extensive network
of connectors, drivers, programming languages, and utilities, including:

* Certified partners who have developed cloud-based and on-premises solutions for connecting to Snowflake.
* Other 3rd-party tools and technologies that are known to work with Snowflake.
* Snowflake-provided clients, including the [Snowflake CLI](../developer-guide/snowflake-cli/index.md) and [SnowSQL](snowsql.md) command line tools, connectors for
  [Python](../developer-guide/python-connector/python-connector.md) and [Spark](spark-connector.md), and drivers for
  [Node.js](../developer-guide/node-js/nodejs-driver.md), [JDBC](../developer-guide/jdbc/jdbc.md), [ODBC](../developer-guide/odbc/odbc.md), and more.

The next topics describe the solutions in more detail. The solutions are listed both alphabetically and grouped according to the categories
shown in the diagram above.

> **Tip:**
>
> If you don’t find a solution here that works for you, we have an extensive network of partners who can help you integrate with Snowflake.
> For more details, see [Solutions Partners](https://www.snowflake.com/partners/solutions-partners/) (Snowflake website).

**Next Topics:**

* [All Partners & Technologies (Alphabetical)](ecosystem-all.md)
* [Data Integration](ecosystem-etl.md)
* [Business Intelligence (BI)](ecosystem-bi.md)
* [Machine Learning & Data Science](ecosystem-analytics.md)
* [Security, Governance & Observability](ecosystem-security.md)
* [SQL Development & Management](ecosystem-editors.md)
* [Native Programmatic Interfaces](ecosystem-lang.md)

---
title: Snowflake editions
source: https://docs.snowflake.com/en/user-guide/intro-editions.md
section: User Guide
---

# Snowflake editions

Snowflake offers multiple editions to choose from, ensuring that your usage fits your organization’s specific requirements. Each successive
edition builds on the previous edition through the addition of edition-specific features and/or higher levels of service. As your
organization’s needs change and grow, changing editions is easy.

For information about working with editions, including viewing and changing an account’s edition, see [Working with account editions](organizations-manage-accounts-editions.md).

> **Note:**
>
> The Snowflake Edition that your organization chooses determines the unit costs for the credits and the data storage you use. Other factors
> that impact unit costs are the [region](intro-regions.md) where your Snowflake account is located and whether it is
> an *On Demand* or *Capacity* account:
>
> * On Demand: Usage-based pricing with no long-term licensing requirements.
> * Capacity: Discounted pricing based on an upfront Capacity commitment.
>
> For pricing details, see the [pricing page](http://www.snowflake.com/pricing) (on the Snowflake website).

## Overview of editions

### Standard Edition

Standard Edition is our introductory level offering, providing full, unlimited access to all of Snowflake’s standard features. It provides
a strong balance between features, level of support, and cost.

### Enterprise Edition

Enterprise Edition provides all the features and services of Standard Edition, with additional features
that are designed specifically for the needs of large-scale enterprises and organizations.

### Business Critical Edition

Business Critical Edition, formerly known as Enterprise for Sensitive Data (ESD), offers even higher levels of data protection to support
the needs of organizations with extremely sensitive data, particularly PHI data that must comply with HIPAA and
[HITRUST CSF](intro-cloud-platforms.md) regulations.

It includes all the features and services of Enterprise Edition, with the addition of enhanced security and data
protection. In addition, account failover/failback adds support for business continuity and disaster recovery.

> **Note:**
>
> As required by HIPAA and [HITRUST CSF](intro-cloud-platforms.md) regulations, before any PHI data can be stored in Snowflake, a
> signed business associate agreement (BAA) must be in place between your agency/organization and Snowflake Inc.

### Virtual Private Snowflake (VPS)

Virtual Private Snowflake offers our highest level of security for organizations that have the strictest requirements, such as financial
institutions and any other large enterprises that collect, analyze, and share highly sensitive data.

It includes all the features and services of Business Critical Edition, but in a completely separate Snowflake
environment, isolated from all other Snowflake accounts (i.e. VPS accounts do not share any type of hardware resources with accounts outside the VPS).

> **Note:**
>
> To access your account, you can use an [account identifier](admin-account-identifier.md) that specifies your
> organization name and account name.
>
> If you instead choose to use an [account locator](admin-account-identifier.md) as the account identifier, note that
> the account locator for VPS accounts uses a different format than the accounts for other Snowflake Editions. For details, see
> [Finding the account locator format for a VPS account](admin-account-identifier.md).

## Find your current edition

You can find the Snowflake edition for your account in the following ways:

* To find your Snowflake edition by using Snowsight, follow the instructions in
  [Locate your Snowflake account information in Snowsight](ui-snowsight-gs.md).
* To find your Snowflake edition by using SQL, query the
  [ACCOUNTS view](../sql-reference/organization-usage/accounts.md) in the ORGANIZATION_USAGE schema,
  and select the `edition` column:

  ```sqlexample
  SELECT edition
    FROM SNOWFLAKE.ORGANIZATION_USAGE.ACCOUNTS
    WHERE account_name = CURRENT_ACCOUNT();
  ```

  To query this view, you must have [access to the ORGANIZATION_USAGE schema](../sql-reference/organization-usage.md).

## Feature and edition matrix

The following tables provide a list of the major features and services included with each edition.

> **Note:**
>
> This is only a partial list of the features. For a more complete and detailed list, see [Overview of key features](intro-supported-features.md).

### Release management

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| 24-hour [early access to weekly new releases](intro-releases.md), which can be used for additional testing or validation before each release is deployed to your production accounts. |  | ✔ | ✔ | ✔ |

### Security, governance, and data protection

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| SOC 2 Type II certification. | ✔ | ✔ | ✔ | ✔ |
| [Federated authentication and SSO](admin-security-fed-auth-overview.md) for centralizing and streamlining user authentication. | ✔ | ✔ | ✔ | ✔ |
| [OAuth](oauth-intro.md) for authorizing account access without sharing or storing user login credentials. | ✔ | ✔ | ✔ | ✔ |
| [Network policies](network-policies.md) for limiting/controlling site access by user IP address. | ✔ | ✔ | ✔ | ✔ |
| Automatic [encryption of all data](../guides-overview-secure.md). | ✔ | ✔ | ✔ | ✔ |
| Support for [multi-factor authentication](security-mfa.md). | ✔ | ✔ | ✔ | ✔ |
| Object-level [access control](security-access-control-overview.md). | ✔ | ✔ | ✔ | ✔ |
| Standard [Time Travel](data-time-travel.md) (up to 1 day) for accessing/restoring modified and deleted data. | ✔ | ✔ | ✔ | ✔ |
| [Object tags](object-tagging/introduction.md) that can be applied to Snowflake objects to help track sensitive data and resource usage. Some tagging features require Enterprise Edition or higher. | ✔ | ✔ | ✔ | ✔ |
| Disaster recovery of modified/deleted data (for 7 days beyond Time Travel) through [Fail-safe](data-failsafe.md). | ✔ | ✔ | ✔ | ✔ |
| [Generating synthetic data](synthetic-data.md) |  | ✔ | ✔ | ✔ |
| [Extended Time Travel](data-time-travel.md) (up to 90 days). |  | ✔ | ✔ | ✔ |
| [Periodic rekeying of encrypted data](security-encryption-manage.md) for increased protection. |  | ✔ | ✔ | ✔ |
| [Column-level Security](security-column-intro.md) to apply masking policies to columns in tables or views. |  | ✔ | ✔ | ✔ |
| [Row-level Security](security-row-intro.md) to apply row access policies to determine which rows are visible in a query result. |  | ✔ | ✔ | ✔ |
| [Aggregation policies](aggregation-policies.md) that enforce privacy by requiring queries to aggregate data to return results. |  | ✔ | ✔ | ✔ |
| [Projection policies](projection-policies.md) that restrict who can use a SELECT statement to project a column. |  | ✔ | ✔ | ✔ |
| [Differential privacy](diff-privacy/differential-privacy-overview.md) to protect data against targeted privacy attacks. |  | ✔ | ✔ | ✔ |
| Support for classifying potentially sensitive data using [classification](classify-intro.md). |  | ✔ | ✔ | ✔ |
| Audit the user access history through the Account Usage [ACCESS_HISTORY](../sql-reference/account-usage/access_history.md) view. |  | ✔ | ✔ | ✔ |
| Event tables associated with a database by [associating an event table with an object](../developer-guide/logging-tracing/event-table-setting-up.md). |  | ✔ | ✔ | ✔ |
| Customer-managed encryption keys through [Tri-Secret Secure](security-encryption-manage.md). |  |  | ✔ | ✔ |
| Support for private connectivity [to the Snowflake service](private-connectivity-inbound.md). |  |  | ✔ | ✔ |
| Support for private connectivity [to Snowflake internal stages](private-connectivity-inbound.md). |  |  | ✔ | ✔ |
| Support for [Pinning private connectivity endpoints for inbound traffic](pin-private-endpoints.md). |  |  | ✔ | ✔ |
| Support for [private connectivity for inbound network traffic in Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/private-connectivity-inbound). |  |  | ✔ | ✔ |
| Support for private connectivity for outbound network traffic [to external stages](private-connectivity-outbound.md). |  |  | ✔ | ✔ |
| Support for private connectivity for outbound network traffic [to external volumes for Apache Iceberg tables](private-connectivity-outbound.md). |  |  | ✔ | ✔ |
| Support for private connectivity to a key management service through [Tri-Secret Secure](security-encryption-tss-self-serve-private.md). |  | . | ✔ | ✔ |
| Support for private connectivity to the Snowflake service using AWS PrivateLink, Azure Private Link, or Google Cloud Private Service Connect. |  |  | ✔ | ✔ |
| Support for private connectivity to Snowflake internal stages using [AWS PrivateLink](private-internal-stages-aws.md), [Azure Private Link](private-internal-stages-azure.md), and [Google Cloud](private-internal-stages-gcp.md) |  |  | ✔ | ✔ |
| Support for [Pinning private connectivity endpoints for inbound traffic](pin-private-endpoints.md). |  |  | ✔ | ✔ |
| Support for [Cross-Region Connectivity for AWS PrivateLink](admin-security-privatelink.md). |  | . | ✔ | ✔ |
| Support for [private connectivity for inbound network traffic in Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/private-connectivity-inbound). |  |  | ✔ | ✔ |
| Support for [private connectivity for outbound network traffic in Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/private-connectivity-outbound). |  |  | ✔ | ✔ |
| Support for PHI data (in accordance with HIPAA and [HITRUST CSF](intro-cloud-platforms.md) regulations). |  |  | ✔ | ✔ |
| Support for PCI DSS. |  |  | ✔ | ✔ |
| Support for public sector workloads that meet U.S. Federal and state government requirements, such as [FedRAMP and ITAR](intro-regions.md). |  |  | ✔ | ✔ |
| Support for IRAP - Protected (P) data (in specified [Asia Pacific regions](intro-regions.md)). |  |  | ✔ | ✔ |
| Dedicated metadata store and pool of compute resources (used in virtual warehouses). |  |  |  | ✔ |
| [Data Quality and data metric functions](data-quality-intro.md) to monitor the state and integrity of data. |  | ✔ | ✔ | ✔ |

### Compute Resource Management

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Virtual warehouses](warehouses.md), separate compute clusters for isolating query and data loading workloads. | ✔ | ✔ | ✔ | ✔ |
| [Resource monitors](resource-monitors.md) for monitoring virtual warehouse credit usage. | ✔ | ✔ | ✔ | ✔ |
| [Multi-cluster virtual warehouses](warehouses-multicluster.md) for scaling compute resources to meet concurrency needs. |  | ✔ | ✔ | ✔ |

### SQL support

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Standard SQL](../sql-reference-commands.md), including most DDL and DML defined in SQL:1999. | ✔ | ✔ | ✔ | ✔ |
| [Advanced DML](../sql-reference/sql-dml.md) such as multi-table INSERT, MERGE, and multi-merge. | ✔ | ✔ | ✔ | ✔ |
| Broad support for standard [data types](../sql-reference-data-types.md). | ✔ | ✔ | ✔ | ✔ |
| Native support for [semi-structured data](semistructured-intro.md) (JSON, Avro, ORC, Parquet, and XML). | ✔ | ✔ | ✔ | ✔ |
| Native support for [geospatial data](../sql-reference/data-types-geospatial.md). | ✔ | ✔ | ✔ | ✔ |
| Native support for [unstructured data](unstructured-intro.md). | ✔ | ✔ | ✔ | ✔ |
| [Collation rules](../sql-reference/collation.md) for string/text data in table columns. | ✔ | ✔ | ✔ | ✔ |
| [Integrity constraints](../sql-reference/constraints.md) (not enforced) on table columns for informational and modeling purposes. | ✔ | ✔ | ✔ | ✔ |
| Multi-statement [transactions](../sql-reference/transactions.md). | ✔ | ✔ | ✔ | ✔ |
| [User-defined functions (UDFs)](../developer-guide/udf/udf-overview.md) with support for Java, JavaScript, Python, and SQL. | ✔ | ✔ | ✔ | ✔ |
| [External access](../developer-guide/external-network-access/external-network-access-overview.md) for enabling user-defined functions (UDFs) or stored procedures to securely connect to external network locations, such as a third-party API or another database. | ✔ | ✔ | ✔ | ✔ |
| [External functions](../sql-reference/external-functions.md) for extending Snowflake to other development platforms. | ✔ | ✔ | ✔ | ✔ |
| [Amazon API Gateway private endpoints for external functions](../sql-reference/external-functions-creating-aws-planning.md). |  |  | ✔ | ✔ |
| [Stored procedures](../developer-guide/stored-procedure/stored-procedures-overview.md) with support for Java, JavaScript, Python, Scala, and SQL (Snowflake Scripting). | ✔ | ✔ | ✔ | ✔ |
| [Dynamic tables](dynamic-tables-about.md) for automatically materializing the results of a specified SQL query and keeping them up to date to meet your data freshness target. | ✔ | ✔ | ✔ | ✔ |
| [External tables](tables-external-intro.md) for referencing data in a cloud storage data lake. | ✔ | ✔ | ✔ | ✔ |
| [Hybrid tables](tables-hybrid.md) for data in transactional and analytical workloads. | ✔ | ✔ | ✔ | ✔ |
| Support for [clustering data](tables-clustering-keys.md) in very large tables to improve query performance, with automatic maintenance of clustering. | ✔ | ✔ | ✔ | ✔ |
| [Query acceleration](query-acceleration-service.md) for parallel processing portions of eligible queries. |  | ✔ | ✔ | ✔ |
| [Search optimization](search-optimization-service.md) for point lookup queries, with automatic maintenance. |  | ✔ | ✔ | ✔ |
| [Snowflake Optima](snowflake-optima.md) for automatic workload performance improvements. | ✔ | ✔ | ✔ | ✔ |
| [Materialized views](views-materialized.md), with automatic maintenance of results. |  | ✔ | ✔ | ✔ |
| [Iceberg tables](tables-iceberg.md) for referencing data in a cloud storage data lake. | ✔ | ✔ | ✔ | ✔ |
| [Schema detection](data-load-overview.md) for automatically detecting the schema in a set of staged semi-structured data files and retrieving the column definitions. | ✔ | ✔ | ✔ | ✔ |
| [Schema evolution](data-load-schema-evolution.md) for automatically evolving tables to support the structure of new data received from the data sources. | ✔ | ✔ | ✔ | ✔ |

### Interfaces and tools

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Snowsight](ui-snowsight.md), the next-generation SQL worksheet for advanced query development, data analysis, and visualization. | ✔ | ✔ | ✔ | ✔ |
| [Snowflake CLI](../developer-guide/snowflake-cli/index.md), Open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations, including querying, executing DDL/DML commands, and bulk loading/unloading of data. | ✔ | ✔ | ✔ | ✔ |
| [SnowSQL](snowsql.md), a command line client for building/testing queries, loading/unloading bulk data, and automating DDL operations. | ✔ | ✔ | ✔ | ✔ |
| [SnowCD](snowcd.md), a command line diagnostic tool for identifying and fixing client connectivity issues. | ✔ | ✔ | ✔ | ✔ |
| Programmatic interfaces for [Python](../developer-guide/python-connector/python-connector.md), [Spark](spark-connector.md), [Node.js](../developer-guide/node-js/nodejs-driver.md), [.NET.js](../developer-guide/dotnet/dotnet-driver.md), [PHP](../developer-guide/php-pdo/php-pdo-driver.md), and [Go](../developer-guide/golang/go-driver.md). | ✔ | ✔ | ✔ | ✔ |
| Native support for [JDBC](../developer-guide/jdbc/jdbc.md) and [ODBC](../developer-guide/odbc/odbc.md). | ✔ | ✔ | ✔ | ✔ |
| [Snowflake SQL API](../developer-guide/sql-api/index.md), a REST API for accessing and updating data in a Snowflake database. | ✔ | ✔ | ✔ | ✔ |
| Extensive [ecosystem](ecosystem.md) for connecting to ETL, BI, and other third-party vendors and technologies. | ✔ | ✔ | ✔ | ✔ |
| [Snowflake Partner Connect](ecosystem-partner-connect.md) for initiating free software/service trials with a growing network of partners in the Snowflake ecosystem. | ✔ | ✔ | ✔ | ✔ |
| [Snowpark](../developer-guide/snowpark/index.md), the set of libraries and runtimes that securely deploy and process non-SQL code, including Python, Java, and Scala. | ✔ | ✔ | ✔ | ✔ |
| [Streamlit in Snowflake](../developer-guide/streamlit/about-streamlit.md) for building, deploying, and sharing Streamlit apps on Snowflake data cloud. | ✔ | ✔ | ✔ | ✔ |

### Data import and export

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Bulk loading](../guides-overview-loading-data.md) from delimited flat files (CSV, TSV, etc.) and semi-structured data files (JSON, Avro, ORC, Parquet, and XML). | ✔ | ✔ | ✔ | ✔ |
| [Bulk unloading](data-unload-overview.md) to delimited flat files and JSON files. | ✔ | ✔ | ✔ | ✔ |
| [Snowpipe](data-load-snowpipe-intro.md) for continuous micro-batch loading. | ✔ | ✔ | ✔ | ✔ |
| [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) for low-latency loading of streaming data. | ✔ | ✔ | ✔ | ✔ |
| [Snowflake Connector for Kafka](kafka-connector.md) for loading data from Apache Kafka topics. | ✔ | ✔ | ✔ | ✔ |

### Data pipelines

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Streams](streams-intro.md) for tracking table changes. | ✔ | ✔ | ✔ | ✔ |
| [Tasks](tasks-intro.md) for scheduling the execution of SQL statements, often in conjunction with table streams. | ✔ | ✔ | ✔ | ✔ |

### Data replication and failover

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Database and share replication](account-replication-intro.md) between Snowflake accounts (within an organization) to synchronize databases, shared objects, and stored data. | ✔ | ✔ | ✔ | ✔ |
| [Failover and failback](account-replication-failover-failback.md) between Snowflake accounts for business continuity and disaster recovery. |  |  | ✔ | ✔ |
| [Redirecting client connections](client-redirect.md) between Snowflake accounts for business continuity and disaster recovery. |  |  | ✔ | ✔ |

### Data sharing

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| Snowflake Marketplace | ✔ | ✔ | ✔ |  |
| Universal Search | ✔ | ✔ | ✔ | ✔ |
| Build data products, monetize listings, and analyze your successes in Snowflake Marketplace. | ✔ | ✔ | ✔ |  |
| Public Listings | ✔ | ✔ | ✔ |  |
| Private Listings | ✔ | ✔ | ✔ | ✔ |
| With VPS, collaborate privately while strictly upholding requirements for security and isolation. |  |  |  | ✔ |
| Make data accessible without moving it using cross-cloud auto-fulfillment powered by Snowgrid™. | ✔ | ✔ | ✔ | ✔ |
| Collaborate with [Snowflake Data Clean Rooms](cleanrooms/overview.md). | ✔ | ✔ | ✔ |  |
| Create and manage your own Snowflake Data Clean Rooms. |  | ✔ | ✔ | ✔ |
| Collaborate using one of Snowflake’s many collaborative technologies. | ✔ | ✔ | ✔ | ✔ |
| Replicate shared data to keep it synchronized within your organization. | ✔ | ✔ | ✔ | ✔ |

### Artificial intelligence and machine learning

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| Use [Snowflake Cortex AI Functions](snowflake-cortex/aisql.md) to respond to plain-language prompts, answer questions, summarize or translate text, find similar documents, and more. | ✔ | ✔ | ✔ | ✔ |
| Use [Snowflake Copilot](snowflake-copilot.md) to engage in conversations about your structured data. | ✔ | ✔ | ✔ | ✔ |
| Use [Cortex Analyst](snowflake-cortex/cortex-analyst.md) to help write applications that can engage in conversations about your structured data. | ✔ | ✔ | ✔ | ✔ |
| Use [Cortex Fine-tuning](snowflake-cortex/cortex-finetuning.md) to create large language models specialized for your needs without the usual training costs. | ✔ | ✔ | ✔ | ✔ |
| Use [Cortex Search](snowflake-cortex/cortex-search/cortex-search-overview.md) to enable high-quality semantic search over your Snowflake data. | ✔ | ✔ | ✔ | ✔ |
| Use [ML Functions](../guides-overview-ml-functions.md) to analyze your data using our machine-learning models trained on your data. | ✔ | ✔ | ✔ | ✔ |
| Use the [Snowflake Model Registry](../developer-guide/snowflake-ml/model-registry/overview.md) as a central repository for machine learning models within your organization. | ✔ | ✔ | ✔ | ✔ |
| Use the [Snowflake Feature Store](../developer-guide/snowflake-ml/feature-store/overview.md) to create a repository of data transformations that can be used to train machine learning models. | ✔ | ✔ | ✔ | ✔ |

### Customer support

| Feature/Service | Standard | Enterprise | Business Critical | VPS |
| --- | --- | --- | --- | --- |
| [Snowflake Community](https://community.snowflake.com), Snowflake’s online Knowledge Base and support portal (for logging and tracking Snowflake Support tickets). | ✔ | ✔ | ✔ | ✔ |
| [Premier support](https://www.snowflake.com/wp-content/uploads/2019/02/Snowflake-Support-Policy-02202019.pdf), which includes 24/7 coverage and 1-hour response window for Severity 1 issues. | ✔ [1] | ✔ | ✔ | ✔ |

[1] Applies only to Standard accounts provisioned after May 1, 2020; Standard accounts provisioned before May 1 will continue to receive Standard support (as defined in ‘Support Policy and Service Level Agreement’) until the account is transitioned to Premier support.

---
title: Snowflake Extension for Visual Studio Code
source: https://docs.snowflake.com/en/user-guide/vscode-ext.md
section: User Guide
---

# Snowflake Extension for Visual Studio Code

The Snowflake [Visual Studio Code](https://code.visualstudio.com/) (VS Code) extension enables you to write and execute Snowflake SQL statements directly in VS Code. The extension also integrates with [Snowpark Python](../developer-guide/snowpark/python/index.md) to provide debugging, syntax highlighting, and autocomplete features for SQL in Python code.

You can either install the VS Code extension from the Visual Studio marketplace or download and install the `.vsix` file.

## Install the VS Code extension from Visual Studio Marketplace

1. In VS Code, select Code > Settings > Extensions.
2. In the Search Extensions in Marketplace field, enter **Snowflake**, and then select the **Snowflake** extension.

   To confirm you’ve selected the correct extension, look for the Snowflake badge as shown in the following image:
3. Select Install.

## Install the VS Code extension from a .vsix file

1. Download the extension:

   * Go to <https://marketplace.visualstudio.com/items?itemName=snowflake.snowflake-vsc>.
   * Select the Version History tab.
   * Select Download for the VS Code extension version you want to install. Note where the file is downloaded.
2. In VS Code, select Code > Settings > Extensions.
3. Select More (…) > Install from VSIX.
4. Browse to the location of the `snowflake-x.y.z.vsix` file on your computer, select the file, and then select Install.

   After the installation completes, the Snowflake Extension for Visual Studio Code appears in the INSTALLED section of the Extensions menu in VS Code.

## Sign in to Snowflake with the VS Code extension

Before you can execute SQL statements, use Snowpark Python, or use Snowflake Native App Framework features, you must sign in to a Snowflake account.

Methods for signing in to Snowflake with the VS Code extension:

* Use your Snowflake account identifier, username, and password.
* Use federated authentication such as Security Assertion Markup Language (SAML) or Single sign-on (SSO).
* Use key-pair authentication.
* Use OAuth authentication in your `connections.toml` configuration file. See Edit the Snowflake connections.toml file.

The first time you use the VS Code extension to sign in to Snowflake, you need to enter either the account identifier for your Snowflake account or the URL that you use to connect to Snowflake. To determine your account identifier, see [Account identifiers](admin-account-identifier.md).

1. Select the Snowflake icon in the VS Code Activity Bar.
2. In the Account Identifier/URL field, enter the account identifier for your Snowflake account or the URL that you use to connect to Snowflake, and then select Continue.

   The Account Identifier/URL field isn’t available if you’ve previously provided your Snowflake account credentials.
3. Select one of the following options in the Auth Method list:

   * Select Single sign-on to use your SSO credentials to sign in to Snowflake.
   * Select Username/password to use your Snowflake username and password to sign in to Snowflake.
   * Select Key Pair to use your Snowflake username and password to sign in to Snowflake. For more information about key-pair authentication, see [Key-pair authentication and key-pair rotation](key-pair-auth.md).
4. Enter your credentials and then select Sign in.

   When you select SSO, a separate authentication page opens after you enter your username and select Sign in with single sign-on. Enter your SSO credentials and then return to VS Code to complete the Snowflake sign in.

   After you successfully sign in, the sidebar displays your account information, your default role, the OBJECT EXPLORER with a **Databases** list, and your QUERY HISTORY.

## Edit the Snowflake `connections.toml` file

You can add and modify connection definitions in the Snowflake `connections.toml` configuration file. A connection definition is a collection of connection-related parameters.

To connect to Snowflake with a TOML file, see [Connecting using the connections.toml file](../developer-guide/python-connector/python-connector-connect.md). To learn more about managing connections with a TOML file, see [Managing Snowflake connections](../developer-guide/snowflake-cli/connecting/configure-connections.md).

1. In VS Code, open the Snowflake VS Code extension and sign in to Snowflake.
2. In the ACCOUNT pane, select Snowflake:Edit Connections File .
3. Edit the TOML file.
4. Select Save, and then close the TOML file.

## The VS Code extension interface

The following table provides descriptions of the VS Code extension interface functional areas.

| Item | Description |
| --- | --- |
| 1 | The sidebar pane contains the *Account*, *Native App*, *Object Explorer*, and *Query History* panes. Use this pane to specify account details, examine database objects, and examine query results. |
| 2 | Use the Snowflake Native App pane to create and manage a Snowflake Native App. |
| 3 | The Query History pane shows recent queries. |
| 4 | The current session information, including the current role, database, schema, and active warehouse. |
| 5 | Snowflake SQL pane. Displays Snowflake SQL files. |
| 6 | The Query Results pane shows query results. Select a query to display its execution result. |

## Use the VS Code extension with SnowSQL configuration files

The Snowflake Extension for Visual Studio Code can use [Snow SQL configuration files](snowsql-config.md)
for loading connection configurations.

> **Note:**
>
> Only connection configuration values are used. Other SnowSQL configuration values are ignored.

1. In the VS Code search field, enter `>user settings` and then select Preferences: Open User Settings.
2. On the User tab, expand Extensions.
3. Scroll down and select Snowflake.
4. In the right pane, scroll down to Snowsql Config Path.
5. Enter a path to a valid SnowSQL configuration file.

   All connections defined in the configuration display in the **Account** pane.

## Work with SQL files

You can use the Snowflake Extension for Visual Studio Code to create and load SQL files.
SQL files are text files that contain one or more SQL statements.

### Open or create SQL files

1. In VS Code, select File > Open, browse to the location of a SQL file, and then open it.

   To create a new SQL file, select File > New File and
   create a file of type Snowflake SQL File.
2. Add one or more Snowflake SQL statements to the file.
3. Optional. Select Snowflake: Execute All Statements () to execute a command.

### Execute commands or queries

> **Important:**
>
> To display Snowflake query results, the VS Code extension automatically runs `DESC RESULT '<query_id>'` in the background after every query. This process makes `LAST_QUERY_ID()` inaccurate. For more information about the DESCRIBE RESULT command and its parameters, see [DESCRIBE RESULT](../sql-reference/sql/desc-result.md).

In VS Code, select one of the following options:

* To execute all SQL statements in a file, select Snowflake: Execute All Statements ().
* To execute a specific command, place your cursor on the statement you want to run and then select Execute.
* To execute multiple commands, select the statements you want to run and then select Execute. The commands execute in order from top to bottom.

To use keyboard shortcuts to execute statements, select the SQL statements you want to run, press  + [enter] on a Mac keyboard, or  + [enter] on a Windows keyboard.

Executed SQL statement results display in the SNOWFLAKE pane.

To cancel in-progress queries, select a query in QUERY HISTORY list and select Cancel query ().

### View query history

1. In VS Code, expand Query History.
2. Select a statement.
3. Review the results in the SNOWFLAKE pane.
4. Optional. Select one of the following:

   * Select Snowflake: Copy to Clipboard () to copy the query text to the clipboard.
   * Select Snowflake: Remove Query () to delete a query.

### Work with query results

You can sort, reorder, hide, freeze, or save query results to disk.

1. In QUERY HISTORY, select a query.
2. In the SNOWFLAKE pane, choose a column.
3. Select the expander arrow (↓) and then one of: Sort Ascending, Sort Descending,
   Hide column “column name”, or Freeze columns up to “column name”.

   If a column was previously hidden, choose any other column and select Unhide N columns.
4. Optional. Select one of the following:

   * Select Cloud () to save the results as a compressed gzip file.
   * Select Save () to save the results as a comma-separated (CSV) file.

## Work with Snowpark Python code

You can use the Snowflake Extension for Visual Studio Code to create, load, and execute SQL files.

### Debug Snowpark Python functions

1. Write a Snowflake stored procedure in a Python function where the first parameter is a Snowpark `Session` object.
2. An inline Snowflake: Debug option appears above the function name. Choose this option to run the stored procedure in
   the function, using your current active session through the extension. You can also set debug breakpoints.

### Detect SQL statements automatically

To set up automatic SQL syntax highlighting, enable the extension setting
Auto Detect Sql in Python. The extension automatically detects SQL statements by looking for a SQL keyword in all capital letters as
the first word in a Python string, as shown in the following image.

### Denote SQL statements manually

1. Optional. Disable the extension setting Auto Detect Sql in Python.
2. Use comments to denote the start and end of a SQL statement. You can use any combination of the following markers:

   * Start markers: `-–startsql`, `-–beginsql`, `-–start-sql`, `-–begin-sql`
   * End markers: `–-endsql`, `–-end-sql`

   The following image shows how the `--begin-sql` and `--end-sql` markers manually denote a SQL statement.

### Use SQL autocomplete in Python strings

1. In a Python file, create a Python string while connected to an active Snowflake session with the VS Code extension.
2. Write a SQL statement. The autocomplete suggestions appear.

   For example, when you start writing a statement such as `SELECT * FROM db1.public`, the extension automatically suggests table names.

   Similarly, when you start filling out columns inside a SELECT statement that references a table, the extension automatically suggests column
   names, as shown in the following image.

### Jinja template syntax highlighting

By default, the VS Code extension adds basic syntax highlighting and bracket autocomplete for writing
[Jinja templates](https://jinja.palletsprojects.com/en/3.1.x/) in Snowflake SQL, as shown in the following image.

## Work with the Snowflake Native App Framework

[Preview Feature](../release-notes/preview-features.md) — Open

Available to all accounts. To use the Snowpark Python features of the VS Code extension, you must enable the extension setting
Enable Public Preview Features. For more information, see VS Code extension preview settings.

You can use the VS Code extension to create and manage a Snowflake Native App. For more information about the Snowflake Native App Framework, see [About the Snowflake Native App Framework](../developer-guide/native-apps/native-apps-about.md).

> **Note:**
>
> To make sure you have the latest VS Code extension Snowflake Native App functionality, Snowflake recommends upgrading to the most recent version of the Snowflake CLI. See [Installing Snowflake CLI](../developer-guide/snowflake-cli/installation/installation.md).
>
> Snowflake CLI versions 2.2.X and 3.X.X are supported.

### View the VS Code extension Snowflake Native App command palette

The VS Code extension Snowflake Native App command palette provides access to the following Snowflake Native App commands:

* Create Native App
* Deploy a Native App
* Focus on a Native App View
* Open a Native App
* Run (deploy and re-install) a Native App
* Teardown a Native App

To access these commands, type `>Snowflake Native` in the search field at the top of the VS Code window.

### Create a Snowflake Native App

1. In VS Code, open the Snowflake VS Code extension and sign in to Snowflake.
2. Expand the NATIVE APP pane and then select Create new from template.
3. Select one of the following:

   * Enter the folder name where you want to create the Snowflake Native App and then press Enter.
   * Press Enter to accept the default directory as the location for the Snowflake Native App.
4. Select one of the following:

   * Enter the URL for the GitHub repository where your Snowflake Native App templates are stored, and then select Enter.
   * Enter the path to a local templates folder.
   * To accept the default GitHub Snowflake Native App template repository URL, select Enter.
5. Select one of the following templates:

   * Select basic to create a Snowflake Native App with minimal code examples and guidance.
   * Select streamlit-python to create a Snowflake Native App with Python extension code and Streamlit code examples.
   * Select streamlit-java to create a Snowflake Native App with Java extension code and Streamlit code examples.
   * Select spcs-basic to create a Snowflake Native App with SPCS extension code and Streamlit code examples.

### Deploy and open a Snowflake Native App

When you use the Run (deploy and re-install) or Deploy options, the application selected in the NATIVE APP pane is used. When multiple Snowflake Native App applications are available, a prompt appears, and you can select which `snowflake.yml` file to use for the deployment.

After you deploy your Snowflake Native App, you can open it in Snowflake to manage access, view, add, and validate app packages, view logs and events, and modify privileges.

1. Select one of the following:

   * In the VS Code extension NATIVE APP pane, select Run (deploy and re-install). This is the recommended option when you have made significant changes and an application object is required.
   * In the VS Code extension NATIVE APP pane, select Deploy. This is the recommended option when you are deploying application packages and stage files and an application object is not required.
2. Optional. Select the OUTPUT tab in the query results pane to view deployment progress.
3. In the NATIVE APP pane, select Open.

### View Snowflake Native App application object status

> **Note:**
>
> Snowflake Native App application object status is not available in Snowflake CLI version 3.0.0.

* In the VS Code extension NATIVE APP pane, expand your application.

  A blue font and a blue circle indicate that the application object has not been installed or deployed.

### View the Snowflake Environment Variables Manager

Use the Snowflake Environment Variables Manager to create and manage environment variables in environment variable profiles. You can use an environment variable profile to customize object behavior in Snowflake Native App project definition files. For example, you can create environment variable profiles that change object behavior in development, stage, and production environments. For more information about environment variables and Snowflake connections, see [Project definition files](../developer-guide/snowflake-cli/native-apps/project-definitions.md).

Use one of the following methods to view the Snowflake Environment Variables Manager:

* In the VS Code extension NATIVE APP pane, select Environment Variables.
* In the command palette, select `Open Environment Variables Manager`.

### Add an environment variable to an environment variable profile

An environment variable profile stores environment variables.

1. In the VS Code extension NATIVE APP pane, select Environment Variables.
2. In the Selected Profile list, select a profile.
3. In the Environment Variable column, enter an environment variable.
4. In the Value column, enter a value for the environment variable.
5. Optional: To add additional environment variables to the environment variable profile, repeat steps 3 and 4.
6. Optional: To add additional rows to the environment variable profile, select + (Add Row).

### Add an environment variable profile

An environment variable profile stores environment variables. To customize object behavior in Snowflake Native App project definition files, create a new profile.

1. In the VS Code extension NATIVE APP pane, select Environment Variables.
2. Select Add Profile.
3. Enter a name for the profile, and then press Enter.

### Add a row to an environment variable profile

> To add an environment variable, add a row to the environment variable profile.

1. In the VS Code extension NATIVE APP pane, select Environment Variables.
2. Select a profile in the Selected Profile list.
3. Select + (Add Row).

### Rename an environment variable profile

1. In the VS Code extension NATIVE APP pane, select Environment Variables.
2. Select a profile in the Selected Profile list.
3. Select Rename Profile.
4. Enter a name for the profile, and then press Enter.

### Delete an environment variable profile

1. In the VS Code extension NATIVE APP pane, select Environment Variables.
2. Select a profile in the Selected Profile list.
3. Select Delete Profile.

### Enable Snowflake Native App debug mode

Use debug mode to view application objects that are not visible to consumers, such as shared content objects or objects not granted to a specific database role. For more information about debug mode and turning it on programmatically, see [About debug mode](../developer-guide/native-apps/installing-testing-application.md).

* In the VS Code extension NATIVE APP pane, select App Debug Mode: OFF.

### Drop Snowflake Native App packages and application objects

Use the Teardown option to drop the application object and package defined in the resolved project definition.

* In the VS Code extension NATIVE APP pane, select Teardown. A confirmation message appears when teardown is complete.

## Change session context

You can use the Account section of the Side Bar pane to select roles, databases, schemas, and warehouses.
Use the associated dropdown to select each as appropriate.

Use the account drop down to sign in to, or switch between different accounts.

## Navigate with the Object Explorer

Use the OBJECT EXPLORER section of the Side Bar pane to examine and display characteristics of database objects.

1. Sign in to an account.
2. Expand OBJECT EXPLORER.
3. Expand the databases list.
4. Expand a database.
5. Expand a schema.
6. Navigate to a child component.
7. Select any element to view its characteristics in the SNOWFLAKE Query Results pane.
8. Optional. Select one of the following:

   * Select Open data preview () to display a preview of up to 100 rows of content associated with a table, view, materialized view, or similar object.
   * Select Show SQL () to display the SQL code associated with the object.

## Manage stage content

The Snowflake Extension for Visual Studio Code supports managing stage content directly in the Object Explorer.

### List all the files in a stage

1. In the VS Code OBJECT EXPLORER, navigate to a stage.
2. Expand the stage to see all staged files.

### Upload files from the local file system to a stage

The Snowflake Extension for Visual Studio Code only supports uploads for internal stages, all other operations work for both internal and external stages.

1. In the VS Code OBJECT EXPLORER, navigate to a stage and select Upload ().
2. Enter optional parameters for the upload operation. See [PUT](../sql-reference/sql/put.md) for a list of optional parameters.
3. Navigate to the folder containing upload files, and then select and upload one or more files.

### Download files from a stage to a local file system

1. In the VS Code OBJECT EXPLORER, navigate to a stage.
2. Select Download () to download all files, or expand the stage.
3. Select and download a file.
4. Select a directory to complete the download.

### Remove files from a stage

See also [REMOVE](../sql-reference/sql/remove.md).

1. In the VS Code OBJECT EXPLORER, navigate to a stage.
2. Select a file.
3. Select Remove ().

## VS Code extension settings

The following table lists the Snowflake Extension for Visual Studio Code settings.

| Setting | Description | Default |
| --- | --- | --- |
| Autocomplete Object Details | Show details of a Snowflake object after you select its autocomplete entry | Disabled |
| Autocomplete Variant Keys | Show OBJECT/VARIANT key autocomplete suggestions | Disabled |
| Connections Config File | Specifies the location of the `config.toml` file | Unset |
| Enable Frequency Based Completion | Enable frequency-based auto-completion suggestions | Enabled |
| Enable Native App Panel | Enable the Snowflake Native App pane | Disabled |
| Export CSV > Delimiter | Specifies delimiter for columns | Comma |
| Export CSV > Header | Enable inclusion of header row in exported CSV file | Enabled |
| Export CSV > Include Empty Rows | Enable inclusion of empty rows in exported CSV file | Exclude |
| Export CSV > Quotes | Enable double quotes around all values in exported CSV file | Enabled |
| Highlight Query | Enable background highlight on the current SQL statement | Enabled |
| Native App: Activate Snowflake CLI Debugging | Enable debugging mode for Snowflake CLI operations (Snowflake Native App) | Disabled |
| Object Explorer: Search | Enable search in object explorer | Enabled |
| Query History: Item Limit | Specifies the maximum number of queries shown in history. Showing more queries might affect performance. | 1000 |
| Set Client Session Keep Alive | Specifies whether to keep the session active indefinitely when the connection is active, regardless of activity. If this is not enabled, you must sign in again after four hours of inactivity. | Enabled |
| Set HTTP Agent Keep Alive | Enable Node.js driver socket reuse for requests | Enabled |
| Show Execute Above Statement | Enable a selectable execute action above each statement | Enabled |
| Skip Native App Support Message | Hides the support message when a Snowflake Native App project is detected | Disabled |
| Skip YAML Support Message | Hides the YAML extension recommendation message | Disabled |
| Snowsql Config Path | If set, connection configuration will be loaded from this file | Unset |
| Syntax Highlighting: Auto Detect SQL In Python | Enable SQL statement syntax highlighting in Python strings | Enabled |

### VS Code extension preview settings

[Preview Feature](../release-notes/preview-features.md) — Open

Available to all accounts.

The following table lists the VS Code extension preview settings.

| Setting | Description | Default |
| --- | --- | --- |
| Enable Public Preview Features | Enable public preview features for the extension | Disabled |

### Change VS Code extension settings

1. Select one of the following:

   * On Windows/Linux select File > Preferences > Settings.
   * On macOS select Code > Settings > Settings.
2. In the Search settings field, enter *Snowflake*.
3. Select the User or Workspace tabs to view or modify user specific or workspace specific settings.
4. Close the Settings tab.

## Show the VS Code extension changelog

1. Press `CMD+Shift+P` (Mac), or `CTRL+Shift+P` (Windows).
2. Enter the following command:

   ```none
   Show Change Log
   ```

## Uninstall the VS Code extension

1. Select Code > Settings > Extensions
2. Select the extension.
3. Right-click and select Disable or Uninstall.

---
title: Snowflake generation 2 standard warehouses
source: https://docs.snowflake.com/en/user-guide/warehouses-gen2.md
section: User Guide
---

# Snowflake generation 2 standard warehouses

Generation 2 Standard Warehouse (Gen2) is an updated version (the “next generation”) of the
current standard virtual warehouse in Snowflake, focused on improving performance for
analytics and data engineering workloads. Gen2 is built on top of faster underlying hardware
and intelligent software optimizations, such as enhancements to delete, update, and merge operations,
and table scan operations. With Gen2, you can expect the majority of queries finish faster, and you can do more work
at the same time. The exact details depend on your configuration and workload. Conduct tests to verify how much this feature
improves your costs, performance, or both.

You can specify the generation for standard warehouses in the [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md)
or [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) commands, using either the GENERATION clause or the RESOURCE_CONSTRAINT clause:

**Using GENERATION clause (recommended):**

* `GENERATION = '1'` represents Snowflake’s original, industry-leading standard virtual warehouses.
* `GENERATION = '2'` represents the next generation of Snowflake’s standard virtual warehouses.

**Using RESOURCE_CONSTRAINT clause:**

* STANDARD_GEN_1 represents Snowflake’s original, industry-leading standard virtual warehouses.
* STANDARD_GEN_2 represents the next generation of Snowflake’s standard virtual warehouses.

> **Note:**
>
> Currently, the GENERATION clause and the
> STANDARD_GEN_1 and STANDARD_GEN_2 values aren’t available in Snowsight. You must specify them
> with SQL commands.
>
> Generation 2 standard warehouses aren’t available for warehouse sizes X5LARGE and X6LARGE.
>
> This feature applies to standard warehouses. It doesn’t apply to Snowpark-optimized warehouses.
>
> STANDARD_GEN_1 provides the same memory capacity for standard warehouses as MEMORY_1X does
> for Snowpark-optimized warehouses.

## Default value for the RESOURCE_CONSTRAINT for standard warehouses

For the following regions, any account associated with a new organization created after June 27th, 2025 will have standard
warehouses default to Gen2:

* AWS US West (Oregon)
* AWS EU (Frankfurt)
* Azure East US 2 (Virginia)
* Azure West Europe (Netherlands)

For all other regions where Gen2 warehouses are available, all new organizations created after July 15th, 2025 will have standard
warehouses default to Gen2. For information about region availability, see
Region availability.

For any regions or organizations where the preceding factors don’t apply, if you don’t specify the GENERATION or RESOURCE_CONSTRAINT clause when
you create a standard warehouse, Snowflake creates a Gen1 standard warehouse.

## Changing a warehouse to or from a generation 2 warehouse

You can alter a standard warehouse and specify a different GENERATION clause or RESOURCE_CONSTRAINT clause to change
it from generation 1 to generation 2, or from generation 2 to generation 1. You can make that change
whether the warehouse is running or suspended.

You can also switch between a Gen2 standard warehouse and a Snowpark-optimized warehouse by
changing the value of the WAREHOUSE_TYPE and RESOURCE_CONSTRAINT clauses. You can make that change
whether the warehouse is running or suspended. Note that the GENERATION clause applies only to standard warehouses
and cannot be used with Snowpark-optimized warehouses.

> **Note:**
>
> When you convert a Gen1 warehouse to Gen2 without suspending it first, existing queries that were running on Gen1 continue to run
> to completion using the Gen1 compute resources. At the same time, the warehouse runs any new queries on the Gen2 compute
> resources. While the existing queries are running, you are charged for both sets of compute resources. The warehouse doesn’t
> automatically suspend during this period, whether or not any queries are using the Gen2 compute resources. When the existing
> queries complete, the workload shifts entirely to the Gen2 compute resources. Therefore, you can maximize availability by
> converting the warehouse while it’s running. Or, you can reduce costs by converting the warehouse while it’s suspended and no
> queries are running.
>
> The same consideration applies to converting between standard and Snowpark-optimized warehouses, or any other change
> to the RESOURCE_CONSTRAINT property. Existing queries will complete on the warehouse they began on and with the
> RESOURCE_CONSTRAINT that was in effect at the initialization of the query, while new queries will operate on the new warehouse
> type or the new RESOURCE_CONSTRAINT that you set.

You can see the setting for a standard warehouse in the `"resource_constraint"` column of
the SHOW WAREHOUSES output.

This setting isn’t reflected in the INFORMATION_SCHEMA views for warehouses.

## Region availability

Gen2 standard warehouses are available for the Amazon Web Services (AWS), Microsoft Azure, and Google Cloud Platform (GCP)
cloud service providers (CSPs).

Gen2 standard warehouses are available in all [CSP regions](intro-regions.md),
with some exceptions. Currently, Gen2 standard warehouses *aren’t* available in these CSP regions:

* AWS EU (Zurich)
* AWS Africa (Cape Town)
* GCP Middle East Central2 (Dammam)
* Azure US Gov Virginia (FedRAMP High Plus)
* Azure US Gov Virginia

> **Important:**
>
> If you use account replication for your warehouses, and you create any Gen2 warehouses, any secondary regions must
> also have Gen2 warehouse support. Otherwise, the Gen2 warehouses might not be able to resume in the
> secondary regions after a failover. Make sure to test that any Gen2 warehouses can be resumed in secondary regions.
>
> The defaults for Snowflake standard warehouses are changing, based on the availability of Gen2 standard warehouses. Currently, the
> default value of the RESOURCE_CONSTRAINT property depends on your organization and the CSP region of your account. For more
> information, see Default value for the RESOURCE_CONSTRAINT for standard warehouses.

## Cost and billing for Gen2 standard warehouses

For general information about credit usage with Snowflake virtual warehouses,
see [Virtual warehouse credit usage](cost-understanding-compute.md).

For information about credit consumption for Gen2 standard warehouses,
see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Examples

The following examples show how you can specify Gen2 standard warehouses when creating a new
warehouse or altering an existing one. The examples show variations such as changing the warehouse size,
type, and memory capacity at the same time.

### Examples using GENERATION clause (recommended approach)

The following example creates a Gen2 warehouse with all other properties left as defaults. The warehouse
type is STANDARD and the size is XSMALL. Those defaults are the same for both generation 1 and generation 2
standard warehouses.

```sqlexample
CREATE OR REPLACE WAREHOUSE next_generation_default_size
  GENERATION = '2';
```

The following example creates a Gen2 standard warehouse with size SMALL.

```sqlexample
CREATE OR REPLACE WAREHOUSE next_generation_size_small
  GENERATION = '2'
  WAREHOUSE_SIZE = SMALL;
```

### Examples using RESOURCE_CONSTRAINT clause

The following example creates a Gen2 warehouse using the RESOURCE_CONSTRAINT syntax:

```sqlexample
CREATE OR REPLACE WAREHOUSE next_generation_default_size
  RESOURCE_CONSTRAINT = STANDARD_GEN_2;
```

The following example creates a Gen2 standard warehouse with size SMALL.

```sqlexample
CREATE OR REPLACE WAREHOUSE next_generation_size_small
  RESOURCE_CONSTRAINT = STANDARD_GEN_2
  WAREHOUSE_SIZE = SMALL;
```

### Examples of converting between generations

The following example shows how to convert a generation 1 standard warehouse to generation 2. The
warehouse size remains the same, XLARGE, throughout the operation. This example uses the
GENERATION clause (recommended):

```sqlexample
CREATE OR REPLACE WAREHOUSE old_to_new_xlarge_gen
  WAREHOUSE_SIZE = XLARGE;

ALTER WAREHOUSE old_to_new_xlarge_gen
  SET GENERATION = '2';
```

The following example shows the same conversion using the RESOURCE_CONSTRAINT clause:

```sqlexample
CREATE OR REPLACE WAREHOUSE old_to_new_xlarge
  WAREHOUSE_SIZE = XLARGE;

ALTER WAREHOUSE old_to_new_xlarge
  SET RESOURCE_CONSTRAINT = STANDARD_GEN_2;
```

### Examples of converting to or from Snowpark-optimized warehouses

The following example shows how to convert a Gen2 standard warehouse to Snowpark-optimized.
Snowpark-optimized warehouses currently aren’t available as Gen2 warehouses. Because the warehouse
has size XSMALL when it has the type STANDARD, we specify a RESOURCE_CONSTRAINT value of MEMORY_1X.
That RESOURCE_CONSTRAINT produces a memory size that’s compatible with Snowpark-optimized warehouses
of XSMALL size.

```sqlexample
CREATE OR REPLACE WAREHOUSE gen2_to_snowpark_optimized
  RESOURCE_CONSTRAINT = STANDARD_GEN_2;

ALTER WAREHOUSE gen2_to_snowpark_optimized
  SET WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED' RESOURCE_CONSTRAINT = MEMORY_1X;
```

The following example shows how to convert a Snowpark-optimized warehouse to a standard Gen2
warehouse. The Snowpark-optimized warehouse starts with size MEDIUM and a relatively large memory
capacity represented by a RESOURCE_CONSTRAINT value of MEMORY_16X. After the change, the warehouse
is of type STANDARD, still with size MEDIUM. However, its memory capacity is lower. That’s because
the RESOURCE_CONSTRAINT value of STANDARD_GEN_2 has the same memory capacity as a Snowpark-optimized
warehouse with a resource constraint of MEMORY_1X.

```sqlexample
CREATE OR REPLACE WAREHOUSE snowpark_optimized_medium_to_gen2
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  WAREHOUSE_SIZE = MEDIUM
  RESOURCE_CONSTRAINT = MEMORY_16X;

ALTER WAREHOUSE snowpark_optimized_medium_to_gen2
  SET WAREHOUSE_TYPE = STANDARD GENERATION = '2';
```

---
title: Snowflake Horizon Catalog
source: https://docs.snowflake.com/en/user-guide/snowflake-horizon.md
section: User Guide
---

# Snowflake Horizon Catalog

Horizon Catalog is the universal catalog for your entire data estate. It provides context and governance for AI, enables any architecture
across clouds and regions, works with any engine and data format — and has zero risk of vendor lock-in.

## Context for AI

[Snowflake Intelligence](snowflake-cortex/snowflake-intelligence.md) empowers users across the organization to engage with
data intuitively by allowing them to ask questions and immediately obtain answers, insights, and visualizations. Snowflake Intelligence brings all your
structured and unstructured data together and uses helpful [AI agents](snowflake-cortex/cortex-agents.md) that understand
your business through your [semantic views](views-semantic/overview.md) and search services. Powered by [Cortex AI Functions](snowflake-cortex/aisql.md), [Cortex Analyst](snowflake-cortex/cortex-analyst.md), and [Cortex Search](snowflake-cortex/cortex-search/cortex-search-overview.md), Snowflake Intelligence delivers clear, trustworthy insights while keeping everything secure and
fully governed inside Snowflake. With easy access to [leading models](snowflake-cortex/snowflake-intelligence/reference.md) and cross-region inference,
every user can explore, discover, and act with confidence.

## Easy and safe data discovery across clouds

Horizon Catalog gives users one place to find all data resources with consistent metadata about Snowflake data, Apache Iceberg™ data, and
external relational sources and BI tools. Horizon Catalog expands visibility through
[Internal Marketplace](collaboration/listings/organizational/org-listing-about.md) listings so teams can discover governed
data products without copying data. Horizon Catalog enforces [access control](security-access-control-overview.md), protects
sensitive fields with [dynamic data masking](security-column-intro.md), applies
[row access policies](security-row-intro.md), and [identifies sensitive data](classify-intro.md) through data
classification.

Horizon Catalog supports enforcing row access and data masking policies on Apache Iceberg tables that you query from Apache Spark™ through Snowflake Horizon
Catalog. For more information,
see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

## Enterprise-grade security and governance

Horizon Catalog makes it easy to keep data safe, consistent, and well understood across your entire organization. It applies
[sensitive data classification](classify-intro.md), [retention](tables-iceberg-metadata.md), and
[data access policies](security-access-control-overview.md) the same way everywhere, giving every engine a shared view of your
metadata, [lineage](ui-snowsight-lineage.md), and rules. With [access history](ui-snowsight-lineage.md) and
[Time Travel](data-time-travel.md), teams can confidently review past activity and data states. Governance flows naturally to
external storage, Iceberg tables, and the [Snowflake Marketplace](../collaboration/collaboration-marketplace-about.md), so shared data
products always carry their tags and permissions wherever they go.

## Interoperability without vendor lock-in

Horizon Catalog connects all compute engines and formats through one governed environment. It presents consistent metadata and permissions
to Snowflake, Spark, and engines that read [Apache Iceberg](tables-iceberg.md). Horizon Catalog governs data inside Snowflake and in
external storage through [external tables](tables-external-intro.md) and Iceberg tables. It carries governance into the
Marketplace by [sharing](../guides-overview-sharing.md) data products through
[internal exchanges](collaboration/listings/organizational/org-listing-about.md) while preserving tags and access rules.
Horizon Catalog ensures every engine sees the same definitions, lineage, and policy behavior.

## A central spot for managing business continuity

Within Horizon, you [manage](account-replication-failover-failback.md) primary and secondary environments with ease, keeping
them consistent through database and account replication so your data, policies, and configurations stay aligned across every region and
account.

---
title: Snowflake in 20 minutes
source: https://docs.snowflake.com/en/user-guide/tutorials/snowflake-in-20minutes.md
section: User Guide
---

Snowflake

Getting Started

# Snowflake in 20 minutes

## Introduction

This tutorial uses the Snowflake command-line client, [SnowSQL](../snowsql.md), to introduce key concepts and tasks, including:

* Creating Snowflake objects—You create a database and a table for storing data.
* Loading data—We provide small sample CSV data files for you to load into the table.
* Querying—You explore sample queries.

> **Note:**
>
> Snowflake bills a minimal amount for the on-disk storage used for any sample data in
> this tutorial. The tutorial provides steps to drop objects and minimize storage
> cost. Snowflake requires a [virtual warehouse](../warehouses.md) to load the
> data and execute queries. A running virtual warehouse consumes Snowflake credits.
>
> If you are using a [30-day trial account](https://signup.snowflake.com/),
> which provides free credits, you won’t incur any costs.

### What you’ll learn

In this tutorial you’ll learn how to:

* Create Snowflake objects—You create a database and a table for storing data.
* Install SnowSQL—You install and use SnowSQL, the Snowflake command-line query tool.

  Users of Visual Studio Code might consider using the [Snowflake Extension for Visual Studio Code](../vscode-ext.md) instead of SnowSQL.
* Load CSV data files—You use various mechanisms to load data into tables from CSV files.
* Write and execute sample queries—You write and execute a variety of queries against newly loaded data.

## Prerequisites

This tutorial requires a database, table, and virtual warehouse to load and query data.
Creating these Snowflake objects requires a Snowflake user with a role with the
necessary access control privileges. In addition, [SnowSQL](../snowsql.md)
is required to execute the SQL statements in the tutorial. Lastly, the tutorial requires CSV files that contain sample data to load.

You can complete this tutorial using an existing Snowflake warehouse, database, and table, and your own local data files, but we recommend using the Snowflake objects and the set of
provided data.

To set up Snowflake for this tutorial, complete the following before continuing:

1. Create a user

   To create the database, table, and virtual warehouse, you must be logged in as a
   Snowflake user with a role that grants you the privileges to create these objects.

   * If you’re using a 30-day trial account, you can log in as the user that was created for the account.
     This user has the role with the privileges needed to create the objects.
   * If you don’t have a Snowflake user, you can’t perform this tutorial.
     If you don’t have a role that lets you create a user, ask someone who does to perform this step for you.
     Users with the ACCOUNTADMIN or SECURITYADMIN role can create users.
2. Install SnowSQL

   To install SnowSQL, see [Installing SnowSQL](../snowsql-install-config.md).
3. Download sample data files

   For this tutorial you download sample employee data files in CSV format that Snowflake provides.

   To download and unzip the sample data files:

   1. Download the set of sample data files. Right-click the name of the archive
      file, [`getting-started.zip`](../../_downloads/34f4a66f56d00340f8f7a92acaccd977/getting-started.zip), and save the link/file to your local file system.
   2. Unzip the sample files. The tutorial assumes you unpacked files into one of the following directories:
   > * Linux/macOS: `/tmp`
   > * Windows: `C:\\temp`

   Each file has five data records. The data uses a comma (,) character as field
   delimiter. The following is an example record:

   ```none
   Althea,Featherstone,afeatherstona@sf_tuts.com,"8172 Browning Street, Apt B",Calatrava,7/12/2017
   ```

There are no blank spaces before or after the commas separating the
fields in each record. This is the default that Snowflake expects when loading CSV data.

## Log in to SnowSQL

After you have [SnowSQL](../snowsql.md), start SnowSQL to connect to Snowflake:

1. Open a command-line window.
2. Start SnowSQL:

   ```bash
   $ snowsql -a <account_identifier> -u <user_name>
   ```

   Where:

   > * `<account_identifier>` is the unique identifier for your Snowflake account.
   >   :   The preferred format of the [account identifier](../admin-account-identifier.md) is as follows:
   >
   >       `organization_name-account_name`
   >       :   Names of your Snowflake organization and account. For more information, see [Format 1 (preferred): Account name in your organization](../admin-account-identifier.md).
   >
   >       If you don’t know your account identifier, see [Finding the organization and account name for an account](../admin-account-identifier.md).
   > * `<user_name>` is the login name for your Snowflake user.

   > **Note:**
   >
   > If your account has an identity provider (IdP) that has been defined for your account, you can use a web browser to authenticate instead of a password, as the following example demonstrates:
   >
   > ```bash
   > $ snowsql -a <account_identifier> -u <user_name> --authenticator externalbrowser
   > ```

   For more information, see [Using a web browser for federated authentication/SSO](../snowsql-start.md).
3. When SnowSQL prompts you, enter the password for your Snowflake user.

If you log in successfully, SnowSQL displays a command prompt that includes
your current warehouse, database, and schema.

> **Note:**
>
> If you get locked out of the account and can’t obtain the account identifier, you can find it in the Welcome email that Snowflake sent to
> you when you signed up for the trial account, or you can work with your
> ORGADMIN to [get the account details](../../sql-reference/sql/show-accounts.md).
> You can also find the values for `locator`, `cloud`, and `region`
> in the Welcome email.

If your Snowflake user doesn’t have a default warehouse, database, and schema, or if
you didn’t configure SnowSQL to specify a default warehouse, database, and schema,
the prompt displays `no warehouse`, `no database`, and `no schema`. For example:

```none
user-name#(no warehouse)@(no database).(no schema)>
```

This prompt indicates that there is no warehouse, database, and schema
selected for the current session. You create these objects
in the next step. As you follow the next steps in this tutorial to create
these objects, the prompt automatically updates to include the names of these objects.

For more information, see [Connecting through SnowSQL](../snowsql-start.md).

## Create Snowflake objects

During this step you create the following Snowflake objects:

* A database (`sf_tuts`) and a table (`emp_basic`). You load sample data into this table.
* A [virtual warehouse](../warehouses-overview.md) (`sf_tuts_wh`).
  This warehouse provides the compute resources needed to load data into
  the table and query the table. For this tutorial, you create an X-Small warehouse.

At the completion of this tutorial, you will remove these objects.

### Create a database

Create the `sf_tuts` database using the [CREATE DATABASE](../../sql-reference/sql/create-database.md) command:

```sqlexample
CREATE OR REPLACE DATABASE sf_tuts;
```

In this tutorial, you use the default schema (`public`) available for each database, rather than creating a new schema.

Note that the database and schema you just created are now in use for your current
session, as reflected in the SnowSQL command prompt. You can also use the context
functions to get this information.

```sqlexample
SELECT CURRENT_DATABASE(), CURRENT_SCHEMA();
```

The following is an example result:

```output
+--------------------+------------------+
| CURRENT_DATABASE() | CURRENT_SCHEMA() |
|--------------------+------------------|
| SF_TUTS            | PUBLIC           |
+--------------------+------------------+
```

### Create a table

Create a table named `emp_basic` in `sf_tuts.public` using the [CREATE TABLE](../../sql-reference/sql/create-table.md) command:

```sqlexample
CREATE OR REPLACE TABLE emp_basic (
   first_name STRING ,
   last_name STRING ,
   email STRING ,
   streetaddress STRING ,
   city STRING ,
   start_date DATE
   );
```

Note that the number of columns in the table, their positions, and their data types correspond to the fields in the sample CSV data files that you stage in the next step in this tutorial.

### Create a virtual warehouse

Create an X-Small warehouse named `sf_tuts_wh` using the [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) command:

```sqlexample
CREATE OR REPLACE WAREHOUSE sf_tuts_wh WITH
   WAREHOUSE_SIZE='X-SMALL'
   AUTO_SUSPEND = 180
   AUTO_RESUME = TRUE
   INITIALLY_SUSPENDED=TRUE;
```

The `sf_tuts_wh` warehouse is initially suspended, but the DML statement also sets
`AUTO_RESUME = true`. The AUTO_RESUME setting causes a warehouse to automatically start
when SQL statements that require compute resources are executed.

After you create the warehouse, it’s now in use for your current session.
This information is displayed in your SnowSQL command prompt. You can also retrieve
the name of the warehouse by using the following context function:

```sqlexample
SELECT CURRENT_WAREHOUSE();
```

The following is an example result:

```output
+---------------------+
| CURRENT_WAREHOUSE() |
|---------------------|
| SF_TUTS_WH          |
+---------------------+
```

## Stage data files

A Snowflake stage is a location in cloud storage that you use to load and
unload data from a table. Snowflake supports the following types of stages:

* **Internal stages**—Used to store data files internally within Snowflake. Each user and table in Snowflake gets an internal stage by default for staging data files.
* **External stages**—Used to store data files externally in Amazon S3, Google Cloud Storage, or Microsoft Azure.
  If your data is already stored in these cloud storage services, you can use an external stage to load data in Snowflake tables.

In this tutorial, we upload the sample data files
(downloaded in Prerequisites)
to the internal stage for the `emp_basic` table that you created earlier. You use the [PUT](../../sql-reference/sql/put.md) command
to upload the sample data files to that stage.

### Staging sample data files

Execute the [PUT](../../sql-reference/sql/put.md) command in [SnowSQL](../snowsql.md) to upload local data files to the table stage
provided for the `emp_basic` table you created.

```sqlexample
PUT file://<file-path>[/\]employees0*.csv @sf_tuts.public.%emp_basic;
```

For example:

* Linux or macOS

  ```sqlexample
  PUT file:///tmp/employees0*.csv @sf_tuts.public.%emp_basic;
  ```
* Windows

  ```sqlexample
  PUT file://C:\temp\employees0*.csv @sf_tuts.public.%emp_basic;
  ```

Let’s take a closer look at the command:

* `file://<file-path>[/]employees0*.csv` specifies the full directory path and
  names of the files on your local machine to stage. Note that file system wildcards are allowed, and if multiple files fit the pattern they are all displayed.
* `@<namespace>.%<table_name>` indicates to use the stage for the specified table, in this case the `emp_basic` table.

The command returns the following result, showing the staged files:

```output
+-----------------+--------------------+-------------+-------------+--------------------+--------------------+----------+---------+
| source          | target             | source_size | target_size | source_compression | target_compression | status   | message |
|-----------------+--------------------+-------------+-------------+--------------------+--------------------+----------+---------|
| employees01.csv | employees01.csv.gz |         360 |         287 | NONE               | GZIP               | UPLOADED |         |
| employees02.csv | employees02.csv.gz |         355 |         274 | NONE               | GZIP               | UPLOADED |         |
| employees03.csv | employees03.csv.gz |         397 |         295 | NONE               | GZIP               | UPLOADED |         |
| employees04.csv | employees04.csv.gz |         366 |         288 | NONE               | GZIP               | UPLOADED |         |
| employees05.csv | employees05.csv.gz |         394 |         299 | NONE               | GZIP               | UPLOADED |         |
+-----------------+--------------------+-------------+-------------+--------------------+--------------------+----------+---------+
```

The PUT command compresses files by default using `gzip`, as indicated in the TARGET_COMPRESSION column.

### Listing the staged files (Optional)

You can list the staged files using the [LIST](../../sql-reference/sql/list.md) command.

```sqlexample
LIST @sf_tuts.public.%emp_basic;
```

The following is an example result:

```output
+--------------------+------+----------------------------------+------------------------------+
| name               | size | md5                              | last_modified                |
|--------------------+------+----------------------------------+------------------------------|
| employees01.csv.gz |  288 | a851f2cc56138b0cd16cb603a97e74b1 | Tue, 9 Jan 2018 15:31:44 GMT |
| employees02.csv.gz |  288 | 125f5645ea500b0fde0cdd5f54029db9 | Tue, 9 Jan 2018 15:31:44 GMT |
| employees03.csv.gz |  304 | eafee33d3e62f079a054260503ddb921 | Tue, 9 Jan 2018 15:31:45 GMT |
| employees04.csv.gz |  304 | 9984ab077684fbcec93ae37479fa2f4d | Tue, 9 Jan 2018 15:31:44 GMT |
| employees05.csv.gz |  304 | 8ad4dc63a095332e158786cb6e8532d0 | Tue, 9 Jan 2018 15:31:44 GMT |
+--------------------+------+----------------------------------+------------------------------+
```

## Copy data into target tables

To load your staged data into the target table, execute [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md).

The [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command uses the virtual warehouse you created
in Create Snowflake objects to copy files.

```sqlexample
COPY INTO emp_basic
  FROM @%emp_basic
  FILE_FORMAT = (type = csv field_optionally_enclosed_by='"')
  PATTERN = '.*employees0[1-5].csv.gz'
  ON_ERROR = 'skip_file';
```

Where:

* The FROM clause specifies the location containing the data files (the internal stage for the table).
* The FILE_FORMAT clause specifies the file type as CSV, and specifies the double-quote
  character (`"`) as the character used to enclose strings. Snowflake supports
  diverse file types and options. These are described
  in [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md).
* The PATTERN clause specifies that the command should load data from the filenames matching
  this regular expression (`.*employees0[1-5].csv.gz`).
* The ON_ERROR clause specifies what to do when the COPY command encounters errors in the files. By default, the command stops loading data
  when the first error is encountered. This example skips any file containing an error and moves on to loading
  the next file. (None of the files in this tutorial contain errors; this is included for illustration purposes.)

The COPY command also provides an option for validating files before they are loaded. For more information about additional error checking and validation instructions, see the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) topic and the other [data loading tutorials](../../guides-overview-loading-data.md).

The COPY command returns a result showing the list of files copied and related information:

```output
+--------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
| file               | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
|--------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
| employees02.csv.gz | LOADED |           5 |           5 |           1 |           0 | NULL        |             NULL |                  NULL | NULL                    |
| employees04.csv.gz | LOADED |           5 |           5 |           1 |           0 | NULL        |             NULL |                  NULL | NULL                    |
| employees05.csv.gz | LOADED |           5 |           5 |           1 |           0 | NULL        |             NULL |                  NULL | NULL                    |
| employees03.csv.gz | LOADED |           5 |           5 |           1 |           0 | NULL        |             NULL |                  NULL | NULL                    |
| employees01.csv.gz | LOADED |           5 |           5 |           1 |           0 | NULL        |             NULL |                  NULL | NULL                    |
+--------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
```

## Query loaded data

You can query the data loaded in the `emp_basic` table using standard [SQL](../../sql-reference/constructs.md) and any supported
[functions](../../sql-reference-functions.md) and
[operators](../../sql-reference/operators.md).

You can also manipulate the data, such as updating the loaded data or inserting more data, using standard [DML commands](../../sql-reference/sql-dml.md).

### Retrieve all data

Return all rows and columns from the table:

```sqlexample
SELECT * FROM emp_basic;
```

The following is a partial result:

```output
+------------+--------------+---------------------------+-----------------------------+--------------------+------------+
| FIRST_NAME | LAST_NAME    | EMAIL                     | STREETADDRESS               | CITY               | START_DATE |
|------------+--------------+---------------------------+-----------------------------+--------------------+------------|
| Arlene     | Davidovits   | adavidovitsk@sf_tuts.com  | 7571 New Castle Circle      | Meniko             | 2017-05-03 |
| Violette   | Shermore     | vshermorel@sf_tuts.com    | 899 Merchant Center         | Troitsk            | 2017-01-19 |
| Ron        | Mattys       | rmattysm@sf_tuts.com      | 423 Lien Pass               | Bayaguana          | 2017-11-15 |
 ...
 ...
 ...
| Carson     | Bedder       | cbedderh@sf_tuts.co.au    | 71 Clyde Gallagher Place    | Leninskoye         | 2017-03-29 |
| Dana       | Avory        | davoryi@sf_tuts.com       | 2 Holy Cross Pass           | Wenlin             | 2017-05-11 |
| Ronny      | Talmadge     | rtalmadgej@sf_tuts.co.uk  | 588 Chinook Street          | Yawata             | 2017-06-02 |
+------------+--------------+---------------------------+-----------------------------+--------------------+------------+
```

### Insert additional data rows

In addition to loading data from staged files into a table, you can insert rows directly into a table using the [INSERT](../../sql-reference/sql/insert.md) DML command.

For example, to insert two additional rows into the table:

```sqlexample
INSERT INTO emp_basic VALUES
   ('Clementine','Adamou','cadamou@sf_tuts.com','10510 Sachs Road','Klenak','2017-9-22') ,
   ('Marlowe','De Anesy','madamouc@sf_tuts.co.uk','36768 Northfield Plaza','Fangshan','2017-1-26');
```

### Query rows based on email address

Return a list of email addresses with United Kingdom top-level domains using the [[ NOT ] LIKE](../../sql-reference/functions/like.md) function:

```sqlexample
SELECT email FROM emp_basic WHERE email LIKE '%.uk';
```

The following is an example result:

```output
+--------------------------+
| EMAIL                    |
|--------------------------|
| gbassfordo@sf_tuts.co.uk |
| rtalmadgej@sf_tuts.co.uk |
| madamouc@sf_tuts.co.uk   |
+--------------------------+
```

### Query rows based on start date

For example, to calculate when certain employee benefits might start, add 90 days to employee start
dates using the [DATEADD](../../sql-reference/functions/dateadd.md) function. Filter the list by employees whose start date occurred earlier than January 1, 2017:

```sqlexample
SELECT first_name, last_name, DATEADD('day',90,start_date) FROM emp_basic WHERE start_date <= '2017-01-01';
```

The following is an example result:

```output
+------------+-----------+------------------------------+
| FIRST_NAME | LAST_NAME | DATEADD('DAY',90,START_DATE) |
|------------+-----------+------------------------------|
| Granger    | Bassford  | 2017-03-30                   |
| Catherin   | Devereu   | 2017-03-17                   |
| Cesar      | Hovie     | 2017-03-21                   |
| Wallis     | Sizey     | 2017-03-30                   |
+------------+-----------+------------------------------+
```

## Summary, clean up, and additional resources

Congratulations! You’ve successfully completed this introductory tutorial.

Take a few minutes to review a short summary and the key points covered in the tutorial.
You might also want to consider cleaning up by dropping any objects you created in the tutorial.
Learn more by reviewing other topics in the Snowflake Documentation.

### Summary and key points

In summary, data loading is performed in two steps:

1. Stage the data files to load. The files can be staged internally (in Snowflake) or in an external location. In this tutorial, you stage files internally.
2. Copy data from the staged files into an existing target table. A running
   warehouse is required for this step.

Remember the following key points about loading CSV files:

* A CSV file consists of 1 or more records, with 1 or more fields in each record, and sometimes a header record.
* Records and fields in each file are separated by delimiters. The default delimiters are:

  > Records:
  > :   newline characters
  >
  > Fields:
  > :   commas

  In other words, Snowflake expects each record in a CSV file to be separated by new lines and the fields (i.e. individual values) in each record to be separated by commas. If different
  characters are used as record and field delimiters, you must explicitly specify this as part of the file format when loading.
* There is a direct correlation between the fields in the files and the columns in the table you will be loading, in terms of:

  > + Number of fields (in the file) and columns (in the target table).
  > + Positions of the fields and columns within their respective file/table.
  > + Data types, such as string, number, or date, for fields and columns.

  The records will not be loaded if the numbers, positions, and data types don’t align with the data.

  > **Note:**
  >
  > Snowflake supports loading files in which the fields don’t exactly align with the columns in the target table;
  > however, this is a more advanced data loading topic (covered in
  > [Transform data during a load](../data-load-transform.md)).

### Tutorial cleanup (Optional)

If the objects you created in this tutorial are no longer needed,
you can remove them from the system with [DROP <object>](../../sql-reference/sql/drop.md) statements.

```sqlexample
DROP DATABASE IF EXISTS sf_tuts;

DROP WAREHOUSE IF EXISTS sf_tuts_wh;
```

### Exit the connection

To exit a connection, use the `!exit` command for SnowSQL (or its alias, `!disconnect`).

Exit drops the current connection and quits SnowSQL if it is the last connection.

### What’s next?

Continue learning about Snowflake using the following resources:

* Complete the other tutorials provided by Snowflake:

  + [Tutorials to get started with Snowflake](../../learn-tutorials.md)
* Familiarize yourself with key Snowflake concepts and features, as well as the SQL commands to perform queries and insert/update data:

  + [Get started with Snowflake for users](../../getting-started-for-users.md)
  + [Query syntax](../../sql-reference/constructs.md)
  + [Data Manipulation Language (DML) commands](../../sql-reference/sql-dml.md)

---
title: Snowflake interactive tables and interactive warehouses
source: https://docs.snowflake.com/en/user-guide/interactive.md
section: User Guide
---

# Snowflake interactive tables and interactive warehouses

## Overview

Snowflake interactive tables and interactive warehouses are specialized types of Snowflake objects that are optimized for low-latency, high concurrency workloads. Ideal for use cases such as real-time dashboards, data-powered APIs, and serving high-concurrency workloads.

Interactive warehouse
:   A warehouse that’s optimized for low-latency, interactive workloads. The warehouse contains a query engine that is optimized for low-latency, high concurrency queries.

Interactive table
:   A type of Snowflake table that’s optimized for low latency, high concurrency workloads that works well with interactive warehouses and can be used with standard Snowflake warehouses. You get the best performance gains when you query these tables through interactive warehouses.

## Use cases for interactive tables

Real-time dashboards
:   Serving dashboard queries that powers thousands of users requests with low-latency, high concurrency. Especially useful for serving use cases where some aggregations and flexibility are required.

Data-powered APIs
:   Serving data-powered APIs that require predictable, consistency latency, that contain repetitive query shapes.

Alerting and agentic AI workloads
:   For observability and AI agentic workloads that can generate unpredictable query load spikes and requires low cost per query.

## Getting started with interactive tables

To get started with interactive tables, complete the following sequence of steps:

1. Create an interactive table, using a standard warehouse. For more information, see
   Creating an interactive table.
2. Create an interactive warehouse. For more information, see
   Creating an interactive warehouse.
3. Resume the interactive warehouse. For more information, see
   Resuming and suspending an interactive warehouse.
4. Add the interactive table to the interactive warehouse. For more information, see
   Adding an interactive table to an interactive warehouse.
5. Start querying the interactive table through the interactive warehouse. For more information, see
   Querying an interactive table.

## Working with interactive tables and interactive warehouses

The following procedures explain how to create and manage all the required
objects to run queries using interactive tables. When you are trying this
feature for the first time, perform these procedures in the following order.

### Creating an interactive table

Table creation follows the standard CTAS ([CREATE TABLE AS SELECT](../sql-reference/sql/create-table.md)) syntax,
with the additional INTERACTIVE keyword that defines the table type.

The CREATE INTERACTIVE TABLE command also requires a CLUSTER BY clause.
Specify one or more columns in the CLUSTER BY clause to match the WHERE clauses in your most time-critical queries.
The columns you specify in the CLUSTER BY clause can significantly affect the performance
of queries on the interactive table. Therefore, choose the clustering columns carefully.
For more information about choosing the best clustering columns, see [Clustering Keys & Clustered Tables](tables-clustering-keys.md).

> **Note:**
>
> You run the CREATE INTERACTIVE TABLE command with a standard warehouse.
> You only use the interactive warehouse in later steps, to query the interactive table.

The following command creates an interactive table containing the same columns and data
as a standard table. The CLUSTER BY clause refers to a column named `id` from the source table.

```sqlexample
CREATE INTERACTIVE TABLE
  IF NOT EXISTS orders
  CLUSTER BY (id)
AS
  SELECT * FROM demoSource;
```

#### Specifying auto-refresh for an interactive table

To make an interactive table automatically refresh using data from some
other table, specify the TARGET_LAG clause with an interval.
When you specify TARGET_LAG, you must also specify the WAREHOUSE clause
and the name of a standard warehouse that Snowflake will use for regular
maintenance refreshes.

You can also optionally specify INITIALIZATION_WAREHOUSE to run initial
refreshes on a separate warehouse. Initial refreshes often process more
data than maintenance refreshes. In many cases, you can use a larger
warehouse, such as XL, for the initial refresh and a smaller warehouse,
such as S, for ongoing maintenance refreshes.

The time interval for the TARGET_LAG clause lets you specify the maximum
lag in terms of some number of seconds, minutes, hours, or days:

```sqlsyntax
TARGET_LAG = '<num> { seconds | minutes | hours | days }'
```

If you don’t specify a unit, the number represents seconds. The minimum value
is 60 seconds, or 1 minute.

For example, the following CREATE INTERACTIVE TABLE statement defines an
interactive table that lags no more than 20 minutes behind a specified
source table, uses a larger warehouse for the initial refresh, and uses
a smaller warehouse for ongoing maintenance refreshes:

```sqlexample
CREATE INTERACTIVE TABLE my_dynamic_interactive_table
  CLUSTER BY (c1, c2)
  TARGET_LAG = '20 minutes'
  WAREHOUSE = s_maintenance_wh
  INITIALIZATION_WAREHOUSE = xl_initial_wh
AS SELECT c1, SUM(c2) FROM my_source_table GROUP BY c1;
```

For more information about choosing an appropriate lag time that balances costs and freshness of data,
see [How Snowflake schedules refreshes](dynamic-tables-target-lag.md). For guidance on using separate warehouses for initial and
maintenance refreshes, see [Adjust your warehouse configuration](dynamic-tables-performance-optimize.md). Similar considerations apply to
interactive tables as to dynamic tables.

You can also manually trigger a refresh for a dynamic interactive table by running

```sqlexample
ALTER INTERACTIVE TABLE ``my_dynamic_interactive_table`` REFRESH.
```

### Creating an interactive warehouse

After you create an interactive table, querying that table with optimal performance requires an interactive warehouse.
Specify the keyword INTERACTIVE in the [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) or CREATE OR REPLACE WAREHOUSE command.

Optionally, you can specify a TABLES clause with a comma-separated list of interactive table names.
Using that clause immediately associates those interactive tables with the interactive warehouse.

The following command creates an interactive warehouse that’s associated with the interactive table
named `orders`. In this case, you can immediately run a [USE WAREHOUSE](../sql-reference/sql/use-warehouse.md)
command for the interactive warehouse, and begin running queries for the interactive table:

```sqlexample
CREATE OR REPLACE INTERACTIVE WAREHOUSE interactive_demo
  TABLES (orders)
  WAREHOUSE_SIZE = 'XSMALL';
```

The following command creates an interactive warehouse with no associated interactive tables.
In this case, you run ALTER WAREHOUSE commands afterward to associate interactive tables
with the interactive warehouse:

```sqlexample
CREATE OR REPLACE INTERACTIVE WAREHOUSE interactive_demo
  WAREHOUSE_SIZE = 'XSMALL';
```

After you create an interactive warehouse, it remains in a suspended state until you resume it.
You can configure auto-suspend and auto-resume for interactive warehouses. The minimum
auto-suspend interval for an interactive warehouse is 24 hours (86400 seconds). For details,
see Resuming and suspending an interactive warehouse.

## Interactive table performance considerations

The following sections explain how to solve performance issues that you might encounter due to
the special characteristics of interactive tables and the workloads they’re best suited for.

### Query best practices for interactive warehouses

Interactive warehouses are optimized for queries with **selective workloads**. This means queries
with good selectivity see substantially more improvements on performance than other query types.

| Expect more performance benefits with interactive warehouses | Expect limited performance benefits with interactive warehouses |
| --- | --- |
| ```sqlexample SELECT col1, col4, AVG(col_x)   FROM my_table   GROUP BY col1, col4; ```  This query is highly selective because it only requires a few columns. Snowflake can optimize loading only columns required for this one query. | ```sqlexample SELECT * FROM my_table; ```  This query processes all columns. Although the query is simple, Snowflake must process a large amount of data, which might exceed the size of the cache. Even if the contents of the table can fit in the cache, that leaves less room to cache data from other queries, leading to lower concurrency. |
| ```sqlexample SELECT col1, col2   FROM my_table   WHERE     col_x IN (1,4,7,8)     AND event_time >=       DATEADD(hour, -1, CURRENT_TIMESTAMP()); ```  The conditions in the WHERE clause make this query highly selective. The IN clause limits the results to a relatively few items, and the time comparison further limits the data to a certain time period. | ```sqlexample SELECT col1, col2   FROM my_table   WHERE     event_time >=       DATEADD(day, -365, CURRENT_TIMESTAMP()); ```  Asking for data for an entire year makes this query less selective. If your dataset is big, this query might process all rows in the table. |

Other complexities such as large joins (for example, by joining two fact tables), or compute-intensive expressions such
as regular expressions, might result in lower concurrency due to higher use of compute resources.
See Choosing a size for an interactive warehouse for information about optimizing for
those situations.

### Data layout best practices for interactive tables

Interactive tables follow standard Snowflake best practices for performance. In particular,
interactive tables benefit from a **well-clustered table**, a table that’s sorted based on the same
column or columns that you are filtering on. For example, if your query often filters on a TIMESTAMP
column such as `sale_date`, then it makes sense to use that column as the clustering key when creating
the interactive table. For example, you might create the interactive table as follows:

```sqlexample
CREATE INTERACTIVE TABLE product_sales (<column definitions>) CLUSTER BY (sale_date);
```

That way, SELECT queries that filter on `sale_date` can quickly skip all irrelevant data
and return results. For example, the following query filters on a date range by testing the
`sale_date` column:

```sqlexample
SELECT ... WHERE sale_date > '2025-10-24' AND ...
```

For more details about choosing the best clustering keys, see
[Clustering Keys & Clustered Tables](tables-clustering-keys.md).

### Choosing a size for an interactive warehouse

Once you’ve completed all your queries and layout optimizations, consider **scaling your warehouse**
to meet demand. Interactive warehouses have a range of sizes from XSMALL to 3XLARGE, as well as
[Multi-cluster warehouses](warehouses-multicluster.md).

We recommend that you start by sizing your warehouse based on the approximate size of the *working
data set* in the interactive table. The working data set refers to the portion of the data that is
frequently queried. For example, if your queries typically only query the last seven days of sales data,
the working set is the fraction of the interactive table corresponding to those seven days.

This is because the interactive warehouse utilizes *local storage caching*. While the data for
your entire data set (table) is always accessible, accessing non-cached data does incur higher read
latency on the first read.

Choose a warehouse size to fit the needs of your workloads. Experiment with your particular data and
workload to determine the optimal size for your interactive warehouse.

> **Tip:**
>
> For good performance, you don’t need to fit the entire working set of your queries in the cache.
> Pick a cache size that’s sufficient to hold your *hot data*, that is, the data from your
> frequently accessed rows. In fact, many customers can serve most of their queries from the interactive warehouse cache even with only a portion of their data cached.

We recommend starting with the following warehouse sizes based on the working data set size.
(Subject to change and hardware differences depending on cloud provider and region)

| Working Set | Warehouse Size |
| --- | --- |
| Less than ~350 GB | XSMALL |
| ~350 GB to ~600 GB | SMALL |
| ~600 GB to ~1.2 TB | MEDIUM |
| ~1.2 TB to ~2.5 TB | LARGE |
| ~2.5 TB to ~5.5 TB | XLARGE |
| 5.5 TB to ~11 TB | 2XLARGE |
| ~11 TB to ~22 TB | 3XLARGE |
| ~22 TB to ~44 TB | 4XLARGE |

#### Performance troubleshooting for interactive tables

##### Problem 1: My single query is taking too long

This is likely due to your query requiring more computing resources to finish. It’s possible that your
query has a lot of complex processing, thus requiring more CPUs. For example, queries with a lot of
regular expression filters and CASE clauses. It’s also possible that your queries require a lot of memory, such
as queries that do a lot of `COUNT(DISTINCT ...)`. To lower the run time of a single query,
consider a **larger warehouse size**. Start with the recommended size above, and keep
increasing the size of the warehouse until you are satisfied with a single query’s latency.

##### Problem 2: My queries are suddenly taking a long time to run (High tail latency, high P95 latency)

A sudden increase in query time is likely due to insufficient caching. Each warehouse size has a
local SSD cache that Snowflake uses to cache the most recently used data. Snowflake manages the cache to
only store parts of the table that are accessed frequently. If your queries are selective, then
increasing warehouse size can potentially reduce tail latency.

Also note, the newly spun-up warehouse takes a while to **warm the cache**. Snowflake proactively
warms the newly added data. For benchmarking, wait for a while before starting the benchmark so that
the cache has time to warm up. Cache warm-up speed is based on warehouse size and table size. The
bigger your interactive table is, the longer Snowflake takes to warm the cache. On the other hand,
the larger the size you specify for the interactive warehouse, the shorter the warming time.

##### Problem 3: My query is queuing or I’m not able to achieve the expected concurrency

You can scale out your warehouse by setting the MIN_CLUSTER_COUNT and MAX_CLUSTER_COUNT parameters.
That way, you can create a multi-cluster interactive warehouse. If MAX_CLUSTER_COUNT is set to value greater than the MIN_CLUSTER_COUNT, the warehouse will scale out automatically.

### Adding an interactive table to an interactive warehouse

To get optimal query performance for an interactive table, you should use an interactive warehouse.

Before you can query the interactive table from an interactive warehouse, you must perform a
one-time operation to add the interactive table to the interactive warehouse. Otherwise, you’ll see
an `object not found` error when running a query against such a table from the interactive
warehouse. If you didn’t specify the interactive tables to associate with the interactive warehouses
by using the TABLES clause in your CREATE INTERACTIVE WAREHOUSE command, you can do that later by
using an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command.

The following command associates the `orders` table with the `interactive_demo` warehouse. You can specify multiple table
names, separated by commas, with the ADD TABLES clause.

```sqlexample
ALTER WAREHOUSE interactive_demo ADD TABLES (orders);
```

If the interactive table is already associated with the interactive warehouse, the command
succeeds but has no effect. You can associate an interactive table with multiple interactive warehouses.

This action starts the cache-warming process. Cache warming time is based on the size of the data and the warehouse size. A XS warehouse warms roughly at 300-350MB/s. The bigger the table, the longer the cache warming time. Larger warehouses warms faster.

Warming process do not block the warehouse from accepting new queries. Priority of warming is:
1. User issued queries
2. Newly added micropartitions through auto-refresh or other means of data ingestion
3. Any existing data

Because cache warming depends on your queries, the best way to monitor whether the cache is warm is to review the remote read percentage in [Snowsight Query Profile](ui-snowsight-activity.md). For programmatic access to query operator statistics, see [GET_QUERY_OPERATOR_STATS](../sql-reference/functions/get_query_operator_stats.md). In ideal execution scenarios, low-latency queries should have a remote read percentage of 0%.

### Removing an interactive table from an interactive warehouse

You can detach one or more interactive tables from an interactive warehouse by running an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command
with the DROP TABLES clause.

```sqlexample
ALTER WAREHOUSE interactive_demo DROP TABLES (orders, customers);
```

> **Note:**
>
> The interactive tables still exist after this operation. This ALTER WAREHOUSE clause isn’t the same as performing the SQL command DROP TABLE.

### Using search optimization for point lookups

We recommend adding [search optimization](search-optimization/enabling.md) when you
perform point lookup queries on your interactive table. Point lookups are queries that filter on a
single column to retrieve one or a few rows of data. A good example is `WHERE some_id =
some_UUID`.

## Materialized view support for interactive tables

You can create materialized views on interactive tables. An *interactive materialized view*
precomputes and stores the results of a query on an interactive table, which can further improve
query performance for common aggregation patterns.

To create an interactive materialized view, use the INTERACTIVE keyword in the
[CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md) statement:

```sqlexample
CREATE INTERACTIVE MATERIALIZED VIEW IF NOT EXISTS mv_order_summary
  AS
    SELECT region, SUM(quantity) AS total_quantity, SUM(net_paid) AS total_net_paid
      FROM orders
      GROUP BY region;
```

After you create the interactive materialized view, you must add **both** the materialized view
and the underlying base table to your interactive warehouse:

```sqlexample
ALTER WAREHOUSE interactive_demo ADD TABLES (mv_order_summary, orders);
```

### Best practices for interactive materialized views

Follow these guidelines when you create materialized views using interactive tables:

* Interactive Materlized View works just like regular materialized views. It must be based on an interactive table.
* Interactive Materlized View and the source interactive table it’s based on must be added to the same interactive warehouse. Otherwise, you’ll see an `object not found` error when running a query against such a materialized view from the Interactive warehouse.
* Joins aren’t supported in materialized views, whether interactive or standard. Structure your
  queries to aggregate or filter from a single base table.
* You can’t use an interactive table or an interactive materialized view as the source for a
  [dynamic table](dynamic-tables-about.md).
* When you’re considering candidates for interactive materialized views,
  choose aggregation queries that are frequently run and expensive to compute

### Resuming and suspending an interactive warehouse

The following command resumes an interactive warehouse. You must do this after creating the warehouse, because it’s
created in a suspended state:

```sqlexample
ALTER WAREHOUSE interactive_demo RESUME;
```

You also do this to start running queries through the warehouse, if you manually suspended the warehouse.

Queries will be slow while the cache is being warmed after resuming. It might take a
few minutes to an hour or so, depending on how much data you have in that table.

The following command suspends an interactive warehouse:

```sqlexample
ALTER WAREHOUSE interactive_demo SUSPEND;
```

#### Auto-suspend and auto-resume for interactive warehouses

Interactive warehouses support auto-suspend and auto-resume. You can set the AUTO_SUSPEND
and AUTO_RESUME properties when creating or altering an interactive warehouse.

The minimum AUTO_SUSPEND value for an interactive warehouse is 86400 seconds (24 hours).
This minimum ensures that the cache stays warm long enough to provide consistent low-latency
performance. If you specify a value less than 86400, Snowflake uses 86400 instead.

The following example creates an interactive warehouse with auto-suspend after 24 hours of
inactivity, and auto-resume enabled:

```sqlexample
CREATE INTERACTIVE WAREHOUSE interactive_demo
  WAREHOUSE_SIZE = 'XSMALL'
  AUTO_SUSPEND = 86400
  AUTO_RESUME = TRUE;
```

You can also set these properties on an existing interactive warehouse:

```sqlexample
ALTER WAREHOUSE interactive_demo SET
  AUTO_SUSPEND = 86400
  AUTO_RESUME = TRUE;
```

> **Note:**
>
> In a production environment, you typically use interactive warehouses for workloads running
> many concurrent queries 24x7, or where low latency is crucial for queries. Suspending and
> resuming an interactive warehouse (whether manually or through auto-suspend) incurs
> significant cache warm-up time, so evaluate whether auto-suspend is appropriate for your
> workload pattern.

## Region availability

Interactive tables and interactive warehouses are available in the following Amazon Web Services (AWS),
Google Cloud Platform (GCP), and Microsoft Azure regions. For more information about Snowflake regions,
see [Supported cloud regions](intro-regions.md).

* `us-east-1` - AWS US East (N. Virginia)
* `us-west-2` - AWS US West (Oregon)
* `us-east-2` - AWS US East (Ohio)
* `ca-central-1` - AWS Canada (Central)
* `ap-northeast-1` - AWS Asia Pacific (Tokyo)
* `ap-southeast-2` - AWS Asia Pacific (Sydney)
* `eu-central-1` - AWS EU (Frankfurt)
* `eu-west-1` - AWS EU (Ireland)
* `eu-west-2` - AWS Europe (London)
* `us-central1` - GCP US Central1 (Iowa)
* `us-east4` - GCP US East4 (N. Virginia)
* `europe-west2` - GCP Europe West2 (London)
* `europe-west3` - GCP Europe West3 (Frankfurt)
* `europe-west4` - GCP Europe West4 (Netherlands)
* `australia-southeast2` - GCP Australia Southeast2 (Melbourne)
* Azure: all Azure regions.

### Task based multi-cluster sizing

You can adjsut the MIN_CLUSTER_COUNT parameters via scheduled tasks.

```sqlexample
-- 1) Task to scale OUT during business hours
CREATE OR REPLACE TASK mcw_scale_out_morning
  WAREHOUSE = my_wh          -- the warehouse that *executes the task*
  SCHEDULE = 'USING CRON 0 8 * * * UTC'   -- 08:00 UTC daily
AS
  ALTER WAREHOUSE my_wh      -- the warehouse you want to change (can be same or different)
    SET
      MIN_CLUSTER_COUNT = 10;  -- optional: ECONOMY or STANDARD

-- 2) Task to scale IN after hours
CREATE OR REPLACE TASK mcw_scale_in_evening
  WAREHOUSE = my_wh
  SCHEDULE = 'USING CRON 0 20 * * * UTC'  -- 20:00 UTC daily
AS
  ALTER WAREHOUSE my_wh
    SET
      MIN_CLUSTER_COUNT = 2;
```

It’s recommended to set the MAX_CLUSTER_COUNT using auto-scaling policy to acommendate peak concurrency for your workload..

### Dropping an interactive warehouse

You can run the [DROP WAREHOUSE](../sql-reference/sql/drop-warehouse.md) command to remove an interactive warehouse entirely. Dropping an
interactive warehouse removes the associations between that warehouse and any interactive tables. However, you can still use other
interactive warehouses to query those same interactive tables.

### Querying an interactive table

In your query session, make sure that the warehouse for your current session is an interactive warehouse:

```sqlexample
USE WAREHOUSE interactive_demo;
```

After this, you can query your interactive table normally.

> **Note:**
>
> * In an interactive warehouse, you can only query interactive tables. To query other types of Snowflake tables, such as standard
>   tables or hybrid tables, switch to a standard warehouse first.
> * Certain types of queries are especially suited for interactive tables. For more information, see
>   Use cases for interactive tables.

### Benchmarking best practices

When assessing the performance of interactive tables in a test environment, follow these
best practices to avoid inconsistent or misleading results:

* Turn off the query result cache to make the benchmark results consistent between multiple
  benchmark runs. You can turn off the query result cache at the account, user, and session level by
  setting the [USE_CACHED_RESULT](../sql-reference/parameters.md) session parameter. That way, the
  queries only use the table data cache from the interactive warehouse. When you turn result caching
  on in your production environment, you can expect equal or better performance than in your
  benchmark testing.
* Because an interactive warehouse takes some time to warm the table data cache, wait for a while
  after you create or resume an interactive warehouse before testing query performance. This
  simulates the typical production configuration, where the warehouse remains active for long
  periods. Snowflake applies optimizations to the cache warming process. Therefore, it’s more
  efficient to let Snowflake complete this process than to warm the cache yourself by running sample
  queries.
* When comparing performance of interactive tables against standard Snowflake tables, don’t
  interleave the queries between standard and interactive tables. Instead, run the full benchmark on
  standard tables, then run the same tests on interactive tables.
* When doing comparative benchmarks with other database systems, make sure that the clustering
  columns in your interactive tables match the WHERE clause predicates in your queries. For more
  information about choosing the best clustering columns, see
  [Clustering Keys & Clustered Tables](tables-clustering-keys.md). In particular, don’t cluster on columns with
  high cardinality, such as unique IDs or timestamps.
* If your queries are short and simple, you can achieve higher concurrency by setting the
  [MAX_CONCURRENCY_LEVEL](../sql-reference/parameters.md) parameter to a higher value for
  your interactive warehouse.

## Interactive tables and storage lifecycle policies

You can use [storage lifecycle policies](storage-management/storage-lifecycle-policies.md) to
archive or expire specific table rows based on conditions that you define, such as data age or other criteria.

Currently, you can’t use storage lifecycle policies for interactive tables that use auto-refresh.
You can use the TARGET_LAG parameter, or a storage lifecycle policy, but not both.

## Disaster recovery and replication

When added to a replication group, interactive tables and warehouses are replicated to the target account.

Interactive table replication behaves the same as standard table replication.
Interactive warehouse replication behaves the same as standard warehouse replication, except it assume Interactive warehouse is supported in the target region. There is no validation of Interactive warehouse replication in the target region at this time.

The Interactive warehouse in target account wil auto-resume. However, due to cache warming requirements, the performance of the warehouse is not guaranteed. To ensure consistent performance, you can keep the warehouse running in the target region.

## Cost and billing considerations

Interactive warehouses incur compute charges when active. The minimum billable period for
an interactive warehouse is one hour, and at one-second granularity thereafter.

> **Note:**
>
> If you resume an interactive warehouse that was suspended (whether manually or through
> auto-resume), that operation results in a new minimum billable period charge. That charge
> applies even if you were already being billed for that period because of other recent activity
> in the warehouse. Therefore, avoid suspending and resuming an interactive warehouse multiple
> times within a short period. The 24-hour minimum auto-suspend interval helps prevent
> excessive suspend/resume cycles.

Interactive tables incur standard storage costs. The price for storage of interactive tables is the
same as for standard tables. Interactive tables may be larger than equivalent standard tables, due to
differences in data encoding and additional indexes. The larger data size and indexes are factored into
the storage volume.

For more information about cost and billing for interactive warehouses and interactive tables, see the
[Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Limitations of interactive warehouses and interactive tables

The following limitations apply to interactive warehouses and interactive tables. Some limitations
are due to architectural differences between interactive tables and standard Snowflake tables;
those limitations are intended to be permanent.

### Limitations of interactive warehouses

* Snowflake interactive warehouses is optimized for short-running queries. The query timeout for SELECT
  commands defaults to five seconds. After five seconds, the query is canceled. You can reduce the
  query timeout value, but you can’t increase it. This is by design to prevent long-running queries from starving the interactive warehouse of resources and degrading the performance of low latency queries. Certain kinds of commands, such as SHOW and INSERT OVERWRITE, aren’t subject to the five-second timeout interval.
* If a query consistently times out, that’s a signal that it might not be suitable for use with
  interactive warehouses. Commonly, applying some of the performance tuning techniques can help reduce query latency. See Interactive table performance considerations for more details.
* Interactive warehouses support auto-suspend and auto-resume with a minimum auto-suspend interval
  of 24 hours (86400 seconds). Expect significant query latency when you resume an interactive
  warehouse, because the data cache needs to warm up again. For more information, see
  Resuming and suspending an interactive warehouse.
* You can’t query standard Snowflake tables from an interactive warehouse. To query both standard
  tables and interactive tables in the same session, run [USE WAREHOUSE](../sql-reference/sql/use-warehouse.md)
  to switch to the appropriate warehouse type.
* You can add a maximum of 10 interactive tables to an interactive warehouse. This is a temporary limitation to prevent overloading of the system. This limit will be increased in the future. In the f you need to add more than 10 interactive tables, please contact Snowflake Support.
* You can’t run [CALL commands](../sql-reference/sql/call.md) to call stored procedures in an interactive warehouse.
* You can’t use the `->>` [pipe operator](../sql-reference/operators-flow.md). That operator uses stored procedures behind the scenes.

### Limitations of interactive tables

* Interactive tables don’t support the following features:

  + [Data manipulation language (DML) commands](../sql-reference/sql-dml.md) such as UPDATE and DELETE.
    The recommended workflow is to use auto-refresh interactive tables (i.e. by setting TARGET_LAG) and apply DML to the source table instead. The auto-refresh mechanism is more efficient and cost-effective than using DML on the interactive table. The only DML that you can perform is INSERT OVERWRITE.
  + Fail-safe. This data recovery mechanism isn’t available for interactive tables. However, you can still
    use Time Travel with interactive tables.
  + [Row timestamps](data-engineering/row-timestamps.md). You can’t enable row timestamps on an interactive table. This is a temporary limitation.
  + [Query insights](query-insights.md). They currently aren’t collected or available for queries executing on
    interactive tables to help reduce query execution latency.
* You can’t perform the following operations:

  + Use an interactive table as the source for a standard (non-interactive) materialized view. To create a materialized view
    on an interactive table, use the INTERACTIVE keyword. See Materialized view support for interactive tables.
  + Modify properties of an interactive table by using
    [ALTER TABLE](../sql-reference/sql/alter-table.md) clauses such as ADD COLUMN or REMOVE COLUMN.
    ALTER TABLE operations that you **can** perform include:

    - Renaming the table.
    - Modifying columns to set or unset comments.
    - Setting or unsetting masking policies on columns.
    - Adding or unsetting a [masking policy](security-column-ddm-use.md),
      [join policy](join-policies.md), [aggregation policy](aggregation-policies.md),
      or [row access policy](security-row-intro.md) on the table.
    - Adding a [storage lifecycle policy](storage-management/storage-lifecycle-policies.md)
      to the table, or dropping a storage lifecycle policy from the table.
  + Use [streams](streams-intro.md) with an interactive table.
  + Create a [dynamic table](dynamic-tables-about.md) with an interactive table as a base table.
  + Use the [RESAMPLE clause](../sql-reference/constructs/resample.md) for queries on an interactive table.
  + Set the Time Travel retention period using CREATE INTERACTIVE TABLE or ALTER TABLE.
    Interactive tables inherit the DATA_RETENTION_TIME_IN_DAYS value from their parent
    schema, database, or account.

## Affected SQL statements

This feature introduces changes to the following Snowflake SQL commands:

* [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md): new ADD TABLES and DROP TABLES clauses.
* [CREATE INTERACTIVE TABLE](../sql-reference/sql/create-interactive-table.md): creates interactive tables with required CLUSTER BY clause.
* [CREATE INTERACTIVE WAREHOUSE](../sql-reference/sql/create-interactive-warehouse.md): creates interactive warehouses with an
  optional TABLES clause.
* [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md): new optional INTERACTIVE keyword for creating
  materialized views on interactive tables.

---
title: Snowflake key concepts and architecture
source: https://docs.snowflake.com/en/user-guide/intro-key-concepts.md
section: User Guide
---

# Snowflake key concepts and architecture

Snowflake is powered by an advanced data platform that is provided to you as a self-managed
service. Snowflake’s data platform brings together data storage, processing, and analytic solutions
that are faster, easier to use, and far more flexible than traditional offerings.

Snowflake combines a completely new SQL query engine with an innovative architecture that is
natively designed for the cloud. It offers full enterprise analytic database functionality,
and unique features and capabilities.

## Data platform as a self-managed service

As a *self-managed service*, Snowflake has the following advantages:

* There is no hardware (virtual or physical) for you to select, install, configure, or manage.
* There is virtually no software for you to install, configure, or manage.
* Ongoing maintenance, management, upgrades, and tuning are handled by Snowflake.

Snowflake uses public cloud infrastructure to host virtual compute instances and persistent data storage.
Snowflake manages software updates and infrastructure so you don’t have to. You can’t install and run Snowflake
locally or on private cloud infrastructures, whether on-premises or hosted.

## Snowflake architecture

Snowflake’s architecture is a hybrid of traditional shared-disk and shared-nothing database architectures.
Similar to shared-disk architectures, Snowflake uses a central data repository for persisted data that is
accessible from all compute nodes in the platform. But similar to shared-nothing architectures, Snowflake
processes queries using massively parallel processing (MPP) compute clusters, where each node in the cluster
stores a portion of the entire data set locally. This hybrid architecture, which is shown in the following diagram,
offers the data management simplicity of a shared-disk architecture, but with the performance and scale-out benefits
of a shared-nothing architecture:

Snowflake’s unique architecture has the following key layers:

* Database storage
* Compute
* Cloud services

### Database storage

Snowflake supports the following kinds of data:

* *Structured data* — such as rows and columns in a table — follows a strict tabular schema.
* *Semi-structured data* — such as a JSON file or an XML file — has a flexible schema.
* *Unstructured data* — such as a document, image, or audio file — has no inherent schema.

Snowflake supports several types of tables for data storage, including the following table types:

* Snowflake tables
* Apache Iceberg™ tables
* Hybrid tables

#### Snowflake tables

When data is loaded into a Snowflake table, Snowflake reorganizes that data into its internally optimized,
compressed, columnar format. Snowflake stores this optimized data in cloud storage. Snowflake tables
are ideal for data warehouses.

Snowflake manages all aspects of how this data is stored — including the organization, file size,
structure, compression, metadata, and statistics. All data in Snowflake tables is automatically divided
into *micro-partitions*, which are contiguous units of storage. Micro-partitions improve efficiency and
provide other benefits.

You can use Snowflake tables to store structured and semi-structured data. You can also use the
[FILE data type](../sql-reference/data-types-unstructured.md) for unstructured data.

For more information about Snowflake tables, see [Understanding Snowflake Table Structures](tables-micro-partitions.md).

#### Apache Iceberg™ tables

Apache Iceberg™ tables for Snowflake combine the performance and query semantics of typical
Snowflake tables with external cloud storage that you manage. They are
ideal for existing data lakes and data lakehouses that you can’t, or choose not to, store in Snowflake.

Iceberg tables store their data and metadata files in an external cloud storage location; for example,
Amazon S3, Google Cloud Storage, or Microsoft Azure Storage. The external storage isn’t part of Snowflake.

You can use Iceberg tables to store structured and semi-structured data.

For more information, see [Apache Iceberg™ tables](tables-iceberg.md).

#### Hybrid tables

Hybrid tables are optimized for low latency and high throughput by using index-based random reads and writes.
Hybrid tables support row locking and enforce unique and referential integrity constraints, which are
critical for transactional workloads. You can use a hybrid table along with other Snowflake
tables and features for [Unistore workloads](https://www.snowflake.com/en/data-cloud/workloads/unistore/)
that bring transactional and analytical data together in a single platform.

You can use hybrid tables to store structured and semi-structured data.

For more information, see [Hybrid tables](tables-hybrid.md).

### Compute

A *virtual warehouse* is a cluster of compute resources in Snowflake. Virtual warehouses process
SQL statements and, using [Snowpark](../developer-guide/snowpark/index.md), run code in languages,
such as Java, Python, and Scala. With
[Snowpark Connect for Spark](../developer-guide/snowpark-connect/snowpark-connect-overview.md), you
can also run Apache Spark™ workloads on virtual warehouses.

Each virtual warehouse is an independent compute cluster that doesn’t share compute resources with other
virtual warehouses. As a result, each virtual warehouse has no effect on the performance of other virtual
warehouses.

For more information, see [Virtual warehouses](warehouses.md).

### Cloud services

The cloud services layer is a collection of services that coordinate activities across Snowflake. These services
tie together all of the different components of Snowflake in order to process user requests, from sign-in to query
dispatch. The cloud services layer also runs on compute instances that are provisioned by Snowflake from the cloud
provider.

Services managed in this layer include the following:

* [Security, authentication, and access control](../guides-overview-secure.md)
* [Snowflake Horizon Catalog](snowflake-horizon.md)
* [Infrastructure management with cloud platforms](intro-cloud-platforms.md)
* Metadata management, including the [SNOWFLAKE database](../sql-reference/snowflake-db.md) and the [Snowflake Information Schema](../sql-reference/info-schema.md)
* [Query parsing and optimization](../guides-overview-performance.md)
* [Regulatory compliance](intro-compliance.md)

## Integrated features for your workloads

Instead of moving data to different systems so that different teams can complete specific operations and tasks,
you can bring all of your workloads directly to their data with an integrated set of features.

These features support the following broad areas of data integration and development:

* Data engineering
* Analytics
* AI and ML
* Applications and collaboration

### Data engineering

Snowflake separates storage and compute, which simplifies some traditional challenges of data engineering,
such as infrastructure management and performance tuning. Data engineers can focus on implementing pipelines that
ingest, transform, and deliver data.

Snowflake provides several ways to ingest data, including the following options:

* [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command — Loads data from files to a table.
* [Snowpipe](data-load-snowpipe-intro.md) — Loads data from files as soon as they are available in a stage.
* [Snowpipe Streaming](snowpipe-streaming/data-load-snowpipe-streaming-overview.md) — Loads row-level data continuously and with
  low latency, using the Snowflake SDKs or a REST API, directly into Snowflake tables and Snowflake-managed Iceberg tables,
  instead of loading data from files.
* [Openflow connectors](data-integration/openflow/connectors/about-openflow-connectors.md) — Ingest data from specific sources
  by using connectors built on Apache NiFi, such as Microsoft Sharepoint and Google Drive.
* [Snowflake Connectors](https://other-docs.snowflake.com/connectors.html) — Connect from external applications and systems and stream data into Snowflake.

Snowflake also provides several ways to transform data, including the following options:

* [Dynamic tables](dynamic-tables-about.md) — Define tables that automatically refresh based on target freshness and a query
  that performs data transformations.
* [Streams and tasks](data-pipelines-intro.md) — Capture changes made to base objects with streams and
  define tasks to perform data transformations.
* [Snowpark](../developer-guide/snowpark/index.md) — Perform more complex transformations by using programming languages,
  such as Python, Java, and Scala.
* [dbt](data-engineering/dbt-projects-on-snowflake.md) — Use an open-source data transformation
  tool and framework to define, test, and deploy SQL transformations.

In addition, [SnowConvert AI](../migrations/snowconvert-docs/overview.md) can ingest and transform data, and
[Snowpark Migration Accelerator](../migrations/sma-docs/general/introduction.md) can convert code from various platforms
to Snowflake.

For more information, see [Overview of data loading](data-load-overview.md).

### Analytics

With Snowflake, you can scale workloads dynamically based on demand, access different types of data — including structured,
semi-structured, and unstructured — and share data easily. These features let you analyze data stored in Snowflake
to extract meaningful insights, patterns, and trends for analytical use cases, such as business intelligence or predictive
modeling.

Snowflake provides several ways to analyze data, including the following options:

* System functions and SQL constructs — Perform calculations and statistical analysis with the following Snowflake system
  functions and SQL constructs:

  + [Aggregate functions](../sql-reference/functions-aggregation.md) — Summarize data by performing calculations on a set of related rows
    and returning a single value.
  + [Window functions](../sql-reference/functions-window.md) — Perform calculations on a set of related rows in partitions for
    rolling operations on subsets of the rows in each partition, such as calculating running totals or moving averages.
  + [Common table expressions (CTEs)](queries-cte.md) — Improve the readability and reusability of
    complex queries, which might perform multiple steps of data transformation.
* [Cortex AI Functions](snowflake-cortex/aisql.md) — Run unstructured analytics on text and images
  with large language models (LLMs) from OpenAI, Anthropic, Meta, Mistral AI, and DeepSeek.
* [Semantic views](views-semantic/overview.md) — Store semantic business concepts directly in the database to
  define business metrics and model business entities and their relationships.

### AI and ML

Snowflake simplifies the use of artificial intelligence (AI) and machine learning (ML) capabilities so you can
perform AI and ML feature engineering, training, and inference with your Snowflake data. Models can access your most
up-to-date data in a secure environment. With Snowflake, you can avoid the cost and complexity of moving your data to a
separate platform for AI and ML tasks.

Snowflake offers AI and ML capabilities in two broad suites of features:

* Snowflake Cortex — AI features that use LLMs to understand unstructured data, answer freeform
  questions, and provide intelligent assistance. [Cortex AI functions](snowflake-cortex/aisql.md) can automate routine tasks, such
  as simple summaries and quick translations.
* Snowflake ML — Features that you can use to build your own models. [ML functions](../guides-overview-ml-functions.md)
  give you automated predictions and insights into your data by using ML.
  [Snowflake ML](../developer-guide/snowflake-ml/overview.md) is a unified environment for ML development.

For more information, see [Snowflake AI and ML](../guides-overview-ai-features.md).

### Applications and collaboration

Snowflake offers many ways to build applications and share them with your teams, partners, and customers. When
you use Snowflake to share data, you control access to the data, and avoid the challenges of keeping it synchronized
in different places.

The following list shows some of the tools and services you can use to build, deploy, and manage applications in Snowflake:

* [Streamlit](../developer-guide/streamlit/about-streamlit.md) — Use an open-source Python library to create and
  share custom web apps with an interactive user interface (UI) for ML and data science.
* [Snowpark Container Services](../developer-guide/snowpark-container-services/overview.md) — Deploy, manage, and scale
  containerized applications from directly inside Snowflake.
* [Snowflake Native App Framework](../developer-guide/native-apps/native-apps-about.md) — Build applications that
  expand the capabilities of other Snowflake features by sharing data and related business logic with other Snowflake
  accounts. The business logic of an application might include a Streamlit app, stored procedures, and functions
  written by using Snowpark API, JavaScript, and SQL. A Snowflake Native App can also run container workloads with
  Snowpark Container Services.

Snowflake includes support for the following kinds of collaboration:

* [Secure Data Sharing](data-sharing-intro.md) — Share selected objects in a database in your
  account with other Snowflake accounts.
* [Listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about) — Provide data and
  other information to other Snowflake users, or access data and other information shared by Snowflake providers.
  You can explore, access, and provide listings to consumers privately and on the [Snowflake Marketplace](https://other-docs.snowflake.com/collaboration/collaboration-marketplace-about).
* [Data Clean Rooms](cleanrooms/overview.md) — Define what analyses can be run against the shared data,
  which allows the consumer to gather insights from the data without having unrestricted access to it.

## Snowgrid

Snowgrid is Snowflake’s cross-region, cross-cloud technology layer. With Snowgrid, you can achieve the following
goals:

* Connect a data ecosystem across different cloud regions and providers — such as, Amazon Web Services (AWS),
  Microsoft Azure, and Google Cloud — by using
  [listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about) and other
  [collaboration features](../guides-overview-sharing.md).
* Apply consistent
  [security and governance policies across clouds and regions](secure-data-sharing-across-regions-platforms.md).
* Enable disaster recovery and business continuity capabilities across regions by using
  [replication](replication-intro.md).

For more information, see [Snowgrid](https://www.snowflake.com/en/product/features/cross-cloud-snowgrid/).

## Connecting to Snowflake

Snowflake supports multiple ways for you to connect to the service:

* [Snowsight](ui-snowsight-quick-tour.md), a web-based UI that you can use to access all aspects of managing
  and using Snowflake can be accessed.
* Command-line clients that you can also use to access all aspects of managing and using Snowflake; for example,
  [Snowflake CLI](../developer-guide/snowflake-cli/index.md).
* Native APIs that you can use to create and manage Snowflake resources programmatically; for example,
  [Snowflake Python APIs](../developer-guide/snowflake-python-api/snowflake-python-overview.md) and
  [Snowflake REST APIs](../developer-guide/snowflake-rest-api/snowflake-rest-api.md).
* [Drivers](../developer-guide/drivers.md) that other applications can use to connect to Snowflake; for example, JDBC
  and ODBC.
* Native [connectors](connectors.md) that you can use to develop applications for
  connecting to Snowflake; for example, Apache Kafka and Apache Spark.
* [Third-party technologies](ecosystem-all.md) that you can use to connect applications to Snowflake;
  for example, extract, transform, load (ETL) tools such as Informatica, and business intelligence (BI) tools such as
  ThoughtSpot.

For more information, see [Sign in to Snowflake](connecting.md).

---
title: Snowflake Marketplace and Listings
source: https://docs.snowflake.com/en/user-guide/data-marketplace.md
section: User Guide
---

# Snowflake Marketplace and Listings

You can provide and consume listings offered privately or publicly using the Snowflake Marketplace,
discovering and accessing a variety of third-party datasets. For more details about the Snowflake Marketplace, refer to
[About Snowflake Marketplace](../collaboration/collaboration-marketplace-about.md).

Listings and the Snowflake Marketplace use Secure Data Sharing to connect providers of data with consumers across clouds and regions.

When you use listings and the Snowflake Marketplace, you can do the following:

* Access shared datasets directly in your Snowflake account without needing to transform the data.
* Join the datasets with your own data.
* View usage data for listings that you provide.
* Automatically fulfill data to other regions for listings that you provide.

For more details about listings, refer to [About listings](../collaboration/collaboration-listings-about.md).

For more details about data sharing at Snowflake, refer to [Data sharing and collaboration in Snowflake](../guides-overview-sharing.md).

---
title: Snowflake OAuth overview
source: https://docs.snowflake.com/en/user-guide/oauth-snowflake-overview.md
section: User Guide
---

# Snowflake OAuth overview

Snowflake OAuth uses Snowflake’s built-in OAuth service to provide OAuth-based authentication.

This topic describes Snowflake OAuth and how to use Snowflake as an OAuth resource and authorization server for accessing Snowflake data
securely.

Snowflake OAuth uses Snowflake’s built-in OAuth service and supports the following applications:

* [Tableau Desktop, Tableau Cloud](oauth-partner.md)
* [Looker](oauth-partner.md)
* [Alation](oauth-partner.md)
* [ThoughtSpot](oauth-partner.md)
* [Custom clients configured by your organization](oauth-custom.md)

## Snowflake OAuth authorization flow

The OAuth authorization flow is as follows:

1. In the client, the user attempts to connect to Snowflake using OAuth.

   The application sends an authorization request to the Snowflake authorization server, which in turn displays an authorization screen
   that asks the user to authorize access.
2. The user submits the Snowflake login name and password, and is in turn presented with a consent screen to allow the client access to
   Snowflake using a specific role in a user session (e.g. SYSADMIN or CUSTOM_ROLE1).

   The user submits consent to use the specific role in a session.

   The Snowflake authorization server sends an authorization code back to the client.
3. The client sends the authorization code back to the Snowflake authorization server to request an access token and, optionally, a refresh
   token that allows the client to obtain new access tokens.

   The Snowflake authorization server accepts the authorization code and provides the client with an access token specific to the user
   resources in the Snowflake resource server. Based on the settings in the authorization request, the authorization server issues a
   refresh token to obtain new access tokens tied to the specific resource.
4. The client sends the access token to the Snowflake resource server.

   The resource server recognizes the valid access token and creates a user session with the authorized role. The client now has access to
   the Snowflake resources limited by the role specified by the access token.

   By default, Snowflake prevents the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles from authenticating. To allow these
   privileged roles to authenticate, use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to set the [OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST](../sql-reference/parameters.md) account parameter to `FALSE`.

Access tokens have a short life; typically 10 minutes. When the access token expires, the client can send a refresh token to obtain new
access tokens. A refresh token is sent to the Snowflake authorization server to request a new access token each time the current access
token expires (Steps 3-6). If the integration is configured to prevent sending refresh tokens, the user must repeat the above steps to
re-authorize the client. If you want to limit the risk posed by long-lived tokens in your Snowflake OAuth flow, you can use single-use
refresh tokens.

## Single-use refresh tokens

You can use single-use refresh tokens to mitigate theft or reuse of refresh tokens. For more information, see [Single-use refresh tokens for Snowflake OAuth security integrations](single-use-refresh-tokens.md)

## Local applications

Snowflake provides a simplified way to set up local applications — that is, desktop applications — to use Snowflake OAuth to
authenticate. The application can authenticate by setting a single connection option; no additional set up is required. For more information,
see [Using Snowflake OAuth for local applications](oauth-local-applications.md).

## Partner applications

To configure support, refer to [Configure Snowflake OAuth for partner applications](oauth-partner.md).

To learn about using OAuth without traversing the public Internet, refer to Partner applications.

## Custom clients

Snowflake supports custom clients configured by your organization. To configure support, refer to [Configure Snowflake OAuth for custom clients](oauth-custom.md).

## Restricting network traffic for Snowflake OAuth

You can associate a [network policy](network-policies.md) with the Snowflake OAuth security integration to restrict network
traffic when the client requests a token from Snowflake as the authorization server. This network policy also governs network traffic when
the client queries Snowflake as the resource server.

To associate a network policy with the Snowflake OAuth security integration, set the NETWORK_POLICY parameter when creating or updating the integration. For example:

```sqlexample
CREATE SECURITY INTEGRATION td_oauth_int2
  TYPE = oauth
  ENABLED = true
  OAUTH_CLIENT = tableau_desktop
  OAUTH_REFRESH_TOKEN_VALIDITY = 36000
  BLOCKED_ROLES_LIST = ('SYSADMIN');
  NETWORK_POLICY = 'allow_private_ip_only';
```

A network policy associated with the Snowflake OAuth security integration does not affect network traffic between the user and Snowflake as
the authorization server. When the user authenticates by using a browser, the network traffic is restricted by a network policy associated
with the user.

The following diagram shows which network policy governs network traffic from the client and user.

1. Network policy associated with the user governs. If no user-level network policy exists, the account-level policy governs.
2. Network policy associated with the security integration governs. If no integration-level network policy exists, the account-level
   policy governs.

## Error codes

Refer to the table below for descriptions of error codes associated with Snowflake OAuth:

| Error Code | Error | Description |
| --- | --- | --- |
| 390302 | OAUTH_CONSENT_INVALID | Issue generating or validating consent for a given user. |
| 390303 | OAUTH_ACCESS_TOKEN_INVALID | Access token provided used when attempting to create a Snowflake session is expired or invalid. |
| 390304 | OAUTH_AUTHORIZE_INVALID_RESPONSE_TYPE | Invalid `response_type` was provided as a parameter to the authorization endpoint (it should most likely be `code`). |
| 390305 | OAUTH_AUTHORIZE_INVALID_STATE_LENGTH | State parameter provided as a parameter to the authorization endpoint exceeds 2048 characters. |
| 390306 | OAUTH_AUTHORIZE_INVALID_CLIENT_ID | Integration associated with a provided client id does not exist. |
| 390307 | OAUTH_AUTHORIZE_INVALID_REDIRECT_URI | `redirect_uri` given as a parameter to the authorization endpoint does not match the `redirect_uri` of the integration associated with the provided `client_id` or the `redirect_uri` is not properly formatted. |
| 390308 | OAUTH_AUTHORIZE_INVALID_SCOPE | Either the scope requested is not a valid scope, or the scopes requested cannot fully be granted to the user. |
| 390309 | OAUTH_USERNAMES_MISMATCH | The user you were trying to authenticate as differs from the user tied to the access token. |
| 390311 | OAUTH_AUTHORIZE_INVALID_CODE_CHALLENGE_PARAMS | Either the code challenge or code challenge method is missing, invalid, or not supported. |

Additionally, the following errors are taken from the RFC and are returned in the JSON blob generated during an unsuccessful token request
or exchange:

| Error | Description |
| --- | --- |
| invalid_client | There was a failure relating to client authentication, such as the client being unknown, a client secret mismatch, etc. |
| invalid_grant | The provided authorization grant or refresh token is invalid, expired, revoked, does not match the redirection URI used in the authorization request, or was issued to another client. |
| unsupported_grant_type | A grant type was provided that Snowflake currently does not support (“refresh_token” and “authorization_code” are the only two supported grant types at the moment). |
| invalid_request | The request was malformed or could not be processed. |

---
title: Snowflake Open Catalog overview
source: https://docs.snowflake.com/en/user-guide/opencatalog/overview.md
section: User Guide
---

# Snowflake Open Catalog overview

Snowflake Open Catalog is a catalog implementation for Apache Iceberg™ tables and is built on the open source Apache Iceberg™ REST protocol. Snowflake Open Catalog is a managed service for [Apache Polaris™ (incubating)](https://github.com/apache/polaris).

With Open Catalog, you can provide centralized, secure read and write access to your Iceberg tables across different REST-compatible query engines.

Open Catalog is currently offered as a service hosted in Snowflake-managed infrastructure.

## Signing up

Open Catalog offers the following signup options:

* **For existing Snowflake customers:** Sign in to an existing Snowflake account as an organization administrator and create a new Open
  Catalog account in your Snowflake organization. Users with the ORGADMIN role in Snowflake can manage the Open Catalog account from
  Snowflake. For instructions, see [Create a Snowflake Open Catalog account](create-open-catalog-account.md).
* **If you are not an existing Snowflake customer:** You can try Snowflake Open Catalog for free for 30 days by signing up for a
  Snowflake trial account. For instructions, see
  [Try Snowflake Open Catalog for free](try-open-catalog-for-free.md).

## Key concepts

This section introduces key concepts associated with using Open Catalog hosted in Snowflake.

In the following diagram, a sample Open Catalog structure with nested namespaces is shown for Catalog1. No tables
or namespaces have been created yet for Catalog2 or Catalog3.

### Catalog

In Open Catalog, you can create one or more catalog resources to organize Iceberg tables.

Configure your catalog by setting values in the storage configuration for Amazon S3, Azure, or Google Cloud Storage. An Iceberg catalog enables a
query engine to manage and organize tables. The catalog forms the first architectural layer in the [Apache Iceberg™ table specification](https://iceberg.apache.org/spec/#overview) and must support the following tasks:

* Storing the current metadata pointer for one or more Iceberg tables. A metadata pointer maps a table name to the location of that table’s
  current metadata file.
* Performing atomic operations so that you can update the current metadata pointer for a table to the metadata pointer of a new version of
  the table.

To learn more about Iceberg catalogs, see the [Apache Iceberg™ documentation](https://iceberg.apache.org/terms/#catalog).

#### Catalog types

A catalog can be one of the following two types:

* Internal: The catalog is managed by Open Catalog. A third-party query engine can read and write to tables from this catalog. In addition,
  Snowflake can also read and write to tables from this catalog.
* External: The catalog is externally managed by another Iceberg catalog provider (for example, Snowflake, Glue, Dremio Arctic). Tables from
  this catalog are synced to Open Catalog. These tables are read-only in Open Catalog. In the current release, only a Snowflake external
  catalog is provided.

A catalog is configured with a storage configuration that can point to Amazon S3, Azure Storage, or Cloud Storage from Google.

To create a new catalog, see [Create a catalog](create-catalog.md).

### Namespace

You create *namespaces* to logically group Iceberg tables within a catalog. A catalog can have multiple namespaces. You can also create
nested namespaces. Iceberg tables belong to namespaces.

### Apache Iceberg™ tables and catalogs

In an internal catalog, an Iceberg table is registered in Open Catalog, but read and written via query engines. The table data and
metadata is stored in your external cloud storage. The table uses Open Catalog as the Iceberg catalog.

> **Important:**
>
> If you drop a table in Snowflake Open Catalog without purging it, don’t create a new table with the same name and location as the dropped
> table. If you do, a user could gain access to the original table’s data when they shouldn’t have permission to access it. For example, if
> you drop but don’t purge `Table1` where its storage directory location is `/MyCatalog/Schema1/Table1`, don’t create a new `Table1` within
> the same `Table1` storage directory. When you drop a table without purging it, its data is retained in the external cloud storage.

If you have tables that use Snowflake as the Iceberg catalog (Snowflake-managed tables), you can sync these tables to an external
catalog in Open Catalog. If you sync this catalog to Open Catalog, it appears as an external catalog in Open Catalog. The table data and
metadata is stored in your external cloud storage. The Snowflake query engine can read from or write to these tables. However, the other query
engines can only read from these tables.

> **Important:**
>
> To ensure that the access privileges defined for a catalog are enforced correctly, the following conditions must be met:
>
> * A directory only contains the data files that belong to a single table.
> * A directory hierarchy matches the namespace hierarchy for the catalog.
>
> For example, if a catalog includes the following items:
>
> * Top-level namespace `namespace1`
> * Nested namespace `namespace1a`
> * A `customers` table grouped under nested namespace `namespace1a`
> * An `orders` table grouped under nested namespace `namespace1a`
>
> The directory hierarchy for the catalog must be:
>
> * `/namespace1/namespace1a/customers/<files for the customers table *only*>`
> * `/namespace1/namespace1a/orders/<files for the orders table *only*>`
>
> These conditions apply to both internal and external catalogs, including external catalogs that contain
> [Snowflake-managed Apache Iceberg™ tables](https://docs.snowflake.com/en/user-guide/tables-iceberg). When you create a table in an
> internal catalog, Open Catalog prohibits you from creating the table within the directory or subdirectory for an existing table. When you
> create Snowflake-managed Iceberg tables in an external catalog, Open Catalog doesn’t prohibit overlapping directory locations. Therefore,
> when you create these tables, use the BASE_LOCATION parameter to specify a unique parent directory for each table. For more information, see
> [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-snowflake).
>
> For more information about internal and external catalogs, see Catalog types.

### Service principal

A service principal is an entity that you create in Open Catalog. Each service principal encapsulates credentials that you use to connect
to Open Catalog.

Query engines use service principals to connect to catalogs.

Open Catalog generates a Client ID and Client Secret pair for each service principal.

The following table displays example service principals that you might create in Open Catalog:

| Service connection name | Purpose |
| --- | --- |
| Flink ingestion | For Apache Flink® to ingest streaming data into Apache Iceberg™ tables. |
| Spark ETL pipeline | For Apache Spark™ to run ETL pipeline jobs on Iceberg tables. |
| Snowflake data pipelines | For Snowflake to run data pipelines for transforming data in Apache Iceberg™ tables. |
| Trino BI dashboard | For Trino to run BI queries for powering a dashboard. |
| Snowflake AI team | For Snowflake to run AI jobs on data in Apache Iceberg™ tables. |

### Service connection

A service connection represents a REST-compatible engine (such as Apache Spark™, Apache Flink®, or Trino) that can read from and write to Open
Catalog. When creating a new service connection, the Open Catalog administrator grants the service principal that is created with the new service
connection either a new or existing principal role. A principal role is a resource in Open Catalog that you can use to logically group Open
Catalog service principals together and grant privileges on securable objects. For more information, see [Principal role](access-control.md). Open Catalog uses a role-based access control (RBAC) model to grant service principals access to resources. For more information,
see [Access control](access-control.md). For a diagram of this model, see [RBAC model](access-control.md).

If the Open Catalog administrator grants the service principal for the new service connection a new principal role, the service principal
doesn’t have any privileges granted to it yet. When securing the catalog that the new service connection will connect to, the Open
Catalog administrator grants privileges to catalog roles and then grants these catalog roles to the new principal role. As a result, the service
principal for the new service connection has these privileges. For more information about catalog roles, see [Catalog role](access-control.md).

If the Open Catalog administrator grants an existing principal role to the service principal for the new service connection, the service principal
is bestowed with the privileges granted to the catalog roles that are granted to the existing principal role. If needed, the Open Catalog
administrator can grant additional catalog roles to the existing principal role or remove catalog roles from it to adjust the privileges
bestowed to the service principal. For an example of how RBAC works in Open Catalog, see [RBAC example](access-control.md).

### Storage configuration

A storage configuration stores a generated identity and access management (IAM) entity for your external cloud storage and is created
when you create a catalog. The storage configuration is used to set the values to connect Open Catalog to your cloud storage. During the
catalog creation process, an IAM entity is generated and used to create a trust relationship between the cloud storage provider and Open
Catalog.

When you create a catalog, you supply the following information about your external cloud storage:

| Cloud storage provider | Information |
| --- | --- |
| Amazon S3 | * Default base location for your Amazon S3 bucket * Locations for your Amazon S3 bucket * S3 role ARN * External ID (optional) |
| Cloud Storage from Google | * Default base location for your Cloud Storage from Google bucket * Locations for your Cloud Storage from Google bucket |
| Azure | * Default base location for your Microsoft Azure container * Locations for your Microsoft Azure container * Azure tenant ID |

## Example workflow

In the following example workflow, Bob creates an Apache Iceberg™ table named Table1 and Alice reads data from Table1.

1. Bob uses Apache Spark™ to create the Table1 table under the Namespace1 namespace in the Catalog1 catalog and insert values into Table1.

   Bob can create Table1 and insert data into it because he is using a service connection with a service principal that has
   the privileges to perform these actions.
2. Alice uses Snowflake to read data from Table1.

   Alice can read data from Table1 because she is using a service connection with a service principal with a catalog integration that
   has the privileges to perform this action. Alice creates an externally managed table in Snowflake to read data from Table1.

## Security and access control

This section describes security and access control.

### Credential vending

Credential vending simplifies access control in Open Catalog by centralizing access management for the following items:

* The metadata within Open Catalog
* The storage location to your Apache Iceberg tables

When credential vending is enabled for a catalog, Open Catalog provides the query engine executing the query with a temporary storage
credential. This credential allows the query engine to access an Iceberg table’s underlying directory location. If you enable credential
vending, you don’t have to manage storage access separately, outside of Open Catalog.

#### Credential vending for external catalogs

You have the option to enable credential vending for each external catalog. If you don’t enable credential vending for a catalog, you must
provide your own storage credential separately to the query engine, outside of Open Catalog.

Before you enable credential vending for an external catalog, be aware that Open Catalog doesn’t prevent Iceberg tables in the catalog from
having overlapping storage directory locations. When tables have overlapping storage directory locations, a user could gain access to tables
that they shouldn’t have permission to access. Before you enable credential vending for an external catalog, ensure your tables in the
catalog don’t have overlapping storage directory locations.

For example, consider the following directory locations:

* Storage directory location for Table1 is `/MyCatalog/Schema1/Table1`.
* Storage directory location for Table2 is `/MyCatalog/Schema1/Table1/Table2`.

Users with vended credentials to Table1 will also have access to the storage location for Table2.

Here’s an example of how to resolve the overlapping storage directory locations:

* Storage directory location for Table1 is `/MyCatalog/Schema1/Table1`.
* Storage directory location for Table2 is `/MyCatalog/Schema1/Table2`.

To enable credential vending for an external catalog, see [Enable credential vending for an external catalog](enable-credential-vending-external-catalog.md).

#### Credential vending for internal catalogs

Credential vending for internal catalogs is enabled by default when you create the catalog. You don’t need to enable it. When you create a
table in an internal catalog, Open Catalog prohibits you from creating the table within the directory or subdirectory for an existing table.

### Identity and access management (IAM)

Open Catalog uses the identity and access management (IAM) entity to securely connect to your storage for accessing table data, Iceberg
metadata, and manifest files that store the table schema, partitions, and other metadata. Open Catalog retains the IAM entity for your
storage location.

### Access control

Open Catalog enforces the access control that you configure across all tables registered with the service and governs security for all
queries from query engines in a consistent manner.

Open Catalog uses a role-based access control (RBAC) model that lets you centrally configure access for Open Catalog service principals
to catalogs, namespaces, and tables.

Open Catalog RBAC uses two different role types to delegate privileges:

* **Principal roles:** Granted to Open Catalog service principals and
  analogous to roles in other access control systems that you grant to service principals.
* **Catalog roles:** Configured with certain privileges on Open Catalog resources and granted to principal roles.

For more information, see [Access control](access-control.md).

## Billing

Open Catalog is currently free to use with general availability. Billing will begin in the first half of 2026.

When billing begins, Snowflake bills your account for the requests to the REST APIs supported by the Open Catalog service. For more
information, see the Serverless Feature Table in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

Snowflake does not bill your account for Iceberg tables stored outside of Snowflake. Your cloud storage provider bills you directly for data storage usage.

## Legal notices

Apache®, Apache Iceberg™, Apache Spark™, Apache Flink®, and Flink® are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.

---
title: Snowflake Open Catalog release notes
source: https://docs.snowflake.com/en/user-guide/opencatalog/release-notes.md
section: User Guide
---

# Snowflake Open Catalog release notes

Overview of the most recent releases of Snowflake Open Catalog.

## September 29, 2025

### Support for External OAuth and key pair authentication

You can now connect to Snowflake Open Catalog with External OAuth or key pair authentication. For more
information, see:

* [Overview of External OAuth in Open Catalog](external-oauth-overview.md)
* [Overview of key pair authentication in Open Catalog](key-pair-auth-overview.md)

## May 27, 2025

### Access the Open Catalog UI with private connectivity

You can now access the Open Catalog UI through private connectivity instead of over the public internet. For more information, see:

* [Configure private connectivity for the Open Catalog UI](private-connectivity-ui-configure.md)
* [Sign in to Open Catalog using private connectivity](signin-snowflake-customer.md)

## May 5, 2025

### Support for private connectivity

Snowflake supports using private connectivity with Snowflake Open Catalog. Use private connectivity
to route your connection to Snowflake Open Catalog through private endpoints or private IP addresses instead of the public internet.
For more information, see:

* [Private connectivity for inbound network traffic in Snowflake Open Catalog](private-connectivity-inbound-configure-aws.md)
* [Private connectivity for outbound network traffic in Snowflake Open Catalog](private-connectivity-outbound.md)

## March 31, 2025

### Support for SAML-based SSO

With this release, we are pleased to announce support for using SAML-based single sign-on (SSO) with Snowflake Open Catalog.

SSO for Open Catalog lets you integrate Open Catalog with a third-party identity provider. Users can now sign in
to the Open Catalog web interface by using existing credentials managed by an identity provider (IdP). You don’t have to manage separate
usernames and passwords in Open Catalog. To learn more, see [Overview of SSO in Snowflake Open Catalog](sso-overview.md).

## November 7, 2024

### Credential vending for external catalogs is now disabled by default

With this release, credential vending for external catalogs is now disabled by default. External catalogs aren’t managed by Snowflake Open
Catalog, so Open Catalog can’t enforce certain directory structure hierarchies. For more information about directory structure hierarchies,
see [Organize catalog content](https://other-docs.snowflake.com/en/opencatalog/organize-catalog-content). Therefore, credential vending for
external catalogs is disabled by default when you create the catalog. In addition, it’s also now disabled for existing external catalogs.
However, you have the option to enable credential vending for an external catalog. For details, see
[Enable credential vending for an external catalog](enable-credential-vending-external-catalog.md).

## October 18, 2024

### Snowflake Open Catalog: General Availability

With this release, we are pleased to announce the general availability of Snowflake Open Catalog, which was previously named Polaris Catalog
and was available as a preview feature. With general availability, we’ve made the following updates:

* A service admin can now create additional users for the Open Catalog account. These users can manage the account through the Open Catalog
  web interface. For details, see [Manage users](manage-users.md).
* A catalog admin can now secure individual namespaces or tables within a catalog. You can also continue to secure a catalog at the catalog
  level. For details, see [Secure catalogs](secure-catalogs.md).
* When viewing the schema for a table in Snowflake Open Catalog, you can now view the nested schema for a column. For details, see
  [View the schema for a table](view-table-schema.md).
* We’ve added billing support for Open Catalog, but you can use Open Catalog for free until April 30, 2025. For more information, see
  [Billing](overview.md).

## August 8, 2024

With this release, we are pleased to announce the availability of the following new enhancements in Open Catalog.

### Snowflake now supports queries on tables with nested namespaces

Previously, we listed a limitation where Snowflake couldn’t read tables registered in Open Catalog that were located under a nested
namespace. Snowflake now supports querying tables located under a nested namespace. For example, if you create a nested namespace
`namespace1.namespace1a.namespace1ab`, Snowflake can read tables grouped under the namespace `namespace1ab`. For more information, see
[Create a namespace](organize-catalog-content.md).

## July 30, 2024

With this release, we are pleased to announce the initial public preview release of Open Catalog hosted on Snowflake with the following features:

### Apache Iceberg™ Rest API

Open Catalog provides an Apache Iceberg Rest Catalog API, which enables support for any query engine that supports the Apache Iceberg™ Rest catalog specification.

### Authentication

Users can create service connections that provide a Client ID and Client Secret service credentials. These credentials are used for authentication by using OAuth 2.0.

### Open Catalog user interface

Open Catalog is provided with a web application to simplify the management of the catalog. Within the UI, users can manage catalogs, service principals, and the privileges for service principals.

### Role-based security model

A role-based access control (RBAC) security model is included, so customers can manage the level of access each user or user group is allowed on the catalog. For more information, see [Access control](access-control.md).

### Credential vending

Access to the storage objects where the data resides is managed by Open Catalog. When a user requests access to a table, whether for read or write, a temporary scoped storage credential is generated and passed back to the calling engine, which provides the appropriate access permissions to the folder in which the data resides within the storage.

### Snowflake warehouse catalog integration for Open Catalog

A new catalog integration for Open Catalog is available within Snowflake. This catalog integration allows users to create externally managed Apache Iceberg™ tables that point to tables that reside in Open Catalog for querying.

## Considerations and limitations

The following considerations and limitations apply to Open Catalog, and are subject to change:

**Signup**

* Only Snowflake customers can sign up for Open Catalog.

**Catalogs**

* Open Catalog currently supports Apache Iceberg™ tables that use either:

  + Open Catalog as the Iceberg catalog
  + Snowflake as the Iceberg catalog. External Iceberg catalogs other than Snowflake aren’t currently supported. If you want to add Iceberg tables from other external catalogs, you must migrate them.
* You can’t import existing Iceberg tables from vendors such as Glue or Tabular into an internal catalog in Open Catalog, but they can be added to external catalogs.
* Snowflake can query but can’t write to tables managed by Open Catalog.
* Snowflake Iceberg tables, which are available in external catalogs, are read only in Open Catalog.
* For internal catalogs, you can’t rename a table across namespaces. For example, you can’t rename a table from `/mytables/ns1/table1` to `/mytables/ns2/table1`.
* When creating an internal or external catalog, you can’t specify a default base location or allowed location that overlaps with the directory hierarchy for a different catalog. For example, if the default base location for catalog1 is `s3://mytables/db1/schema1/table1`, you can’t specify the default base location for your new catalog as `s3://mytables/db1/`.

**Access control**

* The scoped access policy for a table is limited to the `<table_base>/metadata/` and `<table_base>/data/` directories.

**Iceberg**

* When calling the registerTable API, you can’t register a table in a location that is outside of the parent namespace directory. For example, if the folder hierarchy for a catalog is `s3://teambucket/iceberg/namespace1/namespace1a/`, you can’t create `mytbl3` with a base location `s3://teambucket/iceberg/namespace1`. You can create it, for example, with a base location `s3://teambucket/iceberg/namespace1/mytbl3`.
* If you call the `dropTable` API and request to purge the table’s data and metadata by setting the `purgeRequested` parameter to `true`, Open Catalog
  makes a best effort to delete the following items:

  + All data and metadata files associated with the table
  + The storage directory for the table

  However, some of these items might not be deleted. If so, navigate to your external cloud storage to identify and delete
  the orphaned files or storage directory yourself.

---
title: Snowflake Optima
source: https://docs.snowflake.com/en/user-guide/snowflake-optima.md
section: User Guide
---

# Snowflake Optima

Snowflake Optima extends Snowflake’s core principles of performance and simplicity by applying an
intelligent approach to workload optimization. Instead of requiring manual tuning, Snowflake Optima continuously analyzes
workload patterns and implements the most effective strategies automatically. Snowflake Optima ensures that queries run faster
and more cost-efficiently, without added configuration or maintenance. By anticipating and adapting to the evolving
nature of SQL workloads, Snowflake Optima automatically improves performance.

> **Note:**
>
> * Snowflake Optima is included in all [Snowflake editions](intro-editions.md).
> * Snowflake Optima is only available on
>   [Snowflake generation 2 standard warehouses](warehouses-gen2.md).

The following sections describe Snowflake Optima in more detail:

## Optima Indexing

*Optima Indexing* is a Snowflake Optima feature that automatically analyzes workloads to create and maintain
indexes in the background. Optima Indexing is built on top of the
[search optimization service](search-optimization-service.md).

By continuously monitoring SQL workloads, Optima Indexing identifies opportunities to improve performance — such
as repetitive point-lookup queries on a table — and automatically generates hidden indexes to accelerate those workloads.
These indexes are built and maintained on a best-effort basis, without requiring user intervention.

There are no additional costs for Optima Indexing, and because it is fully integrated into Snowflake, no additional
configuration or effort is required to benefit from improved performance.

For specialized workloads that demand guaranteed performance — for example, threat detection in the cybersecurity industry —
you can still directly apply search optimization. This option provides consistent index freshness and ultimately consistent
performance for scenarios where near real-time results are critical.

## Optima Metadata

*Optima Metadata* is a Snowflake Optima feature that automatically optimizes your workloads without any user input.
Snowflake Optima analyzes your query patterns, identifies inefficient usage of columns in pruning, and creates additional
metadata to optimize these queries. Even if you don’t know all the nuances of Snowflake’s query engine, Optima still ensures
that you prune unused micro-partitions as effectively as possible.

For example, one of the scenarios that Snowflake Optima has optimized is usage of the [UPPER](../sql-reference/functions/upper.md) and
[LOWER](../sql-reference/functions/lower.md) functions in the WHERE clause. These functions are inefficient in pruning. So, if Snowflake
Optima observes frequent use of these functions in your query filter predicates, it automatically creates metadata to aid in
pruning.

In general, the best practice is to avoid scenarios that lead to inefficient pruning. However, Snowflake Optima can improve
performance when these scenarios occur. That is, you should continue to follow all existing query performance best practices and think of Optima
Metadata as a feature that works in the background to catch optimizations you might have missed.

## Monitor Snowflake Optima use

You can monitor Snowflake Optima use on the following panes in the [Query Profile tab](ui-snowsight-activity.md)
under Query History in Snowsight:

* Query insights pane
* Statistics pane

You can also monitor Snowflake Optima use by querying the [QUERY_INSIGHTS view](../sql-reference/account-usage/query_insights.md).
For more information about query insights, see [Using query insights to improve performance](query-insights.md).

### Query insights pane

The [Query insights](query-insights.md) pane displays each type of insight detected
for a query and lists each instance of that insight type.

* To learn more about the condition that was detected, select View next to an entry in the
  Query insights pane.

If Snowflake Optima was used to optimize the given query, Snowflake Optima used appears and the details
are displayed.

The following image shows an example of the Query insights pane that indicates that Snowflake Optima was used:

### Statistics pane

To view pruning statistics for Snowflake Optima, open the
[Statistics](search-optimization/monitoring-search-optimization.md) pane on the Query Profile tab.
Look for the row labeled Partitions pruned by Snowflake Optima. This row shows the number of partitions skipped during
query execution, indicating how Snowflake Optima improved performance by reducing the amount of data scanned.

The following image shows an example of the Statistics pane that indicates that Snowflake Optima was used:

---
title: Snowflake Partner Connect
source: https://docs.snowflake.com/en/user-guide/ecosystem-partner-connect.md
section: User Guide
---

# Snowflake Partner Connect

Partner Connect lets you easily create trial accounts with selected Snowflake business partners and integrate these accounts with
Snowflake. This feature provides a convenient option for trying various 3rd-party tools and services, and then adopting the ones
that best meet your business needs.

## Supported Partners

> **Important:**
>
> Snowflake neither determines nor dictates the conditions or terms (length, supported features, etc.) for partner trial accounts; these
> policies are set by each Snowflake partner and vary according to the partner.
>
> For details about a specific trial, please contact the partner directly.

Currently, Partner Connect includes the following partners:

| Partner | Category | Notes |
| --- | --- | --- |
|  | [Security, Governance & Observability](ecosystem-security.md) | Free forever plan |
|  | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  | [Data Integration](ecosystem-etl.md) | dbt Cloud |
|  | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [Data Integration](ecosystem-etl.md) | Hevo Data CDC for ETL |
|  | [Machine Learning & Data Science](ecosystem-analytics.md) |  |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [Security, Governance & Observability](ecosystem-security.md) |  |
|  | [Data Integration](ecosystem-etl.md) | Informatica Cloud |
|  | [Data Integration](ecosystem-etl.md) | Informatica Data Loader |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [Data Integration](ecosystem-etl.md) | Matillion Data Productivity Cloud |
|  | [Business Intelligence (BI)](ecosystem-bi.md) |  |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [SQL Development & Management](ecosystem-editors.md) |  |
|  | [Data Integration](ecosystem-etl.md) |  |
|  | [Business Intelligence (BI)](ecosystem-bi.md) |  |

## Security Requirements

Partner Connect is limited to account administrators (i.e. users with the ACCOUNTADMIN role) who have a verified email address in
Snowflake:

* To use Partner Connect, you must switch to the ACCOUNTADMIN role or contact someone in your organization who has the role.
* To verify your email address:

  Snowsight:
  :   In some cases, you automatically receive an email prompting you to Please Validate Your Email. If you didn’t, follow these
      steps to verify your email address:

      1. Sign in to [Snowsight](ui-snowsight-gs.md).
      2. In the lower-left corner, select your name » Settings.
      3. In My Profile, configure your email address:

         + If you don’t have an email address listed, enter an email address in the Email field, and then select Save.
         + If you can’t enter an email address, an account administrator must either add an email address on your behalf or grant your user
           the role with the OWNERSHIP privilege on your user.
         + If you didn’t receive an email, select Resend verification email. Snowflake sends a verification email to the address listed.
      4. Open your email, and then select the link in the email to validate your email address.

## Connecting with a Snowflake Partner

To initiate a trial account with any Snowflake partner currently in Partner Connect:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To switch to the account administrator role, in the lower-left corner, select your name » Switch role » ACCOUNTADMIN.
3. In the navigation menu, select Admin » Partner Connect.
4. Click on the corresponding tile for the partner to which you wish to connect.

   A dialog displays the requirements for connecting to the partner, as well as a list of the objects automatically created in Snowflake
   during the connection process, including an empty database, warehouse, default user, and custom role. The partner application uses
   these objects when reading from or writing to your account.
5. Optionally specify one or more existing databases in Snowflake to automatically use with the trial. This creates an additional
   custom role that makes existing data in Snowflake quickly and easily available to the partner application.

   If you do not specify any databases during the initial connection process, you can specify them later; however, specifying them later
   is a manual task.

   > To use shared databases with a trial:
   >
   > * Use [Snowsight](ui-snowsight-gs.md) to complete the initial connection process.
   > * Manually specify the shared database after the process completes.
6. Click the Connect button below the partner description to initiate creating a trial account with the partner and connecting the
   partner application to Snowflake.

When the process is complete and the objects have been created, the partner tile is updated with a checkmark.

### Objects Created for the Partner

During the connection process, the following Snowflake objects for the partner application are created in your account:

| Object Name | Type | Notes |
| --- | --- | --- |
| PC_<*partner*>_DB | Database | This database is empty and can be used to load/store data for querying. If you wish to use existing databases that already contain data, during the initial connection process, you can specify any non-shared databases to use in the field provided. You can also manually specify other databases after the process completes. |
| PC_<*partner*>_WH | Warehouse | The default size of the warehouse is X-Small, but can be changed if needed. |
| PC_<*partner*>_USER | System User | This is the user that connects to Snowflake from the partner application. As noted in the dialog, a random password for the user is automatically generated. |
| PC_<*partner*>_ROLE | Role | The PUBLIC role is granted to this custom role, which enables the role to access any objects owned/granted to the PUBLIC role. In addition, this role is granted to the SYSADMIN role, which enables users with the SYSADMIN role (or higher) to also access any Snowflake objects created for partner access. |

In addition, if you optionally chose to specify one or more existing databases during the initial connection process, a second custom
role is created with all of the necessary privileges to access the tables in the databases:

PC_<*partner*>_DB_PICKER_ROLE

This role is then granted to the PC_<*partner*>_ROLE, which enables all the tables in the specified databases to be used by the partner
application with minimal (or no) additional configuration.

Note that this second role is not displayed in the dialog, but the role is created automatically after all the other objects listed in
the dialog are created.

> **Tip:**
>
> The above objects are created to enable a quick, convenient setup:
>
> * If you prefer to use existing Snowflake objects (databases, warehouses, users, etc.), you can update the preferences in the partner
>   application to reference the desired objects in Snowflake.
> * An account administrator can use [ALTER USER](../sql-reference/sql/alter-user.md) to change the generated password for
>   PC_<*partner*>_USER.
> * To enable access to objects owned by (or granted to) roles other than PUBLIC, grant the other roles to PC_<*partner*>_ROLE.

### Automated Application Features and Resource Usage

Partner applications may include automated features such as dashboards that run on a schedule and consume compute resources. We
encourage you to read the product documentation for a partner application and to
[monitor usage](warehouses-load-monitoring.md) of the PC_<*partner*>_WH warehouse to avoid unexpected Snowflake
credit usage by the application.

## Adding Partner IP Addresses to Network Policies

If you use a [network policy](network-policies.md) to restrict access to your Snowflake account based on user IP
address, partner applications will not be able to access your account unless you add the partner’s IP addresses to the list of
allowed IP addresses in the network policy. For detailed instructions, see [Modify a network policy](network-policies.md).

The following table lists the IP addresses to add for each partner (if available and supported) or provides links to pages on the
partner sites for this information:

| Partner | IP Addresses | Notes |
| --- | --- | --- |
| ALTR | `3.145.219.176/28` . `35.89.45.128/28` . `44.203.133.160/28` |  |
| CARTO | N/A |  |
| Coalesce | N/A |  |
| Dataiku | N/A |  |
| dbt Labs | `52.22.161.231` . `52.45.144.63` . `54.81.134.249` |  |
| Domo | N/A |  |
| Etleap | N/A |  |
| Fivetran | `52.0.2.4` | For more setup details, see the [Fivetran Documentation](https://fivetran.com/docs/warehouses/snowflake). |
| Hunters | `18.192.165.147` . `34.223.20.125` . `34.223.186.164` . `34.223.221.217` . `52.32.222.121` . `52.35.55.27` . `52.35.219.75` . `52.40.78.172` . `54.68.155.124` . `54.72.125.231` . `54.73.199.243` . `54.75.50.99` . `54.212.81.93` . `54.214.94.117` . `54.220.191.11` |  |
| Hevo Data CDC for ETL | TBD |  |
| Hex | N/A |  |
| Hightouch | N/A |  |
| Informatica | N/A |  |
| Informatica Data Loader | N/A |  |
| Keboola | N/A |  |
| Matillion Data Productivity Cloud | N/A |  |
| Sigma | `104.197.169.18` . `104.197.193.23` |  |
| SnapLogic | Various | For the IP addresses, see the [SnapLogic Documentation](https://docs-snaplogic.atlassian.net/wiki/spaces/SD/pages/1439269/Network+Setup#NetworkSetup-IPAddressWhitelisting). |
| SqlDBM | N/A |  |
| Striim | N/A |  |
| ThoughtSpot | `35.164.213.211` |  |

## Launching a Partner Application

After a partner application is connected to Snowflake:

1. On the Snowflake Partner Connect page, click the corresponding tile.
2. Click the Launch button to open the partner web site.

## Disconnecting from a Partner Account

If you decide to discontinue a trial account initiated through Partner Connect for any reason, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To switch to the account administrator role, in the lower-left corner, select your name » Switch role » ACCOUNTADMIN.
3. In the navigation menu, select Admin » Partner Connect.
4. Click the tile for the partner application you are disconnecting from. In the dialog that opens, note the names of the database,
   warehouse, system user, and custom role objects that were created for the partner application during the initial connection process.
5. Use the appropriate [DROP <object>](../sql-reference/sql/drop.md) command to remove each of the objects created for the partner application.

   > **Tip:**
   >
   > During the initial connection process, if you specified existing databases to use with the partner application, remember to also
   > drop the PC_<*partner*>_DB_PICKER_ROLE role that was automatically created along with the other objects.
6. Open a new worksheet in [Snowsight](ui-snowsight-gs.md) and run the following command to complete the removal of the partner
   connection:

   ```sqlsyntax
   select system$remove_etl_integration('partnername');
   ```

   Replace `<partner_name>` with the name of the partner application you are disconnecting from.
7. If the trial does not expire on its own, contact the partner to end your participation in the trial.

## Troubleshooting a Connection

### Connection Already Exists

If your organization already has an account with the partner, initiated either with the partner directly or using Partner Connect on
another one of your Snowflake accounts, initiating another trial account might fail with a message that a connection already exists.

In this case, the trial for this account must be initiated directly through the partner.

---
title: Snowflake releases
source: https://docs.snowflake.com/en/user-guide/intro-releases.md
section: User Guide
---

# Snowflake releases

Snowflake is committed to providing a seamless, always up-to-date experience for our users while also delivering ever-increasing value
through rapid development and continual innovation.

To meet this commitment, we deploy new releases each week. This allows us to regularly deliver service improvements in the form of new
features, enhancements, and fixes. The deployments happen transparently in the background; users experience no downtime
or disruption of service, and are always assured of running on the most-recent release with access to the latest features.

This topic describes the process we follow for weekly releases, including the option to request 24-hour early access for Enterprise Edition
and higher accounts to enable additional release testing (if desired).

## Release types (weekly)

Each week, Snowflake deploys two planned/scheduled releases:

Full release:
:   A full release may include any of the following:

    * New features
    * Feature enhancements or updates
    * Fixes
    * Behavior changes (see next section in this topic)

    In addition, a full release includes updated Snowflake release notes documentation per weekly cycle. See [Snowflake server release notes and feature updates](../release-notes/new-features.md).

    Full releases may be deployed on any day of the week, except we typically do not plan full releases on Friday to mitigate against issues
    that may be encountered during off-hours.

Patch release:
:   A patch release includes fixes only. Note that the scheduled/planned patch release for a given week may be canceled if the
    full release for the week is sufficiently delayed or prolonged.

    Additionally, patch releases are deployed (as needed) during or after the completion of the full release to address any issues that are
    encountered.

## Behavior changes (monthly)

Each month — except for November and December — Snowflake selects one of the weekly full releases for the month to introduce behavior changes.
The weekly release selected for the behavior changes may vary, but is typically the 3rd or 4th release for the month.

A behavior change is defined as any change to existing behavior that returns different results from before and may impact customer code or
workloads. Behavior changes are provided in bundles that utilize the following naming convention:

`YYYY_NN`

Where `YYYY` is the year and `NN` is the ordinal number of the release within the year. For example, `2022_06` would be the 6th behavior
change bundle introduced in 2022.

For more details, see [Behavior change management](../release-notes/bcr-bundles/managing-behavior-change-releases.md).

### Bundle lifecycle

The behavior change bundle lifecycle consists of the following two periods:

Testing period (1st month):
:   The bundle is introduced “Disabled by Default”. During this period, you can choose to *enable* the bundle in
    one or more accounts. Typically, you would choose accounts designated for development or QA (quality assurance) so that you can test the
    changes without impacting your production accounts.

Opt-out period (2nd month):
:   The bundle moves from “Disabled by Default” to “Enabled by Default”. During this period, you can choose to
    *disable* the bundle in your accounts. This allows you to postpone the changes in the bundle, typically for production accounts, while
    making any necessary adjustments to mitigate the impact of the changes.

During these two periods, Snowflake doesn’t override the setting for a given bundle. For example, if you disable a bundle during the testing
period, we do not enable it at the beginning of the opt-out period.

At the end of the opt-out period, Snowflake enables the behavior changes in the bundle across all accounts, at which time the bundle is
considered “Generally Enabled”. From this time onwards, you cannot enable or disable the bundle. However, you can still request to temporarily
disable individual behavior changes in the bundle by contacting [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### Behavior change documentation

A release that contains behavior change bundles includes the following documentation (in addition to the Release Notes for the
release):

* A listing of upcoming and recently implemented bundled changes. See [Behavior change announcements](../release-notes/behavior-changes.md).
* A description of each behavior change. Behavior changes are listed on the landing page for each bundle.
* A listing of upcoming and recently implemented unbundled changes. See [Unbundled behavior changes](../release-notes/bcr-bundles/un-bundled/unbundled-behavior-changes.md).

## Pre-release testing and validation

At Snowflake, release quality is a top priority. Before each release is deployed, it goes through a full suite of validation tests that
include:

* Regular build testing.
* Continuous workload and performance testing.

In addition, before any customer accounts are moved to a release, the following validation is performed:

* Full round of regression testing in internal accounts across all supported cloud platforms.
* Simulating execution of select impacted customer workloads (e.g. queries on customer data), with a focus on workloads that are most
  likely impacted by changes in the release.

## Staged release process

After a full release has been deployed, Snowflake doesn’t move all accounts to the release at the same time. Accounts are moved to the
release using a three-stage approach over multiple days. Accounts are moved to the full release in the following order, based on
their [Snowflake Edition](intro-editions.md):

Day 1:
:   Stage 1 (*early access*) for designated Enterprise (or higher) accounts.

Day 1 or 2:
:   Stage 2 (*regular access*) for Standard accounts.

Day 2:
:   Stage 3 (*final*) for Enterprise (or higher) accounts.

Typically, the minimum amount of time between the early access and final stages is 24 hours, but
it may be shorter or longer. This staged approach enables Snowflake to monitor activity as accounts are moved and respond to any issues that
may occur. It also enables designating Enterprise accounts for early access testing (see the next section in this topic).

> **Note:**
>
> This staged approach only applies to full releases. For patch releases, all accounts are moved on the same day.
>
> In addition, if issues are discovered while moving accounts to a full release or patch release, the release might be halted or
> rolled back. In most cases, the follow-up to a halted or rolled-back release is completed within 24-48 hours.

## Early access to full releases

If you have multiple Enterprise Edition (or higher) accounts, you can designate one or more of these accounts as early access to take advantage
of the period between the early access and final stages for full releases. This can be particularly useful if you maintain separate accounts
for development/testing and production.

To designate an account for early access, please contact your Snowflake account representative.

After you have designated an account for early access, you can implement a testing framework similar to the following:

1. Use [CURRENT_VERSION](../sql-reference/functions/current_version.md) (or a UDF that returns similar results) to verify when your early access account is on
   the full release.
2. Use your early access accounts to test your production workloads against the full release.
3. If any issues are encountered, notify [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support), who can work with you to prevent the issues from disrupting your other
   accounts.

> **Tip:**
>
> Early access is not required or recommended for all organizations with Enterprise Edition accounts. Snowflake’s rigorous release testing
> and monitoring during deployments is usually sufficient to prevent most issues. Early access is intended primarily for organizations that
> desire added certainty that their production accounts will not be affected by full releases.

---
title: Snowflake SCIM support
source: https://docs.snowflake.com/en/user-guide/scim-intro.md
section: User Guide
---

# Snowflake SCIM support

Snowflake supports SCIM 2.0, lets you integrate Snowflake with Okta and Microsoft Azure AD as identity providers. You can use
custom identity providers, which are identity providers that are neither Okta nor Microsoft Azure. You can provision users and groups
(roles) from the identity provider into Snowflake, which functions as the service provider.

> **Note:**
>
> SCIM roles in Snowflake must own any users or roles that are imported from the identity provider. If the Snowflake SCIM role does not own
> the imported users or roles, updates in the identity provider are not be synced to Snowflake. Snowflake SCIM roles correlate with their
> identity provider (IdP):
>
> * Okta SCIM Role: `OKTA_PROVISIONER`
> * Microsoft Entra ID SCIM Role: `AAD_PROVISIONER`
> * Custom SCIM Role: `GENERIC_SCIM_PROVISIONER`
>
> For more information on how to use the Snowflake SCIM Role, see the SCIM configuration sections for [Okta](scim-okta.md),
> [Microsoft Entra ID](scim-azure.md), and the [Custom SCIM integration](scim-custom.md).

## Use cases

The Snowflake SCIM API can address the following use cases.

* [Managing users](scim-user-api-reference.md): Administrators can provision and manage their users from their organization’s
  identity provider to Snowflake. User management is a one-to-one mapping from the identity provider to Snowflake.
* [Managing groups](scim-group-api-reference.md): Administrators can provision and manage their groups (i.e. Roles) from
  their organization’s identity provider to Snowflake. Role management is a one-to-one mapping from the identity provider to Snowflake.
* Auditing SCIM API requests: Administrators can query the `rest_event_history` table to determine whether the
  identity provider is sending updates (i.e. SCIM API requests) to Snowflake.

## SCIM API

Identity providers can use a SCIM client to make RESTful API requests to the Snowflake SCIM server. After validating the API request,
Snowflake performs actions requested by the identity providers on users or groups.

Snowflake authenticates SCIM API requests from identity providers through an OAuth Bearer token in the `Authorization` header of HTTP
requests. The token is valid for six months. You must ensure your token is not expired when authenticating. If your token expires, you can
generate a new access token using the [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](../sql-reference/functions/system_generate_scim_access_token.md) function.

> **Caution:**
>
> The Snowflake SCIM API lets administrators manage users and groups from the customer’s identity provider to Snowflake. If you make
> changes to users and groups in Snowflake directly, the changes do not synchronize back to the customer’s identity provider.

For more information about making SCIM API requests to Snowflake, see [SCIM API references](scim-api-references.md).

## Auditing SCIM API requests

You can query Snowflake to find information about SCIM API requests that were made over a span of time. You can use this information to see
if your organization’s active users match the users provisioned into Snowflake.

For example, to determine which SCIM API requests were made in the last five minutes, with a maximum of 200 requests to be returned, you can
use the Information Schema table function [REST_EVENT_HISTORY](../sql-reference/functions/rest_event_history.md):

```sqlexample
use role accountadmin;
use database demo_db;
use schema information_schema;
select *
    from table(rest_event_history(
        'scim',
        dateadd('minutes',-5,current_timestamp()),
        current_timestamp(),
        200))
    order by event_timestamp;
```

For more information on how to modify this query, see [DATEADD](../sql-reference/functions/dateadd.md) and
[CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md).

## Supported SCIM security integrations

See [SCIM security integrations](scim-security-integrations.md).

## Replicating security integrations

Snowflake supports replication and failover/failback with the SCIM security integration from the source account to the target account.

For details, see [Replication of security integrations & network policies across multiple accounts](account-replication-security-integrations.md).

## User invitation emails

Snowflake sends invitation emails to users created using SCIM by default.
Invitation emails are sent within 24-48 hours of users being created.
To opt out of this feature, contact [Snowflake Support](https://community.snowflake.com/s/article/How-To-Submit-a-Support-Case-in-Snowflake-Lodge).

---
title: Snowflake sessions and session policies
source: https://docs.snowflake.com/en/user-guide/session-policies.md
section: User Guide
---

# Snowflake sessions and session policies

This topic describes Snowflake sessions and session policies and provides instructions for configuring session policies at the account or
user level.

## Snowflake sessions

A session begins when a user connects to Snowflake and authenticates successfully using a Snowflake programmatic client or [Snowsight](ui-snowsight-gs.md).
A session is independent of an identity provider (IdP) session. If the Snowflake session expires but the IdP session remains active,
a user can log in to Snowflake without entering their login credentials again (i.e. silent authentication).

A session is maintained indefinitely with continued user activity. After a period of inactivity in the session, known as the
idle session timeout, the user must authenticate to Snowflake again. The idle session timeout has a maximum value of four hours and
a session policy can modify the idle session timeout period. The idle session timeout applies to the following:

* [Snowsight](ui-snowsight-gs.md).
* [Snowflake CLI](../developer-guide/snowflake-cli/index.md).
* [SnowSQL (CLI client)](snowsql.md).
* Supported [connectors and drivers](../guides-overview-connecting.md).
* Third-party clients that connect to Snowflake using a supported connector or driver.

Snowflake recommends reusing existing sessions when possible and to close the connection to Snowflake when a session is no longer needed.

### Snowsight session expiration and logout behavior

* A Snowsight session remains active as long as the user is interacting with the application and has not exceeded the configured idle
  session timeout.
* The session idle timeout is controlled by your organization’s session policy (the default is 4 hours). If there is no activity for longer
  than this period, the session will expire and you will be logged out automatically.
* In addition to idle timeout, session persistence is also affected by authentication cookies:

  > + In most cases, closing and reopening your browser will end your Snowsight session, regardless of your idle time.
  > + If your authentication cookie expires (typically after 24 hours), you will be required to log in again, even if you have not been idle
  >   for longer than the session timeout.
* If your network connection is lost or you attempt to access Snowsight from a disallowed network, your session may be closed and you will be logged out.
* When a session is closed for any reason, any running queries or jobs associated with that session will be terminated after a short delay
  (usually within a few minutes).

> **Note:**
>
> Session expiration can occur due to idle timeout, cookie expiration, browser restarts, or network policy violations. Closing your browser
> or being inactive for an extended period may require you to log in again, even if you have not reached the configured idle timeout.

### Monitor session usage

You can monitor active sessions and session usage using Snowsight or a SQL view. You can view your own sessions,
or use a role with access to view the SESSIONS view to view sessions for your account. See [ACCOUNT_USAGE schema SNOWFLAKE database roles](../sql-reference/account-usage.md).

SQL:
:   Query the [SESSIONS](../sql-reference/account-usage/sessions.md) view in the ACCOUNT USAGE schema
    of the shared SNOWFLAKE database to monitor session usage.

Snowsight:
:   In the navigation menu, select Governance & security » Network policies, and then select the Sessions tab.
    You can review the session ID, user name, start time, client driver in use for the session, client net address, and authentication method.
    Hover over the start time to view the exact date and time that the session started, in your local time zone.

## Session policies

A session policy defines the idle session timeout period in minutes and provides the option to override the default idle timeout
value. The timeout period begins upon a successful authentication to Snowflake. The minimum configurable idle timeout value for a session
policy is `5` minutes.

If a session policy is not set, Snowflake uses a default value of `240` minutes (four hours).

When the session expires, the user must authenticate to Snowflake again. However, Snowflake does not enforce any setting defined by the
[Custom logout endpoint](admin-security-fed-auth-security-integration.md).

The session policy can be set for an account or user with configurable idle timeout periods to address compliance requirements. If a user
is associated with both an account and user-level session policy, the user-level session policy takes precedence. After the session policy
is set on the account or user, Snowflake enforces the session policy.

There are two properties that govern the session policy behavior:

* `SESSION_IDLE_TIMEOUT_MINS` for programmatic and Snowflake clients.
* `SESSION_UI_IDLE_TIMEOUT_MINS` for Snowsight.

For more information, see [Managing session policies](../sql-reference/ddl-user-security.md).

### Secondary roles in a session policy

When a user connects to Snowflake and the session begins, the user can activate
[secondary roles](security-access-control-overview.md) with a [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md) command. However, as a
security administrator, you might want to manage the secondary roles that are available to an individual user, groups of users, or the
entire account. Managing secondary roles helps to scope the set of privileges available to a user for the duration of the session.

To meet these management needs, you can set the `ALLOWED_SECONDARY_ROLES` property in a session policy and set the session policy on
the account or a user in the account. This property controls the secondary roles that can be activated in a session. Setting this property
to an empty list `ALLOWED_SECONDARY_ROLES=()` disables secondary roles in a session.

For examples, see [Specifying secondary roles in a session policy](session-policies-using.md).

> **Note:**
>
> When you set the `ALLOWED_SECONDARY_ROLES` property in a session policy, the enforcement of the secondary roles begins immediately,
> including existing sessions.
>
> Prior to updating the session policy to limit secondary roles, consider your workload schedule and the access control for each
> workload to avoid unnecessary workload disruption.

### Considerations

* If a client supports the CLIENT_SESSION_KEEP_ALIVE option and the option is set to `TRUE`, the client preserves the Snowflake
  session indefinitely as long as the connection to Snowflake is active. Otherwise, if the option is set to `FALSE`, the session ends
  after 4 hours. When possible, avoid using this option since it can result in many open sessions and place a greater demand on resources
  which can lead to a performance degradation.
* You can use the [CLIENT_SESSION_KEEP_ALIVE_HEARTBEAT_FREQUENCY](../sql-reference/parameters.md) parameter to specify the number of seconds
  between client attempts to update the token for the session. The web interface session can be refreshed as Snowflake objects continue to
  be used, such as executing DDL and DML statements. Snowflake checks for this behavior every 30 seconds.
* Creating a new worksheet or opening an existing worksheet continues to use the established user session but with its idle session timeout
  reset to 0.

### Limitations

Future grants:
:   [Future grants](../sql-reference/sql/grant-privilege.md) of privileges on session policies are not supported.

    As a workaround, grant the APPLY SESSION POLICY privilege to a custom role to allow that role to apply session policies on a user or the
    Snowflake account.

---
title: Snowflake storage for Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-internal-storage.md
section: User Guide
---

# Snowflake storage for Apache Iceberg™ tables

Snowflake supports Snowflake storage for Apache Iceberg™ tables.

Just like standard Snowflake tables, this feature lets you create [Snowflake-managed Iceberg tables](tables-iceberg.md)
in Snowflake. With this option, Snowflake stores and manages the Iceberg table files for you by using Snowflake (internal) storage, so
you don’t need to set up access to external cloud storage.

This feature works with the [Snowflake Horizon Catalog](snowflake-horizon.md),
so you can use an external query engine to connect to an Iceberg table that uses Snowflake storage.
For more information, see [Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md). In addition, you can
query these same tables in Snowflake.

> **Note:**
>
> This feature is currently available only for accounts hosted on Amazon Web Services (AWS) or Azure.
> This feature is not available in government regions or in the People’s Republic of China.

## How Snowflake storage works

When you create an Iceberg table with Snowflake storage, Snowflake manages all data and metadata files internally.
You don’t need to configure an external volume or grant Snowflake access to your cloud storage.

### Create an Iceberg table with Snowflake storage

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table_defaults (col1 int)
  CATALOG = SNOWFLAKE
  EXTERNAL_VOLUME = SNOWFLAKE_MANAGED;
```

Explicit `TRANSIENT` table with Snowflake-managed storage:

```sqlexample
CREATE TRANSIENT ICEBERG TABLE my_iceberg_table_internal (col1 int)
  CATALOG = SNOWFLAKE
  EXTERNAL_VOLUME = SNOWFLAKE_MANAGED;
```

* `CATALOG` must be `SNOWFLAKE` for this storage model. If your account default catalog is Snowflake, you can omit `CATALOG`.
* `EXTERNAL_VOLUME` must be `SNOWFLAKE_MANAGED` when you are using Snowflake storage. If your default external volume
  is `SNOWFLAKE_MANAGED`, you can omit `EXTERNAL_VOLUME`.

### The `SNOWFLAKE_MANAGED` external volume

`EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'` selects Snowflake-provided storage for the table. `SNOWFLAKE_MANAGED` is a reserved
value, not a user-created [external volume](../sql-reference/sql/create-external-volume.md) object. You don’t run `CREATE EXTERNAL VOLUME`
for this path.

For Iceberg tables that store files in **your** cloud storage instead, you create an external volume, grant `USAGE`, and set
`EXTERNAL_VOLUME` to that volume’s name. For instructions, see [Configure an external volume](tables-iceberg-configure-external-volume.md).

### Permanent and transient tables

Iceberg tables that use Snowflake storage can be permanent or transient:

* **Permanent** (default): Table data is protected by [Fail-safe](data-failsafe.md),
  the same 7-day data recovery feature that Snowflake provides for standard tables.
* **Transient**: Table data is not protected by Fail-safe. Storage and time travel behavior follow
  [transient tables](tables-temp-transient.md) in Snowflake. Transient tables don’t incur Fail-safe storage costs.

Use the `TRANSIENT` keyword in the [CREATE ICEBERG TABLE](../sql-reference/sql/create-iceberg-table-snowflake.md) statement
to create a transient Iceberg table.

> **Note:**
>
> Transient Iceberg tables are only supported with Snowflake storage. You can’t create a transient Iceberg table
> that uses a customer-managed external volume.

> **Tip:**
>
> To check whether an existing Iceberg table is permanent or transient, run [SHOW TABLES](../sql-reference/sql/show-tables.md)
> and look at the `kind` column. The value is `TRANSIENT` for transient tables and `TABLE` for permanent tables.

### Default catalog and external volume

If you omit `CATALOG` and `EXTERNAL_VOLUME` on the statement, Snowflake resolves them from schema, database, and account
defaults (schema overrides database, database overrides account). When the effective catalog is Snowflake (`CATALOG = 'SNOWFLAKE'`),
the default external volume is `SNOWFLAKE_MANAGED` unless a different default is set at a lower level. For more information, see
[Set a default catalog at the account, database, or schema level](tables-iceberg-configure-catalog-integration.md) and [Set a default external volume at the account, database, or schema level](tables-iceberg-configure-external-volume.md).

When you set `CATALOG = 'SNOWFLAKE'` explicitly, the default external volume is `SNOWFLAKE_MANAGED` unless you override it with
`EXTERNAL_VOLUME` or a schema, database, or account default that names another volume.

## Replication

You can replicate Iceberg tables that use Snowflake storage by using a failover or replication group.
To enable replication for these tables, you must first enable replication for Snowflake-managed Iceberg tables
by following the steps in [Configure replication for Snowflake-managed Apache Iceberg™ tables](tables-iceberg-replication.md).

Unlike standard Snowflake-managed Iceberg tables, you don’t need to include `EXTERNAL VOLUMES` in the
`OBJECT_TYPES` list of your failover or replication group. Snowflake automatically manages the storage for
replicated tables that use the `SNOWFLAKE_MANAGED` external volume.

For example, create a failover group that replicates a database containing Iceberg tables that use Snowflake storage:

```sqlexample
CREATE FAILOVER GROUP my_iceberg_fg
  OBJECT_TYPES = DATABASES
  ALLOWED_DATABASES = my_iceberg_database
  ALLOWED_ACCOUNTS = myorg.my_account_1;
```

### Considerations for replication

* Replication to accounts hosted on Google Cloud Platform (GCP) isn’t supported. Snowflake skips Iceberg tables that
  use Snowflake storage during refresh operations when the target account is hosted on GCP.
* If you created Iceberg tables during the private preview using an external volume other than `SNOWFLAKE_MANAGED`,
  Snowflake automatically migrates the replicated table on the secondary account to use the `SNOWFLAKE_MANAGED`
  volume. Note the following about this migration:

  + If you include `EXTERNAL VOLUMES` in the `OBJECT_TYPES` list of the failover or replication group,
    the private preview external volume is replicated to the secondary account, but it isn’t attached to the table.
    All usages of the private preview external volume on the secondary account are blocked.
  + Snowflake recommends that you drop any Iceberg tables that use a private preview external volume and recreate them
    using `EXTERNAL_VOLUME = SNOWFLAKE_MANAGED` before you enable replication.

## Billing

Snowflake bills your account for the following usage:

### Storage cost

* Snowflake charges for every byte stored in Snowflake.

  Snowflake aggregates the storage usage for Iceberg
  tables that use Snowflake storage in the `STORAGE_BYTES` column of the [STORAGE_USAGE view](../sql-reference/account-usage/storage_usage.md), together
  with storage usage for non-Iceberg tables.
  Only files that are committed to the catalog are included in `STORAGE_BYTES`. Snowflake doesn’t bill for abandoned commits.

  The cost for this storage cost usage is described in Table 3(a) of the
  [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) on the Snowflake website.

### Request cost

> **Note:**
>
> Any time you use a query engine through Horizon Catalog to access Iceberg tables that are stored in Snowflake, the query engine is
> considered an external query engine. When you use an external query engine to access these tables, Snowflake bills your account for this access.
>
> The following list describes some cases where external query engines access Iceberg tables that are stored in Snowflake:
>
> * Snowflake engines that access the table through Horizon Catalog from another Snowflake account. For example, if a table is managed by Snowflake account A but you access
>   the table from the Snowflake engine in account B through Horizon Catalog, you are charged for this access. You are charged for this access because
>   the Snowflake engine in account B is an external query engine.
> * Third-party query engines that you deploy within the Snowflake network by using Snowflake Container Services.
>   When you use these engines through Horizon Catalog to access the table, the engine is external and
>   their requests are billed in the same way as other third-party query engines.
> * Third-party query engines that you deploy outside of Snowflake that you use to connect to the table through Horizon Catalog.
>
> Snowflake doesn’t bill your account when you use the Snowflake query engine to *directly* access these Iceberg tables, which
> means you don’t access them through Horizon Catalog. For example, if a table is managed by account A and you
> use the Snowflake engine in account A to access the table, you aren’t charged for this access.

* When you use an external query engine through Snowflake Horizon Catalog to access Iceberg tables that use Snowflake storage, Snowflake
  bills your account a per-request fee for each HTTP request sent to the underlying storage system. The rate depends on the request type:

  + PUT, COPY, POST, PATCH and LIST operations, which are billed as “class 1”.
  + GET and SELECT operations, which are billed as “class 2”.

  To view the request counts for these operation types, use the
  [STORAGE_REQUEST_HISTORY](../sql-reference/account-usage/storage_request_history.md) Account Usage view.
  This usage is billed under the `STORAGE_REQUEST-1` and `STORAGE_REQUEST-2` SKUs on the billing report.

  This rate is described in Table 3(g) of the [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Data transfer cost

* When you use an external query engine through Horizon Catalog to access the table from a different region or with another cloud provider,
  a standard data transfer charge is billed on a per-byte basis.

  This data transfer charge is described in Tables 4(a), 4(b), and 4(c) of the
  [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

  For more information, see [Understanding data transfer cost](cost-understanding-data-transfer.md).

## Private connectivity

When you use an external query engine to access Iceberg tables that use Snowflake storage, you can configure
private connectivity so that traffic doesn’t traverse the public internet.

For setup instructions, see [To Snowflake-managed storage volumes](private-connectivity-inbound.md).

## Considerations and limitations

Consider the following when you work with Iceberg tables that use Snowflake storage.

### Cloud provider support

This feature is currently available only for accounts hosted on Amazon Web Services (AWS) or Microsoft Azure.
This feature is not available in government regions or in the People’s Republic of China.

### Encryption

Iceberg tables that use Snowflake storage support only server-side encryption (SSE).
Customer-managed keys (CMK) are not supported, even if your account has
[Tri-Secret Secure](security-encryption-tss.md) enabled.

### Cloning behavior

> **Warning:**
>
> The Iceberg table that you create uses catalog-vended credentials. When you clone an Iceberg table that uses catalog-vended credentials, the cloned table
> shares the same base location as the source table. The same credentials can be used to access the shared base location, so the cloned table has write
> access to the source table.

For tables that use Snowflake-managed storage (`EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'`), `CREATE ICEBERG TABLE ... CLONE` succeeds only when the
source table and the new table are **both** transient or **both** permanent. If one is transient and the other is permanent, the statement fails.

| Source table | Clone | Result |
| --- | --- | --- |
| Transient | Transient | Supported |
| Permanent | Permanent | Supported |
| Transient | Permanent | Not supported |
| Permanent | Transient | Not supported |

For command syntax and more cloning behavior, see [CREATE ICEBERG TABLE … CLONE](../sql-reference/sql/create-iceberg-table-snowflake.md) in [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-snowflake.md)
and [Cloning and Apache Iceberg™ tables](object-clone.md).

### Ingesting data

You can ingest data into Iceberg tables that use Snowflake storage using the following methods:

* **Snowpipe**: Use [Snowpipe](data-load-snowpipe-intro.md) to load data from files in cloud storage
  using [COPY INTO](../sql-reference/sql/copy-into-table.md). Snowpipe works with both permanent and transient Iceberg tables.
* **Snowpipe Streaming**: Use [Snowpipe Streaming high-performance](snowpipe-streaming/snowpipe-streaming-high-performance-overview.md)
  to ingest streaming data. Snowpipe Streaming works with both permanent and transient Iceberg tables.

---
title: Snowflake Terraform provider
source: https://docs.snowflake.com/en/user-guide/terraform.md
section: User Guide
---

# Snowflake Terraform provider

[HashiCorp Terraform](https://developer.hashicorp.com/terraform) is an open-source Infrastructure as Code (IaC) tool that allows you to dynamically build, change, and version infrastructure resources. You use the [Terraform language](https://developer.hashicorp.com/terraform/language) to create configuration files that describe the configuration you want. Terraform compares your configuration to the current state and then generates a plan to create new resources or update and delete existing resources. The plan runs as a directed acyclic graph (DAG), which allows Terraform to understand and handle dependencies between resources.

The [Snowflake Terraform provider](https://registry.terraform.io/providers/snowflakedb/snowflake/latest) allows you to establish a consistent workflow to manage Snowflake resources like warehouses, databases, schemas, tables, roles, grants, and more. For more information about other features and building blocks that support Snowflake DevOps workflows, see [DevOps with Snowflake](../developer-guide/builders/devops-with-snowflake.md).

After you [install Terraform](https://developer.hashicorp.com/terraform/tutorials/aws-get-started/install-cli#install-terraform), see the following resources to get started using the Snowflake provider.

| Resource | Description |
| --- | --- |
| [Snowflake provider documentation](https://registry.terraform.io/providers/snowflakedb/snowflake/latest/docs) | Guides and reference documentation in the [Terraform Registry](https://registry.terraform.io/) for the Snowflake provider. Documentation includes the [resource blocks](https://developer.hashicorp.com/terraform/language/resources/syntax) that describe objects in Snowflake (for example, [snowflake_database](https://registry.terraform.io/providers/snowflakedb/snowflake/latest/docs/resources/database)) and the [data sources](https://developer.hashicorp.com/terraform/language/data-sources) that you can use to name and dynamically fetch configuration state from Snowflake objects (for example, [snowflake_users](https://registry.terraform.io/providers/snowflakedb/snowflake/latest/docs/data-sources/users)). |
| [terraform-provider-snowflake](https://github.com/snowflakedb/terraform-provider-snowflake) | The GitHub project where you can do the following:   * Stay up to date on feature developments and status, including the [project roadmap](https://github.com/snowflakedb/terraform-provider-snowflake/blob/main/ROADMAP.md) and [issues](https://github.com/snowflakedb/terraform-provider-snowflake/issues). * Get support from the community in [discussion forums](https://github.com/snowflakedb/terraform-provider-snowflake/discussions). Snowflake Support and subject matter experts participate actively in the GitHub community and make a best effort to resolve issues. Snowflake provides official support as detailed below in Officially supported versions. * Review supplementary documentation and source code. * Review the [change log](https://github.com/snowflakedb/terraform-provider-snowflake/blob/main/CHANGELOG.md) and [migration guide](https://github.com/snowflakedb/terraform-provider-snowflake/blob/main/MIGRATION_GUIDE.md) to follow releases. |
| [Terraforming Snowflake](https://quickstarts.snowflake.com/guide/terraforming_snowflake/#0) | This Quickstart tutorial from Snowflake Labs guides you through creating a Terraform project in GitHub that uses the Snowflake provider to create a demo database and warehouse. |

## Versioning and preview features

The Snowflake Terraform provider follows semantic versioning. Major version releases include breaking changes. We announce these well in advance on GitHub. Minor version releases may sometimes include unexpected changes, depending on the configuration or environment. We balance the occasional one-time inconvenience for some users against the overall benefits these updates bring to the community.

### New features and fixes

* Generally, we introduce new features and fixes in the latest minor version. This is due to the resource-intensive development process and the need for extensive regression testing.
* If we discover a security vulnerability, we consider backporting critical fixes to earlier versions on a case-by-case basis.
* We assess BCRs introduced by underlying Snowflake features for impacts to the provider. The [migration guide](https://github.com/snowflakedb/terraform-provider-snowflake/blob/main/MIGRATION_GUIDE.md) provides information about how to manage potential breaking changes. We prioritize BCR fixes in each latest version release of the provider and recommend updating your version of the provider regularly.

### Preview features

Some resources and data sources are labeled “preview features” with each release.

* Please consider these features to be preview features in the provider, regardless of their state in Snowflake.
* Preview features are disabled by default. You must add the relevant feature name to the `preview_features_enabled` field in the provider configuration. The GitHub repository always contains a list of preview features.
* Each preview feature will be reworked and marked as a stable feature in future releases. Please expect that preview features might introduce breaking changes, even when the provider’s major version number does not change.
* Preview features, much like other Snowflake preview features, do not receive official Snowflake Support. However, the Product and Engineering teams can offer help.

## Officially supported versions

* Snowflake offers official support only for the latest version. When a new version is released, it immediately becomes the officially supported version. You can submit a case for official support of a Terraform provider issue using the processes described in [Contacting Snowflake Support](contacting-support.md).
* Official Snowflake Support began exclusively with version 2.0.0 and later. All other versions, including major versions earlier than 2.0.0, are not officially supported.
* Although the latest version of the provider is the only officially supported version, we make a best effort to support resolution of issues with earlier versions. After assessing the issue, Snowflake Support may at its discretion require an update to the latest version to support the troubleshooting process.

---
title: Snowflake Time Travel & Fail-safe
source: https://docs.snowflake.com/en/user-guide/data-availability.md
section: User Guide
---

# Snowflake Time Travel & Fail-safe

Snowflake provides powerful CDP features for ensuring the maintenance and availability of your historical data (i.e. data that has been changed or deleted):

> * Querying, cloning, and restoring historical data in tables, schemas, and databases for up to 90 days through Snowflake Time Travel.
> * Disaster recovery of historical data (by Snowflake) through Snowflake Fail-safe.

These features are included standard for all accounts, i.e. no additional licensing is required; however, standard Time Travel is 1 day. Extended Time Travel (up to 90 days) requires Snowflake Enterprise Edition. In addition,
both Time Travel and Fail-safe require additional data storage, which has associated fees.

**Next Topics:**

* [Understanding & using Time Travel](data-time-travel.md)
* [Understanding and viewing Fail-safe](data-failsafe.md)
* [Storage costs for Time Travel and Fail-safe](data-cdp-storage-costs.md)

---
title: Snowpark-optimized warehouses
source: https://docs.snowflake.com/en/user-guide/warehouses-snowpark-optimized.md
section: User Guide
---

# Snowpark-optimized warehouses

Snowpark-optimized warehouses let you configure the available memory resources and CPU architecture on a single-node instance for
your workloads.

## When to use a Snowpark-optimized warehouse

While [Snowpark](https://www.snowflake.com/en/data-cloud/snowpark/) workloads can be run on both standard and Snowpark-optimized warehouses,
Snowpark-optimized warehouses are recommended for running Snowpark workloads such as code that has large
memory requirements or dependencies on a specific CPU architecture. Example workloads include Machine Learning (ML) training
use cases using a [stored procedure](../developer-guide/stored-procedure/stored-procedures-overview.md) on a single virtual warehouse
node. Snowpark workloads, utilizing [UDF](../developer-guide/udf/udf-overview.md) or
[UDTF](../developer-guide/udf/python/udf-python-tabular-functions.md), might also benefit from Snowpark-optimized warehouses.
Workloads that don’t use Snowpark might not benefit from running on Snowpark-optimized warehouses.

> **Note:**
>
> Initial creation and resumption of a Snowpark-optimized virtual warehouse might take longer than standard warehouses.

## Configuration options for Snowpark-optimized warehouses

[Preview Feature](../release-notes/preview-features.md) — Open

The 1 TB resource constraints (MEMORY_64X and MEMORY_64X_x86) are available as a preview feature.
The 1 TB constraints are available only on the Amazon Web Services (AWS) cloud platform.

All other MEMORY_\* resource constraint sizes are generally available and are available for all cloud platforms.

The default configuration for a Snowpark-optimized warehouse provides 16x memory per node compared to a standard warehouse. You can
optionally configure additional memory per node and specify CPU architecture using the `resource_constraint` property
of the [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) or [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command.
The following options are available:

| Memory (up to) | CPU architecture | RESOURCE_CONSTRAINT values | Minimum warehouse size |
| --- | --- | --- | --- |
| 16GB | Default or x86 | MEMORY_1X, MEMORY_1X_x86 | XSMALL |
| 256GB | Default or x86 | MEMORY_16X, MEMORY_16X_x86 | M |
| 1TB | Default or x86 | MEMORY_64X, MEMORY_64X_x86 | L |

## Creating a Snowpark-optimized warehouse

To create a new Snowpark-optimized warehouse, you can set the warehouse type property in the following interfaces.

SQLPython

Set the WAREHOUSE_TYPE property to `'SNOWPARK-OPTIMIZED'` when running the [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) command. For example:

```sqlexample
CREATE OR REPLACE WAREHOUSE snowpark_opt_wh WITH
  WAREHOUSE_SIZE = 'MEDIUM'
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED';
```

Create a large Snowpark-optimized warehouse `so_warehouse` with 256 GB of memory by specifying the resource constraint
`MEMORY_16X_X86`:

```sqlexample
CREATE WAREHOUSE so_warehouse WITH
  WAREHOUSE_SIZE = 'LARGE'
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  RESOURCE_CONSTRAINT = 'MEMORY_16X_X86';
```

> **Note:**
>
> The default resource constraint is `MEMORY_16X`.

Set the `warehouse_type` property to `'SNOWPARK-OPTIMIZED'` when constructing a [Warehouse](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.Warehouse) object.

Then, pass this `Warehouse` object to the [WarehouseCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.WarehouseCollection)
method to create the warehouse in Snowflake. For example:

```python
from snowflake.core import CreateMode
from snowflake.core.warehouse import Warehouse

my_wh = Warehouse(
  name="snowpark_opt_wh",
  warehouse_size="MEDIUM",
  warehouse_type="SNOWPARK-OPTIMIZED"
)
root.warehouses.create(my_wh, mode=CreateMode.or_replace)
```

> **Note:**
>
> Resource constraints are currently not supported in the Snowflake Python APIs.

## Modifying Snowpark-optimized warehouse properties

To modify warehouse properties including the warehouse type, you can use the following interfaces.

> **Note:**
>
> You can change the warehouse type whether the warehouse is in the `STARTED` or `SUSPENDED` state.
> If you suspend a warehouse before changing the `warehouse_type` property, execute the following operation:
>
> SQLPython
>
> ```sqlexample
> ALTER WAREHOUSE snowpark_opt_wh SUSPEND;
> ```
>
> ```python
> root.warehouses["snowpark_opt_wh"].suspend()
> ```

SQLPython

Use the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command to modify the memory resources and CPU architecture for Snowpark-optimized
warehouse `so_warehouse`:

```sqlexample
ALTER WAREHOUSE so_warehouse SET
  RESOURCE_CONSTRAINT = 'MEMORY_1X_x86';
```

Resource constraints are currently not supported in the Snowflake Python APIs.

## Using Snowpark Python Stored Procedures to run ML training workloads

For information on Machine Learning Models and Snowpark Python, see [Training Machine Learning Models with Snowpark Python](../developer-guide/snowpark/python/python-snowpark-training-ml.md).

## Billing for Snowpark-optimized warehouses

For information on Snowpark-optimized warehouse credit consumption, see
`Table 1` in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

> **Tip:**
>
> For information about cost implications of changing the RESOURCE_CONSTRAINT property, see
> [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](warehouses-gen2.md).

## Region availability

Snowpark-optimized warehouses are available in all regions across AWS, Azure, and Google Cloud.

1 TB memory options are not currently available for the Microsoft Azure and Google Cloud Platform (GCP)
[cloud platforms](intro-cloud-platforms.md). On the Amazon Web Services (AWS) cloud platform,
the 1 TB memory option is also still a preview feature.

---
title: Snowpipe
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-intro.md
section: User Guide
---

# Snowpipe

Snowpipe enables loading data from files as soon as they’re available in a stage. This means you can load data from files in micro-batches, making it available to users within minutes, rather than manually executing COPY statements on a schedule to load larger batches.

## How does Snowpipe work?

Snowpipe loads data from files as soon as they are available in a stage. The data is loaded according to the COPY statement defined in a referenced pipe.

A pipe is a named, first-class Snowflake object that contains a COPY statement used by Snowpipe. The COPY statement identifies the source location of the data files (i.e., a stage) and a target table. All data types are supported, including semi-structured data types such as JSON and Avro.

Different mechanisms for detecting the staged files are available:

* Automating Snowpipe using cloud messaging

  Automated data loads leverage event notifications for cloud storage to inform Snowpipe of the arrival of new data files to load. Snowpipe
  polls the event notifications from a queue. By using the metadata in the queue, Snowpipe loads the new data files into the target table in a continuous, serverless fashion based on the parameters
  defined in a specified pipe object.
* Calling Snowpipe REST endpoints

  Your client application calls a public REST endpoint with the name of a pipe object and a list of data filenames. If new data files
  matching the list are discovered in the stage referenced by the pipe object, they are queued for loading. Snowflake-provided compute
  resources load data from the queue into a Snowflake table based on parameters defined in the pipe.

### Supported Cloud Storage services

The following table indicates the cloud storage service support for automated Snowpipe and Snowpipe REST API calls from Snowflake accounts hosted on each
[cloud platform](intro-cloud-platforms.md):

| Snowflake Account Host | Amazon S3 | Google Cloud Storage | Microsoft Azure Blob storage | Microsoft Data Lake Storage Gen2 | Microsoft Azure General-purpose v2 |
| --- | --- | --- | --- | --- | --- |
| Amazon Web Services | ✔ | ✔ | ✔ | ✔ | ✔ |
| Google Cloud | ✔ | ✔ | ✔ | ✔ | ✔ |
| Microsoft Azure | ✔ | ✔ | ✔ | ✔ | ✔ |

For more information, see [Automate continuous data loading with cloud messaging](data-load-snowpipe-auto.md) and [Overview of the Snowpipe REST endpoints to load data](data-load-snowpipe-rest-overview.md).

Note that the government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions. For more information, see [AWS GovCloud (US)](https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/govcloud-s3.html) and [Azure Government](https://learn.microsoft.com/en-us/azure/azure-government/).

> **Important:**
>
> Snowflake recommends that you enable cloud event filtering for Snowpipe to reduce costs, event noise, and latency. For more information about configuring event filtering for each cloud provider, see the following pages:
>
> * [Configuring event notifications using object key name filtering - Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/notification-how-to-filtering.html)
> * [Understand event filtering for Event Grid subscriptions - Azure](https://docs.microsoft.com/en-us/azure/event-grid/event-filtering)
> * [Filtering messages - Google Pub/Sub](https://cloud.google.com/pubsub/docs/filtering)

## How is Snowpipe different from bulk data loading?

This section briefly describes the primary differences between Snowpipe and a bulk data load workflow using the COPY command. Additional details are provided throughout the Snowpipe documentation.

### Authentication

Bulk data load:
:   Relies on the security options supported by the client for authenticating and initiating a user session.

Snowpipe:
:   When calling the REST endpoints: Requires key pair authentication with JSON Web Token (JWT). JWTs are signed using a public/private key pair with RSA encryption.

### Load history

Bulk data load:
:   Stored in the metadata of the target table for 64 days. Available upon completion of the COPY statement as the statement output.

Snowpipe:
:   Stored in the metadata of the pipe for 14 days. Must be requested from Snowflake via a REST endpoint, SQL table function, or ACCOUNT_USAGE view.

> **Important:**
>
> To avoid reloading files (and duplicating data), we recommend loading data from a specific set of files using either bulk data loading or Snowpipe but not both.

### Transactions

Bulk data load:
:   Loads are always performed in a single transaction. Data is inserted into table alongside any other SQL statements submitted manually by users.

Snowpipe:
:   Loads are combined or split into a single or multiple transactions based on the number and size of the rows in each data file. Rows of partially loaded files (based on the ON_ERROR copy option setting) can also be combined or split into one or more transactions.

### Compute resources

Bulk data load:
:   Requires a user-specified warehouse to execute COPY statements.

Snowpipe:
:   Uses Snowflake-supplied compute resources.

### Cost

Bulk data load:
:   Billed for the amount of time each virtual warehouse is active.

Snowpipe:
:   Billed according to the compute resources used in the Snowpipe warehouse while loading the files.

## Recommended load file size

For the most efficient and cost-effective load experience with Snowpipe, we recommend following the file sizing recommendations in [File sizing best practices](data-load-considerations-prepare.md) and staging files once per minute. This approach typically leads to a good balance between cost (i.e. resources spent on Snowpipe queue management and the actual load) and performance (i.e. load latency). For more information, see [Continuous data loads — that is, Snowpipe — and file sizing](data-load-considerations-prepare.md).

## Load order of data files

For each pipe object, Snowflake establishes a single queue to sequence data files awaiting loading. As new data files are discovered in a stage, Snowpipe appends them to the queue. However, multiple processes pull files from the queue; and so, while Snowpipe generally loads older files first, there is no guarantee that files are loaded in the same order they are staged.

## Data duplication

Snowpipe uses file loading metadata associated with each pipe object to prevent reloading the same files (and duplicating data) in a table. This metadata stores the path (i.e. prefix) and name of each loaded file, and prevents loading files with the same name even if they were later modified (i.e. have a different eTag).

## Estimating Snowpipe latency

Given the number of factors that can differentiate Snowpipe loads, it is very difficult for Snowflake to estimate latency. File formats and sizes, and the complexity of COPY statements (including SELECT statement used for transformations), all impact the amount of time required for a Snowpipe load.

We suggest that you experiment by performing a typical set of loads to estimate average latency.

## Pipe security

### Access control privileges

#### Creating pipes

Creating and managing pipes requires a role with a minimum of the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE, CREATE PIPE |  |
| Stage in the pipe definition | USAGE | External stages only. |
| Stage in the pipe definition | READ | Internal stages only. |
| Table in the pipe definition | SELECT, INSERT |  |

#### Owning pipes

After a pipe is created, the pipe owner (i.e. the role that has the OWNERSHIP privilege on the pipe) must have the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Pipe | OWNERSHIP |  |
| Stage in the pipe definition | USAGE | External stages only. |
| Stage in the pipe definition | READ | Internal stages only. |
| Table in the pipe definition | SELECT, INSERT |  |

#### Pausing or resuming pipes

In addition to the pipe owner, a role that has the following minimum permissions can pause or resume the pipe:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Pipe | OPERATE |  |
| Stage in the pipe definition | USAGE | External stages only. |
| Stage in the pipe definition | READ | Internal stages only. |
| Table in the pipe definition | SELECT, INSERT |  |

## Snowpipe DDL

To support creating and managing pipes, Snowflake provides the following set of special DDL commands:

* [CREATE PIPE](../sql-reference/sql/create-pipe.md)
* [ALTER PIPE](../sql-reference/sql/alter-pipe.md)
* [DROP PIPE](../sql-reference/sql/drop-pipe.md)
* [DESCRIBE PIPE](../sql-reference/sql/desc-pipe.md)
* [SHOW PIPES](../sql-reference/sql/show-pipes.md)

In addition, providers can view, grant, or revoke access to the necessary database objects for Snowpipe using the following standard access control DDL:

* [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
* [REVOKE <privileges> … FROM ROLE](../sql-reference/sql/revoke-privilege.md)
* [SHOW GRANTS](../sql-reference/sql/show-grants.md)

---
title: Snowpipe costs
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-billing.md
section: User Guide
---

# Snowpipe costs

With Snowpipe’s serverless compute model, users can initiate any size load without managing a virtual warehouse. Instead, Snowflake provides and manages the compute resources, automatically growing or shrinking capacity based on the current Snowpipe load.

> **Important:**
>
> Snowpipe ingestion is billed based on a fixed credit amount per GB. This simplified model provides you with more predictable data-loading expenses and simplifies cost estimation. The former cost model had two components: the actual compute resources used to load data, measured per-second/per-core, and a per-1,000-files charge.
>
> This credit-per-GB billing model applies to all Snowflake editions: Standard, Enterprise, Business Critical, and Virtual Private Snowflake (VPS).
>
> For text files — such as CSV, JSON, XML — you are charged based on their uncompressed size. For binary files — such as Parquet, Avro, ORC — you are charged based on their observed size regardless of compression.
>
> For more information, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Resource consumption and management overhead

With the credit-per-GB pricing model, Snowpipe billing is calculated based on a fixed credit amount per GB of data that you loaded. This simplified approach means that you don’t need to track or manage compute utilization, which was formerly measured with per-second/per-core granularity.

File sizes and staging frequency might impact the performance of Snowpipe. For recommended best practices, see [Continuous data loads — that is, Snowpipe — and file sizing](data-load-considerations-prepare.md).

## Estimation of Snowpipe charges

Estimating Snowpipe charges is straightforward. You can calculate your expected costs by using your anticipated data volume and the fixed credit amount per GB. Because text files — such as CSV, JSON, XML — are charged based on their uncompressed size, you must know the compression ratio of your text files.

You can verify these calculations against your actual usage by examining the BILLED_BYTES column in the relevant Account Usage views. The BILLED_BYTES column was introduced in the [2025_05 BCR bundle](../release-notes/bcr-bundles/2025_05/bcr-2045.md).

To understand the actual credit consumption for your specific workloads, we suggest that you experiment by performing a typical set of loads.

## View data-load history and cost

Account administrators (users with the ACCOUNTADMIN role) or users with a role granted the MONITOR USAGE global privilege can use [Snowsight](ui-snowsight-gs.md) or SQL to view the credits billed to your Snowflake account within a specified date range.

Occasionally, the data compaction and maintenance process can consume Snowflake credits. For example, the returned results might show that you consumed credits with 0 BYTES_INSERTED and 0 FILES_INSERTED. This means that your data is not being loaded, but the data compaction and maintenance process has consumed some credits.

To view the credits billed for Snowpipe data loading for your account:

> Snowsight:
> :   In the navigation menu, select Admin » Cost management.
>
> SQL:
> :   Query either of the following:
>
>     * [PIPE_USAGE_HISTORY](../sql-reference/functions/pipe_usage_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).
>     * [PIPE_USAGE_HISTORY view](../sql-reference/account-usage/pipe_usage_history.md) (in [Account Usage](../sql-reference/account-usage.md)).
>
>       You can run the following queries against the PIPE_USAGE_HISTORY view. You can verify costs based on volume by using the `BYTES_BILLED` column.
>
>       **Query: Snowpipe cost history (by day, by object)**
>
>       The following query provides a full list of pipes and the volume of credits that you consumed through the service over the last 30 days, broken out by day.
>
>       ```sqlexample
>       SELECT TO_DATE(start_time) AS date,
>         pipe_name,
>         SUM(credits_used) AS credits_used,
>         SUM(bytes_billed) AS bytes_billed_total
>       FROM snowflake.account_usage.pipe_usage_history
>       WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
>       GROUP BY 1,2
>       ORDER BY bytes_billed_total DESC;
>       ```
>
>       **Query: Snowpipe History & m-day average**
>
>       The following query shows the average daily credits consumed by Snowpipe that are grouped by week over the last year. This query can help you identify anomalies in daily consumption averages over the year so that you can investigate sudden increases or unexpected changes in consumption.
>
>       ```sqlexample
>       WITH credits_by_day AS (
>         SELECT TO_DATE(start_time) AS date,
>           SUM(credits_used) AS credits_used,
>           SUM(bytes_billed) AS bytes_billed_total
>         FROM snowflake.account_usage.pipe_usage_history
>         WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
>         GROUP BY 1
>       )
>       SELECT DATE_TRUNC('week',date),
>         AVG(credits_used) AS avg_daily_credits,
>         AVG(bytes_billed_total) AS avg_daily_bytes_billed
>       FROM credits_by_day
>       GROUP BY 1
>       ORDER BY 1;
>       ```

> **Note:**
>
> [Resource monitors](resource-monitors.md) provide control over virtual warehouse credit usage; however, you cannot use them to control
> credit usage for the Snowflake-provided warehouses, including the  SNOWPIPE warehouse.

---
title: Snowpipe error notifications
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-errors.md
section: User Guide
---

# Snowpipe error notifications

Snowpipe can push error notifications to a cloud messaging service when it encounters errors while loading data. The notifications describe the errors encountered in each file, enabling further analysis of the data in the files.

> **Note:**
>
> Snowpipe error notifications only work when the ON_ERROR copy option is set to SKIP_FILE (the default). Snowpipe will not send any error notifications if the ON_ERROR copy option is set to CONTINUE.
>
> You can use the NOTIFICATION_HISTORY table function to query the history of notifications sent through Snowpipe. For more information, refer to [NOTIFICATION_HISTORY](../sql-reference/functions/notification_history.md).

Currently, cross-cloud support is not available for push notifications. Configure error notification support for the messaging service provided by the cloud platform where your Snowflake account is hosted.

**Next Topics:**

* [Enabling Snowpipe error notifications for Amazon SNS](data-load-snowpipe-errors-sns.md)
* [Enabling Snowpipe error notifications for Google Pub/Sub](data-load-snowpipe-errors-gcs.md)
* [Enabling Snowpipe error notifications for Microsoft Azure Event Grid](data-load-snowpipe-errors-azure.md)

---
title: Snowpipe REST API
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-rest-apis.md
section: User Guide
---

# Snowpipe REST API

You interact with a pipe by making calls to REST endpoints. This topic describes the Snowpipe REST API for defining the list of files to ingest and fetching reports of the load history.

Snowflake also provides Java and Python APIs that simplify working with the Snowpipe REST API.

## Data file ingestion

The Snowpipe API provides a REST endpoint for defining the list of files to ingest.

### Endpoint: `insertFiles`

Informs Snowflake about the files to be ingested into a table. A successful response from this endpoint means that Snowflake has recorded the list of files to add to the table. It does not necessarily mean the files have been ingested. For more details, see the response codes below.

In most cases, Snowflake inserts fresh data into the target table within a few minutes.

**Method:** `POST`

**POST URL:**

> `https://{account}.snowflakecomputing.com/v1/data/pipes/{pipeName}/insertFiles?requestId={requestId}`

**URL Parameters:**

* `account` (Required): Account identifier for your Snowflake account.
* `pipeName` (Required): Case-sensitive, fully qualified pipe name. For example, `myDatabase.mySchema.myPipe`.
* `requestId` (Optional): String used to track requests through the system. We recommend providing a random string with each request, for example, a UUID. This should be appended to the URL like this: `?requestId=<your_uuid>`.

**Request Headers**

* `Content-Type:`:

  + `text/plain`: For a plain text list of file paths and filenames, one per line. The size parameter is not allowed in this format.
  + `application/json`: For a JSON object containing a list of files with optional size information.
* `Authorization`: `BEARER <jwt_token>`

**Request Body (for application/json Content-Type)**

The request body must be a JSON object with a single key named “files”. The value associated with this key is an array of JSON objects, where each object represents a file to be ingested.

```JSON
{
  "files":[
    {
      "path":"filePath/file1.csv",
      "size":100
    },
    {
      "path":"filePath/file2.csv",
      "size":100
    }
   ]
}
```

Each element in the “files” array is a JSON object with the following attributes:

* `path` (Required): The path and filename of the staged file. If you follow our recommended best practices by partitioning your data in the stage using logical, granular paths, the path values in the payload include the complete paths to the staged files.
* `size` (Optional, but recommended for better performance): The size of the file in bytes.

**Request Body (for text/plain Content-Type)**

The request body should be a plain text list of file paths and filenames, with one entry per line.

```text
filePath/file_a.csv
another/path/file_b.json
yet/another/file_c.txt
```

> **Note:**
>
> The post can contain at most 5000 files. Each file path given must be <= 1024 bytes long when serialized as UTF-8.

**Response Body**

> Response Codes:
>
> * 200 — Success. Files added to the queue of files to ingest.
> * 400 — Failure. Invalid request due to an invalid format, or limit exceeded.
> * 404 — Failure. `pipeName` not recognized.
>
>   This error code can also be returned if the role used when calling the endpoint does not have sufficient privileges. For more information, see [Grant access privileges](data-load-snowpipe-rest-gs.md).
> * 429 — Failure. Request rate limit exceeded.
> * 500 — Failure. Internal error occurred.
>
> Response Payload:
>
> > With a successful API request (i.e. code 200), the response payload contains the `requestId` and `status` elements in JSON format. If an error occurs, the response payload may contain details about the error.
> >
> > ```JSON
> > {
> >   "requestId": "your_request_uuid",
> >   "status": "success"
> > }
> > ```
> >
> > If the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement in the pipe definition includes the PATTERN copy option, the `unmatchedPatternFiles` attribute lists any files submitted in the header that did not match the regular expression and were therefore skipped.
> >
> > > ```JSON
> > > {
> > >   "requestId": "your_request_uuid",
> > >   "status": "success",
> > >   "unmatchedPatternFiles": ["some_file.txt", "another_file.dat"]
> > > }
> > > ```

## Load history reports

The Snowpipe API provides REST endpoints for fetching load reports.

### Endpoint: `insertReport`

Retrieves a report of files submitted via `insertFiles` whose contents were recently ingested into a table. Note that for large files, this may only be part of the file.

Note the following limitations for this endpoint:

* The 10,000 most recent events are retained.
* Events are retained for a maximum of 10 minutes.

An event occurs when data from a file submitted via `insertFiles` has been committed to the table and is available to queries. The `insertReport` endpoint can be thought of like the UNIX command tail. By calling this command repeatedly, it is possible to see the full history of events on a pipe over time. Note that the command must be called often enough to not miss events. How often depends on the rate files are sent to `insertFiles`.

**Method:** `GET`

**GET URL:**

> `https://<account_identifier>.snowflakecomputing.com/v1/data/pipes/<pipeName>/insertReport?requestId=<requestId>&beginMark=<beginMark>`

**URL Parameters:**

* `account_identifier` (Required): Your unique Snowflake account identifier. The preferred format is `organization_name-account_name`. For alternative formats (account locator with region and cloud platform), see [Format 1 (preferred): Account name in your organization](admin-account-identifier.md).
* `pipeName` (Required): The case-sensitive, fully qualified name of the Snowpipe. For example, `myDatabase.mySchema.myPipe`.
* `requestId` (Optional): A string you can provide to track this specific request through Snowflake’s system. Using a random string like a UUID is highly recommended for easier debugging and monitoring. Append this to the URL like so: `?requestId=<your_uuid>`.
* `beginMark` (Optional): A marker value returned in the `nextBeginMark` field of a previous `insertReport` response. Including this marker helps optimize subsequent calls by potentially reducing the number of duplicate events returned. Note: While `beginMark` is intended as a hint to avoid duplicates, occasional repetition of events might still occur. If `beginMark` is not specified, the report will show the ingestion history from the last 10 minutes. Append this to the URL like so: `?beginMark=<previous_nextBeginMark>`.

**Request Headers:**

* Accept: Specifies the desired response format. Accepted values are `text/plain` or `application/json`.
* Authorization : Your Snowflake authentication token. Use the format BEARER <jwt_token>.

**Request Body:**

This endpoint does not accept a request body for GET requests. The necessary parameters are provided in the URL and headers.

**Response Body:**

> Response Codes:
>
> * 200 — Success. Report returned.
> * 400 — Failure. Invalid request due to an invalid format, or limit exceeded.
> * 404 — Failure. `pipeName` not recognized.
>
>   This error code can also be returned if the role used when calling the endpoint does not have sufficient privileges. For more information, see [Grant access privileges](data-load-snowpipe-rest-gs.md).
> * 429 — Failure. Request rate limit exceeded.
> * 500 — Failure. Internal error occurred.
>
> Response Payload:
>
> > A success response (200) contains information about files that have recently been added to the table. Note that this report may only represent a portion of a large file.
> >
> > For example:
> >
> > > ```JSON
> > > {
> > >   "pipe": "TESTDB.TESTSCHEMA.pipe2",
> > >   "completeResult": true,
> > >   "nextBeginMark": "1_39",
> > >   "files": [
> > >     {
> > >       "path": "data2859002086815673867.csv",
> > >       "stageLocation": "s3://mybucket/",
> > >       "fileSize": 57,
> > >       "timeReceived": "2017-06-21T04:47:41.453Z",
> > >       "lastInsertTime": "2017-06-21T04:48:28.575Z",
> > >       "rowsInserted": 1,
> > >       "rowsParsed": 1,
> > >       "errorsSeen": 0,
> > >       "errorLimit": 1,
> > >       "complete": true,
> > >       "status": "LOADED"
> > >     }
> > >   ]
> > > }
> > > ```
>
> Response Fields:
>
> > | Field | Type | Description |
> > | --- | --- | --- |
> > | `pipe` | String | The fully-qualified name of the pipe. |
> > | `completeResult` | Boolean | `false` if an event was missed between the supplied `beginMark` and the first event in this report history. Otherwise, `true`. |
> > | `nextBeginMark` | String | `beginMark` to use on the next request to avoid seeing duplicate records. Note that this value is a hint. Duplicates can still occasionally occur. |
> > | `files` | Array | An array of JSON objects, one object for each file that is part of the history response. |
> > | `path` | String | The file path relative to the stage location. |
> > | `stageLocation` | String | Either the stage ID (internal stage) or the S3 bucket (external stage) defined in the pipe. |
> > | `fileSize` | Long | File size, in bytes. |
> > | `timeReceived` | String | Time that this file was received for processing. Format is ISO-8601 in UTC time zone. |
> > | `lastInsertTime` | String | Time that data from this file was last inserted into the table. Format is ISO-8601 in UTC time zone. |
> > | `rowsInserted` | Long | Number of rows inserted into the target table from the file. |
> > | `rowsParsed` | Long | Number of rows parsed from the file. Rows with errors may be skipped. |
> > | `errorsSeen` | Integer | Number of errors seen in the file |
> > | `errorLimit` | Integer | Number of errors allowed in the file before it is considered failed (based on ON_ERROR copy option). |
> > | `firstError` [1] | String | Error message for the first error encountered in this file. |
> > | `firstErrorLineNum` [1] | Long | Line number of the first error. |
> > | `firstErrorCharacterPos` [1] | Long | Character position of the first error. |
> > | `firstErrorColumnName` [1] | String | Column name where the first error occurred. |
> > | `systemError` [1] | String | General error describing why the file was not processed. |
> > | `complete` | Boolean | Indicates whether the file was completely processed successfully. |
> > | `status` | String | Load status for the file: |
> > |  |  | * LOAD_IN_PROGRESS: Part of the file has been loaded into the table, but the load process has not completed yet. |
> > |  |  | * LOADED: The entire file has been loaded into the table. |
> > |  |  | * LOAD_FAILED: The file load failed. |
> > |  |  | * PARTIALLY_LOADED: Some rows from this file were loaded successfully, but others were not loaded due to errors. Processing of this file is completed. |
> >
> > [1] Values are only supplied for these fields when files include errors.

### Endpoint: `loadHistoryScan`

Fetches a report about ingested files whose contents have been added to table. Note that for large files, this may only be part of the file. This endpoint differs from `insertReport` in that it views the history between two points in time. There is a maximum of 10,000 items returned, but multiple calls can be issued to cover the desired time range.

> **Important:**
>
> This endpoint is rate limited to avoid excessive calls. To help avoid exceeding the rate limit (error code 429), we recommend relying more heavily on `insertReport` than `loadHistoryScan`. When calling `loadHistoryScan`, specify the most narrow time range that includes a set of data loads. For example, reading the last 10 minutes of history every 8 minutes would work well. Trying to read the last 24 hours of history every minute will result in 429 errors indicating a rate limit has been reached. The rate limits are designed to allow each history record to be read a handful of times.

For a more comprehensive view, without these limits, Snowflake provides an Information Schema table function, [COPY_HISTORY](../sql-reference/functions/copy_history.md), that returns the load history of a pipe or table.

**Method:** `GET`

**GET URL:**

> `https://{account}.snowflakecomputing.com/v1/data/pipes/{pipeName}/loadHistoryScan?startTimeInclusive=<startTime>&endTimeExclusive=<endTime>&requestId=<requestId>`

**URL Parameters:**

* `account` (Required): Your unique Snowflake account identifier.
* `pipeName` (Required): The case-sensitive, fully qualified name of the Snowpipe. Example: `myDatabase.mySchema.myPipe`.
* `startTimeInclusive` (Required): The beginning of the time range for retrieving load history data, specified as a timestamp in ISO-8601 format (for example, 2023-10-26T10:00:00Z). This timestamp marks the inclusive lower bound of the query.
* `endTimeExclusive` (Optional): The end of the time range for retrieving load history data, specified as a timestamp in ISO-8601 format (for example, 2023-10-26T10:15:00Z). This timestamp marks the exclusive upper bound of the query. If this parameter is omitted, the current server timestamp (CURRENT_TIMESTAMP()) will be used as the end of the time range.
* `requestId` (Optional): A string you can provide to track this specific request through Snowflake’s system. We recommend using a random string like a UUID for easier debugging and monitoring. Append this to the URL like so: `?requestId=<your_uuid>`.

**Request Headers:**

* `Accept`: Specifies the desired response format. Accepted values are `text/plain` or `application/json`.
* `Authorization`: Your Snowflake authentication token. Use the format `BEARER <jwt_token>`.

**Request Body:**

This endpoint does not accept a request body for `GET` requests. All necessary parameters are provided in the URL and headers.

**Response Body:**

> Response Codes:
>
> * 200 — Success. Load History scan results are returned.
> * 400 — Failure. Invalid request due to an invalid format, or limit exceeded.
> * 404 — Failure. `pipeName` not recognized.
> * 429 — Failure. Request rate limit exceeded.
> * 500 — Failure. Internal error occurred.
>
> Response Payload:
>
> > A success response (200) contains information about files that have recently been added to the table. Note that this report may only represent a portion of a large file.
> >
> > For example:
> >
> > > ```JSON
> > > {
> > >   "pipe": "TESTDB.TESTSCHEMA.pipe2",
> > >   "completeResult": true,
> > >   "startTimeInclusive": "2017-08-25T18:42:31.081Z",
> > >   "endTimeExclusive":"2017-08-25T22:43:45.552Z",
> > >   "rangeStartTime":"2017-08-25T22:43:45.383Z",
> > >   "rangeEndTime":"2017-08-25T22:43:45.383Z",
> > >   "files": [
> > >     {
> > >       "path": "data2859002086815673867.csv",
> > >       "stageLocation": "s3://mystage/",
> > >       "fileSize": 57,
> > >       "timeReceived": "2017-08-25T22:43:45.383Z",
> > >       "lastInsertTime": "2017-08-25T22:43:45.383Z",
> > >       "rowsInserted": 1,
> > >       "rowsParsed": 1,
> > >       "errorsSeen": 0,
> > >       "errorLimit": 1,
> > >       "complete": true,
> > >       "status": "LOADED"
> > >     }
> > >   ]
> > > }
> > > ```
>
> Response Fields:
>
> | Field | Type | Description |
> | --- | --- | --- |
> | `pipe` | String | Fully-qualified name of the pipe. |
> | `completeResult` | Boolean | `false` if the report is incomplete (i.e. the number of entries in the specified time range exceeds the 10,000 entry limit). If `false`, the user can specify the current `rangeEndTime` value as the `startTimeInclusive` value for the next request to proceed to the next set of entries. |
> | `startTimeInclusive` | String | Starting timestamp (in ISO-8601 format) provided in the request. |
> | `endTimeExclusive` | String | Ending timestamp (in ISO-8601 format) provided in the request. |
> | `rangeStartTime` | String | Timestamp (in ISO-8601 format) of the oldest entry in the files included in the response. |
> | `rangeEndTime` | String | Timestamp (in ISO-8601 format) of the latest entry in the files included in the response. |
> | `files` | Array | An array of JSON objects, one object for each file that is part of the history response. Within the array, the response fields are the same as those returned in the `insertReport` response. |

---
title: Snowpipe Streaming
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md
section: User Guide
---

# Snowpipe Streaming

Snowpipe Streaming is Snowflake’s real-time ingestion service built on the high-performance architecture. It enables applications to load streaming data directly into Snowflake tables as rows arrive, without staging files or managing intermediate storage. Data becomes available for query within seconds of ingestion, supporting use cases from IoT telemetry and Change Data Capture (CDC) pipelines to fraud detection and live analytics.

Snowpipe Streaming delivers:

* Up to **10 GB/s** throughput per table
* **As low as 5 second** end-to-end ingest-to-query latency
* **Exactly-once delivery** through built-in offset token tracking
* **Ordered ingestion** within each channel
* Streaming into Snowflake-managed [Apache Iceberg](../tables-iceberg.md) tables

## Why use Snowpipe Streaming

* **Exactly-once delivery**: Built-in offset token tracking enables exactly-once semantics. Your application tracks committed offsets and replays from the last committed position on recovery, preventing duplicate data and data loss. For more information, see [Offset tokens and exactly-once delivery](snowpipe-streaming-channels.md).
* **Ordered ingestion**: Rows are ingested in order within each [channel](snowpipe-streaming-channels.md). Channels map naturally to source partitions (for example, Kafka topic partitions), enabling deterministic replay and zero-loss recovery.
* **High throughput, low latency**: Designed to support ingest speeds of up to 10 GB/s per table, with data available for query in as low as 5 seconds.
* **In-flight transformations**: Cleanse, reshape, and transform data during ingestion by using COPY command syntax within the PIPE object. Filter rows, reorder columns, cast types, and apply expressions before data is committed to the target table, with no separate ETL step needed.
* **Pre-clustering at ingest time**: Sort data during ingestion for optimized query performance on tables with clustering keys.
* **Apache Iceberg table support**: Stream data into Snowflake-managed Iceberg tables, including both Iceberg v2 and [Iceberg v3](../tables-iceberg-v3-specification-support.md) tables. For more information, see [Snowpipe Streaming high-performance architecture with Apache Iceberg™ tables](snowpipe-streaming-high-performance-iceberg.md).
* **Schema evolution**: Automatically adapt table schemas to changing data structures. Snowflake can add new columns detected in the incoming stream without manual DDL changes.
* **Simplified pipelines**: SDKs write rows directly into tables, bypassing the need for staging files or intermediate cloud storage.
* **Serverless and scalable**: Compute resources scale automatically based on ingestion load. No infrastructure to manage.
* **Transparent pricing**: Throughput-based billing calculated by credits per uncompressed GB of data ingested. For more information, see [Snowpipe Streaming high-performance architecture: Understand your costs](snowpipe-streaming-high-performance-cost.md).

## How to connect

Snowpipe Streaming supports multiple ingestion paths to fit different workloads:

| Integration | Best for |
| --- | --- |
| [Java SDK](https://central.sonatype.com/artifact/com.snowflake/snowpipe-streaming) ([Java API reference](https://docs.snowflake.com/user-guide/snowpipe-streaming-sdk/reference/java/com/snowflake/ingest/streaming/package-summary.html)) | High-throughput custom applications. Requires Java 11 or later. |
| [Python SDK](https://pypi.org/project/snowpipe-streaming/) ([Python API reference](https://docs.snowflake.com/en/user-guide/snowpipe-streaming-sdk-python/reference/latest/index)) | Data engineering and Python-native workflows. Requires Python 3.9 or later. |
| [REST API](snowpipe-streaming-high-performance-rest-api.md) | Lightweight workloads, IoT devices, and edge deployments. |
| [Snowflake Connector for Kafka](../kafka-connector.md) | Apache Kafka topic ingestion. |

Both the Java and Python SDKs use a Rust-based client core for improved client-side performance and lower resource usage.

> **Note:**
>
> We recommend that you begin with the Snowpipe Streaming SDK over the REST API to benefit from the improved performance and getting-started experience.

To get started, see [Tutorial: Get started with the SDK](snowpipe-streaming-high-performance-getting-started.md) or [Tutorial: Get started with the REST API](snowpipe-streaming-high-performance-rest-tutorial.md).

For technical details about the PIPE object, channels, offset tokens, and supported data types, see [Key concepts](snowpipe-streaming-high-performance-overview.md).

## Recommended for

* High-volume streaming workloads requiring up to 10 GB/s throughput
* Real-time analytics and dashboards with data freshness as low as 5 seconds
* IoT and edge deployments using the REST API
* CDC (Change Data Capture) pipelines with exactly-once delivery guarantees
* Apache Kafka topic ingestion using the [Snowflake Connector for Kafka](../kafka-connector.md)
* Streaming into [Apache Iceberg](../tables-iceberg.md) tables for open table format analytics

> **Note:**
>
> Looking for SQL-native streaming? See [Dynamic Tables](../dynamic-tables-about.md) and [Streams](../streams-intro.md) with [Tasks](../tasks-intro.md) for declarative streaming pipelines.

## Snowpipe Streaming versus Snowpipe

Snowpipe Streaming is intended to complement Snowpipe, not replace it. Use Snowpipe Streaming in scenarios where data arrives as rows (for example, from Apache Kafka topics, IoT devices, or application events) instead of files. With Snowpipe Streaming, you don’t need to create files to load data into Snowflake tables.

The following table describes the differences between Snowpipe Streaming and Snowpipe:

| Category | Snowpipe Streaming | Snowpipe |
| --- | --- | --- |
| Form of data to load | Rows | Files. If your existing data pipeline generates files in blob storage, we recommend using Snowpipe instead. |
| Data ordering | Ordered insertions within each channel | Not supported. Snowpipe can load data from files in an order different from the file creation timestamps in cloud storage. |
| Load history | Load history recorded in [SNOWPIPE_STREAMING_FILE_MIGRATION_HISTORY view](../../sql-reference/account-usage/snowpipe_streaming_file_migration_history.md) (Account Usage) | Load history recorded in [COPY_HISTORY](../../sql-reference/account-usage/copy_history.md) (Account Usage) and [COPY_HISTORY function](../../sql-reference/functions/copy_history.md) (Information Schema) |
| Pipe object | The PIPE object is the server-side processing layer for all streaming ingestion. It handles schema validation, in-flight transformations, and pre-clustering. A default pipe is created automatically for each table, or you can create a custom pipe for advanced processing. | A pipe object queues and loads staged file data into target tables. |

## In this section

**Key concepts**

* [Channels and exactly-once delivery](snowpipe-streaming-channels.md)
* [The PIPE object](snowpipe-streaming-pipe-object.md)
* [Table support and schema](snowpipe-streaming-table-support.md)
* [Operations and reference](snowpipe-streaming-operations.md)

**Get started**

* [Tutorial: Get started with the SDK](snowpipe-streaming-high-performance-getting-started.md)
* [Tutorial: Get started with the REST API](snowpipe-streaming-high-performance-rest-tutorial.md)
* [Configurations and examples](snowpipe-streaming-high-performance-configurations.md)

**Ingestion targets**

* [Iceberg tables](snowpipe-streaming-high-performance-iceberg.md)

**Operations**

* [Best practices](snowpipe-streaming-high-performance-best-practices.md)
* [Error handling](snowpipe-streaming-high-performance-error-handling.md)
* [Error logging](snowpipe-streaming-error-tables.md)
* [Costs](snowpipe-streaming-high-performance-cost.md)
* [Limitations and considerations](snowpipe-streaming-high-performance-limitations.md)
* [Migration from classic architecture](snowpipe-streaming-high-performance-migration.md)

**Reference**

* [REST API endpoints](snowpipe-streaming-high-performance-rest-api.md)
* [Python SDK Reference](https://docs.snowflake.com/en/user-guide/snowpipe-streaming-sdk-python/reference/latest/index)
* [Java SDK Reference](https://docs.snowflake.com/user-guide/snowpipe-streaming-sdk/reference/java/index.html)
* [Comparison: Classic vs current SDK](snowpipe-streaming-high-performance-comparison.md)

## Classic architecture

> **Important:**
>
> The classic architecture, which uses the [snowflake-ingest-sdk](https://mvnrepository.com/artifact/net.snowflake/snowflake-ingest-sdk) Java SDK, is planned for deprecation. No immediate changes are required. Current workloads continue to be fully supported.
>
> For full details, see [Notice of planned deprecation](snowpipe-streaming-classic-deprecation.md).

If you have existing workloads running on the classic architecture, see [Classic architecture](snowpipe-streaming-classic-overview.md). For a detailed comparison of differences, see [Comparison between high-performance and classic SDKs](snowpipe-streaming-high-performance-comparison.md).

If you’re upgrading to the high-performance architecture, see [Migration guide](snowpipe-streaming-high-performance-migration.md).

---
title: Snowpipe Streaming classic architecture
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-overview.md
section: User Guide
---

# Snowpipe Streaming classic architecture

> **Important:**
>
> * Advance notice: Snowpipe Streaming classic architecture is fully supported today, but it is planned for future deprecation.
> * Action: No immediate changes are required. Your current workloads are safe and continue to be fully supported.
> * Timeline: Snowflake plans to issue a formal deprecation announcement in mid-2026. This milestone refers to the announcement date only. After the deprecation announcement, an 18-month migration window begins before the end-of-life date.
> * Recommendation: Use the high-performance architecture for all new implementations.
>
> For full details, FAQs, and migration guidance, see [Notice of planned deprecation](snowpipe-streaming-classic-deprecation.md).

> **Note:**
>
> New to Snowpipe Streaming? See the [Snowpipe Streaming overview](data-load-snowpipe-streaming-overview.md) for current capabilities built on the high-performance architecture.

The Snowpipe Streaming classic architecture offers a proven and efficient method for continuous, low-latency, row-based data ingestion directly into Snowflake tables. This implementation, referred to as Snowpipe Streaming Classic in the documentation, remains a reliable choice for diverse streaming workloads such as application event data, Internet of things (IoT) sensor readings, and low-latency Change Data Capture (CDC).

Snowpipe Streaming Classic uses the `snowflake-ingest-java` SDK and operates without the explicit `PIPE` object concept for managing data flow that is central to the Snowpipe Streaming high-performance architecture. Instead, in Snowpipe Streaming Classic, channels are configured more directly against tables, offering a familiar and established approach to streaming data into Snowflake.

## Software requirements

* SDK: Use the [snowflake-ingest-sdk](https://mvnrepository.com/artifact/net.snowflake/snowflake-ingest-sdk) version 4.X or later.
* Java version: Requires Java 8 or later.
* Additional prerequisite: [Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction Policy Files](../../developer-guide/jdbc/java-install.md) must be installed for your Java 8 environment.
* For documentation on the classes and interfaces for the classic architecture, see [Snowflake Ingest SDK API](https://javadoc.io/doc/net.snowflake/snowflake-ingest-sdk/latest/overview-summary.html).

For the differences between the classic and high-performance architectures, see [API differences](snowpipe-streaming-high-performance-comparison.md).

### Custom client application

The API requires a custom Java application interface that can accept rows of data and handle errors that occur. You must ensure that the application runs continuously and can recover from failure. For a given batch of rows, the API supports the equivalent of `ON_ERROR = CONTINUE | SKIP_BATCH | ABORT`.

* `CONTINUE`: Continue to load the acceptable rows of data and return all errors.
* `SKIP_BATCH`: Skip loading and return all errors if any error is encountered in the entire batch of rows.
* `ABORT` (default setting): Abort the entire batch of rows and throw an exception when the first error is encountered.

For Snowpipe Streaming classic, the application does schema validations using the response from the `insertRow` (single row) or `insertRows` (set of rows) methods. For the error handling for the high-performance architecture, see [Error handling](snowpipe-streaming-high-performance-error-handling.md).

## Loading data into Apache Iceberg™ tables

With Snowflake Ingest SDK versions 3.0.0 and later, Snowpipe Streaming can ingest data into Snowflake-managed [Apache Iceberg](../tables-iceberg.md) tables. The Snowpipe Streaming Ingest Java SDK supports loading into both standard Snowflake tables (non-Iceberg) and Iceberg tables.

For more information, see [Snowpipe Streaming Classic with Apache Iceberg™ tables](snowpipe-streaming-classic-iceberg.md).

## Migration to optimized files in the classic architecture

The API writes the rows from channels into blobs in cloud storage, which are then committed to the target table. Initially, the streamed data written to a target table is stored in a temporary intermediate file format. At this stage, the table is considered a “mixed table” because partitioned data is stored in a mixture of native and intermediary files. An automated background process migrates data from the active intermediate files to native files that are optimized for query and DML operations as needed.

## Replication in the classic architecture

Snowpipe streaming supports the [replication and failover](../account-replication-intro.md) of Snowflake tables populated by Snowpipe Streaming and its associated channel offsets from a source account to a target account in different [regions](../intro-regions.md) and across [cloud platforms](../intro-cloud-platforms.md).

For more information, see [Replication and Snowpipe Streaming](../account-replication-considerations.md).

---
title: Snowpipe Streaming Classic with Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-classic-iceberg.md
section: User Guide
---

# Snowpipe Streaming Classic with Apache Iceberg™ tables

With Snowflake Ingest SDK versions 3.0.0 and later, Snowpipe Streaming can ingest data into Snowflake-managed [Apache Iceberg](../tables-iceberg.md) tables. The Snowpipe Streaming Ingest Java SDK supports loading into both standard Snowflake tables (non-Iceberg) and Iceberg tables.

Data sent through the Snowpipe Streaming API ingests rows through one or more channels, which are automatically flushed according to the specified `MAX_CLIENT_LAG`.

The `MAX_CLIENT_LAG` property controls the latency of streaming ingestion:

* For standard Snowflake tables (non-Iceberg), the default `MAX_CLIENT_LAG` is 1 second.
* For Iceberg tables, the default `MAX_CLIENT_LAG` is 30 seconds.

Snowflake connects to your storage location using an [external volume](../tables-iceberg.md), and Snowpipe Streaming flushes the data to create Iceberg-compatible Parquet data files with corresponding Iceberg metadata. These Parquet data and metadata files are uploaded to your configured external cloud storage location and made available as Snowflake-managed Iceberg tables registered with Snowflake as the Iceberg catalog.

## Configurations

Create your [Snowflake-managed Iceberg table with your configured external volume](../../sql-reference/sql/create-iceberg-table-snowflake.md) and specify the Iceberg table name in your open [channel](snowpipe-streaming-channels.md) request.

To enable Snowpipe Streaming with the Snowflake-managed Iceberg table, you need to set the following property `ENABLE_ICEBERG_STREAMING=true` in the `profile.json` file.

## Supported data types

* The Snowflake Ingest SDK supports most of the Iceberg data types, the same as what Snowflake currently supports. For more information, see [Data types for Apache Iceberg™ tables](../tables-iceberg-data-types.md).
* The Snowflake Ingest SDK supports ingestion into the three [structured data types](../../sql-reference/data-types-structured.md): Structured ARRAY, Structured OBJECT, Structured MAP.

## Usage notes

* The default `MAX_CLIENT_LAG` for streaming to Snowflake-managed Iceberg tables is 30 seconds to ensure optimized Parquet files. You can set the property to a lower value, but we recommend *not* doing this unless there is a significantly high throughput.
* The Ingest SDK supports automatic serverless compaction of small Parquet files asynchronously.
* The same client application cannot be used for Iceberg and non-Iceberg tables simultaneously.
* Snowflake-managed Iceberg tables do not support client-side encryption.
* The Iceberg compatible Parquet files are created based on the [STORAGE_SERIALIZATION_POLICY](../../sql-reference/parameters.md) specified on the Iceberg table.
* Snowpipe Streaming only supports Snowflake as the Iceberg catalog, but it also supports [syncing with Snowflake Open Catalog](../tables-iceberg-open-catalog-sync.md).
* Snowflake connects to your storage location using an [external volume](../tables-iceberg.md). You are responsible for [data storage](../tables-iceberg.md) for Iceberg tables.
* Only Iceberg v2 tables are supported. [Iceberg v3 tables](../tables-iceberg-v3-specification-support.md) aren’t supported. To use Snowpipe Streaming with Iceberg v3 tables, you must use
  [Snowpipe Streaming with high-performance architecture](snowpipe-streaming-high-performance-iceberg.md).

## Limitations

The [Snowpipe Streaming limitations](data-load-snowpipe-streaming-overview.md) and [Iceberg tables limitations](../tables-iceberg.md) also apply to Snowpipe Streaming with Iceberg tables.

---
title: Snowpipe Streaming high-performance architecture with Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-iceberg.md
section: User Guide
---

# Snowpipe Streaming high-performance architecture with Apache Iceberg™ tables

Snowpipe Streaming with high-performance architecture supports ingesting data into Snowflake-managed [Apache Iceberg](../tables-iceberg.md) tables, including both Iceberg v2 and [Iceberg v3](../tables-iceberg-v3-specification-support.md) tables. This enables near real-time streaming of data into Iceberg tables with all the performance benefits of the high-performance architecture.

> **Note:**
>
> The classic architecture supports Iceberg v2 tables only. If you need Iceberg v3 support, you must use the high-performance architecture. For more information about Iceberg support in the classic architecture, see [Snowpipe Streaming Classic with Apache Iceberg™ tables](snowpipe-streaming-classic-iceberg.md).

## How it works

Snowpipe Streaming ingests data through the PIPE object into your target Iceberg table. Snowflake creates Iceberg-compatible Apache Parquet data files with corresponding Iceberg metadata, and uploads them to your configured external cloud storage location. The data is made available as a Snowflake-managed Iceberg table registered with Snowflake as the Iceberg catalog.

Snowflake connects to your storage location using an [external volume](../tables-iceberg.md).

## Get started

This section provides a step-by-step example of how to set up Snowpipe Streaming with high-performance architecture to ingest data into an Iceberg table.

### Step 1: Create an external volume

Create an [external volume](../tables-iceberg-configure-external-volume.md) that specifies a storage location for your Iceberg table data.

Grant the streaming role USAGE on the external volume:

```sqlexample
GRANT USAGE ON EXTERNAL VOLUME my_external_volume TO ROLE my_streaming_role;
```

### Step 2: Create a Snowflake-managed Iceberg table

Create a [Snowflake-managed Iceberg table](../../sql-reference/sql/create-iceberg-table-snowflake.md) with your configured external volume:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    event_id NUMBER,
    event_type STRING,
    event_data VARIANT,
    event_timestamp TIMESTAMP_NTZ
)
    CATALOG = 'SNOWFLAKE'
    EXTERNAL_VOLUME = 'my_external_volume'
    BASE_LOCATION = 'my_iceberg_table/'
    ICEBERG_VERSION = 3;
```

> **Note:**
>
> If you omit the `ICEBERG_VERSION` parameter, the table defaults to Iceberg v2.

### Step 3: Create a pipe for ingestion

Create a pipe that targets the Iceberg table. You can use the default pipe (automatically created) or create a custom pipe:

```sqlexample
-- Option 1: Use the default pipe.
-- The default pipe is automatically created when you open a channel
-- against the table using the SDK. The default pipe name follows the
-- convention: <TABLE_NAME>-STREAMING (for example, MY_ICEBERG_TABLE-STREAMING).

-- Option 2: Create a custom pipe with explicit column mapping.
CREATE OR REPLACE PIPE my_iceberg_pipe AS
    COPY INTO my_iceberg_table (event_id, event_type, event_data, event_timestamp)
    FROM (SELECT $1:event_id, $1:event_type, $1:event_data, $1:event_timestamp);
```

### Step 4: Stream data using the SDK

Configure the SDK to stream data into your Iceberg table through the pipe. Use the same SDK setup as described in [Tutorial: Get started with Snowpipe Streaming high-performance architecture SDK](snowpipe-streaming-high-performance-getting-started.md), specifying your Iceberg table’s pipe in the client configuration.

## Supported Iceberg versions

The high-performance architecture supports both Iceberg v2 and [Iceberg v3](../tables-iceberg-v3-specification-support.md) tables.

The [classic architecture](snowpipe-streaming-classic-iceberg.md) supports only Iceberg v2 tables.

## Supported data types

The Snowflake Ingest SDK supports most of the Iceberg data types that Snowflake currently supports. For more information, see [Data types for Apache Iceberg™ tables](../tables-iceberg-data-types.md).

The SDK also supports ingestion into the three [structured data types](../../sql-reference/data-types-structured.md): Structured ARRAY, Structured OBJECT, and Structured MAP.

## Usage notes

* Snowpipe Streaming only supports **Snowflake as the Iceberg catalog**. Externally managed Iceberg tables that use external catalogs (such as AWS Glue or Hive Metastore) aren’t supported. However, you can [sync your Snowflake-managed Iceberg tables with Snowflake Open Catalog](../tables-iceberg-open-catalog-sync.md).
* Snowflake connects to your storage location using an [external volume](../tables-iceberg.md). You are responsible for [data storage](../tables-iceberg.md) for Iceberg tables.
* The Iceberg-compatible Parquet files are created based on the [STORAGE_SERIALIZATION_POLICY](../../sql-reference/parameters.md) specified on the Iceberg table.

## Limitations

The following limitations apply to Snowpipe Streaming with high-performance architecture and Iceberg tables:

* Partitioned Iceberg tables aren’t supported.
* Schema evolution isn’t supported for Iceberg tables.
* Length-constrained VARCHAR columns (for example, `VARCHAR(100)`) aren’t supported for Iceberg tables. Use STRING or VARCHAR without a length constraint.

The [Snowpipe Streaming high-performance architecture limitations](snowpipe-streaming-high-performance-limitations.md) and [Iceberg tables limitations](../tables-iceberg.md) also apply.

---
title: Snowpipe Streaming high-performance architecture: Understand your costs
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-cost.md
section: User Guide
---

# Snowpipe Streaming high-performance architecture: Understand your costs

This document outlines the billing model for the new high-performance architecture of Snowpipe Streaming, designed for transparent and predictable pricing.

## Billing model: Throughput based

The high-performance architecture introduces a flat-rate pricing model based on the volume of uncompressed data ingested.

* Rate: Charged per uncompressed gigabyte (GB). For the current rate, see the [Snowflake Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
* Metering: Data is metered by the Snowpipe Streaming service during the ingestion process.
* Measurement basis: Billing is based on the input bytes received by Snowpipe Streaming, not the final byte count produced in the target table. This means the raw, uncompressed data volume sent to Snowpipe Streaming is what’s measured for billing.

> **Note:**
>
> Snowflake charges only for the data values ingested, not for the structural elements like keys. For instance, when ingesting a JSON file that contains both keys and values, billing is based solely on the byte size of the values. This is because the values represent the actual data being ingested, similar to how data in a CSV file (without headers explicitly included in the data charge) would be measured.

**Key change from Snowpipe Streaming Classic**: This new model differs significantly from the Snowpipe Streaming Classic billing, where credits are primarily based on serverless compute usage and active client connections.

### Billing example

Let’s consider an example where you are ingesting uncompressed data values at a rate of 1 Megabyte per second (MB/s).

* Data Values Ingested per Second: 1 MB
* Data Values Ingested per Hour: 1 MB/s \* 3600 s/hour = 3600 MB/hour = 3.6 GB/hour (assuming 1 GB = 1000 MB for this billing context).
* Credits Consumed per Hour: 3.6 GB/hour \* current rate per GB

For the exact credit calculation, see the current rate provided on the [Snowflake Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Monitoring your usage and costs

To understand the data ingested and the corresponding credits consumed within the high-performance architecture, you can query the ACCOUNT_USAGE.METERING_HISTORY view.

Here is an example:

```sqlexample
SELECT *
FROM snowflake.account_usage.metering_history m
JOIN snowflake.account_usage.pipes p
  ON m.entity_id = p.pipe_id
 AND m.name = p.pipe_name
 AND m.service_type = 'SNOWPIPE_STREAMING';
```

### Distinguishing costs: High-performance vs. Classic Snowpipe Streaming

It’s important to be able to differentiate costs originating from the new high-performance architecture versus the existing Snowpipe Streaming Classic model. You can achieve this by querying your billing history and filtering based on the service type or other distinguishing attributes.

### Snowpipe Streaming Classic credits

For information on the billing model for Snowpipe Streaming Classic, see [Costs for Snowpipe Streaming Classic](snowpipe-streaming-classic-billing.md).

---
title: Snowpipe Streaming key concepts
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md
section: User Guide
---

# Snowpipe Streaming key concepts

This section describes the key concepts for Snowpipe Streaming. For an introduction to Snowpipe Streaming and its capabilities, see [Snowpipe Streaming](data-load-snowpipe-streaming-overview.md).

[Channels and exactly-once delivery](snowpipe-streaming-channels.md)
:   Learn how channels provide ordered ingestion and how offset tokens enable exactly-once delivery, including crash recovery examples for Kafka and log file ingestion.

[The PIPE object](snowpipe-streaming-pipe-object.md)
:   Understand how the PIPE object manages server-side data processing, including in-flight transformations, pre-clustering, and the default pipe for simplified setup.

[Table support and schema](snowpipe-streaming-table-support.md)
:   Review supported table types (including Apache Iceberg v2 and v3), schema evolution, supported Java data types, and insert-only operation constraints.

[Operations and reference](snowpipe-streaming-operations.md)
:   Monitor ingestion health, understand required access privileges, and reference observability tools.

---
title: Snowpipe Streaming migration guide
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-migration.md
section: User Guide
---

# Snowpipe Streaming migration guide

This guide describes how to migrate from the classic Snowpipe Java SDK to the high-performance Snowpipe Streaming SDK. The architectural changes and API updates discussed here also apply to migrations to the Python SDK, because the high-performance architecture is available in both languages. Although the code examples in this document are in Java, the core migration principles remain consistent across languages.

## Key architectural changes

The following table summarizes the most important architectural changes in the high-performance Snowpipe Streaming SDK. For a detailed comparison of the SDKs, see [Comparison between Snowpipe Streaming high-performance and classic SDKs](snowpipe-streaming-high-performance-comparison.md).

| Area | Classic (snowflake-ingest-java) | High-Performance (snowpipe-streaming SDK) |
| --- | --- | --- |
| Entry point | Data is ingested directly into tables. | Data is ingested through PIPE objects, which support transforms and schema enforcement. |
| SDK / Core | Java SDK only. | SDK in multiple languages (Java and Python) with a shared Rust core. |
| API names | `insertRow`/`insertRows`, `openChannel(request)` | `appendRow`/`appendRows`, `openChannel(channelName, offsetToken)` |
| Error handling | Client-side validation is performed. | Server-side validation with richer error feedback is provided. |
| Backpressure handling | Puts the thread to sleep, leading to a blocked/unresponsive state. | Returns an error, allowing the caller to implement a backoff/retry strategy. |
| Client-to-table mapping | A single client object could open channels to any table. | A single client object is now exclusively tied to one pipe object. |
| Billing | Based on compute and client count. | Flat, per-GB ingested. |
| Schema / transforms | Managed on the client side. | Managed on the server side through the PIPE definition. |

## Migration process

To migrate your application to the high-performance SDK, complete the following high-level steps:

1. For each target table, [create a PIPE](../../sql-reference/sql/create-pipe.md).

   ```sqlexample
   CREATE PIPE my_pipe
   AS COPY INTO my_table
     FROM TABLE (DATA_SOURCE(TYPE => 'STREAMING'))
     MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE
     [CLUSTER_AT_INGEST_TIME = TRUE];
   ```
2. Stop ingestion from all classic clients.
3. For each channel in the classic client, confirm the last committed offsets. To retrieve these offsets, use the `getLatestCommittedOffsetTokens()` method from the classic SDK. Verify that these offsets align with your client-side records.
4. Update your application code.

   * Switch your project dependencies to the high-performance SDK (Java or Python).
   * Update your API calls as detailed in the following API and configuration changes section.
   * Initialize one client per table/PIPE by using the last committed offset from Snowflake.
5. After your new client is configured and stable, resume ingestion.

## API and configuration changes

The following changes must be made to your API calls and configuration settings during migration:

### Client initialization

* Classic: `builder(name)`
* High-performance: `builder(name, db, schema, pipeName)`

### Channels

* Classic: `openChannel(OpenChannelRequest)`
* High-performance: `openChannel(channelName, offsetToken)` returns both channel and status

### Ingestion methods

* Classic: `insertRow/insertRows(...)`
* High-performance: `appendRow/appendRows(...)`

### Offset tracking

* The classic SDK’s `getLatestCommittedOffsetTokens(channels)` method offers limited visibility and lacks error context.
* The high-performance SDK still supports `getLatestCommittedOffsetTokens(...)`, but for robust monitoring, we recommend that you use `getChannelStatuses(...)`. This method performs the following tasks:

  + Confirms that offsets are advancing as expected.
  + Returns error counts and detailed error information per channel.
  + Enables proactive monitoring and troubleshooting of your data pipelines.

### Handling semi-structured data

When migrating to the high-performance SDK, review how your application provides data for ARRAY and VARIANT columns to avoid data being stored as literal strings.

#### Behavioral change

Passing a serialized string literal — for example, “[1, 2, 3]” — to an ARRAY column in v2 results in a single-element array containing that string literal. To maintain the classic architecture behavior, select one of the following options:

#### Option 1: Pass native objects (Recommended)

Update your client application to deserialize JSON strings into native objects before calling `appendRow`.

* **Java**: Use `java.util.List` for arrays and `java.util.Map` for objects.
* **Python**: Use native `list` and `dict` types.

**Benefit**: Compatible with the default pipe and automatic schema evolution.

#### Option 2: Pipe-side transformation

Explicitly define the `Pipe` object with transformation logic by using the `PARSE_JSON` function.

**Example SQL**

```sqlexample
CREATE PIPE my_pipe AS
COPY INTO my_table (my_array_col)
FROM (SELECT PARSE_JSON($1:my_array_col) FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING')));
```

> **Note:**
>
> This method is incompatible with the default pipe and the automatic schema evolution features.

---
title: Snowpipe Streaming REST API endpoints
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-rest-api.md
section: User Guide
---

# Snowpipe Streaming REST API endpoints

> **Note:**
>
> We recommend that you begin with the Snowpipe Streaming SDK over the REST API to benefit from the improved performance and getting-started experience.

The Snowpipe Streaming REST API is designed for lightweight workloads and provides a flexible way to integrate with external applications without using the Snowpipe Streaming SDK.

The following  diagram provides a visual overview of how data flows from the client to the Snowflake server, detailing each of the key API endpoints in the process.

## Request headers

The following request headers apply to all the endpoints for the Snowpipe Streaming REST API:

| Header | Description |
| --- | --- |
| `Authorization` | Authentication token |
| `X-Snowflake-Authorization-Token-Type` (optional) | JWT/OAuth |
| `Content-Encoding` (optional) | Specifies the compression format of the payload. Supported: `gzip`, `zstd`. |

> **Note:**
>
> The maximum allowed size for a single request payload is 16 MB. If your data is larger, you must split it into multiple requests.

## Get Hostname

The `Get Hostname` returns the hostname used to interact with the Snowpipe Streaming REST API. Each account has a unique hostname.

```output
GET /v2/streaming/hostname
```

Response:

```json
{
  "hostname": "string"
}
```

Description of response fields:

| Field | Type | Description |
| --- | --- | --- |
| Hostname | String | The hostname of the account. |

## Exchange Scoped Token

The `Exchange Scoped Token` returns a security token that can be used to access only the Snowpipe Streaming API-related service. This provides security protection for the customer.

```output
POST /oauth/token
```

Request:

| Attribute | Required | Component | Description |
| --- | --- | --- | --- |
| content_type | Yes | Header | “application/x-www-form-urlencoded” |
| grant_type | Yes | Payload | “<urn:ietf:params:oauth:grant-type:jwt-bearer>” |
| scope | Yes | Payload | The hostname of the account. |

Response:

```json
{
  "token": "string"
}
```

Description of response fields:

| Field | Type | Description |
| --- | --- | --- |
| Token | String | The scoped token. |

## Open Channel

The `Open Channel` operation creates or opens a new channel against a pipe or table. If the channel already exists, Snowflake bumps the client sequencer of the channel and returns the last committed offset token.

```output
PUT /v2/streaming/databases/{databaseName}/schemas/{schemaName}/pipes/{pipeName}/channels/{channelName}
```

Request:

| Attribute | Required | Component | Description |
| --- | --- | --- | --- |
| databaseName | Yes | URI | Database name, case-insensitive. |
| schemaName | Yes | URI | Schema name, case-insensitive. |
| pipeName | Yes | URI | Pipe name, case-insensitive. |
| channelName | Yes | URI | The name of the channel that you create or re-open, case-insensitive. |
| offset_token | No | Payload | String used to set an offset token when opening a channel. |
| requestId | No | Query parameter | A universally unique identifier (UUID) used to track requests through the system. |

Response:

```json
{
  "next_continuation_token": "string",
  "channel_status": {
    "database_name": "string",
    "schema_name": "string",
    "pipe_name": "string",
    "channel_name": "string",
    "channel_status_code": "string",
    "last_committed_offset_token": "string",
    "created_on_ms": "long",
    "rows_inserted": "int",
    "rows_parsed": "int",
    "rows_error_count": "int",
    "last_error_offset_upper_bound": "string",
    "last_error_message": "string",
    "last_error_timestamp": "timestamp_utc",
    "snowflake_avg_processing_latency_ms": "int"
  }
}
```

Description of response fields:

| Field | Type | Description |
| --- | --- | --- |
| next_continuation_token | String | An API-managed token that must be used in the subsequent Append Rows request. The token links a series of calls, ensuring a contiguous, in-order stream of data and maintaining the session state for exactly once delivery. |
| channel_status | Object | A nested object with the following detailed information about the channel:   * database_name (String): The name of the database where the pipe is located. * schema_name (String): The name of the schema where the pipe is located. * pipe_name (String): The name of the specific pipe being used. * channel_name (String): The name of the streaming channel. * channel_status_code (String): A code that indicates the current status of the channel; for example, “ACTIVE”. * last_committed_offset_token (String): The token that represents the last successfully committed offset. * created_on_ms (Long): The timestamp, in milliseconds, when the channel was created. * rows_inserted (Int): The total number of rows successfully inserted. * rows_parsed (Int): The total number of rows parsed. * rows_error_count (Int): The total number of rows that encountered an error. * last_error_offset_upper_bound (String): A token that indicates the upper bound of the offset where the last error occurred. * last_error_message (String): The message of the last error that occurred. * last_error_timestamp (Long): The timestamp, in milliseconds, of the last error. * snowflake_avg_processing_latency_ms (Int): The average processing latency of Snowflake in milliseconds. |

## Append Row(s)

The `Append Rows` operation inserts a batch of rows to the given channel.

```output
POST /v2/streaming/data/databases/{databaseName}/schemas/{schemaName}/pipes/{pipeName}/channels/{channelName}/rows
```

Request:

| Attribute | Required | Component | Description |
| --- | --- | --- | --- |
| databaseName | Yes | URI | Database name, case-insensitive. |
| schemaName | Yes | URI | Schema name, case-insensitive. |
| pipeName | Yes | URI | Pipe, case-insensitive. |
| channelName | Yes | URI | Channel name, case-insensitive. |
| continuationToken | Yes | Query parameter | Continuation token from Snowflake, encapsulates both client and row sequencers. |
| offsetToken | No | Query parameter | String used to set an offset token per batch. |
| rows | Yes | Payload | The actual data payload to be ingested in NDJSON format. The maximum allowed size for this attribute is 4 MB. |
| requestId | No | Query parameter | A UUID used to track requests through the system. |

> **Note:**
>
> The JSON text within the NDJSON payload must strictly conform to the `RFC 8259` standard. Each JSON text must be followed by a newline character `\n` (`0x0A`). You can also insert a carriage return `\r` (`0x0D`) before the newline character.

Response:

```json
{
  "next_continuation_token": "string"
}
```

Description of response fields:

| Field | Type | Description |
| --- | --- | --- |
| next_continuation_token | string | The next continuation token from Snowflake, which encapsulates both client and row sequencers. It should be used for inserting the next batch. |

## Drop Channel

The `Drop Channel` operation drops a channel at server side along with its metadata.

```output
DELETE /v2/streaming/databases/{databaseName}/schemas/{schemaName}/pipes/{pipeName}/channels/{channelName}
```

Request:

| Attribute | Required | Component | Description |
| --- | --- | --- | --- |
| databaseName | Yes | URI | Database name, case-insensitive |
| schemaName | Yes | URI | Schema name, case-insensitive |
| pipeOrTableName | Yes | URI | Pipe or table name, case-insensitive |
| channelName | Yes | URI | Channel name, case-insensitive |
| requestId | No | Query parameter | A UUID used to track requests through the system |

Response:

This operation returns a payload with no specific successful response other than the HTTP status code.

## Bulk Get Channel Status

The `Bulk Get Channel Status` operation returns the status of a channel for a specific client sequencer.

```output
POST /v2/streaming/databases/{databaseName}/schemas/{schemaName}/pipes/{pipeName}:bulk-channel-status
```

Request:

| Attribute | Required | Component | Description |
| --- | --- | --- | --- |
| databaseName | Yes | URI | Database name, case-insensitive |
| schemaName | Yes | URI | Schema name, case-insensitive |
| pipeName | Yes | URI | Pipe name, case-insensitive |
| channel_names | Yes | Payload | An array of String channel names that the customer wants to get status for; the names are case-sensitive. For example, `{"channel_names":["channel1", "channel2"]}`. |

Response:

```json
{
  "channel_statuses": {
    "channel1": {
      "channel_status_code": "String",
      "last_committed_offset_token": "String",
      "database_name": "String",
      "schema_name": "String",
      "pipe_name": "String",
      "channel_name": "String",
      "rows_inserted": "int",
      "rows_parsed": "int",
      "rows_errors": "int",
      "last_error_offset_upper_bound": "String",
      "last_error_message": "String",
      "last_error_timestamp": "timestamp_utc",
      "snowflake_avg_processing_latency_ms": "int"
    },
    "channel2": {
      "comment": "same structure as channel1"
    }
    "comment": "potentially other channels"
  }
}
```

> **Note:**
>
> If no requested channel is found in the service, the response payload doesn’t have an entry for that channel within the `channel_statuses` object.

Description of `channel_statuses` fields for each channel:

| Field | Type | Description |
| --- | --- | --- |
| channel_status_code | String | Indicates the status of the channel. |
| last_committed_offset_token | String | Latest committed offset token. |
| database_name | String | The name of the database that the channel belongs to. |
| schema_name | String | The name of the schema that the channel belongs to. |
| pipe_name | String | The name of the pipe that the channel belongs to. |
| channel_name | String | The name of the channel. |
| rows_inserted | int | A count of all rows inserted into this channel. |
| rows_parsed | int | A count of all rows parsed, but not necessarily inserted into this channel. |
| rows_errors | int | A count of all rows that experienced errors when inserted into this channel and were therefore rejected. |
| last_error_offset_upper_bound | String | The upper bound for an ingestion error. The error will be located at or before this committed offset token. |
| last_error_message | String | A human readable message corresponding to the latest error code for that channel, with sensitive customer data redacted. |
| last_error_timestamp | timestamp_utc | Timestamp at the time when the last error occurred. |
| snowflake_avg_processing_latency_ms | int | Average end-to-end processing time for this channel. |

## Error response structure

The Snowpipe Streaming REST APIs return a JSON payload for error responses. This structure provides actionable information for both automated error handling and human analysis.

The response payload has the following structure:

```json
{
  "code": "...",
  "message": "..."
}
```

### Response fields

| Field | Type | Description |
| --- | --- | --- |
| Code | String | A stable, programmatic error code. This value can be used for automated error handling and logging. For example, an application’s logic can check for a specific code to trigger a predefined action. |
| Message | String | A human-readable message that describes the error. This message is subject to change and shouldn’t be used for automated parsing. |

### Example

The following example shows an error response you might receive:

```json
{
  "code": "STALE_CONTINUATION_TOKEN_SEQUENCER",
  "message": "Channel sequencer in the continuation token is stale. Please reopen the channel"
}
```

This example shows the response for an attempt to use a continuation token with a stale channel sequencer. The code provides a clear, machine-readable identifier for the error, and the message offers a helpful, descriptive text for a user.

---
title: Snowsight navigation menu
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-navigation.md
section: User Guide
---

# Snowsight navigation menu

## Overview

Use this guide to familiarize yourself with the location of various features and workflows in Snowsight.

Snowflake has rolled out updates to Snowsight’s navigation menu. The Snowsight navigation menu contains new groups, and some pages have been separated into their own items. Features are now grouped under key categories, helping you find what you need more quickly.

Work with data:

* **Projects:** Analyze data and develop applications using tools such as Worksheets, Notebooks, Streamlit, Dashboards, and Native Apps.
* **Ingestion:** Use connectors and tools to ingest data into Snowflake.
* **Transformation:** Monitor and manage data transformation jobs and pipelines using dynamic tables and tasks.
* **AI & ML:** Use Snowflake Cortex AI and Snowflake ML to analyze unstructured data, build models, and create intelligent agents.
* **Monitoring:** Monitor query history, container services, job history, and traces and logs.
* **Marketplace:** Discover and access third-party data, apps, and agentic products.

Horizon Catalog:

* **Catalog:** Browse all of your data in one place, including the databases in your account and other data products and apps published within your organization.
* **Data sharing:** Publish data products to your Internal Marketplace, share privately with other Snowflake accounts, or sell on Snowflake Marketplace.
* **Governance & security:** Manage permissions, monitor data access, and enforce security policies. Keep your data protected while maintaining compliance.

Manage:

* **Compute:** Manage your warehouse and compute pool resources in Snowflake.
* **Admin:** Administer your accounts, oversee billing and terms, and manage admin contacts and integrations.

## Shortcuts

You can pin up to three Snowsight pages for quick access. To pin a page to the shortcut menu, hover over the page name in the navigation and select the pin icon.

To unpin a shortcut, select the filled pin icon under Shortcuts.

To rearrange pinned shortcuts, hover over the shortcut and drag it to its new position.

## Navigation mapping

This table outlines the new location of items in the navigation based on where they were located in the previous version.

| Previous navigation | New navigation |
| --- | --- |
| Data » Databases | Catalog » Database Explorer |
| Data » Add Data | Ingestion » Add data |
| Data » Migrations | Ingestion » Migrations |
| Data » Openflow | Ingestion » Openflow |
| Data Products » Marketplace | Marketplace |
| Data Products » Marketplace » Internal Marketplace | Catalog » Internal Marketplace |
| Data Products » Apps | Catalog » Apps |
| Data Products » Private Sharing | Data sharing » Private sharing |
| Data Products » Provider Studio | Data sharing » Provider Studio |
| Data Products » Partner Connect | Admin » Partner Connect |
| Monitoring » Copy History | Ingestion » Copy history |
| Monitoring » dbt Projects | Transformation » dbt projects |
| Monitoring » Tasks | Transformation » Tasks |
| Monitoring » Dynamic Tables | Transformation » Dynamic tables |
| Monitoring » Trust Center | Governance & security » Trust Center |
| Monitoring » Governance | Governance & security » Tags & policies |
| Admin » Warehouses | Compute » Warehouses |
| Admin » Compute Pools | Compute » Compute pools |
| Admin » Users & Roles | Governance & security » Users & roles |
| Admin » Security | Governance & security » Network policies |
| Admin » Contacts | Admin » Admin contacts |

---
title: Snowsight quick tour
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-quick-tour.md
section: User Guide
---

# Snowsight quick tour

In Snowsight, you can perform data analysis and engineering tasks, monitor query and data loading and transformation activity,
explore your Snowflake database objects, and administer your Snowflake database, including managing the cost and adding users and roles.

You can use Snowsight to perform the following tasks:

**Work with data**

* Build and develop in Workspaces and Notebooks with SQL, Python, and multi-file projects.
* Ingest and transform data using Snowpipe, connectors, tasks, and streams.
* Analyze with AI using Snowflake Cortex functions, agents, and ML models.
* Monitor activity including query history, task graphs, and data loading.
* Discover and share on the Snowflake Marketplace.

**Explore the Horizon Catalog**

* Discover data across your data estate with Universal Search and the catalog.
* Share data products securely with other Snowflake accounts through listings.
* Govern and protect data with masking policies, row access policies, and tags.

**Manage your account**

* Optimize compute resources including warehouses and compute pools.
* Administer users, roles, and access control.
* Monitor and control costs with budgets and cost management views.
* Manage Postgres instances within Snowflake.

For more information about these and other tasks that you can perform, see [Snowsight: The Snowflake web interface](ui-snowsight.md).

## Work with data

### Workspaces

Workspaces is the unified editor for creating, organizing, and managing code across multiple file types. Workspaces provides a file-based
development environment where you can write SQL and Python, organize projects with folders, and integrate with Git for version control.

For more information, see:

* [Workspaces](ui-snowsight/workspaces.md)
* [Integrate workspaces with a Git repository](ui-snowsight/workspaces-git.md)
* [Work with worksheets in Snowsight](ui-snowsight-worksheets.md)

### Notebooks in Workspaces

Notebooks in Workspaces provide an interactive, cell-based environment for Python, SQL, and Markdown. Use notebooks for exploratory data analysis,
machine learning model development, and data science workflows with embedded visualizations and Git integration.

For more information, see:

* [Snowflake Notebooks in Workspaces](ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md)

### Streamlit

Build and deploy interactive data applications with Streamlit in Snowflake. Create custom dashboards, reports, and data apps
using Python without managing infrastructure.

For more information, see:

* [About Streamlit in Snowflake](../developer-guide/streamlit/about-streamlit.md)
* [Create your Streamlit app](../developer-guide/streamlit/app-development/creating-your-app.md)

### dbt projects

Develop and manage dbt projects with a web-based IDE that connects to Git repositories. Build, test, and run SQL-based data
transformation pipelines directly in Snowflake with version control integration.

For more information, see:

* [Workspaces for dbt Projects on Snowflake](data-engineering/dbt-projects-on-snowflake-using-workspaces.md)

### Ingestion

Load data into Snowflake using Snowpipe for continuous ingestion, connectors for various data sources, and file uploads through the UI.

For more information, see:

* [Load data using Snowsight](data-load-web-ui.md)
* [Snowpipe](data-load-snowpipe-intro.md)
* [Staging files using Snowsight](data-load-local-file-system-stage-ui.md)

### Transformation

Transform your data with dbt projects for analytics engineering, dynamic tables for continuously refreshed materialized views,
and tasks for scheduling transformation workflows.

For more information, see:

* [Dynamic tables](dynamic-tables-about.md)
* [Introduction to tasks](tasks-intro.md)
* [View tasks and task graphs in Snowsight](ui-snowsight-tasks.md)

### AI & ML

Build AI-powered applications with AI Studio, interact with data conversationally using Snowflake Intelligence, and leverage
Cortex AI functions for text analysis and LLM capabilities. Use Cortex Agents for natural language interactions, Cortex Analyst
for data analysis, Cortex Search for vector similarity search, and Cortex Code for AI-powered coding assistance. Manage machine
learning models, features, and experiments for production ML workflows.

For more information, see:

* [Overview of Snowflake Intelligence](snowflake-cortex/snowflake-intelligence.md)
* [Snowflake Cortex AI Functions (including LLM functions)](snowflake-cortex/aisql.md)
* [Cortex Code](cortex-code/cortex-code.md)

### Monitoring

Monitor and track query performance, container services and jobs, task execution, data loading activity, and system health.
Review query history with Performance Explorer to analyze and optimize queries, view traces and logs for observability,
and debug failed operations.

For more information, see:

* [Monitor query activity with Query History](ui-snowsight-activity.md)
* [View tasks and task graphs in Snowsight](ui-snowsight-tasks.md)
* [Monitor data loading activity by using Copy History](data-load-monitor.md)

### Marketplace

Discover and share data products on the Snowflake Marketplace. As a provider, publish data products and application packages
on the Snowflake Marketplace to share with the broader Snowflake community. As a consumer, access datasets and application packages
from providers to derive real-time data insights without needing to set up a data pipeline or write any code.

For more information, see:

* [About Snowflake Marketplace](../collaboration/collaboration-marketplace-about.md)
* [Create and configure shares](data-sharing-provider.md)
* [About the Snowflake Native App Framework](../developer-guide/native-apps/native-apps-about.md)

## Explore the Horizon Catalog

### Catalog

Discover database objects across your entire data estate with Universal Search and the Horizon Catalog. Explore databases, tables,
functions, views, and more using the Database Explorer. Browse the Internal Marketplace to find data products shared within
your organization, and manage apps and native application packages.

To learn more, see:

* [Snowflake Horizon Catalog](snowflake-horizon.md)
* [Explore and manage database objects in Snowsight](ui-snowsight-data.md)
* [About organizational listings](collaboration/listings/organizational/org-listing-about.md)

### Data sharing

Collaborate with users in other Snowflake accounts by sharing data and application packages securely through Internal sharing
(within your organization) or External sharing (to other organizations). As a provider, create data product listings, manage
sharing agreements, and use auto-fulfillment to provide data across regions. As a consumer, access datasets and application
packages shared with your account.

For more information, see:

* [Create and configure shares](data-sharing-provider.md)
* [About organizational listings](collaboration/listings/organizational/org-listing-about.md)
* [Access Provider Studio](../collaboration/provider-studio-accessing.md)

### Governance & security

Apply data governance policies to protect sensitive information, manage user access control, and monitor security posture.
Use masking policies for column-level security, row access policies for row-level filtering, tags for data classification,
create and manage users and roles, and evaluate account security in the Trust Center.

For more information, see:

* [Data Governance in Snowflake](../guides-overview-govern.md)
* [Configuring access control](security-access-control-configure.md)
* [Trust Center](trust-center/overview.md)

## Manage your account

### Compute

Manage virtual warehouses for query execution and compute pools for container-based workloads. Optimize resource allocation,
monitor utilization, and configure auto-suspend and auto-resume settings.

For more information, see:

* [Working with warehouses](warehouses-tasks.md)
* [Overview of warehouses](warehouses-overview.md)

### Postgres

Create and manage Postgres instances within Snowflake. Deploy Postgres databases for compatibility with existing applications
while leveraging Snowflake’s infrastructure and management capabilities.

For more information, see:

* [Snowflake Postgres](snowflake-postgres/about.md)

### Admin

Manage cost and billing, configure account settings, set up integrations with external systems and services,
and connect with partner tools through Partner Connect.

For more information, see:

* [Exploring overall cost](cost-exploring-overall.md)
* [Managing integrations in Snowsight](ui-snowsight-integrations.md)
* [Snowflake Partner Connect](ecosystem-partner-connect.md)

## User menu

Access account information, switch roles and accounts, manage your user profile and settings, file support cases, and sign out from the user menu in the lower-left corner.

For more information, see:

* [Manage your user settings in Snowsight](ui-snowsight-profile.md)
* [Getting started with Snowsight](ui-snowsight-gs.md)
* [Overview of Access Control](security-access-control-overview.md)

---
title: SnowSQL (CLI client)
source: https://docs.snowflake.com/en/user-guide/snowsql.md
section: User Guide
---

# SnowSQL (CLI client)

> **Note:**
>
> [Snowflake CLI](../developer-guide/snowflake-cli/index.md) is an open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations. Snowflake CLI is a more modern, robust, and efficient CLI client than legacy SnowSQL. Snowflake CLI not only lets you execute SQL commands, but also lets you execute commands for other Snowflake products like Streamlit in Snowflake, Snowpark Container Services, and Snowflake Native App Framework. Snowflake will only add new features and enhancements to Snowflake CLI. Consequently, Snowflake recommends that you begin transitioning from SnowSQL to Snowflake CLI.
>
> To help you with the transition from SnowSQL to Snowflake CLI, see [Migrating from SnowSQL to Snowflake CLI](snowsql-migrate.md).
>
> As of July 2025, Snowflake will provide support based on the minor releases for SnowSQL, as follows:
>
> > | SnowSQL version | Initial release date | Support end date |
> > | --- | --- | --- |
> > | 1.2.x | February 02, 2023 | December 19, 2025 |
> > | 1.3.x | May 02, 2024 | May 02, 2026 |
> > | 1.4.x | May 22, 2025 | May 22, 2027 |
> > | 1.5.x | April 16, 2026 | April 16, 2028 |

SnowSQL is a legacy command-line client for connecting to Snowflake to execute SQL queries and perform all DDL and DML operations, including loading data into and unloading data out of database tables.

SnowSQL (`snowsql` executable) can be run as an interactive shell or in batch mode through `stdin` or using the `-f` option.

SnowSQL is an example of an application developed using the [Snowflake Connector for Python](../developer-guide/python-connector/python-connector.md); however, the connector is not a prerequisite for installing SnowSQL. All required software for installing SnowSQL
is bundled in the installers.

Snowflake provides platform-specific versions of SnowSQL for download for the following platforms:

| Operating System | Supported Versions |
| --- | --- |
| Linux | CentOS 7, 8 |
|  | Red Hat Enterprise Linux (RHEL) 7, 8 |
|  | Ubuntu 16.04, 18.04, 20.04 or later |
| macOS | 10.14 or later |
| Microsoft Windows | Microsoft Windows 8 or later |
|  | Microsoft Windows Server 2012, 2016, 2019, 2022 |

## Related videos

> Snowflake 101 | SnowSQL

**Next Topics:**

* [Installing SnowSQL](snowsql-install-config.md)
* [Configuring SnowSQL](snowsql-config.md)
* [Connecting through SnowSQL](snowsql-start.md)
* [Using SnowSQL](snowsql-use.md)

---
title: SOC 1 Type II
source: https://docs.snowflake.com/en/user-guide/cert-soc-1.md
section: User Guide
---

# SOC 1 Type II

This topic describes how Snowflake supports customers with SOC 1 compliance requirements.

## Understanding SOC 1 compliance requirements

The SOC (System Organization Controls) 1 Type II report is an independent auditor’s attestation of the design and operating effectiveness
of internal controls over financial reporting that Snowflake has had in place during the report’s coverage period. The framework was
created by the American Institute of Certified Public Accountants (AICPA).

> **Tip:**
>
> For information about requesting a copy of the report, see the [Snowflake Compliance Center](https://trust.snowflake.com/).

---
title: SOC 2 Type II
source: https://docs.snowflake.com/en/user-guide/cert-soc-2.md
section: User Guide
---

# SOC 2 Type II

This topic describes how Snowflake supports customers with SOC 2 compliance requirements.

## Understanding SOC 2 compliance requirements

The SOC (System and Organization Controls) 2 Type II report is an independent auditor’s attestation of the design and operating
effectiveness of the security, availability, and confidentiality controls that Snowflake has had in place during the report’s coverage
period. The framework was created by the American Institute of Certified Public Accountants (AICPA). The control criteria for a SOC 2 Type
II certification are based on the AICPA Trust Service Principles.

> **Tip:**
>
> For information about requesting a copy of the report, see the [Snowflake Compliance Center](https://trust.snowflake.com/).

---
title: Speeding up geospatial queries with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/geospatial-queries.md
section: User Guide
---

# Speeding up geospatial queries with search optimization

The search optimization service can improve the performance of queries with predicates that use geospatial functions with
GEOGRAPHY objects.

The following sections provide more information about search optimization support for geospatial queries:

* Enabling search optimization for geospatial queries
* Supported predicates with geospatial functions
* Other performance considerations
* Examples that use geospatial functions

> **Note:**
>
> GEOMETRY objects aren’t yet supported.

## Enabling search optimization for geospatial queries

To improve the performance of geospatial queries on a table, use the
[ON GEO clause in the ALTER TABLE … ADD SEARCH OPTIMIZATION command](../../sql-reference/sql/alter-table.md)
for specific columns. Enabling search optimization at the table level doesn’t enable it for columns with geospatial data types.

For example:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON GEO(mygeocol);
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Supported predicates with geospatial functions

For queries with predicates that use the following functions:

* [ST_INTERSECTS](../../sql-reference/functions/st_intersects.md)
* [ST_CONTAINS](../../sql-reference/functions/st_contains.md)
* [ST_WITHIN](../../sql-reference/functions/st_within.md)
* [ST_DWITHIN](../../sql-reference/functions/st_dwithin.md)
* [ST_COVERS](../../sql-reference/functions/st_covers.md)
* [ST_COVEREDBY](../../sql-reference/functions/st_coveredby.md)

The search optimization service can improve performance if:

* One input expression is a GEOGRAPHY column in a table, and
* The other input expression is a GEOGRAPHY constant (created through a
  [conversion or constructor function](../../sql-reference/functions-geospatial.md)).
* For ST_DWITHIN, the distance argument is a non-negative REAL constant.

Note that this feature has the same
[limitations that apply to the search optimization service](queries-that-benefit.md).

## Other performance considerations

Because the search optimization service is designed for predicates that are highly selective and because predicates filter by proximity
between geospatial objects, clustering geospatial objects by proximity in the table can result in better performance. You can cluster
your data either by specifying the sort order when loading the data or by using Automatic Clustering, depending on whether the base
table changes frequently:

Loading Pre-Sorted Data
:   If the data in your base table does not change often, you can specify the sort order when loading the data. You can then enable search
    optimization on the GEOGRAPHY column. For example:

    ```sqlexample
    CREATE TABLE new_table AS SELECT * FROM source_table ORDER BY st_geohash(geom);
    ALTER TABLE new_table ADD SEARCH OPTIMIZATION ON GEO(geom);
    ```

    After every large change made to your base data, you can manually re-sort the data.

### Automatic clustering

If there are frequent updates to your base table, you can use the [ALTER TABLE … CLUSTER BY …](../../sql-reference/sql/alter-table.md)
command to enable [Automatic Clustering](../tables-auto-reclustering.md) so the table is automatically reclustered as it
changes.

The following example adds a new column `geom_geohash` of the type VARCHAR and stores the geohash or H3 index of the GEOGRAPHY column
`geom` in that new column. It then enables Automatic Clustering with the new column as the cluster key. This approach will
automatically recluster the parts of the table that change.

```sqlexample
CREATE TABLE new_table AS SELECT *, ST_GEOHASH(geom) AS geom_geohash FROM source_table;
ALTER TABLE new_table CLUSTER BY (geom_geohash);
ALTER TABLE new_table ADD SEARCH OPTIMIZATION ON GEO(geom);
```

## Examples that use geospatial functions

The following statements create and configure the table used in the examples in this section. The last statement uses the
[ON clause in ALTER TABLE … ADD SEARCH OPTIMIZATION](enabling.md) command
to add search optimization for the `g1` GEOGRAPHY column.

```sqlexample
CREATE OR REPLACE TABLE geospatial_table (id NUMBER, g1 GEOGRAPHY);
INSERT INTO geospatial_table VALUES
  (1, 'POINT(-122.35 37.55)'),
  (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)'),
  (3, 'POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))');
ALTER TABLE geospatial_table ADD SEARCH OPTIMIZATION ON GEO(g1);
```

## Examples of supported predicates

The following query is an example of a query supported by the search optimization service. The search optimization service can
use search access paths to improve the performance of this query:

```sqlexample
SELECT id FROM geospatial_table WHERE
  ST_INTERSECTS(
    g1,
    TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'));
```

The following are examples of additional predicates that are supported by the search optimization service:

```sqlexample
...
  ST_INTERSECTS(
    TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'),
    g1)
```

```sqlexample
...
  ST_CONTAINS(
    TO_GEOGRAPHY('POLYGON((-74.17 40.64, -74.1796875 40.58, -74.09 40.58, -74.09 40.64, -74.17 40.64))'),
    g1)
```

```sqlexample
...
  ST_CONTAINS(
    g1,
    TO_GEOGRAPHY('MULTIPOINT((0 0), (1 1))'))
```

```sqlexample
...
  ST_WITHIN(
   TO_GEOGRAPHY('{"type" : "MultiPoint","coordinates" : [[-122.30, 37.55], [-122.20, 47.61]]}'),
   g1)
```

```sqlexample
...
  ST_WITHIN(
    g1,
    TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'))
```

```sqlexample
...
  ST_COVERS(
    TO_GEOGRAPHY('POLYGON((-1 -1, -1 4, 4 4, 4 -1, -1 -1))'),
    g1)
```

```sqlexample
...
  ST_COVERS(
    g1,
    TO_GEOGRAPHY('POINT(0 0)'))
```

```sqlexample
...
  ST_COVEREDBY(
    TO_GEOGRAPHY('POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))'),
    g1)
```

```sqlexample
...
  ST_COVEREDBY(
    g1,
    TO_GEOGRAPHY('POINT(-122.35 37.55)'))
```

```sqlexample
...
  ST_DWITHIN(
    TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'),
    g1,
    100000)
```

```sqlexample
...
  ST_DWITHIN(
    g1,
    TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'),
    100000)
```

## Examples of constructing GEOGRAPHY constants

The following are examples of predicates that use different
[conversion and constructor functions](../../sql-reference/functions-geospatial.md) for the GEOGRAPHY constant.

```sqlexample
...
  ST_INTERSECTS(
    g1,
    ST_GEOGRAPHYFROMWKT('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'))
```

```sqlexample
...
  ST_INTERSECTS(
    ST_GEOGFROMTEXT('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'),
    g1)
```

```sqlexample
...
  ST_CONTAINS(
    ST_GEOGRAPHYFROMEWKT('POLYGON((-74.17 40.64, -74.1796875 40.58, -74.09 40.58, -74.09 40.64, -74.17 40.64))'),
    g1)
```

```sqlexample
...
  ST_WITHIN(
    ST_GEOGRAPHYFROMWKB('01010000006666666666965EC06666666666C64240'),
    g1)
```

```sqlexample
...
  ST_COVERS(
    g1,
    ST_MAKEPOINT(0.2, 0.8))
```

```sqlexample
...
  ST_INTERSECTS(
    g1,
    ST_MAKELINE(
      TO_GEOGRAPHY('MULTIPOINT((0 0), (1 1))'),
      TO_GEOGRAPHY('POINT(0.8 0.2)')))
```

```sqlexample
...
  ST_INTERSECTS(
    ST_POLYGON(
      TO_GEOGRAPHY('SRID=4326;LINESTRING(0.0 0.0, 1.0 0.0, 1.0 2.0, 0.0 2.0, 0.0 0.0)')),
    g1)
```

```sqlexample
...
  ST_WITHIN(
    g1,
    TRY_TO_GEOGRAPHY('POLYGON((-1 -1, -1 4, 4 4, 4 -1, -1 -1))'))
```

```sqlexample
...
  ST_COVERS(
    g1,
    ST_GEOGPOINTFROMGEOHASH('s00'))
```

---
title: Speeding up join queries with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/join-queries.md
section: User Guide
---

# Speeding up join queries with search optimization

The search optimization service can improve the performance of join queries that have a small number of distinct values on the
build side of the join.

For example, the search optimization service can improve the performance of these types of joins:

* Suppose that `products` is a table containing a row for each product, and `sales` is a table containing a row for
  each sale of a product. The `products` table contains fewer rows and is smaller than the `sales` table. To find all sales of
  a specific product, you join the `sales` table (the larger table) with the `products` table (the smaller table). Because
  the `products` table is small, there are few distinct values on the build side of the join.

  > **Note:**
  >
  > In data warehousing, the large table is often referred to as the [fact table](https://en.wikipedia.org/wiki/Fact_table). The small table is referred to as the
  > [dimension table](https://en.wikipedia.org/wiki/Dimension_%28data_warehouse%29#Dimension_table). The rest of this topic uses these terms when referring to the large table and the small table in a join.
* Suppose that `customers` is a table containing a row for each customer, and `sales` is a table containing a row for
  each sale. Both tables are large. To find all sales for a specific customer, you join the `sales` table (the probe side)
  with the `customers` table (the build side) and use a filter so that there are a small number of distinct values on the
  build side of the join.

The following sections provide more information about search optimization support for join queries:

* Enabling search optimization for join queries
* Supported join predicates
* Examples of supported join queries
* Limitations

## Enabling search optimization for join queries

To improve the performance of join queries, make sure search optimization is enabled for columns in the join
predicate of the query. In addition, make sure the build side of the join has a small number of distinct values, either
because it’s a small dimension table or because of a selective filter. The search optimization runtime costs of a query
are proportionate to the number of distinct values that must be looked up on the build side of the join. If this number
is too large, Snowflake might decide against using the search access path and use the regular table access path instead.

To improve the performance of join queries, [enable search optimization](enabling.md)
for the table on the probe side of the join. This table is usually a large table that isn’t filtered in join queries,
such as a fact table.

Use the [ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md)
command to:

* Enable search optimization for specific columns.
* Enable search optimization for all columns of the table.

In general, enabling search optimization only for specific columns is the best practice. Use the ON EQUALITY clause
to specify the columns. This example enables search optimization for a specific column:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(mycol);
```

To specify EQUALITY for all columns of the supported data types (except for
[semi-structured](../../sql-reference/data-types-semistructured.md) and [GEOGRAPHY](../../sql-reference/data-types-geospatial.md)):

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION;
```

## Supported join predicates

The search optimization service can improve the performance of queries with the following types of join predicates:

* Equality predicates of the form `probe_side_table.column = build_side_table.column`.
* Transformations on the build-side operand of the predicate (for example, string concatenation, addition, and so on).
* Conjunctions (`AND`) of multiple equality predicates.

## Examples of supported join queries

This section shows examples of join queries that can benefit from search optimization.

### Example: Simple equality predicate

The following is an example of a supported query that uses a simple equality predicate as the join predicate. This query joins a
table named `sales` with a table named `customers`. The probe-side table `sales` is large and has search optimization
enabled. The build-side table `customers` is also large, but the input from this table is small, due to the selective filter on the
`customer_id` column.

```sqlexample
SELECT sales.date, customer.name
  FROM sales JOIN customers ON (sales.customer_id = customers.customer_id)
  WHERE customers.customer_id = 2094;
```

### Example: Predicate transformed on the dimension-side operand

The following query joins a fact table named `sales` with a dimension table named `products`. The fact table is large and
has search optimization enabled. The dimension table is small.

This query transforms the dimension-side operand of the predicate (for example, by multiplying values in the join condition)
and can benefit from search optimization:

```sqlexample
SELECT sales.date, product.name
  FROM sales JOIN products ON (sales.product_id = product.old_id * 100)
  WHERE product.category = 'Cutlery';
```

### Example: Predicate spanning multiple columns

Queries in which a join predicate spans multiple columns can benefit from search optimization:

```sqlexample
SELECT sales.date, product.name
  FROM sales JOIN products ON (sales.product_id = product.id and sales.location = product.place_of_production)
  WHERE product.category = 'Cutlery';
```

### Example: Query using point-lookup filters and join predicates

In a query that uses both regular point-lookup filters and join predicates, the search optimization service can improve the
performance of both. In the following query, the search optimization service can improve the `sales.location` point-lookup
predicate as well as the `product_id` join predicate:

```sqlexample
SELECT sales.date, product.name
  FROM sales JOIN products ON (sales.product_id = product.id)
  WHERE product.category = 'Cutlery'
  AND sales.location = 'Buenos Aires';
```

## Limitations

The following limitations apply to the search optimization service and join queries:

* Disjuncts (`OR`) in join predicates currently aren’t supported.
* LIKE, ILIKE, and RLIKE join predicates currently aren’t supported.
* Join predicates on VARIANT columns currently aren’t supported.
* [[ NOT ] EQUAL_NULL](../../sql-reference/functions/equal_null.md) equality predicates currently aren’t supported.
* The [current limitations of the search optimization service](queries-that-benefit.md) also apply to
  join queries.

---
title: Speeding up point lookup queries with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/point-lookup-queries.md
section: User Guide
---

# Speeding up point lookup queries with search optimization

Point lookup queries are queries that are expected to return a small number of rows. The search optimization service can
improve the performance of point lookup queries that use:

* Equality predicates (for example, `column_name = constant`).
* Predicates that use [IN](../../sql-reference/functions/in.md) (see example).

The following sections provide more information about search optimization support for point lookup queries:

* Enabling search optimization for point lookup queries
* Examples of supported point lookup queries

## Enabling search optimization for point lookup queries

Point lookup queries aren’t improved unless you enable search optimization for the columns referenced by the predicate of
the query. To improve the performance of point lookup queries on a table, use the
[ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command to:

* Enable search optimization for specific columns.
* Enable search optimization for all columns of the table.

In general, enabling search optimization only for specific columns is the best practice. Use the ON EQUALITY clause
to specify the columns. This example enables search optimization for a specific column:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(mycol);
```

To specify EQUALITY for all columns of the supported data types (except for
[semi-structured](../../sql-reference/data-types-semistructured.md) and [GEOGRAPHY](../../sql-reference/data-types-geospatial.md)):

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION;
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Examples of supported point lookup queries

The search optimization service can improve the performance of the following query that uses an equality predicate:

```sqlexample
SELECT * FROM test_table WHERE id = 3;
```

The [IN](../../sql-reference/functions/in.md) clause is supported by the search optimization service:

```sqlexample
SELECT id, c1, c2, c3
  FROM test_table
  WHERE id IN (2, 3)
  ORDER BY id;
```

---
title: Speeding up queries of semi-structured data with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/semi-structured-queries.md
section: User Guide
---

# Speeding up queries of semi-structured data with search optimization

The search optimization service can improve the performance of point lookup and substring queries on semi-structured
data in Snowflake tables (that is, data in [VARIANT, OBJECT, and ARRAY columns](../../sql-reference/data-types-semistructured.md)).
You can configure search optimization on columns of these types even when the structure is deeply nested and
changes frequently. You can also enable search optimization for specific elements within a semi-structured column.

The following sections provide more information about search optimization support for queries of semi-structured data:

* Enabling search optimization for queries of semi-structured data
* Supported data types for constants and casts in predicates for semi-structured types
* Support for semi-structured data type values cast to VARCHAR
* Supported predicates for point lookups on VARIANT types
* Substring search in VARIANT types
* Current limitations in support for semi-structured types

## Enabling search optimization for queries of semi-structured data

To improve the performance for queries of semi-structured data on a table, use the
[ON clause in the ALTER TABLE … ADD SEARCH OPTIMIZATION command](../../sql-reference/sql/alter-table.md)
for specific columns or elements in columns. Queries against VARIANT, OBJECT, and ARRAY columns aren’t optimized if you
omit the ON clause. Enabling search optimization at the table level doesn’t enable it for columns with semi-structured
data types.

For example:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(myvariantcol);
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c4:user.uuid);

ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(myvariantcol);
ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON SUBSTRING(c4:user.uuid);

ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(object_column);
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(object_column);

ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(array_column);
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(array_column);
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Supported data types for constants and casts in predicates for semi-structured types

The search optimization service can improve the performance of
[point lookups of semi-structured data](../querying-semistructured.md) where the
following types are used for the constant and the [implicit or explicit cast](../../sql-reference/data-type-conversion.md) for the
element:

* FIXED (including casts that specify a valid precision and scale)
* INTEGER (including synonymous types)
* VARCHAR (including synonymous types)
* DATE (including casts that specify a scale)
* TIME (including casts that specify a scale)
* TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ (including casts that specify a scale)

The search optimization service supports the casting of types using:

* [CAST and the :: operator](../../sql-reference/functions/cast.md)
* [TRY_CAST](../../sql-reference/functions/try_cast.md)

## Support for semi-structured data type values cast to VARCHAR

The search optimization service can also improve the performance of point lookups in which columns with semi-structured data types are cast
to VARCHAR and are compared to constants that are cast to VARCHAR.

For example, suppose that `src` is a VARIANT column containing BOOLEAN, DATE, and TIMESTAMP values that have been converted to VARIANT:

```sqlexample
CREATE OR REPLACE TABLE test_table
(
  id INTEGER,
  src VARIANT
);

INSERT INTO test_table SELECT 1, TO_VARIANT('true'::BOOLEAN);
INSERT INTO test_table SELECT 2, TO_VARIANT('2020-01-09'::DATE);
INSERT INTO test_table SELECT 3, TO_VARIANT('2020-01-09 01:02:03.899'::TIMESTAMP);
```

For this table, the search optimization service can improve the following queries, which cast the VARIANT column to VARCHAR and
compare the column to string constants:

```sqlexample
SELECT * FROM test_table WHERE src::VARCHAR = 'true';
SELECT * FROM test_table WHERE src::VARCHAR = '2020-01-09';
SELECT * FROM test_table WHERE src::VARCHAR = '2020-01-09 01:02:03.899';
```

## Supported predicates for point lookups on VARIANT types

The search optimization service can improve point lookup queries with the types of predicates listed below. In the examples
below, `src` is the column with a semi-structured data type, and `path_to_element` is a
[path to an element in the column with a semi-structured data type](../querying-semistructured.md).

* Equality predicates of the following form:

  `WHERE path_to_element[::target_data_type] = constant`

  In this syntax, `target_data_type` (if specified) and the data type of `constant` must be one
  of the supported types.

  For example, the search optimization service supports:

  + Matching a VARIANT element against a NUMBER constant without explicitly casting the element.

    ```sqlexample
    WHERE src:person.age = 42;
    ```
  + Explicitly casting a VARIANT element to NUMBER with a specified precision and scale.

    ```sqlexample
    WHERE src:location.temperature::NUMBER(8, 6) = 23.456789;
    ```
  + Matching a VARIANT element against a VARCHAR constant without explicitly casting the element.

    ```sqlexample
    WHERE src:sender_info.ip_address = '123.123.123.123';
    ```
  + Explicitly casting a VARIANT element to VARCHAR.

    ```sqlexample
    WHERE src:salesperson.name::VARCHAR = 'John Appleseed';
    ```
  + Explicitly casting a VARIANT element to DATE.

    ```sqlexample
    WHERE src:events.date::DATE = '2021-03-26';
    ```
  + Explicitly casting a VARIANT element to TIMESTAMP with a specified scale.

    ```sqlexample
    WHERE src:event_logs.exceptions.timestamp_info(3) = '2021-03-26 15:00:00.123 -0800';
    ```
  + Matching an ARRAY element against a value of a supported type,
    with or without explicitly casting to the type. For example:

    ```sqlexample
    WHERE my_array_column[2] = 5;

    WHERE my_array_column[2]::NUMBER(4, 1) = 5;
    ```
  + Matching an OBJECT element against a value of a supported type,
    with or without explicitly casting to the type. For example:

    ```sqlexample
    WHERE object_column['mykey'] = 3;

    WHERE object_column:mykey = 3;

    WHERE object_column['mykey']::NUMBER(4, 1) = 3;

    WHERE object_column:mykey::NUMBER(4, 1) = 3;
    ```
* Predicates that use the ARRAY functions, such as:

  + `WHERE ARRAY_CONTAINS(value_expr, array)`

    In this syntax, `value_expr` must not be NULL and must evaluate to VARIANT. The data type of the value must be one of
    the supported types.

    For example:

    ```sqlexample
    WHERE ARRAY_CONTAINS('77.146.211.88'::VARIANT, src:logs.ip_addresses)
    ```

    In this example, the value is a constant that is implicitly cast to a VARIANT:

    ```sqlexample
    WHERE ARRAY_CONTAINS(300, my_array_column)
    ```
  + `WHERE ARRAYS_OVERLAP(ARRAY_CONSTRUCT(constant_1, constant_2, .., constant_N), array)`

    The data type of each constant (`constant_1`, `constant_2`, and so on) must be one of the
    supported types. The constructed ARRAY can
    include NULL constants.

    In this example, the array is in a VARIANT value:

    ```sqlexample
    WHERE ARRAYS_OVERLAP(
      ARRAY_CONSTRUCT('122.63.45.75', '89.206.83.107'), src:senders.ip_addresses)
    ```

    In this example, the array is an ARRAY column:

    ```sqlexample
    WHERE ARRAYS_OVERLAP(
      ARRAY_CONSTRUCT('a', 'b'), my_array_column)
    ```
* The following predicates that check for NULL values:

  + `WHERE IS_NULL_VALUE(path_to_element)`

    Note that [IS_NULL_VALUE](../../sql-reference/functions/is_null_value.md) applies to JSON null values and not to SQL NULL values.
  + `WHERE path_to_element IS NOT NULL`
  + `WHERE semistructured_column IS NULL`

    where `semistructured_column` refers to the column and not a path to an element in the semi-structured data.

    For example, the search optimization service supports using the VARIANT column `src` but not the path to the element
    `src:person.age` in that VARIANT column.

## Substring search in VARIANT types

The search optimization service can optimize [wildcard or regular expression searches](substring-queries.md)
in [semi-structured columns](../../sql-reference/data-types-semistructured.md) — that is, VARIANT, OBJECT, and ARRAY columns —
or elements in such columns.

The search optimization service can optimize predicates that use the following functions:

* [LIKE](../../sql-reference/functions/like.md)
* [LIKE ANY](../../sql-reference/functions/like_any.md)
* [LIKE ALL](../../sql-reference/functions/like_all.md)
* [ILIKE](../../sql-reference/functions/ilike.md)
* [ILIKE ANY](../../sql-reference/functions/ilike_any.md)
* [CONTAINS](../../sql-reference/functions/contains.md)
* [ENDSWITH](../../sql-reference/functions/endswith.md)
* [STARTSWITH](../../sql-reference/functions/startswith.md)
* [SPLIT_PART](../../sql-reference/functions/split_part.md)
* [RLIKE](../../sql-reference/functions/rlike.md)
* [REGEXP](../../sql-reference/functions/regexp.md)
* [REGEXP_LIKE](../../sql-reference/functions/regexp_like.md)

You can enable substring search optimization for a column or for multiple individual elements within a column. For
example, the following statement enables substring search optimization for a nested element in a column:

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:data.search);
```

After the search access path has been built, the following query can be optimized:

```sqlexample
SELECT * FROM test_table WHERE col2:data.search LIKE '%optimization%';
```

However, the following queries aren’t optimized because the WHERE clause filters don’t apply to the element
that was specified when search optimization was enabled (`col2:data.search`):

```sqlexample
SELECT * FROM test_table WHERE col2:name LIKE '%simon%parker%';
SELECT * FROM test_table WHERE col2 LIKE '%hello%world%';
```

You can specify multiple elements to be optimized. In the following example, search optimization is enabled for two specific
elements in the column `col2`:

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:name);
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:data.search);
```

If you enable search optimization for a given element, it is enabled for any nested elements. The second ALTER TABLE statement
below is redundant because the first statement enables search optimization for the entire `data` element, including
the nested `search` element.

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:data);
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:data.search);
```

Similarly, enabling search optimization for an entire column allows all substring searches on that column to be optimized,
including elements nested to any depth within it.

For an example that enables FULL_TEXT search optimization on a VARIANT column in the `car_sales` table and its data,
which is described in [Querying Semi-structured Data](../querying-semistructured.md), see
[Enable FULL_TEXT search optimization on a VARIANT column](text-queries.md).

### How constants are evaluated for VARIANT substring searches

When it evaluates the constant string in a query — for example, `LIKE 'constant_string'` — the search optimization service splits the
string into tokens by using the following characters as delimiters:

* Square brackets (`[` and `]`).
* Curly braces (`{` and `}`).
* Colons (`:`).
* Commas (`,`).
* Double quotes (`"`).

After it splits the string into tokens, the search optimization service considers only tokens that are at least five characters long.
The following table explains how the search optimization service handles various predicate examples:

| Example of a predicate | How the search optimization service handles the query |
| --- | --- |
| `LIKE '%TEST%'` | The search optimization service *doesn’t use* search access paths for the following predicate because the substring is shorter than five characters. |
| `LIKE '%SEARCH%IS%OPTIMIZED%'` | The search optimization service can optimize this query, by using search access paths to search for `SEARCH` and `OPTIMIZED` but not `IS`. `IS` is shorter than five characters. |
| `LIKE '%HELLO_WORLD%'` | The search optimization service can optimize this query, by using search access paths to search for `HELLO_WORLD`. |
| `LIKE '%COL:ON:S:EVE:RYWH:ERE%'` | The search optimization service splits this string into `COL`, `ON`, `S`, `EVE`, `RYWH`, `ERE`. Because all of these tokens are shorter than five characters, the search optimization service can’t optimize this query. |
| `LIKE '%{\"KEY01\":{\"KEY02\":\"value\"}%'` | The search optimization service splits this string into the tokens `KEY01`, `KEY02`, `VALUE` and uses the tokens when it optimizes the query. |
| `LIKE '%quo\"tes_and_com,mas,\"are_n\"ot\"_all,owed%'` | The search optimization service splits this string into the tokens `quo`, `tes_and_com`, `mas`, `are_n`, `ot`, `_all`, `owed`. The search optimization service can only use the tokens that are five characters or longer (`tes_and_com`, `are_n`) when it optimizes the query. |

## Current limitations in support for semi-structured types

Support for semi-structured types in the search optimization service is limited in the following ways:

* Predicates of the form `path_to_element IS NULL` aren’t supported.
* Predicates where the constants are results of scalar subqueries aren’t supported.
* Predicates that specify paths to elements that contain sub-elements aren’t supported.
* Predicates that use the [XMLGET](../../sql-reference/functions/xmlget.md) function aren’t supported.

The [current limitations of the search optimization service](queries-that-benefit.md) also apply to
semi-structured types.

---
title: Speeding up queries of structured data with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/structured-queries.md
section: User Guide
---

# Speeding up queries of structured data with search optimization

The search optimization service can improve the performance of point-lookup and substring queries on
structured data in Snowflake tables; that is, data in
[structured ARRAY, OBJECT, and MAP columns](../../sql-reference/data-types-structured.md). You can configure
search optimization on columns of these types even when the structure is deeply nested and changes frequently.
You can also enable search optimization for specific elements within a structured column.

The following sections provide more information about search optimization support for queries of structured data:

* Enabling search optimization for queries of structured data
* Supported predicates for point lookups on structured types
* Substring search in structured types
* Schema evolution support
* Current limitations in support for structured types

## Enabling search optimization for queries of structured data

To improve the performance for queries of structured data types on a table, use the
[ON clause in the ALTER TABLE … ADD SEARCH OPTIMIZATION command](../../sql-reference/sql/alter-table.md)
for specific columns or elements in columns. Queries against structured ARRAY, OBJECT, and MAP columns aren’t
optimized if you omit the ON clause. Enabling search optimization at the table level doesn’t enable it for columns
with structured data types.

For example:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(array_column);
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(array_column[1]);

ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(object_column);
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(object_column:key);

ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(map_column);
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(map_column:user.uuid);
```

The following rules apply to the keywords you use in these ALTER TABLE … ADD SEARCH OPTIMIZATION commands:

* You can use the EQUALITY keyword with any inner element or the column itself.
* You can use the SUBSTRING keyword only with inner elements that have
  [text string](../../sql-reference/data-types-text.md) data types.

For more information, see [Enabling and disabling search optimization](enabling.md).

## Supported data types for constants and casts in predicates for structured types

The search optimization service can improve the performance of point lookups of structured data where the
following types are used for the constant and the [implicit or explicit cast](../../sql-reference/data-type-conversion.md)
for the element:

* FIXED (including casts that specify a valid precision and scale)
* INTEGER (including synonymous types)
* VARCHAR (including synonymous types)
* DATE (including casts that specify a scale)
* TIME (including casts that specify a scale)
* TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ (including casts that specify a scale)

The search optimization service supports the casting of types by using the following conversion functions:

* [CAST and the :: operator](../../sql-reference/functions/cast.md)
* [TRY_CAST](../../sql-reference/functions/try_cast.md)

## Supported predicates for point lookups on structured types

The search optimization service can improve point-lookup queries with the types of predicates shown in the following
list. In the examples, `src` is the column with a structured data type, and `path_to_element` is a
path to an element in the column with a structured data type:

* Equality predicates of the following form:

  `WHERE path_to_element[::target_data_type] = constant`

  In this syntax, `target_data_type` (if specified) and the data type of `constant` must be one
  of the supported types.

  For example, the search optimization service supports the following predicates:

  + Matching an OBJECT or MAP element against a NUMBER constant without explicitly casting the element:

    ```sqlexample
    WHERE src:person.age = 42;
    ```
  + Explicitly casting an OBJECT or MAP element to NUMBER with a specified precision and scale:

    ```sqlexample
    WHERE src:location.temperature::NUMBER(8, 6) = 23.456789;
    ```
  + Matching an OBJECT or MAP element against a VARCHAR constant without explicitly casting the element:

    ```sqlexample
    WHERE src:sender_info.ip_address = '123.123.123.123';
    ```
  + Explicitly casting an OBJECT or MAP element to VARCHAR:

    ```sqlexample
    WHERE src:salesperson.name::VARCHAR = 'John Appleseed';
    ```
  + Explicitly casting an OBJECT or MAP element to DATE:

    ```sqlexample
    WHERE src:events.date::DATE = '2021-03-26';
    ```
  + Explicitly casting an OBJECT or MAP element to TIMESTAMP with a specified scale:

    ```sqlexample
    WHERE src:event_logs.exceptions.timestamp_info(3) = '2021-03-26 15:00:00.123 -0800';
    ```
  + Matching an ARRAY element against a value of a [supported type](semi-structured-queries.md),
    with or without an explicit cast:

    ```sqlexample
    WHERE my_array_column[2] = 5;

    WHERE my_array_column[2]::NUMBER(4, 1) = 5;
    ```
  + Matching an OBJECT or MAP element against a value of a [supported type](semi-structured-queries.md),
    with or without an explicit cast:

    ```sqlexample
    WHERE object_column['mykey'] = 3;

    WHERE object_column:mykey = 3;

    WHERE object_column['mykey']::NUMBER(4, 1) = 3;

    WHERE object_column:mykey::NUMBER(4, 1) = 3;
    ```
* Predicates that use the ARRAY functions, such as the following predicates:

  + `WHERE ARRAY_CONTAINS(value_expr, array)`

    In this syntax, `value_expr` must not be NULL and must evaluate to VARIANT. The data type of the
    value must be one of the [supported types](semi-structured-queries.md):

    ```sqlexample
    WHERE ARRAY_CONTAINS('77.146.211.88'::VARIANT, src:logs.ip_addresses)
    ```

    In this example, the value is a constant that is implicitly cast to an OBJECT:

    ```sqlexample
    WHERE ARRAY_CONTAINS(300, my_array_column)
    ```
  + `WHERE ARRAYS_OVERLAP(ARRAY_CONSTRUCT(constant_1, constant_2, .., constant_N), array)`

    The data type of each constant — `constant_1`, `constant_2`, and so on — must be one of the
    [supported types](semi-structured-queries.md). The constructed ARRAY can
    include NULL constants.

    In this example, the array is in an OBJECT value:

    ```sqlexample
    WHERE ARRAYS_OVERLAP(
      ARRAY_CONSTRUCT('122.63.45.75', '89.206.83.107'), src:senders.ip_addresses)
    ```

    In this example, the array is in an ARRAY column:

    ```sqlexample
    WHERE ARRAYS_OVERLAP(
      ARRAY_CONSTRUCT('a', 'b'), my_array_column)
    ```
* The following predicates check for NULL values:

  + `WHERE IS_NULL_VALUE(path_to_element)`

    > **Note:**
    >
    > [IS_NULL_VALUE](../../sql-reference/functions/is_null_value.md) applies to JSON null values and not to SQL NULL values.
  + `WHERE path_to_element IS NOT NULL`
  + `WHERE structured_column IS NULL`

    where `structured_column` refers to the column and not a path to an element in the structured data.

    For example, the search optimization service supports using the OBJECT column `src` but not the path to the element
    `src:person.age` in that OBJECT column.

## Substring search in structured types

You can enable substring search only if the target structured element is a
[text string](../../sql-reference/data-types-text.md) data type.

For example, consider the following table:

```sqlexample
CREATE TABLE t(
  col OBJECT(
    a INTEGER,
    b STRING,
    c MAP(INTEGER, STRING),
    d ARRAY(STRING)
  )
);
```

For this table, search optimization for SUBSTRING search *can* be added on the following target structured elements:

* `col:b` because its type is STRING.
* `col:c[value]` — for example, `col:c[0]`, `col:c[100]` — if the values are text string types.

For this table, search optimization for SUBSTRING search *can’t* be added on the following target structured elements:

* `col` because its type is structured OBJECT.
* `col:a` because its type is INTEGER.
* `col:c` because its type is MAP.
* `col:d` because its type is ARRAY.

The search optimization service can optimize predicates that use the following functions:

* [LIKE](../../sql-reference/functions/like.md)
* [LIKE ANY](../../sql-reference/functions/like_any.md)
* [LIKE ALL](../../sql-reference/functions/like_all.md)
* [ILIKE](../../sql-reference/functions/ilike.md)
* [ILIKE ANY](../../sql-reference/functions/ilike_any.md)
* [CONTAINS](../../sql-reference/functions/contains.md)
* [ENDSWITH](../../sql-reference/functions/endswith.md)
* [STARTSWITH](../../sql-reference/functions/startswith.md)
* [SPLIT_PART](../../sql-reference/functions/split_part.md)
* [RLIKE](../../sql-reference/functions/rlike.md)
* [REGEXP](../../sql-reference/functions/regexp.md)
* [REGEXP_LIKE](../../sql-reference/functions/regexp_like.md)

You can enable substring search optimization for a column or for multiple individual elements within a column. For
example, the following statement enables substring search optimization for a nested element in a column:

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:data.search);
```

After the search access path has been built, the following query can be optimized:

```sqlexample
SELECT * FROM test_table WHERE col2:data.search LIKE '%optimization%';
```

However, the following queries aren’t optimized because the WHERE clause filters don’t apply to the element
that was specified when search optimization was enabled (`col2:data.search`):

```sqlexample
SELECT * FROM test_table WHERE col2:name LIKE '%simon%parker%';
SELECT * FROM test_table WHERE col2 LIKE '%hello%world%';
```

You can specify multiple elements to be optimized. In the following example, search optimization is enabled for two specific
elements in the column `col2`:

```sqlexample
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:name);
ALTER TABLE test_table ADD SEARCH OPTIMIZATION ON SUBSTRING(col2:data.search);
```

If you enable search optimization for a given element, it is enabled for any unnested elements of a text string type.
Search optimization isn’t enabled for nested elements or elements of non-text string types.

### How constants are evaluated for structured substring searches

When it evaluates the constant string in a query — for example, `LIKE 'constant_string'` — the search optimization service splits the
string into tokens by using the following characters as delimiters:

* Square brackets (`[` and `]`).
* Curly braces (`{` and `}`).
* Colons (`:`).
* Commas (`,`).
* Double quotes (`"`).

After it splits the string into tokens, the search optimization service considers only tokens that are at least five characters long.
The following table explains how the search optimization service handles various predicate examples:

| Example of a predicate | How the search optimization service handles the query |
| --- | --- |
| `LIKE '%TEST%'` | The search optimization service *doesn’t use* search access paths for the following predicate because the substring is shorter than five characters. |
| `LIKE '%SEARCH%IS%OPTIMIZED%'` | The search optimization service can optimize this query, by using search access paths to search for `SEARCH` and `OPTIMIZED` but not `IS`. `IS` is shorter than five characters. |
| `LIKE '%HELLO_WORLD%'` | The search optimization service can optimize this query, by using search access paths to search for `HELLO_WORLD`. |
| `LIKE '%COL:ON:S:EVE:RYWH:ERE%'` | The search optimization service splits this string into `COL`, `ON`, `S`, `EVE`, `RYWH`, `ERE`. Because all of these tokens are shorter than five characters, the search optimization service can’t optimize this query. |
| `LIKE '%{\"KEY01\":{\"KEY02\":\"value\"}%'` | The search optimization service splits this string into the tokens `KEY01`, `KEY02`, `VALUE` and uses the tokens when it optimizes the query. |
| `LIKE '%quo\"tes_and_com,mas,\"are_n\"ot\"_all,owed%'` | The search optimization service splits this string into the tokens `quo`, `tes_and_com`, `mas`, `are_n`, `ot`, `_all`, `owed`. The search optimization service can only use the tokens that are five characters or longer (`tes_and_com`, `are_n`) when it optimizes the query. |

## Schema evolution support

The schema of structured columns can evolve over time. For more information about schema evolution, see
[ALTER ICEBERG TABLE … ALTER COLUMN … SET DATA TYPE (structured types)](../../sql-reference/sql/alter-iceberg-table-alter-column-set-data-type.md).

As part of a single schema-evolution operation, the following modifications can occur:

* Type widening
* Reordering elements
* Adding elements
* Removing elements
* Renaming elements

The search optimization service isn’t invalidated as part of the schema-evolution operation. Instead,
the search optimization service handles operations in the following ways:

Type widening (for example, INT to NUMBER)
:   Search optimization access paths aren’t affected.

Adding elements
:   The newly added elements are automatically reflected in the existing search optimization access paths.

Removing elements
:   When elements are removed from a structured column, the search optimization service automatically
    drops access paths that are prefixed by the removed element.

    For example, create a table with a column of OBJECT type, and then insert data:

    ```sqlexample
    CREATE OR REPLACE TABLE test_struct (
      a OBJECT(
        b INTEGER,
        c OBJECT(
          d STRING,
          e VARIANT
          )
      )
    );

    INSERT INTO test_struct (a) SELECT
      {
        'b': 100,
        'c': {
            'd': 'value1',
            'e': 'value2'
      }
      }::OBJECT(
        b INTEGER,
        c OBJECT(
            d STRING,
            e VARIANT
        )
    );
    ```

    To view the data, query the table:

    ```sqlexample
    SELECT * FROM test_struct;
    ```

    ```output
    +--------------------+
    | A                  |
    |--------------------|
    | {                  |
    |   "b": 100,        |
    |   "c": {           |
    |     "d": "value1", |
    |     "e": "value2"  |
    |   }                |
    | }                  |
    +--------------------+
    ```

    The following statement removes element `c` from the object:

    ```sqlexample
    ALTER TABLE test_struct ALTER COLUMN a
      SET DATA TYPE OBJECT(
        b INTEGER);
    ```

    When this statement runs, the access paths at `a`, `a:c`, `a:c:d`
    and `a:c:e` are dropped.

Renaming elements
:   When an element is renamed, the search optimization service automatically drops access paths prefixed
    by the renamed element and adds them back with the newly named path. This operation incurs an additional
    maintenance cost to process the newly added path in the search optimization service.

    For example, create a table with a column of OBJECT type, and then insert data:

    ```sqlexample
    CREATE OR REPLACE TABLE test_struct (
      a OBJECT(
        b INTEGER,
        c OBJECT(
          d STRING,
          e VARIANT
          )
      )
    );

    INSERT INTO test_struct (a) SELECT
      {
        'b': 100,
        'c': {
            'd': 'value1',
            'e': 'value2'
      }
      }::OBJECT(
        b INTEGER,
        c OBJECT(
            d STRING,
            e VARIANT
        )
    );
    ```

    To view the data, query the table:

    ```sqlexample
    SELECT * FROM test_struct;
    ```

    ```output
    +--------------------+
    | A                  |
    |--------------------|
    | {                  |
    |   "b": 100,        |
    |   "c": {           |
    |     "d": "value1", |
    |     "e": "value2"  |
    |   }                |
    | }                  |
    +--------------------+
    ```

    The following statement renames element `c` to `c_new` in the object:

    ```sqlexample
    ALTER TABLE test_struct ALTER COLUMN a
      SET DATA TYPE OBJECT(
        b INTEGER,
        c_new OBJECT(
          d STRING,
          e VARIANT
        )
      ) RENAME FIELDS;
    ```

    The access paths at `a`, `a:c`, `a:c:d`, `a:c:e` are dropped and re-added as `a`, `a:c_new`,
    `a:c_new:d`, `a:c_new:e`.

Reordering elements
:   Search optimization access paths aren’t affected.

## Current limitations in support for structured types

Support for structured types in the search optimization service is limited in the following ways:

* Predicates of the form `path_to_element IS NULL` aren’t supported.
* Predicates where the constants are results of scalar subqueries aren’t supported.
* Predicates that specify paths to elements that contain sub-elements aren’t supported.
* Predicates that use the [XMLGET](../../sql-reference/functions/xmlget.md) function aren’t supported.

* Predicates that use the [MAP_CONTAINS_KEY](../../sql-reference/functions/map_contains_key.md) function aren’t supported.

The [current limitations of the search optimization service](queries-that-benefit.md) also apply to
structured types.

---
title: Speeding up queries with scalar functions using search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/scalar-functions.md
section: User Guide
---

# Speeding up queries with scalar functions using search optimization

A scalar function returns a single value for each invocation. The search optimization service can improve the
performance of queries that use scalar functions in equality predicates. The scalar function can be a
[system-defined scalar function](../../sql-reference/functions.md) or a
[user-defined scalar SQL function](../../developer-guide/udf/sql/udf-sql-introduction.md).

The following sections provide more information about search optimization support for queries that use scalar
functions:

* Enabling search optimization for queries that use scalar functions
* Supported data types
* Examples of supported queries with scalar functions

## Enabling search optimization for queries that use scalar functions

Queries aren’t improved unless you enable search optimization for the columns that are specified in equality
predicates that use scalar function calls. To improve the performance of queries with scalar functions on a table,
use the [ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md)
command to do the following:

* Enable search optimization for specific columns.
* Enable search optimization for all columns of the table.

In general, enabling search optimization only for specific columns is the best practice. Use the ON EQUALITY clause
to specify the columns. This example enables search optimization for a specific column:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(mycol);
```

To specify EQUALITY for all columns of the supported data types (except for
[semi-structured](../../sql-reference/data-types-semistructured.md) and [GEOGRAPHY](../../sql-reference/data-types-geospatial.md)):

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION;
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Supported data types

The search optimization service can improve the performance of queries that use columns of the following
data types in equality predicates that use scalar function calls:

* [Data types for fixed-point numbers](../../sql-reference/data-types-numeric.md), including the following:

  + All INTEGER data types, which have a scale of 0.
  + Fixed-point non-integers, which have a scale other than 0 (such as `NUMBER(10,2)`).
  + [Casts](../../sql-reference/data-type-conversion.md) of fixed-point numbers (for example,
    `NUMBER(30, 2)::NUMBER(30, 5)`).
* [String & binary data types](../../sql-reference/data-types-text.md) (for example, VARCHAR and BINARY).
* [Date & time data types](../../sql-reference/data-types-datetime.md) (for example, DATE, TIME, and TIMESTAMP).

Queries that involve other types of values (for example, VARIANT, FLOAT, GEOGRAPHY, or GEOMETRY) don’t benefit.

## Examples of supported queries with scalar functions

The following queries use scalar functions and are supported by the search optimization service.

### Use a system-defined scalar function in the predicate of a query

This query uses the [SHA2](../../sql-reference/functions/sha2.md) system-defined scalar function in an
equality predicate. To improve performance, make sure the EQUALITY search method
is enabled for the `mycol` column in the `test_so_scalar_function_system` table.

```sqlexample
SELECT *
  FROM test_so_scalar_function_system
  WHERE mycol = SHA2('Snowflake');
```

### Use a user-defined scalar SQL function in the predicate of a query

Create a user-defined scalar function:

```sqlexample
CREATE OR REPLACE FUNCTION test_scalar_udf(x INTEGER)
RETURNS INTEGER
AS
$$
  SELECT x + POW(2, 3)::INTEGER + 2
$$
;
```

This query uses the `test_scalar_udf` function in an equality predicate. To improve performance,
make sure the EQUALITY search method is enabled for the `mycol` column in the
`test_so_scalar_function_udf` table.

```sqlexample
SELECT *
  FROM test_so_scalar_function_udf
  WHERE mycol = test_scalar_udf(15750);
```

---
title: Speeding up queries with scalar subqueries using search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/scalar-subqueries.md
section: User Guide
---

# Speeding up queries with scalar subqueries using search optimization

A scalar subquery returns a single value (one column of one row). If no rows qualify to be returned, the subquery
returns NULL. The search optimization service can improve the performance of queries with scalar subqueries. For
more information about subqueries, see [Working with Subqueries](../querying-subqueries.md).

The following sections provide more information about search optimization support for queries with subqueries:

* Enabling search optimization for queries with scalar subqueries
* Supported data types
* Examples of supported queries with scalar subqueries

## Enabling search optimization for queries with scalar subqueries

Queries with subqueries aren’t improved unless you enable search optimization for the column that is
equal to the result of the subquery. To improve the performance of queries with scalar subqueries on a table, use the
[ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command to
do either of the following:

* Enable search optimization for specific columns.
* Enable search optimization for all columns of the table.

In general, enabling search optimization only for specific columns is the best practice. Use the ON EQUALITY clause
to specify the columns. This example enables search optimization for a specific column:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON EQUALITY(mycol);
```

To specify EQUALITY for all columns of the supported data types (except for
[semi-structured](../../sql-reference/data-types-semistructured.md) and [GEOGRAPHY](../../sql-reference/data-types-geospatial.md)):

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION;
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Supported data types

The search optimization service can improve the performance of scalar subqueries on columns of the following
data types:

* [Data types for fixed-point numbers](../../sql-reference/data-types-numeric.md), including the following:

  + All INTEGER data types, which have a scale of 0.
  + Fixed-point non-integers, which have a scale other than 0 (such as `NUMBER(10,2)`).
  + [Casts](../../sql-reference/data-type-conversion.md) of fixed-point numbers (for example,
    `NUMBER(30, 2)::NUMBER(30, 5)`).
* [String & binary data types](../../sql-reference/data-types-text.md) (for example, VARCHAR and BINARY).
* [Date & time data types](../../sql-reference/data-types-datetime.md) (for example, DATE, TIME, and TIMESTAMP).

Subqueries that involve other types of values (for example, VARIANT, FLOAT, GEOGRAPHY, or GEOMETRY) don’t benefit.

## Examples of supported queries with scalar subqueries

The following queries are examples of queries with scalar subqueries that are supported by the search
optimization service.

This query has a scalar subquery that queries the same table as the table in the outer query. To improve performance,
make sure search optimization is enabled for the `salary` column in the `employees` table.

```sqlexample
SELECT employee_id
  FROM employees
  WHERE salary = (
    SELECT MAX(salary)
      FROM employees
      WHERE department = 'Engineering');
```

This query has a scalar subquery that queries a table that is different from the table in the outer query. To improve
performance, make sure search optimization is enabled for the `product_id` column in the `products` table.

```sqlexample
SELECT *
  FROM products
  WHERE products.product_id = (
    SELECT product_id
      FROM sales
      GROUP BY product_id
      ORDER BY COUNT(product_id) DESC
      LIMIT 1);
```

---
title: Speeding up substring and regular expression queries with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/substring-queries.md
section: User Guide
---

# Speeding up substring and regular expression queries with search optimization

Search optimization can improve the performance of queries with predicates that search for substrings or use
regular expressions in text or semi-structured data. For details on how substring searches work with semi-structured
data, see [Speeding up queries of semi-structured data with search optimization](semi-structured-queries.md).

The following sections provide more information about search optimization support for substring and regular
expression queries:

* Enabling search optimization for substring and regular expression queries
* Supported predicates

## Enabling search optimization for substring and regular expression queries

To improve the performance of substring and regular expression queries on a table, use the
[ON SUBSTRING clause in the ALTER TABLE … ADD SEARCH OPTIMIZATION command](../../sql-reference/sql/alter-table.md)
for specific columns.

For example:

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(mycol);
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Supported predicates

The search optimization service can improve the performance of queries with predicates that use:

* [LIKE](../../sql-reference/functions/like.md)
* [LIKE ANY](../../sql-reference/functions/like_any.md)
* [LIKE ALL](../../sql-reference/functions/like_all.md)
* [ILIKE](../../sql-reference/functions/ilike.md)
* [ILIKE ANY](../../sql-reference/functions/ilike_any.md)
* [CONTAINS](../../sql-reference/functions/contains.md)
* [ENDSWITH](../../sql-reference/functions/endswith.md)
* [STARTSWITH](../../sql-reference/functions/startswith.md)
* [SPLIT_PART](../../sql-reference/functions/split_part.md)
* [RLIKE](../../sql-reference/functions/rlike.md)
* [REGEXP](../../sql-reference/functions/regexp.md)
* [REGEXP_LIKE](../../sql-reference/functions/regexp_like.md)

The search optimization service can improve performance when searching for substrings that are five or more characters
long. (More selective substrings can result in better performance.) The search optimization service doesn’t
use search access paths for the following predicate because the substring is shorter than five characters:

```sqlexample
LIKE '%TEST%'
```

For the following predicate, the search optimization service can optimize this query, using search access paths to search for the
substrings for `SEARCH` and `OPTIMIZED`. However, search access paths are not used for `IS` because the substring is shorter
than five characters.

```sqlexample
LIKE '%SEARCH%IS%OPTIMIZED%'
```

For queries that use RLIKE, REGEXP, and REGEXP_LIKE against text:

* The `subject` argument must be a TEXT column in a table that has search optimization enabled.
* The `pattern` argument must be a string constant.

For regular expressions, the search optimization service works best when:

* The pattern contains at least one substring literal that is five or more characters long.
* The pattern specifies that the substring should appear at least once.

For example, the following pattern specifies that `string` should appear one or more times in the subject:

```sqlexample
RLIKE '(string)+'
```

The search optimization service can improve the performance of queries with the following patterns because each predicate
specifies that a substring of five or more characters must appear at least once. (Note that the first example uses a
[dollar-quoted string constant](../../sql-reference/data-types-text.md) to avoid escaping the backslash characters.)

```sqlexample
RLIKE $$.*email=[\w\.]+@snowflake\.com.*$$
```

```sqlexample
RLIKE '.*country=(Germany|France|Spain).*'
```

```sqlexample
RLIKE '.*phone=[0-9]{3}-?[0-9]{3}-?[0-9]{4}.*'
```

In contrast, search optimization does not use search access paths for queries with the following patterns:

* Patterns without any substrings:

  ```sqlexample
  RLIKE '.*[0-9]{3}-?[0-9]{3}-?[0-9]{4}.*'
  ```
* Patterns that only contain substrings shorter than five characters:

  ```sqlexample
  RLIKE '.*tel=[0-9]{3}-?[0-9]{3}-?[0-9]{4}.*'
  ```
* Patterns that use the alternation operator where one option is a substring shorter than five characters:

  ```sqlexample
  RLIKE '.*(option1|option2|opt3).*'
  ```
* Patterns in which the substring is optional:

  ```sqlexample
  RLIKE '.*[a-zA-z]+(string)?[0-9]+.*'
  ```

Even when the substring literals are shorter than five characters, the search optimization service can still improve query
performance if expanding the regular expression produces a substring literal that is five characters or longer.

For example, consider the pattern:

```output
.*st=(CA|AZ|NV).*(-->){2,4}.*
```

In this example:

* Although the substring literals (e.g. `st=`, `CA`, etc) are shorter than five characters, the search optimization service
  recognizes that the substring `st=CA`, `st=AZ`, or `st=NV` (each of which is five characters long) must appear in the text.
* Similarly, even though the substring literal `-->` is shorter than five characters, the search optimization service determines
  that the substring `-->-->` (which is longer than five characters) must appear in the text.

The search optimization service can use search access paths to match these substrings, which can improve the performance of the
query.

---
title: Speeding up text queries with search optimization
source: https://docs.snowflake.com/en/user-guide/search-optimization/text-queries.md
section: User Guide
---

# Speeding up text queries with search optimization

Search optimization can improve the performance of queries that use the [SEARCH](../../sql-reference/functions/search.md)
and [SEARCH_IP](../../sql-reference/functions/search_ip.md) functions. These queries search for character data (text) and IP
addresses in specified columns from one or more tables, including elements in VARIANT, OBJECT, and ARRAY columns.

The following sections provide more information about search optimization support for text queries:

* Enabling search optimization for text queries
* Conditions for runtime use of FULL_TEXT search optimization
* Examples of ADD (and DROP) FULL_TEXT search optimization

## Enabling search optimization for text queries

To improve the performance of text queries on a table, use the
[ON FULL_TEXT clause in the ALTER TABLE … ADD SEARCH OPTIMIZATION command](../../sql-reference/sql/alter-table.md)
for specific columns. Enabling search optimization at the table level doesn’t enable it for queries that use the
SEARCH or SEARCH_IP function.

For example:

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(play, character, line);
```

For more information, see [Enabling and disabling search optimization](enabling.md).

## Conditions for runtime use of FULL_TEXT search optimization

After you have enabled FULL_TEXT search optimization on a table that is queried with the
SEARCH function, the search access path for the optimization can be used during query planning and execution.
The following conditions must be met:

* The search optimization must be ready for use (`active` column = TRUE in the DESCRIBE SEARCH
  OPTIMIZATION output).
* The search optimization must be enabled on a superset of the columns specified in the SEARCH predicate. For example,
  if a table contains VARCHAR columns `c1,c2,c3,c4,c5`, the search optimization covers columns `c1,c2,c3`, and the function
  searches one, two, or three of those columns (but not `c4` or `c5`), the query can benefit from FULL_TEXT search
  optimization.
* The analyzer defined for the search optimization in the ALTER TABLE command must be the same as the analyzer specified in
  the SEARCH function call.

> **Tip:**
>
> To find out if a specific search access path was used for a query, look for a `Search Optimization Access`
> node in the query profile.

## Examples of ADD (and DROP) FULL_TEXT search optimization

The following examples show how to enable FULL_TEXT search optimization on columns in a table to improve query performance
when the SEARCH function is used to query those columns.

### Enable FULL_TEXT search optimization with a specific analyzer

The following example enables FULL_TEXT search optimization on one column and specifies an analyzer.
The combination of optimization type and analyzer (`method`) is reflected in the DESCRIBE output.

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(line, ANALYZER => 'UNICODE_ANALYZER');
```

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON lines;
```

```output
+---------------+----------------------------+--------+------------------+--------+
| expression_id | method                     | target | target_data_type | active |
|---------------+----------------------------+--------+------------------+--------|
|             1 | FULL_TEXT UNICODE_ANALYZER | LINE   | VARCHAR(2000)    | true   |
+---------------+----------------------------+--------+------------------+--------+
```

If you enable FULL_TEXT search optimization on the same column with the default analyzer, the DESCRIBE output
returns two rows and differentiates the two entries by expression ID and method.

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(line);
```

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON lines;
```

```output
+---------------+----------------------------+--------+------------------+--------+
| expression_id | method                     | target | target_data_type | active |
|---------------+----------------------------+--------+------------------+--------|
|             1 | FULL_TEXT UNICODE_ANALYZER | LINE   | VARCHAR(2000)    | true   |
|             2 | FULL_TEXT DEFAULT_ANALYZER | LINE   | VARCHAR(2000)    | false  |
+---------------+----------------------------+--------+------------------+--------+
```

### Enable FULL_TEXT search optimization on a VARIANT column

The following command enables FULL_TEXT search optimization on a VARIANT column.
(This `car_sales` table and its data are described under [Querying Semi-structured Data](../querying-semistructured.md).)

```sqlexample
ALTER TABLE car_sales ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(src);

DESCRIBE SEARCH OPTIMIZATION ON car_sales;
```

```output
+---------------+----------------------------+--------+------------------+--------+
| expression_id | method                     | target | target_data_type | active |
|---------------+----------------------------+--------+------------------+--------|
|             1 | FULL_TEXT DEFAULT_ANALYZER | SRC    | VARIANT          | true   |
+---------------+----------------------------+--------+------------------+--------+
```

### Enable FULL_TEXT search optimization on an OBJECT column

The following example enables FULL_TEXT search optimization on an OBJECT column.

First, create a table with an OBJECT column and insert data:

```sqlexample
CREATE OR REPLACE TABLE so_object_example (object_column OBJECT);

INSERT INTO so_object_example (object_column)
  SELECT OBJECT_CONSTRUCT('a', 1::VARIANT, 'b', 2::VARIANT);
```

The following command enables FULL_TEXT search optimization on the OBJECT column.

```sqlexample
ALTER TABLE so_object_example ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(object_column);

DESCRIBE SEARCH OPTIMIZATION ON so_object_example;
```

```output
+---------------+----------------------------+---------------+------------------+--------+
| expression_id | method                     | target        | target_data_type | active |
|---------------+----------------------------+---------------+------------------+--------|
|             1 | FULL_TEXT DEFAULT_ANALYZER | OBJECT_COLUMN | OBJECT           | true   |
+---------------+----------------------------+---------------+------------------+--------+
```

### Enable FULL_TEXT search optimization on an ARRAY column

The following example enables FULL_TEXT search optimization on an ARRAY column.

First, create a table with an ARRAY column and insert data:

```sqlexample
CREATE OR REPLACE TABLE so_array_example (array_column ARRAY);

INSERT INTO so_array_example (array_column)
  SELECT ARRAY_CONSTRUCT('a', 'b', 'c');
```

The following command enables FULL_TEXT search optimization on the ARRAY column.

```sqlexample
ALTER TABLE so_array_example ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(array_column);

DESCRIBE SEARCH OPTIMIZATION ON so_array_example;
```

```output
+---------------+----------------------------+--------------+------------------+--------+
| expression_id | method                     | target       | target_data_type | active |
|---------------+----------------------------+--------------+------------------+--------|
|             1 | FULL_TEXT DEFAULT_ANALYZER | ARRAY_COLUMN | ARRAY            | true   |
+---------------+----------------------------+--------------+------------------+--------+
```

### Drop FULL_TEXT optimization from one or more columns

You can enable FULL_TEXT optimization on multiple columns, then later drop the optimization
from one or more of those columns. The remaining columns are still optimized.

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(play, act_scene_line, character, line, ANALYZER => 'UNICODE_ANALYZER');

DESCRIBE SEARCH OPTIMIZATION ON lines;
```

```output
+---------------+----------------------------+----------------+------------------+--------+
| expression_id | method                     | target         | target_data_type | active |
|---------------+----------------------------+----------------+------------------+--------|
|             1 | FULL_TEXT UNICODE_ANALYZER | PLAY           | VARCHAR(50)      | true   |
|             2 | FULL_TEXT UNICODE_ANALYZER | ACT_SCENE_LINE | VARCHAR(10)      | true   |
|             3 | FULL_TEXT UNICODE_ANALYZER | CHARACTER      | VARCHAR(30)      | true   |
|             4 | FULL_TEXT UNICODE_ANALYZER | LINE           | VARCHAR(2000)    | true   |
+---------------+----------------------------+----------------+------------------+--------+
```

```sqlexample
ALTER TABLE lines DROP SEARCH OPTIMIZATION ON 1, 2, 3;
```

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON lines;
```

```output
+---------------+----------------------------+--------+------------------+--------+
| expression_id | method                     | target | target_data_type | active |
|---------------+----------------------------+--------+------------------+--------|
|             4 | FULL_TEXT UNICODE_ANALYZER | LINE   | VARCHAR(2000)    | true   |
+---------------+----------------------------+--------+------------------+--------+
```

### Use the wildcard (\*) to enable search optimization on all qualifying columns

The following ALTER TABLE command enables FULL_TEXT search optimization on all four VARCHAR columns in the
`lines` table:

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(*);
```

```sqlexample
DESCRIBE SEARCH OPTIMIZATION ON lines;
```

```output
+---------------+----------------------------+----------------+------------------+--------+
| expression_id | method                     | target         | target_data_type | active |
|---------------+----------------------------+----------------+------------------+--------|
|             1 | FULL_TEXT DEFAULT_ANALYZER | PLAY           | VARCHAR(50)      | true   |
|             2 | FULL_TEXT DEFAULT_ANALYZER | ACT_SCENE_LINE | VARCHAR(10)      | true   |
|             3 | FULL_TEXT DEFAULT_ANALYZER | CHARACTER      | VARCHAR(30)      | true   |
|             4 | FULL_TEXT DEFAULT_ANALYZER | LINE           | VARCHAR(2000)    | true   |
+---------------+----------------------------+----------------+------------------+--------+
```

### Expected error when enabling FULL_TEXT optimization

The following ALTER TABLE command fails with an expected error because one of the specified columns
is a NUMBER column:

```sqlexample
ALTER TABLE lines ADD SEARCH OPTIMIZATION
  ON FULL_TEXT(play, speech_num, act_scene_line, character, line);
```

```output
001128 (42601): SQL compilation error: error line 1 at position 76
Expression FULL_TEXT(IDX_SRC_TABLE.SPEECH_NUM) cannot be used in search optimization.
```

---
title: SQL Development & Management
source: https://docs.snowflake.com/en/user-guide/ecosystem-editors.md
section: User Guide
---

# SQL Development & Management

Snowflake provides the following native SQL development and data querying interfaces:

| Solution |  | Description | Notes |
| --- | --- | --- | --- |
|  | [Snowsight Worksheets](ui-snowsight-worksheets.md) | Browser-based SQL development and editing. | * No installation or configuration required. * Supports multiple, independent working environments that can be opened/closed, named, and reused across multiple sessions   (all work is automatically saved). |
|  | [Snowflake CLI](../developer-guide/snowflake-cli/index.md) | Open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations, including querying, executing DDL/DML commands, and bulk loading/unloading of data. | * Download the installer from the [Snowflake CLI repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html) page. |
|  | [SnowSQL](snowsql.md) | Python-based client for performing all tasks in Snowflake, including querying, executing DDL/DML commands, and bulk loading/unloading of data. | * Download the installer from the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page. |
|  | [Snowflake Extension for Visual Studio Code](vscode-ext.md) | Snowflake provides an extension for Visual Studio Code to enable Snowflake users to write and execute Snowflake SQL statements directly in VSC, using either SQL files or Python files containing Snowpark Python code. | * Install the extension directly from within Visual Studio Code, or indirectly by downloading a specific version. |

In addition, Snowflake works with a variety of 3rd-party SQL tools for managing the modeling, development, and deployment of SQL code in
your Snowflake applications, including, but not limited to:

| Solution |  | Version / Installation Requirements | Notes |
| --- | --- | --- | --- |
|  |  | **Agile Data Engine:** No requirements  **Snowflake:** No requirements |  |
|  |  | **Aginity:** Pro or Team  **Snowflake:** No requirements |  |
|  |  | **DataOps.live:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). |
|  |  | **DBeaver:** 4.3.4 (or higher)  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — automatically downloaded and installed by DBeaver |  |
|  |  | **erwin:** Data Modeler 2020 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [User Guides > erwin Data Modeler … Snowflake Object Support](https://erwin.com/bookshelf/public_html/2020R2/Content/User%20Guides/erwin%20Help/Snowflake_Object_Support.html) (erwin Documentation)   + [User Guides > erwin Data Modeler … Database Connection Parameters](https://erwin.com/bookshelf/public_html/2020R2/Content/User%20Guides/erwin%20Help/Database_Connection_Parameters.html) (erwin Documentation) |
|  |  | **Hackolade:** Studio 5.2.0 (or higher)  **Snowflake:** No requirements | * Additional resources:    + [Snowflake](https://hackolade.com/help/Snowflake.html) (Hackolade Documentation)   + [Connect to a Snowflake instance](https://hackolade.com/help/ConnecttoaSnowflakeinstance.html) (Hackolade Documentation) |
|  |  | **SeekWell:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Connecting to Snowflake](https://intercom.help/seekwell/articles/2725663-connecting-to-snowflake) (SeekWell Help Center) |
|  |  | **Solita Agile Data Engine:** No requirements  **Snowflake:** No requirements |  |
|  |  | **SqlDBM:** No requirements  **Snowflake:** No requirements | * Available for trial via [Snowflake Partner Connect](ecosystem-partner-connect.md). * Additional resources:    + [SqlDBM Partnership with Snowflake](http://blog.sqldbm.com/snowflake-data-modelling-with-sqldbm/) (SqlDBM Blog) |
|  |  | **SQL Workbench:** No requirements  **Snowflake:** [JDBC Driver](../developer-guide/jdbc/jdbc.md) — download from the [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc) | * Additional resources:    + [Configuring SQL Workbench/J to Use Snowflake](https://community.snowflake.com/s/article/configuring-sql-workbenchj-to-use-snowflake)     (Snowflake Community) |
|  |  | **Statsig:** No requirements  **Snowflake:** No requirements | * Additional resources:    + [Integrations > Data Imports > Snowflake](https://docs.statsig.com/integrations/data-imports/snowflake) (Statsig Documentation) |

> **Note:**
>
> This is not a complete list of SQL management tools that work with Snowflake; these are known tools that have been validated for use
> with Snowflake. Other tools can be used with Snowflake; however, we do not guarantee that all features/functionality in these 3rd-party
> tools will interoperate with Snowflake.

---
title: SQL Statements Supported for Preparation
source: https://docs.snowflake.com/en/user-guide/sql-prepare.md
section: User Guide
---

# SQL Statements Supported for Preparation

Some drivers and connectors support the ability to send a SQL statement for preparation before execution. Snowflake supports
preparation for the following types of SQL statements:

* [SELECT](../sql-reference/sql/select.md)
* [Data Manipulation Language (DML) commands](../sql-reference/sql-dml.md)
* [SHOW <objects>](../sql-reference/sql/show.md)

Note that if a driver or connector sends other types of SQL statements for preparation, those statements will not be prepared. For
example, if you send a DDL statement for preparation and execution, Snowflake just executes the statement without preparing it.

---
title: SSDF
source: https://docs.snowflake.com/en/user-guide/cert-ssdf.md
section: User Guide
---

# SSDF

This topic describes how Snowflake supports customers with SSDF compliance requirements.

## Understanding SSDF compliance requirements

The Cybersecurity and Infrastructure Security Agency (CISA) Secure Software Development Framework (SSDF) reinforces
secure by design principles advanced by CISA, Federal government partners, and international allies and requires
software producers serving the federal government to confirm implementation of specific security practices.

Snowflake maintains service offerings that have completed a National Institute of Standards and Technology (NIST)
Special Publication (SP) 800-218 SSDF assessment by a FedRAMP authorized third-party assessment organization (3PAO)
with an accompanying attestation letter available upon request.

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Stage, pipe, and load history replication
source: https://docs.snowflake.com/en/user-guide/account-replication-stages-pipes-load-history.md
section: User Guide
---

# Stage, pipe, and load history replication

This topic provides information about replication support for data pipeline objects and related metadata,
including stages, storage integrations, pipes, and load history. You can replicate these objects to configure failover for ingest and ETL
pipelines across [regions](intro-regions.md) and across [cloud platforms](intro-cloud-platforms.md).

Before you get started, we recommend that you have familiarity with Snowflake support for replication and failover/failback.
For more information, see [Introduction to replication and failover across multiple accounts](account-replication-intro.md).

## Requirements

> **Important:**
>
> If a database in a target account that you plan to use already contains stages and pipes, we recommend that you contact support
> before enabling replication. When a replication or failover group in your source account includes that database, any pre-existing stages
> and pipes are dropped from the database.

To replicate any external stages that use a storage integration, you must configure your replication or failover group to replicate
`STORAGE INTEGRATIONS`. Otherwise, external stages are replicated without the associated storage integration.

You can use an [ALTER REPLICATION GROUP](../sql-reference/sql/alter-replication-group.md) or
[ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) statement to modify these properties for an existing group.

If you add `INTEGRATIONS` to the `OBJECT_TYPES` list in your ALTER statement,
include any other existing objects in the list to avoid dropping those objects in the target account.
The same applies if you add `STORAGE INTEGRATIONS` to the `ALLOWED_INTEGRATION_TYPES` list.

For example:

```sqlexample
ALTER FAILOVER GROUP my_failover_group SET
  OBJECT_TYPES = ROLES, INTEGRATIONS
  ALLOWED_INTEGRATION_TYPES = API INTEGRATIONS, STORAGE INTEGRATIONS;
```

> **Note:**
>
> Your cloud storage provider might limit replication of data pipeline objects between commercial and government cloud regions. To avoid
> government cloud data replication limitations, configure your failover resources in any region accessible to your government cloud region.
> For more information about government cloud limitations, review your cloud storage provider’s documentation.

## Replication and stages

This section describes the current level of replication functionality that Snowflake supports for different types of stages.

### Replication of internal stages

The following table describes how replication works for each type of internal stage.

| Type | Description of Replication Support |
| --- | --- |
| Table stage | Empty table stages are created for tables in a replicated database. Files on table stages are not replicated. |
| User stage | User and user stage replication requires Business Critical Edition (or higher).  Empty user stages are created for replicated users. Files on user stages are not replicated. |
| Named stage | Named internal stages are replicated when you replicate a database.  The stage must have a directory table enabled on it in order to replicate the files on the stage. |

### Replication of external stages

> **Note:**
>
> Snowflake does not replicate files on an external stage.
> The cloud storage URL points to the same location for external stages in primary and secondary databases.

The following table describes how replication works for each type of external stage.

| Type | Description of Replication Support |
| --- | --- |
| Named stage with no credentials (public storage location) | Named external stages are replicated when you replicate a database. The files on an external stage are not replicated. |
| Named stage with credentials (private storage location) | Replicated stages include the cloud provider credentials, such as secret keys or access tokens. |
| Named stage with storage integration (private storage location) | Storage integration replication requires Business Critical Edition (or higher).  The replication or failover group must include `STORAGE INTEGRATIONS` in the `ALLOWED_INTEGRATION_TYPES` list. For more information, see [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md).  You must also take action to configure the trust relationships for your cloud storage in the target accounts. For more information, see [Configure cloud storage access for secondary storage integrations](account-replication-config.md). |

> **Note:**
>
> To associate a secondary stage or pipe with a different cloud storage location than the one associated with the primary object,
> contact the support team. For example, you might choose a location in another region.

### Considerations

The following constraints apply to stage objects:

* Snowflake currently supports stage replication as part of group-based replication (replication and failover groups).
  Stage replication is not supported for database replication.
* You can replicate an external stage. However, the files on an external stage are not replicated.
* You can replicate an internal stage. To replicate the files on an internal stage, you must enable a directory table on the stage.
  Snowflake replicates only the files that are mapped by the directory table.
* When you replicate an internal stage with a directory table, you cannot disable the directory table on the primary or secondary stage.
  The directory table contains critical information about replicated files and files loaded using a COPY statement.
* A refresh operation will fail if the directory table on an internal stage contains a file that is larger than 5GB. To work around this
  limitation, move any files larger than 5GB to a different stage.

  You cannot disable the directory table on a primary or secondary stage, or any stage that has previously been replicated. Follow
  these steps *before* you add the database that contains the stage to a replication or failover group.

  1. [Disable the directory table](../sql-reference/sql/alter-stage.md) on the primary stage.
  2. Move the files that are larger than 5GB to another stage that does not have a directory table enabled.
  3. After you move the files to another stage, re-enable the directory table on the primary stage.
* Files on user stages and table stages are not replicated.
* For named external stages that use a storage integration, you must configure the trust relationship for secondary storage integrations
  in your target accounts prior to failover. For more information, see [Configure cloud storage access for secondary storage integrations](account-replication-config.md).
* If you replicate an external stage with a directory table, and you have configured
  [automated refresh](data-load-dirtables-auto.md) for the source
  directory table, you must configure automated refresh for the secondary directory table before failover. For more information,
  see [Configure automated refresh for directory tables on secondary stages](account-replication-config.md).
* A copy command might take longer than expected if the directory table on a replicated stage is not consistent with the
  replicated files on the stage. To make a directory table consistent, refresh it with an
  [ALTER STAGE … REFRESH](../sql-reference/sql/alter-stage.md) statement.
  To check the consistency status of a directory table, use the [SYSTEM$GET_DIRECTORY_TABLE_STATUS](../sql-reference/functions/system_get_directory_table_status.md) function.

## Replication and pipes

This section describes the current level of replication functionality supported for different types of pipes.

Snowflake supports replication for the following:

* Pipe objects, including auto-ingest and REST endpoint pipes that load data from external stages.
* Pipe-level parameters.
* Privilege grants on pipe objects.

> **Note:**
>
> To associate a secondary stage or pipe with a different cloud storage location than the one associated with the primary object,
> contact the support team. For example, you might choose a location in another region.

### Pipes in secondary databases

Pipes in a secondary database are in a `READ_ONLY` execution state and receive notifications
but do not load data until you promote the secondary database to serve as the primary.
After you promote a secondary database, the pipes will transition to a `FAILING_OVER` execution state.
Once failover is complete, the pipes should be in the `RUNNING` execution state
and begin to load any data that is available since the last refresh time (that is, the last time that the former primary database was updated).

### Replication of auto-ingest pipes

In the event of a failover, a replicated auto-ingest pipe becomes the new primary pipe and can do the following:

* Load any data that has not yet been loaded.
  This includes any data that is new since the newly promoted primary database was last refreshed.
* Continue to receive notifications when the stage has new files to load, and loads data from those files.

  > **Note:**
  >
  > To receive notifications, you must configure a secondary auto-ingest pipe in a target account prior to failover.
  > For more information, see [Configure notifications for secondary auto-ingest pipes](account-replication-config.md).

### Replication of REST endpoint pipes

For pipes that use the [Snowpipe REST API](data-load-snowpipe-rest-load.md) to load data,
Snowflake replicates the pipes and their load history metadata to each target account that you specify.
There are no additional configuration steps you need to take on the target accounts.
For a detailed list of load history metadata, see [Load metadata](data-load-considerations-load.md).

To continue data loading in the event of a failover, call the REST API from the newly-promoted source account.

### Considerations

The following constraints apply to pipe objects:

* Snowflake currently supports pipe replication as part of group-based replication (replication and failover groups).
  Pipe replication is not supported for database replication.
* Snowflake replicates the copy history of a pipe only when the pipe belongs to the same replication group as its target table.
* Replication of notification integrations is not supported.
* Snowflake only replicates load history after the latest table truncate.
* To receive notifications, you must configure a secondary auto-ingest pipe in a target account prior to failover.
  For more information, see [Configure notifications for secondary auto-ingest pipes](account-replication-config.md).
* Use the [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md) function to resolve any pipes not in their expected execution state after failover.
* Snowflake doesn’t support replication and failover for Snowpipe with the Kafka connector, but Snowflake does support replication and failover for Snowpipe Streaming with the Kafka connector. For more information, see [Snowpipe Streaming and the Kafka connector](account-replication-failover-failback.md).

## Example 1: Replicate a named internal stage

This example demonstrates how replication works for internal stages. In particular, the example shows how the directory table
is the single source of truth for stage metadata before and after replication.

The first part of the example completes the following tasks in a source account.

1. Create an internal stage named `my_int_stage` with a directory table enabled to replicate the files on the stage. Then copy data
   from a table named `my_table` into files on the stage.

   > **Note:**
   >
   > The example refreshes the directory table after loading `file1` and `file2` onto the stage to synchronize
   > the table metadata with the latest set of files in the stage definition for the directory tables.
   > However, no refresh operation occurs after loading `file3`.

   ```sqlexample
   CREATE OR REPLACE STAGE my_stage
     DIRECTORY = (ENABLE = TRUE);

   COPY INTO @my_stage/folder1/file1 from my_table;
   COPY INTO @my_stage/folder2/file2 from my_table;
   ALTER STAGE my_stage REFRESH;

   COPY INTO @my_stage/folder3/file3 from my_table;
   ```
2. Create a failover group:

   ```sqlexample
   CREATE FAILOVER GROUP my_stage_failover_group
     OBJECT_TYPES = DATABASES
     ALLOWED_DATABASES = my_database_1
     ALLOWED_ACCOUNTS = myorg.my_account_2;
   ```

The second part of the example completes the replication and failover process in a target account:

1. Create a failover group as a replica of the failover group in the source account, refresh the objects in the new failover group,
   and promote the target account to serve as the source account.

   ```sqlexample
   CREATE FAILOVER GROUP my_stage_failover_group
     AS REPLICA OF myorg.my_account_1.my_stage_failover_group;

   ALTER FAILOVER GROUP my_stage_failover_group REFRESH;

   ALTER FAILOVER GROUP my_stage_failover_group PRIMARY;
   ```
2. Next, refresh the directory table on the replicated stage and copy all of the
   files tracked by the directory table on `my_stage` into a table named `my_table` .

   > **Note:**
   >
   > The COPY INTO statement loads `file1` and `file2` into the table, but not `file3`.
   > This is because the directory table was not refreshed after adding `file3` in the source account.

   ```sqlexample
   ALTER STAGE my_stage REFRESH;

   COPY INTO my_table FROM @my_stage;
   ```

## Example 2: Replicate an external stage and storage integration

This example provides a sample workflow for replicating an external stage and storage integration to a target account.

The example assumes that you have already completed the following:
[Configured secure access to your Amazon S3 bucket](data-load-snowpipe-auto-s3.md).

The first part of the example completes the following tasks in a source account.

1. Create a storage integration for an Amazon S3 bucket in database `my_database_2`.

   ```sqlexample
   CREATE STORAGE INTEGRATION my_storage_int
     TYPE = external_stage
     STORAGE_PROVIDER = 's3'
     STORAGE_ALLOWED_LOCATIONS = ('s3://mybucket/path')
     STORAGE_BLOCKED_LOCATIONS = ('s3://mybucket/blockedpath')
     ENABLED = true;
   ```
2. Create an external stage in database `my_database_2` using storage integration `my_storage_int`.

   ```sqlexample
   CREATE STAGE my_ext_stage
     URL = 's3://mybucket/path'
     STORAGE_INTEGRATION = my_storage_int
   ```
3. Create a failover group and include database `my_database_2` and storage integration objects.

   ```sqlexample
   CREATE FAILOVER GROUP my_external_stage_fg
     OBJECT_TYPES = databases, integrations
     ALLOWED_INTEGRATION_TYPES = storage integrations
     ALLOWED_DATABASES = my_database_2
     ALLOWED_ACCOUNTS = myorg.my_account_2;
   ```

The second part of the example completes the replication and failover process in a target account:

1. Create a failover group as a replica of the failover group in the source account and refresh.

   ```sqlexample
   CREATE FAILOVER GROUP my_external_stage_fg
     AS REPLICA OF myorg.my_account_1.my_external_stage_fg;

   ALTER FAILOVER GROUP my_external_stage_fg REFRESH;
   ```
2. After you replicate the storage integration to the target account, you must take additional steps to update your cloud
   provider permissions to grant the replication integration access to your cloud storage. For more information, see
   [Configure cloud storage access for secondary storage integrations](account-replication-config.md).

## Example 3: Replicate an auto-ingest pipe

This example provides a sample workflow for replicating a pipe that uses
an [Amazon Simple Notification Service (SNS) topic with Amazon Simple Queue Service (SQS) to automate Snowpipe](data-load-snowpipe-auto-s3.md).

The example assumes that you have already completed the following tasks:

* [Created and configured a storage integration for Amazon S3](data-load-snowpipe-auto-s3.md). For example
  purposes, we use a storage integration named `my_s3_storage_int`.
* [Created an Amazon SNS topic and subscription, and subscribed the Snowflake SQS queue to your SNS topic](data-load-snowpipe-auto-s3.md).
* Created an external stage that references your storage integration. For example
  purposes, we use a stage named `my_s3_stage`. For instructions, see [CREATE STAGE](../sql-reference/sql/create-stage.md).

Start with the following tasks in a source account.

1. Use the [CREATE PIPE](../sql-reference/sql/create-pipe.md) command to create a pipe with auto-ingest enabled that loads data from the external stage into a table named `mytable`.

   ```sqlexample
   CREATE PIPE snowpipe_db.public.mypipe AUTO_INGEST=TRUE
    AWS_SNS_TOPIC='<topic_arn>'
    AS
      COPY INTO snowpipe_db.public.mytable
      FROM @snowpipe_db.public.my_s3_stage
      FILE_FORMAT = (TYPE = 'JSON');
   ```
2. Refresh the pipe with an [ALTER PIPE](../sql-reference/sql/alter-pipe.md) statement to load data from the stage from the last 7 days.

   ```sqlexample
   ALTER PIPE mypipe REFRESH;
   ```
3. Finally, use [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md) to create a failover group
   that allows replication of storage integrations.

   ```sqlexample
   CREATE FAILOVER GROUP my_pipe_failover_group
     OBJECT_TYPES = DATABASES, INTEGRATIONS
     ALLOWED_INTEGRATION_TYPES = STORAGE INTEGRATIONS
     ALLOWED_DATABASES = snowpipe_db
     ALLOWED_ACCOUNTS = myorg.my_account_2;
   ```

The second part of the example completes the replication and failover process in a target account:

1. Create a failover group as a replica of the failover group in the source account.

   ```sqlexample
   CREATE FAILOVER GROUP my_pipe_failover_group
     AS REPLICA OF myorg.my_account_1.my_pipe_failover_group;
   ```
2. Execute a [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md) statement to retrieve the ARN for the
   AWS IAM User for your Snowflake account on the secondary deployment.

   Use the ARN to grant the IAM user permissions to access your S3 bucket.
   [See Step 5: Grant the IAM User Permissions to Access Bucket Objects](data-load-s3-config-storage-integration.md).

   ```sqlexample
   DESC INTEGRATION my_s3_storage_int;
   ```
3. Call the [SYSTEM$GET_AWS_SNS_IAM_POLICY](../sql-reference/functions/system_get_aws_sns_iam_policy.md) system function to generate an IAM policy that grants the new SQS queue permission
   to subscribe to your SNS topic. Snowflake created the new SQS queue in your target account when you replicated the failover group from your
   source account.

   ```sqlsyntax
   SELECT SYSTEM$GET_AWS_SNS_IAM_POLICY('<topic_arn>');
   ```

   `topic_arn` is the Amazon Resource Name (ARN) of the SNS topic that you created for the original pipe in your source account.

   Then, [Subscribe the new Amazon SQS queue to your SNS topic](data-load-snowpipe-auto-s3.md).
4. Refresh the objects in your new failover group.

   ```sqlexample
   ALTER FAILOVER GROUP my_pipe_failover_group REFRESH;
   ```
5. Finally, promote the target account to serve as the source account with the [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md) command.

   ```sqlexample
   ALTER FAILOVER GROUP my_pipe_failover_group PRIMARY;
   ```

   The `mypipe` pipe will begin to load any data that was made available since the
   last time the failover group was refreshed in the source account.

   To verify that the replicated pipe is working, query the table from the pipe’s COPY statement.

   ```sqlexample
   SELECT * FROM mytable;
   ```

## Migrate to Amazon Simple Notification Service (SNS)

This section covers how to migrate from sending Amazon S3 event notifications directly to an Amazon Simple Queue Service (SQS)
queue to using an Amazon Simple Notification Service (SNS) topic for the following scenarios:

* [Refresh directory tables automatically for Amazon S3](data-load-dirtables-auto-s3.md)
* [Automating Snowpipe for Amazon S3](data-load-snowpipe-auto-s3.md)

When you replicate a directory table or pipe,
Snowflake creates a new SQS queue in your target account to handle automation. You can configure a single SNS topic to
deliver event notifications from your S3 bucket to all SQS queues across multiple accounts.
By broadcasting your S3 event notification(s) to every SQS queue, you reduce the risk of losing notifications and data after failover.

> **Note:**
>
> If you already use SNS, migration is not necessary.
> Instead, follow the usual steps to configure automation with SNS for secondary directory tables or auto-ingest pipes before failover.
>
> * [Configure automated refresh for directory tables on secondary stages](account-replication-config.md)
> * [Configure notifications for secondary auto-ingest pipes](account-replication-config.md)

### Prerequisites

To migrate, you must meet the following conditions:

* You have already set up one or more event notifications for your S3 bucket. For instructions, see the topic for your use case:

  + [Refreshing Directory Tables Automatically for Amazon S3: Creating a New S3 Event Notification](data-load-dirtables-auto-s3.md)
  + [Creating a New S3 Event Notification to Automate Snowpipe](data-load-snowpipe-auto-s3.md)
* You have already created a replication or failover group in a target account that includes a stage with a directory table or a pipe.

### Migrate to an SNS Topic

1. Create an SNS topic in your AWS account.
   For instructions, see [Creating an Amazon SNS topic](https://docs.aws.amazon.com/sns/latest/dg/sns-create-topic.html)
   in the AWS SNS documentation.
2. Subscribe your target destinations (for example, other SQS queues or AWS Lambda workloads) for your S3 event notification(s)
   to your SNS topic. SNS publishes event notifications for your bucket to all subscribers to the topic.
   For instructions, see the [AWS SNS documentation](https://docs.aws.amazon.com/sns/latest/dg/sns-create-subscribe-endpoint-to-topic.html).
3. Update the access policy for your topic with the following permissions:

   * Allow the Snowflake IAM user to subscribe the SQS queue that is in your *target* account
     to your topic.
   * Allow Amazon S3 to publish event notifications from your bucket to the SNS topic.

   For instructions, see [Step 1: Subscribe the Snowflake SQS Queue to the SNS Topic](data-load-snowpipe-auto-s3.md).
4. In your target Snowflake account, call the [SYSTEM$CONVERT_PIPES_SQS_TO_SNS](../sql-reference/functions/system_convert_pipes_sqs_to_sns.md) function.
   The function subscribes the SQS queue in your *target* account to your SNS topic without interrupting metadata
   synchronization or ingestion work.

   Specify your S3 bucket name and SNS topic ARN.

   ```sqlexample
   SELECT SYSTEM$CONVERT_PIPES_SQS_TO_SNS('s3_mybucket', 'arn:aws:sns:us-west-2:001234567890:MySNSTopic')
   ```
5. Update your S3 event notifications to use your SNS topic as a destination. For instructions, see the
   [Amazon S3 User Guide](https://docs.aws.amazon.com/AmazonS3/latest/userguide/enable-event-notifications.html).

After you complete these steps, the SQS queue automatically unbinds from your S3 event notification(s).
All of the directory tables and pipes that use the specified S3 bucket will start using SNS as the source of notifications.

---
title: Staging data
source: https://docs.snowflake.com/en/user-guide/data-load-considerations-stage.md
section: User Guide
---

# Staging data

This topic provides best practices, general guidelines, and important considerations for preparing your data files for loading.

## Organizing data by path

Both internal (i.e. Snowflake) and external (Amazon S3, Google Cloud Storage, or Microsoft Azure) stage references can include a path (or *prefix* in AWS terminology). When staging regular data sets, we recommend partitioning the data into logical paths that include identifying details such as geographical location or other source identifiers, along with the date when the data was written.

Organizing your data files by path lets you copy any fraction of the partitioned data into Snowflake with a single command. This allows you to execute concurrent COPY statements that match a subset of files, taking advantage of parallel operations.

For example, if you were storing data for a North American company by geographical location, you might include identifiers such as continent, country, and city in paths along with data write dates:

> * Canada/Ontario/Toronto/2016/07/10/05/
> * United_States/California/Los_Angeles/2016/06/01/11/
> * United_States/New York/New_York/2016/12/21/03/
> * United_States/California/San_Francisco/2016/08/03/17/

When you create a named stage, you can specify any part of a path. For example, create an external stage using one of the above example paths:

> ```sqlexample
> CREATE STAGE my_stage URL='s3://mybucket/United_States/California/Los_Angeles/' CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z');
> ```

You can also add a path when you stage files in an internal user or table stage. For example, stage `mydata.csv` in a
specific path in the `t1` table stage:

> ```sqlexample
> PUT file:///data/mydata.csv @%t1/United_States/California/Los_Angeles/2016/06/01/11/
> ```

When loading your staged data, narrow the path to the most granular level that includes your data for improved data load performance.

Use any of the following options to further confine the list of files to load:

> * If the file names match except for a suffix or extension, include the matching part of the file names in the path, e.g.:
>
>   ```sqlexample
>   COPY INTO t1 from @%t1/United_States/California/Los_Angeles/2016/06/01/11/mydata;
>   ```
> * Add the FILES or PATTERN options (see [Options for selecting staged data files](data-load-considerations-load.md)), e.g.:
>
>   > ```sqlexample
>   > COPY INTO t1 from @%t1/United_States/California/Los_Angeles/2016/06/01/11/
>   >   FILES=('mydata1.csv', 'mydata1.csv');
>   >
>   > COPY INTO t1 from @%t1/United_States/California/Los_Angeles/2016/06/01/11/
>   >   PATTERN='.*mydata[^[0-9]{1,3}$$].csv';
>   > ```

---
title: Staging data files from a local file system
source: https://docs.snowflake.com/en/user-guide/data-load-local-file-system-stage.md
section: User Guide
---

# Staging data files from a local file system

Execute [PUT](../sql-reference/sql/put.md) using the [Snowflake CLI](../developer-guide/snowflake-cli/index.md) client, the [SnowSQL client](snowsql.md), or [Drivers](../developer-guide/drivers.md) to upload (stage) local data files into an internal stage.

If you want to load a few small local data files into a named internal stage, you can also use Snowsight.
Refer to [Staging files using Snowsight](data-load-local-file-system-stage-ui.md).

## Staging the data files

User Stage
:   The following example uploads a file named `data.csv` in the `/data` directory on your local machine to
    your user stage and prefixes the file with a folder named `staged`.

    Note that the `@~` character combination identifies a user stage.

    * Linux or macOS

      > ```sqlexample
      > PUT file:///data/data.csv @~/staged;
      > ```
    * Windows

      > ```sqlexample
      > PUT file://C:\data\data.csv @~/staged;
      > ```

Table Stage
:   The following example uploads a file named `data.csv` in the `/data` directory on your local machine to
    the stage for a table named `mytable`.

    Note that the `@%` character combination identifies a table stage.

    * Linux or macOS

      > ```sqlexample
      > PUT file:///data/data.csv @%mytable;
      > ```
    * Windows

      > ```sqlexample
      > PUT file://C:\data\data.csv @%mytable;
      > ```

Named Stage
:   The following example uploads a file named `data.csv` in the `/data` directory on your local machine to a
    named internal stage called `my_stage`. See [Choosing an internal stage for local files](data-load-local-file-system-create-stage.md) for information on named stages.

    In SQL, note that the `@` character by itself identifies a named stage.

    * Linux or macOS

      SQLPython

      ```sqlexample
      PUT file:///data/data.csv @my_stage;
      ```

      ```python
      my_stage_res = root.databases["<database>"].schemas["<schema>"].stages["my_stage"]
      my_stage_res.put("/data/data.csv", "/")
      ```
    * Windows

      SQLPython

      ```sqlexample
      PUT file://C:\data\data.csv @my_stage;
      ```

      ```python
      my_stage_res = root.databases["<database>"].schemas["<schema>"].stages["my_stage"]
      my_stage_res.put("C:/data/data.csv", "/")
      ```

## Listing staged data files

To see files that have been uploaded to a Snowflake stage, use the [LIST](../sql-reference/sql/list.md) command:

User stage:

```sqlexample
LIST @~;
```

Table stage:

```sqlexample
LIST @%mytable;
```

Named stage:

SQLPython

```sqlexample
LIST @my_stage;
```

```python
stage_files = root.databases["<database>"].schemas["<schema>"].stages["my_stage"].list_files()
for stage_file in stage_files:
  print(stage_file)
```

**Next:** [Copy data from an internal stage](data-load-local-file-system-copy.md)

---
title: Staging files using Snowsight
source: https://docs.snowflake.com/en/user-guide/data-load-local-file-system-stage-ui.md
section: User Guide
---

# Staging files using Snowsight

With Snowsight, you can create and manage named stages without writing SQL. You can also upload files onto a named internal stage so
that you can view your files, reference the files in a Python worksheet, or
[load data from the files into a table](data-load-web-ui.md).

You can’t upload files onto user stages or table stages using Snowsight.
For more information about stages, see [Overview of data loading](data-load-overview.md).

## Creating a stage

You can use Snowsight to create a named internal or external stage.

> **Note:**
>
> To create a stage, you must use a role that is granted or inherits the necessary privileges.
> For more information, see [Access control requirements](../sql-reference/sql/create-stage.md) for [CREATE STAGE](../sql-reference/sql/create-stage.md).

### Create a named internal stage

To use Snowsight to create a named internal stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Stage » Snowflake Managed.
3. In the Create Stage dialog, enter a Stage Name.
4. Select the database and schema where you want to create the stage.
5. Optionally deselect Directory table. Directory tables let you see files on the stage, but require a warehouse and thus incur a cost.
   You can choose to deselect this option for now and enable a directory table later.
6. Select the type of Encryption supported for all files on your stage. For details, see [encryption for internal stages](../sql-reference/sql/create-stage.md). You can’t change the encryption type after you create the stage.

   > > **Note:**
   > >
   > > To enable data access, use server-side encryption. Otherwise, staged files are client-side
   > > encrypted by default and unreadable when downloaded. For more information, see [Server-side encryption for unstructured data access](unstructured-intro.md).
7. Complete the fields to describe your stage. For more information, see [CREATE STAGE](../sql-reference/sql/create-stage.md).
8. Select Create.

### Create a named external stage

To use Snowsight to create a named external stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Stage » External Stage.
3. Select your external cloud storage provider: Amazon S3, Microsoft Azure, or Google Cloud Platform.
4. In the Create Stage dialog, enter a Stage Name.
5. Select the database and schema where you want to create the stage.
6. Enter the URL of your external cloud storage location.
7. If your external storage isn’t public, enable Authentication and enter your details. For more information,
   see [CREATE STAGE](../sql-reference/sql/create-stage.md).
8. Optionally deselect Directory table. Directory tables let you see files on the stage,
   but require a warehouse and thus incur a cost. You can choose to deselect this option for now and enable a directory table later.

   > If you enable Directory table, optionally select Enable auto-refresh, and then select your event notification or
   > notification integration to automatically refresh the directory table when files are added or removed.
   > For more information, see [Automated directory table metadata refreshes](data-load-dirtables-auto.md).
9. If your files are encrypted, enable Encryption, and then enter your details.
10. (Optional) To view a generated SQL statement, expand the SQL Preview.
    To specify additional options for your stage, such as AUTO_REFRESH, you can open this SQL preview in a worksheet.
11. Select Create.

## Uploading files onto a stage

You can use Snowsight to upload files onto a named internal stage.

To upload files onto an external stage, use the tools provided by your external cloud service
(Amazon S3, Microsoft Azure, or Google Cloud Storage).

### Upload files onto a named internal stage

> **Note:**
>
> The maximum file size is 250 MB.
>
> To upload files onto an internal stage, you must use a role that is granted or inherits the USAGE privilege on the database and schema
> and the WRITE privilege on the stage. For more information, see [Stage privileges](security-access-control-privileges.md).

To upload files onto your stage, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Ingestion » Add Data.
3. On the Add Data page, select Load files into a Stage.
4. In the Upload Your Files dialog that appears, select the files that you want to upload. You can upload multiple files at the same time.
5. Select the database schema in which you created the stage, then select the stage.
6. Optionally, select or create a path where you want to save your files within the stage.
7. Select Upload.

After you upload files onto the stage, you can take one of the following actions depending on the file:

* Use the files in a Python worksheet. For more information, see [Add a Python File from a Stage to a Worksheet](../developer-guide/snowpark/python/python-worksheets.md).
* Copy data from the staged files into a table. For more information, see [Load data into an existing table using Snowsight](data-load-web-ui.md) or [Copy data from an internal stage](data-load-local-file-system-copy.md).
* Query the data in the stage. For more information, see [Query data in staged files](querying-stage.md).

## Viewing staged files

You can view staged files using Snowsight. You can view files on both internal and external stages.

> **Note:**
>
> You must use a role that is granted or inherits the USAGE privilege on the database and schema and the READ privilege on the stage
> to perform these steps.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select the database and schema that contain the stage.
4. Select Stages and select the stage for which you want to view files.
5. If prompted, select Enable Directory Table to enable a directory table for the stage so that you can see files.
6. If prompted, select a warehouse to refresh the directory table.

To refresh the directory table on a stage, select the refresh icon.

## Managing staged files

You can use Snowsight to take the following actions on staged files:

* Select  » Load into table to
  [load the file from the stage into a table](data-load-web-ui.md).
* Select  » Copy path to copy the path to the file for use elsewhere, such as in a worksheet.

For files on an internal stage, you can also take the following actions:

* Select  » Download to download the file from the stage.
* Select  » Remove to remove the file from the stage.

> **Note:**
>
> To download a file from an external stage, see [Download staged files in Snowsight](unstructured-intro.md).

## Managing stages

To manage a stage in Snowsight, do the following:

> **Note:**
>
> You must use a role that is granted or inherits the USAGE privilege on the database and schema and the OWNERSHIP privilege on the stage
> to perform these steps.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Select the database and schema that contain the stage.
4. Select Stages and select the stage.
5. Select Stage Details.

You can manage the stage in the following ways:

* Select  » Edit to edit properties or enable a directory table for the stage object.
* Select  » Clone to clone the stage.
* Select  » Drop to drop, or remove, the stage.
* Select  » Transfer Ownership to transfer OWNERSHIP privileges of the stage to another role.

If you want to manage privileges for the stage, use the Privileges section to view, grant, and revoke privileges.

## Troubleshooting

### Files are not visible on an external stage

This issue can occur when an external stage does not have a directory table enabled, or when information about the external storage
location is incorrect.

To fix this issue, try the following:

* Make sure the stage owner has enabled a [directory table](data-load-dirtables.md) on the stage.
* Check that the directory table has been refreshed.
  To refresh the directory table, select your stage in Snowsight, then select the refresh icon.
* Verify that the cloud provider URL is correct. If your URL contains a subpath, ensure that there is a trailing slash.

### Upload files button is unavailable (inactive)

This issue can occur when you don’t have the required privileges to upload files onto an internal stage,
or if another upload is in progress.

To fix this issue, try the following:

* Make sure that you have selected an internal stage.
* Use a role that is granted or inherits the USAGE privilege on the database and schema and the WRITE privilege on the stage.
* Check whether another upload is in progress. Hovering over the inactive button displays information about any in-progress uploads.
  Snowsight also displays a notification for in-progress uploads. If another upload is in progress,
  it must complete before you can upload additional files onto the stage.

---
title: Starting and Stopping sfsql — Obsoleted
source: https://docs.snowflake.com/en/user-guide/sfsql-start-stop.md
section: User Guide
---

# Starting and Stopping sfsql — *Obsoleted*

This topic describes how to use `sfsql` to connect to Snowflake, initiate a session to execute queries and DDL/DML statements, and close the session when you’re finished.

## Connecting to Snowflake and Initiating a Session

To connect to Snowflake and initiate a session, navigate to the directory where the `sfsql` script is located and execute the script using the following syntax.

> ```bash
> sfsql [ -u <user> ] [ -c <password> ] [ -d <database> ] [ -s <schema> ] ... [ -h ]
> ```

> **Note:**
>
> In a Linux environment, you must precede the script name with a dot-slash, e.g. `./sfsql`. If you start the client from any directory other than the `client` install directory, you must also
> include the path after the forward slash.

### Parameters

| Connection Parameter | Equivalent in `login.defaults` | Description |
| --- | --- | --- |
| `-g <host>` | `GSIP=<host>` | Host/IP to connect to. Set by default in `login.defaults` when the client was downloaded from Snowflake. |
|  |  | Format of `<host>` for accounts in US West: `<account_name>.snowflakecomputing.com` |
|  |  | Format of `<host>` for accounts in all other regions: `<account_name>.<region_id>.snowflakecomputing.com` |
| `-a <account_name>` | `ACCOUNT=<name>` | Snowflake account to connect to. Set by default in `login.defaults` when the client was downloaded from Snowflake. |
| `-u <user>` | `USER=<login_name>` | Login name of user to connect with. If this parameter is specified, the `-c` parameter also should be specified. |
| `-c <password>` | `PASSWORD=<password>` | Password for the user. |
| `-b <authenticator>` | `AUTHENTICATOR=<authenticator>` | Use a SAML 2.0-compliant IdP, instead of Snowflake, to authenticate. |
| `-r <role>` | `ROLE=<name>` | Role to use by default for accessing objects in Snowflake (can be changed after login). |
| `-d <database>` | `DATABASE=<name>` | Database to use by default (can be changed after login). |
| `-s <schema>` | `SCHEMA=<name>` | Database schema to use by default (can be changed after login). |
| `-w <warehouse>` | `WAREHOUSE=<name>` | Virtual warehouse to use by default for queries, loading, etc. (can be changed after login). |
| `-f <sqlfile>` | N/A | Execute the specified SQL file. If this parameter is not specified, the client connects in interactive mode. |
| `-t` | `TRACING=<level>` | Logging level. |
| `-y <proxy host>` | `PROXY_HOST=<host>` | HTTP proxy host. |
| `-z <proxy port>` | `PROXY_PORT=<port>` | Port for HTTP proxy host. |
| `-m <mfa_passcode>` | `PASSCODE=<mfa_passcode>` | MFA passcode. |
| `-n` | `PASSCODEINPASSWORD=true` | MFA passcode embedded in password. |
| `-k` | `EXITONERROR=true` | Exit the client when an error is encountered. |
| `-h` | N/A | Help for login parameters (i.e. this list). |

> **Note:**
>
> If you do not specify a login name or password either in `login.defaults` or in the command line, the client prompts you to enter them during login.
>
> If you provide an incorrect login name or password, the client does not connect to Snowflake and exits to the HenPlus shell command line. You must then exit the shell (by typing `exit`, `quit`, or
> using the **[CTRL]-d** keyboard combo) before attempting to log in again. Or, in the HenPlus shell, you can type `connect` followed by a valid JDBC connect string to log in.

During login, the client displays the version of the JDBC driver used by the client, as well as the latest available version of the driver (if it is different from the version in use). This information
can be useful when troubleshooting client issues.

After successful login, the command line displays the login name of the user and the host to which the session is connected in the form `<login_name>@snowflake:<account_name>.snowflakecomputing.com`.

### Example

The following example starts the client installed in a Linux or macOS environment in a directory named `/Users/user1` with a Snowflake user named `user1` and password `1234567a` for the `xy12345`
account:

> ```bash
> $ cd /Users/user1/client
> $ ./sfsql -u user1 -c 1234567a
>
> using GNU readline (Brian Fox, Chet Ramey), Java wrapper by Bernhard Bablok
> henplus config at /Users/ybrenman/.henplus
> ----------------------------------------------------------------------------
>  HenPlus II 0.9.8 "Yay Labor Day"
>  Copyright(C) 1997..2009 Henner Zeller <H.Zeller@acm.org>
>  HenPlus is provided AS IS and comes with ABSOLUTELY NO WARRANTY
>  This is free software, and you are welcome to redistribute it under the
>  conditions of the GNU Public License <http://www.gnu.org/licenses/gpl2.txt>
> ----------------------------------------------------------------------------
> HenPlus II connecting
>  url 'jdbc:snowflake://xy12345.snowflakecomputing.com:443/?account=xy12345&user=user1&ssl=on'
>  driver version 2.3
>  Snowflake - 1.0 (driver change version: 2.3.1, latest change version: 2.4.38)
> no transactions.
>  No Transaction *
>
> user1@snowflake:xy12345.snowflakecomputing.com>
> ```

## Closing a Session and Exiting the Client

To close the current Snowflake session and exit `sfsql`, type `exit` or `quit` on the command line.

When you close a Snowflake session:

* All in-process queries and DDL/DML statements are canceled.
* All temporary tables created during the session are dropped.

> **Note:**
>
> Typing **[CTRL]-d** exits `sfsql`, but does not close the HenPlus shell. You must then type `exit` or `quit` (or type **[CTRL]-d** again) to close the HenPlus shell.

---
title: Storage costs for Time Travel and Fail-safe
source: https://docs.snowflake.com/en/user-guide/data-cdp-storage-costs.md
section: User Guide
---

# Storage costs for Time Travel and Fail-safe

Storage fees are incurred for maintaining historical data during both the Time Travel and Fail-safe periods.

## Storage usage and fees

The fees are calculated for each 24-hour period (that is, 1 day) from the time that the data changed. The number of days that Snowflake maintains
historical data is based on the table type and the Time Travel retention period for the table.

Also, Snowflake minimizes the amount of storage required for historical data by maintaining only the information required to restore the individual table rows that were updated or deleted. As a result,
storage usage is calculated as a percentage of the table that changed. Snowflake only maintains full copies of tables when tables are dropped or truncated.

## Temporary and transient tables

To help manage the storage costs associated with Time Travel and Fail-safe, Snowflake provides two table types, temporary and transient, which do not incur the same fees as standard (that is, permanent) tables:

* Transient tables can have a Time Travel retention period of either 0 or 1 day.
* Temporary tables can also have a Time Travel retention period of 0 or 1 day; however, this retention period ends as soon as the table is dropped or the session in which the table was created ends.
* Transient and temporary tables have no Fail-safe period.

As a result, the maximum additional fees incurred for Time Travel and Fail-safe by these types of tables is limited to 1 day. The following table illustrates the different scenarios, based on
table type:

| Table Type | Time Travel Retention Period (Days) | Fail-safe Period (Days) | Min , Max Historical Data Maintained (Days) |
| --- | --- | --- | --- |
| Permanent | 0 or 1 (for Snowflake Standard Edition) | 7 | **7 , 8** |
| 0 to 90 (for Snowflake Enterprise Edition) | 7 | **7 , 97** |
| Transient | 0 or 1 | 0 | **0 , 1** |
| Temporary | 0 or 1 | 0 | **0 , 1** |

## Considerations for using temporary and transient tables to manage storage costs

When you choose whether to store data in permanent, temporary, or transient tables, consider the following details:

* Temporary tables are dropped when the session in which they were created ends. Data stored in temporary tables is not recoverable after the table is dropped.
* Historical data in transient tables can’t be recovered by Snowflake after the Time Travel retention period ends. Use transient tables only for data you can replicate or reproduce
  independently from Snowflake.
* Long-lived tables, such as fact tables, should always be defined as permanent to ensure they are fully protected by Fail-safe.
* You can define short-lived tables as transient to eliminate Fail-safe costs. For example, you might use transient tables for data with a lifetime of less than 1 day, such as ETL work tables.
* If downtime and the time required to reload lost data are factors, permanent tables, even with their added Fail-safe costs, might offer a better overall solution than transient tables.

> **Note:**
>
> The default type for tables is permanent. To define a table as temporary or transient, you must explicitly specify the type during table creation:
>
> > `CREATE [ OR REPLACE ] [ TEMPORARY | TRANSIENT ] TABLE <name> ...`
>
> For more information, see [CREATE TABLE](../sql-reference/sql/create-table.md).

## Migrating data from permanent tables to transient tables

Migrating data from permanent tables to transient tables involves performing the following tasks:

1. Use [CREATE TABLE … AS SELECT](../sql-reference/sql/create-table.md) to create and populate the transient tables with the data from the original, permanent tables.
2. Apply all access control privileges granted on the original tables to the new tables. For more information about access control, see
   [Overview of Access Control](security-access-control-overview.md).
3. Use [DROP TABLE](../sql-reference/sql/drop-table.md) to delete the original tables.
4. Optionally, use [ALTER TABLE](../sql-reference/sql/alter-table.md) to rename the new tables to match the original tables.

## Cost for backups

The following table describes charges for backups.

For information about credit consumption, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

| Cost component | Description | Billed |
| --- | --- | --- |
| Backup compute | Snowflake-managed compute service generates scheduled backup creation and expiration. | Yes |
| Restore compute | Snowflake-managed warehouses are used to restore objects from backups. | Yes |
| Backup storage | Snowflake-managed cloud object storage to store backup data. | Billed for bytes retained for backups, similar to bytes retained for clones. |

You can monitor costs for backup storage in the [TABLE_STORAGE_METRICS](../sql-reference/account-usage/table_storage_metrics.md)
view using the `RETAINED_FOR_CLONE_BYTES` column, and in the
[BACKUP_STORAGE_USAGE](../sql-reference/account-usage/backup_storage_usage.md) view.

---
title: Storage for Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-storage.md
section: User Guide
---

# Storage for Apache Iceberg™ tables

Snowflake tables typically use storage that Snowflake manages.
In contrast, Apache Iceberg™ tables in Snowflake use external storage that you configure and maintain.

This topic provides conceptual information and best practices for Iceberg table storage.

## External volumes

> **Note:**
>
> To connect Snowflake to your external cloud storage for Iceberg tables without using an external volume, use catalog-vended
> credentials. This option is only available for externally managed Iceberg tables.
> For more information, see [Use catalog-vended credentials for Apache Iceberg™ tables](tables-iceberg-configure-catalog-integration-vended-credentials.md).

An external volume is a named, account-level Snowflake object that you use to connect Snowflake to your
external cloud storage for Iceberg tables. An external volume stores an identity and access management (IAM) entity
for your storage location. Snowflake uses the IAM entity to securely connect to your storage for accessing
table data, Iceberg metadata, and manifest files that store the table schema, partitions, and other metadata.

A single external volume can support one or more Iceberg tables.

Each external volume is associated with a particular Active storage location,
and a single external volume can support multiple Iceberg tables. However, the number of external volumes you need depends on how you want to store,
organize, and secure your table data.

You can use a single external volume if you want the data and metadata
for *all* of your Snowflake-Iceberg tables in subdirectories under the same storage location (for example, in the same S3 bucket).
To configure these directories for Snowflake-managed tables, see Data and metadata directories.

Alternatively, you can create multiple external volumes to secure various storage locations differently. For example,
you might create the following external volumes:

* A read-only external volume for externally managed Iceberg tables.
* An external volume configured with read and write access for Snowflake-managed tables.

## Granting Snowflake access to your storage

### Cloud provider storage

To grant Snowflake access to your cloud storage locations for Iceberg tables,
you use the identity and access management service for your cloud provider.
You grant an identity, or principal, limited access to your storage without exchanging secrets.
This is the same access model that Snowflake uses for other integrations, including storage integrations.

Snowflake provisions a principal for your entire Snowflake account when you create an [external volume](tables-iceberg.md).
The principal is as follows, depending on your cloud provider:

| Cloud provider | Snowflake-provisioned principal |
| --- | --- |
| Amazon Web Services (AWS) | [IAM user](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_policies_elements_principal.html#principal-users) |
| Google Cloud | [Service account](https://cloud.google.com/iam/docs/overview#service_account) |
| Azure | [Service principal](https://learn.microsoft.com/en-us/entra/identity-platform/app-objects-and-service-principals?tabs=browser#service-principal-object) |

Snowflake authenticates directly with your storage provider, and the Snowflake-provisioned principal assumes a role that you specify.
The role must have permission to perform operations on your storage location.
For example, Snowflake can read from a storage location only if the role has permission to read from that storage location.

Snowflake requires permission to perform the following actions on Iceberg tables:

|  | Snowflake-managed tables | Tables that use an external Iceberg catalog |
| --- | --- | --- |
| **Amazon S3** | * `s3:GetBucketLocation` * `s3:GetObject` * `s3:ListBucket` * `s3:PutObject` * `s3:DeleteObject` * `s3:GetObjectVersion` * `s3:DeleteObjectVersion` | * `s3:GetBucketLocation` * `s3:GetObject` * `s3:ListBucket` * `s3:GetObjectVersion` |
| **Google Cloud Storage** | * `storage.objects.create` * `storage.objects.delete` * `storage.objects.get` * `storage.objects.list` | * `storage.buckets.get` * `storage.objects.get` * `storage.objects.list` |
| **Azure Storage** | All allowed actions for the [Storage Blob Data Contributor role](https://learn.microsoft.com/en-us/azure/role-based-access-control/built-in-roles/storage#storage-blob-data-contributor) | All allowed actions for the [Storage Blob Data Reader role](https://learn.microsoft.com/en-us/azure/role-based-access-control/built-in-roles/storage#storage-blob-data-reader) |

> **Note:**
>
> The `s3:PutObject` permission grants write access to the external volume location.
> To completely configure write access, you must set the `ALLOW_WRITES` parameter of the external volume to `TRUE` (the default value).

For full instructions on granting Snowflake access to your storage for Iceberg tables, see the following topics:

* [Configure an external volume for Amazon S3](tables-iceberg-configure-external-volume-s3.md)
* [Configure an external volume for Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
* [Configure an external volume for Azure](tables-iceberg-configure-external-volume-azure.md)

### S3-compatible storage

To grant Snowflake access to an [S3-compatible storage location](data-load-s3-compatible-storage.md) for Iceberg tables,
you specify an [S3-compatible storage endpoint](data-load-s3-compatible-storage.md) with credentials when you create an external volume.

For instructions, see [Configure an external volume for S3-compatible storage](tables-iceberg-s3-compatible.md).

## Active storage location

Each external volume supports a single active storage location.
If you specify multiple storage locations in a [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) statement,
Snowflake assigns one location as the active location.
The active location remains the same for the lifetime of the external volume.

Assignment occurs the first time you use the external volume in a CREATE ICEBERG TABLE statement.
Snowflake uses the following logic to choose an active location:

* If the `STORAGE_LOCATIONS` list contains one or more *local* storage locations, Snowflake uses the first local storage location in the list.
  A local storage location is one with the same cloud provider and in the same region as your Snowflake account.
* If the `STORAGE_LOCATIONS` list does not contain any local storage locations, Snowflake selects the first location in the list.

> **Note:**
>
> External volumes that were created before Snowflake version 7.44 might have used different logic to select an active location.

## Verifying storage access

> **Note:**
>
> To verify storage access by using Snowsight, see [Verify an external volume by using Snowsight](tables-iceberg-configure-external-volume.md)

To check that Snowflake can successfully authenticate to your storage provider, call the [SYSTEM$VERIFY_EXTERNAL_VOLUME](../sql-reference/functions/system_verify_external_volume.md)
function.

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_VOLUME('my_external_volume');
```

> **Note:**
>
> If you receive the following error, your account administrator must activate AWS STS in the Snowflake deployment region.
> For instructions, see
> [Manage AWS STS in an AWS Region](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_enable-regions.html)
> in the AWS documentation.
>
> ```output
> Error assuming AWS_ROLE:
> STS is not activated in this region for account:<external volume id>. Your account administrator can activate STS in this region using the IAM Console.
> ```

For Snowflake-managed tables, Snowflake automatically verifies access to the active storage location
on your external volume in the following situations:

* The first time you specify that external volume in a CREATE ICEBERG TABLE statement for a Snowflake-managed table.
* The first time you convert a table to use Snowflake as the Iceberg catalog.

The `ALLOW_WRITES` property of the external volume must be set to `TRUE`.

Snowflake tries the following storage operations to verify the storage location.

1. Writing a test file.
2. Reading the file.
3. Listing the contents of the file’s path.
4. Deleting the file.

If any one of the operations fails,
the CREATE ICEBERG TABLE (or ALTER ICEBERG TABLE … CONVERT TO MANAGED) statement fails and you receive an error message.

## File management

This section explains how management of Iceberg table files in storage works, according to the type of Iceberg table.

### Snowflake-managed tables

> **Important:**
>
> * Don’t allow other tools access to delete or overwrite objects that are associated with Snowflake-managed Iceberg tables.
> * Ensure that the Snowflake principal maintains access to your table storage.
>   For more information, see Granting Snowflake access to your storage.

Though *you* configure and manage storage locations for Iceberg tables, Snowflake exclusively operates on the objects
in your storage (data and metadata files) that belong to Snowflake-managed tables. Snowflake runs periodic maintenance
on these table objects to optimize query performance and clean up deleted data.

Queries might fail if other tools delete or overwrite Snowflake-managed table objects.
Similarly, queries on the table and Snowflake’s table maintenance operations will fail
if you revoke the Snowflake principal’s access to your storage.

Snowflake deletes objects after the table retention period expires when Snowflake-managed table data is deleted or the table is dropped.

To configure replication for Snowflake-managed Iceberg tables, see Configure replication for Snowflake-managed Iceberg tables.

#### Data and metadata directories

This section describes the data and metadata directories for Snowflake-managed tables.

These directories can either be organized in a flat or hierarchical layout:

* Flat layout
* Hierarchical layout

> **Note:**
>
> To find the data and metadata directories for any Iceberg table, you can use the [SHOW ICEBERG TABLES](../sql-reference/sql/show-iceberg-tables.md) command.
> The command output includes a `base_location` property that indicates the location of each table’s data and metadata files.

##### Flat layout

This section describes the flat layout in Snowflake for data and metadata directories for Snowflake-managed tables.

When you create a Snowflake-managed table that uses the default flat directory layout (PATH_LAYOUT = FLAT), Snowflake writes all Parquet
data files under a single `data/` directory and all table metadata data files under a single `metadata/` directory. Snowflake also writes
metadata for [Delta-based tables](tables-iceberg-metadata.md).

Snowflake constructs paths using the following patterns, depending on the values specified
for [BASE_LOCATION](../sql-reference/sql/create-iceberg-table-snowflake.md) or
the [BASE_LOCATION_PREFIX](../sql-reference/parameters.md) parameter.
If you specify a `BASE_LOCATION`, Snowflake does not use the BASE_LOCATION_PREFIX in the path.

Where:

* `STORAGE_BASE_URL` is the base URL for the active storage location associated with your external volume.
* `BASE_LOCATION` is the path for a directory where Snowflake should write the table files (specified in CREATE ICEBERG TABLE),
  relative to your external volume location. Specifying a BASE_LOCATION is required for Delta-based tables.
* `randomId` is a random, Snowflake-generated 8-character string.

| BASE_LOCATION defined | BASE_LOCATION_PREFIX defined | Path |
| --- | --- | --- |
| No | No | `STORAGE_BASE_URL/database/schema/table_name.randomId/[data | metadata]/` |
| No | Yes | `STORAGE_BASE_URL/BASE_LOCATION_PREFIX/table_name.randomId/[data | metadata]/` |
| Yes | N/A (ignored) | `STORAGE_BASE_URL/BASE_LOCATION.randomId/[data | metadata]/` |
| ‘’ (empty string) | N/A (ignored) | `STORAGE_BASE_URL/randomId/[data | metadata]/` |

**Organizing table storage with BASE_LOCATION**

> **Note:**
>
> We don’t recommend this option if you plan to rename tables in the future.
>
> After you create a Snowflake-managed table,
> the path to its files in external storage does not change, even if you rename the table.

To organize files in storage for multiple Iceberg tables under the same `STORAGE_BASE_URL`,
consider using the table name as the `BASE_LOCATION` in your CREATE ICEBERG TABLE statement. This way, Snowflake writes data and
metadata to a directory that includes the name of the table.

For example:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE iceberg_table_1 (
  col_1 int,
  col_2 string
)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'iceberg_external_volume'
  BASE_LOCATION = 'iceberg_table_1';

CREATE OR REPLACE ICEBERG TABLE iceberg_table_2 (
  col_1 int,
  col_2 string
)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'iceberg_external_volume'
  BASE_LOCATION = 'iceberg_table_2';
```

The statement results in the following directory structure in your external cloud storage:

```bash
STORAGE_BASE_URL
|-- iceberg_table_1.<randomId>
|   |-- data/
|   |-- metadata/
|-- iceberg_table_2.<randomId>
|   |-- data/
|   |-- metadata/
```

##### Hierarchical layout

This section describes the hierarchical layout in Snowflake for data and metadata directories for Snowflake-managed tables.

When you create a Snowflake-managed table that uses the default flat directory layout (PATH_LAYOUT = HIERARCHICAL), Snowflake writes all
Parquet data files by organizing them in a hierarchical directory structure under the `data/` directory that is based on transforms that
you define when you create a
table. For instructions on how to enable this layout, see [Partitioning with hierarchical paths](tables-iceberg-metadata.md). Snowflake writes all
table metadata data files under a single `metadata/` directory.

Snowflake constructs paths using the following patterns, depending on the values specified
for [BASE_LOCATION](../sql-reference/sql/create-iceberg-table-snowflake.md) or
the [BASE_LOCATION_PREFIX](../sql-reference/parameters.md) parameter.
If you specify a `BASE_LOCATION`, Snowflake does not use the BASE_LOCATION_PREFIX in the path.

Where:

* `STORAGE_BASE_URL` is the base URL for the active storage location associated with your external volume.
* `BASE_LOCATION` is the path for a directory where Snowflake should write the table files (specified in CREATE ICEBERG TABLE),
  relative to your external volume location. Specifying a BASE_LOCATION is required for Delta-based tables.
* `randomId` is a random, Snowflake-generated 8-character string.

| BASE_LOCATION defined | BASE_LOCATION_PREFIX defined | Path |
| --- | --- | --- |
| No | No | `STORAGE_BASE_URL/database/schema/table_name.randomId/[data/<hierarchical_layout> | metadata]/` |
| No | Yes | `STORAGE_BASE_URL/BASE_LOCATION_PREFIX/table_name.randomId/[data/<hierarchical_layout> | metadata]/` |
| Yes | N/A (ignored) | `STORAGE_BASE_URL/BASE_LOCATION.randomId/[data/<hierarchical_layout> | metadata]/` |
| ‘’ (empty string) | N/A (ignored) | `STORAGE_BASE_URL/randomId/[data/<hierarchical_layout> | metadata]/` |

**Organizing table storage with BASE_LOCATION**

> **Note:**
>
> We don’t recommend this option if you plan to rename tables in the future.
>
> After you create a Snowflake-managed table,
> the path to its files in external storage does not change, even if you rename the table.

To organize files in storage for multiple Iceberg tables under the same `STORAGE_BASE_URL`,
consider using the table name as the `BASE_LOCATION` in your CREATE ICEBERG TABLE statement. This way, Snowflake writes data and
metadata to a directory that includes the name of the table.

The following example creates `customer_region_summary` and `orders_by_status` tables, which each use a hierarchical path layout
for their data files based on the following transforms:

* The `customer_region_summary` table is partitioned by `region`
* the `orders_by_status` table is partitioned by `order_status`

```sqlexample
CREATE OR REPLACE ICEBERG TABLE customer_region_summary (
  customer_id int,
  region string
)
  PARTITION BY (region)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'iceberg_external_volume'
  PATH_LAYOUT = HIERARCHICAL
  BASE_LOCATION = 'customer_region_summary';

CREATE OR REPLACE ICEBERG TABLE orders_by_status (
  order_id int,
  order_status string
)
  PARTITION BY (order_status)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'iceberg_external_volume'
  PATH_LAYOUT = HIERARCHICAL
  BASE_LOCATION = 'orders_by_status';
```

The statement results in the following directory structure in your external cloud storage:

```bash
STORAGE_BASE_URL
|-- customer_region_summary.<randomId>
|   |-- data/
|   |   |-- REGION=US/
|   |   |   |-- part-00001-abc123.parquet
|   |   |-- REGION=EU/
|   |       |-- part-00002-def456.parquet
|   |-- metadata/
|
|-- orders_by_status.<randomId>
    |-- data/
    |   |-- ORDER_STATUS=SHIPPED/
    |   |   |-- part-00001-ghi789.parquet
    |   |-- ORDER_STATUS=PENDING/
    |       |-- part-00002-jkl012.parquet
    |-- metadata/
```

### Tables that use an external catalog

Snowflake doesn’t write or delete storage objects for externally managed Iceberg tables
or on external volumes with the `ALLOW_WRITES` property set to `FALSE`.

For external catalogs that you connect to with an external volume, to access your table data and metadata, Snowflake assumes the
access control role that you configure for your external volume.
You grant the role permission to access a storage location (in a bucket or container). All of your table data and metadata files must
be in that location.
For example, if your storage location is an S3 bucket, all of your data and metadata files must exist somewhere in that bucket.

For external catalogs that you connect to by using catalog-vended credentials, Snowflake obtains short-lived, scoped credentials from the
external catalog that allow Snowflake access only to the paths that store the table’s data and metadata.
For more information, see [Use catalog-vended credentials for Apache Iceberg™ tables](tables-iceberg-configure-catalog-integration-vended-credentials.md).

Additionally, [converting a table](tables-iceberg-conversion.md) does not rewrite any data or metadata files.
Snowflake writes to an Iceberg table only after you convert a table to use Snowflake as the catalog.

#### Data and metadata directories

This section describes the data and metadata directories for externally managed tables that you create in a catalog-linked database.
These directories can either be organized in a flat or hierarchical layout:

* Catalog-linked database: Flat layout
* Catalog-linked database: Hierarchical layout

> **Note:**
>
> * To find the data and metadata directories for any Iceberg table that you specified a `base_location` for when you created it, you can use
>   the [SHOW ICEBERG TABLES](../sql-reference/sql/show-iceberg-tables.md) command.
>   The command output includes a `base_location` property that indicates the location of each table’s data and metadata files.
> * For externally managed tables in a standard Snowflake database, Snowflake infers the location of the table from the remote catalog
>   metadata and then writes to the `/data` directory for the table.

##### Catalog-linked database: Flat layout

This section describes the flat layout for data and metadata directories for externally managed Iceberg tables that you create in a
catalog-linked database.

When you create an externally managed table in a catalog-linked database that uses the default flat directory layout (PATH_LAYOUT = FLAT),
Snowflake writes all Parquet data files under a single `data/` directory and all table metadata data files under a
single `metadata/` directory.

Snowflake constructs paths using the following patterns, depending on the values specified
for [BASE_LOCATION](../sql-reference/sql/create-iceberg-table-snowflake.md) or
the [BASE_LOCATION_PREFIX](../sql-reference/parameters.md) parameter.
If you specify a `BASE_LOCATION`, Snowflake does not use the BASE_LOCATION_PREFIX in the path.

> **Note:**
>
> The `BASE_LOCATION_PREFIX` parameter is only supported when you use an external volume to connect to your catalog-linked database.
> The `BASE_LOCATION_PREFIX` parameter isn’t supported when you use catalog-vended credentials to connect to your catalog-linked database.

Where:

* `STORAGE_BASE_URL` is the base URL for the active storage location associated with your external volume or vended credentials.
* `BASE_LOCATION` is the path for a directory where Snowflake should write the table files (specified in CREATE ICEBERG TABLE),
  relative to your external volume location. If you’re using catalog-vended credentials, this must be an absolute path that points to
  an allowed location defined by the remote catalog. Specifying a BASE_LOCATION is required for Delta-based tables.
* `randomId` is a random, Snowflake-generated 8-character string.

| BASE_LOCATION defined | BASE_LOCATION_PREFIX defined | Path |
| --- | --- | --- |
| No | No | `STORAGE_BASE_URL/database/schema/table_name/[data | metadata]/` |
| No | Yes | `STORAGE_BASE_URL/BASE_LOCATION_PREFIX/table_name.randomId/[data | metadata]/` |
| Yes | N/A (ignored) | `STORAGE_BASE_URL/BASE_LOCATION.randomId/[data | metadata]/` |

**Organizing table storage with BASE_LOCATION**

To organize files in storage for multiple Iceberg tables under the same `STORAGE_BASE_URL`,
consider using the table name as the `BASE_LOCATION` in your CREATE ICEBERG TABLE statement. This way, Snowflake writes data and
metadata to a directory that includes the name of the table.

For example:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE iceberg_table_1 (
  col_1 int,
  col_2 string
)
  BASE_LOCATION = 's3://my-bucket/customer_iceberg/my_base_location1';

CREATE OR REPLACE ICEBERG TABLE iceberg_table_2 (
  col_1 int,
  col_2 string
)
  BASE_LOCATION = 's3://my-bucket/customer_iceberg/my_base_location2';
```

The statement results in the following directory structure in your external cloud storage:

```bash
STORAGE_BASE_URL
|-- iceberg_table_1.<randomId>
|   |-- data/
|   |-- metadata/
|-- iceberg_table_2.<randomId>
|   |-- data/
|   |-- metadata/
```

##### Catalog-linked database: Hierarchical layout

This section describes the hierarchical layout for data and metadata directories for externally managed Iceberg tables that you
create in a catalog-linked database.

When you create an externally managed Iceberg table in a catalog-linked database that uses the default flat directory layout
(PATH_LAYOUT = HIERARCHICAL), Snowflake writes all
Parquet data files by organizing them in a hierarchical directory structure under the `data/` directory that is based on transforms that you
define when you create a table. For instructions on
how to enable this layout, see [Partitioning with hierarchical paths](tables-iceberg-metadata.md). Snowflake writes all table metadata data files
under a single `metadata/` directory.

> **Note:**
>
> If you set PATH_LAYOUT = HIERARCHICAL without specifying a PARTITION BY clause, Snowflake uses the
> flat layout for the table. However, if you later
> enable partitioning on the table, Snowflake begins using a hierarchical layout with partitioned writes.
> For more information, see [Partitioning with hierarchical paths](tables-iceberg-metadata.md).

For externally managed tables with a hierarchical layout, Snowflake writes Parquet data files and table metadata to your external cloud
storage. The Parquet data files are organized in a hierarchical directory structure that is based
on transforms that you define when you create a table.

Snowflake constructs paths using the following patterns, depending on the values specified
for [BASE_LOCATION](../sql-reference/sql/create-iceberg-table-snowflake.md) or
the [BASE_LOCATION_PREFIX](../sql-reference/parameters.md) parameter.
If you specify a `BASE_LOCATION`, Snowflake does not use the BASE_LOCATION_PREFIX in the path.

> **Note:**
>
> The `BASE_LOCATION_PREFIX` parameter is only supported when you use an external volume to connect to your catalog-linked database.
> The `BASE_LOCATION_PREFIX` parameter isn’t supported when you use catalog-vended credentials to connect to your catalog-linked database.

Where:

* `STORAGE_BASE_URL` is the base URL for the active storage location associated with your external volume or vended credentials.
* `BASE_LOCATION` is the path for a directory where Snowflake should write the table files (specified in CREATE ICEBERG TABLE),
  relative to your external volume location. If you’re using catalog-vended credentials, this must be an absolute path that points to
  an allowed location defined by the remote catalog. Specifying a BASE_LOCATION is required for Delta-based tables.
* `randomId` is a random, Snowflake-generated 8-character string.

| BASE_LOCATION defined | BASE_LOCATION_PREFIX defined | Path |
| --- | --- | --- |
| No | No | `STORAGE_BASE_URL/database/schema/table_name.randomId/[data/<hierarchical_layout> | metadata]/` |
| No | Yes | `STORAGE_BASE_URL/BASE_LOCATION_PREFIX/table_name.randomId/[data/<hierarchical_layout> | metadata]/` |
| Yes | N/A (ignored) | `STORAGE_BASE_URL/BASE_LOCATION.randomId/[data/<hierarchical_layout> | metadata]/` |

**Organizing table storage with BASE_LOCATION**

To organize files in storage for multiple Iceberg tables under the same `STORAGE_BASE_URL`,
consider using the table name as the `BASE_LOCATION` in your CREATE ICEBERG TABLE statement. This way, Snowflake writes data and
metadata to a directory that includes the name of the table.

The following example creates `customer_region_summary` and `orders_by_status` tables, which each use a hierarchical path layout
for their data files based on the following transforms:

* The `customer_region_summary` table is partitioned by `region`
* the `orders_by_status` table is partitioned by `order_status`

```sqlexample
CREATE OR REPLACE ICEBERG TABLE customer_region_summary (
  customer_id int,
  region string
)
  PARTITION BY (region)
  PATH_LAYOUT = HIERARCHICAL
  BASE_LOCATION = 's3://my-bucket/customer_iceberg/my_base_location1';

CREATE OR REPLACE ICEBERG TABLE orders_by_status (
  order_id int,
  order_status string
)
  PARTITION BY (order_status)
  BASE_LOCATION = 's3://my-bucket/customer_iceberg/my_base_location2';
```

The statement results in the following directory structure in your external cloud storage:

```bash
STORAGE_BASE_URL
|-- customer_region_summary.<randomId>
|   |-- data/
|   |   |-- REGION=US/
|   |   |   |-- part-00001-abc123.parquet
|   |   |-- REGION=EU/
|   |       |-- part-00002-def456.parquet
|   |-- metadata/
|
|-- orders_by_status.<randomId>
    |-- data/
    |   |-- ORDER_STATUS=SHIPPED/
    |   |   |-- part-00001-ghi789.parquet
    |   |-- ORDER_STATUS=PENDING/
    |       |-- part-00002-jkl012.parquet
    |-- metadata/
```

## Enabling storage access logs

To diagnose issues and audit access to the storage locations associated with an external volume, you can enable storage logging.
Storage logs help you identify the cause of missing or corrupted files.

Enable logging with your storage provider. Because you own and manage storage for Iceberg tables,
Snowflake can’t enable logging or auditing on your Iceberg storage locations.

To learn about storage access logs for your storage provider, see the following external topics:

* [Logging options for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/logging-with-S3.html)
* [Usage logs and storage logs for Google Cloud Storage](https://cloud.google.com/storage/docs/access-logs)
* [Azure Storage analytics logging](https://learn.microsoft.com/en-us/azure/storage/common/storage-analytics-logging)

## Protecting files with versioning and object retention

If your Iceberg table data is in a central data repository (or data lake) that is operated on by multiple tools and services,
accidental deletion or corruption might occur. To protect Iceberg table data and ensure retrieval
of accidentally deleted or overwritten data, use storage lifecycle management and versioning offered by your storage provider.

With lifecycle management, you can set retention and tracking rules for storage objects.
To learn about lifecycle management for your storage provider, see the following external topics:

* [Managing your storage lifecycle for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/object-lifecycle-mgmt.html)
* [Object Lifecycle Management for Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle)
* [Lifecycle management policies in Azure](https://learn.microsoft.com/en-us/azure/storage/blobs/lifecycle-management-overview)

To support object recovery, you can also enable versioning for your external cloud storage.

* To enable versioning for Amazon S3, see [Enabling versioning on buckets](https://docs.aws.amazon.com/AmazonS3/latest/userguide/manage-versioning-examples.html).
* To enable versioning for Google Cloud Storage, see [Use Object Versioning](https://cloud.google.com/storage/docs/using-object-versioning).
* To enable versioning for Azure, see [Enable blob versioning](https://learn.microsoft.com/en-us/azure/storage/blobs/versioning-enable?tabs=portal#enable-blob-versioning).

## Encrypting table files

Snowflake can read Iceberg table files in storage that you encrypt using common server-side encryption (SSE) schemes.
You should use your cloud service provider to manage encryption keys,
and grant the Snowflake principal access to your keys if you use a customer-managed key.

For Amazon S3, Snowflake supports the following SSE options:

| SSE option | Configuration |
| --- | --- |
| SSE with Amazon S3 managed keys (SSE-S3) | Specify `ENCRYPTION = ( TYPE = 'AWS_SSE_S3' )` in the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command. |
| SSE with AWS KMS keys (SSE-KMS) | Specify `ENCRYPTION = ( TYPE = 'AWS_SSE_KMS' KMS_KEY_ID='my_key' )` in the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command.  You must also grant privileges required for SSE-KMS encryption. For instructions, see Step 3 in [Configure an external volume for Amazon S3](tables-iceberg-configure-external-volume-s3.md). |

For Google Cloud Storage, Snowflake supports the following SSE option:

| SSE option | Configuration |
| --- | --- |
| SSE using keys stored in Google Cloud KMS | Specify `ENCRYPTION = ( TYPE = 'GCS_SSE_KMS' KMS_KEY_ID = 'my_key' )` in the [CREATE EXTERNAL VOLUME](../sql-reference/sql/create-external-volume.md) command.  You must also [Grant the GCS service account permissions on the Google Cloud Key Management Service keys](tables-iceberg-configure-external-volume-gcs.md). |

## Configure replication for Snowflake-managed Iceberg tables

You can replicate Snowflake-managed Iceberg tables by using a failover or replication group. Snowflake replicates
a Snowflake-managed Iceberg table when you add the following objects to a failover or replication group:

* The parent database for the table
* The external volume that the table uses

For more information, see [Configure replication for Snowflake-managed Apache Iceberg™ tables](tables-iceberg-replication.md).

---
title: Storage lifecycle policies
source: https://docs.snowflake.com/en/user-guide/storage-management/storage-lifecycle-policies.md
section: User Guide
---

# Storage lifecycle policies

A *storage lifecycle policy* is a schema-level object that automatically manages the data lifecycle for
standard and interactive Snowflake tables.
Use these policies to archive or expire specific table rows that are based on conditions that you define, such as data age or other criteria.
Snowflake automatically executes these policies daily by using shared compute resources.

## How storage lifecycle policies work

To get started with storage lifecycle policies, complete the following steps:

1. [Create a policy](storage-lifecycle-policies-create-manage.md) with an expression that identifies rows to archive or expire.
2. [Attach the policy to one or more tables](storage-lifecycle-policies-create-manage.md).

After you attach a storage lifecycle policy to a table, Snowflake waits approximately 24 hours before running the policy for the first time.
Following this initial delay, Snowflake automatically runs the policy daily by using shared compute resources to identify
and process rows that meet your defined conditions.

When the policy runs, it checks each row against your expression, and then either archives the data to
COOL or COLD storage or expires the data, which deletes it permanently. You can retrieve archived data by using the
[CREATE TABLE … FROM ARCHIVE OF](../../sql-reference/sql/create-table.md) command before expiration occurs. Snowflake waits until the
specified archive period elapses before expiring the data from archive storage.

### Key capabilities

Storage lifecycle policies provide the following benefits for managing your Snowflake data.

Reduced storage costs
:   Storage lifecycle policies help optimize costs by automatically moving older data to
    more cost-effective archival tiers.
    For data that must be retained long-term but
    accessed infrequently, archival storage can significantly reduce storage costs compared
    to standard storage tiers.

Regulatory compliance
:   Automatically meet compliance requirements by configuring policies to archive or expire data according to regulatory standards.
    You can archive data for a specific time before expiration, or expire it directly without archiving.
    This ensures that your data management follows your organization’s governance standards.

Simple data management
:   Storage lifecycle policies eliminate manual data management tasks by automatically executing
    archival and expiration rules. For more information, see [Monitor storage lifecycle policies](storage-lifecycle-policies-monitoring.md).

Flexible data retrieval
:   [Retrieve archived data](storage-lifecycle-policies-retrieving-archived-data.md)
    with precision by creating a new table that contains only the
    rows you need. Use a simple command with a WHERE clause to specify exactly which
    archived data to restore.

## Archive storage tiers

Snowflake supports archiving data in the following storage tiers:

| Archive tier | Description |
| --- | --- |
| COOL | Offers fast retrieval time, so data is readily available. The minimum archival period is 90 days. |
| COLD | Offers greater cost savings than the COOL tier; it is four times less expensive. The minimum archival period is 180 days. Compared to the COOL tier, COLD has a longer data retrieval time, which is up to 48 hours. Data retrieval operations from the COLD storage tier support a maximum of 1 million files per restore operation. |

### Choosing an archive tier

When you select an archive tier, consider the following factors:

* **Archiving costs**: The one-time cost to archive data is the same for both tiers.
* **Storage costs**: COLD tier storage is less expensive than COOL tier storage.
* **Retrieval costs**: COLD tier data retrieval is less expensive than COOL tier retrieval.
* **Retrieval time**: The COOL storage tier offers instant data retrieval, whereas COLD tier retrieval can take up to 48 hours.

> **Important:**
>
> If you attach an archival storage policy to a table, the table is permanently assigned to the specified archive tier for its lifetime. You can’t change the archive tier by applying a new policy. For example, you can’t specify a policy created with a COOL archive tier in ALTER TABLE…DROP STORAGE LIFECYCLE POLICY and then subsequently alter the table to add a policy created with a COLD archive tier. To alter the archive tier for a table, contact Snowflake Support to request deletion of the currently archived data. For additional considerations, see Archival storage policies.

For detailed pricing information, see
tables 3(e) and 4(f) in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

For more information about archiving data, see [Create a storage lifecycle policy](storage-lifecycle-policies-create-manage.md)
and Archive storage considerations.

## Considerations

Consider the following information when you work with storage lifecycle policies.

### Cloud provider support

* **Expiration policies**: Supported for accounts hosted on all cloud providers: Amazon Web Services (AWS), Microsoft Azure, and Google Cloud.
* **Archival policies**:

  + COOL tier: Available for accounts hosted on AWS and Microsoft Azure.
  + COLD tier: Available for accounts hosted on AWS only.

### Supported tables and features

* **Supported tables**: Storage lifecycle policies are supported for standard Snowflake tables, and for
  [interactive tables](../interactive.md) that don’t auto-refresh.
  To evaluate and apply storage lifecycle policy expressions, Snowflake internally and temporarily bypasses any governance policies on a table.
* **Replication**:

  + Snowflake replicates storage lifecycle policies and their associations with tables to target accounts, but doesn’t run the policies.
  + Snowflake doesn’t replicate archived data in the COOL or COLD tiers. After failover,
    archived data in your source account isn’t available in the target account.
  + After failover to a target account, Snowflake pauses storage lifecycle policy execution in the original primary account. After failback to the original primary account, Snowflake resumes policy execution.
  + Snowflake never automatically runs secondary storage lifecycle policies on secondary tables, even after failover. However, you can use secondary policies in a target account by attaching them to *new* tables. For those new tables, Snowflake runs the policies.
* **Cloning**: Snowflake doesn’t automatically apply storage lifecycle policies to cloned tables. If you apply a storage lifecycle policy to
  a table in a clone group, Snowflake archives rows only from that specific table. The policy doesn’t affect clones. This creates copies of the data in both the standard and archive
  storage tiers, and you pay for storage in each tier. For cost information, see [Billing for storage lifecycle policies](storage-lifecycle-policies-billing.md).
* **Unsupported features**

  Storage lifecycle policies aren’t supported for the following features:

  + All object types other than regular Snowflake tables, dynamic tables, and interactive tables that
    don’t auto-refresh.
  + Write once read many (WORM) snapshots, which are immutable snapshots that can’t be modified after creation.
  + Both provider and consumer tables shared through Snowflake data sharing.
  + Native Apps.
  + User-defined functions (UDFs) with external access and external functions.
  + Python, Java, or Scala UDFs.
  + [Row timestamps](../data-engineering/row-timestamps.md).

### Policy behavior and execution

Storage lifecycle policies use performance guidelines that are similar to
[guidelines for row-level access policies](../security-row-intro.md),
and operate automatically with the following characteristics:

* When you attach a storage lifecycle policy to a table, Snowflake waits approximately 24 hours before running it for the first time.
* Snowflake runs storage lifecycle policies every day by using shared compute resources. For information about cost
  for storage lifecycle policies, see
  [Billing for storage lifecycle policies](storage-lifecycle-policies-billing.md).
* To prevent excessively long archive or expiration runs, Snowflake processes large data operations incrementally in smaller chunks.
  A large operation might not complete in one daily run and might instead complete across multiple daily runs.
* When a storage lifecycle policy is running on a table, Snowflake locks UPDATE, DELETE, and MERGE operations.
  You can still perform INSERT and COPY operations during this time. For more information,
  see [Resource locking](../../sql-reference/transactions.md).

### Archival storage policies

Consider the following information when you work with tables that have an archival storage lifecycle policy attached:

* **Accessing archived data**: After Snowflake archives rows, you can’t query them directly. To access them, use
  the [CREATE TABLE … FROM ARCHIVE OF](../../sql-reference/sql/create-table.md) command
  to create a new table with a copy of the archived data. For more information, see
  [Retrieving archived data](storage-lifecycle-policies-retrieving-archived-data.md).
* **Viewing archive metadata**: To view information about archived data (such as row count and column
  min/max values) without incurring retrieval costs, use the
  [SYSTEM$GET_TABLE_ARCHIVE_METADATA](../../sql-reference/functions/system_get_table_archive_metadata.md)
  function.
* **Security**: You can use Tri-Secret Secure ([TSS](../security-encryption-tss.md)) to protect archived data with regular key rotation.
* **Rekeying**: Snowflake doesn’t rekey archived data. If you suspect a key compromise, perform the following steps:

  1. Retrieve the archived data to a new table with the [CREATE TABLE … FROM ARCHIVE OF](../../sql-reference/sql/create-table.md)
     command.
  2. Archive data in the new table when needed.

     Each table has its own encryption key, so the new table effectively uses a new key.
  3. Drop the archive of the original table in which the keys were compromised.
* **Archive tier limitations**:

  + You can’t change the archive tier for a policy from COOL to COLD or from COLD to COOL. Create a new policy instead. For instructions, see [Recreate a storage lifecycle policy](storage-lifecycle-policies-create-manage.md).
  + A table can only use one archive tier *for its lifetime*. For example, you can’t attach a policy that uses a COLD archive tier to a table that already uses a COOL archive tier or vice versa. In addition, you can’t alter a table to drop a policy and then subsequently attach a policy that specifies a different archive tier.
* **Removing policies**: When you remove a policy from a table, the archived data remains in archive storage and can still be retrieved.
* **Dropping or truncating a table**:

  + Truncating a table doesn’t affect archived data for that table. You can still retrieve data from archive storage after truncating the table.
  + When you use [UNDROP TABLE](../../sql-reference/sql/undrop-table.md) to restore a table in an applicable
    [Time Travel data retention period](../data-time-travel.md), Snowflake also restores any data in archive storage.
  + When a table is within the [Fail-safe](../data-failsafe.md) period, the data in archive storage might be recoverable
    by using Fail-safe data recovery steps through Snowflake Support.
  + Table data in archive storage that you delete before the ARCHIVE_FOR_DAYS period has elapsed is subject to storage cost.
    For more information, see [Minimum storage duration charges](storage-lifecycle-policies-billing.md).

---
title: Stream examples
source: https://docs.snowflake.com/en/user-guide/streams-examples.md
section: User Guide
---

# Stream examples

This topic provides practical examples of use cases for streams on objects.

## Streams on tables

### Basic example

The following example shows how the contents of a stream change as DML statements execute on the source table:

```sqlexample
-- Create a table to store the names and fees paid by members of a gym
CREATE OR REPLACE TABLE members (
  id number(8) NOT NULL,
  name varchar(255) default NULL,
  fee number(3) NULL
);

-- Create a stream to track changes to date in the MEMBERS table
CREATE OR REPLACE STREAM member_check ON TABLE members;

-- Create a table to store the dates when gym members joined
CREATE OR REPLACE TABLE signup (
  id number(8),
  dt DATE
  );

INSERT INTO members (id,name,fee)
VALUES
(1,'Joe',0),
(2,'Jane',0),
(3,'George',0),
(4,'Betty',0),
(5,'Sally',0);

INSERT INTO signup
VALUES
(1,'2018-01-01'),
(2,'2018-02-15'),
(3,'2018-05-01'),
(4,'2018-07-16'),
(5,'2018-08-21');

-- The stream records the inserted rows
SELECT * FROM member_check;

+----+--------+-----+-----------------+-------------------+------------------------------------------+
| ID | NAME   | FEE | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|----+--------+-----+-----------------+-------------------+------------------------------------------|
|  1 | Joe    |   0 | INSERT          | False             | d200504bf3049a7d515214408d9a804fd03b46cd |
|  2 | Jane   |   0 | INSERT          | False             | d0a551cecbee0f9ad2b8a9e81bcc33b15a525a1e |
|  3 | George |   0 | INSERT          | False             | b98ad609fffdd6f00369485a896c52ca93b92b1f |
|  4 | Betty  |   0 | INSERT          | False             | e554e6e68293a51d8e69d68e9b6be991453cc901 |
|  5 | Sally  |   0 | INSERT          | False             | c94366cf8a4270cf299b049af68a04401c13976d |
+----+--------+-----+-----------------+-------------------+------------------------------------------+

-- Apply a $90 fee to members who joined the gym after a free trial period ended:
MERGE INTO members m
  USING (
    SELECT id, dt
    FROM signup s
    WHERE DATEDIFF(day, '2018-08-15'::date, s.dt::DATE) < -30) s
    ON m.id = s.id
  WHEN MATCHED THEN UPDATE SET m.fee = 90;

SELECT * FROM members;

+----+--------+-----+
| ID | NAME   | FEE |
|----+--------+-----|
|  1 | Joe    |  90 |
|  2 | Jane   |  90 |
|  3 | George |  90 |
|  4 | Betty  |   0 |
|  5 | Sally  |   0 |
+----+--------+-----+

-- The stream records the updated FEE column as a set of inserts
-- rather than deletes and inserts because the stream contents
-- have not been consumed yet
SELECT * FROM member_check;

+----+--------+-----+-----------------+-------------------+------------------------------------------+
| ID | NAME   | FEE | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|----+--------+-----+-----------------+-------------------+------------------------------------------|
|  1 | Joe    |  90 | INSERT          | False             | 957e84b34ef0f3d957470e02bddccb027810892c |
|  2 | Jane   |  90 | INSERT          | False             | b00168a4edb9fb399dd5cc015e5f78cbea158956 |
|  3 | George |  90 | INSERT          | False             | 75206259362a7c89126b7cb039371a39d821f76a |
|  4 | Betty  |   0 | INSERT          | False             | 9b225bc2612d5e57b775feea01dd04a32ce2ad18 |
|  5 | Sally  |   0 | INSERT          | False             | 5a68f6296c975980fbbc569ce01033c192168eca |
+----+--------+-----+-----------------+-------------------+------------------------------------------+

-- Create a table to store member details in production
CREATE OR REPLACE TABLE members_prod (
  id number(8) NOT NULL,
  name varchar(255) default NULL,
  fee number(3) NULL
);

-- Insert the first batch of stream data into the production table
INSERT INTO members_prod(id,name,fee) SELECT id, name, fee FROM member_check WHERE METADATA$ACTION = 'INSERT';

-- The stream position is advanced
select * from member_check;

+----+------+-----+-----------------+-------------------+-----------------+
| ID | NAME | FEE | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID |
|----+------+-----+-----------------+-------------------+-----------------|
+----+------+-----+-----------------+-------------------+-----------------+

-- Access and lock the stream
BEGIN;

-- Increase the fee paid by paying members
UPDATE members SET fee = fee + 15 where fee > 0;

+------------------------+-------------------------------------+
| number of rows updated | number of multi-joined rows updated |
|------------------------+-------------------------------------|
|                      3 |                                   0 |
+------------------------+-------------------------------------+

-- These changes are not visible because the change interval of the stream object starts at the current offset and ends at the current
-- transactional time point, which is the beginning time of the transaction
SELECT * FROM member_check;

+----+------+-----+-----------------+-------------------+-----------------+
| ID | NAME | FEE | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID |
|----+------+-----+-----------------+-------------------+-----------------|
+----+------+-----+-----------------+-------------------+-----------------+

-- Commit changes
COMMIT;

-- The changes surface now because the stream object uses the current transactional time as the end point of the change interval that now
-- includes the changes in the source table
SELECT * FROM member_check;

+----+--------+-----+-----------------+-------------------+------------------------------------------+
| ID | NAME   | FEE | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|----+--------+-----+-----------------+-------------------+------------------------------------------|
|  1 | Joe    | 105 | INSERT          | True              | 123a45b67cd0e8f012345g01abcdef012345678a |
|  2 | Jane   | 105 | INSERT          | True              | 456b45b67cd1e8f123456g01ghijkl123456779b |
|  3 | George | 105 | INSERT          | True              | 567890c89de2f9g765438j20jklmn0234567890d |
|  1 | Joe    |  90 | DELETE          | True              | 123a45b67cd0e8f012345g01abcdef012345678a |
|  2 | Jane   |  90 | DELETE          | True              | 456b45b67cd1e8f123456g01ghijkl123456779b |
|  3 | George |  90 | DELETE          | True              | 567890c89de2f9g765438j20jklmn0234567890d |
+----+--------+-----+-----------------+-------------------+------------------------------------------+
```

### Differences between standard and append-only streams

The following example shows the differences in behavior between standard (delta) and append-only streams:

```sqlexample
-- Create a source table.
create or replace table t(id int, name string);

-- Create a standard stream on the source table.
create or replace  stream delta_s on table t;

-- Create an append-only stream on the source table.
create or replace  stream append_only_s on table t append_only=true;

-- Insert 3 rows into the source table.
insert into t values (0, 'charlie brown');
insert into t values (1, 'lucy');
insert into t values (2, 'linus');

-- Delete 1 of the 3 rows.
delete from t where id = '0';

-- The standard stream removes the deleted row.
select * from delta_s order by id;

+----+-------+-----------------+-------------------+------------------------------------------+
| ID | NAME  | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|----+-------+-----------------+-------------------+------------------------------------------|
|  1 | lucy  | INSERT          | False             | 7b12c9ee7af9245497a27ac4909e4aa97f126b50 |
|  2 | linus | INSERT          | False             | 461cd468d8cc2b0bd11e1e3c0d5f1133ac763d39 |
+----+-------+-----------------+-------------------+------------------------------------------+

-- The append-only stream does not remove the deleted row.
select * from append_only_s order by id;

+----+---------------+-----------------+-------------------+------------------------------------------+
| ID | NAME          | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|----+---------------+-----------------+-------------------+------------------------------------------|
|  0 | charlie brown | INSERT          | False             | e83abf629af50ccf94d1e78c547bfd8079e68d00 |
|  1 | lucy          | INSERT          | False             | 7b12c9ee7af9245497a27ac4909e4aa97f126b50 |
|  2 | linus         | INSERT          | False             | 461cd468d8cc2b0bd11e1e3c0d5f1133ac763d39 |
+----+---------------+-----------------+-------------------+------------------------------------------+

-- Create a table to store the change data capture records in each of the streams.
create or replace  table t2(id int, name string, stream_type string default NULL);

-- Insert the records from the streams into the new table, advancing the offset of each stream.
insert into t2(id,name,stream_type) select id, name, 'delta stream' from delta_s;
insert into t2(id,name,stream_type) select id, name, 'append_only stream' from append_only_s;

-- Update a row in the source table.
update t set name = 'sally' where name = 'linus';

-- The standard stream records the update operation.
select * from delta_s order by id;

+----+-------+-----------------+-------------------+------------------------------------------+
| ID | NAME  | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|----+-------+-----------------+-------------------+------------------------------------------|
|  2 | sally | INSERT          | True              | 461cd468d8cc2b0bd11e1e3c0d5f1133ac763d39 |
|  2 | linus | DELETE          | True              | 461cd468d8cc2b0bd11e1e3c0d5f1133ac763d39 |
+----+-------+-----------------+-------------------+------------------------------------------+

-- The append-only stream does not record the update operation.
select * from append_only_s order by id;

+----+------+-----------------+-------------------+-----------------+
| ID | NAME | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID |
|----+------+-----------------+-------------------+-----------------|
+----+------+-----------------+-------------------+-----------------+
```

The following example shows how streams can be used in ELT (extract, load, transform) processes. In this example, new data inserted into a
staging table is tracked by a stream. A set of SQL statements transform and insert the stream contents into a set of production tables:

### DML operations in explicit transactions

```sqlexample
-- Create a staging table that stores raw JSON data
CREATE OR REPLACE TABLE data_staging (
  raw variant);

-- Create a stream on the staging table
CREATE OR REPLACE STREAM data_check ON TABLE data_staging;

-- Create 2 production tables to store transformed
-- JSON data in relational columns
CREATE OR REPLACE TABLE data_prod1 (
    id number(8),
    ts TIMESTAMP_TZ
    );

CREATE OR REPLACE TABLE data_prod2 (
    id number(8),
    color VARCHAR,
    num NUMBER
    );

-- Load JSON data into staging table
-- using COPY statement, Snowpipe,
-- or inserts

SELECT * FROM data_staging;

+--------------------------------------+
| RAW                                  |
|--------------------------------------|
| {                                    |
|   "id": 7077,                        |
|   "x1": "2018-08-14T20:57:01-07:00", |
|   "x2": [                            |
|     {                                |
|       "y1": "green",                 |
|       "y2": "35"                     |
|     }                                |
|   ]                                  |
| }                                    |
| {                                    |
|   "id": 7078,                        |
|   "x1": "2018-08-14T21:07:26-07:00", |
|   "x2": [                            |
|     {                                |
|       "y1": "cyan",                  |
|       "y2": "107"                    |
|     }                                |
|   ]                                  |
| }                                    |
+--------------------------------------+

--  Stream table shows inserted data
SELECT * FROM data_check;

+--------------------------------------+-----------------+-------------------+------------------------------------------+
| RAW                                  | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
|--------------------------------------+-----------------+-------------------|------------------------------------------|
| {                                    | INSERT          | False             | 789012e01ef4j3k890123k35mnopqr567890124j |
|   "id": 7077,                        |                 |                   |                                          |
|   "x1": "2018-08-14T20:57:01-07:00", |                 |                   |                                          |
|   "x2": [                            |                 |                   |                                          |
|     {                                |                 |                   |                                          |
|       "y1": "green",                 |                 |                   |                                          |
|       "y2": "35"                     |                 |                   |                                          |
|     }                                |                 |                   |                                          |
|   ]                                  |                 |                   |                                          |
| }                                    |                 |                   |                                          |
| {                                    | INSERT          | False             | 765432u89tk3l6y456789012rst7vx678912456k |
|   "id": 7078,                        |                 |                   |                                          |
|   "x1": "2018-08-14T21:07:26-07:00", |                 |                   |                                          |
|   "x2": [                            |                 |                   |                                          |
|     {                                |                 |                   |                                          |
|       "y1": "cyan",                  |                 |                   |                                          |
|       "y2": "107"                    |                 |                   |                                          |
|     }                                |                 |                   |                                          |
|   ]                                  |                 |                   |                                          |
| }                                    |                 |                   |                                          |
+--------------------------------------+-----------------+-------------------+------------------------------------------+

-- Access and lock the stream
BEGIN;

-- Transform and copy JSON elements into relational columns
-- in the production tables
INSERT INTO data_prod1 (id, ts)
SELECT t.raw:id, to_timestamp_tz(t.raw:x1)
FROM data_check t
WHERE METADATA$ACTION = 'INSERT';

INSERT INTO data_prod2 (id, color, num)
SELECT t.raw:id, f.value:y1, f.value:y2
FROM data_check t
, lateral flatten(input => raw:x2) f
WHERE METADATA$ACTION = 'INSERT';

-- Commit changes in the stream objects participating in the transaction
COMMIT;

SELECT * FROM data_prod1;

+------+---------------------------+
|   ID | TS                        |
|------+---------------------------|
| 7077 | 2018-08-14 20:57:01 -0700 |
| 7078 | 2018-08-14 21:07:26 -0700 |
+------+---------------------------+

SELECT * FROM data_prod2;

+------+-------+-----+
|   ID | COLOR | NUM |
|------+-------+-----|
| 7077 | green |  35 |
| 7078 | cyan  | 107 |
+------+-------+-----+

SELECT * FROM data_check;

+-----+-----------------+-------------------+
| RAW | METADATA$ACTION | METADATA$ISUPDATE |
|-----+-----------------+-------------------|
+-----+-----------------+-------------------+
```

## Streams on views

### Stream on a view with multi-table joins

```sqlexample
-- Create multiple tables with matching column values.
CREATE TABLE birds (
  id number,
  common varchar(100),
  class varchar(100)
);

CREATE TABLE sightings (
  d date,
  loc varchar(100),
  b_id number,
  c number
);

-- Create a view that queries the tables with a join.
CREATE VIEW bird_sightings AS
SELECT b.id AS id,
       b.common AS common_name,
       b.class AS classification,
       s.d AS date,
       s.loc AS location,
       s.c AS count
FROM birds b
INNER JOIN sightings s ON b.id = s.b_id;

-- Create a stream on the view.
CREATE STREAM bird_sightings_s ON VIEW bird_sightings;

-- Insert values into the tables.
INSERT INTO birds
VALUES
    (1,'Scarlet Tanager','P. olivacea'),
    (14,'Mallard','A. platyrhynchos'),
    (48,'Spotted Sandpiper','A. macularius'),
    (92,'Great Blue Heron','A. herodias');

INSERT INTO sightings
VALUES
    (current_date(),'Gibson Island',1,4),
    (current_date(),'Lake Los Pajaro',14,12),
    (current_date(),'Lake Los Pajaro',92,12),
    (current_date(),'Gibson Island',14,21),
    (current_date(),'Gibson Island',92,5);

-- Query the stream.
-- The stream displays a record for each row added to the view.
SELECT * FROM bird_sightings_s;

+----+------------------+------------------+------------+-----------------+-------+------------------------------------------+-----------------+-------------------+
| ID | COMMON_NAME      | CLASSIFICATION   | DATE       | LOCATION        | COUNT | METADATA$ROW_ID                          | METADATA$ACTION | METADATA$ISUPDATE |
|----+------------------+------------------+------------+-----------------+-------+------------------------------------------+-----------------+-------------------|
|  1 | Scarlet Tanager  | P. olivacea      | 2021-09-07 | Gibson Island   |     4 | a2522b47726ac2a922104c8e2f668d065ff6fcd0 | INSERT          | False             |
| 14 | Mallard          | A. platyrhynchos | 2021-09-07 | Lake Los Pajaro |    12 | fceb4ad5cb6d2df2865d0f572b8a2aa98f240b70 | INSERT          | False             |
| 92 | Great Blue Heron | A. herodias      | 2021-09-07 | Lake Los Pajaro |    12 | 0db99176fe8bd50749b2b48fb2befab416ff9272 | INSERT          | False             |
| 14 | Mallard          | A. platyrhynchos | 2021-09-07 | Gibson Island   |    21 | 2e94ef3a33e52ba5de5d816dc41c60fedf9cb1eb | INSERT          | False             |
| 92 | Great Blue Heron | A. herodias      | 2021-09-07 | Gibson Island   |     5 | a1df477ac8e388e1cf0ada77e9097c6effa346a7 | INSERT          | False             |
+----+------------------+------------------+------------+-----------------+-------+------------------------------------------+-----------------+-------------------+

-- Consume the stream records in a DML statement (INSERT, MERGE, etc.).

-- Query the stream.
-- The stream is empty.
+----+-------------+----------------+------+----------+-------+-----------------+-----------------+-------------------+
| ID | COMMON_NAME | CLASSIFICATION | DATE | LOCATION | COUNT | METADATA$ROW_ID | METADATA$ACTION | METADATA$ISUPDATE |
|----+-------------+----------------+------+----------+-------+-----------------+-----------------+-------------------|
+----+-------------+----------------+------+----------+-------+-----------------+-----------------+-------------------+

-- Delete a row from the birds table.
DELETE FROM birds WHERE id = 14;

-- Query the stream.
-- The stream displays two records for the single DELETE operation.
SELECT * FROM bird_sightings_s;

+----+-------------+------------------+------------+-----------------+-------+------------------------------------------+-----------------+-------------------+
| ID | COMMON_NAME | CLASSIFICATION   | DATE       | LOCATION        | COUNT | METADATA$ROW_ID                          | METADATA$ACTION | METADATA$ISUPDATE |
|----+-------------+------------------+------------+-----------------+-------+------------------------------------------+-----------------+-------------------|
| 14 | Mallard     | A. platyrhynchos | 2021-09-07 | Lake Los Pajaro |    12 | 83c22ff4be80d65a2e9776df0e35b22079cb4430 | DELETE          | False             |
| 14 | Mallard     | A. platyrhynchos | 2021-09-07 | Gibson Island   |    21 | e29cfae8c3c7d261ed903c2303f61e4d49c01ba1 | DELETE          | False             |
+----+-------------+------------------+------------+-----------------+-------+------------------------------------------+-----------------+-------------------+
```

### Stream on a view that calls a non-deterministic SQL function

```sqlexample
-- Create a table.
CREATE TABLE ndf (
  c1 number
);

-- Create a view that queries the table and
-- also returns the CURRENT_USER and CURRENT_TIMESTAMP values
-- for the query transaction.
CREATE VIEW ndf_v AS
SELECT CURRENT_USER() AS u,
       CURRENT_TIMESTAMP() AS ts,
       c1 AS num
FROM ndf;

-- Create a stream on the view.
CREATE STREAM ndf_s ON VIEW ndf_v;

-- User peter inserts rows into table ndf.
INSERT INTO ndf
VALUES
    (1),
    (2),
    (3);

-- User marie inserts rows into table ndf.
INSERT INTO ndf
VALUES
    (4),
    (5),
    (6);

-- User PETER queries the stream.
-- The stream returns the username for the user.
-- The stream also returns the current timestamp for the query transaction in each row,
-- NOT the timestamp when each row was inserted.
SELECT * FROM ndf_s;

+-------+-------------------------------+-----+-----------------+------------------------------------------+
| U     | TS                            | NUM | METADATA$ACTION | METADATA$ROW_ID                          |
|-------+-------------------------------+-----+-----------------+------------------------------------------|
| PETER | 2021-08-16 11:56:33.778 -0700 |   1 | INSERT          | d200504bf3049a7d515214408d9a804fd03b46cd |
| PETER | 2021-08-16 11:56:33.778 -0700 |   2 | INSERT          | d0a551cecbee0f9ad2b8a9e81bcc33b15a525a1e |
| PETER | 2021-08-16 11:56:33.778 -0700 |   3 | INSERT          | b98ad609fffdd6f00369485a896c52ca93b92b1f |
| PETER | 2021-08-16 11:56:33.778 -0700 |   4 | INSERT          | 62d34abc3fac85c037fb9f47f7758f08d025d9ed |
| PETER | 2021-08-16 11:56:33.778 -0700 |   5 | INSERT          | e554e6e68293a51d8e69d68e9b6be991453cc901 |
| PETER | 2021-08-16 11:56:33.778 -0700 |   6 | INSERT          | f6fa32c498a28b2349d2c6f6be55c30eb1d5310f |
+-------+-------------------------------+-----+-----------------+------------------------------------------+

-- User MARIE queries the stream.
-- The stream returns the username for the user
-- and the current timestamp for the query transaction in each row.
SELECT * FROM ndf_s;
+-------+-------------------------------+-----+-----------------+------------------------------------------+
| U     | TS                            | NUM | METADATA$ACTION | METADATA$ROW_ID                          |
|-------+-------------------------------+-----+-----------------+------------------------------------------|
| MARIE | 2021-08-16 12:04:21.768 -0700 |   1 | INSERT          | d200504bf3049a7d515214408d9a804fd03b46cd |
| MARIE | 2021-08-16 12:04:21.768 -0700 |   2 | INSERT          | d0a551cecbee0f9ad2b8a9e81bcc33b15a525a1e |
| MARIE | 2021-08-16 12:04:21.768 -0700 |   3 | INSERT          | b98ad609fffdd6f00369485a896c52ca93b92b1f |
| MARIE | 2021-08-16 12:04:21.768 -0700 |   4 | INSERT          | 62d34abc3fac85c037fb9f47f7758f08d025d9ed |
| MARIE | 2021-08-16 12:04:21.768 -0700 |   5 | INSERT          | e554e6e68293a51d8e69d68e9b6be991453cc901 |
| MARIE | 2021-08-16 12:04:21.768 -0700 |   6 | INSERT          | f6fa32c498a28b2349d2c6f6be55c30eb1d5310f |
+-------+-------------------------------+-----+-----------------+------------------------------------------+
```

---
title: Strong Authentication Hub
source: https://docs.snowflake.com/en/user-guide/strong-authentication-hub.md
section: User Guide
---

# Strong Authentication Hub

The Strong Authentication Hub is a user interface that helps you implement strong authentication for all of your users. It is an essential
tool for meeting the deadlines associated with the [deprecation of single-factor passwords](security-mfa-rollout.md).

The hub identifies users who don’t meet Snowflake requirements for strong authentication and provides step-by-step instructions to bring
these users into conformance.

## What is the Strong Authentication Hub?

The Strong Authentication Hub helps your account conform to Snowflake’s strong authentication requirements by providing the following
capabilities:

Provide visibility
:   Gives you clarity on your account’s readiness for the multi-factor authentication (MFA) enforcement and the deprecation of
    `LEGACY_SERVICE` users, with a clear path to ensure 100% compliance before enforcement deadlines.

Identify risks
:   Identifies specific authentication issues and at-risk users, including:

    * Users who have logged in using only a password via specific applications (for example, Power BI) within the last 90 days.
    * Users who have a password but aren’t enrolled in MFA, and who have not logged in in the past 90 days.
    * Service users (`TYPE=LEGACY_SERVICE`) that need to migrate to stronger authentication methods and be converted to `TYPE=SERVICE`
      users.
    * Users who have logged in using a strong authentication method like federated authentication with single sign-on, but still have a
      password that is not protected by MFA. If a malicious actor obtained the password, they could sign in and enable MFA for themselves.

Manage remediation
:   Provides progress tracking, step-by-step remediation guidance, and the ability to prioritize by issue type or by individual user.

Manage enforcement timelines
:   Displays the rollout timeline for enforcement phases and allows you to extend enforcement dates if needed.

## Access the hub and remediate users

To meet the requirement for strong authentication, every human user who uses a password must be enrolled in MFA and
every service user must use an authentication method that is stronger than a password. Use the Strong Authentication Hub to find users who
don’t meet these requirements and bring them into conformance.

> **Note:**
>
> Results and violations displayed in the hub are based on Trust Center scanner results that update periodically. Changes you make to
> remediate users might not be reflected immediately.

To access the Strong Authentication Hub and remediate users:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with the required access control privileges.
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Overview tab.
5. In the Strong authentication section, use the Strong authentication progress tile to determine whether you have users who
   aren’t ready for the enforcement of strong authentication.
6. If you have users who need to be migrated to a stronger authentication method, select View hub.
7. Find the Prioritize your remediation efforts section and do one of the following:

   * If you choose to prioritize by issue, select By issue, and then select the card that corresponds to the problem that you want to
     remediate.
   * If you choose to prioritize by user, select By user, and then select the user that you want to remediate.
8. In the side panel, follow the instructions on how to migrate users to a strong authentication method that meets Snowflake requirements.

## Access control requirements

To use the Strong Authentication Hub, you need the following privileges/roles:

| Task | Required privilege/role | Notes |
| --- | --- | --- |
| View the hub | One of the following application roles:   * `SNOWFLAKE.TRUST_CENTER_ADMIN` application role. * `SNOWFLAKE.TRUST_CENTER_VIEWER` application role. | The ACCOUNTADMIN role meets this requirement. |
| Extend enforcement dates | MODIFY privilege on the account. | The ACCOUNTADMIN role meets this requirement. |

---
title: Summary of data loading features
source: https://docs.snowflake.com/en/user-guide/intro-summary-loading.md
section: User Guide
---

# Summary of data loading features

This topic provides a quick-reference of the supported features for using the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command to load data from files
into Snowflake tables.

## Data file details

The following table describes the general details for the files used to load data:

| Feature | Supported | Notes |
| --- | --- | --- |
| Location of files | Local environment | Files are first copied (“staged”) to an internal (Snowflake) stage, then loaded into a table. |
|  | Amazon S3 | Files can be loaded directly from any user-supplied bucket. |
|  | Google Cloud Storage | Files can be loaded directly from any user-supplied bucket. |
|  | Microsoft Azure cloud storage   * Blob storage * Data Lake Storage Gen2 * General-purpose v1 * General-purpose v2 | Files can be loaded directly from any user-supplied container. |
| File formats | Delimited files (CSV, TSV, etc.) | Any valid delimiter is supported; default is comma (i.e. CSV). |
|  | [Semi-structured formats](semistructured-intro.md)   * JSON * Avro * ORC * Parquet * XML |  |
|  | [Unstructured formats](unstructured-intro.md) |  |
| File encoding | File format-specific | For delimited files (CSV, TSV, etc.), the default character set is UTF-8. To use any other characters sets, you must explicitly specify the encoding to use for loading. For the list of supported character sets, see Supported Character Sets for Delimited Files (in this topic). |
|  |  | For semi-structured file formats (JSON, Avro, etc.), the only supported character set is UTF-8. |
|  |  | Snowflake doesn’t support loading data from tar (tape archive) files. |

### Supported character sets for delimited files

The following table lists the encoding character sets supported for loading data from delimited files (CSV, TSV, etc.):

| Character Set | `ENCODING` Value | Supported Languages | Notes |
| --- | --- | --- | --- |
| Big5 | `BIG5` | Traditional Chinese |  |
| EUC-JP | `EUCJP` | Japanese |  |
| EUC-KR | `EUCKR` | Korean |  |
| GB18030 | `GB18030` | Chinese |  |
| IBM420 | `IBM420` | Arabic |  |
| IBM424 | `IBM424` | Hebrew |  |
| IBM949 | `IBM949` | Korean |  |
| ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
| ISO-2022-JP | `ISO2022JP` | Japanese |  |
| ISO-2022-KR | `ISO2022KR` | Korean |  |
| ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
| ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
| ISO-8859-5 | `ISO88595` | Russian |  |
| ISO-8859-6 | `ISO88596` | Arabic |  |
| ISO-8859-7 | `ISO88597` | Greek |  |
| ISO-8859-8 | `ISO88598` | Hebrew |  |
| ISO-8859-9 | `ISO88599` | Turkish |  |
| ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
| KOI8-R | `KOI8R` | Russian |  |
| Shift_JIS | `SHIFTJIS` | Japanese |  |
| UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
| UTF-16 | `UTF16` | All languages |  |
| UTF-16BE | `UTF16BE` | All languages |  |
| UTF-16LE | `UTF16LE` | All languages |  |
| UTF-32 | `UTF32` | All languages |  |
| UTF-32BE | `UTF32BE` | All languages |  |
| UTF-32LE | `UTF32LE` | All languages |  |
| windows-874 | `WINDOWS874` | Thai |  |
| windows-949 | `WINDOWS949` | Korean |  |
| windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
| windows-1251 | `WINDOWS1251` | Russian |  |
| windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
| windows-1253 | `WINDOWS1253` | Greek |  |
| windows-1254 | `WINDOWS1254` | Turkish |  |
| windows-1255 | `WINDOWS1255` | Hebrew |  |
| windows-1256 | `WINDOWS1256` | Arabic |  |

## Compression of staged files

The following table describes how Snowflake handles compression of data files for loading. The options are different depending on whether the files are staged, uncompressed, or already-compressed:

| Feature | Supported | Notes |
| --- | --- | --- |
| Uncompressed files | gzip | When staging uncompressed files in a Snowflake stage, the files are automatically compressed using gzip, unless compression is explicitly disabled. |
| Already-compressed files | * gzip * bzip2 * deflate * raw_deflate * Brotli * Zstandard | Snowflake can automatically detect any of these compression methods, or you can explicitly specify the method that was used to compress the files.  Auto-detection isn’t supported for Brotli-compressed files; when staging or loading Brotli-compressed files, you must explicitly specify the compression method that was used.  Snowflake doesn’t support uploading compressed tar (tape archive) files. |

## Encryption of staged files

The following table describes how Snowflake handles encryption of data files for loading. The options are different depending on whether the files are staged
unencrypted or already-encrypted:

| Feature | Supported | Notes |
| --- | --- | --- |
| Unencrypted files | 128-bit or 256-bit keys | All files stored on internal stages for data loading and unloading operations are automatically encrypted using AES-256 strong encryption on the server side. By default, Snowflake provides additional client-side encryption with a 128-bit key (with the option to configure a 256-bit key). |
| Already-encrypted files | User-supplied key | Files that are already encrypted can be loaded into Snowflake from external cloud storage; the key used to encrypt the files must be provided to Snowflake. |

---
title: Summary of Data Unloading Features
source: https://docs.snowflake.com/en/user-guide/intro-summary-unloading.md
section: User Guide
---

# Summary of Data Unloading Features

This topic provides a quick-reference of the supported features for using the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from Snowflake
tables into flat files.

Note that some of the supported features, particularly compression and encryption, are dictated by whether you are unloading to a Snowflake internal location or an
external location (Amazon S3, Google Cloud Storage, or Microsoft Azure).

## Output Data File Details

The following table describes the general details for the output files generated by Snowflake when unloading data:

| Feature | Supported | Notes |
| --- | --- | --- |
| **Location of files** | Local files | Files are first unloaded to a Snowflake internal location, then can be downloaded locally using [GET](../sql-reference/sql/get.md). |
| Files in Amazon S3 | Files can be unloaded directly to any user-supplied bucket in S3, then can be downloaded locally using AWS utilities. |
| Files in Google Cloud Storage | Files can be unloaded directly to any user-supplied container in Cloud Storage, then can be downloaded locally using Cloud Storage utilities. |
| Files in Microsoft Azure | Files can be unloaded directly to any user-supplied container in Azure, then can be downloaded locally using Azure utilities. |
| **File formats** | Delimited files (CSV, TSV, etc.) | Any valid delimiter is supported; default is comma (i.e. CSV). |
| JSON |  |
| Parquet |  |
| **File encoding** | UTF-8 | Output files are always encoded using UTF-8, regardless of the file format; no other character sets are supported. |

> **Note:**
>
> Unloads running on machines with memory pressure may result in files of smaller size.

## Compression of Output Data Files

The following table describes how Snowflake handles compression for the output files generated by Snowflake when unloading data:

| Location of Files | Supported | Notes |
| --- | --- | --- |
| **Internal or external location** | gzip | By default, all unloaded data files are compressed using gzip, unless compression is explicitly disabled or one of the other supported compression methods is explicitly specified. |
| bzip2 |
| Brotli |
| Zstandard |

> **Note:**
>
> It is a known issue that we currently do not support setting CONTENT-ENCODING for Azure and Google Cloud Platform when `compression=gzip`.

## Encryption of Output Data Files

The following table describes how Snowflake handles encryption for the output files generated by Snowflake when unloading data. The options are different depending
on whether the files are unloaded to an internal location (i.e. Snowflake stage) or external location (Amazon S3, Google Cloud Storage, or Microsoft Azure):

| Location of Files | Supported | Notes |
| --- | --- | --- |
| **Internal location** | 128-bit or 256-bit keys | All data files unloaded to Snowflake internal locations are automatically encrypted using 128-bit keys. The files are unencrypted when they are downloaded to the local directory.  256-bit keys can be enabled (for stronger encryption); however, additional configuration is required. |
| **External location** | User-supplied key | Data files unloaded to cloud storage can be encrypted if a security key (for encrypting the files) is provided to Snowflake. |

---
title: Supported cloud platforms
source: https://docs.snowflake.com/en/user-guide/intro-cloud-platforms.md
section: User Guide
---

# Supported cloud platforms

Snowflake is provided as a self-managed service that runs completely on cloud infrastructure. This means that all three layers of
[Snowflake’s architecture](intro-key-concepts.md) (storage, compute, and cloud services) are deployed and managed entirely
on a selected cloud platform.

A Snowflake account can be hosted on any of the following cloud platforms:

* [Amazon Web Services (AWS)](https://aws.amazon.com/)
* [Google Cloud](https://cloud.google.com/)
* [Microsoft Azure (Azure)](https://azure.microsoft.com/en-us/)

On each platform, Snowflake provides one or more [regions](intro-regions.md) where the account is provisioned.

If your organization’s other cloud services are already hosted on one of these platforms, you can choose to host all your Snowflake
accounts on the same platform. However, you can also choose to host your accounts on a different platform.

> **Note:**
>
> The cloud platform you choose for each Snowflake account is completely independent from your other Snowflake accounts. In fact, you can choose to
> host each Snowflake account on a different platform, although this may have some impact on data transfer billing when loading data.

## Pricing

Differences in unit costs for credits and data storage are calculated by [region](intro-regions.md) on each cloud platform.
For more information about pricing as it pertains to a specific region and platform, see the [pricing page](http://www.snowflake.com/pricing)
(on the Snowflake website).

## Data Loading

Snowflake supports loading data from files staged in any of the following locations, regardless of the cloud platform for your Snowflake account:

* Internal (that is, Snowflake) stages
* Amazon S3
* Google Cloud Storage
* Microsoft Azure blob storage

Snowflake supports both bulk data loading and continuous data loading (Snowpipe). Likewise, Snowflake supports unloading data from tables into any of
the above staging locations.

For more information, see [Load data into Snowflake](../guides-overview-loading-data.md).

> **Note:**
>
> Some data transfer billing charges may apply when loading data from files staged across different platforms. For more information, see
> [Understanding data transfer cost](cost-understanding-data-transfer.md).

## HITRUST CSF Certification

This certification enhances Snowflake’s security posture in regulatory compliance and risk management, and applies to Snowflake editions
that are Business Critical (or higher). For more information, see [Snowflake Security and Trust Center](https://www.snowflake.com/product/security-and-trust-center/).

## Partner Applications

Many partner applications work with Snowflake accounts. For more information, refer to [Snowflake ecosystem](ecosystem.md).

## Current Limitations for Accounts on Google Cloud

We strive to provide the same Snowflake experience regardless of the cloud platform you choose for your account; however, some services and
features are currently unavailable (or have limited availability) for Snowflake accounts hosted on Google Cloud.

### Google Cloud Private Service Connect

See the [limitations](private-service-connect-google.md) section for using Google Cloud Private Service Connect and Snowflake.

Note that the following Snowflake system functions for self-service management aren’t supported currently for Google Cloud Private Service Connect for
your Snowflake account on Google Cloud:

* [SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS](../sql-reference/functions/system_get_privatelink_endpoint_registrations.md)
* [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_register_privatelink_endpoint.md)
* [SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT](../sql-reference/functions/system_unregister_privatelink_endpoint.md)

### Network Rules

[Network rules](network-rules.md) that use private service connect endpoints aren’t supported currently on Google Cloud.

### Private connectivity to key management services through Tri-Secret Secure

Private connectivity to key management services through Tri-Secret Secure isn’t supported currently on Google Cloud.

### Snowflake Open Catalog

[Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) is currently not available in government regions.

## Current Limitations for Accounts on AWS

We strive to provide the same Snowflake experience regardless of the cloud platform you choose for your account; however, some services and
features are currently unavailable (or have limited availability) for Snowflake accounts hosted on AWS.

### Access to External Network Locations

[Access to external network locations](../developer-guide/external-network-access/external-network-access-overview.md) from UDF and
procedure handler code isn’t supported currently in the Gov region.

### Snowflake Open Catalog

[Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) is currently not available in government regions.

## Current limitations for accounts on Azure

We strive to provide the same Snowflake experience regardless of the cloud platform you choose for your account; however, some services and
features are currently unavailable (or have limited availability) for Snowflake accounts hosted on Microsoft Azure.

### Azure Private Link

See [Azure Private Link Requirements and Limitations](privatelink-azure.md).

### Snowflake Clients

Currently, using the account name URL format for private connectivity to the Snowflake service with
[Snowflake CLI](../developer-guide/snowflake-cli/index.md), [SnowSQL](snowsql.md), [connectors](connectors.md) and [drivers](../developer-guide/drivers.md) is not supported. As
a workaround, use the account locator format with Snowflake CLI, SnowSQL, connectors, and drivers.

For details, see:

* [Account identifiers](admin-account-identifier.md)
* [Connecting to your accounts](organizations-connect.md)

### Access to External Network Locations

[Access to external network locations](../developer-guide/external-network-access/external-network-access-overview.md) from UDF and
procedure handler code isn’t supported currently in the Gov region.

### Snowflake Open Catalog

[Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) is currently not available in government regions.

---
title: Supported cloud regions
source: https://docs.snowflake.com/en/user-guide/intro-regions.md
section: User Guide
---

# Supported cloud regions

Regions let your organization choose where your data is geographically stored across your regional, national, and international operations.
Regions also determine where your compute resources are provisioned.

Snowflake supports regions across all of the Snowflake-supported [cloud platforms](intro-cloud-platforms.md), grouped into
three global geographic segments (North/South America, Europe/Middle East/Africa, and Asia Pacific/China).

> **Important:**
>
> Each Snowflake account is hosted in a single region. If you wish to use Snowflake across multiple regions, you must maintain a Snowflake
> account in each of the desired regions.

> **Note:**
>
> For details about the cloud regions where your data can be hosted when using the Egress Cost Optimizer,
> see [Optimizing data transfer costs with Egress Cost Optimizer](../collaboration/provider-listings-auto-fulfillment-eco.md) documentation.

## North and South America

These regions are supported for organizations that prefer or require their data to be stored in the United States, Canada, or Brazil. Multiple
regions are provided to allow your organization to meet its individual compliance requirements for general purpose use.

Additional regions are provided in the United States for organizations that must comply with US government regulations.

### Commercial regions

Snowflake supports the following regions in North America (U.S., Canada, and Mexico) and South America (Brazil) for general commercial use:

| Cloud Platform | Cloud Region ID [1] | Region Name | Additional Notes |
| --- | --- | --- | --- |
| **Amazon Web Services (AWS)** | | | |
|  | ca-central-1 | Canada (Central) | Completed assessment and supports compliance with Canadian government’s CCCS Medium Cloud Control Security Profile (fka, Protected B, Medium Integrity, Medium Availability [PB/M/M]) |
| sa-east-1 | South America (Sao Paulo) |  |
| us-west-2 | US West (Oregon) | Also supports some U.S. government compliance. See U.S. regions supporting public sector workloads. |
| us-east-2 | US East (Ohio) |  |
| us-east-1 | US East (N. Virginia) | Also supports some U.S. government compliance. See U.S. regions supporting public sector workloads. |
| **Google** **Cloud** **Platform** **(GCP)** | | | |
|  | us-central1 | US Central1 (Iowa) |  |
| us-east4 | US East4 (N. Virginia) |  |
| **Microsoft Azure** | | | |
|  | canadacentral | Canada Central (Toronto) | Completed assessment and supports compliance with the CCCS Medium Cloud Control Security Profile (fka, Protected B, Medium Integrity, Medium Availability [PB/M/M]) |
| centralus | Central US (Iowa) |  |
| eastus | East US (Virginia) |  |
| eastus2 | East US 2 (Virginia) |  |
| mexicocentral | Mexico Central (Querétaro) |  |
| southcentralus | South Central US (Texas) | Also supports some U.S. government compliance (see next section for details). |
| westus2 | West US 2 (Washington) |  |

[1] See the map preceding this table for the location of each supported region, labeled by cloud region ID.

### U.S. regions supporting public sector workloads

Snowflake makes the following regions available to customers that require compliance with common U.S. Federal and state government
standards. These regions are only supported for Snowflake accounts on Business Critical Edition
(or higher).

> **Note:**
>
> If your Snowflake account is in a U.S. government region and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

U.S. commercial regions with some support for government standards
:   The following commercial regions meet the requirements for Snowflake’s compliance with the U.S. government standards set forth in the
    table below. By uploading workloads covered by the below compliance standards, customers agree to
    [Snowflake’s U.S. Government Commercial Compliance Addendum](https://www.snowflake.com/legal-gov/us-gov-commercial-compliance-addendum/).

    | Cloud Region ID | Region Name | Compliance standards |
    | --- | --- | --- |
    | **Amazon** **Web** **Services** |  |  |
    | us-east-1 | US East (Commercial Gov - N. Virginia) | * [FedRAMP (Moderate)](cert-fedramp.md) * [GovRAMP (Moderate)](cert-stateramp.md) * [TX-RAMP (Level 2)](cert-txramp.md) * FIPS 140-2 * DOJ Criminal Justice Information Systems (CJIS) Security Policy - Requires Supplemental Contract Terms * IRS Publication 1075 - Requires Supplemental Contract Terms * NIST 800-171 |
    | us-east-1 | US East (N. Virginia) | * [TX-RAMP (Level 2)](cert-txramp.md) |
    | us-west-2 | US West (Commercial Gov - Oregon) | * [FedRAMP (Moderate)](cert-fedramp.md) * [GovRAMP (Moderate)](cert-stateramp.md) * [TX-RAMP (Level 2)](cert-txramp.md) * FIPS 140-2 * DOJ Criminal Justice Information Systems (CJIS) Security Policy - Requires Supplemental Contract Terms * IRS Publication 1075 - Requires Supplemental Contract Terms * NIST 800-171 |
    | **Microsoft Azure** |  |  |
    | southcentralus | South Central US (Texas) | * [GovRAMP (Moderate)](cert-stateramp.md) * [TX-RAMP (Level 2)](cert-txramp.md) |

U.S. SnowGov Regions
:   Snowflake makes the following SnowGov Regions on AWS GovCloud (US) and Microsoft Azure Government
    available to customers who require additional security designed for US government regulated workloads and other types of sensitive data.
    These regions are operated by Snowflake personnel who are U.S. persons located within the U.S. Certain features that are available in
    Snowflake’s commercial regions might not be available or might be different in its SnowGov Regions. Use of and access to Snowflake in any of
    the SnowGov Regions are limited solely to U.S. Government Customers or U.S. Government Contractors (unless otherwise agreed upon by
    Snowflake in its sole discretion) and are subject to Snowflake’s [U.S. SnowGov Region Terms of Service](https://www.snowflake.com/en/legal-gov/terms-of-service/us-snowgov-terms-of-service/). As used here, a
    “U.S. Government Customer” means a Snowflake customer that is: (a) a U.S. Federal, state, or local government entity or (b) a tribal
    government entity; and a “U.S. Government Contractor” means a commercial entity that is required to process data provided by a U.S.
    Government Customer to perform a prime contract or subcontract with or for such entity.

    The SnowGov Regions support U.S. government standards, such as FedRAMP, Department of War Impact Levels, GovRAMP, TX-RAMP, FIPS-140-2, and the
    International Traffic in Arms Regulations (ITAR) among others. As noted in the table below, certain standards require the customer to
    accept supplemental contract terms. You must contact Snowflake and agree to the supplemental terms before uploading workloads covered by
    these standards.

    Self-provisioning of initial Snowflake accounts is not available in the SnowGov Regions. To provision an initial account in these regions,
    you must contact Snowflake.

    | Cloud Region ID | Region Name | Workloads Supported by Default | Workloads Requiring Supplemental Contract Terms |
    | --- | --- | --- | --- |
    | **Amazon** **Web** **Services** |  |  |  |
    | us-gov-east-1 | US Gov East 1 (FedRAMP High Plus) | * [FedRAMP (High)](cert-fedramp.md) * [Department of War (DoW) Impact Level 4 (IL4)](cert-dodIL5.md) * [GovRAMP (High)](cert-stateramp.md) * [TX-RAMP (Level 2)](cert-txramp.md) * FIPS 140-2 * NIST 800-171 * [ITAR](cert-itar.md) | * DFARS 252.204-7012 * DFARS 252.239-7010 * DoJ [CJIS](cert-cjis.md) Security Policy * IRS Publication 1075 |
    | us-gov-west-1 | US Gov West 1 (FedRAMP High Plus) | * [FedRAMP High](cert-fedramp.md) * [DoW IL4](cert-dodIL5.md) * [GovRAMP (High)](cert-stateramp.md) * [TX-RAMP (Level 2)](cert-txramp.md) * FIPS 140-2 * NIST 800-171 * [ITAR](cert-itar.md) | * DFARS 252.204-7012 * DFARS 252.239-7010 * DoJ [CJIS](cert-cjis.md) Security Policy * IRS Publication 1075 |
    | us-gov-west-1 | US Gov West 1 (DoW) | * [DoW IL5](cert-dodIL5.md) * BCAP * FIPS 140-2 * NIST 800-171 * [ITAR](cert-itar.md) | * DFARS 252.204-7012 * DFARS 252.239-7010 * DoJ [CJIS](cert-cjis.md) Security Policy * IRS Publication 1075 |
    | **Microsoft Azure Government** |  |  |  |
    | usgovvirginia | US Gov Virginia (FedRAMP High Plus) | * [FedRAMP (Moderate and High)](cert-fedramp.md) * [GovRAMP (High)](cert-stateramp.md) — *Planned* * [TX-RAMP (Level 2)](cert-txramp.md) * [DoW IL4](cert-dodIL5.md) — *Planned* * FIPS 140-2 * NIST 800-171 * [ITAR](cert-itar.md) | * DFARS 252.204-7012 * DOJ [CJIS](cert-cjis.md) Security Policy * IRS Publication 1075 |
    | usgovvirginia | US Gov Virginia | * [GovRAMP (Moderate)](cert-stateramp.md) * [TX-RAMP (Level 2)](cert-txramp.md) * FIPS 140-2 * NIST 800-171 * [ITAR](cert-itar.md) | * DFARS 252.204-7012 * DOJ [CJIS](cert-cjis.md) Security Policy * IRS Publication 1075 |

Note that the government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions.
For more information, see [AWS GovCloud (US)](https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/govcloud-s3.html) and
[Azure Government](https://learn.microsoft.com/en-us/azure/azure-government/).

## Europe, Middle East, and Africa

These regions are supported for organizations that prefer or require their data to be stored in the European Union (EU), United
Kingdom (UK), Middle East, or Africa. Multiple regions are provided to allow your organization to meet its individual compliance and data
sovereignty requirements.

Snowflake supports the following European, Middle East, and African regions:

| Cloud Platform | Cloud Region ID [1] | Region Name | Additional Notes |
| --- | --- | --- | --- |
| **Amazon Web Services (AWS)** | | | |
|  | af-south-1 | Africa (Cape Town) |  |
| eu-central-1 | EU (Frankfurt) |  |
| eu-central-2 | EU (Zurich) |  |
| eu-north-1 | EU (Stockholm) |  |
| eu-west-1 | EU (Ireland) |  |
| eu-west-2 | Europe (London) |  |
| eu-west-3 | EU (Paris) |  |
| me-central-1 | Middle East (UAE) |  |
| **Google Cloud Platform (GCP)** | | | |
|  | europe-west2 | Europe West2 (London) |  |
| europe-west3 | Europe West3 (Frankfurt) |  |
| europe-west4 | Europe West4 (Netherlands) |  |
| me-central2 | Middle East Central2 (Dammam) |  |
| **Microsoft Azure** | | | |
|  | northeurope | North Europe (Ireland) |  |
| swedencentral | Sweden Central (Gävle) |  |
| switzerlandnorth | Switzerland North (Zurich) |  |
| westeurope | West Europe (Netherlands) |  |
| uaenorth | UAE North (Dubai) |  |
| uksouth | UK South (London) |  |

[1] See the map preceding this table for the location of each supported region, labeled by cloud region ID.

## Asia Pacific and China

These regions are supported for organizations that prefer or require their data to be stored in Japan, Korea, India, Southeast Asia,
Australia, and China. Multiple regions are provided to allow your organization to meet its individual compliance and data sovereignty
requirements.

Snowflake supports the following Asia Pacific and China regions:

| Cloud Platform | Cloud Region ID [1] | Region Name | Additional Notes |
| --- | --- | --- | --- |
| **Amazon Web Services (AWS)** | | | |
|  | ap-northeast-1 | Asia Pacific (Tokyo) | Completed assessment and supports compliance with the Japanese government’s Information System Security Management and Assessment Program (ISMAP) |
| ap-northeast-2 | Asia Pacific (Seoul) |  |
| ap-northeast-3 | Asia Pacific (Osaka) | Completed assessment and supports compliance with ISMAP |
| ap-south-1 | Asia Pacific (Mumbai) |  |
| ap-southeast-1 | Asia Pacific (Singapore) |  |
| ap-southeast-2 | Asia Pacific (Sydney) | Completed assessment and supports compliance with the Australian government’s Infosec Registered Assessors Program (IRAP) - Protected |
| ap-southeast-3 | Asia Pacific (Jakarta) |  |
| cn-northwest-1 | China (Ningxia) | The China region is separate from other Snowflake regions. It utilizes a separate domain name (`snowflakecomputing.cn`) and is wholly operated by Digital China Cloud Technology Limited (DCC), an authorized operating partner of Snowflake, Inc. Customers who wish to create and use Snowflake accounts in the China region must sign a separate agreement with DCC in accordance with all applicable rules and regulations.  Additionally, customers cannot use self-service to create their initial account in the China region. Instead, they must request the account through [DCC](mailto:snowflake.hosting%40dcclouds.com). Once the initial account is created within their org, they can create additional accounts in the org using all other supported methods.  Customers with existing Snowflake accounts are not able to access resources in the China region, and vice versa. Some features might not be available in the China region. |
| **Google Cloud Platform (GCP)** | | | |
|  | australia-southeast2 | Australia Southeast 2 (Melbourne) |  |
| **Microsoft Azure** | | | |
|  | australiaeast | Australia East (New South Wales) | Completed assessment and supports compliance with IRAP - Protected |
| centralindia | Central India (Pune) |  |
| japaneast | Japan East (Tokyo) | Completed assessment and supports compliance with ISMAP |
| koreacentral | Korea Central (Seoul) |  |
| southeastasia | Southeast Asia (Singapore) |  |

[1] See the map preceding this table for the location of each supported region, labeled by cloud region ID.

## Region time zones for support

Snowflake supports multiple [editions](intro-editions.md) with each edition offering different levels of service.

Effective May 1, 2020:

* All new accounts, regardless of Snowflake Edition, receive Premier support, which includes 24/7 coverage.
* Standard Edition accounts that were provisioned before this date will continue to receive Standard support until the accounts are
  transitioned to Premier support.

  Standard support hours are Monday - Friday, 6:00 AM - 6:00 PM, across all regions, but the time zones vary depending on the geographic
  location of the region:

  > North America:
  > :   Pacific Time (PST or PDT)
  >
  > Europe, Middle East, & Africa:
  > :   Central Europe Time (CET or CEST)
  >
  > Asia Pacific:
  > :   Australian Eastern Time (AEST or AEDT)

## Differences between regions

Snowflake features and services are identical across regions except for some newly-introduced features (based on cloud platform or region).
However, there are some differences in unit costs for credits and data storage between regions.

Another factor that impacts unit costs is whether your Snowflake account is *On Demand* or *Capacity*.

For more information about pricing as it pertains to a specific region and account type, see the
[pricing page](http://www.snowflake.com/pricing) (on the Snowflake website).

## View a list of regions available for an organization

An [organization administrator](organization-administrators.md) can view a list of regions available for an organization
through [Snowsight](ui-snowsight-gs.md) or using SQL:

> Snowsight:
> :   In the navigation menu, select Admin » Accounts, and then select + Account. Browse through Region.
>
> SQL:
> :   Execute the [SHOW REGIONS](../sql-reference/sql/show-regions.md) command.

## Considerations for choosing a region for your account

When you request a Snowflake account, either through self-service or a Snowflake representative, you can choose the region where the
account is located. For example, you can decide to locate an account in a particular region on a particular cloud platform to address
latency concerns and/or provide additional backup and disaster recovery beyond the standard recovery support provided by Snowflake.
Snowflake does not place any restrictions on the region where you choose to locate each account.

If latency is a concern, you should choose the available region with the closest geographic proximity to your end users; however, this might
have cost implications, due to pricing differences between the regions. For more details, see the
[pricing page](http://www.snowflake.com/pricing) (on the Snowflake website).

If you are a government agency or a commercial organization that must comply with specific privacy and security requirements of the US
government, you can choose between two dedicated government regions provided by Snowflake.

> **Important:**
>
> Regions do not limit user access to Snowflake; they only dictate the geographic location where data is stored and compute resources are
> provisioned.
>
> In addition, Snowflake does not move data between accounts, so any data in an account in a region remains in the region unless users
> explicitly choose to copy, move, or [replicate](account-replication-intro.md) the data.

## Specify region information in your account hostname

A hostname for a Snowflake account starts with an *account identifier* and ends with the Snowflake domain
(`snowflakecomputing.com`). Snowflake supports two formats to use as the
[account identifier](admin-account-identifier.md) in your hostname:

* Account name (preferred)
* Account locator

> **Important:**
>
> If you choose the account locator as your account identifier, you might need to include additional segments in the locator that
> specify the cloud region and [cloud platform](intro-cloud-platforms.md) where your account is hosted.
>
> For more details, see [Format 2: Account locator in a region](admin-account-identifier.md).

---
title: Supported dbt commands and flags
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-supported-commands.md
section: User Guide
---

# Supported dbt commands and flags

The following table shows the dbt commands that are supported in dbt Projects on Snowflake. Any [dbt command](https://docs.getdbt.com/reference/dbt-commands) that isn’t listed here isn’t supported.

> dbt Projects on Snowflake, supported dbt commands by execution method
>
>
>
>
>
>
> | dbt command | Workspaces | EXECUTE DBT PROJECT | `snow dbt execute` (CLI) |
> | --- | --- | --- | --- |
> | [build](https://docs.getdbt.com/reference/commands/build) | ✔ | ✔ | ✔ |
> | [compile](https://docs.getdbt.com/reference/commands/compile) | ✔ | ✔ | ✔ |
> | [deps](https://docs.getdbt.com/reference/commands/deps) [1] | ✔ | ✔ | ✔ |
> | [docs generate](https://docs.getdbt.com/reference/commands/cmd-docs#dbt-docs-generate) [2] | ✔ | ✔ | ❌ |
> | [list](https://docs.getdbt.com/reference/commands/list) | ✔ | ✔ | ✔ |
> | [parse](https://docs.getdbt.com/reference/commands/parse) | ❌ | ✔ | ✔ |
> | [run](https://docs.getdbt.com/reference/commands/run) | ✔ | ✔ | ✔ |
> | [retry](https://docs.getdbt.com/reference/commands/retry) | ✔ | ❌ | ❌ |
> | [run-operation](https://docs.getdbt.com/reference/commands/run-operation) | ✔ | ✔ | ✔ |
> | [seed](https://docs.getdbt.com/reference/commands/seed) | ✔ | ✔ | ✔ |
> | [show](https://docs.getdbt.com/reference/commands/show) | ✔ | ✔ | ✔ |
> | [snapshot](https://docs.getdbt.com/reference/commands/snapshot) | ✔ | ✔ | ✔ |
> | [test](https://docs.getdbt.com/reference/commands/test) | ✔ | ✔ | ✔ |

[1] A dbt project object is a versioned snapshot of your project. Running the deps command on it doesn’t modify any files; it’s primarily
used to verify that your external access configuration is correct. When a dbt project object is created with an external access integration, dbt deps is run before dbt compile to package all dependencies and project files.

[2] dbt Projects on Snowflake don’t support dbt docs serve.

## About flags

In dbt Core, you run commands (for example, `dbt build`) and modify their behavior with flags. Flags are configuration options that modify how a command behaves; some are command-specific, others are global. For more information, see [flags](https://docs.getdbt.com/reference/global-configs/about-global-configs).

You always run a command, and you attach flags to scope or alter it. For example, to run only incremental models and rebuild them, you would run the following command and flags:

```sqlexample
dbt run --select config.materialized:incremental --full-refresh;
```

The following flags aren’t supported in dbt Projects on Snowflake:

* `--state`
* `--target-path`
* `--log-path`
* `--profiles-dir`
* `--project-dir`
* `--log-format`
* `--log-format-file`

---
title: Supported dbt Core versions for dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-dbt-core-versions.md
section: User Guide
---

# Supported dbt Core versions for dbt Projects on Snowflake

Snowflake provides managed runtimes for dbt Projects to ensure a secure and predictable execution environment. Because dbt Core releases can
introduce breaking changes or security vulnerabilities, Snowflake follows a structured lifecycle for each version. This policy allows users
to pin specific versions for governance and reproducibility while providing a clear timeline for required migrations.

Supported versions for dbt Projects

| dbt Core Version Supported | Snowflake Support Level | dbt Labs Support |
| --- | --- | --- |
| 1.10.15 | Active support | Critical support until Jun 15, 2026 |
| 1.9.4 | Active support | Deprecated |

The DBT_VERSION parameter implicitly defines the execution engine based on the version, as shown in the table below.

Version based engine mapping

| User Input (DBT_VERSION) | Condition | Resulting Engine |
| --- | --- | --- |
| ‘1.x’ (for example, `1.9.4`) | Version `< 2.0` | dbt Core (Python-based) |

## View supported dbt Core versions

To view supported dbt Core versions, run the [SYSTEM$SUPPORTED_DBT_VERSIONS](../../sql-reference/functions/system_supported_dbt_versions.md) system function, as shown
in the following example:

```sqlexample
SELECT SYSTEM$SUPPORTED_DBT_VERSIONS();
```

```output
[{"dbt_version":"1.9.4","type":"dbt Core"},{"dbt_version":"1.10.15","type":"dbt Core"}]
```

## Alter dbt Core execution version

To alter the dbt Core version that the dbt project object will execute, run the [ALTER DBT PROJECT](../../sql-reference/sql/alter-dbt-project.md) command as shown
in the following example:

```sqlexample
ALTER DBT PROJECT my_dbt_project SET DBT_VERSION = '1.10.15';
```

## Create a dbt project pinned to a version

The following example creates a dbt project pinned to the 1.10.15 dbt version:

```sqlexample
CREATE OR REPLACE DBT PROJECT my_dbt_project
  FROM '@my_stage/dbt_files'
  DBT_VERSION = '1.10.15';
```

For more information and examples, see [CREATE DBT PROJECT](../../sql-reference/sql/create-dbt-project.md) and [ALTER DBT PROJECT](../../sql-reference/sql/alter-dbt-project.md).

## How deprecation and decommissioning work

* Snowflake supported versions: These versions are available for all new and existing projects. Snowflake provides full technical support,
  including security patches.
* Snowflake deprecated versions: These versions have reached the end of their active development cycle. While they remain fully functional for
  existing projects, users are discouraged from starting new projects on a deprecated version.
* Snowflake decommissioned versions: These versions are officially removed from the Snowflake environment. At this stage, any project pinned to
  a decommissioned version will fail to execute until it’s updated to a currently supported version.
* dbt Core Support Levels: Even if a version reaches *Critical Support*, *Deprecated*, or *End of Life* status according to
  [dbt Labs](https://docs.getdbt.com/docs/dbt-versions/core#latest-releases), it remains supported on Snowflake. This means that you aren’t
  forced into immediate upgrades and can maintain your existing environment for as long as you choose.

---
title: Supported dbt project source file locations
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-sources.md
section: User Guide
---

# Supported dbt project source file locations

dbt project source files can be in any one of the following locations:

> * **A Git repository stage**, for example:
>
>   `'@my_db.my_schema.my_git_repository_stage/branches/my_branch/path/to/dbt_project_or_projects_parent'`
>
>   For more information about creating a Git repository object in Snowflake that connects a Git repository to a workspace for dbt Projects on Snowflake, see [Create a workspace connected to your Git repository](../tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md). For more information about creating and managing a Git repository object and stage without using a workspace, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md) and [CREATE GIT REPOSITORY](../../sql-reference/sql/create-git-repository.md).
> * **An existing dbt project stage**, for example:
>
>   `'snow://dbt/my_db.my_schema.my_existing_dbt_project_object/versions/last'`
>
>   The version specifier is required and can be `last` (as shown in the previous example), `first`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).
> * **An internal named stage**, for example:
>
>   `'@my_db.my_schema.my_internal_named_stage/path/to/dbt_projects_or_projects_parent'`
>
>   Internal user stages and table stages aren’t supported.
> * **A workspace for dbt on Snowflake**, for example:
>
>   `'snow://workspace/user$.public."my_workspace_name"/versions/live/path/to/dbt_projects_or_projects_parent'`
>
>   We recommend enclosing the workspace name in double quotes because workspace names are case-sensitive and can contain special characters.
>
>   The version specifier is required and can be `last`, `first`, `live`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).

---
title: Supported formats for semi-structured data
source: https://docs.snowflake.com/en/user-guide/semistructured-data-formats.md
section: User Guide
---

# Supported formats for semi-structured data

This topic describes the supported formats for semi-structured data.

Snowflake natively supports the semi-structured data formats below. Specifically, Snowflake provides options in COPY commands to
load and unload data files in these formats.

## JSON

### About JSON

JSON (JavaScript Object Notation) is a lightweight, plain-text, data-interchange format based on a subset of the JavaScript Programming Language.

JSON data can be produced by any application. Some common examples include:

* JavaScript applications using native methods to generate JSON.
* Non-JavaScript applications using libraries (usually with extensions) to generate JSON data.
* Ad hoc JavaScript generators.
* Concatenation of JSON documents (which may or may not be line-separated).

Because there is no formal specification, there are significant differences between various implementations. These differences makes import of JSON-like data sets impossible if the JSON parser is strict in
its language definition. To make import of JSON data sets as problem-free as possible, Snowflake follows the rule “be liberal in what you accept”. The intent is to accept the widest possible range of JSON
and JSON-like inputs that permit unambiguous interpretation.

This topic describes the syntax for JSON documents accepted by Snowflake.

For more information about JSON, see [json.org](http://www.json.org).

### Basic JSON syntax

JSON data is a hierarchical collection of name/value pairs grouped into objects and arrays:

* Colons `:` separate names and values in name/value pairs.
* Curly braces `{}` denote objects.
* Square brackets `[]` denote arrays.
* Commas `,` separate entities in objects and arrays.

### Name/value pairs

JSON name/value pairs consist of a field name (in double quotes), followed by a colon, then a value.

For example:

```sqljson
{"firstName":"John", "empid":45611}
```

### Supported data types

A value in a name/value pair can be:

* A number (integer or floating point)
* A string (in double quotes)
* A Boolean (true or false)
* An array (in square brackets)
* An object (in curly braces)
* Null

### Objects

JSON objects are written inside curly braces. An object can contain multiple name/values pairs, separated by commas. For example:

```sqljson
{"firstName":"John", "lastName":"Doe"}
```

### Arrays

JSON arrays are written inside square brackets. An array can contain multiple objects, separated by commas. For example:

```sqljson
{"employees":[
    {"firstName":"John", "lastName":"Doe"},
    {"firstName":"Anna", "lastName":"Smith"},
    {"firstName":"Peter", "lastName":"Jones"}
  ]
}
```

### Examples of JSON documents

**FILE NAME:** `json_sample_data1`

Contains an array with 3 simple employee records (objects):

> ```sqljson
> {"root":[{"employees":[
>     {"firstName":"John", "lastName":"Doe"},
>     {"firstName":"Anna", "lastName":"Smith"},
>     {"firstName":"Peter", "lastName":"Jones"}
> ]}]}
> ```

**FILE NAME:** `json_sample_data2`

Contains an array with 3 employee records (objects) and their associated dependent data (children, the children’s names and ages, cities where the employee has lived, and the years that the employee has
lived in those cities):

> ```sqljson
> {"root":
>    [
>     { "kind": "person",
>       "fullName": "John Doe",
>       "age": 22,
>       "gender": "Male",
>       "phoneNumber":
>         {"areaCode": "206",
>          "number": "1234567"},
>       "children":
>          [
>            {
>              "name": "Jane",
>              "gender": "Female",
>              "age": "6"
>            },
>            {
>               "name": "John",
>               "gender": "Male",
>               "age": "15"
>            }
>          ],
>       "citiesLived":
>          [
>             {
>                "place": "Seattle",
>                "yearsLived": ["1995"]
>             },
>             {
>                "place": "Stockholm",
>                "yearsLived": ["2005"]
>             }
>          ]
>       },
>       {"kind": "person", "fullName": "Mike Jones", "age": 35, "gender": "Male", "phoneNumber": { "areaCode": "622", "number": "1567845"}, "children": [{ "name": "Earl", "gender": "Male", "age": "10"}, {"name": "Sam", "gender": "Male", "age": "6"}, { "name": "Kit", "gender": "Male", "age": "8"}], "citiesLived": [{"place": "Los Angeles", "yearsLived": ["1989", "1993", "1998", "2002"]}, {"place": "Washington DC", "yearsLived": ["1990", "1993", "1998", "2008"]}, {"place": "Portland", "yearsLived": ["1993", "1998", "2003", "2005"]}, {"place": "Austin", "yearsLived": ["1973", "1998", "2001", "2005"]}]},
>       {"kind": "person", "fullName": "Anna Karenina", "age": 45, "gender": "Female", "phoneNumber": { "areaCode": "425", "number": "1984783"}, "citiesLived": [{"place": "Stockholm", "yearsLived": ["1992", "1998", "2000", "2010"]}, {"place": "Russia", "yearsLived": ["1998", "2001", "2005"]}, {"place": "Austin", "yearsLived": ["1995", "1999"]}]}
>     ]
> }
> ```

## Avro

### About Avro

Avro is an open-source data serialization and RPC framework originally developed for use with Apache Hadoop. It utilizes schemas defined in JSON to produce serialized data in a compact binary format. The
serialized data can be sent to any destination (i.e. application or program) and can be easily deserialized at the destination because the schema is included in the data.

An Avro schema consists of a JSON string, object, or array that defines the type of schema and the data attributes (field names, data types, etc.) for the schema type. The attributes differ depending on
the schema type. Complex data types such as arrays and maps are supported.

Snowflake reads Avro data into a single VARIANT column. You can query the data in a VARIANT column just as you would JSON data, using similar commands and functions.

For more information, see [avro.apache.org](http://avro.apache.org).

### Example of an Avro schema

```sqljson
{
 "type": "record",
 "name": "person",
 "namespace": "example.avro",
 "fields": [
     {"name": "fullName", "type": "string"},
     {"name": "age",  "type": ["int", "null"]},
     {"name": "gender", "type": ["string", "null"]}
     ]
}
```

## ORC

### About ORC

ORC (Optimized Row Columnar) is a binary format used to store Hive data. ORC was designed for efficient compression and improved
performance for reading, writing, and processing data over earlier Hive file formats. For more information about ORC, see [https://orc.apache.org/](https://orc.apache.org//).

Snowflake reads ORC data into a single VARIANT column. You can query the data in a VARIANT column just as you would JSON data, using similar commands and functions.

Alternatively, you can extract columns from a staged ORC file into separate table columns using a CREATE TABLE AS SELECT statement.

> **Note:**
>
> * Map data is deserialized into an array of objects, e.g.:
>
>   ```sqljson
>   "map": [{"key": "chani", "value": {"int1": 5, "string1": "chani"}}, {"key": "mauddib", "value": {"int1": 1, "string1": "mauddib"}}]
>   ```
> * Union data is deserialized into a single object, e.g.:
>
>   ```sqljson
>   {"time": "1970-05-05 12:34:56.197", "union": {"tag": 0, "value": 3880900}, "decimal": 3863316326626557453.000000000000000000}
>   ```

### Example of ORC data loaded into a VARIANT column

```output
+--------------------------------------+
| SRC                                  |
|--------------------------------------|
| {                                    |
|   "boolean1": false,                 |
|   "byte1": 1,                        |
|   "bytes1": "0001020304",            |
|   "decimal1": 12345678.654745,       |
|   "double1": -1.500000000000000e+01, |
|   "float1": 1.000000000000000e+00,   |
|   "int1": 65536,                     |
|   "list": [                          |
|     {                                |
|       "int1": 3,                     |
|       "string1": "good"              |
|     },                               |
|     {                                |
|       "int1": 4,                     |
|       "string1": "bad"               |
|     }                                |
|   ]                                  |
| }                                    |
+--------------------------------------+
```

## Parquet

### About Parquet

Parquet is a compressed, efficient columnar data representation designed for projects in the Hadoop ecosystem. The file format supports complex nested data structures and uses Dremel record shredding and assembly algorithms. Parquet files can’t be opened in a text editor.
For more information, see [parquet.apache.org/docs/](https://parquet.apache.org/docs/).

> **Note:**
>
> Snowflake supports Parquet files produced using the Parquet writer V2 for Apache Iceberg™ tables or when you use
> a [vectorized scanner](../sql-reference/sql/copy-into-table.md).

Depending on your loading use case, Snowflake either reads Parquet data into a single VARIANT column or directly into table columns
(such as when you [load data from Iceberg-compatible Parquet files](tables-iceberg-load.md)).

You can query the data in a VARIANT column just as you would JSON data, using similar commands and functions.
Alternatively, you can extract select columns from a staged Parquet file into separate table columns using a CREATE TABLE AS SELECT statement.

### Example of Parquet data loaded into a VARIANT column

```output
+------------------------------------------+
| SRC                                      |
|------------------------------------------|
| {                                        |
|   "continent": "Europe",                 |
|   "country": {                           |
|     "city": {                            |
|       "bag": [                           |
|         {                                |
|           "array_element": "Paris"       |
|         },                               |
|         {                                |
|           "array_element": "Nice"        |
|         },                               |
|         {                                |
|           "array_element": "Marseilles"  |
|         },                               |
|         {                                |
|           "array_element": "Cannes"      |
|         }                                |
|       ]                                  |
|     },                                   |
|     "name": "France"                     |
|   }                                      |
| }                                        |
+------------------------------------------+
```

## XML

### About XML

XML (eXtensible Markup Language) is a markup language that defines a set of rules for encoding documents. It was
originally based on SGML, another markup language developed for standardizing the structure and elements that comprise
a document.

Since its introduction, XML has grown beyond an initial focus on documents to encompass a wide range of uses, including
representation of arbitrary data structures and serving as the base language for communication protocols. Because of its
extensibility, versatility, and usability, it has become one of the most commonly-used standards for data interchange
on the Web.

An XML document consists primarily of the following constructs:

* Tags (identified by angle brackets, `<` and `>`)
* Elements

Elements typically consist of a “start” tag and matching “end” tag, with the text between the tags constituting the content
of the element. An element can also consist of an “empty-element” tag with no “end” tag. “start” and “empty-element” tags might
contain attributes, which help define the characteristics or metadata for the element.

When you query XML data, the dollar sign operator (`$`) returns the contents, as a VARIANT value, of the value it operates on.
For an element, the contents of that element are returned:

* If the element contains text, text is returned as a VARIANT value.
* If the element contains another element, the element is returned as a VARIANT value in XML format.
* If the element contains a series of elements, an array of the elements is returned as a VARIANT value in JSON format.

Use the following operators to access the VARIANT value in a query:

* `$` for the contents of the value.
* `@` for the name of the value. This operator is useful when you are iterating through elements with different names.

  Use `@attribute_name` for the contents of a named attribute. For example, for `@attr`, the attribute name is `attr`.
  The query returns the contents of the attribute with the name that directly follows the ampersand. If no attribute is found,
  NULL is returned.

For examples that query XML data, see Examples of querying XML data.

You can use the following functions to work with XML data:

* [CHECK_XML](../sql-reference/functions/check_xml.md)
* [PARSE_XML](../sql-reference/functions/parse_xml.md)
* [TO_XML](../sql-reference/functions/to_xml.md)
* [XMLGET](../sql-reference/functions/xmlget.md)

### Examples of working with XML

The following examples show you how to load and query XML data.

#### Example of loading an XML document

This example shows you how to load the following XML document:

```xml
<?xml version="1.0"?>
<!DOCTYPE parts system "parts.dtd">
<?xml-stylesheet type="text/css" href="xmlpartsstyle.css"?>
<parts>
   <part count="4">
      <item>Spark Plugs</item>
      <partnum>A3-400</partnum>
      <manufacturer>ABC company</manufacturer>
      <price units="dollar"> 27.00</price>
   </part>
   <part count="1">
      <item>Motor Oil</item>
      <partnum>B5-200</partnum>
      <source>XYZ company</source>
      <price units="dollar"> 14.00</price>
   </part>
   <part count="1">
      <item>Motor Oil</item>
      <partnum>B5-300</partnum>
      <source>XYZ company</source>
      <price units="dollar"> 16.75</price>
   </part>
   <part count="1">
      <item>Engine Coolant</item>
      <partnum>B6-120</partnum>
       <source>XYZ company</source>
      <price units="dollar"> 19.00</price>
   </part>
   <part count="1">
      <item>Engine Coolant</item>
      <partnum>B6-220</partnum>
      <source>XYZ company</source>
      <price units="dollar"> 18.25</price>
   </part>
</parts>
```

Complete the following steps to load the XML document:

1. Copy the content of the XML document into a file on your file system.

   This example assumes that the file is named `auto-parts.xml` in the `/examples/xml/` directory.
2. Stage the file in the internal staging location:

   ```sqlexample
   PUT FILE:///examples/xml/auto-parts.xml @~/xml_stage;
   ```
3. Create a table for the XML document:

   ```sqlexample
   CREATE OR REPLACE TABLE sample_xml_parts(src VARIANT);
   ```
4. Load the staged XML file into the table:

   ```sqlexample
   COPY INTO sample_xml_parts
     FROM @~/xml_stage
     FILE_FORMAT=(TYPE=XML) ON_ERROR='CONTINUE';
   ```

#### Examples of querying XML data

These examples query XML data.

##### Query XML data directly

Query the column that contains the XML data to return the XML document.

This example queries the XML data loaded in Example of loading an XML document directly:

```sqlexample
SELECT src FROM sample_xml_parts;
```

```output
+----------------------------------------------+
| SRC                                          |
|----------------------------------------------|
| <parts>                                      |
|   <part count="4">                           |
|     <item>Spark Plugs</item>                 |
|     <partnum>A3-400</partnum>                |
|     <manufacturer>ABC company</manufacturer> |
|     <price units="dollar">27.00</price>      |
|   </part>                                    |
|   <part count="1">                           |
|     <item>Motor Oil</item>                   |
|     <partnum>B5-200</partnum>                |
|     <source>XYZ company</source>             |
|     <price units="dollar">14.00</price>      |
|   </part>                                    |
|   <part count="1">                           |
|     <item>Motor Oil</item>                   |
|     <partnum>B5-300</partnum>                |
|     <source>XYZ company</source>             |
|     <price units="dollar">16.75</price>      |
|   </part>                                    |
|   <part count="1">                           |
|     <item>Engine Coolant</item>              |
|     <partnum>B6-120</partnum>                |
|     <source>XYZ company</source>             |
|     <price units="dollar">19.00</price>      |
|   </part>                                    |
|   <part count="1">                           |
|     <item>Engine Coolant</item>              |
|     <partnum>B6-220</partnum>                |
|     <source>XYZ company</source>             |
|     <price units="dollar">18.25</price>      |
|   </part>                                    |
| </parts>                                     |
+----------------------------------------------+
```

##### Query XML data using operators

Query the column that contains the XML data using the `$` and `@` operators.

This example queries the XML data loaded in Example of loading an XML document using the `$`
operator. The query shows metadata about the values (`$`) and names (`@`) of the elements.

```sqlexample
SELECT src:"$" FROM sample_xml_parts;
```

```output
+--------------------------------+
| SRC:"$"                        |
|--------------------------------|
| [                              |
|   {                            |
|     "$": [                     |
|       {                        |
|         "$": "Spark Plugs",    |
|         "@": "item"            |
|       },                       |
|       {                        |
|         "$": "A3-400",         |
|         "@": "partnum"         |
|       },                       |
|       {                        |
|         "$": "ABC company",    |
|         "@": "manufacturer"    |
|       },                       |
|       {                        |
|         "$": 27,               |
|         "@": "price",          |
|         "@units": "dollar"     |
|       }                        |
|     ],                         |
|     "@": "part",               |
|     "@count": 4,               |
|     "item": 0,                 |
|     "manufacturer": 2,         |
|     "partnum": 1,              |
|     "price": 3                 |
|   },                           |
|   {                            |
|     "$": [                     |
|       {                        |
|         "$": "Motor Oil",      |
|         "@": "item"            |
|       },                       |
|       {                        |
|         "$": "B5-200",         |
|         "@": "partnum"         |
|       },                       |
|       {                        |
|         "$": "XYZ company",    |
|         "@": "source"          |
|       },                       |
|       {                        |
|         "$": 14,               |
|         "@": "price",          |
|         "@units": "dollar"     |
|       }                        |
|     ],                         |
|     "@": "part",               |
|     "@count": 1,               |
|     "item": 0,                 |
|     "partnum": 1,              |
|     "price": 3,                |
|     "source": 2                |
|   },                           |
|                                |
|              ...               |
|                                |
+--------------------------------+
```

This example queries the same XML data using the `@` operator. The query shows the name of the root element.

```sqlexample
SELECT src:"@" FROM sample_xml_parts;
```

```output
+---------+
| SRC:"@" |
|---------|
| "parts" |
+---------+
```

This example queries the same XML data using `$` operator and the `@` operator. In the array of child
elements in the root element, the query shows the value of the `count` attribute for the element at the first (0)
and second (1) index.

```sqlexample
SELECT src:"$"[0]."@count", src:"$"[1]."@count" FROM sample_xml_parts;
```

```output
+---------------------+---------------------+
| SRC:"$"[0]."@COUNT" | SRC:"$"[1]."@COUNT" |
|---------------------+---------------------|
| 4                   | 1                   |
+---------------------+---------------------+
```

##### Query XML data using the XMLGET function

Query the column that contains the XML data using the [XMLGET](../sql-reference/functions/xmlget.md) function.

This example queries the XML data loaded in Example of loading an XML document and returns the first
instance of an element in the root element of the XML data. The instance number is 0-based, not 1-based.
So, the following queries are equivalent:

```sqlexample
SELECT XMLGET(src, 'part') FROM sample_xml_parts;

SELECT XMLGET(src, 'part', 0) FROM sample_xml_parts;
```

```output
+--------------------------------------------+
| XMLGET(SRC, 'PART')                        |
|--------------------------------------------|
| <part count="4">                           |
|   <item>Spark Plugs</item>                 |
|   <partnum>A3-400</partnum>                |
|   <manufacturer>ABC company</manufacturer> |
|   <price units="dollar">27.00</price>      |
| </part>                                    |
+--------------------------------------------+
```

This query returns the third element (0-based) in the root element of the XML data.

```sqlexample
SELECT XMLGET(src, 'part', 3) FROM sample_xml_parts;
```

```output
+---------------------------------------+
| XMLGET(SRC, 'PART', 3)                |
|---------------------------------------|
| <part count="1">                      |
|   <item>Engine Coolant</item>         |
|   <partnum>B6-120</partnum>           |
|   <source>XYZ company</source>        |
|   <price units="dollar">19.00</price> |
| </part>                               |
+---------------------------------------+
```

##### Query XML data to extract element contents using multiple functions

This example uses the [FLATTEN](../sql-reference/functions/flatten.md) function with the [XMLGET](../sql-reference/functions/xmlget.md)
function to extract the contents of the elements in the XML data loaded in Example of loading an XML document.

The example uses the [COALESCE](../sql-reference/functions/coalesce.md) function to return either the child element `manufacturer`
or `source` if it exists, cast to a VARCHAR value. The `SRC:"$"` passed to FLATTEN specifies the value in the root
element `parts`. The LATERAL FLATTEN iterates through all of the repeating elements that are passed in.

```sqlexample
SELECT XMLGET(VALUE, 'item'):"$"::VARCHAR AS item,
       XMLGET(VALUE, 'partnum'):"$"::VARCHAR AS partnum,
       COALESCE(XMLGET(VALUE, 'manufacturer'):"$"::VARCHAR,
                XMLGET(VALUE, 'source'):"$"::VARCHAR) AS manufacturer_or_source,
       XMLGET(VALUE, 'price'):"$"::VARCHAR AS price,
  FROM sample_xml_parts,
    LATERAL FLATTEN(INPUT => SRC:"$");
```

```output
+----------------+---------+------------------------+-------+
| ITEM           | PARTNUM | MANUFACTURER_OR_SOURCE | PRICE |
|----------------+---------+------------------------+-------|
| Spark Plugs    | A3-400  | ABC company            | 27    |
| Motor Oil      | B5-200  | XYZ company            | 14    |
| Motor Oil      | B5-300  | XYZ company            | 16.75 |
| Engine Coolant | B6-120  | XYZ company            | 19    |
| Engine Coolant | B6-220  | XYZ company            | 18.25 |
+----------------+---------+------------------------+-------+
```

---
title: Supported object types in DCM Projects
source: https://docs.snowflake.com/en/user-guide/dcm-projects/dcm-projects-supported-entities.md
section: User Guide
---

# Supported object types in DCM Projects

The DEFINE statement is a special command used exclusively in DCM project definition files. Its syntax is similar to the
[CREATE OR ALTER](../../sql-reference/sql/create-or-alter.md) command, but with the following key
differences:

* The order and location of DEFINE statements don’t matter. Snowflake collects and sorts all statements from all definition
  files during project execution.
* If you remove a DEFINE statement that was previously deployed, Snowflake drops the corresponding object the next time you deploy the
  project. The same applies to GRANT and ATTACH statements that are removed after being previously deployed.
* Only a subset of Snowflake object types are supported.
* All objects must be defined with a fully qualified name (`database.schema.object_name`).
* References to other objects must use fully qualified names.

The following object types are natively supported in DCM Projects definition files with the DEFINE, GRANT, or ATTACH statements.

* Database
* Schema
* Table
* (Secure) View
* Dynamic table
* Task
* File format
* Internal stage
* SQL function
* Data metric function
* Warehouse
* Role, Database Role
* Grant
* Tag
* Authentication policy

## Database

**Limitations:**

All [CREATE OR ALTER DATABASE limitations](https://docs.snowflake.com/sql-reference/sql/create-database#create-or-alter-database-usage-notes)
apply, including:

* Renaming the database

## Schema

**Limitations:**

All [CREATE OR ALTER SCHEMA limitations](https://docs.snowflake.com/sql-reference/sql/create-schema#create-or-alter-schema-usage-notes)
apply, including:

* Renaming the schema

## Table

**Limitations:**

All [CREATE OR ALTER TABLE limitations](https://docs.snowflake.com/sql-reference/sql/create-table#create-or-alter-table-usage-notes)
apply, including:

* Renaming tables
* Renaming columns
* Reordering columns
* Changing column types to incompatible types
* Adding search optimization to a table or columns
* Adding tags and policies to a table or columns

## View

**Limitations:**

All [CREATE OR ALTER VIEW limitations](https://docs.snowflake.com/sql-reference/sql/create-view#create-or-alter-view-usage-notes)
apply, including:

* Renaming views
* Reordering columns

## Dynamic table

**Supported changes:**

Without a full refresh:

* Warehouse
* Target-lag

With re-initialization or a full refresh:

* Refresh mode
* Any changes of the body including:

  + Dropping columns
  + Adding columns at the end

**Immutable arguments:**

* INITIALIZE

**Limitations:**

All [CREATE OR ALTER DYNAMIC TABLE limitations](https://docs.snowflake.com/sql-reference/sql/create-dynamic-table#create-or-alter-dynamic-table-usage-notes)
apply, including:

* Reordering columns
* Renaming dynamic tables

## Task

When definition changes are deployed for a task that is already started, Snowflake automatically suspends that task (or its root task)
temporarily, applies the change, and then resumes it again.

Newly deployed tasks are suspended by default.

**Limitations:**

* All [CREATE OR ALTER TASK limitations](https://docs.snowflake.com/sql-reference/sql/create-task#create-or-alter-task-usage-notes)
  apply.

## File format

**Limitations:**

* All [CREATE OR ALTER FILE FORMAT limitations](https://docs.snowflake.com/sql-reference/sql/create-file-format#create-or-alter-file-format-usage-notes)
  apply.

## Internal stage

**Supported changes:**

* Directory table
* Comment

**Immutable attributes:**

* Encryption type

**Limitations:**

* All [CREATE OR ALTER STAGE limitations](https://docs.snowflake.com/sql-reference/sql/create-stage#create-or-alter-stage-usage-notes)
  apply.

## SQL function

**Limitations:**

* All [CREATE OR ALTER FUNCTION limitations](https://docs.snowflake.com/sql-reference/sql/create-function#create-or-alter-function-usage-notes)
  apply.

## Data metric function

Data metric functions (DMFs) let you define data quality expectations and attach those expectations to tables. You can select from existing
system DMFs or write your own user-defined data metric functions (UDMFs). You can then attach them to tables, views, and dynamic tables with
a many-to-many relationship. For more information, see [Use SQL to set up data metric functions](../data-quality-working.md).

To attach data metric functions, you first need to add a `DATA_METRIC_SCHEDULE` to each table, dynamic table, or view definition. For
example: `DATA_METRIC_SCHEDULE = TRIGGER_ON_CHANGES`. The `TRIGGER_ON_CHANGES` schedule is not available for views.

The user-defined names of expectations must be unique per project and attachment.

Defining expectations is optional, but recommended, when attaching DMFs to table columns.
Attached DMFs without set expectations aren’t considered when running `EXECUTE DCM PROJECT <my_project> TEST ALL`.

**Supported changes:**

* Defining UDMFs (user-defined data metric functions)
* Attaching system DMFs and UDMFs to tables, views, or dynamic tables inside and outside a DCM project
* Defining data expectations for table columns

**Example:**

An example of defining a UDMF:

```sqlexample
DEFINE DATA METRIC FUNCTION DCM_DEMO.TESTS.INVENTORY_SPREAD(
  TABLE_NAME TABLE(
    COLUMN_VALUE number
  )
)
  RETURNS number
AS
$$
  SELECT
    MAX(COLUMN_VALUE) - MIN(COLUMN_VALUE)
  FROM
    TABLE_NAME
  WHERE
    COLUMN_VALUE IS NOT NULL
$$;
```

An example of attaching a system DMF with an expectation:

```sqlexample
ATTACH DATA METRIC FUNCTION SNOWFLAKE.CORE.MIN
  TO TABLE DCM_PROJECT_{{db}}.RAW.INVENTORY
  ON (IN_STOCK)
  EXPECTATION MIN_10_ITEMS_INVENTORY (value > 10);
```

An example of attaching a UDMF with an expectation:

```sqlexample
ATTACH DATA METRIC FUNCTION DCM_DEMO.TESTS.INVENTORY_SPREAD
  TO TABLE DCM_PROJECT_{{db}}.RAW.INVENTORY
  ON (IN_STOCK)
  EXPECTATION EVEN_ITEM_INVENTORY (VALUE < 50);
```

To see all available system DMFs, query `SHOW DATA METRIC FUNCTIONS IN DATABASE SNOWFLAKE`.

## Warehouse

**Immutable attributes:**

* INITIALLY_SUSPENDED

**Limitations:**

* All [CREATE OR ALTER WAREHOUSE limitations](https://docs.snowflake.com/sql-reference/sql/create-warehouse#create-or-alter-warehouse-usage-notes)
  apply.

## Role and Database Role

**Unsupported types:**

* Application Role

## Grant

Just like each object can be defined only once in DCM Projects, each privilege-grantee relationship can only be defined once across all DCM Projects.

When removing a GRANT OWNERSHIP statement that was previously deployed, DCM Projects attempts to use the current owner role to grant
ownership back to the DCM project owner. If the project owner role doesn’t hold the object’s owner role, ownership needs to be
transferred back manually outside of DCM Projects.

DCM Projects is only aware of grants that were defined and deployed through DCM Projects. Any grants that were added outside of DCM Projects coexist,
and DCM Projects doesn’t remove them.

**Unsupported GRANT types:**

* APPLICATION ROLE grants
* CALLER grants

## Tag

**Unsupported attributes:**

* Propagate

**Limitations:**

* All [CREATE OR ALTER TAG limitations](https://docs.snowflake.com/sql-reference/sql/create-tag#create-or-alter-tag-usage-notes)
  apply.

## Authentication policy

**Limitations:**

* All [CREATE OR ALTER AUTHENTICATION POLICY limitations](https://docs.snowflake.com/sql-reference/sql/create-authentication-policy#usage-notes)
  apply.

## Attaching tags, masking policies, and row access policies (unsupported)

Tags, masking policies, and row access policies can’t be added to DCM Projects table column definitions.

You can attach masking and row access policies manually outside of DCM Projects.
DCM Projects definitions for table objects ignore any attached masking or row access policies.
They are not revoked by redeploying table definitions, even when those definitions do not contain the policies.

---
title: Supported queries for dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-supported-queries.md
section: User Guide
---

# Supported queries for dynamic tables

Dynamic tables support standard SQL expressions and Snowflake-supported functions, including mathematical operations, string functions, date
functions, etc. This topic describes the expressions, constructs, functions, operators, and clauses that dynamic tables support in
incremental and full refresh modes.

If a query uses expressions, keywords, operators, or clauses that are not supported for incremental refresh, the automated refresh process
uses a full refresh instead, [which might incur an additional cost](dynamic-tables-cost.md).

For guidance on how different operators affect incremental refresh *performance*, see
[Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).

## Supported data types

Dynamic tables support all [Snowflake SQL data types](../sql-reference/intro-summary-data-types.md)
for both incremental and full refresh, except:

* Structured data types.
* Geospatial data types (full refresh only).

## Supported queries in incremental and full refresh modes

| Keyword | Incremental Refresh Mode | Full Refresh Mode |
| --- | --- | --- |
| [DISTINCT](../sql-reference/sql/select.md) | Supported | Supported |
| [External functions](../sql-reference/external-functions-introduction.md) | Not supported | Not supported |
| [FROM](../sql-reference/constructs/from.md) | Source tables, views, Snowflake-managed Apache Iceberg™ tables, and other dynamic tables.  Subqueries outside of FROM clauses (for example, WHERE EXISTS) are not supported. | Supported |
| [GROUP BY](../sql-reference/constructs/group-by.md) | Supported | Supported |
| [CROSS JOIN](../sql-reference/constructs/join.md) | Supported. You can specify any number of tables in the join, and updates to all tables in the join are reflected in the results of the query. | Supported |
| [INNER JOIN](../sql-reference/constructs/join.md) | Supported. You can specify any number of tables in the join, and updates to all tables in the join are reflected in the results of the query. | Supported |
| [LATERAL](../sql-reference/constructs/join-lateral.md) JOIN | Not supported. However, you can use [LATERAL with FLATTEN()](lateral-join-using.md). For example:  ```sqlexample CREATE TABLE persons  AS   SELECT column1 AS id, parse_json(column2) AS entity   FROM values    (12712555,    '{ name:  { first: "John", last: "Smith"},      contact: [      { business:[        { type: "phone", content:"555-1234" },        { type: "email", content:"j.smith@example.com" } ] } ] }'),    (98127771,     '{ name:  { first: "Jane", last: "Doe"},      contact: [      { business:[        { type: "phone", content:"555-1236" },        { type: "email", content:"j.doe@example.com" } ] } ] }'); ```  ```sqlexample CREATE DYNAMIC TABLE my_dynamic_table  TARGET_LAG = DOWNSTREAM  WAREHOUSE = mywh  AS   SELECT p.id, f.value, f.path   FROM persons p,   LATERAL FLATTEN(input => p.entity) f; ```  Note the following behavior for using lateral flatten with incremental refresh:   * Selecting the flatten SEQ column from a lateral flatten join is not supported. * When using the [AUTO](dynamic-tables-refresh.md) parameter, Snowflake typically chooses incremental refresh for queries with lateral flatten joins, unless prevented by other limitations. | Supported. |
| OUTER-EQUI JOIN. | Supported. You can specify any number of tables in the join, and updates to all tables in the join are reflected in the results of the query. | Supported |
| [{LEFT | RIGHT | FULL }] [OUTER JOIN](querying-joins.md) | The following is not supported:   * Outer joins where both sides are the same table. * Outer joins where both sides are a subquery with GROUP BY clauses. * Outer joins with non-equality predicates.   Otherwise, you can specify any number of tables in an outer join, and updates to all tables in the join are reflected in the results of the query. | Supported |
| [ML or LLM functions](snowflake-cortex/aisql.md) | Supported in the SELECT clause. | Supported |
| [PIVOT](../sql-reference/constructs/pivot.md) and [UNPIVOT](../sql-reference/constructs/unpivot.md) | Not supported | Not supported |
| [SAMPLE / TABLESAMPLE](../sql-reference/constructs/sample.md) | Not supported | Not supported |
| Scalar Aggregates | Supported | Supported |
| [SELECT](../sql-reference/sql/select.md) | Expressions including those using deterministic built-in functions and [immutable](../sql-reference/sql/create-function.md) [user-defined functions](../developer-guide/udf/udf-overview.md). | Supported |
| [Set operators](../sql-reference/operators-query.md) (UNION, MINUS, EXCEPT, INTERSECT) | Not supported, except for UNION. In incremental refresh, the UNION set operator works like the combination of the UNION ALL and SELECT DISTINCT operators. | Supported |
| [Sequences](querying-sequences.md). | Not supported | Not supported |
| All [subquery operators](../sql-reference/operators-subquery.md). | Not supported | Supported |
| [UNION ALL](../sql-reference/operators-query.md) | Supported | Supported |
| [User-defined functions](../developer-guide/udf/udf-overview.md) (UDFs) | Supported, except for the following limitations:   * UDFs written in Python, Java, Scala, or Javascript that specify the [VOLATILE](../sql-reference/sql/create-function.md) parameter are not supported. * UDFs written in SQL that contain subqueries are not supported (for example, a SELECT statement). * Replacing an [IMMUTABLE](../sql-reference/sql/create-function.md) UDF while it’s in use by a dynamic table that uses incremental refresh results in failed refreshes. * Importing UDFs from an external stage is not supported. | Supported |
| [User-defined table functions](../developer-guide/udf/udf-overview.md) (UDTFs) | Supported, except for the following limitations:   * UDTFs written in SQL are not supported. * SELECT blocks that read from UDTFs must explicitly specify columns and can’t use `*`. | Supported |
| [WHERE](../sql-reference/constructs/where.md) / [HAVING](../sql-reference/constructs/having.md) / [QUALIFY](../sql-reference/constructs/qualify.md) | Filters with the same expressions that are valid in SELECT are supported.  Filters with the CURRENT_TIMESTAMP, CURRENT_TIME, and CURRENT_DATE functions and their aliases are supported. | Supported.  Filters with the CURRENT_TIMESTAMP, CURRENT_TIME, and CURRENT_DATE functions and their aliases are supported. |
| [Window functions](../sql-reference/functions-window.md) | Supported, except for the following limitations:   * Using the window functions PERCENT_RANK, DENSE_RANK, RANK with sliding window frames is not supported. * Using ANY_VALUE is not supported since it’s a non-deterministic function. | Supported |
| [WITH](../sql-reference/constructs/with.md) | [Common table expressions (CTEs)](queries-cte.md) that use incremental refresh supported features in the subquery are supported.  WITH RECURSIVE is not supported. | Supported |

## Supported non-deterministic functions in incremental and full refresh modes

| Non-deterministic Function | Incremental Refresh Mode | Full Refresh Mode |
| --- | --- | --- |
| [ANY_VALUE](../sql-reference/functions/any_value.md) | Not supported | Not supported |
| [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](../sql-reference/functions/classify_text-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [COMPLETE (SNOWFLAKE.CORTEX)](../sql-reference/functions/complete-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) | Not supported | Supported |
| [CURRENT_DATE](../sql-reference/functions/current_date.md) (and aliases) | Supported only as a part of a WHERE/HAVING/QUALIFY clause. | Supported only as a part of a WHERE/HAVING/QUALIFY clause. |
| [CURRENT_REGION](../sql-reference/functions/current_region.md) | Not supported | Supported |
| [CURRENT_ROLE](../sql-reference/functions/current_role.md) | Not supported | Supported |
| [CURRENT_TIME](../sql-reference/functions/current_time.md) (and aliases) | Supported only as a part of a WHERE/HAVING/QUALIFY clause. | Supported only as a part of a WHERE/HAVING/QUALIFY clause. |
| [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md) (and aliases) | Supported only as a part of a WHERE/HAVING/QUALIFY clause. | Supported only as a part of a WHERE/HAVING/QUALIFY clause. |
| Functions that rely on [CURRENT_USER](../sql-reference/functions/current_user.md). | Not supported. Dynamic table refreshes act as their owner role with a special SYSTEM user. | Not supported. Dynamic table refreshes act as their owner role with a special SYSTEM user. |
| [CURRENT_WAREHOUSE](../sql-reference/functions/current_warehouse.md) | Not supported | Supported |
| [DENSE_RANK](../sql-reference/functions/dense_rank.md) | Supported | Supported |
| [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](../sql-reference/functions/embed_text-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](../sql-reference/functions/embed_text_1024-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [EXTRACT_ANSWER (SNOWFLAKE.CORTEX)](../sql-reference/functions/extract_answer-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [FINETUNE (SNOWFLAKE.CORTEX)](../sql-reference/functions/finetune-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [FIRST_VALUE](../sql-reference/functions/first_value.md) | Supported | Supported |
| [LAST_VALUE](../sql-reference/functions/last_value.md) | Supported | Supported |
| [MAX_BY](../sql-reference/functions/max_by.md) | Supported | Supported |
| [MIN_BY](../sql-reference/functions/min_by.md) | Supported | Supported |
| [NTH_VALUE](../sql-reference/functions/nth_value.md) | Supported | Supported |
| [RANK](../sql-reference/functions/rank.md) | Supported | Supported |
| [ROW_NUMBER](../sql-reference/functions/row_number.md) | Supported | Supported |
| [SENTIMENT (SNOWFLAKE.CORTEX)](../sql-reference/functions/sentiment-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [Sequence functions](../sql-reference/functions/seq1.md) (e.g., `SEQ1`, `SEQ2`) | Not supported | Supported |
| [TRANSLATE (SNOWFLAKE.CORTEX)](../sql-reference/functions/translate-snowflake-cortex.md) | Supported in the SELECT clause | Supported |
| [VOLATILE](../sql-reference/sql/create-function.md) user-defined functions | Not supported | Supported |

## Supported Snowflake Cortex AI functions

You can use [Snowflake Cortex AI Functions (including LLM functions)](snowflake-cortex/aisql.md) in the SELECT clause for dynamic tables in incremental refresh mode. The same
availability restrictions as described in [Cortex AI functions](snowflake-cortex/aisql.md) apply.

Cortex AI Functions let you add AI-powered insights directly to your dynamic tables, automatically analyzing data as it updates. For example, it can
classify customer reviews, support tickets, or survey responses as positive/negative or assign categories.

In the following example, `review_sentiment` uses AI_FILTER to evaluate each review with an LLM. Cortex AI Functions combine
the prompt `The reviewer enjoyed the restaurant` with the actual review text. The output column `enjoyed` is the classification
generated using Cortex AI Functions based on the prompt, indicating whether the reviewer enjoyed the restaurant.

```sqlexample
CREATE OR REPLACE TABLE reviews AS
  SELECT 'Wow... Loved this place.' AS review
  UNION ALL
  SELECT 'The pizza is not good.' AS review;

CREATE OR REPLACE DYNAMIC TABLE review_sentiment
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = mywh
  REFRESH_MODE = INCREMENTAL
  AS
    SELECT review, AI_FILTER(CONCAT('The reviewer enjoyed the restaurant', review), {'model': 'llama3.1-70b'}) AS enjoyed FROM reviews;
```

---
title: Suspend or resume dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-suspend-resume.md
section: User Guide
---

# Suspend or resume dynamic tables

This topic discusses why dynamic tables automatically suspend and how to manually suspend or resume your dynamic tables.

Suspended dynamic tables aren’t automatically refreshed; you can [manually refresh them](dynamic-tables-manual-refresh.md).

## Automatic dynamic table suspension

Dynamic tables are automatically suspended after five consecutive scheduled refresh errors. A successful refresh, including a manual refresh,
resets the error count to zero. For example, if a table fails two consecutive scheduled refreshes, then succeeds on the next, the error count
resets to zero.

Errors from manually triggered refreshes don’t count toward this limit.

Any dynamic tables dependent on a suspended table are also suspended.

You can view the current state (ACTIVE or SUSPENDED) of your dynamic tables using one of the following options:

SQLSnowsight

Execute the [DYNAMIC_TABLE_GRAPH_HISTORY](../sql-reference/functions/dynamic_table_graph_history.md) table function:

```sqlexample
SELECT name, scheduling_state
  FROM TABLE (INFORMATION_SCHEMA.DYNAMIC_TABLE_GRAPH_HISTORY());
```

In the output, the `SCHEDULING_STATE` column shows the state of your dynamic table (ACTIVE or SUSPENDED):

```output
+-------------------+---------------------------------------------------------------------------------+
  | NAME              | SCHEDULING_STATE                                                                |
  |-------------------+---------------------------------------------------------------------------------|
  | DTSIMPLE          | {                                                                               |
  |                   |   "reason_code": "SUSPENDED_DUE_TO_ERRORS",                                     |
  |                   |   "reason_message": "The DT was suspended due to 5 consecutive refresh errors", |
  |                   |   "state": "SUSPENDED",                                                         |
  |                   |   "suspended_on": "2023-06-06 19:27:29.142 -0700"                               |
  |                   | }                                                                               |
  | DT_TEST           | {                                                                               |
  |                   |   "state": "ACTIVE"                                                             |
  |                   | }                                                                               |
  +-------------------+---------------------------------------------------------------------------------+
```

To view the state of your dynamic tables, sign in to [Snowsight](ui-snowsight-gs.md). In the navigation menu, select Transformation » Dynamic tables.

You can view the state and last refresh status for your dynamic tables on this page. You can also filter by database or schema to narrow
the results.

## Manually suspend dynamic tables

Manually suspend a dynamic table when you don’t need it now but want to avoid refresh costs without dropping it, keeping it available for
future use. Suspension can also give you better control over refresh frequency, for example, if skips occur and you need time for
troubleshooting.

If you want to ensure refreshes at a specific time or occurrence, you can use a task or script that runs regularly to execute a manual
refresh because dynamic tables don’t guarantee exact refresh timing. This allows precise control over when your table refreshes.

You can use either the ALTER DYNAMIC TABLE … SUSPEND command or Snowsight to manually suspend dynamic tables, with the following limitations:

* Suspending a dynamic table also suspends the dynamic tables that are [downstream](dynamic-tables-target-lag.md) from it.
* Suspending a dynamic table with incremental refresh beyond the Time Travel retention period of its base tables will cause it to fail on the
  next refresh after the dynamic table resumes.

SQLSnowsight

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SUSPEND;
```

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Dynamic tables.
3. Find your dynamic table in the list and then select  » Suspend.
4. In the popup, confirm that you want to suspend your dynamic table.

## Resume dynamic tables

To resume your dynamic tables, use either the ALTER DYNAMIC TABLE … RESUME command or Snowsight.

SQLSnowsight

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table RESUME;
```

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Dynamic tables.
3. Find your dynamic table in the list and then select  » Resume.
4. In the popup, confirm that you want to resume your dynamic table.

---
title: Sync a Snowflake-managed table with Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-open-catalog-sync.md
section: User Guide
---

# Sync a Snowflake-managed table with Snowflake Open Catalog

To query a Snowflake-managed Apache Iceberg™ table using a third-party engine such as Apache Spark™, you can sync the table with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).

> **Note:**
>
> Alternatively, to query a Snowflake-managed Iceberg table by using a third-party engine, you can use an external query engine through Snowflake
> Horizon Catalog. By using an external query engine through Horizon, you don’t need to sync the tables with Open Catalog. For more information,
> see [Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

This topic explains how to sync a Snowflake-managed Iceberg table with Snowflake Open Catalog by using a catalog integration in Snowflake and an external catalog
in Open Catalog.

> **Important:**
>
> If your third-party engine can only query tables located up to the second namespace level in a catalog, you must sync
> the Snowflake-managed Iceberg table to Open Catalog with one parent namespace. Otherwise, Snowflake will sync the table to the third namespace
> level in Open Catalog and you can’t query the table.
>
> To sync a Snowflake-managed Iceberg table with one parent namespace instead of two, set the CATALOG_SYNC_NAMESPACE_MODE property to `FLATTEN`
> when you create the database. For information, see [CREATE DATABASE](../sql-reference/sql/create-database.md). You can’t alter this mode for an existing database.
> Tables in an existing database with CATALOG_SYNC enabled will sync to Open Catalog with two parent namespaces.

## Step 1: Set a BASE_LOCATION_PREFIX

Snowflake writes the files for each Iceberg table under a directory that includes a dynamically generated
string (random ID).

To ensure that Open Catalog can see all of the Snowflake-managed tables that you sync, we recommend that you use a
[BASE_LOCATION_PREFIX](../sql-reference/parameters.md) (such as `my-open-catalog-tables`) at the account, database, or schema level, and
omit the BASE_LOCATION parameter in your CREATE ICEBERG TABLE statements. Doing so organizes the files for all Iceberg tables
that you create in the account, database, or schema under a known directory with the same name as the prefix. For more information, see
[Data and metadata directories for Snowflake-managed tables](tables-iceberg-storage.md).

The following statement sets a BASE_LOCATION_PREFIX for a schema named `open_catalog`:

```sqlexample
ALTER SCHEMA open_catalog
  SET BASE_LOCATION_PREFIX = 'my-open-catalog-tables';
```

## Step 2: Create an external volume

If you don’t have one already, start by creating an external volume in Snowflake that provides access to the
cloud storage location where you want to store your table data and metadata.

> **Note:**
>
> Don’t include the BASE_LOCATION_PREFIX in the path that you specify for the STORAGE_BASE_URL.

Complete the instructions for your cloud storage service:

* [Amazon S3](tables-iceberg-configure-external-volume-s3.md)
* [Google Cloud Storage](tables-iceberg-configure-external-volume-gcs.md)
* [Azure Storage](tables-iceberg-configure-external-volume-azure.md)

## Step 3: Configure Open Catalog resources

Next, complete the steps in this section to create an external catalog and service connection in your Open Catalog account.

1. Follow the instructions in [Create a catalog](https://other-docs.snowflake.com/en/opencatalog/create-catalog)
   to create an external catalog in your Open Catalog account. Make sure that the following settings for the external catalog are configured:

   * The External toggle is enabled.
   * The Default base location combines the `STORAGE_BASE_URL`
     for the external volume you created in Step 2: Create an external volume and the `BASE_LOCATION_PREFIX`
     that you set for the schema; for example `s3://<storage_base_url>/<base_url_prefix>/`.

   Open Catalog syncs your Snowflake-managed tables to this external catalog.
2. If you don’t already have a service connection for Snowflake, follow the instructions in [Configure a service connection](https://other-docs.snowflake.com/en/opencatalog/configure-service-connection#configure-a-service-connection)
   to create a connection for the Snowflake engine in your Open Catalog account.
3. Configure a catalog role for your external catalog with privileges that allow access to your external catalog.
   For instructions, see [Grant privileges to a catalog](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#step-2-grant-privileges-to-a-catalog).

   The catalog role must have the following privileges on the catalog:

   * TABLE_CREATE
   * TABLE_WRITE_PROPERTIES
   * TABLE_DROP
   * NAMESPACE_CREATE
   * NAMESPACE_DROP

   You can either grant each of these privileges to the catalog role, or grant the CATALOG_MANAGE_CONTENT privilege, which includes
   these privileges. For more information, see
   [Catalog privileges for Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/access-control#catalog-privileges).
4. Attach the catalog role to the principal role for your service connection. This lets the service connection access the catalog.
   For instructions, see [Grant a catalog role to a principal role](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#step-3-grant-a-catalog-role-to-a-principal-role).

## Step 4: Create a catalog integration for Open Catalog

Create a catalog integration for Open Catalog by using the [CREATE CATALOG INTEGRATION (Snowflake Open Catalog)](../sql-reference/sql/create-catalog-integration-open-catalog.md) command.

For CATALOG_NAME, specify the name of the external catalog that you configured in your Open Catalog account. Snowflake syncs the table and its parent
namespace in Snowflake to this external catalog in Open Catalog. For example, if you have a `db1.public.table1` Iceberg table registered in
Snowflake and you specify `catalog1` in the catalog integration, Snowflake syncs the table with Open Catalog with the following fully
qualified name: `catalog1.db1.public.table1`.

To troubleshoot issues with creating a catalog integration, see [You can’t create a catalog integration for Open Catalog](tables-iceberg-open-catalog-troubleshooting.md).

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_open_catalog_int
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  REST_CONFIG = (
    CATALOG_URI = 'https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/polaris/api/catalog'
    CATALOG_NAME = 'myOpenCatalogExternalCatalogName'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = 'myClientId'
    OAUTH_CLIENT_SECRET = 'myClientSecret'
    OAUTH_ALLOWED_SCOPES = ('PRINCIPAL_ROLE:ALL')
  )
  ENABLED = TRUE;
```

> **Note:**
>
> You can use this catalog integration to sync one or more Snowflake-managed tables.

## Step 5: Set up catalog sync

For Snowflake to sync Snowflake-managed Iceberg tables to Open Catalog, you must specify the external catalog in Open Catalog that Snowflake
should sync the tables to. To configure this, you set the CATALOG_SYNC parameter to the name of a catalog integration for Open Catalog.

* Set CATALOG_SYNC at the database level
* Set CATALOG_SYNC at the schema level

### Set CATALOG_SYNC at the database level

This example sets the CATALOG_SYNC parameter at the database level. After you run these statements, Snowflake syncs all Snowflake-managed Iceberg tables in
the `db1` database to the external catalog in Open Catalog that you specified for the `my_open_catalog_int` catalog integration.
For more information, see the [ALTER DATABASE](../sql-reference/sql/alter-database.md) command.

```sqlexample
ALTER DATABASE db1 SET CATALOG_SYNC = 'my_open_catalog_int';
```

You can also set CATALOG_SYNC at the database level when you create a database. For example:

```sqlexample
CREATE DATABASE db2
  CATALOG_SYNC = 'my_open_catalog_int';
```

For more information, see [CREATE DATABASE](../sql-reference/sql/create-database.md).

### Set CATALOG_SYNC at the schema level

This example sets the CATALOG_SYNC parameter at the schema level. After you run these statements, Snowflake syncs all Snowflake-managed Iceberg tables in the
`public` schema to the external catalog in Open Catalog that you specified for the `my_open_catalog_int` catalog integration. For more
information, see the [ALTER SCHEMA](../sql-reference/sql/alter-schema.md) command.

```sqlexample
ALTER SCHEMA public SET CATALOG_SYNC = 'my_open_catalog_int';
```

You can also set CATALOG_SYNC at the schema level when you create a schema. For example:

```sqlexample
CREATE SCHEMA schema1
  CATALOG_SYNC = 'my_open_catalog_int';
```

For more information, see [CREATE SCHEMA](../sql-reference/sql/create-schema.md).

> **Note:**
>
> * You can also do the following:
>
>   + Set CATALOG_SYNC at the account or table level.
>   + Override CATALOG_SYNC at different levels. For example, you can set CATALOG_SYNC
>     at the database level but then override its value for the `myschema` schema within the database. As a result, the Snowflake-managed
>     Iceberg tables in the `myschema` schema sync to a different external catalog in Open Catalog than the other Snowflake-managed
>     Iceberg tables in the database.
>
>   For more information, see [CATALOG_SYNC](../sql-reference/parameters.md) and [Parameter hierarchy and types](../sql-reference/parameters.md).
> * To see the name of the catalog integration for Open Catalog that a Snowflake-managed Iceberg table syncs to, run the [SHOW ICEBERG TABLES](../sql-reference/sql/show-iceberg-tables.md)
>   command and see the `catalog_sync_name` column in the output.

## Step 6: Create a Snowflake-managed table

Create a Snowflake-managed Iceberg table by using the [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../sql-reference/sql/create-iceberg-table-snowflake.md) command.

> **Important:**
>
> To ensure that access privileges in Open Catalog are enforced correctly on the table, make sure the table meets certain conditions
> before creating it. These conditions relate to the directory structure hierarchy for the catalog. For these conditions and instructions on
> how to meet them, see the note in
> [Organize catalog content](https://other-docs.snowflake.com/en/opencatalog/organize-catalog-content#conditions-correct-access-privileges)
> in the Snowflake Open Catalog documentation.

```sqlexample
USE SCHEMA open_catalog;

CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (col1 INT)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume';
```

For the BASE_LOCATION_PREFIX (`my-open-catalog-tables`) and table name (`my_iceberg_table`) used in the previous example statements,
Snowflake writes the table files to the following paths:

* `STORAGE_BASE_URL/my-open-catalog-tables/my_iceberg_table.randomId/data/`
* `STORAGE_BASE_URL/my-open-catalog-tables/my_iceberg_table.randomId/metadata/`

When you modify the table in Snowflake, the changes are automatically synchronized with the external catalog in your Open Catalog account. Other
engines such as Apache Spark™ can query the table by connecting to Open Catalog.

To troubleshoot issues with creating a Snowflake-managed table, see [You can’t create a Snowflake-managed table](tables-iceberg-open-catalog-troubleshooting.md).

---
title: System data metric functions
source: https://docs.snowflake.com/en/user-guide/data-quality-system-dmfs.md
section: User Guide
---

# System data metric functions

This topic is a reference for the system data metric functions (DMFs) that Snowflake provides to all accounts. DMFs are the building block
of [data quality checks](data-quality-intro.md).

## About system DMFs

Snowflake provides system DMFs in the CORE schema of the shared [SNOWFLAKE database](../sql-reference/snowflake-db.md). System DMFs are
maintained by Snowflake; you cannot change the name or functionality of any system DMF.

Each system DMF enables you to measure a different data quality attribute. You can assign more than one system DMF to a table or view to
allow for a more comprehensive data quality measurement to address your governance and compliance needs.

## System DMFs

Currently, Snowflake supports these system DMFs to measure common metrics without having to define them:

| Category | System DMF | Description |
| --- | --- | --- |
| Accuracy | [BLANK_COUNT](../sql-reference/functions/dmf_blank_count.md) | Determine how many blank values are in a column. |
|  | [BLANK_PERCENT](../sql-reference/functions/dmf_blank_percent.md) | Determine what percentage of a column’s values are blank. |
|  | [NULL_COUNT](../sql-reference/functions/dmf_null_count.md) | Determine how many NULL values are in a column. |
|  | [NULL_PERCENT](../sql-reference/functions/dmf_null_percent.md) | Determine what percentage of a column’s values are NULL. |
| Freshness | [FRESHNESS](../sql-reference/functions/dmf_freshness.md) | Determine the freshness of a table’s data based on a timestamp column or the most recent [DML operation](../sql-reference/sql-dml.md). |
|  | [DATA_METRIC_SCHEDULE_TIME](../sql-reference/functions/dmf_data_metric_schedule_time.md) | Define custom freshness metrics. |
| Statistics | [AVG](../sql-reference/functions/dmf_avg.md) | Determine the average value of a column. |
|  | [MAX](../sql-reference/functions/dmf_max.md) | Determine the maximum value of a column. |
|  | [MIN](../sql-reference/functions/dmf_min.md) | Determine the minimum value of a column. |
|  | [STDDEV](../sql-reference/functions/dmf_stddev.md) | Determine the standard deviation value for a column. |
| Uniqueness | [ACCEPTED_VALUES](../sql-reference/functions/dmf_accepted_values.md) | Determine whether values in a column match a Boolean expression. |
|  | [DUPLICATE_COUNT](../sql-reference/functions/dmf_duplicate_count.md) | Determine the number of duplicate values in a column, including NULL values. |
|  | [UNIQUE_COUNT](../sql-reference/functions/dmf_unique_count.md) | Determine the number of unique, non-NULL values in a column. |
| Volume | [ROW_COUNT](../sql-reference/functions/dmf_row_count.md) | Determine how many records are in the table or view. |

---
title: Table Design Considerations
source: https://docs.snowflake.com/en/user-guide/table-considerations.md
section: User Guide
---

# Table Design Considerations

This topic provides best practices, general guidelines, and important considerations when designing and managing tables.

## Date/Time Data Types for Columns

When defining columns to contain dates or timestamps, Snowflake recommends choosing a
[date or timestamp data type](../sql-reference/data-types-datetime.md) rather than a character data type. Snowflake stores DATE and
TIMESTAMP data more efficiently than VARCHAR, resulting in better query performance. Choose an appropriate date or timestamp data type,
depending on the level of granularity required.

## Referential Integrity Constraints

When they are created on standard tables, referential integrity constraints, as defined by primary-key/foreign-key relationships, are informational; they are not enforced. NOT NULL constraints are enforced, but other constraints are not. However, constraints on
[hybrid tables](tables-hybrid.md) are enforced; see [Overview of constraints](../sql-reference/constraints-overview.md).

In general, constraints provide valuable metadata. Primary and foreign keys enable your project team to understand the schema design and see the relationships between the tables and their columns.

Additionally, most business intelligence (BI) and visualization tools import the foreign key definitions with the tables and build the
proper join conditions. This approach saves time and is potentially less prone to error than someone having to guess how to join
the tables and manually configure the tool. Basing joins on primary and foreign keys also brings integrity to the design,
because the joins are not left to different developers to interpret. Some BI and visualization tools also take advantage of constraint
information to rewrite queries more efficiently, for example, by using join elimination.

Specify a constraint when creating or modifying a table using the [CREATE | ALTER TABLE … CONSTRAINT](../sql-reference/sql/create-table-constraint.md) commands.

In the following example, the CREATE TABLE statement for the second table (`salesorders`) defines an out-of-line foreign key constraint that references a column in the first table (`salespeople`):

SQLPython

```sqlexample
CREATE OR REPLACE TABLE salespeople (
  sp_id INT NOT NULL UNIQUE,
  name VARCHAR DEFAULT NULL,
  region VARCHAR,
  constraint pk_sp_id PRIMARY KEY (sp_id)
);
CREATE OR REPLACE TABLE salesorders (
  order_id INT NOT NULL UNIQUE,
  quantity INT DEFAULT NULL,
  description VARCHAR,
  sp_id INT NOT NULL UNIQUE,
  constraint pk_order_id PRIMARY KEY (order_id),
  constraint fk_sp_id FOREIGN KEY (sp_id) REFERENCES salespeople(sp_id)
);
```

```python
from snowflake.core import CreateMode
from snowflake.core.table import ForeignKey, PrimaryKey, Table, TableColumn, UniqueKey

my_table = Table(
  name="salespeople",
  columns=[
      TableColumn(name="sp_id", datatype="int", nullable=False, constraints=[UniqueKey(name='unk')]),
      TableColumn(name="name", datatype="varchar", default="NULL"),
      TableColumn(name="region", datatype="varchar")
  ],
  constraints=[PrimaryKey(name="pk_sp_id", column_names=["sp_id"])]
)
root.databases["<database>"].schemas["<schema>"].tables.create(my_table, mode=CreateMode.or_replace)

my_table = Table(
  name="salesorders",
  columns=[
      TableColumn(name="order_id", datatype="int", nullable=False, constraints=[UniqueKey(name='unk')]),
      TableColumn(name="quantity", datatype="int", default="NULL"),
      TableColumn(name="description", datatype="varchar"),
      TableColumn(name="sp_id", datatype="int", nullable=False, constraints=[UniqueKey(name='unk')])
  ],
  constraints=[
      ForeignKey(referenced_table_name = "salespeople", referenced_column_names=["sp_id"], name="fk_sp_id", column_names=["sp_id"]),
      PrimaryKey(name="pk_order_id", column_names=["order_id"])
  ]
)
root.databases["<database>"].schemas["<schema>"].tables.create(my_table, mode=CreateMode.or_replace)
```

Query the [GET_DDL](../sql-reference/functions/get_ddl.md) function to retrieve a DDL statement that could be executed to recreate the specified
table. The statement includes the constraints currently set on a table.

For example:

```sqlexample
SELECT GET_DDL('TABLE', 'mydb.public.salesorders');
```

```output
+-----------------------------------------------------------------------------------------------------+
| GET_DDL('TABLE', 'MYDB.PUBLIC.SALESORDERS')                                                         |
|-----------------------------------------------------------------------------------------------------|
| create or replace TABLE SALESORDERS (                                                               |
|   ORDER_ID NUMBER(38,0) NOT NULL,                                                                   |
|   QUANTITY NUMBER(38,0),                                                                            |
|   DESCRIPTION VARCHAR(16777216),                                                                    |
|   SP_ID NUMBER(38,0) NOT NULL,                                                                      |
|   unique (SP_ID),                                                                                   |
|   constraint PK_ORDER_ID primary key (ORDER_ID),                                                    |
|   constraint FK_SP_ID foreign key (SP_ID) references MYDATABASE.PUBLIC.SALESPEOPLE(SP_ID)           |
| );                                                                                                  |
+-----------------------------------------------------------------------------------------------------+
```

Alternatively, retrieve a list of all table constraints by schema (or across all schemas in a database) by querying the
[TABLE_CONSTRAINTS view](../sql-reference/info-schema/table_constraints.md) in the Information Schema.

For example:

```sqlexample
SELECT table_name, constraint_type, constraint_name
  FROM mydb.INFORMATION_SCHEMA.TABLE_CONSTRAINTS
  WHERE constraint_schema = 'PUBLIC'
  ORDER BY table_name;
```

```output
+-------------+-----------------+-----------------------------------------------------+
| TABLE_NAME  | CONSTRAINT_TYPE | CONSTRAINT_NAME                                     |
|-------------+-----------------+-----------------------------------------------------|
| SALESORDERS | UNIQUE          | SYS_CONSTRAINT_fce2257e-c343-4e66-9bea-fc1c041b00a6 |
| SALESORDERS | FOREIGN KEY     | FK_SP_ID                                            |
| SALESORDERS | PRIMARY KEY     | PK_ORDER_ID                                         |
| SALESORDERS | UNIQUE          | SYS_CONSTRAINT_bf90e2b3-fd4a-4764-9576-88fb487fe989 |
| SALESPEOPLE | PRIMARY KEY     | PK_SP_ID                                            |
+-------------+-----------------+-----------------------------------------------------+
```

## When to Set a Clustering Key

Specifying a [clustering key](tables-clustering-keys.md) is not necessary for most tables. Snowflake performs automatic tuning via the
optimization engine and micro-partitioning. In many cases, data is loaded and organized into micro-partitions by date or timestamp, and
is queried along the same dimension.

When should you specify a clustering key for a table? First, note that clustering a small table typically doesn’t improve query performance
significantly.

For larger data sets, you might consider specifying a clustering key for a table when:

* The order in which the data is loaded doesn’t match the dimension by which it is most commonly queried (for example, the data is loaded by
  date, but reports filter the data by ID). If your existing scripts or reports query the data by both date and ID (and potentially
  a third or fourth column), you might see some performance improvement by creating a multi-column clustering key.
* [Query Profile](ui-snowsight-activity.md) indicates that a significant percentage of the total duration time for typical
  queries against the table is spent scanning. This applies to queries that filter on one or more specific columns.

Note that reclustering rewrites existing data with a different order. The previous ordering is stored for 7 days to provide Fail-safe
protection. Reclustering a table incurs compute costs that correlate to the size of the data that is reordered.

For more information, see [Automatic Clustering](tables-auto-reclustering.md).

## When to Specify Column Lengths

Snowflake compresses column data effectively; therefore, creating columns larger than necessary has minimal impact on the size of data
tables. Likewise, there is no query performance difference between a column with a maximum length declaration (for example, `VARCHAR(134217728)`),
and a smaller precision.

However, when the size of your column data is predictable, Snowflake recommends defining an appropriate column length, for the following
reasons:

* Data loading operations are more likely to detect issues such as columns loaded out of order; for example, a 50-character string loaded
  erroneously into a VARCHAR(10) column. Such issues produce errors.
* When the column length is unspecified, some third-party tools might anticipate consuming the maximum size value, which can translate into
  increased client-side memory usage or unusual behavior.

## Storing Semi-structured Data in a VARIANT Column vs. Flattening the Nested Structure

If you are not sure yet what types of operations you want perform on your semi-structured data, Snowflake recommends storing the data in a
VARIANT column for now. For data that is mostly regular and uses only native types (strings and integers), the storage requirements and
query performance for operations on relational data and data in a VARIANT column is very similar.

For better pruning and less storage consumption, Snowflake recommends flattening your object and key data into separate relational columns
if your semi-structured data includes:

* Dates and timestamps, especially non-ISO 8601 dates and timestamps, as string values
* Numbers within strings
* Arrays

Non-native values such as dates and timestamps are stored as strings when loaded into a VARIANT column, so operations on these values
could be slower and also consume more space than when stored in a relational column with the corresponding data type.

If you know your use cases for the data, perform tests on a typical data set. Load the data set into a VARIANT column in a table. Use the
FLATTEN function to extract the objects and keys you plan to query into a separate table. Run a typical set of queries against both tables
to see which structure provides the best performance.

## Converting a Permanent Table to a Transient Table or Vice-Versa

Currently, it is not possible to change a permanent table to a [transient](tables-temp-transient.md) table using the
[ALTER TABLE](../sql-reference/sql/alter-table.md) command. The TRANSIENT property is set at table creation and cannot be modified.

Similarly, it is not possible to directly change a transient table to a permanent table.

To convert an existing permanent table to a transient table (or vice versa) while preserving data and other
characteristics such as column defaults and granted privileges, you can create a new table using one of the interfaces as described in the
following examples:

SQLPython

Use the COPY GRANTS clause of the CREATE TABLE command:

```sqlexample
CREATE TRANSIENT TABLE my_new_table LIKE my_old_table COPY GRANTS;
```

Use the `like_table` and `copy_grants` arguments of the [TableCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.table.TableCollection) method:

```python
from snowflake.core.table import Table

my_table = Table(
  name="my_new_table",
  kind="TRANSIENT"
)
tables = root.databases["<database>"].schemas["<schema>"].tables
tables.create(my_table, like_table="my_old_table", copy_grants=True)
```

Then use the INSERT command to copy the data:

```sqlexample
INSERT INTO my_new_table SELECT * FROM my_old_table;
```

If you want to preserve all of the data, but not the granted privileges and other characteristics, you can use one of the following
interfaces:

SQLPython

Use a [CREATE TABLE AS SELECT (CTAS)](../sql-reference/sql/create-table.md) statement:

```sqlexample
CREATE TRANSIENT TABLE my_transient_table AS SELECT * FROM mytable;
```

Use the `as_select` argument of the [TableCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.table.TableCollection) method:

```python
from snowflake.core.table import Table

my_table = Table(
  name="my_transient_table",
  kind="TRANSIENT"
)
tables = root.databases["<database>"].schemas["<schema>"].tables
tables.create(my_table, as_select="SELECT * FROM mytable")
```

Another way to make a copy of a table (but change the lifecycle from permanent to transient) is to clone the table using one of the
following interfaces:

SQLPython

Use the CLONE clause of the CREATE TABLE command:

```sqlexample
CREATE TRANSIENT TABLE foo CLONE bar COPY GRANTS;
```

Use the `clone_table` argument of the [TableCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.table.TableCollection) method:

```python
from snowflake.core.table import Table

my_table = Table(
  name="foo",
  kind="TRANSIENT"
)
tables = root.databases["<database>"].schemas["<schema>"].tables
tables.create(my_table, clone_table="bar", copy_grants=True)
```

Old partitions are *not* affected (they do not become transient), but new partitions added to the clone
will follow the transient lifecycle.

You cannot clone a transient table to a permanent table.

---
title: Table support and schema
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-table-support.md
section: User Guide
---

# Table support and schema

This topic covers the table types, data types, and schema capabilities supported by Snowpipe Streaming.

## Apache Iceberg™ table support

Snowpipe Streaming supports ingestion into Snowflake-managed Apache Iceberg™ tables, including both Iceberg v2 and [Iceberg v3](../tables-iceberg-v3-specification-support.md) tables. For more information, see [Snowpipe Streaming high-performance architecture with Apache Iceberg™ tables](snowpipe-streaming-high-performance-iceberg.md).

## Schema evolution

Snowpipe Streaming supports automatic table schema evolution. When enabled, Snowflake can automatically add new columns that are detected in the incoming stream and drop NOT NULL constraints to accommodate new data patterns. For more information, see [Table schema evolution](../data-load-schema-evolution.md).

Limitations of schema evolution:

* Supported exclusively for standard Snowflake tables. External tables and Apache Iceberg™ tables aren’t supported.
* The precision, scale, or length of existing columns can’t be increased automatically.
* Schema evolution isn’t supported for structured data types. However, new columns that contain structured types are inferred as VARIANT.

## Insert-only operations

The API is currently limited to inserting rows. To modify, delete, or combine data, write the “raw” records to one or more staging tables. Merge, join, or transform the data by using [continuous data pipelines](../data-pipelines-intro.md) to insert modified data into destination reporting tables.

## Supported Java data types

The following table summarizes which Java data types are supported for ingestion into Snowflake columns:

| Snowflake column type | Allowed Java data type |
| --- | --- |
| * CHAR * VARCHAR | * String * primitive data types (int, boolean, char, …) * BigInteger, BigDecimal |
| * BINARY | * byte[] * String (hex-encoded) |
| * NUMBER | * numeric types (BigInteger, BigDecimal, byte, int, double, …) * String |
| * FLOAT | * numeric types (BigInteger, BigDecimal, byte, int, double, …) * String |
| * BOOLEAN | * boolean * numeric types (BigInteger, BigDecimal, byte, int, double, …) * String   See [boolean conversion details](../../sql-reference/data-types-logical.md). |
| * TIME | * java.time.LocalTime * java.time.OffsetTime * String    + [Integer-stored time](../../sql-reference/date-time-input-output.md)   + `HH24:MI:SS.FFTZH:TZM` (for example, `20:57:01.123456789+07:00`)   + `HH24:MI:SS.FF` (for example, `20:57:01.123456789`)   + `HH24:MI:SS` (for example, `20:57:01`)   + `HH24:MI` (for example, `20:57`) |
| * DATE | * java.time.LocalDate * java.time.LocalDateTime * java.time.OffsetDateTime * java.time.ZonedDateTime * java.time.Instant * String    + [Integer-stored date](../../sql-reference/date-time-input-output.md)   + `YYYY-MM-DD` (for example, `2013-04-28`)   + `YYYY-MM-DDTHH24:MI:SS.FFTZH:TZM` (for example, `2013-04-28T20:57:01.123456789+07:00`)   + `YYYY-MM-DDTHH24:MI:SS.FF` (for example, `2013-04-28T20:57:01.123456`)   + `YYYY-MM-DDTHH24:MI:SS` (for example, `2013-04-28T20:57:01`)   + `YYYY-MM-DDTHH24:MI` (for example, `2013-04-28T20:57`)   + `YYYY-MM-DDTHH24:MI:SSTZH:TZM` (for example, `2013-04-28T20:57:01-07:00`)   + `YYYY-MM-DDTHH24:MITZH:TZM` (for example, `2013-04-28T20:57-07:00`) |
| * TIMESTAMP_NTZ * TIMESTAMP_LTZ * TIMESTAMP_TZ | * java.time.LocalDate * java.time.LocalDateTime * java.time.OffsetDateTime * java.time.ZonedDateTime * java.time.Instant * String    + [Integer-stored timestamp](../../sql-reference/date-time-input-output.md)   + `YYYY-MM-DD` (for example, `2013-04-28`)   + `YYYY-MM-DDTHH24:MI:SS.FFTZH:TZM` (for example, `2013-04-28T20:57:01.123456789+07:00`)   + `YYYY-MM-DDTHH24:MI:SS.FF` (for example, `2013-04-28T20:57:01.123456`)   + `YYYY-MM-DDTHH24:MI:SS` (for example, `2013-04-28T20:57:01`)   + `YYYY-MM-DDTHH24:MI` (for example, `2013-04-28T20:57`)   + `YYYY-MM-DDTHH24:MI:SSTZH:TZM` (for example, `2013-04-28T20:57:01-07:00`)   + `YYYY-MM-DDTHH24:MITZH:TZM` (for example, `2013-04-28T20:57-07:00`) |
| * VARIANT * ARRAY | * String (must be a valid JSON) * primitive data types and their arrays * BigInteger, BigDecimal * java.time.LocalTime * java.time.OffsetTime * java.time.LocalDate * java.time.LocalDateTime * java.time.OffsetDateTime * java.time.ZonedDateTime * java.util.Map<String, T> where T is a valid VARIANT type * T[] where T is a valid VARIANT type * List<T> where T is a valid VARIANT type |
| * OBJECT | * String (must be a valid JSON object) * Map<String, T> where T is a valid variant type |
| * GEOGRAPHY | * Supported |
| * GEOMETRY | * Supported |

---
title: Tag inheritance
source: https://docs.snowflake.com/en/user-guide/object-tagging/inheritance.md
section: User Guide
---

# Tag inheritance

A tag is inherited based on the Snowflake securable object hierarchy. A descendant of an object in the hierarchy inherits tags from its
ancestors. For example, a schema in an account inherits tags set on the account. Similarly, if a tag is applied to a table, the tag gets
applied to the columns in that table.

The following diagram shows the Snowflake securable object hierarchy:

> **Note:**
>
> Tag inheritance does not include propagation to nested objects. In the following example, `materialized_view_1` does not inherit
> tags from `table_1` or `view_1`.
>
> `table_1` » `view_1` » `materialized_view_1`
>
> If you want tags from `view_1` to get automatically assigned to `materialized_view_1`, see
> [Automatic tag propagation with user-defined tags](propagation.md).

## Overriding tag inheritance

It’s possible to override the value of an inherited tag on a given object by [manually setting](work.md) the tag on
the object. For example, if a table column inherits the tag named `cost_center` with a tag string value called `sales`, the tag can be
updated with a more specific tag string value such as `sales_na`, to specify the North America sales cost center.

The value of an inherited tag is overwritten when the tag is applied to the object as a result of
[automatic propagation](propagation.md).

The value of an inherited tag is overwritten by [sensitive data classification](../classify-intro.md).

---
title: Tag-based masking policies
source: https://docs.snowflake.com/en/user-guide/tag-based-masking-policies.md
section: User Guide
---

# Tag-based masking policies

This topic provides concepts about tag-based masking policies and examples of tag-based masking policies to protect column data.

## Overview

A tag-based masking policy combines the object tagging and masking policy features to allow a masking policy to be set on a tag using an
ALTER TAG command. When the data type in the masking policy signature and the data type of the column match, the tagged column is
automatically protected by the conditions in the masking policy. This simplifies the data protection efforts because column data that
should be protected no longer needs a masking policy manually applied to the column to protect the data. You can set a tag-based masking
policy on a database, schema, or table.

The tag can support one masking policy for each [data type](../sql-reference-data-types.md) that Snowflake supports. To simplify the
initial column data protection efforts, create a generic masking policy for each data type (e.g. STRING, NUMBER, TIMESTAMP_LTZ) that allows
authorized roles to see the raw data and unauthorized roles to see a fixed masked value.

The masking policy conditions can be written to protect the column data based on the policy assigned to the tag or protect the column data
based on the tag string value of the tag assigned to the column, depending upon the decisions of the policy administrator, tag
administrator, and data steward.

## Choose a database, schema, or table to assign the policy

Data engineers and data stewards can choose to assign the tag-based masking policy to a database, schema, table, or column.

Database & Schema:
:   When you set a tag-based masking policy on a database or schema, you leverage tag inheritance to
    protect table and view columns in the schema or database. Setting a tag-based masking policy on the database or schema protects the
    columns in that database or schema when the data type of the column matches the data type of the masking policy that is
    set on the tag.

    The main benefit of setting the tag-based masking policy on the database or schema is that the columns in all newly added tables and
    views are automatically protected when the column data type matches the masking policy data type. This approach simplifies data
    protection management because it is no longer necessary to set tags on every table. The result is that the policy protects new data in
    Snowflake until a data protection officer decides to assign either a masking policy to the column directly or a row
    access policy to the table or view.

Tables:
:   When you set a tag-based masking policy on a table, the tag is set on all columns in the table. The masking policy protects the column
    data when the data type of the column matches the data type of the masking policy.

    A column can be protected by a masking policy directly assigned to the column and a tag-based masking policy. If a column references both
    of these masking policies, the masking policy that is directly assigned to the column takes precedence over the tag-based masking policy.

For examples of tag-based masking policies, refer to the Use Tag-Based Masking Policies section (in this topic).

## Benefits

Ease of Use:
:   Assigning one or more masking policies to a tag is simple. Policy administrators can add or replace policies without breaking
    existing workflows.

Scalable:
:   Tag-based policies allow policy administrators to write a policy once, assign a policy to a tag once, and, depending on the
    [level](object-tagging/inheritance.md) at which the tag is set, have the policy apply to many objects. This results in the vast
    reduction of manually assigning a single policy to a single column every time a new column is created or replaced.

Comprehensive:
:   Policy administrators can create a policy for each data type and assign all of those policies to a single tag. Once the tag is applied at
    the table level, all columns in the table are protected, provided that the column data type matches the data type specified in the policy.

Protect future objects:
:   Assigning a tag-based masking policy to a table automatically applies the masking policy to any new table columns. This behavior is
    analogous to [future grants](../sql-reference/sql/grant-privilege.md).

Flexibility:
:   Tag-based masking policies offer an alternative to specifying the masking policy in a [CREATE TABLE](../sql-reference/sql/create-table.md) statement,
    which helps to simplify table DDL management. Administrators can choose to assign the masking policy either at table creation or by
    assigning the policy to the tag, which uses [tag inheritance](object-tagging/inheritance.md).

## Considerations

* For a tag-based masking policy where the tag is stored in a different schema than the masking policy and table, cloning the schema
  containing the masking policy and table results in the cloned table being protected by the masking policy in the source schema not the
  cloned schema.

  However, for a tag-based masking policy where the tag, masking policy, and table all exist in the schema, cloning the schema results in the
  table being protected by the masking policy in the cloned schema, not the source schema.

  If the table is cloned or moved to a different schema or database and was originally protected by a tag-based masking policy set on the
  schema or database, the table is not protected by the tag-based masking policy set on the source schema or database. The table is
  protected by the tag-based masking policy set on the target schema or database, if there is a tag-based masking policy set on the target
  schema or database.

* Regarding replication and tag-based masking policies, see
  [policy replication considerations](database-replication-considerations.md).
* For details about [Secure Data Sharing](data-sharing-gs.md) and this feature, see:

  + [Masking Policies & Data Sharing](security-column-intro.md)
  + [Object Tagging & Data Sharing](object-tagging/interaction.md)

## Limitations

All of the existing [masking policy limitations](security-column-intro.md) apply to tag-based masking policies.

Note the following additional limitations when using tag-based masking policies:

Data types:
:   A tag can support one masking policy for each data type. For example, if a tag already has a masking policy for the NUMBER data type, you
    cannot assign another masking policy with the NUMBER data type to the same tag.

System tags:
:   A masking policy cannot be assigned to a [system tag](classify-intro.md).

Dropping objects:
:   Neither the masking policy nor the tag can be dropped if the masking policy is assigned to a tag. Similarly, the parent schema and
    database containing the tag and the masking policy cannot be dropped if the policy is assigned to a tag. For more information,
    see Assign a masking policy to a tag (in this topic).

Materialized Views:
:   A materialized view cannot be created if the underlying table is protected by a tag-based masking policy. For additional details,
    see [masking policies and materialized views](security-column-intro.md).

    If a materialized view exists and a tag-based masking policy is added to the underlying table later, the materialized view cannot
    be queried; the materialized view is now invalidated. To continue using the materialized view, unset the tag-based masking
    policy, recreate or [resume](../sql-reference/sql/alter-materialized-view.md), and then query the materialized view.

Row access policies:
:   A given table or view column can be specified in either a masking policy signature or a row access policy signature.
    In other words, the same column cannot be specified in both a masking policy signature and a row access policy signature at the same time.

Conditional columns:
:   A masked column cannot be used as a conditional column in a masking policy.

Mapping tables:
:   A table containing a column protected by a tag-based masking policy cannot be used as a mapping table.

Snowflake Native App Framework:
:   For details about using tag-based masking policies with a Snowflake Native App, see:

    * [Restrictions on sharing data content that contains policies](../developer-guide/native-apps/preparing-data-content.md).
    * [Define policies on proxy views](../developer-guide/native-apps/preparing-data-content.md).
    * [Blocked context functions](../developer-guide/native-apps/redacted-content.md).

## Manage tag-based masking policies

The existing privileges for masking policies and tags, along with the commands to manage masking policies and tags, apply to tag-based
masking policies.

### Privilege

There are different privilege requirements depending on whether you choose to set the tag-based masking policy on a
database, schema, or table.

With tag-based masking on a database or schema, the current role or a role in the current role hierarchy must inherit the privileges as
shown in either of the following two options.

Option 1:
:   The role must have both the global APPLY MASKING POLICY and the global APPLY TAG privileges. For example, grant these privileges to the
    `data_engineer` custom role:

    ```sqlexample
    USE ROLE ACCOUNTADMIN;

    GRANT APPLY MASKING POLICY ON ACCOUNT TO ROLE data_engineer;

    GRANT APPLY TAG ON ACCOUNT TO ROLE data_engineer;
    ```

    This is the most [centralized approach](object-tagging/work.md) to protect columns with a tag-based masking policy
    in a schema or database.

Option 2:
:   A schema owner (i.e. a role with the OWNERSHIP privilege on the schema) can have the global APPLY MASKING POLICY privilege and the
    APPLY privilege on the tag. For example, if the tag is named `governance.tags.schema_mask` and the custom role that owns the schema
    is `schema_owner`:

    ```sqlexample
    USE ROLE ACCOUNTADMIN;

    GRANT APPLY MASKING POLICY ON ACCOUNT TO ROLE schema_owner;

    GRANT APPLY ON TAG governance.tags.schema_mask TO ROLE schema_owner;
    ```

    This approach provides more flexibility by delegating column protection to schema owners.

With tag-based masking on tables and views, a role with the global APPLY MASKING POLICY privilege can assign and replace a masking policy
on a tag.

For example, grant the global APPLY MASKING POLICY privilege to the `tag_admin` custom role:

> ```sqlexample
> USE ROLE SECURITYADMIN;
>
> GRANT APPLY MASKING POLICY ON ACCOUNT TO ROLE tag_admin;
> ```

#### Privileges for tag owners

A tag owner must have the APPLY MASKING POLICY privilege to unset a masking policy from the tag.

In some cases, tag owners can work with tag-based masking policies without having the APPLY MASKING POLICY privilege. If your role has the
OWNERSHIP or APPLY privilege on a tag that has a masking policy set on it, then you can apply the tag to your table or view without the
APPLY MASKING POLICY privilege. However, you’d still need the APPLY MASKING POLICY privilege to apply the same tag to a database or schema.

### Assign a masking policy to a tag

Assigning a tag-based masking policy on a schema or database follows the same procedure as setting a tag-based masking policy on a
table:

1. Create a tag using the [CREATE TAG](../sql-reference/sql/create-tag.md) command.
2. Create a masking policy using the [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md) command.

   You can optionally use the [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](../sql-reference/functions/system_get_tag_on_current_column.md) and
   [SYSTEM$GET_TAG_ON_CURRENT_TABLE](../sql-reference/functions/system_get_tag_on_current_table.md) system functions in the masking policy conditions.
3. Set the masking policy on the tag using an [ALTER TAG](../sql-reference/sql/alter-tag.md) command.
4. Set the tag on the object based on how you want to protect your data using one of the following commands:

   * [ALTER DATABASE](../sql-reference/sql/alter-database.md)
   * [ALTER SCHEMA](../sql-reference/sql/alter-schema.md)
   * [ALTER TABLE](../sql-reference/sql/alter-table.md)

> **Tip:**
>
> To avoid conflicts with tags and masking policies when setting a tag-based masking policy on a schema or database, prior to
> assigning the tag-based masking policy:
>
> * Query the Account Usage TAG_REFERENCES view to verify the existing tags set on a table or a column in a table.
> * Query the Account Usage POLICY_REFERENCES view to determine if a tag-based masking policy is set on a table or a column. For details,
>   refer to Tag and policy discovery.

In addition to the ALTER TAG usage notes, note the following:

* A tag can have only one masking policy per data type. For example, one policy for the STRING data type, one policy for NUMBER data type,
  and so on.
* If a masking policy already protects a column and a tag with a masking policy is set on the same column, the masking policy
  directly assigned to the column takes precedence over the masking policy assigned to the tag.
* A tag cannot be [dropped](../sql-reference/sql/drop-tag.md) if it is assigned to a masking policy.
* A masking policy cannot be [dropped](../sql-reference/sql/drop-masking-policy.md) if it is assigned to a tag.

For more information on managing masking policies and tags, see:

* [Managing Column-level Security](security-column-intro.md)
* [Access control privileges](object-tagging/work.md)

### Replace a masking policy on a tag

After setting a masking policy on a tag, there are two different pathways to replace the masking policy on the tag with a different
masking policy. The ALTER TAG statement must specify the masking policy name as shown in the following options.

Option 1:
:   Unset the policy from a tag in one statement and then set a new policy on the tag in a different statement:

    ```sqlexample
    ALTER TAG security UNSET MASKING POLICY ssn_mask;

    ALTER TAG security SET MASKING POLICY ssn_mask_2;
    ```

Option 2:
:   Use the `FORCE` keyword to replace the policy in a single statement.

    Note that using the `FORCE` keyword replaces the policy when a policy of the same [data type](../sql-reference-data-types.md) is
    already set on the tag.

    > ```sqlexample
    > ALTER TAG security SET MASKING POLICY ssn_mask_2 FORCE;
    > ```

The option you select in the Privilege section and the order of operations in the
Assign a masking policy to a tag section can impact tag management if you need to replace or unset a tag on a database or schema.

If a schema owner sets a tag on a schema and then a different role sets a masking policy on the same tag, the schema owner cannot unset
the tag from the schema unless the schema owner is granted the global APPLY MASKING POLICY privilege. Snowflake fails the
[ALTER SCHEMA … UNSET TAG](../sql-reference/sql/alter-schema.md) operation for the schema owner. This scenario ensures that column data
that is protected by a tag-based masking policy stays protected. To avoid this scenario, use option 1 in the
Privilege section.

> > **Important:**
> >
> > Exercise caution when replacing a masking policy on a tag.
> >
> > Depending on the timing of the replacement and the query on the column, choosing to replace the policy in two separate statements could
> > lead to a data leak because the column data is unprotected in the time interval between the UNSET and SET operations.
> >
> > However, if the policy conditions are different in the replacement policy, specifying the `FORCE` keyword could lead to a lack of
> > access because (previously) users could access data and the replacement no longer allows access.
> >
> > Prior to replacing a policy, consult your internal data administrators to coordinate the best approach to protect data with tag-based
> > masking policies and replace masking policies as needed.

### Update a tag value

If a schema owner (i.e. `sch_role`) sets a tag on a schema and then a different role sets a masking policy on the same tag
(i.e. `masking_admin_role`), the schema owner cannot change the tag value. Snowflake fails the ALTER SCHEMA … SET TAG operation for
the schema owner.

To change the tag value:

1. Using the `masking_admin_role`, unset the masking policy from the tag.
2. Using the `sch_role`, modify the tag value.
3. Reassign the masking policy to the tag using the `masking_admin_role`.

### Parent database and schema

You cannot perform DROP and REPLACE operations on the database and schema when:

* The tag and masking policy are in the same schema.
* The table or view is in a different schema.
* The protected column in the table or view exists in a different schema than the schema that contains the masking policy and tag.

These four commands refer to DROP and replace operations on the database and schema:

* DROP DATABASE
* DROP SCHEMA
* CREATE OR REPLACE DATABASE
* CREATE OR REPLACE SCHEMA

### Conditional arguments

A [conditional masking policy](security-column-intro.md) can be assigned to a tag. After assigning the tag to a column,
the conditional arguments map to a column in the table by name if the data type of the argument matches with the data type of the column.

A query will fail if a conditional masking policy is assigned to a column in the following cases:

* The table does not have a column with the same name as a conditional argument of the policy.
* The table has a column that matches the name of the conditional argument of the policy but the data type doesn’t match.

For more information on these errors, see Troubleshoot Tag-based Masking Policies (in this topic).

### Tag inheritance

The tag with the masking policy can be assigned to all base table objects, or the tag can be assigned to a column in a base table.
When the tag-based masking policy is assigned to a base table, the columns are protected by the policy provided that the column
data type matches the data type in the masking policy signature.

Since the masking policy protects the base table columns, view columns that are derived from the underlying base table columns are also
protected, based on the current [limitations](security-column-intro.md),
[considerations](security-column-intro.md), and [behaviors](security-column-intro.md) regarding
masking policies with tables and views.

### Data Sharing

Tag-based masking policies that are set on a shared schema or shared database in the provider account are enforced in the consumer account.
This scenario ensures protected data that is shared remains protected, even if a consumer creates a new database from the share.

Additionally, note the following:

* Tag inheritance is preserved in the consumer account.

  When the provider sets a tag-based masking policy on their database and shares that database, Snowflake references the shared provider
  database in the consumer account in terms of the database that contains the tag.
* Snowflake does not honor tag inheritance with shared objects when the tags and tag-based masking policies originate in the consumer account.

  Tags and tag-based masking policies from the consumer account are not enforced on any shared objects.

### Snowsight

You can monitor and assign tag-based masking policies in Snowsight. For details, see:

* [Monitor tags with Snowsight](object-tagging/monitor.md)
* [Monitor masking policies with Snowsight](security-column-intro.md)

## Use tag-based masking policies

The subsections below provide the following information:

* A common procedure to use with tag-based masking policies for data protection and validation.
* Prerequisite steps to complete before implementing tag-based masking policies.
* A list of common assumptions for the examples.
* Representative examples of tag-based masking policy usage, including the usage of the following system functions:

  + [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](../sql-reference/functions/system_get_tag_on_current_column.md)
  + [SYSTEM$GET_TAG_ON_CURRENT_TABLE](../sql-reference/functions/system_get_tag_on_current_table.md)

### Tag and policy discovery

The Information Schema table function [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) and the Account Usage
[POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md) view can help determine whether a masking policy and a tag
reference each other by looking at the following columns:

* TAG_DATABASE
* TAG_SCHEMA
* TAG_NAME
* POLICY_STATUS

The POLICY_STATUS column can have four possible values:

`ACTIVE`
:   Specifies that the column (i.e. REF_COLUMN_NAME) is only associated with a single policy.

`MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`
:   Specifies that multiple masking policies are assigned to the same column.

`COLUMN_IS_MISSING_FOR_SECONDARY_ARG`
:   Specifies that the policy (i.e. POLICY_NAME) is a conditional masking policy and the table (i.e. REF_ENTITY_NAME) does not have a
    column with the same name.

`COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`
:   Specifies that the policy is a conditional masking policy and the table has a column with the same name but a different data type than
    the data type in the masking policy signature.

For details on related error messages with the possible values in the POLICY_STATUS column, refer to
Troubleshoot Tag-Based Masking Policies (in this topic).

### Data protection and validation steps

Generally, Snowflake recommends the following approach when using tag-based masking policies:

1. Create any tags that are needed for tag-based masking policies.
2. Create one masking policy for each data type based on the table columns that you intend to protect with the tag-based masking policies.
3. Assign the masking policies to the tag.
4. Assign the tag with the masking policies to the table column directly or to the table.
5. Check the Information Schema to verify the tag-based policy is assigned to the columns.
6. Query the data to verify the tag-based masking policy protects the data as intended.

### Prerequisite steps

1. Identify the existing tags and their string values in your Snowflake account.

   * Query the Account Usage [TAG REFERENCES](../sql-reference/account-usage/tag_references.md) view to obtain all tags and their assigned
     string values.
   * Optionally:

     + Query the Account Usage [TAGS](../sql-reference/account-usage/tags.md) view (i.e. the *tag catalog*) to obtain a list of
       tags to ensure that duplicate tag naming does not occur later while using tag-based masking policies.
     + Compare the outputs from the TAG_REFERENCES and TAGS queries to determine if there are any unassigned tags that can be used later.
     + Create any tags that will be needed later using the [CREATE TAG](../sql-reference/sql/create-tag.md) command. Otherwise, create tags as needed.
2. Identify the existing policies and their definitions in your Snowflake account.

   * Execute the [SHOW MASKING POLICIES](../sql-reference/sql/show-masking-policies.md) command to obtain a list of existing masking policies.
   * Decide whether these policies, in their current form, can be assigned to tags. If necessary, execute the
     [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md) command to obtain the policy definition. Otherwise, plan to create new policies to
     assign to tags.
3. Determine how to protect the column data with the masking policy in terms of whether the policy conditions should evaluate the tag
   string value that is set on the table column.

### Common assumptions with the examples

The examples make the following assumptions:

* The prerequisite steps were completed.
* The `tag_admin` custom role has the following privileges:

  + The schema-level CREATE TAG privilege.
  + The global APPLY TAG privilege.

  For more information, see [tag privileges](object-tagging/work.md).
* The `masking_admin` custom role has the following privileges:

  + The schema-level CREATE MASKING POLICY privilege.
  + The USAGE privilege on the `governance` database and the `governance.masking_policies` schema.
  + The global APPLY MASKING POLICY privilege to assign masking policies to tags (see Privilege in this topic).
  + The global APPLY TAG privilege, to assign the tag (with the masking policies) to objects.

  For details, see, [tag privileges](object-tagging/work.md).
* The `row_access_admin` custom role has the following privileges:

  + The schema-level CREATE ROW ACCESS POLICY privilege.
  + The USAGE privilege on the `governance` database and the `governance.row_access_policies` schema.
  + The global APPLY ROW ACCESS POLICY privilege.

  For more information, see [row access policy privileges](security-row-intro.md).
* The `accounting_admin` custom role has the following privileges:

  + The USAGE privilege on the `finance` database and the `finance.accounting` schema.
  + The SELECT privilege on tables in the `finance.accounting` schema.
* The `analyst` custom role has the following privileges:

  + The USAGE privilege on the `finance` database and on the `finance.accounting` schema.
  + The SELECT privilege on the `finance.accounting.name_number` table.
* The custom roles described above are granted to the appropriate users.

  For details, see [Configuring access control](security-access-control-configure.md).

### Example 1: Protect column data based on the masking policy directly assigned to the tag

This example assigns two masking policies to a tag and then assigns the same tag to a table. The result is that the masking policies
protect all table columns whose data types match the data types in the policies.

The following steps create a tag-based masking policy to mask accounting data. For example, consider the table named
`finance.accounting.name_number`, which has two columns, `ACCOUNT_NAME` and `ACCOUNT_NUMBER`. The data types in these columns are
STRING and NUMBER, respectively.

> ```output
> ---------------+----------------+
>   ACCOUNT_NAME | ACCOUNT_NUMBER |
> ---------------+----------------+
>   ACME         | 1000           |
> ---------------+----------------+
> ```

Create a tag-based masking policy to protect the ACCOUNT_NAME and ACCOUNT_NUMBER columns as follows:

1. Create a tag named `accounting` in the schema named `governance.tags`.

   > ```sqlexample
   > USE ROLE tag_admin;
   > USE SCHEMA governance.tags;
   > CREATE OR REPLACE TAG accounting;
   > ```
2. Create different masking policies to protect the ACCOUNT_NAME and ACCOUNT_NUMBER columns. In each of these policies, only the
   `ACCOUNTING_ADMIN` custom role can view the raw data.

   Account name policy:

   > ```sqlexample
   > USE ROLE masking_admin;
   > USE SCHEMA governance.masking_policies;
   >
   > CREATE OR REPLACE MASKING POLICY account_name_mask
   > AS (val string) RETURNS string ->
   >   CASE
   >     WHEN CURRENT_ROLE() IN ('ACCOUNTING_ADMIN') THEN val
   >     ELSE '***MASKED***'
   >   END;
   > ```

   Account number policy:

   > ```sqlexample
   > CREATE OR REPLACE MASKING POLICY account_number_mask
   > AS (val number) RETURNS number ->
   >   CASE
   >     WHEN CURRENT_ROLE() IN ('ACCOUNTING_ADMIN') THEN val
   >     ELSE -1
   >   END;
   > ```
3. Assign both masking policies to the `accounting` tag. Note that both policies can be assigned to the tag in a single statement.

   > ```sqlexample
   > ALTER TAG governance.tags.accounting SET
   >   MASKING POLICY account_name_mask,
   >   MASKING POLICY account_number_mask;
   > ```
4. Assign the `accounting` tag to the `finance.accounting.name_number` table.

   > ```sqlexample
   > ALTER TABLE finance.accounting.name_number
   >   SET TAG governance.tags.accounting = 'tag-based policies';
   > ```
5. Verify the `ACCOUNT_NAME` and `ACCOUNT_NUMBER` table columns are protected by the tag-based masking policy by calling the
   Information Schema [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) table function.

   For each protected column, the row in the query result should specify the appropriate values for the column name, policy name, and tag
   name:

   > ```sqlexample
   > USE ROLE masking_admin;
   > SELECT *
   > FROM TABLE (governance.INFORMATION_SCHEMA.POLICY_REFERENCES(
   >   REF_ENTITY_DOMAIN => 'TABLE',
   >   REF_ENTITY_NAME => 'governance.accounting.name_number' )
   > );
   > ```

   Returns (note the updated columns):

   > ```output
   > -------------+------------------+---------------------+----------------+-------------------+-----------------+-----------------+-------------------+-----------------+----------------------+--------------+------------+------------+---------------+
   >   POLICY_DB  | POLICY_SCHEMA    | POLICY_NAME         | POLICY_KIND    | REF_DATABASE_NAME | REF_SCHEMA_NAME | REF_ENTITY_NAME | REF_ENTITY_DOMAIN | REF_COLUMN_NAME | REF_ARG_COLUMN_NAMES | TAG_DATABASE | TAG_SCHEMA | TAG_NAME   | POLICY_STATUS |
   > -------------+------------------+---------------------+----------------+-------------------+-----------------+-----------------+-------------------+-----------------+----------------------+--------------+------------+------------+---------------+
   >   GOVERNANCE | MASKING_POLICIES | ACCOUNT_NAME_MASK   | MASKING_POLICY | FINANCE           | ACCOUNTING      | NAME_NUMBER     | TABLE             | ACCOUNT_NAME    | NULL                 | GOVERNANCE   | TAGS       | ACCOUNTING | ACTIVE        |
   >   GOVERNANCE | MASKING_POLICIES | ACCOUNT_NUMBER_MASK | MASKING_POLICY | FINANCE           | ACCOUNTING      | NAME_NUMBER     | TABLE             | ACCOUNT_NUMBER  | NULL                 | GOVERNANCE   | TAGS       | ACCOUNTING | ACTIVE        |
   > -------------+------------------+---------------------+----------------+-------------------+-----------------+-----------------+-------------------+-----------------+----------------------+--------------+------------+------------+---------------+
   > ```
6. Query the table columns with authorized and unauthorized roles to ensure Snowflake returns the correct query result.

   Authorized:

   > ```sqlexample
   > USE ROLE accounting_admin;
   > SELECT * FROM finance.accounting.name_number;
   > ```
   >
   > Returns:
   >
   > > ```output
   > > ---------------+----------------+
   > >   ACCOUNT_NAME | ACCOUNT_NUMBER |
   > > ---------------+----------------+
   > >   ACME         | 1000           |
   > > ---------------+----------------+
   > > ```

   Unauthorized:

   > ```sqlexample
   > USE ROLE analyst;
   > SELECT * FROM finance.accounting.name_number;
   > ```
   >
   > Returns:
   >
   > > ```output
   > > ---------------+----------------+
   > >   ACCOUNT_NAME | ACCOUNT_NUMBER |
   > > ---------------+----------------+
   > >   ***MASKED*** | -1             |
   > > ---------------+----------------+
   > > ```

### Example 2: Protect column data based on the column tag string value

This example uses a tag-based masking policy to determine whether data should be masked based upon the tag string value of the tag assigned
to a column. The masking policy dynamically evaluates the tag string value by calling the
[SYSTEM$GET_TAG_ON_CURRENT_COLUMN](../sql-reference/functions/system_get_tag_on_current_column.md) function into the masking policy conditions and writing the masking policy
conditions to match the tag string value.

The following steps create a tag-based masking policy to mask accounting data. For brevity, the table columns have two data types,
STRING and NUMBER, respectively. For example, a table named `finance.accounting.name_number`:

> ```output
> ---------------+----------------+
>   ACCOUNT_NAME | ACCOUNT_NUMBER |
> ---------------+----------------+
>   ACME         | 1000           |
> ---------------+----------------+
> ```

Create a tag-based masking policy to protect the `ACCOUNT_NAME` and `ACCOUNT_NUMBER` columns as follows:

1. Create a tag named `accounting_col_string` in the schema named `governance.tags`.

   > ```sqlexample
   > USE ROLE tag_admin;
   > USE SCHEMA governance.tags;
   > CREATE TAG accounting_col_string;
   > ```
2. Create different masking policies to protect the ACCOUNT_NAME and ACCOUNT_NUMBER columns. In each of these policies, the raw data is
   visible only when the current tag string value on the column is set to `'visible'`.

   Account name policy:

   > ```sqlexample
   > USE ROLE masking_admin;
   > USE SCHEMA governance.masking_policies;
   >
   > CREATE MASKING POLICY account_name_mask_tag_string
   > AS (val string) RETURNS string ->
   >   CASE
   >     WHEN SYSTEM$GET_TAG_ON_CURRENT_COLUMN('tags.accounting_col_string') = 'visible' THEN val
   >     ELSE '***MASKED***'
   >   END;
   > ```

   Account number policy:

   > ```sqlexample
   > CREATE MASKING POLICY account_number_mask_tag_string
   > AS (val number) RETURNS number ->
   >   CASE
   >     WHEN SYSTEM$GET_TAG_ON_CURRENT_COLUMN('tags.accounting_col_string') = 'visible' THEN val
   >     ELSE -1
   >   END;
   > ```
   >
   > > **Note:**
   > >
   > > These policies use the `schema_name.tag_name` object name format in the function argument because the `tags` schema and the
   > > `masking_policies` schema both exist in the `governance` database. Alternatively, you can also use the fully-qualified name
   > > for the tag in the function argument.
   > >
   > > Snowflake returns an error at query runtime on a column protected by a tag-based masking policy if the system function argument
   > > in the policy conditions contains a tag name that is not sufficiently qualified. For example, the argument uses the tag name as
   > > `accounting_col_string` only, without specifying the schema name or the database name.
   > >
   > > For more information, see [Object name resolution](../sql-reference/name-resolution.md).
3. Assign both masking policies to the `accounting_col_string` tag. Note that both policies can be assigned to the tag in a single
   statement.

   > ```sqlexample
   > ALTER TAG accounting_col_string SET
   >   MASKING POLICY account_name_mask_tag_string,
   >   MASKING POLICY account_number_mask_tag_string;
   > ```
4. Assign the `accounting_col_string` tag to each table column. In this example, the tag string value on the `ACCOUNT_NAME` column is
   `'visible'`, however, the tag string value on the `ACCOUNT_NUMBER` column is set to `'protect'`.

   > ```sqlexample
   > ALTER TABLE finance.accounting.name_number MODIFY COLUMN
   >   account_name SET TAG governance.tags.accounting_col_string = 'visible',
   >   account_number SET TAG governance.tags.accounting_col_string = 'protect';
   > ```
5. Verify the `ACCOUNT_NAME` and `ACCOUNT_NUMBER` table columns are protected by the tag-based masking policy by calling the
   Information Schema [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) table function.

   For each protected column, the row in the query result should specify the appropriate values for the column name, policy name, and tag
   name.

   > ```sqlexample
   > SELECT *
   > FROM TABLE(
   >  governance.INFORMATION_SCHEMA.POLICY_REFERENCES(
   >    REF_ENTITY_DOMAIN => 'TABLE',
   >    REF_ENTITY_NAME => 'finance.accounting.name_number'
   >    )
   > );
   > ```

   Returns (note the updated columns):

   > ```output
   > ------------+----------------+--------------------------------+----------------+-------------------+-----------------+-----------------+-------------------+-----------------+----------------------+--------------+------------+-----------------------+---------------+
   >  POLICY_DB  | POLICY_SCHEMA  | POLICY_NAME                    |  POLICY_KIND   | REF_DATABASE_NAME | REF_SCHEMA_NAME | REF_ENTITY_NAME | REF_ENTITY_DOMAIN | REF_COLUMN_NAME | REF_ARG_COLUMN_NAMES | TAG_DATABASE | TAG_SCHEMA | TAG_NAME              | POLICY_STATUS |
   > ------------+----------------+--------------------------------+----------------+-------------------+-----------------+-----------------+-------------------+-----------------+----------------------+--------------+------------+-----------------------+---------------+
   >  GOVERNANCE | MASKING_POLICY | ACCOUNT_NAME_MASK_TAG_STRING   | MASKING_POLICY | FINANCE           | ACCOUNTING      | NAME_NUMBER     | TABLE             | ACCOUNT_NAME    | NULL                 | GOVERNANCE   | TAGS       | ACCOUNTING_COL_STRING | ACTIVE        |
   >  GOVERNANCE | MASKING_POLICY | ACCOUNT_NUMBER_MASK_TAG_STRING | MASKING_POLICY | FINANCE           | ACCOUNTING      | NAME_NUMBER     | TABLE             | ACCOUNT_NUMBER  | NULL                 | GOVERNANCE   | TAGS       | ACCOUNTING_COL_STRING | ACTIVE        |
   > ------------+----------------+--------------------------------+----------------+-------------------+-----------------+-----------------+-------------------+-----------------+----------------------+--------------+------------+-----------------------+---------------+
   > ```
6. Query the table columns to ensure Snowflake returns the correct query result, which should only mask the value in the `ACCOUNT_NUMBER`
   column.

   > ```sqlexample
   > USE ROLE accounting_admin;
   > SELECT * FROM finance.accounting.name_number;
   > ```
   >
   > Returns:
   >
   > > ```output
   > > ---------------+----------------+
   > >   ACCOUNT_NAME | ACCOUNT_NUMBER |
   > > ---------------+----------------+
   > >   ACME         | -1             |
   > > ---------------+----------------+
   > > ```

### Example 3: Protect a table based on the table tag string value

This example uses a row access policy to protect a table based on a tag string value assigned to the table and a tag-based masking policy
to protect the columns in the table. For simplicity, this example uses one tag, assigns the masking policies to the tag, and assigns the
tag to the table. The columns in the table will automatically have the same tag and its string value because of
[tag inheritance](object-tagging/inheritance.md).

The row access policy dynamically evaluates the tag string value of the tag assigned to the table by calling the
[SYSTEM$GET_TAG_ON_CURRENT_TABLE](../sql-reference/functions/system_get_tag_on_current_table.md) function in the row access policy conditions. As with the previous example,
the masking policy conditions call the [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](../sql-reference/functions/system_get_tag_on_current_column.md) function to evaluate the tag
string value on the table columns.

> **Important:**
>
> Note that you cannot assign a row access policy to the tag.
>
> Instead, assign the row access policy to the table directly using an [ALTER TABLE](../sql-reference/sql/alter-table.md) command.

The table `finance.accounting.name_number` has two columns, which have the data types STRING and NUMBER:

> ```output
> ---------------+----------------+
>   ACCOUNT_NAME | ACCOUNT_NUMBER |
> ---------------+----------------+
>   ACME         | 1000           |
> ---------------+----------------+
> ```

Protect the table and its columns with a row access policy and a tag-based masking policy as follows:

1. [Create a row access policy](../sql-reference/sql/create-row-access-policy.md) that calls the
   [SYSTEM$GET_TAG_ON_CURRENT_TABLE](../sql-reference/functions/system_get_tag_on_current_table.md) function in the policy conditions:

   > ```sqlexample
   > USE ROLE row_access_admin;
   > USE SCHEMA governance.row_access_policies;
   >
   > CREATE ROW ACCESS POLICY rap_tag_value
   > AS (account_number number)
   > RETURNS BOOLEAN ->
   > SYSTEM$GET_TAG_ON_CURRENT_TABLE('tags.accounting_row_string') = 'visible'
   > AND
   > 'accounting_admin' = CURRENT_ROLE();
   > ```

   The policy specifies that Snowflake return rows in the query result only when the `accounting_row_string` tag is assigned to
   the table with a string value as `'visible'` and the role executing the query on the table or its columns is the
   `accounting_admin` custom role.

   Snowflake does not return rows in the query result if any of the following are true:

   * The `accounting_row_string` tag is not set on the table.
   * The `accounting_row_string` tag is set on the table but with a different string value.
   * The role executing a query on the table or its columns is not the `accounting_admin` custom role.
2. [Assign](../sql-reference/sql/alter-table.md) the row access policy to the table:

   > ```sqlexample
   > ALTER TABLE finance.accounting.name_number
   >   ADD ROW ACCESS POLICY rap_tag_value ON (account_number);
   > ```

   Note that at this point in the procedure, a query on the table should not return any rows in the query result for any role in
   Snowflake because the `accounting_row_string` tag is not assigned to the table. So, the expected result from a query on the table
   should be:

   > ```sqlexample
   > USE ROLE accounting_admin;
   > SELECT * FROM finance.accounting.name_number;
   > ```

   Returns:

   > ```output
   > ---------------+----------------+
   >   ACCOUNT_NAME | ACCOUNT_NUMBER |
   > ---------------+----------------+
   >                |                |
   > ---------------+----------------+
   > ```

   By choosing to assign the row access policy to the table before assigning the tag-based masking policy to the table, all of the table
   data is protected as early as possible.
3. Create a tag named `accounting_row_string` in the schema named `governance.tags`.

   > ```sqlexample
   > USE ROLE tag_admin;
   > USE SCHEMA governance.tags;
   > CREATE TAG accounting_row_string;
   > ```
4. Create different masking policies to protect the ACCOUNT_NAME and ACCOUNT_NUMBER columns. In each of these policies, the raw data is
   visible only when the current tag string value on the column is set to `'visible'`.

   Account name policy:

   > ```sqlexample
   > USE ROLE masking_admin;
   > USE SCHEMA governance.masking_policies;
   >
   > CREATE MASKING POLICY account_name_mask AS (val string) RETURNS string ->
   >   CASE
   >     WHEN SYSTEM$GET_TAG_ON_CURRENT_COLUMN('tags.accounting_row_string') = 'visible' THEN val
   >     ELSE '***MASKED***'
   >   END;
   > ```

   Account number policy:

   > ```sqlexample
   > CREATE MASKING POLICY account_number_mask AS (val number) RETURNS number ->
   >   CASE
   >     WHEN SYSTEM$GET_TAG_ON_CURRENT_COLUMN('tags.accounting_row_string') = 'visible' THEN val
   >     ELSE -1
   >   END;
   > ```
5. Assign both masking policies to the `accounting_row_string` tag. Note that both policies can be assigned to the tag in a single
   statement.

   > ```sqlexample
   > ALTER TAG governance.tags.accounting_row_string SET
   >   MASKING POLICY account_name_mask,
   >   MASKING POLICY account_number_mask;
   > ```
6. Assign the `accounting_row_string` tag to the table with the tag string value `'visible'`:

   > ```sqlexample
   > ALTER TABLE finance.accounting.name_number
   >   SET TAG governance.tags.accounting_row_string = 'visible';
   > ```

   Now that the tag is assigned to the table with a string value of `visible`, only the `accounting_admin` custom role can view the
   table data; a query made by a user with any other role should result in no rows being returned as shown earlier in this example. In
   other words, the conditions of the row access policy now evaluate to true.

   Similarly, the table columns also have the tag string value of `visible` tag because the columns inherit the tag and its string value
   through tag inheritance. The result is that when a user with the `accounting_admin` custom role queries the table, Snowflake returns
   unmasked data:

   > ```sqlexample
   > USE ROLE accounting_admin;
   > SELECT * FROM finance.accounting.name_number;
   > ```

   Returns:

   > ```output
   > ---------------+----------------+
   >   ACCOUNT_NAME | ACCOUNT_NUMBER |
   > ---------------+----------------+
   >   ACME         | 1000           |
   > ---------------+----------------+
   > ```
7. To mask data in either column, update the tag string value for the column directly. For example, to mask the data in the ACCOUNT_NUMBER
   column:

   > ```sqlexample
   > ALTER TABLE finance.accounting.name_number MODIFY COLUMN
   >   account_number SET TAG governance.tags.accounting_row_string = 'protect';
   > ```

   Now when a user with the `accounting_admin` custom role queries the table or the ACCOUNT_NUMBER column, Snowflake returns masked data:

   > ```sqlexample
   > USE ROLE accounting_admin;
   > SELECT * FROM finance.accounting.name_number;
   > ```
   >
   > Returns:
   >
   > > ```output
   > > ---------------+----------------+
   > >   ACCOUNT_NAME | ACCOUNT_NUMBER |
   > > ---------------+----------------+
   > >   ACME         | -1             |
   > > ---------------+----------------+
   > > ```

## Enforce tag-based masking policies on Apache Iceberg tables queried from Apache Spark™

Snowflake supports enforcing tag-based masking policies on Apache Iceberg tables that you query from Apache Spark™ through Snowflake Horizon
Catalog. For more information,
see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

## Troubleshoot tag-based masking policies

The following table lists some error messages that Snowflake can return when using tag-based masking policies:

| Behavior | Error message | Troubleshooting action |
| --- | --- | --- |
| Cannot query a column: too many policies. | SQL execution error: Column <col_name> is mapped to multiple masking policies by tags.Please contact your local administrator to fix the issue. | A given column can be protected by only one masking policy.  Call the [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) function to identify the masking policies set on the column. Modify the tags by unsetting the masking policy from the tag so that the column is protected by only one policy. |
| Cannot query a column: no conditional column. | SQL execution error: Column <col_name> is mapped to a masking policy where the table doesn’t have a column for a secondary argument name of the policy.Please contact your local administrator to fix the issue. | A masking policy that uses [conditional arguments](security-column-intro.md) must have all of the specified columns in the same table or view. Do one of the following to protect the column data:   * Assign a different policy to the column [directly](security-column-intro.md). * Modify the tag by assigning a different masking policy to the tag. |
| Column data is not masked due to a data type mismatch for the column and the policy. | SQL execution error: Column <col_name> is mapped to a masking policy where the table has a column with different data-type for a secondary argument name.Please contact your local administrator to fix the issue. | To mask the column data, the data type for the column and the data type in the masking policy signature must match. Do one of the following to protect the column data:   * Assign a different policy to the column [directly](security-column-intro.md). * Assign a masking policy to the tag, making sure that the data type for the policy and the   data type for the column match. |

---
title: The PIPE object
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-pipe-object.md
section: User Guide
---

# The PIPE object

The PIPE object is the server-side processing layer for Snowpipe Streaming. Every streaming ingestion flows through a pipe, which handles schema validation, optional in-flight data transformations, and optional pre-clustering before committing data to the target table.

The PIPE object provides the following capabilities:

* **In-flight transformations**: Filter rows, reorder columns, cast types, and apply expressions during ingestion by using COPY command transformation syntax. This enables data cleansing and reshaping at ingest time, with no separate ETL step required.
* **Pre-clustering**: Sort data during ingestion based on table clustering keys for optimized query performance.
* **Server-side schema validation**: Validate incoming data against the schema defined in the pipe before committing.
* **Table feature support**: Ingest into tables with defined clustering keys, DEFAULT value columns, and AUTOINCREMENT (or IDENTITY) columns.

For quick setup, Snowflake automatically creates a default pipe for every table. The default pipe handles ingestion with no manual DDL required. For advanced use cases that require transformations or pre-clustering, you can create a custom named pipe. For more information, see [CREATE PIPE](../../sql-reference/sql/create-pipe.md).

## Default pipe

Snowflake provides a default pipe for every target table. The default pipe is created on demand after the first successful pipe-info or open-channel call is made against the target table. This lets you start streaming data immediately without needing to manually execute CREATE PIPE DDL statements.

* On-demand creation: You can only view or describe the pipe (using [SHOW PIPES](../../sql-reference/sql/show-pipes.md) or [DESCRIBE PIPE](../../sql-reference/sql/desc-pipe.md)) after it has been instantiated by one of these calls.
* Naming convention: `<TABLE_NAME>-STREAMING` (for example, `MY_TABLE-STREAMING`)
* Fully Snowflake managed: You can’t run CREATE, ALTER, or DROP on the default pipe.
* Visibility: You can inspect the default pipe using [SHOW PIPES](../../sql-reference/sql/show-pipes.md), [DESCRIBE PIPE](../../sql-reference/sql/desc-pipe.md), and [SHOW CHANNELS](../../sql-reference/sql/show-channels.md). The default pipe is also included in the [ACCOUNT_USAGE.PIPES](../../sql-reference/account-usage/pipes.md), [ACCOUNT_USAGE.METERING_HISTORY](../../sql-reference/account-usage/metering_history.md), and [ORGANIZATION_USAGE.PIPES](../../sql-reference/organization-usage/pipes.md) views.

The default pipe has the following limitations:

* No transformations: The default pipe uses `MATCH_BY_COLUMN_NAME` in the underlying copy statement. It doesn’t support specific data transformations.
* No pre-clustering: The default pipe doesn’t support pre-clustering for the target table.

If your workflow requires transformations or pre-clustering, create your own named pipe. For more information, see [CREATE PIPE](../../sql-reference/sql/create-pipe.md).

When you configure the Snowpipe Streaming SDK or REST API, you can reference the default pipe name in your client configuration to begin streaming. For more information, see [Tutorial: Get started with Snowpipe Streaming high-performance architecture SDK](snowpipe-streaming-high-performance-getting-started.md) and [Tutorial: Get started with Snowpipe Streaming REST API using cURL and a JWT](snowpipe-streaming-high-performance-rest-tutorial.md).

## Pre-clustering data during ingestion

Snowpipe Streaming can cluster in-flight data during ingestion, which improves query performance on your target tables. This feature sorts your data directly during ingestion before the data is committed.

To use pre-clustering, your target table must have clustering keys defined. You can then enable this feature by setting the parameter `CLUSTER_AT_INGEST_TIME` to `TRUE` in your COPY INTO statement when creating or replacing your Snowpipe Streaming pipe.

For more information, see [CLUSTER_AT_INGEST_TIME](../../sql-reference/sql/copy-into-table.md).

> **Important:**
>
> When you use the pre-clustering feature, don’t disable the auto-clustering feature on the destination table. Disabling auto-clustering can lead to degraded query performance over time.

---
title: Time-Series Forecasting (Snowflake ML Functions)
source: https://docs.snowflake.com/en/user-guide/ml-functions/forecasting.md
section: User Guide
---

# Time-Series Forecasting (Snowflake ML Functions)

Forecasting uses a machine learning algorithm that predicts future numeric data from historical time series data.
A common use case is to forecast sales by item for the next two weeks.

## Quickstart to forecasting

This section gives the quickest way to get started with forecasting.

### Prerequisites

To get started you must do the following:

* Select a database, schema and virtual warehouse.
* Confirm that you own your schema or have CREATE SNOWFLAKE.ML.FORECAST privileges in the schema you’ve chosen.
* Have a table or view with at least two columns: one timestamp column and one numeric column. Be sure your timestamp column has
  timestamps at a fixed interval and isn’t missing too many timestamps. The following example shows a dataset with timestamp intervals
  of one day:

  ```none
  ('2020-01-01 00:00:00.000', 2.0),
  ('2020-01-02 00:00:00.000', 3.0),
  ('2020-01-03 00:00:00.000', 4.0);
  ```

### Create forecasts

Once you have the prerequisites, you can use the AI & ML Studio in Snowsight to guide you through setup or you can use the
following SQL commands to train a model and start creating forecasts:

```sqlexample
-- Train your model
CREATE SNOWFLAKE.ML.FORECAST my_model(
  INPUT_DATA => TABLE(my_view),
  TIMESTAMP_COLNAME => 'my_timestamps',
  TARGET_COLNAME => 'my_metric'
);

-- Generate forecasts using your model
SELECT * FROM TABLE(my_model!FORECAST(FORECASTING_PERIODS => 7));
```

For more details on syntax and available methods, see the [FORECAST (SNOWFLAKE.ML)](../../sql-reference/classes/forecast.md) reference.

## Dive deeper into forecasting

The forecasting function is built to predict any numeric time series data into the future. In addition to the simple case presented in the
Quickstart to forecasting section, you can do the following:

* Predict for multiple series at once. For example, you can predict the sales of multiple items for the next two weeks.
* Train and predict using features. Features are additional factors that you believe influence the metric you want to forecast.
* Assess your model’s accuracy.
* Understand the relative importance of the features the model was trained on.
* Debug training errors.

The following sections provide examples of these scenarios and additional details on how forecasting works.

## Examples

This section provide examples of how to set up your data for forecasting and how to create a forecasting model based on your time-series
data.

> **Note:**
>
> Ideally, the training data for a Forecasting model has time steps at equally spaced intervals (for example, daily).
> However, model training can handle real-world data that has missing, duplicate, or misaligned time steps. For more
> information, see [Dealing with real-world data in Time-Series Forecasting](preprocessing.md).

### Set up example data

The example below creates two tables. Views of these tables are included in the examples later in this topic.

The `sales_data` table contains sales data. Each sale includes a store ID, an item identifier, a timestamp, and
the sales amount. Additional columns, which are additional features (temperature, humidity, and holiday) are also included.

The `future_features` table contains future values of the feature columns, which are necessary when forecasting
using features as part of your prediction process.

```sqlexample
CREATE OR REPLACE TABLE sales_data (store_id NUMBER, item VARCHAR, date TIMESTAMP_NTZ,
  sales FLOAT, temperature NUMBER, humidity FLOAT, holiday VARCHAR);

INSERT INTO sales_data VALUES
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-01'), 2.0, 50, 0.3, 'new year'),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-02'), 3.0, 52, 0.3, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-03'), 4.0, 54, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-04'), 5.0, 54, 0.3, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-05'), 6.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-06'), 7.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-07'), 8.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-08'), 9.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-09'), 10.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-10'), 11.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-11'), 12.0, 55, 0.2, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-12'), 13.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-01'), 2.0, 50, 0.3, 'new year'),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-02'), 3.0, 52, 0.3, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-03'), 4.0, 54, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-04'), 5.0, 54, 0.3, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-05'), 6.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-06'), 7.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-07'), 8.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-08'), 9.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-09'), 10.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-10'), 11.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-11'), 12.0, 55, 0.2, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-12'), 13.0, 55, 0.2, NULL);

-- Future values for additional columns (features)
CREATE OR REPLACE TABLE future_features (store_id NUMBER, item VARCHAR,
  date TIMESTAMP_NTZ, temperature NUMBER, humidity FLOAT, holiday VARCHAR);

INSERT INTO future_features VALUES
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-13'), 52, 0.3, NULL),
  (1, 'jacket', TO_TIMESTAMP_NTZ('2020-01-14'), 53, 0.3, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-13'), 52, 0.3, NULL),
  (2, 'umbrella', TO_TIMESTAMP_NTZ('2020-01-14'), 53, 0.3, NULL);
```

### Forecasting on a single series

This example uses a single time series (that is, all the rows are part of a single series) that has two columns, a
timestamp column and a target value column, without additional features.

First, prepare the example dataset to train the model:

```sqlexample
CREATE OR REPLACE VIEW v1 AS SELECT date, sales
  FROM sales_data WHERE store_id=1 AND item='jacket';
SELECT * FROM v1;
```

The SELECT statement returns:

```output
+-------------------------+-------+
| DATE                    | SALES |
+-------------------------+-------+
| 2020-01-01 00:00:00.000 | 2     |
| 2020-01-02 00:00:00.000 | 3     |
| 2020-01-03 00:00:00.000 | 4     |
| 2020-01-04 00:00:00.000 | 5     |
| 2020-01-05 00:00:00.000 | 6     |
+-------------------------+-------+
```

Now, train a forecasting model using this view:

```sqlexample
CREATE SNOWFLAKE.ML.FORECAST model1(
  INPUT_DATA => TABLE(v1),
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales'
);
```

The following message appears after the model is trained:

```output
Instance MODEL1 successfully created.
```

Next, use the forecasting model to forecast the next three timestamps:

```sqlexample
call model1!FORECAST(FORECASTING_PERIODS => 3);
```

**Output**

Note that the model has inferred the interval between timestamps from the training data.

```output
+--------+-------------------------+-----------+--------------+--------------+
| SERIES | TS                      | FORECAST  | LOWER_BOUND  | UPPER_BOUND  |
+--------+-------------------------+-----------+--------------+--------------+
| NULL   | 2020-01-13 00:00:00.000 | 14        | 14           | 14           |
| NULL   | 2020-01-14 00:00:00.000 | 15        | 15           | 15           |
| NULL   | 2020-01-15 00:00:00.000 | 16        | 16           | 16           |
+--------+-------------------------+-----------+--------------+--------------+
```

In this example, because the forecast yields a perfectly linear prediction that has zero errors compared to the actual
values, the prediction interval (LOWER_BOUND, UPPER_BOUND) is the same as the FORECAST value.

To customize the size of the prediction interval, pass `prediction_interval` as part of a configuration object:

```sqlexample
CALL model1!FORECAST(FORECASTING_PERIODS => 3, CONFIG_OBJECT => {'prediction_interval': 0.8});
```

To save your results directly to a table, use [CREATE TABLE … AS SELECT …](../../sql-reference/sql/create-table.md) and
[call the FORECAST method in the FROM clause](../../sql-reference/snowflake-db-classes.md):

```sqlexample
CREATE TABLE my_forecasts AS
  SELECT * FROM TABLE(model1!FORECAST(FORECASTING_PERIODS => 3));
```

As shown in the example above, when calling the method, omit the [CALL](../../sql-reference/sql/call.md) command. Instead, put the call
in parentheses, preceded by the TABLE keyword.

### Forecast on multiple series

To create a forecasting model for multiple series at once, use the `series_colname` parameter.

In this example, the data contains `store_id` and `item` columns. To forecast sales separately for every store/item
combination in the dataset, create a new column that combines these values, and specify that as the series
column.

The following query creates a new view combining `store_id` and `item` into a new column named
`store_item`:

```sqlexample
CREATE OR REPLACE VIEW v3 AS SELECT [store_id, item] AS store_item, date, sales FROM sales_data;
SELECT * FROM v3;
```

**Output**

The first five rows for each series for the resulting dataset are:

```output
+-------------------+-------------------------+-------+
| STORE_ITEM        | DATE                    | SALES |
+-------------------+-------------------------+-------+
| [ 1, "jacket" ]   | 2020-01-01 00:00:00.000 | 2     |
| [ 1, "jacket" ]   | 2020-01-02 00:00:00.000 | 3     |
| [ 1, "jacket" ]   | 2020-01-03 00:00:00.000 | 4     |
| [ 1, "jacket" ]   | 2020-01-04 00:00:00.000 | 5     |
| [ 1, "jacket" ]   | 2020-01-05 00:00:00.000 | 6     |
| [ 2, "umbrella" ] | 2020-01-01 00:00:00.000 | 2     |
| [ 2, "umbrella" ] | 2020-01-02 00:00:00.000 | 3     |
| [ 2, "umbrella" ] | 2020-01-03 00:00:00.000 | 4     |
| [ 2, "umbrella" ] | 2020-01-04 00:00:00.000 | 5     |
| [ 2, "umbrella" ] | 2020-01-05 00:00:00.000 | 6     |
+-------------------+-------------------------+-------+
```

Now use the forecasting function to train a model for each series, all in one step. Note that the `series_colname` parameter is set
to `store_item`:

```sqlexample
CREATE SNOWFLAKE.ML.FORECAST model2(
  INPUT_DATA => TABLE(v3),
  SERIES_COLNAME => 'store_item',
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales'
);
```

Next, use that model to forecast the next two timestamps for all series:

```sqlexample
CALL model2!FORECAST(FORECASTING_PERIODS => 2);
```

**Output**

```output
+-------------------+------------------------+----------+-------------+-------------+
| SERIES            | TS                     | FORECAST | LOWER_BOUND | UPPER_BOUND |
+-------------------+------------------------+----------+-------------+-------------+
| [ 1, "jacket" ]   | 2020-01-13 00:00:00.000 | 14      | 14          | 14          |
| [ 1, "jacket" ]   | 2020-01-14 00:00:00.000 | 15      | 15          | 15          |
| [ 2, "umbrella" ] | 2020-01-13 00:00:00.000 | 14      | 14          | 14          |
| [ 2, "umbrella" ] | 2020-01-14 00:00:00.000 | 15      | 15          | 15          |
+-------------------+-------------------------+---------+-------------+-------------+
```

You can also forecast a specific series with:

```sqlexample
CALL model2!FORECAST(SERIES_VALUE => [2,'umbrella'], FORECASTING_PERIODS => 2);
```

**Output**

The result shows only the next two steps for store 2’s sales of umbrellas.

```output
+-------------------+------------ ------------+-----------+-------------+-------------+
| SERIES            | TS                      | FORECAST  | LOWER_BOUND | UPPER_BOUND |
+-------------------+---------- --------------+-----------+-------------+-------------+
| [ 2, "umbrella" ] | 2020-01-13 00:00:00.000 | 14        | 14          | 14          |
| [ 2, "umbrella" ] | 2020-01-14 00:00:00.000 | 15        | 15          | 15          |
+-------------------+-------------------------+-----------+-------------+-------------+
```

> **Tip:**
>
> Specifying one series with the FORECAST method is more efficient than filtering the results of a multi-series
> forecast to include only the series you’re interested in, because only one series’ forecast is generated.

### Forecasting with features

If you want additional features (for example, holidays or weather) to influence your forecasts, you must include these features
in your training data. Here you create a view containing those fields from the `sales_data` table:

```sqlexample
CREATE OR REPLACE VIEW v2 AS SELECT date, sales, temperature, humidity, holiday
  FROM sales_data WHERE store_id=1 AND item='jacket';
SELECT * FROM v2;
```

**Output**

This is the first five rows of the result of the SELECT query.

```output
+-------------------------+--------+-------------+----------+----------+
| DATE                    | SALES  | TEMPERATURE | HUMIDITY | HOLIDAY  |
+-------------------------+--------+-------------+----------+----------+
| 2020-01-01 00:00:00.000 | 2      | 50          | 0.3      | new year |
| 2020-01-02 00:00:00.000 | 3      | 52          | 0.3      | null     |
| 2020-01-03 00:00:00.000 | 4      | 54          | 0.2      | null     |
| 2020-01-04 00:00:00.000 | 5      | 54          | 0.3      | null     |
| 2020-01-05 00:00:00.000 | 6      | 55          | 0.2      | null     |
+-------------------------+--------+-------------+----------+----------+
```

Now you can use this view to train a model. You are only required to specify the timestamp and target column names;
additional columns in the input data are assumed to be features for use in training.

```sqlexample
CREATE SNOWFLAKE.ML.FORECAST model3(
  INPUT_DATA => TABLE(v2),
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales'
);
```

To generate forecasts with this model, you must provide future values for the features to the model: in this case, `TEMPERATURE`,
`HUMIDITY` and `HOLIDAY`. This allows the model to adjust its sales forecasts based on temperature, humidity, and holiday
forecasts.

Now create a view from the `future_features` table containing this data for future timestamps:

```sqlexample
CREATE OR REPLACE VIEW v2_forecast AS select date, temperature, humidity, holiday
  FROM future_features WHERE store_id=1 AND item='jacket';
SELECT * FROM v2_forecast;
```

**Output**

```output
+-------------------------+-------------+----------+---------+
| DATE                    | TEMPERATURE | HUMIDITY | HOLIDAY |
+-------------------------+-------------+----------+---------+
| 2020-01-13 00:00:00.000 | 52          | 0.3      | null    |
| 2020-01-14 00:00:00.000 | 53          | 0.3      | null    |
+-------------------------+-------------+----------+---------+
```

Now you can generate a forecast using this data:

```sqlexample
CALL model3!FORECAST(
  INPUT_DATA => TABLE(v2_forecast),
  TIMESTAMP_COLNAME =>'date'
);
```

In this variation of the FORECAST method, you do not specify the number of timestamps to predict. Instead, the timestamps
of the forecast come from the `v2_forecast` view.

```output
+--------+-------------------------+-----------+--------------+--------------+
| SERIES | TS                      | FORECAST  | LOWER_BOUND  | UPPER_BOUND  |
+--------+-------------------------+-----------+--------------+--------------+
| NULL   | 2020-01-13 00:00:00.000 | 14        | 14           | 14           |
| NULL   | 2020-01-14 00:00:00.000 | 15        | 15           | 15           |
+--------+-------------------------+-----------+--------------+--------------+
```

## Troubleshooting and model assessment

You can use the following helper functions to assess your model performance, understand which features are most impactful to your model,
and to help you debug the training process if any error occurred:

* [model!SHOW_EVALUATION_METRICS()](../../sql-reference/classes/forecast/methods/show_evaluation_metrics.md);
* [model!EXPLAIN_FEATURE_IMPORTANCE()](../../sql-reference/classes/forecast/methods/explain_feature_importance.md);
* [model!SHOW_TRAINING_LOGS()](../../sql-reference/classes/forecast/methods/show_training_logs.md);

### Evaluation metrics

To get the evaluation metrics for your model, call the [<model_name>!SHOW_EVALUATION_METRICS](../../sql-reference/classes/forecast/methods/show_evaluation_metrics.md) method.
By default, the forecasting function evaluates all models it trains using a method called
[cross-validation](https://en.wikipedia.org/wiki/Cross-validation_(statistics)). This means that under the hood,
in addition to training the final model on all of the training data you provide, the function also trains models on subsets of your
training data. Those models are then used to predict your target metric on the withheld data, allowing the function to compare those
predictions to actual values in your historical data.

If you don’t need these evaluation metrics, you can set `evaluate` to FALSE. If you want to control the way cross-validation is run,
you can use the following parameters:

* **n_splits**: Represents the number of splits in your data for cross validation. Default is 1.
* **max_train_size**: Represents the maximum number of rows for a single training set.
* **test_size**: Limits number of rows included in each test set.
* **gap**: Represents the gap between the end of each training set and the start of the test set.

For complete details on evaluation parameters, see [Evaluation configuration](../../sql-reference/classes/forecast/commands/create-forecast.md).

> **Note:**
>
> Small datasets may not have enough data to perform evaluation. The total number of training rows must be equal to or greater
> than (n_splits \* test_size) + gap. If not enough data is available to train an evaluation model, no evaluation metrics are available
> even when `evaluate` is set to TRUE.
>
> When **n_splits** is 1 (the default), the standard deviation for evaluation metric values is NULL, as only a validation dataset is used.

#### Example

```sqlexample
CREATE OR REPLACE VIEW v_random_data AS SELECT
  DATEADD('minute', ROW_NUMBER() over (ORDER BY 1), '2023-12-01')::TIMESTAMP_NTZ ts,
  UNIFORM(1, 100, RANDOM(0)) exog_a,
  UNIFORM(1, 100, RANDOM(0)) exog_b,
  (MOD(SEQ1(),10) + exog_a) y
FROM TABLE(GENERATOR(ROWCOUNT => 500));

CREATE OR REPLACE SNOWFLAKE.ML.FORECAST model(
  INPUT_DATA => TABLE(v_random_data),
  TIMESTAMP_COLNAME => 'ts',
  TARGET_COLNAME => 'y'
);

CALL model!SHOW_EVALUATION_METRICS();
```

**Output**

```none
+--------+--------------------------+--------------+--------------------+------+
| SERIES | ERROR_METRIC             | METRIC_VALUE | STANDARD_DEVIATION | LOGS |
+--------+--------------------------+--------------+--------------------+------+
| NULL   | "MAE"                    |         2.49 |                NaN | NULL |
| NULL   | "MAPE"                   |        0.084 |                NaN | NULL |
| NULL   | "MDA"                    |         0.99 |                NaN | NULL |
| NULL   | "MSE"                    |        8.088 |                NaN | NULL |
| NULL   | "SMAPE"                  |        0.077 |                NaN | NULL |
| NULL   | "WINKLER_ALPHA=0.05"     |       12.101 |                NaN | NULL |
| NULL   | "COVERAGE_INTERVAL=0.95" |            1 |                NaN | NULL |
+--------+--------------------------+--------------+--------------------+------+
```

### Feature importance

To understand the relative importance of the features used in your model, use the [<model_name>!EXPLAIN_FEATURE_IMPORTANCE](../../sql-reference/classes/forecast/methods/explain_feature_importance.md)
method.

When you train a forecasting model, your model uses provided data, such as timestamps, your target metric, additional columns
you provide (features), and features that are automatically generated to improve the performance of your forecasts, to learn patterns
in your data. Training detects how important each of these is to making accurate predictions than others. Understanding the
relative importance of these features on a scale of 0 to 1 is the purpose of this helper function.

Under the hood, this helper function counts the number of times the model used each feature to make a decision. These feature importance
scores are then normalized to values between 0 and 1 so that their sum is 1. The resulting scores represent an approximate ranking of the
features in your trained model.

#### Key considerations for this feature

* Features that are close in score have similar importance.
* For extremely simple series (for example, when the target column has a constant value), all feature importance scores may be zero.
* Using multiple features that are very similar to each other may result in reduced importance scores for those features. For example,
  if two features are exactly identical, the model may treat them as interchangeable when making decisions, resulting in feature
  importance scores that are half of what those scores would be if only one of the identical features were included.

#### Example

This example uses the data from the evaluation example and calls the feature
importance method. You can see that the `exog_a` variable that was created is the second most important feature - behind all rolling
averages, which are aggregated under the `aggregated_endogenous_trend_features` feature name.

Execute the following statements to get the importance of the features:

```sqlexample
CALL model!EXPLAIN_FEATURE_IMPORTANCE();
```

**Output**

```output
+--------+------+--------------+---------------+---------------+
| SERIES | RANK | FEATURE_NAME | SCORE         | FEATURE_TYPE  |
+--------+------+--------------+---------------+---------------+
| NULL   |    1 | exog_a       |  31.414947903 | user_provided |
| NULL   |    2 | exog_b       |             0 | user_provided |
+--------+------+--------------+---------------+---------------+
```

### Troubleshooting

When you train multiple series with `CONFIG_OBJECT => 'ON_ERROR': 'SKIP'`, individual time series models can
fail to train without the overall training process failing. To understand which time series failed and why, call the
[<model_name>!SHOW_TRAINING_LOGS](../../sql-reference/classes/forecast/methods/show_training_logs.md) method.

### Example

```sqlexample
CREATE TABLE t_error(date TIMESTAMP_NTZ, sales FLOAT, series VARCHAR);
INSERT INTO t_error VALUES
  (TO_TIMESTAMP_NTZ('2019-12-30'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2019-12-31'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-01'), 2.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-02'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-03'), 3.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-04'), 7.0, 'A'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 10.0, 'B'), -- the same timestamp used again and again
  (TO_TIMESTAMP_NTZ('2020-01-06'), 13.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 12.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 15.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 14.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 18.0, 'B'),
  (TO_TIMESTAMP_NTZ('2020-01-06'), 12.0, 'B');

CREATE SNOWFLAKE.ML.FORECAST error_model(
  INPUT_DATA => TABLE(SELECT date, sales, series FROM t_error),
  SERIES_COLNAME => 'series',
  TIMESTAMP_COLNAME => 'date',
  TARGET_COLNAME => 'sales',
  CONFIG_OBJECT => {'ON_ERROR': 'SKIP'}
);

CALL error_model!SHOW_TRAINING_LOGS();
```

**Output**

```output
+--------+--------------------------------------------------------------------------+
| SERIES | LOGS                                                                     |
+--------+--------------------------------------------------------------------------+
| "B"    | {   "Errors": [     "At least two unique timestamps are required."   ] } |
| "A"    | NULL                                                                     |
+--------+--------------------------------------------------------------------------+
```

## Model management

To view a list of your models, use the [SHOW SNOWFLAKE.ML.FORECAST](../../sql-reference/classes/forecast/commands/show-forecast.md) command:

```sqlexample
SHOW SNOWFLAKE.ML.FORECAST;
```

To delete a model, use the [DROP SNOWFLAKE.ML.FORECAST](../../sql-reference/classes/forecast/commands/drop-forecast.md) command:

```sqlexample
DROP SNOWFLAKE.ML.FORECAST my_model;
```

Models are immutable and cannot be updated in place. Train a new model instead.

## Warehouse selection

A Snowflake [virtual warehouse](../warehouses.md) provides the compute resources for training and using the
machine learning models for this feature. This section provides general guidance on selecting the best type and size of
warehouse for this purpose, focusing on the training step, the most time-consuming and memory-intensive part of
the process.

There are two key factors to keep in mind when choosing a warehouse:

1. The number of rows and columns your data contains.
2. The number of distinct series your data contains.

You can use the following rules of thumb to choose your warehouse:

1. If you are training on a longer time series (> 5 million rows) or on many columns (many features), consider upgrading to
   [Snowpark-optimized warehouses](../warehouses-snowpark-optimized.md).
2. If you are training on many series, size up. The forecasting function distributes model training across all available nodes in your
   warehouse when you are training for multiple series at once.

The following table provides this same guidance:

| Series type | < 5 million rows | > 5 million rows and ≤ 100 million rows | > 100 million rows |
| --- | --- | --- | --- |
| One series | Standard warehouse; XS | Snowpark optimized warehouse; XS | Consider aggregating to a less frequent timestamp interval (e.g., hourly to daily) |
| Multiple series | Standard warehouse; Size up | Snowpark optimized warehouse; Size up | Consider batching training by series into multiple jobs |

As a rough estimate, training time is proportional to the number of rows in your time series. For example, on a XS standard warehouse,
with evaluation turned off (`CONFIG_OBJECT => {'evaluate': False}`), training on a 100,000-row dataset takes about
400 seconds. Training on a 1,000,000-row dataset takes about 850 seconds. With
evaluation turned on, training time increases roughly linearly by the number of splits used.

## Algorithm details

The forecasting algorithm used is specified by the (`CONFIG_OBJECT => {'method': '<method>'}`) config object
parameter. This parameter defaults to (`'method': 'best'`). When the method is set to `'best'`, the
algorithm used is an ensemble of multiple models, including [Prophet](https://facebook.github.io/prophet/),
[ARIMA](https://en.wikipedia.org/wiki/Autoregressive_integrated_moving_average) ,
[Exponential Smoothing](https://en.wikipedia.org/wiki/Exponential_smoothing) , and a
[gradient boosting machine](https://en.wikipedia.org/wiki/Gradient_boosting) (described further below).

When the method is set to `fast`, the algorithm used is a gradient boosting machine (GBM). Like an ARIMA model,
it uses a differencing transformation to model data with a non-stationary trend and uses auto-regressive lags of the
historical target data as model variables. Additionally, the algorithm uses rolling averages of historical target data
to help predict trends, and automatically produces cyclic calendar variables (such as day of week and week of year) from
timestamp data.

You can fit models with only historical target and timestamp data, or you may include features (extra columns) that
might have influenced the target value. Exogenous variables can be numerical or categorical and may be NULL (rows
containing NULLs for exogenous variables are not dropped).

The algorithm does not rely on one-hot encoding when training on categorical variables, so you can use categorical data
with many dimensions (high cardinality).

If your model incorporates features, when generating a forecast you must provide values for those features
at every timestamp of the full forecast horizon. Appropriate features could include weather data
(temperature, rainfall), company-specific information (historic and planned company holidays, advertisement campaigns,
event schedules), or any other external factors you believe may help predict your target variable.

The algorithm also generates prediction intervals, in addition to forecasts. A prediction interval is an estimated range
of values within an upper bound and a lower bound in which a certain percentage of data is likely to fall. For example,
a 0.95 value means that 95% of the data likely appears within the interval. You may specify a prediction interval
percentage, or use the default, which is 0.95. Lower and upper bounds of the prediction interval are returned as part of
the forecast output.

> **Important:**
>
> From time to time, Snowflake may refine the forecasting algorithm. Such improvements roll out through
> the regular Snowflake release process. You cannot revert to a previous version of the feature, but models you
> created with a previous version continue to use that version for predictions until deprecation through the Behavior Change Release process.

### Current Limitations

The current release has the following limitations:

* You cannot choose or adjust the forecasting algorithm.
* The minimum number of rows for the main forecasting algorithm is 12 per time series. For time series with between 2 and 11
  observations, forecasting produces a “naive” forecast where all forecasted values are equal to the last observed target
  value.
* The forecasting function does not provide parameters to override trend, seasonality, or seasonal amplitudes; these are
  inferred from the data.
* The minimum acceptable granularity of data is one second. (Timestamps must not be less than one second apart.)
* The minimum granularity of seasonal components is one minute. (The function cannot detect cyclic patterns at
  smaller time deltas.)
* The “season length” of autoregressive features is tied to the input frequency (24 for hourly data, 7 for daily data,
  and so on).
* Forecast models, once trained, are immutable. You cannot update existing models with new data; you must train an
  entirely new model.
* Models do not support versioning. Snowflake recommends retraining a model on a regular cadence,
  perhaps daily, weekly, or monthly, depending on how frequently you receive new data, allowing the model to adjust
  to changing patterns and trends.
* You cannot clone models or share models across roles or accounts. When cloning a schema or database, model objects are skipped.
* You cannot [replicate](../account-replication-intro.md) an instance of the FORECAST class.

## Granting privileges to create forecast objects

Training a forecasting model results in a schema-level object. Therefore, the role you use to create models must
have the CREATE SNOWFLAKE.ML.FORECAST privilege on the schema where the model is created, allowing the
model to be stored there. This privilege is similar to other schema privileges like CREATE TABLE or CREATE VIEW.

Snowflake recommends that you create a role named `analyst` to be used by people who need to create forecasts.

In the following example, the `admin` role is the owner of the schema `admin_db.admin_schema`. The
`analyst` role needs to create models in this schema.

```sqlexample
USE ROLE admin;
GRANT USAGE ON DATABASE admin_db TO ROLE analyst;
GRANT USAGE ON SCHEMA admin_schema TO ROLE analyst;
GRANT CREATE SNOWFLAKE.ML.FORECAST ON SCHEMA admin_db.admin_schema TO ROLE analyst;
```

To use this schema, a user assumes the role `analyst`:

```sqlexample
USE ROLE analyst;
USE SCHEMA admin_db.admin_schema;
```

If the `analyst` role has CREATE SCHEMA privileges in database `analyst_db`, the role can create a new schema
`analyst_db.analyst_schema` and create forecast models in that schema:

```sqlexample
USE ROLE analyst;
CREATE SCHEMA analyst_db.analyst_schema;
USE SCHEMA analyst_db.analyst_schema;
```

To revoke a role’s forecast model creation privilege on the schema, use [REVOKE <privileges> … FROM ROLE](../../sql-reference/sql/revoke-privilege.md):

```sqlexample
REVOKE CREATE SNOWFLAKE.ML.FORECAST ON SCHEMA admin_db.admin_schema FROM ROLE analyst;
```

## Cost considerations

For details on costs for using ML functions, see [Cost Considerations](../../guides-overview-ml-functions.md) in the ML functions overview.

---
title: TISAX (Assessment Level) AL 3
source: https://docs.snowflake.com/en/user-guide/cert-tisax.md
section: User Guide
---

# TISAX (Assessment Level) AL 3

This topic describes how Snowflake supports customers with TISAX compliance requirements.

## Understanding TISAX compliance requirements

Developed by the ENX Association and published by the German Association of the Automotive Industry or VDA, Trusted Information Security
Assessment Exchange or TISAX is a certification specifically designed to address the automotive industry’s cybersecurity requirements.
TISAX focuses on the secure processing of information from business partners, the protection of prototypes and data protection in
accordance with the General Data Protection Regulation (GDPR) for potential business transactions between automobile manufacturers and
their service providers or suppliers. TISAX was established in 2017 by VDA and the ENX Association. All organizations involved in business
with major German automotive industry partners must obtain a TISAX certification. Assessment Level 3 is required for data with a very high
need for protection, such as data classified as confidential or secret. Snowflake’s TISAX is scoped to Information Security and Assessment
Level 3.

For more information, please visit the official [TISAX website](https://portal.enx.com/en-US/TISAX).

---
title: Top Insights (Snowflake ML Functions)
source: https://docs.snowflake.com/en/user-guide/ml-functions/top-insights.md
section: User Guide
---

# Top Insights (Snowflake ML Functions)

Top Insights is an [ML Function](../../guides-overview-ml-functions.md) for key driver analysis, helping you to identify
drivers of a metric’s change over time or explain differences in a metric among various verticals. Top Insights is
powered by a decision tree model that separates a dataset into segments that have different behavior in relation to the
metric you want to analyze. With a few lines of SQL, you can integrate Top Insights into your BI workflows to
automatically monitor segments responsible for changes in any metric.

Use cases for Top Insights include:

* *Time-series analysis:* Identify drivers of a metric’s change over time. For example, automatically identify the
  locations, salespeople, customers, verticals, and other factors that are responsible for a recent revenue shortfall.
* *Vertical analysis:* Identify the drivers of differences in a metric among various verticals. For example, to understand
  which user segments are responsible for differences in new user growth between the United States and EMEA countries,
  to help shape targeted marketing campaigns.

## About Top Insights

Top Insights uses a decision tree model that separates a dataset into segments that have different behavior in relation
to the metric you want to analyze. The algorithm analyzes inter-segment differences between the metric in the control
group and the test group.

* The control group consists of the data points the model will use as a baseline.
* The test group consists of points of interest to be analyzed.

Top Insights then produces a number of possible contributor combinations, which are filtered based on their significance
and distinctiveness. Top Insights does not return redundant segments.

Good candidate datasets for analysis with Top Insights typically have a large number of columns or dimensions used to
segment data that make it difficult to intuitively identify what segments influence a metric. Dimensions can be
categorical (location, market segment, etc.) or continuous (that is, quantitative, such as temperature or attendance).

A Top Insights model is a schema-level object. You only need one instance, since the instance does not hold any state.

> **Tip:**
>
> Dimensions are inferred as categorical or continuous based on their type. Numeric values are taken to be continuous
> dimensions, while string and boolean values are considered categorical. To use a numeric value as a categorical
> dimension, cast it to a string.

## Required privileges

A TOP_INSIGHTS instance is a schema-level object. Therefore, the role you use to create the instance must have the
CREATE SNOWFLAKE.ML.TOP_INSIGHTS privilege on the schema where the instance is created. This privilege is similar to
other schema privileges like CREATE TABLE or CREATE VIEW.

If you are not the owner of the instance, you must have the USAGE privilege on it to be able to call its GET_DRIVERS
method.

## Using Top Insights

To use Top Insights in your queries and pipelines, first create an instance of the [TOP_INSIGHTS (SNOWFLAKE.ML)](../../sql-reference/classes/top-insights.md)
class. The SQL statement below creates an instance named `my_insights`. Creating the instance does not require
any arguments.

```sqlexample
CREATE SNOWFLAKE.ML.TOP_INSIGHTS IF NOT EXISTS my_insights();
```

After creating an instance, you can use the GET_DRIVERS method to extract key drivers from the dataset
you want to perform key driver analytics on. You pass the input data all in one piece (a
[reference](../../sql-reference/references.md) to a single table, view, or query) and provide the names of the
metric and label columns within the input data as additional arguments. Categorical and continuous dimensions
are inferred by their type and do not need to be specified explicitly.

```sqlexample
CALL my_insights!get_drivers (
  INPUT_DATA => TABLE(my_table),
  LABEL_COLNAME => 'label',
  METRIC_COLNAMe => 'sales');
```

## Preparing data for Top Insights

To use Top Insights, make sure you have a Boolean label column that distinguishes rows that are part of the control
group (labeled FALSE) from rows in the test group (labeled TRUE). This column is usually derived from other values in
the dataset, such as a timestamp or the name of a vertical, so it is common to create a view to do this. The view is
also a good place to filter out columns that are not part of your analysis.

The example below, for time-series analysis, creates a view with a label column based on a date range. Specifically, it
labels records in the latest month as TRUE (test data) and all previous records as FALSE (control data). Top
Insights can then analyze the continuous and categorical dimensions that explain differences in month-to-month changes
for the specified metric.

```sqlexample
CREATE VIEW input_table_time_series_label (
  ds, metric, dim_country, dim_vertical, label ) AS
  SELECT
    ds,
    metric,
    dim_country,
    dim_vertical,
    ds >= dateadd(month, -1, current_date) AS label
  FROM input_table;
```

The following example, for vertical analysis, creates a view with a label column based on the country. Specifically, it
labels records in non-US countries as TRUE, and labels records in the USA as FALSE. Top Insights will then analyze the
continuous and categorical dimensions that explain differences in a metric between these population groups.

```sqlexample
CREATE VIEW input_table_vertical_label (
  ds, metric,  dim_country, dim_vertical, label ) AS
  SELECT
    ds,
    metric,
    dim_country,
    dim_vertical,
    dim_country <> 'USA' as label
  FROM input_table;
```

## Interpreting the results

Top Insights returns a row for each segment of interest it finds in your data. Each row contains a plain-English
description of the segment, which can contain multiple criteria (for example, “COUNTRY = france, not VERTICAL = fashion,
not VERTICAL = tech” might describe a single segment). For each segment, Top Insights provides the following values
which quantify how much the segment contributes to the changes between the control and the test group.

| Output column | Description |
| --- | --- |
| METRIC_CONTROL | The total value of the metric in the control period in a specific segment. |
| METRIC_TEST | The total value of the metric in the test period in a specific segment. |
| CONTRIBUTION | The absolute impact of the segment on the change in the metric. |
| RELATIVE_CONTRIBUTION | The impact of the segment as a proportion of the overall change in the metric between test and control. |
| GROWTH_RATE | The change in the metric in the segment as a proportion of the metric in the control group in the segment. |

The contribution, relative contribution, and growth rate may be negative, indicating that a segment has a negative impact.

## Cost considerations

Using Top Insights incurs compute costs. Execution time scales with the number of rows and dimensions processed. See
[Understanding compute cost](../cost-understanding-compute.md) for general information about Snowflake
compute costs.

Top Insights performance does not generally benefit from using a larger warehouse than is needed to load all the data
being analyzed, which must fit into memory. Datasets that surpass about 1,000,000 rows and 1,000 columns may exhaust
memory. Snowflake recommends using a Snowpark-optimized warehouse rather than a larger standard warehouse.
Snowpark-optimized warehouses have more memory than standard warehouses of the corresponding size.

While instances of the Top Insights class are schema-level objects, they do not store any data and have negligible
impact on storage costs.

## Examples

The following examples demonstrate how to use Top Insights for time-series analysis and vertical analysis.

* Time-series analysis example
* Vertical analysis example

### Time-series analysis example

This example finds the segments contributing to differences in the metric between two time periods, specifically how
the country and vertical dimensions affect the metric after 2021.

Create the input table containing synthetic data for this example using the following SQL statements.

```sqlexample
CREATE OR REPLACE TABLE input_table(
  ds DATE, metric NUMBER, dim_country VARCHAR, dim_vertical VARCHAR);

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'usa' AS dim_country,
    'tech' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'usa' AS dim_country,
    'auto' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, seq4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'usa' AS dim_country,
    'fashion' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'usa' AS dim_country,
    'finance' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'canada' AS dim_country,
    'fashion' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'canada' AS dim_country,
    'finance' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'canada' AS dim_country,
    'tech' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'canada' AS dim_country,
    'auto' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'france' AS dim_country,
    'fashion' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'france' AS dim_country,
    'finance' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'france' AS dim_country,
    'tech' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 4, 1)) AS ds,
    UNIFORM(1, 10, RANDOM()) AS metric,
    'france' AS dim_country,
    'auto' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

-- Data for the test group

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 8, 1)) AS ds,
    UNIFORM(300, 320, RANDOM()) AS metric,
    'usa' AS dim_country,
    'auto' AS dim_vertica
  FROM TABLE(GENERATOR(ROWCOUNT => 365));

INSERT INTO input_table
  SELECT
    DATEADD(day, SEQ4(), DATE_FROM_PARTS(2020, 8, 1))  AS ds,
    UNIFORM(400, 420, RANDOM()) AS metric,
    'usa' AS dim_country,
    'finance' AS dim_vertical
  FROM TABLE(GENERATOR(ROWCOUNT => 365));
```

Create a view with a label column based on the datestamp.

```sqlexample
CREATE OR REPLACE VIEW input_view AS (
    SELECT
        metric,
        dim_country as country,
        dim_vertical as vertical,
        ds >= '2021-01-01' AS label
    FROM input_table
);
```

Now analyze this data by calling the GET_DRIVERS method of a TOP_INSIGHTS instance.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.TOP_INSIGHTS my_insights_model()

CALL my_insights_model!GET_DRIVERS(
  INPUT_DATA => TABLE(input_view),
  LABEL_COLNAME => 'label',
  METRIC_COLNAME => 'metric'
)
```

The output resembles the following:

```output
+---------------------------------------------------------------------+----------------+-------------+--------------+-----------------------+---------------+
| CONTRIBUTOR                                                         | METRIC_CONTROL | METRIC_TEST | CONTRIBUTION | RELATIVE_CONTRIBUTION |   GROWTH_RATE |
|---------------------------------------------------------------------+----------------+-------------+--------------+-----------------------+---------------|
| ["Overall"]                                                         |         128445 |      158456 |        30011 |         1             |  0.2336486434 |
| ["COUNTRY = usa"]                                                   |         116238 |      154574 |        38336 |         1.277398287   |  0.3298060875 |
| ["COUNTRY = usa","VERTICAL = finance"]                              |          64281 |       87423 |        23142 |         0.771117257   |  0.3600130676 |
| ["COUNTRY = usa","VERTICAL = auto"]                                 |          48930 |       66131 |        17201 |         0.5731565093  |  0.3515430206 |
| ["COUNTRY = usa","VERTICAL = tech"]                                 |           1543 |         503 |        -1040 |        -0.03465396021 | -0.6740116656 |
| ["COUNTRY = canada","VERTICAL = finance"]                           |           1538 |         482 |        -1056 |        -0.03518709806 | -0.6866059818 |
| ["COUNTRY = canada","VERTICAL = fashion"]                           |           1519 |         446 |        -1073 |        -0.03575355703 | -0.7063857801 |
| ["COUNTRY = france","VERTICAL = auto"]                              |           1534 |         460 |        -1074 |        -0.03578687814 | -0.7001303781 |
| ["COUNTRY = usa","not VERTICAL = auto","not VERTICAL = finance"]    |           3027 |        1020 |        -2007 |        -0.06687547899 | -0.6630327056 |
| ["COUNTRY = france","not VERTICAL = fashion","not VERTICAL = tech"] |           3100 |         962 |        -2138 |        -0.07124054513 | -0.6896774194 |
| ["COUNTRY = france","not VERTICAL = fashion"]                       |           4687 |        1456 |        -3231 |        -0.1076605245  | -0.689353531  |
| ["COUNTRY = france"]                                                |           6202 |        1947 |        -4255 |        -0.1417813468  | -0.68606901   |
| ["not COUNTRY = usa"]                                               |          12207 |        3882 |        -8325 |        -0.2773982873  | -0.6819857459 |
+---------------------------------------------------------------------+----------------+-------------+--------------+-----------------------+---------------+
```

> **Note:**
>
> Since the input data is randomly generated, your results will differ from the results above.

The output is ordered by CONTRIBUTION, with the Overall segment always at the top. The CONTRIBUTOR column contains an
array of strings describing the segment; the rest of the columns quantify how that segment contributes to the metric value.
For more details, see Interpreting the results.

In the example output above, simply being in the United States has the largest impact on the metric. Two additional
segments, based on the finance and automotive verticals within the United States, also have outsize impact. After those,
the contribution of the segments turns negative.

### Vertical analysis example

This example compares credit usage of companies in two regions, USA and EMEA, with a goal of understanding how credit
usage in each segment differs between the regions.

Create the input table containing synthetic data for this example using the following SQL statements.

```sqlexample
CREATE OR REPLACE TABLE vertical_input_table(
  region VARCHAR, industry VARCHAR, num_employee NUMBER, credits FLOAT);

INSERT INTO vertical_input_table
  SELECT
    'USA' as region,
    ['technology', 'finance', 'healthcare', 'consumer'][MOD(ABS(RANDOM()), 4)] as industry,
    UNIFORM(100, 10000, RANDOM()) as num_employee,
    UNIFORM(1000, 3000, RANDOM()) AS credits,
  FROM TABLE(GENERATOR(ROWCOUNT => 450));

INSERT INTO vertical_input_table
  SELECT
    'EMEA' as region,
    ['technology', 'finance', 'healthcare', 'consumer'][MOD(ABS(RANDOM()), 4)] as industry,
    UNIFORM(100, 10000, RANDOM()) as num_employee,
    UNIFORM(100, 5000, RANDOM()) AS credits,
  FROM TABLE(GENERATOR(ROWCOUNT => 350));
```

Create a view with a label column based on the region.

```sqlexample
CREATE OR REPLACE VIEW vertical_input_view AS (
    SELECT
        credits,
        industry,
        num_employee,
        region = 'EMEA' AS label
    FROM vertical_input_table
);
```

Now analyze this data by calling the GET_DRIVERS method of a TOP_INSIGHTS instance.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.ML.TOP_INSIGHTS my_insights_model();

CALL my_insights_model!get_drivers(
  INPUT_DATA => TABLE(vertical_input_view),
  LABEL_COLNAME => 'label',
  METRIC_COLNAME => 'credits'
);
```

The output resembles the following:

```output
+-------------------------------------------------------------------------------------------------------+----------------+-------------+--------------+-----------------------+------------------+|
| CONTRIBUTOR                                                                                           | METRIC_CONTROL | METRIC_TEST | CONTRIBUTION | RELATIVE_CONTRIBUTION |      GROWTH_RATE |
|-------------------------------------------------------------------------------------------------------+----------------+-------------+--------------+-----------------------+------------------|
| ["Overall"]                                                                                           |         896672 |      895326 |        -1346 |           1           |  -0.001501106313 |
| ["not INDUSTRY = consumer","NUM_EMPLOYEE <= 6248.0","NUM_EMPLOYEE > 4235.0"]                          |         141138 |       70337 |       -70801 |          52.601040119 |  -0.5016437813   |
| ["NUM_EMPLOYEE <= 6248.0","NUM_EMPLOYEE > 4235.0"]                                                    |         188770 |      127320 |       -61450 |          45.653789004 |  -0.3255284208   |
| ["not INDUSTRY = technology","NUM_EMPLOYEE <= 8670.0","NUM_EMPLOYEE > 7582.5"]                        |         100533 |       42925 |       -57608 |          42.799405646 |  -0.5730257726   |
| ["not INDUSTRY = consumer","NUM_EMPLOYEE <= 5562.5","NUM_EMPLOYEE > 4235.0"]                          |         103851 |       47052 |       -56799 |          42.198365527 |  -0.54692781     |
+-------------------------------------------------------------------------------------------------------+----------------+-------------+--------------+-----------------------+------------------+
```

> **Note:**
>
> Since the input data is randomly generated, your results will differ from the results above.

The output is ordered by CONTRIBUTION, with the Overall segment always at the top. The CONTRIBUTOR column contains an
array of strings describing the segment; the rest of the columns describe how that segment contributes to the metric value.
For more details, see [<instance_name>!GET_DRIVERS](../../sql-reference/classes/top-insights/methods/get_drivers.md).

In the example output above, you can see that the segments are based on the industry and the number of employees that
the customer has. Top Insights automatically selects such ranges for continuous dimensions. Customers of a certain size
(between about 4,000 and 6,000 employees) seem to have an outsize negative impact.

## Current limitations

* The input metric must be an individual observation or an aggregate.
* For categorical features having more than 25 values, Top Insights uses only the top 25 most influential values to create segments.
* Processing more than 100 million rows in a single job may exhaust memory, even with Snowpark-optimized warehouses.

---
title: Top-K pruning for improved query performance
source: https://docs.snowflake.com/en/user-guide/querying-top-k-pruning-optimization.md
section: User Guide
---

# Top-K pruning for improved query performance

If a SELECT statement contains [LIMIT](../sql-reference/constructs/limit.md) and [ORDER BY](../sql-reference/constructs/order-by.md)
clauses, Snowflake ordinarily scans all eligible rows because any row might be part of the top-K results, where K
is the value from the LIMIT clause. With top-K pruning, Snowflake stops scanning when it determines that none of
the remaining rows can be in a result set that consists of K records.

Top-K pruning can improve the performance of SELECT statements that contain LIMIT and ORDER BY clauses. Queries on large
tables benefit the most from top-K pruning.

## Queries that use top-K pruning

Snowflake applies top-K pruning only when all of the following are true:

* The query contains both an ORDER BY clause and a LIMIT clause.
* The first column specified in the ORDER BY clause has one of the following data types:

  + An integer-representable data type (that is, an [INTEGER type](../sql-reference/data-types-numeric.md), a [DATE type](../sql-reference/data-types-datetime.md),
    or a [TIMESTAMP type](../sql-reference/data-types-datetime.md)). Expressions that return integers, such as casts, are not supported.
  + A [string or binary data type](../sql-reference/data-types-text.md), including [collated strings](../sql-reference/collation.md).
  + A field in a [VARIANT](../sql-reference/data-types-semistructured.md) column with a supported underlying type (that is, a type listed
    in the previous two bulleted list items) and cast to that underlying type.

  If multiple columns are specified, Snowflake considers only the first column.
* When the query contains a join, the ORDER BY column is a column from the larger table. In data warehousing, the
  larger table is often referred to as the [fact table](https://en.wikipedia.org/wiki/Fact_table) or probe side. The smaller table is referred to as the [dimension
  table](https://en.wikipedia.org/wiki/Dimension_%28data_warehouse%29#Dimension_table).

Queries with LIMIT clauses that are already fast (such as queries in which a full table scan is fast) might not benefit
from top-K pruning. Queries that return fewer than K rows also don’t benefit.

Queries that contain [ORDER BY](../sql-reference/constructs/order-by.md) … DESCENDING on a nullable field are pruned only if
they also specify NULLS LAST.

## Queries on VARIANT columns

This section provides examples of queries on a field in a VARIANT column to show the types of queries that can use top-K
pruning.

Create a table with a VARIANT column and insert data:

```sqlexample
CREATE OR REPLACE TABLE variant_topk_test (var_col VARIANT);

INSERT INTO variant_topk_test
  SELECT PARSE_JSON(column1)
    FROM VALUES
      ('{"s": "aa", "i": 1}'),
      ('{"s": "bb", "i": 2}'),
      ('{"s": "cc", "i": 3}'),
      ('{"s": "dd", "i": 4}'),
      ('{"s": "ee", "i": 5}'),
      ('{"s": "ff", "i": 6}'),
      ('{"s": "gg", "i": 7}'),
      ('{"s": "hh", "i": 8}'),
      ('{"s": "ii", "i": 9}'),
      ('{"s": "jj", "i": 10}');
```

This table is relatively small to provide an example, but remember that top-K pruning benefits larger tables.

The following queries on this table can use top-K pruning:

```sqlexample
SELECT * FROM variant_topk_test ORDER BY TO_VARCHAR(var_col:s) LIMIT 5;
```

```sqlexample
SELECT * FROM variant_topk_test ORDER BY var_col:s::VARCHAR LIMIT 5;
```

```sqlexample
SELECT * FROM variant_topk_test ORDER BY TO_NUMBER(var_col:i) LIMIT 5;
```

```sqlexample
SELECT * FROM variant_topk_test ORDER BY var_col:i::NUMBER LIMIT 5;
```

The following query can’t use top-K pruning because the value isn’t cast to the underlying data type:

```sqlexample
SELECT * FROM variant_topk_test ORDER BY var_col:s LIMIT 5;
```

The following query can’t use top-K pruning because the value is cast to a data type that is different from
the underlying data type:

```sqlexample
SELECT * FROM variant_topk_test ORDER BY var_col:i::VARCHAR LIMIT 5;
```

## Queries that contain an aggregate function

Queries that contain an [aggregate function](../sql-reference/functions-aggregation.md) are pruned only if they
meet all of the following conditions:

* They include a [GROUP BY](../sql-reference/constructs/group-by.md) clause.
* The first ORDER BY column is also a GROUP BY column.

For example, the following query can use top-K pruning because the first ORDER BY column `c2` is also a GROUP BY
column and isn’t an aggregated column:

```sqlexample
SELECT c1, c2, c3, COUNT(*) AS agg_col
  FROM mytable
  GROUP BY c1, c2, c3
  ORDER BY c2, c1, agg_col, c3
  LIMIT 5;
```

The following query can’t use top-K pruning because the first ORDER BY column `agg_col` is an aggregated column:

```sqlexample
SELECT c1, c2, c3, COUNT(*) AS agg_col
  FROM mytable
  GROUP BY c1, c2, c3
  ORDER BY agg_col, c2, c1
  LIMIT 5;
```

---
title: Track the use of data metric functions
source: https://docs.snowflake.com/en/user-guide/data-quality-monitor.md
section: User Guide
---

# Track the use of data metric functions

## List your DMFs

Use the [SHOW DATA METRIC FUNCTIONS](../sql-reference/sql/show-data-metric-functions.md) or [SHOW FUNCTIONS](../sql-reference/sql/show-functions.md) command to list data metric functions (DMFs)
in your account, database, or schema. For example, to list all DMFs in the account, execute the following:

```sqlexample
SHOW DATA METRIC FUNCTIONS IN ACCOUNT;
```

Alternatively, you can query the [Information Schema FUNCTIONS view](../sql-reference/info-schema/functions.md) or the
[Account Usage FUNCTIONS view](../sql-reference/account-usage/functions.md) to list your DMFs in the specified database or your account.

The `is_data_metric` column specifies whether the function is a DMF.

## List objects associated with a DMF

You can call the [DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/functions/data_metric_function_references.md)
Information Schema table function to identify the tables or views associated with a given DMF.

To return a row for each object (table or view) that has the DMF named `count_positive_numbers` set on that table or
view, execute the following:

> ```sqlexample
> USE DATABASE governance;
> USE SCHEMA INFORMATION_SCHEMA;
> SELECT *
>   FROM TABLE(
>     INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_REFERENCES(
>       METRIC_NAME => 'governance.dmfs.count_positive_numbers'
>     )
>   );
> ```

You can also query the [DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/account-usage/data_metric_function_references.md)
Account Usage view to determine these associations.

## List DMFs associated with an object

You can call the [DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/functions/data_metric_function_references.md)
Information Schema table function to identify the DMFs associated with a given table or view.

To return a row for each DMF assigned to the table named `hr.tables.empl_info`, execute the following:

> ```sqlexample
> USE DATABASE governance;
> USE SCHEMA INFORMATION_SCHEMA;
> SELECT *
>   FROM TABLE(
>     INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_REFERENCES(
>       REF_ENTITY_NAME => 'hr.tables.empl_info',
>       REF_ENTITY_DOMAIN => 'table'
>     )
>   );
> ```

You can also query the [DATA_METRIC_FUNCTION_REFERENCES](../sql-reference/account-usage/data_metric_function_references.md)
Account Usage view to determine these associations.

---
title: Transactions and Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-transactions.md
section: User Guide
---

# Transactions and Apache Iceberg™ tables

This topic provides information about how Snowflake specifically handles transactions for Apache Iceberg™ tables.
The rules described in the Snowflake [Transactions](../sql-reference/transactions.md) topic also apply to Iceberg tables.

## Tables that use Snowflake as the catalog

For a [table that uses Snowflake as the catalog](tables-iceberg.md),
Snowflake manages the Iceberg metadata so that other query engines, such as Spark, can read from the table.

### Queries

When you use Snowflake to query this type of table, the table follows the general Snowflake transaction principles.

Snowflake currently supports [read committed isolation](../sql-reference/transactions.md) for transactions
for better concurrency and throughput, while Iceberg currently supports serializable or snapshot isolation.

### DDL statements

Snowflake processes [DDL](../sql-reference/sql-ddl-summary.md) statements as individual transactions and doesn’t isolate
DDL statements across multiple concurrent transactions. For more information, see [DDL in implicit transactions](../sql-reference/transactions.md).

This differs from how Iceberg tables typically handle transactions with DDL statements,
where a single committed transaction can include both [DML](../sql-reference/sql-dml.md) and DDL statements, or multiple bundled DDL statements.

> **Note:**
>
> * The Iceberg metadata doesn’t always show a new schema version for each individual DDL change. In some instances,
>   Snowflake groups DDL statements together and records the group as a single new schema version in the Iceberg metadata.
> * DDL changes might appear out of order in the Iceberg metadata, especially if a DDL change occurs in close proximity to other DDL or DML operations.

### Writes from external engines to Snowflake-managed tables

Snowflake doesn’t currently support writes to Snowflake-managed tables from external query engines, such as Spark.

## Tables that use an external catalog

For an Iceberg table that uses an external catalog,
Snowflake retrieves the latest table state from the external catalog when you run the [ALTER ICEBERG TABLE … REFRESH](../sql-reference/sql/alter-iceberg-table-refresh.md) command.

### Refresh transactions

Snowflake automatically commits ALTER ICEBERG TABLE … REFRESH statements inside a single-statement transaction.

In an [implicit transaction](../sql-reference/transactions.md),
Snowflake processes the statement in the same way it handles any other statement when [AUTOCOMMIT](../sql-reference/transactions.md) is enabled.

In an [explicit transaction](../sql-reference/transactions.md) (with multiple statements),
Snowflake executes and automatically commits the refresh as a single-statement transaction before committing the explicit transaction block.

### Writes to externally managed tables

Snowflake supports writes to externally managed tables that use a remote Iceberg REST catalog.
For more information, see [Write support for externally managed Apache Iceberg™ tables](tables-iceberg-externally-managed-writes.md).

## Multi-statement transactions

Snowflake supports multi-statement transactions by committing multiple DML statements atomically, and uses the following logic:

* Each DDL statement executes as an individual transaction when encountered.
* Each ALTER ICEBERG TABLE … REFRESH operation executes as a single transaction when encountered.
* All other statements within an explicit or implicit transaction are grouped and committed as a single transaction

Consider the following example of an explicit transaction block for an Iceberg table in Snowflake:

```sqlexample
BEGIN
  INSERT INTO table1 VALUES (1, "One");
  INSERT INTO table1 VALUES (2, "Two");
  ALTER ICEBERG TABLE table1 ALTER COLUMN c3 SET DATA TYPE ARRAY(long);
  INSERT INTO table1 VALUES (3, "Three");
  INSERT INTO table1 VALUES (4, "Four");
COMMIT;
```

1. When Snowflake encounters the ALTER ICEBERG TABLE statement,
   it commits the first two INSERT INTO TABLE statements (everything processed so far) as a transaction.
2. Snowflake then commits the ALTER ICEBERG TABLE statement as a separate transaction.
3. Finally, Snowflake creates a new transaction and processes the remaining INSERT INTO statements.
   Because the rest of the block contains no DDL or refresh statements, it commits the remaining transactions at the end of the block (at COMMIT).

---
title: Transform data during a load
source: https://docs.snowflake.com/en/user-guide/data-load-transform.md
section: User Guide
---

# Transform data during a load

Snowflake supports transforming data while loading it into a table using the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command, dramatically simplifying your ETL pipeline for basic transformations. This feature helps you avoid the use of temporary tables to store pre-transformed data when reordering columns during a data load. This feature applies to both bulk loading and Snowpipe.

The COPY command supports:

* Column reordering, column omission, and casts using a [SELECT](../sql-reference/sql/select.md) statement. There is no requirement for your data files to have the same number and ordering of columns as your target table.
* The ENFORCE_LENGTH | TRUNCATECOLUMNS option, which can truncate text strings that exceed the target column length.

For general information about querying staged data files, see [Query data in staged files](querying-stage.md).

## Usage notes

This section provides usage information for transforming staged data files during a load.

### Supported file formats

The following file format types are supported for COPY transformations:

* CSV
* JSON
* Avro
* ORC
* Parquet
* XML

To parse a staged data file, it is necessary to describe its file format:

CSV:
:   The default format is character-delimited UTF-8 text. The default field delimiter is a comma character (`,`). The default record delimiter is the new line character. If the source data is in another format, specify the file format type and options.

    When querying staged data files, the `ERROR_ON_COLUMN_COUNT_MISMATCH` option is ignored. There is no requirement for your data files to have the same number and ordering of columns as your target table.

All other file format types:
:   Specify the format type and options that match your data files.

To explicitly specify file format options, set them in one of the following ways:

|  |  |
| --- | --- |
| **Querying staged data files using a SELECT statement:** | * As file format options specified for a named file format or stage object. The named file format/stage object can then be referenced in the SELECT statement. |
| **Loading columns from staged data files using a COPY INTO** *<table>* **statement:** | * As file format options specified directly in the COPY INTO *<table>* statement. * As file format options specified for a named file format or stage object. The named file format/stage object can then be referenced in the COPY INTO *<table>* statement. |

### Supported functions

Snowflake currently supports the following subset of functions for COPY transformations:

* [ARRAY_CONSTRUCT](../sql-reference/functions/array_construct.md)
* [ARRAY_SIZE](../sql-reference/functions/array_size.md)
* [ASCII](../sql-reference/functions/ascii.md)
* [CASE](../sql-reference/functions/case.md)
* [CAST , ::](../sql-reference/functions/cast.md)
* [CEIL](../sql-reference/functions/ceil.md)
* [CHECK_JSON](../sql-reference/functions/check_json.md)
* [CHECK_XML](../sql-reference/functions/check_xml.md)
* [CHR , CHAR](../sql-reference/functions/chr.md)
* [CONCAT , ||](../sql-reference/functions/concat.md)
* [CONVERT_TIMEZONE](../sql-reference/functions/convert_timezone.md)
* [ENDSWITH](../sql-reference/functions/endswith.md)
* [[ NOT ] EQUAL_NULL](../sql-reference/functions/equal_null.md)
* [FLOOR](../sql-reference/functions/floor.md)
* [GET](../sql-reference/functions/get.md)
* [GET_PATH , :](../sql-reference/functions/get_path.md)
* [HEX_DECODE_STRING](../sql-reference/functions/hex_decode_string.md)
* [HEX_ENCODE](../sql-reference/functions/hex_encode.md)
* [IFF](../sql-reference/functions/iff.md)
* [IFNULL](../sql-reference/functions/ifnull.md)
* [[ NOT ] ILIKE](../sql-reference/functions/ilike.md)
* [[ NOT ] IN](../sql-reference/functions/in.md)
* [IS_ARRAY](../sql-reference/functions/is_array.md)
* [IS_BOOLEAN](../sql-reference/functions/is_boolean.md)
* [IS_DECIMAL](../sql-reference/functions/is_decimal.md)
* [IS_INTEGER](../sql-reference/functions/is_integer.md)
* [IS_NULL_VALUE](../sql-reference/functions/is_null_value.md)
* [IS_OBJECT](../sql-reference/functions/is_object.md)
* [IS_TIME](../sql-reference/functions/is_time.md)
* [IS_TIMESTAMP_\*](../sql-reference/functions/is_timestamp.md)
* [LENGTH, LEN](../sql-reference/functions/length.md)
* [[ NOT ] LIKE](../sql-reference/functions/like.md)
* [LPAD](../sql-reference/functions/lpad.md)
* [LTRIM](../sql-reference/functions/ltrim.md)
* [MD5 , MD5_HEX](../sql-reference/functions/md5.md)
* [NULLIF](../sql-reference/functions/nullif.md)
* [NVL](../sql-reference/functions/nvl.md)
* [NVL2](../sql-reference/functions/nvl2.md)
* [OBJECT_CONSTRUCT](../sql-reference/functions/object_construct.md)
* [PARSE_IP](../sql-reference/functions/parse_ip.md)
* [PARSE_JSON](../sql-reference/functions/parse_json.md)
* [PARSE_URL](../sql-reference/functions/parse_url.md)
* [PARSE_XML](../sql-reference/functions/parse_xml.md)
* [RANDOM](../sql-reference/functions/random.md)
* [REGEXP_REPLACE](../sql-reference/functions/regexp_replace.md)
* [REGEXP_SUBSTR](../sql-reference/functions/regexp_substr.md)
* [REPLACE](../sql-reference/functions/replace.md)
* [REVERSE](../sql-reference/functions/reverse.md)
* [RPAD](../sql-reference/functions/rpad.md)
* [RTRIM](../sql-reference/functions/rtrim.md)
* [SPLIT](../sql-reference/functions/split.md)
* [SPLIT_PART](../sql-reference/functions/split_part.md)
* [STARTSWITH](../sql-reference/functions/startswith.md)
* [SUBSTR , SUBSTRING](../sql-reference/functions/substr.md)
* [TO_ARRAY](../sql-reference/functions/to_array.md)
* [TO_BINARY](../sql-reference/functions/to_binary.md)
* [TO_BOOLEAN](../sql-reference/functions/to_boolean.md)
* [TO_CHAR , TO_VARCHAR](../sql-reference/functions/to_char.md)
* [TO_DATE , DATE](../sql-reference/functions/to_date.md)

  Note that when this function is used to explicitly cast a value, neither the DATE_FORMAT file format option nor the [DATE_INPUT_FORMAT](../sql-reference/parameters.md) parameter is applied.
* [TO_DECFLOAT](../sql-reference/functions/to_decfloat.md)
* [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](../sql-reference/functions/to_decimal.md)
* [TO_DOUBLE](../sql-reference/functions/to_double.md)
* [TO_OBJECT](../sql-reference/functions/to_object.md)
* [TO_TIME , TIME](../sql-reference/functions/to_time.md)

  Note that when this function is used to explicitly cast a value, neither the TIME_FORMAT file format option nor the [TIME_INPUT_FORMAT](../sql-reference/parameters.md) parameter is applied.
* [TO_TIMESTAMP / TO_TIMESTAMP_\*](../sql-reference/functions/to_timestamp.md)

  Note that when this function is used to explicitly cast a value, neither the TIMESTAMP_FORMAT file format option nor the [TIMESTAMP_INPUT_FORMAT](../sql-reference/parameters.md) parameter is applied.
* [TO_VARIANT](../sql-reference/functions/to_variant.md)
* [TRIM](../sql-reference/functions/trim.md)
* [TRY_CAST](../sql-reference/functions/try_cast.md)
* [TRY_HEX_DECODE_STRING](../sql-reference/functions/try_hex_decode_string.md)
* [TRY_TO_BINARY](../sql-reference/functions/try_to_binary.md)
* [TRY_TO_BOOLEAN](../sql-reference/functions/try_to_boolean.md)
* [TRY_TO_DATE](../sql-reference/functions/try_to_date.md)

  Note that the COPY INTO *<table>* command does not support the optional `format` argument for this function.
* [TRY_TO_DECFLOAT](../sql-reference/functions/try_to_decfloat.md)
* [TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC](../sql-reference/functions/try_to_decimal.md)
* [TRY_TO_DOUBLE](../sql-reference/functions/try_to_double.md)
* [TRY_TO_TIME](../sql-reference/functions/try_to_time.md)

  Note that the COPY INTO *<table>* command does not support the optional `format` argument for this function.
* [UNICODE](../sql-reference/functions/unicode.md)
* [UUID_STRING](../sql-reference/functions/uuid_string.md)
* [XMLGET](../sql-reference/functions/xmlget.md)

Note in particular that the [VALIDATE](../sql-reference/functions/validate.md) function ignores the
SELECT list in a COPY INTO *<table>* statement. The function parses the files referenced in the statement
and returns any parsing errors. This behavior can be surprising if you expect the function to
evaluate the files in the context of the COPY INTO *<table>* expressions.

Note that COPY transformations do not support the [FLATTEN](../sql-reference/functions/flatten.md) function, or [JOIN](../sql-reference/constructs/join.md) or [GROUP BY](../sql-reference/constructs/group-by.md) (aggregate) syntax.

The list of supported functions might expand over time.

The following categories of functions are also supported:

* Scalar [SQL UDFs](../developer-guide/udf/sql/udf-sql-introduction.md).

> **Note:**
>
> For Scalar SQL UDFs, Snowflake has limited support for transformation error handling, and you may encounter inconsistent or unexpected ON_ERROR copy option behavior.

### Filter results

Filtering the results of a [FROM](../sql-reference/constructs/from.md) clause using a [WHERE](../sql-reference/constructs/where.md) clause is not supported. The ORDER BY, LIMIT,FETCH,TOP keywords in SELECT statements are also not supported.

The DISTINCT keyword in SELECT statements is not fully supported. Specifying the keyword can lead to inconsistent or unexpected ON_ERROR copy option behavior.

### VALIDATION_MODE parameter

The VALIDATION_MODE parameter does not support COPY statements that transform data during a load.

### CURRENT_TIME, CURRENT_TIMESTAMP default column values

Instead of using CURRENT_TIME, CURRENT_TIMESTAMP default column values to capture load time, we recommend that you query METADATA$START_SCAN_TIME to get an accurate time value of record loading. For more information, refer to [Query metadata for staged files](querying-metadata.md).

### MATCH_BY_COLUMN_NAME copy option

You are not allowed to use the MATCH_BY_COLUMN_NAME copy option with a SELECT statement for transforming data during a load in all cases. These two options can still be used separately, but cannot be used together. Any attempt to do so will result in the following error: `SQL compilation error: match_by_column_name is not supported with copy transform`.

## Transform CSV data

### Load a subset of table data

Load a subset of data into a table. For any missing columns, Snowflake inserts the default values.
The following example loads data from columns 1, 2, 6, and 7 of a staged CSV file:

> ```sqlexample
> copy into home_sales(city, zip, sale_date, price)
>    from (select t.$1, t.$2, t.$6, t.$7 from @mystage/sales.csv.gz t)
>    FILE_FORMAT = (FORMAT_NAME = mycsvformat);
> ```

### Reorder CSV columns during a load

The following example reorders the column data from a staged CSV file before loading it into a table.
Additionally, the COPY statement uses the [SUBSTR , SUBSTRING](../sql-reference/functions/substr.md) function to remove the first few characters of a string before
inserting it:

> ```sqlexample
> copy into home_sales(city, zip, sale_date, price)
>    from (select SUBSTR(t.$2,4), t.$1, t.$5, t.$4 from @mystage t)
>    FILE_FORMAT = (FORMAT_NAME = mycsvformat);
> ```

### Convert data types during a load

Convert staged data into other data types during a data load. All [conversion functions](../sql-reference/functions-conversion.md) are supported.

For example, convert strings as binary values, decimals, or timestamps using the [TO_BINARY](../sql-reference/functions/to_binary.md), [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](../sql-reference/functions/to_decimal.md), and [TO_TIMESTAMP / TO_TIMESTAMP_\*](../sql-reference/functions/to_timestamp.md) functions, respectively.

Sample CSV file:

> ```bash
> snowflake,2.8,2016-10-5
> warehouse,-12.3,2017-01-23
> ```

SQL statements:

> ```sqlexample
> -- Stage a data file in the internal user stage
> PUT file:///tmp/datafile.csv @~;
>
> -- Query the staged data file
> select t.$1,t.$2,t.$3 from @~/datafile.csv.gz t;
>
> -- Create the target table
> create or replace table casttb (
>   col1 binary,
>   col2 decimal,
>   col3 timestamp_ntz
>   );
>
> -- Convert the staged CSV column data to the specified data types before loading it into the destination table
> copy into casttb(col1, col2, col3)
> from (
>   select to_binary(t.$1, 'utf-8'),to_decimal(t.$2, '99.9', 9, 5),to_timestamp_ntz(t.$3)
>   from @~/datafile.csv.gz t
> )
> file_format = (type = csv);
>
> -- Query the target table
> select * from casttb;
>
> +--------------------+------+-------------------------+
> | COL1               | COL2 | COL3                    |
> |--------------------+------+-------------------------|
> | 736E6F77666C616B65 |    3 | 2016-10-05 00:00:00.000 |
> | 77617265686F757365 |  -12 | 2017-01-23 00:00:00.000 |
> +--------------------+------+-------------------------+
> ```

### Include sequence columns in loaded data

Create a sequence object using [CREATE SEQUENCE](../sql-reference/sql/create-sequence.md). When loading data into a table using the COPY command, access the object using a `NEXTVAL` expression to sequence the data in a target number column. For more information about using sequences in queries, see [Using Sequences](querying-sequences.md).

> ```sqlexample
> -- Create a sequence
> create sequence seq1;
>
> -- Create the target table
> create or replace table mytable (
>   col1 number default seq1.nextval,
>   col2 varchar,
>   col3 varchar
>   );
>
> -- Stage a data file in the internal user stage
> PUT file:///tmp/myfile.csv @~;
>
> -- Query the staged data file
> select $1, $2 from @~/myfile.csv.gz t;
>
> +-----+-----+
> | $1  | $2  |
> |-----+-----|
> | abc | def |
> | ghi | jkl |
> | mno | pqr |
> | stu | vwx |
> +-----+-----+
>
> -- Include the sequence nextval expression in the COPY statement
> copy into mytable (col1, col2, col3)
> from (
>   select seq1.nextval, $1, $2
>   from @~/myfile.csv.gz t
> )
> ;
>
> select * from mytable;
>
> +------+------+------+
> | COL1 | COL2 | COL3 |
> |------+------+------|
> |    1 | abc  | def  |
> |    2 | ghi  | jkl  |
> |    3 | mno  | pqr  |
> |    4 | stu  | vwx  |
> +------+------+------+
> ```

### Include AUTOINCREMENT / IDENTITY columns in loaded data

Set the AUTOINCREMENT or IDENTITY default value for a number column. When loading data into a table using the COPY command, omit the column in the SELECT statement. The statement automatically populates the column.

> ```sqlexample
> -- Create the target table
> create or replace table mytable (
>   col1 number autoincrement start 1 increment 1,
>   col2 varchar,
>   col3 varchar
>   );
>
> -- Stage a data file in the internal user stage
> PUT file:///tmp/myfile.csv @~;
>
> -- Query the staged data file
> select $1, $2 from @~/myfile.csv.gz t;
>
> +-----+-----+
> | $1  | $2  |
> |-----+-----|
> | abc | def |
> | ghi | jkl |
> | mno | pqr |
> | stu | vwx |
> +-----+-----+
>
> -- Omit the sequence column in the COPY statement
> copy into mytable (col2, col3)
> from (
>   select $1, $2
>   from @~/myfile.csv.gz t
> )
> ;
>
> select * from mytable;
>
> +------+------+------+
> | COL1 | COL2 | COL3 |
> |------+------+------|
> |    1 | abc  | def  |
> |    2 | ghi  | jkl  |
> |    3 | mno  | pqr  |
> |    4 | stu  | vwx  |
> +------+------+------+
> ```

## Transform semi-structured data

The examples in this section apply to any semi-structured data type except where noted.

### Load semi-structured data into separate columns

The following example loads repeating elements from a staged semi-structured file into separate table columns with different data types.

This example loads the following semi-structured data into separate columns in a relational table, with the `location` object values loaded
into a VARIANT column and the remaining values loaded into relational columns:

```sqlexample
-- Sample data:
{"location": {"city": "Lexington","zip": "40503"},"dimensions": {"sq_ft": "1000"},"type": "Residential","sale_date": "4-25-16","price": "75836"},
{"location": {"city": "Belmont","zip": "02478"},"dimensions": {"sq_ft": "1103"},"type": "Residential","sale_date": "6-18-16","price": "92567"},
{"location": {"city": "Winchester","zip": "01890"},"dimensions": {"sq_ft": "1122"},"type": "Condo","sale_date": "1-31-16","price": "89921"}
```

The following SQL statements load the file `sales.json` from the internal stage `mystage`:

> **Note:**
>
> This example loads JSON data, but the SQL statements are similar when loading semi-structured data of other types (e.g. Avro, ORC, etc.).
>
> For an additional example using Parquet data, see Load Parquet Data into Separate Columns (in this topic).

```sqlexample
 -- Create an internal stage with the file type set as JSON.
 CREATE OR REPLACE STAGE mystage
   FILE_FORMAT = (TYPE = 'json');

 -- Stage a JSON data file in the internal stage.
 PUT file:///tmp/sales.json @mystage;

 -- Query the staged data. The data file comprises three objects in NDJSON format.
 SELECT t.$1 FROM @mystage/sales.json.gz t;

 +------------------------------+
 | $1                           |
 |------------------------------|
 | {                            |
 |   "dimensions": {            |
 |     "sq_ft": "1000"          |
 |   },                         |
 |   "location": {              |
 |     "city": "Lexington",     |
 |     "zip": "40503"           |
 |   },                         |
 |   "price": "75836",          |
 |   "sale_date": "2022-08-25", |
 |   "type": "Residential"      |
 | }                            |
 | {                            |
 |   "dimensions": {            |
 |     "sq_ft": "1103"          |
 |   },                         |
 |   "location": {              |
 |     "city": "Belmont",       |
 |     "zip": "02478"           |
 |   },                         |
 |   "price": "92567",          |
 |   "sale_date": "2022-09-18", |
 |   "type": "Residential"      |
 | }                            |
 | {                            |
 |   "dimensions": {            |
 |     "sq_ft": "1122"          |
 |   },                         |
 |   "location": {              |
 |     "city": "Winchester",    |
 |     "zip": "01890"           |
 |   },                         |
 |   "price": "89921",          |
 |   "sale_date": "2022-09-23", |
 |   "type": "Condo"            |
 | }                            |
 +------------------------------+

 -- Create a target table for the data.
 CREATE OR REPLACE TABLE home_sales (
   CITY VARCHAR,
   POSTAL_CODE VARCHAR,
   SQ_FT NUMBER,
   SALE_DATE DATE,
   PRICE NUMBER
 );

 -- Copy elements from the staged file into the target table.
 COPY INTO home_sales(city, postal_code, sq_ft, sale_date, price)
 FROM (select
 $1:location.city::varchar,
 $1:location.zip::varchar,
 $1:dimensions.sq_ft::number,
 $1:sale_date::date,
 $1:price::number
 FROM @mystage/sales.json.gz t);

 -- Query the target table.
 SELECT * from home_sales;

+------------+-------------+-------+------------+-------+
| CITY       | POSTAL_CODE | SQ_FT | SALE_DATE  | PRICE |
|------------+-------------+-------+------------+-------|
| Lexington  | 40503       |  1000 | 2022-08-25 | 75836 |
| Belmont    | 02478       |  1103 | 2022-09-18 | 92567 |
| Winchester | 01890       |  1122 | 2022-09-23 | 89921 |
+------------+-------------+-------+------------+-------+
```

### Load Parquet data into separate columns

Similar to the previous example, but loads semi-structured data from a file in the Parquet format. This example is provided for users who
are familiar with Apache Parquet:

> ```sqlexample
> -- Create a file format object that sets the file format type. Accept the default options.
> create or replace file format my_parquet_format
>   type = 'parquet';
>
> -- Create an internal stage and specify the new file format
> create or replace temporary stage mystage
>   file_format = my_parquet_format;
>
> -- Create a target table for the data.
> create or replace table parquet_col (
>   custKey number default NULL,
>   orderDate date default NULL,
>   orderStatus varchar(100) default NULL,
>   price varchar(255)
> );
>
> -- Stage a data file in the internal stage
> put file:///tmp/mydata.parquet @mystage;
>
> -- Copy data from elements in the staged Parquet file into separate columns
> -- in the target table.
> -- Note that all Parquet data is stored in a single column ($1)
> -- SELECT list items correspond to element names in the Parquet file
> -- Cast element values to the target column data type
> copy into parquet_col
>   from (select
>   $1:o_custkey::number,
>   $1:o_orderdate::date,
>   $1:o_orderstatus::varchar,
>   $1:o_totalprice::varchar
>   from @mystage/mydata.parquet);
>
> -- Query the target table
> SELECT * from parquet_col;
>
> +---------+------------+-------------+-----------+
> | CUSTKEY | ORDERDATE  | ORDERSTATUS | PRICE     |
> |---------+------------+-------------+-----------|
> |   27676 | 1996-09-04 | O           | 83243.94  |
> |  140252 | 1994-01-09 | F           | 198402.97 |
> ...
> +---------+------------+-------------+-----------+
> ```

### Flatten semi-structured data

[FLATTEN](../sql-reference/functions/flatten.md) is a table function that produces a lateral view of a VARIANT, OBJECT, or ARRAY column. Using the sample data from Load semi-structured Data into Separate Columns, create a table with a separate row for each element in the objects.

```sqlexample
-- Create an internal stage with the file delimiter set as none and the record delimiter set as the new line character
create or replace stage mystage
  file_format = (type = 'json');

-- Stage a JSON data file in the internal stage with the default values
put file:///tmp/sales.json @mystage;

-- Create a table composed of the output from the FLATTEN function
create or replace table flattened_source
(seq string, key string, path string, index string, value variant, element variant)
as
  select
    seq::string
  , key::string
  , path::string
  , index::string
  , value::variant
  , this::variant
  from @mystage/sales.json.gz
    , table(flatten(input => parse_json($1)));

  select * from flattened_source;

+-----+-----------+-----------+-------+-------------------------+-----------------------------+
| SEQ | KEY       | PATH      | INDEX | VALUE                   | ELEMENT                     |
|-----+-----------+-----------+-------+-------------------------+-----------------------------|
| 1   | location  | location  | NULL  | {                       | {                           |
|     |           |           |       |   "city": "Lexington",  |   "location": {             |
|     |           |           |       |   "zip": "40503"        |     "city": "Lexington",    |
|     |           |           |       | }                       |     "zip": "40503"          |
|     |           |           |       |                         |   },                        |
|     |           |           |       |                         |   "price": "75836",         |
|     |           |           |       |                         |   "sale_date": "2017-3-5",  |
|     |           |           |       |                         |   "sq__ft": "1000",         |
|     |           |           |       |                         |   "type": "Residential"     |
|     |           |           |       |                         | }                           |
...
| 3   | type      | type      | NULL  | "Condo"                 | {                           |
|     |           |           |       |                         |   "location": {             |
|     |           |           |       |                         |     "city": "Winchester",   |
|     |           |           |       |                         |     "zip": "01890"          |
|     |           |           |       |                         |   },                        |
|     |           |           |       |                         |   "price": "89921",         |
|     |           |           |       |                         |   "sale_date": "2017-3-21", |
|     |           |           |       |                         |   "sq__ft": "1122",         |
|     |           |           |       |                         |   "type": "Condo"           |
|     |           |           |       |                         | }                           |
+-----+-----------+-----------+-------+-------------------------+-----------------------------+
```

### Split semi-structured elements and load as VARIANT values into separate columns

Following the instructions in Load semi-structured Data into Separate Columns, you can load individual elements from semi-structured data into different columns in your target table. Additionally, using the [SPLIT](../sql-reference/functions/split.md) function, you can split element values that contain a separator and load them as an array.

For example, split IP addresses on the dot separator in repeating elements. Load the IP addresses as arrays in separate columns:

> ```sqlexample
> -- Create an internal stage with the file delimiter set as none and the record delimiter set as the new line character
> create or replace stage mystage
>   file_format = (type = 'json');
>
> -- Stage a semi-structured data file in the internal stage
> put file:///tmp/ipaddress.json @mystage auto_compress=true;
>
> -- Query the staged data
> select t.$1 from @mystage/ipaddress.json.gz t;
>
> +----------------------------------------------------------------------+
> | $1                                                                   |
> |----------------------------------------------------------------------|
> | {"ip_address": {"router1": "192.168.1.1","router2": "192.168.0.1"}}, |
> | {"ip_address": {"router1": "192.168.2.1","router2": "192.168.3.1"}}  |
> +----------------------------------------------------------------------+
>
> -- Create a target table for the semi-structured data
> create or replace table splitjson (
>   col1 array,
>   col2 array
>   );
>
> -- Split the elements into individual arrays using the SPLIT function and load them into separate columns
> -- Note that all JSON data is stored in a single column ($1)
> copy into splitjson(col1, col2)
> from (
>   select split($1:ip_address.router1, '.'),split($1:ip_address.router2, '.')
>   from @mystage/ipaddress.json.gz t
> );
>
> -- Query the target table
> select * from splitjson;
>
> +----------+----------+
> | COL1     | COL2     |
> |----------+----------|
> | [        | [        |
> |   "192", |   "192", |
> |   "168", |   "168", |
> |   "1",   |   "0",   |
> |   "1"    |   "1"    |
> | ]        | ]        |
> | [        | [        |
> |   "192", |   "192", |
> |   "168", |   "168", |
> |   "2",   |   "3",   |
> |   "1"    |   "1"    |
> | ]        | ]        |
> +----------+----------+
> ```

---
title: Tri-Secret Secure in Snowflake
source: https://docs.snowflake.com/en/user-guide/security-encryption-tss.md
section: User Guide
---

# Tri-Secret Secure in Snowflake

## Tri-Secret Secure overview

Using a dual-key encryption model together with Snowflake’s built-in user authentication enables three levels of data protection, known as
*Tri-Secret Secure*. Tri-Secret Secure offers you a level of security and control above Snowflake’s standard encryption.

Our dual-key encryption model combines a Snowflake-maintained key and a customer-managed key (CMK), which you create on the cloud provider
platform that hosts your Snowflake account. The model creates a composite master key that protects your Snowflake data. This composite master key
acts as an account master key by wrapping all of the keys in your account hierarchy. The composite master key is never used to encrypt raw data.
For example, the composite master key wraps table master keys, which are used to derive file keys that encrypt the raw data.

> **Attention:**
>
> Before engaging with Snowflake to enable Tri-Secret Secure for your account, you should carefully consider your responsibility for
> safeguarding your key as mentioned in [Customer-managed keys](security-encryption-manage.md). If the customer-managed key (CMK) in the composite master key
> hierarchy is revoked, your data can no longer be decrypted by Snowflake.
>
> If you have any questions or concerns, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> Snowflake also bears the same responsibility for the keys that we maintain. As with all security-related aspects of our service, we treat
> this responsibility with the utmost care and vigilance.
>
> All of our keys are maintained under strict policies that have enabled us to earn the highest security accreditations, including SOC 2
> Type II, PCI-DSS, HIPAA and [HITRUST CSF](intro-cloud-platforms.md).

### Tri-Secret Secure compatibility with hybrid tables

You must enable Dedicated Storage Mode if you intend to create hybrid tables in your account and TSS is already enabled or
will be enabled. For information, see [Hybrid Tables Dedicated Storage Mode for TSS](tables-hybrid-dedicated-storage-mode.md).

### Understanding CMK self-registration with support activation of Tri-Secret Secure

You can register a CMK for use with Tri-Secret Secure using Snowflake system functions. If you
decide to replace a CMK for use with Tri-Secret Secure, the SYSTEM$GET_CMK_INFO function informs you whether your new CMK is registered and
activated. After you self-register your CMK, you can contact Snowflake Support to enable your Snowflake account to use
Tri-Secret Secure with your CMK.

CMK self-registration with support activation provides the following benefits to you:

* Streamlines the steps to register and authorize your CMK.
* Provides transparency to the status of your CMK registration and activation with Tri-Secret Secure.
* Facilitates working with the key management service (KMS) in the cloud platform that hosts your Snowflake account.
* Enables you to rotate your CMK and register the new CMK for use with Tri-Secret Secure.

The following list shows how CMK self-registration with support activation works:

1. As the customer, you do the following actions:

   1. Create the CMK.
   2. Register the CMK.
   3. Generate information for the cloud provider.
   4. Apply the KMS policy.
   5. Confirm the connectivity between your Snowflake account and your CMK.
   6. Contact Snowflake Support to enable your Snowflake account to use Tri-Secret Secure.
2. Snowflake Support enables your Snowflake account to use Tri-Secret Secure based on the CMK that you register.

The steps in the following section avoid terms like *Amazon Resource Number* (ARN) to keep the procedure cloud agnostic. The steps are the
same regardless of the cloud platform that hosts your Snowflake account. However, the system function arguments for some of the steps are
different because each cloud platform service is different.

## Self-register a CMK

To self-register your CMK for use with Tri-Secret Secure, complete the following steps:

1. On the cloud provider, create a CMK.

   Do this step in the key management service (KMS) on the cloud platform that hosts your Snowflake account.
2. In Snowflake, call the [SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md) system function to register your CMK with the KMS
   integration.

   Double-check the system function arguments for the cloud platform that hosts your Snowflake account.
3. In Snowflake, call the [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) system function to view the details for the CMK that you registered.
4. In Snowflake, call the [SYSTEM$GET_CMK_CONFIG](../sql-reference/functions/system_get_cmk_config.md) system function to generate the required information for
   the cloud provider.

   This policy allows Snowflake to access your CMK.

   > **Note:**
   >
   > If Microsoft Azure hosts your Snowflake account, you must pass the `tenant_id` value into the function.
5. On your cloud provider platform, use the output of the SYSTEM$GET_CMK_CONFIG function to authorize your CMK.
6. In Snowflake, call the [SYSTEM$VERIFY_CMK_INFO](../sql-reference/functions/system_verify_cmk_info.md) system function to confirm the connectivity between your
   Snowflake account and your CMK.
7. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) and request that your Snowflake account be enabled to use Tri-Secret Secure.

   Be sure to mention the specific account that you want to use with Tri-Secret Secure.

If you want to enable private connectivity for a CMK that is already activated with Tri-Secret Secure, see [Enable a private connectivity endpoint for an active CMK](security-encryption-tss-self-serve-private.md)
for more information.

## View the status of your CMK

You can call [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) at any time, to check the registration and activation status of your CMK.

For example, depending on when you call SYSTEM$GET_CMK_INFO, the function returns the following output:

* Immediately after activating Tri-Secret Secure, returns `...is being activated...`. This means that rekeying isn’t complete.
* After the Tri-Secret Secure activation process completes, returns output that includes `...is activated...`. This means that your
  Snowflake account is using Tri-Secret Secure with the CMK that you registered.

## Change the CMK for Tri-Secret Secure

Snowflake system functions support changing your customer-managed key (CMK), based on your security needs. Use the same steps to register a new CMK as the
steps that you followed to register your initial CMK. When you complete those steps again by using a new key, the output of the system functions
differs. Read the output from each system function that you call during self-registration to confirm that you have changed your key. For
example, when you change your CMK, calling the SYSTEM$GET_CMK_INFO function returns a message that contains `...is being rekeyed...`.

## Integrate Tri-Secret Secure with AWS external key stores

Snowflake supports integrating Tri-Secret Secure with AWS external key stores to securely store and manage a customer-managed
key outside AWS. Snowflake officially tests and supports only Thales Hardware Security Modules (HSM) and Thales CipherTrust Cloud Key Manager (CCKM) data encryption products.

For more information about setting up and configuring Tri-Secret Secure with Thales solutions, see [How to use Thales External Key Store for Tri-Secret Secure on an AWS Snowflake account](https://community.snowflake.com/s/article/thales-xks-for-tss-aws#e3).

---
title: Tri-Secret Secure self-service in Snowflake
source: https://docs.snowflake.com/en/user-guide/security-encryption-tss-self-serve.md
section: User Guide
---

# Tri-Secret Secure self-service in Snowflake

## Tri-Secret Secure overview

Using a dual-key encryption model together with Snowflake’s built-in user authentication enables three levels of data protection, known as
*Tri-Secret Secure*. Tri-Secret Secure offers you a level of security and control above Snowflake’s standard encryption.

Our dual-key encryption model combines a Snowflake-maintained key and a customer-managed key (CMK), which you create on the cloud provider
platform that hosts your Snowflake account. The model creates a composite master key that protects your Snowflake data. This composite master key
acts as an account master key by wrapping all of the keys in your account hierarchy. The composite master key is never used to encrypt raw data.
For example, the composite master key wraps table master keys, which are used to derive file keys that encrypt the raw data.

> **Attention:**
>
> Before enabling Tri-Secret Secure
> for your account, you should carefully consider your responsibility for safeguarding your key as mentioned in [Customer-managed keys](security-encryption-manage.md).
> If the CMK in the composite master key hierarchy is revoked, your data can no longer be decrypted by Snowflake.
>
> If you have any questions or concerns, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> Snowflake also bears the same responsibility for the keys that we maintain. As with all security-related aspects of our service, we treat
> this responsibility with the utmost care and vigilance.
>
> All of our keys are maintained under strict policies that have enabled us to earn the highest security accreditations, including SOC 2
> Type II, PCI-DSS, HIPAA and [HITRUST CSF](intro-cloud-platforms.md).

### Tri-Secret Secure compatibility with hybrid tables

You must enable Dedicated Storage Mode if you intend to create hybrid tables in your account and TSS is already enabled or
will be enabled. For information, see [Hybrid Tables Dedicated Storage Mode for TSS](tables-hybrid-dedicated-storage-mode.md).

### Understanding Tri-Secret Secure self-service

You can use Snowflake system functions to first register a CMK and then activate Tri-Secret Secure to use the CMK.
If you decide to replace a CMK for use with Tri-Secret Secure, the SYSTEM$GET_CMK_INFO function informs you whether your new
CMK is registered and activated. You can continue to use your account during the rekeying process.

Tri-Secret Secure self-service provides the following benefits to you:

* Facilitates working with the key management service (KMS) in the cloud platform that hosts your Snowflake account.
* Streamlines the steps to register and authorize your CMK.
* Provides transparency to your CMK registration and Tri-Secret Secure activation status.
* Enables you to manage Tri-Secret Secure without any downtime of your Snowflake account.

## Activate Tri-Secret Secure

This procedure works on all cloud provider platforms that Snowflake supports. See your specific cloud provider documentation for any steps
taken on the cloud provider platform.

To create and register your CMK, and then activate Tri-Secret Secure, complete the following steps:

1. On the cloud provider, create a CMK.

   Do this step in the key management service (KMS) on the cloud platform that hosts your Snowflake account.
2. In Snowflake, call the [SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md) system function.

   * This system function registers your CMK with your Snowflake account.
   * Double-check the system function arguments to make sure they are correct for the cloud platform that hosts your Snowflake account.
   * When you call the SYSTEM$REGISTER_CMK_INFO function, Snowflake sends an email message to account administrators who have a validated email
     address. The message notifies the account administrator when to call the SYSTEM$ACTIVATE_CMK_INFO function to activate Tri-Secret Secure.
   > **Important:**
   >
   > You must wait 72 hours before activating Tri-Secret Secure (step 7). If you attempt to activate Tri-Secret Secure during this waiting
   > period, you see an error message that advises you to wait.
3. In Snowflake, call the [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) system function.

   This system function returns the registration status and details for the CMK that you registered.
4. In Snowflake, call the [SYSTEM$GET_CMK_CONFIG](../sql-reference/functions/system_get_cmk_config.md) system function.

   This system function generates the information required for your cloud provider to allow Snowflake to access your CMK.

   > **Note:**
   >
   > If Microsoft Azure hosts your Snowflake account, you must pass the `tenant_id` value into the function.
5. On your cloud provider platform, use the output of the SYSTEM$GET_CMK_CONFIG function to authorize your CMK.
6. In Snowflake, call the [SYSTEM$VERIFY_CMK_INFO](../sql-reference/functions/system_verify_cmk_info.md) system function.

   This system function confirms connectivity between your Snowflake account and your CMK.
7. In Snowflake, call the [SYSTEM$ACTIVATE_CMK_INFO](../sql-reference/functions/system_activate_cmk_info.md) system function.

   This system function activates Tri-Secret Secure with your registered CMK. This system function starts the rekeying process and
   generates an email message that notifies system administrators when the process finishes. The rekeying process can complete in under an
   hour, but might require up to 24 hours.

   > **Warning:**
   >
   > Snowflake uses the old CMK until the rekeying process completes. Do not remove access to the old CMK until receiving email notification
   > that the rekeying process completed.

To enable private connectivity for a CMK already activated with Tri-Secret Secure, see [Enable a private connectivity endpoint for an active CMK](security-encryption-tss-self-serve-private.md).

## View the status of your CMK

You can call [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) at any time, to check the registration and activation status of your CMK.

For example, depending on when you call SYSTEM$GET_CMK_INFO, the function returns the following output:

* Immediately after activating Tri-Secret Secure, returns `...is being activated...`. This means that rekeying isn’t complete.
* After the Tri-Secret Secure activation process completes, returns output that includes `...is activated...`. This means that your
  Snowflake account is using Tri-Secret Secure with the CMK that you registered.

## Change the CMK for Tri-Secret Secure

Snowflake system functions support changing your customer-managed key (CMK), based on your security needs. Use the same steps to register a new CMK as the
steps that you followed to register your initial CMK. When you complete those steps again by using a new key, the output of the system functions
differs. Read the output from each system function that you call during self-registration to confirm that you have changed your key. For
example, when you change your CMK, calling the SYSTEM$GET_CMK_INFO function returns a message that contains `...is being rekeyed...`.

## Use Tri-Secret Secure self-service with automatic key rotation

If you use your cloud provider’s automatic key rotation feature to maintain the lifecycle of your customer-managed keys (CMKs), you can rekey with
the latest version of your CMK by calling the SYSTEM$ACTIVATE_CMK_INFO function and providing the `'REKEY_SAME_CMK'` argument.

For more information, see [Customer-managed keys](security-encryption-manage.md).

## Deactivate Tri-Secret Secure

To deactivate Tri-Secret Secure in your account, call the [SYSTEM$DEACTIVATE_CMK_INFO](../sql-reference/functions/system_deactivate_cmk_info.md) system function.

## Deregister your current CMK

You can only register one CMK at a time with Tri-Secret Secure. When you register your CMK, if the [SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md)
function fails because a different CMK exists, call the [SYSTEM$DEREGISTER_CMK_INFO](../sql-reference/functions/system_deregister_cmk_info.md) system function, as prompted.

## Integrate Tri-Secret Secure with AWS external key stores

Snowflake supports integrating Tri-Secret Secure with AWS external key stores to securely store and manage a customer-managed
key outside AWS. Snowflake officially tests and supports only Thales Hardware Security Modules (HSM) and Thales CipherTrust Cloud Key Manager (CCKM) data encryption products.

For more information about setting up and configuring Tri-Secret Secure with Thales solutions, see [How to use Thales External Key Store for Tri-Secret Secure on an AWS Snowflake account](https://community.snowflake.com/s/article/thales-xks-for-tss-aws#e3).

---
title: Tri-Secret Secure self-service with private connectivity in Snowflake
source: https://docs.snowflake.com/en/user-guide/security-encryption-tss-self-serve-private.md
section: User Guide
---

# Tri-Secret Secure self-service with private connectivity in Snowflake

## Tri-Secret Secure overview

Using a dual-key encryption model together with Snowflake’s built-in user authentication enables three levels of data protection, known as
*Tri-Secret Secure*. Tri-Secret Secure offers you a level of security and control above Snowflake’s standard encryption.

Our dual-key encryption model combines a Snowflake-maintained key and a customer-managed key (CMK), which you create on the cloud provider
platform that hosts your Snowflake account. The model creates a composite master key that protects your Snowflake data. This composite master key
acts as an account master key by wrapping all of the keys in your account hierarchy. The composite master key is never used to encrypt raw data.
For example, the composite master key wraps table master keys, which are used to derive file keys that encrypt the raw data.

> **Attention:**
>
> Before enabling Tri-Secret Secure for your account, you should carefully consider your responsibility for safeguarding your key as
> mentioned in [Customer-managed keys](security-encryption-manage.md).
>
> * If the CMK in the composite master key hierarchy is revoked, your data can no longer be decrypted by Snowflake.
> * Do not deprovision a private endpoint on Microsoft Azure for an active CMK. Deleting such an endpoint
>   makes that CMK inaccessible, renders your account unusable, and means that your data can no longer be decrypted by Snowflake.
>
> If you have any questions or concerns, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> Snowflake also bears the same responsibility for the keys that we maintain. As with all security-related aspects of our service, we treat
> this responsibility with the utmost care and vigilance.
>
> All of our keys are maintained under strict policies that have enabled us to earn the highest security accreditations, including SOC 2
> Type II, PCI-DSS, HIPAA, and [HITRUST CSF](intro-cloud-platforms.md).

### Tri-Secret Secure compatibility with hybrid tables

You must enable Dedicated Storage Mode if you intend to create hybrid tables in your account and Tri-Secret Secure is already enabled or
will be enabled. For information, see [Hybrid Tables Dedicated Storage Mode for TSS](tables-hybrid-dedicated-storage-mode.md).

### Understanding Tri-Secret Secure self-service with private connectivity

You can use Snowflake system functions to register a CMK, provision private endpoints, and then activate a CMK for use with Tri-Secret Secure
through those endpoints. If you decide to replace a CMK for use with Tri-Secret Secure, the SYSTEM$GET_CMK_INFO function informs you
whether your new CMK is registered and activated. You can continue to use your account during the rekeying process.

Tri-Secret Secure self-service with private connectivity provides the following benefits:

* Facilitates working with the key management service (KMS) in the cloud platform that hosts your Snowflake account.
* Streamlines the steps to register and authorize your CMK.
* Provides transparency to your CMK registration and Tri-Secret Secure activation status.
* Enables you to manage Tri-Secret Secure without any downtime of your Snowflake account.
* Allows communication between your Snowflake-managed key and your cloud provider’s key vault through your private endpoints.

## Activate Tri-Secret Secure with private connectivity

This procedure works on all cloud provider platforms that Snowflake supports. See your specific cloud provider documentation
for any steps taken on the cloud provider platform. To enable private connectivity for a CMK already activated with Tri-Secret Secure, see
Enable a private connectivity endpoint for an active CMK.

To activate Tri-Secret Secure with private connectivity, complete the following steps:

1. On the cloud provider, create a CMK.

   Do this step in the KMS on the cloud platform that hosts your Snowflake account.
2. In Snowflake, call [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS](../sql-reference/functions/system_provision_privatelink_endpoint_tss.md).

   This system function provisions a private endpoint for use with your KMS and Tri-Secret Secure.
3. On the cloud provider, approve the private endpoint.

   Do this step in the Azure portal as the owner of the Azure API Management resource.
   For more information, see the [Microsoft Azure](https://learn.microsoft.com/en-us/azure/key-vault/keys/quick-create-portal)
   , [AWS](https://docs.aws.amazon.com/vpc/latest/privatelink/use-resource-endpoint.html), or [Google Cloud](https://docs.cloud.google.com/vpc/docs/configure-private-service-connect-producer#publish-service) documentation.
4. In Snowflake, call [SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md) with both arguments, as shown in the following example:

   ```sqlexample
   SELECT SYSTEM$REGISTER_CMK_INFO('<your_cmk_value>', 'true');
   ```

   * This system function registers your CMK with your Snowflake account.
   * Double-check the system function arguments to ensure that they are correct for the cloud platform that hosts your Snowflake account.
   * When you call the SYSTEM$REGISTER_CMK_INFO function, Snowflake sends an email message to account administrators who have a validated email
     address. The message notifies the account administrator when they can call ACTIVATE_CMK_INFO to activate Tri-Secret Secure.
   > **Important:**
   >
   > You must wait 72 hours before activating Tri-Secret Secure (step 9). If you attempt to activate Tri-Secret Secure during this waiting
   > period, you see an error message that advises you to wait.
5. In Snowflake, call [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md).

   This system function returns the registration status and details for the CMK that you registered.
6. In Snowflake, call [SYSTEM$GET_CMK_CONFIG](../sql-reference/functions/system_get_cmk_config.md).

   This system function generates the information required for your cloud provider to allow Snowflake to access your CMK.

   > **Note:**
   >
   > If Microsoft Azure hosts your Snowflake account, you must pass the `tenant_id` value into the function.
7. On your cloud provider platform, use the output of the SYSTEM$GET_CMK_CONFIG function to authorize your CMK.
8. In Snowflake, call [SYSTEM$VERIFY_CMK_INFO](../sql-reference/functions/system_verify_cmk_info.md).

   This system function confirms connectivity between your Snowflake account and your CMK.
9. In Snowflake, call [SYSTEM$ACTIVATE_CMK_INFO](../sql-reference/functions/system_activate_cmk_info.md).

   This system function activates Tri-Secret Secure with your registered CMK. This system function starts the rekeying process and
   generates an email message that notifies system administrators when the process finishes. The rekeying process can complete in under an
   hour, but might require up to 24 hours.

   > **Warning:**
   >
   > Snowflake uses the old CMK until the rekeying process completes. Don’t remove access to the old CMK until you receive email notification
   > that the rekeying process completed.

## View the status of your CMK

You can call [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) at any time, to check the registration and activation status of your CMK.

For example, depending on when you call SYSTEM$GET_CMK_INFO, the function returns the following output:

* Immediately after activating Tri-Secret Secure, returns `...is being activated...`. This means that rekeying isn’t complete.
* After the Tri-Secret Secure activation process completes, returns output that includes `...is activated...`. This means that your
  Snowflake account is using Tri-Secret Secure with the CMK that you registered.

If you have enabled private connectivity, calling SYSTEM$GET_CMK_INFO returns information about the registration and
activation status of your private connectivity endpoint and Tri-Secret Secure.

## Change the CMK for Tri-Secret Secure

Snowflake system functions support changing your customer-managed key (CMK), based on your security needs. Use the same steps to register a new CMK as the
steps that you followed to register your initial CMK. When you complete those steps again by using a new key, the output of the system functions
differs. Read the output from each system function that you call during self-registration to confirm that you have changed your key. For
example, when you change your CMK, calling the SYSTEM$GET_CMK_INFO function returns a message that contains `...is being rekeyed...`.

## Enable a private connectivity endpoint for an active CMK

If you have an active CMK and you want to enable private connectivity for it *without any rekeying activity*, complete the following steps:

1. In Snowflake, call [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS](../sql-reference/functions/system_provision_privatelink_endpoint_tss.md).

   This system function provisions a private endpoint for use with your key management service (KMS) and Tri-Secret Secure.
2. On the cloud provider, approve the private endpoint.

   Do this step in the Azure portal as the owner of the Azure API Management resource.
   For more information, see the [Microsoft Azure](https://learn.microsoft.com/en-us/azure/key-vault/keys/quick-create-portal)
   , [AWS](https://docs.aws.amazon.com/vpc/latest/privatelink/use-resource-endpoint.html), or [Google Cloud](https://docs.cloud.google.com/vpc/docs/configure-private-service-connect-producer#publish-service) documentation.
3. In Snowflake, call [SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md) by using both arguments, as shown in the following example:

   ```sqlexample
   SELECT SYSTEM$REGISTER_CMK_INFO('<your_cmk_value>', 'true');
   ```

   * This system function registers your CMK with your Snowflake account.
   * Double-check the system function arguments to make sure they are correct for the cloud platform that hosts your Snowflake account.
4. In Snowflake, call [SYSTEM$ACTIVATE_CMK_INFO](../sql-reference/functions/system_activate_cmk_info.md) by providing the `UPDATE_PRIVATELINK`
   argument, as shown in the following example:

   ```sqlexample
   SELECT SYSTEM$ACTIVATE_CMK_INFO('UPDATE_PRIVATELINK');
   ```

   When you run the SYSTEM$ACTIVATE_CMK_INFO function with the UPDATE_PRIVATELINK argument, it reads the value from the previous
   SYSTEM$REGISTER_CMK_INFO call. No rekeying occurs, so the function completes quickly. Optionally, call the SYSTEM$GET_CMK_INFO function
   again, to view your private connectivity status.

## Deprovision a private connectivity endpoint for Tri-Secret Secure

To prevent Snowflake from connecting to an external KMS resource using private connectivity, call the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS](../sql-reference/functions/system_deprovision_privatelink_endpoint_tss.md) function.

## Restore a private connectivity endpoint for Tri-Secret Secure

To re-establish Snowflake connectivity to an external KMS resource using a deprovisioned private connectivity endpoint, call the
[SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS](../sql-reference/functions/system_restore_privatelink_endpoint_tss.md) function.

## Use Tri-Secret Secure self-service with automatic key rotation

If you use your cloud provider’s automatic key rotation feature to maintain the lifecycle of your customer-managed keys (CMKs), you can rekey with
the latest version of your CMK by calling the SYSTEM$ACTIVATE_CMK_INFO function and providing the `'REKEY_SAME_CMK'` argument.

For more information, see [Customer-managed keys](security-encryption-manage.md).

## Deactivate Tri-Secret Secure

To deactivate Tri-Secret Secure in your account, call the [SYSTEM$DEACTIVATE_CMK_INFO](../sql-reference/functions/system_deactivate_cmk_info.md) system function.

## Deregister your current CMK

You can only register one CMK at a time with Tri-Secret Secure. When you register your CMK, if the [SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md)
function fails because a different CMK already exists, call the [SYSTEM$DEREGISTER_CMK_INFO](../sql-reference/functions/system_deregister_cmk_info.md) function, as prompted.

## Integrate Tri-Secret Secure with AWS external key stores

Snowflake supports integrating Tri-Secret Secure with AWS external key stores to securely store and manage a customer-managed
key outside AWS. Snowflake officially tests and supports only Thales Hardware Security Modules (HSM) and Thales CipherTrust Cloud Key Manager (CCKM) data encryption products.

For more information about setting up and configuring Tri-Secret Secure with Thales solutions, see
[How to use Thales External Key Store for Tri-Secret Secure on an AWS Snowflake account](https://community.snowflake.com/s/article/thales-xks-for-tss-aws#e3).

---
title: Tri-Secret Secure with secure share area accounts in Snowflake
source: https://docs.snowflake.com/en/user-guide/security-encryption-tss-ssa.md
section: User Guide
---

# Tri-Secret Secure with secure share area accounts in Snowflake

## Tri-Secret Secure overview

Using a dual-key encryption model together with Snowflake’s built-in user authentication enables three levels of data protection, known as
*Tri-Secret Secure*. Tri-Secret Secure offers you a level of security and control above Snowflake’s standard encryption.

Our dual-key encryption model combines a Snowflake-maintained key and a customer-managed key (CMK), which you create on the cloud provider
platform that hosts your Snowflake account. The model creates a composite master key that protects your Snowflake data. This composite master key
acts as an account master key by wrapping all of the keys in your account hierarchy. The composite master key is never used to encrypt raw data.
For example, the composite master key wraps table master keys, which are used to derive file keys that encrypt the raw data.

> **Attention:**
>
> Before enabling Tri-Secret Secure for your secure share area account, carefully consider your responsibility for
> safeguarding your key, as described in [Customer-managed keys](security-encryption-manage.md). If the customer-managed key (CMK) in the composite master key
> hierarchy is revoked, your data can no longer be decrypted by Snowflake.
>
> If you have any questions or concerns, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> Snowflake also bears the same responsibility for the keys that we maintain. As with all security-related aspects of our service, we treat
> this responsibility with the utmost care and vigilance.
>
> All of our keys are maintained under strict policies that have enabled us to earn the highest security accreditations, including SOC 2
> Type II, PCI-DSS, HIPAA and [HITRUST CSF](intro-cloud-platforms.md).

### Tri-Secret Secure compatibility with hybrid tables

You must enable Dedicated Storage Mode if you intend to create hybrid tables in your account and TSS is already enabled or
will be enabled. For information, see [Hybrid Tables Dedicated Storage Mode for TSS](tables-hybrid-dedicated-storage-mode.md).

### Understanding secure share area accounts

When you publish a listing and enable cross-cloud auto-fulfillment, Snowflake can automatically create one or more secure share area (SSA) accounts in consumer regions. These SSA accounts have the following qualities:

* Are owned and billed to you, the provider.
* Are managed by Snowflake; you cannot access them directly.
* Store replicated copies of your data product for use by consumers in other regions.

Because SSA accounts contain your data, you can protect them with Tri-Secret Secure, just like your primary accounts. However:

* TSS must be enabled separately on each SSA account.
* You can’t run Snowflake commands directly inside an SSA account.
* You may still want KMS API events (for example, GenerateDataKeyWithoutPlaintext and Decrypt) for these accounts to appear consistently in your cloud provider logs for alerting and audit.

## Identify your SSA accounts

SSA accounts for auto-fulfillment follow a standard naming pattern and appear as global accounts in your organization.

To list your SSA accounts, run the following commands:

> ```sqlexample
> USE ROLE ORGADMIN;
>
> SHOW GLOBAL ACCOUNTS LIKE '%AUTO_FULFILLMENT_AREA%' IN ORGANIZATION <org_name>;
> ```

The output returns all accounts whose names include `AUTO_FULFILLMENT_AREA`; for example:

* `AUTO_FULFILLMENT_AREA$PUBLIC_AWS_US_EAST_1`
* `AUTO_FULFILLMENT_AREA$PUBLIC_AZURE_EASTUS2`

These account names are the values you will pass into the Tri-Secret Secure system functions when working with SSA accounts.

> **Note:**
>
> Older deployments might still contain SSA accounts with names that start with `SNOWFLAKE_MANAGED$PUBLIC_<CLOUD>_<REGION>`. You can include both patterns in your filters if needed.

### Understanding Tri-Secret Secure with secure share area accounts

You can use Tri-Secret Secure with SSA accounts to provide enhanced security for data shared through SSAs.
SSA accounts benefit from the same three-layer encryption protection as standard accounts, with the customer-managed key
(CMK) providing an additional layer of control over the encryption keys.

Tri-Secret Secure with secure share area accounts provides the following benefits:

* Enhanced security for data shared through secure share areas
* Control over encryption keys for secure share area data
* Compliance with regulatory requirements for data protection
* Ability to revoke access to encrypted data by revoking the CMK

## Activate Tri-Secret Secure for secure share area accounts

To activate Tri-Secret Secure for an SSA account, complete the following steps. These steps assume that you already [registered your CMK](security-encryption-tss.md).

1. In Snowflake, call the [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) system function to view the details for the CMK that you
   registered, and include the SSA account name.
2. In Snowflake, call the [SYSTEM$GET_CMK_CONFIG](../sql-reference/functions/system_get_cmk_config.md) system function to generate the required information for
   the cloud provider.

   This policy allows Snowflake to access your CMK.

   > **Note:**
   >
   > If Microsoft Azure hosts your Snowflake account, you must pass the `tenant_id` value into the function.
3. On your cloud provider platform, use the output of the SYSTEM$GET_CMK_CONFIG function to authorize your CMK.
4. In Snowflake, call the [SYSTEM$VERIFY_CMK_INFO](../sql-reference/functions/system_verify_cmk_info.md) system function, and include the SSA account name to
   confirm the connectivity between your Snowflake account and your CMK.
5. In Snowflake, call the [SYSTEM$ACTIVATE_CMK_INFO](../sql-reference/functions/system_activate_cmk_info.md) system function to activate Tri-Secret Secure for
   your secure share area account.

   This system function activates Tri-Secret Secure with your registered CMK. This system function starts the rekeying process and
   generates an email message that notifies system administrators when the process finishes. The rekeying process can complete in under an
   hour, but might require up to 24 hours.

   > **Warning:**
   >
   > Snowflake uses the old CMK until the rekeying process is complete. Do not remove access to the old CMK until you receive an email
   > notification indicating that the rekeying process is complete.

## View the status of your CMK

You can call [SYSTEM$GET_CMK_INFO](../sql-reference/functions/system_get_cmk_info.md) at any time, to check the registration and activation status of your CMK.

For example, depending on when you call SYSTEM$GET_CMK_INFO, the function returns the following output:

* Immediately after activating Tri-Secret Secure, returns `...is being activated...`. This means that rekeying isn’t complete.
* After the Tri-Secret Secure activation process completes, returns output that includes `...is activated...`. This means that your
  Snowflake account is using Tri-Secret Secure with the CMK that you registered.

## Change the CMK for Tri-Secret Secure

Snowflake system functions support changing your customer-managed key (CMK), based on your security needs. Use the same steps to register a new CMK as the
steps that you followed to register your initial CMK. When you complete those steps again by using a new key, the output of the system functions
differs. Read the output from each system function that you call during self-registration to confirm that you have changed your key. For
example, when you change your CMK, calling the SYSTEM$GET_CMK_INFO function returns a message that contains `...is being rekeyed...`.

## Deactivate Tri-Secret Secure

To deactivate Tri-Secret Secure in your secure share area account, call the [SYSTEM$DEACTIVATE_CMK_INFO](../sql-reference/functions/system_deactivate_cmk_info.md) system function.

## Deregister your current CMK

You can only register one CMK at a time with Tri-Secret Secure. When you register your CMK, if the
[SYSTEM$REGISTER_CMK_INFO](../sql-reference/functions/system_register_cmk_info.md) function fails because a different CMK exists, call the
[SYSTEM$DEREGISTER_CMK_INFO](../sql-reference/functions/system_deregister_cmk_info.md) system function, as prompted.

---
title: Trial accounts
source: https://docs.snowflake.com/en/user-guide/admin-trial-account.md
section: User Guide
---

# Trial accounts

A Snowflake trial account lets you evaluate/test Snowflake’s full range of innovative and powerful features with no cost or contractual obligations. To sign
up for a trial account, all you need is a valid email address; no payment information or other qualifying information is required.

## Signing up for a trial account

You can sign up for a free trial using the [self-service form](https://signup.snowflake.com/) (on the Snowflake website).

When you sign up for a trial account, you select your [cloud platform](intro-cloud-platforms.md), [region](intro-regions.md),
and [Snowflake Edition](intro-editions.md). These selections can affect how quickly you exhaust your free usage balance. For example, some features available in the Enterprise Edition consume additional [credits](cost-understanding-compute.md).

The balance of your free usage decreases as you consume credits to use [compute resources](cost-understanding-compute.md) and accrue costs associated with [storage](cost-understanding-data-storage.md). You can track your remaining balance at any time.

The trial continues for 30 days (from the sign-up date) or until you’ve depleted your free usage balance, whichever occurs first. At any time during the trial, you can cancel the trial or convert the account to a paid account.

At the end of the trial, the account is suspended. You can still log into a suspended account, but you cannot use any features, such as running a virtual warehouse,
loading data, or performing queries.

To reactivate a suspended trial account, you must enter a credit card, which converts it to a paid account.

## Using compute resources

[Virtual warehouses](warehouses.md) provide the compute power to [load data](../guides-overview-loading-data.md) and
[perform queries](../guides-overview-queries.md). These warehouses consume credits, which reduces your free usage balance. To begin, simply start a warehouse; any credits consumed by the warehouse will be deducted from your balance. If your credit consumption fully depletes your free usage balance, you must add a credit card to the account to continue using Snowflake.

Free credits are only consumed by the virtual warehouses you create in your account, and only when they are running.

> **Tip:**
>
> To prevent unintentional usage of your free credits:
>
> * Verify the size of your virtual warehouses before you start/resume them. The larger the warehouse, the more credits it consumes while running.
>   In many situations, Small or Medium size warehouses are sufficient for evaluating Snowflake’s loading and querying capabilities.
> * Do not disable [auto-suspend](warehouses-overview.md) when creating a warehouse. Choosing a short auto-suspend time period (e.g. 5 minutes or less) can reduce credit consumption.
>
> For additional tips on using your trial account:
>
> 1. In the left navigation bar, find the tile showing your remaining balance.
> 2. Select … » Using your trial credits.

## Using storage

As you load data into your trial account, the cost of that storage is subtracted from your free usage balance based on the standard On-Demand cost of a TB in your cloud platform and region. In addition to the cost of storage, loading data also consumes credits as it uses the compute resources of a warehouse.

## Tutorials for trial accounts

The following tutorials are available for trial accounts:

* [Load and query sample data using SQL](tutorials/tasty-bytes-sql-load.md)
* [Load data from cloud storage: Amazon S3](tutorials/load-from-cloud-tutorial.md)
* [Load data from cloud storage: Microsoft Azure](tutorials/load-from-cloud-tutorial-azure.md)
* [Load data from cloud storage: Google Cloud Storage](tutorials/load-from-cloud-tutorial-gcs.md)
* [Create users and grant roles](tutorials/users-and-roles-tutorial.md)

## Tracking your remaining balance

Users with the ACCOUNTADMIN role can track the remaining balance of their trial using a tile in the left navigation bar of
Snowsight.

From this tile you can also:

* Select Upgrade to convert the trial account to a paid account.
* Select … » see organization usage details to access the Usage page, which allows you to drill down into your credit
  consumption and storage costs.
* Select the … button to access resources that help you get the most out of your trial account.

## Converting to a paid account

You can add a credit card to a trial account at any time to convert it to a paid account.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Do one of the following:

   * Select Upgrade in the left navigation.
   * In the navigation menu, select Admin » Billing.
   * Click the Snowflake billing tab.
3. In the Payment method pane, click + Credit Card.
4. Enter the required information, and then select Add Card.

Adding a payment method converts your trial account to a self-service account. Snowflake knows trial account subscribers who convert to paid
accounts as on-demand, self-service (ODSS) customers. ODSS customers can edit the billing contact information for a self-service account using
Snowsight. For more information, see [Update billing contact information](billing-contacts.md).

Note that you can also change the credit card for a trial account, at any time, using the same interface in which you added the card.

> **Note:**
>
> Adding a credit card to a trial account converts it to a paid account without ending the trial period. During the remainder of the trial period, you can continue using your free credits and storage until the balance is exhausted, after which all additional credit consumption and storage costs will be charged.
>
> Unused balances expire when the trial period ends, at which time costs (for consuming credits and storing data) are charged to the credit card on file at the end of each billing cycle (typically monthly).
>
> For pricing details, see the [pricing page](https://www.snowflake.com/pricing/) (on the Snowflake website).

## Canceling a trial account

You can cancel a trial account at any time by contacting [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support)
and requesting the account to be canceled.

> **Note:**
>
> Currently, trial accounts cannot be canceled through the web interface. To cancel an account, you must contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Current limitations for trial accounts

The following features are not available for trial accounts:

* [External network access](../developer-guide/external-network-access/external-network-access-overview.md)
* [Hybrid tables](tables-hybrid.md)
* [Outbound private connectivity](private-connectivity-outbound.md)
* [Snowflake Openflow](data-integration/openflow/about.md)
* Using Duo for multi-factor authentication (MFA)
* [Cortex Code CLI](cortex-code/cortex-code-cli.md) (requires a Cortex Code CLI trial account; sign up [here](https://signup.snowflake.com/cortex-code?utm_source=docs&utm_medium=docs&utm_campaign=-us-en-all&utm_content=-admin-trial-account-current-limitations-for-trial-accounts)).
* Trial accounts without a valid payment method are limited to roughly ten credits of usage per day of
  [Snowflake Cortex AI Functions](snowflake-cortex/aisql.md).
  To remove this restriction, convert your trial account to a paid account.

---
title: Triggered tasks
source: https://docs.snowflake.com/en/user-guide/tasks-triggered.md
section: User Guide
---

# Triggered tasks

Use triggered tasks to run tasks whenever there’s a change in a [stream](streams-intro.md). This eliminates the need to poll a source frequently when the availability of new data is unpredictable. It also reduces latency because data is processed immediately.

Triggered tasks don’t use compute resources until the event is triggered.

## Considerations

Triggered tasks are supported with the following items:

* Tables
* Views
* Dynamic tables
* Apache Iceberg™ tables (managed and unmanaged)
* Data shares
* Directory tables. A directory table must be refreshed before a triggered task can detect the changes. To detect changes, you can perform either of the following tasks:

  + Set the [directory table to auto-refresh](data-load-dirtables-auto.md).
  + Refresh the directory table manually by using the [ALTER STAGE name REFRESH](../sql-reference/sql/alter-stage.md) command.

Triggered tasks aren’t supported with the following items:

* Hybrid tables
* Streams on external tables

For consumers to create streams on shared tables or secure views, the data provider must enable change tracking on the tables and views that are intended for sharing in their account; that is, `ALTER VIEW <view_name> SET CHANGE_TRACKING = TRUE;`. Without change tracking enabled, consumers can’t create streams on the shared data. For more information, see [Streams on shared objects](data-sharing-provider.md).

## Create a triggered task

Use [CREATE TASK](../sql-reference/sql/create-task.md), and set the following parameters:

* Define the target stream using the `WHEN` clause. (Do not include the `SCHEDULE` parameter.)
* Additional requirements based on [compute resources](tasks-intro.md):

  + To create a task that runs on a user-managed warehouse, include the `WAREHOUSE` parameter and define the warehouse.
  + To create a serverless task, you must include the `TARGET_COMPLETION_INTERVAL` parameter. Do not include the `WAREHOUSE` parameter. Snowflake estimates the resources needed using the target completion interval, and adjusts to complete the task in this time.

The following example creates a serverless triggered task that runs whenever data changes in a stream.

```sqlsyntax
CREATE TASK my_triggered_task
  TARGET_COMPLETION_INTERVAL='15 MINUTES'
  WHEN SYSTEM$STREAM_HAS_DATA('my_order_stream')
  AS
    INSERT INTO customer_activity
    SELECT customer_id, order_total, order_date, 'order'
    FROM my_order_stream;
```

### Migrate an existing task from a scheduled task to a triggered task

1. Suspend the task.
2. Use [ALTER TASK](../sql-reference/sql/alter-task.md) to update the task. Unset the `SCHEDULE` parameter, and then add the `WHEN` clause to define the target stream.
3. Resume the task.

```sqlsyntax
ALTER TASK task SUSPEND;
ALTER TASK task UNSET SCHEDULE;
ALTER TASK task MODIFY WHEN SYSTEM$STREAM_HAS_DATA('my_return_stream');
ALTER TASK task RESUME;
```

### Migrate an existing user-managed triggered task to a serverless triggered task

1. Suspend the task.
2. Use [ALTER TASK](../sql-reference/sql/alter-task.md) to update the task. Remove the `WAREHOUSE` parameter, and then set the `TARGET_COMPLETION_INTERVAL` parameter.
3. Resume the task.

```sqlsyntax
ALTER TASK task SUSPEND;
ALTER TASK task UNSET WAREHOUSE;
ALTER TASK task RESUME;
```

For more information, see [serverless tasks](tasks-intro.md).

## Allow a triggered task to run

When you create a triggered task, it starts in the suspended state.

To begin monitoring the stream:

* Resume the task using [ALTER TASK … RESUME](../sql-reference/sql/alter-task.md).

The task runs in the following conditions:

* When you first resume a triggered task, the task checks the stream for changes after the last task was run. If there is a change, the task runs; otherwise, it skips the task without using compute resources.
* If a task is running and the stream has new data, the task pauses until the current task is complete. Snowflake ensures only one instance of a task runs at a time.
* After a task is complete, Snowflake checks for changes in the stream again. If there are changes, the task runs again; if not, it skips the task.
* The task runs whenever new data is detected in the stream.
* If the stream data is hosted on a directory table, you detect changes by performing either of the following tasks:
* If a task hasn’t run for 12 hours, Snowflake schedules a health check to prevent streams from becoming stale.
  The timing of this health check isn’t guaranteed.
  If Snowflake detects no changes, the task is skipped without using compute resources.
  Task instructions must consume stream data before data retention expires; otherwise, the stream becomes stale.
  For more information, see [Avoiding stream staleness](streams-manage.md).
* Triggered tasks run at most every 30 seconds by default. If a task gets triggered again while running, the next run starts 30 seconds after the previous one was scheduled. You can lower this interval to 10 seconds by setting the [USER_TASK_MINIMUM_TRIGGER_INTERVAL_IN_SECONDS](../sql-reference/parameters.md) parameter.
* When a task is triggered by [Streams on views](streams-intro.md), then any changes to tables referenced by the Streams on Views query will also trigger the task, regardless of any joins, aggregations, or filters in the query.

## Monitor triggered tasks

* In the `SHOW TASKS` and `DESC TASK` output, the `SCHEDULE` property displays `NULL` for triggered tasks.
* In the output of the task_history view of the information_schema and account_usage schemas, the SCHEDULED_FROM column displays TRIGGER.

## Examples

Example 1: Create a user-managed task that runs whenever data changes in either of two streams.

```sqlsyntax
CREATE TASK triggered_task_either_of_two_streams
  WAREHOUSE = my_warehouse
  WHEN SYSTEM$STREAM_HAS_DATA('my_return_stream')
    OR SYSTEM$STREAM_HAS_DATA('my_order_stream')
  AS
    INSERT INTO customer_activity
    SELECT customer_id, return_total, return_date, 'return'
    FROM my_return_stream
    UNION ALL
    SELECT customer_id, order_total, order_date, 'order'
    FROM my_order_stream;
```

Example 2: Create a user-managed task to run whenever data changes are detected in two different data streams. Because the task uses the AND conditional, the task is skipped if only one of the two streams has new data.

```sqlsyntax
CREATE TASK triggered_task_both_streams
  WAREHOUSE = my_warehouse
  WHEN SYSTEM$STREAM_HAS_DATA('orders_stream')
    AND SYSTEM$STREAM_HAS_DATA('my_order_stream')
  AS
    INSERT INTO completed_promotions
    SELECT order_id, order_total, order_time, promotion_id
    FROM orders_stream
    WHERE promotion_id IS NOT NULL;
```

Example 3: Create a user-managed task that runs whenever data changes in a directory table. In the example, a stream — my_directory_table_stream — is hosted on a [directory table](data-load-dirtables-manage.md) on a stage called my_test_stage.

```sqlsyntax
CREATE TASK triggered_task_directory_table
  WAREHOUSE = my_warehouse
  WHEN SYSTEM$STREAM_HAS_DATA('my_directory_table_stream')
  AS
    INSERT INTO tasks_runs
    SELECT 'trigger_t_internal_stage', relative_path, size,
            last_modified, file_url, etag, metadata$action
    FROM my_directory_table_stream;
```

To validate the triggered task, data is added to the stage.

```sqlsyntax
COPY INTO @my_test_stage/my_test_file
  FROM (SELECT 100)
  OVERWRITE=TRUE
```

The directory table is then refreshed manually, which triggers the task.

```sqlsyntax
ALTER STAGE my_test_stage REFRESH
```

---
title: Troubleshoot budgets
source: https://docs.snowflake.com/en/user-guide/budgets/troubleshoot.md
section: User Guide
---

# Troubleshoot budgets

This topic explains how to monitor budgets for problems and provides
solutions to common issues.

## Using an event table to monitor budgets

You can use an [event table](../../developer-guide/logging-tracing/event-table-setting-up.md) to collect telemetry data related to budgets.
After Snowflake starts collecting the data in the event table, you can query the table, create a stream to track changes, or
set alerts to send notifications when certain events occur.

If you don’t want to collect telemetry data for budgets, you must set the
[ENABLE_BUDGET_EVENT_LOGGING](../../sql-reference/parameters.md) account parameter to `FALSE` to turn it off.

### Understanding the events

The following table describes the values in the event table that correspond to budget events so you can focus on the appropriate
events. For detailed information about the structure of an event table, see [Event table columns](../../developer-guide/logging-tracing/event-table-columns.md).

| Event table column | Field | Value | Description |
| --- | --- | --- | --- |
| `resource_attributes` | `snow.cost.budget.id` | `budget_id` | Unique ID of the budget instance. |
|  | `snow.cost.budget.name` | `budget_name` | Fully qualified name of the budget instance. |
| `scope` | `name` | `snow.cost.budget` | Constant identifier for all budget telemetry events. |
| `record_type` | n/a | `EVENT` | Indicates a budget log event. |
| `record` | `name` | `event_name` | Descriptive event name. Possible values include the following:   * `BUDGET_UNVERIFIED_RECIPIENTS` — Occurs when email addresses are not in the integration’s allowed recipients list or there   are email addresses that are not verified. * `BUDGET_INVALID_INTEGRATION` — Occurs when a notification integration doesn’t exist or the user lacks access to it. |
|  | `severity_text` | `INFO`, `WARNING`, or `ERROR` | Severity level of budget event. |
| `value` | `message` | `message` | Descriptive event message, often including contextual details such as an integration name or operation. |

Use the following examples to better understand how to identify budget events in an event table.

Query: Find all events related to the propagation of all budgets within the account
:   ```sqlexample
    SELECT
        TIMESTAMP,
        RESOURCE_ATTRIBUTES,
        SCOPE,
        RECORD_TYPE,
        RECORD,
        VALUE
      FROM SNOWFLAKE.TELEMETRY.EVENTS
      WHERE
        RECORD_TYPE = 'EVENT' AND
        SCOPE['name'] = 'snow.cost.budget';
    ```

Query: Find all events related to a specific budget (for example, `MY_DB.SCH1.MY_BUDGET`)
:   ```sqlexample
    SELECT
        TIMESTAMP,
        RESOURCE_ATTRIBUTES,
        SCOPE,
        RECORD_TYPE,
        RECORD,
        VALUE
      FROM SNOWFLAKE.TELEMETRY.EVENTS
      WHERE
        RECORD_TYPE = 'EVENT' AND
        SCOPE['name'] = 'snow.cost.budget'
        AND RESOURCE_ATTRIBUTES['snow.cost.budget.name'] ILIKE 'MY_DB.SCH1.MY_BUDGET';
    ```

## Troubleshooting specific problems

The following scenarios can help you troubleshoot issues that can occur when creating or editing budgets:

* You can’t activate the account budget
* You can’t create a custom budget
* You can’t activate a custom budget
* You can’t call methods on the account budget
* You can’t add or remove objects from a custom budget
* You can’t set email notifications for a budget
* You can’t successfully call the GET_SERVICE_TYPE_USAGE method
* The Budgets feature is not available for your account

### You can’t activate the account budget

There are multiple reasons you might be unable to activate your account budget:

|  |  |
| --- | --- |
| Error | ```output Unknown user-defined function SNOWFLAKE.LOCAL.ACTIVATE ``` |
| Cause | If your Snowflake account is new, the account budget is not yet available in your account. |
| Solution | Wait for the account budget to be available in your newly created account. You can activate it after it becomes available. |

|  |  |
| --- | --- |
| Error | ```output FAILURE: Uncaught exception of type 'BUDGET_ALREADY_ACTIVATED' on line X at position X ``` |
| Cause | The account budget has already been activated. |
| Solution | You can call the [<budget_name>!GET_CONFIG](../../sql-reference/classes/budget/methods/get_config.md) method to view the activation timestamp:  ```sqlexample CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_CONFIG(); ``` |

|  |  |
| --- | --- |
| Error | ```output -20000 (P0001): Uncaught exception of type 'NO_PERMISSION_TO_ACTIVATE_BUDGET' on line X at position X ``` |
| Cause | Your account does not yet support the Budgets feature. |
| Solution | The Budgets feature is not available for accounts in Gov regions. Support for Gov regions will be available in a future release. |

### You can’t create a custom budget

There are multiple reasons you might be unable to create a custom budget.

|  |  |
| --- | --- |
| Error | ```output FAILURE: SQL access control error: Insufficient privileges to operate on class 'BUDGET' ``` |
| Cause | The role you are using does not have the privileges required to create custom budgets. |
| Solution | Use a role with the required privileges. See [Create a custom role to create budgets](custom-budget.md). |

|  |  |
| --- | --- |
| Error | ```output FAILURE: Uncaught exception of type 'STATEMENT_ERROR' on line 0 at position -1 : Uncaught exception of type 'UNSUPPORTED_BUDGET_TYPE' on line X at position X ``` |
| Cause | You pass arguments to the constructor method to create a budget. |
| Solution | See [CREATE BUDGET](../../sql-reference/classes/budget/commands/create-budget.md) and edit your create statement. |

### You can’t activate a custom budget

|  |  |
| --- | --- |
| Error | ```output FAILURE: Uncaught exception of type 'STATEMENT_ERROR' on line 0 at position -1 : Uncaught exception of type 'UNSUPPORTED_BUDGET_TYPE' on line X at position X ``` |
| Cause | You tried to call the ACTIVATE method on a custom budget. |
| Solution | The ACTIVATE method is only available on the account budget. After you have created a custom budget, use the [<budget_name>!SET_SPENDING_LIMIT](../../sql-reference/classes/budget/methods/set_spending_limit.md) and [<budget_name>!SET_EMAIL_NOTIFICATIONS](../../sql-reference/classes/budget/methods/set_email_notifications.md) methods to configure the budget and start receiving notification emails. |

### You can’t call methods on the account budget

There are multiple reasons why calling a method on the account budget might fail.

|  |  |
| --- | --- |
| Error | ```output -20000 (P0001): Uncaught exception of type 'FUNCTION_NOT_SUPPORTED_FOR_ACCOUNT_ROOT_BUDGET' on line 11 at position 18 ``` |
| Cause | You tried to call any of the following methods on the account budget:   * [ADD_RESOURCE()](../../sql-reference/classes/budget/methods/add_resource.md) * [REMOVE_RESOURCE()](../../sql-reference/classes/budget/methods/remove_resource.md) * [GET_LINKED_RESOURCES()](../../sql-reference/classes/budget/methods/get_linked_resources.md) |
| Solution | These methods are not available on the account budget. The account budget monitors all supported objects in the account and objects cannot be added or removed. For more information, see [Account budget and custom budgets](../budgets.md). |

|  |  |
| --- | --- |
| Error | ```output -20000 (P0001): Uncaught exception of type 'ACCOUNT_ROOT_BUDGET_NOT_ACTIVATED' on line X at position X ``` |
| Cause | You tried to call a method on the account budget before the account budget is activated. |
| Solution | [Activate the account budget](account-budget.md). |

### You can’t add or remove objects from a custom budget

To successfully add or remove an object from a custom budget, the role used to call the method must have the
[required privileges and role](monitor.md).

|  |  |
| --- | --- |
| Error | ```output 002141 (42601): SQL compilation error: Unknown user-defined function <budget_db>.<budget_schema>.<budget_name>!ADD_RESOURCE ``` |
| Cause | The role you used to call the instance method does not have the required privileges to add (or remove) objects from the budget. |
| Solution | Grant the required instance role and privileges to the role used to call the method. For more information, see [Create a custom role to monitor a custom budget](monitor.md). |

|  |  |
| --- | --- |
| Error | ```output 002003 (02000): SQL compilation error: <object_type> '<object_name>' does not exist or not authorized. ``` |
| Cause | You tried to add an object to a custom budget but the role you used to call the method doesn’t have the required privileges. |
| Solution | To add (or remove) an object from a budget, the role used to call the method must have the APPLYBUDGET privilege on the object. If the parent object is a database or schema, you must also have the USAGE privilege on the database and schema that contain the object.  For more information, see the list of [required object privileges](../budgets.md). |

|  |  |
| --- | --- |
| Error | ```output Uncaught exception of type 'EXPRESSION_ERROR' on line 10 at position 21 : Privilege 'APPLYBUDGET' is not authorized on the reference object. ``` |
| Cause | You tried to create a reference for an object without specifying the PRIVILEGE parameter in the SYSTEM$REFERENCE statement. |
| Solution | Create the reference with the APPLYBUDGET privilege on the object. |

|  |  |
| --- | --- |
| Error | ```output 505001 (55000): Uncaught exception of type 'EXPRESSION_ERROR' on line 10 at position 21 : Specified object does not exist or not authorized for the reference. ``` |
| Cause | There are multiple causes for this error message:   * You tried to add the SNOWFLAKE database to a custom budget with an inline SYSTEM$REFERENCE statement. * You don’t have the required privileges on the object to create a reference for it. The valid reference is required to add   the object to a budget. |
| Solution | * The SNOWFLAKE database cannot be added to a budget. See the [usage notes for ADD_RESOURCE](../../sql-reference/classes/budget/methods/add_resource.md). * Grant the required privileges on the object you want to add to the budget. For more information, see the list of   [required object privileges](../budgets.md). |

### You can’t set email notifications for a budget

The following scenarios can help you troubleshoot common issues when calling the
[<budget_name>!SET_EMAIL_NOTIFICATIONS](../../sql-reference/classes/budget/methods/set_email_notifications.md) method.

|  |  |
| --- | --- |
| Error | ```output Unknown user-defined function <database_name>.<schema_name>.<budget_name>.SET_EMAIL_NOTIFICATIONS ``` |
| Cause | The role you used to set the email notifications for a custom budget does not have the ADMIN instance role. |
| Solution | Use a role with the required privileges and roles. See the [Access control requirements](../../sql-reference/classes/budget/methods/set_email_notifications.md) for SET_EMAIL_NOTIFICATIONS. |

|  |  |
| --- | --- |
| Error | ```output Integration '<INTEG_NAME>' does not exist or not authorized. ``` |
| Cause | The notification integration does not exist. |
| Solution | Use a valid notification integration. For more information, see [Create an email notification integration](../notifications/email-notifications.md). Include the email addresses for budgets notifications in the ALLOWED_RECIPIENTS list. |

|  |  |
| --- | --- |
| Error | ```output FAILURE: Uncaught exception of type 'EXPRESSION_ERROR' on line 16 at position 34 : Following email address(es) are not allowed by the email integration <INTEGRATION_NAME>: [<email>] ``` |
| Cause | The email addresses are not included in the notification integration. |
| Solution | Add the email addresses to the notification integration, or use a notification integration that includes all the email addresses in the ALLOWED_RECIPIENTS list. |

|  |  |
| --- | --- |
| Error | ```output Email recipients in the given list at indexes [<index_list>] are not allowed. Either these email addresses are not yet validated or do not belong to any user in the current account. ``` |
| Cause | Some or all of the email addresses you tried to add are not validated. |
| Solution | See [Verify the email addresses of the email notification recipients](../notifications/email-notifications.md). |

### You can’t successfully call the GET_SERVICE_TYPE_USAGE method

The following scenarios can help you troubleshoot common issues when calling the
[<budget_name>!GET_SERVICE_TYPE_USAGE](../../sql-reference/classes/budget/methods/get_service_type_usage.md) method.

|  |  |
| --- | --- |
| Error | ```output 001044 (42P13): SQL compilation error: error line 0 at position -1 Invalid argument types for function 'GET_SERVICE_TYPE_USAGE': (VARCHAR(X), VARCHAR(X), VARCHAR(X), VARCHAR(X)) ``` |
| Cause | You called the method with invalid arguments or the wrong number of arguments. |
| Solution | Check that the arguments you use to call the method are valid and that you’ve included all required arguments. |

|  |  |
| --- | --- |
| Error | ```output 002151 (22023): Uncaught exception of type 'STATEMENT_ERROR' on line 16 at position 23 : SQL compilation error: [:TIME_DEPART] is not a valid date/time component for function DATE_TRUNC. ``` |
| Cause | The TIME_DEPART argument is an invalid string. |
| Solution | Use one of the valid values listed for the [TIME_DEPART argument](../../sql-reference/classes/budget/methods/get_service_type_usage.md) in the reference topic. |

|  |  |
| --- | --- |
| Error | ```output 100094 (22000): Uncaught exception of type 'STATEMENT_ERROR' on line 16 at position 23 : Unknown timezone: '<invalid_timezone>' ``` |
| Cause | The USER_TIMEZONE argument is an invalid string. |
| Solution | Use a valid timezone string. For more information, see the [usage notes for GET_SERVICE_TYPE_USAGE](../../sql-reference/classes/budget/methods/get_service_type_usage.md). |

### The Budgets feature is not available for your account

|  |  |
| --- | --- |
| Errors | ```output FAILURE: SQL compilation error: Class 'SNOWFLAKE.CORE.BUDGET' does not exist or not authorized. ```  ```output 000002 (0A000): Uncaught exception of type 'STATEMENT_ERROR' on line 0 at position -1 : Unsupported feature 'TOK_RESOURCE_GROUP'. ``` |
| Cause | Your account does not yet support the Budgets feature. |
| Solution | The Budgets feature is not available for accounts in Gov regions. Support for Gov regions will be available in a future release. |

---
title: Troubleshooting access control issues
source: https://docs.snowflake.com/en/user-guide/security-access-control-troubleshooting.md
section: User Guide
---

# Troubleshooting access control issues

If a SQL statement fails because the role being used to run the query lacks the required access control privileges, you can use the
[EXPLAIN_PRIVILEGES](../sql-reference/functions/explain_privileges.md) function to determine exactly which privileges are missing.

## Troubleshooting as an administrator

An administrator who has privileges on all objects in Snowflake can call the EXPLAIN_PRIVILEGES function on any SQL statement.

> **Tip:**
>
> If you want someone who doesn’t have privileges on objects to be able to diagnose access control issues using EXPLAIN_PRIVILEGES, grant them the RESOLVE ALL ON ACCOUNT privilege.

**Example: List all privileges needed to run a SQL statement**

```sqlexample
CALL EXPLAIN_PRIVILEGES(statement => 'DESC SCHEMA mydb.myschema');
```

Example output:

```json
{
  "allOf": [
    {
      "privilege": "<ANY>",
      "objectType": "DATABASE",
      "objectName": "MYDB"
    },
    {
      "privilege": "MONITOR",
      "objectType": "SCHEMA",
      "objectName": "MYDB.MYSCHEMA"
    }
  ]
}
```

This output indicates that you need any privilege on the database `MYDB` AND the `MONITOR` privilege
on the schema `MYDB.MYSCHEMA`.

**Example: List the missing privileges for a specific role**

The following call determines whether the `analyst_role` (including privileges from its granted roles) has
the necessary privileges to execute the SELECT statement and, if not, returns the
missing privileges.

```sqlexample
CALL EXPLAIN_PRIVILEGES(
  statement => 'SELECT * FROM mydb.myschema.mytable',
  missing_only => true,
  for_role => 'analyst_role');
```

## Troubleshooting your own query

You must have at least one privilege on the objects referenced in your query to call the EXPLAIN_PRIVILEGES function. If those privileges on the object aren’t enough to successfully run your query, call the EXPLAIN_PRIVILEGES function with the `missing_only`
argument set to `true` to determine the additional privileges that are required.

For example, if you have privileges on the `mydb`, `myschema`, and `mytable` objects, but your query is still failing because of access control issues, run the following command:

```sqlexample
CALL EXPLAIN_PRIVILEGES(
  statement => 'SELECT * FROM mydb.myschema.mytable',
  missing_only => true);
```

If your current role is missing privileges, the function returns the specific privileges you need. For example:

```json
{
  "allOf": [
    {
      "privilege": "SELECT",
      "objectType": "TABLE",
      "objectName": "MYDB.MYSCHEMA.MYTABLE"
    }
  ]
}
```

---
title: Troubleshooting bulk data loads
source: https://docs.snowflake.com/en/user-guide/data-load-bulk-ts.md
section: User Guide
---

# Troubleshooting bulk data loads

This topic describes a methodical approach to troubleshooting issues with bulk data loads.

## Data load failures

### Step 1: Viewing the COPY history for the table

Query the load activity history for a table. For information, see [COPY_HISTORY](../sql-reference/functions/copy_history.md). The `STATUS` column indicates whether a particular set of files was loaded, partially loaded, or failed to load. The `FIRST_ERROR_MESSAGE` column provides a reason when an attempt partially loaded or failed.

Note that if a set of files has multiple issues, the `FIRST_ERROR_MESSAGE` column only indicates the first error encountered. To view all errors in the files, see Step 2: Validating the Data Load for instructions.

### Step 2: Validating the data load

The VALIDATION_MODE copy option instructs a COPY statement to validate the data to be loaded and return results based on the validation option specified. No data is loaded when this copy option is specified. For more information about the copy option, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

Execute a COPY statement with the VALIDATION_MODE copy option set to `RETURN_ALL_ERRORS`. In the statement, reference the set of files you had attempted to load.

The following example validates a set of files that contain errors. To facilitate analysis of the errors, a [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) statement then unloads the problematic records into a text file so they could be analyzed and fixed in the original data files. The statement queries the [RESULT_SCAN](../sql-reference/functions/result_scan.md) table function to retrieve the records. Note that the statements in this section must be run in succession in order to retrieve the applicable records using the [LAST_QUERY_ID](../sql-reference/functions/last_query_id.md) function.

```sqlexample
COPY INTO mytable
  FROM @mystage/myfile.csv.gz
  VALIDATION_MODE=RETURN_ALL_ERRORS;

SET qid=last_query_id();

COPY INTO @mystage/errors/load_errors.txt FROM (SELECT rejected_record FROM TABLE(result_scan($qid)));
```

## Other issues

### Error: Integration `{0}` associated with the stage `{1}` cannot be found

```bash
003139=SQL compilation error:\nIntegration ''{0}'' associated with the stage ''{1}'' cannot be found.
```

This error can occur when the association between the external stage and the storage
integration linked to the stage has been broken. This happens when the storage integration
object has been recreated (using
[CREATE OR REPLACE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md)).
A stage links to a storage integration using a hidden ID rather than the name of the storage
integration. Behind the scenes, the CREATE OR REPLACE syntax drops the object and recreates
it with a different hidden ID.

If you must recreate a storage integration after it has been linked to one or more stages,
you must reestablish the association between each stage and the storage integration by
executing [ALTER STAGE](../sql-reference/sql/alter-stage.md)
`stage_name` SET STORAGE_INTEGRATION = `storage_integration_name`, where:

* `stage_name` is the name of the stage.
* `storage_integration_name` is the name of the storage integration.

### Load times inserted using CURRENT_TIMESTAMP earlier than LOAD_TIME values in COPY_HISTORY view

Table designers may add a timestamp column that inserts the current timestamp as the default value as records are loaded into a table. The intent is to capture the time when each record was loaded into the table; however, the timestamps are earlier than the LOAD_TIME column values returned by the [COPY_HISTORY function](../sql-reference/functions/copy_history.md) (Information Schema) or the [COPY_HISTORY view](../sql-reference/account-usage/copy_history.md) (Account Usage). The reason is, [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md) is evaluated when the load operation is compiled in cloud services rather than when the record is inserted into the table (i.e. when the transaction for the load operation is committed).

It is recommended to include and query [METADATA$START_SCAN_TIME](querying-metadata.md) instead, which provides a more accurate representation of record loading.

---
title: Troubleshooting external tables
source: https://docs.snowflake.com/en/user-guide/tables-external-ts.md
section: User Guide
---

# Troubleshooting external tables

This topic describes how to troubleshoot issues with external tables.

## Automatic metadata refreshing is disabled

If ownership of an external table (that is, the OWNERSHIP privilege on the external table) is transferred to a different role, the AUTO_REFRESH parameter for the external table is set to FALSE by default. To re-enable automatic refreshing of the external table metadata, set the AUTO_REFRESH parameter to TRUE by using an [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md) statement.

Verify that the configured settings for the external cloud messaging service are still accurate. For more information, see the instructions for your cloud storage provider:

* [Refresh external tables automatically for Amazon S3](tables-external-s3.md)
* [Refresh external tables automatically for Azure Blob Storage](tables-external-azure.md)

## Checking the progress of automatic metadata refreshes

Retrieve the current status of the internal, hidden pipe used by the external table to refresh its metadata. The results are displayed in JSON format. For information, see [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](../sql-reference/functions/system_external_table_pipe_status.md).

Verify the following values:

> `lastReceivedMessageTimestamp`
> :   Specifies the timestamp of the last event message received from the message queue.
>
>     If the timestamp is earlier than expected, this likely indicates an issue with either the cloud event notification service configuration or the service itself. If the field is empty, verify your service configuration settings. If the field contains a timestamp but it’s earlier than expected, verify whether any settings were changed in your service configuration.
>
> `lastForwardedMessageTimestamp`
> :   Specifies the timestamp of the last event message that was forwarded to the pipe.

### Error: Integration `{0}` associated with the stage `{1}` cannot be found

```bash
003139=SQL compilation error:\nIntegration ''{0}'' associated with the stage ''{1}'' cannot be found.
```

This error can occur when the association between the external stage and the storage
integration linked to the stage has been broken. This happens when the storage integration
object has been recreated (using
[CREATE OR REPLACE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md)).
A stage links to a storage integration using a hidden ID rather than the name of the storage
integration. Behind the scenes, the CREATE OR REPLACE syntax drops the object and recreates
it with a different hidden ID.

If you must recreate a storage integration after it has been linked to one or more stages,
you must reestablish the association between each stage and the storage integration by
executing [ALTER STAGE](../sql-reference/sql/alter-stage.md)
`stage_name` SET STORAGE_INTEGRATION = `storage_integration_name`, where:

* `stage_name` is the name of the stage.
* `storage_integration_name` is the name of the storage integration.

## Error: External table `{0}` marked invalid. Stage `{1}` location altered

Querying an external table might produce an error similar to the following error:

```bash
091093 (55000): External table ''{0}'' marked invalid. Stage ''{1}'' location altered.
```

This error can occur when the URL for the referenced stage is modified after the external table was created (by using [ALTER STAGE … SET URL](../sql-reference/sql/alter-stage.md)).

If you must modify the stage URL, you must recreate any existing external tables that reference the stage (by using [CREATE OR REPLACE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md)).

---
title: Troubleshooting loads from Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-load-gcs-ts.md
section: User Guide
---

# Troubleshooting loads from Google Cloud Storage

This topic provides instructions for resolving issues specific to loading data from Google Cloud Storage stages.

For general data loading troubleshooting steps, see [Troubleshooting bulk data loads](data-load-bulk-ts.md).

## Error: Failure using stage area

When attempting to load data from a Google Cloud Storage (GCS) bucket, you could encounter the following error:

```none
Failure using stage area. Cause: [Request violates VPC Service Controls. (Status Code: 403)]
```

This error indicates a violation of restrictions established for a GCP service perimeter, which is configured using VPC Service Controls to secure sensitive data. Although the GCS service account created for your Snowflake account may have been granted permission to read and write to the bucket, the access rules for the service perimeter are applied at the GCP organization level, potentially affecting multiple projects. To review additional details associated with the error message, access the VPC Service Control error logs. See the GCP documentation for descriptions of the `violationReason` values in the logs.

The simplest option to resolve the error is to load data from a bucket that is excluded from the service perimeter. If that option is not allowed by your established security rules, you could exclude the GCS service account for your Snowflake account from the service perimeter filters by adding the service account in an access level policy. Note that the service account still requires access to approved resources using the standard IAM policy described in the instructions for configuring the integration with GCS.

The access policy for a GCP organization contains access levels. Access levels are created and managed using the Access Context Manager and either the Google Cloud Console, the gcloud command-line tool, or the Cloud API. The following instructions rely on the gcloud command-line tool.

To add the GCS service account for your Snowflake account to an access level policy:

1. Using a Snowflake client, retrieve the ID for the Cloud Storage service account that was created automatically for your Snowflake account (using [DESCRIBE INTEGRATION](../sql-reference/sql/desc-integration.md)):

   ```sqlexample
   DESC STORAGE INTEGRATION <integration_name>;
   ```

   Where `integration_name` is the name of a storage integration in your account. For more information, see [Configure an integration for Google Cloud Storage](data-load-gcs-config.md).
2. Create a file named `snowflake_policy.yaml` on your local machine. Specify the service account ID in the `members` attribute:

   ```none
   - members:
      - serviceAccount:<service_account>
   ```

   For example:

   ```none
   - members:
      - serviceAccount:service-account-id@project1-123456.iam.gserviceaccount.com
   ```
3. Using the gcloud command-line tool, execute the following command to create an access level.

   > **Note:**
   >
   > This command requires a GCP role with the necessary permissions to change VCP Service Control.

   ```shell
   gcloud access-context-manager levels create <access_level_name> \
      --title snowflake \
      --basic-level-spec snowflake_policy.yaml \
      --combine-function=OR \
      --policy=<policy_name>
   ```

> Where:
>
> * `policy_name` is the access policy name for your GCP organization.
> * `access_level_name` is the name of your choice for the access level name.

---
title: Troubleshooting processing of unstructured data
source: https://docs.snowflake.com/en/user-guide/unstructured-ts.md
section: User Guide
---

# Troubleshooting processing of unstructured data

This topic provides instructions for resolving issues specific to processing unstructured data.

## Downloaded files cannot be opened

When attempting to open a file downloaded from a stage using a URL, you could encounter an error such as `invalid format`.

Verify that the ENCRYPTION property for the stage is configured for server-side encryption. Server-side encryption is required to access
staged files using a URL. For more information, see [Server-side encryption for unstructured data access](unstructured-intro.md).

For external stages, the server-side encryption setting varies by cloud storage provider. For internal stages, set the ENCRYPTION property
value to `SNOWFLAKE_SSE`.

The encryption type is specified when creating a stage (using [CREATE STAGE](../sql-reference/sql/create-stage.md)) or later
(using [ALTER STAGE](../sql-reference/sql/alter-stage.md)).

---
title: Troubleshooting sensitive data classification
source: https://docs.snowflake.com/en/user-guide/classify-troubleshooting.md
section: User Guide
---

# Troubleshooting sensitive data classification

The simplest way to start troubleshooting a table that wasn’t classified by
[sensitive data classification](classify-intro.md) is to query the table directly (for example, `SELECT * FROM my_table`). If
a table can’t be queried, it can’t be classified.

If an object can’t be classified, Snowflake logs an event to an
[event table](../developer-guide/logging-tracing/event-table-setting-up.md). By default, the event is logged to the account-level event
table. If you have an event table defined for the failed object’s database, then the event is logged there instead.

In general, there is a delay before Snowflake tries to classify the object again. Every additional failed attempt is logged to the event
table. This delay and retry process continues until the object is fixed or removed from automatic classification.

> **Note:**
>
> To help avoid unnecessary costs, Snowflake waits additional time to retry classification for some errors, such as timeouts. For these
> timeout errors, Snowflake doesn’t retry classification until all objects are reclassified; the schedule on which objects are reclassified
> is controlled by the `maximum_classification_validity_days` key of the classification profile.

If you want prevent classification events from being logged, set the [ENABLE_AUTOMATIC_SENSITIVE_DATA_CLASSIFICATION_LOG](../sql-reference/parameters.md) account
parameter to FALSE.

## Listing general errors

The following query returns general errors related to sensitive data classification from the event table:

```sqlexample
SELECT
  record_type,
  record:severity_text::string log_level,
  parse_json(value) error_message
  FROM <event_db>.<event_schema>.<event_table>
  WHERE record_type='LOG' and scope:name ='snow.automatic_sensitive_data_classification'
  ORDER BY log_level;
```

For a subset of the possible error messages returned by this query, see Tag-related error messages.

## Listing object-level classification errors

The following query against the event table returns errors related to the classification of a specific object. For example, it returns
errors that occurred when Snowflake tried to classify a specific table.

```sqlexample
SELECT
  RECORD_ATTRIBUTES:"object_name"::string AS object_name,
  parse_json(value):"error_message" error_message,
  PARSE_JSON(VALUE):"profile_name" classification_profile_name,
  timestamp,
  FROM <event_db>.<event_schema>.<event_table>
  WHERE record_type='LOG'
    AND scope:name ='snow.automatic_sensitive_data_classification'
    AND RECORD_ATTRIBUTES:"event_type" = 'CLASSIFICATION_ERROR'
  ORDER BY TIMESTAMP DESC;
```

## Tag-related error messages

|  |  |
| --- | --- |
| Error | ```output "failure_reason":"NO_TAGGING_PRIVILEGE" ``` |
| Cause | The role that was used for sensitive data classification does not have the correct privileges to set tags. |
| Solution | Grant the necessary privileges to the role used for sensitive data classification. For more information, see [Tag privileges](object-tagging/work.md). |

|  |  |
| --- | --- |
| Error | ```output "failure_reason":"MANUALLY_APPLIED_VALUE_PRESENT" ``` |
| Cause | Another tag is manually set on the column. |
| Solution | Determine whether you want to keep the tag that was manually set on the column. If not, unset the tag before classifying the table using automatic classification or the SYSTEM$CLASSIFY stored procedure. |

|  |  |
| --- | --- |
| Error | ```output "failure_reason":"TAG_NOT_ACCESSIBLE_OR_AUTHORIZED" ``` |
| Cause | The role that was used for classification cannot access the tag. |
| Solution | * If the tag does not exist, create the tag. * If the tag exists, grant privileges on the tag, or the database and schema that contains the tag, to the role that was used to   classify the database or schema. |

For more information about event table messages, see [Viewing log messages](../developer-guide/logging-tracing/logging-accessing-messages.md).

---
title: Troubleshooting skipped or failed dynamic table refreshes
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-troubleshoot-refresh.md
section: User Guide
---

# Troubleshooting skipped or failed dynamic table refreshes

This topic helps you troubleshoot skipped or failed refreshes. For slow refresh diagnostics,
see [Monitor dynamic table performance](dynamic-tables-performance-monitor.md).

When [monitoring your dynamic table refreshes](dynamic-tables-monitor.md), note the following:

* If you see many SKIPPED entries, see Skipped refreshes.
* If you see consistent FAILED entries, see Failed refreshes.
* If you see a SCHEDULED or EXECUTING entry stuck for a long time, see
  [Monitor dynamic table performance](dynamic-tables-performance-monitor.md).

## Skipped refreshes

Dynamic tables refresh on a schedule. When a scheduled refresh starts, the following situations might cause the refresh to skip:

* If the dynamic table being refreshed has another dynamic table upstream, and the refresh for the upstream failed or was skipped.
* If a previous refresh for the dynamic table is still running.
* If the dynamic table’s refresh often takes longer than the target lag or there’s a significant difference between the target and actual lag,
  Snowflake might skip a refresh to reduce the rate of future skips.

  For instance, if a dynamic table has a 1-minute target lag but typically takes one hour to refresh, the system adjusts the “actual lag”
  accordingly.

  To improve refresh performance, see [Optimize dynamic table performance](dynamic-tables-performance-optimize.md).

Manual refreshes are never skipped but they can cause other scheduled refreshes to skip, especially if you perform frequent manual refreshes
on a dynamic table. Doing so can prevent downstream dynamic tables from refreshing. For this reason, Snowflake recommends that you avoid
frequently performing manual refreshes on a dynamic table with downstream dynamic tables that are expected to refresh according to target lag.

## Failed refreshes

Refresh failures are typically caused by issues with the dynamic table’s query definition, input
data (for example, parsing errors), or upstream failures.

### Find failed refreshes

To find failed refreshes, query the refresh history:

```sqlexample
SELECT
  name,
  data_timestamp,
  state,
  state_code,
  state_message
FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLE_REFRESH_HISTORY(
  NAME_PREFIX => 'MY_DB.MY_SCHEMA',
  ERROR_ONLY => TRUE
));
```

You can also use the Refresh History page in Snowsight to view failed refreshes.
The Source Data Timestamp column shows the time of the last successful refresh. A failed
refresh doesn’t advance this value. If it’s far behind the target lag, your dynamic table is
lagging.

### Diagnose failed refreshes

Use the Query Profile to troubleshoot by selecting Show query profile next to
each refresh. This shows the execution graph of the query.

Use the Graph view in Snowsight to visualize dependencies. A failed or suspended
upstream dynamic table causes its downstream tables to fail. For more information, see
[View the graph of tables connected to your dynamic tables](dynamic-tables-monitor.md).

### Query event tables for failures

You can query an event table to find refresh failures across your dynamic tables:

```sqlexample
SELECT
  timestamp,
  resource_attributes:"snow.executable.name"::VARCHAR AS dt_name,
  resource_attributes:"snow.query.id"::VARCHAR AS query_id,
  value:message::VARCHAR AS error
FROM my_event_table
WHERE
  resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE' AND
  resource_attributes:"snow.database.name" = 'MY_DB' AND
  value:state = 'FAILED'
ORDER BY timestamp DESC;
```

For more information about configuring event tables and setting up alerts, see
[Event table monitoring and alerts for dynamic tables](dynamic-tables-monitor-event-table-alerts.md).

---
title: Troubleshooting Snowpipe
source: https://docs.snowflake.com/en/user-guide/data-load-snowpipe-ts.md
section: User Guide
---

# Troubleshooting Snowpipe

This topic describes a methodical approach to troubleshooting issues with loading data using Snowpipe.

The steps to troubleshoot issues with Snowpipe differ depending on the workflow used to load data files.

## Automatically loading data using Cloud Storage event notifications

### Error notifications

Configure error notifications for Snowpipe. When Snowpipe encounters errors during a load, the feature pushes a notification to a configured cloud messaging service, enabling analysis of your data files. For more information, see [Snowpipe error notifications](data-load-snowpipe-errors.md).

### General troubleshooting steps

Complete the following steps to identify the cause of most issues preventing the automatic loading of files.

#### Step 1: Check the pipe status

Retrieve the current status of the pipe. The results are displayed in JSON format. For information, see [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md).

Check the following values:

> `lastReceivedMessageTimestamp`
> :   Specifies the timestamp of the last event message received from the message queue. This message might not apply to the specific pipe, for example, if the path associated with the message doesn’t match the path in the pipe definition. In addition, only messages triggered by created data objects are consumed by auto-ingest pipes.
>
>     If the timestamp is earlier than expected, this likely indicates an issue with either the service configuration — for example, Amazon SQS or Amazon SNS, or Azure Event Grid — or the service itself. If the field is empty, verify your service configuration settings. If field contains a timestamp but it is earlier than expected, verify whether any settings were changed in your service configuration.
>
> `lastForwardedMessageTimestamp`
> :   Specifies the timestamp of the last “create object” event message with a matching path that was forwarded to the pipe.
>
>     If event messages are getting received from the message queue but are not forwarded to the pipe, then there is likely a mismatch between the blob storage path where the new data files are created and the combined path specified in the Snowflake stage and pipe definitions. Verify any paths specified in the stage and pipe definitions. Note that a path specified in the pipe definition is appended to any path in the stage definition.

#### Step 2. View the COPY history for the table

If event messages are getting received and forwarded, then query the load activity history for the target table. For information, see [COPY_HISTORY](../sql-reference/functions/copy_history.md).

The `STATUS` column indicates whether a particular set of files was loaded, partially loaded, or failed to load. The `FIRST_ERROR_MESSAGE` column provides a reason when an attempt partially loaded or failed.

Note that if a set of files has multiple issues, the `FIRST_ERROR_MESSAGE` column only indicates the first error encountered. To view all errors in the files, execute a [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement with the VALIDATION_MODE copy option set to `RETURN_ALL_ERRORS`. The VALIDATION_MODE copy option instructs a COPY statement to validate the data to be loaded and return results based on the validation option specified. No data is loaded when this copy option is specified. In the statement, reference the set of files you had attempted to load using Snowpipe. For more information about the copy option, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

If the COPY_HISTORY output does not include a set of expected files, query an earlier time period. If the files were duplicates of earlier files, the load history might have recorded the activity when the attempt to load the original files was made.

#### Step 3: Validate the data files

If the load operation encounters errors in the data files, the COPY_HISTORY table function describes the first error encountered in each file. To validate the data files, query the [VALIDATE_PIPE_LOAD](../sql-reference/functions/validate_pipe_load.md) function.

### Files generated in Microsoft Azure Data Lake Storage Gen2 storage not loaded

Currently, some third-party clients do not call `FlushWithClose` in the ADLS Gen 2 REST API. This step is necessary to trigger events that notify Snowpipe to load the files. Try calling the REST API manually to trigger Snowpipe to load these files.

For more information about the `Flush` method with the `close` argument, see <https://docs.microsoft.com/en-us/dotnet/api/azure.storage.files.datalake.datalakefileclient.flush>. For additional REST API reference information about the load for the `close` parameter, see <https://docs.microsoft.com/en-us/rest/api/storageservices/datalakestoragegen2/path/update>.

### Snowpipe stops loading files after Amazon SNS topic subscription is deleted

The first time a user creates a pipe object that references a specific Amazon Simple Notification Service (SNS) topic, Snowflake subscribes
a Snowflake-owned Amazon Simple Queue Service (SQS) queue to the topic. If an AWS administrator deletes the SQS subscription to the SNS
topic, any pipe that references the topic no longer receives event messages from Amazon S3.

To resolve the issue:

1. Wait 72 hours from the time when the SNS topic subscription was deleted.

   After 72 hours, Amazon SNS clears the deleted subscription. For more information, see the
   [Amazon SNS documentation](https://aws.amazon.com/premiumsupport/knowledge-center/sns-cross-account-subscription/).
2. Recreate any pipes that reference the topic (using CREATE OR REPLACE PIPE). Reference the same SNS topic in the pipe definition.
   For instructions, see [Step 3: Create a pipe with auto-ingest enabled](data-load-snowpipe-auto-s3.md).

All pipes that worked prior to the deletion of the SNS topic subscription should now begin to receive event messages from S3 again.

To circumvent the 72-hour delay, you can create a SNS topic with a different name. Recreate any pipes that reference the topic using the
CREATE OR REPLACE PIPE command, and specify the new topic name.

### Loads from Google Cloud Storage delayed or files missed

When automatic data loading from Google Cloud Storage (GCS) using Pub/Sub messages is configured, the event message for only a single staged file could be read. Alternatively, the data loads from GCS could be delayed from between several minutes and one day or longer. In general, either issue is caused when a GCS administrator has not granted the Snowflake service account the `Monitoring Viewer` role.

For instructions, see “Step 2: Grant Snowflake Access to the Pub/Sub Subscription” in [Configuring secure access to Cloud Storage](data-load-snowpipe-auto-gcs.md).

## Calling Snowpipe REST endpoints to load data

### Error notifications

The support for Snowpipe error notifications is available for Snowflake accounts hosted on Amazon Web Services (AWS). Errors
encountered during a data load trigger notifications that enable analysis of your data files. For more information, see
[Snowpipe error notifications](data-load-snowpipe-errors.md).

### General troubleshooting steps

Complete the following steps to identify the cause of most issues preventing the loading of files.

#### Step 1: Checking authentication issues

The Snowpipe REST endpoints use key pair authentication with JSON Web Token (JWT).

The Python/Java ingest SDKs generate the JWT for you. When calling the REST API directly, you need to generate them. If no JWT token is provided in the request, error `400` is returned by the REST endpoint. If an invalid token is provided, an error similar to the following is returned:

```bash
snowflake.ingest.error.IngestResponseError: Http Error: 401, Vender Code: 390144, Message: JWT token is invalid.
```

#### Step 2. Viewing the COPY history for the table

Query the load activity history for a table, including any attempted data loads using Snowpipe. For information, see [COPY_HISTORY](../sql-reference/functions/copy_history.md). The `STATUS` column indicates whether a particular set of files was loaded, partially loaded, or failed to load. The `FIRST_ERROR_MESSAGE` column provides a reason when an attempt partially loaded or failed.

Note that if a set of files has multiple issues, the `FIRST_ERROR_MESSAGE` column only indicates the first error encountered. To view all errors in the files, execute a [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement with the VALIDATION_MODE copy option set to `RETURN_ALL_ERRORS`. The VALIDATION_MODE copy option instructs a COPY statement to validate the data to be loaded and return results based on the validation option specified. No data is loaded when this copy option is specified. In the statement, reference the set of files you had attempted to load using Snowpipe. For more information about the copy option, see [COPY INTO <table>](../sql-reference/sql/copy-into-table.md).

#### Step 3: Checking the pipe status

If the COPY_HISTORY table function returns 0 results for the data load you are investigating, retrieve the current state of the pipe. The results are displayed in JSON format. For information, see [SYSTEM$PIPE_STATUS](../sql-reference/functions/system_pipe_status.md).

The `executionState` key identifies the execution state of the pipe. For example, `PAUSED` indicates the pipe is currently paused. The pipe owner could resume running the pipe using [ALTER PIPE](../sql-reference/sql/alter-pipe.md).

If the `executionState` value indicates an issue with starting the pipe, check the `error` key for more information.

#### Step 4: Validate the data files

If the load operation encounters errors in the data files, the COPY_HISTORY table function describes the first error encountered in each file. To validate the data files, query the [VALIDATE_PIPE_LOAD](../sql-reference/functions/validate_pipe_load.md) function.

## Other issues

### Set of files not loaded

#### Missing COPY_HISTORY record for the load

Check whether the COPY INTO *<table>* statement in the pipe includes the PATTERN clause. If so, verify whether the regular expression
specified as the PATTERN value is filtering out all of the staged files to load.

To modify the PATTERN value, it is necessary to recreate the pipe using the `CREATE OR REPLACE PIPE` syntax.

For more information, see [CREATE PIPE](../sql-reference/sql/create-pipe.md).

#### COPY_HISTORY record indicates unloaded subset of files

If the COPY_HISTORY function output indicates a subset of files was not loaded, you may try to “refresh” the pipe.

This situation can arise in any of the following situations:

* The external stage was previously used to bulk load data using the COPY INTO *table* command.
* **REST API:**

  + External event-driven functionality is used to call the REST APIs, and a backlog of data files already existed in the external stage before the events were configured.
* **Auto-ingest:**

  + A backlog of data files already existed in the external stage before event notifications were configured.
  + An event notification failure prevented a set of files from getting queued.

To load the data files in your external stage using the configured pipe, execute a [ALTER PIPE … REFRESH](../sql-reference/sql/alter-pipe.md) statement.

### Duplicate data in target tables

Compare the COPY INTO *<table>* statements in the definitions of all pipes in the account by executing [SHOW PIPES](../sql-reference/sql/show-pipes.md)
or by querying either the [PIPES](../sql-reference/account-usage/pipes.md) view in Account Usage or the
[PIPES](../sql-reference/info-schema/pipes.md) view in the Information Schema. If multiple pipes reference the same cloud storage location
in the COPY INTO *<table>* statements, verify that the directory paths do not overlap. Otherwise, multiple pipes could load the same set of
data files into the target tables. For example, this situation can occur when multiple pipe definitions reference the same storage location
with different levels of granularity, such as `<storage_location>/path1/` and `<storage_location>/path1/path2/`. In this example, if
files are staged in `<storage_location>/path1/path2/`, both pipes would load a copy of the files.

### Unable to reload modified data, modified data loaded unintentionally

Snowflake uses file loading metadata to prevent reloading the same files and duplicating data in a table. Snowpipe prevents loading files with the same name even if they were later modified; that is, they have a different eTag.

Because file-loading metadata is associated with the pipe object rather than the table, the following results occur:

* Staged files with the same name as files that were already loaded are ignored, even if they were modified; for example, if new rows were added or errors in the file were corrected.
* Files that couldn’t load during a pipe’s COPY operation — for example, because of invalid file content or stage access failures — are still registered in the pipe’s metadata. The registered file names are ignored by subsequent pipe activity, including ALTER PIPE … REFRESH. You can use a COPY statement to load the skipped files manually.
* Truncating the table by using the [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md) command doesn’t delete the Snowpipe file-loading metadata.

However, pipes only maintain the load history metadata for 14 days. Therefore:

Files modified and staged again within 14 days:
:   Snowpipe ignores modified files that are staged again. To reload modified data files, it is currently necessary to recreate the pipe object using the `CREATE OR REPLACE PIPE` syntax.

    The following example recreates the `mypipe` pipe based on the example in Step 1 of [Data loading preparation using the Snowpipe REST API](data-load-snowpipe-rest-gs.md):

    ```sqlexample
    create or replace pipe mypipe as copy into mytable from @mystage;
    ```

Files modified and staged again after 14 days:
:   Snowpipe loads the data again, potentially resulting in duplicate records in the target table.

In addition, duplicate records can be loaded into the target table if [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statements are executed that reference the same bucket/container, path, and target table as in your active Snowpipe loads. The load histories for the COPY command and Snowpipe are stored separately in Snowflake. After you have loaded any historic staged data, if you need to load data manually using the pipe configuration, execute an ALTER PIPE … REFRESH statement. See Set of Files Not Loaded in this topic for more information.

### Load times inserted using CURRENT_TIMESTAMP earlier than LOAD_TIME values in COPY_HISTORY view

Table designers might add a timestamp column that inserts the current timestamp as the default value as records are loaded into a table. The intent is to capture the time when each record is loaded into the table; however, the timestamps are earlier than the LOAD_TIME column values returned by the [COPY_HISTORY function](../sql-reference/functions/copy_history.md) (Information Schema) or the [COPY_HISTORY view](../sql-reference/account-usage/copy_history.md) (Account Usage). This time discrepancy is because [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md) is evaluated when the load operation is compiled in cloud services rather than when the record is inserted into the table; that is, when the transaction for the load operation is committed.

> **Note:**
>
> We currently do not recommend using the following functions in the `copy_statement` for Snowpipe:
>
> * CURRENT_DATE
> * CURRENT_TIME
> * CURRENT_TIMESTAMP
> * GETDATE
> * LOCALTIME
> * LOCALTIMESTAMP
> * SYSDATE
> * SYSTIMESTAMP
>
> It is a known issue that the time values inserted using these functions can be a few hours earlier than the LOAD_TIME values returned by the [COPY_HISTORY function](../sql-reference/functions/copy_history.md) or the [COPY_HISTORY view](../sql-reference/account-usage/copy_history.md).
>
> Use the copy option `INCLUDE_METADATA` with [METADATA$START_SCAN_TIME](querying-metadata.md) instead, which provides a more accurate representation of record loading. For more information, see [CREATE PIPE examples](../sql-reference/sql/create-pipe.md).

### Error: Integration `{0}` associated with the stage `{1}` cannot be found

```bash
003139=SQL compilation error:\nIntegration ''{0}'' associated with the stage ''{1}'' cannot be found.
```

This error can occur when the association between the external stage and the storage
integration linked to the stage has been broken. This happens when the storage integration
object has been recreated (using
[CREATE OR REPLACE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md)).
A stage links to a storage integration using a hidden ID rather than the name of the storage
integration. Behind the scenes, the CREATE OR REPLACE syntax drops the object and recreates
it with a different hidden ID.

If you must recreate a storage integration after it has been linked to one or more stages,
you must reestablish the association between each stage and the storage integration by
executing [ALTER STAGE](../sql-reference/sql/alter-stage.md)
`stage_name` SET STORAGE_INTEGRATION = `storage_integration_name`, where:

* `stage_name` is the name of the stage.
* `storage_integration_name` is the name of the storage integration.

### Errors for Snowpipe referencing government regions

You may get an error when Snowpipe referencing a bucket in a government region while the account is in a commercial region. Note that the government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions. For more information, see [AWS GovCloud (US)](https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/govcloud-s3.html) and [Azure Government](https://learn.microsoft.com/en-us/azure/azure-government/).

### Large files not loading

Snowpipe auto-ingest relies on AWS S3 event notifications to trigger data loads. When large files are uploaded to S3 using multipart uploads, the event notification generated is `S3:ObjectCreated:CompleteMultipartUpload`. If your S3 bucket’s event notification configuration only includes `S3:ObjectCreated:Put`, `S3:ObjectCreated:Post`, or `S3:ObjectCreated:Copy`, Snowpipe will not automatically ingest these large files. The large files are not visible in `COPY_HISTORY` views or `SYSTEM$PIPE_STATUS` function results.

To avoid this issue, ensure that your S3 bucket event notification configuration includes `S3:ObjectCreated:CompleteMultipartUpload` or, for simplicity, set it to All object create events to capture all object creation events.

You can take the following troubleshooting steps:

1. Verify file size:

   * Confirm that the files not being ingested are larger than the typical threshold for multipart uploads (often around 16 MiB, but this can be configured).
2. Check S3 event notification configuration:

   * Navigate to the AWS S3 console.
   * Select the S3 bucket associated with your Snowpipe stage.
   * Go to Properties and then Event notifications.
   * Verify that the event notification configuration includes the `S3:ObjectCreated:CompleteMultipartUpload` event.
3. Recommended solution: configure All object create events:

   * In the S3 event notification configuration, change the setting to `All object create events`. This ensures that all object creation event types are sent to Snowflake.
4. Confirm event delivery:

   * After making changes, upload a large file to the S3 bucket and monitor AWS CloudWatch logs (if configured) or Snowflake’s `COPY_HISTORY` to ensure that the event is being delivered and the file is being ingested.
   * You can also check the `SYSTEM$PIPE_STATUS` function.
5. Review S3 multipart upload settings:

   * If you still experience issues, review the applications or processes that are uploading the large files to S3. Confirm that they use multipart uploads and that the configurations are correct.

---
title: Troubleshooting steps
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/troubleshooting-steps.md
section: User Guide
---

# Troubleshooting steps

This topic provides additional steps you can take to troubleshoot connectivity issues when the [common issues resolutions](common-issues.md) are not successful. If these resolutions don’t work, you can try the following steps, *in order*, until the issue is resolved:

1. [Use Snowflake troubleshooting tools](snowflake-tools.md).
2. [Follow alternative troubleshooting steps](alternate-steps.md).
3. [Follow-up actions](followup-actions.md).

## Prerequisites

* Ensure that all tests are performed directly on the system experiencing connectivity issues. For example, if the issue is occurring in the Tableau server, perform the following troubleshooting steps in that system and not on your own workstation.
* Admin access might be required on systems with connectivity issues, such as a Tableau server.
* Verification of client connectivity before any scheduled production release or network change is recommended to prevent unexpected downtime due to a client’s inability to access one or more of the required endpoints.

---
title: Troubleshooting tasks
source: https://docs.snowflake.com/en/user-guide/tasks-ts.md
section: User Guide
---

# Troubleshooting tasks

This section describes a methodical approach to troubleshooting tasks that do not run as expected.

## Task did not run

### Step 1: Verify the task did not run

Query the [TASK_HISTORY](../sql-reference/functions/task_history.md) table function to verify the task did not run. It is possible that the task ran successfully but the SQL statement in the task definition failed. In particular, note the scheduled and completed times, as well as any error code and message.

If the task has a predecessor task (in a [task graph](tasks-graphs.md)), verify whether the predecessor task completed successfully.

### Step 2: Verify the task was resumed

Snowflake creates all tasks in the SUSPENDED state. Verify the state of the task (or each task in a task graph) is RESUMED (using [DESCRIBE TASK](../sql-reference/sql/desc-task.md) or [SHOW TASKS](../sql-reference/sql/show-tasks.md)). Or verify that the task was manually executed using [EXECUTE TASK](../sql-reference/sql/execute-task.md).

To resume an individual task, execute [ALTER TASK](../sql-reference/sql/alter-task.md) … RESUME. To recursively enable all dependent tasks tied to a root task, query the [SYSTEM$TASK_DEPENDENTS_ENABLE](../sql-reference/functions/system_task_dependents_enable.md) function rather than enabling each task individually.

While you are reviewing the task details, if the task has a schedule, also check the cron expression. Verify that at least one occurrence of the scheduled time has passed.

### Step 3: Verify the permissions granted to the task owner

Verify the task owner (i.e. the role that has the OWNERSHIP privilege on the task) has the following privileges, which are required for the task to run:

| Object | Privilege | Notes |
| --- | --- | --- |
| Account | EXECUTE TASK | Required to run any tasks the role owns. Revoking the EXECUTE TASK privilege on a role prevents all subsequent task runs from starting under that role. |
| Database | USAGE |  |
| Schema | USAGE |  |
| Task | OWNERSHIP |  |
| Warehouse | USAGE |  |

Verify the privileges granted to the role using [SHOW GRANTS](../sql-reference/sql/show-grants.md) TO ROLE `role_name`.

### Step 4: Verify the condition

If the task includes a WHEN clause with a [SYSTEM$STREAM_HAS_DATA](../sql-reference/functions/system_stream_has_data.md) condition, verify that the specified stream contained change data capture (CDC) records when the task was last scheduled to run. Historical data for a stream can be queried using an [AT | BEFORE](../sql-reference/constructs/at-before.md) clause.

### Step 5: Check predecessor tasks

If the task is a child task in a task graph, check that the predecessor tasks (parent tasks) ran to completion successfully. If A parent task failed to run to completion, any child tasks are skipped. For more information, see [Create a sequence of tasks with a task graph](tasks-graphs.md).

## Task timed out or exceeded the schedule window

There is a 60 minute default limit on a single run of a task. This limitation was implemented as a safeguard against non-terminating tasks. Query the [TASK_HISTORY](../sql-reference/functions/task_history.md) table function. If the task was canceled or exceeded the window scheduled for the task, the cause is often an undersized warehouse. Review the warehouse size and consider increasing it to fit within the schedule window or the one-hour limit.

Alternatively, consider increasing the timeout limit for the task by executing [ALTER TASK](../sql-reference/sql/alter-task.md) … SET USER_TASK_TIMEOUT_MS = *<num>*. To determine if the USER_TASK_TIMEOUT_MS parameter has been set for a specific task, execute the following statement:

```sqlsyntax
SHOW PARAMETERS LIKE 'USER_TASK_TIMEOUT_MS' IN TASK <task_name>;
```

Where `<task_name>` is the name of the task whose timeout limit you are adjusting. If the statement returns no record, the task currently has the default `3600000` millisecond (60 minute) timeout.

Note that neither increasing the warehouse size nor increasing the timeout limit might help if there are query parallelization issues. Consider looking at alternate ways to rewrite the SQL statement run by the task.

---
title: Troubleshooting the Kafka connector
source: https://docs.snowflake.com/en/user-guide/kafka-connector-ts.md
section: User Guide
---

# Troubleshooting the Kafka connector

This section describes how to troubleshoot issues encountered while ingesting data using the Kafka connector.

## Error notifications

Configure error notifications for Snowpipe. When Snowpipe encounters file errors during a load, the feature pushes a notification to a configured cloud messaging service, enabling analysis of your data files. For more information, see [Snowpipe error notifications](data-load-snowpipe-errors.md).

## General troubleshooting steps

Complete the following steps to troubleshoot issues with loads using the Kafka connector.

### Step 1: View the COPY history for the table

Query the load activity history for the target table. For information, see [COPY_HISTORY view](../sql-reference/account-usage/copy_history.md). If the COPY_HISTORY output does not include a set of expected files, query an earlier time period. If the files were duplicates of earlier files, the load history might have recorded the activity when the attempt to load the original files was made. The `STATUS` column indicates whether a particular set of files was loaded, partially loaded, or failed to load. The `FIRST_ERROR_MESSAGE` column provides a reason when an attempt partially loaded or failed.

The Kafka connector moves files it could not load to the stage associated with the target table. The syntax for referencing a table stage is `@[namespace.]%table_name`.

List all files located in the table stage using [LIST](../sql-reference/sql/list.md).

For example:

```sqlexample
LIST @mydb.public.%mytable;
```

File names are in one of the following formats. The conditions that produce each format are described in the table:

| File Type | Description |
| --- | --- |
| Raw bytes | These files match the following pattern:  `<connector_name>/<table_name>/<partition>/offset_(<key>/<value>_)<timestamp>.gz`  For these files, the Kafka records could not be converted from raw bytes to the source file format (Avro, JSON, or Protobuf).  A common cause for this issue is a network failure that resulted in a character getting dropped from the record. The Kafka connector could no longer parse the raw bytes, resulting in a broken record. |
| Source file format (Avro, JSON, or Protobuf) | These files match the following pattern:  `<connector_name>/<table_name>/<partition>/<start_offset>_<end_offset>_<timestamp>.<file_type>.gz`  For these files, after the Kafka connector converted the raw bytes back to the source file format, Snowpipe encountered an error and could not load the file. |

The following sections provide instructions for resolving issues with each of the file types:

#### Raw bytes

The filename `<connector_name>/<table_name>/<partition>/offset_(<key>/<value>_)<timestamp>.gz` includes the exact offset of the record that was not converted from raw bytes to the source file format. To resolve issues, resend the record to the Kafka connector as a new record.

#### Source file format (Avro, JSON, or protobuf)

If Snowpipe could not load data from files in the internal stage created for the Kafka topic, the Kafka connector moves the files to the stage for the target table in the source file format.

If a set of files has multiple issues, the `FIRST_ERROR_MESSAGE` column in the COPY_HISTORY output only indicates the first error encountered. To view all errors in the files, it is necessary to retrieve the files from the table stage, upload them to a named stage, and then execute a [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) statement with the VALIDATION_MODE copy option set to `RETURN_ALL_ERRORS`. The VALIDATION_MODE copy option instructs a COPY statement to validate the data to be loaded and return results based on the validation option specified. No data is loaded when this copy option is specified. In the statement, reference the set of files you had attempted to load using the Kafka connector.

When any issues with the data files are resolved, you can load the data manually using one or more COPY statements.

The following example references data files located in the table stage for the `mytable` table in the `mydb.public` database and schema.

To validate data files in the table stage and resolve errors:

1. List all files located in the table stage using [LIST](../sql-reference/sql/list.md).

   For example:

   ```sqlexample
   LIST @mydb.public.%mytable;
   ```

   The examples in this section presume that JSON is the source format for the data files.
2. Download the files created by Kafka connector to your local machine using [GET](../sql-reference/sql/get.md).

   For example, download the files to a directory named `data` on your local machine:

   Linux or macOS:
   :   ```sqlexample
       GET @mydb.public.%mytable file:///data/;
       ```

   Microsoft Windows:
   :   ```sqlexample
       GET @mydb.public.%mytable file://C:\data\;
       ```
3. Create a named internal stage using [CREATE STAGE](../sql-reference/sql/create-stage.md) that stores data files with the same format as your source Kafka files.

   For example, create a internal stage named `kafka_json` that stores JSON files:

   ```sqlexample
   CREATE STAGE kafka_json FILE_FORMAT = (TYPE = JSON);
   ```
4. Upload the files you downloaded from the table stage using [PUT](../sql-reference/sql/put.md).

   For example, upload the files downloaded to the `data` directory on your local machine:

   Linux or macOS:
   :   ```sqlexample
       PUT file:///data/ @mydb.public.kafka_json;
       ```

   Microsoft Windows:
   :   ```sqlexample
       PUT file://C:\data\ @mydb.public.kafka_json;
       ```
5. Create a temporary table with two variant columns for testing purposes. The table is only used to validate staged data file. No data is loaded into the table. The table is dropped automatically when the current user session ends:

   ```sqlexample
   CREATE TEMPORARY TABLE t1 (col1 variant);
   ```
6. Retrieve all errors encountered in the data file by executing a [COPY INTO \*table\* … VALIDATION_MODE = ‘RETURN_ALL_ERRORS’](../sql-reference/sql/copy-into-table.md) statement. The statement validates the file in the specified stage. No data is loaded into the table:

   ```sqlexample
   COPY INTO mydb.public.t1
     FROM @mydb.public.kafka_json
     FILE_FORMAT = (TYPE = JSON)
     VALIDATION_MODE = 'RETURN_ALL_ERRORS';
   ```
7. Fix all reported errors in the data files on your local machine.
8. Upload the fixed files to either the table stage or the named internal stage using [PUT](../sql-reference/sql/put.md).

   The following example uploads the files to the table stage, overwriting the existing files:

   Linux or macOS:
   :   ```sqlexample
       PUT file:///tmp/myfile.csv @mydb.public.%mytable OVERWRITE = TRUE;
       ```

   Windows:
   :   ```sqlexample
       PUT file://C:\temp\myfile.csv @mydb.public.%mytable OVERWRITE = TRUE;
       ```
9. Load the data into the target table using COPY INTO *table* without the VALIDATION_MODE option.

   You can optionally use the PURGE = TRUE copy option to delete the data files from the stage once the data is loaded successfully, or manually delete the files from the table stage using [REMOVE](../sql-reference/sql/remove.md):

   ```sqlexample
   COPY INTO mydb.public.mytable(RECORD_METADATA, RECORD_CONTENT)
     FROM (SELECT $1:meta, $1:content FROM @mydb.public.%mytable)
     FILE_FORMAT = (TYPE = 'JSON')
     PURGE = TRUE;
   ```

### Step 2: Analyze the Kafka connector log file

If the COPY_HISTORY view has no record of the data load, then analyze the log file for the Kafka connector. The connector writes events to the log file. Note that the Snowflake Kafka connector shares the same log file with all Kafka connector plugins. The name and location of this log file should be in your Kafka Connect configuration file. For more information, see the documentation provided for your Apache Kafka software.

Search the Kafka connector log file for Snowflake-related error messages. Most messages will have the string `ERROR` and will contain the file name
`com.snowflake.kafka.connector...` to make these messages easier to find.

Possible errors that you might encounter include:

Configuration error:
:   Possible causes of the error:

    * The connector doesn’t have the proper information to subscribe to the topic.
    * The connector doesn’t have the proper information to write to the Snowflake table (e.g. the key pair for authentication might be wrong).

    Note that the Kafka connector validates its parameters. The connector throws an error for each incompatible configuration parameter. The error message is written
    to the Kafka Connect cluster’s log file. If you suspect a configuration problem, check the errors in that log file.

Read error:
:   The connector might not have been able to read from Kafka for the following reasons:

    * Kafka or Kafka Connect might not be running.
    * The message might not have been sent yet.
    * The message might have been deleted (expired).

Write error (stage):
:   Possible causes of the error:

    * Insufficient privileges on the stage.
    * Stage is out of space.
    * Stage was dropped.
    * Some other user or process wrote unexpected files to the stage.

Write error (table):
:   Possible causes of the error:

    * Insufficient privileges on the table.

### Step 3: Check Kafka Connect

If no error is reported in the Kafka connect log file, check Kafka Connect. For troubleshooting instructions, see the documentation provided by your Apache Kafka software vendor.

## Resolving specific issues

### Duplicate rows with the same topic partition and offset

When loading data using version 1.4 of the Kafka connector (or higher), duplicate rows in the target table with the same topic partition and offset can indicate that the load operation exceeded the default execution timeout of 300000 milliseconds (300 seconds). To verify the cause, check the Kafka Connect log file for the following error:

```bash
org.apache.kafka.clients.consumer.CommitFailedException: Commit cannot be completed since the group has already rebalanced and assigned the partitions to another member.

This means that the time between subsequent calls to poll() was longer than the configured max.poll.interval.ms, which typically implies that the poll loop is spending too much time message processing. You can address this either by increasing max.poll.interval.ms or by reducing the maximum size of batches returned in poll() with max.poll.records.

at org.apache.kafka.clients.consumer.internals.ConsumerCoordinator.sendOffsetCommitRequest(ConsumerCoordinator.java:1061)
```

To resolve the error, in the Kafka configuration file (e.g. `<kafka_dir>/config/connect-distributed.properties`), change either of the following properties:

`consumer.max.poll.interval.ms`
:   Increase the execution timeout to `900000` (900 seconds).

`consumer.max.poll.records`
:   Decrease the number of records loaded with each operation to `50`.

### Failure in Streaming Channel Offset Migration Response Error Code: 5023

When upgrading to the v2.1.0 (or higher) connector version, there was a change introduced in Snowpipe Streaming Channel name format. As a result, the logic detecting information about previously committed offsets
will not find any information about previously committed ones.
This will manifest as the following exception:

```bash
com.snowflake.kafka.connector.internal.SnowflakeKafkaConnectorException: [SF_KAFKA_CONNECTOR] Exception: Failure in Streaming Channel Offset Migration Response Error Code: 5023

Detail: Streaming Channel Offset Migration from Source to Destination Channel has no/invalid response, please contact Snowflake Support

Message: Snowflake experienced a transient exception, please retry the migration request.
```

To resolve this error, in the Kafka configuration file (for example, `<kafka_dir>/config/connect-distributed.properties`), add the following configuration property:

`enable.streaming.channel.offset.migration`
:   Disable automatic offset migration by setting it to `false`.

### Configuring connector to support multiple topics

We have encountered issue with a single kafka connector instance supporting a large number of topics, each having multiple partitions. The connector’s configuration, even though seemed to be valid, resulted in endless re-balance cycle without possibility to ingest any data into the Snowflake.
The issue was specific to Snowpipe Streaming ingestion mode (`snowflake.ingestion.method=SNOWPIPE_STREAMING`), but guidelines are also applicable to Snowpipe ingestion mode (`snowflake.ingestion.method=SNOWPIPE`).
The issue manifests itself in the log file by repeatedly logging this log message:

`[Worker-xyz] [timestamp] INFO [my-connector|task-id] [SF_INGEST] Channel is marked as closed`

This can typically happen when you configure your connector to ingest topics via regex.
We recommend applying the following set of options to the Kafka configuration file (for example, `<kafka_dir>/config/connect-distributed.properties`):

`consumer.override.partition.assignment.strategy`
:   Configure partition assignment strategy to tasks as `org.apache.kafka.clients.consumer.CooperativeStickyAssignor` - this will cause even distribution of ingested channels to available tasks, reducing the risk of re-balancing. Note that `CooperativeStickyAssignor` requires Kafka Connect version 3.0.1 or later because of [this known issue](https://issues.apache.org/jira/browse/KAFKA-12487).

`tasks.max`
:   The number of instantiated tasks per connector shouldn’t exceed number of available CPU’s - the underlying driver implements throttling mechanism based on the available CPU’s. Increasing number of concurrent requests will increase memory pressure on your system, but also will result in longer insert processing times, directly leading to missing connector’s heartbeats.

When speaking about connector’s timeout values, there is a set of configuration properties directly affecting these:

`consumer.override.heartbeat.interval.ms`
:   Defines how often the monitor thread (there is one associated with each task) will send heartbeat to Kafka. Default is `3000` ms, but in case of higher system load - you can experiment with increasing it to `5000` ms.

`consumer.override.session.timeout.ms`
:   Defines how long the broker will wait before assuming the consumer is in an invalid state and attempting re-balance. This setting should be typically 3 times higher than heartbeat interval, so if you configured heartbeat to `5000` ms, set this one to `15000` ms.

`consumer.override.max.poll.interval.ms`
:   Defines the maximum interval between call to `poll()` from underlying Kafka. The time spent between the polls basically maps to the connector processing batch of data (including upload to Snowflake and committing). In scenarios when you have multiple tasks processing data, underlying Snowflake Connection may start throttling requests, resulting in longer processing times. Depending on your scenario, you can increase this value to even 20 minutes (`1200000` ms) - especially when you start the connector with a large initial record count to be ingested.

`consumer.override.rebalance.timeout.ms`
:   When re-balance happens, in a scenario with large number of channels per task, there is a lot of underlying logic per channel to figure out where to resume processing. This code is executed sequentially, so the more channels per task, the longer initial setup will last. Configure this property to value large enough, to give each channel to complete its initialization. Value of 3 minutes (`180000` ms) is a good starting point.

It is also important to be aware of available heap memory for the connector. This is especially important in scenarios, where there are multiple connectors running simultaneously or you have one connector ingesting data from multiple topics. Each topic’s partition maps to a single channel and as such, requires memory.

Make sure you adjust your Kafka connect process memory settings via Xmx setting. One way of doing that is to define the
`KAFKA_OPTS` environment variable and set it accordingly (that is, `KAFKA_OPTS=-Xmx4G`).

### File cleaner purging files unexpectedly

When using the Kafka connector with SNOWPIPE, you might encounter an issue where you ingest data into a single table from multiple topics.
If your configuration doesn’t have the `snowflake.topic2table.map` entry or there is a 1:1 mapping between the topic and the table, this issue doesn’t apply.

The Kafka connector is generating files with records to be uploaded to a stage. These files are formatted according to the following pattern:
`snowflake_kafka_connector_<connector-name>_stage_<table-name>/<connector-name>/<table-name>/<partition-id>/<low-watermark>_<high-watermark>_<timestamp>.json.gz`. The issue is located in the `<partition-id>`: if multiple topics load data into a single table, duplicates are likely on the `partition-id` value. This is not a problem in a normal connector operation. However, if the connector restarts or rebalances, the cleaner process might inaccurately associate files loaded to stage (but not yet ingested) with the wrong partition and decide to delete them, which might result in a loss-of-data event.

The connector with version 2.5.0 fixes this issue by including the source topic’s hashcode in the `partition-id` to ensure unique file names that exactly match a single topic’s partition.
This fix is enabled by default - `snowflake.snowpipe.stageFileNameExtensionEnabled` - and affects only configurations where a target table is listed more than once in `snowflake.topic2table.map`.

If your configuration is affected by this functionality, you might end up having stale files uploaded to your stage. When the connector starts, it will check if your stage contains such files. You need to look for the log entries starting with `NOTE: For table`, followed by the list of detected files.

You can also check if there are some files affected at the stage manually:

1. Find the affected stage:

   ```sqlexample
   show stages like 'snowflake_kafka_connector%<your table name>';
   ```
2. List the stage files:

   ```sqlexample
   list @<your stage name> pattern = '.+/<your-table-name>/[0-9]{1,4}/[0-9]+_[0-9]+_[0-9]+\.json\.gz$';
   ```

The command above lists all files matching your table’s stage and having partition IDs in the range 0-9999.
These files won’t be ingested anymore, so you can download or delete them.

## Reporting issues

When contacting [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for assistance, please have the following files available:

* Configuration file for your Kafka connector.

  > **Important:**
  >
  > Remove the private key before providing the file to Snowflake.
* Copy of the Kafka Connector log. Ensure that the file does not contain confidential or sensitive information.
* JDBC log file.

  To generate the log file, set the `JDBC_TRACE = true` environment variable on your Kafka Connect cluster before you run the Kafka
  connector.

  For more information about the JDBC log file, see
  [this article](https://community.snowflake.com/s/article/How-to-generate-log-file-on-Snowflake-connectors) in the Snowflake Community.
* Connect log file.

  To produce the log file, edit the `etc/kafka/connect-log4j.properties` file. Set the
  `log4j.appender.stdout.layout.ConversionPattern` property as follows:

  > `log4j.appender.stdout.layout.ConversionPattern=[%d] %p %X{connector.context}%m (%c:%L)%n`

  Connector contexts are available in Kafka version 2.3 and higher.

  For more information, see the [Logging Improvements](https://www.confluent.io/blog/kafka-connect-improvements-in-apache-kafka-2-3/)
  information on the Confluent website.

---
title: Troubleshooting using Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-open-catalog-troubleshooting.md
section: User Guide
---

# Troubleshooting using Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake

The following scenarios can help you troubleshoot issues that might occur when using Apache Iceberg™ tables with Snowflake Open Catalog in
Snowflake.

## You can’t create a catalog integration for Open Catalog

This section describes how to troubleshoot creating a catalog integration for Open Catalog.

To troubleshoot, identify the error message you received in the SQL output when the creation of your catalog integration failed.

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Cannot create catalog integration <catalog_integration_name> due to error: Unable to process: Unable to find warehouse <catalog_name>. Check the REST configuration and ensure the warehouse name '<catalog_name>' matches the Polaris catalog name. ``` |
| Cause | The `<open_catalog_name>` you specified for the `CATALOG_NAME` parameter in your catalog integration doesn’t match the name of any external catalog in the Open Catalog account at the `<polaris_account_url>` you specified for the `CATALOG_URI` parameter. |
| Solution | Update `<open_catalog_name>` for the `CATALOG_NAME` parameter to exactly match the name of the external catalog in Open Catalog, and try creating the catalog integration again. If you haven’t created the external catalog yet, follow the instructions in [Create a catalog](https://other-docs.snowflake.com/en/opencatalog/create-catalog).  **Important:** `<open_catalog_name>` is case-sensitive. |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: User provided authentication credentials are invalid for catalog integration <catalog_integration_name> due to error: Malformed request: unauthorized_client: The client is not authorized. ``` |
| Cause | The OAuth token you specified in the catalog integration isn’t valid. |
| Solution | Ensure that the values specified for `OAUTH_CLIENT_ID` and `OAUTH_CLIENT_SECRET` in your catalog integration are valid values for an existing service connection. To validate, compare these values with the service credential values you saved when you [configured the service connection](https://other-docs.snowflake.com/en/opencatalog/configure-service-connection#configure-a-service-connection). If they don’t match, update the values to match. |

## You can’t create a Snowflake-managed table

This section describes how to troubleshoot creating a Snowflake-managed table.

To troubleshoot, identify the error message you received in the SQL output when the creation of your table failed.

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to validate CATALOG_SYNC target '<catalog_integration_name>' due to error: The Snowflake service connection associated with the Polaris catalog integration does not have the required privileges to send notifications. The minimum required privileges are TABLE_CREATE, TABLE_WRITE_PROPERTIES, TABLE_DROP, NAMESPACE_CREATE, and NAMESPACE_DROP. ``` |
| Cause | The catalog role for the external catalog you want to connect to doesn’t have the necessary privileges to send notifications to Open Catalog. |
| Solution | Update the catalog role by granting all of the following privileges to the catalog role for your external catalog:   * TABLE_CREATE * TABLE_WRITE_PROPERTIES * TABLE_DROP * NAMESPACE_CREATE * NAMESPACE_DROP   Where you update the catalog role depends on whether the grants it has are applied at the catalog, namespace, or table level. See the applicable procedure for your catalog role:   * [Update the privileges granted to a catalog role at the catalog level](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#update-the-privileges-granted-to-a-catalog-role-at-the-catalog-level) * [Update the privileges granted to a catalog role at the namespace level](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#update-the-privileges-granted-to-a-catalog-role-at-the-namespace-level) * [Update the privileges granted to a catalog role at the table level](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#update-the-privileges-granted-to-a-catalog-role-at-the-table-level) |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to access the REST endpoint of catalog integration <catalog_integration_name> with error: Unable to process: Failed to get subscoped credentials: Error assuming AWS_ROLE: User: <IAM_user_arn> is not authorized to perform: sts:AssumeRole on resource: <S3_role_arn>. Check the accessibility of the REST catalog URI or warehouse. ``` |
| Cause | The AWS IAM user for your external catalog can’t assume the role that has permission to access S3. |
| Solution | Modify the policy document in AWS to allow the IAM user for your Open Catalog account to assume the role that has permission to access your S3 bucket. To modify the policy document, you need to update the IAM role in AWS. For details, see [Retrieve the AWS IAM user for your Snowflake Open Catalog account](https://other-docs.snowflake.com/en/opencatalog/create-catalog#step-4-retrieve-the-aws-iam-user-for-your-open-catalog-account) and then [Grant the IAM user permissions to access bucket objects](https://other-docs.snowflake.com/en/opencatalog/create-catalog#step-5-grant-the-iam-user-permissions-to-access-bucket-objects).  Remember that the policy document must include the IAM user ARN and external ID for both your external volume and external catalog in Open Catalog. In the following example policy document, note the following values:   * `arn:aws:iam::111111111111:user/----0000-s` is the STORAGE_AWS_IAM_USER_ARN for the external volume. * `arn:aws:iam::222222222222:user/----0000-s` is the IAM user ARN for the external catalog in Snowflake Open Catalog. * `Iceberg_table_external_id` is the STORAGE_AWS_EXTERNAL_ID for your external volume and also the external ID for your external   Catalog in Open Catalog.  ```sqljson   {        "Version": "2012-10-17",        "Statement": [          {            "Sid": "",            "Effect": "Allow",            "Principal": {              "AWS": [                  "arn:aws:iam::111111111111:user/----0000-s",                  "arn:aws:iam::222222222222:user/----0000-s"               ]            },            "Action": "sts:AssumeRole",            "Condition": {              "StringEquals": {                "sts:ExternalId": "iceberg_table_external_id"              }            }          }        ]      }   ``` |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to validate CATALOG_SYNC target '<catalog_integration_name>' due to error: The associated Polaris catalog cannot be of type INTERNAL. ``` |
| Cause | You’re attempting to sync a Snowflake-managed table to an internal catalog in Open Catalog. You can only sync a Snowflake-managed table to an external catalog in Open Catalog. |
| Solution | You can’t update an existing internal catalog to an external catalog, so you must create a new external catalog:  1. Follow the instructions in [Create a catalog](https://other-docs.snowflake.com/en/opencatalog/create-catalog) to create an external catalog in your Open Catalog account. When creating the catalog,    ensure that the External toggle is enabled. 2. Update `<open_catalog_name>` for the `CATALOG_NAME` parameter in your catalog integration to the name of the external    catalog you created. |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to validate CATALOG_SYNC target '<catalog_integration_name>' due to error: SQL Execution Error: Resource on the REST endpoint of catalog integration CATINT is forbidden due to error: Forbidden: Invalid locations '[<path to metadata file>]' for identifier '<identifier>': <path to metadata file> is not in the list of allowed locations: [<list of allowed locations>]. ``` |
| Cause | The path to the metadata file for the table you want to create isn’t included in the list of allowed locations for your external cloud provider. As a result, Open Catalog can’t access the metadata file for the table. |
| Solution | Ensure that the location of the metadata file falls under the file path of the default base location for the catalog that the service admin created in Open Catalog, or that it falls under any of the additional allowed locations, if applicable. For the list of allowed locations, select the catalog in Open Catalog and refer to the **Locations** field. |

## You can’t alter an Iceberg table when specifying the CATALOG_SYNC parameter

This section describes how to troubleshoot altering the CATALOG_SYNC parameter.

To troubleshoot, identify the error message you received in the SQL output when your table alteration failed.

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to validate CATALOG_SYNC target '<catalog_integration_name>' due to error: The Snowflake service connection associated with the Polaris catalog integration does not have the required privileges to send notifications. The minimum required privileges are TABLE_CREATE, TABLE_WRITE_PROPERTIES, TABLE_DROP, NAMESPACE_CREATE, and NAMESPACE_DROP. ``` |
| Cause | The catalog role for the external catalog you want to connect to doesn’t have the necessary privileges to send notifications to Open Catalog. |
| Solution | Grant all of the following privileges to the catalog role for your external catalog:   * TABLE_CREATE * TABLE_WRITE_PROPERTIES * TABLE_DROP * NAMESPACE_CREATE * NAMESPACE_DROP   Where you update the catalog role depends on whether its grants are applied at the catalog, namespace, or table level. See the applicable procedure for your catalog role:   * [Update the privileges granted to a catalog role at the catalog level](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#update-the-privileges-granted-to-a-catalog-role-at-the-catalog-level) * [Update the privileges granted to a catalog role at the namespace level](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#update-the-privileges-granted-to-a-catalog-role-at-the-namespace-level) * [Update the privileges granted to a catalog role at the table level](https://other-docs.snowflake.com/en/opencatalog/secure-catalogs#update-the-privileges-granted-to-a-catalog-role-at-the-table-level) |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to access the REST endpoint of catalog integration <catalog_integration_name> with error: Unable to process: Failed to get subscoped credentials: Error assuming AWS_ROLE: User: <IAM_user_arn> is not authorized to perform: sts:AssumeRole on resource: <S3_role_arn>. Check the accessibility of the REST catalog URI or warehouse. ``` |
| Cause | The AWS IAM user for your external catalog doesn’t have permission to access S3 bucket objects. |
| Solution | Modify the policy document in AWS to allow the IAM user for your Open Catalog account to access objects in your S3 bucket. To modify the policy document, you need to update the IAM role in AWS. For details, see [Retrieve the AWS IAM user for your Polaris Open Catalog account](https://other-docs.snowflake.com/en/opencatalog/create-catalog#step-4-retrieve-the-aws-iam-user-for-your-open-catalog-account) and then [Grant the IAM user permissions to access bucket objects](https://other-docs.snowflake.com/en/opencatalog/create-catalog#step-5-grant-the-iam-user-permissions-to-access-bucket-objects).  Remember that the policy document must include the IAM user ARN and external ID for both your external volume and external catalog in Open Catalog. In the following example policy document, note the following values:   * `arn:aws:iam::111111111111:user/----0000-s` is the STORAGE_AWS_IAM_USER_ARN for the external volume * `arn:aws:iam::222222222222:user/----0000-s` is the IAM user ARN for the external catalog in Snowflake Open Catalog. * `Iceberg_table_external_id` is the STORAGE_AWS_EXTERNAL_ID for your external volume and also the external ID for your   external catalog in Open Catalog.  ```sqljson   {        "Version": "2012-10-17",        "Statement": [          {            "Sid": "",            "Effect": "Allow",            "Principal": {              "AWS": [                  "arn:aws:iam::111111111111:user/----0000-s",                  "arn:aws:iam::222222222222:user/----0000-s"               ]            },            "Action": "sts:AssumeRole",            "Condition": {              "StringEquals": {                "sts:ExternalId": "iceberg_table_external_id"              }            }          }        ]      }   ``` |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to validate CATALOG_SYNC target '<catalog_integration_name>' due to error: The associated Polaris catalog cannot be of type INTERNAL. ``` |
| Cause | You’re attempting to sync a Snowflake-managed Iceberg table to a catalog integration for an internal catalog in Open Catalog. You can only sync a Snowflake-managed Iceberg table to an external catalog in Open Catalog. |
| Solution | You can’t update an existing internal catalog to an external catalog, so you must create a new external catalog:   1. Follow the instructions in [Create a catalog](https://other-docs.snowflake.com/en/opencatalog/create-catalog) to create an    external catalog in your Open Catalog account. When creating the catalog, ensure that the External toggle is enabled. 2. Update `open_catalog_name` for the `CATALOG_NAME` parameter in your catalog integration to the name of the external    catalog you created. |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: Failed to validate CATALOG_SYNC target '<catalog_integration_name>' due to error: SQL Execution Error: Resource on the REST endpoint of catalog integration CATINT is forbidden due to error: Forbidden: Invalid locations '[<path to metadata file>]' for identifier '<identifier>': <path to metadata file> is not in the list of allowed locations: [<list of allowed locations>]. ``` |
| Cause | The path to the metadata file for the table you want to create isn’t included in the list of allowed locations for your external cloud provider. As a result, Open Catalog can’t access the metadata file for the table. |
| Solution | Ensure that the location of the metadata file falls under the file path of the default base location for the catalog that the service admin created in Open Catalog, or that it falls under any of the additional allowed locations, if applicable. For the list of allowed locations, select the catalog in Open Catalog and refer to the Locations field. |

---
title: Trust Center
source: https://docs.snowflake.com/en/user-guide/trust-center/overview.md
section: User Guide
---

# Trust Center

You can use the Trust Center to evaluate, monitor, and reduce potential security risks in your Snowflake accounts. The Trust
Center evaluates each Snowflake account against recommendations that are specified in scanners. Scanners
might generate *findings*. Trust Center findings provide information about how to reduce potential security risks in your Snowflake
account. Not every scanner run generates a finding. A scanner run that finds no security concern generates no finding in the Trust Center.
You can also use the Trust Center to [configure proactive notifications](notifications-trust-center.md) that help
you monitor your account for security risks.

## Overview

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

You can use the Overview tab to gain high-level insights into the security posture of your account, including a summary of findings
generated by current scanners.

For more information, see [Use the Trust Center Snowsight interface](using-the-trust-center.md).

## Common Trust Center use cases

For more information about how to use the Trust Center to reduce security risks in your Snowflake account, see the following topics:

* [Ensure multi-factor authentication (MFA) is enforced for all human users using password-based authentication](getting-started.md)
* [Find over-privileged roles](getting-started.md).
* [Ensure the amount of users with the ACCOUNTADMIN and SECURITYADMIN system roles is limited](getting-started.md).
* [Find users who have not logged in for 90 days](getting-started.md).
* [Find risky users and mitigate authentication risks](getting-started.md).
* Detect anomalous access.

## Limitations

Snowflake reader accounts aren’t supported.

## Required roles

To view or manage scanners and their findings by using the Trust Center, a user with the [ACCOUNTADMIN role](../security-access-control-overview.md)
must grant the `SNOWFLAKE.TRUST_CENTER_VIEWER` or `SNOWFLAKE.TRUST_CENTER_ADMIN` [application role](../../developer-guide/native-apps/creating-setup-script.md) to your role.

The following table lists common tasks that you perform by using the Trust Center user interface, and the minimum application role that your
role requires to perform those tasks:

> **Note:**
>
> If you are using the Trust Center in the [organization account](../organization-accounts.md), use the GLOBALORGADMIN role, not
> ACCOUNTADMIN, to grant the Trust Center application roles.

See the following table for information about which application roles you need to access specific tabs in the Trust Center:

| Task | Trust Center tab | Minimum required application role | Notes |
| --- | --- | --- | --- |
| [View detection findings](using-the-trust-center.md) | Detections | `SNOWFLAKE.TRUST_CENTER_VIEWER` | `SNOWFLAKE.TRUST_CENTER_ADMIN` role can also view detections. |
| [View violation findings](using-the-trust-center.md) | Violations | `SNOWFLAKE.TRUST_CENTER_VIEWER` | `SNOWFLAKE.TRUST_CENTER_ADMIN` role can also view violations. |
| [Manage violation findings Lifecycle](using-the-trust-center.md) | Violations | `SNOWFLAKE.TRUST_CENTER_ADMIN` | None. |
| [Manage scanner packages](using-the-trust-center.md) | Manage scanners | `SNOWFLAKE.TRUST_CENTER_ADMIN` | None. |
| [Manage scanners](using-the-trust-center.md) | Manage scanners | `SNOWFLAKE.TRUST_CENTER_ADMIN` | None. |
| View org-level violations | Organization | `ORGANIZATION_SECURITY_VIEWER` and `SNOWFLAKE.TRUST_CENTER_ADMIN` | The Organization tab is visible *only* in an [Organization account](../organization-accounts.md). |

You can create a custom role that provides view-only access to the Violations and Detections tabs. You
can also create a separate, administrator-level role to manage violations and scanners by using the Violations and
Manage scanners tabs. For example, to create these two different roles, run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE trust_center_admin_role;
GRANT APPLICATION ROLE SNOWFLAKE.TRUST_CENTER_ADMIN TO ROLE trust_center_admin_role;

CREATE ROLE trust_center_viewer_role;
GRANT APPLICATION ROLE SNOWFLAKE.TRUST_CENTER_VIEWER TO ROLE trust_center_viewer_role;

GRANT ROLE trust_center_admin_role TO USER example_admin_user;

GRANT ROLE trust_center_viewer_role TO USER example_nonadmin_user;
```

> **Note:**
>
> This example isn’t intended to recommend a complete role hierarchy for using the Trust Center. For more information, see each sub-section
> in [Using the Trust Center](using-the-trust-center.md).

## Using private connectivity with Trust Center

The Trust Center supports private connectivity. For more information, see
[Using private connectivity](../ui-snowsight-gs.md).

## Trust Center findings

Trust Center findings include two kinds of findings: violations and detections. Both findings are generated by scanners
as they run in your Snowflake accounts.

You can review findings at the organization level or you can examine more closely the
findings for a specific account.

> **Note:**
>
> Currently, you can’t view detection findings at the organization level.

### Organization-level findings

The Organization tab provides insights into the violation findings that are generated in all of the accounts in the organization. This tab
includes the following information:

* The number of violations in the organization.
* The accounts with the most critical violations.
* The number of violations for each account in the organization. You can select an account to drill down into the individual violations in
  the account.

> **Note:**
>
> You can’t use the Organization tab to resolve or reopen violations. To perform these actions, sign in to the account with the
> violation, and then access the Violations tab.

To access the Organization tab, you must meet the following requirements:

* Sign in to the [organization account](../organization-accounts.md).
* Use a role that has the ORGANIZATION_SECURITY_VIEWER application role. You must also have a Trust Center application role.

### Account-level findings

Scanners find and report violations and detections findings through the Trust Center. A violation persists over
time and represents a configuration that doesn’t conform with a scanner’s requirements. A detection occurs one time and represents a
unique event. You can use the Trust Center to view and manage findings for your account. For more information, see [Using the Trust Center](using-the-trust-center.md).

#### Violations

A scanner can examine an entity at any point and determine whether it is in violation based only on its current configuration. Scanners
continue to report on violations unless you change the configuration to remediate the violations. For example, a scanner reports a violation if some
users haven’t configured multi-factor authentication (MFA).

The Violations tab provides account-level information about scanner results. It includes the following information:

* A graph of scanner violations over time, color coded by low, medium, high, and critical severity.
* An interactive list for each violation that is found. Each row in the list contains details about the violation, when the
  scanner was last run, and how to remediate the violation.

Violations let you identify Snowflake configurations in the account that violate the requirements of
enabled scanner packages. For each violation, the Trust Center provides an explanation of how to
remediate the violation. After you remediate a violation, the violation still appears in the Violations tab until the
next scheduled run of the scanner package containing the scanner that reported the violation begins, or until you
[run the scanner package on demand](using-the-trust-center.md).

When you are signed in to the account with the violations, you can use the Violations tab to perform the
following actions:

* Triage the violations that apply to you and record evidence or progress notes.
* Resolve or reopen violations for any reason and record justification for audit needs.
* Sort or filter violations by severity, scanner package, scanner version, scanned time, updated time, or status.
* Add reasons for a violation status change to provide a clear record of actions taken.

You can remediate violations by changing the configuration. For a violation, the Trust Center provides suggestions for
remediation. After you remediate the issue, the Trust Center no longer reports the violation. You can also manage the [lifecycle of a violation finding](using-the-trust-center.md)
by changing its status to Resolved. Email notifications are suppressed for resolved violations. Suppression prevents more notifications
while you work to remediate the underlying misconfigurations. A resolved violation finding no longer generates a notification.

#### Detections

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

A *detection* represents an event that happened at a specific time. The following findings are examples of events that might be reported as
detections:

* Login events originated from an unrecognized IP address.
* A large amount of data was transferred to an external stage.
* A task had a high error rate between two points in time.

Scanners report each detection based on an event trigger. For example, a scanner reports a detection when it detects a suspicious sign-in event
and reports a separate detection when it detects another suspicious sign-in event at a different time. For a detection, the Trust Center provides
information about the event. Because the event is unique and happened in the past, direct remediation of a detection isn’t possible.

Based on the information that the Trust Center provides, you can investigate whether the detection is meaningful. If the detection is meaningful,
you can take actions to prevent similar events in the future.

> **Note:**
>
> If the scanner that reported the detection runs again, it might or might not report similar detections. Currently, you can’t manage the
> lifecycle of a detection.

For more information about managing detections, see [View detections](using-the-trust-center.md).

## Scanners

A *scanner* is a background process that checks your account for security risks that are based on the following criteria:

* How you configured your account.
* Anomalous events.

The Trust Center groups scanners into scanner packages. Scanner details provide information
about what security risks the scanner checks for in your account, when the scanner runs, and who receives notifications about the scanner’s
findings for your account. To see the details for a specific scanner, follow the instructions in [View details for a scanner](using-the-trust-center.md).

### Schedule-based scanners

*Schedule-based scanners* run at specific times, according to their schedules. You must enable a scanner package before you can change the
schedule for a scanner. For more information about changing the schedule for a scanner, see [Change the schedule for a scanner](using-the-trust-center.md).

### Event-driven scanners

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

*Event-driven scanners* generate detections that are based on relevant events. Examples include scanners that detect sign-ins from unusual IP
addresses and scanners that detect changes to sensitive parameters. You can’t schedule an event-driven scanner, because an event, not a schedule,
drives the detection that an event-driven scanner generates. The Trust Center reports detections that are generated by event-driven scanners
within an hour of the time that an event occurs.

An event-based scanner can detect events that a schedule-based scanner could miss. For example, consider a schedule-based scanner that detects
the `TRUE` or `FALSE` state of a Boolean parameter once every 10 minutes. Toggling — that is, changing the state of — the value of
that parameter from `TRUE` to `FALSE`, and then back to `TRUE` again before 10 minutes pass would occur undetected by the
schedule-based scanner. An event-based scanner that detects each state change would detect both events.

For a current list of event-driven scanners, see Threat Intelligence scanner package.

> **Note:**
>
> Event-driven scanners might appear as multiple items in the [METERING_HISTORY view](../../sql-reference/account-usage/metering_history.md).

### Scanner Packages

*Scanner packages* contain a description and a list of scanners that run when you [enable the scanner package](using-the-trust-center.md).
After you enable a scanner package, the scanner package runs immediately, regardless of the configured schedule. After you enable a scanner
package, you can [enable or disable individual scanners in the scanner package](using-the-trust-center.md). Your role must have the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role to manage scanners
by using the Manage scanners tab. For more information, see Required roles.

The following scanner packages are available:

* Security Essentials scanner package
* CIS Benchmarks scanner package
* Threat Intelligence scanner package

For information about enabling scanner packages, the cost that can occur from enabled scanners, how to change the schedule for a scanner
package, and how to view the list of current scanners in a package, see the following topics:

* [Enable scanner packages](using-the-trust-center.md)
* [Monitoring cost](using-the-trust-center.md)
* [Change the schedule for a scanner package](using-the-trust-center.md)
* [View the list of scanners in a package](using-the-trust-center.md)

Scanner packages are deactivated by default, except for the Security Essentials scanner package.

### Security Essentials scanner package

The **Security Essentials** scanner package scans your account to check whether you have set up the following recommendations:

* You have an authentication policy that enforces all human users to enroll in MFA if they use passwords to
  authenticate.
* All human users are enrolled in MFA if they use passwords to authenticate.
* You set up an account-level network policy that was configured to only allow access from trusted IP addresses.
* You [set up an event table](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#label-nativeapps-consumer-logging-setting-up) if your account [enabled event sharing for a native app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#label-nativeapps-consumer-logging-enabling), so your account receives a copy of the log
  messages and event information that is shared with the application provider.

This scanner package only scans users that are human users; that is, user objects with a TYPE property of PERSON or NULL. For more information,
see [Types of users](../admin-user-management.md).

The **Security Essentials** scanner package:

* Is enabled by default. You can’t deactivate it.
* Runs regularly on a fixed schedule without incurring any serverless compute cost. You can’t change this schedule.
* Any other run of this scanner will incur incremental charges for the serverless compute cost. For more information, see
  [Monitoring cost](using-the-trust-center.md).

For information about running a scanner at the package-level and the scanner-level, see
[Run a scanner package on demand](using-the-trust-center.md) and [Run a scanner on demand](using-the-trust-center.md).

### CIS Benchmarks scanner package

You can access additional security insights by enabling the **CIS Benchmarks** scanner package, which contains scanners that evaluate your
account against the Center for Internet Security (CIS) Snowflake Benchmarks. The CIS Snowflake Benchmarks are a list of best practices for
Snowflake account configurations meant to reduce security vulnerabilities. The CIS Snowflake Benchmarks were created through community
collaboration and consensus among subject matter experts.

To obtain a copy of the CIS Snowflake Benchmarks document, see the
[CIS Snowflake Benchmark website](https://www.cisecurity.org/benchmark/snowflake).

The recommendations found in the CIS Snowflake Benchmarks are numbered by section and recommendation. For example, the first recommendation
of the first section is numbered `1.1`. In the Violations tab, the Trust Center provides section numbers for each
violation if you want to reference the Snowflake CIS Benchmarks.

This scanner package runs once a day by default, but you can change the schedule.

For information about enabling scanner packages, the cost that can occur from enabled scanners, how to change the schedule for a scanner
package, and how to view the list of current scanners in a package, see the following topics:

* [Enable scanner packages](using-the-trust-center.md)
* [Monitoring cost](using-the-trust-center.md)
* [Change the schedule for a scanner package](using-the-trust-center.md)
* [View the list of scanners in a package](using-the-trust-center.md)

> **Note:**
>
> For specific Snowflake CIS benchmarks, Snowflake only determines whether you have implemented a specific security measure, but does not
> evaluate whether the security measure was implemented in a way that achieves its objective. For these benchmarks, the absence of a
> violation does not guarantee that the security measure is implemented in an effective manner. The following benchmarks either do not
> evaluate whether your security implementations were implemented in a way that achieve their goal, or the Trust Center does not perform
> checks for them:
>
> * **All of section 2**: Ensure that activities are monitored and provide recommendations for configuring Snowflake to address
>   activities that require attention. These scanners contain complex queries whose violations don’t appear in the Snowsight console.
>
>   A security officer can derive valuable insights from section 2 scanners by executing the following query against the
>   `snowflake.trust_center.findings` view:
>
>   ```sqlexample
>   SELECT start_timestamp,
>          end_timestamp,
>          scanner_id,
>          scanner_short_description,
>          impact,
>          severity,
>          total_at_risk_count,
>          AT_RISK_ENTITIES
>     FROM snowflake.trust_center.findings
>     WHERE scanner_type = 'Threat' AND
>           completion_status = 'SUCCEEDED'
>     ORDER BY event_id DESC;
>   ```
>
>   In the output, the `AT_RISK_ENTITIES` column contains JSON content with details about activities that require review
>   or remediation. For example, the CIS_BENCHMARKS_CIS2_1 scanner monitors high privilege grants, and security officers should
>   review events reported by this scanner carefully, such as the following sample event:
>
>   ```output
>   [
>     {
>       "entity_detail": {
>         "granted_by": joe_smith,
>         "grantee_name": "SNOWFLAKE$SUSPICIOUS_ROLE",
>         "modified_on": "2025-01-01 07:00:00.000 Z",
>         "role_granted": "ACCOUNTADMIN"
>       },
>       "entity_id": "SNOWFLAKE$SUSPICIOUS_ROLE",
>       "entity_name": "SNOWFLAKE$SUSPICIOUS_ROLE",
>       "entity_object_type": "ROLE"
>     }
>   ]
>   ```
>
>   Snowflake suggests the following best practices for section 2 scanners:
>
>   + Don’t disable section 2 scanners unless you’re confident that you have sufficient monitoring measures in place.
>   + Inspect the violations of section 2 scanners on a regular cadence or configure a monitoring task for detections. Specifically,
>     configure monitoring as described in the `SUGGESTED_ACTION` column of the `snowflake.trust_center.findings` view.
> * **3.1**: Ensure that an account-level network policy was configured to only allow access from trusted IP addresses. Trust Center
>   displays a violation if you don’t have an account-level [network policy](../network-policies.md), but doesn’t evaluate
>   whether the appropriate IP addresses have been allowed or blocked.
> * **4.3**: Ensure that the DATA_RETENTION_TIME_IN_DAYS parameter is set to 90 for critical data. Trust Center displays a violation if the
>   [DATA_RETENTION_TIME_IN_DAYS](../../sql-reference/parameters.md) parameter associated with [Time Travel](../data-time-travel.md) isn’t set to 90
>   days for the account or at least one object, but doesn’t evaluate which data is considered critical.
> * **4.10**: Ensure that data masking is enabled for sensitive data. Trust Center displays a violation if the account does not have at
>   least one [masking policy](../security-column-intro.md), but does not evaluate whether sensitive data is protected
>   appropriately. The Trust Center does not evaluate whether a masking policy is assigned to at least one table or view.
> * **4.11**: Ensure that row-access policies are configured for sensitive data. Trust Center displays a violation if the account doesn’t
>   have at least one [row access policy](../security-row-intro.md), but does not evaluate whether sensitive data is protected.
>   The Trust Center does not evaluate whether a row access policy is assigned to at least one table or view.

### Threat Intelligence scanner package

You can access additional security insights in the Trust Center by enabling the **Threat Intelligence** scanner package. This package
identifies risks based on the following criteria:

* [User types](../admin-user-management.md): Whether a Snowflake account user is a human or a service.
* Authentication methods or policies: Whether a user logs in to their account with a password without being enrolled in MFA.
* Login activity: Whether a user hasn’t logged in recently.
* Abnormal failure rates: Whether a user has a high number of authentication failures or job errors.
* **New!** Detection findings: all new scanners that report detection findings.

Specific scanners in the Threat Intelligence package identify users that demonstrate potentially risky behavior as risky. The following table provides examples:

#### Threat Intelligence scanners

| Scanner | Type | Description |
| --- | --- | --- |
| Migrate human users away from password-only sign-in | Schedule-based | Identifies human users who (a) haven’t set up MFA and signed in with a password at least once in the past 90 days *and* (b) have a password but haven’t set up MFA and haven’t signed in for 90 days. |
| Migrate legacy service users away from password-only sign-in | Schedule-based | Identifies legacy service users who have a password and (a) have signed in with only a password at least once in the past 90 days *and* (b) haven’t signed in for 90 days. |
| Identify users with a high volume of authentication failures | Schedule-based | Identifies users with a high number of authentication failures or job errors, which might indicate attempted takeovers of an account, misconfigurations, exceeded quotas, or permission issues. Provides a risk-severity finding and a risk-mitigation recommendation. |

#### New Threat Intelligence scanners

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

Both schedule-based scanners and event-based scanners
can report detections. This preview adds new scanners of both types. All of the added scanners generate detections instead of violation findings.

This preview adds the following new scanners to the Threat Intelligence scanner package:

| Scanner | Type | Description |
| --- | --- | --- |
| Authentication policy changes | Event-driven | Finds changes to [authentication polices](../authentication-policies.md) at both the account level and the user level. |
| Dormant user sign-ins | Event-driven | Analyzes sign-in history events and flags sign-ins from users who haven’t signed in during the last 90 days. |
| Entities with long-running queries | Schedule-based | Finds users and query IDs associated with long-running queries, which are queries with durations that are two standard deviations away from an average query duration over the last 7 days, or the last time the scanner ran, whichever is more recent. We recommend setting this scanner to run once a day. This scanner might cost more initially, as it builds a 30-day cache, which it stores thereafter. Trust Center reports a detection event the first time this scanner runs. |
| Login protection | Event-driven | Finds recent logins from unusual IP addresses.  **Important:** These events originate from the [Malicious IP Protection service](../malicious-ip-protection.md) and require immediate attention. |
| Sensitive parameter protection | Event-driven | Reports disablement of the following sensitive account-level parameters: [PREVENT_UNLOAD_TO_INLINE_URL](../../sql-reference/parameters.md), [REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_CREATION](../../sql-reference/parameters.md), and [REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_OPERATION](../../sql-reference/parameters.md). This scanner only reports detections of a change from `TRUE` to `FALSE` for these parameters, which are set to `TRUE` by default for the best security posture. |
| Users with administrator privileges | Schedule-based | Finds newly created users whose default role is an administrator role, as well as recent grants to existing users that grant them an administrator role. |
| Users with unusual applications used in sessions | Schedule-based | Finds users who have used unusual client applications that connect to Snowflake. |

The Threat Intelligence scanner package runs once a day by default, but you can change the schedule.

For information about enabling scanner packages, the cost that can occur from enabled scanners, how to change the schedule for a scanner
package, and how to view the list of current scanners in a package, see the following topics:

* [Enable scanner packages](using-the-trust-center.md)
* [Monitoring cost](using-the-trust-center.md)
* [Change the schedule for a scanner package](using-the-trust-center.md)
* [View the list of scanners in a package](using-the-trust-center.md)

## Next steps

* [Getting started with the Trust Center](getting-started.md)

---
title: Try Snowflake Open Catalog for free
source: https://docs.snowflake.com/en/user-guide/opencatalog/try-open-catalog-for-free.md
section: User Guide
---

# Try Snowflake Open Catalog for free

You can try Snowflake Open Catalog for free for 30 days. To try Open Catalog, sign up for a Snowflake trial account, and then use this trial
account to create an Open Catalog account. Snowflake Open Catalog is a managed service for [Apache Polaris™ (incubating)](https://github.com/apache/polaris).

To create a Snowflake trial account, all you need is a valid email address; no payment information or contract is required.

When your trial period ends, you can retain access to your Open Catalog account by signing up
for Snowflake. Also, you can drop your Open Catalog account at any time.

## Create a free Open Catalog account

1. [Sign up for a Snowflake trial account](https://docs.snowflake.com/en/user-guide/admin-trial-account).

**Note**

> Your 30-day free access to Open Catalog begins when you sign up for your Snowflake trial account.

2. From your Snowflake trial account, create an Open Catalog account. For details,
   see [create a Snowflake Open Catalog account](create-open-catalog-account.md).

## Sign in to an Open Catalog account

See [Sign in to Snowflake Open Catalog](signin-snowflake-customer.md).

## Retain access to your Open Catalog account

To retain access to your Open Catalog account after the trial period for your Snowflake account ends, convert your Snowflake trial account
to a paid account. This conversion also converts your Open Catalog account to a paid account. For more information, see
[Convert your Snowflake trial account to a paid account](https://docs.snowflake.com/en/user-guide/admin-trial-account#converting-to-a-paid-account).

## Drop your Open Catalog account

You can drop your account at any time to delete it from the system. For details, see
[Dropping an account](https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-delete#label-delete-account-drop).

**Note**

> If you want to cancel your Snowflake trial account, see
> [Canceling a trial account](https://docs.snowflake.com/en/user-guide/admin-trial-account#canceling-a-trial-account).

---
title: Trying query acceleration
source: https://docs.snowflake.com/en/user-guide/performance-query-warehouse-qas.md
section: User Guide
---

# Trying query acceleration

This topic provides an overview of how a warehouse owner or administrator can use the query acceleration service to improve the performance
of queries running on a warehouse. For complete details about query acceleration, refer to [Using the Query Acceleration Service (QAS)](query-acceleration-service.md).

The query acceleration service offloads portions of query processing to
[serverless compute resources](cost-understanding-compute.md), which speeds up the processing of a query while reducing its demand
on the warehouse’s compute resources.

When a warehouse has outlier queries (i.e. queries that use more resources than a typical query), the query acceleration service might
also improve the performance of the warehouse’s other queries because the extra computing demands of the outlier queries are offloaded
to serverless compute resources.

Examples of workloads that might benefit from the query acceleration service include ad hoc analytics, workloads with unpredictable data
volume per query, and queries with large scans and selective filters.

> **Note:**
>
> You must have [access to the shared SNOWFLAKE database](../sql-reference/account-usage.md) to execute the diagnostic queries provided in this topic. By default, only the ACCOUNTADMIN role has the privileges needed to execute the queries.

## Finding candidates for query acceleration

You can use a function or queries to determine whether enabling the query acceleration service might improve the performance of a query
or set of queries.

**Function: Determine if a specific query might benefit**

The [SYSTEM$ESTIMATE_QUERY_ACCELERATION](../sql-reference/functions/system_estimate_query_acceleration.md) function allows you to check whether a specific query is a good
candidate for query acceleration service.

The function accepts a query id as its sole argument. Wrapping the function in the PARSE_JSON function makes it easier to interpret the
results. For example:

```sqlexample
SELECT PARSE_JSON(system$estimate_query_acceleration('8cd54bf0-1651-5b1c-ac9c-6a9582ebd20f'));
```

If a query is a candidate for query acceleration service and has not yet been accelerated, the `status` of the response is `eligible`.
A status of `ineligible` indicates the query will not benefit if you enable query acceleration service for a warehouse.

For additional information about evaluating the query acceleration service for a particular query, including estimated execution times for
different scale factors, refer to the [reference documentation](../sql-reference/functions/system_estimate_query_acceleration.md).

**Query: Best query candidates across warehouses**

This query identifies the queries in the past week that might benefit most from the query acceleration service by calculating the amount of query execution
time that is eligible for acceleration.

```sqlexample
SELECT query_id, eligible_query_acceleration_time
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY eligible_query_acceleration_time DESC;
```

**Query: Best warehouse candidates by execution time**

This query identifies the warehouses that might benefit the most from the query acceleration service in the past week. For each warehouse, it calculates
the total query execution time eligible for acceleration.

```sqlexample
SELECT warehouse_name, SUM(eligible_query_acceleration_time) AS total_eligible_time
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  GROUP BY warehouse_name
  ORDER BY total_eligible_time DESC;
```

**Query: Best warehouse candidates by number of queries**

This query identifies the warehouses with the most queries, in the past week, eligible for the query acceleration service.

```sqlexample
SELECT warehouse_name, COUNT(query_id) AS num_eligible_queries
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  GROUP BY warehouse_name
  ORDER BY num_eligible_queries DESC;
```

## Cost considerations

The serverless compute resources leased by a warehouse for query acceleration consume credits independent of the credits consumed by the
warehouse, and are billed separately.

Query acceleration service is enabled for an entire warehouse, but unlike upsizing a warehouse, it is only used for queries that benefit
from increased compute power. This can be cost effective for warehouses that run a mixed workload because queries that do not require
additional compute resources do not incur the additional cost of using a larger warehouse.

You can use the warehouse’s [scale factor](query-acceleration-service.md) to help control the cost of the query acceleration
service. This scale factor, which is a multiplier of the warehouse’s credit consumption, sets a limit on how much serverless compute can
be used by a warehouse. For example, if a warehouse has a scale factor of 5, the credit consumption rate of serverless compute resources
cannot exceed the consumption rate of the warehouse by more than 5 times.

You can use the [SYSTEM$ESTIMATE_QUERY_ACCELERATION](../sql-reference/functions/system_estimate_query_acceleration.md) function to gauge how the scale factor affects the
performance of a query.

To maximize performance without considering cost, set the scale factor to 0.

## How to enable Query Acceleration Service

To enable the query acceleration service with a maximized performance boost, use the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command as
follows:

```sqlexample
ALTER WAREHOUSE my_wh SET
  ENABLE_QUERY_ACCELERATION = true
  QUERY_ACCELERATION_MAX_SCALE_FACTOR = 0;
```

---
title: Tutorial: Automatically classify and tag sensitive data
source: https://docs.snowflake.com/en/user-guide/tutorials/sensitive-data-auto-classification.md
section: User Guide
---

Snowflake

Data Governance

Sensitive Data Classification

# Tutorial: Automatically classify and tag sensitive data

## Introduction

Identifying and tracking your sensitive data is simple and straightforward. Snowflake provides a built-in algorithm to identify your
sensitive data and automatically tag that data with system tags to help track the type of data and how sensitive it is.

With minimal setup, you can also configure a database so Snowflake automatically performs this classification process for new and changing
data and applies user-defined tags along with the system tags.

In this tutorial, you’ll do the following:

* Set up the resources you need to complete the tutorial, including a user-defined tag that is applied to the sensitive data.
* Create a classification profile, which Snowflake uses to automatically classify data when it’s added to a database.
* Add a tag map to the classification profile so the user-defined tag is applied to data that Snowflake identifies as sensitive.
* View the results of the classification.

## Set up governance database

In this tutorial, you’ll create the Snowflake objects (a user-defined tag and a classification profile) needed to govern your data. Based on
best practice, these object are created in a database dedicated to governance.

[Open a SQL worksheet](../ui-snowsight-worksheets-gs.md), and then execute the following statements to create a database and schema
for the governance objects:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE DATABASE IF NOT EXISTS governance_db;
CREATE SCHEMA IF NOT EXISTS governance_db.sch;
```

> **Note:**
>
> For simplicity, you will use the ACCOUNTADMIN system role to avoid setting up the privileges needed to configure sensitive data
> classification. In practice, you should not use this powerful role but rather create custom roles with the required privileges.

## Set up your data

Before setting up the data for this tutorial, create a warehouse to populate a table:

```sqlexample
CREATE WAREHOUSE IF NOT EXISTS tutorial_wh;
```

### Create a table

1. Create the database and schema that will contain the table to be classified.

   ```sqlexample
   CREATE DATABASE IF NOT EXISTS data_db;
   CREATE SCHEMA IF NOT EXISTS data_db.sch;
   ```
2. Create the table structure that will contain the sensitive data.

   ```sqlexample
   CREATE TABLE data_db.sch.customers (
     account_number NUMBER(38,0),
     first_name VARCHAR(16777216),
     last_name VARCHAR(16777216),
     email VARCHAR(16777216)
   );
   ```

### Insert values into the table

Add data to the table you created:

```sqlexample
USE WAREHOUSE tutorial_wh;

INSERT INTO data_db.sch.customers (account_number, first_name, last_name, email)
  VALUES
    (1589420, 'john', 'doe', 'john.doe@example.com'),
    (2834123, 'jane', 'doe', 'jane.doe@example.com'),
    (4829381, 'jim', 'doe', 'jim.doe@example.com'),
    (9821802, 'susan', 'smith', 'susan.smith@example.com'),
    (8028387, 'bart', 'simpson', 'bart.barber@example.com');
```

## Create a classification profile

Great, you now have a table full of data that you need to classify to help protect your sensitive data. Because you want Snowflake to
automatically classify data when it is added to a database, you’ll need to create a classification profile.

A classification profile controls how often data in a database is classified, along with what happens during that classification process.
Every classification profile is an instance of the CLASSIFICATION_PROFILE class.

To create the classification profile for your database, run the following command:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  governance_db.sch.my_classification_profile(
      {
        'minimum_object_age_for_classification_days': 0,
        'maximum_classification_validity_days': 30,
        'auto_tag': true,
        'classify_views': true
      });
```

When this classification profile is set on your database, the following actions happens:

* Classification starts in less than one day (`'minimum_object_age_for_classification_days': 0`).
* After the initial classification, Snowflake rechecks every 30 days to see if tables need to be reclassified
  (`'maximum_classification_validity_days': 30`).
* Classification tags will be automatically set on columns identified as containing sensitive data (`'auto_tag': true`).
* Snowflake classifies data in tables *and* views (`'classify_views': true`).

## Add tag map to classification profile

Because you set `'auto_tag': true` in your classification profile, Snowflake will automatically apply [system classification tags](../classify-intro.md) when it classifies data as being sensitive. The SEMANTIC_CATEGORY tag classifies the type of
data, for example identifying the data as a name or address. The PRIVACY_CATEGORY tag classifies the sensitivity of the data, for
example identifying the data as an identifier or quasi-identifier.

Now suppose you want to go one step further and automatically apply your own user-defined tag based on how data is classified. This tutorial
shows you how!

To create the custom tag that you want applied to sensitive data, execute the following statement:

```sqlexample
CREATE TAG governance_db.sch.tutorial_pii;
```

Next, you’ll modify the classification profile so this user-defined tag gets applied when Snowflake identifies that a column contains names.
Adding a tag map to the classification profile configures how and when the user-defined tag gets applied.

To add the tag map to your classification profile, execute the `classification_profile_name!SET_TAG_MAP` method:

```sqlexample
CALL governance_db.sch.my_classification_profile!SET_TAG_MAP(
  {'column_tag_map':[
    {
      'tag_name':'governance_db.sch.tutorial_pii',
      'tag_value':'sensitive_name',
      'semantic_categories':['NAME']
    }]});
```

Now, if sensitive data classification determines the system-defined semantic category is `NAME`, then the user-defined tag `tutorial_pii` is
set on the column. Based on the classification profile, the value of the user-defined `tutorial_pii` tag is set to `sensitive_name`.

> **Note:**
>
> You can also define a tag map when creating the classification profile.

## Set classification profile on a database

You have your classification profile configured, so you’re ready to set it on the database. This starts the automatic classification process.

```sqlexample
ALTER DATABASE data_db
  SET CLASSIFICATION_PROFILE = 'governance_db.sch.my_classification_profile';
```

That’s it, Snowflake does the rest! Snowflake starts classifying the existing data and classifies new data when it is added to
the database.

## View classification results

Before completing this part of the tutorial, you’ll have to wait one hour for Snowflake to complete the classification process.

After one hour, execute the following statement to retrieve the results of the classification:

```sqlexample
CALL SYSTEM$GET_CLASSIFICATION_RESULT('data_db.sch.customers');
```

In the results, notice the following:

* The ACCOUNT_NUMBER column was not classified as sensitive, so it wasn’t assigned classification tags.
* The EMAIL column was flagged as having a semantic category of EMAIL and a privacy category of IDENTIFIER.
* Based on the tag map of the classification profile, the `governance_db.sch.tutorial_pii` user-defined tag got assigned to columns that
  had a semantic category of NAME (see highlighted lines in output).

```output
  {
  "classification_profile_config": {
    "classification_profile_name": "GOVERNANCE_DB.SCH.MY_CLASSIFICATION_PROFILE"
  },
  "classification_result": {
    "ACCOUNT_NUMBER": {
      "alternates": []
    },
    "EMAIL": {
      "alternates": [],
      "recommendation": {
        "confidence": "HIGH",
        "coverage": 1,
        "details": [],
        "privacy_category": "IDENTIFIER",
        "semantic_category": "EMAIL",
        "tags": [
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.semantic_category",
            "tag_value": "EMAIL"
          },
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.privacy_category",
            "tag_value": "IDENTIFIER"
          }
        ]
      },
      "valid_value_ratio": 1
    },
    "FIRST_NAME": {
      "alternates": [],
      "recommendation": {
        "confidence": "HIGH",
        "coverage": 1,
        "details": [],
        "privacy_category": "IDENTIFIER",
        "semantic_category": "NAME",
        "tags": [
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.semantic_category",
            "tag_value": "NAME"
          },
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.privacy_category",
            "tag_value": "IDENTIFIER"
          },
          {
            "tag_applied": true,
            "tag_name": "governance_db.sch.tutorial_pii",
            "tag_value": "sensitive_name"
          }
        ]
      },
      "valid_value_ratio": 1
    },
    "LAST_NAME": {
      "alternates": [],
      "recommendation": {
        "confidence": "HIGH",
        "coverage": 1,
        "details": [],
        "privacy_category": "IDENTIFIER",
        "semantic_category": "NAME",
        "tags": [
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.semantic_category",
            "tag_value": "NAME"
          },
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.privacy_category",
            "tag_value": "IDENTIFIER"
          },
          {
            "tag_applied": true,
            "tag_name": "governance_db.sch.tutorial_pii",
            "tag_value": "sensitive_name"
          }
        ]
      },
      "valid_value_ratio": 1
    }
  }
}
```

## Clean up, summary, and additional resources

Congratulations! You’ve successfully completed this tutorial.

In summary, you learned how to do the following:

* Create a classification profile to control how automatic classification is implemented.
* Add a tag map to the classification profile so user-defined tags are automatically set on columns containing sensitive data.
* Set the classification profile on a database to kick off automatic classification.
* View the results of automatic classification.

### Drop the tutorial objects

If you plan to repeat the tutorial, you can keep the objects that you created.

Otherwise, drop the tutorial objects as follows:

```sqlexample
DROP TAG governance_db.sch.tutorial_pii;
DROP DATABASE governance_db;
DROP DATABASE data_db;
DROP WAREHOUSE tutorial_wh;
```

### What’s next?

For complete details about implementing automatic sensitive data classification, including associated costs and implementing custom
classification, see [Use SQL to set up sensitive data classification](../classify-auto.md).

---
title: Tutorial: Bulk loading from a local file system using COPY
source: https://docs.snowflake.com/en/user-guide/tutorials/data-load-internal-tutorial.md
section: User Guide
---

Getting Started

# Tutorial: Bulk loading from a local file system using COPY

This tutorial describes how to load data from files in your local file system into a table.

## Introduction

In this tutorial, you will learn how to:

* Create named file format objects that describe your data files.
* Create named stage objects.
* Upload your data to the internal stages.
* Load your data into tables.
* Resolve errors in your data files.

The tutorial covers how to load both CSV and JSON data using SnowSQL.

## Prerequisites

The tutorial assumes the following:

* You have a Snowflake account and a user with a role that grants the necessary
  privileges to create a database, tables, and virtual warehouse objects.
* You have SnowSQL installed.

The [Snowflake in 20 minutes](snowflake-in-20minutes.md) tutorial provides the related step-by-step instructions to meet these requirements.

In addition, you need to do the following before you start the tutorial:

* Download sample files provided for this exercise.
* Create a database, tables, and a virtual warehouse for this tutorial.
  These are the basic Snowflake objects needed for most Snowflake activities.

### Download the sample data files

For this tutorial you need to download the sample data files provided by Snowflake.

To download and unzip the sample data files:

1. Right-click the name of the
   archive file, [`data-load-internal.zip`](../../_downloads/22c3a6290f5d1f4d97075282729f3859/data-load-internal.zip)
   and save the link/file to your local file system.
2. Unzip the sample files. The tutorial assumes you unpacked files in
   to the following directories:

> * Linux/macOS: `/tmp/load`
> * Windows: `C:\tempload`

These data files include sample contact data in the following formats:

* CSV files that contain a header row and five records. The field
  delimiter is the pipe (`|`) character.
  The following example shows a header row and one record:

  > ```sqlexample
  > ID|lastname|firstname|company|email|workphone|cellphone|streetaddress|city|postalcode
  > 6|Reed|Moses|Neque Corporation|eget.lacus@facilisis.com|1-449-871-0780|1-454-964-5318|Ap #225-4351 Dolor Ave|Titagarh|62631
  > ```
* A single file in JSON format that contains one array and three objects.
  The following is an example of an array that contains one of the objects:

  > ```sqlexample
  > [
  >  {
  >    "customer": {
  >      "address": "509 Kings Hwy, Comptche, Missouri, 4848",
  >      "phone": "+1 (999) 407-2274",
  >      "email": "blankenship.patrick@orbin.ca",
  >      "company": "ORBIN",
  >      "name": {
  >        "last": "Patrick",
  >        "first": "Blankenship"
  >      },
  >      "_id": "5730864df388f1d653e37e6f"
  >    }
  >  },
  > ]
  > ```

### Create the database, tables, and warehouse

Execute the following statements to create a database, two tables
(for csv and json data), and a virtual warehouse needed for this tutorial.
After you complete the tutorial, you can drop these objects.

> ```sqlexample
> -- Create a database. A database automatically includes a schema named 'public'.
>
> CREATE OR REPLACE DATABASE mydatabase;
>
> /* Create target tables for CSV and JSON data. The tables are temporary, meaning they persist only for the duration of the user session and are not visible to other users. */
>
> CREATE OR REPLACE TEMPORARY TABLE mycsvtable (
>   id INTEGER,
>   last_name STRING,
>   first_name STRING,
>   company STRING,
>   email STRING,
>   workphone STRING,
>   cellphone STRING,
>   streetaddress STRING,
>   city STRING,
>   postalcode STRING);
>
> CREATE OR REPLACE TEMPORARY TABLE myjsontable (
>   json_data VARIANT);
>
> -- Create a warehouse
>
> CREATE OR REPLACE WAREHOUSE mywarehouse WITH
>   WAREHOUSE_SIZE='X-SMALL'
>   AUTO_SUSPEND = 120
>   AUTO_RESUME = TRUE
>   INITIALLY_SUSPENDED=TRUE;
> ```

The `CREATE WAREHOUSE` statement sets up the warehouse to be suspended initially.
The statement also sets `AUTO_RESUME = true`, which starts the warehouse automatically
when you execute SQL statements that require compute resources.

## Create file format objects

When you load data from a file into a table, you must describe the format of the file
and specify how the data in the file should be interpreted and processed. For example,
if you are loading pipe-delimited data from a CSV file, you must specify that the file
uses the CSV format with pipe symbols as delimiters.

When you execute the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command, you specify this format information. You can
either specify this information as options in the command (e.g.
`TYPE = CSV`, `FIELD_DELIMITER = '|'`, etc.) or you can specify a
file format object that contains this format information. You can create a named file
format object using the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command.

In this step, you create file format objects describing the data format of the sample CSV and
JSON data provided for this tutorial.

### Create a file format object for CSV data

Execute the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command
to create the `mycsvformat` file format.

```sqlexample
CREATE OR REPLACE FILE FORMAT mycsvformat
  TYPE = 'CSV'
  FIELD_DELIMITER = '|'
  SKIP_HEADER = 1;
```

Where:

* `TYPE = 'CSV'` indicates the source file format type. CSV is the default file format type.
* `FIELD_DELIMITER = '|'` indicates the ‘|’ character is a field separator. The default value is ‘,’.
* `SKIP_HEADER = 1` indicates the source file includes one header line. The COPY command skips these header lines when loading data. The default value is 0.

### Create a file format object for JSON data

Execute the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command to create
the `myjsonformat` file format.

```sqlexample
CREATE OR REPLACE FILE FORMAT myjsonformat
  TYPE = 'JSON'
  STRIP_OUTER_ARRAY = TRUE;
```

Where:

* `TYPE = 'JSON'` indicates the source file format type.
* `STRIP_OUTER_ARRAY = TRUE` directs the COPY command to exclude the root brackets ([]) when loading data to the table.

## Create stage objects

A stage specifies where data files are stored (i.e. “staged”) so that the data
in the files can be loaded into a table.
A named [internal stage](../data-load-overview.md)
is a cloud storage location managed by Snowflake.

Creating a named stage is useful if you want multiple users or processes
to upload files. If you plan to stage data files to load only
by you, or to load only into a single table, then you may prefer
to use your user stage or the table stage. For information, see
[Bulk loading from a local file system](../data-load-local-file-system.md).

In this step, you create named stages for the different types of sample data files.

### Create a stage for CSV data files

Execute CREATE STAGE to create the `my_csv_stage` stage:

```sqlexample
CREATE OR REPLACE STAGE my_csv_stage
  FILE_FORMAT = mycsvformat;
```

Note that if you specify the `FILE_FORMAT` option when creating
the stage, it is not necessary to specify the same `FILE_FORMAT`
option in the COPY command used to load data from the stage.

### Create a stage for JSON data files

Execute CREATE STAGE to create the `my_json_stage` stage:

```sqlexample
CREATE OR REPLACE STAGE my_json_stage
  FILE_FORMAT = myjsonformat;
```

## Stage the data files

Execute [PUT](../../sql-reference/sql/put.md) to upload (stage) sample data files from your local
file system to the stages you created in Tutorial: Bulk loading from a local file system using COPY.

### Staging the CSV sample data files

Execute the PUT command to upload the CSV files from your local file system.

* Linux or macOS

  > ```sqlexample
  > PUT file:///tmp/load/contacts*.csv @my_csv_stage AUTO_COMPRESS=TRUE;
  > ```
* Windows

  > ```sqlexample
  > PUT file://C:\temp\load\contacts*.csv @my_csv_stage AUTO_COMPRESS=TRUE;
  > ```

Let us take a closer look at the command:

* `file://<file-path>[/]contacts*.csv` specifies the full directory path and names of the files on your local machine to stage. Note that file system wildcards are allowed.
* `@my_csv_stage` is the stage name where to stage the data.
* `auto_compress=true;` directs the command to compress the data when staging. This is also the default.

The command returns the following result, showing the staged files:

```output
+---------------+------------------+-------------+-------------+--------------------+--------------------+----------+---------+
| source        | target           | source_size | target_size | source_compression | target_compression | status   | message |
|---------------+------------------+-------------+-------------+--------------------+--------------------+----------+---------|
| contacts1.csv | contacts1.csv.gz |         694 |         506 | NONE               | GZIP               | UPLOADED |         |
| contacts2.csv | contacts2.csv.gz |         763 |         565 | NONE               | GZIP               | UPLOADED |         |
| contacts3.csv | contacts3.csv.gz |         771 |         567 | NONE               | GZIP               | UPLOADED |         |
| contacts4.csv | contacts4.csv.gz |         750 |         561 | NONE               | GZIP               | UPLOADED |         |
| contacts5.csv | contacts5.csv.gz |         887 |         621 | NONE               | GZIP               | UPLOADED |         |
+---------------+------------------+-------------+-------------+--------------------+--------------------+----------+---------+
```

### Stage the JSON sample data files

Execute the PUT command to upload the JSON file from your local file system to the named stage.

* Linux or macOS

  > ```sqlexample
  > PUT file:///tmp/load/contacts.json @my_json_stage AUTO_COMPRESS=TRUE;
  > ```
* Windows

  > ```sqlexample
  > PUT file://C:\temp\load\contacts.json @my_json_stage AUTO_COMPRESS=TRUE;
  > ```

The command returns the following result, showing the staged files:

```output
+---------------+------------------+-------------+-------------+--------------------+--------------------+----------+---------+
| source        | target           | source_size | target_size | source_compression | target_compression | status   | message |
|---------------+------------------+-------------+-------------+--------------------+--------------------+----------+---------|
| contacts.json | contacts.json.gz |         965 |         446 | NONE               | GZIP               | UPLOADED |         |
+---------------+------------------+-------------+-------------+--------------------+--------------------+----------+---------+
```

### List the staged files (optional)

You can list the staged files by using the [LIST](../../sql-reference/sql/list.md) command.

#### CSV

> ```sqlexample
> LIST @my_csv_stage;
> ```

Snowflake returns a list of your staged files.

#### JSON

> ```sqlexample
> LIST @my_json_stage;
> ```

Snowflake returns a list of your staged files.

## Copy data into the target tables

Execute [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) to load staged data into the target tables.

### CSV

To load the data from the sample CSV files:

1. Start by loading the data from one of the files (`contacts1.csv.gz`). Execute the following:

   ```sqlexample
   COPY INTO mycsvtable
     FROM @my_csv_stage/contacts1.csv.gz
     FILE_FORMAT = (FORMAT_NAME = mycsvformat)
     ON_ERROR = 'skip_file';
   ```

   Where:

   * The `FROM` clause specifies the location of the staged data
     file (stage name followed by the file name).
   * The `ON_ERROR` clause specifies what to do when the COPY command
     encounters errors in the files. By default, the command stops
     loading data when the first error is encountered; however, we’ve instructed it to skip any file containing an error and move on to loading the next file. Note that this is just for illustration purposes; none of the files in this tutorial contain errors.

   The COPY command returns a result showing the name of the file copied and related information:

   ```output
   +-----------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   | file                        | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
   |-----------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
   | mycsvtable/contacts1.csv.gz | LOADED |           5 |           5 |           1 |           0 |        NULL |             NULL |                  NULL |                    NULL |
   +-----------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   ```
2. Load the rest of the staged files in the `mycsvtable` table.

   The following example uses pattern matching to load data from all files
   that match the regular expression `.*contacts[1-5].csv.gz` into the `mycsvtable` table.

   ```sqlexample
   COPY INTO mycsvtable
     FROM @my_csv_stage
     FILE_FORMAT = (FORMAT_NAME = mycsvformat)
     PATTERN='.*contacts[1-5].csv.gz'
     ON_ERROR = 'skip_file';
   ```

   Where the `PATTERN` clause specifies that the command should load data
   from the filenames matching this regular expression `(.*employees0[1-5].csv.gz)`.

   The COPY command returns a result showing the name of the file copied and related information:

   ```output
   +-----------------------------+-------------+-------------+-------------+-------------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------+-----------------------+-------------------------+
   | file                        | status      | rows_parsed | rows_loaded | error_limit | errors_seen | first_error                                                                                                                                                          | first_error_line | first_error_character | first_error_column_name |
   |-----------------------------+-------------+-------------+-------------+-------------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------+-----------------------+-------------------------|
   | mycsvtable/contacts2.csv.gz | LOADED      |           5 |           5 |           1 |           0 | NULL                                                                                                                                                                 |             NULL |                  NULL | NULL                    |
   | mycsvtable/contacts3.csv.gz | LOAD_FAILED |           5 |           0 |           1 |           2 | Number of columns in file (11) does not match that of the corresponding table (10), use file format option error_on_column_count_mismatch=false to ignore this error |                3 |                     1 | "MYCSVTABLE"[11]        |
   | mycsvtable/contacts4.csv.gz | LOADED      |           5 |           5 |           1 |           0 | NULL                                                                                                                                                                 |             NULL |                  NULL | NULL                    |
   | mycsvtable/contacts5.csv.gz | LOADED      |           6 |           6 |           1 |           0 | NULL                                                                                                                                                                 |             NULL |                  NULL | NULL                    |
   +-----------------------------+-------------+-------------+-------------+-------------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------+-----------------------+-------------------------+
   ```

   Note the following highlights in the result:

   * The data in `contacts1.csv.gz` is ignored because you already loaded
     the data successfully.
   * The data in these files was loaded successfully:
     `contacts2.csv.gz`, `contacts4.csv.gz`, and
     `contacts5.csv.gz`.
   * The data in `contacts3.csv.gz` was skipped due to 2 data errors.
     The next step in this tutorial addresses how to validate and fix
     the errors.

### JSON

Load the `contacts.json.gz` staged data file into the `myjsontable` table.

```sqlexample
COPY INTO myjsontable
  FROM @my_json_stage/contacts.json.gz
  FILE_FORMAT = (FORMAT_NAME = myjsonformat)
  ON_ERROR = 'skip_file';
```

The COPY command returns a result showing the name of the file copied
and related information:

```output
+------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
| file                         | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
|------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
| myjsontable/contacts.json.gz | LOADED |           3 |           3 |           1 |           0 |        NULL |             NULL |                  NULL |                    NULL |
+------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
```

## Resolve data load errors

In the preceding step, the COPY INTO command skipped loading one of the files when
it encountered the first error. You need to find all the errors and fix them.
In this step, you use the [VALIDATE](../../sql-reference/functions/validate.md) function
to validate the previous execution of the COPY INTO command and returns all errors.

### Validate the sample data files and retrieve any errors

You first need the query ID associated with the COPY INTO command
that you previously executed. You then call the `VALIDATE` function,
specifying the query ID.

1. Retrieve the query ID.

   1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   2. Make sure the role in Snowsight is the same as the role you are using
      in SnowSQL to run SQL statements for this tutorial.
   3. In the navigation menu, select Monitoring » Query History.
   4. Select the row for the specific COPY INTO command to open the query
      information pane.
   5. Copy the Query ID value.
2. Validate the COPY INTO command execution, represented by the query ID,
   and save errors to a new table named `save_copy_errors`.

   1. In SnowSQL, execute the following command. Replace `query_id` with the Query ID value.

      ```sqlexample
      CREATE OR REPLACE TABLE save_copy_errors AS SELECT * FROM TABLE(VALIDATE(mycsvtable, JOB_ID=>'<query_id>'));
      ```
   2. Query the `save_copy_errors` table.

      ```sqlexample
      SELECT * FROM SAVE_COPY_ERRORS;
      ```

      The query returns the following results:

      ```output
      +----------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+------+-----------+-------------+----------+--------+-----------+-------------------------------+------------+----------------+-----------------------------------------------------------------------------------------------------------------------------------------------------+
      | ERROR                                                                                                                                                                | FILE                                | LINE | CHARACTER | BYTE_OFFSET | CATEGORY |   CODE | SQL_STATE | COLUMN_NAME                   | ROW_NUMBER | ROW_START_LINE | REJECTED_RECORD                                                                                                                                     |
      |----------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+------+-----------+-------------+----------+--------+-----------+-------------------------------+------------+----------------+-----------------------------------------------------------------------------------------------------------------------------------------------------|
      | Number of columns in file (11) does not match that of the corresponding table (10), use file format option error_on_column_count_mismatch=false to ignore this error | mycsvtable/contacts3.csv.gz         |    3 |         1 |         234 | parsing  | 100080 |     22000 | "MYCSVTABLE"[11]              |          1 |              2 | 11|Ishmael|Burnett|Dolor Elit Pellentesque Ltd|vitae.erat@necmollisvitae.ca|1-872|600-7301|1-513-592-6779|P.O. Box 975, 553 Odio, Road|Hulste|63345 |
      | Field delimiter '|' found while expecting record delimiter '\n'                                                                                                      | mycsvtable/contacts3.csv.gz         |    5 |       125 |         625 | parsing  | 100016 |     22000 | "MYCSVTABLE"["POSTALCODE":10] |          4 |              5 | 14|Sophia|Christian|Turpis Ltd|lectus.pede@non.ca|1-962-503-3253|1-157-|850-3602|P.O. Box 824, 7971 Sagittis Rd.|Chattanooga|56188                  |
      +----------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+------+-----------+-------------+----------+--------+-----------+-------------------------------+------------+----------------+-----------------------------------------------------------------------------------------------------------------------------------------------------+
      ```

The result shows two data errors in `mycsvtable/contacts3.csv.gz`:

* `Number of columns in file (11) does not match that of the corresponding table (10)`

  In Row 1, a hyphen was mistakenly replaced with the pipe (`|`) character, the data file delimiter, effectively creating an additional column in the record.
* `Field delimiter '|' found while expecting record delimiter 'n'`

  In Row 5, an additional pipe (`|`) character was introduced after a hyphen, breaking the record.

### Fix the errors and load the data files again

1. Fix the errors in the records manually in the `contacts3.csv` file in your local environment.
2. Use the [PUT](../../sql-reference/sql/put.md) command to upload the modified data file to the stage. The modified file overwrites the existing staged file.

   * Linux or macOS:

     ```sqlexample
     PUT file:///tmp/load/contacts3.csv @my_csv_stage AUTO_COMPRESS=TRUE OVERWRITE=TRUE;
     ```
   * Windows:

     ```sqlexample
     PUT file://C:\temp\load\contacts3.csv @my_csv_stage AUTO_COMPRESS=TRUE OVERWRITE=TRUE;
     ```
3. Copy the data from the staged files into the tables.

   ```sqlexample
   COPY INTO mycsvtable
     FROM @my_csv_stage/contacts3.csv.gz
     FILE_FORMAT = (FORMAT_NAME = mycsvformat)
     ON_ERROR = 'skip_file';
   ```

Snowflake returns the following results, indicating the data in `contacts3.csv.gz` was loaded successfully.

> ```output
> +-----------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
> | file                        | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
> |-----------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
> | mycsvtable/contacts3.csv.gz | LOADED |           5 |           5 |           1 |           0 |        NULL |             NULL |                  NULL |                    NULL |
> +-----------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
> ```

### Verify the loaded data

Execute a [SELECT](../../sql-reference/sql/select.md) query to verify that the data was loaded successfully.

#### CSV

> ```sqlexample
> SELECT * FROM mycsvtable;
> ```

The query returns the following results:

> ```output
> +----+-----------+------------+----------------------------------+----------------------------------------+----------------+----------------+---------------------------------+------------------+------------+
> | ID | LAST_NAME | FIRST_NAME | COMPANY                          | EMAIL                                  | WORKPHONE      | CELLPHONE      | STREETADDRESS                   | CITY             | POSTALCODE |
> |----+-----------+------------+----------------------------------+----------------------------------------+----------------+----------------+---------------------------------+------------------+------------|
> |  6 | Reed      | Moses      | Neque Corporation                | eget.lacus@facilisis.com               | 1-449-871-0780 | 1-454-964-5318 | Ap #225-4351 Dolor Ave          | Titagarh         |      62631 |
> |  7 | Audrey    | Franks     | Arcu Eu Limited                  | eu.dui@aceleifendvitae.org             | 1-527-945-8935 | 1-263-127-1173 | Ap #786-9241 Mauris Road        | Bergen           |      81958 |
> |  8 | Jakeem    | Erickson   | A Ltd                            | Pellentesque.habitant@liberoProinmi.ca | 1-381-591-9386 | 1-379-391-9490 | 319-1703 Dis Rd.                | Pangnirtung      |      62399 |
> |  9 | Xaviera   | Brennan    | Bibendum Ullamcorper Limited     | facilisi.Sed.neque@dictum.edu          | 1-260-757-1919 | 1-211-651-0925 | P.O. Box 146, 8385 Vel Road     | Béziers          |      13082 |
> | 10 | Francis   | Ortega     | Vitae Velit Egestas Associates   | egestas.rhoncus.Proin@faucibus.com     | 1-257-584-6487 | 1-211-870-2111 | 733-7191 Neque Rd.              | Chatillon        |      33081 |
> | 16 | Aretha    | Sykes      | Lobortis Tellus Justo Foundation | eget@Naminterdumenim.net               | 1-670-849-1866 | 1-283-783-3710 | Ap #979-2481 Dui. Av.           | Thurso           |      66851 |
> | 17 | Akeem     | Casey      | Pharetra Quisque Ac Institute    | dictum.eu@magna.edu                    | 1-277-657-0361 | 1-623-630-8848 | Ap #363-6074 Ullamcorper, Rd.   | Idar-Oberstei    |      30848 |
> | 18 | Keelie    | Mendez     | Purus In Foundation              | Nulla.eu.neque@Aeneanegetmetus.co.uk   | 1-330-370-8231 | 1-301-568-0413 | 3511 Tincidunt Street           | Lanklaar         |      73942 |
> | 19 | Lane      | Bishop     | Libero At PC                     | non@dapibusligula.ca                   | 1-340-862-4623 | 1-513-820-9039 | 7459 Pede. Street               | Linkebeek        |      89252 |
> | 20 | Michelle  | Dickson    | Ut Limited                       | Duis.dignissim.tempor@cursuset.org     | 1-202-490-0151 | 1-129-553-7398 | 6752 Eros. St.                  | Stornaway        |      61290 |
> | 20 | Michelle  | Dickson    | Ut Limited                       | Duis.dignissim.tempor@cursuset.org     | 1-202-490-0151 | 1-129-553-7398 | 6752 Eros. St.                  | Stornaway        |      61290 |
> | 21 | Lance     | Harper     | Rutrum Lorem Limited             | Sed.neque@risus.com                    | 1-685-778-6726 | 1-494-188-6168 | 663-7682 Et St.                 | Gisborne         |      73449 |
> | 22 | Keely     | Pace       | Eleifend Limited                 | ante.bibendum.ullamcorper@necenim.edu  | 1-312-381-5244 | 1-432-225-9226 | P.O. Box 506, 5233 Aliquam Av.  | Woodlands County |      61213 |
> | 23 | Sage      | Leblanc    | Egestas A Consulting             | dapibus@elementum.org                  | 1-630-981-0327 | 1-301-287-0495 | 4463 Lorem Road                 | Woodlands County |      33951 |
> | 24 | Marny     | Holt       | Urna Nec Luctus Associates       | ornare@vitaeorci.ca                    | 1-522-364-3947 | 1-460-971-8360 | P.O. Box 311, 4839 Nulla Av.    | Port Coquitlam   |      36733 |
> | 25 | Holly     | Park       | Mauris PC                        | Vestibulum.ante@Maecenasliberoest.org  | 1-370-197-9316 | 1-411-413-4602 | P.O. Box 732, 8967 Eu Avenue    | Provost          |      45507 |
> |  1 | Imani     | Davidson   | At Ltd                           | nec@sem.net                            | 1-243-889-8106 | 1-730-771-0412 | 369-6531 Molestie St.           | Russell          |      74398 |
> |  2 | Kelsie    | Abbott     | Neque Sed Institute              | lacus@pede.net                         | 1-467-506-9933 | 1-441-508-7753 | P.O. Box 548, 1930 Pede. Road   | Campbellton      |      27022 |
> |  3 | Hilel     | Durham     | Pede Incorporated                | eu@Craspellentesque.net                | 1-752-108-4210 | 1-391-449-8733 | Ap #180-2360 Nisl. Street       | Etalle           |      84025 |
> |  4 | Graiden   | Molina     | Sapien Institute                 | sit@fermentum.net                      | 1-130-156-6666 | 1-269-605-7776 | 8890 A, Rd.                     | Dundee           |      70504 |
> |  5 | Karyn     | Howard     | Pede Ac Industries               | sed.hendrerit@ornaretortorat.edu       | 1-109-166-5492 | 1-506-782-5089 | P.O. Box 902, 5398 Et, St.      | Saint-Hilarion   |      26232 |
> | 11 | Ishmael   | Burnett    | Dolor Elit Pellentesque Ltd      | vitae.erat@necmollisvitae.ca           | 1-872-600-7301 | 1-513-592-6779 | P.O. Box 975, 553 Odio, Road    | Hulste           |      63345 |
> | 12 | Ian       | Fields     | Nulla Magna Malesuada PC         | rutrum.non@condimentumDonec.co.uk      | 1-138-621-8354 | 1-369-126-7068 | P.O. Box 994, 7053 Quisque Ave  | Ostra Vetere     |      90433 |
> | 13 | Xanthus   | Acosta     | Tortor Company                   | Nunc.lectus@a.org                      | 1-834-909-8838 | 1-693-411-2633 | 282-7994 Nunc Av.               | Belcarra         |      28890 |
> | 14 | Sophia    | Christian  | Turpis Ltd                       | lectus.pede@non.ca                     | 1-962-503-3253 | 1-157-850-3602 | P.O. Box 824, 7971 Sagittis Rd. | Chattanooga      |      56188 |
> | 15 | Dorothy   | Watson     | A Sollicitudin Orci Company      | diam.dictum@fermentum.co.uk            | 1-158-596-8622 | 1-402-884-3438 | 3348 Nec Street                 | Qu�bec City      |      63320 |
> +----+-----------+------------+----------------------------------+----------------------------------------+----------------+----------------+---------------------------------+------------------+------------+
> ```

#### JSON

> ```sqlexample
> SELECT * FROM myjsontable;
> ```

The query returns the following results:

> ```output
> +-----------------------------------------------------------------+
> | JSON_DATA                                                       |
> |-----------------------------------------------------------------|
> | {                                                               |
> |   "customer": {                                                 |
> |     "_id": "5730864df388f1d653e37e6f",                          |
> |     "address": "509 Kings Hwy, Comptche, Missouri, 4848",       |
> |     "company": "ORBIN",                                         |
> |     "email": "blankenship.patrick@orbin.ca",                    |
> |     "name": {                                                   |
> |       "first": "Blankenship",                                   |
> |       "last": "Patrick"                                         |
> |     },                                                          |
> |     "phone": "+1 (999) 407-2274"                                |
> |   }                                                             |
> | }                                                               |
> | {                                                               |
> |   "customer": {                                                 |
> |     "_id": "5730864d4d8523c8baa8baf6",                          |
> |     "address": "290 Lefferts Avenue, Malott, Delaware, 1575",   |
> |     "company": "SNIPS",                                         |
> |     "email": "anna.glass@snips.name",                           |
> |     "name": {                                                   |
> |       "first": "Anna",                                          |
> |       "last": "Glass"                                           |
> |     },                                                          |
> |     "phone": "+1 (958) 411-2876"                                |
> |   }                                                             |
> | }                                                               |
> | {                                                               |
> |   "customer": {                                                 |
> |     "_id": "5730864e375e08523150fc04",                          |
> |     "address": "756 Randolph Street, Omar, Rhode Island, 3310", |
> |     "company": "ESCHOIR",                                       |
> |     "email": "sparks.ramos@eschoir.co.uk",                      |
> |     "name": {                                                   |
> |       "first": "Sparks",                                        |
> |       "last": "Ramos"                                           |
> |     },                                                          |
> |     "phone": "+1 (962) 436-2519"                                |
> |   }                                                             |
> | }                                                               |
> +-----------------------------------------------------------------+
> ```

## Remove the successfully copied data files

After you verify that you successfully copied data from your stage into the tables,
you can remove data files from the internal stage using the [REMOVE](../../sql-reference/sql/remove.md)
command to save on [data storage](../cost-understanding-compute.md).

> ```sqlexample
> REMOVE @my_csv_stage PATTERN='.*.csv.gz';
> ```

Snowflake returns the following results:

> ```output
> +-------------------------------+---------+
> | name                          | result  |
> |-------------------------------+---------|
> | my_csv_stage/contacts1.csv.gz | removed |
> | my_csv_stage/contacts4.csv.gz | removed |
> | my_csv_stage/contacts2.csv.gz | removed |
> | my_csv_stage/contacts3.csv.gz | removed |
> | my_csv_stage/contacts5.csv.gz | removed |
> +-------------------------------+---------+
> ```
>
> ```sqlexample
> REMOVE @my_json_stage PATTERN='.*.json.gz';
> ```

Snowflake returns the following results:

> ```output
> +--------------------------------+---------+
> | name                           | result  |
> |--------------------------------+---------|
> | my_json_stage/contacts.json.gz | removed |
> +--------------------------------+---------+
> ```

## Clean up

Congratulations, you have successfully completed the tutorial.

### Tutorial clean up (optional)

Execute the following [DROP <object>](../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

> ```sqlexample
> DROP DATABASE IF EXISTS mydatabase;
> DROP WAREHOUSE IF EXISTS mywarehouse;
> ```

Dropping the database automatically removes all child database objects such as tables.

### Other data loading tutorials

* [Snowflake in 20 minutes](snowflake-in-20minutes.md)
* [Tutorial: Bulk loading from Amazon S3 using COPY](data-load-external-tutorial.md)

---
title: Tutorial: Bulk loading from Amazon S3 using COPY
source: https://docs.snowflake.com/en/user-guide/tutorials/data-load-external-tutorial.md
section: User Guide
---

Getting Started

# Tutorial: Bulk loading from Amazon S3 using COPY

## Introduction

This tutorial describes how to load data from files in an existing Amazon Simple Storage Service (Amazon S3) bucket into a table. In this tutorial, you will learn how to:

* Create named file formats that describe your data files.
* Create named stage objects.
* Load data located in your S3 bucket into Snowflake tables.
* Resolve errors in your data files.

The tutorial covers loading of both CSV and JSON data.

## Prerequisites

The tutorial assumes the following:

* You have a Snowflake account that is configured to use Amazon Web Services (AWS) and a user with a role that grants the necessary
  privileges to create a database, tables, and virtual warehouse objects.
* You have SnowSQL installed.

Refer to the [Snowflake in 20 minutes](snowflake-in-20minutes.md)
for instructions to meet these requirements.

Snowflake provides sample data files in a public Amazon S3 bucket for use in this tutorial.
But before you start, you need to create a database, tables, and a virtual warehouse for
this tutorial. These are the basic Snowflake objects needed for most Snowflake activities.

### About the sample data files

Snowflake provides sample data files staged in a public S3 bucket.

> **Note:**
>
> In regular use, you would stage your own data files using the AWS Management Console, AWS Command
> Line Interface, or an equivalent client application. See the
> [Amazon Web Services](https://aws.amazon.com/console/) documentation for instructions.

The sample data files include sample contact information in the following formats:

* CSV files that contain a header row and five records. The field delimiter is the pipe (`|`) character.
  The following example shows a header row and one record:

  > ```sqlexample
  > ID|lastname|firstname|company|email|workphone|cellphone|streetaddress|city|postalcode
  > 6|Reed|Moses|Neque Corporation|eget.lacus@facilisis.com|1-449-871-0780|1-454-964-5318|Ap #225-4351 Dolor Ave|Titagarh|62631
  > ```
* A single file in JSON format that contains one array and three objects.
  The following is an example of an array that contains one of the objects:

  > ```sqlexample
  > [
  >  {
  >    "customer": {
  >      "address": "509 Kings Hwy, Comptche, Missouri, 4848",
  >      "phone": "+1 (999) 407-2274",
  >      "email": "blankenship.patrick@orbin.ca",
  >      "company": "ORBIN",
  >      "name": {
  >        "last": "Patrick",
  >        "first": "Blankenship"
  >      },
  >      "_id": "5730864df388f1d653e37e6f"
  >    }
  >  },
  > ]
  > ```

### Create the database, tables, and warehouse

Execute the following statements to create a database, two tables (for csv and json data),
and a virtual warehouse needed for this tutorial. After you complete the tutorial, you can
drop these objects.

```sqlexample
CREATE OR REPLACE DATABASE mydatabase;

CREATE OR REPLACE TEMPORARY TABLE mycsvtable (
     id INTEGER,
     last_name STRING,
     first_name STRING,
     company STRING,
     email STRING,
     workphone STRING,
     cellphone STRING,
     streetaddress STRING,
     city STRING,
     postalcode STRING);

CREATE OR REPLACE TEMPORARY TABLE myjsontable (
     json_data VARIANT);

CREATE OR REPLACE WAREHOUSE mywarehouse WITH
     WAREHOUSE_SIZE='X-SMALL'
     AUTO_SUSPEND = 120
     AUTO_RESUME = TRUE
     INITIALLY_SUSPENDED=TRUE;
```

Note the following:

* The `CREATE DATABASE` statement creates a database. The database automatically includes a schema named ‘public’.
* The `CREATE TABLE` statements create target tables for CSV and JSON data. The tables are temporary, that is, they
  persist only for the duration of the user session and are not visible to other users.
* The `CREATE WAREHOUSE` statement creates an initially suspended warehouse. The
  statement also sets `AUTO_RESUME = true`, which starts the warehouse automatically when
  you execute SQL statements that require compute resources.

## Create file format objects

When you load data files from an S3 bucket into a table, you must describe the format of the file
and specify how the data in the file should be interpreted and processed. For example,
if you are loading pipe-delimited data from a CSV file, you must specify that the file
uses the CSV format with pipe symbols as delimiters.

When you execute the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command, you specify this format information. You can
either specify this information as options in the command (e.g.
`TYPE = CSV`, `FIELD_DELIMITER = '|'`, etc.) or you can specify a
file format object that contains this format information. You can create a named file
format object using the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command.

In this step, you create file format objects describing the data format of the sample CSV and
JSON data provided for this tutorial.

### Create a file format object for CSV data

Execute the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command
to create the `mycsvformat` file format.

```sqlexample
CREATE OR REPLACE FILE FORMAT mycsvformat
   TYPE = 'CSV'
   FIELD_DELIMITER = '|'
   SKIP_HEADER = 1;
```

Where:

* `TYPE = 'CSV'` indicates the source file format type. CSV is the default file format type.
* `FIELD_DELIMITER = '|'` indicates the ‘|’ character is a field separator. The default value is ‘,’.
* `SKIP_HEADER = 1` indicates the source file includes one header line. The COPY command skips these header lines when loading data. The default value is 0.

### Create a file format object for JSON data

Execute the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command to create
the `myjsonformat` file format.

> ```sqlexample
> CREATE OR REPLACE FILE FORMAT myjsonformat
>   TYPE = 'JSON'
>   STRIP_OUTER_ARRAY = TRUE;
> ```

Where:

* `TYPE = 'JSON'` indicates the source file format type.
* `STRIP_OUTER_ARRAY = TRUE` directs the COPY command to exclude the root brackets ([]) when loading data to the table.

## Create stage objects

A stage specifies where data files are stored (i.e. “staged”) so that the data
in the files can be loaded into a table.
A named [external stage](../data-load-overview.md)
is a cloud storage location managed by Snowflake.
An external stage references data files stored in a S3 bucket. In this case, we are creating a
stage that references the sample data files necessary to complete the tutorial.

Creating a named external stage is useful if you want multiple users or processes
to upload files. If you plan to stage data files to load only
by you, or to load only into a single table, then you may prefer
to use your user stage or the table stage. For information, see
[Bulk loading from Amazon S3](../data-load-s3.md).

In this step, you create named stages for the different types of sample data files.

### Create a stage for CSV data files

Execute CREATE STAGE to create the `my_csv_stage` stage:

```sqlexample
CREATE OR REPLACE STAGE my_csv_stage
  FILE_FORMAT = mycsvformat
  URL = 's3://snowflake-docs';
```

### Create a stage for JSON data files

Execute CREATE STAGE to create the `my_json_stage` stage:

```sqlexample
CREATE OR REPLACE STAGE my_json_stage
  FILE_FORMAT = myjsonformat
  URL = 's3://snowflake-docs';
```

> **Note:**
>
> In regular use, if you were creating a stage that pointed to your private data files, you would reference a storage integration created using [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md) by an account administrator (i.e. a user with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege:
>
> > ```sqlexample
> > CREATE OR REPLACE STAGE external_stage
> >   FILE_FORMAT = mycsvformat
> >   URL = 's3://private-bucket'
> >   STORAGE_INTEGRATION = myint;
> > ```

## Copy data into the target table

Execute [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) to load staged data into the target tables.

### CSV

To load the data from the sample CSV files:

1. Start by loading the data from one of the files
   in the `/tutorials/dataloading/` prefix (folder) named `contacts1.csv`
   in the `mycsvtable` table.
   Execute the following:

   ```sqlexample
   COPY INTO mycsvtable
     FROM @my_csv_stage/tutorials/dataloading/contacts1.csv
     ON_ERROR = 'skip_file';
   ```

   Where:

   * The `FROM` clause specifies the location of the staged data
     file (stage name followed by the file name).
   * The `ON_ERROR = 'skip_file'` clause specifies what to do when the COPY command encounters errors
     in the files. In this case, when the command encounters a data error on any of the records
     in a file, it skips the file. If you do not specify an ON_ERROR clause, the default
     is `abort_statement`, which aborts the COPY command on the first error
     encountered on any of the records in a file.

   The COPY command returns a result showing the name of the file copied and related information:

   ```output
   +---------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   | file                                                    | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
   |---------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
   | s3://snowflake-docs/tutorials/dataloading/contacts1.csv | LOADED |           5 |           5 |           1 |           0 |        NULL |             NULL |                  NULL |                    NULL |
   +---------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   ```
2. Load the rest of the staged files in the `mycsvtable` table.

   The following example uses pattern matching to load data from files that match the
   regular expression `.*contacts[1-5].csv` into the `mycsvtable` table.

   ```sqlexample
   COPY INTO mycsvtable
     FROM @my_csv_stage/tutorials/dataloading/
     PATTERN='.*contacts[1-5].csv'
     ON_ERROR = 'skip_file';
   ```

   Where the `PATTERN` clause specifies that the command should load data
   from the filenames matching this regular expression `.*contacts[1-5].csv`.

   The COPY command returns a result showing the name of the file copied and
   related information:

   ```output
   +---------------------------------------------------------+-------------+-------------+-------------+-------------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------+-----------------------+-------------------------+
   | file                                                    | status      | rows_parsed | rows_loaded | error_limit | errors_seen | first_error                                                                                                                                                          | first_error_line | first_error_character | first_error_column_name |
   |---------------------------------------------------------+-------------+-------------+-------------+-------------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------+-----------------------+-------------------------|
   | s3://snowflake-docs/tutorials/dataloading/contacts2.csv | LOADED      |           5 |           5 |           1 |           0 | NULL                                                                                                                                                                 |             NULL |                  NULL | NULL                    |
   | s3://snowflake-docs/tutorials/dataloading/contacts3.csv | LOAD_FAILED |           5 |           0 |           1 |           2 | Number of columns in file (11) does not match that of the corresponding table (10), use file format option error_on_column_count_mismatch=false to ignore this error |                3 |                     1 | "MYCSVTABLE"[11]        |
   | s3://snowflake-docs/tutorials/dataloading/contacts4.csv | LOADED      |           5 |           5 |           1 |           0 | NULL                                                                                                                                                                 |             NULL |                  NULL | NULL                    |
   | s3://snowflake-docs/tutorials/dataloading/contacts5.csv | LOADED      |           6 |           6 |           1 |           0 | NULL                                                                                                                                                                 |             NULL |                  NULL | NULL                    |
   +---------------------------------------------------------+-------------+-------------+-------------+-------------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------+-----------------------+-------------------------+
   ```

   Note the following highlights in the result:

   > * The data in `contacts1.csv` is ignored because you already loaded
   >   the data successfully.
   > * The data in these files was loaded successfully:
   >   `contacts2.csv`, `contacts4.csv`, and
   >   `contacts5.csv`.
   > * The data in `contacts3.csv` was skipped due to 2 data errors.
   >   The next step in this tutorial addresses how to validate and fix
   >   the errors.

### JSON

Load the `contacts.json` staged data file into the `myjsontable` table.

> ```sqlexample
> COPY INTO myjsontable
>   FROM @my_json_stage/tutorials/dataloading/contacts.json
>   ON_ERROR = 'skip_file';
> ```

The COPY returns a result showing the name of the file copied and related information:

```output
+---------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
| file                                                    | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
|---------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
| s3://snowflake-docs/tutorials/dataloading/contacts.json | LOADED |           3 |           3 |           1 |           0 |        NULL |             NULL |                  NULL |                    NULL |
+---------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
```

## Resolve data load errors related to data issues

In the preceding step, the COPY INTO command skipped loading one of the files when
it encountered the first error. You need to find all the errors.
In this step, you use the [VALIDATE](../../sql-reference/functions/validate.md) function
to validate the previous execution of the COPY INTO command and return all errors.

### Validate the sample data files and retrieve any errors

You first need the retrieve query ID associated with the COPY INTO command
that you previously executed. You then call the `VALIDATE` function,
specifying the query ID.

1. Retrieve the query ID.

   1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   2. Make sure the role in Snowsight is the same as the role you are using
      in SnowSQL to run SQL statements for this tutorial.
   3. In the navigation menu, select Monitoring » Query History.
   4. Select the row for the specific COPY INTO command to open the query
      information pane.
   5. Copy the Query ID value.
2. Validate the COPY INTO command execution, represented by the query ID,
   and save errors to a new table named `save_copy_errors`.

   1. In SnowSQL, execute the following command. Replace `query_id` with the Query ID value.

      ```sqlexample
      CREATE OR REPLACE TABLE save_copy_errors AS SELECT * FROM TABLE(VALIDATE(mycsvtable, JOB_ID=>'<query_id>'));
      ```
   2. Query the `save_copy_errors` table.

      ```sqlexample
      SELECT * FROM SAVE_COPY_ERRORS;
      ```

      The query returns the following results:

      ```output
      +----------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+------+-----------+-------------+----------+--------+-----------+-------------------------------+------------+----------------+-----------------------------------------------------------------------------------------------------------------------------------------------------+
      | ERROR                                                                                                                                                                | FILE                                | LINE | CHARACTER | BYTE_OFFSET | CATEGORY |   CODE | SQL_STATE | COLUMN_NAME                   | ROW_NUMBER | ROW_START_LINE | REJECTED_RECORD                                                                                                                                     |
      |----------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+------+-----------+-------------+----------+--------+-----------+-------------------------------+------------+----------------+-----------------------------------------------------------------------------------------------------------------------------------------------------|
      | Number of columns in file (11) does not match that of the corresponding table (10), use file format option error_on_column_count_mismatch=false to ignore this error | mycsvtable/contacts3.csv.gz         |    3 |         1 |         234 | parsing  | 100080 |     22000 | "MYCSVTABLE"[11]              |          1 |              2 | 11|Ishmael|Burnett|Dolor Elit Pellentesque Ltd|vitae.erat@necmollisvitae.ca|1-872|600-7301|1-513-592-6779|P.O. Box 975, 553 Odio, Road|Hulste|63345 |
      | Field delimiter '|' found while expecting record delimiter '\n'                                                                                                      | mycsvtable/contacts3.csv.gz         |    5 |       125 |         625 | parsing  | 100016 |     22000 | "MYCSVTABLE"["POSTALCODE":10] |          4 |              5 | 14|Sophia|Christian|Turpis Ltd|lectus.pede@non.ca|1-962-503-3253|1-157-|850-3602|P.O. Box 824, 7971 Sagittis Rd.|Chattanooga|56188                  |
      +----------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+------+-----------+-------------+----------+--------+-----------+-------------------------------+------------+----------------+-----------------------------------------------------------------------------------------------------------------------------------------------------+
      ```

The result shows two data errors in `mycsvtable/contacts3.csv.gz`:

* `Number of columns in file (11) does not match that of the corresponding table (10)`

  In Row 1, a hyphen was mistakenly replaced with the pipe (`|`) character, the data file delimiter, effectively creating an additional column in the record.
* `Field delimiter '|' found while expecting record delimiter 'n'`

  In Row 5, an additional pipe (`|`) character was introduced after a hyphen, breaking the record.

### Fix the errors and load the data files again

In regular use, you would fix the problematic records manually and write them to a new data file.
You would then stage the fixed data files to the S3 bucket and attempt to reload the data from
the files. For this tutorial, you are using Snowflake provided sample data, which you do not correct.

### Verify the loaded data

Execute a [SELECT](../../sql-reference/sql/select.md) statement to verify that the data was loaded successfully.

#### CSV

> ```sqlexample
> SELECT * FROM mycsvtable;
> ```

The query returns the following results:

> ```output
> +----+-----------+------------+----------------------------------+----------------------------------------+----------------+----------------+--------------------------------+------------------+------------+
> | ID | LAST_NAME | FIRST_NAME | COMPANY                          | EMAIL                                  | WORKPHONE      | CELLPHONE      | STREETADDRESS                  | CITY             | POSTALCODE |
> |----+-----------+------------+----------------------------------+----------------------------------------+----------------+----------------+--------------------------------+------------------+------------|
> |  6 | Reed      | Moses      | Neque Corporation                | eget.lacus@facilisis.com               | 1-449-871-0780 | 1-454-964-5318 | Ap #225-4351 Dolor Ave         | Titagarh         |      62631 |
> |  7 | Audrey    | Franks     | Arcu Eu Limited                  | eu.dui@aceleifendvitae.org             | 1-527-945-8935 | 1-263-127-1173 | Ap #786-9241 Mauris Road       | Bergen           |      81958 |
> |  8 | Jakeem    | Erickson   | A Ltd                            | Pellentesque.habitant@liberoProinmi.ca | 1-381-591-9386 | 1-379-391-9490 | 319-1703 Dis Rd.               | Pangnirtung      |      62399 |
> |  9 | Xaviera   | Brennan    | Bibendum Ullamcorper Limited     | facilisi.Sed.neque@dictum.edu          | 1-260-757-1919 | 1-211-651-0925 | P.O. Box 146, 8385 Vel Road    | Béziers          |      13082 |
> | 10 | Francis   | Ortega     | Vitae Velit Egestas Associates   | egestas.rhoncus.Proin@faucibus.com     | 1-257-584-6487 | 1-211-870-2111 | 733-7191 Neque Rd.             | Chatillon        |      33081 |
> | 16 | Aretha    | Sykes      | Lobortis Tellus Justo Foundation | eget@Naminterdumenim.net               | 1-670-849-1866 | 1-283-783-3710 | Ap #979-2481 Dui. Av.          | Thurso           |      66851 |
> | 17 | Akeem     | Casey      | Pharetra Quisque Ac Institute    | dictum.eu@magna.edu                    | 1-277-657-0361 | 1-623-630-8848 | Ap #363-6074 Ullamcorper, Rd.  | Idar-Oberstei    |      30848 |
> | 18 | Keelie    | Mendez     | Purus In Foundation              | Nulla.eu.neque@Aeneanegetmetus.co.uk   | 1-330-370-8231 | 1-301-568-0413 | 3511 Tincidunt Street          | Lanklaar         |      73942 |
> | 19 | Lane      | Bishop     | Libero At PC                     | non@dapibusligula.ca                   | 1-340-862-4623 | 1-513-820-9039 | 7459 Pede. Street              | Linkebeek        |      89252 |
> | 20 | Michelle  | Dickson    | Ut Limited                       | Duis.dignissim.tempor@cursuset.org     | 1-202-490-0151 | 1-129-553-7398 | 6752 Eros. St.                 | Stornaway        |      61290 |
> | 20 | Michelle  | Dickson    | Ut Limited                       | Duis.dignissim.tempor@cursuset.org     | 1-202-490-0151 | 1-129-553-7398 | 6752 Eros. St.                 | Stornaway        |      61290 |
> | 21 | Lance     | Harper     | Rutrum Lorem Limited             | Sed.neque@risus.com                    | 1-685-778-6726 | 1-494-188-6168 | 663-7682 Et St.                | Gisborne         |      73449 |
> | 22 | Keely     | Pace       | Eleifend Limited                 | ante.bibendum.ullamcorper@necenim.edu  | 1-312-381-5244 | 1-432-225-9226 | P.O. Box 506, 5233 Aliquam Av. | Woodlands County |      61213 |
> | 23 | Sage      | Leblanc    | Egestas A Consulting             | dapibus@elementum.org                  | 1-630-981-0327 | 1-301-287-0495 | 4463 Lorem Road                | Woodlands County |      33951 |
> | 24 | Marny     | Holt       | Urna Nec Luctus Associates       | ornare@vitaeorci.ca                    | 1-522-364-3947 | 1-460-971-8360 | P.O. Box 311, 4839 Nulla Av.   | Port Coquitlam   |      36733 |
> | 25 | Holly     | Park       | Mauris PC                        | Vestibulum.ante@Maecenasliberoest.org  | 1-370-197-9316 | 1-411-413-4602 | P.O. Box 732, 8967 Eu Avenue   | Provost          |      45507 |
> |  1 | Imani     | Davidson   | At Ltd                           | nec@sem.net                            | 1-243-889-8106 | 1-730-771-0412 | 369-6531 Molestie St.          | Russell          |      74398 |
> |  2 | Kelsie    | Abbott     | Neque Sed Institute              | lacus@pede.net                         | 1-467-506-9933 | 1-441-508-7753 | P.O. Box 548, 1930 Pede. Road  | Campbellton      |      27022 |
> |  3 | Hilel     | Durham     | Pede Incorporated                | eu@Craspellentesque.net                | 1-752-108-4210 | 1-391-449-8733 | Ap #180-2360 Nisl. Street      | Etalle           |      84025 |
> |  4 | Graiden   | Molina     | Sapien Institute                 | sit@fermentum.net                      | 1-130-156-6666 | 1-269-605-7776 | 8890 A, Rd.                    | Dundee           |      70504 |
> |  5 | Karyn     | Howard     | Pede Ac Industries               | sed.hendrerit@ornaretortorat.edu       | 1-109-166-5492 | 1-506-782-5089 | P.O. Box 902, 5398 Et, St.     | Saint-Hilarion   |      26232 |
> +----+-----------+------------+----------------------------------+----------------------------------------+----------------+----------------+--------------------------------+------------------+------------+
> ```

#### JSON

> ```sqlexample
> SELECT * FROM myjsontable;
> ```

The query returns the following results:

> ```output
> +-----------------------------------------------------------------+
> | JSON_DATA                                                       |
> |-----------------------------------------------------------------|
> | {                                                               |
> |   "customer": {                                                 |
> |     "_id": "5730864df388f1d653e37e6f",                          |
> |     "address": "509 Kings Hwy, Comptche, Missouri, 4848",       |
> |     "company": "ORBIN",                                         |
> |     "email": "blankenship.patrick@orbin.ca",                    |
> |     "name": {                                                   |
> |       "first": "Blankenship",                                   |
> |       "last": "Patrick"                                         |
> |     },                                                          |
> |     "phone": "+1 (999) 407-2274"                                |
> |   }                                                             |
> | }                                                               |
> | {                                                               |
> |   "customer": {                                                 |
> |     "_id": "5730864d4d8523c8baa8baf6",                          |
> |     "address": "290 Lefferts Avenue, Malott, Delaware, 1575",   |
> |     "company": "SNIPS",                                         |
> |     "email": "anna.glass@snips.name",                           |
> |     "name": {                                                   |
> |       "first": "Anna",                                          |
> |       "last": "Glass"                                           |
> |     },                                                          |
> |     "phone": "+1 (958) 411-2876"                                |
> |   }                                                             |
> | }                                                               |
> | {                                                               |
> |   "customer": {                                                 |
> |     "_id": "5730864e375e08523150fc04",                          |
> |     "address": "756 Randolph Street, Omar, Rhode Island, 3310", |
> |     "company": "ESCHOIR",                                       |
> |     "email": "sparks.ramos@eschoir.co.uk",                      |
> |     "name": {                                                   |
> |       "first": "Sparks",                                        |
> |       "last": "Ramos"                                           |
> |     },                                                          |
> |     "phone": "+1 (962) 436-2519"                                |
> |   }                                                             |
> | }                                                               |
> +-----------------------------------------------------------------+
> ```

## Clean up

Congratulations, you have successfully completed the tutorial.

### Tutorial clean up (optional)

Execute the following [DROP <object>](../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

> ```sqlexample
> DROP DATABASE IF EXISTS mydatabase;
> DROP WAREHOUSE IF EXISTS mywarehouse;
> ```

Dropping the database automatically removes all child database objects such as tables.

### Other data loading tutorials

* [Snowflake in 20 minutes](snowflake-in-20minutes.md)
* [Tutorial: Bulk loading from a local file system using COPY](data-load-internal-tutorial.md)

---
title: Tutorial: Create and manage an organizational listing
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listing-tutorial.md
section: User Guide
---

# Tutorial: Create and manage an organizational listing

Organizational listings in Snowflake allow you to share data products securely within your organization, making it easier
for internal teams to discover and use trusted resources. As a provider, you can create listings that centralize access
to datasets, Native Apps, and other resources, simplifying data sharing and collaboration across your teams. This guide
will help you understand the steps and requirements to create and manage organizational listings effectively, ensuring
that your data products are accessible while maintaining control over who can see and use them.

Before you begin, make sure you have the necessary privileges to create and manage organizational listings.

In this tutorial, we create a custom role (ORG_LISTING_PROVIDER) to manage listings on behalf of the organization.

## Create a role to manage organizational listings

Switch to the ORGADMIN role (or ACCOUNTADMIN) to create a new role and add one or more users. These users will be the administrators
for organizational listings. Then GRANT the new role the required privileges to create and share organizational listings.

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE ORG_LISTING_PROVIDER;
GRANT ROLE ORG_LISTING_PROVIDER TO USER <user_name>;
GRANT CREATE SHARE ON ACCOUNT TO ROLE ORG_LISTING_PROVIDER;
```

## Create a share and grant usage to it

Switch to the ORG_LISTING_PROVIDER custom role that you just created to create a share and grant usage to the share.

```sqlexample
USE ROLE ORG_LISTING_PROVIDER;
CREATE OR REPLACE DATABASE DEVORGDB;
USE DATABASE DEVORGDB;
CREATE SHARE ORG_SHARE SECURE_OBJECTS_ONLY=FALSE;
GRANT USAGE ON DATABASE DEVORGDB TO SHARE ORG_SHARE;
GRANT USAGE ON SCHEMA PUBLIC TO SHARE ORG_SHARE;
CREATE OR REPLACE TABLE TUTORIAL_TABLE ( item_id INT, item_name STRING );
GRANT SELECT ON TABLE DEVORGDB.PUBLIC.TUTORIAL_TABLE TO SHARE ORG_SHARE;
INSERT INTO TUTORIAL_TABLE (item_id, item_name) VALUES (1,'Tutorial table');
```

## Create an organizational listing

Create an organizational listings from the share with the required attributes included
in YAML (entered in $$ delimiters).

This example shares the listing with all accounts in the organization:

```sqlexample
USE ROLE ORG_LISTING_PROVIDER;
CREATE ORGANIZATION LISTING ORG_LISTING
SHARE ORG_SHARE AS
$$
title : "My title"
organization_profile: INTERNAL
organization_targets:
    access:
    - all_accounts : true
locations:
  access_regions:
  - name: "ALL"
auto_fulfillment:
  refresh_type: "SUB_DATABASE"
  refresh_schedule: "10 MINUTE"
$$;
```

For a complete list of all fields and values for an Organization listing see [Organization listing manifest reference](org-listing-manifest-reference.md).

## Alter an organizational listing

Alter the organizational listings by including any changes or additional attributes in the YAML.

> **Caution:**
>
> When altering an organizational listing you must include all the attributes from the original listing manifest.
> Failure to include all attributes can cause errors or the unexpected removal of existing attributes from the listing manifest.
> Snowflake recommends capturing the existing listing manifest with the [DESCRIBE LISTING](../../../../sql-reference/sql/desc-listing.md) command and then using the results as the input in the [ALTER LISTING](../../../../sql-reference/sql/alter-listing.md) command.

This example shares the listing with a single account and adds a description to the listing:

```sqlexample
USE ROLE ORG_LISTING_PROVIDER;
ALTER LISTING ORG_LISTING
AS
$$
title : "My title"
organization_profile: INTERNAL
organization_targets:
    access:
    - all_accounts : false
locations:
  access_regions:
  - name: "ALL"
auto_fulfillment:
  refresh_type: "SUB_DATABASE"
  refresh_schedule: "10 MINUTE"
$$;
```

## View a list of organizational listings

To view organizational listings, run the following command:

```sqlexample
SHOW LISTINGS;
DESCRIBE LISTING ORG_LISTING;
```

## (Optional) Add auto-fulfillment for organizational listings

To enable auto-fulfillment for your organizational listings, run the following commands:

> **Important:**
>
> Before you run the command to enable auto-fulfillment, check to see if it’s already enabled and note the current settings.
> If it’s already turned on, you don’t need to run the command.

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT MANAGE LISTING AUTO FULFILLMENT ON ACCOUNT TO ROLE ORG_LISTING_PROVIDER;

USE ROLE ORG_LISTING_PROVIDER;
SHOW ORGANIZATION ACCOUNTS;
SELECT SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT('<ORGACCOUNT>');

CALL SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('<ORGACCOUNT>');
```

## Clean up after the tutorial

To drop any unwanted objects you created during this tutorial, run one or more of the
following commands as needed:

> **Important:**
>
> If auto-fulfillment was enabled when you ran the last step, DO NOT disable it when you clean up after the query.
> Doing so will stop all auto-fulfillment on your account!

```sqlexample
DROP LISTING <organizational_listing_name>;
DROP SHARE org_listing1_share1;
DROP DATABASE org_listing_db1;
--CALL SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('ORGACCOUNT');
DROP ROLE ORG_LISTING_PROVIDER;
```

---
title: Tutorial: Create your first Apache Iceberg™ table
source: https://docs.snowflake.com/en/user-guide/tutorials/create-your-first-iceberg-table.md
section: User Guide
---

Snowflake

Iceberg

Data lake

# Tutorial: Create your first Apache Iceberg™ table

## Introduction

This tutorial covers how to create [Apache Iceberg™ tables](../tables-iceberg.md) that use Snowflake as the catalog
and support read and write operations. Iceberg tables for Snowflake combine the performance and query semantics
of regular Snowflake tables with external cloud storage that you manage.

Complete this tutorial using a worksheet in Snowsight or using a Snowflake client such as [SnowSQL](../snowsql.md).
You can copy and paste the code examples, and then run them.

### What you’ll learn

In this tutorial, you’ll learn how to do the following:

* Create and configure an [external volume](../tables-iceberg.md) for Snowflake-managed Iceberg tables.
  For demonstration purposes, the tutorial creates an external volume for Amazon S3.
* Create two Iceberg tables that use Snowflake as the Iceberg catalog (Snowflake-managed tables).
* Insert data into the Iceberg tables.
* Query the Iceberg tables.
* Delete rows from an Iceberg table.

### Prerequisites

Before you start, you should be familiar with the following:

* Snowflake [object identifiers](../../sql-reference/identifiers.md) and their requirements.
* Apache Iceberg and Iceberg tables in Snowflake. For more information, see [Apache Iceberg™ tables](../tables-iceberg.md).
* Cloud object storage.
* If using S3, you should be familiar with
  [AWS Identity and Access Management (IAM)](https://docs.aws.amazon.com/IAM/latest/UserGuide/introduction.html) and
  [IAM policy elements](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_policies_elements.html).

You need:

* A Snowflake user with a role that has the privileges to perform
  the following actions:

  + [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md)
  + [CREATE DATABASE](../../sql-reference/sql/create-database.md)
  + [CREATE EXTERNAL VOLUME](../../sql-reference/sql/create-external-volume.md)
  + [CREATE ICEBERG TABLE](../../sql-reference/sql/create-iceberg-table-snowflake.md)

  If using a 30-day trial account, you can log in as the user that was created for the account.
  This user has the role with the privileges needed to create the objects.

  If you don’t have a user with the necessary permissions, ask someone who does to create one for you.
  Users with the ACCOUNTADMIN role can create new users and grant them the required privileges.
* Administrator access for your cloud storage provider in order to configure an external volume.
* A storage bucket (or container) with the same cloud provider, in the same region that hosts your Snowflake account.

  > **Note:**
  >
  > Snowflake can’t support external volumes with S3 bucket names that contain dots (for example, `my.s3.bucket`).
  > S3 doesn’t support SSL for virtual-hosted-style buckets with dots in the name, and
  > Snowflake uses virtual-host-style paths and HTTPS to access data in S3.
* Access to the SNOWFLAKE_SAMPLE_DATA database in your account. Snowflake creates the sample database in new accounts by default.
  If the database has not been created in your account, see [Use the sample database](../sample-data-using.md).

## Set up a warehouse and database

Set up your environment by creating a warehouse and database for this tutorial.

```sqlexample
CREATE WAREHOUSE iceberg_tutorial_wh
  WAREHOUSE_TYPE = STANDARD
  WAREHOUSE_SIZE = XSMALL;

USE WAREHOUSE iceberg_tutorial_wh;

CREATE OR REPLACE DATABASE iceberg_tutorial_db;
USE DATABASE iceberg_tutorial_db;
```

## Create an external volume

Before you can create an Apache Iceberg™ table for Snowflake, you must have an external volume.
An external volume is an account-level Snowflake object that stores an identity and access management (IAM)
entity for your external cloud storage.

Snowflake uses the external volume to securely connect to your cloud storage to access table data and metadata.

For demonstration purposes, this step covers how to create an external volume for Amazon S3.
To create an external volume for a different cloud storage service, see the following topics:

* [Configure an external volume for Google Cloud Storage](../tables-iceberg-configure-external-volume-gcs.md)
* [Configure an external volume for Azure](../tables-iceberg-configure-external-volume-azure.md)

### Create an IAM policy that grants access to your S3 location

To configure access permissions for Snowflake in the AWS Management Console, do the following:

1. Log in to the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Account settings.
4. Under Security Token Service (STS) in the Endpoints list, find the Snowflake
   [region](../intro-regions.md) where your account is located. If the STS status is inactive,
   move the toggle to Active.
5. From the left-hand navigation pane, select Policies.
6. Select Create Policy.
7. For Policy editor, select JSON.
8. Add a policy to provide Snowflake with the required permissions to read and write data to your S3 location.

   The following example policy grants access to all locations in the specified bucket.

   > **Note:**
   > * Replace `my_bucket` with your actual bucket name. You can also specify a path in the bucket; for example, `my_bucket/path`.
   > * Setting the `"s3:prefix":` condition to `["*"]` grants access to all prefixes in the
   >   specified bucket; setting it to `["path/*"]` grants access to a specified path in the bucket.
   > * For buckets in [government regions](../intro-regions.md), the bucket ARNs use the `arn:aws-us-gov:s3:::` prefix.
   > * If you’re using an S3 access point, specify the access point ARN instead of a bucket ARN. For more information, see
   >   [Configuring IAM policies for using access points](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-policies.html).

   ```sqljson
   {
      "Version": "2012-10-17",
      "Statement": [
            {
               "Effect": "Allow",
               "Action": [
                  "s3:PutObject",
                  "s3:GetObject",
                  "s3:GetObjectVersion",
                  "s3:DeleteObject",
                  "s3:DeleteObjectVersion"
               ],
               "Resource": "arn:aws:s3:::<my_bucket>/*"
            },
            {
               "Effect": "Allow",
               "Action": [
                  "s3:ListBucket",
                  "s3:GetBucketLocation"
               ],
               "Resource": "arn:aws:s3:::<my_bucket>",
               "Condition": {
                  "StringLike": {
                        "s3:prefix": [
                           "*"
                        ]
                  }
               }
            }
      ]
   }
   ```
9. Select Next.
10. Enter a Policy name (for example, `snowflake_access`) and an optional Description.
11. Select Create policy.

### Create an IAM role

Create an AWS IAM role to grant privileges on the S3 bucket containing your data files.

1. From the left-hand navigation pane in the Identity and Access Management (IAM) Dashboard, select Roles.
2. Select Create role.
3. For the trusted entity type, select AWS account.
4. Under An AWS account, select This account. In a later step,
   you modify the trust relationship and grant access to Snowflake.
5. Select the Require external ID option. Enter an
   [external ID](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html) of your choice.
   For example, `iceberg_table_external_id`.

   An external ID is used to grant access to your AWS resources (such as S3 buckets) to a third party like Snowflake.
6. Select Next.
7. Select the policy that you created for the external volume, then select Next.
8. Enter a Role name and description for the role, then select Create role.

   You have now created an IAM policy for an S3 location, created an IAM role, and attached the policy to the role.
9. Select View role to view the role summary page. Locate and record the ARN (Amazon Resource Name) value for the role.

### Create an external volume in Snowflake

Create an external volume using the [CREATE EXTERNAL VOLUME](../../sql-reference/sql/create-external-volume.md) command.
The following example creates an external volume named `iceberg_external_volume`
that defines a single Amazon S3 storage location with encryption.

```sqlexample
CREATE OR REPLACE EXTERNAL VOLUME iceberg_external_volume
   STORAGE_LOCATIONS =
      (
         (
            NAME = 'my-s3-us-west-2'
            STORAGE_PROVIDER = 'S3'
            STORAGE_BASE_URL = 's3://<my_bucket>/'
            STORAGE_AWS_ROLE_ARN = '<arn:aws:iam::123456789012:role/myrole>'
            STORAGE_AWS_EXTERNAL_ID = 'iceberg_table_external_id'
         )
      )
      ALLOW_WRITES = TRUE;
```

The example specifies the
external ID (`iceberg_table_external_id`) associated with the IAM role that you created for the external volume.
Specifying an external ID lets you use the same IAM role (and external ID) across multiple external volumes.

> **Note:**
>
> Specify ARNs exactly as provided by AWS. ARNs are case-sensitive.

### Retrieve the AWS IAM user for your Snowflake account

1. Retrieve the ARN for the AWS IAM user that was created automatically
   for your Snowflake account using the [DESCRIBE EXTERNAL VOLUME](../../sql-reference/sql/desc-external-volume.md) command.
   Specify the name of your external volume.

   The following example describes an external volume named `iceberg_external_volume`.

   ```sqlexample
   DESC EXTERNAL VOLUME iceberg_external_volume;
   ```
2. Record the value for the `STORAGE_AWS_IAM_USER_ARN` property, which is the AWS IAM user created for your Snowflake account;
   for example, `arn:aws:iam::123456789001:user/abc1-b-self1234`.

   Snowflake provisions a single IAM user for your entire Snowflake account. All S3 external volumes in your account use that IAM user.

   > **Note:**
   >
   > If you didn’t specify an external ID (`STORAGE_AWS_EXTERNAL_ID`) when you created an external volume,
   > Snowflake generates an ID for you to use. Record the value so that you can update your IAM role trust policy with the generated external ID.

### Grant the IAM user permissions to access bucket objects

In this step, you configure permissions that allow the IAM user for your Snowflake account to access objects in your S3 bucket.

1. Log in to the AWS Management Console.
2. From the home dashboard, search for and select IAM.
3. From the left-hand navigation pane, select Roles.
4. Select the IAM role that you created for your external volume.
5. Select the Trust relationships tab.
6. Select Edit trust policy.
7. Modify the policy document with the DESC EXTERNAL VOLUME output values that you recorded.

   **Policy document for IAM role**

   ```sqljson
   {
     "Version": "2012-10-17",
     "Statement": [
       {
         "Sid": "",
         "Effect": "Allow",
         "Principal": {
           "AWS": "<snowflake_user_arn>"
         },
         "Action": "sts:AssumeRole",
         "Condition": {
           "StringEquals": {
             "sts:ExternalId": "<iceberg_table_external_id>"
           }
         }
       }
     ]
   }
   ```

   Where:

   * `snowflake_user_arn` is the STORAGE_AWS_IAM_USER_ARN value you recorded.
   * `iceberg_table_external_id` is your external ID. If you *already* specified an external ID when you created the role, and used the same
     ID to create your external volume, leave the value as-is. Otherwise, update `sts:ExternalId` with the value that you recorded.
   > **Note:**
   >
   > You must update this policy document if you create a new external volume (or recreate an existing external volume using the CREATE OR
   > REPLACE EXTERNAL VOLUME syntax) and don’t provide your own external ID.
   > For security reasons, a new or recreated external volume has a different external ID and cannot
   > resolve the trust relationship unless you update this trust policy.
8. Select Update policy to save your changes.

## Create a table

In this step, you’ll create two Apache Iceberg™ tables: one with the standard CREATE ICEBERG TABLE syntax, and another with the
CREATE ICEBERG TABLE … AS SELECT variant. Both tables use the external volume configured in the previous step.

You’ll also learn how to set the Iceberg catalog and external volume at the database level.

### Create a table using the standard syntax

First, create an Iceberg table using the standard CREATE ICEBERG TABLE syntax.

Specify `CATALOG = 'SNOWFLAKE'` so that the table uses Snowflake as the
Iceberg catalog.

To tell Snowflake where to write table data and metadata, specify a value for the `BASE_LOCATION` parameter.
The example sets the table name (`customer_iceberg`) as the `BASE_LOCATION`. This way,
Snowflake writes data and metadata under a directory that includes the table name in your external volume
location.

```sqlexample
CREATE OR REPLACE ICEBERG TABLE customer_iceberg (
    c_custkey INTEGER,
    c_name STRING,
    c_address STRING,
    c_nationkey INTEGER,
    c_phone STRING,
    c_acctbal INTEGER,
    c_mktsegment STRING,
    c_comment STRING
)
    CATALOG = 'SNOWFLAKE'
    EXTERNAL_VOLUME = 'iceberg_external_volume'
    BASE_LOCATION = 'customer_iceberg';
```

Later in the tutorial, you load data into this table from the `snowflake_sample_data.tpch_sf1.customer` table
in the [SNOWFLAKE_SAMPLE_DATA](../sample-data-using.md) database. The column definitions in the CREATE ICEBERG TABLE statement match the sample table.

> **Note:**
>
> If you check your cloud storage location, you should now see a directory named `metadata/` that Snowflake wrote during table
> creation under your `BASE_LOCATION`. The directory stores the metadata files for your table.

### Set the catalog integration and external volume for the database

Next, set the `CATALOG` and `EXTERNAL_VOLUME` parameters for the `iceberg_tutorial_db` that you created in this tutorial.
Setting the parameters tells Snowflake to use the specific catalog and external volume that you choose for *all* Iceberg tables created after the change.

```sqlexample
ALTER DATABASE iceberg_tutorial_db SET CATALOG = 'SNOWFLAKE';
ALTER DATABASE iceberg_tutorial_db SET EXTERNAL_VOLUME = 'iceberg_external_volume';
```

To verify, check the parameters for the current database (`iceberg_tutorial_db`):

```sqlexample
SHOW PARAMETERS IN DATABASE ;
```

### Create a table using CTAS

Finally, create a second Iceberg table called `nation_iceberg` using the CREATE ICEBERG TABLE … AS SELECT syntax.
We’ll base the new table on the `snowflake_sample_data.tpch_sf1.nation` table in the
[Snowflake sample database](../sample-data-using.md).

> **Note:**
>
> Since you just set the `CATALOG` and `EXTERNAL_VOLUME` parameters for the `iceberg_tutorial_db` database,
> you can omit both parameters from the CREATE ICEBERG TABLE statement.
> The `nation_iceberg` table will inherit the values from the database.

```sqlexample
CREATE OR REPLACE ICEBERG TABLE nation_iceberg (
  n_nationkey INTEGER,
  n_name STRING
)
  BASE_LOCATION = 'nation_iceberg'
  AS SELECT
    N_NATIONKEY,
    N_NAME
  FROM snowflake_sample_data.tpch_sf1.nation;
```

## Load data and query the tables

In this step, you start by loading data from the [Snowflake sample database](../sample-data-using.md)
into the `customer_iceberg` table using `INSERT INTO <table>`:

```sqlexample
INSERT INTO customer_iceberg
  SELECT * FROM snowflake_sample_data.tpch_sf1.customer;
```

> **Note:**
>
> If you check your cloud storage location, you should now see a directory that contains your table data files.

Now that there’s data in the table, you can query the table.
The following query joins the `customer_iceberg` table with the `nation_iceberg` table (which already contains data).

```sqlexample
SELECT
    c.c_name AS customer_name,
    c.c_mktsegment AS market_segment,
    n.n_name AS nation
  FROM customer_iceberg c
  INNER JOIN nation_iceberg n
    ON c.c_nationkey = n.n_nationkey
  LIMIT 15;
```

Output:

```output
+--------------------+----------------+----------------+
| CUSTOMER_NAME      | MARKET_SEGMENT | NATION         |
|--------------------+----------------+----------------|
| Customer#000015001 | HOUSEHOLD      | MOROCCO        |
| Customer#000015002 | BUILDING       | VIETNAM        |
| Customer#000015003 | BUILDING       | INDONESIA      |
| Customer#000015004 | FURNITURE      | SAUDI ARABIA   |
| Customer#000015005 | HOUSEHOLD      | KENYA          |
| Customer#000015006 | BUILDING       | UNITED KINGDOM |
| Customer#000015007 | MACHINERY      | FRANCE         |
| Customer#000015008 | HOUSEHOLD      | INDIA          |
| Customer#000015009 | FURNITURE      | EGYPT          |
| Customer#000015010 | HOUSEHOLD      | ETHIOPIA       |
| Customer#000015011 | FURNITURE      | UNITED KINGDOM |
| Customer#000015012 | BUILDING       | FRANCE         |
| Customer#000015013 | FURNITURE      | SAUDI ARABIA   |
| Customer#000015014 | HOUSEHOLD      | KENYA          |
| Customer#000015015 | MACHINERY      | ROMANIA        |
+--------------------+----------------+----------------+
```

## Delete rows

In this step, you use a [DELETE](../../sql-reference/sql/delete.md) statement to remove specific rows from the
`customer_iceberg` table.

Start by querying the first 10 rows of the table and notice that four rows belong to the `AUTOMOBILE` market segment:

```sqlexample
SELECT
    c_name AS customer_name,
    c_mktsegment AS market_segment
  FROM customer_iceberg
  LIMIT 10;
```

Output:

```output
+--------------------+----------------+
| CUSTOMER_NAME      | MARKET_SEGMENT |
|--------------------+----------------|
| Customer#000000001 | BUILDING       |
| Customer#000000002 | AUTOMOBILE     |
| Customer#000000003 | AUTOMOBILE     |
| Customer#000000004 | MACHINERY      |
| Customer#000000005 | HOUSEHOLD      |
| Customer#000000006 | AUTOMOBILE     |
| Customer#000000007 | AUTOMOBILE     |
| Customer#000000008 | BUILDING       |
| Customer#000000009 | FURNITURE      |
| Customer#000000010 | HOUSEHOLD      |
+--------------------+----------------+
```

Next, let’s use a DELETE statement to remove all of the rows from the table where the market segment is `AUTOMOBILE`:

```sqlexample
DELETE FROM customer_iceberg WHERE c_mktsegment = 'AUTOMOBILE';
```

Output:

```output
+------------------------+
| number of rows deleted |
|------------------------|
|                  29752 |
+------------------------+
```

Finally, you can double-check that the rows are gone:

```sqlexample
SELECT
    c_name AS customer_name,
    c_mktsegment AS market_segment
 FROM customer_iceberg
 WHERE c_mktsegment = 'AUTOMOBILE';
```

Output:

```output
+---------------+----------------+
| CUSTOMER_NAME | MARKET_SEGMENT |
|---------------+----------------|
+---------------+----------------+
0 Row(s) produced. Time Elapsed: 1.426s
```

Congratulations!

You’ve just written to, read from, and modified your first Snowflake-managed Iceberg tables. You’ve also learned how to configure an
external volume for Iceberg table storage and set the Iceberg catalog and external volume for all Iceberg tables in a database.

## Clean up

To delete all of the objects created for this tutorial, run the following DROP statements.

Replace the following values:

* `my_other_database` with the name of a database to use so that you can drop the one created for this tutorial.
* `my_other_warehouse` with the name of the external volume that you created.

```sqlexample
DROP ICEBERG TABLE customer_iceberg;
DROP ICEBERG TABLE nation_iceberg;
DROP EXTERNAL VOLUME iceberg_external_volume;
USE DATABASE <my_other_database>;
DROP DATABASE iceberg_tutorial_db;
USE WAREHOUSE <my_other_warehouse>;
DROP WAREHOUSE iceberg_tutorial_wh;
```

## Summary and additional resources

In this tutorial, you followed an end-to-end workflow for creating and using Snowflake-managed Apache Iceberg™ tables.

Along the way, you completed the following tasks:

* **Created an external volume for Iceberg tables**.
  For more information about external volumes and Iceberg table storage, see [Configure an external volume](../tables-iceberg-configure-external-volume.md).
* **Created a Snowflake-managed Iceberg table** using sample data from the Snowflake sample database.
  For related information, see the following topics:

  + [Catalog options](../tables-iceberg.md).
  + For more information about loading data into tables, see [Load data into Apache Iceberg™ tables](../tables-iceberg-load.md) and [Load data into Snowflake](../../guides-overview-loading-data.md).
* **Set the Iceberg catalog and external volume for a database**.
  For more information about setting these parameters, see the following topics:

  + [Set a default catalog at the account, database, or schema level](../tables-iceberg-configure-catalog-integration.md)
  + [Set a default external volume at the account, database, or schema level](../tables-iceberg-configure-external-volume.md)
* **Loaded data into, queried, and deleted rows from Iceberg tables**.
  For more information about managing an Iceberg table and its data, see [Load data into Apache Iceberg™ tables](../tables-iceberg-load.md) and
  [Manage Apache Iceberg™ tables](../tables-iceberg-manage.md).

To learn more about Iceberg tables for Snowflake, see the [Iceberg tables documentation](../tables-iceberg.md).
For additional Iceberg tutorials and quickstarts, see the [Snowflake tutorials](https://docs.snowflake.com/en/tutorials) page.

---
title: Tutorial: Get started with budgets
source: https://docs.snowflake.com/en/user-guide/tutorials/budgets.md
section: User Guide
---

Snowflake

Cost Monitoring

Getting Started

Audit

# Tutorial: Get started with budgets

## Introduction

This tutorial introduces you to account-level credit usage monitoring with Budgets by setting up the account budget
and creating a custom budget that monitors a group of specified objects.

With budgets, you can monitor credit usage for the compute costs of supported objects, including credit usage for
background maintenance tasks and serverless features. Budgets enables you to set a monthly spending limit for each budget
and sends a notification email when your current spending is projected to exceed the monthly spending limit.

You can complete this tutorial using a worksheet in Snowsight or using a CLI client such as [SnowSQL](../snowsql.md).
Some portions of this tutorial can be completed using Snowsight.

By the end of this tutorial, you will learn how to do the following:

* Create custom roles to monitor and manage budgets.
* Grant the required privileges to add objects to a custom budget.
* Activate and set up an account budget.
* Create a custom budget and add objects to it.

### Prerequisites

To complete this tutorial, the following prerequisites are required:

* You must be able to use the ACCOUNTADMIN role to create the roles used in this tutorial.
* You must [verify your email address](../notifications/email-notifications.md). Only
  verified email addresses can be added to a budget notification list.

## Create a notification integration

Budgets use a notification integration to send notification emails when current credit usage is expected to exceed the monthly
spending limit. The `ALLOWED_RECIPIENTS` list *must* include the verified email addresses
of the users to receive budgets notifications.

A notification integration is required if you are completing the tutorial using SQL. Follow the steps below to create one.

When you use Snowsight to set up a budget, the notification integration is automatically
created for you. If you are going to use Snowsight to set up your budgets, you can skip to the next step.

1. Execute the following statement to create a notification integration. Use your verified email address in the ALLOWED_RECIPIENTS
   list:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE NOTIFICATION INTEGRATION budgets_notification_integration
     TYPE=EMAIL
     ENABLED=TRUE
     ALLOWED_RECIPIENTS=('<YOUR_EMAIL_ADDRESS>');
   ```
2. After you create the notification integration, grant the USAGE privilege to the SNOWFLAKE application. This privilege
   is required in order for Budgets to use the notification integration to send emails.

   Execute the following statement to grant the USAGE privilege on the notification integration:

   ```sqlexample
   GRANT USAGE ON INTEGRATION budgets_notification_integration
     TO APPLICATION snowflake;
   ```

## Create a database, schema, and custom roles

In this step, the following objects are created for the tutorial to create, manage, and monitor budgets:

* A database and schema in which to create custom budgets.
* A custom role to manage the account budget.
* A custom role to monitor the account budget.
* A custom role to create custom budgets.

1. Create a database and schema in which to create a custom budget using the following steps:

   SQLSnowsight

   1. Create the database and schema in which to create the custom budget:

      ```sqlexample
      USE ROLE ACCOUNTADMIN;

      CREATE DATABASE budgets_db;

      CREATE SCHEMA budgets_db.budgets_schema;
      ```

   1. Create the database and schema in which to create the custom budget:

      1. Sign in to [Snowsight](../ui-snowsight-gs.md).
      2. Switch to the ACCOUNTADMIN role.
      3. In the navigation menu, select Catalog » Database Explorer, and then select + Database.
      4. In the Name field, enter `budgets_db`.
      5. Select Create.
      6. After the database is created, select the `budgets_db`.
      7. Select Schemas » + Schema.
      8. In the Name field, enter `budgets_schema`.
      9. Select Create.
2. Create custom role `account_budget_admin` for the account budget administrator. The account budget administrator
   can take the following actions on the account budget:

   * Activate and deactivate the account budget.
   * Set the spending limit.
   * Edit notification settings.
   * Monitor credit usage for the account.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE account_budget_admin;

   GRANT APPLICATION ROLE SNOWFLAKE.BUDGET_ADMIN TO ROLE account_budget_admin;

   GRANT IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE TO ROLE account_budget_admin;
   ```
3. Create custom role `account_budget_monitor` to be granted to account budget monitors. An account budget monitor
   can take the following actions on the account budget:

   * Monitor credit usage for the account.
   * View the email notification settings.
   * View the monthly spending limit for the account.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE account_budget_monitor;

   GRANT APPLICATION ROLE SNOWFLAKE.BUDGET_VIEWER TO ROLE account_budget_monitor;

   GRANT IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE TO ROLE account_budget_monitor;
   ```
4. Create a custom role `budget_owner` with the required role and privileges to create custom
   budgets in the schema `budgets_db.budgets_schema`:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE budget_owner;

   GRANT USAGE ON DATABASE budgets_db TO ROLE budget_owner;
   GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO ROLE budget_owner;

   GRANT DATABASE ROLE SNOWFLAKE.BUDGET_CREATOR TO ROLE budget_owner;

   GRANT CREATE SNOWFLAKE.CORE.BUDGET ON SCHEMA budgets_db.budgets_schema
     TO ROLE budget_owner;
   ```
5. Create two custom roles to manage and monitor custom budgets. These roles will be granted additional privileges later in the
   tutorial after the custom budget is created. To create the custom roles, follow these steps:

   1. Create a custom `budget_admin` role that can manage and monitor a custom budget:

      ```sqlexample
      USE ROLE ACCOUNTADMIN;

      CREATE ROLE budget_admin;

      GRANT USAGE ON DATABASE budgets_db TO ROLE budget_admin;

      GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO ROLE budget_admin;

      GRANT DATABASE ROLE SNOWFLAKE.USAGE_VIEWER TO ROLE budget_admin;
      ```
   > 1. Create a custom `budget_monitor` role that can monitor a custom budget:
   > > ```sqlexample
   > > USE ROLE ACCOUNTADMIN;
   > >
   > > CREATE ROLE budget_monitor;
   > >
   > > GRANT USAGE ON DATABASE budgets_db TO ROLE budget_monitor;
   > >
   > > GRANT USAGE ON SCHEMA budgets_db.budgets_schema TO ROLE budget_monitor;
   > >
   > > GRANT DATABASE ROLE SNOWFLAKE.USAGE_VIEWER TO ROLE budget_monitor;
   > > ```
6. Grant custom budget roles to yourself to use in future steps of the tutorial:

   > SQLSnowsight
   >
   > 1. Grant the `account_budget_admin` role to yourself:
   >
   >    ```sqlexample
   >    GRANT ROLE account_budget_admin
   >      TO USER <YOUR_USER_NAME>;
   >    ```
   > 2. Grant the `account_budget_monitor` role to yourself:
   >
   >    ```sqlexample
   >    GRANT ROLE account_budget_monitor
   >      TO USER <YOUR_USER_NAME>;
   >    ```
   > 3. Grant the `budget_owner` role to yourself:
   >
   >    ```sqlexample
   >    GRANT ROLE budget_owner
   >      TO USER <YOUR_USER_NAME>;
   >    ```
   > 4. Grant the `budget_monitor` role to yourself:
   >
   >    ```sqlexample
   >    GRANT ROLE budget_monitor
   >      TO USER <YOUR_USER_NAME>;
   >    ```
   >
   > Grant custom budget roles to yourself:
   >
   > 1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   > 2. Switch to the ACCOUNTADMIN role.
   > 3. In the navigation menu, select Governance & security » Users & roles, and then select Roles.
   > 4. Select Table and locate and select the role `account_budget_admin`.
   > 5. In the section 0 users have been granted ACCOUNT_BUDGET_ADMIN, select Grant to User.
   > 6. For User to receive grant, select your username to grant the role to.
   > 7. Select Grant.
   > 8. After the role is granted, return to the previous page.
   > 9. Select the role `account_budget_monitor`.
   > 10. In the section 0 users have been granted ACCOUNT_BUDGET_MONITOR, select Grant to User.
   > 11. Select Grant.
   > 12. Repeat the previous four steps (h-k) to grant yourself the following additional roles:
   >
   >     * `budget_owner`
   >     * `budget_monitor`

In this section, you created custom roles to manage and monitor budgets, and create custom budgets.

## Create the objects for the custom budget

In this step, create objects to add to a custom budget and grant privileges to the custom roles you created in the previous
step. You will be creating the following objects:

* A warehouse to add to a custom budget.
* A database to add to a custom budget.

1. Create a warehouse and grant the USAGE and APPLYBUDGET privileges on the warehouse to the custom roles you created.
   The APPLYBUDGET privilege is required to add an object to a budget.

   > SQLSnowsight
   >
   > 1. Create warehouse `na_finance_wh`:
   >
   >    ```sqlexample
   >    CREATE WAREHOUSE na_finance_wh;
   >    ```
   > 2. Grant the USAGE privilege to custom budget roles:
   >
   >    ```sqlexample
   >    GRANT USAGE ON WAREHOUSE na_finance_wh TO ROLE account_budget_admin;
   >    GRANT USAGE ON WAREHOUSE na_finance_wh TO ROLE account_budget_monitor;
   >    GRANT USAGE ON WAREHOUSE na_finance_wh TO ROLE budget_admin;
   >    GRANT USAGE ON WAREHOUSE na_finance_wh TO ROLE budget_owner;
   >    GRANT USAGE ON WAREHOUSE na_finance_wh TO ROLE budget_monitor;
   >    ```
   > 3. Grant the APPLYBUDGET privilege on the warehouse to role `budget_owner`:
   >
   >    ```sqlexample
   >    GRANT APPLYBUDGET ON WAREHOUSE na_finance_wh TO ROLE budget_owner;
   >    ```
   >
   > 1. Create warehouse `na_finance_wh`:
   >
   >    1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   >    2. In the navigation menu, select Compute » Warehouses » + Warehouse.
   >    3. In the Warehouse Name field, enter `na_finance_wh`.
   >    4. Select Create Warehouse.
   > 2. Grant the USAGE privilege on the warehouse to custom roles, `account_budget_admin` and `budget_admin`:
   >
   >    1. In the navigation menu, select Compute » Warehouses.
   >    2. Select warehouse you just created `na_finance_wh`.
   >    3. In the Privileges tile, select + Privilege.
   >    4. For the Role, select the `account_budget_admin` role.
   >    5. For the Privileges, select USAGE.
   >    6. Select Grant Privileges.
   >    7. Repeat the previous 4 steps for the role `budget_admin`.
   > 3. Grant the USAGE and APPLYBUDGET privileges on the warehouse to role `budget_owner`:
   >
   >    1. In the navigation menu, select Compute » Warehouses.
   >    2. Select warehouse you just created `na_finance_wh`.
   >    3. In the Privileges tile, select + Privilege.
   >    4. For the Role, select the `budget_owner` role.
   >    5. For the Privileges, select APPLYBUDGET and USAGE.
   >    6. Select Grant Privileges.
2. Create a database and grant the APPLYBUDGET privilege on the warehouse to the custom budget owner role you created.
   The APPLYBUDGET privilege is required to add an object to a budget.

   > SQLSnowsight
   >
   > 1. Create a database:
   >
   >    ```sqlexample
   >    CREATE DATABASE na_finance_db;
   >    ```
   > 2. Grant the APPLYBUDGET privilege on the database to role `budget_owner`:
   >
   >    ```sqlexample
   >    GRANT APPLYBUDGET ON DATABASE  na_finance_db TO ROLE budget_owner;
   >    ```
   >
   > 1. Create a database:
   >
   >    1. Sign in to [Snowsight](../ui-snowsight-gs.md).
   >    2. In the navigation menu, select Catalog » Database Explorer, and then select + Database.
   >    3. In the Name field, enter `na_finance_db`.
   >    4. Select Create.
   > 2. Grant the APPLYBUDGET privilege on the database to role `budget_owner`:
   >
   >    1. In the navigation menu, select Catalog » Database Explorer.
   >    2. Select the database you just created `na_finance_db`.
   >    3. In the Privileges tile, select + Privilege.
   >    4. For the Role, select the `budget_owner` role.
   >    5. For the Privileges, select APPLYBUDGET.
   >    6. Select Grant Privileges.

In this section, you created the objects to be added to a custom budget and granted the APPLYBUDGET privilege required to add those objects
to a budget. You also created the database and schema in which to create the custom budget and granted the USAGE privilege required to
create a budget in the schema. Now you are ready to activate, create, and set up budgets.

## Activate and set up the account budget

The account budget monitors credit usage for the compute costs of all Budgets supported objects in the account, including
background maintenance tasks (for example, automatic clustering) and serverless features. The account budget must be
activated before it can start monitoring credit usage. After it is activated, you
can set the monthly spending limit for the account and the email list of notification recipients. Budgets sends
a notification email when current credit usage is expected to exceed the monthly spending limit.

Activate and set up the account budget using the following steps:

SQLSnowsight

1. Use the `account_budget_admin` role you created in a previous step to activate the account budget:

   ```sqlexample
   USE ROLE account_budget_admin;

   CALL snowflake.local.account_root_budget!ACTIVATE();
   ```
2. Set the spending limit for the account budget to 500 credits per month:

   ```sqlexample
   CALL snowflake.local.account_root_budget!SET_SPENDING_LIMIT(500);
   ```
3. To set up the email notification list, use your verified email address and the notification integration you
   created earlier in the tutorial:

   ```sqlexample
   CALL snowflake.local.account_root_budget!SET_EMAIL_NOTIFICATIONS(
      'budgets_notification_integration',
      '<YOUR_EMAIL_ADDRESS>');
   ```

Activate and set up the account budget:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Select the ACCOUNT_BUDGET_ADMIN role you created in a previous step.
3. In the navigation menu, select Admin » Cost management.
4. Select Budgets.
5. If prompted, select `na_finance_wh` for the warehouse.
6. In the upper-right corner of the dashboard, select Set up Account Budget.
7. Enter 500 for the spending limit for the account.

   To help you set your monthly spending limit, the configuration tool displays your projected spend for the month
   and your average monthly spend for the previous 3 months. For example, see the screenshot below.
8. Enter your email address to receive notification emails.
9. Select Finish Setup.

In this section, you activated the account budget and set the spending limit and the email address to receive budget notifications.

## Create a custom budget

Now that you have activated and set up your account budget, create a custom budget to monitor the credit usage
in your account for a specified group of objects. In this tutorial, you’ll:

* Use the `budget_owner` role to create a custom budget `na_finance_budget` in `budgets_db.budgets_schema`.
* Set the monthly spending limit and email notification list for the budget.
* Add the `na_finance_wh` warehouse and `na_finance_db` database to the custom budget.

To create the custom budget, complete the following steps:

SQLSnowsight

1. Create the custom budget:

   ```sqlexample
   USE ROLE budget_owner;
   USE SCHEMA budgets_db.budgets_schema;
   USE WAREHOUSE na_finance_wh;

   CREATE SNOWFLAKE.CORE.BUDGET na_finance_budget();
   ```
2. Set the monthly spending limit to 500 credits:

   ```sqlexample
   CALL na_finance_budget!SET_SPENDING_LIMIT(500);
   ```
3. To set up the notification list, use your verified email address and the notification integration created in the first
   step of the tutorial:

   ```sqlexample
   CALL na_finance_budget!SET_EMAIL_NOTIFICATIONS('budgets_notification_integration',
                                                  '<YOUR_EMAIL_ADDRESS>');
   ```
4. Add database `na_finance_db` and warehouse `na_finance_wh` to budget `na_finance_budget`:

   ```sqlexample
   CALL na_finance_budget!ADD_RESOURCE(
     SYSTEM$REFERENCE('database', 'na_finance_db', 'SESSION', 'applybudget'));

   CALL na_finance_budget!ADD_RESOURCE(
     SYSTEM$REFERENCE('warehouse', 'na_finance_wh', 'SESSION', 'applybudget'));
   ```

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Select the BUDGET_OWNER role you created in a previous step.
3. In the navigation menu, select Admin » Cost management.
4. Select Budgets.
5. Select + Budget.
6. On the Basic Information page, complete the following steps:

   1. From the Location to store drop-down, select `budgets_db` » `budgets_schema`.
   2. In the Name field, specify `na_finance_budget`.
   3. In the Budget (credits per month) field, specify `500`.
   4. In the Notify field, enter your email address to receive notification emails.
   5. Select Next.
7. On the Budget scope page, complete the following steps:

   1. Expand the Resources drop-down.

      > **Note:**
      >
      > If you are directly adding individual objects, you can only add an object to one custom budget. In this case, if an object is currently
      > included in one custom budget and you add that object to a second custom budget, Budgets removes the object from the first custom budget
      > without issuing a warning.
      >
      > This behavior does not apply to using tags to add objects to budgets; an object with one or more tags can be
      > included in multiple custom budgets if you are using tags to add the object to the budgets.
   2. Select the search field.
   3. Select Databases, then select `na_finance_db`.

      When you select a database, all the Budgets supported objects the database
      contains are also selected. Additionally, any future objects created in the database are automatically
      added to the budget.
   4. Select Warehouses, then select `na_finance_wh`.
   5. Select Done.
8. Select Create.

To grant instance roles to the custom roles you created in a previous step, complete the following steps:

> 1. Grant the required roles and privileges to the `budget_admin` role to let the `budget_admin`
>    role modify and monitor the custom budget `na_finance_budget`:
>
>    ```sqlexample
>    USE ROLE budget_owner;
>
>    GRANT SNOWFLAKE.CORE.BUDGET ROLE budgets_db.budgets_schema.na_finance_budget!ADMIN
>      TO ROLE budget_admin;
>    ```
> 2. Grant the VIEWER instance role to the `budget_monitor` role to let the `budget_monitor`
>    role monitor the custom budget `na_finance_budget`:
>
>    ```sqlexample
>    USE ROLE budget_owner;
>
>    GRANT SNOWFLAKE.CORE.BUDGET ROLE budgets_db.budgets_schema.na_finance_budget!VIEWER
>      TO ROLE budget_monitor;
>    ```

In this section, you created a custom budget, added objects for the budget to monitor, and set up the email address
to receive budget notifications.

## Monitoring credit usage

You have completed all the steps in the tutorial to activate your account budget, create a custom budget, and create custom
roles to monitor and manage both account and custom budgets. Credit usage data for your budgets takes some time to populate.

Budgets uses serverless tasks to collect credit usage data for the budgets in your account. After you activate the account
budget or create a custom budget, it takes a while for the serverless task to execute. After credit usage data becomes
available, you can monitor credit usage for budgets using Snowsight.

To monitor credit usage after usage data becomes available, use the following steps:

SQLSnowsight

Use the `account_budget_monitor` role created in a previous step and view the spending history for the account budget
in the past week by executing the following statements:

```sqlexample
USE ROLE account_budget_monitor;

CALL snowflake.local.account_root_budget!GET_SPENDING_HISTORY(
  TIME_LOWER_BOUND => DATEADD('days', -7, CURRENT_TIMESTAMP()),
  TIME_UPPER_BOUND => CURRENT_TIMESTAMP()
);
```

You can monitor spending history by service type. To view the spending history for the search optimization serverless feature
for the account budget in the past week, execute the following statement:

```sqlexample
USE ROLE account_budget_monitor;

SELECT *
   FROM table(snowflake.local.account_root_budget!GET_SERVICE_TYPE_USAGE_V2(
         '2025-05', '2025-12'))
   WHERE service_type = 'SEARCH_OPTIMIZATION';
```

Use the `budget_monitor` role to view the spending history for the past week for custom budget `na_finance_budget`:

```sqlexample
USE ROLE budget_monitor;

CALL budgets_db.budgets_schema.na_finance_budget!GET_SPENDING_HISTORY(
  TIME_LOWER_BOUND => DATEADD('days', -7, CURRENT_TIMESTAMP()),
  TIME_UPPER_BOUND => CURRENT_TIMESTAMP()
);
```

Use the `account_budget_monitor` role to view spending history for the account budget:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Select the ACCOUNT_BUDGET_MONITOR role you created in a previous step.
3. In the navigation menu, select Admin » Cost management.
4. Select Budgets.
5. If prompted, select the `na_finance_wh`.

Use the `budget_monitor` role to view spending history for the `na_finance_budget` custom budget:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Select the BUDGET_MONITOR role you created in a previous step.
3. In the navigation menu, select Admin » Cost management.
4. Select Budgets.
5. If prompted, select the `na_finance_wh`.

## Clean up, summary, and additional resources

Congratulations! You have successfully completed this tutorial.

After credit usage data is populated for your account budget and custom budget, see [Use Snowsight to monitor budgets](../budgets/monitor.md).

### Summary and key points

In summary, you learned how to:

* Create custom roles to manage and monitor budgets.

  Custom roles enable non-account administrators to monitor credit usage for a budget and modify budget settings. For more
  information, see [Budgets roles and privileges](../budgets.md).
* Grant the required privileges to add objects to a custom budget.

  The APPLYBUDGET privilege must be granted on an object to add or remove it from a custom budget. Objects are added or removed
  by [reference](../../sql-reference/references.md). For more information, see [Add or remove objects from a custom budget](../budgets/custom-budget.md).
* Activate and set up the account budget.

  The account budget must be activated and set up to start monitoring credit usage for your account. The account budget monitors
  compute costs including background maintenance tasks and serverless features and sends an email notification when current
  spending is expected to exceed the monthly spending limit.

  For more information, see [Activating the account budget](../budgets/account-budget.md).
* Create a custom budget to monitor a specified group of objects in your account.

  Custom budgets monitor credit usage for a group of objects in your account. Custom budgets monitor credit usage for compute
  costs for the objects in the group including background maintenance tasks and serverless features.

  For more information, see [Custom budgets](../budgets/custom-budget.md).

For more information, see the following topics:

* For a list of supported objects and the serverless features monitored by custom budgets, see [Supported objects for custom budgets](../budgets/custom-budget.md)
  and [Supported services](../budgets.md).
* For more information on monitoring budgets spending, see [Use Snowsight to monitor budgets](../budgets/monitor.md).

### Delete objects created in the tutorial

You can choose to keep the custom roles and custom budget you created in the tutorial to monitor credit usage. Otherwise,
drop the budget and the related custom roles:

To delete the custom budget created in the tutorial, execute the following statements:

```sqlexample
USE ROLE budget_owner;

DROP SNOWFLAKE.CORE.BUDGET budgets_db.budgets_schema.na_finance_budget;
```

To delete the objects created in this tutorial, execute the following statements:

```sqlexample
USE ROLE ACCOUNTADMIN;

DROP DATABASE na_finance_db;
DROP WAREHOUSE na_finance_wh;
DROP DATABASE budgets_db;
```

To delete the custom roles created for managing and monitoring the custom budget, execute the following statements:

```sqlexample
USE ROLE ACCOUNTADMIN;

DROP ROLE budget_monitor;
DROP ROLE budget_admin;
DROP ROLE budget_owner;
```

Snowflake recommends leaving the account budget activated. However, if you decide to deactivate it, see
[Deactivating the account budget](../budgets/account-budget.md) for more information and instructions.

To delete the account budget monitor and administrator roles, execute the following statements:

```sqlexample
USE ROLE ACCOUNTADMIN;

DROP ROLE account_budget_monitor;
DROP ROLE account_budget_admin;
```

To delete the notification integration, execute the following statements:

```sqlexample
USE ROLE ACCOUNTADMIN;

DROP NOTIFICATION INTEGRATION budgets_notification_integration;
```

### Additional resources

Continue learning about budgets and Snowflake using the following resources:

* [Monitor credit usage with budgets](../budgets.md)
* [Understand budget costs](../budgets/cost.md)
* [Troubleshoot budgets](../budgets/troubleshoot.md)

---
title: Tutorial: Get started with dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md
section: User Guide
---

dbt

data engineering

tasty bytes

getting started

# Tutorial: Get started with dbt Projects on Snowflake

## Introduction

This tutorial guides you through creating a workspace for [dbt Projects on Snowflake](../data-engineering/dbt-projects-on-snowflake.md) that is connected to a GitHub repository that you fork from our [getting-started-with-dbt-on-snowflake repository](https://github.com/Snowflake-Labs/getting-started-with-dbt-on-snowflake) in Snowflake Labs. You then use the workspace to update dbt project files, and test and run the dbt project, which materializes the data model output of the dbt project in target Snowflake databases and schemas. You deploy the project to create a dbt project object on Snowflake. Finally, you set up a task to execute the project on a schedule that you define.

### Prerequisites

* **GitHub**

  + A GitHub account that can create a repository and manage access to that repository.
  + Git on the command line. For more information about installation, see [Set up Git](https://docs.github.com/en/get-started/git-basics/set-up-git).
* **Snowflake**

  + A Snowflake account and user with privileges as described in [Access control for dbt projects on Snowflake](../data-engineering/dbt-projects-on-snowflake-access-control.md).
  + Privileges to create and edit the following objects or access to an administrator who can create each of them on your behalf:

    - An API integration
    - If your GitHub repository is private, a secret
    - A network rule
    - (Optional) An external access integration that references the network rule
    - Your user object

## Set up your environment

Complete the following steps to set up your environment for this tutorial:

1. Fork and clone the dbt Projects on Snowflake getting started repository
2. (Optional) Create a warehouse for executing workspace actions
3. Create a database and schema for integrations and model materializations
4. Create an API integration in Snowflake for connecting to GitHub
5. (Optional) Create an external access integration in Snowflake for dbt dependencies

### Fork and clone the dbt Projects on Snowflake getting started repository

1. Go to <https://github.com/Snowflake-Labs/getting-started-with-dbt-on-snowflake>, select the down arrow next to Fork, and then select Create a new fork.
2. Specify the owner and name of your forked repository and other details. Later in the tutorial, we use the following URL to represent your forked repository:

   ```none
   https://github.com/my-github-account/getting-started-with-dbt-on-snowflake.git
   ```

### (Optional) Create a warehouse for executing workspace actions

A dedicated warehouse assigned to your workspace can help you log, trace, and identify actions initiated from within that workspace. In this tutorial, we use a warehouse named TASTY_BYTES_DBT_WH. Alternatively, you can use an existing warehouse in your account. For more information about creating a warehouse, see [Creating a warehouse](../warehouses-tasks.md).

The Tasty Bytes data model that you create for source data is fairly large, so we recommend using an XL warehouse.

To create a warehouse, run the following SQL command:

```sqlexample
CREATE WAREHOUSE tasty_bytes_dbt_wh WAREHOUSE_SIZE = XLARGE;
```

### Create a database and schema for integrations and model materializations

This tutorial uses a database named TASTY_BYTES_DBT_DB. Within that database, you create a schema named INTEGRATIONS to store the objects that Snowflake needs for GitHub integration. You create schemas named DEV and PROD to store materialized objects that your dbt project creates.

To create the database and schemas, run the following SQL commands:

```sqlexample
CREATE DATABASE tasty_bytes_dbt_db;
CREATE SCHEMA tasty_bytes_dbt_db.dev;
CREATE SCHEMA tasty_bytes_dbt_db.prod;
CREATE SCHEMA tasty_bytes_dbt_db.integrations;
```

### Create an API integration in Snowflake for connecting to GitHub

Snowflake needs an API integration to interact with GitHub.

If your repository is private, you must also create a secret in Snowflake to store GitHub credentials for your repository. You then specify the secret in the API integration definition as one of the ALLOWED_AUTHENTICATION_SECRETS. You also specify this secret when you create the workspace for your dbt project later in this tutorial.

Creating a secret requires a personal access token for your repository. For more information about creating a token, see [Managing your personal access tokens](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens) in GitHub documentation.

This tutorial uses a secret named TB_DBT_GIT_SECRET. For more information about creating a secret, see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).

To create a secret for GitHub, run the following SQL commands:

```sqlexample
USE tasty_bytes_dbt_db.integrations;
CREATE OR REPLACE SECRET tasty_bytes_dbt_db.integrations.tb_dbt_git_secret
  TYPE = password
  USERNAME = 'your-gh-username'
  PASSWORD = 'YOUR_PERSONAL_ACCESS_TOKEN';
```

To create an API integration for GitHub that uses the secret you just created, run the following SQL command. Replace `https://github.com/my-github-account` with the HTTPS URL of the GitHub account for your forked repository:

```sqlexample
CREATE OR REPLACE API INTEGRATION tb_dbt_git_api_integration
  API_PROVIDER = git_https_api
  API_ALLOWED_PREFIXES = ('https://github.com/my-github-account')
  -- Comment out the following line if your forked repository is public
  ALLOWED_AUTHENTICATION_SECRETS = (tasty_bytes_dbt_db.integrations.tb_dbt_git_secret)
  ENABLED = TRUE;
```

### (Optional) Create an external access integration in Snowflake for dbt dependencies

When you run dbt commands in a workspace, dbt might need to access remote URLs to download dependencies. For example, dbt might need to download packages from the dbt Package hub or from GitHub.

Most dbt projects specify dependencies in their `packages.yml` file. You must install these dependencies in the dbt project workspace. You can’t update a deployed dbt project object with dependencies.

To get dependency files from remote URLs, Snowflake needs an external access integration that relies on a network rule.

For more information about external access integrations in Snowflake, see [Creating and using an external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md).

To create a network rule and an external access integration, run the following SQL commands:

```sqlexample
-- Create NETWORK RULE for external access integration

CREATE OR REPLACE NETWORK RULE dbt_network_rule
  MODE = EGRESS
  TYPE = HOST_PORT
  -- Minimal URL allowlist that is required for dbt deps
  VALUE_LIST = (
    'hub.getdbt.com',
    'codeload.github.com'
    );

-- Create EXTERNAL ACCESS INTEGRATION for dbt access to external dbt package locations

CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION dbt_ext_access
  ALLOWED_NETWORK_RULES = (dbt_network_rule)
  ENABLED = TRUE;
```

## Create a workspace connected to your Git repository

In this step, you create a workspace in Snowsight that is connected to your GitHub repository. For more information about workspaces, see [Workspaces](../ui-snowsight/workspaces.md).

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. From the Workspaces list above the workspace files area, under Create Workspace, select From Git repository. (The Workspaces list has a default selection of My Workspace.)
4. For Repository URL, enter the HTTPS URL of your forked GitHub repository; for example, *https://github.com/my-github-account/getting-started-with-dbt-on-snowflake.git*
5. For Workspace name, enter a name. Later in this tutorial, we use *tasty_bytes_dbt*.
6. Under API integration, select the name of the API integration that you created earlier; for example, *TB_DBT_GIT_API_INTEGRATION*.
7. If your GitHub repository is public, select Public repository, and then select Create.

   > **Note:**
   >
   > Workspaces don’t support committing and pushing changes from a workspace to a public repository.
8. If your GitHub repository is private, and you created a secret for your API integration during setup, do the following:

   1. Select Personal access token.
   2. Under Credentials secret, select Select database and schema.
   3. Select the database from the list (for example, **TASTY_BYTES_DBT_DB**), and then select the schema from the list (for example, **INTEGRATIONS**) where you stored the API integration.
   4. Select Select secret, and then select your secret from the list; for example, **tb_dbt_git_secret**.
9. Select Create.

   Snowflake connects to the GitHub repository that you specified and opens your new workspace. A single folder in the workspace named `tasty_bytes_dbt_demo` contains the dbt project that you will work with.

### Verify the contents of the profiles.yml file in your dbt project root

Each dbt project folder in your Snowflake workspace must contain a `profiles.yml` file that specifies a target `warehouse`, `database`, `schema`, and `role` in Snowflake for the project. The `type` must be set to `snowflake`. dbt requires an `account` and `user`, but these can be left with an empty or arbitrary string because the dbt project runs in Snowflake under the current account and user context.

When you run dbt commands, your workspace reads the `profiles.yml` file. When you have at least one valid `target` specified in `profiles.yml`, each target is available to select from the Profile list in the menu bar above the workspace editing pane. When you run a dbt command, the workspace uses the selected profile (`target`) to run the command.

Open the `tasty_bytes_dbt_demo/profiles.yml` file, and then verify that your contents match the following example. If you specified different database or warehouse names earlier, replace them with your own.

```yaml
tasty_bytes:
  target: dev
  outputs:
    dev:
      type: snowflake
      account: 'not needed'
      user: 'not needed'
      role: accountadmin
      database: tasty_bytes_dbt_db
      schema: dev
      warehouse: tasty_bytes_dbt_wh
      threads: 8
    prod:
      type: snowflake
      account: 'not needed'
      user: 'not needed'
      role: accountadmin
      database: tasty_bytes_dbt_db
      schema: prod
      warehouse: tasty_bytes_dbt_wh
      threads: 8
```

## Run the SQL commands in tasty_bytes_setup.sql to set up source data

As source data for its transformations, the dbt project in your repository uses the foundational data model for the fictitious Tasty Bytes food truck brand. The SQL script to create the data model is in the workspace.

1. In your workspace, navigate to the `tasty_bytes_dbt_demo/setup/tasty_bytes_setup.sql` file, and then open it.
2. From the context selector in the upper right of the workspace editor, select the warehouse you created earlier; for example, **TASTY_BYTES_DBT_WH**.
3. The SQL file contains commands that you already ran in this tutorial. Near the beginning of the file, find the following commands, and then comment them out so that you don’t run them again and create duplicate resources:

   ```sqlexample
   CREATE OR REPLACE WAREHOUSE ...;
   CREATE OR REPLACE API INTEGRATION ...;
   CREATE OR REPLACE NETWORK RULE ...;
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION ...;
   ```
4. Run the uncommented SQL commands in the file.

   > **Tip:**
   >
   > Use `cmd` + `Shift` + `Enter` to run all uncommented commands.

   The Output tab displays the following message:

   `tasty_bytes_dbt_db setup is now complete`

### Enable logging, tracing, and metrics

You can capture logging and tracing events for a dbt project object and for the task that runs it on a schedule, if applicable. For more information, see [Monitor dbt Projects on Snowflake](../data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

To enable this feature, you must set logging, tracing, and metrics on the schema where the dbt project object and task are deployed.

The following commands in the `tasty_bytes_setup.sql` file enable logging, tracing, and metrics for the DEV and PROD schemas in the TASTY_BYTES_DBT_DB database. You ran these in the previous step. They are shown here for reference so that you can enable logging, tracing, and metrics for your own projects.

```sqlexample
ALTER SCHEMA tasty_bytes_dbt_db.dev SET LOG_LEVEL = 'INFO';
ALTER SCHEMA tasty_bytes_dbt_db.dev SET TRACE_LEVEL = 'ALWAYS';
ALTER SCHEMA tasty_bytes_dbt_db.dev SET METRIC_LEVEL = 'ALL';

ALTER SCHEMA tasty_bytes_dbt_db.prod SET LOG_LEVEL = 'INFO';
ALTER SCHEMA tasty_bytes_dbt_db.prod SET TRACE_LEVEL = 'ALWAYS';
ALTER SCHEMA tasty_bytes_dbt_db.prod SET METRIC_LEVEL = 'ALL';
```

## (Optional) Execute the dbt deps command for your project

You can use the workspace to execute common dbt commands for a project. For a list of available commands, see [Supported dbt commands and flags](../data-engineering/dbt-projects-on-snowflake-supported-commands.md). To run a command, you select the dbt Project, Profile, and dbt command from the lists above the workspace editor. You then select the execute button. Use the down arrow next to the execute button to specify additional arguments that the dbt command supports.

When you execute any dbt command within the workspace, the Output tab shows the command that executes on Snowflake (in green) and the stdout for that command so that you can monitor command success or failure.

The first command you must execute for any dbt project is `deps`, which updates the dependencies for your project specified in your project’s `packages.yml` file. Other commands will fail unless you have updated dependencies. For more information, see [Limitations, requirements, and considerations for dbt dependencies](../data-engineering/dbt-projects-on-snowflake-dependencies.md).

1. Below the workspace editor, open the Output tab so that you can see stdout after you run dbt commands from the workspace.
2. From the menu bar above the workspace editor, confirm that the default Project (tasty_bytes_dbt_demo) is selected. You can have any Profile selected. This project has the profiles `dev` and `prod` defined in the `profiles.yml` file.
3. From the command list, select Deps.
4. Next to the execute button, select the down arrow.
5. In the dbt Deps window, do the following:

   * Select Run with defaults.
   * Enter the name of the External Access Integration you created during setup in the space provided; for example, *dbt_ext_access*.
6. To run the command, select Deps.

   The Output tab displays the SQL command that runs on Snowflake, which is similar to the following:

   ```sqlexample
   execute dbt project from workspace "USER$"."PUBLIC"."tasty_bytes_dbt" project_root='tasty_bytes_dbt_demo' args='deps --target dev' external_access_integrations = (dbt_ext_access)
   ```

   When the command finishes, stdout messages appear that are similar to the following:

   ```output
   14:47:19  Running with dbt=1.8.9
   14:47:19  Updating lock file in file path: /tmp/dbt/package-lock.yml
   14:47:19  Installing dbt-labs/dbt_utils
   14:47:19  Installed from version 1.3.0
   14:47:19  Up to date!
   Uploading /tmp/dbt/package-lock.yml to snow://workspace/USER$ADMIN.PUBLIC."tasty_bytes_dbt"/versions/live/dbt//package-lock.yml
   ```

   The `package_lock.yml` file is created and appears in your list of workspace files with an A next to it. This indicates that the file was added in the workspace for your dbt project, with contents that are similar to the following example:

   ```yaml
   packages:
     - package: dbt-labs/dbt_utils
       version: 1.3.0
   ```

## Compile the dbt project, view the DAG, and view compiled SQL

Compiling a project in dbt creates executable SQL from modeled SQL files and a visual representation of the directed acyclic graph (DAG) for the project in the workspace. For more information about dbt project compilation, see [compile](https://docs.getdbt.com/reference/commands/compile) in dbt documentation.

After you compile the project in the workspace, you can view the DAG. You also can open any SQL file in the `models` folder to see the model SQL and the compiled SQL in side-by-side tabs.

1. Select the project and target that you want to compile.
2. From the command list, select Compile, and then select the execute button (optionally, you can select the down arrow and specify compile command arguments).
3. In the area below the workspace editor, select the DAG tab.

   You can use the DAG pane to visualize your dbt project transformations from source files to materialized data model objects in Snowflake.

   * Click and drag anywhere in the pane to pan the view.
   * Use the + and – buttons to zoom in and out.
4. To view the contents of an object’s source file in the editor, select a tile for any object.
5. To see compiled SQL in a split-pane view in the workspace editor:

   1. In the DAG, select the tile for a dbt SQL model file; for example, orders.

      –OR–

      From the workspace file listing, select any file in the `models` subdirectory of your dbt project to open it in the workspace editor.
   2. Choose View Compiled SQL in the upper-right of the workspace editor to see the compiled SQL in a split-pane view.

## Run the dbt dev project and verify the materialized Snowflake objects

Executing the dbt `run` command executes your compiled SQL against the target database and schema using the Snowflake warehouse and role that are specified in the `profiles.yml` file of the project. In this step, you’ll materialize the output of the `Dev` target in your dbt demo project. You then create a SQL worksheet named `dbt_sandbox.sql` in the workspace where you can run SQL to verify object creation.

> **Important:**
>
> Choosing the dbt Run or Build command for a project from within a workspace materializes target output using the `role` defined in the project’s `profiles.yml` file. Both the user and the role specified must have the required privileges to use the `warehouse`, perform operations on the `database` and `schema` that are specified in the project’s `profiles.yml` file, and perform operations on any other Snowflake objects that the dbt model specifies.

1. From the Profile list, select **Dev**.
2. From the command list, select Run, and then select the execute button.

   The output pane shows the completion status of the run.
3. In your **tasty_bytes_dbt_demo** project, navigate to the `examples` folder, select the + next to the folder name, and then select SQL File.
4. Enter *dbt_sandbox.sql*, and then press `Enter`.
5. In the workspace tab for `dbt_sandbox.sql`, run the following query:

   ```sqlexample
   SHOW TABLES IN DATABASE tasty_bytes_dbt_db;
   ```

   In the Status and Results pane, you should see the tables CUSTOMER_LOYALTY_METRICS, ORDERS, and SALES_METRICS_BY_LOCATION.
6. To see the views that your dbt project run created, run the following command :

   ```sqlexample
   SHOW VIEWS IN DATABASE tasty_bytes_dbt_db;
   ```

## Push your file updates from the workspace to your repository

Now that you have updated your workspace and compiled, tested, run, and deployed your project as a dbt project object, you can push the changes you made in the workspace to your private GitHub repository. This step isn’t supported for public repositories.

1. With your workspace open, select Changes.

   The workspace file listing is filtered to show only files that have changed since you synchronized with the Git repository.

   * A indicates a file added in the workspace and not to the Git repository.
   * M indicates a modified file.
   * D indicates a deleted file.
2. Select a file to view its diff with GitHub since the last pull (in this case, when the workspace was created).
3. On the menu bar above the workspace file listing, verify that the branch selector is set to main for this tutorial.
4. Select the Push button, and then type a commit message in the box provided; for example, *Updating project with initial changes from dbt on Snowflake*.
5. Select Push.

   A push to your repository might take several minutes.

## Deploy the dbt project object from the workspace

Deploying your dbt project from a workspace creates a dbt project object. You can use the object to schedule, run, and monitor a dbt project in Snowflake outside of the workspace.

When you deploy your dbt project object from the workspace to a Snowflake database and schema, you can create or overwrite an object that you previously created.

1. On the right side of the workspace editor, select Connect » Deploy dbt project.
2. Select Select database and schema, and then select the **TASTY_BYTES_DBT_DB** database and the **DEV** schema.
3. Under Select or Create dbt Object, select Create dbt Object.
4. Under Enter Name, type *TASTY_BYTES_DBT_PROJECT*, and then select Deploy.

   The Output tab displays the command that runs on Snowflake, which is similar to the following example:

   ```sqlexample
   create or replace DBT PROJECT "TASTY_BYTES_DBT_DB"."DEV"."TASTY_BYTES_DBT_PROJECT" from snow://workspace/USER$MYUSER.PUBLIC."tasty_bytes_dbt_demo"/versions/live/dbt

   tasty_bytes_dbt_project successfully created.
   ```

   The Connect menu now displays the name of the dbt project object that you created, with the following options:

   * Redeploy dbt project - Updates the dbt project object with the current workspace version of the project by using ALTER. This increments the version of the dbt project object by one. For more information, see [Versions for dbt project objects and files](../data-engineering/dbt-projects-on-snowflake-versions.md).
   * Disconnect - Disconnects the workspace from the dbt project object, but doesn’t delete the dbt project object.
   * View project - Opens the dbt project object in the object explorer, where you can view the CREATE DBT PROJECT command for the dbt project object and run history for the project.
   * Create schedule - Provides options for you to create a task that runs the dbt project object on a schedule. For more information, see Create a task to schedule dbt project execution.
   * View schedules - Opens a list of schedules (tasks) that run the dbt project object, with the option to view task details in the object explorer.
5. To verify the creation of the project, do one or both of the following tasks:

   * From the menu for the dbt project, select View project to open the dbt project object in the object explorer.

     –OR–
   * From the `dbt_sandbox.sql` file worksheet that you created earlier, run the following command:

     ```sqlexample
     SHOW DBT PROJECTS LIKE 'tasty%';
     ```

## Create a task to schedule dbt project execution

Now that you have deployed your dbt project object, you can use the workspace or SQL to set up a task that executes a dbt command on your dbt project object.

The following steps set up a schedule to execute the dbt project every hour at one minute after the hour. The task executes the dbt `run` command with the `--select` option to run the `customer_loyalty_metrics` model in the dbt project.

1. From the dbt project menu in the upper right of the workspace editor, choose Create schedule.
2. In the Schedule a dbt run dialog box, do the following:

   * For Schedule name, enter a name for the task; for example, *run_prepped_data_dbt*.
   * For Frequency, leave Hourly at 01 for your time zone selected.
   * Under dbt properties:

     + For Operation, select run.
     + For Profile, select dev.
     + For Additional flags, enter `--select customer_loyalty_metrics`.
3. Choose Create.

   Snowflake creates a task that runs an EXECUTE DBT PROJECT command using these parameters. For more information about tasks and task options, see [Introduction to tasks](../tasks-intro.md) and [CREATE TASK](../../sql-reference/sql/create-task.md).
4. From the dbt project menu, select View schedules, and then choose your schedule from the list.

   The object explorer opens to your database with the Task Details pane opened for the task. The Task Definition shows a [CREATE TASK](../../sql-reference/sql/create-task.md) command similar to the following:

   ```sqlexample
   CREATE OR REPLACE TASK tasty_bytes_dbt_db.dev.run_prepped_data_dbt
     WAREHOUSE=tasty_bytes_dbt_wh
     SCHEDULE ='USING CRON 1 * * * * America/Los_Angeles'
   AS
     EXECUTE DBT PROJECT tasty_bytes_dbt_project ARGS='run --select customer_loyalty_metrics --target dev';
   ```

## Clean up

You can delete the databases, workspaces, and warehouse that you created to clean up after this tutorial.

Run the following SQL commands from your `dbt_sandbox.sql` worksheet to remove the warehouse, the TASTY_BYTES_DBT_DB and TB_101 databases that you created, and all schemas and objects created in the databases:

```sqlexample
DROP WAREHOUSE IF EXISTS tasty_bytes_dbt_wh;
DROP DATABASE IF EXISTS tasty_bytes_dbt_db;
DROP DATABASE IF EXISTS tb_101;
```

**To delete your tasty_bytes_dbt_demo workspace:**

* From the vertical ellipsis menu  next to the workspace menu at the top of the workspace explorer, select Delete, and then confirm the deletion when you’re prompted.

---
title: Tutorial: Get started with Snowpipe Streaming high-performance architecture SDK
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-getting-started.md
section: User Guide
---

# Tutorial: Get started with Snowpipe Streaming high-performance architecture SDK

This tutorial provides step-by-step instructions for setting up and running a demo application that utilizes the new high-performance architecture with the `snowpipe-streaming` SDK.

## Prerequisites

Before you run the demo, ensure that you meet the following prerequisites:

* Snowflake account: Verify that you have access to a Snowflake account. You will need a user with sufficient privileges (e.g., ACCOUNTADMIN or USERADMIN for the initial setup) to create the dedicated user and custom role detailed in Step 1: Configure Snowflake objects.
* Network access: Ensure that your network allows outbound connectivity to Snowflake and Amazon S3 or Google Cloud Platform (GCS) or Azure Blob Storage. Adjust firewall rules if necessary because the SDK makes REST API calls to Snowflake and to your cloud storage provider.

  + To verify network connectivity, use the following command:

  ```bash
  # Test connectivity to Snowflake; replace with your account URL
  curl -I https://<your_account_identifier>.snowflakecomputing.com

  # Test connectivity to AWS S3
  curl -I https://s3.amazonaws.com

  # Test connectivity to GCS
  curl -I https://storage.googleapis.com

  # Test connectivity to Azure Blob Storage
  curl -I https://azure.blob.core.windows.net  or curl -I https://<your_account_name>.blob.core.windows.net
  ```
* Java Development Environment: Install Java 11 or later, and Maven for dependency management.
* Python: Install Python version 3.9 or later.
* System requirements: The SDK requires glibc version 2.26 or later. You can check your current glibc version with:

  ```bash
  ldd --version
  ```
* Snowpipe Streaming SDKs and the sample code:

  + For **AWS**: Obtain the [Java SDK](https://central.sonatype.com/artifact/com.snowflake/snowpipe-streaming) or [Python SDK](https://pypi.org/project/snowpipe-streaming/) (any version).
  + For **Azure**: Requires SDK version 1.1.0 or later.
  + For **GCP**: Requires SDK version 1.1.0 or later.

  Download the sample code for your preferred language from the [Snowpipe Streaming SDK examples in the GitHub repository](https://github.com/snowflakedb/snowpipe-streaming-sdk-examples).

## Get started

This section outlines the steps required to set up and run the demo application.

### Step 1: Configure Snowflake objects

Before you can use the `snowpipe-streaming` SDK, you must create a target table within your Snowflake environment. Unlike the classic architecture, the high-performance architecture requires a PIPE object for data ingestion. This tutorial uses the default pipe that is automatically created at ingest time for your target table. If you require additional features, such as in-flight transformations or clustering at ingest time, see [CREATE PIPE](../../sql-reference/sql/create-pipe.md).

#### Generate a key pair for authentication

Generate a private-public key pair for authentication using OpenSSL. For more information, see [Key-pair authentication and key-pair rotation](../key-pair-auth.md).

Run the following commands in your terminal to generate the keys:

```bash
openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out rsa_key.p8 -nocrypt
openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
```

```bash
PUBK=$(cat ./rsa_key.pub | grep -v KEY- | tr -d '\012')
echo "ALTER USER MY_USER SET RSA_PUBLIC_KEY='$PUBK';"
```

> **Important:**
>
> Save the generated `rsa_key.p8` (private key) and `rsa_key.pub` (public key) files securely. You will use these keys in subsequent authentication steps.

#### Create database, schema, table, and configure user authentication

Run the following SQL commands in your Snowflake account; for example, by using Snowsight or Snowflake CLI). You must have a role with permissions to create users, roles, and databases — such as ACCOUNTADMIN or USERADMIN for the first few lines, and then switching to the new role. Replace placeholders like MY_USER, MY_ROLE, MY_DATABASE, and so on, with the names that you want.

```sqlexample
-- 1. Create a dedicated role and user (Run with a highly-privileged role)
CREATE OR REPLACE USER MY_USER;
CREATE ROLE IF NOT EXISTS MY_ROLE;
GRANT ROLE MY_ROLE TO USER MY_USER;

-- 2. Set the public key for key-pair authentication
-- NOTE: Replace 'YOUR_FORMATTED_PUBLIC_KEY' with the output of the PUBK variable from the key generation step.
ALTER USER MY_USER SET RSA_PUBLIC_KEY='YOUR_FORMATTED_PUBLIC_KEY';

-- 3. Set the default role (Recommended)
ALTER USER MY_USER SET DEFAULT_ROLE=MY_ROLE;

-- 4. Switch to the new role and create objects
USE ROLE MY_ROLE;
-- NOTE: You may also need to run USE WAREHOUSE YOUR_WH; here if a default warehouse isn't set.

-- Create database and schema
CREATE OR REPLACE DATABASE MY_DATABASE;
CREATE OR REPLACE SCHEMA MY_SCHEMA;

-- Create a target table
CREATE OR REPLACE TABLE MY_TABLE (
    data VARIANT,
    c1 NUMBER,
    c2 STRING
);

-- 5. Configure authentication policy (Optional, but recommended for explicit control)
CREATE OR REPLACE AUTHENTICATION POLICY testing_auth_policy
  AUTHENTICATION_METHODS = ('KEYPAIR')
  CLIENT_TYPES = ('DRIVERS');

-- Apply authentication policy (if created)
ALTER USER MY_USER SET AUTHENTICATION POLICY testing_auth_policy;
```

> **Note:**
>
> The `data` column in the sample table is a VARIANT type. The high-performance SDK requires that data for this column be passed as a native object; for example, a Java `Map` or Python dictionary. Passing a raw JSON string results in the data being stored as a string literal.

### Step 2: Configure an authentication profile

The demo application requires a `profile.json` file to store connection settings, including authentication details. The SDK uses key-pair authentication for secure connections.

#### Create a profile configuration file

Create or update the `profile.json` file in the root directory of your demo project.

#### profile.json template

```json
{
    "user": "MY_USER",
    "account": "your_account_identifier",
    "url": "https://your_account_identifier.snowflakecomputing.com:443",
    "private_key_file": "rsa_key.p8",
    "role": "MY_ROLE"
}
```

Replace the placeholders:

* `MY_USER`: Your Snowflake username configured in Step 1: Configure Snowflake objects.
* `your_account_identifier`: Your Snowflake account identifier (for example, `xy12345`).
* `rsa_key.p8`: The private key file you generated in Step 1: Configure Snowflake objects.
* `MY_ROLE`: The dedicated role (`MY_ROLE`) you created and granted to the user in Step 1: Configure Snowflake objects.

### Step 3: Set up the demo project

JavaPython

**Download:** [Sample Java code](https://github.com/snowflakedb/snowpipe-streaming-sdk-examples/tree/main/java-example)

**Add the JAR dependency**

To include the Snowpipe Streaming SDK, add the following dependency to your Maven `pom.xml`. Maven automatically downloads the JAR from the public repository.

```xml
<dependency>
    <groupId>com.snowflake</groupId>
    <artifactId>snowpipe-streaming</artifactId>
    <version>YOUR_SDK_VERSION</version>
</dependency>
<dependency>
    <groupId>com.fasterxml.jackson.core</groupId>
    <artifactId>jackson-databind</artifactId>
    <version>2.18.1</version>
</dependency>
```

> **Important:**
>
> Replace `YOUR_SDK_VERSION` with the specific version available on [Maven Central](https://central.sonatype.com/artifact/com.snowflake/snowpipe-streaming).

**Download:** [Sample Python code](https://github.com/snowflakedb/snowpipe-streaming-sdk-examples/tree/main/python-example)

**Add the Python dependency**

The SDK requires Python version 3.9 or later.

To install the Snowpipe Streaming SDK for Python, run the following command:

```bash
pip install snowpipe-streaming
```

For more information about the package, see [PyPI](https://pypi.org/project/snowpipe-streaming/).

#### Place the profile file

Ensure that the `profile.json` file that you configured in Step 2: Configure an authentication profile is located in the root directory of your project.

### Step 4: Use the provided code example and run the demo application

In your terminal, navigate to the project’s root directory.

JavaPython

**Build and execute**

* Build the project:

  > ```bash
  > mvn clean install
  > ```
* Run the main class:

  > ```bash
  > mvn exec:java -Dexec.mainClass="com.snowflake.snowpipestreaming.demo.Main"
  > ```

**Run the demo application**

Run the Python demo:

```bash
python example.py
```

### Step 5: Verify the data

After running the demo, verify the ingested data in Snowflake:

```sqlexample
SELECT COUNT(*) FROM MY_DATABASE.MY_SCHEMA.MY_TABLE;
SELECT * FROM MY_DATABASE.MY_SCHEMA.MY_TABLE LIMIT 10;
```

Verify that your data was ingested as a structured object rather than a string literal:

```sqlexample
SELECT
    data,
    TYPEOF(data) as data_type
FROM MY_DATABASE.MY_SCHEMA.MY_TABLE
LIMIT 10;
```

* If `data_type` returns `OBJECT`, the ingestion is correct.
* If `data_type` returns `VARCHAR`, your application is passing a string literal that isn’t being parsed.

---
title: Tutorial: Get started with Snowpipe Streaming REST API using cURL and a JWT
source: https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-rest-tutorial.md
section: User Guide
---

# Tutorial: Get started with Snowpipe Streaming REST API using cURL and a JWT

> **Note:**
>
> We recommend that you begin with the Snowpipe Streaming SDK over the REST API to benefit from the improved performance and getting-started experience.

This guide shows you how to stream data into Snowflake using the [Snowpipe Streaming REST API](snowpipe-streaming-high-performance-rest-api.md) and [a JSON Web Token (JWT) generated with SnowSQL](../../developer-guide/sql-api/authenticating.md).

## Prerequisites

Before you begin, ensure you have the following items:

**Snowflake User and Objects:**

A Snowflake user that is configured for key-pair authentication. Register your public key by using the following SQL command:

```sqlexample
ALTER USER MY_USER SET RSA_PUBLIC_KEY='<your-public-key>';
```

A Snowflake database, schema, and a target table for streaming ingestion. You can create them by using the following SQL commands and replacing placeholders like `MY_DATABASE`, `MY_SCHEMA`, `MY_TABLE` with the names that you want:

```sqlexample
-- Create Database and Schema
CREATE OR REPLACE DATABASE MY_DATABASE;
CREATE OR REPLACE SCHEMA MY_SCHEMA;

-- Create Target Table
CREATE OR REPLACE TABLE MY_TABLE (
    id NUMBER,
    c1 NUMBER,
    ts STRING
);
```

**ACCOUNT_IDENTIFIER:**

We suggest that you use Format 1 for the ACCOUNT_IDENTIFIER, which uses the account name within your organization; for example, `myorg-account123`. For more information on the format, see [Account identifiers](../admin-account-identifier.md).

**Installed tools:**

* `curl`: For making HTTP requests.
* `jq`: For parsing JSON responses.
* `SnowSQL`: For running commands, Snowflake’s command-line client.

**Generated JWT:**

Generate your JWT by using SnowSQL:

```bash
snowsql --private-key-path rsa_key.p8 --generate-jwt \
  -a <ACCOUNT_IDENTIFIER> \
  -u MY_USER
```

> **Caution:**
>
> Store your JWT securely. Avoid exposing it in logs or scripts.

## Step-by-step instructions

Complete the following steps to stream data into Snowflake.

### Step 1: Set environment variables

Set up the necessary environment variables for your Snowflake account and the streaming operation. Note that the `PIPE` variable targets the default streaming pipe associated with your table.

```bash
# Paste the JWT token obtained from SnowSQL
export JWT_TOKEN="PASTE_YOUR_JWT_TOKEN_HERE"

# Configure your Snowflake account and resources:
export ACCOUNT="<ACCOUNT_IDENTIFIER>" # For example, ab12345
export USER="MY_USER"
export DB="MY_DATABASE"
export SCHEMA="MY_SCHEMA"
export PIPE="MY_TABLE-STREAMING"
export CHANNEL="MY_CHANNEL"

# Replace ACCOUNT with your Account URL Host to form the control plane host:
export CONTROL_HOST="${ACCOUNT}.snowflakecomputing.com"
```

### Step 2: Discover ingest host

> **Important:**
>
> If your Snowflake account name contains underscores (e.g., MY_ACCOUNT), a known issue can cause an internal error when calling the ingestion service.
>
> You must replace all underscores with dashes in the INGEST_HOST before generating the scoped token. This converted format (with dashes) must be used for all subsequent REST API calls, including the generation of the scoped token itself.
>
> For example, if the hostname returned is `my_account.region.ingest.snowflakecomputing.com`, you must change it to `my-account.region.ingest.snowflakecomputing.com` for all subsequent REST API calls.

The ingest host is the endpoint for streaming data. Discover the ingest host by using your JWT:

```bash
export INGEST_HOST=$(curl -sS -X GET \
  -H "Authorization: Bearer $JWT_TOKEN" \
  -H "X-Snowflake-Authorization-Token-Type: KEYPAIR_JWT" \
  "https://${CONTROL_HOST}/v2/streaming/hostname")

echo "Ingest Host: $INGEST_HOST"
```

Obtain a scoped token to authorize operations on the ingest host:

```bash
export SCOPED_TOKEN=$(curl -sS -X POST "https://$CONTROL_HOST/oauth/token" \
  -H 'Content-Type: application/x-www-form-urlencoded' \
  -H "Authorization: Bearer $JWT_TOKEN" \
  -d "grant_type=urn:ietf:params:oauth:grant-type:jwt-bearer&scope=${INGEST_HOST}")

echo "Scoped Token obtained for ingest host"
```

### Step 3: Open the channel

Open a streaming channel to begin data ingestion:

```bash
curl -sS -X PUT \
  -H "Authorization: Bearer $SCOPED_TOKEN" \
  -H "Content-Type: application/json" \
  "https://${INGEST_HOST}/v2/streaming/databases/$DB/schemas/$SCHEMA/pipes/$PIPE/channels/$CHANNEL" \
  -d '{}' | tee open_resp.json | jq .
```

### Step 4: Append a row of data

Append a single row of data to the open channel.

#### 4.1 Extract continuation and offset tokens

These tokens are crucial for maintaining the state of your streaming session.

```bash
export CONT_TOKEN=$(jq -r '.next_continuation_token' open_resp.json)
export OFFSET_TOKEN=$(jq -r '.channel_status.last_committed_offset_token' open_resp.json)
export NEW_OFFSET=$((OFFSET_TOKEN + 1))
```

#### 4.2 Create sample row

Generate a sample data row in NDJSON format:

```bash
export NOW_TS=$(date -u +"%Y-%m-%dT%H:%M:%SZ")

cat <<EOF > rows.ndjson
{
  "id": 1,
  "c1": $RANDOM,
  "ts": "$NOW_TS"
}
EOF
```

#### 4.3 Append row

Send the sample row to the streaming channel:

```bash
curl -sS -X POST \
  -H "Authorization: Bearer $SCOPED_TOKEN" \
  -H "Content-Type: application/x-ndjson" \
  -H "Content-Encoding: zstd" \
  "https://${INGEST_HOST}/v2/streaming/data/databases/$DB/schemas/$SCHEMA/pipes/$PIPE/channels/$CHANNEL/rows?continuationToken=$CONT_TOKEN&offsetToken=$NEW_OFFSET" \
  --data-binary @rows.ndjson | jq .
```

> **Note:**
>
> This example includes the `Content-Encoding: zstd` header to demonstrate compression support. For this simple example with uncompressed data, you can omit this header. When you send compressed data, specify either `zstd` or `gzip` to match the compression format of your payload.

> **Important:**
>
> * After each append operation, you must update the `continuationToken` for the next append call. The response from the append rows call contains a `next_continuation_token` field that you should use to make your updates.
> * The success of the append operation confirms only that the data was received by the service, not that it is persisted to the table. Take the next step to verify persistence before querying or moving to the next batch.

#### 4.4 Verify data persistence and committed offset by using `getChannelStatus`

Complete this critical step to ensure application reliability. Data isn’t guaranteed to be persistent until the `committedOffset` has advanced. To confirm that the rows that you just appended are successfully persisted, use `getChannelStatus`.

Check the current status of your streaming channel:

```bash
curl -sS -X POST \
  -H "Authorization: Bearer $SCOPED_TOKEN" \
  -H "Content-Type: application/json" \
  "https://${INGEST_HOST}/v2/streaming/databases/$DB/schemas/$SCHEMA/pipes/$PIPE:bulk-channel-status" \
  -d "{\"channel_names\": [\"$CHANNEL\"]}" | jq ".channel_statuses.\"$CHANNEL\""
```

**Verification check**

You must ensure that the `committedOffset` returned in the response is greater than or equal to the offset of the rows you just appended. Only after the `committedOffset` advances can you be certain that the data is safely available in the table.

#### 4.5 Query the table for persisted data

After you confirm that the `committedOffset` has advanced in the previous step (4.4), you can query to confirm that the data is ingested into your Snowflake table.

Run the following SQL query in Snowflake:

```sqlexample
SELECT * FROM MY_DATABASE.MY_SCHEMA.MY_TABLE WHERE id = 1;
```

### (Optional) Step 5: Clean up

Remove temporary files and unset environment variables:

```bash
rm -f rows.ndjson open_resp.json
unset JWT_TOKEN SCOPED_TOKEN ACCOUNT USER DB SCHEMA PIPE CHANNEL CONTROL_HOST INGEST_HOST CONT_TOKEN OFFSET_TOKEN NEW_OFFSET NOW_TS
```

## Troubleshooting

* **HTTP 401 (Unauthorized):** Verify that your JWT token is valid and not expired. If needed, regenerate it.
* **HTTP 404 (Not Found):** Double-check that the database, schema, pipe, and channel names are spelled correctly and exist in your Snowflake account.
* **No Ingest Host:** Ensure your control plane host URL is correct and accessible.

---
title: Tutorial: Getting started with data metric functions
source: https://docs.snowflake.com/en/user-guide/tutorials/data-quality-tutorial-start.md
section: User Guide
---

Snowflake

Data Governance

Data Quality

# Tutorial: Getting started with data metric functions

## Introduction

You can complete this tutorial using a worksheet in Snowsight or using a CLI client such as [SnowSQL](../snowsql.md).
Simply paste the code examples and run them.

By the end of this tutorial, you will learn how to:

* Create a custom data metric function (DMF) to measure data quality.
* Manage the DMF to optimize serverless credit usage.
* Monitor the serverless credit usage associated with calling the scheduled DMF.

## Access control setup

To complete this tutorial, use a single custom role that has all of the required access, which includes the following:

* Creating a database, which subsequently allows creating a schema, creating a DMF in the schema, and creating a table in the schema
* Creating a warehouse to perform query operations
* Querying the view that contains the results of calling the scheduled DMF
* Querying the view that contains serverless compute usage information

Create the `dq_tutorial_role` role to use throughout the tutorial:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> CREATE ROLE IF NOT EXISTS dq_tutorial_role;
> ```

Grant privileges, and grant the application role and database roles to the `dq_tutorial_role`:

> ```sqlexample
> GRANT CREATE DATABASE ON ACCOUNT TO ROLE dq_tutorial_role;
> GRANT EXECUTE DATA METRIC FUNCTION ON ACCOUNT TO ROLE dq_tutorial_role;
> GRANT APPLICATION ROLE SNOWFLAKE.DATA_QUALITY_MONITORING_VIEWER TO ROLE dq_tutorial_role;
> GRANT DATABASE ROLE SNOWFLAKE.USAGE_VIEWER TO ROLE dq_tutorial_role;
> GRANT DATABASE ROLE SNOWFLAKE.DATA_METRIC_USER TO ROLE dq_tutorial_role;
> ```

Create a warehouse to query the table that contains the data and grant the USAGE privilege on the role to the `dq_tutorial_role` role:

> ```sqlexample
> CREATE WAREHOUSE IF NOT EXISTS dq_tutorial_wh;
> GRANT USAGE ON WAREHOUSE dq_tutorial_wh TO ROLE dq_tutorial_role;
> ```

Confirm the grants to the `dq_tutorial_role` role:

> ```sqlexample
> SHOW GRANTS TO ROLE dq_tutorial_role;
> ```

Establish a role hierarchy and grant the role to a user who can complete this tutorial (replace the `jsmith` value):

> ```sqlexample
> GRANT ROLE dq_tutorial_role TO ROLE SYSADMIN;
> GRANT ROLE dq_tutorial_role TO USER jsmith;
> ```

## Data setup

To facilitate managing the data and the DMF for this tutorial, create a dedicated database to contain these objects:

### Create a table

```sqlexample
USE ROLE dq_tutorial_role;
CREATE DATABASE IF NOT EXISTS dq_tutorial_db;
CREATE SCHEMA IF NOT EXISTS sch;

CREATE TABLE customers (
  account_number NUMBER(38,0),
  first_name VARCHAR(16777216),
  last_name VARCHAR(16777216),
  email VARCHAR(16777216),
  phone VARCHAR(16777216),
  created_at TIMESTAMP_NTZ(9),
  street VARCHAR(16777216),
  city VARCHAR(16777216),
  state VARCHAR(16777216),
  country VARCHAR(16777216),
  zip_code NUMBER(38,0)
);
```

### Insert values into a table

Add data to the table:

> ```sqlexample
> USE WAREHOUSE dq_tutorial_wh;
>
> INSERT INTO customers (account_number, city, country, email, first_name, last_name, phone, state, street, zip_code)
>   VALUES (1589420, 'san francisco', 'usa', 'john.doe@', 'john', 'doe', 1234567890, null, null, null);
>
> INSERT INTO customers (account_number, city, country, email, first_name, last_name, phone, state, street, zip_code)
>   VALUES (8028387, 'san francisco', 'usa', 'bart.simpson@example.com', 'bart', 'simpson', 1012023030, null, 'market st', 94102);
>
> INSERT INTO customers (account_number, city, country, email, first_name, last_name, phone, state, street, zip_code)
>   VALUES
>     (1589420, 'san francisco', 'usa', 'john.doe@example.com', 'john', 'doe', 1234567890, 'ca', 'concar dr', 94402),
>     (2834123, 'san mateo', 'usa', 'jane.doe@example.com', 'jane', 'doe', 3641252911, 'ca', 'concar dr', 94402),
>     (4829381, 'san mateo', 'usa', 'jim.doe@example.com', 'jim', 'doe', 3641252912, 'ca', 'concar dr', 94402),
>     (9821802, 'san francisco', 'usa', 'susan.smith@example.com', 'susan', 'smith', 1234567891, 'ca', 'geary st', 94121),
>     (8028387, 'san francisco', 'usa', 'bart.simpson@example.com', 'bart', 'simpson', 1012023030, 'ca', 'market st', 94102);
> ```

## Create and work with DMFs

In the following sections, we will create a user-defined DMF to measure the count of invalid email addresses and subsequently do the
following:

* Schedule the DMF to run every 5 minutes.
* Check the DMF table references (find the tables the DMF is set on).
* Query a built-in view that contains the result of calling the scheduled DMF.
* Unset the DMF from the table to avoid unnecessary serverless credit usage.

### Create a DMF

Create a data metric function (DMF) to return the number of email addresses in a column that don’t match the specified regular expression:

> ```sqlexample
> CREATE DATA METRIC FUNCTION IF NOT EXISTS
>   invalid_email_count (ARG_T table(ARG_C1 STRING))
>   RETURNS NUMBER AS
>   'SELECT COUNT_IF(FALSE = (
>     ARG_C1 REGEXP ''^[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Za-z]{2,4}$''))
>     FROM ARG_T';
> ```

### Set the schedule on the table

The DMF schedule defines when all DMFs on the table run. Currently, 5 minutes is the shortest possible time interval:

> ```sqlexample
> ALTER TABLE customers SET DATA_METRIC_SCHEDULE = '5 MINUTE';
> ```

> **Note:**
>
> For the purpose of the tutorial, the schedule is set for 5 minutes. However, after you optimize your DMF use cases, experiment with the
> other schedule settings, such as cron expressions or trigger events associated with DML operations that affect the table.

### Set the DMFs on the table and check the references

Associate the DMF to the table:

> ```sqlexample
> ALTER TABLE customers ADD DATA METRIC FUNCTION
>   invalid_email_count ON (email);
> ```

Because the schedule is set for 5 minutes, we need to wait 5 minutes in order for Snowflake to call the DMF and process the results. For
now, we can check to see that the DMF is associated with the table by calling the
[DATA_METRIC_FUNCTION_REFERENCES](../../sql-reference/functions/data_metric_function_references.md) Information Schema table function:

> ```sqlexample
> SELECT * FROM TABLE(INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_REFERENCES(
>   REF_ENTITY_NAME => 'dq_tutorial_db.sch.customers',
>   REF_ENTITY_DOMAIN => 'TABLE'));
> ```

### View the DMF results

The results of calling the scheduled DMF are stored in the DATA_QUALITY_MONITORING_RESULTS view. To determine the number of invalid email
addresses, query the [DATA_QUALITY_MONITORING_RESULTS](../../sql-reference/local/data_quality_monitoring_results.md) view to see the results
of calling the scheduled DMF:

> ```sqlexample
> SELECT scheduled_time, measurement_time, table_name, metric_name, value
> FROM SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS
> WHERE TRUE
> AND METRIC_NAME = 'INVALID_EMAIL_COUNT'
> AND METRIC_DATABASE = 'DQ_TUTORIAL_DB'
> LIMIT 100;
> ```

The results show that the `value` column contains `1`. This number corresponds to one improperly formatted email
address, which corresponds to the first INSERT statement in the Insert values into a table section.

### Unset the DMFs from the table

You have established that the DMF is working as expected based on the definition of the DMF, the schedule, and the expected results.

To avoid unnecessary serverless credit usage, unset the DMF from the table:

> ```sqlexample
> ALTER TABLE customers DROP DATA METRIC FUNCTION
>   invalid_email_count ON (email);
> ```

## Use DMF to return failed records

In this section, you will return records that failed a data quality check because they had blank values.

The data quality metric function identifies rows that contain data that failed the quality check. You can run a data metric scan to
extract and return these records.

To return the rows identified by a DMF, follow these steps:

* Create a table.
* Add bad records to the table.
* Run the data metric scan to return records with blank values.
* View the scan results.
* Update records with a new value.

### Create a table

Paste and run the following statement to create a table.

> ```sqlexample
> CREATE or REPLACE table dq_tutorial_db.sch.employeesTable (
>   id NUMBER,
>   name VARCHAR,
>   last_name VARCHAR,
>   email VARCHAR,
>   zip_code NUMBER
>  );
> ```

### Insert values into a table

Add data with a few bad records, such as blank values, to the table:

> ```sqlexample
> INSERT INTO dq_tutorial_db.sch.employeesTable (id, name, last_name, email, zip_code)
> VALUES
>   (8, 'John', 'Doe', 'johndoe@example.com', 12345),
>   (23, '', 'Smith', 'smithj@example.com', 23456),
>   (1, NULL, 'Taylor', 'taylorj@example.com', 34567),
>   (99, 'Jane', 'Adams', 'jadams@example.com', 45678),
>   (50, 'Alice', 'Brown', '', 56789),
>   (51, NULL, 'Lee', 'lee@example.com', 67890),
>   (234, 'Michael', '', 'michael@example.com', 78901),
>   (56, 'Sara', 'Jones', 'sjones@example.com', 89012),
>   (11, '', NULL, 'blanklast@example.com', 90123),
>   (12, 'Tom', 'Harris', NULL, 10234);
> ```

### Return the number of blank values by running the BLANK_COUNT data metric function

Execute the BLANK_COUNT data metric function to return the number of blank values:

```sqlexample
SELECT snowflake.core.blank_count (SELECT name FROM dq_tutorial_db.sch.employeesTable)
```

### Return rows by running the SYSTEM$DATA_METRIC_SCAN function

To return the table rows containing blank values in the `name` column, execute the SYSTEM$DATA_METRIC_SCAN function on the `name`
column.

> ```sqlexample
> SELECT *
>   FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
>     REF_ENTITY_NAME  => 'dq_tutorial_db.sch.employeesTable',
>     METRIC_NAME  => 'snowflake.core.blank_count',
>     ARGUMENT_NAME => 'name'
>    ));
> ```

### View the system metric scan results

The results show the rows of the `employeeTable` table that contain blank values.

```output
+-----+-------+--------------+-----------------------+-----------+------- --+
| ID  | NAME  | LAST_NAME    | EMAIL                 | CREATEDAT | ZIP_CODE |
|-----+-------+--------------+-----------------------+----------------------|
| 23  |       |   Smith      | smith@example.com     | null      | 23456    |
| 11  |       |   null       | blanklast@example.com | null      | 90123    |
+-----+-------+--------------+-----------------------+-----------+----------+
```

### Update records with a new value

To replace the blank values in the `name` column, run a query on the target table that includes the SYSTEM$DATA_METRIC_SCAN function.
It sets the blank values in the `name` column to NULL by running the UPDATE command on each of the rows returned by the system
function:

```sqlexample
UPDATE dq_tutorial_db.sch.employeesTable
  SET name = null
  WHERE dq_tutorial_db.sch.employeesTable.ID IN (
    select ID from table(system$data_metric_scan(
  REF_ENTITY_NAME => 'dq_tutorial_db.sch.employeesTable',
  METRIC_NAME => 'snowflake.core.blank_count',
  ARGUMENT_NAME => 'name'
  )));
```

After you update the values, running the following returns 0:

```sqlexample
SELECT snowflake.core.blank_count (SELECT name FROM dq_tutorial_db.sch.employeesTable)
```

In this section, you extracted records with data that failed the quality check. In the next section, you will learn how to view your
serverless credit consumption.

## View your serverless credit consumption

Calling scheduled data metric functions (DMFs) requires [serverless compute resources](../cost-understanding-compute.md). You
can query the Account Usage view
[DATA_QUALITY_MONITORING_USAGE_HISTORY](../../sql-reference/account-usage/data_quality_monitoring_usage_history.md) to view the
[DMF serverless compute cost](../data-quality-intro.md).

Because the view has a latency of 1-2 hours, wait for that time to pass before querying the view. You can come back to this step later.

Query the view and filter the results to include the time interval of your scheduled DMF:

> ```sqlexample
> USE ROLE dq_tutorial_role;
> SELECT *
> FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_QUALITY_MONITORING_USAGE_HISTORY
> WHERE TRUE
> AND START_TIME >= CURRENT_TIMESTAMP - INTERVAL '3 days'
> LIMIT 100;
> ```

## Clean up, summary, and additional resources

Congratulations! You’ve completed this tutorial.

Take a few minutes to review the summary and the key points covered in this tutorial.

Consider cleaning up by dropping the objects you created in this tutorial. Learn more by reviewing other topics in the Snowflake
documentation.

### Summary and key points

In summary, you learned how to do the following:

* Create a custom DMF to measure data quality and manage the DMF to optimize serverless credit usage.
* Monitor the serverless credit usage associated with calling the scheduled DMF.

### Drop the tutorial objects

If you plan to repeat the tutorial, you can keep the objects that you created.

Otherwise, drop the tutorial objects as follows:

```sqlexample
USE ROLE ACCOUNTADMIN;
DROP DATABASE dq_tutorial_db;
DROP WAREHOUSE dq_tutorial_wh;
DROP ROLE dq_tutorial_role;
```

### What’s next?

Continue learning about Snowflake using the following resources:

* Learn more about DMFs by starting with [Introduction to data quality checks](../data-quality-intro.md).
* Complete the other tutorials provided by Snowflake in the [Tutorials to get started with Snowflake](../../learn-tutorials.md) topic.

---
title: Tutorial: Improve Workload Performance with the Query Acceleration Service
source: https://docs.snowflake.com/en/user-guide/tutorials/query-acceleration-service.md
section: User Guide
---

Data Engineering

# Tutorial: Improve Workload Performance with the Query Acceleration Service

## Introduction

Snowflake offers a variety of performance enhancements to accelerate its various workloads. In this tutorial you will learn how to
leverage the Query Acceleration Service (QAS) to improve your overall workload performance.

### Prerequisites

* A Snowflake account that is Enterprise Edition (or higher)
* A role granted the following privileges:

  + The privileges required to execute the [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) and
    [ALTER WAREHOUSE](../../sql-reference/sql/alter-warehouse.md) commands.

    - [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md)
    - [MODIFY WAREHOUSE](../security-access-control-privileges.md)
  + The privileges required to query the Account Usage views in the tutorial:

    - [GOVERNANCE_VIEWER](../../sql-reference/snowflake-db-roles.md)
    - [USAGE_VIEWER](../../sql-reference/snowflake-db-roles.md)
  + The privilege required to execute the Information Schema table functions in the tutorial:

    - [MONITOR USAGE](../security-access-control-configure.md)
* Intermediate knowledge of SQL.
* [Snowsight](../ui-snowsight-gs.md) or [SnowSQL (CLI client)](../snowsql.md) for executing SQL commands.

### What You Will Learn

In this tutorial you will learn how to:

* Find a query in your query history that is eligible for acceleration.
* Execute the query in two separate warehouses to identify the effects of query acceleration.
* Compare query performance and cost with and without acceleration.
* Identify which of your warehouses would benefit most from the query acceleration service.
* Enable the service for an existing warehouse.

## Find an Eligible Query

Find an eligible query to accelerate. You can use the following example query to find a query that is eligible for acceleration.

This query identifies queries with a high eligible time ratio as identified by the ratio of eligible_query_acceleration_time
field and total query duration in the QUERY_ACCELERATION_ELIGIBLE view in the ACCOUNT_USAGE schema.

```sqlexample
SELECT query_id,
       query_text,
       start_time,
       end_time,
       warehouse_name,
       warehouse_size,
       eligible_query_acceleration_time,
       upper_limit_scale_factor,
       DATEDIFF(second, start_time, end_time) AS total_duration,
       eligible_query_acceleration_time / NULLIF(DATEDIFF(second, start_time, end_time), 0) AS eligible_time_ratio
FROM
    SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
WHERE
    start_time >= DATEADD(day, -30, CURRENT_TIMESTAMP())
    AND eligible_time_ratio <= 1.0
    AND total_duration BETWEEN 3 * 60 and 5 * 60
ORDER BY (eligible_time_ratio, upper_limit_scale_factor) DESC NULLS LAST
LIMIT 100;
```

1. From the results, select a query with the highest UPPER_LIMIT_SCALE_FACTOR value.
2. Copy the query text, warehouse size, and upper limit scale factor.

If the query above does not yield any results, you can still follow this tutorial by using the following example query. The example
dataset for this query is a snapshot of the [TPC-DS data](../sample-data-tpcds.md) in the Snowflake sample data that is shared
with you:

```sqlexample
SELECT d.d_year as "Year",
       i.i_brand_id as "Brand ID",
       i.i_brand as "Brand",
       SUM(ss_net_profit) as "Profit"
FROM   snowflake_sample_data.tpcds_sf10tcl.date_dim    d,
       snowflake_sample_data.tpcds_sf10tcl.store_sales s,
       snowflake_sample_data.tpcds_sf10tcl.item        i
WHERE  d.d_date_sk = s.ss_sold_date_sk
  AND s.ss_item_sk = i.i_item_sk
  AND i.i_manufact_id = 939
  AND d.d_moy = 12
GROUP BY d.d_year,
         i.i_brand,
         i.i_brand_id
ORDER BY 1, 4, 2
LIMIT 200;
```

1. If you use this example query, the WAREHOUSE_SIZE is ‘X-Small’ and UPPER_LIMIT_SCALE_FACTOR is 64.
2. Copy the query text, warehouse size, and upper limit scale factor.

## Create Two New Warehouses

This tutorial needs two warehouses to execute the query: one with the query acceleration service enabled and one without.
Executing the same query in new, separate warehouses will allow you to compare both performance and cost for the query acceleration
service in this tutorial.

To create the warehouses, connect to Snowflake and run the following command in Snowsight or using SnowSQL. Replace the
`warehouse_size` and `upper_limit_scale_factor` with the values selected in the previous step:

```sqlexample
CREATE WAREHOUSE noqas_wh WITH
  WAREHOUSE_SIZE='<warehouse_size>'
  ENABLE_QUERY_ACCELERATION = false
  INITIALLY_SUSPENDED = true
  AUTO_SUSPEND = 60;

CREATE WAREHOUSE qas_wh WITH
  WAREHOUSE_SIZE='<warehouse_size>'
  ENABLE_QUERY_ACCELERATION = true
  QUERY_ACCELERATION_MAX_SCALE_FACTOR = <upper_limit_scale_factor>
  INITIALLY_SUSPENDED = true
  AUTO_SUSPEND = 60;
```

## Query Without QAS

After setting up your environment and finding a query eligible for query acceleration, execute the query without enabling the query
acceleration service to see how it performs.

If you are using the example query provided rather than an eligible query from your query history, execute the following statement
first:

```sqlexample
USE SCHEMA snowflake_sample_data.tpcds_sf10tcl;
```

Select a warehouse and execute your query:

1. Use the warehouse that does not have QAS enabled.

   ```sqlexample
   USE WAREHOUSE noqas_wh;
   ```
2. Execute your test query (the query text from the previous step).
3. Get the query ID of the last executed query.

   If you are using Snowsight,
   you can copy and paste the query ID from the Query Profile panel in the Results panel. Alternatively, you can execute
   the following statement:

   ```sqlexample
   SELECT LAST_QUERY_ID();
   ```
4. Copy this query ID for additional future steps.

## Query With QAS

After executing the query in a warehouse without query acceleration, execute the same query in the QAS enabled warehouse.

1. Use the warehouse with QAS enabled to execute your query:

   ```sqlexample
   USE WAREHOUSE qas_wh;
   ```
2. Execute your test query (the query text from the previous step).
3. Get the query ID of the last executed query

   If you are using Snowsight, you can copy and paste the query ID from the Query Profile panel in the Results panel.
   Alternatively, you can execute the following statement:

   ```sqlexample
   SELECT LAST_QUERY_ID();
   ```
4. Copy this query ID for additional future steps.

## Compare Query Performance and Cost

In the previous steps, you executed the same query twice, once using a warehouse with QAS enabled and another without. Now, you can
compare the query performance of the query.

To do that, you can execute the Information Schema [QUERY_HISTORY](../../sql-reference/functions/query_history.md)
table function to compare the execution time for the queries using their query IDs:

```sqlexample
SELECT query_id,
       query_text,
       warehouse_name,
       total_elapsed_time
FROM TABLE(snowflake.information_schema.query_history())
WHERE query_id IN ('<non_accelerated_query_id>', '<accelerated_query_id>')
ORDER BY start_time;
```

Compare the TOTAL_ELAPSED_TIME for the same query executed with and without acceleration.

Next, compare the costs for each warehouse, you can execute the Information Schema [WAREHOUSE_METERING_HISTORY](../../sql-reference/functions/warehouse_metering_history.md)
table function for each warehouse:

> **Note:**
>
> If you skipped creating new warehouses for this tutorial and instead used pre-existing warehouses, the results of this table
> function are likely not going to be useful.

1. Execute the following query to view the costs for the `noqas_wh` warehouse:

   ```sqlexample
   SELECT start_time,
          end_time,
          warehouse_name,
          credits_used,
          credits_used_compute,
          credits_used_cloud_services,
          (credits_used + credits_used_compute + credits_used_cloud_services) AS credits_used_total
     FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.WAREHOUSE_METERING_HISTORY(
       DATE_RANGE_START => DATEADD('days', -1, CURRENT_DATE()),
       WAREHOUSE_NAME => 'NOQAS_WH'
     ));
   ```
2. For the QAS enabled warehouse, add the costs for the warehouse and the query acceleration service to calculate the total cost
   of QAS.

   * View the costs for the `qas_wh` warehouse:

     ```sqlexample
     SELECT start_time,
            end_time,
            warehouse_name,
            credits_used,
            credits_used_compute,
            credits_used_cloud_services,
            (credits_used + credits_used_compute + credits_used_cloud_services) AS credits_used_total
       FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.WAREHOUSE_METERING_HISTORY(
         DATE_RANGE_START => DATEADD('days', -1, CURRENT_DATE()),
         WAREHOUSE_NAME => 'QAS_WH'
       ));
     ```
   * View the costs for the query acceleration service with the Information Schema
     [QUERY_ACCELERATION_HISTORY](../../sql-reference/functions/query_acceleration_history.md) table function:

     > ```sqlexample
     >   SELECT start_time,
     >          end_time,
     >          warehouse_name,
     >          credits_used,
     >          num_files_scanned,
     >          num_bytes_scanned
     >     FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.QUERY_ACCELERATION_HISTORY(
     >       DATE_RANGE_START => DATEADD('days', -1, CURRENT_DATE()),
     >       WAREHOUSE_NAME => 'QAS_WH'
     > ));
     > ```

   Add the `credits_used_total` value from the first query with the `credits_used` value from the second query for
   the total cost of QAS.

So far, you have tested a query in two warehouses, one warehouse with QAS enabled and one without, and been able to compare
the performance and cost of QAS. Next, you will learn how to identify which of your warehouses will benefit the most from
QAS.

## Find Eligible Warehouses in Your Workloads

You can find the warehouses that would benefit the most for query acceleration by determining which warehouses have the largest
number of queries that are eligible for acceleration and/or the warehouses with the most query acceleration eligible time.

* Identify the warehouses with the most queries eligible for the query acceleration service in the last month,
  by counting the `query_id` values:

  ```sqlexample
  SELECT warehouse_name,
      COUNT(query_id) as num_eligible_queries,
      MAX(upper_limit_scale_factor)
    FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
    WHERE start_time > DATEADD(month, -1, CURRENT_TIMESTAMP())
    GROUP BY warehouse_name
    ORDER BY num_eligible_queries DESC;
  ```
* Identify the warehouses with the most eligible time for the query acceleration service in the last month,
  by summing the `eligible_query_acceleration_time` values:

  ```sqlexample
  SELECT warehouse_name,
      SUM(eligible_query_acceleration_time) AS total_eligible_time,
      MAX(upper_limit_scale_factor)
    FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
    WHERE start_time > DATEADD(month, -1, CURRENT_TIMESTAMP())
    GROUP BY warehouse_name
    ORDER BY total_eligible_time DESC;
  ```

Typically, the warehouses that benefit the most are the ones that either have the largest number of eligible queries,
the largest amount of eligible query acceleration time, or a combination of the two.
For example, if a warehouse is in the top of the results for both of the queries above,
that warehouse might be a good candidate for query acceleration.

## Enabling Query Acceleration

After you have decided which warehouses would benefit the most from the query acceleration service,
you can enable query acceleration by executing the following [ALTER WAREHOUSE](../../sql-reference/sql/alter-warehouse.md) statement:

```sqlexample
ALTER WAREHOUSE <warehouse_name> SET
  ENABLE_QUERY_ACCELERATION = TRUE;
```

Now that you have enabled QAS for your warehouses, you are ready to take advantage of query acceleration for eligible queries.

## Clean up and Additional Resources

To clean up, drop the warehouses created for this tutorial:

```sqlexample
DROP WAREHOUSE noqas_wh;

DROP WAREHOUSE qas_wh;
```

### What to Read Next

* For more information about the query acceleration service, see [Using the Query Acceleration Service (QAS)](../query-acceleration-service.md).
* For additional example queries for identifying eligible queries and warehouses, see
  [Identifying queries and warehouses that might benefit from query acceleration](../query-acceleration-service.md).
* For more information about the QAS scale factor:

  + For a description of the scale factor, see [QUERY_ACCELERATION_MAX_SCALE_FACTOR](../../sql-reference/sql/create-warehouse.md)
    in the [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) topic.
  + For more information about setting the scale factor, see [Adjusting the scale factor](../query-acceleration-service.md).
* To monitor QAS usage and costs after you start using the query acceleration service:

  + For more information about monitoring query acceleration service usage, see [Monitoring query acceleration service usage](../query-acceleration-service.md).
  + For more information about the costs of the service, see [Query acceleration service cost](../query-acceleration-service.md).
  + For example queries to help evaluate QAS performance and cost, see [Evaluating cost and performance](../query-acceleration-service.md).

---
title: Tutorial: JSON basics for Snowflake
source: https://docs.snowflake.com/en/user-guide/tutorials/json-basics-tutorial.md
section: User Guide
---

Getting Started

# Tutorial: JSON basics for Snowflake

## Introduction

In this tutorial you will learn the basics of using JSON with Snowflake.

### What you will learn

In this tutorial, you learn how to do the following:

* Upload sample JSON data from a public S3 bucket into a column of the `variant` type in a
  Snowflake table.
* Test simple queries for JSON data in the table.
* Explore the FLATTEN function to flatten JSON data into a relational representation and save it
  in another table.
* Explore ways to ensure uniqueness as you insert rows in the flattened version of the data.

## Prerequisites

The tutorial assumes the following:

* You have a Snowflake account that is configured to use Amazon AWS and a user with
  a role that grants the necessary privileges to create a database, tables, and
  virtual warehouse objects.
* You have [SnowSQL (CLI client)](../snowsql.md) installed.

The [Snowflake in 20 minutes](snowflake-in-20minutes.md) tutorial provides the related
step-by-step instructions to meet these requirements.

Snowflake provides sample data files in a public S3 bucket for use in this tutorial.
But before you start, you need to create a database, tables, a virtual warehouse,
and an external stage for this tutorial. These are the basic Snowflake objects
needed for most Snowflake activities.

### About the sample data file

For this tutorial, you use the following sample application events JSON data provided in a public S3 bucket.

```sqlexample
{
"device_type": "server",
"events": [
  {
    "f": 83,
    "rv": "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19",
    "t": 1437560931139,
    "v": {
      "ACHZ": 42869,
      "ACV": 709489,
      "DCA": 232,
      "DCV": 62287,
      "ENJR": 2599,
      "ERRS": 205,
      "MXEC": 487,
      "TMPI": 9
    },
    "vd": 54,
    "z": 1437644222811
  },
  {
    "f": 1000083,
    "rv": "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22",
    "t": 1437036965027,
    "v": {
      "ACHZ": 6953,
      "ACV": 346795,
      "DCA": 250,
      "DCV": 46066,
      "ENJR": 9033,
      "ERRS": 615,
      "MXEC": 0,
      "TMPI": 112
    },
    "vd": 626,
    "z": 1437660796958
  }
],
"version": 2.6
}
```

The data represents sample events that applications upload to S3. A variety of devices and applications, such as servers, cell phones, and browsers publish events. In a common data
collection scenario, a scalable web endpoint collects POSTed data from different sources and writes them to a queuing
system. An ingest service/utility then writes the data to a S3
bucket, from which you can load the data into Snowflake.

The sample data illustrates the following concepts:

* Applications can choose to group events in batches. A batch is a container
  that holds header information common to all of the events in the batch. For example, the preceding JSON is a batch of two
  events with common header information: `device_type` and `version` that generated these events.
* Amazon S3 supports using folders concept to organize a bucket. Applications can leverage this feature to partition event data.
  Partitioning schemes typically identify details, such as application or location that generated the event, along with
  an event date when it was written to S3. Such a partitioning scheme enables you to copy any fraction of the partitioned
  data to Snowflake with a single COPY command. For example, you can copy event data by the hour, data, month, or year
  when you initially populate tables.

  For example:

  > `s3://bucket_name/application_a/2016/07/01/11/`
  >
  > `s3://bucket_name/application_b/location_c/2016/07/01/14/`

  Note the `application_a`, `application_b`, `location_c`, etc. identify details for the source
  of all data in the path. The data can be organized by the date when it was written.
  An optional 24-hour directory reduces the amount of data in each directory.

  > **Note:**
  >
  > S3 transmits a directory list with each COPY statement used by Snowflake, so reducing
  > the number of files in each directory improves the performance of your COPY statements.
  > You may even consider creating 10-15 minute increment folders in each hour.

  The sample data provided in the S3 bucket uses a similar partitioning scheme. In a COPY command you
  will specify a specific folder path to copy events data.

### Creating the database, table, warehouse, and external stage

Execute the following statements to create a database, a table, a virtual warehouse,
and an external stage needed for this tutorial. After you complete the tutorial,
you can drop these objects.

> ```sqlexample
> CREATE OR REPLACE DATABASE mydatabase;
>
> USE SCHEMA mydatabase.public;
>
> CREATE OR REPLACE TABLE raw_source (
>   SRC VARIANT);
>
> CREATE OR REPLACE WAREHOUSE mywarehouse WITH
>   WAREHOUSE_SIZE='X-SMALL'
>   AUTO_SUSPEND = 120
>   AUTO_RESUME = TRUE
>   INITIALLY_SUSPENDED=TRUE;
>
> USE WAREHOUSE mywarehouse;
>
> CREATE OR REPLACE STAGE my_stage
>   URL = 's3://snowflake-docs/tutorials/json';
> ```

Note the following:

* The `CREATE DATABASE` statement creates a database. The database automatically
  includes a schema named ‘public’.
* The `USE SCHEMA` statement specifies an active database and schema for the current user session.
  Specifying a database now enables you to perform your work in this database without having
  to provide the name each time it is requested.
* The `CREATE TABLE` statement creates a target table for JSON data.
* The `CREATE WAREHOUSE` statement creates an initially suspended warehouse. The
  statement also sets AUTO_RESUME = true, which starts the warehouse automatically
  when you execute SQL statements that require compute resources.
  The `USE WAREHOUSE` statement specifies the warehouse you created as the active
  warehouse for the current user session.
* The `CREATE STAGE` statement creates an external stage that points to the S3 bucket
  containing the sample file for this tutorial.

## Copy data into the target table

Execute [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) to load your staged data into
the target `RAW_SOURCE` table.

```sqlexample
COPY INTO raw_source
  FROM @my_stage/server/2.6/2016/07/15/15
  FILE_FORMAT = (TYPE = JSON);
```

The command copies all new data from the specified path on the external stage
to the target `RAW_SOURCE` table. In this example, the specified path targets data
written on the 15th hour (3 PM) of July 15th, 2016.
Note that Snowflake checks each file’s S3 ETag value to ensure it is
copied only once.

Execute a SELECT query to verify the data is copied successfully.

```sqlexample
SELECT * FROM raw_source;
```

The query returns the following result:

```sqlexample
+-----------------------------------------------------------------------------------+
| SRC                                                                               |
|-----------------------------------------------------------------------------------|
| {                                                                                 |
|   "device_type": "server",                                                        |
|   "events": [                                                                     |
|     {                                                                             |
|       "f": 83,                                                                    |
|       "rv": "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19",   |
|       "t": 1437560931139,                                                         |
|       "v": {                                                                      |
|         "ACHZ": 42869,                                                            |
|         "ACV": 709489,                                                            |
|         "DCA": 232,                                                               |
|         "DCV": 62287,                                                             |
|         "ENJR": 2599,                                                             |
|         "ERRS": 205,                                                              |
|         "MXEC": 487,                                                              |
|         "TMPI": 9                                                                 |
|       },                                                                          |
|       "vd": 54,                                                                   |
|       "z": 1437644222811                                                          |
|     },                                                                            |
|     {                                                                             |
|       "f": 1000083,                                                               |
|       "rv": "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22", |
|       "t": 1437036965027,                                                         |
|       "v": {                                                                      |
|         "ACHZ": 6953,                                                             |
|         "ACV": 346795,                                                            |
|         "DCA": 250,                                                               |
|         "DCV": 46066,                                                             |
|         "ENJR": 9033,                                                             |
|         "ERRS": 615,                                                              |
|         "MXEC": 0,                                                                |
|         "TMPI": 112                                                               |
|       },                                                                          |
|       "vd": 626,                                                                  |
|       "z": 1437660796958                                                          |
|     }                                                                             |
|   ],                                                                              |
|   "version": 2.6                                                                  |
| }                                                                                 |
+-----------------------------------------------------------------------------------+
```

In this sample JSON data, there are two events. The `device_type`,
and `version` key values identify a data source and version for
events from a specific device.

## Query data

In this section, you explore SELECT statements to query the JSON data.

1. Retrieve `device_type`.

   ```sqlexample
   SELECT src:device_type
     FROM raw_source;
   ```

   The query return the following result:

   ```sqlexample
   +-----------------+
   | SRC:DEVICE_TYPE |
   |-----------------|
   | "server"        |
   +-----------------+
   ```

   The query uses the `src:device_type` notation
   to specify the column name and the JSON element name to retrieve. This
   notation is similar to the
   familiar SQL `table.column` notation.
   Snowflake allows you to specify a
   sub-column within a parent column, which Snowflake dynamically derives from the
   schema definition embedded in the JSON data. For more information,
   refer to [Querying Semi-structured Data](../querying-semistructured.md).

   > > **Note:**
   > >
   > > The column name is case-insensitive, however JSON element names
   > > are case-sensitive.
2. Retrieve the `device_type` value without the quotes.

   The preceding query returns the JSON data value in quote. You can remove
   the quotes by casting the data to a specific data type,
   in this example a string.

   This query also optionally assigns a name to the column using an alias.

   ```sqlexample
   SELECT src:device_type::string AS device_type
     FROM raw_source;
   ```

   The query returns the following result:

   ```sqlexample
   +-------------+
   | DEVICE_TYPE |
   |-------------|
   | server      |
   +-------------+
   ```
3. Retrieve repeating `f` keys nested within the array event objects.

   The sample JSON data includes `events` array. Each event object in the array
   has the `f` field as shown.

   ```sqlexample
   {
   "device_type": "server",
   "events": [
     {
       "f": 83,
       ..
     }
     {
       "f": 1000083,
       ..
     }
   ]}
   ```

   To retrieve these nested keys, you can use the [FLATTEN](../../sql-reference/functions/flatten.md)
   function. The function flattens the events into separate rows.

   ```sqlexample
   SELECT
     value:f::number
     FROM
       raw_source
     , LATERAL FLATTEN( INPUT => SRC:events );
   ```

   The query returns the following result:

   ```sqlexample
   +-----------------+
   | VALUE:F::NUMBER |
   |-----------------|
   |              83 |
   |         1000083 |
   +-----------------+
   ```

   Note the `value` is one of the columns that FLATTEN function returns.
   The next step provides more details about using the FLATTEN function.

## Flatten data

[FLATTEN](../../sql-reference/functions/flatten.md) is a table function that produces a lateral
view of a VARIANT, OBJECT, or ARRAY column. In this step, you use this function
to explore different levels of flattening.

### Flatten array objects in a variant column

You can flatten the event objects in the `events` array into separate rows
using the `FLATTEN` function. The function output includes a
VALUE column that stores these individual events.

You can then use the LATERAL modifier to join the `FLATTEN` function output
with any information outside of the object — in this example,
the `device_type` and `version`.

1. Query the data for each event:

   ```sqlexample
   SELECT src:device_type::string,
       src:version::String,
       VALUE
   FROM
       raw_source,
       LATERAL FLATTEN( INPUT => SRC:events );
   ```

   The query returns the following result:

   ```output
   +-------------------------+---------------------+-------------------------------------------------------------------------------+
   | SRC:DEVICE_TYPE::STRING | SRC:VERSION::STRING | VALUE                                                                         |
   |-------------------------+---------------------+-------------------------------------------------------------------------------|
   | server                  | 2.6                 | {                                                                             |
   |                         |                     |   "f": 83,                                                                    |
   |                         |                     |   "rv": "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19",   |
   |                         |                     |   "t": 1437560931139,                                                         |
   |                         |                     |   "v": {                                                                      |
   |                         |                     |     "ACHZ": 42869,                                                            |
   |                         |                     |     "ACV": 709489,                                                            |
   |                         |                     |     "DCA": 232,                                                               |
   |                         |                     |     "DCV": 62287,                                                             |
   |                         |                     |     "ENJR": 2599,                                                             |
   |                         |                     |     "ERRS": 205,                                                              |
   |                         |                     |     "MXEC": 487,                                                              |
   |                         |                     |     "TMPI": 9                                                                 |
   |                         |                     |   },                                                                          |
   |                         |                     |   "vd": 54,                                                                   |
   |                         |                     |   "z": 1437644222811                                                          |
   |                         |                     | }                                                                             |
   | server                  | 2.6                 | {                                                                             |
   |                         |                     |   "f": 1000083,                                                               |
   |                         |                     |   "rv": "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22", |
   |                         |                     |   "t": 1437036965027,                                                         |
   |                         |                     |   "v": {                                                                      |
   |                         |                     |     "ACHZ": 6953,                                                             |
   |                         |                     |     "ACV": 346795,                                                            |
   |                         |                     |     "DCA": 250,                                                               |
   |                         |                     |     "DCV": 46066,                                                             |
   |                         |                     |     "ENJR": 9033,                                                             |
   |                         |                     |     "ERRS": 615,                                                              |
   |                         |                     |     "MXEC": 0,                                                                |
   |                         |                     |     "TMPI": 112                                                               |
   |                         |                     |   },                                                                          |
   |                         |                     |   "vd": 626,                                                                  |
   |                         |                     |   "z": 1437660796958                                                          |
   |                         |                     | }                                                                             |
   +-------------------------+---------------------+-------------------------------------------------------------------------------+
   ```
2. Use a CREATE TABLE AS SELECT statement to store the preceding query result in a table:

   ```sqlexample
   CREATE OR REPLACE TABLE flattened_source AS
     SELECT
       src:device_type::string AS device_type,
       src:version::string     AS version,
       VALUE                   AS src
     FROM
       raw_source,
       LATERAL FLATTEN( INPUT => SRC:events );
   ```

   Query the resulting table.

   ```sqlexample
   SELECT * FROM flattened_source;
   ```

   The query returns the following result:

   ```output
   +-------------+---------+-------------------------------------------------------------------------------+
   | DEVICE_TYPE | VERSION | SRC                                                                           |
   |-------------+---------+-------------------------------------------------------------------------------|
   | server      | 2.6     | {                                                                             |
   |             |         |   "f": 83,                                                                    |
   |             |         |   "rv": "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19",   |
   |             |         |   "t": 1437560931139,                                                         |
   |             |         |   "v": {                                                                      |
   |             |         |     "ACHZ": 42869,                                                            |
   |             |         |     "ACV": 709489,                                                            |
   |             |         |     "DCA": 232,                                                               |
   |             |         |     "DCV": 62287,                                                             |
   |             |         |     "ENJR": 2599,                                                             |
   |             |         |     "ERRS": 205,                                                              |
   |             |         |     "MXEC": 487,                                                              |
   |             |         |     "TMPI": 9                                                                 |
   |             |         |   },                                                                          |
   |             |         |   "vd": 54,                                                                   |
   |             |         |   "z": 1437644222811                                                          |
   |             |         | }                                                                             |
   | server      | 2.6     | {                                                                             |
   |             |         |   "f": 1000083,                                                               |
   |             |         |   "rv": "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22", |
   |             |         |   "t": 1437036965027,                                                         |
   |             |         |   "v": {                                                                      |
   |             |         |     "ACHZ": 6953,                                                             |
   |             |         |     "ACV": 346795,                                                            |
   |             |         |     "DCA": 250,                                                               |
   |             |         |     "DCV": 46066,                                                             |
   |             |         |     "ENJR": 9033,                                                             |
   |             |         |     "ERRS": 615,                                                              |
   |             |         |     "MXEC": 0,                                                                |
   |             |         |     "TMPI": 112                                                               |
   |             |         |   },                                                                          |
   |             |         |   "vd": 626,                                                                  |
   |             |         |   "z": 1437660796958                                                          |
   |             |         | }                                                                             |
   +-------------+---------+-------------------------------------------------------------------------------+
   ```

### Flatten object keys in separate columns

In the preceding example, you flattened the event objects in the `events` array
into separate rows. The resulting `flattened_source` table retained the event structure
in the `src` column of the VARIANT type.

One benefit of retaining the
event objects in the `src` column of the VARIANT type is that when event format changes,
you don’t have to recreate and repopulate such tables. But you also have the option to
copy individual keys in the event object into separate typed columns as shown
in the following query.

The following CREATE TABLE AS SELECT statement creates a new table named `events` with the event
object keys stored in separate columns. Each value is cast to a data type that is appropriate
for the value, using a double-colon (::) followed by the type. If you omit the casting,
the column assumes the VARIANT data type, which can hold any value:

```sqlexample
create or replace table events as
  select
    src:device_type::string                             as device_type
  , src:version::string                                 as version
  , value:f::number                                     as f
  , value:rv::variant                                   as rv
  , value:t::number                                     as t
  , value:v.ACHZ::number                                as achz
  , value:v.ACV::number                                 as acv
  , value:v.DCA::number                                 as dca
  , value:v.DCV::number                                 as dcv
  , value:v.ENJR::number                                as enjr
  , value:v.ERRS::number                                as errs
  , value:v.MXEC::number                                as mxec
  , value:v.TMPI::number                                as tmpi
  , value:vd::number                                    as vd
  , value:z::number                                     as z
  from
    raw_source
  , lateral flatten ( input => SRC:events );
```

The statement flattens the nested data in the EVENTS.SRC:V key, adding a separate column for each value.
The statement outputs a row for each key/value pair. The following output shows the first two records in the new `events` table:

```sqlexample
SELECT * FROM events;

+-------------+---------+---------+----------------------------------------------------------------------+---------------+-------+--------+-----+-------+------+------+------+------+-----+---------------+
| DEVICE_TYPE | VERSION |       F | RV                                                                   |             T |  ACHZ |    ACV | DCA |   DCV | ENJR | ERRS | MXEC | TMPI |  VD |             Z |
|-------------+---------+---------+----------------------------------------------------------------------+---------------+-------+--------+-----+-------+------+------+------+------+-----+---------------|
| server      | 2.6     |      83 | "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19"   | 1437560931139 | 42869 | 709489 | 232 | 62287 | 2599 |  205 |  487 |    9 |  54 | 1437644222811 |
| server      | 2.6     | 1000083 | "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22" | 1437036965027 |  6953 | 346795 | 250 | 46066 | 9033 |  615 |    0 |  112 | 626 | 1437660796958 |
+-------------+---------+---------+----------------------------------------------------------------------+---------------+-------+--------+-----+-------+------+------+------+------+-----+---------------+
```

## Update data

So far in this tutorial, you did the following:

* Copied sample JSON event data from an S3 bucket into the `RAW_SOURCE` table
  and explored simple queries.
* You also explored the FLATTEN function to flatten the JSON data and obtain a relational
  representation of the data. For example, you extracted event keys and stored the keys
  in separate columns in another EVENTS table.

At the beginning, the tutorial explains the application scenario where multiple sources generate
events and a web endpoint saves it to your S3 bucket. As new events are added to the S3 bucket,
you might use a script to continuously copy new data into the `RAW_SOURCE` table.
But how do insert only new event data into the `EVENTS` table.

There are numerous ways to maintain data consistency. This section explains two options.

### Use primary key columns for comparison

In this section you add a primary key to the `EVENTS` table. The primary key then guarantees uniqueness.

1. Examine your JSON data for any values that are naturally unique and would be good
   candidates for a primary key. For example, assume that the combination of
   `src:device_type` and `value:rv` can be a primary key. These two JSON keys
   correspond to the `DEVICE_TYPE` and `RV` columns in the `EVENTS` table.

   > **Note:**
   >
   > Snowflake does not enforce the primary key constraint. Rather, the constraint
   > serves as metadata that identifies the natural key in the Information Schema.
2. Add the primary key constraint to the `EVENTS` table:

   > ```sqlexample
   > ALTER TABLE events ADD CONSTRAINT pk_DeviceType PRIMARY KEY (device_type, rv);
   > ```
3. Insert a new JSON event record into the `RAW_SOURCE` table:

   ```sqlsyntax
   insert into raw_source
     select
     PARSE_JSON ('{
       "device_type": "cell_phone",
       "events": [
         {
           "f": 79,
           "rv": "786954.67,492.68,3577.48,40.11,343.00,345.8,0.22,8765.22",
           "t": 5769784730576,
           "v": {
             "ACHZ": 75846,
             "ACV": 098355,
             "DCA": 789,
             "DCV": 62287,
             "ENJR": 2234,
             "ERRS": 578,
             "MXEC": 999,
             "TMPI": 9
           },
           "vd": 54,
           "z": 1437644222811
         }
       ],
       "version": 3.2
     }');
   ```
4. Insert the new record that you added to the `RAW_SOURCE` table
   into the `EVENTS` table based on a comparison of the primary key values:

   ```sqlsyntax
   insert into events
   select
         src:device_type::string
       , src:version::string
       , value:f::number
       , value:rv::variant
       , value:t::number
       , value:v.ACHZ::number
       , value:v.ACV::number
       , value:v.DCA::number
       , value:v.DCV::number
       , value:v.ENJR::number
       , value:v.ERRS::number
       , value:v.MXEC::number
       , value:v.TMPI::number
       , value:vd::number
       , value:z::number
       from
         raw_source
       , lateral flatten( input => src:events )
       where not exists
       (select 'x'
         from events
         where events.device_type = src:device_type
         and events.rv = value:rv);
   ```

   Querying the `EVENTS` table shows the added row:

   ```sqlexample
   select * from EVENTS;
   ```

   The query returns the following result:

   ```sqlexample
   +-------------+---------+---------+----------------------------------------------------------------------+---------------+-------+--------+-----+-------+------+------+------+------+-----+---------------+
   | DEVICE_TYPE | VERSION |       F | RV                                                                   |             T |  ACHZ |    ACV | DCA |   DCV | ENJR | ERRS | MXEC | TMPI |  VD |             Z |
   |-------------+---------+---------+----------------------------------------------------------------------+---------------+-------+--------+-----+-------+------+------+------+------+-----+---------------|
   | server      | 2.6     |      83 | "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19"   | 1437560931139 | 42869 | 709489 | 232 | 62287 | 2599 |  205 |  487 |    9 |  54 | 1437644222811 |
   | server      | 2.6     | 1000083 | "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22" | 1437036965027 |  6953 | 346795 | 250 | 46066 | 9033 |  615 |    0 |  112 | 626 | 1437660796958 |
   | cell_phone  | 3.2     |      79 | "786954.67,492.68,3577.48,40.11,343.00,345.8,0.22,8765.22"           | 5769784730576 | 75846 |  98355 | 789 | 62287 | 2234 |  578 |  999 |    9 |  54 | 1437644222811 |
   +-------------+---------+---------+----------------------------------------------------------------------+---------------+-------+--------+-----+-------+------+------+------+------+-----+---------------+
   ```

### Use all columns for comparison

If the JSON data does not have fields that can be primary key candidates, you
could compare all repeating JSON keys in the `RAW_SOURCE` table with the
corresponding column values in the `EVENTS` table.

No changes to your existing `EVENTS` table are required.

1. Insert a new JSON event record into the `RAW_SOURCE` table:

   ```sqlsyntax
   insert into raw_source
     select
     parse_json ('{
       "device_type": "web_browser",
       "events": [
         {
           "f": 79,
           "rv": "122375.99,744.89,386.99,12.45,78.08,43.7,9.22,8765.43",
           "t": 5769784730576,
           "v": {
             "ACHZ": 768436,
             "ACV": 9475,
             "DCA": 94835,
             "DCV": 88845,
             "ENJR": 8754,
             "ERRS": 567,
             "MXEC": 823,
             "TMPI": 0
           },
           "vd": 55,
           "z": 8745598047355
         }
       ],
       "version": 8.7
     }');
   ```
2. Insert the new record in the `RAW_SOURCE` table into the `EVENTS` table based on a comparison of all repeating key values:

   ```sqlsyntax
   insert into events
   select
         src:device_type::string
       , src:version::string
       , value:f::number
       , value:rv::variant
       , value:t::number
       , value:v.ACHZ::number
       , value:v.ACV::number
       , value:v.DCA::number
       , value:v.DCV::number
       , value:v.ENJR::number
       , value:v.ERRS::number
       , value:v.MXEC::number
       , value:v.TMPI::number
       , value:vd::number
       , value:z::number
       from
         raw_source
       , lateral flatten( input => src:events )
       where not exists
       (select 'x'
         from events
         where events.device_type = src:device_type
         and events.version = src:version
         and events.f = value:f
         and events.rv = value:rv
         and events.t = value:t
         and events.achz = value:v.ACHZ
         and events.acv = value:v.ACV
         and events.dca = value:v.DCA
         and events.dcv = value:v.DCV
         and events.enjr = value:v.ENJR
         and events.errs = value:v.ERRS
         and events.mxec = value:v.MXEC
         and events.tmpi = value:v.TMPI
         and events.vd = value:vd
         and events.z = value:z);
   ```

   Querying the `EVENTS` table shows the added row:

   ```sqlexample
   select * from EVENTS;
   ```

   The query returns the following result:

   ```sqlexample
   +-------------+---------+---------+----------------------------------------------------------------------+---------------+--------+--------+-------+-------+------+------+------+------+-----+---------------+
   | DEVICE_TYPE | VERSION |       F | RV                                                                   |             T |   ACHZ |    ACV |   DCA |   DCV | ENJR | ERRS | MXEC | TMPI |  VD |             Z |
   |-------------+---------+---------+----------------------------------------------------------------------+---------------+--------+--------+-------+-------+------+------+------+------+-----+---------------|
   | server      | 2.6     |      83 | "15219.64,783.63,48674.48,84679.52,27499.78,2178.83,0.42,74900.19"   | 1437560931139 |  42869 | 709489 |   232 | 62287 | 2599 |  205 |  487 |    9 |  54 | 1437644222811 |
   | server      | 2.6     | 1000083 | "8070.52,54470.71,85331.27,9.10,70825.85,65191.82,46564.53,29422.22" | 1437036965027 |   6953 | 346795 |   250 | 46066 | 9033 |  615 |    0 |  112 | 626 | 1437660796958 |
   | cell_phone  | 3.2     |      79 | "786954.67,492.68,3577.48,40.11,343.00,345.8,0.22,8765.22"           | 5769784730576 |  75846 |  98355 |   789 | 62287 | 2234 |  578 |  999 |    9 |  54 | 1437644222811 |
   | web_browser | 8.7     |      79 | "122375.99,744.89,386.99,12.45,78.08,43.7,9.22,8765.43"              | 5769784730576 | 768436 |   9475 | 94835 | 88845 | 8754 |  567 |  823 |    0 |  55 | 8745598047355 |
   +-------------+---------+---------+----------------------------------------------------------------------+---------------+--------+--------+-------+-------+------+------+------+------+-----+---------------+
   ```

## Congratulations

Congratulations, you have successfully completed the tutorial.

### Tutorial key points

* Partitioning the event data in your S3 bucket using logical, granular paths allows you to copy a subset of the partitioned data into Snowflake with a single command.
* Snowflake’s `column:key` notation, similar to the familiar SQL `table.column` notation,
  allows you to effectively query a column within the column (i.e., a sub-column), which is
  dynamically derived based on the schema definition embedded in the JSON data.
* The [FLATTEN](../../sql-reference/functions/flatten.md) function allows you to parse JSON data into separate columns.
* Several options are available to update table data based on comparisons with staged data files.

### Tutorial clean up (optional)

Execute the following [DROP <object>](../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

> ```sqlexample
> DROP DATABASE IF EXISTS mydatabase;
> DROP WAREHOUSE IF EXISTS mywarehouse;
> ```

Dropping the database automatically removes all child database objects such as tables.

---
title: Tutorial: Loading and unloading Parquet data
source: https://docs.snowflake.com/en/user-guide/tutorials/script-data-load-transform-parquet.md
section: User Guide
---

Getting started

# Tutorial: Loading and unloading Parquet data

## Introduction

This tutorial describes how you can upload Parquet data
by transforming elements of a staged Parquet file directly into table columns using
the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command. The tutorial also describes how you can use the
[COPY INTO <location>](../../sql-reference/sql/copy-into-location.md) command to unload table data into a Parquet file.

## Prerequisites

For this tutorial you need to:

* Download a Snowflake provided Parquet data file.
* Create a database, a table, and a virtual warehouse.

Database, table, and virtual warehouse are basic Snowflake objects required for most Snowflake activities.

### Downloading the sample data file

To download the sample Parquet data file, click [`cities.parquet`](../../_downloads/0c1e6c4f4140561029eeb20afdd02664/cities.parquet).
Alternatively, right-click the link and save the
link/file to your local file system.

The tutorial assumes you unpacked files in to the following directories:

> * Linux/macOS: `/tmp/load`
> * Windows: `C:\tempload`

The Parquet data file includes sample continent data. The following is a representative example:

```sqljson
{
  "continent": "Europe",
  "country": {
    "city": [
      "Paris",
      "Nice",
      "Marseilles",
      "Cannes"
    ],
    "name": "France"
  }
}
```

### Creating the database, table, and virtual warehouse

The following commands create objects specifically for use with this tutorial.
When you have completed the tutorial, you can drop these objects.

```sqlexample
 create or replace database mydatabase;

 use schema mydatabase.public;

  create or replace temporary table cities (
    continent varchar default null,
    country varchar default null,
    city variant default null
  );

create or replace warehouse mywarehouse with
  warehouse_size='X-SMALL'
  auto_suspend = 120
  auto_resume = true
  initially_suspended=true;

use warehouse mywarehouse;
```

Note these commands create a temporary table. Temporary tables persist only for
the duration of the user session and is not visible to other users.

## Create file format object

Execute the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command
to create the `sf_tut_parquet_format` file format.

```sqlexample
CREATE OR REPLACE FILE FORMAT sf_tut_parquet_format
  TYPE = parquet;
```

`TYPE = 'parquet'` indicates the source file format type. CSV is the default file format type.

## Create stage object

Execute the [CREATE STAGE](../../sql-reference/sql/create-stage.md) command to create the
internal `sf_tut_stage` stage.

```sqlexample
CREATE OR REPLACE TEMPORARY STAGE sf_tut_stage
FILE_FORMAT = sf_tut_parquet_format;
```

Similar to temporary tables, temporary stages are automatically dropped
at the end of the session.

## Stage the data file

Execute the [PUT](../../sql-reference/sql/put.md) command to upload the parquet file from your local file system to the
named stage.

* Linux or macOS

  > ```sqlexample
  > PUT file:///tmp/load/cities.parquet @sf_tut_stage;
  > ```
* Windows

  > ```sqlexample
  > PUT file://C:\temp\load\cities.parquet @sf_tut_stage;
  > ```

## Copy data into the target table

Copy the `cities.parquet` staged data file into the `CITIES` table.

```sqlexample
copy into cities
 from (select $1:continent::varchar,
              $1:country:name::varchar,
              $1:country:city::variant
      from @sf_tut_stage/cities.parquet);
```

Note the following:

* `$1` in the SELECT query refers to the single column where the Parquet
  data is stored.
* The query casts each of the Parquet element values it retrieves to specific column types.

Execute the following query to verify data is copied.

```sqlexample
SELECT * from cities;
```

The query returns the following result:

```sqlexample
+---------------+---------+-----------------+
| CONTINENT     | COUNTRY | CITY            |
|---------------+---------+-----------------|
| Europe        | France  | [               |
|               |         |   "Paris",      |
|               |         |   "Nice",       |
|               |         |   "Marseilles", |
|               |         |   "Cannes"      |
|               |         | ]               |
|---------------+---------+-----------------|
| Europe        | Greece  | [               |
|               |         |   "Athens",     |
|               |         |   "Piraeus",    |
|               |         |   "Hania",      |
|               |         |   "Heraklion",  |
|               |         |   "Rethymnon",  |
|               |         |   "Fira"        |
|               |         | ]               |
|---------------+---------+-----------------|
| North America | Canada  | [               |
|               |         |   "Toronto",    |
|               |         |   "Vancouver",  |
|               |         |   "St. John's", |
|               |         |   "Saint John", |
|               |         |   "Montreal",   |
|               |         |   "Halifax",    |
|               |         |   "Winnipeg",   |
|               |         |   "Calgary",    |
|               |         |   "Saskatoon",  |
|               |         |   "Ottawa",     |
|               |         |   "Yellowknife" |
|               |         | ]               |
+---------------+---------+-----------------+
```

## Unload the table

Unload the `CITIES` table into another Parquet file.

> **Note:**
>
> By default, Snowflake optimizes table columns in unloaded Parquet data files by
> setting the smallest precision that accepts all of the values. If you prefer
> consistent output file schema determined by the “logical” column data types (i.e.
> the types in the unload SQL query or source table), set the
> [ENABLE_UNLOAD_PHYSICAL_TYPE_OPTIMIZATION](../../sql-reference/parameters.md)
> session parameter to FALSE.

```sqlexample
copy into @sf_tut_stage/out/parquet_
from (select continent,
             country,
             c.value::string as city
     from cities,
          lateral flatten(input => city) c)
  file_format = (type = 'parquet')
  header = true;
```

Note the following:

* The `file_format = (type = 'parquet')` specifies parquet as the format of the data file on the stage. When the Parquet file type is specified, the `COPY INTO <location>` command unloads data to a single column by default.
* The `header=true` option directs the command to retain the column names in the output file.
* In the nested SELECT query:

  + The [FLATTEN](../../sql-reference/functions/flatten.md) function first flattens the `city` column array elements into separate columns.
  + The LATERAL modifier joins the output of the FLATTEN function with information
    outside of the object - in this example, the `continent` and `country`.

Execute the following query to verify data is copied into staged Parquet file.

```sqlexample
select t.$1 from @sf_tut_stage/out/ t;
```

The query returns the following results (only partial result is shown):

```sqlexample
+---------------------------------+
| $1                              |
|---------------------------------|
| {                               |
|   "CITY": "Paris",              |
|   "CONTINENT": "Europe",        |
|   "COUNTRY": "France"           |
| }                               |
|---------------------------------|
| {                               |
|   "CITY": "Nice",               |
|   "CONTINENT": "Europe",        |
|   "COUNTRY": "France"           |
| }                               |
|---------------------------------|
| {                               |
|   "CITY": "Marseilles",         |
|   "CONTINENT": "Europe",        |
|   "COUNTRY": "France"           |
| }                               |
+---------------------------------+
```

## Remove the successfully copied data files

After you verify that you successfully copied data from your stage into the tables,
you can remove data files from the internal stage using the [REMOVE](../../sql-reference/sql/remove.md)
command to save on [data storage](../cost-understanding-compute.md).

```sqlexample
REMOVE @sf_tut_stage/cities.parquet;
```

## Clean up

Execute the following [DROP <object>](../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

```sqlexample
DROP DATABASE IF EXISTS mydatabase;
DROP WAREHOUSE IF EXISTS mywarehouse;
```

Dropping the database automatically removes all child database objects such as tables.

---
title: Tutorial: Loading JSON data into a relational table
source: https://docs.snowflake.com/en/user-guide/tutorials/script-data-load-transform-json.md
section: User Guide
---

Getting Started

# Tutorial: Loading JSON data into a relational table

## Introduction

When uploading JSON data into a table, you have these options:

* Store JSON objects natively in a VARIANT type column (as shown in [Tutorial: Bulk loading from a local file system using COPY](data-load-internal-tutorial.md)).
* Store JSON object natively in an intermediate table and then use FLATTEN function to extract JSON elements into separate columns in a table (as shown in [Tutorial: JSON basics for Snowflake](json-basics-tutorial.md))
* Transform JSON elements directly into table columns as shown in this tutorial.

The COPY command in this tutorial uses a SELECT statement to query for individual elements in a staged JSON file.

The example commands provided in this tutorial includes a [PUT](../../sql-reference/sql/put.md) statement.
We recommend executing these commands in SnowSQL which supports the PUT command.
Clients such as [Snowsight](../ui-snowsight-gs.md) do not support the PUT command.

## Prerequisites

For this tutorial you need to:

* Download a Snowflake provided JSON data file.
* Create a database, a table, and a virtual warehouse for this tutorial.

Database, table, and virtual warehouse are basic Snowflake objects required for
most Snowflake activities.

### Data file for loading

To download the sample JSON data file, click [`sales.json`](../../_downloads/b50c24de20be843b34f2535dfe67fd5e/sales.json).
If clicking the link does not download the file, right-click the link and save the
link/file to your local file system.

The tutorial assumes you unpacked the JSON data file in to the following directories:

> * Linux/macOS: `/tmp/load`
> * Windows: `C:\tempload`

The data file include sample home sales JSON data. An example JSON object is shown:

```sqljson
{
   "location": {
      "state_city": "MA-Lexington",
      "zip": "40503"
   },
   "sale_date": "2017-3-5",
   "price": "275836"
}
```

### Creating the database, table, and virtual warehouse

The following commands create objects specifically for use with this tutorial.
When you have completed the tutorial, you can drop the objects.

```sqlexample
 create or replace database mydatabase;

 use schema mydatabase.public;

CREATE OR REPLACE TEMPORARY TABLE home_sales (
  city STRING,
  zip STRING,
  state STRING,
  type STRING DEFAULT 'Residential',
  sale_date timestamp_ntz,
  price STRING
  );

create or replace warehouse mywarehouse with
  warehouse_size='X-SMALL'
  auto_suspend = 120
  auto_resume = true
  initially_suspended=true;

use warehouse mywarehouse;
```

Note these commands creates temporary table. Temporary tables persist only for
the duration of the user session and is not visible to other users.

## Create file format object

Execute the [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md) command
to create the `sf_tut_json_format` file format.

```sqlexample
CREATE OR REPLACE FILE FORMAT sf_tut_json_format
  TYPE = JSON;
```

`TYPE = 'JSON'` indicates the source file format type. CSV is the default file format type.

## Create stage object

Execute [CREATE STAGE](../../sql-reference/sql/create-stage.md) to create the
internal `sf_tut_stage` stage.

> ```sqlexample
> CREATE OR REPLACE TEMPORARY STAGE sf_tut_stage
>  FILE_FORMAT = sf_tut_json_format;
> ```

Similar to temporary tables, temporary stages are automatically dropped
at the end of the session.

## Stage the data file

Execute the [PUT](../../sql-reference/sql/put.md) command to upload the JSON file from your local file system to the
named stage.

* Linux or macOS

  > ```sqlexample
  > PUT file:///tmp/load/sales.json @sf_tut_stage AUTO_COMPRESS=TRUE;
  > ```
* Windows

  > ```sqlexample
  > PUT file://C:\temp\load\sales.json @sf_tut_stage AUTO_COMPRESS=TRUE;
  > ```

## Copy data into the target table

Load the `sales.json.gz` staged data file into the `home_sales` table.

```sqlexample
COPY INTO home_sales(city, state, zip, sale_date, price)
   FROM (SELECT SUBSTR($1:location.state_city,4),
                SUBSTR($1:location.state_city,1,2),
                $1:location.zip,
                to_timestamp_ntz($1:sale_date),
                $1:price
         FROM @sf_tut_stage/sales.json.gz t)
   ON_ERROR = 'continue';
```

Note the $1 in the SELECT query refers to the single column where the JSON is stored.
The query also uses the following functions:

* The [SUBSTR , SUBSTRING](../../sql-reference/functions/substr.md) function to extract city and state values from state_city JSON key.
* The [TO_TIMESTAMP / TO_TIMESTAMP_\*](../../sql-reference/functions/to_timestamp.md) to cast the sale_date JSON key value to a timestamp.

Execute the following query to verify data is copied.

```sqlexample
SELECT * from home_sales;
```

## Remove the successfully copied data files

After you verify that you successfully copied data from your stage into the tables,
you can remove data files from the internal stage using the [REMOVE](../../sql-reference/sql/remove.md)
command to save on [data storage](../cost-understanding-compute.md).

> ```sqlexample
> REMOVE @sf_tut_stage/sales.json.gz;
> ```

## Clean up

Execute the following [DROP <object>](../../sql-reference/sql/drop.md) commands to return your system to its state before you began the tutorial:

> ```sqlexample
> DROP DATABASE IF EXISTS mydatabase;
> DROP WAREHOUSE IF EXISTS mywarehouse;
> ```

Dropping the database automatically removes all child database objects such as tables.

---
title: Tutorial: Optimize dynamic table performance for SCD Type 1 workloads
source: https://docs.snowflake.com/en/user-guide/tutorials/optimize-dynamic-table-performance.md
section: User Guide
---

Snowflake

Dynamic Tables

Performance

# Tutorial: Optimize dynamic table performance for SCD Type 1 workloads

## Introduction

This tutorial shows you how to identify and resolve performance bottlenecks in a [dynamic table](../dynamic-tables-about.md)
pipeline for slowly changing dimension (SCD) Type 1 workloads. Dynamic tables automatically materialize query
results and handle scheduling and orchestration for your data pipelines. Optimizing dynamic table performance
helps you maintain data freshness and control costs.

### About SCD Type 1 tables

Slowly changing dimension (SCD) tables store data that changes occasionally and unpredictably over time.
Common examples include tables that track changes to customer addresses or product prices.

This tutorial implements an SCD Type 1 table, often called an “SCD-1 live” table.
This type overwrites old data with new data and doesn’t keep a history of past values.
SCD Type 1 tables are useful when you only care about the latest state of each record, such as a customer’s current phone
number or a product’s current category.

In real-world data pipelines, you typically build a Type 1 SCD table by consuming a changelog table.

### What you’ll learn

In this tutorial, you’ll learn how to complete the following tasks:

* Create a sample source table with product change data.
* Build two SCD Type 1 dynamic tables: one with a suboptimal SQL pattern and one with an optimized pattern with
  the [QUALIFY](../../sql-reference/constructs/qualify.md) clause.
* Understand how the `QUALIFY` clause enables efficient incremental processing and significantly reduces refresh time.
* Monitor key performance metrics like refresh duration and partition scans to identify optimization opportunities.
* Compare the incremental refresh performance of both dynamic tables on the same data.

### Prerequisites

You need access to a Snowflake environment with the following resources:

* A [warehouse](../warehouses-overview.md) for compute resources. We recommend using an x-small warehouse.
* The privileges required to create databases, schemas, and dynamic tables.
  For more information, see [Access control privileges](../security-access-control-privileges.md).

If you don’t have a user with the necessary permissions, ask someone who does to create one for you.
Users with the ACCOUNTADMIN role can create new users and grant them the required privileges.

> **Note:**
>
> For the best experience, complete this tutorial in Snowsight so that you can quickly view the query history
> and monitor your dynamic table performance.

## Step 1: Create the source data

Start by setting up a source table with sample data that simulates streaming product changes.

Create a database and schema for the tutorial, then create a source table:

```sqlexample
CREATE DATABASE IF NOT EXISTS dt_perf_demo_db;
CREATE SCHEMA IF NOT EXISTS dt_perf_demo_db.tutorial;

USE SCHEMA dt_perf_demo_db.tutorial;

CREATE OR REPLACE TABLE product_changes (
    product_code VARCHAR(50),
    product_name VARCHAR(200),
    price NUMBER(10, 2),
    price_start_date TIMESTAMP_NTZ(9)
);
```

Next, insert sample data into the `product_changes` source table. The following command
generates 100 million rows of sample product data by repeating 10,000 unique product codes and names.
It assigns each product a price that changes slightly with each row, and sets a timestamp that increases by a few minutes
for each new entry.

```sqlexample
INSERT INTO product_changes (product_code, product_name, price, price_start_date)
  SELECT
      'PC-' || LPAD(TO_VARCHAR(MOD(SEQ4(), 10000) + 1), 3, '0') AS product_code,
      'Product ' || LPAD(TO_VARCHAR(MOD(SEQ4(), 10000) + 1), 3, '0') AS product_name,
      ROUND(10.00 + (MOD(SEQ4(), 10000) * 5) + (SEQ4() * 0.01), 2) AS price,
      DATEADD(MINUTE, SEQ4() * 5, '2025-01-01 00:00:00') AS PRICE_START_DATE
  FROM
      TABLE(GENERATOR(ROWCOUNT => 100000000));
```

## Step 2: Create dynamic tables for comparison

In this step, you create two SCD Type 1 dynamic tables that consume from the source table. The first dynamic table
uses a suboptimal SQL pattern to find the most recent price change for every product, while the second uses an
optimized pattern. Creating both tables simultaneously lets you directly compare their refresh performance
on the same data.

### Create a suboptimal dynamic table

Create a dynamic table by using an INNER JOIN with a subquery that gets the latest timestamp for each product code.
This is a common but inefficient pattern that triggers costly re-computation on every update.

> **Note:**
>
> Replace `my_warehouse` with the name of your warehouse.

```sqlexample
CREATE DYNAMIC TABLE product_current_price_v1
    TARGET_LAG = DOWNSTREAM
    WAREHOUSE = <my_warehouse>
    INITIALIZE = ON_SCHEDULE
    REFRESH_MODE = INCREMENTAL
  AS
  SELECT
      h.product_code,
      h.product_name,
      h.price,
      h.price_start_date
  FROM product_changes h
  INNER JOIN (
      SELECT product_code, MAX(price_start_date) max_price_start_date
      FROM product_changes
      GROUP BY product_code
  ) m ON h.price_start_date = m.max_price_start_date AND h.product_code = m.product_code;
```

Key details about this dynamic table configuration:

* This dynamic table uses `TARGET_LAG = DOWNSTREAM`, which means it refreshes only when downstream
  tables or queries need fresh data. This setting works well for intermediate tables in a pipeline.
* The `REFRESH_MODE = INCREMENTAL` setting tells Snowflake to process only changed data instead of
  recomputing the entire table.

### Create an optimized dynamic table

Now create a second dynamic table named `product_current_price_v2` with an optimized SQL pattern.
This table uses the `QUALIFY` clause to efficiently filter to the latest price for each product:

```sqlexample
CREATE DYNAMIC TABLE product_current_price_v2
    TARGET_LAG = DOWNSTREAM
    WAREHOUSE = <my_warehouse>
    REFRESH_MODE = INCREMENTAL
    INITIALIZE = ON_SCHEDULE
  AS
  SELECT
      product_code,
      product_name,
      price,
      price_start_date
  FROM product_changes
  QUALIFY RANK() OVER (PARTITION BY product_code ORDER BY price_start_date DESC) = 1;
```

Using the `QUALIFY` clause with a ranking window function like `RANK()` lets Snowflake efficiently detect which
product partitions changed. Instead of rescanning all historical data, the engine finds affected partitions
and recalculates rankings only for those specific products. This results in more efficient incremental refreshes.

This optimization works because of the following factors:

* Ranking functions like `RANK`, `ROW_NUMBER`, or `DENSE_RANK` used with `PARTITION BY` let the engine isolate changes by product.
* Filtering to `RANK() ... = 1` keeps only the latest record for each product, which is what SCD Type 1 tables require.
* Placing the `QUALIFY RANK() ... = 1` clause at the top level of the dynamic table query, not within a subquery,
  ensures that the optimization applies.
* Persisting the `product_code` and `price_start_date` keys as columns in the dynamic table lets the engine track partition changes between
  refreshes and avoids full table scans.

This pattern also demonstrates good *data locality*, which describes how closely Snowflake stores rows
with matching keys together. The pattern isolates changes to specific partition keys, which avoids
full table scans.

### Refresh both dynamic tables

To fill in the initial data for both tables, manually refresh them. This establishes a baseline for comparing
their incremental refresh performance in the next step:

```sqlexample
ALTER DYNAMIC TABLE product_current_price_v1 REFRESH;

ALTER DYNAMIC TABLE product_current_price_v2 REFRESH;
```

## Step 3: Compare incremental refresh performance

Now compare how each table handles incremental refreshes.

### Add new data to the source table

This step simulates new data arriving in the source table,
as would happen in a real-world streaming scenario. Insert 1,000 new rows into the `product_changes` source table
that update the price for five of the existing products:

```sqlexample
INSERT INTO product_changes (product_code, product_name, price, price_start_date)
  SELECT
      'PC-' || LPAD(TO_VARCHAR(MOD(SEQ4(), 5) + 1), 3, '0') AS product_code,
      'Product ' || LPAD(TO_VARCHAR(MOD(SEQ4(), 5) + 1), 3, '0') AS product_name,
      ROUND(50.00 + (MOD(SEQ4(), 10) * 5) + ((SEQ4() + 100000000) * 0.01), 2) AS price,
      DATEADD(MINUTE, (SEQ4() + 100000000) * 5, '2025-01-01 00:00:00') AS price_start_date
  FROM
      TABLE(GENERATOR(ROWCOUNT => 1000));
```

### Monitor refresh performance

Dynamic table performance depends on several factors: how you write queries, how you organize data,
and the resources you allocate. The key metrics to monitor are refresh duration, partition scans,
and bytes spilled. In this step, you’ll compare these metrics between the two dynamic table implementations.

To pick up the changes, start by refreshing the suboptimal dynamic table:

```sqlexample
ALTER DYNAMIC TABLE product_current_price_v1 REFRESH;
```

Check the execution time and scan metrics:

1. Navigate to Transformation » Dynamic Tables.
2. Filter the list by selecting the `dt_perf_demo_db` database, then select `product_current_price_v1`.
3. Select the Refresh History tab and notice the REFRESH DURATION value for the most recent refresh.
4. Select Show query profile for the latest refresh entry.
5. Find the Statistics section and notice the Partitions scanned value.

   The `product_current_price_v1` table is inefficient because the subquery recalculates the maximum timestamp for all 10,000 products,
   even though only five products received new price changes. This forces the dynamic table engine to scan many more partitions than necessary,
   driving up both time and cost as the source table grows. This pattern demonstrates poor data locality
   because changes don’t align well with how the data is organized for incremental processing.
6. Now refresh the optimized `product_current_price_v2` dynamic table:

   ```sqlexample
   ALTER DYNAMIC TABLE product_current_price_v2 REFRESH;
   ```
7. Repeat the previous steps to check the Refresh History for the optimized table:

   Compare the two refresh operations. The optimized `product_current_price_v2` dynamic table should complete significantly faster
   than the suboptimal `product_current_price_v1` dynamic table. In the example results, the suboptimal table took 2.8 seconds
   while the optimized table took only 804 milliseconds.

   Open the Query Profile and compare the Statistics section:

   The `product_current_price_v2` uses the `QUALIFY` clause with a ranking window function, which lets the engine
   efficiently identify and process only the five products that changed, resulting in a much faster incremental refresh.
   This query pattern has good data locality because Snowflake can isolate which partition keys (product codes) contain changes.

> **Tip:**
>
> Even at the small scale used in this tutorial, this optimization leads to noticeable performance improvements.
> In production, with millions of products and billions of records,
> this optimization can cut refresh times from hours to seconds.
> Performance depends on the percentage of changed products, so efficiency remains high as your data grows.
>
> Faster refreshes translate directly to fresher data. If you need data fresh within minutes,
> optimizing query patterns like this helps you meet aggressive target lag requirements without
> oversizing warehouses.

## Clean up

To delete all objects created for this tutorial, run the following DROP statement:

```sqlexample
DROP DATABASE dt_perf_demo_db;
```

## Summary and additional resources

In this tutorial, you optimized a [dynamic table](../dynamic-tables-about.md) pipeline by replacing
a suboptimal subquery pattern with the highly efficient `QUALIFY RANK() = 1` pattern for an SCD Type 1 table.
This lets the dynamic table engine apply performance optimizations for
[incremental refresh](../dynamic-tables-refresh.md) and leads to faster and cheaper pipeline runs.
Faster refreshes mean you can maintain data freshness with tighter
[target lag](../dynamic-tables-target-lag.md) requirements without increasing cost.

Along the way, you completed the following tasks:

* **Created a source table** with sample product data simulating a changelog.
* **Created a suboptimal SCD Type 1 dynamic table** that demonstrated the common pitfall of using
  a nested query with `MAX()` to find the latest records.
* **Applied the QUALIFY optimization** to significantly improve dynamic table refresh performance with
  efficient [incremental processing](../dynamic-tables-refresh.md). This pattern improves
  [data locality](../dynamic-tables-performance-optimize.md) by letting the engine isolate changes to
  specific partition keys.
* **Monitored refresh performance** by comparing partition scans and execution times between different
  implementations using the [query profile](../dynamic-tables-performance-monitor.md). These metrics
  help you identify whether your queries work efficiently with incremental refresh.

**Key performance concepts demonstrated:**

* **Incremental refresh efficiency**: The [optimized query](../dynamic-tables-performance-optimize-query.md)
  processes only changed data, while the suboptimal query rescans the entire dataset.
* **Data locality**: When changes align with partition keys (product codes),
  [incremental refresh](../dynamic-tables-refresh.md) performs well. When changes scatter across
  many keys or require full rescans, performance suffers. See [Improve data locality](../dynamic-tables-performance-optimize.md) for more details.
* **Target lag and freshness**: Optimizing query patterns lets you meet tighter
  [data freshness requirements](../dynamic-tables-target-lag.md) without oversizing
  [warehouses](../dynamic-tables-warehouses.md).

For more information about dynamic tables and optimization techniques, explore the following resources:

**Query and pipeline optimization:**

* **Query optimization for incremental refresh**: Learn which operators perform well with incremental
  refresh and how to restructure queries for better performance. See
  [Optimize queries for incremental refresh](../dynamic-tables-performance-optimize-query.md).
* **Data locality**: Understand how data organization affects incremental refresh performance and
  how to cluster source tables. See [Improve data locality](../dynamic-tables-performance-optimize.md).
* **Immutability constraints**: To avoid reprocessing unchanged historical data, use the
  [IMMUTABLE WHERE](../dynamic-tables-performance-optimize-immutability.md)
  option. This can greatly reduce refresh costs and time.

**Infrastructure and monitoring:**

* **Target lag**: Learn how to balance data freshness requirements with compute costs by choosing
  appropriate target lag settings. See [Understanding dynamic table target lag](../dynamic-tables-target-lag.md).
* **Warehouse sizing**: Learn how warehouse size affects refresh performance and cost. See
  [Adjust your warehouse configuration](../dynamic-tables-performance-optimize.md).
* **Performance monitoring**: Track key metrics like refresh duration, partition scans, and warehouse
  utilization to identify optimization opportunities. See [Monitor dynamic table performance](../dynamic-tables-performance-monitor.md).
* **Refresh modes**: Understand when to use incremental vs. full refresh mode and how Snowflake chooses
  between them. See [Understanding dynamic table initialization and refresh](../dynamic-tables-refresh.md).
* **Dynamic Iceberg tables**: Use dynamic tables with Apache Iceberg™ tables to build interoperable data pipelines
  for your data lake. See [Create dynamic Apache Iceberg™ tables](../dynamic-tables-create-iceberg.md).

---
title: Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog
source: https://docs.snowflake.com/en/user-guide/tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md
section: User Guide
---

Snowflake

Iceberg

Data lake

Databricks

# Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog

## Introduction

This tutorial covers how to connect Snowflake to a catalog in Databricks Unity Catalog by using a writable
catalog-linked database with catalog-vended credentials. This setup enables bidirectional data collaboration between Snowflake and
Databricks.

A [catalog-linked database](../tables-iceberg-catalog-linked-database.md) is a Snowflake database connected to an external
Iceberg REST catalog, such as a catalog in Unity Catalog. Snowflake automatically syncs with the external catalog to detect namespaces
and Iceberg tables, and registers the remote tables to the catalog-linked database. When a catalog-linked database is writable, it also
supports creating and dropping schemas or Iceberg tables.

[Catalog-vended credentials for Iceberg tables](../tables-iceberg-configure-catalog-integration-vended-credentials.md) let you
give Snowflake access to your table data and metadata in cloud storage without using an external volume. When you connect Snowflake to a
catalog in Unity Catalog by using catalog-vended credentials, Unity
Catalog provides temporary credentials to Snowflake for accessing your table data in cloud storage.

With this setup, you can perform the following tasks:

* Use Snowflake to query Iceberg tables that are managed by Unity Catalog.
* Use Snowflake to insert data into Iceberg tables that are managed by Unity Catalog.
* Use Snowflake to create Iceberg tables that are managed by Unity Catalog.
* Use Databricks to work with Unity Catalog-managed Iceberg tables that you created or modified from Snowflake.

To complete the steps in this tutorial for working with Snowflake, use a worksheet in Snowsight or use a Snowflake client such
as [SnowSQL](../snowsql.md).
You can copy and paste the code examples, and then run them. To complete the steps in this tutorial for working with Databricks,
use your Databricks workspace to copy and paste the code examples or follow the instructions in the linked Databricks documentation.

### What you’ll learn

In this tutorial, you’ll learn how to do the following:

* Create a catalog in Unity Catalog.
* Configure authentication credentials for Snowflake to use by adding a service principal and OAuth secret in Databricks.
* Use Databricks to enable Snowflake access to your catalog in Unity Catalog.
* Create a catalog integration in Snowflake that uses vended credentials to connect Snowflake to your catalog in Unity Catalog.
* Create a writable catalog-linked database in Snowflake that syncs with your catalog in Unity Catalog and allows you to write
  to your catalog in Unity Catalog from Snowflake.
* Work with Unity Catalog-managed Iceberg tables from Snowflake, which includes querying and inserting data into these tables and
  creating a Unity Catalog-managed Iceberg table from Snowflake.
* Work with Unity Catalog-managed Iceberg tables from Databricks.

### Prerequisites

Before you start, you should be familiar with the following concepts:

* Snowflake [object identifiers](../../sql-reference/identifiers.md) and their requirements.
* Apache Iceberg and Iceberg tables in Snowflake. For more information, see [Apache Iceberg™ tables](../tables-iceberg.md).
* Databricks Unity Catalog. For more information, see
  [What is Unity Catalog?](https://docs.databricks.com/aws/data-governance/unity-catalog)
  in the Databricks documentation.

You need:

**Databricks**

* A Databricks account hosted on AWS, Azure, or Google Cloud.
* A Databricks workspace with Unity Catalog enabled.

  For instructions on how to enable a workspace for Unity Catalog, see the topic for where your Databricks account is hosted:

  + **Databricks on AWS**: [Databricks on AWS: Enable a workspace for Unity Catalog](https://docs.databricks.com/aws/en/data-governance/unity-catalog/enable-workspaces)
  + **Azure Databricks**: [Azure Databricks: Enable a workspace for Unity Catalog](https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/enable-workspaces)
  + **Databricks on Google Cloud**: [Databricks on Google Cloud: Enable a workspace for Unity Catalog](https://docs.databricks.com/gcp/en/data-governance/unity-catalog/enable-workspaces)
* Required access:

  + Metastore admin privilege or the CREATE CATALOG privilege on the metastore to create a catalog in Unity Catalog.

    > **Note:**
    >
    > In this tutorial, you’ll create a catalog in Unity Catalog, which makes you an owner of the catalog. As a catalog owner, you can
    > grant a Databricks service principal privileges to your catalog, which you’ll do in this tutorial.
  + Account admin or workspace admin privilege to your Databricks workspace to create a service principal and OAuth secret
  + Metastore admin privilege to enable external data access on the metastore

**Snowflake**

* A Snowflake user with a role that has the privileges to perform the following actions:

  + [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../../sql-reference/sql/create-catalog-integration-rest.md)
  + [CREATE DATABASE (catalog-linked)](../../sql-reference/sql/create-database-catalog-linked.md)
  + [CREATE ICEBERG TABLE](../../sql-reference/sql/create-iceberg-table-rest.md)
  + [CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md)

  If using a 30-day trial account, you can log in as the user that was created for the account.
  This user has the role with the privileges needed to create the objects.

  If you don’t have a user with the necessary permissions, ask someone who does to create one for you.
  Users with the ACCOUNTADMIN role can create new users and grant them the required privileges.

## Step 1: Create a catalog in Databricks Unity Catalog

In Databricks, run the following statements to create an `example_sales_catalog` catalog in Unity Catalog with the following objects:

* A `customers` schema
* A `customer_accounts` table with some sample data, which is nested under the `customers` schema

```sqlexample
CREATE CATALOG example_sales_catalog;

CREATE SCHEMA example_sales_catalog.customers;

CREATE TABLE example_sales_catalog.customers.customer_accounts (
  customer_account_id INT,
  customer_id INT,
  account_status STRING,
  created_at TIMESTAMP,
  updated_at TIMESTAMP
) USING ICEBERG;

INSERT INTO example_sales_catalog.customers.customer_accounts VALUES
  (1, 1001, 'Active', CURRENT_TIMESTAMP(), CURRENT_TIMESTAMP()),
  (2, 1002, 'Active', CURRENT_TIMESTAMP(), CURRENT_TIMESTAMP()),
  (3, 1003, 'Inactive', CURRENT_TIMESTAMP(), CURRENT_TIMESTAMP()),
  (4, 1004, 'Active', CURRENT_TIMESTAMP(), CURRENT_TIMESTAMP()),
  (5, 1005, 'Pending', CURRENT_TIMESTAMP(), CURRENT_TIMESTAMP());
```

## Step 2: Add a service principal in Databricks

In this step, you’ll add a service principal in Databricks.

To allow Snowflake to authenticate with Unity Catalog, you need to add a service principal
in Databricks and then create an OAuth secret for your service principal in Databricks.

### Add a service principal in Databricks

1. To add a service principal, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Add service principals to your account](https://docs.databricks.com/aws/admin/users-groups/manage-service-principals?language=Account%C2%A0console#-add-service-principals-to-your-account)
   * **Azure Databricks**: [Azure Databricks: Add service principals to your account](https://learn.microsoft.com/azure/databricks/admin/users-groups/manage-service-principals#-add-service-principals-to-your-account)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Add service principals to your account](https://docs.databricks.com/gcp/admin/users-groups/manage-service-principals#-add-service-principals-to-your-account)
2. Copy the *Application ID* value for your service principal into a text editor and store it securely. You specify this value later
   when you create a catalog integration in Snowflake.

### Create an OAuth secret for your service principal

1. To create an OAuth secret for your service principal, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Create an OAuth secret](https://docs.databricks.com/aws/dev-tools/auth/oauth-m2m#-step-1-create-an-oauth-secret)
   * **Azure Databricks**: [Azure Databricks: Create an OAuth secret](https://learn.microsoft.com/azure/databricks/dev-tools/auth/oauth-m2m#-step-1-create-an-oauth-secret)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Create an OAuth secret](https://docs.databricks.com/gcp/dev-tools/auth/oauth-m2m#-step-1-create-an-oauth-secret)
2. Copy the *Secret* value that you generated into a text editor and store it securely. You specify this value later when you create a
   catalog integration in Snowflake.

   > **Important:**
   >
   > The client secret is only displayed once. Make sure to copy it before closing the dialog.

In the next step, you’ll grant privileges to the service principal that you created, which enables Snowflake access to your
`example_sales_catalog` catalog in Unity Catalog.

## Step 3: Enable Snowflake access to Unity Catalog

In this step, you use Databricks to enable Snowflake access to your catalog in Unity Catalog.

To enable Snowflake access to your catalog in Unity Catalog through vended credentials, first, at the metastore level, you must enable
external data access on the metastore. Next, you need to grant your service principal Unity Catalog
privileges to your catalog.

### Enable external data access on the metastore

First, enable external data access on the metastore.

For instructions on how to enable external data access on the metastore, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Enable external data access on the metastore](https://docs.databricks.com/aws/en/external-access/admin#enable-external-data-access-on-the-metastore)
* **Azure Databricks**: [Azure Databricks: Enable external data access on the metastore](https://learn.microsoft.com/en-us/azure/databricks/external-access/admin#enable-external-data-access-on-the-metastore)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Enable external data access on the metastore](https://docs.databricks.com/gcp/en/external-access/admin#enable-external-data-access-on-the-metastore)

### Grant your service principal access to your catalog

Next, you must grant your service principal Unity Catalog privileges. You need to grant
these privileges to your service principal to allow Snowflake access to the catalog based on the privileges that you specify.

Catalog ExplorerSQL

Use the Catalog Explorer to select the `example_sales_catalog` catalog and then grant the following privileges to your service
principal at the catalog level:

* `CREATE TABLE`
* `EXTERNAL USE SCHEMA`
* `MODIFY`
* `SELECT`
* `USE CATALOG`
* `USE SCHEMA`

To grant permissions by using the Databricks Catalog Explorer, see the topic for where your Databricks account is hosted:

* **Databricks on AWS**: [Databricks on AWS: Grant permissions on an object](https://docs.databricks.com/aws/en/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)
* **Azure Databricks**: [Azure Databricks: Grant permissions on an object](https://learn.microsoft.com/en-us/azure/databricks/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)
* **Databricks on Google Cloud**: [Databricks on Google Cloud: Grant permissions on an object](https://docs.databricks.com/gcp/en/data-governance/unity-catalog/manage-privileges#-grant-permissions-on-an-object)

> **Important:**
>
> In the Principals field, you must enter the name of your service principal, not the email address for a user or the
> name of a group.

Run the following statements to grant your service principal the privileges needed to complete this tutorial. See
Description of the privileges
for a description of each privilege.

```sqlexample
GRANT CREATE TABLE ON CATALOG example_sales_catalog TO `<application_id>`;
GRANT EXTERNAL USE SCHEMA ON CATALOG example_sales_catalog TO `<application_id>`;
GRANT MODIFY ON CATALOG example_sales_catalog TO `<application_id>`;
GRANT SELECT ON CATALOG example_sales_catalog TO `<application_id>`;
GRANT USE CATALOG ON CATALOG example_sales_catalog TO `<application_id>`;
GRANT USE SCHEMA ON CATALOG example_sales_catalog TO `<application_id>`;
```

Where:

* `<application_id>` is the *Application ID* for your service principal, which you copied in a previous step.
  `1aaa1a1a-11a1-1111-1111-1a11111aaa1a` is an example of an Application ID for a service principal.

#### Description of the privileges

The following table describes each Unity Catalog privilege that you granted to your service principal and what access each privilege
gives Snowflake:

| Privilege | Description |
| --- | --- |
| `CREATE TABLE` | Allows Snowflake to create new Iceberg tables in the catalog. Required to create Unity Catalog-managed Iceberg tables from Snowflake. |
| `EXTERNAL USE SCHEMA` | Allows Unity Catalog to generate and provide temporary, scoped credentials to Snowflake for accessing table data in cloud storage.  **Important:** This privilege is required when you use vended credentials with a catalog-linked database. |
| `MODIFY` | Allows Snowflake to insert, update, or delete data in existing tables. Required to write data to Unity Catalog-managed Iceberg tables from Snowflake. |
| `SELECT` | Allows Snowflake to query tables and access table metadata. Required for all operations in Snowflake, including reading data and discovering tables in the catalog-linked database. |
| `USE CATALOG` | Allows Snowflake to access the catalog. Required to connect to and interact with any objects in the Unity Catalog. |
| `USE SCHEMA` | Allows Snowflake access to schemas (namespaces) within the catalog. Required to view and work with tables in specific schemas. |

## Step 4: Gather your Databricks workspace information

In this step, you use Databricks to gather information about your workspace. You need this information to specify it later when you create
a catalog integration in Snowflake.

Gather the following information from your Databricks workspace:

1. **Databricks workspace URL**: This is the URL you use to access your Databricks workspace.

   For instructions on how to find this URL, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Workspace instance names, URLs, and IDs](https://docs.databricks.com/aws/workspace/workspace-details#workspace-instance-names-urls-and-ids)
   * **Azure Databricks**: [Azure Databricks: Determine per-workspace URL](https://learn.microsoft.com/azure/databricks/workspace/workspace-details#determine-per-workspace-url)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Workspace instance names, URLs, and IDs](https://docs.databricks.com/gcp/workspace/workspace-details#workspace-instance-names-urls-and-ids)

   Here is an example of a Databricks workspace URL for each cloud platform:

   * **Databricks on AWS**: `https://dbc-a1a1a1a1-a1a1.cloud.databricks.com`
   * **Azure Databricks**: `https://adb-1111111111111111.1.azuredatabricks.net`
   * **Databricks on Google Cloud**: `https://1111111111111111.1.gcp.databricks.com`
2. **Catalog name in Unity Catalog**: The name of the catalog in Unity Catalog that you want to access from Snowflake,
   which is `example_sales_catalog`.
3. **Application ID**: The Application ID for the service principal that you added in Databricks.
4. **OAuth secret**: The OAuth secret for the service principal that you added in Databricks.

Copy these values into a text editor. You’ll use them when you create a catalog integration in Snowflake.

## Step 5: Set up a warehouse and catalog integration in Snowflake

In Snowflake, set up your environment by creating a warehouse and catalog integration for this tutorial.

### Create a warehouse

Run the following statements to create a warehouse.

```sqlexample
CREATE WAREHOUSE catalog_linked_database_tutorial_wh
  WAREHOUSE_TYPE = STANDARD
  WAREHOUSE_SIZE = XSMALL;

USE WAREHOUSE catalog_linked_database_tutorial_wh;
```

### Create a catalog integration

In Snowflake, create a catalog integration that connects Snowflake to your `example_sales_catalog` catalog in Unity Catalog
by using OAuth authentication and vended credentials.

To create a catalog integration for the Databricks Unity Catalog, use
the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../../sql-reference/sql/create-catalog-integration-rest.md) command.

The following example creates a REST catalog integration for connecting to your `example_sales_catalog` catalog by using OAuth:

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_unity_catalog_int_vended_creds
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  REST_CONFIG = (
    CATALOG_URI = '<databricks_workspace_url>/api/2.1/unity-catalog/iceberg-rest'
    CATALOG_NAME = 'example_sales_catalog'
    ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_TOKEN_URI = '<databricks_workspace_url>/oidc/v1/token'
    OAUTH_CLIENT_ID = '<client_id>'
    OAUTH_CLIENT_SECRET = '<oauth_secret>'
    OAUTH_ALLOWED_SCOPES = ('all-apis')
  )
  ENABLED = TRUE;
```

Where:

* `<databricks_workspace_url>` is your Databricks workspace URL, which you found in the previous step.
* `example_sales_catalog` is the name of your catalog in Unity Catalog that you want to connect to.
* `<client_id>` is the OAuth client ID for the service principal that you created in Databricks.

  > **Important:**
  >
  > In Databricks, this value is called the *Application ID*, not Client ID.
* `<oauth_secret>` is the OAuth secret that you generated for the service principal that you created in Databricks.

### Verify your catalog integration

* To verify the configuration for your catalog integration, call the SYSTEM$VERIFY_CATALOG_INTEGRATION function.

  For more information, including an example, see [Use SYSTEM$VERIFY_CATALOG_INTEGRATION to check your catalog integration configuration](../tables-iceberg-configure-catalog-integration-rest-check-config.md).

Next, you’ll specify the catalog integration that you created when you create a catalog-linked database that discovers and syncs with your
`example_sales_catalog` catalog in Unity Catalog.

## Step 6: Create a catalog-linked database

In this step, you create a catalog-linked database that connects to your `example_sales_catalog` catalog in Unity Catalog by using the catalog
integration you created in the previous step.

To create a catalog-linked database, use the [CREATE DATABASE (catalog-linked)](../../sql-reference/sql/create-database-catalog-linked.md) command.

The following example creates a catalog-linked database that is writable, uses vended credentials, and specifies one blocked namespace.

```sqlexample
CREATE DATABASE unity_linked_db
  LINKED_CATALOG = (
    CATALOG = 'my_unity_catalog_int_vended_creds'
    BLOCKED_NAMESPACES = ('information_schema')
  );
```

Where:

* `my_unity_catalog_int_vended_creds` is the name of the catalog integration that you created in the previous step.

> **Note:**
>
> * You don’t need to specify the `ALLOWED_WRITE_OPERATIONS` parameter to create a catalog-linked database that is writable.
>   The reason is that the default for this parameter is `ALL`, which means that your catalog-linked database is writable.
> * `BLOCKED_NAMESPACES = ('information_schema')` prevents Snowflake from syncing with the `information_schema` schema in
>   your Unity Catalog. Otherwise, this schema returns irrelevant SQL execution errors when you check the catalog sync status later.
>   These errors are irrelevant because the tables nested under this schema aren’t compatible with Iceberg.
>
>   The tables and views nested under the `information_schema` schema are built-in Databricks tables and views; they aren’t Iceberg tables.
> * The example code configures your catalog-linked database with vended credentials, so you don’t need to specify an external volume.
> * The default for the `ALLOWED_WRITE_OPERATIONS` parameter is `ALL`, which means that your catalog-linked database supports read
>   and write operations.

Now that you’ve created a catalog-linked database, Snowflake automatically syncs with your Unity Catalog
to discover namespaces and Iceberg tables. The default sync interval is 30 seconds.

## Step 7: Check the catalog sync status

To verify that Snowflake has successfully linked your catalog in Unity Catalog to your database,
use the [SYSTEM$CATALOG_LINK_STATUS](../../sql-reference/functions/system_catalog_link_status.md) function:

```sqlexample
SELECT SYSTEM$CATALOG_LINK_STATUS('unity_linked_db');
```

The function returns information about the sync status, including any tables that failed to sync.

If the sync is successful, you should see your Unity Catalog namespaces appear as schemas
in the catalog-linked database, and Iceberg tables appear under their respective schemas.

> **Note:**
>
> To identify tables that Snowflake created but couldn’t initialize, use the
> [SHOW ICEBERG TABLES](../../sql-reference/sql/show-iceberg-tables.md) command. For more information, see [Identify tables that were created but couldn’t be initialized](../tables-iceberg-catalog-linked-database.md).

## Step 8: Work with Iceberg tables

After the catalog sync completes, you can query and insert data into your Unity Catalog-managed Iceberg tables
directly from Snowflake and create a Unity Catalog-managed Iceberg table from Snowflake.

You can then use Databricks to work with the tables that you created or modified from Snowflake.

### Query Iceberg tables from Snowflake

To query a Unity Catalog-managed Iceberg table, follow these steps:

1. Select your catalog-linked database.

   ```sqlexample
   USE DATABASE unity_linked_db;
   ```
2. Query a table in your catalog in Unity Catalog.

   The following example queries the `customer_accounts` table, which is nested under the top-level `customers` schema
   in the `example_sales_catalog` catalog in Unity Catalog.

   ```sqlexample
   SELECT * FROM customers.customer_accounts
     LIMIT 20;
   ```

> **Note:**
>
> For requirements about identifying objects in a catalog-linked database,
> see [Requirements for identifier resolution in a catalog-linked database](../tables-iceberg-catalog-linked-database.md).

### Insert data into Iceberg tables from Snowflake

With a catalog-linked database, you can use Snowflake to insert data into your Unity Catalog-managed Iceberg tables.

To insert data into a Unity Catalog-managed Iceberg table from Snowflake, follow these steps:

1. Select your catalog-linked database.

   ```sqlexample
   USE DATABASE unity_linked_db;
   ```
2. Insert data into the `customer_accounts` table.

   ```sqlexample
   INSERT INTO customers.customer_accounts (
     customer_account_id,
     customer_id,
     account_status,
     created_at,
     updated_at
   )
     VALUES
       (6, 1006, 'ACTIVE', '2025-12-15 10:23:45', '2025-12-15 10:23:45');
   ```

### Create a new Iceberg table from Snowflake

With a catalog-linked database, you can also use Snowflake to create a Unity Catalog-managed Iceberg table.

To create a Unity Catalog-managed Iceberg table from Snowflake, follow these steps:

1. Select your catalog-linked database.

   ```sqlexample
   USE DATABASE unity_linked_db;
   ```
2. Select the schema in your catalog in Unity Catalog where you want to create an Iceberg table.

   ```sqlexample
   USE SCHEMA customers;
   ```
3. Create an Iceberg table.

   ```sqlexample
   CREATE ICEBERG TABLE table_created_from_snowflake (
     id INT,
     name STRING,
     created_date DATE
   );
   ```

   When you create a table in a catalog-linked database, Snowflake creates the table both in
   Snowflake and in your catalog in Unity Catalog. As you update the table by using either Snowflake or Databricks,
   Snowflake keeps both table instances in sync.

### Drop an Iceberg table from Snowflake

Because your catalog-linked database is writable, you can use the [DROP ICEBERG TABLE](../../sql-reference/sql/drop-iceberg-table.md) command to
drop an Iceberg table from Snowflake. To drop a table from Snowflake, your Databricks service principal must be granted the MANAGE
privilege.

> **Warning:**
>
> When your catalog-linked database has write permissions enabled, Snowflake propagates table drops to the remote catalog, which removes
> the table and data from both systems.

### Work with Iceberg tables from Databricks

In Databricks, try the following tasks:

* Insert data into the `table_created_from_snowflake` Iceberg table that you created from Snowflake.

  ```sqlexample
  USE CATALOG example_sales_catalog;
  USE SCHEMA customers;
  INSERT INTO table_created_from_snowflake VALUES
    (1, 'John', CURRENT_TIMESTAMP());
  ```

  > **Note:**
  >
  > If you receive a PERMISSION_DENIED error, you might need to first grant the MODIFY and SELECT privileges for the table to your Databricks user.
* Query the `customer_accounts` table to view the data that you inserted from Snowflake.

  ```sqlexample
  USE CATALOG example_sales_catalog;
  USE SCHEMA customers;
  SELECT * FROM customer_accounts;
  ```

### Automated refresh for your Iceberg tables

[Automated refresh](../tables-iceberg-auto-refresh.md) is enabled by default for Iceberg tables in your catalog-linked database.
As a result, when you update your Iceberg table from Databricks, the corresponding Iceberg table in Snowflake is automatically refreshed
with the updates.

To check the automated refresh status for your Unity Catalog-managed Iceberg tables, use the [SYSTEM$AUTO_REFRESH_STATUS](../../sql-reference/functions/system_auto_refresh_status.md)
system function in Snowflake.

## Clean up

### Clean up in Snowflake

To delete the objects that you created for this tutorial, run the following DROP statements.

Replace the following values:

* `my_other_database` with the name of a database to use so that you can drop the one that you created for this tutorial.
* `my_other_warehouse` with the name of a warehouse to use so that you can drop the one that you created for this tutorial.

```sqlexample
USE DATABASE <my_other_database>;
DROP DATABASE unity_linked_db;
DROP CATALOG INTEGRATION my_unity_catalog_int_vended_creds;
USE WAREHOUSE <my_other_warehouse>;
DROP WAREHOUSE catalog_linked_database_tutorial_wh;
```

### Clean up in Databricks

1. Drop the `example_sales_catalog` catalog.

   ```sqlexample
   DROP CATALOG example_sales_catalog cascade;
   ```
2. Remove your service principal.

   For instructions, see the topic for where your Databricks account is hosted:

   * **Databricks on AWS**: [Databricks on AWS: Manage service principals](https://docs.databricks.com/aws/en/admin/users-groups/manage-service-principals)
   * **Azure Databricks**: [Azure Databricks: Manage service principals](https://learn.microsoft.com/en-us/azure/databricks/admin/users-groups/manage-service-principals)
   * **Databricks on Google Cloud**: [Databricks on Google Cloud: Manage service principals](https://docs.databricks.com/gcp/en/admin/users-groups/manage-service-principals)

## Summary and additional resources

In this tutorial, you followed an end-to-end workflow to set up bidirectional access to Unity Catalog
by using a catalog-linked database with vended credentials.

Along the way, you completed the following tasks:

* **Created a catalog in Unity Catalog** with a Unity Catalog-managed Iceberg table.
* **Created OAuth credentials in Databricks** for Snowflake to authenticate with Unity Catalog by adding a service principal and OAuth secret.
* **Enabled Snowflake access to your catalog in Unity Catalog**, which
  included granting your service principal with Unity Catalog privileges.

  In the tutorial, we granted privileges at the catalog level. However, for more granular control, you can grant privileges at the schema
  or table level.

  Here’s an example of granting privileges at the schema level:

  ```sqlexample
  GRANT CREATE TABLE ON SCHEMA example_sales_catalog.customers TO `<application_id>`;
  GRANT EXTERNAL USE SCHEMA ON SCHEMA example_sales_catalog.customers TO `<application_id>`;
  GRANT MODIFY ON SCHEMA example_sales_catalog.customers TO `<application_id>`;
  GRANT SELECT ON SCHEMA example_sales_catalog.customers TO `<application_id>`;
  GRANT USE CATALOG ON CATALOG example_sales_catalog TO `<application_id>`;
  GRANT USE SCHEMA ON SCHEMA example_sales_catalog.customers TO `<application_id>`;
  ```

  For more information about service principals and OAuth in Databricks, see
  [Authentication using OAuth tokens for service principals](https://docs.databricks.com/aws/dev-tools/authentication-oauth)
  in the Databricks documentation. For more information about Unity Catalog privileges, see [Manage privileges in Unity Catalog](https://docs.databricks.com/aws/data-governance/unity-catalog/manage-privileges).
* **Created an OAuth catalog integration with vended credentials** to connect Snowflake to your catalog in Unity Catalog.
  For more information about vended credentials in Snowflake, see [Use catalog-vended credentials for Apache Iceberg™ tables](../tables-iceberg-configure-catalog-integration-vended-credentials.md).

  > **Note:**
  >
  > Alternatively, you can create a bearer catalog integration. For instructions, see [Configure a bearer token catalog integration](../tables-iceberg-configure-catalog-integration-rest-unity.md).
  > With a bearer catalog integration, you specify a personal access token (PAT) in the catalog integration for authentication. For more information
  > about PATs in Databricks, see [Authenticate with Databricks personal access tokens (legacy)](https://docs.databricks.com/aws/en/dev-tools/auth/pat)
  > in the Databricks documentation.
* **Created a catalog-linked database** that automatically syncs with your catalog in Unity Catalog.
  For more information about using catalog-linked databases, see [Use a catalog-linked database for Apache Iceberg™ tables](../tables-iceberg-catalog-linked-database.md).

  In the tutorial, you blocked a set of namespaces from your catalog-linked databases by using the BLOCKED_NAMESPACES parameter.
  Alternatively, to instead limit automatic table discovery to a specific set of namespaces, use the ALLOWED_NAMESPACES parameter
  when you create or modify a catalog-linked database.

  In the tutorial, you used the default sync interval that Snowflake uses to automatically discover schemas and tables in your remote catalog,
  which is 30 seconds. However, you can change this interval by using the SYNC_INTERVAL_SECONDS parameter when you create or modify your catalog-linked
  database. For example, you might want to decrease the sync interval to prevent rate limit issues.

  For more information, see the following topics:

  + [CREATE DATABASE (catalog-linked)](../../sql-reference/sql/create-database-catalog-linked.md)
  + [ALTER DATABASE (catalog-linked)](../../sql-reference/sql/alter-database-catalog-linked.md)
* **Worked with Unity Catalog-managed Iceberg tables** by using Snowflake and Databricks.

  In the tutorial, in Snowflake, you used an INSERT INTO statement to insert data into the `customer_accounts` table from Snowflake.

  However, you have other options for inserting data into Unity Catalog-managed Iceberg tables from Snowflake. For example, you can use
  an INSERT INTO … SELECT FROM command. For more information about write support for externally managed Iceberg tables,
  see [Write support for externally managed Apache Iceberg™ tables](../tables-iceberg-externally-managed-writes.md), including [Writing to externally managed Iceberg tables](../tables-iceberg-externally-managed-writes.md).

To learn more about Iceberg tables for Snowflake, see the [Iceberg tables documentation](../tables-iceberg.md).

---
title: Tutorial: Set up CI/CD integrations on dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/tutorials/dbt-projects-on-snowflake-ci-cd-tutorial.md
section: User Guide
---

dbt

data engineering

tasty bytes

getting started

# Tutorial: Set up CI/CD integrations on dbt Projects on Snowflake

## Introduction

This tutorial is a continuation of the
[Getting started with dbt Projects on Snowflake](dbt-projects-on-snowflake-getting-started-tutorial.md) tutorial.
It assumes you’ve already completed that tutorial and have a working Snowflake environment with your database, schemas, warehouse, and source
data set up.

This tutorial guides you through building a secure CI/CD pipeline for dbt Projects on Snowflake using GitHub Actions, OIDC authentication, Snowflake CLI, and
dbt project objects to automate testing, deployment, and orchestration with minimal overhead.

For more information, see [CI/CD integrations on dbt Projects on Snowflake](../data-engineering/dbt-projects-on-snowflake-ci-cd.md).

### Overview

This tutorial walks you through the following steps:

1. Setting up your Snowflake environment:

   * You choose one of three ways to prepare dev and prod targets (full database clone, partial clone, or brand-new databases).
   * Your dbt project must include a `profiles.yml` that refers to these dev and prod targets.
2. Setting up an OIDC service user for secure authentication: Instead of passwords or long-lived tokens, you create a Snowflake service user
   that trusts GitHub through [OpenID Connect](../workload-identity-federation.md). This enables secure, short-lived, per-run
   authentication.
3. Setting up network policies: (Optional) If your Snowflake account restricts inbound IPs, you can use [Snowflake-managed network rules](../network-rules.md)
   to add Github Actions runner IPs to your service user’s network policy. Otherwise, you can skip this step.
4. Storing GitHub secrets and repository variables to configure Snowflake CLI in your workflows:

   * Your Snowflake account identifier
   * (Optionally) the Snowflake username
   * The target database and schema where dbt project objects will be deployed
5. Creating GitHub Actions workflows:

> * CI workflow that triggers on pull requests, deploys a tester dbt project, and runs `dbt build` to build models and test them in DAG order.
>   If anything breaks, the pull request fails.
> * CD workflow that triggers on merges to main, deploys the production dbt project object, and optionally applies scheduling.

At the end of the tutorial, you will have:

* A fully automated, GitHub-driven dbt workflow
* Secure OIDC authentication
* Consistent, tested deployments into Snowflake
* Version-controlled orchestration (optional)
* A repeatable template for scaling dbt workflows across teams

### Prerequisites

* **GitHub**

  + An existing dbt Project in a GitHub account that can create a repository and manage access to that repository.
* **Snowflake**

  + Completion of the
    [Getting started with dbt Projects on Snowflake](dbt-projects-on-snowflake-getting-started-tutorial.md) tutorial,
    which sets up the `tasty_bytes_dbt_db` database, `dev`/`prod` schemas, `tasty_bytes_dbt_wh` warehouse, and source data.
  + Basic understanding of dbt Projects on Snowflake. For more information, see [dbt Projects on Snowflake](../data-engineering/dbt-projects-on-snowflake.md).
  + A Snowflake account and user with privileges as described in [Access control for dbt projects on Snowflake](../data-engineering/dbt-projects-on-snowflake-access-control.md).
  + Privileges or administrator assistance to create and edit the following:

    - GitHub repository secrets to specify the account and (optional) username
    - A Snowflake service user
    - Network policy

## Set up your environment

Set up where your dbt project will read and write in Snowflake, then update your `profiles.yml` file.

### Create dev and prod databases and schemas

> **Note:**
>
> If you’ve already completed the
> [Getting started with dbt Projects on Snowflake](dbt-projects-on-snowflake-getting-started-tutorial.md) tutorial
> and run the `tasty_bytes_setup.sql` file, your database (`tasty_bytes_dbt_db`) and schemas (`dev`, `prod`) already exist. You can
> skip this step. For details, see
> [Run the SQL commands in tasty_bytes_setup.sql to set up source data](dbt-projects-on-snowflake-getting-started-tutorial.md).

To set up where your dbt project will read and write in Snowflake, choose one of the following options:

1. Create an empty database with dev and prod schemas
2. Clone your production database using zero-copy cloning
3. Create an empty dev database and clone the production schemas you need

#### Create an empty database with dev and prod schemas

This is the simplest approach when you’re starting from scratch.

```sqlexample
CREATE DATABASE IF NOT EXISTS tasty_bytes_dbt_db;
CREATE SCHEMA IF NOT EXISTS tasty_bytes_dbt_db.dev;
CREATE SCHEMA IF NOT EXISTS tasty_bytes_dbt_db.prod;
```

#### Clone your production database

Use Snowflake’s [zero-copy cloning](../../sql-reference/sql/create-clone.md) to create a full replica of your production database, as shown in
the following example. This gives you a high-fidelity testing environment and is cost-effective because you only pay storage for tables that
change during dbt runs.

```sqlexample
-- This assumes that other_tasty_bytes_dbt_db has the two schemas dev and prod
CREATE DATABASE IF NOT EXISTS tasty_bytes_dbt_db CLONE other_tasty_bytes_dbt_db;
```

#### Create an empty dev database and clone the production schemas you need

Use this method when you only need specific schemas for testing.

```sqlexample
CREATE DATABASE IF NOT EXISTS tasty_bytes_dbt_db;

-- Repeat the line below for other necessary schemas
CREATE SCHEMA IF NOT EXISTS tasty_bytes_dbt_db.dev CLONE other_tasty_bytes_dbt_db.dev;
CREATE SCHEMA IF NOT EXISTS tasty_bytes_dbt_db.prod CLONE other_tasty_bytes_dbt_db.prod;
```

### Update your profiles.yml file

To manage CI/CD for a dbt project object in GitHub Actions, you must include a `profiles.yml` file inside your dbt project folder (for
example, `my_dbt_project/profiles.yml`). This file defines your dev and prod targets and uses placeholder values that GitHub repository secrets will later replace.

Edit this file *directly on GitHub* to reference the dev and prod databases and schemas you created, as shown below:

```yaml
tasty_bytes:
  target: dev
  outputs:
    dev:
      account: '_' # Put any value here, it will be overwritten by a GitHub repository secret
      database: tasty_bytes_dbt_db
      schema: dev
      role: ACCOUNTADMIN # Use whichever role has USAGE on the database and schema
      type: snowflake
      warehouse: tasty_bytes_dbt_wh
      user: '_' # Put any value here, it will be overwritten by a GitHub repository secret
      threads: 8 # Snowflake recommends 8 threads
    prod:
      account: '_' # Put any value here, it will be overwritten by a GitHub repository secret
      database: tasty_bytes_dbt_db
      schema: prod
      role: ACCOUNTADMIN # Use whichever role has USAGE on the database and schema
      type: snowflake
      warehouse: tasty_bytes_dbt_wh
      user: '_' # Put any value here, it will be overwritten by a GitHub repository secret
      threads: 8 # Snowflake recommends 8 threads
```

Key points from the example:

* `target: dev` sets the default target of the dbt project. This value can be overridden by Snowflake CLI or a dbt project object.
* `dev` and `prod` both use `type: snowflake`.
* Database and schema point to the databases and schemas you created in the previous step (or already set up from the getting-started tutorial).
* Warehouse is the warehouse created in the getting-started tutorial (`tasty_bytes_dbt_wh`).
* Account and user are set to dummy values like ‘_’ because they’ll be replaced by GitHub repository secrets later.
* `threads: 8` sets the number of concurrent threads dbt uses. Snowflake recommends 8 threads.

## Create a GitHub service user in Snowflake (recommended)

GitHub Actions run using the Snowflake user specified in your Snowflake CLI commands. To keep things clean and secure, create a dedicated
Snowflake user for all GitHub workflows and grant it the required privileges.

### Recommended: OIDC-based service user

This approach uses OpenID Connect (OIDC) rather than long-lived credentials. The service user trusts GitHub as an identity provider, allowing
GitHub Actions to request short-lived tokens for each workflow run. You will map this user to an environment subject like
`environment:prod` in a later step.

Each OIDC service user must have a unique subject. We recommend using a repo path and an environment name, for example
`repo:<org>/<repo>:environment:<environment_name>`. The environment name can be anything, as long as it matches exactly in your GitHub
Action YAML file. For more information, see [Workload identity federation](../workload-identity-federation.md).

Create an OIDC-based service user as follows:

```sqlexample
CREATE USER IF NOT EXISTS github_actions_service_user
  TYPE = SERVICE
  WORKLOAD_IDENTITY = (
    TYPE = OIDC
    ISSUER = 'https://token.actions.githubusercontent.com',
    SUBJECT = 'repo:your_repo_org/your_dbt_repo:environment:prod'
  )
  DEFAULT_ROLE = ACCOUNTADMIN
  COMMENT = 'Service user for GitHub Actions';
```

After you create your user, explicitly grant the default role for the service user to assume that role. The DEFAULT_ROLE parameter only sets the
user’s default role and doesn’t grant it.

```sqlexample
GRANT ROLE ACCOUNTADMIN TO USER github_actions_service_user;
```

Set a default warehouse:

```sqlexample
ALTER USER github_actions_service_user SET DEFAULT_WAREHOUSE = 'tasty_bytes_dbt_wh';
```

### Alternative: PAT-based authentication (less secure)

If you prefer to use one Snowflake user across multiple repositories, or cannot use OIDC, you can create the user with a personal access
token (PAT) instead.

This method is easier to reuse across repositories but less secure because it relies on long-lived credentials and requires manual rotation.

```sqlexample
CREATE USER IF NOT EXISTS github_actions_service_user
TYPE = SERVICE
COMMENT = 'Service user for GitHub Actions';

-- Grant the level of access to your user that can create network, auth policies,
-- and objects such as DBs and schemas
GRANT ROLE ACCOUNTADMIN TO USER github_actions_service_user;

-- Setting up databases and schemas to store policies and network rules
CREATE DATABASE IF NOT EXISTS github_actions_access_management;
CREATE SCHEMA IF NOT EXISTS github_actions_access_management.NETWORKS;
CREATE SCHEMA IF NOT EXISTS github_actions_access_management.POLICIES;

CREATE AUTHENTICATION POLICY github_actions_access_management.POLICIES.github_auth_policy
authentication_methods = ('PROGRAMMATIC_ACCESS_TOKEN')
pat_policy = (
default_expiry_in_days = 15, -- default value
max_expiry_in_days = 365, -- default value
network_policy_evaluation = ENFORCED_NOT_REQUIRED -- this is needed to ensure you can generate a PAT on Snowsight
);

ALTER USER github_actions_service_user SET AUTHENTICATION POLICY github_actions_access_management.POLICIES.github_auth_policy;
```

## (Optional) Set up a network policy for GitHub Actions

Now that you’ve created the service user that Snowflake CLI will use, let’s configure this user to connect to your Snowflake account from within GitHub Actions.

> **Note:**
>
> Creating or modifying network policies requires ACCOUNTADMIN or an equivalent role.

### Determine whether you need a network policy

* If your account restricts inbound access, you must create or update a network policy to add GitHub Actions runner IPs to your allowlist. Snowflake
  simplifies this with Snowflake-managed network rules. For more information, see [Network rules](../network-rules.md).
* If your account does *not* restrict inbound access, no network policy changes are required.

If you’re unsure, skip this step for now and return only if you see an error like: `Incoming request with IP/Token <IP> is not allowed to access Snowflake.`

To create and apply a network policy to a user, choose one of the following options:

* Create a new network policy and assign it to the service user, or
* Add the GitHub Actions network rule to an existing network policy that the user already uses.

> **Note:**
>
> Before doing this, consult your Snowflake account admin. They must ensure the policy includes not only the GitHub Actions network rule but
> also any other IP ranges your organization requires.
>
> Once a network policy is applied, Snowflake restricts user access based on its allowed and blocked IP ranges. Your account admin might need
> to adjust the policy or apply it account wide to avoid unintentionally blocking essential access.

#### Option 1: Create a new network policy and apply it to the user

A Snowflake user can have only one network policy at a time. If the user doesn’t have one or you want to replace the existing policy, complete
the following steps:

```sqlexample
CREATE NETWORK POLICY github_actions_policy
  ALLOWED_NETWORK_RULE_LIST = ('SNOWFLAKE.NETWORK_SECURITY.GITHUBACTIONS_GLOBAL', <other required rules>)
  BLOCKED_NETWORK_RULE_LIST = ();

ALTER USER GitHub_Actions_Service_User
  SET NETWORK_POLICY = github_actions_policy;
```

#### Option 2: Add a network rule to an existing network policy

If the user already has a network policy, you can add the GitHub Actions rule to it.

```sqlexample
-- Check the user's current network policy:
SHOW PARAMETERS LIKE 'NETWORK_POLICY' FOR USER github_actions_service_user;
```

> **Note:**
>
> If the network policy is applied at the account level or shared by many users, updating it will affect everyone.

```sqlexample
-- Add the new rule:
ALTER NETWORK POLICY <name>
  ADD ALLOWED_NETWORK_RULE_LIST = ('SNOWFLAKE.NETWORK_SECURITY.GITHUBACTIONS_GLOBAL');
```

The user inherits the update automatically since they’re already assigned to this policy.

## Configure GitHub repository secrets and variables

GitHub Actions use the Snowflake CLI to connect to your Snowflake account, so you must configure GitHub repository secrets and variables
first. This is how the CI/CD integration passes Snowflake account info into Snowflake CLI inside GitHub Action workflows.

### Configure GitHub repository secrets

Add secrets to securely store the information Snowflake CLI needs to identify your Snowflake account and, if required, the user it should
authenticate as:

1. In your GitHub repository, go to Settings.
2. From the left-hand side navigation, select Secrets and variables » Actions.
3. Under Secrets, select New repository secret.
4. Add a secret to connect your Snowflake account:

   * Name: `SNOWFLAKE_ACCOUNT`
   * Value: Your Snowflake account identifier (for example, `org_name-account_name`). This value tells Snowflake CLI which
     account you want to connect to.
5. Select Add secret.
6. (Optional) If you aren’t using OIDC, select New repository secret to specify the Snowflake username the CLI should use when connecting. It specifies which
   user credentials to run commands under.

   * Name: `SNOWFLAKE_USER`
   * Value: Optional if you’re using OIDC or credential-less authentication.

     + With OIDC, Snowflake CLI automatically matches the GitHub Action’s subject to the OIDC service user (created in Step 3), so this is
       not required.
     + Without OIDC, you must specify a user (and supply password or key credentials). As a recommended best practice, you should create
       a personal access token in Snowsight. For more information, see [Generating a programmatic access token](../programmatic-access-tokens.md).
7. Select Add secret.
8. (Optional) If you aren’t using OIDC, select New repository secret to specify the service user’s personal access token that the CLI should use when connecting.

   * Name: `SNOWFLAKE_PAT`
   * Value: Optional if you’re using OIDC or credential-less authentication.

### Configure GitHub repository variables

These help Snowflake CLI connect to the right database and schema. Complete the following steps:

1. In your GitHub repository, go to Settings.
2. From the left-hand side navigation, select Secrets and variables » Actions.
3. Under Variables, select New repository variable.
4. Add a database variable:

   * Name: `SNOWFLAKE_DATABASE`
   * Value: Enter an existing database where the dbt project object will be created.
5. Select Add variable.
6. Add a schema variable:

   * Name: `SNOWFLAKE_SCHEMA`
   * Value: Enter an existing schema where the dbt project object will be created.
7. Select Add variable.

## Create your Continuous Integration (CI) GitHub Action

This step is where automation starts. This CI workflow runs whenever a pull request targets main. It:

1. Creates a tester dbt project object in Snowflake
2. Runs `dbt build` against your dev target, which builds all models and runs tests in DAG order, failing early if any test fails
3. Fails the pull request if the dbt execution fails

### Create your CI workflow file

1. In your GitHub repository, go to Actions.
2. From the left-hand side navigation, select New workflow.
3. Select set up a workflow yourself to create an empty workflow.
4. Name the file `incoming_pr.yml`.
5. Copy and paste the following into the file:

   ```yaml
   name: Incoming PR
   run-name: PR opened by ${{ github.actor }}
   on:
     pull_request:
       types: [opened, synchronize, reopened, ready_for_review]
       branches: [main]

   permissions:
     contents: read
     id-token: write

   jobs:
     run-snowflake-test-dbt-job:
       name: "Run on Incoming PR"
       runs-on: ubuntu-latest
       environment: prod # Must match the OIDC subject's environment
       env:
         SNOWFLAKE_CLI_FEATURES_ENABLE_DBT: true
         SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
         # SNOWFLAKE_PASSWORD: ${{ secrets.SNOWFLAKE_PAT }} # Snowflake password is required if you aren't using OIDC
         SNOWFLAKE_DATABASE: ${{ vars.SNOWFLAKE_DATABASE }}
         SNOWFLAKE_SCHEMA: ${{ vars.SNOWFLAKE_SCHEMA }}

       steps:
         # Check out repository code
         # Gets the latest code from the incoming pull request
         - name: Check out repository code
           uses: actions/checkout@v4

         - name: Install Snowflake CLI
           uses: snowflakedb/snowflake-cli-action@v2.0
           with: # Ensures Snowflake CLI will search for OIDC users matching this subject
             use-oidc: true

         - name: Check Snowflake CLI Version
           run: snow --version

         # The -x is shorthand for --temporary-connection
         - run: snow connection test -x

         # The --force setting creates the object or updates it if it already exists
         # You can remove the "--source" flag if your dbt_project.yml is at root of your repo
         - name: Create a new tester dbt project object in ${{ vars.SNOWFLAKE_DATABASE }}.${{ vars.SNOWFLAKE_SCHEMA }}
           run: snow dbt deploy tester_tasty_bytes_dbt_project_object_gh_action --source ./tasty_bytes --dbt-version 1.10.15 --force -x

         - name: List all of the snowflake dbt project objects in your account
           run: snow dbt list -x

         # Builds all models and runs tests in DAG order, failing early if any upstream test breaks
         - name: Build and test dbt project in ${{ vars.SNOWFLAKE_DATABASE }}.${{ vars.SNOWFLAKE_SCHEMA }}
           run: snow dbt execute -x tester_tasty_bytes_dbt_project_object_gh_action build --target dev
   ```
6. Select Commit changes.
7. Select Create a new branch for this commit and start a pull request.
8. Select Propose changes.
9. After you finish submitting the pull request, you should see your `incoming_pr.yml` action start to run.
10. After it’s merged, the file will be saved to `.github/workflows/incoming_pr.yml`.

#### Key pieces from the workflow file

* Triggers on pull requests to `main`:

  ```yaml
  on:
    pull_request:
      types: [opened, synchronize, reopened, ready_for_review]
      branches: [main]
  ```
* Grants permissions and sets environment variables for Snowflake CLI:

  ```yaml
  permissions:
    contents: read
    id-token: write

  jobs:
    run-snowflake-test-dbt-job:
      runs-on: ubuntu-latest
      environment: prod # Must match the OIDC subject's environment
      env:
        SNOWFLAKE_CLI_FEATURES_ENABLE_DBT: true
        SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
        # SNOWFLAKE_PASSWORD: ${{ secrets.SNOWFLAKE_PAT }} # Snowflake password is required if you aren't using OIDC
        SNOWFLAKE_DATABASE: ${{ vars.SNOWFLAKE_DATABASE }}
        SNOWFLAKE_SCHEMA: ${{ vars.SNOWFLAKE_SCHEMA }}
  ```
* Steps in the job:

  1. Check out repository code (`actions/checkout@v4`).
  2. Install Snowflake CLI using `snowflakedb/snowflake-cli-action@v2.0` with `use-oidc: true`.
  3. Run `snow --version`.
  4. Run `snow connection test -x` to verify OIDC connection.
  5. Deploy a tester dbt project object using `snow dbt deploy ... --force -x` (with `--source` if the dbt project is in a subfolder).
  6. Run `snow dbt list -x` to show dbt project objects.
  7. Build and test the dbt project in DAG order:

     `snow dbt execute -x tester_tasty_bytes_dbt_project_object_gh_action build --target dev`

     Using `build` instead of separate `run` and `test` commands ensures that tests execute immediately after each model is built, in
     dependency order. If an upstream model’s test fails, downstream models aren’t built, providing faster feedback and preventing invalid
     data from propagating.
* Once you commit this new workflow on a branch and open a pull request, GitHub Actions will run it. If the dbt project object fails to build a model
  or any test fails, the CI check fails and the pull request can’t be merged.

## Create your Continuous Deployment (CD) GitHub Action

The CD workflow runs after code is merged to main (or any direct push to main), ensuring the dbt project object in Snowflake reflects the
latest code.

### Create your CD workflow file

1. In your GitHub repository, go to Actions.
2. From the left-hand side navigation, select New workflow.
3. Select set up a workflow yourself to create an empty workflow.
4. Name the file `pr_merged.yml`.
5. Copy and paste the following into the file:

   ```yaml
   name: PR Accepted Deployment
   run-name: PR from ${{ github.actor }} accepted - triggered a ${{ github.event_name }}
   on:
     push:
       branches: [ main ]

   permissions:
     contents: read
     id-token: write

   jobs:
     run-snowflake-dbt-job:
       name: "Run on Accepted PR"
       runs-on: ubuntu-latest
       environment: prod # Must match the OIDC subject's environment
       env:
         SNOWFLAKE_CLI_FEATURES_ENABLE_DBT: true
         SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
         # SNOWFLAKE_PASSWORD: ${{ secrets.SNOWFLAKE_PAT }} # Snowflake password is required if you aren't using OIDC
         SNOWFLAKE_DATABASE: ${{ vars.SNOWFLAKE_DATABASE }}
         SNOWFLAKE_SCHEMA: ${{ vars.SNOWFLAKE_SCHEMA }}

       steps:
         # Check out repository code
         # Gets the latest code from the pull request branch
         - name: Check out repository code
           uses: actions/checkout@v4

         - name: Install Snowflake CLI
           uses: snowflakedb/snowflake-cli-action@v2.0
           with: # Ensures Snowflake CLI will search for OIDC users matching this subject
             use-oidc: true

         - name: Check Snowflake CLI Version
           run: snow --version

         # The -x is shorthand for --temporary-connection
         - run: snow connection test -x

         # The --force setting creates the object or updates it if it already exists
         # You can remove the "--source" flag if your dbt_project.yml is at root of your repo
         # The --default-target flag ensures the dbt project object compiles and executes with your prod target
         - name: Create a new dbt project object in ${{ vars.SNOWFLAKE_DATABASE }}.${{ vars.SNOWFLAKE_SCHEMA }}
           run: snow dbt deploy tasty_bytes_dbt_object_gh_action --source ./tasty_bytes --default-target prod --dbt-version 1.10.15 --force -x

         - name: List all of the snowflake dbt project objects on your account
           run: snow dbt list -x

         # (optional) Uncomment the lines below and follow Step 7 if you want to manage Task orchestration via source control
         # - name: Run schedules.sql to create or alter tasks for tasty_bytes_dbt_object_gh_action
         #   run: snow sql -f ${{ github.workspace }}/tasty_bytes/schedules.sql -x
   ```
6. Select Commit changes to save the file to `.github/workflows/pr_merged.yml`.
7. Navigate to the Actions tab of your repository to see your `pr_merged.yml` action start to run.

#### Key pieces from the workflow file

* Triggers on pushes to `main`:

  ```yaml
  on:
   push:
     branches: [ main ]
  ```
* Similar permissions/env as CI:

  ```yaml
  permissions:
  contents: read
  id-token: write

  jobs:
    run-snowflake-dbt-job:
      runs-on: ubuntu-latest
      environment: prod # Must match the OIDC subject's environment
      env:
        SNOWFLAKE_CLI_FEATURES_ENABLE_DBT: true
        SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
        # SNOWFLAKE_PASSWORD: ${{ secrets.SNOWFLAKE_PAT }} # Snowflake password is required if you aren't using OIDC
        SNOWFLAKE_DATABASE: ${{ vars.SNOWFLAKE_DATABASE }}
        SNOWFLAKE_SCHEMA: ${{ vars.SNOWFLAKE_SCHEMA }}
  ```
* Steps in the job:

  1. Check out the repository code.
  2. Install Snowflake CLI with OIDC.
  3. Run `snow --version`.
  4. Run `snow connection test -x` to verify OIDC connection.
  5. Deploy/update the production dbt project object with `snow dbt deploy ... --default-target prod --force -x`.
  6. Run `snow dbt list -x` to show dbt project objects.
  7. (Optional) Run a `schedules.sql` file to manage tasks (see next section).
* Once this workflow is in place, every successful merge to main (or push) updates the dbt project object in Snowflake.

## (Optional) Add orchestration with Snowflake tasks

Orchestrate runs of your dbt project object using a `schedules.sql` file and Snowflake tasks (triggered from the CD workflow):

1. In your GitHub repository, navigate to your dbt project (for example, `tasty_bytes/`).
2. Create a file named `schedules.sql` and copy and paste the following into the file.

   This file:

   * Suspends any existing tasks
   * Creates or alters tasks to:

     + Run a subset of the DAG on a schedule, failing early if any test fails
     + Run the full project, failing early if any test fails
   * Resumes tasks in the correct order (child → root)

   ```sqlexample
   -- To avoid issues with CREATE OR ALTER, suspend all of the tasks from root to child
   -- ALTER TASK IF EXISTS ensures this file can execute on first run each time a task is added
   ALTER TASK IF EXISTS run_tasty_bytes_subset SUSPEND;
   ALTER TASK IF EXISTS run_tasty_bytes_full SUSPEND;

   -- Example of a subset that needs to be available early for business needs.
   -- If tests fail here, the next task won't run
   CREATE OR ALTER TASK run_tasty_bytes_subset
     WAREHOUSE = tasty_bytes_dbt_wh
     SCHEDULE = '12 hours'
     AS
         execute dbt project tasty_bytes_dbt_object_gh_action args='build --select raw_customers stg_customers customers --target prod';

   -- Builds all models and runs tests in DAG order, failing early if any upstream test breaks
   CREATE OR ALTER TASK run_tasty_bytes_full
     WAREHOUSE = tasty_bytes_dbt_wh
     AFTER run_tasty_bytes_subset
     AS
         execute dbt project tasty_bytes_dbt_object_gh_action args='build --target prod';

   -- When a task is first created or if an existing task it paused, it MUST BE RESUMED to be activated
   -- The tasks must be enabled in REVERSE ORDER from child to root
   ALTER TASK IF EXISTS run_tasty_bytes_full RESUME;
   ALTER TASK IF EXISTS run_tasty_bytes_subset RESUME;
   ```
3. Select Commit changes.
4. In your GitHub repository, navigate to `.github/workflows/pr_merged.yml` and uncomment the `Run schedules.sql to create...`
   step at the end of the file.
5. Select Commit changes.

## Next steps

Next steps to improve your workflow:

* Use [zero-copy cloning](../../sql-reference/sql/create-clone.md) in CI:

  Test against fresh production data by adding a step in `incoming_pr.yml` before deploying your tester dbt object:

  ```snowcli
  snow sql -q "CREATE DATABASE <your_user_name>_dev_dbt_DB CLONE YOUR_PRODUCTION_DATABASE".
  ```
* Add alerting:

  Configure Slack or email notifications in GitHub Actions, or use Snowflake task error notifications.

  For more information, see [Configure a task to send error notifications](../tasks-errors-integrate.md).
* Explore [Managing dbt Projects on Snowflake using Snowflake CLI](../../developer-guide/snowflake-cli/data-pipelines/dbt-projects.md).

---
title: Tutorial: Use primary keys to optimize dynamic table pipelines
source: https://docs.snowflake.com/en/user-guide/tutorials/dynamic-table-primary-keys.md
section: User Guide
---

Snowflake

Dynamic Tables

Performance

Primary Keys

# Tutorial: Use primary keys to optimize dynamic table pipelines

## Introduction

This tutorial shows you how to use primary keys to enable efficient
[incremental refresh](../dynamic-tables-refresh.md) in a
[dynamic table](../dynamic-tables-about.md) pipeline where the base table is
periodically rewritten through INSERT OVERWRITE. You’ll build two dimension-fact join pipelines
with the same data and query definition, one with a primary key and one without, and compare
how they handle a dimension table rewrite.

### About dimension-fact joins with INSERT OVERWRITE

In many data pipelines, dimension tables are periodically rewritten by external processes using
INSERT OVERWRITE. This is common with ETL connectors and batch data loads. The rewrite replaces
all rows in the table, even when only a small fraction of the data actually changed.

Without primary keys, Snowflake can’t determine what changed across a rewrite and treats every
row as new. This forces the pipeline to recompute everything in each refresh cycle.

When the dimension table has a primary key with the `RELY` property, Snowflake uses the key
to identify which rows actually changed between rewrites. The dynamic table then processes only
the affected rows, even though the underlying data was fully replaced.

### What you’ll learn

In this tutorial, you’ll learn how to complete the following tasks:

* Create two dimension tables with the same data: one with a primary key and one without.
* Build two dynamic table pipelines with the same join query and compare their behavior.
* Simulate a dimension table rewrite with INSERT OVERWRITE where only 10% of rows change.
* Compare the incremental refresh performance of both dynamic tables on the same data.

### Prerequisites

You need access to a Snowflake environment with the following resources:

* A [warehouse](../warehouses-overview.md) for compute resources. We recommend using
  an x-small warehouse.
* The privileges required to create databases, schemas, and dynamic tables.
  For more information, see [Access control privileges](../security-access-control-privileges.md).

If you don’t have a user with the necessary permissions, ask someone who does to create one for
you. Users with the ACCOUNTADMIN role can create new users and grant them the required
privileges.

> **Note:**
>
> For the best experience, complete this tutorial in Snowsight so that you can quickly
> view the query history and monitor your dynamic table performance.

## Step 1: Create the source data

Start by setting up two dimension tables (one with a primary key, one without) and a shared fact
table.

Create a database and schema for the tutorial:

```sqlexample
CREATE DATABASE IF NOT EXISTS temp;
CREATE SCHEMA IF NOT EXISTS temp.tutorial;

USE SCHEMA temp.tutorial;
```

Create a dimension table with a primary key and the `RELY` property. The `RELY` property tells
Snowflake that it can trust the primary key for optimizations such as change tracking:

```sqlexample
CREATE OR REPLACE TABLE dimension_products_with_pk (
    product_id INT PRIMARY KEY RELY,
    product_name VARCHAR(200),
    category VARCHAR(100),
    price NUMBER(10, 2)
) CHANGE_TRACKING = TRUE;
```

Create a second dimension table with the same schema but no primary key:

```sqlexample
CREATE OR REPLACE TABLE dimension_products_no_pk (
    product_id INT,
    product_name VARCHAR(200),
    category VARCHAR(100),
    price NUMBER(10, 2)
) CHANGE_TRACKING = TRUE;
```

Create a fact table for order transactions:

```sqlexample
CREATE OR REPLACE TABLE fact_orders (
    order_id INT,
    product_id INT,
    quantity INT,
    order_date TIMESTAMP_NTZ
) CHANGE_TRACKING = TRUE;
```

Insert sample data into the dimension table with a primary key. This generates 100,000 products
across 10 categories:

```sqlexample
INSERT INTO dimension_products_with_pk (product_id, product_name, category, price)
  SELECT
      SEQ4() + 1 AS product_id,
      'Product ' || LPAD(TO_VARCHAR(SEQ4() + 1), 6, '0') AS product_name,
      'Category ' || LPAD(TO_VARCHAR(MOD(SEQ4(), 10) + 1), 2, '0') AS category,
      ROUND(5.00 + MOD(SEQ4(), 500) * 0.50, 2) AS price
  FROM TABLE(GENERATOR(ROWCOUNT => 100000));
```

Copy the same data into the dimension table without a primary key:

```sqlexample
INSERT INTO dimension_products_no_pk
  SELECT * FROM dimension_products_with_pk;
```

Insert sample order data. This generates 10 million orders that reference the products:

```sqlexample
INSERT INTO fact_orders (order_id, product_id, quantity, order_date)
  SELECT
      SEQ4() + 1 AS order_id,
      MOD(SEQ4(), 100000) + 1 AS product_id,
      MOD(SEQ4(), 10) + 1 AS quantity,
      DATEADD(SECOND, SEQ4(), '2025-01-01 00:00:00') AS order_date
  FROM TABLE(GENERATOR(ROWCOUNT => 10000000));
```

## Step 2: Create dynamic tables for comparison

In this step, you will create two dynamic table pipelines with the same join query. The only
difference is whether the dimension table has a primary key. This lets you directly compare
their refresh performance on the same data.

> **Note:**
>
> Replace `my_warehouse` with the name of your warehouse.

### Pipeline with primary key

Create a dynamic table that joins the dimension table (with primary key) to the fact table.
Because the dimension table has a reliable primary key, this dynamic table can use incremental
refresh:

```sqlexample
CREATE OR REPLACE DYNAMIC TABLE dt_enriched_orders_with_pk
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = <my_warehouse>
  REFRESH_MODE = INCREMENTAL
AS
  SELECT
      f.order_id,
      f.product_id,
      d.product_name,
      d.category,
      f.quantity,
      d.price,
      f.quantity * d.price AS order_total,
      f.order_date
  FROM fact_orders f
  INNER JOIN dimension_products_with_pk d ON f.product_id = d.product_id;
```

### Pipeline without primary key

Create the same join query using the dimension table without a primary key. Without a primary
key, Snowflake can’t track row-level changes across INSERT OVERWRITE rewrites, so this dynamic
table recomputes the entire join on each refresh:

```sqlexample
CREATE OR REPLACE DYNAMIC TABLE dt_enriched_orders_no_pk
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = <my_warehouse>
  REFRESH_MODE = INCREMENTAL
AS
  SELECT
      f.order_id,
      f.product_id,
      d.product_name,
      d.category,
      f.quantity,
      d.price,
      f.quantity * d.price AS order_total,
      f.order_date
  FROM fact_orders f
  INNER JOIN dimension_products_no_pk d ON f.product_id = d.product_id;
```

### Refresh both pipelines

Perform the initial refresh for both dynamic tables to establish a baseline:

```sqlexample
ALTER DYNAMIC TABLE dt_enriched_orders_with_pk REFRESH;
ALTER DYNAMIC TABLE dt_enriched_orders_no_pk REFRESH;
```

## Step 3: Simulate a dimension table rewrite and compare

Now simulate the common scenario where an external process rewrites the dimension table through
INSERT OVERWRITE with a small percentage of rows changed.

### Rewrite the dimension tables

Rewrite both dimension tables with INSERT OVERWRITE. This updates the price for 10% of products
(those in Category 01) while keeping the rest identical:

```sqlexample
INSERT OVERWRITE INTO dimension_products_with_pk
  SELECT
      product_id,
      product_name,
      category,
      CASE
          WHEN category = 'Category 01' THEN ROUND(price * 1.10, 2)
          ELSE price
      END AS price
  FROM dimension_products_with_pk;

INSERT OVERWRITE INTO dimension_products_no_pk
  SELECT
      product_id,
      product_name,
      category,
      CASE
          WHEN category = 'Category 01' THEN ROUND(price * 1.10, 2)
          ELSE price
      END AS price
  FROM dimension_products_no_pk;
```

Even though every row was rewritten in both tables, only about 10,000 products (Category 01)
actually have different values. The pipeline with a primary key can detect this; the pipeline
without a primary key can’t.

### Refresh and compare performance

Refresh both dynamic tables to pick up the changes:

```sqlexample
ALTER DYNAMIC TABLE dt_enriched_orders_no_pk REFRESH;
```

Check the execution time and scan metrics:

1. Navigate to Transformation » Dynamic Tables.
2. Filter the list by selecting the `temp` database, then select `dt_enriched_orders_no_pk`.
3. Select the Refresh History tab and notice the REFRESH DURATION value for the most recent refresh.

   Because the dimension table has no primary key, Snowflake can’t distinguish changed rows from unchanged rows after
   the INSERT OVERWRITE. Every row in the rewritten table looks like a new insertion, so the engine treats all 100,000
   dimension rows as changed and must re-join them against the entire 10-million-row fact table. This produces far more
   inserted and deleted rows and a much higher refresh duration, even though only 10% of the dimension data actually changed.
4. Now refresh the optimized `dt_enriched_orders_with_pk` dynamic table:

   ```sqlexample
   ALTER DYNAMIC TABLE dt_enriched_orders_with_pk REFRESH;
   ```
5. Repeat the previous steps to check the Refresh History for the optimized table:

   Compare the two refresh operations. The optimized `dt_enriched_orders_with_pk` dynamic table should complete significantly faster
   than the suboptimal `dt_enriched_orders_no_pk` dynamic table. In the example results, the suboptimal dynamic table took 34 seconds
   and updated 20 million rows in total, while the optimized table took only 12 seconds and updated only 2 million rows in total.

The results show the difference between the two approaches:

* **dt_enriched_orders_with_pk**: Uses the primary key to identify the ~10% of dimension rows
  that actually changed, then processes only the orders that reference those products. The
  `rows_inserted` and `rows_deleted` counts reflect just the affected rows, and the refresh
  duration is significantly lower.
* **dt_enriched_orders_no_pk**: Can’t determine what changed in the rewrite, so it reprocesses
  the entire 10-million-row join. The row counts and refresh duration are much higher.

> **Tip:**
>
> The performance difference increases when a) the fact table grows and b) fewer dimension rows
> actually changed. In production pipelines where dimension tables contain
> millions of rows and only a small fraction changes on each load cycle, primary key-based
> change tracking can reduce refresh times by an order of magnitude.

## Clean up

To delete all objects created for this tutorial, run the following DROP statement:

```sqlexample
DROP DATABASE temp;
```

## Summary and additional resources

In this tutorial, you used [primary keys](../dynamic-tables-primary-keys.md) to
enable an efficient incremental refresh in a dynamic table pipeline where the dimension table is
periodically rewritten through INSERT OVERWRITE. By comparing two pipelines with the same data
and query, you saw how a primary key lets Snowflake identify only the rows that actually changed
and process just those changes through the join.

Along the way, you completed the following tasks:

* **Created two dimension tables** with the same data: one with a primary key (`RELY`) and one
  without.
* **Built two dynamic table pipelines** with the same join query and compared their behavior
  after an INSERT OVERWRITE rewrite.
* **Simulated a dimension table rewrite** with INSERT OVERWRITE, changing 10% of the rows, and
  compared the refresh performance of both pipelines.

**Key concepts demonstrated:**

* **Primary key-based change tracking**: When a base table has a primary key with `RELY`,
  Snowflake uses it to compute row-level changes across rewrites. See
  [Understanding primary keys in dynamic tables](../dynamic-tables-primary-keys.md).
* **INSERT OVERWRITE compatibility**: Primary keys solve the change-tracking gap that occurs
  when tables are fully replaced. Without a primary key, Snowflake treats every row as changed.
  See [Use primary keys to optimize dynamic table pipelines](../dynamic-tables-performance-optimize-primary-keys.md).

For more information about dynamic tables and optimization techniques, explore the following
resources:

* [Understanding primary keys in dynamic tables](../dynamic-tables-primary-keys.md) – Conceptual overview of primary keys in
  dynamic tables.
* [Use primary keys to optimize dynamic table pipelines](../dynamic-tables-performance-optimize-primary-keys.md) – Additional examples for
  using primary keys to optimize pipelines.
* [Optimize queries for incremental refresh](../dynamic-tables-performance-optimize-query.md) – Query optimization for
  incremental refresh.
* [Use immutability constraints](../dynamic-tables-performance-optimize-immutability.md) – Use immutability
  constraints for historical data.
* [Monitor dynamic table performance](../dynamic-tables-performance-monitor.md) – Monitor dynamic table performance.
* [Understanding dynamic table initialization and refresh](../dynamic-tables-refresh.md) – Understand refresh modes and scheduling.

---
title: Tutorials: Bulk load data
source: https://docs.snowflake.com/en/user-guide/bulk-load-tutorials.md
section: User Guide
---

# Tutorials: Bulk load data

The following tutorials provide examples and step-by-step instructions you can follow as you learn to bulk
load data into Snowflake:

> **Note:**
>
> These tutorials show you how to load data into a table by using the
> [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command. For information about other options
> for loading data, see [Overview of data loading](data-load-overview.md).

[Bulk load from a local file system using COPY](tutorials/data-load-internal-tutorial.md)
:   Describes how to [bulk load data](data-load-local-file-system.md) from files in your
    local file system into a table.

[Bulk load from Amazon S3 using COPY](tutorials/data-load-external-tutorial.md)
:   Describes how to bulk load data from files in an existing Amazon Simple Storage Service (Amazon S3)
    bucket into a table.

---
title: Tutorials: Load and query data
source: https://docs.snowflake.com/en/user-guide/data-load-tutorials.md
section: User Guide
---

# Tutorials: Load and query data

The following tutorials provide examples and step-by-step instructions you can follow as you learn to load data into Snowflake:

> **Note:**
>
> These tutorials show you how to load data into a table by using the
> [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command. For information about other options
> for loading data, see [Overview of data loading](data-load-overview.md).

[Load and query sample data using SQL](tutorials/tasty-bytes-sql-load.md)
:   Uses a fictitious food truck brand named Tasty Bytes to show you how to
    [load](data-load-overview.md) and query data in Snowflake using
    SQL. You can access a pre-loaded
    [Snowsight template](ui-snowsight/snowsight-templates.md) worksheet
    to complete these tasks.

[Load data from cloud storage: Amazon S3](tutorials/load-from-cloud-tutorial.md)
:   Shows you how to load data from an Amazon S3 bucket into Snowflake using SQL. You can
    access a pre-loaded Snowsight template worksheet to complete these tasks.

[Load data from cloud storage: Microsoft Azure](tutorials/load-from-cloud-tutorial-azure.md)
:   Shows you how to load data from Microsoft Azure cloud storage into Snowflake using SQL.
    You can access a pre-loaded Snowsight template worksheet to complete these tasks.

[Load data from cloud storage: Google Cloud Storage](tutorials/load-from-cloud-tutorial-gcs.md)
:   Shows you how to load data from Google Cloud Storage into Snowflake using SQL.
    You can access a pre-loaded Snowsight template worksheet to complete these tasks.

---
title: Tutorials: Work with semi-structured data
source: https://docs.snowflake.com/en/user-guide/semi-structured-tutorials.md
section: User Guide
---

# Tutorials: Work with semi-structured data

The following tutorials provide examples and step-by-step instructions you can follow as you learn to
work with semi-structured data in Snowflake:

> **Note:**
>
> These tutorials show you how to load data into a table by using the
> [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command. For information about other options
> for loading data, see [Overview of data loading](data-load-overview.md).

[Learn the basics of using JSON with Snowflake](tutorials/json-basics-tutorial.md)
:   Describes the basics of using [JSON](semistructured-data-formats.md) with Snowflake.

[Load JSON data into a relational table](tutorials/script-data-load-transform-json.md)
:   Uses a [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command with a SELECT statement to load individual
    elements in a staged JSON file into a table.

[Load and unload Parquet data](tutorials/script-data-load-transform-parquet.md)
:   Describes how you can upload [Parquet](semistructured-data-formats.md) data by transforming elements of
    a staged Parquet file directly into table columns using the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command. The
    tutorial also describes how you can use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload table data
    into a Parquet file.

---
title: TX-RAMP
source: https://docs.snowflake.com/en/user-guide/cert-txramp.md
section: User Guide
---

# TX-RAMP

This topic describes how Snowflake supports customers with TX-RAMP compliance requirements.

## Understanding TX-RAMP compliance requirements

TX-RAMP provides a standardized approach for security assessment, authorization, and continuous monitoring of
cloud computing services used by Texas state agencies. Texas state agencies can use Snowflake to comply with TX-RAMP.

For more information about the service offerings that are currently authorized, see [U.S. regions supporting public sector workloads](intro-regions.md).

You can find an up-to-date inventory of authorized cloud solutions on the
[TX-RAMP website](https://dir.texas.gov/resource-library-item/tx-ramp-certified-cloud-products).

> **Note:**
>
> If your Snowflake account is in a [U.S. government region](intro-regions.md) and you want to access data products that are
> offered privately or on the Snowflake Marketplace, or offer listings either privately or on the Snowflake Marketplace, you must review and
> acknowledge a cross-region disclaimer for your [organization](organizations.md).
>
> For details, see:
>
> * [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
> * [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
> * [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: Understand budget costs
source: https://docs.snowflake.com/en/user-guide/budgets/cost.md
section: User Guide
---

# Understand budget costs

Using budgets incurs the following costs:

* **Compute costs** — Snowflake runs serverless background tasks (_MEASUREMENT_TASK and _BACKFILL_TASK)
  that collect credit usage data for the account budget and custom budgets in your account. The compute resources used for these tasks are
  billed to your account.
* **Storage costs** — Snowflake stores metadata for Budgets in your account. Storage for this metadata is billed to your account.

## Exploring compute costs

You can view costs for serverless tasks using Snowsight or the Account Usage
[SERVERLESS_TASK_HISTORY view](../../sql-reference/account-usage/serverless_task_history.md).

> **Note:**
>
> The _MEASUREMENT_TASK task runs when you add or remove object tags, which incurs cost for the serverless compute needed to run the task.

Example: Compute cost of all budgets
:   The following example sums the credit usage for the measure task for the previous 28 days, which helps you understand the total compute
    cost of using budgets:

    ```sqlexample
    SELECT SUM(credits_used)
       FROM snowflake.account_usage.serverless_task_history
       WHERE task_name = '_MEASUREMENT_TASK'
         AND start_time >= DATEADD('day', -28, current_timestamp());
    ```

Example: Compute cost of individual budgets
:   The following example lists the budgets in the account along with the compute costs associated with each budget within the specified time
    period.

    ```sqlexample
    WITH costs AS (
      SELECT instance_id, SUM(credits_used) AS sum_credits
        FROM snowflake.account_usage.serverless_task_history
        WHERE start_time >= DATE_TRUNC('month',  CURRENT_TIMESTAMP())
          AND instance_id IS NOT NULL
       GROUP BY 1)
    SELECT ci.name, ci.schema_name, ci.database_name, costs.sum_credits
    FROM snowflake.account_usage.class_instances ci
      JOIN costs
        ON costs.instance_id = ci.id
    WHERE class_name = 'BUDGET' AND class_database_name = 'SNOWFLAKE' AND deleted IS NULL;
    ```

## Exploring storage costs

The data and metadata needed for budgets is stored in the following internal tables:

* _CONFIGURATION_TABLE
* _MEASUREMENT_TABLE
* _NOTIFICATION_TABLE
* _BUDGET_HOT_USAGE_DATA
* _BUDGET_COLD_USAGE_DATA
* _BUDGET_CUSTOM_ACTIONS

To determine costs associated with these tables, you can query the TABLES view in the Account Usage or Organization Usage schema to return
the amount of storage being used for the tables.

The following examples returns the sum of the storage being used for the internal tables associated with budgets in the current account:

```sqlexample
SELECT SUM(bytes)
   FROM snowflake.account_usage.tables
   WHERE table_name IN (
      '_CONFIGURATION_TABLE',
      '_MEASUREMENT_TABLE',
      '_NOTIFICATION_TABLE',
      '_BUDGET_HOT_USAGE_DATA',
      '_BUDGET_COLD_USAGE_DATA');
```

---
title: Understand dbt project objects
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-understanding-dbt-project-objects.md
section: User Guide
---

# Understand dbt project objects

A DBT PROJECT is a schema-level object that contains versioned source files for your dbt project in Snowflake. You can connect a dbt project
object to a workspace, or you can create and manage the object independently of a workspace.

A dbt project object is typically based on a dbt project directory that contains a `dbt-project.yml` file. This is the pattern that
Snowflake uses when you [deploy](dbt-projects-on-snowflake-deploy.md) (create) a dbt project object from
within a workspace.

dbt project objects support role-based access control (RBAC). You can CREATE, ALTER, and DROP dbt project objects like other schema-level objects in Snowflake. You can use the [EXECUTE DBT PROJECT](../../sql-reference/sql/execute-dbt-project.md) command from a Snowflake warehouse to run dbt
commands like `test` and `run`. You can also use [tasks](../tasks-intro.md) to schedule execution of these commands.

## How dbt project objects get updated

dbt project objects don’t automatically update as you edit the workspace; you must deploy (that is, add a new version) each time you want the
object to pick up code changes.

To create a production pipeline, we recommend creating a dbt project object and [scheduling its execution with a task](dbt-projects-on-snowflake-schedule-project-execution.md).
Because each dbt project object version is immutable, doing so ensures nothing changes between runs unless someone explicitly adds a new
version.

To update the dbt Project’s files, you must add a new version in a workspace, for example:

```sqlexample
ALTER DBT PROJECT testdbt.public.my_dbt_project_object
  ADD VERSION FROM 'snow://workspace/user$.public."all_my_dbt_projects"/versions/last';
```

If your dbt Project is backed by Git and you want to automate your testing and deployment, run the Snow CLI `snow dbt deploy` command
with the `--force` option, as shown in the following example:

```snowcli
snow dbt deploy --source 'snow://workspace/user$.public."all_my_dbt_projects"/versions/last'  --force my_dbt_project;
```

`--force` enables you to add a version; without it, it would be the equivalent of running CREATE DBT PROJECT on an already created
object, which would fail.

For more information about versioning, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).

---
title: Understand dependencies for dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-dependencies.md
section: User Guide
---

# Understand dependencies for dbt Projects on Snowflake

In dbt Projects on Snowflake, dbt dependencies are the packages that you declare in your `packages.yml` file (for example, `dbt-labs/dbt_utils` from the
[Getting started tutorial](../tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md)). They get installed into a
`dbt_packages` folder when you run `dbt deps`, just like in dbt Core.

You must execute the `dbt deps` command within a Snowflake workspace to populate the `dbt_packages` folder for your dbt Project.
Alternatively, you can run `dbt deps` on your local machine or git orchestrator (for example, GitHub Actions) and deploy with
`snow dbt deploy`.

Once a dbt project version is created, think of it as read-only code. You don’t modify its files with `dbt deps`; you create a new
version if you need updated dependencies.

## About executing the dbt deps command

You can execute the `dbt deps` command in one of the following ways:

* **In a Snowflake Workspace:** (Recommended for dev environments.) You can execute the `dbt deps` command inside your workspace in
  Snowsight to populate `dbt_packages` before you deploy your dbt Project as a DBT PROJECT object.

  This requires external network access so Snowflake can access the repositories for the dependencies. For more information, see
  Create an external access integration in Snowflake for dbt dependencies.
* **Outside Snowflake:** (For example, in the build step of your deployment pipeline.) You can execute the `dbt deps` command on your
  local machine or in your continuous integration (CI), which downloads packages into `dbt_packages`, then deploy the whole project
  (including that folder) into Snowflake.

  This doesn’t require an external network access integration because all dependencies are already included in the dbt project.

  Because the files in a dbt project version are immutable, if you try to execute `dbt deps` against a deployed object, this would have
  no effect on the `dbt_packages` folder within the object.

## Cross dbt project dependencies

In order to reference another dbt project within your dbt project, the dbt project being referenced must be copied into the root of your dbt
project. Snowflake only supports references in the same folder. For example, `:local: ../some_other_project` isn’t supported.

Although local dependencies don’t require an external access integration, if you need a mix of local packages and remote packages (for example, from dbt Packages hub or Git), you must configure a real external access integration.

Take, for example, the following two dbt projects. You want `core_project` to include `metrics_project` locally so that everything
is self-contained when you deploy to Snowflake (no external access needed).

```text
/Projects
├─ core_project/
│   ├─ dbt_project.yml
│   ├─ packages.yml
│   ├─ models/
│   └─ ...
└─ metrics_project/
    ├─ dbt_project.yml
    ├─ models/
    └─ ...
```

* `core_project`: This is your main project (the one that you’ll deploy).
* `metrics_project`: This is the project you want to use as a local dependency.

To reference `metrics_project` inside `core_project`, complete the following steps:

1. Inside of `core_project`, create a folder named `local_packages`. Copy `metrics_project` into this folder.

   Make sure that `metrics_project` has a different name in its `dbt_project.yml` than `core_project`. They must be unique.

   ```bash
   cd /Projects/core_project
   mkdir local_packages
   cp -R ../metrics_project ./local_packages/metrics_project
   ```

   Now, your layout looks like this:

   ```text
   core_project/
     ├─ dbt_project.yml
     ├─ packages.yml
     ├─ models/
     ├─ local_packages/
     │   └─ metrics_project/
     │       ├─ dbt_project.yml
     │       ├─ models/
     │       └─ ...
   ```
2. In `core_project/packages.yml`, declare the local dependency using the relative path.

   ```yaml
   packages:
     - local: local_packages/metrics_project
   ```
3. From inside `core_project`, run `dbt deps`.

   dbt will now treat `metrics_project` as a package and macros from `metrics_project` are available to `core_project`.

## Run dbt deps automatically at compilation

When you deploy or update a dbt project object and give it an external access integration, Snowflake can automatically run `dbt deps`
during compilation so that dependencies are installed as part of that step. This means you no longer need to include `/dbt_packages`
when deploying projects with external dependencies.

SnowsightSQLSnowflake CLI

When you deploy your dbt project object from the workspace to a Snowflake database and schema, you can create or update an object that
you previously created.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Workspaces.
3. In the Workspaces menu, select the workspace that contains your dbt project.
4. On the right side of the workspace editor, select Connect » Deploy dbt project.
5. In the Deploy dbt project popup window, select the following:

   * Under Select location, select your database and schema.
   * Under Select or Create dbt project, select Create dbt project.
   * Enter a name and description.
   * Optionally, enter a default target to choose which profile will be used for compilation and subsequent runs (for example, prod). The
     target of a dbt project run can still be overridden with `--target` in `ARGS`.
   * Optionally, select Run dbt deps, then select your external access integration to execute `dbt deps` automatically during
     deployment.
6. Select Deploy.

The Output tab displays the command that runs on Snowflake, which is similar to the following example:

```sqlexample
CREATE DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  FROM 'snow://workspace/mydb.my_dbt_projects_schema.sales_model/versions/version$2'
  DEFAULT_TARGET = 'prod'
  EXTERNAL_ACCESS_INTEGRATIONS = (my_dbt_ext_access);
```

```output
my_dbt_project successfully created.
```

The Connect menu now displays the name of the dbt project object that you created, with the following options:

* Redeploy dbt project: Updates the dbt project object with the current workspace version of the project by using ALTER. This
  increments the version of the dbt project object by one. For more information, see [Versions for dbt project objects and files](dbt-projects-on-snowflake-versions.md).
* Disconnect: Disconnects the workspace from the dbt project object, but doesn’t delete the dbt project object.
* Edit project: Update the comment, default target, and external access integration for the dbt project object.
* View project: Opens the dbt project object in the object explorer, where you can view the CREATE DBT PROJECT command for the dbt
  project object and run history for the project.
* Create schedule: Provides options for you to create a task that runs the dbt project object on a schedule. For more information,
  see [Create a task to schedule dbt project execution](../tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md).
* View schedules: Opens a list of schedules (tasks) that run the dbt project object, with the option to view task details in the
  object explorer.

To automatically run `dbt deps` during compile, run the CREATE DBT PROJECT or ALTER DBT PROJECT command with the
EXTERNAL_ACCESS_INTEGRATIONS parameter, as shown in the following example.

You can pass an empty array into the EXTERNAL_ACCESS_INTEGRATIONS parameter or you can specify one or more external access integrations,
depending on your use case. Local dependencies don’t require an external access integration, but if you need a mix of local packages and
remote packages (for example, from dbt Packages hub or Git), you must configure a real external access integration.

```sqlexample
-- Create a dbt project object that runs dbt deps on compile for remote packages
CREATE DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  FROM 'snow://workspace/mydb.my_dbt_projects_schema.sales_model/versions/version$2'
  EXTERNAL_ACCESS_INTEGRATIONS = (my_dbt_ext_access);

-- Create a dbt project object that runs dbt deps on compile for only local dependencies
CREATE DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  FROM 'snow://workspace/mydb.my_dbt_projects_schema.sales_model/versions/version$2'
  EXTERNAL_ACCESS_INTEGRATIONS = ();
```

```sqlexample
-- Update the Git repository object to fetch the latest code
ALTER GIT REPOSITORY mydb.dev_schema.my_dbt_git_stage FETCH;

-- Set external access integrations
ALTER DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  SET EXTERNAL_ACCESS_INTEGRATIONS = ();

-- Add a new version to the dbt project object based on the updated Git repository object
-- After an external access integration is set, the next ALTER DBT PROJECT ... ADD VERSION will call dbt deps during compile
ALTER DBT PROJECT mydb.my_dbt_projects_schema.my_dbt_project
  ADD VERSION
  FROM '@mydb.dev_schema.my_dbt_git_stage/branches/main/sales_dbt_project';
```

To automatically run `dbt deps` during compile, run the [snow dbt deploy](../../developer-guide/snowflake-cli/command-reference/dbt-commands/deploy.md)
command with either the `--external-access-integration` or `--install-local-deps` flag, as shown in the following example.

The `--install-local-deps` flag creates an object that has an empty external access integration. On a regular compile, it runs
`dbt deps` and replaces the previous state of the `dbt_packages` folder.

The `--external-access-integration` flag adds an external access integration, which takes precedence over the
`--install-local-deps` flag.

```snowcli
snow dbt deploy my_dbt_project --install-local-deps;
```

## Create an external access integration in Snowflake for dbt dependencies

When you run dbt commands in a workspace, dbt might need to access remote URLs to download dependencies. For example, dbt might need to
download packages from the dbt Package hub or from GitHub.

Most dbt projects specify dependencies in their `packages.yml` file. You must install these dependencies in the dbt project workspace.

You can’t update a deployed dbt project object with dependencies. To update a dbt project object with new dependencies, you must add a new
version to the object. For more information, see [How dbt project objects get updated](dbt-projects-on-snowflake-understanding-dbt-project-objects.md).

To get dbt package from remote URLs, Snowflake needs an external access integration that relies on a network rule, as shown in the
following example:

```sqlexample
-- Create NETWORK RULE for external access integration

CREATE OR REPLACE NETWORK RULE my_dbt_network_rule
  MODE = EGRESS
  TYPE = HOST_PORT
  -- Minimal URL allowlist that is required for dbt deps
  VALUE_LIST = (
    'hub.getdbt.com',
    'codeload.github.com'
    );

-- Create EXTERNAL ACCESS INTEGRATION for dbt access to external dbt package locations

CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION my_dbt_ext_access
  ALLOWED_NETWORK_RULES = (my_dbt_network_rule)
  ENABLED = TRUE;
```

For more information about external access integrations in Snowflake, see [Creating and using an external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md).

## Limitations, requirements, and considerations for dbt dependencies

The following requirements, considerations, and limitations apply to dbt dependencies for dbt projects in dbt Projects on Snowflake:

* You must execute the `dbt deps` command within a Snowflake workspace to populate the `dbt_packages` folder for your dbt Project.
  Alternatively, you can run `dbt deps` on your local machine or Git orchestrator and deploy with `snow dbt deploy`.

  A dbt Project object is a versioned snapshot, so running `dbt deps` with EXECUTE DBT PROJECT or `snow dbt execute` doesn’t
  modify any files; it mainly checks that your external access is configured correctly.
* You can specify public [Git packages](https://docs.getdbt.com/docs/build/packages#git-packages) in the `packages.yml` file. As a best practice, Snowflake recommends using private Git packages
  only if they are stored securely. We don’t recommend embedding unencrypted Git tokens.
* A network rule and external access integration are required to allow Snowflake to access the repositories for the dependencies. For more
  information, see Create an external access integration in Snowflake for dbt dependencies.
* A dbt project object is a versioned snapshot of your project. Running the `deps` command on it doesn’t modify any files; it’s
  primarily used to verify that your external access configuration is correct. When a dbt project object is created with an external access
  integration, `dbt deps` is run before `dbt compile` to package all dependencies and project files.
* Snowflake only supports referencing another dbt project in the same folder. For example, `:local: ../some_other_project` isn’t
  supported. For a workaround, see Cross dbt project dependencies.

---
title: Understand schema generation and customization
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-schema-customization.md
section: User Guide
---

# Understand schema generation and customization

dbt uses the default macro `generate_schema_name` to decide where a model is built.

By default, it uses your target schema (`target.schema`) specified from your dbt environment or profile. Unlike dbt Core behavior, the
target schema specified in the `profiles.yml` file must already exist in Snowflake before you
[deploy your dbt project object](dbt-projects-on-snowflake-deploy.md).
Otherwise, the project fails to compile or execute.

Typically, each developer has their own target schema, for example `analytics_dev`. For larger projects, you can set a custom schema to
group models and specify the schema configuration key in your `dbt_project.yml` file. dbt appends it to the target schema (for example,
`<target_schema>_<custom_schema>`) to keep intermediate and user-facing models separate.

```sqlexample
--Models in `models/tasty_bytes/ will be built in the "*_staging" schema
models:
  tasty_bytes:
      +schema: staging
```

A model’s custom schema doesn’t replace the target schema; rather, dbt combines them to avoid collisions. For example, `analytics_dev_staging`.
This is because if dbt ignored the target schema and only used the custom schema (in this case, `staging`), every developer would write to
the same schema and overwrite each other.

If you want different behavior (for example, use only the custom schema, prepend user names, add environment prefixes, etc.), override
`generate_schema_name` in `/macros/` to change how the final schema name is built. For more information and examples, see
[Changing the way dbt generates a schema name](https://docs.getdbt.com/docs/build/custom-schemas#changing-the-way-dbt-generates-a-schema-name) in the dbt documentation.

---
title: Understand warehouse usage for dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-warehouses.md
section: User Guide
---

# Understand warehouse usage for dynamic tables

Every dynamic table requires a warehouse to run its refreshes. You specify this warehouse when you
create the dynamic table, and Snowflake uses it automatically for all scheduled refreshes.

For guidance on configuring warehouses for your dynamic tables, see
[Adjust your warehouse configuration](dynamic-tables-performance-optimize.md).

## How warehouse size affects refresh performance

A larger warehouse doesn’t always result in higher costs. In many cases, doubling the warehouse size
doubles the per-second cost but halves the runtime. This results in similar total cost with faster
refreshes. Larger warehouses improve performance in two ways:

* **Memory**: When a refresh needs more memory than the warehouse provides, data spills to local storage.
  This spillage increases the total compute work and slows the refresh process. A larger warehouse has more memory and can
  avoid spills entirely.
* **Parallelism**: Larger warehouses run more tasks simultaneously. Refreshes that scan large amounts
  of data across many partitions benefit the most. Small data sets and sequential operations see
  diminishing returns when you use a larger warehouse.

For more information about warehouse sizing, see [Warehouse size](warehouses-overview.md).

## Dual warehouse support

Dynamic tables support separate warehouses for different refresh types:

* **WAREHOUSE**: Runs regular incremental refreshes.
* **INITIALIZATION_WAREHOUSE**: Runs [initializations and full refreshes](dynamic-tables-refresh.md), which perform full data scans and are typically more
  resource-intensive.

  > **Important:**
  >
  > When you [refresh manually](dynamic-tables-manual-refresh.md) by running
  > [ALTER DYNAMIC TABLE REFRESH COPY SESSION](../sql-reference/sql/alter-dynamic-table.md), the
  > command uses the current session’s warehouse. Snowflake ignores the INITIALIZATION_WAREHOUSE in this scenarios,
  > even for initializations.

This separation lets you use a larger warehouse for resource-intensive initializations without
paying for that capacity during regular incremental refreshes. Dual warehouse support is useful in the following common scenarios:

* You want to enable faster recovery when you promote a secondary dynamic table to primary and must reinitialize the table.
* You must meet strict RTO/RPO requirements, but don’t want to increase costs for day-to-day operations.

When you don’t set the INITIALIZATION_WAREHOUSE parameter, Snowflake runs all refreshes on the warehouse specified by WAREHOUSE.

---
title: Understanding & using Time Travel
source: https://docs.snowflake.com/en/user-guide/data-time-travel.md
section: User Guide
---

# Understanding & using Time Travel

Snowflake Time Travel enables accessing historical data (that is, data that has been changed or deleted) at any point within a defined period.

It serves as a powerful tool for performing the following tasks:

* Restoring objects that might have been accidentally or intentionally deleted. You can restore individual objects,
  such as tables, or restore all the objects inside a container object by restoring an entire schema or database.
* Duplicating and backing up data from key points in the past.
* Analyzing data usage/manipulation over specified periods of time.

## Introduction to Time Travel

Using Time Travel, you can perform the following actions within a defined period of time:

* Query data in the past that has since been updated or deleted.
* Create clones of entire tables, schemas, and databases at or before specific points in the past.
* Restore tables, schemas, databases, and some other kinds of objects that have been dropped.

> **Note:**
>
> When querying historical data in a table or non-materialized view, the current table or view schema is used. For more
> information, see [Usage notes](../sql-reference/constructs/at-before.md) for AT | BEFORE.

After the defined period of time has elapsed, the data is moved into [Snowflake Fail-safe](data-failsafe.md) and these actions
can no longer be performed.

> **Note:**
>
> A long-running Time Travel query will delay moving any data and objects (tables, schemas, and databases) in the account into Fail-safe,
> until the query completes.

### Time Travel SQL extensions

To support Time Travel, the following SQL extensions have been implemented:

* [AT | BEFORE](../sql-reference/constructs/at-before.md) clause which can be specified in SELECT statements and CREATE … CLONE commands (immediately
  after the object name). The clause uses one of the following parameters to pinpoint the exact historical data you want to access:

  + TIMESTAMP
  + OFFSET (time difference in seconds from the present time)
  + STATEMENT (query ID for statement)
* [UNDROP <object>](../sql-reference/sql/undrop.md) command for tables, schemas, databases, accounts, external volumes, and tags.

### Data retention period

A key component of Snowflake Time Travel is the data retention period.

When data in a table is modified, including deletion of data or dropping an object containing data, Snowflake preserves the state of the data
before the update. The data retention period specifies the number of days for which this historical data is preserved and, therefore,
Time Travel operations (SELECT, CREATE … CLONE, UNDROP) can be performed on the data.

The standard retention period is 1 day (24 hours) and is automatically enabled for all Snowflake accounts:

* For Snowflake Standard Edition, the retention period can be set to 0 (or unset back to the default of 1 day) at the account and object
  level (that is, databases, schemas, and tables).
* For Snowflake Enterprise Edition (and higher):

  + For transient databases, schemas, and tables, the retention period can be set to 0 (or unset back to the default of 1 day). The same
    is also true for temporary tables.
  + For permanent databases, schemas, and tables, the retention period can be set to any value from 0 up to 90 days.

> **Note:**
>
> A retention period of 0 days for an object effectively deactivates Time Travel for the object.

When the retention period ends for an object, the historical data is moved into [Snowflake Fail-safe](data-failsafe.md):

* Historical data is no longer available for querying.
* Past objects can no longer be cloned.
* Past objects that were dropped can no longer be restored.

To specify the data retention period for Time Travel:

* The [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) object parameter can be used by users with the ACCOUNTADMIN role to set the default
  retention period for your account.
* The same parameter can be used to explicitly override the default when creating a database, schema, and individual table.
* The data retention period for a database, schema, or table can be changed at any time.
* The [MIN_DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) account parameter can be set by users with the ACCOUNTADMIN role to set a minimum
  retention period for the account. This parameter does not alter or replace the DATA_RETENTION_TIME_IN_DAYS parameter value. However it
  may change the effective data retention time. When this parameter is set at the account level, the effective minimum data retention
  period for an object is determined by MAX(DATA_RETENTION_TIME_IN_DAYS, MIN_DATA_RETENTION_TIME_IN_DAYS).

### Limitations

When using Time Travel, the following object types are not cloned:

> * External tables
> * Internal (Snowflake) stages
> * Hybrid tables can be cloned for databases but not for schemas.
> * User tasks in a database or schema are not cloned when using CREATE SCHEMA … TIMESTAMP. In the following example, tasks in the source schema (S1) are not cloned to the schema with a timestamp (S2) but are cloned to the schema without a timestamp (S3).
>
>   ```sqlexample
>   CREATE SCHEMA S1;
>   USE SCHEMA S1;
>   CREATE TASK T1 AS SELECT 1;
>   CREATE SCHEMA S2 CLONE S1 AT(TIMESTAMP => '2025-04-01 12:00:00');
>     -- T1 is not cloned into S2
>   CREATE SCHEMA S3 CLONE S1;
>     -- T1 is cloned into S3
>   ```

## Enabling and deactivating Time Travel

No tasks are required to enable Time Travel. It is automatically enabled with the standard, 1-day retention period.

However, you may want to upgrade to Snowflake Enterprise Edition to enable configuring longer data retention periods of up to 90 days
for databases, schemas, and tables. Note that extended data retention requires additional storage which will be reflected in your monthly
storage charges. For more information about storage charges, see [Storage costs for Time Travel and Fail-safe](data-cdp-storage-costs.md).

Time Travel cannot be deactivated for an account. A user with the ACCOUNTADMIN role can set [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) to 0 at
the account level, which means that all databases (and subsequently all schemas and tables) created in the account have no retention period
by default; however, this default can be overridden at any time for any database, schema, or table.

A user with the ACCOUNTADMIN role can also set the [MIN_DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) at the account level. This parameter
setting enforces a minimum data retention period for databases, schemas, and tables. Setting MIN_DATA_RETENTION_TIME_IN_DAYS does not
alter or replace the DATA_RETENTION_TIME_IN_DAYS parameter value. It may, however, change the effective data retention period for objects.
When MIN_DATA_RETENTION_TIME_IN_DAYS is set at the account level, the data retention period for an object is determined by
MAX(DATA_RETENTION_TIME_IN_DAYS, MIN_DATA_RETENTION_TIME_IN_DAYS).

Time Travel can be deactivated for individual databases, schemas, and tables by specifying [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) with a
value of 0 for the object. However, if DATA_RETENTION_TIME_IN_DAYS is set to a value of 0, and MIN_DATA_RETENTION_TIME_IN_DAYS is set
at the account level and is greater than 0, the higher value setting takes precedence.

> **Attention:**
>
> Before setting [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) to 0 for any object, consider whether you want to deactivate Time Travel for the object,
> particularly as it pertains to recovering the object if it is dropped. When an object with no retention period is dropped, you will not
> be able to restore the object.
>
> As a general rule, we recommend maintaining a value of (at least) 1 day for any given object.

If the Time Travel retention period is set to 0, any modified or deleted data is moved into Fail-safe (for permanent tables)
or deleted (for transient tables) by a background process. This may take a short time to complete. During that time, the
TIME_TRAVEL_BYTES in table storage metrics might contain a non-zero value even when the Time Travel retention period is 0 days.

## Specifying the data retention period for an object

By default, the maximum retention period is 1 day (one 24-hour period). With Snowflake Enterprise Edition (and higher), the default
for your account can be set to any value up to 90 days:

* When creating a table, schema, or database, the account default can be overridden using the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md)
  parameter in the command.
* If a retention period is specified for a database or schema, the period is inherited by default for all objects created in the
  database/schema.

A minimum retention period can be set on the account using the [MIN_DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) parameter. If this parameter is
set at the account level, the data retention period for an object is determined by
MAX(DATA_RETENTION_TIME_IN_DAYS, MIN_DATA_RETENTION_TIME_IN_DAYS).

## Checking the data retention period for an object

To check the current retention period for a table, schema, or database, you can check the value of the `retention_time`
column in the output of the corresponding SHOW command, such as [SHOW TABLES](../sql-reference/sql/show-tables.md),
[SHOW SCHEMAS](../sql-reference/sql/show-schemas.md), or [SHOW DATABASES](../sql-reference/sql/show-databases.md).

For objects that are derived from tables, schemas, or databases, such as materialized views, you can examine the
retention periods of the parent objects.

For streams, you can check the value of the `stale_after` column in the output from the
[SHOW STREAMS](../sql-reference/sql/show-streams.md) command.

To include information about objects that have already been dropped, include the HISTORY clause with the
SHOW command.

The following example shows how you might check the retention periods of certain objects by filtering the output of
the SHOW commands.

The following example checks the retention period for specific named tables:

```sqlexample
SHOW TABLES
  ->> SELECT "name", "retention_time"
        FROM $1
        WHERE "name" IN ('MY_TABLE1', 'MY_TABLE2');
```

The following example checks for schemas where Time Travel is turned off:

```sqlexample
SHOW SCHEMAS
  ->> SELECT "name", "retention_time"
        FROM $1
        WHERE "retention_time" = 0;
```

The following example checks for databases where retention time is larger than the default.
The results include databases that have already been dropped.

```sqlexample
SHOW DATABASES HISTORY
  ->> SELECT "name", "retention_time", "dropped_on"
        FROM $1
        WHERE "retention_time" > 1;
```

## Changing the data retention period for an object

If you change the data retention period for a table, the new retention period impacts all data that is active, as well as any data currently
in Time Travel. The impact depends on whether you increase or decrease the period:

Increasing Retention:
:   Causes the data currently in Time Travel to be retained for the longer time period.

    For example, if you have a table with a 10-day retention period and increase the period to 20 days, data that would have been removed
    after 10 days is now retained for an additional 10 days before moving into Fail-safe.

    Note that this doesn’t apply to any data that is older than 10 days and has already moved into Fail-safe.

Decreasing Retention:
:   Reduces the amount of time data is retained in Time Travel:

    * For active data modified after the retention period is reduced, the new shorter period applies.
    * For data that is currently in Time Travel:

      > + If the data is still within the new shorter period, it remains in Time Travel.
      > + If the data is outside the new period, it moves into Fail-safe.

    For example, if you have a table with a 10-day retention period and you decrease the period to 1-day, data from days 2 to 10 will be moved
    into Fail-safe, leaving only the data from day 1 accessible through Time Travel.

    However, the process of moving the data from Time Travel into Fail-safe is performed by a background process, so the change is not immediately
    visible. Snowflake guarantees that the data will be moved, but does not specify when the process will complete; until the background process
    completes, the data is still accessible through Time Travel.

> **Note:**
>
> If you change the data retention period for a database or schema, the change only affects active objects contained within
> the database or schema. Any objects that have been dropped (for example, tables) remain unaffected.
>
> For example, if you have a schema `s1` with a 90-day retention period and table `t1` is in schema `s1`,
> table `t1` inherits the 90-day retention period. If you drop table `s1.t1`, `t1` is retained in Time Travel
> for 90 days. Later, if you change the schema’s data retention period to 1 day, the retention
> period for the dropped table `t1` is unchanged. Table `t1` will still be retained in Time Travel for 90 days.
>
> To alter the retention period of a dropped object, you must undrop the object, then alter its retention period.

To change the retention period for an object, use the appropriate [ALTER <object>](../sql-reference/sql/alter.md) command. For example, to change the
retention period for a table:

```sqlexample
CREATE TABLE mytable(col1 NUMBER, col2 DATE) DATA_RETENTION_TIME_IN_DAYS=90;

ALTER TABLE mytable SET DATA_RETENTION_TIME_IN_DAYS=30;
```

> **Attention:**
>
> Changing the retention period for your account or individual objects changes the value for all lower-level objects that do not have a
> retention period explicitly set. For example:
>
> * If you change the retention period at the account level, all databases, schemas, and tables that do not have an explicit retention period
>   automatically inherit the new retention period.
> * If you change the retention period at the schema level, all tables in the schema that do not have an explicit retention period inherit the
>   new retention period.
>
> Keep this in mind when changing the retention period for your account or any objects in your account because the change might have
> Time Travel consequences that you did not anticipate or intend. In particular, we do not recommend changing the retention period to 0
> at the account level.

### Dropped containers and object retention inheritance

> **Warning:**
>
> Currently, when a database is dropped, the data retention period for child schemas or tables, if explicitly set to be different from the
> retention of the database, is not honored. The child schemas or tables are retained for the same period of time as the database.
>
> Similarly, when a schema is dropped, the data retention period for child tables, if explicitly set to be different from the retention of
> the schema, is not honored. The child tables are retained for the same period of time as the schema.
>
> To honor the data retention period for these child objects (schemas or tables), drop them explicitly before you drop the database
> or schema.

## Querying historical data

When any DML operations are performed on a table, Snowflake retains previous versions of the table data for a defined period of time. This
enables querying earlier versions of the data using the [AT | BEFORE](../sql-reference/constructs/at-before.md) clause.

This clause supports querying data either exactly at or immediately preceding a specified point in the table’s history within the
retention period. The specified point can be time-based (for example, a timestamp or time offset from the present) or it can be the ID for a
completed statement (for example, SELECT or INSERT).

For example:

* The following query selects historical data from a table as of the date and time represented by the specified
  [timestamp](../sql-reference/data-types-datetime.md):

  > ```sqlexample
  > SELECT * FROM my_table AT(TIMESTAMP => 'Wed, 26 Jun 2024 09:20:00 -0700'::timestamp_tz);
  > ```
* The following query selects historical data from a table as of 5 minutes ago:

  > ```sqlexample
  > SELECT * FROM my_table AT(OFFSET => -60*5);
  > ```
* The following query selects historical data from a table up to, but not including any changes made by the specified statement:

  > ```sqlexample
  > SELECT * FROM my_table BEFORE(STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726');
  > ```

> **Note:**
>
> If the TIMESTAMP, OFFSET, or STATEMENT specified in the [AT | BEFORE](../sql-reference/constructs/at-before.md) clause falls outside the data
> retention period for the table, the query fails and returns an error.

## Cloning historical objects

In addition to queries, the [AT | BEFORE](../sql-reference/constructs/at-before.md) clause can be used with the CLONE keyword in the CREATE command
for a table, schema, or database to create a logical duplicate of the object at a specified point in the object’s history. If you don’t
specify a point in time, the clone defaults to the state of the object as of now
(the [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md) value).

For example:

* The following [CREATE TABLE](../sql-reference/sql/create-table.md) statement creates a clone of a table as of the date and time represented by the
  specified timestamp:

  > ```sqlexample
  > CREATE TABLE restored_table CLONE my_table
  >   AT(TIMESTAMP => 'Wed, 26 Jun 2024 01:01:00 +0300'::timestamp_tz);
  > ```
* The following [CREATE SCHEMA](../sql-reference/sql/create-schema.md) statement creates a clone of a schema and all its objects as they existed 1 hour
  before the current time:

  > ```sqlexample
  > CREATE SCHEMA restored_schema CLONE my_schema AT(OFFSET => -3600);
  > ```
* The following [CREATE DATABASE](../sql-reference/sql/create-database.md) statement creates a clone of a database and all its objects as they existed prior
  to the completion of the specified statement:

  > ```sqlexample
  > CREATE DATABASE restored_db CLONE my_db
  >   BEFORE(STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726');
  > ```

> **Note:**
>
> The cloning operation for a database or schema fails:
>
> > * If the specified Time Travel time is beyond the retention time of any current child (for example, a table) of the entity.
> >
> >   As a workaround for child objects that have been purged from Time Travel, use the
> >   [IGNORE TABLES WITH INSUFFICIENT DATA RETENTION](../sql-reference/sql/create-clone.md) parameter of the
> >   CREATE <object> … CLONE command. For more information, see [Child objects and data retention time](object-clone.md).
> > * If the specified Time Travel time is at or before the point in time when the object was created.

* The following [CREATE DATABASE](../sql-reference/sql/create-database.md) statement creates a clone of a database and all its objects as they existed
  four days ago, skipping any tables that have a data retention period of less than four days:

  > ```sqlexample
  > CREATE DATABASE restored_db CLONE my_db
  >   AT(TIMESTAMP => DATEADD(days, -4, current_timestamp)::timestamp_tz)
  >   IGNORE TABLES WITH INSUFFICIENT DATA RETENTION;
  > ```

## Dropping and restoring objects

The following sections explain the Time Travel considerations for the DROP, SHOW, and UNDROP commands.

### Dropping objects

When a table, schema, or database is dropped, it is not immediately overwritten or removed from the system. Instead, it is retained for the
data retention period for the object, during which time the object can be restored. Once dropped objects are moved to
[Fail-safe](data-failsafe.md), you cannot restore them.

To drop a notebook, table, schema, or database, use the following commands:

* [DROP NOTEBOOK](../sql-reference/sql/drop-notebook.md)
* [DROP TABLE](../sql-reference/sql/drop-table.md)
* [DROP SCHEMA](../sql-reference/sql/drop-schema.md)
* [DROP DATABASE](../sql-reference/sql/drop-database.md)

> **Note:**
>
> After dropping an object, creating an object with the same name does not restore the object. Instead, it creates a new version of the
> object. The original, dropped version is still available and can be restored.
>
> Restoring a dropped object restores the object in place (that is, it does not create a new object).

### Listing dropped objects

Dropped objects can be listed using the following commands with the HISTORY keyword specified:

* [SHOW TABLES](../sql-reference/sql/show-tables.md)
* [SHOW SCHEMAS](../sql-reference/sql/show-schemas.md)
* [SHOW DATABASES](../sql-reference/sql/show-databases.md)
* [SHOW ACCOUNTS](../sql-reference/sql/show-accounts.md)

For example:

> ```sqlexample
> SHOW TABLES HISTORY LIKE 'load%' IN mytestdb.myschema;
>
> SHOW SCHEMAS HISTORY IN mytestdb;
>
> SHOW DATABASES HISTORY;
> ```

The output includes all dropped objects and an additional DROPPED_ON column, which displays the date and time when the object was dropped.
If an object has been dropped more than once, each version of the object is included as a separate row in the output.

> **Note:**
>
> After the retention period for an object has passed and the object has been purged, it is no longer displayed in the
> SHOW *<object_type>* HISTORY output.

### Restoring objects

A dropped object that has not been purged from the system (that is, the object is displayed in the SHOW *<object_type>* HISTORY output) can be
restored using the following commands:

* [UNDROP NOTEBOOK](../sql-reference/sql/undrop-notebook.md)
* [UNDROP TABLE](../sql-reference/sql/undrop-table.md)
* [UNDROP SCHEMA](../sql-reference/sql/undrop-schema.md)
* [UNDROP DATABASE](../sql-reference/sql/undrop-database.md)
* [UNDROP ICEBERG TABLE](../sql-reference/sql/undrop-iceberg-table.md)
* [UNDROP DYNAMIC TABLE](../sql-reference/sql/undrop-dynamic-table.md)
* [UNDROP EXTERNAL VOLUME](../sql-reference/sql/undrop-external-volume.md)
* [UNDROP TAG](../sql-reference/sql/undrop-tag.md)
* [UNDROP ACCOUNT](../sql-reference/sql/undrop-account.md)

Calling UNDROP restores the object to its most recent state before the DROP command was issued.

For example:

> ```sqlexample
> UNDROP TABLE mytable;
>
> UNDROP SCHEMA myschema;
>
> UNDROP DATABASE mydatabase;
>
> UNDROP NOTEBOOK mynotebook;
> ```

> **Note:**
>
> If an object with the same name already exists, UNDROP fails. You must rename the existing object, which then enables you to restore
> the previous version of the object.

### Access control requirements and name resolution

Similar to dropping an object, a user must have OWNERSHIP privileges for an object to restore it. In addition, the user must have CREATE
privileges on the object type for the database or schema where the dropped object will be restored.

Restoring tables and schemas is only supported in the current schema or current database, even if a fully-qualified object name is specified.

### Example: Dropping and restoring a table multiple times

In the following example, the `mytestdb.public` schema contains two tables: `loaddata1` and `proddata1`. The `loaddata1` table is
dropped and recreated twice, creating three versions of the table:

> * Current version
> * Second (most recent) dropped version
> * First dropped version

The example then illustrates how to restore the two dropped versions of the table:

> 1. First, the current table with the same name is renamed to `loaddata3`. This enables restoring the most recent version of the dropped
>    table, based on the timestamp.
> 2. Then, the most recent dropped version of the table is restored.
> 3. The restored table is renamed to `loaddata2` to enable restoring the first version of the dropped table.
> 4. Lastly, the first version of the dropped table is restored.
>
> ```sqlexample
> SHOW TABLES HISTORY;
>
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
> | created_on                      | name      | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | dropped_on                      |
> |---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------|
> | Tue, 17 Mar 2016 17:41:55 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 48   | 16248 | PUBLIC | 1              | [NULL]                          |
> | Tue, 17 Mar 2016 17:51:30 -0700 | PRODDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 12   | 4096  | PUBLIC | 1              | [NULL]                          |
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
>
> DROP TABLE loaddata1;
>
> SHOW TABLES HISTORY;
>
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
> | created_on                      | name      | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | dropped_on                      |
> |---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------|
> | Tue, 17 Mar 2016 17:51:30 -0700 | PRODDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 12   | 4096  | PUBLIC | 1              | [NULL]                          |
> | Tue, 17 Mar 2016 17:41:55 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 48   | 16248 | PUBLIC | 1              | Fri, 13 May 2016 19:04:46 -0700 |
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
>
> CREATE TABLE loaddata1 (c1 number);
> INSERT INTO loaddata1 VALUES (1111), (2222), (3333), (4444);
>
> DROP TABLE loaddata1;
>
> CREATE TABLE loaddata1 (c1 varchar);
>
> SHOW TABLES HISTORY;
>
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
> | created_on                      | name      | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | dropped_on                      |
> |---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------|
> | Fri, 13 May 2016 19:06:01 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 0    | 0     | PUBLIC | 1              | [NULL]                          |
> | Tue, 17 Mar 2016 17:51:30 -0700 | PRODDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 12   | 4096  | PUBLIC | 1              | [NULL]                          |
> | Fri, 13 May 2016 19:05:32 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 4    | 4096  | PUBLIC | 1              | Fri, 13 May 2016 19:05:51 -0700 |
> | Tue, 17 Mar 2016 17:41:55 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 48   | 16248 | PUBLIC | 1              | Fri, 13 May 2016 19:04:46 -0700 |
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
>
> ALTER TABLE loaddata1 RENAME TO loaddata3;
>
> UNDROP TABLE loaddata1;
>
> SHOW TABLES HISTORY;
>
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
> | created_on                      | name      | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | dropped_on                      |
> |---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------|
> | Fri, 13 May 2016 19:05:32 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 4    | 4096  | PUBLIC | 1              | [NULL]                          |
> | Fri, 13 May 2016 19:06:01 -0700 | LOADDATA3 | MYTESTDB      | PUBLIC      | TABLE |         |            | 0    | 0     | PUBLIC | 1              | [NULL]                          |
> | Tue, 17 Mar 2016 17:51:30 -0700 | PRODDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 12   | 4096  | PUBLIC | 1              | [NULL]                          |
> | Tue, 17 Mar 2016 17:41:55 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 48   | 16248 | PUBLIC | 1              | Fri, 13 May 2016 19:04:46 -0700 |
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
>
> ALTER TABLE loaddata1 RENAME TO loaddata2;
>
> UNDROP TABLE loaddata1;
>
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
> | created_on                      | name      | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | dropped_on                      |
> |---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------|
> | Tue, 17 Mar 2016 17:41:55 -0700 | LOADDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 48   | 16248 | PUBLIC | 1              | [NULL]                          |
> | Fri, 13 May 2016 19:05:32 -0700 | LOADDATA2 | MYTESTDB      | PUBLIC      | TABLE |         |            | 4    | 4096  | PUBLIC | 1              | [NULL]                          |
> | Fri, 13 May 2016 19:06:01 -0700 | LOADDATA3 | MYTESTDB      | PUBLIC      | TABLE |         |            | 0    | 0     | PUBLIC | 1              | [NULL]                          |
> | Tue, 17 Mar 2016 17:51:30 -0700 | PRODDATA1 | MYTESTDB      | PUBLIC      | TABLE |         |            | 12   | 4096  | PUBLIC | 1              | [NULL]                          |
> +---------------------------------+-----------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+---------------------------------+
> ```

---
title: Understanding and viewing Fail-safe
source: https://docs.snowflake.com/en/user-guide/data-failsafe.md
section: User Guide
---

# Understanding and viewing Fail-safe

Separate and distinct from Time Travel, Fail-safe ensures historical data is protected in the event of a system failure or other event (e.g. a
security breach).

## What is Fail-safe?

Fail-safe provides a (non-configurable) 7-day period during which historical data may be recoverable by Snowflake. This period starts
immediately after the Time Travel retention period ends. Note, however, that a long-running Time Travel query will delay moving any data and
objects (tables, schemas, and databases) in the account into Fail-safe, until the query completes.

> **Attention:**
>
> Fail-safe is a data recovery service that is provided on a best effort basis and is intended only for use when all other recovery options have been attempted.
>
> Fail-safe is not provided as a means for accessing historical data after the Time Travel retention period has ended. It is for use only by
> Snowflake to recover data that may have been lost or damaged due to extreme operational failures.
>
> Data recovery through Fail-safe may take from several hours to several days to complete.

## View Fail-safe storage for your account

When you review the total data storage usage for your account in Snowsight, you can view the
historical data storage in Fail-safe.

You must use the ACCOUNTADMIN role to view the amount of data that is stored in Snowflake.

In Snowsight, follow these steps:

1. In the navigation menu, select Admin » Cost management, and then select Consumption.
2. Use the Usage Type filter to select Storage.
3. Review the graph and table for Fail-safe storage. The Storage Breakdown column in the table uses color-coded bars
   to represent the different kinds of storage, including Fail-safe storage. Hover the mouse pointer over
   each bar to see the size for each kind of storage.

## Billing for Fail-safe

Data recovery through Fail-safe uses Snowflake-managed serverless compute. Standard serverless compute billing applies. For billing
details, see “Table 5: Serverless Feature Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). To view the related credits that are consumed by data recovery
through Fail-safe, use the following metering history views. Filter for the FAILSAFE_RECOVERY service type:

* [METERING_DAILY_HISTORY view](../sql-reference/account-usage/metering_daily_history.md)
* [METERING_HISTORY view](../sql-reference/account-usage/metering_history.md)

## Considerations

For fail-safe and Snowpipe Streaming Classic, be aware of the following limitations:

* Fail-safe doesn’t support tables that contain data ingested by Snowpipe Streaming Classic. For such tables, you can’t use fail-safe for recovery because fail-safe operations on that table will fail completely. For more information, see [Snowpipe Streaming limitations](snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

---
title: Understanding Column-level Security
source: https://docs.snowflake.com/en/user-guide/security-column-intro.md
section: User Guide
---

# Understanding Column-level Security

This topic provides a general overview of Column-level Security and describes the features and support that are common to both Dynamic Data
Masking and External Tokenization.

To learn more about using a masking policy with a tag, see [Tag-based masking policies](tag-based-masking-policies.md).

## What is Column-level Security?

Column-level Security in Snowflake allows the application of a masking policy to a column within a table or view. Currently, Column-level Security includes two features:

1. [Dynamic Data Masking](security-column-ddm-intro.md)
2. [External Tokenization](security-column-ext-token-intro.md)

Dynamic Data Masking is a Column-level Security feature that uses masking policies to selectively mask plain-text data in table and view columns at query time.

External Tokenization enables accounts to tokenize data before loading it into Snowflake and detokenize the data at query runtime. Tokenization is the process of removing sensitive data by replacing it with an undecipherable token. External Tokenization makes use of masking policies with [external functions](../sql-reference/external-functions.md).

### What are masking policies?

Snowflake supports masking policies as a schema-level object to protect sensitive data from unauthorized access while allowing authorized users to access sensitive data at query runtime. This means that sensitive data in Snowflake is not modified in an existing table (i.e. no static masking). Rather, when users execute a query in which a masking policy applies, the masking policy conditions determine whether unauthorized users see masked, partially masked, obfuscated, or tokenized data. Masking policies as a schema-level object also provide flexibility in choosing a centralized, decentralized, or hybrid management approach. For more information, see Managing Column-level Security (in this topic).

Masking policies can include conditions and functions to transform the data at query runtime when those conditions are met. The policy-driven approach supports segregation of duties to allow security teams to define policies that can limit sensitive data exposure, even to the owner of an object (i.e. the role with the OWNERSHIP privilege on the object, such as a table or view) who normally have full access to the underlying data.

For example, masking policy administrators can implement a masking policy such that analysts (i.e. users with the custom ANALYST role) can only view the last four digits of a phone number and none of the social security number, while customer support representatives (i.e. users with the custom SUPPORT role) can view the entire phone number and social security number for customer verification use cases.

A masking policy consists of a single [data type](../sql-reference-data-types.md), one or more conditions, and one or more masking
functions.

* You can apply the masking policy to one or more table/view columns with the matching data type. For example, you can define a policy for an email address once and apply it to 1000s of email columns across databases and schemas.
* Masking policy conditions can be expressed using [Conditional expression functions](../sql-reference/expressions-conditional.md) and [Context functions](../sql-reference/functions-context.md) or by querying a custom entitlement table. You can use the context functions [INVOKER_ROLE](../sql-reference/functions/invoker_role.md) and [INVOKER_SHARE](../sql-reference/functions/invoker_share.md) for use with views and shares, respectively.
* Masking functions can be any of the built-in functions (e.g. [REGEXP_REPLACE](../sql-reference/functions/regexp_replace.md), [SHA2 , SHA2_HEX](../sql-reference/functions/sha2.md)), [User-defined functions overview](../developer-guide/udf/udf-overview.md), or [Writing external functions](../sql-reference/external-functions.md) (for de-tokenization using an external tokenization provider).

While Snowflake offers [secure views](views-secure.md) to restrict access to sensitive data, secure views present management challenges due to large numbers of views and derived business intelligence (BI) dashboards from each view. Masking policies solve this management challenge by avoiding an explosion of views and dashboards to manage.

Masking policies support segregation of duties (SoD) through the role separation of policy administrators from object owners. Secure views do not have SoD, which is a profound limitation to their utility. This role separation leads to the following default settings:

* Object owners (i.e. the role that has the OWNERSHIP privilege on the object) do not have the privilege to unset masking policies.
* Object owners cannot view column data in which a masking policy applies.

For more information on managing roles and privileges, see Managing Column-level Security (in this topic) and [Access control privileges](security-access-control-privileges.md).

> **Note:**
>
> In some cases, error messages related to masking policies might be redacted. For more information, see
> [Secure objects: Redaction of information in error messages](../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

#### How does a masking policy work?

Masking policies for Dynamic Data Masking and External Tokenization adopt the same structure and format with one notable exception: masking policies for External Tokenization require using [Writing external functions](../sql-reference/external-functions.md) in the masking policy body.

The reason for this exception is that External Tokenization requires a third-party tokenization provider to tokenize data before loading data into Snowflake. At query runtime, Snowflake uses the external function to make a REST API call to the tokenization provider, which then evaluates a tokenization policy (that is created outside of Snowflake) to return either tokenized or detokenized data based on the masking policy conditions. Note that role mapping must exist in Snowflake and the tokenization provider to ensure that the correct data can be returned from a given query.

Snowflake supports creating masking policies using [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md). For example:

```sqlexample
-- Dynamic Data Masking

CREATE MASKING POLICY employee_ssn_mask AS (val string) RETURNS string ->
  CASE
    WHEN CURRENT_ROLE() IN ('PAYROLL') THEN val
    ELSE '******'
  END;

-- External Tokenization

  CREATE MASKING POLICY employee_ssn_detokenize AS (val string) RETURNS string ->
  CASE
    WHEN CURRENT_ROLE() IN ('PAYROLL') THEN ssn_unprotect(VAL)
    ELSE val -- sees tokenized data
  END;
```

Where:

> `employee_ssn_mask`
> :   The name of the Dynamic Data Masking policy.
>
> `employee_ssn_detokenize`
> :   The name of the External Tokenization policy.
>
> `AS (val string) RETURNS string`
> :   Specifies the input and output data types. The data types must match.
>
> `->`
> :   Separates the policy signature from its body.
>
> `case ... end;`
> :   Specifies the masking policy body (i.e. SQL expression) conditions.
>
>     In these two examples, if the query operator is using the PAYROLL custom role in the current session, the operator sees the unmasked/detokenized value. Otherwise, a fixed masked/tokenized value is seen.
>
> `ssn_unprotect`
> :   The external function to operate on the tokenized data.

> **Tip:**
>
> If you want to update an existing masking policy and need to see the current definition of the policy, call the [GET_DDL](../sql-reference/functions/get_ddl.md) function or run the [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md) command.
>
> The masking policy definition can then be updated with the [ALTER MASKING POLICY](../sql-reference/sql/alter-masking-policy.md) command. This command does not
> require unsetting a masking policy from a column, if the masking policy is set on a column. So, a column that is protected by a policy
> remains protected while the policy definition is being updated.

For more details on using masking policies, see:

* Using Column-level Security on Tables and Views (in this topic)
* [Dynamic Data Masking](security-column-ddm-intro.md)
* [External Tokenization](security-column-ext-token-intro.md)
* [Advanced Column-level Security topics](security-column-advanced.md)
* [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) (for policy examples when role hierarchy and role activation is important)

#### Use conditional columns

Conditional masking uses a masking policy to selectively protect the column data in a table or view based on the values in one or more
different columns.

Using a different column to determine whether data in a given column should be protected offers policy administrators
(i.e. users with the `POLICY_ADMIN` custom role) more freedom to create policy conditions.

Note the difference between the two representative policy examples:

Masking policy:
:   This policy can be used for dynamic data masking.

    ```sqlexample
    CREATE MASKING POLICY email_mask AS
    (val string) RETURNS string ->
      CASE
        WHEN CURRENT_ROLE() IN ('PAYROLL') THEN val
        ELSE '******'
      END;
    ```

    This policy specifies only one argument, `val`, which represents any column that contains string data. This policy can be created
    once and applied to any column containing string data. Only users whose [CURRENT_ROLE](../sql-reference/functions/current_role.md) is the `PAYROLL`
    custom role can see the column data. Otherwise, Snowflake returns a fixed masked value in the query result.

    For more information, see [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).

Conditional masking policy:
:   Note the arguments, `(email varchar, visibility string)`, in this example.

    ```sqlexample
    CREATE MASKING POLICY email_visibility AS
    (email varchar, visibility string) RETURNS varchar ->
      CASE
        WHEN CURRENT_ROLE() = 'ADMIN' THEN email
        WHEN VISIBILITY = 'PUBLIC' THEN email
        ELSE '***MASKED***'
      END;
    ```

    This policy specifies two arguments, `email` and `visibility`, and these arguments are column names. The first column always
    specifies the column to mask. The second column is a conditional column to evaluate whether the first column should be masked. Multiple
    conditional columns can be specified. In this policy, users whose CURRENT_ROLE is the `ADMIN` custom role can view the email address.
    If the email address also has a visibility column value of `Public`, then the email address is visible in the query result.
    Otherwise, Snowflake returns a query result with a fixed masked value for the email column.

    This policy can be used on multiple tables and views provided that column structure in the table or view matches the columns specified in
    the policy. For more information, see [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).

Since the same object type is used for each representative example, the overall behavior of the policies should be similar, including, but not limited to:

* Query runtime evaluation.
* Utility (e.g. protecting sensitive data, using [Context functions](../sql-reference/functions-context.md)).
* Privilege structure.
* Usage with different management approaches to support segregation of duties (SoD).

Limitations:
:   In addition to the existing masking policy limitations, conditional masking policies have
    the following limitations:

    * Row access policies. For details, see Row Access Policies (in this topic).
    * External tables. For details, see: External tables (in this topic).

Considerations:
:   In addition to the existing normal masking policy considerations, evaluate the following
    points prior to using conditional masking policies:

    * Ensure all columns specified in the CREATE MASKING POLICY statement reside in the same table or view.
    * Minimize the number of column arguments in the policy definition. Snowflake must evaluate each column at query runtime. Specifying
      fewer columns leads to faster performance overall.
    * Track conditional column usage in a masking policy by calling the Information Schema table function
      [POLICY_REFERENCES](../sql-reference/functions/policy_references.md).

For more details on setting masking policies with conditional columns, see
Apply a conditional masking policy on a column (in this topic).

#### Masking policies at query runtime

At runtime, Snowflake rewrites the query to apply the masking policy expression to the columns specified in the masking policy. The masking
policy is applied to the column regardless of where in a SQL expression the column is referenced, including:

* Projections.
* JOIN predicates.
* WHERE clause predicates.
* ORDER BY and GROUP BY clauses.

> **Important:**
>
> A masking policy is deliberately applied wherever the relevant column is referenced by a SQL construct to prevent the de-anonymization of
> data through creative queries to include masked column data.
>
> Therefore, if executing a query results in masked data in one or more columns, the query output may not provide the anticipated value
> because the masked data prevents evaluating all of the query output data in the desired context.

**Masking policies with nested objects:**

> Snowflake supports nested masking policies, such as a masking policy on a table and a masking policy on a view for the same table. At
> query runtime, Snowflake evaluates all masking policies that are relevant to a given query in the following sequence:
>
> 1. The masking policy that is applicable to the table is always executed first.
> 2. The policy for the view is executed after evaluating the policy for the table.
> 3. If nested views exist (e.g. `table_1` » `view_1` » `view_2` » … `view_n`), the policies are applied in sequential
>    order from left to right.
>
> This pattern continues for however many masking policies exist with respect to the data in the query. The following diagram illustrates
> the relationship between a query performer, tables, views, and policies.

**User queries:**

> The following example shows a user-submitted query followed by the Snowflake runtime query rewrite in which the masking policy
> (i.e. `sql_expression`) applies to the email column only.
>
> ```sqlexample
> SELECT email, city
> FROM customers
> WHERE city = 'San Mateo';
>
> SELECT <SQL_expression>(email), city
> FROM customers
> WHERE city = 'San Mateo';
> ```

**Query with a protected column in the WHERE clause predicate (anti-pattern):**

> The following examples show a user-submitted query followed by the Snowflake runtime query rewrite in which the masking policy
> (i.e. `sql_expression`) applies to only one side of a comparison (e.g. the email column but not the string to which the email column
> is compared). The results of the query are not what the user intended. Masking only one side of a comparison is a common
> anti-pattern.
>
> ```sqlexample
> SELECT email
> FROM customers
> WHERE email = 'user@example.com';
>
> SELECT <SQL_expression>(email)
> FROM customers
> WHERE <SQL_expression>(email) = 'user@example.com';
> ```

**Query with a protected column in the JOIN predicate (anti-pattern):**

> ```sqlexample
> SELECT b.email, d.city
> FROM
>   sf_tuts.public.emp_basic AS b
>   JOIN sf_tuts.public.emp_details AS d ON b.email = d.email;
>
> SELECT
>   <SQL_expression>(b.email),
>   d.city
> FROM
>   sf_tuts.public.emp_basic AS b
>   JOIN sf_tuts.public.emp_details AS d ON <SQL_expression>(b.email) = <SQL_expression>(d.email);
> ```

#### Query runtime considerations

Snowflake recommends considering the following factors when trying to predict the effect of applying a masking policy to a column and whether the query operator sees masked data:

The current session:
:   Masking policy conditions using [CURRENT_ROLE](../sql-reference/functions/current_role.md).

The executing role:
:   Masking policy conditions using [INVOKER_ROLE](../sql-reference/functions/invoker_role.md).

Role hierarchy:
:   Masking policy conditions using [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) or [IS_GRANTED_TO_INVOKER_ROLE](../sql-reference/functions/is_granted_to_invoker_role.md).

Data sharing:
:   Whether the data is shared using [Secure Data Sharing](data-sharing-gs.md). For details, see
    Data Sharing (in this topic).

Replication:
:   See Replication (in this topic).

Subqueries:
:   A masking policy can reference a subquery in the policy definition, however, there are limits to subquery support in Snowflake. For more information, see [Working with Subqueries](querying-subqueries.md).

UDFs in a masking policy:
:   Ensure the data type of the column, UDF, and masking policy match. For more information, see
    User-defined functions in a masking policy (in this topic).

Search optimization service:
:   The search optimization service can improve the query performance on a table that uses a masking or row access policy.

    For details, see
    [Support for Tables With Masking Policies and Row Access Policies in the Search Optimization Service](search-optimization/working-with-tables.md).

The first three items are explained in greater detail in [Advanced Column-level Security topics](security-column-advanced.md). Data sharing only applies to Dynamic Data Masking because external functions cannot be invoked in the context of a share.

Ultimately, the specific use case determines whether Dynamic Data Masking or External Tokenization is the best fit.

## Choosing Dynamic Data Masking or External Tokenization

To choose the correct feature that best meets the need of your organization, evaluate the major use cases for your data along with relevant considerations and limitations. The following two sections summarize the benefits and limitations between the two features.

### Benefits

The following table compares the benefits of Dynamic Data Masking and External Tokenization.

| Factor | Dynamic Data Masking | External Tokenization | Notes |
| --- | --- | --- | --- |
| Preserve analytical value after de-identification. |  | ✔ | Since tokenization provides a unique value for a given set of characters, it is possible to group records by a tokenized value without revealing the sensitive information.  For example, group medical records by diagnosis code with the patient diagnosis code tokenized. Data analysts can then query a view on the diagnosis code to obtain a count of the number of patients with a unique diagnosis code. |
| Pre-load tokenized Data. |  | ✔ | Unauthorized users never see the real data value. Requires third-party tokenization provider. |
| Pre-load unmasked data. | ✔ |  | Only need built-in Snowflake functionality, no third-parties required. |
| Data Sharing. | ✔ |  | For details, see Data Sharing (in this topic). |
| Ease of use and Change management. | ✔ | ✔ | Write a policy once and have it apply to thousands of columns across databases and schemas. |
| Data administration and SoD. | ✔ | ✔ | A security or privacy officer decides which columns to protect, not the object owner.  Masking policies are easy to manage and support centralized and decentralized administration models. |
| Data authorization and governance. | ✔ | ✔ |  |
| Contextual data access by role or custom entitlements. | ✔ | ✔ | Supports data governance as implemented by security or privacy officers and can prohibit privileged users with the ACCOUNTADMIN or SECURITYADMIN system roles from unnecessarily viewing data. |
| Database replication and account object replication. | ✔ | ✔ | See: Replication (in this topic). |

### Limitations

The following table describes the current limitations for Column-level Security. A checkmark (i.e. ✔) indicates a limitation or lack of current support for the feature.

| Limitation | Dynamic Data Masking | External Tokenization | Notes |
| --- | --- | --- | --- |
| Materialized views (MV). | ✔ | ✔ | For a complete summary, see Materialized views (in this topic). |
| [DROP MASKING POLICY](../sql-reference/sql/drop-masking-policy.md) | ✔ | ✔ | Prior to dropping a policy, unset the policy from the table or view column using [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md) or [ALTER VIEW](../sql-reference/sql/alter-view.md). |
| [DROP DATABASE](../sql-reference/sql/drop-database.md) and [DROP SCHEMA](../sql-reference/sql/drop-schema.md) | ✔ | ✔ | Dropping a database or schema requires the masking policy and its mappings to be self-contained within the database or schema.  For example, `database_1` contains `policy_1` and `policy_1` is only used in `database_1`. |
| External tables. | ✔ | ✔ | An external table cannot be referenced as a lookup table (i.e. in a subquery) to determine whether column values should be masked. For more information, see External Tables (in this topic) |
| Different data types in the input and output of a policy definition. | ✔ | ✔ | A masking policy definition must have the same data type for the input and output. In other words, as a representative example, you cannot define the input datatype as a timestamp and return a string. |
| Masking policy change management. | ✔ | ✔ | You can optionally store and track masking policy changes in a version control system of your choice. |
| [Future grants](../sql-reference/sql/grant-privilege.md). | ✔ | ✔ | [Future grants](../sql-reference/sql/grant-privilege.md) of privileges on masking policies are not supported.  As a workaround, grant the APPLY MASKING POLICY privilege to a custom role to allow that role to apply masking policies on table or view columns. |

### Considerations

* Use caution when inserting values from a source column that has a masking policy on the source column to a target column without a masking policy on the target column. Since a masking policy is set on the source column, a role that views unmasked column data can insert unmasked data into another column, where any role with sufficient privileges on the table or view can see the value.
* If a role that sees masked data in the source column inserts those values into a target column, the inserted values remain masked. If a masking policy is not set on the target column, then users with sufficient privileges on the table or view may see a combination of masked and unmasked values in the target column. Therefore, as a best practice:

  + Exercise caution when applying masking policies to columns.
  + Verify queries using columns that have masking policies before making tables and views available to users.
  + Determine additional tables and views (i.e. target columns) where the data in the source column may appear.
  + For more information, see Obtain Columns with a Masking Policy (in this topic).
* Use caution when creating the setup script for a Snowflake Native App when the masking policy exists in a versioned schema. For details, see
  [version schema considerations](../developer-guide/native-apps/creating-setup-script.md).

## Using Column-level Security on tables and views

Snowflake supports masking policies with tables and views. The following describes how masking policies affect tables and views in
Snowflake.

> **Tip:**
>
> Call the [POLICY_CONTEXT](../sql-reference/functions/policy_context.md) function to simulate a query on a column that is protected by a masking policy,
> a table or view protected by a row access policy, or both types of policies.

### Active role hierarchy & mapping tables

The policy conditions can evaluate the user’s active primary and secondary roles in a session directly, look up active roles in a mapping
table, or do both depending on how the policy administrator wants to write the policy. If the policy contains a mapping table lookup,
create a centralized mapping table and store the mapping table in the same database as the protected table. This is particularly important
if the policy calls the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function. For details, see the function
[usage notes](../sql-reference/functions/is_database_role_in_session.md).

For these use cases, Snowflake recommends writing the policy conditions to call the [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) or
the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function depending on whether you want to specify an account role or
database role. For examples, see:

* [Examples](../sql-reference/functions/is_role_in_session.md) section in the IS_ROLE_IN_SESSION function.
* IS_DATABASE_ROLE_IN_SESSION
* [Share data protected by a policy](data-sharing-policy-protected-data.md)

For additional examples using context functions and masking policies, see [Advanced Column-level Security topics](security-column-advanced.md).

### Apply masking policies to columns

When a column is not protected by a masking policy, there are two options to apply a masking policy to a column in a table or view:

1. With a new table or view, apply the policy to a table column with a [CREATE TABLE](../sql-reference/sql/create-table.md) statement or a view
   column with a [CREATE VIEW](../sql-reference/sql/create-view.md) statement.
2. With an existing table or view, apply the policy to a table column with an [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md)
   statement or a view column with an [ALTER VIEW](../sql-reference/sql/alter-view.md) statement.

For a new table or view, execute the following statements:

> ```sqlexample
> -- table
> CREATE OR REPLACE TABLE user_info (ssn string masking policy ssn_mask);
>
> -- view
> CREATE OR REPLACE VIEW user_info_v (ssn masking policy ssn_mask_v) AS SELECT * FROM user_info;
> ```

For an existing table or view, execute the following statements:

> ```sqlexample
> -- table
> ALTER TABLE IF EXISTS user_info MODIFY COLUMN ssn_number SET MASKING POLICY ssn_mask;
>
> -- view
> ALTER VIEW user_info_v MODIFY COLUMN ssn_number SET MASKING POLICY ssn_mask_v;
> ```

For more information on syntax and usage, see [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md) and [ALTER VIEW](../sql-reference/sql/alter-view.md).

If the masking policy uses a UDF, see User-defined functions in a masking policy (in this topic).

### Apply a conditional masking policy on a column

After [creating](../sql-reference/sql/create-masking-policy.md) a masking policy using
conditional columns, there are two options to set a conditional masking policy on a column
when the column is not already protected by a masking policy:

1. For a new table or view, apply the policy to a table or view column with the corresponding CREATE statement.

   For more information on syntax and usage, see:

   * [CREATE TABLE](../sql-reference/sql/create-table.md)
   * [CREATE VIEW](../sql-reference/sql/create-view.md)
   * [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md)

   For a new table or view, execute the following statements:

   > ```sqlexample
   > -- table
   > CREATE OR REPLACE TABLE user_info (email string masking policy email_visibility) USING (email, visibility);
   >
   > --view
   > CREATE OR REPLACE VIEW user_info_v (email masking policy email_visibility) USING (email, visibility) AS SELECT * FROM user_info;
   > ```
2. For an existing table or view, set the policy on a table or view column with the corresponding ALTER statement.

   For more information on syntax and usage, see:

   * [ALTER TABLE](../sql-reference/sql/alter-table.md)
   * [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md)
   * [ALTER VIEW](../sql-reference/sql/alter-view.md)
   * [ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md)

   For an existing table or view, execute the following statements:

   > ```sqlexample
   > -- table
   > ALTER TABLE IF EXISTS user_info MODIFY COLUMN email
   > SET MASKING POLICY email_visibility USING (EMAIL, VISIBILITY);
   >
   > -- VIEW
   > ALTER VIEW user_info_v MODIFY COLUMN email
   > SET MASKING POLICY email_visibility USING (email, visibility);
   > ```

### Replace a masking policy on a column

Once a masking policy is set on a column, there are two different pathways to replace the masking policy on the column with a different
masking policy without having to replace the entire table or view.

These examples use [ALTER TABLE](../sql-reference/sql/alter-table.md) commands. The same approach applies to views with the
[ALTER VIEW](../sql-reference/sql/alter-view.md) command:

* Unset the policy from a table column in one statement and then set a new policy on the column in a different statement:

  ```sqlexample
  ALTER TABLE t1 MODIFY COLUMN c1 UNSET MASKING POLICY;

  ALTER TABLE t1 MODIFY COLUMN c1 SET MASKING POLICY p2;
  ```
* Use the `FORCE` keyword to replace the policy in a single statement:

  ```sqlexample
  ALTER TABLE t1 MODIFY COLUMN c1 SET MASKING POLICY p2 FORCE;
  ```

  Note:

  + The `FORCE` keyword requires the [data type](../sql-reference-data-types.md) of the policy in the ALTER TABLE statement
    (i.e. STRING) to match the data type of the masking policy currently set on the column (i.e. STRING).
  + The `FORCE` keyword can be combined with the `USING` clause to set a conditional masking policy on column:

    ```sqlexample
    ALTER TABLE t1 MODIFY COLUMN c1 SET MASKING POLICY policy1 USING (c1, c3, c4) FORCE;
    ```

> **Important:**
>
> Exercise caution when replacing a masking policy on a column.
>
> Depending on the timing of the replacement and the query on the column, choosing to replace the policy in two separate statements could
> lead to a data leak because the column data is unprotected in the time interval between the UNSET and SET operations.
>
> However, if the policy conditions are different in the replacement policy, specifying the `FORCE` keyword could lead to a lack of
> access because (previously) users could access data and the replacement no longer allows access.
>
> Prior to replacing a policy, consult your internal data administrators to coordinate the best approach to protect data with masking
> policies and replace masking policies as needed.

### Row access policies

A given table or view column can be specified in either a masking policy signature or a row access policy signature. In other words, the
same column cannot be specified in both a masking policy signature and a row access policy signature at the same time.

This behavior also applies to column used as conditional columns in a masking policy.

For more information, see [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md) and [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md).

### Simulate how a policy will work

Call the [POLICY_CONTEXT](../sql-reference/functions/policy_context.md) function to simulate a query on a column that is protected by a masking policy,
a table or view protected by a row access policy, or both types of policies.

### Materialized views

Snowflake lets you set a masking policy on a materialized view column. At query runtime, the query plan executes any masking policy that
is present prior to creating the materialized view rewrite. Once the materialized view rewrite occurs, masking policies cannot be set on
any materialized view columns.

There are two options to set a masking policy on a materialized view column:

1. For a new materialized view, execute a [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md) statement:

   ```sqlexample
   CREATE OR REPLACE MATERIALIZED VIEW user_info_mv
     (ssn_number masking policy ssn_mask)
   AS SELECT ssn_number FROM user_info;
   ```
2. For an existing materialized view, execute an
   [ALTER VIEW … MODIFY COLUMN](../sql-reference/sql/alter-view.md) statement on the materialized view as shown in the
   Apply Masking Policies to Columns section (in this topic).

Additionally, the following two limitations exist regarding masking policies and materialized views:

1. A masking policy cannot be set on a table column if a materialized view is already created from the underlying table. Snowflake returns
   the following error message when this attempt is made:

   ```none
   SQL execution error: One or more materialized views exist on the table. number of mvs=<number>, table name=<table_name>.
   ```
2. If a masking policy is set on an underlying table column and a materialized view is created from that table, the
   materialized view only contains columns that are not protected by a masking policy. Snowflake also returns the following error message
   if the attempting to include one or more columns protected by a masking policy:

   ```none
   Unsupported feature 'CREATE ON MASKING POLICY COLUMN'.
   ```

> **Tip:**
>
> If you prefer to set a masking policy on a column in the base table, consider creating a dynamic table from the base table. For more
> information, see [Masking and row access policies](dynamic-tables-limitations.md).

### Dynamic tables

You can create a dynamic table with a row access policy, masking policy, and tag. For more information, see:

* [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md)
* [Masking and row access policies](dynamic-tables-limitations.md)

### Obtain columns with a masking policy

To obtain a list of columns with masking policies, execute the following statement. For more information, see [POLICY_REFERENCES](../sql-reference/functions/policy_references.md).

> ```sqlexample
> SELECT * from table(
>   INFORMATION_SCHEMA.POLICY_REFERENCES(
>     policy_name=>'<policy_name>'
>   )
> );
> ```

Execute a [DESCRIBE TABLE](../sql-reference/sql/desc-table.md) or [DESCRIBE VIEW](../sql-reference/sql/desc-view.md) statement to view the masking policy on column in a table or view.

### Object Tagging and masking policies

For details, see [Tag-based masking policies](tag-based-masking-policies.md).

Note that a masking policy that is directly assigned to a column takes precedence over a tag-based masking policy.

### Hashing, cryptographic, and encryption functions in masking policies

[Hashing](../sql-reference/functions-hash-scalar.md) and [cryptographic/checksum](../sql-reference/functions-string.md) can be used in masking policies to mask sensitive data.

For a more information, see [Advanced Column-level Security topics](security-column-advanced.md).

### External tables

You cannot assign a masking policy to the external table VALUE column when creating the external table with a
[CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md) statement because this column is automatically created by default.

You can assign the masking policy to the external table VALUE column by executing an [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md)
statement on the external table. The data type of the masking policy that protects the VALUE column must be VARIANT.

> ```sqlexample
> ALTER TABLE t1 MODIFY COLUMN VALUE SET MASKING POLICY p1;
> ```

You can assign a masking policy to a virtual column in an external table as follows:

* Set the `EXEMPT_OTHER_POLICIES` masking policy property to `TRUE` in the masking policy that protects VALUE column in the external
  table.

  If this property is not already set, execute a CREATE OR REPLACE statement on the masking policy the protects the VALUE column and
  specify the `EXEMPT_OTHER_POLICIES` property. The virtual column inherits the policy that protects the VALUE column, and this property
  allows the policy on the virtual column to override the inherited policy. For details, see
  [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).
* Assign a different masking policy to the virtual column using an ALTER TABLE command. This policy can be less strict than the policy for
  the VALUE column because the virtual column is less sensitive. The virtual column contains a lesser amount of data than the VALUE
  column; the VALUE column contains all of the data for each row in the external table.

  The data type in the policy that protects the virtual column depends on the data type of the virtual column.

Regarding conditional columns in a masking policy, a virtual column can be listed as an conditional column argument to determine whether
the first column argument should be masked or tokenized. However, a virtual column cannot be specified as the first column to mask or
tokenize.

For more information, see [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).

> **Important:**
>
> Snowflake does not support using an external table as a lookup table (i.e. in a subquery) in a masking policy. While cloning a database,
> Snowflake clones the masking policy, but not the external table. Therefore, the policy in the cloned database refers to a table that
> is not present in the cloned database.
>
> If the data in the external table is necessary for the policy, consider moving the external table data to a dedicated schema
> within the database in which the masking policy exists prior to completing a clone operation. Update the masking policy to
> reference the fully qualified table name to ensure the policy refers to a table in the cloned database.

### Streams

Masking policies on columns in a table carry over to a stream on the same table.

The result is that unauthorized users see masked data; streams created by authorized users see the data as defined by the masking policy.

For masking policies, streams use the latest table version available at the query time for any tables referenced in the policy.

### Cloned objects

The following approach helps to safeguard data from users with the SELECT privilege on the table or view when accessing a cloned object:

* Cloning an individual policy object is not supported.
* Cloning a schema results in the cloning of all policies within the schema.
* A cloned table maps to the same policies as the source table. In other words, if a policy is set on the base table or its columns, the
  policy is attached to the cloned table or its columns.

  + If a table or view exists in the source schema/database and has references to policies in the same schema/database, the cloned table or
    view is mapped to the corresponding cloned policy (in the target schema/database) instead of the policy in the source schema/database.
  + If the source table refers to a policy in a different schema (i.e. a foreign reference), then the cloned table retains the
    foreign reference.

For more information, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

### CREATE TABLE … AS SELECT (CTAS) statements

Executing a CREATE TABLE … AS SELECT (CTAS) statement applies any masking policies on columns included in the statement before the data is populated in the new table (i.e. the applicable column data is masked in the new table). This flow is adhered to because a table created using a CTAS statement may have a different set of columns than the source objects, and Snowflake cannot apply masking policies to the new table columns implicitly.

If there is a need to copy unmasked data, use a role authorized for protected data to run the CTAS statement. After creating the new table, transfer ownership of the new table to another role and ask the masking policy administrator to apply the masking policies to the columns of the new table.

For more information, see [CREATE TABLE](../sql-reference/sql/create-table.md).

### Queries using aggregate functions and masked columns

It is possible to use [Aggregate functions](../sql-reference/functions-aggregation.md) on columns with masked data.

A representative use case is that a data analyst wants to obtain the [COUNT](../sql-reference/functions/count.md) for a column of social security numbers without needing to see the actual data. However, if the data analyst runs a query using [SELECT](../sql-reference/sql/select.md) on a masked table column, the query returns a fixed masked value. Users with the PAYROLL custom role in the current session see the unmasked data and everyone else sees masked data.

To achieve this outcome:

1. The table owner creates a view on the column that contains the aggregate function.

   ```sqlexample
   CREATE VIEW ssn_count AS SELECT DISTINCT(ssn) FROM table1;
   ```
2. Grant the ANALYST role full privileges on the view. Do not grant the analyst any privileges on the table.
3. Apply a masking policy to the table column. Note that the table policy is always applied before the view policy, if there is a policy on a view column.

   ```sqlexample
   CASE
     WHEN CURRENT_ROLE() IN ('PAYROLL') THEN val
     ELSE '***MASKED***'
   END;
   ```
4. Execute a query on the view column.

   ```sqlexample
   USE ROLE analyst;
   SELECT COUNT(DISTINCT ssn) FROM v1;
   ```

### User-defined functions in a masking policy

A [UDF](../developer-guide/udf/udf-overview.md) can be passed into the masking policy conditions.

It is important to ensure that the data type for the table or view column, the UDF, and the masking policy match. If the data
types are different, such as having a table column and UDF with data type VARIANT and the masking policy (with this UDF in the policy
conditions) returns VARCHAR data type, Snowflake returns an error when making a query on the table column when this masking policy is set
on the table column.

For a representative example of matching the data type for a table column, UDF, and masking policy, see the *Using JavaScript
UDFs on JSON (Variant)* example in [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).

### Data Sharing

Usage:
:   * If the provider assigns a policy to a shared table or view and the policy conditions call the
      [CURRENT_ROLE](../sql-reference/functions/current_role.md) or [CURRENT_USER](../sql-reference/functions/current_user.md) function, or the policy conditions call a [secure UDF](../developer-guide/secure-udf-procedure.md), Snowflake returns a NULL value for the function or the UDF in the consumer account.

      The reason is that the owner of the data being shared does not typically control the users or roles in the account in which the table
      or view is being shared. As a workaround, use the [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) function in the policy conditions.

      Alternatively, as a provider, write the policy conditions to call the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md)
      function and share the database role. As a consumer, grant the shared database role to an account role. For details, see
      [Share data protected by a policy](data-sharing-policy-protected-data.md).

Limitations:
:   * A data sharing provider cannot create a policy in a [reader account](data-sharing-reader-create.md).
    * Data sharing consumers cannot apply a policy to a shared table or view. As a workaround, import the shared database and create a local
      view from the shared table or view.
    * Data sharing consumers cannot query a shared table or view that references two different providers. For example:

      + `rap1` is a row access policy that protects the table named `t1`, where `t1` is in the share named `share1` from a provider.
      + The `rap1` policy conditions reference a mapping table named `t2`, where `t2` comes from `share2` and a different provider.
      + The consumer query on `t1` fails.
      + The provider for `t1` can query `t1`.
    * External functions:

      Snowflake returns an error if:

      + The policy assigned to a shared table or view is updated to call an external function.
      + The policy calls an external function and you attempt to assign the policy to a shared table or view.

> **Note:**
>
> For External Tokenization, [Secure Data Sharing](data-sharing-intro.md) is not applicable because external functions
> cannot be invoked in the context of a share.

### Replication

Masking policies and their assignments can be replicated using database replication and replication groups.

For [database replication](database-replication-considerations.md), the replication operation fails if either of the
following conditions is true:

* The primary database is in an Enterprise (or higher) account and contains a policy but one or more of the accounts approved for
  replication are on lower editions.
* A table or view contained in the primary database has a [dangling reference](database-replication-considerations.md) to a
  masking policy in another database.

The dangling reference behavior for database replication can be avoided when replicating multiple databases in a
[replication group](account-replication-intro.md).

> > **Note:**
> >
> > If using failover or failback actions, the Snowflake account must be Business Critical Edition or higher.
> >
> > For more information, see [Introduction to replication and failover across multiple accounts](account-replication-intro.md).

### Query profile

When used on a column with a masking policy, the [EXPLAIN](../sql-reference/sql/explain.md) command output includes the masked data, not the masking policy body.

The following example generates the EXPLAIN plan for a query on a table of employee identification numbers and social security numbers. The command in this example generates the example in JSON format.

The column containing the social security numbers has a masking policy.

```sqlexample
EXPLAIN USING JSON SELECT * FROM mydb.public.ssn_record;
```

```sqljson
{
  "GlobalStats": {
    "partitionsTotal": 0,
    "partitionsAssigned": 0,
    "bytesAssigned": 0
  },
  "Operations": [
    [
      {
        "id": 0,
        "operation": "Result",
        "expressions": [
          "1",
          "'**MASKED**'"
        ]
      },
      {
        "id": 1,
        "parent": 0,
        "operation": "Generator",
        "expressions": [
          "1"
        ]
      }
    ]
  ]
}
```

### Unload data

Using the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command on a column that has a masking policy results in the masking policy being applied to the data. Therefore, unauthorized users see masked data after executing the command.

### Snowflake Native App Framework

For details about using masking policies with a Snowflake Native App, see:

* [Restrictions on sharing data content that contains policies](../developer-guide/native-apps/preparing-data-content.md).
* [Define policies on proxy views](../developer-guide/native-apps/preparing-data-content.md).
* [Blocked context functions](../developer-guide/native-apps/redacted-content.md).

## Managing Column-level Security

This section provides information useful for determining your overall management approach to masking policies, describes the privileges required to manage Column-level Security, and lists supported DDL commands.

### Choosing a centralized, hybrid, or decentralized approach

To manage Dynamic Data Masking and External Tokenization policies effectively, it is helpful to consider whether your approach to masking data in columns should follow a centralized security approach, a decentralized approach, or a hybrid of each of these two approaches.

The following table summarizes some of the considerations with each of these two approaches.

| Policy Action | Centralized Management | Hybrid Management | Decentralized Management |
| --- | --- | --- | --- |
| Create policies | Security officer | Security officer | Individual teams |
| Apply policies to columns | Security officer | Individual teams | Individual teams |

As a best practice, Snowflake recommends that your organization gathers all relevant stakeholders to determine the best management approach for implementing Column-level Security in your environment.

### Masking policy privileges

This section describes the Column-level Security masking policy privileges and how they apply to a centralized, decentralized, or hybrid management approach.

Snowflake provides the following privileges for Column-level Security masking policies.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a masking policy on a column.  Note that granting the global APPLY MASKING POLICY privilege (i.e. APPLY MASKING POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views.  For syntax examples, see Masking policy privileges. |
| OWNERSHIP | Grants full control over the masking policy. Required to alter most properties of a masking policy. Only a single role can hold this privilege on a specific object at a time. |

> **Note:**
>
> Operating on a masking policy also requires the USAGE privilege on the parent database and schema.

The following examples show how granting privileges apply to different management approaches. After granting the APPLY privilege to a role, the masking policy can be set on a table column using an [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md) statement or set on a view column using an [ALTER VIEW](../sql-reference/sql/alter-view.md) statement (by a member of the role with the APPLY privilege on the masking policy).

**Centralized Management**

> In a centralized management approach, only the security officer custom role (e.g. `security_officer`) creates and applies masking policies to columns in tables or views. This approach can provide the most consistency in terms of masking policy management and masking sensitive data.
>
> > ```sqlexample
> > -- create a security_officer custom role
> >
> > USE ROLE ACCOUNTADMIN;
> > CREATE ROLE security_officer;
> >
> > -- grant CREATE AND APPLY masking policy privileges to the SECURITY_OFFICER custom role.
> >
> > GRANT CREATE MASKING POLICY ON SCHEMA mydb.mysch TO ROLE security_officer;
> >
> > GRANT APPLY MASKING POLICY ON ACCOUNT TO ROLE security_officer;
> > ```
> >
> > Where:
> >
> > * `schema_name`
> >   :   Specifies the identifier for the schema; must be unique for the database in which the schema is created.
> >
> >       In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also case-sensitive.
> >
> >       For more details, see [Identifier requirements](../sql-reference/identifiers-syntax.md).

**Hybrid Management**

> In a hybrid management approach, the security officer custom role (e.g. `security_officer`) creates masking policies and individual teams (e.g. finance, payroll, human resources) apply the masking policies to columns in tables or views owned by the teams. This approach can lead to consistent policy creation and maintenance but requires individual teams to have the increased responsibility to mask sensitive data.
>
> > ```sqlexample
> > USE ROLE ACCOUNTADMIN;
> > CREATE ROLE security_officer;
> > GRANT CREATE MASKING POLICY ON SCHEMA mydb.mysch TO ROLE security_officer;
> > ```
>
> The SECURITY_OFFICER custom role grants the APPLY privilege to the human resources team (i.e. users with the HUMAN_RESOURCES custom role) to mask social security numbers (e.g. masking policy: `ssn_mask`) in columns for objects owned by the HUMAN_RESOURCES custom role.
>
> > ```sqlexample
> > USE ROLE security_officer;
> > GRANT APPLY ON MASKING POLICY ssn_mask TO ROLE human_resources;
> > ```
> >
> > Where:
> >
> > * `grant apply on masking policy policy_name to role role_name;`
> >   :   Used by a policy owner to decentralize the unset and set operations of a given masking policy on columns to the object owners.
> >
> >       This privilege supports [discretionary access control](security-access-control-overview.md) where object owners are also considered data stewards.

**Decentralized Approach**

> In a decentralized management approach, individual teams create and apply masking policies to columns in tables or views. This approach can lead to inconsistent policy management, with the possibility of sensitive data not being masked properly, since individual teams assume all responsibility for managing masking policies and masking sensitive data.
>
> In this representative example, the support team (i.e. users with the custom role SUPPORT) and the finance team (i.e. users with the custom role FINANCE) can create masking policies. Note that these custom roles may not include the SECURITY_OFFICER custom role.
>
> > ```sqlexample
> > USE ROLE ACCOUNTADMIN;
> > GRANT CREATE MASKING POLICY ON SCHEMA mydb.mysch TO ROLE support;
> > GRANT CREATE MASKING POLICY ON SCHEMA <DB_NAME.SCHEMA_NAME> TO ROLE FINANCE;
> > ```
>
> The support team grants the APPLY privilege to the human resources team (i.e. users with the custom role HUMAN_RESOURCES) to mask social security numbers (e.g. masking policy: `ssn_mask`) in columns for objects owned by the HUMAN_RESOURCES custom role.
>
> > ```sqlexample
> > USE ROLE support;
> > GRANT APPLY ON MASKING POLICY ssn_mask TO ROLE human_resources;
> > ```
>
> The finance team grants the APPLY privilege to the internal audit team (i.e. users with the custom role AUDIT_INTERNAL) to mask cash flow data (e.g. masking policy: `cash_flow_mask`) in columns for objects owned by the AUDIT_INTERNAL custom role.
>
> > ```sqlexample
> > USE ROLE finance;
> > GRANT APPLY ON MASKING POLICY cash_flow_mask TO ROLE audit_internal;
> > ```

For more information on masking policy privileges, see:

* [Using Dynamic Data Masking](security-column-ddm-use.md)
* [Using External Tokenization](security-column-ext-token-use.md)

### Masking policy DDL

Snowflake provides the following set of commands to manage Column-level Security masking policies.

* [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md)
* [ALTER MASKING POLICY](../sql-reference/sql/alter-masking-policy.md) (see also: [ALTER TABLE](../sql-reference/sql/alter-table.md), [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md), and [ALTER VIEW](../sql-reference/sql/alter-view.md))
* [DROP MASKING POLICY](../sql-reference/sql/drop-masking-policy.md)
* [SHOW MASKING POLICIES](../sql-reference/sql/show-masking-policies.md)
* [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md)

The following table summarizes the relationship between the Column-level Security masking policy DDL operations and their necessary privileges.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Operation | Privilege |
| --- | --- |
| Create masking policy | A role with the CREATE MASKING POLICY on SCHEMA privilege. |
| Alter masking policy | The masking policy owner (i.e. the role with the OWNERSHIP privilege on the masking policy). |
| Drop masking policy | The masking policy owner (i.e. the role with the OWNERSHIP privilege on the masking policy). |
| Show masking policies | One of the following: . A role with the global APPLY MASKING POLICY privilege, or . The masking policy owner (i.e. the role with the OWNERSHIP privilege on the masking policy) or . A role with the APPLY privilege on the masking policy. |
| Describe masking policy | One of the following: . A role with the global APPLY MASKING POLICY privilege or . The masking policy owner (i.e. the role with the OWNERSHIP privilege on the masking policy) or . A role with the APPLY privilege on the masking policy. |
| List of columns having a masking policy | One of the following: . The role with the APPLY MASKING POLICY privilege, or . The role with the APPLY on MASKING POLICY privilege on a given masking policy and has OWNERSHIP on the target object. |
| Using UDFs in a masking policy | If creating a new or altering an existing masking policy, the policy administrator role must have usage on the UDF, all scalar UDFs in the policy expression should have the same data type, and the UDF must exist.  At the query runtime, Snowflake verifies if the UDF exists; if not, the SQL expression will not resolve and the query fails. |

## Monitor masking policies with SQL

You can monitor masking policy usage through two different Account Usage views and an Information Schema table.

It can be helpful to think of two general approaches to determine how to monitor masking policy usage.

* Discover Masking Policies
* Identify Assignments

### Discover masking policies

You can use the [MASKING_POLICIES](../sql-reference/account-usage/masking_policies.md) view in the Account Usage schema of the shared
SNOWFLAKE database. This view is a *catalog* for all masking policies in your Snowflake account. For example:

> ```sqlexample
> SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.MASKING_POLICIES
> ORDER BY POLICY_NAME;
> ```

### Identify assignments

Snowflake supports different options to identify masking policy assignments, depending on whether the query needs to target the account
or a specific database.

* Account-level query:

  Use the Account Usage [POLICY_REFERENCES](../sql-reference/account-usage/tag_references.md) view to determine all of the columns
  that have a masking policy. For example:

  > ```sqlexample
  > SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.POLICY_REFERENCES
  > ORDER BY POLICY_NAME, REF_COLUMN_NAME;
  > ```
* Database-level query:

  Every Snowflake database includes an [Snowflake Information Schema](../sql-reference/info-schema.md). Use the Information Schema table function
  [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) to determine all of the masking policies on columns for a given table:

  > ```sqlexample
  > SELECT *
  > FROM TABLE(
  >   my_db.INFORMATION_SCHEMA.POLICY_REFERENCES(
  >     'my_table',
  >     'table'
  >   )
  > );
  > ```

  You can also use this function to query by the name of the masking policy to find the objects that are associated with a
  given masking policy.

## Monitor masking policies with Snowsight

You can use the Snowsight Governance & security » Tags & policies area to monitor and report on the usage of
policies and tags with tables, views, and columns. There are two different interfaces: Dashboard and Tagged Objects.

When using the Dashboard and the Tagged Objects interface, note the following details.

* The Dashboard and Tagged Objects interfaces require a running warehouse.
* Snowsight updates the Dashboard every 12 hours.
* The Tagged Objects information latency can be up to two hours and returns up to 1000 objects.

### Accessing the Governance area in Snowsight

To access the Tags & policies area, your Snowflake account must be [Enterprise Edition or higher](intro-editions.md).
Additionally, you must do either of the following:

* Use the ACCOUNTADMIN role.
* Use an account role that is directly granted the GOVERNANCE_VIEWER and OBJECT_VIEWER database roles.

  You must use an account role with these database role grants. Currently, Snowsight does not evaluate role hierarchies
  and user-defined database roles that have access to tables, views, data access policies, and tags.

  To determine if your account role is granted these two database roles, use a [SHOW GRANTS](../sql-reference/sql/show-grants.md) command:

  > ```sqlexample
  > SHOW GRANTS LIKE '%VIEWER%' TO ROLE data_engineer;
  > ```
  >
  > ```output
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > | created_on                    | privilege | granted_on    | name                        | granted_to | grantee_name    | grant_option | granted_by |
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > | 2024-01-24 17:12:26.984 +0000 | USAGE     | DATABASE_ROLE | SNOWFLAKE.GOVERNANCE_VIEWER | ROLE       | DATA_ENGINEER   | false        |            |
  > | 2024-01-24 17:12:47.967 +0000 | USAGE     | DATABASE_ROLE | SNOWFLAKE.OBJECT_VIEWER     | ROLE       | DATA_ENGINEER   | false        |            |
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > ```

  If your account role is not granted either or both of these database roles, use the [GRANT DATABASE ROLE](../sql-reference/sql/grant-database-role.md) command
  and run the SHOW GRANTS command again to confirm the grants:

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  > GRANT DATABASE ROLE SNOWFLAKE.GOVERNANCE_VIEWER TO ROLE data_engineer;
  > GRANT DATABASE ROLE SNOWFLAKE.OBJECT_VIEWER TO ROLE data_engineer;
  > SHOW GRANTS LIKE '%VIEWER%' TO ROLE data_engineer;
  > ```

  For details about these database roles, see [SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md).

### Dashboard

As a data administrator, you can use the Dashboard interface to monitor tag and policy usage in the following ways.

* Coverage: specifies the count and percentage based on whether a table, view, or column has a policy or tag.
* Prevalence: lists and counts the most frequently used policies and tags.

The coverage and prevalence provide a snapshot as to how well the data is protected and tagged.

When you select a count number, percentage, policy name, or tag name, the Tagged Objects interface opens. The Tagged Objects
interface updates the filters automatically based on your selection in the Dashboard.

The monitoring information is an alternative or complement to running complex and query-intensive operations on multiple Account
Usage views.

These views might include, but are not limited to, the [COLUMNS](../sql-reference/account-usage/columns.md),
[POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md), [TABLES](../sql-reference/account-usage/tables.md),
[TAG_REFERENCES](../sql-reference/account-usage/tag_references.md), and [VIEWS](../sql-reference/account-usage/views.md) views.

### Tagged Objects

As a data administrator, you can use this table to associate the coverage and prevalence in the Dashboard to a list of specific
tables, view, or columns quickly. You can also filter the table results manually as follows.

* Choose Tables or Columns.
* For tags, you can filter with tags, without tags, or by a specific tag.
* For policies, you can filter with policies, without policies, or by a specific policy.

When you select a row in the table, the Table Details or Columns tab in Catalog » Database Explorer opens. You can edit
the tag and policy assignments as needed.

> **Tip:**
>
> You can use Snowsight to troubleshoot masking policy assignments. In the Columns tab, the MASKING POLICY column
> shows Policy Error when there is a conflict with the masking policy assignment on the column. You can select the Policy
> Error for more information.
>
> Additionally, the Data Preview tab does not render a data preview when there is a error with a masking policy assignment on a
> column. Instead, the Data Preview tab returns the SQL error message. This message corresponds to one of the error values in the
> POLICY_STATUS column of the Account Usage POLICY_REFERENCES view and the Information Schema POLICY_REFERENCES table function.
>
> To correct the error, use the SQL error message and the Policy Error message to modify the tag or policy assignment.

For additional details, refer to [Tag and policy discovery](tag-based-masking-policies.md)

---
title: Understanding compute cost
source: https://docs.snowflake.com/en/user-guide/cost-understanding-compute.md
section: User Guide
---

# Understanding compute cost

Compute costs represent credits used for:

* Virtual Warehouse compute — Virtual warehouses consume credits as they execute queries, load
  data and perform other DML operations. Virtual Warehouses are user-managed, which means you can directly
  control credit consumption of these resources.
* Serverless compute —
  Serverless features use compute resources that are managed by Snowflake instead of using virtual warehouses.
* Compute pools — Compute pools provide the compute resources for Snowpark Container Services.
* Cloud Services compute — Cloud Services is the layer of the Snowflake architecture that
  performs services that tie together all the different components of Snowflake to process user requests, login, query display, and more.
  Cloud Services compute resources are managed by Snowflake.

## Virtual warehouse credit usage

A virtual warehouse is one or more clusters of compute resources that enable executing queries, loading data, and performing other DML
operations. The web interface and other features use warehouses, such as [Cross-Cloud Auto-Fulfillment](../collaboration/provider-understand-cost-auto-fulfillment.md) or display information in dashboards.

Snowflake credits are used to pay for the processing time used by each virtual warehouse.
Snowflake credits are charged based on the number of virtual warehouses you use, how long they run, and their size.

Warehouses come in many sizes. In this table, the size specifies the compute resources per cluster available to the warehouse.
Each increase in size to the next larger warehouse approximately doubles the computing power and the number of credits billed per
full hour that the warehouse runs.

For information on credit consumption, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

> **Important:**
>
> Warehouses are only billed for credit usage while running. When a warehouse is suspended, it does not use any credits.
>
> The credit numbers shown above are for a full hour of usage; however, credits are billed per-second, with a 60-second (i.e. 1-minute)
> minimum:
>
> * Each time a warehouse is started or resumed, the warehouse is billed for 1 minute’s worth of usage based on the hourly
>   rate shown above.
> * Each time a warehouse is resized to a larger size, the warehouse is billed for 1 minute’s worth of usage; however, the number
>   of credits billed are only for the additional compute resources that are provisioned. For example, resizing from Small
>   (2 credits/hour) to Medium (4 credits/hour) results in billing charges for 1 minute’s worth of 2 additional credits.
> * After 1 minute, all subsequent billing is per-second as long as the warehouse runs continuously.
> * Suspending and then resuming a warehouse within the first minute results in multiple charges because the 1-minute minimum starts
>   over each time a warehouse is resumed.
> * Resizing a warehouse from 5X-Large or 6X-Large to 4X-Large (or smaller) results in a brief period during which the warehouse is
>   billed for both the new compute resources and the old resources while the old resources are quiesced.
>
> For more information on warehouses in general, see [Overview of warehouses](warehouses-overview.md) and [Warehouse considerations](warehouses-considerations.md).

To learn how to view the historical cost of consuming compute resources with virtual warehouses, see [Exploring compute cost](cost-exploring-compute.md).

## Serverless credit usage

Serverless credit usage is the result of features relying on compute resources provided by Snowflake rather than user-managed
virtual warehouses. These compute resources are automatically resized and scaled up or down by Snowflake as required for each workload.

For these serverless features, which usually require continuous and/or maintenance operations, this model is more efficient, allowing
Snowflake to charge based on the time spent using the resources. In contrast, user-managed virtual warehouses consume credits while running,
regardless of whether they are performing any work, which may cause them to be overutilized or sit idle.

Charges for serverless features are calculated based on total usage of snowflake-managed compute resources measured in *compute-hours*.
Compute-Hours are calculated on a per second basis, rounded up to the nearest whole second. The number of credits consumed per compute
hour varies depending on the serverless feature.

To learn how many credits are consumed by a serverless feature, refer to the “Serverless Feature Credit Table” in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

Charges for the use of a serverless feature appear on your bill as an individual line item. Charges for both Snowflake-managed compute
resources and Cloud Services appear as a single line item for that serverless feature.

To learn how to view the historical cost of using serverless compute resources, see [Exploring compute cost](cost-exploring-compute.md).

## Compute pool credit usage

[Snowpark Container Services](../developer-guide/snowpark-container-services/overview.md) uses compute pools to run its jobs and services.
A compute pool is a collection of one or more virtual machine (VM) nodes. The number and type of these nodes determine how many credits the
job or service consumes as it uses the compute pool.

For more information about the cost of compute pools, including how to monitor these costs, see [Compute pool cost](../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md).

## Cloud service credit usage

The cloud services layer of the Snowflake architecture is a collection of services that coordinate activities across Snowflake.
This layer authenticates users, enforces security, performs query compilation and optimization, handles request query caching, and more.
Cloud services tie together all of the different components of Snowflake, including supporting the use of virtual warehouses.

The cloud services layer is constructed of stateless compute resources, running across multiple availability zones and using a highly
available, distributed metadata store for global state management. The cloud services layer runs on compute instances provisioned by
Snowflake from the cloud provider.

Similar to virtual warehouse usage, Snowflake credits are used to pay for the usage of the cloud
services.

Snowflake Marketplace calculates compute costs for listing auto-fulfillment to VPS regions by using VPS rates. For details on VPS rates, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Understanding billing for cloud services usage

Usage for cloud services is charged only if the daily consumption of cloud services exceeds 10% of the daily usage of virtual warehouses.
The charge is calculated daily (in the UTC time zone). This ensures that the 10% adjustment is accurately applied each day, at the credit
price for that day.

Keep the following in mind:

* Serverless compute does not factor into the 10% adjustment for cloud services.
* The 10% adjustment for cloud services is calculated daily (in the UTC time zone) by multiplying daily warehouse usage by 10%.
* The adjustment on the monthly usage statement is equal to the sum of these daily calculations.
* If cloud services consumption is less than 10% of warehouse compute credits on a given day, then the adjustment for that day is equal to
  the cloud services used by your account. The daily adjustment never exceeds actual cloud services usage for that day. Thus, the total
  monthly adjustment may be significantly less than 10%.

For example:

| Date | Compute Credits Used (Warehouses only) | Cloud Services Credits Used | Credit Adjustment for Cloud Services (Lesser of 10% of Compute or Cloud Services) | Credits Billed (Sum of Compute, Cloud Services, and Adjustment) |
| --- | --- | --- | --- | --- |
| Nov 1 | 100 | 20 | -10 | 110 |
| Nov 2 | 120 | 10 | -10 | 120 |
| Nov 3 | 80 | 5 | -5 | 80 |
| Nov 4 | 100 | 13 | -10 | 103 |
| **Total** | **400** | **48** | **-35** | **413** |

### More about cloud services

* To learn how to view the historical cost of consuming cloud services resources, see [Exploring compute cost](cost-exploring-compute.md), which
  includes [sample queries](cost-exploring-compute.md) you can run to see how much of cloud services consumption was
  actually billed and which queries and warehouses have the highest cloud services usage.
* To learn about patterns that drive cloud services consumption and ways that you might be able to reduce that consumption, see
  [Optimizing cloud services for cost](cost-optimize-cloud-services.md).

## What are credits?

Snowflake credits are used to pay for the consumption of resources on Snowflake. A Snowflake credit is a unit of measure, and it is
consumed only when a customer is using resources, such as when a virtual warehouse is running, the cloud services layer is performing work,
or serverless features are used.

**Next Topic**
:   * [Exploring compute cost](cost-exploring-compute.md)

---
title: Understanding costs for dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-cost.md
section: User Guide
---

# Understanding costs for dynamic tables

This topic provides an overview of the compute and storage costs associated with dynamic tables. For
general information about Snowflake costs, see [Understanding overall cost](cost-understanding-overall.md).

## Compute costs

There are two compute costs associated with dynamic tables: virtual warehouses and Cloud Services compute.

Dynamic tables require at least one [virtual warehouse](cost-understanding-compute.md) to perform [refreshes](dynamic-tables-refresh.md).
You can optionally assign a second warehouse if you want to separate compute costs for different operations. For more information, see
[Understand warehouse usage for dynamic tables](dynamic-tables-warehouses.md).

Dynamic table refreshes consume [compute credits](cost-understanding-compute.md), and their frequency is determined by the configured
[target lag](dynamic-tables-target-lag.md): lower target lag values trigger more frequent refreshes and therefore higher
compute costs.

Dynamic tables also require [Cloud Services compute](cost-understanding-compute.md) to identify changes in underlying base objects
and determine whether a virtual warehouse must run. If Cloud Services compute finds no changes, no warehouse compute credits are consumed
because there’s no new data to refresh. If changes do exist, even if the dynamic table query filters them, the virtual warehouse consumes
credits because the dynamic table refreshes to evaluate whether those changes apply.

If the associated virtual warehouses are suspended and Cloud Services compute detects no changes in the base tables, the warehouses remain
suspended and the dynamic tables don’t consume any credits. When Cloud Services compute identifies changes in the base tables, the appropriate
warehouse automatically resumes. If the changes support incremental refresh, the dynamic table refreshes by using the WAREHOUSE parameter. If
reinitialization is required — for example, because of a base table schema change — the dynamic table uses the INITIALIZATION_WAREHOUSE to
perform a full reinitialization. For information on how dynamic tables automatically suspend, see [Automatic dynamic table suspension](dynamic-tables-suspend-resume.md).

### Check your consumption of virtual warehouse credits

To check whether your dynamic table refreshes consumed virtual warehouse credits, use the Refresh History tab in Snowsight:

1. In the navigation menu, select Transformation » Dynamic tables.
2. Select your dynamic table, and then select the Refresh History tab.
3. To view refreshes that used the warehouse to update, select the Warehouse used only checkbox.

> **Tip:**
>
> To better understand costs related to your dynamic table pipelines, Snowflake recommends that you test dynamic tables by using dedicated
> warehouses. This way, you can isolate the virtual warehouse consumption that is attributed to dynamic tables. You can move your dynamic
> tables to a shared warehouse after you establish a cost baseline.

For more information, see [Understand warehouse usage for dynamic tables](dynamic-tables-warehouses.md).

### Compute cost for immutability constraints

If you use the IMMUTABLE WHERE constraint, Snowflake recomputes only the rows that don’t match the immutability condition, which helps reduce
reinitialization costs. This is useful in situations where reinitialization can occur, such as the following scenarios:

* Recreating upstream tables or views.
* Changes in upstream data governance policies.
* Failover to a secondary region in a failover group.

Using the IMMUTABLE WHERE constraint can help you reduce the cost of incremental and full refresh because the constraint ignores changes and
data that match its predicate.

Adding immutability constraints to a dynamic table doesn’t trigger extra computation, but removing them does because it causes
[reinitialization](dynamic-tables-refresh.md) on the next refresh. Modifying the predicate in an IMMUTABLE WHERE constraint
might trigger reinitialization depending on whether Snowflake can determine the rows that are returned with the original condition are still
returned with the new condition.

For example, the following modifications don’t trigger reinitialization:

* From `(ts < CURRENT_TIMESTAMP() - INTERVAL '2 days')` to `(ts < CURRENT_TIMESTAMP() - INTERVAL '1 days')`
* From `(year <= 2023)` to `(year <= 2024)`

The following modifications trigger reinitialization:

* From `(ts < '2025-01-02')` to `(ts < '2025-01-01')`
* From `(year < 2024)` to `(month < 10)`

## Storage cost

Dynamic tables require storage to store the materialized results. Similar to regular tables, you might incur additional storage cost for Time
Travel, fail-safe storage, and cloning features.

[Dynamic Apache Iceberg™ tables](dynamic-tables-create-iceberg.md) don’t incur Snowflake storage costs. For more information,
see [Billing](tables-iceberg.md).

This section discusses the following storage considerations for dynamic tables:

* Time Travel and fail-safe storage
* Replication of dynamic tables
* Suspended dynamic tables
* Transient dynamic tables
* Additional storage for incremental refresh operations

For detailed information about how this storage incurs cost, see [Understanding storage cost](cost-understanding-data-storage.md)
and [Data storage considerations](tables-storage-considerations.md).

### Time Travel and fail-safe storage

With Snowflake Time Travel, you can access and query historical versions of dynamic tables at specific points in time, which can help provide
insights into historical trends, changes, and anomalies in your data.

Frequent refreshes can increase buildup of Time Travel data, which adds to your overall storage usage. For more information, see
[Understanding & using Time Travel](data-time-travel.md).

Fail-safe features help protect your dynamic tables from data loss or corruption. Based on the configured fail-safe period, additional storage
charges might apply.

### Replication of dynamic tables

Dynamic tables support cross-account, cross-region replication, which lets you copy data from a primary database to a secondary database for
either disaster recovery or data sharing. It can serve as either a failover preparation strategy for disaster recovery or as a means of
sharing data across deployments for read-only purposes. Using replication with dynamic tables is subject to
[replication costs](account-replication-cost.md). For more information, see [Replication and dynamic tables](account-replication-considerations.md).

### Suspended dynamic tables

Suspended dynamic tables don’t incur additional costs beyond standard storage fees and don’t consume compute resources. If you have ongoing
maintenance tasks or scheduled jobs that interact with the suspended table, your dynamic tables might consume compute resources.

### Transient dynamic tables

Snowflake supports [transient](tables-temp-transient.md) dynamic tables, similar to regular tables, that persist until
explicitly dropped, and are available to all users with the appropriate privileges without a fail-safe period. Transient dynamic tables are
best used for transitory data that doesn’t need the same level of data protection and recovery that permanent tables provide. Using them
helps you save on storage charges for fail-safe storage.

### Additional storage for incremental refresh operations

For incremental refresh operations, dynamic tables maintain an additional internal metadata column for identifying each row within the table.
Internal row identifiers consume a constant amount of storage per row and increase storage cost linearly to the number of rows in the table,
independent of the number of columns.

For tables with very few columns, the increase in storage compared to an equivalent [CTAS](../sql-reference/sql/create-table.md) table can be significant,
or even dominant. In wider dynamic tables, this effect is less pronounced.

## Refresh schedule cost

The schedule at which a dynamic table refreshes, whether [full or incremental](dynamic-tables-refresh.md), has an effect
on its overall cost. This section discusses the factors that you should consider when you decide on a refresh schedule, with the assumption
that every refresh is non-empty:

* Full refresh schedule
* Incremental refresh schedule

> **Note:**
>
> Refreshes are relatively inexpensive if the sources haven’t changed. For more information, see Compute costs (in this topic).

### Full refresh schedule

The cost of a full refresh typically depends on how much data your dynamic table scans and how often it refreshes. To save on costs, you can
refresh your dynamic tables only when you need to; for example, you can suspend your dynamic tables outside of business hours. For precise
timing control, set the [downstream target lag](dynamic-tables-target-lag.md) for your dynamic tables and use
[manual refresh](../sql-reference/sql/alter-dynamic-table.md) from a [task](tasks-intro.md) to automate your custom schedules.

### Incremental refresh schedule

The cost of an incremental refresh is typically proportional to the volume of changes in the source objects, plus some fixed overhead.

If the overhead is low, you can set a high refresh frequency without much downside. This means that you can refresh often for best results.
For instance, a simple `SELECT ... FROM ... WHERE` dynamic table only processes changed rows between refreshes, which has minimal
overhead and the dynamic table can run frequently at low added cost.

If the overhead is high, you must balance the credit consumption of high refresh frequency with the business benefits of freshness. For
example, in a dynamic table with a join, you must join the changes in one table with the other table. No matter how small the set of changes,
this join usually involves a minimum cost for you to execute. If this overhead is significant, it can accumulate as the refresh frequency
increases.

To reduce overhead and optimize incremental refresh performance, see
[Optimize queries for incremental refresh](dynamic-tables-performance-optimize-query.md).

---
title: Understanding data transfer cost
source: https://docs.snowflake.com/en/user-guide/cost-understanding-data-transfer.md
section: User Guide
---

# Understanding data transfer cost

Data transfer is the process of moving data into (ingress) and out of (egress) Snowflake.

Snowflake charges a per-byte fee for data egress when users transfer data from a Snowflake account into a different region on the same
cloud platform or into a completely different cloud platform. Data transfers within the same region are free.

The per-byte rate for transferring data out of a region depends where your Snowflake account is hosted. For data transfer pricing, see
the [pricing guide](https://www.snowflake.com/pricing/pricing-guide/).

> **Note:**
>
> Snowflake does not charge *data ingress* fees. However, a cloud storage provider might charge a data egress fee for transferring
> data from the provider to your Snowflake account.
>
> Contact your cloud storage provider (Amazon S3, Google Cloud Storage, or Microsoft Azure) to determine whether they apply data egress
> charges to transfer data from their network and region of origin to the cloud provider’s network and region where your Snowflake
> account is hosted.

## Snowflake features that incur transfer costs

Snowflake features that transfer data from a Snowflake account into a different region on the same cloud platform or into a completely
different cloud platform incur data transfer costs. For example, the following actions incur data transfer costs:

* [Unloading data](data-unload-overview.md) - Unloading data from Snowflake to Amazon, Google Cloud Storage, or Microsoft Azure.

  Typically this involves the use of [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) to unload data to cloud storage in a region or cloud
  platform different from where your Snowflake account is hosted.
  In addition, unloading data typically involves a stage, with its associated costs.
* [Replicating Data](account-replication-intro.md) - Replication of databases, creating a snapshot of the database to a
  secondary database.

  Typically this involves replicating data to a Snowflake account in a region or cloud platform different from where your primary (origin)
  Snowflake account is hosted. See also [Replication Billing](account-replication-cost.md).
* [External network access](../developer-guide/external-network-access/external-network-access-overview.md) - Accessing network locations
  external to Snowflake from procedure or UDF handler code using external access. See also
  [Costs of external network access](../developer-guide/external-network-access/external-network-access-billing.md).
* [Copy files](../sql-reference/sql/copy-files.md) - Copy files from a source stage to an output stage. For example, with [Writing files from Snowpark Python UDFs and UDTFs](../developer-guide/snowpark/python/creating-udfs.md), you can copy files to an external stage that is on a different region/cloud.
* [Writing external functions](../sql-reference/external-functions.md) - Use of external functions to transfer data from your Snowflake account to AWS, Microsoft
  Azure, or Google Public cloud. See also [External Functions Billing](../sql-reference/external-functions-introduction.md).
* [Cross-Cloud Auto-Fulfillment](../collaboration/provider-understand-cost-auto-fulfillment.md) -
  Using auto-fulfillment to offer listings to consumers in other cloud regions.
* [FileOperation.put](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.FileOperation.put) - PUT calls to external stages from Snowpark stored procedures.
* [Cross-region/cross-cloud Iceberg writes](tables-iceberg.md) - When you [use Snowflake as the catalog](tables-iceberg.md),
  writing new data into the Iceberg table incurs costs for data transfer usage if the active storage location is in a different region or
  with another cloud provider. However, data transfers within the same region are free.
* Cross-region/cross-cloud reads from an Iceberg table created with Snowflake Storage - When you create an Iceberg table
  using [Snowflake Storage](tables-iceberg-internal-storage.md), reading existing data through the
  Snowflake Horizon Catalog incurs costs for data transfer usage if the client is in a different region or with another cloud provider.

> **Note:**
>
> Snowflake does not apply data egress charges when a Snowflake client or driver retrieves query results across regions within
> the same cloud platform or across different cloud platforms.

**Next Topic**

> * [Exploring data transfer cost](cost-exploring-data-transfer.md)

---
title: Understanding Dynamic Data Masking
source: https://docs.snowflake.com/en/user-guide/security-column-ddm-intro.md
section: User Guide
---

# Understanding Dynamic Data Masking

This topic provides a general overview of the Dynamic Data Masking feature.

To learn more about using a masking policy with a tag, see [Tag-based masking policies](tag-based-masking-policies.md).

## What is Dynamic Data Masking?

Dynamic Data Masking is a Column-level Security feature that uses masking policies to selectively mask plain-text data in table and view columns at query time.

In Snowflake, masking policies are schema-level objects, which means a database and schema must exist in Snowflake before a masking policy can be applied to a column. Currently, Snowflake supports using Dynamic Data Masking on tables and views.

At query runtime, the masking policy is applied to the column at every location where the column appears. Depending on the masking policy conditions, the SQL execution context, and role hierarchy, Snowflake query operators may see the plain-text value, a partially masked value, or a fully masked value.

For more details about how masking policies work, including the query runtime behavior, creating a policy, usage with tables and views, and management approaches using masking policies, see: [Understanding Column-level Security](security-column-intro.md).

For more details on the effects of the SQL execution context and role hierarchy, see [Advanced Column-level Security topics](security-column-advanced.md).

## Dynamic Data Masking benefits

The following summarizes some of the key benefits of Dynamic Data Masking.

Ease of use:
:   You can write a policy once and have it apply to thousands of columns across databases and schemas.

Data administration and SoD:
:   A security or privacy officer decides which columns to protect, not the object owner. Masking policies are easy to manage and support centralized and decentralized administration models.

Data authorization and governance:
:   Contextual data access by role or custom entitlements.

    Supports data governance as implemented by security or privacy officers and can prohibit privileged users with the ACCOUNTADMIN or SECURITYADMIN role from unnecessarily viewing data.

Data sharing:
:   Easily mask data before sharing.

Change management:
:   Easily change masking policy content without having to reapply the masking policy to thousands of columns.

For a comparison of benefits between Dynamic Data Masking and External Tokenization, see: [Column-level Security Benefits](security-column-intro.md).

## Dynamic Data Masking limitations

For an overview of the limitations, see [Column-level Security Limitations](security-column-intro.md).

## Dynamic Data Masking considerations

For additional Dynamic Data Masking Considerations, see [Column-level Security Considerations](security-column-intro.md).

## Dynamic Data Masking privileges

The following table summarizes the privileges related to Dynamic Data Masking.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a [masking policy](security-column-intro.md) on a column.  Note that granting the global APPLY MASKING POLICY privilege (i.e. APPLY MASKING POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views.  For syntax examples, see [Masking policy privileges](security-column-intro.md). |
| OWNERSHIP | Grants full control over the masking policy. Required to alter most properties of a masking policy. Only a single role can hold this privilege on a specific object at a time. |

> **Note:**
>
> Operating on a masking policy also requires the USAGE privilege on the parent database and schema.

## Dynamic Data Masking DDL

Snowflake provides the following set of commands to manage Dynamic Data Masking policies.

* [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md)
* [ALTER MASKING POLICY](../sql-reference/sql/alter-masking-policy.md) (see also: [ALTER TABLE](../sql-reference/sql/alter-table.md), [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md), and [ALTER VIEW](../sql-reference/sql/alter-view.md))
* [DROP MASKING POLICY](../sql-reference/sql/drop-masking-policy.md)
* [SHOW MASKING POLICIES](../sql-reference/sql/show-masking-policies.md)
* [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md)

## Auditing Dynamic Data Masking

Snowflake provides two Account Usage views to obtain information about masking policies:

* The [MASKING POLICIES](../sql-reference/account-usage/masking_policies.md) view provides a list of all masking policies in your
  Snowflake account.
* The [POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md) view provides a list of all objects in which a masking
  policy is set.

The Information Schema table function [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) can be used to either:

* Return a list of all objects (i.e. tables, views) that have the masking policy set on a column.
* Return a list of policy associations that have the specified object name and object type.

Snowflake records the original query run by the user on the [History page](ui-snowsight-activity.md) (in the web interface). The query
is found in the SQL Text column.

The masking policy names that were used in a specific query can be found in the [Query Profile](ui-snowsight-activity.md).

The query history is specific to the Account Usage [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) view only. In this
view, the Query Text column contains the text of the SQL statement. Masking policy names are not included in the QUERY_HISTORY
view.

## Troubleshooting Dynamic Data Masking

You can use error messages to help troubleshoot masking policy issues.

### Error Messages

The following table describes error messages Snowflake can return while using masking policies.

| Behavior | Error Message | Troubleshooting Action |
| --- | --- | --- |
| Cannot apply a masking policy to a Snowflake feature. | Unsupported feature `CREATE ON MASKING POLICY COLUMN`. | Masking policies are currently not applicable to this feature. |
| An active role cannot create or replace a masking policy. | SQL access control error: Insufficient privileges to operate on account <account_name> | Grant the CREATE MASKING POLICY privilege to the specified role using `grant create masking policy on account to role <role_name>;` . Verify the role has the privilege using `show grants to role <role_name>`, and try the CREATE OR REPLACE masking statement again. |
| A given role cannot attach a masking policy to a table. | SQL compilation error: Database <database_name> does not exist or not authorized. | Grant the APPLY MASKING POLICY privilege to the role using `grant apply masking policy on account to role <role_name>;` |
| A given role that does not own a masking policy on a table tries to apply a masking policy on a table they can use. | SQL compilation error: Masking policy <policy_name> does not exist or not authorized. | Grant the given role usage on the masking policy using `grant apply on masking policy <policy_name> to role <role_name>;` |
| Cannot drop or remove a policy using `drop masking policy <policy_name>;` | SQL compilation error: Policy <policy_name> cannot be dropped/replaced as it is associated with one or more entities. | Use an ALTER TABLE … MODIFY COLUMN or ALTER VIEW … MODIFY COLUMN statement to UNSET the policy first, then try the DROP statement again. |
| Restoring a dropped table produces a masking policy error. | SQL execution error: Column <column_name> already attached to a masking policy that does not exist. Please contact the policy administrator. | Unset the currently attached masking policy with an ALTER Table/View MODIFY COLUMN statement and then reapply the masking policy to the column with a CREATE OR REPLACE statement. |
| Cannot apply a masking policy to a specific column, but the masking policy can be applied to a different column. | Specified column already attached to another masking policy.A column cannot be attached to multiple masking policies.please drop the current association in order to attach a new masking policy. | Decide which masking policy should apply to the column, update, and try again. |
| Updating a policy with an ALTER statement fails. | SQL compilation error: Masking policy <policy_name> does not exist or not authorized. | Verify the policy name in the ALTER command matches an existing policy by executing `show masking policies;` |
| The role that owns the cloned table cannot unset a masking policy. | SQL access control error: Insufficient privileges to operate on ALTER TABLE UNSET MASKING POLICY ‘<policy_name>’ | Grant the APPLY privilege to the role that owns the cloned table using `grant apply on masking policy <policy_name> to role <role_name>;` . Verify that the role that owns the cloned table has the grant using `show grants to role <role_name>;` and try the ALTER statement again. |
| Updating a policy using IF EXISTS returns a successful result but does not update the policy. | No error message returned; Snowflake returns Statement executed successfully. | Remove IF EXISTS from the ALTER statement and try again. |
| While creating or replacing a masking policy with CASE, the data types do not match (e.g. (VAL string) -> returns number). | SQL compilation error: Masking policy function argument and return type mismatch. | Update the masking policy using CASE with matching data types using a CREATE OR REPLACE statement or an ALTER MASKING POLICY statement. |
| Applying a masking policy to a virtual column. | SQL compilation error: Masking policy cannot be attached to a VIRTUAL_COLUMN column. | Apply the masking policy to the column(s) in the source table. |
| Applying a masking policy to a materialized view. | SQL compilation error: syntax error line <number> at position <number> unexpected ‘modify’. . SQL compilation error: error line <number> at position <number> invalid identifier ‘<character>’ . SQL execution error: One or more materialized views exist on the table. number of mvs=<number>, table name=<table_name>. | Apply the masking policy to the column(s) in the source table. For more information, see [Limitations](security-column-intro.md). |
| Applying a masking policy to a table column used to create a materialized view. | SQL compilation error: Masking policy cannot be attached to a MATERIALIZED_VIEW column. | To apply the masking policy to the table column, drop the materialized view. |
| Including a masked column while creating a materialized view. | Unsupported feature ‘CREATE ON MASKING POLICY COLUMN’. | Create the materialized view without including the masked columns or do not set any masking policies on the base table or views, create the materialized view, and then apply the masking policies to the materialized view columns. |
| Cannot create a masking policy with a user-defined function (UDF) in the masking policy body. | SQL access control error: Insufficient privileges to operate on function ‘<udf_name>’ | Verify the role creating the masking policy has the USAGE privilege on the UDF. |

**Next Topics:**

* [Using Dynamic Data Masking](security-column-ddm-use.md)

---
title: Understanding dynamic table initialization and refresh
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-refresh.md
section: User Guide
---

# Understanding dynamic table initialization and refresh

A dynamic table’s content is defined by a query and automatically updates — called a refresh — when the underlying data changes.
This process analyzes the query to keep the table current.

> **Note:**
>
> Information in this topic applies to dynamic tables with the `SCHEDULER` attribute set to `ENABLE` or not explicitly set. Dynamic
> tables with the `SCHEDULER` attribute set to `DISABLE` can only be refreshed manually. For more information, see
> [Manually refresh dynamic tables](dynamic-tables-manual-refresh.md).

The following sections explain dynamic table refresh in more detail:

| Section | Description |
| --- | --- |
| Understanding dynamic table initialization | Introduces initialization, or in other words, the initial data population when you create a dynamic table. You can specify when the initial refresh occurs. |
| Understanding manual and scheduled refresh options | An overview of dynamic table refresh. Dynamic tables refresh on a schedule unless manually refreshed. |
| Dynamic table refresh modes | Dynamic tables support different refresh modes: incremental, full, and AUTO. |
| How data is refreshed when a dynamic table depends on other dynamic tables | Learn how dynamic tables refresh in relation to their dependencies. |
| Understanding the effects of changes to columns in base tables |  |

## Understanding dynamic table initialization

When you [create a dynamic table](dynamic-tables-create.md), its initial refresh takes place either synchronously at creation
or at a scheduled time. The initial data population, or initialization, depends on when this initial refresh occurs.

Dynamic tables refresh based on the specified [target lag](dynamic-tables-target-lag.md), which sets the maximum allowed delay
between updates to the base tables and the dynamic table’s content. If you set `INITIALIZE = ON_CREATE` (default), the table is initialized
immediately. If you set `INITIALIZE = ON_SCHEDULE`, initialization happens within the specified target lag timeframe.

For example, consider a dynamic table, `DT1`, with a target lag of 30 minutes. The initial data population for `DT1` can occur as follows:

* If `DT1` is set to refresh synchronously at creation (`ON_CREATE`), it initializes at creation.
* If `DT1` is set to refresh at a scheduled time (`ON_SCHEDULE`), it initializes within 30 minutes.

In scenarios with downstream dependencies, refresh behavior depends on the dependencies. For example, if dynamic table `DT1` has a
[downstream](dynamic-tables-target-lag.md) target lag and `DT2`, which depends on `DT1`, has a 30-minute target lag, `DT1`
refreshes only when `DT2` refreshes.

For `DT1`:

* If set to refresh synchronously at creation, it initializes immediately. If initialization fails, the creation process stops, providing
  immediate feedback on any errors.
* If set to refresh at a scheduled time, initialization depends on when `DT2` refreshes.

Initialization can take some time, depending on how much data is scanned. To track progress, see [Troubleshoot dynamic table creation](dynamic-tables-create.md).

## Understanding manual and scheduled refresh options

Dynamic tables are refreshed on a schedule that’s determined by the [target lag](dynamic-tables-target-lag.md). Every time a
dynamic table is read, the data freshness is within the time period defined by the target lag.

You can manually refresh your dynamic tables to get the latest data using the ALTER DYNAMIC TABLE … REFRESH command or Snowsight.
For more information, see [Manually refresh dynamic tables](dynamic-tables-manual-refresh.md).

Dynamic table refresh timeouts are controlled by the [STATEMENT_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md) parameter, which sets the maximum allowed
duration at the account or warehouse level before a refresh is automatically canceled.

### How target lag affects scheduled refreshes

Target lag controls the frequency of scheduled refreshes. To manually manage refreshes, set your dynamic table’s target lag to DOWNSTREAM and
ensure that all downstream dynamic tables are also set to DOWNSTREAM.

Setting the entire Directed Acyclic Graph (DAG)’s target lag to DOWNSTREAM essentially disables scheduled refreshes because the final dynamic
table controls the refresh schedule. If no dynamic table has a time-based target lag, the pipeline is suspended for scheduled refreshes. In
this case, manually refreshing the most downstream table automatically refreshes any upstream dependencies.

Setting the target lag to DOWNSTREAM doesn’t specify exact times. Instead, Snowflake picks a refresh cadence to attempt to keep the lag under
the target value. For example, a dynamic table with a target lag of 4 hours might refresh every 3.5 hours.

To specify exact times, you can use a task with a CRON schedule. For more information, see [Manually refresh dynamic tables](dynamic-tables-manual-refresh.md).

## Dynamic table refresh modes

Dynamic tables support three refresh modes: auto, incremental, and full.
You can either set the refresh mode to [AUTO](../sql-reference/sql/create-dynamic-table.md)
or set it explicitly:

* **AUTO refresh mode:** When using the `AUTO` parameter, Snowflake automatically selects the most cost- and time-effective refresh mode
  based on query complexity, supported constructs, operators, functions, and expected performance. This decision is made only once at the time
  of table creation. If incremental refresh is [unsupported](dynamic-tables-supported-queries.md) or
  [inefficient](dynamic-tables-performance-optimize-query.md), Snowflake chooses full refresh instead.

  For example, if a dynamic table references a view and the view’s definition changes asynchronously, the refresh mode remains unchanged. If
  the original decision was incremental but becomes unsupported (for example, due to an upstream view change), the refresh will fail with an
  error like `Dynamic table can no longer be refreshed incrementally because an upstream view changed.`

  To change the refresh mode, recreate the dynamic table using the CREATE OR REPLACE DYNAMIC TABLE command.
* **Incremental refresh mode:** This mode analyzes the dynamic table’s query and calculates changes since the last refresh. It then merges these
  changes into the table.
* **Full refresh mode:** This mode executes the dynamic table’s query and completely replaces the previously materialized results.

For guidance on when to use incremental refresh versus full refresh, see [Choose a refresh mode](dynamic-tables-performance-optimize.md).
To check which refresh mode an existing dynamic table uses, see
[Refresh mode](dynamic-tables-performance-monitor.md).

> **Important:**
>
> Dynamic tables in incremental refresh mode can only be downstream from dynamic tables with full refresh mode if the
> upstream full refresh table has a system-derived unique key or an immutability constraint.
>
> For more information, see [Understanding primary keys in dynamic tables](dynamic-tables-primary-keys.md) and [Understanding immutability constraints](dynamic-tables-immutability-constraints.md).

## How data is refreshed when a dynamic table depends on other dynamic tables

When a dynamic table’s lag is set as a time measure, the automated refresh process schedules refreshes to best meet the target lag times.

In order to keep data consistent in cases when [one dynamic table depends on another](dynamic-tables-create.md), the
process refreshes all dynamic tables in an account at compatible times. The timing of less frequent refreshes coincides with the timing of
more frequent refreshes. If refreshes take too long, the scheduler may skip refreshes to try to stay up to date. However, snapshot isolation
is preserved.

For example, suppose that dynamic table `DT1` has a target lag of two minutes and queries dynamic table `DT2`, which has a target lag of
one minute. The process might determine that `DT1` should be refreshed every 96 seconds, and `DT2` every 48 seconds. As a result, the
process might apply the following schedule:

| Specific Point in Time | Dynamic Tables Refreshed |
| --- | --- |
| 2022-12-01 00:00:00 | DT1, DT2 |
| 2022-12-01 00:00:48 | DT2 |
| 2022-12-01 00:01:36 | DT1, DT2 |
| 2022-12-01 00:02:24 | DT2 |

The target lag of a dynamic table can’t be shorter than the target lag of the dynamic tables it depends on, unless the upstream dynamic table
is referenced through [DYNAMIC_TABLE_REFRESH_BOUNDARY()](dynamic-tables-refresh-boundary.md). For example, suppose that:

* `DT1` queries dynamic tables `DT2` and `DT3`.
* `DT2` has a target lag of five minutes.
* `DT3` has a target lag of one minute.

This means that the target lag time for `DT1` must not be shorter than five minutes (that is, not shorter than the longer of the lag times
for `DT2` and `DT3`).

If you set the lag for `DT1` to five minutes, the process sets up a refresh schedule with these goals:

* Refresh `DT3` often enough to keep its lag below one minute.
* Refresh `DT1` and `DT2` together and often enough to keep their lags below five minutes.
* Ensure that the refresh for `DT1` and `DT2` coincides with a refresh of `DT3` to ensure snapshot isolation.

> **Important:**
>
> Dynamic tables in incremental refresh mode can only be downstream from dynamic tables with full refresh mode if the
> upstream full refresh table has a system-derived unique key or a immutability constraint.
>
> For more information, see [Understanding primary keys in dynamic tables](dynamic-tables-primary-keys.md) and [Understanding immutability constraints](dynamic-tables-immutability-constraints.md).

### Snapshot isolation

When a dynamic table refreshes, it ensures a consistent state by Time Traveling to the same data timestamp across all upstream dependencies.

For non-dynamic base tables, Time Travel works as usual, where it looks at the “wall-clock” commit time. This means that the contents of a
dynamic table are always consistent with a “snapshot” of the data in the base tables.

For upstream dynamic tables, Snowflake looks up the specific table version tagged with that data timestamp. This ensures that downstream tables
are always consistent with their ancestors. You don’t need to coordinate refresh schedules or worry about different lags; Snowflake automatically
aligns the snapshots to ensure data integrity across the pipeline.

Snapshot isolation isn’t guaranteed in the following cases:

* **Manual SELECT statements:** When you join multiple dynamic tables using a manual SELECT statement, ad hoc queries use the current
  version of each table. Because each dynamic table commits its refresh independently, a manual join might capture different refresh states,
  even if the dynamic tables share the same target lag or an upstream refresh is delayed. This means the results might not reflect a single,
  consistent snapshot of the base data.
* **Refresh boundaries:** When a dynamic table references an upstream dynamic table through
  [DYNAMIC_TABLE_REFRESH_BOUNDARY()](dynamic-tables-refresh-boundary.md), the upstream dynamic table is treated as belonging
  to a separate pipeline. The downstream dynamic table reads whatever version of the upstream data is available at refresh time, rather than a
  coordinated data timestamp.

## Understanding the effects of changes to columns in base tables

When the underlying objects associated with a dynamic table change, the following behaviors apply:

| Change | Impact |
| --- | --- |
| * New column added to the base table. * Existing unused column removed in the base table. | None. If a new column is added to the base table or an unused column is deleted, no action occurs and refreshes continue as before. |
| * Underlying base table is recreated with identical column names and types. * Underlying base table column is recreated with the same name and type. * Changes to the policies on underlying base tables of dynamic tables with incremental refresh. | Reinitialization: The first refresh after recreation is initialization. |
| * Changes to underlying base table for dynamic tables created with `SELECT *` from base table. | The dynamic table fails to refresh and must be recreated to respond to the change. |
| * Changes to underlying base table for dynamic tables created with a column definition. | No impact to the dynamic table. |

---
title: Understanding dynamic table target lag
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-target-lag.md
section: User Guide
---

# Understanding dynamic table target lag

Dynamic table refresh is triggered by the data’s target lag, which determines how outdated it can be. You can set a fixed target lag
or set the dynamic table to DOWNSTREAM, making its refresh timing depend on the dynamic tables that depend on it.

The target lag for a dynamic table is measured relative to the dynamic tables at the root of the graph, not the dynamic tables directly
upstream. To see the graph of tables connected to your dynamic table, see [View the graph of tables connected to your dynamic tables](dynamic-tables-monitor.md).

Snowflake schedules refreshes to keep the actual lag of your dynamic tables below their target lag. The duration of each refresh depends on
the query, data pattern, and warehouse size. When choosing a target lag, consider the time needed to refresh each
[dynamic table in a chain](dynamic-tables-create.md) to the root. If you don’t, some refreshes might be skipped, leading
to a higher actual lag.

> **Note:**
>
> For dynamic tables with the `SCHEDULER` attribute explicitly set to `DISABLE`, target lag doesn’t apply and automatic refreshes
> are disabled. These dynamic tables can only be refreshed manually. For more information, see
> [Manually refresh dynamic tables](dynamic-tables-manual-refresh.md).

## Types of target lag

You specify target lag in one of the following ways. Target lag is inversely proportional to the dynamic table’s refresh frequency: frequent
refreshes imply a lower lag.

1. **Measure of freshness**: Defines the maximum amount of time that the dynamic table’s content should lag behind updates to the base tables.

   > The following example sets `my_dynamic_table` to refresh and maintain freshness within every hour:
   >
   > ```sqlexample
   > ALTER DYNAMIC TABLE my_dynamic_table SET TARGET_LAG = '1 hour';
   > ```
2. **Downstream**: Specifies that the dynamic table should refresh on demand when downstream tables (tables that depend on this table)
   refresh. This refresh can be triggered by [initialization at creation](dynamic-tables-refresh.md),
   [manual refresh](dynamic-tables-manual-refresh.md), or [scheduled refresh](dynamic-tables-refresh.md)
   of a downstream table.

   When `refresh_mode` is set to `downstream`, the refresh schedule of a dynamic table is driven by the most demanding (shortest) lag of its
   downstream dependents. For example, if one downstream dependent table requires data that is no older than 10 minutes and another
   downstream dependent table requires data that is no older than 1 hour, the refresh schedule of this dynamic table will be every 10
   minutes because that is the shortest lag of its downstream dependents.

   In the following example, `my_dynamic_table` is set to refresh based on the target lag of its downstream dynamic tables. If
   `my_dynamic_table` doesn’t have any dynamic tables that depend on it, then it won’t refresh.

   ```sqlexample
   ALTER DYNAMIC TABLE my_dynamic_table SET TARGET_LAG = DOWNSTREAM;
   ```

   For more examples of downstream target lag, see Example: Target lag for dynamic table chains.

## How Snowflake schedules refreshes

Snowflake schedules refreshes slightly earlier than the target lag to allow time for the refresh to complete. For example, if you set the
target lag to 5 minutes, the table might refresh more frequently than every five minutes. Actual refresh intervals are often shorter than
the specified lag.

> **Note:**
>
> Target lag is a target, not a guarantee. Snowflake attempts to keep data within the target lag, but actual lag may exceed the target
> because of factors such as warehouse size, data volume, and query complexity.

For guidance on adjusting target lag for your workload, see [Alter the warehouse or target lag for dynamic tables](dynamic-tables-alter.md).
For information about optimizing your target lag, see [Identify the right target lag](dynamic-tables-performance-optimize.md).

## How upstream and downstream relationships affect target lag

The following diagram illustrates suspend, resume, and manual refresh operations in the context of upstream and downstream relationships to
other dynamic tables.

The diagram depicts a simple declarative data pipeline built with dynamic tables:

* `DT2` is described as *downstream* of `DT1` because it depends on that dynamic table, and as *upstream* of `DT3`, which depends on it.
* `DT3` is downstream of both `DT2` and `DT1` because it depends on `DT2` directly and on `DT1` indirectly.
* `DT1` is directly or indirectly upstream of the other dynamic tables.

## Example: Target lag for dynamic table chains

Consider the following example where a dynamic table (`DT2`) reads from another dynamic table (`DT1`) to materialize its contents. In
this scenario, a report consumes `DT2`’s data via a query.

The following results are possible, depending on how each dynamic table specifies its lag:

| `DT1` | `DT2` | Refresh results |
| --- | --- | --- |
| `TARGET_LAG = DOWNSTREAM` | `TARGET_LAG = 10minutes` | `DT2` is updated at least every 10 minutes. `DT1` infers its lag from `DT2` and is updated every time `DT2` requires updates. |
| `TARGET_LAG = 10minutes` | `TARGET_LAG = DOWNSTREAM` | This scenario should be avoided. The report query will not receive any data. DT1 is frequently refreshed and `DT2` is not refreshed because there’s no dynamic table that’s based on `DT2`. |
| `TARGET_LAG = 5minutes` | `TARGET_LAG = 10minutes` | `DT2` is updated approximately every 10 minutes with data from `DT1` that’s at most 5 minutes old. |
| `TARGET_LAG = DOWNSTREAM` | `TARGET_LAG = DOWNSTREAM` | Neither `DT1` nor `DT2` is refreshed periodically because both of them have a downstream lag, and neither has a downstream consumer with a defined lag. |

---
title: Understanding Encryption Key Management in Snowflake
source: https://docs.snowflake.com/en/user-guide/security-encryption-manage.md
section: User Guide
---

# Understanding Encryption Key Management in Snowflake

This topic provides concepts related to Snowflake-managed keys and customer-managed keys.

## Overview

Snowflake manages data encryption keys to protect customer data. This management can occur automatically without any need for customer
intervention.

Customers can also use the key management service in the cloud platform that hosts their Snowflake account to maintain their own additional
encryption key.

When enabled, the combination of a Snowflake-maintained key and a customer-managed key creates a composite
[master key](https://csrc.nist.gov/glossary/term/master_key) to protect customer data in Snowflake. This is called [Tri-Secret Secure](security-encryption-tss.md). For more information, see [Tri-Secret Secure overview](security-encryption-tss.md).

## Snowflake-managed keys

All Snowflake customer data is encrypted by default. Snowflake uses strong AES encryption with a hierarchical key model rooted in a
cloud-provider-hosted hardware security module.

Keys are automatically rotated on a regular basis by the Snowflake service, and customer data can be automatically re-encrypted (“rekeyed”)
on a regular basis. Customer data encryption and key management require no configuration or management.

### Hierarchical key model

A hierarchical key model provides a framework for Snowflake’s encryption key management. The hierarchy is composed of several layers of
keys in which each higher layer of keys (parent keys) encrypts the layer below (child keys). In security terminology, a parent key
encrypting all child keys is known as “wrapping”.

Snowflake’s hierarchical key model consists of four levels of keys:

* The root key
* Account master keys
* Table master keys
* File keys

Each customer account has a separate key hierarchy of account-level, table-level, and file-level keys, as shown in the following image:

In a multi-tenant cloud service like Snowflake, the hierarchical key model isolates every account with the use of separate account master
keys. In addition to the [access control model](security-access-control-overview.md), which separates storage of customer
data, the hierarchical key model provides another layer of account isolation.

A hierarchical key model reduces the scope of each layer of keys. For example, a table master key encrypts a single table. A file key
encrypts a single file. A hierarchical key model constrains the amount of customer data each key protects and the duration of time for which
it is usable.

### Encryption key rotation

Keys in the Snowflake-managed key hierarchy are automatically rotated by Snowflake when they are more than 30 days old. Active keys are
retired, and new keys are created. When Snowflake determines the retired key is no longer needed, the key is automatically destroyed. When
active, a key is used to encrypt customer data and is available for usage by the customer. When retired, the key is used solely to decrypt
customer data and is only available for accessing the data.

When wrapping child keys in the key hierarchy, or when inserting customer data into a table, only the current, active key is
used to encrypt data. When a key is destroyed, it is not used for either encryption or decryption. Regular key rotation limits the
life cycle for the keys to a limited period of time.

The following image illustrates key rotation for one table master key (TMK) over a period of three months:

The TMK rotation works as follows:

* Version 1 of the TMK is active in April. Customer data inserted into this table in April is protected with TMK v1.
* In May, this TMK is rotated: TMK v1 is retired, and a new, completely random key, TMK v2, is created. TMK v1 is now used only to decrypt
  customer data from April. New customer data inserted into the table is encrypted using TMK v2.
* In June, the TMK is rotated again: TMK v2 is retired and a new TMK, v3, is created. TMK v1 is used to decrypt customer data from April,
  TMK v2 is used to decrypt customer data from May, and TMK v3 is used to encrypt and decrypt new customer data inserted into the table in
  June.

As stated previously, key rotation limits the duration of time in which a key is actively used to encrypt customer data. In conjunction with
the hierarchical key model, key rotation further constrains the amount of customer data a key version protects. Limiting the lifetime of a
key is [recommended](https://csrc.nist.gov/pubs/sp/800/57/pt1/r5/final) by the National Institute of Standards
and Technology (NIST) to enhance security.

### Periodic rekeying

This section continues with an explanation of the account and table master key lifecycle. Encryption Key Rotation describes key rotation,
which replaces active keys with new keys on a periodic basis and retires the old keys. Periodic data rekeying completes the life cycle.

While key rotation ensures that a key is transferred from its active state to a retired state, rekeying ensures that a key is transferred
from its retired state to being destroyed.

If periodic rekeying is enabled, then when the retired encryption key for a table is older than one year, Snowflake automatically creates
a new encryption key and re-encrypts all customer data previously protected by the retired key using the new key. The new key is used to
decrypt the table data going forward.

> **Note:**
> > For Enterprise Edition accounts, users with the ACCOUNTADMIN role can enable periodic rekeying for the account, using
> > [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) and the [PERIODIC_DATA_REKEYING](../sql-reference/parameters.md) parameter, as shown in the following example.
> >
> > > ```sqlexample
> > > ALTER ACCOUNT SET ENABLE_TRI_SECRET_AND_REKEY_OPT_OUT_FOR_IMAGE_REPOSITORY = TRUE;
> > >
> > > ALTER ACCOUNT SET PERIODIC_DATA_REKEYING = TRUE;
> > > ```
>
> This does not apply additional security to any images stored in your Snowpark image repository.

The following image shows periodic rekeying for a TMK for a single table:

Periodic rekeying works as follows:

* In April of the following year, after TMK v1 has been retired for an entire year, it is rekeyed (generation 2) using a fully new random
  key.

  The customer data files protected by TMK v1 generation 1 are decrypted and re-encrypted using TMK v1 generation 2. Having no further
  purpose, TMK v1 generation 1 is destroyed.
* In May, Snowflake performs the same rekeying process on the table data protected by TMK v2.
* And so on.

In this example, the lifecycle of a key is limited to a total duration of one year.

Rekeying constrains the total duration in which a key is used for recipient usage, following NIST recommendations. Furthermore, when
rekeying customer data, Snowflake can increase encryption key sizes and utilize better encryption algorithms that may be standardized since
the previous key generation was created.

Snowflake rekeys customer data files online, in the background, without any impact to currently running customer workloads. Customer data
that is being rekeyed is always available to you. No service downtime is necessary to rekey the data, and you encounter no performance
impact on your workload. This benefit is a direct result of Snowflake’s architecture of separating storage and compute resources.

#### Impact of rekeying on Time Travel and Fail-safe

[Time Travel](data-time-travel.md) and [Fail-safe](data-failsafe.md) retention periods are not affected by
rekeying. However, some additional storage charges are associated with rekeying of customer data in Fail-safe (see next section).

#### Impact of rekeying on storage utilization

Snowflake customers are charged with additional storage for Fail-safe protection of customer data files that were rekeyed. For these
files, 7 days of Fail-safe protection is charged.

That is, for example, the customer data files with the old key on Amazon S3 are already protected by Fail-safe, and the customer data files
with the new key on Amazon S3 are also added to Fail-safe, leading to a second charge, but only for the 7-day period.

### Hardware security module

Snowflake relies on cloud-hosted hardware security modules (HSMs) to help ensure that key storage and usage are secure. Each cloud platform has
different HSM services, and that affects how Snowflake uses the HSM service on each platform:

* On AWS and Azure, Snowflake uses the HSM to create and store the root key.
* On Google Cloud, the HSM service is made available through the Google Cloud KMS (key management service) API. Snowflake uses Google Cloud
  KMS to create and store the root key in multi-tenant HSM partitions.

For all cloud platforms and all keys in the key hierarchy, a key that is stored in the HSM is used to unwrap a key in the hierarchy. For
example, to decrypt the table master key, the key in the HSM unwraps the account master key. This process occurs in the HSM. After this
process completes, a software operation decrypts the table master key with the account master key.

The following image shows the relationship between the HSM, the account master keys, the table master keys, and the file keys:

## Customer-managed keys

A customer-managed key (CMK) is a master encryption key that the customer maintains in the key management service for the cloud provider that
hosts the customer’s Snowflake account. The key management services for each platform are:

* **AWS:** [AWS Key Management Service (KMS)](https://aws.amazon.com/kms/)
* **Google Cloud:** [Cloud Key Management Service (Cloud KMS)](https://cloud.google.com/kms)
* **Microsoft Azure:** [Azure Key Vault](https://azure.microsoft.com/en-us/services/key-vault/)

The CMK can then be combined with a Snowflake-managed key to create a composite master key. When this occurs, Snowflake refers to this
as Tri-Secret Secure. For more information, see [Tri-Secret Secure overview](security-encryption-tss.md).

You can call these system functions in your Snowflake account to obtain information about your keys:

* AWS: [SYSTEM$GET_CMK_KMS_KEY_POLICY](../sql-reference/functions/system_get_cmk_kms_key_policy.md)
* Microsoft Azure: [SYSTEM$GET_CMK_AKV_CONSENT_URL](../sql-reference/functions/system_get_cmk_akv_consent_url.md)
* Google Cloud: [SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD](../sql-reference/functions/system_get_gcp_kms_cmk_grant_access_cmd.md)

Snowflake supports the following:

* **Configuring automatic key rotation in your cloud service provider.** Configuring key rotation creates a new version of the same CMK with
  updated encryption material. You can enable automatic key rotation using the key rotation feature specific to your cloud provider
  without any action in Snowflake.

  When your CMK is rotated, a new cryptographic version is created in your cloud provider. Snowflake does not immediately use this new version.
  Instead, Snowflake incorporates the new CMK version the next time it rotates its internal Account Master Key (AMK), which occurs automatically,
  every 30 days.

  After automatic key rotation occurs, Snowflake uses the new AMK to encrypt newly created database files. Existing files continue to be
  decrypted using prior CMK versions. If you wish to rekey all the existing files with the new CMK version, please see [Use Tri-Secret Secure self-service with automatic key rotation](security-encryption-tss-self-serve.md).

  > **Important:**
  >
  > Do not delete or change or revoke permissions on older CMK versions in any cloud service provider. Deleting a rotated key can
  > lead to data loss. Snowflake cannot decrypt data encrypted with a deleted key.

  For more information about configuring automatic key rotation for customer-managed keys in the cloud platform that supports your Snowflake
  account(s), see:

  + [AWS KMS automatic key rotation policy for Customer Managed Keys used by Snowflake Tri-Secret Secure](https://community.snowflake.com/s/article/Does-enabling-AWS-automatic-key-rotation-CMKs-feature-impact-the-snowflake-account-which-has-tri-secret-secure-enabled)
  + [Microsoft Azure automatic key rotation policy for Customer Managed Keys used by Snowflake Tri-Secret Secure](https://community.snowflake.com/s/article/azure-automatic-key-rotation-tss)
  + [GCP KMS automatic key rotation policy for Customer Managed Keys used by Snowflake Tri-Secret Secure](https://community.snowflake.com/s/article/GCP-KMS-automatic-key-rotation-tri-secret-secure)
* **Manually changing the CMK used by Tri-Secret Secure.** Changing your CMK for Tri-Secret Secure requires creating and self-registering a new
  CMK, and then updating Tri-Secret Secure to use it. Follow the self-registration steps in [Tri-Secret Secure](security-encryption-tss.md) and reach out to
  [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to coordinate the key change.

  > **Important:**
  >
  > Don’t revoke access to or delete the current CMK until Snowflake Support confirms it is safe to do so.

For more information about using Snowflake system functions to change your CMK, see [Understanding Tri-Secret Secure self-service](security-encryption-tss-self-serve.md) and [Change the CMK for Tri-Secret Secure](security-encryption-tss-self-serve.md).

### Benefits of customer-managed keys

Benefits of customer-managed keys include:

Control over customer data access:
:   You have complete control over your master key in the key management service and, therefore, your customer data in Snowflake. You must
    release this key to decrypt data stored in your Snowflake account.

Disable access due to a customer data breach:
:   If you experience a security breach, you can disable access to your key and halt all data operations running in your Snowflake account.

Ownership of the customer data lifecycle:
:   Using customer-managed keys, you can align your data protection requirements with your business processes. Explicit control over your key
    provides safeguards throughout the entire customer data lifecycle, from creation to deletion.

### Important requirements for customer-managed keys

Customer-managed keys provide significant security benefits, but they also have crucial, fundamental requirements that you must
continuously follow to safeguard your master key:

Confidentiality:
:   You must keep your key secure and confidential at all times.

Integrity:
:   You must ensure your key is protected against improper modification or deletion.

Availability:
:   To execute queries and access your data, you must ensure your key is continuously available to Snowflake.

By design, an invalid or unavailable key will result in a disruption to your Snowflake data operations until a valid key is made available
again to Snowflake.

Snowflake is designed to handle temporary availability issues (up to 10 minutes) caused by common issues, such as network
communication failures. After 10 minutes, if the key remains unavailable, all data operations in your Snowflake account will cease
completely. Once access to the key is restored, data operations can be started again.

Failure to comply with these requirements can significantly jeopardize the integrity of your data, ranging from your data being
temporarily inaccessible to it being permanently disabled. In addition, Snowflake cannot be responsible for 3rd-party
issues that occur or administrative mishaps caused by your organization in the course of maintaining your key.

For example, if an issue with the key management service results in your key becoming unavailable, your data operations will be impacted.
These issues must be resolved between you and the Support team for the key management service. Similarly, if your key is
tampered with or destroyed, all existing data in your Snowflake account will become unreadable until the key is restored.

---
title: Understanding end-to-end encryption in Snowflake
source: https://docs.snowflake.com/en/user-guide/security-encryption-end-to-end.md
section: User Guide
---

# Understanding end-to-end encryption in Snowflake

This topic provides concepts related to end-to-end encryption in Snowflake.

## Overview

End-to-end encryption (E2EE) is a method to secure customer data that prevents third parties from reading the data while at-rest or in
transit to and from Snowflake and to minimize the attack surface.

The figure illustrates the E2EE system in Snowflake:

The E2EE system includes the following components:

* The Snowflake customer in a corporate network.
* A customer-provided or Snowflake-provided data file staging area.
* Snowflake runs in a secure virtual private cloud (VPC) or virtual network (VNet), depending on the cloud platform.

Snowflake supports both internal (Snowflake-provided) and external (customer-provided) stages for data files. Snowflake provides internal
stages where you can upload and group your data files before loading the data into tables (image B).

Customer-provided stages are containers or directories in a supported cloud storage service (e.g. Amazon S3) that you control and manage
(image A). Customer-provided stages are an attractive option for customers who already have data stored in a cloud storage service that
they want to copy into Snowflake.

Per the figure in this section, the flow of E2EE in Snowflake is as follows:

1. A user uploads one or more data files to a stage.

   If the stage is an external stage (Image A), the user may optionally encrypt the data files using client-side encryption (see
   Client-Side Encryption for more information). We recommend client-side encryption for data files in external stages; but if the data
   is not encrypted, Snowflake immediately encrypts the data when it is loaded into a table within Snowflake.

   If the stage is an internal (i.e., Snowflake) stage (Image B) data files are automatically encrypted by the Snowflake client on the
   user’s local machine prior to being transmitted to the internal stage, in addition to being encrypted after they are loaded into the
   stage.
2. The user loads the data from the stage into a table.

   The data is transformed into Snowflake’s proprietary file format and stored in a cloud storage container. In Snowflake, all customer data
   at rest is encrypted and encrypted with TLS in transit to/from the Snowflake service. Snowflake also decrypts customer data when the data
   is transformed or operated on in a table, and then re-encrypts the data when the transformations and operations are complete.
3. The user can unload query results into an external or internal stage.

   Results are optionally encrypted using client-side encryption when unloaded into a customer-managed stage, and are automatically
   encrypted when unloaded to a Snowflake-provided stage.
4. The user downloads data files from the stage and decrypts the data on the client side.

## Client-side encryption

Client-side encryption means that a client encrypts data before copying it into a cloud
storage staging area. Client-side encryption provides a secure system for managing data
in cloud storage.

Client-side encryption follows a specific protocol defined by the cloud storage service. The service SDK and third-party tools implement
this protocol.

The following image summarizes client-side encryption:

The client-side encryption protocol works as follows:

1. The customer creates a secret [master key](https://csrc.nist.gov/glossary/term/master_key), which is shared with Snowflake.
2. The client, which is provided by the cloud storage service, generates a random encryption key and encrypts the file before uploading it
   into cloud storage. The random encryption key, in turn, is encrypted with the customer’s master key.
3. Both the encrypted file and the encrypted random key are uploaded to the cloud storage service. The encrypted random key is stored with
   the file’s metadata.

When downloading data, the client downloads both the encrypted file and the encrypted random key. The client decrypts the encrypted random
key using the customer’s master key.

Next, the client decrypts the encrypted file using the now decrypted random key. This encryption and decryption happens on the client side.

At no time does the third-party cloud storage service or any other third party (such as an ISP) see the data in the clear. Customers may upload
client-side encrypted data using any client or tool that supports client-side encryption.

## Ingesting client-side encrypted data into Snowflake

Snowflake supports the client-side encryption protocol using a client-side master key when reading or writing data between a cloud storage
service stage and Snowflake, as shown in the following image:

To load client-side encrypted data from a customer-provided stage, you create a named stage object with an additional `MASTER_KEY`
parameter using a [CREATE STAGE](../sql-reference/sql/create-stage.md) command, and then load data from the stage into your Snowflake tables. The
`MASTER_KEY` parameter requires either a 128-bit or 256-bit Advanced Encryption Standard (AES) key encoded in Base64.

A named stage object stores settings related to a stage and provides a convenient way to load or unload data between Snowflake and a
specific container in cloud storage. The following SQL snippet creates an example Amazon S3 stage object in Snowflake that supports
client-side encryption:

```sqlexample
-- create encrypted stage
create stage encrypted_customer_stage
url='s3://customer-bucket/data/'
credentials=(AWS_KEY_ID='ABCDEFGH' AWS_SECRET_KEY='12345678')
encryption=(MASTER_KEY='eSxX...=');
```

The truncated master key specified in this SQL command is the Base64-encoded string of the customer’s secret master key. As with all other
credentials, this master key is transmitted over Transport Layer Security (HTTPS) to Snowflake and is stored encrypted in metadata storage.
Only the customer and the query-processing components of Snowflake are exposed to the master key.

A benefit of named stage objects is that they can be granted to other users within a Snowflake account without revealing access credentials
or client-side encryption keys to those users. Users with the appropriate access control privileges simply reference the named stage object
when loading or unloading data.

The following SQL commands create a table named `users` and copy data from the encrypted stage into the `users` table:

```sqlexample
-- create table and ingest data from stage
CREATE TABLE users (id bigint, name varchar(500), purchases int);
COPY INTO users FROM @encrypted_customer_stage/users;
```

The data is now ready to be analyzed using Snowflake.

You can also unload data into the stage. The following SQL command creates a `most_purchases` table and populates it with the results of a
query that finds the top 10 users with the most purchases, and then unloads the table data into the stage:

```sqlexample
-- find top 10 users by purchases, unload into stage
CREATE TABLE most_purchases as select * FROM users ORDER BY purchases desc LIMIT 10;
COPY INTO @encrypted_customer_stage/most_purchases FROM most_purchases;
```

Snowflake encrypts the data files copied into the customer’s stage using the master key stored in the stage object. Snowflake adheres to
the client-side encryption protocol for the cloud storage service. A customer can download the encrypted data files using any client or
tool that supports client-side encryption.

**Next Topics:**

* [Understanding Encryption Key Management in Snowflake](security-encryption-manage.md)

---
title: Understanding External Tokenization
source: https://docs.snowflake.com/en/user-guide/security-column-ext-token-intro.md
section: User Guide
---

# Understanding External Tokenization

This topic provides a general overview of the External Tokenization feature.

Note that an external tokenization masking policy can be assigned to a tag to provide tag-based external tokenization. For details about
assigning a masking policy to a tag, see [Tag-based masking policies](tag-based-masking-policies.md).

> **Important:**
>
> External tokenization requires [Writing external functions](../sql-reference/external-functions.md), which are included in the Snowflake [Standard Edition](intro-editions.md), and you can use external functions with a tokenization provider.
>
> However, if you choose to integrate your tokenization provider with Snowflake External Tokenization, you must upgrade to
> [Enterprise Edition](intro-editions.md) or higher.
>
> To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## What is External Tokenization?

External Tokenization enables accounts to tokenize data before loading it into Snowflake and detokenize the data at query runtime. Tokenization is the process of removing sensitive data by replacing it with an undecipherable token. External Tokenization makes use of masking policies with [external functions](../sql-reference/external-functions.md).

In Snowflake, masking policies are schema-level objects, which means a database and schema must exist in Snowflake before a masking policy can be applied to a column. Currently, Snowflake supports using Dynamic Data Masking on tables and views.

At query runtime, the masking policy is applied to the column at every location where the column appears. Depending on the masking policy conditions, the SQL execution context, and role hierarchy, Snowflake query operators may see the plain-text value, a partially masked value, or a fully masked value.

For more details about how masking policies work, including the query runtime behavior, creating a policy, usage with tables and views, and management approaches using masking policies, see: [Understanding Column-level Security](security-column-intro.md).

For more details on the effects of the SQL execution context and role hierarchy, see [Advanced Column-level Security topics](security-column-advanced.md).

Tokenizing data before loading into Snowflake ensures that sensitive data is never exposed unnecessarily. Using masking policies with external functions ensures that only the appropriate audiences can view de-tokenized data at query runtime.

## External Tokenization benefits

The following summarizes some of the key benefits of External Tokenization.

Pre-load Tokenized Data:
:   Using a tokenization provider, tokenized data is pre-loaded into Snowflake. Therefore, even without applying a masking policy to a column in a table or view, users never see the real data value. This provides enhanced data security to the most sensitive data in your organization.

Ease of use:
:   You can write a policy once and have it apply to thousands of columns across databases and schemas.

Data administration and SoD:
:   A security or privacy officer decides which columns to protect, not the object owner. Masking policies are easy to manage and support centralized and decentralized administration models.

Data authorization and governance:
:   Contextual data access by role or custom entitlements.

    Supports data governance as implemented by security or privacy officers and can prohibit privileged users with the ACCOUNTADMIN or SECURITYADMIN role from unnecessarily viewing data.

Change management:
:   Easily change masking policy content without having to reapply the masking policy to thousands of columns.

For a comparison of benefits between Dynamic Data Masking and External Tokenization, see: [Column-level Security Benefits](security-column-intro.md).

## External Tokenization limitations

For an overview on the limitations, see [Column-level Security Limitations](security-column-intro.md).

## External Tokenization considerations

For additional External Tokenization Considerations, see [Column-level Security Considerations](security-column-intro.md).

## External Tokenization privileges and dependencies

The following table summarizes the privileges related to External Tokenization masking policies.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the unset and set operations for a [masking policy](security-column-intro.md) on a column.  Note that granting the global APPLY MASKING POLICY privilege (i.e. APPLY MASKING POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views.  For syntax examples, see [Masking policy privileges](security-column-intro.md). |
| OWNERSHIP | Grants full control over the masking policy. Required to alter most properties of a masking policy. Only a single role can hold this privilege on a specific object at a time. |

> **Note:**
>
> Operating on a masking policy also requires the USAGE privilege on the parent database and schema.

Since the external tokenization masking policy requires an external function that depends on an API integration, the following table summarizes the privileges the custom role (e.g. MASKING_ADMIN) must have on Snowflake objects. Note that these privileges apply to the custom role only and are not necessary for the role of the user querying the column with a masking policy.

| Custom role | Privilege | Object |
| --- | --- | --- |
| External tokenization policy owner | USAGE | External function |
| External function owner (i.e. the role with the OWNERSHIP privilege on the external function) | USAGE | Any API integration objects that are referenced by the external function. |

## External Tokenization DDL

Snowflake provides the following set of commands to manage External Tokenization policies.

* [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md)
* [ALTER MASKING POLICY](../sql-reference/sql/alter-masking-policy.md) (see also: [ALTER TABLE](../sql-reference/sql/alter-table.md), [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md), and [ALTER VIEW](../sql-reference/sql/alter-view.md))
* [DROP MASKING POLICY](../sql-reference/sql/drop-masking-policy.md)
* [SHOW MASKING POLICIES](../sql-reference/sql/show-masking-policies.md)
* [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md)

## Auditing External Tokenization

Snowflake provides two Account Usage views to obtain information about masking policies:

* The [MASKING POLICIES](../sql-reference/account-usage/masking_policies.md) view provides a list of all masking policies in your
  Snowflake account.
* The [POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md) view provides a list of all objects in which a masking
  policy is set.

The Information Schema table function [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) can be used to either:

* Return a list of all objects (i.e. tables, views) that have the masking policy set on a column.
* Return a list of policy associations that have the specified object name and object type.

Snowflake records the original query run by the user on the [History page](ui-snowsight-activity.md) (in the web interface). The query
is found in the SQL Text column.

The masking policy names that were used in a specific query can be found in the [Query Profile](ui-snowsight-activity.md).

The query history is specific to the Account Usage [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) view only. In this
view, the Query Text column contains the text of the SQL statement. Masking policy names are not included in the QUERY_HISTORY
view.

## Troubleshooting External Tokenization

You can use error messages to help troubleshoot masking policy issues.

### Error Messages

The following table describes error messages Snowflake can return while using masking policies.

| Behavior | Error Message | Troubleshooting Action |
| --- | --- | --- |
| Cannot apply a masking policy to a Snowflake feature. | Unsupported feature `CREATE ON MASKING POLICY COLUMN`. | Masking policies are currently not applicable to this feature. |
| An active role cannot create or replace a masking policy. | SQL access control error: Insufficient privileges to operate on account <account_name> | Grant the CREATE MASKING POLICY privilege to the specified role using `grant create masking policy on account to role <role_name>;` . Verify the role has the privilege using `show grants to role <role_name>`, and try the CREATE OR REPLACE masking statement again. |
| A given role cannot attach a masking policy to a table. | SQL compilation error: Database <database_name> does not exist or not authorized. | Grant the APPLY MASKING POLICY privilege to the role using `grant apply masking policy on account to role <role_name>;` |
| A given role that does not own a masking policy on a table tries to apply a masking policy on a table they can use. | SQL compilation error: Masking policy <policy_name> does not exist or not authorized. | Grant the given role usage on the masking policy using `grant apply on masking policy <policy_name> to role <role_name>;` |
| Cannot drop or remove a policy using `drop masking policy <policy_name>;` | SQL compilation error: Policy <policy_name> cannot be dropped/replaced as it is associated with one or more entities. | Use an ALTER TABLE … MODIFY COLUMN or ALTER VIEW … MODIFY COLUMN statement to UNSET the policy first, then try the DROP statement again. |
| Restoring a dropped table produces a masking policy error. | SQL execution error: Column <column_name> already attached to a masking policy that does not exist. Please contact the policy administrator. | Unset the currently attached masking policy with an ALTER Table/View MODIFY COLUMN statement and then reapply the masking policy to the column with a CREATE OR REPLACE statement. |
| Cannot apply a masking policy to a specific column, but the masking policy can be applied to a different column. | Specified column already attached to another masking policy.A column cannot be attached to multiple masking policies.please drop the current association in order to attach a new masking policy. | Decide which masking policy should apply to the column, update, and try again. |
| Updating a policy with an ALTER statement fails. | SQL compilation error: Masking policy <policy_name> does not exist or not authorized. | Verify the policy name in the ALTER command matches an existing policy by executing `show masking policies;` |
| The role that owns the cloned table cannot unset a masking policy. | SQL access control error: Insufficient privileges to operate on ALTER TABLE UNSET MASKING POLICY ‘<policy_name>’ | Grant the APPLY privilege to the role that owns the cloned table using `grant apply on masking policy <policy_name> to role <role_name>;` . Verify that the role that owns the cloned table has the grant using `show grants to role <role_name>;` and try the ALTER statement again. |
| Updating a policy using IF EXISTS returns a successful result but does not update the policy. | No error message returned; Snowflake returns Statement executed successfully. | Remove IF EXISTS from the ALTER statement and try again. |
| While creating or replacing a masking policy with CASE, the data types do not match (e.g. (VAL string) -> returns number). | SQL compilation error: Masking policy function argument and return type mismatch. | Update the masking policy using CASE with matching data types using a CREATE OR REPLACE statement or an ALTER MASKING POLICY statement. |
| Applying a masking policy to a virtual column. | SQL compilation error: Masking policy cannot be attached to a VIRTUAL_COLUMN column. | Apply the masking policy to the column(s) in the source table. |
| Applying a masking policy to a materialized view. | SQL compilation error: syntax error line <number> at position <number> unexpected ‘modify’. . SQL compilation error: error line <number> at position <number> invalid identifier ‘<character>’ . SQL execution error: One or more materialized views exist on the table. number of mvs=<number>, table name=<table_name>. | Apply the masking policy to the column(s) in the source table. For more information, see [Limitations](security-column-intro.md). |
| Applying a masking policy to a table column used to create a materialized view. | SQL compilation error: Masking policy cannot be attached to a MATERIALIZED_VIEW column. | To apply the masking policy to the table column, drop the materialized view. |
| Including a masked column while creating a materialized view. | Unsupported feature ‘CREATE ON MASKING POLICY COLUMN’. | Create the materialized view without including the masked columns or do not set any masking policies on the base table or views, create the materialized view, and then apply the masking policies to the materialized view columns. |
| Cannot create a masking policy with a user-defined function (UDF) in the masking policy body. | SQL access control error: Insufficient privileges to operate on function ‘<udf_name>’ | Verify the role creating the masking policy has the USAGE privilege on the UDF. |

**Next Topics:**

* [Using External Tokenization](security-column-ext-token-use.md)
* [Using Conditional Tokenization](security-column-intro.md)
  (For an external tokenization policy example with conditional columns, see [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).)

---
title: Understanding How Snowflake Can Eliminate Redundant Joins
source: https://docs.snowflake.com/en/user-guide/join-elimination.md
section: User Guide
---

# Understanding How Snowflake Can Eliminate Redundant Joins

In some cases, a join on a key column can refer to tables that are not needed for the join. If your tables have key columns and
you are using and enforcing the UNIQUE, PRIMARY KEY, and FOREIGN KEY constraints, Snowflake can improve query performance by
eliminating unnecessary joins on key columns.

These optimizations are performed only if you use the RELY constraint property to indicate that the data in your tables complies
with the constraints around primary keys and foreign keys.

## Setting the RELY Constraint Property to Eliminate Unnecessary Joins

Snowflake only performs this optimization on joins if you indicate that the data in your tables comply with the UNIQUE, PRIMARY
KEY, and FOREIGN KEY constraints.

As mentioned in [Supported constraint types](../sql-reference/constraints-overview.md), Snowflake does not enforce UNIQUE, PRIMARY KEY, and FOREIGN KEY
constraints on standard tables, but does enforce them on [hybrid tables](tables-hybrid.md). For standard tables, you are
responsible for enforcing constraints on the data.

If you have ensured that the data complies with these constraints and you want Snowflake to eliminate unnecessary joins, set the
RELY constraint property on the UNIQUE, PRIMARY KEY, FOREIGN KEY constraints.

> **Note:**
>
> You are responsible for maintaining the integrity of your constraints (UNIQUE, PRIMARY KEY, and FOREIGN KEY). If the integrity
> of your constraints is not maintained, the query results might differ if the RELY constraint property is set (compared to the
> results with NORELY).

## Examples of Eliminating Unnecessary Joins

The following examples demonstrates cases in which Snowflake eliminates joins and references to tables that are not necessary:

* Example 1: Eliminating an Unnecessary Left Outer Join
* Example 2: Eliminating an Unnecessary Self-Join
* Example 3: Eliminating an Unnecessary Join on a Primary Key and Foreign Key

In these examples:

* `dim_products` is a table that contains a row for each product available for purchase.

  In this table, `product_id` is a column that uniquely identifies a product.
* `fact_sales` is a table that contains a row for each sale of a product.

  In this table, `product_id` is a column that identifies the product that was sold. The IDs in this column correspond to the
  IDs in the `product_id` column of the `dim_products` table.

### Example 1: Eliminating an Unnecessary Left Outer Join

This following is an example of an unnecessary left outer join that Snowflake can optimize:

```sqlexample
SELECT f.*
FROM fact_sales f
LEFT OUTER JOIN dim_products p
ON f.product_id = p.product_id;
```

The join is unnecessary because the statement does not refer to any columns in the `dim_products` table on the right (other than
the primary key column for the join).

If the `dim_products.product_id` column has the UNIQUE or PRIMARY KEY constraint with the RELY property, Snowflake can identify
this join as unnecessary and can eliminate the reference to the `dim_products` table on the right.

### Example 2: Eliminating an Unnecessary Self-Join

This following is an example of an unnecessary self-join that Snowflake can optimize:

```sqlexample
SELECT p1.product_id, p2.product_name
FROM dim_products p1, dim_products p2
WHERE p1.product_id = p2.product_id;
```

The statement unnecessarily joins the `dim_products` table with itself and selects columns from that table.

If the `dim_products.product_id` column has the UNIQUE or PRIMARY KEY constraint with the RELY property, Snowflake can identify
this join as unnecessary and can eliminate the reference to the `dim_products` table on the right.

### Example 3: Eliminating an Unnecessary Join on a Primary Key and Foreign Key

This following is an example of an unnecessary inner join that Snowflake can optimize.

```sqlexample
SELECT p.product_id, f.units_sold
FROM   fact_sales f, dim_products p
WHERE  f.product_id = p.product_id;
```

The statement does not refer to any columns in the `dim_products` table on the right, other than the primary key column for the
join.

If the `dim_products.product_id` column has the PRIMARY KEY constraint and the `fact_sales.product_id` column has the FOREIGN
KEY constraint, Snowflake can identify this join as unnecessary and can eliminate the reference to the `dim_products` table on
the right.

---
title: Understanding immutability constraints
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-immutability-constraints.md
section: User Guide
---

# Understanding immutability constraints

Immutability constraints let you mark portions of a dynamic table as static. When you define
an immutability constraint, Snowflake skips those rows during refresh, which improves
performance, especially for tables that contain large amounts of historical data.

You define immutability constraints with the `IMMUTABLE WHERE` clause when you create or
alter a dynamic table. The clause specifies a condition, or predicate, that identifies which rows are
immutable.

Key behaviors:

* **Initial refresh**: Snowflake ignores the IMMUTABLE WHERE predicate during the initial refresh
  but applies to all subsequent refreshes.
* **Full refresh mode**: The predicate limits recomputation to only the rows that don’t match
  the condition.
* **Incremental refresh**: Streams and incremental refresh dynamic tables can read from full
  refresh dynamic tables that have immutability constraints.
* **Cloning and replication**: Snowflake copies IMMUTABLE WHERE constraints with no limitations.

For information about compute costs, see [Compute cost for immutability constraints](dynamic-tables-cost.md).

## When to use immutability constraints

Immutability constraints are useful in the following scenarios:

**Avoid reprocessing historical data**
:   When your dynamic table contains historical data that you don’t want to reprocess, mark older
    rows as immutable.

**Optimize full refresh mode**
:   Dynamic tables that use full [refresh mode](dynamic-tables-refresh.md)
    normally recompute all rows during each refresh. Immutability constraints limit recomputation
    to only the mutable rows, significantly reducing work when most data is historical.

**Facilitate incremental downstream refreshes**
:   Some query constructs, such as Python user-defined table functions, require a dynamic table to
    use full refresh mode. Normally, this prevents downstream tables from benefiting from
    incremental refresh. When the upstream table has immutability constraints, downstream
    tables can still benefit from incremental processing.

## Use backfill with immutability

Backfill extends immutability constraints. Backfill is a zero-copy operation that lets you instantly
copy existing data into a dynamic table
without recomputing it. Use it to migrate existing pipelines, change dynamic table definitions,
or avoid expensive initialization when creating tables
with years of historical data.

Backfilled data can’t change during future refreshes.

When you create a dynamic table with both `IMMUTABLE WHERE` and `BACKFILL FROM`:

* Backfill copies the **immutable region** from the source table. The immutable region consists of rows that match the
  `IMMUTABLE WHERE` condition.
* The query definition computes the **mutable region**. The mutable region consists of rows that don’t match the condition.

## Interaction with primary key and unique constraints (RELY)

Dynamic tables can have [primary key and unique constraints](../sql-reference/constraints-overview.md) with the
[RELY property](join-elimination.md). When both of the following are true on a dynamic table:

* An `IMMUTABLE WHERE` predicate is set, and
* At least one primary key or unique constraint has the RELY property set,

then the columns referenced in the `IMMUTABLE WHERE` predicate must be a subset of the columns referenced in the
set of all RELY primary key and RELY unique constraints on that table.
Only constraints with the RELY property are included in set of allowed columns, if any are present.
Consider the following examples:

* If the table has a RELY primary key on column `A` and a NORELY unique constraint on column `B`, the
  `IMMUTABLE WHERE` predicate may only reference column `A` (or a subset of the RELY constraint columns).
* If the table has a RELY primary key on column `A`, a RELY unique constraint on column `B`, and a NORELY unique constraint on column `C`, the
  `IMMUTABLE WHERE` predicate may only reference `A` and `B` or any subset of these columns.

Validity is checked when a RELY constraint or the `IMMUTABLE WHERE` predicate is added or changed.
If the resulting state would violate the rule (e.g., the predicate references a column not in any RELY constraint),
the statement fails with an error.

## Next steps

For implementation guidance and examples, see [Backfill examples](dynamic-tables-performance-optimize-immutability.md) and [Immutability constraints](dynamic-tables-limitations.md).

---
title: Understanding overall cost
source: https://docs.snowflake.com/en/user-guide/cost-understanding-overall.md
section: User Guide
---

# Understanding overall cost

> **Note:**
>
> This topic describes foundational costs associated with using Snowflake (compute costs, storage costs, and data transfer costs).
> Specific Snowflake features (for example, Snowflake Cortex and Snowpark Container Services) incur costs in unique ways, and are not
> discussed in this topic.

## How are costs incurred?

The total cost of using Snowflake is the aggregate of the cost of using data transfer, storage, and compute resources. Snowflake’s
innovative [cloud architecture](intro-key-concepts.md) separates the cost of accomplishing any task into one of these
usage types.

Compute Resources
:   Using compute resources within Snowflake consumes Snowflake credits. The billed cost of using compute resources is
    calculated by multiplying the number of consumed credits by the price of a credit. For the current price of a credit, see the
    [Snowflake Pricing Guide](https://www.snowflake.com/pricing/pricing-guide/).

    There are three types of compute resources that consume credits within Snowflake:

    * **Virtual Warehouse Compute**: [Virtual warehouses](warehouses.md) are user-managed compute resources that consume
      credits when loading data, executing queries, and performing other DML operations. Because Snowflake utilizes per-second billing (with a
      60-second minimum each time the warehouse starts), warehouses are billed only for the credits they actually consume when they are
      actively working.
    * **Serverless Compute**: There are Snowflake features such as Search Optimization and Snowpipe that use Snowflake-managed compute
      resources rather than virtual warehouses. To minimize cost, these serverless compute resources are automatically resized and scaled
      up or down by Snowflake as required for each workload.
    * **Cloud Services Compute**: The cloud services layer of the Snowflake architecture consumes credits as it performs behind-the-scenes
      tasks such as authentication, metadata management, and access control. Usage of the cloud services layer is charged only if the daily
      consumption of cloud services resources exceeds 10% of the daily warehouse usage.

    For more details about compute costs, see [Understanding compute cost](cost-understanding-compute.md).

Storage Resources
:   The monthly cost for storing data in Snowflake is based on a flat rate per terabyte (TB). For the current rate, which
    varies depending on your type of account (Capacity or On Demand) and region (US or EU), see the
    [Snowflake Pricing Guide](https://www.snowflake.com/pricing/pricing-guide/).

    Storage is calculated monthly based on the average number of on-disk bytes stored each day in your Snowflake account.

    For more details about storage costs, see [Understanding storage cost](cost-understanding-data-storage.md).

Data Transfer Resources
:   Snowflake does not charge data ingress fees to bring data into your account, but does charge for data egress.

    Snowflake charges a per-terabyte fee when you transfer data from a Snowflake account into a different region on the same cloud platform or into a completely different cloud platform. This fee for data egress depends on the region where your Snowflake account is hosted. For details,
    see the [Snowflake Pricing Guide](https://www.snowflake.com/pricing/pricing-guide/).

    For more details about data transfer costs, see [Understanding data transfer cost](cost-understanding-data-transfer.md).

## Total cost example

The following example provides insight into the total cost in Snowflake to load and query data.

Suppose an organization loads data constantly, 24x7. It has two different groups of users (Finance and Sales) using the database in
overlapping, but different times of the day. It also runs a weekly batch report. This organization:

* Uses the Standard Edition of Snowflake.
* Stores an average of 65 TBs of compressed data (compare with 325 TB without compression).
* Loads data 24x7x365. They use a Small Standard virtual warehouse for this purpose.
* Enables seven finance users to work 5 days a week from 8am until 5pm using a Large Standard virtual warehouse.
* Enables twelve sales users in different geographies to work a total of 16 hours a day (across Europe and the Americas), 5 days a
  week using a Medium Standard virtual warehouse.
* Runs a complex weekly report every Friday. This report takes approximately 2 hours to run on a 2X-Large standard warehouse.

**Data Loading Requirements**

| Parameter | Customer Requirement | Configuration | Cost |
| --- | --- | --- | --- |
| Loading Window | 24 x 7 x 365 | Small Standard Virtual Warehouse (2 credits/hr) | 1,488 credits (2 credits/hr x 24 hours per day x 31 days per month) |

**Storage Requirements**

|  |  |
| --- | --- |
| Data set size (per month) | 65 TB (after compression) |

**Compute Requirements**

| Parameter | Customer Requirement | Configuration | Cost |
| --- | --- | --- | --- |
| Finance Users | 5 Users, 8am-5pm (9 hours) | Large Standard Virtual Warehouse (8 credits/hr) | 1,440 credits (8 credits/hr x 9 hours per day x 20 days per month) |
| Sales Users | 12 Users, 16 hour time slot | Medium Standard Virtual Warehouse (4 credits/hr) | 1,280 (4 credits/hr x 16 hours per day x 20 days per month) |
| Complex Query Users | 1 User, 2 hours/day | 2X Standard Virtual Warehouse (32 credits/hr) | 256 (32 credits/hr x 2 hours per day x 4 days per month) |

**Total Cost**

| Usage Type | Monthly Cost | Total Billed Cost |
| --- | --- | --- |
| Compute Cost | 4,464 credits (@ $2/credit) | $8928 |
| Storage Cost | 65 TB (@ $23/TB) | $1495 |
|  |  | $10,423 |

**Next Topics**

* [Understanding compute cost](cost-understanding-compute.md)
* [Understanding storage cost](cost-understanding-data-storage.md)
* [Understanding data transfer cost](cost-understanding-data-transfer.md)

---
title: Understanding primary keys in dynamic tables
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-primary-keys.md
section: User Guide
---

# Understanding primary keys in dynamic tables

Snowflake can use primary keys in dynamic tables and dynamic iceberg tables to track row-level
changes more efficiently and to enable incremental refresh downstream of full refresh dynamic
tables. Instead of relying on change-tracking columns, Snowflake uses primary keys as stable
row identifiers to compute the minimal set of changes between refreshes.

This is especially useful in the following scenarios:

* Base tables are periodically rewritten through INSERT OVERWRITE rather than updated in place,
  which normally prevents Snowflake from detecting what changed between versions.
* The pipeline reads from an externally-managed Apache Iceberg™ v2 table, which preceed row lineage.
* Some dynamic tables must use full refresh mode because of unsupported incremental constructs,
  but downstream tables would benefit from incremental processing.

## Types of primary keys in dynamic tables

Snowflake supports two types of primary key use cases for dynamic tables: 1) row-level change tracking and 2)
derivation of a unique key for the dynamic table itself.

### Primary key row-level lineage-based change tracking

When a base table has a primary key constraint with the `RELY` property set, Snowflake
uses that key for row-level change tracking in downstream dynamic tables. This is particularly
useful when the base table is periodically rewritten through INSERT OVERWRITE, which normally
prevents change tracking across table versions.

With a reliable primary key, Snowflake identifies which rows changed between refreshes by
comparing primary key values instead of relying on internal change-tracking columns. This
enables incremental processing even when the underlying data is fully replaced.

To set the RELY property on a base table primary key:

```sql
ALTER TABLE my_base_table ALTER CONSTRAINT my_pk_constraint RELY;
```

### Unique key derivation

Snowflake can automatically derive a reliable unique key from the query definition of a dynamic table.
For example, the following SQL constructs produce derived unique keys:

* **GROUP BY**: The grouping columns form a unique key because each group produces exactly
  one output row.
* **QUALIFY ROW_NUMBER() = 1**: The partition-by columns form a unique key because the filter
  keeps exactly one row per partition.
* **Reliable base table primary keys**: The primary key of the base table is used as the unique key of the dynamic table.

Snowflake registers derived primary keys as unique constraints on the dynamic table. Because
these constraints come from the query structure, they’re fully reliable without additional
validation.

To check whether a dynamic table has a derived primary key, run:

```sql
SHOW UNIQUE KEYS IN my_dynamic_table;
```

## When to use primary keys

Primary keys are useful in the following scenarios:

**Improve change tracking for INSERT OVERWRITE workloads**
:   When a base table is periodically rewritten through INSERT OVERWRITE, Snowflake can’t use
    standard change-tracking columns to detect what changed. A primary key lets Snowflake compare
    rows by key value and process only the actual changes, avoiding a full recomputation of the
    dynamic table.

**Enable incremental refresh downstream of full refresh dynamic tables**
:   Normally, a dynamic table in incremental refresh mode can’t be downstream of a dynamic table
    in full refresh mode. When the upstream full refresh dynamic table has a system-derived unique key,
    Snowflake can compute the changes between full refreshes, allowing downstream tables to
    refresh incrementally. This removes a major blocker for incremental pipelines.

**Reduce change propagation in pipelines**
:   Primary keys enable value-based change reduction at each stage of a pipeline. Snowflake can
    filter out rows where the primary key exists in both the old and new versions with identical
    values, reducing the volume of changes that propagate to downstream tables.

## Key behaviors

* **Opt-in for downstream incremental refresh**: To use incremental refresh on a dynamic table
  that reads from a full refresh dynamic table with a derived unique key, you must explicitly set
  `REFRESH_MODE = INCREMENTAL` on the downstream table. Setting `REFRESH_MODE = AUTO`
  continues to resolve to FULL.
* **Verify primary key-based change tracking support**: Use `SHOW UNIQUE KEYS IN <dt_name>` to
  check whether a dynamic table has a derived unique key. Alternatively, create a downstream dynamic
  table with `REFRESH_MODE = INCREMENTAL` and check whether the creation succeeds.
* **Masking policies**: Masking policies that obfuscate primary key columns prevent Snowflake
  from using those keys for change tracking. In this case, Snowflake falls back to standard
  change-tracking columns.

## Next steps

For examples and implementation guidance, see [Use primary keys to optimize dynamic table pipelines](dynamic-tables-performance-optimize-primary-keys.md).

For the full syntax of CREATE DYNAMIC TABLE, see [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md).

---
title: Understanding replication cost
source: https://docs.snowflake.com/en/user-guide/account-replication-cost.md
section: User Guide
---

# Understanding replication cost

Charges based on replication are divided into two categories: data transfer and compute resources. Both categories are billed on the
target account (i.e. the account that stores the secondary database or secondary replication/failover group that is refreshed).

Data transfer:
:   The initial replication and subsequent synchronization operations transfer data between regions. Cloud providers charge for
    data transferred from one region to another within their own network.

    The data transfer rate is determined by the location of the source account (i.e. the account that stores the primary replication
    or failover group). For data transfer pricing, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

    For more information, see [Understanding data transfer cost](cost-understanding-data-transfer.md).

Compute resources:
:   Replication operations use Snowflake-provided compute resources for the following:

    * To determine the delta of both metadata and data to be copied during the refresh operation.
    * To copy the data between accounts across regions.

    The service type for compute costs for replication in the [account usage](../sql-reference/account-usage.md) and
    [organization usage](../sql-reference/organization-usage.md) views is REPLICATION.

    For more information, see [Understanding compute cost](cost-understanding-compute.md).

> **Note:**
>
> * The target account also incurs standard storage costs for the data in each secondary database in the account.
> * The target account also incurs costs for the automatic background processes that service
>   [materialized views](account-replication-considerations.md)
>   and [search optimization](search-optimization/working-with-tables.md). For details, see the “Serverless
>   Feature Credit Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) for the costs per compute hour.
> * Replication charges are applied even if the initial replication or a refresh operation doesn’t succeed. Any data that is copied
>   before the initial replication or refresh operation fails can be reused by a subsequent refresh operation (if performed within 14 days)
>   and doesn’t need to be copied again.

## Estimating and controlling costs

In general, monthly billing for replication is proportional to:

* Amount of table data in the primary database, or databases in a replication/failover group, that changes as a result of data loading
  or DML operations.
* Frequency of secondary database, or replication/failover group, refreshes from the primary database or replication/failover group.

You can control the cost of replication by carefully choosing which databases or objects to replicate and their refresh frequency. You
can stop incurring replication costs by ceasing refresh operations.

## Viewing actual costs

Users with the ACCOUNTADMIN role can use SQL to view the amount of data transferred (in bytes) and the credit usage for
replication using replication or failover groups for your Snowflake account within a specified date range.

Users with the ACCOUNTADMIN role can use [Snowsight](ui-snowsight-gs.md) or SQL to view the amount of replication data transferred
(in bytes) for your Snowflake account within a specified date range.

> Snowsight:
> :   In the navigation menu, select Admin » Cost management.

To view the data transfer amounts and credit usage for replication for your account:

> SQL:
> :   Query either of the following:
>
>     * [REPLICATION_GROUP_USAGE_HISTORY](../sql-reference/functions/replication_group_usage_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)). This
>       function returns replication usage activity within the last 14 days.
>     * [REPLICATION_GROUP_USAGE_HISTORY view](../sql-reference/account-usage/replication_group_usage_history.md) (in [Account Usage](../sql-reference/account-usage.md)). This view returns
>       replication usage activity within the last 365 days (1 year).
>
>     For examples, see [Monitor replication costs](account-replication-monitor.md).

To view the cost of replication for individual databases replicated with Database Replication, see
[Monitoring database replication cost](db-replication-config.md).

---
title: Understanding row access policies
source: https://docs.snowflake.com/en/user-guide/security-row-intro.md
section: User Guide
---

# Understanding row access policies

This topic provides an introduction to row access policies and row-level security.

## What is Row-level Security?

Snowflake supports row-level security through the use of row access policies to determine which rows to return in the query result. The row
access policy can be relatively simple to allow one particular role to view rows, or be more complex to include a
[mapping table](https://en.wikipedia.org/wiki/Associative_entity) in the policy definition to determine access to rows in the query
result. If the policy contains a mapping table lookup, create a centralized mapping table and store the mapping table in the
same database as the protected table. This is particularly important if the policy calls the
[IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function. For details, see the function usage notes.

A row access policy is a schema-level object that determines whether a given row in a table or view can be viewed from the following types
of statements:

* [SELECT](../sql-reference/sql/select.md) statements
* Rows selected by [UPDATE](../sql-reference/sql/update.md), [DELETE](../sql-reference/sql/delete.md), and [MERGE](../sql-reference/sql/merge.md) statements.

Row access policies can include conditions and functions in the policy expression to transform the data at query runtime when those
conditions are met. The policy-driven approach supports segregation of duties to allow governance teams to define policies that can limit
sensitive data exposure. This approach also includes the object owner (i.e. the role with the OWNERSHIP privilege on the object, such as a
table or view) who normally has full access to the underlying data. A single policy can be set on different tables and views at the same
time.

Row access policies do not currently prevent rows from being inserted, or prevent visible rows from being updated or deleted.

A row access policy can be added to a table or view either when the object is created or after the object is created. For more information, see, Apply a Row Access Policy to a Table or View (in this topic).

> **Note:**
>
> In some cases, error messages related to row access policies might be redacted. For more information, see
> [Secure objects: Redaction of information in error messages](../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

### How does a row access policy work?

A row access policy contains an expression that can specify Snowflake database objects (e.g. table or view), and use
[Conditional expression functions](../sql-reference/expressions-conditional.md) and [Context functions](../sql-reference/functions-context.md) to determine which rows should be visible in a
given context.

Snowflake evaluates the policy expression by using the role of the [policy owner](../developer-guide/stored-procedure/stored-procedures-rights.md), not the
role of the operator who executed the query. This approach allows Snowflake not to return a row in a query result because the query
operator does not require access to the mapping tables in the row access policy.

> **Tip:**
>
> If you want to update an existing row access policy and need to see the current definition of the policy, call the [GET_DDL](../sql-reference/functions/get_ddl.md) function or run the [DESCRIBE ROW ACCESS POLICY](../sql-reference/sql/desc-row-access-policy.md) command.
>
> The row access policy expression can then be updated with the [ALTER ROW ACCESS POLICY](../sql-reference/sql/alter-row-access-policy.md) command. This command
> does not require dropping a row access policy from a table or view. So, a table or view that is protected by a row access policy remains protected while the policy expression is being updated.

### Row access policies at query runtime

At query runtime, Snowflake goes through the following process:

1. Snowflake determines whether a row access policy is set on a database object. If a policy is added to the database object, all rows are
   protected by the policy.
2. Snowflake creates a dynamic secure view (i.e. a secure inline view) of the database object.
3. The values of the columns specified in the ALTER TABLE or ALTER VIEW command (i.e when adding a row access policy to a table or view)
   are bound to the corresponding parameters in the policy, and the policy expression is evaluated.
4. Snowflake generates the query output for the user, and the query output only contains rows based on the policy definition evaluating
   to `TRUE`.

For more details on the specific execution plan, see Query profile (in this topic).

Snowflake supports nested row access policies, such as a row access policy on a table and a row access policy on a view for the same
table. At query runtime, Snowflake evaluates all row access policies that are relevant to a given query in the following sequence:

* The row access policy that is applicable to the table is always executed first.
* The policy for the view is executed after evaluating the policy for the table.
* If nested views exist (e.g. Table 1 -> View 1 -> View 2 -> … View n), the policies are applied in sequential order from left to right.

This pattern continues for however many row access policies exist with respect to the data in the query. The following diagram illustrates
the relationship between a query operator, tables, views, and policies.

For more information on row access policy privileges, commands, and a step-by-step implementation, see:

* Row access policy privileges
* Row access policy DDL
* [Use row access policies](security-row-using.md)

### Representative use case: Simple row filtering

A simple application of a row access policy is to specify an attribute in the policy and a role that is allowed to see that attribute in
the query result. The advantage of simple policies like this is that there is a negligible performance cost for Snowflake to evaluate
these policies to return query results compared to using row access policies with mapping tables.

As a representative example, it may be necessary for information technology administrators (e.g. `it_admin` custom role) to query an
employee identification number (i.e. `empl_id`) before granting the employee additional privileges to use internal systems. Therefore,
the row access policy should return rows in the query result if the [CURRENT_ROLE](../sql-reference/functions/current_role.md) matches the `it_admin`
custom role and not return rows for all other roles. For example:

```sqlexample
CREATE OR REPLACE ROW ACCESS POLICY rap_it
AS (empl_id varchar) RETURNS BOOLEAN ->
  'it_admin' = current_role()
;
```

This policy is the most concise version of a row access policy because there are no other conditions to evaluate, only the value of the
CURRENT_ROLE.

If role hierarchy needs to be considered, this policy could similarly use [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) to be more
inclusive of other roles to see the employee ID number in the query result.

Alternatively, to consider additional conditions, using the [CASE](../sql-reference/functions/case.md) function allows including WHEN/THEN/ELSE
clauses to support more detailed conditional logic.

### Representative use case: Use a mapping table to filter the query result

A row access policy condition can reference a mapping table to filter the query result set, however using mapping tables may result in
decreased performance compared to the more simple example.

For example, use a mapping table to determine the revenue values a sales manager can see in a specified sales region. The mapping table
should specify the sales manager and the sales region (e.g. WW: Worldwide, NA: North America, EU: European Union).

> | Sales Manager | Region |
> | --- | --- |
> | Alice | WW |
> | Bob | NA |
> | Simon | EU |

Next, define a policy with one or more conditions to query the mapping table with a subquery. At query runtime, Snowflake determines
whether the user executing the query matches the sales region specified in the mapping table.

If a match occurs, the user can see those rows in the query result. Based on the mapping table, the expected query results are as follows:

> | Company | Region | Revenue | Who can view |
> | --- | --- | --- | --- |
> | Acme | EU | 2.5B | Alice, Simon |
> | Acme | NA | 1.5B | Alice, Bob |

For details on implementing a row access policy with a mapping table, see:

* External Tables (in this topic)
* [Use row access policies](security-row-using.md)

### Policy performance guidelines

Row Access Policies are designed to perform well in a wide variety of real-world scenarios. Use the following tips to secure data and enhance performance:

Limit the policy arguments:
:   Snowflake needs to scan columns that the policy is bound to, even if they are not referenced in queries. Therefore, policies with fewer
    arguments will generally perform better than policies with many arguments.

Simplify the SQL expression:
:   Policies with simple SQL expressions, such as CASE statements, generally perform better than policies that access mapping (i.e. lookup)
    tables. Minimizing the number of table lookups improves performance.

    When specifying a mapping table, replace the mapping table reference with a memoizable function. For details, refer to:

    * [Memoizable function](../developer-guide/udf/sql/udf-sql-scalar-functions.md) (in the scalar SQL UDF overview).
    * [Using a memoizable function in a policy](security-row-using.md)
      (in the Using Row Access Policies topic).

Test with realistic workloads:
:   Without a row access policy, the query `SELECT COUNT(*) FROM t1` executes in milliseconds since Snowflake already knows the number of
    rows in the table. However, adding a row access policy means Snowflake must scan the table to count the number of rows that are
    accessible in the current context. Although the performance difference is large, this query is not representative of most real-world
    workloads.

    For more information on this example, see the Considerations section (in this topic).

Cluster by attributes:
:   For very large tables, clustering by attributes used for policy filtering can improve performance.

    For more information, see [Clustering Keys & Clustered Tables](tables-clustering-keys.md).

Search optimization service:
:   The search optimization service can improve the query performance on a table that uses a masking or row access policy.

    For details, see
    [Support for Tables With Masking Policies and Row Access Policies in the Search Optimization Service](search-optimization/working-with-tables.md).

### Benefits

The primary benefit of a row access policy is that the policy enables an organization to properly balance data security, governance, and
analytics through an extensible policy. The extensible nature of the row access policy allows one or more conditions to be added or
removed at any time to ensure the policy is consistent with updates to data, mapping tables, and the RBAC hierarchy.

Additional benefits include:

Ease of Use:
:   Write a policy once and apply it to tables across databases and schemas.

Change Management:
:   Easily change row access policy definitions without having to reapply the policy to tables.

    If using a mapping table, update the entitlement information in the mapping table referenced by the policy without having to change the
    policy.

Data Administration and SoD:
:   A central data administrator decides which objects to protect, not the object owner. Row access policies are easy to manage and support
    through centralized, decentralized, and hybrid administration models to support segregation of duties (i.e. SoD).

Data Authorization and Governance:
:   The row access policy supports contextual data access by role or custom entitlements.

### Limitations

* Using the [CHANGES](../sql-reference/constructs/changes.md) clause on a view protected by a row access policy is not supported.
* Snowflake does not support using external tables as a mapping table in a row access policy. For more information, see
  External Tables (in this topic).
* Snowflake does not support attaching a row access policy to the stream object itself, but does apply the row access policy to the table
  when the stream accesses a table protected by a row access policy. For more information, see Streams (in this topic).
* [Future grants](../sql-reference/sql/grant-privilege.md) of privileges on row access policies are not supported.

  As a workaround, grant the APPLY ROW ACCESS POLICY privilege to a custom role to allow that role to apply row access policies on a table
  or view.

### Considerations

* Attaching row access policies to tables that are protected by other row access policies or masking policies may cause errors. For more
  information, see [ALTER TABLE](../sql-reference/sql/alter-table.md), [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md), and
  [ALTER VIEW](../sql-reference/sql/alter-view.md).
* Including one or more [subqueries](querying-subqueries.md) in the policy body may cause errors. When possible, limit the
  number of subqueries, limit the number of JOIN operations, and simplify WHERE clause conditions.
* Snowflake maintains statistics about table and view columns that make it possible to answer many simple queries in milliseconds.
  Examples of such queries include using the [COUNT](../sql-reference/functions/count.md) function, `select count(*) from my_table`, and the
  [MAX](../sql-reference/functions/max.md) function, `select max(c) from my_table`.

  Generally, these statistics and optimizations are not applicable with a row access policy since Snowflake must identify
  the subset of rows the query is permitted to access. Executing queries of this type on tables and views with a row access
  policy may take longer than expected to obtain the query results since these statistics and optimizations are not used, and the
  returned statistics are only based on what is permissible to access, not the “true” statistical values (i.e. statistics on the
  table or view without a row access policy).
* Use caution when creating the setup script for a Snowflake Native App when the row access policy exists in a versioned schema. For details, see
  [version schema considerations](../developer-guide/native-apps/creating-setup-script.md).
* If you specify the [CURRENT_DATABASE](../sql-reference/functions/current_database.md) or [CURRENT_SCHEMA](../sql-reference/functions/current_schema.md) function in the
  body of a masking or row access policy, the function returns the database or schema that contains the protected table, not the database or
  schema in use for the session.

## Use row access policies with Snowflake objects and features

The following sections describe how row access policies affect tables and views along with other Snowflake features.

### Obtain database objects with a row access policy

The Information Schema [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) table function can return information about the row access policy
assigned to a given object.

* All objects for a given policy:

  Specify the name of the row access policy (e.g. `mydb.policies.rap1`):

  > ```sqlexample
  > SELECT *
  > FROM TABLE(
  >   mydb.INFORMATION_SCHEMA.POLICY_REFERENCES(
  >     POLICY_NAME=>'mydb.policies.rap1'
  >   )
  > );
  > ```
* The policy assigned to a specific object:

  Specify the name of the object (e.g. `mydb.tables.t1`) and the object domain (e.g. `table`):

  > ```sqlexample
  > SELECT *
  > FROM TABLE(
  >   mydb.INFORMATION_SCHEMA.POLICY_REFERENCES(
  >     REF_ENTITY_NAME => 'mydb.tables.t1',
  >     REF_ENTITY_DOMAIN => 'table'
  >   )
  > );
  > ```

Note that this table function is complementary to the Account Usage
[POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md) view.

### Active role hierarchy & mapping tables

The policy conditions can evaluate the user’s active primary and secondary roles in a session directly, look up active roles in a mapping
table, or do both depending on how the policy administrator wants to write the policy. If the policy contains a mapping table lookup,
create a centralized mapping table and store the mapping table in the same database as the protected table. This is particularly important
if the policy calls the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function. For details, see the function
[usage notes](../sql-reference/functions/is_database_role_in_session.md).

For these use cases, Snowflake recommends writing the policy conditions to call the [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) or
the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function depending on whether you want to specify an account role or
database role. For examples, see:

* [Examples](../sql-reference/functions/is_role_in_session.md) section in the IS_ROLE_IN_SESSION function.
* IS_DATABASE_ROLE_IN_SESSION
* [Share data protected by a policy](data-sharing-policy-protected-data.md)

### Apply a row access policy to a table or view

There are two options to add a row access policy to a table or view:

1. With a new table or view, apply the policy to a table with a [CREATE TABLE](../sql-reference/sql/create-table.md) statement or a view with a
   [CREATE VIEW](../sql-reference/sql/create-view.md) statement.
2. With an existing table or view, apply the policy to a table with an [ALTER TABLE](../sql-reference/sql/alter-table.md) statement or a view
   with an [ALTER VIEW](../sql-reference/sql/alter-view.md) statement.

For a new table or view, execute the following statements:

> ```sqlexample
> -- table
> CREATE TABLE sales (
>   customer   varchar,
>   product    varchar,
>   spend      decimal(20, 2),
>   sale_date  date,
>   region     varchar
> )
> WITH ROW ACCESS POLICY sales_policy ON (region);
>
> -- view
> CREATE VIEW sales_v WITH ROW ACCESS POLICY sales_policy ON (region)
> AS SELECT * FROM sales;
> ```

For an existing table or view, execute the following statements:

> ```sqlexample
> -- table
>
> ALTER TABLE t1 ADD ROW ACCESS POLICY rap_t1 ON (empl_id);
>
> -- view
>
> ALTER VIEW v1 ADD ROW ACCESS POLICY rap_v1 ON (empl_id);
> ```

### Masking policies

When a database object has both a row access policy and one or more [masking policies](security-column-intro.md),
Snowflake evaluates the row access policy first.

A given table or view column can be specified in either a row access policy signature or a masking policy signature. In other words, the
same column cannot be specified in both a row access policy signature and a masking policy signature at the same time.

For more information, see [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md) and [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md).

### Simulate how a policy will work

Call the [POLICY_CONTEXT](../sql-reference/functions/policy_context.md) function to simulate a query on a column that is protected by a masking policy,
a table or view protected by a row access policy, or both types of policies.

### External tables

You can create an external table with a row access policy by executing a [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md) statement and
apply the policy to the VALUE column.

You can apply the row access policy to VALUE column of an existing external table by executing an [ALTER TABLE](../sql-reference/sql/alter-table.md)
statement on the external table.

A row access policy cannot be added to a virtual column directly. Instead, create a view on the external table and apply the row access
policy to the columns on the view.

> **Important:**
>
> Snowflake does not support using an external table as a mapping table in a row access policy. While cloning a database, Snowflake clones
> the row access policy, but not the external table. Therefore, the policy in the cloned database refers to a table that is not present in
> the cloned database.
>
> If the data in the external table is necessary for the row access policy, consider moving the external table data to a dedicated schema
> within the database in which the row access policy exists prior to completing a clone operation. Update the row access policy to
> reference the fully qualified table name to ensure the policy refers to a table in the cloned database.

### Streams

If a row access policy is added to a table, Snowflake applies the row access policy to the table data when the stream accesses the table
data.

For masking policies, streams use the latest table version available at the query time for any tables referenced in the policy.

For more information, see Limitations.

### Views

Snowflake supports setting row access policies on the base table and view. The base table or view policy can apply to the view owner (i.e.
[INVOKER_ROLE](../sql-reference/functions/invoker_role.md)) or the query operator role (i.e. [CURRENT_ROLE](../sql-reference/functions/current_role.md)).

For more information, see Limitations.

### Materialized views

Snowflake supports adding a row access policy to a materialized view provided that a row access policy is not set on the underlying
table or view.

Row access policies and materialized views do have the following limitations:

* A materialized view cannot be created from a table if a row access policy is added to the underlying table.
* A row access policy cannot be added to a table if a materialized view has been created from that underlying table.

> **Tip:**
>
> If you prefer to set a row access policy on the base table, consider creating a dynamic table from the base table. For more
> information, see [Masking and row access policies](dynamic-tables-limitations.md).

### Dynamic tables

You can create a dynamic table with a row access policy, masking policy, and tag. For more information, see:

* [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md)
* [Masking and row access policies](dynamic-tables-limitations.md)

### CREATE TABLE statements

The following summarizes how row access policies affect [CREATE TABLE](../sql-reference/sql/create-table.md) statements:

CREATE TABLE … CLONE:
:   The following approach helps to safeguard data from users with the SELECT privilege on the table or view when accessing a cloned object:

    * Cloning an individual policy object is not supported.
    * Cloning a schema results in the cloning of all policies within the schema.
    * A cloned table maps to the same policies as the source table. In other words, if a policy is set on the base table or its columns, the
      policy is attached to the cloned table or its columns.

      + If a table or view exists in the source schema/database and has references to policies in the same schema/database, the cloned table or
        view is mapped to the corresponding cloned policy (in the target schema/database) instead of the policy in the source schema/database.
      + If the source table refers to a policy in a different schema (i.e. a foreign reference), then the cloned table retains the
        foreign reference.

    For more information, see [CREATE <object> … CLONE](../sql-reference/sql/create-clone.md).

CREATE TABLE … LIKE:
:   If a row access policy is set on the base table, the row access policy is not set on a column in the new table. The new table
    is empty.

CREATE TABLE … AS SELECT:
:   If a row access policy is set on the base table, the new table contains the filtered rows based on the row access policy definition. The
    new table does not have a row access policy set on a column.

### Query profile

At query runtime, Snowflake creates a dynamic secure view.

When using the [EXPLAIN](../sql-reference/sql/explain.md) command on a database object in which a row access policy is set, the query result
indicates that a row access policy is present. When a row access policy is set on the database object, the EXPLAIN query result specifies
the following column values:

* The `operation` column includes the value `DynamicSecureView`.
* The `object` column includes the value `"<object_name> (+ RowAccessPolicy)"`.

Each step in the query plan that requires invoking the row access policy results in the `operation` and `object` columns specifying the
corresponding values for that step in the query plan. If the row access policy was invoked only once in the query, only one row in the
EXPLAIN query result includes the `DynamicSecureView` and `"<object_name> (+ RowAccessPolicy)"` values.

In the EXPLAIN command result and the [Query History](ui-snowsight-activity.md) page, Snowflake does not show users any
row access policy [information](../sql-reference/sql/create-row-access-policy.md) (i.e. policy name, policy signature, policy expression) or
the objects accessed by the policy.

The following example indicates a row access policy being invoked only once.

> ```sqlexample
> EXPLAIN SELECT * FROM my_table;
> ```
>
> ```output
> +-------+--------+--------+-------------------+--------------------------------+--------+-------------+-----------------+--------------------+---------------+
> |  step |   id   | parent |     operation     |           objects              | alias  | expressions | partitionsTotal | partitionsAssigned | bytesAssigned |
> +-------+--------+--------+-------------------+--------------------------------+--------+-------------+-----------------+--------------------+---------------+
> ...
>
> | 1     | 2      | 1      | DynamicSecureView | "MY_TABLE (+ RowAccessPolicy)" | [NULL] | [NULL]      | [NULL]          | [NULL]             | [NULL]        |
> +-------+--------+--------+-------------------+--------------------------------+--------+-------------+-----------------+--------------------+---------------+
> ```

The following example indicates a row access policy being invoked twice on the same table:

> ```sqlexample
> EXPLAIN SELECT product FROM sales
>   WHERE revenue > (SELECT AVG(revenue) FROM sales)
>   ORDER BY product;
> ```
>
> ```output
> +--------+--------+--------+-------------------+-----------------------------+--------+-------------+-----------------+--------------------+---------------+
> |  step  |   id   | parent |     operation     |           objects           | alias  | expressions | partitionsTotal | partitionsAssigned | bytesAssigned |
> +--------+--------+--------+-------------------+-----------------------------+--------+-------------+-----------------+--------------------+---------------+
> ...
> | 1      | 0      | [NULL] | DynamicSecureView | "SALES (+ RowAccessPolicy)" | [NULL] | [NULL]      | [NULL]          | [NULL]             | [NULL]        |
> ...
> | 2      | 2      | 1      | DynamicSecureView | "SALES (+ RowAccessPolicy)" | [NULL] | [NULL]      | [NULL]          | [NULL]             | [NULL]        |
> +--------+--------+--------+-------------------+-----------------------------+--------+-------------+-----------------+--------------------+---------------+
> ```

### Time Travel

Snowflake supports time travel on tables and views with a row access policy.

At query run time, Snowflake evaluates the row access policy’s mapping tables at the time of the query; in other words, time travel does
not affect the mapping table.

For more information, see [Understanding & using Time Travel](data-time-travel.md).

### Replication

Row access policies and their assignments can be replicated using database replication and replication groups.

For [database replication](database-replication-considerations.md), the replication operation fails if either of the
following conditions is true:

* The primary database is in an Enterprise (or higher) account and contains a policy but one or more of the accounts approved for
  replication are on lower editions.
* A table or view contained in the primary database has a [dangling reference](database-replication-considerations.md) to a
  row access policy in another database.

The dangling reference behavior for database replication can be avoided when replicating multiple databases in a
[replication group](account-replication-intro.md).

> > **Note:**
> >
> > If using failover or failback actions, the Snowflake account must be Business Critical Edition or higher.
> >
> > For more information, see [Introduction to replication and failover across multiple accounts](account-replication-intro.md).

### Data Sharing

Usage:
:   * If the provider assigns a policy to a shared table or view and the policy conditions call the
      [CURRENT_ROLE](../sql-reference/functions/current_role.md) or [CURRENT_USER](../sql-reference/functions/current_user.md) function, or the policy conditions call a [secure UDF](../developer-guide/secure-udf-procedure.md), Snowflake returns a NULL value for the function or the UDF in the consumer account.

      The reason is that the owner of the data being shared does not typically control the users or roles in the account in which the table
      or view is being shared. As a workaround, use the [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) function in the policy conditions.

      Alternatively, as a provider, write the policy conditions to call the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md)
      function and share the database role. As a consumer, grant the shared database role to an account role. For details, see
      [Share data protected by a policy](data-sharing-policy-protected-data.md).

Limitations:
:   * A data sharing provider cannot create a policy in a [reader account](data-sharing-reader-create.md).
    * Data sharing consumers cannot apply a policy to a shared table or view. As a workaround, import the shared database and create a local
      view from the shared table or view.
    * Data sharing consumers cannot query a shared table or view that references two different providers. For example:

      + `rap1` is a row access policy that protects the table named `t1`, where `t1` is in the share named `share1` from a provider.
      + The `rap1` policy conditions reference a mapping table named `t2`, where `t2` comes from `share2` and a different provider.
      + The consumer query on `t1` fails.
      + The provider for `t1` can query `t1`.
    * External functions:

      Snowflake returns an error if:

      + The policy assigned to a shared table or view is updated to call an external function.
      + The policy calls an external function and you attempt to assign the policy to a shared table or view.

### Snowflake Native App Framework

For details about using row access policies with a Snowflake Native App, see:

* [Restrictions on sharing data content that contains policies](../developer-guide/native-apps/preparing-data-content.md).
* [Define policies on proxy views](../developer-guide/native-apps/preparing-data-content.md).
* [Blocked context functions](../developer-guide/native-apps/redacted-content.md).

### Streamlit in Snowflake

Row access policies that are used in Streamlit in Snowflake apps have limitations with context functions in the body of a row access policy. For more information, see:

* [Context functions and row access policies in Streamlit in Snowflake](../developer-guide/streamlit/features/row-access.md)
* [Example: Access data in a table with row access policy using CURRENT_USER](../developer-guide/streamlit/features/row-access.md)

## Enforce row access policies on Apache Iceberg tables queried from Apache Spark™

Snowflake supports enforcing row access policies that are set on Apache Iceberg tables that you query from Apache Spark™ through
Snowflake Horizon Catalog. For more information,
see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

## Manage row access policies

### Choosing a centralized, hybrid, or decentralized management approach

To manage row access policies effectively, it is helpful to consider whether your approach to filtering rows should follow a centralized,
decentralized, or hybrid governance approach.

The following table summarizes some of the considerations with each of these three approaches.

| Policy Action | Centralized | Hybrid | Decentralized |
| --- | --- | --- | --- |
| Create policies | Governance officer | Governance officer | Individual teams |
| Apply policies to columns | Governance officer | Individual teams | Individual teams |

For syntax examples, see Summary of DDL commands, operations, and privileges.

> **Tip:**
>
> As a best practice, Snowflake recommends that your organization gathers all relevant stakeholders to determine the best management
> approach for implementing row access policies in your environment.

### Row access policy privileges

Snowflake supports the following row access policy privileges to determine whether users can create, set, and own row access policies.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| APPLY | Enables executing the add and drop operations for the row access policy on a table or view.  Note that granting the global APPLY ROW ACCESS POLICY privilege (i.e. APPLY ROW ACCESS POLICY on ACCOUNT) enables executing the DESCRIBE operation on tables and views.  For syntax examples, see Summary of DDL commands, operations, and privileges. |
| OWNERSHIP | Grants full control over the row access policy. Required to alter most properties of a row access policy. Only a single role can hold this privilege on a specific object at a time. |

### Row access policy DDL

Snowflake supports the following DDL commands and operations to manage row access policies:

* [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md)
* [ALTER ROW ACCESS POLICY](../sql-reference/sql/alter-row-access-policy.md)
* [DROP ROW ACCESS POLICY](../sql-reference/sql/drop-row-access-policy.md)
* [SHOW ROW ACCESS POLICIES](../sql-reference/sql/show-row-access-policies.md)
* [DESCRIBE ROW ACCESS POLICY](../sql-reference/sql/desc-row-access-policy.md)
* [ALTER TABLE](../sql-reference/sql/alter-table.md), [ALTER EXTERNAL TABLE](../sql-reference/sql/alter-external-table.md), and [ALTER VIEW](../sql-reference/sql/alter-view.md) (to add/drop a policy on a table or view)

### Summary of DDL commands, operations, and privileges

The following table summarizes the relationship between the row access policy DDL operations and their necessary privileges.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Operation | Privilege required |
| --- | --- |
| Create row access policy | A role with the CREATE ROW ACCESS POLICY privilege in the same schema. |
| Alter row access policy | The role with the OWNERSHIP privilege on the row access policy. |
| `Add/Drop` row access policy | A role with the APPLY ROW ACCESS POLICY privilege on the account or a role with the OWNERSHIP privilege on the database object and the APPLY privilege on the row access policy object. |
| Drop row access policy | One of the following: A role with the OWNERSHIP privilege on the row access policy or . A role with the OWNERSHIP privilege on the schema in which the row access policy exists. |
| Show row access policies | One of the following: . A role with the APPLY ROW ACCESS POLICY privilege, or . The OWNERSHIP privilege on the row access policy, or . The APPLY privilege on the row access policy. |
| Describe row access policy | One of the following: A role with the APPLY ROW ACCESS POLICY privilege, or . The OWNERSHIP privilege on the row access policy, or . The APPLY privilege on the row access policy. |

Snowflake supports different permissions to create and set a row access policy on an object.

1. For a centralized row access policy management approach, in which the `rap_admin` custom role creates and sets row access policies on
   all objects, the following permissions are necessary:

   ```sqlexample
   use role securityadmin;
   grant create row access policy on schema <db_name.schema_name> to role rap_admin;
   grant apply row access policy on account to role rap_admin;
   ```
2. In a hybrid management approach, a single role has the CREATE ROW ACCESS POLICY privilege to ensure consistent policy creation to
   optimize query performance and individual teams or roles have the APPLY privilege for a specific row access policy to protect their tables and views.

   For example, the custom role `finance_role` role can be granted the permission to add the row access policy `rap_finance` on tables
   and views the role owns:

   ```sqlexample
   use role securityadmin;
   grant create row access policy on schema <db_name.schema_name> to role rap_admin;
   grant apply on row access policy rap_finance to role finance_role;
   ```

## Monitor row access policies with SQL

You can monitor row access policy usage through two different Account Usage views and an Information Schema table.

It can be helpful to think of two general approaches to determine how to monitor row access policy usage.

* Discover row access policies
* Identify assignments

### Discover row access policies

You can use the [ROW_ACCESS_POLICIES](../sql-reference/account-usage/row_access_policies.md) view in the Account Usage schema of the
shared SNOWFLAKE database. This view is a *catalog* for all row access policies in your Snowflake account. For example:

> ```sqlexample
> SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.ROW_ACCESS_POLICIES
> ORDER BY POLICY_NAME;
> ```

### Identify assignments

Snowflake supports different options to identify row access policy assignments, depending on whether the query needs to target the
account or a specific database.

* Account-level query:

  Use the Account Usage [POLICY_REFERENCES](../sql-reference/account-usage/tag_references.md) view to determine all of the tables
  that have a row access policy. For example:

  > ```sqlexample
  > SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.POLICY_REFERENCES
  > ORDER BY POLICY_NAME, REF_COLUMN_NAME;
  > ```
* Database-level query:

  Every Snowflake database includes a [Snowflake Information Schema](../sql-reference/info-schema.md). Use the Information Schema table function
  [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) to determine all of the objects associated with a specific row access policy:

  > ```sqlexample
  > SELECT *
  > FROM TABLE(
  >   my_db.INFORMATION_SCHEMA.POLICY_REFERENCES(
  >     POLICY_NAME => 'rap_t1'
  >   )
  > );
  > ```

## Monitor row access policies with Snowsight

You can use the Snowsight Governance & security » Tags & policies area to monitor and report on the usage of
policies and tags with tables, views, and columns. There are two different interfaces: Dashboard and Tagged Objects.

When using the Dashboard and the Tagged Objects interface, note the following details.

* The Dashboard and Tagged Objects interfaces require a running warehouse.
* Snowsight updates the Dashboard every 12 hours.
* The Tagged Objects information latency can be up to two hours and returns up to 1000 objects.

### Accessing the Governance area in Snowsight

To access the Tags & policies area, your Snowflake account must be [Enterprise Edition or higher](intro-editions.md).
Additionally, you must do either of the following:

* Use the ACCOUNTADMIN role.
* Use an account role that is directly granted the GOVERNANCE_VIEWER and OBJECT_VIEWER database roles.

  You must use an account role with these database role grants. Currently, Snowsight does not evaluate role hierarchies
  and user-defined database roles that have access to tables, views, data access policies, and tags.

  To determine if your account role is granted these two database roles, use a [SHOW GRANTS](../sql-reference/sql/show-grants.md) command:

  > ```sqlexample
  > SHOW GRANTS LIKE '%VIEWER%' TO ROLE data_engineer;
  > ```
  >
  > ```output
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > | created_on                    | privilege | granted_on    | name                        | granted_to | grantee_name    | grant_option | granted_by |
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > | 2024-01-24 17:12:26.984 +0000 | USAGE     | DATABASE_ROLE | SNOWFLAKE.GOVERNANCE_VIEWER | ROLE       | DATA_ENGINEER   | false        |            |
  > | 2024-01-24 17:12:47.967 +0000 | USAGE     | DATABASE_ROLE | SNOWFLAKE.OBJECT_VIEWER     | ROLE       | DATA_ENGINEER   | false        |            |
  > |-------------------------------+-----------+---------------+-----------------------------+------------+-----------------+--------------+------------|
  > ```

  If your account role is not granted either or both of these database roles, use the [GRANT DATABASE ROLE](../sql-reference/sql/grant-database-role.md) command
  and run the SHOW GRANTS command again to confirm the grants:

  > ```sqlexample
  > USE ROLE ACCOUNTADMIN;
  > GRANT DATABASE ROLE SNOWFLAKE.GOVERNANCE_VIEWER TO ROLE data_engineer;
  > GRANT DATABASE ROLE SNOWFLAKE.OBJECT_VIEWER TO ROLE data_engineer;
  > SHOW GRANTS LIKE '%VIEWER%' TO ROLE data_engineer;
  > ```

  For details about these database roles, see [SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md).

### Dashboard

As a data administrator, you can use the Dashboard interface to monitor tag and policy usage in the following ways.

* Coverage: specifies the count and percentage based on whether a table, view, or column has a policy or tag.
* Prevalence: lists and counts the most frequently used policies and tags.

The coverage and prevalence provide a snapshot as to how well the data is protected and tagged.

When you select a count number, percentage, policy name, or tag name, the Tagged Objects interface opens. The Tagged Objects
interface updates the filters automatically based on your selection in the Dashboard.

The monitoring information is an alternative or complement to running complex and query-intensive operations on multiple Account
Usage views.

These views might include, but are not limited to, the [COLUMNS](../sql-reference/account-usage/columns.md),
[POLICY_REFERENCES](../sql-reference/account-usage/policy_references.md), [TABLES](../sql-reference/account-usage/tables.md),
[TAG_REFERENCES](../sql-reference/account-usage/tag_references.md), and [VIEWS](../sql-reference/account-usage/views.md) views.

### Tagged Objects

As a data administrator, you can use this table to associate the coverage and prevalence in the Dashboard to a list of specific
tables, view, or columns quickly. You can also filter the table results manually as follows.

* Choose Tables or Columns.
* For tags, you can filter with tags, without tags, or by a specific tag.
* For policies, you can filter with policies, without policies, or by a specific policy.

When you select a row in the table, the Table Details or Columns tab in Catalog » Database Explorer opens. You can edit
the tag and policy assignments as needed.

## Audit row access policies

Snowflake supports the following approaches to facilitate row access policy auditing and governance operations.

* Use [SHOW ROW ACCESS POLICIES](../sql-reference/sql/show-row-access-policies.md) to produce a list of row access policies that have not been dropped from your
  account.
* Row access policy administrators (i.e. users with the row access policy OWNERSHIP privilege) can
  use [Time Travel](data-time-travel.md) or [streams](streams-intro.md) to capture historical data about any
  mapping tables referenced in their row access policies.
* To determine the data a given user can access, the row access policy administrator can assume the role of the user and run a query.

  + Snowflake supports defining a row access policy `expression` with custom logic to support this behavior in the
    [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md) command.
  + Snowflake does not currently have a default mechanism (e.g. a dedicated system or context function) to support this operation.
* If a given row access policy uses mapping tables to determine which role and user populations can access row data, the row access policy
  owner can query the mapping tables to determine authorized user access on demand.
* Snowflake captures and logs error message information related to row access policies in the account usage
  [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md) view. If an error occurs in a query, Snowflake records the first error message that
  occurs during the query evaluation. For more information on row access policy error messages, see Troubleshoot Row Access Policies.
* To determine the data a given user accessed in the past as it relates to row access policies on database objects, use Time Travel in
  combination with the ROW_ACCESS_POLICIES Account Usage view and the POLICY_REFERENCES Information Schema table function.

  + If the policy and mapping tables, if present, have not changed, the row access policy administrator can assume the role of the user and
    run a Time Travel query. The values of relevant session parameters, such as [CURRENT_ROLE](../sql-reference/functions/current_role.md), are available
    in the query result.
  + If the policy or mapping tables have changed, the row access policy administrator must run a time travel query on the mapping table and
    reconstruct the row access policy that existed at the specified incident time. After those steps, the row access policy administrator can
    begin to query the data and proceed with their analysis.

## Troubleshoot row access policies

The following behaviors and error messages apply to row access policies.

| Behavior | Error Message | Troubleshooting Action |
| --- | --- | --- |
| Cannot set a row access policy (Materialized view). | Row access policy cannot be attached to a Materialized view. | Verify that a row access policy can be set on the materialized view. See Materialized Views (in this topic). |
| Cannot create a row access policy (Boolean). | `003551=SQL compilation error:` Row access policy return type ‘’{0}’’ is not BOOLEAN. | A row access policy definition must have `RETURNS BOOLEAN`. Rewrite the row access policy as shown in [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md). |
| Cannot create a row access policy (Database). | This session does not have a current database. Call ‘USE DATABASE’, or use a qualified name. | Since a row access policy is a schema-level object, define a database and schema for the current session or use the fully qualified name in the CREATE ROW ACCESS POLICY command. For more information, see [Object name resolution](../sql-reference/name-resolution.md). |
| Cannot create a row access policy (Object exists) | SQL compilation error: Object ‘<name>’ already exists. | Since a row access policy in the schema already exists with the stated name, recreate the row access policy with a different `name` value. |
| Cannot create a row access policy (Schema ownership). | SQL access control error: Insufficient privileges to operate on schema ‘S1’ | Verify the privileges to create a row access policy in Summary of DDL Commands, Operations, and Privileges (in this topic). |
| Cannot create a row access policy (Schema usage). | SQL compilation error: Schema ‘<schema_name>’ does not exist or not authorized. | Verify that the specified schema exists and the privileges to create a row access policy in Summary of DDL Commands, Operations, and Privileges (in this topic). |
| Cannot describe a row access policy (Usage only). | SQL compilation error: Row access policy ‘RLS_AUTHZ_DB.S_B.P1’ does not exist or not authorized. | Having the USAGE privilege on the parent database and schema in which the row access policy exists is not sufficient to execute a DESCRIBE operation on the row access policy. Verify the row access policy exists and the privileges to describe a row access policy in Summary of DDL Commands, Operations, and Privileges (in this topic). |
| Cannot drop a row access policy. (Maintenance). | SQL compilation error: Row access policy ‘RLS_AUTHZ_DB.S_B.P1’ does not exist or not authorized. | Verify the specified row access policy exists and the privileges to drop a row access policy in Summary of DDL Commands, Operations, and Privileges (in this topic). |
| Cannot execute `UNDROP` on a row access policy. (Maintenance) | Unsupported feature ‘UNDROP not supported for objects of type ROW_ACCESS_POLICY’. | To reinstate a row access policy, execute a CREATE ROW ACCESS POLICY command, and then add the row access policy to a database object using an ALTER TABLE or ALTER VIEW command as shown in [ALTER TABLE](../sql-reference/sql/alter-table.md) or [ALTER VIEW](../sql-reference/sql/alter-view.md). |
| Cannot update a row access policy (Name/Operation). | SQL compilation error: Object found is of type ‘ROW_ACCESS_POLICY’, not specified type ‘MASKING_POLICY’ | Double-check the query to verify the name of the object and the intended operation on the object. . . For example, Snowflake does not support `ALTER ROW ACCESS POLICY <name>;`. . . Instead, use a CREATE OR REPLACE ROW ACCESS POLICY command to update a row access policy. For more information on row access policy operations, see Summary of DDL Commands, Operations, and Privileges (in this topic). |
| Cannot use row access policies with a Snowflake feature or service (Unsupported feature). | Unsupported feature ‘CREATE ON OBJECTS ENFORCED BY ROW ACCESS POLICY’. | Some Snowflake features and services do not support row access policies. For more information, see the Limitations and Use Row Access Policies with Snowflake Objects and Features sections in this topic. |
| Cannot update a row access policy (Unsupported token). | Unsupported feature ‘TOK_ROW_ACCESS_POLICY’. | `TOK` refers to token, which can be returned if an query is unsupported and/or inaccurate; Snowflake’s SQL compiler does not know how to process the given query. . For example `alter row access policy p1_test set comment = 'test policy 1';`. In this example, the `ALTER` command cannot be used on the policy object directly; use an ALTER TABLE or ALTER VIEW command instead as shown in Summary of DDL Commands, Operations, and Privileges (in this topic). |

**Next Topics:**

* [Use row access policies](security-row-using.md)

---
title: Understanding Snowflake Table Structures
source: https://docs.snowflake.com/en/user-guide/tables-micro-partitions.md
section: User Guide
---

# Understanding Snowflake Table Structures

All data in Snowflake is stored in database tables, logically structured as collections of columns and rows. To best utilize Snowflake tables, particularly large tables, it is helpful to have an
understanding of the physical structure behind the logical structure.

These topics describe *micro-partitions* and *data clustering*, two of the principal concepts utilized in Snowflake physical table structures. They also provides guidance for explicitly defining
*clustering keys* for very large tables (in the multi-terabyte range) to help optimize table maintenance and query performance.

**Next Topics:**

* [Micro-partitions & Data Clustering](tables-clustering-micropartitions.md)
* [Clustering Keys & Clustered Tables](tables-clustering-keys.md)
* [Automatic Clustering](tables-auto-reclustering.md)
* [Manual Reclustering — *Deprecated*](tables-clustering-manual.md)

---
title: Understanding storage cost
source: https://docs.snowflake.com/en/user-guide/cost-understanding-data-storage.md
section: User Guide
---

# Understanding storage cost

Storage cost represents the cost of:

* Files [staged](data-load-considerations-stage.md) for bulk data loading/unloading (stored compressed or uncompressed).
* Database tables, including historical data for [Time Travel](data-time-travel.md).
* [Fail-safe](data-failsafe.md) for database tables.
* [Clones](object-clone.md) of database tables that reference data deleted in the table that owns the clones.

The monthly costs for storing data in Snowflake is based on a flat rate per terabyte (TB).
The amount charged depends on your type of account (Capacity or On Demand) and region (US or EU).

For storage pricing, see the [Snowflake Pricing Guide](https://www.snowflake.com/pricing/pricing-guide/).

## Staged file costs

Files staged for bulk data loading/unloading incur storage costs based on the size of the files. For more information on loading data, see [Load data into Snowflake](../guides-overview-loading-data.md).

## Database costs

Database costs include data stored in database tables. Database costs also include historical data maintained for Time Travel.
Snowflake automatically compresses all data stored in tables and uses the compressed file size to calculate the total storage used for
an account.

See also [Data storage considerations](tables-storage-considerations.md).

## Time Travel and Fail-safe costs

Time Travel and Fail-safe fees are calculated for each 24-hour period (i.e. 1 day) from the time the data changed.
The number of days historical data is maintained is based on the table type and the Time Travel retention period for the table.

Snowflake minimizes the amount of storage required for historical data by maintaining only the information required to restore the
individual table rows that were updated or deleted. As a result, storage usage is calculated as a percentage of the table that changed.
Full copies of tables are only maintained when tables are dropped or truncated.

See also [Storage costs for Time Travel and Fail-safe](data-cdp-storage-costs.md).

## Temporary and transient tables costs

To help manage the storage costs associated with Time Travel and Fail-safe, Snowflake provides two table types, temporary and transient.
Temporary and transient tables do not incur the same fees as permanent tables:

* Transient and temporary tables contribute to the storage charges that Snowflake bills your account until explicitly dropped.
  Data stored in these table types contributes to the overall storage charges Snowflake bills your account while they exist.
* Temporary tables are typically used for non-permanent session specific transitory data such as ETL or other session specific data.
  Temporary tables only exist for the lifetime or their associated session. On session end, temporary table data is purged and
  unrecoverable. Temporary tables are not accessible outside the specific session which created them.
* Transient tables exist until explicitly dropped and are available to all users with appropriate privileges.
* Transient and temporary tables can have a Time Travel retention period of either 0 or 1 day.
* Transient and temporary tables have no Fail-safe period.
* Transient and temporary tables can, at most, incur a one day’s worth of storage cost.

The following table illustrates the different scenarios, based on table type:

| Table Type | Time Travel Retention Period (Days) | Fail-safe Period (Days) | Min , Max Historical Data Maintained (Days) |
| --- | --- | --- | --- |
| Permanent | 0 or 1 (for Snowflake Standard Edition) | 7 | **7 , 8** |
| 0 to 90 (for Snowflake Enterprise Edition) | 7 | **7 , 97** |
| Transient | 0 or 1 | 0 | **0 , 1** |
| Temporary | 0 or 1 | 0 | **0 , 1** |

### Using temporary and transient tables to manage storage costs

When choosing whether to store data in permanent, temporary, or transient tables, consider the following:

* Temporary tables are dropped when the session in which they were created ends. Data stored in temporary tables is not recoverable
  after the table is dropped.
* Historical data in transient tables cannot be recovered by Snowflake after the Time Travel retention period ends. Use transient
  tables only for data you can replicate or reproduce
  independently from Snowflake.
* Long-lived tables, such as fact tables, should always be defined as permanent to ensure they are fully protected by Fail-safe.
* Short-lived tables (i.e. <1 day), such as ETL work tables, can be defined as transient to eliminate Fail-safe costs.
* If downtime and the time required to reload lost data are factors, permanent tables, even with their added Fail-safe costs, may offer a
  better overall solution than transient tables.

> **Note:**
>
> The default type for tables is permanent. To define a table as temporary or transient, you must explicitly specify the type during table
> creation.

## Hybrid table storage costs

Cost for storage of hybrid tables depends on the amount of data that you are storing.
Storage cost is based on a flat monthly rate per gigabyte (GB). See Table 3(b) in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf), which covers unit pricing for hybrid table storage.

Note that hybrid table storage *for the row-store copy of the data* is more expensive than traditional
Snowflake storage. The copy of the current data in the column store (object storage) is not billed.

Historical time travel data is billed at standard storage prices.

For more information, see [Evaluate cost for hybrid tables](tables-hybrid-cost.md).

## Cloning tables, schemas, and databases costs

Snowflake’s zero-copy cloning feature provides a convenient way to quickly take a “snapshot” of any table (excluding hybrid tables), schema, or database and create
a derived copy of that object which initially shares the underlying storage. This can be extremely useful for creating instant backups that
do not incur any additional costs (until changes are made to the cloned object).

However, cloning makes calculating total storage usage more complex because each clone has its own separate life-cycle. This means that
changes can be made to the original object or the clone independently of each other and these changes are protected through CDP.

For example, when a clone is created of a table, the clone utilizes no storage because it shares all the existing micro-partitions of the
original table at the time it was cloned; however, rows can then be added, deleted, or updated in the clone independently from the original
table. Each change to the clone results in new micro-partitions that are owned exclusively by the clone and are protected through CDP.

In addition, clones can be cloned, with no limitations on the number or iterations of clones that can be created (e.g. you can create a
clone of a clone of a clone, and so on), which results in an n-level hierarchy of cloned objects, each with their own portion of shared and
independent storage.

## Cross-Cloud Auto-Fulfillment costs

Cross-Cloud Auto-Fulfillment lets you provide a data product to consumers in other cloud regions without manual data replication.
When your data product is auto-fulfilled to another region, you incur storage and other costs. For details, see
[Auto-fulfillment costs](../collaboration/provider-understand-cost-auto-fulfillment.md).

## Storage request costs

When an external query engine accesses an Apache Iceberg™ table that uses
[Snowflake Storage](tables-iceberg-internal-storage.md) through the
[Snowflake Horizon Catalog](snowflake-horizon.md), Snowflake charges a per-request fee for each
HTTP request sent to the underlying storage system. The rate depends on the request type:

* PUT, COPY, POST, PATCH, and LIST operations are billed as “class 1” requests.
* GET and SELECT operations are billed as “class 2” requests.

Snowflake only bills for table accesses through the Horizon Catalog. Direct access using the Snowflake query engine
isn’t charged. Non-Iceberg accesses, such as FDN and stage accesses, don’t incur storage request fees.
For more information, see [Request cost](tables-iceberg-internal-storage.md).

**Next Topic**

> * [Exploring storage cost](cost-exploring-data-storage.md)

---
title: Unload into a Snowflake stage
source: https://docs.snowflake.com/en/user-guide/data-unload-snowflake.md
section: User Guide
---

# Unload into a Snowflake stage

This set of topics describes how to use the COPY command to unload data from a table into an internal (i.e. Snowflake) stage. You can then download the unloaded data files to your local file system.

As illustrated in the diagram below, unloading data to a local file system is performed in two, separate steps:

Step 1:
:   Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to copy the data from the Snowflake database table into one or more files in a Snowflake stage. In the SQL statement, you specify the
    stage (named stage or table/user stage) where the files are written.

    Regardless of the stage you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to write rows from the table.

Step 2:
:   Use the [GET](../sql-reference/sql/get.md) command to download the data files to your local file system.

> **Tip:**
>
> The instructions in this set of topics assume you have read [File formats to unload data](data-unload-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data unloading considerations](data-unload-considerations.md) for best practices, tips, and other guidance.

## Unload the data

This section provides instructions for unloading table data to a named internal stage, table stage, or user stage.

### Unload data to a named internal stage

Internal stages are named database objects that provide the greatest degree of flexibility for data unloading. Because they are database objects, privileges for named stages can be granted to any role.

You can create an internal stage using either the web interface or SQL:

> Snowsight:
> :   In the navigation menu, select Catalog » Database Explorer. Then select the *<db_name>* » Stages.
>
> SQL:
> :   [CREATE STAGE](../sql-reference/sql/create-stage.md)

#### Create a named stage

The following example creates an internal stage that references the named file format object called `my_csv_unload_format` that was created in [File formats to unload data](data-unload-prepare.md):

> ```sqlexample
> CREATE OR REPLACE STAGE my_unload_stage
>   FILE_FORMAT = my_csv_unload_format;
> ```

#### Unload data to the named stage

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload all the rows from a table into one or more files into the `my_csv_unload_format` stage. The statement prefixes the unloaded
   file(s) with `unload/` to organize the files in the stage:

   For example:

   > ```sqlexample
   > COPY INTO @mystage/unload/ from mytable;
   > ```

   Note that the `@` character by itself identifies a named stage.

   > **Note:**
   >
   > Because the file format options were defined for the stage, it is not necessary to specify the same file format options in the COPY command.
2. Use the [LIST](../sql-reference/sql/list.md) command to view a list of files that have been unloaded to the stage:

   ```sqlexample
   LIST @mystage;

   +----------------------------------+------+----------------------------------+-------------------------------+
   | name                             | size | md5                              | last_modified                 |
   |----------------------------------+------+----------------------------------+-------------------------------|
   | mystage/unload/data_0_0_0.csv.gz |  112 | 6f77daba007a643bdff4eae10de5bed3 | Mon, 11 Sep 2017 18:13:07 GMT |
   +----------------------------------+------+----------------------------------+-------------------------------+
   ```
3. Use the [GET](../sql-reference/sql/get.md) command to download the generated file(s) from the table stage to your local machine. The following example downloads the files to the `data/unload` directory:

   For example:

   Linux or macOS:

   > ```sqlexample
   > GET @mystage/unload/data_0_0_0.csv.gz file:///data/unload;
   > ```

   Windows:

   > ```sqlexample
   > GET @mystage/unload/data_0_0_0.csv.gz file://C:\data\unload;
   > ```

### Unload data to a table stage

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload all the rows from a table into one or more files in the stage for the table. The following example unloads data files to the stage
   using the named `my_csv_unload_format` file format created in [File formats to unload data](data-unload-prepare.md). The statement prefixes the unloaded file(s) with `unload/` to organize the files
   in the stage:

   For example:

   > ```sqlexample
   > COPY INTO @%mytable/unload/ from mytable FILE_FORMAT = (FORMAT_NAME = 'my_csv_unload_format' COMPRESSION = NONE);
   > ```

   Note that the `@%` character combination identifies a table stage.
2. Use the [LIST](../sql-reference/sql/list.md) command to view a list of files that have been unloaded to the stage:

   ```sqlexample
   LIST @%mytable;

   +-----------------------+------+----------------------------------+-------------------------------+
   | name                  | size | md5                              | last_modified                 |
   |-----------------------+------+----------------------------------+-------------------------------|
   | unload/data_0_0_0.csv |   96 | 29918f18bcb35e7b6b628ca41024236c | Mon, 11 Sep 2017 17:45:20 GMT |
   +-----------------------+------+----------------------------------+-------------------------------+
   ```
3. Use the [GET](../sql-reference/sql/get.md) command to download the generated file(s) from the table stage to your local machine. The following example downloads the files to the `data/unload` directory:

   For example:

   Linux or macOS:

   > ```sqlexample
   > GET @%mytable/unload/data_0_0_0.csv file:///data/unload;
   > ```

   Windows:

   > ```sqlexample
   > GET @%mytable/unload/data_0_0_0.csv file://C:\data\unload;
   > ```

### Unload data to your user stage

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload all the rows from a table into one or more files in your stage. The following example unloads data files to your user stage using
   the named `my_csv_unload_format` file format created in [File formats to unload data](data-unload-prepare.md). The statement prefixes the unloaded file(s) with `unload/` to organize the files in the stage:

   For example:

   > ```sqlexample
   > COPY INTO @~/unload/ from mytable FILE_FORMAT = (FORMAT_NAME = 'my_csv_unload_format' COMPRESSION = NONE);
   > ```

   Note that the `@~` character combination identifies a user stage.
2. Use the [LIST](../sql-reference/sql/list.md) command to view a list of files that have been unloaded to the stage:

   ```sqlexample
   LIST @~;

   +-----------------------+------+----------------------------------+-------------------------------+
   | name                  | size | md5                              | last_modified                 |
   |-----------------------+------+----------------------------------+-------------------------------|
   | unload/data_0_0_0.csv |   96 | 94a306c55733b95a0887511ff355936b | Mon, 11 Sep 2017 17:25:07 GMT |
   +-----------------------+------+----------------------------------+-------------------------------+
   ```
3. Use the [GET](../sql-reference/sql/get.md) command to download the generated file(s) from your stage to your local machine. The following example downloads the files to the `data/unload` directory:

   For example:

   Linux or macOS:

   > ```sqlexample
   > GET @~/unload/data_0_0_0.csv file:///data/unload;
   > ```

   Windows:

   > ```sqlexample
   > GET @~/unload/data_0_0_0.csv file://C:\data\unload;
   > ```

## Manage unloaded data files

Staged files can be deleted from a Snowflake stage using the [REMOVE](../sql-reference/sql/remove.md) command to remove the files in the stage after you are finished with them.

Removing files improves performance when loading data, because it reduces the number of files that the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command must scan to verify whether existing files in a
stage were loaded already.

---
title: Unload into Amazon S3
source: https://docs.snowflake.com/en/user-guide/data-unload-s3.md
section: User Guide
---

# Unload into Amazon S3

If you already have a Amazon Web Services (AWS) account and use S3 buckets for storing and managing your data files, you can make use of your existing buckets and folder paths when unloading data from
Snowflake tables. This topic describes how to use the COPY command to unload data from a table into an Amazon S3 bucket. You can then download the unloaded data files to your local file system.

As illustrated in the diagram below, unloading data to an S3 bucket is performed in two steps:

Step 1:
:   Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to copy the data from the Snowflake database table into one or more files in an S3 bucket. In the command, you specify a named
    external stage object that references the S3 bucket (recommended) or you can choose to unload directly to the bucket by specifying the URI and either the storage integration or the security credentials (if required) for the bucket.

    Regardless of the method you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to write rows from the table.

Step 2:
:   Use the interfaces/tools provided by Amazon to download the files from the S3 bucket.

> **Tip:**
>
> The instructions in this set of topics assume you have read [File formats to unload data](data-unload-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data unloading considerations](data-unload-considerations.md) for best practices, tips, and other guidance.

## Allow the Amazon Virtual Private Cloud IDs

If an AWS administrator in your organization has not explicitly granted Snowflake access to your AWS S3 storage account, you can do so now.
Follow the steps in [Allowing the Virtual Private Cloud IDs](data-load-s3-allow.md) in the data loading configuration instructions.

## Configure an S3 bucket for unloading data

Snowflake requires the following permissions on an S3 bucket and folder to create new files in the folder (and any sub-folders):

* `s3:DeleteObject`
* `s3:PutObject`

As a best practice, Snowflake recommends configuring a storage integration object to delegate authentication responsibility for external cloud storage to a Snowflake identity and access management (IAM) entity.

For configuration instructions, see [Configuring secure access to Amazon S3](data-load-s3-config.md).

## (Optional) Configure support for Amazon S3 access control lists

Snowflake storage integrations support AWS access control lists (ACLs) to grant the bucket owner full control. Files created in Amazon S3 buckets from unloaded table data are owned by an AWS Identity and Access Management (IAM) role. ACLs support the use case where IAM roles in one AWS account are configured to access S3 buckets in one or more other AWS accounts. Without ACL support, users in the bucket-owner accounts could not access the data files unloaded to an external (S3) stage using a storage integration. When users unload Snowflake table data to data files in an external (S3) stage using [COPY INTO <location>](../sql-reference/sql/copy-into-location.md), the unload operation applies an ACL to the unloaded data files. The data files apply the `"s3:x-amz-acl":"bucket-owner-full-control"` privilege to the files, granting the S3 bucket owner full control over them.

Enable ACL support in the storage integration for an S3 stage via the optional `STORAGE_AWS_OBJECT_ACL = 'bucket-owner-full-control'` parameter. A storage integration is a Snowflake object that stores a generated identity and access management (IAM) user for your S3 cloud storage, along with an optional set of allowed or blocked storage locations (i.e. S3 buckets). An AWS administrator in your organization adds the generated IAM user to the role to grant Snowflake permissions to access specified S3 buckets. This feature allows users to avoid supplying credentials when creating stages or loading data. An administrator can set the `STORAGE_AWS_OBJECT_ACL` parameter when creating a storage integration (using [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md)) or later (using [ALTER STORAGE INTEGRATION](../sql-reference/sql/alter-storage-integration.md)).

## Unload data into an external stage

External stages are named database objects that provide the greatest degree of flexibility for data unloading. Because they are database objects, privileges for named stages can be granted to any role.

You can create an external named stage using either Snowsight or SQL:

> Snowsight:
> :   In the navigation menu, select Catalog » Database Explorer » *<db_name>* » Stages » Create
>
> SQL:
> :   [CREATE STAGE](../sql-reference/sql/create-stage.md)

### Create a named stage

Snowflake uses multipart uploads when uploading to Amazon S3 and Google Cloud Storage.
This process might leave incomplete uploads in the storage location for your external stage.

To prevent incomplete uploads from accumulating, we recommend that you set a lifecycle rule.
For instructions, see the [Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpu-abort-incomplete-mpu-lifecycle-config.html)
or [Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle#abort-mpu) documentation.

The following example creates an external stage named `my_ext_unload_stage` using an S3 bucket named `unload` with a
folder path named `files`. The stage accesses the S3 bucket using an existing storage integration named `s3_int`.

The stage references a named file format object called `my_csv_unload_format`. For instructions, see [File formats to unload data](data-unload-prepare.md).

```sqlexample
CREATE OR REPLACE STAGE my_ext_unload_stage URL='s3://unload/files/'
  STORAGE_INTEGRATION = s3_int
  FILE_FORMAT = my_csv_unload_format;
```

### Unload data to the named stage

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from a table into an S3 bucket using the external stage.

   The following example uses the `my_ext_unload_stage` stage to unload all the rows in the `mytable` table into one or more files into the S3 bucket. A `d1` filename prefix is applied
   to the files:

   ```sqlexample
   COPY INTO @my_ext_unload_stage/d1 from mytable;
   ```
2. Use the S3 console (or equivalent client application) to retrieve the objects (i.e. files generated by the command) from the bucket.

## Unload data directly into an S3 bucket

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from a table directly into a specified S3 bucket. This option works well for ad hoc unloading, when you aren’t planning regular data unloading with the same table and bucket parameters.

   You must specify the URI for the S3 bucket and the storage integration or credentials for accessing the bucket in the COPY command.

   The following example unloads all the rows in the `mytable` table into one or more files with the folder path prefix `unload/` in the `mybucket` S3 bucket:

   ```sqlexample
   COPY INTO 's3://mybucket/unload/'
     FROM mytable
     STORAGE_INTEGRATION = s3_int;
   ```

   > **Note:**
   >
   > In this example, the referenced S3 bucket is accessed using a referenced storage integration named `s3_int`.
2. Use the S3 console (or equivalent client application) to retrieve the objects (i.e. files generated by the command) from the bucket.

---
title: Unload into Google Cloud Storage
source: https://docs.snowflake.com/en/user-guide/data-unload-gcs.md
section: User Guide
---

# Unload into Google Cloud Storage

If you already have a Google Cloud Storage account and use Cloud Storage buckets for storing and managing your files, you can make use of your existing buckets and folder paths when unloading data from Snowflake tables. This topic describes how to use the COPY command to unload data from a table into a Cloud Storage bucket. You can then download the unloaded data files to your local file system.

As illustrated in the diagram below, unloading data into a Cloud Storage bucket is performed in two steps:

Step 1:
:   Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to copy the data from the Snowflake database table into one or more files in a Cloud Storage bucket. In the command, you specify a named external stage object that references the Cloud Storage bucket (recommended) or you can choose to unload directly to the bucket by specifying the URI and storage integration (if required) for the bucket.

    Regardless of the method you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to write rows from the table.

Step 2:
:   Use the interfaces/tools provided by Google to download the files from the Cloud Storage bucket.

> **Tip:**
>
> The instructions in this set of topics assume you have read [File formats to unload data](data-unload-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data unloading considerations](data-unload-considerations.md) for best practices, tips, and other guidance.

## Configure Cloud Storage for unloading data

For Snowflake to write to a Cloud Storage bucket, you must configure a storage integration object to delegate authentication responsibility for external cloud storage to a Snowflake identity and access management (IAM) entity.

For configuration instructions, see [Configure an integration for Google Cloud Storage](data-load-gcs-config.md).

## Unload data into an external stage

External stages are named database objects that provide the greatest degree of flexibility for data unloading. Because they are database objects, privileges for named stages can be granted to any role.

You can create an external named stage using either the web interface or SQL:

> Snowsight:
> :   In the navigation menu, select Catalog » Database Explorer. Then select the *<db_name>* » Stages.
>
> SQL:
> :   [CREATE STAGE](../sql-reference/sql/create-stage.md)

### Create a named stage

Snowflake uses multipart uploads when uploading to Amazon S3 and Google Cloud Storage.
This process might leave incomplete uploads in the storage location for your external stage.

To prevent incomplete uploads from accumulating, we recommend that you set a lifecycle rule.
For instructions, see the [Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpu-abort-incomplete-mpu-lifecycle-config.html)
or [Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle#abort-mpu) documentation.

The following example creates an external stage named `my_ext_unload_stage` with a folder path named `unload`. The stage references the following objects:

* A named storage integration called `gcs_int`. For instructions, see [Configure an integration for Google Cloud Storage](data-load-gcs-config.md).
* A named file format called `my_csv_unload_format`. For instructions, see [File formats to unload data](data-unload-prepare.md).

  ```sqlexample
  CREATE OR REPLACE STAGE my_ext_unload_stage
    URL='gcs://mybucket/unload'
    STORAGE_INTEGRATION = gcs_int
    FILE_FORMAT = my_csv_unload_format;
  ```

### Unload data to the named stage

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from a table into a Cloud Storage bucket using the external stage.

   The following example uses the `my_ext_unload_stage` stage to unload all the rows in the `mytable` table into one or more files into the Cloud Storage bucket. A `d1` filename prefix is applied to the files:

   > ```sqlexample
   > COPY INTO @my_ext_unload_stage/d1
   > FROM mytable;
   > ```
2. Use the tools provided by Cloud Storage to retrieve the objects (i.e. files generated by the command) from the bucket.

## Unload data directly into a Cloud Storage bucket

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from a table directly into a specified Cloud Storage bucket. This option works well for ad hoc unloading, when you aren’t planning regular data unloading with the same table and bucket parameters.

   You must specify the URI for the Cloud Storage bucket and the storage integration for accessing the bucket.

   The following example unloads all the rows in the `mytable` table into one or more files with the folder path prefix `unload/` in a Cloud Storage bucket:

   > ```sqlexample
   > COPY INTO 'gcs://mybucket/unload/'
   >   FROM mytable
   >   STORAGE_INTEGRATION = gcs_int;
   > ```
2. Use the Cloud Storage console (or equivalent client application) to retrieve the objects (i.e. files generated by the command) from the bucket.

---
title: Unload into Microsoft Azure
source: https://docs.snowflake.com/en/user-guide/data-unload-azure.md
section: User Guide
---

# Unload into Microsoft Azure

If you already have a Microsoft Azure account and use Azure containers for storing and managing your files, you can make use of your existing containers and folder paths when unloading data from
Snowflake tables. This topic describes how to use the COPY command to unload data from a table into an Azure container. You can then download the unloaded data files to your local file system.

Snowflake supports the following types of blob storage accounts:

* Blob storage
* Data Lake Storage Gen2
* General-purpose v1
* General-purpose v2

Snowflake does not support Data Lake Storage Gen1.

As illustrated in the diagram below, unloading data into an Azure container is performed in two steps:

Step 1:
:   Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to copy the data from the Snowflake database table into one or more files in an Azure container bucket. In the command, you specify
    a named external stage object that references the Azure container (recommended) or you can choose to unload directly to the container by specifying the URI and security credentials (if required) for
    the container.

    Regardless of the method you use, this step requires a running, current virtual warehouse for the session if you execute the command
    manually or within a script. The warehouse provides the compute resources to write rows from the table.

Step 2:
:   Use the interfaces/tools provided by Microsoft to download the files from the Azure container.

> **Tip:**
>
> The instructions in this set of topics assume you have read [File formats to unload data](data-unload-prepare.md) and have created a named file format, if desired.
>
> Before you begin, you may also want to read [Data unloading considerations](data-unload-considerations.md) for best practices, tips, and other guidance.

## Allow the Azure Virtual Network subnet IDs

If an Azure administrator in your organization has not explicitly granted Snowflake access to your Azure storage account, you can do so now.
Follow the steps in [Allow the VNet subnet IDs](data-load-azure-allow.md) in the data loading configuration instructions.

## Configure an Azure container for unloading data

For Snowflake to write to an Azure container, you must configure access to your storage
account. For instructions, see [Configure an Azure container for loading data](data-load-azure-config.md). Note that we
provide a single set of instructions, which call out the specific permissions required for
data loading or unloading operations.

## Unload data into an external stage

External stages are named database objects that provide the greatest degree of flexibility for data unloading. Because they are database objects, privileges for named stages can be granted to any role.

You can create an external named stage using either Snowsight or SQL:

> Snowsight:
> :   In the navigation menu, select Catalog » Database Explorer » *<db_name>* » Stages » Create
>
> SQL:
> :   [CREATE STAGE](../sql-reference/sql/create-stage.md)

### Create a named stage

The following example creates an external stage named `my_ext_unload_stage` with a container named `mycontainer` and a folder path named `unload`. The stage references the named file format object called `my_csv_unload_format` that was created in [File formats to unload data](data-unload-prepare.md):

> ```sqlexample
> CREATE OR REPLACE STAGE my_ext_unload_stage
>   URL='azure://myaccount.blob.core.windows.net/mycontainer/unload'
>   CREDENTIALS=(AZURE_SAS_TOKEN='?sv=2016-05-31&ss=b&srt=sco&sp=rwdl&se=2018-06-27T10:05:50Z&st=2017-06-27T02:05:50Z&spr=https,http&sig=bgqQwoXwxzuD2GJfagRg7VOS8hzNr3QLT7rhS8OFRLQ%3D')
>   ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'kPxX0jzYfIamtnJEUTHwq80Au6NbSgPH5r4BDDwOaO8=')
>   FILE_FORMAT = my_csv_unload_format;
> ```

> **Note:**
>
> Use the `blob.core.windows.net` endpoint for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.

Note that the AZURE_SAS_TOKEN and MASTER_KEY values used in this example are for illustration purposes only.

### Unload data to the named stage

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from a table into an Azure container using the external stage.

   The following example uses the `my_ext_unload_stage` stage to unload all the rows in the `mytable` table into one or more files into the Azure container. A `d1` filename prefix is
   applied to the files:

   > ```sqlexample
   > COPY INTO @my_ext_unload_stage/d1 from mytable;
   > ```
2. Use the tools provided by Azure to retrieve the objects (i.e. files generated by the command) from the container.

## Unload data directly into an Azure container

1. Use the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command to unload data from a table directly into a specified Azure container. This option works well for ad hoc unloading, when you aren’t planning regular data unloading with the same table and container parameters.

   You must specify the URI for the Azure container and the security credentials for accessing the container in the COPY command.

   The following example unloads all the rows in the `mytable` table into one or more files with the folder path prefix `unload/` in an Azure container.

   This example references a storage integration created using [CREATE STORAGE INTEGRATION](../sql-reference/sql/create-storage-integration.md) by an account administrator (i.e. a user with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege. A storage integration allows users to avoid supplying credentials to access a private storage location:

   > ```sqlexample
   > COPY INTO 'azure://myaccount.blob.core.windows.net/mycontainer/unload/' FROM mytable STORAGE_INTEGRATION = myint;
   > ```
2. Use the Azure console (or equivalent client application) to retrieve the objects (i.e. files generated by the command) from the container.

---
title: Update billing contact information
source: https://docs.snowflake.com/en/user-guide/billing-contacts.md
section: User Guide
---

# Update billing contact information

[Converting to a paid account](admin-trial-account.md) allows you to see the billing contact information for your account in Snowsight.
On-demand, self-service (ODSS) [organization administrators](organization-administrators.md) can edit billing information
shown on Billing communication. Non-ODSS customers see their billing information with READ-ONLY access.

Customers with Marketplace accounts (Marketplace customers) have limited ability to edit their billing information on the Contacts tab. Only
ODSS Marketplace customers (who have the ORGADMIN role) can edit the information shown in Snowflake Marketplace billing communication
cards. More specifically, ODSS Marketplace **providers** cannot update their billing name and address fields using Snowsight. ODSS
Marketplace **consumers** can only update the Country field, by selecting one from the list of [Supported consumer locations](../collaboration/consumer-listings-paying.md). Non-ODSS Marketplace
customers see their billing information with READ-ONLY access.

Customers with at least one active Snowflake order form (Capacity customers) can see the information on Snowflake Marketplace billing
communication with READ-ONLY access.

## Edit billing contact information

To edit your Snowflake billing contact information:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. On the Billing and Payments page, select the Contacts tab.
4. Find the field that you want to edit, and select .
5. Type your updated information.

   If you are updating email addresses, you can update multiple email addresses at the same time. The list can be comma-separated or
   space-separated. For example, `updated-usage@snowflake.com, existing-usage@snowflake.com`.

   > **Tip:**
   >
   > Snowsight might suggest information that matches existing addresses when you update address information. You can accept (select)
   > or ignore such suggestions.
6. Update other fields, as necessary.
7. Select Save to save all updates.

### Receiving notification of changes

Snowflake notifies customers who update billing information about the change by sending an email message to the current email addresses
shown in the Usage emails and Invoices emails fields on the Billing communication card. If a customer changes the email address
in a specific email address field, Snowflake sends an email message to both the previous and current email addresses entered in that field.

Snowflake sends notification emails to customers about successful and unsuccessful update attempts. Unsuccessful update attempts also
generate a banner in Snowsight, which remains visible for up to 35 days or until the next update attempt.

## Access control requirements

You must have been granted the ORGADMIN role to edit information on the Snowflake billing communication card.

## Usage notes

* Updates to billing contact information might take several minutes to process.
* Pending updates appear as In-progress in Snowsight.

---
title: Use a catalog-linked database for Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-catalog-linked-database.md
section: User Guide
---

# Use a catalog-linked database for Apache Iceberg™ tables

With a catalog-linked database, you can access multiple remote Iceberg tables
from Snowflake without creating individual [externally managed tables](tables-iceberg.md).

A catalog-linked database is a Snowflake database connected to an external Iceberg REST catalog.
Snowflake automatically syncs with the external catalog to detect namespaces and Iceberg tables,
and registers the remote tables to the catalog-linked database. Catalog-linked databases also support creating and dropping schemas or Iceberg tables.

## Billing for catalog-linked databases

Snowflake bills your account for the following usage:

* Automatic table discovery, create schema, drop schema, and drop table. Snowflake will bill your account for this usage under the
  CREDITS_USED_CLOUD_SERVICES usage type. Usage for
  cloud services is charged only if the daily consumption of cloud services exceeds 10% of the daily usage of virtual warehouses. For more
  information, see [Understanding billing for cloud services usage](cost-understanding-compute.md).
* Create table. Snowflake will bill your account for this usage under the CREDITS_USED_COMPUTE usage type through auto refresh.
  The cost for this usage is described in Table 5 of the [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) on the Snowflake website.
  Refer to the Snowflake-managed compute column for the Automated Refresh and Data Registration row.

Snowflake won’t bill you for any cloud services that you use during table creation.

> **Note:**
>
> To view the credit usage for your catalog-linked databases, use the [CATALOG_LINKED_DATABASE_USAGE_HISTORY view](../sql-reference/account-usage/catalog_linked_database_usage_history.md).

## Workflow to configure access to your external catalog and table storage

The following steps cover how to create a catalog-linked database, check the sync status between
Snowflake and your catalog, and create or query a table in the database.

1. Configure access to your external catalog and table storage
2. Create a catalog-linked database
3. Check the catalog sync status
4. Query a table in your catalog-linked database or Write to your remote catalog

> **Note:**
>
> * If your external data is in Unity Catalog, see [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md) to get started with catalog-linked databases.
> * If your external data is in AWS Glue, see [Build Data Lakes using Apache Iceberg with Snowflake and AWS Glue](https://www.snowflake.com/en/developers/guides/data-lake-using-apache-iceberg-with-snowflake-and-aws-glue/)

## Configure access to your external catalog and table storage

Before you create a catalog-linked database, you need to configure access to
your external catalog and table storage. To configure this access, you configure a catalog integration with vended credentials. With this
option, your remote Iceberg catalog must support credential vending.

For instructions, see [Use catalog-vended credentials for Apache Iceberg™ tables](tables-iceberg-configure-catalog-integration-vended-credentials.md).

> **Note:**
>
> If your remote Iceberg catalog doesn’t support credential vending, you must configure an [external volume](tables-iceberg.md) and a
> [catalog integration](tables-iceberg.md) to configure access to your external catalog and table storage.
> First,
> [configure an external volume for your cloud storage provider](tables-iceberg-configure-external-volume.md). Then,
> [configure a Apache Iceberg™ REST catalog integration for your remote Iceberg catalog](tables-iceberg-configure-catalog-integration-rest.md).

## Create a catalog-linked database

Create a catalog-linked database with the [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md) command:

The following example creates a catalog-linked database that uses vended credentials. The sync interval is 30 seconds, which is the default.
The sync interval tells Snowflake how often to poll your remote catalog.

```sqlexample
CREATE DATABASE my_linked_db
  LINKED_CATALOG = (
    CATALOG = 'my_catalog_int'
  );
```

> **Note:**
>
> To create a catalog-linked database that uses an external volume, see [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md), including
> the [example](../sql-reference/sql/create-database-catalog-linked.md).

Your catalog-linked database includes a link icon.

## Check the configuration of a catalog-linked database

After you create a catalog-linked database, use the [SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG](../sql-reference/functions/system_get_catalog_linked_database_config.md) function to
check the configuration for the database.

```sqlexample
SELECT SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG('my_linked_db');
```

## Check the catalog sync status

To check whether Snowflake has successfully linked your remote catalog to your database, use the [SYSTEM$CATALOG_LINK_STATUS](../sql-reference/functions/system_catalog_link_status.md)
function.

The function also provides information to help you identify tables in the remote catalog that fail to sync.

```sqlexample
SELECT SYSTEM$CATALOG_LINK_STATUS('my_linked_db');
```

### Identify tables that were created but couldn’t be initialized

To identify tables in the remote catalog that synced successfully but fail to refresh automatically, run the [SHOW ICEBERG TABLES](../sql-reference/sql/show-iceberg-tables.md)
command, and then refer to the `auto_refresh_status` column in the output. These tables
have an `executionState` of `ICEBERG_TABLE_NOT_INITIALIZED` in the output.

For example, Snowflake might successfully discover and create a table in your remote catalog to your catalog-linked database, but this
table has a corrupted data file in your remote catalog. As a result, Snowflake can’t automatically refresh the table until you resolve
the error.

Automated refresh is turned off for these kinds of tables, so querying the table in Snowflake returns an error that says the
table was never initialized. To query the table, you must fix the error, and then [turn on automated refresh for the table](tables-iceberg-auto-refresh.md).

## Query a table in your catalog-linked database

After you create a catalog-linked database, Snowflake starts the table discovery process and
automatically polls your linked catalog using the value of the SYNC_INTERVAL_SECONDS parameter (with a default interval of 30 seconds) to check for changes.

In the database, allowed namespaces from the remote catalog appear as schemas, and Iceberg tables appear under their respective schemas.

You can query the remote tables by using a SELECT statement.

> **Note:**
>
> For the requirements for identifying objects in a catalog-linked database, see Requirements for identifier resolution in a catalog-linked database.
>
> For more information about object identifiers, see [Identifier requirements](../sql-reference/identifiers-syntax.md).

For example:

```sqlexample
USE DATABASE my_linked_db;

SELECT * FROM my_namespace.my_iceberg_table
  LIMIT 20;
```

## Write to your remote catalog

You can use Snowflake to create namespaces and Iceberg tables in your linked catalog. For more information, see the
following topics:

* [Write support for externally managed Apache Iceberg™ tables](tables-iceberg-externally-managed-writes.md)
* [Use CREATE SCHEMA to create namespaces in your external catalog](tables-iceberg-externally-managed-writes.md)
* [Create an Iceberg table in a catalog-linked database](tables-iceberg-externally-managed-writes.md)

## Requirements for identifier resolution in a catalog-linked database

The requirement for resolving an identifier depends on the following:

* The value that you specified for the CATALOG_CASE_SENSITIVITY parameter when you
  [created your catalog-linked database](../sql-reference/sql/create-database-catalog-linked.md)
* Whether your external Iceberg catalog uses case-sensitive or case-insensitive identifiers.

> **Note:**
>
> * These requirements apply to identifying existing schemas, tables, and table columns. They also include some special cases for
>   creating or altering an object.
> * When you create a new
>   schema, table, or column in a case-sensitive catalog such as AWS Glue or Unity Catalog, you must use lowercase letters and surround
>   the schema, table, and column names in double quotes. This is also required for other Iceberg REST catalogs that only support
>   lowercase identifiers.

The following table shows the requirement for each scenario:

| CATALOG_CASE_SENSITIVITY value | External Iceberg catalog uses | Requirement |
| --- | --- | --- |
| CASE_SENSITIVE | Case sensitive identifiers | Snowflake matches identifiers exactly as they appear, including case. Snowflake automatically converts unquoted identifiers to uppercase, but quoted identifiers must match exactly the case in your external catalog.  The following example shows a valid query for creating a table:  ```sqlexample CREATE TABLE "Table1" (id INT, name STRING); ```  Snowflake creates the table in the external catalog as `Table1`, which preserves the capitalization you used. Note that you can also create a lowercase `table1` table, if needed.  The following example shows a valid query for selecting the `Table1` table:  ```sqlexample SELECT * FROM "Table1"; ```  In the previous example, the double quotes are required for matching the capitalization exactly.  The following example shows an invalid query, unless a `TABLE1` table exists:  ```sqlexample SELECT * FROM table1; ```  In the previous example, the query is invalid if `TABLE1` doesn’t exist because the identifier isn’t surrounded with double quotes. As a result, Snowflake converts the identifier to uppercase.  The following example shows an invalid query for the case when an all uppercase `TABLE1` doesn’t exist:  ```sqlexample SELECT * FROM TABLE1; ``` |
| CASE_SENSITIVE | Case insensitive identifiers | If the external Iceberg catalog is actually case insensitive, and normalizes to lowercase, you must surround identifiers in double quotes.  The following example shows valid queries:  ```sqlexample SELECT * from "s1"; SELECT * from "lowercasetablename"; ``` |
| CASE_INSENSITIVE | Case insensitive identifiers | * If your case insensitive catalog has a lowercase `table1` table, all of the following queries are valid:  ```sqlexample   SELECT * from table1;   SELECT * from TABLE1;   SELECT * from Table1;   SELECT * from "table1";   ``` * For any of the following commands, you must surround the schema, table, and column names in double quotes:    + CREATE ICEBERG TABLE   + CREATE SCHEMA   + ALTER ICEBERG TABLE ADD COLUMN   + ALTER ICEBERG TABLE RENAME COLUMN |
| CASE_INSENSITIVE | Case sensitive identifiers | If the external Iceberg catalog is actually case sensitive, Snowflake treats unquoted identifiers as case-insensitive and automatically converts unquoted identifiers to uppercase. When you create or query objects, Snowflake matches identifiers regardless of case, as long as they are unquoted.  Using this pattern is discouraged because Snowflake can’t resolve two different identifiers that differ in casing. This pattern only works when no two identifiers are different in casing only.  Consider the case where the remote catalog has a `Table1` table. All of the following queries are valid for querying that table.  ```sqlexample SELECT * from table1; SELECT * from TABLE1; SELECT * from Table1; SELECT * from "Table1"; ```  Quoted identifiers preserve case and match exactly. However, in CASE_INSENSITIVE mode, unquoted and quoted forms are both supported. |

## Considerations for using a catalog-linked database for Iceberg tables

Consider the following items when you use a catalog-linked database:

* Supported only when you use a catalog integration for Iceberg REST (for example, Snowflake Open Catalog).
* To limit automatic table discovery to a specific set of namespaces, use the ALLOWED_NAMESPACES parameter. You can also use the
  BLOCKED_NAMESPACES parameter to block a set of namespaces.
* Snowflake doesn’t sync remote catalog access control for users or roles.
* You can create schemas, externally managed Iceberg tables, or database roles in a catalog-linked database. Creating other Snowflake objects
  isn’t currently supported.
* When you create a catalog-linked database, you can’t specify the default Iceberg version or merge-on-read behavior to use for
  Iceberg tables.

  However, you can modify these properties for an existing database by using the [ALTER DATABASE (catalog-linked)](../sql-reference/sql/alter-database-catalog-linked.md)
  command to set the following parameters:

  + ICEBERG_VERSION_DEFAULT
  + ENABLE_ICEBERG_MERGE_ON_READ
* For Iceberg tables in a catalog-linked database:

  + Snowflake doesn’t copy remote catalog table properties, such as retention policies or buffers, and doesn’t currently support altering table properties.
  + [Automated refresh](tables-iceberg-auto-refresh.md) is enabled by default. If the `table-uuid` of an external table
    and the catalog-linked database table don’t match, refresh fails and Snowflake drops the table from the catalog-linked database; Snowflake doesn’t change the remote table.
  + If you drop a table from the remote catalog, Snowflake drops the table from the catalog-linked database.
    This action is asynchronous, so you might not see the change in the remote catalog right away.
  + If you rename a table in the remote catalog, Snowflake drops the existing table from the catalog-linked database and creates a table with the new name.
  + Masking policies and tags are supported. Other Snowflake-specific features, including replication and cloning, aren’t supported.
  + The character that you choose for the NAMESPACE_FLATTEN_DELIMITER parameter can’t appear in your remote namespaces. During the auto discovery process,
    Snowflake skips any namespace that contains the delimiter, and doesn’t create a corresponding schema in your catalog-linked database.
  + If you specify anything other than `_`, `$`, or numbers for the NAMESPACE_FLATTEN_DELIMITER parameter,
    you must put the schema name in quotes when you query the table.
  + For databases linked to AWS Glue, you must use lowercase letters and surround the schema, table, and column names in double quotes.
    This is also required for other Iceberg REST catalogs that only support lowercase identifiers.

    The following example shows a valid query:

    ```sqlexample
    CREATE SCHEMA "s1";
    ```

    The following statements aren’t valid, because they use uppercase letters or omit the double quotes:

    ```sqlexample
    CREATE SCHEMA s1;
    CREATE SCHEMA "Schema1";
    ```
  + Using UNDROP ICEBERG TABLE isn’t supported.
  + Sharing:

    - Sharing with a listing isn’t currently supported
    - Direct sharing is supported
* For writing to tables in a catalog-linked database:

  + Creating tables in nested namespaces isn’t currently supported.
  + Writing to tables in nested namespaces isn’t currently supported.
  + Position [row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) are supported for tables stored
    on Amazon S3, Azure, or Google Cloud. Row-level deletes with equality delete files aren’t supported. For more information about row-level deletes,
    see [Use row-level deletes](tables-iceberg-manage.md). To turn off position deletes, which enable
    running the Data Manipulation Language (DML) operations in copy-on-write mode, set the `ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE at the table, schema, or
    database level.

---
title: Use an external query engine with Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-use-external-query-engine.md
section: User Guide
---

# Use an external query engine with Apache Iceberg™ tables

Snowflake offers the following two options for using an external query engine to query Apache Iceberg™ tables:

* External query engines through Snowflake Horizon Catalog
* Microsoft Fabric

## External query engines through Snowflake Horizon Catalog

You can access Snowflake-managed Apache Iceberg™ tables by using any
external query engine that supports the open Iceberg REST protocol, such as Apache Spark™. To ensure this interoperability with
external engines, [Apache Polaris™ (incubating)](https://github.com/apache/polaris) is integrated into Horizon Catalog. You can access
these tables in a Snowflake account by using a single Horizon Catalog endpoint and you can use your existing users, roles, policies,
and authentication in Snowflake.

For more information, see [Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

## Microsoft Fabric

To view Snowflake-managed Iceberg tables in [Microsoft Fabric](https://learn.microsoft.com/fabric/), you can connect a standard Snowflake
database to Fabric, which syncs the database with Fabric. You can then view any Snowflake-managed Iceberg tables in the database in Fabric.

For more information, see [Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric](tables-iceberg-query-using-microsoft-fabric.md).

---
title: Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-open-catalog.md
section: User Guide
---

# Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake

Use Apache Iceberg™ tables in Snowflake to work with Snowflake Open Catalog.

## What is Snowflake Open Catalog?

Open Catalog is a catalog implementation for Iceberg built on the open source Apache Iceberg REST protocol. To learn more,
see the [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) documentation.

Snowflake supports the following options for working with Open Catalog:

* [Query a table registered in Snowflake Open Catalog using Snowflake](tables-iceberg-open-catalog-query.md).
* [Write to a table registered in Open Catalog using Snowflake](tables-iceberg-externally-managed-writes.md).
* [Sync a Snowflake-managed Iceberg table with Open Catalog](tables-iceberg-open-catalog-sync.md).

## Considerations

When using Snowflake with Open Catalog, be aware of the following considerations:

**Storage**

* Just like [Snowflake-managed Iceberg tables](tables-iceberg.md), you store Iceberg tables managed by Open Catalog
  in external cloud storage.
* Iceberg tables in Snowflake use an [external volume](tables-iceberg.md) to provide access to your cloud storage,
  while tables managed by Open Catalog use a [storage configuration](https://other-docs.snowflake.com/en/opencatalog/overview#storage-configuration).

**Configuration for syncing Snowflake-managed Iceberg tables**

* To sync a Snowflake-managed table with Open Catalog, you must first create an external volume in Snowflake and then create an external
  catalog in Open Catalog that points to the same location as the external volume. For more information, see
  [Sync a Snowflake-managed table with Snowflake Open Catalog](tables-iceberg-open-catalog-sync.md).

**Table access**

* Snowflake-managed Iceberg tables that you sync with Open Catalog are read-only in Open Catalog.
* Snowflake can query but can’t write to tables managed by Open Catalog.

## Terminology differences

This section summarizes the key differences in terminology between Snowflake and Open Catalog.

| Snowflake term | Open Catalog term |
| --- | --- |
| [Database](../guides-overview-db.md) | Open Catalog uses *catalogs*, which are like databases in Snowflake. In Open Catalog, you create one or more catalog resources to organize Iceberg tables under namespaces. For more information, see [Catalog](https://other-docs.snowflake.com/en/opencatalog/overview#catalog) in the Open Catalog documentation.  When you sync a Snowflake-managed table with Open Catalog, Snowflake syncs the table with the catalog associated with the table’s catalog integration using two parent namespaces. The namespaces correspond to the table’s database and schema in Snowflake. For example, if you have a `db1.public.table1` Iceberg table registered in Snowflake and you specify `catalog1` in the catalog integration, it gets synced to Open Catalog with the following fully qualified name: `catalog1.db1.public.table1`. |
| [Schema](../sql-reference/ddl-database.md) | In Open Catalog, the concepts of schema and namespace are synonymous and can be used interchangeably.  Namespace is displayed in the Open Catalog user interface. Open Catalog uses namespaces to hold a collection of objects and the term _namespace_ is primarily used in the Open Catalog documentation. For more information about namespaces, see Namespace.  However, if you’re using a third-party query engine, such as Apache Spark, and you run the CREATE SCHEMA or CREATE DATABASE command, you create a namespace in Open Catalog. You can also run the CREATE NAMESPACE command to create a namespace. |
| [Namespace](../sql-reference/ddl-database.md) | Like Snowflake, Open Catalog also uses namespaces but with key differences compared to how Snowflake uses namespaces.  A catalog in Open Catalog comprises top-level namespaces, which you define, along with any number of nested namespaces beneath them, which you also define.  Nested namespaces allow you to register tables with the same name within the same catalog. For example, a catalog named `customers` can contain the following `customerdata` tables, which are grouped under a top-level namespace `<region>` and a nested namespace `<state>`:   * `customers.northeast.maine.customerdata` * `customers.northeast.vermont.customerdata`   Also, in Open Catalog, you can group tables under any namespace in the namespace hierarchy, including top-level namespaces.  For more information about namespaces, including a conceptual diagram of a sample Open Catalog structure, see [key concepts of Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview#key-concepts). |
| [Role](security-access-control-overview.md) | In Open Catalog, *principal roles* are like roles in Snowflake but with key differences. You don’t grant privileges to a principal role. Instead, you grant privileges to a catalog role, which you then grant to a principal role, and then you grant the principal role to a service principal, thus bestowing the privileges on the service principal. Also, you can’t assign principal roles to other principal roles. You can only grant one principal role to a service principal.  You can use a principal role to logically group service principals together. The scope of a principal role is across all catalogs. Also, there aren’t different types of principal roles. For more information, see [Principal role](https://other-docs.snowflake.com/en/opencatalog/access-control#principal-role) in the Open Catalog documentation. |
| [Database role](security-access-control-overview.md) | Open Catalog uses *catalog roles*, which are like database roles in Snowflake. Catalog roles specify a set of permissions for actions on a catalog or objects in the catalog. The scope of a catalog role is the catalog where it is created.  In Open Catalog, you grant privileges to catalog roles. Next, you grant catalog roles to principal roles, and then you grant principal roles to service principals, which grants access to resources. You can grant multiple catalog roles to a principal role but only one principal role to a service principal. For more information, see [Catalog role](https://other-docs.snowflake.com/en/opencatalog/access-control#catalog-role) in the Open Catalog documentation. |
| [User](security-access-control-overview.md) | In the context of access control, there is no concept of a user in Open Catalog.  In Open Catalog, privileges are bestowed on *service principals*, not users. Query engines use service principals to connect to catalogs. For more information, see [Service principal](https://other-docs.snowflake.com/en/opencatalog/overview#service-principal) in the Open Catalog documentation. |

## Legal Notices

Apache®, Apache Iceberg™, Apache Spark™, Apache Flink®, and Flink® are either registered trademarks or trademarks of the Apache Software
Foundation in the United States and/or other countries.

---
title: Use catalog-vended credentials for Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-catalog-integration-vended-credentials.md
section: User Guide
---

# Use catalog-vended credentials for Apache Iceberg™ tables

Vended credential support for Iceberg tables lets you give Snowflake access to your table data and
metadata in cloud storage without using an [external volume](tables-iceberg.md).

Instead, you configure and delegate access control with your third-party Iceberg REST catalog (such as [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview)), then create a
catalog integration in Snowflake configured for vended credentials. For any Iceberg table associated
with the catalog integration, Snowflake uses credentials vended by your catalog provider to securely connect to your external cloud storage.

> **Note:**
>
> Using catalog-vended credentials is supported for [externally managed Iceberg tables](tables-iceberg.md)
> that use a [REST catalog integration](../sql-reference/sql/create-catalog-integration-rest.md).
> To use this feature, your external catalog must also support credential vending.

## Considerations

Consider the following when you use catalog-vended credentials for Iceberg tables:

* This feature is supported for tables that store their data and metadata in Amazon S3, Azure Storage, or Google Cloud Storage.
* Table files must be stored in a single bucket; they can’t be spread across multiple buckets.

  However, you can spread your tables across multiple buckets if each table is stored in one bucket.
* The service principal configured with your REST catalog must have permission to read from *all* of the locations that contain your
  table files in your bucket. If you use AWS Lake Formation with AWS Glue, you might need to take extra steps to enable this access. For more information,
  see [(Optional) Configure Lake Formation access control](tables-iceberg-configure-catalog-integration-rest-glue.md).
* Snowflake expects your catalog to provide one of the following tokens, based on your cloud storage provider:

  + AWS: An expiration time for the AWS session token. Snowflake searches for a key-value pair where the key is
    `s3.session-token-expires-at-ms`, and the value is a timestamp that specifies the expiration time in milliseconds.
  + Azure: An expiration time for the SAS token. Snowflake searches for a key-value pair where the key is
    `adls.sas-token-expires-at-ms`, and the value is a timestamp that specifies the expiration time in milliseconds.
  + Google Cloud Storage: An expiration time for the OAuth 2.0 access token. Snowflake searches for a key-value pair where the key is
    `gcs.oauth2.token-expires-at`, and the value is a timestamp that specifies the expiration time in milliseconds.

  If your catalog doesn’t provide a token, Snowflake expects your catalog to provide an expiration time for vended credentials, and searches for a key-value pair
  where the key is `expiration-time`,
  and the value is a timestamp that specifies the expiration time in milliseconds; for example, `1730234407000`.

  If your catalog doesn’t provide an expiration time, Snowflake assumes that the credentials expire 60 minutes after
  receipt.
* Table creation fails if your catalog provides credentials that aren’t valid.
* The CREATE ICEBERG TABLE … AS SELECT command isn’t supported.
* Private connectivity isn’t supported; to use private connectivity, you must [configure an external volume](tables-iceberg-configure-external-volume.md).

## Create a catalog integration for vended credentials

To create a catalog integration for vended credentials, use the [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md)
command with the `ACCESS_DELEGATION_MODE` property set to `VENDED_CREDENTIALS`.

Where:

`ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS | EXTERNAL_VOLUME_CREDENTIALS }`
:   Specifies the access delegation mode to use for accessing Iceberg table files in your external cloud storage.

    * `VENDED_CREDENTIALS` specifies that Snowflake should use vended credentials.
    * `EXTERNAL_VOLUME_CREDENTIALS` specifies that Snowflake should use an external volume.

    Default: `EXTERNAL_VOLUME_CREDENTIALS`

You can specify the `ACCESS_DELEGATION_MODE` property in the list of `REST_CONFIG` properties in any
[CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql-reference/sql/create-catalog-integration-rest.md) statement.

> **Important:**
>
> If you use AWS Lake Formation for access control, you must ensure that Snowflake can access your
> AWS Glue catalog or Amazon S3 table. For more information, see
> [(Optional) Configure Lake Formation access control](tables-iceberg-configure-catalog-integration-rest-glue.md).

### Example: AWS Glue

The following example creates a catalog integration for AWS Glue that uses vended credentials. For more information,
see [Configure a catalog integration for AWS Glue Iceberg REST](tables-iceberg-configure-catalog-integration-rest-glue.md).

```sqlexample
CREATE CATALOG INTEGRATION glue_rest_catalog_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'rest_catalog_integration'
  REST_CONFIG = (
    CATALOG_URI = 'https://glue.us-west-2.amazonaws.com/iceberg'
    CATALOG_API_TYPE = AWS_GLUE
    CATALOG_NAME = '123456789012'
    ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
  )
  REST_AUTHENTICATION = (
    TYPE = SIGV4
    SIGV4_IAM_ROLE = 'arn:aws:iam::123456789012:role/my-role'
    SIGV4_SIGNING_REGION = 'us-west-2'
  )
  ENABLED = TRUE;
```

### Example: Amazon S3 Tables

This example creates a catalog integration for
[Amazon S3 tables](https://docs.aws.amazon.com/AmazonS3/latest/userguide/s3-tables-tables.html) with SigV4 credential vending
enabled using Lake Formation.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION my_s3_tables_catalog_integration
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'my_namespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://glue.us-west-2.amazonaws.com/iceberg'
    CATALOG_API_TYPE = AWS_GLUE
    CATALOG_NAME = '123456789012:S3tablescatalog/my_table_bucket'
    ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
  )
  REST_AUTHENTICATION = (
    TYPE = SIGV4
    SIGV4_IAM_ROLE = 'arn:aws:iam::123456789012:role/my_api_permissions_role'
  )
  ENABLED = TRUE;
```

Where:

> `CATALOG_URI = 'https://glue.us-west-2.amazonaws.com/iceberg'`
> :   Specifies the [AWS Glue Iceberg REST endpoint](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html).
>
> `CATALOG_NAME = 'aws_account_id:s3tablescatalog/s3_table_bucket`
> :   Specifies an S3 table bucket in your AWS account.

## Create an Iceberg table that uses vended credentials

After you set up access control with your third-party Iceberg REST catalog and create a catalog integration for vended credentials,
you can create an Iceberg table.

When you create an Iceberg table that uses vended credentials, you specify a catalog integration configured with
`ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS` and exclude the `EXTERNAL_VOLUME` parameter from the
[CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) statement.

For example:

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table
  CATALOG = open_catalog_int_vended_credentials
  CATALOG_TABLE_NAME = 'my_table'
  AUTO_REFRESH = TRUE;
```

> **Note:**
>
> If you’ve set a default external volume at the account, database, or schema level, Snowflake ignores the default external volume during
> table creation as long as you specify a catalog integration configured to use vended credentials.

---
title: Use data profiling to understand your data
source: https://docs.snowflake.com/en/user-guide/data-quality-profile.md
section: User Guide
---

# Use data profiling to understand your data

Data profiling helps you understand the structure, content, and quality of your data sets by automatically gathering statistics such as data
types, value distributions, counts of NULL values, and uniqueness. The data profile reveals patterns, anomalies, and potential quality
issues, which lets you assess data reliability and make informed decisions about how to clean, transform, or effectively use your data. Data
profiling simplifies the path to continuous data quality monitoring by providing insights without manual setup.

The data profile includes the following statistics:

* Number of rows in the table.
* Last time the table was updated.
* How many NULL values are in a column.
* Minimum and maximum values in a column.
* Most common values in a column.

## Get started

To view the data profile of a table or view, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select the table or view.
3. Select the Data Quality tab.
4. Select Data Profile.

## Warehouse considerations

Data profiling runs background SQL queries to display information about a table or view. Snowflake recommends using an X-Small warehouse to
run these queries; however, heavier workloads might see a performance improvement by using a larger warehouse. In general, larger warehouses
consume more credits.

By default, data profiling uses the warehouse that is set as the default for the current user. To select a different warehouse, use the
drop-down list at the top of the page.

---
title: Use immutability constraints
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-performance-optimize-immutability.md
section: User Guide
---

# Use immutability constraints

To tell Snowflake that certain rows won’t change in a dynamic table,
use the `IMMUTABLE WHERE` clause in a [CREATE DYNAMIC TABLE](../sql-reference/sql/create-dynamic-table.md) or
[ALTER DYNAMIC TABLE](../sql-reference/sql/alter-dynamic-table.md) statement.

Immutability makes refreshes faster by skipping rows that don’t change.
*Backfill* with immutability provides both immediate and ongoing performance benefits:

* **Initial creation**: Backfill copies historical data instantly without computation costs. This
  makes tables with years of historical data immediately available instead of requiring expensive
  initial refreshes.
* **Ongoing refreshes**: Immutability constraints protect backfilled data from reprocessing during
  future refreshes. Only the mutable region gets refreshed, keeping refresh times fast even as the
  table grows.

For conceptual background, see
[Understanding immutability constraints](dynamic-tables-immutability-constraints.md).

## Basic examples

### Example: Prevent recomputation when a dimension table changes

When you update a row in a dimension table, reprocess only the facts from the mutable period:

```sqlexample
CREATE DYNAMIC TABLE joined_data
  TARGET_LAG = '1 minute'
  WAREHOUSE = mywh
  IMMUTABLE WHERE (timestamp_col < CURRENT_TIMESTAMP() - INTERVAL '1 day')
AS
  SELECT F.primary_key primary_key, F.timestamp_col timestamp_col, D.value dim_value
  FROM fact_table F
  LEFT OUTER JOIN dimension_table D USING (primary_key);
```

### Example: Retain data longer than the source table

Create a dynamic table that retains parsed data longer than the staging table,
and delete old staging data with a task:

```sqlexample
CREATE TABLE staging_data (raw TEXT, ts TIMESTAMP);

CREATE DYNAMIC TABLE parsed_data
  TARGET_LAG = '1 minute'
  WAREHOUSE = mywh
  IMMUTABLE WHERE (ts < CURRENT_TIMESTAMP() - INTERVAL '7 days')
AS
  SELECT
    parse_json(raw):event_id::string event_id,
    parse_json(raw):name::string name,
    parse_json(raw):region::string region,
    ts
  FROM staging_data
  WHERE region = 'US';

CREATE TASK delete_old_staging_data
  WAREHOUSE = mywh
  SCHEDULE = '24 hours'
AS
  DELETE FROM staging_data WHERE ts < CURRENT_TIMESTAMP() - INTERVAL '30 days';
```

### Example: Let downstream tables use incremental refresh from a full refresh table

Some query constructs (like Python user-defined table functions) require full refresh mode.
Immutability constraints let downstream tables still use incremental refresh:

```sqlexample
CREATE DYNAMIC TABLE udtf_dt
  TARGET_LAG = '1 hour'
  WAREHOUSE = mywh
  REFRESH_MODE = FULL
  IMMUTABLE WHERE (ts < current_timestamp() - interval '1 day')
AS
  SELECT ts, data, output, join_key
  FROM input_table, TABLE(my_udtf(data));

CREATE DYNAMIC TABLE incremental_join_dt
  TARGET_LAG = '1 hour'
  WAREHOUSE = mywh
  REFRESH_MODE = INCREMENTAL
  IMMUTABLE WHERE (ts < current_timestamp() - interval '1 day')
AS
  SELECT * FROM udtf_dt JOIN dim_table USING (join_key);
```

## Backfill examples

The following examples show how to create new dynamic tables from tables with backfilled data.

The backfill table must contain matching columns with compatible data types in the same order as your dynamic table.
Snowflake doesn’t copy table properties or privileges from the backfill table.

If you specify the Time Travel parameters `AT | BEFORE`, Snowflake copies data from the backfill table at the specified time.

The following limitations apply when you work with [immutability constraints](dynamic-tables-immutability-constraints.md)
and backfilled data:

* Currently, only regular and dynamic tables can be used for backfilling.
* You can’t specify policies or tags in the new dynamic table because they are copied from the backfill table.
* Clustering keys in the new dynamic table and backfill table must be the same.

### Example: Backfill from a part of the table

The following example backfills the immutable region of `my_dynamic_table` from `my_backfill_table` and the mutable region from the dynamic
table’s definition.

When you reinitialize this dynamic table:

* **Incremental refresh mode**: Snowflake deletes all mutable rows and repopulates only the mutable region.
* **Full refresh mode**: Snowflake performs a full refresh with the same effect.

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table (day TIMESTAMP, totalSales NUMBER)
  IMMUTABLE WHERE (day < '2025-01-01')
  BACKFILL FROM my_backfill_table
  TARGET_LAG = '20 minutes'
  WAREHOUSE = 'mywh'
  AS SELECT DATE_TRUNC('day', ts) AS day, sum(price)
    FROM my_base_table
    GROUP BY day;
```

### Example: Use backfill to recover or modify data in a dynamic table

You can’t directly edit a dynamic table’s data or definition. To recover or fix data, complete the following workaround steps:

1. Clone the dynamic table to a regular table.
2. Modify the cloned table as needed.
3. Backfill from the edited table into a new dynamic table.

In the following example, `my_dynamic_table` aggregates daily sales data from the `sales` base table:

```sqlexample
CREATE OR REPLACE TABLE sales(item_id INT, ts TIMESTAMP, sales_price FLOAT);

INSERT INTO sales VALUES (1, '2025-05-01 01:00:00', 10.0), (1, '2025-05-01 02:00:00', 15.0), (1, '2025-05-01 03:00:00', 11.0);
INSERT INTO sales VALUES (1, '2025-05-02 00:00:00', 11.0), (1, '2025-05-02 05:00:00', 13.0);

CREATE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = 'DOWNSTREAM'
  WAREHOUSE = mywh
  INITIALIZE = on_create
  IMMUTABLE WHERE (day <= '2025-05-01')
  AS
    SELECT item_id, date_trunc('DAY', ts) day, count(sales_price) AS sales_count FROM sales
    GROUP BY item_id, day;

SELECT item_id, to_char(day, 'YYYY-MM-DD') AS day, sales_count FROM my_dynamic_table;
```

```output
+---------+------------+-------------+
| ITEM_ID | DAY        | SALES_COUNT |
|---------+------------+-------------|
| 1       | 2025-05-01 | 3           |
| 1       | 2025-05-02 | 2           |
+---------+------------+-------------+
```

Optionally, you can archive the old data to save storage cost:

```sqlexample
DELETE FROM sales WHERE ts < '2025-05-02';

ALTER DYNAMIC TABLE my_dynamic_table REFRESH;

SELECT item_id, to_char(day, 'YYYY-MM-DD') AS day, sales_count FROM my_dynamic_table;
```

Later, you find a sales error on `2025-05-01`, where `sales_count` should be 2. To correct this:

1. Clone `my_dynamic_table` to a regular table:

   ```sqlexample
   CREATE OR REPLACE TABLE my_dt_clone_table CLONE my_dynamic_table;
   ```
2. Update the cloned table:

   ```sqlexample
   UPDATE my_dt_clone_table SET
     sales_count = 2
     WHERE day = '2025-05-01';

   SELECT item_id, to_char(day, 'YYYY-MM-DD') AS day, sales_count FROM my_dt_clone_table;
   ```

   ```output
   +---------+------------+-------------+
   | ITEM_ID | DAY        | SALES_COUNT |
   |---------+------------+-------------|
   | 1       | 2025-05-01 | 2           |
   | 1       | 2025-05-02 | 2           |
   +---------+------------+-------------+
   ```
3. Recreate the dynamic table by using the edited clone as the backfill source.

   ```sqlexample
   CREATE OR REPLACE DYNAMIC TABLE my_dynamic_table
     BACKFILL FROM my_dt_clone_table
     IMMUTABLE WHERE (day <= '2025-05-01')
     TARGET_LAG = 'DOWNSTREAM'
     WAREHOUSE = mywh
     INITIALIZE = on_create
     AS
       SELECT item_id, date_trunc('DAY', ts) day, count(sales_price) AS sales_count FROM sales
       GROUP BY item_id, day;
   ```

   This method lets you recover or correct data in a dynamic table without modifying the base table:

   ```sqlexample
   SELECT item_id, to_char(day, 'YYYY-MM-DD') AS day, sales_count FROM my_dynamic_table;
   ```

   ```output
   +---------+------------+-------------+
   | ITEM_ID | DAY        | SALES_COUNT |
   |---------+------------+-------------|
   | 1       | 2025-05-01 | 2           |
   | 1       | 2025-05-02 | 2           |
   +---------+------------+-------------+
   ```

### Example: Modify a dynamic table’s schema by using backfill

You can’t directly alter the schema of a dynamic table. To update the schema — for example, add a column — follow these steps:

1. Clone the dynamic table to a regular table. The following example uses `my_dynamic_table` created from `sales`
   (earlier).

   ```sqlexample
   CREATE OR REPLACE TABLE my_dt_clone_table CLONE my_dynamic_table;
   ```
2. Modify the schema of the cloned table:

   ```sqlexample
   ALTER TABLE my_dt_clone_table ADD COLUMN sales_avg FLOAT;

   SELECT item_id, to_char(day, 'YYYY-MM-DD') as DAY, SALES_COUNT, SALES_AVG FROM my_dt_clone_table;
   ```
3. Optionally, add data to the new column.
4. Recreate the dynamic table by using the edited clone as the backfill source.

   ```sqlexample
   CREATE OR REPLACE DYNAMIC TABLE my_dynamic_table
     BACKFILL FROM my_dt_clone_table
     IMMUTABLE WHERE (day <= '2025-05-01')
     TARGET_LAG = 'DOWNSTREAM'
     WAREHOUSE = mywh
     INITIALIZE = on_create
     AS
       SELECT item_id, date_trunc('DAY', ts) day, count(sales_price) AS sales_count, avg(sales_price) as sales_avg FROM sales
       GROUP BY item_id, day;
   ```
5. Verify that the new column appears in the dynamic table:

   ```sqlexample
   SELECT item_id, to_char(day, 'YYYY-MM-DD') as DAY, SALES_COUNT, SALES_AVG, metadata$is_immutable as IMMUTABLE from my_dynamic_table ORDER BY ITEM_ID, DAY;
   ```

   ```output
   +---------+------------+-------------+-----------+-----------+
   | ITEM_ID | DAY        | SALES_COUNT | SALES_AVG | IMMUTABLE |
   |---------+------------+-------------|-----------|-----------|
   | 1       | 2025-05-01 | 3           | NULL      | TRUE      |
   | 1       | 2025-05-02 | 2           | 12        | FALSE     |
   +---------+-------------+------------+-----------+-----------+
   ```

## Check immutability status

To check whether a row is mutable in a dynamic table, query the `METADATA$IS_IMMUTABLE` column:

```sqlexample
SELECT *, METADATA$IS_IMMUTABLE FROM my_dynamic_table;
```

To view the immutability constraint on a dynamic table, run [SHOW DYNAMIC TABLES](../sql-reference/sql/show-dynamic-tables.md) and check the
`immutable_where` column.

---
title: Use offers as a consumer
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/consumers-pricing-plans-offers.md
section: User Guide
---

# Use offers as a consumer

As a consumer, you receive private offers from your listing provider. An offer defines the purchase terms for a listing. Offers are specific to each consumer, and they provide individualized billing, payment terms, payment schedules, and contract start and end dates. All existing billing methods are supported, and you can continue to use the Snowflake Marketplace Capacity Drawdown (MCD) program to pay for Snowflake Marketplace purchases.

When you receive an offer from your listing provider, you can review the terms and then accept the offer, reject the offer, or request changes. You can also perform the following actions to manage your offers:

* Review the pricing plan details.
* Add or update payment information.
* Cancel a subscription.
* View invoices on Stripe.
* Review the subscription renewal date.
* Add a PO number.

## Prerequisites for consumers to accept offers

* The consumer’s organization has accepted the legal terms. See [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms).
* The consumer’s account has the privileges necessary to accept or reject offers. See [Set up required privileges](https://other-docs.snowflake.com/collaboration/consumer-becoming#label-consumer-required-privileges).
* The consumer’s organization has set up payment information and can pay for listings. See [Pay for listings](https://other-docs.snowflake.com/collaboration/consumer-listings-paying#label-set-up-billing-consumer).
* The consumer’s organization has accepted the cross-region disclaimer for accounts located in U.S. government regions. See [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer).

## Required privileges

* The ACCOUNTADMIN role is required to access a listing.
* The PURCHASE DATA EXCHANGE LISTING privilege is required to pay for a paid listing. If you don’t have a role with this privilege, contact your account administrator.

## Types of offers

The following table lists the Snowflake offer types.

| Offer type | Description |
| --- | --- |
| Public offer | An offer that is visible to all Snowflake Marketplace consumers. |
| Private offer | An individualized offer for a specific consumer. |

---
title: Use organization user groups with organizational listings
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/organizational/org-listings-org-user-groups.md
section: User Guide
---

# Use organization user groups with organizational listings

Providers can use [organization user groups](../../../organization-users.md) to assign consumers to organizational listings. For example, an organization account can create a marketing organization user group, and then providers can assign specific consumers to organization listings that are specific to the marketing team. With this functionality, the provider no longer needs to assign individual consumers to specific organizational listings. To identify the organization user group to associate for an organizational listing, the provider modifies the following fields in the listing manifest:

```yaml
organization_targets:
  access:
    - account: <account_name>
    - organization_user_group: <organization_user_group_1>
    - organization_user_group: <organization_user_group_2>
    - account: <account_name>
      roles: [<roles_list>]
```

To create an organizational listing programmatically, select the SQL tab in [Create an organizational listing](org-listing-create.md).

For more information about the listing manifest, see [Listing manifest reference](../../../../progaccess/listing-manifest-reference.md).

To allow consumers to access their assigned organizational listings, the consumers must import the organization user group.

> **Note:**
>
> You can make an organizational listing discoverable or accessible to an organization user group. Making an organizational listing discoverable or accessible to all accounts makes it discoverable or accessible to the organization user group as well. For this reason, you don’t need to add the organization user group to the listing manifest when making a listing discoverable or accessible to all accounts.

---
title: Use pricing plans and offers as a provider
source: https://docs.snowflake.com/en/user-guide/collaboration/listings/pricing-plans-offers/providers-pricing-plans-offers.md
section: User Guide
---

# Use pricing plans and offers as a provider

Pricing plans are similar to stock keeping units (SKUs) and they allow providers to create multiple, individualized pricing, terms, and discounts for a single paid listing. Providers don’t need to create a listing for every new pricing plan they offer consumers. After creating a pricing plan, providers create offers to package and present pricing plan and contract information to consumers.

All existing billing methods are supported, and eligible consumers can continue to use the Snowflake Marketplace Capacity Drawdown (MCD) program to pay for Snowflake Marketplace purchases.

Providers can create both standard and private offers. Standard offers are listed on the Snowflake Marketplace. Private offers are listed in Snowsight on the consumer’s External sharing page. Unlike standard offers, private offers allow providers to offer customized pricing and allow consumers to negotiate terms and conditions that are specific to their business requirements.

Pricing plans and offers can be created and managed in Snowsight or programmatically with SQL.

---
title: Use primary keys to optimize dynamic table pipelines
source: https://docs.snowflake.com/en/user-guide/dynamic-tables-performance-optimize-primary-keys.md
section: User Guide
---

# Use primary keys to optimize dynamic table pipelines

Snowflake can use primary keys to track row-level changes in dynamic tables without relying
on change-tracking columns. This enables incremental refresh for pipelines that run insert overwrite workloads including
full refresh dynamic tables, which normally block downstream incremental processing.

Primary keys are especially effective when an INSERT OVERWRITE is performed on a base table where only a small fraction of
the data is actually changed. In these cases, primary key-based change tracking processes only the
changed rows instead of recomputing the entire table. A primary key provides a stable row identifier that persists
across overwrites.

For conceptual background, see [Understanding primary keys in dynamic tables](dynamic-tables-primary-keys.md).

## Improve performance for INSERT OVERWRITE workloads

When a base table is periodically rewritten through INSERT OVERWRITE, standard change-tracking columns are reset and a
dynamic table consuming the base table will see a set of inserts and deletes for all rows in the base table.

In the following example, an external process rewrites the `dimension_table` periodically, but
most rows remain the same:

```sqlexample
CREATE TABLE dimension_table (
  dim_id INT PRIMARY KEY RELY,
  dim_name VARCHAR,
  category VARCHAR
);

CREATE TABLE fact_table (
  fact_id INT,
  dim_id INT,
  measure FLOAT,
  ts TIMESTAMP
);

CREATE DYNAMIC TABLE enriched_facts
  TARGET_LAG = '30 minutes'
  WAREHOUSE = mywh
  REFRESH_MODE = INCREMENTAL
AS
  SELECT f.fact_id, f.measure, d.dim_name, d.category, f.ts
  FROM fact_table f
  INNER JOIN dimension_table d ON f.dim_id = d.dim_id;
```

When the dimension table is rewritten through INSERT OVERWRITE, Snowflake uses the primary key
to identify which dimension rows actually changed and refreshes only the affected facts, rather
than recomputing the entire join.

## Enable incremental refresh downstream of a full refresh dynamic table

Normally, a dynamic table with `REFRESH_MODE = INCREMENTAL` can’t read from a dynamic table
with `REFRESH_MODE = FULL`. When the full refresh dynamic table has a system-derived unique key,
you can explicitly set the refresh mode to INCREMENTAL.

### Example: Use a base table primary key

Create a base table with a primary key and set the `RELY` property so Snowflake uses it
for row-level change tracking:

```sqlexample
CREATE TABLE raw_events (
  event_id INT PRIMARY KEY RELY,
  event_type VARCHAR,
  payload VARIANT,
  created_at TIMESTAMP
);
```

Create a full refresh dynamic table that reads from the base table. Because the base table has
a reliable primary key, Snowflake can derive an unique key from the base table and register
it as an unique constraint for the dynamic table:

```sqlexample
CREATE DYNAMIC TABLE transformed_events
  TARGET_LAG = '10 minutes'
  WAREHOUSE = mywh
  REFRESH_MODE = FULL
AS
  SELECT event_id, event_type, payload:user_id::STRING AS user_id, created_at
  FROM raw_events;
```

Create an incremental dynamic table downstream. This works because the upstream table has
a system-derived reliable unique key:

```sqlexample
CREATE DYNAMIC TABLE event_summary
  TARGET_LAG = '10 minutes'
  WAREHOUSE = mywh
  REFRESH_MODE = INCREMENTAL
AS
  SELECT user_id, COUNT(*) AS event_count, MAX(created_at) AS last_event
  FROM transformed_events
  GROUP BY user_id;
```

### Example: Use a query-derived primary key

When a dynamic table’s query includes a GROUP BY clause, Snowflake automatically derives an
unique key from the grouping columns. Downstream tables can use this derived key for
primary key-based change tracking and enable incremental refreshes.

```sqlexample
CREATE DYNAMIC TABLE daily_sales
  TARGET_LAG = '1 hour'
  WAREHOUSE = mywh
  REFRESH_MODE = FULL
AS
  SELECT DATE_TRUNC('day', sale_ts) AS sale_day, product_id, SUM(amount) AS total_sales
  FROM sales
  GROUP BY sale_day, product_id;
```

The `daily_sales` table has a derived unique key on `(sale_day, product_id)` because the
GROUP BY guarantees one row per combination. A downstream table can refresh incrementally:

```sqlexample
CREATE DYNAMIC TABLE product_trends
  TARGET_LAG = '1 hour'
  WAREHOUSE = mywh
  REFRESH_MODE = INCREMENTAL
AS
  SELECT product_id, AVG(total_sales) AS avg_daily_sales, COUNT(*) AS days_with_sales
  FROM daily_sales
  GROUP BY product_id;
```

## Check system-derived unique keys on a dynamic table

To see whether a dynamic table has a derived unique key, use the SHOW UNIQUE KEYS command:

```sqlexample
SHOW UNIQUE KEYS IN daily_sales;
```

If the output contains a unique key, the dynamic table supports primary key-based change
tracking. Downstream dynamic tables can use `REFRESH_MODE = INCREMENTAL` to read from it,
even if it uses full refresh mode.

You can also verify support by creating a downstream dynamic table with
`REFRESH_MODE = INCREMENTAL`. If the upstream table doesn’t have a reliable unqiue key,
the creation fails with an error.

---
title: Use row access policies
source: https://docs.snowflake.com/en/user-guide/security-row-using.md
section: User Guide
---

# Use row access policies

This topic provides an introduction to implementing row access policies.

## Implement row access policies

The following sections provide examples on how to implement row access policies:

* Use a typical row access policy with a mapping table lookup.
* Replace existing row access policy subqueries with memoizable functions to increase query performance.
* Reference a mapping table protected by a row access policy in a different row access policy.

## Example: Mapping table lookup

The following steps are a representative guide to configure row access policy privileges and add row access policies to tables and views.

These steps make the following assumptions:

* The management approach is centralized.

  If the row access policy use case includes a hybrid, or decentralized management approach, see [Manage row access policies](security-row-intro.md)
  for a representative distribution of roles and privileges.
* A mapping table is necessary, similar to the [Representative use case: Use a mapping table to filter the query result](security-row-intro.md).

  The following steps use the [CURRENT_ROLE](../sql-reference/functions/current_role.md) context function to determine whether users see rows in a
  query result, while the representative use case focuses on the user’s first name (i.e. [CURRENT_USER](../sql-reference/functions/current_user.md)).

  If role activation and role hierarchy are important, Snowflake recommends that the policy conditions use the
  [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) function for account roles and the
  [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) function for database roles. For details, see
  [Active role hierarchy & mapping tables](security-row-intro.md).

  The overall process to implement a row access policy with mapping tables remains the same even though the context functions are different.
* The SECURITYADMIN system role grants privileges to custom roles to manage and implement row access policies.

  If you do not want to use higher privileged roles (i.e. SECURITYADMIN or ACCOUNTADMIN) in a production environment in favor of less
  privileged custom roles (e.g. `database_admin`, `finance_admin`), verify that the lower-privileged roles have the necessary
  privileges to manage and implement row access policies.

  For more information, see [Row access policy privileges](security-row-intro.md) and [Summary of DDL commands, operations, and privileges](security-row-intro.md).
* There are separate steps to create a table to be protected by a row access policy (step 1) and adding the row access policy to the
  table (step 5). It is possible to add row access policy to the table when the table is created, assuming that a row access policy already exists. For more information on the syntax, see [CREATE TABLE](../sql-reference/sql/create-table.md).

For example:

1. Create a table for the sales data:

   ```sqlexample
   CREATE TABLE sales (
     customer   varchar,
     product    varchar,
     spend      decimal(20, 2),
     sale_date  date,
     region     varchar
   );
   ```
2. In the `security` schema, create a mapping table as shown in the
   [representative example](security-row-intro.md). This table defines which rows sales managers can see in the
   `sales` table:

   ```sqlexample
   CREATE TABLE security.salesmanagerregions (
     sales_manager varchar,
     region        varchar
   );
   ```
3. Next, a security administrator creates the `mapping_role` custom role and grants the SELECT privilege to the custom role. This grant
   allows users with the custom role to query the mapping table:

   ```sqlexample
   USE ROLE SECURITYADMIN;

   CREATE ROLE mapping_role;

   GRANT SELECT ON TABLE security.salesmanagerregions TO ROLE mapping_role;
   ```
4. Using the schema owner role, create a row access policy with the following two conditions:

   * Users with the `sales_executive_role` custom role can view all rows.
   * Users with the `sales_manager` custom role can view rows based on the `salesmanagerregions` mapping table.

   Note that the schema owner role is automatically granted the CREATE ROW ACCESS POLICY privilege. If other roles should be able to create
   row access policies, the schema owner role can grant the CREATE ROW ACCESS policy privilege to other roles.

   ```sqlexample
   USE ROLE schema_owner_role;

   CREATE OR REPLACE ROW ACCESS POLICY security.sales_policy
   AS (sales_region varchar) RETURNS BOOLEAN ->
     'sales_executive_role' = CURRENT_ROLE()
       OR EXISTS (
         SELECT 1 FROM salesmanagerregions
           WHERE sales_manager = CURRENT_ROLE()
           AND region = sales_region
       )
   ;
   ```

   Where:

   `security.sales_policy`
   :   The name of the row access policy in the `security` schema.

   `AS (sales_region varchar)`
   :   The signature for the row access policy.

       A signature specifies the mapping table attribute and data type. The returned value determines whether the user has access to a given
       row on the table or view to which the row access policy is added.

   `'sales_executive_role' = CURRENT_ROLE()`
   :   The beginning of the `body` in the row access policy.

       The first condition of the row access policy expression that allows users with the `sales_executive_role` custom role to view data.

   `OR EXISTS (select 1 from salesmanagerregions WHERE sales_manager = CURRENT_ROLE() AND region = sales_region)`
   :   The second condition of the row access policy expression which uses a subquery.

       The subquery requires the CURRENT_ROLE to be the `sales_manager` custom role with the executed query on the data to specify a region
       listed in the `{salesmanagerregions}` mapping table.

       > **Tip:**
       >
       > To increase query performance on the policy-protected table, replace the mapping table lookup subquery in the `EXISTS` clause with a [memoizable function](../developer-guide/udf/sql/udf-sql-scalar-functions.md).
       >
       > For details, see the memoizable function example (in this topic).
5. Using the SECURITYADMIN system role, execute the following two statements:

   ```sqlexample
   GRANT OWNERSHIP ON ROW ACCESS POLICY security.sales_policy TO mapping_role;

   GRANT APPLY ON ROW ACCESS POLICY security.sales_policy TO ROLE sales_analyst_role;
   ```

   These two [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) statements have the following effects:

   * Ownership of the policy does not rest with the SECURITYADMIN system role. At query runtime, Snowflake uses the privileges granted to
     the custom role because policies are executed with [owner’s rights](security-row-intro.md), not the more privileged SECURITYADMIN system role. This approach supports the Principle of Least Privilege.
   * The `sales_analyst_role` custom role can add or drop the row access policy from a table as needed.
6. Add (bind) the row access policy to the region column in the `sales` data table:

   ```sqlexample
   USE ROLE SECURITYADMIN;

   ALTER TABLE sales ADD ROW ACCESS POLICY security.sales_policy ON (region);
   ```
7. Grant the SELECT privilege on the protected `sales` data to the `sales_manager_role` custom role:

   ```sqlexample
   GRANT SELECT ON TABLE sales TO ROLE sales_manager_role;
   ```
8. After the sales data populates the `sales` data, test the row access policy:

   ```sqlexample
   USE ROLE sales_manager_role;
   SELECT product, SUM(spend)
   FROM sales
   WHERE YEAR(sale_date) = 2020
   GROUP BY product;
   ```

## Example: Replace Policy subqueries with a memoizable function

The steps in this example create a [memoizable function](../developer-guide/udf/sql/udf-sql-scalar-functions.md) for each mapping table lookup in the row
access policy conditions. The subquery in each `EXISTS` clause specifies the mapping table lookup, where the tables are named
`regions`, `customers`, and `products`, respectively:

> ```sqlexample
> CREATE OR REPLACE ROW ACCESS POLICY rap_NO_memoizable_function
>   AS (region_id number, customer_id number, product_id number)
>   RETURNS BOOLEAN ->
>     EXISTS(SELECT 1 FROM regions WHERE id = region_id) OR
>     EXISTS(SELECT 1 FROM customers WHERE id = customer_id) OR
>     EXISTS(SELECT 1 FROM products WHERE id = product_id)
>   ;
> ```

For the following steps, assume that the `rap_admin` custom role can create row access policies
(i.e. has the CREATE ROW ACCESS POLICY on SCHEMA privilege).

Complete the following steps to replace each of the row access policy mapping table lookups with a memoizable function:

1. Create a custom role named `functions_admin` to manage the memoizable function:

   ```sqlexample
   USE ROLE USERADMIN;

   CREATE ROLE functions_admin;
   ```
2. Grant the following privileges to the `functions_admin` role to allow creating the memoizable function in an existing schema named
   `governance.functions`:

   ```sqlexample
   USE ROLE SECURITYADMIN;

   GRANT USAGE ON DATABASE governance TO ROLE functions_admin;

   GRANT USAGE ON SCHEMA governance.functions TO ROLE functions_admin;

   GRANT CREATE FUNCTION ON SCHEMA governance.functions TO ROLE functions_admin;
   ```
3. Create a memoizable function for each of the `EXISTS` subquery clauses in the row access policy. Each memoizable function definition
   takes the same form. For brevity, only one function example is shown:

   ```sqlexample
   USE ROLE functions_admin;

   USE SCHEMA governance.functions;

   CREATE OR REPLACE function allowed_regions()
     RETURNS array
     memoizable
     AS 'SELECT ARRAY_AGG(id) FROM regions';
   ```
4. Use a [CREATE ROW ACCESS POLICY](../sql-reference/sql/create-row-access-policy.md) statement to define a new row access policy that replaces the subqueries with
   memoizable functions:

   The new row access policy allows for testing queries on a protected table, when the policy uses or does not use the memoizable
   functions, to quantify the performance impact of using memoizable functions in the policy conditions:

   ```sqlexample
   USE ROLE rap_admin;

   CREATE OR REPLACE ROW ACCESS POLICY rap_with_memoizable_function
     AS (region_id number, customer_id number, product_id number)
     RETURNS BOOLEAN ->
       ARRAY_CONTAINS(region_id, allowed_regions()) OR
       ARRAY_CONTAINS(customer_id, allowed_customers()) OR
       ARRAY_CONTAINS(product_id, allowed_products())
     ;
   ```

## Example: Protect the mapping table with a row access policy

This example shows how to reference a mapping table that is protected by a row access policy in a different row access policy. The row
access policy that protects the mapping table calls the [IS_ROLE_IN_SESSION](../sql-reference/functions/is_role_in_session.md) context function to account for
role hierarchy. A different row access policy protects the table that the user queries. This row access policy uses a subquery to perform a
mapping table lookup. For example:

1. Create a mapping table to define allowed roles based on geographic sales regions, and insert data into the table:

   ```sqlexample
   CREATE OR REPLACE TABLE sales.tables.regional_managers (
     allowed_regions varchar
     allowed_roles varchar
   );
   ```

   ```sqlexample
   INSERT INTO sales.tables.regional_managers
   (allowed_regions, allowed_roles)
   VALUES
   ('na', 'NA_MANAGER'),
   ('eu', 'EU_MANAGER'),
   ('apac', 'APAC_MANAGER');
   ```
2. Create a row access policy to specify the ALLOWED_ROLES column in the mapping table:

   ```sqlexample
   CREATE OR REPLACE ROW ACCESS POLICY governance.policies.rap_map_exempt
   AS (allowed_roles varchar) RETURNS BOOLEAN ->
   IS_ROLE_IN_SESSION(allowed_roles);
   ```
3. Add the row access policy on the mapping table using an [ALTER TABLE](../sql-reference/sql/alter-table.md) statement:

   ```sqlexample
   ALTER TABLE sales.tables.regional_managers
     ADD ROW ACCESS POLICY governance.policies.rap_map_exempt
     ON (allowed_roles);
   ```
4. Create a new row access policy that specifies the mapping table lookup on the protected mapping table:

   ```sqlexample
    CREATE OR REPLACE ROW ACCESS POLICY governance.policies.rap_map_lookup
    AS (allowed_regions varchar) RETURNS BOOLEAN ->
    EXISTS (
      SELECT * FROM sales.tables.regional_managers
      WHERE
        REGION = allowed_regions
   );
   ```
5. Add the row access policy named `governance.policies.rap_map_lookup` on the table named `sales.tables.data` using an ALTER
   TABLE statement:

   ```sqlexample
   ALTER TABLE sales.tables.data
     ADD ROW ACCESS POLICY governance.policies.rap_map_lookup
     ON (allowed_regions);
   ```
6. Grant privileges to the roles in the mapping table to allow users with these roles to query the protected data. For example, these
   grants are for the `na_manager` custom role:

   ```sqlexample
   USE ROLE SECURITYADMIN;
   GRANT USAGE ON DATABASE sales TO ROLE na_manager;
   GRANT USAGE ON SCHEMA sales.tables TO ROLE na_manager;
   GRANT SELECT ON TABLE sales.tables.regional_managers TO ROLE na_manager;
   GRANT SELECT ON TABLE sales.tables.data TO ROLE na_manager;
   ```

   As necessary, repeat the commands in this step for each role in the mapping table.

---
title: Use row timestamps to measure latency in your pipelines
source: https://docs.snowflake.com/en/user-guide/data-engineering/row-timestamps.md
section: User Guide
---

# Use row timestamps to measure latency in your pipelines

Row timestamps provide a precise, chronological record of when each row in a table was last updated. Rows modified in the
same transaction share the exact same timestamp and rows modified in different transactions are ordered by when they were
committed.

Key use cases include the following:

* **Pipeline observability:** Measure end-to-end latency and data freshness for streaming ingest, CDC, and ETL workloads
  with higher accuracy than client-side timestamps.
* **Reliable incremental processing:** Capture delayed or backfilled records that event timestamps might skip by using
  definitive commit times.
* **Definitive audit trails:** Establish a chronological order of events for regulatory compliance or SCD2-style
  milestoning.

To set row timestamps on your tables, choose one of the following options:

* **Set row timestamps on a table or dynamic table:** Using a role that has the OWNERSHIP privilege on the table, set the ROW_TIMESTAMP
  property to TRUE when executing the [CREATE TABLE](../../sql-reference/sql/create-table.md), [ALTER TABLE](../../sql-reference/sql/alter-table.md),
  [CREATE DYNAMIC TABLE](../../sql-reference/sql/create-dynamic-table.md), or [ALTER DYNAMIC TABLE](../../sql-reference/sql/alter-dynamic-table.md) command.

  For example, `CREATE TABLE … ROW_TIMESTAMP = TRUE`, `ALTER TABLE … SET ROW_TIMESTAMP = TRUE`, or
  `ALTER DYNAMIC TABLE … SET ROW_TIMESTAMP = TRUE`.
* **Set row timestamps by default for new tables in a container:** Set the ROW_TIMESTAMP_DEFAULT property to TRUE on the container.

  For example, `ALTER SCHEMA … SET ROW_TIMESTAMP_DEFAULT = TRUE` means that every new table created in the schema after setting the
  parameter will have row timestamps on by default.
* **Bulk enable row timestamps on existing tables:** Use the system function SELECT SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES.

  For example, `SELECT SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES('schema', '{my_db}.my_schema')`.

  + The first argument is level: one of `schema`, `database`, or `account`.
  + The second argument is the fully qualified name of the container.

  This function adds the row timestamp column to all existing eligible tables within the container and ensures newly created tables automatically
  have row timestamp enabled.

  To successfully execute the function, you need MODIFY privileges on the container you’re invoking the function on.

After row timestamps are enabled, tables expose the METADATA$ROW_LAST_COMMIT_TIME column, which returns the timestamp when each row was last
modified. This enables change tracking, incremental processing, and time-travel queries based on row modification time.

> **Note:**
>
> In a data sharing scenario, consumers can’t select METADATA$ROW_LAST_COMMIT_TIME even if the producer table has row timestamp enabled. Producers
> must create a view that selects METADATA$ROW_LAST_COMMIT_TIME and then share the view if they want to share row timestamps with consumers.

The following statements demonstrate how to create a table that supports row timestamps. The statements insert data into the table and retrieve
the timestamp of each row.

```sqlexample
CREATE OR REPLACE TABLE table1(value1 STRING)
  ROW_TIMESTAMP = TRUE;

INSERT INTO table1 VALUES('some-value-a');

INSERT INTO table1 VALUES('some-value-b');

SELECT METADATA$ROW_LAST_COMMIT_TIME AS row_timestamp, *
  FROM table1
  ORDER BY 1;
```

## Primary use cases

The METADATA$ROW_LAST_COMMIT_TIME metadata column helps track latency. For example, if you aim for a five-second total latency, this column
helps you determine Snowflake’s contribution to that latency.

Key use cases include:

* **Measuring ingestion latency**: Track the time between when a row is created on the client and when it becomes visible in Snowflake,
  allowing users to calculate data ingestion time.
* **Measuring end-to-end latency**: Combine ingestion latency and pipeline latency to measure the total time from data generation to its
  final state.
* **Measuring pipeline latency**: Tracks timestamps as data moves through a pipeline. By comparing the timestamp of the initial table to the
  final table, users can measure how long the pipeline takes to process data.

  + Supported for pipelines based on streams, dynamic tables, and tasks.

### Example: measure ingestion latency

To measure ingestion latency using the METADATA$ROW_LAST_COMMIT_TIME metadata column, do the following:

1. Create an ingestion pipeline that sends data to Snowflake using one of the following methods:

   * [Snowpipe Streaming Ingest SDK](../snowpipe-streaming/data-load-snowpipe-streaming-overview.md). For a simple example that shows how the client
     SDK could be used to build a Snowpipe Streaming application, see
     [this Java file](https://github.com/snowflakedb/snowflake-ingest-java/blob/master/src/main/java/net/snowflake/ingest/streaming/example/SnowflakeStreamingIngestExample.java) (GitHub).
   * [Snowpipe](../data-load-snowpipe-intro.md)
   * [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command
2. Execute the following:

   ```sqlexample
   ALTER SESSION SET TIMESTAMP_TZ_OUTPUT_FORMAT = 'YYYY-MM-DDTHH:MI:SS.FF3 TZH';

   ALTER SESSION SET TIMEZONE = 'UTC';

   CREATE OR REPLACE DATABASE mydb;

   CREATE OR ALTER SCHEMA myschema;

   CREATE OR REPLACE TABLE table1(record_id STRING, client_timestamp TIMESTAMP_LTZ);

   -- The rows inserted from server-side-insert-1 up to this point will not have a valid METADATA$ROW_LAST_COMMIT_TIME timestamp.
   INSERT INTO table1 VALUES('server-side-insert-1', current_timestamp());
   ```
3. Modify the table to enable the METADATA$ROW_LAST_COMMIT_TIME feature.

   ```sqlexample
   ALTER TABLE table1 SET ROW_TIMESTAMP = TRUE;
   ```
4. Ingest data that includes the `record_id` and `client_timestamp` columns to your Snowflake table using the ingestion pipeline defined in Step 1.
5. Insert a new row as an immediate example if not using an ingestion pipeline. Unlike the insert in Step 2, this insert will have a valid METADATA$ROW_LAST_COMMIT_TIME timestamp because the table property is enabled.

   ```sqlexample
   INSERT INTO table1 VALUES('server-side-insert-2', current_timestamp());
   ```
6. Run your client-side program again, and then do the following:

   ```sqlexample
   SELECT *, METADATA$ROW_LAST_COMMIT_TIME AS ROW_TIMESTAMP, TIMESTAMPDIFF(ms, CLIENT_TIMESTAMP, ROW_TIMESTAMP)
     AS INGEST_LATENCY FROM table1 ORDER BY 2;
   ```

### Example: measure pipeline latency with dynamic tables

Row timestamps are supported on dynamic tables. By enabling ROW_TIMESTAMP on both a source table and a dynamic table, you can
measure pipeline latency — the time it takes for data to flow from the source table through to the dynamic table.

The following example creates a source table, inserts data, creates a dynamic table that materializes the source row timestamp,
refreshes the dynamic table, and then queries it to compute pipeline latency.

1. Create a source table with row timestamps enabled and insert data:

   ```sqlexample
   CREATE OR REPLACE TABLE raw_events (
     event_id INT,
     event_type STRING,
     event_data STRING
   ) ROW_TIMESTAMP = TRUE;

   INSERT INTO raw_events VALUES
     (1, 'click', '{"page":"home"}'),
     (2, 'view',  '{"page":"product"}');
   ```
2. Create a dynamic table that materializes the source table’s `METADATA$ROW_LAST_COMMIT_TIME` as a column. Enable
   ROW_TIMESTAMP on the dynamic table as well so it has its own `METADATA$ROW_LAST_COMMIT_TIME`:

   ```sqlexample
   CREATE OR REPLACE DYNAMIC TABLE processed_events
     TARGET_LAG = '1 minute'
     WAREHOUSE = my_warehouse
     REFRESH_MODE = INCREMENTAL
     ROW_TIMESTAMP = TRUE
   AS
   SELECT
     event_id,
     event_type,
     event_data,
     METADATA$ROW_LAST_COMMIT_TIME AS source_last_commit_time
   FROM raw_events;
   ```
3. Refresh the dynamic table to process the inserted rows:

   ```sqlexample
   ALTER DYNAMIC TABLE processed_events REFRESH;
   ```
4. Query the dynamic table to measure pipeline latency. The `source_last_commit_time` column captures when each row was
   committed in the source table, while the dynamic table’s own `METADATA$ROW_LAST_COMMIT_TIME` records when the dynamic
   table refresh committed that row:

   ```sqlexample
   SELECT
     event_id,
     event_type,
     source_last_commit_time,
     METADATA$ROW_LAST_COMMIT_TIME AS dt_last_commit_time,
     TIMESTAMPDIFF('second', source_last_commit_time, METADATA$ROW_LAST_COMMIT_TIME) AS pipeline_latency_seconds
   FROM processed_events
   ORDER BY event_id;
   ```

## Secondary use cases

Row timestamps can also be used in the following cases:

* **Data retention**: Row timestamps can help delete old records to save on storage costs.
* **Event ordering and change tracking**: You can use row timestamps to track changes. The row with the largest timestamp represents the
  latest change.
* **Append-only data**: If rows are only appended, row timestamps can help filter for table states from specific points in time, enabling
  you to use [Time Travel](../data-time-travel.md) regardless of data retention policy.

## Limitations and considerations

* Row timestamps are only guaranteed to maintain chronological order within the same table, except in the event of failover where ordering
  isn’t guaranteed. Ordering across tables, different regions, or other time sources isn’t guaranteed. You shouldn’t compare row timestamps
  across tables or other sources because doing so can lead to inconsistencies.
* Row timestamps reflect the last updated time, not the creation time. For instance, if the data row is updated after it has been committed,
  the row timestamp reflects the last updated time, not the creation time of the data.
* Timestamps on rows created before the row timestamps were enabled for a table are set to NULL.
* Row timestamps are stored as long as the rows are stored.
* Setting the ROW_TIMESTAMP property to FALSE permanently deletes all stored METADATA$ROW_LAST_COMMIT_TIME values. Re-enabling
  it will not restore them and Time Travel queries will return nothing.
* Row timestamps are not supported for Apache Iceberg™ tables, external tables, hybrid tables, streams, or views.
* The metadata column METADATA$ROW_LAST_COMMIT_TIME can’t be referenced in the following:

  + [CHANGES](../../sql-reference/constructs/changes.md) clause
  + Policies, including row or column access policies and [storage lifecycle policies](../storage-management/storage-lifecycle-policies.md)
  + Constraints
  + CLUSTER BY expressions
* Row timestamps can’t be restored by archive table restore. As a workaround, you can materialize METADATA$ROW_LAST_COMMIT_TIME as a persisted
  column of another table to use in archive restore.

### Cloning considerations for row timestamps

Cloning a table preserves row timestamps exactly. Operations that create a physical copy of data, such as CREATE TABLE AS SELECT (CTAS) and
INSERT INTO … SELECT, assign fresh row timestamps reflecting when the copy was made. The original row timestamps from the source table aren’t
preserved. If you would like to keep a record of them, then select them explicitly into a persisted column, as shown in the
following example:

```sqlexample
CREATE TABLE my_archive AS
 SELECT *, METADATA$ROW_LAST_COMMIT_TIME AS original_commit_time
 FROM my_source_table;
```

---
title: Use secure objects to control data access
source: https://docs.snowflake.com/en/user-guide/data-sharing-secure-views.md
section: User Guide
---

# Use secure objects to control data access

To ensure sensitive data in a shared database is not exposed to users in consumer accounts, Snowflake strongly recommends sharing
[secure views](views-secure.md) and/or [secure UDFs](../developer-guide/secure-udf-procedure.md) instead of directly
sharing tables.

In addition, for optimal performance, especially when sharing data in extremely large tables, we recommend defining
[clustering keys](tables-clustering-keys.md) on the base table(s) for your secure objects.

This topic describes using clustering keys in base tables for shared secure objects and provides step-by-step instructions for sharing
a secure view with a consumer account. It provides sample scripts for both data providers and consumers.

> **Note:**
>
> The instructions for sharing a secure object are essentially the same as sharing a table, with the addition of the following objects:
>
> * A “private” schema containing the base table and a “public” schema containing the secure object. Only the public schema and secure
>   object are shared.
> * A “mapping table” (also in the “private” schema), which is only required if you wish to share the data in the base table with multiple
>   consumer accounts and share specific rows in the table with specific accounts.

## Using clustering keys for shared data

On very large (i.e. multi-terabyte) tables, clustering keys provide significant query performance benefits. By defining one or more
clustering keys on the base tables used in shared secure views or secure UDFs, you ensure users in your consumer accounts are not negatively
impacted when using these objects.

When choosing the columns to use as the clustering key for a table, please note some
[important considerations](tables-clustering-keys.md).

## Sample setup and tasks

These sample instructions assume a database named `mydb` exists in the data provider account and has two schemas, `private`
and `public`. If the database and schemas do not exist, you should create them before proceeding.

### Step 1: Create data and mapping tables in private schema

Create the following two tables in the `mydb.private` schema and populate them with data:

`sensitive_data` — contains the data to share, and an `access_id` column for controlling data access by account.

`sharing_access` — uses the `access_id` column to map the shared data and the accounts that can access the data.

### Step 2: Create secure view in public schema

Create the following secure view in the `mydb.public` schema:

`paid_sensitive_data` — displays data based on account.

Note that the `access_id` column from the base table (`sensitive_data`) does not need to be included in the view.

### Step 3: Validate tables and secure view

Validate the tables and secure view to ensure the data is filtered properly by account.

To enable validating secure views that will be shared with other accounts, Snowflake provides a session parameter,
[SIMULATED_DATA_SHARING_CONSUMER](../sql-reference/parameters.md). Set this session parameter to the name of the consumer account you wish to simulate
access for. You can then query the view and see the results that a user in the consumer account will see.

### Step 4: Create a share

1. Create a [share](data-sharing-provider.md).

   To create a share, you must use the ACCOUNTADMIN role or a role granted the global CREATE SHARE privilege.
   The role must also have one of the following to grant objects to the share:

   * A role with the OWNERSHIP privilege on the shared database.
   * A role with the [USAGE privilege on the database WITH GRANT OPTION](../sql-reference/sql/grant-privilege.md). For example:

     ```sqlsyntax
     GRANT USAGE ON <database-name> TO ROLE <role-name> WITH GRANT OPTION;
     ```
2. Add the database (`mydb`), schema (`public`), and secure view (`paid_sensitive_data`) to the share. You can choose
   to either add privileges on these objects to a share via a database role, or grant privileges on the objects directly to the
   share. For more information on these options, see [How to share database objects](data-sharing-gs.md).
3. Confirm the contents of the share. At the most basic level, you should use the [SHOW GRANTS](../sql-reference/sql/show-grants.md) command to confirm
   the objects in the share have the necessary privileges.

   Note that the secure view `paid_sensitive_data` is displayed in the command output as a table.
4. Add one or more accounts to the share.

## Sample script

The following script illustrates performing all the tasks described in the previous section:

1. Create two tables in the ‘private’ schema and populate the first one with stock data from three different companies (Apple, Microsoft,
   and IBM). You will then populate the second one with data that maps the stock data to individual accounts:

   ```sqlexample
   use role sysadmin;

   create or replace table mydb.private.sensitive_data (
       name string,
       date date,
       time time(9),
       bid_price float,
       ask_price float,
       bid_size int,
       ask_size int,
       access_id string /* granularity for access */ )
       cluster by (date);

   insert into mydb.private.sensitive_data
       values('AAPL',dateadd(day,  -1,current_date()), '10:00:00', 116.5, 116.6, 10, 10, 'STOCK_GROUP_1'),
             ('AAPL',dateadd(month,-2,current_date()), '10:00:00', 116.5, 116.6, 10, 10, 'STOCK_GROUP_1'),
             ('MSFT',dateadd(day,  -1,current_date()), '10:00:00',  58.0,  58.9, 20, 25, 'STOCK_GROUP_1'),
             ('MSFT',dateadd(month,-2,current_date()), '10:00:00',  58.0,  58.9, 20, 25, 'STOCK_GROUP_1'),
             ('IBM', dateadd(day,  -1,current_date()), '11:00:00', 175.2, 175.4, 30, 15, 'STOCK_GROUP_2'),
             ('IBM', dateadd(month,-2,current_date()), '11:00:00', 175.2, 175.4, 30, 15, 'STOCK_GROUP_2');

   create or replace table mydb.private.sharing_access (
     access_id string,
     snowflake_account string
   );

   /* In the first insert, CURRENT_ACCOUNT() gives your account access to the AAPL and MSFT data.       */

   insert into mydb.private.sharing_access values('STOCK_GROUP_1', CURRENT_ACCOUNT());

   /* In the second insert, replace <consumer_account> with an account name; this account will have     */
   /* access to IBM data only. Note that account names are case-sensitive and must be in uppercase      */
   /* enclosed in single-quotes, e.g.                                                                   */
   /*                                                                                                   */
   /*      insert into mydb.private.sharing_access values('STOCK_GROUP_2', 'ACCT1')                */
   /*                                                                                                   */
   /* To share the IBM data with multiple accounts, repeat the second insert for each account.          */

   insert into mydb.private.sharing_access values('STOCK_GROUP_2', '<consumer_account>');
   ```
2. Create a secure view in the ‘public’ schema. This view filters the stock data from the first table by account, using the mapping
   information in the second table:

   ```sqlexample
   create or replace secure view mydb.public.paid_sensitive_data as
       select name, date, time, bid_price, ask_price, bid_size, ask_size
       from mydb.private.sensitive_data sd
       join mydb.private.sharing_access sa on sd.access_id = sa.access_id
       and sa.snowflake_account = current_account();

   grant select on mydb.public.paid_sensitive_data to public;

   /* Test the table and secure view by first querying the data as the provider account. */

   select count(*) from mydb.private.sensitive_data;

   select * from mydb.private.sensitive_data;

   select count(*) from mydb.public.paid_sensitive_data;

   select * from mydb.public.paid_sensitive_data;

   select * from mydb.public.paid_sensitive_data where name = 'AAPL';

   /* Next, test the secure view by querying the data as a simulated consumer account. You specify the  */
   /* account to simulate using the SIMULATED_DATA_SHARING_CONSUMER session parameter.                  */
   /*                                                                                                   */
   /* In the ALTER command, replace <consumer_account> with one of the accounts you specified in the    */
   /* mapping table. Note that the account name is not case-sensitive and does not need to be enclosed  */
   /* in single-quotes, e.g.                                                                            */
   /*                                                                                                   */
   /*      alter session set simulated_data_sharing_consumer=acct1;                                     */

   alter session set simulated_data_sharing_consumer=<account_name>;

   select * from mydb.public.paid_sensitive_data;
   ```
3. Create a share using the ACCOUNTADMIN role.

   ```sqlexample
   use role accountadmin;

   create or replace share mydb_shared
     comment = 'Example of using Secure Data Sharing with secure views';

   show shares;
   ```
4. Add the objects to the share. You can choose to either add privileges on these objects to a share via a database role
   (Option 1), or grant privileges on the objects directly to the share (Option 2):

   ```sqlexample
   /* Option 1: Create a database role, grant privileges on the objects to the database role, and then grant the database role to the share */

   create database role mydb.dr1;

   grant usage on database mydb to database role mydb.dr1;

   grant usage on schema mydb.public to database role mydb.dr1;

   grant select on mydb.public.paid_sensitive_data to database role mydb.dr1;

   grant usage on database mydb to share mydb_shared;

   grant database role mydb.dr1 to share mydb_shared;

   /* Option 2: Grant privileges on the database objects to include in the share.  */

   grant usage on database mydb to share mydb_shared;

   grant usage on schema mydb.public to share mydb_shared;

   grant select on mydb.public.paid_sensitive_data to share mydb_shared;

   /*  Confirm the contents of the share. */

   show grants to share mydb_shared;
   ```
5. Add accounts to the share.

   ```sqlexample
   /* In the alter statement, replace <consumer_accounts> with the  */
   /* consumer account(s) you assigned to STOCK_GROUP2 earlier,     */
   /* with each account name separated by commas, e.g.              */
   /*                                                               */
   /*    alter share mydb_shared set accounts = acct1, acct2;       */

   alter share mydb_shared set accounts = <consumer_accounts>;
   ```

## Sample script (for consumers)

The following script can be used by consumers to create a database (from the share created in the above script) and query the secure view
in the resulting database:

1. Bring the shared database into your account by creating a database from the share.

   ```sqlexample
   /* In the following commands, the share name must be fully qualified by replacing     */
   /* <provider_account> with the name of the account that provided the share, e.g.      */
   /*                                                                                    */
   /*    desc prvdr1.mydb_shared;                                                        */

   use role accountadmin;

   show shares;

   desc share <provider_account>.mydb_shared;

   create database mydb_shared1 from share <provider_account>.mydb_shared;
   ```
2. Grant privileges on the database to other roles in your account (e.g. CUSTOM_ROLE1). The GRANT statement differs depending on whether
   the data consumer added objects to the share using database roles (Option 1) or by granting privileges on the objects directly to the
   share (Option 2):

   ```sqlexample
   /* Option 1 */
   grant database role mydb_shared1.db1 to role custom_role1;

   /* Option 2 */
   grant imported privileges on database mydb_shared1 to custom_role1;
   ```
3. Use the CUSTOM_ROLE1 role to query the view in the database you created. Note that there must be an active warehouse in use in the
   session to perform queries. In the USE WAREHOUSE command, replace <warehouse_name> with the name of one of the warehouses in your
   account. The CUSTOM_ROLE1 role must have the USAGE privilege on the warehouse:

   ```sqlexample
   use role custom_role1;

   show views;

   use warehouse <warehouse_name>;

   select * from paid_sensitive_data;
   ```

---
title: Use Snowflake troubleshooting tools
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/snowflake-tools.md
section: User Guide
---

# Use Snowflake troubleshooting tools

Snowflake provides the following tools to test client connectivity by simplifying tests for a client’s ability to access Snowflake URLs:

* Snowflake Connector for Python diagnostics

  Use the connector’s diagnostic feature for comprehensive testing. For more information and detailed steps,
  see [Running connectivity tests and diagnostics](../../developer-guide/python-connector/python-connector-connect.md).
* SnowCD

  You can use SnowCD to diagnose connectivity for all Snowflake-related URLs. See the [SnowCD User Guide](../snowcd.md) for installation and usage instructions.

If you cannot use or install these tools in your system, you can follow the [alternate steps](alternate-steps.md), based on your platform.

If you successfully verified connectivity, proceed to [follow-up actions](followup-actions.md).

---
title: Use Snowsight to set up data quality checks
source: https://docs.snowflake.com/en/user-guide/data-quality-ui-setup.md
section: User Guide
---

# Use Snowsight to set up data quality checks

This topic describes how to use [Snowsight](ui-snowsight-gs.md) to set up data quality checks. You can use the following strategies to set up data
quality checks:

* Use AI to intelligently suggest data quality checks based on characteristics of your data and usage patterns.
  See Set up quality checks using Cortex Data Quality.
* Manually define the expected values to be returned by a data metric function (DMF). See Set up quality checks manually.

For an introduction to concepts of data quality checks, see [Core concepts of data quality checks](data-quality-intro.md).

## Set up quality checks using Cortex Data Quality

Cortex Data Quality uses AI to suggest data quality checks based on characteristics of your metadata. If you accept
the suggestions, Snowflake checks your data for quality issues at regular intervals to identify problems.

Cortex Data Quality leverages the [Snowflake Cortex AI_COMPLETE function](../sql-reference/functions/ai_complete.md) to
intelligently suggest data quality checks. Because it runs securely inside Snowflake Cortex, your enterprise data and metadata always stay
securely inside Snowflake. Cortex Data Quality also fully respects Snowflake access control and provides suggestions that are based only
on the data that you can access.

To use Cortex Data Quality to set up data quality checks, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select the object.
3. Select the Data Quality tab.
4. Select Monitoring.
5. Do one of the following:

   * **If this is the first time you are setting up quality checks**, select Get started.
   * **If you are setting up additional quality checks**, select Add quality check, and then select Suggested quality checks.
6. Review the suggested data quality checks. To change the criteria that determine whether data passes a quality check, edit the contents
   of the What should the result be? column.
7. Select the quality checks that you want to implement, and then select Apply.

For more information about Cortex Data Quality, see More about Cortex Data Quality.

## Set up quality checks manually

To create data quality checks based on your knowledge of your data, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select the object.
3. Select the Data Quality tab.
4. Select Monitoring.
5. Do one of the following:

   * **If this is the first time you are setting up quality checks**, select Start manually.
   * **If you are setting up additional quality checks**, select Add quality check, and then select Build checks manually.
6. In the Set up a quality check dialog, select the type of check that you want to create.
7. Configure the criteria that determine if data passes the quality check, and then select Save.

> **Tip:**
>
> If you want to enable anomaly detection so that Snowflake can automatically detect data quality issues based on the historical volume and
> freshness of your data, either use Cortex Data Quality and accept its suggestions for anomaly
> detection or [set up anomaly detection manually](data-quality-anomaly.md).

## Adjust how often quality checks run

The schedule of a table or view determines how often the DMF that is powering the data quality check runs. The schedule can be based on time
or on updates to the table.

> **Note:**
>
> You can’t use Snowsight to adjust the schedule until you have added at least one quality check. You can use an ALTER <object>
> command to set the schedule for a table or view at anytime.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer, and then select the object.
3. Select the Data Quality tab.
4. Select Monitoring.
5. Select Settings.
6. Specify how often you want to run the DMF:

   * To run the DMF at a regular interval of one day or less, select Interval-based timing and select the interval from the drop-down
     list.
   * To run the DMF on a custom schedule, select Select schedule and set the schedule.
   * To run the DMF whenever there is a DML change to the table — for example, when a row is added — select
     Trigger-based execution.

## More about Cortex Data Quality

The following sections provide additional information about Cortex Data Quality.

### Required LLMs

Cortex Data Quality won’t work unless the [CORTEX_MODELS_ALLOWLIST](../sql-reference/parameters.md) account parameter allows the `mistral-7b` and
`llama3.1-8b` models within the account. By default, both models are allowed. For more information about setting this parameter, see
[Account-level allowlist parameter](snowflake-cortex/aisql.md).

### Access control requirements

Administrators with the ACCOUNTADMIN role have all the privileges that they need to use Cortex to suggest data quality checks.

Other users must have the following privileges and roles:

* OWNERSHIP privilege on the table
* EXECUTE DATA METRIC FUNCTION privilege on the account
* SNOWFLAKE.DATA_METRIC_USER database role
* SNOWFLAKE.CORTEX_USER database role

#### Limit access

By default, the CORTEX_USER database role is granted to the PUBLIC role, which means every user has it. If you don’t want all users to be
able use Snowflake Cortex features, you can revoke this database role from the PUBLIC role and then grant
it to specific roles.

To stop users from using Cortex to suggest quality checks, revoke the CORTEX_USER database role from the PUBLIC role by
running the following commands. Be sure to use the ACCOUNTADMIN role.

```sqlexample
USE ROLE ACCOUNTADMIN;

REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER
  FROM ROLE PUBLIC;
```

You can now selectively provide access by granting the CORTEX_USER database role to specific roles. In the following example, use the
ACCOUNTADMIN role and grant the user `some_user` the CORTEX_USER database role through the account role `cortex_access_role`, which you
create for this purpose.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE cortex_access_role;
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO ROLE cortex_access_role;

GRANT ROLE cortex_access_role TO USER some_user;
```

You can also grant the CORTEX_USER database role to existing roles.

### Cost considerations

The cost of using Cortex Data Quality consists of the following:

* Costs associated with the [COMPLETE (SNOWFLAKE.CORTEX)](../sql-reference/functions/complete-snowflake-cortex.md) function. These charges appear on a bill as
  AI-Services, which includes all uses of Snowflake Cortex.
* Compute cost of the default warehouse that runs Snowsight.

### Legal notices

Cortex Data Quality leverages third-party models and/or services, as previously described on this page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Usage Data | Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information about the use of AI, see [Snowflake AI and ML](../guides-overview-ai-features.md).

---
title: Use Snowsight to work with cost anomalies
source: https://docs.snowflake.com/en/user-guide/cost-anomalies-ui.md
section: User Guide
---

# Use Snowsight to work with cost anomalies

This topic describes how to use Snowsight to identify and investigate cost anomalies, which occur when daily consumption in an
account or organization is above or below the expected range of consumption for the day. It also describes how to use Snowsight to
configure notifications so specific users are emailed when cost anomalies occur.

For an overview of cost anomalies, see [Introduction to cost anomalies](cost-anomalies.md).

## Configure notifications with Snowsight

When Snowflake identifies a cost anomaly, it sends a notification to a list of email addresses. When deciding who will receive notifications for cost anomalies, be aware that email notifications might contain details about how much was spent by an account.

Each account can have a notification list for account-level anomalies within the account. You can also define a separate notification list for the organization to
control who is notified when there is an organization-level anomaly.

Each email address must have been [verified by the user](ui-snowsight-profile.md).

You can use a group email address, such as a distribution list, for notifications, but this email address must be verified. Before adding a group email address to the notification list, you might need to create a new Snowflake user with the group email address so you can verify it.

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

To add email addresses where notifications are sent when there is a cost anomaly, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the [required privileges](cost-anomalies-access-control.md).
2. In the navigation menu, select Admin » Cost management, and then select Anomalies.
3. Select Notifications.
4. To specify who gets notified for an [account-level anomaly](cost-anomalies.md), do the following:

   1. In the Notify for account anomalies field, enter the email address of a Snowflake user you want contacted for anomalies.
   2. Press Enter.
   3. Repeat for additional users.
5. To specify who gets notified for an [organization-level anomaly](cost-anomalies.md), do the following:

   1. In the Notify for organization anomalies field, enter the email addresses of a Snowflake user you want contacted for anomalies.
   2. Press Enter.
   3. Repeat for additional users.
6. Select Save changes.

## Identify and investigate cost anomalies with Snowsight

**Step 1: Identify cost anomalies**

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the [required privileges](cost-anomalies-access-control.md).
2. In the navigation menu, select Admin » Cost management, and then select Anomalies.
3. Use the filters to select a timeframe and account. If you want to identify
   [organization-level anomalies](cost-anomalies.md), select All accounts.
4. Do one of the following:

   1. Use the chart to visually track actual consumption and the expected range of consumption over time. Cost anomalies where actual
      consumption went above or below the expected range are visually represented in the chart.
   2. Use the table to view a list of all cost anomalies within the timeframe. Sort as desired.

**Step 2: Investigate a cost anomaly**

After identifying a cost anomaly, you can investigate it using the side panel or by using Cortex Code to ask natural-language questions.

### Investigate using the side panel

1. Select a cost anomaly by clicking the indicator in the chart or selecting a row in the table. A side panel opens.
2. If you are investigating an account-level anomaly (you selected a specific account in the filter), you can use the side panel to drill down into the following:

   * Use the **Top consumption drivers** section to investigate hourly consumption within the account. You can view consumption for all service types or you can focus on the services that consumed the most credits during the day.
   * Use the **Top warehouses** section to identify the warehouses within the account that had the greatest absolute change in consumption.
   * If you are investigating anomalies in the account that you are currently signed in to, use the **Top queries** section to identify the most expensive queries in the warehouse that had the greatest change in consumption. This might not show the most expensive query in the account because it focuses on queries in a specific warehouse (the one with the greatest change in consumption).
   * Drill down into the most expensive queries by selecting the **Open in Worksheet** icon that is located near the Query ID. A worksheet opens that shows the query that was executed.
3. If you are investigating an organization-level anomaly (you selected **All Accounts** in the filter), you can use the side panel to drill down into the following:

   * Use the **Top accounts** section to identify the accounts that had the greatest absolute change in consumption.
   * Use the **Top warehouses** section to drill down into the account with the greatest change in consumption. You can identify the warehouses within the account that had the greatest change in consumption.

   This might not show the warehouse with the greatest change within the entire organization because it focuses on warehouses in a specific account (the one with the greatest change in consumption). To programmatically retrieve the top warehouses in a different account or within the organization, see [Warehouse-level consumption](cost-anomalies-class.md).

> **Tip:**
>
> If the Anomalies tab does not provide the consumption data you need to identify the root cause of the cost anomaly, you can select the **Consumption** tab for further investigation.

### Investigate with Cortex Code

Cortex Code is an AI-driven intelligent agent integrated into the Snowflake platform. You can use Cortex Code to investigate cost anomalies by highlighting a section of the consumption chart and asking natural-language questions.

> **Note:**
>
> **First-time users:** When you first access the Anomalies tab, you might see an introductory prompt highlighting the Snap and Ask feature. This prompt appears near the consumption chart and introduces the Add to Chat and Explain quick actions. Select either action to begin using Cortex Code for cost investigation.

**Prerequisites**

Before you can use Cortex Code to investigate cost anomalies, you must be granted the following privileges:

* The [required privileges](cortex-code/cortex-code-snowsight.md) to access Cortex Code in Snowsight.

**Investigate a cost anomaly with Cortex Code**

To investigate a cost anomaly with Cortex Code, do the following:

1. Identify and highlight activity in the consumption chart that you want to investigate, such as a spike in compute costs.

   The Add to Chat and Explain quick actions appear.
2. Select one of the following quick actions:

   * Add to Chat: Start a Cortex Code chat where you can enter prompts and interact with Cortex Code.
   * Explain: Cortex Code will analyze the highlighted area of the chart and return an analysis.
3. Cortex Code analyzes the cost activity for the highlighted area and reports its findings. It might ask you to run SQL statements to gather more information about the anomaly. For example, if you ask about a cost spike, it might generate a SQL statement that identifies the warehouses, queries, or users that contributed to the increase.

**Example prompts**

The following example prompts cover different types of analysis that Cortex Code supports for cost anomalies:

| Use case | Example prompt |
| --- | --- |
| Gather general information about a cost change | What changed in this highlighted window? |
| Determine the cause of a cost spike | Why did this cost spike occur? |
| Identify cost drivers | Which top warehouses contributed the most to this increase? |
| Get recommendations to reduce costs | What can I do to reduce these costs? |
| Investigate specific cost categories | What queries caused this compute cost increase? |

For more information, see [Cortex Code](cortex-code/cortex-code.md).

---
title: Use SQL to set up data metric functions
source: https://docs.snowflake.com/en/user-guide/data-quality-working.md
section: User Guide
---

# Use SQL to set up data metric functions

This topic describes how to use SQL to associate a data metric function (DMF) with a table or view so it runs at regular
intervals. It also describes how to call a DMF directly, for example, if you want to test a DMF before associating it with a table or view.

> **Note:**
>
> To use a user interface to set up data quality checks, which includes associating a DMF with a table, see [Use Snowsight to set up data quality checks](data-quality-ui-setup.md).

## Associate a DMF

You can associate a DMF with a table or view to automatically call it on regular intervals. When associating the DMF, you specify which
columns are passed to the DMF as arguments.

Use an [ALTER TABLE](../sql-reference/sql/alter-table.md) or [ALTER VIEW](../sql-reference/sql/alter-view.md) command to associate a DMF and specify which
columns are passed as arguments. For example, the following command associates the NULL_COUNT system DMF with table `t`. When the
DMF runs, it will return the number of NULL values in the column `c1`.

```sqlexample
ALTER TABLE t
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT
    ON (c1);
```

Some DMFs don’t accept a column as an argument. For example, to associate the ROW_COUNT system DMF with view `v2`, run the following command:

```sqlexample
ALTER VIEW v2
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.ROW_COUNT
    ON ();
```

The [ACCEPTED_VALUES](../sql-reference/functions/dmf_accepted_values.md) DMF contains a lambda expression as well as the column name, which
allows you to check how many records do not match an expected value. For example, the following statement associates the function with table
`t1` so the function returns the number of records where the value of the column `age` is *not* equal to five.

```sqlexample
ALTER TABLE t1
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.ACCEPTED_VALUES ON (age, age -> age = 5);
```

### Drop a DMF from an object

You can drop a DMF using an ALTER TABLE or ALTER VIEW command. For example:

> ```sqlexample
> ALTER TABLE t
>   DROP DATA METRIC FUNCTION governance.dmfs.count_positive_numbers
>     ON (c1, c2, c3);
> ```

## Adjust the schedule for DMFs

The [DATA_METRIC_SCHEDULE](../sql-reference/parameters.md) object parameter for a table, view, or materialized view controls how often DMFs run. By default, the schedule is set to one hour.
All data metric functions on a table or view follow the same schedule.

You can use the following approaches to schedule your DMF to run:

* Set the DMF to run after a specified number of minutes.
* Use a cron expression to schedule the DMF to run at a particular frequency.
* Use a trigger event to schedule the DMF to run when there is a [DML change](../sql-reference/sql-dml.md) to the table, such as
  inserting a new row into the table. However:

  + The [reclustering of tables](tables-auto-reclustering.md) doesn’t trigger a DMF to run.
  + The trigger approach is only available for certain kinds of tables. For more information, see
    [ALTER TABLE … SET DATA_METRIC_SCHEDULE](../sql-reference/sql/alter-table.md).

For example:

Set the data metric function schedule to run every 5 minutes:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = '5 MINUTE';
> ```

Set the data metric function schedule to run at 8:00 AM daily:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'USING CRON 0 8 * * * UTC';
> ```

Set the data metric function schedule to run at 8:00 AM on weekdays only:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'USING CRON 0 8 * * MON,TUE,WED,THU,FRI UTC';
> ```

Set the data metric function schedule to run three times daily at 0600, 1200, and 1800 UTC:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'USING CRON 0 6,12,18 * * * UTC';
> ```

Set the data metric function to run when a general DML operation, such as inserting a new row, modifies the table:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'TRIGGER_ON_CHANGES';
> ```

You can use the [SHOW PARAMETERS](../sql-reference/sql/show-parameters.md) command to view the DMF schedule for a supported table object:

> ```sqlexample
> SHOW PARAMETERS LIKE 'DATA_METRIC_SCHEDULE' IN TABLE hr.tables.empl_info;
> ```
>
> ```output
> +----------------------+--------------------------------+---------+-------+------------------------------------------------------------------------------------------------------------------------------+--------+
> | key                  | value                          | default | level | description                                                                                                                  | type   |
> +----------------------+--------------------------------+---------+-------+------------------------------------------------------------------------------------------------------------------------------+--------+
> | DATA_METRIC_SCHEDULE | USING CRON 0 6,12,18 * * * UTC |         | TABLE | Specify the schedule that data metric functions associated to the table must be executed in order to be used for evaluation. | STRING |
> +----------------------+--------------------------------+---------+-------+------------------------------------------------------------------------------------------------------------------------------+--------+
> ```

For view and materialized view objects, specify `TABLE` as the object domain and check the schedule as follows:

> ```sqlexample
> SHOW PARAMETERS LIKE 'DATA_METRIC_SCHEDULE' IN TABLE mydb.public.my_view;
> ```

> **Note:**
>
> There is a 10-minute lag from when you modify the DMF from a table for any scheduling changes to take effect on previous DMFs that are
> assigned to the table. However, new DMF assignments to the table are not subject to the 10 minute delay. Plan the DMF scheduling and DMF
> unsetting operations carefully to align with your expected [DMF costs](data-quality-intro.md).
>
> Additionally, when you evaluate the DMF results, such as by querying the
> [DATA_QUALITY_MONITORING_RESULTS](../sql-reference/local/data_quality_monitoring_results.md) view, specify the `measurement_time`
> column in your query as the basis for the evaluation. There is an internal process that initiates the DMF evaluation, and it is possible
> that table updates, such as INSERT operations, can occur between the scheduled time and the measurement time. When you use the
> `measurement_time` column, you have a more accurate assessment of the DMF results because the measurement time indicates the
> evaluation time of the DMF.

## Suspend DMFs

You can suspend a DMF to prevent it from running even though it is associated with a table. Alternatively, you can suspend all DMFs
associated with a table with a single statement.

* **To suspend a specific DMF** associated with a table, modify the association to set the SUSPEND parameter. For example:

  ```sqlexample
  ALTER TABLE t1
    MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT ON ( col1 )
      SUSPEND;
  ```

  To resume running the DMF, use another MODIFY DATA METRIC FUNCTION statement to set the RESUME parameter.
* **To suspend all DMFs** associated with a table, set the table’s schedule to an empty string. For example:

  ```sqlexample
  ALTER TABLE t1 SET DATA_METRIC_SCHEDULE = '';
  ```

  To resume the DMFs, set the DATA_METRIC_SCHEDULE parameter to a valid value.

## Call a DMF manually

Calling a DMF directly can be useful to test the output of the DMF before associating it with a table or view.

Use the following syntax to call a DMF:

```sqlsyntax
SELECT <data_metric_function>(<query>)
```

Where:

`data_metric_function`
:   Specifies a system- or user-defined DMF.

`query`
:   Specifies a SQL query on a table or view.

    The columns projected by the query must match the column arguments in the DMF signature.

> **Note:**
>
> The following system DMFs don’t follow this syntax because they don’t take any arguments:
>
> * [DATA_METRIC_SCHEDULED_TIME (system data metric function)](../sql-reference/functions/dmf_data_metric_schedule_time.md)
> * [ROW_COUNT (system data metric function)](../sql-reference/functions/dmf_row_count.md)

For example, to call a custom DMF `count_positive_numbers`, which accepts three columns as arguments, run the following command:

```sqlexample
SELECT governance.dmfs.count_positive_numbers(
  SELECT c1, c2, c3
  FROM t);
```

For example, to call the [NULL_COUNT (system data metric function)](../sql-reference/functions/dmf_null_count.md) system DMF to view the number of NULL values
in the `ssn` column, run the following command:

```sqlexample
SELECT SNOWFLAKE.CORE.NULL_COUNT(
  SELECT ssn
  FROM hr.tables.empl_info);
```

If a custom DMF accepts arguments from multiple tables, each query that projects a column must be enclosed in parentheses. For example, if
you want to manually call the REFERENTIAL_CHECK DMF, execute the following:

```sqlexample
SELECT referential_check( (SELECT id FROM salesorders), (SELECT id FROM salespeople) );
```

---
title: Use SQL to set up sensitive data classification
source: https://docs.snowflake.com/en/user-guide/classify-auto.md
section: User Guide
---

# Use SQL to set up sensitive data classification

The following sections describe how to use SQL to set up the automatic classification of sensitive data within a database. If you want to
use a web interface to set up sensitive data classification, see [Use the Trust Center to set up sensitive data classification](classify-ui-trust-center.md).

The basic workflow for using SQL to classify sensitive data consists of the following steps:

1. Create a *classification profile* that controls what happens during sensitive data classification.
2. Set the classification profile on a database or schema to automatically classify tables in the entity.

For end-to-end examples of this workflow, see Examples.

## About classification profiles

A classification profile defines the criteria that are used to automatically classify tables in a database. This
criteria includes:

* How long a table should exist before automatically classifying it.
* How long before previously classified tables should be reclassified.
* Whether system and custom tags are automatically set on columns after the classification. You can decide whether you want Snowflake
  to automatically apply suggested tags or prefer to review proposed tag assignments, then apply them yourself.
* A mapping between system classification tags and user-defined object tags so the user-defined tags
  can be applied automatically.
* Whether [custom classifiers](classify-custom.md) are used to classify data.

When a data engineer assigns the classification profile to a database, sensitive data in the tables that belong to the database is
automatically classified on the schedule defined by the profile. A data engineer can assign the same classification profile to multiple
databases, or create multiple classification profiles to set different classification criteria for different databases.

To use SQL to create a classification profile, run the [CREATE CLASSIFICATION_PROFILE](../sql-reference/classes/classification_profile/commands/create-classification-profile.md)
command to create an instance of the CLASSIFICATION_PROFILE
[class](../sql-reference/snowflake-db-classes.md).

For an example of using the CREATE CLASSIFICATION_PROFILE command to create a classification profile, see Examples.

## About tag mapping

You can use the classification profile to map [SEMANTIC_CATEGORY system tags](classify-intro.md) to one or more
[object tags](object-tagging/introduction.md). With this tag mapping, a column with sensitive data can be automatically
assigned a user-defined tag based on its classification. The tag map can be added while creating the classification profile or later by
calling the [<classification_profile_name>!SET_TAG_MAP](../sql-reference/classes/classification_profile/methods/set_tag_map.md) method.

Regardless of whether you are defining the tag map while creating the classification profile or after, the contents of the map are specified
as a JSON object. This JSON object contains the `'column_tag_map'` key, which is an array of objects that specify a user-defined tag,
the string value of that tag, and the semantic categories to which the tag is being mapped.

The following is an example of a tag map:

```javascript
'tag_map': {
  'column_tag_map': [
    {
      'tag_name':'tag_db.sch.pii',
      'tag_value':'Highly Confidential',
      'semantic_categories':[
        'NAME',
        'NATIONAL_IDENTIFIER'
      ]
    },
    {
      'tag_name': 'tag_db.sch.pii',
      'tag_value':'Confidential',
      'semantic_categories': [
        'EMAIL'
      ]
    }
  ]
}
```

Based on this mapping, if you have a column of email addresses and the classification process determines that the column contains these
addresses, the `tag_db.sch.pii = 'Confidential'` tag is set on the column containing the email addresses.

If your tag map includes multiple JSON objects that map tags, tag values, and category values, the order of the JSON objects determines
which tag and value to set on the column if there is a conflict. Specify the JSON objects in the desired assignment order from left to
right, or top to bottom if you are formatting JSON.

> **Tip:**
>
> Each object in the `column_tag_map` field has only has one required key: `tag_name`. If you omit the `tag_value` and
> `semantic_categories` keys, the user-defined tag gets applied to every column to which the SEMANTIC_CATEGORY system tag is applied,
> and the value of the user-defined tag will match the value of the SEMANTIC_CATEGORY tag for a given column.

If there is a conflict with a manually assigned tag and a tag applied by automatic classification, an error occurs. For information about
tracking these errors, see [Troubleshooting sensitive data classification](classify-troubleshooting.md).

## Classify data using a subset of native semantic categories

By default, Snowflake classifies data into its [native semantic categories](classify-native.md) whenever it identifies
sensitive data. A semantic category represents a type of data, such as email addresses, credit card numbers, or social security numbers.

You can configure the classification profile to limit which types of data (semantic categories) to classify as sensitive.
Snowflake classifies data only if it belongs to the subset of semantic categories that you specify in the profile.

The `snowflake_semantic_categories` key in a classification profile’s configuration object defines the list of semantic categories
that you want classified. You can specify which data to classify in two ways:

* **Classify by semantic category**: Specify categories like NAME or EMAIL to classify all data of that type, regardless of location.
* **Classify by semantic category and country**: For categories with country-specific subcategories (like TAX_IDENTIFIER or PASSPORT),
  you can use the `country_codes` key to classify only data from specific countries. For a list of two-letter country codes and
  supported semantic subcategories, see [Native semantic categories of sensitive data classification](classify-native.md).

You can mix both approaches in the same classification profile. The following examples demonstrate each variation:

### Example 1: Classify specific semantic categories

This example configures a classification profile so data is classified only if Snowflake identifies it as belonging to the semantic
categories NAME and NATIONAL_IDENTIFIER. All other types of data are not classified.

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  my_classification_profile(
    {
      'minimum_object_age_for_classification_days': 0,
      'snowflake_semantic_categories':
        [
          {'category': 'NAME'},
          {'category': 'NATIONAL_IDENTIFIER'}
        ]
    });
```

### Example 2: Classify a category with specific country codes

For semantic categories that have country-specific subcategories, you can use the `country_codes` key to limit classification to
specific countries. The `country_codes` value is a two-letter country code.

This example classifies data only if Snowflake identifies that it is a tax identifier in Italy (IT) or France (FR):

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  my_classification_profile(
    {
      'minimum_object_age_for_classification_days': 0,
      'snowflake_semantic_categories':
        [
          {
            'category': 'TAX_IDENTIFIER',
            'country_codes': ['IT', 'FR']
          }
        ]
    });
```

### Example 3: Combine global and country-specific categories

You can combine semantic categories that do not have country-specific subcategories with categories that do. This example specifies that
Snowflake always classify data belonging to the NAME category and classify the PASSPORT category if the data pertains to United States
passports:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  my_classification_profile(
    {
      'minimum_object_age_for_classification_days': 0,
      'snowflake_semantic_categories':
        [
          {
            'category': 'NAME'
          },
          {
            'category': 'PASSPORT',
            'country_codes': ['US']
          }
        ]
    });
```

## Set a classification profile on a database

Implement sensitive data classification by setting a classification profile on a database. After you set the
classification profile on the database, all tables and views within that database are automatically monitored by sensitive data
classification.

You can also set a classification on a schema. If you set a classification profile on a schema that exists within a database that is also
associated with a classification profile, the profile set on the schema overrides the profile set on the database.

To set a classification profile, use an [ALTER DATABASE](../sql-reference/sql/alter-database.md) or [ALTER SCHEMA](../sql-reference/sql/alter-schema.md) command to set
the CLASSIFICATION_PROFILE parameter. For example, to set a classification profile `my_profile` so all tables and views in the `my_db`
database are monitored by sensitive data classification, run the following command:

```sqlexample
ALTER DATABASE my_db
  SET CLASSIFICATION_PROFILE = 'governance_db.classify_sch.my_profile';
```

## Access control

Here are the privileges and roles that let you work with classification profiles and enable sensitive data
classification.

| Task | Required privileges/roles | Notes |
| --- | --- | --- |
| Create a classification profile | SNOWFLAKE.CLASSIFICATION_ADMIN database role | For information about granting this database role to other roles, see [Using SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md). |
|  | CREATE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE on schema | You need this privilege on the schema where you want to create the classification profile instance. |
|  | USAGE on database and schema | You need privileges on the schema where you want to create the classification profile instance. |
| Set the classification profile on a database/schema | One of the following:   * EXECUTE AUTO CLASSIFICATION on account * EXECUTE AUTO CLASSIFICATION on database/schema | By default, the owner of the database/schema has the EXECUTE AUTO CLASSIFICATION privilege on it. |
|  | Any privilege on schema’s database | If setting a classification profile on a schema, you need at least one privilege on the database that contains that schema. |
|  | Any privilege on database/schema | You need at least one privilege on the database/schema that contains the table that you want to automatically classify. The EXECUTE AUTO CLASSIFICATION privilege meets this requirement. |
|  | One of the following:   * OWNERSHIP on classification profile instance. * <classification_profile>!PRIVACY_USER instance role on the classification profile. | For information about granting the PRIVACY_USER instance role to other roles, see [Instance roles](../sql-reference/snowflake-db-classes.md). |
|  | APPLY TAG on Account |  |
| Call [methods](../sql-reference/classes/classification_profile.md) on a classification profile instance | <classification_profile>!PRIVACY_USER instance role | For information about granting this instance role to other roles, see [Instance roles](../sql-reference/snowflake-db-classes.md). |
| List classification profiles | <classification_profile>!PRIVACY_USER instance role |  |
| Drop classification profiles | OWNERSHIP on classification profile instance |  |

For an example of granting these privileges and database roles to the role of a data engineer, see Basic example: Automatically classifying tables in a database.

## Examples

* Basic example: Automatically classifying tables in a database
* Example: Using a tag map and custom classifiers
* Example: Testing a classification profile before enabling automatic classification

### Basic example: Automatically classifying tables in a database

Complete these steps to automatically classify a table in the database:

1. As an administrator, give the data engineer the roles and privileges they need to
   automatically classify tables in a database.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   GRANT USAGE ON DATABASE mydb TO ROLE data_engineer;
   GRANT EXECUTE AUTO CLASSIFICATION ON DATABASE mydb TO ROLE data_engineer;

   GRANT DATABASE ROLE SNOWFLAKE.CLASSIFICATION_ADMIN TO ROLE data_engineer;
   GRANT CREATE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE ON SCHEMA mydb.sch TO ROLE data_engineer;

   GRANT APPLY TAG ON ACCOUNT TO ROLE data_engineer;
   ```
2. Switch to the data engineer role:

   ```sqlexample
   USE ROLE data_engineer;
   ```
3. [Create the classification profile](../sql-reference/classes/classification_profile/commands/create-classification-profile.md) as an
   instance of the CLASSIFICATION_PROFILE class:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
     my_classification_profile(
       {
         'minimum_object_age_for_classification_days': 0,
         'maximum_classification_validity_days': 30,
         'auto_tag': true,
         'classify_views': false
       });
   ```
4. Call the [DESCRIBE](../sql-reference/classes/classification_profile/methods/describe.md) method on the instance to confirm its properties:

   ```sqlexample
   SELECT my_classification_profile!DESCRIBE();
   ```
5. Set the classification profile instance on the schema, which starts the background process of monitoring tables in the schema and
   automatically classifying them for sensitive data.

   ```sqlexample
   ALTER DATABASE mydb
    SET CLASSIFICATION_PROFILE = 'mydb.sch.my_classification_profile';
   ```

   > **Note:**
   >
   > There is a one-hour delay between setting the classification profile on the schema and Snowflake beginning to classify the schema.
6. After waiting one hour, call the [SYSTEM$GET_CLASSIFICATION_RESULT](../sql-reference/functions/system_get_classification_result.md) stored procedure to obtain the results
   of the automatic classification.

   ```sqlexample
   CALL SYSTEM$GET_CLASSIFICATION_RESULT('mydb.sch.t1');
   ```

### Example: Using a tag map and custom classifiers

1. As an administrator, give the data engineer the roles and privileges they need to
   automatically classify tables in a database and set tags on columns.
2. Create the classification profile.

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
     my_classification_profile(
       {
         'minimum_object_age_for_classification_days': 0,
         'maximum_classification_validity_days': 30,
         'auto_tag': true,
         'classify_views': false
       });
   ```
3. Call the [SET_TAG_MAP](../sql-reference/classes/classification_profile/methods/set_tag_map.md) method on the instance to add a
   tag map to the classification profile. This allows custom tags to be automatically applied on
   columns that contain sensitive data.

   ```sqlexample
   CALL my_classification_profile!SET_TAG_MAP(
     {'column_tag_map':[
       {
         'tag_name':'my_db.sch1.pii',
         'tag_value':'sensitive',
         'semantic_categories':['NAME']
       }]});
   ```

   Alternatively, you could have added this tag map when you created the classification profile.
4. Call the [SET_CUSTOM_CLASSIFIERS](../sql-reference/classes/classification_profile/methods/set_custom_classifiers.md) method to add
   [custom classifiers](classify-custom.md) to the classification profile. This allows sensitive data to be automatically
   classified with user-defined semantic and privacy categories.

   ```sqlexample
   CALL my_classification_profile!set_custom_classifiers(
     {
       'medical_codes': medical_codes!list(),
       'finance_codes': finance_codes!list()
     });
   ```

   Alternatively, you could have added the custom classifiers when you created the classification profile.
5. Call the [DESCRIBE](../sql-reference/classes/classification_profile/methods/describe.md) method on the instance to confirm that the
   tag map and custom classifiers have been added to the classification profile.

   ```sqlexample
   SELECT my_classification_profile!DESCRIBE();
   ```
6. Set the classification profile instance on the database.

   ```sqlexample
   ALTER DATABASE mydb
    SET CLASSIFICATION_PROFILE = 'mydb.sch.my_classification_profile';
   ```
7. Attach a masking policy to the `tag_db.sch.pii` tag to enable tag-based masking.

   ```sqlexample
   ALTER TAG tag_db.sch.pii SET MASKING POLICY pii_mask;
   ```

### Example: Testing a classification profile before enabling automatic classification

1. As an administrator, give the data engineer the roles and privileges they need to
   automatically classify tables in a schema and set tags on columns.
2. Create the classification profile with a tag map and custom classifiers:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE my_classification_profile(
     {
       'minimum_object_age_for_classification_days':0,
       'auto_tag':true,
       'tag_map': {
         'column_tag_map':[
           {
             'tag_name':'tag_db.sch.pii',
             'tag_value':'highly sensitive',
             'semantic_categories':['NAME','NATIONAL_IDENTIFIER']
           },
           {
             'tag_name':'tag_db.sch.pii',
             'tag_value':'sensitive',
             'semantic_categories':['EMAIL','MEDICAL_CODE']
           }
         ]
       },
       'classify_views': false,
       'custom_classifiers': {
         'medical_codes': medical_codes!list(),
         'finance_codes': finance_codes!list()
       }
     }
   );
   ```
3. Call the [SYSTEM$CLASSIFY](../sql-reference/stored-procedures/system_classify.md) stored procedure to test the tag mappings on the `table1` table before
   enabling automatic classification.

   ```sqlexample
   CALL SYSTEM$CLASSIFY(
    'db.sch.table1',
    'db.sch.my_classification_profile'
   );
   ```

   The `tags` key in the output contains the details about whether the tag was set (`true` if set, `false` otherwise),
   the name of the tag that was set, and the value of the tag:

   ```output
   {
     "classification_profile_config": {
       "classification_profile_name": "db.schema.my_classification_profile"
     },
     "classification_result": {
       "EMAIL": {
         "alternates": [],
         "recommendation": {
           "confidence": "HIGH",
           "coverage": 1,
           "details": [],
           "privacy_category": "IDENTIFIER",
           "semantic_category": "EMAIL",
           "tags": [
             {
               "tag_applied": true,
               "tag_name": "snowflake.core.semantic_category",
               "tag_value": "EMAIL"
             },
             {
               "tag_applied": true,
               "tag_name": "snowflake.core.privacy_category",
               "tag_value": "IDENTIFIER"
             },
             {
               "tag_applied": true,
               "tag_name": "tag_db.sch.pii",
               "tag_value": "sensitive"
             }
           ]
         },
         "valid_value_ratio": 1
       },
       "FIRST_NAME": {
         "alternates": [],
         "recommendation": {
           "confidence": "HIGH",
           "coverage": 1,
           "details": [],
           "privacy_category": "IDENTIFIER",
           "semantic_category": "NAME",
           "tags": [
             {
               "tag_applied": true,
               "tag_name": "snowflake.core.semantic_category",
               "tag_value": "NAME"
             },
             {
               "tag_applied": true,
               "tag_name": "snowflake.core.privacy_category",
               "tag_value": "IDENTIFIER"
             },
             {
               "tag_applied": true,
               "tag_name": "tag_db.sch.pii",
               "tag_value": "highly sensitive"
             }
           ]
         },
         "valid_value_ratio": 1
       }
     }
   }
   ```
4. Having verified that automatic classification based on the classification profile will have the desired result, set the classification
   profile instance on the database.

   ```sqlexample
   ALTER DATABASE mydb
    SET CLASSIFICATION_PROFILE = 'mydb.sch.my_classification_profile';
   ```

---
title: Use SQL to work with expectations
source: https://docs.snowflake.com/en/user-guide/data-quality-expectations.md
section: User Guide
---

# Use SQL to work with expectations

Returning a value from a data metric function (DMF) provides useful information, but it might be hard to know whether it indicates a data
quality issue without knowing what you consider to be acceptable for your data. For example, you might consider tables that contain less
than 10 NULL values in a given column as passing the data quality check. In this case, you *expect* the value to be less than 10, and only
want to be notified if it exceeds that value.

An *expectation* lets you define criteria for whether data passes a data quality check performed by a DMF. When the DMF returns a value,
that value is compared to this criteria to determine whether the data passed or failed the check. Return values that fail are reported as
expectation violations so you can take appropriate action upon the data.

> **Note:**
>
> This topic describes how to use SQL to set up and monitor expectations. To use a user interface to set up data quality checks consisting of
> a DMF and an expectation, see [Use Snowsight to set up data quality checks](data-quality-ui-setup.md).

The following creates the expectation that the column `C1` contains less than 10 NULL values.

```sqlexample
ALTER VIEW v1
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT ON (C1)
  EXPECTATION my_exp ( VALUE < 10);
```

You can define expectations for both system DMFs and custom DMFs.

## Defining what meets the expectation

An expectation includes a Boolean expression that determines whether the expectation was met or not. When this expression
evaluates to TRUE it means that the DMF result matched your expectation.

Within an expression, the keyword `VALUE` represents the value returned by the DMF. For example, suppose you had the following
definition of an expectation:

```sqlexample
EXPECTATION my_exp (VALUE < 5)
```

Snowflake replaces `VALUE` with the value returned by the DMF when evaluating the expectation. If the DMF returned `3`, the expectation
would be met because the expression evaluates to TRUE.

If an expression evaluates to FALSE, Snowflake reports it as an expectation violation. For information about tracking these violations, see Identify expectation violations.

An expression can include the following types of operators:

* [Logical operators](../sql-reference/operators-logical.md)
* [Comparison operators](../sql-reference/operators-comparison.md)

An expression cannot reference other tables or views, or user-defined functions (UDFs).

## Create an expectation

Each association between a DMF and an object can have one or more expectations.

You can add an expectation when you associate the DMF with the table or view, or you can add it to the association later. You can also
modify an existing expectation.

After you add an expectation, you can manually test it without having to wait until the DMF
runs based on its schedule.

### Add an expectation when associating a DMF

You use an ALTER TABLE or ALTER VIEW command to associate a DMF with a table or view. You can add expectations to the association in the
same SQL statement that creates the association.

For example, the syntax to add expectations when associating a DMF with a table is as follows. Views use a similar syntax.

```sqlsyntax
  ALTER TABLE <table>
    ADD DATA METRIC FUNCTION <dmf>
      ON (<col_name> [ , ... ] [ , TABLE<table_name>( <col_name> [ , ... ] ) )
      [ EXPECTATION <expectation_name> ( <expression> )
        [, <expectation_name> ( <expression> ) [ , ... ] ] ]
```

Where:

* `expectation_name` is a string that’s used to identify the expectation. You can create expectations with the same name as
  long as they belong to different associations.
* `expression` is a Boolean expression that determines whether the DMF returned an expected value. See
  Defining what meets the expectation.

Example: Add single expectation
:   Suppose you’re associating the MAX system DMF with view `v1` in order to check the maximum value in the column `c1`. You expect the
    maximum value to be between 25 and 50.

    ```sqlexample
    ALTER VIEW v1
      ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.MAX ON (C1)
        EXPECTATION my_exp ( 25 < VALUE AND VALUE < 50);
    ```

    If the MAX DMF returns a value outside this range of expected values, then Snowflake records it as an expectation violation.

Example: Add multiple expectations
:   Suppose you wanted to be notified when a table hasn’t been updated within five minutes, then again when it hasn’t been updated for 30
    minutes. You could add the following expectations, then check when those expectations were violated.

    ```sqlexample
    ALTER TABLE emp
    ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.FRESHNESS ON (last_updated)
      EXPECTATION lessThan5Mins (VALUE < 300), lessThan30Mins (VALUE < 1800);
    ```

### Add an expectation to an existing association

You use an ALTER TABLE or ALTER VIEW command to add an expectation to an existing association between a DMF and the table or view.

For example, the syntax to add expectations to an association between a table and a DMF is as follows. Views use a similar syntax.

```sqlsyntax
  ALTER TABLE <table>
    MODIFY DATA METRIC FUNCTION <dmf>
      ON (<col_name> [ , ... ] [ , TABLE<table_name>( <col_name> [ , ... ] ) )
      [ ADD EXPECTATION <expectation_name> ( <expression> )
        [, <expectation_name> ( <expression> ) [ , ... ] ] ]
```

Where:

* `expectation_name` is a string that’s used to identify the expectation. You can create expectations with the same name as
  long as they belong to different associations.
* `expression` is a Boolean expression that determines whether the DMF returned an expected value. See
  Defining what meets the expectation.

Example
:   Suppose you previously associated the NULL_COUNT system DMF with the column `c1` in the table `my_table`. To add an expectation so you
    can be notified when there are 10 or more NULL values in the column `c1`, run the following statement:

    ```sqlexample
    ALTER TABLE my_table
      MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT ON (c1)
        ADD EXPECTATION my_exp (VALUE < 10);
    ```

    If the result of NULL_COUNT is 15, it’s reported as an expectation violation.

### Modify an existing expectation

You use a MODIFY EXPECTATION clause to change the expression of an expectation that you previously added to an association.

For example, suppose you previously added the expectation `my_exp` to the association between table `t1` and the NULL_COUNT DMF. To
modify the expectation so it’s violated when there are 15 or more NULL values in column `c1`, run the following statement:

```sqlexample
ALTER TABLE t1
  MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT ON (c1)
    MODIFY EXPECTATION my_exp (VALUE < 15);
```

The previous expression of the expectation is replaced with `VALUE < 15`.

## Test an expectation

After you add expectations, you can call the [SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS](../sql-reference/functions/system_evaluate_data_quality_expectations.md) system function to
ensure that they were added correctly and to determine whether these expectations are currently violated.

For example, suppose you added at least one expectation to the association between a DMF and table `t1`. To see whether these expectations
are currently violated, run the following statement:

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS(
      REF_ENTITY_NAME => 'my_db.sch.t1'));
```

## Drop an expectation

Use a DROP EXPECTATION clause to remove an expectation from an association and remove it from the system.

For example, suppose you previously added the expectation `my_exp` to the association between the column `c1` in the table `t1`
and the NULL_COUNT DMF. To remove the `my_exp` from the association and the DMF, run the following code:

```sqlexample
ALTER TABLE t1
  MODIFY DATA METRIC FUNCTION SNOWFLAKE.CORE.NULL_COUNT on (c1)
    DROP EXPECTATION my_exp;
```

## Identify expectation violations

You can identify expectation violations using the following:

* SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW — A dedicated event table that records raw data quality results.
* DATA_QUALITY_MONITORING_EXPECTATION_STATUS view — View in the SNOWFLAKE.LOCAL schema that contains flattened results.
* DATA_QUALITY_MONITORING_EXPECTATION_STATUS function — Table function that returns expectations results.

### SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW

Data quality results are recorded in the dedicated event table SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW.

If the association between an object and a DMF has an expectation, two rows are added to the table every time Snowflake computes the result
of the DMF. The first row records information about the object the DMF is associated with, the DMF itself, and the result of the data
quality check. The second row records information related to the expectation set on the DMF association, including whether the expectation
was met or violated. If there are multiple expectations, there is a row for each expectation.

The `snow.data_metric.record_type` field in the `resource_attribute` column indicates whether a row corresponds to an
expectation. This field has two possible values:

* `EXPECTATION_VIOLATION_STATUS` - Indicates that the row corresponds to an expectation.
* `EVALUATION_RESULT` - Indicates that the row corresponds to the evaluation of the DMF.

If the row corresponds to an expectation, the `resource_attribute` column also contains the following fields related to expectations:

* `snow.data_metric.expectation_id` - System-generated identifier.
* `snow.data_metric.expectation_name`- Name of the expectation when it was added to the association.
* `snow.data_metric.expectation_expression` - Expectation’s expression.

After you have determined that a row corresponds to the evaluation of an expectation, you can check the `value` column to determine
whether the expectation was violated. If TRUE, the expectation was violated.

### DATA_QUALITY_MONITORING_EXPECTATION_STATUS view

The [DATA_QUALITY_MONITORING_EXPECTATION_STATUS view](../sql-reference/local/data_quality_monitoring_expectation_status.md), which exists in the SNOWFLAKE.LOCAL schema, flattens the
information in the event table to make it easier to access DMF results.

### DATA_QUALITY_MONITORING_EXPECTATION_STATUS function

The [DATA_QUALITY_MONITORING_EXPECTATION_STATUS](../sql-reference/functions/data_quality_monitoring_expectation_status.md) table function returns rows that provide the
same information that is available in the DATA_QUALITY_MONITORING_EXPECTATION_STATUS view. The function uses a different access control
model than the view.

## Track the use of expectations

Snowflake keeps track of all of the expectations in your account. You can run a function or
query an ACCOUNT_USAGE view to monitor the use of expectations, including performing the following
tasks:

* Monitor which objects have an expectation defined for their association with a DMF.
* Monitor which DMFs have an expectation defined for their association with an object.
* Discover whether there is an expectation defined for a specific association between an object and a DMF.
* Determine the Boolean expression of an expectation to better understand a data quality check.

### Run a function to track expectations

You can run the [DATA_METRIC_FUNCTION_EXPECTATIONS](../sql-reference/functions/data_metric_function_expectations.md) function to output expectations defined for a specific
object, a specific DMF, or the association between an object and a DMF.

**Example:** Expectations that exist for a specific object

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_EXPECTATIONS(
      REF_ENTITY_NAME => 'my_table',
      REF_ENTITY_DOMAIN => 'table'));
```

**Example:** Expectations that exist for a specific DMF

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_EXPECTATIONS(
      METRIC_NAME => 'SNOWFLAKE.CORE.NULL_COUNT'));
```

**Example:** Expectations that exist for a specific association between an object and a DMF

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_EXPECTATIONS(
      METRIC_NAME => 'SNOWFLAKE.CORE.NULL_COUNT',
      REF_ENTITY_NAME => 'my_table',
      REF_ENTITY_DOMAIN => 'table'));
```

### Query a view to track expectations

The [DATA_METRIC_FUNCTION_EXPECTATIONS view](../sql-reference/account-usage/data_metric_function_expectations.md) in the ACCOUNT_USAGE schema contains all of the expectations in
your account. You can query the view to track the use of expectations within your account and determine the Boolean expression of each
expectation.

**Example:** Return all expectations for your Snowflake account

```sqlexample
SELECT * FROM snowflake.account_usage.data_metric_function_expectations
  ORDER BY expectation_name;
```

**Example:** Identify expectations for a specific data metric function

```sqlexample
SELECT expectation_name,
    ref_database_name as object_database,
    ref_schema_name as object_schema,
    ref_entity_name as object_name
  FROM snowflake.account_usage.data_metric_function_expectations
  WHERE
    metric_database_name = 'SNOWFLAKE' AND
    metric_schema_name = 'CORE' AND
    metric_name = 'ROW_COUNT'
  ORDER BY expectation_name;
```

---
title: Use the sample database
source: https://docs.snowflake.com/en/user-guide/sample-data-using.md
section: User Guide
---

# Use the sample database

The sample database, SNOWFLAKE_SAMPLE_DATA, is identical to the databases that you create in your account, except that it is read-only.
As such, the following operations are not allowed:

* No DDL can be performed on the data set schemas (i.e. tables and other database objects cannot be added, dropped, or altered).
* No DML can be performed on the tables in the schemas.
* No cloning or Time Travel can be performed on the database or any schemas/tables in the database.

However, you can use all the same commands and syntax to view the sample database, schemas, and tables, as well as execute queries on the tables.

> **Important:**
>
> The sample database is created by default for newer accounts. If the database has not been created for your account and you want
> access to it, execute the following SQL statements with the ACCOUNTADMIN role active:
>
> ```sqlexample
> -- Create a database from the share.
> CREATE DATABASE SNOWFLAKE_SAMPLE_DATA FROM SHARE SFC_SAMPLES.SAMPLE_DATA;
>
> -- Grant the PUBLIC role access to the database.
> -- Optionally change the role name to restrict access to a subset of users.
> GRANT IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE_SAMPLE_DATA TO ROLE PUBLIC;
> ```

## View the sample database

You can view the sample database and its contents either in Snowsight or using SQL:

> Snowsight:
> :   In the navigation menu, select Catalog » Database Explorer » SNOWFLAKE_SAMPLE_DATA.
>
> SQL:
> :   Execute a [SHOW DATABASES](../sql-reference/sql/show-databases.md) command.
>
>     You can also use the relevant [SHOW <objects>](../sql-reference/sql/show.md) commands to view the objects in the sample database.

For example, in SQL:

> ```sqlexample
> show databases like '%sample%';
>
> +-------------------------------+-----------------------+------------+------------+-------------------------+--------------+---------+---------+----------------+
> | created_on                    | name                  | is_default | is_current | origin                  | owner        | comment | options | retention_time |
> |-------------------------------+-----------------------+------------+------------+-------------------------+--------------+---------+---------+----------------|
> | 2016-07-14 14:30:21.711 -0700 | SNOWFLAKE_SAMPLE_DATA | N          | N          | SFC_SAMPLES.SAMPLE_DATA | ACCOUNTADMIN |         |         | 1              |
> +-------------------------------+-----------------------+------------+------------+-------------------------+--------------+---------+---------+----------------+
> ```

Note that this example illustrates the sample database, SNOWFLAKE_SAMPLE_DATA, has been [shared with your account](data-sharing-intro.md) by Snowflake.

The `origin` column in the SHOW DATABASES output (or the Origin column in the Databases  page in the interface) displays the fully-qualified name of the shared
database, SFC_SAMPLES.SAMPLE_DATA, indicating it originated from the SFC_SAMPLES account (used by Snowflake to share the sample data).

## Query tables and views in the sample database

To use a table or view in the sample database, you can either:

* Reference the fully-qualified name of the table in your query (in the form of `snowflake_sample_data.schema_name.object_name`).

  OR
* Specify the sample database (and schema) for your session using the [USE DATABASE](../sql-reference/sql/use-database.md) and/or [USE SCHEMA](../sql-reference/sql/use-schema.md) commands.

The following two examples illustrate using both approaches to query the `lineitem` table in the `tpch_sf1` schema:

> ```sqlexample
> select count(*) from snowflake_sample_data.tpch_sf1.lineitem;
>
> +----------+
> | COUNT(*) |
> |----------|
> |  6001215 |
> +----------+
>
> use schema snowflake_sample_data.tpch_sf1;
>
> select count(*) from lineitem;
>
> +----------+
> | COUNT(*) |
> |----------|
> |  6001215 |
> +----------+
> ```

> **Note:**
>
> You must have a running, current warehouse in your session to perform queries. You set the current warehouse in a session using the [USE WAREHOUSE](../sql-reference/sql/use-warehouse.md)
> command (or within the Worksheet in the web interface.)

---
title: Use the Trust Center to set up sensitive data classification
source: https://docs.snowflake.com/en/user-guide/classify-ui-trust-center.md
section: User Guide
---

# Use the Trust Center to set up sensitive data classification

Trust Center lets you set up [sensitive data classification](classify-intro.md) in the Snowsight user interface, so you
don’t have to write any SQL code. After it is set up, sensitive data classification automatically identifies which data in a database is sensitive and needs to be protected.

## Get started

> **Note:**
>
> The following steps apply only to the first user who accesses the Data Security tab in the Trust Center. If you aren’t the first
> user and want to set up classification, see Set up classification with advanced settings.

To use a web interface to set up sensitive data classification, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the required privileges.
2. In the navigation menu, select Governance & security » Trust Center.
3. Select the Data Security tab.
4. Select Get started.
5. In the Set up auto-classification dialog, do the following:

   1. Select the databases that you want to classify.
   2. Specify whether you want to auto-apply tags instead of just recommending them. For more information about tags and categories, see [Core concepts of sensitive data classification](classify-intro.md).
6. Select Enable.
7. Select Close.

Based on this default set up, sensitive data classification has the following behavior:

* Reclassifies previously classified objects every 30 days.
* Scans data for all [native semantic categories](classify-native.md).
* Excludes views from classification.
* Bases classification on a sample of up to 10,000 randomly selected rows per table.

When the classification process is complete, you are ready to [view the results](classify-results.md).

## Set up classification with advanced settings

To set up sensitive data classification with advanced settings, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the required privileges.
2. In the navigation menu, select Governance & security » Trust Center.
3. Select the Data Security tab.
4. Select Settings.
5. Do one of the following:

   * If you’re fine-tuning existing classification settings, find the classification profile that contains the settings and
     select  » Edit. If the first person to set up classification chose the default settings during
     setup, the profile is `Default Snowflake profile`.
   * If you are creating a new [classification profile](classify-intro.md) so different databases can be
     classified with different settings, select Create New.
6. Select the databases that you want to scan for sensitive data.

   If a database is greyed out, it’s associated with an existing
   classification profile and is already being classified. You’ll need to edit the existing classification profile to remove the database
   before you can classify it with the settings of a new profile.
7. Select Next.
8. If your account classifies sensitive data into [custom categories](classify-custom.md), select the ones that you want to use.
9. Select Next.
10. If you don’t want tags automatically applied to columns containing sensitive data, deselect Auto-apply tags.
11. If you want to apply a user-defined tag in addition to a system tag on matching columns, do the following:

    1. In the Tag to apply column, select the user-defined tag/value pair that you want applied to sensitive data.
    2. In the Detected semantic categories column, select values of the `SNOWFLAKE.CORE.SEMANTIC_CATEGORY` tag. These can be
       native and custom semantic categories.

    For example, if you select `PII = CONFIDENTIAL` as the user-defined tag/value pair in Tag to apply, and
    then select the `NAME` semantic category in Detected semantic categories, when Snowflake assigns the
    `SNOWFLAKE.CORE.SEMANTIC_CATEGORY = NAME` system tag to a column, it also applies the `PII = CONFIDENTIAL` tag.
12. Select Next.
13. Specify the database, schema, and name of the [classification profile](classify-intro.md) where all of your
    settings will be saved.
14. Select the cadence at which previously classified objects are re-classified.
15. Specify if you want to exclude certain objects from the classification process. For information about excluding specific objects, see [Excluding data from sensitive data classification](classify-auto-exclude.md).
16. Select Enable.

## Classify additional databases

You can classify additional databases with the same classification settings by editing an existing classification profile. To edit a classification profile:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the required privileges.
2. In the navigation menu, select Governance & security » Trust Center.
3. Select the Data Security tab.
4. Select Settings.
5. Find the classification profile in the list and select  » Edit. If the first person to set up classification used the default settings, the classification profile is `Default Snowflake profile`.
6. On the first page that appears, select the additional databases.
7. Complete the setup.

## Next steps

To view the results of sensitive data classification, see [Use the Trust Center to view classification results](classify-results.md).

## Access control requirements

> **Note:**
>
> The `DATA_SECURITY_*` application roles alone are not sufficient to access the Trust Center Data Security tab. You must have the
> SNOWFLAKE.TRUST_CENTER_VIEWER or SNOWFLAKE.TRUST_CENTER_ADMIN application role to use the Trust Center UI for classification. If your
> account previously relied on `DATA_SECURITY_*` roles, update your role grants accordingly.

| Task | Required privileges/roles | Notes |
| --- | --- | --- |
| Set up classification for a database | One of the following:   * SNOWFLAKE.TRUST_CENTER_VIEWER application role * SNOWFLAKE.TRUST_CENTER_ADMIN application role |  |
|  | EXECUTE AUTO CLASSIFICATION privilege on ACCOUNT |  |
|  | APPLY TAG privilege on ACCOUNT |  |
|  | USAGE on the database | More powerful privileges on the database meet this requirement. |
| Review classification insights and classified objects | One of the following:   * SNOWFLAKE.TRUST_CENTER_VIEWER application role * SNOWFLAKE.TRUST_CENTER_ADMIN application role |  |

**Example: Allow a user to set up classification**

To allow user `mary` to set up sensitive data classification and review classification findings, run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE trust_center_admin_role;

GRANT APPLICATION ROLE SNOWFLAKE.TRUST_CENTER_ADMIN TO ROLE trust_center_admin_role;
GRANT EXECUTE AUTO CLASSIFICATION ON ACCOUNT TO ROLE trust_center_admin_role;
GRANT APPLY TAG ON ACCOUNT TO ROLE trust_center_admin_role;
GRANT USAGE ON DATABASE mydb TO ROLE trust_center_admin_role;

GRANT ROLE trust_center_admin_role TO USER mary;
```

**Example: Allow user to review classification findings**

If you want user `joe` to be able to review classification findings, but not be able to set up classification, run the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE trust_center_viewer_role;

GRANT APPLICATION ROLE SNOWFLAKE.TRUST_CENTER_VIEWER TO ROLE trust_center_viewer_role;

GRANT ROLE trust_center_viewer_role TO USER joe;
```

---
title: User management
source: https://docs.snowflake.com/en/user-guide/admin-user-management.md
section: User Guide
---

# User management

User administrators can create and manage Snowflake users through SQL or the web interface:

* Using SQL, administrators can perform all user-related tasks, including changing login credentials and defaults for users.
* Snowsight supports most user-related tasks.

## Types of users

Some user objects correspond to human users while other user objects correspond to a service or application that interacts with Snowflake
programmatically without human interaction. When you create a user object, you specify the type of user to differentiate between people and
services. This distinction is important because people need to enroll in
[multi-factor authentication (MFA)](security-mfa.md), but services and applications should not because there is no one to
use a secondary method of authentication.

The `TYPE` property of a user object determines the type of user. Possible values of this `TYPE` property are as follows:

PERSON:
:   User is a human user who can interact with Snowflake.

NULL:
:   Functions the same as `PERSON`.

SERVICE:
:   User is a service or application that interacts with Snowflake without human interaction.

    To improve the security posture of non-interactive use cases, users with the `TYPE` property set to `SERVICE` have the
    following characteristics:

    * They cannot log in using a password.
    * They cannot log in using SAML SSO.
    * They cannot [enroll in MFA](ui-snowsight-profile.md).
    * They are not subject to authentication policy MFA enforcement.
    * They cannot have the following properties:

      + `FIRST_NAME`
      + `MIDDLE_NAME`
      + `LAST_NAME`
      + `PASSWORD`
      + `MUST_CHANGE_PASSWORD`
      + `MINS_TO_BYPASS_MFA`
    * The following commands cannot be used:

      + ALTER USER RESET PASSWORD
      + ALTER USER SET `DISABLE_MFA = TRUE`

SNOWFLAKE_SERVICE:
:   User that is created by Snowflake for [Snowpark Container Services](../developer-guide/snowpark-container-services/overview.md).
    Administrators cannot create users of type SNOWFLAKE_SERVICE, nor can they change the type of an existing user to be SNOWFLAKE_SERVICE.
    For more information about SNOWFLAKE_SERVICE users, see [Snowpark Container Services: SQL execution](../developer-guide/snowpark-container-services/spcs-execute-sql.md).

LEGACY_SERVICE:
:   A user with their `TYPE` property set to `LEGACY_SERVICE` represents a non-interactive integration. It is similar to
    `SERVICE`, but allows password and SAML authentication.

    > **Note:**
    >
    > The LEGACY_SERVICE type is being deprecated. Use the SERVICE type for services and applications. For a timeline of the deprecation of
    > LEGACY_SERVICE, see [Planning for the deprecation of single-factor password sign-ins](security-mfa-rollout.md).

## User roles

Snowflake uses roles to control the objects (virtual warehouses, databases, tables, etc.) that users can access:

* Snowflake provides a set of predefined roles, as well as a framework for defining a hierarchy of custom roles.
* All Snowflake users are automatically assigned the predefined PUBLIC role, which enables login to Snowflake and basic object access.
* In addition to the PUBLIC role, each user can be assigned additional roles, with one of these roles designated as their *default role*. A user’s default role determines the role used in the Snowflake
  sessions initiated by the user; however, this is only a default. Users can change roles within a session at any time.
* Roles can be assigned at user creation or afterwards.

> **Attention:**
>
> When deciding the additional roles to assign to a user, as well as designating their default role, consider the following for the predefined ACCOUNTADMIN role (required for performing account-level
> administrative tasks):
>
> * Snowflake recommends strictly controlling the assignment of ACCOUNTADMIN, but recommends assigning it to at least two users.
> * ACCOUNTADMIN should never be designated as a user’s default role. Instead, designate a lower-level administrative or custom role as their default.
>
> For more details and best practices related to the ACCOUNTADMIN role, see [Access control best practices](security-access-control-considerations.md). For more general information about roles, see
> [Overview of Access Control](security-access-control-overview.md).

## Privileges required to create and modify users

The following roles or privileges are required to manage users in your account:

Create users:
:   The USERADMIN system role can create users using SQL ([CREATE USER](../sql-reference/sql/create-user.md)).

    If you prefer to use a custom role for this purpose, grant the CREATE USER privilege on the account to this role.

Modify users:
:   Only the role with the OWNERSHIP privilege on a user can modify most user properties using SQL
    ([ALTER USER](../sql-reference/sql/alter-user.md)).

## Creating users

This section describes how to create a user in a specific account.

> **Note:**
>
> Snowsight, requires that you specify a password when you create a user. The [CREATE USER](../sql-reference/sql/create-user.md) command and [UserCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserCollection) Python API do not.

> **Note:**
>
> If you want to create a user who can access multiple accounts within an organization, see [Organization users](organization-users.md).

### Using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Select + User.
4. In the User Name field, enter a unique identifier for the user. The user uses this identifier to sign in to Snowflake unless you
   specify a login name.
5. Optionally specify an email address for the user in the Email field.
6. In the Password and Confirm Password fields, enter the password for the user.
7. Optionally add a comment explaining why you created the user.
8. Leave the Force user to change password on first time login checkbox selected to force the user to change their password when they
   sign in.
9. Optionally select Advanced User Options to specify additional details about the user:

   * Login Name to use instead of the User Name when signing in to Snowflake.
   * Display Name that appears after signing in.
   * First Name and Last Name to complete the user profile.
   * Default Role, Default Warehouse, and Default Namespace.
10. Select Create User.

### Using SQL

Use the [CREATE USER](../sql-reference/sql/create-user.md) command to create a user.

> **Important:**
>
> When creating a user, if you assign a default role to the user, you must then explicitly grant this role to the user. For example:
>
> > ```sqlexample
> > CREATE USER janesmith PASSWORD = 'abc123' DEFAULT_ROLE = myrole MUST_CHANGE_PASSWORD = TRUE;
> >
> > GRANT ROLE myrole TO USER janesmith;
> > ```
>
> Note that the [GRANT ROLE](../sql-reference/sql/grant-role.md) command allows you to assign multiple roles to a single user. The web interface does not currently support the same capability.

### Using Python

Use the [UserCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserCollection)
Python API to create a user.

> **Important:**
>
> When creating a user, if you assign a default role to the user, you must then explicitly grant this role to the user. For example:
>
> > ```python
> > from snowflake.core.user import Securable, User
> >
> > my_user = User(
> >   name="janesmith",
> >   password="abc123",
> >   default_role="myrole",
> >   must_change_password=True)
> > root.users.create(my_user)
> >
> > root.users['janesmith'].grant_role(role_type="ROLE", role=Securable(name='myrole'))
> > ```

## Disabling or enabling a user

Disabling a user prevents the user from logging into Snowflake. You can disable a user through the following interfaces.

### Using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Locate the user that you want to disable and select  » Disable User.
4. In the confirmation dialog that opens, select Disable.

To enable a user, follow the same steps, but select Enable User.

### Using SQL

Use the [ALTER USER](../sql-reference/sql/alter-user.md) command to disable or enable a user. For example:

* Disable a user:

  > ```sqlexample
  > ALTER USER janesmith SET DISABLED = TRUE;
  > ```
* Enable a user:

  > ```sqlexample
  > ALTER USER janesmith SET DISABLED = FALSE;
  > ```

### Using Python

Use the [UserResource.create_or_alter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource)
Python API to disable or enable a user. For example:

* Disable a user:

  > ```python
  > user_parameters = root.users["janesmith"].fetch()
  > user_parameters.disabled = True
  > root.users["janesmith"].create_or_alter(user_parameters)
  > ```
* Enable a user:

  > ```python
  > user_parameters = root.users["janesmith"].fetch()
  > user_parameters.disabled = False
  > root.users["janesmith"].create_or_alter(user_parameters)
  > ```

## Unlocking a user

If a user login fails after five consecutive attempts, the user is locked out of their account for a period of time (currently 15 minutes).
After the period of time elapses, the system automatically clears the lock and the user can attempt to log in again.

To unlock the user before the time has elapsed, you can reset the timer using the [ALTER USER](../sql-reference/sql/alter-user.md) command or the
[UserResource.create_or_alter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource) Python API.

The following example resets the timer to 0, which immediately unlocks user `janesmith`:

SQLPython

```sqlexample
ALTER USER janesmith SET MINS_TO_UNLOCK = 0;
```

```python
user_parameters = root.users["janesmith"].fetch()
user_parameters.mins_to_unlock = 0
root.users["janesmith"].create_or_alter(user_parameters)
```

> **Tip:**
>
> If a single role has the OWNERSHIP privilege on all Snowflake users, we recommend granting the role to multiple users. That way, if a member of the role is locked out, another member can unlock that user.

## Altering session parameters for a user

* To show the session parameters for a user, use the following SQL syntax:

  > ```sqlsyntax
  > SHOW PARAMETERS [ LIKE '<pattern>' ] FOR USER <name>
  > ```
* To alter the session parameters for a user, use the following syntax:

  > ```sqlsyntax
  > ALTER USER <name> SET <session_param> = <value>
  > ```

  For example, allow a user to remain connected to Snowflake indefinitely without timing out:

  > ```sqlexample
  > ALTER USER janesmith SET CLIENT_SESSION_KEEP_ALIVE = TRUE;
  > ```
* To reset a session parameter for a user to the default value, use the following syntax:

  > ```sqlsyntax
  > ALTER USER <name> UNSET <session_param>
  > ```

## Modifying other user properties

You can modify all other user properties using the [ALTER USER](../sql-reference/sql/alter-user.md) command or the
[UserResource.create_or_alter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource) Python API.
You can modify many of the same user properties using Snowsight.

For example:

* Change the last name for user `janesmith` to `Jones`:

  SQL:
  :   ```sqlexample
      ALTER USER janesmith SET LAST_NAME = 'Jones';
      ```

  Python:
  :   ```python
      user_parameters = root.users["janesmith"].fetch()
      user_parameters.last_name = "Jones"
      root.users["janesmith"].create_or_alter(user_parameters)
      ```

  Snowsight:
  :   1. Sign in to [Snowsight](ui-snowsight-gs.md).
      2. In the navigation menu, select Governance & security » Users & roles.
      3. Locate the user that you want to edit and select  » Edit.
      4. For the Last Name field, enter Jones.
      5. Select Save User.
* Set or change the default warehouse, namespace, primary role, and secondary roles for user `janesmith`:

  SQL:
  :   ```sqlexample
      ALTER USER janesmith SET DEFAULT_WAREHOUSE = mywarehouse DEFAULT_NAMESPACE = mydatabase.myschema DEFAULT_ROLE = myrole DEFAULT_SECONDARY_ROLES = ('ALL');
      ```

  Python:
  :   ```python
      user_parameters = root.users["janesmith"].fetch()
      user_parameters.default_warehouse = "mywarehouse"
      user_parameters.default_namespace = "mydatabase.myschema"
      user_parameters.default_role = "myrole"
      user_parameters.default_secondary_roles = "ALL"
      root.users["janesmith"].create_or_alter(user_parameters)
      ```

  Snowsight:
  :   > **Note:**
      >
      > You cannot set default secondary roles for a user using Snowsight.

      1. Sign in to [Snowsight](ui-snowsight-gs.md).
      2. In the navigation menu, select Governance & security » Users & roles.
      3. Locate the user that you want to edit and select  » Edit.
      4. Open the Advanced User Options and enter values in the relevant fields.
      5. Select Save User.

## Viewing users

You can view information about users using the following interfaces.

### Using SQL

Use the [DESCRIBE USER](../sql-reference/sql/desc-user.md) or [SHOW USERS](../sql-reference/sql/show-users.md) command to view information about one or more users.

For example:

```sqlexample
DESC USER janesmith;
```

### Using Python

Use the [UserResource.fetch](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource)
Python API to get information about a user.

For example:

```python
my_user = root.users["janesmith"].fetch()
print(my_user.to_dict())
```

Use the [UserCollection.iter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserCollection)
Python API to list users in an account.

For example:

```python
users = root.users.iter(like="jane%")
for user in users:
  print(user.name)
```

### Using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Locate the user for which you want to view more details.

   You can review the display name, status, last login time, owning role, and whether or not the user has multi-factor authentication (MFA)
   set up. If the user has a comment, you can hover over the .
4. Optionally select the user to see more details, such as their default settings, roles that have privileges granted on the user, and
   the roles granted to the user.

## Dropping a user

Dropping a user removes the user credentials from Snowflake.

> **Important:**
>
> When you drop a user, the folders, worksheets, and dashboards owned by that user become inaccessible and **do not** transfer to another user
> unless sharing is enabled.
>
> Share recipients with [View, View + Run, and Edit permissions](ui-snowsight-worksheets.md)
> will retain their assigned permissions and can still access the shared folders, worksheets, and dashboards. However, only users with Edit
> permissions can modify or delete the shared folders, worksheets, and dashboards. If you don’t give Edit permissions to at least one other
> user before you drop the owner, that owner’s folders, worksheets, and dashboards cannot be deleted.
>
> If a dropped user’s worksheets do not have sharing enabled, an administrator can [recover up to 500 worksheets owned by the user](ui-snowsight-worksheets.md).

> **Caution:**
>
> Any worksheets in the Classic Console will be permanently deleted, and dashboards will be inaccessible if they were not previously shared
> with another user.

Objects created by the user, such as tables or views, are not dropped because they are owned by the user’s active role when the objects
were created. Another user assigned the same role or a higher role in the [role hierarchy](security-access-control-considerations.md) can manage the objects or transfer ownership to another role.

### Using Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Locate the user that you want to disable and select  » Drop.
4. In the confirmation dialog that opens, select Drop User.

### Using SQL

Use the [DROP USER](../sql-reference/sql/drop-user.md) command to drop a user.

```sqlexample
DROP USER janesmith;
```

### Using Python

Use the [UserResource.drop](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource)
Python API to drop a user.

```python
root.users["janesmith"].drop()
```

---
title: Using Arrays to Compute Distinct Values for Hierarchical Aggregations
source: https://docs.snowflake.com/en/user-guide/querying-arrays-for-distinct-counts.md
section: User Guide
---

# Using Arrays to Compute Distinct Values for Hierarchical Aggregations

If you are counting distinct values for hierarchical aggregations (e.g. multiple grouping sets, rollups, or cubes), you can
improve performance by producing [ARRAYs](../sql-reference/data-types-semistructured.md) that contain the distinct values and computing the number
of distinct values from these ARRAYs. Using this approach can be faster than using `COUNT(DISTINCT <expr>)`.

This topic explains how to use ARRAYs to count distinct values.

For other techniques for counting distinct values, see [Computing the Number of Distinct Values](querying-distinct-counts.md).

## Introduction

When computing the number of distinct values for hierarchical aggregations (e.g. multiple grouping sets, rollups, or cubes), you
can speed up the computation by calling functions that produce arrays containing the distinct values. You can then call
[ARRAY_SIZE](../sql-reference/functions/array_size.md) to compute the count of those distinct values.

These aggregation functions that produce ARRAYs of distinct values can perform better than `COUNT(DISTINCT <expression>)` in
queries of the following forms:

* GROUP BY ROLLUP aggregate queries
* queries containing multiple grouping sets.

Unlike `COUNT(DISTINCT <expression>)` (which needs to be executed for each group), you can compose and reuse ARRAYs that
contain the distinct values. For hierarchical aggregations, you avoid repeatedly computing the distinct counts by producing these
ARRAYs once and reusing them in higher aggregation levels.

In addition, to improve performance further, you can produce these ARRAYs ahead of time (e.g. in a materialized view), rather
than during the query, and you can use these precomputed ARRAYs in your query.

## Creating an ARRAY Containing Distinct Values

To create an ARRAY that contains the distinct values in a column, call the [ARRAY_UNIQUE_AGG](../sql-reference/functions/array_unique_agg.md)
function in a SELECT statement.

`ARRAY_UNIQUE_AGG` is an aggregation function. Aggregation in this context means returning only one instance of a value that
appears in multiple rows. If multiple rows contain the value 3, `ARRAY_UNIQUE_AGG` just includes 3 once in the returned
ARRAY.

For example, create the following table containing a column of numeric values, and insert some values into that column.

```sqlexample
CREATE OR REPLACE TABLE array_unique_agg_test (a INTEGER);
INSERT INTO array_unique_agg_test VALUES (5), (2), (1), (2), (1);
```

Run the following command to produce an ARRAY that contains the distinct values in the column:

```sqlexample
SELECT ARRAY_UNIQUE_AGG(a) AS distinct_values FROM array_unique_agg_test;
```

```none
+-----------------+
| DISTINCT_VALUES |
|-----------------|
| [               |
|   5,            |
|   2,            |
|   1             |
| ]               |
+-----------------+
```

## Computing the Number of Distinct Values from the ARRAYs

To get the total count of the distinct values from the ARRAY, call [ARRAY_SIZE](../sql-reference/functions/array_size.md), passing in the
ARRAY created by [ARRAY_UNIQUE_AGG](../sql-reference/functions/array_unique_agg.md).

For example:

```sqlexample
SELECT ARRAY_SIZE(ARRAY_UNIQUE_AGG(a)) AS number_of_distinct_values FROM array_unique_agg_test;
```

```none
+---------------------------+
| NUMBER_OF_DISTINCT_VALUES |
|---------------------------|
|                         3 |
+---------------------------+
```

## Using Arrays to Improve Query Performance

The following examples demonstrate how to use the aggregation functions that produce ARRAYs of distinct values as an alternative
to `COUNT(DISTINCT <expression>)`.

* Example 1: Counting the Distinct Values in a Single Table
* Example 2: Using GROUP BY to Compute the Counts by Group
* Example 3: Using GROUP BY ROLLUP to Roll up Counts by Group

### Example 1: Counting the Distinct Values in a Single Table

Suppose that you want to count the number of distinct values in `my_column`. The following table compares the SQL statements
for performing this task with `COUNT(DISTINCT expression)` and `ARRAY_UNIQUE_AGG(expression)`.

| Example With COUNT(DISTINCT <expression>) | Example With ARRAY_UNIQUE_AGG(<expression>) |
| --- | --- |
| ```sqlexample SELECT   COUNT(DISTINCT my_column_1),   COUNT(DISTINCT my_column_2) FROM my_table; ``` | ```sqlexample SELECT   ARRAY_SIZE(ARRAY_UNIQUE_AGG(my_column_1)),   ARRAY_SIZE(ARRAY_UNIQUE_AGG(my_column_2)) FROM my_table; ``` |

### Example 2: Using GROUP BY to Compute the Counts by Group

Suppose that you want to count the number of distinct values in `my_column` by `my_key_1` and `my_key_2`.
The following table compares the SQL statements for performing this task with `COUNT(DISTINCT expression)` and
`ARRAY_UNIQUE_AGG(expression)`.

| Example With COUNT(DISTINCT <expression>) | Example With ARRAY_UNIQUE_AGG(<expression>) |
| --- | --- |
| ```sqlexample SELECT   COUNT(DISTINCT my_column_1),   COUNT(DISTINCT my_column_2) FROM my_table GROUP BY my_key_1, my_key_2; ``` | ```sqlexample SELECT   ARRAY_SIZE(ARRAY_UNIQUE_AGG(my_column_1)),   ARRAY_SIZE(ARRAY_UNIQUE_AGG(my_column_2)) FROM my_table GROUP BY my_key_1, my_key_2; ``` |

### Example 3: Using GROUP BY ROLLUP to Roll up Counts by Group

`ARRAY_UNIQUE_AGG` works even more efficiently for `GROUP BY ROLLUP` aggregate queries. ARRAYs are composable (in
contrast to `COUNT(DISTINCT <expression>)`), which results in less computation work and lower execution times.

Suppose that you want to roll up the number of distinct values in `my_column` by `my_key_1` and `my_key_2`. The
following table compares the SQL statements for performing this task with `COUNT(DISTINCT expression)` and
`ARRAY_UNIQUE_AGG(expression)`.

| Example With COUNT(DISTINCT <expression>) | Example With ARRAY_UNIQUE_AGG(<expression>) |
| --- | --- |
| ```sqlexample SELECT   COUNT(DISTINCT my_column) FROM my_table GROUP BY ROLLUP(my_key_1, my_key_2); ``` | ```sqlexample SELECT   ARRAY_SIZE(ARRAY_UNIQUE_AGG(my_column)) FROM my_table GROUP BY ROLLUP(my_key_1, my_key_2); ``` |

## Precomputing the ARRAYs

To improve performance, you can precompute the ARRAYs of distinct values in a table or materialized view.

For example, suppose that your data warehouse contains a fact table with multiple dimensions. You can define a materialized view
that constructs the ARRAYs to perform a coarse-grained precomputation or pre-aggregation before computing the final aggregates or
cubes that require a `COUNT(DISTINCT <expression>)`.

To collect the distinct values from the ARRAYs in each row, call the [ARRAY_UNION_AGG](../sql-reference/functions/array_union_agg.md) function.

The following example creates a table containing the ARRAYs and uses this table to compute the number of distinct values,
aggregated by different dimensions.

The following statement creates a table named `precompute` that contains the ARRAYs:

```sqlexample
CREATE TABLE precompute AS
SELECT
  my_dimension_1,
  my_dimension_2,
  ARRAY_UNIQUE_AGG(my_column) arr
FROM my_table
GROUP BY 1, 2;
```

The following statement computes the aggregates for `my_dimension_1` and `my_dimension_2`:

```sqlexample
SELECT
  my_dimension_1,
  my_dimension_2,
  ARRAY_SIZE(arr)
FROM precompute
GROUP BY 1, 2;
```

The following statement computes the aggregate only for `my_dimension_1`:

```sqlexample
SELECT
  my_dimension_1,
  ARRAY_SIZE(ARRAY_UNION_AGG(arr))
FROM precompute
GROUP BY 1;
```

The following statement computes the aggregate only for `my_dimension_2`:

```sqlexample
SELECT
  my_dimension_2,
  ARRAY_SIZE(ARRAY_UNION_AGG(arr))
FROM precompute
GROUP BY 1;
```

## Limitations

In Snowflake, ARRAY data types are limited to 16 MiB, which means that ARRAY_UNIQUE_AGG or ARRAY_UNION_AGG will generate an error
if the physical size of the output ARRAY exceeds this size.

In these cases, consider using a [bitmap aggregation](querying-bitmaps-for-distinct-counts.md) instead. As an alternative, you
can apply a bucketization technique similar to the one used for bitmap aggregations but with a different bucketization function
than BITMAP_BUCKET_NUMBER.

---
title: Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations
source: https://docs.snowflake.com/en/user-guide/querying-bitmaps-for-distinct-counts.md
section: User Guide
---

# Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations

If you are counting distinct values for hierarchical aggregations (e.g. multiple grouping sets, rollups, or cubes), you can
improve performance by producing bitmaps that represent the distinct values and computing the number of distinct values from these
bitmaps. Using this approach can be faster than using `COUNT(DISTINCT <expr>)`.

This topic explains how to use bitmaps to count distinct values.

For other techniques for counting distinct values, see [Computing the Number of Distinct Values](querying-distinct-counts.md).

## Introduction

When computing the number of distinct values for hierarchical aggregations (e.g. multiple grouping sets, rollups, or cubes), you
can speed up the computation by producing and querying a bitmap that represents the set of all possible distinct values.

* In this bitmap, you set the bits that correspond to the distinct values that are present in the data.
* When computing the number of distinct values, you use the bitmap functions to count the bits that are set in the bitmap (rather
  than querying the table with `COUNT(DISTINCT <expression>)`).

The bitmap functions can perform better than `COUNT(DISTINCT <expression>)` under the following conditions:

* The query performs a hierarchical aggregation (e.g. for multiple grouping sets, rollups, or cubes) that counts distinct values.

  Unlike `COUNT(DISTINCT <expression>)` (which needs to be executed for each group), you can compose and reuse bitmaps by
  calling the bitmap functions. This can reduce the cost of the query plan.
* The range of values is dense (e.g. the value is generated by a sequence)

  Note that if the value range is sparse, you can use the [DENSE_RANK](../sql-reference/functions/dense_rank.md) window function to
  transform the sparse range of values into a dense range of values.
* The range of values is small. A large range of values might require multiple bitmaps that do not fit into main memory and must
  be saved to disk.

In addition, to improve performance further, you can compute these bitmaps ahead of time (e.g. in a materialized view), rather
than during the query, and you can use these precomputed bitmaps in your query.

## Understanding How Bitmaps Identify Distinct Values

A bitmap is a contiguous piece of memory that is stored as a BINARY data type. A bitmap effectively is an array of bits that can
be set individually. For example, a 4-byte bitmap consists of 32 bits (4 bytes \* 8 bits per byte).

For each possible distinct value, you can use a bit in the bitmap to represent the presence or absence of the distinct value in
the data. For example, if the values 3 and 5 are present in the data, you can set the 3rd and 5th bits to 1 in the bitmap. (If the
distinct values are not numeric values, you must map the values to numeric values.)

For the bitmap functions in Snowflake, the default size of a bitmap is 32,768 bits (4 KiB). Note that this size does not
correspond to the physical size of the BINARY value. Internally, the bitmap functions manage the physical representation of the
bitmap, which might not be an actual bitmap. (For example, the functions might use an index vector.) The physical size of a bitmap
can vary from 10 bytes to 4108 bytes.

If the number of distinct values is greater than 32,768 bits, multiple bitmaps are needed to represent all of the values. The
process of dividing up the bits for distinct values into different bitmaps is called bucketization. For example, the bits for the
distinct values ranging from 1 - 65,536 are bucketized into two separate buckets. The bitmap in one bucket represents the values
1 - 32,768, and the bitmap in the other bucket represents the values 32,769 - 65,536. The bitmap in each bucket contains a subset
of the bits representing the distinct values.

The following diagram shows the logical representation of a bitmap. (As mentioned earlier, the physical representation of the
bitmap in the BINARY value might be different.)

A distinct value is represented by the combination of a bucket containing a bitmap and a bit that is set in that bitmap. To
identify the bucket and bit that represents a specific value, use the following functions:

* Call [BITMAP_BUCKET_NUMBER](../sql-reference/functions/bitmap_bucket_number.md) to identify the bucket containing the bitmap that has the bit for the
  value.
* Call [BITMAP_BIT_POSITION](../sql-reference/functions/bitmap_bit_position.md) to identify the zero-based position of the bit within the bitmap for
  the value.

For example, the numeric value 1 is represented by the bit at position 0 in bitmap 1:

```sqlexample
select bitmap_bucket_number(1), bitmap_bit_position(1);

+-------------------------+------------------------+
| BITMAP_BUCKET_NUMBER(1) | BITMAP_BIT_POSITION(1) |
|-------------------------+------------------------|
|                       1 |                      0 |
+-------------------------+------------------------+
```

The numeric value 32,768 is represented by the bit at position 32,767 in bitmap 1:

```sqlexample
select bitmap_bucket_number(32768), bitmap_bit_position(32768);

+-----------------------------+----------------------------+
| BITMAP_BUCKET_NUMBER(32768) | BITMAP_BIT_POSITION(32768) |
|-----------------------------+----------------------------|
|                           1 |                      32767 |
+-----------------------------+----------------------------+
```

As another example, the numeric value 32,769 is represented by the bit at position 0 in bitmap 2:

```sqlexample
select bitmap_bucket_number(32769), bitmap_bit_position(32769);

+-----------------------------+----------------------------+
| BITMAP_BUCKET_NUMBER(32769) | BITMAP_BIT_POSITION(32769) |
|-----------------------------+----------------------------|
|                           2 |                          0 |
+-----------------------------+----------------------------+
```

## Creating Bitmaps

To create bitmaps that represent all possible distinct values, call the [BITMAP_CONSTRUCT_AGG](../sql-reference/functions/bitmap_construct_agg.md)
function in a SELECT statement:

1. Pass in the value returned by [BITMAP_BIT_POSITION](../sql-reference/functions/bitmap_bit_position.md) for the column to the
   [BITMAP_CONSTRUCT_AGG](../sql-reference/functions/bitmap_construct_agg.md) function.
2. In the SELECT statement, select [BITMAP_BUCKET_NUMBER](../sql-reference/functions/bitmap_bucket_number.md) and use `GROUP BY` to aggregate the
   results for a given bitmap (identified by “bucket number”).

`BITMAP_CONSTRUCT_AGG` is an aggregation function. Aggregation in this context means setting the bit for a distinct value if
any row has that distinct value. If multiple rows contain the value 3, `BITMAP_CONSTRUCT_AGG` just sets the bit for 3 once
and does not change the value of the bit for the additional rows that contain 3.

For example, create the following table containing a column of numeric values. Insert two distinct values, one of which is greater
than 32768.

```sqlexample
CREATE OR REPLACE TABLE bitmap_test_values (val INT);
insert into bitmap_test_values values (1), (32769);
```

Run the following command to produce bitmaps with bits that represent the distinct values:

```sqlexample
-- Display the bitmap in hexadecimal
alter session set binary_output_format='hex';

select bitmap_bucket_number(val) as bitmap_id,
    bitmap_construct_agg(bitmap_bit_position(val)) as bitmap
    from bitmap_test_values
    group by bitmap_id;

+-----------+----------------------+
| BITMAP_ID | BITMAP               |
|-----------+----------------------|
|         1 | 00010000000000000000 |
|         2 | 00010000000000000000 |
+-----------+----------------------+
```

> **Note:**
>
> The `BITMAP` column contains a physical representation of the bitmap, which is not necessarily the actual bitmap. In this
> example, the column contains an index vector that represents the bitmap.
>
> An index vector is one way in which the bitmap functions store the physical representation of the bitmap. Depending on the
> number of values represented by the bitmap, the bitmap functions can use different physical representations for the bitmap.
>
> You should not expect the binary value of the bitmap to be stored in a specific format. To determine which bits are set, use the
> bitmap functions (rather than examining the binary value yourself).

Inserting additional rows with the same values does not change the resulting bitmap. The `BITMAP_CONSTRUCT_AGG` function
only sets the bit for a distinct value once.

```sqlexample
insert into bitmap_test_values values (32769), (32769), (1);

select bitmap_bucket_number(val) as bitmap_id,
    bitmap_construct_agg(bitmap_bit_position(val)) as bitmap
    from bitmap_test_values
    group by bitmap_id;

+-----------+----------------------+
| BITMAP_ID | BITMAP               |
|-----------+----------------------|
|         1 | 00010000000000000000 |
|         2 | 00010000000000000000 |
+-----------+----------------------+
```

Inserting other distinct values produces a different bitmap in which the corresponding bits for those values are set.

```sqlexample
insert into bitmap_test_values values (2), (3), (4);

select bitmap_bucket_number(val) as bitmap_id,
    bitmap_construct_agg(bitmap_bit_position(val)) as bitmap
    from bitmap_test_values
    group by bitmap_id;

+-----------+----------------------+
| BITMAP_ID | BITMAP               |
|-----------+----------------------|
|         1 | 00040000010002000300 |
|         2 | 00010000000000000000 |
+-----------+----------------------+
```

## Aggregating Bitmaps

If you need to aggregate different bitmaps in the same bucket (identified by the bucket number returned by
[BITMAP_BUCKET_NUMBER](../sql-reference/functions/bitmap_bucket_number.md)), call [BITMAP_OR_AGG](../sql-reference/functions/bitmap_or_agg.md).

## Computing the Number of Distinct Values from the Bitmaps

To get the total count of the distinct values from the bitmaps, call [BITMAP_COUNT](../sql-reference/functions/bitmap_count.md), passing in a
bitmap created by [BITMAP_CONSTRUCT_AGG](../sql-reference/functions/bitmap_construct_agg.md) or [BITMAP_OR_AGG](../sql-reference/functions/bitmap_or_agg.md).

For example:

```sqlexample
select bitmap_bucket_number(val) as bitmap_id,
    bitmap_count(bitmap_construct_agg(bitmap_bit_position(val))) as distinct_values
    from bitmap_test_values
    group by bitmap_id;

+-----------+-----------------+
| BITMAP_ID | DISTINCT_VALUES |
|-----------+-----------------|
|         1 |               4 |
|         2 |               1 |
+-----------+-----------------+
```

## Using Bitmaps to Improve Query Performance

The following examples demonstrate how to use the bitmap functions as an alternative to `COUNT(DISTINCT <expression>)`.

* Example 1: Counting the Distinct Values in a Single Table
* Example 2: Using GROUP BY to Compute the Counts by Group
* Example 3: Using GROUP BY ROLLUP to Roll up Counts by Group

### Example 1: Counting the Distinct Values in a Single Table

Suppose that you want to count the number of distinct values in `my_column`. The following table compares the SQL statements
for performing this task with `COUNT(DISTINCT expression)` and the bitmap functions.

| Example With COUNT(DISTINCT <expression>) | Example With Bitmap Functions |
| --- | --- |
| ```sqlexample SELECT   COUNT(DISTINCT my_column) FROM my_table; ``` | ```sqlexample SELECT SUM(cnt) FROM (   SELECT     BITMAP_COUNT(BITMAP_CONSTRUCT_AGG(BITMAP_BIT_POSITION(my_column))) cnt   FROM my_table   GROUP BY BITMAP_BUCKET_NUMBER(my_table) ); ```  Note that if the range of values in `my_column` is 0 to 32,768, you can use this simpler statement instead:  ```sqlexample -- If the full value range of my_column fits into the bitmap: --   MIN(my_column) >= 0 AND MAX(my_column) < 32,768 SELECT   BITMAP_COUNT(BITMAP_CONSTRUCT_AGG(my_column)) FROM my_table; ``` |

### Example 2: Using GROUP BY to Compute the Counts by Group

Suppose that you want to count the number of distinct values in `my_column` by `my_key_1` and `my_key_2`. The
following table compares the SQL statements for performing this task with `COUNT(DISTINCT expression)` and the bitmap
functions.

| Example With COUNT(DISTINCT <expression>) | Example With Bitmap Functions |
| --- | --- |
| ```sqlexample SELECT   my_key_1,   my_key_2,   COUNT(DISTINCT my_column) FROM my_table GROUP BY my_key_1, my_key_2; ``` | ```sqlexample SELECT my_key_1, my_key_2, SUM(cnt) FROM (   SELECT     my_key_1,     my_key_2,     BITMAP_COUNT(BITMAP_CONSTRUCT_AGG(BITMAP_BIT_POSITION(my_column))) cnt   FROM my_table   GROUP BY my_key_1, my_key_2, BITMAP_BUCKET_NUMBER(my_column) ) GROUP BY my_key_1, my_key_2; ``` |

### Example 3: Using GROUP BY ROLLUP to Roll up Counts by Group

Bitmap functions work even more efficiently for `GROUP BY ROLLUP` aggregate queries. Bitmaps are composable (in contrast to
`COUNT(DISTINCT <expression>)`), which results in less computation work and lower execution times.

Suppose that you want to roll up the number of distinct values in `my_column` by `my_key_1` and `my_key_2`. The
following table compares the SQL statements for performing this task with `COUNT(DISTINCT expression)` and the bitmap
functions.

| Example With COUNT(DISTINCT <expression>) | Example With Bitmap Functions |
| --- | --- |
| ```sqlexample SELECT   my_key_1,   my_key_2,   COUNT(DISTINCT my_column) FROM my_table GROUP BY ROLLUP(my_key_1, my_key_2); ``` | ```sqlexample SELECT my_key_1, my_key_2, SUM(cnt) FROM (   SELECT     my_key_1,     my_key_2,     BITMAP_COUNT(BITMAP_CONSTRUCT_AGG(BITMAP_BIT_POSITION(my_column))) cnt   FROM my_table   GROUP BY ROLLUP(my_key_1, my_key_2), BITMAP_BUCKET_NUMBER(my_column) ) GROUP BY my_key_1, my_key_2; ``` |

## Precomputing the Bitmaps

To improve performance, you can precompute the counts of distinct values in a table or materialized view.

For example, suppose that your data warehouse contains a fact table with multiple dimensions. You can define a materialized view
that constructs the bitmaps to perform a coarse-grained precomputation or pre-aggregation before computing the final aggregates
or cubes that require a `COUNT(DISTINCT <expression>)`.

The following example creates a table containing the bitmaps and uses this table to compute the number of distinct values,
aggregated by different dimensions.

The following statement creates a table named `precompute` that contains the bitmaps and bucket information:

```sqlexample
CREATE TABLE precompute AS
SELECT
  my_dimension_1,
  my_dimension_2,
  BITMAP_BUCKET_NUMBER(my_column) bucket,
  BITMAP_CONSTRUCT_AGG(BITMAP_BIT_POSITION(my_column)) bmp
FROM my_table
GROUP BY 1, 2, 3;
```

The following statement computes the aggregates for `my_dimension_1` and `my_dimension_2`:

```sqlexample
SELECT
  my_dimension_1,
  my_dimension_2,
  SUM(BITMAP_COUNT(bmp))
FROM precompute
GROUP BY 1, 2;
```

The following statement computes the aggregate only for `my_dimension_1`:

```sqlexample
SELECT my_dimension_1, SUM(cnt) FROM (
  SELECT
    my_dimension_1,
    BITMAP_COUNT(BITMAP_OR_AGG(bmp)) cnt
  FROM precompute
  GROUP BY 1, bucket
)
GROUP BY 1;
```

The following statement computes the aggregate only for `my_dimension_2`:

```sqlexample
SELECT my_dimension_2, SUM(cnt) FROM (
  SELECT
    my_dimension_2,
    BITMAP_COUNT(BITMAP_OR_AGG(bmp)) cnt
  FROM precompute
  GROUP BY 1, bucket
)
GROUP BY 1;
```

---
title: Using budgets for AI features (shared resources)
source: https://docs.snowflake.com/en/user-guide/budgets/budget-shared-resources.md
section: User Guide
---

# Using budgets for AI features (shared resources)

A shared resource is a Snowflake resource that is used by more than one business unit or team. AI features (such as AI Functions, Snowflake Intelligence, Cortex Agents, and Cortex Code) are examples of shared resources. You can add these resources to a budget and configure the budget so that credits consumed by them count toward the budget’s spending limit only when selected users consume those credits. This enables tracking and controlling usage across different teams or cost centers.

For example, suppose multiple teams use the same AI function. You can track consumption for each of the teams in separate budgets based
on which users are calling the function — one budget for engineering users and another for finance users.

## Workflow for tracking consumption by shared resources

Tracking consumption by a shared resource based on the user who is using the resource consists of the following workflow:

1. Apply a tag-value pair to a user who uses the shared resource.
2. Add to the budget the tag-value pair that you applied to the user.
3. Add the shared resource to the budget.

## Apply a tag to a user

A [tag](../object-tagging/introduction.md) is a schema-level object that can be applied to another object. When you apply a
tag to an object, you can set the tag to a value, thereby creating a tag-value pair.

You can group users into logical units such as cost centers by applying the same tag-value pair to each of the users. The first step in
tracking consumption of shared resources is to apply a tag-value pair to every user that belongs to a unit. You can then use a budget to
track consumption by these users while ignoring the consumption of the same shared resource by other users.

Use the [ALTER USER](../../sql-reference/sql/alter-user.md) command to apply a tag to users. Suppose you use the `cost_center` tag to identify cost
centers within your organization, and that the user `joe` belongs to the cost center `finance`. To apply the correct tag-value pair to
the user, run the following command:

```sqlexample
ALTER USER joe SET TAG cost_management.tags.cost_center = 'FINANCE';
```

## Add the user tag to the budget

After tagging all users in the logical unit, you must add the tag-value pair to the budget so it can track consumption by the users. Use
the [SET_USER_TAGS](../../sql-reference/classes/budget/methods/set_user_tags.md) method to add the tag to the budget.

In the following example, when a shared resource consumes credits, the `finance_budget` budget will only track consumption by users with
the `cost_center = 'FINANCE'` tag-value pair.

```sqlexample
CALL finance_budget!SET_USER_TAGS(
  [
    [(SELECT SYSTEM$REFERENCE('TAG', 'COST_MANAGEMENT.TAGS.COST_CENTER', 'SESSION', 'APPLYBUDGET')),
    'FINANCE']
  ],
  'UNION');
```

The SET_USER_TAGS method lets you add all of your user tags to the budget at once. It also lets you configure the budget so that usage is
included if a user is tagged with *any* of the user tags (UNION) or configure it so usage is included only if the user is tagged with *all*
of the user tags (INTERSECTION).

In the following example, the `my_budget` budget tracks consumption when shared resources are acted upon by users tagged with *both*
the tag-value combination `cost_center = 'sales'` and the tag-value combination `project = 'phoenix'`.

```sqlexample
CALL budget_db.budget_schema.my_budget!SET_USER_TAGS(
  [
    [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.cost_center', 'SESSION', 'APPLYBUDGET')), 'SALES'],
    [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.project', 'SESSION', 'APPLYBUDGET')), 'PHOENIX']
  ],
  'INTERSECTION');
```

To verify the results of the method, call the [GET_BUDGET_SCOPE](../../sql-reference/classes/budget/methods/get_budget_scope.md) method.

## Add AI features (shared resources) to a budget

After you have configured the users who are using AI features, you must specify which of these features will be tracked by the budget.
Use the [ADD_SHARED_RESOURCE](../../sql-reference/classes/budget/methods/add_shared_resource.md) method to add an AI feature to the budget.

Supported AI feature domains include:

* `AI FUNCTION` — Model inference functions
* `CORTEX CODE` — Cortex Code workloads (CLI, Snowsight)
* `CORTEX AGENT` — Cortex agent-based workflows (domain-level only)
* `SNOWFLAKE INTELLIGENCE` — Snowflake Intelligence workloads (domain-level only)

> **Tip:**
>
> You can use the [SYSTEM$SHOW_BUDGET_SHARED_RESOURCE_CANDIDATES](../../sql-reference/functions/system_show_budget_shared_resource_candidates.md) function to return a list of resources that can be added as
> shared resources to a budget.

**Example: Add all AI functions to the budget**

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('AI FUNCTION');
```

---

**Example: Add the AI_CLASSIFY function to the budget**

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('AI FUNCTION', 'AI_CLASSIFY');
```

## Creating a budget for AI workloads in Snowsight

You can create and configure budgets for AI workloads directly in Snowsight using a guided user interface.

> **Note:**
>
> Using tags to define the scope of a budget is required for shared resources such as AI workloads.

1. Sign in to Snowsight.
2. In the navigation menu, select Admin » Cost management.
3. Select the Budgets tab.
4. Select + Budget on the top right corner.
5. On the Basic Information page, complete the required fields.
6. On the Budget scope page, add the objects that you want to include in the budget.
7. For setting budgets on AI features (shared resources), move to the Budgets Scope page and update as follows.

   In the Tags on users section:

   * Search for and select relevant tags (for example, cost center or team).
   * This enables tracking activity for tagged users, which is required when monitoring shared resources.
   * Select AI resources to monitor.

   In the Select resources to monitor section, enable one or more of the following:

   * AI Functions
   * Cortex Code
   * Cortex Agents
   * Snowflake Intelligence
8. Configure AI Functions.

   * By default, all AI functions are selected, and future AI functions are automatically included.
   * You can also choose to selectively choose specific functions (for example, `AI_CLASSIFY`, `AI_COMPLETE`). For a complete list, see [Snowflake Cortex AI Functions (including LLM functions)](../snowflake-cortex/aisql.md).
9. Configure Cortex Code.

   * By default, future Cortex Code interfaces are automatically included.
   * You can also choose to select specific instances (for example, `CLI`, `Snowsight`).
10. Configure domain-level resources.

    * **Cortex Agents** and **Snowflake Intelligence** can be selected only at the domain level.
11. Review your selections.

    Confirm that the correct resources are selected, ensure that any selected tags correctly reflect the intended scope.
12. Complete the remaining configuration and click **Create**

> **Note:**
>
> * AI workloads are tracked as shared resources and are attributed based on user activity and applied tags.
> * Selecting **All (auto)** ensures that new instances for the domain are automatically included as they become available.

## Limitations and considerations

* For AI functions, the budget tracks the AI_SERVICES service type.

## Related methods

* [ADD_SHARED_RESOURCE](../../sql-reference/classes/budget/methods/add_shared_resource.md)
* [GET_SHARED_RESOURCES](../../sql-reference/classes/budget/methods/get_shared_resources.md)
* [REMOVE_SHARED_RESOURCE](../../sql-reference/classes/budget/methods/remove_shared_resource.md)
* [SET_USER_TAGS](../../sql-reference/classes/budget/methods/set_user_tags.md)

---
title: Using Contacts
source: https://docs.snowflake.com/en/user-guide/contacts-using.md
section: User Guide
---

# Using Contacts

Contacts are schema-level objects that contain details about which user or group of users can be contacted for a specific purpose. For
example, one contact named `data_stewards` might include an email distribution list while another named `support_department` might
include the URL of the department’s website.

Contacts can be associated with other objects such as databases and tables so the right person can be contacted for assistance with those
objects. For example, there might be a contact on a table that contains the users who can help gain access to the table. The purpose of the
contact is not a property of the contact, but rather the association between a contact and a specific object.
For example, the same contact might provide general support for one table while providing access approval for another.

An object can have more than one contact as long as the purpose of each contact is unique for the object. For example, a table might have
one contact that grants access to the table and another contact that provides general support for the table. When a user views the contacts
associated with an object, they see the purpose of each contact along with a communication method, so they know who to communicate with for
a specific reason and how to reach them.

Data users see these contacts when they are using the Database Explorer in Snowsight to navigate their databases, schemas, and
table-like objects. Snowflake features that send notifications to users can use the contact associated with an object to communicate
with the users.

## Inheriting and overriding contacts

Contacts are inherited by descendant objects. If you associate a contact with an object that is the parent of another object, the child,
grandchild, and so on inherit the contact. For example, if you associate a contact with a schema, all of the tables in the schema inherit
the contact by default.

Contact inheritance is overridden if a child object has a contact with the same purpose. For example, suppose someone associates the
following two contacts with the `ac_sch` schema, which contains the table `t1`:

| Contact | Purpose |
| --- | --- |
| `data_stewards` | Steward |
| `business_unit1` | Approver |

Now suppose someone associates the contact `finance_dept` with `t1` for the purpose of access approval. The contacts associated with
`t1` are now the following:

| Contact | Purpose |
| --- | --- |
| `data_stewards` | Steward |
| `finance_dept` | Approver |

The contact responsible for access approval on the `ac_sch` has been replaced with a contact directly associated with `t1`, but
`t1` continues to inherit the `data_stewards` contact from the schema.

All objects inherit contacts set on the account unless overridden by an association further down in the inheritance hierarchy.

## Supported objects

You can associate a contact with the following objects:

|  |  |  |
| --- | --- | --- |
| * Database * Schema * Table * Apache Iceberg™ table | * External table * Dynamic table * Event table | * View * Materialized view * Task (SQL only) |

## Create a contact

When you create a contact, you specify the name of the contact and how the contact can be reached. Communication methods include the
following:

* The URL of a website.
* An email address, which can be a distribution list.
* A list of Snowflake users.

You can create a contact using Snowsight or SQL.

> **Tip:**
>
> Creating all contacts in a dedicated schema can be helpful.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Catalog » Database Explorer.
    3. Navigate to the schema where you want to create the contact.
    4. Select Create » Contact.
    5. Specify the name of the contact.
    6. Choose a communication method, then specify the email address, users, or URL of the people who can be contacted for assistance with an
       object.
    7. Select the roles that will have the ability to associate the contact with objects. These roles are granted the APPLY privilege on the
       contact.
    8. Select Create.

SQL:
:   You can execute the [CREATE CONTACT](../sql-reference/sql/create-contact.md) command to create a new contact.

    **Examples**

    Create a contact for the support team that is reached through their website.

    ```sqlexample
    CREATE CONTACT support_dept URL = 'http://internalsupport.example.com';
    ```

    Create a contact for the finance team that is reached via an email address, which acts as a distribution list.

    ```sqlexample
    CREATE CONTACT finance_dept EMAIL_DISTRIBUTION_LIST = 'fd_dl@example.com';
    ```

    Create a contact for database administrators, as identified by the name of their Snowflake user objects.

    ```sqlexample
    CREATE CONTACT db_admins USERS = ('ex_admin1', 'ex_admin2');
    ```

## Associate a contact with an object

When you associate a contact with an object, you specify the name of the contact along with the purpose of the association between the
contact and the object. When users view all of the contacts associated with an object, they’ll be able to decide who to communicate with based
on the purpose of each contact. If a Snowflake feature uses the contact to reach people, it will be able to select the right contacts based
on the purpose.

The purpose of having a contact associated with an object can be one of the following:

| Purpose | Description | SQL value |
| --- | --- | --- |
| Approver | Approves or rejects requests to access data. | `ACCESS_APPROVAL` |
| Security and compliance | Receives security and compliance updates. | `SECURITY_COMPLIANCE` |
| Data steward | Provides information about the accuracy, consistency, and reliability of data. | `STEWARD` |
| Support | Provides technical support related to a dataset. | `SUPPORT` |

You can associate a contact and define its purpose when modifying an existing object or creating a new object.

### Associate a contact with an existing object

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Catalog » Database Explorer.
    3. Navigate to one of the supported objects.
    4. Select the Details tab.
    5. Find the Assigned Contacts section and select the Edit icon.
    6. Select a contact for one or more of the purposes. For example, if you select a contact from the Approver drop-down list, data
       users will reach out to that contact when they need access to the object.
    7. Select Save.

SQL:
:   The ALTER … SET CONTACT command for an existing object lets you associate a contact and specify the purpose of the contact for that
    object. The syntax to associate the contact is the same for all objects that can be associated with a contact:

    ```sqlsyntax
    ALTER <object_type> <object_name>
      SET CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ]
    ```

    The `purpose` must be one of the predefined purposes, which describe the contact’s
    relationship to the object. When users view the contacts of an object, they use the purpose to determine which of the contacts to
    communicate with.

    **Examples**

    Associate the `finance_dept` contact with a table so users know who to communicate with when they need access to the table:

    ```sqlexample
    ALTER TABLE t1 SET CONTACT ACCESS_APPROVAL = finance_dept;
    ```

    Associate the `security_officers` contact with the account so Snowflake features can send updates related to security and compliance:

    ```sqlexample
    ALTER ACCOUNT SET CONTACT SECURITY_COMPLIANCE = security_officers;
    ```

    Associate the `data_stewards` contact with a schema so users know who to communicate with regarding object tagging of tables in the schema:

    ```sqlexample
    ALTER SCHEMA sch1 SET CONTACT STEWARD = data_stewards;
    ```

    > **Note:**
    >
    > If you want to set a contact on an existing Iceberg table, external table, or dynamic table, you must use the ALTER TABLE
    > command.

### Associate a contact when creating a new object

The CREATE … WITH CONTACT command lets you associate a contact with a new object. The syntax for the WITH CONTACT clause is the same for all
objects that can be associated with a contact:

```sqlsyntax
WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] )
```

The `purpose` must be one of the predefined purposes, which describe the contact’s
relationship to the object. When users view the contacts of an object, they use the purpose to determine which of the contacts to
communicate with.

For tables and table-like objects, the WITH CONTACT clause is specified after the column definitions.

The organization administrator can’t associate a contact when creating an account.

#### Examples

Associate the `finance_dept` contact with a new table so users know who to communicate with when they need access to the table:

```sqlexample
CREATE TABLE t1 (col1 VARCHAR, col2 INT) WITH CONTACT (ACCESS_APPROVAL = finance_dept);
```

Associate the `data_stewards` contact with a new schema so users know who to communicate with regarding object tagging of tables in the
schema and the `finance_dept` contact so users know who to communicate with when they need access:

```sqlexample
CREATE SCHEMA sch1 WITH CONTACT (STEWARD = data_stewards, ACCESS_APPROVAL = finance_dept);
```

## Detach a contact from an object

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Catalog » Database Explorer.
    3. Navigate to the object.
    4. Select the Details tab.
    5. Find the Assigned Contacts section and select the Edit icon.
    6. Find the purpose (for example, Approver) that has the contact that you want to detach, and use the drop-down list to select None.
    7. Select Save.

SQL:
:   ALTER … UNSET CONTACT command lets you detach a contact from an object. The syntax to detach the contact is the same for all
    objects that can be associated with a contact:

    ```sqlsyntax
    ALTER <object_type> <object_name>
      UNSET CONTACT <purpose>
    ```

    You identify the contact to detach by specifying the purpose of the association between the contact and the object, not by the contact
    name. For a list of possible purposes that can be specified to detach a contact, see
    predefined purposes.

    For example, to detach the contact that was added as the STEWARD of a table, run:

    > ```sqlexample
    > ALTER TABLE t1 UNSET CONTACT STEWARD;
    > ```

    > **Note:**
    >
    > If you want to unset a contact on an existing Iceberg table, external table, or dynamic table, you must use the ALTER TABLE
    > command.

## View contacts for an object

Snowsight:
:   When you navigate to an object in Snowsight, the contacts that are associated with the object appear on the Details tab.

SQL:
:   Users with at least one privilege on an object can use the [GET_CONTACTS](../sql-reference/functions/get_contacts.md) table function to view the
    contacts associated with that object. The function returns a row for each contact associated with the object.

    For example, to list the contacts on the table `t1`, a user with at least one privilege can execute the following:

    ```sqlexample
    SELECT *
      FROM TABLE(SNOWFLAKE.CORE.GET_CONTACTS('t1', 'TABLE'));
    ```

    For each contact, the function lists the following:

    * Purpose of the contact.
    * Method of communication for the contact.
    * Whether the contact was associated with the object directly or inherited from a parent object.

## Governing contacts and their associations

Snowsight:
:   To list the contacts that have been created in a schema and drill down into the details for a specific contact:

    1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Catalog » Database Explorer, and then select the schema.
    3. Select the Contacts tab.
    4. Select a contact if you want to view details about it, including the objects with which it is associated.

SQL:
:   The ACCOUNT_USAGE schema provides the following views to help manage contacts:

    * [CONTACTS view](../sql-reference/account-usage/contacts.md) - Lists all contacts in the account.
    * [CONTACT_REFERENCES view](../sql-reference/account-usage/contact_references.md) - Lists the objects with which a contact has been associated.

## Access control privileges

The following summarizes the privileges a user needs to work with contacts.

| Task | Required privileges |
| --- | --- |
| Create a contact | Both of the following:   * CREATE CONTACT on the schema * USAGE on the schema and database |
| Associate a contact with an object | Either of the following:   * APPLY CONTACT on the account * APPLY privilege on the contact and OWNERSHIP on the object |
| List the contacts for an object | Any privilege on the object. |
| Detach a contact from an object | Either of the following:   * APPLY CONTACT on the account * APPLY privilege on the contact and OWNERSHIP on the object |
| [Modify an existing contact](../sql-reference/sql/alter-contact.md) | Either of the following:   * OWNERSHIP on the contact * MODIFY on the contact |
| [Drop a contact](../sql-reference/sql/drop-contact.md) | OWNERSHIP on the contact |

---
title: Using cost insights to save
source: https://docs.snowflake.com/en/user-guide/cost-insights.md
section: User Guide
---

# Using cost insights to save

Snowflake provides cost insights that identify opportunities to optimize Snowflake for cost within a particular account. These insights are
calculated and refreshed weekly.

Each insight indicates how many credits or terabytes could be saved by optimizing Snowflake.

To access the Cost Insights tile:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. Switch to a role with [access to cost-related features](cost-access-control.md).
3. In the navigation menu, select Admin » Cost management.
4. Select the Account Overview tab.
5. Find the Cost insights tile.

Each of the following insights includes suggestions on how to optimize your spend.

* Insight: Rarely used tables with automatic clustering
* Insight: Rarely used materialized views
* Insight: Rarely used search optimization paths
* Insight: Large tables that are never queried
* Insight: Tables over 100 GB from which data is written but not read
* Insight: Short-lived permanent tables
* Insight: Inefficient usage of multi-cluster warehouses

Insight: Rarely used tables with automatic clustering
:   This insight identifies tables with [automatic clustering](tables-auto-reclustering.md) that are queried fewer than 100
    times per week by this account.

    Enabling automatic clustering for a table can significantly improve the performance of queries against that table. However, as the table
    changes, Snowflake must use serverless compute resources to keep it in a well-clustered state. If the number of queries executed against
    the table is minimal, the cost incurred might not justify the performance improvements.

    **Recommendation:**
    Consider disabling automatic clustering on these tables. Before you turn off automatic clustering, determine whether the table exists
    solely for disaster recovery purposes or for use by other Snowflake accounts through data sharing, which might explain why it isn’t
    accessed frequently.

    For example, to disable automatic clustering for a table named `t1`, execute the following command:

    ```sqlexample
    ALTER TABLE t1 SUSPEND RECLUSTER;
    ```

Insight: Rarely used materialized views
:   This insight identifies [materialized views](views-materialized.md) that are queried fewer than 10 times per week by this
    account.

    Creating a materialized view can significantly improve performance for certain query patterns. However, materialized views incur
    additional storage costs as well as serverless compute costs associated with keeping the materialized view up to date with new data. If
    the number of queries executed against the materialized view is minimal, the cost incurred might not justify the performance improvements.

    **Recommendation:**
    Consider removing or suspending updates to the materialized views. Before you drop a materialized view, determine whether the table exists
    solely for disaster recovery purposes or for use by other Snowflake accounts through data sharing, which might explain why it isn’t
    accessed frequently.

    For example, to delete a materialized view named `mv1`, execute the following command:

    ```sqlexample
    DROP MATERIALIZED VIEW mv1;
    ```

Insight: Rarely used search optimization paths
:   This insight identifies [search optimization](search-optimization-service.md) access paths that are used fewer than
    10 times per week by this account.

    Search optimization uses search access paths to improve the performance of certain types of point lookup and analytical queries. Adding
    search optimization to a table can significantly improve performance for these queries. However, search optimization incurs additional
    storage costs as well as serverless compute costs associated with keeping that storage up to date. If the number of queries that use the
    search access path created by search optimization is minimal, the cost incurred might not justify the performance improvements.

    **Recommendation:**
    Consider removing search optimization from the table. Before you remove search optimization, determine whether the table exists solely
    for disaster recovery purposes or for use by other Snowflake accounts through data sharing, which might explain why it isn’t accessed
    frequently.

    For example, to completely remove search optimization from a table named `t1`, execute the following command:

    ```sqlexample
    ALTER TABLE t1 DROP SEARCH OPTIMIZATION;
    ```

Insight: Large tables that are never queried
:   This insight identifies large tables that have not been queried in the last week by this account.

    **Recommendation:**
    Consider deleting unused tables, which can reduce storage costs without impacting any workloads. Before you drop the tables, determine
    whether the table exists solely for disaster recovery purposes or for use by other Snowflake accounts through data sharing, which might
    explain why it isn’t accessed frequently.

    For example, to delete a table name `t1`, execute the following command:

    ```sqlexample
    DROP TABLE t1;
    ```

Insight: Tables over 100 GB from which data is written but not read
:   This insight identifies tables where data is written but never read by this account.

    **Recommendation:**
    It might be wasteful to store data and ingest new data into Snowflake if the data is never read. Consider dropping these tables to save on
    storage costs or stop writing new data to save on credits consumed by ingestion. Before you drop the tables, determine whether the table
    exists solely for disaster recovery purposes or for use by other Snowflake accounts through data sharing, which might explain why it
    isn’t being read.

    For example, to drop a table name `t1`, execute the following command:

    ```sqlexample
    DROP TABLE t1;
    ```

Insight: Short-lived permanent tables
:   This insight identifies tables over 100 GB that were deleted within 24 hours of their creation.

    **Recommendation:** If data needs to be persisted for only a short time, consider using a
    [temporary table or transient table](tables-temp-transient.md) for future tables. Using a temporary table or transient
    table might help you save on [Fail-safe and Time Travel costs](data-cdp-storage-costs.md).

    For example, to create a new transient table `t1`, execute the following command:

    ```sqlexample
    CREATE TRANSIENT TABLE t1;
    ```

Insight: Inefficient usage of multi-cluster warehouses
:   This insight identifies when you have the minimum and maximum cluster count set to the same value for a multi-cluster warehouse, which
    prevents the warehouse from scaling up or down to respond to demand. If your multi-cluster warehouse can scale down during periods of
    lighter usage, it can save credits.

    **Recommendation:** Consider lowering the minimum cluster count to allow the multi-cluster warehouse to scale down during periods of
    lighter usage.

    For example, to set the minimum cluster count to 1 for a warehouse named `wh1`, execute the following command:

    ```sqlexample
    ALTER WAREHOUSE wh1 SET MIN_CLUSTER_COUNT = 1;
    ```

---
title: Using custom classifiers to implement custom semantic categories
source: https://docs.snowflake.com/en/user-guide/classify-custom-using.md
section: User Guide
---

# Using custom classifiers to implement custom semantic categories

The CUSTOM_CLASSIFIER [class](../sql-reference-classes.md) allows data engineers to extend their sensitive data classification capabilities
based on their own knowledge of their data. To classify sensitive data into [custom semantic categories](classify-custom.md),
create an instance of the CUSTOM_CLASSIFIER class in a schema and call instance methods to add regular
expressions associated with the instance.

For an end-to-end example of using a CUSTOM_CLASSIFIER instance to create a custom semantic category, see Example.

## Commands and methods

The following methods and SQL commands are supported:

* Commands:

  + [CREATE CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier/commands/create-custom-classifier.md)
  + [DROP CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier/commands/drop-custom-classifier.md)
  + [SHOW CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier/commands/show-custom-classifiers.md)
* Methods:

  + [custom_classifier!ADD_REGEX](../sql-reference/classes/custom_classifier/methods/add_regex.md)
  + [custom_classifier!DELETE_CATEGORY](../sql-reference/classes/custom_classifier/methods/delete_category.md)
  + [custom_classifier!LIST](../sql-reference/classes/custom_classifier/methods/list.md)

## Access control

These sections summarize the roles and grants on various objects that you need to use an instance.

### Roles

You can use the following roles with custom classification:

* SNOWFLAKE.CLASSIFICATION_ADMIN: database role that enables you to create a custom classifier instance.
* `custom_classifier`!PRIVACY_USER: [instance role](../sql-reference/snowflake-db-classes.md) that enables you to call the following methods on
  the instance:

  + ADD_REGEX
  + LIST
  + DELETE_CATEGORY
* The account role with the OWNERSHIP privilege on the instance can run these commands:

  + DROP CUSTOM_CLASSIFIER
  + SHOW CUSTOM_CLASSIFIER

### Grants

To create and manage instances, you can choose to either grant the CREATE SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER
[privilege](../sql-reference/sql/grant-privilege.md) to a role or grant the PRIVACY_USER instance role to a role.

You can grant the instance roles to account roles and database roles to enable other users to work with custom classifier instances:

```sqlsyntax
GRANT SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER ROLE <name>!PRIVACY_USER
  TO ROLE <role_name>

REVOKE SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER ROLE <name>!PRIVACY_USER
  FROM ROLE <role_name>

GRANT SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER ROLE <name>!PRIVACY_USER
  TO DATABASE ROLE <database_role_name>

REVOKE SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER ROLE <name>!PRIVACY_USER
  FROM DATABASE ROLE <database_role_name>
```

Where:

`name`
:   Specifies the name of the custom classifier instance.

`role_name`
:   Specifies the name of an account role.

`database_role_name`
:   Specifies the name of a database role.

You must use a warehouse to call methods on the instance.

To grant the custom role `my_classification_role` the required instance role and privileges to create and use an instance of the
CUSTOM_CLASSIFIER class, execute the following statements:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT DATABASE ROLE SNOWFLAKE.CLASSIFICATION_ADMIN
  TO ROLE my_classification_role;
GRANT USAGE ON DATABASE mydb TO ROLE my_classification_role;
GRANT USAGE ON SCHEMA mydb.instances TO ROLE my_classification_role;
GRANT USAGE ON WAREHOUSE wh_classification TO ROLE my_classification_role;
```

If you would like to enable a specific role, such as `data_analyst` to use a specific instance, do the following:

```sqlexample
GRANT SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER ROLE
  mydb.sch.my_instance!PRIVACY_USER TO ROLE data_analyst;
```

## Example

The high-level approach to classify data with custom classifiers is as follows:

1. Identify a table to classify.
2. Use SQL to do the following:

   1. Create a custom classifier instance.
   2. Add the custom semantic category and regular expressions to the instance.
   3. Classify the table.

Complete these steps to create a custom classifier to classify a table:

1. Consider a table, `data.tables.employee_roster`, in which one of its columns contains internal employee identifiers.

   > ```none
   > +-------------+------------------------+
   > | EMPLOYEE_ID | EMPLOYEE_NAME          |
   > +-------------+------------------------+
   > | 100001      | Employee A             |
   > | 100002      | Employee B             |
   > | 100003      | Employee C             |
   > +-------------+------------------------+
   > ```

   This table might also include other identifying columns, such as personal email address, work location, and manager information.
   The data owner can classify the table to ensure that the columns are tagged correctly so the table can be monitored.

   In this example, the data owner already has these privileges granted to their role:

   * OWNERSHIP on the table to classify.
   * OWNERSHIP on the schema that contains the table.
   * USAGE on the database that contains the schema and table.
2. Enable the data owner to classify the table by granting the SNOWFLAKE.CLASSIFICATION_ADMIN database role to the data owner role:

   > ```sqlexample
   > USE ROLE ACCOUNTADMIN;
   > GRANT DATABASE ROLE SNOWFLAKE.CLASSIFICATION_ADMIN
   >   TO ROLE data_owner;
   > ```
3. As the data owner, create a schema to store your custom classifier instances:

   > ```sqlexample
   > USE ROLE data_owner;
   > CREATE SCHEMA data.classifiers;
   > ```
4. Use the [CREATE CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier/commands/create-custom-classifier.md) command to create a custom classifier
   instance in the `data.classifiers` schema:

   ```sqlexample
   CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER internal_ids();
   ```

   You can optionally update your [search path](../sql-reference/snowflake-db-classes.md) as follows:

   * Add `SNOWFLAKE.DATA_PRIVACY` so that you don’t have to specify the fully qualified name of the class when creating a new
     instance of the class.
   * Add `DATA.CLASSIFIERS` so that you don’t have to specify the fully qualified name of the instance when calling a method on the
     instance or using a command with the instance.
5. Use a [SHOW CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier/commands/show-custom-classifiers.md) command to list each instance that you create.
   For example:

   ```sqlexample
   SHOW SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER;
   ```

   Returns:

   ```output
   +----------------------------------+---------------+---------------+-------------+-----------------+---------+-------------+
   | created_on                       | name          | database_name | schema_name | current_version | comment | owner       |
   +----------------------------------+---------------+---------------+-------------+-----------------+---------+-------------+
   | 2023-09-08 07:00:00.123000+00:00 | INTERNAL_IDS  | DATA          | CLASSIFIERS | 1.0             | None    | DATA_OWNER  |
   +----------------------------------+---------------+---------------+-------------+-----------------+---------+-------------+
   ```
6. Call the [custom_classifier!ADD_REGEX](../sql-reference/classes/custom_classifier/methods/add_regex.md) method on the instance to specify the system tags and regular
   expression to identify internal employee IDs in a column. In this example, the value regular expression matches six-digit employee IDs.
   The regular expression to match the column name, `EMP.*ID.*`, and the comment are optional:

   ```sqlexample
   CALL internal_ids!ADD_REGEX(
     SEMANTIC_CATEGORY => 'EMPLOYEE_ID',
     PRIVACY_CATEGORY => 'IDENTIFIER',
     VALUE_REGEX => '^[0-9]{6}$',
     COL_NAME_REGEX => 'EMP.*ID.*',
     DESCRIPTION => 'Add a regex to identify employee IDs in a column',
     THRESHOLD => 0.8
   );
   ```

   Returns:

   ```output
   +---------------+
   |   ADD_REGEX   |
   +---------------+
   | EMPLOYEE_ID   |
   +---------------+
   ```

   > **Tip:**
   >
   > Test the regular expression before adding a regular expression to the custom classifier instance. For example:
   >
   > ```sqlexample
   > SELECT employee_id
   > FROM employee_roster
   > WHERE employee_id REGEXP('^[0-9]{6}$');
   > ```
   >
   > ```output
   > +-------------+
   > | EMPLOYEE_ID |
   > +-------------+
   > | 100001      |
   > | 100002      |
   > | 100003      |
   > +-------------+
   > ```
   >
   > In this query, only valid values that match the regular expression are returned. The query does not return invalid
   > values such as `xyz`.
   >
   > For details, see [String functions (regular expressions)](../sql-reference/functions-regexp.md).
7. Call the [custom_classifier!LIST](../sql-reference/classes/custom_classifier/methods/list.md) method on the instance to verify the regular expression that
   you added to the instance:

   ```sqlexample
   SELECT internal_ids!LIST();
   ```

   Returns:

   ```output
   +--------------------------------------------------------------------------------+
   | INTERNAL_IDS!LIST()                                                            |
   +--------------------------------------------------------------------------------+
   | {                                                                              |
   |   "EMPLOYEE_ID": {                                                             |
   |     "col_name_regex": "EMP.*ID.*",                                             |
   |     "description": "Add a regex to identify employee IDs in a column",         |
   |     "privacy_category": "IDENTIFIER",                                          |
   |     "threshold": 0.8,                                                          |
   |     "value_regex": "^[0-9]{6}$"                                                |
   |   }                                                                            |
   | }                                                                              |
   +--------------------------------------------------------------------------------+
   ```

   To remove a category, call the [custom_classifier!DELETE_CATEGORY](../sql-reference/classes/custom_classifier/methods/delete_category.md) method on the instance.
8. Call the [SYSTEM$CLASSIFY_SCHEMA](../sql-reference/stored-procedures/system_classify_schema.md) stored procedure to classify the table.
9. If the instance is no longer needed, use the [DROP CUSTOM_CLASSIFIER](../sql-reference/classes/custom_classifier/commands/drop-custom-classifier.md) command to
   remove a custom classifier instance from the system:

   ```sqlexample
   DROP SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER data.classifiers.internal_ids;
   ```

## Auditing custom classifiers

You can use the following queries to audit the creation of custom classifier instances, adding regular expressions to instances, and
dropping the instance.

* To audit the creation of custom classifier instances, use the following query:

  ```sqlexample
  SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_text ILIKE 'create % snowflake.data_privacy.custom_classifier%';
  ```
* To audit adding regular expressions to a specific instance, use the following query and replace `DB.SCH.MY_INSTANCE` with the name
  of the instance that you want to audit:

  ```sqlexample
  SELECT
      QUERY_HISTORY.user_name,
      QUERY_HISTORY.role_name,
      QUERY_HISTORY.query_text,
      QUERY_HISTORY.query_id
    FROM
      SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY query_history,
      SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY access_history,
        TABLE(FLATTEN(input => access_history.direct_objects_accessed)) flattened_value
  WHERE flattened_value.value:"objectName" = 'DB.SCH.MY_INSTANCE!ADD_REGEX'
  AND QUERY_HISTORY.query_id = ACCESS_HISTORY.query_id;
  ```
* To audit dropping a custom classifier instance, use the following query:

  ```sqlexample
  SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_text ILIKE 'drop % snowflake.data_privacy.custom_classifier%';
  ```

---
title: Using Duo as a multi-factor authentication (MFA) method
source: https://docs.snowflake.com/en/user-guide/security-mfa-duo.md
section: User Guide
---

# Using Duo as a multi-factor authentication (MFA) method

This topic provides general information about using Duo in conjunction with multi-factor authentication (MFA), including administrative
tasks that must be completed before users can use Duo as an MFA method. If you are a user who wants to set up Duo as your second factor of
authentication, see [Configuring a second factor of authentication](security-mfa-second-factor.md).

> **Note:**
>
> Users in trial accounts and [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) accounts cannot use Duo
> as their second factor of authentication. For other options, see [Configuring a second factor of authentication](security-mfa-second-factor.md).

Users don’t need to separately sign up with Duo or perform any tasks, other than installing the Duo Mobile application, which is supported
on multiple smartphone platforms. For more information about supported platforms/devices and how Duo multi-factor authentication works, see
the [Duo User Guide](http://guide.duosecurity.com/) .

## Prerequisite

The Duo application service communicates through TCP port `443`.

To ensure consistent behavior, update your firewall settings to include the Duo application service on TCP port `443`.

> ```bash
> *.duosecurity.com:443
> ```

For more information, see the [Duo documentation](https://duo.com/docs/duoweb#first-steps).

## MFA login flow

The following diagram illustrates the overall login flow for a user enrolled in MFA, regardless of the interface used to connect:

## Switching phones used for MFA

Instant Restore is a Duo feature that allows a user to back up the Duo app before switching to a new phone. As long as a Snowflake user
backs up their old phone first, they can use Instant Restore to enable authentication on the new phone without interrupting MFA for
Snowflake.

If a user does not back up the old phone or loses the old phone, the Snowflake account administrator must help set up a new MFA method. For
information, see [Recovering a user who is locked out](security-mfa.md).

## MFA error codes related to Duo

The following are error codes associated with MFA that can be returned during the authentication flow when the user is using Duo as their
second factor of authentication.

The errors are displayed with each failed login attempt. Historical data is also available in [Snowflake Information Schema](../sql-reference/info-schema.md) and
[Account Usage](../sql-reference/account-usage.md):

> * Information Schema provides data from within the past seven days and can be queried using the [LOGIN_HISTORY , LOGIN_HISTORY_BY_USER](../sql-reference/functions/login_history.md)
>   table functions.
> * The Account Usage [LOGIN_HISTORY view](../sql-reference/account-usage/login_history.md) provides data from within the past year.

| Error Code | Error | Description |
| --- | --- | --- |
| 390120 | EXT_AUTHN_DENIED | Duo Security authentication is denied. |
| 390121 | EXT_AUTHN_PENDING | Duo Security authentication is pending. |
| 390122 | EXT_AUTHN_NOT_ENROLLED | User is not enrolled in Duo Security. Contact your local system administrator. |
| 390123 | EXT_AUTHN_LOCKED | User is locked from Duo Security. Contact your local system administrator. |
| 390124 | EXT_AUTHN_REQUESTED | Duo Security authentication is required. |
| 390125 | EXT_AUTHN_SMS_SENT | Duo Security temporary passcode is sent via SMS. Please authenticate using the passcode. |
| 390126 | EXT_AUTHN_TIMEOUT | Timed out waiting for your login request approval via Duo Mobile. If your mobile device has no data service, generate a Duo passcode and enter it in the connect string. |
| 390127 | EXT_AUTHN_INVALID | Incorrect passcode was specified. |
| 390128 | EXT_AUTHN_SUCCEEDED | Duo Security authentication is successful. |
| 390129 | EXT_AUTHN_EXCEPTION | Request could not be completed due to a communication problem with the external service provider. Try again later. |
| 390132 | EXT_AUTHN_DUO_PUSH_DISABLED | Duo Push is not enabled for your MFA. Provide a passcode as part of the connection string. |

---
title: Using Dynamic Data Masking
source: https://docs.snowflake.com/en/user-guide/security-column-ddm-use.md
section: User Guide
---

# Using Dynamic Data Masking

This topic provides instructions on how to configure and use Dynamic Data Masking in Snowflake.

To learn more about using a masking policy with a tag, see [Tag-based masking policies](tag-based-masking-policies.md).

## Using Dynamic Data Masking

The following lists the high-level steps to configure and use Dynamic Data Masking in Snowflake:

1. Grant masking policy management privileges to a custom role for a security or privacy officer.
2. Grant the custom role to the appropriate users.
3. The security or privacy officer creates and defines masking policies and applies them to columns with sensitive data.
4. Execute queries in Snowflake. Note the following:

   * Snowflake dynamically rewrites the query applying the masking policy SQL expression to the column.
   * The column rewrite occurs at every place where the column specified in the masking policy appears in the query (e.g. projections, join predicate, where clause predicate, order by, and group by).
   * Users see masked data based on the execution context conditions defined in the masking policies. For more information on the execution context in Dynamic Data Masking policies, see [Advanced Column-level Security topics](security-column-advanced.md).

### Enforce dynamic data masking policies on Apache Iceberg tables queried from Apache Spark™

Snowflake supports enforcing dynamic data masking policies on Apache Iceberg tables that you query from Apache Spark™ through Snowflake Horizon
Catalog. For more information,
see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

### Step 1: Grant masking policy privileges to the custom role

A [security or privacy officer](security-column-intro.md) should serve as the masking policy administrator (i.e. custom role: `MASKING_ADMIN`) and have the privileges to define, manage, and apply masking policies to columns.

Snowflake provides the following privileges to grant to a security or privacy officer for Column-level Security masking policies:

| Privilege | Object | Description |
| --- | --- | --- |
| CREATE MASKING POLICY | Schema | This privilege controls who can create masking policies. |
| APPLY MASKING POLICY | Account | This privilege controls who can [un]set masking policies on columns and is granted to the ACCOUNTADMIN role by default. . This privilege only allows applying a masking policy to a column and does not provide any additional table privileges described in [Access control privileges](security-access-control-privileges.md). |
| APPLY | Masking policy | Optional. This policy-level privilege can be used by a policy owner to decentralize the [un]set operations of a given masking policy on columns to the object owners (i.e. the role that has the OWNERSHIP privilege on the object). . Snowflake supports [discretionary access control](security-access-control-overview.md) where object owners are also considered data stewards. . If the policy administrator trusts the object owners to be data stewards for protected columns, then the policy administrator can use this privilege to decentralize applying the policy [un]set operations. |

The following example creates the `MASKING_ADMIN` role and grants masking policy privileges to that role.

Create a masking policy administrator custom role:

> ```sqlexample
> use role useradmin;
> CREATE ROLE masking_admin;
> ```

Grant privileges to `masking_admin` role:

> ```sqlexample
> use role securityadmin;
> GRANT CREATE MASKING POLICY on SCHEMA <db_name.schema_name> to ROLE masking_admin;
> GRANT APPLY MASKING POLICY on ACCOUNT to ROLE masking_admin;
> ```

Allow `table_owner` role to set or unset the `ssn_mask` masking policy (optional):

> ```sqlexample
> GRANT APPLY ON MASKING POLICY ssn_mask to ROLE table_owner;
> ```

Where:

* `db_name.schema_name`
  :   Specifies the identifier for the schema for which the privilege should be granted.

For more information, see:

* [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
* [Configuring access control](security-access-control-configure.md)
* [Access control privileges](security-access-control-privileges.md)

### Step 2: Grant the custom role to a user

Grant the `MASKING_ADMIN` custom role to a user serving as the security or privacy officer.

```sqlexample
GRANT ROLE masking_admin TO USER jsmith;
```

### Step 3: Create a masking policy

Using the MASKING_ADMIN role, create a masking policy and apply it to a column.

In this representative example, users with the ANALYST role see the unmasked value. Users without the ANALYST role see a full mask.

```sqlexample
CREATE OR REPLACE MASKING POLICY email_mask AS (val string) RETURNS string ->
  CASE
    WHEN CURRENT_ROLE() IN ('ANALYST') THEN val
    ELSE '*********'
  END;
```

> **Tip:**
>
> If you want to update an existing masking policy and need to see the current definition of the policy, call the [GET_DDL](../sql-reference/functions/get_ddl.md) function or run the [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md) command.

### Step 4: Apply the masking policy to a table or view column

These examples assume that a masking policy is not applied to the table column when the table is created and the view column when the view
is created. You can optionally apply a masking policy to a table column when you create the table with a
[CREATE TABLE](../sql-reference/sql/create-table.md) statement or a view column with a [CREATE VIEW](../sql-reference/sql/create-view.md) statement.

Execute the following statements to apply the policy to a table column or a view column.

```sqlexample
-- apply masking policy to a table column

ALTER TABLE IF EXISTS user_info MODIFY COLUMN email SET MASKING POLICY email_mask;

-- apply the masking policy to a view column

ALTER VIEW user_info_v MODIFY COLUMN email SET MASKING POLICY email_mask;
```

### Step 5: Query data in Snowflake

Execute two different queries in Snowflake, one query with the ANALYST role and another query with a different role, to verify that users without the ANALYST role see a full mask.

```sqlexample
-- using the ANALYST role

USE ROLE analyst;
SELECT email FROM user_info; -- should see plain text value

-- using the PUBLIC role

USE ROLE PUBLIC;
SELECT email FROM user_info; -- should see full data mask
```

## Masking policy with a memoizable function

This example uses a [memoizable function](../developer-guide/udf/sql/udf-sql-scalar-functions.md) to cache the result of a query on the mapping table that
determines whether a role is authorized to view PII data. A data engineer uses a masking policy to protect the columns in the table.

The following procedure references these objects:

* A table that contains PII data, `employee_data`:

  ```output
  +----------+-------------+---------------+
  | USERNAME |     ID      | PHONE_NUMBER  |
  +----------+-------------+---------------+
  | JSMITH   | 12-3456-89  | 1555-523-8790 |
  | AJONES   | 12-0124-32  | 1555-125-1548 |
  +----------+-------------+---------------+
  ```
* A mapping table that determines whether a particular role is authorized to view data, `auth_role_t`:

  ```output
  +---------------+---------------+
  | ROLE          | IS_AUTHORIZED |
  +---------------+---------------+
  | DATA_ENGINEER | TRUE          |
  | DATA_STEWARD  | TRUE          |
  | IT_ADMIN      | TRUE          |
  | PUBLIC        | FALSE         |
  +---------------+---------------+
  ```

Complete these steps to create a masking policy that calls a memoizable function with arguments:

1. Create a memoizable function that queries the mapping table. The function returns an array of roles based on the value of the
   `is_authorized` column:

   ```sqlexample
   CREATE FUNCTION is_role_authorized(arg1 VARCHAR)
   RETURNS BOOLEAN
   MEMOIZABLE
   AS
   $$
     SELECT ARRAY_CONTAINS(
       arg1::VARIANT,
       (SELECT ARRAY_AGG(role) FROM auth_role WHERE is_authorized = TRUE)
     )
   $$;
   ```
2. Call the memoizable function to cache the query results. In this example, pass the value `TRUE` as the argument value because the
   resultant array serves as the source of allowed roles to access the data protected by the masking policy:

   ```sqlexample
   SELECT is_role_authorized(IT_ADMIN);
   ```

   ```output
   +---------------------------------------------+
   |         is_role_authorized(IT_ADMIN)        |
   +---------------------------------------------+
   |                    TRUE                     |
   +---------------------------------------------+
   ```
3. Create a masking policy to protect the `id` column. The policy calls the memoizable function to determine whether the
   role used to query the table is authorized to see the data in the protected column:

   ```sqlexample
   CREATE OR REPLACE MASKING POLICY empl_id_mem_mask
   AS (val VARCHAR) RETURNS VARCHAR ->
   CASE
     WHEN is_role_authorized(CURRENT_ROLE()) THEN val
     ELSE NULL
   END;
   ```
4. Set the masking policy on the table with an [ALTER TABLE … ALTER COLUMN](../sql-reference/sql/alter-table-column.md) command:

   ```sqlexample
   ALTER TABLE employee_data MODIFY COLUMN id
     SET MASKING POLICY empl_id_mem_mask;
   ```
5. Query the table to test the policy:

   ```sqlexample
   USE ROLE data_engineer;
   SELECT * FROM employee_data;
   ```

   This query returns unmasked data.

   However, if you switch roles to the PUBLIC role and repeat the query in this step, the values in the `id` are replaced
   with `NULL`.

## Additional masking policy examples

The following are additional, representative examples that can be used in the body of the Dynamic Data Masking policy.

Allow a production [account](admin-account-identifier.md) to see unmasked values and all other accounts
(e.g. development, test) to see masked values.

> ```sqlexample
> case
>   when current_account() in ('<prod_account_identifier>') then val
>   else '*********'
> end;
> ```

Return NULL for unauthorized users:

> ```sqlexample
> case
>   when current_role() IN ('ANALYST') then val
>   else NULL
> end;
> ```

Return a static masked value for unauthorized users:

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   ELSE '********'
> END;
> ```

Return a hash value using [SHA2 , SHA2_HEX](../sql-reference/functions/sha2.md) for unauthorized users. Using a hashing function in a masking policy may result in collisions; therefore, exercise caution with this approach. For more information, see [Advanced Column-level Security topics](security-column-advanced.md).

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   ELSE sha2(val) -- return hash of the column value
> END;
> ```

Apply a partial mask or full mask:

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   WHEN current_role() IN ('SUPPORT') THEN regexp_replace(val,'.+\@','*****@') -- leave email domain unmasked
>   ELSE '********'
> END;
> ```

Using timestamps.

> ```sqlexample
> case
>   WHEN current_role() in ('SUPPORT') THEN val
>   else date_from_parts(0001, 01, 01)::timestamp_ntz -- returns 0001-01-01 00:00:00.000
> end;
> ```
>
> > **Important:**
> >
> > Currently, Snowflake does not support different input and output data types in a masking policy, such as defining the masking policy to target a timestamp and return a string (e.g. `***MASKED***`); the input and output data types must match.
> >
> > A workaround is to cast the actual timestamp value with a fabricated timestamp value. For more information, see [DATE_FROM_PARTS](../sql-reference/functions/date_from_parts.md) and [CAST , ::](../sql-reference/functions/cast.md).

Using a UDF:

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   ELSE mask_udf(val) -- custom masking function
> END;
> ```

On variant data:

> ```sqlexample
> CASE
>    WHEN current_role() IN ('ANALYST') THEN val
>    ELSE OBJECT_INSERT(val, 'USER_IPADDRESS', '****', true)
> END;
> ```

Using a custom entitlement table. Note the use of [EXISTS](../sql-reference/operators-subquery.md) in the WHEN clause. Always use EXISTS when including a subquery in the masking policy body. For more information on subqueries that Snowflake supports, see [Working with Subqueries](querying-subqueries.md).

> ```sqlexample
> CASE
>   WHEN EXISTS
>     (SELECT role FROM <db>.<schema>.entitlement WHERE mask_method='unmask' AND role = current_role()) THEN val
>   ELSE '********'
> END;
> ```

Using [DECRYPT](../sql-reference/functions/decrypt.md) on previously encrypted data with either [ENCRYPT](../sql-reference/functions/encrypt.md) or [ENCRYPT_RAW](../sql-reference/functions/encrypt_raw.md), with a passphrase on the encrypted data:

> ```sqlexample
> case
>   when current_role() in ('ANALYST') then DECRYPT(val, $passphrase)
>   else val -- shows encrypted value
> end;
> ```

Using a [<JavaScript UDF](../developer-guide/udf/javascript/udf-javascript-introduction.md) on JSON (VARIANT):

> In this example, a JavaScript UDF masks location data in a JSON string. It is important to set the data type as VARIANT in the UDF and
> the masking policy. If the data type in the table column, UDF, and masking policy signature do not match, Snowflake returns an error
> message because it cannot resolve the SQL.
>
> ```sqlexample
> -- Flatten the JSON data
>
> create or replace table <table_name> (v variant) as
> select value::variant
> from @<table_name>,
>   table(flatten(input => parse_json($1):stationLocation));
>
> -- JavaScript UDF to mask latitude, longitude, and location data
>
> CREATE OR REPLACE FUNCTION full_location_masking(v variant)
>   RETURNS variant
>   LANGUAGE JAVASCRIPT
>   AS
>   $$
>     if ("latitude" in V) {
>       V["latitude"] = "**latitudeMask**";
>     }
>     if ("longitude" in V) {
>       V["longitude"] = "**longitudeMask**";
>     }
>     if ("location" in V) {
>       V["location"] = "**locationMask**";
>     }
>
>     return V;
>   $$;
>
>   -- Grant UDF usage to ACCOUNTADMIN
>
>   grant ownership on function FULL_LOCATION_MASKING(variant) to role accountadmin;
>
>   -- Create a masking policy using JavaScript UDF
>
>   create or replace masking policy json_location_mask as (val variant) returns variant ->
>     CASE
>       WHEN current_role() IN ('ANALYST') THEN val
>       else full_location_masking(val)
>       -- else object_insert(val, 'latitude', '**locationMask**', true) -- limited to one value at a time
>     END;
> ```

Using the [GEOGRAPHY](../sql-reference/data-types-geospatial.md) data type:

> In this example, a masking policy uses the [TO_GEOGRAPHY](../sql-reference/functions/to_geography.md) function to convert all GEOGRAPHY data in a
> column to a fixed point, the longitude and latitude for Snowflake in San Mateo, California, for users whose CURRENT_ROLE is not
> `ANALYST`.
>
> > ```sqlexample
> > create masking policy mask_geo_point as (val geography) returns geography ->
> >   case
> >     when current_role() IN ('ANALYST') then val
> >     else to_geography('POINT(-122.35 37.55)')
> >   end;
> > ```
>
> Set the masking policy on a column with the GEOGRAPHY data type and set the [GEOGRAPHY_OUTPUT_FORMAT](../sql-reference/parameters.md) value for the session to
> `GeoJSON`:
>
> > ```sqlexample
> > alter table mydb.myschema.geography modify column b set masking policy mask_geo_point;
> > alter session set geography_output_format = 'GeoJSON';
> > use role public;
> > select * from mydb.myschema.geography;
> > ```
>
> Snowflake returns the following:
>
> > ```sqlexample
> > ---+--------------------+
> >  A |         B          |
> > ---+--------------------+
> >  1 | {                  |
> >    |   "coordinates": [ |
> >    |     -122.35,       |
> >    |     37.55          |
> >    |   ],               |
> >    |   "type": "Point"  |
> >    | }                  |
> >  2 | {                  |
> >    |   "coordinates": [ |
> >    |     -122.35,       |
> >    |     37.55          |
> >    |   ],               |
> >    |   "type": "Point"  |
> >    | }                  |
> > ---+--------------------+
> > ```
>
> The query result values in column B depend on the GEOGRAPHY_OUTPUT_FORMAT parameter value for the session. For example, if the parameter
> value is set to `WKT`, Snowflake returns the following:
>
> > ```sqlexample
> > alter session set geography_output_format = 'WKT';
> > select * from mydb.myschema.geography;
> >
> > ---+----------------------+
> >  A |         B            |
> > ---+----------------------+
> >  1 | POINT(-122.35 37.55) |
> >  2 | POINT(-122.35 37.55) |
> > ---+----------------------+
> > ```

For examples using other context functions and role hierarchy, see [Advanced Column-level Security topics](security-column-advanced.md).

**Next Topics:**

* [Advanced Column-level Security topics](security-column-advanced.md)

---
title: Using External Tokenization
source: https://docs.snowflake.com/en/user-guide/security-column-ext-token-use.md
section: User Guide
---

# Using External Tokenization

This topic provides instructions on how to use External Tokenization in Snowflake with partner integrations and how to create a custom
External Tokenization integration.

Snowflake supports External Tokenization on AWS, Microsoft Azure, and Google Cloud Platform.

Note that an external tokenization masking policy can be assigned to a tag to provide tag-based external tokenization. For details about
assigning a masking policy to a tag, see [Tag-based masking policies](tag-based-masking-policies.md).

> **Important:**
>
> External tokenization requires [Writing external functions](../sql-reference/external-functions.md), which are included in the Snowflake [Standard Edition](intro-editions.md), and you can use external functions with a tokenization provider.
>
> However, if you choose to integrate your tokenization provider with Snowflake External Tokenization, you must upgrade to
> [Enterprise Edition](intro-editions.md) or higher.
>
> To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## External Tokenization partner integrations

The following partners facilitate external tokenization in Snowflake. To use these partner integrations, follow the instructions in the
partner documentation or contact the partner to begin the configuration process:

* [ALTR](https://altr.com/use-cases/tokenization/)
* [Baffle](https://baffle.io/snowflake/)
* [Capital One Databolt](https://www.capitalone.com/software/products/databolt/)
* [Comforte](https://insights.comforte.com/how-to-protect-data-on-snowflake)
* [Fortanix](https://support.fortanix.com/hc/en-us/articles/4407049792148-Using-Data-Security-Manager-with-Snowflake)
* [MicroFocus CyberRes Voltage](https://www.microfocus.com/en-us/cyberres/partners/snowflake)
* [Protegrity](https://www.protegrity.com/snowflake-partnership)
* [Privacera](https://privacera.com/partners/snowflake/)
* [SecuPI](https://www.secupi.com/solution/data-access-governance/)
* [Skyflow](https://info.skyflow.com/snowflake-partner-skyflow)
* [Spring Labs](https://springlabs.com/spring-labs-snowflake)
* [Thales](https://thalesdocs.com/ctp/ig/snowflake/index.html)

## Create a custom External Tokenization integration

Complete the following steps to create a custom integration for External Tokenization:

### Step 1: Create an external function

Create an external function in Snowflake and configure your cloud provider environment to communicate with the external function. For
details, see:

* [Creating external functions on AWS](../sql-reference/external-functions-creating-aws.md)
* [Creating external functions on Microsoft Azure](../sql-reference/external-functions-creating-azure.md)
* [Creating external functions on GCP](../sql-reference/external-functions-creating-gcp.md)

### Step 2: Grant Masking Policy Privileges to Custom Role

A [security or privacy officer](security-column-intro.md) should serve as the masking policy administrator (i.e. custom role: `MASKING_ADMIN`) and have the privileges to define, manage, and apply masking policies to columns.

Snowflake provides the following privileges to grant to a security or privacy officer for Column-level Security masking policies:

| Privilege | Object | Description |
| --- | --- | --- |
| CREATE MASKING POLICY | Schema | This privilege controls who can create masking policies. |
| APPLY MASKING POLICY | Account | This privilege controls who can [un]set masking policies on columns and is granted to the ACCOUNTADMIN role by default. . This privilege only allows applying a masking policy to a column and does not provide any additional table privileges described in [Access control privileges](security-access-control-privileges.md). |
| APPLY | Masking policy | Optional. This policy-level privilege can be used by a policy owner to decentralize the [un]set operations of a given masking policy on columns to the object owners (i.e. the role that has the OWNERSHIP privilege on the object). . Snowflake supports [discretionary access control](security-access-control-overview.md) where object owners are also considered data stewards. . If the policy administrator trusts the object owners to be data stewards for protected columns, then the policy administrator can use this privilege to decentralize applying the policy [un]set operations. |

The following example creates the `MASKING_ADMIN` role and grants masking policy privileges to that role.

Create a masking policy administrator custom role:

> ```sqlexample
> use role useradmin;
> CREATE ROLE masking_admin;
> ```

Grant privileges to `masking_admin` role:

> ```sqlexample
> use role securityadmin;
> GRANT CREATE MASKING POLICY on SCHEMA <db_name.schema_name> to ROLE masking_admin;
> GRANT APPLY MASKING POLICY on ACCOUNT to ROLE masking_admin;
> ```

Allow `table_owner` role to set or unset the `ssn_mask` masking policy (optional):

> ```sqlexample
> GRANT APPLY ON MASKING POLICY ssn_mask to ROLE table_owner;
> ```

Where:

* `db_name.schema_name`
  :   Specifies the identifier for the schema for which the privilege should be granted.

For more information, see:

* [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
* [Configuring access control](security-access-control-configure.md)
* [Access control privileges](security-access-control-privileges.md)

### Step 3: Grant the Custom Role to a User

Grant the `MASKING_ADMIN` custom role to a user serving as the security or privacy officer.

```sqlexample
use role useradmin;
grant role masking_admin to user jsmith;
```

### Step 4: Create a Masking Policy

In this representative example, users with the `ANALYST` custom role see the detokenized email values. Users without the `ANALYST` custom role see the tokenized values.

The external function to detokenize email values is `de_email()`.

```sqlexample
-- create masking policy

create or replace masking policy email_de_token as (val string) returns string ->
  case
    when current_role() in ('ANALYST') then de_email(val)
    else val
  end;
```

> **Tip:**
>
> If you want to update an existing masking policy and need to see the current definition of the policy, call the [GET_DDL](../sql-reference/functions/get_ddl.md) function or run the [DESCRIBE MASKING POLICY](../sql-reference/sql/desc-masking-policy.md) command.

### Step 5: Apply the Masking Policy to a Table or View Column

These examples assume that a masking policy is not applied to the table column when the table is created and the view column when the view
is created. You can optionally apply a masking policy to a table column when you create the table with a
[CREATE TABLE](../sql-reference/sql/create-table.md) statement or a view column with a [CREATE VIEW](../sql-reference/sql/create-view.md) statement.

Execute the following statements to apply the policy to a table column or a view column.

```sqlexample
-- apply masking policy to a table column

alter table if exists user_info modify column email set masking policy email_de_token;

-- apply the masking policy to a view column

alter view user_info_v modify column email set masking policy email_de_token;
```

### Step 6: Query Data in Snowflake

Execute two different queries in Snowflake, one query with the `ANALYST` custom role and another query with a different role, to verify that users without the `ANALYST` custom role see tokenized values.

```sqlexample
-- using the ANALYST custom role

use role ANALYST;
select email from user_info; -- should see plain text value

-- using the PUBLIC system role

use role public;
select email from user_info; -- should see tokenized value
```

## External Tokenization best practices

* Synchronizing systems. On AWS, it is helpful to synchronize users and roles in your organization’s identity provider (IdP) with Snowflake
  and Protegrity. If users and roles are not synchronized, there can be unexpected behaviors, error messages, and complex troubleshooting
  regarding external functions, API integrations, masking policies, and tokenization policies. One option is to use
  [SCIM](scim-intro.md) to keep users and roles synchronized with your IdP and Snowflake.
* Root cause for error(s). Since External Tokenization requires coordinating multiple systems (e.g. IdP, Snowflake, Protegrity, AWS, Azure, GCP), always verify the privileges, current limitations, external functions, API integration, masking policies, and the columns that have masking policies for External Tokenization in Snowflake. To help determine the root cause, see:

  + [Understanding Column-level Security](security-column-intro.md)
  + [Troubleshooting External Tokenization](security-column-ext-token-intro.md)
  + [Creating external functions on AWS](../sql-reference/external-functions-creating-aws.md)
  + [Creating external functions on Microsoft Azure](../sql-reference/external-functions-creating-azure.md)
  + [Creating external functions on GCP](../sql-reference/external-functions-creating-gcp.md)

**Next Topic:**

* [Advanced Column-level Security topics](security-column-advanced.md)

---
title: Using full-text search
source: https://docs.snowflake.com/en/user-guide/querying-with-search-functions.md
section: User Guide
---

# Using full-text search

You can use search functions to find character data (text) and IP addresses in specified columns from one or
more tables, including fields in VARIANT, OBJECT, and ARRAY columns. This function searches the text in specified
columns or strings based on a list of given search terms. The function returns TRUE if the text matches the
specified search terms based on the search semantics.

In most cases, you call the SEARCH function by specifying it in the SELECT list or the WHERE clause of a SELECT statement.
If the function is used as a WHERE clause filter, the query returns rows when the function returns TRUE.

The SEARCH function requires no setup and no additional privileges. If you’re using a role that has the
privileges to access the data in a column, you can search for that data by using the SEARCH function.

The next sections contain more information about the SEARCH function and about optimizing query performance when you
use it:

* Using the SEARCH function
* Using the SEARCH_IP function
* Optimizing queries that use the SEARCH function

## Using the SEARCH function

The [SEARCH function](../sql-reference/functions/search.md) finds character data (text) in specified columns from one
or more tables, including fields in VARIANT, OBJECT, and ARRAY columns.

When you use the SEARCH function, a text analyzer breaks the text into *tokens*, which are discrete units of text, such
as words or numbers. A default analyzer is applied if you don’t specify one. The analyzer extracts tokens from both the
search terms and the data.

If tokens extracted from the search terms match tokens extracted from a specified column or field according to the
search semantics, the function returns TRUE. The SEARCH_MODE function argument specifies one of the following search
modes:

* `'OR'` - The function uses disjunctive semantics. There is a match if *any* of the tokens
  extracted from the columns or fields being searched match *any* of the tokens extracted from the search string.
  For example, if the `search_string` value is `'blue red green'`, the function returns TRUE for a row that
  contains `blue` OR `red` OR `green` in any of the columns or fields being searched.
* `'AND'` - The function uses conjunctive semantics. There is a match if the tokens extracted from
  *at least one* of the columns or fields being searched matches *all* of the tokens extracted from the search
  string. The matching tokens must all be in one column or field; they can’t be spread across multiple columns or fields.
  For example, if the `search_string` value is `'blue red green'`, the function returns TRUE for a row that
  contains `blue` AND `red` AND `green` in at least one of the columns or fields being searched.
* `'PHRASE'` - The function uses phrase-match semantics. There is a match if the tokens extracted from
  *at least one* of the columns or fields being searched matches *all* of the tokens extracted from the search string,
  including the order and adjacency of the tokens.

  The matching semantics are the same as conjunctive semantics, except for the following differences:

  + The order of the tokens must exactly match. For example, if the `search_string` value is `'blue,red,green'`,
    the function returns FALSE for `red,green,blue`.
  + No additional tokens can be interspersed in the search data. For example, if the `search_string` value
    is `'blue,red,green'`, the function returns FALSE for `blue,yellow,red,green`.
* `'EXACT'` - The function uses exact-match semantics. There is a match if the tokens extracted from
  *at least one* of the columns or fields being searched exactly matches *all* of the tokens extracted
  from the search string, including the delimiters.

  The matching rules are the same as phrase-search semantics, except for the following differences:

  + The delimiter strings between the tokens must match exactly. For example, if the `search_string`
    value is `'blue,red,green'`, the function returns TRUE for a row that contains `blue,red,green`
    in at least one of the columns or fields being searched. The function returns FALSE for variations such as
    `blue|red|green` or `blue, red, green`.
  + When a delimiter is the first or last character in the `search_string` value, the delimiter
    is treated like a character for matching. Therefore, delimiters on the left and right of the first and
    last delimiter can result in a match. For example, if the `search_string` value is `'[blue]'`,
    the function returns TRUE for `foo [blue] bar`, `[[blue]]`, and `=[blue].`, but not for
    `(blue)` or `foo blue bar`.

The following example searches for the string `snow leopard` in the text `leopard` with the default SEARCH_MODE (`'OR'`)
and the default analyzer:

```sqlexample
SELECT SEARCH('leopard', 'snow leopard');
```

```output
+-----------------------------------+
| SEARCH('LEOPARD', 'SNOW LEOPARD') |
|-----------------------------------|
| True                              |
+-----------------------------------+
```

The following example searches for the string `snow leopard` in the text `lion`:

```sqlexample
SELECT SEARCH('lion', 'snow leopard');
```

```output
+--------------------------------+
| SEARCH('LION', 'SNOW LEOPARD') |
|--------------------------------|
| False                          |
+--------------------------------+
```

The following example searches for the string `snow leopard` in the text `leopard` and specifies `'AND'` for the
SEARCH_MODE argument:

```sqlexample
SELECT SEARCH('leopard', 'snow leopard', search_mode => 'AND');
```

```output
+---------------------------------------------------------+
| SEARCH('LEOPARD', 'SNOW LEOPARD', SEARCH_MODE => 'AND') |
|---------------------------------------------------------|
| False                                                   |
+---------------------------------------------------------+
```

For more information about this function and additional examples, see [SEARCH](../sql-reference/functions/search.md).

## Using the SEARCH_IP function

The [SEARCH_IP function](../sql-reference/functions/search_ip.md) finds valid IPv4 and IPv6 addresses in specified character-string
columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. The search is based on a single IP
address that you specify. If this IP address exactly matches an IP address in the specified column or field, the function
returns TRUE.

The following example searches for the IP address `10.10.10.1` in the text `192.0.2.146`:

```sqlexample
SELECT SEARCH_IP('192.0.2.146','10.10.10.1');
```

```output
+---------------------------------------+
| SEARCH_IP('192.0.2.146','10.10.10.1') |
|---------------------------------------|
| False                                 |
+---------------------------------------+
```

For more information about this function and additional examples, see [SEARCH_IP](../sql-reference/functions/search_ip.md).

## Optimizing queries that use the SEARCH function

To improve the performance of queries that use the function, you can optionally [enable FULL_TEXT search optimization](search-optimization/enabling.md) on a specific column or set of columns in a table. When you enable search
optimization, a new [search access path](search-optimization-service.md) is built and maintained.

---
title: Using lateral joins
source: https://docs.snowflake.com/en/user-guide/lateral-join-using.md
section: User Guide
---

# Using lateral joins

In a [FROM](../sql-reference/constructs/from.md) clause, the [LATERAL](../sql-reference/constructs/join-lateral.md) construct allows an inline view to reference columns from preceding table expressions.

For example, if the inline view is a [subquery](querying-subqueries.md), the subquery can process rows from the table to the left of the subquery. For example:

```sqlexample
SELECT ...
  FROM left_hand_table_expression AS lhte,
    LATERAL (SELECT col_1 FROM table_2 AS t2 WHERE t2.col_1 = lhte.col_1);
```

This behavior is somewhat similar to a [correlated subquery](querying-subqueries.md).
The subquery after the LATERAL keyword is similar to the correlated subquery itself, and the `left_hand_table_expression` is similar to the
outer query. A lateral join, unlike a correlated subquery, can return multiple rows, each of which can have multiple columns.

Other types of joins do not directly pass the left-hand table expression’s rows to the right-hand table
expression for processing.

A common use for a lateral join is to combine it with a call to the [FLATTEN](../sql-reference/functions/flatten.md) table function to
process a complex data structure, such as an ARRAY or VARIANT data type, and extract the values. For an example, see
[LATERAL](../sql-reference/constructs/join-lateral.md).

Unlike the output of other types of joins, the output from a lateral join includes only the rows
generated from the inline view (the subquery); after the rows from the subquery are generated, they are not cross-joined
to all the rows from the table on the left-hand side.

## Terminology

Consider the following code fragment:

```sqlexample
... FROM te1, LATERAL iv1 ...
```

The left-hand side of the lateral join is a table expression (`te1`).
The right-hand side of the lateral join is an inline view (`iv1`).

* *Table expression*: In this topic, the table expression on the left-hand side of a lateral join,
  such as the table expression above named `te1`, can be almost any valid
  expression that evaluates to a table. For example:

  + A table.
  + A view.
  + A subquery.
  + The output of a table function.
  + The result of an earlier join (a lateral join or other type of join).
* *Inline view*: In this topic, the expression on the right-hand side of a lateral join (in this case, `iv1`)
  is referred to as an “inline view.” In this context, a valid inline view can be one of the following:

  + A view that is defined within the statement, and valid only for the duration of the statement.
  + A subquery.
  + A table function: either a built-in table function such as FLATTEN or a user-defined table function (UDTF).

  The inline view cannot be a table.
* *Cross join*: In this topic, the term “cross join” refers not only to explicit cross joins, but also to
  inner joins and outer joins, including all variations (natural joins, left/right/full outer joins, and so on).

## A refresher on joins

A join is a two-step process. First, the server pairs up two rows,
which are usually in different tables, and which are almost always related in some way.
Second, the server joins the columns of each row in the pair into a single row.

Many of the example queries use the data shown below:

```sqlexample
CREATE TABLE departments (department_id INTEGER, name VARCHAR);
CREATE TABLE employees (employee_ID INTEGER, last_name VARCHAR,
  department_ID INTEGER, project_names ARRAY);

INSERT INTO departments (department_ID, name) VALUES
  (1, 'Engineering'),
  (2, 'Support');
INSERT INTO employees (employee_ID, last_name, department_ID) VALUES
  (101, 'Richards', 1),
  (102, 'Paulson',  1),
  (103, 'Johnson',  2);
```

Here’s a simple inner join (this is not a lateral join):

```sqlexample
SELECT *
  FROM departments AS d, employees AS e
  WHERE e.department_ID = d.department_ID
  ORDER BY employee_ID;
```

```output
+---------------+-------------+-------------+-----------+---------------+---------------+
| DEPARTMENT_ID | NAME        | EMPLOYEE_ID | LAST_NAME | DEPARTMENT_ID | PROJECT_NAMES |
|---------------+-------------+-------------+-----------+---------------+---------------|
|             1 | Engineering |         101 | Richards  |             1 | NULL          |
|             1 | Engineering |         102 | Paulson   |             1 | NULL          |
|             2 | Support     |         103 | Johnson   |             2 | NULL          |
+---------------+-------------+-------------+-----------+---------------+---------------+
```

As you can see, the rows are paired based on matching department IDs.

The join takes the columns from two corresponding (“paired”) input rows and generates one output row that contains all
the columns from both input rows. (Of course, by modifying the SELECT list, you can change the columns; however,
in the simplest case, all input columns are included in the output.)

A lateral join pairs rows differently. However, the second half of the process, the “join” of paired rows, is similar:
the output row will (almost always) contain one or more columns from each member of the pair of input rows.

## How a lateral join pairs rows

A lateral join behaves differently from other types of joins. A lateral join behaves as if the server executed a loop
similar to the following:

```none
for each row in left_hand_table LHT:
  execute right_hand_subquery RHS using the values from the current row in the LHT
```

This section focuses on the “pairing” part of the process, which is different for lateral joins.

The LATERAL construct allows an inline view on the right-hand side of the lateral join to reference columns from a
table expression that is outside the view. (In the example below, the “inline view” is actually a subquery.)

```sqlexample
SELECT *
  FROM departments AS d,
    LATERAL (SELECT * FROM employees AS e WHERE e.department_ID = d.department_ID) AS iv2
  ORDER BY employee_ID;
```

```output
+---------------+-------------+-------------+-----------+---------------+---------------+
| DEPARTMENT_ID | NAME        | EMPLOYEE_ID | LAST_NAME | DEPARTMENT_ID | PROJECT_NAMES |
|---------------+-------------+-------------+-----------+---------------+---------------|
|             1 | Engineering |         101 | Richards  |             1 | NULL          |
|             1 | Engineering |         102 | Paulson   |             1 | NULL          |
|             2 | Support     |         103 | Johnson   |             2 | NULL          |
+---------------+-------------+-------------+-----------+---------------+---------------+
```

In this example, the WHERE clause in the subquery on the right refers to a value from the table on the left.

The differences between a lateral join and a cross join are much greater than simply access to columns.
The next several paragraphs contrast these two types of joins, starting with the traditional cross join.

A cross join combines each row of the table on the left with each row of the table on the right. The result is a
Cartesian product.

Conceptually, a cross join is similar to a nested loop, as in the pseudo-code below:

```none
for each row in left_hand_table LHT:
  for each row in right_hand_table RHT:
    concatenate the columns of the RHT to the columns of the LHT
```

If the table on the left has *n* rows and the table on the right has *m\** rows,
the result of the cross join has *n x m* rows. For example, if the table on the left
has 1000 rows and the table on the right has 100 rows, the result of
the inner join is 100,000 rows. This is just what you would expect from
nested loops; if the outer loop executes 1000 times and the inner loop
executes 100 times *per iteration of the outer loop*, the innermost statement executes 100,000 times.
(Of course, SQL programmers rarely write pure cross joins without any join conditions in the
FROM clause or WHERE clause.)

A lateral join pairs records very differently. Here’s the pseudo-code for the
implementation of a lateral join:

```none
for each row in left_hand_table LHT:
  execute right_hand_subquery RHS using the values from the LHT row,
    and concatenate LHT columns to RHS columns
```

The lateral join has only one loop, not two nested loops, which changes the output.

For the cross join, the output was 100,000 rows. For a lateral join with
the same 1000-row table on the left-hand side, and using a right-hand inline view (such as a subquery)
that emits one output row per input row, the output of the lateral join
will be 1000 rows, not 100,000 rows.

You can think of a lateral join as follows: For each input row from the left-hand table, the inline view on the right produces
0 or more rows. Each of those output rows from the subquery is then joined to the input row (*not* to the entire table on the
left-hand side) to produce a row that contains the columns selected from the subquery and the columns from the LHT input row.

The inline view on the right-hand side of a lateral join does not necessarily produce exactly one output row for each
input row. For any one input row, the output from the right-hand side might be 0 rows, 1 row, or multiple rows. Each
of those output rows will be joined to the columns of the original input row.

If the subquery does not produce exactly one output row for each input row, the lateral join does not necessarily
produce exactly as many rows as there are in the left-hand table. If the left-hand table has 1000 rows, and the
inline view produces 2 output rows for each input row, the result of the lateral join is 2000 rows.

In each of the lateral join examples so far, there was no ON clause or WHERE clause in the outer query to pair up
records. The pairing (if any) is done by the inline view based on the individual row passed into the inline view.
This is reasonably clear when the inline view is a subquery with a WHERE clause. It is not necessarily as obvious in
other cases, such as when the right-hand expression is a table function rather than a subquery. (A later example
shows a right-hand expression that uses the FLATTEN table function instead of a subquery.)

Readers who are fluent with correlated subqueries or with joins of table
functions might find the following comparisons helpful in understanding how
lateral joins differ from cross joins. Readers not familiar with correlated
subqueries or joining table functions can skip these sections.

## Similarities between correlated subqueries and lateral joins

A lateral join is similar to a correlated subquery:

* In a correlated subquery, the subquery is executed once for each row in the outer query.
* In a lateral join, the right-hand subquery (inline view) is executed once for each row in the
  left-hand table expression.

However, correlated subqueries and lateral joins are not the same. One difference is that
in a lateral join the subquery can generate more than one output row per input row,
and each output row can contain multiple columns. Correlated subqueries return only
one output row per input row, and each output row must contain only one column.

## Similarities between joining table functions and lateral joins

A lateral join is similar to a “join” between a table and a user-defined table function (UDTF).
For example, consider the following SQL statement:

```sqlexample
SELECT *
  FROM t1, TABLE(udtf2(t1.col1))
  ...
  ;
```

The pseudo-code for implementing the join between the table and the UDTF is:

```none
for each row in left_hand_table LHT:
  udtf2(row) -- that is, call udtf2() with the value(s) from the LHT row.
```

This is essentially identical to the code for implementing a lateral join:

```none
for each row in left_hand_table LHT:
  execute right_hand_subquery RHS using the values from the LHT row
```

## Example: Using a lateral join with the FLATTEN table function

Lateral joins are frequently used with the built-in [FLATTEN](../sql-reference/functions/flatten.md) table function. The FLATTEN function is
often used with data types that can store multiple values (such as ARRAY, VARIANT, and OBJECT). For example, an array typically
contains multiple values. Similarly, a VARIANT column can contain a JSON data value, which might contain a dictionary (hash) or list.
(And that, in turn, might contain other values.)

You can create ARRAY values as follows:

```sqlexample
UPDATE employees SET project_names = ARRAY_CONSTRUCT('Materialized Views', 'UDFs')
  WHERE employee_ID = 101;
UPDATE employees SET project_names = ARRAY_CONSTRUCT('Materialized Views', 'Lateral Joins')
  WHERE employee_ID = 102;
```

The FLATTEN function can extract values from inside those values. The function takes a single expression of type VARIANT, OBJECT,
or ARRAY, and extracts the values from that expression into a set of rows (0 or more rows, each of which contains 1 or more columns).
This set of rows is equivalent to a view or a table. This view exists only for the duration of the statement in which it is
defined, so it is commonly referred to as an “inline view”.

The following example uses FLATTEN to extract values from an array (*without using a lateral join*):

```sqlexample
SELECT index, value AS project_name
  FROM TABLE(FLATTEN(INPUT => ARRAY_CONSTRUCT('project1', 'project2')));
```

```output
+-------+--------------+
| INDEX | PROJECT_NAME |
|-------+--------------|
|     0 | "project1"   |
|     1 | "project2"   |
+-------+--------------+
```

The inline view generated by FLATTEN can be (but is not required to be) used with the LATERAL keyword. For example:

```sqlexample
SELECT * FROM table1, LATERAL FLATTEN(...);
```

When used with the LATERAL keyword, the inline view can contain a reference to columns in a table that precedes it:

```sqlexample
SELECT emp.employee_ID, emp.last_name, index, value AS project_name
  FROM employees AS emp,
    LATERAL FLATTEN(INPUT => emp.project_names) AS proj_names
  ORDER BY employee_ID;
```

```output
+-------------+-----------+-------+----------------------+
| EMPLOYEE_ID | LAST_NAME | INDEX | PROJECT_NAME         |
|-------------+-----------+-------+----------------------|
|         101 | Richards  |     0 | "Materialized Views" |
|         101 | Richards  |     1 | "UDFs"               |
|         102 | Paulson   |     0 | "Materialized Views" |
|         102 | Paulson   |     1 | "Lateral Joins"      |
+-------------+-----------+-------+----------------------+
```

---
title: Using multiple identity providers for federated authentication
source: https://docs.snowflake.com/en/user-guide/admin-security-fed-auth-security-integration-multiple.md
section: User Guide
---

# Using multiple identity providers for federated authentication

You can configure Snowflake to allow users to authenticate with multiple identity providers (IdPs).

Implementing a federated environment that uses multiple IdPs consists of the following steps:

1. Enable the identifier-first login flow (in this topic).
2. [Configure each identity provider](admin-security-fed-auth-configure-idp.md).
3. [Create multiple SAML security integrations](admin-security-fed-auth-security-integration.md), one for each IdP.
4. Associate users with IdPs (in this topic).

> **Note:**
>
> Keep the following in mind as you implement an environment using multiple IdPs:
>
> * Each IdP must have a corresponding SAML security integration. If you have an existing single-IdP environment that uses the deprecated
>   SAML_IDENTITY_PROVIDER parameter, you must use the [SYSTEM$MIGRATE_SAML_IDP_REGISTRATION](../sql-reference/functions/system_migrate_saml_idp_registration.md) function to
>   migrate it to a SAML security integration.
> * Currently, only a subset of Snowflake drivers support the use of multiple identity providers. These drivers include JDBC, ODBC, and Python.

## Enable identifier-first login

When the federated environment for an account uses multiple IdPs, Snowflake must be able to determine which IdPs are associated with a user
*before* presenting the user with authentication options. In this flow, Snowflake prompts the user for only their email address or username,
then displays authentication methods after identifying the user. Only IdPs associated with the user appear as authentication options.

The identifier-first login flow must be enabled if you are using multiple IdPs. To enable identifier-first login, set the
[ENABLE_IDENTIFIER_FIRST_LOGIN](../sql-reference/parameters.md) parameter to `TRUE`:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Execute the following SQL statements:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   ALTER ACCOUNT SET ENABLE_IDENTIFIER_FIRST_LOGIN = true;
   ```

For more information about the identifier-first login flow, see [Identifier-first login](identifier-first-login.md).

## Associate users with IdPs

In an environment with multiple IdPs, you can choose how you want to associate a user with an IdP. You can use the security integration
associated with an IdP, an authentication policy, or combine the two methods.

Security Integration:
:   Use the `ALLOWED_USER_DOMAINS` and `ALLOWED_EMAIL_PATTERNS` properties of the SAML2 security integration associated with each
    IdP. In this configuration, a user only sees an IdP as an authentication option if their `EMAIL` matches an email address domain or
    pattern in the security integration.

Authentication Policy:
:   Use the `SECURITY_INTEGRATIONS` property of an [authentication policy](authentication-policies.md) to specify which
    security integrations are available to the user. In this configuration, the authentication policy is assigned to an entire account or an
    individual user. A user can only authenticate with IdPs associated with security integrations that are specified in the authentication
    policy.

    If you want a user to only see the identity providers that they are allowed to use, create multiple authentication policies and then
    assign the appropriate policy to a user.

    For an example of using an authentication policy to implement multiple IdPs, see [Allow authentication from multiple identity providers on an account](authentication-policies.md).

Combined:
:   You can combine the security integration and authentication policy methods to further refine how users authenticate in an environment that
    has multiple IdPs.

    If you use both methods, Snowflake first evaluates which security integrations are associated with the authentication policy governing the
    user’s login. Once Snowflake has identified the security integrations, the user’s `EMAIL` is matched to one of the integrations
    based on the `ALLOWED_USER_DOMAINS` and `ALLOWED_EMAIL_PATTERNS` properties. Snowflake only displays the IdP option for the
    security integration that matches the user’s `EMAIL`.

## Use multiple SAML2 security integrations with Microsoft Entra ID using the same issuer ID

This section guides you through configuring Snowflake and Microsoft Entra ID to let users authenticate through SSO using both a public or
private issuer URL. You can use two different SAML2 security integrations with Microsoft Entra ID to implement this experience. You can
configure Microsoft Entra ID to differentiate between public and private issuer URLs by appending a different application ID to each issuer
URL.

Before continuing, you must enable the identifier-first login flow.

Follow the sections below to learn how to use multiple SAML2 security integrations with Microsoft Entra ID using the same issuer
ID:

* Configure Microsoft Entra ID to append application IDs to Microsoft Entra Identifier URLs.
* Gather the Login URL, Microsoft Entra identifier, and application ID.
* Create public and private SAML2 security integrations.

### Configure Microsoft Entra ID to append application IDs to Microsoft Entra Identifier URLs

1. Log in to [Microsoft Azure](https://portal.azure.com/).
2. Under Azure services, select Microsoft Entra ID.
3. In the left navigation, select Manage » Enterprise applications.
4. Select your application.
5. In the left navigation, select Manage » Single sign-on.
6. In Attributes & Claims, select Edit.
7. Under Additional claims, expand Advanced settings.
8. Beside Advanced SAML claims options, select Edit.

   A panel on the right appears.
9. Select Append application ID to issuer.

### Gather the Login URL, Microsoft Entra identifier, and application ID

1. Ensure you configured Microsoft Entra ID.
2. In the left navigation, select Manage » Single sign-on.
3. Under Set up <your application name>, save the following values for later:

   * Login URL
   * Microsoft Entra Identifier
4. In the left navigation, select Overview
5. Under Properties, save the Application ID for later.
6. Repeat for additional applications.

### Create public and private SAML2 security integrations

1. Ensure you configured Microsoft Entra ID.
2. Ensure you gathered the Login URL, Microsoft Entra identifier, and application ID.
3. Sign in to [Snowsight](ui-snowsight-gs.md).
4. In the navigation menu, select Projects » Worksheets.
5. Switch to a role with the [CREATE INTEGRATION](../sql-reference/sql/create-security-integration-saml2.md) privilege.
6. Execute the following SQL statement to create a SAML2 security integration:

   ```sqlexample
   CREATE OR REPLACE SECURITY INTEGRATION entra_id_public
     TYPE = SAML2
     ENABLED = TRUE
     SAML2_ISSUER = '<microsoft_entra_identifier>/<application_id>'
     SAML2_SSO_URL = '<login_url>'
     SAML2_PROVIDER = 'CUSTOM'
     SAML2_X509_CERT = 'MIIC...TAs/'
     SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = 'Entra ID SSO Public'
     SAML2_ENABLE_SP_INITIATED = TRUE
     SAML2_SNOWFLAKE_ACS_URL = 'https://<organization_name>-<account_name>.snowflakecomputing.com/fed/login'
     SAML2_SNOWFLAKE_ISSUER_URL = 'https://<organization_name>-<account_name>.snowflakecomputing.com';
   ```

   Where the following placeholders are replaced with the values you gathered earlier:

   | Placeholder | Example value |
   | --- | --- |
   | `<login_url>` | `https://login.microsoftonline.com/91ccae45-d439-xxxx-xxxx-e22c06bfe4f9/saml2` |
   | `<microsoft_entra_identifier>` | `https://sts.windows.net/91ccae45-d439-xxxx-xxxx-e22c06bfe4f9` |
   | `<application_id>` | `456xyz00-4567-4567-4567-4567xyz5678` |
   | `<organization_name>` | `EXAMPLE-USER12_AA12` |
   | `<account_name>` | `MSMITH` |
7. Create another SAML2 security integration, appending the private application ID to the Microsoft Entra Identifier in the SAML2_ISSUER
   parameter.

---
title: Using Persisted Query Results
source: https://docs.snowflake.com/en/user-guide/querying-persisted-results.md
section: User Guide
---

# Using Persisted Query Results

When a query is executed, the result is persisted (i.e. cached) for a period of time. At the end
of the time period, the result is purged from the system.

Snowflake uses persisted query results to avoid re-generating results when nothing has changed
(i.e. “retrieval optimization”). In addition, you can use persisted query results to post-process
the results (e.g. layering a new query on top of the results already calculated).

For persisted query results of all sizes, the cache expires after 24 hours.

Note that the security token used to access large persisted query results (i.e. greater than
100 KB in size) expires after 6 hours. A new token can be retrieved to access results while they
are still in cache. Smaller persisted query results do not use an access token.

> **Note:**
>
> The token provided to the Snowflake Connector for Spark (“Spark connector”) expires after
> 24 hours regardless of the size of the persisted query results. The Spark connector leverages
> the longer cache expiration time to avoid timeouts in some use cases.

See also [Optimizing the warehouse cache](performance-query-warehouse-cache.md), which discusses how table data may be cached
and reused by an active warehouse.

## Retrieval Optimization

If a user repeats a query that has already been run, and the data in the table(s) hasn’t changed since the last time that the query was run, then the result of the query is the same.
Instead of running the query again, Snowflake simply returns the same result that it returned previously. This can substantially reduce query time because Snowflake bypasses query
execution and, instead, retrieves the result directly from the cache.

Typically, query results are reused if all of the following conditions are met:

* The new query matches the previously executed query exactly. Any difference in syntax, including lowercase versus uppercase, or the use of table aliases, will inhibit 100% cache reuse. For example, consider the following queries, run in succession:

  > ```sqlexample
  > SELECT DISTINCT(severity) FROM weather_events;
  > SELECT DISTINCT(severity) FROM weather_events;
  > SELECT DISTINCT(severity) FROM weather_events we;
  > select distinct(severity) from weather_events;
  > ```

  The first query will populate the cache, and the identical second query will benefit from 100% cache reuse. However, the third and fourth queries will not trigger cache reuse, simply because the third query introduces a table alias and the fourth query uses lowercase keywords.
* The query does not include non-reusable functions, which return different results for successive runs of the same query.
  [UUID_STRING](../sql-reference/functions/uuid_string.md), [RANDOM](../sql-reference/functions/random.md), and [RANDSTR](../sql-reference/functions/randstr.md) are good examples of non-reusable functions.
* The query does not include [external functions](../sql-reference/external-functions.md).
* The query does not select from [hybrid tables](tables-hybrid.md).
* The table data contributing to the query result has not changed.
* The persisted result for the previous query is still available.
* The role accessing the cached results has the required privileges.

  + If the query was a SELECT query, the role executing the query must have the necessary access privileges for all
    the tables used in the cached query.
  + If the query was a SHOW query, the role executing the query must match the role that generated the cached results.
* Any configuration options that affect how the result was produced have not changed.
* The table’s micro-partitions have not changed (e.g. been reclustered or consolidated) due to changes to other data in the table.

> **Note:**
>
> Meeting all these conditions does not guarantee that Snowflake reuses the query results.

By default, result reuse is enabled, but can be overridden at the account, user, and session level using the [USE_CACHED_RESULT](../sql-reference/parameters.md) session parameter.

> **Note:**
>
> Each time the persisted result for a query is reused, Snowflake resets the 24-hour retention period for the result, up to a maximum of 31 days from the date and time that the query was first
> executed. After 31 days, the result is purged and the next time the query is submitted, a new result is generated and persisted.

## Post-processing Query Results

In some cases, you might want to perform further processing on the result of a query that you’ve already run. For example:

* You are developing a complex query step-by-step and you want to add a new layer on top of the previous query and run the new query without recalculating the partial results from scratch.
* The previous query was a [SHOW <objects>](../sql-reference/sql/show.md), [DESCRIBE <object>](../sql-reference/sql/desc.md), or [CALL](../sql-reference/sql/call.md) statement, which returns results in a form that are not easy to reuse.

  For example, you can’t call a stored procedure inside a more complex SQL statement the way you can call a function inside a SQL statement, so the only way to process the output of the stored
  procedure is to post-process the stored query results.

You can perform post-processing by using the [RESULT_SCAN](../sql-reference/functions/result_scan.md) table function. The function returns the results of the previous query as a “table,” and then you can run a new query on the tabular data.

> **Tip:**
>
> You can also use the [pipe operator](../sql-reference/operators-flow.md) (`->>`) instead of the RESULT_SCAN function to process
> the results of a previous command. With the pipe operator, you don’t have to display the results of the initial SELECT, SHOW, or
> other command.

### Examples

Process the result of a [SHOW TABLES](../sql-reference/sql/show-tables.md) command and extract the following columns and rows from the result:

> * `schema_name`, `table_name`, and `rows` columns.
> * Rows for tables that are empty.
>
> ```sqlexample
> SHOW TABLES;
>
> +-----+-------------------------------+-------------+-------+-------+------+
> | Row |           created_on          | name        | ...   | ...   | rows |
> +-----+-------------------------------+-------------+-------+-------+------+
> |  1  | 2018-07-02 09:43:49.971 -0700 | employees   | ...   | ...   | 2405 |
> +-----+-------------------------------+-------------+-------+-------+------+
> |  2  | 2018-07-02 09:43:52.483 -0700 | dependents  | ...   | ...   | 5280 |
> +-----+-------------------------------+-------------+-------+-------+------+
> |  3  | 2018-07-03 11:43:52.483 -0700 | injuries    | ...   | ...   |    0 |
> +-----+-------------------------------+-------------+-------+-------+------+
> |  4  | 2018-07-03 11:43:52.483 -0700 | claims      | ...   | ...   |    0 |
> +-----+-------------------------------+-------------+-------+-------+------+
> | ...                                                                      |
> | ...                                                                      |
> +-----+-------------------------------+-------------+-------+-------+------+
>
> -- Show the tables that are empty.
> SELECT  "schema_name", "name" as "table_name", "rows"
>     FROM table(RESULT_SCAN(LAST_QUERY_ID()))
>     WHERE "rows" = 0;
>
> +-----+-------------+-------------+------+
> | Row | schema_name | name        | rows |
> +-----+-------------+-------------+------+
> |  1  |  PUBLIC     | injuries    |    0 |
> +-----+-------------+-------------+------+
> |  2  |  PUBLIC     | claims      |    0 |
> +-----+-------------+-------------+------+
> | ...                                    |
> | ...                                    |
> +-----+-------------+-------------+------+
> ```

Additional examples are provided in [RESULT_SCAN](../sql-reference/functions/result_scan.md).

---
title: Using privacy policies for differential privacy
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md
section: User Guide
---

# Using privacy policies for differential privacy

This topic describes how a data provider uses privacy policies to implement
[differential privacy](differential-privacy-overview.md).

## About privacy policies

With differential privacy, Snowflake must check each query to determine whether it will exceed the
[privacy budget](differential-privacy-overview.md) associated with the user executing the query. Privacy policies make that possible.
A data provider creates a privacy policy that associates users with privacy budgets, and then assigns that policy to tables and views to
make them privacy-protected.

When an analyst executes a query against a table with a privacy policy, Snowflake evaluates the body of the policy and does one of the
following:

* If the policy associates the user with a privacy budget, Snowflake makes sure that the
  [privacy loss](differential-privacy-overview.md) incurred by the query does not exceed that privacy budget. If the query is executed
  successfully, Snowflake adds the privacy loss incurred by the query to the cumulative privacy loss for the user so that subsequent queries
  don’t exceed the privacy budget.
* If the policy indicates that the user can query the table without restriction, then the results don’t contain
  [noise](differential-privacy-overview.md), and Snowflake does not track the privacy loss incurred by the query.

### Privacy policy best practices

You can create a single privacy policy to protect a single entity, and then assign the privacy policy to all tables and views that contain
information for that entity. This groups all privacy budgets for that entity under one privacy policy. You don’t need to create separate
privacy policies for every table and view.

## Working with privacy policies

Implementing differential privacy for a schema is a three-step process:

1. Create a privacy policy that associates privacy budgets with users based on conditions like
   name, role, or account.
2. Assign that privacy policy to a table or view to ensure that a query or set of queries against
   the data don’t exceed the privacy budget associated with the user who is executing the query.
3. Grant SELECT privileges on the privacy-protected data. Don’t grant privileges before assigning a privacy policy to the table or view
   because the analyst would have full access to the data.

As you manage your differential privacy environment, you can also:

* Modify an existing privacy policy.
* Replace a privacy policy that is currently assigned to a table or view with another policy.
* Detach a privacy policy from a table or view.

### Create a privacy policy

The most basic syntax for creating a new privacy policy is:

```sqlsyntax
CREATE PRIVACY POLICY  <name>
  AS ( ) RETURNS PRIVACY_BUDGET -> <body>
```

Where:

* `name` is the name of the privacy policy.
* `AS ( ) RETURNS PRIVACY_BUDGET` is the signature and return type of the policy. The signature doesn’t accept any arguments
  and the return type is PRIVACY_BUDGET, which is an internal data type. All privacy policies have the same signature and return
  type.
* `body` is a SQL expression that determines whether the privacy policy returns a privacy budget, and if it does, which one.

  The SQL expression of the body calls two functions to control the return value of the policy:

  `NO_PRIVACY_POLICY`
  :   Use the body’s expression to call the NO_PRIVACY_POLICY function when you want a query to have unrestricted access to the table or view
      to which the privacy policy is assigned.

  `PRIVACY_BUDGET`
  :   Use the body’s expression to call the PRIVACY_BUDGET function when you want to return a privacy budget from the policy.

For the complete syntax for the NO_PRIVACY_POLICY and PRIVACY_BUDGET functions, see [CREATE PRIVACY POLICY](../../sql-reference/sql/create-privacy-policy.md).

#### Example privacy policies

Single privacy budget without conditions
:   Create a privacy policy `my_priv_policy` that always returns a privacy budget named `analysts`:

    > ```sqlexample
    > CREATE PRIVACY POLICY my_priv_policy
    >   AS ( ) RETURNS PRIVACY_BUDGET ->
    >   PRIVACY_BUDGET(BUDGET_NAME=> 'analysts');
    > ```

Conditional privacy policy
:   Create a privacy policy `my_priv_policy` that gives `admin` unrestricted access to the privacy-protected table or view while
    associating all other users with the privacy budget `analysts`:

    > ```sqlexample
    > CREATE PRIVACY POLICY my_priv_policy
    >   AS () RETURNS PRIVACY_BUDGET ->
    >     CASE
    >       WHEN CURRENT_USER() = 'ADMIN'
    >         THEN NO_PRIVACY_POLICY()
    >       ELSE PRIVACY_BUDGET(BUDGET_NAME => 'analysts')
    >     END;
    > ```

Conditional privacy policy for cross-account sharing
:   Create a privacy policy `my_priv_policy` that does the following:

    * Gives `admin` unrestricted access to the privacy-protected table or view.
    * Associates the privacy budget `analysts` to users in the same account.
    * Names the privacy budget associated with external account users so it can be easily identified. Privacy budgets are automatically
      namespaced to a specific external account, but using a descriptive naming scheme can help manage the privacy budgets.

    ```sqlexample
    CREATE PRIVACY POLICY my_priv_policy
      AS () RETURNS PRIVACY_BUDGET ->
        CASE
          WHEN CURRENT_USER() = 'ADMIN'
            THEN NO_PRIVACY_POLICY()
          WHEN CURRENT_ACCOUNT() = 'YE74187'
            THEN PRIVACY_BUDGET(BUDGET_NAME => 'analysts')
          ELSE PRIVACY_BUDGET(BUDGET_NAME => 'external.' || CURRENT_ACCOUNT())
        END;
    ```

#### Using context functions in the policy body

You can include [context functions](../../sql-reference/functions-context.md) in the body of a privacy policy so its behavior depends on the
context in which the differentially private query is executed.

You can use the following context functions in the body of a privacy policy:

| Context function | Description |
| --- | --- |
| [CURRENT_ACCOUNT](../../sql-reference/functions/current_account.md) | Returns the account locator in use for the user’s current session. |
| [CURRENT_DATABASE](../../sql-reference/functions/current_database.md) | Returns the database that contains the table that is protected by the privacy policy. |
| [CURRENT_ORGANIZATION_NAME](../../sql-reference/functions/current_organization_name.md) | Returns the name of the organization in use for user’s the current session. |
| [CURRENT_ROLE](../../sql-reference/functions/current_role.md) | Returns the name of the role in use for the current session. |
| [CURRENT_SCHEMA](../../sql-reference/functions/current_schema.md) | Returns the schema that contains the table that is protected by the privacy policy. |
| [CURRENT_USER](../../sql-reference/functions/current_user.md) | Returns the name of the user executing the query. |
| [INVOKER_ROLE](../../sql-reference/functions/invoker_role.md) | Returns the name of the executing role. |
| [INVOKER_SHARE](../../sql-reference/functions/invoker_share.md) | Returns the name of the share that directly accessed the table or view where the INVOKER_SHARE function is invoked. |

> **Tip:**
>
> Context functions like [CURRENT_USER](../../sql-reference/functions/current_user.md) return strings,
> so comparisons using them are case-sensitive. You can use [LOWER](../../sql-reference/functions/lower.md) to convert strings to all lowercase
> if you’d like to do a case-insensitive comparison.

### Modify a privacy policy

Use the [ALTER PRIVACY POLICY](../../sql-reference/sql/alter-privacy-policy.md) command to modify a privacy policy. You can rename the policy, change its body, or
modify a comment.

For example, to replace the existing body of a privacy policy `my_priv_policy` with a new body that always returns a budget
`external_analysts`, execute:

```sqlexample
ALTER PRIVACY POLICY my_priv_policy SET BODY ->
  PRIVACY_BUDGET(BUDGET_NAME => 'external_analysts');
```

### Assign a privacy policy

A privacy policy can be applied to one or more tables or views to protect them with differential privacy. A table or view can have only one privacy policy assigned to it.

Use the ADD PRIVACY POLICY clause of an [ALTER TABLE](../../sql-reference/sql/alter-table.md) or [ALTER VIEW](../../sql-reference/sql/alter-view.md) command to assign
a privacy policy to the table or view. The syntax is:

> ```sqlsyntax
> ALTER { TABLE | [ MATERIALIZED ] VIEW } <name>
>   ADD PRIVACY POLICY <policy_name>
>   { NO ENTITY KEY | ENTITY KEY ( <column_name> ) }
> ```

Where:

* `name` specifies the name of the table or view.
* `policy_name` specifies the name of the privacy policy.
* `column_name` specifies the entity key for the table or view. The [entity key](differential-privacy-admin.md) is a
  column that uniquely identifies an entity within the table or view.

In most cases, you’ll want to define an entity key in order to implement entity-level privacy, though you can use the NO ENTITY KEY clause
to protect individual rows without considering whether data belonging to an entity could exist in multiple rows. For more information, see [About entity-level privacy](differential-privacy-admin.md).

For example, to assign the policy `my_priv_policy` to the table `t1` where the entity key is the `email` column, execute:

> ```sqlexample
> ALTER TABLE t1 ADD PRIVACY POLICY my_priv_policy ENTITY KEY (email);
> ```

### Replace a privacy policy or entity key

The recommended method of replacing a privacy policy or entity key is to use both the ADD and DROP clauses in the same ALTER TABLE or ALTER
VIEW command. This allows you to atomically make the change because both operations take place in the same transaction, leaving no gap in
protection.

To keep the same policy but change the entity key, you need to drop the policy, then add it again with the new entity key.

For example, to assign a new privacy policy to a table that is already protected by a privacy policy:

```sqlexample
ALTER TABLE finance.accounting.customers
  DROP PRIVACY POLICY priv_policy_1,
  ADD PRIVACY POLICY priv_policy_2 ENTITY KEY (email);
```

You can also detach the privacy policy from a table or view in one statement and then set a new policy
on the table or view in a different statement. If you choose this method, the table is not protected by a privacy policy in between
detaching one policy and assigning another. A query could potentially access sensitive data during this time if the users still have SELECT
privileges on the data.

### Detach a privacy policy

Use the DROP PRIVACY POLICY clause of an ALTER TABLE or ALTER VIEW command to detach a privacy policy from a table or
view. After executing this command, the table or view is no longer privacy-protected. The syntax is:

> ```sqlsyntax
> ALTER { TABLE | [ MATERIALIZED ] VIEW } <name> DROP PRIVACY POLICY <policy_name>
> ```

Where:

* `name` specifies the name of the table or view.
* `policy_name` specifies the name of the privacy policy.

For example, to detach the privacy policy `my_priv_policy` from the `finance.accounting.customers` table:

> ```sqlexample
> ALTER TABLE finance.accounting.customers
>   DROP PRIVACY POLICY my_priv_policy;
> ```

## Monitor privacy policies

To help monitor the use of privacy policies, you can list all of the privacy policies in your account, determine which tables and views are
protected by a particular privacy policy, or list all policies currently assigned to a table or view.

### List all privacy policies

You can use the [PRIVACY_POLICIES](../../sql-reference/account-usage/privacy_policies.md) view in the Account Usage schema of the shared
SNOWFLAKE database. This view is a *catalog* for all privacy policies in your Snowflake account. For example:

> ```sqlexample
> SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.PRIVACY_POLICIES
>   ORDER BY POLICY_NAME;
> ```

### Identify privacy policy references

The [POLICY_REFERENCES](../../sql-reference/functions/policy_references.md) Information Schema table function can identify which tables and views are protected by
privacy policies. There are two different syntax options:

1. Return a row for each object (that is, table or view) that has the specified privacy policy set on it:

   ```sqlexample
   USE DATABASE my_db;
   USE SCHEMA information_schema;
   SELECT policy_name,
          policy_kind,
          ref_entity_name,
          ref_entity_domain,
          ref_column_name,
          ref_arg_column_names,
          policy_status
   FROM TABLE(information_schema.policy_references(policy_name => 'my_db.my_schema.privpolicy'));
   ```
2. Return a row for each policy assigned to the table named `my_table`. Use the POLICY_KIND column to identify which policies are privacy
   policies.

   ```sqlexample
   USE DATABASE my_db;
   USE SCHEMA information_schema;
   SELECT policy_name,
          policy_kind,
          ref_entity_name,
          ref_entity_domain,
          ref_column_name,
          ref_arg_column_names,
          policy_status
   FROM TABLE(information_schema.policy_references(ref_entity_name => 'my_db.my_schema.my_table', ref_entity_domain => 'table'));
   ```

## Privileges and commands

The following subsections provide information to help manage privacy policies.

### Privacy policy privileges

Snowflake supports the following privileges on the privacy policy object.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| APPLY | Lets you assign a privacy policy to, or detach a privacy policy from, a table or view. |
| OWNERSHIP | Required to alter most properties of a privacy policy. Ownership of the privacy policy can be transferred, which grants full control over the privacy policy. |

### Privacy policy DDL reference

Snowflake supports the following DDL to create and manage privacy policies.

* [CREATE PRIVACY POLICY](../../sql-reference/sql/create-privacy-policy.md)
* [ALTER PRIVACY POLICY](../../sql-reference/sql/alter-privacy-policy.md)
* [DESCRIBE PRIVACY POLICY](../../sql-reference/sql/desc-privacy-policy.md)
* [DROP PRIVACY POLICY](../../sql-reference/sql/drop-privacy-policy.md)
* [SHOW PRIVACY POLICIES](../../sql-reference/sql/show-privacy-policies.md)

### Summary of DDL commands, operations, and privileges

The following table summarizes the relationship between privacy policy privileges and DDL operations.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Operation | Privilege required |
| --- | --- |
| Create privacy policy | A role with the CREATE PRIVACY POLICY privilege in the same schema. |
| Alter privacy policy | The role with the OWNERSHIP privilege on the privacy policy. |
| Describe privacy policy | **One** of the following:   * A role with the global APPLY PRIVACY POLICY privilege. * A role with the OWNERSHIP privilege on the privacy policy. * A role with the APPLY privilege on the privacy policy. |
| Drop privacy policy. | A role with the OWNERSHIP privilege on the privacy policy. |
| Show privacy policies. | **One** of the following:   * A role with the USAGE privilege on the schema in which the privacy policy exists. * A role with the APPLY PRIVACY POLICY on the account. |
| Assign a privacy policy to, or detach a privacy policy from, a table or view. | **One** of the following:   * A role with the APPLY PRIVACY POLICY privilege on the account. * A role with the APPLY privilege on the privacy policy and the OWNERSHIP privilege on the table or view. |

---
title: Using programmatic access tokens for authentication
source: https://docs.snowflake.com/en/user-guide/programmatic-access-tokens.md
section: User Guide
---

# Using programmatic access tokens for authentication

You can use a programmatic access token to authenticate to the following Snowflake endpoints:

* [Snowflake REST APIs](../developer-guide/snowflake-rest-api/snowflake-rest-api.md).
* The [Snowflake SQL API](../developer-guide/sql-api/index.md).
* The [Snowflake Catalog SDK](tables-iceberg-catalog.md).
* [Snowpark Container Services](../developer-guide/snowpark-container-services/working-with-services.md) endpoints.
* [Snowflake Horizon Catalog endpoint](tables-iceberg-access-using-external-query-engine-snowflake-horizon.md)
* The Spark runtime hosted on Snowflake with Snowpark Connect for Spark. For more information, see [Run Spark workloads from VS Code, Jupyter Notebooks, or a terminal](../developer-guide/snowpark-connect/snowpark-connect-workloads-jupyter.md).

You can also use a programmatic access token as a replacement for a password in the following:

* [Snowflake drivers](../developer-guide/drivers.md).
* Third-party applications that connect to Snowflake (such as Tableau and PowerBI).
* Snowflake APIs and libraries (such as the [Snowpark API](../developer-guide/snowpark/index.md) and the
  [Snowflake Python API](../developer-guide/snowflake-python-api/snowflake-python-overview.md).
* Snowflake command-line clients (such as the [Snowflake CLI](../developer-guide/snowflake-cli/index.md) and
  [SnowSQL](snowsql.md).

You can generate programmatic access tokens for human users (users with TYPE=PERSON) as well as service users (users with
TYPE=SERVICE).

## Prerequisites

You must fulfill the following prerequisites to generate and use programmatic access tokens:

* Network policy requirements
* Authentication policy requirements

### Network policy requirements

By default, the user must be subject to a [network policy](network-policies.md) with one or more
[network rules](network-rules.md) to generate or use programmatic access tokens:

* For service users (where TYPE=SERVICE or TYPE=LEGACY_SERVICE for the user), you can only generate or use a token if the user
  is subject to a network policy.

  This prerequisite limits the use of the token to requests from a specific set of addresses or network identifiers.
* For human users (where TYPE=PERSON for the user), you can generate a token even if the user is not subject to a network policy,
  but the user must be subject to a network policy to authenticate with this token.

  If a human user who is not subject to a network policy needs to use a programmatic access token for authentication, you can
  temporarily bypass the requirement of having a network policy, but we don’t recommend this. See Generating a programmatic access token.

  > **Note:**
  >
  > Users cannot bypass the network policy itself.

The network policy can be activated [for all users in the account](network-policies.md) or
[for a specific user](network-policies.md).

To change this requirement, create or modify an [authentication policy](authentication-policies.md) that specifies
a programmatic access token policy.

To create an authentication policy:

1. Execute the [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) or [ALTER AUTHENTICATION POLICY](../sql-reference/sql/alter-authentication-policy.md)
   command. In the PAT_POLICY clause, set NETWORK_POLICY_EVALUATION to one of the following values:

   `ENFORCED_REQUIRED` (default behavior)
   :   The user must be subject to a network policy to generate and use programmatic access tokens.

       If the user is subject to a network policy, the network policy is enforced during authentication.

   `ENFORCED_NOT_REQUIRED`
   :   The user does not need to be subject to a network policy to generate and use programmatic access tokens.

       If the user is subject to a network policy, the network policy is enforced during authentication.

   `NOT_ENFORCED`
   :   The user does not need to be subject to a network policy to generate and use programmatic access tokens.

       If the user is subject to a network policy, the network policy is not enforced during authentication.

   For example, to create an authentication policy that removes the network policy requirement but enforces any network policy
   that the user is subject to:

   ```sqlexample
   CREATE AUTHENTICATION POLICY my_authentication_policy
     PAT_POLICY=(
       NETWORK_POLICY_EVALUATION = ENFORCED_NOT_REQUIRED
     );
   ```
2. [Apply the authentication policy to an account or user](authentication-policies.md).

The following example alters an existing authentication policy to remove the network policy requirement and prevent the
enforcement of any network policy that the user is subject to:

```sqlexample
ALTER AUTHENTICATION POLICY my_authentication_policy
  SET PAT_POLICY = (
    NETWORK_POLICY_EVALUATION = NOT_ENFORCED
  );
```

### Authentication policy requirements

If there is an [authentication policy](authentication-policies.md) that limits the authentication methods for a
user, the user cannot generate and use programmatic access tokens unless the AUTHENTICATION_METHODS list in that policy includes
`'PROGRAMMATIC_ACCESS_TOKEN'`.

For example, suppose that an authentication policy limits users to using the OAuth and password methods to authenticate:

```sqlexample
CREATE AUTHENTICATION POLICY my_auth_policy
  ...
  AUTHENTICATION_METHODS = ('OAUTH', 'PASSWORD')
  ...
```

Users can’t generate and use programmatic access tokens unless you add `'PROGRAMMATIC_ACCESS_TOKEN'` to the
AUTHENTICATION_METHODS list. You can use the [ALTER AUTHENTICATION POLICY](../sql-reference/sql/alter-authentication-policy.md) command to update this list.

For example:

```sqlexample
ALTER AUTHENTICATION POLICY my_auth_policy
  SET AUTHENTICATION_METHODS = ('OAUTH', 'PASSWORD', 'PROGRAMMATIC_ACCESS_TOKEN');
```

## Configuring the default and maximum expiration time

Administrators (users with the ACCOUNTADMIN role) can configure the following settings that affect the expiration time of
programmatic access tokens:

* Setting the maximum expiration time
* Setting the default expiration time

### Setting the maximum expiration time

By default, you can specify an expiration time up to 365 days for a token. If you want to reduce this to a shorter time, create
or modify an [authentication policy](authentication-policies.md) that specifies a programmatic access token policy
with a maximum expiration time.

Execute the [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) or [ALTER AUTHENTICATION POLICY](../sql-reference/sql/alter-authentication-policy.md)
command. In the PAT_POLICY clause, set MAX_EXPIRY_IN_DAYS to a value ranging from the
default expiration time to `365`.

For example, to create an authentication policy that sets the maximum to 100 days:

```sqlexample
CREATE AUTHENTICATION POLICY my_authentication_policy
  PAT_POLICY=(
    MAX_EXPIRY_IN_DAYS=100
  );
```

Then, [apply the authentication policy to an account or user](authentication-policies.md).

As another example, to alter an existing authentication policy to set the maximum to 90 days:

```sqlexample
ALTER AUTHENTICATION POLICY my_authentication_policy
  SET PAT_POLICY = (
    MAX_EXPIRY_IN_DAYS=90
  );
```

> **Note:**
>
> If there are existing programmatic access tokens with expiration times that exceed the new maximum expiration time, attempts to
> authenticate with those tokens will fail.
>
> For example, suppose that you generate a programmatic access token named `my_token` with the expiration time of 7 days. If you
> later change the maximum expiration time for all tokens to 2 days, authenticating with `my_token` will fail because the
> expiration time of the token exceeds the new maximum expiration time.

### Setting the default expiration time

By default, a programmatic access token expires after 15 days. If you want to change this, create or modify an
[authentication policy](authentication-policies.md) that specifies a programmatic access token policy with a
default expiration.

Execute the [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) or [ALTER AUTHENTICATION POLICY](../sql-reference/sql/alter-authentication-policy.md)
command. In the PAT_POLICY clause, set DEFAULT_EXPIRY_IN_DAYS to a value ranging from `1` to the
maximum expiration time.

For example, to create an authentication policy that sets the default to 5 days:

```sqlexample
CREATE AUTHENTICATION POLICY my_authentication_policy
  PAT_POLICY=(
    DEFAULT_EXPIRY_IN_DAYS=5
  );
```

Then, [apply the authentication policy to an account or user](authentication-policies.md).

As another example, to alter an existing authentication policy to set the default to 30 days:

```sqlexample
ALTER AUTHENTICATION POLICY my_authentication_policy
  SET PAT_POLICY = (
    DEFAULT_EXPIRY_IN_DAYS=30
  );
```

## Removing the role restriction for service users

By default, if you generate a programmatic access token for a service user (a user with TYPE=SERVICE or TYPE=LEGACY_SERVICE),
you must specify the role that will be used during sessions authenticated with that token. That role will be used for privilege
evaluation and object creation.

You can lift this restriction when you use the ALTER USER ADD PROGRAMMATIC ACCESS TOKEN command to generate a programmatic access
token for a service user.

To lift this restriction:

1. Create or modify an [authentication policy](authentication-policies.md) that specifies that you can generate a
   programmatic access token without a role restriction for service users.

> * To create an authentication policy, run the [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) command, setting
>   REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS to FALSE in the PAT_POLICY clause. For example:
>
>   ```sqlexample
>   CREATE AUTHENTICATION POLICY my_authentication_policy
>     PAT_POLICY = (
>       REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = FALSE
>     );
>   ```
> * To alter an existing authentication policy, run the [ALTER AUTHENTICATION POLICY](../sql-reference/sql/alter-authentication-policy.md) command, setting
>   REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS to FALSE in the PAT_POLICY clause. For example:
>
>   ```sqlexample
>   ALTER AUTHENTICATION POLICY my_authentication_policy
>     SET PAT_POLICY = (
>       REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = FALSE
>     );
>   ```

1. [Apply the authentication policy to an account or user](authentication-policies.md):

   * To lift the restriction for all service users in the account, apply the authentication policy to the account.
   * To lift the restriction for specific service users, apply the authentication policy to those users.

> **Note:**
>
> * Currently, the authentication policy does not lift the restriction if you are using Snowsight to generate the
>   programmatic access token, but support will be added in the future.
> * Changing REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS from FALSE back to TRUE invalidates any programmatic access tokens for
>   service users that were generated without the role restriction.

## Privileges required for programmatic access tokens

To create and manage a programmatic access token, you need to use a role that has been granted the following privileges:

* For human users (with TYPE=PERSON), you do not need any special privileges to generate, modify, drop, or display a programmatic
  access token for yourself.
* If you’re generating, modifying, dropping, or displaying a programmatic access token for a different user or a service user
  (with TYPE=SERVICE), you must use a role that has the OWNERSHIP or MODIFY PROGRAMMATIC AUTHENTICATION METHODS privilege on that
  user.

  For example, suppose that you want to grant users with the `my_service_owner_role` custom role the ability to generate and
  manage programmatic access tokens for the service user `my_service_user`. You can grant the MODIFY PROGRAMMATIC AUTHENTICATION
  METHODS privilege on the `my_service_user` user to the role `my_service_owner_role`:

  ```sqlexample
  GRANT MODIFY PROGRAMMATIC AUTHENTICATION METHODS ON USER my_service_user
    TO ROLE my_service_owner_role;
  ```

## Generating a programmatic access token

You can generate a programmatic access token in Snowsight or by executing SQL commands.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Select the user that you want to generate the programmatic access token for.
4. Under Programmatic access tokens, select Generate new token.
5. In the New programmatic access token dialog, enter the following information:

   1. In the Name field, enter a name for the token.

      In the name, you can only use letters, numbers, and underscores. The name must start with a letter or underscore.
      Letters in the name are stored and resolved as uppercase characters.
   2. In the Comment field, enter a descriptive comment about the token.

      After you create the token, this comment is displayed under the token in the Programmatic access tokens section.
   3. From Expires in, choose the number of days after which the token should expire.
   4. If you want to restrict the scope of the operations that can be performed:

      1. Select One specific role (recommended).
      2. Select the role that should be used for privilege evaluation and object creation.
      > **Note:**
      >
      > If you are generating a token for a service user (a user with TYPE=SERVICE or TYPE=LEGACY_SERVICE), you must select a
      > role.

      When you use this token for authentication, any objects that you create are owned by this role, and this role is used for
      privilege evaluation.

      > **Note:**
      >
      > Secondary roles are not used, even if [DEFAULT_SECONDARY_ROLES](../sql-reference/sql/create-user.md) is set to
      > (‘ALL’) for the user.

      If you select Any of my roles instead, any objects that you create owned by your primary role, and privileges are
      evaluated against your [active roles](security-access-control-overview.md).
   5. Select Generate.
6. Copy or download the generated programmatic access token so that you can use the token for authentication.

   > **Note:**
   >
   > After you close this message box, you will not be able to copy or download this token.

The new token is listed in the Programmatic access tokens section.

As noted earlier, to use a programmatic access token, the user associated with the token
must be subject to a network policy, unless you set up an authentication policy
to change this requirement.

If a human user who is not subject to a network policy needs to use a programmatic access token for authentication, you can
temporarily bypass the requirement of having a network policy by selecting
 » Bypass requirement for network policy.

> **Note:**
>
> Bypass requirement for network policy does not allow users to bypass the network policy itself.

Execute [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](../sql-reference/sql/alter-user-add-programmatic-access-token.md), specifying a name for the token.

* If you’re generating the token for yourself, omit the `username` parameter. For example, to generate a token named
  `example_token`:

  ```sqlexample
  ALTER USER ADD PROGRAMMATIC ACCESS TOKEN example_token;
  ```
* If you’re generating the token on behalf of a user for a person (if the USER object has TYPE=PERSON), specify the name of
  the user. For example, to generate a token named `example_token` for the user `example_user`:

  ```sqlexample
  ALTER USER IF EXISTS example_user ADD PROGRAMMATIC ACCESS TOKEN example_token;
  ```

> **Tip:**
>
> You can use the keyword PAT as a shorter way of specifying the keywords PROGRAMMATIC ACCESS TOKEN.

Note the following:

* If you’re generating the token on behalf of a user for a service (if the USER object has TYPE=SERVICE), or if you want to
  restrict the scope of the operations that can be performed, set ROLE_RESTRICTION to the role that should be used for
  privilege evaluation and object creation.

  This must be a role that has been granted to the user. You can only specify this role when generating the token.

  > **Note:**
  >
  > If you are generating a token for a service user (a user with TYPE=SERVICE or TYPE=LEGACY_SERVICE), you must specify the
  > ROLE_RESTRICTION parameter, unless you have set up an authentication policy to bypass this restriction. For information,
  > see Removing the role restriction for service users.

  When you use this token for authentication, any objects that you create are owned by this role, and this role is used for
  privilege evaluation.

  > **Note:**
  >
  > Secondary roles are not used, even if [DEFAULT_SECONDARY_ROLES](../sql-reference/sql/create-user.md) is set to
  > (‘ALL’) for the user.

  For example, suppose that you want to generate a token named `example_service_user_token` for the service user
  `example_service_user`. When the service user authenticates with this token, the `example_service_user_role` role
  (which has been granted to that service user) should be used to evaluate privileges and own any objects created by the user.

  To generate a token for this case, execute the following statement:

  ```sqlexample
  ALTER USER IF EXISTS example_service_user
    ADD PROGRAMMATIC ACCESS TOKEN example_service_user_token
      ROLE_RESTRICTION = 'example_service_user_role';
  ```

  If you omit ROLE_RESTRICTION, any objects that you create owned by your primary role, and privileges are evaluated against
  your [active roles](security-access-control-overview.md).
* To specify when the token should expire (overriding the default expiration time),
  set the DAYS_TO_EXPIRY parameter to the number of days after which the token should expire.

  You can specify a value from `1` (for 1 day) to the value of the
  maximum expiration time.

  For example, to generate a programmatic access token that expires after 10 days:

  ```sqlexample
  ALTER USER IF EXISTS example_user ADD PROGRAMMATIC ACCESS TOKEN example_token
    DAYS_TO_EXPIRY = 10
    COMMENT = 'An example of a token that expires in 10 days';
  ```
* As noted earlier, to use a programmatic access token, the user associated with the token
  must be subject to a network policy, unless you set up an authentication policy
  to change this requirement.

  For human users (where the TYPE property of the user is PERSON) that are not subject to a network policy, you can
  temporarily bypass the requirement of having a network policy by setting MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT to the
  number of minutes during which you want to bypass this requirement.

  For example, suppose that you are a user who is not subject to a network policy, and you want to use a programmatic access
  token for authentication. You can bypass the requirement of having a network policy for 4 hours by setting
  MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT to 240.

  > **Note:**
  >
  > Setting MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT does not allow users to bypass the network policy itself.

ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN prints the token in the `token_secret` column in the output:

```output
+---------------+-----------------+
| token_name    | token_secret    |
|---------------+-----------------|
| EXAMPLE_TOKEN | ... (token) ... |
+---------------+-----------------+
```

> **Note:**
>
> The output of this command is the only place where the token appears. Copy the token from the output for use when
> authenticating to an endpoint.

After you create a programmatic access token, you cannot change the expiration date. You must revoke the token and generate a new
token with the new expiration time.

For programmatic access tokens that are restricted to a role, if the role is revoked from the user or the role is dropped, the
user can no longer use the programmatic access token for authentication.

## Using a programmatic access token

The following sections explain how to use a programmatic access token as a password and for authentication to a Snowflake endpoint:

* Using a programmatic access token as a password
* Using a programmatic access token to authenticate to an endpoint

### Using a programmatic access token as a password

To authenticate with a programmatic access token as the password, you can specify the token for the value of the password in the driver settings or in the call to connect to Snowflake.

For example, if you’re using the Snowflake Connector for Python, you can specify the programmatic access token as the `password` argument when calling the `snowflake.connector.connect` method.

```python
conn = snowflake.connector.connect(
    user=USER,
    password=<programmatic_access_token>,
    account=ACCOUNT,
    warehouse=WAREHOUSE,
    database=DATABASE,
    schema=SCHEMA
)
```

In the same way, you can use programmatic access tokens in place of a password in third-party applications (such as Tableau or PowerBI). Paste the programmatic access token in the field for the password.

> **Note:**
>
> By default, using programmatic access tokens
> requires a network policy to be activated for a user or for all users in the account.
> To use programmatic access tokens with a third-party application, you must create a network policy that allows requests from
> the IP address ranges of the third-party application.

### Using a programmatic access token to authenticate to an endpoint

To authenticate with a programmatic access token, set the following HTTP headers in the request:

* `Authorization: Bearer token_secret`
* `X-Snowflake-Authorization-Token-Type: PROGRAMMATIC_ACCESS_TOKEN` (optional)

For example, if you’re using cURL to send a request to a
[Snowflake REST API](../developer-guide/snowflake-rest-api/snowflake-rest-api.md) endpoint:

```bash
curl --location "https://myorganization-myaccount.snowflakecomputing.com/api/v2/databases" \
  --header "Authorization: Bearer <token_secret>"
```

As another example, if you’re using cURL to send a request to the
[Snowflake SQL API](../developer-guide/sql-api/index.md) endpoint:

```bash
curl -si -X POST https://myorganization-myaccount.snowflakecomputing.com/api/v2/statements \
  --header "Content-Type: application/json" \
  --header "Accept: application/json" \
  --header "Authorization: Bearer <token_secret>" \
  --data '{"statement": "select 1"}'
```

If the request fails with a `PAT_INVALID` error, the error might have occurred for one of the following reasons:

* The user associated with the programmatic access token was not found.
* Validation failed.
* The role associated with the programmatic access token was not found.
* The user is not associated with the specified programmatic access token.

## Managing programmatic access tokens

The following sections explain how to use, modify, list, rotate, revoke, and re-enable programmatic access tokens:

* Listing programmatic access tokens
* Renaming a programmatic access token
* Rotating a programmatic access token
* Revoking a programmatic access token
* Re-enabling a disabled programmatic access token

> **Note:**
>
> You cannot modify, rename, rotate, or revoke a programmatic access token in a session where you used a programmatic access
> token for that same user for authentication.

### Listing programmatic access tokens

You can list the programmatic access token for a user in Snowsight or by executing SQL commands.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Select the user who owns the programmatic access token.

   The programmatic access tokens for the user are listed Under Programmatic access tokens.

Execute the [SHOW USER PROGRAMMATIC ACCESS TOKENS](../sql-reference/sql/show-user-programmatic-access-tokens.md) command. For example, to view information about
the programmatic access tokens associated with the user `example_user`:

```sqlexample
SHOW USER PROGRAMMATIC ACCESS TOKENS FOR USER example_user;
```

To list the programmatic access tokens for all users in the account, query the [CREDENTIALS view](../sql-reference/account-usage/credentials.md)
for rows where the `type` column contains `'PAT'`. For example:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CREDENTIALS WHERE type = 'PAT';
```

> **Note:**
>
> After seven days, expired programmatic access tokens are deleted and no longer appear in either Snowsight or the output
> of the SHOW USER PROGRAMMATIC ACCESS TOKENS command.

### Renaming a programmatic access token

> **Note:**
>
> You cannot rename a programmatic access token in a session where you used a programmatic access token for that same user for
> authentication.

You can change the name of a programmatic access token in Snowsight or by executing SQL commands.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Select the user associated with the programmatic access token.
4. Under Programmatic access tokens, locate the programmatic access token and select  »
   Edit.
5. In the Name field, change the name of the token, and select Save.

Execute
[ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN … RENAME TO](../sql-reference/sql/alter-user-modify-programmatic-access-token.md).
For example:

> ```sqlexample
> ALTER USER IF EXISTS example_user MODIFY PROGRAMMATIC ACCESS TOKEN old_token_name
>   RENAME TO new_token_name;
> ```

### Rotating a programmatic access token

> **Note:**
>
> You cannot rotate a programmatic access token in a session where you used a programmatic access token for that same user for
> authentication.

You can rotate a programmatic access token in Snowsight or by executing SQL commands.

Rotating a token returns a new token secret that has the same name and an extended expiration time. Rotating a token also expires
the existing token secret. Use the new token for authenticating to Snowflake.

> **Note:**
>
> When you rotate a programmatic access token:
>
> * Snowflake does not verify that the network policy and
>   authentication policy requirements are met.
> * If the programmatic access token is restricted to a role, Snowflake does not verify that the user associated with the token
>   has been granted that role.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Select the user associated with the programmatic access token.
4. Under Programmatic access tokens, locate the programmatic access token and select  »
   Rotate.
5. If you want the previous token secret to expire immediately, select Expire current secret immediately.
6. Select Rotate token.
7. Copy or download the generated programmatic access token so that you can use the token for authentication.

   > **Note:**
   >
   > After you close this message box, you will not be able to copy or download this token.

Execute the [ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)](../sql-reference/sql/alter-user-rotate-programmatic-access-token.md) command.

For example, to rotate the programmatic access token `example_token` associated with the user `example_user`:

```sqlexample
ALTER USER IF EXISTS example_user ROTATE PROGRAMMATIC ACCESS TOKEN example_token;
```

If you want to specify when the old token expires, set EXPIRE_ROTATED_TOKEN_AFTER_HOURS to the number of hours before the
old token should expire. For example, to expire the old token immediately:

```sqlexample
ALTER USER IF EXISTS example_user
  ROTATE PROGRAMMATIC ACCESS TOKEN example_token
  EXPIRE_ROTATED_TOKEN_AFTER_HOURS = 0;
```

The command prints the token in the `token_secret` column in the output:

```output
+---------------+-----------------+-------------------------------------+
| token_name    | token_secret    | rotated_token_name                  |
|---------------+-----------------+-------------------------------------|
| EXAMPLE_TOKEN | ... (token) ... | EXAMPLE_TOKEN_ROTATED_1744239049066 |
+---------------+-----------------+-------------------------------------+
```

> **Note:**
>
> The output of this command is the only place where the new token appears. Copy the token from the output for use
> when authenticating to an endpoint.

The output also includes the name of the older token that has been rotated:

* If you want to know when this token expires, you can use the [SHOW USER PROGRAMMATIC ACCESS TOKENS](../sql-reference/sql/show-user-programmatic-access-tokens.md)
  command and look for the token name. For example:

  ```sqlexample
  SHOW USER PROGRAMMATIC ACCESS TOKENS FOR USER example_user;
  ```

  ```output
  +--------------------------------------+--------------+------------------+-------------------------------+---------+---------+-------------------------------+--------------+-------------------------------------------+----------------+
  | name                                 | user_name    | role_restriction | expires_at                    | status  | comment | created_on                    | created_by   | mins_to_bypass_network_policy_requirement | rotated_to     |
  |--------------------------------------+--------------+------------------+-------------------------------+---------+---------+-------------------------------+--------------+-------------------------------------------+----------------|
  | EXAMPLE_TOKEN                        | EXAMPLE_USER | MY_CUSTOM_ROLE   | 2025-05-09 07:18:47.360 -0700 | ACTIVE  |         | 2025-04-09 07:18:47.360 -0700 | EXAMPLE_USER | NULL                                      | NULL           |
  | EXAMPLE_TOKEN_ROTATED_1744239049066  | EXAMPLE_USER | MY_CUSTOM_ROLE   | 2025-04-10 15:21:49.652 -0700 | ACTIVE  |         | 2025-04-09 15:21:49.652 -0700 | EXAMPLE_USER | NULL                                      | EXAMPLE_TOKEN  |
  +--------------------------------------+--------------+------------------+-------------------------------+---------+---------+-------------------------------+--------------+-------------------------------------------+----------------+
  ```
* If you want to revoke this token, you can use the [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](../sql-reference/sql/alter-user-remove-programmatic-access-token.md)
  command and specify the name of the older token. For example:

  ```sqlexample
  ALTER USER IF EXISTS example_user
    REMOVE PROGRAMMATIC ACCESS TOKEN EXAMPLE_TOKEN_ROTATED_1744239049066;
  ```

  ```output
  +-------------------------------------------------------------------------------------+
  | status                                                                              |
  |-------------------------------------------------------------------------------------|
  | Programmatic access token EXAMPLE_TOKEN_ROTATED_1744239049066 successfully removed. |
  +-------------------------------------------------------------------------------------+
  ```

### Revoking a programmatic access token

> **Note:**
>
> You cannot revoke a programmatic access token in a session where you used a programmatic access token for that same user for
> authentication.

You can revoke a programmatic access token in Snowsight or by executing SQL commands.

SnowsightSQL

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Select the user associated with the programmatic access token.
4. Under Programmatic access tokens, locate the programmatic access token and select  »
   Delete.

Execute the [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](../sql-reference/sql/alter-user-remove-programmatic-access-token.md) command.

For example, to revoke the programmatic access token `example_token` associated with the user `example_user`:

```sqlexample
ALTER USER IF EXISTS example_user REMOVE PROGRAMMATIC ACCESS TOKEN example_token;
```

### Re-enabling a disabled programmatic access token

> **Note:**
>
> You cannot modify a programmatic access token in a session where you used a programmatic access token for that same user
> authentication.

When you [disable login access for a user](admin-user-management.md) or Snowflake locks out a user from
logging in, the programmatic access tokens for that user are automatically disabled.

> **Note:**
>
> Programmatic access tokens are not disabled when a user is [temporarily locked out](admin-user-management.md)
> (for example, due to five or more failed attempts to authenticate).

If you run the [SHOW USER PROGRAMMATIC ACCESS TOKENS](../sql-reference/sql/show-user-programmatic-access-tokens.md) command, the value in the `status` column is
`DISABLED` for tokens associated with that user.

```sqlexample
SHOW USER PROGRAMMATIC ACCESS TOKENS FOR USER example_user;
```

```output
+---------------+--------------+------------------+-------------------------------+----------+---------+-------------------------------+--------------+-------------------------------------------+------------+
| name          | user_name    | role_restriction | expires_at                    | status   | comment | created_on                    | created_by   | mins_to_bypass_network_policy_requirement | rotated_to |
|---------------+--------------+------------------+-------------------------------+----------+---------+-------------------------------+--------------+-------------------------------------------+------------|
| EXAMPLE_TOKEN | EXAMPLE_USER | MY_ROLE          | 2025-04-28 12:13:46.431 -0700 | DISABLED | NULL    | 2025-04-13 12:13:46.431 -0700 | EXAMPLE_USER | NULL                                      | NULL       |
+---------------+--------------+------------------+-------------------------------+----------+---------+-------------------------------+--------------+-------------------------------------------+------------+
```

If you later enable login access for that user or Snowflake unlocks login access for that user, the programmatic access tokens
for that user remain disabled. To enable the tokens again, execute the
[ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)](../sql-reference/sql/alter-user-modify-programmatic-access-token.md) command, and set DISABLED to FALSE. For example:

```sqlexample
ALTER USER example_user MODIFY PROGRAMMATIC ACCESS TOKEN example_token SET DISABLED = FALSE;
```

## Getting information about a programmatic access token from the secret

If you need information about a programmatic access token, given the secret for that token, call the
[SYSTEM$DECODE_PAT](../sql-reference/functions/system_decode_pat.md) function. You can use this function if the secret has been compromised and you
want to know the user associated with the token, the name of the token, and the state of the token.

For example:

```sqlexample
SELECT SYSTEM$DECODE_PAT('abC...Y5Z');
```

```output
+------------------------------------------------------------------------+
| SYSTEM$DECODE_PAT('☺☺☺...☺☺☺')                                         |
|------------------------------------------------------------------------|
| {"STATE":"ACTIVE","PAT_NAME":"MY_EXAMPLE_TOKEN","USER_NAME":"MY_USER"} |
+------------------------------------------------------------------------+
```

## Handling a leaked programmatic access token

Snowflake is part of the
[GitHub secret scanning partner program](https://docs.github.com/en/code-security/secret-scanning/secret-scanning-partnership-program/secret-scanning-partner-program).
If the secret for a programmatic access token has been checked in to a public GitHub repository, Snowflake is notified and
disables the programmatic access token automatically. Snowflake sends an email notification about the leaked token to your account
administrator and to the user who is associated with the token.

The notification includes:

* The name of the Snowflake account
* The name of the Snowflake user
* The name, ID, and status of the programmatic access token
* The URL of the GitHub repository

> **Note:**
>
> The account administrator and user will receive the email notification only if they have
> [verified their email addresses](ui-snowsight-profile.md).

If you own a GitHub repository, you can allow Snowflake to disable leaked tokens by
[enabling secret scanning](https://docs.github.com/en/code-security/secret-scanning/enabling-secret-scanning-features/enabling-secret-scanning-for-your-repository).
You can also enable
[push protection](https://docs.github.com/en/code-security/secret-scanning/enabling-secret-scanning-features/enabling-push-protection-for-your-repository)
to prevent Snowflake programmatic access tokens from being committed to your GitHub repository.

If a programmatic access token is leaked, you should examine the queries executed during the sessions that used the programmatic
access token for authentication. To identify these queries, you can use the following SQL statement:

```sqlexample
WITH session_ids_with_leaked_pats AS (
  SELECT DISTINCT s.session_id
    FROM SNOWFLAKE.ACCOUNT_USAGE.SESSIONS s JOIN SNOWFLAKE.ACCOUNT_USAGE.LOGIN_HISTORY lh
      ON s.login_event_id= lh.event_id
    WHERE
      lh.first_authentication_factor_id = '<pat_id>'
)
SELECT qh.*
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY qh JOIN session_ids_with_leaked_pats slp
    ON qh.session_id = slp.session_id;
```

In addition, if the programmatic access token has been replicated to another account, you must disable the token in that account.
To determine which accounts might contain the replicated token, run the [SHOW REPLICATION GROUPS](../sql-reference/sql/show-replication-groups.md) command.

## Identifying the login sessions in which a programmatic access token was used

To determine when a programmatic access token was used for authentication, you can join the
[LOGIN_HISTORY](../sql-reference/account-usage/login_history.md) and
[CREDENTIALS](../sql-reference/account-usage/credentials.md) views in the ACCOUNT_USAGE schema on the column containing the
credential ID:

* The LOGIN_HISTORY view contains the credential ID in the `first_authentication_factor_id` column, if the
  `first_authentication_factor` column contains `PROGRAMMATIC_ACCESS_TOKEN`.
* The CREDENTIALS view contains the credential ID in the `credential_id` column.

For example:

```sqlexample
SELECT
    login.event_timestamp,
    login.user_name,
    cred.name
  FROM SNOWFLAKE.ACCOUNT_USAGE.LOGIN_HISTORY login
    JOIN SNOWFLAKE.ACCOUNT_USAGE.CREDENTIALS cred
    ON login.first_authentication_factor_id = cred.credential_id
  WHERE login.first_authentication_factor = 'PROGRAMMATIC_ACCESS_TOKEN';
```

```output
+-------------------------------+-----------+-----------+
| EVENT_TIMESTAMP               | USER_NAME | NAME      |
|-------------------------------+-----------+-----------|
| 2025-08-01 09:01:06.098 -0700 | USER_A    | PAT_FOR_A |
| 2025-07-08 13:33:07.687 -0700 | USER_B    | MY_PAT    |
| 2025-07-08 14:15:26.234 -0700 | USER_C    | MY_TOKEN  |
+-------------------------------+-----------+-----------+
```

To get information about the queries that were run during this login session, you can join the LOGIN_HISTORY view with the
[SESSIONS](../sql-reference/account-usage/sessions.md) view on the `login_event_id` column to get the session ID, and then
use that to join the [QUERY_HISTORY](../sql-reference/account-usage/query_history.md) view.

## Best practices

* If you need to store a programmatic access token, do so securely (for example, by using a password or secrets manager).
* Avoid exposing programmatic access tokens in code.
* Restrict the use of the token to a specific role when generating the token.
* Regularly review and rotate programmatic access tokens. Users can set the expiration time when
  generating the token, and administrators can
  reduce the maximum expiration time for all tokens to encourage the rotation of
  tokens.

## Limitations

* You can only view the secret for a programmatic access token when you create it. After you create a programmatic access token,
  you can only view information about the token and not the secret for the token.
* You cannot change some of the properties of a programmatic access token after generating the token:

  + After you generate the token, you cannot change or remove the role that the token is restricted to.
  + After you generate the token, you cannot change the expiration time of the token. You can
    revoke a programmatic access token and generate a new token with a different expiration time.

* Each user can have a maximum of 15 programmatic access tokens.

  + This number includes tokens that have been disabled.
  + This number does not include tokens that have expired.

* Although there is a command that administrators can run to list all programmatic access tokens for a given user
  ([SHOW USER PROGRAMMATIC ACCESS TOKENS](../sql-reference/sql/show-user-programmatic-access-tokens.md)), there is no command for listing all programmatic access
  tokens in the account.

  Administrators can, however, query the [CREDENTIALS view](../sql-reference/account-usage/credentials.md) view to list the programmatic access
  tokens in account.
* You cannot recover a programmatic access token after you revoke it.
* You cannot modify, rename, rotate, or revoke a programmatic access token in a session where you used a programmatic access
  token for that same user for authentication.

---
title: Using query insights to improve performance
source: https://docs.snowflake.com/en/user-guide/query-insights.md
section: User Guide
---

# Using query insights to improve performance

If conditions exist that affect query performance, Snowflake provides insights about these conditions. Each insight includes a
message that explains how query performance might be affected and provides a general recommendation for improving performance.

You can access these insights in Snowsight and by querying
[the QUERY_INSIGHTS view](../sql-reference/account-usage/query_insights.md).

The next sections provide details about query insights:

* List of insight types
* Viewing the query insights in Snowsight
* Limitations

## List of insight types

The Query Insights pane and [The QUERY_INSIGHTS view](../sql-reference/account-usage/query_insights.md) provide the
insights, which include:

* A message about the condition detected and how it can affect query performance.
* Details about the part of the query that produced the condition.
* A suggested next step to address the condition, if the condition negatively affects performance.

The following table lists the types of insights by type ID.

|  |  |
| --- | --- |
| Type ID | Insight |
| `QUERY_INSIGHT_NO_FILTER_ON_TOP_OF_TABLE_SCAN` | No filter on table scan |
| `QUERY_INSIGHT_INAPPLICABLE_FILTER_ON_TABLE_SCAN` | Filter not applicable |
| `QUERY_INSIGHT_UNSELECTIVE_FILTER` | Filter not selective |
| `QUERY_INSIGHT_LIKE_WITH_LEADING_WILDCARD` | LIKE filter with leading wildcard |
| `QUERY_INSIGHT_FILTER_WITH_CLUSTERING_KEY` | Filter uses clustering key |
| `QUERY_INSIGHT_SEARCH_OPTIMIZATION_USED` | Query benefited from search optimization |
| `QUERY_INSIGHT_SNOWFLAKE_OPTIMA` | Query benefited from Snowflake Optima |
| `QUERY_INSIGHT_SEARCH_OPTIMIZATION_AND_SNOWFLAKE_OPTIMA` | Query benefited from search optimization and Snowflake Optima |
| `QUERY_INSIGHT_JOIN_WITH_NO_JOIN_CONDITION` | Join with no join condition |
| `QUERY_INSIGHT_INEFFICIENT_JOIN_CONDITION` | Join with inefficient join condition |
| `QUERY_INSIGHT_NESTED_EXPLODING_JOIN` | Exploding join (nested join) |
| `QUERY_INSIGHT_EXPLODING_JOIN` | Exploding join (not nested) |
| `QUERY_INSIGHT_INEFFICIENT_AGGREGATE` | Unnecessary aggregation |
| `QUERY_INSIGHT_UNNECESSARY_UNION_DISTINCT` | Unnecessary UNION [ DISTINCT ] clause |
| `QUERY_INSIGHT_REMOTE_SPILLAGE` | Remote spillage |
| `QUERY_INSIGHT_QUEUED_OVERLOAD` | Query was in the queue for the warehouse for too long |

### No filter on table scan

A query or subquery has no WHERE clause, which means that the query scans an entire table and might return more rows than
intended.

To improve performance, add a WHERE clause to reduce the amount of data scanned.

### Filter not applicable

A WHERE clause doesn’t filter out any rows, which means that the query might scan more data than intended.

To improve performance, add a more selective condition to the WHERE clause, or make the existing condition more selective.

### Filter not selective

A WHERE clause doesn’t significantly reduce the number of rows, which means that the query might scan more data than intended.

Unlike the Filter not applicable insight, this insight indicates that the WHERE
clause is filtering out some rows but it could have been more selective.

To improve performance, add a more selective condition to the WHERE clause, or make the existing condition more selective.

### LIKE filter with leading wildcard

The query uses a LIKE filter that starts with a wildcard character. Specifying a pattern that starts with a wildcard can result in
scanning a large amount of data.

To reduce the amount of data scanned, specify a pattern that does not start with a wildcard, if possible. If you need to specify a
pattern that starts with a wildcard, consider enabling [search optimization](search-optimization-service.md) for
more efficient pattern matching.

### Filter uses clustering key

The query benefited from filtering on a [clustering key for the table](tables-clustering-keys.md).

### Query benefited from search optimization

The query benefited from filtering on a column that is configured for
[search optimization](search-optimization-service.md).

### Query benefited from Snowflake Optima

The query benefited from [Snowflake Optima](snowflake-optima.md).

### Query benefited from search optimization and Snowflake Optima

The query benefited from [search optimization](search-optimization-service.md) and
[Snowflake Optima](snowflake-optima.md).

### Join with no join condition

The join is missing the join condition. The result is a [cross join](querying-joins.md), which returns every
possible combination of rows.

To reduce the row count produced by this join, specify one or more join conditions.

### Join with inefficient join condition

The join contains a complex join condition that is evaluated after the data sets are joined. This is less efficient than if the
condition were evaluated before the data sets were joined, which reduces the amount of data that the join must process.

To speed up this query, simplify the join condition.

### Exploding join (nested join)

A join that includes the output of at least one other join is
[returning many more rows than are in the tables being joined](ui-snowsight-activity.md). This might indicate a problem with
the join conditions for the child joins.

To prevent the join from producing more rows than the joined tables contain, add or change the join conditions for the child
joins. In addition, adding a WHERE clause to a subquery used in a child join might reduce the number of rows returned.

### Exploding join (not nested)

A join of two data sets (for example, tables, views, or output from table function calls) is
[returning many more rows than the joined tables contain](ui-snowsight-activity.md). This might indicate a problem with
the join condition.

To prevent the join from producing more rows than are in the tables being joined, add or change the join condition. In addition,
adding a WHERE clause to a subquery used by this join might reduce the number of rows returned.

### Unnecessary aggregation

The DISTINCT or GROUP BY clause produces the same number of rows as the same statement without the DISTINCT or GROUP BY clause.
Specifying the clause introduces an additional processing step that has no effect on the result.

To improve performance, remove the unnecessary DISTINCT or GROUP BY clause.

### Unnecessary UNION [ DISTINCT ] clause

The UNION [ DISTINCT ] clause isn’t necessary because the input sets are disjoint.

To improve performance, use UNION ALL, rather than UNION [ DISTINCT ].

### Remote spillage

This query scanned more data than the warehouse had capacity to store. As a result, the warehouse
[spilled data](ui-snowsight-activity.md) to storage, which slowed down the processing of the query.

To prevent this problem, use a larger warehouse that has more capacity. If using a larger warehouse is not an option, change the
query to process data in smaller batches.

### Query was in the queue for the warehouse for too long

This query was [waiting in the queue for the warehouse](warehouses-overview.md) for too long.

To avoid this problem, use a larger warehouse that has more capacity, or use a warehouse that has fewer concurrent queries.

## Viewing the query insights in Snowsight

In [Query Profile](ui-snowsight-activity.md) tab under Query History, you can view the insights for a
query. The nodes that have corresponding insights are highlighted.

The Query Insights pane on the right displays each type of insight that was detected for this query and lists each instance
of that insight type that was detected for the query. To learn more about the condition that was detected, select View
next to an entry in the Query Insights pane.

The details include the recommended next steps to take to improve the performance of the query. You can select
Learn more to view more information about this insight.

## Limitations

* Insights are produced for SQL queries that are made against databases and are processed by warehouses.
* Snowflake does not produce the “filter not selective” insight for queries that
  are accelerated by the [query acceleration service](query-acceleration-service.md).
* Insights are not produced for:

  + Queries for which the query plan takes multiple steps to finish.
  + Queries involving secure objects.
  + Queries executed against hybrid tables (Unistore).
  + Queries generated by Native Apps.
  + EXPLAIN queries.
  + Queries that [reuse results](querying-persisted-results.md).
  + Queries executing on [interactive tables](interactive.md).

---
title: Using Sequences
source: https://docs.snowflake.com/en/user-guide/querying-sequences.md
section: User Guide
---

# Using Sequences

Sequences are used to generate unique numbers across sessions and statements, including concurrent statements. They can be used to generate
values for a primary key or any column that requires a unique value.

> **Important:**
>
> Snowflake does not guarantee generating sequence numbers without gaps. The generated numbers are not necessarily contiguous.

## Sequence Semantics

Snowflake sequences currently utilize the following semantics:

* All values generated by a sequence are globally unique as long as the sign of the sequence interval does not change (e.g. by changing
  the step size). Concurrent queries never observe the same value, and values within a single query are always distinct.
* Changing the sequence interval from positive to negative (e.g. from `1` to `-1`), or vice versa may result in
  duplicates. For example, if the first query(s) return sequence values `1`, `2`, and `3`, and if the interval
  is changed from `1` to `-1`, then the next few values generated include `2`, and `1`, which were
  generated previously.
* Snowflake may calculate the next value for a sequence as soon as the current sequence number is used, rather than waiting
  until the next sequence number is requested.

  A consequence of this is that an `ALTER SEQUENCE ... SET INCREMENT ...` command might not affect the next operation
  that uses the sequence. For an example, see Understanding the Effects of Reversing the Direction of a Sequence.
* Each generated sequence value additionally reserves values depending on the sequence interval, also called the “step”. The
  reserved values span from the sequence value to

  `<value>  +  (sign(<step>) * abs(<step>))  -  (sign(<step>) * 1)`

  (inclusive).

  Thus, if the value `100` is generated:

  + With a step of `2`, values `100` and `101` are reserved.
  + With a step of `10`, values `100` to `109` are reserved.
  + With a step of `-5`, values `96` to `100` are reserved.

  A reserved value is never generated by the sequence as long as the step/interval is never modified.

* Values generated by a sequence are greater than the maximum value produced by a previous statement (or less than the
  minimum value if the step size is negative) if the following are true:

  + The sequence does not have the NOORDER property.

    NOORDER specifies that the values are not guaranteed to be in increasing order.

    For example, if a sequence has `START 1 INCREMENT 2`, the generated values might be `1`, `3`, `101`, `5`, `103`, etc.

    NOORDER can improve performance when multiple INSERT operations are performed concurrently (for example, when multiple
    clients are executing multiple INSERT statements).
  + The previous statement completed, and an acknowledgment was received, prior to submitting the current statement.

  This behavior does not hold if the sign of the interval is changed (positive to negative or negative to positive).

There is no guarantee that values from a sequence are contiguous (gap-free) or that the sequence values are assigned in a particular order.
There is, in fact, no way to assign values from a sequence to rows in a specified order other than to use single-row statements (this still
provides no guarantee about gaps).

A sequence value can represent a 64-bit two’s complement integer (`-2^63` to `2^63 - 1`). If the internal representation of a
sequence’s next value exceeds this range (in either direction) an error results and the query fails. Note that this may result in
losing these sequence values.

In this situation, you must either use a smaller (in magnitude) increment value or create a new sequence with a smaller start value. As gaps
may occur, the internal representation of the next value may exceed the allowable range even if the returned sequence values are all within
the allowable range. Snowflake does not provide an explicit guarantee regarding how to avoid this error, but Snowflake supports sequence
objects that correctly provide unique values. A sequence object created with a start value of `1` and an increment value of
`1` is extremely unlikely to exhaust the allowable range of sequence values.

## Referencing Sequences

### `currval` Not Supported

Many databases provide a `currval` sequence reference; however, Snowflake does not. `currval` in other systems is typically used
to create primary-foreign key relationships between tables — a first statement inserts a single row into the fact table using a sequence
to create a key. Subsequent statements insert rows into the dimension tables using `currval` to refer to the fact table’s key.

This pattern is contrary to Snowflake best practices — bulk queries should be preferred over small, single-row queries. The same task can be
better accomplished using multi-table [INSERT](../sql-reference/sql/insert-multi-table.md) and sequence references in nested subqueries.
For a detailed example, see Ingesting and Normalizing Denormalized Data (in this topic).

### Sequences as Expressions

Sequences may be accessed in queries as expressions of the form `seq_name.NEXTVAL`. Each occurrence of a sequence generates a set of
distinct values. This is different from what many other databases provide, where multiple references to `NEXTVAL` of a sequence return
the same value for each row.

For example, the following query returns distinct values for columns `a` and `b`:

> ```sqlexample
> CREATE OR REPLACE SEQUENCE seq1;
>
> SELECT seq1.NEXTVAL a, seq1.NEXTVAL b FROM DUAL;
> ```

To return two columns with the same generated sequence value, use nested subqueries and views:

> ```sqlexample
> CREATE OR REPLACE SEQUENCE seq1;
>
> SELECT seqRef.a a, seqRef.a b FROM (SELECT seq1.NEXTVAL a FROM DUAL) seqRef;
> ```

Nested subqueries generate as many distinct sequence values as rows returned by the subquery (so a sequence reference in a query block
with several joins refers not to any of the joined objects, but the output of the query block). These generated values may not be observed
if the associated rows are later filtered out, or the values may be observed twice (as in the above example) if the sequence column or the
inline view are referred to multiple times.

> **Note:**
>
> For multi-table insert, insert values may be provided both in the VALUES clauses and in the SELECT input:
>
> * VALUES clauses referring to a sequence value aliased from the input SELECT receive the same value.
> * VALUES clauses containing a direct reference to a sequence `NEXTVAL` receive distinct values.
>
> In contrast, Oracle restricts sequence references to VALUES clauses only.

### Sequences as Table Functions

Nested queries with sequence references are often difficult to understand and verbose — any shared reference (where two columns of a row
should receive the same sequence value) requires an additional level of query nesting. To simplify nested-query syntax, Snowflake provides
an additional method to generate sequences using the table function GETNEXTVAL, as in the following example:

> ```sqlexample
> CREATE OR REPLACE SEQUENCE seq1;
>
> CREATE OR REPLACE TABLE foo (n NUMBER);
>
> INSERT INTO foo VALUES (100), (101), (102);
>
> SELECT n, s.nextval FROM foo, TABLE(GETNEXTVAL(seq1)) s;
> ```

GETNEXTVAL is a special 1-row table function that generates a unique value (and joins this value) to other objects in the SELECT statement.
A call to GETNEXTVAL must be aliased; otherwise, the generated values cannot be referenced. Multiple columns may refer to a generated value
by accessing this alias. The GETNEXTVAL alias contains an attribute also named `NEXTVAL`.

The GETNEXTVAL table function additionally allows precise control over sequence generation when many tables are joined together. The order of
objects in the [FROM](../sql-reference/constructs/from.md) clause determines where values are generated. Sequence values are generated over the
result of joins between all objects listed prior to GETNEXTVAL in the FROM clause. The resulting rows are then joined to the objects to the
right. There is an implicit lateral dependence between GETNEXTVAL and all other objects in the FROM clause. Joins may not reorder around
GETNEXTVAL. This is an exception in SQL, as typically the order of objects does not affect the query semantics.

Consider the following example with tables `t1`, `t2`, `t3`, and `t4`:

> ```sqlexample
> CREATE OR REPLACE SEQUENCE seq1;
>
> SELECT t1.*, t2.*, t3.*, t4.*, s.NEXTVAL FROM t1, t2, TABLE(GETNEXTVAL(seq1)) s, t3, t4;
> ```

This query will join `t1` to `t2`, generate a unique value of the result, and then join the resulting relation against `t3`
and `t4`. The order of joins between the post-sequence relation, `t3`, and `t4` is not specified because inner joins are
associative.

> **Note:**
>
> These semantics can be tricky. We recommend using GETNEXTVAL at the end of the [FROM](../sql-reference/constructs/from.md) clause, when possible
> and appropriate, to avoid confusion.

## Using Sequences to Create Default Column Values

Sequences can be used in tables to generate primary keys for table columns. The following tools provide a simple way to do this.

### Column Default Expressions

The column default expression can be a sequence reference. Omitting the column in an insert statement or setting the value to DEFAULT in an
insert or update statement will generate a new sequence value for the row.

For example:

> ```sqlexample
> CREATE OR REPLACE SEQUENCE seq1;
>
> CREATE OR REPLACE TABLE foo (k NUMBER DEFAULT seq1.NEXTVAL, v NUMBER);
>
> -- insert rows with unique keys (generated by seq1) and explicit values
> INSERT INTO foo (v) VALUES (100);
> INSERT INTO foo VALUES (DEFAULT, 101);
>
> -- insert rows with unique keys (generated by seq1) and reused values.
> -- new keys are distinct from preexisting keys.
> INSERT INTO foo (v) SELECT v FROM foo;
>
> -- insert row with explicit values for both columns
> INSERT INTO foo VALUES (1000, 1001);
>
> SELECT * FROM foo;
>
> +------+------+
> |    K |    V |
> |------+------|
> |    1 |  100 |
> |    2 |  101 |
> |    3 |  100 |
> |    4 |  101 |
> | 1000 | 1001 |
> +------+------+
> ```

The advantage of using sequences as a column default value is that the sequence can be referenced in other locations, and even be the default
value for multiple columns and in multiple tables. If a sequence is named as the default expression of a column and then subsequently dropped
any attempt to insert/update the table using the default value will result in an error saying the identifier cannot be found.

## Ingesting and Normalizing Denormalized Data

Consider a schema with two tables, `people` and `contact`:

* The `people` table contains:

  + A primary key unique identifier: `id`
  + Two string columns: `firstName` and `lastName`
* The `contact` table contains:

  + A primary key unique identifier: `id`
  + A foreign key linking this contact entry to a person: `p_id`
  + Two string columns:

    - `c_type`: The type of contact (e.g. ‘email’ or ‘phone’).
    - `data`: The actual contact information.

Data in this format frequently is denormalized for ingestion or while processing semi-structured data.

This example illustrates ingesting JSON data, denormalizing it to extract the desired data, and normalizing the data as it is inserted into
tables. At the same time, it is important to create unique identifiers on rows while maintaining the intended relationships across rows of
tables. We accomplish this with sequences.

1. First, we set up the tables and sequences used in the example:

   ```sqlexample
   -- primary data tables

   CREATE OR REPLACE TABLE people (id number, firstName string, lastName string);
   CREATE OR REPLACE TABLE contact (id number, p_id number, c_type string, data string);

   -- sequences to produce primary keys on our data tables

   CREATE OR REPLACE SEQUENCE people_seq;
   CREATE OR REPLACE SEQUENCE contact_seq;

   -- staging table for json

   CREATE OR REPLACE TABLE input (json variant);
   ```
2. Next, we insert data from table `json`:

   ```sqlexample
   INSERT INTO input SELECT parse_json(
   '[
    {
      firstName : \'John\',
      lastName : \'Doe\',
      contacts : [
        {
          contactType : \'phone\',
          contactData : \'1234567890\',
        }
        ,
        {
          contactType : \'email\',
          contactData : \'jdoe@example.com\',
        }
       ]
      }
   ,
     {
      firstName : \'Mister\',
      lastName : \'Smith\',
      contacts : [
        {
          contactType : \'phone\',
          contactData : \'0987654321\',
        }
        ,
        {
          contactType : \'email\',
          contactData : \'msmith@example.com\',
        }
        ]
      }
    ,
      {
      firstName : \'George\',
      lastName : \'Washington\',
      contacts : [
        {
          contactType : \'phone\',
          contactData : \'1231231234\',
        }
        ,
        {
          contactType : \'email\',
          contactData : \'gwashington@example.com\',
        }
      ]
    }
   ]'
   );
   ```
3. Then, we parse and flatten the JSON, generate unique identifiers for each person and contact entry, and insert the data while preserving
   relationships between people and contact entries:

   ```sqlexample
   INSERT ALL
     WHEN 1=1 THEN
       INTO contact VALUES (c_next, p_next, contact_value:contactType, contact_value:contactData)
     WHEN contact_index = 0 THEN
       INTO people VALUES (p_next, person_value:firstName, person_value:lastName)

   SELECT * FROM
   (
     SELECT f1.value person_value, f2.value contact_value, f2.index contact_index, p_seq.NEXTVAL p_next, c_seq.NEXTVAL c_next
     FROM input, LATERAL FLATTEN(input.json) f1, TABLE(GETNEXTVAL(people_seq)) p_seq,
       LATERAL FLATTEN(f1.value:contacts) f2, table(GETNEXTVAL(contact_seq)) c_seq
   );
   ```
4. This produces the following data (unique IDs may change):

   ```sqlexample
   SELECT * FROM people;

   +----+-----------+------------+
   | ID | FIRSTNAME | LASTNAME   |
   |----+-----------+------------|
   |  1 | John      | Doe        |
   |  2 | Mister    | Smith      |
   |  3 | George    | Washington |
   +----+-----------+------------+

   SELECT * FROM contact;

   +----+------+--------+-------------------------+
   | ID | P_ID | C_TYPE | DATA                    |
   |----+------+--------+-------------------------|
   |  1 |    1 | phone  | 1234567890              |
   |  2 |    1 | email  | jdoe@example.com        |
   |  3 |    2 | phone  | 0987654321              |
   |  4 |    2 | email  | msmith@example.com      |
   |  5 |    3 | phone  | 1231231234              |
   |  6 |    3 | email  | gwashington@example.com |
   +----+------+--------+-------------------------+
   ```

As this example shows, rows are linked and can be joined between `people.id` and `contact.p_id`.

If additional data is added, new rows continue to receive unique IDs. For example:

> ```sqlexample
>  TRUNCATE TABLE input;
>
>  INSERT INTO input SELECT PARSE_JSON(
>  '[
>   {
>     firstName : \'Genghis\',
>     lastName : \'Khan\',
>     contacts : [
>       {
>         contactType : \'phone\',
>         contactData : \'1111111111\',
>       }
>       ,
>       {
>         contactType : \'email\',
>         contactData : \'gkahn@example.com\',
>       }
>    ]
>  }
> ,
>  {
>     firstName : \'Julius\',
>     lastName : \'Caesar\',
>     contacts : [
>       {
>         contactType : \'phone\',
>         contactData : \'2222222222\',
>       }
>       ,
>       {
>         contactType : \'email\',
>         contactData : \'gcaesar@example.com\',
>       }
>     ]
>   }
>  ]'
>  );
>
>  INSERT ALL
>    WHEN 1=1 THEN
>      INTO contact VALUES (c_next, p_next, contact_value:contactType, contact_value:contactData)
>    WHEN contact_index = 0 THEN
>      INTO people VALUES (p_next, person_value:firstName, person_value:lastName)
>  SELECT * FROM
>  (
>    SELECT f1.value person_value, f2.value contact_value, f2.index contact_index, p_seq.NEXTVAL p_next, c_seq.NEXTVAL c_next
>    FROM input, LATERAL FLATTEN(input.json) f1, table(GETNEXTVAL(people_seq)) p_seq,
>      LATERAL FLATTEN(f1.value:contacts) f2, table(GETNEXTVAL(contact_seq)) c_seq
>  );
>
>  SELECT * FROM people;
>
>  +----+-----------+------------+
>  | ID | FIRSTNAME | LASTNAME   |
>  |----+-----------+------------|
>  |  4 | Genghis   | Khan       |
>  |  5 | Julius    | Caesar     |
>  |  1 | John      | Doe        |
>  |  2 | Mister    | Smith      |
>  |  3 | George    | Washington |
>  +----+-----------+------------+
>
>  SELECT * FROM contact;
>
>  +----+------+--------+-------------------------+
>  | ID | P_ID | C_TYPE | DATA                    |
>  |----+------+--------+-------------------------|
>  |  1 |    1 | phone  | 1234567890              |
>  |  2 |    1 | email  | jdoe@example.com        |
>  |  3 |    2 | phone  | 0987654321              |
>  |  4 |    2 | email  | msmith@example.com      |
>  |  5 |    3 | phone  | 1231231234              |
>  |  6 |    3 | email  | gwashington@example.com |
>  |  7 |    4 | phone  | 1111111111              |
>  |  8 |    4 | email  | gkahn@example.com       |
>  |  9 |    5 | phone  | 2222222222              |
>  | 10 |    5 | email  | gcaesar@example.com     |
>  +----+------+--------+-------------------------+
> ```

## Altering a Sequence

### Understanding the Effects of Reversing the Direction of a Sequence

The following example shows what happens when you reverse the direction of a sequence.

This also shows that due to pre-calculation of sequence values, an ALTER SEQUENCE command might seem to take effect only after
the second use of the sequence after executing the ALTER SEQUENCE command.

Create the sequence and use it as the default value for a column in a table:

```sqlexample
CREATE OR REPLACE SEQUENCE test_sequence_wraparound_low
   START = 1
   INCREMENT = 1
   ;

CREATE or replace TABLE test_seq_wrap_low (
    i int,
    j int default test_sequence_wraparound_low.nextval
    );
```

Load the table:

```sqlexample
INSERT INTO test_seq_wrap_low (i) VALUES
     (1),
     (2),
     (3);
```

Show the sequence values in column `j`:

```sqlexample
SELECT * FROM test_seq_wrap_low ORDER BY i;
+---+---+
| I | J |
|---+---|
| 1 | 1 |
| 2 | 2 |
| 3 | 3 |
+---+---+
```

Alter the increment (step size) of the sequence:

```sqlexample
ALTER SEQUENCE test_sequence_wraparound_low SET INCREMENT = -4;
```

Insert two more rows:

```sqlexample
INSERT INTO test_seq_wrap_low (i) VALUES
    (4),
    (5);
```

Show the sequence values. Note that the first row inserted after the ALTER SEQUENCE
has the value `4`, not `-1`. The second row inserted after the ALTER SEQUENCE
does take into account the new step size.

```sqlexample
SELECT * FROM test_seq_wrap_low ORDER BY i;
+---+---+
| I | J |
|---+---|
| 1 | 1 |
| 2 | 2 |
| 3 | 3 |
| 4 | 4 |
| 5 | 0 |
+---+---+
```

---
title: Using session policies
source: https://docs.snowflake.com/en/user-guide/session-policies-using.md
section: User Guide
---

# Using session policies

This topic provides examples on how to use session policies.

## Standard session policy

The following steps are a representative guide to creating a session policy and setting the session policy on an account or user.

These steps assume a centralized management approach in which a custom role named `policy_admin` owns the session policy (i.e. has the
OWNERSHIP privilege on the session policy) and is responsible for setting the session policy on an account or user (i.e. has the APPLY
SESSION POLICY on ACCOUNT privilege or the APPLY SESSION POLICY ON USER privilege).

> **Note:**
>
> To set a policy on an account, the `policy_admin` custom role must have the following permissions:
>
> * USAGE on the database and schema that contain the session policy.
> * CREATE SESSION POLICY on the schema that contains the session policy.

Follow these steps to implement a session policy.

1. Create a custom role that allows users to create and manage session policies. Throughout this example custom role is `policy_admin`,
   although the role could have any appropriate name.

   If the custom role already exists, continue to the next step.

   Otherwise, create the `policy_admin` custom role:

   ```sqlexample
   USE ROLE USERADMIN;

   CREATE ROLE policy_admin;
   ```
2. Grant privileges to the custom role.

   If the `policy_admin` custom role does not already have the following privileges, grant these privileges as shown below:

   * USAGE on the database and schema that will contain the session policy.
   * CREATE SESSION POLICY on the schema that will contain the session policy.
   * APPLY SESSION POLICY on the account.
   * APPLY SESSION POLICY on each user, if you plan to set session policies at the user level.

   ```sqlexample
   USE ROLE SECURITYADMIN;

   GRANT USAGE ON DATABASE mydb TO ROLE policy_admin;

   GRANT USAGE, CREATE SESSION POLICY ON SCHEMA mydb.policies TO ROLE policy_admin;

   GRANT APPLY SESSION POLICY ON ACCOUNT TO ROLE policy_admin;
   ```

   If associating a session policy with an individual user:

   ```sqlexample
   GRANT APPLY SESSION POLICY ON USER jsmith TO ROLE policy_admin;
   ```

   For more information, see [Summary of commands, operations, and privileges](session-policies-managing.md).
3. Create a new session policy.

   ```sqlexample
   USE ROLE policy_admin;

   CREATE SESSION POLICY mydb.policies.session_policy_prod_1
     SESSION_IDLE_TIMEOUT_MINS = 30
     SESSION_UI_IDLE_TIMEOUT_MINS = 30
     COMMENT = 'Session policy for the prod_1 environment';
   ```

   For more information, see [CREATE SESSION POLICY](../sql-reference/sql/create-session-policy.md).
4. Set the session policy the account with the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command, or a user with the
   [ALTER USER](../sql-reference/sql/alter-user.md) command.

   ```sqlexample
   USE ROLE policy_admin;

   ALTER ACCOUNT SET SESSION POLICY mydb.policies.session_policy_prod_1;

   ALTER USER jsmith SET SESSION POLICY my_database.my_schema.session_policy_prod_1;
   ```

   > **Important:**
   > > To replace a session policy that is already set for an account or user, unset the session policy first and then set the new session
   > > policy for the account or user. For example:
   >
   > ```sqlexample
   > ALTER ACCOUNT UNSET session policy;
   >
   > ALTER ACCOUNT SET SESSION POLICY mydb.policies.session_policy_prod_2;
   > ```

## Specifying secondary roles in a session policy

The following sections detail how to specify secondary roles in a session policy:

* Set the property in a session policy
* Unset the property in a session policy
* Disallow secondary roles for all users in the account
* Disallow secondary roles for a specific user
* Allow a user to use specific secondary roles

For more information about secondary roles in a session policy, see [Secondary roles in a session policy](session-policies.md)

### Set the property in a session policy

The security administrator can create a new session policy or modify an existing session policy to set the
`ALLOWED_SECONDARY_ROLES` property. For example:

* Create a new session policy to allow all secondary roles:

  ```sqlexample
  CREATE OR REPLACE SESSION POLICY prod_env_session_policy
    SESSION_IDLE_TIMEOUT_MINS = 30
    SESSION_UI_IDLE_TIMEOUT_MINS = 30
    ALLOWED_SECONDARY_ROLES = ('ALL')
    COMMENT = 'session policy for use in the prod_1 environment';
  ```
* Modify an existing session policy to disallow secondary roles:

  ```sqlexample
  ALTER SESSION POLICY prod_env_session_policy
    SET ALLOWED_SECONDARY_ROLES = ();
  ```

  The [ALTER SESSION POLICY](../sql-reference/sql/alter-session-policy.md) command can modify the property value if the property is already set.

For details about the syntax, see the [Managing session policies](session-policies-managing.md).

You can use the [DESCRIBE SESSION POLICY](../sql-reference/sql/desc-session-policy.md) command or call the [GET_DDL](../sql-reference/functions/get_ddl.md) function to view
the value of the `ALLOWED_SECONDARY_ROLES` property.

### Unset the property in a session policy

You can use an ALTER SESSION POLICY command to unset secondary roles in the session policy:

```sqlexample
ALTER SESSION POLICY prod_env_session_policy
  UNSET ALLOWED_SECONDARY_ROLES;
```

### Disallow secondary roles for all users in the account

To prevent all users in an account from using secondary roles, set a session policy on the account that disallows secondary roles for the
session. For example:

1. Modify a session policy to disallow secondary roles:

   ```sqlexample
   ALTER SESSION POLICY prod_env_session_policy SET ALLOWED_SECONDARY_ROLES = ();
   ```
2. Assign the session policy to the account:

   ```sqlexample
   ALTER ACCOUNT SET SESSION POLICY prod_env_session_policy;
   ```

If a user tries to activate secondary roles with a USE SECONDARY ROLES command, such as `USE SECONDARY ROLES analyst;`, the following
error message occurs:

```output
SQL execution error: USE SECONDARY ROLES '[ANALYST]' not allowed as per session policy.
```

### Disallow secondary roles for a specific user

To disallow a specific user from using secondary roles, set a session policy on the user that disallows secondary roles for the session.
For example, if that session policy already exists:

```sqlexample
ALTER USER jsmith SET SESSION POLICY prod_env_session_policy;
```

If there is a session policy that is set on the account, the session policy assigned to the user overrides the session policy on the
account.

If the user runs a USE SECONDARY ROLES command to activate secondary roles, such as `USE SECONDARY ROLES (ANALYST, DATA_SCIENTIST);`
they will see the following error message:

```output
SQL execution error: USE SECONDARY ROLES '[ANALYST, DATA_SCIENTIST]' not allowed as per session policy.
```

### Allow a user to use specific secondary roles

To enable a user to use specific secondary roles, do the following:

1. Create a session policy that specifies the secondary roles a user can use:

   ```sqlexample
   CREATE OR REPLACE SESSION POLICY prod_env_session_policy
     SESSION_IDLE_TIMEOUT_MINS = 30
     SESSION_UI_IDLE_TIMEOUT_MINS = 30
     ALLOWED_SECONDARY_ROLES = (DATA_SCIENTIST, ANALYST)
     COMMENT = 'session policy for user secondary roles data_scientist and analyst';
   ```
2. Set the session policy on the user:

   ```sqlexample
   ALTER USER bsmith SET SESSION POLICY prod_env_session_policy;
   ```

The user can activate the secondary roles as needed with a USE SECONDARY ROLES command. For example:

* Activate all secondary roles:

  ```sqlexample
  USE SECONDARY ROLES ALL;
  ```
* Activate `DATA_SCIENTIST` as a secondary role:

  ```sqlexample
  USE SECONDARY ROLES DATA_SCIENTIST;
  ```

For details about the syntax, see [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md).

## Replicate the session policy to a target account

You can replicate a session policy and its references, which are the assignments to a user or the account, from the source account to the
target account using database replication and account replication. For details, see:

* [Account replication](account-replication-considerations.md).
* [Database replication](database-replication-considerations.md).

For details about replicating a session policy that specifies secondary roles, see
[replicate session policies with secondary roles](account-replication-considerations.md).

---
title: Using sfsql — Obsoleted
source: https://docs.snowflake.com/en/user-guide/sfsql-use.md
section: User Guide
---

# Using sfsql — *Obsoleted*

This topic describes how to use `sfsql`, including a list of the native Henplus commands that are not supported by the client.

> **Note:**
>
> Some Snowflake SQL commands are implemented through the JDBC driver used by `sfsql`, e.g. [PUT](../sql-reference/sql/put.md)/[GET](../sql-reference/sql/get.md) for uploading files to and downloading files from internal stages.
> As a result, these operations can be performed in `sfsql`, but cannot be performed in the Snowflake web interface.

## Navigating and editing on the command line

`sfsql` supports all standard command-line editing functions, including:

* Up and down keys to access the command history.
* Control and Meta key combinations (e.g. **[CTRL]-a**, **[CTRL]-e**) for navigating and editing text on the command line.

## Setting Parameters

HenPlus provides properties that control session behavior; however, you should not set these properties in `sfsql`. Instead, use the [session parameters](../sql-reference/parameters.md) provided by Snowflake.

In addition, HenPlus provides the following global properties that can be set across all sessions (the property settings are saved when you log out of the session). You can use these global properties to control the formatting and
appearance of your SQL statement results:

* To see the list of global parameters and their current values, type `set-property` at the command line:

  | Property | Initial Value | Description |
  | --- | --- | --- |
  | column-delimiter | | | Specifies the character(s) used to separate/format columns in the display. |
  | comments-remove | off (or false) | Not currently used. |
  | echo-commands | off (or false) | Specifies whether to display a statement before executing it. |
  | sql-result-limit | 1000000000 | Specifies the maximum number of rows returned in the statement results. |
  | sql-result-showfooter | on (or true) | Specifies whether to include a footer row in the results. |
  | sql-result-showheader | on (or true) | Specifies whether to include a header row, including column headings, in the results. |
* To set a global parameter, type `set-property` followed by the parameter name and value.

  For example, to disable headers and footers in the results:

  > ```bash
  > user1@xy12345.snowflakecomputing.com> set-property sql-result-showfooter false
  > user1@xy12345.snowflakecomputing.com> set-property sql-result-showheader false
  > ```

  Note that you do not need to type any closing characters, e.g. semi-colon (`;`), to set the global properties.

## Executing SQL Statements and Script Files

To execute a SQL query or statement:

* Type a semi-colon (`;`) immediately following the end of the statement.
* If you enter a new line after the statement, you must type two semi-colons (`;;`) to execute the statement.
* On a new line, you can also type a forward-slash (`/`) which is the command for ending a statement.

For example, any of the syntax can be used to execute the following query:

> ```bash
> user1@xy12345.snowflakecomputing.com> select * from test1;
>
>
> user1@xy12345.snowflakecomputing.com> select * from test1
>                                       ;;
>
> user1@xy12345.snowflakecomputing.com> select * from test1
>                                       /
> ```

To execute a SQL script file, use `@` or `@@` followed by the directory path and full name of the file (including the file extension, if any).

For example, to execute a file named `query.sql` located in the `/Users/user1/scripts` directory:

> ```bash
> user1@xy12345.snowflakecomputing.com> @/Users/user1/scripts/query.sql
> ```

> **Note:**
>
> HenPlus also allows using the `start` command to execute a file; however, you cannot use this command in `sfsql` to execute a file because Snowflake reserves the START keyword for initiating transactions. For more information, see
> [Transactions](../sql-reference/transactions.md).

## Canceling In-progress Queries

To cancel a query that has not yet completed, use the **[CTRL]-c** keyboard combo.

## Spooling Results

To spool the results of a SQL query or command, type `spool` followed by the directory path and name of the file in which to spool the results.

To stop spooling results, type `spool off`.

## Accessing the Snowflake Command-line Help

Snowflake provides command-line help topics. To access the help, use the following syntax:

```sqlsyntax
info [ <topic> | <subtopic> ]
```

* If no value is specified, all the top-level topics in the help are displayed.
* If a topic is specified, all the subtopics for the topic are displayed.
* If a subtopic is specified, the contents of the subtopic are displayed.

For example:

> ```sqlexample
> info;
>
> info warehouses;
>
> info alter_warehouse;
> ```

## Unsupported HenPlus Commands

HenPlus provides native commands for performing tasks such as describing objects and importing/exporting data from the system. You should not use these commands in `sfsql`. Instead, use the SQL commands provided by Snowflake:

| Unsupported HenPlus commands: | Equivalent Snowflake SQL commands: |
| --- | --- |
| `tables` , `views`, and other related commands | [SHOW <objects>](../sql-reference/sql/show.md) |
| `describe` , `idescribe` | [DESCRIBE <object>](../sql-reference/sql/desc.md) |
| `import` , `import-check`, and other related commands | [COPY INTO <table>](../sql-reference/sql/copy-into-table.md), [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) |
| `dump-out` , `dump-in`, and other related commands | [PUT](../sql-reference/sql/put.md), [GET](../sql-reference/sql/get.md) |

---
title: Using Snowflake Copilot
source: https://docs.snowflake.com/en/user-guide/snowflake-copilot.md
section: User Guide
---

# Using Snowflake Copilot

> **Important:**
>
> Snowflake Copilot is being replaced with Cortex Code. To learn more about Cortex Code, see [Cortex Code in Snowsight](cortex-code/cortex-code-snowsight.md).

This topic provides an introduction to what Snowflake Copilot is and how to use it in your data analysis workflow. This topic gives information about how to use Snowflake Copilot with a chat interface. For information about using Snowflake Copilot inline, see [Using Snowflake Copilot inline](snowflake-copilot-inline.md).
The examples in this topic use worksheets.

> **Note:**
>
> Snowflake Copilot is natively supported in the following regions:
>
> * AWS US West 2 (Oregon)
> * AWS US East 1 (N. Virginia)
> * AWS US East (Commercial Gov - N. Virginia)
> * AWS Europe Central 1 (Frankfurt)
> * AWS AP Northeast 1 (Tokyo)
> * AWS Europe West 1 (Ireland)
> * AWS AP Southeast 2 (Sydney)
> * Azure East US 2 (Virginia)
> * Azure West Europe (Netherlands)
>
> To use Snowflake Copilot in other regions, set the `CORTEX_ENABLED_CROSS_REGION` parameter.
> Within this parameter, you can either:
>
> * Provide a list of values that include at least one of the supported regions.
> * Set it to `ANY_REGION`.
>
> For information about how to use the `CORTEX_ENABLED_CROSS_REGION` parameter, see [How to use the cross-region inference parameter](snowflake-cortex/cross-region-inference.md).

## Introduction

Snowflake Copilot is an LLM-powered assistant that simplifies data analysis while maintaining robust data governance, and seamlessly integrates
into your existing Snowflake workflow.

Snowflake Copilot is powered by a model fine-tuned by Snowflake that runs securely inside [Snowflake Cortex](../guides-overview-ai-features.md),
Snowflake’s intelligent, fully managed AI service. This approach means that your enterprise data and metadata always stay securely inside
Snowflake. Snowflake Copilot also fully respects RBAC and provides suggestions based only on the datasets that you can access.

Snowflake Copilot uses natural language requests to enable data analysis from start to finish. To start, Copilot can help answer questions
about how your data is structured and guide you in exploring a new dataset. You can then ask Copilot to generate and refine SQL queries to
extract useful information from your data. Snowflake Copilot can even help improve your SQL query by recommending optimizations or suggesting
fixes for possible issues.

Snowflake Copilot can also help improve your SQL fluency or understanding of Snowflake features. Ask questions about how to perform a task in
Snowflake and Copilot will return answers based on the Snowflake documentation.

You can interact with Copilot in SQL Worksheets in Snowsight. Using the Copilot panel, you can enter a question,
and Snowflake Copilot will reply with an answer. You can run suggested SQL queries in your worksheet.

## Access control requirements

The COPILOT_USER database role in the SNOWFLAKE database includes the privileges that allow users to use Snowflake Copilot features. By default,
the COPILOT_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted
to all users and roles, so this allows all users in your account to use Snowflake Copilot features.

Users with this privilege will see Ask Copilot in the lower-right corner of their worksheet and can use the panel
to interact with Snowflake Copilot.

## Limit access to Copilot

If you don’t want all users to have access to Copilot, you can revoke the COPILOT_USER database role from the PUBLIC role and grant access to specific roles.

To revoke the COPILOT_USER database role from the PUBLIC role, run the following command using the ACCOUNTADMIN role:

```sqlexample
USE ROLE ACCOUNTADMIN;

REVOKE DATABASE ROLE SNOWFLAKE.COPILOT_USER
  FROM ROLE PUBLIC;
```

A user without this role will not see Ask Copilot in the lower-right corner of their worksheet. You can switch your
active role in the navigation menu on the left to switch to a role that has access to Copilot to see the Ask Copilot menu again.
For details, see [Switch your primary role](ui-snowsight-gs.md).

You can then selectively provide access to specific roles. The SNOWFLAKE.COPILOT_USER database role cannot be granted directly to a user.
For more information, see [Using SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md). A user with the ACCOUNTADMIN role can grant this role to a custom role
in order to allow users to access Snowflake Copilot features. In the following example, use the ACCOUNTADMIN role and grant the user
`some_user` the COPILOT_USER database role via the account role `copilot_access_role`, which you create for this purpose.

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE copilot_access_role;
GRANT DATABASE ROLE SNOWFLAKE.COPILOT_USER TO ROLE copilot_access_role;

GRANT ROLE copilot_access_role TO USER some_user;
```

You can also grant access to Snowflake Copilot through existing roles commonly used by specific groups of
users. (See [User roles](admin-user-management.md).) For example, if you have created an `analyst` role that is used
as a default role by analysts in your organization, you can easily grant these users access to Snowflake Copilot
with a single GRANT statement.

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.COPILOT_USER TO ROLE analyst;
```

### Limit models used by Snowflake Copilot

Role-based access control (RBAC) at the model level lets account administrators control which large language models (LLMs) can be used by Snowflake Copilot and other Cortex features based on user roles. Access to each LLM is controlled using a distinct application role. Organizations with regulatory or compliance restrictions might need to control the use of some LLMs, especially those hosted outside of their cloud boundary.

> **Important:**
>
> Role-based access control (RBAC) at the model level is intended for customers with strict legal or compliance requirements. Enabling this feature might cause Snowflake Copilot and other features that rely on models to stop working because required models are not accessible. Limiting access to a model impacts any Snowflake Cortex feature that relies on that model.
>
> We recommend not using RBAC at the model level unless absolutely necessary.

#### Snowflake Copilot model selection

Snowflake Copilot uses the following logic to select a model to use when model level RBAC is enabled.

1. Snowflake Copilot uses Anthropic Claude Sonnet 3.5 if the user is in AWS US West, AWS US East, or has cross-region inference enabled.
2. If Anthropic Claude Sonnet 3.5 is not available because of RBAC or region availability, Snowflake Copilot uses Mistral Large 2.
3. If Mistral Large 2 is also not available, Snowflake Copilot does not appear in the Snowsight UI and any requests to the Snowflake Copilot API fail.

#### Enable and configure model level RBAC

Complete the following steps to enable model level RBAC and configure the models that Snowflake Copilot can use.

1. Grant model-specific application roles to user roles. The user only then has access to these models when using Snowflake Copilot. Each supported LLM has a corresponding application role. For information about granting these application roles to implement model level RBAC, see [Role-based access control (RBAC)](snowflake-cortex/aisql.md).
2. Snowflake Copilot automatically selects the best available model based on access privileges and regional availability.

#### Risks and limitations

Snowflake Copilot needs access to at least one model to function. Limiting access to specific models eliminates fallback options and increases the risk of partial or total failure. If you remove access to all models using RBAC, Snowflake Copilot might become unusable. Snowflake is not responsible for Snowflake Copilot availability in those circumstances. Limiting access to models also applies to all Cortex features that might use that model.

## Supported use cases

* **Explore your data** by asking open-ended questions to learn about the structure and nuances of a new dataset.
* **Generate SQL queries** with questions in plain English.
* **Try out the SQL query** suggested by Snowflake Copilot with the click of a button. You can also edit the query
  before running it.
* **Build complex queries** through a conversation with Snowflake Copilot by asking follow-up questions to refine the suggested SQL query and
  dig deeper into the analysis.
* **Learn about Snowflake** by asking questions about Snowflake concepts, capabilities, and features.
* **Improve your queries** by asking Snowflake Copilot to help you assess query efficiency, find optimizations, or explain what the query does.
* **Provide feedback** (thumbs up or thumbs down) on each response from Snowflake Copilot, which will be used to improve the product.
* **Add custom instructions** such as a set of preferences or specific business knowledge for Snowflake Copilot to consider
  when generating responses.

## Limitations

* **Support for the following languages:**

  + English
  + French
  + German
  + Spanish
  + Italian
  + Portuguese
  + Arabic
  + Hindi
  + Chinese
  + Japanese
  + Korean
  + SQL
  + Python
  + Java
* **No access to your data:** Snowflake Copilot does not have access to the data inside your tables. If you want to
  filter on a particular value of a column, you should provide that value. For example, if you ask Snowflake Copilot to return all rows with a
  column A value equal to “X”, you should provide the value “X” in your request. See the
  Construct and run a SQL Statement example.
* **Snowflake Copilot does not support cross database or schema queries:** You can work around this by creating and using views that join data from
  different schemas and databases.
* **Delayed response:** Snowflake Copilot might take a second to complete a response, depending on the length of the response provided.
* **SQL suggestions may not always work:** Snowflake Copilot may sometimes suggest queries that contain invalid SQL syntax or non-existent
  tables or columns. Please provide feedback using the thumbs up or thumbs down buttons for the particular response. This feedback helps us
  improve this feature.
* **Delay in detecting new databases, schemas, and tables:** It may take up to 3-4 hours for Snowflake Copilot to recognize newly created
  databases, schemas, and tables.
* **Limited number of tables and columns considered:** To generate a response, Snowflake Copilot first searches for
  tables and columns most relevant for your request. The search results are then ranked by relevancy and only the top 10 tables and top
  10 columns from each of those tables in the results are considered when generating a response.

## How to use Snowflake Copilot

Snowflake Copilot is ready to use with no additional setup. Remember the following points when using Snowflake Copilot:

* Each chat session with Snowflake Copilot is associated with a particular worksheet. Opening a new worksheet opens a
  new chat session.
* You must have a database and schema in use during your session to use Snowflake Copilot. Copilot uses them to generate relevant responses.
* Snowflake Copilot uses the names of your databases, schemas, tables, and columns and also the data types of your columns to determine what
  data is available to query.
* If Snowflake Copilot cannot answer your question based on the selected database and schema, it may try to use other ways to answer, such
  as the Snowflake documentation or general SQL knowledge. If you get an unexpected response, you can leave feedback using the
  thumbs up and thumbs down buttons.
* If you need to refer to a table name or a column name in your question, prefix the name with `@`. Referring to specific tables
  and columns can help Snowflake Copilot provide more accurate responses.
* For optimal performance, use meaningful names for databases, schemas, tables, and columns, and ensure that columns are assigned the
  appropriate data type.

Follow these steps to start using Snowflake Copilot:

1. Create a new worksheet or open an existing worksheet.
2. Select Ask Copilot in the lower-right corner of the worksheet.
   The Snowflake Copilot panel opens on the right side of the worksheet.
3. Make sure a database and a schema are selected for the current worksheet. If not, you can select them by using either
   the selector on the top of the worksheet or the selector below the Snowflake Copilot message box.
4. In the message box, type in your question and then select the send icon or press `Enter` to submit it. Snowflake Copilot provides a
   response in the panel.
5. If the response from Snowflake Copilot includes SQL statements:

   * Select Run to run the query. This adds the query to your worksheet and runs it.
   * Select Add to edit the query before running it. This adds the query to your worksheet.

## Add custom instructions

Snowflake Copilot accepts custom instructions that let you customize how it responds. When enabled,
these instructions are used to enhance the prompt that’s sent to the model behind Snowflake Copilot and are considered by
Copilot when it’s generating new responses. Custom instructions can include directions to use a specific tone or respond in a certain way,
preferences on how to write SQL, or additional information about the data to consider.

Remember the following when adding custom instructions:

* There is a 2,000 character limit for custom instructions.
* Snowflake recommends specifying custom instructions in plain English.
* The instructions are specific to the user that entered them and used for all their conversations with Snowflake Copilot.

Follow these steps to add custom instructions for Snowflake Copilot:

1. Create a new worksheet or open an existing worksheet.
2. Select Ask Copilot in the lower-right corner of the worksheet.
   The Snowflake Copilot panel opens on the right side of the worksheet.
3. Select the Copilot menu at the top of the Snowflake Copilot panel.
4. Select Custom instructions from the drop-down menu.
5. To enable the custom instructions text box, select the Enable for new chats toggle on the bottom left of the custom
   instructions window.
6. Enter your instructions in plain text English.
7. Select Save when finished.
8. Continue your conversation with Snowflake Copilot in the Copilot panel.

## Examples

The following sections provide examples that demonstrate how to:

* Explore your data
* Construct and run SQL statements
* Get an explanation of a SQL statement
* Ask questions about SQL and Snowflake concepts

These examples use a sample dataset from the Snowflake Marketplace.

### Prerequisites

The examples in this section use the [Cybersyn Github Archive dataset](https://app.snowflake.com/marketplace/listing/GZTSZAS2KJ3/cybersyn-inc-cybersyn-github-archive) from the Snowflake Marketplace:

1. Install the [Cybersyn Github Archive dataset](https://app.snowflake.com/marketplace/listing/GZTSZAS2KJ3/cybersyn-inc-cybersyn-github-archive) in your account.
2. Create a new worksheet or open an existing worksheet.
3. Select Ask Copilot in the lower-right corner of the worksheet.
4. Select the Cybersyn Github Archive database and schema.

### Explore your data

The following example demonstrates how to use Snowflake Copilot to explore a dataset.

1. Enter an open-ended question such as “What types of questions can I ask about this dataset?”
2. Press `Enter` and Snowflake Copilot will generate a response based on the database and schema you’ve selected.
3. Ask further clarifying questions about the data, such as “What type of events can I filter by?”
   or “Are any of these tables joinable?”
4. If the response from Snowflake Copilot includes a SQL statement, you can select Add to add the query to the end of your
   worksheet and edit it before running or select Run to add the query and run it automatically.

### Construct and run a SQL Statement

The following example demonstrates how to use Snowflake Copilot to generate SQL queries.

1. Enter the question “How many stars were given in the past year?” in the Snowflake Copilot message box, and press `Enter`. Snowflake Copilot
   responds with a SQL query that answers your question.
2. Select Add to add the query to the end of your worksheet.
3. Enter the question “Show me this for each month,” and press `Enter`. Snowflake Copilot responds with a SQL query
   that answers your question.
4. Select Run to add the query to your worksheet and run the query.

Snowflake Copilot does not have access to the data inside your tables. If you want Snowflake Copilot to construct a
SQL statement that filters based on a specific value of a column, you must provide the value
to filter on.

1. Enter the question “what are all the repo names that start with ‘snowflake’?” in the message box and
   press `Enter`. Snowflake Copilot responds with a SQL query that uses the filter value you provided.
2. Select Add to edit the query before running or select Run to add the query to your worksheet and run it.

### Explain a SQL statement

The following example demonstrates how to use Snowflake Copilot to explain a SQL statement you’re working on.

* In the Snowflake Copilot message box, type the following question and SQL query:

  ```none
  Can you explain this query to me step-by-step?
  ```

  ```sqlexample
  SELECT
    github_repos.repo_name,
    COUNT(github_stars.repo_id) AS total_stars
  FROM
    github_repos
    JOIN github_stars ON github_repos.repo_id = github_stars.repo_id
  GROUP BY
    github_repos.repo_name
  ORDER BY
    total_stars DESC;
  ```

Snowflake Copilot responds with a step-by-step explanation of the provided query.

### Ask questions about SQL and Snowflake

Snowflake Copilot has access to Snowflake documentation and can answer general questions about Snowflake or SQL. Here are some example
questions you can try:

* How do I write a SQL join?
* What is Snowpark Cortex?
* How do I ingest data into Snowflake?

## Tips for using Snowflake Copilot

* Creating curated [views](views-introduction.md) can significantly improve the performance of Snowflake Copilot.

  Follow these guidelines when creating the views:

  | Guideline | Example |
  | --- | --- |
  | Use descriptive and easy-to-understand names for the views and their columns.  When choosing the names, use the business and data taxonomy you are likely to use while using Snowflake Copilot. | If a column contains the date for a specific sale, name the column `sale_date`. |
  | Make sure all columns have the appropriate data type. | If a column contains the date for a specific sale, make sure it has the [DATE](../sql-reference/data-types-datetime.md) type. |
  | Define commonly used metrics/expressions as new columns. | If profit is defined as `revenue - cost`, create a column `(revenue - cost) AS profit` in your view. |
  | If possible, capture common and complex joins. | If two tables `products` and `sales` are often joined, make sure that your view joins these tables.  If there are multiple join paths between commonly joined tables, use the preferred join path in your view. |
* Be as specific as possible when you ask a question. Imagine that you are asking a question to a human who may have limited knowledge
  of your data.
* If you want to filter on specific values inside columns, you might need to actively guide Snowflake Copilot. You can ask Snowflake Copilot for a
  query that returns all the distinct values in a column.

## Costs

Snowflake Copilot is currently free to use. Details on pricing and billing are planned but you will be notified before any charges are applied
for this feature.

## Legal notices

Snowflake Copilot leverages third party models and/or services, as previously described on this page. Where the models and/or services used are
provided on the [Snowflake Model and Service Flowdown Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/open-source-model-flow-down-terms/) page, use of those models and/or services are also subject to those terms.

For additional information, refer to [Snowflake AI and ML](../guides-overview-ai-features.md).

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Usage Data | Generally available functions are Covered AI Features. Preview functions are Preview AI Features. [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../guides-overview-ai-features.md).

---
title: Using Snowflake Copilot inline
source: https://docs.snowflake.com/en/user-guide/snowflake-copilot-inline.md
section: User Guide
---

# Using Snowflake Copilot inline

> **Important:**
>
> Snowflake Copilot inline now uses Cortex Code. To learn more about Cortex Code, see [Cortex Code in Snowsight](cortex-code/cortex-code-snowsight.md).

Snowflake Copilot inline is an expansion of the existing Snowflake Copilot experience that gives you the ability to query Snowflake Copilot from within your SQL code. For information about Snowflake Copilot, see [Using Snowflake Copilot](snowflake-copilot.md).

Snowflake Copilot inline is only supported in Workspaces. For information about Workspaces, see [Workspaces](ui-snowsight/workspaces.md).

> **Note:**
>
> Snowflake Copilot inline is natively supported in the following regions:
>
> * AWS US West 2 (Oregon)
> * AWS US East 1 (N. Virginia)
> * Azure US East 2 (Virginia)
>
> To use Snowflake Copilot in other regions, set the `CORTEX_ENABLED_CROSS_REGION` parameter.
> Within this parameter, you can either:
>
> * Provide a list of values that include at least one of the supported regions.
> * Set it to `ANY_REGION`.
>
> For information about how to use the `CORTEX_ENABLED_CROSS_REGION` parameter, see [How to use the cross-region inference parameter](snowflake-cortex/cross-region-inference.md).

## Access control requirements

The COPILOT_USER database role in the SNOWFLAKE database includes the privileges that allow users to use Snowflake Copilot features. By default,
the COPILOT_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted
to all users and roles, so this allows all users in your account to use Snowflake Copilot features.

In addition to the COPILOT_USER requirement, users must have the CORTEX_USER role. The CORTEX_USER database role in the SNOWFLAKE database includes the privileges that allow users to call Snowflake AI functions. By default, the CORTEX_USER role is granted to the PUBLIC role. The PUBLIC role is automatically granted to all users and roles, so this allows all users in your account to use the Snowflake AI functions.

Snowflake Copilot inline requires that the user has access to the `claude-3.5-sonnet` or `openai-gpt-4.1` model. To ensure all users have access to this model, make sure that `claude-3.5-sonnet` or `openai-gpt-4.1` is included in the model allowlist and is not limited by role-based access control (RBAC). For more information about controlling model access, see [Control model access](snowflake-cortex/aisql.md).

If users have the correct permissions, they see the Snowflake Copilot inline sparkle icon in Workspaces. They can use the inline interface to interact with Snowflake Copilot.

To remove access to Copilot inline, you must revoke access to either CORTEX_USER or COPILOT_USER. If you don’t want all users to have this privilege, you can revoke access to the PUBLIC role and grant access to specific roles. For example, to revoke access from the PUBLIC role, use the following query:

```sqlexample
USE ROLE ACCOUNTADMIN;

REVOKE DATABASE ROLE SNOWFLAKE.COPILOT_USER
  FROM ROLE PUBLIC;

REVOKE DATABASE ROLE SNOWFLAKE.CORTEX_USER
  FROM ROLE PUBLIC;
```

You can then grant access as needed. For information about limiting access to Snowflake Copilot, see [Limit access to Copilot](snowflake-copilot.md).

## Supported use cases

* **Explore your data** by asking open-ended questions to learn about the structure and nuances of a new dataset.
* **Generate SQL queries** with questions in natural language.
* **Improve your queries** by asking Snowflake Copilot to help you assess query efficiency, find optimizations, or explain what the query does.
* **Fix syntax errors** by asking Snowflake Copilot to fix your query.

## Limitations

Snowflake Copilot inline has the following limitations:

* **Support for the following languages:**

  + English
  + French
  + German
  + Spanish
  + Italian
  + Portuguese
  + Arabic
  + Hindi
  + Chinese
  + Japanese
  + Korean
  + SQL

No access to your data
:   Snowflake Copilot does not have access to the data inside your tables. If you want to filter on a particular value of a column, you should provide that value. For example, if you ask Snowflake Copilot to return all rows with a column A value equal to “X”, you should provide the value “X” in your request. For more information, see the Construct and run a SQL Statement example.

Delayed response
:   Snowflake Copilot might take a second to complete a response, depending on the length of the response provided.

SQL suggestions may not always work
:   Snowflake Copilot may sometimes suggest queries that contain invalid SQL syntax or non-existent tables or columns.

Delay in detecting new databases, schemas, and tables
:   It may take up to 3-4 hours for Snowflake Copilot to recognize newly created databases, schemas, and tables.

Limited number of tables and columns considered
:   To generate a response, Snowflake Copilot first searches for tables and columns most relevant for your request. The search results are then ranked by relevancy and only the top 10 tables and top 10 columns from each of those tables in the results are considered when generating a response.

Snowflake Copilot inline does not support feedback
:   You cannot upvote or downvote the suggestions that Snowflake Copilot inline gives you.

## How to use Snowflake Copilot inline

> Snowflake Copilot inline requires no additional setup. Remember the following points when using Snowflake Copilot:

* Each session with Snowflake Copilot inline is associated with a particular file in your Workspace.
* You don’t need to have a database and schema in use during your session to use Snowflake Copilot inline.
* Snowflake Copilot uses the names of your databases, schemas, tables, and columns and also the data types of your columns to determine what
  data is available to query.
* For optimal performance, use meaningful names for databases, schemas, tables, and columns, and ensure that columns are assigned the
  appropriate data type.
* Snowflake Copilot inline considers the following sources, but does not store the data from them:

  + Contents of the current file, including SQL queries and code.
  + Context of the current file, including database, schema, and role.
  + User supplied input.
  + Snowflake documentation or general SQL knowledge.
  + Data from your account.

To begin using Snowflake Copilot inline:

1. Open a Workspace. For information about Workspaces, see [Workspaces](ui-snowsight/workspaces.md).
2. Enter the `CMD+I` shortcut.
3. In the message dialog, enter your request. Then click the send icon to submit it. Snowflake Copilot provides a
   response inline and shows a diff with the existing code.
4. Choose one of the following:

   * Select Accept to accept the suggested changes.
   * Select Reject to reject the suggested changes.
   * Select Close to end the session.

## Add custom instructions

Snowflake Copilot inline does not accept custom instructions to customize how it responds.

## Examples

The following sections provide examples that demonstrate how to:

* Construct and run SQL statements
* Add comments to a SQL statement
* Fix a SQL Statement

These examples use a sample dataset from the Snowflake Marketplace.

### Prerequisites

The examples in this section use the [Cybersyn Github Archive dataset](https://app.snowflake.com/marketplace/listing/GZTSZAS2KJ3/cybersyn-inc-cybersyn-github-archive) from the Snowflake Marketplace:

1. Install the [Cybersyn Github Archive dataset](https://app.snowflake.com/marketplace/listing/GZTSZAS2KJ3/cybersyn-inc-cybersyn-github-archive) in your account.
2. Open a Workspace. For information about Workspaces, see [Workspaces](ui-snowsight/workspaces.md).
3. Select the Cybersyn Github Archive database and schema.

### Construct and run a SQL Statement

The following example demonstrates how to use Snowflake Copilot inline to generate SQL queries.

1. Enter the following question in the Snowflake Copilot inline message box, and select the send icon to submit it. Snowflake Copilot
   responds with a SQL query that answers your question.

   ```none
   How many stars were given in the past year?
   ```
2. Review the changes. Lines highlighted in red are removed and lines highlighted in green are added.
3. Select Accept to accept the suggested changes.

Snowflake Copilot does not have access to the data inside your tables. If you want Snowflake Copilot to construct a
SQL statement that filters based on a specific value of a column, you must provide the value
to filter on.

1. Enter the following question in the message box and select the send icon. Snowflake Copilot inline responds with a SQL query that uses the filter value that you provided.

   ```none
   What are all of the repo names that start with 'snowflake'?
   ```
2. Review the changes. Lines highlighted in red are removed and lines highlighted in green are added.
3. Select Accept to accept the suggested changes.

### Add comments to a SQL statement

The following example demonstrates how to use Snowflake Copilot to add comments to a SQL statement you’re working on.

* In the Snowflake Copilot inline message box, type the following question:

  ```none
  Can you add comments to this query?
  ```

Snowflake Copilot responds by adding a comment that explains the purpose of each line in the provided query.

### Fix a SQL Statement

The following example demonstrates how to use Snowflake Copilot inline from a Workspace to fix a SQL statement.

1. Focus your cursor over the target query with a syntax error.
2. Enter the `CMD+I` shortcut to bring up the Snowflake Copilot inline window.
3. Ask Snowflake Copilot to fix your query.
4. Review the changes. Lines highlighted in red are removed and lines highlighted in green are added.
5. Select Accept to accept the suggested changes.

## Tips for using Snowflake Copilot

For tips about using Snowflake Copilot, see [Tips for using Snowflake Copilot](snowflake-copilot.md).

## Costs

Snowflake Copilot is currently free to use. Details on pricing and billing are planned but you will be notified before any charges are applied
for this feature.

## Legal notices

This feature leverages third party models and/or services, as previously described on this page. Where the models and/or services used are
provided on the [Snowflake Model and Service Flowdown Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/open-source-model-flow-down-terms/) page, use of those models and/or services are also subject to those terms.

For additional information, refer to [Snowflake AI and ML](../guides-overview-ai-features.md).

---
title: Using Snowflake OAuth for local applications
source: https://docs.snowflake.com/en/user-guide/oauth-local-applications.md
section: User Guide
---

# Using Snowflake OAuth for local applications

This topic describes the preferred authentication method for local applications, including desktop applications and local scripts.

[Snowflake OAuth](oauth-snowflake-overview.md) is implemented by creating a *security integration* that defines an interface
between Snowflake as the OAuth authorization server and the application that is authenticating on behalf of a user by using the OAuth
authorization code flow. Snowflake OAuth is a strong authentication option because the application doesn’t have to store or manage secrets,
and you don’t have to configure a third-party identity provider like External OAuth.

To simplify how a local application uses Snowflake OAuth to authenticate, your account has a built-in
security integration called `SNOWFLAKE$LOCAL_APPLICATION`. Because the security integration already exists, if a local application
uses a Snowflake client like the Python driver or Snowflake CLI, the application can authenticate to Snowflake by setting a property or
parameter of the client. No further set up is required. The built-in integration also simplifies the setup for local applications that call
the OAuth endpoints directly rather than use a Snowflake client.

An administrator can change the parameters of the `SNOWFLAKE$LOCAL_APPLICATION` integration to adjust its behavior, such as specifying
how long OAuth access tokens and refresh tokens are valid.

Snowflake OAuth for local applications has the following additional advantages:

* Unlike user-created Snowflake OAuth integrations, in-role session switching *is* supported.
* It is a straightforward replacement for applications that are currently using passwords only to authenticate users. Snowflake is
  [deprecating single-factor passwords](security-mfa-rollout.md), so Snowflake OAuth for local applications provides a path to
  using a more secure form of authentication without requiring a lot of set up.

> **Note:**
>
> The `SNOWFLAKE$LOCAL_APPLICATION` security integration is being rolled out slowly to all accounts. To determine if this built-in
> integration exists in your account, run the following command:
>
> ```sqlexample
> SHOW SECURITY INTEGRATIONS LIKE 'SNOWFLAKE$LOCAL_APPLICATION';
> ```

## Configuring the Snowflake OAuth integration

The built-in `SNOWFLAKE$LOCAL_APPLICATION` security integration is owned by the system but can be configured by security
administrators (that is, users granted the SECURITYADMIN system role).

Security administrators can configure the following parameters of the security integration:

| Parameter | Description |
| --- | --- |
| `ENABLED` | Controls whether the integration is enabled. If the integration is disabled, local applications must use a different authentication method. |
| `OAUTH_ISSUE_REFRESH_TOKENS` | Controls whether the authorization server issues refresh tokens. |
| `OAUTH_REFRESH_TOKEN_VALIDITY` | Sets the validity duration of refresh tokens. |
| `OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED` | Controls whether the authorization server issues single-use refresh tokens. |
| `OAUTH_ACCESS_TOKEN_VALIDITY` | Sets the validity duration of access tokens. |

For example, to modify the built-in security integration so that the authorization server starts issuing single-use refresh tokens, run the
following commands:

```sqlexample
USE ROLE SECURITYADMIN;

ALTER SECURITY INTEGRATION SNOWFLAKE$LOCAL_APPLICATION
  SET OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED = TRUE;
```

For more information about setting these parameters, see [ALTER SECURITY INTEGRATION](../sql-reference/sql/alter-security-integration-oauth-snowflake.md).

### Controlling the login frequency

When `OAUTH_ISSUE_REFRESH_TOKENS = TRUE`, applications can use refresh tokens to obtain new access tokens without prompting users to log
in again. Users only need to re-authenticate when the refresh token expires after the duration that is specified by the
`OAUTH_REFRESH_TOKEN_VALIDITY` parameter.

## Setting up a local application to use Snowflake OAuth

This section provides the details a developer needs to configure a local application to authenticate with Snowflake OAuth. The following
types of local applications can authenticate by using the built-in integration:

* A local application that uses a Snowflake client like the Python driver or Snowflake CLI. See Applications that use a Snowflake client.
* A local application that makes REST requests to the OAuth authorization endpoint and token endpoint directly, without the use of a
  Snowflake client. See Applications that call OAuth endpoints directly.

### Applications that use a Snowflake client

When a local application uses a Snowflake client like the Snowflake ODBC driver, it can authenticate with Snowflake OAuth by setting the
`authenticator` connection option to `oauth_authorization_code`. Additional development work isn’t required.

#### Prerequisite

With Snowflake OAuth for local applications, the Snowflake client must be able to open
the user’s web browser. For this reason, both the Snowflake client and the local application that uses it must be installed on the
user’s computer. Snowflake OAuth for local applications doesn’t work if the Snowflake client is used by code that runs on a server.

#### Supported clients

Your local application can use the following Snowflake clients to authenticate with Snowflake OAuth for local applications:

| Client | Minimum required version | Required configuration |
| --- | --- | --- |
| .NET | v4.8.0 | Set `authenticator=oauth_authorization_code` in the connection string. |
| Go | v1.14.1 | Set `authenticator=oauth_authorization_code` in the connection configuration. |
| JDBC | v3.24.1 | Set `authenticator=``oauth_authorization_code` in the connection string for the driver. |
| Node.js | v2.1.0 | Set `authenticator: 'oauth_authorization_code'` in the connection options. |
| ODBC | v3.9.0 | * For Linux and macOS, set `authenticator=oauth_authorization_code` in the `odbc.ini` file. * For Windows, in the ODBC Data Source Administrator tool, edit the DSN for Snowflake and set Authenticator to   `oauth_authorization_code`. |
| Python | v3.16.0 | Pass `AUTHENTICATOR=OAUTH_AUTHORIZATION_CODE` to the `snowflake.connector.connect()` function. |
| Snowflake CLI | v3.8.1 | Add the `authenticator = "OAUTH_AUTHORIZATION_CODE"` option to the connection definition. |
| SnowSQL | v1.4.0 | Add the `authenticator = "OAUTH_AUTHORIZATION_CODE"` parameter in the configuration file. |

### Applications that call OAuth endpoints directly

Your local application can use Snowflake OAuth by making requests to the authorization endpoint and token endpoint of Snowflake as the
authorization server. You don’t need to use a Snowflake client. The application sends a request to Snowflake’s authorization endpoint to
authenticate the user and receive an authorization code, and then sends a request to the token endpoint to exchange that code for an access
token.

For more information about making REST requests to Snowflake’s authorization and token endpoints, see [Call the OAuth endpoints](oauth-custom.md).

#### Request requirements

Your application’s REST requests to the authorization and token endpoints must conform to the following requirements:

* The redirect URL in the request to the authorization endpoint must be `http://127.0.0.1[:port][/path]`. That is, your local application
  must be listening on a loopback address for the authorization code that is returned by Snowflake as the authorization server.
* Requests to the authorization and token endpoints must implement Proof Key for Code Exchange (PKCE). For more information, see
  [Proof key for code exchange](oauth-custom.md).
* When calling the token endpoint to exchange an authorization code for an access token, the application must provide the proper client ID
  and client secret. This requirement varies slightly depending on how you choose to send these client credentials:

  + If you send client credentials in the request header, the client ID must be `LOCAL_APPLICATION` and the client secret must be
    `LOCAL_APPLICATION`.
  + If you send client credentials in the POST body, the client ID must be `LOCAL_APPLICATION`. The built-in integration configures
    the local application as a public client, so the client secret isn’t necessary if you provide the client ID as
    `client_id=LOCAL_APPLICATION` in the POST body.

## Usage notes

Every account has a `SNOWFLAKE$LOCAL_APPLICATION` integration, so this integration isn’t replicated. The
configuration of the built-in integration is unique to each account.

---
title: Using Snowflake with Kafka and Spark
source: https://docs.snowflake.com/en/user-guide/connectors.md
section: User Guide
---

# Using Snowflake with Kafka and Spark

You can connect Snowflake with systems external to it using the connectors described in this section.

For other ways to connect Snowflake with tools and technologies in its ecosystem, see [Snowflake Ecosystem](ecosystem.md).

[Snowflake Connector for Kafka](kafka-connector.md)
:   Read data from one or more Apache Kafka topics and load the data into a Snowflake table.

[Snowflake Connector for Spark](spark-connector.md)
:   Bring Snowflake into the Apache Spark ecosystem, enabling Spark to read data from, and write data to, Snowflake.

---
title: Using Snowsight to create and manage semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/ui.md
section: User Guide
---

# Using Snowsight to create and manage semantic views

In Snowsight, you can create and manage [semantic views](overview.md):

## Creating a semantic view

In Snowsight, you can create a semantic view by using a wizard or by uploading a
[YAML specification](semantic-view-yaml-spec.md).

* Uploading a YAML specification to create a semantic view

> **Note:**
>
> To create a semantic view, you must use a role with the following privileges:
>
> * CREATE SEMANTIC VIEW on the schema where you are creating the semantic view.
> * USAGE on the database and schema where you are creating the semantic view.
> * SELECT on the tables and views used in the semantic view.

### Using the AI-assisted generator to create a semantic view

Use the AI-assisted generator to create a semantic view that combines semantic information from multiple sources. Instead of creating a semantic view manually with your own YAML specification, you can use the model generator within Snowsight to save time. The process of creating a semantic view requires the following information:

* A description with basic information about the view
* Context, such as example SQL queries
* The data source (at least one table or view) that you’re using
* The columns that you’re using

The AI-assisted generator handles inputs in the following ways:

* **Example SQL queries**

  + Validate the list of queries and throw out invalid queries.
  + Extract all tables and columns from the queries and present them for review before adding to the semantic view.
  + Extract relationships from the queries.
  + Add valid queries to the semantic view as verified queries.
* **Table metadata**

  + Extract all table and column descriptions.
  + Add primary and unique keys to the semantic view by analyzing metadata or counting distinct values to determine cardinality and relationship types.
* **Query history**

  + Surface historical SQL queries as suggestions to the semantic view. The AI-assisted generator identifies the most common types of queries that fit within the bounds of the selected tables and columns.
  + Find valid relationships and column types for the semantic view.
  + Cortex Analyst uses the query history accessible by the role used to create the semantic model to generate both relationships and verified query suggestions.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select AI & ML » Cortex Analyst.
3. At the top of the navigation menu, select Create new » Create new Semantic View.
4. Select a location to store the semantic view after creation.
5. Enter a name for the semantic view.
6. For Description, specify information about the semantic view.
7. Select Next.
8. To provide context, add the following information:

   * For SQL Queries, provide example questions and their respective SQL queries that you want to use as part of the view.
9. For Select tables, provide the data source that you’re using to create the semantic view.

   > You must provide at least one table or view. There’s no limit on the tables or views that you can specify, but Snowflake recommends not using more than 10.
10. Select Next.
11. For Select columns, select the columns that you’re using to create the semantic view.

    You can select all the columns or specific columns. For performance reasons, Snowflake recommends not using more than 50 columns.
12. Select whether you want to add sample values from each column to the semantic view. Sample values help improve the accuracy of Cortex Analyst’s results.
13. Select whether you want to add AI-generated descriptions for tables and columns to the semantic view. The AI-generated descriptions are based on the column names and sample values.
14. Select Create and save.
    You can view the progress of the view generation, including details about the steps that the view generator is taking, on the semantic view page. The process can take a few minutes.
15. Optional: To make additional modifications, edit the view either by using Snowsight or by editing the YAML file directly.

Cortex Analyst automatically generates suggestions to improve the semantic view after creation. After the suggestions appear, which might take several minutes, you can review them and apply them to the view as needed.

### Uploading a YAML specification to create a semantic view

1. If you are planning to create the semantic view from Cortex Analyst,
   [create a stage](../data-load-local-file-system-stage-ui.md) for the YAML file.
2. Upload the YAML file in one of the following ways:

   * [Database object explorer](../ui-snowsight-data.md):

     1. Sign in to [Snowsight](../ui-snowsight-gs.md).
     2. In the navigation menu, select Catalog » Database Explorer.
     3. Select the database and schema where you want to create the semantic view.
     4. Select Create » Semantic View » Upload YAML file.
     5. Select the YAML file to upload.
     6. Under Select database, schema and stage, select the database, schema, and stage where you want to upload the YAML
        file.
     7. If you want the YAML file uploaded to a specific path in the stage, specify that path.
     8. Select Upload.
   * Cortex Analyst:

     1. Sign in to [Snowsight](../ui-snowsight-gs.md).
     2. In the navigation menu, select AI & ML » Cortex Analyst.
     3. Select Create new » Upload YAML file.
     4. Select the YAML file to upload.
     5. Select Convert and save.

## Editing a semantic view

> **Note:**
>
> Editing a semantic view in Snowsight effectively replaces the existing view. To replace an existing semantic view, you
> must use a role that has been granted the following privileges:
>
> * CREATE SEMANTIC VIEW on the schema where you are creating the semantic view.
> * USAGE on the database and schema where you are creating the semantic view.
> * SELECT on the tables and views used in the semantic view.

To edit a semantic view:

1. Access the semantic view in one of the following ways:

   * [Database object explorer](../ui-snowsight-data.md):

     1. Sign in to [Snowsight](../ui-snowsight-gs.md).
     2. In the navigation menu, select Catalog » Database Explorer.
     3. Select the database and schema containing the semantic view.
     4. Select Semantic views.
     5. Select the semantic view.
     6. Select the Semantic information tab.
   * Cortex Analyst:

     1. Sign in to [Snowsight](../ui-snowsight-gs.md).
     2. In the navigation menu, select AI & ML » Cortex Analyst.
     3. Select the Semantic views tab.
     4. Under Select database to see semantic views, select the database and schema containing the semantic view that you
        want to edit.
     5. Select the semantic view that you want to edit.
2. Make changes to the semantic view. You can make the following types of changes:

   * **To modify the name or description of the semantic view:**

     1. Select Edit next to the name of the semantic view.
     2. Make changes to the name or description.
     3. Select Apply.
   * **To add a new logical table to the semantic view:**

     1. Select + Logical Table in the database object explorer or + in Cortex Analyst.
     2. In the Select a table step in the wizard:

        1. Select the table or view that contains the data that you want to use in your semantic view.
        2. Select Next.
     3. In the Select columns step in the wizard:

        1. Select the columns to include in the view.

           To select all columns in a table or view, select the table or view.
        2. Select Generate logical table.
   * **To make changes to the name, description, synonyms, or primary key of a logical table in the semantic view:**

     1. Select  » Edit Logical Table next to the logical table name in the database object explorer
        or Edit next to the logical table name in Cortex Analyst.
     2. Make your changes to the name, description, synonyms, and primary key.

        If you have not specified the description or synonyms, you can select Generate fields to fill in these fields
        automatically.
     3. Select Save.
   * **To add a fact, dimension, or metric:**

     1. Open the form for adding the new item:

        + In the database object explorer, select , and select Fact, Dimension, or Metric.
        + In Cortex Analyst, select + next to Facts, Dimensions, or Metrics.
     2. Enter information about the new fact, dimension, or metric, and select Add.
   * **To modify or remove a fact, dimension, or metric:**

     1. Select Facts, Dimensions, or Metrics to display the list of facts, dimensions or metrics.
     2. For the fact, dimension, or metric that you want to change:

        + Select Edit to modify the item.
        + Select  » Remove fact, Remove dimension, or Remove metric to remove the item.
   * **To add a relationship:**

     1. Open the form for adding the new item:

        + In the database object explorer, select + Relationship.
        + In Cortex Analyst, select + next to Relationships.
     2. Enter a name for the relationship, select the tables in the relationship, and select the columns to use to join the
        tables.
     3. Select Add.
3. If you plan to use Cortex Analyst with this view, consider the following:

   * Add sample queries to the Verified Queries section. Note that this section is available only in Cortex Analyst.

     + These are example queries that help Cortex Analyst understand how to use the semantic view.
     + Add queries that represent common use cases for your data.
   * Add synonyms for your tables, facts, dimensions, or metrics.

     + These are alternative terms that users might use in queries.
     + Synonyms help Cortex Analyst correctly interpret user questions.
   * Add custom instructions.

     + These provide additional context about how the data should be interpreted.
     + Include business rules or constraints that should be considered.
4. Select Save.

## Granting the privilege to use a semantic view to another role

To grant another role the privileges to view and query a semantic view:

1. Access the semantic view in one of the following ways:

   * [Database object explorer](../ui-snowsight-data.md):

     1. Sign in to [Snowsight](../ui-snowsight-gs.md).
     2. In the navigation menu, select Catalog » Database Explorer.
     3. Select the database and schema containing the semantic view.
     4. Select Semantic views.
     5. Select the semantic view.
     6. Select  » Share.
   * Cortex Analyst:

     1. Sign in to [Snowsight](../ui-snowsight-gs.md).
     2. In the navigation menu, select AI & ML » Cortex Analyst.
     3. Select the Semantic views tab.
     4. Select the semantic view.
     5. Select Share.
2. Select the role that should be granted the privileges to view and query the semantic view.
3. Select Done.

This grants the SELECT and REFERENCES privileges on the semantic view to the selected role.

## Querying a semantic view

If you are viewing a semantic view in the database object explorer, you can open a worksheet to construct a query for that view
by selecting  » Query with SQL.

For information on how to construct the query, see [Querying semantic views](querying.md).

## Best practices for creating a semantic view

* **Provide clear descriptions:**

  + Use business terminology in all names and descriptions.
  + Make descriptions detailed enough for non-technical users to understand.
* **Include representative user questions:**

  + Include questions that can help the model generator better understand your intent.
  + Include variations of how questions might be asked.
* **Review generated suggestions carefully:**

  + Make sure the questions are relevant for the use case.
  + Make sure the suggested relationships match your business understanding.
* **Test with real questions:**

  + After creating your semantic view, test it with actual business questions.
  + Refine your semantic view, based on how well the model supports these questions.
* **Iterate on developing the semantic view:**

  + Start with a simple star schema.
  + Start with core tables and metrics, then expand. We suggest three tables to keep things simple.
  + Get feedback from business users, and refine your semantic view.

## Troubleshooting

* If your semantic view is not listed in the list of views, refresh the list of models (not the page itself).
* If errors occur with the relationships in the semantic view, ensure that these relationships match the actual data structure.
* If queries are slow, reduce the number of tables or columns.
* If Cortex Analyst produces unexpected results when using your semantic view, review the facts, dimensions, and metrics in the
  semantic view.

---
title: Using SnowSQL
source: https://docs.snowflake.com/en/user-guide/snowsql-use.md
section: User Guide
---

# Using SnowSQL

This topic describes how to use SnowSQL, including starting/stopping the client, using commands and variables within the client, and other general usage information.

## Executing commands

In a Snowflake session, you can issue commands to take specific actions. All commands in SnowSQL start with an exclamation point (`!`), followed by the command name.

For example:

> ```none
> user#> !help
>
> +------------+-------------------------------------------+-------------+--------------------------------------------------------------------------------------------+
> | Command    | Use                                       | Aliases     | Description                                                                                |
> |------------+-------------------------------------------+-------------+--------------------------------------------------------------------------------------------|
> | !abort     | !abort <query id>                         |             | Abort a query                                                                              |
> | !connect   | !connect <connection_name>                |             | Create a new connection                                                                    |
> | !define    | !define <variable>=<value>                |             | Define a variable as the given value                                                       |
> | !edit      | !edit <query>                             |             | Opens up a text editor. Useful for writing longer queries. Defaults to last query          |
> | !exit      | !exit                                     | !disconnect | Drop the current connection                                                                |
> | !help      | !help                                     | !helps, !h  | Show the client help.                                                                      |
> | !options   | !options                                  | !opts       | Show all options and their values                                                          |
> | !pause     | !pause                                    |             | Pauses running queries.                                                                    |
> | !print     | !print <message>                          |             | Print given text                                                                           |
> | !queries   | !queries help, <filter>=<value>, <filter> |             | Lists queries matching the specified filters. Write <!queries> help for a list of filters. |
> | !quit      | !quit                                     | !q          | Drop all connections and quit SnowSQL                                                      |
> | !rehash    | !rehash                                   |             | Refresh autocompletion                                                                     |
> | !result    | !result <query id>                        |             | See the result of a query                                                                  |
> | !set       | !set <option>=<value>                     |             | Set an option to the given value                                                           |
> | !source    | !source <filename>, <url>                 | !load       | Execute given sql file                                                                     |
> | !spool     | !spool <filename>, off                    |             | Turn on or off writing results to file                                                     |
> | !system    | !system <system command>                  |             | Run a system command in the shell                                                          |
> | !variables | !variables                                | !vars       | Show all variables and their values                                                        |
> +------------+-------------------------------------------+-------------+--------------------------------------------------------------------------------------------+
> ```

For a detailed description of each command, see Commands reference (in this topic).

## Using variables

You can use variables to store and reuse values in a Snowflake session. Variables enable you to use user-defined and database values in queries.

The next sections explain how to define and use variables:

* Defining variables
* Enabling variable substitution
* Substituting variables in a session
* Listing variables
* Using the built-in variables

### Defining variables

You can define variables for SnowSQL in several ways:

* Defining variables before connecting (configuration file)
* Defining variables while connecting (-D or --variable command-line flag)
* Defining variables within a session (!define command)

#### Defining variables before connecting (configuration file)

To define variables before connecting to Snowflake, add the variables in the `config` configuration file:

1. Open the [SnowSQL configuration file](snowsql-config.md) (named `config`) in a text editor. The default
   location of the file is:

   Linux/macOS:
   :   `~/.snowsql/`

   Windows:
   :   `%USERPROFILE%\.snowsql\`

   > **Note:**
   >
   > You can change the default location by specifying the `--config path` command-line flag when starting SnowSQL.

1. In the `[variables]` section, define any variables that you plan to use:

   > ```ini
   > [variables]
   > <variable_name>=<variable_value>
   > ```
   >
   > Where:
   >
   > * `variable_name` is a string of alphanumeric characters (case-insensitive) representing the name of the variable.
   > * `variable_value` is a string representing the value for the variable. If needed, the string can be enclosed by
   >   single or double quotes.

   For example:

   > ```ini
   > [variables]
   > tablename=CENUSTRACKONE
   > ```

#### Defining variables while connecting (`-D` or `--variable` command-line flag)

To define variables while connecting to Snowflake, on the terminal command line, specify the `-D` or `--variable`
command-line flag. For the argument to this flag, specify the variable name and value in the form of
`variable_name=variable_value`.

For example:

Linux/macOS:
:   ```bash
    $ snowsql ... -D tablename=CENUSTRACKONE --variable db_key=$DB_KEY
    ```

Windows:
:   ```bash
    $ snowsql ... -D tablename=CENUSTRACKONE --variable db_key=%DB_KEY%
    ```

In the above example:

* `-D` sets a variable named `tablename` to `CENUSTRACKONE`.
* `--variable` assigns a Snowflake variable named `db_key` to the `DB_KEY` environment variable.

#### Defining variables within a session (`!define` command)

To define a variable after connecting to Snowflake, execute the `!define` command in the session.

For example:

> ```none
> user#> !define tablename=CENUSTRACKONE
> ```

### Enabling variable substitution

To enable SnowSQL to substitute values for the variables, you must set the `variable_substitution` configuration option to
`true` in one of the following ways:

* To set this option before you start SnowSQL, open the [SnowSQL configuration file](snowsql-config.md) in a text
  editor, and set this option in the `[options]` section:

  ```ini
  [options]
  variable_substitution = True
  ```
* To set this option when you start SnowSQL, specify the `-o` command-line flag:

  ```bash
  $ snowsql ... -o variable_substitution=true
  ```
* To set this option when in a SnowSQL session, execute the `!set` command in the session:

  ```none
  user#> !set variable_substitution=true
  ```

  > **Note:**
  >
  > There is currently no option to unset an option value, such as the `variable_substitution` option. If you need
  > to disable variable substitution, execute the command `!set variable_substitution=false`.

### Substituting variables in a session

After you enable variable substitution, you can use variables in SQL
statements.

To use a variable in a statement, use the `&variable_name` syntax. Note that variable names are case-insensitive. For
example:

> ```none
> user#> !define snowshell=bash
>
> user#> !set variable_substitution=true
>
> user#> select '&snowshell';
>
> +--------+
> | 'BASH' |
> |--------|
> | bash   |
> +--------+
> ```

If the `variable_substitution` option is not enabled, no variable substitution occurs. For example:

> ```none
> user#> !define snowshell=bash
>
> user#> !set variable_substitution=false
>
> user#> select '&snowshell';
>
> +--------------+
> | '&SNOWSHELL' |
> |--------------|
> | &snowshell   |
> +--------------+
> ```

If you refer to a variable that has not been defined, SnowSQL displays an error. For example:

> ```none
> select '&z';
>
> Variable z is not defined
> ```

To combine a variable with text, enclose the variable reference in curly braces. For example:

> ```none
> user#> !define snowshell=bash
>
> user#> !set variable_substitution=true
>
> select '&{snowshell}_shell';
>
> +--------------+
> | 'BASH_SHELL' |
> |--------------|
> | bash_shell   |
> +--------------+
> ```

To use an ampersand sign without using substitution, escape the ampersand sign with a second ampersand sign:

> `&&variable`

For example:

> ```none
> user#> !set variable_substitution=true
>
> user#> select '&notsubstitution';
>
> Variable notsubstitution is not defined
>
> user#> select '&&notsubstitution';
>
> +--------------------+
> | '&NOTSUBSTITUTION' |
> |--------------------|
> | &notsubstitution   |
> +--------------------+
> ```

### Listing variables

To list variables, execute the `!variables` or `!vars` command in the session:

> ```none
> user#> !variables
>
> +-----------+-------+
> | Name      | Value |
> |-----------+-------|
> | snowshell | bash  |
> +-----------+-------+
> ```

### Using the built-in variables

SnowSQL includes a set of built-in variables that return metadata about statements executed in the current user session.

Each of these variable names begins with two underscore characters (“__”).

`__rowcount`
:   Returns the number of rows affected by the most recent DML statement executed by the user.

    > ```none
    > user#> insert into a values(1),(2),(3);
    >
    > +-------------------------+
    > | number of rows inserted |
    > |-------------------------|
    > |                       3 |
    > +-------------------------+
    > 3 Row(s) produced. Time Elapsed: 0.950s
    >
    > user#> !set variable_substitution=true
    >
    > user#> select &__rowcount;
    >
    > +---+
    > | 3 |
    > |---|
    > | 3 |
    > +---+
    > ```

`__sfqid`
:   Returns the query ID for the most recent query executed by the user.

    > ```none
    > user#> !set variable_substitution=true
    >
    > user#> select * from a;
    >
    > user#> select '&__sfqid';
    >
    > +----------------------------------------+
    > | 'A5F35B56-49A2-4437-BA0E-998496CE793E' |
    > |----------------------------------------|
    > | a5f35b56-49a2-4437-ba0e-998496ce793e   |
    > +----------------------------------------+
    > ```

## Using auto-complete

Various SQL functions, table names, and variables are stored in SnowSQL and are auto-completed in interactive mode. To select an auto-complete suggestion, press the `Tab` key. To choose a different
suggestion, use the `↑` and `↓` keys to highlight the desired option, and then press `Tab`.

To disable auto-complete interactively, set the `auto_completion` configuration option to `False` in the [configuration file](snowsql-config.md).

## Viewing your command-line history

Your recent command-line history can be recalled by using the `↑` key. Press the key repeatedly to scroll through the buffer.

### History file

The interactive command-line history file is named `history` and is located in `~/.snowsql/history`.

## Running batch scripts

You can run batch scripts in two ways:

* Using connection parameters (while connecting to Snowflake)
* Executing commands (on the command line in the Snowflake session)

### Running while connecting (`-f` connection parameter)

To execute a SQL script while connecting to Snowflake, use the `-f <input_filename>` connection parameter.

An output file for the script can be specified using `-o output_file=<output_filename>`. In addition, you can use `-o quiet=true` to turn off the standard output and
`-o friendly=false` to turn off the startup and exit messages.

For example:

> ```none
> snowsql -a myorganization-myaccount -u jsmith -f /tmp/input_script.sql -o output_file=/tmp/output.csv -o quiet=true -o friendly=false -o header=false -o output_format=csv
> ```

For more information about all connection parameters, see [Connection parameters reference](snowsql-start.md).

### Running in a session (`!source` or `!load` command)

To run a SQL script after connecting to Snowflake, execute the `!source` (or `!load`) command in the session.

For example:

> ```none
> user#> !source example.sql
> ```

## Exporting data

Output query results to a file in a defined format using the following [configuration options](snowsql-config.md):

* `output_format=output_format`
* `output_file=output_filename`

To remove the splash text, header text, timing, and goodbye message from the output, also set the following options:

* `friendly=false`
* `header=false`
* `timing=false`

As with all configuration options, you can set them using any of the following methods:

* In the configuration file (before connecting to Snowflake).
* Using the `-o` or `--options` connection parameter (while connecting to Snowflake).
* Executing the `!set` command (on the command line in the Snowflake session).

Note that consecutive queries are appended to the output file. Alternatively, to redirect query output to a file and overwrite the file with each new statement, use the greater-than sign (`>`) in a
script.

In the following example, SnowSQL connects to an account using a named connection and queries a table. The output is written to a CSV file named `output_file.csv` in the current local directory:

Linux/macOS:

```bash
snowsql -c my_example_connection -d sales_db -s public -q 'select * from mytable limit 10' -o output_format=csv -o header=false -o timing=false -o friendly=false  > output_file.csv
```

Windows:

```bash
snowsql -c my_example_connection -d sales_db -s public -q "select * from mytable limit 10" -o output_format=csv -o header=false -o timing=false -o friendly=false  > output_file.csv
```

## Changing the SnowSQL prompt format

The SnowSQL prompt dynamically displays context information about the current session:

* When you log into Snowflake, the prompt displays your user name, as well as the default warehouse, database, and schema (if defaults have been set).
* If you use a USE command in the session to set or change the warehouse, database, or schema, the prompt changes to reflect the context.

You can control the appearance and structure of the prompt using the `prompt_format` configuration option and a Pygments token in brackets for each object type, in the form of `[token]`
(e.g. `[user]` or `[warehouse]`).

The token affects the prompt going forward. You can change the order and color for each token, as well as the delimiters between tokens.

As with all configuration options, you can set the prompt using any of the following methods:

* In the configuration file (before connecting to Snowflake).
* Using the `-o` or `--options` connection parameter (while connecting to Snowflake).
* Executing the `!set` command (on the command line in the Snowflake session).

> **Note:**
>
> If you change the prompt using the connection parameter or directly on the command line, the change applies to the current session only. To persist the change in future sessions, set the option in
> the configuration file.

### Supported tokens

SnowSQL supports the following object types as tokens:

* `user`
* `account`
* `role`
* `database`
* `schema`
* `warehouse`

### Default prompt

The SnowSQL default prompt uses the following tokens and structure:

> ```bash
> [user]#[warehouse]@[database].[schema]>
> ```

For example:

> ```bash
> jdoe#DATALOAD@BOOKS_DB.PUBLIC>
> ```

### Prompt example

Continuing the example above, the following `!set` command executed in the command line adds the role token and changes the token order
to `user` and `role`, `database` and `schema`, then `warehouse`. It
also changes the delimiter for each token to a period (`.`) and sets the tokens to use different colors:

> ```bash
> jdoe#DATALOAD@BOOKS_DB.PUBLIC> !set prompt_format=[#FF0000][user].[role].[#00FF00][database].[schema].[#0000FF][warehouse]>
> ```

This example results in the following prompt for the session:

## Disconnecting from Snowflake and stopping SnowSQL

SnowSQL provides separate commands for:

* Exiting individual connections (i.e. sessions) without stopping SnowSQL.
* Quitting SnowSQL, which also automatically terminates all connections.

To exit a connection/session, use the `!exit` command (or its alias, `!disconnect`). You can then connect again using `!connect <connection_name>` if you can defined multiple
connections in the SnowSQL `config` file. Note that, if you only have one connection open, the `!exit` command also quits/stops SnowSQL.

To exit all connections and then quit/stop SnowSQL, use the `!quit` command (or its alias, `!q`). You can also type
`CTRL` + `d` on your keyboard.

### Exit codes

There are several possible exit codes that are returned when SnowSQL quits/exits:

`0`:
:   Everything ran smoothly.

`1`:
:   Something went wrong with the client.

`2`:
:   Something went wrong with the command-line arguments.

`3`:
:   SnowSQL could not contact the server.

`4`:
:   SnowSQL could not communicate properly with the server.

`5`:
:   The `exit_on_error` configuration option was set and SnowSQL exited because of an error.

## Default key bindings

`Tab`
:   Accept the current auto-complete suggestion.

`CTRL` + `d`
:   Quit/stop SnowSQL.

## Commands reference

### `!abort`

Aborts a query (specified by query ID). The query ID can be obtained from the History  page in the web interface.

For example:

> ```none
> user#> !abort 77589bd1-bcbf-4ec8-9ebc-6c949b00614d;
> ```

### `!connect`

SnowSQL supports multiple sessions (i.e. connections) with `!connect <connection_name>`:

* The connection parameters/options associated with `connection_name` are stored in the corresponding `[connections.<connection_name>]` section in the SnowSQL configuration file.
* If a parameter/option is not specified in the `[connections.<connection_name>]` section, the unspecified parameter will default to the parameters under `[connections]`.

When connecting, the connection is added to your connection stack, and exiting will return you to your previous connection. Quitting will exit all of your connections and quit, no matter how many connections you have.

For example:

Configuration file:

> ```bash
> [connections.my_example_connection]
> ...
> ```

Command line:

> ```none
> user#> !connect my_example_connection
> ```

### `!define`

Sets a variable to a specified value, using the following format:

> `!define <variable_name>=<variable_value>`

The name and value must be separated by a single `=` with no spaces. Valid characters that can be used in the variable are:

> `0-9a-zA-Z_`

For more information on defining and using variables, see Using variables.

### `!edit`

Opens up the editor that was set using the `editor` connection parameter (if no editor was set, the default is `vim`). The command accepts a query as an argument. If no argument is passed,
it opens up the last query that was run.

> **Note:**
>
> You must save before or while exiting the editor, or else any text that was entered/modified in the editor will not be saved.

### `!exit` , `!disconnect`

Drops the current connection and quits SnowSQL if it is the last connection.

### `!help` , `!helps` , `!h`

Displays the help for SnowSQL commands.

### `!options` , `!opts`

Returns a list of all the SnowSQL [configuration options](snowsql-config.md) and their currently-set values. These options can be set using the `!set` command in the current SnowSQL
session.

> **Note:**
>
> These options can also be set in the [configuration file](snowsql-config.md) for SnowSQL or as [connector parameters](snowsql-start.md) in the command line when
> invoking SnowSQL.

### `!pause`

Pause running queries. Press the return key to continue.

### `!print`

Prints the specified text to the screen and any files you are currently spooling to.

For example:

> ```none
> user#> !print Include This Text
> ```

### `!queries`

Lists all queries that match the specified filters. The default filters are `session` and `amount=25`, which return the 25 most recent queries in the current session.

For example:

* Return the 25 most recent queries that ran in this current session:

  ```none
  !queries session
  ```
* Return the 20 most recent queries run in the account:

  ```none
  !queries amount=20
  ```
* Return the 20 most recent queries run in the account that took more than 200 milliseconds to run:

  ```none
  !queries amount=20 duration=200
  ```
* Return the 25 most recent queries that ran in the specified warehouse:

  ```none
  !queries warehouse=mywh
  ```

This command creates a variable for each query ID returned. Note that variable substitution must be enabled for you to use these variables. For example:

> ```none
> user#> !queries session
>
> +-----+--------------------------------------+----------+-----------+----------+
> | VAR | QUERY ID                             | SQL TEXT | STATUS    | DURATION |
> |-----+--------------------------------------+----------+-----------+----------|
> | &0  | acbd6778-c68c-4e79-a977-510b2d8c08f1 | select 1 | SUCCEEDED |       19 |
> +-----+--------------------------------------+----------+-----------+----------+
>
> user#> !result &0
>
> +---+
> | 1 |
> |---|
> | 1 |
> +---+
>
> user#> !result acbd6778-c68c-4e79-a977-510b2d8c08f1
>
> +---+
> | 1 |
> |---|
> | 1 |
> +---+
> ```

### `!quit` , `!q` (also `CTRL` + `d`)

Drops all connections and exits SnowSQL.

### `!rehash`

Re-syncs the auto-complete tokens.

Normal use does not require re-syncing the auto-complete tokens. However, forcing an update to the server-side tokens could be useful in certain scenarios (e.g. if a new user-defined function is
created in a different session).

### `!result`

Returns the result of a completed query (specified by query ID). Query IDs can be obtained from the History  page in the web interface or using the `!queries` command.

If the query is still running, the command waits until the query completes.

For example:

> ```none
> user#> !result 77589bd1-bcbf-4ec8-9ebc-6c949b00614d;
> ```

### `!set`

Sets the specified SnowSQL [configuration option](snowsql-config.md) to a given value using the form `<option>=<value>`.

Note that there is no option currently to unset an option value. To change the value for an option, run the `!set` command again with the desired value.

For example:

> ```none
> user#> !options
>
> +-----------------------+-------------------+
> | Name                  | Value             |
> |-----------------------+-------------------|
>  ...
> | rowset_size           | 1000              |
>  ...
> +-----------------------+-------------------+
>
> user#> !set rowset_size=500
>
> user#> !options
>
> +-----------------------+-------------------+
> | Name                  | Value             |
> |-----------------------+-------------------|
>  ...
> | rowset_size           | 500               |
>  ...
> +-----------------------+-------------------+
>
> user#> !set rowset_size=1000
>
> user#> !options
>
> +-----------------------+-------------------+
> | Name                  | Value             |
> |-----------------------+-------------------|
>  ...
> | rowset_size           | 1000              |
>  ...
> +-----------------------+-------------------+
> ```

> **Important:**
>
> Spaces are not allowed between an option and its value. Some options support a defined set of values; SnowSQL returns an error if the provided value is unsupported. You cannot create new options.
>
> For a list of all the configuration options you can set, use the `!options` command.

### `!source` , `!load`

Executes SQL from a file. You can SQL from local files or a URL.

For example:

> ```none
> user#> !source example.sql
>
> user#> !load /tmp/scripts/example.sql
>
> user#> !load http://www.example.com/sql_text.sql
> ```

### `!spool`

This command can be executed in two ways:

* Enable spooling and write the results of all subsequent statements/queries to the specified file:

  > `!spool <file_name>`
* Turn off results spooling (if it is enabled):

  > `!spool off`

For example:

> ```none
> user#> select 1 num;
>
> +-----+
> | NUM |
> |-----|
> |   1 |
> +-----+
>
> user#> !spool /tmp/spool_example
>
> user#> select 2 num;
>
> +---+
> | 2 |
> |---|
> | 2 |
> +---+
>
> user#> !spool off
>
> user#> select 3 num;
>
> +---+
> | 3 |
> |---|
> | 3 |
> +---+
>
> user#> !exit
>
> Goodbye!
>
> $ cat /tmp/spool_example
>
> +---+
> | 2 |
> |---|
> | 2 |
> +---+
> ```

You can change the output format by first running the `!set output_format=<format>` command. The option supports the following values:

* `expanded`
* `fancy_grid`
* `grid`
* `html`
* `latex`
* `latex_booktabs`
* `mediawiki`
* `orgtbl`
* `pipe`
* `plain`
* `psql`
* `rst`
* `simple`
* `tsv`

Recommended value: `psql`, `fancy_grid`, or `grid`

For example, to output in CSV format:

> ```none
> user#> !set output_format=csv
>
> user#> !spool /tmp/spool_example
> ```

### `!system`

Executes a shell command.

> `!system <command>`

The following example runs the `ls` command in the user’s home directory:

> ```none
> user#> !system ls ~
> ```

### `!variables` , `!vars`

Lists all current variables. Returns each `<variable_name>=<variable_value>` pair currently defined.

Once a variable is assigned, it cannot be deleted, but its value can be removed by specifying the variable name with no value. For example:

> ```bash
> user#> !set variable_substitution=true
>
> user#> !define SnowAlpha=_ALPHA_
>
> user#> !variables
>
> +------------------+---------+
> | Name             | Value   |
> |------------------+---------|
> | snowalpha        | _ALPHA_ |
> +------------------+---------+
>
> user#> !define SnowAlpha
>
> user#> !variables
>
> +------------------+-------+
> | Name             | Value |
> |------------------+-------|
> | snowalpha        |       |
> +------------------+-------+
>
> user#> !define snowalpha=456
>
> user#> select &snowalpha;
>
> +-----+
> | 456 |
> |-----|
> | 456 |
> +-----+
> ```

For more information about setting variables, see Using variables (in this topic).

## Troubleshooting

### Error Message: `Variable is not defined`

Cause:
:   If you see this error message when running commands in SnowSQL, the cause might be an ampersand (`&`) inside a
    [CREATE FUNCTION](../sql-reference/sql/create-function.md) command. (The ampersand is the SnowSQL variable substitution character.) For
    example, executing the following in SnowSQL causes this error:

    ```javascript
    create function mask_bits(...)
        ...
        as
        $$
        var masked = (x & y);
        ...
        $$;
    ```

    The error occurs when the function is created, not when the function is called.

Solution:
:   If you do not intend to use variable substitution in SnowSQL, you can explicitly disable variable substitution by executing the
    following command:

    ```sqlexample
    !set variable_substitution=false;
    ```

    For more information about variable substitution, see Using variables.

---
title: Using SQL commands to create and manage semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/sql.md
section: User Guide
---

# Using SQL commands to create and manage semantic views

This topic explains how to use the following SQL commands to create and manage [semantic views](overview.md):

* [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md)
* [ALTER SEMANTIC VIEW](../../sql-reference/sql/alter-semantic-view.md)
* [DESCRIBE SEMANTIC VIEW](../../sql-reference/sql/desc-semantic-view.md)
* [DROP SEMANTIC VIEW](../../sql-reference/sql/drop-semantic-view.md)
* [SHOW SEMANTIC VIEWS](../../sql-reference/sql/show-semantic-views.md)
* [SHOW SEMANTIC DIMENSIONS](../../sql-reference/sql/show-semantic-dimensions.md)
* [SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md)
* [SHOW SEMANTIC FACTS](../../sql-reference/sql/show-semantic-facts.md)
* [SHOW SEMANTIC METRICS](../../sql-reference/sql/show-semantic-metrics.md)

This topic also explains how to call the following stored procedure and function to create a semantic view from a
[YAML specification](semantic-view-yaml-spec.md) and get the specification for a semantic view:

* [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md)
* [SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](../../sql-reference/functions/system_read_yaml_from_semantic_view.md)

## Privileges required to create or replace a semantic view

To create or replace a semantic view, you must use a role with the following privileges:

* CREATE SEMANTIC VIEW on the schema where you are creating the semantic view.
* USAGE on the database and schema where you are creating the semantic view.
* SELECT on the tables and views used in the semantic view.

For information about the privileges required to query a semantic view, see [Privileges required to query a semantic view](querying.md).

## Creating a semantic view by using the CREATE SEMANTIC VIEW command

To create a semantic view, use the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command.

> **Note:**
>
> To create a semantic view from a [YAML specification](semantic-view-yaml-spec.md), call the
> [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md) stored procedure.

The semantic view must be valid. See [How Snowflake validates semantic views](validation-rules.md).

The following example uses the [TPC-H sample data](../sample-data-tpch.md) available in Snowflake. This data set
contains tables that represent a simplified business scenario with customers, orders, and line items.

The example creates a semantic view named `tpch_rev_analysis`, using the tables in the TPC-H data set. The semantic view
defines:

* Three logical tables (`orders`, `customers`, and `line_items`).
* A relationship between the `orders` and `customers` tables.
* A relationship between the `line_items` and `orders` tables.
* Facts that will be used to calculate metrics.
* Dimensions for the customer name, the order date, and the year in which the order was placed.
* Metrics for the average value of an order and the average number of line items in an order.

```sqlexample
CREATE SEMANTIC VIEW tpch_rev_analysis

  TABLES (
    orders AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS
      PRIMARY KEY (o_orderkey)
      WITH SYNONYMS ('sales orders')
      COMMENT = 'All orders table for the sales domain',
    customers AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
      PRIMARY KEY (c_custkey)
      COMMENT = 'Main table for customer data',
    line_items AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.LINEITEM
      PRIMARY KEY (l_orderkey, l_linenumber)
      COMMENT = 'Line items in orders'
  )

  RELATIONSHIPS (
    orders_to_customers AS
      orders (o_custkey) REFERENCES customers,
    line_item_to_orders AS
      line_items (l_orderkey) REFERENCES orders
  )

  FACTS (
    line_items.line_item_id AS CONCAT(l_orderkey, '-', l_linenumber),
    orders.count_line_items AS COUNT(line_items.line_item_id),
    line_items.discounted_price AS l_extendedprice * (1 - l_discount)
      COMMENT = 'Extended price after discount'
  )

  DIMENSIONS (
    customers.customer_name AS customers.c_name
      WITH SYNONYMS = ('customer name')
      COMMENT = 'Name of the customer',
    orders.order_date AS o_orderdate
      COMMENT = 'Date when the order was placed',
    orders.order_year AS YEAR(o_orderdate)
      COMMENT = 'Year when the order was placed'
  )

  METRICS (
    customers.customer_count AS COUNT(c_custkey)
      COMMENT = 'Count of number of customers',
    orders.order_average_value AS AVG(orders.o_totalprice)
      COMMENT = 'Average order value across all orders',
    orders.average_line_items_per_order AS AVG(orders.count_line_items)
      COMMENT = 'Average number of line items per order'
  )

  COMMENT = 'Semantic view for revenue analysis';
```

The next sections explain this example in more detail:

> **Note:**
>
> For a full example, see [Example of using SQL to create a semantic view](example.md).

### Defining the logical tables

In the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command, use the TABLES clause to define the logical tables in the view.
In this clause, you can:

* Specify the physical table name and an optional alias.
* Identify the following columns in the logical table:

  + Columns that serve as primary keys.
  + Columns that contain unique values (other than the primary key columns).

  You can use these columns to define relationships in this semantic view.
* Add synonyms for the table (for enhanced discoverability).
* Include a descriptive comment.

In the example presented earlier, the TABLES clause defines three logical tables:

* An `orders` table containing the order information from the TPC-H `orders` table.
* A `customers` table containing the customer information from the TPC-H `customers` table.
* A `line_items` table containing the line items in orders from the TPC-H `lineitem` table.

The example uses the PRIMARY KEY clause to identify the columns to be used as primary keys for each logical table. Primary keys
and unique values help determine the types of relationships between the tables
(for example, many-to-one or one-to-one).

The example also provides synonyms and comments that describe the logical tables and make the data easier to discover.

```sqlexample
TABLES (
  orders AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS
    PRIMARY KEY (o_orderkey)
    WITH SYNONYMS ('sales orders')
    COMMENT = 'All orders table for the sales domain',
  customers AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
    PRIMARY KEY (c_custkey)
    COMMENT = 'Main table for customer data',
  line_items AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.LINEITEM
    PRIMARY KEY (l_orderkey, l_linenumber)
    COMMENT = 'Line items in orders'
)
```

### Identifying the relationships between logical tables

In the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command, use the RELATIONSHIPS clause to identify the relationships between the tables in the
view. For each relationship, you specify:

* An optional name for the relationship.
* The name of the logical table containing the foreign key.
* The columns in that table that define the foreign key.
* The name of the logical table containing the primary key or columns with unique values.
* The columns in that table that define the primary key or that contain unique values.

  + If you already specified PRIMARY KEY for the logical table in the TABLES clause, you don’t need to specify the primary key
    column in the relationship.
  + If there is a single UNIQUE keyword for the logical table in the TABLES clause, you don’t need to specify the corresponding
    columns in the relationship.

  You can also specify a date, time, timestamp, or numeric column, if you want to
  join the columns on a range.

In the example presented earlier, the RELATIONSHIPS clause specifies two
relationships:

* A relationship between the `orders` and `customers` tables. In the `orders` table, `o_custkey` is the foreign key that
  refers to the primary key in the `customers` table (`c_custkey`).
* A relationship between the `line_items` and `orders` tables. In the `line_items` table, `l_orderkey` is the foreign key
  that refers to the primary key in the `orders` table (`o_orderkey`).

```sqlexample
RELATIONSHIPS (
  orders_to_customers AS
    orders (o_custkey) REFERENCES customers (c_custkey),
  line_item_to_orders AS
    line_items (l_orderkey) REFERENCES orders (o_orderkey)
)
```

### Using a date, time, timestamp, or numeric range to join logical tables

By default, when you specify a relationship between two logical tables, the tables are joined on an equality condition.

If you need to join two logical tables on a date, time, timestamp, or numeric range (where the values in a column of one table
need to be in the same range as the values in a column of another table), you can specify the ASOF keyword with the column name
in the REFERENCES clause:

```sqlexample
RELATIONSHIPS(
  my_relationship AS
    logical_table_1(
      col_table_1
    )
    REFERENCES
    logical_table_2(
      ASOF col_table_2
    )
)
```

A query of the semantic view defined above produces an [ASOF JOIN](../../sql-reference/constructs/asof-join.md) that uses the
`>=` comparison operator in the MATCH_CONDITION clause. This joins the two tables so that the values in `col_table_1` are
greater than or equal to the values in `col_table_2`:

```sqlexample
...
FROM logical_table_1 ASOF JOIN logical_table_2
  MATCH_CONDITION(
    logical_table_1.col_table_1 >= logical_table_2.col_table_2
  )
...
```

> **Note:**
>
> No other comparison operator in MATCH_CONDITION clause is supported.

You can use the ASOF keyword for columns of
[the same types that you can use with ASOF JOIN](../../sql-reference/constructs/asof-join.md).

> **Note:**
>
> You can specify at most one ASOF keyword in the definition of a given relationship. You can specify this keyword before any
> column in the list.

For example, suppose that you have tables containing customer, customer address, and order data:

```sqlexample
CREATE OR REPLACE TABLE customer(
  c_cust_id VARCHAR,
  c_first_name VARCHAR,
  c_last_name VARCHAR);

INSERT INTO customer VALUES
  ('cust001', 'Mary', 'Smith'),
  ('cust002', 'Bill', 'Wilson');

CREATE OR REPLACE TABLE customer_address(
  ca_cust_id VARCHAR,
  ca_zipcode VARCHAR,
  ca_street_addr VARCHAR,
  ca_start_date DATE,
  ca_end_date DATE
);

INSERT INTO customer_address VALUES
  ('cust001', '94025', '100 Main Street', '2024-01-01', '2024-03-31'),
  ('cust001', '94026', '200 Main Street', '2024-04-01', '2024-06-30'),
  ('cust001', '94027', '300 Main Street', '2024-07-01', NULL),
  ('cust002', '94028', '400 Main Street', '2024-01-01', '2024-04-30'),
  ('cust002', '94029', '500 Main Street', '2024-05-01', '2024-07-31'),
  ('cust002', '94030', '600 Main Street', '2024-08-01', NULL);

CREATE OR REPLACE TABLE orders(
  o_ord_id VARCHAR,
  o_cust_id VARCHAR,
  o_ord_date DATE,
  o_amount NUMBER
);

INSERT INTO orders VALUES
  ('ord100', 'cust001', '2024-02-01', 100),
  ('ord101', 'cust001', '2024-02-02', 200),
  ('ord102', 'cust001', '2024-05-01', 300),
  ('ord103', 'cust001', '2024-05-02', 400),
  ('ord104', 'cust001', '2024-08-01', 500),
  ('ord105', 'cust001', '2024-08-02', 600),
  ('ord106', 'cust002', '2024-03-01', 100),
  ('ord107', 'cust002', '2024-03-02', 200),
  ('ord108', 'cust002', '2024-06-01', 300),
  ('ord109', 'cust002', '2024-06-02', 400),
  ('ord110', 'cust002', '2024-09-01', 500),
  ('ord111', 'cust002', '2024-09-02', 600);
```

In this example, the `customer_address` table has a `ca_start_date` column, which indicates when the customer started residing
at the specified address. The `orders` table has a `o_ord_date` column, which is the date of the order.

Suppose that you want to be able to query information about customer orders and retrieve the zip codes corresponding to where the
customer resided when the orders were placed.

You can define a semantic view that specifies an ASOF join between the `ca_start_date` and `o_ord_date` columns:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW customer_orders_view
  TABLES (
    customer_address UNIQUE (ca_cust_id, ca_start_date),
    customer UNIQUE (c_cust_id),
    orders UNIQUE (o_ord_id)
  )
  RELATIONSHIPS (
    customer_address(ca_cust_id) REFERENCES customer,
    -- Defines an ASOF JOIN on the date columns.
    orders(o_cust_id, o_ord_date)
      REFERENCES
        customer_address(ca_cust_id, ASOF ca_start_date)
  )
  FACTS (
    customer_address.f_zipcode AS ca_zipcode
  )
  DIMENSIONS (
    -- Relies on the ASOF join to retrieve the zip code
    -- where the order date is greater than or equal to
    -- the address starting date.
    orders.f_cust_zipcode AS customer_address.f_zipcode,
    orders.dim_year_month AS DATE_TRUNC('month', o_ord_date)
  )
  METRICS (
    orders.m_order_amount AS SUM(o_amount)
  );
```

Suppose that you [query this semantic view](querying.md) to return the sum of the order amounts per month for each zip code:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
  customer_orders_view
  DIMENSIONS orders.dim_year_month, orders.f_cust_zipcode
  METRICS orders.m_order_amount
);
```

```output
+----------------+----------------+----------------+
| DIM_YEAR_MONTH | F_CUST_ZIPCODE | M_ORDER_AMOUNT |
|----------------+----------------+----------------|
| 2024-02-01     | 94025          |            300 |
| 2024-05-01     | 94026          |            700 |
| 2024-08-01     | 94027          |           1100 |
| 2024-03-01     | 94028          |            300 |
| 2024-09-01     | 94030          |           1100 |
| 2024-06-01     | 94029          |            700 |
+----------------+----------------+----------------+
```

The query effectively uses an ASOF JOIN to join the tables on the date columns, where the order date is greater than or equal to
the address starting date:

```sqlexample
...
FROM orders ASOF JOIN customer_address
  MATCH_CONDITION(
    orders.o_ord_date >= customer_address.ca_start_date
  )
  ON
    orders.o_cust_id = customer_address.ca_cust_id
...
```

### Joining logical tables that contain ranges of values

You can use a *range join* when you want to join a table with another table that defines a range of possible values in the
first table. For example, suppose that one table represents sales orders and has a column with the timestamp when the order
was placed. Suppose that another table represents fiscal quarters and contains the distinct ranges of time that represent
these quarters. You can create a semantic view that joins the two tables so that the row for an order includes the fiscal
quarter in which the order was placed.

In the table that contains the ranges, each range must be distinct. No two ranges can overlap.

In the table data, if you want to specify the lowest possible value for the range or the highest possible value for the range,
use NULL.

For example, the following table defines a set of ranges of times that do not overlap:

* The first row covers the range that includes everything up to (but not including) January 1, 2024.
* The last row covers the range that includes everything from March 20, 2024, onwards.

```output
+----------------+------------------+-------------------------+-------------------------+
| TIME_PERIOD_ID | TIME_PERIOD_NAME | START_TIME              | END_TIME                |
|----------------+------------------+-------------------------+-------------------------|
|              1 | Before_January   | NULL                    | 2024-01-01 00:00:00.000 |
|              2 | Early_January    | 2024-01-01 00:00:00.000 | 2024-01-15 00:00:00.000 |
|              3 | Late_January     | 2024-01-15 00:00:00.000 | 2024-02-01 00:00:00.000 |
|              4 | Early_February   | 2024-02-01 00:00:00.000 | 2024-02-15 00:00:00.000 |
|              5 | Late_February    | 2024-02-15 00:00:00.000 | 2024-03-01 00:00:00.000 |
|              6 | Early_March      | 2024-03-01 00:00:00.000 | 2024-03-20 00:00:00.000 |
|              7 | After_March20    | 2024-03-20 00:00:00.000 | NULL                    |
+----------------+------------------+-------------------------+-------------------------+
```

> **Note:**
>
> No two rows can contain NULL in the start column, and no two rows can contain NULL in the end column.

For cases like these, you can set up a [semantic view](overview.md) that supports range-join
queries. When you create the semantic view, you must do the following:

1. For the logical table containing the start and end times of a time period,
   define a constraint that specifies that no two ranges can overlap.

   In the TABLE clause of the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command, specify the CONSTRAINT clause in the logical
   table definition. For the syntax, see the
   [documentation for CONSTRAINT in the CREATE SEMANTIC VIEW topic](../../sql-reference/sql/create-semantic-view.md).
2. Define a relationship between the column containing the timestamp in one table
   and the start and end time columns in the other table.

   In the RELATIONSHIPS clause of the CREATE SEMANTIC VIEW command, use the BETWEEN clause to specify the columns containing the
   start and end times. For the syntax, see the
   [documentation for RELATIONSHIP in the CREATE SEMANTIC VIEW topic](../../sql-reference/sql/create-semantic-view.md).

For example, suppose that the `my_time_periods` table defines distinct periods of time:

```sqlexample
CREATE OR REPLACE TABLE my_time_periods (
  time_period_id INT PRIMARY KEY,
  time_period_name VARCHAR(50),
  start_time TIMESTAMP,
  end_time TIMESTAMP
);
```

```sqlexample
INSERT INTO my_time_periods (
    time_period_id, time_period_name, start_time, end_time
  ) VALUES
    (1, 'Before_January', NULL, '2024-01-01 00:00:00'::TIMESTAMP),
    (2, 'Early_January', '2024-01-01 00:00:00'::TIMESTAMP, '2024-01-15 00:00:00'::TIMESTAMP),
    (3, 'Late_January', '2024-01-15 00:00:00'::TIMESTAMP, '2024-02-01 00:00:00'::TIMESTAMP),
    (4, 'Early_February', '2024-02-01 00:00:00'::TIMESTAMP, '2024-02-15 00:00:00'::TIMESTAMP),
    (5, 'Late_February', '2024-02-15 00:00:00'::TIMESTAMP, '2024-03-01 00:00:00'::TIMESTAMP),
    (6, 'Early_March', '2024-03-01 00:00:00'::TIMESTAMP, '2024-03-20 00:00:00'::TIMESTAMP),
    (7, 'After_March20', '2024-03-20 00:00:00'::TIMESTAMP, NULL);
```

Suppose that the `my_events` table captures events that occurred within those periods of time:

```sqlexample
CREATE OR REPLACE TABLE my_events (
  event_id INTEGER PRIMARY KEY,
  event_timestamp TIMESTAMP,
  event_name VARCHAR
);
```

```sqlexample
INSERT INTO my_events (event_id, event_name, event_timestamp) VALUES
  (1, 'Login', '2024-01-15 10:00:00'::TIMESTAMP),
  (2, 'Purchase', '2024-01-15 14:30:00'::TIMESTAMP),
  (3, 'Logout', '2024-01-15 18:45:00'::TIMESTAMP),
  (4, 'Review', '2024-02-10 12:00:00'::TIMESTAMP),
  (5, 'Support', '2024-02-20 09:30:00'::TIMESTAMP),
  (6, 'Upgrade', '2024-03-05 16:00:00'::TIMESTAMP),
  (7, 'Feedback', '2024-03-25 11:00:00'::TIMESTAMP);
```

You can define a semantic view that joins the tables. Rows in `my_events` are joined with rows in `my_time_periods`,
where the value in the `event_timestamp` column in `my_events` is within the range specified by the `start_time` and
`end_time` columns in `my_time_periods`.

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW my_semantic_view_range_join
  TABLES (
    my_events PRIMARY KEY (event_id),
    my_time_periods UNIQUE (start_time, end_time)
      CONSTRAINT my_time_period_range DISTINCT RANGE BETWEEN start_time AND end_time EXCLUSIVE
  )
  RELATIONSHIPS (
    my_time_periods_for_events AS
      my_events(event_timestamp) REFERENCES
        my_time_periods(BETWEEN start_time AND end_time EXCLUSIVE)
  )
  DIMENSIONS (
    my_events.dim_event_name AS event_name,
    my_events.dim_event_timestamp AS event_timestamp,
    my_time_periods.dim_time_period_name AS time_period_name
  )
  METRICS (
    my_events.m_event_count AS COUNT(*)
  );
```

The following query demonstrates how the rows are joined:

```sqlexample
SELECT
    sv.dim_event_name,
    sv.dim_event_timestamp,
    sv.dim_time_period_name,
    sv.m_event_count
  FROM SEMANTIC_VIEW(
    my_semantic_view_range_join
    METRICS my_events.m_event_count
    DIMENSIONS
      my_events.dim_event_name,
      my_events.dim_event_timestamp,
      my_time_periods.dim_time_period_name
  ) AS sv
  ORDER BY
    sv.dim_event_timestamp,
    sv.dim_time_period_name;
```

```output
+----------------+-------------------------+----------------------+---------------+
| DIM_EVENT_NAME | DIM_EVENT_TIMESTAMP     | DIM_TIME_PERIOD_NAME | M_EVENT_COUNT |
|----------------+-------------------------+----------------------+---------------|
| Login          | 2024-01-15 10:00:00.000 | Late_January         |             1 |
| Purchase       | 2024-01-15 14:30:00.000 | Late_January         |             1 |
| Logout         | 2024-01-15 18:45:00.000 | Late_January         |             1 |
| Review         | 2024-02-10 12:00:00.000 | Early_February       |             1 |
| Support        | 2024-02-20 09:30:00.000 | Late_February        |             1 |
| Upgrade        | 2024-03-05 16:00:00.000 | Early_March          |             1 |
| Feedback       | 2024-03-25 11:00:00.000 | After_March20        |             1 |
+----------------+-------------------------+----------------------+---------------+
```

As shown in the examples, the `dim_time_period_name` dimension for each row in the results is the name of the time period that
the `dim_event_timestamp` dimension falls into.

### Defining facts, dimensions, and metrics

In the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command, use the FACTS, DIMENSIONS, and METRICS clauses to define the facts, dimensions,
and metrics in the semantic view.

You must define at least one dimension or metric in the semantic view.

For each fact, dimension, or metric, you specify:

* The logical table it belongs to.

  > **Note:**
  >
  > If you want to define a derived metric (a metric that is not specific to one logical table), you must omit the logical table
  > name. See Defining derived metrics.
* A name for the fact, dimension, or metric.
* The SQL expression to calculate it.

  > **Note:**
  >
  > For dimensions, you can specify a
  > [Cortex Search Service](../snowflake-cortex/cortex-search/cortex-search-overview.md) to use for the dimension. For
  > information, see Defining a dimension that uses a Cortex Search Service.
* Optional synonyms and comments.

> **Note:**
>
> If a metric should not be aggregated across specific dimensions, you should specify that those dimensions should be
> *non-additive*.
>
> For information, see Identifying the dimensions that should be non-additive for a metric.

The example presented earlier defines several facts, dimensions, and metrics:

```sqlexample
FACTS (
  line_items.line_item_id AS CONCAT(l_orderkey, '-', l_linenumber),
  orders.count_line_items AS COUNT(line_items.line_item_id),
  line_items.discounted_price AS l_extendedprice * (1 - l_discount)
    COMMENT = 'Extended price after discount'
)

DIMENSIONS (
  customers.customer_name AS customers.c_name
    WITH SYNONYMS = ('customer name')
    COMMENT = 'Name of the customer',
  orders.order_date AS o_orderdate
    COMMENT = 'Date when the order was placed',
  orders.order_year AS YEAR(o_orderdate)
    COMMENT = 'Year when the order was placed'
)

METRICS (
  customers.customer_count AS COUNT(c_custkey)
    COMMENT = 'Count of number of customers',
  orders.order_average_value AS AVG(orders.o_totalprice)
    COMMENT = 'Average order value across all orders',
  orders.average_line_items_per_order AS AVG(orders.count_line_items)
    COMMENT = 'Average number of line items per order'
)
```

> **Note:**
>
> For additional guidelines on defining metrics that use window functions, see [Defining and querying window function metrics](querying.md).

### Defining a dimension that uses a Cortex Search Service

To define a dimension that uses a
[Cortex Search Service](../snowflake-cortex/cortex-search/cortex-search-overview.md), set the
WITH CORTEX SEARCH SERVICE clause to the name of the Cortex Search Service. If the service is in a different database or schema,
[qualify the name of the service](../../sql-reference/name-resolution.md). For example:

```sqlexample
DIMENSIONS (
  my_table.my_dimension AS my_dimension_expression
    WITH CORTEX SEARCH SERVICE my_db.my_schema.my_dimension_search_service
)
```

### Defining derived metrics

When you define a metric, you specify the name of the logical table that the metric belongs to. This is the logical table on which
the metric is aggregated.

If you want to define a metric based on metrics from different logical tables, you can define a *derived metric*. A derived metric
is a metric that is scoped to the semantic view (rather than to a specific logical table). A derived metric can combine metrics
from multiple logical tables.

In the definition of a derived metric, omit the logical table name.

For example, suppose that you want to define a metric `my_derived_metric_1` that is the sum of the metrics `table_1.metric_1`
and `table_2.metric_2`. When you define `my_derived_metric_1`, don’t qualify the name with any logical table name:

```sqlexample
CREATE SEMANTIC VIEW sv_with_derived_metrics
  TABLES (
    table_1 PRIMARY KEY (column_1),
    table_2 PRIMARY KEY (column_2)
  )
  ...
  METRICS (
    table_1.metric_1 AS SUM(...),
    table_2.metric_2 AS SUM(...),
    my_derived_metric_1 AS table_1.metric_1 + table_2.metric_2
  )
 ...
```

You can use other derived metrics in the expression. For example:

```sqlexample
METRICS (
  ...
  my_derived_metric_1 AS table_1.metric_1 + table_2.metric_2,
  my_view_metric_2 AS my_derived_metric_1 + table_3.metric_3
)
```

Note the following restrictions when you define a derived metric:

* You cannot use the same name for a derived metric and a regular metric.
* The expression for a derived metric can use:

  + Aggregations of dimensions and facts defined in any logical table in the semantic view.
  + Scalar expressions of metrics defined in any logical table in the semantic view.
  + Other derived metrics.

  In the following example:

  + `derived_metric_1` uses a scalar expression with two metrics.
  + `derived_metric_2` uses an aggregation of a dimension.
  + `derived_metric_3` adds an aggregation of a dimension to another derived metric.

  ```sqlexample
  CREATE OR REPLACE SEMANTIC VIEW sv_derived_metrics
    TABLES (t1)
    DIMENSIONS (t1.dim1 AS t1.col1)
    METRICS (
      t1.m1 AS SUM(t1.col1),
      t2.m2 AS SUM(t1.col2),
      derived_metric_1 AS t1.m1 + t2.m2,
      derived_metric_2 AS SUM(t1.dim1),
      derived_metric_3 AS SUM(t1.dim1) + derived_metric_2
    )
    ...
  ```
* You don’t need to qualify the name of a metric, dimension, or fact in the expression if the name is not ambiguous. For example:

  ```sqlexample
  METRICS (
    table_1.metric_1 AS ...,
    table_1.my_unique_metric_name AS ...,
    table_2.metric_1 AS ...,
    my_derived_metric_1 AS table_1.metric_1 + my_unique_metric_name
  )
  ```

  Note that `metric_1` needs to be qualified by `table_1` because there are two metrics named `metric_1`, but
  `my_unique_metric_name` does not need to be qualified because the name is unique.
* In the expression for a derived metric, you cannot use the following:

  + Aggregations of metrics.
  + Window functions.
  + References to physical columns.
  + References to facts or dimensions that are not aggregated.
* You cannot use a derived metric in the expression for a regular metric, dimension, or fact. Only another derived metric
  can use a derived metric in its expression.

### Specifying the relationship for a metric when multiple relationship paths exist

In some cases, you multiple relationship paths might exist between two specific logical tables in a semantic view. In these cases,
when you define a metric, you must specify the relationship path to use.

#### The problem with multiple relationship paths

Suppose that you have two tables that contain information about flights and airports:

```sqlexample
CREATE OR REPLACE TABLE airports (
  airport_code VARCHAR PRIMARY KEY,
  city_name VARCHAR,
  airport_region_code VARCHAR
);

INSERT INTO airports VALUES
  ('SEA', 'Seattle', 'NA'),
  ('SFO', 'San Fransico', 'NA'),
  ('PVG', 'Shanghai', 'AS');

SELECT * FROM airports;
```

```output
+--------------+--------------+---------------------+
| AIRPORT_CODE | CITY_NAME    | AIRPORT_REGION_CODE |
|--------------+--------------+---------------------|
| SEA          | Seattle      | NA                  |
| SFO          | San Fransico | NA                  |
| PVG          | Shanghai     | AS                  |
+--------------+--------------+---------------------+
```

```sqlexample
CREATE OR REPLACE TABLE flights (
  flight_id INTEGER PRIMARY KEY,
  departure_airport VARCHAR,
  arrival_airport VARCHAR,
  is_late BOOLEAN,
  aircraft_id INTEGER,
  departure_time DATETIME,
  arrival_time DATETIME
);

INSERT INTO flights VALUES
  (1, 'SFO', 'SEA', true, 1, '2025-01-03 06:00:00', '2025-01-03 11:00:00'),
  (2, 'SEA', 'SFO', false, 2, '2025-01-03 11:00:00', '2025-01-03 16:00:00'),
  (3, 'SEA', 'PVG', false, 3, '2025-01-03 11:00:00', '2025-01-04 11:00:00'),
  (4, 'SFO', 'PVG', true, 1, '2025-01-03 06:00:00', '2025-01-04 11:00:00');

SELECT * FROM flights;
```

```output
+-----------+-------------------+-----------------+---------+-------------+-------------------------+-------------------------+
| FLIGHT_ID | DEPARTURE_AIRPORT | ARRIVAL_AIRPORT | IS_LATE | AIRCRAFT_ID | DEPARTURE_TIME          | ARRIVAL_TIME            |
|-----------+-------------------+-----------------+---------+-------------+-------------------------+-------------------------|
|         1 | SFO               | SEA             | True    |           1 | 2025-01-03 06:00:00.000 | 2025-01-03 11:00:00.000 |
|         2 | SEA               | SFO             | False   |           2 | 2025-01-03 11:00:00.000 | 2025-01-03 16:00:00.000 |
|         3 | SEA               | PVG             | False   |           3 | 2025-01-03 11:00:00.000 | 2025-01-04 11:00:00.000 |
|         4 | SFO               | PVG             | True    |           1 | 2025-01-03 06:00:00.000 | 2025-01-04 11:00:00.000 |
+-----------+-------------------+-----------------+---------+-------------+-------------------------+-------------------------+
```

Suppose that you define a semantic view that provides information about the total number of flights departing from and arriving
to a specific city:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW flights_sv
  TABLES (
    flights PRIMARY KEY (flight_id),
    airports PRIMARY KEY (airport_code)
  ) RELATIONSHIPS (
    flight_departure_airport AS flights(departure_airport) REFERENCES airports(airport_code),
    flight_arrival_airport AS flights(arrival_airport) REFERENCES airports(airport_code)
  ) DIMENSIONS (
    airports.city_name AS city_name
  ) METRICS (
    flights.m_flight_count AS COUNT(flight_id)
  );
```

The semantic view specifies two different relationships between the `flights` table and the `airports` table
(`flight_departure_airport` and `flight_arrival_airport`). Because there are multiple relationship paths between the tables,
querying for the `m_flight_count` metric and selecting the `airports.city_name` dimension (or any dimension in the
`airports` table) fails:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  flights_sv
  METRICS flights.m_flight_count
  DIMENSIONS airports.city_name
);
```

```output
010246 (42601): SQL compilation error:
Invalid dimension specified: Multi-path relationship between the dimension entity 'AIRPORTS'
  and the base metric or dimension entity 'FLIGHTS' is not supported.
```

Because there are multiple paths between the `flights` and `airports` tables, the query fails. If the query did not select a
dimension from the `airports` table, the query would have succeeded.

#### Specifying the relationship to use

In the metric definition in the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command, you can specify which relationship to use
in the USING clause:

```sqlsyntax
METRICS (
  <table_alias>.<metric>
    [ USING ( <relationship_name> [ , ... ] )
    AS <sql_expr>
  [ , ... ]
)
```

> **Note:**
>
> * Each relationship that you specify must start from the logical table containing the metric. For example, suppose that you want
>   to specify:
>
>   ```sqlexample
>   METRICS (
>     table_a.metric_a
>       USING ( table_a_to_table_b )
>       ...
>   ```
>
>   The relationship `table_a_to_table_b` must start from `table_a`:
>
>   ```sqlexample
>   RELATIONSHIPS (
>     table_a_to_table_b AS table_a(col_1) REFERENCES table_b(col_1)
>     ...
>   ```
> * You cannot specify a sequence of relationships (for example, `table_a_to_table_b` and `table_b_to_table_c`). Each
>   relationship must start from the logical table containing the metric.
> * If you need to identify the relationships from the logical table containing the metric to different tables, you can specify
>   the relationships in the USING clause. For example, suppose that you want the metric to be computed by specific relationships
>   from `table_a` to `table_b` and from `table_a` to `table_c`. In this case, you specify both relationships in the USING
>   clause:
>
>   ```sqlexample
>   METRICS (
>     table_a.metric_a
>       USING ( table_a_to_table_b, table_a_to_table_c )
>       ...
>   ```
> * You cannot specify the USING clause in a derived metric.

For example, the following statement defines two additional metrics that use specific relationships:

* `m_flight_departure_count`, which uses the `flight_departure_airport` relationship.
* `m_flight_arrival_count`, which uses the `flight_arrival_airport` relationship.

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW flights_sv
  TABLES (
    flights PRIMARY KEY (flight_id),
    airports PRIMARY KEY (airport_code)
  ) RELATIONSHIPS (
    flight_departure_airport AS flights(departure_airport) REFERENCES airports(airport_code),
    flight_arrival_airport AS flights(arrival_airport) REFERENCES airports(airport_code)
  ) DIMENSIONS (
    airports.city_name AS city_name
  ) METRICS (
    flights.m_flight_count AS COUNT(flight_id),
    flights.m_flight_departure_count USING (flight_departure_airport) AS flights.m_flight_count,
    flights.m_flight_arrival_count USING (flight_arrival_airport) AS flights.m_flight_count
  );
```

When querying this view, you can specify the two new metrics that use specific relationships:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  flights_sv
  METRICS flights.m_flight_arrival_count, flights.m_flight_departure_count
  DIMENSIONS airports.city_name
);
```

```output
+------------------------+--------------------------+--------------+
| M_FLIGHT_ARRIVAL_COUNT | M_FLIGHT_DEPARTURE_COUNT | CITY_NAME    |
|------------------------+--------------------------+--------------|
|                      1 |                        2 | San Fransico |
|                      1 |                        2 | Seattle      |
|                      2 |                     NULL | Shanghai     |
+------------------------+--------------------------+--------------+
```

#### Add dimensions that rely on the same relationships

The query in the previous example used the `airports.city_name` dimension, which is in the `airports` logical table that the
relationships are based on.

If you add a dimension for a different logical table to the view, queries of that dimension benefit from the relationships that
you specified earlier.

For example, suppose that you create a table named `regions` with additional information about the airport regions specified in
the `airport_region_code` column of the `airports` table:

```sqlexample
CREATE OR REPLACE TABLE regions (
  region_code VARCHAR PRIMARY KEY,
  region_name VARCHAR
);

INSERT INTO regions VALUES
  ('NA', 'North America'),
  ('AS', 'Asia');

SELECT * FROM regions;
```

```output
+-------------+---------------+
| REGION_CODE | REGION_NAME   |
|-------------+---------------|
| NA          | North America |
| AS          | Asia          |
+-------------+---------------+
```

You can extend the semantic view that you defined earlier to return the region name:

* Add a new logical table for the `regions` table.
* Add a relationship between the `regions` and `airports` tables.
* Add a dimension for the region name.

You don’t need to make any additional changes to the USING clause for the metrics because there’s a single relationship between
the `regions` and `airports` tables.

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW flights_by_regions_sv
  TABLES (
    flights PRIMARY KEY (flight_id),
    airports PRIMARY KEY (airport_code),
    regions PRIMARY KEY (region_code)
  ) RELATIONSHIPS (
    flight_departure_airport AS flights(departure_airport) REFERENCES airports(airport_code),
    flight_arrival_airport AS flights(arrival_airport) REFERENCES airports(airport_code),
    airport_region AS airports(airport_region_code) REFERENCES regions(region_code)
  ) DIMENSIONS (
    airports.city_name AS city_name,
    regions.region_name AS region_name
  ) METRICS (
    flights.m_flight_count AS COUNT(flight_id),
    flights.m_flight_departure_count USING (flight_departure_airport) AS flights.m_flight_count,
    flights.m_flight_arrival_count USING (flight_arrival_airport) AS flights.m_flight_count
  );
```

If you query the view, specifying the `region_name` dimension, and there is ambiguity about which relationship to use, the USING
clause determines the relationships to use:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  flights_by_regions_sv
  METRICS flights.m_flight_arrival_count, flights.m_flight_departure_count
  DIMENSIONS regions.region_name
);
```

```output
+------------------------+--------------------------+---------------+
| M_FLIGHT_ARRIVAL_COUNT | M_FLIGHT_DEPARTURE_COUNT | REGION_NAME   |
|------------------------+--------------------------+---------------|
|                      2 |                        4 | North America |
|                      2 |                     NULL | Asia          |
+------------------------+--------------------------+---------------+
```

#### Specify relationships to different tables

If the semantic view uses dimensions from multiple tables, and you need to specify the relationships to use for these dimensions,
you can specify multiple relationships in the USING clause.

For example, suppose that you create a table named `weather` with weather information about the airports in the `airports`
table:

```sqlexample
CREATE OR REPLACE TABLE weather (
  airport_code VARCHAR PRIMARY KEY,
  weather_condition VARCHAR,
  start_date DATETIME,
  end_date DATETIME
);

INSERT INTO weather VALUES
  ('SEA', 'rainy', '2025-01-01 10:00:00', '2025-01-01 12:00:00'),
  ('SEA', 'rainy', '2025-01-03 10:00:00', '2025-01-03 12:00:00'),
  ('SFO', 'sunny', '2025-01-03 05:00:00', '2025-01-03 09:00:00'),
  ('SFO', 'sunny', '2025-01-03 10:00:00', '2025-01-03 18:00:00'),
  ('PVG', 'cloudy', '2025-01-04 10:00:00', '2025-01-04 12:00:00');

SELECT * FROM weather;
```

```output
+--------------+-------------------+-------------------------+-------------------------+
| AIRPORT_CODE | WEATHER_CONDITION | START_DATE              | END_DATE                |
|--------------+-------------------+-------------------------+-------------------------|
| SEA          | rainy             | 2025-01-01 10:00:00.000 | 2025-01-01 12:00:00.000 |
| SEA          | rainy             | 2025-01-03 10:00:00.000 | 2025-01-03 12:00:00.000 |
| SFO          | sunny             | 2025-01-03 05:00:00.000 | 2025-01-03 09:00:00.000 |
| SFO          | sunny             | 2025-01-03 10:00:00.000 | 2025-01-03 18:00:00.000 |
| PVG          | cloudy            | 2025-01-04 10:00:00.000 | 2025-01-04 12:00:00.000 |
+--------------+-------------------+-------------------------+-------------------------+
```

You can extend the semantic view that you defined earlier to return the weather condition:

* Add a new logical table for the `weather` table.
* Add two relationships between the `weather` and `flights` tables (one for departing flights and one for arrriving flights).
* Add a dimension for the weather information.
* Specify that the metrics should also use the two new relationships between the `weather` and `flights` tables.

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW flights_and_weather_sv
  TABLES (
    flights PRIMARY KEY (flight_id),
    airports PRIMARY KEY (airport_code),
    weather PRIMARY KEY (airport_code, start_date, end_date)
  ) RELATIONSHIPS (
    flight_departure_airport AS flights(departure_airport) REFERENCES airports(airport_code),
    flight_arrival_airport AS flights(arrival_airport) REFERENCES airports(airport_code),
    flight_departure_weather AS flights(departure_airport, departure_time) REFERENCES weather(airport_code, BETWEEN start_date AND end_date EXCLUSIVE),
    flight_arrival_weather AS flights(arrival_airport, arrival_time) REFERENCES weather(airport_code, BETWEEN start_date AND end_date EXCLUSIVE)
  ) DIMENSIONS (
    airports.city_name AS city_name,
    weather.weather_condition AS weather_condition
  ) METRICS (
    flights.m_flight_count AS COUNT(flight_id),
    flights.m_flight_departure_count USING (flight_departure_airport, flight_departure_weather) AS flights.m_flight_count,
    flights.m_flight_arrival_count USING (flight_arrival_airport, flight_arrival_weather) AS flights.m_flight_count
  );
```

When you query the view and specify the `weather_condition` dimension, the USING clause determines the relationships that are
used:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  flights_by_regions_sv
  METRICS flights.m_flight_arrival_count, flights.m_flight_departure_count
  DIMENSIONS weather.weather_condition
);
```

```output
+------------------------+--------------------------+-------------------+
| M_FLIGHT_ARRIVAL_COUNT | M_FLIGHT_DEPARTURE_COUNT | WEATHER_CONDITION |
|------------------------+--------------------------+-------------------|
|                      2 |                     NULL | cloudy            |
|                      1 |                        2 | sunny             |
|                      1 |                        2 | rainy             |
+------------------------+--------------------------+-------------------+
```

#### Define derived metrics based on metrics that use specific relationships

Although you cannot specify the USING clause in a derived metric, you can
define a derived metric that uses metrics that specify the USING clause.

For example, the following semantic view defines two derived metrics:

* `global_m_departure_arrival_ratio`
* `global_m_departure_arrival_sum`

The definitions of these derived metrics use the `flights.m_flight_departure_count` and `flights.m_flight_arrival_count`
metrics, which both specify the USING clause:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW flights_derived_metrics_sv
  TABLES (
    flights PRIMARY KEY (flight_id),
    airports PRIMARY KEY (airport_code)
  ) RELATIONSHIPS (
    flight_departure_airport AS flights(departure_airport) REFERENCES airports(airport_code),
    flight_arrival_airport AS flights(arrival_airport) REFERENCES airports(airport_code)
  ) DIMENSIONS (
    airports.city_name AS city_name
  ) METRICS (
    flights.m_flight_count AS COUNT(flight_id),
    flights.m_flight_departure_count USING (flight_departure_airport) AS flights.m_flight_count,
    flights.m_flight_arrival_count USING (flight_arrival_airport) AS flights.m_flight_count,
    global_m_departure_arrival_ratio AS DIV0(flights.m_flight_departure_count, flights.m_flight_arrival_count),
    global_m_departure_arrival_sum AS flights.m_flight_departure_count + flights.m_flight_arrival_count
  );
```

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  flights_derived_metrics_sv
  METRICS global_m_departure_arrival_ratio,
    flights.m_flight_arrival_count, flights.m_flight_departure_count
  DIMENSIONS airports.city_name
);
```

```output
+------------------------+--------------------------+----------------------------------+--------------+
| M_FLIGHT_ARRIVAL_COUNT | M_FLIGHT_DEPARTURE_COUNT | GLOBAL_M_DEPARTURE_ARRIVAL_RATIO | CITY_NAME    |
|------------------------+--------------------------+----------------------------------+--------------|
|                      1 |                        2 |                         2.000000 | Seattle      |
|                      1 |                        2 |                         2.000000 | San Fransico |
|                      2 |                     NULL |                             NULL | Shanghai     |
+------------------------+--------------------------+----------------------------------+--------------+
```

### Identifying the dimensions that should be non-additive for a metric

In some cases, a metric should not be aggregated across specific dimensions. In these cases, you can mark the dimensions as
*non-additive*.

#### Understanding the problem with aggregating metrics across some dimensions

Suppose you have a table that contains the account balances of each customer’s checking and savings accounts on a specific day.

```sqlexample
CREATE OR REPLACE TABLE bank_accounts (
  customer_id VARCHAR,
  account_type VARCHAR,
  year NUMBER,
  month NUMBER,
  day NUMBER,
  balance NUMBER
);
```

```sqlexample
INSERT INTO bank_accounts VALUES
  ('cust-001', 'checking', 2024, 01, 01, 100),
  ('cust-001', 'savings', 2024, 01, 01, 110),
  ('cust-001', 'checking', 2024, 02, 10, 140),
  ('cust-001', 'savings', 2024, 02, 10, 150),
  ('cust-001', 'checking', 2024, 03, 15, 200),
  ('cust-001', 'savings', 2024, 03, 30, 210),
  ('cust-001', 'checking', 2025, 02, 15, 280),
  ('cust-001', 'savings', 2025, 02, 15, 290),
  ('cust-001', 'checking', 2025, 03, 20, 300),
  ('cust-001', 'savings', 2025, 03, 20, 310),
  ('cust-002', 'checking', 2025, 03, 30, 200),
  ('cust-002', 'savings', 2025, 03, 30, 310);
```

```sqlexample
SELECT * FROM bank_accounts;
```

```output
+-------------+--------------+------+-------+-----+---------+
| CUSTOMER_ID | ACCOUNT_TYPE | YEAR | MONTH | DAY | BALANCE |
|-------------+--------------+------+-------+-----+---------|
| cust-001    | checking     | 2024 |     1 |   1 |     100 |
| cust-001    | savings      | 2024 |     1 |   1 |     110 |
| cust-001    | checking     | 2024 |     2 |  10 |     140 |
| cust-001    | savings      | 2024 |     2 |  10 |     150 |
| cust-001    | checking     | 2024 |     3 |  15 |     200 |
| cust-001    | savings      | 2024 |     3 |  30 |     210 |
| cust-001    | checking     | 2025 |     2 |  15 |     280 |
| cust-001    | savings      | 2025 |     2 |  15 |     290 |
| cust-001    | checking     | 2025 |     3 |  20 |     300 |
| cust-001    | savings      | 2025 |     3 |  20 |     310 |
| cust-002    | checking     | 2025 |     3 |  30 |     200 |
| cust-002    | savings      | 2025 |     3 |  30 |     310 |
+-------------+--------------+------+-------+-----+---------+
```

Suppose that you want to define a semantic view that includes:

* The following dimensions:

  + Customer ID
  + Account type
  + Year
  + Month
  + Day
* A metric for the sum of the balance.

The following statement creates a semantic view that includes the dimensions and metrics listed above:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW bank_accounts_sv
  TABLES (
    bank_accounts
  )
  DIMENSIONS (
    bank_accounts.customer_id_dim AS bank_accounts.customer_id,
    bank_accounts.account_type_dim AS bank_accounts.account_type,
    bank_accounts.year_dim AS bank_accounts.year,
    bank_accounts.month_dim AS bank_accounts.month,
    bank_accounts.day_dim AS bank_accounts.day
  )
  METRICS (
    bank_accounts.m_account_balance AS SUM(balance)
  );
```

If you want to retrieve the total balance of the checking and savings accounts for each customer at the end of each year, you can
query the semantic view for the `m_account_balance` metric and specify the `customer_id_dim` and `year_dim` dimensions.

However, the `m_account_balance` metric will be the sum of the balances of each day for each customer because the metric is
aggregated by the date dimensions.

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    bank_accounts_sv
    METRICS bank_accounts.m_account_balance
    DIMENSIONS customer_id_dim, year_dim
  )
  ORDER BY customer_id_dim, year_dim;
```

```output
+-------------------+-----------------+----------+
| M_ACCOUNT_BALANCE | CUSTOMER_ID_DIM | YEAR_DIM |
|-------------------+-----------------+----------|
|               910 | cust-001        |     2024 |
|              1180 | cust-001        |     2025 |
|               510 | cust-002        |     2025 |
+-------------------+-----------------+----------+
```

In the example above, for `cust-001` in 2024, `910` is the sum of the balances for each day
(`100 + 110 + 140 + 150 + 200 + 210`).

#### Preventing a metric from being aggregated across specific dimensions

To prevent the metric from being aggregated by the date dimensions, specify the date dimensions in the NON ADDITIVE BY clause
when creating the semantic view:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW bank_accounts_sv
  TABLES (
    bank_accounts
  )
  DIMENSIONS (
    bank_accounts.customer_id_dim AS bank_accounts.customer_id,
    bank_accounts.account_type_dim AS bank_accounts.account_type,
    bank_accounts.year_dim AS bank_accounts.year,
    bank_accounts.month_dim AS bank_accounts.month,
    bank_accounts.day_dim AS bank_accounts.day
  )
  METRICS (
    bank_accounts.m_account_balance
      NON ADDITIVE BY (year_dim, month_dim, day_dim)
      AS SUM(balance)
  );
```

> **Note:**
>
> * If you specify the NON ADDITIVE BY clause in a metric, you cannot refer to that metric in the definitions of metrics that are
>   not derived. Only derived metrics can refer to metrics that specify non-additive dimensions.

Specifying the NON ADDITIVE BY clause makes the metric a *semi-additive* metric.

When you query this semantic view, the `m_account_balance` metric is no longer aggregated by the date dimensions. The query
aggregates the account balances at the end of the period in each group of queried dimensions.

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    bank_accounts_sv
    METRICS bank_accounts.m_account_balance
    DIMENSIONS customer_id_dim, year_dim
  )
  ORDER BY customer_id_dim, year_dim;
```

```output
+-------------------+-----------------+----------+
| M_ACCOUNT_BALANCE | CUSTOMER_ID_DIM | YEAR_DIM |
|-------------------+-----------------+----------|
|               210 | cust-001        |     2024 |
|               610 | cust-001        |     2025 |
|               510 | cust-002        |     2025 |
+-------------------+-----------------+----------+
```

In the example above, for `cust-001` in 2024, `210` is the sum of the checking and savings account balances for the last day
of the year that contains data:

* The last day of 2024 that contains data is `2024-03-30`.
* There is no row with that date for the checking account, so the resulting metric is the balance of the savings account
  (`210`).

As another example, if you just want the total account balance for all customers at the end of the year, you can specify the
`year_dim` dimension.

Because the date dimensions are marked as non-additive, the query sums the values at the end of the period (by date) for the
checking and savings account balances for each customer.

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    bank_accounts_sv
    METRICS bank_accounts.m_account_balance
    DIMENSIONS year_dim
  )
  ORDER BY year_dim;
```

```output
+-------------------+----------+
| M_ACCOUNT_BALANCE | YEAR_DIM |
|-------------------+----------|
|               210 |     2024 |
|               510 |     2025 |
+-------------------+----------+
```

During query processing, the rows are sorted by the non-additive dimensions, and the values from the last rows (the
*latest snapshots of values*) are aggregated to compute the metric.

> **Note:**
>
> Because the rows are sorted by the non-additive dimensions, the order in which you specify the dimensions is important. This
> is similar to the order in which you specify columns in the [ORDER BY](../../sql-reference/constructs/order-by.md) clause.

#### Specifying the sort order for non-additive dimensions

As demonstrated in the example, the metric aggregates the values of the checking and savings balances for each customer at the
end of a period. If you want to change the sort order, you can specify the ASC or DESC keyword next to the dimension name. For
example:

```sqlexample
METRICS (
  bank_accounts.m_account_balance
    NON ADDITIVE BY (year_dim DESC, month_dim DESC, day_dim DESC)
    AS SUM(balance)
);
```

In this example, the metric evaluates to the earliest date specified by `year_dim`, `month_dim`, and `day_dim`.

If the dimension includes NULL values, you can use the NULLS FIRST or NULLS LAST keywords to specify whether NULL values are
sorted first or last in the results:

```sqlexample
METRICS (
  bank_accounts.m_account_balance
    NON ADDITIVE BY (
      year_dim DESC NULLS FIRST,
      month_dim DESC NULLS FIRST,
      day_dim DESC NULLS FIRST
    )
    AS SUM(balance)
```

### Marking a fact or metric as private

If you are defining a fact or metric only for use in calculations in the semantic view and you don’t want the fact or metric to
be returned in a query, you can specify the PRIVATE keyword to mark the fact or metric as private. For example:

```sqlexample
FACTS (
  PRIVATE my_private_fact AS ...
)

METRICS (
  PRIVATE my_private_metric AS ...
)
```

> **Note:**
>
> You cannot mark a dimension as private. Dimensions are always public.

When you query a semantic view that has private facts or metrics, you cannot specify a private fact or metric in the following
clauses:

* The SELECT list
* FACTS in the [SEMANTIC_VIEW](../../sql-reference/constructs/semantic_view.md) clause
* METRICS in the [SEMANTIC_VIEW](../../sql-reference/constructs/semantic_view.md) clause
* METRICS
* WHERE in the SELECT statement or the [SEMANTIC_VIEW](../../sql-reference/constructs/semantic_view.md) clause

Some commands and functions include private facts and metrics:

* Private facts and metrics do appear in the output of the [DESCRIBE SEMANTIC VIEW](../../sql-reference/sql/desc-semantic-view.md) command. The rows for
  private facts and metrics have `PRIVATE` in the `access_modifier` column.
* Private facts and metrics are listed in the return value of a [GET_DDL](../../sql-reference/functions/get_ddl.md) function call, as noted
  in Getting the SQL statement for a semantic view.

Some commands and functions include private facts and metrics only under specific conditions:

* Private facts and metrics are listed in the INFORMATION_SCHEMA [SEMANTIC_FACTS](../../sql-reference/info-schema/semantic_facts.md) and
  [SEMANTIC_METRICS](../../sql-reference/info-schema/semantic_metrics.md) views only if you are using a role that has been
  granted the REFERENCES or OWNERSHIP privilege on the semantic view.

  Otherwise, these views list only the public facts and metrics.

Other commands and functions do not include private facts and metrics:

* Private facts do not appear in the output of the [SHOW SEMANTIC FACTS](../../sql-reference/sql/show-semantic-facts.md) command.
* Private metrics do not appear in the output of the [SHOW SEMANTIC METRICS](../../sql-reference/sql/show-semantic-metrics.md) command.

### Providing custom instructions for Cortex Analyst

In a semantic view, you can provide
[instructions for Cortex Analyst](../snowflake-cortex/cortex-analyst/custom-instructions.md) that explain how to:

* Generate the SQL statement
* Classify questions and prompt for additional information

To provide these custom instructions, use the following clauses:

* For instructions on how to generate the SQL statement, use the AI_SQL_GENERATION clause in the
  [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command.

  For example, to tell Cortex Analyst to generate the SQL statement so that all numeric columns are rounded to two decimal
  points, specify the following:

  ```sqlexample
  CREATE SEMANTIC VIEW my_semantic_view
    ...
    -- Definitions of logical tables, relationships, dimensions, facts, and metrics
    ...
    AI_SQL_GENERATION 'Ensure that all numeric columns are rounded to 2 decimal points.'
    ...
    -- Additional clauses
  ```
* For instructions on how to classify questions, use the AI_QUESTION_CATEGORIZATION clause.

  For example, to tell Cortex Analyst to reject questions about users, specify the following:

  ```sqlexample
  CREATE SEMANTIC VIEW my_semantic_view
    ...
    -- Definitions of logical tables, relationships, dimensions, facts, and metrics
    ...
    AI_QUESTION_CATEGORIZATION 'Reject all questions asking about users. Ask users to contact their admin.'
    ...
    -- Additional clauses
  ```

  You can also provide instructions to ask for more details, if the question isn’t clear. For example:

  ```sqlexample
  AI_QUESTION_CATEGORIZATION 'If the question asks for users without providing a product_type, consider this question UNCLEAR and ask the user to specify product_type.'
  ```

## Creating a semantic view from a YAML specification

To create a semantic view from a [YAML specification](semantic-view-yaml-spec.md), you can call the
[SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md) stored procedure.

First, pass TRUE as the third argument to verify that you can create the semantic view from the YAML specification.

The following example verifies that you can use a given semantic model specification in YAML to create a semantic view named
`tpch_analysis` in the database `my_db` and schema `my_schema`:

```sqlexample-yaml
CALL SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  'my_db.my_schema',
  $$
  name: TPCH_REV_ANALYSIS
  description: Semantic view for revenue analysis
  tables:
    - name: CUSTOMERS
      description: Main table for customer data
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: CUSTOMER
      primary_key:
        columns:
          - C_CUSTKEY
      dimensions:
        - name: CUSTOMER_NAME
          synonyms:
            - customer name
          description: Name of the customer
          expr: customers.c_name
          data_type: VARCHAR(25)
        - name: C_CUSTKEY
          expr: C_CUSTKEY
          data_type: VARCHAR(134217728)
      metrics:
        - name: CUSTOMER_COUNT
          description: Count of number of customers
          expr: COUNT(c_custkey)
    - name: LINE_ITEMS
      description: Line items in orders
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: LINEITEM
      primary_key:
        columns:
          - L_ORDERKEY
          - L_LINENUMBER
      dimensions:
        - name: L_ORDERKEY
          expr: L_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: L_LINENUMBER
          expr: L_LINENUMBER
          data_type: VARCHAR(134217728)
      facts:
        - name: DISCOUNTED_PRICE
          description: Extended price after discount
          expr: l_extendedprice * (1 - l_discount)
          data_type: "NUMBER(25,4)"
        - name: LINE_ITEM_ID
          expr: "CONCAT(l_orderkey, '-', l_linenumber)"
          data_type: VARCHAR(134217728)
    - name: ORDERS
      synonyms:
        - sales orders
      description: All orders table for the sales domain
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: ORDERS
      primary_key:
        columns:
          - O_ORDERKEY
      dimensions:
        - name: ORDER_DATE
          description: Date when the order was placed
          expr: o_orderdate
          data_type: DATE
        - name: ORDER_YEAR
          description: Year when the order was placed
          expr: YEAR(o_orderdate)
          data_type: "NUMBER(4,0)"
        - name: O_ORDERKEY
          expr: O_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: O_CUSTKEY
          expr: O_CUSTKEY
          data_type: VARCHAR(134217728)
      facts:
        - name: COUNT_LINE_ITEMS
          expr: COUNT(line_items.line_item_id)
          data_type: "NUMBER(18,0)"
      metrics:
        - name: AVERAGE_LINE_ITEMS_PER_ORDER
          description: Average number of line items per order
          expr: AVG(orders.count_line_items)
        - name: ORDER_AVERAGE_VALUE
          description: Average order value across all orders
          expr: AVG(orders.o_totalprice)
  relationships:
    - name: LINE_ITEM_TO_ORDERS
      left_table: LINE_ITEMS
      right_table: ORDERS
      relationship_columns:
        - left_column: L_ORDERKEY
          right_column: O_ORDERKEY
      relationship_type: many_to_one
    - name: ORDERS_TO_CUSTOMERS
      left_table: ORDERS
      right_table: CUSTOMERS
      relationship_columns:
        - left_column: O_CUSTKEY
          right_column: C_CUSTKEY
      relationship_type: many_to_one
  $$,
TRUE);
```

If the specification is valid, the stored procedure returns the following message:

```output
+----------------------------------------------------------------------------------+
| SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML                                            |
|----------------------------------------------------------------------------------|
| YAML file is valid for creating a semantic view. No object has been created yet. |
+----------------------------------------------------------------------------------+
```

If the YAML syntax is invalid, the stored procedure throw an exception. For example, if a colon is missing:

```yaml
relationships
  - name: LINE_ITEM_TO_ORDERS
```

the stored procedure throws an exception, indicating that the YAML syntax is invalid:

```output
392400 (22023): Uncaught exception of type 'EXPRESSION_ERROR' on line 3 at position 23 :
  Invalid semantic model YAML: while scanning a simple key
   in 'reader', line 90, column 3:
        relationships
        ^
  could not find expected ':'
   in 'reader', line 91, column 11:
          - name: LINE_ITEM_TO_ORDERS
                ^
```

If the specification refers to a physical table that does not exist, the stored procedure throws an exception:

```yaml
base_table:
  database: SNOWFLAKE_SAMPLE_DATA
  schema: TPCH_SF1
  table: NONEXISTENT
```

```output
002003 (42S02): Uncaught exception of type 'EXPRESSION_ERROR' on line 3 at position 23 :
  SQL compilation error:
  Table 'SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.NONEXISTENT' does not exist or not authorized.
```

Similarly, if the specification refers to a primary key column that does not exist, the stored procedure throws an exception:

```yaml
primary_key:
  columns:
    - NONEXISTENT
```

```output
000904 (42000): Uncaught exception of type 'EXPRESSION_ERROR' on line 3 at position 23 :
  SQL compilation error: error line 0 at position -1
  invalid identifier 'NONEXISTENT'
```

You can then call the stored procedure without passing in the third argument to create the semantic view.

The following example creates a semantic view named `tpch_analysis` in the database `my_db` and schema `my_schema`:

```sqlexample-yaml
CALL SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  'my_db.my_schema',
  $$
  name: TPCH_REV_ANALYSIS
  description: Semantic view for revenue analysis
  tables:
    - name: CUSTOMERS
      description: Main table for customer data
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: CUSTOMER
      primary_key:
        columns:
          - C_CUSTKEY
      dimensions:
        - name: CUSTOMER_NAME
          synonyms:
            - customer name
          description: Name of the customer
          expr: customers.c_name
          data_type: VARCHAR(25)
        - name: C_CUSTKEY
          expr: C_CUSTKEY
          data_type: VARCHAR(134217728)
      metrics:
        - name: CUSTOMER_COUNT
          description: Count of number of customers
          expr: COUNT(c_custkey)
    - name: LINE_ITEMS
      description: Line items in orders
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: LINEITEM
      primary_key:
        columns:
          - L_ORDERKEY
          - L_LINENUMBER
      dimensions:
        - name: L_ORDERKEY
          expr: L_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: L_LINENUMBER
          expr: L_LINENUMBER
          data_type: VARCHAR(134217728)
      facts:
        - name: DISCOUNTED_PRICE
          description: Extended price after discount
          expr: l_extendedprice * (1 - l_discount)
          data_type: "NUMBER(25,4)"
        - name: LINE_ITEM_ID
          expr: "CONCAT(l_orderkey, '-', l_linenumber)"
          data_type: VARCHAR(134217728)
    - name: ORDERS
      synonyms:
        - sales orders
      description: All orders table for the sales domain
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: ORDERS
      primary_key:
        columns:
          - O_ORDERKEY
      dimensions:
        - name: ORDER_DATE
          description: Date when the order was placed
          expr: o_orderdate
          data_type: DATE
        - name: ORDER_YEAR
          description: Year when the order was placed
          expr: YEAR(o_orderdate)
          data_type: "NUMBER(4,0)"
        - name: O_ORDERKEY
          expr: O_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: O_CUSTKEY
          expr: O_CUSTKEY
          data_type: VARCHAR(134217728)
      facts:
        - name: COUNT_LINE_ITEMS
          expr: COUNT(line_items.line_item_id)
          data_type: "NUMBER(18,0)"
      metrics:
        - name: AVERAGE_LINE_ITEMS_PER_ORDER
          description: Average number of line items per order
          expr: AVG(orders.count_line_items)
        - name: ORDER_AVERAGE_VALUE
          description: Average order value across all orders
          expr: AVG(orders.o_totalprice)
  relationships:
    - name: LINE_ITEM_TO_ORDERS
      left_table: LINE_ITEMS
      right_table: ORDERS
      relationship_columns:
        - left_column: L_ORDERKEY
          right_column: O_ORDERKEY
      relationship_type: many_to_one
    - name: ORDERS_TO_CUSTOMERS
      left_table: ORDERS
      right_table: CUSTOMERS
      relationship_columns:
        - left_column: O_CUSTKEY
          right_column: C_CUSTKEY
      relationship_type: many_to_one
  $$
);
```

```output
+-----------------------------------------+
| SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML   |
|-----------------------------------------|
| Semantic view was successfully created. |
+-----------------------------------------+
```

## Modifying the comment for an existing semantic view

To modify the comment for an existing semantic view, run the [ALTER SEMANTIC VIEW](../../sql-reference/sql/alter-semantic-view.md) command. For example:

```sqlexample
ALTER SEMANTIC VIEW my_semantic_view SET COMMENT = 'my comment';
```

> **Note:**
>
> You can’t use the ALTER SEMANTIC VIEW command to change properties other than the comment. To change other properties of the
> semantic view, replace the semantic view. See Replacing an existing semantic view.

You can also use the [COMMENT](../../sql-reference/sql/comment.md) command to set a comment for a semantic view:

```sqlexample
COMMENT ON SEMANTIC VIEW my_semantic_view IS 'my comment';
```

## Replacing an existing semantic view

To replace an existing semantic view (for example, to change the definition of the view), specify OR REPLACE when executing
[CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md). If you want to preserve any privileges granted on the existing semantic view,
specify COPY GRANTS. For example:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW tpch_rev_analysis

  TABLES (
    orders AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS
      PRIMARY KEY (o_orderkey)
      WITH SYNONYMS ('sales orders')
      COMMENT = 'All orders table for the sales domain',
    customers AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
      PRIMARY KEY (c_custkey)
      COMMENT = 'Main table for customer data',
    line_items AS SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.LINEITEM
      PRIMARY KEY (l_orderkey, l_linenumber)
      COMMENT = 'Line items in orders'
  )

  RELATIONSHIPS (
    orders_to_customers AS
      orders (o_custkey) REFERENCES customers,
    line_item_to_orders AS
      line_items (l_orderkey) REFERENCES orders
  )

  FACTS (
    line_items.line_item_id AS CONCAT(l_orderkey, '-', l_linenumber),
    orders.count_line_items AS COUNT(line_items.line_item_id),
    line_items.discounted_price AS l_extendedprice * (1 - l_discount)
      COMMENT = 'Extended price after discount'
  )

  DIMENSIONS (
    customers.customer_name AS customers.c_name
      WITH SYNONYMS = ('customer name')
      COMMENT = 'Name of the customer',
    orders.order_date AS o_orderdate
      COMMENT = 'Date when the order was placed',
    orders.order_year AS YEAR(o_orderdate)
      COMMENT = 'Year when the order was placed'
  )

  METRICS (
    customers.customer_count AS COUNT(c_custkey)
      COMMENT = 'Count of number of customers',
    orders.order_average_value AS AVG(orders.o_totalprice)
      COMMENT = 'Average order value across all orders',
    orders.average_line_items_per_order AS AVG(orders.count_line_items)
      COMMENT = 'Average number of line items per order'
  )

  COMMENT = 'Semantic view for revenue analysis and different comment'
  COPY GRANTS;
```

## Listing semantic views

To list semantic views in the current schema or a specified schema, run the [SHOW SEMANTIC VIEWS](../../sql-reference/sql/show-semantic-views.md)
command. For example:

```sqlexample
SHOW SEMANTIC VIEWS;
```

```output
+-------------------------------+-----------------------+---------------+-------------------+----------------------------------------------+-----------------+-----------------+-----------+
| created_on                    | name                  | database_name | schema_name       | comment                                      | owner           | owner_role_type | extension |
|-------------------------------+-----------------------+---------------+-------------------+----------------------------------------------+-----------------+-----------------+-----------|
| 2025-03-20 15:06:34.039 -0700 | MY_NEW_SEMANTIC_MODEL | MY_DB         | MY_SCHEMA         | A semantic model created through the wizard. | MY_ROLE         | ROLE            | ["CA"]    |
| 2025-02-28 16:16:04.002 -0800 | O_TPCH_SEMANTIC_VIEW  | MY_DB         | MY_SCHEMA         | NULL                                         | MY_ROLE         | ROLE            | NULL      |
| 2025-03-21 07:03:54.120 -0700 | TPCH_REV_ANALYSIS     | MY_DB         | MY_SCHEMA         | Semantic view for revenue analysis           | MY_ROLE         | ROLE            | NULL      |
+-------------------------------+-----------------------+---------------+-------------------+----------------------------------------------+-----------------+-----------------+-----------+
```

The output of the [SHOW OBJECTS](../../sql-reference/sql/show-objects.md) command includes semantic views. In the `kind` column, the type of
object is listed as `VIEW`. For example:

```sqlexample
SHOW OBJECTS LIKE '%TPCH_ANALYSIS%' IN SCHEMA;
```

```output
+-------------------------------+---------------+---------------+-------------+------+---------+------------+------+-------+---------+----------------+-----------------+-----------+------------+------------+
| created_on                    | name          | database_name | schema_name | kind | comment | cluster_by | rows | bytes | owner   | retention_time | owner_role_type | is_hybrid | is_dynamic | is_iceberg |
|-------------------------------+---------------+---------------+-------------+------+---------+------------+------+-------+---------+----------------+-----------------+-----------+------------+------------|
| 2025-10-03 16:28:01.505 -0700 | TPCH_ANALYSIS | MY_DB         | MY_SCHEMA   | VIEW |         |            |    0 |     0 | MY_ROLE | 1              | ROLE            | N         | N          | N          |
+-------------------------------+---------------+---------------+-------------+------+---------+------------+------+-------+---------+----------------+-----------------+-----------+------------+------------+
```

You can also [query the views for semantic views in the ACCOUNT_USAGE and INFORMATION_SCHEMA schemas](views.md).

## Listing dimensions, facts, and metrics

To list the dimensions, facts, and metrics that are available in a view, schema, database, or account, you can run the following
commands:

* [SHOW SEMANTIC DIMENSIONS](../../sql-reference/sql/show-semantic-dimensions.md)
* [SHOW SEMANTIC FACTS](../../sql-reference/sql/show-semantic-facts.md)
* [SHOW SEMANTIC METRICS](../../sql-reference/sql/show-semantic-metrics.md)

By default, the commands list the dimensions, facts, and metrics that are available in semantic views defined in the current
schema:

```sqlexample
SHOW SEMANTIC DIMENSIONS;
```

```output
+---------------+-------------+--------------------+------------+---------------+--------------+-------------------+--------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name          | data_type    | synonyms          | comment                        |
|---------------+-------------+--------------------+------------+---------------+--------------+-------------------+--------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | CUSTOMER_NAME | VARCHAR(25)  | ["customer name"] | Name of the customer           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | C_CUSTKEY     | NUMBER(38,0) | NULL              | NULL                           |
...
```

```sqlexample
SHOW SEMANTIC FACTS;
```

```output
+---------------+-------------+--------------------+------------+------------------+--------------------+----------+-------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name             | data_type          | synonyms | comment                       |
|---------------+-------------+--------------------+------------+------------------+--------------------+----------+-------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | LINE_ITEMS | DISCOUNTED_PRICE | NUMBER(25,4)       | NULL     | Extended price after discount |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | LINE_ITEMS | LINE_ITEM_ID     | VARCHAR(134217728) | NULL     | NULL                          |
...
```

```sqlexample
SHOW SEMANTIC METRICS;
```

```output
+---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name                         | data_type    | synonyms | comment                                |
|---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | CUSTOMER_COUNT               | NUMBER(18,0) | NULL     | Count of number of customers           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | AVERAGE_LINE_ITEMS_PER_ORDER | NUMBER(36,6) | NULL     | Average number of line items per order |
...
```

The following examples demonstrate how to list the dimensions, facts, and metrics for semantic views within different scopes:

* List the dimensions, facts, and metrics in semantic views in the current database:

  ```sqlexample
  SHOW SEMANTIC DIMENSIONS IN DATABASE;

  SHOW SEMANTIC FACTS IN DATABASE;

  SHOW SEMANTIC METRICS IN DATABASE;
  ```
* List the dimensions, facts, and metrics in semantic views in a specific schema or database:

  ```sqlexample
  SHOW SEMANTIC DIMENSIONS IN SCHEMA my_db.my_other_schema;

  SHOW SEMANTIC DIMENSIONS IN DATABASE my_db;

  SHOW SEMANTIC FACTS IN SCHEMA my_db.my_other_schema;

  SHOW SEMANTIC FACTS IN DATABASE my_db;

  SHOW SEMANTIC METRICS IN SCHEMA my_db.my_other_schema;

  SHOW SEMANTIC METRICS IN DATABASE my_db;
  ```
* List the dimensions, facts, and metrics in semantic views in the account:

  ```sqlexample
  SHOW SEMANTIC DIMENSIONS IN ACCOUNT;

  SHOW SEMANTIC FACTS IN ACCOUNT;

  SHOW SEMANTIC METRICS IN ACCOUNT;
  ```
* List the dimensions, facts, and metrics in a specific semantic view:

  ```sqlexample
  SHOW SEMANTIC DIMENSIONS IN my_semantic_view;

  SHOW SEMANTIC FACTS IN my_semantic_view;

  SHOW SEMANTIC METRICS IN my_semantic_view;
  ```

If you are querying a semantic view, you can use the [SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md) command to
determine which dimensions you can return when specifying a given metric. For details, see
[Choosing the dimensions that you can return for a given metric](querying.md).

When you run the [SHOW COLUMNS](../../sql-reference/sql/show-columns.md) command for a semantic view, the output includes the dimensions, facts,
and metrics in the semantic view. The `kind` column indicates if the row represents a dimension, fact, or metric.

For example:

```sqlexample
SHOW COLUMNS IN VIEW my_db.my_schema.tpch_analysis;
```

```output
+---------------+-------------+------------------------------+-----------------------------------------------------------------------------------------+----------+---------+-----------+------------+---------+---------------+---------------+-------------------------+
| table_name    | schema_name | column_name                  | data_type                                                                               | null?    | default | kind      | expression | comment | database_name | autoincrement | schema_evolution_record |
|---------------+-------------+------------------------------+-----------------------------------------------------------------------------------------+----------+---------+-----------+------------+---------+---------------+---------------+-------------------------|
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_COUNT               | {"type":"FIXED","precision":18,"scale":0,"nullable":false}                              | NOT_NULL |         | METRIC    |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_COUNTRY_CODE        | {"type":"TEXT","length":15,"byteLength":60,"nullable":true,"fixed":false}               | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_MARKET_SEGMENT      | {"type":"TEXT","length":10,"byteLength":40,"nullable":true,"fixed":false}               | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_NAME                | {"type":"TEXT","length":25,"byteLength":100,"nullable":true,"fixed":false}              | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_NATION_NAME         | {"type":"TEXT","length":25,"byteLength":100,"nullable":true,"fixed":false}              | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_ORDER_COUNT         | {"type":"FIXED","precision":30,"scale":0,"nullable":true}                               | true     |         | METRIC    |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | CUSTOMER_REGION_NAME         | {"type":"TEXT","length":25,"byteLength":100,"nullable":true,"fixed":false}              | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | C_CUSTOMER_ORDER_COUNT       | {"type":"FIXED","precision":18,"scale":0,"nullable":false}                              | NOT_NULL |         | FACT      |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | LINE_ITEM_ID                 | {"type":"TEXT","length":134217728,"byteLength":134217728,"nullable":true,"fixed":false} | true     |         | FACT      |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | NATION_NAME                  | {"type":"TEXT","length":25,"byteLength":100,"nullable":true,"fixed":false}              | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | N_NAME                       | {"type":"TEXT","length":25,"byteLength":100,"nullable":true,"fixed":false}              | true     |         | FACT      |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | AVERAGE_LINE_ITEMS_PER_ORDER | {"type":"FIXED","precision":36,"scale":6,"nullable":true}                               | true     |         | METRIC    |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | COUNT_LINE_ITEMS             | {"type":"FIXED","precision":18,"scale":0,"nullable":false}                              | NOT_NULL |         | FACT      |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | ORDER_AVERAGE_VALUE          | {"type":"FIXED","precision":30,"scale":8,"nullable":true}                               | true     |         | METRIC    |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | ORDER_COUNT                  | {"type":"FIXED","precision":18,"scale":0,"nullable":false}                              | NOT_NULL |         | METRIC    |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | ORDER_DATE                   | {"type":"DATE","nullable":true}                                                         | true     |         | DIMENSION |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | O_ORDERKEY                   | {"type":"FIXED","precision":38,"scale":0,"nullable":true}                               | true     |         | FACT      |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | R_NAME                       | {"type":"TEXT","length":25,"byteLength":100,"nullable":true,"fixed":false}              | true     |         | FACT      |            |         | MY_DB         |               | NULL                    |
| TPCH_ANALYSIS | MY_SCHEMA   | SUPPLIER_COUNT               | {"type":"FIXED","precision":18,"scale":0,"nullable":false}                              | NOT_NULL |         | METRIC    |            |         | MY_DB         |               | NULL                    |
+---------------+-------------+------------------------------+-----------------------------------------------------------------------------------------+----------+---------+-----------+------------+---------+---------------+---------------+-------------------------+
```

## Viewing the details about a semantic view

To view the details of a semantic view, run the [DESCRIBE SEMANTIC VIEW](../../sql-reference/sql/desc-semantic-view.md) command. For example:

```sqlexample
DESCRIBE SEMANTIC VIEW tpch_rev_analysis;
```

```output
+--------------+------------------------------+---------------+--------------------------+----------------------------------------+
| object_kind  | object_name                  | parent_entity | property                 | property_value                         |
|--------------+------------------------------+---------------+--------------------------+----------------------------------------|
| NULL         | NULL                         | NULL          | COMMENT                  | Semantic view for revenue analysis     |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_SCHEMA_NAME   | TPCH_SF1                               |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_NAME          | CUSTOMER                               |
| TABLE        | CUSTOMERS                    | NULL          | PRIMARY_KEY              | ["C_CUSTKEY"]                          |
| TABLE        | CUSTOMERS                    | NULL          | COMMENT                  | Main table for customer data           |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | TABLE                    | CUSTOMERS                              |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | EXPRESSION               | customers.c_name                       |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | DATA_TYPE                | VARCHAR(25)                            |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | SYNONYMS                 | ["customer name"]                      |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | COMMENT                  | Name of the customer                   |
| TABLE        | LINE_ITEMS                   | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| TABLE        | LINE_ITEMS                   | NULL          | BASE_TABLE_SCHEMA_NAME   | TPCH_SF1                               |
| TABLE        | LINE_ITEMS                   | NULL          | BASE_TABLE_NAME          | LINEITEM                               |
| TABLE        | LINE_ITEMS                   | NULL          | PRIMARY_KEY              | ["L_ORDERKEY","L_LINENUMBER"]          |
| TABLE        | LINE_ITEMS                   | NULL          | COMMENT                  | Line items in orders                   |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | REF_TABLE                | ORDERS                                 |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | FOREIGN_KEY              | ["L_ORDERKEY"]                         |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | REF_KEY                  | ["O_ORDERKEY"]                         |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | EXPRESSION               | l_extendedprice * (1 - l_discount)     |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | DATA_TYPE                | NUMBER(25,4)                           |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | COMMENT                  | Extended price after discount          |
| FACT         | LINE_ITEM_ID                 | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| FACT         | LINE_ITEM_ID                 | LINE_ITEMS    | EXPRESSION               | CONCAT(l_orderkey, '-', l_linenumber)  |
| FACT         | LINE_ITEM_ID                 | LINE_ITEMS    | DATA_TYPE                | VARCHAR(134217728)                     |
| TABLE        | ORDERS                       | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| TABLE        | ORDERS                       | NULL          | BASE_TABLE_SCHEMA_NAME   | TPCH_SF1                               |
| TABLE        | ORDERS                       | NULL          | BASE_TABLE_NAME          | ORDERS                                 |
| TABLE        | ORDERS                       | NULL          | SYNONYMS                 | ["sales orders"]                       |
| TABLE        | ORDERS                       | NULL          | PRIMARY_KEY              | ["O_ORDERKEY"]                         |
| TABLE        | ORDERS                       | NULL          | COMMENT                  | All orders table for the sales domain  |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | TABLE                    | ORDERS                                 |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | REF_TABLE                | CUSTOMERS                              |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | FOREIGN_KEY              | ["O_CUSTKEY"]                          |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | REF_KEY                  | ["C_CUSTKEY"]                          |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | TABLE                    | ORDERS                                 |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | EXPRESSION               | AVG(orders.count_line_items)           |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | DATA_TYPE                | NUMBER(36,6)                           |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | COMMENT                  | Average number of line items per order |
| FACT         | COUNT_LINE_ITEMS             | ORDERS        | TABLE                    | ORDERS                                 |
| FACT         | COUNT_LINE_ITEMS             | ORDERS        | EXPRESSION               | COUNT(line_items.line_item_id)         |
| FACT         | COUNT_LINE_ITEMS             | ORDERS        | DATA_TYPE                | NUMBER(18,0)                           |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | TABLE                    | ORDERS                                 |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | EXPRESSION               | AVG(orders.o_totalprice)               |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | DATA_TYPE                | NUMBER(30,8)                           |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | COMMENT                  | Average order value across all orders  |
| DIMENSION    | ORDER_DATE                   | ORDERS        | TABLE                    | ORDERS                                 |
| DIMENSION    | ORDER_DATE                   | ORDERS        | EXPRESSION               | o_orderdate                            |
| DIMENSION    | ORDER_DATE                   | ORDERS        | DATA_TYPE                | DATE                                   |
| DIMENSION    | ORDER_DATE                   | ORDERS        | COMMENT                  | Date when the order was placed         |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | TABLE                    | ORDERS                                 |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | EXPRESSION               | YEAR(o_orderdate)                      |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | DATA_TYPE                | NUMBER(4,0)                            |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | COMMENT                  | Year when the order was placed         |
+--------------+------------------------------+---------------+--------------------------+----------------------------------------+
```

## Getting the SQL statement for a semantic view

You can call the [GET_DDL](../../sql-reference/functions/get_ddl.md) function to retrieve the DDL statement that created a semantic view.

> **Note:**
>
> To call this function for a semantic view, you must use a role that has been
> granted the REFERENCES or OWNERSHIP privilege on the semantic view.

When calling GET_DDL, pass in `'SEMANTIC_VIEW'` as the object type. For example:

```sqlexample
SELECT GET_DDL('SEMANTIC_VIEW', 'tpch_rev_analysis', TRUE);
```

```output
+-----------------------------------------------------------------------------------+
| GET_DDL('SEMANTIC_VIEW', 'TPCH_REV_ANALYSIS', TRUE)                               |
|-----------------------------------------------------------------------------------|
| create or replace semantic view DYOSHINAGA_DB.DYOSHINAGA_SCHEMA.TPCH_REV_ANALYSIS |
|     tables (                                                                                                                                                                       |
|             ORDERS primary key (O_ORDERKEY) with synonyms=('sales orders') comment='All orders table for the sales domain',                                                                                                                                                                       |
|             CUSTOMERS as CUSTOMER primary key (C_CUSTKEY) comment='Main table for customer data',                                                                                                                                                                       |
|             LINE_ITEMS as LINEITEM primary key (L_ORDERKEY,L_LINENUMBER) comment='Line items in orders'                                                                                                                                                                       |
|     )                                                                                                                                                                       |
|     relationships (                                                                                                                                                                       |
|             ORDERS_TO_CUSTOMERS as ORDERS(O_CUSTKEY) references CUSTOMERS(C_CUSTKEY),                                                                                                                                                                       |
|             LINE_ITEM_TO_ORDERS as LINE_ITEMS(L_ORDERKEY) references ORDERS(O_ORDERKEY)                                                                                                                                                                       |
|     )                                                                                                                                                                       |
|     facts (                                                                                                                                                                       |
|             ORDERS.COUNT_LINE_ITEMS as COUNT(line_items.line_item_id),                                                                                                                                                                       |
|             LINE_ITEMS.DISCOUNTED_PRICE as l_extendedprice * (1 - l_discount) comment='Extended price after discount',                                                                                                                                                                       |
|             LINE_ITEMS.LINE_ITEM_ID as CONCAT(l_orderkey, '-', l_linenumber)                                                                                                                                                                       |
|     )                                                                                                                                                                       |
|     dimensions (                                                                                                                                                                       |
|             ORDERS.ORDER_DATE as o_orderdate comment='Date when the order was placed',                                                                                                                                                                       |
|             ORDERS.ORDER_YEAR as YEAR(o_orderdate) comment='Year when the order was placed',                                                                                                                                                                       |
|             CUSTOMERS.CUSTOMER_NAME as customers.c_name with synonyms=('customer name') comment='Name of the customer'                                                                                                                                                                       |
|     )                                                                                                                                                                       |
|     metrics (                                                                                                                                                                       |
|             ORDERS.AVERAGE_LINE_ITEMS_PER_ORDER as AVG(orders.count_line_items) comment='Average number of line items per order',                                                                                                                                                                       |
|             ORDERS.ORDER_AVERAGE_VALUE as AVG(orders.o_totalprice) comment='Average order value across all orders'                                                                                                                                                                       |
|     );                                                                                                                                                                       |
+-----------------------------------------------------------------------------------+
```

The return value includes private facts and metrics (facts and metrics that are marked with
the PRIVATE keyword).

## Getting the YAML specification for a semantic view

To get the [YAML specification](semantic-view-yaml-spec.md) for a semantic view, call the
[SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](../../sql-reference/functions/system_read_yaml_from_semantic_view.md) function.

The following example returns the YAML specification for the semantic view named `tpch_analysis` in the database `my_db` and
schema `my_schema`:

```sqlexample
SELECT SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW(
  'my_db.my_schema.tpch_rev_analysis'
);
```

```output
+-------------------------------------------------------------+
| READ_YAML_FROM_SEMANTIC_VIEW                                |
|-------------------------------------------------------------|
| name: TPCH_REV_ANALYSIS                                     |
| description: Semantic view for revenue analysis             |
| tables:                                                     |
|   - name: CUSTOMERS                                         |
|     description: Main table for customer data               |
|     base_table:                                             |
|       database: SNOWFLAKE_SAMPLE_DATA                       |
|       schema: TPCH_SF1                                      |
|       table: CUSTOMER                                       |
|     primary_key:                                            |
|       columns:                                              |
|         - C_CUSTKEY                                         |
|     dimensions:                                             |
|       - name: CUSTOMER_NAME                                 |
|         synonyms:                                           |
|           - customer name                                   |
|         description: Name of the customer                   |
|         expr: customers.c_name                              |
|         data_type: VARCHAR(25)                              |
|       - name: C_CUSTKEY                                     |
|         expr: C_CUSTKEY                                     |
|         data_type: VARCHAR(134217728)                       |
|   - name: LINE_ITEMS                                        |
|     description: Line items in orders                       |
|     base_table:                                             |
|       database: SNOWFLAKE_SAMPLE_DATA                       |
|       schema: TPCH_SF1                                      |
|       table: LINEITEM                                       |
|     primary_key:                                            |
|       columns:                                              |
|         - L_ORDERKEY                                        |
|         - L_LINENUMBER                                      |
|     dimensions:                                             |
|       - name: L_ORDERKEY                                    |
|         expr: L_ORDERKEY                                    |
|         data_type: VARCHAR(134217728)                       |
|       - name: L_LINENUMBER                                  |
|         expr: L_LINENUMBER                                  |
|         data_type: VARCHAR(134217728)                       |
|     facts:                                                  |
|       - name: DISCOUNTED_PRICE                              |
|         description: Extended price after discount          |
|         expr: l_extendedprice * (1 - l_discount)            |
|         data_type: "NUMBER(25,4)"                           |
|       - name: LINE_ITEM_ID                                  |
|         expr: "CONCAT(l_orderkey, '-', l_linenumber)"       |
|         data_type: VARCHAR(134217728)                       |
|   - name: ORDERS                                            |
|     synonyms:                                               |
|       - sales orders                                        |
|     description: All orders table for the sales domain      |
|     base_table:                                             |
|       database: SNOWFLAKE_SAMPLE_DATA                       |
|       schema: TPCH_SF1                                      |
|       table: ORDERS                                         |
|     primary_key:                                            |
|       columns:                                              |
|         - O_ORDERKEY                                        |
|     dimensions:                                             |
|       - name: ORDER_DATE                                    |
|         description: Date when the order was placed         |
|         expr: o_orderdate                                   |
|         data_type: DATE                                     |
|       - name: ORDER_YEAR                                    |
|         description: Year when the order was placed         |
|         expr: YEAR(o_orderdate)                             |
|         data_type: "NUMBER(4,0)"                            |
|       - name: O_ORDERKEY                                    |
|         expr: O_ORDERKEY                                    |
|         data_type: VARCHAR(134217728)                       |
|       - name: O_CUSTKEY                                     |
|         expr: O_CUSTKEY                                     |
|         data_type: VARCHAR(134217728)                       |
|     facts:                                                  |
|       - name: COUNT_LINE_ITEMS                              |
|         expr: COUNT(line_items.line_item_id)                |
|         data_type: "NUMBER(18,0)"                           |
|     metrics:                                                |
|       - name: AVERAGE_LINE_ITEMS_PER_ORDER                  |
|         description: Average number of line items per order |
|         expr: AVG(orders.count_line_items)                  |
|       - name: ORDER_AVERAGE_VALUE                           |
|         description: Average order value across all orders  |
|         expr: AVG(orders.o_totalprice)                      |
| relationships:                                              |
|   - name: LINE_ITEM_TO_ORDERS                               |
|     left_table: LINE_ITEMS                                  |
|     right_table: ORDERS                                     |
|     relationship_columns:                                   |
|       - left_column: L_ORDERKEY                             |
|         right_column: O_ORDERKEY                            |
|   - name: ORDERS_TO_CUSTOMERS                               |
|     left_table: ORDERS                                      |
|     right_table: CUSTOMERS                                  |
|     relationship_columns:                                   |
|       - left_column: O_CUSTKEY                              |
|         right_column: C_CUSTKEY                             |
|                                                             |
+-------------------------------------------------------------+
```

## Exporting a semantic view to a Tableau Data Source (TDS) file

To export a semantic view to a
[Tableau Data Source (TDS) file](https://help.tableau.com/current/pro/desktop/en-us/export_connection.htm#options-for-saving-a-local-data-source),
call the [SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW](../../sql-reference/functions/system_export_tds_from_semantic_view.md) function.

The following example returns the TDS file content for the semantic view `my_sv_for_export`:

```sqlexample
SELECT SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW('my_sv_for_export');
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW('MY_SV_FOR_EXPORT')                                                                                                                                                                                                              |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <?xml version="1.0" encoding="UTF-8"?>                                                                                                                                                                                                                                |
| <!--Tableau compatibility notice:                                                                                                                                                                                                                                     |
| - Generated TDS schema version 18.1 is validated against Tableau Desktop 2025.2                                                                                                                                                                                       |
| - Connection customization schema version 1 enables CAP_* settings to take effect.                                                                                                                                                                                    |
| - Update these versions if your Tableau client requires a different schema.-->                                                                                                                                                                                        |
| <!--Dimensions and measures with duplicated names [DUPLICATE_DIM] are not shown in the TDS file-->                                                                                                                                                                    |
| <datasource xmlns:user="http://www.tableausoftware.com/xml/user" formatted-name="federated.0484db64fcbd48d89e8af86a62" inline="true" version="18.1">                                                                                                                  |
|   <document-format-change-manifest>                                                                                                                                                                                                                                   |
| ...                                                                                                                                 |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

Copy the XML to a `.tds` file and open the file in Tableau Desktop.

Tableau Desktop displays a folder for each logical table in the list of folders on the left. The names of the folders use spaces
instead of underscores, and each word starts with an uppercase letter. For example, the folder name for the `date_dim` logical
table is `Date Dim`.

Each folder contains Tableau dimensions and measures that correspond to the dimensions, facts, and metrics in the semantic view.

The next sections provide more detail and the limitations of the conversion process:

### About the conversion

The function converts dimensions, facts, and metrics in the semantic view to the following equivalents in the Tableau TDS file:

| Element in the semantic view | Tableau equivalent (dimension or measure) | How the data is aggregated |
| --- | --- | --- |
| Dimension | Dimension | * For values of numeric dimensions, SUM is used. * Date dimensions are aggregated by year. * For dimensions of other types, COUNT is used. |
| Numeric fact | Measure | SUM |
| Non-numeric fact | Dimension | * Date dimensions are aggregated by year. * For dimensions of other types, COUNT is used. |
| Numeric metric | Measure | The TDS file uses a calculated field in place of the metric. The calculated field passes the value of the metric to the Snowflake [AGG](../../sql-reference/functions/agg.md) function. |
| Non-numeric metric | Dimension | * Date dimensions are aggregated by year. * For dimensions of other types, COUNT is used. |
| Numeric derived metric | Measure | The TDS file uses a calculated field in place of the metric. The calculated field passes the value of the metric to the Snowflake [AGG](../../sql-reference/functions/agg.md) function. |
| Non-numeric derived metric | Dimension | * Date dimensions are aggregated by year. * For dimensions of other types, COUNT is used. |

The following [Snowflake data types](../../sql-reference-data-types.md) are mapped to corresponding Tableau TDS data types:

| Snowflake data type | Equivalent Tableau data type |
| --- | --- |
| NUMBER/FIXED (if the scale is greater than 0) | real |
| NUMBER/FIXED (if the scale is 0 or null) | integer |
| FLOAT or DECFLOAT | real |
| STRING or BINARY | string |
| BOOLEAN | boolean |
| TIME | time |
| DATE | date |
| DATETIME or TIMESTAMP | datetime |
| GEOGRAPHY | spatial |
| Semi-structured (VARIANT, OBJECT, ARRAY), structured (ARRAY, OBJECT, MAP), unstructured (FILE), GEOMETRY, UUID, VECTOR | string |

The TDS file has the following [capabilities](https://help.tableau.com/current/pro/desktop/en-us/odbc_capabilities.htm)
customized for the connection to Snowflake:

| Customization name | Value | Effect of the customization |
| --- | --- | --- |
| `CAP_ODBC_METADATA_SUPPRESS_EXECUTED_QUERY` | `yes` | Prevents Tableau from actually running a query like `SELECT * FROM table WHERE 1=0` to see column names. |
| `CAP_ODBC_METADATA_SUPPRESS_PREPARED_QUERY` | `yes` | Prevents Tableau from “preparing” a statement (sending it to Snowflake to be parsed without executing) to learn about types. |
| `CAP_ODBC_METADATA_SUPPRESS_SELECT_STAR` | `yes` | Prevents Tableau from using a `SELECT *` query to read metadata. |
| `CAP_ODBC_METADATA_SUPPRESS_SQLCOLUMNS_API` | `no` | Forces Tableau to enable and use the standard ODBC `SQLColumns` function to return column information about the semantic view. This column information includes the names, data types, and precision of columns. |
| `CAP_DISABLE_ESCAPE_UNDERSCORE_IN_CATALOG` | `yes` | Prevents Tableau from escaping underscores when searching for the database name. |

### Limitations when using a semantic view in Tableau Desktop

The following limitations apply to semantic views in Tableau Desktop:

* You cannot create an extract from a semantic view.

  If you change your connection from Live to Extract, Tableau Desktop fails with the following error:

  ```none
  SQL compilation error:
  Requested semantic expression 'XXX' in FACTS clause must be one of the following types: (DIMENSION, FACT).
  Unable to create extract
  ```
* You cannot use the Measure Values field in a semantic view.

  If you select the Measure Values field in a semantic view, Tableau Desktop reports the following error:

  ```none
  Unable to complete action

  Error Code: B9F09DDB
  SQL compilation error: error line 1 at position 7
  Invalid metric expression 'SUM(1)'.
  ```
* You cannot select the Count field in a semantic view.

  If you select SemanticViewName(Count), Tableau Desktop reports the following error:

  ```none
  Unable to complete action

  Error Code: B9F09DDB
  SQL compilation error: error line 1 at position 7
  Invalid metric expression 'SUM(1)'.
  ```

  Tableau Desktop cannot report the number of rows in the semantic view because the number of rows can vary, depending on the
  dimensions, facts, and metrics that are specified in the query.
* You cannot drag a measure by itself.

  If you drag a measure, Tableau Desktop reports the following error:

  ```none
  Unable to complete action

  Error Code: B9F09DDB
  SQL compilation error: error line 3 at position 8
  Invalid metric expression 'COUNT(1)'.
  ```
* You cannot directly use a non-numeric metric.

  SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW converts non-numeric metrics to dimensions in Tableau. If you attempt to use one of these
  dimensions, Tableau Desktop reports the following error:

  ```none
  Unable to complete action

  Error Code: B9F09DDB
  SQL compilation error:
  Requested semantic expression 'CUSTOMER.MIN_NAME' in DIMENSIONS clause must be one of the following types: (DIMENSION, FACT).
  ```

  To work around this, convert the dimension to a measure:

  1. Right-click on the dimension, and select Convert to Measure.

     This converts the dimension to a measure, using the default aggregation Count (Distinct).
  2. To use a different aggregation, right-click on the converted measure, select Default Properties »
     Aggregations, and select the aggregation that you want to use.

## Renaming a semantic view

To rename a semantic view, run [ALTER SEMANTIC VIEW … RENAME TO …](../../sql-reference/sql/alter-semantic-view.md). For
example:

```sqlexample
ALTER SEMANTIC VIEW sv RENAME TO sv_new_name;
```

## Removing a semantic view

To remove a semantic view, run the [DROP SEMANTIC VIEW](../../sql-reference/sql/drop-semantic-view.md) command. For example:

```sqlexample
DROP SEMANTIC VIEW tpch_rev_analysis;
```

## Granting privileges on semantic views

[Semantic view privileges](../security-access-control-privileges.md) lists the privileges that you can grant on a semantic view.

The following privileges on a semantic view are required to work with the view:

* Any privilege (for example, MONITOR, REFERENCES, or SELECT) on a view is required to run the
  [DESCRIBE SEMANTIC VIEW](../../sql-reference/sql/desc-semantic-view.md) command on that view.
* Any privilege on a view is required to display that view in the output of the [SHOW SEMANTIC VIEWS](../../sql-reference/sql/show-semantic-views.md)
  command.
* SELECT is required to query the semantic view.

> **Note:**
>
> To query a semantic view, you don’t need the SELECT privilege on the tables used in the semantic view. You only need the
> SELECT privilege on the semantic view itself.
>
> This behavior is consistent with [the privileges required to query standard views](../views-introduction.md).

To use a semantic view that you do not own in [Cortex Analyst](../snowflake-cortex/cortex-analyst.md), you must use a
role that has the REFERENCES and SELECT privileges on that view.

To grant the REFERENCES and SELECT privileges on a semantic view, use the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md)
command. For example, to grant the REFERENCES and SELECT privileges on the semantic view named `my_semantic_view` to the role
`my_analyst_role`, you can run the following statement:

```sqlexample
GRANT REFERENCES, SELECT ON SEMANTIC VIEW my_semantic_view TO ROLE my_analyst_role;
```

If you have a schema containing semantic views that you want to share with Cortex Analyst users, you can use
[future grants](../security-access-control-configure.md) to grant the privileges on any semantic view that you create
in that schema. For example:

```sqlexample
GRANT REFERENCES, SELECT ON FUTURE SEMANTIC VIEWS IN SCHEMA my_schema TO ROLE my_analyst_role;
```

---
title: Using SQL to automatically generate object descriptions
source: https://docs.snowflake.com/en/user-guide/sql-cortex-descriptions.md
section: User Guide
---

# Using SQL to automatically generate object descriptions

The Cortex Powered Object Descriptions feature allows you to use the
[Snowflake Cortex COMPLETE function](../sql-reference/functions/complete-snowflake-cortex.md) to automatically generate descriptions for
tables, views, and columns. This feature leverages Snowflake-hosted large language models (LLMs) to evaluate object metadata and, if
desired, sample data to generate a description.

This topic describes how to use a stored procedure to generate descriptions programmatically. For
information about using Snowsight to generate the descriptions, see [Generate descriptions with Snowflake Cortex](ui-snowsight-cortex-descriptions.md).

## Generating a description

The [AI_GENERATE_TABLE_DESC](../sql-reference/stored-procedures/ai_generate_table_desc.md) stored procedure automatically generates a description for a table and
view. It can also generate descriptions for the columns of that table or view.

The AI_GENERATE_TABLE_DESC stored procedure accepts two arguments:

* The name of the table or view that you want to generate a description for.
* An optional configuration object that allows you to do the following:

  + Generate descriptions for the columns of the specified table or view.
  + Use sample data from the table or view to potentially improve the accuracy of the column descriptions.

Example: Generate a table description
:   ```sqlexample
    CALL AI_GENERATE_TABLE_DESC( 'my_table');
    ```

Example: Generate table and column descriptions without using sample data
:   ```sqlexample
    CALL AI_GENERATE_TABLE_DESC(
      'mydb.sch1.hr_data',
      {
        'describe_columns': true,
        'use_table_data': false
      });
    ```

Example: Generate view and column descriptions using sample data to improve accuracy
:   ```sqlexample
    CALL AI_GENERATE_TABLE_DESC(
      'mydb.sch1.v1',
      {
        'describe_columns': true,
        'use_table_data': true
      });
    ```

For the complete syntax of the stored procedure, see [AI_GENERATE_TABLE_DESC](../sql-reference/stored-procedures/ai_generate_table_desc.md).

## Working with the response

The AI_GENERATE_TABLE_DESC stored procedure returns a JSON object that contains the generated descriptions along with general
information about the table and columns. Within this object, the `description` field contains the generated description.

Suppose you created the following table:

```sqlexample
CREATE OR REPLACE TABLE mydb.sch1.hr_data (fname VARCHAR, age INTEGER);

INSERT INTO hr_data (fname, age)
    VALUES
        ('Thomas',    44),
        ('Katherine', 29),
        ('Lisa',      29);
```

Given this table, the following is an example of the JSON object returned by AI_GENERATE_TABLE_DESC:

```output
{
  "COLUMNS": [
    {
      "database_name": "mydb",
      "description": "The first name of the employee.",
      "name": "FNAME",
      "schema_name": "sch1",
      "table_name": "hr_data"
    }
    {
      "database_name": "mydb",
      "description": "A column holding data of type DecimalType representing age values.",
      "name": "AGE",
      "schema_name": "sch1",
      "table_name": "hr_data"
    },
  ],
  "TABLE": [
    {
      "database_name": "mydb",
      "description": " The table contains records of employee data, specifically demographic information. Each record includes an employee's age and name.",
      "name": "hr_data",
      "schema_name": "sch1"
    }
  ]
}
```

For more information about each JSON field, see [Returns](../sql-reference/stored-procedures/ai_generate_table_desc.md).

## Set generated descriptions as comments

To set a generated description as a comment on a table, view, or column, you must manually execute a SQL statement that includes the
SET COMMENT parameter. For example, to save a generated description for a table `t1`, execute
`ALTER TABLE t1 SET COMMENT = 'ai generated description';`.

You can write custom code to automatically generate and save descriptions. For examples of stored procedures that do this, see
Examples.

## Access control requirements

Users must have the following privileges and roles to call the AI_GENERATE_TABLE_DESC stored procedure:

* SELECT privilege on the table or view.
* SNOWFLAKE.CORTEX_USER database role.

## Availability of the feature

Your region must support the LLM used by Snowflake Cortex (like Mistral-7b and Llama 3.1-8b) to generate the descriptions. Check the
[availability of the COMPLETE function](snowflake-cortex/aisql.md). If the COMPLETE function is not supported in your region, you
must enable [cross-region inference](snowflake-cortex/cross-region-inference.md) to use the feature.

## Using sample data

When generating a description for a column, you can rely only on metadata, or you can choose to use sample data to
improve the Snowflake Cortex Powered Description. Sample data refers to data within a particular column that is evaluated when you
use Snowflake Cortex to generate descriptions. If you choose to use sample data, Snowflake uses a portion of the sample data to generate the
description, which leads to more accurate descriptions. Sample data is not stored by Snowflake as Usage Data.

## Cost considerations

Generating descriptions incurs the following costs:

* Credits consumed by the warehouse in use.
* Credits charged for the use of Snowflake Cortex with smaller LLMs like Mistral-7b and Llama 3.1-8b. These charges appear on a bill as
  AI-Services, which includes all uses of Snowflake Cortex.

## Limitations

You cannot generate column descriptions for objects with more than 5,000 columns.

## Legal Notices

This feature relies on the COMPLETE function to generate a recommended object description. When the user initiates the description
generation, Usage Data may be collected through the COMPLETE function.

The generated description is not retained by Snowflake until it is saved by the user.

For additional information about the use of AI, see [Snowflake AI and ML](../guides-overview-ai-features.md).

## Examples

The following examples create and call a stored procedure to generate object descriptions:

### Example: Generate descriptions and set them as comments

**Step 1: Create a stored procedure**

The following stored procedure does the following:

* Automatically generates descriptions for all tables (and their columns) in a schema.
* Sets these descriptions as comments on the tables and columns.

```sqlexample-python
CREATE OR REPLACE PROCEDURE DESCRIBE_TABLES_SET_COMMENT (database_name STRING, schema_name STRING,
  set_table_comment BOOLEAN,
  set_column_comment BOOLEAN)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.10'
  PACKAGES=('snowflake-snowpark-python','joblib')
  HANDLER = 'main'
AS
$$
import json
from joblib import Parallel, delayed
import multiprocessing

def generate_descr(session, database_name, schema_name, table, set_table_comment, set_column_comment):
  table_name =  table['TABLE_NAME']
  async_job = session.sql(f"CALL AI_GENERATE_TABLE_DESC( '{database_name}.{schema_name}.{table_name}',{{'describe_columns': true, 'use_table_data': true}})").collect_nowait()
  result = async_job.result()
  output = json.loads(result[0][0])
  columns_ret = output["COLUMNS"]
  table_ret = output["TABLE"][0]

  table_description = table_ret["description"]
  table_name = table_ret["name"]
  database_name = table_ret["database_name"]
  schema_name = table_ret["schema_name"]

  if (set_table_comment):
      table_description = table_description.replace("'", "\\'")
      session.sql(f"""ALTER TABLE {database_name}.{schema_name}.{table_name} SET COMMENT = '{table_description}'""").collect()

  for column in columns_ret:
      column_description = column["description"];
      column_name = column["name"];
      if not column_name.isupper():
        column_name = '"' + column_name + '"'

      if (set_column_comment):
          column_description = column_description.replace("'", "\\'")
          session.sql(f"""ALTER TABLE  {database_name}.{schema_name}.{table_name} MODIFY COLUMN {column_name}  COMMENT '{column_description}'""").collect()

  return 'Success';

def main(session, database_name, schema_name, set_table_comment, set_column_comment):

    schema_name = schema_name.upper()
    database_name = database_name.upper()
    tablenames = session.sql(f"""SELECT table_name
                      FROM {database_name}.information_schema.tables
                      WHERE table_schema = '{schema_name}'
                      AND table_type = 'BASE TABLE'""").collect()
    try:
        Parallel(n_jobs=multiprocessing.cpu_count(), backend="threading")(
                delayed(generate_descr)(
                    session,
                    database_name,
                    schema_name,
                    table,
                    set_table_comment,
                    set_column_comment,
                ) for table in tablenames
            )
        return 'Success'
    except Exception as e:
        # Catch and return the error message
        return f"An error occurred: {str(e)}"
$$;
```

**Step 2: Call the stored procedure**

Assuming your schema is named `my_db.sch1`, call the stored procedure as follows to generate descriptions for both tables and columns:

```sqlexample
CALL describe_tables_set_comment('my_db', 'sch1', true, true);
```

You can run a DESC TABLE command to verify that the generated descriptions were set as comments on a table.

### Example: Generate descriptions and save them to a catalog table

**Step 1: Create a stored procedure**

The following stored procedure does the following:

* Automatically generates descriptions for all tables (and their columns) in a schema.
* Populates a catalog table, where each row represents a table or column with its generated description.

```sqlexample-python
CREATE OR REPLACE PROCEDURE DESCRIBE_TABLES_SET_CATALOG (database_name string, schema_name string, catalog_table string)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.10'
  PACKAGES=('snowflake-snowpark-python','joblib')
  HANDLER = 'main'
AS
$$
import json
from joblib import Parallel, delayed
import multiprocessing

def generate_descr(session, database_name, schema_name, table, catalog_table):
    table_name =  table['TABLE_NAME']
    async_job = session.sql(f"CALL AI_GENERATE_TABLE_DESC( '{database_name}.{schema_name}.{table_name}',{{'describe_columns': true, 'use_table_data': true}})").collect_nowait()
    result = async_job.result()
    output = json.loads(result[0][0])
    columns_ret = output["COLUMNS"]
    table_ret = output["TABLE"][0]

    table_description = table_ret["description"]
    table_description = table_description.replace("'", "\\'")
    table_name = table_ret["name"]
    database_name = table_ret["database_name"]
    schema_name = table_ret["schema_name"]

    session.sql(f"""INSERT INTO {catalog_table} (domain, description, name, database_name, schema_name, table_name)
                          VALUES ('TABLE', '{table_description}', '{table_name}', '{database_name}', '{schema_name}', null)""").collect()

    for column in columns_ret:
        column_description = column["description"];
        column_description = column_description.replace("'", "\\'")
        column_name = column["name"];
        if not column_name.isupper():
            column_name = '"' + column_name + '"'
        session.sql(f"""INSERT INTO {catalog_table} (domain, description, name, database_name, schema_name, table_name)
                          VALUES ('COLUMN', '{column_description}', '{column_name}', '{database_name}', '{schema_name}', '{table_name}')""").collect()

    return 'Success';

def main(session, database_name, schema_name, catalog_table):

    schema_name = schema_name.upper()
    database_name = database_name.upper()
    catalog_table_upper = catalog_table.upper()
    tablenames = session.sql(f"""SELECT table_name
                      FROM {database_name}.information_schema.tables
                      WHERE table_schema = '{schema_name}'
                      AND table_type = 'BASE TABLE'
                      AND table_name !='{catalog_table_upper}'""").collect()
    try:
        Parallel(n_jobs=multiprocessing.cpu_count(), backend="threading")(
                delayed(generate_descr)(
                    session,
                    database_name,
                    schema_name,
                    table,
                    catalog_table,
                ) for table in tablenames
            )
        return 'Success'
    except Exception as e:
        # Catch and return the error message
        return f"An error occurred: {str(e)}"
$$;
```

**Step 2: Create the catalog table to populate**

Use the following code to create the catalog table where table and column descriptions are stored.

```sqlexample
CREATE OR REPLACE TABLE catalog_table (
  domain VARCHAR,
  description VARCHAR,
  name VARCHAR,
  database_name VARCHAR,
  schema_name VARCHAR,
  table_name VARCHAR
  );
```

**Step 3: Call the stored procedures**

Assuming your schema is named `my_db.sch1`, call the stored procedure as follows:

```sqlexample
CALL describe_tables_set_catalog('my_db', 'sch1', 'catalog_table');
```

---
title: Using synthetic data in Snowflake
source: https://docs.snowflake.com/en/user-guide/synthetic-data.md
section: User Guide
---

# Using synthetic data in Snowflake

This release introduces a new stored procedure, [GENERATE_SYNTHETIC_DATA](../sql-reference/stored-procedures/generate_synthetic_data.md), to generate synthetic data.

## Overview

Snowflake can generate *synthetic data* from a source table, producing a table with the same number of columns as the source
table, but with statistically similar artificial data. You can use synthetic data to share or test data that is too sensitive,
confidential, or otherwise restricted to share with others. The synthetic data set has the same characteristics as the source data set,
such as name, number, and data type of columns, and the same or fewer number of rows. You can use synthetic data to test and validate
workloads in Snowflake, particularly when the original data is sensitive and should’t be accessible to unauthorized users. Synthetic data
appears in the [data lineage graph](ui-snowsight-lineage.md).

### Benefits

Statistical consistency:
:   A synthetic data set represents the statistical properties of the original data set, which helps data engineers to understand the
    statistical properties of the real data set. Subsequently, the data engineer can test and validate solutions that are based on the real
    data set.

Production validation:
:   A synthetic data set similar to a production data set enables production engineers to test and validate their production
    environment. The result is a more robust production environment.

### About the synthetic data algorithm

Snowflake uses an algorithm to generate synthetic data that is similar to the original data set. The algorithm uses the original data set
to generate synthetic data that has the same statistical properties as the original data set. Once this distribution is captured, the
synthetic data resembles the original data statistically but does not have a direct reference or link to any row from the original data.

## Generating synthetic data

Call [GENERATE_SYNTHETIC_DATA](../sql-reference/stored-procedures/generate_synthetic_data.md) to generate synthetic data from one or more tables. Snowflake creates
synthetic data tables with ownership granted to the role that calls the stored procedure. The output tables have the same number of columns
as the input tables, with the same column names and data types. The output generally has the same number of rows, unless you enable the
privacy filter, in which case the output tables might have fewer rows.

### Generated data values

Snowflake generates synthetic data for non-join-key columns according to the source data type:

* **Statistical data:** Data of type number, boolean, date, time, or timestamp. Generated data is the same type, with similar values to
  the source data.
* **Categorical string:** A string column with *few* unique values‡. Generated data uses actual values from the source data.
* **Non-categorical string:** A string column with *many* unique values‡. Redacted in the output unless you specify an output
  format with the `replace` option in GENERATE_SYNTHETIC_DATA.

You can explicitly designate a non-join-key string column as categorical or non-categorical by providing a `categorical` value to
GENERATE_SYNTHETIC_DATA. Join key columns must be non-categorical strings or statistical.

Generated data in each table maintains the approximate distributions and correlations present in the original table.

Columns designated as join keys can be of any data type, and will result in synthetic data of the same type and consistent, but artificial,
values.

‡ *Few unique values* means that the number of unique values is less than half the row count. *Many unique values* means that the
number of unique values is more than half the row count.

### Maintaining join key consistency in synthetic data

If you plan to run join queries on your synthetic data, designate every column that you will join on as a *join key*. You can designate
any numeric, boolean, or non-categorical column as a join key by assigning the `join_key` value in GENERATE_SYNTHETIC_DATA. A consistent
synthetic value is generated in the output data for the same value in the source data for all join keys in all tables during a single run.
This enables you to run join queries and get similar results as you would when running the same query against the source data.

To maintain join consistency between tables, be sure that the same join key column in each table has the same arguments. That is,
if you expect `cust_id` to be joinable across tables, provide the same set of arguments and values in the `columns` description in each
dataset object:

```sqlexample
'datasets':[
  {
    'input_table': 'd.s.orders',
    'output_table': 'd.s.orders_synth',
    'columns': {'cust_id': {'join_key': True, 'replace': 'uuid'}, ...}
  },
  {
    'input_table': 'd.s.customers',
    'output_table': 'd.s.customers_synth',
    'columns' : {'cust_id': {'join_key': True, 'replace':'uuid'}, ...}

  }
]
```

If you provide a [symmetric string secret](../sql-reference/sql/create-secret.md) to `consistency_secret` in GENERATE_SYNTHETIC_DATA, join key
values will be consistent across tables and multiple runs. If you do not specify a secret, then the join key values will be consistent
across all tables in a single run, but not across multiple runs. Multi-run consistency is supported only for string columns.

> **Note:**
>
> If you use provide a SECRET object to GENERATE_SYNTHETIC_DATA, you need the READ or OWNERSHIP privilege on that SECRET.

**Example: Single-run join key consistency**

```sqlexample
CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
  'datasets':[
      {
        'input_table': 'CLINICAL_DB.PUBLIC.PATIENTS1',
        'output_table': 'MY_DB.PUBLIC.PATIENTS1',
        'columns': { 'patient_id': {'join_key': TRUE}, 'age':{'join_key': TRUE}}
      },
      {
        'input_table': 'CLINICAL_DB.PUBLIC.PATIENTS2',
        'output_table': 'MY_DB.PUBLIC.PATIENTS2',
        'columns': { 'patient_id': {'join_key': TRUE}, 'age':{'join_key': TRUE}}
      }
    ],
    'replace_output_tables': TRUE
});
```

**Example: Multi-run join key consistency**

```sqlexample
-- Generate consistent join keys across multiple runs by
-- providing a symmetric key secret.
CREATE OR REPLACE SECRET my_db.public.my_consistency_secret
  TYPE=SYMMETRIC_KEY
  ALGORITHM=GENERIC;

CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
  'datasets':[
      {
        'input_table': 'CLINICAL_DB.PUBLIC.BASE_TABLE',
        'output_table': 'MY_DB.PUBLIC.PATIENTS1',
        'columns': { 'patient_id': {'join_key': TRUE}}
      }
    ],
    'consistency_secret': SYSTEM$REFERENCE('SECRET', 'MY_CONSISTENCY_SECRET', 'SESSION', 'READ')::STRING,
    'replace_output_tables': TRUE
});

CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
  'datasets':[
      {
        'input_table': 'CLINICAL_DB.PUBLIC.SECOND_TABLE',
        'output_table': 'MY_DB.PUBLIC.PATIENTS2',
        'columns': { 'patient_id': {'join_key': TRUE}}
      }
    ],
    'consistency_secret': SYSTEM$REFERENCE('SECRET', 'MY_CONSISTENCY_SECRET', 'SESSION', 'READ')::STRING,
    'replace_output_tables': TRUE
});
```

### Enhancing privacy

When you call the GENERATE_SYNTHETIC_DATA stored procedure, you can optionally set the `'similarity_filter': True` configuration
option to apply a privacy filter to the output table. The privacy filter removes rows from the output table if the rows are too similar to
the input data set. The privacy threshold uses the nearest neighbor distance ratio (NNDR) and distance to closest record (DCR) values to
determine whether a row should be removed from the output table.

When using a similarity filter, all non-string columns must have values for all rows. A NULL value in a non-string column will cause the
procedure to fail.

## Requirements

### Input table requirements

Both tables and views are supported as source data. You can specify up to five input tables per procedure call.

To generate synthetic data, *each* input table or view must meet the following requirements:

* Minimum 20 distinct rows
* Maximum 100 columns
* Maximum 14M rows
* The following input table types are supported:

  + Regular, temporary, dynamic, and transient tables
  + Regular, materialized, secure, and secure materialized views
* The following input table types are not supported:

  + External, Apache Iceberg™, and hybrid tables
  + Streams
* The following column types are supported. Columns of an unsupported data type return NULL for all values in
  the column.

  + All numeric types (NUMBER, DECIMAL, FLOAT, INTEGER, and so on)
  + BOOLEAN
  + All date and time types (DATE, DATETIME, TIME, TIMESTAMP, and so on) except TIMESTAMP_TZ.
    However, timestamps earlier than `1677-09-21 00:12:43.145224193` or later than `2262-04-11 23:47:16.854775807` in the source
    data are coerced to `1677-09-21 00:12:43.145224193` or `2262-04-11 23:47:16.854775807` respectively when generating synthetic data.
  + STRING, VARCHAR, CHAR, CHARACTER, TEXT

    If more than half of the values in a STRING column are unique values, Snowflake replaces the
    value with a redacted value in the output table due to privacy concerns.

### Access control requirements

To generate synthetic data, you must use a role with each the following grants:

* USAGE on the warehouse that you want to use for queries.
* SELECT on the input table from which you want to generate synthetic data.
* USAGE on the database and schema that contain the input table, and on the database that contains the output table.
* CREATE TABLE on the schema that contains the output table.
* OWNERSHIP on the output tables. The simplest way to do this is by granting OWNERSHIP to the schema where the output table is
  generated. (However, if someone has applied a FUTURE GRANT on this schema, table ownership will be silently overridden – that is,
  `GRANT OWNERSHIP ON FUTURE TABLES IN SCHEMA db.my_schema TO ROLE some_role` will automatically grant OWNERSHIP to `some_role` on any
  new tables created in schema `my_schema`.)

All users can access the SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA stored procedure. Access is made available using the
SNOWFLAKE.CORE_VIEWER database role, which is granted to the PUBLIC role.

### Other requirements

You must [accept the Anaconda terms and conditions](../developer-guide/udf/python/udf-python-packages.md) in your Snowflake account in order to enable this
feature.

## Recommendations

* Use a medium [Snowpark-optimized warehouse](warehouses-snowpark-optimized.md).
* While `GENERATE_SYNTHETIC_DATA` is running, do not run any other queries in that warehouse.

## Example: Synthetic data from multiple tables

This example uses the [Snowflake Sample Data database SNOWFLAKE_SAMPLE_DATA](sample-data-using.md). If you don’t see it in
your account, you can copy it with the following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE or REPLACE DATABASE SNOWFLAKE_SAMPLE_DATA from share SFC_SAMPLES.SAMPLE_DATA;
```

Follow these steps to generate synthetic data from multiple input table:

1. Create and configure the access control for the `data_engineer` role to allow them to create all of the necessary objects:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE OR REPLACE ROLE data_engineer;
   CREATE OR REPLACE DATABASE syndata_db;
   CREATE OR REPLACE WAREHOUSE syndata_wh;

   GRANT OWNERSHIP ON DATABASE syndata_db TO ROLE data_engineer;
   GRANT USAGE ON WAREHOUSE syndata_wh TO ROLE data_engineer;
   GRANT ROLE data_engineer TO USER jsmith; -- Or whoever you want to run this example. Or skip this line to run it yourself.
   ```
2. Create two views from the Snowflake Sample Data database:

   ```sqlexample
   - Sign in as user with data_engineer role. Then...
   CREATE SCHEMA syndata_db.sch;
   CREATE OR REPLACE VIEW syndata_db.sch.TPC_ORDERS_5K as (
       SELECT * from SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS
       LIMIT 5000
   );
   CREATE OR REPLACE VIEW syndata_db.sch.TPC_CUSTOMERS_5K as (
       SELECT * from SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.CUSTOMER
       LIMIT 5000
   );
   ```
3. Query the input tables to view the data and confirm that each table has 5,000 rows:

   ```sqlexample
   USE WAREHOUSE syndata_wh;
   SELECT TOP 20 * FROM syndata_db.sch.TPC_ORDERS_5K;
   SELECT COUNT(*) FROM syndata_db.sch.TPC_ORDERS_5K;
   select count(distinct o_clerk), count(*) from syndata_db.sch.TPC_ORDERS_5K;

   SELECT TOP 20 * FROM syndata_db.sch.TPC_CUSTOMERS_5K;
   SELECT COUNT(*) FROM syndata_db.sch.TPC_CUSTOMERS_5K;
   ```
4. Call the [GENERATE_SYNTHETIC_DATA](../sql-reference/stored-procedures/generate_synthetic_data.md) stored procedure to generate the synthetic data into two output
   tables. Designate join keys, because you will join on those keys later.

   ```sqlexample
   CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
       'datasets':[
           {
             'input_table': 'syndata_db.sch.TPC_ORDERS_5K',
             'output_table': 'syndata_db.sch.TPC_ORDERS_5K_SYNTHETIC',
             'columns': {'O_CUSTKEY': {'join_key': True}}
           },
           {
             'input_table': 'syndata_db.sch.TPC_CUSTOMERS_5K',
             'output_table': 'syndata_db.sch.TPC_CUSTOMERS_5K_SYNTHETIC',
             'columns' : {'C_CUSTKEY': {'join_key': True}}

           }
         ],
         'replace_output_tables':True
     });
   ```
5. Query the output table to view the synthetic data:

   ```sqlexample
   SELECT TOP 20 * FROM syndata_db.sch.TPC_ORDERS_5K_SYNTHETIC;
   SELECT COUNT(*) FROM syndata_db.sch.TPC_ORDERS_5K_SYNTHETIC;

   SELECT TOP 20 * FROM syndata_db.sch.TPC_CUSTOMERS_5K_SYNTHETIC;
   SELECT COUNT(*) FROM syndata_db.sch.TPC_CUSTOMERS_5K_SYNTHETIC;
   ```
6. Clean up all the objects

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   DROP DATABASE syndata_db;
   DROP ROLE data_engineer;
   DROP WAREHOUSE syndata_wh;
   ```

---
title: Using SYSTEM$SEND_EMAIL to send email notifications
source: https://docs.snowflake.com/en/user-guide/notifications/email-stored-procedures.md
section: User Guide
---

# Using SYSTEM$SEND_EMAIL to send email notifications

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

This topic explains how to use the built-in [SYSTEM$SEND_EMAIL](../../sql-reference/stored-procedures/system_send_email.md) stored procedure to send
email notifications.

## Introduction

This feature uses the [notification integration](email-notifications.md) object, which is a Snowflake
object that provides an interface between Snowflake and third-party services (e.g. cloud message queues, email, etc.).

## Sending an email notification

Before you send a notification, you must have a notification integration that you will use to send the notification. You must also
validate the email addresses of the recipients. For details, see [Notifications in Snowflake](about-notifications.md).

To send the email notification, call the [SYSTEM$SEND_EMAIL](../../sql-reference/stored-procedures/system_send_email.md) stored procedure.

For example, to use the notification integration `my_email_int` to send an email message with the subject line
“Email Alert: Task A has finished.” to `first.last@example.com` and `first2.last2@example.com`, execute the following statement:

```sqlexample
CALL SYSTEM$SEND_EMAIL(
    'my_email_int',
    'first.last@example.com, first2.last2@example.com',
    'Email Alert: Task A has finished.',
    'Task A has successfully finished.\nStart Time: 10:10:32\nEnd Time: 12:15:45\nTotal Records Processed: 115678'
);
```

> **Note:**
>
> If you set the ALLOWED_RECIPIENTS property of the notification integration, and any email address in the recipient list is not
> on that list, no email notifications are sent.

If you are on the Amazon Web Services (AWS) cloud platform, then the email notification message is sent from
`no-reply@snowflake.net`. If you are on the Google Cloud Platform (GCP) or Microsoft Azure (Azure)
cloud platform, the email notification message is sent from `do-not-reply@snowflake.net`.

---
title: Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications
source: https://docs.snowflake.com/en/user-guide/notifications/snowflake-notifications.md
section: User Guide
---

# Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

If you need to send notifications to an email address, webhook, or a queue provided by a cloud service (Amazon SNS, Google Cloud
PubSub, or Azure Event Grid), use the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure.

With a single call to this stored procedure, you can:

* Send a message to multiple types of destinations (email addresses, webhooks, and queues).
* Send a message to multiple email addresses, webhooks, and queues.
* Send a message in a specified format, according to the type of notification integration (plain text or HTML for email, JSON
  for queues).

For example, with a single call, you can send messages in plain text, HTML, and JSON formats to multiple email addresses and
multiple SNS, PubSub, and Event Grid topics.

You can use multiple notification integrations to send the notification to different queues. You can also create multiple email
notification integrations that have different sets of email addresses and subject lines, making it easier to configure email
messages for different recipients.

## Send a notification

Before you send a notification, you must have a notification integration that you will use to send the notification. If you are
sending an email notification, you must also validate the email addresses of the recipients. For details, see
[Notifications in Snowflake](about-notifications.md).

To send a notification to email addresses or queues, call the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure, specifying the messages and the
notification integrations to use.

The following is an example of a call to this stored procedure:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
   -- Message type and content.
  '{ "text/html": "<p>This is a message.</p>" }',
  -- Integration used to send the notification and values used for the subject and recipients.
  -- These values override the defaults specified in the integration.
  '{
    "my_email_int": {
      "subject": "Status update",
      "toAddress": ["person_a@example.com", "person_b@example.com"],
      "ccAddress": ["person_c@example.com"],
      "bccAddress": ["person_d@example.com"]
    }
  }'
);
```

As shown in the example above, you pass in JSON-formatted strings as arguments to specify the message to send and the
notification integration to use.

For the syntax for these strings, see [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md).

To construct these JSON-formatted strings, you can call helper functions like [TEXT_HTML](../../sql-reference/functions/text_html.md) to specify
the message and [EMAIL_INTEGRATION_CONFIG](../../sql-reference/functions/email_integration_config.md) to specify the notification integration, subject line,
and email addresses. For example:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_HTML('<p>a message</p>'),
  SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
    'my_email_int',
    'Status update',
    ARRAY_CONSTRUCT('person_a@example.com', 'person_b@example.com'),
    ARRAY_CONSTRUCT('person_c@example.com'),
    ARRAY_CONSTRUCT('person_d@example.com')
  )
);
```

For the list of helper functions that you can use, see [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md).

## Override the default values in the email notification integration

To use a different set of recipients or a different subject line from
[the default specified in the email notification integration](email-notifications.md), set the
following properties of the integration configuration object that you pass to
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md):

* `subject` (this cannot exceed 256 characters in length)
* `toAddress`
* `ccAddress`
* `bccAddress`

For example, to use the email notification integration `my_email_int` and override the subject line, “To:” line, “Cc:” line,
and “Bcc:” line:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  '{ "text/html": "<p>This is a message.</p>" }',
  '{
    "my_email_int": {
      "subject": "Status update",
      "toAddress": ["person_a@example.com", "person_b@example.com"],
      "ccAddress": ["person_c@example.com"],
      "bccAddress": ["person_d@example.com"]
    }
  }'
);
```

To construct the JSON-formatted string for the integration configuration, you can call the
[EMAIL_INTEGRATION_CONFIG](../../sql-reference/functions/email_integration_config.md) helper function.

For example, to send the email message to [oncall-a@snowflake.com](mailto:oncall-a%40snowflake.com) and [oncall-b@snowflake.com](mailto:oncall-b%40snowflake.com) with the subject line “Service down”, execute the following statement:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_PLAIN('Your message'),
  SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
    'my_email_int,
    'Service down',
    ARRAY_CONSTRUCT('oncall-a@example.com', 'oncall-b@example.com')
  )
);
```

To include the “Cc:” and “Bcc:” lines in the email message, pass in additional arguments with arrays of email addresses for those
lines:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_PLAIN('Your message'),
  SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
    'my_email_int,
    'Service down',
    ARRAY_CONSTRUCT('oncall-a@example.com', 'oncall-b@example.com'),
    ARRAY_CONSTRUCT('cc-a@example.com', 'cc-b@example.com'),
    ARRAY_CONSTRUCT('bcc-a@example.com', 'bcc-b@example.com')
  )
);
```

If you only want to set the “Cc:” or “Bcc:” line (not both), pass in an empty array or NULL for the corresponding arguments.
If you are constructing the JSON object without using the helper function, omit the `ccAddress` or `bccAddress`
property from the JSON object.

## Send HTML, plain text, and JSON messages

To send a message in HTML, plain text, or JSON, pass in a JSON object that contains the message type as the name of the property
and the message as the value of the property:

```json
'{ "<message_type>": "<message>" }'
```

`"message_type"` can be one of the following values:

* `"text/html"`
* `"text/plain"`
* `"application/json"`

For example:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  '{ "text/html": "<p>This is a message.</p>" }',
  '{ "my_email_int": {} }'
);
```

To construct the JSON object for the message, you can use the following helper functions:

* For an HTML message, call [TEXT_HTML](../../sql-reference/functions/text_html.md).
* For a plain text message, call [TEXT_HTML](../../sql-reference/functions/text_html.md).
* For a JSON message, call [APPLICATION_JSON](../../sql-reference/functions/application_json.md).

The following example sends an HTML message, using the `my_email_int` email notification integration:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_HTML('<p>a message</p>'),
  SNOWFLAKE.NOTIFICATION.INTEGRATION('my_email_int')
);
```

The following example sends a plain text message, using the same integration:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_PLAIN('A message'),
  SNOWFLAKE.NOTIFICATION.INTEGRATION('my_email_int')
);
```

The following example sends a JSON message to the queue specified by the `my_queue_int` notification integration. For
instructions on creating a notification integration for a queue, see the following topics:

* [Creating a notification integration to send notifications to an Amazon SNS topic](creating-notification-integration-amazon-sns.md)
* [Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic](creating-notification-integration-azure-event-grid.md)
* [Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic](creating-notification-integration-google-pubsub.md)

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.APPLICATION_JSON('{ "name": "value" }'),
  SNOWFLAKE.NOTIFICATION.INTEGRATION('my_sns_int')
);
```

## Send a notification using multiple integrations

You can use multiple integrations to send messages when:

* You want to send a message in email and to a topic in the same function call.
* You want to send a message to different email addresses specified by different email notification integrations.

To use multiple integrations, call the [ARRAY_CONSTRUCT](../../sql-reference/functions/array_construct.md) function to construct an array of
integration configurations, and pass the array as the second argument of the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../sql-reference/stored-procedures/system_send_snowflake_notification.md) stored procedure.

For example, to send a plain text message to a queue and email addresses configured in different notification integrations:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  '{"text/plain":"A message"}',
  ARRAY_CONSTRUCT(
    '{"my_sns_int":{}}',
    '{"my_email_int":{}}',
 )
);
```

> **Note:**
>
> The array cannot contain more than one object for the same notification integration.

If you prefer to use the helper functions to construct the integration configurations, you can pass the values returned by the
helper functions to the ARRAY_CONSTRUCT function. For example:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  SNOWFLAKE.NOTIFICATION.TEXT_PLAIN('A message'),
  ARRAY_CONSTRUCT(
    SNOWFLAKE.NOTIFICATION.INTEGRATION('my_sns_int'),
    SNOWFLAKE.NOTIFICATION.INTEGRATION('my_email_int')
  )
);
```

The following example sends messages in different formats to a queue and email addresses:

```sqlexample
CALL SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  ARRAY_CONSTRUCT(
    SNOWFLAKE.NOTIFICATION.TEXT_PLAIN('A message'),
    SNOWFLAKE.NOTIFICATION.TEXT_HTML('<p>A message</p>'),
    SNOWFLAKE.NOTIFICATION.APPLICATION_JSON('{ "name": "value" }')
  ),
  ARRAY_CONSTRUCT(
    SNOWFLAKE.NOTIFICATION.INTEGRATION('my_sns_int'),
    SNOWFLAKE.NOTIFICATION.INTEGRATION('my_email_int')
  )
);
```

---
title: Using the Query Acceleration Service (QAS)
source: https://docs.snowflake.com/en/user-guide/query-acceleration-service.md
section: User Guide
---

# Using the Query Acceleration Service (QAS)

The query acceleration service (QAS) can accelerate parts of the query workload in a warehouse. When it is enabled for a warehouse,
it can improve overall warehouse performance by reducing the impact of outlier queries, which are queries that use more resources than the
typical query. The query acceleration service does this by offloading portions of the query processing work to shared compute resources that
are provided by the service.

Examples of the types of workloads that might benefit from the query acceleration service include:

* Ad hoc analytics.
* Workloads with unpredictable data volume per query.
* Queries with large scans and selective filters.

The query acceleration service can handle these types of workloads more efficiently by performing more work in parallel and reducing the
wall-clock time spent in scanning and filtering.

> **Note:**
>
> The query acceleration service depends on server availability. Therefore, performance improvements might fluctuate over time.

## SQL commands that QAS can accelerate

The query acceleration service supports the following SQL commands:

> * SELECT
> * INSERT
> * CREATE TABLE AS SELECT (CTAS)
> * COPY INTO <table>

Within a supported SQL command, QAS might accelerate an entire query, or a subquery or clause within the query,
if the command is eligible for acceleration.

## Enabling query acceleration

To enable the query acceleration service, specify the clause ENABLE_QUERY_ACCELERATION = TRUE with the
[CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) or [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command.

### Examples

The following example enables the query acceleration service for a new warehouse named `my_wh`,
and for a warehouse named `my_other_wh` that’s initially created with QAS turned off:

```sqlexample
CREATE WAREHOUSE my_wh WITH ENABLE_QUERY_ACCELERATION = true;

CREATE WAREHOUSE my_other_wh;
ALTER WAREHOUSE my_other_wh SET ENABLE_QUERY_ACCELERATION = true;
```

Run the [SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md) command to display details about the `my_wh` warehouse.
The following query uses the [pipe operator](../sql-reference/operators-flow.md) (`->>`) to return information about
just the columns from the SHOW output that are relevant for QAS processing:

```sqlexample
SHOW WAREHOUSES LIKE 'my_wh'
  ->> SELECT "name",
             "enable_query_acceleration",
             "query_acceleration_max_scale_factor"
        FROM $1;
```

```output
+-------+---------------------------+-------------------------------------+
| name  | enable_query_acceleration | query_acceleration_max_scale_factor |
|-------+---------------------------+-------------------------------------|
| MY_WH | true                      |                                   8 |
+-------+---------------------------+-------------------------------------+
```

The query acceleration service might increase the credit consumption rate of a warehouse.
The maximum scale factor can help limit the consumption rate.
See Adjusting the scale factor to learn how to specify the
[QUERY_ACCELERATION_MAX_SCALE_FACTOR](../sql-reference/sql/create-warehouse.md) property.
You do so using the [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) and [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md)
commands.

The QUERY_ACCELERATION_ELIGIBLE view and the SYSTEM$ESTIMATE_QUERY_ACCELERATION function might be useful
in determining an appropriate scale factor for a warehouse.
See Identifying queries and warehouses that might benefit from query acceleration (in this topic) for details.

## Identifying queries and warehouses that might benefit from query acceleration

To identify the queries or warehouses that might benefit from the query acceleration service, you can
query the [QUERY_ACCELERATION_ELIGIBLE](../sql-reference/account-usage/query_acceleration_eligible.md) view.
You can also use the [SYSTEM$ESTIMATE_QUERY_ACCELERATION](../sql-reference/functions/system_estimate_query_acceleration.md) function to assess
whether a specific query is eligible for acceleration.

### Eligible queries

In general, queries are eligible because they have a portion of the query plan that can be run in parallel using QAS
compute resources. These queries fall into one of two patterns:

* Large scans with an aggregation or selective filter.
* Large scans that insert or copy many new rows (for example, INSERT and COPY commands).

Snowflake doesn’t have a specific cutoff for what constitutes a “large enough” scan to be eligible.
The threshold for eligibility depends on a variety of factors, including the query plan and
warehouse size. Snowflake only marks a query as eligible if there is high confidence that the query
would be accelerated if QAS was enabled. Over time, Snowflake is expanding the query patterns that
are eligible for acceleration. For example, formerly QAS didn’t accelerate queries with a LIMIT
clause and no ORDER BY clause, but now Snowflake automatically determines whether such queries can
benefit from QAS.

### Common reasons that queries are ineligible

Some queries are ineligible for query acceleration. The following are common reasons why a query cannot be accelerated:

* There aren’t enough partitions in the scan. If there aren’t enough partitions to scan, the benefits of query acceleration are offset by
  the latency in acquiring resources for the query acceleration service.
* Even if a query has a filter, the filters might not be selective enough. Alternatively, if the query has an aggregation with GROUP BY,
  the cardinality of the GROUP BY expression might be too high for eligibility.
* The query includes a LIMIT clause that prevents acceleration. QAS automatically determines which
  queries with LIMIT clauses (including those without ORDER BY) can be accelerated.
* The query includes functions that return nondeterministic results (for example, [SEQ](../sql-reference/functions/seq1.md) or [RANDOM](../sql-reference/functions/random.md)).

### Identifying queries with the SYSTEM$ESTIMATE_QUERY_ACCELERATION function

The [SYSTEM$ESTIMATE_QUERY_ACCELERATION](../sql-reference/functions/system_estimate_query_acceleration.md) function can help determine if a previously executed query might
benefit from the query acceleration service. If the query is eligible for query acceleration, the function returns the estimated query
execution time for different query acceleration [scale factors](../sql-reference/sql/create-warehouse.md).

#### Example

Execute the following statement to help determine if query acceleration might benefit a specific query:

```sqlexample
SELECT PARSE_JSON(SYSTEM$ESTIMATE_QUERY_ACCELERATION('8cd54bf0-1651-5b1c-ac9c-6a9582ebd20f'));
```

In this example, the query is eligible for the query acceleration service. The result value includes estimated
query times using the service. The `ineligibleReason` property is empty.

```sqljson
{
  "estimatedQueryTimes": {
    "1": 171,
    "10": 115,
    "2": 152,
    "4": 133,
    "8": 120
  },
  "ineligibleReason": null,
  "originalQueryTime": 300.291,
  "queryUUID": "8cd54bf0-1651-5b1c-ac9c-6a9582ebd20f",
  "status": "eligible",
  "upperLimitScaleFactor": 10
}
```

The following example shows the results for a query that is not eligible for query acceleration service:

```sqlexample
SELECT PARSE_JSON(SYSTEM$ESTIMATE_QUERY_ACCELERATION('cf23522b-3b91-cf14-9fe0-988a292a4bfa'));
```

The statement above produces the following output. The estimated query times are blank.
The `ineligibleReason` property reports why the query didn’t use QAS.

```sqljson
{
  "estimatedQueryTimes": {},
  "ineligibleReason": "NO_LARGE_ENOUGH_SCAN",
  "originalQueryTime": 20.291,
  "queryUUID": "cf23522b-3b91-cf14-9fe0-988a292a4bfa",
  "status": "ineligible",
  "upperLimitScaleFactor": 0
}
```

### Identifying queries and warehouses with the QUERY_ACCELERATION_ELIGIBLE view

Query the [QUERY_ACCELERATION_ELIGIBLE](../sql-reference/account-usage/query_acceleration_eligible.md) view
to identify the queries and warehouses that might benefit the most from the query acceleration
service. For each query, the view includes the amount of query execution time that is eligible for
the query acceleration service.

#### Examples

> **Note:**
>
> These examples assume the ACCOUNTADMIN role (or a [role granted IMPORTED PRIVILEGES](../sql-reference/account-usage.md) on the
> shared SNOWFLAKE database) is in use. If it is not in use, execute the following command before running the queries in the examples:
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> ```

Identify the queries in the past week that might benefit the most from the service by the longest amount of query execution time that is
eligible for acceleration:

```sqlexample
SELECT query_id, eligible_query_acceleration_time
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY eligible_query_acceleration_time DESC;
```

Identify the queries in the past week that might benefit the most from the service in a specific warehouse `mywh`:

```sqlexample
SELECT query_id, eligible_query_acceleration_time
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE warehouse_name = 'MYWH'
  AND start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY eligible_query_acceleration_time DESC;
```

Identify the warehouses with the most queries, in the past week, eligible for the query acceleration service:

```sqlexample
SELECT warehouse_name, COUNT(query_id) AS num_eligible_queries
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  GROUP BY warehouse_name
  ORDER BY num_eligible_queries DESC;
```

Identify the warehouses with the most eligible time for the query acceleration service in the past week:

```sqlexample
SELECT warehouse_name, SUM(eligible_query_acceleration_time) AS total_eligible_time
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  GROUP BY warehouse_name
  ORDER BY total_eligible_time DESC;
```

Identify the upper limit [scale factor](../sql-reference/sql/create-warehouse.md) in the past week for the query acceleration
service for warehouse `mywh`:

```sqlexample
SELECT MAX(upper_limit_scale_factor)
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE warehouse_name = 'MYWH'
  AND start_time > DATEADD('day', -7, CURRENT_TIMESTAMP());
```

Identify the distribution of scale factors in the past week for the query acceleration service for warehouse `mywh`:

```sqlexample
SELECT upper_limit_scale_factor, COUNT(upper_limit_scale_factor)
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE warehouse_name = 'MYWH'
  AND start_time > DATEADD('day', -7, CURRENT_TIMESTAMP())
  GROUP BY 1 ORDER BY 1;
```

## Adjusting the scale factor

The scale factor is a cost control mechanism that allows you to set an upper bound on the amount of compute resources a warehouse can
lease for query acceleration. This value is used as a multiplier based on warehouse size and cost.

For example, suppose that you set the scale factor to 5 for a medium warehouse. This means that:

* The warehouse can lease compute resources up to 5 times the size of a medium warehouse.
* Because a medium warehouse costs [4 credits per hour](cost-understanding-compute.md), leasing these resources can cost up
  to an additional 20 credits per hour (4 credits per warehouse x 5 times its size).

> **Tip:**
>
> The scale factor applies to the entire warehouse, whether it’s a single-cluster or multi-cluster warehouse.
> If you use QAS for a multi-cluster warehouse, consider increasing the scale factor.
> That way, all the warehouse clusters can take advantage of the QAS optimizations.

The cost is the same no matter how many queries are using the query acceleration service at the same time.
The query acceleration service is billed by the second, only when the service is in use. These credits are billed separately from warehouse
usage.

Not all queries require the full set of resources that are made available by the scale factor. The amount of resources requested for the service
depends on how much of the query is eligible for acceleration and how much data will be processed to answer it. Regardless of the scale
factor value or the amount of resources requested, the amount of available compute resources for query acceleration is bound by the
availability of resources in the service and the number of other concurrent requests. The query acceleration service only uses as many
resources as it needs and that are available at the time the query is executed.

If the scale factor is not explicitly set, the default value is `8`. Setting the scale factor to `0` eliminates the upper bound limit
and allows queries to lease as many resources as necessary and as available to service the query.

### Example

The following example modifies the warehouse named `my_wh` to enable the query acceleration service with a maximum
scale factor of 0.

> ```sqlexample
> ALTER WAREHOUSE my_wh SET
>   ENABLE_QUERY_ACCELERATION = true
>   QUERY_ACCELERATION_MAX_SCALE_FACTOR = 0;
> ```

## Monitoring query acceleration service usage

This section describes how to monitor the usage of the query acceleration service. By monitoring, you can understand the performance
impact, identify which queries benefit most from acceleration, and assess the overall effectiveness of the feature. Doing so can
help you manage your costs and optimize your workloads.

### Using the web interface to monitor query acceleration usage

Once you enable the query acceleration service, you can view the Profile Overview panel in the
[Query Profile tab](ui-snowsight-activity.md) to see the effects of the query acceleration results.

The following screenshot displays an example of the statistics displayed for the query overall. If multiple operations in a query were
accelerated, the results are aggregated in this view so you can see the total amount of work done by the query acceleration service.

The Query Acceleration section of the Profile Overview panel includes the following statistics:

* *Partitions scanned by service* — number of files offloaded for scanning to the query acceleration service.
* *Scans selected for acceleration* — number of table scans being accelerated.

In the operator details (see [Statistics](ui-snowsight-activity.md)), click on the operator to see detailed information.
The following screenshot displays an example of the statistics displayed for a TableScan operation:

The Query Acceleration section of the TableScan details panel includes the following statistics:

* *Partitions scanned by service* — number of files offloaded for scanning to the query acceleration service.

### Using the Account Usage QUERY_HISTORY view to monitor query acceleration usage

To see the effects of query acceleration on a query, you can use the following columns in the
[QUERY_HISTORY view](../sql-reference/account-usage/query_history.md).

* QUERY_ACCELERATION_BYTES_SCANNED
* QUERY_ACCELERATION_PARTITIONS_SCANNED
* QUERY_ACCELERATION_UPPER_LIMIT_SCALE_FACTOR

You can use these columns to identify the queries that benefited from the query acceleration service. For each query, you can also
determine the total number of partitions and bytes scanned by the query acceleration service.

For descriptions of each of these columns, see [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md).

> **Note:**
>
> For a given query, the sum of the QUERY_ACCELERATION_BYTES_SCANNED and BYTES_SCANNED columns might be greater when the query
> acceleration service is used than when the service is not used. The same is true for the sum of the columns
> QUERY_ACCELERATION_PARTITIONS_SCANNED and PARTITIONS_SCANNED.
>
> The increase in the number of bytes and partitions is due to the intermediary results that are generated by the service to
> facilitate query acceleration.

For example, to find the queries with the most bytes scanned by the query acceleration service in the past 24 hours:

```sqlexample
SELECT query_id,
       query_text,
       warehouse_name,
       start_time,
       end_time,
       query_acceleration_bytes_scanned,
       query_acceleration_partitions_scanned,
       query_acceleration_upper_limit_scale_factor
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_acceleration_partitions_scanned > 0
  AND start_time >= DATEADD(hour, -24, CURRENT_TIMESTAMP())
  ORDER BY query_acceleration_bytes_scanned DESC;
```

To find the queries with the largest number of partitions scanned by the query acceleration service in the past 24 hours:

```sqlexample
SELECT query_id,
       query_text,
       warehouse_name,
       start_time,
       end_time,
       query_acceleration_bytes_scanned,
       query_acceleration_partitions_scanned,
       query_acceleration_upper_limit_scale_factor
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_acceleration_partitions_scanned > 0
  AND start_time >= DATEADD(hour, -24, CURRENT_TIMESTAMP())
  ORDER BY query_acceleration_partitions_scanned DESC;
```

## Query acceleration service cost

Query Acceleration consumes credits as it uses [serverless compute resources](cost-understanding-compute.md) to execute portions of
eligible queries.

Query Acceleration is billed like other serverless features in Snowflake in that you pay by the second for the compute resources used. To
learn how many credits per compute-hour are consumed by the Query Acceleration Service, refer to the “Serverless
Feature Credit Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Viewing billing information in Snowsight

If you have Query Acceleration enabled for your account, use the Cost Management page in Snowsight to view billing
information for the Query Acceleration Service.

To see Query Acceleration Service spending, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select the Consumption tab.
4. Select Query Acceleration from the Service Type drop-down.

   Snowsight displays the Query Acceleration Service spending for your account.

### Viewing billing using the Account Usage QUERY_ACCELERATION_HISTORY view

You can view billing data in the Account Usage [QUERY_ACCELERATION_HISTORY view](../sql-reference/account-usage/query_acceleration_history.md).

#### Example

This query returns the total number of credits used by each warehouse in your account for the query acceleration service
(month-to-date):

```sqlexample
SELECT warehouse_name,
       SUM(credits_used) AS total_credits_used
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
  WHERE start_time >= DATE_TRUNC(month, CURRENT_DATE)
  GROUP BY 1
  ORDER BY 2 DESC;
```

### Viewing billing using the Organization Usage QUERY_ACCELERATION_HISTORY view

You can view billing data for the query acceleration service for all accounts in your organization in the Organization Usage
[QUERY_ACCELERATION_HISTORY view](../sql-reference/organization-usage/query_acceleration_history.md).

#### Example

This query returns the total number of credits used by each warehouse in each account for the query acceleration service (month-to-date):

```sqlexample
SELECT account_name,
       warehouse_name,
       SUM(credits_used) AS total_credits_used
  FROM SNOWFLAKE.ORGANIZATION_USAGE.QUERY_ACCELERATION_HISTORY
  WHERE usage_date >= DATE_TRUNC(month, CURRENT_DATE)
  GROUP BY 1, 2
  ORDER BY 3 DESC;
```

### Viewing billing using the QUERY_ACCELERATION_HISTORY function

You can also view billing data using the Information Schema [QUERY_ACCELERATION_HISTORY](../sql-reference/functions/query_acceleration_history.md) function.

#### Example

The following example uses the QUERY_ACCELERATION_HISTORY function to return information about the queries accelerated by this service
within the past 12 hours:

> ```sqlexample
> SELECT start_time,
>        end_time,
>        credits_used,
>        warehouse_name,
>        num_files_scanned,
>        num_bytes_scanned
>   FROM TABLE(INFORMATION_SCHEMA.QUERY_ACCELERATION_HISTORY(
>     date_range_start=>DATEADD(H, -12, CURRENT_TIMESTAMP)));
> ```

## Evaluating cost and performance

This section includes example queries that might help you evaluate query performance and cost before and after enabling the query
acceleration service.

### Viewing warehouse and query acceleration service costs

The following query computes the costs of the warehouse and the query acceleration service for a specific warehouse. You can execute
this query after enabling the query acceleration service for a warehouse to compare costs before and after enabling query acceleration.
The date range for the query begins 8 weeks prior to the first credit usage for the query acceleration service to 8 weeks after the last
incurred cost for query acceleration service (or up to the current date).

> **Note:**
>
> * This query is most useful for evaluating the cost of the service if the warehouse properties and workload remain the same
>   before and after enabling the query acceleration service.
> * This query returns results only if there has been credit usage for accelerated queries in the warehouse.

This example query returns the warehouse and query acceleration service costs for `my_warehouse`:

```sqlexample
WITH credits AS (
  SELECT 'QC' AS credit_type,
         TO_DATE(end_time) AS credit_date,
         SUM(credits_used) AS num_credits
    FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
    WHERE warehouse_name = 'my_warehouse'
    AND credit_date BETWEEN
           DATEADD(WEEK, -8, (
             SELECT TO_DATE(MIN(end_time))
               FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
               WHERE warehouse_name = 'my_warehouse'
           ))
           AND
           DATEADD(WEEK, +8, (
             SELECT TO_DATE(MAX(end_time))
               FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
               WHERE warehouse_name = 'my_warehouse'
           ))
  GROUP BY credit_date
  UNION ALL
  SELECT 'WC' AS credit_type,
         TO_DATE(end_time) AS credit_date,
         SUM(credits_used) AS num_credits
    FROM SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_METERING_HISTORY
    WHERE warehouse_name = 'my_warehouse'
    AND credit_date BETWEEN
           DATEADD(WEEK, -8, (
             SELECT TO_DATE(MIN(end_time))
               FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
               WHERE warehouse_name = 'my_warehouse'
           ))
           AND
           DATEADD(WEEK, +8, (
             SELECT TO_DATE(MAX(end_time))
               FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
               WHERE warehouse_name = 'my_warehouse'
           ))
  GROUP BY credit_date
)
SELECT credit_date,
       SUM(IFF(credit_type = 'QC', num_credits, 0)) AS qas_credits,
       SUM(IFF(credit_type = 'WC', num_credits, 0)) AS compute_credits,
       compute_credits + qas_credits AS total_credits,
       AVG(total_credits) OVER (
         PARTITION BY NULL ORDER BY credit_date ROWS BETWEEN 6 PRECEDING AND CURRENT ROW)
         AS avg_total_credits_7days
  FROM credits
  GROUP BY credit_date
  ORDER BY credit_date;
```

### Viewing query performance

This query returns the average execution time for query acceleration eligible queries for a given warehouse. The date range for the query
begins 8 weeks prior to the first credit usage for the query acceleration service to 8 weeks after the last incurred cost for query
acceleration service (or up to the current date). The results might help you evaluate how the average query performance changed after
enabling the query acceleration service.

> **Note:**
>
> * This query is most useful for evaluating query performance if the warehouse workload remains the same before and after enabling
>   the query acceleration service.
> * If the warehouse workload remains stable, the value in the `num_execs` column should remain consistent.
> * If the value in the `num_execs` column of the query results dramatically increases or decreases, the results of this query
>   will likely not be useful for query performance evaluation.

This example query returns the query execution time by day and computes the 7 day average for the week prior for queries that are
eligible for acceleration in the warehouse `my_warehouse`:

```sqlexample
WITH qas_eligible_or_accelerated AS (
  SELECT TO_DATE(qh.end_time) AS exec_date,
        COUNT(*) AS num_execs,
        SUM(qh.execution_time) AS exec_time,
        MAX(IFF(qh.query_acceleration_bytes_scanned > 0, 1, NULL)) AS qas_accel_flag
    FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY AS qh
    WHERE qh.warehouse_name = 'my_warehouse'
    AND TO_DATE(qh.end_time) BETWEEN
           DATEADD(WEEK, -8, (
             SELECT TO_DATE(MIN(end_time))
               FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
              WHERE warehouse_name = 'my_warehouse'
           ))
           AND
           DATEADD(WEEK, +8, (
             SELECT TO_DATE(MAX(end_time))
               FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
              WHERE warehouse_name = 'my_warehouse'
           ))
    AND (qh.query_acceleration_bytes_scanned > 0
          OR
          EXISTS (
            SELECT 1
              FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE AS qae
               WHERE qae.query_id = qh.query_id
               AND qae.warehouse_name = qh.warehouse_name
          )
         )
    GROUP BY exec_date
)
SELECT exec_date,
       SUM(exec_time) OVER (
         PARTITION BY NULL ORDER BY exec_date ROWS BETWEEN 6 PRECEDING AND CURRENT ROW
       ) /
       NULLIFZERO(SUM(num_execs) OVER (
         PARTITION BY NULL ORDER BY exec_date ROWS BETWEEN 6 PRECEDING AND CURRENT ROW)
       ) AS avg_exec_time_7days,
      exec_time / NULLIFZERO(num_execs) AS avg_exec_time,
      qas_accel_flag,
      num_execs,
      exec_time
  FROM qas_eligible_or_accelerated;
```

The output from the statement includes the following columns:

| Column | Description |
| --- | --- |
| EXEC_DATE | Query execution date. |
| AVG_EXEC_TIME_7DAYS | The average execution time for the prior 7 days inclusive of EXEC_DATE. |
| AVG_EXEC_TIME | The average query execution time. |
| QAS_ACCEL_FLAG | 1 if any queries were accelerated; NULL if no queries were accelerated. |
| NUM_EXECS | Number of queries accelerated. |
| EXEC_TIME | Total execution time of all query acceleration eligible queries. |

> **Tip:**
>
> When the query acceleration service (QAS) is enabled, Snowflake writes a small amount of data to remote storage
> for each eligible query, even if QAS isn’t used for that query. Therefore, don’t be concerned by a nonzero
> value for `bytes_spilled_to_remote_storage` in the QUERY_HISTORY view when QAS is enabled.

## Compatibility with search optimization

Query acceleration and [search optimization](search-optimization-service.md) can work together to
optimize query performance. First, search optimization can prune the [micro-partitions](tables-clustering-micropartitions.md) not needed for a query. Then, for eligible queries, query acceleration can offload portions of the rest of the work to
shared compute resources provided by the service.

Performance of queries accelerated by both services varies depending on workload and available resources.

---
title: Using the Query Hash to Identify Patterns and Trends in Queries
source: https://docs.snowflake.com/en/user-guide/query-hash.md
section: User Guide
---

# Using the Query Hash to Identify Patterns and Trends in Queries

To identify, group, and analyze similar queries in the query history, you can use a hash of the query text. For example, you can:

* Group queries by the query hash to identify patterns in expensive queries.
* Determine the effects of performance improvements (for example, changes to clustering keys) on repeated queries.

In the following views and table functions, you can use the `query_hash` and `query_parameterized_hash` columns to get the
hash of the query text:

* ACCOUNT_USAGE views (1 year retention)

  + [QUERY_HISTORY view](../sql-reference/account-usage/query_history.md)
  + [QUERY_ACCELERATION_ELIGIBLE view](../sql-reference/account-usage/query_acceleration_eligible.md)
  + [TASK_HISTORY view](../sql-reference/account-usage/task_history.md)
* INFORMATION_SCHEMA table functions (7 days retention)

  + [QUERY_HISTORY](../sql-reference/functions/query_history.md) table function
  + [TASK_HISTORY](../sql-reference/functions/task_history.md) table function

You can use this hash to analyze repeated queries.

## Using the Hash of the Query (`query_hash`)

The `query_hash` column contains a hash value that is computed, based on the canonicalized text of the SQL statement. Repeated
queries that have exactly the same query text have the same `query_hash` values.

Repeated queries also have the same `query_hash` if their query text differs only in:

* Case insensitive identifier, session variable, and stage name

  Note that this does not include identifiers specified using IDENTIFIER() with bind variables. Bind variables with different
  values produce different query hashes.
* White space
* Comments

If any other part of the query text of two queries differ, those queries have different `query_hash` values.

For example, the following queries have the same `query_hash` value because they have exactly the same query text.

```sqlexample
SELECT * FROM table1 WHERE table1.name = 'TIM'
```

```sqlexample
SELECT * FROM table1 WHERE table1.name = 'TIM'
```

You can use the `query_hash` value to find patterns in query performance that might not be obvious otherwise. For example,
although a query might not be excessively expensive during any single execution, a frequently repeated query could lead to high
costs, based on the number of times it runs. You can use the `query_hash` value to identify the queries to focus on optimizing
first.

For example, the following query uses the `query_hash` value to identify the query IDs for the 100 longest-running queries:

```sqlexample
SELECT
    query_hash,
    COUNT(*),
    SUM(total_elapsed_time),
    ANY_VALUE(query_id)
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE warehouse_name = 'MY_WAREHOUSE'
    AND DATE_TRUNC('day', start_time) >= CURRENT_DATE() - 7
  GROUP BY query_hash
  ORDER BY SUM(total_elapsed_time) DESC
  LIMIT 100;
```

## Using the Hash of the Parameterized Query (`query_parameterized_hash`)

`query_parameterized_hash` contains a hash value that is computed based on the parameterized query, which means the version of
the query after literals are parameterized. These literals must be used in the query predicate and must be used with one of the
following [comparison operators](../sql-reference/operators-comparison.md):

* `=` (equal to)
* `!=` (not equal to)
* `>=` (greater than or equal to)
* `<=` (less than or equal to)

Repeated queries (including those with different parameter values) have the same `query_parameterized_hash` value.

Repeated queries also have the same `query_parameterized_hash` if their query text differs only in:

* Case insensitive identifier, session variable, and stage name

  Note that this does not include identifiers specified using IDENTIFIER() with bind variables. Bind variables with different
  values produce different query hashes.
* White space
* Comments

Queries that have the same `query_hash` value also have the same `query_parameterized_hash` value, but not vice versa.

For example, the following queries have the same `query_parameterized_hash` value because the literal values are the
only difference between the queries:

```sqlexample
SELECT * FROM table1 WHERE table1.name = 'TIM'
```

```sqlexample
SELECT * FROM table1 WHERE table1.name = 'AIHUA'
```

As is the case with the `query_hash` value, you can use the `query_parameterized_hash` value to find patterns in query
performance that might not be obvious otherwise.

The following statement computes the average `total_elapsed_time` each day for all queries with a specific
`query_parameterized_hash` value (`cbd58379a88c37ed6cc0ecfebb053b03`):

```sqlexample
SELECT
    DATE_TRUNC('day', start_time),
    SUM(total_elapsed_time),
    ANY_VALUE(query_id)
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_parameterized_hash = 'cbd58379a88c37ed6cc0ecfebb053b03'
    AND DATE_TRUNC('day', start_time) >= CURRENT_DATE() - 30
  GROUP BY DATE_TRUNC('day', start_time);
```

## Checking the Version That Was Used to Generate the Hash

Over time, the logic used by Snowflake to generate the query hash can change. Changes to this logic can result in different
hashes produced for the same query. For example, for a given query, the hash generated by version 1 of the logic might differ
from the hash generated by version 2 of the logic.

The views and table function output that include the `query_hash` and `query_parameterized_hash` columns also include the
following columns that specify the version of the logic used to produce the hashes:

* `query_hash_version`
* `query_parameterized_hash_version`

The version number in these columns is a NUMBER (for example, `1` for the first version of the logic, `2` for the second
version of the logic, etc.).

If these columns contain different version numbers for different periods of time, you can use these version columns to identify
the different hashes for the same query. For example:

```sqlexample
...
WHERE (query_hash = 'hash_from_v1' AND query_hash_version = 1)
  OR (query_hash = 'hash_from_v2' AND query_hash_version = 2)
```

---
title: Using the Snowflake Connector for Kafka with Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/kafka-connector-iceberg.md
section: User Guide
---

# Using the Snowflake Connector for Kafka with Apache Iceberg™ tables

Beginning with version 3.0.0, the Snowflake Connector for Kafka can ingest data into a
Snowflake-managed [Apache Iceberg™ table](tables-iceberg.md).

## Requirements and limitations

Before you configure the Kafka connector for Iceberg table ingestion, note the following requirements and limitations:

* Iceberg table ingestion requires version 3.0.0 or later of the Kafka connector.
* Iceberg table ingestion is supported by the Kafka connector with Snowpipe Streaming. It’s not supported by the Kafka connector with Snowpipe.
* Iceberg table ingestion is not supported when `snowflake.streaming.enable.single.buffer` is set to `false`.
* You must create an Iceberg table before running the connector. For more information, see Configuration and setup in this topic.

### Schema evolution limitations

Schema evolution for Iceberg is fully supported for schematized data formats like AVRO or Protobuf.

For plain JSON without a schema, the connector considers the following message types invalid and sends them to
dead-letter queues (DLQ):

* Messages with a new column if the corresponding value is `null` or `[]`
* Messages with a new field in a structured object if the corresponding value is `null` or `[]`

To manually change the table schema so that the connector can ingest these message types, use an ALTER TABLE statement.

## Configuration and setup

To configure the Kafka connector for Iceberg table ingestion, you follow the
regular [setup steps for a Snowpipe Streaming-based connector](snowpipe-streaming/snowpipe-streaming-classic-kafka.md)
with a few differences noted in the following sections.

### Grant usage on an external volume

You must grant the USAGE privilege on the external volume associated with your Iceberg table to your role for the Kafka connector.

For example, if your Iceberg table uses the `kafka_external_volume` external volume
and the connector uses the role `kafka_connector_role`, run the following statement:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT USAGE ON EXTERNAL VOLUME kafka_external_volume TO ROLE kafka_connector_role;
```

### Create an Iceberg table for ingestion

Before you run the connector, you must create an Iceberg table.
The initial table schema depends on your connector `snowflake.enable.schematization` settings.

If you enable schematization, you can create a table with a column named `record_metadata`:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
    record_metadata OBJECT()
  )
  EXTERNAL_VOLUME = 'my_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_location/my_iceberg_table';
```

The connector automatically creates the columns for message fields and alters the `record_metadata` column schema.

If you don’t enable schematization, you can create a table with a column named `record_content` of a type that matches the actual Kafka message content.
The connector automatically creates the `record_metadata` column.

When you create an Iceberg table, you can use Iceberg data types or [compatible Snowflake types](tables-iceberg-data-types.md).
The semi-structured VARIANT type isn’t supported. Instead, use a [structured OBJECT or MAP](../sql-reference/data-types-structured.md).

For example, consider the following message:

```sqljson
{
    "id": 1,
    "name": "Steve",
    "body_temperature": 36.6,
    "approved_coffee_types": ["Espresso", "Doppio", "Ristretto", "Lungo"],
    "animals_possessed":
    {
        "dogs": true,
        "cats": false
    },
    "date_added": "2024-10-15"
}
```

To create an Iceberg table for the example message, use the following statement:

> ```sqlexample
> CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
>     record_content OBJECT(
>         id INT,
>         body_temperature FLOAT,
>         name STRING,
>         approved_coffee_types ARRAY(STRING),
>         animals_possessed OBJECT(dogs BOOLEAN, cats BOOLEAN),
>         date_added DATE
>     )
>   )
>   EXTERNAL_VOLUME = 'my_volume'
>   CATALOG = 'SNOWFLAKE'
>   BASE_LOCATION = 'my_location/my_iceberg_table';
> ```

> **Note:**
>
> Field names inside nested structures such as `dogs` or `cats` are case sensitive.

### Configuration properties

`snowflake.streaming.iceberg.enabled`
:   Specifies whether the connector ingests data into an Iceberg table. The connector fails if this property doesn’t match the actual table type.

    Values:

    * `true`
    * `false`

    Default:
    :   `false`

---
title: Using the Spark Connector
source: https://docs.snowflake.com/en/user-guide/spark-connector-use.md
section: User Guide
---

# Using the Spark Connector

The connector adheres to the standard Spark API, but with the addition of Snowflake-specific options, which are described in this topic.

In this topic, the term COPY refers to both:

* [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) (used to transfer data from an internal or external stage into a table).
* [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) (used to transfer data from a table into an internal or external stage).

## Verifying the Network Connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

## Pushdown

The Spark Connector applies predicate and query pushdown by capturing and analyzing the Spark logical plans for SQL operations. When the data source is Snowflake, the operations are translated into a SQL query and then executed in Snowflake to improve performance.

However, because this translation requires almost a one-to-one translation of Spark SQL operators to Snowflake expressions, not all of Spark SQL operators can be pushed down. When pushdown fails, the connector falls back to a less-optimized execution plan. The unsupported operations are instead performed in Spark.

> **Note:**
>
> If you need pushdown for all operations, consider writing your code to use [Snowpark API](../developer-guide/snowpark/index.md) instead.

Below is a list of supported operations for pushdown (all functions below use their Spark names). If a function is not in this list, a Spark plan that utilizes it might be executed on Spark rather than pushed down into Snowflake.

* Aggregation Functions

  + Average
  + Corr
  + CovPopulation
  + CovSample
  + Count
  + Max
  + Min
  + StddevPop
  + StddevSamp
  + Sum
  + VariancePop
  + VarianceSamp
* Boolean Operators

  + And
  + Between
  + Contains
  + EndsWith
  + EqualTo
  + GreaterThan
  + GreaterThanOrEqual
  + In
  + IsNull
  + IsNotNull
  + LessThan
  + LessThanOrEqual
  + Not
  + Or
  + StartsWith
* Date, Time, and Timestamp Functions

  + DateAdd
  + DateSub
  + Month
  + Quarter
  + TruncDate
  + TruncTimestamp
  + Year
* Mathematical Functions

  + Arithmetic operators ‘+’ (addition), ‘-’ (subtraction), ‘\*’ (multiplication), ‘/’ (division), and ‘-’ (unary negation).
  + Abs
  + Acos
  + Asin
  + Atan
  + Ceil
  + CheckOverflow
  + Cos
  + Cosh
  + Exp
  + Floor
  + Greatest
  + Least
  + Log
  + Pi
  + Pow
  + PromotePrecision
  + Rand
  + Round
  + Sin
  + Sinh
  + Sqrt
  + Tan
  + Tanh
* Miscellaneous Operators

  + Alias (AS expressions)
  + BitwiseAnd
  + BitwiseNot
  + BitwiseOr
  + BitwiseXor
  + CaseWhen
  + Cast(child, t, _)
  + Coalesce
  + If
  + MakeDecimal
  + ScalarSubquery
  + ShiftLeft
  + ShiftRight
  + SortOrder
  + UnscaledValue
* Relational Operators

  + Aggregate functions and group-by clauses
  + Distinct
  + Filters
  + In
  + InSet
  + Joins
  + Limits
  + Projections
  + Sorts (ORDER BY)
  + Union and Union All
  + Window functions and windowing-clauses
* String Functions

  + Ascii
  + Concat(children)
  + Length
  + Like
  + Lower
  + StringLPad
  + StringRPad
  + StringTranslate
  + StringTrim
  + StringTrimLeft
  + StringTrimRight
  + Substring
  + Upper
* Window Functions (note: these do not work with Spark 2.2)

  + DenseRank
  + Rank
  + RowNumber

## Using the Connector in Scala

### Specifying the Data Source Class Name

To use Snowflake as a data source in Spark, use the `.format` option to provide the Snowflake connector class name that defines the data source.

> `net.snowflake.spark.snowflake`

To ensure a compile-time check of the class name, Snowflake highly recommends defining a variable for the class name. For example:

```scala
val SNOWFLAKE_SOURCE_NAME = "net.snowflake.spark.snowflake"
```

Also, for convenience, the `Utils` class provides the variable, which can be imported as follows:

```scala
import net.snowflake.spark.snowflake.Utils.SNOWFLAKE_SOURCE_NAME
```

> **Note:**
>
> All examples in this topic use `SNOWFLAKE_SOURCE_NAME` as the class definition.

### Enabling/Disabling Pushdown in a Session

Version 2.1.0 (and higher) of the connector supports query pushdown, which can significantly improve performance by pushing query processing to Snowflake when Snowflake is the Spark data source.

By default, pushdown is enabled.

To disable pushdown within a Spark session for a given DataFrame:

1. After instantiating a `SparkSession` object, call the `SnowflakeConnectorUtils.disablePushdownSession` static
   method, passing in the `SparkSession` object. For example:

   > ```scala
   > SnowflakeConnectorUtils.disablePushdownSession(spark)
   > ```

   Where `spark` is your `SparkSession` object.
2. Create a DataFrame with the autopushdown option set to `off`. For example:

   > ```scala
   > val df = sparkSession.read.format(SNOWFLAKE_SOURCE_NAME)
   >   .options(sfOptions)
   >   .option("query", query)
   >   .option("autopushdown", "off")
   >   .load()
   > ```

   Note that you can also set the `autopushdown` option in a `Map` that you pass to the `options` method (e.g.
   in `sfOptions` in the example above).

To enable pushdown again after disabling it, call the `SnowflakeConnectorUtils.enablePushdownSession` static method
(passing in the `SparkSession` object), and create a DataFrame with `autopushdown` enabled.

### Moving Data from Snowflake to Spark

> **Note:**
>
> When using DataFrames, the Snowflake connector supports SELECT queries only.

To read data from Snowflake into a Spark DataFrame:

1. Use the `read()` method of the `SqlContext` object to construct a `DataFrameReader`.
2. Specify `SNOWFLAKE_SOURCE_NAME` using the `format()` method. For the definition, see Specifying the Data Source Class Name (in this topic).
3. Specify the connector options using either the `option()` or `options()` method. For more information, see Setting Configuration Options for the Connector (in this topic).
4. Specify one of the following options for the table data to be read:

   * `dbtable`: The name of the table to be read. All columns and records are retrieved (i.e. it is equivalent to `SELECT * FROM db_table`).
   * `query`: The exact query (SELECT statement) to run.

#### Usage Notes

* Currently, the connector does not support other types of queries (e.g. SHOW or DESC, or DML statements) when using DataFrames.
* There is an upper limit to the size of an individual row. For more details, see [Limits on Query Text Size](query-size-limits.md).

#### Performance Considerations

When transferring data between Snowflake and Spark, use the following methods to analyze/improve performance:

* Use the `net.snowflake.spark.snowflake.Utils.getLastSelect()` method to see the actual query issued when moving data from Snowflake to Spark.
* If you use the `filter` or `where` functionality of the Spark DataFrame, check that the respective filters are present in the issued SQL query. The Snowflake connector tries to translate all the
  filters requested by Spark to SQL.

  However, there are forms of filters that the Spark infrastructure today does not pass to the Snowflake connector. As a result, in some situations, a large number of unnecessary records are requested from
  Snowflake.
* If you need only a subset of columns, make sure the reflect the subset in the SQL query.
* In general, if the SQL query issued does not match what you expect based on the DataFrame operations, use the `query` option to provide the exact SQL syntax you want.

#### Examples

Read an entire table:

> ```scala
> val df: DataFrame = sqlContext.read
>     .format(SNOWFLAKE_SOURCE_NAME)
>     .options(sfOptions)
>     .option("dbtable", "t1")
>     .load()
> ```

Read the results of a query:

> ```scala
> val df: DataFrame = sqlContext.read
>     .format(SNOWFLAKE_SOURCE_NAME)
>     .options(sfOptions)
>     .option("query", "SELECT DEPT, SUM(SALARY) AS SUM_SALARY FROM T1")
>     .load()
> ```

### Moving Data from Spark to Snowflake

The steps for saving the contents of a DataFrame to a Snowflake table are similar to writing from Snowflake to Spark:

1. Use the `write()` method of the `DataFrame` to construct a `DataFrameWriter`.
2. Specify `SNOWFLAKE_SOURCE_NAME` using the `format()` method. For the definition, see Specifying the Data Source Class Name (in this topic).
3. Specify the connector options using either the `option()` or `options()` method. For more information, see Setting Configuration Options for the Connector (in this topic).
4. Use the `dbtable` option to specify the table to which data is written.
5. Use the `mode()` method to specify the save mode for the content.

   For more information, see [SaveMode](https://spark.apache.org/docs/1.6.0/api/java/org/apache/spark/sql/SaveMode.html) (Spark documentation).

#### Examples

> ```scala
> df.write
>     .format(SNOWFLAKE_SOURCE_NAME)
>     .options(sfOptions)
>     .option("dbtable", "t2")
>     .mode(SaveMode.Overwrite)
>     .save()
> ```

### Exporting JSON from Spark to Snowflake

Spark DataFrames can contain JSON objects, serialized as strings. The following code provides an example of converting a regular DataFrame to a DataFrame containing JSON data:

> ```scala
> val rdd = myDataFrame.toJSON
> val schema = new StructType(Array(StructField("JSON", StringType)))
> val jsonDataFrame = sqlContext.createDataFrame(
>             rdd.map(s => Row(s)), schema)
> ```

Note that the resulting `jsonDataFrame` contains a single column of type `StringType`. As a result, when this DataFrame is exported to Snowflake with the common `SaveMode.Overwrite` mode, a new table in Snowflake is created with a single column of type `VARCHAR`.

To load `jsonDataFrame` into a `VARIANT` column:

1. Create a Snowflake table (connecting to Snowflake in Java using the Snowflake JDBC Driver). For explanations of the
   connection parameters used in the example, see [JDBC Driver connection parameter reference](../developer-guide/jdbc/jdbc-parameters.md).

   > ```java
   > import java.sql.Connection;
   > import java.sql.DriverManager;
   > import java.sql.ResultSet;
   > import java.sql.ResultSetMetaData;
   > import java.sql.SQLException;
   > import java.sql.Statement;
   > import java.util.Properties;
   > public class SnowflakeJDBCExample {
   >   public static void main(String[] args) throws Exception {
   >     String jdbcUrl = "jdbc:snowflake://myorganization-myaccount.snowflakecomputing.com/";
   >
   >     Properties properties = new Properties();
   >     properties.put("user", "peter");
   >     properties.put("password", "test");
   >     properties.put("account", "myorganization-myaccount");
   >     properties.put("warehouse", "mywh");
   >     properties.put("db", "mydb");
   >     properties.put("schema", "public");
   >
   >     // get connection
   >     System.out.println("Create JDBC connection");
   >     Connection connection = DriverManager.getConnection(jdbcUrl, properties);
   >     System.out.println("Done creating JDBC connection\n");
   >     // create statement
   >     System.out.println("Create JDBC statement");
   >     Statement statement = connection.createStatement();
   >     System.out.println("Done creating JDBC statement\n");
   >     // create a table
   >     System.out.println("Create my_variant_table table");
   >     statement.executeUpdate("create or replace table my_variant_table(json VARIANT)");
   >     statement.close();
   >     System.out.println("Done creating demo table\n");
   >
   >     connection.close();
   >     System.out.println("Close connection\n");
   >   }
   > }
   > ```
2. Instead of using `SaveMode.Overwrite`, use `SaveMode.Append`, to reuse the existing table. When the string value representing JSON is loaded into Snowflake, because the target column is of type VARIANT, it is parsed as JSON. For example:

   > ```scala
   > df.write
   >     .format(SNOWFLAKE_SOURCE_NAME)
   >     .options(sfOptions)
   >     .option("dbtable", "my_variant_table")
   >     .mode(SaveMode.Append)
   >     .save()
   > ```

### Executing DDL/DML SQL Statements

Use the `runQuery()` method of the `Utils` object to execute DDL/DML SQL statements, in addition to queries, for example:

```scala
var sfOptions = Map(
    "sfURL" -> "<account_identifier>.snowflakecomputing.com",
    "sfUser" -> "<user_name>",
    "sfPassword" -> "<password>",
    "sfDatabase" -> "<database>",
    "sfSchema" -> "<schema>",
    "sfWarehouse" -> "<warehouse>"
    )
Utils.runQuery(sfOptions, "CREATE TABLE MY_TABLE(A INTEGER)")
```

where `sfOptions` is the parameters map used to read/write DataFrames.

The `runQuery` method returns only TRUE or FALSE. It is intended for statements that do not return a result set,
for example DDL statements like `CREATE TABLE` and DML statements like `INSERT`, `UPDATE`, and `DELETE`.
It is not useful for statements that return a result set, such as `SELECT` or `SHOW`.

### Working with Timestamps and Time Zones

Spark provides only one type of timestamp, equivalent to the Scala/Java Timestamp type. It is almost identical in behavior to the TIMESTAMP_LTZ (local time zone) data type in Snowflake. As such, when transferring data between Spark and Snowflake, Snowflake recommends using the following approaches to preserve time correctly, relative to time zones:

* Use only the TIMESTAMP_LTZ data type in Snowflake.

  > **Note:**
  >
  > The default timestamp data type mapping is TIMESTAMP_NTZ (no time zone), so you must explicitly set the [TIMESTAMP_TYPE_MAPPING](../sql-reference/parameters.md) parameter to use TIMESTAMP_LTZ.
* Set the Spark time zone to `UTC` and use this time zone in Snowflake (i.e. don’t set the `sfTimezone` option for the connector, and don’t explicitly set a time zone in Snowflake). In this scenario, TIMESTAMP_LTZ and TIMESTAMP_NTZ are effectively equivalent.

  To set the time zone, add the following line to your Spark code:

  ```bash
  java.util.TimeZone.setDefault(java.util.TimeZone.getTimeZone("UTC"))
  ```

If you don’t implement either of these approaches, undesired time modifications might occur. For example, consider the following scenario:

* The time zone in Spark is set to `America/New_York`.
* The time zone in Snowflake is set to `Europe/Warsaw`, which can happen by either:

  + Setting `sfTimezone` to `Europe/Warsaw` for the connector.
  + Setting `sfTimezone` to `snowflake` for the connector and setting the [TIMEZONE](../sql-reference/parameters.md) session
    parameter in Snowflake to `Europe/Warsaw`.
* Both TIMESTAMP_NTZ and TIMESTAMP_LTZ are in use in Snowflake.

In this scenario:

1. If a value representing `12:00:00` in a TIMESTAMP_NTZ column in Snowflake is sent to Spark, this value doesn’t carry any time zone information. Spark treats the value as `12:00:00` in New York.
2. If Spark sends this value `12:00:00` (in New York) back to Snowflake to be loaded into a TIMESTAMP_LTZ column, it is automatically converted and loaded as `18:00:00` (for the Warsaw time zone).
3. If this value is then converted to TIMESTAMP_NTZ in Snowflake, the user sees `18:00:00`, which is different from the original value, `12:00:00`.

To summarize, Snowflake recommends strictly following at least one of these rules:

* Use the same time zone, ideally `UTC`, for both Spark and Snowflake.
* Use only the TIMESTAMP_LTZ data type for transferring data between Spark and Snowflake.

### Sample Scala Program

> **Important:**
>
> This sample program assumes you are using version 2.2.0 (or higher) of the connector, which uses a Snowflake internal stage for storing temporary data and, therefore, does not require an S3 location for
> storing temporary data. If you are using an earlier version, you must have an existing S3 location and include values for `tempdir`, `awsAccessKey`, `awsSecretKey` for `sfOptions`.
> For more details, see AWS Options for External Data Transfer (in this topic).

The following Scala program provides a full use case for the Snowflake Connector for Spark. Before using the code, replace the following strings with the appropriate values, as described in
Setting Configuration Options for the Connector (in this topic):

* `<account_identifier>`: Your [account identifier](gen-conn-config.md).
* `<user_name>` , `<password>`: Login credentials for the Snowflake user.
* `<database>` , `<schema>` , `<warehouse>`: Defaults for the Snowflake session.

The sample Scala program uses basic authentication (i.e. username and password). If you wish to authenticate with OAuth, see Using External OAuth (in this topic).

```scala
import org.apache.spark.sql._

//
// Configure your Snowflake environment
//
var sfOptions = Map(
    "sfURL" -> "<account_identifier>.snowflakecomputing.com",
    "sfUser" -> "<user_name>",
    "sfPassword" -> "<password>",
    "sfDatabase" -> "<database>",
    "sfSchema" -> "<schema>",
    "sfWarehouse" -> "<warehouse>"
)

//
// Create a DataFrame from a Snowflake table
//
val df: DataFrame = sqlContext.read
    .format(SNOWFLAKE_SOURCE_NAME)
    .options(sfOptions)
    .option("dbtable", "t1")
    .load()

//
// DataFrames can also be populated via a SQL query
//
val df: DataFrame = sqlContext.read
    .format(SNOWFLAKE_SOURCE_NAME)
    .options(sfOptions)
    .option("query", "select c1, count(*) from t1 group by c1")
    .load()

//
// Join, augment, aggregate, etc. the data in Spark and then use the
// Data Source API to write the data back to a table in Snowflake
//
df.write
    .format(SNOWFLAKE_SOURCE_NAME)
    .options(sfOptions)
    .option("dbtable", "t2")
    .mode(SaveMode.Overwrite)
    .save()
```

## Using the Connector with Python

Using the connector with Python is very similar to the Scala usage.

We recommend using the `bin/pyspark` script included in the Spark distribution.

### Configuring the `pyspark` Script

The `pyspark` script must be configured similarly to the `spark-shell` script, using the `--packages` or `--jars` options. For example:

> ```bash
> bin/pyspark --packages net.snowflake:snowflake-jdbc:3.13.22,net.snowflake:spark-snowflake_2.12:2.11.0-spark_3.3
> ```

Don’t forget to include the Snowflake Spark Connector and JDBC Connector .jar
files in your CLASSPATH environment variable.

For more information about configuring the `spark-shell` script, see [Step 4: Configure the Local Spark Cluster or Amazon EMR-hosted Spark Environment](spark-connector-install.md).

### Enabling/Disabling Pushdown in a Session

Version 2.1.0 (and higher) of the connector supports query pushdown, which can significantly improve performance by pushing query processing to Snowflake when Snowflake is the Spark data source.

By default, pushdown is enabled.

To disable pushdown within a Spark session for a given DataFrame:

1. After instantiating a `SparkSession` object, call the `SnowflakeConnectorUtils.disablePushdownSession` static
   method, passing in the `SparkSession` object. For example:

   > ```python
   > sc._jvm.net.snowflake.spark.snowflake.SnowflakeConnectorUtils.disablePushdownSession(sc._jvm.org.apache.spark.sql.SparkSession.builder().getOrCreate())
   > ```
2. Create a DataFrame with the autopushdown option set to `off`. For example:

   > ```python
   > df = spark.read.format(SNOWFLAKE_SOURCE_NAME) \
   >   .options(**sfOptions) \
   >   .option("query",  query) \
   >   .option("autopushdown", "off") \
   >   .load()
   > ```

   Note that you can also set the `autopushdown` option in a `Dictionary` that you pass to the `options` method
   (e.g. in `sfOptions` in the example above).

To enable pushdown again after disabling it, call the `SnowflakeConnectorUtils.enablePushdownSession` static method
(passing in the `SparkSession` object), and create a DataFrame with `autopushdown` enabled.

### Sample Python Script

> **Important:**
>
> This sample script assumes you are using version 2.2.0 (or higher) of the connector, which uses a Snowflake internal stage for storing temporary data and, therefore, does not require an S3 location for
> storing this data. If you are using an earlier version, you must have an existing S3 location and include values for `tempdir`, `awsAccessKey`, `awsSecretKey` for `sfOptions`. For more
> details, see AWS Options for External Data Transfer (in this topic).

Once the `pyspark` script has been configured, you can perform SQL queries and other operations. Here’s an example Python script that performs a simple SQL query. This script illustrates basic connector
usage. Most of the Scala examples in this document can be adapted with minimal effort/changes for use with Python.

The sample Python script uses basic authentication (i.e. username and password). If you wish to authenticate with OAuth, see Using External OAuth (in this topic).

```python
from pyspark.sql import SparkSession

spark = SparkSession.builder.master("local").appName("Simple App").getOrCreate()

# You might need to set these
sc._jsc.hadoopConfiguration().set("fs.s3n.awsAccessKeyId", "<AWS_KEY>")
sc._jsc.hadoopConfiguration().set("fs.s3n.awsSecretAccessKey", "<AWS_SECRET>")

# Set options below
sfOptions = {
  "sfURL" : "<account_identifier>.snowflakecomputing.com",
  "sfUser" : "<user_name>",
  "sfPassword" : "<password>",
  "sfDatabase" : "<database>",
  "sfSchema" : "<schema>",
  "sfWarehouse" : "<warehouse>",
  "sfRole" : "Accountadmin"
}

SNOWFLAKE_SOURCE_NAME = "net.snowflake.spark.snowflake"

df = spark.read.format(SNOWFLAKE_SOURCE_NAME) \
  .options(**sfOptions) \
  .option("query",  "select 1 as my_num union all select 2 as my_num") \
  .load()

df.show()
```

> **Tip:**
>
> Note the usage of `sfOptions` and `SNOWFLAKE_SOURCE_NAME`. This simplifies the code and reduces the chance of errors.
>
> For details about the supported options for `sfOptions`, see Setting Configuration Options for the Connector (in this topic).

## Data Type Mappings

The Spark Connector supports converting between many common data types.

### From Spark SQL to Snowflake

> | Spark Data Type | Snowflake Data Type |
> | --- | --- |
> | `ArrayType` | VARIANT |
> | `BinaryType` | Not supported |
> | `BooleanType` | BOOLEAN |
> | `ByteType` | INTEGER. Snowflake does not support the BYTE type. |
> | `DateType` | DATE |
> | `DecimalType` | DECIMAL |
> | `DoubleType` | DOUBLE |
> | `FloatType` | FLOAT |
> | `IntegerType` | INTEGER |
> | `LongType` | INTEGER |
> | `MapType` | VARIANT |
> | `ShortType` | INTEGER |
> | `StringType` | If length is specified, VARCHAR(N); otherwise, VARCHAR |
> | `StructType` | VARIANT |
> | `TimestampType` | TIMESTAMP |

### From Snowflake to Spark SQL

> | Snowflake Data Type | Spark Data Type |
> | --- | --- |
> | ARRAY | `StringType` |
> | BIGINT | `DecimalType(38, 0)` |
> | BINARY | Not supported |
> | BLOB | Not supported |
> | BOOLEAN | `BooleanType` |
> | CHAR | `StringType` |
> | CLOB | `StringType` |
> | DATE | `DateType` |
> | DECIMAL | `DecimalType` |
> | DOUBLE | `DoubleType` |
> | FLOAT | `DoubleType` |
> | INTEGER | `DecimalType(38, 0)` |
> | OBJECT | `StringType` |
> | TIMESTAMP | `TimestampType` |
> | TIME | `StringType` (Spark Connector Version 2.4.14 or later) |
> | VARIANT | `StringType` |

## Calling the DataFrame.show Method

If you are calling the `DataFrame.show` method and passing in a number that is less than the number of rows in the
DataFrame, construct a DataFrame that just contains the rows to show in a sorted order.

To do this:

1. Call the `sort` method first to return a DataFrame that contains sorted rows.
2. Call the `limit` method on that DataFrame to return a DataFrame that just contains the rows that you want to show.
3. Call the `show` method on the returned DataFrame.

For example, if you want to show 5 rows and want the results sorted by the column `my_col`:

> ```scala
> val dfWithRowsToShow = originalDf.sort("my_col").limit(5)
> dfWithRowsToShow.show(5)
> ```

Otherwise, if you call `show` to display a subset of rows in the DataFrame, different executions of the code might result in
different rows being shown.

## Setting Configuration Options for the Connector

The following sections list the options that you set to configure the behavior of the connector:

* Required Connection Options
* Required Context Options
* Additional Context Options
* Proxy Options
* Additional Options

To set these options, call the `.option(<key>, <value>)` or `.options(<map>)` method of the
[Spark DataframeReader](https://spark.apache.org/docs/1.6.0/api/java/org/apache/spark/sql/DataFrameReader.html) class.

> **Tip:**
>
> To facilitate using the options, Snowflake recommends specifying the options in a single `Map` object and calling
> `.options(<map>)` to set the options.

### Required Connection Options

The following options are required for connecting to Snowflake:

`sfUrl`
:   Specifies the *hostname* for your account in the following format:

    `account_identifier.snowflakecomputing.com`

    `account_identifier` is your [account identifier](gen-conn-config.md).

`sfUser`
:   Login name for the Snowflake user.

You must also use one of the following options to authenticate:

* `sfPassword`

  Password for the Snowflake user.
* `pem_private_key`

  Private key (in PEM format) for key pair authentication. For instructions, see [Key-pair authentication and key-pair rotation](key-pair-auth.md).
* `sfAuthenticator`

  Specifies using [External OAuth](oauth-ext-overview.md) to authenticate to Snowflake. Set the value to `oauth`.

  Using External OAuth requires setting the `sfToken` parameter.

`sfToken`
:   (Required if using External OAuth) Set the value to your External OAuth access token.

    This connection parameter requires setting the `sfAuthenticator` parameter value to `oauth`.

    Default is none.

### Required Context Options

The following options are required for setting the database and schema context for the session:

`sfDatabase`
:   The database to use for the session after connecting.

`sfSchema`
:   The schema to use for the session after connecting.

### Additional Context Options

The options listed in this section are not required.

`sfAccount`
:   Account identifier (e.g. `myorganization-myaccount`). This option is no longer required because the account identifier is specified in `sfUrl`. It is documented here only for backward compatibility.

`sfWarehouse`
:   The default virtual warehouse to use for the session after connecting.

`sfRole`
:   The default security role to use for the session after connecting.

### Proxy Options

The options listed in this section are not required.

`use_proxy`
:   Specifies whether the connector should use a proxy:

    * `true` specifies that the connector should use a proxy.
    * `false` specifies that the connector should not use a proxy.

    The default value is `false`.

`proxy_host`
:   (Required if `use_proxy` is `true`) Specifies the hostname of the proxy server to use.

`proxy_port`
:   (Required if `use_proxy` is `true`) Specifies the port number of the proxy server to use.

`proxy_protocol`
:   Specifies the protocol used to connect to the proxy server. You can specify one of the following values:

    * `http`
    * `https`

    The default value is `http`.

    This is only supported for Snowflake on AWS.

    This option was added in version 2.11.1 of the Spark Connector.

`proxy_user`
:   Specifies the user name for authenticating to the proxy server. Set this if the proxy server requires authentication.

    This is only supported for Snowflake on AWS.

`proxy_password`
:   Specifies the password of `proxy_user` for authenticating to the proxy server. Set this if the proxy server requires
    authentication.

    This is only supported for Snowflake on AWS.

`non_proxy_hosts`
:   Specifies the list of hosts that the connector should connect to directly, bypassing the proxy server.

    Separate the hostnames with a URL-escaped pipe symbol (`%7C`). You can also use an asterisk (`*`) as a wildcard

    This is only supported for Snowflake on AWS.

### Additional Options

The options listed in this section are not required.

`sfTimezone`
:   The time zone to be used by Snowflake when working with Spark. Note that the parameter only sets the time zone in Snowflake; the Spark environment remains unmodified. The supported values are:

    * `spark`: Use the time zone from Spark (default).
    * `snowflake`: Use the current time zone for Snowflake.
    * `sf_default`: Use the default time zone for the Snowflake user who is connecting.
    * `time_zone`: Use a specific time zone (e.g. `America/New_York`), if valid.

      For more information about the impact of setting this option, see Working with Timestamps and Time Zones (in this topic).

`sfCompress`
:   If set to `on` (default), the data passed between Snowflake and Spark is compressed.

`s3MaxFileSize`
:   The size of the file used when moving data from Snowflake to Spark. The default is 10MB.

`preactions`
:   A semicolon-separated list of SQL commands that are executed before data is transferred between Spark and Snowflake.

    If a SQL command contains `%s`, the `%s` is replaced with the table name referenced for the operation.

`postactions`
:   A semicolon-separated list of SQL commands that are executed after data is transferred between Spark and Snowflake.

    If a SQL command contains `%s`, it is replaced with the table name referenced for the operation.

`truncate_columns`
:   If set to `on` (default), a COPY command automatically truncates text strings that exceed the target column length. If set to `off`, the command produces an error if a loaded string exceeds the target column length.

`truncate_table`
:   This parameter controls whether Snowflake retains the schema of a Snowflake
    target table when overwriting that table.

    By default, when a target table in Snowflake is overwritten, the schema of
    that target table is also overwritten; the new schema is based on the schema
    of the source table (the Spark dataframe).

    However, sometimes the schema of the source is not ideal. For example,
    a user might want a Snowflake target table to be able to store FLOAT
    values in the future even though the data type of the initial source column
    is INTEGER. In that case, the Snowflake table’s schema should not be
    overwritten; the Snowflake table should merely be truncated and then
    reused with its current schema.

    The possible values of this parameter are:

    * `on`
    * `off`

    If this parameter is `on`, the original schema of the target table is kept.
    If this parameter is `off`, then the old schema of the table is ignored,
    and a new schema is generated based on the schema of the source.

    This parameter is optional.

    The default value of this parameter is `off` (i.e. by default
    the original table schema is overwritten).

    For details about mapping Spark data types to Snowflake data types (and
    vice versa), see: Data Type Mappings (in this topic).

`continue_on_error`
:   This variable controls whether the COPY command aborts if the user enters
    invalid data (for example, invalid JSON format for a variant data type column).

    The possible values are:

    * `on`
    * `off`

    The value `on` means continue even if an error occurs. The value `off`
    means abort if an error is hit.

    This parameter is optional.

    The default value of this parameter is `off`.

    Turning this option on is not recommended. If any errors are reported
    while COPYing into Snowflake with the Spark connector, then this is likely
    to result in missing data.

    > **Note:**
    >
    > If rows are rejected or missing, and those rows are not clearly faulty
    > in the input source, please report it to Snowflake.

`usestagingtable`
:   This parameter controls whether data loading uses a staging table.

    A staging table is a normal table (with a temporary name) that is created
    by the connector; if the data loading operation is successful, the original
    target table is dropped and the staging table is renamed to the original
    target table’s name. If the data loading operation fails, the staging table
    is dropped and the target table is left with the data that it had
    immediately prior to the operation. Thus the staging table allows the original
    target table data to be retained if the operation fails. For safety, Snowflake
    strongly recommends using a staging table in most circumstances.

    In order for the connector to create a staging table, the user executing the
    COPY via the Spark connector must have sufficient privileges to
    create a table. Direct loading (i.e. loading without using a staging table)
    is useful if the user does not have permission to create a table.

    The possible values of this parameter are:

    * `on`
    * `off`

    If the parameter is `on`, a staging table is used. If this parameter
    is `off`, then the data is loaded directly into the target table.

    This parameter is optional.

    The default value of this parameter is `on` (i.e. use a staging table).

`autopushdown`
:   This parameter controls whether automatic query pushdown is enabled.

    If pushdown is enabled, then when a query is run on Spark, if part of the
    query can be “pushed down” to the Snowflake server, it is pushed down.
    This improves performance of some queries.

    This parameter is optional.

    The default value is `on` if the connector is plugged into a compatible
    version of Spark. Otherwise, the default value is `off`.

    If the connector is plugged into a different version of Spark than the
    connector is intended for (e.g. if version 3.2 of the connector is
    plugged into version 3.3 of Spark), then auto-pushdown is disabled
    even if this parameter is set to `on`.

`purge`
:   If this is set to `on`, then the connector deletes temporary files created
    when transferring from Spark to Snowflake via external data transfer.
    If this parameter is set to `off`, then those files are not automatically
    deleted by the connector.

    Purging works only for transfers from Spark to Snowflake, not for transfers
    from Snowflake to Spark.

    The possible values are

    * `on`
    * `off`

    The default value is `off`.

`columnmap`
:   This parameter is useful when writing data from Spark to Snowflake
    and the column names in the Snowflake table do not match the column names
    in the Spark table. You can create a map that indicates which Spark
    source column corresponds to each Snowflake destination column.

    The parameter is a single string literal, in the form of:

    > `"Map(col_2 -> col_b, col_3 -> col_a)"`

    For example, consider the following scenario:

    * A Dataframe named `df` in Spark has three columns:

      > `col_1` , `col_2` , `col_3`
    * A table named `tb` in Snowflake has two columns:

      > `col_a` , `col_b`
    * You wish to copy the following values:

      + From `df.col_2` to `tb.col_b`.
      + From `df.col_3` to `tb.col_a`.

    The value of the `columnmap` parameter would be:

    > `Map(col_2 -> col_b, col_3 -> col_a)`

    You can generate this value by executing the following Scala code:

    > `Map("col_2"->"col_b","col_3"->"col_a").toString()`

    The default value of this parameter is null. In other words, by default,
    column names in the source and destination tables should match.

    This parameter is used only when writing from Spark to Snowflake;
    it does not apply when writing from Snowflake to Spark.

`keep_column_case`
:   When writing a table from Spark to Snowflake, the Spark connector defaults
    to shifting the letters in column names to uppercase, unless the column
    names are in double quotes.

    When writing a table from Snowflake to Spark, the Spark connector defaults
    to adding double quotes around any column name that contains any characters
    except uppercase letters, underscores, and digits.

    If you set keep_column_case to `on`, then the Spark connector will not
    make these changes.

    The possible values are:

    * `on`
    * `off`

    The default value is `off`.

`column_mapping`
:   The connector must map columns from the Spark data frame to the Snowflake table. This can be done based on column
    names (regardless of order), or based on column order (i.e. the first column in the data frame is mapped to the first
    column in the table, regardless of column name).

    By default, the mapping is done based on order. You can override that by setting this parameter to `name`,
    which tells the connector to map columns based on column names. (The name mapping is case-insensitive.)

    The possible values of this parameter are:

    * `order`
    * `name`

    The default value is `order`.

`column_mismatch_behavior`
:   This parameter applies only when the `column_mapping` parameter is set to `name`.

    If the column names in the Spark data frame and the Snowflake table do not match, then:

    * If `column_mismatch_behavior` is `error`, then the Spark Connector reports an error.
    * If `column_mismatch_behavior` is `ignore`, then the Spark Connector ignores the error.

      + The driver discards any column in the Spark data frame that does not have a corresponding column in the
        Snowflake table.
      + The driver inserts NULLs into any column in the Snowflake table that does not have a corresponding column
        in the Spark data frame.

    Potential errors include:

    * The Spark data frame could contain columns that are identical except for case (uppercase/lowercase). Because
      column name mapping is case-insensitive, it is not possible to determine the correct mapping from the data frame
      to the table.
    * The Snowflake table could contain columns that are identical except for case (uppercase/lowercase). Because
      column name mapping is case-insensitive, it is not possible to determine the correct mapping from the data frame
      to the table.
    * The Spark data frame and the Snowflake table might have no column names in common. In theory, the Spark Connector
      could insert NULLs into every column of every row, but this is usually pointless, so the connector throws an
      error even if the `column_mismatch_behavior` is set to `ignore`.

    The possible values of this parameter are:

    * `error`
    * `ignore`

    The default value is `error`.

`time_output_format`
:   This parameter allows the user to specify the format for `TIME` data returned.

    The possible values of this parameter are the possible values for time formats specified at [Time formats](../sql-reference/date-time-input-output.md).

    This parameter affects only output, not input.

`timestamp_ntz_output_format`, . `timestamp_ltz_output_format`, . `timestamp_tz_output_format`
:   These options specify the output format for timestamp values. The default values of these options are:

    | Configuration Option | Default Value |
    | --- | --- |
    | `timestamp_ntz_output_format` | `"YYYY-MM-DD HH24:MI:SS.FF3"` |
    | `timestamp_ltz_output_format` | `"TZHTZM YYYY-MM-DD HH24:MI:SS.FF3"` |
    | `timestamp_tz_output_format` | `"TZHTZM YYYY-MM-DD HH24:MI:SS.FF3"` |

    If these options are set to `"sf_current"`, the connector uses the formats specified for the session.

`partition_size_in_mb`
:   This parameter is used when the query result set is very large and needs to be split into multiple DataFrame
    partitions. This parameter specifies the recommended uncompressed size for each DataFrame partition. To reduce the
    number of partitions, make this size larger.

    This size is used as a recommended size; the actual size of partitions could be smaller or larger.

    This option applies only when the use_copy_unload parameter is FALSE.

    This parameter is optional.

    The default value is `100` (MB).

`use_copy_unload`
:   If this is `FALSE`, Snowflake uses the Arrow data format when SELECTing data. If this is set to `TRUE`,
    then Snowflake reverts to the old behavior of using the `COPY UNLOAD` command to transmit selected data.

    This parameter is optional.

    The default value is `FALSE`.

`treat_decimal_as_long`
:   If `TRUE`, configures the Spark Connector to return `Long` values (rather than `BigDecimal` values) for queries
    that return the type `Decimal(precision, 0)`.

    The default value is `FALSE`.

    This option was added in version 2.11.1 of the Spark Connector.

`s3_stage_vpce_dns_name`
:   Specifies the DNS name of your VPC Endpoint for access to internal stages.

    This option was added in version 2.11.1 of the Spark Connector.

`support_share_connection`
:   If `FALSE`, configures the Spark Connector to create a new JDBC connection for each job or action that uses the same Spark
    Connector options to access Snowflake.

    The default value is `TRUE`, which means that the different jobs and actions share the same JDBC connection if they use the
    same Spark Connector options to access Snowflake.

    If you need to enable or disable this setting programmatically, use the following global static functions:

    * `SparkConnectorContext.disableSharedConnection()`
    * `SparkConnectorContext.enableSharingJDBCConnection()`

    > **Note:**
    >
    > In the following special cases, the Spark Connector does not use a shared JDBC connection:
    >
    > * If preactions or postactions are set, and those preactions or postactions are not CREATE TABLE, DROP TABLE, or MERGE INTO,
    >   the Spark Connector does not use the shared connection.
    > * Utility functions in Utils such as `Utils.runQuery()` and `Utils.getJDBCConnection()` do not use the shared
    >   connection.

    This option was added in version 2.11.2 of the Spark Connector.

`force_skip_pre_post_action_check_for_shared_session`
:   If `TRUE`, configures the Spark Connector to disable the validation of preactions and postactions for session sharing.

    The default value is `FALSE`.

    > **Important:**
    >
    > Before setting this option, make sure that the queries in preactions and postactions don’t affect the session settings.
    > Otherwise, you may encounter issues with results.

    This option was added in version 2.11.3 of the Spark Connector.

### Using Key Pair Authentication & Key Pair Rotation

The Spark connector supports key pair authentication and key rotation.

1. To start, complete the initial configuration for key pair authentication as shown in [Key-pair authentication and key-pair rotation](key-pair-auth.md).
2. Send an unencrypted copy of the private key using the `pem_private_key` connection option.

> **Attention:**
>
> For security reasons, rather than hard-coding the `pem_private_key` in your application, you should set the parameter dynamically after reading the key from a secure source. If the key is encrypted, then decrypt it and send the decrypted version.

In the Python example, note that the `pem_private_key` file, `rsa_key.p8`, is:

* Being read directly from a password-protected file, using the environment variable `PRIVATE_KEY_PASSPHRASE`.
* Using the expression `pkb` in the `sfOptions` string.

To connect, you can save the Python example to a file (i.e. `<file.py>`) and then execute the following command:

```bash
spark-submit --packages net.snowflake:snowflake-jdbc:3.13.22,net.snowflake:spark-snowflake_2.12:2.11.0-spark_3.3 <file.py>
```

**Python**

```python
from pyspark.sql import SQLContext
from pyspark import SparkConf, SparkContext
from cryptography.hazmat.backends import default_backend
from cryptography.hazmat.primitives import serialization
import re
import os

with open("<path>/rsa_key.p8", "rb") as key_file:
  p_key = serialization.load_pem_private_key(
    key_file.read(),
    password = os.environ['PRIVATE_KEY_PASSPHRASE'].encode(),
    backend = default_backend()
    )

pkb = p_key.private_bytes(
  encoding = serialization.Encoding.PEM,
  format = serialization.PrivateFormat.PKCS8,
  encryption_algorithm = serialization.NoEncryption()
  )

pkb = pkb.decode("UTF-8")
pkb = re.sub("-*(BEGIN|END) PRIVATE KEY-*\n","",pkb).replace("\n","")

sc = SparkContext("local", "Simple App")
spark = SQLContext(sc)
spark_conf = SparkConf().setMaster('local').setAppName('Simple App')

sfOptions = {
  "sfURL" : "<account_identifier>.snowflakecomputing.com",
  "sfUser" : "<user_name>",
  "pem_private_key" : pkb,
  "sfDatabase" : "<database>",
  "sfSchema" : "schema",
  "sfWarehouse" : "<warehouse>"
}

SNOWFLAKE_SOURCE_NAME = "net.snowflake.spark.snowflake"

df = spark.read.format(SNOWFLAKE_SOURCE_NAME) \
    .options(**sfOptions) \
    .option("query", "COLORS") \
    .load()

df.show()
```

### Using External OAuth

Starting with Spark Connector version 2.7.0, you can use [External OAuth](oauth-ext-overview.md) to authenticate to Snowflake using either the sample Scala program or the sample Python script.

Before using External OAuth and the Spark Connector to authenticate to Snowflake, configure an External OAuth security integration for one of the supported External OAuth authorization servers or an External OAuth [custom client](oauth-ext-custom.md).

In the Scala and Python examples, note the replacement of the `sfPassword` parameter with the `sfAuthenticator` and `sfToken` parameters.

**Scala:**

> ```scala
> // spark connector version
>
> val SNOWFLAKE_SOURCE_NAME = "net.snowflake.spark.snowflake"
> import net.snowflake.spark.snowflake2.Utils.SNOWFLAKE_SOURCE_NAME
> import org.apache.spark.sql.DataFrame
>
> var sfOptions = Map(
>     "sfURL" -> "<account_identifier>.snowflakecomputing.com",
>     "sfUser" -> "<username>",
>     "sfAuthenticator" -> "oauth",
>     "sfToken" -> "<external_oauth_access_token>",
>     "sfDatabase" -> "<database>",
>     "sfSchema" -> "<schema>",
>     "sfWarehouse" -> "<warehouse>"
> )
>
> //
> // Create a DataFrame from a Snowflake table
> //
> val df: DataFrame = sqlContext.read
>     .format(SNOWFLAKE_SOURCE_NAME)
>     .options(sfOptions)
>     .option("dbtable", "region")
>     .load()
>
> //
> // Join, augment, aggregate, etc. the data in Spark and then use the
> // Data Source API to write the data back to a table in Snowflake
> //
> df.write
>     .format(SNOWFLAKE_SOURCE_NAME)
>     .options(sfOptions)
>     .option("dbtable", "t2")
>     .mode(SaveMode.Overwrite)
>     .save()
> ```

**Python:**

> ```python
> from pyspark import SparkConf, SparkContext
> from pyspark.sql import SQLContext
> from pyspark.sql.types import *
>
> sc = SparkContext("local", "Simple App")
> spark = SQLContext(sc)
> spark_conf = SparkConf().setMaster('local').setAppName('<APP_NAME>')
>
> # You might need to set these
> sc._jsc.hadoopConfiguration().set("fs.s3n.awsAccessKeyId", "<AWS_KEY>")
> sc._jsc.hadoopConfiguration().set("fs.s3n.awsSecretAccessKey", "<AWS_SECRET>")
>
> # Set options below
> sfOptions = {
>   "sfURL" : "<account_identifier>.snowflakecomputing.com",
>   "sfUser" : "<user_name>",
>   "sfAuthenticator" : "oauth",
>   "sfToken" : "<external_oauth_access_token>",
>   "sfDatabase" : "<database>",
>   "sfSchema" : "<schema>",
>   "sfWarehouse" : "<warehouse>"
> }
>
> SNOWFLAKE_SOURCE_NAME = "net.snowflake.spark.snowflake"
>
> df = spark.read.format(SNOWFLAKE_SOURCE_NAME) \
>   .options(**sfOptions) \
>   .option("query",  "select 1 as my_num union all select 2 as my_num") \
>   .load()
>
> df.show()
> ```

### AWS Options for External Data Transfer

These options are used to specify the Amazon S3 location where temporary data is stored and provide authentication details for accessing the location. They are required only if you are doing an external data transfer. External data transfers are required if either of the following is true:

* You are using version 2.1.x or lower of the Spark Connector (which
  does not support internal transfers), or
* Your transfer is likely to take 36 hours or more (internal transfers
  use temporary credentials that expire after 36 hours).

`tempDir`
:   The S3 location where intermediate data is stored (e.g. `s3n://xy12345-bucket/spark-snowflake-tmp/`).

    If `tempDir` is specified, you must also specify either:

    * `awsAccessKey` , `awsSecretKey` . or
    * `temporary_aws_access_key_id` , `temporary_aws_secret_access_key`, `temporary_aws_session_token`

`awsAccessKey` , `awsSecretKey`
:   These are standard AWS credentials that allow access to the location
    specified in `tempDir`. Note that both of these options must be set
    together.

    If they are set, they can be retrieved from the existing `SparkContext` object.

    If you specify these variables, you must also specify `tempDir`.

    These credentials should also be set for the Hadoop cluster.

`temporary_aws_access_key_id` , `temporary_aws_secret_access_key`, `temporary_aws_session_token`
:   These are temporary AWS credentials that allow access to the location
    specified in `tempDir`. Note that all three of these options must be
    set together.

    Also, if these options are set, they take precedence over the `awsAccessKey` and `awsSecretKey` options.

    If you specify `temporary_aws_access_key_id` , `temporary_aws_secret_access_key`, and `temporary_aws_session_token` , you must also specify `tempDir`. Otherwise, these parameters are ignored.

`check_bucket_configuration`
:   If set to `on` (default), the connector checks if the bucket used for data transfer has a lifecycle policy configured (see [Preparing an AWS External S3 Bucket](spark-connector-install.md) for more information). If there is no lifecycle
    policy present, a warning is logged.

    Disabling this option (by setting to `off`) skips this check. This can be useful if a user can access the bucket data operations, but not the bucket lifecycle policies. Disabling the option can also
    speed up query execution times slightly.

For details, see Authenticating S3 for Data Exchange (in this topic).

### Azure Options for External Data Transfer

This section describes the parameters that apply to Azure Blob storage when
doing external data transfers. External data transfers are required if either
of the following is true:

* You are using version 2.1.x or lower of the Spark Connector (which
  does not support internal transfers), or
* Your transfer is likely to take 36 hours or more (internal transfers
  use temporary credentials that expire after 36 hours).

When using an external transfer with Azure Blob storage, you specify the location of the
Azure container and the SAS (shared-access signature) for that container using
the parameters described below.

`tempDir`
:   The Azure Blob storage container where intermediate data is stored.
    This is in the form of a URL, for example:

    > `wasb://<azure_container>@<azure_account>.<azure_endpoint>/`

`temporary_azure_sas_token`
:   Specify the SAS token for Azure Blob storage.

For details, see Authenticating Azure for Data Exchange (in this topic).

#### Specifying Azure Information for Temporary Storage in Spark

When using Azure Blob storage to provide temporary storage to transfer data
between Spark and Snowflake, you must provide Spark, as well as the Snowflake Spark Connector,
with the location and credentials for the temporary storage.

To provide Spark with the temporary storage location, execute commands similar to the following
on your Spark cluster:

> ```scala
> sc.hadoopConfiguration.set("fs.azure", "org.apache.hadoop.fs.azure.NativeAzureFileSystem")
> sc.hadoopConfiguration.set("fs.AbstractFileSystem.wasb.impl", "org.apache.hadoop.fs.azure.Wasb")
> sc.hadoopConfiguration.set("fs.azure.sas.<container>.<account>.<azure_endpoint>", <azure_sas>)
> ```

Note that the last command contains the following variables:

* `<container>` and `<account>`: These are the container and account name for your Azure deployment.
* `<azure_endpoint>`: This is the endpoint for your Azure deployment location. For example, if you are using an Azure US deployment, the endpoint is likely to be `blob.core.windows.net`.
* `<azure_sas>`: This is the Shared Access Signature security token.

Replace each of these variables with the proper information for your Azure Blob Storage account.

## Passing Snowflake Session Parameters as Options for the Connector

The Snowflake Connector for Spark supports sending arbitrary session-level parameters to Snowflake (see [Session parameters](../sql-reference/parameters.md) for more info). This can be achieved by adding a
`("<key>" -> "<value>")` pair to the `options` object, where `<key>` is the session parameter name and `<value>` is the value.

> **Note:**
>
> The `<value>` should be a string enclosed in double quotes, even for parameters that accept numbers or Boolean values (e.g. `"1"` or `"true"`).

For example, the following code sample passes the [USE_CACHED_RESULT](../sql-reference/parameters.md) session parameter with a value of `"false"`, which disables using the results of previously-executed queries:

```scala
// ... assuming sfOptions contains Snowflake connector options

// Add to the options request to keep connection alive
sfOptions += ("USE_CACHED_RESULT" -> "false")

// ... now use sfOptions with the .options() method
```

## Security Considerations

Customers should ensure that in a multi-node Spark system, communications between the nodes are secure. The Spark
master sends Snowflake credentials to Spark workers so that those workers can access Snowflake stages. If
communications between the Spark master and Spark workers are not secure, the credentials could be read by an
unauthorized third party.

## Authenticating S3 for Data Exchange

This section describes how to authenticate when using S3 for data exchange.

This task is required only in either of the following circumstances:

* The Snowflake Connector for Spark version is 2.1.x (or lower). Starting with v2.2.0, the connector uses a Snowflake internal temporary stage for data exchange. If you are not currently using version 2.2.0 (or higher) of the connector, Snowflake strongly recommends upgrading to the latest version.
* The Snowflake Connector for Spark version is 2.2.0 (or higher), but your jobs regularly exceed 36 hours in length. This is the maximum duration for the AWS token used by the connector to access the internal stage for data exchange.

If you are using an older version of the connector, you need to prepare an S3 location that the connector can use to exchange data between Snowflake and Spark.

To allow access to the S3 bucket/directory used to exchange data between Spark and Snowflake (as specified for `tempDir`), two authentication methods are supported:

* Permanent AWS credentials (also used to configure Hadoop/Spark authentication for accessing S3)
* Temporary AWS credentials

### Using Permanent AWS Credentials

This is the standard AWS authentication method. It requires a pair of `awsAccessKey` and `awsSecretKey` values.

> **Note:**
>
> These values should also be used to configure Hadoop/Spark for accessing S3. For more information, including examples, see Authenticating Hadoop/Spark Using S3A or S3N (in this topic).

For example:

> ```scala
> sc.hadoopConfiguration.set("fs.s3n.awsAccessKeyId", "<access_key>")
> sc.hadoopConfiguration.set("fs.s3n.awsSecretAccessKey", "<secret_key>")
>
> // Then, configure your Snowflake environment
> //
> var sfOptions = Map(
>     "sfURL" -> "<account_identifier>.snowflakecomputing.com",
>     "sfUser" -> "<user_name>",
>     "sfPassword" -> "<password>",
>     "sfDatabase" -> "<database>",
>     "sfSchema" -> "<schema>",
>     "sfWarehouse" -> "<warehouse>",
>     "awsAccessKey" -> sc.hadoopConfiguration.get("fs.s3n.awsAccessKeyId"),
>     "awsSecretKey" -> sc.hadoopConfiguration.get("fs.s3n.awsSecretAccessKey"),
>     "tempdir" -> "s3n://<temp-bucket-name>"
> )
> ```

For details about the options supported by `sfOptions`, see AWS Options for External Data Transfer (in this topic).

#### Authenticating Hadoop/Spark Using S3A or S3N

Hadoop/Spark ecosystems support 2 URI schemes for [accessing S3](https://wiki.apache.org/hadoop/AmazonS3/):

`s3a://`
:   **New, recommended method (for Hadoop 2.7 and higher)**

    To use this method, modify the Scala examples in this topic to add the following Hadoop configuration options:

    > ```scala
    > val hadoopConf = sc.hadoopConfiguration
    > hadoopConf.set("fs.s3a.access.key", <accessKey>)
    > hadoopConf.set("fs.s3a.secret.key", <secretKey>)
    > ```

    Make sure the `tempdir` option uses `s3a://` as well.

`s3n://`
:   **Older method (for Hadoop 2.6 and lower)**

    In some systems, it is necessary to specify it explicitly as shown in the following Scala example:

    > ```scala
    > val hadoopConf = sc.hadoopConfiguration
    > hadoopConf.set("fs.s3.impl", "org.apache.hadoop.fs.s3native.NativeS3FileSystem")
    > hadoopConf.set("fs.s3.awsAccessKeyId", <accessKey>)
    > hadoopConf.set("fs.s3.awsSecretAccessKey", <secretKey>)
    > ```

### Using Temporary AWS Credentials

This method uses the `temporary_aws_access_key_id`, `temporary_aws_secret_access_key`, and `temporary_aws_session_token` configuration options for the connector.

This method allows additional security by providing Snowflake with only temporary access to the S3 bucket/directory used for data exchange.

> **Note:**
>
> Temporary credentials can only be used to configure the S3 authentication for the connector; they cannot be used to configure Hadoop/Spark authentication.
>
> Also, if you provide temporary credentials, they take precedence over any permanent credentials that have been provided.

The following Scala code sample provides an example of authenticating using temporary credentials:

> ```scala
> import com.amazonaws.services.securitytoken.AWSSecurityTokenServiceClient
> import com.amazonaws.services.securitytoken.model.GetSessionTokenRequest
>
> import net.snowflake.spark.snowflake.Parameters
>
> // ...
>
> val sts_client = new AWSSecurityTokenServiceClient()
> val session_token_request = new GetSessionTokenRequest()
>
> // Set the token duration to 2 hours.
>
> session_token_request.setDurationSeconds(7200)
> val session_token_result = sts_client.getSessionToken(session_token_request)
> val session_creds = session_token_result.getCredentials()
>
> // Create a new set of Snowflake connector options, based on the existing
> // sfOptions definition, with additional temporary credential options that override
> // the credential options in sfOptions.
> // Note that constants from Parameters are used to guarantee correct
> // key names, but literal values, such as temporary_aws_access_key_id are, of course,
> // also allowed.
>
> var sfOptions2 = collection.mutable.Map[String, String]() ++= sfOptions
> sfOptions2 += (Parameters.PARAM_TEMP_KEY_ID -> session_creds.getAccessKeyId())
> sfOptions2 += (Parameters.PARAM_TEMP_KEY_SECRET -> session_creds.getSecretAccessKey())
> sfOptions2 += (Parameters.PARAM_TEMP_SESSION_TOKEN -> session_creds.getSessionToken())
> ```

`sfOptions2` can now be used with the `options()` DataFrame method.

## Authenticating Azure for Data Exchange

This section describes how to authenticate when using Azure Blob storage for data exchange.

Authenticating this way is required only in either of the following circumstances:

* The Snowflake Connector for Spark version is 2.1.x (or lower). Starting with
  v2.2.0, the connector uses a Snowflake internal temporary stage for data
  exchange. If you are not currently using version 2.2.0 (or higher) of the
  connector, Snowflake strongly recommends upgrading to the latest version.
* The Snowflake Connector for Spark version is 2.2.0 (or higher), but your
  jobs regularly exceed 36 hours in length. This is the maximum duration for
  the Azure token used by the connector to access the internal stage for
  data exchange.

You need to prepare an Azure Blob storage container that the connector can use to
exchange data between Snowflake and Spark.

### Using Azure Credentials

This is the standard Azure Blob storage authentication method. It requires a pair of
values: `tempDir` (a URL) and `temporary_azure_sas_token` values.

> **Note:**
>
> These values should also be used to configure Hadoop/Spark for accessing Azure Blob storage. For more information, including examples, see Authenticating Hadoop/Spark Using Azure (in this topic).

For example:

> ```scala
> sc.hadoopConfiguration.set("fs.azure", "org.apache.hadoop.fs.azure.NativeAzureFileSystem")
> sc.hadoopConfiguration.set("fs.AbstractFileSystem.wasb.impl", "org.apache.hadoop.fs.azure.Wasb")
> sc.hadoopConfiguration.set("fs.azure.sas.<container>.<account>.<azure_endpoint>", <azure_sas>)
>
> // Then, configure your Snowflake environment
> //
> val sfOptions = Map(
>   "sfURL" -> "<account_identifier>.snowflakecomputing.com",
>   "sfUser" -> "<user_name>",
>   "sfPassword" -> "<password>",
>   "sfDatabase" -> "<database_name>",
>   "sfSchema" -> "<schema_name>",
>   "sfWarehouse" -> "<warehouse_name>",
>   "sfCompress" -> "on",
>   "sfSSL" -> "on",
>   "tempdir" -> "wasb://<azure_container>@<azure_account>.<Azure_endpoint>/",
>   "temporary_azure_sas_token" -> "<azure_sas>"
> )
> ```

For details about the options supported by `sfOptions`, see Azure Options for External Data Transfer (in this topic).

#### Authenticating Hadoop/Spark Using Azure

To use this method, modify the Scala examples in this topic to add the following Hadoop configuration options:

> ```scala
> val hadoopConf = sc.hadoopConfiguration
> sc.hadoopConfiguration.set("fs.azure", "org.apache.hadoop.fs.azure.NativeAzureFileSystem")
> sc.hadoopConfiguration.set("fs.AbstractFileSystem.wasb.impl", "org.apache.hadoop.fs.azure.Wasb")
> sc.hadoopConfiguration.set("fs.azure.sas.<container>.<account>.<azure_endpoint>", <azure_sas>)
> ```

Make sure the `tempdir` option uses `wasb://` as well.

## Authenticating Through a Browser is Not Supported

When using the Spark Connector, it is impractical to use any form of authentication that would open a browser
window to ask the user for credentials. The window would not necessarily appear on the client machine. Therefore, the
Spark Connector does not support any type of authentication, including MFA (Multi-Factor Authentication) or SSO
(Single Sign-On), that would invoke a browser window.

---
title: Using the Trust Center
source: https://docs.snowflake.com/en/user-guide/trust-center/using-the-trust-center.md
section: User Guide
---

# Using the Trust Center

This topic describes how to monitor Trust Center costs, and manage scanners, findings, and security risks by using the Trust Center
Snowsight interface.

## Monitoring cost

The Trust Center incurs [serverless compute cost](../cost-understanding-compute.md) when it scans your Snowflake environment for
security vulnerabilities.

You can use cost-related views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas to track the costs associated with the Trust Center. When
querying these views, filter on the `service_type` column to find `TRUST_CENTER` values.

| View | Schema | `service_type` | Roles with required privileges |
| --- | --- | --- | --- |
| [METERING_HISTORY](../../sql-reference/account-usage/metering_history.md) | ACCOUNT_USAGE | TRUST_CENTER | * ACCOUNTADMIN role * USAGE_VIEWER database role |
| [METERING_DAILY_HISTORY](../../sql-reference/account-usage/metering_daily_history.md) | ACCOUNT_USAGE | TRUST_CENTER | * ACCOUNTADMIN role * USAGE_VIEWER database role |
| [METERING_DAILY_HISTORY](../../sql-reference/organization-usage/metering_daily_history.md) | ORGANIZATION_USAGE | TRUST_CENTER | * ORGADMIN role * ORGANIZATION_USAGE_VIEWER database role |
| [USAGE_IN_CURRENCY_DAILY](../../sql-reference/organization-usage/usage_in_currency_daily.md) | ORGANIZATION_USAGE | TRUST_CENTER | * ORGADMIN role * ORGANIZATION_BILLING_VIEWER database role |

**Example:** View the total cost that the Trust Center incurred between December 1, 2024 and December 31, 2024.

```sqlexample
SELECT
   SUM(credits_used) AS total_credits
FROM snowflake.account_usage.metering_history
WHERE
   service_type = 'TRUST_CENTER' AND
   start_time >= '2024-12-01' AND
   end_time <= '2024-12-31';
```

**Example:** View the daily cost that the Trust Center incurred after December 1, 2024.

```sqlexample
SELECT
   usage_date AS date,
   credits_used AS credits
FROM snowflake.account_usage.metering_daily_history
WHERE
   service_type = 'TRUST_CENTER' AND
   date > '2024-12-01';
```

For information about how many credits are charged per Compute-Hour for the operation of the Trust Center, see Table 5 in the
[Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Use the Trust Center Snowsight interface

This preview introduces several changes to the Trust Center. The Trust Center Snowsight interface now has the following tabs:

* Overview - Displays a high-level summary of Trust Center findings for your account. Select the View option in each section of
  Overview to see more detailed information about a specific aspect of your account’s security posture.
* Violations - This tab was previously named the Findings tab. It shows violations, suggests remediation actions for them, and
  provides detailed information about them. For information about using this tab, go to the Violations tab, and then follow the instructions
  in Manage the violation findings lifecycle and Manage security risks.
* Detections - This tab shows the detections found by the scanners and provides information about them. For information about using
  this tab, see View Trust Center detection findings.
* Manage scanners - Now contains the Scanner packages tab. You can use it to view and manage scanner packages and individual
  scanners. The event-driven scanners added in this preview show Event driven in the SCHEDULE column. For information
  about using this tab, go to the Manage scanners tab, and then follow the instructions in Manage scanner packages
  and Managing scanners.
* Manage scanners - Now contains the Extensions tab. You can create Trust Center extensions by using the Snowflake Native App
  Framework. For more information, see, [Using Trust Center extensions](trust-center-extensions.md).

## Manage scanner packages

You can complete the following tasks to manage scanner packages in the Trust Center:

* View the list of scanners in a package
* Enable scanner packages.
* View available scanner packages.
* Change the schedule for a scanner package.
* Run a scanner package on demand.

### View the list of scanners in a package

To view the list of scanners provided in a scanner package, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. From the list, select a scanner package.

### Enable scanner packages

To enable a scanner package, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Select Enable Package.

After you enable a scanner package, you can
enable or disable individual scanners in the scanner package.

### View available scanner packages

To view available scanner packages, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Optionally, select Provider, Status, or Search to filter the list of scanner packages available.

### Change the schedule for a scanner package

You can change the schedule for all scanner packages, except the [Security Essentials scanner package](overview.md).

> **Tip:**
>
> After a scanner package is enabled, you can
> change the schedule for individual scanners in the scanner package.

To change the schedule for a scanner package, follow these steps:

1. Ensure you’ve enabled the [CIS Benchmarks scanner package](overview.md).
2. Sign in to [Snowsight](../ui-snowsight-gs.md).
3. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
4. In the navigation menu, select Governance & security » Trust Center.
5. Select the Manage scanners tab.
6. Select a scanner package from the list.
7. Select the Settings tab.
8. Under Scanner Package Schedule, select  Edit.
9. Set your desired Frequency.
10. Select Continue.

### Run a scanner package on demand

To run a scanner package on demand, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Next to Search, select  Run Package.

## Managing scanners

You can complete the following tasks to manage scanners in the Trust Center:

* View details for a scanner.
* Enable or disable a scanner in a scanner package.
* Change the schedule for a scanner.
* Reset the schedule for a scanner to the scanner package schedule.
* Run a scanner on demand.

### View details for a scanner

To view details that describe what each scanner does, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. Select a scanner from the list of scanner names.

### Enable or disable a scanner in a scanner package

> **Attention:**
>
> Scanners provide valuable information about possible security risks at a minimal cost.
> Before disabling a scanner, we recommend evaluating the value of the information provided
> by the scanner in relation to the cost associated with running it. For more information about
> evaluating the cost associated with a scanner, see Monitoring cost.

If a scanner package is disabled, all of the scanners in the package are disabled, including
scanners that were enabled individually.

To enable or disable a scanner in a scanner package, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. Select a scanner package from the list.
6. In the scanner STATE, enable or disable the scanner.
7. In the confirmation box, select Confirm.

### Change the schedule for a scanner

You can change the schedule for schedule-based scanners. You can’t change the schedule for event-based scanners. You can only
enable or disable an event-driven scanner.

> **Note:**
>
> When a custom schedule is set for an individual scanner, that setting is used instead of its scanner package schedule,
> even if the scanner package schedule is changed.

To change the schedule for a scanner, follow these steps:

1. Ensure that you enabled the scanner.
2. Sign in to [Snowsight](../ui-snowsight-gs.md).
3. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
4. In the navigation menu, select Governance & security » Trust Center.
5. Select the Manage scanners tab.
6. Select a scanner package from the list.
7. Select  More for the scanner, and then select Edit schedule.
8. Set your desired Frequency.
9. Select Save.

### Reset the schedule for a scanner to the scanner package schedule

To change the schedule for a scanner to match its scanner package schedule, follow these steps:

1. Ensure that you enabled the scanner.
2. Sign in to [Snowsight](../ui-snowsight-gs.md).
3. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
4. In the navigation menu, select Governance & security » Trust Center.
5. Select the Manage scanners tab.
6. Select a scanner package from the list.
7. Select  More for the scanner, and then select Edit schedule.
8. Select Reset, and then select Reset to scanner package schedule.
9. Select Save.

### Run a scanner on demand

To run a scanner on demand, follow these steps:

1. Ensure that you enabled the scanner.
2. Sign in to [Snowsight](../ui-snowsight-gs.md).
3. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
4. In the navigation menu, select Governance & security » Trust Center.
5. Select the Manage scanners tab.
6. Select a scanner package from the list.
7. Select  More for the scanner, and then select Run scanner.

## Manage the violation findings lifecycle

Specific application roles allow you to view and manage violation findings by using the Violations tab. For more
information, see [Required roles](overview.md).

### View violations

To view and filter your violations data to see your current progress, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_VIEWER` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Violations tab.
5. To view the list of open, muted, or all violations, select an option from the Status drop-down menu.
6. To see a detailed pane with the violation’s summary, recommendations, and activity, select any violation.
7. In the violation bar, select Activity to see the comments history and the responsible users.
8. To see the scanner’s last run and when the violation was generated, select Scanned.
9. To see when the violation status was last changed, select Updated.

### Change the status of a violation finding

> **Attention:**
>
> Marking a violation as `Muted` is a way to triage the open violation so you can focus on the ones most important for your environment.
> Muting a violation also ceases the periodic email notifications for that violation. Scanners still run as scheduled irrespective of the
> violation status: `Open` or `Muted`. The scanner continues to run and detect violations if the configuration remains unchanged.

All new security violations are raised with an `Open` status. You can mute a violation for multiple reasons, such as not being applicable
to your account, being deferred for a future date, being in progress already, or another reason.

You can change the status of a violation for any reason, such as not being applicable to your account, deferred for a future date, being in
progress already, or another reason. To change the status of a violation, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Violations tab.
5. Select a violation that opens its detailed pane. By default, only violations with the Open status are shown.
6. Select the Mute notification button.
7. (Optional) To justify the resolution, add a comment.
8. Select Submit.

You can reopen a muted violation by selecting the Unmute button.

> **Note:**
>
> Manually muting a violation finding isn’t mandatory for customers. The Trust Center automatically removes violation findings from the
> Violations tab when a scanner run determines that any misconfiguration was corrected or remediation steps were
> followed correctly.

## Remediate violations with Cortex Code

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

You can use Cortex Code to get AI-guided remediation for Trust Center violations directly in Snowsight. When you select
Begin Remediation for a finding, Cortex Code opens a chat that explains the violation in the context of your account,
recommends remediation steps, and can execute remediation actions with your approval.

Cortex Code provides interactive, conversational remediation that is personalized to your account’s specific configuration. Unlike
the static remediation instructions on the Remediation tab, Cortex Code can tailor its guidance based on the entities and
configurations involved in the violation, answer follow-up questions, and generate SQL statements that you can review and run.

### Prerequisites

To use Cortex Code for violation remediation, the following conditions must be met:

* Cortex Code in Snowsight must be available for your account.
* Your role must have the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it. For more information about granting
  this role, see [Required roles](overview.md).
* Your role must have the `SNOWFLAKE.CORTEX_USER` database role granted to it.

### Remediate a violation with Cortex Code

To remediate a violation by using Cortex Code, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Violations tab.
5. Select a violation to open the findings detail panel, and select Begin Remediation, or select the Cortex icon in the list
   of violations.

   Cortex Code opens in a chat panel on the right side of the screen. The chat is pre-populated with the context of the selected
   violation, including the violation type, severity, affected entities, and scanner details.
6. Review the explanation and remediation steps that Cortex Code provides. You can:

   * Ask follow-up questions to understand the violation in more detail.
   * Request alternative remediation approaches.
   * Ask Cortex Code to generate SQL statements for the remediation.
   * Review and run SQL statements directly from the chat.
7. After you complete the remediation, wait for the next scheduled scanner run or
   run the scanner on demand to verify that the root cause of the violation
   has been remediated. The Trust Center automatically removes the violation from the Violations tab after the
   scanner confirms the remediation. A remediated finding may appear as an open finding in Snowsight for up to 3 hours after the violation is remediated at the scanner is re-run.

> **Note:**
>
> AI-guided remediation is available for violations only. Detections represent unique events that occurred in the past and don’t
> have direct remediation steps. However, you can use Cortex Code to investigate and plan a course of action for detection findings
> as well.

### Considerations

* Cortex Code generates remediation suggestions based on your account’s configuration and the details of the specific violation.
  Always review the suggested SQL statements before running them.
* Some violations require actions outside of Snowflake, such as coordinating an organization-wide MFA policy change or investigating
  whether a login from an unrecognized IP address is legitimate. In these cases, Cortex Code explains the required steps but cannot
  execute them on your behalf.
* After completing a remediation, you can verify the fix by running the scanner on demand rather than waiting for the next scheduled
  run. For more information, see Run a scanner package on demand.

## View Trust Center detection findings

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

The Detections tab displays information about the detection findings reported by the Trust Center and lets
you examine them:

> **Note:**
>
> Currently, you can’t manage the lifecycle — that is, mute or reopen — a detection finding. Detection findings aren’t currently aggregated
> into the Organization account.

### View detections

To view detections, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role that has either the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role or the `SNOWFLAKE.TRUST_CENTER_VIEWER`
   application role granted to it.

   For more information about granting these roles, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Detections tab.

   A chart displays information about detections in the specified time period. You can adjust filters
   to modify the detections displayed on the tab. See the next step for information about modifying filters.

   The detections bar displays information about each detection, such as the detection type, entity type,
   entity name, and additional information.
5. To analyze the detections displayed on the tab, adjust the filters:

   * Detection Type - Clear the filter to show detections of any type, or select a type to show only detections of that type; for
     example, Abnormal Account Activities, Insecure Login, or Privilege Escalation.
   * Severity - Clear the filter to show detections of any severity, or select a severity to show only detections of that severity;
     for example, Critical, High, Medium, or Low.
   * Entity Type - Clear the filter to show detections for any entity type, or select an entity type to show only detections for that
     entity type; for example, QUERY, ROLE, or USER.
   * Reported By - Clear the filter to show detections reported by all scanners in the **Security Essentials** and **Threat Intelligence**
     scanner packages, or select a scanner package to only show detections reported by scanners in that scanner package.
   * Time Range - Clear the filter to show all detections that were reported at any time or select a time range to view detections
     reported in the selected time range.
6. To see a detailed pane with the detection’s summary, remediation recommendations, and activity, select any detection.

   To open a worksheet with queries that you can run to get more information on the scanner output,
   on the Remediation tab, select Open a Worksheet.

## Manage security risks

You can complete the following tasks to manage security risks in the Trust Center:

* View security risks.
* Remediate security risks.

### View security risks

To view security risks, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role with the `SNOWFLAKE.TRUST_CENTER_VIEWER` or `SNOWFLAKE.TRUST_CENTER_ADMIN`
   application role granted to it.

   For more information about granting these roles, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Violations tab.
5. Select a recommendation from the list of violations to view details about the violation associated with the recommendation.
6. Optionally, select Severity, Violations, or Search to filter the list of recommendations shown.

### Remediate security risks

When viewing individual security risks, you can learn how to remediate the risks associated
with the recommendations that display, allowing you to harden the security of your account.

To remediate security risks, follow these steps:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. Switch to a role that has the `SNOWFLAKE.TRUST_CENTER_ADMIN` application role granted to it.

   For more information about granting these roles, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Violations tab.
5. From the list of violations, select a recommendation.
6. In the Remediation tab, follow the steps that are shown.

---
title: Using the tutorials
source: https://docs.snowflake.com/en/user-guide/data-load-tutorials-using.md
section: User Guide
---

# Using the tutorials

This topic provides general usage information for the data loading tutorials.

## Privileges for objects used in tutorials

### Practice databases and schemas

For convenience and to avoid mixing your data, we recommend that you create a separate database and/or schema for completing practice exercises, including Snowflake tutorials.

If you create a practice database/schema, make certain to grant the USAGE privilege on the database/schema to any roles for the users who will complete the tutorials. Also
grant the following schema privileges, which are required to create specific objects in the schema:

* CREATE FILE FORMAT
* CREATE STAGE
* CREATE TABLE

### Virtual warehouses

The tutorials also require an active warehouse to load data and execute queries. Grant the following virtual warehouse privileges to the same role:

* OPERATE
* USAGE

### Example

```sqlexample
grant usage on database mydb to role analyst;

grant usage, create file format, create stage, create table on schema mydb.public to role analyst;

grant operate, usage on warehouse mywh to role analyst;
```

## Credit usage

Each tutorial requires an estimated 30 minutes or less to complete, resulting in an average Snowflake credit usage of less than 1 credit (if an X-Small warehouse is used).

---
title: Using Trust Center extensions
source: https://docs.snowflake.com/en/user-guide/trust-center/trust-center-extensions.md
section: User Guide
---

# Using Trust Center extensions

To integrate solutions with the Trust Center, security partners can use the
[Snowflake Native App Framework](../../developer-guide/native-apps/native-apps-about.md) to create applications that
provide one or more additional scanner packages. These applications are called *Trust Center extensions*.

You can create Trust Center extensions to tailor security, privacy, governance, and compliance solutions to better
fit your requirements, and then share the extensions in your organization. You can also create extensions that can
be used more broadly and list them to specific Snowflake accounts or on the [Snowflake Marketplace](https://other-docs.snowflake.com/collaboration/collaboration-marketplace-about). For more information,
see Develop a Trust Center extension.

Users can discover, install, and manage third-party extensions that contain scanner packages. For more information,
see Install Trust Center extensions.

## Access control requirements

To create and manage Trust Center extensions, a user with the
[ACCOUNTADMIN role](../security-access-control-overview.md) must grant
the following privileges to your role:

* SNOWFLAKE.TRUST_CENTER_ADMIN [application role](../../developer-guide/native-apps/creating-setup-script.md)
* CREATE APPLICATION PACKAGE
* CREATE APPLICATION

## Develop a Trust Center extension

You can develop and deploy a Trust Center extension with scanner packages. You can version your Trust Center
extension by using [Native App versioning](../../developer-guide/native-apps/update-app-overview.md). Extensions
also use the [Native App privilege model](../../developer-guide/native-apps/requesting-about.md) to access any
data or metadata, such as tables within a customer account or Account Usage views.

### Prerequisites

Before you develop an extension with scanner packages, complete the following prerequisites:

* Understand how to develop a [Native App](../../developer-guide/native-apps/native-apps-about.md).
* Understand how to create and use Snowflake [stored procedures](../../developer-guide/extensibility.md).
* Create or identify a Snowflake account that can act as an extension provider account. Every Native App
  requires a provider account.

### Create a scanner package manifest and scanners

To create a scanner package manifest and scanners, complete the following steps:

1. Create an extension manifest file.
2. Create scanners.
3. Create an extension.
4. Grant privileges.
5. Register the extension.
6. Test the extension.

#### Create an extension manifest file

Create a manifest file that contains information and metadata about the various scanner packages and scanners:

1. Create a manifest file.

   The manifest file has the following requirements:

   * The name of the manifest file must be `tc_extension_manifest.yml`.
   * The `tc_extension_manifest.yml` file must exist at the root of the directory structure
     on the named stage where the Native App `manifest.yml` file resides.

   The manifest file lists the scanner package properties and all of the scanners that are included in the scanner package.

   Use the following definition for the manifest file:

   ```yaml
   manifest_version: '2.0'
   scanner_packages:
   - id: ''
     name: ''
     short_description: ''
     description: ''
     scanners:
       - id: ''
         name: ''
         short_description: ''
         description: ''
         type: 'VULNERABILITY'
         callback:
           schema: ''
           name: ''
           version: ''
   ```

   The manifest file has the following properties:

   | Property | Description | Maximum number of characters |
   | --- | --- | --- |
   | `manifest_version` | Currently, only `2.0` is valid. | Not applicable |
   | `scanner_packages.id` | A unique identifier for the scanner package, which the provider must maintain for the scanner package’s lifetime. Only ASCII alphanumeric and underscore characters are supported. All of the configurations that the customer applies to a scanner package are persisted in Trust Center using this ID. | 25 |
   | `scanner_packages.name` | The name of the scanner package. | 30 |
   | `scanner_packages.short_description` | The short description of the scanner package. | 150 |
   | `scanner_packages.description` | The description of the scanner package. | 700 |
   | `scanner_packages.scanners.id` | A unique identifier for the scanner, which the provider must maintain for the scanner’s lifetime. Only ASCII alphanumeric and underscore characters are supported. All of the configurations that customers apply to a scanner are persisted in Trust Center using this ID. | 25 |
   | `scanner_packages.scanners.name` | The name of the scanner. | 30 |
   | `scanner_packages.scanners.short_description` | The short description of the scanner. | 150 |
   | `scanner_packages.scanners.description` | The long description of the scanner. | 1,500 |
   | `scanner_packages.scanners.type` | The type of the scanner. Currently, only `VULNERABILITY` is supported. | — |
   | `scanner_packages.scanners.callback` | The callback section for the scanner. Every scanner must have a `callback` section that specifies its `schema`, `name`, and `version`. | Not applicable |
   | `scanner_packages.scanners.callback.schema` | The schema for the stored procedure. The schema must exist in the `setup_script.sql` file. For more information about this file, see Create an extension. | Not applicable |
   | `scanner_packages.scanners.callback.name` | The name of the stored procedure. The following requirements apply to the stored procedure:  * Currently, it must be named `scan`. * The stored procedure name that is defined here must exist in the `setup_script.sql` file under the schema   that is specified in `callback.schema`. | Not applicable |
   | `scanner_packages.scanners.callback.version` | The version of the stored procedure. Currently, only `1.0` is supported. | Not applicable |

The following example shows the contents of a manifest file:

```yaml
manifest_version: '2.0'
scanner_packages:
  - id: 'se_extension'
    name: 'Security Extension'
    short_description: 'Enhances security features and capabilities.'
    description: 'This extension provides additional security features and capabilities to the platform.'
    scanners:
      - id: 'es_check'
        name: 'NA event sharing check'
        short_description: 'Checks for NA event sharing configurations.'
        description: 'This scanner checks for event sharing configurations in the North America region.'
        type: 'VULNERABILITY'
        callback:
          schema: 'security_essentials_na_consumer_es_check'
          name: 'scan'
          version: '1.0'
      - id: 'se_mfa'
        name: 'MFA Required for Users'
        short_description: 'Ensures that MFA is required for all users.'
        description: 'This scanner checks that Multi-Factor Authentication (MFA) is enforced for all users in the system.'
        type: 'VULNERABILITY'
        callback:
          schema: 'security_essentials_mfa_required_for_users_check'
          name: 'scan'
          version: '1.0'
      - id: 'se_client'
        name: 'Client Security'
        short_description: 'Ensures that client security best practices are followed.'
        description: 'This scanner checks that client security best practices are enforced for all clients in the system.'
        type: 'VULNERABILITY'
        callback:
          schema: 'security_essentials_client_security'
          name: 'scan'
          version: '1.0'
      - id: 'cis_1_4'
        name: 'Extension CIS 1_4'
        short_description: 'Checks for compliance with CIS Benchmark 1.4.'
        description: 'This scanner checks for compliance with the CIS Benchmark 1.4, ensuring that security best practices are followed.'
        type: 'VULNERABILITY'
        callback:
          schema: 'security_essentials_cis1_4'
          name: 'scan'
          version: '1.0'
      - id: 'cis_3_1'
        name: 'Extension CIS 3_1'
        short_description: 'Checks for compliance with CIS Benchmark 3.1.'
        description: 'This scanner checks for compliance with the CIS Benchmark 3.1, ensuring that security best practices are followed.'
        type: 'VULNERABILITY'
        callback:
          schema: 'security_essentials_cis3_1'
          name: 'scan'
          version: '1.0'
```

#### Create scanners

Create a [versioned schema](../../developer-guide/native-apps/versioned-schema.md) and a stored procedure
that implements the scanner logic.

If the scanner package contains multiple scanners, then complete these steps for each scanner,
using a different versioned schema for each scanner:

1. Create a versioned schema to host the scanner logic.

   The name of the schema must be the same as the schema specified for the scanner in the
   extension manifest file.

   For example, the following SQL statement creates a versioned schema that is named
   `security_essentials_mfa_required_for_users`:

   ```sqlexample
   CREATE OR ALTER VERSIONED SCHEMA security_essentials_mfa_required_for_users;
   ```
2. Create a stored procedure that implements the scanner logic.

   The following example creates a stored procedure named `scan` in the `security_essentials_mfa_required_for_users`
   schema:

   ```sqlexample
   CREATE OR REPLACE PROCEDURE security_essentials_mfa_required_for_users.scan(
       run_id VARCHAR)
     RETURNS TABLE(
       risk_id VARCHAR,
       risk_name VARCHAR,
       total_at_risk_count NUMBER,
       scanner_type VARCHAR,
       risk_description VARCHAR,
       suggested_action VARCHAR,
       impact VARCHAR,
       severity VARCHAR,
       at_risk_entities ARRAY
     )
     LANGUAGE SQL
     AS
     $$
       -- Scanning logic --
     $$;
   ```

   Verify that the stored procedure returns exactly one row for each severity and risk ID combination.

   The returned table must have the following columns:

   | Column | Type | Description |
   | --- | --- | --- |
   | `risk_id` | VARCHAR | The identifier for the risk. |
   | `risk_name` | VARCHAR | The name of the risk. |
   | `total_at_risk_count` | NUMBER | Total number of entities at risk for a scanner. For scenarios where the scanner doesn’t detect any violations, the value is `0`. The maximum number of at-risk entities is 1,000, and the maximum combined size of all values in an [array](../../sql-reference/data-types-semistructured.md) is 128 MB. |
   | `scanner_type` | VARCHAR | Currently, only the `VULNERABILITY` scanner type is supported. |
   | `risk_description` | VARCHAR | The description of the risk. |
   | `suggested_action` | VARCHAR | Suggested action for remediation. |
   | `impact` | VARCHAR | Possible consequences of not addressing the risk. |
   | `severity` | VARCHAR | The severity level of the risk. The possible values are LOW, MEDIUM, HIGH, and CRITICAL. |
   | `at_risk_entities` | ARRAY of OBJECT values | The OBJECT values in the array have the following structure:  ```output [   {     "entity_id": <id>,     "entity_name": "<name>",     "entity_object_type": "<type>",     "entity_detail": {       ..., -- custom data     }   },   ... ] ```  The OBJECT values contain the following key-value pairs:  * `entity_id` - An optional field that corresponds to the ID of the entity at risk. * `entity_name` - A required field that corresponds to the name of the entity at risk. * `entity_object_type` - A required field that corresponds to the type of the entity at risk.   For example: `APPLICATION`, `TASK`, `NETWORK_POLICY`, `SECURITY_INTEGRATION`, `ROLE`,   `PROCEDURE`, `QUERY`, `DRIVER`, `PARAMETER`, `TABLE`, `STAGE`, `DATA_MASKING_POLICY`,   or `ROW_ACCESS_POLICY`. * `entity_detail` - Custom data that describes the entity. The maximum size of an array is 128 MB.  For scenarios where the scanner doesn’t detect any violations, the value is an empty list. |

#### Create an extension

An *extension* bundles scanner packages in a Native App, makes them accessible to the Trust Center, and
configures the privileges to allow the Trust Center to invoke the required stored procedures.

To create an extension, complete the following steps:

1. Create a `setup_script.sql` file for the extension by following the instructions in
   [Create the setup script](../../developer-guide/native-apps/creating-setup-script.md).

   In the `setup_script.sql` file, create an application role that will provide Snowflake with the privileges to use the schema and procedure that have the scanner logic. For example, you can create an application role named `trust_center_integration_role`.

   Then, grant the required privileges on
   the versioned schema and stored procedure to
   that application role.

   The following example shows how to create the application role `trust_center_integration_role`, and then
   grant the required privileges:

   ```sqlexample
   CREATE APPLICATION ROLE IF NOT EXISTS trust_center_integration_role;

   GRANT USAGE ON SCHEMA security_essentials_mfa_required_for_users
     TO APPLICATION ROLE trust_center_integration_role;

   GRANT USAGE ON PROCEDURE security_essentials_mfa_required_for_users.scan(VARCHAR)
     TO APPLICATION ROLE trust_center_integration_role;
   ```

   The privileges are required for every scanner in the package.
2. Create a `manifest.yml` file for the extension by following the instructions in
   [Create the manifest file for an app](../../developer-guide/native-apps/manifest-overview.md).

   The following example shows the contents of a `manifest.yml` file for a Trust Center extension:

   ```yaml
   manifest_version: 1
   artifacts:
     setup_script: setup_script.sql
     readme: README.md
   privileges:
     - IMPORTED PRIVILEGES ON SNOWFLAKE DB:
       description: "Required access to SNOWFLAKE.ACCOUNT_USAGE views to scan for vulnerabilities"
   ```
3. Create an application package for the extension by following the instructions in
   [Create and manage an application package](../../developer-guide/native-apps/creating-app-package.md).
4. Register a version of the application package by following the instructions in
   [Register a version](../../developer-guide/native-apps/release-channels.md).

   To confirm that the application package has registered versions, you can run the [SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md).
5. Create an application that is based on a registered version by following the instructions in
   [Create an app from a version or patch](../../developer-guide/native-apps/installing-testing-application.md).

   To confirm that the application object was created, you can run the [SHOW APPLICATIONS](../../sql-reference/sql/show-applications.md).

#### Grant privileges

After you install the extension, grant the required privileges by completing the following steps:

1. Grant the privileges requested by the extension by following the instructions in [Manage access requests using Snowsight](https://other-docs.snowflake.com/en/native-apps/consumer-granting-privs#manage-access-requests-using-snowsight).
2. To grant the `trust_center_integration_role` application role in the namespace of the extension to the SNOWFLAKE
   application, run the [GRANT APPLICATION ROLE](../../sql-reference/sql/grant-application-role.md) command:

   ```sqlexample
   GRANT APPLICATION ROLE <extension_name>.trust_center_integration_role
     TO APPLICATION snowflake;
   ```
3. To grant the application role in the namespace of the extension to the SNOWFLAKE
   application, run the [GRANT APPLICATION ROLE](../../sql-reference/sql/grant-application-role.md) command:

   ```sqlexample
   GRANT APPLICATION ROLE <extension_name>.<role_name>
     TO APPLICATION snowflake;
   ```

   For example, to grant the `tc_extension.trust_center_integration_role` application role (that you created earlier) to the SNOWFLAKE application,
   run the following command:

   ```sqlexample
   GRANT APPLICATION ROLE tc_extension.trust_center_integration_role
     TO APPLICATION snowflake;
   ```

#### Register the extension

You can register or deregister an extension by calling the following stored procedures:

* [SNOWFLAKE.TRUST_CENTER.REGISTER_EXTENSION](../../sql-reference/stored-procedures/register_extension.md)
* [SNOWFLAKE.TRUST_CENTER.DEREGISTER_EXTENSION](../../sql-reference/stored-procedures/deregister_extension.md)

To register an extension with the Trust Center, complete the following steps:

1. Switch to a role with the SNOWFLAKE.TRUST_CENTER_ADMIN application role granted to it.
2. Call the [SNOWFLAKE.TRUST_CENTER.REGISTER_EXTENSION](../../sql-reference/stored-procedures/register_extension.md)
   stored procedure.

   To view details about the extension, you can run the [SHOW APPLICATIONS](../../sql-reference/sql/show-applications.md) command. The
   application package or listing identifier is in the `source` column.

   For example, to register an extension named `tc_extension` that was installed from the application package named `my_tc_package`,
   call the stored procedure:

   ```sqlexample
   CALL SNOWFLAKE.TRUST_CENTER.REGISTER_EXTENSION(
     'APPLICATION PACKAGE',
     'my_tc_package',
     'tc_extension');
   ```

   You can display information about your registered extensions by querying the
   [EXTENSIONS view](../../sql-reference/trust_center/extensions.md).

   > **Note:**
   >
   > To deregister an extension, call the
   > [SNOWFLAKE.TRUST_CENTER.DEREGISTER_EXTENSION](../../sql-reference/stored-procedures/deregister_extension.md)
   > stored procedure.
3. Confirm that the scanner package provided by the extension is now in the list of Trust Center scanner packages by
   following the instructions in [View available scanner packages](using-the-trust-center.md).

#### Test the extension

After granting the privileges and enabling the scanner package, test the extension and examine the results generated
by the scanner by querying the SNOWFLAKE.TRUST_CENTER.FINDINGS view. If a scanner run has failed, you can check
the `ERROR_CODE` and `ERROR_MESSAGE` to debug the scanner failure.

You can also monitor telemetry data for Trust Center extensions by using the views in the
[DATA_SHARING_USAGE schema](../../sql-reference/data-sharing-usage.md). For example, you can find the number of installed
instances of the extension by querying the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md), and you can
monitor consumer usage of an extension by querying the [LISTING_ACCESS_HISTORY view](../../sql-reference/data-sharing-usage/listing-access-history.md).

## Install Trust Center extensions

You can discover, install, and manage third-party extensions that contain scanner packages.

### Install and manage third-party scanner packages

To install and manage a third-party scanner package, complete the following steps:

1. Discover and install extensions.
2. Manage the new scanner packages.

#### Discover and install extensions

You can discover and install a public Trust Center extension that was published to the [Snowflake Marketplace](https://other-docs.snowflake.com/collaboration/collaboration-marketplace-about) or a private one that was shared with you by using private listings. Trust Center extensions can contain one or more scanner packages.

To discover and install a public extension, follow these steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role that has been granted the SNOWFLAKE.TRUST_CENTER_ADMIN application role.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.
5. To view a list of public extensions that are available to your account, select Extensions.
6. Select the public extension that you want to install.

   The Snowflake Marketplace page for the extension opens.
7. To access the listing, select Get.
8. Optional: For Application name, enter a name.
9. To install the extension, select Get.

   Consumer registration of public extensions with the Trust Center happens automatically.

To discover and install a private extension, follow these steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role that has been granted the SNOWFLAKE.TRUST_CENTER_ADMIN application role.
3. Install the private extension, using [these instructions](../../developer-guide/native-apps/ui-consumer-installing.md).
4. Grant the TRUST_CENTER_INTEGRATION_ROLE to the Snowflake application.
5. Register the private extension with the Trust Center, as described in Register the extension.

> **Note:**
>
> A consumer must register private extensions with the Trust Center after installing them. The Trust Center does not generate notifications or
> send email after a private extension is installed.

When the installation of a public extension is complete, a Snowsight notification appears, and an email is sent to the email address
associated with your account.

For more information about installing Native Apps, see [Use and manage Snowflake Native Apps as a consumer](../../developer-guide/native-apps/ui-consumer-about.md).

#### Manage the new scanner packages

Installing a Trust Center extension adds one or more scanner packages to the Trust Center. To view the newly installed scanner
packages, complete the following steps:

1. [Sign in to Snowsight](../connecting.md).
2. Switch to a role that has been granted the SNOWFLAKE.TRUST_CENTER_ADMIN application role.

   For more information about granting this role, see [Required roles](overview.md).
3. In the navigation menu, select Governance & security » Trust Center.
4. Select the Manage scanners tab.

   In the list of scanner packages, the following information is displayed for each new scanner package:

   * NAME - The name of the new scanner package.
   * SOURCE - The name of the extension that you installed.
   * SCANNERS - The number of enabled and disabled scanners in the scanner package.
   * STATUS - The status of the scanner package. By default, newly installed scanner packages are disabled.
5. To enable a new scanner package, complete the following steps:

   1. In the list of scanner packages, select the scanner package.
   2. On the scanner package page, select Enable package.
   3. To grant the privileges required by the new scanner package, select Grant.
   4. Select Enable.

   Repeat these steps for each new scanner package that you want to enable.

You can manage the new scanner package in the same way that you manage other scanner packages in the Trust Center. For
example, you can schedule or disable the new scanner package. For more information, see
[Manage scanner packages](using-the-trust-center.md).

You can manage the scanners in the new scanner package in the same way that you manage other scanners. For example,
you can enable, disable, or schedule a scanner. For more information, see [Managing scanners](using-the-trust-center.md).

You can also monitor and manage the Native App associated with the extension directly. For more information, see
[Manage apps](../../developer-guide/native-apps/ui-consumer-managing-applications.md).

You can view the findings generated by the scanner packages that are installed with the extension by querying the
SNOWFLAKE.TRUST_CENTER.FINDINGS view. For example, the following query returns the findings for
the scanner packages that are installed with an extension that has a `extension_id` of `4486988721`:

```sqlexample
SELECT * FROM snowflake.trust_center.findings WHERE extension_id = 4486988721;
```

To find the identifiers for registered extensions, query the [EXTENSIONS view](../../sql-reference/trust_center/extensions.md).

For more information about Trust Center findings, see [Trust Center findings](overview.md)
and [View security risks](using-the-trust-center.md).

### Troubleshooting extension installation and registration

If a query on the SNOWFLAKE.TRUST_CENTER.FINDINGS view returns `FAILED` in the
`COMPLETION_STATUS` column, then scanner execution has failed. One possible reason for scanner
failure is that the extension wasn’t granted the required privileges. Ensure that the extension was
granted the privileges that are described in Grant privileges.

After you grant the required privileges, run the scanner package again to generate new findings.
If a query on the SNOWFLAKE.TRUST_CENTER.FINDINGS view still returns `FAILED` in the
`COMPLETION_STATUS` column, then contact Snowflake Support.

---
title: Versions for dbt project objects and files
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-versions.md
section: User Guide
---

# Versions for dbt project objects and files

Snowflake maintains immutable versions of dbt project objects and their source files. This versioning lets you track and manage changes
throughout the development lifecycle.

> **Note:**
>
> dbt project object versions are distinct from the dbt Core version used for execution. For more information, see [Supported dbt Core versions for dbt Projects on Snowflake](dbt-projects-on-snowflake-dbt-core-versions.md).

Snowflake identifies dbt project object versions in the dbt project stage, as shown in the following example.

`snow://dbt/my_db.my_schema.my_dbt_project_object/versions/version_id`

`version_id` can be any of the following identifiers:

| Identifier | Description |
| --- | --- |
| `VERSION$num` | Specifies a version identifier in the form `VERSION$num`, where `num` is a positive integer. For example, `VERSION$1`.  The version number begins at `1` when you create a dbt project object and increments by one with each new version of the dbt project object.  Snowflake increments the version identifier when you perform the following tasks:   * Redeploy dbt project from a workspace (runs the ALTER command with the ADD VERSION option). * Update the project by using the [ALTER DBT PROJECT](../../sql-reference/sql/alter-dbt-project.md) command. * Run the Snow CLI `snow dbt deploy` command with the `--force` option.   Snowflake resets the version identifier to `1` and removes all version aliases when you run the CREATE DBT PROJECT command with the `OR REPLACE` option. |
| `LAST` | Indicates the most recent version of the dbt project object. |
| `FIRST` | Indicates the oldest version of the dbt project object. |
| `version_name_alias` | Indicates a custom version name alias that you have created for a specific version of the dbt project object using the [ALTER DBT PROJECT](../../sql-reference/sql/alter-dbt-project.md) command with the ADD VERSION option. A version name alias always maps to a specific version identifier, such as `VERSION$3`. |

Project files stored in the dbt project stage are organized by version, with each version having its own subdirectory. For example, a dbt
project object named `my_dbt_project_object` with a version identifier of `VERSION$3` and a dbt project file named `dbt_project.yml`
can be referenced as `snow://dbt/my_db.my_schema.my_dbt_project_object/versions/VERSION$3/dbt_project.yml`.

---
title: View and manage information for existing dbt Projects
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-manage.md
section: User Guide
---

# View and manage information for existing dbt Projects

This topic covers how to explore the structure and metadata of an existing dbt project object. This includes viewing the project’s DAG,
inspecting model and source details, and running dbt projects.

## Browse the project DAG to see model lineage and dependencies

The Directed Acyclic Graph (DAG) shows how dbt models depend on each other, visualizing data lineage so you can:

* Verify where a model is built (database.schema), how it materializes, and which upsteam and downstream dependencies it has.
* Spot and improve inefficient model designs to support better performance and scalability.

To browse the project DAG in Snowsight, navigate to Databases » your database » your schema »
dbt Projects and select your project. The project details page displays the Graph of your models and their
relationships, along with a Description of your project, the dbt Project definition, and Privileges.

In the Graph, click a model node to inspect model, source, or test details (such as compiled SQL and configuration) directly from the DAG.

> **Tip:**
>
> If you’re working in a workspace, you can also reach the project details page by selecting Connect »
> View project from the workspace editor. For more information, see
> [Workspaces for dbt Projects on Snowflake](dbt-projects-on-snowflake-using-workspaces.md).

### Inspect model details from the DAG

When you select a model node in the DAG, the model details view opens, showing:

* The model’s type, file path, target object, row count, and column count.
* A description of the model (if one is defined in the dbt project).
* Model lineage, listing upstream and downstream dependencies with links to navigate between them.
* The source and compiled SQL for the model.

### Execute models from the DAG

You can run a subset of your dbt project directly from the DAG by selecting the … menu on a model node. The
following execution options are available:

| Menu option | What it executes | Equivalent `--select` flag |
| --- | --- | --- |
| Execute model | Only the selected model | `--select model_name` |
| Execute model+ | The model and all downstream dependents | `--select model_name+` |
| Execute +model | The model and all upstream parents | `--select +model_name` |
| Execute +model+ | The model, its parents, and its children | `--select +model_name+` |

Selecting any option opens the Execute dbt project dialog with the Additional flags field pre-filled with the corresponding
`--select` value. From the dialog, you can:

* Choose the operation, such as Run, Test, or Build.
* Choose the profile target (for example, dev or prod).
* Edit the flags before executing if you want to refine the selection.

You can use the same `--select` syntax with the `+` graph operators in SQL and the Snowflake CLI:

```sqlexample
EXECUTE DBT PROJECT my_dbt_project
  ARGS = 'build --select +stg_customers+ --target dev';
```

For more information about supported dbt commands and flags, see [Supported dbt commands and flags](dbt-projects-on-snowflake-supported-commands.md).

## View dbt project object properties

View the metadata Snowflake stores about a dbt project object to see what it’s called, who owns it, which version is the default, and where
its files live in Snowflake’s internal `snow://dbt/...` stage.

To view the properties (such as name, owner, comment) of a specific dbt project, use the DESCRIBE DBT PROJECT command, as shown in the
following example:

```sqlexample
DESCRIBE DBT PROJECT my_dbt_project;
```

The output shows the object’s name, owner, comment, versioning details, and external access integration. For more information, see
[DESCRIBE DBT PROJECT](../../sql-reference/sql/desc-dbt-project.md).

### View all dbt projects

Use SHOW DBT PROJECTS when you want to see all dbt project objects you can access, plus key metadata.

```sqlexample
SHOW DBT PROJECTS IN DATABASE mydb;
```

The output shows each object’s database, schema, owner, comment, when it was created and last updated, versioning details, and external access
integration. For more information, see [SHOW DBT PROJECTS](../../sql-reference/sql/show-dbt-projects.md).

Alternatively, use the [snow dbt list](../../developer-guide/snowflake-cli/command-reference/dbt-commands/list.md) command. For more information, see
[Listing all available dbt project objects](../../developer-guide/snowflake-cli/data-pipelines/dbt-projects.md).

---
title: View and track sensitive data classification results
source: https://docs.snowflake.com/en/user-guide/classify-results.md
section: User Guide
---

# View and track sensitive data classification results

This topic describes how you can view and track the results of sensitive data classification and how you can track classification tags to
monitor sensitive data.

## Use the Trust Center to view classification results

To view results of sensitive data classification in the Trust Center, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md) as a user with the [required privileges](classify-ui-trust-center.md).
2. In the navigation menu, select Governance & security » Trust Center.
3. Select the Data Security tab.
4. Do one of the following:

   * If you want to gain high-level insights into the security of your sensitive data, select the Dashboard tab. For more information,
     see Review the Dashboard page.
   * If you want to list all of the tables and views that have been classified as containing sensitive data, select the
     Sensitive objects tab.

     When the page opens, select a table to see which columns have sensitive data, the
     [semantic category](classify-intro.md) of those columns, and whether tags were applied to the columns.

### Review the Dashboard page

The Dashboard page provides high-level insights into the security of your sensitive data, such as how many databases and tables have been classified. The page contains the following tiles:

| Tile | Description |
| --- | --- |
| Objects by compliance category | Identifies the number of objects that contain data that might be subject to a regulation or other compliance standard, based on the type of information in the object.  **Note:** The mapping between a compliance category and semantic categories is not exhaustive. Only native semantic categories supported by Snowflake are mapped to a compliance category. For the complete mapping, see Compliance categories and their semantic categories.  You are solely responsible for determining which regulations or laws apply to your data and ensuring your compliance with the applicable regulations or laws. |
| Objects by semantic category | Identifies the most common semantic categories, and the number of objects that contain data that belong to those categories. |
| Databases monitored by auto-classification | Identifies which databases are currently monitored by sensitive data classification. A database is partially monitored if someone used SQL to set a classification profile directly on a schema in the database rather than setting the profile at the database level. |
| Classification status | Identifies whether all databases currently being monitored for sensitive data have been classified. |
| Sensitive data masking status | Identifies whether sensitive data is protected by a [masking policy](security-column-ddm-intro.md). The masking policy can be a tag-based policy or one that was manually applied to the column.  A table is *fully masked* if every column that contains sensitive data has a masking policy associated with it. A table is *partially masked* if only some columns containing sensitive data are associated with a masking policy. |

### Compliance categories and their semantic categories

> **Note:**
>
> You are solely responsible for determining which regulations or laws apply to your data and ensuring your compliance with the applicable
> regulations or laws. The compliance categories within sensitive data classification are designed to give you an out-of-the-box toolkit to
> aid your efforts, but are not exhaustive. Only [native semantic categories](classify-native.md) supported by Snowflake are
> mapped to a compliance category.

> **Warning:**
>
> HIPAA data requirements mandate that covered entities and business associates protect the confidentiality, integrity, and availability of
> Protected Health Information (PHI) through strict administrative, physical, and technical safeguards. Non-compliance with HIPAA can lead
> to significant penalties. Semantic categories related to PHI are included in [Sensitive information](classify-native.md).

Use the following table to understand the Objects by compliance category tile on the Dashboard page.

| Compliance category | Native semantic category | Locale |
| --- | --- | --- |
| Digital Personal Data Protection Act (DPDPA) | DATE_OF_BIRTH | n/a |
|  | DRIVERS_LICENSE | India (IN) |
|  | EMAIL | n/a |
|  | NAME | n/a |
|  | NATIONAL_IDENTIFIER | India (IN) |
|  | PHONE_NUMBER | n/a |
|  | STREET_ADDRESS | n/a |
|  | TAX_IDENTIFIER | India (IN) |
| General Data Protection Regulation (GDPR) | AGE | n/a |
|  | DRIVERS_LICENSE | Austria (AT), Belgium (BE), Bulgaria (BG), Croatia (HR), Cyprus (CY), Czech Republic (CZ), Denmark (DK), Estonia (EE), Finland (FI), France (FR), Germany (DE), Greece (GR), Hungary (HU), Ireland (IE), Italy (IT), Latvia (LV), Lithuania (LT), Luxembourg (LU), Malta (MT), Netherlands (NL), Poland (PL), Portugal (PT), Romania (RO), Slovakia (SK), Slovenia (SI), Spain (ES), Sweden (SE) |
|  | EMAIL | n/a |
|  | ETHNICITY | n/a |
|  | GENDER | n/a |
|  | IBAN | n/a |
|  | IMEI | n/a |
|  | IP_ADDRESS | n/a |
|  | NAME | n/a |
|  | NATIONAL_IDENTIFIER | Austria (AT), Belgium (BE), Bulgaria (BG), Croatia (HR), Cyprus (CY), Czech Republic (CZ), Denmark (DK), Estonia (EE), Finland (FI), France (FR), Germany (DE), Greece (GR), Hungary (HU), Ireland (IE), Latvia (LV), Lithuania (LT), Luxembourg (LU), Malta (MT), Netherlands (NL), Poland (PL), Portugal (PT), Romania (RO), Slovakia (SK), Slovenia (SI), Spain (ES), Sweden (SE), United Kingdom (UK) |
|  | PASSPORT | Austria (AT), Belgium (BE), Bulgaria (BG), Croatia (HR), Cyprus (CY), Czech Republic (CZ), Denmark (DK), Estonia (EE), Finland (FI), France (FR), Germany (DE), Greece (GR), Hungary (HU), Ireland (IE), Italy (IT), Latvia (LV), Lithuania (LT), Luxembourg (LU), Malta (MT), Netherlands (NL), Poland (PL), Portugal (PT), Romania (RO), Slovakia (SK), Slovenia (SI), Spain (ES), Sweden (SE) |
|  | PAYMENT_CARD | n/a |
|  | PHONE_NUMBER | n/a |
|  | SALARY | n/a |
|  | TAX_IDENTIFIER | Austria (AT), Cyprus (CY), France (FR), Germany (DE), Greece (GR), Hungary (HU), Italy (IT), Malta (MT), Netherlands (NL), Poland (PL), Portugal (PT), Slovenia (SI), Spain (ES), Sweden (SE) |
|  | VIN | n/a |
| Gramm-Leach-Bliley Act (GLBA) | BANK_ACCOUNT | United States (US) |
|  | DRIVERS_LICENSE | United States (US) |
|  | NAME | United States (US) |
|  | NATIONAL_IDENTIFIER | United States (US) |
|  | PASSPORT | United States (US) |
|  | PAYMENT_CARD | n/a |
|  | STREET_ADDRESS | United States (US) |
|  | TAX_IDENTIFIER | United States (US) |
| Health Insurance Portability and Accountability Act (HIPAA) | ADMINISTRATIVE_AREA_1 | United States (US) |
|  | ADMINISTRATIVE_AREA_2 | United States (US) |
|  | AGE | n/a |
|  | CITY | United States (US) |
|  | DATE_OF_BIRTH | n/a |
|  | EMAIL | n/a |
|  | ETHNICITY | n/a |
|  | IMEI | n/a |
|  | IP_ADDRESS | n/a |
|  | MEDICAL_DATA | n/a |
|  | MEDICAL_SPECIALTY | n/a |
|  | NAME | n/a |
|  | NATIONAL_IDENTIFIER | United States (US) |
|  | PHONE_NUMBER | United States (US) |
|  | POSTAL_CODE | United States (US) |
|  | STREET_ADDRESS | United States (US) |
|  | URL | n/a |
|  | VIN | n/a |
| Payment Card Industry (PCI) | PAYMENT_CARD | n/a |
| Personally identifiable information (PII) | DATE_OF_BIRTH | n/a |
|  | DRIVERS_LICENSE | n/a |
|  | EMAIL | n/a |
|  | NAME | n/a |
|  | NATIONAL_IDENTIFIER | n/a |
|  | PHONE_NUMBER | n/a |
|  | STREET_ADDRESS | n/a |
|  | TAX_IDENTIFIER | n/a |

## Use SQL to view classification results

You can use SQL to view the results of data classification by calling a system function or querying an Account Usage view.

### Retrieve classification results for a specific table

Call the [SYSTEM$GET_CLASSIFICATION_RESULT](../sql-reference/functions/system_get_classification_result.md) function to view results for a specific table.

```sqlexample
CALL SYSTEM$GET_CLASSIFICATION_RESULT('mydb.sch.t1');
```

Results aren’t available until the classification process is complete. The automatic classification process doesn’t start until one hour after setting the classification profile on the database.

### Query the latest classification results

To view the latest classification results, query the [DATA_CLASSIFICATION_LATEST](../sql-reference/account-usage/data_classification_latest.md) view. Classification results before the latest are not shown. For example,
you can use a role that has been granted the SNOWFLAKE.GOVERNANCE_VIEWER database role. Other privileges can also provide access,
such as using ACCOUNTADMIN or having IMPORTED PRIVILEGES on the SNOWFLAKE database.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_LATEST;
```

Results might not appear until three hours after classification is complete. To view previous classification results, see Query the classification history.

### Query the classification history

To view all classification events over the last 365 days, query the [DATA_CLASSIFICATION_HISTORY](../sql-reference/account-usage/data_classification_history.md) view. For example, you can use a role that has been granted the SNOWFLAKE.GOVERNANCE_VIEWER
database role. Other privileges can also provide access, such as using ACCOUNTADMIN or having IMPORTED PRIVILEGES on the SNOWFLAKE
database.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_HISTORY;
```

Use the following examples to query classification history:

#### Filter classification history by database, schema, and table name

The following example returns all classification events for a specific table by filtering on database name, schema name, and table name, ordered from most recent to oldest:

```sqlexample
SELECT
    database_id,
    database_name,
    schema_id,
    schema_name,
    table_id,
    table_name,
    trigger_type,
    classified_on,
    table_deleted_on,
    result
  FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_HISTORY
  WHERE database_name = 'MY_DB'
    AND schema_name = 'MY_SCHEMA'
    AND table_name = 'EMPLOYEES'
  ORDER BY classified_on DESC;
```

The output shows two classification events for the same EMPLOYEES table: a manual classification from February 2025 that identified the EMAIL column, and a later auto classification from March 2025 that identified both the EMAIL and SSN columns. The results are ordered from most recent to oldest, showing how classification results can evolve over time.

```output
+-------------+---------------+-----------+-------------+----------+------------+---------------------+---------------------------+----------------+--------------------------------+
| DATABASE_ID | DATABASE_NAME | SCHEMA_ID | SCHEMA_NAME | TABLE_ID | TABLE_NAME | TRIGGER_TYPE        | CLASSIFIED_ON             | TABLE_DELETED_ON | RESULT                         |
+-------------+---------------+-----------+-------------+----------+------------+---------------------+---------------------------+----------------+--------------------------------+
| 10          | MY_DB         | 100       | MY_SCHEMA   | 1234     | EMPLOYEES  | AUTO CLASSIFICATION | 2025-03-01 08:00:00 -0800 | NULL           | {"EMAIL": {...}, "SSN": {...}} |
| 10          | MY_DB         | 100       | MY_SCHEMA   | 1234     | EMPLOYEES  | MANUAL              | 2025-02-15 14:30:00 -0800 | NULL           | {"EMAIL": {...}}               |
+-------------+---------------+-----------+-------------+----------+------------+---------------------+---------------------------+----------------+--------------------------------+
```

#### Filter by table ID

The following example filters the classification history by the table ID to return all classification events for a specific table, ordered
by most recent first:

> **Note:**
>
> Filtering by the ID can be useful if the table was renamed after classification.

```sqlexample
SELECT
    database_id,
    database_name,
    schema_id,
    schema_name,
    table_id,
    table_name,
    trigger_type,
    classified_on,
    table_deleted_on,
    result
  FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_HISTORY
  WHERE table_id = 1234
  ORDER BY classified_on DESC;
```

The output shows two classification events for the same table (ID 1234), even though the table was renamed from EMPLOYEES to EMPLOYEES_NEW between events. Because the query filters by table ID rather than name, both events are returned regardless of the name change.

```output
+-------------+---------------+-----------+-------------+----------+---------------+---------------------+---------------------------+----------------+--------------------------------+
| DATABASE_ID | DATABASE_NAME | SCHEMA_ID | SCHEMA_NAME | TABLE_ID | TABLE_NAME    | TRIGGER_TYPE        | CLASSIFIED_ON             | TABLE_DELETED_ON | RESULT                         |
+-------------+---------------+-----------+-------------+----------+---------------+---------------------+---------------------------+----------------+--------------------------------+
| 10          | MY_DB         | 100       | MY_SCHEMA   | 1234     | EMPLOYEES_NEW | AUTO CLASSIFICATION | 2025-03-01 08:00:00 -0800 | NULL           | {"EMAIL": {...}, "SSN": {...}} |
| 10          | MY_DB         | 100       | MY_SCHEMA   | 1234     | EMPLOYEES     | MANUAL              | 2025-02-15 14:30:00 -0800 | NULL           | {"EMAIL": {...}}               |
+-------------+---------------+-----------+-------------+----------+---------------+---------------------+---------------------------+----------------+--------------------------------+
```

#### Count classification events in the last seven days

The following example shows the number of classification events in the last seven days:

```sqlexample
SELECT
    COUNT(*) AS classification_count
  FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_HISTORY
  WHERE classified_on >= DATEADD(DAY, -7, CURRENT_TIMESTAMP());
```

```output
+----------------------+
| CLASSIFICATION_COUNT |
+----------------------+
| 42                   |
+----------------------+
```

#### Compare classification runs for one table

The following example compares the two most recent classification runs for a table and returns only the columns whose
classification changed between the runs. Each row in the result includes a `change_type` column with one of the
following values:

* `ADDED`: The column was not classified in the previous run. The `PREV_*` columns are NULL.
* `REMOVED`: The column was classified in the previous run but not in the current run. The `CURR_*` columns are NULL.
* `CHANGED`: The column exists in both runs but its semantic or privacy category differs.

Columns whose classification was identical across both runs are excluded from the results.

```sqlexample
WITH ranked AS (
    SELECT
        table_id,
        database_id,
        schema_id,
        database_name,
        schema_name,
        table_name,
        classified_on,
        trigger_type,
        result,
        ROW_NUMBER() OVER (PARTITION BY table_id ORDER BY classified_on DESC) AS rn
      FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_HISTORY
      WHERE table_id = 1234
    ),
  curr_cols AS (
      SELECT r.table_id, r.database_id, r.schema_id,
          r.database_name, r.schema_name, r.table_name,
          r.classified_on, r.trigger_type,
          c.key AS column_name, c.value AS column_result
        FROM ranked r, LATERAL FLATTEN(input => r.result) c
        WHERE r.rn = 1
  ),
  prev_cols AS (
      SELECT r.table_id,
          r.classified_on, r.trigger_type,
          c.key AS column_name, c.value AS column_result
        FROM ranked r, LATERAL FLATTEN(input => r.result) c
        WHERE r.rn = 2
  )
  SELECT
      curr.database_id,
      curr.database_name,
      curr.schema_id,
      curr.schema_name,
      curr.table_id,
      curr.table_name,
      prev.classified_on AS previous_classified_on,
      curr.classified_on AS current_classified_on,
      COALESCE(curr.column_name, prev.column_name) AS column_name,
      CASE
        WHEN prev.column_name IS NULL THEN 'ADDED'
        WHEN curr.column_name IS NULL THEN 'REMOVED'
        ELSE 'CHANGED'
      END AS change_type,
      prev.column_result:recommendation.semantic_category::STRING AS prev_semantic_category,
      curr.column_result:recommendation.semantic_category::STRING AS curr_semantic_category,
      prev.column_result:recommendation.privacy_category::STRING AS prev_privacy_category,
      curr.column_result:recommendation.privacy_category::STRING AS curr_privacy_category
    FROM curr_cols curr
    FULL OUTER JOIN prev_cols prev
      ON curr.table_id = prev.table_id
      AND curr.column_name = prev.column_name
    WHERE prev.column_name IS NULL
      OR curr.column_name IS NULL
      OR curr.column_result:recommendation.semantic_category != prev.column_result:recommendation.semantic_category
      OR curr.column_result:recommendation.privacy_category != prev.column_result:recommendation.privacy_category
    ORDER BY column_name;
```

The output shows three columns whose classification changed between the two most recent runs: DATE_OF_BIRTH and SSN were newly identified (ADDED) in the current run, while PHONE was classified in the previous run but no longer appears in the current run (REMOVED). Columns whose classification remained the same across both runs, such as EMAIL, are excluded from the results.

```output
+-------+---------+-----------+-------------+----------+------------+---------------------+---------------------+---------------+-------------+---------------+---------------+--------------+------------------+
| DB_ID | DB_NAME | SCHEMA_ID | SCHEMA_NAME | TABLE_ID | TABLE_NAME | PREV_CLASSIFIED_ON  | CURR_CLASSIFIED_ON  | COLUMN_NAME   | CHANGE_TYPE | PREV_SEMANTIC | CURR_SEMANTIC | PREV_PRIVACY | CURR_PRIVACY     |
+-------+---------+-----------+-------------+----------+------------+---------------------+---------------------+---------------+-------------+---------------+---------------+--------------+------------------+
| 10    | MY_DB   | 100       | MY_SCHEMA   | 1234     | EMPLOYEES  | 2025-02-15 14:30:00 | 2025-03-01 08:00:00 | DATE_OF_BIRTH | ADDED       | NULL          | DATE_OF_BIRTH | NULL         | QUASI_IDENTIFIER |
| 10    | MY_DB   | 100       | MY_SCHEMA   | 1234     | EMPLOYEES  | 2025-02-15 14:30:00 | 2025-03-01 08:00:00 | PHONE         | REMOVED     | PHONE_NUMBER  | NULL          | IDENTIFIER   | NULL             |
| 10    | MY_DB   | 100       | MY_SCHEMA   | 1234     | EMPLOYEES  | 2025-02-15 14:30:00 | 2025-03-01 08:00:00 | SSN           | ADDED       | NULL          | US_SSN        | NULL         | IDENTIFIER       |
+-------+---------+-----------+-------------+----------+------------+---------------------+---------------------+---------------+-------------+---------------+---------------+--------------+------------------+
```

## View classification results for JSON columns

Snowflake can classify columns of type ARRAY, VARIANT, or OBJECT when the semi-structured data is in JSON format. The result of this
classification has the following characteristics:

* The results object contains a `object_path_results` field. This field lists objects, where each object corresponds to
  a field in the semi-structured data that was classified into a native semantic category.
* If a field in the semi-structured data contains sensitive data, then the semantic category of the *column* is `MULTIPLE`. To obtain
  the semantic category of fields in the semi-structured data, use the `object_path_results` field in the results.

As an example, suppose Snowflake classifies the following table:

```output
+-----------------------------------------------------------+---------------+-----------------------------------------------------+
| ARRAY_COL                                                 | FIRST_NAME    | OBJECT_COL                                          |
+-----------------------------------------------------------+---------------+-----------------------------------------------------+
| [ { "email": "alice@example.com" }, { "email": "b..." } ] | "Joe"         | { "email": "jane@domain.com", "phone": "206-..." }  |
+-----------------------------------------------------------+---------------+-----------------------------------------------------+
```

The classification result might look like the following:

```JSON
{
  "ARRAY_COL": {
    "object_path_results": {
      "ARRAY_COL:[$$].email": {
        "alternates": [],
        "recommendation": {
          "confidence": "HIGH",
          "coverage": 1,
          "details": [],
          "privacy_category": "IDENTIFIER",
          "semantic_category": "EMAIL"
        }
      }
    },
    "recommendation": {
      "confidence": "HIGH",
      "details": [],
      "privacy_category": "IDENTIFIER",
      "semantic_category": "MULTIPLE"
    },
    "valid_value_ratio": 1
  },
  "FIRST_NAME": {
    "alternates": [],
    "recommendation": {
      "confidence": "HIGH",
      "coverage": 1,
      "details": [],
      "privacy_category": "IDENTIFIER",
      "semantic_category": "NAME"
    },
    "valid_value_ratio": 1
  },
  "OBJECT_COL": {
    "object_path_results": {
      "OBJECT_COL:email": {
        "alternates": [],
        "recommendation": {
          "confidence": "HIGH",
          "coverage": 1,
          "details": [],
          "privacy_category": "IDENTIFIER",
          "semantic_category": "EMAIL"
        }
      },
      "OBJECT_COL:phone": {
        "alternates": [],
        "recommendation": {
          "confidence": "HIGH",
          "coverage": 1,
          "details": [
            {
              "coverage": 1,
              "semantic_category": "US_PHONE_NUMBER"
            },
            {
              "coverage": 1,
              "semantic_category": "JP_PHONE_NUMBER"
            }
          ],
          "privacy_category": "IDENTIFIER",
          "semantic_category": "PHONE_NUMBER"
        }
      }
    },
    "recommendation": {
      "confidence": "HIGH",
      "details": [],
      "privacy_category": "IDENTIFIER",
      "semantic_category": "MULTIPLE"
    },
    "valid_value_ratio": 1
  }
}
```

## Use tags to track sensitive data

When Snowflake classifies sensitive data, it suggests or automatically applies system-defined and user-defined tags to the columns that
contain sensitive data. Because columns with sensitive data are assigned these tags, you can monitor the sensitive data by running queries
and calling functions to track the tags.

For example, to list all of the columns that were classified and assigned a semantic category, you can run the following query:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.TAG_REFERENCES
  WHERE TAG_NAME = 'SEMANTIC_CATEGORY'
  ORDER BY object_database, object_schema, object_name, column_name;
```

If you want to determine which semantic category was assigned to the `fname` column of the `hr_data` table, you can run the following
query to obtain the value of the SEMANTIC_CATEGORY tag:

```sqlexample
SELECT SYSTEM$GET_TAG(
    'SNOWFLAKE.CORE.SEMANTIC_CATEGORY',
    'hr_data.fname',
    'COLUMN'
    );
```

For information about the different ways that you can track tags, see [Monitor object tags](object-tagging/monitor.md).

---
title: View results of a data metric function
source: https://docs.snowflake.com/en/user-guide/data-quality-results.md
section: User Guide
---

# View results of a data metric function

You can access the results of a scheduled data metric function (DMF) in the following ways:

* Query the dedicated event table.
* Query the [DATA_QUALITY_MONITORING_RESULTS](../sql-reference/local/data_quality_monitoring_results.md) view, which is a
  flattened version of the event table.
* Call the [DATA_QUALITY_MONITORING_RESULTS](../sql-reference/functions/data_quality_monitoring_results.md) table function.

Each method of viewing results has its own access control requirements. For example, an application role that grants access to the table
function might not let you query the event table. For a description of these access control requirements, see
[Viewing data quality results](data-quality-access-control.md).

> **Note:**
>
> This topic describes how you can use SQL to view the results of a DMF. To interact with a user interface to see the results of a data quality check, see [Monitoring data quality checks in Snowsight](data-quality-ui-monitor.md). A DMF is a building block of a data quality check.

## Query the dedicated event table

This option gives you access to the raw data, and you have more freedom to post-process the data using derived objects, such as creating
views, table functions, or stored procedures based on how you want to analyze the results. Additionally, if you create these
derived objects, you can selectively grant access on these objects to different roles. For example, a data engineer can access the stored
procedures to maintain the approach to obtain the results, and a data analyst can access the view to analyze the results.

The event table is named `SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS_RAW`.

For information about the event table columns, see [Event table columns](../developer-guide/logging-tracing/event-table-columns.md).

For a representative example to query the event table, see the
[logging and tracing tutorial](../developer-guide/logging-tracing/tutorials/logging-tracing-getting-started.md).

## Query the DATA_QUALITY_MONITORING_RESULTS view

This option enables you to query the [DATA_QUALITY_MONITORING_RESULTS](../sql-reference/local/data_quality_monitoring_results.md) view,
which flattens the raw data in the event table to enable easier access to the DMF results. Additionally, this option is best when data
post-processing is not needed and when you don’t want to grant access to the raw data.

The view exists in the LOCAL schema in the shared SNOWFLAKE database: `SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS`.

For information, see the [DATA_QUALITY_MONITORING_RESULTS](../sql-reference/local/data_quality_monitoring_results.md) view.

> **Note:**
>
> The SNOWFLAKE.GOVERNANCE_VIEWER database role does not have access to query the DATA_QUALITY_MONITORING_RESULTS view.

## Call the DATA_QUALITY_MONITORING_RESULTS table function

This option enables you to call the [DATA_QUALITY_MONITORING_RESULTS](../sql-reference/functions/data_quality_monitoring_results.md) table function to view the DMF
results. The function returns the same columns as the DATA_QUALITY_MONITORING_RESULTS view. However, you can only specify a single table
when calling the function. This option is best when you want to limit data metric function results to a single table and not provide
access to the measurements of other tables or the event table.

---
title: View tasks and task graphs in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-tasks.md
section: User Guide
---

# View tasks and task graphs in Snowsight

Tasks let you schedule the execution of SQL code. A task is associated with a specific database and schema. You can use
Snowsight to view and manage your tasks and task graphs. Using Snowsight, you can also view the execution history for
tasks and tasks graphs and retry failed tasks.

## View and manage individual tasks

To view and manage a task in Snowsight, perform the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. For a specific database and schema, select Tasks and select the task you want to manage.

When viewing the task in Snowsight, you can perform the following steps:

* In the Details section, review the task ID, warehouse used by the task, schedule, state, parameters, and any conditions.
* Review the SQL statement used to create the task and any task graph configurations in the Task Definition section.
* Manage privileges on the task. For information, see [Manage object privileges with Snowsight](security-access-control-configure.md).
* To edit the task, clone the task, drop the task, or transfer ownership of the task to another role, select the … actions button.

When you edit a task in Snowsight, the task is automatically suspended, and then resumed when you finish editing the task. For more
information about suspending and resuming tasks, see [Versioning of task runs](tasks-intro.md).

## View and manage task graphs

Review a task graph to see a root task, its dependent tasks, and finalizer task in the format of a task graph. For more information about task graphs, see [Create a sequence of tasks with a task graph](tasks-graphs.md). When you review a task graph, you can perform the following steps in Snowsight:

* View task information.
* Examine the task graph.
* Select a task on the graph to view additional details, such as predecessor tasks, the warehouse used to run the task, and the role that
  owns the task.

You can also edit the root task to change parameters for the task graph. When you edit a task in Snowsight, the task is
automatically suspended and resumed when you finish editing the task. For more information about suspending and resuming tasks, see
[Versioning of task runs](tasks-intro.md).

To view a task graph for a specific database schema, perform the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. Use the object explorer to locate the database and schema that contain the tasks that you want to view.
4. For the selected schema, select Tasks.
5. Select a specific task.

   The task details appear, with additional Graph, and Run History tabs.
6. Select the Graph tab to view the task graph.

   The task graph appears, displaying a hierarchy of tasks.
7. Select a task to view details in the context of the graph.

> **Note:**
>
> Task history data is only available if the task has been executed in the last 7 days.

## View task history

To view task history, perform the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Tasks.

From the Tasks page, you can see task execution history. For example:

* Review all tasks that have run in your account to help identify critical tasks that failed to run, long-running tasks, or tasks that have increasing costs.
* Review specific tasks to gather more information about the task.
* Review task graphs to observe, monitor, and help identify problems with a specific task graph.

You can also review task history in SQL by using a table function [TASK_HISTORY](../sql-reference/functions/task_history.md) or a
view [TASK_HISTORY view](../sql-reference/account-usage/task_history.md).

### Required privileges for viewing task history

To view task history in Snowsight, you need the same privileges required to run the
[TASK_HISTORY](../sql-reference/functions/task_history.md) function. That is, you must use a role that includes one of the following roles or privileges on the task:

> * The ACCOUNTADMIN role.
> * The role with the OWNERSHIP privilege on the task (that is, the task owner).
> * The MONITOR or OPERATE privilege on the task.
> * The global MONITOR EXECUTION privilege.

The role that you use must be able to query the Account Usage [TASK_HISTORY](../sql-reference/account-usage/task_history.md) view. You can grant the USAGE_VIEWER database role in the shared SNOWFLAKE database to the role that you use.

For example, to view the history for a specific task `mytask`, you can grant OWNERSHIP privileges on the task and the USAGE_VIEWER database role on the shared Snowflake database by running the following SQL commands:

```sqlexample
GRANT OWNERSHIP ON TASK mytask TO ROLE myrole;
GRANT DATABASE ROLE USAGE_VIEWER TO ROLE myrole;
```

For details, see:

* [ACCOUNT_USAGE schema SNOWFLAKE database roles](../sql-reference/account-usage.md)
* [GRANT DATABASE ROLE](../sql-reference/sql/grant-database-role.md)

### Review the run history for a task

Task run history includes details about each execution of a given task. You
can view the scheduled run time, status, return value, duration of a task, and other information.

For each instance, you can view the following:

* Scheduled run time: When the scheduled task was run.
* Status: Status of the most recent attempt of the task run.
* Duration: Amount of time, in seconds, for the task run.

To view the run history:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the right pane, using the object explorer, navigate to a database and schema.
4. For the selected schema, select and expand Tasks.
5. Select a task. Task information is displayed, including Task Details, Graph, and Run History sub-tabs.
6. Select the Run History tab.

> **Note:**
>
> Task history data is only available if the task has been executed in the last 7 days.

### Review account-level task history

Review the account-level history for task runs to identify failing tasks, long-running tasks,
and other monitoring and debugging cases for an entire account, rather than for one specific task.

To view account-level history for tasks, perform the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Tasks.
3. To view individual task runs, select View » Task Runs from the filters.

After you select the history of task runs, you can filter the page to display relevant information.

* Select the Date Range filter to show task history from the last day through the last 12 months, or a custom range.
* Select the Task status filter to display task history for one or more status, such as Succeeded, Failed, Cancelled, or Skipped.
* Filter on the name of the task to see patterns in status or duration over time for specific tasks.
* Filter on the name of the database or schema that contain the tasks.

For example, to identify long-running tasks, select the Status filter to show only successful tasks,
and sort the Duration field in descending order. For advanced debugging, you can open the filtered and sorted table in worksheets
using the Open in worksheets button. You could then modify the SQL statement with [LIMIT / FETCH](../sql-reference/constructs/limit.md)
and [GROUP BY](../sql-reference/constructs/group-by.md) arguments to identify the databases and schemas with the top 25 most long-running tasks.

You can also select a specific task to drill down for more details.

## View task graph history

### Viewing the Tasks page

To identify failing tasks, long-running tasks, and other monitoring and debugging cases, review the history of your task graph runs on the Tasks page.

> **Note:**
>
> With the Tasks page, you can view the task graph runs based on your specific role privileges.

To view task graph runs, take the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Transformation » Tasks.

On the Task Graphs tab, you can perform the following:

* Hover over the Previous Runs counter to display the status of the most recent runs in chronological order.
* The Duration Trend graph visualizes task run durations over time (based on the selected date range) by highlighting a median line within a min-max range. This can help you quickly assess whether task durations are stable, fluctuating, or trending, and identify individual outliers.
* Use the ellipsis menu to manually run the graph, edit the root task (for example, modify the schedule or parameters), or suspend/resume the graph.

You can filter the page to display relevant information. It is recommended to filter by database and schema to reduce load times on large accounts.

* Select the Date Range filter to show task history from the last 7 days (the default setting). You can change it to 1 day. Note that the date range filter only applies to the previous runs counter and the runtime duration trends.
* Select the Last Run Status filter to display task graphs for one or more statuses, such as Succeeded, Failed, Canceled, or Skipped on the most recent run. This filter applies only to the latest completed run of a task graph.
* Filter on the name of the database or schema that contains the tasks.
* Use the search field to filter on the root task name.

You can also select a specific task graph to drill down for more details, which the following image demonstrates:

Selecting a task graph always opens the details of the most recent run. If you want to see details of a previous run, you can select Open previous runs on the specific graph run page.

### Accessing task graph from a task object page

With this preview, you can also access the Task Graph page from a specific run history page if the task is part of a task graph.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the right pane, using the object explorer, navigate to a database and schema.
4. For the selected schema, select and expand Tasks.
5. Select a task. Task information is displayed, including Task Details, Graph, and Run History sub-tabs.
6. Select the Run History tab.
7. If the task is associated with a task graph, there is a Task Graph tab next to Task, which the following image demonstrates.

   Select Task Graph to view the details of the task graph.

### Considerations and limitations

* To view a task graph within this function, you’ll need a role with at least one of the following privileges:

  + OWNERSHIP privilege on the task (that is, the task owner).
  + MONITOR or OPERATE privileges on the task.
  + The global MONITOR EXECUTION privilege.
  + The ACCOUNTADMIN role.

  The role must also have the USAGE privilege on the database and schema that store the task, otherwise the DATABASE_NAME and SCHEMA_NAME values in the output are NULL.

## Retry failed tasks

In Snowsight, you can see previous task run attempts and retry failed and canceled task graphs. You must have the OPERATE privilege on the task to retry failed and canceled tasks. To view previous task run attempts, you also need the same privileges as viewing task history.

This is particularly useful for ensuring that data workflows or pipelines are successfully completed without having to restart the entire process, saving time and resources.

Snowflake supports both auto-retry and manual retry mechanisms:

* Auto-Retry: Tasks that fail are automatically retried shortly after failure based on predefined parameters set at the root task level.
* Manual-Retry: If auto-retry doesn’t resolve the issue, you can manually retry failed or canceled tasks within 14 days of their latest graph runs.

Using retry attempts instead of new runs is particularly helpful for completing graphs that failed partway without re-executing tasks that have already been successfully completed or skipped. This ensures that only the failed tasks are retried, minimizing redundancy.

> **Note:**
>
> * A graph can only be retried if it hasn’t been recreated or altered since the last run.
> * You must have the OWNERSHIP or OPERATE privilege on the task to retry failed and canceled tasks. To view previous task run attempts, you also need the same privileges as viewing task history.

Take the following steps to manually retry failed and canceled tasks. The following steps work only when the Viewing task graphs preview feature is enabled.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the right pane, using the object explorer, navigate to a database and schema.
4. For the selected schema, select and expand Tasks.
5. Select a task. Task information is displayed, including Task Details, Graph, and Run History sub-tabs.
6. Select the Run History tab.
7. On the Run History page, select Task Graph.
8. On the task graph run details page, open a failed or canceled task graph.
9. Select Retry to manually retry the failed task graph run.

   The retry starts only failed and canceled tasks in a graph and does not rerun tasks that have already succeeded.
10. Select the refresh button to refresh the page. The failed attempts show up in the account-level task graph run details. Account-level task run history and task graph run history show the status of the most recent attempt with a 45-minute latency.
11. The failed attempts also show up in the object-level task history. Follow the steps to view the run history of a task. The latest attempt for a run is shown. All attempts to run the task have the same run ID.

Any previous failed or canceled attempts are shown next to the run status. You can select the task to see the scheduled timestamp, status, and error messages for each attempt.

> **Note:**
>
> The Retry action is disabled if any of the following is true,
>
> * A retry is already in progress.
> * The selected run is not the most recent run.
> * The task graph has been modified after the run.
> * The run is longer than 14 days.
>
> The Retry action is not available if no tasks in the graph failed or were canceled.

---
title: View the schema for a table in Snowflake Open Catalog
source: https://docs.snowflake.com/en/user-guide/opencatalog/view-table-schema.md
section: User Guide
---

# View the schema for a table in Snowflake Open Catalog

A service admin or catalog admin can use the Snowflake Open Catalog web interface to view the schema for a table, including its nested schemas, if applicable. You can view the schema for a table in an internal or external catalog.

1. Sign in to Snowflake Open Catalog.
2. From the menu on the left, select **Catalogs**.
3. From the catalog object explorer on the left, select the catalog containing the table whose schema you want to view.
4. From the catalog object explorer, expand the catalog’s namespaces to expose the table whose schema you want to view.
5. Select the table to display its schema in the center of the page.
6. If the data type for the **Type** column in the schema is **struct** or **list<struct >**, to view the nested schema for the column, select **View nested schema** .

---
title: View the Snowflake client version
source: https://docs.snowflake.com/en/user-guide/snowflake-client-version-check.md
section: User Guide
---

# View the Snowflake client version

To view the version of the Snowflake client used to execute SQL statements in Snowflake, use the Client Driver column on
the Query History page in Snowsight.

Use this information to determine if the client versions actively used by users in your account meet the
[minimum requirements](../release-notes/requirements.md). You can also use this information, if applicable, to identify the
client version when submitting cases to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

To view the versions of Snowflake clients used recently in your account:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role » ACCOUNTADMIN.
3. In the navigation menu, select Monitoring » Query History.
4. Locate the Client Driver column, containing the version of the client or driver that submitted the query:

   If the column isn’t visible, select Columns and choose Client Driver.
5. Note the client version in the row for each SQL statement.

   For clients and drivers, the column includes an icon that indicates if the client version is supported,
   unsupported, or nearing the end of support. You can hover over the icon to display a tooltip that indicates the
   current status of the client version.

   Snowflake updates the information on which versions are supported every three months. See [Client versions & support policy](../release-notes/requirements.md).

---
title: Viewing accounts in your organization
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-view.md
section: User Guide
---

# Viewing accounts in your organization

[As the organization administrator](organization-administrators.md), you can view a list of all accounts created in your
organization through the web interface or using SQL:

> [Snowsight](ui-snowsight-gs.md):
> :   In the navigation menu, select Admin » Accounts.
>     You can search and filter active and dropped accounts in your organization.
>
> SQL:
> :   Execute a [SHOW ACCOUNTS](../sql-reference/sql/show-accounts.md) command.

---
title: Views, materialized views, and dynamic tables
source: https://docs.snowflake.com/en/user-guide/overview-view-mview-dts.md
section: User Guide
---

# Views, materialized views, and dynamic tables

Snowflake provides a variety of structures to view, materialize, and otherwise transform data. Three of the most common mechanisms are:

* [Views](views-introduction.md): Snowflake provides what would be considered a traditional database view.
  In general, a view allows the result of a query to be accessed as if it were a table, including linking (or in database parlance, joining)
  two or more tables or other views into a single logical view. Once defined, views can be queried like any other table.
* [Materialized views](views-materialized.md): Materialized views differ from traditional views by providing the ability to
  pre-compute the dataset based on materialized view query.
  Because the result is pre-computed, querying a materialized view is faster than executing a query against the base table of the view.
  This performance difference can be significant when a query is run frequently or is sufficiently complex.
  As a result, materialized views can speed up expensive aggregation, projection, and selection operations, especially those that run
  frequently and that run on large data sets.
* [Dynamic Tables](dynamic-tables-about.md): Dynamic tables materialize the results of a specified query.
  Instead of creating a separate target table and writing code to transform and update the data in that table,
  you can define the target table as a dynamic table, and you can specify the SQL statement that performs the transformation.
  Background automation then keeps the dynamic table up to date based on the refresh criteria that you specify.

## Comparison of Views, materialized views, and dynamic tables

| Object type | Pros | Cons | Limitations and More information |
| --- | --- | --- | --- |
| View | Simple, easily defined, consumes no storage. | Inflexible, slow, requires compute to generate results. | See [Limitations on Views](views-introduction.md). |
| Materialized View | Fast results retrieval. Relatively simple definition. Somewhat flexible. Always up to date. | Incurs compute to keep up to date. Consumes storage. | For more information, including limitations in materialized views, see [Working with Materialized Views](views-materialized.md). |
| Dynamic Tables | Extremely fast results retrieval. Relatively simple definition. Very flexible. Fine control on refresh. Can provide complex transformations. | Incurs compute cost to be kept up to date. Consumes storage. Requires careful consideration as to how often to refresh, and when and how to refresh. | For more information, see [Dynamic tables](dynamic-tables-about.md). |

---
title: Virtual warehouses
source: https://docs.snowflake.com/en/user-guide/warehouses.md
section: User Guide
---

# Virtual warehouses

A virtual warehouse, often referred to simply as a “warehouse”, is a cluster of compute resources in Snowflake. A virtual warehouse is
available in two types:

* Standard
* Snowpark-optimized

A warehouse provides the required resources, such as CPU, memory, and temporary storage, to
perform the following operations in a Snowflake session:

* Executing SQL [SELECT](../sql-reference/sql/select.md) statements that require compute resources (for example, retrieving rows from tables and views).
* Performing DML operations, such as:

  + Updating rows in tables ([DELETE](../sql-reference/sql/delete.md) , [INSERT](../sql-reference/sql/insert.md) , [UPDATE](../sql-reference/sql/update.md)).
  + Loading data into tables ([COPY INTO <table>](../sql-reference/sql/copy-into-table.md)).
  + Unloading data from tables ([COPY INTO <location>](../sql-reference/sql/copy-into-location.md)).

> **Note:**
>
> To perform these operations, a warehouse must be running and in use for the session. While a warehouse is running, it consumes Snowflake
> credits.

[Overview of warehouses](warehouses-overview.md)
:   Warehouses are required for queries, as well as all DML operations, including loading data into tables.
    In addition to being defined by its type as either Standard or Snowpark-optimized, a warehouse is defined by its size,
    as well as the other properties that can be set to help control and automate warehouse activity.

[Snowpark-optimized warehouses](warehouses-snowpark-optimized.md)
:   Snowpark workloads can be run on both Standard and Snowpark-optimized warehouses. Snowpark-optimized warehouses are recommended for workloads that have large memory requirements such as ML training use cases.

[Warehouse considerations](warehouses-considerations.md)
:   Best practices and general guidelines for using virtual warehouses in Snowflake to process queries.

[Multi-cluster warehouses](warehouses-multicluster.md)
:   Multi-cluster warehouses enable you to scale compute resources to manage your user and query concurrency needs as they change, such as during peak and off hours.

[Working with warehouses](warehouses-tasks.md)
:   Learn how to create, stop, start and otherwise manage Snowflake warehouses.

[Using the Query Acceleration Service (QAS)](query-acceleration-service.md)
:   The query acceleration service can accelerate parts of the query workload in a warehouse.
    When enabled for a warehouse, query acceleration can improve overall warehouse performance by reducing the impact of outlier queries
    (i.e. queries which use more resources then typical queries).

[Monitoring warehouse load](warehouses-load-monitoring.md)
:   Warehouse query load measures the average number of queries that were running or queued within a specific interval.

* [Overview of warehouses](warehouses-overview.md)
* [Snowpark-optimized warehouses](warehouses-snowpark-optimized.md)
* [Warehouse considerations](warehouses-considerations.md)
* [Multi-cluster warehouses](warehouses-multicluster.md)
* [Working with warehouses](warehouses-tasks.md)
* [Using the Query Acceleration Service (QAS)](query-acceleration-service.md)
* [Monitoring warehouse load](warehouses-load-monitoring.md)

---
title: Visualizing data with dashboards
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-dashboards.md
section: User Guide
---

# Visualizing data with dashboards

You can use dashboards to visualize and communicate query results using charts in [Snowsight](ui-snowsight-gs.md).
Dashboards are flexible collections of charts arranged as tiles. The charts are generated by query results and can be customized.

> **Important:**
>
> Legacy Dashboards will be removed from Snowflake on **June 22, 2026**. For migration options and the
> full deprecation timeline, see
> [Deprecation of Legacy Worksheets and Dashboards](../release-notes/bcr-bundles/un-bundled/bcr-2260.md).

You can also create dashboard tiles from charts in worksheets. For more details, see [Visualizing worksheet data](ui-snowsight-visualizations.md).

## Create a dashboard

You can create an empty dashboard or create a dashboard directly from a worksheet.

### Create an empty dashboard

To create an empty dashboard, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Dashboards.
3. Select + Dashboard.
4. Enter a name for the dashboard, and then select Create Dashboard.

### Create a dashboard from an existing worksheet

You can also use an existing worksheet to create a dashboard.

When you use a worksheet to create a dashboard, the worksheet is removed from the list of worksheets and can only be accessed from
the dashboard. The worksheet query is stored in the dashboard and can be modified in that context.

To create a dashboard using an existing worksheet, complete the following steps:

1. [Open a worksheet](ui-snowsight-worksheets-gs.md).
2. Hover over the name of the worksheet and select , and then select Move to.
3. Select New dashboard.
4. Enter a name for the dashboard, and then select Create Dashboard.

   > The dashboard opens, displaying a tile based on the worksheet you used.

> **Note:**
>
> If the worksheet is shared with other users, those users lose access to the worksheet when you create a dashboard because the worksheet
> is removed when the dashboard is created. Permissions on the worksheet are revoked and links to the worksheet
> no longer function. For more details about sharing dashboards, see Share dashboards.

## About using dashboards

After you create a dashboard, you can manage, add tiles, filters, and share the dashboard with other users.

Tiles visualize data on your dashboards as charts and tables. Hover over charts to view details about each data point.

### Open a dashboard

To open a dashboard, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Dashboards.
3. Locate the dashboard that you want to open:

   * The Recent tab displays the dashboards you opened most recently.
   * The Shared With Me tab displays dashboards that your colleagues shared with you.
   * The My dashboards tab displays dashboards that you created and own.

You can also search the names and contents of worksheets and dashboards.

## Manage a dashboard

While viewing a dashboard, you can take the following actions on the dashboard:

* Select the dashboard name to rename, duplicate, or delete the dashboard.
* Select + to add a tile to the dashboard. See Add a tile to a dashboard.
* Select  to show, hide, and manage custom filters that you can use in queries on the dashboard.
  For more details on filters, see [Filter query results in dashboards and worksheets](ui-snowsight-filters.md).
* Use the context selector  to specify the role and warehouse to use for running the queries in the dashboard.
* Select Share to share the dashboard with other Snowsight users. See Share dashboards for details.
* Select Run to run all the queries for the dashboard tiles.

### Add a tile to a dashboard

To add a tile to a dashboard, complete the following steps:

1. Open a dashboard.
2. Select + ().
3. Select New Tile from Worksheet.

   A blank worksheet opens, overlaying the dashboard.
4. Use the worksheet to build your query.

   To learn more about queries and worksheets, see [Querying data using worksheets](ui-snowsight-query.md).
5. When you finish writing your query, select Return to <dashboard name> to save your worksheet and add it to the dashboard.

### Add an existing worksheet to a dashboard

To add an existing worksheet as a tile, complete the following steps:

1. [Open a worksheet](ui-snowsight-worksheets-gs.md).
2. On the worksheet tab, select , and then select Move to.
3. Select an existing dashboard.

   The worksheet is added to the dashboard and removed from the list of worksheets. A tile showing a chart for the worksheet
   displays on the dashboard.

### Rearrange the order of tiles

By default, tiles are added to the bottom of the dashboard.

To rearrange the tiles on a dashboard, drag a tile to a new position. As you drag the tile, a preview of the new position appears.

### Edit charts

To edit a chart that appears in a tile, complete the following steps:

1. From the tile menu (), select View Chart.

   The chart opens in a worksheet.
2. Make changes to the chart. To learn more about charts, see [Visualizing worksheet data](ui-snowsight-visualizations.md).
3. When you are finished editing the chart, select Return to <dashboard name> to save your changes and return to the dashboard.

### Edit queries

To edit the query used for a tile, complete the following steps:

1. From the tile menu (), select Edit query.

   The query opens in a worksheet window.
2. Make changes to the query. For more about editing queries in worksheets, see [Querying data using worksheets](ui-snowsight-query.md).
3. When you finish editing the query, select Return to <dashboard name> to save your changes and return to the dashboard.

### Configure a tile display

By default, when you move a worksheet to a dashboard, the corresponding tile displays a chart.

To change the tile from a chart to a table of the query results, complete the following steps:

1. Remove the tile.
2. Add a tile and drag the table version of your query to your dashboard.

If a tile displays a table and you want to add a chart tile based on the same query, edit the query of the tile
by completing the following steps:

1. From the tile menu (), select Edit Query.
2. Select Chart.
3. Select Return to <dashboard name> to save your changes and return to the dashboard.

   A new tile is added at the bottom of the dashboard with a chart view of the table. For details on making changes to the chart,
   see Edit charts.

### Duplicate a tile

To duplicate a tile, complete the following steps:

* From the tile menu (), select Duplicate Tile.

  A copy of the tile appears at the bottom of the dashboard.

### Remove a tile

When you want to remove a tile from your dashboard, but still preserve the underlying query, complete the following step:

* To remove a tile, on the tile menu (), select Unplace Tile.

  The tile is removed from the dashboard, but remains available to add to the dashboard from the Add tile menu.

### Delete a tile

> **Warning:**
>
> Deleting a tile from a dashboard also deletes the underlying queries. This action cannot be undone. If you want to remove the tile but
> preserve the query, see Remove a tile.
>
> If you delete a tile with a query that is used by another tile, such as a table and chart view of the same query results, both tiles
> are deleted.

To delete a tile, complete the following steps:

1. From the tile menu (), select Delete.
2. Select Delete to permanently delete the tile and its underlying query from the dashboard.

## Share dashboards

Editors and owners can share a dashboard with individual collaborators or by enabling and using link sharing.

The queries that drive dashboards in Snowsight use unique sessions with assigned roles and warehouses. To view shared dashboards,
the Snowflake user must use the same role as the session context for the queries that drive the dashboard.

To share a dashboard, complete the following steps:

1. Open a dashboard.
2. Select Share.
3. Enter the names or usernames of the Snowflake users you want to invite to use your dashboard. The list only shows users that have
   previously signed in to Snowsight. To share with someone who has not yet signed in to Snowsight, share a
   link instead (ensure that you have enabled link sharing).
4. Optionally, set how people with the link can interact with the dashboard. By default, people with the link cannot view the dashboard.
   For example, you can choose to allow people to view the results on the dashboard, but not run the underlying queries.
5. Optionally, select Get Link to get a link to your dashboard that you can share with others.
6. Select Done.

> **Note:**
>
> Dashboards run exclusively with the user’s primary role, regardless of the value set for DEFAULT_SECONDARY_ROLES. This behavior simplifies
> governance and enhances security by disabling secondary roles in dashboards.

For more details about sharing permissions for dashboards and worksheets, see [Permissions for shared worksheets](ui-snowsight-worksheets.md).
You cannot organize dashboards into folders.

---
title: Visualizing worksheet data
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-visualizations.md
section: User Guide
---

# Visualizing worksheet data

This topic describes how to visualize your SQL worksheet results using charts in [Snowsight](ui-snowsight-gs.md).
Charts transform your query results into visualizations that communicate logical relationships and lead to more informed decision making. Charts let you quickly identify and understand patterns and outliers in data.

Snowsight supports the following types of charts:

* Bar charts
* Line charts
* Scatterplots
* Heat grids
* Scorecards

You can also visualize your data [using dashboards](ui-snowsight-dashboards.md).

> **Note:**
>
> Chart generation and data transformations in worksheets can result in compute usage. For guidance on managing these costs, see [Optimizing cost](cost-optimize.md).

## Create a chart

When you run a query in a worksheet, you can display a chart based on the results.

1. [Open a worksheet](ui-snowsight-worksheets-gs.md).
2. Run the worksheet.
3. Above the results table for the query, select Chart.

## Modify a chart

When you select a chart to visualize your worksheet results, Snowsight automatically generates a chart for you based on the
query results. Each query supports one type of chart at a time.

Hover over the chart to view details about each data point. For example, you can view your results as a line chart:

You can modify the type of chart used to display your query results.

* Select the chart type to choose a different type, for example, Bar.

You can manage the columns in your chart with the Data section:

Select a column to modify the column attributes:

* Add or remove columns.
* Choose a different column in the query results to use in the chart.
* Modify how column data is represented in the chart. For example, change the bucketing for a time column from day to minutes.

  You can modify the column attributes to configure how data in that column is rendered in the chart. See Aggregate and bucket data
  for more details about managing aggregate data.

Style your chart in the Appearance section. The available settings depend on the type of chart. For example, for a heatgrid
chart:

The exact content of your charts depends on your query results. To generate the examples in this topic, use the following query based
on the Snowflake sample data:

```sqlexample
SELECT
  COUNT(O_ORDERDATE) as orders, O_ORDERDATE as date
FROM
  SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.ORDERS
WHERE
  O_ORDERDATE = :daterange
GROUP BY
  :datebucket(O_ORDERDATE), O_ORDERDATE
ORDER BY
  O_ORDERDATE
LIMIT 10;
```

## Charts and new query results

Your chart updates automatically as long as the columns used by the chart are available in the query results. If a column name changes, you
must update the chart to use the new column name. Charts indicate any columns that cannot be found.

## Aggregate and bucket data

Charts simplify grouping numbers, dates, and timestamps of more or less continuous values into various *buckets*. For example, suppose your
query retrieves per-day data over a period of time. Without modifying your query, you can easily select a different bucket of time (e.g.
weekly or monthly data) in the inspector panel to change the time dimension of the results displayed in the chart.

Charts can bucket by date, week, month, and year for date columns. For numeric columns, charts can bucket by integer values.

Charts use aggregation functions to determine a single value from multiple data points in a bucket. These functions are as follows:

* average
* count
* minimum
* maximum
* median
* mode
* sum

---
title: Warehouse considerations
source: https://docs.snowflake.com/en/user-guide/warehouses-considerations.md
section: User Guide
---

# Warehouse considerations

This topic provides general guidelines and best practices for using virtual warehouses in Snowflake to process queries. It does
not provide specific or absolute numbers, values, or recommendations because every query scenario is different and is affected by
numerous factors, including number of concurrent users/queries, number of tables being queried, and data size and composition, as well as
your specific requirements for warehouse availability, latency, and cost.

It also does not cover warehouse considerations for data loading, which are covered in another topic (see the sidebar).

The keys to using warehouses effectively and efficiently are:

1. Experiment with different types of queries and different warehouse sizes to determine the combinations that best meet your
   specific query needs and workload.
2. Don’t focus on warehouse size. Snowflake utilizes per-second billing, so you can run larger warehouses (Large, X-Large,
   2X-Large, etc.) and simply suspend them when not in use.

> **Note:**
>
> These guidelines and best practices apply to both single-cluster warehouses, which are standard for all accounts, and
> [multi-cluster warehouses](warehouses-multicluster.md), which are available in [Snowflake Enterprise Edition](intro-editions.md) (and higher).

## How are credits charged for warehouses?

Credit charges are calculated based on:

* The warehouse size.
* The number of clusters (if using [multi-cluster warehouses](warehouses-multicluster.md)).
* The length of time the compute resources in each cluster runs.

For example:

X-Small:
:   Bills 1 credit per full, continuous hour that each cluster runs; each successive size generally doubles the number of compute
    resources per warehouse.

4X-Large:
:   Bills 128 credits per full, continuous hour that each cluster runs.

Note the following:

* When compute resources are provisioned for a warehouse:

  + The minimum billing charge for provisioning compute resources is 1 minute (i.e. 60 seconds).
  + There is no benefit to stopping a warehouse before the first 60-second period is over because the credits have already
    been billed for that period.
  + After the first 60 seconds, all subsequent billing for a running warehouse is per-second (until all its compute resources are shut down).
    Three examples are provided below:

    - If a warehouse runs for 30 to 60 seconds, it is billed for 60 seconds.
    - If a warehouse runs for 61 seconds, it is billed for only 61 seconds.
    - If a warehouse runs for 61 seconds, shuts down, and then restarts and runs for less than 60 seconds, it is billed for 121 seconds (60 + 1 + 60).
* Resizing a warehouse provisions additional compute resources for each cluster in the warehouse:

  + This results in a corresponding increase in the number of credits billed for the warehouse (while the additional compute resources are
    running).
  + The additional compute resources are billed when they are provisioned (i.e. credits for the additional resources are billed relative
    to the time when the warehouse was resized).
  + Resizing between a 5XL or 6XL warehouse to a 4XL or smaller warehouse results in a brief period during which the customer is
    charged for both the new warehouse and the old warehouse while the old warehouse is quiesced.
  + Credit usage is displayed in hour increments. With per-second billing, you will see fractional amounts for credit usage/billing.

> **Tip:**
>
> For information about cost implications of changing the RESOURCE_CONSTRAINT property, see
> [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](warehouses-gen2.md).

## How does query composition impact warehouse processing?

The compute resources required to process a query depend on the size and complexity of the query. For the most part, queries scale
linearly with respect to warehouse size, particularly for larger, more complex queries. When considering factors that impact query
processing, consider the following:

* The overall size of the tables being queried has more impact than the number of rows.
* Query filtering using predicates has an impact on processing, as does the number of joins/tables in the query.

> **Tip:**
>
> To achieve the best results, try to execute relatively homogeneous queries (complexity, data sets, etc.) on the same warehouse;
> executing queries of widely varying complexity on the same warehouse makes it more difficult to analyze warehouse load,
> which can make it more difficult to select the best warehouse size to match the complexity, composition, and number of queries in your
> workload.

## How does warehouse caching impact queries?

Each warehouse, when running, maintains a cache of table data accessed as queries are processed by the warehouse. This enables improved
performance for subsequent queries if they are able to read from the cache instead of from the table(s) in the query. The size of the cache
is determined by the compute resources in the warehouse (that is, the larger the warehouse and, therefore, more compute resources in the
warehouse, the larger the cache).

This cache is dropped when the warehouse is suspended, which might result in slower initial performance for some queries after the warehouse
is resumed. As the resumed warehouse runs and processes more queries, the cache is rebuilt, and queries that are able to take advantage of
the cache will experience improved performance.

Keep this in mind when deciding whether to suspend a warehouse or leave it running. In other words, consider the trade-off between saving
credits by suspending a warehouse versus maintaining the cache of data from previous queries to help with performance.

## Creating a warehouse

When creating a warehouse, the two most critical factors to consider, from a cost and performance perspective, are:

* Warehouse size (that is, available compute resources)
* Manual vs automated management (for starting/resuming and suspending warehouses).

The number of clusters in a warehouse is also important if you are using [Snowflake Enterprise Edition](intro-editions.md) (or higher) and
[multi-cluster warehouses](warehouses-multicluster.md). For more details, see Scaling Up vs Scaling Out (in this topic).

### Selecting an initial warehouse size

The initial size you select for a warehouse depends on the task the warehouse is performing and the workload it processes. For example:

* For data loading, the warehouse size should match the number of files being loaded and the amount of data in each file. For more information,
  see [Planning a data load](data-load-considerations-plan.md).
* For queries in small-scale testing environments, smaller warehouses sizes (X-Small, Small, Medium) may be sufficient.
* For queries in large-scale production environments, larger warehouse sizes (Large, X-Large, 2X-Large, etc.) may be more cost effective.

However, note that per-second credit billing and auto-suspend give you the flexibility to start with larger sizes and then adjust the size
to match your workloads. You can decrease the size of a warehouse at any time.

Also, larger is not necessarily faster for smaller, more basic queries. Small/simple queries typically do not need an X-Large (or larger)
warehouse because they do not necessarily benefit from the additional resources, regardless of the number of queries being processed
concurrently. In general, you should try to match the size of the warehouse to the expected size and complexity of the queries to be
processed by the warehouse.

> **Tip:**
>
> Experiment by running the same queries against warehouses of multiple sizes (for example, X-Large, Large, Medium). The queries you experiment
> with should be of a size and complexity that you know will typically complete within 5 to 10 minutes (or less).

### Selecting a warehouse for Snowsight

Certain Snowsight pages, such as Task Run History or Data Preview, require a warehouse to run SQL queries in order to
display more than just metadata. On these pages, a warehouse selector indicates the warehouse where these UI queries are running. A green
dot indicates when the warehouse is active.

An X-Small warehouse is recommended and generally sufficient for most of these queries, however large accounts may see performance
improvements by using a larger warehouse.

In some cases, your account is not billed for client-generated statements. For example, [SHOW TABLES](../sql-reference/sql/show-tables.md) does not
require a warehouse to retrieve data, so no charges apply. For more information about warehouses in general, see [Overview of warehouses](warehouses-overview.md).

> **Note:**
>
> Snowsight performance can be affected if the warehouse is temporarily overloaded and UI queries are queued behind other active
> workloads. If you notice inconsistent Snowsight performance, Snowflake recommends that you review the selected warehouse for
> overload and consider using one with lower utilization. Large accounts with many active users might benefit from a dedicated X-Small
> warehouse for UI-related tasks.

You can view which Snowsight queries have been running on the currently selected warehouse and when they ran. To monitor these
queries, follow these steps:

1. In the navigation menu, select Monitoring » Query History.
2. Select the Filters drop-down list.
3. Select the Client-generated statements checkbox to view internal queries run by a client, driver, or library, including the web interface.
4. Select Apply Filters.

For information about cost governance, see [Exploring compute cost](cost-exploring-compute.md).

### Using the default warehouse for Notebook apps

Each account is provisioned with the SYSTEM$STREAMLIT_NOTEBOOK_WH warehouse that is specifically designed to run Notebook Python code. This multi-cluster X-Small warehouse helps reduce cluster fragmentation, optimize costs, and improve bin-packing efficiency. For more
details, see [Default warehouse for notebooks](warehouses-overview.md).

### Automating warehouse suspension

Warehouses can be set to automatically suspend when there’s no activity after a specified period of time. Auto-suspend is enabled by
specifying the time period (minutes, hours, etc.) of inactivity for the warehouse.

We recommend setting auto-suspend according to your workload and your requirements for warehouse availability:

* If you enable auto-suspend, we recommend setting it to a low value (for example, 5 or 10 minutes or less) because Snowflake utilizes per-second
  billing. This will help keep your warehouses from running (and consuming credits) when not in use.

  However, the value you set should match the gaps, if any, in your query workload. For example, if you have regular gaps of 2 or 3 minutes
  between incoming queries, it doesn’t make sense to set auto-suspend to 1 or 2 minutes because your warehouse will be in a continual state
  of suspending and resuming (if auto-resume is also enabled) and each time it resumes, you are billed for the minimum credit usage (that is, 60 seconds).
* You might want to consider disabling auto-suspend for a warehouse if:

  + You have a heavy, steady workload for the warehouse.
  + You require the warehouse to be available with no delay or lag time. Warehouse provisioning is generally very fast (e.g. 1 or 2
    seconds); however, depending on the size of the warehouse and the availability of compute resources to provision, it can take longer.

> **Important:**
>
> If you choose to disable auto-suspend, carefully consider the costs associated with running a warehouse continually, even when the
> warehouse is not processing queries. The costs can be significant, especially for larger warehouses (X-Large, 2X-Large, etc.).
>
> To disable auto-suspend, you must explicitly select Never in the web interface, or specify `0` or `NULL` in SQL.

### Automating warehouse resumption

Warehouses can be set to automatically resume when new queries are submitted.

We recommend enabling/disabling auto-resume depending on how much control you wish to exert over usage of a particular warehouse:

* If cost and access are not an issue, enable auto-resume to ensure that the warehouse starts whenever needed. Keep in mind that there
  might be a short delay in the resumption of the warehouse due to provisioning.
* If you wish to control costs and/or user access, leave auto-resume disabled and instead manually resume the warehouse only when needed.

## Scaling up vs scaling out

Snowflake supports two ways to scale warehouses:

* Scale up by resizing a warehouse.
* Scale out by adding clusters to a multi-cluster warehouse (requires [Snowflake Enterprise Edition](intro-editions.md) or
  higher).

### Warehouse resizing improves performance

Resizing a warehouse generally improves query performance, particularly for larger, more complex queries. It can also help reduce the
queuing that occurs if a warehouse does not have enough compute resources to process all the queries that are submitted concurrently. Note
that warehouse resizing is not intended for handling concurrency issues; instead, use additional warehouses to handle the workload or use a
multi-cluster warehouse (if this feature is available for your account).

Snowflake supports resizing a warehouse at any time, even while running. If a query is running slowly and you have additional queries of
similar size and complexity that you want to run on the same warehouse, you might choose to resize the warehouse while it is running; however,
note the following:

* As stated earlier about warehouse size, larger is not necessarily faster; for smaller, basic queries that are already executing quickly,
  you may not see any significant improvement after resizing.
* Resizing a running warehouse does not impact queries that are already being processed by the warehouse; the additional compute resources,
  once fully provisioned, are only used for queued and new queries.
* Resizing between a 5XL or 6XL warehouse to a 4XL or smaller warehouse results in a brief period during which the customer is charged
  for both the new warehouse and the old warehouse while the old warehouse is quiesced.

> **Tip:**
>
> Decreasing the size of a running warehouse removes compute resources from the warehouse. When the computer resources are removed, the
> cache associated with those resources is dropped, which can impact performance in the same way that suspending the warehouse can impact
> performance after it is resumed.
>
> Keep this in mind when choosing whether to decrease the size of a running warehouse or keep it at the current size. In other words, there
> is a trade-off with regards to saving credits versus maintaining the cache.

### Multi-cluster warehouses improve concurrency

[Multi-cluster warehouses](warehouses-multicluster.md) are designed specifically for handling queuing and performance issues
related to large numbers of concurrent users and/or
queries. In addition, multi-cluster warehouses can help automate this process if your number of users/queries tend to fluctuate.

When deciding whether to use multi-cluster warehouses and the number of clusters to use per multi-cluster warehouse, consider the
following:

* If you are using Snowflake Enterprise Edition (or a higher edition), all your warehouses should be configured as multi-cluster
  warehouses.
* Unless you have a specific requirement for running in Maximized mode, multi-cluster warehouses should be configured to run in Auto-scale
  mode, which enables Snowflake to automatically start and stop clusters as needed.
* When choosing the minimum and maximum number of clusters for a multi-cluster warehouse:

  Minimum:
  :   Keep the default value of `1`; this ensures that additional clusters are only started as needed. However, if
      high-availability of the warehouse is a concern, set the value higher than `1`. This helps ensure multi-cluster warehouse availability
      and continuity in the unlikely event that a cluster fails.

  Maximum:
  :   Set this value as large as possible, while being mindful of the warehouse size and corresponding credit costs. For example, an
      X-Large multi-cluster warehouse with maximum clusters = `10` will consume 160 credits in an hour if all 10 clusters run
      continuously for the hour.

---
title: Windows troubleshooting steps
source: https://docs.snowflake.com/en/user-guide/client-connectivity-troubleshooting/windows.md
section: User Guide
---

# Windows troubleshooting steps

Follow these steps to identify and confirm that you have a proxy and to gather the proxy host and port numbers that you need for further troubleshooting.

1. Check the proxy settings.

   1. Open the Settings menu.
   2. Search for “proxy”, and select Change proxy settings.
2. Check the manual proxy configuration.

   1. In Manual proxy setup, select the `Use a proxy server` option.

      * If it is `On`, a proxy is currently in use.
      * If it is `Off`, no proxy is being used.
3. Check the automatic proxy configuration.

   1. Under Automatic proxy setup, look for `Use setup script`. If it is `On`, a proxy might be configured via a script.
   2. To verify, enter the script URL in your browser. If a file is downloaded, it contains the proxy information.
4. Check the proxy using the Windows `PowerShell`, as follows:

   ```bash
   $proxyAddr = (Get-ItemProperty 'HKCU:\Software\Microsoft\Windows\CurrentVersion\Internet Settings').ProxyServer
   $proxyEnable = (Get-ItemProperty 'HKCU:\Software\Microsoft\Windows\CurrentVersion\Internet Settings').ProxyEnable

   # Output the values
   $proxyAddr
   $proxyEnable
   ```

   For example:

   * If the `proxyAddr` is `my.pro.xy:123` and `proxyEnable` is `0`, the proxy address is `my.pro.xy:123`.
   * If `proxyEnable` is `0`, the proxy is disabled; if it is `1`, the proxy is enabled.
5. Proceed based on the proxy test results:

   * **Proxy found**: Based on these environment variables settings, you can gather the proxy host and port that you will need for further testing.
   * **No proxy found**: If the test for the proxy is negative, continue with further testing.

## If you have a proxy

After identifying your proxy settings, or if you already know your proxy information, proceed to test the URL that is encountering issues. You should test all URLs in Snowflake’s allowlist thoroughly. At the very least, make sure to test the URL that is causing failures in your connector specifically.

In the Windows `Powershell`, run the following commands, making sure to update the URL in the commands to match the Snowflake URL that you are testing. Also, make sure to update your `PROXY_URL`.

```bash
[Net.ServicePointManager]::ServerCertificateValidationCallback = { $true }

$proxy = New-Object System.Net.WebProxy("http://<PROXY:PORT>")
$url = "https://<URL>/"

$req = [Net.HttpWebRequest]::Create($url)
$req.Proxy = $proxy
$req.GetResponse() | Out-Null
$output = [PSCustomObject]@{
  Proxy = $proxy
  URL = $url
  'Issuer' = $req.ServicePoint.Certificate.Issuer
  'Subject' = $req.ServicePoint.Certificate.Subject
}

$output|ConvertTo-Json

Sample expected output:

{
    "Proxy": {
                  "Address": "<IP ADDRESS>"",
                  "BypassProxyOnLocal": false,
                  "BypassList": [
                                ],
                  "Credentials": null,
                  "UseDefaultCredentials": false,
                  "BypassArrayList": [
                                      ]
              },
    "URL": "https://<account>.snowflakecomputing.com"",
    "Issuer": "CN=Amazon, OU=Server CA 1B, O=Amazon, C=US",
    "Subject": "CN=*.us-east-1.snowflakecomputing.com",
    "Cert Start Date": "5/23/2022 8:00:00 PM",
    "Cert End Date": "6/22/2023 7:59:59 PM"
}
```

Observe any references to the proxy in the test results to confirm that the proxy was used during this test. If the connection is successful, examine the issuer information provided in the output.

After completing these steps, continue with [follow-up actions](followup-actions.md).

## If you don’t have a proxy

You should test all URLs in the Snowflake allowlist thoroughly. At the very least, make sure to specifically test the URL that is causing failures in your connector.

1. Open `Powershell`.
2. Run the following commands in `Powershell`, updating the URL in the commands to match the URL that you are testing.

   ```bash
   [Net.ServicePointManager]::ServerCertificateValidationCallback = { $true }
   $url = "https://<URL>/""
   $req = [Net.HttpWebRequest]::Create($url)
   $req.GetResponse() | Out-Null
   $output = [PSCustomObject]@{
     URL = $url
     'Issuer' = $req.ServicePoint.Certificate.Issuer
     'Subject' = $req.ServicePoint.Certificate.Subject
   }
   $output|ConvertTo-Json
   ```

   Sample successful output:

   ```output
   {
       "URL": "https://<account>.snowflakecomputing.com"",
       "Issuer": "CN=Amazon, OU=Server CA 1B, O=Amazon, C=US",
       "Subject": "CN=*.us-east-1.snowflakecomputing.com"
   }
   ```

   Sample connection failure output:

   ```output
   Exception calling "GetResponse" with "0" argument(s): "Unable to connect to the remote server"
   At line:4 char:1

   + $req.GetResponse() | Out-Null
   + ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
       + CategoryInfo     : NotSpecified: (:) [], MethodInvocationException
       + FullyQualifiedErrorId : WebException
   ```

If the connection is successful, examine the issuer information provided in the output.

After completing these steps, continue with [follow-up actions](followup-actions.md).

---
title: Work with Amazon S3-compatible storage
source: https://docs.snowflake.com/en/user-guide/data-load-s3-compatible-storage.md
section: User Guide
---

# Work with Amazon S3-compatible storage

This topic provides information about accessing Amazon S3-compatible storage from Snowflake.

A storage application or device is Amazon S3-compatible if it provides an application programming interface (API)
that is compliant with the industry-standard [Amazon Simple Storage Service (S3) REST API](https://docs.aws.amazon.com/AmazonS3/latest/API/Welcome.html).
The Amazon S3 REST API enables CRUD operations and administrative actions on storage buckets and objects.

With Snowflake, you can use an external stage to connect to a growing number of S3-compatible storage solutions,
including on-premises storage and devices that exist outside of the public cloud.
The external stage stores an S3-compliant API endpoint, bucket name and path, and credentials.
To let users load and unload data from and to your storage locations, you grant privileges on the stage to roles.

You can use Snowflake support for Amazon S3-compatible storage to perform tasks such as:

* Querying data from an external stage without loading the data into Snowflake. For more information, see Extend your data lake using external tables.
* Reading and processing unstructured data. To learn more, see [Download from an internal stage](unstructured-intro.md).
* Copying files in S3-compatible storage from one location to another using the [COPY FILES](../sql-reference/sql/copy-files.md) command.

> **Note:**
>
> You can also create an external volume for S3-compatible storage to query an externally managed Apache Iceberg™ table. For more information, see
> [Configure an external volume for S3-compatible storage](tables-iceberg-s3-compatible.md).

## Cloud platform support

This feature is available to Snowflake accounts hosted on the following supported [cloud platforms](intro-cloud-platforms.md):

* Amazon Web Services
* Google Cloud
* Microsoft Azure

## Requirements for S3-compatible storage

An S3-compatible API endpoint for Snowflake must meet the following requirements:

* Highly compliant with the S3 API and able to pass our [public test suite](https://github.com/snowflakedb/snowflake-s3compat-api-test-suite) (in GitHub).
  If the endpoint does not behave like S3, it cannot work with Snowflake.
* Supported by your third-party storage provider as a Snowflake S3-compatible tested and compliant service. For a list of vendors that have tested
  at least some of their products and found them to work with Snowflake, see Vendor support for S3-compatible storage.
* Accessible from the public cloud where your Snowflake account is hosted.
* Highly available and performant to serve analytics needs.
* Configured to use virtual-hosted-style requests. For more information,
  see [Virtual hosting of buckets](https://docs.aws.amazon.com/AmazonS3/latest/userguide/VirtualHosting.html) in the Amazon S3 documentation.
* Configured to use HTTPS communication with a valid TLS certificate.
* Configured to use direct credentials.
* Does not contain a port number. For example, the endpoint mystorage.com:3000 is not supported.

> **Important:**
>
> Amazon S3-compatible endpoints are not automatically enabled for all accounts.
>
> Cloudflare endpoints that contain `r2.cloudfarestorage.com` are enabled by default. To enable other endpoints, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
> Before you send a request, verify the endpoints by using our [public test suite](https://github.com/snowflakedb/snowflake-s3compat-api-test-suite) (in GitHub).
>
> Provide the following information with your request:
>
> * Your Snowflake account name and cloud region deployment.
> * The endpoint URL (for example, `my-s3-endpoint.example.com`).
> * The software or hardware vendor that provides the endpoint.

## Create an external stage for S3-compatible storage

To create an external stage for S3-compatible storage, create a named [external stage](data-load-overview.md)
by using the [CREATE STAGE](../sql-reference/sql/create-stage.md) command.
You can use an external stage to perform actions such as listing files, loading data, and unloading files.

Optionally, add a [directory table](data-load-dirtables.md) to the external stage.
You can query a directory table to retrieve file URLs to access files in the referenced storage, as well as other metadata.

> **Note:**
>
> When you add a directory table, you must set the AUTO_REFRESH parameter to FALSE. The metadata for S3-compatible external stages cannot be refreshed automatically.

The following example creates an external stage named `my_s3_compat_stage` that points to the bucket and path named `my_bucket/files/` at the endpoint `mystorage.com`.
The AWS_KEY_ID and AWS_SECRET_KEY values used in this example are for illustration purposes only.

```sqlexample
CREATE STAGE my_s3compat_stage
  URL = 's3compat://my_bucket/files/'
  ENDPOINT = 'mystorage.com'
  CREDENTIALS = (AWS_KEY_ID = '1a2b3c...' AWS_SECRET_KEY = '4x5y6z...')
```

## Load and unload data

You can load and unload data using an external stage configured for S3-compatible storage. The following features work with S3-compatible storage:

* Bulk data loading using the [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) command.

  For example, load data into table `t1` from all files located in the `load` subpath in the bucket and path defined in a stage named `my_s3compat_stage`:

  ```sqlexample
  COPY INTO t1
    FROM @my_s3compat_stage/load/;
  ```
* [Calling Snowpipe REST endpoints to continuously load data](data-load-snowpipe-rest-overview.md).

  For sample programs, see [Option 1: Load data with the Snowpipe REST API](data-load-snowpipe-rest-load.md).
* Data unloading using the [COPY INTO <location>](../sql-reference/sql/copy-into-location.md) command.

  For example, unload data from table `t2` into files in the `unload` subpath in the bucket and path defined in a stage named `my_s3compat_stage`:

  ```sqlexample
  COPY INTO @my_s3compat_stage/unload/
    FROM t2;
  ```

## Extend your data lake using external tables

You can use external tables with S3-compatible storage to query data without first loading it into Snowflake.
This section briefly covers how to create and query an external table that references a location on an external stage configured for S3-compatible storage.

Start by creating an external table using [CREATE EXTERNAL TABLE](../sql-reference/sql/create-external-table.md) that reference an S3-compatible external stage.

> **Note:**
>
> The metadata for these external tables cannot be refreshed automatically. The `AUTO_REFRESH = TRUE` parameter setting is not supported.
> You must manually refresh the metadata by executing an [ALTER EXTERNAL TABLE … REFRESH](../sql-reference/sql/alter-external-table.md) command
> to register any added or removed files.

The following example creates an external table named `et` that references subpath `path1` in a stage named `my_s3compat_stage`.
The files in the `path1` subpath are in the Apache Parquet format.

```sqlexample
CREATE EXTERNAL TABLE et
 LOCATION=@my_s3compat_stage/path1/
 AUTO_REFRESH = FALSE
 REFRESH_ON_CREATE = TRUE
 FILE_FORMAT = (TYPE = PARQUET);
```

After you create an external table for S3-compatible storage, you can query it. For example, query the `value` column in the external table created previously:

```sqlexample
SELECT value FROM et;
```

Query performance varies depending on network and application or device performance.
If performance is critical, create a [materialized view](views-materialized.md) on the external table.
Materialized views are pre-computed, so querying a materialized view is faster than executing a query against the base table of the view.

## Vendor support for S3-compatible storage

You can use devices or applications that have an S3-compliant API with Snowflake. However, your storage service provider is responsible for ensuring compliance.

The following vendors have indicated to Snowflake that they have tested at least some of their products and found them to work with Snowflake:

* Akave
* Backblaze
* Cloudflare
* Cloudian
* Cohesity
* Dell
* Hitachi Content Platform
* IBM Storage Ceph
* IDrive e2
* MinIO
* NetApp (StorageGRID)
* Nutanix
* PureStorage
* Scality
* Wasabi

This list is provided for convenience only. Snowflake does not test external products to validate compatibility and cannot fix issues
in products sold by third-party vendors. If you have questions about whether or how your hardware or software with an S3 API works with Snowflake,
contact the vendor directly.

## Test your S3-compatible API

If you are a hardware or software developer who has created an S3-compatible API,
you can use our [public test suite](https://github.com/snowflakedb/snowflake-s3compat-api-test-suite) (in GitHub)
to test whether your S3 API works with Snowflake. The test suite looks for obvious mismatches between your implementation and what Snowflake expects from S3.
However, there might be cases where the tests don’t identify incompatibility.

If you’re a customer and you want to test your own devices, contact your vendor to run these tests.
You can also run these public tests on your devices to assess compatibility.

---
title: Work with object tags
source: https://docs.snowflake.com/en/user-guide/object-tagging/work.md
section: User Guide
---

# Work with object tags

This topic describes how to create a tag and assign it to a Snowflake object. It also contains instructions on how to delete a tag.

## Create a tag

Use the [CREATE TAG](../../sql-reference/sql/create-tag.md) command to create a new tag. For example, to create a basic tag named `cost_center` without
any optional parameters, execute the following:

```sqlexample
CREATE TAG cost_center;
```

### Set a list of allowed tag values

The `ALLOWED_VALUES` tag parameter lets you specify a list of the string values that can be assigned to the tag when the tag is set
on an [object](introduction.md). Users cannot assign a value to a tag unless the value is in the defined list.

The maximum number of possible string values for a single tag is 5,000. The string value for each tag can be up to 256 characters.

You can specify the list of allowed values when creating or replacing a tag with a [CREATE TAG](../../sql-reference/sql/create-tag.md) statement, or while
modifying an existing tag with an [ALTER TAG](../../sql-reference/sql/alter-tag.md) statement. Note that the ALTER TAG statement supports adding allowed
values for a tag and dropping existing values for a tag.

> **Note:**
>
> If a tag is configured to automatically propagate to target objects, the order of values in the allowed list can affect how conflicts are
> resolved. For more information, see [Tag propagation conflicts](propagation.md).

To determine the list of allowed values for a tag, call the [SYSTEM$GET_TAG_ALLOWED_VALUES](../../sql-reference/functions/system_get_tag_allowed_values.md) function.

#### Examples

Create a tag named `cost_center` with `finance` and `engineering` as the only two allowed string values:

> ```sqlexample
> CREATE TAG cost_center
>   ALLOWED_VALUES 'finance', 'engineering';
> ```

Verify the allowed values:

> ```sqlexample
> SELECT SYSTEM$GET_TAG_ALLOWED_VALUES('governance.tags.cost_center');
> ```

Modify the tag named `cost_center` to add `marketing` as an allowed string value:

> ```sqlexample
> ALTER TAG cost_center
>   ADD ALLOWED_VALUES 'marketing';
> ```

Modify the tag named `cost_center` to drop `engineering` as an allowed string value:

> ```sqlexample
> ALTER TAG cost_center
>   DROP ALLOWED_VALUES 'engineering';
> ```

### Define a tag that will automatically propagate

The PROPAGATE tag parameter lets you configure a tag so it is automatically propagated from a source object to target objects under
certain circumstances. This PROPAGATE parameter can be set to the following values:

* `PROPAGATE = ON_DEPENDENCY`: The tag is propagated to a target object when there is an
  [object dependency](propagation.md).
* `PROPAGATE = ON_DATA_MOVEMENT`: The tag is propagated to a target object when [data moves](propagation.md) from the
  source object to the target object.
* `PROPAGATE = ON_DEPENDENCY_AND_DATA_MOVEMENT`: The tag is propagated for both object dependencies and data movement.

For more information about propagation, see [Automatic tag propagation with user-defined tags](propagation.md).

#### Examples

Create a new tag that propagates automatically when there is an object dependency.

```sqlexample
CREATE TAG data_sensitivity PROPAGATE = ON_DEPENDENCY;
```

Update an existing tag to enable automatic propagation for both object dependency and data movement.

```sqlexample
ALTER TAG data_sensitivity SET PROPAGATE = ON_DEPENDENCY_AND_DATA_MOVEMENT;
```

Update an existing tag to disable propagation.

```sqlexample
ALTER TAG data_sensitivity UNSET PROPAGATE;
```

## Set a tag

You can set a tag on an object using the user interface or SQL.

When you set a tag on an object, you must set the value of the tag. This string value can be up to 256 characters.

The user who created a tag might have specified a list of allowed values, in which case you can only set a tag value that is in the list.
To obtain the list of allowed string values for a given tag, call the [SYSTEM$GET_TAG_ALLOWED_VALUES](../../sql-reference/functions/system_get_tag_allowed_values.md)
function. For example, assuming that the tag `cost_center` is stored in a database named `governance` and a schema named `tags`, you
can execute the following to determine that you can set the tag value to `finance` or `marketing`:

```sqlexample
SELECT SYSTEM$GET_TAG_ALLOWED_VALUES('governance.tags.cost_center');
```

```output
+--------------------------------------------------------------+
| SYSTEM$GET_TAG_ALLOWED_VALUES('GOVERNANCE.TAGS.COST_CENTER') |
|--------------------------------------------------------------|
| ["finance","marketing"]                                      |
+--------------------------------------------------------------+
```

### Use Snowsight to set tags

You can set a tag on existing tables, views, and columns using Snowsight.

There are several options to set a tag:

* In the navigation menu, select Catalog » Database Explorer, and then navigate to the desired table, view, or column using the object explorer.

  Select the More menu (that is, `...`) » Edit, and then select + Tag. Follow the prompts to manage the tag
  assignment.
* In the navigation menu, select Governance & security » Tags & policies, and do the following:

  + Select a tile, distribution percentage, and one of the most used tags or tables. When you select an item in the Dashboard,
    Snowsight redirects you to the Tagged Objects tab.
  + Modify the filters as needed. When you select an object or column, Snowsight redirects you to its location in the
    object explorer. Update the tag assignment as needed.
* Navigate to the Tagged Objects tab directly. Modify the filters, select an object or column, and manage the tag assignment.

> **Note:**
>
> To access the Tags & policies area, your Snowflake account must be [Enterprise Edition or higher](../intro-editions.md).
> In addition, you must have one of the following roles:
>
> * Use the ACCOUNTADMIN role.
> * Use a role that is granted the GOVERNANCE_VIEWER and OBJECT_VIEWER database roles.
>
>   For information about these database roles, see [SNOWFLAKE database roles](../../sql-reference/snowflake-db-roles.md).

### Use SQL to set tags

You can use SQL commands to set a tag when creating a new object or to set a tag on an existing object.

To set a tag on a new object you’re creating, use a CREATE … WITH TAG command. For example, to assign a tag `cost_center` to a
warehouse that you’re creating, execute the following:

```sqlexample
CREATE WAREHOUSE mywarehouse WITH TAG (cost_center = 'sales');
```

To set a tag on an existing object, use an ALTER … SET TAG command. For example, to assign a tag `cost_center` to an existing warehouse,
execute the following:

```sqlexample
ALTER WAREHOUSE wh1 SET TAG cost_center = 'sales';
```

#### Extended example: Create and assign tags with SQL

The following is an extended example that provides a high-level overview on how to use SQL to implement object tagging. It shows you how to
do the following:

* Manage the access control privileges needed to work with tags.

  For simplicity, the workflow assumes a centralized management approach to tags, where the `tag_admin` custom role has both the CREATE
  TAG and the global APPLY TAG privileges. For alternative approaches, see Approaches assigning tagging privileges.
* Create a tag using a [CREATE TAG](../../sql-reference/sql/create-tag.md) statement.
* Assign a tag to a new Snowflake object using a [CREATE <object>](../../sql-reference/sql/create.md) command.
* Assign a tag to existing Snowflake objects using [ALTER <object>](../../sql-reference/sql/alter.md) commands.

1. Create a custom role and assign privileges.

   In a centralized management approach, the `tag_admin` custom role is responsible for creating and assigning tags to Snowflake objects.

   Note that this example uses the ACCOUNTADMIN system role. If using this higher-privileged role in a production environment is not
   desirable, verify that the role assigning privileges to the `tag_admin` custom role has the necessary privileges to qualify the
   `tag_admin` custom role. For more information, see Access control privileges (in this topic).

   ```sqlexample
   USE ROLE USERADMIN;
   CREATE ROLE tag_admin;
   USE ROLE ACCOUNTADMIN;
   GRANT CREATE TAG ON SCHEMA mydb.mysch TO ROLE tag_admin;
   GRANT APPLY TAG ON ACCOUNT TO ROLE tag_admin;
   ```
2. Grant the `tag_admin` custom role to a user serving as the tag administrator.

   ```sqlexample
   USE ROLE USERADMIN;
   GRANT ROLE tag_admin TO USER jsmith;
   ```
3. Execute a [CREATE TAG](../../sql-reference/sql/create-tag.md) statement to create a tag.

   ```sqlexample
   USE ROLE tag_admin;
   USE SCHEMA mydb.mysch;
   CREATE TAG cost_center;
   ```
4. Assign the tag to a new warehouse.

   ```sqlexample
   USE ROLE tag_admin;
   CREATE WAREHOUSE mywarehouse WITH TAG (cost_center = 'sales');
   ```
5. Assign the tag to an existing warehouse.

   ```sqlexample
   USE ROLE tag_admin;
   ALTER WAREHOUSE wh1 SET TAG cost_center = 'sales';
   ```
6. Assign the tag to a column of an existing table.

   ```sqlexample
   ALTER TABLE hr.tables.empl_info
     MODIFY COLUMN job_title
     SET TAG cost_center = 'marketing';
   ```

## Delete a tag

Use the [DROP TAG](../../sql-reference/sql/drop-tag.md) command to delete a tag. When you execute the command, there is a 24-hour grace period before
the tag is permanently deleted. During the grace period, you can execute the UNDROP TAG command to restore the tag, which also restores all
of the tag assignments (that is, references) between the tag and objects.

If you want to determine which objects have a tag before you delete it, query the
[TAG_REFERENCES](../../sql-reference/account-usage/tag_references.md) view (in Account Usage) to determine the tag assignments.

## Access control privileges

### Tag privileges

Snowflake supports the following privileges to determine whether users can create, set, and own tags.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

| Privilege | Usage |
| --- | --- |
| CREATE | Enables creating a new tag in a schema. |
| APPLY | Enables the set and unset operations for the tag on a Snowflake object. For syntax examples, see: Summary of DDL commands, operations, and privileges. |
| OWNERSHIP | Transfers ownership of the tag, which grants full control over the tag. Required to alter most properties of a tag. |

### Summary of DDL commands, operations, and privileges

The following table summarizes the relationship between tag privileges and DDL operations.

| Operation | Privilege required |
| --- | --- |
| Create tag. | A role with the CREATE TAG privilege in the same schema. |
| Create tag that propagates | A role with the APPLY TAG privilege on the account and the OWNERSHIP privilege on the tag. |
| Alter tag. | The role with the OWNERSHIP privilege on the tag. |
| Drop & Undrop tag. | A role with the OWNERSHIP privilege on the tag and the USAGE privilege on the database and schema in which the tag exists. |
| Show tags. | One of the following: . A role with the USAGE privilege on the schema in which the tags exist, or . A role with the APPLY TAG privilege on the account. |
| Set or unset a tag on an object. | For individual objects, a role with the APPLY TAG privilege on the account, or the APPLY TAG privilege on the tag and the OWNERSHIP privilege on the object on which the tag is set. See [Supported objects](introduction.md). |
| Set or unset a tag on a column. | A role with the APPLY TAG privilege on the account, or a role with the APPLY privilege on the tag and the OWNERSHIP privilege on the table or view. |
| Get tags on an object. | See [SYSTEM$GET_TAG](../../sql-reference/functions/system_get_tag.md), [TAG_REFERENCES](../../sql-reference/functions/tag_references.md), and [TAG_REFERENCES_WITH_LINEAGE](../../sql-reference/functions/tag_references_with_lineage.md). |

### Approaches assigning tagging privileges

This section describes different approaches to assigning the privileges needed to create and set tags.

1. For a centralized tag management approach in which the `tag_admin` custom role creates and sets tags on all objects/columns,
   the following privileges are necessary:

   ```sqlexample
   USE ROLE securityadmin;
   GRANT CREATE TAG ON SCHEMA <db_name.schema_name> TO ROLE tag_admin;
   GRANT APPLY TAG ON ACCOUNT TO ROLE tag_admin;
   ```
2. In a hybrid management approach, a single role has the CREATE TAG privilege to ensure tags are named consistently and individual
   teams or roles have the APPLY privilege for a specific tag.

   For example, the custom role `finance_role` can be granted the privilege to set the tag `cost_center` on tables and views
   the role owns (that is, the role has the OWNERSHIP privilege on the table or view):

   ```sqlexample
   USE ROLE securityadmin;
   GRANT CREATE TAG ON SCHEMA <db_name.schema_name> TO ROLE tag_admin;
   GRANT APPLY ON TAG cost_center TO ROLE finance_role;
   ```

---
title: Work with the account budget
source: https://docs.snowflake.com/en/user-guide/budgets/account-budget.md
section: User Guide
---

# Work with the account budget

The account budget monitors spending for all credit usage in the account.

## Activating the account budget

To start using budgets to monitor credit usage for your account, activate the account budget. After you activate the account
budget, you can set the spending limit for the account and configure how notifications are sent when credit usage is
expected to exceed the spending limit. Notifications begin when projected spending is more than 10% above the spending limit.

You can activate the account budget by using Snowsight or by executing SQL statements.

The next sections explain how to activate the account budget:

* Create a custom role to manage the account budget
* Use Snowsight to activate the account budget
* Use SQL commands to activate the account budget

### Create a custom role to manage the account budget

You can create a custom role to activate and modify the account budget. A user who is granted this role can administer the budget by taking
the following actions on the account budget:

* Activate and deactivate the account budget.
* Set the spending limit.
* Edit notification settings.
* Monitor credit usage for the account.

For a full list of roles and privileges required for the budget administrator role, see [Budgets roles and privileges](../budgets.md).

The following example creates a role named `account_budget_admin` and grants the role the ability to monitor and manage the
account budget:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE ROLE account_budget_admin;

GRANT APPLICATION ROLE SNOWFLAKE.BUDGET_ADMIN TO ROLE account_budget_admin;

GRANT IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE TO ROLE account_budget_admin;
```

### Use Snowsight to activate the account budget

> **Note:**
>
> Only a user with the ACCOUNTADMIN role or a role
> granted account budget admin privileges can activate and set up the account budget for a regular
> account.
>
> If you are activating the account budget for the [organization account](../organization-accounts.md), use the GLOBALORGADMIN
> role instead of the ACCOUNTADMIN role.

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.
4. If prompted, select a warehouse.
5. In the dashboard, select Set up Account Budget.
6. Enter the target spending limit for the account.
7. Enter the email addresses to receive notification emails.

   > **Note:**
   >
   > Each email address added for budget notifications must be [verified](../notifications/email-notifications.md). The
   > notification email setup fails if any email address in the list is *not* verified.
8. Select Finish Setup.

### Use SQL commands to activate the account budget

> **Note:**
>
> Only a user with the ACCOUNTADMIN role or a role
> granted account budget admin privileges can activate and set up the account budget in a regular
> account.
>
> If you are activating the account budget for the [organization account](../organization-accounts.md), use the GLOBALORGADMIN
> role instead of the ACCOUNTADMIN role.

1. Activate the account budget by calling the [account_root_budget!ACTIVATE](../../sql-reference/classes/budget/methods/activate.md) method on the
   SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET object:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!ACTIVATE();
   ```
2. Set the spending limit calling the [<budget_name>!SET_SPENDING_LIMIT](../../sql-reference/classes/budget/methods/set_spending_limit.md) method:

   ```sqlexample
   CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_SPENDING_LIMIT(1000);
   ```
3. Set up notifications for the budget so that you receive notifications when your credit usage is expected to exceed your
   spending limits.

   See [Notifications for budgets](notifications.md).

## Deactivating the account budget

You can deactivate the account budget using Snowsight or SQL.

Deactivating the account budget resets the account budget to its state before activation:

* All historical account budget data is deleted.
* The background measurement task for the account budget is suspended.
* The account budget settings for spending limit and email notifications are reset.

Account budget deactivation does not affect custom budgets. To remove a custom budget from your account, use
the [DROP BUDGET](../../sql-reference/classes/budget/commands/drop-budget.md) command.

> **Note:**
>
> If the account budget is deactivated, you can’t create new custom budgets using Snowsight.
> However, you can continue to [create custom budgets using SQL](custom-budget.md).

### Use Snowsight to deactivate the account budget

You can deactivate the account budget using the Budgets page:

1. Sign in to [Snowsight](../ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management.
3. Select Budgets.
4. Select the  more menu.
5. Select Deactivate account budget.

### Use SQL commands to deactivate the account budget

You can use the [account_root_budget!DEACTIVATE](../../sql-reference/classes/budget/methods/deactivate.md) method to deactivate the account budget:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!DEACTIVATE();
```

---
title: Work with worksheets in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-worksheets.md
section: User Guide
---

# Work with worksheets in Snowsight

Worksheets provide a powerful and versatile method for running SQL queries or Python code within the Snowflake platform, as well as
performing other Snowflake data loading, definition, and manipulation tasks.

> **Important:**
>
> Legacy Worksheets will be removed from Snowsight on **June 22, 2026**.
> [Workspaces](ui-snowsight/workspaces.md) is the replacement
> SQL editing experience. For the full deprecation timeline and migration guidance, see
> [Deprecation of Legacy Worksheets and Dashboards](../release-notes/bcr-bundles/un-bundled/bcr-2260.md).

After you open a worksheet in Snowsight, you can do any of the following:

* [Write SQL statements](ui-snowsight-query.md) and [visualize the results](ui-snowsight-visualizations.md).
* [Write Python code](../developer-guide/snowpark/python/python-worksheets.md).
* Browse and open other worksheets.
* Change the context for the worksheet.
* Update and organize worksheets into folders.
* Share worksheets.
* Manage the history and versioning of worksheets.
* Perform tasks with keyboard shortcuts.
* Recover worksheets owned by a dropped user.

> **Note:**
>
> Executing SQL or Python code in worksheets consumes warehouse credits. To learn how to run code more cost-efficiently, see [Optimizing cost](cost-optimize.md).

## Browse and open worksheets

When you open a worksheet, you can view and manage other worksheets in the Worksheets explorer. The Worksheets explorer also
allows you to search for specific worksheets.

> **Note:**
>
> The search functionality is designed to index worksheet metadata, such as titles and object names. It does not index user-written values
> within queries (such as numbers, string literals, or personally identifiable information/PII) as this is by design to protect sensitive data.

### Preview worksheet contents

To preview the contents of a worksheet, you can hover over the name of the worksheet in the Worksheets explorer. The preview also
shows the role used to run the worksheet.

From the preview, you can also copy the contents of the worksheet. Hover over the worksheet contents preview and
select the Copy button that appears.

## Perform tasks with keyboard shortcuts

Snowsight provides keyboard shortcuts to help you quickly navigate and edit queries in worksheets. For example, you can move your
cursor within a worksheet, perform find and replace, copy lines, format queries, and more using hotkeys.

| Task | MacOS shortcut | Windows shortcut |
| --- | --- | --- |
| Show keyboard shortcuts | `⌘` + `Shift` + `/` | `Ctrl` + `Shift` + `/` |
| New query | `Ctrl` + `⌘` + `n` | `Ctrl` + `Alt` + `n` |
| Search schema or results | `⌘` + `Shift` + `f` | `Ctrl` + `Shift` + `f` |
| Clear selection | `Escape` | `Escape` |
| Make bottom pane larger | `Ctrl` + `` ` `` + `↑` | `Ctrl` + `` ` `` + `↑` |
| Make bottom pane smaller | `Ctrl` + `` ` `` + `↓` | `Ctrl` + `` ` `` + `↓` |
| Go right one pane tab | `Ctrl` + `` ` `` + `→` | `Ctrl` + `` ` `` + `→` |
| Go left one pane tab | `Ctrl` + `` ` `` + `←` | `Ctrl` + `` ` `` + `←` |
| Run selected | `⌘` + `Return` | `Ctrl` + `Enter` |
| Run all | `⌘` + `Shift` + `Return` | `Ctrl` + `Alt` + `Enter` |
| Format query | `⌘` + `Shift` + `o` | `Ctrl` + `Shift` + `o` |
| Indent line | `⌘` + `]` | `Ctrl` + `]` |
| Outdent line | `⌘` + `[` | `Ctrl` + `[` |
| Toggle comment | `⌘` + `/` | `Ctrl` + `/` |
| Search | `⌘` + `f` | `Ctrl` + `f` |
| Replace | `⌘` + `Shift` + `h` | `Ctrl` + `h` |
| Find next | `⌘` + `g` | `F3` |
| Find previous | `⌘` + `Shift` + `g` | `Shift` + `F3` |
| Move line up | `` ` `` + `↑` | `Alt` + `↑` |
| Move line down | `` ` `` + `↓` | `Alt` + `↓` |
| Copy line up | `` ` `` + `Shift` + `↑` | `Alt` + `Shift` + `↑` |
| Copy line down | `` ` `` + `Shift` + `↓` | `Alt` + `Shift` + `↓` |
| Delete line | `Ctrl` + `Shift` + `k` | `Ctrl` + `Shift` + `k` |
| Split pane horizontally | `Ctrl` + `\` | `Ctrl` + `\` |
| Split pane vertically | `Ctrl` + `Shift` + `\` | `Ctrl` + `Shift` + `\` |

To see all available keyboard shortcuts in Snowsight, open a worksheet and press `CMD` + `SHIFT` + `/` on a Mac keyboard or
`CTRL` + `SHIFT` + `/` on a Windows keyboard.

## Change the context for a worksheet

When you create a worksheet, you specify the role and warehouse used to execute the worksheet’s contents. This information is referred to
as **worksheet context**, is preserved for future sessions, and is shared with all users of the same worksheet.

> **Note:**
>
> The role selector lets you choose your primary role. To enable secondary roles in a SQL worksheet, run
> [USE SECONDARY ROLES](../sql-reference/sql/use-secondary-roles.md). To determine whether secondary roles are active in your current session, call the CURRENT_SECONDARY_ROLES function [CURRENT_SECONDARY_ROLES](../sql-reference/functions/current_secondary_roles.md).

The role context for a worksheet determines which operations can be performed on Snowflake objects based on the access control
privileges granted to the role.

To set the context for a worksheet, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Open the context selector.
5. Select a role to run the worksheet as.
6. Select a warehouse that the role has privileges to use.
7. Select anywhere outside the drop-down to close the context selector.

> **Note:**
>
> Each worksheet has a unique session and can use roles different from the role you select in the user menu (your *active role*).
> Changing your active role does not change the role assigned to the worksheet with the context selector.

### Resume or resize a warehouse

Before or after you run your worksheet, you might need to resume or resize your warehouse.

> **Note:**
>
> You must have MODIFY or OWNERSHIP privileges on the warehouse to alter warehouse details.

To view or adjust warehouse details using the context selector, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. Open the context selector.
5. Select the Show warehouse details icon.
6. For the Status option, select the status and choose Resume if the warehouse is suspended.
7. For the Size option, select the size and choose a different size.
8. Select anywhere outside the drop-down to close the context selector.

## Manage worksheets

You can manage worksheets in Snowsight from the worksheet tab or the Worksheets explorer. To access the worksheet tab menu,
open a worksheet, hover over the tab, and select the . To access the Worksheets explorer, hover over a
worksheet name and select the .

The actions available in each menu are based on your current role. Depending on your permissions, you can do the following:

* Rename, delete, or move a worksheet (requires Edit or Ownership permissions).
* Organize worksheets by moving them into folders or a dashboard.
* Import SQL from an external file.
* Format your queries.
* Search for other worksheets.
* Duplicate a worksheet (any role).

> **Tip:**
>
> You can hover over a worksheet to preview its contents.

You can identify which worksheets are open in tabs by referencing the worksheet icon. A solid icon indicates that the worksheet is
currently open. To access menu options, hover over a worksheet name and select the ellipsis visible.

## Share worksheets and folders

Sharing a worksheet or worksheet folder allows you to collaborate with colleagues. Recipients of a shared worksheet can edit or view its
contents, run queries, view results, or duplicate the shared worksheet.

You can share worksheets and folders of worksheets with other Snowflake users in your account. You can only share worksheets directly
with users who have previously signed in to Snowsight. To share with someone who has not yet signed in to
Snowsight, share a link instead (ensure that link sharing is enabled).

### Permissions for shared worksheets

When you share a worksheet with someone, you can manage access to the worksheet and its contents by choosing which permissions to grant to
the other user. These permissions are also used for [sharing dashboards](ui-snowsight-dashboards.md).
Worksheet owners have the same permissions as worksheet editors.

Each worksheet in Snowsight uses a unique session with a specific role and warehouse assigned in the context of the worksheet.
The *worksheet role* is the *primary role* last used to run the worksheet and is required to run the worksheet.

> **Note:**
>
> Users with Run permissions can also change the worksheet’s role using [USE ROLE](../sql-reference/sql/use-role.md).

To view the results of an earlier worksheet version, you need to have the primary role that was used to run the SQL statement that
generated the results. See Viewing results for past runs of a worksheet.

| Permissions Granted | Recipient Can: |
| --- | --- |
| Edit | * Edit the worksheet contents. * Run the worksheet, including using a different role. * View and manage past versions of the worksheet. * View and manage results from past worksheet versions, provided they have the role used to generate the results. * Share the worksheet with others. * Add the worksheet to a different folder. |
| View + Run | * Inherits all privileges from View Results (see below). * Run the worksheet, provided they have the worksheet role. * View the results of the most recent worksheet version. * Duplicate and run the worksheet using their own role. |
| View Results | * Inherits all privileges from Link with View Results (see below). * View the results of the most recent worksheet version, provided they have the worksheet role. * Duplicate and run the worksheet using their own role. |
| Link with View + Run | * Inherits all privileges from Link with View Results (see below). * Run the worksheet, provided they have the worksheet role. * View the results of the most recent worksheet version. * View the worksheet contents (but cannot duplicate or run the worksheet). |
| Link with View Results | * View the results of the most recent worksheet version, provided they have the worksheet role. * View the worksheet contents (but cannot duplicate or run the worksheet). |

The worksheet owner is the user who created the worksheet and has the same permissions as a worksheet editor. The worksheet owner changes
if the worksheet is added to a folder owned by another user.

> **Important:**
>
> If a worksheet owner is dropped from Snowflake, the dropped user will remain the owner of the worksheet; however, users with any share
> permissions can continue to access and use the worksheet. Any user with the worksheet link will still be able to access it if link
> sharing is enabled. To maintain worksheet access, Snowflake recommends having the user share their worksheets with Edit permissions
> (rather than View or View + Run) *before dropping the user* so others can continue to modify or delete the worksheet. To recover the
> worksheets owned by a dropped user including those that aren’t shared, see Recover worksheets owned by a dropped user.

#### Viewing results for past runs of a worksheet

When you run one or all queries in a worksheet, your query results are displayed as a table. You can navigate the query results with the
arrow keys, as you would with a spreadsheet. You can select columns, cells, rows, or ranges in
the results table. You can copy and paste any selection.

To view the results for past runs of a worksheet, the following must be true:

* The user must have the role used to run the SQL statement that generated the results.
* The results must still be stored in Snowflake. See Stored results for past worksheet versions.

Snowsight allows you to review generated statistics for up to 1 million rows of results. These statistics provide contextual
information for any selection,
as well as overall statistics. See [Automatic contextual statistics](ui-snowsight-query.md) for more details.

You can also:

* View your results as a chart by selecting Chart. For more details about charts, see [Visualizing worksheet data](ui-snowsight-visualizations.md).
* Review results of a past worksheet run by viewing the Query History for a worksheet. See [View query history](ui-snowsight-query.md).

### Share a worksheet

To share a worksheet, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a worksheet.
4. In the upper-right corner of the worksheet, select Share.
5. Enter the names or usernames of the Snowflake users to invite to use your worksheet. The list only shows users who have
   previously signed in to Snowsight. To share with someone who has not yet signed in to
   Snowsight, select Get Link to generate a link to share instead.
6. Choose the permissions to grant to the users with whom you share the worksheet.
7. Optionally, set permissions for what people with the link to the worksheet can access.
8. Select Done.

> **Note:**
>
> The worksheet that is shared is the most recently run version. If the worksheet has never been run, the contents will appear empty. Any edits
> that you make to your version of the worksheet, whether you’re an editor or an owner of the worksheet, do not appear for collaborators
> until you run part or all of the worksheet code.

Any worksheet that you share (either directly or through a link) with a collaborator can appear in their search results or worksheets
list. Worksheets shared directly appear immediately, while those shared through a link appear once they have been accessed. These
worksheets will continue to appear in the collaborator’s search results or lists unless they are deleted by a user with edit access, or
if the collaborator’s access permissions to the worksheets are removed.

### Share a folder of worksheets

To share a folder, including all worksheets in the folder, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open a folder.
4. Select Share.
5. Enter the names or usernames of the Snowflake users to invite them to your folder. The list only shows users who have previously
   signed in to Snowsight. If you want to share with someone who has not yet signed in to Snowsight, share a link instead.
6. Choose the permissions to grant to the users with whom you share the folder.
7. Optionally, set permissions for what people with the link to the folder can access.
8. Optionally, select Get Link to get a link to your folder that you can share with others.
9. Select Done.

If you add a worksheet to a shared folder, the worksheet inherits the sharing settings of the folder. If the folder is owned by someone
other than the worksheet owner, the folder owner becomes the worksheet owner, and the original worksheet owner inherits
the sharing permissions from the folder.

For example, if a worksheet owner adds a worksheet to a folder on which they have edit permissions, the worksheet updates
to be owned by the folder owner, and the original worksheet owner then has edit permissions on the worksheet.

Any folder shared (either directly or through a link) with a collaborator can appear in their search results or folders
list. Folders shared directly appear immediately, while those shared through a link appear once they have been accessed. These folders will
continue to appear in the collaborator’s search results or lists unless they are deleted by a user with edit access, or if the
collaborator’s access permissions to the worksheets are removed.

### Share worksheets across accounts

Worksheets cannot be replicated or shared across accounts. To share the contents of a worksheet
with users in another Snowflake account, copy the contents and share it with users in the account outside of Snowflake.

## Manage worksheet history and versions

Any local edits you make to a worksheet are automatically saved every three seconds but remain visible only to you. When you run a SQL
query or execute code in a worksheet, the latest version is updated and shared with all collaborators. You can also view past versions of a
worksheet and optionally copy details from any version. For more information, see Switch worksheet versions.

When making changes to worksheets and managing worksheet versions, consider the following:

* When you share a worksheet with other users, users with edit permissions can view past versions of the worksheet.
  All users that you share a worksheet with can view up to 10,000 rows of the results for the most recent version of the worksheet.
* Whenever someone with permissions runs a worksheet, a new version of the worksheet is saved.
* If you make changes to the worksheet and they seem to disappear, use the version history to open the saved draft with your changes.
* The most recently run version of the worksheet is the version visible to collaborators.
* If you make changes to the worksheet that you want to be visible to the users with whom you shared the worksheet, you must run the worksheet.
* If multiple users edit and run a shared worksheet at the same time, each run of the worksheet creates a new version. The most recently
  run version of the worksheet is the one visible when you open or refresh the worksheet.

### Switch worksheet versions

To view past versions of a worksheet, do the following:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open the worksheet.
4. Select Code Versions for the worksheet.
5. From the list of worksheet versions, select the timestamp of the version that you want to view.
6. Review and optionally copy the worksheet details for that version.
7. Select Close to return to the current version of the worksheet.

To view the results of a past worksheet run, view the Query History for the worksheet.
See [View query history](ui-snowsight-query.md).

### Stored results for past worksheet versions

> **Note:**
>
> Available to most accounts. Accounts in U.S. government regions, accounts using Virtual Private Snowflake (VPS), and accounts
> that use Private Connectivity to access Snowflake continue to see query results limited to 10,000 rows.

All results for queries executed in worksheets are available for up to 24 hours. After 24 hours, you must run your query again to view
results.

To support contextual statistics and sharing worksheet results, the 25 latest query results are cached for up to 90 days. This cache is
included in the data storage usage for your account.

## Recover worksheets owned by a dropped user

If you drop a user, you can recover up to 500 of the worksheets owned by that user. To recover the worksheets, do the following:

1. Download recovered worksheets owned by a dropped user.
2. [Create worksheets from a SQL file](ui-snowsight-worksheets-gs.md) to add the recovered worksheets back to Snowflake.

If you want to change ownership or retain access to worksheets before dropping a user, ask that user to share the worksheets.
See Share worksheets and folders.

### Download recovered worksheets owned by a dropped user

To recover worksheets owned by a dropped user, download a `.tar.gz` archive file of up to 500 worksheets owned by that user.

> **Note:**
>
> You must be granted the ACCOUNTADMIN role to recover worksheets of dropped users.

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Settings.
3. Select General.
4. Next to Recover worksheets from dropped users, select Recover worksheets.
5. In the dialog box, enter the username of a dropped user in your account.

   > **Important:**
   >
   > The case and spelling of the username must exactly match the username as it was stored in Snowflake.
6. Select Recover.

   Your web browser downloads a `.tar` file containing up to 500 worksheets. If the dropped user has more than 500 worksheets,
   only the 500 most recently modified worksheets are downloaded.

After downloading worksheets owned by a dropped user, add the recovered worksheets to Snowsight by creating worksheets from
the SQL files.

You must expand the downloaded `.tar` file into a folder of `.sql` files before you can add recovered worksheets to
Snowsight. You can only add one worksheet at a time to Snowsight, and the user who adds the recovered worksheets to
Snowsight becomes the new owner of the worksheets.

See [Create worksheets from a SQL file](ui-snowsight-worksheets-gs.md) for details.

### Considerations for recovering worksheets owned by dropped users

* Only the title and contents of the most recently executed version of a worksheet are recovered. Worksheet version history,
  sharing recipients and permissions, query results, and worksheet metadata are not recovered.
* A maximum of 500 worksheets are recovered. For dropped users with more than 500 worksheets, only the 500 most recently modified worksheets
  are recovered.
* Only worksheets in Snowsight are recovered. Worksheets in Classic Console owned by dropped users cannot be recovered with
  this method.
* If multiple dropped users have the same username, worksheets owned by all dropped users with that username are recovered.

If the worksheet recovery fails for unexpected reasons, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Internal Snowflake objects for worksheets

Snowflake creates the following internal objects to support using worksheets in Snowsight:

| Object Type | Name |
| --- | --- |
| Security integration | WORKSHEETS |
| Blobs | WORKSHEETS_APP |
| Database | WORKSHEETS_APP |
| User | WORKSHEETS_APP_USER |
| Roles | APPADMIN, WORKSHEETS_APP_RL |

These internal objects are used to cache query results in an internal stage in your account. This cached data is encrypted and protected by
the key hierarchy for the account.

The limited privileges granted to the internal role only allow Snowsight to access the internal stage to store those results. The
role cannot list objects in your account or access data in your tables.

The Snowsight user and role are returned when you query the [USERS](../sql-reference/account-usage/users.md) and
[ROLES](../sql-reference/account-usage/roles.md) views, respectively, in the [ACCOUNT_USAGE](../sql-reference/account-usage.md) schema
in the SNOWFLAKE shared database. [SHOW <objects>](../sql-reference/sql/show.md) statements do not return these internal objects.

---
title: Working with account editions
source: https://docs.snowflake.com/en/user-guide/organizations-manage-accounts-editions.md
section: User Guide
---

# Working with account editions

Each account in an organization has a specific [Snowflake edition](intro-editions.md) that determines its available features
and level of service.

[As the organization administrator](organization-administrators.md), you can check the account edition in
[Snowsight](ui-snowsight-gs.md) or using SQL:

> Snowsight:
> :   In the navigation menu, select Admin » Accounts.
>
> SQL:
> :   Execute the [SHOW ACCOUNTS](../sql-reference/sql/show-accounts.md) command.

If you would like to change an account’s edition, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Tip:**
>
> If you are not an organization administrator but want to view the edition of an account, do the following:
>
> 1. Open Snowsight, and then select your name to open the user menu.
> 2. Locate the account and select View Account Details.
> 3. View the edition and other details.

---
title: Working with CTEs (Common Table Expressions)
source: https://docs.snowflake.com/en/user-guide/queries-cte.md
section: User Guide
---

# Working with CTEs (Common Table Expressions)

See also:
:   [CONNECT BY](../sql-reference/constructs/connect-by.md) , [WITH](../sql-reference/constructs/with.md)

## What is a CTE?

A CTE (common table expression) is a named subquery defined in a [WITH](../sql-reference/constructs/with.md) clause. You can
think of the CTE as a temporary [view](views-introduction.md) for use in the statement that defines the
CTE. The CTE defines the temporary view’s name, an optional list of column names, and a query expression (i.e. a SELECT
statement). The result of the query expression is effectively a table. Each column of that table corresponds to a column
in the (optional) list of column names.

The following code is an example of a query that uses a CTE:

```sqlexample
WITH
    my_cte (cte_col_1, cte_col_2) AS (
        SELECT col_1, col_2
            FROM ...
    )
SELECT ... FROM my_cte;
```

In the example above, the CTE starts on the line containing `my_cte (cte_col_1, cte_col_2) AS (`, and ends on the line containing
`)`.

Avoid choosing CTE names that match the following:

* [SQL function names](../sql-reference/functions-all.md)
* Tables, views, or materialized views. If a query defines a CTE with a particular name, the CTE takes precedence over tables, etc.

A CTE can be recursive or non-recursive. A recursive CTE is a CTE that references itself. A recursive CTE can join a
table to itself as many times as necessary to process hierarchical data in the table.

CTEs increase modularity and simplify maintenance.

## Recursive CTEs and Hierarchical Data

Recursive CTEs enable you to process hierarchical data, such as a parts explosion (component, sub-components) or a
management hierarchy (manager, employees). For more information about hierarchical data, and other ways to query
hierarchical data, see [Querying Hierarchical Data](queries-hierarchical.md).

A recursive CTE allows you to join all the levels of a hierarchy without knowing in advance how many levels there are.

### Overview of Recursive CTE Syntax

This section provides an overview of the syntax and how the syntax relates to the way that the recursion works:

```sqlsyntax
WITH [ RECURSIVE ] <cte_name> AS
(
  <anchor_clause> UNION ALL <recursive_clause>
)
SELECT ... FROM ...;
```

Where:
:   `anchor_clause`
    :   Selects an initial row or set of rows that represent the top of the hierarchy. For
        example, if you are trying to display all the employees in a company, the anchor clause would select the President
        of the company.

        The anchor clause is a [SELECT](../sql-reference/sql/select.md) statement and can contain any supported SQL constructs.
        The anchor clause cannot reference the `cte_name`.

    `recursive_clause`
    :   Selects the next layer of the hierarchy based on the previous layer. In the first iteration, the previous layer is the
        result set from the anchor clause. In subsequent iterations, the previous layer is the most recent completed iteration.

        The `recursive_clause` is a [SELECT](../sql-reference/sql/select.md) statement; however, the statement is restricted
        to projections, joins, and filters. In addition, the following are not allowed in the statement:

        * Aggregate or window functions.
        * `GROUP BY`, `ORDER BY`, `LIMIT`, or `DISTINCT`.

        The recursive clause can reference the `cte_name` like a regular table or view.

For a more detailed description of the syntax, see [WITH](../sql-reference/constructs/with.md).

Logically, the recursive CTE is evaluated as follows:

1. The `anchor_clause` is evaluated and its result is written to both the final result set and to a working table.
   The `cte_name` is effectively an alias to the working table; in other words, a query referencing the
   `cte_name` reads from the working table.
2. While the working table is not empty:

   1. The `recursive_clause` is evaluated, using the current contents of the working table wherever `cte_name`
      is referenced.
   2. The result of `recursive_clause` is written to both the final result set and a temp table.
   3. The working table is overwritten by the content of the temp table.

Effectively, the output of the previous iteration is stored in a working table named `cte_name`, and that table is
then one of the inputs to the next iteration. The working table contains only the result of the most recent iteration.
The accumulated results from all iterations so far are stored elsewhere.

After the final iteration, the accumulated results are available to the main SELECT statement by referencing `cte_name`.

### Recursive CTE Considerations

#### Potential for Infinite Loops

Constructing a recursive CTE incorrectly can cause an infinite loop. In these cases, the query continues to run until the query
succeeds, the query times out (e.g. exceeds the number of seconds specified by the
[STATEMENT_TIMEOUT_IN_SECONDS](../sql-reference/parameters.md) parameter), or you [cancel the query](querying-cancel-statements.md).

For information on how infinite loops can occur and for guidelines on how to avoid this problem, see
Troubleshooting a Recursive CTE.

#### Non-Contiguous Hierarchies

This topic described hierarchies and how parent-child relationships can be used by recursive CTEs. In all of the examples
in this topic, the hierarchies are contiguous.

For information about non-contiguous hierarchies, see [Querying Hierarchical Data](queries-hierarchical.md).

## Examples

This section includes both non-recursive and recursive CTEs examples to contrast the two types.

### Non-Recursive, Two-Level, Self-joining CTE

This example uses a table of employees and managers:

> ```sqlexample
> CREATE OR REPLACE TABLE employees (title VARCHAR, employee_ID INTEGER, manager_ID INTEGER);
> ```
>
> ```sqlexample
> INSERT INTO employees (title, employee_ID, manager_ID) VALUES
>     ('President', 1, NULL),  -- The President has no manager.
>         ('Vice President Engineering', 10, 1),
>             ('Programmer', 100, 10),
>             ('QA Engineer', 101, 10),
>         ('Vice President HR', 20, 1),
>             ('Health Insurance Analyst', 200, 20);
> ```

A two-level self-join of this employee table looks like:

> ```sqlexample
> SELECT
>      emps.title,
>      emps.employee_ID,
>      mgrs.employee_ID AS MANAGER_ID,
>      mgrs.title AS "MANAGER TITLE"
>   FROM employees AS emps LEFT OUTER JOIN employees AS mgrs
>     ON emps.manager_ID = mgrs.employee_ID
>   ORDER BY mgrs.employee_ID NULLS FIRST, emps.employee_ID;
> +----------------------------+-------------+------------+----------------------------+
> | TITLE                      | EMPLOYEE_ID | MANAGER_ID | MANAGER TITLE              |
> |----------------------------+-------------+------------+----------------------------|
> | President                  |           1 |       NULL | NULL                       |
> | Vice President Engineering |          10 |          1 | President                  |
> | Vice President HR          |          20 |          1 | President                  |
> | Programmer                 |         100 |         10 | Vice President Engineering |
> | QA Engineer                |         101 |         10 | Vice President Engineering |
> | Health Insurance Analyst   |         200 |         20 | Vice President HR          |
> +----------------------------+-------------+------------+----------------------------+
> ```

The query above shows all the employees. Each manager’s employees appear near their manager in the report. However, the
report doesn’t visually show the hierarchy. Without looking carefully at the data, you don’t know how many levels there
are in the organization, and you need to read each row in order to see which employees are associated with a specific
manager.

A recursive CTE can display this hierarchical data as a sideways tree, as shown in the next section.

### Recursive CTE with Indented Output

Below are two examples of using a recursive CTE:

* The first uses indentation to show the different levels of the hierarchy. To simplify this example, the code does not
  produce the rows in a particular order.
* The second example uses indentation and shows each manager’s employees immediately below their manager.

#### Unordered Output

Here is the first example.

```sqlexample
WITH RECURSIVE managers                                     -- Line 1
    (indent, employee_ID, manager_ID, employee_title)       -- Line 2
  AS                                                        -- Line 3
    (                                                       -- Line 4
                                                            -- Line 5
      SELECT '' AS indent,                                  -- Line 6
             employee_ID,                                   -- Line 7
             manager_ID,                                    -- Line 8
             title AS employee_title                        -- Line 9
        FROM employees                                      -- Line 10
        WHERE title = 'President'                           -- Line 11
                                                            -- Line 12
        UNION ALL                                           -- Line 13
                                                            -- Line 14
        SELECT indent || '--- ',                            -- Line 15
               employees.employee_ID,                       -- Line 16
               employees.manager_ID,                        -- Line 17
               employees.title                              -- Line 18
          FROM employees JOIN managers                      -- Line 19
            ON employees.manager_ID = managers.employee_ID  -- Line 20
    )                                                       -- Line 21
                                                            -- Line 22
SELECT indent || employee_title AS Title,                   -- Line 23
       employee_ID,                                         -- Line 24
       manager_ID                                           -- Line 25
  FROM managers;                                            -- Line 26
```

The query includes the following sections:

* Line 2 contains the column names for the “view” (CTE).
* Lines 4 - 21 contain the CTE.
* Lines 6 - 11 contain the anchor clause of the CTE.
* Lines 15 - 21 contain the recursive clause of the CTE.
* Lines 23 - 26 contain the main SELECT that uses the CTE as a view. This SELECT references:

  + The CTE name (`managers`), defined in line 1.
  + The CTE’s columns (`indent`, `employee_id`, etc.) defined in line 2.

The CTE contains two SELECT statements:

* The SELECT statement in the anchor clause is executed once and provides the set of rows from the
  first (top) level of the hierarchy.
* The SELECT in the recursive clause can reference the CTE. You can think of the query as
  iterating, with each iteration building on the previous iterations’ query results.

In the manager/employee example, the anchor clause emits the first row, which is the row that describes the company
president.

In the next iteration of the recursive clause, the recursive clause finds all the rows whose manager is the company
president (i.e. it finds all of the vice presidents). The 3rd iteration finds all the employees whose manager is one
of the vice presidents. Iteration continues until there is an iteration in which all of the rows retrieved are rows of
leaf-level employees who do not manage anyone. The statement does one more iteration, looking for (but not finding)
any employees whose managers are leaf-level employees. That iteration produces 0 rows, and the iteration stops.

Throughout these iterations, the UNION ALL clause accumulates the results. The results of each iteration are added
to the results of the previous iterations. After the last iteration completes, the accumulated rows (like any rows
produced in a WITH clause) are made available to the main SELECT clause in the query. That main SELECT can then query
those rows.

This particular example query uses indentation to show the hierarchical nature of the data. If you look at the output,
you see that the lower the level of the employee, the further that employee’s data is indented.

The indentation is controlled by the column named `indent`. The indentation starts at 0 characters (an empty string
in the anchor clause), and increases by 4 characters (`---`) for each iteration (i.e. for each level in the hierarchy).

Not surprisingly, it is very important to construct the join(s) correctly, and to select the correct columns in the
recursive clause. The columns in the SELECT of the recursive clause must correspond correctly to the columns in
the anchor clause. Remember that the query starts with the President, then selects the Vice Presidents, and then
selects the people who report directly to the Vice Presidents, etc. Each iteration looks for employees whose
`manager_id` field corresponds to one of the `managers.employee_id` values produced in the previous iteration.

Expressed another way, the employee ID in the managers “view” is the manager ID for the next level of employees. The
employee IDs must progress downward through the hierarchy (President, Vice President, senior manager, junior manager, etc.)
during each iteration. If the employee IDs don’t progress, then the query can loop infinitely (if the same `manager_ID`
keeps appearing in the `managers.employee_ID` column in different iterations), or skip a level, or fail in other ways.

#### Ordered Output

The previous example had no ORDER BY clause, so even though each employee’s record is indented properly, each employee did
not necessarily appear directly underneath their manager. The example below generates output with correct indentation, and
with each manager’s employees directly underneath their manager.

The query’s ORDER BY clause uses an additional column, named `sort_key`. The sort key accumulates as the recursive clause
iterates; you can think of the sort key as a string that contains the entire chain of command above you (your manager, your
manager’s manager, etc.). The most senior person in that chain of command (the President) is at the beginning of the sort
key string. Although you normally wouldn’t display the sort key, the query below includes the sort key in the output so that
it is easier to understand the output.

Each iteration should increase the length of the sort key by the same amount (same number of characters), so the query uses
a UDF (user-defined function) named `skey`, with the following definition, to generate consistent-length segments of the
sort key:

> > ```sqlexample
> > CREATE OR REPLACE FUNCTION skey(ID VARCHAR)
> >   RETURNS VARCHAR
> >   AS
> >   $$
> >     SUBSTRING('0000' || ID::VARCHAR, -4) || ' '
> >   $$
> >   ;
> > ```
>
> Here is an example of output from the `SKEY` function:
>
> > ```sqlexample
> > SELECT skey(12);
> > +----------+
> > | SKEY(12) |
> > |----------|
> > | 0012     |
> > +----------+
> > ```

Here is the final version of the query. This puts each manager’s employees immediately underneath that manager, and indents based
on the “level” of the employee:

> ```sqlexample
> WITH RECURSIVE managers
>       -- Column list of the "view"
>       (indent, employee_ID, manager_ID, employee_title, sort_key)
>     AS
>       -- Common Table Expression
>       (
>         -- Anchor Clause
>         SELECT '' AS indent,
>             employee_ID, manager_ID, title AS employee_title, skey(employee_ID)
>           FROM employees
>           WHERE title = 'President'
>
>         UNION ALL
>
>         -- Recursive Clause
>         SELECT indent || '--- ',
>             employees.employee_ID, employees.manager_ID, employees.title,
>             sort_key || skey(employees.employee_ID)
>           FROM employees JOIN managers
>             ON employees.manager_ID = managers.employee_ID
>       )
>
>   -- This is the "main select".
>   SELECT
>          indent || employee_title AS Title, employee_ID,
>          manager_ID,
>          sort_key
>     FROM managers
>     ORDER BY sort_key
>   ;
> +----------------------------------+-------------+------------+-----------------+
> | TITLE                            | EMPLOYEE_ID | MANAGER_ID | SORT_KEY        |
> |----------------------------------+-------------+------------+-----------------|
> | President                        |           1 |       NULL | 0001            |
> | --- Vice President Engineering   |          10 |          1 | 0001 0010       |
> | --- --- Programmer               |         100 |         10 | 0001 0010 0100  |
> | --- --- QA Engineer              |         101 |         10 | 0001 0010 0101  |
> | --- Vice President HR            |          20 |          1 | 0001 0020       |
> | --- --- Health Insurance Analyst |         200 |         20 | 0001 0020 0200  |
> +----------------------------------+-------------+------------+-----------------+
> ```

The next query shows how to reference a field from the previous (higher) level in the hierarchy; pay particular attention to the
`mgr_title` column:

> ```sqlexample
> WITH RECURSIVE managers
>       -- Column names for the "view"/CTE
>       (employee_ID, manager_ID, employee_title, mgr_title)
>     AS
>       -- Common Table Expression
>       (
>
>         -- Anchor Clause
>         SELECT employee_ID, manager_ID, title AS employee_title, NULL AS mgr_title
>           FROM employees
>           WHERE title = 'President'
>
>         UNION ALL
>
>         -- Recursive Clause
>         SELECT
>             employees.employee_ID, employees.manager_ID, employees.title, managers.employee_title AS mgr_title
>           FROM employees JOIN managers
>             ON employees.manager_ID = managers.employee_ID
>       )
>
>   -- This is the "main select".
>   SELECT employee_title AS Title, employee_ID, manager_ID, mgr_title
>     FROM managers
>     ORDER BY manager_id NULLS FIRST, employee_ID
>   ;
> +----------------------------+-------------+------------+----------------------------+
> | TITLE                      | EMPLOYEE_ID | MANAGER_ID | MGR_TITLE                  |
> |----------------------------+-------------+------------+----------------------------|
> | President                  |           1 |       NULL | NULL                       |
> | Vice President Engineering |          10 |          1 | President                  |
> | Vice President HR          |          20 |          1 | President                  |
> | Programmer                 |         100 |         10 | Vice President Engineering |
> | QA Engineer                |         101 |         10 | Vice President Engineering |
> | Health Insurance Analyst   |         200 |         20 | Vice President HR          |
> +----------------------------+-------------+------------+----------------------------+
> ```

### Parts Explosion

Manager/employee hierarchies are not the only type of variable-depth hierarchies that you can store in a single table and process
with a recursive CTE. Another common example of hierarchical data is a “parts explosion”, in which each component can be listed with
its sub-components, each of which can be listed with its sub-sub-components.

For example, suppose that your table contains hierarchical data, such as the components of a car. Your car probably contains
components such as an engine, wheels, etc. Many of those components contain sub-components (e.g. an engine might contain a fuel pump).
The fuel pump might contain a motor, tubing, etc. You could list all the components and their sub-components using a recursive CTE.

For an example of a query that produces a parts explosion, see [WITH](../sql-reference/constructs/with.md).

## Troubleshooting a Recursive CTE

### Recursive CTE Query Runs Until It Succeeds or Times Out

This issue can be caused by two different scenarios:

* Your data hierarchy might have a cycle.
* You might have created an infinite loop.

#### Cause 1: Cyclic Data Hierarchy

If your data hierarchy contains a cycle (i.e. it is not a true tree), there are multiple possible solutions:

Solution 1.1:
:   If the data is not supposed to contain a cycle, correct the data.

Solution 1.2:
:   Limit the query in some way (e.g. limit the number of rows of output). For example:

    > ```sqlexample
    > WITH RECURSIVE t(n) AS
    >     (
    >     SELECT 1
    >     UNION ALL
    >     SELECT N + 1 FROM t
    >    )
    >  SELECT n FROM t LIMIT 10;
    > ```

Solution 1.3:
:   Do not use a query that contains a recursive CTE, which expects hierarchical data.

#### Cause 2: Infinite Loop

An infinite loop can happen if the projection clause in the `recursive_clause` outputs a value
from the “parent” (the previous iteration) instead of the “child” (the current iteration) and then the next
iteration uses that value in a join when it should use the current iteration’s value in the join.

The following pseudo-code shows an approximate example of this:

> ```sqlexample
> CREATE TABLE employees (employee_ID INT, manager_ID INT, ...);
> INSERT INTO employees (employee_ID, manager_ID) VALUES
>         (1, NULL),
>         (2, 1);
>
> WITH cte_name (employee_ID, manager_ID, ...) AS
>   (
>      -- Anchor Clause
>      SELECT employee_ID, manager_ID FROM table1
>      UNION ALL
>      SELECT manager_ID, employee_ID   -- <<< WRONG
>          FROM table1 JOIN cte_name
>            ON table1.manager_ID = cte_name.employee_ID
>   )
> SELECT ...
> ```

In this example, the recursive clause passes its parent value (`manager_id`) in the column that should have the
current/child value (`employee_id`). The parent will show up as the “current” value in the next iteration, and will
be passed again as the “current” value to the following generation, so the query never progresses down through the
levels; it keeps processing the same level each time.

Step 1:
:   Suppose that the anchor clause selects the values `employee_id = 1` and `manager_id = NULL`.

    CTE:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ---------
    >       1         NULL
    > ```

Step 2:
:   During the first iteration of the recursive clause, `employee_id = 2` and `manager_id = 1` in `table1`.

    CTE:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >        1         NULL
    > ```

    `table1`:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >  ...
    >        2         1
    >  ...
    > ```

    Result of the join in the recursive clause:

    > ```sqlexample
    > table1.employee_ID  table1.manager_ID  cte.employee_ID  cte.manager_ID
    > -----------------   -----------------  ---------------  --------------
    >  ...
    >        2                   1                 1                NULL
    >  ...
    > ```

    Projection:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >  ...
    >        2         1
    >  ...
    > ```

    However, because the `employee_id` and `manager_id` columns are reversed in the projection, the actual output of
    the query (and thus the content of the CTE at the start of the next iteration) is:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >  ...
    >        1         2        -- Because manager and employee IDs reversed
    >  ...
    > ```

Step 3:
:   During the second iteration of the recursive clause:

    CTE:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >        1         2
    > ```

    `table1`:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >  ...
    >        2         1
    >  ...
    > ```

    Result of join in recursive clause:

    > ```sqlexample
    > table1.employee_ID  table1.manager_ID  cte.employee_ID  cte.manager_ID
    > -----------------   -----------------  ---------------  --------------
    >  ...
    >        2                   1                 1                2
    >  ...
    > ```

    Projection:

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >  ...
    >        2         1
    >  ...
    > ```

    Result of the query (contents of CTE at start of next iteration):

    > ```sqlexample
    > employee_ID  manager_ID
    > -----------  ----------
    >  ...
    >        1         2        -- Because manager and employee IDs reversed
    >  ...
    > ```

    As you can see, at the end of the second iteration, the row in the CTE is the same as it was at the start of the
    iteration:

    * `employee_id` is `1`.
    * `manager_id` is `2`.

    Thus, the result of the join during the next iteration will be the same as the result of the join during the current
    iteration, and the query loops infinitely.

If you have created an infinite loop:

Solution 2:
:   Make sure that the recursive clause passes the correct variables in the correct order.

    Also make sure that the JOIN condition in the recursive clause is correct. In a typical case, the parent of the
    “current” row should be joined to the child/current value of the parent row.

---
title: Working with joins
source: https://docs.snowflake.com/en/user-guide/querying-joins.md
section: User Guide
---

Categories:
:   [Query syntax](../sql-reference/constructs.md)

# Working with joins

A join combines rows from two tables to create a new combined row that can be used in the query.

## Introduction

Joins are useful when the data in the tables is related. For example, one table might hold information about projects,
and one table might hold information about employees working on those projects.

```sqlexample
CREATE TABLE projects (
  project_id INT,
  project_name VARCHAR);

INSERT INTO projects VALUES
  (1000, 'COVID-19 Vaccine'),
  (1001, 'Malaria Vaccine'),
  (1002, 'NewProject');

CREATE TABLE employees (
  employee_id INT,
  employee_name VARCHAR,
  project_id INT);

INSERT INTO employees VALUES
  (10000001, 'Terry Smith', 1000),
  (10000002, 'Maria Inverness', 1000),
  (10000003, 'Pat Wang', 1001),
  (10000004, 'NewEmployee', NULL);
```

Query the tables to view the data:

```sqlexample
SELECT * FROM projects ORDER BY project_ID;
```

```output
+------------+------------------+
| PROJECT_ID | PROJECT_NAME     |
|------------+------------------|
|       1000 | COVID-19 Vaccine |
|       1001 | Malaria Vaccine  |
|       1002 | NewProject       |
+------------+------------------+
```

```sqlexample
SELECT * FROM employees ORDER BY employee_ID;
```

```output
+-------------+-----------------+------------+
| EMPLOYEE_ID | EMPLOYEE_NAME   | PROJECT_ID |
|-------------+-----------------+------------|
|    10000001 | Terry Smith     |       1000 |
|    10000002 | Maria Inverness |       1000 |
|    10000003 | Pat Wang        |       1001 |
|    10000004 | NewEmployee     |       NULL |
+-------------+-----------------+------------+
```

The two joined tables usually contain one or more columns in common so that the rows
in one table can be associated with the corresponding rows in the other table.
For example, in these sample tables, each row in the projects table has a unique project ID
number, and each row in the employees table includes the ID number of
the project that the employee is currently assigned to.

The join operation specifies, explicitly or implicitly, how to relate rows
in one table to the corresponding rows in the other table, typically by
referencing one or more common columns, such as `project_id`. For example, the following
joins the `projects` and `employees` tables that were created previously:

```sqlexample
SELECT p.project_ID, project_name, employee_ID, employee_name, e.project_ID
  FROM projects AS p JOIN employees AS e
    ON e.project_ID = p.project_ID
  ORDER BY p.project_ID, e.employee_ID;
```

```output
+------------+------------------+-------------+-----------------+------------+
| PROJECT_ID | PROJECT_NAME     | EMPLOYEE_ID | EMPLOYEE_NAME   | PROJECT_ID |
|------------+------------------+-------------+-----------------+------------|
|       1000 | COVID-19 Vaccine |    10000001 | Terry Smith     |       1000 |
|       1000 | COVID-19 Vaccine |    10000002 | Maria Inverness |       1000 |
|       1001 | Malaria Vaccine  |    10000003 | Pat Wang        |       1001 |
+------------+------------------+-------------+-----------------+------------+
```

Although a single join operation can join only two tables, joins can be chained together. The result of a join is
a table-like object, and that table-like object can then be joined to another table-like object. Conceptually,
the idea is similar to the following; this isn’t the actual syntax:

```sqlexample
table1 JOIN (table2 JOIN table3)
```

In this pseudo-code, `table2` and `table3` are joined first. The table that results from that join is then joined with
`table1`.

Joins can be applied not only to tables, but also to other table-like objects. You can join:

* A table.
* A [view](views-introduction.md) (materialized or non-materialized).
* A [table literal](../sql-reference/literals-table.md).
* An expression that evaluates to the equivalent of a table (containing one or more columns and zero or more
  rows). For example:

  + The result set returned by a [table function](../sql-reference/functions-table.md).
  + The result set returned by a subquery that returns a table.

When this topic refers to joining a table, it generally means joining any table-like object.

> **Note:**
>
> Snowflake can improve performance by eliminating unnecessary joins. For more information, see
> [Understanding How Snowflake Can Eliminate Redundant Joins](join-elimination.md).

## Types of joins

Snowflake supports the following types of joins:

* Inner join
* Outer join
* Cross join
* Natural join

> **Note:**
>
> Snowflake also supports ASOF JOIN for analyzing time-series data. For more information,
> see [ASOF JOIN](../sql-reference/constructs/asof-join.md) and [Analyzing time-series data](querying-time-series-data.md).

### Inner join

An inner join pairs each row in one table with the matching rows in the other table.

The following example shows an inner join:

```sqlexample
SELECT p.project_ID, project_name, employee_ID, employee_name, e.project_ID
  FROM projects AS p INNER JOIN employees AS e
    ON e.project_id = p.project_id
  ORDER BY p.project_ID, e.employee_ID;
```

```output
+------------+------------------+-------------+-----------------+------------+
| PROJECT_ID | PROJECT_NAME     | EMPLOYEE_ID | EMPLOYEE_NAME   | PROJECT_ID |
|------------+------------------+-------------+-----------------+------------|
|       1000 | COVID-19 Vaccine |    10000001 | Terry Smith     |       1000 |
|       1000 | COVID-19 Vaccine |    10000002 | Maria Inverness |       1000 |
|       1001 | Malaria Vaccine  |    10000003 | Pat Wang        |       1001 |
+------------+------------------+-------------+-----------------+------------+
```

In this example, the output contains two columns named `PROJECT_ID`. One `PROJECT_ID` column is from
the `projects` table, and one is from the `employees` table. For each row in the output, the values
in the two `PROJECT_ID` columns match because the query specified `e.project_id = p.project_id`.

The output includes only valid pairs; that is, rows that match the join condition. In this example, there is
no row for the project named `NewProject`, which has no employees assigned yet, or the employee named
`NewEmployee`, who hasn’t been assigned to any projects yet.

### Outer join

An outer join lists all rows in the specified table, even if those rows have no match in the other table. For
example, a left outer join between projects and employees lists all projects, including projects that don’t
yet have any employee assigned.

```sqlexample
SELECT p.project_name, e.employee_name
  FROM projects AS p LEFT OUTER JOIN employees AS e
    ON e.project_ID = p.project_ID
  ORDER BY p.project_name, e.employee_name;
```

```output
+------------------+-----------------+
| PROJECT_NAME     | EMPLOYEE_NAME   |
|------------------+-----------------|
| COVID-19 Vaccine | Maria Inverness |
| COVID-19 Vaccine | Terry Smith     |
| Malaria Vaccine  | Pat Wang        |
| NewProject       | NULL            |
+------------------+-----------------+
```

The project named `NewProject` is included in this output, even though there is no matching row in the
`employees` table. Because there are no matching employee names for the project named `NewProject`, the
employee name is NULL.

A right outer join lists all employees (regardless of project).

```sqlexample
SELECT p.project_name, e.employee_name
  FROM projects AS p RIGHT OUTER JOIN employees AS e
    ON e.project_ID = p.project_ID
  ORDER BY p.project_name, e.employee_name;
```

```output
+------------------+-----------------+
| PROJECT_NAME     | EMPLOYEE_NAME   |
|------------------+-----------------|
| COVID-19 Vaccine | Maria Inverness |
| COVID-19 Vaccine | Terry Smith     |
| Malaria Vaccine  | Pat Wang        |
| NULL             | NewEmployee     |
+------------------+-----------------+
```

A full outer join lists all projects and all employees.

```sqlexample
SELECT p.project_name, e.employee_name
  FROM projects AS p FULL OUTER JOIN employees AS e
    ON e.project_ID = p.project_ID
  ORDER BY p.project_name, e.employee_name;
```

```output
+------------------+-----------------+
| PROJECT_NAME     | EMPLOYEE_NAME   |
|------------------+-----------------|
| COVID-19 Vaccine | Maria Inverness |
| COVID-19 Vaccine | Terry Smith     |
| Malaria Vaccine  | Pat Wang        |
| NewProject       | NULL            |
| NULL             | NewEmployee     |
+------------------+-----------------+
```

### Cross join

A cross join combines each row in the first table with each row in the second table, creating every possible
combination of rows, which is called a *Cartesian product*. Because most of the result rows contain parts of
rows that aren’t actually related, a cross join is rarely useful by itself. In fact, cross joins are usually
the result of accidentally omitting the join condition.

The result of a cross join can be very large and expensive. If the first table has N rows and the second table
has M rows, then the result is N x M rows. For example, if the first table has 100 rows and the second table
has 1000 rows, then the result set contains 100,000 rows.

The following query shows a cross join:

> **Note:**
>
> This query contains no `ON` clause and no filter.

```sqlexample
SELECT p.project_name, e.employee_name
  FROM projects AS p CROSS JOIN employees AS e
  ORDER BY p.project_ID, e.employee_ID;
```

```output
+------------------+-----------------+
| PROJECT_NAME     | EMPLOYEE_NAME   |
|------------------+-----------------|
| COVID-19 Vaccine | Terry Smith     |
| COVID-19 Vaccine | Maria Inverness |
| COVID-19 Vaccine | Pat Wang        |
| COVID-19 Vaccine | NewEmployee     |
| Malaria Vaccine  | Terry Smith     |
| Malaria Vaccine  | Maria Inverness |
| Malaria Vaccine  | Pat Wang        |
| Malaria Vaccine  | NewEmployee     |
| NewProject       | Terry Smith     |
| NewProject       | Maria Inverness |
| NewProject       | Pat Wang        |
| NewProject       | NewEmployee     |
+------------------+-----------------+
```

You can make the output of a cross join more useful by applying a filter in the `WHERE` clause:

```sqlexample
SELECT p.project_name, e.employee_name
  FROM projects AS p CROSS JOIN employees AS e
  WHERE e.project_ID = p.project_ID
  ORDER BY p.project_ID, e.employee_ID;
```

```output
+------------------+-----------------+
| PROJECT_NAME     | EMPLOYEE_NAME   |
|------------------+-----------------|
| COVID-19 Vaccine | Terry Smith     |
| COVID-19 Vaccine | Maria Inverness |
| Malaria Vaccine  | Pat Wang        |
+------------------+-----------------+
```

The result of this cross join and filter is the same as the result of the following inner join:

```sqlexample
SELECT p.project_name, e.employee_name
  FROM projects AS p INNER JOIN employees AS e
    ON e.project_ID = p.project_ID
  ORDER BY p.project_ID, e.employee_ID;
```

```output
+------------------+-----------------+
| PROJECT_NAME     | EMPLOYEE_NAME   |
|------------------+-----------------|
| COVID-19 Vaccine | Terry Smith     |
| COVID-19 Vaccine | Maria Inverness |
| Malaria Vaccine  | Pat Wang        |
+------------------+-----------------+
```

> **Important:**
>
> Although the two queries in this example produce the same output when they use the same condition
> (`e.project_id = p.project_id`) in different clauses (`WHERE` and `FROM ... ON ...`), it is possible to
> construct pairs of queries that use the same condition but that don’t produce the same output.
>
> The most common examples involve outer joins. If you run `table1 LEFT OUTER JOIN table2`, then for rows in
> `table1` that have no match, the columns that would have come from `table2` contain NULL. A filter
> such as `WHERE table2.ID = table1.ID` filters out rows in which either `table2.id` or `table1.id` contains a
> NULL, while an explicit outer join in the `FROM ... ON ...` clause doesn’t filter out rows with NULL values.
> In other words, an outer join with a filter might not act like an outer join.

### Natural join

A natural join joins two tables on columns that have the same names and compatible data types. Both the
`employees` and the `projects` table created previously, have a column named `project_ID`. A natural
join implicitly constructs the `ON` clause: `ON projects.project_ID = employees.project_ID`.

If two tables have multiple columns in common, then a natural join uses all of the common columns in the constructed
`ON` clause. For example, if two tables each have columns named `city` and `province`, then a natural join
constructs the following `ON` clause:

```sqlexample
ON table2.city = table1.city AND table2.province = table1.province
```

The output of a natural join includes only one copy of each of the shared columns. For example, the following query
produces a natural join that contains all of columns in the two tables, except that it omits all but one copy of the
redundant `project_id` columns:

```sqlexample
SELECT *
  FROM projects NATURAL JOIN employees
  ORDER BY employee_id;
```

```output
+------------+------------------+-------------+-----------------+
| PROJECT_ID | PROJECT_NAME     | EMPLOYEE_ID | EMPLOYEE_NAME   |
|------------+------------------+-------------+-----------------|
|       1000 | COVID-19 Vaccine |    10000001 | Terry Smith     |
|       1000 | COVID-19 Vaccine |    10000002 | Maria Inverness |
|       1001 | Malaria Vaccine  |    10000003 | Pat Wang        |
+------------+------------------+-------------+-----------------+
```

You can combine a natural join with an outer join.

You can’t combine a natural join `ON` clause because the join condition is already implied. However, you
can use a `WHERE` clause to filter the results of a natural join.

## Implementing joins

Syntactically, there are two ways to join tables:

* Use the [JOIN](../sql-reference/constructs/join.md) subclause in the `ON` subclause of the
  [FROM](../sql-reference/constructs/from.md) clause.
* Use the [WHERE](../sql-reference/constructs/where.md) clause with the [FROM](../sql-reference/constructs/from.md) clause.

Snowflake recommends using the `ON` subclause in the `FROM` clause because the syntax is more flexible.
Also, specifying the predicate in the `ON` subclause avoids the problem of accidentally filtering rows
with NULL values when using a `WHERE` clause to specify the join condition for an outer join.

In addition, you can use the `DIRECTED` keyword to enforce the join order of the tables. When you
specify this keyword, the first, or left, table is scanned before the second, or right, table. For example,
`o1 INNER DIRECTED JOIN o2` scans the `o1` table before the `o2` table. If the
`DIRECTED` keyword is added, the join type — for example, `INNER` or `OUTER` — is required.
For more information, see [JOIN](../sql-reference/constructs/join.md).

---
title: Working with Materialized Views
source: https://docs.snowflake.com/en/user-guide/views-materialized.md
section: User Guide
---

# Working with Materialized Views

A materialized view is a pre-computed data set derived from a query specification (the SELECT in the view definition) and stored for later use.
Because the data is pre-computed, querying a materialized view is faster than executing a query against the base table of the view.
This performance difference can be significant when a query is run frequently or is sufficiently complex. As a result,
materialized views can speed up expensive aggregation, projection, and selection operations, especially those that run frequently
and that run on large data sets.

> **Note:**
>
> Materialized views are designed to improve query performance for workloads composed of common, repeated query patterns. However, materializing
> intermediate results incurs
> additional costs. As such, before creating any materialized views, you
> should consider whether the costs are
> offset by the savings from re-using these results frequently enough.

## Deciding When to Create a Materialized View

Materialized views are particularly useful when:

* Query results contain a small number of rows and/or columns relative to the base table (the table on
  which the view is defined).
* Query results contain results that require significant processing, including:

  + Analysis of semi-structured data.
  + Aggregates that take a long time to calculate.
* The query is on an [external table](tables-external-intro.md), which might have slower
  performance compared to querying native database tables or [Apache Iceberg™ tables](tables-iceberg.md).
* The view’s base table does not change frequently.

### Advantages of Materialized Views

Snowflake’s implementation of materialized views provides a number of unique characteristics:

* Materialized views can improve the performance of queries that use the same subquery results repeatedly.
* Materialized views are automatically and transparently maintained by Snowflake. A background service updates the materialized view
  after changes are made to the base table. This is more efficient and less error-prone than manually maintaining the equivalent of a
  materialized view at the application level.
* Data accessed through materialized views is always current, regardless of the amount of DML that has been performed on the base table.
  If a query is run before the materialized view is up-to-date, Snowflake either updates the materialized view or uses the up-to-date
  portions of the materialized view and retrieves any required newer data from the base table.

> **Important:**
>
> The automatic maintenance of materialized views consumes credits. For more details, see
> Materialized Views Cost (in this topic).

### Deciding When to Create a Materialized View or a Regular View

In general, when deciding whether to create a materialized view or a regular view, use the following criteria:

* Create a materialized view when all of the following are true:

  + The query results from the view don’t change often. This almost always means that the underlying/base table
    for the view doesn’t change often, or at least that the subset of base
    table rows used in the materialized view don’t change often.
  + The results of the view are used often (typically significantly more often than the query results change).
  + The query consumes a lot of resources. Typically, this means that the query consumes a lot of processing
    time or credits, but it could also mean that the query consumes a lot of storage space for intermediate results.
* Create a regular view when any of the following are true:

  + The results of the view change often.
  + The results are not used often (relative to the rate at which the results change).
  + The query is not resource intensive so it is not costly to re-run it.

These criteria are just guidelines. A materialized view might provide benefits even if it is not used often — especially if the results change less frequently than the usage of the view.

Also, there are other factors to consider when deciding whether to use a regular view or a materialized view.

For example, the cost of storing the materialized view is a factor; if the results are not used very often (even
if they are used more often than they change), then the additional storage costs might not be worth the performance gain.

### Comparison with Tables, Regular Views, and Cached Results

Materialized views are similar to tables in some ways and similar to regular (i.e. non-materialized) views in other ways.
In addition, materialized views have some similarities with cached results, particularly because both enable storing
query results for future re-use.

This section describes some of the similarities and differences between these objects in specific areas, including:

* Query performance.
* Query security.
* Reduced query logic complexity.
* Data clustering (related to query performance).
* Storage and maintenance costs.

Snowflake caches query results for a short period of time after a query has been run. In some situations,
if the same query is re-run and if nothing has changed in the table(s) that the query accesses, then
Snowflake can simply return the same results without re-running the query. This is the fastest and most
efficient form of re-use, but also the least flexible. For more details, see
[Using Persisted Query Results](querying-persisted-results.md).

Both materialized views and cached query results provide query performance benefits:

* Materialized views are more flexible than, but typically slower than, cached results.
* Materialized views are faster than tables because of their “cache” (i.e. the query results for the view); in addition,
  if data has changed, they can use their “cache” for data that hasn’t changed and use the base table for any data that has changed.

Regular views do not cache data, and therefore cannot improve performance by caching. However, in some
cases, views help Snowflake generate a more efficient query plan. Also, both materialized views and
regular views enhance data security by allowing data to be exposed or hidden at the row level or column level.

The following table shows key similarities and differences between tables, regular views, cached query results, and materialized views:

|  | Performance Benefits | Security Benefits | Simplifies Query Logic | Supports Clustering | Uses Storage | Uses Credits for Maintenance | Notes |
| --- | --- | --- | --- | --- | --- | --- | --- |
| Regular table |  |  |  | ✔ | ✔ |  |  |
| Regular view |  | ✔ | ✔ |  |  |  |  |
| Cached query result | ✔ |  |  |  |  |  | Used only if data has not changed and if query only uses deterministic functions (e.g. not CURRENT_DATE). |
| Materialized view | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | Storage and maintenance requirements typically result in increased costs. |
| External table |  |  |  |  |  |  | Data is maintained outside Snowflake and, therefore, does not incur any storage charges within Snowflake. |

### Examples of Use Cases For Materialized Views

This section describes some general usage scenarios that also provide a conceptual overview of materialized views:

* Suppose that, every day, you run a query `Q` that includes a subquery `S`. If `S` is resource-intensive and queries data that
  changes only once a week, then you could improve performance of the outer query `Q` by running `S` and caching the results in a table named `CT`:

  + You would update the table only once a week.
  + The rest of the time, when you run `Q`, it would reference the subquery results of `S` that were stored in the table.

  This would work well as long as the results of subquery `S` change predictably (e.g. at the same time every week).

  However, if the results of `S` change unpredictably then caching the results in a table is risky; sometimes your
  main query `Q` will return out-of-date results if the results of subquery `S` are out of date (and thus the results of cached
  table `CT` are out of date).

  Ideally, you’d like a special type of cache for results that change rarely, but for which the timing of the change is unpredictable. Looking
  at it another way, you’d like to force your subquery `S` to be re-run (and your cache table `CT` to be updated) when necessary.

  A materialized view implements an approximation of the best of both worlds. You define a query for your materialized view, and the results
  of the query are cached (as though they were stored in an internal table), but Snowflake updates the cache when the table that the materialized
  view is defined on is updated. Thus, your subquery results are readily available for fast performance.
* As a less abstract example, suppose that you run a small branch of a large pharmacy, and your branch stocks hundreds of medications out of a
  total of tens of thousands of FDA-approved medications.

  Suppose also that you have a complete list of all medications that each of your customers takes, and that almost all of those customers order
  only medicines that are in stock (i.e. special orders are rare).

  In this scenario, you could create a materialized view that lists only the interactions among medicines that you keep in stock. When a customer
  orders a medicine that she has never used before, if both that medicine and all of the other medicines that she takes are covered by your
  materialized view, then you don’t need to check the entire FDA database for drug interactions; you can just check the materialized view, so
  the search is faster.
* You can use a materialized view by itself, or you can use it in a join.

  Continuing with the pharmacy example, suppose that you have one table that lists all of the medicines that each of your customers takes; you can
  join that table to the materialized view of drug interactions to find out which of the customer’s current medications might interact with the
  new medication.

  You might use an outer join to make sure that you list all of the customer’s medicines, whether or not they are in your materialized view;
  if the outer join shows that any of the current medicines are not in the materialized view, you can re-run the query on the
  full drug interactions table.

### How the Query Optimizer Uses Materialized Views

You don’t need to specify a materialized view in a SQL statement in order for the view to be used. The query optimizer can
automatically rewrite queries against the base table or regular views to use the materialized view instead.

For example, suppose that a materialized view contains all of the rows and columns that are needed by a query against a base
table. The optimizer can decide to rewrite the query to use the materialized view, rather than the base table. This can
dramatically speed up a query, especially if the base table contains a large amount of historical data.

As another example, in a multi-table join, the optimizer might decide to use a materialized view instead of a table for one of the
tables in the join.

> **Note:**
>
> Even if a materialized view can replace the base table in a particular query, the optimizer might not use the materialized view.
> For example, if the base table is clustered by a field, the optimizer might choose to scan the base table (rather than the
> materialized view) because the optimizer can effectively prune out partitions and provide equivalent performance using the
> base table.

A materialized view can also be used as the data source for a subquery.

When the optimizer chooses to use a materialized view implicitly, the materialized view is listed in the EXPLAIN plan or the
Query Profile instead of the base table. You can use this information to experiment and understand which queries can benefit from
existing materialized views.

## About Materialized Views in Snowflake

The next sections explain how materialized views are represented in Snowflake.

### DDL Commands For Materialized Views

Materialized views are first-class database objects. Snowflake provides the following DDL commands for creating and maintaining materialized
views:

* [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md)
* [ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md)
* [DROP MATERIALIZED VIEW](../sql-reference/sql/drop-materialized-view.md)
* [DESCRIBE MATERIALIZED VIEW](../sql-reference/sql/desc-materialized-view.md)
* [SHOW MATERIALIZED VIEWS](../sql-reference/sql/show-materialized-views.md)

### DML Operations on Materialized Views

Snowflake does not allow standard DML (e.g. INSERT, UPDATE, DELETE) on materialized views.
Snowflake does not allow users to truncate materialized views.

See Limitations on Working With Materialized Views (in this topic) for details.

### Access Control Privileges

There are three types of privileges that are related to materialized views:

* Privileges on the schema that contains the materialized view.
* Privileges directly on the materialized view itself.
* Privileges on the database objects (e.g. tables) that the materialized view accesses.

You can use the standard commands for granting and revoking privileges on materialized views:

* [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md)
* [REVOKE <privileges> … FROM ROLE](../sql-reference/sql/revoke-privilege.md)

#### Privileges on a Materialized View’s Schema

Materialized views consume storage space. To create a materialized view, you need the CREATE MATERIALIZED VIEW
privilege on the schema that will contain the materialized view. You need to execute a statement similar to:

```sqlexample
GRANT CREATE MATERIALIZED VIEW ON SCHEMA <schema_name> TO ROLE <role_name>;
```

For more details about the GRANT statement, see [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md).

#### Privileges on a Materialized View

Materialized Views, like other database objects (tables, views, UDFs, etc.), are owned by a role and have privileges that can be granted
to other roles.

You can grant the following privileges on a materialized view:

* SELECT

As with non-materialized views, a materialized view does not automatically inherit the privileges of its base table.
You should explicitly grant privileges on the materialized view to the roles that should use that view.

> **Note:**
>
> The exception to this rule is when the query optimizer rewrites a query against the base table to use the materialized view
> (as explained in How the Query Optimizer Uses Materialized Views). In this case, the user does not need privileges to use the
> materialized view in order to access the results of the query.

#### Privileges on the Database Objects Accessed by the Materialized View

As with non-materialized views, a user who wishes to access a materialized view needs privileges only on the view, not on the underlying object(s)
that the view references.

### Secure Materialized Views

Materialized views can be secure views.

Most information about secure views applies to secure materialized views. There are a few cases where secure
materialized views are different from secure non-materialized views. The differences include:

* The command to find out whether a view is secure.

  + For non-materialized views, check the `IS_SECURE` column in the output of the `SHOW VIEWS` command.
  + For materialized views, check the `IS_SECURE` column in the output of the `SHOW MATERIALIZED VIEWS` command.

For more information about secure views, see [Working with Secure Views](views-secure.md).

The syntax to create secure materialized views is documented at
[CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md).

## Creating and Working With Materialized Views

This section provides information about creating and working with materialized views.

### Planning to Create a Materialized View

When deciding to create a materialized view, consider doing some analysis to determine the need for the view:

1. Examine the filters, projections, and aggregations of queries that are frequent or expensive.
2. Use the Query Profile and the EXPLAIN command to see whether existing materialized views are already
   being used by the automatic query rewrite feature. You might find that you do not need to create any new
   materialized views if there are existing views that fit the queries well.
3. Before adding any materialized views, record current query costs and performance so that you can
   evaluate the difference after creating the new materialized view.
4. If you find queries with very selective filters that do not benefit from clustering the table, then a materialized
   view containing the same filters can help the queries avoid scanning a lot of data.

   Similarly, if you find queries that use aggregation, or that contain expressions that are very expensive to
   evaluate (for example, expensive function calls, or expensive operations on semi-structured data), then
   a materialized views that uses the same expression(s) or aggregation(s) can provide a benefit.
5. Run the EXPLAIN command against the original queries, or run the queries and check the Query Profile, to see
   whether the new materialized view is being used.
6. Monitor the combined query and materialized view costs, and
   evaluate whether the performance or cost benefits justify the cost of the materialized view’s maintenance.

   Examine the query costs of the base table as well. In cases where the optimizer can rewrite the query to use a materialized
   view, query compilation can consume more time and resources. (The optimizer has a larger number of possibilities to consider.)
7. Remember that you can always reference materialized views directly if it simplifies your queries or you know that a
   materialized view will give you better performance. However, in most cases, you can simply query the base table and
   the automatic query rewrite feature will do that for you.

### Creating a Materialized View

Use the [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md) command to create a materialized view. For an example, see
Basic Example: Creating a Materialized View (in this topic).

> **Note:**
>
> The CREATE MATERIALIZED VIEW statement might take a substantial amount of time to complete.
>
> When a materialized view is first created, Snowflake performs the equivalent of a CTAS (CREATE TABLE … AS ….) operation.

When you create a materialized view, note the following:

* Handling Column Names That Are Not Allowed in Materialized Views
* Referring to the Base Table
* Specifying Filters for Query Optimization

#### Handling Column Names That Are Not Allowed in Materialized Views

The following column names are not allowed in a materialized view:

* Names that start with `SYSTEM$` or `METADATA$`
* Names that contain `$SYS_FACADE$`
* The column name `SYS_MV_SOURCE_PARTITION`

If you are defining a materialized view that selects a column with one of these names, you can define an alias for that column.
For example:

```sqlexample
CREATE OR REPLACE MATERIALIZED VIEW mv AS
  SELECT SYSTEM$ALPHA AS col1, ...
```

#### Referring to the Base Table

Whenever possible, use the fully-qualified name for the base table referenced in a materialized view. This insulates the view
from changes that can invalidate the view, such as moving the base table to a different schema from the view (or vice versa).

If the name of the base table is not qualified, and the table or view is moved to a different schema, the reference becomes
invalid.

In addition, if you are referring to the base table more than once in the view definition, use the same qualifier in all
references to the base table. For example, if you choose to use the fully-qualified name, make sure that all references to the
base table use the fully-qualified name.

#### Specifying Filters for Query Optimization

If you specify a filter when creating a materialized view (e.g. `WHERE column_1 BETWEEN Y and Z`), the optimizer can use
the materialized view for queries against the base table that have the same filter or a more restrictive filter. Here are some
examples:

* Here’s a simple example of range subsumption.

  In this example, the filter in the query does not match the filter in the materialized view. However, the filter in the query
  selects only rows that are in the materialized view, so the optimizer can choose to scan the materialized view rather than the
  entire table.

  ```sqlexample
  -- Example of a materialized view with a range filter
  create materialized view v1 as
    select * from table1 where column_1 between 100 and 400;
  ```

  ```sqlexample
  -- Example of a query that might be rewritten to use the materialized view
  select * from table1 where column_1 between 200 and 300;
  ```
* This example shows OR subsumption. The materialized view contains all the rows that the subsequent query needs.

  Define a materialized view that contains all rows that have either value X or value Y:

  ```sqlexample
  create materialized view mv1 as
    select * from tab1 where column_1 = X or column_1 = Y;
  ```

  Define a query that looks only for value Y (which is included in the materialized view):

  > ```sqlexample
  > select * from tab1 where column_1 = Y;
  > ```

  The query above can be rewritten internally as:

  > ```sqlexample
  > select * from mv1 where column_1 = Y;
  > ```
* This example is another example of OR subsumption. There’s no explicit OR in the materialized view definition. However, an IN
  clause is equivalent to a series of OR expressions, so the optimizer can re-write this query the same way as it re-wrote the
  OR subsumption example above:

  ```sqlexample
  create materialized view mv1 as
    select * from tab1 where column_1 in (X, Y);
  ```

> Define a query that looks only for value Y (which is included in the materialized view):
>
> ```sqlexample
> select * from tab1 where column_1 = Y;
> ```
>
> The query above can be rewritten internally as:
>
> > ```sqlexample
> > select * from mv1 where column_1 = Y;
> > ```

* This example uses AND subsumption:

  Create a materialized view that contains all rows where `column_1 = X`.

  ```sqlexample
  create materialized view mv2 as
    select * from table1 where column_1 = X;
  ```

  Create a query:

  ```sqlexample
  select column_1, column_2 from table1 where column_1 = X AND column_2 = Y;
  ```

  The query can be rewritten as:

  ```sqlexample
  select * from mv2 where column_2 = Y;
  ```

  The rewritten query does not even need to include the expression `column_1 = X` because the materialized view’s
  definition already requires that all rows match `column_1 = X`.
* The following example shows aggregate subsumption:

  The materialized view is defined below:

  ```sqlexample
  create materialized view mv4 as
    select column_1, column_2, sum(column_3) from table1 group by column_1, column_2;
  ```

  The following query can use the materialized view defined above:

  ```sqlexample
  select column_1, sum(column_3) from table1 group by column_1;
  ```

  The query can be rewritten as:

  ```sqlexample
  select column_1, sum(column_3) from mv4 group by column_1;
  ```

  The rewritten query does not take advantage of the additional grouping by column_2, but the rewritten query is not blocked by
  that additional grouping, either.

### Limitations on Creating Materialized Views

> **Note:**
>
> These are current limitations; some of them might be removed or changed in future versions.

The following limitations apply to creating materialized views:

* A materialized view can query only a single table.
* Joins, including self-joins, are not supported.
* A materialized view cannot query:

  + A materialized view.
  + A non-materialized view.
  + A hybrid table.
  + A dynamic table.
  + A UDTF (user-defined table function).
* A materialized view cannot include:

  + UDFs (this limitation applies to all types of user-defined functions, including external functions).
  + Window functions.
  + HAVING clauses.
  + ORDER BY clause.
  + LIMIT clause.
  + GROUP BY keys that are not within the SELECT list. All GROUP BY keys in a materialized view must be part of the SELECT list.
  + GROUP BY GROUPING SETS.
  + GROUP BY ROLLUP.
  + GROUP BY CUBE.
  + Nesting of subqueries within a materialized view.
  + The MINUS, EXCEPT, or INTERSECT [set operators](../sql-reference/operators-query.md).
* Many aggregate functions are not allowed in a materialized view definition.

  + The aggregate functions that are supported in materialized views are:

    - [APPROX_COUNT_DISTINCT (HLL)](../sql-reference/functions/approx_count_distinct.md).
    - [AVG](../sql-reference/functions/avg.md) (except when used in [PIVOT](../sql-reference/constructs/pivot.md)).
    - [BITAND_AGG](../sql-reference/functions/bitand_agg.md).
    - [BITOR_AGG](../sql-reference/functions/bitor_agg.md).
    - [BITXOR_AGG](../sql-reference/functions/bitxor_agg.md).
    - [COUNT](../sql-reference/functions/count.md).
    - [COUNT_IF](../sql-reference/functions/count_if.md).
    - [MAX](../sql-reference/functions/max.md).
    - [MIN](../sql-reference/functions/min.md).
    - [STDDEV, STDDEV_SAMP](../sql-reference/functions/stddev.md).
    - [STDDEV_POP](../sql-reference/functions/stddev_pop.md).
    - [SUM](../sql-reference/functions/sum.md).
    - [VARIANCE (VARIANCE_SAMP, VAR_SAMP)](../sql-reference/functions/variance.md).
    - [VARIANCE_POP (VAR_POP)](../sql-reference/functions/variance_pop.md).

    The other aggregate functions are not supported in materialized views.
  > **Note:**
  >
  > Aggregate functions that are allowed in materialized views still have some restrictions:
  >
  > + Aggregate functions cannot be nested. You cannot use an aggregate function in a subquery.
  >
  >   For example, the following is allowed:
  >
  >   ```sqlexample
  >   CREATE MATERIALIZED VIEW mv AS
  >     SELECT SUM(c1) AS sum_c1, c2 FROM t GROUP BY c2;
  >   ```
  >
  >   The following is not allowed:
  >
  >   ```sqlexample
  >   CREATE MATERIALIZED VIEW mv AS
  >     SELECT 100 * sum_c1 AS sigma FROM (
  >       SELECT SUM(c1) AS sum_c1, c2 FROM t GROUP BY c2;
  >     ) WHERE sum_c1 > 0;
  >   ```
  >
  >   If you need to nest an aggregate function, create a materialized view without the nested aggregation, and then create a view
  >   on top of that materialized view:
  >
  >   ```sqlexample
  >   CREATE MATERIALIZED VIEW mv AS
  >     SELECT SUM(c1) AS sum_c1, c2 FROM t GROUP BY c2;
  >
  >   CREATE VIEW view_on_mv AS
  >     SELECT 100 * sum_c1 AS sigma FROM mv WHERE sum_c1 > 0;
  >   ```
  > + DISTINCT cannot be combined with aggregate functions.
  > + In a materialized view, the aggregate functions AVG, COUNT, COUNT_IF, MIN, MAX, and SUM can be used as aggregate
  >   functions but not as window functions. In a materialized view, these functions cannot be used with the `OVER`
  >   clause:
  >
  >   > ```sqlexample
  >   > OVER ( [ PARTITION BY <expr1> ] [ ORDER BY <expr2> ] )
  >   > ```
* Functions used in a materialized view must be deterministic. For example, using [CURRENT_TIME](../sql-reference/functions/current_time.md) or
  [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md) is not permitted.
* A materialized view should not be defined using a function that produces different results for different settings
  of parameters, such as the session-level parameter TIMESTAMP_TYPE_MAPPING.

  For example, suppose that a view is defined as follows:

  ```sqlexample
  create materialized view bad_example (ts1) as
      select to_timestamp(n) from t1;
  ```

  The data type of the return value from `TO_TIMESTAMP(n)` depends upon the parameter TIMESTAMP_TYPE_MAPPING, so
  the contents of the materialized view depend upon the value of TIMESTAMP_TYPE_MAPPING at the time that the view was
  created.

  When a materialized view is created, the expression defining each of its columns is evaluated and stored. If a
  column definition depends upon a particular session variable, and the session variable changes, the expression is
  not re-evaluated, and the materialized view is not updated. If the materialized view depends upon a particular value
  of a session variable, and if the session variable’s value has changed, then queries on the materialized view fail.

  To avoid this problem, force the expression to a value that does not depend upon any session variables. The
  example below casts the output to a particular data type, independent of the TIMESTAMP_TYPE_MAPPING parameter:

  ```sqlexample
  create materialized view good_example (ts1) as
      select to_timestamp(n)::TIMESTAMP_NTZ from t1;
  ```

  This issue is specific to materialized views. Non-materialized views generate their output
  dynamically based on current parameter settings, so the results can’t be stale.
* In the definition of a materialized view, selecting the SEQ column from the output of the
  [FLATTEN](../sql-reference/functions/flatten.md) function is not supported.

  The values in the SEQ column are not guaranteed to be ordered in any way when selected from a materialized view. If you
  select this column in the materialized view definition, the output may be indeterministic.
* Materialized views cannot be created using the [Time Travel feature](data-time-travel.md).

### Basic Example: Creating a Materialized View

This section contains a basic example of creating and using a materialized view:

> ```sqlexample
> CREATE OR REPLACE MATERIALIZED VIEW mv1 AS
>   SELECT My_ResourceIntensive_Function(binary_col) FROM table1;
>
> SELECT * FROM mv1;
> ```

More detailed examples are provided in Examples (in this topic).

### Understanding How Materialized Views Are Maintained

After you create a materialized view, a background process automatically maintains the data in the materialized view. Note the
following:

* Maintenance of materialized views is performed by a background process, and the timing is optimally based on the workload on the
  base table and the materialized view.

  + This process updates the materialized view with changes made by DML operations to the base table (insertions, updates, and
    deletions).

    In addition, clustering on the base table can also result in refreshes of a materialized view. Refer to
    Best Practices for Clustering Materialized Views and their Base Tables.
  + When rows are inserted in the base table, the process performs a “refresh” operation to insert the new rows into the
    materialized view.
  + When rows are deleted in the base table, the process performs a “compaction” operation on the materialized view, deleting
    these rows from the materialized view.
* To see the last time that a materialized view was refreshed, execute the
  [SHOW MATERIALIZED VIEWS](../sql-reference/sql/show-materialized-views.md) command.

  Check the REFRESHED_ON and BEHIND_BY columns in the output:

  + The REFRESHED_ON and COMPACTED_ON columns show the timestamp of the last DML operation on the base table that was processed
    by the refresh and compaction operations, respectively.
  + The BEHIND_BY column indicates the amount of time that the updates to the materialized view are behind the updates to the base
    table.
* If maintenance falls behind, queries might run more slowly than when the views are up-to-date, but the results will always be
  up-to-date.

  If some micro-partitions of the materialized view are out of date, Snowflake skips those partitions and looks up the data from
  the base table.
* If the background process encounters certain user errors (for example, the query for the view results in a “division by zero”
  error), the process invalidates the materialized view.

  Querying an invalid materialized view results in an error. The error message includes the reason why the materialized view was
  invalidated. For example:

  ```output
  Failure during expansion of view 'MY_MV':
    SQL compilation error: Materialized View MY_MV is invalid.
    Invalidation reason: Division by zero
  ```

  If this occurs, address the problem described in the error message (for example, delete the rows that introduce the “divide by
  zero” error), and resume the materialized view by using the
  [ALTER MATERIALIZED VIEW … RESUME](../sql-reference/sql/alter-materialized-view.md) command.

### Suspending and Resuming Maintenance on a Materialized View

If you need to suspend the maintenance and use of a materialized view, execute the
[ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md) command with the SUSPEND parameter:

```sqlsyntax
ALTER MATERIALIZED VIEW <name> SUSPEND
```

If you suspend maintenance of a view, you cannot query the view until you resume maintenance.

To resume the maintenance and use of a materialized view, execute the ALTER MATERIALIZED VIEW command with the RESUME parameter:

```sqlsyntax
ALTER MATERIALIZED VIEW <name> RESUME
```

For an example, see Suspending Updates to a Materialized View.

### Displaying Information About Materialized Views

The following command and view provide information about materialized views:

* The [SHOW VIEWS](../sql-reference/sql/show-views.md) command returns information about both materialized and regular views.
* The [INFORMATION_SCHEMA.TABLES view](../sql-reference/info-schema/tables.md) shows materialized views. The `TABLE_TYPE`
  column shows “MATERIALIZED VIEW”. The `IS_INSERTABLE` column is always “NO”, because you cannot insert directly into a
  materialized view.

  > **Note:**
  >
  > The [INFORMATION_SCHEMA.VIEWS view](../sql-reference/info-schema/views.md) does not show materialized views. Materialized
  > views are shown by INFORMATION_SCHEMA.TABLES.

### Limitations on Working With Materialized Views

> **Note:**
>
> These are current limitations; some of them might be removed or changed in future versions.

The following limitations apply to using materialized views:

* To ensure that materialized views stay consistent with the base table on
  which they are defined, you cannot perform most DML operations on a
  materialized view itself. For example, you cannot insert rows directly
  into a materialized view (although of course you can insert rows into
  the base table). The prohibited DML operations include:

  + COPY
  + DELETE
  + INSERT
  + MERGE
  + UPDATE

  Truncating a materialized view is not supported.
* You cannot directly clone a materialized view by using the `CREATE MATERIALIZED VIEW ... CLONE...` command.
  However, if you clone a schema or a database that contains a materialized view, the materialized view will be cloned
  and included in the new schema or database.
* Snowflake does not support using the [Time Travel feature](data-time-travel.md) to query materialized views at
  [a point in the past](data-time-travel.md) (e.g. using the [AT clause](../sql-reference/constructs/at-before.md)
  when querying a materialized view).

  However, you can use Time Travel to
  [clone a database or schema containing a materialized view at a point in the past](data-time-travel.md). For
  details, see Materialized Views and Time Travel.
* Materialized Views are not monitored by Snowflake [Working with resource monitors](resource-monitors.md).

### Effects of Changes to Base Tables on Materialized Views

The following sections explain how materialized views are affected by changes to the base tables.

* Adding Columns to the Base Table
* Changing or Dropping Columns in the Base Table
* Renaming or Swapping the Base Table
* Dropping the Base Table

#### Adding Columns to the Base Table

If columns are added to the base table, those new columns are not propagated to the materialized view automatically.

This is true even if the materialized view was defined with `SELECT *` (e.g.
`CREATE MATERIALIZED VIEW AS SELECT * FROM table2 ...`). The columns of the materialized view are defined at the time that
the materialized view is defined. The `SELECT *` is not interpreted dynamically each time that the materialized view is
queried.

To avoid confusion, Snowflake recommends not using `SELECT *` in the definition of a materialized view.

> **Note:**
>
> Adding a column to the base table does not suspend a materialized view created on that base table.

#### Changing or Dropping Columns in the Base Table

If a base table is altered so that existing columns are changed or dropped, then all materialized views on that base table are
suspended; the materialized views cannot be used or maintained. (This is true even if the modified or dropped column was not part
of the materialized view.)

You cannot RESUME that materialized view. If you want to use it again, you must recreate it.

The simplest way to recreate a materialized view with the same privileges on the view is by running the command:

> ```sqlexample
> CREATE OR REPLACE MATERIALIZED VIEW <view_name> ... COPY GRANTS ...
> ```

This is more efficient than running separate commands to:

1. Drop the materialized view ([DROP MATERIALIZED VIEW](../sql-reference/sql/drop-materialized-view.md)).
2. Create the materialized view again ([CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md)).
3. Create the same privileges on the view ([GRANT](../sql-reference/sql/grant-privilege.md) and
   [REVOKE](../sql-reference/sql/revoke-privilege.md)).

#### Renaming or Swapping the Base Table

Renaming or swapping the base table (or the schema or database containing the base table) can result in the materialized view
pointing to a different base table than the base table used to create the materialized view. The following are examples of
situations in which this can occur:

* The base table is renamed (through [ALTER TABLE](../sql-reference/sql/alter-table.md) … RENAME), and another table is created with the
  original name of the base table.
* The base table of a materialized view is swapped with another table (through
  [ALTER TABLE](../sql-reference/sql/alter-table.md) … SWAP WITH).
* The schema or database containing the base table of the materialized view is moved through DROP, SWAP or RENAME.

In these cases, the materialized view is suspended. In most cases, you must recreate the materialized view in order to use the
view.

#### Dropping the Base Table

If a base table is dropped, the materialized view is suspended (but not automatically dropped).

In most cases, the materialized view must be dropped.

If for some reason you are recreating the base table and would also like to recreate the materialized view with the same
definition it had previously, then first recreate the base table and then replace the view by using
`CREATE OR REPLACE MATERIALIZED VIEW <view_name> ... COPY GRANTS ...`.

### Materialized Views in Cloned Schemas and Databases

If you clone a schema or a database that contains a materialized view, then the materialized view is cloned.

If you clone the materialized view and the corresponding base table at the same time (as part of the same
`CREATE SCHEMA ... CLONE` or `CREATE DATABASE ... CLONE` operation), then the cloned materialized view
refers to the cloned base table.

If you clone the materialized view without cloning the base table (e.g. if the table is in Database1.Schema1
and the view is in Database1.Schema2, and you clone only Schema2 rather than all of Database1), then the cloned view
will refer to the original base table.

## Materialized Views Cost

Materialized views impact your costs for both storage and compute resources:

* Storage: Each materialized view stores query results, which adds to the monthly storage usage for your account.
* Compute resources: In order to prevent materialized views from becoming out-of-date, Snowflake performs automatic background maintenance of
  materialized views. When a base table changes, all materialized views defined on the table are updated by a background service that uses compute
  resources provided by Snowflake.

  These updates can consume significant resources, resulting in increased credit usage. However, Snowflake ensures efficient credit
  usage by billing your account only for the actual resources used. Billing is calculated in 1-second increments.

To learn how many credits per compute-hour are consumed by materialized views, refer to the “Serverless
Feature Credit Table” in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

### Estimating and Controlling Costs

There are no tools to estimate the costs of maintaining materialized views.
In general, the costs are proportional to:

* The number of materialized views created on each base table, and the amount of data that changes in each of those
  materialized views when the base table changes. Any changes to micro-partitions in the base table require eventual
  materialized view maintenance, whether those changes are due to reclustering or DML statements run on the base table.
* The number of those materialized views that are clustered. Maintaining clustering (of either a table or a
  materialized view) adds costs.

  If a materialized view is clustered differently from the base table, the number of micro-partitions changed in the
  materialized view might be substantially larger than the number of micro-partitions changed in the base table.

  For example, consider the case where the base table is changed largely by inserting (appending) data, and is not
  clustered, so the base table is largely in the order that the rows were inserted into the table. Imagine that the
  materialized view is clustered by an independent column, for example, postal code. If 100 new rows are added to
  the base table, those might go into one or two new micro-partitions, leaving the other micro-partitions in the
  base table untouched. But those 100 rows might require re-writing 100 micro-partitions in the clustered
  materialized view.

  As another example, consider deletes. Deleting the oldest rows in an unclustered base table might delete only
  the oldest micro-partitions, but might require changes to a far larger number of micro-partitions in a
  materialized view that is not clustered by age.

  (For more details about clustering materialized views,
  see Materialized Views and Clustering.)

You can control the cost of maintaining materialized views by carefully choosing how many views to create, which tables to create them on, and each view’s definition (including the number of rows and columns in that view).

You can also control costs by suspending or resuming the materialized view; however, suspending maintenance typically only defers costs, rather
than reducing them. The longer that maintenance has been deferred, the more maintenance there is to do.

See also Best Practices for Maintaining Materialized Views.

> **Tip:**
>
> If you are concerned about the cost of maintaining materialized views, Snowflake recommends starting slowly with this
> feature (i.e. create only a few materialized views on selected tables) and monitor the costs over time.

### Viewing Costs

You can view the billing costs for maintaining materialized views using [Snowsight](ui-snowsight-gs.md) or SQL:

Snowsight:
:   As a user with the [proper privileges](cost-access-control.md), In the navigation menu, select Admin » Cost management, and then select Consumption.

    Use the All Service Types filter and select Materialized View.

    The credit costs are tracked in a Snowflake-provided virtual warehouse named  MATERIALIZED_VIEW_MAINTENANCE.

SQL:
:   Query either of the following:

    * [MATERIALIZED_VIEW_REFRESH_HISTORY](../sql-reference/functions/materialized_view_refresh_history.md) table function (in the [Snowflake Information Schema](../sql-reference/info-schema.md)).

      For example:

      ```sqlexample
      SELECT * FROM TABLE(INFORMATION_SCHEMA.MATERIALIZED_VIEW_REFRESH_HISTORY());
      ```
    * [MATERIALIZED_VIEW_REFRESH_HISTORY view](../sql-reference/account-usage/materialized_view_refresh_history.md) view (in [Account Usage](../sql-reference/account-usage.md)).

      The following queries can be executed against the MATERIALIZED_VIEW_REFRESH_HISTORY view:

      **Query: Materialized Views cost history (by day, by object)**

      This query provides a full list of materialized views and the volume of credits consumed via the service over the last 30 days, broken
      out by day. Any irregularities in the credit consumption or consistently high consumption are flags for additional investigation.

      ```sqlexample
      SELECT TO_DATE(start_time) AS date,
        database_name,
        schema_name,
        table_name,
        SUM(credits_used) AS credits_used
      FROM snowflake.account_usage.materialized_view_refresh_history
      WHERE start_time >= DATEADD(month,-1,CURRENT_TIMESTAMP())
      GROUP BY 1,2,3,4
      ORDER BY 5 DESC;
      ```

      **Query: Materialized Views History & m-day average**

      This query shows the average daily credits consumed by materialized views grouped by week over the last year. It can help identify
      anomalies in daily averages over the year so you can investigate spikes or unexpected changes in
      consumption.

      ```sqlexample
      WITH credits_by_day AS (
        SELECT TO_DATE(start_time) AS date,
          SUM(credits_used) AS credits_used
        FROM snowflake.account_usage.materialized_view_refresh_history
        WHERE start_time >= DATEADD(year,-1,CURRENT_TIMESTAMP())
        GROUP BY 1
        ORDER BY 2 DESC
      )

      SELECT DATE_TRUNC('week',date),
        AVG(credits_used) AS avg_daily_credits
      FROM credits_by_day
      GROUP BY 1
      ORDER BY 1;
      ```

> **Note:**
>
> [Resource monitors](resource-monitors.md) provide control over virtual warehouse credit usage; however, you cannot use them to control
> credit usage for the Snowflake-provided warehouses, including the  MATERIALIZED_VIEW_MAINTENANCE warehouse.

## Materialized Views and Clustering

Defining a clustering key on a materialized view is supported and can increase performance in many situations.
However, it also adds costs.

If you cluster both the materialized view(s) and the base table on which the
materialized view(s) are defined, you can cluster the materialized view(s) on
different columns from the columns used to cluster the base table.

In most cases, clustering a subset of the materialized views on a
table tends to be more cost-effective than clustering the table itself.
If the data in the base table is accessed (almost) exclusively through the
materialized views, and (almost) never directly through the base table,
then clustering the base table adds costs without adding benefit.

If you are considering clustering both the base table and the materialized
views, Snowflake recommends that you start by clustering only the materialized
views, and that you monitor performance and cost before and after adding
clustering to the base table.

If you plan to create a table, load it, and create a clustered materialized
view(s) on the table, then Snowflake recommends that you create the
materialized views last (after loading as much data as possible). This
can save money on the initial data load, because it avoids some extra effort
to maintain the clustering of the materialized view the first time that
the materialized view is loaded.

For more details about clustering, refer to:

* [Understanding Snowflake Table Structures](tables-micro-partitions.md)
* [Automatic Clustering](tables-auto-reclustering.md)

For more information about the costs of clustering materialized views, refer to:

* Materialized Views Cost
* Best Practices for Materialized Views

## Materialized Views and Time Travel

Currently, you cannot use [Time Travel](data-time-travel.md) to
[query historical data](data-time-travel.md) for materialized views.

However, note the following:

* You can use Time Travel to
  [clone a database or schema containing a materialized view at a specific point in the past](data-time-travel.md).
  Snowflake clones the materialized view at the specified point in time.
* To support cloning with Time Travel, Snowflake does maintain historical data for materialized views. You will be billed for
  the [storage costs for historical data](data-cdp-storage-costs.md) for materialized views.
* The storage costs depend on the [data retention period](data-time-travel.md) for the materialized
  views, which is determined by the [DATA_RETENTION_TIME_IN_DAYS](../sql-reference/parameters.md) parameter. Materialized
  views inherit this parameter from its parent schema or database.

## Best Practices for Materialized Views

The following sections summarize the best practices for working with materialized views:

* Best Practices for Creating Materialized Views
* Best Practices for Maintaining Materialized Views
* Best Practices for Clustering Materialized Views and their Base Tables

### Best Practices for Creating Materialized Views

* Most materialized views should do one or both of the following:

  + Filter data. You can do this by:

    - Filtering rows (e.g. defining the materialized view so that only very recent data is included).
      In some applications, the best data to store is the abnormal data. For example, if you are monitoring
      pressure in a gas pipeline to estimate when pipes might fail, you might store all pressure data in the base
      table, and store only unusually high pressure measurements in the materialized view. Similarly, if you are
      monitoring network traffic, your base table might store all monitoring information, while your materialized
      view might store only unusual and suspicious information (e.g. from IP addresses known to launch
      DOS (Denial Of Service) attacks).
    - Filtering columns (e.g. selecting specific columns rather than “SELECT \* …”).
      Using `SELECT * ...` to define a materialized view typically is expensive. It can also lead to future
      errors; if columns are added to the base table later (e.g. `ALTER TABLE ... ADD COLUMN ...`), the
      materialized view does not automatically incorporate the new columns.
  + Perform resource-intensive operations and store the results so that the resource intensive operations
    don’t need to be performed as often.
* You can create more than one materialized view for the same base table. For example, you can create one materialized
  view that contains just the most recent data, and another materialized view that stores unusual data. You can then
  create a non-materialized view that joins the two tables and shows recent data that matches unusual historical
  data so that you can quickly detect unusual situations, such as a DOS (denial of service) attack that is ramping up.

  Snowflake recommends materialized views for unusual data only when:

  > + The base table is not clustered, or the columns that contain the unusual data are not already part of the base
  >   table’s clustering key.
  > + The data is unusual enough that it is easy to isolate, but not so unusual that it is rarely used. (If the data
  >   is rarely used, the cost of maintaining the materialized view is likely to outweigh the performance benefit and
  >   cost savings from being able to access it quickly when it is used.)

### Best Practices for Maintaining Materialized Views

* Snowflake recommends batching DML operations on the base table:

  + `DELETE`: If tables store data for the most recent time period (e.g. the most recent day or week or month),
    then when you trim your base table by deleting old data, the changes to the base table are propagated to the
    materialized view. Depending upon how the data is distributed across the micro-partitions, this could cause you
    to pay more for background updates of the materialized views. In
    some cases, you might be able to reduce costs by deleting less frequently (e.g. daily rather than hourly, or
    hourly rather than every 10 minutes).

    If you do not need to keep a specific amount of old data, you should experiment to find the best balance between cost and
    functionality.
  + `INSERT`, `UPDATE`, and `MERGE`: Batching these types of DML statements on the
    base table can reduce the cost of maintaining the materialized views.

### Best Practices for Clustering Materialized Views and their Base Tables

* If you create a materialized view on a base table, and if the materialized views are accessed frequently and the
  base table is not accessed frequently, it is usually more efficient to avoid clustering the base table.

  If you create a materialized view on a clustered table, consider removing any clustering on the base table, because
  any change to the clustering of the base table will eventually require a refresh of the materialized view,
  which adds to the materialized view’s maintenance costs.
* Clustering materialized views, especially materialized views on base tables that change frequently, increases
  costs. Do not cluster more materialized views than you need to.
* Almost all information about clustering tables also applies to clustering materialized views.
  For more information about clustering tables, see [Strategies for Selecting Clustering Keys](tables-clustering-keys.md).

## Examples

This section contains additional examples of creating and using materialized views. For a simple, introductory example, see
Basic Example: Creating a Materialized View (in this topic).

### Simple Materialized View

This first example illustrates a simple materialized view and a simple query on the view.

> Create the table and load the data, and create the view:
>
> ```sqlexample
> CREATE TABLE inventory (product_ID INTEGER, wholesale_price FLOAT,
>   description VARCHAR);
>
> CREATE OR REPLACE MATERIALIZED VIEW mv1 AS
>   SELECT product_ID, wholesale_price FROM inventory;
>
> INSERT INTO inventory (product_ID, wholesale_price, description) VALUES
>     (1, 1.00, 'cog');
> ```
>
> Select data from the view:
>
> ```sqlexample
> SELECT product_ID, wholesale_price FROM mv1;
> +------------+-----------------+
> | PRODUCT_ID | WHOLESALE_PRICE |
> |------------+-----------------|
> |          1 |               1 |
> +------------+-----------------+
> ```

### Joining a Materialized View

You can join a materialized view with a table or another view. This example builds on the previous example by creating an additional
table, and then a non-materialized view that shows profits by joining the materialized view to a table:

> ```sqlexample
> CREATE TABLE sales (product_ID INTEGER, quantity INTEGER, price FLOAT);
>
> INSERT INTO sales (product_ID, quantity, price) VALUES
>    (1,  1, 1.99);
>
> CREATE or replace VIEW profits AS
>   SELECT m.product_ID, SUM(IFNULL(s.quantity, 0)) AS quantity,
>       SUM(IFNULL(quantity * (s.price - m.wholesale_price), 0)) AS profit
>     FROM mv1 AS m LEFT OUTER JOIN sales AS s ON s.product_ID = m.product_ID
>     GROUP BY m.product_ID;
> ```
>
> Select data from the view:
>
> ```sqlexample
> SELECT * FROM profits;
> +------------+----------+--------+
> | PRODUCT_ID | QUANTITY | PROFIT |
> |------------+----------+--------|
> |          1 |        1 |   0.99 |
> +------------+----------+--------+
> ```

### Suspending Updates to a Materialized View

The following example temporarily suspends the use (and maintenance)
of the `mv1` materialized view, and shows that queries on that view
generate an error message while the materialized view is suspended:

> ```sqlexample
> ALTER MATERIALIZED VIEW mv1 SUSPEND;
>
> INSERT INTO inventory (product_ID, wholesale_price, description) VALUES
>     (2, 2.00, 'sprocket');
>
> INSERT INTO sales (product_ID, quantity, price) VALUES
>    (2, 10, 2.99),
>    (2,  1, 2.99);
> ```
>
> Select data from the materialized view:
>
> ```sqlexample
> SELECT * FROM profits ORDER BY product_ID;
> ```
>
> Output:
>
> ```output
> 002037 (42601): SQL compilation error:
> Failure during expansion of view 'PROFITS': SQL compilation error:
> Failure during expansion of view 'MV1': SQL compilation error: Materialized View MV1 is invalid.
> ```
>
> Resume:
>
> ```sqlexample
> ALTER MATERIALIZED VIEW mv1 RESUME;
> ```
>
> Select data from the materialized view:
>
> ```sqlexample
> SELECT * FROM profits ORDER BY product_ID;
> +------------+----------+--------+
> | PRODUCT_ID | QUANTITY | PROFIT |
> |------------+----------+--------|
> |          1 |        1 |   0.99 |
> |          2 |       11 |  10.89 |
> +------------+----------+--------+
> ```

### Clustering a Materialized View

This example creates a materialized view and then later clusters it:

> These statements create two tables that track information about segments of a
> pipeline (e.g. for natural gas).
>
> The segments that are most likely to fail in the near future are often the segments that are oldest, or that are
> made of materials that corrode easily, or that had experienced periods of unusually high pressure, so
> this example tracks each pipe’s age, pressure, and material (iron, copper, PVC plastic, etc.).
>
> > ```sqlexample
> > CREATE TABLE pipeline_segments (
> >     segment_ID BIGINT,
> >     material VARCHAR, -- e.g. copper, cast iron, PVC.
> >     installation_year DATE,  -- older pipes are more likely to be corroded.
> >     rated_pressure FLOAT  -- maximum recommended pressure at installation time.
> >     );
> >
> > INSERT INTO pipeline_segments
> >     (segment_ID, material, installation_year, rated_pressure)
> >   VALUES
> >     (1, 'PVC', '1994-01-01'::DATE, 60),
> >     (2, 'cast iron', '1950-01-01'::DATE, 120)
> >     ;
> >
> > CREATE TABLE pipeline_pressures (
> >     segment_ID BIGINT,
> >     pressure_psi FLOAT,  -- pressure in Pounds per Square Inch
> >     measurement_timestamp TIMESTAMP
> >     );
> > INSERT INTO pipeline_pressures
> >    (segment_ID, pressure_psi, measurement_timestamp)
> >   VALUES
> >     (2, 10, '2018-09-01 00:01:00'),
> >     (2, 95, '2018-09-01 00:02:00')
> >     ;
> > ```
>
> The pipeline segments don’t change very frequently, and the oldest pipeline segments are the segments most
> likely to fail, so create a materialized view of the oldest segments.
>
> > ```sqlexample
> > CREATE MATERIALIZED VIEW vulnerable_pipes
> >   (segment_ID, installation_year, rated_pressure)
> >   AS
> >     SELECT segment_ID, installation_year, rated_pressure
> >         FROM pipeline_segments
> >         WHERE material = 'cast iron' AND installation_year < '1980'::DATE;
> > ```
>
> You can add clustering or change the clustering key. For example, to cluster on `installation_year`:
>
> > ```sqlexample
> > ALTER MATERIALIZED VIEW vulnerable_pipes CLUSTER BY (installation_year);
> > ```
>
> New pressure measurements arrive frequently (perhaps every 10
> seconds), so maintaining a materialized view on the pressure
> measurements would be expensive. Therefore, even though high
> performance (fast retrieval) of recent pressure data is important,
> the `pipeline_pressures` table starts without a materialized view.
>
> If performance is too slow, you can create a materialized view that contains only recent pressure
> data, or that contains data only about abnormal high-pressure events.
>
> Create a (non-materialized) view that combines information from the
> materialized view and the `pipeline_pressures` table:
>
> > ```sqlexample
> > CREATE VIEW high_risk AS
> >     SELECT seg.segment_ID, installation_year, measurement_timestamp::DATE AS measurement_date,
> >          DATEDIFF('YEAR', installation_year::DATE, measurement_timestamp::DATE) AS age,
> >          rated_pressure - age AS safe_pressure, pressure_psi AS actual_pressure
> >        FROM vulnerable_pipes AS seg INNER JOIN pipeline_pressures AS psi
> >            ON psi.segment_ID = seg.segment_ID
> >        WHERE pressure_psi > safe_pressure
> >        ;
> > ```
>
> Now list the high-risk pipeline segments:
>
> > > ```sqlexample
> > > SELECT * FROM high_risk;
> > > +------------+-------------------+------------------+-----+---------------+-----------------+
> > > | SEGMENT_ID | INSTALLATION_YEAR | MEASUREMENT_DATE | AGE | SAFE_PRESSURE | ACTUAL_PRESSURE |
> > > |------------+-------------------+------------------+-----+---------------+-----------------|
> > > |          2 | 1950-01-01        | 2018-09-01       |  68 |            52 |              95 |
> > > +------------+-------------------+------------------+-----+---------------+-----------------+
> > > ```
> >
> > This shows that the pipeline segment with `segment_id = 2`, which is made of a
> > material that corrodes, is old. This segment has never experienced pressure higher than
> > the maximum pressure rating at the time it was installed, but because of the potential
> > for corrosion, its “safe limit” has declined over time, and the highest pressure it has
> > experienced is higher than the pressure that was recommended for a pipe as old
> > as the pipe was at the time of the pressure measurement.

### Creating a Materialized View on Shared Data

You can create a materialized view on shared data.

Account1:

```sqlexample
create or replace table db1.schema1.table1(c1 int);
create or replace share sh1;
grant usage on database db1 to share sh1;
alter share sh1 add accounts = account2;
grant usage on schema db1.schema1 to share sh1;
grant select on table db1.schema1.table1 to share sh1;
```

Account2:

```sqlexample
create or replace database dbshared from share account1.sh1;
create or replace materialized view mv1 as select * from dbshared.schema1.table1 where c1 >= 50;
```

> **Note:**
>
> Remember that maintaining materialized views will consume credits. When you create a materialized view on
> someone else’s shared table, the changes to that shared table will result in charges to you as your
> materialized view is maintained.

### Sharing a Materialized View

You can use Snowflake’s data sharing feature to share a materialized view.

For more information about data sharing, see [Data sharing and collaboration in Snowflake](../guides-overview-sharing.md).

> **Note:**
>
> Remember that maintaining materialized views will consume credits. When someone else creates a materialized
> view on your shared data, any changes to your shared data can cause charges to the people who have materialized
> views on your shared data. The larger the number of materialized views on a shared base table, the more important
> it is to update that base table efficiently to minimize the costs of maintaining materialized views.

## Troubleshooting

### Compilation Error: `Failure during expansion of view '<name>': SQL compilation error: Materialized View <name> is invalid.`

Possible Causes:
:   * The materialized view has been suspended. For more information about suspending and resuming views, see
      [ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md).
    * A change to the base table of the materialized view has invalidated the materialized view. For example, this error is
      returned if:

      + The base table is dropped.
      + A column in the base table column has been dropped.
    * The background process has encountered an error of a specific type (for example, a “division by zero” error) and has failed
      to refresh the materialized view.

Possible Solutions:
:   * If the view has been suspended:

      + Consider resuming the view by executing
        [ALTER MATERIALIZED VIEW … RESUME](../sql-reference/sql/alter-materialized-view.md).
      + Consider running the query against the base table. However, this is likely
        to consume more credits and take longer than running the query against the materialized view.
    * If the base table has been modified or dropped:

      + If the base table has been dropped, then drop the materialized view.
      + If the base table has been modified (e.g. has dropped a column referenced by the view), and if the materialized view would still
        be useful with the new version of the table, then consider dropping and re-creating the materialized view, using the
        columns that remain in the base table.
      + If no other cause of the error message is apparent, consider dropping and re-creating the materialized view.
      + Consider running the query against the base table. However, this is likely
        to consume more credits and take longer than running the query against the materialized view.
    * If the background process has failed to refresh the materialized view due to an error, the error message should include
      details about why the materialized view has been invalidated. For example:

      ```output
      Failure during expansion of view 'MY_MV':
        SQL compilation error: Materialized View MY_MV is invalid.
        Invalidation reason: Division by zero
      ```

    If this occurs, address the problem described in the error message, and resume the materialized view by using the
    [ALTER MATERIALIZED VIEW … RESUME](../sql-reference/sql/alter-materialized-view.md) command.

### SHOW MATERIALIZED VIEWS Command Shows Materialized Views That Are Not Updated

Possible Cause:
:   One possible cause is that the refresh failed because the [SELECT](../sql-reference/sql/select.md) statement in the view definition
    failed.

    Because the refresh is done performed by the background process, you will not see an error message at the time the refresh is
    attempted. Instead, you will see the error message when you query the materialized view or when you execute
    [SHOW MATERIALIZED VIEWS](../sql-reference/sql/show-materialized-views.md).

Possible Solution:
:   If the `invalid` column is `true`, check the `invalid_reason` column for the reason why the view was invalidated.

    In some cases, you might be able to debug the problem by manually running the SELECT statement in the materialized view’s
    definition, or by running a simpler (less expensive) SELECT statement on the table referenced in the materialized view’s
    definition.

    If you do not know the exact definition of the materialized view, you can find it in the output of
    [SHOW MATERIALIZED VIEWS](../sql-reference/sql/show-materialized-views.md) or by using the [GET_DDL](../sql-reference/functions/get_ddl.md) function.

---
title: Working with passwords
source: https://docs.snowflake.com/en/user-guide/password-authentication.md
section: User Guide
---

# Working with passwords

This topic describes how an administrator can configure password requirements and reset user passwords.

## Password policies

A password policy specifies the requirements that must be met to create and reset a password to authenticate to Snowflake.

Snowflake provides two options for password policies:

* A built-in password policy to facilitate the initial user provisioning process.
* A schema-level password policy object that can be set at the level of the Snowflake account, an individual user, or both depending on
  the use cases and needs of the user administrator.

### Best practices for password policies and passwords

Snowflake recommends the following best practices regarding passwords and password policies:

Create and enforce the custom password policy
:   The password policy object is enforced once the password policy is set on an account or user.

    Set these properties to values that meet your internal security needs. For details, see Step 4: Create a password policy
    (in this topic):

    * `PASSWORD_HISTORY` to ensure users cannot reuse passwords too frequently and to help prevent brute force attacks to determine the
      password for a user.
    * `PASSWORD_MIN_AGE_DAYS` to require the user to use the new password. A value of 0 is not recommended because the user can change the
      password to exhaust the password history and reuse the original password value too soon.

    To require the user to change their password to meet the password policy on their initial or next login to Snowflake, set the
    `MUST_CHANGE_PASSWORD` property on the user to `TRUE` using an [ALTER USER](../sql-reference/sql/alter-user.md) command.

    For details, see Step 6: Require a password change (in this topic).

Require strong passwords
:   Define an account-level password policy to require strong passwords.

    A strong password has at least 14 characters and includes a combination of uppercase and lowercase letters, special characters
    (e.g. `!` and `*`), and numbers.

MFA
:   Use [multi-factor authentication (MFA)](security-mfa.md) for additional security.

Using SCIM
:   You can set a password for the user to access Snowflake in a SCIM API request. SCIM administrators and user administrators should choose
    to manage the user password to access Snowflake in either your identity provider or using a password policy in Snowflake.

    Currently, users provisioned to Snowflake with SCIM are required to have their password meet the
    default Snowflake password policy. This requirement can be bypassed if you choose to use this
    password policy feature.

    To bypass the default password policy requirement, follow the instructions in the Using Password Policies section (in this topic).

Monitoring passwords
:   To monitor passwords:

    * Query the Snowflake Account Usage [USERS](../sql-reference/account-usage/users.md) view to determine whether the `HAS_PASSWORD`
      column value returns `TRUE` for a given user.
    * Query the Snowflake Account Usage [LOGIN_HISTORY](../sql-reference/account-usage/login_history.md) view and evaluate the
      `FIRST_AUTHENTICATION_FACTOR` column. If a user does not require a password to access Snowflake, execute an
      [ALTER USER](../sql-reference/sql/alter-user.md) command to set the `password` property to NULL.

### Setting an initial password for new users

During the initial user creation, it is possible to set a weak password for the user that does not meet the minimum requirements
of the password policy that is in effect. This gives administrators the option to use generic passwords for the user during the
creation process. If this pathway is chosen, Snowflake strongly recommends setting the `MUST_CHANGE_PASSWORD` property to
`TRUE` to require users to change their password on their next login, including the initial login. When a user resets
their password, they must chose one that conforms to the password policy in effect, whether it is a Snowflake-provided policy or a custom
one.

Additionally, Snowflake allows creating users without an initial password to support business processes in which new users are not allowed
to log into the system. If this occurs, the user’s `PASSWORD` property value will be `NULL`. However, as a general rule,
Snowflake expects that users are created with initial passwords.

### Snowflake-provided password policy

A password can be any case-sensitive string up to 256 characters, including blank spaces and special (that is, non-alphanumeric) characters,
such as exclamation points (`!`), percent signs (`%`), and asterisks (`*`).

In the context of resetting an existing password (e.g. change `'test12345'` to `'q@-*DaC2yjZoq3Re4JYX'`), Snowflake
enforces the following password policy as a minimum requirement while using the [ALTER USER](../sql-reference/sql/alter-user.md) command and the
web interface:

* Must be at least 14 characters long.
* Must contain at least 1 digit.
* Must contain at least 1 uppercase letter and 1 lowercase letter.

Snowflake strongly recommends the following guidelines for creating the strongest passwords possible:

* Create a unique password for Snowflake (i.e. do not reuse passwords from other systems or accounts).
* Include multiple, random mixed-case letters, numbers, and special characters, including blank spaces.
* Do not use easily-guessed common passwords, names, numbers, or dates.

### Custom password policy for the account and users

The custom password policy is a schema-level object that specifies the requirements that must be met to create and reset a password to
authenticate to Snowflake, including the number of attempts to enter the password successfully and the number of minutes before a password
can be retried (i.e. the “lockout” time).

The password policy requirements for a password include upper or lowercase letters, special characters, numbers, and password length to
meet security requirements for users and clients to authenticate to Snowflake. Password policies that require strong passwords help to meet
security guidelines and regulations.

Snowflake supports setting a password policy for your Snowflake account and for individual users. Only one password policy
can be set at any given time for your Snowflake account or a user. If a password policy exists for the Snowflake account and another
password policy is set for a user in the same Snowflake account, the user-level password policy takes precedence over the account-level
password policy.

The password policy applies to new passwords that are set in your Snowflake account. To ensure that users with existing passwords meet the
password policy requirements, require users to change their password during their next login to Snowflake as shown in
Step 6: Require a password change (in this topic).

> **Note:**
>
> Most password policy property changes take effect the next time a user changes their password. For example, if you change the
> `PASSWORD_MAX_LENGTH` property from `10` to `16` to require the user to use a longer password then the user must
> comply with the password policy change whenever they change their password. You can set the user property `MUST_CHANGE_PASSWORD`
> to `TRUE` with an [ALTER USER](../sql-reference/sql/alter-user.md) statement to require the user to change their password on their next login
> to Snowflake.
>
> However, some password policy property changes take effect during the next login because Snowflake does not force the user to change
> their password in their current session:
>
> * `PASSWORD_MAX_AGE_DAYS = integer`
> * `PASSWORD_MAX_RETRIES = integer`
> * `PASSWORD_LOCKOUT_TIME_MINS = integer`
>
> Any changes to these properties do not affect the current session. For example, a change to the value of the
> `PASSWORD_MAX_AGE_DAYS` property does not cause the user’s current password to expire. However, during the next login to
> Snowflake, the user must change their password.

#### Considerations

* [Future grants](../sql-reference/sql/grant-privilege.md) of privileges on password policies are not supported.

  As a workaround, grant the APPLY PASSWORD POLICY privilege to a custom role to allow that role to apply password policies on the user or
  the Snowflake account.
* The password policy can be managed with SQL using [Snowflake CLI](../developer-guide/snowflake-cli/index.md), [SnowSQL](snowsql.md) or a
  supported [driver or connector](../guides-overview-connecting.md), or using Snowsight.
* When you reset or change a password, Snowflake evaluates the password policy to ensure that the newly created password matches the
  password policy requirements.
* Tracking password policy usage:

  + Query the Account Usage [PASSWORD_POLICIES](../sql-reference/account-usage/password_policies.md) view to return a row for each
    password policy in your Snowflake account.
  + Use the Information Schema table function [POLICY_REFERENCES](../sql-reference/functions/policy_references.md) to return a row for each user that is
    assigned to the specified password policy and a row for the password policy assigned to the Snowflake account.

    Currently, only the following syntax is supported for password policies:

    > ```sqlsyntax
    > POLICY_REFERENCES( POLICY_NAME => '<password_policy_name>' )
    > ```

    Where `password_policy_name` is the fully qualified name of the password policy.

    For example, execute the following query to return a row for each user that is assigned the password policy named
    `password_policy_prod_1`, which is stored in the database named `my_db` and the schema named `my_schema`:

    > ```sqlexample
    > SELECT *
    > FROM TABLE(
    >     my_db.information_schema.policy_references(
    >       POLICY_NAME => 'my_db.my_schema.password_policy_prod_1'
    >   )
    > );
    > ```

### Access control for password policies

The following access control privileges allow users to work with password policies:

| Privilege | Object type | Usage |
| --- | --- | --- |
| CREATE PASSWORD POLICY | Schema | Enables creating a new password policy. |
| APPLY PASSWORD POLICY | Account, User | Enables applying a password policy at the account or user level. |
| OWNERSHIP | Password policy | Grants full control over the password policy. Required to alter most properties of a password policy. |

The following table summarizes the relationship between password policy DDL operations and their necessary privileges.

| Operation | Privilege required |
| --- | --- |
| Create password policy | A role with the CREATE PASSWORD POLICY privilege on the schema to store the password policy. |
| Alter password policy | A role with the OWNERSHIP privilege on the password policy. |
| Drop password policy | A role with the OWNERSHIP privilege on the password policy. |
| Describe password policy | A role with the OWNERSHIP privilege on the password policy or . the APPLY PASSWORD POLICY privilege on the account. |
| Show password policies | A role with the OWNERSHIP privilege on the password policy or . the APPLY PASSWORD POLICY privilege on the account. |
| Set & unset password policy | A role with the APPLY PASSWORD POLICY privilege on the account or the user. |

> **Note:**
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

### DDL commands

Snowflake provides the following DDL commands to manage password policy objects:

* [CREATE PASSWORD POLICY](../sql-reference/sql/create-password-policy.md)
* [ALTER PASSWORD POLICY](../sql-reference/sql/alter-password-policy.md)
* [DROP PASSWORD POLICY](../sql-reference/sql/drop-password-policy.md)
* [SHOW PASSWORD POLICIES](../sql-reference/sql/show-password-policies.md)
* [DESCRIBE PASSWORD POLICY](../sql-reference/sql/desc-password-policy.md)

### Using password policies

The following steps are a representative guide to define and set a password policy in Snowflake.

These steps assume a centralized management approach in which a custom role named `policy_admin` owns the password policy (i.e. has the
OWNERSHIP privilege on the password policy) and is responsible for setting the password policy on an account or user
(i.e. has the global APPLY PASSWORD POLICY privilege, as shown in step 2).

> **Note:**
>
> To set a policy on an account, the `policy_admin` custom role must also have the USAGE privilege on the database and schema that
> contain the password policy.
>
> For more information, see: [Access control privileges](security-access-control-privileges.md)

#### Step 1: Create the custom role

Create a custom role that allows creating and managing password policies. Throughout this topic, the example custom role is named
`policy_admin`, although the role could have any appropriate name.

If the custom role already exists, continue to the next step.

Otherwise, create the `policy_admin` custom role.

> ```sqlexample
> USE ROLE USERADMIN;
>
> CREATE ROLE policy_admin;
> ```

#### Step 2: Grant privileges to the custom role

If the `policy_admin` custom role does not already have the following privileges, grant these privileges as shown below:

* USAGE on the database and schema that will contain the password policy.
* CREATE PASSWORD POLICY on the schema that will store the password policy.
* APPLY PASSWORD POLICY on the account.

```sqlexample
USE ROLE SECURITYADMIN;

GRANT USAGE ON DATABASE security TO ROLE policy_admin;

GRANT USAGE ON SCHEMA security.policies TO ROLE policy_admin;

GRANT CREATE PASSWORD POLICY ON SCHEMA security.policies TO ROLE policy_admin;

GRANT APPLY PASSWORD POLICY ON ACCOUNT TO ROLE policy_admin;
```

If you decide to set a password policy on a user, grant the APPLY PASSWORD POLICY privilege on the user. For example, if the username is
`JSMITH`, execute the following command.

> ```sqlexample
> GRANT APPLY PASSWORD POLICY ON USER jsmith TO ROLE policy_admin;
> ```

For more information, see Access control for password policies.

#### Step 3: Grant the custom role to a user

Grant the `policy_admin` custom role to the users responsible for managing password policies.

```sqlexample
USE ROLE SECURITYADMIN;
GRANT ROLE policy_admin TO USER jsmith;
```

For more information, see [Configuring access control](security-access-control-configure.md)

#### Step 4: Create a password policy

Using the `policy_admin` custom role, create a password policy named `password_policy_prod_1`. For more information, see
[CREATE PASSWORD POLICY](../sql-reference/sql/create-password-policy.md).

> ```sqlexample
> USE ROLE policy_admin;
>
> USE SCHEMA security.policies;
>
> CREATE PASSWORD POLICY PASSWORD_POLICY_PROD_1
>     PASSWORD_MIN_LENGTH = 14
>     PASSWORD_MAX_LENGTH = 24
>     PASSWORD_MIN_UPPER_CASE_CHARS = 2
>     PASSWORD_MIN_LOWER_CASE_CHARS = 2
>     PASSWORD_MIN_NUMERIC_CHARS = 2
>     PASSWORD_MIN_SPECIAL_CHARS = 2
>     PASSWORD_MIN_AGE_DAYS = 1
>     PASSWORD_MAX_AGE_DAYS = 999
>     PASSWORD_MAX_RETRIES = 3
>     PASSWORD_LOCKOUT_TIME_MINS = 30
>     PASSWORD_HISTORY = 5
>     COMMENT = 'production account password policy';
> ```
>
> > **Note:**
> >
> > The property `PASSWORD_MAX_AGE_DAYS` is set to the largest value, 999. Choose a value that aligns with your internal
> > guidelines. For details, see [CREATE PASSWORD POLICY](../sql-reference/sql/create-password-policy.md).

#### Step 5: Set the password policy on the account or an individual user

Set the policy on an account with the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command:

> ```sqlexample
> ALTER ACCOUNT SET PASSWORD POLICY security.policies.password_policy_prod_1;
> ```

If you decide to create an additional password policy for one or more users, set the user-level password policy on a user with an
[ALTER USER](../sql-reference/sql/alter-user.md) command:

> ```sqlexample
> ALTER USER jsmith SET PASSWORD POLICY security.policies.password_policy_user;
> ```

> **Important:**
>
> To replace a password policy that is already set for an account or user, unset the password policy first and then set the new password
> policy for the account or user. For example:
>
> > ```sqlexample
> > ALTER ACCOUNT UNSET PASSWORD POLICY;
> >
> > ALTER ACCOUNT SET PASSWORD POLICY security.policies.password_policy_prod_2;
> > ```

#### Step 6: Require a password change

Set the `MUST_CHANGE_PASSWORD` property to `TRUE` for individual users using an [ALTER USER](../sql-reference/sql/alter-user.md) statement to
require the users to change their password to meet the password policy on their next login to Snowflake.

> ```sqlexample
> ALTER USER JSMITH SET MUST_CHANGE_PASSWORD = true;
> ```

## Resetting the password for a user

Administrators can change a user’s password through the following interfaces.

### Snowsight

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Governance & security » Users & roles.
3. Locate the user whose password you want to change and select  » Reset Password.
4. Enter a new password for the user and confirm the password.
5. Select Update.

### Using SQL

Use the [ALTER USER](../sql-reference/sql/alter-user.md) command to input a user’s password. For example:

> ```sqlexample
> ALTER USER janesmith SET PASSWORD = 'H8MZRqa8gEe/kvHzvJ+Giq94DuCYoQXmfbb$Xnt' MUST_CHANGE_PASSWORD = TRUE;
> ```

Alternatively, use the ALTER USER … RESET PASSWORD syntax to generate a URL to share with the user. The URL opens a web page on which the user can enter the new password. For example:

> ```sqlexample
> ALTER USER janesmith RESET PASSWORD;
> ```
>
> > **Note:**
> >
> > * The generated URL is valid for one use only and expires after 4 hours.
> > * Executing the ALTER USER … RESET PASSWORD statement does not invalidate the current password. The user can continue to use the old password until the new password is set.

### Using Python

The [UserResource.create_or_alter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.UserResource)
method in the Snowflake Python APIs currently does not support changing the `password` for an existing user. You can only set the password
using this method when creating a new user.

## Resetting the password for an administrator

An account administrator (i.e. a user with the ACCOUNTADMIN role) can reset their own password using the procedure described in
Resetting the Password for a User.

If an account administrator is locked out of their account, a different user with the ACCOUNTADMIN role
can reset the password for the locked-out administrator. In the event that the administrator is locked out and there is no other
administrator to change the password, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to reset the password.

---
title: Working with privacy budgets
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md
section: User Guide
---

# Working with privacy budgets

This topic describes how to manage privacy budgets in a privacy policy. For an
introduction to privacy budgets and how they help prevent queries from revealing sensitive information about an entity, see
[Limiting privacy loss](differential-privacy-overview.md).

A privacy budget is created automatically when you define a privacy budget name in the body of the privacy policy. You don’t create a
privacy budget independent of a [privacy policy](differential-privacy-admin-privacy-policies.md).

When a query would cause the cumulative privacy loss to exceed the privacy budget limit, the query fails until the privacy budget
refreshes.

To manage a privacy budget, you need OWNERSHIP privilege on the privacy policy that specifies the privacy budget.

## View a privacy budget

Each privacy budget is namespaced to a privacy policy. There can be multiple privacy budgets with the same name, but each is unique to a
privacy policy. Within a privacy policy, a privacy budget is further namespaced to the consumer account incurring
[privacy loss](differential-privacy-overview.md). As a result, multiple accounts can have a privacy budget with the same name and
limit on privacy loss, but Snowflake calculates the cumulative privacy loss for each account separately.

Privacy budget names must be unique within a privacy policy. Multiple accounts can have a privacy budget with the same name
and Snowflake tallies the cumulative privacy loss for each account separately.

View a privacy budget to see its limit on privacy loss as well as the cumulative privacy loss incurred by users associated
with the budget. You can use this information to determine whether the cumulative privacy loss is approaching the privacy budget’s limit.
[See what properties exist in a privacy budget object.](../../sql-reference/sql/create-privacy-policy.md)

> **Note:**
>
> The cumulative privacy loss associated with a privacy budget does not include privacy loss incurred in accounts outside
> of the data provider’s account.

You have the following two options for viewing privacy budgets. For both options, a privacy budget appears only if analysts associated with
the privacy budget have incurred privacy loss or if an administrator has reset the privacy budget.

* **To query all privacy budgets in the account,** use the PRIVACY_BUDGETS view in the Account Usage schema.
  The [PRIVACY_BUDGETS](../../sql-reference/account-usage/privacy_budgets.md) view in the ACCOUNT USAGE schema contains all privacy budgets
  in the account. You can use it to view privacy budgets associated with all of the privacy policies that you own, and can filter results to
  focus on specific privacy budgets by name. For example, to focus on a specific privacy budget associated with the `patients_policy`
  privacy policy, you might execute the following query:

  ```sqlexample
  SELECT * FROM snowflake.account_usage.privacy_budgets
    WHERE policy_name='patients_policy' AND budget_name='analyst_budget';
  ```
* **To view the privacy budgets associated with a particular privacy policy,** use the CUMULATIVE_PRIVACY_LOSSES table function.
  You can use the [CUMULATIVE_PRIVACY_LOSSES](../../sql-reference/functions/cumulative_privacy_losses.md) table function to retrieve privacy budgets associated with a
  particular privacy policy. Unlike the PRIVACY_BUDGETS view in the ACCOUNT USAGE schema, this function does not have a fixed amount of
  latency and will return the real-time values for the cumulative privacy losses. When calling the function, the name of the privacy policy
  must be fully qualified.

  For example, to view the privacy budgets that are specified in the `my_policy_privacy` policy, execute the following:

  ```sqlexample
  SELECT *
    FROM TABLE(SNOWFLAKE.DATA_PRIVACY.CUMULATIVE_PRIVACY_LOSSES(
      'my_policy_db.my_policy_schema.my_policy_privacy'));
  ```

## Set privacy settings for a privacy budget

Snowflake lets you adjust the privacy budget’s limit on privacy loss and the maximum amount of privacy budget spent per
aggregate (collectively known as the *epsilon* in differential privacy). You set these controls by specifying the following parameters in
the body of the privacy policy:

* `BUDGET_LIMIT` — Sets the privacy budget’s limit on cumulative privacy loss.
* `MAX_BUDGET_PER_AGGREGATE` – Sets the maximum amount of the privacy budget spend per aggregate (that is, the maximum privacy loss
  incurred by each aggregate function in a query).

For example, to use the [ALTER PRIVACY POLICY](../../sql-reference/sql/alter-privacy-policy.md) command to adjust the privacy controls of an existing privacy budget,
you might execute:

```sqlexample
ALTER PRIVACY POLICY users_policy SET BODY ->
  PRIVACY_BUDGET(BUDGET_NAME=>'analysts',
  BUDGET_LIMIT=>300,
  MAX_BUDGET_PER_AGGREGATE=>0.1);
```

You can also define these controls when executing the [CREATE PRIVACY POLICY](../../sql-reference/sql/create-privacy-policy.md) command to create the privacy policy.

> **Caution:**
>
> When changing the `BUDGET_LIMIT`, `MAX_BUDGET_PER_AGGREGATE`, or `BUDGET_WINDOW` parameter, any
> parameter not specified in your ALTER PRIVACY POLICY command reverts back to its default value. So in the previous example,
> the `BUDGET_WINDOW` parameter, which determines how often Snowflake resets the privacy budget, will revert to its default value.

For more information about setting privacy controls, see [Adjust privacy controls](differential-privacy-admin-adjust.md).

## Privacy budget refresh

### About the refresh period

Snowflake periodically resets the cumulative privacy loss of a privacy budget to 0 to let analysts run a new set of queries. This refresh
period is known as the budget window. This automatic refresh lets analysts access new data as it is added to a table. Theoretically,
the analyst hasn’t learned any information about this new data, so it’s appropriate to let them run more queries.

The default budget window is weekly.

### Modify the refresh period

To modify the privacy budget refresh period, update the `budget_window` value of the privacy policy’s `privacy_budget`. For example:

```sqlexample
ALTER PRIVACY POLICY users_policy SET BODY ->
  PRIVACY_BUDGET(BUDGET_NAME=>'analysts', BUDGET_WINDOW=>'daily');
```

> **Caution:**
>
> When changing the `BUDGET_LIMIT`, `MAX_BUDGET_PER_AGGREGATE`, or `BUDGET_WINDOW` parameter, any parameter not specified
> in your ALTER PRIVACY POLICY command reverts back to its default value. So in the previous example, `BUDGET_LIMIT` and
> `MAX_BUDGET_PER_AGGREGATE` will revert to default values.

## Reset cumulative privacy loss

As analysts execute queries on data protected by a policy, Snowflake tallies the cumulative privacy loss of those queries. You can call
the [RESET_PRIVACY_BUDGET](../../sql-reference/stored-procedures/reset_privacy_budget.md) stored procedure to reset the cumulative privacy loss to 0, letting the
analysts execute additional queries.

The RESET_PRIVACY_BUDGET stored procedure is intended to reset the budget when analysts inadvertently incur privacy loss and want to
start over. Remember that the privacy loss is automatically set to 0 when the privacy budget is refreshed.

Only the cumulative privacy loss associated with analysts in the specified account is reset to 0, even if the privacy budget is associated
with analysts in multiple accounts.

> **Note:**
>
> When calling RESET_PRIVACY_BUDGET, the cumulative privacy loss is not reset
> immediately. It is reset the next time a query incurs privacy loss. As a result,
> if you view the privacy budget after calling the function but before the first
> query incurs privacy loss, the cumulative privacy loss will not be 0.

**Example**

Here’s an example of zeroing out the privacy usage count for all users executing queries in the `companyorg.account_123` account:

```sqlexample
CALL SNOWFLAKE.DATA_PRIVACY.RESET_PRIVACY_BUDGET(
  'my_policy_db.my_policy_schema.my_policy',
  'analyst_budget',
  'companyorg',
  'account_123');
```

---
title: Working with privacy domains as an administrator
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-privacy-domains-admin.md
section: User Guide
---

# Working with privacy domains as an administrator

A *privacy domain* defines the possible values in a column, similar to a mathematical domain. Snowflake uses a privacy domain to determine
how much noise to introduce into results.

To gain a complete understanding of privacy domains before completing the tasks in this section, see [About privacy domains](differential-privacy-privacy-domains.md).

It is best practice for a data provider to set a privacy domain for all numerical and categorical columns that an analyst might want to act
upon *before* distributing data to them.

## Choosing a privacy domain

A privacy domain defines *possible* values in a column, not necessarily *actual* values. You can narrow or expand a privacy domain as needed
so that it doesn’t contain actual values. For example, you can do either of the following:

* Define a broader list to obscure exact values. Because an analyst can view the privacy domain, you might not want to expose the exact
  contents of a column. For example, suppose a column contains a subset of zip codes. You might want to expand the privacy domain to include
  all possible zip codes, thereby obscuring whether a particular zip code is in the dataset.
* Define a narrower range to obscure the presence of an outlier value. For example, if most values are between 1 and 50, you might not want a
  value of 100 to be included in an average because the analyst could deduce the presence of 100 because the average is unusually high.

For information about how values outside a privacy domain are treated, see [Values outside a privacy domain](differential-privacy-privacy-domains.md).

> **Important:**
>
> Anyone with privileges to query a privacy-protected table has the ability to view the privacy domain for a column in that table, so
> choose your privacy domains carefully.
>
> While most fields should have a privacy domain, there are important exceptions. For example, unique identifier fields like user ID, email,
> credit card numbers, and Social Security numbers should *not* have a privacy domain. Users can see the exact privacy domain, and you
> usually don’t want an analyst to know whether a particular identifier exists in the dataset.
>
> In contrast, privacy domains should contain the actual values for identifier fields when they’re not unique to an individual entity and
> whose possible values are publicly known, such as zip codes, ICD codes in healthcare data, and NAICS codes.

## Setting a privacy domain

You define a privacy domain as either a range of values with a minimum and maximum or as an enumerated list of values. In general, the type
of privacy domain is based on the data type of the column. You cannot set a privacy domain on a column if its data type is not in the
following list.

| Data type | Privacy domain type |
| --- | --- |
| [Numeric](../../sql-reference/data-types-numeric.md)  [Date & time](../../sql-reference/data-types-datetime.md) | Range |
| [Strings](../../sql-reference/data-types-text.md) | Enumerated list |

To set, alter, or drop a privacy domain, you need the OWNERSHIP privilege on a table. You can set a privacy domain
when doing the following:

* Creating a table.
* Adding a new column to an existing table.
* Modifying an existing column of a table. If the existing column already has a
  privacy domain, the new domain replaces the old one.

For each of these methods, the syntax of the new privacy domain is the same.

> **Note:**
>
> When a table is dropped, its privacy domains are also be dropped. This applies to a CREATE OR REPLACE command as well.

### Privacy domain syntax

The syntax of creating a privacy domain is:

```sqlsyntax
PRIVACY DOMAIN
  {
      [ BETWEEN ( <lo_value>, <hi_value> ) ]
    | [ IN ( '<value1>, '<value2>', ... ) ]
    | [ REFERENCES <table_name>( <col_name> ) ]
  }
```

#### Parameters

A single parameter must be specified.

`BETWEEN ( lo_value, hi_value )`
:   Creates a privacy domain that is the range of possible values in the column, where `lo_value` is the minimum value and
    `hi_value` is the maximum value.

`IN ( 'value1', 'value2', ... )`
:   Creates a privacy domain that is an enumerated list of the specified values.

    The `IN` parameter accepts a maximum of 50 values, each of which can contain a maximum of 100 characters. If you need to specify an
    enumerated list of greater than 50 values, use the `REFERENCES` parameter.

`REFERENCES table_name( col_name )`
:   Creates a privacy domain that is an enumerated list consisting of the values contained in the column of a table.

    The user making differentially private queries against a table with a REFERENCES privacy domain must have SELECT privileges to the table
    that contains the column referenced in the privacy domain. This means that if you share a privacy-protected table that references another,
    it is best to share the referenced table in the same share.

    The privacy domain can reference itself; however, you must be careful when using this capability. If the privacy domain references its own
    column, the enumerated list contains all *actual* values in the column, not all *possible* values in the column, which can expose private
    information. For example, if the privacy domain of a `zipcode` column references itself, then the analyst will know with absolute
    certainty whether a particular zip code is in the dataset when they view the privacy domain.

    > **Note:**
    >
    > You cannot define a privacy domain that references itself when creating the table for the first time. First create the table, then
    > set the privacy domain with a separate command.

    The column being referenced can contain 16,384 unique values.

#### Set a privacy domain when creating a new table

The syntax to set a privacy domain for a column when using the [CREATE TABLE](../../sql-reference/sql/create-table.md) command to create a table is:

```sqlsyntax
CREATE TABLE <table_name>
  ( <col_name> <col_type> PRIVACY DOMAIN
    {
        [ BETWEEN ( <lo_value>, <hi_value> ) ]
      | [ IN ( '<value1>', '<value2>', ... ) ]
      | [ REFERENCES <table_name>( <col_name> ) ]
    }
  )
```

For more information, see Privacy domain syntax.

### Set a privacy domain when adding a new column

The syntax to set a privacy domain when using the [ALTER TABLE](../../sql-reference/sql/alter-table.md) to add a new column to an existing table is:

```sqlsyntax
ALTER TABLE <table_name>
  ADD COLUMN <col_name> <col_type> PRIVACY DOMAIN
    {
        [ BETWEEN ( <lo_value>, <hi_value> ) ]
      | [ IN ( '<value1>', '<value2>', ... ) ]
      | [ REFERENCES <table_name>( <col_name> ) ]
    }
```

For more information, see Privacy domain syntax.

### Set a privacy domain by modifying a column

The syntax to set a privacy domain for an existing column of a table using the [ALTER TABLE … ALTER COLUMN](../../sql-reference/sql/alter-table-column.md) command is:

```sqlsyntax
ALTER TABLE <table_name>
  { ALTER | MODIFY } COLUMN <col1_name> SET PRIVACY DOMAIN
    {
        [ BETWEEN ( <lo_value>, <hi_value> ) ]
      | [ IN ( '<value1>', '<value2>', ... ) ]
      | [ REFERENCES <table_name>( <col_name> ) ]
    }
```

For more information, see Privacy domain syntax.

## Modify a privacy domain

The syntax for modifying an existing privacy domain is identical to creating a new privacy domain on an existing column. An ALTER TABLE .. ALTER COLUMN … SET PRIVACY DOMAIN command replaces the old privacy
domain with the new one.

## Remove a privacy domain

The syntax for using the [ALTER TABLE … ALTER COLUMN](../../sql-reference/sql/alter-table-column.md) command to remove a privacy domain from a column is:

```sqlsyntax
ALTER TABLE <table_name>
  { ALTER | MODIFY } COLUMN <col1_name> UNSET PRIVACY DOMAIN
```

## View a privacy domain

To view the privacy domains of a privacy-protected table or view, execute the [DESCRIBE TABLE](../../sql-reference/sql/desc-table.md) or
[DESCRIBE VIEW](../../sql-reference/sql/desc-view.md) command. The privacy domain for a column appears in the PRIVACY_DOMAIN column of the output.

You need the SELECT privilege on a privacy-protected table to view its privacy domains.

### Interpreting the privacy domain object

A privacy domain for a column is returned as a JSON object. The `domain_type` field of the JSON object indicates whether the privacy
domain is a range of values or an enumerated list. The remaining fields in the object are dependent on the value of the `domain_type`
field.

The JSON object for a privacy domain can include the following fields:

`domain_type`
:   Indicates the type of privacy domain.

    `BETWEEN`
    :   The privacy domain is a range of the possible values that might be in the column.

    `IN`
    :   The privacy domain is an enumerated list of the possible values that might be in the column.

    `REFERENCES`
    :   The privacy domain is an enumerated list of the possible values that might be in the column. This list comes
        from the column of the same table or another table. To view the enumerated list of the privacy domain, query the contents of the
        referenced column.

`low`
:   When `domain_type = BETWEEN`, specifies the minimum value in the range of possible values.

`high`
:   When `domain_type = BETWEEN`, specifies the maximum value in the range of possible values.

`values`
:   When `domain_type = IN`, specifies the enumerated list of possible values, structured as an array.

`database`
:   When `domain_type = REFERENCES`, specifies the database that contains the column that Snowflake references to build the enumerated
    list of possible values.

`schema`
:   When `domain_type = REFERENCES`, specifies the schema that contains the column that Snowflake references to build the enumerated
    list of possible values.

`table`
:   When `domain_type = REFERENCES`, specifies the table that contains the column that Snowflake references to build the enumerated
    list of possible values.

`column`
:   When `domain_type = REFERENCES`, specifies the column that Snowflake references to build the enumerated list of possible values.
    To view the enumerated list of the privacy domain, query the contents of this column.

---
title: Working with privacy domains as an analyst
source: https://docs.snowflake.com/en/user-guide/diff-privacy/differential-privacy-privacy-domains-analyst.md
section: User Guide
---

# Working with privacy domains as an analyst

A *privacy domain* defines the possible values in a column, similar to a mathematical domain. Snowflake uses a privacy domain to determine
how much noise to introduce into results.

To gain a complete understanding of privacy domains before completing the tasks in this section, see [About privacy domains](differential-privacy-privacy-domains.md).

If the data provider followed best practices, most numerical and categorical columns in a privacy-protected table have a privacy domain.
If the data provider didn’t set one on a column that you want to aggregate or use in a GROUP BY clause, you need to shape your query
so that it includes techniques that implicitly specify a privacy domain for that column.
Privacy domains that the data provider set on the table can also be lost based on operations done on the table. For example, if you aggregate
a field in a subquery with GROUP BY, the system might not be able to derive a privacy domain due to privacy constraints.

You can also write your query to narrow a privacy domain set by the data provider. This override can help improve the results of your
aggregation.

> **Note:**
>
> To meet the [requirements of joining](differential-privacy-analyst.md) with a privacy-protected table, an analyst might need to
> define a privacy domain for a column of their own table, even if it is not privacy-protected. These privacy domains are defined at the
> table level, and apply to all queries against the table. If you are an administrator for the analyst and need to specify a privacy domain
> for the column of one of your tables, see [Setting a privacy domain](differential-privacy-privacy-domains-admin.md).

## Viewing privacy domains

It’s useful to view the privacy domains of a privacy-protected table before querying the table. Checking the privacy domains for each
column can help in the following ways:

* Determine whether the data provider set a privacy domain for a column.
* Determine the possible values found in the column, which can help you improve your analysis. For example, if the privacy domain is a
  range of possible values found in the column, you can determine the minimum and maximum of the range.
* Investigate why you’re getting more [noise](differential-privacy-overview.md) in your results than expected. You can identify
  whether there are outlier values that aren’t important to your analysis, and remove those values from your aggregation to
  improve results.

To see whether a column has a privacy domain and, if it does, determine the type and possible values of the domain, see [View a privacy domain](differential-privacy-privacy-domains-admin.md).

## Specifying a privacy domain

This section describes the techniques an analyst can use to set a privacy domain for the duration of a query. It summarizes how the
structure of a query specifies a privacy domain for a column.

### Specify a privacy domain for string columns

Filtering on a string column using a WHERE clause specifies a privacy domain for it. The privacy domain consists of the values that match
the filter. For example, queries specify a privacy domain for the `product` column if they include the following clauses:

> ```sqlexample
> WHERE product = 'hackeysack' OR product = 'frisbee'
> ```
>
> ```sqlexample
> WHERE product IN ('hackeysack', 'frisbee')
> ```

The privacy domain is an enumerated list consisting of `hackeysack` and `frisbee`.

If the data provider already set a privacy domain on the `product` column, Snowflake uses the intersection of the two privacy domains for
the duration of the query. For information, see [Interaction between admin-specified and analyst-specified privacy domains](differential-privacy-privacy-domains.md).

Values outside of the privacy domain for string columns are [treated as NULL](differential-privacy-privacy-domains.md).

### Specify a privacy domain for numeric, date, and time columns

You can use filtering clauses or column transformations to specify a privacy domain for a numeric, date, or time column. These query
techniques specify a privacy domain that’s a range of possible values.

You can use the following techniques to specify a privacy domain for a numeric, date, or time column:

WHERE clause
:   For example:

    ```sqlexample
    WHERE a < 10 AND a >= 0
    ```

    The specified privacy domain of the column `a` is between 0 and 10.

    If the data provider already set a privacy domain on the `a` column, Snowflake uses the intersection of the two privacy domains
    for the duration of the query. For information, see [Interaction between admin-specified and analyst-specified privacy domains](differential-privacy-privacy-domains.md).

    Using a filter removes values that fall outside the privacy domain, meaning these values are ignored when calculating aggregations.
    For more information, see [Numeric, date, and time](differential-privacy-privacy-domains.md).

GREATEST and LEAST column transformations
:   For example:

    ```sqlexample
    GREATEST(LEAST(a, 100), 0) AS clamped_a
    ```

    The specified range of the privacy domain is between 0 and 100.

    If the data provider already set a privacy domain on the `a` column, Snowflake uses the intersection of the two privacy domains
    for the duration of the query. For information, see [Interaction between admin-specified and analyst-specified privacy domains](differential-privacy-privacy-domains.md).

    If you’re narrowing a privacy domain set by the data provider, you can use just one of the GREATEST or LEAST transformations to decrease
    the maximum or increase the minimum while keeping the other end of the range the same as the privacy domain defined by the data provider.

    Values in the column that are outside of the privacy domain are [clamped](differential-privacy-privacy-domains.md),
    meaning they are treated as if they are the nearest value in the domain (the minimum or maximum value).

#### Narrowing a privacy domain to improve results

Snowflake must introduce enough [noise](differential-privacy-overview.md) to hide exact values within a privacy domain. If the
privacy domain includes values that are outliers from most of the data in the column, Snowflake must increase the noise to obscure the
presence of those values. Overriding a privacy domain to narrow its range can reduce noise because Snowflake no longer needs to obscure
the presence of values that are not important to your analysis.

The technique you use to narrow a privacy domain affects how your aggregations are calculated. Your choice depends upon what is important
to your analysis.

* If you use a filter (WHERE clause) to narrow the privacy domain, values
  outside of the domain are ignored when calculating aggregations.

  Using a filter is the preferred technique when you think the outlier values of a privacy domain are due to data quality issues, or if
  these values are not relevant to your query. Excluding the outlier values from the privacy domain can retain the integrity of your
  analysis while significantly reducing the noise introduced into your results.
* If you use a column transformation, values in the column
  that are outside of the domain are [clamped](differential-privacy-privacy-domains.md), meaning they are treated as if
  they are the nearest value in the domain (the minimum or maximum value).

  Using a column transformation can improve your analysis even if you think the outlier values are not data quality issues. For example, if
  you are taking the average of values, clamping outlier values using a column transformation might improve your analysis.

> **Note:**
>
> If your query includes highly selective filters that target a limited number of records in a dataset, the relative amount of noise
> actually increases because Snowflake must ensure that you cannot use your results to identify an individual.

---
title: Working with resource monitors
source: https://docs.snowflake.com/en/user-guide/resource-monitors.md
section: User Guide
---

# Working with resource monitors

A *resource monitor* can help control costs and avoid unexpected credit usage caused by running warehouses. A
[virtual warehouse](warehouses-overview.md) consumes Snowflake credits while it runs. You can use a resource monitor
to monitor credit usage by virtual warehouses and the cloud services needed to support those warehouses. You can also set up a resource
monitor to suspend a user-managed virtual warehouse when it reaches a credit limit.

> **Important:**
>
> Resource monitors work for warehouses only. You can’t use a resource monitor to track spending associated with serverless
> features and AI services. To monitor credit consumption by these features, use a [budget](budgets.md) instead.

The number of credits consumed depends on the size of the warehouse and how long it runs. For more information on warehouse credit usage,
see [Virtual warehouse credit usage](cost-understanding-compute.md).

Credit usage limits can be set for a specified interval or date range. When a limit is reached and/or reaches a specified threshold,
the resource monitor can trigger various actions, such as sending alert notifications and/or suspending user-managed warehouses.

Only users with the ACCOUNTADMIN role can create a resource monitor, but an account administrator can grant privileges to other roles
to allow other users to view and modify resource monitors.

## Resource monitor properties

A resource monitor is a first-class object in Snowflake, consisting of the following properties:

### Credit quota

Credit quota specifies the number of Snowflake credits allocated to the monitor for the specified frequency interval. Any number can be specified.

In addition, Snowflake tracks the *used credits/quota* within the specified frequency interval by all warehouses assigned to the monitor.
At the specified interval, this number resets to `0`.

Credit quota accounts for credits consumed by both user-managed virtual warehouses and virtual warehouses used by cloud services.

For example, your resource monitor limit is set at 1000 credits. If your warehouse consumes 700 credits, and cloud services consume 300
credits within a specified interval or date range, an alert will be triggered.

> **Note:**
>
> Resource monitor limits do not take into account the daily 10% adjustment for cloud services. Snowflake uses all credit consumption by
> the cloud services layer to calculate whether a limit has been reached, even if that consumption is never billed. For more information
> about how cloud services credits and adjustments are calculated, see [Understanding billing for cloud services usage](cost-understanding-compute.md).
>
> For instructions on how to view your cloud services credit usage, see [Exploring compute cost](cost-exploring-compute.md).

### Monitor type

This property specifies whether the resource monitor is used to monitor your account or a specific set of individual warehouses:

* An *account monitor* monitors the credit usage of all the warehouses in the account. An account can only have one account monitor.
* A *warehouse monitor* monitors the credit usage of the warehouses assigned to the resource monitor. An account can have multiple
  warehouse monitors.

  A warehouse monitor can have one or more warehouses assigned to it, but each warehouse can only be assigned to one resource monitor.

If this property is not set, the resource monitor doesn’t monitor any credit usage. It simply remains dormant.

For more information, see Assignment of resource monitors.

### Schedule

The default schedule for a resource monitor specifies that it starts monitoring credit usage immediately and the used credits reset to
`0` at the beginning of each calendar month (i.e. the start of the standard Snowflake billing cycle).

However, you can optionally customize the schedule for a resource monitor using the following properties:

Frequency:
:   The interval at which the used credits reset relative to the specified start date.

    Supported values:

    * Daily
    * Weekly
    * Monthly
    * Yearly
    * Never (used credits never reset; assigned warehouses continue using credits until the credit quota is reached)

Start:
:   Date and time (i.e. timestamp) when the resource monitor starts monitoring the assigned warehouses.

    Supported values:

    * Immediately (i.e. current timestamp)
    * Later (i.e. any future timestamp)

    In addition, Snowflake uses this date to determine when to reset the used credits, based on the specified frequency. Note, however, that
    regardless of the time specified in the start date and time, resource monitors reset at 12:00 AM UTC. For example, if the start is
    15-July-2019 (Monday) at 8:00 AM:

    * Frequency = Monthly: Used credits reset at 12:00 AM UTC on the 15th of each following month.
    * Frequency = Weekly: Used credits reset at 12:00 AM UTC on each following Monday.

    Note that, if you specify the last day of a month as the start date, Snowflake resets the used credits on the last day of all following
    months, regardless of the number of days in the month. For example, if you set the start date to January 31, Snowflake subsequently resets
    the used credits for the resource monitor on February 28 (or February 29 in a leap year), March 31, April 30, and so on.

End:
:   Date and time (i.e. timestamp) when Snowflake suspends the warehouses associated with the resource monitor, regardless of whether the
    used credits reached any of the thresholds defined for the resource monitor’s actions (see the next section in this topic).

    Supported values: Any future timestamp.

    Note that this property is not commonly used.

> **Important:**
>
> If you choose to customize the schedule for a resource monitor, the frequency is relative to the specified start date, which is
> different than the default schedule.
>
> Also, if you specify a frequency, you must also specify a start date and time, and vice versa (i.e. you cannot set one without setting the
> other).

### Actions

Also referred to as *triggers*, each action specifies a threshold, as a percentage of the credit quota for the resource monitor, and the
action to perform when the threshold is reached within the specified interval. Note that actions support thresholds greater than `100`.

Resource monitors support the following actions:

Notify & Suspend:
:   Send a notification and suspend all assigned warehouses after all statements being executed by the warehouse(s) have completed.

Notify & Suspend Immediately:
:   Send a notification and suspend all assigned warehouses immediately, which cancels any statements being executed by the warehouses
    at the time.

Notify:
:   Perform no action on warehouses, but send a notification.

Notifications are sent to all account administrators with notifications enabled. Email notifications for resource monitors that monitor
warehouses are also sent to any non-administrator user that is enabled
to receive those notifications.

> **Note:**
>
> Non-administrator users can only receive email notifications for *warehouse* monitors.

Each resource monitor can have the following actions:

* One **Suspend** action.
* One **Suspend Immediate** action.
* Up to five **Notify** actions.

> **Note:**
>
> A resource monitor must have at least one action defined; if no actions have been defined, nothing happens when the used credits reach
> the threshold.

## Assignment of resource monitors

A single monitor can be set at the account level to control credit usage for all warehouses in your account.

In addition, a one or more warehouses can be assigned to a resource monitor, thereby controlling the credit usage for each assigned
warehouse. Note, however, that a warehouse can be assigned to only a single resource monitor below the account level.

The following diagram illustrates a scenario in which one resource monitor is set at the account level and individual warehouses are
assigned to two other resource monitors:

Based on this diagram:

* The credit quota for the entire account is 5000 for the interval (month, week, etc.), as controlled by Resource Monitor 1; if this quota
  is reached within the interval, the actions defined for the resource monitor (**Suspend**, **Suspend Immediate**, etc.) are enforced for
  all five warehouses.
* Warehouse 3 can consume a maximum of 1000 credits within the interval.
* Warehouse 4 and 5 can consume a maximum combined total of 2500 credits within the interval.

Note that the actual credits consumed by Warehouses 3, 4, and 5 may be less than their quotas if the quota for the account is reached first.

> **Important:**
>
> * An account-level resource monitor does not override resource monitor assignment for individual warehouses. If either the account
>   resource monitor or the warehouse resource monitor reaches its defined threshold and a suspend action has been defined, the warehouse is
>   suspended.
> * An account-level resource monitor does not control credit usage by the Snowflake-provided compute resources for serverless features (for
>   example, Snowpipe, automatic reclustering, and materialized views).

## Warehouse suspension and resumption

The used credits for a resource monitor reflect the sum of credits consumed by all assigned warehouses within the specified interval,
along with the cloud services used to support those warehouses during the same interval. If a monitor has a **Suspend** or **Suspend
Immediately** action defined and its used credits reach the threshold for the action, any warehouses assigned to the monitor are suspended
and cannot be resumed until one of the following conditions is met:

* The next interval, if any, starts, as dictated by the start date for the monitor.
* The credit quota for the monitor is increased.
* The credit threshold for the suspend action is increased.
* The warehouses are no longer assigned to the monitor.
* The monitor is dropped.

A warehouse-level resource monitor can monitor, but cannot suspend, credit usage by cloud services. After a virtual warehouse is
suspended, subsequent queries run against that warehouse can still result in additional cloud services costs. For more information
about credit usage for cloud services, see [Cloud service credit usage](cost-understanding-compute.md).

> **Tip:**
>
> Resource monitors are not intended for strictly controlling consumption on an hourly basis; they are intended for tracking and
> controlling credit consumption per interval (day, week, month, etc.). Also, they are not intended for setting precise limits on credit
> usage (i.e. down to the level of individual credits). For example, when credit quota thresholds are reached for a resource monitor, the
> assigned warehouses may take some time to suspend, even when the action is **Suspend Immediate**, thereby consuming additional credits.
>
> If you wish to strictly enforce your quotas, we recommend the following:
>
> * Utilize buffers in the quota thresholds for actions (e.g. set a threshold to 90% instead of 100%).
>
>   This will help ensure that your credit usage doesn’t exceed the quota.
> * To more strictly control credit usage for individual warehouses, assign only a single warehouse to each resource monitor.
>
>   When multiple warehouses are assigned to the same resource monitor, they share the same quota thresholds, which may result in credit
>   usage for one warehouse impacting the other assigned warehouses.

## Resource monitor notifications

When a resource monitor reaches the threshold for an action, it generates a notification
similar to the following notification:

```output
Resource Monitor MY_ACCOUNT_MONITOR has reached 50% of its MONTHLY
quota of 500 credits which has triggered a <action> action.
```

The `<action>` is one of the following actions:

* NOTIFY
* SUSPEND
* SUSPEND_IMMEDIATE

Notification behavior depends on the type of resource monitor and whether or not notifications
are enabled for an individual user. Notifications are sent as follows:

* For *warehouse* monitors, a notification is sent to the following users:

  + All account administrators who have enabled resource monitor notifications.
  + Non-administrator users in the notification list for the resource monitor.
* For *account* monitors, a notification is sent to the following users:

  + The account administrator with the OWNERSHIP privilege on the resource monitor if they have enabled notifications.
  + All account administrators who have enabled notifications using Snowsight.
  > **Note:**
  >
  > A non-administrator user can’t be added to the notification list for an *account* monitor.

> **Important:**
>
> Resource monitor notifications can be sent by email or in Snowsight, but are disabled by default. You must set up
> notifications before they are sent. Users that do not have the ACCOUNTADMIN role can only be sent email notifications.
>
> To enable notifications, see Enabling receipt of notifications.

## DDL for resource monitors

Snowflake provides the following DDL commands for creating and using/managing resource monitors:

* [CREATE RESOURCE MONITOR](../sql-reference/sql/create-resource-monitor.md)
* [ALTER RESOURCE MONITOR](../sql-reference/sql/alter-resource-monitor.md)
* [SHOW RESOURCE MONITORS](../sql-reference/sql/show-resource-monitors.md)
* [DROP RESOURCE MONITOR](../sql-reference/sql/drop-resource-monitor.md)

In addition, the following DDL commands can be used to assign a resource monitor to a warehouse and determine whether a warehouse is assigned to
a monitor:

* [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) or [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md)
* [SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md)

## Access control privileges for resource monitors

By default, resource monitors can only be created by account administrators and, therefore, can only be viewed and maintained by them.

However, roles that have been granted the following privileges on specific resource monitors can view and modify the resource monitor as
needed using SQL:

* MONITOR
* MODIFY

For more information, see [Access control privileges](security-access-control-privileges.md) and [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md).

Note that only account administrators can view and manage resource monitors via the Snowsight.

## Enabling receipt of notifications

Before you can receive any notifications from resource monitors, you must enable notifications in the web interface and verify your email.

Snowsight:
:   To enable notifications, follow these steps:

    1. Verify your email address if you haven’t already done so. For instructions on how to verify your email address, see
       [Verify your email address](ui-snowsight-profile.md).
    2. Select your username, then select Profile
    3. For Notifications, select Enable notifications from resource monitors.

       > **Note:**
       > * If you haven’t verified your email address, the Notifications option isn’t available until you verify your email.
       > * In Notifications, the Enable notifications from resource monitors option is only available to users with
       >   the ACCOUNTADMIN role in the following cases:
       >
       >   + If there is a resource monitor assigned to the account and the account administrator has the OWNERSHIP privilege
       >     on the account monitor.
       >   + If there are warehouse monitors in the account. All account administrators can receive notifications for warehouse monitors.

## Creating resource monitors

Resource monitors can be created through either the web interface or SQL; however, only account administrators (i.e. users with the
ACCOUNTADMIN role) can create resource monitors.

A resource monitor can’t be assigned more than 500 warehouses.

> **Important:**
>
> You must assign at least one warehouse to a resource monitor or set the monitor at the account level for it to begin
> monitoring/tracking credit usage:
>
> * In the web interface, you are required to do this at creation time.
> * In SQL, you must create the resource monitor first, then assign one or more warehouses to it by executing [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) as a separate,
>   additional step.
>
> Also, to receive notifications when resource monitor actions are triggered, you must
> enable notifications.

### Creating a resource monitor with a default schedule

You can create a resource monitor that uses the default schedule using the web interface or SQL.

> **Note:**
>
> Only users with the ACCOUNTADMIN role can create resource monitors.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Admin » Cost management.
    3. Select Resource Monitors, then select + Resource Monitor.
    4. For Name, enter a name for the resource monitor.
    5. For Credit Quota, enter the number of credits for each specified interval.
    6. Select the Monitor Type. Choose Account to create an account monitor or
       choose Warehouse to select the warehouses to monitor.
    7. For Actions, choose which notifications to enable by entering a threshold next to the option. You must select at
       least one option.

       Select Add to create additional notifications. You can specify up to five notify actions.

SQL:
:   In SQL, this task is performed in two steps:

    1. Execute a [CREATE RESOURCE MONITOR](../sql-reference/sql/create-resource-monitor.md) command, but do not specify any scheduling properties.
    2. Execute an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) statement to assign warehouses to the resource monitor or
       an [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) statement to set the resource monitor for the account.

    For example:

    * To create a monitor that starts monitoring immediately, resets at the beginning of each month, and suspends the assigned warehouse
      when the used credits reach 100% of the credit quota:

      > ```sqlexample
      > USE ROLE ACCOUNTADMIN;
      >
      > CREATE OR REPLACE RESOURCE MONITOR limit1 WITH CREDIT_QUOTA=1000
      >   TRIGGERS ON 100 PERCENT DO SUSPEND;
      >
      > ALTER WAREHOUSE wh1 SET RESOURCE_MONITOR = limit1;
      > ```

      The SUSPEND action waits for currently executing queries to finish before suspending the warehouse. A query might start before the
      threshold is reached and complete after the SUSPEND action is triggered. In this case, the warehouse continues to consume credits
      even after the quota is reached.
    * To create a similar monitor that suspends at 90% and suspends immediately at 100% after the quota has been reached:

      > ```sqlexample
      > USE ROLE ACCOUNTADMIN;
      >
      > CREATE OR REPLACE RESOURCE MONITOR limit1 WITH CREDIT_QUOTA=1000
      >   TRIGGERS ON 90 PERCENT DO SUSPEND
      >            ON 100 PERCENT DO SUSPEND_IMMEDIATE;
      >
      > ALTER WAREHOUSE wh1 SET RESOURCE_MONITOR = limit1;
      > ```

      In this example, a notification is sent and the assigned warehouses are suspended when 90% of the credit quota is reached.
      Currently executing queries complete, but the resource monitor prevents the warehouses from executing any new queries.
      If the assigned warehouses reach 100% of the credit quota, a notification is sent and the warehouses are suspended immediately,
      canceling all currently executing queries. This prevents all warehouses in the account from consuming credits.
    * To create a monitor that is similar to the first example, but lets the assigned warehouse exceed the quota by 10% and also includes two
      notification actions to alert account administrators as the used credits reach the halfway and three-quarters points for the quota:

      > ```sqlexample
      > USE ROLE ACCOUNTADMIN;
      >
      > CREATE OR REPLACE RESOURCE MONITOR limit1 WITH CREDIT_QUOTA=1000
      >    TRIGGERS ON 50 PERCENT DO NOTIFY
      >             ON 75 PERCENT DO NOTIFY
      >             ON 100 PERCENT DO SUSPEND
      >             ON 110 PERCENT DO SUSPEND_IMMEDIATE;
      >
      > ALTER WAREHOUSE wh1 SET RESOURCE_MONITOR = limit1;
      > ```

      In this example:

      + When 50% and 75% usage is reached, an alert notification is sent to all account administrators who have enabled notifications, but no
        other actions are performed.
      + When 100% usage is reached, the assigned warehouse is suspended.
      + If the warehouse is still running when 110% usage is reached, it is suspended immediately.

### Creating a resource monitor with a custom schedule

You can create a resource monitor that uses a schedule other than the default using the web interface or SQL.

> **Note:**
>
> Only users with the ACCOUNTADMIN role can create resource monitors.

Complete the following steps to create a resource monitor with a custom schedule:

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Admin » Cost management.
    3. Select Resource Monitors, then select + Resource Monitor.
    4. For Name, enter a name for the resource monitor.
    5. For Credit Quota, enter the number of credits for each specified interval.
    6. Select the Monitor Type. Choose Account to create an account monitor or
       choose Warehouse to select the warehouses to monitor.
    7. Select Schedule » Customize to set a custom schedule for the specified interval. You can skip this step to use the
       default schedule.

       You can set a custom start and end date using Custom Start Date and Custom End Date, or specify a date range
       by using the Range tab.

       > **Note:**
       >
       > If you choose to end monitoring at a specified date and time, all assigned warehouses are suspended on that date and time even
       > if the credit quota has not been reached. A notification is sent when this occurs that states the resource monitor has reached
       > a percentage of its quota and has triggered a suspend immediate action. The percentage of the quota reflects the number
       > of used credits in the current interval up to the end date and might not be a threshold you specified.

       You can also customize the specified interval for monitoring. Select Resets and you can select from the following periodic
       intervals at which to reset the credit quota:

       * Monthly
       * Daily
       * Weekly
       * Yearly
       * Never
    8. For Actions, choose which notifications to enable by entering a threshold next to the option. You must select at
       least one option.

       Select Add to create additional notifications. You can specify up to five notify actions.
    9. For Actions and Notifications, enable a given action or notification by entering a threshold next to the option.
       You must select at least one option.

       Select +Add more notification thresholds to create additional notifications. You can specify up to five notify actions.

SQL:
:   Execute a [CREATE RESOURCE MONITOR](../sql-reference/sql/create-resource-monitor.md) command, with one or more of the following scheduling properties:

    * FREQUENCY
    * START_TIMESTAMP
    * END_TIMESTAMP

    For example:

    * To create an account-level resource monitor that starts immediately (based on the current timestamp), resets monthly on the same day,
      has no end date or time, and suspends the assigned warehouse when the used credits reach 100% of the quota:

      > ```sqlexample
      > USE ROLE ACCOUNTADMIN;
      >
      > CREATE OR REPLACE RESOURCE MONITOR limit1 WITH CREDIT_QUOTA=1000
      >     FREQUENCY = MONTHLY
      >     START_TIMESTAMP = IMMEDIATELY
      >     TRIGGERS ON 100 PERCENT DO SUSPEND;
      >
      > ALTER WAREHOUSE wh1 SET RESOURCE_MONITOR = limit1;
      > ```
    * To create a resource monitor that starts at a specific date and time in the future, resets weekly on the same day, has no end date or
      time, and performs two different suspend actions at different thresholds on two assigned warehouses:

      > ```sqlexample
      > USE ROLE ACCOUNTADMIN;
      >
      > CREATE OR REPLACE RESOURCE MONITOR limit1 WITH CREDIT_QUOTA=2000
      >     FREQUENCY = WEEKLY
      >     START_TIMESTAMP = '2019-03-04 00:00 PST'
      >     TRIGGERS ON 80 PERCENT DO SUSPEND
      >              ON 100 PERCENT DO SUSPEND_IMMEDIATE;
      >
      > ALTER WAREHOUSE wh1 SET RESOURCE_MONITOR = limit1;
      >
      > ALTER WAREHOUSE wh2 SET RESOURCE_MONITOR = limit1;
      > ```

> **Note:**
>
> You cannot change the customized schedule for a resource monitor back to the default. You must drop the monitor and create a new monitor.

## Modifying a resource monitor

You can modify the following properties for an existing resource monitor:

* Increase or decrease the credit quota for the monitor.
* Customize the schedule (frequency, start timestamp, and end timestamp) for the monitor.
* Add or remove actions, or modify the threshold percentages for existing actions.
* If the monitor is monitoring your account, convert it to monitor individual warehouses.
* If the monitor is monitoring individual warehouses:

  + Add or remove warehouses from the list.
  + Convert it to monitor your account.

> **Note:**
>
> Changing any of these properties does not affect the used credits to-date for the monitor. All changes only affect used credits
> after the changes are saved.

Resource monitors can be modified through the web interface or SQL.

> **Note:**
>
> The following privileges are required to modify the properties of a resource monitor:
>
> * To modify the credit quota, schedule, or actions for a resource monitor, a user must use a role with the MODIFY
>   privilege on the resource monitor.
> * To modify the monitor type for a resource monitor from warehouse to account or vice versa, a user must use the
>   ACCOUNTADMIN role.
> * To modify the list of warehouses for a warehouse-level resource monitor, a user must use the ACCOUNTADMIN role.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Admin » Cost management.
    3. Select Resource Monitors, then select a resource monitor.
    4. Select the More menu (…) in top right corner. Select Edit.

SQL:
:   * To change the quota, customize the schedule, or add/remove/modify actions, execute an
      [ALTER RESOURCE MONITOR](../sql-reference/sql/alter-resource-monitor.md) statement.

      For example, to increase the credit quota for resource monitor `limit1` to `3000`, execute the following statement:

      > ```sqlexample
      > ALTER RESOURCE MONITOR limit1 SET CREDIT_QUOTA=3000;
      > ```

      For more examples, see [Examples](../sql-reference/sql/alter-resource-monitor.md).

    * To change the monitor type, execute an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) or [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) statement.

      For example, to change resource monitor `my_rm` which currently monitors warehouses to monitor
      the account `my_account`, execute the following steps:

      1. Find all the warehouses the resource monitor `my_rm` is monitoring. Check the `resource_monitor` column for
         `my_rm`:

         ```sqlexample
         SHOW WAREHOUSES;
         ```

         Returns the following results:

         ```output
         +--------+-----------+----------+---------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+------------------+---------+----------+--------+-----------+------------+--------+-----------------+
         | name   | state     | type     | size    | running | queued | is_default | is_current | auto_suspend | auto_resume | available | provisioning | quiescing | other | created_on                    | resumed_on                    | updated_on                    | owner        | comment | resource_monitor | actives | pendings | failed | suspended | uuid       | budget | owner_role_type |
         |--------+-----------+----------+---------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+------------------+---------+----------+--------+-----------+------------+--------+-----------------|
         | MY_WH1 | STARTED   | STANDARD | X-Small |       0 |      0 | N          | N          |          600 | true        |           |              |           |       | 2024-01-17 14:37:36.223 -0800 | 2024-01-17 14:37:36.325 -0800 | 2024-01-17 14:47:49.854 -0800 | MY_ROLE      |         | null             |       0 |        0 |      0 |         1 | 1222706972 | NULL   | ROLE            |
         | MY_WH2 | SUSPENDED | STANDARD | X-Small |       0 |      0 | N          | Y          |          600 | true        |           |              |           |       | 2023-12-20 13:50:50.972 -0800 | 2024-01-17 14:28:39.170 -0800 | 2024-01-17 14:37:57.560 -0800 | ACCOUNTADMIN |         | MY_RM            |       0 |        0 |      0 |         1 | 1222706948 | NULL   | ROLE            |
         | MY_WH3 | SUSPENDED | STANDARD | Small   |       0 |      0 | N          | N          |          600 | true        |           |              |           |       | 2024-01-17 14:26:26.911 -0800 | 2024-01-17 14:33:39.260 -0800 | 2024-01-17 14:38:31.192 -0800 | ACCOUNTADMIN |         | MY_RM            |       0 |        0 |      0 |         2 | 1222706960 | NULL   | ROLE            |
         +--------+-----------+----------+---------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+------------------+---------+----------+--------+-----------+------------+--------+-----------------+
         ```

         Resource monitor `my_rm` is monitoring two warehouses `my_wh2` and `my_wh3`.
      2. Remove the resource monitor for both warehouses by executing the following statements:

         ```sqlexample
         ALTER WAREHOUSE my_wh2 UNSET RESOURCE_MONITOR;

         ALTER WAREHOUSE my_wh3 UNSET RESOURCE_MONITOR;
         ```
      3. Change the resource monitor to monitor the account by executing the following statement:

         ```sqlexample
         ALTER ACCOUNT my_account SET RESOURCE_MONITOR = my_rm;
         ```

> **Note:**
>
> If a resource monitor has a customized schedule, you cannot change the schedule back to the default. You must drop the monitor and
> create a new monitor.

## Send resource monitor notifications to non-administrator users

Non-administrator users can only receive email notifications for warehouse monitors. Each non-administrator
user must have a verified email address. You can add up to five non-administrator users to a warehouse monitor using the CREATE RESOURCE
MONITOR or ALTER RESOURCE MONITOR command.

For example, to add users `user1` and `user2` to the warehouse monitor `my_warehouse_rm`, execute the following
statement:

```sqlexample
ALTER RESOURCE MONITOR my_warehouse_rm
  SET NOTIFY_USERS = (USER1, USER2);
```

> **Note:**
>
> If any user in the notification list does not have a verified email, the statement fails.

For more information, see the [NOTIFY_USERS parameter](../sql-reference/sql/alter-resource-monitor.md) and
[Usage notes](../sql-reference/sql/alter-resource-monitor.md) in the [ALTER RESOURCE MONITOR](../sql-reference/sql/alter-resource-monitor.md) topic.

To add non-administrator users to the notification list for a resource monitor when you create a resource monitor using SQL, see
[CREATE RESOURCE MONITOR](../sql-reference/sql/create-resource-monitor.md).

## Setting a resource monitor for your account

A resource monitor can be set for your account through the web interface or SQL.

> **Note:**
>
> Only users with the ACCOUNTADMIN role can set a resource monitor to monitor the account.

Snowsight:
:   You can set the monitor type to account when you create a resource monitor. For more information,
    see Creating resource monitors.

SQL:
:   In SQL, this task is performed in two steps:

    1. Use the [CREATE RESOURCE MONITOR](../sql-reference/sql/create-resource-monitor.md) command to create the resource monitor (if it doesn’t already exist).

       If the resource monitor does exist, to change a warehouse level resource monitor to monitor an account, see the
       example in the Modifying a Resource Monitor section.
    2. Use the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to set the resource monitor you created as the monitor for your account.

       For example, to set the account resource monitor to `my_account_rm`, execute the following statements:

       > ```sqlexample
       > USE ROLE ACCOUNTADMIN;
       >
       > CREATE RESOURCE MONITOR my_account_rm WITH CREDIT_QUOTA=10000
       >   TRIGGERS ON 100 PERCENT DO SUSPEND;
       >
       > ALTER ACCOUNT SET RESOURCE_MONITOR = my_account_rm;
       > ```

    To change the monitor type of an existing resource monitor from a warehouse monitor to an account monitor, see the
    example in the Modifying a Resource Monitor section.

To view whether a resource monitor is set for your account, use the web interface or the [SHOW RESOURCE MONITORS](../sql-reference/sql/show-resource-monitors.md)
command. The `LEVEL` column for a resource monitor displays whether it is set for your account or individual warehouses.

> **Important:**
>
> * An account-level resource monitor only controls the virtual warehouses explicitly created in your account; it does not control credit
>   usage by the Snowflake-provided warehouses for serverless features (for example, [Snowpipe](data-load-snowpipe-intro.md),
>   [Automatic Clustering](tables-auto-reclustering.md), and [materialized views](views-materialized.md)).
> * A warehouse-level resource monitor can monitor, but cannot suspend, credit usage by cloud services. The monitor can only
>   suspend the user-managed virtual warehouses created in your account. After a user-managed virtual warehouse is suspended, subsequent
>   queries run against that warehouse can still result in additional cloud services costs. For more information about credit usage for cloud
>   services, see [Cloud service credit usage](cost-understanding-compute.md).

## Assigning warehouses to a resource monitor

Warehouses can be assigned to an existing resource monitor through the web interface or SQL.

> **Note:**
>
> Only users with the ACCOUNTADMIN role can assign warehouses to resource monitors.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Admin » Cost management.
    3. Select Resource Monitors, then select a resource monitor.
    4. Select the More menu (…) in top right corner. Select Edit.
    5. If Monitor Type is Account, select Warehouse.
    6. Select Warehouse to choose the warehouses to monitor.

SQL:
:   Use the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command to assign a warehouse to a resource monitor.

    For example, to monitor warehouse `my_wh` with the resource monitor `my_rm`, execute the following statement:

    > ```sqlexample
    > ALTER WAREHOUSE my_wh SET RESOURCE_MONITOR = my_rm;
    > ```

## Viewing resource monitors

Resource monitors can be viewed through the web interface or SQL.

Snowsight:
:   1. Sign in to [Snowsight](ui-snowsight-gs.md).
    2. In the navigation menu, select Admin » Cost management.
    3. Select Resource Monitors to see a list of resource monitors for which your role has been granted the MODIFY or MONITOR
       privilege. Account administrator users can see all resource monitors.
    4. Select a resource monitor to view detailed information about resource monitor settings, current credit usage, and a list of
       roles with privileges on the resource monitor.
    5. Select Account  » Resource Monitors.

SQL:
:   Using the ACCOUNTADMIN role or a role that has been granted the MONITOR or MODIFY privilege on the resource monitor, execute
    a [SHOW RESOURCE MONITORS](../sql-reference/sql/show-resource-monitors.md) statement.

    In addition, using any role, you can execute a [SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md) statement to view the warehouses owned
    by the role (or for which the role has been granted USAGE privilege). The output includes the resource monitor, if any, to which
    the warehouse is assigned.

> **Note:**
>
> For [provider accounts](data-sharing-gs.md) that have created reader accounts, Snowflake provides an additional view,
> [RESOURCE_MONITORS](../sql-reference/account-usage/resource_monitors.md). This view can be used for querying resource monitor usage
> in the provider’s reader accounts. For more information, see [Account Usage](../sql-reference/account-usage.md).

## Replicating resource monitors

Resource monitors can be replicated from a source account to target accounts using a
[replication or failover group](account-replication-intro.md). For more information, see
[Resource monitor replication](account-replication-intro.md).

---
title: Working with search-optimized tables
source: https://docs.snowflake.com/en/user-guide/search-optimization/working-with-tables.md
section: User Guide
---

# Working with search-optimized tables

Search optimization is generally transparent to users. Queries work the same; some are just faster. However, it is
important to be aware of possible effects of other table operations on the search optimization service, or the reverse.

## Modifying the table

A search access path becomes invalid if the default value of a column is changed.

To use search optimization again after a search access path has become invalid, you must
[drop the SEARCH OPTIMIZATION property](cost-estimation.md) and
[add the SEARCH OPTIMIZATION property](enabling.md) back to the table.

A search access path remains valid if you add, drop, or rename a column:

* If you enabled search optimization for an entire table without specifying specific columns, then when you add a column to a table, the
  new column is added to the search access path automatically. However, if you used the ON clause when enabling search optimization for a
  column, new columns are not added automatically.
* When you drop a column from a table, the dropped column is removed from the search access path automatically.
* Renaming a column doesn’t require any changes to the search access path.

If you drop a table, the SEARCH OPTIMIZATION property and search access paths are also dropped. Note that:

* Undropping the table immediately reestablishes search optimization as a property of the table.
* When you drop a table, the search access path has the same data retention period as the table.

If you [drop the SEARCH OPTIMIZATION property](cost-estimation.md) from the table, the search access
path is removed. When you
[add the SEARCH OPTIMIZATION property](enabling.md) back to the table,
the maintenance service needs to recreate the search access path. (There is no way to undrop the property.)

## Cloning the table, schema, or database

If you clone a table, schema, or database, the SEARCH OPTIMIZATION property and search access paths of each table are
also cloned. Cloning a table, schema, or database creates a [zero-copy clone](../tables-storage-considerations.md) of each table
and its corresponding search access paths. However, if the search access path for a table is out-of-date at the time the clone
is created, both the original table and the cloned table incur the maintenance costs for the search optimization service to
update the search access path.

The search access path might be out-of-date if a DML operation significantly modifies a table just before the clone operation. For example,
if an INSERT statement results in a large increase in the size of the original table, the search access path requires maintenance to
reflect this change.

A zero-copy clone isn’t created for search access paths of replicated cloned tables. For more information, see
Working with tables in a secondary database (database replication support).

To avoid or minimize the costs of search optimization maintenance tasks on the cloned table, follow one or both of these steps:

1. If you need to leave search optimization enabled on the cloned table, verify that the search access path is up-to-date *before*
   executing the CREATE TABLE … CLONE statement. Otherwise, skip to the next step.

   In most cases, you can execute a SHOW TABLES statement and check the value in the SEARCH_OPTIMIZATION_PROGRESS column. If the
   column’s value is `100`, the search access path is up-to-date. However, maintenance might be incurred if the search access
   path is being compacted to remove information pertaining to deleted source table data.
2. Disable the search optimization service on the cloned table immediately after the clone is created. For example, to disable
   the search optimization service on table `t1`, execute the following statement:

   ```sqlexample
   ALTER TABLE t1 DROP SEARCH OPTIMIZATION;
   ```

   For more information, see [Search optimization actions (searchOptimizationAction)](../../sql-reference/sql/alter-table.md) in the ALTER TABLE topic.

If you use CREATE TABLE … LIKE to create a new empty table with the same columns as the original table,
the SEARCH OPTIMIZATION property is not copied to the new table.

## Working with tables in a secondary database (database replication support)

If a table in the primary database has the SEARCH OPTIMIZATION property enabled, the property is replicated to the corresponding
table in the secondary database.

Search access paths in the secondary database aren’t replicated but are instead rebuilt automatically. This also applies to
replicated cloned tables. Replication doesn’t create [zero-copy clone](../tables-storage-considerations.md) for cloned search access
paths but fully rebuilds them in the secondary database automatically. Subsequent maintenance on the cloned search access
paths isn’t replicated from the primary database but is performed in the secondary database. This process incurs the same
kinds of costs described in [Search optimization cost estimation and management](cost-estimation.md).

## Sharing tables

Data providers can use [Secure Data Sharing](../data-sharing-intro.md) to share tables that have search optimization
enabled.

When querying shared tables, data consumers can benefit from any performance improvements made by the search optimization service.

## Masking policies and row access policies

The search optimization service is fully compatible with tables that use masking policies and row access policies.

However, when search optimization is enabled, a user who is prevented from seeing a value due to a masking policy or row
access policy might be able to deduce with greater certainty whether that value exists. With or without search
optimization, differences in query latency can provide hints about the presence or absence of data restricted by a
policy, which may constitute a security issue depending on the sensitivity of the data. This effect can be magnified by
search optimization since it can make a query that does not return results even faster.

For example, suppose that a row access policy prevents a user from accessing rows with `country = 'US'`, but the data
does not include rows with `country = 'US'`. Now suppose that search optimization is enabled for the `country` column
and that the user runs a query with `WHERE country = 'US'`. The query returns empty results as expected, but the query
might run faster with search optimization than without. In this case, the user can more easily infer that the data does not
contain any rows where `country = 'US'` based on the time taken to run the query.

---
title: Working with Secure Views
source: https://docs.snowflake.com/en/user-guide/views-secure.md
section: User Guide
---

# Working with Secure Views

This topic covers concepts and syntax for defining views and materialized views as secure.

## Overview of Secure Views

### Why Should I Use Secure Views?

* For a non-secure view, internal optimizations can indirectly expose data.

  Some of the internal optimizations for views require access to the underlying data in the base tables for the view. This access
  might allow data that is hidden from users of the view to be exposed through user code, such as user-defined functions, or other
  programmatic methods. Secure views do not utilize these optimizations, ensuring that users have no access to the underlying data.
* For a non-secure view, the view definition is visible to other users.

  By default, the query expression used to create a standard view, also known as the view definition or text, is visible to users
  in various commands and interfaces. For details, see Interacting with Secure Views (in this topic).

  For security or privacy reasons, you might not wish to expose the underlying tables or internal structural details for a view.
  With secure views, the view definition and details are visible only to authorized users (i.e. users who are granted the role that
  owns the view).

### When Should I Use a Secure View?

Views should be defined as secure when they are specifically designated for data privacy (i.e. to limit access to sensitive data that
should not be exposed to all users of the underlying table(s)).

Secure views should not be used for views that are defined solely for query convenience, such as views created to
simplify queries for which users do not need to understand the underlying data representation. Secure views can execute
more slowly than non-secure views.

> **Tip:**
>
> When deciding whether to use a secure view, you should consider the purpose of the view and weigh the trade-off between data
> privacy/security and query performance.

### How Might Data be Exposed by a Non-secure View?

Using the following widgets example, consider a user who has access to only the red widgets. Suppose the user wonders if any purple
widgets exist and issues the following query:

```sqlexample
SELECT *
    FROM widgets_view
    WHERE 1/iff(color = 'Purple', 0, 1) = 1;
```

If any purple widgets exist, then the IFF() expression returns 0. The division operation then fails due to a division-by-zero error,
which allows the user to infer that at least one purple widget exists.

## Creating Secure Views

Secure views are defined using the SECURE keyword with the standard DDL for views:

* To create a secure view, specify the SECURE keyword in the [CREATE VIEW](../sql-reference/sql/create-view.md) or
  [CREATE MATERIALIZED VIEW](../sql-reference/sql/create-materialized-view.md) command.
* To convert an existing view to a secure view and back to a regular view, set/unset the SECURE keyword in the
  [ALTER VIEW](../sql-reference/sql/alter-view.md) or [ALTER MATERIALIZED VIEW](../sql-reference/sql/alter-materialized-view.md) command.

> **Note:**
>
> In some cases, error messages related to secure views might be redacted. For more information, see
> [Secure objects: Redaction of information in error messages](../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

## Interacting with Secure Views

### Viewing the Definition for Secure Views

The definition of a secure view is only exposed to authorized users (i.e. users who have been granted the role that owns the view). If an
unauthorized user uses any of the following commands or interfaces, the view definition is not displayed:

* [SHOW VIEWS](../sql-reference/sql/show-views.md) and [SHOW MATERIALIZED VIEWS](../sql-reference/sql/show-materialized-views.md) commands.
* [GET_DDL](../sql-reference/functions/get_ddl.md) utility function.
* [VIEWS](../sql-reference/info-schema/views.md) Information Schema view.

However, users that have been granted IMPORTED PRIVILEGES privilege on the SNOWFLAKE database or another shared database have access to secure view definitions via the [VIEWS](../sql-reference/account-usage/views.md) Account Usage view.

Users granted the ACCOUNTADMIN role or the SNOWFLAKE.OBJECT_VIEWER database role can also see secure view definitions via this view. The preferred, least-privileged means of access is the SNOWFLAKE.OBJECT_VIEWER database role.

### Determining if a View is Secure

For non-materialized views, the `IS_SECURE` column in the Information Schema and Account Usage views identifies whether a view is secure.
For example, for aview named `MYVIEW` in the `mydb` database:

> Information Schema:
>
> > ```sqlexample
> > select table_catalog, table_schema, table_name, is_secure
> >     from mydb.information_schema.views
> >     where table_name = 'MYVIEW';
> > ```
>
> Account Usage:
>
> > ```sqlexample
> > select table_catalog, table_schema, table_name, is_secure
> >     from snowflake.account_usage.views
> >     where table_name = 'MYVIEW';
> > ```

(For general information about the differences between INFORMATION_SCHEMA views and ACCOUNT_USAGE views, see
[Differences between Account Usage and Information Schema](../sql-reference/account-usage.md).)

Alternatively, you can use the SHOW VIEWS command to view similar information (note that the view name is case-insensitive):

> ```sqlexample
> SHOW VIEWS LIKE 'myview';
> ```

For materialized views, use the SHOW MATERIALIZED VIEWS command to identify whether a view is secure. For example:

> ```sqlexample
> SHOW MATERIALIZED VIEWS LIKE 'my_mv';
> ```

### Viewing Secure View Details in Query Profile

The internals of a secure view are not exposed in [Query Profile](ui-snowsight-activity.md) (in the web interface). This is the
case even for the owner of the secure view, because non-owners might have access to an owner’s Query Profile.

## Using Secure Views with Snowflake Access Control

View security can be integrated with Snowflake users and roles using the [CURRENT_ROLE](../sql-reference/functions/current_role.md) and
[CURRENT_USER](../sql-reference/functions/current_user.md) context functions. The following example illustrates using roles to control access to the rows of
a table. In addition to the table that contains the data (`widgets`), the example uses an access table (`widget_access_rules`) to
track which roles have access to which rows in the data table:

```sqlexample
CREATE TABLE widgets (
    id NUMBER(38,0) DEFAULT widget_id_sequence.nextval,
    name VARCHAR,
    color VARCHAR,
    price NUMBER(38,0),
    created_on TIMESTAMP_LTZ(9));
CREATE TABLE widget_access_rules (
    widget_id NUMBER(38,0),
    role_name VARCHAR);
CREATE OR REPLACE SECURE VIEW widgets_view AS
    SELECT w.*
        FROM widgets AS w
        WHERE w.id IN (SELECT widget_id
                           FROM widget_access_rules AS a
                           WHERE upper(role_name) = CURRENT_ROLE()
                      )
    ;
```

The WHERE clause limits which widgets each role can see.

Suppose that a user who has access only to red widgets executes the query shown earlier:

```sqlexample
SELECT *
    FROM widgets_view
    WHERE 1/iff(color = 'Purple', 0, 1) = 1;
```

The secure view’s WHERE clause is executed before any WHERE clause in the user’s query. Because purple widgets are excluded
by the view, the user’s query never generates a division-by-zero error.

If the view were not secure, then the Snowflake optimizer could re-order the predicates in the WHERE clauses. This could allow
the predicate in the user’s query to execute first, which would allow the division-by-zero error to occur.

## Best Practices for Using Secure Views

Secure views prevent users from possibly being exposed to data from rows of tables that are filtered by the view. However, there are still
ways that a data owner might inadvertently expose information about the underlying data if views are not constructed carefully. This section
discusses some potential pitfalls to avoid.

To illustrate these pitfalls, this section uses the sample `widgets` tables and view defined in the earlier examples in this topic.

### Sequence-generated Columns

A common practice for generating surrogate keys is to use a sequence or auto-increment column. If these keys are exposed to users who do not
have access to all of the underlying data, then a user might be able to guess details of the underlying data distribution. For example,
`widgets_view` exposes the ID column. If ID is generated from a sequence, then a user of `widgets_view` could deduce the total
number of widgets created between the creation timestamps of two widgets that the user has access to. Consider the following query and result:

> ```sqlexample
> select * from widgets_view order by created_on;
>
> ------+-----------------------+-------+-------+-------------------------------+
>   ID  |         NAME          | COLOR | PRICE |          CREATED_ON           |
> ------+-----------------------+-------+-------+-------------------------------+
> ...
>  315  | Small round widget    | Red   | 1     | 2017-01-07 15:22:14.810 -0700 |
>  1455 | Small cylinder widget | Blue  | 2     | 2017-01-15 03:00:12.106 -0700 |
> ...
> ```

Based on the result, the user might suspect that 1139 widgets (1455 - 315) were created between January 7 and January 15. If this
information is too sensitive to expose to users of a view, you can use any of the following alternatives:

* Do not expose the sequence-generated column as part of the view.
* Use randomized identifiers (e.g. generated by [UUID_STRING](../sql-reference/functions/uuid_string.md)) instead of sequence-generated values.
* Programmatically obfuscate the identifiers.

### Scanned Data Size

For queries containing secure views, Snowflake does not expose the amount of data scanned (either in terms of bytes or micro-partitions)
or the total amount of data. This is to protect the information from users who only have access to a subset of the data. However, users
might still be able to make observations about the quantity of underlying data based on performance characteristics of queries. For example,
a query that runs twice as long might process twice as much data. While any such observations are approximate at best, in some cases it
might be undesirable for even this level of information to be exposed.

In such cases, it is best to materialize data per user/role instead of exposing views on the base data to users. In the case of the
`widgets` table, a table would be created for each role that has access to widgets, which contains only the widgets accessible by
that role, and a role would be granted access to its table. This is much more cumbersome than using a single view, but for extremely
high-security situations, this might be warranted.

### Secure Views and Data Sharing

When using secure views with [Secure Data Sharing](../guides-overview-sharing.md), use the [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) function to authorize users from a specific account to access rows in a base table.

> > **Note:**
> >
> > When using the [CURRENT_ROLE](../sql-reference/functions/current_role.md) and [CURRENT_USER](../sql-reference/functions/current_user.md) functions with secure
> > views that will be shared to other Snowflake accounts, Snowflake returns a NULL value for these functions. The reason is that the owner
> > of the data being shared does not typically control the users or roles in the account with which the view is being shared.

---
title: Working with Subqueries
source: https://docs.snowflake.com/en/user-guide/querying-subqueries.md
section: User Guide
---

# Working with Subqueries

A subquery is a query within another query. Subqueries in a [FROM](../sql-reference/constructs/from.md) or [WHERE](../sql-reference/constructs/where.md)
clause are used to provide data that will be used to limit or compare/evaluate the data returned by the containing query.

## Types of Subqueries

### Correlated vs. Uncorrelated Subqueries

Subqueries can be categorized as *correlated* or *uncorrelated*:

* A correlated subquery refers to one or more columns from outside of
  the subquery. (The columns are typically referenced inside the `WHERE`
  clause of the subquery.) A correlated subquery can be thought of as a filter
  on the table that it refers to, as if the subquery were evaluated on each
  row of the table in the outer query.
* An uncorrelated subquery has no such external column references. It
  is an independent query, the results of which are returned to and used by
  the outer query once (not per row).

For example:

> ```sqlexample
> -- Uncorrelated subquery:
> SELECT c1, c2
>   FROM table1 WHERE c1 = (SELECT MAX(x) FROM table2);
>
> -- Correlated subquery:
> SELECT c1, c2
>   FROM table1 WHERE c1 = (SELECT x FROM table2 WHERE y = table1.c2);
> ```

### Scalar vs. Non-scalar Subqueries

Subqueries can also be categorized as *scalar* or *non-scalar*:

* A scalar subquery returns a single value (one column of one row).
  If no rows qualify to be returned, the subquery returns NULL.
* A non-scalar subquery returns 0, 1, or multiple rows, each of which
  may contain 1 or multiple columns. For each column, if there is no value to
  return, the subquery returns NULL. If no rows qualify to be returned, the
  subquery returns 0 rows (not NULLs).

### Types Supported by Snowflake

Snowflake currently supports the following types of subqueries:

* Uncorrelated scalar subqueries in any place that a value expression can be used.
* Correlated scalar subqueries in [WHERE](../sql-reference/constructs/where.md) clauses.
* EXISTS, ANY / ALL, and IN subqueries in [WHERE](../sql-reference/constructs/where.md) clauses. These subqueries can be correlated or uncorrelated.

## Subquery Operators

[Subquery operators](../sql-reference/operators-subquery.md) operate on nested query expressions. They can be used to compute values that are:

* Returned in a [SELECT](../sql-reference/sql/select.md) list.
* Grouped in a [GROUP BY](../sql-reference/constructs/group-by.md) clause.
* Compared with other expressions in the [WHERE](../sql-reference/constructs/where.md) or [HAVING](../sql-reference/constructs/having.md) clause.

## Differences Between Correlated and Non-Correlated Subqueries

The following query demonstrates an uncorrelated subquery in a [WHERE](../sql-reference/constructs/where.md) clause.
The subquery gets the per capita GDP of Brazil, and the outer query
selects all the jobs (in any country) that pay less than the
per-capita GDP of Brazil. The subquery is uncorrelated because the value
that it returns does not depend upon any column of the outer query. The
subquery only needs to be called once during the entire execution of the
outer query.

> ```sqlexample
> SELECT p.name, p.annual_wage, p.country
>   FROM pay AS p
>   WHERE p.annual_wage < (SELECT per_capita_GDP
>                            FROM international_GDP
>                            WHERE name = 'Brazil');
> ```

The next query demonstrates a correlated subquery in a [WHERE](../sql-reference/constructs/where.md) clause.
The query lists jobs where the annual pay of the job is less than the
per-capita GDP in that country.
This subquery is correlated because it is called once for each row in the
outer query and is passed a value, `p.country` (country name), from the row.

> ```sqlexample
> SELECT p.name, p.annual_wage, p.country
>   FROM pay AS p
>   WHERE p.annual_wage < (SELECT MAX(per_capita_GDP)
>                            FROM international_GDP i
>                            WHERE p.country = i.name);
> ```

> **Note:**
>
> The [MAX](../sql-reference/functions/max.md) aggregate function is not logically necessary in this case because the
> `international_GDP` table has only one row per country; however, because the server doesn’t know that, and because the server
> requires that the subquery return no more than one row, the query uses the aggregate function to force the server to recognize that the
> subquery will return only one row each time that the subquery is executed.
>
> The functions [MIN](../sql-reference/functions/min.md) and [AVG](../sql-reference/functions/avg.md) also work because
> applying either of these to a single value returns that value unchanged.

## Scalar Subqueries

A scalar subquery is a subquery that returns at most one row. A scalar subquery can appear anywhere that a value expression can appear, including
the [SELECT](../sql-reference/sql/select.md) list, [GROUP BY](../sql-reference/constructs/group-by.md) clause, or as an argument to a function in a
[WHERE](../sql-reference/constructs/where.md) or [HAVING](../sql-reference/constructs/having.md) clause.

### Usage Notes

* A scalar subquery can contain only one item in the [SELECT](../sql-reference/sql/select.md) list.
* If a scalar subquery returns more than one row, a runtime error is generated.
* Correlated scalar subqueries are currently supported only if they can be statically determined to return one row (e.g. if the
  [SELECT](../sql-reference/sql/select.md) list contains an aggregate function with no [GROUP BY](../sql-reference/constructs/group-by.md)).
* Uncorrelated scalar subqueries are supported anywhere that a value expression is allowed.
* Subqueries with a correlation inside of [FLATTEN](../sql-reference/functions/flatten.md) are currently unsupported.
* The [LIMIT / FETCH](../sql-reference/constructs/limit.md) clause is allowed only in uncorrelated scalar subqueries.

### Examples

This example shows a basic uncorrelated subquery in a WHERE clause:

> ```sqlexample
> SELECT employee_id
> FROM employees
> WHERE salary = (SELECT max(salary) FROM employees);
> ```

This example shows an uncorrelated subquery in a FROM clause; this basic subquery
returns a subset of the information in the `international_GDP` table.
The overall query lists jobs in “high-wage” countries where the annual pay
of the job is the same as the per_capita_GDP in that country.

> ```sqlexample
> SELECT p.name, p.annual_wage, p.country
>   FROM pay AS p INNER JOIN (SELECT name, per_capita_GDP
>                               FROM international_GDP
>                               WHERE per_capita_GDP >= 10000.0) AS pcg
>     ON pcg.per_capita_GDP = p.annual_wage AND p.country = pcg.name;
> ```

## Limitations

Although subqueries can contain a wide range of SELECT statements, they have the following limitations:

* Some clauses are not allowed inside of ANY/ALL/NOT EXISTS subqueries.
* The only type of subquery that allows a
  [LIMIT / FETCH](../sql-reference/constructs/limit.md) clause is an uncorrelated scalar
  subquery. Also, because an uncorrelated scalar subquery returns only 1 row,
  the LIMIT clause has little or no practical value inside a subquery.

---
title: Working with tables in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-data-databases-table.md
section: User Guide
---

# Working with tables in Snowsight

You can work with [tables](../guides-overview-db.md) in SQL or using Snowsight.
For details about the available SQL commands for working with tables, see [Table, view, & sequence DDL](../sql-reference/ddl-table.md)

In Snowsight, in the navigation menu, select Catalog » Database Explorer, and then search for or browse to the table.
Select the table to do any of the following:

* Explore details about the table and the columns defined in the table.
* Preview the data in the table.
* [Load data into the table from files](data-load-web-ui.md).
* Monitor the data loading activity for the table using the [table-level Copy History](data-load-monitor.md).

> **Note:**
>
> To work with tables in Snowsight, you must use a role with the relevant [table privileges](security-access-control-privileges.md).

## Explore table details in Snowsight

After opening a table in Snowsight, you can review the table name and the database and schema that contain the table.

You can also review the following details:

* The type of table
* The owner role of the table
* When the table was created. Hover over the time to see the exact date and time.
* The number of rows in the table
* The size of the table. For example, 2.5 KB for a very small table.

You can review the SQL definition for the table on the Table Details tab in the Table definition section.
The Columns tab provides information about the columns in the table. Use the Search option to find columns by name.

Manage privileges for the table in the Privileges section of the Table Details tab.
See [Manage object privileges with Snowsight](security-access-control-configure.md).

## Manage a table in Snowsight

You can perform the following basic management tasks for a table in Snowsight:

* To edit the table name or add a comment, select  » Edit.
* To clone the table, select  » Clone.
* To drop the table, select  » Drop.
* To transfer ownership of the table to another role, select  » Transfer Ownership.
* To [load data into the table from files](data-load-web-ui.md), select Load Data.

## Preview data in a table

The Data Preview tab provides a preview of up to the first 100 rows of the table.

Select  to manipulate the preview data:

* Sort the data in ascending or descending order.
* Increase or decrease the decimal precision.
* Show thousands separators in numbers.
* Display the data in the column as percentages.

The options available to you depend on the type of data in the column.

> **Note:**
>
> The preview requires a warehouse. By default, Snowsight uses the default warehouse for your user profile, or you can select
> a different warehouse.

---
title: Working with Temporary and Transient Tables
source: https://docs.snowflake.com/en/user-guide/tables-temp-transient.md
section: User Guide
---

# Working with Temporary and Transient Tables

In addition to permanent tables, which is the default table type when creating tables, Snowflake supports defining tables as either temporary or
transient. These types of tables are especially useful for storing data that does not need to be maintained for extended periods of time
(i.e. transitory data).

> **Note:**
>
> You cannot create [hybrid tables](tables-hybrid.md) that are temporary or transient. In turn, you cannot create hybrid tables within transient schemas or databases.

## Temporary Tables

Snowflake supports creating temporary tables for storing non-permanent, transitory data (e.g. ETL data, session-specific data). Temporary tables
only exist within the session in which they were created and persist only for the remainder of the session. As such, they are not visible to other
users or sessions. Once the session ends, data stored in the table is purged completely from the system and, therefore, is not recoverable, either
by the user who created the table or Snowflake.

> **Note:**
>
> In addition to tables, Snowflake supports creating certain other database objects as temporary (e.g. stages). These objects follow the same
> semantics (i.e. they are session-based, persisting only for the remainder of the session).

### Data Storage Usage for Temporary Tables

For the duration of the existence of a temporary table, the data stored in the table contributes to the overall storage charges that Snowflake bills
your account. To prevent any unexpected storage changes, particularly if you create large temporary tables in sessions that you maintain for periods
longer than 24 hours, Snowflake recommends explicitly dropping these tables once they are no longer needed. You can also explicitly exit the session
in which the table was created to ensure no additional charges are accrued.

For more information, see Comparison of Table Types (in this topic).

### Maintenance of Temporary Tables

If your workload generates high volumes of temporary tables, you are very likely to experience degraded performance when you query the
[COLUMNS view](../sql-reference/info-schema/columns.md) or [TABLES view](../sql-reference/info-schema/tables.md) in the Information Schema.

Snowflake recommends the following best practices:

* [Drop temporary tables](../sql-reference/sql/drop-table.md) explicitly before sessions end.
* Make sure that users explicitly log out of sessions that are inactive. See [Snowflake sessions and session policies](session-policies.md).

Taking these actions consistently will help you avoid query performance degradation that is related to the presence of temporary tables.

### Potential Naming Conflicts with Other Table Types

Similar to the other table types (transient and permanent), temporary tables belong to a specified database and schema; however, because they are
session-based, they aren’t bound by the same uniqueness requirements. This means you can create temporary and non-temporary tables with the same name
within the same schema.

However, note that the temporary table takes precedence in the session over any other table with the same name in the same schema. This can lead to
potential conflicts and unexpected behavior, particularly when performing DDL on both temporary and non-temporary tables. For example:

* You can create a temporary table that has the same name as an existing table in the same schema, effectively hiding the existing table.
* You can create a table that has the same name as an existing temporary table in the same schema; however, the newly-created table is hidden by the
  temporary table.

Subsequently, all queries and other operations performed in the session on the table affect only the temporary table.

> **Important:**
>
> This behavior is particularly important to note when dropping a table in a session and then using Time Travel to restore the table. It is also
> important to note this behavior when using CREATE OR REPLACE to create a table because this essentially drops a table (if it exists) and creates a
> new table with the specified definition.

### Creating a Temporary Table

To create a temporary table, simply specify the TEMPORARY keyword (or TEMP abbreviation) in [CREATE TABLE](../sql-reference/sql/create-table.md).
You can also use the [TableCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.table.TableCollection) Python API.

Note that creating a temporary table does not require the CREATE TABLE privilege on the schema in which the object is created.

For example:

SQLPython

```sqlexample
CREATE TEMPORARY TABLE mytemptable (id NUMBER, creation_date DATE);
```

```python
from snowflake.core.table import Table, TableColumn

my_temp_table = Table(
  name="mytemptable",
  columns=[TableColumn(name="id", datatype="int"),
          TableColumn(name="creation_date", datatype="timestamp_tz")],
  kind="TEMPORARY"
)
root.databases["<database>"].schemas["<schema>"].tables.create(my_temp_table)
```

> **Note:**
>
> After creation, temporary tables cannot be converted to any other table type.

## Transient Tables

Snowflake supports creating transient tables that persist until explicitly dropped and are available to all users with the appropriate privileges.
Transient tables are similar to permanent tables with the key difference that they do not have a Fail-safe period. As a result, transient tables
are specifically designed for transitory data that needs to be maintained beyond each session (in contrast to temporary tables), but does not
need the same level of data protection and recovery provided by permanent tables.

### Data Storage Usage for Transient Tables

Similar to permanent tables, transient tables contribute to the overall storage charges that Snowflake bills your account; however, because
transient tables do not utilize Fail-safe, there are no Fail-safe costs (i.e. the costs associated with maintaining the data required for
Fail-safe disaster recovery).

For more information, see Comparison of Table Types (in this topic).

### Transient Tables Created as Clones of Permanent Tables

When you create a transient table as a clone of a permanent table, Snowflake creates a [zero-copy clone](tables-storage-considerations.md).
This means when the transient table is created, it utilizes no data storage because it shares all of the existing
[micro-partitions](tables-clustering-micropartitions.md) of the original permanent table.
When rows are added, deleted, or updated in the clone, it results in new micro-partitions that belong exclusively
to the clone (in this case, the transient table).

When a permanent table is deleted, it enters Fail-safe for a 7-day period. Fail-safe bytes incur
[storage costs](data-cdp-storage-costs.md). If a transient table is created as a clone of a permanent table, this
might delay the time between when the permanent table is deleted and when all of its bytes enter Fail-safe. If the transient
table clone shares any micro-partitions with the permanent table when it is deleted, those shared bytes will only enter
Fail-safe when the transient table is deleted.

### Transient Databases and Schemas

Snowflake also supports creating transient databases and schemas. All tables created in a transient schema, as well as all schemas created in
a transient database, are transient by definition.

### Creating a Transient Table, Schema, or Database

To create a transient table, schema, or database, simply specify the TRANSIENT keyword when creating the object:

SQLPython

* [CREATE TABLE](../sql-reference/sql/create-table.md)
* [CREATE SCHEMA](../sql-reference/sql/create-schema.md)
* [CREATE DATABASE](../sql-reference/sql/create-database.md)

* [DatabaseCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.database.DatabaseCollection)
* [SchemaCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.schema.SchemaCollection)
* [TableCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.table.TableCollection)

For example, to create a transient table:

SQLPython

```sqlexample
CREATE TRANSIENT TABLE mytranstable (id NUMBER, creation_date DATE);
```

```python
from snowflake.core.table import Table, TableColumn

my_trans_table = Table(
  name="mytranstable",
  columns=[TableColumn(name="id", datatype="int"),
          TableColumn(name="creation_date", datatype="timestamp_tz")],
  kind="TRANSIENT"
)
root.databases["<database>"].schemas["<schema>"].tables.create(my_trans_table)
```

> **Note:**
>
> After creation, transient tables cannot be converted to any other table type.

## Comparison of Table Types

The following table summarizes the differences between the three table types, particularly with regard to their impact on Time Travel and
Fail-safe:

| Type | Persistence | Cloning (source type => target type) | Time Travel Retention Period (Days) | Fail-safe Period (Days) |
| --- | --- | --- | --- | --- |
| Temporary | Remainder of session | Temporary => Temporary . . Temporary => Transient | 0 or 1 (default is 1) | 0 |
| Transient | Until explicitly dropped | Transient => Temporary . . Transient => Transient | 0 or 1 (default is 1) | 0 |
| Permanent ([Standard Edition](intro-editions.md)) | Until explicitly dropped | Permanent => Temporary . . Permanent => Transient . . Permanent => Permanent | 0 or 1 (default is 1) | 7 |
| Permanent ([Enterprise Edition and higher](intro-editions.md)) | Until explicitly dropped | Permanent => Temporary . . Permanent => Transient . . Permanent => Permanent | 0 to 90 (default is configurable) | 7 |

### Time Travel Notes

* The Time Travel retention period for a table can be specified when the table is created or any time afterwards. Within the retention period,
  all Time Travel operations can be performed on data in the table (e.g. queries) and the table itself (e.g. cloning and restoration).
* If the Time Travel retention period for a permanent table is set to 0, it will immediately enter the Fail-safe period when it is dropped.
* Temporary tables can have a Time Travel retention period of 1 day; however, a temporary table is purged once the session (in which the table
  was created) ends so the actual retention period is for 24 hours or the remainder of the session, whichever is shorter.
* A long-running Time Travel query will delay the purging of temporary and transient tables until the query completes.

### Fail-safe Notes

* The Fail-safe period is not configurable for any table type.
* Transient and temporary tables have no Fail-safe period. As a result, no additional data storage charges are incurred beyond the
  Time Travel retention period.

> **Important:**
>
> Because transient tables do not have a Fail-safe period, they provide a good option for managing the cost of very large tables used to store
> transitory data; however, the data in these tables cannot be recovered after the Time Travel retention period passes.
>
> For example, if a system failure occurs in which a transient table is dropped or lost, after 1 day, the data is not recoverable by you or
> Snowflake. As such, we recommend using transient tables only for data that does not need to be protected against failures or data that
> can be reconstructed outside of Snowflake.
>
> For more information, see [Data storage considerations](tables-storage-considerations.md).

---
title: Working with user-defined functions in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-data-databases-function.md
section: User Guide
---

# Working with user-defined functions in Snowsight

You can work with user-defined functions (UDFs) in SQL or in Snowsight.

In Snowsight, in the navigation menu, select Catalog » Database Explorer, and then search for or browse to the UDF.
Select the UDF to review details about and manage the function.

You must have the [relevant privileges](security-access-control-privileges.md) to access and manage the UDF in Snowsight.

## Explore UDF details in Snowsight

After opening a UDF in Snowsight, you can do the following:

* Identify when the function was created, and any comment on the function.
  You can hover over the time details to see the exact creation date and time.
* Review additional details about the UDF, including:

  + Arguments that the UDF takes, and their expected data types, if applicable.
  + The data type of the result of the function.
  + Whether the function is an aggregate function.
  + Whether the function is a secure function.
  + Whether the function is a table function.
  + The language in which the UDF is written. For example, Java.
* Review the SQL definition of the UDF in the Function definition section.
* Review the roles with privileges on the function in the Privileges section.

## Manage a UDF in Snowsight

You can perform the following basic management tasks for a UDF in Snowsight:

* To edit the name of the function or add a comment, select  » Edit.
* To drop the function, select  » Drop.
* To transfer ownership of the function to another role, select  » Transfer Ownership

---
title: Working with views in Snowsight
source: https://docs.snowflake.com/en/user-guide/ui-snowsight-data-databases-view.md
section: User Guide
---

# Working with views in Snowsight

You can work with [views](views-introduction.md), [materialized views](views-materialized.md), and
[semantic views](views-semantic/overview.md) in SQL or in Snowsight.
For details about the available SQL commands for working with views, see [Table, view, sequence, and user-defined type commands](../sql-reference/commands-table.md).

In Snowsight, in the navigation menu, select Catalog » Database Explorer, and then search for or browse to the view.
Select the view to explore details about the view, the columns defined in the view, and preview the data in the view.

You must have the relevant privileges to access and manage the [view](security-access-control-privileges.md),
[materialized view](security-access-control-privileges.md), or [semantic view](security-access-control-privileges.md) in
Snowsight.

## Explore view details in Snowsight

After opening a view in Snowsight, you can do the following:

* Identify the type of view and when the view was last created. Hover over the time details to see the exact creation date and
  time.
* Review the SQL definition of the view on the View Details or Semantic View Details tab.
* Manage privileges for the view in the Privileges section of the View Details or Semantic View Details tab.
  To manage privileges, see [Manage object privileges with Snowsight](security-access-control-configure.md).
* For views and materialized views, review the name, type, ordinal (order of the column in the view), tags, and masking policies
  applied to the view on the Columns tab.

  To add tags and masking policies to a view, see [Use Snowsight to set tags](object-tagging/work.md).
* For semantic views, you can view details about the logical tables, relationships, facts, dimensions, and metrics by selecting
  the Semantic Information tab.

## Manage a view in Snowsight

You can perform the following basic management tasks for a view in Snowsight:

* To edit the view name or add a comment, select  » Edit.
* To drop the view, select  » Drop.
* To transfer ownership of the view to another role, select  » Transfer Ownership.
* To edit a semantic view, select the Semantic Information tab, and select Edit in Cortex Analyst.

## Preview data in a view

For views and materialized views, you can preview up to the first 100 rows of a view on the Data Preview tab for the view.

> **Note:**
>
> If your view is complex, you might not see a data preview. Snowsight queries the view for the data preview and waits
> for up to 300 seconds for results to be returned. If the query takes longer than 300 seconds to complete, Snowsight
> cancels the query and displays no preview data.

Select  to manipulate the preview data:

* Sort the data in ascending or descending order.
* Increase or decrease the decimal precision.
* Show thousands separators in numbers.
* Display the data in the column as percentages.

The options available to you depend on the type of data in the column.

> **Note:**
>
> The preview requires a warehouse. By default, Snowsight uses the default warehouse for your user profile, or you can
> select a different warehouse.

---
title: Working with warehouses
source: https://docs.snowflake.com/en/user-guide/warehouses-tasks.md
section: User Guide
---

# Working with warehouses

All warehouse tasks can be performed from the Snowflake web interface or using the DDL commands for warehouses.

## Creating a warehouse

You can create a warehouse using the Snowsight or SQL:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses » Warehouse
>
> SQL:
> :   Execute a [CREATE WAREHOUSE](../sql-reference/sql/create-warehouse.md) command.
>
> Python:
> :   Use the [WarehouseCollection.create](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.WarehouseCollection)
>     API ([Creating a warehouse](../developer-guide/snowflake-python-api/snowflake-python-managing-warehouses.md)).

When you create a warehouse, you can specify whether the warehouse is created initially in the “Started” (i.e. running) or “Suspended”
state. If you choose “Started”, the warehouse starts consuming credits once all the compute resources are provisioned for the warehouse.

> **Note:**
>
> If you choose to create a warehouse in the “Started” state, the warehouse may take some time to become fully available as Snowflake
> provisions all the compute resources for the warehouse.

## Starting or resuming a warehouse

A warehouse can be started at any time, including on initial creation. Once a warehouse is created, resuming a warehouse is the same as
starting a warehouse.

You can resume a suspended (that is, inactive) warehouse by using the following interfaces:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses » *<suspended_warehouse_name>* »  » Resume
>
> SQL:
> :   Execute an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command with the `RESUME` keyword.
>
> Python:
> :   Use the [WarehouseResource.resume](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.WarehouseResource)
>     API ([Performing warehouse operations](../developer-guide/snowflake-python-api/snowflake-python-managing-warehouses.md)).

Starting a warehouse typically takes only a few seconds; however, in some rare instances, it can take longer as Snowflake provisions the
compute resources for the warehouse.

Warehouses consume credits while running:

* A warehouse begins to consume credits once all the compute resources are provisioned for the warehouse.

  + In a rare instance when some of the compute resources fail to provision, the warehouse only consumes credits for the provisioned
    compute resources.
  + Once the remaining compute resources are successfully provisioned, the warehouse starts consuming credits for all requested compute
    resources.
* While starting or resuming a warehouse often takes only a few seconds, in some instances, it can take longer as Snowflake provisions the
  compute resources for the warehouse.
* Snowflake does not begin executing SQL statements submitted to a warehouse until all of the compute resources for the warehouse are
  successfully provisioned, unless any of the resources fail to provision:

  + If any of the compute resources for the warehouse fail to provision during start-up, Snowflake attempts to repair the failed resources.
  + During the repair process, the warehouse starts processing SQL statements once 50% or more of the requested compute resources are
    successfully provisioned.

Credits are billed on a per-second basis while the warehouse is running, with a 1-minute minimum each time the warehouse is resumed; however,
credit consumption is reported in 60-minute (i.e. hourly) increments.

> **Note:**
>
> A warehouse must be running and the current warehouse for the session (i.e. [in use](../sql-reference/sql/use-warehouse.md)) to
> process SQL statements submitted in the session. For more information, refer to Using a Warehouse in this topic.

## Suspending a warehouse

A running warehouse can be suspended at any time, even while executing SQL statements. Suspending a warehouse stops the
warehouse from consuming credits once all the compute resources shut down.

You can suspend a warehouse by using the following interfaces:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses » *<started_warehouse_name>* »  » Suspend
>
> SQL:
> :   Execute an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command with the `SUSPEND` keyword.
>
> Python:
> :   Use the [WarehouseResource.suspend](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.WarehouseResource)
>     API ([Performing warehouse operations](../developer-guide/snowflake-python-api/snowflake-python-managing-warehouses.md)).

When you suspend a warehouse, Snowflake immediately shuts down all idle compute resources for the warehouse, but allows any compute
resources that are executing statements to continue until the statements complete, at which time the resources are shut down and the status
of the warehouse changes to “Suspended”. Compute resources waiting to shut down are considered to be in “quiesce” mode.

## Resizing a warehouse

A warehouse can be resized up or down at any time, including while it is running and processing statements.

You can resize a warehouse by using the following interfaces:

> Snowsight:
> :   In the navigation menu, select Compute » Warehouses » *<warehouse_name>* »  » Edit
>
> SQL:
> :   Execute an [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command with `SET WAREHOUSE_SIZE = ...`.
>
> Python:
> :   Use the [WarehouseResource.create_or_alter](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.WarehouseResource)
>     API ([Creating or altering a warehouse](../developer-guide/snowflake-python-api/snowflake-python-managing-warehouses.md)).

Resizing a warehouse to a larger size is useful when the operations being performed by the warehouse will benefit from more compute resources,
including:

* Improving the performance of large, complex queries against large data sets.
* Improving performance while loading and unloading significant amounts of data.

### Effects of resizing a running warehouse

Resizing a running warehouse adds or removes compute resources in each cluster in the warehouse. All the usage and credit
rules associated with starting or suspending a warehouse apply to resizing a started warehouse, such as:

* Compute resources added to a warehouse start using credits when they are provisioned; however, the additional compute resources don’t
  start executing statements until they are all provisioned, unless some of the resources fail to provision.
* Compute resources are removed from a warehouse only when they are no longer being used to execute any current statements.

Resizing a warehouse doesn’t have any impact on statements that are currently being executed by the warehouse. When resizing to a larger
size, the new compute resources, once fully provisioned, are used only to execute statements that are already in the warehouse queue, as
well as all future statements submitted to the warehouse.

> **Tip:**
>
> To verify the additional compute resources for your warehouse have been fully provisioned, add the `WAIT_FOR_COMPLETION` parameter
> to the [ALTER WAREHOUSE](../sql-reference/sql/alter-warehouse.md) command. You can also use [SHOW WAREHOUSES](../sql-reference/sql/show-warehouses.md) to check its
> `state`.

### Effects of resizing a suspended warehouse

Resizing a suspended warehouse does not provision any new compute resources for the warehouse. It simply instructs Snowflake to provision
the additional compute resources when the warehouse is next resumed, at which time all the usage and credit rules associated with starting a
warehouse apply.

## Using a warehouse

To execute a query or DML statement in Snowflake, a warehouse must be running and it must be specified as the current warehouse for the
session in which the query/statement is submitted.

A Snowflake session can only have one current warehouse at a time. The current warehouse for the session can be specified or changed at any
time through the [USE WAREHOUSE](../sql-reference/sql/use-warehouse.md) SQL command or the [WarehouseResource.use_warehouse](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.WarehouseResource) Python API.

Once a running warehouse has been set as the current warehouse for the session, queries and DML statements submitted within the session are
processed by the warehouse. In the Query History and Workspaces pages in Snowsight, you can view the warehouse used to
process each query/statement.

> **Note:**
>
> Some Snowsight features require a warehouse to run SQL queries for retrieving data, such as Task Run History or
> Data Preview for a table. An X-Small warehouse is recommended and generally sufficient for most of these queries. For information,
> see [Warehouse considerations](warehouses-considerations.md).

## Delegating warehouse management

By default, the ACCOUNTADMIN role is granted the ability to alter, suspend, describe, and perform other operations on all
warehouses in the account.

If you need to delegate these abilities to a custom role in your account, you can grant the MANAGE WAREHOUSES privilege to that role.
Granting the MANAGE WAREHOUSES privilege is equivalent to granting the
MODIFY, MONITOR, and OPERATE privileges on all warehouses in the account.

The following examples demonstrate how you can delegate the ability to manage warehouses to a custom role named
`manage_wh_role`. The example uses the `manage_wh_role` to make changes to the warehouse `test_wh`, which is owned by a
different role (`create_wh_role`).

Create a new role that will create and own a new warehouse, and grant the CREATE WAREHOUSE privilege to that role:

SQLPython

Using the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command:

```sqlexample
CREATE ROLE create_wh_role;
GRANT CREATE WAREHOUSE ON ACCOUNT TO ROLE create_wh_role;
GRANT ROLE create_wh_role TO ROLE SYSADMIN;
```

Using the [RoleResource.grant_privileges](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.role.RoleResource) API:

```python
from snowflake.core.role import Role, Securable

my_role = Role(name="create_wh_role")
my_role_res = root.roles.create(my_role)

my_role_res.grant_privileges(
  privileges=["CREATE WAREHOUSE"], securable_type="ACCOUNT"
)

root.roles['SYSADMIN'].grant_role(role_type="ROLE", role=Securable(name='create_wh_role'))
```

Create a second role that will manage all warehouses in the account, and grant the MANAGE WAREHOUSES privilege to that role:

SQLPython

```sqlexample
CREATE ROLE manage_wh_role;
GRANT MANAGE WAREHOUSES ON ACCOUNT TO ROLE manage_wh_role;
GRANT ROLE manage_wh_role TO ROLE SYSADMIN;
```

```python
from snowflake.core.role import Role, Securable

my_role = Role(name="manage_wh_role")
my_role_res = root.roles.create(my_role)

my_role_res.grant_privileges(
  privileges=["MANAGE WAREHOUSES"], securable_type="ACCOUNT"
)

root.roles['SYSADMIN'].grant_role(role_type="ROLE", role=Securable(name='manage_wh_role'))
```

Using the `create_wh_role` role, create a new warehouse:

SQLPython

```sqlexample
USE ROLE create_wh_role;
CREATE OR REPLACE WAREHOUSE test_wh
    WITH WAREHOUSE_SIZE= XSMALL;
```

```python
from snowflake.core import CreateMode
from snowflake.core.warehouse import Warehouse

root.session.use_role("create_wh_role")

my_wh = Warehouse(
  name="test_wh",
  warehouse_size="XSMALL"
)
root.warehouses.create(my_wh, mode=CreateMode.or_replace)
```

Change the current role to `manage_wh_role`:

SQLPython

```sqlexample
USE ROLE manage_wh_role;
```

```python
root.session.use_role("manage_wh_role")
```

Although the `manage_wh_role` does not own the `test_wh`, that role does have the MANAGE WAREHOUSES privilege, which means
that you can:

* Suspend and resume the warehouse:

  SQLPython

  ```sqlexample
  ALTER WAREHOUSE test_wh SUSPEND;
  ALTER WAREHOUSE test_wh RESUME;
  ```

  ```python
  my_wh_res = root.warehouses["test_wh"]
  my_wh_res.suspend()
  my_wh_res.resume()
  ```
* Change the size of the warehouse:

  SQLPython

  ```sqlexample
  ALTER WAREHOUSE test_wh SET WAREHOUSE_SIZE = SMALL;
  ```

  ```python
  my_wh = root.warehouses["test_wh"].fetch()
  my_wh.warehouse_size = "SMALL"
  root.warehouses["test_wh"].create_or_alter(my_wh)
  ```
* Describe the warehouse:

  SQLPython

  ```sqlexample
  DESC WAREHOUSE test_wh;
  ```

  ```python
  my_wh = root.warehouses["test_wh"].fetch()
  print(my_wh.to_dict())
  ```

## Review Warehouse Details in Snowsight

You must use the ACCOUNTADMIN role, or a role granted the relevant [warehouse privileges](security-access-control-privileges.md).

To review warehouses and manage warehouse details in Snowsight, complete the following steps:

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. In the navigation menu, select Compute » Warehouses.

You can then review the table of warehouses, search for warehouses, or filter the list of warehouses by status or size.

By default, you can see the following information about each warehouse:

* Name
* Status, such as Started, Resuming, or Suspended.
* Size
* Clusters, indicated by a bar in the column. You can hover over the value to see how many clusters are active.
* Running, for details on how many SQL statements are being executed by the warehouse.
* Queued, for details on how many SQL statements are queued for the warehouse.
* Owner, or the owning role for the warehouse.
* Resumed, to see how long ago the warehouse was resumed. Hover over the value to see the exact date and timestamp in your local time zone.

You can also add columns to see additional details for each warehouse in the table:

* QAS (Scale Factor), to see the scale factor of the warehouse used by Query Acceleration Service (QAS).
  See [Using the Query Acceleration Service (QAS)](query-acceleration-service.md).
* Scaling Policy, to see the scaling policy defined for the warehouse. See [Setting the scaling policy for a multi-cluster warehouse](warehouses-multicluster.md).
* Auto Resume, to see whether auto-resume is set up for the warehouse.
* Auto Suspend, to see the time period before auto-suspend occurs for the warehouse.
* Created, to see when the warehouse was created. Hover over the value to see the exact date and timestamp in your local time zone.

When you select a warehouse in the Warehouses table, you can see more details:

* The Warehouse Activity section provides a graph of warehouse load over a period of time, which can help you understand why a
  query might be running slowly. See [Monitoring warehouse load](warehouses-load-monitoring.md) for more details.
* The Details section provides additional information about your warehouse, including:

  + The status of the warehouse.
  + The size of the warehouse.
  + Maximum and minimum number of clusters the warehouse can use.
  + The scaling policy.
  + The number of tasks that are running and queued.
  + The period of no activity before the warehouse is automatically suspended.
  + If the warehouse is suspended, whether to automatically resume the warehouse when needed.
  + The last time the warehouse resumed operation.
* You can use the Privileges section to view, grant, and revoke privileges on the warehouse.

---
title: Workload identity federation
source: https://docs.snowflake.com/en/user-guide/workload-identity-federation.md
section: User Guide
---

# Workload identity federation

This document is for the following audiences:

* Developers of in-house cloud services.
* Administrators who manage integrations with internal and external services.
* Developers of multi-tenant SaaS applications who want to issue
  [OpenID Connect (OIDC) Federation](https://openid.net/developers/how-connect-works/) ID tokens to individual workloads that are running
  on their platform so that each customer workload can authenticate to Snowflake as a dedicated user.

Workload identity federation (WIF) is a service-to-service authentication method that lets workloads, such as applications, services, or
containers, authenticate with Snowflake using their cloud provider’s native identity system, such as AWS Identity and Access Management (AWS IAM) roles, Microsoft Entra ID, and
Google Cloud service accounts to get an attestation that Snowflake can use and validate.

WIF removes the need to manage and store long-lived credentials such as passwords, API keys, key pairs, and
programmatic access tokens for authenticating to Snowflake. WIF also reduces the complexity involved in getting
credentials, where other methods, such as [External OAuth](oauth-ext-overview.md) can require more effort to set up.
Applications, services, and containers that use Snowflake connectors automatically get short-lived credentials from their platform’s
identity provider (IdP) through each platform’s native mechanisms.

## Benefits

This section describes why you may want to use WIF for authentication:

* **Cost effective**: Using existing IdPs to manage service identities reduces the need for additional tools or licenses, which can be
  cost-effective.
* **Interoperability**: Popular cloud provider services, such as AWS IAM, Entra ID, and Google Cloud, support and
  encourage WIF as an authentication method for external workloads.
* **Convenient auditing and monitoring**:

  + Administrators can use existing cloud provider services, such as
    [AWS CloudTrail](https://docs.aws.amazon.com/awscloudtrail/latest/userguide/cloudtrail-user-guide.html) and [Azure Monitor](https://learn.microsoft.com/en-us/azure/azure-monitor/fundamentals/overview), to log and monitor activity.
  + Snowflake administrators can query the LOGIN_HISTORY and CREDENTIALS views in the
    [ACCOUNT_USAGE schema](../sql-reference/account-usage.md) to monitor and audit services that use WIF.

## Workflow for implementing workload identity federation

You can use WIF to authenticate a variety of workloads using different IdPs, but the basic workflow, as shown in
the following steps, remains the same:

1. As a workload administrator, configure your service to use a native identity provider so that the provider can issue an *attestation* of
   your workload’s identity. This attestation is often, but not always, a JSON Web Token (JWT).
2. As a Snowflake administrator, create a Snowflake service user for your workload. You set the properties of this user to values found in
   the attestation sent by the provider. For example, a user property might specify the name of an IAM role or the issuer URL of the
   provider.
3. As a workload developer, configure your workload to use a Snowflake driver. Drivers send the
   attestation to Snowflake for verification.

To view end-to-end examples of this workflow for different types of workloads and IdPs, see Use cases.

## Access control requirements

To configure WIF for a Snowflake service user — that is, a user with their TYPE property set to `SERVICE` —
you must grant your activated roles one of the following privileges:

* OWNERSHIP on the service user.
* MODIFY PROGRAMMATIC AUTHENTICATION METHODS on the service user.

## Supported Snowflake drivers

A workload uses a Snowflake driver to send an attestation when it connects to Snowflake. The following drivers support workload identity
federation:

| Driver | Minimum version |
| --- | --- |
| [Go](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Authenticator_values) | v1.16.0 |
| [JDBC](../developer-guide/jdbc/jdbc-configure.md) | v3.26.0 |
| [.NET](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/Connecting.md) | v4.8.0 |
| [Node.js](../developer-guide/node-js/nodejs-driver-authenticate.md) | v2.2.0 |
| [ODBC](../developer-guide/odbc/odbc-parameters.md) | v3.11.0 |
| [PHP PDO](https://github.com/snowflakedb/pdo_snowflake/blob/master/README.rst) | v3.6.0 |
| [Python](../developer-guide/python-connector/python-connector-connect.md) | v3.17.0 |

## Minimizing the number of Snowflake identities

Creating a dedicated Snowflake user for every WIF workload can be challenging at scale. It’s often better to consolidate so that multiple workloads authenticate with a well-defined, limited number of Snowflake service users. This approach reduces identity sprawl in Snowflake, simplifies user lifecycle management, and enables consistent access patterns without tightly coupling Snowflake users to individual workloads or infrastructure.

### Create a single user for multiple workloads

Some cloud providers allow the identity that is attached to a workload to impersonate another identity. For example, suppose a workload on
Google Cloud is attached to service account `A`. You can use
[service account impersonation](https://docs.cloud.google.com/iam/docs/service-account-impersonation) so that service account `A`
authenticates as service account `B`. That is, service account `A` impersonates service account `B` so that the workload can
authenticate to Snowflake as user `B`.

Impersonation is especially useful in an environment that has many workloads because creating a one-to-one mapping between each workload
and a Snowflake service user is operationally expensive and difficult to manage. By allowing multiple workloads to impersonate a shared
Snowflake identity, teams can centralize Snowflake access behind a small set of service users while enforcing access controls through the
cloud provider’s IAM.

**Prerequisite**

To use impersonation so that multiple workloads authenticate with a single Snowflake identity, the workload must be on Google Cloud or AWS.
Currently, Microsoft Azure doesn’t support impersonation.

**Workflow**

1. As the workload administrator, configure the workloads so that their attached identities impersonate another identity.
2. As the Snowflake administrator, create a service user that corresponds to the cloud provider identity that is authenticating to
   Snowflake. For example, if workloads are using service account `D` to authenticate, create a service user and set its SUBJECT parameter
   to the unique identifier of service account `D`.
3. As the workload developer, use a connection parameter of the driver to define the
   identity chain for the workloads that use impersonation. The parameter is set to a list of strings, where each string is a cloud
   provider identity (for example, a service account ID).

   The driver follows the identity chain defined in the list in order to obtain the token that is needed to authorize the next cloud
   provider identity. Each identity in the chain needs permissions to impersonate the next identity only. The final identity in the list
   obtains the Snowflake connection token that is used to connect to Snowflake.

   To obtain the syntax of the connection parameter for your driver, see the driver documentation.

**Example**

Suppose a Google Cloud workload is attached to service account `A` but impersonates service account `B`, which then
impersonates service account `D`. To set up the Python driver so that the workload authenticates with WIF using the identity of service
account `D`, define the connection parameter as follows:

```python
workload_identity_impersonation_path=['service_account_a@my-project.iam.gserviceaccount.com',
                                      'service_account_b@my-project.iam.gserviceaccount.com',
                                      'service_account_d@my-project.iam.gserviceaccount.com']
```

The Snowflake service user created for the workload should contain the identifier of the final identity in the identity chain. Given the
example above, create the service user with the following command:

```sqlexample
CREATE USER <username>
  WORKLOAD_IDENTITY = (
    TYPE = GCP
    SUBJECT = 'service_account_d@my-project.iam.gserviceaccount.com'
  )
  TYPE = SERVICE
  DEFAULT_ROLE = PUBLIC;
```

### Create a single user for multiple GitHub or GitLab environments

If you’re using GitHub actions or GitLab projects, you can use the tool’s OIDC Provider to use WIF to authenticate to Snowflake. By default,
the OIDC token for each GitHub action or GitLab project might have a different subject in the `sub` claim, which would require you to
have multiple Snowflake service users, one for each subject.

However, GitHub and GitLab let you customize the `sub` claim of its OIDC tokens. This lets you configure your tool so that the subject
of OIDC tokens is the same for all of your environments. When you create a Snowflake service user, you specify the subject of the OIDC
tokens it will be receiving from GitHub or GitLab. Because the subject in the tokens will always be the same (that is, the custom value),
you only need one service user for all of your environments.

To learn more about customizing the `sub` claim of a GitHub or GitLab OIDC token, see the following resources:

* **GitHub**: To customize the subject claim for an organization or repository, see the
  [GitHub documentation](https://docs.github.com/en/actions/reference/security/oidc#customizing-the-subject-claims-for-an-organization-or-repository).
* **GitLab**: To use the Project API to customize the `sub` claim of the GitLab OIDC token, see the
  [GitLab documentation](https://docs.gitlab.com/api/projects/). Currently, the claim is customized with the
  `ci_id_token_sub_claim_components` attribute.

After you’ve defined a custom `sub` claim that is the same for all of your GitHub or GitLab environments, configure the SUBJECT
parameter of your Snowflake service user to match the custom `sub` claim.

## Hardening your security posture

You can use an [authentication policy](authentication-policies.md) to control which Snowflake service users can authenticate
with WIF. You can also create and set the authentication policy so that a workload can authenticate only if it uses
a specified identity provider, or an account within that provider.

For example, the following authentication policy allows a workload to authenticate only if it uses Microsoft Entra ID as its provider and the
issuer of the attestation is a Microsoft Entra ID tenant with tenant ID `https://login.microsoftonline.com/9ebd1ec9-9a78-4429-8f53-5cf870a812d1/v2.0`:

> ```sqlexample
> CREATE AUTHENTICATION POLICY workload_policy
>   WORKLOAD_IDENTITY_POLICY=(
>     ALLOWED_PROVIDERS = (AZURE)
>     ALLOWED_AZURE_ISSUERS = (
>       'https://login.microsoftonline.com/9ebd1ec9-9a78-4429-8f53-5cf870a812d1/v2.0')
>   );
> ```

For more information about the `WORKLOAD_IDENTITY_POLICY` parameter, see [CREATE AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md).

For more information about setting an authentication policy so it is enforced, see [Setting an authentication policy on an account or user](authentication-policies.md).

## Use cases

The following use cases are examples of implementing WIF for a workload:

### Authenticate using AWS IAM roles and a Snowflake Python driver

Complete the following tasks to use WIF to authenticate to Snowflake from your AWS service:

#### Configure AWS

To configure your AWS service to use AWS IAM as its identity provider, attach an IAM role. For more information, see the AWS documentation
that corresponds to your workload.

* For Amazon EC2, see [Attach an IAM role to an instance](https://docs.aws.amazon.com/AWSEC2/latest/UserGuide/attach-iam-role.html).
* For AWS Lambda, see [Defining Lambda function permissions with an execution role](https://docs.aws.amazon.com/lambda/latest/dg/lambda-intro-execution-role.html).

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you must have the Amazon Resource Identifier (ARN) that uniquely identifies the AWS user or
> role associated with the instance authenticating to Snowflake. To obtain the ARN of a IAM role, complete the following steps:
>
> 1. Sign in to the AWS Console, and then navigate to the IAM Dashboard.
> 2. In the left-hand navigation, select Roles.
> 3. Select the name of the role that you attached to your AWS instance.
> 4. In the Summary section, find the ARN, and then select the Copy icon.
>
> Snowflake accepts the following forms of [IAM identifiers](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_identifiers.html):
>
> > * `arn:aws:iam::account:user/user_name_with_path`
> > * `arn:aws:iam::account:role/role_name_with_path`
> > * `arn:aws:sts::account:assumed-role/role_name/role_session_name`

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER <username>
     WORKLOAD_IDENTITY = (
       TYPE = AWS
       ARN = '<amazon_resource_identifier>'
     )
     TYPE = SERVICE
     DEFAULT_ROLE = PUBLIC;
   ```

   Where `ARN` is the value you obtained before starting these steps.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   import os
   import snowflake.connector

   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='AWS'
   )
   ```
3. Run your Python application. It authenticates to Snowflake using WIF.

### Authenticate using Microsoft Entra ID and a Snowflake Python driver

Complete the steps in each section listed below to use WIF to authenticate to Snowflake from Microsoft Entra ID:

* Configure Microsoft Entra ID
* Configure Microsoft Azure
* Configure Snowflake
* Configure your workload to use a Snowflake driver

#### Configure Microsoft Entra ID

A Microsoft Entra ID tenant administrator must complete the following steps to allow usage of Snowflake workload identity. These steps only
need to be performed once per Microsoft Entra ID tenant:

1. Log into Microsoft Azure portal.
2. Ensure you have Azure tenant admin privileges.
3. Consent to installing the multi-tenant Snowflake EntraID app by visiting [the consent URI](https://login.microsoftonline.com/common/oauth2/v2.0/authorize?client_id=fd3f753b-eed3-462c-b6a7-a4b5bb650aad&response_type=none&scope=openid&redirect_uri=https://www.snowflake.com/).

   The multi-tenant Snowflake EntraID app is publisher-verified, and represents Snowflake as a resource. The app is used as the audience for
   the access token when authenticating to Snowflake. This app only requires basic permissions and is non-privileged.
4. Select Accept to give permissions to the Snowflake EntraID app.

#### Configure Microsoft Azure

Complete the following steps to configure your Microsoft Azure service to use WIF:

1. Log into Microsoft Azure portal.
2. Select your workload, such as a [virtual machine](https://learn.microsoft.com/en-us/azure/virtual-machines/) or an [app service](https://learn.microsoft.com/en-us/azure/app-service).
3. In the sidebar, navigate to Security » Identity.
4. Enable a managed identity for an
   [Azure VM](https://learn.microsoft.com/en-us/entra/identity/managed-identities-azure-resources/how-to-configure-managed-identities?pivots=qs-configure-portal-windows-vm#system-assigned-managed-identity)
   or an [Azure Function](https://learn.microsoft.com/en-us/azure/app-service/overview-managed-identity?tabs=portal%2Chttp).
5. Save the Object (Principal) ID for a later step.

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you need the following information:
>
> * The case-sensitive Object ID (Principal ID) of the managed identity you enabled in the previous step.
>   You can use the Azure Portal to copy this identifier from the Identity page for your Azure VM or function.
> * Your Microsoft Entra tenant ID. You use this value to construct the Authority URL.
>
>   + To obtain the tenant ID by using the Microsoft Entra Console, see the [How to find your Microsoft Entra tenant ID](https://learn.microsoft.com/en-us/entra/fundamentals/how-to-find-tenant).
>   + To obtain the tenant ID by using PowerShell, run the following commands:
>
>     ```powershell
>     Connect-AzAccount
>     Get-AzTenant
>     ```

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER <username>
     WORKLOAD_IDENTITY = (
       TYPE = AZURE
       ISSUER = 'https://login.microsoftonline.com/<tenant_id>/v2.0'
       SUBJECT = '<managed_identity_object_id>'
     )
     TYPE = SERVICE
     DEFAULT_ROLE = PUBLIC;
   ```

   Where `ISSUER` and `SUBJECT` are the values that you obtained before starting these steps.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   import snowflake.connector

   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='AZURE'
   )
   ```
3. Run your Python application. It authenticates to Snowflake using WIF.

> **Note:**
>
> As the workload developer, you might need to set an environment variable related to the managed identity that your workload administrator
> enabled. If your administrator enabled a [user-assigned managed identity](https://learn.microsoft.com/en-us/entra/identity/managed-identities-azure-resources/how-managed-identities-work-vm#user-assigned-managed-identity) rather than a system-assigned one, you must set the
> MANAGED_IDENTITY_CLIENT_ID environment variable to the client ID of the managed identity that you want to use for authenticating to
> Snowflake.

### Authenticate using Google Cloud service accounts and a Snowflake Python driver

Complete the following tasks to use WIF to authenticate to Snowflake from your Google Cloud service:

#### Configure Google Cloud

To configure your service to use Google Cloud as its identity provider, [attach a service account to your GCE or Cloud Run instance](https://cloud.google.com/compute/docs/instances/change-service-account).

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you must have the value of the service account’s `uniqueId` property. To obtain this unique ID,
> use the Google Cloud CLI to run the following command:
>
> ```bash
> gcloud iam service-accounts describe "<SERVICE_ACCOUNT_EMAIL_ADDRESS>" --format="value(uniqueId)"
> ```

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER <username>
     WORKLOAD_IDENTITY = (
       TYPE = GCP
       SUBJECT = '<unique_id_of_service_account>'
     )
     TYPE = SERVICE
     DEFAULT_ROLE = PUBLIC;
   ```

   Where `SUBJECT` is the value that you obtained before starting these steps.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   import snowflake.connector

   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='GCP'
   )
   ```
3. Run your Python application. It authenticates to Snowflake using WIF.

### Authenticate using an OpenID Connect (OIDC) issuer from Elastic Kubernetes Service (EKS)

Complete the steps in each section listed below to use WIF to authenticate to Snowflake from Elastic Kubernetes Service (EKS):

* Configure EKS
* Configure Snowflake
* Configure your workload to use a Snowflake driver

#### Configure EKS

1. Configure EKS to issue ID tokens that are compatible with Snowflake.

   1. [Configure your pod deployment YAML to include a projected ServiceAccount token volume](https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/#launch-a-pod-using-service-account-token-projection).
   2. Configure the ID tokens to contain an audience claim with `snowflakecomputing.com`.

      The following is an example of a YAML configuration with the proper audience:

      ```yaml
      kind: Pod
      metadata:
        name: nginx
      spec:
        containers:
        - image: nginx
          name: nginx
          volumeMounts:
          - mountPath: /var/run/secrets/tokens
            name: snowflake-token
        serviceAccountName: build-robot
        volumes:
        - name: snowflake-token
          projected:
            sources:
            - serviceAccountToken:
                path: snowflake-token
                expirationSeconds: 7200
                audience: snowflakecomputing.com
      ```

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you need the following information:
>
> * The issuer URL of the OIDC provider that is generating the ID token for the Kubernetes service account. To obtain this issuer URL, you
>   can perform either of the following tasks:
>
>   + Navigate to the Overview tab of your cluster, and copy the value in the OpenID Connect provider URL field.
>   + Run the following command with access to the API server endpoint:
>
>     ```bash
>     aws eks describe-cluster --name <cluster_name> --query "cluster.identity.oidc.issuer" --output text
>     ```
> * The namespace and name of the Kubernetes service account. You use this information to construct the subject of the ID token issued by
>   the OIDC provider.

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER my_eks_service
     WORKLOAD_IDENTITY = (
       TYPE = OIDC
       ISSUER = 'https://oidc.eks.<region>.amazonaws.com/id/<issuer_id>'
       SUBJECT = 'system:serviceaccount:<service_account_namespace>:<service_account_name>'
     )
     TYPE = SERVICE;
   ```

   Where `ISSUER` and `SUBJECT` are the values that you obtained before starting these steps.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='OIDC',
     token_file_path='<service_account_token_path>'
   )
   ```

   Where `service_account_token_path` is the one you created in the Configure EKS step. Based
   on the YAML example in that step, the token path would be `/var/run/secrets/tokens/snowflake-token`.
3. Run your Python application. It authenticates to Snowflake using WIF.

### Authenticate using an OpenID Connect (OIDC) issuer from Azure Kubernetes Service (AKS)

Complete the steps in each section listed below to use WIF to authenticate to Snowflake from Azure Kubernetes Service (AKS):

* Configure AKS
* Configure Snowflake
* Configure your workload to use a Snowflake driver

#### Configure AKS

Configure AKS to issue ID tokens that are compatible with Snowflake:

1. [Enable the OIDC issuer on your AKS cluster](https://learn.microsoft.com/en-us/azure/aks/use-oidc-issuer).
2. Configure AKS to issue ID tokens that are compatible with Snowflake.

   1. [Configure your pod deployment YAML to include a projected ServiceAccount token volume](https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/#launch-a-pod-using-service-account-token-projection).
   2. Configure the ID tokens to contain an audience claim with `snowflakecomputing.com`.

      The following is an example of a YAML configuration with the proper audience:

      ```yaml
      kind: Pod
      metadata:
        name: nginx
      spec:
        containers:
        - image: nginx
          name: nginx
          volumeMounts:
          - mountPath: /var/run/secrets/tokens
            name: snowflake-token
        serviceAccountName: build-robot
        volumes:
        - name: snowflake-token
          projected:
            sources:
            - serviceAccountToken:
                path: snowflake-token
                expirationSeconds: 7200
                audience: snowflakecomputing.com
      ```

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you need the following information:
>
> * The issuer URL of the OIDC provider that is generating the ID token for the Kubernetes service account. To obtain this issuer URL, see
>   the [Microsoft documentation](https://learn.microsoft.com/en-us/azure/aks/use-oidc-issuer#get-the-oidc-issuer-url)
> * The namespace and name of the Kubernetes service account. You use this information to construct the subject of the ID token issued
>   by the OIDC provider.

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER my_aks_service
     WORKLOAD_IDENTITY = (
       TYPE = OIDC
       ISSUER = 'https://<region>.oic.prod-aks.azure.com/<tenant_id>/<uuid>/'
       SUBJECT = 'system:serviceaccount:<service_account_namespace>:<service_account_name>'
     )
     TYPE = SERVICE;
   ```

   Where `ISSUER` and `SUBJECT` are the values that you obtained before starting these steps.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='OIDC',
     token_file_path='<service_account_token_path>'
   )
   ```

   Where `service_account_token_path` is the one you created in the Configure AKS step.
   Based on the YAML example in that step, the token path would be `/var/run/secrets/tokens/snowflake-token`.
3. Run your Python application. It authenticates to Snowflake using WIF.

### Authenticate using an OpenID Connect (OIDC) issuer from Google Kubernetes Engine (GKE)

Complete the steps in each section listed below to use WIF to authenticate to Snowflake from Google Kubernetes Engine (GKE):

* Configure GKE
* Configure Snowflake
* Configure your workload to use a Snowflake driver

#### Configure GKE

1. Configure GKE to issue ID tokens that are compatible with Snowflake.

   1. [Configure your pod deployment YAML to include a projected ServiceAccount token volume](https://kubernetes.io/docs/tasks/configure-pod-container/configure-service-account/#launch-a-pod-using-service-account-token-projection).
   2. Configure the ID tokens to contain an audience claim with `snowflakecomputing.com`.

      The following is an example of a YAML configuration with the proper audience:

      ```yaml
      kind: Pod
      metadata:
        name: nginx
      spec:
        containers:
        - image: nginx
          name: nginx
          volumeMounts:
          - mountPath: /var/run/secrets/tokens
            name: snowflake-token
        serviceAccountName: build-robot
        volumes:
        - name: snowflake-token
          projected:
            sources:
            - serviceAccountToken:
                path: snowflake-token
                expirationSeconds: 7200
                audience: snowflakecomputing.com
      ```

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you need the following information:
>
> * The Google Cloud project ID, region of the cluster, and cluster name. You use this information to construct the OIDC issuer.
> * The namespace and name of the Kubernetes service account. You use this information to construct the subject of the ID token issued by
>   the OIDC provider.

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER my_gke_service
     WORKLOAD_IDENTITY = (
       TYPE = OIDC
       ISSUER = 'https://container.googleapis.com/v1/projects/<project_id>/locations/<region>/clusters/<cluster_name>'
       SUBJECT = 'system:serviceaccount:<service_account_namespace>:<service_account_name>'
     )
     TYPE = SERVICE;
   ```

   Where `ISSUER` and `SUBJECT` are the values that you obtained before starting these steps.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='OIDC',
     token_file_path='<service_account_token_path>'
   )
   ```

   Where `service_account_token_path` is the one you created in the Configure GKE step.
   Based on the YAML example in that step, the token path would be `/var/run/secrets/tokens/snowflake-token`.
3. Run your Python application. It authenticates to Snowflake using WIF.

### Authenticate using a custom OpenID Connect (OIDC) Provider

Complete the steps in each section listed below to use WIF to authenticate to Snowflake from a custom OIDC Provider:

* Configure your OIDC Provider
* Configure Snowflake
* Configure your workload to use a Snowflake driver

#### Configure your OIDC Provider

1. Ensure that your OIDC Provider supports the [OpenID Configuration](https://openid.net/specs/openid-connect-discovery-1_0.html#ProviderConfig)
   as specified within the Discovery specification. Both the configuration and the configuration’s `jwks_uri` endpoint must be publicly accessible.
2. Configure your OpenID Provider to issue ID tokens with an audience claim that is set to `snowflakecomputing.com` or a non-empty custom list.
   If you define a non-empty custom list, you need to specify it when you create a service user in Snowflake.

#### Configure Snowflake

To configure Snowflake, create a Snowflake service user — that is, a user of type `SERVICE` — that uses WIF
to authenticate with Snowflake.

> **Before you begin:**
>
> To successfully configure Snowflake, you need the following information:
>
> * The issuer URL of your OIDC Provider.
> * The subject claim associated with your workload.
>
> You can obtain both of these values by parsing out the `iss` and `sub` claims from an issued ID token for your workload. For example,
> if you have access to a Unix-like environment with `jq`, `cat`, and `echo`, you can save your ID token to a file and run the
> following commands.
>
> ```bash
> ID_TOKEN_PATH=<id_token_path>
>
> JWS_PAYLOAD=$(cat $ID_TOKEN_PATH | jq -R 'split(".") | .[1] | gsub("-";"+") | gsub("_";"/") | @base64d | fromjson')
> echo "ISSUER = '$(echo $JWS_PAYLOAD | jq -r .iss)'"
> echo "SUBJECT = '$(echo $JWS_PAYLOAD | jq -r .sub)'"
> ```
>
> To learn how to obtain an ID token, see the documentation for your OIDC provider.

**To create a service user for your workload:**

1. Sign in to [Snowsight](ui-snowsight-gs.md).
2. To open the list of worksheets, in the navigation menu, select Projects » Worksheets.
3. To open a new SQL worksheet, select +.
4. To create a service user that uses WIF to authenticate with Snowflake, run a
   [CREATE USER](../sql-reference/sql/create-user.md) statement in the worksheet:

   ```sqlexample
   CREATE USER my_custom_service
     WORKLOAD_IDENTITY = (
       TYPE = OIDC
       ISSUER = '<issuer>'
       SUBJECT = '<subject>'
       OIDC_AUDIENCE_LIST = ('<custom_audience>')
     )
     TYPE = SERVICE;
   ```

   Where:

   * `ISSUER` and `SUBJECT` are the values that you obtained before starting these steps.
   * `OIDC_AUDIENCE_LIST` is a non-empty superset of the ID token’s audience claim set in Configure your OIDC Provider.
     You don’t have to specify `OIDC_AUDIENCE_LIST` if the ID token’s audience claim is `snowflakecomputing.com`.

#### Configure your workload to use a Snowflake driver

> **Note:**
>
> You can configure your workload to use any Snowflake driver that supports WIF. For the complete list, see
> Supported Snowflake drivers.

If your workload needs a Python driver, complete the following steps:

1. [Install the Snowflake Connector for Python](../developer-guide/python-connector/python-connector-install.md).
2. In your Python application code, add the following source code:

   ```python
   conn = snowflake.connector.connect(
     account='<snowflake_account>',
     authenticator='WORKLOAD_IDENTITY',
     workload_identity_provider='OIDC',
     token='<id_token>'
   )
   ```

   Where `id_token` is an unexpired ID token received from your OIDC Provider for your workload. To learn how to obtain
   this token, see the documentation for your OIDC provider.
3. Run your Python application. It authenticates to Snowflake using WIF.

## View service user settings

Run the [SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS](../sql-reference/sql/show-user-workload-identity-authentication-methods.md) command to view the values of the WORKLOAD_IDENTITY
parameter for the service user. For example, to view the WIF settings that the service user `my_custom_service`
uses to authenticate to Snowflake, run the following command:

```sqlexample
SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS FOR USER my_custom_service;
```

## Limitations and considerations

* Azure workloads can’t be located in Azure sovereign clouds, such as Azure China and Azure US Gov. This limitation isn’t related to the
  Snowflake region of your account.

---
title: Workspaces for dbt Projects on Snowflake
source: https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-using-workspaces.md
section: User Guide
---

# Workspaces for dbt Projects on Snowflake

Workspaces in Snowsight offer a web-based integrated development environment (IDE) for dbt projects that can connect and sync to a Git repository. Each workspace for dbt Projects on Snowflake can represent a single dbt project or multiple dbt projects, depending on how you organize your files and folders.

You can use a workspace for dbt Projects on Snowflake to visualize, test, and run dbt projects directly in Snowflake. Workspaces provide a quick way to initialize (or scaffold) a new dbt project, creating the necessary files and directories (such as `dbt_project.yml`) or create a new dbt project from an existing git repo. You can also connect the workspace to a *dbt project object* in Snowflake, so you can create and update objects from within the workspace.

In addition to supporting dbt projects, Workspaces provide a unified editor for you to create, organize, and manage code across multiple file types and projects within Snowflake. For more information, see [Workspaces](../ui-snowsight/workspaces.md).

## Limitations, requirements, and considerations for using workspaces with dbt projects

The following requirements, considerations, and limitations apply to workspaces for dbt Projects on Snowflake:

* Each dbt project folder in your Snowflake workspace must contain a `profiles.yml` file that specifies a target `warehouse`, `database`, `schema`, and `role` in Snowflake for the project. The `type` must be set to `snowflake`. dbt requires an `account` and `user`, but unlike dbt Core, these can be removed or left with an empty or arbitrary string because the dbt project runs in Snowflake under the current account and user context.
* A dbt project in a workspace can’t have more than 20,000 files in its folder structure. This limit includes all files in the dbt project directory and subdirectories, including the `target/dbt_packages/logs` directories, which is where log files are saved when a dbt project runs from within the workspace.

### Personal database requirement

Workspaces are created within a personal database and cannot be shared with other users. Personal databases must be enabled at the account level, which requires ACCOUNTADMIN privileges. For more information, see [Manage access and behavior](../ui-snowsight/workspaces.md).

Shared workspaces are created within a specific database and schema, which grants access to multiple authenticated users. Users assigned specific roles can then contribute, edit, and modify code and files simultaneously within the environment. For more information, see [Shared workspaces](../ui-snowsight/workspaces-shared.md).

### Git repositories

For requirements, considerations, and limitations that apply when you connect a workspace for dbt Projects on Snowflake to a Git repository, see
[Git in Snowflake limitations](../../developer-guide/git/git-limitations.md).

Git repositories accessed through PrivateLink must be configured beforehand. For more information, see [Configure Snowflake for access over a public network](../../developer-guide/git/git-setting-up.md).

---
title: Write support for externally managed Apache Iceberg™ tables
source: https://docs.snowflake.com/en/user-guide/tables-iceberg-externally-managed-writes.md
section: User Guide
---

# Write support for externally managed Apache Iceberg™ tables

Write support for externally managed [Apache Iceberg™ tables](tables-iceberg.md) lets you perform write operations on tables managed by an external
Iceberg REST catalog. The Iceberg table in Snowflake is linked to a table in your remote catalog.
When you make changes to the table in Snowflake, Snowflake commits the same changes to your remote catalog.

This feature expands interoperability between Snowflake and third-party systems,
so that you can use Snowflake for data engineering workloads with Iceberg even when you use an external Iceberg catalog.

The following list shows some key use cases:

* **Building complex data engineering pipelines with Iceberg tables**: Writing to Iceberg tables in external catalogs from Snowflake
  allows you to use Snowpark or Snowflake SQL to build complex pipelines that ingest, transform, and process data for Iceberg tables.
  You can query the data by using Snowflake or other engines.
  Similarly, you can use Snowflake [partner tools](ecosystem-all.md) to build your Iceberg data engineering pipelines.
* **Making your data available to the Iceberg ecosystem**: The ability to write to Iceberg tables in external catalogs lets you make your
  data available to the Iceberg ecosystem. You can query data that’s already in Snowflake and write it to Iceberg tables.
  To keep your Iceberg tables in sync with your Snowflake tables, you can use operations like INSERT INTO … SELECT FROM to do the following:

  + Copy existing data from a standard Snowflake table into an Iceberg table.
  + Insert data with Snowflake streams.

## Workflow

Use the workflow in this section to get started with this feature:

1. Configure a catalog integration with vended credentials.
   For instructions, see
   [Use catalog-vended credentials for Apache Iceberg™ tables](tables-iceberg-configure-catalog-integration-vended-credentials.md).

   The following topics explain how to configure a catalog integration for specific catalogs:

   * [Databricks Unity Catalog](tables-iceberg-configure-catalog-integration-rest-unity.md)
   * [AWS Glue](tables-iceberg-configure-catalog-integration-rest-glue.md)
   * [Snowflake Open Catalog](tables-iceberg-configure-catalog-integration-open-catalog.md)
   > **Note:**
   >
   > If your remote Iceberg catalog doesn’t support credential vending, you must configure an
   > [external volume](tables-iceberg.md) and a
   > [catalog integration](tables-iceberg.md).
   >
   > First,
   > [configure an external volume for your cloud storage provider](tables-iceberg-configure-external-volume.md).
   > Ensure that both read and write operations are allowed for the external volume by setting the ALLOW_WRITES parameter to `TRUE`,
   > which is the default. For more information about the required permissions, see [Granting Snowflake access to your storage](tables-iceberg-storage.md).
   >
   > If your Databricks workspace is on Azure, you must use an external volume that is configured to connect to Data Lake Storage Gen2
   > for write operations. For more information, see
   > [Enable interoperability with remote catalogs that use Data Lake Storage](tables-iceberg-configure-external-volume-azure.md).
   >
   > Then,
   > [configure a Apache Iceberg™ REST catalog integration for your remote Iceberg catalog](tables-iceberg-configure-catalog-integration-rest.md).
   > Your remote catalog must comply with the
   > open source [Apache Iceberg REST OpenAPI specification](https://github.com/apache/iceberg/blob/main/open-api/rest-catalog-open-api.yaml),
   > If you currently use a [catalog integration for AWS Glue](tables-iceberg-configure-catalog-integration-glue.md),
   > you must create a new REST catalog integration for the AWS Glue Iceberg REST endpoint.
2. Choose from the following options:

   * Create a catalog-linked database. With this option,
     you can write to auto-discovered Iceberg tables in your catalog, or use the catalog-linked database to create additional Iceberg tables.
   * Create an Iceberg table in a standard Snowflake database.
     With this option, you must first create a table in your remote catalog before you create an externally managed Iceberg table in Snowflake.

After you complete these steps, you can perform write operations
on your Iceberg tables.

## Create a catalog-linked database

Snowflake supports creating writable externally managed tables in a catalog-linked database, which
is a Snowflake database that you sync with an external Iceberg REST catalog. You can also write to Iceberg tables that Snowflake
automatically discovers in your remote catalog. For more information, see [Use a catalog-linked database for Apache Iceberg™ tables](tables-iceberg-catalog-linked-database.md).

> **Note:**
>
> Alternatively, you can create writable externally managed Iceberg tables in a standard Snowflake database.

The following example uses the [CREATE DATABASE (catalog-linked)](../sql-reference/sql/create-database-catalog-linked.md) command to
create a catalog-linked database that uses catalog-vended credentials. This works with any REST catalog, including
Databricks Unity Catalog, AWS Glue, and Snowflake Open Catalog.

```sqlexample
CREATE DATABASE my_catalog_linked_db
  LINKED_CATALOG = (
    CATALOG = 'my_rest_catalog_int'
  );
```

> **Note:**
>
> * If you’re using an external volume to connect to your remote catalog, you must specify the `EXTERNAL_VOLUME` parameter with the
>   CREATE DATABASE (catalog-linked) command.
> * For a complete tutorial on setting up a writable catalog-linked database with Unity Catalog, see
>   [Tutorial: Set up bidirectional access to Apache Iceberg™ tables in Databricks Unity Catalog](tutorials/tables-iceberg-set-up-bidirectional-access-to-unity-catalog.md).

### Use CREATE SCHEMA to create namespaces in your external catalog

To create a namespace for organizing Iceberg tables in your external catalog, you can use the
[CREATE SCHEMA](../sql-reference/sql/create-schema.md) command with a catalog-linked database.
The command creates a namespace in your linked Iceberg REST catalog and a corresponding schema in your Snowflake database.

```sqlsyntax
CREATE SCHEMA 'my_namespace';
```

> **Note:**
>
> Schema names must be alphanumeric and can’t include delimiters.

#### DROP SCHEMA

You can also use the [DROP SCHEMA](../sql-reference/sql/drop-schema.md) command to simultaneously drop a
schema from your catalog-linked database and its corresponding namespace from your remote catalog.

```sqlexample
DROP SCHEMA 'my_namespace';
```

## Create an Iceberg table

Creating an externally managed Iceberg table that you can write to from Snowflake varies, depending on the kind of database you use:

* If you use a catalog-linked database, you can use the [CREATE ICEBERG TABLE (catalog-linked database)](../sql-reference/sql/create-iceberg-table-rest.md) to create a table
  *and* register it in your remote catalog. For instructions, see Create an Iceberg table in a catalog-linked database.
* If you use a standard Snowflake database (not linked to a catalog), you must first create a
  a table in your remote catalog. Then, you can use the [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) syntax to create
  an Iceberg table in Snowflake and write to it. For instructions, see Create an Iceberg table in a standard Snowflake database.

### Create an Iceberg table in a catalog-linked database

To create a table in Snowflake and in your external catalog at the same time, use the [CREATE ICEBERG TABLE (catalog-linked database)](../sql-reference/sql/create-iceberg-table-rest.md) command.

The following example creates a writable
Iceberg table by using the previously created catalog integration for AWS Glue REST that is configured with catalog-vended credentials.
It also uses the value of a
column named `first_name` to partition the table. For more information, see [Iceberg partitioning](tables-iceberg-metadata.md).

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA my_namespace;

CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
  first_name STRING,
  last_name STRING,
  amount INT,
  create_date DATE
)
TARGET_FILE_SIZE = '64MB'
PARTITION BY (first_name);
```

When you run the command, Snowflake creates a new Iceberg table in your remote catalog and a linked, writable, externally managed table
in Snowflake.

### Create an Iceberg table in a catalog-linked database with hierarchical path layout

To create a table in Snowflake and in your external catalog at the same time with hierarchical path layout for the data files,
use the [CREATE ICEBERG TABLE (catalog-linked database)](../sql-reference/sql/create-iceberg-table-rest.md) command.

The following example creates a writable
Iceberg table by using the previously created catalog integration for AWS Glue REST that is configured with catalog-vended credentials.
It also uses the value of a
column named `first_name` to partition the table. For more information, see [Partitioning with hierarchical paths](tables-iceberg-metadata.md).

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA my_namespace;

CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
  first_name STRING,
  last_name STRING,
  amount INT,
  create_date DATE
)
TARGET_FILE_SIZE = '64MB'
PARTITION BY (first_name)
PATH_LAYOUT = HIERARCHICAL;
```

When you run the command, Snowflake creates a new Iceberg table in your remote catalog and a linked, writable, externally managed table
in Snowflake.

With this option, Snowflake writes data to partitioned Iceberg tables by using a hierarchical path layout
for files where partitioning information is included in the file paths. You might use this option when you need to use both Snowflake and
external engines to write to the same Iceberg table by using a hierarchical path layout for partitions.

### Create an Iceberg table in a standard Snowflake database

If using a standard Snowflake database, you must first create a table in your remote catalog. For example, you might use Spark to write
an Iceberg table to Open Catalog.

After you create the table in your remote catalog, use the [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) command to create an
Iceberg table object in Snowflake. For the CATALOG_TABLE_NAME, specify the name of the table as it appears in your remote catalog.

For example:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'my_rest_catalog_integration'
  CATALOG_TABLE_NAME = 'my_remote_table_name';
```

When you run the command, Snowflake creates a writable, externally managed table
in Snowflake that is linked to the existing table in your remote catalog.

> **Note:**
>
> If you create partition columns on the table outside of Snowflake,
> Snowflake infers the partitions from the table metadata.
> For more information about partitioning, see [Iceberg partitioning](tables-iceberg-metadata.md).

## Dropping an Iceberg table

You can simultaneously drop a writable externally managed Iceberg table from Snowflake and from your remote catalog by using the
[DROP ICEBERG TABLE](../sql-reference/sql/drop-iceberg-table.md) command.

```sqlexample
DROP ICEBERG TABLE my_iceberg_table;
```

Snowflake drops the table and also
makes a call to your remote Iceberg catalog, instructing it to drop the table and delete the table’s underlying data and metadata.

Snowflake only drops the table after confirming that the table has successfully been dropped from the remote catalog.

> **Note:**
>
> * If you use the AWS Glue Data Catalog as your external catalog, dropping an externally managed table through Snowflake does not delete
>   the underlying table files. This behavior is specific to the AWS Glue Data Catalog implementation.
> * Dropping an Amazon S3 Table isn’t currently supported.

## Writing to externally managed Iceberg tables

You can use the following DML commands for externally managed Iceberg tables:

* [INSERT](../sql-reference/sql/insert.md)

  + [Example: Multi-row insert using a query (INSERT INTO … SELECT FROM)](../sql-reference/sql/insert.md)
  + [Example: INSERT INTO … SELECT FROM (stream)](streams-examples.md)
* [UPDATE](../sql-reference/sql/update.md)
* [DELETE](../sql-reference/sql/delete.md)
* [MERGE](../sql-reference/sql/merge.md)
* [TRUNCATE TABLE](../sql-reference/sql/truncate-table.md)
* [COPY INTO <table>](../sql-reference/sql/copy-into-table.md). For more information, see [Load data into Apache Iceberg™ tables](tables-iceberg-load.md).

You can also use the [Snowpark API](../developer-guide/snowpark/index.md) to process Iceberg tables.

### Examples

You can use the following basic examples to get started with writing to Iceberg tables.

#### INSERT

Use [INSERT](../sql-reference/sql/insert.md) to insert values into an Iceberg table:

```sqlexample
INSERT INTO my_iceberg_table VALUES (1, 'a');
INSERT INTO my_iceberg_table VALUES (2, 'b');
INSERT INTO my_iceberg_table VALUES (3, 'c');
```

#### UPDATE

Use [UPDATE](../sql-reference/sql/update.md) to update the values in an Iceberg table:

```sqlexample
UPDATE my_iceberg_table
  SET a = 10
  WHERE b = 'b';
```

#### DELETE

Use [DELETE](../sql-reference/sql/delete.md) to remove values from an Iceberg table:

```sqlexample
DELETE my_iceberg_table
  WHERE b = 'a';
```

#### MERGE

Use [MERGE](../sql-reference/sql/merge.md) on an Iceberg table:

```sqlexample
MERGE INTO my_iceberg_table USING my_snowflake_table
  ON my_iceberg_table.a = my_snowflake_table.a
  WHEN MATCHED THEN
      UPDATE SET my_iceberg_table.b = my_snowflake_table.b
  WHEN NOT MATCHED THEN
      INSERT VALUES (my_snowflake_table.a, my_snowflake_table.b);
```

#### COPY INTO <table>

Use [COPY INTO <table>](../sql-reference/sql/copy-into-table.md) to load data into an Iceberg table.

```sqlexample
COPY INTO customer_iceberg_ingest
  FROM @my_parquet_stage
  FILE_FORMAT = 'my_parquet_format'
  MATCH_BY_COLUMN_NAME = CASE_SENSITIVE;
```

For more information, see [Load data into Apache Iceberg™ tables](tables-iceberg-load.md) for more information.

#### Change Data Capture using streams

A [table stream](streams-intro.md) tracks changes made to rows in a source table for Change Data Capture (CDC).
The source table can be a standard Snowflake table, a Snowflake-managed Iceberg table, or an externally managed Iceberg table.
You can insert the changes into an externally managed Iceberg table using the INSERT INTO… SELECT FROM… command.

> **Note:**
>
> If your source table is an externally managed Iceberg table, you must use INSERT_ONLY = TRUE when you create the stream.

```sqlexample
CREATE OR REPLACE STREAM my_stream ON TABLE my_snowflake_table;

//...

INSERT INTO my_iceberg_table(id,name)
  SELECT id, name
  FROM my_stream;
```

#### Using Snowpark

Create a function to copy data into an Iceberg table from a Snowflake table by using Snowpark Python.

```python
def copy_into_iceberg():

  try:
      df = session.table("my_snowflake_table")

      df.write.save_as_table("my_iceberg_table")

  except Exception as e:
      print(f"Error processing {table_name}: {e}")
```

## Troubleshooting

If an issue occurs when Snowflake attempts to commit table changes to your external catalog, Snowflake returns one of the
following error messages.

|  |  |
| --- | --- |
| Error | ```output 004185=SQL Execution Error: Failed while committing transaction to external catalog. Error:''{0}'' ```  Or:  ```output 004185=SQL Execution Error: Failed while committing transaction to external catalog with unresolvable commit conflicts. Error:''{0}'' ``` |
| Cause | A commit to the external catalog failed, where `{0}` is the exception returned by the external catalog (if available); otherwise, Snowflake reports `Exception unavailable` as the cause. The error message includes `with unresolvable commit conflicts` if Snowflake encountered an unresolvable commit conflict while attempting to commit a transaction to the external catalog. |

|  |  |
| --- | --- |
| Error | ```output 004500=SQL Execution Error: Cannot verify the status of transaction from external catalog. The statement ''{0}'' with transaction id {1} may or may not have committed to external catalog. Error:''{2}'' ``` |
| Cause | A commit to the external catalog resulted in no response from the external catalog. The message includes the exception returned by the external catalog (if available); otherwise, Snowflake reports `Exception unavailable` as the cause. |

|  |  |
| --- | --- |
| Error | ```output SQL Execution Error: An error occurred while interacting with the external catalog. Please check the external catalogs logs for more details: Entity size has exceeded the maximum allowed size. ``` |
| Cause | AWS Glue has limits on the size of metadata that can be stored for a table. When the table’s accumulated metadata — such as old snapshots and associated manifest files — exceeds this limit, Glue rejects the commit. |
| Solution | Reduce the table’s metadata size by expiring old snapshots. You can use an Apache Spark procedure to expire snapshots for the affected table. For example:  ```sqlexample CALL catalog_name.system.expire_snapshots('db_name.table_name'); ```  For more information, see [Expire Snapshots](https://iceberg.apache.org/docs/latest/spark-procedures/#expire_snapshots) in the Apache Iceberg™ documentation. |

## Considerations

Consider the following when you use write support for externally managed Iceberg tables:

* Snowflake supports externally managed writes for Iceberg tables that use version 2 of the
  [Iceberg table specification](https://iceberg.apache.org/spec/).
* Snowflake provides Data Definition Language (DDL) and Data Manipulation Language (DML) commands for externally managed tables. However,
  you configure metadata and data retention using your external catalog and the tools provided by your external storage provider.
  For more information, see [Tables that use an external catalog](tables-iceberg-metadata.md).

  For writes, Snowflake ensures that changes are committed to your remote catalog before updating the table in Snowflake.
* If you use a catalog-linked database, you can use the CREATE ICEBERG TABLE syntax with column definitions to create a table in Snowflake
  *and* in your remote catalog. If you use a standard Snowflake database (not linked to a catalog), you must first create a
  table in your remote catalog. After that, you can use the [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) syntax to create
  an Iceberg table in Snowflake and write to it.
* For the AWS Glue Data Catalog: Dropping an externally managed table through Snowflake doesn’t delete
  the underlying table files. This behavior is specific to the AWS Glue Data Catalog implementation.
* You can’t drop an Amazon S3 Table through Snowflake. The Amazon S3 Tables service requires
  the `purge` option to be specified with the DROP command, which Snowflake doesn’t currently support.
* Position [row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) are supported for tables stored on
  Amazon S3, Azure, or Google Cloud. Row-level deletes with equality delete files aren’t supported. For more information about row-level deletes,
  see [Use row-level deletes](tables-iceberg-manage.md). To turn off position deletes, which enable
  running the DML operations in copy-on-write mode, set the
  `ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE at the table, schema, or database level.
* Writing to externally managed tables with the following Iceberg data types isn’t supported:

  + `uuid`
  + `fixed(L)`
* The following features aren’t currently supported when you use Snowflake to write to externally managed Iceberg tables:

  + Server-side encryption (SSE) for Azure external volumes.
  + Multi-statement transactions. Snowflake supports autocommit transactions only.
  + Conversion to Snowflake-managed tables.
  + External Iceberg catalogs that don’t conform to the Iceberg REST protocol.
  + Using the OR REPLACE option when creating a table.
  + Using the CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT syntax if you use one of the following catalogs as your remote catalog:

    - AWS Glue
    - Databricks Unity Catalog

    Alternatively, you can use the [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql-reference/sql/create-iceberg-table-rest.md) syntax to create an empty Iceberg table and then use
    an [INSERT INTO … SELECT](../sql-reference/sql/insert.md) statement to insert data into the empty table. However, this alternative
    uses two separate transactions, so it doesn’t guarantee atomicity.
* For creating schemas in a catalog-linked database, be aware of the following:

  + The CREATE SCHEMA command creates a corresponding namespace in your remote catalog only when you use a catalog-linked database.
  + The ALTER and CLONE options aren’t supported.
  + Delimiters aren’t supported for schema names. Only alphanumeric schema names are supported.

* You can set a target file size for a table’s Parquet files. For more information, see [Set a target file size](tables-iceberg-manage.md).
* For Azure cloud storage services: Snowflake only supports externally managed writes for Iceberg tables that use the following services for external storage:

  + Blob Storage
  + Data Lake Storage Gen2

    [Preview feature](../release-notes/preview-features.md) — Open

    Available to all accounts.

    Connecting Snowflake to Data Lake Storage Gen2 storage by using an external volume is in public preview. This configuration enables externally managed
    writes to catalogs that
    are only configured to use Data Lake Storage, such as Unity Catalog. For more information, see [Configure an external volume for Azure](tables-iceberg-configure-external-volume-azure.md)

    > **Note:**
    >
    > Connecting Snowflake to Data Lake Storage Gen2 storage by using catalog-vended credentials isn’t supported.
  + General-purpose v1
  + General-purpose v2
  + Microsoft Fabric OneLake
* Sharing:

  + Sharing with a listing isn’t currently supported.
  + Direct sharing isn’t currently supported.

---
title: YAML specification for semantic views
source: https://docs.snowflake.com/en/user-guide/views-semantic/semantic-view-yaml-spec.md
section: User Guide
---

# YAML specification for semantic views

Semantic views are schema-level objects that define business concepts over your data, making it easier for users to query and analyze data using business terminology. You can use the YAML specification to create a semantic view in Cortex Analyst or use the [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md) stored procedure to create a semantic view from a YAML specification.

## Overview

**Semantic views are the recommended approach** for defining business semantics in Snowflake. They are schema-level objects
that integrate with Snowflake’s privilege system, sharing mechanisms, and metadata catalog.

> **Note:**
>
> Legacy semantic model YAML files (stored on stages) can still be used with Cortex Analyst for backward compatibility,
> but we recommend using semantic views for new implementations.

The benefits of semantic views over legacy semantic models are:

* **Native Snowflake integration**: Schema-level objects with full RBAC, sharing, and catalog support
* **Advanced features**: Support for derived metrics and access modifiers (public/private)
* **Better governance**: Integrated with Snowflake’s privilege and sharing systems
* **Simplified management**: No need to manage YAML files on stages

## YAML format

Semantic views can take a [YAML](https://yaml.org/) specification to define their behavior, allowing for readable, plain-text definitions.

The general syntax of a semantic view YAML specification is:

```yaml
# Name and description of the semantic view.
name: <name>
description: <string>

# Logical table-level concepts
# A semantic view can contain one or more logical tables.
tables:
  # A logical table on top of a base table.
  - name: <name>
    description: <string>
    # The fully qualified name of the base table.
    base_table:
      database: <database>
      schema: <schema>
      table: <base table name>

    # Dimension columns in the logical table.
    dimensions:
      - name: <name>
        synonyms: <array of strings>
        description: <string>
        expr: <SQL expression>
        data_type: <data type>
        unique: <boolean>
        cortex_search_service:
          service: <string>
          literal_column: <string>
          database: <string>
          schema: <string>
        is_enum: <boolean>
    - ...
    # Time dimension columns in the logical table.
    time_dimensions:
      - name: <name>
        synonyms: <array of strings>
        description: <string>
        expr: <SQL expression>
        data_type: <data type>
        unique: <boolean>

    # Fact columns in the logical table.
    facts:
      - name: <name>
        synonyms: <array of strings>
        description: <string>
        access_modifier: <public_access | private_access>  # Default is public_access.
        expr: <SQL expression>
        data_type: <data type>

    # Regular metrics scoped to the logical table.
    metrics:
      - name: <name>
        synonyms: <array of strings>
        description: <string>
        access_modifier: <public_access | private_access>  # Default is public_access.
        expr: <SQL expression>
        non_additive_dimensions:
        - table: <table name>
          dimension: <dimension name>
          sort_direction: <ascending | descending>
          null_order: <first | last>
        using_relationships:
        - <relationship_name>

    # Commonly used filters over the logical table.
    filters:
      - name: <name>
        synonyms: <array of strings>
        description: <string>
        expr: <SQL expression>

# View-level concepts
# Relationships between logical tables
relationships:
  - name: <string>
    left_table: <table>
    right_table: <table>
    relationship_columns:
      - left_column: <column>
        right_column: <column>
      - left_column: <column>
        right_column: <column>

# Derived metrics scoped to the semantic view.
# Derived metrics combine metrics from multiple tables.
metrics:
  - name: <name>
    synonyms: <array of strings>
    description: <string>
    access_modifier: <public_access | private_access>  # Default is public_access
    expr: <SQL expression>

# Additional context concepts
# Verified queries with example questions and queries that answer them
verified_queries:
  - name: <string>       # A descriptive name of the query.
    question: <string>   # The natural language question that this query answers.
    verified_at: <int>   # Optional: Time (in seconds since the UNIX epoch, January 1, 1970) when the query was verified.
    verified_by: <string> # Optional: Name of the person who verified the query.
    use_as_onboarding_question: <boolean>  # Optional: Marks this question as an onboarding question for the end user.
    sql: <string>        # The SQL query for answering the question
```

> **Important:**
>
> **Semantic views do not require** the `join_type` or `relationship_type` fields that were used in legacy semantic
> models. The relationship type is automatically inferred from the data.

## Key concepts

### Tables

Logical tables represent business entities (such as customers, orders, or products) and map to physical database tables.
Each logical table can define:

* **Base table**: The fully qualified name of the physical table
* **Primary key**: Columns that uniquely identify rows
* **Synonyms**: Alternative names for the table
* **Description**: Business-friendly explanation of what the table represents

### Dimensions

Dimensions represent categorical attributes that provide context for analysis. They answer “who,” “what,” “where,” and
“when” questions. Dimensions can be:

* **Regular dimensions**: Text, numeric, or other categorical values
* **Time dimensions**: Date or timestamp columns with special time-based handling

#### Properties of dimensions

* `expr`: SQL expression to calculate the dimension value
* `synonyms`: Alternative terms users might use
* `unique`: Whether values are unique across rows
* `is_enum`: Whether the dimension has a fixed set of values
* `cortex_search_service`: Optional Cortex Search service for semantic search

#### Optional properties for physical dimensions

These fields are optional, but recommended for producing higher-quality results from a semantic view search.

`synonyms`
:   A list of other terms/phrases used to refer to this dimension. Must be unique across all synonyms in this semantic model.

`description`
:   A brief description of this dimension. Include information that provides useful context, such as data this dimension represents.

`unique`
:   A boolean value that indicates this dimension has unique values.

`sample_values`
:   Sample values of this column, if any. Add any value that is likely to be referenced in the user questions.

`is_enum`
:   A Boolean value. If `True`, the values in the `sample_values` field are taken to be the full list of possible values,
    and the model only chooses from those values when filtering on that column.

`cortex_search_service`
:   Specifies the Cortex Search Service to use for this dimension. It has the following fields:

    * `service`: The name of the Cortex Search Service.
    * `literal_column`: (optional) The column in the Cortex Search Service that contains the literal values.
    * `database`: (optional) The database where the Cortex Search Service is located. Defaults to `base_table`’s database.
    * `schema`: (optional) The schema where the Cortex Search Service is located. Defaults to `base_table`’s schema.

    `cortex_search_service` replaces the `cortex_search_service_name` field, which could only specify the name. `cortex_search_service_name` has been deprecated.

#### Optional properties for time dimensions

These fields are optional, but recommended for producing higher-quality results from a semantic view search.

`synonyms`
:   A list of other terms/phrases used to refer to this time dimension. Must be unique across all synonyms in this semantic model.

`description`
:   A brief description of this dimension. Include information that provides useful context, such as the time zone that this dimension uses as a reference point.

`unique`:
:   A boolean value that indicates this column has unique values.

`sample_values`:
:   Sample values of this column, if any. Add any values that are likely to be referenced in the user questions.

### Facts

Facts are row-level quantitative attributes that represent specific business events or transactions. Facts capture
“how much” or “how many” at the most granular level, such as individual sales amounts, quantities purchased, or costs.

Facts typically function as “helper” concepts within the semantic view to help construct dimensions and metrics.

The properties of facts are:

* `expr`: SQL expression to calculate the fact value
* `access_modifier`: Set to `private_access` to hide from queries (useful for intermediate calculations)
* `data_type`: The data type of the fact

### Metrics

Metrics are quantifiable measures of business performance calculated by aggregating facts or other columns using functions
like SUM, AVG, and COUNT.

Two types of metrics:

1. **Table-level metrics**: Scoped to a specific logical table, aggregating data within that table
2. **Derived metrics**: View-level metrics that combine metrics from multiple tables

#### Properties of metrics

* `expr`: SQL expression with aggregation function
* `access_modifier`: Set to `private_access` to hide from queries (useful for intermediate calculations)
* `synonyms`: Alternative terms for the metric

#### Optional properties of metrics

* If you want to
  [specify the dimensions that should be non-additive for the metric](sql.md), use the
  following fields:

  `non_additive_dimensions`
  :   Specifies the dimensions that the metric should not be aggregated across.

      `table`
      :   Name of the logical table containing the dimension.

      `dimension`
      :   Name of the dimension.

      `sort_direction`
      :   [Sort order for the non-additive dimension](sql.md). You can specify one of
          the following values:

          + `ascending`: Sort the dimension values in ascending order.
          + `descending`: Sort the dimension values in descending order.

          Default: `ascending`

      `null_order`
      :   Specifies whether NULLs are
          [sorted before or after non-NULL values](sql.md).
          You can specify one of the following values:

          + `first`: NULLs are sorted before non-NULL values.
          + `last`: NULLs are sorted after non-NULL values.

          Default: Depends on the value in the `sort_direction` field (`ascending` or `descending`); see
          [the usage notes in the ORDER BY documentation](../../sql-reference/constructs/order-by.md).

      > **Note:**
      >
      > Because the rows are sorted by the non-additive dimensions, the order in which you specify the dimensions is important. This
      > is similar to the order in which you specify columns in the [ORDER BY](../../sql-reference/constructs/order-by.md) clause.

  The following example specifies that the `m_account_balance` metric cannot be aggregated by the `year_dim` and `month_dim`
  dimensions:

  ```yaml
  metrics:
    - name: m_account_balance
      ...
      non_additive_dimensions:
      - table: bank_accounts
        dimension: year_dim
        sort_direction: ascending
        null_order: last
      - table: bank_accounts
        dimension: month_dim
        sort_direction: descending
        null_order: first
  ```
* If there are multiple relationship paths between two specific logical tables in a semantic view, use the following field to
  [specify the relationship path to use](sql.md):

  `using_relationships`
  :   [Preview Feature](../../release-notes/preview-features.md) — Open

      Available to all accounts.

      Specifies the name of the relationship to use to join the logical tables when calculating the metric.

### Derived metrics

Derived metrics are view-level metrics not tied to a specific table. They can combine metrics from multiple tables
or perform calculations across the entire view.

Example of a derived metric:

```yaml
metrics:
  - name: total_profit_margin
    description: "Overall profit margin across all products"
    expr: (orders.total_revenue - orders.total_cost) / orders.total_revenue
    access_modifier: public_access
```

### Relationships

Relationships define how logical tables join together. Each relationship specifies:

* `left_table`: The table containing the foreign key
* `right_table`: The table being referenced
* `relationship_columns`: Pairs of columns to join on, as `left_column` and `right_column`

The relationship type (one-to-one, many-to-one) is automatically inferred from the data and primary key definitions.

> **Note:**
>
> Unlike legacy semantic models, semantic views do not require explicit `join_type` or `relationship_type`
> specifications. These are determined automatically.

### Filters

Filters define commonly used filtering conditions that can be referenced by name. This helps ensure consistent
filtering logic across queries.

Example:

```yaml
filters:
  - name: active_customers
    description: "Customers who have made a purchase in the last 12 months"
    expr: "customer_last_purchase_date >= DATEADD(month, -12, CURRENT_DATE())"
```

### Verified queries

Verified queries are example questions with their corresponding SQL queries. They help Cortex Analyst understand
how to answer similar questions and serve as documentation for users.

Properties:

* `question`: Natural language question
* `sql`: SQL query that answers the question
* `verified_by`: Optional person who verified the query is correct
* `verified_at`: Optional timestamp when verified
* `use_as_onboarding_question`: Optional flag to show this as a suggestion to users

## Access modifiers

Semantic views support access modifiers for facts and metrics, allowing you to control visibility:

* `public_access` (default): Visible and queryable by users
* `private_access`: Hidden from queries, used only for intermediate calculations

Example:

```yaml
facts:
  - name: internal_cost
    expr: unit_cost * quantity
    data_type: NUMBER
    access_modifier: private_access  # Not visible in queries

metrics:
  - name: total_revenue
    expr: SUM(sale_amount)
    access_modifier: public_access  # Visible in queries
```

## Custom instructions for Cortex Analyst

You can use SQL commands to provide custom instructions in the semantic view definition.
These instructions guide how the queries are generated and how questions are categorized. These instructions are not
part of the YAML specification but are set using the [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md) command.

For more information, see [Providing custom instructions for Cortex Analyst](sql.md).

## Example semantic view YAML

Here’s a complete example of a semantic view YAML specification:

```yaml
name: revenue_analysis
description: "Semantic view for analyzing revenue across products and customers"

tables:
  - name: customers
    description: "Customer information"
    base_table:
      database: sales_db
      schema: public
      table: customers
    dimensions:
      - name: customer_name
        synonyms: ["client name", "customer"]
        description: "Full name of the customer"
        expr: c_name
        data_type: VARCHAR
      - name: customer_segment
        synonyms: ["segment", "market segment"]
        description: "Customer market segment"
        expr: c_mktsegment
        data_type: VARCHAR
        is_enum: true

  - name: orders
    description: "Order information"
    base_table:
      database: sales_db
      schema: public
      table: orders
    dimensions:
      - name: order_date
        description: "Date when order was placed"
        expr: o_orderdate
        data_type: DATE
    time_dimensions:
      - name: order_year
        description: "Year when order was placed"
        expr: YEAR(o_orderdate)
        data_type: NUMBER
    facts:
      - name: order_total
        description: "Total order amount"
        expr: o_totalprice
        data_type: NUMBER
    metrics:
      - name: total_orders
        description: "Total number of orders"
        expr: COUNT(*)
      - name: average_order_value
        description: "Average order value"
        expr: AVG(o_totalprice)

relationships:
  - name: orders_to_customers
    left_table: orders
    right_table: customers
    relationship_columns:
      - left_column: o_custkey
        right_column: c_custkey

metrics:
  - name: revenue_per_customer
    description: "Average revenue per customer"
    expr: orders.total_revenue / customers.customer_count
    access_modifier: public_access

verified_queries:
  - name: top_customers_by_revenue
    question: "Who are the top 10 customers by revenue?"
    sql: |
      SELECT
        customer_name,
        SUM(order_total) as total_revenue
      FROM revenue_analysis
      GROUP BY customer_name
      ORDER BY total_revenue DESC
      LIMIT 10
    use_as_onboarding_question: true
```

## Creating a semantic view from YAML

To create a semantic view from a YAML specification, use the
[SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md) stored procedure.

For more information, see [Creating a semantic view from a YAML specification](sql.md).

## Getting YAML from a semantic view

To export a semantic view to YAML format, use the
[SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](../../sql-reference/functions/system_read_yaml_from_semantic_view.md) function.

For more information, see [Getting the YAML specification for a semantic view](sql.md).

## Differences from legacy semantic models

If you’re migrating from legacy semantic model YAML files to semantic views, note these key differences:

| Feature | Legacy semantic models | Semantic views |
| --- | --- | --- |
| Storage | YAML files on stages | Schema-level objects in database |
| Privileges | Stage-based access control | Full Snowflake RBAC integration |
| Sharing | Manual file sharing | Native Snowflake sharing |
| Join types | Requires `join_type` and `relationship_type` | Automatically inferred |
| Derived metrics | Not supported | Fully supported |
| Access modifiers | Not supported | `public_access` / `private_access` |
| Custom instructions | In YAML file | Set via SQL commands |

When converting from a legacy semantic model to a semantic view:

1. Remove `join_type` and `relationship_type` from relationships
2. Consider using derived metrics for view-level calculations
3. Add `access_modifier` to facts/metrics you want to make private
4. Move custom instructions to SQL CREATE SEMANTIC VIEW command

## Snowflake CLI

Command-line interface for managing Snowflake objects and CI/CD workflows.

---
title: About project definition files
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/about.md
section: Snowflake CLI
---

# About project definition files

When developing Streamlit or Snowpark applications you often work with multiple files and objects, be it python file or stored procedures. Organizing this in a clear and concise way is very important for smooth development experience. That’s the reason why Snowflake CLI is using the concept of *project definition files*.

A project definition file (usually named `snowflake.yml`) is a file containing information about the Snowflake objects you are developing. The following `snowflake.yml` example shows a project with a Snowpark UDF and a stored procedure.

```yaml
definition_version: 2
entities:
  test_function:
    type: "function"
    stage: "dev_deployment"
    artifacts: ["app/"]
    handler: "functions.hello_function"
    signature: ""
    returns: string

  hello_procedure:
    type: "procedure"
    stage: "dev_deployment"
    artifacts: ["app/"]
    handler: "procedures.hello_procedure"
    signature:
      - name: "name"
        type: "string"
    returns: string
```

## Project definition properties

The following table describes the project definition properties used by all projects.

Common project definition properties

| Property | Definition |
| --- | --- |
| **definition_version**  *required*, *int* | Version of the project definition schema, which is currently 2. |
| **entities**  *optional*, *string* | List of entity definitions, such as procedures, functions, and so on. For more information, see [Specify entities](specify-entities.md). |
| **env**  *optional*, *string sequence* | List of default environment specifications to be used in project templates. For more information, see [Create project definition file templates](create-templates.md). |
| **mixins**  *optional*, *string sequence* | List of common values for entity properties. For more information, see [Project mixins](specify-entities.md). |

Each project requires specific information about what you are building. Snowflake CLI currently supports the following entity definitions from the following Snowflake domains:

* [Native App Framework](../native-apps/project-definitions.md)
* [Notebooks](../notebooks/use-notebooks.md)
* [Snowpark](../snowpark/create.md)
* Snowpark Container Services (SPCS)

  + [Compute pools](../services/manage-compute-pools.md)
  + [Image repositories](../services/manage-images.md)
  + [Services](../services/manage-services.md)
* [Streamlit](../streamlit-apps/manage-apps/initialize-app.md)
* [SQL](../sql/execute-sql.md)

> **Caution:**
>
> Files inside a project directory are processed by Snowflake CLI and could be uploaded to Snowflake when executing other `snow` commands. You should use caution when putting any sensitive information inside files in a project directory.

---
title: About Snowflake Native App projects
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/about-projects.md
section: Snowflake CLI
---

# About Snowflake Native App projects

From the point of view of Snowflake Native App, a project encompasses a codebase that can be added to an application package in a Snowflake account. It includes references to all the extension code that app functionality needs, references to external databases for shared content, as well as required files such as [manifest.yml](../../native-apps/manifest-overview.md), an [environment.yml](../../streamlit/app-development/dependency-management.md) (for a Streamlit app), and any code artifacts such as JAR files and images. It also includes a configuration to describe how the application package can be built from the files in the project folder.

A Snowflake Native App project is simply a set of files in a directory; like other code repositories, these files can be version-controlled using technologies like git and shared on platforms like Github.

To give you an idea of what a Snowflake Native App project should look like, Snowflake has created a few templates that are available for you to clone through Snowflake CLI commands. You can access these publicly available templates from the [Snowflake Git repository](https://github.com/snowflakedb/snowflake-cli-templates) and even create projects directly from them using Snowflake CLI. You can also create and share your own templates. For more information, see [Bootstrapping a project from a template](../bootstrap-project/bootstrap.md).

> **Caution:**
>
> Snowflake CLI processes the files inside a project directory. These files can be uploaded to Snowflake by other `snow app` commands, so you should use caution when putting any sensitive information inside files in a project directory.

---
title: Alter command behavior using templates
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/alter-with-templates.md
section: Snowflake CLI
---

# Alter command behavior using templates

You can use templates to alter the definition using environment variables. For example, the following project definition templates the schema for a Streamlit dashboard:

```yaml
definition_version: "1.1"
env:
  schema: "test"
streamlit:
  name: "MY_APP"
  schema: <% ctx.env.schema %>
```

This feature lets you to alter the behavior of the `snow streamlit deploy` command by setting a `schema` environment variable. Using this approach, you can deploy the same dashboard to multiple different schemas by entering the following commands to deploy different schemas:

```snowcli
schema="staging"; snow streamlit deploy
schema="prod"; snow streamlit deploy
```

> **Note:**
>
> The variables and environment variables are case-sensitive.

You can also use the template feature without defining variables in the `env` section. If a variable is not present in `env` section, Snowflake CLI looks for corresponding environment variables. For example, if you define a Streamlit application similar to the following, you can still alter the behavior of `snow streamlit deploy` by specifying a `schema` environment variable.

```yaml
definition_version: "1.1"
streamlit:
  name: "MY_APP"
  schema: <% ctx.env.schema %>
```

---
title: Bootstrapping a project from a template
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/bootstrap-project/bootstrap.md
section: Snowflake CLI
---

# Bootstrapping a project from a template

To make it easier for you to instantiate projects, Snowflake CLI implements project templating. You can
create your own project templates or use samples provided by Snowflake in the [Snowflake CLI templates](https://github.com/snowflakedb/snowflake-cli-templates/) public Git repository.

The [snow init](../command-reference/bootstrap-commands/init.md) command creates a project directory and populates it with file structure defined in the specified template.

* If you don’t provide the `--no-interactive` option, the command prompts for each variable specified by the template (`template.yml`) that you don’t provide with the `-D` (or `--variable`) option.
* If you do provide the `--no-interactive` option, the command uses the default values of variables (defined by the template). If the template does not define a default value for a variable and you don’t use the `-D` option to provide it, the command exits with an error.

The `snow init` command uses the following syntax:

```bash
snow init PATH [--template-source SOURCE] [--template NAME] [-D key1=value1 -D key2=value2...] [--no-interactive]
```

where:

* `PATH` is a new directory where the command initializes the project. If you specify an existing directory, the command exits with an error.
* `[--template-source SOURCE]` is one of the following:

  + A local file path of the template directory.
  + A valid Git URL to the directory containing the project template. If not specified, the command defaults to the [Snowflake CLI templates](https://github.com/snowflakedb/snowflake-cli-templates/) Git repository.
* `[--template NAME]` specifies which subdirectory of `SOURCE` to use as a template (useful for remote sources). If not provided, `SOURCE` is treated as a single template.
* `[-D key1=value1 -D key2=value2...]` is a list of one or more name-value pairs, providing values for variables defined in the template (in `template.yml`). The command does not prompt for variables you provide with this option.
* `[--no-interactive]` disables prompts for user input. If you use this option, you must provide all of the required values with the `[-D key1=value1 -D key2=value2...]` options; otherwise, the command exists with an error.

For more information, see the [snow init](../command-reference/bootstrap-commands/init.md) command reference.

## Examples

* Initialize project from `example_snowpark` template from default repository:

  ```snowcli
  snow init my_snowpark_test_app --template example_snowpark
  ```

  The command prompts for (default values are shown in square brackets):

  ```output
  Project identifier (used to determine artifacts stage path) [my_snowpark_project]:
  What stage should the procedures and functions be deployed to? [dev_deployment]: snowpark
  Initialized the new project in my_snowpark_test_app
  ```
* Initialize the project from the local template.

  ```snowcli
  snow init new_streamlit_project --template-source ../local_templates/example_streamlit -D query_warehouse=dev_wareshouse -D stage=testing
  ```

  In this example, `query_warehouse` and `stage` variables are specified with the `-D` option, so the command only prompts for the following:

  ```output
  Name of the streamlit app [streamlit_app]:
  Initialized the new project in new_streamlit_project
  ```

## Creating custom templates

### Template layout

A project template requires a `template.yml` file that contains data that explains how the `snow init` command should render the template. If the file is not present in the template’s root directory, `snow init` finishes with an error.
For more information, see template.yml syntax.

### Template syntax

Template variables and expressions should be enclosed in `<! ... !>`.
Snowflake CLI also supports basic jinja2 expressions and filters, for example:

> ```yaml
> some_file_spec:
>   filename: <! file_name !>
>   size: "<! [ max_file_size_mb, 4 ] | max !> MB"
> ```

Snowflake CLI project templates also support the following reserved variable and filter:

* `project_dir_name` variable, which automatically resolves to the root directory of the created project.

  For example, suppose your `snowflake.yml` file contains the following:

  ```yaml
  definition_version: "1.1"
  snowpark:
    project_name: <! project_dir_name !>
    ...
  ```

  If you then execute the following command to initialize the project from your custom template:

  ```snowcli
  snow init examples/new_snowpark_project --template-source my_example_template/
  ```

  The `snow init` command renders the `snowflake.yml` file as follows:

  ```yaml
  definition_version: "1.1"
  snowpark:
    project_name: new_snowpark_project
    ...
  ```
* `to_snowflake_identifier` filter, which formats user-provided strings into to correctly-formatted Snowflake identifiers.

  Snowflake strongly recommends using this filter when a variable references a Snowflake object.

  For example, suppose your `snowflake.yml` file contains the following:

  ```yaml
  definition_version: "1.1"
  streamlit:
    name: <! name | to_snowflake_identifier !>
    ...
  ```

  If you then execute the following command to initialize a project from your custom template:

  ```snowcli
  snow init examples/streamlit --template-source my_example_template2/ -D name='My test streamlit'
  ```

  The `snow init` command renders the `snowflake.yml` file as follows:

  ```yaml
  definition_version: "1.1"
  streamlit:
    name: My_test_streamlit
    ...
  ```

  If a string cannot be converted into a valid Snowflake identifier, the `snow init` command exits with an error, as shown:

  ```snowcli
  snow init examples/streamlit --template-source my_example_template2/ -D name=1234567890
  ```

  ```output
  ╭─ Error ────────────────────────────────────────────────────────────────────────╮
  │ Value '123456789' cannot be converted to valid Snowflake identifier.         │
  │ Consider enclosing it in double quotes: ""                                   │
  ╰────────────────────────────────────────────────────────────────────────────────╯
  ```

### About the `template.yml` project template file

The `template.yml` project template file stores all of the data needed to render the project. For example:

```yaml
minimum_cli_version: "2.7.0"
files_to_render:
  - snowflake.yml
variables:
  - name: name
    default: streamlit_app
    prompt: "Name of the streamlit app"
    type: string
  - name: stage
    default: my_streamlit_stage
    prompt: "What stage should the app be deployed to?"
    type: string
  - name: query_warehouse
    default: my_streamlit_warehouse
    prompt: "On which warehouse SQL queries issued by the application are run"
    type: string
```

The following table lists the properties in a `template.yml` project template file.

Template properties

| Property | Definition |
| --- | --- |
| `minimum_cli_version`  *optional*, *string* (default:None) | Minimum Snowflake CLI version. If specified, the `snow init` command checks the version of Snowflake CLI installed and exits with an error if the installed version is lower than the specified version. |
| `files_to_render`  *optional*, *string list* (default: `[]`) | List of files to be rendered by the `snow init` command. Each path should be relative to the templates root.  **Note:** Template files not included in this list are added to the new project, but their content remains unchanged. |
| `variables`  *optional*, *variable list* (default: `[]`) | List of template variables. It supports customizing prompts, providing default values for optional variables and basic type checking. See the **Variables property parameters** table below for more details. Variable values are determined in order from this list.  If you omit any variable used in the `snowflake.yml` file from this list, the `snow init` command exits with the following error.  ```output ╭─ Error ─────────────────────────────────────────────────────────╮ │ Cannot determine value of variable undefined_variable         │ ╰─────────────────────────────────────────────────────────────────╯ ``` |

The following table lists the parameters of a variable property.

Variable property parameters

| Property | Definition |
| --- | --- |
| `name`  *required*, *string* | Name of the variable. It is used in the template files, such as `<! name !>` and in `-D` option, such as `-D name=value`. |
| `prompt`  *optional*, *string* | Prompt to display to the user to get a value. If you don’t set this parameter, the command displays the name of the parameter as the prompt text.  If you define the prompt as follows:  ```yaml variables:   - name: project_id     prompt: The identifier for the project ```  `snow init` displays this prompt for the `project_id` variable.  ```output The identifier for the project: ``` |
| `default`  *optional*, *string/int/float* | Default value of the variable. If not provided, the variable is treated as required, so a user needs to provide the value after a prompt or by specifying it with the `-D` command-line option.  The following example defines two variables with default values:  ```yaml variables:   - name: max_file_size_mb     default: 16   - name: file_name     default: 'default_file_name.zip' ```  When executed, the `snow init` command displays the following prompts for these two variables:  ```bash file_name [default_file_name.zip]: max_file_size_mb [16]: 5 ```  In this example, the command uses the default value (`default_file_name.zip`) for the `file_name` variable has a default value, and sets `max_file_size_mb` to the value provided by the user (5). |
| `type`  *optional*, *string* | Data type of the variable. Valid values include: `string`, `int`, and `float`. If not specified, the command assumes the value is a `string`.  The following example defines a variable as an `int` data type:  ```yaml variables:   - name: max_file_size_mb     type: int ```  When executed, the snow init command displays the following errors if the user enters a value of the wrong data type:  ```output max_file_size_mb: not an int Error: 'not an int' is not a valid integer. max_file_size_mb: 14.5 Error: '14.5' is not a valid integer. max_file_size_mb: 6 Initialized the new project in example_dir ``` |

---
title: Build a Snowpark project
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/build.md
section: Snowflake CLI
---

# Build a Snowpark project

The `snow snowpark build` command builds the Snowpark project as one or more `.zip` archive files that can be used by the `deploy` command. The cp,,amd builds the archives using only the `src` directory specified in the project file.

```snowcli
snow snowpark build
```

```output
Resolving dependencies from requirements.txt
  No external dependencies.
Preparing artifacts for source code
  Creating: app.zip
Build done.
```

Additional options:

* `--allow-shared-libraries`: Allows shared (`.so`/`.dll`) libraries, when using packages installed through `pip`.
* `--ignore-anaconda`: Does not lookup packages on Snowflake Anaconda channel.
* `--index-url`: Specifies the base URL of the Python Package Index to use for package lookup. This URL should point to a repository compliant with PEP 503 (the simple repository API) or a local directory laid out in the same format.
* `--skip-version-check`: Skips comparing versions of dependencies between requirements and Anaconda.
* `--project [-p]`: Specifies the path where the Snowpark project resides. Defaults to the current working directory.

---
title: Configuring Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/configure-cli.md
section: Snowflake CLI
---

# Configuring Snowflake CLI

Snowflake CLI uses a global configuration file called `config.toml` to configure connections and logs for Snowflake CLI.
If the file does not exist, running any `snow` command for the first time automatically creates an
empty `config.toml` file that you can then populate with the desired connections.
For more information about `toml` file formats, see [TOML (Tom’s Obvious Minimal Language)](https://toml.io/en/).
Snowflake Python libraries currently support TOML version 1.0.0.

The `config.toml` supports the following sections:

* `[connections]` for defining and managing connections
* `[logs]` for configuring which types of messages are saved to log files

A Snowflake CLI configuration file has the following structure:

```toml
default_connection_name = "myconnection"

[connections]
[connections.myconnection]
account = "myorganization-myaccount"
user = "jdoe"
...

[connections.testingconnection]
account = "myorganization-myaccount"
user = "jdoe"
...

[cli.logs]
save_logs = true
level = "info"
path = "/home/<username>/.snowflake/logs"
```

You can generate the basic settings for the TOML configuration file in Snowsight. For information, see
[Configuring a client, driver, library, or third-party application to connect to Snowflake](../../../user-guide/gen-conn-config.md).

> **Note:**
>
> If a `connections.toml` file exists, Snowflake CLI uses the connections defined in it instead of those defined in the `config.toml` file.

## Location of the `.toml` configuration file

By default Snowflake CLI looks for the `config.toml` file in the `~/.snowflake` directory or, in case this directory does not exist, in a system-specific location, as listed below.
You can also specify which configuration file should be used using `--config-file` flag or `SNOWFLAKE_HOME` environment variable.

* If you specify the `--config-file` option (such as, `snow --config-file ./my-config-file-path`), Snowflake CLI uses the specified configuration file.
* If the `SNOWFLAKE_HOME` environment variable is set, Snowflake CLI uses the location specified by this variable.
* If a `~/.snowflake` directory exists on your machine, Snowflake CLI uses the `~/.snowflake/config.toml` file.
* Otherwise, Snowflake CLI uses the `config.toml` file in the one of the following locations, based on your operating system:

  > + Linux: `~/.config/snowflake/config.toml`, but you can update it with XDG vars
  > + Windows: `%USERPROFILE%\AppData\Local\snowflake\config.toml`
  > + Mac: `~/Library/Application Support/snowflake/config.toml`

> **Note:**
>
> For MacOS and Linux systems, Snowflake CLI requires the `config.toml` file to limit its file permissions to read and write for the file owner only. To
> set the file required file permissions execute the following commands:
>
> ```bash
> chown $USER config.toml
> chmod 0600 config.toml
> ```

### Choose a different configuration file

In some situations, such as a continuous integration and continuous deployment (CI/CD) environments, you might prefer to create dedicated configuration files for testing and deployment pipelines instead of defining all of the possible configurations in a single Snowflake default configuration file.

To use a different configuration file that your default file, you can use the `--config-file` option for the `snow` command, as shown:

```snowcli
snow --config-file="my_config.toml" connection test
```

### Support for system environment variables

Snowflake CLI supports using system environment variables to override parameter values defined in your `config.toml` file, using the following format:

```bash
SNOWFLAKE_<config-section>_<variable>=<value>
```

where:

* `<config_section>` is the name of a section in the configuration file with periods (`.`) replaced with underscores (`_`), such as `CLI_LOGS`.
* variable is the name of a variable defined in that section, such as `path`.

Some examples include:

* Override the `path` parameter in the `[cli.logs]` section in the `config.toml` file:

  ```bash
  export SNOWFLAKE_CLI_LOGS_PATH="/Users/jondoe/snowcli_logs"
  ```
* Set the password for the `myconnection` connection:

  ```bash
  export SNOWFLAKE_CONNECTIONS_MYCONNECTION_PASSWORD="*******"
  ```
* Set the default connection name:

  ```bash
  export SNOWFLAKE_DEFAULT_CONNECTION_NAME="myconnection"
  ```

## Add an authentication policy that limits access to Snowflake CLI only

Users can create an [authentication policy](../../../user-guide/authentication-policies.md) that limits access permission to drivers, as well as Snowflake CLI.
If you want to allow access to Snowflake CLI only (and exclude the drivers), you can do the following:

* Create a new authentication policy that limits access strictly to Snowflake CLI.
* Enable the policy in the `config.toml` file.

### Create an authentication policy limited to Snowflake CLI

To create a new authentication policy for only Snowflake CLI, follow these steps:

1. Execute the [CREATE AUTHENTICATION POLICY](../../../sql-reference/sql/create-authentication-policy.md) SQL command, setting the CLIENT_TYPES parameter to include `'SNOWFLAKE_CLI'`.

   ```sqlexample
   CREATE AUTHENTICATION POLICY snowflake_cli_only
     CLIENT_TYPES = ('SNOWFLAKE_CLI');
   ```
2. Add the policy to the user, as shown:

   ```sqlexample
   ALTER USER user1
     SET AUTHENTICATION POLICY snowflake_cli_only;
   ```

### Enable the policy in the Snowflake CLI configuration

The `enable_separate_authentication_policy_id` configuration parameter lets you enable access to Snowflake CLI separately from the drivers.
When this access is enabled, specified users can access Snowflake CLI but not the other Snowflake drivers.

> **Warning:**
>
> If you already have an authentication policy that allows access only to drivers and don’t have one that allows access to Snowflake CLI only, enabling the `enable_separate_authentication_policy_id` parameter will cause the users to lose access to Snowflake CLI if you don’t create the new policy first. Make sure to add SNOWFLAKE_CLI to your authentication policy before enabling the configuration parameter.

To enable the SNOWFLAKE_CLI policy, add the `enable_separate_authentication_policy_id` parameter to the `[cli.features]` section in the `config.toml` file, as shown:

```toml
[cli.features]
enable_separate_authentication_policy_id = true
```

> **Note:**
>
> Enabling this parameter affects all connections made by Snowflake CLI.

## Use a proxy server

To use a proxy server, configure the following environment variables:

* HTTP_PROXY
* HTTPS_PROXY
* NO_PROXY

For example:

Linux or macOS:
:   ```bash
    export HTTP_PROXY='http://username:password@proxyserver.example.com:80'
    export HTTPS_PROXY='http://username:password@proxyserver.example.com:80'
    ```

Windows:
:   ```bash
    set HTTP_PROXY=http://username:password@proxyserver.example.com:80
    set HTTPS_PROXY=http://username:password@proxyserver.example.com:80
    ```

> **Tip:**
>
> Snowflake’s security model does not allow Secure Sockets Layer (SSL) proxies (using an HTTPS certificate). Your proxy server must use a publicly-available Certificate Authority (CA), reducing potential security risks such as a MITM (Man In The Middle) attack through a compromised proxy.
>
> If you must use your SSL proxy, we strongly recommend that you update the server policy to pass through the Snowflake certificate such that no certificate is altered in the middle of
> communications.
>
> Optionally `NO_PROXY` can be used to bypass the proxy for specific communications. For example, access to Amazon S3 can bypass the proxy server by specifying `NO_PROXY=".amazonaws.com"`.
>
> `NO_PROXY` does not support wildcards. Each value specified should be one of the following:
>
> * The end of a hostname (or a complete hostname), for example:
>
>   + .amazonaws.com
>   + myorganization-myaccount.snowflakecomputing.com
> * An IP address, for example:
>
>   + 192.196.1.15
>
> If more than one value is specified, values should be separated by commas, for example:
>
> > ```none
> > localhost,.example.com,.snowflakecomputing.com,192.168.1.15,192.168.1.16
> > ```

## Configure logging

By default, Snowflake CLI automatically saves `INFO`, `WARNING`, and `ERROR` level messages to log files. To disable or customize logging, create a `[cli.logs]` section in your `config.toml` file:

```toml
[cli.logs]
save_logs = true
level = "info"
path = "/home/<username>/.snowflake/logs"
```

where:

* `save_logs` indicates whether to save logs to files. Default: `true`.
* `level` specifies which levels of messages to save to log files. Choose from the following levels, which includes all levels below the selected one:

  + `debug`

    > **Warning:**
    >
    > Switching to the `debug` logging level can expose sensitive information, such as executed SQL queries. Use caution when enabling this level.
  + `info`
  + `warning`
  + `error`

  Default: `info`
* `path` specifies the absolute path to save the log files. The format of the path varies based on your operating system, as shown:

  + Linux: `path = "/home/<your_username>/.config/snowflake/logs"`
  + MacOS: `path = "/Users/<your_username>/Library/Application Support/snowflake/logs"`
  + Windows: `path = "C:\\Users\\<your_username>\\AppData\\Local\\snowflake\\logs"`

  If not specified, the command creates a `logs` directory in the default `config.toml` file location.

If your `config.toml` was created automatically, the `config.toml` file contains the `|cli.logs|` section filled with default values.

Logs from a single day are appended to file `snowflake-cli.log`, which is later renamed to `snowflake-cli.log.YYYY-MM-DD`, as shown.

```bash
ls logs/
```

```output
snowflake-cli.log            snowflake-cli.log.2024-10-22
```

For troubleshooting purposes, you’ll typically also need to configure logging for the Snowflake Connector for Python by adding a `[log]` section to the `config.toml` file, as shown in the following example:

```toml
[log]
save_logs = true
path = "/home/<username>/.snowflake/logs"
level = "DEBUG"
```

For more information about logging for the Snowflake Connector for Python, see [Logging configuration file](../../python-connector/python-connector-example.md) in the Snowflake Connector for Python documentation.

## Suppress version update notifications

By default, Snowflake CLI checks for newer versions and displays a notification message when a newer version is available. You can suppress these notifications using either a configuration file setting or an environment variable, as follows:

* Add the `ignore_new_version_warning` setting to the `config.toml` file:

  ```toml
  [cli]
  ignore_new_version_warning = true
  ```
* Set the `SNOWFLAKE_CLI_IGNORE_NEW_VERSION_WARNING` environment variable:

  ```bash
  export SNOWFLAKE_CLI_IGNORE_NEW_VERSION_WARNING=true
  ```

---
title: Configuring Snowflake CLI and connecting to Snowflake
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/connect.md
section: Snowflake CLI
---

# Configuring Snowflake CLI and connecting to Snowflake

This section explains how to configure, test, and manage your Snowflake connections.

* [Configuring Snowflake CLI](configure-cli.md)
* [Managing Snowflake connections](configure-connections.md)

---
title: Copying files in Git
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/git/copy-files.md
section: Snowflake CLI
---

# Copying files in Git

The `snow git copy` command copies files from given state of the repository (specific branch, tag, or commit) into another stage or local file system.

```bash
snow git copy <REPO_PATH> <DEST_PATH> [--parallel INT]
```

where:

* `<REPO_PATH>` is a stage path with a specific scope where the value is the repository name followed by a suffix specifying which branch, tag, or commit to copy. The following lists some different types of values:

  + `@snowcli_git/branches/main/` refers to last commit of the “main” branch
  + `@snowcli_git/tags/v2.1.0/` refers to a commit tagged `v2.1.0`.
  + `@snowcli_git/commits/1e939d69ca6fd0f89074e7e97c9fd1/` refers to a specific commit. Commit hashes should be between 6 and 40 characters long.

  A repository path can also be a subdirectory or file in the repository, but still must be preceded with a scope prefix.
* `<DEST_PATH>` is a path to a local directory or to a remote directory on the Snowflake stage.
* `--parallel` specifies the number of threads to use when downloading files.

When `<DEST_PATH>` specifies a stage, the command operates differently based on its suffix format, as follows:

* If the source ends with a `/`, such as `@my_snow_git/branches/main/tests/plugin/`, the command copies the contents of the `plugin` directory into the destination.
* If the source does not end with a `/`, such as `@my_snow_git/branches/main/tests/plugin`, the command copies the entire `plugin` directory.

## Example: Copy files from a commit to a directory in a stage

This example creates a `snowcli2.0/` directory on stage `@public` and copies all files from the commit marked with tag `v2.0.0` into that directory:

```snowcli
snow git copy @my_snow_git/tags/v2.0.0/ @public/snowcli2.0/
```

## Example: Copy files from inside a directory to a directory in a stage

The following example creates a `plugin_tests` directory on the `test_stage` stage and copies the contents of the `tests/plugin/` directory into it.

```snowcli
snow git copy @my_snow_git/branches/main/tests/plugin/ @test_stage/plugin_tests/
```

## Example: Copy an entire directory to a directory in a stage

This example creates a `plugin_tests` directory on the `test_stage` stage and copies the entire `tests/plugin` directory into it. Because `tests/plugin` does note end with a /, the command copies all of the files to `@test_stage/plugin_tests/plugin`.

```snowcli
snow git copy @snowcli_git/branches/main/tests/plugin @test_stage/plugin_tests
```

## Example: Copy files from a directory in a stage to the local file system

The following example creates a `plugin_tests` directory in the local file system and downloads the contents of the `tests/plugin` directory into it.

```snowcli
snow git copy @snowcli_git/branches/main/tests/plugin plugin_tests/
```

---
title: Create a Snowpark project definition
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/create.md
section: Snowflake CLI
---

# Create a Snowpark project definition

The `snowflake.yml` file contains the functions and procedures declarations for a Snowpark project.

> **Note:**
>
> Currently, the Snowpark project definition file must be named `snowflake.yml`.

The following snippet shows a sample Snowpark project definition file: with two functions and two procedures. The `hello_function` function uses external capabilities of Snowpark.

```yaml
definition_version: '2'

mixins:
  snowpark_shared:
    artifacts:
      - dest: my_snowpark_project
        src: app/
    stage: dev_deployment

entities:

  hello_function:
    type: function
    identifier:
      name: hello_function
    handler: functions.hello_function
    signature:
      - name: name
        type: string
    returns: string
    external_access_integrations:
      - my_external_access
    secrets:
        cred: my_cred_name
    meta:
      use_mixins:
        - snowpark_shared

  hello_procedure:
    type: procedure
    identifier:
      name: hello_procedure
    handler: procedures.hello_procedure
    signature:
      - name: name
        type: string
    returns: string
    meta:
      use_mixins:
        - snowpark_shared

  test_procedure:
    type: procedure
    identifier:
      name: test_procedure
    handler: procedures.test_procedure
    signature: ''
    returns: string
    meta:
      use_mixins:
        - snowpark_shared
```

> **Caution:**
>
> Files inside a project directory are processed by Snowflake CLI and could be uploaded to Snowflake when executing other `snow snowpark` commands. You should use caution when putting any sensitive information inside files in a project directory.

## Function and procedure object properties

The following table describes the properties used by functions and procedures.

Function and procedure object properties

| Property | Definition |
| --- | --- |
| **identifier**  *optional*, *string* | Optional Snowflake identifier for the entity. The value can have the following forms:   * String identifier text  ```yaml   identifier: my-snowpark-id   ```  Both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (e.g. `’”My Snowpark Function”’`). * Object  ```yaml   identifier:     name: my-snowpark-id     schema: my-schema # optional     database: my-db # optional   ```  **Note:** An error occurs if you specify a `schema` or `database` and use a fully-qualified name in the `name` property (such as `mydb.schema1.my-app`). |
| **type**  *optional*, *string* | Must be one of: `function` or `procedure`. |
| **artifact_repository**  *optional*, *string* | Name of the artifact repository. Snowflake has a default artifact repository called `snowflake.snowpark.pypi_shared_repository` that you use to connect and install PyPI packages within Snowpark UDFs and procedures. For more information, see [Artifact Repository overview](../../udf/python/udf-python-packages.md).  The `artifact_repository` and `packages` parameters let you use non-anaconda packages, similar to the following:   * In the project’s `app.py` file, you can define a function like the following:  ```python   from sklearn.datasets import load_iris   from sklearn.model_selection import train_test_split   from sklearn.ensemble import RandomForestClassifier    def udf():     X, y = load_iris(return_X_y=True)     X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.25, random_state=42)      model = RandomForestClassifier()     model.fit(X_train, y_train)     return model.score(X_test, y_test)   ``` * In the `snowflake.yml` file, you would then define it like the following:  ```yaml   test_function:     type: "function"     handler: "app.udf"     identifier:       - name: "udf"     stage: "dev_deployment"     signature: ""     returns: float     artifact_repository: snowflake.snowpark.pypi_shared_repository     packages:       - "scikit-learn"     artifacts: "app.py"   ```   For packages that depend on specific architectures, you can define them in the `resource_constraint` parameter as follows:  ```yaml test_function:    type: "function"    handler: "app.udf"    identifier:      - name: "udf"    stage: "dev_deployment"    signature: ""    returns: float    artifact_repository: snowflake.snowpark.pypi_shared_repository    packages:      - "scikit-learn"    artifacts: "app.py" ```  For more information, see [Packages built only for x86](../../udf/python/udf-python-packages.md). |
| **artifact_repository_packages**  *optional*, *string* | **Note:** This property has been deprecated in favor of the `packages` property. |
| **packages**  *optional*, *string* | List of packages to install from the artifact_repository. For example:  ```yaml artifact_repository: snowflake.snowpark.pypi_shared_repository packages:    - Faker   - rich   - pytest ``` |
| **artifacts**  *required*, *string sequence* | List of file source and destination pairs to add to the deploy root. You can use the following artifact properties:   * `src`: Path to the code source file or files * `dest`: Path to the directory to deploy the artifacts.  Destination paths that reference directories must end with a `/`. A glob pattern’s destination that does not end with a `/` results in an error. If omitted, `dest` defaults to the same string as `src`.  **Note:** Using glob patterns in Snowpark `snowflake.yml` files requires enabling the ENABLE_SNOWPARK_GLOB_SUPPORT feature flag.  You can also pass in a string for each item instead of a `dict`, in which case the value is treated as both `src` and `dest`.   If `src` refers to just one file (not a glob), `dest` can refer to a target `<path>` or a `<path/name>`.  You can also pass in a string for each item instead of a `dict`, which case, the value is treated as both `src` and `dest`. |
| **handler**  *required*, *string* | Function’s or procedure’s implementation of the object inside module defined in `snowpark.src`. For example `functions.hello_function` refers to function `hello_function` from file `<src>/functions.py`. |
| **returns**  *required*, *string* | SQL type of the result. Check the list of [available types](../../udf-stored-procedure-data-type-mapping.md). |
| **signature**  *required*, *sequence* | The `signature` parameter describes consecutive arguments passed to the object. Each should specify its name and type, for example:  ```yaml signature:   - name: "first_argument"     type: int   - name: "second_argument"     default: "default value"     type: string ```  If a function or procedure takes no arguments, set this value to an empty string (`signature: ""`).  Check the **SQL Type** column of [available types](../../udf-stored-procedure-data-type-mapping.md). To learn more about the syntax of named and optional arguments, see [Calling a UDF that has optional arguments](../../udf/udf-calling-sql.md). |
| **runtime**  *optional*, *string* | Python version to use when executing the procedure or function. Default: “3.12”. |
| **external_access_integrations**  *optional*, *string sequence* | Names of [external access integrations](../../../sql-reference/sql/create-external-access-integration.md) needed for this procedure’s handler code to access external networks. See the [EXTERNAL_ACCESS_INTEGRATIONS parameter in CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) for more details. |
| **secrets**  *optional*, *dictionary* | Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from secrets in handler code. See [the SECRETS parameter in CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) for more details. |
| **imports**  *optional*, *string sequence* | Stage and path to previously uploaded files you want to import. See [the IMPORT parameter in CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) for more details. |
| **execute_as_caller**  *optional*, *bool* | **Available only for procedures**. Determine whether the procedure is executed with the privileges of the owner (you) or with the privileges of the caller. Default: False (owner’s privileges). |

---
title: Create project definition file templates
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/create-templates.md
section: Snowflake CLI
---

# Create project definition file templates

In some situations, you might want to reference information already present in a project definition file in another place of the file. Snowflake CLI supports templating the project definition file.

Project definition file templates use the `<% … %>` syntax for specifying the templates. The following example uses the `env` section to define a name for a Streamlit application:

```yaml
definition_version: 2
env:
  name: "my-app"
entities:
  my_streamlit:
    type: "streamlit"
    identifier: <% ctx.env.name %>
```

The `<% ctx.env.name %>` syntax references a global context object that provides access to the project definition. The `ctx` object has the same structure as the project definition. You can access attributes of defined objects using dot notation. Example uses include:

* `<% ctx.entities.pkg.identifier %>` to access the name of a Native App package with ID `pkg`.
* `<% ctx.entities.function.stage_name %>` to access the stage name for a snowpark UDFs and procedures.
* `<% ctx.entities.my_streamlit.identifier %>` to access the Streamlit dashboard name.

You can override any variable defined in the `snowflake.yml` project definition file `env` section by setting a shell environment variable by the same case-sensitive name. For example, to override the name value defined in the example, you can execute the following shell command:

```yaml
export name="other"
```

## Access template defaults

Template defaults let you access default and automatically-generated fields from a project definition file, even if the fields are not explicitly defined. To illustrate, consider the following Snowflake Native App project definition file:

```yaml
definition_version: 2
entities:
  pkg:
    type: application package
    artifacts:
      - src: app/*
        dest: ./
  app:
    type: application
    from:
      target: pkg
```

This definition provides enough information to create a Snowflake Native App, so the default values for the application package and application instance are automatically generated when you create the application. You can then access these values using the following syntax:

> ```yaml
> <% ctx.entities.app.identifier %>
> <% ctx.entities.pkg.identifier %>
> ```

---
title: Creating a Snowflake Native App project
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/initiate-app.md
section: Snowflake CLI
---

# Creating a Snowflake Native App project

You can use the `snow init` command to bootstrap a Snowflake Native App project, and get the project up and running quickly.

To create a Snowflake Native App project from a Snowflake provided Snowflake Native App template:

* Enter a `snow init` command, similar to the following:

  ```snowcli
  snow init --template app_basic my_app
  ```

  When successful, the command returns a confirmation message similar to the following:

  ```output
  Initialized the new project in my_app
  ```

> **Caution:**
>
> Files inside a project directory are processed by Snowflake CLI and could be uploaded to Snowflake when executing other `snow app` commands. You should use caution when putting any sensitive information inside files in a project directory.

For more information about creating a Snowflake Native App project, see the snow init command as well as the [Snowflake CLI templates](https://github.com/snowflakedb/snowflake-cli-templates) repository.

---
title: Creating a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/streamlit-apps/manage-apps/initialize-app.md
section: Snowflake CLI
---

# Creating a Streamlit app

## Prerequisites

Before creating a Streamlit app with Snowflake CLI, you should meet the following prerequisites:

* Ensure that your account has the correct privileges as described in [Privileges required to create and use a Streamlit app](../../../streamlit/object-management/privileges.md).
* Ensure that you can create or have access to a named stage where you can upload your Streamlit app files.

## Bootstrap a Streamlit app

The `snow init` command creates a local directory with a sample set of files that help you get started creating a Streamlit app. When you execute this command, Snowflake CLI creates the following directory structure:

```output
example_streamlit/            - project name (default: example_streamlit)
  snowflake.yml               - configuration for snow streamlit commands
  environment.yml             - additional config for Streamlit, for example installing packages
  streamlit_app.py            - entrypoint file of the app
  pages/                      - directory name for Streamlit pages (default pages)
  common/                     - example “shared library”
```

To initialize a Streamlit app, enter the following command:

```snowcli
snow init new_streamlit_project --template example_streamlit -D query_warehouse=dev_warehouse -D stage=testing
```

> **Caution:**
>
> Files inside a project directory are processed by Snowflake CLI and could be uploaded to Snowflake when executing other `snow streamlit` commands. You should use caution when putting any sensitive information inside files in a project directory.

For more information about the file structure, see [Organize your Streamlit app files](../../../streamlit/app-development/file-organization.md).

## Create the project definition for a Streamlit app

Each Streamlit app in Snowflake must include a `snowflake.yml` project definition file. Streamlit is limited to one application per project definition file.

The following shows a sample `snowflake.yml` project definition file:

```yaml
definition_version: 2
entities:
  my_streamlit:
    type: streamlit
    identifier: streamlit_app
    stage: my_streamlit_stage
    query_warehouse: my_streamlit_warehouse
    main_file: streamlit_app.py
    pages_dir: pages/
    external_access_integrations:
      - test_egress
    secrets:
      dummy_secret: "db.schema.dummy_secret"
    imports:
      - "@my_stage/foo.py"
    artifacts:
      - common/hello.py
      - environment.yml
    grants:
      - privilege: USAGE
        role: streamlit_role
```

The following table describes the properties of a Streamlit project definition.

Streamlit project definition properties

| Property | Definition |
| --- | --- |
| **identifier**  *optional*, *string* | Optional Snowflake identifier for the entity. The value can have the following forms:   * String identifier text  ```yaml   identifier: my-streamlit-id   ```  Both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (e.g. `’”My Streamlit Application”’`). * Object  ```yaml   identifier:     name: my-streamlit-id     schema: my-schema # optional     database: my-db # optional   ```  **Note:** An error occurs if you specify a `schema` or `database` and use a fully-qualified name in the `name` property (such as `mydb.schema1.my-app`). |
| **type**  *optional*, *string* | Must be `streamlit`. |
| **comment**  *optional*, *string* | Comment for the Streamlit dashboard. |
| **title**  *optional*, *string* | Human-readable title for the Streamlit dashboard. |
| **stage**  *optional*, *string* | Stage in which the app’s artifacts will be stored. Default: None. |
| **query_warehouse**  *required*, *string* | Snowflake warehouse to host the app. |
| **main_file**  *optional*, *string* | [Entrypoint file](https://docs.streamlit.io/get-started/tutorials/create-an-app) of the streamlit app. Default: “streamlit_app.py”. |
| **pages_dir**  *optional*, *string* | Streamlit [pages](https://docs.streamlit.io/get-started/tutorials/create-a-multipage-app). Default: “pages”. |
| **external_access_integrations**  *optional*, *string sequence* | Names of [external access integrations](../../../../sql-reference/sql/create-external-access-integration.md) needed for this Streamlit application code to access external networks. See [the optional parameters for CREATE STREAMLIT](../../../../sql-reference/sql/create-streamlit.md) for more details. |
| **secrets**  *optional*, *dictionary* | Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from secrets in application code. |
| **imports**  *optional*, *string sequence* | Stage and path to previously uploaded files you want to import. See [the optional parameters for CREATE STREAMLIT](../../../../sql-reference/sql/create-streamlit.md) for more details. |
| **artifacts**  *required*, *string sequence* | List of file source and destination pairs to add to the deploy root. You can use the following artifact properties:   * `src`: Path to the code source file or files. * `dest`: Path to the directory to deploy the artifacts.  Destination paths that reference directories must end with a `/`. A glob pattern’s destination that does not end with a `/` results in an error. If omitted, `dest` defaults to the same string as `src`.  You can also pass in a string for each item instead of a `dict`, in which case the value is treated as both `src` and `dest`.   If `src` refers to just one file (not a glob), `dest` can refer to a target `<path>` or a `<path/name>`.  You can also pass in a string for each item instead of a `dict`; in that case, the value is treated as both `src` and `dest`. |
| **grants**  *optional*, *grant sequence* | Grants that should be given for the Streamlit app. Each grant must specify the privilege and target role. For more details, see [the optional parameters for CREATE STREAMLIT](../../../../sql-reference/sql/create-streamlit.md). |

---
title: Creating an application package with a version (or patch)
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/create-app-package-version.md
section: Snowflake CLI
---

# Creating an application package with a version (or patch)

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your native app project.

## How to create an application package with a version (or patch)

The [snow app version create](../command-reference/native-apps-commands/version/app-version-create.md) command brings all the different code files together, creates an application package, uploads code to a Snowflake stage in this application package, and creates a version for that application package. If a version already exists, it adds a custom or an auto-incremented patch to it. This command uses the values specified in your resolved project definition to determine the stage to which it upload files, which files to upload, and the name of the application package to create.

To create an application package and create a version for it, do the following:

1. [Create a connection](../connecting/connect.md), if necessary.
2. Make relevant changes to your code files, including `snowflake.yml`, `manifest.yml`, addition any setup scripts and extension code files.
3. Execute the `snow app version create` command from within your project, similar to the following:

   ```snowcli
   snow app version create V1 --connection="dev"
   ```

   ```output
   Version V1 created for application package my_app_pkg.
   Version create is now complete.
   ```

> This command creates a version **V1** and a default patch **0** for application package `my_app_pkg`.

For more information about adding a version definition to an application package, see the [snow app version create](../command-reference/native-apps-commands/version/app-version-create.md) command.

---
title: Creating and installing your application
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/create-package.md
section: Snowflake CLI
---

# Creating and installing your application

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your native app project.

## How to create an application package and an application object together

The [snow app run](../command-reference/native-apps-commands/run-app.md) command brings all the different code files together, creates an application package, uploads code to a Snowflake stage in this application package, [validates the setup script SQL](../command-reference/native-apps-commands/validate-app.md), and also installs or upgrades an application in the same account from this application package. This command is driven by the values specified in your resolved project definition for determining which stage to upload files to, which files to upload, and the names of the objects to be created.

To create an application object, do the following:

1. [Create a connection](../connecting/connect.md), if necessary.
2. Make relevant changes to your code files, including `snowflake.yml`, `manifest.yml`, any setup scripts and extension code files.
3. Execute the `snow app run` command from within your project, similar to the following:

   ```snowcli
   snow app run --connection="dev"
   ```

> When successful, the command displays a message similar to the following:
>
> ```output
> Your application ("my_app_admin") is now live:
> https://app.snowflake.com/data_org/data_acct/#/apps/application/my_app_admin
> ```

Using the `snow app run --connection="dev"` command creates an application using the files on a named stage that is automatically managed by Snowflake CLI. You can also use the command to create or update your application even if your application package already exists. In this case, the command issues an UPGRADE on your application object, which will execute your setup script. For information about how to avoid re-running the setup script, see the next section.

To create an application using a version (and patch) of an existing application package, execute the following:

```snowcli
snow app run --version v1 --patch 12 --connection="dev"
```

Here, version `V1` and patch `12` are used as an example only.
For more information about creating Snowflake Native App objects, see the [snow app run](../command-reference/native-apps-commands/run-app.md) command.

## How to create an application package

The `snow app deploy` command performs a subset of the steps `snow app run` takes to deploy your
code to Snowflake. While it still brings all the different code files together, creates an application
package, and uploads code to a named stage in this application package, and [validates the setup script SQL](../command-reference/native-apps-commands/validate-app.md), the `snow app deploy` command does not attempt to create or upgrade an application object.

The `snow app deploy` command is particularly useful in the following situations:

* Deploying only the application package and stage files, for situations where an application object is not required (such as part of a Continuous Delivery pipeline).
* Updating the stage files linked to the application object. For example, if you only changed python code files, you do not need to re-create the PROCEDURE, FUNCTION, and STREAMLIT objects that point to it when using stage development mode. This approach saves time and reduces cost, as you do not need to use a warehouse to re-execute the setup script to use the updated python code.

To create an application package without a corresponding application object, do the following:

1. [Create a connection](../connecting/connect.md), if necessary.
2. Make relevant changes to your code files, including `snowflake.yml`, `manifest.yml`, any setup scripts, and extension code files.
3. Execute the `snow app deploy` command from within your project, similar to the following:

   ```snowcli
   snow app deploy --connection="dev"
   ```

> When successful, the command displays a message similar to the following:
>
> ```output
> Checking if stage exists, or creating a new one if none exists.
> Performing a diff between the Snowflake stage and your local deploy_root
> ...
> Deployed successfully. Application package and stage are up-to-date.
> ```

You can also use the `snow app deploy` command to restrict which files it synchronizes to
a stage by listing paths as positional arguments after the `snow app deploy` command. For more information about this and other advanced functionality, see the [snow app deploy](../command-reference/native-apps-commands/deploy-app.md) command.

---
title: Creating and managing Snowflake Native App objects
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/create-manage-apps.md
section: Snowflake CLI
---

# Creating and managing Snowflake Native App objects

You can perform the following operations when creating and managing Snowflake Native App objects:

* [Creating a Snowflake Native App project](initiate-app.md)
* [Preparing a local folder with configured Snowflake Native App artifacts](bundle-app.md)
* [Validating an application package](validate-app.md)
* [Creating and installing your application](create-package.md)
* [Creating an application package with a version (or patch)](create-app-package-version.md)
* [Listing all versions defined in an application package](list-app-package-version.md)
* [Opening an app in a browser](open-app.md)
* [Publishing a Snowflake Native App to customers](publish-app.md)
* [Dropping an existing version of an app in an application package](drop-app-package-version.md)
* [Dropping Snowflake Native App objects](drop-objects.md)

---
title: Deploy a Snowpark project
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/deploy.md
section: Snowflake CLI
---

# Deploy a Snowpark project

The `snow snowpark deploy` command uploads local files to the specified stage and creates procedure and function objects defined in the project. Deploying the project alters all objects defined in it. By default, if any of the objects exist already the commands fails unless you provide the `--replace` option. All deployed objects use the same artifact, which is uploaded only once.

```snowcli
snow snowpark deploy
```

```output
+-------------------------------------------------------------+
| object                       | type      | status           |
|------------------------------+-----------+------------------|
| hello_procedure(name string) | procedure | created          |
| test_procedure()             | procedure | packages updated |
| hello_function(name string)  | function  | created          |
+-------------------------------------------------------------+
```

When you run `snow snowpark deploy`, the command does the following:

1. Snowflake CLI checks whether any of the defined objects (functions or procedures) already exists.
2. If any exist and the `--replace` flag is not provided, the command exits. The reasoning behind this approach is to be “production-safe” by avoiding unintentional changes to existing objects.
3. If all objects don’t exist or `--replace` is provided, the command:

   * If the `--prune` flag is provided, all previous contents of the stages used by defined procedure and function objects are removed.
   * Uploads the new zip artifacts.
   * Updates definitions of every procedure.
   * Updates definitions of every function.

---
title: Deploying a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/streamlit-apps/manage-apps/deploy-app.md
section: Snowflake CLI
---

# Deploying a Streamlit app

The [snow streamlit deploy](../../command-reference/streamlit-commands/deploy.md) command creates a new Streamlit object
inside your chosen database and schema. By default, this command looks for a main file called `streamlit_app.py`
in your current working directory.

## Prerequisites

Before deploying a Streamlit app with Snowflake CLI, you should meet the following prerequisites:

* Ensure that you have a local Streamlit app with the correct directory structure and `snowflake.yml` project definition file must exist.
* Ensure that your account has the correct privileges as described in [Privileges required to create and use a Streamlit app](../../../streamlit/object-management/privileges.md).
* Ensure that you can create or have access to a named stage where you can upload your Streamlit app files.

## How to deploy a Streamlit app

> **Note:**
>
> With the release of Snowflake CLI 3.14.0, the `snow streamlit deploy` command now uses the updated CREATE STREAMLIT syntax (FROM *source_location*) instead of the deprecated syntax (ROOT_LOCATION = ‘<stage_path_and_root_directory>’). To continue using the deprecated syntax, you can use the `--legacy` option.

The `snow streamlit deploy` command uploads local files to a stage and creates a new Streamlit object inside your chosen database and schema. Your [project definition file](initialize-app.md) should specify the main Python file and query warehouse. You can also specify the following options:

* `--replace`: Replaces the specified Streamlit app, if it already exists.
* `--open`: Opens the Streamlit app in your default browser after deploying the app.
* `--prune`: Removes files that exist in the stage, but not files in the local filesystem (by default no files are removed).
* `--legacy`: Uses the deprecated SQLsyntax (ROOT_LOCATION = ‘<stage_path_and_root_directory>’).

By default the command automatically deploys the `environment.yml` file and the content of the `pages/`
directory, if any of those exists. You can use different files by using [command-line options](../../command-reference/streamlit-commands/deploy.md).

For more information about creating Streamlit apps, see the CLI [snow streamlit deploy](../../command-reference/streamlit-commands/deploy.md) and
SQL [CREATE STREAMLIT](../../../../sql-reference/sql/create-streamlit.md) commands.

---
title: Dropping an existing version of an app in an application package
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/drop-app-package-version.md
section: Snowflake CLI
---

# Dropping an existing version of an app in an application package

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your Snowflake Native App project.

## How to drop a version definition of an app

The [snow app version drop](../command-reference/native-apps-commands/version/app-version-drop.md) drops the specified app version of an application package, if it exists, and is not referenced by a release directive. If you want to drop a version referenced by a release directive, you must first set that release directive to a different version. This command uses the resolved project definition to determine the name of the application package version to drop.

This command does not allow dropping patches, because Snowflake does not currently support that functionality for a Snowflake Native App.

To drop a version of an existing application package, do the following:

1. [Create a connection](../connecting/connect.md), if necessary.
2. Set the release directive to a different version, if not already done.
3. Execute the `snow app version drop` command from within your project, as shown:

   ```snowcli
   snow app version drop v1 --connection="dev"
   ```

   ```output
   Version v1 of application package my_app_pkg dropped successfully.
   Version drop is now complete.
   ```

> **Note:**
>
> If the version of the application is replicated to other regions, the version won’t be dropped until the next replication is complete.
>
> For information about updating the replication schedule, see
> [Set the refresh schedule for a listing](../../../collaboration/provider-listings-auto-fulfillment-configure-cron-refresh-schedule.md).
>
> To start replication manually, use the [SYSTEM$TRIGGER_LISTING_REFRESH](../../../sql-reference/functions/system_trigger_listing_refresh.md) system function.

For more information about dropping a version of an application package, see the [snow app version drop](../command-reference/native-apps-commands/version/app-version-drop.md) command.

---
title: Dropping Snowflake Native App objects
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/drop-objects.md
section: Snowflake CLI
---

# Dropping Snowflake Native App objects

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your Snowflake Native App project.

## How to drop Snowflake Native App application packages and application objects

The `snow app teardown` drops both the application object and the application package defined in the resolved project definition.
This command succeeds even if one or both of these objects do not exist.

1. [Create a connection](../connecting/connect.md), if necessary.
2. Execute the `snow app teardown` command from within your project, similar to the following:

   > ```snowcli
   > snow app teardown --connection="dev"
   > ```
   >
   > When successful, the command returns the following message:
   >
   > ```output
   > Teardown is now complete.
   > ```

> **Note:**
>
> When dropping applications that own objects outside of the application object, such as compute pools, Snowflake CLI shows a list of these dependent objects and asks whether you would like to drop them in addition to the application object and package.
>
> > You can choose this option non-interactively by passing in the `--cascade` option.

If Snowflake CLI is unable to drop the application, it does note drop the application package either.
For more information about dropping Snowflake Native App objects, see the [snow app teardown](../command-reference/native-apps-commands/teardown-app.md) command.

---
title: Execute a Snowpark procedure or function
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/execute.md
section: Snowflake CLI
---

# Execute a Snowpark procedure or function

To execute a Snowpark procedure or function, use the `snow snowpark execute OBJECT_TYPE EXECUTION_IDENTIFIER` command, where:

* `OBJECT_TYPE` is one of `function` or `procedure`.
* `EXECUTION_IDENTIFIER` is function or procedure signature, with all arguments provided.

The following example calls a Snowpark function called `hello_function`:

```snowcli
snow snowpark execute function "hello_function('Olaf')"
```

```output
+--------------------------------------+
| key                    | value       |
|------------------------+-------------|
| HELLO_FUNCTION('Olaf') | Hello Olaf! |
+--------------------------------------+
```

---
title: Executing files from a repository
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/git/execute-sql.md
section: Snowflake CLI
---

# Executing files from a repository

> **Note:**
>
> Snowflake CLI does not support executing Python files for Python versions 3.12 and above.

You can use the `snow git execute` command for all `.sql` and `.py` files in a repository path. The command searches for all SQL and Python files and then executes the [EXECUTE IMMEDIATE](../../../sql-reference/sql/execute-immediate.md) command on each of them.

```bash
snow git execute <REPO_PATH> [--silent]
```

where:

* `<REPO_PATH>` can be any of the following:

  + A repository stage, such as `@snowcli_git/branches/main/`, to execute commands from all `.sql` files in the stage.
  + A glob-like pattern, such as `@snowcli_git/branches/main/scripts/*`, to execute commands from all `.sql` files in the `scripts` directory.
  + A specific `.sql` file, such as `@snowcli_git/branches/main/scripts/script.sql`, to execute commands contained only the `script.sql` file.
* `--silent` hides intermediate messages with file execution results.

> **Note:**
>
> The `snow git execute` command does not display the output of any of the SQL commands it processes.

The following example shows how to execute SQL commands in all files within the `project` directory that match a regular expression.

```snowcli
snow git execute "@git_test/branches/main/projects/script?.sql"
```

```output
SUCCESS - git_test/branches/main/projects/script1.sql
SUCCESS - git_test/branches/main/projects/script2.sql
SUCCESS - git_test/branches/main/projects/script3.sql
+---------------------------------------------------------------+
| File                                        | Status  | Error |
|---------------------------------------------+---------+-------|
| git_test/branches/main/projects/script1.sql | SUCCESS | None  |
| git_test/branches/main/projects/script2.sql | SUCCESS | None  |
| git_test/branches/main/projects/script3.sql | SUCCESS | None  |
+---------------------------------------------------------------+
```

Adding the `--silent` option to the same command hides the intermediate messages showing the progression of the files processed.

```snowcli
snow git execute "@git_test/branches/main/projects/script?.sql" --silent
```

```output
+---------------------------------------------------------------+
| File                                        | Status  | Error |
|---------------------------------------------+---------+-------|
| git_test/branches/main/projects/script1.sql | SUCCESS | None  |
| git_test/branches/main/projects/script2.sql | SUCCESS | None  |
| git_test/branches/main/projects/script3.sql | SUCCESS | None  |
+---------------------------------------------------------------+
```

---
title: Executing SQL statements
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/sql/execute-sql.md
section: Snowflake CLI
---

# Executing SQL statements

The [snow sql](../command-reference/sql-commands/sql.md) command lets you execute ad-hoc SQL queries or files containing SQL queries using the following options:

* To execute an ad-hoc query, use the `-q` command-line option. For example, to execute a simple SQL SELECT query, as shown in the following example:

  ```snowcli
  snow sql -q "SELECT * FROM FOO;"
  ```
* To execute a file containing a SQL query, use the `-f` command-line option to specify the path to the file. For example, to execute a file containing a SQL query, as shown in the following example:

  ```snowcli
  snow sql -f my_query.sql
  ```

The `snow sql` command also can execute multiple statements; in that case, multiple result sets are returned. For example running:

```snowcli
snow sql  -q "select 'a', 'b'; select 'c', 'd';"
```

results in the following output:

```output
select 'a', 'b';
+-----------+
| 'A' | 'B' |
|-----+-----|
| a   | b   |
+-----------+

select 'c', 'd';
+-----------+
| 'C' | 'D' |
|-----+-----|
| c   | d   |
+-----------+
```

You can also execute [scripting blocks](../../snowflake-scripting/running-examples.md) in Snowflake CLI with a caveat relating to the `$$` delimiter.

For example:

```sqlexample
EXECUTE IMMEDIATE $$
-- Snowflake Scripting code
DECLARE
  radius_of_circle FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius_of_circle := 3;
  area_of_circle := pi() * radius_of_circle * radius_of_circle;
  RETURN area_of_circle;
END;
$$
;
```

Some operating systems interpret `$$`, such as a process ID (PID), instead of recognizing it as a scripting block delimiter. To address this limitation, you can use the following alternatives:

> * If you still want to specify the scripting block on the command line, you can escape the `$$` delimiters, as in `\$\$`.
> * You can also put the scripting block with the default `$$` delimiters into a separate file and call it with the `snow sql -f <filename>` command.

For more information, see the [snow sql](../command-reference/sql-commands/sql.md) command.

## Using variables for SQL templates

In certain situations, you might want to change your SQL queries based on the context. The `snow sql` command supports client-side variable substitution that lets you use variables in the command that are resolved locally before submitting the query. Variables in the SQL string take the form `<% variable_name %>`, and the `-D` (or `--variable`) option specifies the value of the variable.

> > **Note:**
> >
> > You can currently use the SnowSQL `&variable_name` and `<% variable_name %>` syntax for templates. However, Snowflake recommends using the `<% variable_name %>` syntax.

For example, to specify a database using a client-side variable, you can enter a command similar to the following:

```snowcli
snow sql -q "select * from <% database %>.logs" -D "database=dev"
```

When executed, the command substitutes the value `dev` in the `<% database %>` variable to create the `dev.logs` filename and then sends the `select * from dev.logs` SQL query to Snowflake for processing.

You can also specify multiple variable inputs, as shown:

```snowcli
snow sql \
-q "grant usage on database <% database %> to <% role %>" \
-D "database=dev" \
-D "role=eng_rl"
```

This example generates the following SQL query:

```bash
grant usage on database dev to eng_rl
```

The `--enable-templating` option lets you specify which templating syntaxes are resolved in a SQL query. Snowflake CLI supports the following syntaxes:

* `STANDARD`: Support the standard Snowflake CLI variable syntax (`<% variable_name %>`). Enabled by default.
* `LEGACY`: Support the SnowSQL variable syntax (`&{ variable_name }` or `&variable_name`). Enabled by default.
* `JINJA`: Support the jinja variable syntax (`{{ variable_name }}`). Disabled by default.
* `ALL`: Allow all supported syntaxes. Disabled by default.
* `NONE`: Do not support templating. Disabled by default.

The following examples illustrate different ways to support templating:

* Disable templating, so that neither of the query variables is resolved:

  ```snowcli
  snow sql --enable-templating NONE -q "select '<% not_resolved %> &not_resolved'"
  ```
* Allow JINJA and STANDARD templating, while disallowing LEGACY templating:

  ```snowcli
  snow sql --enable-templating JINJA --enable-templating STANDARD -q "select '<% resolved %> {{ resolved }} &not_resolved'"
  ```
* Enable all syntaxes, so the SQL query resolves all three syntaxes:

  ```snowcli
  snow sql --enable-templating ALL -q "select '<% resolved %> {{ resolved }}'"
  snow sql --enable-templating ALL -q "select '&resolved {{ resolved }}'"
  ```

> **Note:**
>
> JINJA variables, if enabled, are resolved after STANDARD and LEGACY variables.

## Storing variables in the `snowflake.yml` project definition file

Specifying variables as `snow sql` command-line options might not always be practical, or perhaps you might not want to specify sensitive values on the command line. In such cases, you can define variables and values in the `snowflake.yml` project definition file. Then you can just specify the variable names in the form `<% ctx.env.<variable_name> %>` instead of using the `-D "<variable> = <value>"` option.

Using the example from the previous section, you could store the database and role variables in `snowflake.yml` file and change the query to:

```snowcli
snow sql -q "grant usage on database <% ctx.env.database %> to <% ctx.env.role %>"
```

In this example, the `snow sql` command looks for the variable definitions in the project definition file and extracts the values without making them visible on the command line.
The `snowflake.yml` file should be located either in the current working directory or in the location specified with the `-p` option.

For more information about storing these values in the project definition file, see [Use variables in SQL](../project-definitions/use-sql-variables.md).

## Executing SQL queries asynchronously

Snowflake CLI lets you execute one or more SQL queries asynchronously. Instead of waiting for a result, the `snow sql` command schedules the queries at Snowflake and returns a query ID. After a query finishes, you can get the result using the !result query command or the SQL [RESULT_SCAN](../../../sql-reference/functions/result_scan.md) command.

To execute a SQL query asynchronously, end the query with `;>` instead of `;`, as shown:

```snowcli
snow sql -q 'select "My async query" ;>'
```

The following example executes a single query asynchronously:

```snowcli
snow sql -q "select 'This is async query';>"
```

```output
select 'This is async query'
+--------------------------------------+
| scheduled query ID                   |
|--------------------------------------|
| 01bc3011-080f-f2d7-0001-c1be14bae7c2 |
+--------------------------------------+
```

You can then use the returned query ID in the !result query command to display the query result:

```snowcli
snow sql -q '!result 01bc3011-080f-f2d7-0001-c1be14bae7c2'
```

```output
path-to-private-key-file
+-----------------------+
| 'THIS IS ASYNC QUERY' |
|-----------------------|
| This is async query   |
+-----------------------+
```

You can also execute multiple queries in the query string, both asynchronously and synchronously, as shown:

```snowcli
snow sql -q "select 'This is async query';> select 'Not an async query'; select 'Another async query';>"
```

```output
select 'This is async query'
+--------------------------------------+
| scheduled query ID                   |
|--------------------------------------|
| 01bc3b8c-0109-2e81-0000-0f2d0e5a4a32 |
+--------------------------------------+

select 'Not an async query';
+----------------------+
| 'NOT AN ASYNC QUERY' |
|----------------------|
| Not an async query   |
+----------------------+

select 'Another async query'
+--------------------------------------+
| scheduled query ID                   |
|--------------------------------------|
| 01bc3b8c-0109-2e81-0000-0f2d0e5a4a36 |
+--------------------------------------+
```

## Working with SQL query commands

Snowflake CLI provides the following commands that you can use inside your SQL queries:

* !source, which executes SQL in local files or URLs.
* !queries, which lists all SQL queries.
* !result, which displays the result of a SQL query.
* !abort, which aborts an active SQL query.
* !edit, which opens an external editor to modify and execute SQL commands.

> **Tip:**
>
> If you enclose your SQL query in double quotes (`""`) instead of single quotes (`''`), you might
> need to escape the exclamation point (`!`) based on which shell you use.

### Execute SQL in local files or URLs

You can use the `!source` query command in your SQL query to execute SQL in local files or a URL-based file. For example, the following command executes all SQL commands in a local file named `my_sql_code.sql`:

```snowcli
snow sql -q '!source my_sql_code.sql'
```

You can also nest `!source` commands in the SQL files, such as:

```bash
select emp_id FROM employees;
!source code_file_2.sql
```

In this example, the command executes the SELECT query and then executes the SQL commands in the `code_file_2.sql` file.

To execute multiple SQL files using `!source`, place each directive on a separate line in a wrapper file. For example, create a file named `run_all.sql` with the following contents:

```bash
!source script1.sql
!source script2.sql
!source script3.sql
```

Then execute the wrapper file:

```snowcli
snow sql -f run_all.sql
```

All three files are executed sequentially on a single connection. Alternatively, you can use multiple `-f` options to achieve the same result without a wrapper file, such as `snow sql -f script1.sql -f script2.sql -f script3.sql`.

Before executing `!source` queries, Snowflake CLI does the following:

* Evaluates variable substitutions and templates.
* Reads the contents of all nested files to ensure that no recursion occurs.

When the variables and templates are resolved and no recursion is detected, the command sends the code to Snowflake for execution.

> **Note:**
>
> If you use double quotes (`""`) instead of single quotes (`''`) around a `!source` query, you might need to escape the `!` (`\!`) depending on which shell you use.

The following examples illustrate different ways you can execute source files.

* Execute code in a local file.

  This example assumes you have a simple query in a local SQL file.

  ```bash
  cat code_to_execute.sql
  ```

  ```output
  select 73;
  ```

  To execute the code in the file, enter the following command:

  ```snowcli
  snow sql -q '!source code_to_execute.sql'
  ```

  ```output
  select 73;
  +----+
  | 73 |
  |----|
  | 73 |
  +----+
  ```
* Execute code in a URL-based file.

  This example assumes you have the same simple query in a SQL file at a URL.

  To execute the code in the file, enter the following command:

  ```snowcli
  snow sql -q '!source https://trusted-host/trusted-content.sql'
  ```

  ```output
  select 73;
  +----+
  | 73 |
  |----|
  | 73 |
  +----+
  ```
* Execute code that uses variable substitution and templating.

  This example assumes you have a query in a local SQL file that uses a template variable.

  ```bash
  cat code_with_variable.sql
  ```

  ```output
  select '<% ctx.env.Message %>';
  ```

  To execute the code in the file, enter the following command that defines the variable value:

  ```snowcli
  snow sql -q '!source code_&value.sql;' -D value=with_variable --env Message='Welcome !'
  ```

  ```output
  select 'Welcome !';
  +-------------+
  | 'WELCOME !' |
  |-------------|
  | Welcome !   |
  +-------------+
  ```

> **Note:**
>
> The `!source` command supports the legacy `!load` alias.

### List all SQL queries

The `!queries` query command lists all queries for an account. By default, the command lists the 25 most recent queries executed in the current session.

For example, the following `!queries` query command returns the three most recent queries for a specific user:

> ```snowcli
> snow sql -q '!queries user=user1 amount=3'
> ```
>
> ```output
> +-------------------------------------------------------------------------------------------------------------------------------------+
> | QUERY ID                             | SQL TEXT                                                           | STATUS    | DURATION_MS |
> |--------------------------------------+--------------------------------------------------------------------+-----------+-------------|
> | 01bc3040-080f-f4f9-0001-c1be14bb603a | select current_version();                                          | SUCCEEDED | 3858        |
> | 01bc303d-080f-f4e9-0001-c1be14bb1812 | SELECT SYSTEM$CANCEL_QUERY('01bc3011-080f-f2d7-0001-c1be14bae7c2') | SUCCEEDED | 564         |
> | 01bc3011-080f-f2d7-0001-c1be14bae7c2 | select 'This is async query'                                       | SUCCEEDED | 931         |
> +-------------------------------------------------------------------------------------------------------------------------------------+
> ```

You can use the following filters to narrow the list of returned queries:

| Filter | Default | Description |
| --- | --- | --- |
| amount (integer) | 25 | Number of recent queries to return (default: 25). |
| session (boolean) | N/A | If provided, return only queries executed in the current session. |
| warehouse (string) | None | Return queries executed only on the specified warehouse. |
| user (string) | None | Return queries executed only by the specified user. |
| duration (milliseconds) | 0 | Return only queries that took at least the specified number of milliseconds. |
| start_date (string) | None | Return only queries executed after the specified date. Date is expected to be provided in ISO format (for example `2025-01-01T09:00:00`) |
| end_date (string) | None | Return only queries executed before the specified date. Date is expected to be provided in ISO format (for example `2025-01-01T09:00:00`) |
| start (integer) | None | Return only queries executed after the specified Unix timestamp (in milliseconds). |
| end (integer) | None | Return only queries executed before the specified Unix timestamp (in milliseconds). |
| status (enum) | None | Return only queries in one of the following statuses:   * RUNNING * SUCCEEDED * FAILED * BLOCKED * QUEUED * ABORTED |
| type | None | Return only queries of one of the following types:   * SELECT * INSERT * UPDATE * DELETE * MERGE * MULTI_TABLE_INSERT * COPY * COMMIT * ROLLBACK * BEGIN_TRANSACTION * SHOW * GRANT * CREATE * ALTER |

The following examples return queries using different filters:

* Return the 25 most recent queries executed in the current session:

  ```snowcli
  snow sql -q 'select 42; select 15; !queries session'
  ```
* Return the 20 most recent queries executed in the account:

  ```snowcli
  snow sql -q '!queries amount=20'
  ```
* Return the 20 most recent queries executed in the account that took longer than 200 milliseconds to run:

  ```snowcli
  snow sql -q '!queries amount=20 duration=200'
  ```
* Return the 25 most recent queries executed in the specified warehouse:

  ```snowcli
  snow sql -q '!queries warehouse=mywh'
  ```

### Return a completed SQL query result

The `!result` query command returns the result of a completed query, given its query ID. You can obtain the query ID in the following ways:

* Check the [Query History page](../../../user-guide/ui-snowsight-activity.md) in Snowsight.
* Run the `!queries` SQL query command.
* Use the ID returned by an asynchronous query.

```snowcli
snow sql -q '!result 01bc3011-080f-f2d7-0001-c1be14bae7c2'
```

```output
+-----------------------+
| 'THIS IS ASYNC QUERY' |
|-----------------------|
| This is async query   |
+-----------------------+
```

### Abort an active SQL query

The `!abort` query command aborts an active query, given its query ID. You can obtain the query ID in the following ways:

* Check the [Query History page](../../../user-guide/ui-snowsight-activity.md) in Snowsight.
* Run the `!queries` SQL query command.
* Use the ID returned by an asynchronous query.

```snowcli
snow sql -q '!abort 01bc3011-080f-f2d7-0001-c1be14bae7c2'
```

```output
+-------------------------------------------------------------+
| SYSTEM$CANCEL_QUERY('01BC3011-080F-F2D7-0001-C1BE14BAE7C2') |
|-------------------------------------------------------------|
| Identified SQL statement is not currently executing.        |
+-------------------------------------------------------------+
```

### Open an external editor to modify and execute SQL commands

The `!edit` query command opens an external editor where you can modify SQL commands to execute when you exit the editor. The editor is specified in the `EDITOR` environment variable or, if the environment variable is not set, the default system editor is used.

To enter commands in an external editor, follow these steps:

1. If not already defined in your shell, set the `EDITOR` environment variable to your preferred text editor.
2. Enter the `snow sql` command:

   ```snowcli
   snow sql
   ```
3. At the `>` prompt, enter the `!edit` command:

   ```snowcli
   > !edit
   ```

   The command opens the specified text editor.
4. Enter your SQL commands in the editor, as shown:

   ```sqlexample
   SELECT current_user() ;
   ```
5. Save the file and exit the editor.

   The commands you entered are displayed, as shown:

   ```output
   ✓ Edited SQL loaded into prompt. Modify as needed or press Enter to execute.
   > select current_user();
   ```
6. To execute the commands, select `ENTER`.

   The command output is displayed, as shown:

   ```output
   +----------------+
   | CURRENT_USER() |
   |----------------|
   | USER1          |
   +----------------+
   ```

## Entering multiple commands in a single transaction

The `--single-transaction` option lets you enter multiple SQL commands to execute as an all-or-nothing set of commands.
By executing commands in a single transaction, you can ensure that all of the commands are completed successfully before committing any of the changes.
If any of the commands fail, none of the changes from the successful commands persist.

The following examples show successful and unsuccessful transactions:

* Successful command execution

  ```snowcli
  snow sql -q "insert into my_tbl values (123); insert into my_tbl values (124);" --single-transaction
  ```

  ```output
  BEGIN;
  +----------------------------------+
  | status                           |
  |----------------------------------|
  | Statement executed successfully. |
  +----------------------------------+

  insert into my_tbl values (123);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  insert into my_tbl values (124);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  COMMIT
  +----------------------------------+
  | status                           |
  |----------------------------------|
  | Statement executed successfully. |
  +----------------------------------+
  ```

  You can then verify that the commands were committed to the database:

  ```snowcli
  snow sql -q "select count(*) from my_tbl"
  ```

  ```output
  select count(*) from my_tbl
  +----------+
  | COUNT(*) |
  |----------|
  | 2        |
  +----------+
  ```
* Unsuccessful single transaction

  ```snowcli
  snow sql -c patcli -q "insert into my_tbl values (123); insert into my_tbl values (124); select BAD;" --single-transaction
  ```

  ```output
  BEGIN;
  +----------------------------------+
  | status                           |
  |----------------------------------|
  | Statement executed successfully. |
  +----------------------------------+

  insert into my_tbl values (123);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  insert into my_tbl values (124);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  select BAD;
  ╭─ Error ───────────────────────────────────────────────────────────────────────────────╮
  │ 000904 (42000): 01bc3b84-0810-0247-0001-c1be14ee11ce: SQL compilation error: error    │
  │ line 1 at position 7                                                                  │
  │ invalid identifier 'BAD'                                                              │
  ╰───────────────────────────────────────────────────────────────────────────────────────╯
  ```

> You can then verify that the commands were not committed to the database:
>
> > ```snowcli
> > snow sql -q "select count(*) from my_tbl"
> > ```
> >
> > ```output
> > select count(*) from my_tbl
> > +----------+
> > | COUNT(*) |
> > |----------|
> > | 0        |
> > +----------+
> > ```

## Entering SQL commands in interactive mode

The `snow sql` command supports an interactive mode that lets you enter SQL commands one at a time. Interactive mode provides the following features:

* Syntax highlighting
* Code completion while typing
* Searchable history

  Pressing `CTRL-R`: lets you search your command history:
* Multi-line input

  Pressing `ENTER` on a line that does not end with a semicolon (`;`) moves the cursor to the next line for more commands until a statement ends with a semicolon.

To use interactive mode, enter the `snow sql` command followed by `ENTER`, as shown:

```snowcli
snow sql
```

The command opens a sub-shell with a `>` prompt where you can enter SQL commands interactively:

```output
$ snow sql
  ╭───────────────────────────────────────────────────────────────────────────────────╮
  │ Welcome to Snowflake-CLI REPL                                                     │
  │ Type 'exit' or 'quit' to leave                                                    │
  ╰───────────────────────────────────────────────────────────────────────────────────╯
  >
```

You can then enter SQL commands, as shown:

```snowcli
> create table my_table (c1 int);
```

```output
+-------------------------------------+
| status                              |
|-------------------------------------|
| Table MY_TABLE successfully created.|
+-------------------------------------+
```

> **Note:**
>
> You must end each SQL statement with a semicolon (`;`).

To exit interactive mode, enter `exit`, `quit`, or `CTRL-D`.

---
title: Initialize a Snowpark project
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/initialize.md
section: Snowflake CLI
---

# Initialize a Snowpark project

The first step when creating Snowpark projects is to create a project boilerplate. The `snow init` command creates a fully-functional boilerplate with the following structure:

```output
snowflake.yml      - project definition
requirements.txt   - project dependencies
app/               - code of functions and procedures
  __init__.py
  functions.py     - example functions
  procedures.py    - example procedures
  common.py        - example "shared library"
```

* The `snowflake.yml` file contains a [project definition](../project-definitions/about.md) that describes the project structure that the `snow snowpark` commands use.
* The `app` directory stores the project code. You can think about it as a Python module. All functions and procedures must reside in this directory.
* The `requirements.txt` file contains project dependencies. Snowflake CLI supports all requirement specifiers supported by `pip`, such as a package name, a URL for a package, or a local path.

  You can add more dependencies (such as previously deployed custom packages) as `imports` parameters in the function and procedure declarations in the [project definition](../project-definitions/about.md).

---
title: Installing Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation.md
section: Snowflake CLI
---

# Installing Snowflake CLI

This topic explains how to install Snowflake CLI on [supported platforms](../../../release-notes/requirements.md). Note that Snowflake CLI is not currently available for AIX systems.

Snowflake recommends using binary installation methods, such as package managers, to install Snowflake CLI on your system.
You can download the binary installers from the official [Snowflake CLI repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html).

## Requirements

* Before using Snowflake CLI, you must have a valid Snowflake account.
* To run Streamlit in Snowflake using Snowflake CLI, you must have a Snowflake account with permission to use Streamlit.
* To run Snowpark Container Services in Snowflake using Snowflake CLI, you must have a Snowflake account with privileges to use Snowpark Container Services.

> **Tip:**
>
> If your Snowflake account requires MFA (multi-factor authentication), Snowflake CLI requires approval for every command. You can use MFA caching to require
> authentication only once every four hours. For more information, see [Use multi-factor authentication (MFA)](../connecting/configure-connections.md).

## Install Snowflake CLI using package managers

To install Snowflake CLI using platform-specific package managers, use one of the following procedures:

* Install using Linux package managers (rpm, deb).
* Install using MacOS installer.
* Install using Windows installer.
* Install using Homebrew.

### Install with Linux package managers

If you use a Linux operating system, you can install Snowflake CLI with package managers that support the following:

* `deb` packages,
* `rpm` packages.

To install Snowflake CLI using the `deb` package manager:

1. Download the Snowflake CLI `deb` from the [Snowflake CLI repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html).
2. Install the package by running the following command:

   ```bash
   sudo dpkg -i snowflake-cli-<version>.deb
   ```

To install Snowflake CLI using the `rpm` package manager:

1. Download the Snowflake CLI `rpm` package from the [Snowflake CLI repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html).
2. Install the package by running the following command:

   ```bash
   sudo rpm -i snowflake-cli-<version>.rpm
   ```
3. To verify that the software was installed successfully, run the following command:

   ```bash
   snow --help
   ```

   ```output
   Usage: snow [OPTIONS] COMMAND [ARGS]...

   Snowflake CLI tool for developers.

   ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ --version                           Shows version of the Snowflake CLI                                                                   │
   │ --info                              Shows information about the Snowflake CLI                                                            │
   │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
   │ --install-completion                Install completion for the current shell.                                                            │
   │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
   │ --help                -h            Show this message and exit.                                                                          │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ app          Manages a Snowflake Native App                                                                                              │
   │ connection   Manages connections to Snowflake.                                                                                           │
   │ cortex       Provides access to Snowflake Cortex.                                                                                        │
   │ git          Manages git repositories in Snowflake.                                                                                      │
   │ notebook     Manages notebooks in Snowflake.                                                                                             │
   │ object       Manages Snowflake objects like warehouses and stages                                                                        │
   │ snowpark     Manages procedures and functions.                                                                                           │
   │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
   │ sql          Executes Snowflake query.                                                                                                   │
   │ stage        Manages stages.                                                                                                             │
   │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ```
4. [Configure the Snowflake connection](../connecting/connect.md).

### Install with the MacOS package installer

To install Snowflake CLI on MacOS, do the following:

1. Download the Snowflake CLI installer from the [Snowflake CLI repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html).
2. Run the installer and follow the instructions to install Snowflake CLI.
3. To verify that the software was installed successfully, open new terminal and run the following command:

   ```bash
   snow --help
   ```

   ```output
   Usage: snow [OPTIONS] COMMAND [ARGS]...

   Snowflake CLI tool for developers.

   ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ --version                           Shows version of the Snowflake CLI                                                                   │
   │ --info                              Shows information about the Snowflake CLI                                                            │
   │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
   │ --install-completion                Install completion for the current shell.                                                            │
   │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
   │ --help                -h            Show this message and exit.                                                                          │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ app          Manages a Snowflake Native App                                                                                              │
   │ connection   Manages connections to Snowflake.                                                                                           │
   │ cortex       Provides access to Snowflake Cortex.                                                                                        │
   │ git          Manages git repositories in Snowflake.                                                                                      │
   │ notebook     Manages notebooks in Snowflake.                                                                                             │
   │ object       Manages Snowflake objects like warehouses and stages                                                                        │
   │ snowpark     Manages procedures and functions.                                                                                           │
   │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
   │ sql          Executes Snowflake query.                                                                                                   │
   │ stage        Manages stages.                                                                                                             │
   │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ```
4. [Configure the Snowflake connection](../connecting/connect.md).

### Install with the Windows installer

To install Snowflake CLI on Windows, do the following:

1. Download the Snowflake CLI installer from the [Snowflake CLI repository](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html).
2. Run the installer and follow the instructions to install Snowflake CLI.
3. To verify that the software was installed successfully, open new terminal and run the following command:

   ```bash
   snow --help
   ```

   ```output
   Usage: snow [OPTIONS] COMMAND [ARGS]...

   Snowflake CLI tool for developers.

   ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ --version                           Shows version of the Snowflake CLI                                                                   │
   │ --info                              Shows information about the Snowflake CLI                                                            │
   │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
   │ --install-completion                Install completion for the current shell.                                                            │
   │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
   │ --help                -h            Show this message and exit.                                                                          │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ app          Manages a Snowflake Native App                                                                                              │
   │ connection   Manages connections to Snowflake.                                                                                           │
   │ cortex       Provides access to Snowflake Cortex.                                                                                        │
   │ git          Manages git repositories in Snowflake.                                                                                      │
   │ notebook     Manages notebooks in Snowflake.                                                                                             │
   │ object       Manages Snowflake objects like warehouses and stages                                                                        │
   │ snowpark     Manages procedures and functions.                                                                                           │
   │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
   │ sql          Executes Snowflake query.                                                                                                   │
   │ stage        Manages stages.                                                                                                             │
   │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ```
4. [Configure the Snowflake connection](../connecting/connect.md).

### Install with Homebrew

If you use a Mac operating system, you can install Snowflake CLI with [Homebrew](https://brew.sh/).

1. Install [Homebrew](https://brew.sh/), if necessary.
2. To give Homebrew access to the Snowflake CLI repository, run the following command:

   ```bash
   brew tap snowflakedb/snowflake-cli
   brew update
   ```
3. To install Snowflake CLI, run the following command:

   ```bash
   brew install snowflake-cli
   ```
4. To verify that the software was installed successfully, run the following command:

   ```bash
   snow --help
   ```

   ```output
   Usage: snow [OPTIONS] COMMAND [ARGS]...

   Snowflake CLI tool for developers.

   ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ --version                           Shows version of the Snowflake CLI                                                                   │
   │ --info                              Shows information about the Snowflake CLI                                                            │
   │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
   │ --install-completion                Install completion for the current shell.                                                            │
   │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
   │ --help                -h            Show this message and exit.                                                                          │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ app          Manages a Snowflake Native App                                                                                              │
   │ connection   Manages connections to Snowflake.                                                                                           │
   │ cortex       Provides access to Snowflake Cortex.                                                                                        │
   │ git          Manages git repositories in Snowflake.                                                                                      │
   │ notebook     Manages notebooks in Snowflake.                                                                                             │
   │ object       Manages Snowflake objects like warehouses and stages                                                                        │
   │ snowpark     Manages procedures and functions.                                                                                           │
   │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
   │ sql          Executes Snowflake query.                                                                                                   │
   │ stage        Manages stages.                                                                                                             │
   │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ```
5. [Configure the Snowflake connection](../connecting/connect.md).

## Advanced local installations

You can also install Snowflake CLI as a Python package using either of the following:

* pip (PyPi)
* pipx

Snowflake recommends installing as a Python package only for development purposes or when installing binaries isn’t possible in your environment.

### Install with pip (PyPi)

> **Note:**
>
> This method modifies the Python environment where you install Snowflake CLI. Consider using pipx instead to avoid dependency conflicts.

To install Snowflake CLI using `pip`, you must have [Python](https://python.org) version 3.10 or later installed.

1. Run the following shell command:

   ```bash
   pip install snowflake-cli
   ```
2. To verify that the software was installed successfully, run the following command:

   ```bash
   snow --help
   ```

   ```output
   Usage: snow [OPTIONS] COMMAND [ARGS]...

   Snowflake CLI tool for developers.

   ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ --version                           Shows version of the Snowflake CLI                                                                   │
   │ --info                              Shows information about the Snowflake CLI                                                            │
   │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
   │ --install-completion                Install completion for the current shell.                                                            │
   │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
   │ --help                -h            Show this message and exit.                                                                          │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ app          Manages a Snowflake Native App                                                                                              │
   │ connection   Manages connections to Snowflake.                                                                                           │
   │ cortex       Provides access to Snowflake Cortex.                                                                                        │
   │ git          Manages git repositories in Snowflake.                                                                                      │
   │ notebook     Manages notebooks in Snowflake.                                                                                             │
   │ object       Manages Snowflake objects like warehouses and stages                                                                        │
   │ snowpark     Manages procedures and functions.                                                                                           │
   │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
   │ sql          Executes Snowflake query.                                                                                                   │
   │ stage        Manages stages.                                                                                                             │
   │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ```
3. [Configure the Snowflake connection](../connecting/connect.md).

### Install with pipx

[pipx](https://github.com/pypa/pipx) provides an alternative to `pip` that installs and executes Python packages into isolated virtual environments. Installing Snowflake CLI with `pipx` does not, therefore, modify your current Python environment.

To install Snowflake CLI using `pipx`, you must have [pipx](https://github.com/pypa/pipx) installed.

1. Run the following shell command:

   ```bash
   pipx install snowflake-cli
   ```
2. To verify that the software was installed successfully, run the following command:

   ```bash
   snow --help
   ```

   ```output
   Usage: snow [OPTIONS] COMMAND [ARGS]...

   Snowflake CLI tool for developers.

   ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ --version                           Shows version of the Snowflake CLI                                                                   │
   │ --info                              Shows information about the Snowflake CLI                                                            │
   │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
   │ --install-completion                Install completion for the current shell.                                                            │
   │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
   │ --help                -h            Show this message and exit.                                                                          │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
   │ app          Manages a Snowflake Native App                                                                                              │
   │ connection   Manages connections to Snowflake.                                                                                           │
   │ cortex       Provides access to Snowflake Cortex.                                                                                        │
   │ git          Manages git repositories in Snowflake.                                                                                      │
   │ notebook     Manages notebooks in Snowflake.                                                                                             │
   │ object       Manages Snowflake objects like warehouses and stages                                                                        │
   │ snowpark     Manages procedures and functions.                                                                                           │
   │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
   │ sql          Executes Snowflake query.                                                                                                   │
   │ stage        Manages stages.                                                                                                             │
   │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
   ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
   ```
3. [Configure the Snowflake connection](../connecting/connect.md).

## Installing Snowflake CLI in FIPS-compliant environments

You can use a Docker image to install Snowflake CLI in an environment that is compliant with FIPS (Federal Information Processing Standards).

### Prerequisites

Before installing Snowflake CLI in a FIPS-compliant environment, ensure that you meet the following prerequisites:

* **FIPS-compliant Python**: Python must be preinstalled, built, and configured for FIPS compliance. This typically means Python is linked against a FIPS-enabled OpenSSL library.
* **FIPS-enabled OpenSSL**: The system’s OpenSSL libraries must be FIPS-compliant and available to Python at runtime.
* **Build tools**: Standard build tools (such as a C compiler and Python development headers) must be available, as dependencies will be built from source.
* **Network Access**: The environment must allow access to PyPI or your internal package index for downloading source distributions.

### Install Snowflake CLI in a FIPS-compliant Dockerfile

To install Snowflake CLI in a FIPS-compliant environment, follow these steps:

1. Create a Python virtual environment in the container, as shown in the following example:

   ```bash
   python -m venv .venv
   ```
2. Activate the Python virtual environment in the container, as shown in the following example:

   ```bash
   source ~/.venv/bin/activate
   ```
3. Upgrade `pip` and `setuptools` in the container, as shown in the following example:

   ```bash
   pip install -U setuptools pip
   ```
4. Install the cryptography, Python connector, and Snowflake CLI dependencies from source in the container, as shown in the following example. Note that all dependencies must be installed from source to ensure they are built against your FIPS-compliant libraries.

   ```bash
   pip install cryptography==44.0.3 --no-binary cryptography
   pip install -U snowflake-connector-python[secure-local-storage] --no-binary snowflake-connector-python[secure-local-storage]
   pip install -U snowflake-cli --no-binary snowflake-cli
   ```

   The `--no-binary` option forces installation from source, ensuring that the builds use FIPS-ready libraries.

### Validate the Docker image

To confirm that your Python environment uses a FIPS-enabled OpenSSL library, enter the following command in the running container:

```bash
python -c "import ssl; print(ssl.OPENSSL_VERSION)"
```

After installing Snowflake CLI and validating the Docker image, you can use Snowflake CLI in the container.

```bash
snow <your-command>
```

where <*your-command*> is any valid Snowflake CLI command, such as `snow --help`.

## Install command auto-completion functionality

Snowflake CLI supports standard shell tab completion functionality.

To install auto-completion into Snowflake CLI, perform the following steps:

1. Run the `snow --install-completion` command:

   ```snowcli
   snow --install-completion
   ```

   ```output
   zsh completion installed in <user home>/.zfunc/_snow
   Completion will take effect once you restart the terminal
   ```
2. Run the `snow --show-completion` command to generate the commands you need to add to your shell profile (`.bashrc`, `.bash_profile`, `.zshrc`, and others):

   ```bash
   snow --show-completion
   ```

   ```output
   _snow_completion() {
      local IFS=$'
   '
      COMPREPLY=( $( env COMP_WORDS="${COMP_WORDS[*]}" \
                     COMP_CWORD=$COMP_CWORD \
                     _SNOW_COMPLETE=complete_bash $1 ) )
      return 0
   }

   complete -o default -F _snow_completion snow
   ```
3. Select and copy the command output text.
4. Open your shell profile file, `.bashrc` in this example, and paste the copied text:

   ```output
   export SHELL=/bin/bash

   ...

   _snow_completion() {
      local IFS=$'
   '
      COMPREPLY=( $( env COMP_WORDS="${COMP_WORDS[*]}" \
                     COMP_CWORD=$COMP_CWORD \
                     _SNOW_COMPLETE=complete_bash $1 ) )
      return 0
   }

   complete -o default -F _snow_completion snow
   ```
5. Save the file.
6. To activate the tab-completion functionality, restart your shell or `source` your shell profile file, such as:

   ```bash
   source ~/.bashrc
   ```
7. To test the feature, enter a snow command followed by a `TAB`, as shown:

   ```bash
   snow app [TAB]
   ```

   ```output
   deploy    init      open      run       teardown  version
   ```

---
title: Integrating CI/CD with Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/cicd/integrate-ci-cd.md
section: Snowflake CLI
---

# Integrating CI/CD with Snowflake CLI

Snowflake CLI integrates popular CI/CD (continuous integration and continuous delivery) systems and frameworks, such as [GitHub Actions](https://github.com/features/actions), to efficiently automate your Snowflake workflows for SQL, Snowpark, Native Apps, or Notebooks.

The following illustration shows a typical CI/CD workflow in Snowflake CLI.

## CI/CD workflow steps

1. **Store:** Configure a remote Git repository to manage your Snowflake files securely.
2. **Code:** Develop your Snowflake code using an IDE or Snowsight, tailored to your preferences.
3. **Install:** [Install](../installation/installation.md) Snowflake CLI, and provision your preferred CI/CD provider, such as GitHub Actions.
4. **Deploy:** Automate deployment by combining the Snowflake CLI with your selected CI/CD tool.
5. **Monitor:** Track code and workflow performance in Snowflake using [Snowflake Trail](https://www.snowflake.com/en/product/features/snowflake-trail/) for real-time insights.
6. **Iterate:** Apply small, frequent updates to your project for continuous improvement; smaller changes simplify management and rollback, if necessary.

## CI/CD with GitHub Actions

A Snowflake CLI action is a GitHub action designed to integrate Snowflake CLI into CI/CD pipelines. You can use it to automate execution of Snowflake CLI commands within your GitHub workflows. For more information, see the [snowflake-cli-action](https://github.com/snowflakedb/snowflake-cli-action) repository.

## Using Snowflake CLI actions

Github Actions streamlines the process of installing and using Snowflake CLI in your CI/CD workflows. The CLI is installed in an
isolated way, ensuring that it won’t conflict with the dependencies of your project. It automatically sets up
the input configuration file within the `~/.snowflake/` directory.

The action enables automation of your Snowflake CLI tasks, such as deploying Snowflake Native Apps or running Snowpark scripts within your Snowflake environment.

### Input parameters

A Snowflake CLI action uses the following inputs from your Github workflow YAML file, such as `<repo-name>/.github/workflows/my-workflow.yaml`:

* `cli-version`: The specified Snowflake CLI version, such as `3.11.0`. If not provided, the latest version of the Snowflake CLI is used.
* `custom-github-ref`: The branch, tag, or commit in the Github repository that you want to install Snowflake CLI directly from.

  > **Note:**
  >
  > You cannot use both `cli-version` and `custom-github-ref` together; specify only one of these parameters.
* `default-config-file-path`: Path to the configuration file (`config.toml`) in your repository. The path must be relative to the root of the repository. The configuration file is not required when a temporary connection (`-x` option) is used. For more information, see [Managing Snowflake connections](../connecting/configure-connections.md).
* `use-oidc`: Boolean flag to enable OIDC authentication. When set to `true`, the action configures the CLI to use GitHub’s OIDC token for authentication with Snowflake, eliminating the need for storing private keys as secrets. Default is `false`.

### Install Snowflake CLI from a GitHub branch or tag

* To install Snowflake CLI from a specific branch, tag, or commit in the GitHub repository (for example, to test unreleased features or a fork), use the following configuration:

```yaml
- uses: snowflakedb/snowflake-cli-action@v2.0
  with:
    custom-github-ref: "feature/my-branch" # or a tag/commit hash
```

You can also include other input parameters.

This feature is available in snowflake-cli-action version 1.6 or later.

### Safely configure the action in your CI/CD workflow

You can safely configure the action in your CI/CD workflow by using either of the following methods:

* Use workload identity federation (WIF) OpenID Connect (OIDC) authentication
* Use private key authentication

#### Use workload identity federation (WIF) OpenID Connect (OIDC) authentication

> **Note:**
>
> WIF OIDC authentication requires Snowflake CLI version 3.11.0 or later.

WIF OIDC authentication provides a secure and modern way to authenticate with Snowflake without storing private keys as secrets. This approach uses GitHub’s OIDC (OpenID Connect) token to authenticate with Snowflake.

To set up WIF OIDC authentication, follow these steps:

1. Configure WIF OIDC by setting up a service user with the OIDC workload identity type:

   ```sqlexample
   CREATE USER <username>
   TYPE = SERVICE
   WORKLOAD_IDENTITY = (
     TYPE = OIDC
     ISSUER = 'https://token.actions.githubusercontent.com'
     SUBJECT = '<your_subject>'
   )
   ```

> **Note:**
> > By default, your subject should look like `repo:<repository-owner/repository-name>:environment:<environment>`.
>
> * To simplify generation of the subject, use `gh` command, where `<environment_name>` is the environment defined in your repository settings, as shown in the following example:
>
> > ```bash
> > gh repo view <repository-owner/repository-name> --json nameWithOwner | jq -r '"repo:\(.nameWithOwner):environment:<environment_name>"'
> > ```
> >
> > For more information about customizing your subject, see the [OpenID Connect](https://docs.github.com/en/actions/reference/security/oidc) reference on GitHub.

1. Store your Snowflake account identifier in GitHub secrets. For more information, see [GitHub Actions documentation](https://docs.github.com/en/actions/security-guides/using-secrets-in-github-actions#creating-secrets-for-a-repository).
2. Configure the Snowflake CLI action in your GitHub workflow YAML file, as shown:

   ```yaml
   name: Snowflake OIDC
   on: [push]

   permissions:
     id-token: write  # Required for OIDC token generation
     contents: read

   jobs:
     oidc-job:
       runs-on: ubuntu-latest
       environment: test-env # this should match the environment used in the subject
       steps:
         - uses: actions/checkout@v4
           with:
             persist-credentials: false
         - name: Set up Snowflake CLI
           uses: snowflakedb/snowflake-cli-action@v2.0
           with:
             use-oidc: true
             cli-version: "3.11"
         - name: test connection
           env:
             SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
           run: snow connection test -x
   ```

   For more information about setting up WIF OIDC authentication for your Snowflake account and configuring the GitHub OIDC provider, see [Workload identity federation](../../../user-guide/workload-identity-federation.md).

#### Use private key authentication

To use private key authentication, you need to store your Snowflake private key in GitHub secrets and configure the Snowflake CLI action to use it.

1. Store your Snowflake private key in GitHub secrets.

For more information, see [GitHub Actions documentation](https://docs.github.com/en/actions/security-guides/using-secrets-in-github-actions#creating-secrets-for-a-repository).

2. Configure the Snowflake CLI action in your GitHub workflow YAML file, as shown:

   ```yaml
   name: Snowflake Private Key
   on: [push]

   jobs:
     private-key-job:
       runs-on: ubuntu-latest
       steps:
         - uses: actions/checkout@v4
           with:
             persist-credentials: false
         - name: Set up Snowflake CLI
           uses: snowflakedb/snowflake-cli-action@v2.0
   ```

## Defining connections

You can define a GitHub action to connect to Snowflake with a temporary connection or with a connection defined in your configuration file. For more information about managing connections, see [Managing Snowflake connections](../connecting/configure-connections.md).

### Use a temporary connection

For more information about temporary connections, see [Use a temporary connection](../connecting/configure-connections.md).

To set up your Snowflake credentials for a temporary connection, follow these steps:

1. Map secrets to environment variables in your GitHub workflow, in the form `SNOWFLAKE_<key>=<value>`, as shown:

   ```yaml
   env:
     SNOWFLAKE_PRIVATE_KEY_RAW: ${{ secrets.SNOWFLAKE_PRIVATE_KEY_RAW }}
     SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
   ```
2. Configure the Snowflake CLI action.

   If you use the latest version of Snowflake CLI, you do not need to include the `cli-version` parameter. The following example instructs the action to use Snowflake CLI version 3.11.0 specifically:

   ```yaml
   - uses: snowflakedb/snowflake-cli-action@v2.0
     with:
       cli-version: "3.11.0"
   ```
3. Optional: If your private key is encrypted, to set up a passphrase, set the PRIVATE_KEY_PASSPHRASE environment variable to the private key passphrase. Snowflake uses this passphrase to decrypt the private key. For example:

   ```yaml
   - name: Execute Snowflake CLI command
     env:
       PRIVATE_KEY_PASSPHRASE: ${{ secrets.PASSPHARSE }}
   ```

   To use a password instead of a private key, unset the `SNOWFLAKE_AUTHENTICATOR` environment variable, and add the `SNOWFLAKE_PASSWORD` variable, as follows:

   ```yaml
   - name: Execute Snowflake CLI command
     env:
       SNOWFLAKE_USER: ${{ secrets.SNOWFLAKE_USER }}
       SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
       SNOWFLAKE_PASSWORD: ${{ secrets.SNOWFLAKE_PASSWORD }}
   ```

   > **Note:**
   >
   > To enhance your experience when using a password and MFA, Snowflake recommends that you [configure MFA caching](../connecting/configure-connections.md).

   For more information about setting Snowflake credentials in environment variables, see [Use environment variables for Snowflake credentials](../connecting/configure-connections.md), and for information about defining environment variables within your GitHub CI/CD workflow, see [Defining environment variables for a single workflow](https://docs.github.com/en/actions/learn-github-actions/variables#defining-environment-variables-for-a-single-workflow).
4. Add the `snow` commands you want to execute with the temporary connection, as shown:

   ```yaml
   run: |
     snow --version
     snow connection test --temporary-connection
   ```

The following example shows a completed sample `<repo-name>/.github/workflows/my-workflow.yaml` file:

```yaml
name: deploy
on: [push]

jobs:
  version:
    name: "Check Snowflake CLI version"
    runs-on: ubuntu-latest
    steps:
      # Snowflake CLI installation
      - uses: snowflakedb/snowflake-cli-action@v2.0

        # Use the CLI
      - name: Execute Snowflake CLI command
        env:
          SNOWFLAKE_AUTHENTICATOR: SNOWFLAKE_JWT
          SNOWFLAKE_USER: ${{ secrets.SNOWFLAKE_USER }}
          SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
          SNOWFLAKE_PRIVATE_KEY_RAW: ${{ secrets.SNOWFLAKE_PRIVATE_KEY_RAW }}
          PRIVATE_KEY_PASSPHRASE: ${{ secrets.PASSPHARSE }} # Passphrase is only necessary if private key is encrypted.
        run: |
          snow --help
          snow connection test -x
```

After verifying that your action can connect to Snowflake successfully, you can add more Snowflake CLI commands like `snow notebook create` or `snow git execute`. For information about supported commands, see [Snowflake CLI command reference](../command-reference/overview.md).

### Use a configuration file

For more information about defining connections, see [Define connections](../connecting/configure-connections.md).

To set up your Snowflake credentials for a specific connection, follow these steps:

1. Create a `config.toml` file at the root of your Git repository with an empty configuration connection, as shown:

   ```toml
   default_connection_name = "myconnection"

   [connections.myconnection]
   ```

   This file serves as a template and should not contain actual credentials.
2. Map secrets to environment variables in your GitHub workflow, in the form `SNOWFLAKE_<key>=<value>`, as shown:

   ```yaml
   env:
     SNOWFLAKE_CONNECTIONS_MYCONNECTION_PRIVATE_KEY_RAW: ${{ secrets.SNOWFLAKE_PRIVATE_KEY_RAW }}
     SNOWFLAKE_CONNECTIONS_MYCONNECTION_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
   ```
3. Configure the Snowflake CLI action.

   If you use the latest version of Snowflake CLI, you do not need to include the `cli-version` parameter. The following example specifies a desired version and the name of your default configuration file:

   ```yaml
   - uses: snowflakedb/snowflake-cli-action@v2.0
     with:
       cli-version: "3.11.0"
       default-config-file-path: "config.toml"
   ```
4. Optional: If your private key is encrypted, to set up a passphrase, set the PRIVATE_KEY_PASSPHRASE environment variable to the private key passphrase. Snowflake uses this passphrase to decrypt the private key. For example:

   ```yaml
   - name: Execute Snowflake CLI command
     env:
       PRIVATE_KEY_PASSPHRASE: ${{ secrets.PASSPHARSE }}
   ```

   To use a password instead of a private key, unset the `SNOWFLAKE_AUTHENTICATOR` environment variable, and add the `SNOWFLAKE_PASSWORD` variable, as follows:

   ```yaml
   - name: Execute Snowflake CLI command
     env:
       SNOWFLAKE_CONNECTIONS_MYCONNECTION_USER: ${{ secrets.SNOWFLAKE_USER }}
       SNOWFLAKE_CONNECTIONS_MYCONNECTION_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
       SNOWFLAKE_CONNECTIONS_MYCONNECTION_PASSWORD: ${{ secrets.SNOWFLAKE_PASSWORD }}
   ```

   > **Note:**
   >
   > To enhance your experience when using a password and MFA, Snowflake recommends that you [configure MFA caching](../connecting/configure-connections.md).
5. Add the `snow` commands you want to execute with a named connection, as shown:

   ```yaml
   run: |
     snow --version
     snow connection test
   ```

The following example shows a sample `config.toml` file in your Git repository and a completed sample `<repo-name>/.github/workflows/my-workflow.yaml` file:

* Sample `config.toml` file:

  ```toml
  default_connection_name = "myconnection"

  [connections.myconnection]
  ```
* Sample Git workflow file:

  ```yaml
  name: deploy
  on: [push]
  jobs:
    version:
      name: "Check Snowflake CLI version"
      runs-on: ubuntu-latest
      steps:
        # Checkout step is necessary if you want to use a config file from your repo
        - name: Checkout repo
          uses: actions/checkout@v4
          with:
            persist-credentials: false

          # Snowflake CLI installation
        - uses: snowflakedb/snowflake-cli-action@v2.0
          with:
            default-config-file-path: "config.toml"

          # Use the CLI
        - name: Execute Snowflake CLI command
          env:
            SNOWFLAKE_CONNECTIONS_MYCONNECTION_AUTHENTICATOR: SNOWFLAKE_JWT
            SNOWFLAKE_CONNECTIONS_MYCONNECTION_USER: ${{ secrets.SNOWFLAKE_USER }}
            SNOWFLAKE_CONNECTIONS_MYCONNECTION_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
            SNOWFLAKE_CONNECTIONS_MYCONNECTION_PRIVATE_KEY_RAW: ${{ secrets.SNOWFLAKE_PRIVATE_KEY_RAW }}
            PRIVATE_KEY_PASSPHRASE: ${{ secrets.PASSPHARSE }} #Passphrase is only necessary if private key is encrypted.
          run: |
            snow --help
            snow connection test
  ```

After verifying that your action can connect to Snowflake successfully, you can add more Snowflake CLI commands like `snow notebook create` or `snow git execute`. For information about supported commands, see [Snowflake CLI command reference](../command-reference/overview.md).

---
title: Introducing Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/introduction/introduction.md
section: Snowflake CLI
---

# Introducing Snowflake CLI

Snowflake CLI’s open-source nature means that developers can leverage the community’s collective knowledge and contributions to
improve and enhance the tool. By using Snowflake CLI, developers can expect a streamlined, efficient experience that empowers
them to work with Snowflake in new and innovative ways. Snowflake CLI is a powerful and flexible tool that helps developers
streamline their workflow and optimize their Snowflake experience.

As a command-line interface (CLI), Snowflake CLI provides several benefits for developers, such as:

* Speed and efficiency

  A CLI allows developers to perform tasks quickly and efficiently by executing commands from the
  terminal without needing a graphical user interface. This can save developers significant time and effort, especially
  when performing repetitive or complex tasks.
* Automation

  A CLI can automate tasks and workflows, such as building, testing, CI/CD, and deploying applications.
  CLI can help developers streamline their development process and reduce the risk of errors or inconsistencies.
* Portability

  A CLI is often platform-independent and can be used across different operating systems and environments.
  Developers can work more easily on multiple projects or collaborate with others who use different systems.
* Version control

  A CLI can be integrated with version control systems like Git to manage changes and track code
  history, which can help developers collaborate more effectively, resolve conflicts, and document changes appropriately.
* Customization

  A CLI can be customized and extended by using modules and scripts, so developers can tailor it to
  their needs and preferences. Automating common tasks and workflows can help developers work more efficiently
  and effectively.
* Accessibility

  CLI can be accessed remotely, so developers can work on servers and other remote
  systems without a graphical interface.

## How does Snowflake CLI differ from SnowSQL?

SnowSQL is the command-line client for connecting to Snowflake to execute SQL queries and perform all DDL and DML
operations, including loading data into and unloading data out of database tables.

The Snowflake CLI command-line client, in contrast, focuses primarily on managing workloads and applications that connect
to Snowflake. Snowflake CLI lets you locally
run and debug Snowflake apps, with the following benefits:

* You can search, create, and upload Python packages that might not be supported in Anaconda yet.
* Snowflake CLI supports Snowpark Python user-defined functions and stored procedures, warehouses, and Streamlit apps.
* You can define packages by using `requirements.txt`, with dependencies automatically added
  through integration with Anaconda at deployment time.
* Snowflake CLI can include packages that are identified in `requirements.txt`—but aren’t yet in
  Anaconda—in the application package deployed to Snowflake.
  (This feature only works with packages that don’t rely on native libraries).
* When you update existing applications, code and dependencies are automatically altered as needed.
* Deployment artifacts are automatically managed and uploaded to Snowflake stages.

Snowflake plans to continue enhancing Snowflake CLI to provide developers a robust tool for leveraging all of the SnowSQL capabilities in a new open source CLI.

---
title: Listing all versions defined in an application package
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/list-app-package-version.md
section: Snowflake CLI
---

# Listing all versions defined in an application package

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your Snowflake Native App project.

## How to list all versions defined in an application package

The [snow app version list](../command-reference/native-apps-commands/version/app-version-list.md) command lists all versions defined in an application package. This command uses the resolved project definition to determine the name of the application package for which the versions will be listed.

To list the versions of an application package, do the following:

1. [Create a connection](../connecting/connect.md), if necessary.
2. Execute the `snow app version list` command from within your project, similar to the following:

   ```snowcli
   snow app version list --connection="dev"
   ```

When successful, the command displays a list of existing versions in the default format or in the format specified by you through the command line.
For more information about listing the versions of an application package, see the [snow app version list](../command-reference/native-apps-commands/version/app-version-list.md) command.

---
title: Listing the contents of a repository
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/git/list-contents.md
section: Snowflake CLI
---

# Listing the contents of a repository

Snowflake CLI supports the following ways to list the contents of a Git repository:

* List branches in a repository
* List tags in a repository
* List files in a repository

## List branches in a repository

The `snow git list-branches` command lists all of the branches in a repository.

```bash
snow git list-branches <REPO_NAME>
```

where:

* `<REPO_NAME>` is the ID of the repository stage.

For example, to list all of the branches in a repository named `my_snow_git`, enter the following command:

```bash
snow git list-branches my_snow_git
```

```output
show git branches in my_snow_git
+--------------------------------------------------------------------------------------------------------------------------------------------+
| name                                     | path                                     | checkouts | commit_hash                              |
|------------------------------------------+------------------------------------------+-----------+------------------------------------------|
| SNOW-1011750-service-create-options      | /branches/SNOW-1011750-service-create-op |           | 729855df0104c8d0ef1c7a3e8f79fe50c6c8d2fa |
|                                          | tions                                    |           |                                          |
| SNOW-1011775-containers-to-spcs-int-test | /branches/SNOW-1011775-containers-to-spc |           | e81b00de6b0eb73a99a7baaa39b0afa5ea1202d0 |
| s                                        | s-int-tests                              |           |                                          |
| SNOW-1105629-git-integration-tests       | /branches/SNOW-1105629-git-integration-t |           | 712b07b5e692624c34caabe07d64801615ce5f0f |
+--------------------------------------------------------------------------------------------------------------------------------------------+
```

## List tags in a repository

The `snow git list-tabs` command lists all of the tags in a repository.

```bash
snow git list-tags <REPO_NAME>
```

where:

* `<REPO_NAME>` is the ID of the repository stage you want to create. Note that if the repository stage already exists, the command fails.

For example, to list all of the tags in a repository named `my_snow_git`, enter the following command:

```bash
snow git list-tags my_snow_git
```

```output
show git tags in my_snow_git
+--------------------------------------------------------------------------------------------------------------+
| name           | path                 | commit_hash                 | author                       | message |
|----------------+----------------------+-----------------------------+------------------------------+---------|
| v2.0.0rc3      | /tags/v2.0.0rc3      | 2b019d2841da823d8001f23c6f3 | None                         | None    |
|                |                      | 064e5899142a0               |                              |         |
| v2.1.0-rc0     | /tags/v2.1.0-rc0     | 829887b758b43b86959611dd612 | None                         | None    |
|                |                      | 7638da75cf871               |                              |         |
| v2.1.0-rc1     | /tags/v2.1.0-rc1     | b7efe1fe9c0925b95ba214e233b | None                         | None    |
|                |                      | 18924fa0404b3               |                              |         |
+--------------------------------------------------------------------------------------------------------------+
```

## List files in a repository

The `snow git list-files` command lists all of the files on a specified repository state (a specific branch, tag or commit).

```bash
snow git list-files <REPO_PATH>
```

where:

* `<REPO_PATH>` is a stage path with a specific scope where the value is the repository name is followed by a suffix specifying which branch, tag or commit. The following lists some different types of values:

  + `@snowcli_git/branches/main/` refers to last commit of the `main` branch.
  + `@snowcli_git/tags/v2.1.0/` refers to a commit tagged `v2.1.0`.
  + `@snowcli_git/commits/1e939d69ca6fd0f89074e7e97c9fd1/` refers to a specific commit. Commit hashes should be between 6 and 40 characters long.

  A repository path can also be a subdirectory or file in the repository, but still must be preceded with a scope prefix.

The following example lists all of the files in the `my_snow_git` repository marked with the `v2.0.0` tag:

```bash
snow git list-files @my_snow_git/tags/v2.0.0/
```

```output
ls @snowcli_git/tags/v2.0.0/
+---------------------------------------------------------------------------------------------------------------------------------+
| name                                    | size | md5  | sha1                                     | last_modified                |
|-----------------------------------------+------+------+------------------------------------------+------------------------------|
| snowcli_git/tags/v2.0.0/CONTRIBUTING.md | 5472 | None | 1cc437b88d20afe4d5751bd576114e3b20be27ea | Mon, 5 Feb 2024 13:16:25 GMT |
| snowcli_git/tags/v2.0.0/LEGAL.md        | 251  | None | 4453da50b7a2222006289ff977bfb23583657214 | Mon, 5 Feb 2024 13:16:25 GMT |
| snowcli_git/tags/v2.0.0/README.md       | 1258 | None | bdc918baae93467c258c6634c872ca6bd4ee1e9c | Mon, 5 Feb 2024 13:16:25 GMT |
| snowcli_git/tags/v2.0.0/SECURITY.md     | 308  | None | 27e7e1b2fd28a86943b3f4c0a35a931577422389 | Mon, 5 Feb 2024 13:16:25 GMT |
| ...
+---------------------------------------------------------------------------------------------------------------------------------+
```

The following example lists all of the files in the `tests/` directory of the `my_snow_git` repository marked with the `v2.0.0` tag:

```bash
snow git list-files @my_snow_git/tags/v2.0.0/tests --pattern ".*\.toml"
```

```output
ls @snowcli_git/tags/v2.0.0/tests pattern = '.*\.toml'
+-----------------------------------------------------------------------------------------------------------------------------------------+
| name                                            | size | md5  | sha1                                     | last_modified                |
|-------------------------------------------------+------+------+------------------------------------------+------------------------------|
| snowcli_git/tags/v2.0.0/tests/empty_config.toml | 0    | None | e69de29bb2d1d6434b8b29ae775ad8c2e48c5391 | Mon, 5 Feb 2024 13:16:25 GMT |
| snowcli_git/tags/v2.0.0/tests/test.toml         | 381  | None | 45f1c00f16eba1b7bc7b4ab2982afe95d0161e7f | Mon, 5 Feb 2024 13:16:25 GMT |
+-----------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: Manage your Snowpark functions and procedures
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/manage.md
section: Snowflake CLI
---

# Manage your Snowpark functions and procedures

> * Add and modify functions and procedures using the [Build a Snowpark project](build.md) and [Deploy a Snowpark project](deploy.md) processes.
> * List functions and procedures to which you have access using the `snow snowpark list functions` and `snow snowpark list procedures` commands. For more information, see [List all objects of a specific type](../objects/manage-objects.md).
> * View details of a function or procedure using the `snow snowpark describe function [IDENTIFIER]` and `snow snowpark describe procedure [IDENTIFIER]` commands. For more information, see [Display the description for an object of a specified type](../objects/manage-objects.md).
> * Delete function/procedure using the `snow snowpark drop function [IDENTIFIER]` and `snow snowpark drop procedure [IDENTIFIER]` commands. For more information, see [Delete an object of a specified type](../objects/manage-objects.md).
> * Execute functions and procedures using the `snow snowpark execute` command. For more information, see [Execute a Snowpark procedure or function](execute.md).

---
title: Managing compute pools
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/services/manage-compute-pools.md
section: Snowflake CLI
---

# Managing compute pools

A compute pool is a collection of one or more virtual machine (VM) nodes on which Snowflake runs your
Snowpark Container Services jobs and services.

For more information about compute pools, see [Snowpark Container Services: Working with compute pools](../../snowpark-container-services/working-with-compute-pool.md).

This topic shows how to do the following tasks with services:

* Create a compute pool
* Create a compute pool from a project definition
* Suspend and resume a compute pool
* Set and unset a compute pool’s properties or parameters
* Stop all services in a compute pool

For common operations, such as listing or dropping, Snowflake CLI uses `snow object` commands as described in [Managing Snowflake objects](../objects/manage-objects.md).

## Create a compute pool

To create a compute pool named “pool_1” composed of two CPUs with 4 GB of memory, enter a
[spcs pool create](../command-reference/spcs-commands/compute-pool-commands/create.md) command similar to the following:

```snowcli
snow spcs compute-pool create "pool_1" --min-nodes 2 --max-nodes 2 --family "CPU_X64_XS"
```

For more information about instance families, see the SQL `CREATE COMPUTE POOL` command.

## Create a compute pool from a project definition

You can create a compute pool from a `snowflake.yml` project definition file and then executing the `snow spcs compute-pool deploy` command.

The following shows a sample `snowflake.yml` project definition file:

```yaml
definition_version: 2
entities:
  my_compute_pool:
    type: compute-pool
    identifier:
      name: my_compute_pool
    min_nodes: 1
    max_nodes: 2
    instance_family: CPU_X64_XS
    auto_resume: true
    initially_suspended: true
    auto_suspend_seconds: 60
    comment: "My compute pool"
    tags:
      - name: my_tag
        value: tag_value
```

The following table describes the properties of a compute pool project definition.

Compute pool project definition properties

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | Must be `compute-pool`. |
| **identifier**  *optional*, *string* | Snowflake identifier for the entity. The value can have the following forms:   * String identifier text  ```yaml   identifier: my-compute-pool   ```  Both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (for example, `"My Compute Pool"`). * Object  ```yaml   identifier:     name: my-compute-pool     schema: my-schema # optional     database: my-db # optional   ```  **Note:** An error occurs if you specify a `schema` or `database` and use a fully qualified name in the `name` property (such as `mydb.schema1.my-app`). |
| **instance_family**  *required*, *string* | Name of the instance family. For a list of available instance families, see the [CREATE COMPUTE POOL INSTANCE_FAMILY](../../../sql-reference/sql/create-compute-pool.md) parameter. |
| **min_nodes**  *optional*, *string* | Minimum number of nodes for the compute pool. This value must be greater than 0.  Default: `1` |
| **max_nodes**  *optional*, *int* | Maximum number of nodes for the compute pool. |
| **auto_resume**  *optional*, *boolean* | Whether to automatically resume a compute pool when a service or job is submitted to it.  Default: `True` |
| **initially_suspended**  *optional*, *boolean* | Whether the compute pool is created initially in the suspended state. If `true`, Snowflake doesn’t provision any nodes requested for the compute pool at the compute pool creation time.  Default: `False` |
| **auto_suspend_seconds**  *optional*, *int* | Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool.  Default: `3600` |
| **comment**  *optional*, *string* | Comments to associate with the compute pool. |
| **tags**  *optional*, *Tag sequence* | Tag names and values for the compute pool. For more information, see [Tag quotas](../../../user-guide/object-tagging/introduction.md) |

To create and deploy the compute pool to a stage, do the following:

1. Change your current directory to the directory containing the project definition file.
2. Run a `snow spcs compute-pool deploy` command similar to the following:

   ```snowcli
   snow spcs compute-pool deploy
   ```

   ```output
   +---------------------------------------------------------------------+
   | key    | value                                                      |
   |--------+------------------------------------------------------------|
   | status | Compute pool MY_COMPUTE_POOL successfully created.         |
   +---------------------------------------------------------------------+
   ```

## Suspend and resume a compute pool

> **Note:**
>
> The current role must have OPERATE privilege on the compute pool to suspend or resume it.

To suspend a compute pool, enter a command similar to the following:

```snowcli
snow spcs compute-pool suspend tutorial_compute_pool
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

To resume a suspended compute pool, enter a command similar to the following:

```snowcli
snow spcs compute-pool resume tutorial_compute_pool
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

## Set and unset a compute pool’s properties or parameters

> **Note:**
>
> The current role must have MODIFY privilege on the compute pool to set properties.

To set a property or parameter, enter a command similar to the following:

```snowcli
snow spcs compute-pool set tutorial_compute_pool --min-nodes 2 --max-nodes 4
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

To reset a property or parameter to its default value, enter a command similar to the following:

```snowcli
snow spcs compute-pool unset tutorial_compute_pool --auto-resume
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

## Stop all services in a compute pool

Stopping a compute pool deletes all of the services running on the compute pool; however, it does not stop the compute pool itself.

To stop a compute pool, enter a [spcs compute-pool stop-all](../command-reference/spcs-commands/compute-pool-commands/stop-all.md) command similar to the following:

```snowcli
snow spcs compute-pool stop-all "pool_1"
```

---
title: Managing data pipelines in Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/data-pipelines/data-pipelines.md
section: Snowflake CLI
---

# Managing data pipelines in Snowflake CLI

* [Managing dbt Projects on Snowflake using Snowflake CLI](dbt-projects.md)
* [Managing DCM projects using Snowflake CLI](dcm-projects.md)

---
title: Managing dbt Projects on Snowflake using Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/data-pipelines/dbt-projects.md
section: Snowflake CLI
---

# Managing dbt Projects on Snowflake using Snowflake CLI

> **Note:**
>
> The dbt Projects on Snowflake features in Snowflake CLI are available only in version 3.13.0 or later.

You can use Snowflake CLI to manage dbt projects with the following operations:

* Deploying a dbt project object
* Listing all available dbt project objects
* Executing a dbt project object command
* Describing a dbt project object
* Dropping a dbt project object

## Deploying a dbt project object

The [snow dbt deploy](../command-reference/dbt-commands/deploy.md) command uploads local files to a temporary stage and creates a new dbt project object, updates it by
making a new version, or completely recreates it. A valid dbt project must contain two files:

* `dbt_project.yml`: A standard dbt configuration file that specifies the profile to use.
* `profiles.yml`: A dbt connection profile definition referenced in `dbt_project.yml`. `profiles.yaml` must define the database, role, schema, and type.

  + By default, dbt Projects on Snowflake uses your target schema (`target.schema`) specified from your dbt environment or profile. Unlike dbt Core behavior, the target schema specified in the `profiles.yml`
    file must exist before you create your dbt Project in order for it to compile or execute successfully.

  ```yaml
  <profile_name>:
  target: dev
  outputs:
    dev:
      database: <database_name>
      role: <role_name>
      schema: <schema_name>
      type: snowflake
  ```

The following examples illustrate how to use the `snow dbt deploy` command:

* Deploy a dbt project object named `jaffle_shop`:

  ```snowcli
  snow dbt deploy jaffle_shop
  ```
* Deploy a project named `jaffle_shop` from a specified directory and create or add a new version depending on whether the dbt project object already exists:

  ```snowcli
  snow dbt deploy jaffle_shop --source /path/to/dbt/directory --profiles-dir ~/.dbt/ --force
  ```
* Deploy a project named `jaffle_shop` from a specified directory using a custom profiles directory, a specific dbt version, and enabling [external access integrations](../../external-network-access/creating-using-external-network-access.md):

  ```snowcli
  snow dbt deploy jaffle_shop --source /path/to/dbt/directory
  --profiles-dir ~/.dbt/
  --default-target prod
  --dbt-version 1.10.15
  --external-access-integration dbthub-integration
  --external-access-integration github-integration
  --force
  ```
* Deploy a project named `jaffle_shop` and set a specific version for the dbt project object:

  ```snowcli
  snow dbt deploy jaffle_shop --dbt-version '1.10.15'
  ```

## Listing all available dbt project objects

The [snow dbt list](../command-reference/dbt-commands/list.md) command lists all available dbt project objects on Snowflake.

The following examples illustrate how to use the `snow dbt list` command:

* List all available dbt project objects:

  ```snowcli
  snow dbt list
  ```
* List dbt project objects in the `product` database whose names begin with `JAFFLE`:

  ```snowcli
  snow dbt list --like JAFFLE% --in database product
  ```

## Executing a dbt project object command

The [snow dbt execute](../command-reference/dbt-commands/execute/overview.md) command executes one of the following [dbt commands](https://docs.getdbt.com/reference/dbt-commands) on a Snowflake dbt project object:

* [build](https://docs.getdbt.com/reference/commands/build)
* [compile](https://docs.getdbt.com/reference/commands/compile)
* [deps](https://docs.getdbt.com/reference/commands/deps)
* [list](https://docs.getdbt.com/reference/commands/list)
* [parse](https://docs.getdbt.com/reference/commands/parse)
* [retry](https://docs.getdbt.com/reference/commands/retry)
* [run](https://docs.getdbt.com/reference/commands/run)
* [run-operation](https://docs.getdbt.com/reference/commands/run-operation)
* [seed](https://docs.getdbt.com/reference/commands/seed)
* [show](https://docs.getdbt.com/reference/commands/show)
* [snapshot](https://docs.getdbt.com/reference/commands/snapshot)
* [test](https://docs.getdbt.com/reference/commands/test)

For more information about using dbt commands, see the [dbt Command reference](https://docs.getdbt.com/reference/dbt-commands).

The following examples illustrate how to use the `snow dbt execute` command:

* Execute the dbt `test` command:

  ```snowcli
  snow dbt execute jaffle_shop test
  ```
* Execute the `run` dbt command asynchronously:

  ```snowcli
  snow dbt execute --run-async jaffle_shop run --select @source:snowplow,tag:nightly models/export
  ```
* Execute the `run` dbt command with a specific dbt version:

  ```snowcli
  snow dbt execute jaffle_shop run --dbt-version '1.9.4'
  ```

## Describing a dbt project object

The [snow dbt describe](../command-reference/dbt-commands/describe.md) command describes a dbt project object on Snowflake.

The following example describes the dbt project object named `my_dbt_project` on Snowflake:

```console
snow dbt describe my_dbt_project
```

## Dropping a dbt project object

The [snow dbt drop](../command-reference/dbt-commands/drop.md) command deletes a dbt project object on Snowflake.

The following example deletes the dbt project object named `my_dbt_project` on Snowflake:

```console
snow dbt drop my_dbt_project
```

## Use `snow dbt` commands in a CI/CD workflow

> **Note:**
>
> When building CI/CD workflows, you only need your git server, such as Github, and Snowflake CLI. A Git repository object is not required.

You can run dbt commands with Snowflake CLI to build CI/CD pipelines. These pipelines are commonly used to test new code, such as new pull requests, or to update production applications whenever something is merged to the main branch.

To build a CI/CD workflow with `snow dbt` commands, follow these steps:

1. Prepare your dbt project:

   1. Download your dbt project or start a new one.

      * Ensure that the main project directory contains the `dbt_project.yml` and `profiles.yml` files.
      * Verify that the profile name referenced in `dbt_project.yml` is defined in `profiles.yml`.

        > **Note:**
        >
        > Snowflake’s dbt project objects don’t need passwords, so if `profiles.yml` contains any, deployment stops until
        > they are removed.
2. Set up Snowflake CLI GitHub Action.

   Follow the guidelines for [setting up GitHub Action for Snowflake CLI](../cicd/integrate-ci-cd.md) and [verify your connection](../connecting/configure-connections.md) to Snowflake.
3. Define your workflow.

   Determine which commands your workflow needs to run based on your organization’s needs. The following example illustrates a CI workflow that updates the version of the dbt project object named `product_pipeline` with new files, runs the transformations, and finally runs tests:

   ```yaml
   - name: Execute Snowflake CLI command
     run: |
       snow dbt deploy product_pipeline
       snow dbt execute product_pipeline run
       snow dbt execute product_pipeline test
   ```

---
title: Managing DCM projects using Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/data-pipelines/dcm-projects.md
section: Snowflake CLI
---

# Managing DCM projects using Snowflake CLI

You can manage a DCM project using Snowflake CLI. For more information about DCM projects, see [Snowflake DCM Projects](../../../user-guide/dcm-projects/dcm-projects-overview.md).

## Install Snowflake CLI with DCM project features

To use the `snow dcm` commands, you must install Snowflake CLI version 3.16.0 or later. For more information, see [Installing Snowflake CLI](../installation/installation.md).

## Verify you have a valid connection to Snowflake

Snowflake CLI requires a working connection to interact with Snowflake. For information about managing connections, see [Configuring Snowflake CLI](../connecting/configure-cli.md).

## Enable DCM commands in Snowflake CLI

To use DCM commands, you must enable the `SNOWFLAKE_CLI_FEATURES_ENABLE_SNOWFLAKE_PROJECTS` feature flag, using either of the following methods:

* Set the `SNOWFLAKE_CLI_FEATURES_ENABLE_SNOWFLAKE_PROJECTS` environment variable to `true` before running the command.
* Set the `enable_snowflake_projects` configuration option to `true` in the `config.toml` file, as shown in the following example:

  ```toml
  [cli.features]
  enable_snowflake_projects = true
  ```

## Initialize a DCM project from a template

* To initialize a DCM project from a template, use the `snow init` command:

  ```snowcli
  snow init <project_dir_name> --template dcm_project
  ```

  where `<project_dir_name>` is the directory with the DCM project files. This directory is created by the `snow init` command and is populated with the project files generated from the specified template.

  For example, the following command creates the project files in the `MY_PROJECT` directory:

  ```snowcli
  snow init MY_PROJECT --template dcm_project
  ```

## Snowflake CLI commands

To support DCM Projects, Snowflake CLI added the following commands:

* [snow dcm create](../command-reference/dcm-commands/create.md)
* [snow dcm deploy](../command-reference/dcm-commands/deploy.md)
* [snow dcm describe](../command-reference/dcm-commands/describe.md)
* [snow dcm drop](../command-reference/dcm-commands/drop.md)
* [snow dcm drop-deployment](../command-reference/dcm-commands/drop-deployment.md)
* [snow dcm list](../command-reference/dcm-commands/list.md)
* [snow dcm list-deployments](../command-reference/dcm-commands/list-deployments.md)
* [snow dcm plan](../command-reference/dcm-commands/plan.md)
* [snow dcm preview](../command-reference/dcm-commands/preview.md)
* [snow dcm refresh](../command-reference/dcm-commands/refresh.md)
* [snow dcm test](../command-reference/dcm-commands/test.md)

## Create and deploy DCM projects

This section describes how to create, validate, and deploy DCM projects using Snowflake CLI.

### Create a DCM project

Use the `snow dcm create` command to create a new DCM project in Snowflake. The project identifier can be specified directly as an argument or resolved from the `manifest.yml` file.

* Create a project using the identifier from the default target specified in the manifest :

  ```snowcli
  snow dcm create
  ```
* Create a project using the identifier from the `dev` target specified in the manifest:

  ```snowcli
  snow dcm create --target dev
  ```
* Create a project only if it does not already exist:

  ```snowcli
  snow dcm create --if-not-exists
  ```

For more information, see [snow dcm create](../command-reference/dcm-commands/create.md).

### Plan a DCM project

Use the `snow dcm plan` command to validate your project before deploying. This command shows what changes would be applied without actually making any modifications.

* Validate a project:

  ```snowcli
  snow dcm plan
  ```
* Validate with variable substitution:

  ```snowcli
  snow dcm plan -D "db_name=my_database" -D "schema_name=my_schema"
  ```
* Validate using a specific target profile and save the output:

  ```snowcli
  snow dcm plan --target dev --save-output
  ```

  When using `--save-output`, the command saves the response and artifacts to a local `out/` directory.

For more information, see [snow dcm plan](../command-reference/dcm-commands/plan.md).

### Deploy a DCM project

Use the `snow dcm deploy` command to apply changes defined in your DCM project to Snowflake..

* Deploy a project:

  ```snowcli
  snow dcm deploy
  ```
* Deploy with variable substitution:

  ```snowcli
  snow dcm deploy -D "table_name='MY_DB.PUBLIC.MY_TABLE'"
  ```
* Deploy with an alias for the deployment:

  ```snowcli
  snow dcm deploy --alias v1.0
  ```
* Deploy from a specific directory using a target profile:

  ```snowcli
  snow dcm deploy --from /path/to/project --target prod
  ```

For more information, see [snow dcm deploy](../command-reference/dcm-commands/deploy.md).

### Preview a DCM project

Use the `snow dcm preview` command to return rows from any table, view, or dynamic table defined in your project. This command is useful for testing your definitions before or after deployment.

* Preview data from a table:

  ```snowcli
  snow dcm preview --object MY_DB.PUBLIC.MY_TABLE
  ```
* Preview with a row limit:

  ```snowcli
  snow dcm preview --object MY_DB.PUBLIC.MY_VIEW --limit 10
  ```
* Preview with variable substitution:

  ```snowcli
  snow dcm preview --object MY_DB.PUBLIC.MY_VIEW -D "filter_date='2024-01-01'"
  ```

For more information, see [snow dcm preview](../command-reference/dcm-commands/preview.md).

### Test a DCM project

Use the `snow dcm test` command to run all expectations (data metric functions) defined in your project. This command validates data quality rules and returns pass/fail results.

* Test a project:

  ```snowcli
  snow dcm test
  ```
* Test using a target profile:

  ```snowcli
  snow dcm test --target dev
  ```
* Test and save the results:

  ```snowcli
  snow dcm test --save-output
  ```

The command returns exit code 0 if all tests pass, or exit code 1 if any test fails.

For more information, see [snow dcm test](../command-reference/dcm-commands/test.md).

### Refresh a DCM project

Use the `snow dcm refresh` command to refresh all dynamic tables defined in your DCM project. This triggers an immediate refresh of the data.

* Refresh dynamic tables in a project:

  ```snowcli
  snow dcm refresh
  ```
* Refresh using a target profile:

  ```snowcli
  snow dcm refresh --target prod
  ```
* Refresh and save the output:

  ```snowcli
  snow dcm refresh --save-output
  ```

The command reports the status of each dynamic table, including the number of rows inserted and deleted.

For more information, see [snow dcm refresh](../command-reference/dcm-commands/refresh.md).

### Drop a DCM project

Use the `snow dcm drop` command to drop a DCM project. This command deletes the project and all its versions. The stage associated with the project is not deleted.

* Drop a project:

  ```snowcli
  snow dcm drop
  ```
* Drop a project only if it exists:

  ```snowcli
  snow dcm drop --if-exists
  ```

For more information, see [snow dcm drop](../command-reference/dcm-commands/drop.md).

## Manage deployed DCM projects

After deploying a DCM project, you can list and manage individual deployments.

### List deployed DCM projects

Use the `snow dcm list-deployments` command to list all deployments of a given DCM project.

* List deployments for a project:

  ```snowcli
  snow dcm list-deployments
  ```
* List deployments using a target profile:

  ```snowcli
  snow dcm list-deployments --target dev
  ```

The output shows the deployment name and alias (if set) for each deployment.

For more information, see [snow dcm list-deployments](../command-reference/dcm-commands/list-deployments.md).

### Drop deployed DCM projects

Use the `snow dcm drop-deployment` command to drop a specific deployment from a DCM project.

* Drop a deployment by name:

  ```snowcli
  snow dcm drop-deployment --deployment 'DEPLOYMENT$1'
  ```

  > **Note:**
  >
  > For deployment names containing `$`, use single quotes to prevent shell expansion.
* Drop a deployment by alias:

  ```snowcli
  snow dcm drop-deployment --deployment v1.0
  ```
* Drop a deployment only if it exists:

  ```snowcli
  snow dcm drop-deployment --deployment v1.0 --if-exists
  ```

For more information, see [snow dcm drop-deployment](../command-reference/dcm-commands/drop-deployment.md).

---
title: Managing Git repositories
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/git/overview.md
section: Snowflake CLI
---

# Managing Git repositories

You can integrate your remote Git repository with Snowflake so that files from the repository are synchronized to a special kind of stage called a *repository stage*. The repository stage acts as a local Git repository with a full clone of the remote repository, including branches, tags, and commits.

For more information, see [Using a Git repository in Snowflake](../../git/git-overview.md).

Snowflake CLI supports the following git operations:

* [Setting up a Git repository](setup-git.md)
* [Refreshing a repository](refresh-repo.md)
* [Listing the contents of a repository](list-contents.md)
* [Copying files in Git](copy-files.md)
* [Executing files from a repository](execute-sql.md)

---
title: Managing services
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/services/manage-services.md
section: Snowflake CLI
---

# Managing services

Snowpark Container Services enables you to easily deploy, manage, and scale containerized applications.
After you upload your application image to a repository in your account, you run your application containers
as a service or a job. This topic explains working with services.

A service is long-running, like a web service, and does not end on its own. Snowflake manages running services.
For example, if a service container exits, for whatever reason, Snowflake restarts that container so the service
runs uninterrupted. If your service needs more resources, such as more compute power, Snowflake provisions
additional nodes in the compute pool.

For more information about working with container services, see [Snowpark Container Services: Working with services](../../snowpark-container-services/working-with-services.md).

This topic shows how to do the following tasks with services:

* Create a Snowpark Container Services service
* Create and deploy a service from a project definition
* Suspend and resume a service
* Get status information about a service
* List the endpoints in a service
* Set and unset a service’s properties or parameters
* Display logs for a named service
* Upgrade a named service

For common operations, such as listing or dropping, Snowflake CLI uses `snow object` commands as described in [Managing Snowflake objects](../objects/manage-objects.md).

## Create a Snowpark Container Services service

A Snowpark container service requires the following:

* **A compute pool**: Snowflake runs your service in the specified compute pool.
* **A service specification file**: This specification gives Snowflake the information needed to configure and run
  your service.

To create a service, enter a [snow spcs service create](../command-reference/spcs-commands/service-commands/create.md) command similar to the following:

```snowcli
snow spcs service create "job_1" --compute-pool "pool_1" --spec-path "/some-dir/spec_file.yaml"
```

For more information, see [Managing Snowflake objects](../objects/manage-objects.md).

### Create and deploy a service from a project definition

You can create a service from a `snowflake.yml` project definition file and then executing the `snow spcs service deploy` command.

The following shows a sample `snowflake.yml` project definition file:

```yaml
definition_version: 2
entities:
  my_service:
    type: service
    identifier: my_service
    stage: my_stage
    compute_pool: my_compute_pool
    spec_file: spec.yml
    min_instances: 1
    max_instances: 2
    query_warehouse: my_warehouse
    auto_resume: true
    external_access_integrations:
      - my_external_access
    secrets:
        cred: my_cred_name
    artifacts:
      - spec.yml
    comment: "My service"
    tags:
      - name: test_tag
        value: test_value
```

The following table describes the properties of a compute pool project definition.

Compute pool project definition properties

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | Must be `service`. |
| **stage**  *required*, *string* | Stage where the service specification file is located. |
| **compute_pool**  *required*, *string* | Compute pool where the service runs. |
| **spec_file**  *required*, *string* | Path to service specification file on the stage. |
| **identifier**  *optional*, *string* | Snowflake identifier for the entity. The value can have the following forms:   * String identifier text  ```yaml   identifier: my-service   ```  Both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (for example, `’”My Image Repository"`). * Object  ```yaml   identifier:     name: my-service     schema: my-schema # optional     database: my-db # optional   ```  **Note:** An error occurs if you specify a `schema` or `database` and use a fully qualified name in the `name` property (such as `mydb.schema1.my-app`). |
| **min_instances**  *optional*, *string* | Minimum number of service instances to run.  Default: `1` |
| **max_instances**  *optional*, *string* | Maximum number of service instances to run. |
| **query_warehouse**  *optional*, *string* | Warehouse to use if a service container connects to Snowflake to execute a query without explicitly specifying a warehouse to use. |
| **auto_resume**  *optional*, *string* | Whether to automatically resume when a service function or ingress is called.  Default: `True` |
| **external_access_integrations**  *optional*, *string sequence* | Names of external access integrations needed for this entity to access external networks. |
| **secrets**  *optional*, *dictionary* | Names and values of secrets variables so that you can use the variables to reference the secrets. |
| **artifacts**  *optional*, *string sequence* | List of file source and destination pairs to add to the deploy root. You can use the following artifact properties:   * `src`: Path to the code source file or files * `dest`: Path to the directory to deploy the artifacts.  Destination paths that reference directories must end with a `/`. A glob pattern’s destination that does not end with a `/` results in an error. If omitted, `dest` defaults to the same string as `src`.  You can also pass in a string for each item instead of a `dict`, in which case the value is treated as both `src` and `dest`.   If `src` refers to just one file (not a glob), `dest` can refer to a target `<path>` or a `<path/name>`.  You can also pass in a string for each item instead of a `dict`, in which case the value is treated as both `src` and `dest`. |
| **comment**  *optional*, *string* | Comments to associate with the compute pool. |
| **tags**  *optional*, *Tag sequence* | Tag names and values for the compute pool. For more information, see [Tag quotas](../../../user-guide/object-tagging/introduction.md) |

To create and deploy a service, do the following:

1. Change your current directory to the directory containing the project definition file.
2. Run a `snow spcs service deploy` command similar to the following:

   ```snowcli
   snow spcs service deploy
   ```

   ```output
   +---------------------------------------------------------------------+
   | key    | value                                                      |
   |--------+------------------------------------------------------------|
   | status | Service MY_SERVICE successfully created.                   |
   +---------------------------------------------------------------------+
   ```

## Suspend and resume a service

To suspend a named service, enter a [snow spcs service suspend](../command-reference/spcs-commands/service-commands/suspend.md) command similar to the following:

```snowcli
snow spcs service suspend echo_service
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

To resume a suspended service, enter a [snow spcs service resume](../command-reference/spcs-commands/service-commands/resume.md) command similar to the following:

```snowcli
snow spcs service resume echo_service
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

## Get status information about a service

> **Note:**
>
> The current role must have MONITOR privilege on the service to get its status.

### List all services

The [snow spcs service list](../command-reference/spcs-commands/service-commands/list.md) command returns an overview of all services, including the runtime state of the services, such as PENDING or RUNNING, and the upgrading status. To get the status of all services, enter a command similar to the following:

```snowcli
snow spcs service list
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
|        |        |        |        |        |        |        |        |        |        |        |         | extern |         |        |         |        |         |        |        |         |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         | al_acc |         |        |         |        |         |        |        |         |        | managin | managi |
|        |        | databa |        |        |        |        | curren | target | min_in | max_in |         | ess_in |         |        |         |        | owner_r | query_ |        |         |        | g_objec | ng_obj |
|        |        | se_nam | schema |        | comput | dns_na | t_inst | _insta | stance | stance | auto_re | tegrat | created | update | resumed | commen | ole_typ | wareho |        | spec_di | is_upg | t_domai | ect_na |
| name   | status | e      | _name  | owner  | e_pool | me     | ances  | nces   | s      | s      | sume    | ions   | _on     | d_on   | _on     | t      | e       | use    | is_job | gest    | rading | n       | me     |
|--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+---------+--------+---------+--------+---------+--------+---------+--------+--------+---------+--------+---------+--------|
| ECHO_S | RUNNIN | TEST00 | TEST_S | SYSADM | TUTORI | echo-s | 1      | 1      | 1      | 1      | true    | None   | 2024-10 | 2024-1 | None    | This   | ROLE    | COMPUT | false  | 52e62d1 | false  | None    | None   |
| ERVICE | G      | _DB    | CHEMA  | IN     | AL_COM | ervice |        |        |        |        |         |        | -16     | 0-16   |         | is a   |         | E_WH   |        | f19c720 |        |         |        |
|        |        |        |        |        | PUTE_P | .imhd. |        |        |        |        |         |        | 15:09:3 | 15:09: |         | test   |         |        |        | 6b5f4ef |        |         |        |
|        |        |        |        |        | OOL    | svc.sp |        |        |        |        |         |        | 0.49300 | 31.905 |         | servic |         |        |        | c069557 |        |         |        |
|        |        |        |        |        |        | cs.int |        |        |        |        |         |        | 0-07:00 | 000-07 |         | e      |         |        |        | 8b6c2b3 |        |         |        |
|        |        |        |        |        |        | ernal  |        |        |        |        |         |        |         | :00    |         |        |         |        |        | 806ad76 |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 67d78cc |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | ce8b6ed |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 6501a8a |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 3       |        |         |        |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

### Get the status of a named service

To get the status of an individual service, enter a [snow spcs service describe](../command-reference/spcs-commands/service-commands/describe.md) command similar to the following:

```snowcli
snow spcs service describe echo_service
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
|        |        |        |        |        |        |        |        |        |        |        |         | extern |         |        |         |        |         |        |        |         |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         | al_acc |         |        |         |        |         |        |        |         |        | managin | managi |
|        |        | databa |        |        |        |        | curren | target | min_in | max_in |         | ess_in |         |        |         |        | owner_r | query_ |        |         |        | g_objec | ng_obj |
|        |        | se_nam | schema |        | comput | dns_na | t_inst | _insta | stance | stance | auto_re | tegrat | created | update | resumed | commen | ole_typ | wareho |        | spec_di | is_upg | t_domai | ect_na |
| name   | status | e      | _name  | owner  | e_pool | me     | ances  | nces   | s      | s      | sume    | ions   | _on     | d_on   | _on     | t      | e       | use    | is_job | gest    | rading | n       | me     |
|--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+---------+--------+---------+--------+---------+--------+---------+--------+--------+---------+--------+---------+--------|
| ECHO_S | RUNNIN | TEST00 | TEST_S | SYSADM | TUTORI | echo-s | 1      | 1      | 1      | 1      | true    | None   | 2024-10 | 2024-1 | None    | This   | ROLE    | COMPUT | false  | 52e62d1 | false  | None    | None   |
| ERVICE | G      | _DB    | CHEMA  | IN     | AL_COM | ervice |        |        |        |        |         |        | -16     | 0-16   |         | is a   |         | E_WH   |        | f19c720 |        |         |        |
|        |        |        |        |        | PUTE_P | .imhd. |        |        |        |        |         |        | 15:09:3 | 15:09: |         | test   |         |        |        | 6b5f4ef |        |         |        |
|        |        |        |        |        | OOL    | svc.sp |        |        |        |        |         |        | 0.49300 | 31.905 |         | servic |         |        |        | c069557 |        |         |        |
|        |        |        |        |        |        | cs.int |        |        |        |        |         |        | 0-07:00 | 000-07 |         | e      |         |        |        | 8b6c2b3 |        |         |        |
|        |        |        |        |        |        | ernal  |        |        |        |        |         |        |         | :00    |         |        |         |        |        | 806ad76 |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 67d78cc |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | ce8b6ed |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 6501a8a |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 3       |        |         |        |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

### List instances and containers

You can list service’s instances and containers with the `snow spcs service list-instances` and `snow spcs service list-containers` commands, respectively.

To get the list of instances in the `echo_service` service, enter the following [snow spcs service list-instances](../command-reference/spcs-commands/service-commands/list-instances.md) command:

```snowcli
snow spcs service list-instances echo_service
```

```output
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| database_name | schema_name | service_name | instance_id | status | spec_digest                                                      | creation_time        | start_time           |
|---------------+-------------+--------------+-------------+--------+------------------------------------------------------------------+----------------------+----------------------|
| TEST00_DB     | TEST_SCHEMA | ECHO_SERVICE | 0           | READY  | 336c065739dd2b96e770f01804affdc7810e6df68a23b23052d851627abfbdf9 | 2024-10-10T06:06:30Z | 2024-10-10T06:06:30Z |
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

To get the list of containers in the `echo_service` service, enter the following [snow spcs service list-containers](../command-reference/spcs-commands/service-commands/list-containers.md) command:

```snowcli
snow spcs service list-containers echo_service
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| database_name | schema_name | service_name | instance_id | container_name | status | message | image_name                                | image_digest                              | restart_count | start_time           |
|---------------+-------------+--------------+-------------+----------------+--------+---------+-------------------------------------------+-------------------------------------------+---------------+----------------------|
| TEST00_DB     | TEST_SCHEMA | ECHO_SERVICE | 0           | main           | READY  | Running | org-test-account-00.registry.registry.sno | sha256:06c3d54edc24925abe398eda70d37eb6b8 | 0             | 2024-10-16T22:09:35Z |
|               |             |              |             |                |        |         | wflakecomputing.com/test00_db/test_schema | 7b1c4dd6211317592764e1e7d94498            |               |                      |
|               |             |              |             |                |        |         | /test00_repo/echo_service:latest          |                                           |               |                      |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

### List the endpoints in a service

To list the endpoints a named service, enter a [snow spcs service list-endpoints](../command-reference/spcs-commands/service-commands/list-endpoints.md) command similar to the following:

```snowcli
snow spcs service list-endpoints echo_service
```

```output
+--------------+------+----------+-----------------+-----------------------------------------+
| name         | port | protocol | ingress_enabled | ingress_url                             |
|--------------+------+----------+-----------------+-----------------------------------------|
| echoendpoint | 8000 | TCP      | true            | org-id-acct-id.snowflakecomputing.app   |
+--------------+------+----------+-----------------+-----------------------------------------+
```

### List the service roles associated with a service

You can manage access to individual endpoints exposed by a service by defining service roles and permissions in the service specification. For more information about how to use service roles, see [GRANT SERVICE ROLE](../../../sql-reference/sql/grant-service-role.md).

To get a list of service roles created for a service, use the [snow spcs service list-roles](../command-reference/spcs-commands/service-commands/list-roles.md) command, as shown:

```snowcli
snow spcs service list-roles my_service
```

```output
+------------------------------------------------------------------+
| created_on                       | name                | comment |
|----------------------------------+---------------------+---------|
| 2024-10-09 16:48:52.980000-07:00 | ALL_ENDPOINTS_USAGE | None    |
+------------------------------------------------------------------+
```

## Set and unset a service’s properties or parameters

> **Note:**
>
> The current role must have OPERATE privilege on the service to set properties.

To set a service’s property or parameter, enter a [snow spcs service set](../command-reference/spcs-commands/service-commands/set.md) command similar to the following:

```snowcli
snow spcs service set echo_service --min-instances 2 --max-instances 4
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

To reset a service’s property or parameter to its default value, enter a command similar to the following:

```snowcli
snow spcs compute-pool unset tutorial_compute_pool --auto-resume
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

## Display logs for a named service

> **Note:**
>
> The current role must have MONITOR privilege on the service to display logs.

To display local logs for a named service, enter a [snow spcs service logs](../command-reference/spcs-commands/service-commands/logs.md) command similar to the following:

```snowcli
snow spcs service logs "service_1" --container-name "container_1" --instance-id "0"
```

## Upgrade a named service

> **Note:**
>
> The current role must have OPERATE privilege on the service to upgrade it.

To upgrade a named service, enter a [snow spcs service upgrade](../command-reference/spcs-commands/service-commands/upgrade.md) command similar to the following:

```snowcli
snow spcs service upgrade echo_service --spec-path spec.yml
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: Managing Snowflake connections
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/configure-connections.md
section: Snowflake CLI
---

# Managing Snowflake connections

Before you can use Snowflake CLI, you must define connections, which specify how Snowflake CLI connects to Snowflake. Snowflake CLI uses the following precedence hierarchy to determine which value to use when a connection parameter is defined in multiple locations:

* Command-line parameters
* Environment variables overriding specific `config.toml` parameters, such as `SNOWFLAKE_CONNECTIONS_MYCONNECTION_PASSWORD`
* Connections defined in `config.toml` file manually or using `snow connection add` command
* Generic environment variables, such as `SNOWFLAKE_USER`.

You can also use the `--temporary-connection` option, which does not require defining it in `config.toml`.

> **Caution:**
>
> For improved security, Snowflake strongly recommends using either `SNOWFLAKE_CONNECTIONS_<NAME>_PASSWORD` or `SNOWFLAKE_PASSWORD` environment variable.

## Define connections

Connection definitions are stored in the `[connections]` section of the `config.toml` file, similar to the following block of code:

```toml
[connections.myconnection]
account = "myaccount"
user = "jondoe"
password = "password"
warehouse = "my-wh"
database = "my_db"
schema = "my_schema"
```

Connection definitions support the same configuration options as the [Snowflake Connector for Python](../../python-connector/python-connector-api.md).
Additionally, you can specify a default connection in the `default_connection_name` variable at the top of the file. You cannot include it
within a connection definition. For example:

```toml
default_connection_name = "myconnection"

[connections.myconnection]
account = "myaccount"
...
```

> **Note:**
>
> For MacOS and Linux systems, Snowflake CLI requires the `config.toml` file to limit its file permissions to read and write for the file owner only. To set the file required file permissions execute the following commands:
>
> ```snowcli
> chown $USER config.toml
> chmod 0600 config.toml
> ```

### Alternative configuration file

> **Note:**
>
> For Snowflake CLI, Snowflake recommends that you use the `config.toml` file for configuration definitions. However, you can use the `connections.toml` file, if desired.

Snowflake CLI also supports the `connections.toml` configuration file. The file should be placed in the same directory as the `config.toml` file, and it should contain only connections.
Configurations in `connections.toml` require a different section name, without `connections`. For example, `[connections.myconnection]` would be just `[myconnection]`.

> **Note:**
>
> If both the `config.toml` and `connections.toml` configurations contain connections, Snowflake CLI uses only configurations from `connections.toml`.

## Manage or add your connections to Snowflake with the `snow connection` commands

The `snow connection` commands let you create, manage, and test Snowflake connections.

### Add a connection

> **Note:**
>
> If you need to add a connection for Snowflake Open Catalog, see [Create a Snowflake CLI connection for Open Catalog](https://other-docs.snowflake.com/opencatalog/sso-configure-open-catalog#create-a-snowflake-cli-connection-for-open-catalog) in the
> Open Catalog documentation. You might need to add this connection for tasks like configuring Open Catalog to use SSO.

To create a new connection and add it to the [configuration file](configure-cli.md), do the following:

1. Execute the `snow connection add` command:

   > ```snowcli
   > snow connection add
   > ```
2. When prompted, supply the required connection, account, and username parameters, as well as any other desired optional parameters. Note that additional parameters might be required depending on the authentication method you choose.

   > ```output
   > Enter connection name: <connection_name>
   > Enter account: <account>
   > Enter user: <user-name>
   > Enter password: <password>
   > Enter role: <role-name>
   > Enter warehouse: <warehouse-name>
   > Enter database: <database-name>
   > Enter schema: <schema-name>
   > Enter host: <host-name>
   > Enter port: <port-number>
   > Enter region: <region-name>
   > Enter authenticator: <authentication-method>
   > Enter workload identity provider: <workload-identity-provider>
   > Enter private key file: <path-to-private-key-file>
   > Enter token file path: <path-to-mfa-token>
   > Wrote new connection <connection-name> to config.toml
   > ```

You can also add values for specific parameters on the command line, as shown:

```snowcli
snow --config-file config.toml connection add -n myconnection2 --account myaccount2 --user jdoe2
```

> **Note:**
>
> If the command finishes with an error, such as if the `--private_key_file` option references a non-existing file, the connection is not saved in the `config.toml` configuration file.

By default, the `snow connection add` command prompts for optional parameters if they are not specified on the command line. If you want to add connections without specifying some optional parameter, like `account`, and skip the interactive prompts, you can use the `--no-interactive` option, as shown:

```snowcli
snow connection add -n myconnection2 --user jdoe2 --no-interactive
```

After adding a connection, you can test the connection to make sure it works correctly.

### List defined connections

To list the available connections, enter the `snow connection list` command, as shown:

```snowcli
snow connection list
```

```output
+-------------------------------------------------------------------------------------------------+
| connection_name | parameters                                                       | is_default |
|-----------------+------------------------------------------------------------------+------------|
| myconnection    | {'account': 'myaccount', 'user': 'jondoe', 'password': '****',   | False      |
|                 | 'database': 'my_db', 'schema': 'my_schema', 'warehouse':         |            |
|                 | 'my-wh'}                                                         |            |
| myconnection2   | {'account': 'myaccount2', 'user': 'jdoe2'}                       | False      |
+-------------------------------------------------------------------------------------------------+
```

### Test and diagnose a connection

To test whether a connection can successfully connect to Snowflake, enter the `snow connection test` command, similar to the following:

```snowcli
snow connection test -c myconnection2
```

```output
+--------------------------------------------------+
| key             | value                          |
|-----------------+--------------------------------|
| Connection name | myconnection2                  |
| Status          | OK                             |
| Host            | example.snowflakecomputing.com |
| Account         | myaccount2                     |
| User            | jdoe2                          |
| Role            | ACCOUNTADMIN                   |
| Database        | not set                        |
| Warehouse       | not set                        |
+--------------------------------------------------+
```

If you encounter connectivity issues, you can run diagnostics directly within Snowflake CLI. Snowflake Support might also request this information to help you with connectivity issues.

The diagnostics collection uses the following `snow connection test` command options:

* `--enable-diag` to generate a diagnostic report.
* `--diag-log-path` to specify the absolute path for the generated report.
* `--diag-allowlist-path` to specify the absolute path to a JSON file containing the output of the SYSTEM$ALLOWLIST() or SYSTEM$ALLOWLIST_PRIVATELINK() SQL commands. This option is required only if the user defined in the connection does not have permission to run the system allowlist functions or if connecting to the account URL fails.

The following example generates a diagnostic report for the `myconnection2` connection and stores in the `~/report/SnowflakeConnectionTestReport.txt` file:

```snowcli
snow connection test -c myconnection2 --enable-diag --diag-log-path $(HOME)/report
```

```output
+----------------------------------------------------------------------------+
| key                  | value                                               |
|----------------------+-----------------------------------------------------|
| Connection name      | myconnection2                                       |
| Status               | OK                                                  |
| Host                 | example.snowflakecomputing.com                      |
| Account              | myaccount2                                          |
| User                 | jdoe2                                               |
| Role                 | ACCOUNTADMIN                                        |
| Database             | not set                                             |
| Warehouse            | not set                                             |
| Diag Report Location | /Users/<username>/SnowflakeConnectionTestReport.txt |
+----------------------------------------------------------------------------+
```

You can review the report for any connectivity issues and discuss them with your network team. You can also provide the report to Snowflake Support for additional assistance.

### Remove a connection

You can use the `snow connection remove` command to delete a specific connection, similar to the following:

```snowcli
snow connection remove bad_connection
```

```output
Removed connection bad_connection from /Users/jdoe/.snowflake/config.toml.
```

### Set the default connection

You can use the `snow connection set-default` command to specify which configuration Snowflake CLI should use as the default, overriding the `default_connection_name`
configuration file and `SNOWFLAKE_DEFAULT_CONNECTION_NAME` variables, if set.

The following example sets the default connection to `myconnection2`:

```snowcli
snow connection set-default myconnection2
```

```output
Default connection set to: myconnection2
```

> **Note:**
>
> If both `connections.toml` and `config.toml` files are present, Snowflake CLI uses only connections defined in `connections.toml`.

### Use environment variables for Snowflake credentials

You can specify Snowflake credentials in system environment variables instead of
in configuration files. You can use the following generic environment variables only to specify connection parameters:

* `SNOWFLAKE_ACCOUNT`
* `SNOWFLAKE_USER`
* `SNOWFLAKE_PASSWORD`
* `SNOWFLAKE_DATABASE`
* `SNOWFLAKE_SCHEMA`
* `SNOWFLAKE_ROLE`
* `SNOWFLAKE_WAREHOUSE`
* `SNOWFLAKE_AUTHENTICATOR`
* `SNOWFLAKE_PRIVATE_KEY_PATH`
* `SNOWFLAKE_PRIVATE_KEY_RAW`
* `SNOWFLAKE_SESSION_TOKEN`
* `SNOWFLAKE_MASTER_TOKEN`
* `SNOWFLAKE_TOKEN`
* `SNOWFLAKE_TOKEN_FILE_PATH`
* `SNOWFLAKE_OAUTH_CLIENT_ID`
* `SNOWFLAKE_OAUTH_CLIENT_SECRET`
* `SNOWFLAKE_OAUTH_AUTHORIZATION_URL`
* `SNOWFLAKE_OAUTH_TOKEN_REQUEST_URL`
* `SNOWFLAKE_OAUTH_REDIRECT_URI`
* `SNOWFLAKE_OAUTH_SCOPE`
* `SNOWFLAKE_OAUTH_DISABLE_PKCE`
* `SNOWFLAKE_OAUTH_ENABLE_REFRESH_TOKENS`
* `SNOWFLAKE_OAUTH_ENABLE_SINGLE_USE_REFRESH_TOKENS`
* `SNOWFLAKE_CLIENT_STORE_TEMPORARY_CREDENTIAL`
* `SNOWFLAKE_WORKLOAD_IDENTITY_PROVIDER`

### Pass connection parameters to the `snow` command

You can pass connection parameters directly in every `snow` command that requires a connection. For a full list of connection configuration parameters, execute the `snow sql --help` command, as shown. Note that the output shows only the section with the connection configuration options.

```snowcli
snow sql --help
```

```output
╭─ Connection configuration ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
│ --connection,--environment             -c      TEXT     Name of the connection, as defined in your config.toml. Default: default.
│ --host                                         TEXT     Host address for the connection. Overrides the value specified for the connection.
│ --port                                         INTEGER  Port for the connection. Overrides the value specified for the connection.
│ --account,--accountname                        TEXT     Name assigned to your Snowflake account. Overrides the value specified for the connection.
│ --user,--username                              TEXT     Username to connect to Snowflake. Overrides the value specified for the connection.
│ --password                                     TEXT     Snowflake password. Overrides the value specified for the connection.
│ --authenticator                                TEXT     Snowflake authenticator. Overrides the value specified for the connection.
│ --private-key-file,--private-key-path          TEXT     Snowflake private key file path. Overrides the value specified for the connection.
│ --token                                        TEXT     OAuth token to use when connecting to Snowflake.
│ --token-file-path                              TEXT     Path to file with an OAuth token that should be used when connecting to Snowflake.
│ --database,--dbname                            TEXT     Database to use. Overrides the value specified for the connection.
│ --schema,--schemaname                          TEXT     Database schema to use. Overrides the value specified for the connection.
│ --role,--rolename                              TEXT     Role to use. Overrides the value specified for the connection.
│ --warehouse                                    TEXT     Warehouse to use. Overrides the value specified for the connection.
│ --temporary-connection                 -x               Uses connection defined with command-line parameters, instead of one defined in config.
│ --mfa-passcode                                 TEXT     Token to use for multi-factor authentication (MFA).
│ --oauth-client-id                              TEXT     Value of the client ID provided by the identity provider for Snowflake integration.
│ --oauth-client-secret                          TEXT     Value of the client secret provided by the identity provider for Snowflake integration.
│ --oauth-authorization-url                      TEXT     Identity provider endpoint supplying the authorization code to the driver.
│ --oauth-token-request-url                      TEXT     Identity provider endpoint supplying the access tokens to the driver.
│ --oauth-redirect-uri                           TEXT     URI to use for the authorization code.
│ --oauth-scope                                  TEXT     Scope requested in the identity provider authorization request.
│ --oauth-disable-pkce                                    Disables Proof Key for Code Exchange (PKCE). Default: False.
│ --oauth-enable-refresh-tokens                           Enables a silent re-authentication when the actual access token becomes outdated. Default: False.
│ --oauth-enable-single-use-refresh-tokens                Whether to opt in to single-use refresh token semantics. Default: False.
│ --client-store-temporary-credential                     Store the temporary credential.
│ --enable-diag                                           Run the python connector diagnostic test.
│ --diag-log-path                                TEXT     Diagnostic report path.
│ --diag-allowlist-path                          TEXT     Diagnostic report path to optional allowlist.
╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
```

> **Caution:**
>
> For improved security, Snowflake strongly recommends using either `SNOWFLAKE_CONNECTIONS_<NAME>_PASSWORD` or `SNOWFLAKE_PASSWORD` environment variable.

## Import connections from SnowSQL

Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric workloads in addition to SQL operations. Snowflake CLI is a more modern, robust, and efficient CLI client than legacy SnowSQL. In addition to executing SQL commands with Snowflake CLI, you can also execute commands for other Snowflake products like Streamlit in Snowflake, Snowpark Container Services, and Snowflake Native App Framework. Because new features and enhancements will be added only to Snowflake CLI, Snowflake recommends that you begin transitioning from SnowSQL to Snowflake CLI.

To import any existing connections defined in [SnowSQL](../../../user-guide/snowsql.md) into your Snowflake CLI `config.toml` configuration file, use the `snow helpers import-snowsql-connections` command.

To import SnowSQL connections, enter the `snow helpers import-snowsql-connections` command similar to the following code block that imports SnowSQL connections from the standard configuration file locations:

```snowcli
snow helpers import-snowsql-connections
```

As the command processes the SnowSQL configuration files, it shows the progress and prompts for confirmation when a connection with the same name is already defined in the Snowflake CLI `config.toml` file:

```output
SnowSQL config file [/etc/snowsql.cnf] does not exist. Skipping.
SnowSQL config file [/etc/snowflake/snowsql.cnf] does not exist. Skipping.
SnowSQL config file [/usr/local/etc/snowsql.cnf] does not exist. Skipping.
Trying to read connections from [/Users/<user>/.snowsql.cnf].
Reading SnowSQL's connection configuration [connections.connection1] from [/Users/<user>/.snowsql.cnf]
Trying to read connections from [/Users/<user>/.snowsql/config].
Reading SnowSQL's default connection configuration from [/Users/<user>/.snowsql/config]
Reading SnowSQL's connection configuration [connections.connection1] from [/Users/<user>/.snowsql/config]
Reading SnowSQL's connection configuration [connections.connection2] from [/Users/<user>/.snowsql/config]
Connection 'connection1' already exists in Snowflake CLI, do you want to use SnowSQL definition and override existing connection in Snowflake CLI? [y/N]: Y
Connection 'connection2' already exists in Snowflake CLI, do you want to use SnowSQL definition and override existing connection in Snowflake CLI? [y/N]: n
Connection 'default' already exists in Snowflake CLI, do you want to use SnowSQL definition and override existing connection in Snowflake CLI? [y/N]: n
Saving [connection1] connection in Snowflake CLI's config.
Connections successfully imported from SnowSQL to Snowflake CLI.
```

For more information about this command, see the [snow helpers import-snowsql-connections](../command-reference/helpers-commands/import-snowsql-connections.md) command reference.

For help with migrating from SnowSQL to Snowflake CLI, see [Migrating from SnowSQL to Snowflake CLI](../../../user-guide/snowsql-migrate.md).

## Use a temporary connection

You can also specify connection parameters from the command line using the `--temporary-connection [-x]` option. It ignores all definitions from the `config.toml`, using ones specified by command-line options instead. This approach can be helpful for CI/CD use cases when you don’t want to use a configuration file. When you use a temporary connection, Snowflake CLI ignores any connection variables defined in the `config.toml` file, but does still use any of the following environment variables you set:

* `SNOWFLAKE_ACCOUNT`
* `SNOWFLAKE_USER`
* `SNOWFLAKE_PASSWORD`
* `SNOWFLAKE_DATABASE`
* `SNOWFLAKE_SCHEMA`
* `SNOWFLAKE_ROLE`
* `SNOWFLAKE_WAREHOUSE`
* `SNOWFLAKE_AUTHENTICATOR`
* `SNOWFLAKE_PRIVATE_KEY_FILE`
* `SNOWFLAKE_PRIVATE_KEY_RAW`
* `SNOWFLAKE_PRIVATE_KEY_PATH`
* `SNOWFLAKE_SESSION_TOKEN`
* `SNOWFLAKE_MASTER_TOKEN`
* `SNOWFLAKE_TOKEN_FILE_PATH`
* `WORKLOAD_IDENTITY_PROVIDER`

The following example shows how to create a temporary connection using a username and password. This example assumes you stored the password in the `SNOWFLAKE_PASSWORD` environment variable.

```snowcli
snow sql -q "select 42;" --temporary-connection \
                           --account myaccount \
                           --user jdoe
```

```output
select 42;
+----+
| 42 |
|----|
| 42 |
+----+
```

> **Caution:**
>
> For improved security, Snowflake strongly recommends using either `SNOWFLAKE_CONNECTIONS_<NAME>_PASSWORD` or `SNOWFLAKE_PASSWORD` environment variable.

For additional security, you can use a private key file and store the path to your private key file in the `SNOWFLAKE_PRIVATE_KEY_FILE` environment variable, as shown:

```bash
SNOWFLAKE_ACCOUNT = "account"
SNOWFLAKE_USER = "user"
SNOWFLAKE_PRIVATE_KEY_FILE = "/path/to/key.p8"
```

You can then create a temporary connection without specifying the options, as shown:

```snowcli
snow sql -q "select 42" --temporary-connection
```

```output
select 42;
+----+
| 42 |
|----|
| 42 |
+----+
```

When using CI/CD pipelines with key pair authentication, you might not be able to access local private key files (`SNOWFLAKE_PRIVATE_KEY_FILE`). In this situation, you can store the private key in the `SNOWFLAKE_PRIVATE_KEY_RAW` environment variable, as shown:

```bash
SNOWFLAKE_ACCOUNT = "account"
SNOWFLAKE_USER = "user"
SNOWFLAKE_PRIVATE_KEY_RAW = "-----BEGIN PRIVATE KEY-----..."
```

You can then create a temporary connection without specifying the options, as shown:

```snowcli
snow sql -q "select 42" --temporary-connection
```

```output
select 42;
+----+
| 42 |
|----|
| 42 |
+----+
```

> **Note:**
>
> If you use the `SNOWFLAKE_PRIVATE_KEY_RAW` environment variable, you should not also define `SNOWFLAKE_PRIVATE_KEY_FILE`.

## Additional ways to authenticate your connection

You can also use the following methods to authenticate your connection to Snowflake:

* Use a private key file for authentication
* Use OAuth authentication
* Use the OAuth 2.0 Authorization Code flow
* Use the OAuth 2.0 Client Credentials flow
* Use multi-factor authentication (MFA)
* Use MFA caching
* Use SSO (single sign-on)
* Use an external browser
* Use PAT (Programmatic Access Token)
* Use workload identity federation (WIF)

### Use a private key file for authentication

To use private key file for authentication, your connection configuration requires you to set the `authenticator`
parameter to `SNOWFLAKE_JWT` and provide path to file with your private key similar to the following:

* Specify the `--private_key-file` option in the `snow connection add` command, as shown:

  > ```snowcli
  > snow connection add \
  >    --connection-name jwt \
  >    --authenticator SNOWFLAKE_JWT \
  >    --private-key-file ~/.ssh/sf_private_key.p8
  > ```
* Use the configuration file:

  > ```toml
  > [connections.jwt]
  > account = "my_account"
  > user = "jdoe"
  > authenticator = "SNOWFLAKE_JWT"
  > private_key_file = "~/sf_private_key.p8"
  > ```

For more details on configuring key pair authentication, see [Key-pair authentication and key-pair rotation](../../../user-guide/key-pair-auth.md).

Snowflake CLI looks for the private key in the connection parameters in the following order:

1. If `private_key_file` is specified, Snowflake CLI reads the key from the specified file path.
2. If `private_key_path` is specified, Snowflake CLI reads the key from the specified file path.
3. If `private_key_file` or `private_key_path` are not specified, Snowflake CLI reads the key directly from the `private_key_raw` parameter.

> **Caution:**
>
> If you specify your private key in the `private_key_raw` parameter,
> Snowflake recommends using either the `SNOWFLAKE_CONNECTIONS_<NAME>_PRIVATE_KEY_RAW`
> or the `SNOWFLAKE_PRIVATE_KEY_RAW` environment variables for improved security.

> **Note:**
>
> If your private key is passphrase-protected, set the `PRIVATE_KEY_PASSPHRASE` environment variable to that passphrase.

### Use OAuth authentication

To use connect using OAuth, you can do either of the following:

* Specify the `--token-file-path` option in the `snow connection add` command, as shown:

  ```snowcli
  snow connection add --token-file-path "my-token.txt"
  ```
* In the `config.toml` file, set `authenticator = "OAUTH"`, and add the `token_file_path` parameter to the connection definition, as shown:

  ```toml
  [connections.oauth]
  account = "my_account"
  user = "jdoe"
  authenticator = "OAUTH"
  token_file_path = "my-token.txt"
  ```

### Use the OAuth 2.0 Authorization Code flow

The OAuth 2.0 Authorization Code flow is a secure method for a client application to obtain an access token from an authorization server on behalf of a user, without revealing the user’s credentials. For more information about this flow and its parameters, see [Enable the OAuth 2.0 Authorization Code flow](../../python-connector/python-connector-connect.md) in the Snowflake Connector for Python documentation.

To use the OAuth 2.0 Authorization Code flow, add a connection definition to your `config.toml` file similar to the following:

```toml
[connections.oauth]
authenticator = "OAUTH_AUTHORIZATION_CODE"
user = "user"
account = "account"
oauth_client_id = "client_id"
oauth_client_secret = "client_secret"
oauth_redirect_uri = "http://localhost:8001/snowflake/oauth-redirect"
oauth_scope = "session:role:PUBLIC"
```

### Use the OAuth 2.0 Client Credentials flow

The OAuth 2.0 Client Credentials flow provides a secure way for machine-to-machine (M2M) authentication, such as the Snowflake Connector for Python connecting to a backend service. Unlike the OAuth 2.0 Authorization Code flow, this method does not rely on any user-specific data. For more information about this flow and its parameters, see [Enable the OAuth 2.0 Client Credentials flow](../../python-connector/python-connector-connect.md) in the Snowflake Connector for Python documentation.

To use the OAuth 2.0 Client Credentials flow, add a connection definition to your `config.toml` file similar to the following:

```toml
[connections.oauth]
authenticator = "OAUTH_CLIENT_CREDENTIALS"
user = "user"
account = "account"
oauth_client_id = "client_id"
oauth_client_secret = "client_secret"
oauth_token_request_url = "http://identity.provider.com/token"
oauth_scope = "session:role:PUBLIC"
```

### Use multi-factor authentication (MFA)

To use MFA:

1. Set up [multi-factor authentication](../../../user-guide/security-mfa.md) in Snowflake and set the `authenticator` parameter to `SNOWFLAKE` (which is a default value).
2. If you want to use a Duo-generated passcode instead of the push mechanism, use either the `--mfa-passcode <passcode>` option or set `passcode_in_password = true` in the `config.toml` file and include the passcode in your password as described in [Using MFA with Python](../../../user-guide/security-mfa.md).

   > **Note:**
   >
   > If you want use the passcode in the password for authentication, after executing the first `snow` command, you can no longer provide the passcode as long as the token in valid. You must do the following:
   >
   > * Remove the passcode from the password.
   > * Remove or comment the `passcode_in_password = true` in the `config.toml` file.

### Use MFA caching

MFA caching is a security feature that reduces the frequency of Multi-Factor Authentication (MFA) prompts during logins. Frequent MFA prompts can disrupt workflow and decrease productivity. MFA caching addresses this issue by securely storing MFA session information for a specified period. Using MFA caching lets you authenticate without repeatedly entering MFA codes, as long as they are within the cached session’s timeframe.

To enable MFA caching:

1. For your account, set `ALLOW_CLIENT_MFA_CACHING = true`.
2. In your `config.toml` file, add `authenticator = "USERNAME_PASSWORD_MFA"` to your connection.

For more information, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../../../user-guide/security-mfa.md).

### Use SSO (single sign-on)

If you have [configured Snowflake to use single sign-on (SSO)](../../../user-guide/admin-security-fed-auth-overview.md), you can configure your client application to use SSO for authentication. See [Using SSO with client applications that connect to Snowflake](../../../user-guide/admin-security-fed-auth-use.md) for details and configure your connection using the instructions for Python.

### Use an external browser

You can use your browser to authenticate your Snowflake CLI connection with any SAML 2.0 compliant identity provider (IdP), such as Okta or Active Directory Federation Services.

> **Note:**
>
> The `externalbrowser` authenticator is only supported in terminal windows that have web browser access. For example, a terminal window on a remote machine accessed through a SSH (Secure Shell) session might require additional setup to open a web browser.
>
> If you don’t have access to a web browser, but your IdP is Okta, you can use native Okta by setting the authenticator to `https://<okta_account_name>.okta.com`.

To use external browser authentication, use one of the following methods:

* Use the `snow connection add --authenticator` command option:

  ```snowcli
  snow connection add --authenticator EXTERNALBROWSER
  ```
* Set `authenticator` to `EXTERNALBROWSER` in your `config.toml` file:

  ```toml
  [connections.externalbrowser]
  account = "my_account"
  user = "jdoe"
  authenticator = "EXTERNALBROWSER"
  ```

### Use PAT (Programmatic Access Token)

Programmatic Access Token (PAT) is a Snowflake-specific authentication method. The feature must be enabled for the account before usage (see the [Prerequisites](../../../user-guide/programmatic-access-tokens.md) for more information). Authentication with PAT doesn’t involve any human interaction.

To use PAT with the connection, set `authenticator` to `PROGRAMMATIC_ACCESS_TOKEN` and `token_file_path` to point the file with token, as shown:

```toml
[connections.externalbrowser]
account = "my_account"
user = "jdoe"
authenticator = "PROGRAMMATIC_ACCESS_TOKEN"
token_file_path = "path-to-pat-token"
```

For more information about PATs, see [Using programmatic access tokens for authentication](../../../user-guide/programmatic-access-tokens.md).

### Use workload identity federation (WIF)

Workload identity federation (WIF) is a feature that allows you to use your CI/CD environment’s identity to authenticate to Snowflake without the need for static credentials. This is particularly useful in automated workloads, where you want to minimize the risk of credential exposure.

For more information, see [Workload identity federation](../../../user-guide/workload-identity-federation.md).

#### Set up WIF connections

To set up a WIF connection, you need to create a service account in Snowflake using the following steps:

1. Create a service user in Snowflake with the proper WORKLOAD_IDENTITY:

> ```sqlexample
> CREATE USER <username>
> WORKLOAD_IDENTITY = (
>   TYPE = <WIF type>
>   // ...
> )
> TYPE = SERVICE
> DEFAULT_ROLE = PUBLIC;
> ```

1. Configure a connection in Snowflake CLI using either of the following methods

   * Add the connection to the `config.toml` file

     > ```toml
     > [connections.my_wif_conn]
     > account = "my_account"
     > authenticator = "WORKLOAD_IDENTITY"
     > workload_identity_provider = "<provider type>"
     > ```
   * Use the `snow connection add` command:

     ```snowcli
     snow connection add \
      --connection-name my_wif_conn \
      --account <account>
      --authenticatior WORKLOAD_IDENTITY \
      --workload-identity-provider <provider type>
     ```

where:

> `<provider type>` is one of the following:
>
> * AWS
> * AZURE
> * GCP
> * OIDC

> **Note:**
>
> When using OIDC as a provider, you need to retrieve the token from your environment and provide it to cli. You can provide retrieved token via
>
> * `--token` parameter
> * `SNOWFLAKE_TOKEN` environment variable
> * `SNOWFLAKE_CONNECTIONS_<connection_name>_TOKEN` environment variable
> * `token_file_path` in your `config.toml` file

For more information, see [Using Snowflake CLI actions](../cicd/integrate-ci-cd.md).

#### Connect to Snowflake using a temporary WIF connection

To connect to Snowflake using a temporary connection, you can use the following command:

```snowcli
snow sql -x \
--authenticator WORKLOAD_IDENTITY \
--workload-identity-provider AWS \
--account <my_account> \
-q 'select current_user()'

select current_user();
+----------------+
| CURRENT_USER() |
|----------------|
| <user name>    |
+----------------+
```

---
title: Managing Snowflake objects
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/objects/manage-objects.md
section: Snowflake CLI
---

# Managing Snowflake objects

The `snow object` commands provide you with a convenient way of managing most Snowflake objects, such as stages, Snowpark functions, or Streamlit apps. Instead of using separate commands for each type of object, you can use these commands to perform common tasks, including the following:

* Create an object of a specific type
* List available objects of a specified type.
* Display the description of an object.
* Delete an object.

To see a list of supported types use the `--help` option for any of the `snow object` commands, such as the following:

```snowcli
snow object list --help
```

```output
Usage: snow object list [OPTIONS] OBJECT_TYPE

Lists all available Snowflake objects of given type.
Supported types: compute-pool, database, function, image-repository, integration, network-rule,
procedure, role, schema, secret, service, stage, stream, streamlit, table, task,
user, view, warehouse

...
```

The object subcommands let you perform common operations, while leaving service-specific commands groups dedicated to service-specific operations.

## Create an object of a specific type

The `snow object create` command creates a specified object based on the definition provided, using the following syntax:

```bash
snow object create TYPE ([OBJECT_ATTRIBUTES]|[--json {OBJECT_DEFINITION}])
```

where:

* `TYPE` is a Snowflake object type:

  + `account`
  + `catalog-integration`
  + `compute-pool`
  + `database`
  + `database-role`
  + `dynamic-table`
  + `event-table`
  + `external-volume`
  + `function`
  + `image-repository`
  + `managed-account`
  + `network-policy`
  + `notebook`
  + `notification-integration`
  + `pipe`
  + `procedure`
  + `role`
  + `schema`
  + `service`
  + `stage`
  + `stream`
  + `table`
  + `task`
  + `user-defined-function`
  + `view`
  + `warehouse`

* `OBJECT_ATTRIBUTES` contains the object definition in the form of a list of `<key>=<value>` pairs, such as:

  ```snowcli
  snow object create database name=my_db comment="Created with Snowflake CLI"
  ```
* `--json {OBJECT_DEFINITION}` contains the object definition in JSON, such as:

  ```snowcli
  snow object create database --json '{"name":"my_db", "comment":"Created with Snowflake CLI"}'
  ```

> **Note:**
>
> The following object types require a database to be identified in the connection configuration, such as `config.toml`, or passed to the command using the `--database` option.
>
> * image-repository
> * schema
> * service
> * table
> * task

To create a database object using the `option-attributes` parameter:

```snowcli
snow object create database name=my_db comment='Created with Snowflake CLI'
```

To create a table object using the `option-attributes` parameter:

```snowcli
snow object create table name=my_table columns='[{"name":"col1","datatype":"number", "nullable":false}]' constraints='[{"name":"prim_key", "column_names":["col1"], "constraint_type":"PRIMARY KEY"}]' --database my_db --schema public
```

To create a database using the `--json object-definition` option:

```snowcli
snow object create database --json '{"name":"my_db", "comment":"Created with Snowflake CLI"}'
```

To create a table using the `--json object-definition` option:

```snowcli
snow object create table --json "$(cat table.json)" --database my_db
```

where `table.json` contains the following:

```json
{
  "name": "my_table",
  "columns": [
    {
      "name": "col1",
      "datatype": "number",
      "nullable": false
    }
  ],
  "constraints": [
    {
      "name": "prim_key",
      "column_names": ["col1"],
      "constraint_type": "PRIMARY KEY"
    }
  ]
}
```

## List all objects of a specific type

The `snow object list` command lists all objects of given type available with your permissions.

```bash
snow object list TYPE
```

where `TYPE` is the type of the object. Use `snow object list --help` for the full list of supported types.

To list all role objects, enter the following command:

```bash
snow object list role
```

```output
+--------------------------------------------------------------------------------------------------------------------------------+
|            |            |            |            | is_inherit | assigned_t | granted_to | granted_ro |            |           |
| created_on | name       | is_default | is_current | ed         | o_users    | _roles     | les        | owner      | comment   |
|------------+------------+------------+------------+------------+------------+------------+------------+------------+-----------|
| 2023-07-24 | ACCOUNTADM | N          | N          | N          | 2          | 0          | 2          |            | Account   |
| 06:05:49-0 | IN         |            |            |            |            |            |            |            | administr |
| 7:00       |            |            |            |            |            |            |            |            | ator can  |
|            |            |            |            |            |            |            |            |            | manage    |
|            |            |            |            |            |            |            |            |            | all       |
|            |            |            |            |            |            |            |            |            | aspects   |
|            |            |            |            |            |            |            |            |            | of the    |
|            |            |            |            |            |            |            |            |            | account.  |
| 2023-07-24 | PUBLIC     | N          | N          | Y          | 0          | 0          | 0          |            | Public    |
| 06:05:48.9 |            |            |            |            |            |            |            |            | role is   |
| 56000-07:0 |            |            |            |            |            |            |            |            | automatic |
| 0          |            |            |            |            |            |            |            |            | ally      |
|            |            |            |            |            |            |            |            |            | available |
|            |            |            |            |            |            |            |            |            | to every  |
|            |            |            |            |            |            |            |            |            | user in   |
|            |            |            |            |            |            |            |            |            | the       |
|            |            |            |            |            |            |            |            |            | account.  |
| 2023-07-24 | SYSADMIN   | N          | N          | N          | 0          | 1          | 0          |            | System    |
| 06:05:49.0 |            |            |            |            |            |            |            |            | administr |
| 33000-07:0 |            |            |            |            |            |            |            |            | ator can  |
| 0          |            |            |            |            |            |            |            |            | create    |
|            |            |            |            |            |            |            |            |            | and       |
|            |            |            |            |            |            |            |            |            | manage    |
|            |            |            |            |            |            |            |            |            | databases |
|            |            |            |            |            |            |            |            |            | and       |
|            |            |            |            |            |            |            |            |            | warehouse |
|            |            |            |            |            |            |            |            |            | s.        |
| 2023-07-24 | USERADMIN  | N          | N          | N          | 0          | 1          | 0          |            | User      |
| 06:05:49.0 |            |            |            |            |            |            |            |            | administr |
| 45000-07:0 |            |            |            |            |            |            |            |            | ator can  |
| 0          |            |            |            |            |            |            |            |            | create    |
|            |            |            |            |            |            |            |            |            | and       |
|            |            |            |            |            |            |            |            |            | manage    |
|            |            |            |            |            |            |            |            |            | users and |
|            |            |            |            |            |            |            |            |            | roles     |
+--------------------------------------------------------------------------------------------------------------------------------+
```

You can also use the `--like [-l] <pattern>` to filter objects by name using a SQL LIKE pattern. For example, `list function --like "my%"` lists all functions that begin with **my**. For more information about SQL patterns syntax, see [SQL LIKE Keyword](https://www.w3schools.com/sql/sql_ref_like.asp).

To list only role objects that begin with the string, **public**, enter the following command:

```snowcli
snow object list role --like public%
```

```output
show roles like 'public%'
+-------------------------------------------------------------------------------
| created_on                       | name        | is_default | is_current | ...
|----------------------------------+-------------+------------+------------+----
| 2023-02-01 15:25:04.105000-08:00 | PUBLIC      | N          | N          | ...
| 2024-01-15 12:55:05.840000-08:00 | PUBLIC_TEST | N          | N          | ...
+-------------------------------------------------------------------------------
```

## Display the description for an object of a specified type

The `snow object describe` command provides a description of an object of given type.

```bash
snow object describe TYPE IDENTIFIER
```

where:

* `TYPE` is the type of the object. Use `snow object describe --help` for the full list of supported types.
* `IDENTIFIER` is the name of the object. For procedures and functions, the identifier must specify arguments types, such as `"hello(int,string)"`.

To describe a function object, enter a command similar to the following:

```snowcli
snow object describe function "hello_function(string)"
```

```output
describe function hello_function(string)
+---------------------------------------------------------------------
| property           | value
|--------------------+------------------------------------------------
| signature          | (NAME VARCHAR)
| returns            | VARCHAR(16777216)
| language           | PYTHON
| null handling      | CALLED ON NULL INPUT
| volatility         | VOLATILE
| body               | None
| imports            |
| handler            | functions.hello_function
| runtime_version    | 3.9
| packages           | ['snowflake-snowpark-python']
| installed_packages | ['_libgcc_mutex==0.1','_openmp_mutex==5.1',...
+---------------------------------------------------------------------
```

## Delete an object of a specified type

The `snow object drop` command deletes a Snowflake object of given name and type.

```bash
snow object drop TYPE IDENTIFIER
```

where:

* `TYPE` is the type of the object. Use `snow object drop --help` for the full list of supported types.
* `IDENTIFIER` is the name of the object. For procedures and functions, the identifier must specify arguments types, such as `"hello(int,string)"`.

To drop a procedure, enter a commands similar to the following:

```snowcli
snow object drop procedure "test_procedure()"
```

```output
drop procedure test_procedure()
+--------------------------------------+
| status                               |
|--------------------------------------|
| TEST_PROCEDURE successfully dropped. |
+--------------------------------------+
```

---
title: Managing Snowflake stages
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/stages/manage-stages.md
section: Snowflake CLI
---

# Managing Snowflake stages

The `snow stage` commands let you perform additional stage-specific tasks:

* Create a named stage if it does not already exist.
* Copy all files from source to target directory.
* List the contents of a stage.
* Execute SQL files from a stage.
* Remove a file from a stage.

## Create a named stage

The `snow stage create` command creates a named stage if it does not already exist.

```bash
snow stage create <stage_name>
```

For example, to create a stage called `new_stage`, enter the following command:

```snowcli
snow stage create new_stage
```

```output
+-----------------------------------------------------+
| key    | value                                      |
|--------+--------------------------------------------|
| status | Stage area NEW_STAGE successfully created. |
+-----------------------------------------------------+
```

The following example shows what happens if you try to create a stage, `packages`, that already exists.

```snowcli
# stage that already exists
snow stage create packages
```

```output
+--------------------------------------------------------+
| key    | value                                         |
|--------+-----------------------------------------------|
| status | PACKAGES already exists, statement succeeded. |
+--------------------------------------------------------+
```

If you want to specify the type of encryption to use for all files stored on the stage, add the `--encryption` option to specify whether you want to full encryption (`SNOWFLAKE_FULL`) or only server-side encryption (`SNOWFLAKE_SSE`).

```snowcli
snow stage create new_stage --encryption SNOWFLAKE_FULL
```

```output
+-----------------------------------------------------+
| key    | value                                      |
|--------+--------------------------------------------|
| status | Stage area NEW_STAGE successfully created. |
+-----------------------------------------------------+
```

## Copy files to and from a stage

The `snow stage copy` command copies a file from the local machine to a stage, from a stage to a local machine, or between named stages.

```snowcli
snow stage copy <source_path> <destination_path>
```

Note the following guidelines:

* The stage path must start with `@`, as shown in the following examples.
* When you copy a single file, `<destination_path>` must identify a directory, not a file. If the specified directory does not exist, the command creates it.
* By default, when you copy a local directory to a stage, the local directory must contain only files. You can use the `--recursive` option to upload sub-directories in the local directory. You can use glob patterns with the `--recursive` option.
* When you copy a directory from a stage to a local filesystem, the command currently flattens its internal tree structure. To illustrate, assume your local directory contains the following:

  ```output
  test_case.py
  tests/abc.py
  tests/test1/x1.txt
  tests/test1/x2.txt
  ```

  After copying the directory from the stage, the local filesystem directory contains the following:

  ```output
  test_case.py
  abc.py
  x1.txt
  x2.txt
  ```

  > **Note:**
  >
  > If you want to maintain the file structure from the source directory, you can include the `--recursive` option.

### Copy files to a stage

* To copy files from the local machine to a stage, enter a command similar to the following:

  ```snowcli
  snow stage copy local_example_app @example_app_stage/app
  ```

  ```output
  put file:///.../local_example_app/* @example_app_stage/app4 auto_compress=false parallel=4 overwrite=False
  +--------------------------------------------------------------------------------------
  | source           | target           | source_size | target_size | source_compression...
  |------------------+------------------+-------------+-------------+--------------------
  | environment.yml  | environment.yml  | 62          | 0           | NONE             ...
  | snowflake.yml    | snowflake.yml    | 252         | 0           | NONE             ...
  | streamlit_app.py | streamlit_app.py | 109         | 0           | NONE             ...
  +--------------------------------------------------------------------------------------
  ```

You can use the `snow stage list-files` command to verify the command copied the files successfully:

```snowcli
snow stage list-files example_app_stage
```

```output
ls @example_app_stage
​+------------------------------------------------------------------------------------
| name                                   | size | md5                              | ...
|----------------------------------------+------+----------------------------------+-
| example_app_stage/app/environment.yml  | 64   | 45409c8da098125440bfb7ffbcd900f5 | ...
| example_app_stage/app/snowflake.yml    | 256  | a510b1d59fa04f451b679d43c703b6d4 | ...
| example_app_stage/app/streamlit_app.py | 112  | e6c2a89c5a164e34a0faf60b086bbdfc | ...
+------------------------------------------------------------------------------------
```

### Copy files from a stage

* To copy files from a stage to a directory on the local machine, enter a command similar to the following:

  ```bash
  mkdir local_app_backup
  snow stage copy @example_app_stage/app local_app_backup
  ```

  ```output
  get @example_app_stage/app file:///.../local_app_backup/ parallel=4
  +------------------------------------------------+
  | file             | size | status     | message |
  |------------------+------+------------+---------|
  | environment.yml  | 62   | DOWNLOADED |         |
  | snowflake.yml    | 252  | DOWNLOADED |         |
  | streamlit_app.py | 109  | DOWNLOADED |         |
  +------------------------------------------------+
  ```

You can list the directory contents to verify the command copied the files correctly:

```bash
ls local_app_backup
```

```output
environment.yml  snowflake.yml    streamlit_app.py
```

Note that the local directory must exist.

You can copy from a user stage (`@~`):

> ```bash
> snow stage copy "@~" . --recursive
> ```
>
> ```output
> +------------------------------------------------+
> | file             | size | status     | message |
> |------------------+------+------------+---------|
> | environment.yml  | 62   | DOWNLOADED |         |
> | snowflake.yml    | 252  | DOWNLOADED |         |
> | streamlit_app.py | 109  | DOWNLOADED |         |
> +------------------------------------------------+
> ```

### Copy files between stages

You can copy files directly between two named stages without downloading them to your local machine first. This can be useful for organizing files across different stages or creating backups.

* To copy files from one stage to another, use the following syntax:

  ```snowcli
  snow stage copy @source_stage @destination_stage
  ```

The following example copies all files from the `production_stage` to the `backup_stage`:

```snowcli
snow stage copy @production_stage @backup_stage
```

```output
+------------------------------------------------------------+
| file                                                       |
|------------------------------------------------------------|
| __init__.py                                                |
| main.py                                                    |
| procedure.py                                               |
+------------------------------------------------------------+
```

> **Note:**
>
> When you copy between stages, the destination cannot be a user stage (`@~`). You must specify named stages for both source and destination.

### Use glob patterns to specify files

You can specify multiple files matching a regular expression by using a glob pattern for the `source_path` argument. You must enclose the glob pattern in single or double quotes.

The following example copies all `.txt` files in a directory to a stage.

```bash
snow stage copy "testdir/*.txt" @TEST_STAGE_3
```

```output
put file:///.../testdir/*.txt @TEST_STAGE_3 auto_compress=false parallel=4 overwrite=False
+------------------------------------------------------------------------------------------------------------+
| source | target | source_size | target_size | source_compression | target_compression | status   | message |
|--------+--------+-------------+-------------+--------------------+--------------------+----------+---------|
| b1.txt | b1.txt | 3           | 16          | NONE               | NONE               | UPLOADED |         |
| b2.txt | b2.txt | 3           | 16          | NONE               | NONE               | UPLOADED |         |
+------------------------------------------------------------------------------------------------------------+
```

## List the contents of a stage

The `snow stage list-files` command lists the stage contents.

```bash
snow stage list-files <stage_path>
```

For example, to list the packages in a stage, enter the following command:

```bash
snow stage list-files packages
```

```output
ls @packages
+-------------------------------------------------------------------------------------
| name                 | size     | md5                              | last_modified
|----------------------+----------+----------------------------------+----------------
| packages/plp.Ada.zip | 824736   | 90639175a0ac7735e67525118b81047c | Tue, 16 Jan ...
| packages/samrand.zip | 13721024 | 648f0bae2f65fd4c9f178b17c23de7e5 | Tue, 16 Jan ...
+-------------------------------------------------------------------------------------
```

## Execute files from a stage

> **Note:**
>
> Snowflake CLI does not support executing Python files for Python versions 3.12 and above.

The `snow stage execute` command executes SQL or Python files from a stage.

```bash
snow stage execute <stage_path>
```

* For `.sql` files, the it performs an [EXECUTE IMMEDIATE FROM](../../../sql-reference/sql/execute-immediate-from.md) command on `.sql` files from a stage.
* For `.py` files, it executes a session-scoped [Snowpark Python procedure](../../snowpark/python/creating-sprocs.md).

  Snowflake CLI executes the procedure in Snowflake to guarantee a consistent execution environment. If your Python scripts require additional requirements, you should specify them in a `requirements.txt` file that resides in the same directory as the files on the stage. The `snow stage execute` command only supports packages from the Snowflake Anaconda channel.

  By default, the command looks for the `requirements.txt` file in the following precedence:

  + Stage path specified in the command’s `stage_path` parameter.
  + Parent directories of the specified stage path hierarchy, until it reaches the stage.
  + If you don’t specify a `requirements.txt` file, the command assumes no additional packages are necessary.

  For example, if you run `snow stage execute @my_stage/ml/app1/scripts`, the command looks for the file as follows:

  + `my_stage/ml/app1/scripts/requirements.txt`
  + `my_stage/ml/app1/requirements.txt`
  + `my_stage/ml/requirements.txt`
  + `my_stage/ml/requirements.txt`

The following examples illustrate ways to execute different sets of `.sql` files from a stage:

* Specify only a stage name to execute all `.sql` files in the stage:

  ```bash
  snow stage execute "@scripts"
  ```

  ```output
  SUCCESS - scripts/script1.sql
  SUCCESS - scripts/script2.sql
  SUCCESS - scripts/dir/script.sql
  +------------------------------------------+
  | File                   | Status  | Error |
  |------------------------+---------+-------|
  | scripts/script1.sql    | SUCCESS | None  |
  | scripts/script2.sql    | SUCCESS | None  |
  | scripts/dir/script.sql | SUCCESS | None  |
  +------------------------------------------+
  ```
* Specify a user stage (`@~`) to execute the `script.sql` files in the user stage:

  ```bash
  snow stage execute "@~/script1.sql"
  ```

  ```output
  SUCCESS - scripts/script1.sql
  +------------------------------------------+
  | File                   | Status  | Error |
  |------------------------+---------+-------|
  | @~/script.sql          | SUCCESS | None  |
  +------------------------------------------+
  ```

### Use glob patterns to select subsets of files

* Specify a glob-like pattern to execute all `.sql` files in the `dir` directory:

  ```bash
  snow stage execute "@scripts/dir/*"
  ```

  ```output
  SUCCESS - scripts/dir/script.sql
  +------------------------------------------+
  | File                   | Status  | Error |
  |------------------------+---------+-------|
  | scripts/dir/script.sql | SUCCESS | None  |
  +------------------------------------------+
  ```
* Specify a glob-like pattern to execute only `.sql` files in the `dir` directory that begin with “script”, followed by one character:

  ```bash
  snow stage execute "@scripts/script?.sql"
  ```

  ```output
  SUCCESS - scripts/script1.sql
  SUCCESS - scripts/script2.sql
  +---------------------------------------+
  | File                | Status  | Error |
  |---------------------+---------+-------|
  | scripts/script1.sql | SUCCESS | None  |
  | scripts/script2.sql | SUCCESS | None  |
  +---------------------------------------+
  ```
* Specify a direct file path with the `--silent` option:

  ```bash
  snow stage execute "@scripts/script1.sql" --silent
  ```

  ```output
  +---------------------------------------+
  | File                | Status  | Error |
  |---------------------+---------+-------|
  | scripts/script1.sql | SUCCESS | None  |
  +---------------------------------------+
  ```

## Remove a file from a stage

The `snow stage remove` command removes a file from a stage.

```bash
snow stage remove <stage_name> <file_name>
```

For example, to remove a file from a stage, enter a command similar to the following:

```bash
snow stage remove example_app_stage app/pages/my_page.py
```

```output
+-------------------------------------------------+
| key    | value                                  |
|--------+----------------------------------------|
| name   | example_app_stage/app/pages/my_page.py |
| result | removed                                |
+-------------------------------------------------+
```

---
title: Managing Snowpark Container Services in Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/services/overview.md
section: Snowflake CLI
---

# Managing Snowpark Container Services in Snowflake CLI

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Snowpark Container Services is a fully managed container offering that helps you easily deploy,
manage, and scale containerized applications without having to move data out of Snowflake.
As a fully managed service, it comes with Snowflake security, configuration, and operational best practices built in.

Snowpark Container Services is fully integrated with Snowflake. For example, your application can easily:

* Connect to Snowflake and run SQL in a Snowflake virtual warehouse.
* Access data files in a Snowflake stage.

Snowpark Container Services is also integrated with third-party tools. It allows you to use third-party clients
(such as Docker) to easily upload your application images to Snowflake. Seamless integration makes it easier for
teams to focus on building the data applications, not the environment.

This section describes the following topics:

* [Working with image registries and repositories](manage-images.md)
* [Managing compute pools](manage-compute-pools.md)
* [Managing services](manage-services.md)

---
title: Managing Streamlit apps
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/streamlit-apps/manage-apps/manage-app.md
section: Snowflake CLI
---

# Managing Streamlit apps

After you have created a Streamlit app, you can use the following commands to manage the app:

* To retrieve the URL of your Streamlit app, use the `snow streamlit get-url NAME` command. See [snow streamlit get-url](../../command-reference/streamlit-commands/get-url.md) for more information.
* To share your app to other roles, use the snow `streamlit share NAME TO_ROLE` command. See [snow streamlit share](../../command-reference/streamlit-commands/share.md) for more information.
* To list the Streamlit apps for which you have access, use the `snow streamlit list` command. See [snow streamlit list](../../command-reference/streamlit-commands/list.md) for more information.
* To display details about a Streamlit app, use the `snow streamlit describe NAME` command. See [snow streamlit describe](../../command-reference/streamlit-commands/describe.md) for more information.
* To delete a Streamlit app, use the `snow streamlit drop NAME` command. See [snow streamlit drop](../../command-reference/streamlit-commands/drop.md) for more information.

---
title: Managing Streamlit apps with Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/streamlit-apps/overview.md
section: Snowflake CLI
---

# Managing Streamlit apps with Snowflake CLI

For Streamlit developers who currently use a local IDE development flow and a Git-backed continuous integration and deployment (CI/CD) collaboration workflow, switching to in-browser editing for [Streamlit in Snowflake](../../streamlit/about-streamlit.md) can be difficult. Snowflake CLI gives developers critical and familiar tooling to integrate SiS into their current development flow.

Using Snowflake CLI, developers can now easily deploy apps from a CLI and perform operations efficiently without requiring any SQL knowledge. Without Snowflake CLI, Streamlit app developers had to deploy locally developed apps to the Snowflake infrastructure by executing SQL commands and copying local files to a stage. Now, these app developers can use whichever method they prefer.

You can perform the following operations when managing Streamlit apps:

* [Creating a Streamlit app](manage-apps/initialize-app.md)
* [Deploying a Streamlit app](manage-apps/deploy-app.md)
* [Retrieving the URL for a Streamlit app](manage-apps/get-url.md)
* [Share a Streamlit app](manage-apps/share-app.md)
* [Managing Streamlit apps](manage-apps/manage-app.md)

For more information about Streamlit apps in Native Apps, see [Add a Streamlit app](../../native-apps/adding-streamlit.md).

---
title: Migrating project definition files from version 1.x to 2.0
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/migrate-projects.md
section: Snowflake CLI
---

# Migrating project definition files from version 1.x to 2.0

To convert a version 1.x project definition file to the version 2 format, do the following:

1. Go to your project directory that contains the version 1.x `snowflake.yml` file.
2. Enter the `snow helpers v1-to-v2` command.

   * If the version 1.x file conversion succeeds, the command displays a message similar to the following:

     > ```snowcli
     > cd <project-directory>
     > snow helpers v1-to-v2
     > ```
     >
     > ```output
     > Project definition migrated to version 2.
     > ```
   * If your project definition file is already updated to version 2, the command displays the following message:

     > ```snowcli
     > cd <project-directory>
     > snow helpers v1-to-v2
     > ```
     >
     > ```output
     > Project definition is already at version 2.
     > ```

> * If you try to convert a project file that contains a `snowflake.local.yml` file, without using the `--[no]-migrate-local-overrides` option, the command generates an error similar to the following:
>
> > * If you try to convert a project file that contains templates, without using the `--accept-templates` option, the command generates an error similar to the following:
> >
> >   > ```snowcli
> >   > cd <project-directory>
> >   > snow helpers v1-to-v2
> >   > ```
> >   >
> >   > ```output
> >   > +- Error-------------------------------------------------------------------+
> >   > | snowflake.local.yml file detected, please specify                        |
> >   > | --migrate-local-overrides to include or --no-migrate-local-overrides to  |
> >   > | exclude its values.                                                      |
> >   > +--------------------------------------------------------------------------+
> >   > ```
> > * If you convert a project definition file that contains templates, and use the `--accept-templates` option, the command converts the file and displays a warning message similar to the following:
> >
> >   > ```snowcli
> >   > cd <project-directory>
> >   > snow helpers v1-to-v2
> >   > ```
> >   >
> >   > ```output
> >   > WARNING  snowflake.cli._plugins.workspace.commands:commands.py:60 Your V1 definition contains templates. We cannot guarantee the correctness of the migration.
> >   > Project definition migrated to version 2
> >   > ```

## Convert Native App projects

This section shows an example from a V1 to V2 conversion of a Snowflake Native App project, lists the changes in property names, and offers some tips to help with migration.

### Snowflake Native App conversion example

Native Apps project conversion example

| V1 project file | V2 project file |
| --- | --- |
| ```yaml definition_version: 1 native_app:   name: myapp   source_stage: app_src.stage   artifacts:     - src: app/*       dest: ./       processors:         - native app setup         - name: templates           properties:             foo: bar   package:     role: pkg_role     distribution: external   application:     name: myapp_app     warehouse: app_wh ``` | ```yaml definition_version: 2 entities:   pkg:     type: application package     meta:       role: pkg_role     identifier: <% fn.concat_ids('myapp', '_pkg_', fn.sanitize_id(fn.get_username('unknown_user')) | lower) %>     manifest: app/manifest.yml     artifacts:     - src: app/*       dest: ./       processors:       - name: native app setup       - name: templates         properties:           foo: bar     stage: app_src.stage   app:     meta:       warehouse: app_wh     identifier: myapp_app     type: application     from:       target: pkg ``` |

### Native App project definition V1 to V2 property changes

Native App project definition V1 to V2 property changes

| V1 property | V2 property |
| --- | --- |
| `native_app.name` | No equivalent. Use a template variable to port, if required. |
| `native_app.deploy_root` | `<package entity>.deploy_root` |
| `native_app.generated_root` | `<package entity>.generated_root` |
| `native_app.bundle_root` | `<package entity>.bundle_root` |
| `native_app.source_stage` | `<package entity>.source_stage` |
| `native_app.scratch_stage` | `<package entity>.scratch_stage` |
| `native_app.artifacts` | `<package entity>.artifacts` |
| `native_app.application.debug` | `<application entity>.debug` |
| `native_app.application.name` | `<application entity>.identifier` |
| `native_app.application.post_deploy` | `<application entity>.meta.post_deploy` (see above notes) |
| `native_app.application.role` | `<application entity>.meta.role` |
| `native_app.application.warehouse` | `<application entity>.meta.warehouse` |
| `native_app.package.distribution` | `<package entity>.distribution` |
| `native_app.package.name` | `<package entity>.identifier` |
| `native_app.package.post_deploy` | `<package entity>.meta.post_deploy` (see above notes) |
| `native_app.package.role` | `<package entity>.meta.role` |
| `native_app.package.scripts` | `<package entity>.meta.post_deploy` (see above notes) |
| `native_app.package.warehouse` | `<package entity>.meta.warehouse` |

### Migration tips

* When migrating Snowflake Native App package scripts, the `v1-to-v2` command converts them to `package post-deploy` hooks and replaces `{{package_name}}` in the package script file with the equivalent template expression.
* When migrating existing template expressions, `ctx.native_app`, `ctx.streamlit`, and `ctx.snowpark` variables are no longer
  accepted. The `v1-to-v2` command with equivalent template expressions that reference the specific entity name instead.
  For example, `ctx.native_app.package.name` could be replaced with `ctx.entities.pkg.identifier` if the package was migrated to an entity named `pkg` in the `snowflake.yml` file.

## Convert Streamlit projects

This section shows an example from a V1 to V2 conversion of a Streamlit project, lists the changes in property names, and offers some tips to help with migration.

### Streamlit conversion example

Streamlit project conversion example

| V1 project file | V2 project file |
| --- | --- |
| ```yaml definition_version: 1 streamlit:   name: test_streamlit   stage: streamlit   query_warehouse: test_warehouse   main_file: "streamlit_app.py"   title: "My Fancy Streamlit" ``` | ```yaml definition_version: 2 entities:   test_streamlit:     identifier:       name: test_streamlit     type: streamlit     title: My Fancy Streamlit     query_warehouse: test_warehouse     main_file: streamlit_app.py     pages_dir: None     stage: streamlit     artifacts:     - streamlit_app.py ``` |

### Streamlit project definition V1 to V2 property changes

Streamlit project definition V1 to V2 property changes

| V1 property | V2 property |
| --- | --- |
| `streamlit.name` | `<streamlit entity>.identifier.name` |
| `streamlit.schema` | `<streamlit entity>.identifier.schema` |
| `streamlit.database` | `<streamlit entity>.identifier.database` |
| `streamlit.comment` | `<streamlit entity>.comment` |
| `streamlit.title` | `<streamlit entity>.title` |
| `streamlit.query_warehouse` | `<streamlit entity>.query_warehouse` |
| `streamlit.main_file` | `<streamlit entity>.main_file` and `<streamlit entity>.artifacts` |
| `streamlit.stage` | `<streamlit entity>.stage` |
| `streamlit.env_file` | `<streamlit entity>.artifacts` |
| `streamlit.pages_dir` | `<streamlit entity>.pages_dir` and `<streamlit entity>.artifacts` |
| `streamlit.additional_source_files` | `<streamlit entity>.artifacts` |

### Streamlit migration tips

None.

## Convert Snowpark projects

This section shows an example from a V1 to V2 conversion of a Snowpark project, lists the changes in property names, and offers some tips to help with migration.

### Snowpark conversion example

Snowpark project conversion example

| V1 project file | V2 project file |
| --- | --- |
| ```yaml definition_version: 1 snowpark:   project_name: "my_snowpark_project"   stage_name: "dev_deployment"   src: "app/"   functions:     - name: func1       handler: "app.func1_handler"       signature:         - name: "a"           type: "string"           default: "default value"         - name: "b"           type: "variant"       returns: string       runtime: 3.10   procedures:     - name: procedureName       handler: "hello"       signature:         - name: "name"           type: "string"       returns: string ``` | ```yaml definition_version: 2 entities:   procedureName:     imports: []     external_access_integrations: []     secrets: {}     meta:       use_mixins:       - snowpark_shared     identifier:       name: procedureName     handler: hello     returns: string     signature:     - name: name       type: string     stage: dev_deployment     artifacts:     - src: app       dest: my_snowpark_project     type: procedure     execute_as_caller: false   func1:     imports: []     external_access_integrations: []     secrets: {}     meta:       use_mixins:       - snowpark_shared     identifier:       name: func1     handler: app.func1_handler     returns: string     signature:     - name: a       type: string       default: default value     - name: b       type: variant     runtime: '3.10'     stage: dev_deployment     artifacts:     - src: app       dest: my_snowpark_project     type: function mixins:   snowpark_shared:     stage: dev_deployment     artifacts:     - src: app/       dest: my_snowpark_project ``` |

### Snowpark project definition V1 to V2 property changes

Snowpark project definition V1 to V2 property changes

| V1 property | V2 property |
| --- | --- |
| `snowpark.project_name` | `<function or procedure entity>.artifacts.dest` for each function and/or procedure migrated from the project. See above notes regarding Snowpark migration. Each function or procedure should declare an artifact with `dest` defined as the `snowpark.project_name` value, and `src` defined as the `snowpark.src` value. Use of a mixin is recommended. |
| `snowpark.stage_name` | `<function or procedure entity>.stage` for each function and/or procedure migrated from the project. |
| `snowpark.src` | `<function or procedure entity>.artifacts.src` for each function and/or procedure migrated from the project. (see `snowpark.project_name above`) |
| `snowpark.functions` (list) | `<function entities> (top-level)` |
| `snowpark.procedures` (list) | `<procedure entities> (top-level)` |

Snowpark function and procedure definition V1 to V2 property changes

| V1 property | V2 property |
| --- | --- |
| `name` | `identifier.name` |
| `schema` | `identifier.schema` |
| `database` | `identifier.database` |
| `handler` | `handler` |
| `returns` | `returns` |
| `signature` | `signature` |
| `runtime` | `runtime` |
| `external_access_integrations` | `external_access_integrations` |
| `secrets` | `secrets` |
| `imports` | `imports` |
| `execute_as_caller` | `execute_as_caller` (only for procedures) |

### Snowpark migration tips

* When migrating Snowpark projects, each function (from the `snowpark.functions` array) or procedure (from the `snowpark.procedures` array) maps to a top-level entity.
* All top-level Snowpark project properties (e.g. `src`) are now defined for each function and procedure. To reduce duplication,
  Snowflake recommends that you declare a `mixin` and include it in each of the migrated function and procedure entities.

---
title: Opening an app in a browser
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/open-app.md
section: Snowflake CLI
---

# Opening an app in a browser

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your Snowflake Native App project.

## How to open a Snowflake Native App application in your default browser

The `snow app open` command opens the app specified in the resolved project definition of your Snowflake Native App project.

1. [Create a connection](../connecting/connect.md), if necessary.
2. Execute the `snow app open` command from within your project, similar to the following:

   > ```snowcli
   > snow app open --connection="dev"
   > ```

   When successful, the command returns the following message:

   > ```output
   > Application opened in browser.
   > ```

   If you have not yet installed an application as part of the `snow app run`, the following error message is displayed:

   > ```output
   > Application not yet deployed! Please run "snow app run" first.
   > ```

For more information about opening a Snowflake Native App in a browser, see the CLI [snow app open](../command-reference/native-apps-commands/open-app.md) command.

---
title: Preparing a local folder with configured Snowflake Native App artifacts
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/bundle-app.md
section: Snowflake CLI
---

# Preparing a local folder with configured Snowflake Native App artifacts

## Create a local folder with configured artifacts

The `snow app bundle` command creates a local directory in your project, populates it with the file structure you specified in the project definition file, and generates CREATE FUNCTION or CREATE PROCEDURE declarations in Snowflake Native App setup scripts from Snowpark Python code that includes decorators (such as `@sproc` or `@udaf`). For more information, see the Snowpark Python documentation corresponding to your chosen function decorator, such as [snowflake.snowpark.functions.udaf](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udaf).

The `snow app deploy` and `snow app run` commands already use this functionality. However, now with an explicit `snow app bundle` command at your disposal, you can explore this directory before it gets uploaded to the stage, to verify the artifacts were created as expected.

To create a local folder with the configured artifacts, do the following:

1. Create or verify your Snowflake `snowflake.yml` project definition file, such as:

   ```yaml
   definition_version: 2
   entities:
     codegen_nativeapp_pkg:
       type: application package
       manifest: root_files/_manifest.yml
       artifacts:
         - src: root_files/README.md
           dest: README.md
         - src: root_files/_manifest.yml
           dest: manifest.yml
         - src: root_files/setup_scripts/*
           dest: setup_scripts/
         - src: python/user_gen/echo.py
           dest: user_gen/echo.py
         - src: python/cli_gen/*
           dest: cli_gen/
           processors:
             - snowpark
     codegen_nativeapp:
       type: application
       from:
         target: codegen_nativeapp_pkg
   ```
2. From your project directory, run the `snow app bundle` command to create the temporary `output/deploy` directory that contains your configured artifacts.

   ```snowcli
   snow app bundle
   ```
3. Verify the contents of the output or deploy directory match the rules you specified in the snowflake.yml. file. If you invoked Snowpark annotation processing in your Python files, you can see the generated code in the amended setup script in the directory.

For more information, see the [snow app bundle](../command-reference/native-apps-commands/bundle-app.md) command.

## Generate SQL code using Snowpark annotation processing

As a Snowflake Native App developer with a limited SQL background, you might find it cumbersome to write and maintain [setup scripts](../../native-apps/creating-setup-script.md), which can get quite large and complicated over time. Setup scripts contain all the application logic that a customer can use with their data, and hence are a required part of developing a Snowflake Native App. One of the core components of setup scripts is your ability to use Snowpark Python extension functions for functions and stored procedures. In addition to writing Snowpark code in Python, Java, or other Snowpark supported languages, you need to write the corresponding portions of those functions and procedures using SQL in the setup script.

For example, you could create a basic function and stored procedure using Snowpark Python, as shown:

```Python
# Example python file "echo.py" that a developer writes

def echo_fn(data):
    return 'echo_fn: ' + data

def echo_proc(session, data):
    return 'echo_proc: ' + data
```

You would then need to upload the file to a stage and refer to it from the setup script SQL code, similar to the following:

```sqlexample
-- Sample setup_script.sql SQL file for a Snowflake Native App

CREATE APPLICATION ROLE IF NOT EXISTS app_instance_role;

CREATE OR ALTER VERSIONED SCHEMA ext_code_schema;
GRANT USAGE ON SCHEMA ext_code_schema TO APPLICATION ROLE app_instance_role;

CREATE OR REPLACE PROCEDURE ext_code_schema.py_echo_proc(DATA string)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES=('snowflake-snowpark-python')
  HANDLER='echo.echo_proc'
  IMPORTS=('/echo.py');

    GRANT USAGE ON PROCEDURE ext_code_schema.py_echo_proc(string)
      TO APPLICATION ROLE app_instance_role;

-- Wraps a function from a python file
CREATE OR REPLACE FUNCTION ext_code_schema.py_echo_fn(string)
RETURNS STRING
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES=('snowflake-snowpark-python')
HANDLER='echo.echo_fn'
IMPORTS=('/echo.py');

GRANT USAGE ON FUNCTION ext_code_schema.py_echo_fn(DATA string)
  TO APPLICATION ROLE app_instance_role;
```

### Automatic SQL code generation

> **Note:**
>
> To take advantage of automatic SQL code generation, you must use Snowpark Python version 1.15.0 and above.

To help alleviate this extra work, Snowflake CLI can automatically generate the necessary SQL for your setup scripts. Snowpark Python supports a feature called extension function decorators (`@udf`, `@sproc`, `@udtf`, and `@udaf`) that let you annotate your Python code, such as using the [snowflake.snowpark.functions.udf](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udf) function decorator. Snowflake CLI can use these decorators to automatically create and validate the necessary SQL code for your setup scripts.

For example, you can use the `@udf` decorator for the function in the previous example:

```python
# some python file echo.py
@udf(name="echo_fn")
def echo_fn(data) -> str:
  return 'echo_fn: ' + str
```

Using the `@udf` decorator tells the Snowflake CLI [snow app bundle](../command-reference/native-apps-commands/bundle-app.md) (and other commands that internally invoke the `snow app bundle` command) to process the Snowpark Python decorators, generate the corresponding SQL commands, and include them in the setup script automatically, as shown. You can, therefore, minimize the amount of SQL code you need to write for your setup script.

```sqlexample
-- Sample setup_script.sql SQL file for a Snowflake Native App

-- User-written code
CREATE OR REPLACE APPLICATION ROLE app_instance_role;

CREATE OR ALTER VERSIONED SCHEMA ext_code_schema;
GRANT USAGE ON SCHEMA ext_code_schema TO APPLICATION ROLE app_instance_role;

-- Snowflake CLI generated code
CREATE OR REPLACE FUNCTION ext_code_schema.py_echo_fn(DATA string)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES=('snowflake-snowpark-python')
  HANDLER='echo.echo_fn'
  IMPORTS=('/echo.py');

  GRANT USAGE ON FUNCTION ext_code_schema.py_echo_fn(string)
    TO APPLICATION ROLE app_instance_role;
```

## Using the Snowpark Python decorators

While the Snowpark decorators in Snowflake CLI work the same as regular Snowpark Python decorators, you should be aware of the following differences when writing Python code files specifically for a Snowflake Native App:

* You can’t use any `Session` objects in these files, as Snowflake CLI executes these Python files in a sandbox environment with no connection to Snowflake. As a result, any reference of a Snowpark `Session` results in an error.
* You can use only the `@udf`, `@sproc`, `@udaf` and `@udtf` Snowpark Python decorators.
* You can’t use these decorators as regular functions to register your code as a Snowflake object. Only code explicitly annotated with the supported decorators is recognized. Therefore, the Python function must be a named function. Lambda functions are not supported.
* Snowflake CLI always generates CREATE OR REPLACE statements, as recommended by Snowflake, for creating functions and procedures in your setup scripts.

### More about decorator properties

The following table lists the Python decorator properties and explains how Snowflake CLI uses them.

Python decorator properties

| Property | Details |
| --- | --- |
| **name**  *Optional* | Name of the function or stored procedure Snowflake CLI uses to generate the SQL statements.  If you omit this property, Snowflake CLI reuses the Python function name to generate the SQL statements. |
| **input_types**  *Required* | Types for each input parameter for this function or stored procedure.  You must provide this information either in this decorator parameter or provide type annotations directly in your code. If this information is not available in either location, Snowflake CLI does not generate SQL statements for this function or stored procedure. |
| **return_type**  *Required* | Type for the return value for this function or stored procedure.  You must provide this information either in this decorator parameter or provide type annotation directly in your code. If this information is not available in either location, Snowflake CLI does not generate SQL statements for this function or stored procedure. |
| **packages**  *Optional* | List of packages. You can specify `snowflake-snowpark-python` with or without a version number. If you provide a version number for this package, Snowflake CLI does not use the version as part of its SQL generation, but does retain the version number for any other packages in the list.  If you omit this property, Snowflake CLI automatically adds `snowflake-snowpark-python` as the only package and reflects it in the generated SQL statements. |
| **imports**  *Optional* | List of files your Snowflake function or stored procedure needs to import from the stage. You can specify them either as a string or a tuple of strings. If you specify a tuple, Snowflake CLI only uses the string at the 0th index. For an example of using a tuple, see [Use external Python files](../../native-apps/adding-application-logic.md).  If you do not specify any imports, Snowflake CLI automatically adds an import for the Python file that contains the function or stored procedure for which it generates SQL. The path of the import is determined by the `dest` parameter of the Python file in the deploy root directory, based on the project definition file. |
| **execute_as**  *Optional* | Persona to use when executing a stored procedure. Values include: `caller` and `owner`. If unspecified, Snowflake CLI defaults to `owner`. Note that this property does not apply to functions. |
| **handler**  *N/A* | Handler for the function or stored procedure. Snowflake CLI automatically populates this field. |
| **replace**  *Unused* | Snowflake CLI assumes `true` for code generation. |
| **session**  *Required* | Must be `None`. If omitted, Snowflake CLI throws an error. |
| **is_permanent**  *Unused* | Snowflake CLI does not use this field for SQL generation. |
| **stage_location**  *Unused* | Snowflake CLI does not use this field for SQL generation. |
| **if_not_exists**  *Unused* | Snowflake CLI does not use this field for SQL generation. |
| **strict**  *Unused* | Snowflake CLI does not use this field for SQL generation. |
| **secure**  *Unused* | Snowflake CLI does not use this field for SQL generation. |
| **immutable**  *Unused* | Snowflake CLI does not use this field for SQL generation. |
| **native_app_params**  *Optional* | (For a Snowflake Native App only)  Python dictionary containing the following Snowflake Native App parameters:   * `schema`: Name of the schema to contain the Snowpark function or stored procedure. This schema must already be defined in your setup script. Snowflake recommends setting the value to the name of a versioned schema in your setup script file. Snowflake CLI prefixes this value to the name of the Snowpark function or procedure name in the generated SQL statement. Note that Snowflake CLI does not create the schema for you. * `application_roles`: List of application roles to be granted USAGE privileges on the generated Snowpark function or procedure. Snowflake CLI does not create the application roles; it only creates SQL statements like `GRANT USAGE ON FUNCTION <schema_name.func_name> TO APPLICATION ROLE <app_role>` and adds them to the setup script.   While technically optional, not specifying the `native_app_params` property in your project definition file might result in an invalid setup script. |

When uploading your Python files to a destination stage, Snowflake CLI converts the decorators to comments so these UDFs and stored procedures are not created in your current session. The original source files are not changed so that the `snow app bundle` command remains idempotent. Only Python files in the deploy root directory are changed to contain the comments, as the deploy root is recreated every time you run the `snow app bundle` command. The following example illustrates how Snowflake CLI comments decorators.

```python
# output/deploy/dest_dir1/dest_dir2/echo.py
#: @sproc(
#:    return_type=IntegerType(),
#:    input_types=[IntegerType(), IntegerType()],
#:    packages=["snowflake-snowpark-python==1.15.0"],
#:    native_app_params={
#:        "schema": "ext_code_schema",
#:        "application_roles": ["app_instance_role"],
#:    },
#: )
def add_sp(session_, x, y):
    return x + y
```

Also, only the Python files with a `processors` property in the project definition file are affected.

---
title: Project definition files
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/project-definitions.md
section: Snowflake CLI
---

# Project definition files

A project definition file called `snowflake.yml` declares a directory as a Snowflake Native App project. It is a version-controlled file that resides at the root of a Snowflake Native App project directory and can either be created manually or by Snowflake CLI as part of project initialization. As long as you can provide this structured file in the directory but choose to use your own independent project structure, Snowflake CLI can discover the relevant files and carry out its functionality as usual.

For Native Apps, your `snowflake.yml` would look similar to the following:

```yaml
definition_version: 2
entities:
  pkg:
    type: application package
    identifier: <name_of_app_pkg>
    stage: app_src.stage
    manifest: app/manifest.yml
    artifacts:
      - src: app/*
        dest: ./
      - src: src/module-add/target/add-1.0-SNAPSHOT.jar
        dest: module-add/add-1.0-SNAPSHOT.jar
      - src: src/module-ui/src/*
        dest: streamlit/
    meta:
      role: <your_app_pkg_owner_role>
      warehouse: <your_app_pkg_warehouse>
      post_deploy:
        - sql_script: scripts/any-provider-setup.sql
        - sql_script: scripts/shared-content.sql
  app:
    type: application
    identifier: <name_of_app>
    from:
      target: pkg
    debug: <true|false>
    meta:
      role: <your_app_owner_role>
      warehouse: <your_app_warehouse>
```

## Common entity properties

The following table describes common properties available for project definition entities for Native Apps. See [Specify entities](../project-definitions/specify-entities.md) for more information on project definition entities.

Common entity properties

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | The type of entity to manage. For Snowflake Native App, valid values include:   * `application package`. For more information about application package properties, see Application package entity properties. * `application`. For more information about application properties, see Application entity properties. |
| **identifier**  *optional*, *string* | Optional Snowflake identifier for the entity, both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (e.g. `’”My Native Application Package”’`).  If not specified, the entity ID in the project definition is used as the identifier. |
| **meta.warehouse**  *optional*, *string* | Warehouse used to run the scripts provided as part of `meta.post_deploy`, if any SQL commands within these scripts require use of warehouse.  Default: Warehouse specified for the connection in the Snowflake CLI `config.toml` file.  **Note:** If you do not specify a warehouse, the application passes validation, but fails to install.  Typically, you specify this value in the `snowflake.local.yml` as described in Project definition overrides. |
| **meta.role**  *optional*, *string* | Role to use when creating the entity and provider-side objects.  **Note:** If you do not specify a role, Snowflake CLI attempts to use the default role assigned to your user in your Snowflake account.  Typically, you specify this value in the `snowflake.local.yml` as described in Project definition overrides.  Default: Role specified in the [Snowflake CLI connection](../connecting/connect.md) |
| **meta.post_deploy**  *optional*, *sequence* | List of SQL scripts to execute after the entity has been created. The following example shows how to define these scripts in the project definition file:  ```yaml definition_version: 2 entities:   myapp_pkg:     type: application package     ...     meta:       post_deploy:         - sql_script: scripts/post_deploy1.sql         - sql_script: scripts/post_deploy2.sql ```  These scripts are invoked by commands that create or update an entity. For example, running the `snow app deploy` command executes these scripts after creating or updating a package. They are also executed by `snow app run` if the application instance is not being directly installed from a version or release directive.  You can also use templates in the post-deploy SQL scripts as well, as shown in the following sample script content:  ```snowcli GRANT reference_usage on database provider_data to share in entity <% fn.str_to_id(ctx.entities.myapp_pkg.identifier) %> ``` |
| **meta.use_mixins**  *optional*, *sequence* | Names of mixins to apply to this entity. See [Project mixins](../project-definitions/specify-entities.md) for more information |

## Application package entity properties

The following table describes common properties available for application package entities for Native Apps. See [Specify entities](../project-definitions/specify-entities.md) for more information on project definition entities.

Properties for `application package` entities

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | Must be `application package`. |
| **manifest**  *optional*, *string* | The location of the Snowflake Native App `manifest.yml` file in your project.  **Note:** With version 3.2, this property switched from required to optional. |
| **deploy_root**  *optional*, *string* | Subdirectory at the root of your project where the build step copies the artifacts. Once copied to this location, you can deploy them to a Snowflake stage.  Default: `output/deploy` |
| **generated_root**  *optional*, *string* | Subdirectory of the deploy root where Snowflake CLI writes generated files.  Default: `__generated` |
| **stage**  *optional*, *string* | Identifier of the stage that stores the application artifacts. The value uses the form `<schema_name>.<stage_name>`. The stage lives within the Application Package object. You can change the name to avoid name collisions.  Default: `app_src.stage` |
| **artifacts**  *required*, *sequence* | List of file source and destination pairs to add to the deploy root, as well as an optional Snowpark annotation processor. You can use the following artifact properties:   * `src`: Path to the code source file or files * `dest`: Path to the directory to deploy the artifacts.  Destination paths that reference directories must end with a `/`. A glob pattern’s destination that does not end with a `/` results in an error. If omitted, `dest` defaults to the same string as `src`.  You can also pass in a string for each item instead of a `dict`, in which case the value is treated as both `src` and `dest`. * `processors`: Name of the processor to use to process the `src` code files. See More information about artifacts processors for more details.   If `src` refers to just one file (not a glob), `dest` can refer to a target `<path>` or a `<path/name>`.  You can also pass in a string for each item instead of a `dict`, which case, the value is treated as both `src` and `dest`.  Example without a processor:  ```yaml pkg:   artifacts:     - src: app/*       dest: ./     - src: streamlit/*       dest: streamlit/     - src: src/resources/images/snowflake.png       dest: streamlit/ ```  Example with a processor:  ```yaml pkg:   artifacts:     - src: qpp/*       dest: ./       processors:           - name: snowpark             properties:               env:                 type: conda                 name: <conda_name> ``` |
| **distribution**  *optional*, *string* | Distribution of the application package created by the Snowflake CLI. When running `snow app` commands, Snowflake CLI warns you if the application package you are working with has a different value for distribution than is set in your resolved project definition.  Default: `Internal` |
| **scratch_stage**  *optional*, *string* | Identifier of the stage that stores temporary scratch data used by Snowflake CLI. The value uses the form `<schema_name>.<stage_name>`. The stage lives within the Application Package object. You can change the name to avoid name collisions.  Default: `app_src.stage_snowflake_cli_scratch` |
| **stage_subdirectory**  *optional*, *string* | Name of the folder for Snowflake CLI to add as a subdirectory under the stage to hold the artifacts specified in this Application Package Entity. If none are specified, the artifacts are uploaded to the root of the stage.  Default: `""` (empty string) |
| **enable_release_channels**  *optional*, *bool* | Whether to enable publishing this application package in [release channels](publish-app.md).  Default: Unset |

## Application entity properties

The following table describes common properties available for application entities for Native Apps. See [Specify entities](../project-definitions/specify-entities.md) for more information on project definition entities.

Properties for `application` entities

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | Must be `application`. |
| **from.target**  *required*, *string* | Application package from which to create this application entity. In the following example, `target` defines the name of an entity in the `snowflake.yml` file.  ```yaml from:   target: my_pkg ``` |
| **telemetry.share_mandatory_events**  *optional*, *boolean* | Whether to enable event sharing at the application level. When this is set to `true`, all mandatory events are automatically shared with the application package provider.  ```yaml telemetry:   share_mandatory_events: true ``` |
| **telemetry.optional_shared_events**  *optional*, *sequence* | List of optional events to share with the provider in addition to the mandatory events. All events listed here must be declared in the `configuration.telemetry_event_definitions` section of the `manifest.yml` file. This field is supported only when `share_mandatory_events` is set to `true`.  ```yaml telemetry:   share_mandatory_events: true   optional_shared_events:     - DEBUG_LOGS ``` |
| **debug**  *optional*, *boolean* | Whether to enable debug mode when using a named stage to create an application.  Default: `True` |

### Sharing events with providers

> **Note:**
>
> Snowflake CLI supports event sharing only in `snowflake.yml` files based on definition version 2 or later. If you currently use an earlier version, see [Migrating project definition files from version 1.x to 2.0](../project-definitions/migrate-projects.md).

[Event sharing](../../native-apps/event-definition.md) allows applications to send telemetry events back to application package owners. When testing an application with an application package requiring event sharing, you must explicitly enable event sharing for the application installation to succeed.

To enable sharing of specific events, you must also have the events configured in the `configuration.telemetry_event_definitions` section in the `manifest.yml` file for the application package. You must also have the MANAGE EVENT SHARING global privilege to authorize event sharing for the application.

After event sharing is enabled in your application’s manifest, you must add a `telemetry` section to your `snowflake.yml` file that specifies the events you want to share from your application. The following code shows a sample `telemetry` section:

```yaml
definition_version: 2
entities:
  app:
    type: application
    from:
      target: pkg
    telemetry:
      share_mandatory_events: true
      optional_shared_events:
        - DEBUG_LOGS

...
```

The following examples illustrate different ways to share events in the `snowflake.yml` file. All of the examples are based on the following section in the application package’s `manifest.yml` file:

```yaml
configuration:
    telemetry_event_definitions:
        - type: ERRORS_AND_WARNINGS
          sharing: MANDATORY
        - type: DEBUG_LOGS
          sharing: OPTIONAL
```

* Authorize telemetry and share all mandatory events with the provider. In this case, only `ERRORS_AND_WARNINGS` events are shared.

  ```yaml
  definition_version: 2
  entities:
    app:
      type: application
      from:
        target: pkg
      telemetry:
        share_mandatory_events: true
  ```
* Share both `DEBUG_LOGS` and `ERRORS_AND_WARNINGS` events with the application package provider. Setting `share_mandatory_events` to `true` enables sharing of mandatory `ERRORS_AND_WARNINGS` events, while the `optional_shared_events` section enables optional events like `DEBUG_LOGS`.

  ```yaml
  definition_version: 2
  entities:
    app:
      type: application
      from:
        target: pkg
      telemetry:
        share_mandatory_events: true
        optional_shared_events:
          - DEBUG_LOGS
  ```

## More information about artifacts processors

If you include the `artifacts.processors` field in the project definition file, the `snow app bundle` command invokes custom processing for Python code files in the `src` directory or file.

This section covers a list of supported processors.

### Snowpark processor

> **Note:**
>
> The Snowpark processor has been deprecated and will be removed in a future release.

One of the processors supported by Snowflake CLI is `snowpark`, which applies Snowpark annotation processing to Python files. The following code examples show the basic structure and syntax for different processing environments:

* To execute code in a conda environment, use the following:

  ```yaml
  pkg:
    artifacts:
      - src: <some_src>
        dest: <some_dest>
        processors:
            - name: snowpark
              properties:
                env:
                  type: conda
                  name: <conda_name>
  ```

  where `<conda_name>` is the name of the conda environment containing the Python interpreter and the Snowpark library you want to use for Snowpark annotation processing.
* To execute code in a Python virtual environment, use the following:

  ```yaml
  pkg:
    artifacts:
      - src: <some_src>
        dest: <some_dest>
        processors:
            - name: snowpark
              properties:
                env:
                  type: venv
                  path: <venv_path>
  ```

  where `<venv_path>` is the path of the Python virtual environment containing the Python interpreter and the Snowpark library you want to use for Snowpark annotation processing. The path can be absolute or relative to the project directory.
* To execute code in the currently active environment, use any of the following equivalent definitions:

  ```yaml
  pkg:
    artifacts:
      - src: <some_src>
        dest: <some_dest>
        processors:
            - name: snowpark
              properties:
                env:
                  type: current
  ```

  or

  ```yaml
  pkg:
    artifacts:
      - src: <some_src>
        dest: <some_dest>
        processors:
            - name: snowpark
  ```

  or

  ```yaml
  pkg:
    artifacts:
      - src: <some_src>
        dest: <some_dest>
        processors:
            - snowpark
  ```

For more information about custom processing, see [Automatic SQL code generation](bundle-app.md) and the [snow app bundle](../command-reference/native-apps-commands/bundle-app.md) command.

### Templates processor

Snowflake Native App projects support templates in arbitrary files, which lets you expand templates in all files in an artifact’s `src` directory.
You can enable this feature by including a `templates` processor in the desired `artifacts` definition, as shown in the following example:

```yaml
definition_version: 2
entities:
  pkg:
    type: application package
    identifier: myapp_pkg
    artifacts:
      - src: app/*
        dest: ./
        processors:
          - templates
    manifest: app/manifest.yml
  app:
    type: application
    identifier: myapp_<% fn.get_username() %>
    from:
      target: pkg
```

When Snowflake CLI uploads the files to a stage, it automatically expands the templates before uploading them. For example, suppose your application contained an
`app/README.md` file with the following content that includes the `<% ctx.entities.pkg.identifier %>` template:

```markdown
This is a README file for application package <% ctx.entities.pkg.identifier %>.
```

The template is then expanded to the following before uploading the file to a stage:

```markdown
This is a README file for application package myapp_pkg.
```

## Project definition overrides

Though your project directory must have a `snowflake.yml` file, you can choose to customize the behavior of the Snowflake CLI by providing local overrides to `snowflake.yml`, such as a new role to test out your own application package. These overrides must be put in the `snowflake.local.yml` file that lives beside the base project definition. Snowflake suggests that you add it to your `.gitignore` file so it won’t be version-controlled by git. All templates provided by Snowflake already include it in the `.gitignore` file.

This overrides file must live in the same location as your `snowflake.yml` file.

The `snowflake.local.yml` file shares the exact schema as `snowflake.yml`, except that every value that was required is now optional, in additional to the already optional ones. The following shows a sample `snowflake.local.yml` file:

```yaml
entities:
  pkg:
    meta:
      role: <your_app_pkg_owner_role>
      name: <name_of_app_pkg>
      warehouse: <your_app_pkg_warehouse>
  app:
    debug: <true|false>
    meta:
      role: <your_app_owner_role>
      name: <name_of_app>
      warehouse: <your_app_warehouse>
```

Every `snow app` command prioritizes the parameters in this file over those set in base `snowflake.yml` configuration file. Sensible defaults already provide isolation between developers using the same Snowflake account to develop the same application project, so if you are just getting started we suggest not including an overrides file.

The final definition schema obtained after overriding `snowflake.yml` with `snowflake.local.yml` is called the resolved project definition.

### Limitations

Currently, Snowflake CLI does not support

* Multiple override files.
* A blank override file. Only create this file if you want to override a value from `snowflake.yml`.

---
title: Publishing a Snowflake Native App to customers
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/publish-app.md
section: Snowflake CLI
---

# Publishing a Snowflake Native App to customers

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your Snowflake Native App project.
* You must have an existing listing if you are publishing a Snowflake Native App to the [Snowflake Marketplace](../../../collaboration/collaboration-marketplace-about.md).

## How to publish a Snowflake Native App to customers

In Snowflake, publishing a Snowflake Native App to customers is done by setting release directives.
Release directives are a set of rules that determine which version and patch of the Snowflake Native App is available to which customers.

Release channels provide a way to manage separate release processes for different types of customers. For example, early access customers can use the ALPHA channel, the internal QA team can use the QA channel, and general customers can use the DEFAULT channel.

If release channels are enabled for an application package, the release directives are tied to the release channels; otherwise, the release directives are tied directly to the application package.

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

### Process with release channels enabled

To explicitly enable release channels, add `enable_release_channels=true` to the [application package definition](project-definitions.md) in your `snowflake.yml` file. You need to update or recreate your application package after enabling release channels.

> **Note:**
>
> After enabling, release channels cannot be disabled

To confirm that release channels have been enabled, run the [snow app release-channel list](../command-reference/native-apps-commands/release-channel/list.md) command. A list of release channels in the application package is then displayed:

```snowcli
snow app release-channel list
```

The simplest way to publish an existing version and patch to all customers on the default release channel is to use the [snow app publish](../command-reference/native-apps-commands/publish-app.md) command with the `--version` and `--patch` options:

```snowcli
snow app publish --version v1 --patch 1
```

To automatically create a new version and patch, use the `--create-version` option:

```snowcli
snow app publish --version v1 --create-version
```

To publish a Snowflake Native App to a non-default release channel, use the `--channel` option:

```snowcli
snow app publish --version v1 --patch 1 --channel ALPHA
```

To publish a Snowflake Native App to a custom release directive targeting specific customers, use the `--directive` option:

```snowcli
snow app publish --version v1 --patch 1 --channel ALPHA --directive customers_group_1
```

The `snow app publish` command adds the version to the release channel. If the release channel already has the maximum number of versions allowed, this command first attempts to remove from the channel one of the versions not referenced by any release directive.

After adding the version to the release channel, the command sets the default release directive of that release channel to the specified version and patch.

For more control over what is happening, replace the previous command with the following commands:

```snowcli
snow app release-channel add-version --version v1 ALPHA
snow app release-directive set customers_group_1 --version v1 --patch 1
```

For more information on managing release channels and release directives, see the [snow app release-channel](../command-reference/native-apps-commands/release-channel/overview.md) and [snow app release-directive](../command-reference/native-apps-commands/release-directive/overview.md) command references.

### Process with release channels disabled

If release channels are not enabled for an application package, the release directives are tied directly to the application package.

The simplest way to publish an existing version and patch to all customers is to use the [snow app publish](../command-reference/native-apps-commands/publish-app.md) command with the `--version` and `--patch` options.

```snowcli
snow app publish --version v1 --patch 1
```

This command sets the default release directive of the application package to the specified version and patch. In this case, release channels are not enabled, so no release channel is involved in this process.

If you want the publish command to automatically create a new version and patch, use the `--create-version` option:

```snowcli
snow app publish --version v1 --create-version
```

To publish a Snowflake Native App to a custom release directive targeting specific customers, use the `--directive` option:

```snowcli
snow app publish --version v1 --patch 1 --directive customers_group_1
```

These `snow app publish` commands continue to work even if release channels are enabled in the future. When release channels are enabled, the command starts using the default release channel.

---
title: Refreshing a repository
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/git/refresh-repo.md
section: Snowflake CLI
---

# Refreshing a repository

The `snow git fetch` command updates a repository stage with all branches, tags, and new commits from a remote repository.

To fetch changes in a repository, use the following command:

```bash
snow git fetch <REPO_NAME>
```

where:

* `<REPO_NAME>` is the ID of the repository stage.

The following example refreshes a repository named `my_snow_git`:

```snowcli
snow git fetch my_snow_git
```

```output
alter Git repository my_snow_git fetch
+-------------------------------------------------------------------+
| status                                                            |
|-------------------------------------------------------------------|
| Git Repository MY_SNOW_GIT is up to date. No change was fetched.. |
+-------------------------------------------------------------------+
```

---
title: Retrieving the URL for a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/streamlit-apps/manage-apps/get-url.md
section: Snowflake CLI
---

# Retrieving the URL for a Streamlit app

## Prerequisites

* The Streamlit app must already be uploaded to a stage in the connection you are currently using.
* Your current ROLE must have access to the app.

## How to get the URL for a deployed Streamlit app

The `snow streamlit get-url` command returns a URL for a deployed Streamlit app that you can then use to open the app in a browser.

To get an app URL, do the following:

1. Ensure your connection specifies the database and schema where your app is deployed.
2. Enter a command similar to the following:

   ```snowcli
   snow streamlit get-url my_streamlit_app
   ```

   ```output
   https://snowflake.com/provider-deduced-from-connection/#/streamlit-apps/DB.SCHEMA.MY_STREAMLIT_APP
   ```

You can use the command to return the URL and open the app automatically in your default browser by using the `--open` option, similar to the following:

```snowcli
snow streamlit get-url my_streamlit_app --open
```

## How to resolve common errors

* If the command fails because your ROLE does not have access to the Streamlit app, try the following:

  + Verify you are using the same ROLE in your browser that was used to deploy the app.
  + Switch to a ROLE that has access to the app. If you don’t have access to the ROLE used to create the app, the app developer can grant access to another ROLE with the `snow streamlit share` command.
* If the command fails because it could not find the Streamlit app, try the following:

  + Check the app name.
  + Verify you generated the URL using the same connection (host, account, database, and schema) that was used to deploy the app.
  + Ensure the database and schema are correct. If you specified the database and schema as a fully-qualified name, it overrides the values for them in the connection.

---
title: Setting up a Git repository
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/git/setup-git.md
section: Snowflake CLI
---

# Setting up a Git repository

You can integrate your remote Git repository with Snowflake so that files from the repository are synchronized to a special kind of stage called a *repository stage*. The repository stage acts as a local Git repository with a full clone of the remote repository, including branches, tags, and commits.

For more information, see [Using a Git repository in Snowflake](../../git/git-overview.md).

## Before you start

Before setting a Git repository, you need the following information:

* URL of the for the remote repository (also called the `origin` in Git).
* Optional credentials for connecting to Git, including a secret, username, and password.
* Optional API integration ID.
* Role or user with privileges to create API integrations, if you do not already have an API integration.

For more information, see [Setting up Snowflake to use Git](../../git/git-setting-up.md).

## Set up a Git repository

To clone a Git repository into Git repository stage, use the `snow git setup` command, as shown:

```bash
snow git setup <REPO_NAME>
```

where:

* `<REPO_NAME>` is the ID of the repository stage you want to create. Note that if the repository stage already exists, the command fails.

The `snow git setup` command provide a series of prompts to collect the necessary information, as shown in the following examples:

* Create a repository that requires a secret and credentials:

  ```bash
  $ snow git setup snowcli_git
  Origin url: https://github.com/snowflakedb/snowflake-cli.git
  Use secret for authentication? [y/N]: y
  Secret identifier (will be created if not exists) [snowcli_git_secret]: new_secret
  Secret 'new_secret' will be created
  username: john_doe
  password/token: ****
  API integration identifier (will be created if not exists) [snowcli_git_api_integration]:
  ```

  ```output
  Secret 'new_secret' successfully created.
  API integration snowcli_git_api_integration successfully created.
  +------------------------------------------------------+
  | status                                               |
  |------------------------------------------------------|
  | Git Repository SNOWCLI_GIT was successfully created. |
  +------------------------------------------------------+
  ```
* Create a repository without a secret and an existing API integration ID:

  ```bash
  $ snow git setup snowcli_git
  Origin url: https://github.com/snowflakedb/snowflake-cli.git
  Use secret for authentication [y/N]: n
  API integration identifier (will be created if not exists) [snowcli_git_api_integration]: EXISTING_INTEGRATION
  ```

  ```output
  Using existing API integration 'EXISTING_INTEGRATION'.
  +------------------------------------------------------+
  | status                                               |
  |------------------------------------------------------|
  | Git Repository SNOWCLI_GIT was successfully created. |
  +------------------------------------------------------+
  ```

If the role or user specified in your [connection](../connecting/configure-connections.md) has not been granted, executing this command generates an error similar to the following:

```bash
003001 (42501): 01b2f095-0508-c66d-0001-c1be009a66ee: SQL access control error: Insufficient privileges to operate on account XXX
```

In this situation, you should check your connection configuration or ask your account administrator to give you the necessary privileges or to create the integration for you. For more information, see [Setting up Snowflake to use Git](../../git/git-setting-up.md)

---
title: Share a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/streamlit-apps/manage-apps/share-app.md
section: Snowflake CLI
---

# Share a Streamlit app

## Prerequisites

Before sharing a Streamlit app with Snowflake CLI, you should meet the following prerequisites:

* Ensure that your account has the correct privileges as described in [Privileges required to create and use a Streamlit app](../../../streamlit/object-management/privileges.md).
* Ensure that the app is already deployed in your connection.
* Ensure that your connection has the right ROLE and that the connection uses the correct database and schema.

## How to share a Streamlit app

To share a Streamlit app from the stage, enter the following command:

```snowcli
snow streamlit share my-app some-role
```

For more information about sharing Streamlit apps, see the CLI [snow streamlit share](../../command-reference/streamlit-commands/share.md) command.

---
title: snow
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snow.md
section: Snowflake CLI
---

# snow

Snowflake CLI tool for developers.

## Syntax

```console
snow [<resource-commands>]
  --version
  --info
  --config-file <configuration_file>
  --install-completion
  --show-completion
  --help
```

## Arguments

`[resource-commands]`
:   Optional commands for managing Snowflake CLI resources.

## Options

`--version`
:   Shows the version of the Snowflake CLI.

`--info`
:   Shows information about the Snowflake CLI.

`--config-file configuration_file`
:   Specifies Snowflake CLI configuration file that should be used.

`--install-completion`
:   Install completion for the current shell.

`--show-completion`
:   Show completion for the current shell, to copy it or customize the installation.

`--help`
:   Displays the help text for this command.

## Usage notes

The **snow** command supports the following commands to manage Snowflake resources:

* [snow app commands](native-apps-commands/overview.md)
* [snow connection commands](connection-commands/overview.md)
* [snow git commands](git-commands/overview.md)
* [snow helpers commands](helpers-commands/overview.md)
* [snow notebook commands](notebook-commands/overview.md)
* [snow object commands](object-commands/overview.md)
* [snow snowpark commands](snowpark-commands/overview.md)
* [snow spcs service commands](spcs-commands/service-commands/overview.md)
* [snow sql commands](sql-commands/overview.md)
* [snow stage commands](stage-commands/overview.md)
* [snow streamlit commands](streamlit-commands/overview.md)

## Examples

* To display the Snowflake CLI version, run the following command:

  ```snowcli
  snow --version
  ```

  ```output
  Snowflake CLI version: 3.0.0
  ```
* To display information about Snowflake CLI, run the following command:

  ```snowcli
  snow --info
  ```

  ```output
  [
    {
        "key": "version",
        "value": "3.2.0"
    },
    {
        "key": "default_config_file_path",
        "value": "<user-home>/.snowflake/config.toml"
    },
    {
        "key": "python_version",
        "value": "3.11.6 (v3.11.6:8b6ee5ba3b, Oct  2 2023, 11:18:21) [Clang 13.0.0 (clang-1300.0.29.30)]"
    },
    {
        "key": "system_info",
        "value": "macOS-14.4.1-x86_64-i386-64bit"
    },
    {
        "key": "feature_flags",
        "value": {}
    },
    {
        "key": "SNOWFLAKE_HOME",
        "value": null
    }
  ]
  ```
* To display command-line help for the `snow` command, run the following command:

  ```bash
  snow --help
  ```

  ```output
  Usage: snow [OPTIONS] COMMAND [ARGS]...

  Snowflake CLI tool for developers.

  ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
  │ --version                           Shows version of the Snowflake CLI                                                                   │
  │ --info                              Shows information about the Snowflake CLI                                                            │
  │ --config-file                 FILE  Specifies Snowflake CLI configuration file that should be used [default: None]                       │
  │ --install-completion                Install completion for the current shell.                                                            │
  │ --show-completion                   Show completion for the current shell, to copy it or customize the installation.                     │
  │ --help                -h            Show this message and exit.                                                                          │
  ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
  ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
  │ app          Manages a Snowflake Native App                                                                                              │
  │ connection   Manages connections to Snowflake.                                                                                           │
  │ cortex       Provides access to Snowflake Cortex.                                                                                        │
  │ git          Manages git repositories in Snowflake.                                                                                      │
  │ notebook     Manages notebooks in Snowflake.                                                                                             │
  │ object       Manages Snowflake objects like warehouses and stages                                                                        │
  │ snowpark     Manages procedures and functions.                                                                                           │
  │ spcs         Manages Snowpark Container Services compute pools, services, image registries, and image repositories.                      │
  │ sql          Executes Snowflake query.                                                                                                   │
  │ stage        Manages stages.                                                                                                             │
  │ streamlit    Manages a Streamlit app in Snowflake.                                                                                       │
  ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
  ```
* To display command-line help resource commands, run a command similar to the following that displays help for the `snow spcs` commands:

  ```snowcli
  snow spcs --help
  ```

  ```output
  Usage: snow spcs [OPTIONS] COMMAND [ARGS]...

  Manages Snowpark Container Services compute pools, services, image registries, and image repositories.

  ╭─ Options ────────────────────────────────────────────────────────────────────────────────────────────────────────────╮
  │ --help  -h        Show this message and exit.                                                                        │
  ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
  ╭─ Commands ───────────────────────────────────────────────────────────────────────────────────────────────────────────╮
  │ compute-pool       Manages compute pools.                                                                            │
  │ image-registry     Manages image registries.                                                                         │
  │ image-repository   Manages image repositories.                                                                       │
  │ service            Manages services.                                                                                 │
  ╰──────────────────────────────────────────────────────────────────────────────────────────────────────────────────────╯
  ```

---
title: snow app bundle
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/bundle-app.md
section: Snowflake CLI
---

# snow app bundle

Prepares a local folder with configured app artifacts.

## Syntax

```console
snow app bundle
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app bundle` command creates a temporary local directory that contains all of the Snowflake Native App artifacts. It can also automatically generate SQL scripts from your Snowpark Python code. This command is called automatically by the [snow app deploy](deploy-app.md), [snow app run](run-app.md), and [snow app version create](version/app-version-create.md) commands. If, however, you want to see the setup script, artifacts, and generated SQL, before uploading them to a stage, you can run this command manually. For more information about generating SQL code, see [Preparing a local folder with configured Snowflake Native App artifacts](../../native-apps/bundle-app.md).

* The command uses the [project definition file](../../native-apps/project-definitions.md) to determine the name of the temporary folder to create within your project directory.

  + By default, it will be `<project_directory>/output/deploy`. This directory, which is also known as the deploy root, mirrors what the structure of the stage will be, once files are uploaded to stage in subsequent commands.
  + If you want Snowflake CLI to create a folder with a custom name instead of `output/deploy`, you can do so by providing the `deploy_root` field on the `application package` entity in the [project definition file](../../native-apps/project-definitions.md).

    > **Note:**
    >
    > You must provide a relative path for the deploy root; absolute paths are rejected. The deploy root path is created inside the project directory.
  + The deploy root is a temporary directory because it gets deleted and recreated every you `run snow app bundle` or another command invokes the `bundle` functionality.
* Because `snow app bundle` is automatically called as part of he [snow app deploy](deploy-app.md), [snow app run](run-app.md), and [snow app version create](version/app-version-create.md) commands, you should make changes to the source files only, outside the deploy root. If you modify files in the deploy root, the files are overwritten by the most recent state of your source files the next time you call one of these commands.
* If using a version control system such as `git`, you can choose to not to track the deploy root, as it can change frequently.
* `snow app bundle` does not build or compile your artifacts for you, such as creating jar files from your Java files. It only copies the artifacts specified in the project definition file and adds them to the deploy root to mimic the stage’s directory structure.
* `snow app bundle` does not need access to your Snowflake account; it only affects your local filesystem.
* The command has the following copying and symlinking behavior for any `artifacts` of the `application package` entity in the [project definition file](../../native-apps/project-definitions.md):

  + All directory names in a source path are also created in the deploy root.
  + All files in a source path are symlinked within these directories in the deploy root.
  + Some symlinked files in the deploy root can become hard links if you invoke SQL generation from those files. For more information, see [Preparing a local folder with configured Snowflake Native App artifacts](../../native-apps/bundle-app.md).

  Consider the following `artifacts` list example from a project definition file:

  ```yaml
  entities:
    pkg:
      type: application package
      ...
      artifacts:
        - src: dir1/dir2/*
          dest: dest_dir1/dest_dir2/
        - src: dir8/dir9/file.txt
          dest: dest_dir8/dest_file.txt
    ...
  ```

  where `dir1/dir2` in the project root could have other subdirectories, such as `dir3` and `dir4`, and some files, such as `file3.txt` and `file4.txt`.

  After running the `snow app bundle` command, your deploy root should look like the following:

  ```yaml
  -- deploy_root
        -- dest_dir1
              -- dest_dir2
                    -- dir3
                        -- ... <entire directory tree of dir3>
                    -- dir4
                        -- ... <entire directory tree of dir4>
                    -- file3.txt
                    -- file4.txt
        -- dest_dir8
              -- dest_file.txt
  ```

### Snowpark annotation processing

Beginning with Snowflake CLI version 2.5.0 and [Snowpark Python API](../../../snowpark/python/index.md) version 1.15.0, you can leverage the Snowpark annotation processing feature with the `snow app bundle` command. This feature lets you annotate your Python code files with Snowpark Python decorators, such as `@udf`, `@sproc`, `@udaf`, and `@udtf` to let Snowflake CLI automatically the corresponding CREATE FUNCTION or CREATE PROCEDURE SQL statements in setup script files in the project directory. For a better understanding of these decorators, please refer to corresponding Python decorators documentation.

Snowpark annotation processing involves the following:

* It reads all Python files you marked with a `processor` field in the project definition file.
* It creates a separate temporary sandbox Python environment using the environment information provided in the processor’s `properties` sub-field.
* It executes those Python files in the sandboxed environment.
* It collects all decorated functions from those files.
* With the collected information, Snowflake CLI generates the necessary SQL statements and adds them to the setup script whose location is specified in your `manifest.yaml` file.

You no longer need to repeat boilerplate SQL code for writing Snowpark extension functions for your Snowflake Native App apps.

For more information about enabling this feature in your project definition files, see [Using the Snowpark Python decorators](../../native-apps/bundle-app.md).

For more information about all supported artifact processors, see [More information about artifacts processors](../../native-apps/project-definitions.md).

## Examples

This example assumes you have made the necessary changes to your code files and added them to your `snowflake.yml` or `snowflake.local.yml` files, and also built or compiled any relevant artifacts.

```snowcli
cd my_app_project
snow app bundle
```

The command displays information about the various steps that occur while the command runs and creates a new directory a the location specified in your project definition file (default: `my_app_project/output/deploy`).

To see a simple use case in action, you can leverage the ready-to-use templates using the following commands:

```snowcli
snow init my_app_bundle_project --template app_basic
cd "my_app_bundle_project"
snow app bundle
ls my_app_bundle_project/output/deploy
```

---
title: snow app commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/overview.md
section: Snowflake CLI
---

# snow app commands

Snowflake CLI supports the following commands for managing Snowflake Native App apps:

> * [snow app bundle](bundle-app.md)
> * [snow app deploy](deploy-app.md)
> * [snow app events](retrieve-app-events.md)
> * [snow app open](open-app.md)
> * [snow app publish](publish-app.md)
> * [snow app release-channel commands](release-channel/overview.md)
> * [snow app release-directive commands](release-directive/overview.md)
> * [snow app run](run-app.md)
> * [snow app teardown](teardown-app.md)
> * [snow app validate](validate-app.md)
> * [snow app version commands](version/overview.md)

---
title: snow app deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/deploy-app.md
section: Snowflake CLI
---

# snow app deploy

Creates an application package in your Snowflake account and syncs the local changes to the stage without creating or updating the application. Running this command with no arguments at all, as in `snow app deploy`, is a shorthand for `snow app deploy --prune --recursive`.

## Syntax

```console
snow app deploy
  <paths>
  --prune / --no-prune
  --recursive / --no-recursive
  --interactive / --no-interactive
  --force
  --validate / --no-validate
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`paths...`
:   Paths, relative to the project root, of files or directories you want to upload to a stage. If a file is specified, it must match one of the artifacts src pattern entries in snowflake.yml. If a directory is specified, it will be searched for subfolders or files to deploy based on artifacts src pattern entries. If unspecified, the command syncs all local changes to the stage.

## Options

`--prune / --no-prune`
:   Whether to delete specified files from the stage if they don’t exist locally. If set, the command deletes files that exist in the stage, but not in the local filesystem. This option cannot be used when paths are specified.

`--recursive, -r / --no-recursive`
:   Whether to traverse and deploy files from subdirectories. If set, the command deploys all files and subdirectories; otherwise, only files in the current directory are deployed.

`--interactive / --no-interactive`
:   When enabled, this option displays prompts even if the standard input and output are not terminal devices. Defaults to True in an interactive shell environment, and False otherwise.

`--force`
:   When enabled, this option causes the command to implicitly approve any prompts that arise. You should enable this option if interactive mode is not specified and if you want perform potentially destructive actions. Defaults to unset. Default: False.

`--validate / --no-validate`
:   When enabled, this option triggers validation of a deployed Snowflake Native App’s setup script SQL. Default: True.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app deploy` command creates an application package in your Snowflake account, uploads code files to its stage, validates the setup script SQL, and runs any post-deploy hooks that are defined in `snowflake.yml`. Unlike the [snow app run](run-app.md) command, this command does not install or upgrade an application object.

For more information about deploying an app with release channels enable, see [Process with release channels enabled](../../native-apps/publish-app.md).

## Examples

If you want to create an application package using staged files, you can execute:

```bash
cd my_app_project
my_app_project_build_script.sh
snow app deploy --connection="dev"
```

---
title: snow app events
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/retrieve-app-events.md
section: Snowflake CLI
---

# snow app events

Fetches events for this app from the event table configured in Snowflake. By default, this command will fetch events generated by an app installed in the current connection’s account. To fetch events generated by an app installed in a consumer account, use the –consumer-org and –consumer-account options. This requires event sharing to be set up to route events to the provider account: <https://docs.snowflake.com/en/developer-guide/native-apps/setting-up-logging-and-events>

## Syntax

```console
snow app events
  --since <since>
  --until <until>
  --type <record_types>
  --scope <scopes>
  --consumer-org <consumer_org>
  --consumer-account <consumer_account>
  --consumer-app-hash <consumer_app_hash>
  --first <first>
  --last <last>
  --follow
  --follow-interval <follow_interval>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--since TEXT`
:   Fetch events that are newer than this time ago, in Snowflake interval syntax.

`--until TEXT`
:   Fetch events that are older than this time ago, in Snowflake interval syntax.

`--type [log|span|span_event]`
:   Restrict results to specific record type. Can be specified multiple times. Default: [].

`--scope TEXT`
:   Restrict results to a specific scope name. Can be specified multiple times. Default: [].

`--consumer-org TEXT`
:   The name of the consumer organization.

`--consumer-account TEXT`
:   The name of the consumer account in the organization.

`--consumer-app-hash TEXT`
:   The SHA-1 hash of the consumer application name.

`--first INTEGER`
:   Fetch only the first N events. Cannot be used with –last. Default: -1.

`--last INTEGER`
:   Fetch only the last N events. Cannot be used with –first. Default: -1.

`--follow, -f`
:   Continue polling for events. Implies –last 20 unless overridden or the –since flag is used. Default: False.

`--follow-interval INTEGER`
:   Polling interval in seconds when using the –follow flag. Default: 10.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> Before using this command, you must set up event handling for the provider Snowflake account. For information, see [Logging, tracing, and metrics](../../../logging-tracing/logging-tracing-overview.md).

The `snow app events` command retrieves events generated by an installed native application installed in the current connection’s account.

By default, this command will fetch events generated by a Snowflake Native App installed in the current connection’s account. To fetch events generated by a Snowflake Native App installed in a consumer account, use the `--consumer-org` and `--consumer-account` options. These options require event sharing to be [set up to route events](../../../native-apps/event-about.md) to the provider account.

## Examples

* Retrieve all events for an application installed in the provider account.

  ```snowcli
  snow app events
  ```
* Retrieve a subset of events for an application installed in the provider account.

  ```snowcli
  # Limiting the number of events
  snow app events --first 10
  snow app events --last 10

  # Narrowing the time range using interval syntax
  snow app events --since '5 minutes'
  snow app events --until '1 hour'

  # Filtering events
  snow app events --type log
  snow app events --scope com.myapp.MyClass1 --scope com.myapp.MyClass2
  ```
* Retrieve events for a consumer installation.

  ```snowcli
  snow app events --consumer-org <organization-name> --consumer-account <account-name>
  ```
* Retrieve events for a consumer application using the hashed application name.

  ```snowcli
  snow app events --consumer-org <organization-name> --consumer-account <account-name> --consumer-app-hash cafc10bf6a5deb574ada0e3a009b63bbbe9bdb84
  ```
* Retrieve events as JSON.

  ```snowcli
  snow app events --format json
  ```

---
title: snow app open
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/open-app.md
section: Snowflake CLI
---

# snow app open

Opens the Snowflake Native App inside of your browser, once it has been installed in your account.

## Syntax

```console
snow app open
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> This command does not accept a role or warehouse overrides to your `config.toml` file. Please add them to the native app definition in the `snowflake.yml` or `snowflake.local.yml` instead.

The `snow app open` command opens the native application specified in `snowflake.yml` of your native apps project.

## Examples

Assuming the application specified in the resolved project definition exists, you can execute:

```snowcli
snow app open --connection="dev"
```

---
title: snow app publish
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/publish-app.md
section: Snowflake CLI
---

# snow app publish

Adds the version to the release channel and updates the release directive with the new version and patch.

## Syntax

```console
snow app publish
  --version <version>
  --patch <patch>
  --channel <channel>
  --directive <directive>
  --interactive / --no-interactive
  --force
  --create-version
  --from-stage
  --label <label>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--version TEXT`
:   The version to publish to the provided release channel and release directive. Version is required to exist unless `--create-version` flag is used.

`--patch INTEGER`
:   The patch number under the given version. This will be used when setting the release directive. Patch is required to exist unless `--create-version` flag is used.

`--channel TEXT`
:   The name of the release channel to publish to. If not provided, the default release channel is used. Default: DEFAULT.

`--directive TEXT`
:   The name of the release directive to update with the specified version and patch. If not provided, the default release directive is used. Default: DEFAULT.

`--interactive / --no-interactive`
:   When enabled, this option displays prompts even if the standard input and output are not terminal devices. Defaults to True in an interactive shell environment, and False otherwise.

`--force`
:   When enabled, this option causes the command to implicitly approve any prompts that arise. You should enable this option if interactive mode is not specified and if you want perform potentially destructive actions. Defaults to unset. Default: False.

`--create-version`
:   Create a new version or patch based on the provided `--version` and `--patch` values. Fallback to the manifest values if not provided. Default: False.

`--from-stage`
:   When enabled, the Snowflake CLI creates a version from the current application package stage without syncing to the stage first. Can only be used with `--create-version` flag. Default: False.

`--label TEXT`
:   A label for the version that is displayed to consumers. Can only be used with `--create-version` flag.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app publish` command lets you add Snowflake Native App versions to a release channel and then sets the selected release directive to use the provided version and patch.

For more information on release channels and release directives, see [Publishing a Snowflake Native App to customers](../../native-apps/publish-app.md).

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.
>
> If the release channel feature is not available, you can ignore the `--channel` parameter of this command.

This command adds the specified version to the release channel. If the release channel has reached its maximum number of versions, the oldest version not referenced by any release directive is removed from the release channel.
After the version is added to the release channel, the release directive within the release channel is updated to use the provided version and patch.

If release channels are not enabled for the application package, only the release directive is updated to use the provided version and patch.
When a release channel is not provided, or when using the default release channel, you can use the same commands whether release channels are enabled or not.

This command assumes that the version and patch already exist in the application package. If the version and patch do not exist, the command fails.

To create a new version or patch when using this command, use the `--create-version` option. By using this option, you can use options like `--from-stage` or `--label`. For more information, also see the [snow app version create](version/app-version-create.md) command.

The rules for creating a new version are the same rules as for the [snow app version create](version/app-version-create.md) command. In other words, Snowflake CLI uses the same fallback logic to the manifest file if the version field is missing.

## Examples

* Publish version v1 and patch 2 to the default release directive of the default release channel or to the default release directive in the package. In this example, release channels are not enabled:

  ```snowcli
  snow app publish --version v1 --patch 2
  ```
* Publish version v1 and patch 2 to the `customers_group_1` release directive of the ALPHA release channel:

  ```snowcli
  snow app publish --version v1 --patch 2 --channel ALPHA --directive customers_group_1
  ```
* Publish version v1 and patch 2 to the default release directive of the QA release channel:

  ```snowcli
  snow app publish --version v1 --patch 2 --channel QA
  ```
* Create a new version and publish it to the custom `early_adopters` release directive of the default release channel:

  ```snowcli
  snow app publish --version v2 --create-version --directive early_adopters
  ```
* Add a patch to an existing version and publish it to the default release directive of the default release channel. You must use `--create-version` and either provide the patch number or omit it to use the next available patch number:

  ```snowcli
  snow app publish --version v2 --create-version
  ```
* Create a new patch from the content of the stage without syncing files to the stage first, and publish it to the default release directive of the default release channel:

  ```snowcli
  snow app publish --version v2 --patch 11 --create-version --from-stage
  ```

---
title: snow app release-channel add-accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/add-accounts.md
section: Snowflake CLI
---

# snow app release-channel add-accounts

Adds accounts to a release channel.

## Syntax

```console
snow app release-channel add-accounts
  <channel>
  --target-accounts <target_accounts>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`channel`
:   The release channel to add accounts to.

## Options

`--target-accounts TEXT`
:   The accounts to add to the release channel. Format must be `org1.account1,org2.account2`.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

The `snow app release-channel add-accounts` command adds a list of accounts to an existing release channel for an application package.
The release channel must already exist, and release channels must be enabled for the application package. Only non-default release channels can have accounts associated with them.
To view the available release channels for the application package, use the [snow app release-channel list](list.md) command.
The specified accounts are provided in the format of ORGANIZATION_NAME.ACCOUNT_NAME and separated by comma.

## Examples

* Add accounts to the ALPHA release channel:

  ```snowcli
  snow app release-channel add-accounts ALPHA --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```

---
title: snow app release-channel add-version
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/add-version.md
section: Snowflake CLI
---

# snow app release-channel add-version

Adds a version to a release channel.

## Syntax

```console
snow app release-channel add-version
  <channel>
  --version <version>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`channel`
:   The release channel to add a version to.

## Options

`--version TEXT`
:   The version to add to the release channel.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

The `snow app release-channel add-version` command adds a version to an existing release channel for an application package.
The release channel must already exist, and release channels must be enabled for the application package.
To view the available release channels for the application package, use the [snow app release-channel list](list.md) command.
The specified version must already exist in the application package, and the version must not already be associated with the release channel.
If the maximum number of versions is already associated with the release channel, the command fails.

## Examples

* Add version v1 to the default release channel:

  ```snowcli
  snow app release-channel add-version --version v1 DEFAULT
  ```
* Add version v1 to a non-default release channel:

  ```snowcli
  snow app release-channel add-version --version v1 ALPHA
  ```

---
title: snow app release-channel commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/overview.md
section: Snowflake CLI
---

# snow app release-channel commands

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

Release channels help manage releases.
These commands fail if the feature is not enabled.
To enable release channels, contact your Snowflake representative.

For more information on release channels and release directives, see [Publishing a Snowflake Native App to customers](../../../native-apps/publish-app.md).

Snowflake CLI supports the following commands for managing release channels in Snowflake Native Apps:

> * [snow app release-channel list](list.md)
> * [snow app release-channel add-version](add-version.md)
> * [snow app release-channel remove-version](remove-version.md)
> * [snow app release-channel add-accounts](add-accounts.md)
> * [snow app release-channel remove-accounts](remove-accounts.md)
> * [snow app release-channel set-accounts](set-accounts.md)

---
title: snow app release-channel list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/list.md
section: Snowflake CLI
---

# snow app release-channel list

Lists the release channels available for an application package.

## Syntax

```console
snow app release-channel list
  <channel>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`channel`
:   The release channel to list. If not provided, all release channels are listed.

## Options

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

The `snow app release-channel list` lists all the release channels available in the current application package.
If release channels are not enabled in the application package, this command returns no results.

## Examples

* List all the release channels:

  ```snowcli
  snow app release-channel list
  ```
* To display the results in JSON format, add the `--format=json` option:

  ```snowcli
  snow app release-channel list --format=json
  ```

---
title: snow app release-channel remove-accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/remove-accounts.md
section: Snowflake CLI
---

# snow app release-channel remove-accounts

Removes accounts from a release channel.

## Syntax

```console
snow app release-channel remove-accounts
  <channel>
  --target-accounts <target_accounts>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`channel`
:   The release channel to remove accounts from.

## Options

`--target-accounts TEXT`
:   The accounts to remove from the release channel. Format must be `org1.account1,org2.account2`.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

The `snow app release-channel remove-accounts` command removes a list of accounts from an existing release channel for an application package.
The release channel must already exist, and release channels must be enabled for the application package. Only non-default release channels can have accounts associated with them.
To view the available release channels for the application package, use the [snow app release-channel list](list.md) command.
The specified accounts are provided in the format of ORGANIZATION_NAME.ACCOUNT_NAME and separated by comma.

## Examples

* Remove accounts from the ALPHA release channel:

  ```snowcli
  snow app release-channel remove-accounts ALPHA --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```

---
title: snow app release-channel remove-version
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/remove-version.md
section: Snowflake CLI
---

# snow app release-channel remove-version

Removes a version from a release channel.

## Syntax

```console
snow app release-channel remove-version
  <channel>
  --version <version>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`channel`
:   The release channel to remove a version from.

## Options

`--version TEXT`
:   The version to remove from the release channel.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

The `snow app release-channel remove-version` command removes a version from an existing release channel for an application package.
The release channel must already exist, and release channels must be enabled for the application package.
To view the available release channels for the application package, use the [snow app release-channel list](list.md) command.
The specified version must already exist in the application package, and the version must already be associated with the release channel.

## Examples

* Remove version v1 from the default release channel:

  ```snowcli
  snow app release-channel remove-version --version v1 DEFAULT
  ```
* Remove version v1 from a non-default release channel:

  ```snowcli
  snow app release-channel remove-version --version v1 ALPHA
  ```

---
title: snow app release-channel set-accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-channel/set-accounts.md
section: Snowflake CLI
---

# snow app release-channel set-accounts

Sets accounts for a release channel.

## Syntax

```console
snow app release-channel set-accounts
  <channel>
  --target-accounts <target_accounts>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`channel`
:   The release channel to set accounts for.

## Options

`--target-accounts TEXT`
:   The accounts to set for the release channel. Format must be `org1.account1,org2.account2`.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> The release channels feature might not be available in all regions. Please contact Snowflake Support for more information.

The `snow app release-channel set-accounts` command assigns a list of accounts to an existing release channel of an application package.
The release channel must already exist, and release channels must be enabled for the application package. Only non-default release channels can have accounts associated with them.

To specify the accounts, provide comma-separated ORGANIZATION_NAME.ACCOUNT_NAME values.

To view the available release channels for the application package, use the [snow app release-channel list](list.md) command.

## Examples

* Set accounts for the ALPHA release channel:

  ```snowcli
  snow app release-channel set-accounts ALPHA --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```

---
title: snow app release-directive add-accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-directive/add-accounts.md
section: Snowflake CLI
---

# snow app release-directive add-accounts

Adds accounts to a release directive.

## Syntax

```console
snow app release-directive add-accounts
  <directive>
  --channel <channel>
  --target-accounts <target_accounts>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`directive`
:   Name of the release directive.

## Options

`--channel TEXT`
:   Name of the release channel to use. Default: DEFAULT.

`--target-accounts TEXT`
:   List of the accounts to add to the release directive. Format must be `org1.account1,org2.account2`.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app release-directive add-accounts` command adds a list of accounts to an existing custom release directive for an application package.
The custom release directive must already exist in the application package (or the release channel if enabled).

To specify the accounts, provide comma-separated values in the format ORGANIZATION_NAME.ACCOUNT_NAME.

To view the available release directives for the application package, use the [snow app release-directive list](list.md) command.

## Examples

* To add accounts to the `my_directive` custom release directive:

  ```snowcli
  snow app release-directive add-accounts my_directive --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```
* When release channels are enabled, release directives become part of a release channel. To add accounts to the `special_alpha_directive` custom release directive associated with release channel `ALPHA`:

  ```snowcli
  snow app release-directive add-accounts special_alpha_directive --channel ALPHA --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```

---
title: snow app release-directive commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-directive/overview.md
section: Snowflake CLI
---

# snow app release-directive commands

For more information about release channels and release directives, see the [Publishing a Snowflake Native App to customers](../../../native-apps/publish-app.md) page.

Snowflake CLI supports the following commands for managing release directives of native apps:

> * [snow app release-directive add-accounts](add-accounts.md)
> * [snow app release-directive list](list.md)
> * [snow app release-directive remove-accounts](remove-accounts.md)
> * [snow app release-directive set](set.md)
> * [snow app release-directive unset](unset.md)

---
title: snow app release-directive list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-directive/list.md
section: Snowflake CLI
---

# snow app release-directive list

Lists release directives in an application package. If no release channel is specified, release directives for all channels are listed. If a release channel is specified, only release directives for that channel are listed. If `--like` is provided, only release directives matching the SQL pattern are listed.

## Syntax

```console
snow app release-directive list
  --like <like>
  --channel <channel>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `snow app release-directive list --like='my%'` lists all release directives starting with ‘my’. Default: %%.

`--channel TEXT`
:   The release channel to use when listing release directives. If not provided, release directives from all release channels are listed.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app release-directive list` command lists all the release directives available in the current application package.
If no release channel is specified, release directives for all channels are listed. If a release channel is specified, only release directives for that channel are listed. If `--like` is provided, only release directives matching the SQL pattern are listed.

## Examples

* List all release directives associated with all release channels in an application package:

  ```snowcli
  snow app release-directive list
  ```
* List all release directives associated with a specific release channel in an application package:

  ```snowcli
  snow app release-directive list --channel ALPHA
  ```
* List all release directives starting with the word `vip`:

  ```snowcli
  snow app release-directive list --like vip%
  ```

---
title: snow app release-directive remove-accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-directive/remove-accounts.md
section: Snowflake CLI
---

# snow app release-directive remove-accounts

Removes accounts from a release directive.

## Syntax

```console
snow app release-directive remove-accounts
  <directive>
  --channel <channel>
  --target-accounts <target_accounts>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`directive`
:   Name of the release directive.

## Options

`--channel TEXT`
:   Name of the release channel to use. Default: DEFAULT.

`--target-accounts TEXT`
:   List of the accounts to remove from the release directive. Format must be `org1.account1,org2.account2`.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app release-directive remove-accounts` command removes a list of accounts from an existing custom release directive for an application package.
The specified release directive must already exist in the application package (or the release channel if enabled).

To specify the accounts, provide comma-separated ORGANIZATION_NAME.ACCOUNT_NAME values.

To view the available release directives for the application package, use the [snow app release-directive list](list.md) command.

## Examples

* Remove accounts from the `my_directive` custom release directive:

  ```snowcli
  snow app release-directive remove-accounts my_directive --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```
* When release channels are enabled, release directives become part of a release channel. To remove accounts from the `special_alpha_directive` custom release directive associated with release channel `ALPHA`:

  ```snowcli
  snow app release-directive remove-accounts special_alpha_directive --channel ALPHA --target-accounts ORG1.ACCT1,ORG2.ACCT2
  ```

---
title: snow app release-directive set
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-directive/set.md
section: Snowflake CLI
---

# snow app release-directive set

Sets a release directive. target_accounts cannot be specified for default release directives. target_accounts field is required when creating a new non-default release directive.

## Syntax

```console
snow app release-directive set
  <directive>
  --channel <channel>
  --target-accounts <target_accounts>
  --version <version>
  --patch <patch>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`directive`
:   Name of the release directive to set.

## Options

`--channel TEXT`
:   Name of the release channel to use. Default: DEFAULT.

`--target-accounts TEXT`
:   List of the accounts to apply the release directive to. Format must be `org1.account1,org2.account2`.

`--version TEXT`
:   Version of the application package to use.

`--patch INTEGER`
:   Patch number to use for the selected version.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app release-directive set` command sets the release directive for an application package.
There are two types of release directives: default and custom.

* When you set the default release directive, target accounts are not accepted.
* When you set a new custom release directive, the target accounts are required.
* When you update an existing custom release directive, the target accounts are optional.

Target accounts are provided in the format ORGANIZATION_NAME.ACCOUNT_NAME, separated by commas.

When release channels are enabled in the application package, the release directive is scoped to the specified release channel; otherwise, it is scoped to the application package.

Snowflake recommends using the [snow app publish](../publish-app.md) command to publish the application package and using the `snow app release-directive set` command for creating custom release directives.
See [Publishing a Snowflake Native App to customers](../../../native-apps/publish-app.md) for more information.`

## Examples

* Set the default release directive for an application package:

  > ```snowcli
  > snow app release-directive set DEFAULT --version v1 --patch 1
  > ```
* Set a custom release directive for an application package:

  > ```snowcli
  > snow app release-directive set CUSTOM_DIR --version v1 --patch 1 --target-accounts ORG1.ACCT1,ORG2.ACCT2
  > ```
* Update an existing custom release directive for an application package:

  > ```snowcli
  > snow app release-directive set CUSTOM_DIR --version v1 --patch 2
  > ```
* Set the default release directive of a release channel when the application package has release channels enabled:

  > ```snowcli
  > snow app release-directive set DEFAULT --version v1 --patch 1 --channel ALPHA
  > ```

---
title: snow app release-directive unset
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/release-directive/unset.md
section: Snowflake CLI
---

# snow app release-directive unset

Unsets a release directive.

## Syntax

```console
snow app release-directive unset
  <directive>
  --channel <channel>
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`directive`
:   Name of the release directive.

## Options

`--channel TEXT`
:   Name of the release channel to use. Default: DEFAULT.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow app release-directive unset` command removes a custom release directive from an application package.
The specified release directive must already exist in the application package.

## Examples

* Remove the custom release directive `my_directive` from the application package:

  ```snowcli
  snow app release-directive unset my_directive
  ```

  When release channels are enabled, release directives become part of a release channel.
* Remove the custom `special_alpha_directive` release directive associated with release channel `ALPHA`:

  ```snowcli
  snow app release-directive unset special_alpha_directive --channel ALPHA
  ```

---
title: snow app run
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/run-app.md
section: Snowflake CLI
---

# snow app run

Creates an application package in your Snowflake account, uploads code files to its stage, then creates or upgrades an application object from the application package.

## Syntax

```console
snow app run
  --version <version>
  --patch <patch>
  --from-release-directive
  --channel <channel>
  --interactive / --no-interactive
  --force
  --validate / --no-validate
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--version TEXT`
:   The version defined in an existing application package from which you want to create an application object. The application object and application package names are determined from the project definition file.

`--patch INTEGER`
:   The patch number under the given `--version` defined in an existing application package that should be used to create an application object. The application object and application package names are determined from the project definition file.

`--from-release-directive`
:   Creates or upgrades an application object to the version and patch specified by the release directive applicable to your Snowflake account. The command fails if no release directive exists for your Snowflake account for a given application package, which is determined from the project definition file. Default: unset. Default: False.

`--channel TEXT`
:   The name of the release channel to use when creating or upgrading an application instance from a release directive. Requires the `--from-release-directive` flag to be set. If unset, the default channel will be used.

`--interactive / --no-interactive`
:   When enabled, this option displays prompts even if the standard input and output are not terminal devices. Defaults to True in an interactive shell environment, and False otherwise.

`--force`
:   When enabled, this option causes the command to implicitly approve any prompts that arise. You should enable this option if interactive mode is not specified and if you want perform potentially destructive actions. Defaults to unset. Default: False.

`--validate / --no-validate`
:   When enabled, this option triggers validation of a deployed Snowflake Native App’s setup script SQL. Default: True.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> This command does not accept a role or warehouse overrides to your `config.toml` file. Please add them to the native app definition in the `snowflake.yml` or `snowflake.local.yml` instead.

This command relies on the resolved project definition to determine the stage to which to upload files, which files to upload, and the name of the objects to create. For guidance on defaults, please refer to [About Snowflake Native App projects](../../native-apps/about-projects.md) and [snow init](../bootstrap-commands/init.md) usage notes. You can also change them to be according to your own preference, though it is your responsibility to check if there is any clash with existing objects in your account.

* Objects created by Snowflake CLI are tagged with a special comment `GENERATED_BY_SNOWCLI`.
* The role(s) used to create the application package and instance must have the proper account-level privileges to work with Snowflake Native Applications. See [Create and manage an application package](../../../native-apps/creating-app-package.md) and [Install and test an app locally](../../../native-apps/installing-testing-application.md) for more information.

By default, the `snow app run` command creates an application package in your Snowflake account, uploads code files to its stage, validates the setup script SQL, and then creates (or upgrades) a development-mode instance of that application. You should keep the following in mind when running the default command:

* All files specified under `nativeapp.project.artifacts` in the project definition file(s) are uploaded to the Snowflake stage. This artifact must include a `manifest.yml` file and its related setup script(s).
* All files specified under `nativeapp.project.artifacts` must have already been compiled and packaged separately, if needed, before calling `snow app run`. Snowflake CLI does not offer any feature to perform these intermediate tasks for you, so you have full control over your build process by executing it in your own scripts.
* Snowflake CLI uses default application package name, stage name, and application name when creating those objects.
* Subsequent runs of `snow app run` after the initial one compare the state of your uploaded files to the files in your local directory, and selectively upload only the modified files to save you time. If any files have changed, the application is upgraded based on the new contents of the stage.
* If the application package already exists and its distribution property is `INTERNAL`, the command checks if the package was created by the Snowflake CLI. If it was not, the command throws an error. If the distribution of the application package is `EXTERNAL`, no such check is performed.
* The command warns you if the application package you are working with has a different value for distribution than is set in your resolved project definition, but continues execution.
* The application instance is created or upgraded in [development mode](../../../native-apps/installing-testing-application.md). Specifically, it uses the [staged files](../../../native-apps/installing-testing-application.md).

If you specify a `--version`, `--patch` or `--from-release-directive` option, this command upgrades your existing application instance, or creates one if the application does not exist. It does not create an application package in this scenario.

* If Snowflake CLI is not able to update your application for any reason, such as trying to upgrade an application initially installed in loose files mode to use release directives instead, it attempts to drop the existing application and create a new one using the desired installation strategy. The command prompts you to confirm the drop before performing the action.
* If you do not want to interact with the command and instead force all actions, use the `--force` option to bypass all prompts, which proxies as a yes to all the inputs asking whether to proceed with destructive actions.
* Snowflake CLI tries to determine if you are running the commands in an interactive shell. If `--force` is not provided and you are executing commands in the interactive shell, it automatically chooses the interactive option for you.
* If you want to force Snowflake CLI to interact with you even if not in an interactive shell, use the `--interactive` option.

## Examples

These examples assume you have made the necessary changes to your code files and added them to your `snowflake.yml` or `snowflake.local.yml` files.

* If you want to create an application package and an application using staged files, you can execute:

  ```bash
  cd my_app_project
  my_app_project_build_script.sh
  snow app run --connection="dev"
  ```
* If you already have an application package with a version and a patch, want to create an application from this version and patch, and invoke the interactive mode, you can execute:

  ```snowcli
  snow app run --version V1 --patch 12 --interactive --connection="dev"
  ```

  Here, version `V1` and patch `12` are used as an example only.
* If you have an existing release directive set on an application package, want to create an application from it and bypass the interactive mode, you can execute:

  ```snowcli
  snow app run --from-release-directive --force --connection="dev"
  ```
* To create an application from the release directive of a non-default release channel, execute:

  ```snowcli
  snow app run --from-release-directive --channel ALPHA --connection="dev"
  ```
* This example shows how to pass in multiple environment variables using the `--env` option:

  ```snowcli
  snow app run --env source_folder="src/app" --env stage_name=mystage
  ```

---
title: snow app teardown
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/teardown-app.md
section: Snowflake CLI
---

# snow app teardown

Attempts to drop both the application object and application package as defined in the project definition file.

## Syntax

```console
snow app teardown
  --force
  --cascade / --no-cascade
  --interactive / --no-interactive
  --package-entity-id <package_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--force`
:   When enabled, this option causes the command to implicitly approve any prompts that arise. You should enable this option if interactive mode is not specified and if you want perform potentially destructive actions. Defaults to unset. Default: False.

`--cascade / --no-cascade`
:   Whether to drop all application objects owned by the application within the account. Default: false.

`--interactive / --no-interactive`
:   When enabled, this option displays prompts even if the standard input and output are not terminal devices. Defaults to True in an interactive shell environment, and False otherwise.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> This command does not accept a role or warehouse overrides to your `config.toml` file. Please add them to the native app definition in the `snowflake.yml` or `snowflake.local.yml` instead.

* When attempting to drop an application, the command checks if it was created by the Snowflake CLI. If it was not, the command prompts you if it should proceed. You can force the drop through the `--force` option.
* When attempting to drop an application package, if the distribution of the application package is `INTERNAL`, the command checks if the package was created by the Snowflake CLI. If it was not, the command prompts you if it should proceed. You can force the drop through the `--force` option.

  If the distribution of the application package is `EXTERNAL`, the command prompts you if it should succeed, regardless the process by which it was created.
* The command warns you if the application package you are working with has a different value for distribution than is set in your resolved project definition, but continues execution.
* The stage created inside the application package is also dropped. The command does not drop any side effect objects were created by your application or other scripts. You must manually drop them.
* This command succeeds even if one or both of these objects do not exist.

## Examples

If you want to attempt to drop objects specified in `snowflake.yml` or `snowflake.local.yml`, you can execute:

```snowcli
snow app teardown --connection="dev"
```

If you do not have an application instance but want to drop you application package specified in `snowflake.yml`, or vice versa, you can still execute the command above.

If you do not want to interact with the command and want to force dropping the objects, you can execute:

```snowcli
snow app teardown --force --connection="dev"
```

---
title: snow app validate
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/validate-app.md
section: Snowflake CLI
---

# snow app validate

Validates a deployed Snowflake Native App’s setup script.

## Syntax

```console
snow app validate
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

To avoid affecting files in the application’s source stage, this command uses the `scratch_stage` of the `application package` entity to create a blank stage, to upload project files to this stage, to run validation, and to drop the stage.

Validation returns errors and warnings separately. Errors cause the command to exit with an exit code of 1, but warnings do not.

## Examples

* The following example shows the output for a successful validation:

  ```snowcli
  snow app validate
  ```

  ```output
  Validating Snowflake Native App setup script.
    Checking if stage st_version_pkg_<user>.app_src.stage_snowflake_cli_scratch exists, or creating a new one if none exists.
    Performing a diff between the Snowflake stage and your local deploy_root ('<APP_PATH>/apps/st-version/output/deploy') directory.
    There are no new files that exist only on the stage.
  There are no existing files that have been modified, or their status is unknown.
  There are no existing files that are identical to the ones on the stage.
  New files only on your local:
  README.md
  manifest.yml
  setup_script.sql
  streamlit/environment.yml
  streamlit/ui.py
    Uploading diff-ed files from your local <APP_PATH>/apps/st-version/output/deploy directory to the Snowflake stage.
    Dropping stage st_version_pkg_<user>.app_src.stage_snowflake_cli_scratch.
  Snowflake Native App validation succeeded.
  ```
* The following example shows the output for a successful validation, using the JSON output format:

  ```snowcli
  snow app validate --format json
  ```

  ```output
  {
      "errors": [],
      "warnings": [],
      "status": "SUCCESS"
  }
  ```
* The following example shows the output when the validation encounters errors and warnings, using the JSON output format:

  ```snowcli
  snow app validate --format json
  ```

  ```output
  {
      "errors": [
          {
              "message": "Error in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/empty.sql': Empty SQL statement.",
              "cause": "Empty SQL statement.",
              "errorCode": "000900",
              "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/empty.sql",
              "line": -1,
              "column": -1
          },
          {
              "message": "Error in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/second.sql': Unsupported feature 'CREATE VERSIONED SCHEMA without OR ALTER'.",
              "cause": "Unsupported feature 'CREATE VERSIONED SCHEMA without OR ALTER'.",
              "errorCode": "000002",
              "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/second.sql",
              "line": -1,
              "column": -1
          },
          {
              "message": "Error in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql': File '/does-not-exist.sql' cannot be found in the same stage as the setup script is located.",
              "cause": "File '/does-not-exist.sql' cannot be found in the same stage as the setup script is located.",
              "errorCode": "093159",
              "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
              "line": -1,
              "column": -1
          }
      ],
      "warnings": [
          {
              "message": "Warning in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql' on line 11 at position 35: APPLICATION ROLE should be created with IF NOT EXISTS.",
              "cause": "APPLICATION ROLE should be created with IF NOT EXISTS.",
              "errorCode": "093352",
              "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
              "line": 11,
              "column": 35
          },
          {
              "message": "Warning in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql' on line 15 at position 13: CREATE Table statement in the setup script should have \"IF NOT EXISTS\", \"OR REPLACE\", or \"OR ALTER\".",
              "cause": "CREATE Table statement in the setup script should have \"IF NOT EXISTS\", \"OR REPLACE\", or \"OR ALTER\".",
              "errorCode": "093351",
              "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
              "line": 15,
              "column": 13
          },
          {
              "message": "Warning in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql' on line 15 at position 13: Table identifier 'MY_TABLE' should include its parent schema name.",
              "cause": "Table identifier 'MY_TABLE' should include its parent schema name.",
              "errorCode": "093353",
              "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
              "line": 15,
              "column": 13
          }
      ],
      "status": "FAIL"
  }
  ```

---
title: snow app version commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/version/overview.md
section: Snowflake CLI
---

# snow app version commands

Snowflake CLI supports the following commands for managing versioned Native Apps:

> * [snow app version create](app-version-create.md)
> * [snow app version drop](app-version-drop.md)
> * [snow app version list](app-version-list.md)

---
title: snow app version create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/version/app-version-create.md
section: Snowflake CLI
---

# snow app version create

Adds a new patch to the provided version defined in your application package. If the version does not exist, creates a version with patch 0.

## Syntax

```console
snow app version create
  <version>
  --patch <patch>
  --label <label>
  --skip-git-check
  --from-stage
  --interactive / --no-interactive
  --force
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`version`
:   Version to define in your application package. If the version already exists, an auto-incremented patch is added to the version instead. Defaults to the version specified in the `manifest.yml` file.

## Options

`--patch INTEGER`
:   The patch number you want to create for an existing version. Defaults to undefined if it is not set, which means the Snowflake CLI either uses the patch specified in the `manifest.yml` file or automatically generates a new patch number.

`--label TEXT`
:   A label for the version that is displayed to consumers. If unset, the version label specified in `manifest.yml` file is used.

`--skip-git-check`
:   When enabled, the Snowflake CLI skips checking if your project has any untracked or stages files in git. Default: unset. Default: False.

`--from-stage`
:   When enabled, the Snowflake CLI creates a version from the current application package stage without syncing to the stage first. Default: False.

`--interactive / --no-interactive`
:   When enabled, this option displays prompts even if the standard input and output are not terminal devices. Defaults to True in an interactive shell environment, and False otherwise.

`--force`
:   When enabled, this option causes the command to implicitly approve any prompts that arise. You should enable this option if interactive mode is not specified and if you want perform potentially destructive actions. Defaults to unset. Default: False.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> This command does not accept a role or warehouse overrides to your `config.toml` file. Please add them to the native app definition in the `snowflake.yml` or `snowflake.local.yml` instead.

This command creates an application package (if it does not exist) with a version and an optional patch.

* If you do not provide a version, the command uses the version specified in the `manifest.yml` file. If the version is not present in the `manifest.yml` file, the command throws an error.
* If you provide both the version argument and the `--patch` option, and the application package does not already exist, the command throws an error. You should only provide the version argument to create a new application package with the required version.
* If you provide both the version argument and the `--patch` option, and the version does not already exist, the command throws an error. You should only provide the version argument to create a new version with a predetermined patch 0.
* If you are working in a Git repository and execute this command, the command checks for local changes to your working copy. If it finds local changes, it prompts you to confirm whether it is safe to proceed. You can skip this check using `--skip-git-check` option.
* If the application package does not exist, a new one is created by the Snowflake CLI is tagged with a special comment `GENERATED_BY_SNOWCLI`. It also runs any post-deploy hooks and uploads code files to the stage.
* If the application package already exists and its distribution property is `INTERNAL`, the command checks if the package was created by the Snowflake CLI. If it was not, the command throws an error. If the distribution of the application package is `EXTERNAL`, no such check is performed.
* The command warns you if the application package you are working with has a different value for distribution than is set in your resolved project definition, but continues execution.
* If the version is referenced in a release directive for the application package, the command prompts you to confirm whether you want to create a patch on this version.
* If the version already exists and you do not provide a `--patch` option, the Native Apps Framework automatically increments the patch number for this existing version. Else, it creates a custom patch under the version provided by you.
* The `--label` option sets a label for the version or patch created with this command. If specified, this value overrides the label specified for the `version` defined in the application’s `manifest.yml` file.
* If you specify a named version, such as `snow app version create my_version`, the `version` field in the `manifest.yml` file is ignored.

## Examples

These examples assume you have made the necessary changes to your code files and added them to your `snowflake.yml` or `snowflake.local.yml` files.

If you want to create an application package and add a version **V1** to it, use the following command:

```snowcli
snow app version create V1 --connection="dev"
```

You can also use the command above to create a version **V1** on an existing application package.

If you want to add a patch to version **V1** using the auto-increment functionality and invoke the interactive mode, use the following command:

```snowcli
snow app version create V1 --interactive --connection="dev"
```

If you want to add a custom patch number to version `V1` and bypass the interactive mode, even if you are in an interactive shell, use the following command:

```snowcli
snow app version create V1 --patch 42 --force --connection="dev"
```

To create a new version from the current content of the stage without syncing files to the stage first, use the following command:

```snowcli
snow app version create V1 --from-stage --connection="dev"
```

---
title: snow app version drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/version/app-version-drop.md
section: Snowflake CLI
---

# snow app version drop

Drops a version defined in your application package. Versions can either be passed in as an argument to the command or read from the `manifest.yml` file. Dropping patches is not allowed.

## Syntax

```console
snow app version drop
  <version>
  --interactive / --no-interactive
  --force
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`version`
:   Version defined in an application package that you want to drop. Defaults to the version specified in the `manifest.yml` file.

## Options

`--interactive / --no-interactive`
:   When enabled, this option displays prompts even if the standard input and output are not terminal devices. Defaults to True in an interactive shell environment, and False otherwise.

`--force`
:   When enabled, this option causes the command to implicitly approve any prompts that arise. You should enable this option if interactive mode is not specified and if you want perform potentially destructive actions. Defaults to unset. Default: False.

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> This command does not accept a role or warehouse overrides to your `config.toml` file. Please add them to the native app definition in the `snowflake.yml` or `snowflake.local.yml` instead.

* The command warns you if the application package you are working with has a different value for distribution than is set in your resolved project definition, but continues execution.
* If you do not provide a version, the command uses the version specified in the `manifest.yml` file. If the version is not present in the `manifest.yml` file, the command throws an error.
* If you want to drop a version that is referenced by a release directive, you must first set that release directive to a different version and then run this command.
* Because this action is destructive, the command prompts you to confirm dropping the version before it proceeds. Use `--force` option to bypass the prompt and drop the version.

## Examples

These examples assume you have valid `snowflake.yml` or `snowflake.local.yml` project definition file(s).

If you want to drop an existing version **V1** from your application package, use the following command:

```snowcli
snow app version drop V1 --connection="dev"
```

If you want to drop the version and invoke the interactive mode, use the following command:

```snowcli
snow app version drop V1 --interactive --connection="dev"
```

If you want to drop the version and bypass the interactive mode even if you are in an interactive shell, use the following command:

```snowcli
snow app version drop V1 --force --connection="dev"
```

---
title: snow app version list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/native-apps-commands/version/app-version-list.md
section: Snowflake CLI
---

# snow app version list

Lists all versions defined in an application package.

## Syntax

```console
snow app version list
  --package-entity-id <package_entity_id>
  --app-entity-id <app_entity_id>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--package-entity-id TEXT`
:   The ID of the package entity on which to operate when the definition_version is 2 or higher.

`--app-entity-id TEXT`
:   The ID of the application entity on which to operate when the definition_version is 2 or higher.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> This command does not accept a role or warehouse overrides to your `config.toml` file. Please add them to the native app definition in the `snowflake.yml` or `snowflake.local.yml` instead.

## Examples

This example assumes you have valid `snowflake.yml` or `snowflake.local.yml` project definition file(s).

If you want to list all existing versions of an application package specified in your resolved project definition, use the following command:

```snowcli
snow app version list --connection="dev" --format JSON
```

This command displays the results in JSON format instead of the default TABLE format.

---
title: snow auth oidc commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/auth-commands/overview.md
section: Snowflake CLI
---

# snow auth oidc commands

The `snow auth oidc` commands enable secure, password-less authentication to Snowflake. It leverages OpenID Connect (OIDC) tokens from CI/CD environments like GitHub Actions. This feature supports workload identity federation (WIF), enabling automated systems to access Snowflake without static credentials, which aligns with security best practices.

The following Snowflake CLI `snow auth oidc` commands let you manage authentication for your Snowflake projects:

* [snow auth oidc read-token](read-token.md)

Note the following:

* The `snow auth oidc` commands are currently limited to GitHub Actions as the provider.
* The OIDC token is only available when running inside a supported CI/CD environment, such as a GitHub Actions runner.
* Short-lived OIDC tokens are detected dynamically; Snowflake CLI does not store any OIDC tokens.

---
title: snow auth oidc read-token
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/auth-commands/read-token.md
section: Snowflake CLI
---

# snow auth oidc read-token

Reads OIDC token based on the specified type. Use ‘auto’ to auto-detect available providers.

## Syntax

```console
snow auth oidc read-token
  --type <_type>
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--type [auto|github]`
:   Type of OIDC provider to use. Default: auto.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow auth read-token` command displays the OIDC token, which can be used for authentication in Snowflake operations. This command is primarily for retrieving the authentication token and must run within the supported CI/CD runner.

## Examples

* Display the OIDC token in the current CI/CD environment:

  ```snowcli
  snow auth oidc read-token --type github
  ```

---
title: snow bootstrap commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/bootstrap-commands/overview.md
section: Snowflake CLI
---

# snow bootstrap commands

The Snowflake CLI bootstrap commands provide developers the ability to instantiate projects from templates.

* [snow init](init.md)

---
title: snow connection add
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/add-connection.md
section: Snowflake CLI
---

# snow connection add

Adds a connection to configuration file.

## Syntax

```console
snow connection add
  --connection-name <connection_name>
  --account <account>
  --user <user>
  --password <password>
  --role <role>
  --warehouse <warehouse>
  --database <database>
  --schema <schema>
  --host <host>
  --port <port>
  --region <region>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key <private_key_file>
  --token-file-path <token_file_path>
  --default
  --no-interactive
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--connection-name, -n TEXT`
:   Name of the new connection.

`-a, --account, --accountname TEXT`
:   Account name to use when authenticating with Snowflake.

`-u, --user, --username TEXT`
:   Username to connect to Snowflake.

`-p, --password TEXT`
:   Snowflake password.

`-r, --role, --rolename TEXT`
:   Role to use on Snowflake.

`-w, --warehouse TEXT`
:   Warehouse to use on Snowflake.

`-d, --database, --dbname TEXT`
:   Database to use on Snowflake.

`-s, --schema, --schemaname TEXT`
:   Schema to use on Snowflake.

`-h, --host TEXT`
:   Host name the connection attempts to connect to Snowflake.

`-P, --port INTEGER`
:   Port to communicate with on the host.

`--region, -R TEXT`
:   Region name if not the default Snowflake deployment.

`-A, --authenticator TEXT`
:   Chosen authenticator, if other than password-based.

`-W, --workload-identity-provider TEXT`
:   Workload identity provider type.

`--private-key, -k, --private-key-file, --private-key-path TEXT`
:   Path to file containing private key.

`-t, --token-file-path TEXT`
:   Path to file with an OAuth token that should be used when connecting to Snowflake.

`--default`
:   If provided the connection will be configured as default connection. Default: False.

`--no-interactive`
:   Disable prompting. Default: False.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow connection add` command adds the connection to your default `config.toml` file.
For more information, see [Configuring Snowflake CLI and connecting to Snowflake](../../connecting/connect.md).

## Examples

To add a connection, run the following:

```snowcli
snow connection add
Enter connection name: <connection_name>
Enter account: <account>
Enter user: <user-name>
Enter password: <password>
Enter role: <role-name>
Enter warehouse: <warehouse-name>
Enter database: <database-name>
Enter schema: <schema-name>
Enter host: <host-name>
Enter port: <port-number>
Enter region: <region-name>
Enter authenticator: <authentication-method>
Enter private key file: <path-to-private-key-file>
Enter token file path: <path-to-mfa-token>
Do you want to configure key pair authentication? [y/N]: y
Key length [2048]: <key-length>
Output path [~/.ssh]: <path-to-output-file>
Private key passphrase: <key-description>
Wrote new connection <connection-name> to config.toml
```

```output
Wrote new connection my_conn to <user-home>/.snowflake/config.toml
```

---
title: snow connection commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/overview.md
section: Snowflake CLI
---

# snow connection commands

Snowflake CLI supports the following commands for managing Snowflake connections:

> * [snow connection add](add-connection.md)
> * [snow connection generate-jwt](generate-jwt.md)
> * [snow connection list](list-connections.md)
> * [snow connection remove](remove-connection.md)
> * [snow connection set-default](set-default-connection.md)
> * [snow connection test](test-connection.md)

---
title: snow connection generate-jwt
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/generate-jwt.md
section: Snowflake CLI
---

# snow connection generate-jwt

Generate a JWT token, which will be printed out and displayed..

## Syntax

```console
snow connection generate-jwt
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow connection generate-jwt` command generates a JWT (JSON Web Token) that you can use for key-pair authentication when connecting to Snowflake. You can use the token to connect to Snowflake from any Snowflake application, such as the SQL REST API or the Snowflake REST APIs.

## Examples

This example generates a token for account `TEST` and user `JDOE`, using the private key from `rsa_key.p8`:

```snowcli
snow connection generate-jwt --user JDOE --account TEST --private-key-file=rsa_key.p8
```

The command prompts you for a private key passphrase to complete the connection. You can avoid the prompt by providing the passphrase in the `PRIVATE_KEY_PASSPHRASE` environment variable.

---
title: snow connection list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/list-connections.md
section: Snowflake CLI
---

# snow connection list

Lists configured connections.

## Syntax

```console
snow connection list
  --all
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--all, -a`
:   Include connections from all sources (environment variables, SnowSQL config). By default, only shows connections from configuration files. Default: False.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow connection list` command lists the connections in your default `config.toml` file.
For more information, see [Configuring Snowflake CLI and connecting to Snowflake](../../connecting/connect.md).

## Examples

```snowcli
snow connection list
```

```output
+--------------------------------------------------------------------------------------------------------------------------------+
| connection_name | parameters                                                                                                   |
|-----------------+--------------------------------------------------------------------------------------------------------------|
| my-prod         | {'account': 'po52878', 'user': 'JDOE', 'password': '****', 'role': 'integration_tests', 'database':          |
|                 | 'SNOWFLAKE'}                                                                                                 |
|-----------------+--------------------------------------------------------------------------------------------------------------|
| my-test         | {'account': 'po52878', 'user': 'SSMITH', 'password': '****', 'role': 'integration_tests', 'database':        |
|                 | 'SNOWFLAKE'}                                                                                                 |
+--------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow connection remove
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/remove-connection.md
section: Snowflake CLI
---

# snow connection remove

Removes a connection from configuration file.

## Syntax

```console
snow connection remove
  <connection_name>
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`connection_name`
:   Name of the connection to remove.

## Options

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

To remove a connection, execute a `snow connection remove` command similar to the following:

```snowcli
snow connection remove bad_connection
```

```output
Removed connection bad_connection from /Users/jdoe/.snowflake/config.toml.
```

---
title: snow connection set-default
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/set-default-connection.md
section: Snowflake CLI
---

# snow connection set-default

Changes default connection to provided value.

## Syntax

```console
snow connection set-default
  <name>
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Name of the connection, as defined in your `config.toml` file.

## Options

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

This command lets you change the default connection from the command line instead of changing the value of `default_connection_name` in the `config.toml` file each time. Using this command can simplify changing between multiple connections.

## Examples

```bash
snow connection set-default "my_test_connection"
```

```output
Default connection set to: my_test_connection
```

---
title: snow connection test
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/connection-commands/test-connection.md
section: Snowflake CLI
---

# snow connection test

Tests the connection to Snowflake.

## Syntax

```console
snow connection test
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow connection test` command test the connection in your default `config.toml` file.
For more information, see [Configuring Snowflake CLI and connecting to Snowflake](../../connecting/connect.md).

## Examples

To test particular connection you can run the following command:

```snowcli
$ snow connection test --connection conn2
```

```output
+---------------------------------+
| key             | value         |
|-----------------+---------------|
| Connection name | conn2         |
| Status          | OK            |
| Account         | foo           |
| User            | jdoe          |
| Role            | ACCOUNTADMIN  |
| Database        | not set       |
| Warehouse       | XSMALL        |
+---------------------------------+
```

---
title: snow cortex commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/cortex-commands/overview.md
section: Snowflake CLI
---

# snow cortex commands

Snowflake CLI provides the following commands to access [Snowflake Cortex](../../../../user-guide/snowflake-cortex/aisql.md) features:

* [snow cortex complete](complete.md)
* [snow cortex extract-answer](extract-answer.md)
* [snow cortex sentiment](sentiment.md)
* [snow cortex summarize](summarize.md)
* [snow cortex translate](translate.md)

---
title: snow cortex complete
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/cortex-commands/complete.md
section: Snowflake CLI
---

# snow cortex complete

Given a prompt, the command generates a response using your choice of language model. In the simplest use case, the prompt is a single string. You may also provide a JSON file with conversation history including multiple prompts and responses for interactive chat-style usage.

## Syntax

```console
snow cortex complete
  <text>
  --model <model>
  --backend <backend>
  --file <file>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`text`
:   Prompt to be used to generate a completion. Cannot be combined with –file option.

## Options

`--model TEXT`
:   String specifying the model to be used. Default: llama3.1-70b.

`--backend [sql|rest]`
:   String specifying whether to use sql or rest backend. Default: rest.

`--file FILE`
:   JSON file containing conversation history to be used to generate a completion. Cannot be combined with TEXT argument.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

In the simplest use case, the prompt is a single string.
You can also provide a JSON file with conversation history, including multiple prompts and responses, for interactive chat-style conversation.

## Examples

* Ask a question using the default model.

  ```snowcli
  snow cortex complete "Is 5 more than 4? Please answer using one word without a period." -c snowhouse
  ```

  ```output
  Yes
  ```
* Ask a question using a specified model.

  ```snowcli
  snow cortex complete "Is 5 more than 4? Please answer using one word without a period." -c snowhouse --model deepseek-r1
  ```

  ```output
  Yes
  ```

---
title: snow cortex extract-answer
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/cortex-commands/extract-answer.md
section: Snowflake CLI
---

# snow cortex extract-answer

Extracts an answer to a given question from a text document. The document may be a plain-English document or a string representation of a semi-structured (JSON) data object.

## Syntax

```console
snow cortex extract-answer
  <question>
  <source_document_text>
  --file <file>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`question`
:   String containing the question to be answered.

`source_document_text`
:   String containing the plain-text or JSON document that contains the answer to the question. Cannot be combined with –file option.

## Options

`--file FILE`
:   File containing the plain-text or JSON document that contains the answer to the question. Cannot be combined with SOURCE_DOCUMENT_TEXT argument.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The document can be a plain-English document or a string representation of a semi-structured (JSON) data object.

## Examples

* Extract an answer to a question from text provided as a command-line argument.

  ```snowcli
  snow cortex extract-answer "what is snowflake?" "snowflake is a company" -c snowhouse
  ```

  ```output
  a company
  ```

---
title: snow cortex sentiment
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/cortex-commands/sentiment.md
section: Snowflake CLI
---

# snow cortex sentiment

Returns sentiment as a score between -1 to 1 (with -1 being the most negative and 1 the most positive, with values around 0 neutral) for the given English-language input text.

## Syntax

```console
snow cortex sentiment
  <text>
  --file <file>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`text`
:   String containing the text for which a sentiment score should be calculated. Cannot be combined with –file option.

## Options

`--file FILE`
:   File containing the text for which a sentiment score should be calculated. Cannot be combined with TEXT argument.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow cortex sentiment` command returns a sentiment for input text as a score between -1 to 1, where:

* -1 is the most negative.
* 0 is neutral.
* +1 is the most positive.

Currently, this command only supports English language text.

## Examples

The following example returns the sentiment score of “Mary had a little lamb”, which shows a slightly positive sentiment.

```snowcli
snow cortex sentiment "Mary had a little Lamb" -c "snowhouse"
```

```output
0.21522656
```

---
title: snow cortex summarize
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/cortex-commands/summarize.md
section: Snowflake CLI
---

# snow cortex summarize

Summarizes the given English-language input text.

## Syntax

```console
snow cortex summarize
  <text>
  --file <file>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`text`
:   String containing the English text from which a summary should be generated. Cannot be combined with –file option.

## Options

`--file FILE`
:   File containing the English text from which a summary should be generated. Cannot be combined with TEXT argument.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

* Summarize text supplied on the command line.

  ```snowcli
  snow cortex summarize "John has a car. John's car is blue. John's car is old and John is thinking about buying a new car. There are a lot of cars to choose from and John cannot sleep because it's an important decision for John."
  ```

  ```output
  John has an old blue car and is considering buying a new one due to the many options available, causing him sleepless nights.
  ```
* Summarize text stored in a file.

  For this example, assume that the file `about_cortex.txt` contains the following content:

  ```output
  Snowflake Cortex gives you instant access to industry-leading large language models (LLMs) trained by researchers at companies like Mistral, Reka, Meta, and Google, including Snowflake Arctic, an open enterprise-grade model developed by Snowflake.

  Since these LLMs are fully hosted and managed by Snowflake, using them requires no setup. Your data stays within Snowflake, giving you the performance, scalability, and governance you expect.

  Snowflake Cortex features are provided as SQL functions and are also available in Python. The available functions are summarized below.

  COMPLETE: Given a prompt, returns a response that completes the prompt. This function accepts either a single prompt or a conversation with multiple prompts and responses.
  EMBED_TEXT_768: Given a piece of text, returns a vector embedding that represents that text.
  EXTRACT_ANSWER: Given a question and unstructured data, returns the answer to the question if it can be found in the data.
  SENTIMENT: Returns a sentiment score, from -1 to 1, representing the detected positive or negative sentiment of the given text.
  SUMMARIZE: Returns a summary of the given text.
  TRANSLATE: Translates given text from any supported language to any other.
  ```

  Execute the snow cortex command by passing in the filename, as shown:

  ```bash
  snow cortex summarize --file about_cortex.txt
  ```

  ```output
  Snowflake Cortex offers instant access to industry-leading language models, including Snowflake Arctic, with SQL functions for completing prompts (COMPLETE), text embedding (EMBED_TEXT_768), extracting answers (EXTRACT_ANSWER), sentiment analysis (SENTIMENT), summarizing text (SUMMARIZE), and translating text (TRANSLATE).
  ```

---
title: snow cortex translate
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/cortex-commands/translate.md
section: Snowflake CLI
---

# snow cortex translate

Translates text from the indicated or detected source language to a target language.

## Syntax

```console
snow cortex translate
  <text>
  --from <from_language>
  --to <to_language>
  --file <file>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`text`
:   String containing the text to be translated. Cannot be combined with –file option.

## Options

`--from TEXT`
:   String specifying the language code for the language the text is currently in. See Snowflake Cortex documentation for a list of supported language codes.

`--to TEXT`
:   String specifying the language code into which the text should be translated. See Snowflake Cortex documentation for a list of supported language codes.

`--file FILE`
:   File containing the text to be translated. Cannot be combined with TEXT argument.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* By default, the command assumes the text to translate is in the detected source language.
* When using the `--file` option, the file text must use the English language.

## Examples

* Translate the word “herb” in the source language (English, in this case) into Polish.

  ```snowcli
  snow cortex translate herb --to pl
  ```

  ```output
  ziołowy
  ```
* Translate the Polish word “herb” into English.

  ```snowcli
  snow cortex translate herb --from pl --to en
  ```

  ```output
  coat of arms
  ```

---
title: snow dbt commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dbt-commands/overview.md
section: Snowflake CLI
---

# snow dbt commands

Snowflake CLI supports the following commands for managing Snowflake dbt project objects:

* [snow dbt describe](describe.md)
* [snow dbt deploy](deploy.md)
* [snow dbt drop](drop.md)
* [snow dbt execute commands](execute/overview.md)
* [snow dbt list](list.md)

---
title: snow dbt deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dbt-commands/deploy.md
section: Snowflake CLI
---

# snow dbt deploy

Upload local dbt project files and create or update a DBT project object on Snowflake. Examples: snow dbt deploy PROJECT snow dbt deploy PROJECT –source=/Users/jdoe/project –force

## Syntax

```console
snow dbt deploy
  <name>
  --source <source>
  --profiles-dir <profiles_dir>
  --force / --no-force
  --default-target <default_target>
  --unset-default-target
  --external-access-integration <external_access_integrations>
  --install-local-deps
  --dbt-version <dbt_version>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the DBT Project; for example: my_pipeline.

## Options

`--source TEXT`
:   Path to directory containing dbt files to deploy. Defaults to current working directory.

`--profiles-dir TEXT`
:   Path to directory containing profiles.yml. Defaults to directory provided in –source or current working directory.

`--force / --no-force`
:   Overwrites conflicting files in the project, if any. Default: False.

`--default-target TEXT`
:   Default target for the dbt project. Mutually exclusive with –unset-default-target.

`--unset-default-target`
:   Unset the default target for the dbt project. Mutually exclusive with –default-target. Default: False.

`--external-access-integration TEXT`
:   External access integration to be used by the dbt object.

`--install-local-deps`
:   Installs local dependencies from project that don’t require external access. Default: False.

`--dbt-version TEXT`
:   dbt Core version to use for the project, for example ‘1.10.15’. Full list of supported versions can be found at <https://docs.snowflake.com/en/user-guide/data-engineering/dbt-projects-on-snowflake-dbt-core-versions>.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dbt deploy` command uploads local files to a temporary stage and either creates a new object or updates an existing dbt project object by making a new version. A valid dbt project object must contain two files:

* `dbt_project.yml`: A standard dbt configuration file that specifies the profile to use.
* `profiles.yml`: A dbt connection profile definition referenced in `dbt_project.yml`. `profiles.yaml` must define the database, role, schema, and type.

  ```yaml
  <profile_name>:
  target: dev
  outputs:
    dev:
      database: <database_name>
      role: <role_name>
      schema: <schema_name>
      warehouse: <warehouse_name>
      type: snowflake
  ```

> **Note:**
>
> To use the `--dbt-version` option in Snowflake CLI version 3.15.0, you must enable the `SNOWFLAKE_CLI_FEATURES_ENABLE_DBT_VERSION` feature flag, using either of the following methods:
>
> * Set the `SNOWFLAKE_CLI_FEATURES_ENABLE_DBT_VERSION` environment variable to `true` before running the command.
> * Set the `enable_dbt_version` configuration option to `true` in the `config.toml` file, as shown in the following example:
>
>   ```toml
>   [features]
>   enable_dbt_version = true
>   ```

## Examples

* Deploy a dbt project named `jaffle_shop`:

  ```snowcli
  snow dbt deploy jaffle_shop
  ```
* Deploy a project named `jaffle_shop` from a specified directory and overwrite the dbt project object if it already exists:

  ```snowcli
  snow dbt deploy jaffle_shop --force --source /path/to/dbt/directory --profiles-dir ~/.dbt/
  ```
* Deploy a project named `jaffle_shop` from a specified directory using a custom profiles directory and enabling [external access integrations](../../../external-network-access/creating-using-external-network-access.md):

  ```snowcli
  snow dbt deploy jaffle_shop --force --source /path/to/dbt/directory
  --profiles-dir ~/.dbt/ --default-target dev
  --external-access-integration dbthub-integration
  --external-access-integration github-integration
  ```

---
title: snow dbt describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dbt-commands/describe.md
section: Snowflake CLI
---

# snow dbt describe

Provides description of dbt project.

## Syntax

```console
snow dbt describe
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the DBT Project; for example: my_pipeline.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dbt describe` command describes a dbt project object on Snowflake. It returns information such as the project name, owner, dbt version and dbt snowflake version, default versions with their names and aliases, and [external access integrations](../../../external-network-access/creating-using-external-network-access.md).

For more information, see [DESCRIBE DBT PROJECT](../../../../sql-reference/sql/desc-dbt-project.md).

## Examples

The following example describes the dbt project object named `my_dbt_project` in Snowflake:

```console
snow dbt describe my_dbt_project
```

---
title: snow dbt drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dbt-commands/drop.md
section: Snowflake CLI
---

# snow dbt drop

Drops dbt project with given name.

## Syntax

```console
snow dbt drop
  <name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the DBT Project; for example: my_pipeline.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dbt drop` command deletes a dbt project object in Snowflake.

## Examples

The following example deletes the dbt project object named `my_dbt_project` in Snowflake:

> ```console
> snow dbt drop my_dbt_project
> ```

---
title: snow dbt execute commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dbt-commands/execute/overview.md
section: Snowflake CLI
---

# snow dbt execute commands

The `snow dbt execute` command executes one of the following [dbt commands](https://docs.getdbt.com/reference/dbt-commands) on Snowflake:

* [build](https://docs.getdbt.com/reference/commands/build)
* [compile](https://docs.getdbt.com/reference/commands/compile)
* [list](https://docs.getdbt.com/reference/commands/list)
* [parse](https://docs.getdbt.com/reference/commands/parse)
* [retry](https://docs.getdbt.com/reference/commands/retry)
* [run](https://docs.getdbt.com/reference/commands/run)
* [run-operation](https://docs.getdbt.com/reference/commands/run-operation)
* [seed](https://docs.getdbt.com/reference/commands/seed)
* [show](https://docs.getdbt.com/reference/commands/show)
* [snapshot](https://docs.getdbt.com/reference/commands/snapshot)
* [test](https://docs.getdbt.com/reference/commands/test)

For more information about using dbt commands, see the [dbt Command reference](https://docs.getdbt.com/reference/dbt-commands).

> **Note:**
>
> To use the `--dbt-version` option in Snowflake CLI version 3.15.0, you must enable the `SNOWFLAKE_CLI_FEATURES_ENABLE_DBT_VERSION` feature flag, using either of the following methods:
>
> * Set the `SNOWFLAKE_CLI_FEATURES_ENABLE_DBT_VERSION` environment variable to `true` before running the command.
> * Set the `enable_dbt_version` configuration option to `true` in the `config.toml` file, as shown in the following example:
>
>   ```toml
>   [features]
>   enable_dbt_version = true
>   ```

---
title: snow dbt list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dbt-commands/list.md
section: Snowflake CLI
---

# snow dbt list

Lists all available dbt projects.

## Syntax

```console
snow dbt list
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all dbt projects that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dbt list` command lists all available dbt project objects on Snowflake.

## Examples

* List all available dbt project objects:

  ```snowcli
  snow dbt list
  ```
* List dbt project objects in the `product` database whose names begin with `JAFFLE`:

  ```snowcli
  snow dbt list --like JAFFLE% --in database product
  ```

---
title: snow dcm commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/overview.md
section: Snowflake CLI
---

# snow dcm commands

> **Note:**
>
> To use DCM commands, you must enable the `SNOWFLAKE_CLI_FEATURES_ENABLE_SNOWFLAKE_PROJECTS` feature flag, using either of the following methods:
>
> * Set the `SNOWFLAKE_CLI_FEATURES_ENABLE_SNOWFLAKE_PROJECTS` environment variable to `true` before running the command.
> * Set the `enable_snowflake_projects` configuration option to `true` in the `config.toml` file, as shown in the following example:
>
>   ```toml
>   [cli.features]
>   enable_snowflake_projects = true
>   ```

Snowflake CLI supports the following commands for managing Snowflake DCM project objects:

* [snow dcm create](create.md)
* [snow dcm deploy](deploy.md)
* [snow dcm describe](describe.md)
* [snow dcm drop](drop.md)
* [snow dcm drop-deployment](drop-deployment.md)
* [snow dcm list](list.md)
* [snow dcm list-deployments](list-deployments.md)
* [snow dcm plan](plan.md)
* [snow dcm preview](preview.md)
* [snow dcm refresh](refresh.md)
* [snow dcm test](test.md)

## Project configuration (manifest.yml)

DCM projects use a `manifest.yml` file to define project configuration. For more details, see [DCM Projects files and templates](../../../../user-guide/dcm-projects/dcm-projects-files.md).

### Project identifier resolution

Most DCM commands accept an optional project identifier argument and a `--target` option. The project name is resolved as follows:

1. If a project identifier is provided as an argument, it is used directly.
2. If `--target` is specified, the `project_name` from that target in `manifest.yml` is used.
3. If neither is provided, the `default_target` from `manifest.yml` is used.

**Examples:**

```snowcli
# Use default_target from manifest.yml
snow dcm deploy

# Use target from manifest.yml
snow dcm deploy --target DEV

# Explicit project name with fully qualified identifier
snow dcm deploy MY_DB.MY_SCHEMA.MY_PROJECT
```

The `--from` option specifies the directory containing the `manifest.yml` and project source files. If omitted, the current directory is used.

> **Note:**
>
> Project identifiers can be specified as a fully qualified name (`MY_DB.MY_SCHEMA.MY_PROJECT`) or as a simple name (`MY_PROJECT`). When using a simple name, the database and schema are derived from the active connection context. Using fully qualified names is recommended to avoid ambiguity.

---
title: snow dcm create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/create.md
section: Snowflake CLI
---

# snow dcm create

Creates a DCM Project in Snowflake.

## Syntax

```console
snow dcm create
  <identifier>
  --if-not-exists
  --from <from_location>
  --target <target>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--if-not-exists`
:   Do nothing if the project already exists. Default: False.

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm create` command creates a DCM project object in Snowflake if one does not exist. The DCM project object is created in the current session’s database and schema, or in those specified with `snow dcm` command options.

## Examples

* Create a DCM project object in Snowflake where the project name is specified in the `default_target` property in the manifest:

  ```snowcli
  snow dcm create
  ```
* Create a DCM project object in Snowflake where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm create --target DEV
  ```
* Create a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm create MY_DB.MY_SCHEMA.MY_PROJECT
  ```
* Create a DCM project object in Snowflake only if it does not already exist:

  ```snowcli
  snow dcm create --if-not-exists
  ```

---
title: snow dcm deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/deploy.md
section: Snowflake CLI
---

# snow dcm deploy

Deploys local project changes to Snowflake by creating, altering, or dropping objects to match your definition files.

## Syntax

```console
snow dcm deploy
  <identifier>
  --from <from_location>
  --variable <variables>
  --alias <alias>
  --target <target>
  --save-output
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--variable, -D TEXT`
:   Variables for the execution context; for example: `-D "<key>=<value>"`.

`--alias TEXT`
:   Alias for the deployment.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--save-output`
:   Save command response and artifacts to local ‘out/’ directory. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm deploy` command deploys local project changes to Snowflake by creating, altering, or dropping objects to match definition files.

When you deploy a DCM project, the following actions are performed:

* Objects that are defined but don’t exist yet are created.
* Objects that already exist but differ from the current definition are altered.
* Objects that already exist and there are no differences between their state and definition stay unchanged.
* Objects that already exist but are no longer defined are dropped.
* Objects that existed before, and their definitions were recently added into DCM project, are added to objects managed by this DCM project.

> **Note:**
>
> This command automatically uploads local source SQL files to a temporary stage in Snowflake so their content impacts the final result of the operation.

Use the `--save-output` option to save the deployment results to a local `out/deploy.json` file.

For more information about the deployment process, see [Deploy a DCM project](../../../../user-guide/dcm-projects/dcm-projects-use.md).

## Examples

* Deploy a DCM project object with the default options, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm deploy
  ```
* Deploy a DCM project object where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm deploy --target DEV
  ```
* Deploy a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm deploy MY_DB.MY_SCHEMA.MY_PROJECT
  ```
* Deploy a DCM project project where the project name is specified in the `DEV` target in the manifest, specify the value for the `db_name` variable, and set the deployment alias to `v3`:

  ```snowcli
  snow dcm deploy --target DEV --variable "db_name='jdoe'" --alias 'v3'
  ```
* Deploy a DCM project object from a specific directory and save output:

  ```snowcli
  snow dcm deploy --from /path/to/project --save-output
  ```

---
title: snow dcm describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/describe.md
section: Snowflake CLI
---

# snow dcm describe

Provides description of a DCM Project.

## Syntax

```console
snow dcm describe
  <identifier>
  --from <from_location>
  --target <target>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm describe` command describes a single DCM project.

## Examples

* Describe a DCM project object with the default options, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm describe
  ```
* Describe a DCM project project where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm describe --target DEV
  ```
* Describe a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm describe MY_DB.MY_SCHEMA.MY_PROJECT
  ```

---
title: snow dcm drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/drop.md
section: Snowflake CLI
---

# snow dcm drop

Drops a DCM Project. All the objects deployed and managed by this project won’t be dropped.

## Syntax

```console
snow dcm drop
  <identifier>
  --if-exists
  --from <from_location>
  --target <target>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--if-exists`
:   Do nothing if the project does not exist. Default: False.

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm drop` command drops a DCM project object. This command deletes the DCM project object and its deployment history. Objects deployed by this project are not dropped along with it.

## Examples

* Drop a DCM project object, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm drop
  ```
* Drop a DCM project object where the name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm drop --target DEV
  ```
* Drop a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm drop MY_DB.MY_SCHEMA.MY_PROJECT
  ```
* Drop a DCM project object only if it exists:

  ```snowcli
  snow dcm drop --if-exists
  ```

---
title: snow dcm drop-deployment
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/drop-deployment.md
section: Snowflake CLI
---

# snow dcm drop-deployment

Drops a deployment from the DCM Project.

## Syntax

```console
snow dcm drop-deployment
  <identifier>
  --deployment <deployment>
  --if-exists
  --from <from_location>
  --target <target>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--deployment TEXT`
:   Name or alias of the deployment to drop. For names containing ‘$’, use single quotes to prevent shell expansion (e.g., ‘DEPLOYMENT$1’). If both the deployment name and the alias match two different deployments, the deployment name match has higher precedence.

`--if-exists`
:   Do nothing if the deployment does not exist. Default: False.

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm drop-deployment` command drops a specified deployment of a DCM project.

## Examples

* Drop a deployment with alias `MY_DEPLOYMENT` from DCM project, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm drop-deployment --deployment MY_DEPLOYMENT
  ```
* Drop a deployment with alias `MY_DEPLOYMENT` from DCM project, where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm drop-deployment --target DEV --deployment MY_DEPLOYMENT
  ```
* Drop a deployment with alias `MY_DEPLOYMENT` from the `MY_PROJECT` DCM project object:

  ```snowcli
  snow dcm drop-deployment MY_DB.MY_SCHEMA.MY_PROJECT --deployment MY_DEPLOYMENT
  ```
* Drop a deployment named `MY_DEPLOYMENT` from DCM project if it exists (note: use single quotes to prevent shell expansion of `$`):

  ```snowcli
  snow dcm drop-deployment --deployment 'DEPLOYMENT$1' --if-exists
  ```

---
title: snow dcm list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/list.md
section: Snowflake CLI
---

# snow dcm list

Lists all available DCM Projects.

## Syntax

```console
snow dcm list
  --like <like>
  --in <scope>
  --in-account
  --terse
  --limit <limit>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all DCM Projects that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--terse`
:   Returns only a subset of output columns. Default: False.

`--limit INTEGER`
:   Limits the maximum number of rows returned.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm list` command lists all available DCM project objects.

## Examples

* List all available DCM project objects:

  ```snowcli
  snow dcm list
  ```
* List DCM project objects whose names match a pattern:

  ```snowcli
  snow dcm list --like "MY_PROJECT%"
  ```
* List DCM project objects in a specific database:

  ```snowcli
  snow dcm list --in database MY_DB
  ```

---
title: snow dcm list-deployments
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/list-deployments.md
section: Snowflake CLI
---

# snow dcm list-deployments

Lists deployments of given DCM Project.

## Syntax

```console
snow dcm list-deployments
  <identifier>
  --from <from_location>
  --target <target>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm list-deployments` command lists all deployments of a given DCM project. Each deployment has a name (for example, `DEPLOYMENT$1`) and optionally an alias that was specified during deployment.

## Examples

* List all deployments for a DCM project object, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm list-deployments
  ```
* List deployments for a DCM project object, where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm list-deployments --target DEV
  ```
* List deployments of the `MY_PROJECT` DCM project object:

  ```snowcli
  snow dcm list-deployments MY_DB.MY_SCHEMA.MY_PROJECT
  ```

---
title: snow dcm plan
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/plan.md
section: Snowflake CLI
---

# snow dcm plan

Shows what objects would be created, altered, or dropped by the `deploy` command, without applying any changes.

## Syntax

```console
snow dcm plan
  <identifier>
  --from <from_location>
  --variable <variables>
  --target <target>
  --save-output
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--variable, -D TEXT`
:   Variables for the execution context; for example: `-D "<key>=<value>"`.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--save-output`
:   Save command response and artifacts to local ‘out/’ directory. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm plan` command validates a DCM project object and simulates what would happen if the `deploy` command were executed, printing the computed changeset as a result. No Snowflake objects are created, altered, or dropped when you run this command.

> **Note:**
>
> This command automatically uploads local source SQL files to a temporary stage in Snowflake so their content impacts the final result of the operation.

Use the `--save-output` option to save the plan results to a local `out/plan.json` file.

## Examples

* Plan a DCM project object with the default options, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm plan
  ```
* Plan a DCM project project where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm plan --target DEV
  ```
* Plan a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm plan MY_DB.MY_SCHEMA.MY_PROJECT
  ```
* Plan a DCM project project using local files, where the project name is specified in the `DEV` target in the manifest, set the value for the `db_name` variable, and set the deployment alias to `v3`:

  ```snowcli
  snow dcm plan --target DEV --variable db_name=jdoe --alias v3
  ```
* Plan a DCM project object and save the plan output locally:

  ```snowcli
  snow dcm plan --save-output
  ```

---
title: snow dcm preview
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/preview.md
section: Snowflake CLI
---

# snow dcm preview

Returns rows from any table, view, dynamic table.

## Syntax

```console
snow dcm preview
  <identifier>
  --object <object_identifier>
  --from <from_location>
  --variable <variables>
  --limit <limit>
  --target <target>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--object TEXT`
:   FQN of table/view/dynamic table to be previewed.

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--variable, -D TEXT`
:   Variables for the execution context; for example: `-D "<key>=<value>"`.

`--limit INTEGER`
:   The maximum number of rows to be returned.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm preview` command returns rows from any table, view, or dynamic table defined in your DCM project object. This command is useful for:

* Testing your definitions before deployment
* Verifying data after deployment
* Previewing views that reference templated table names

> **Note:**
>
> This command automatically uploads local source SQL files to a temporary stage in Snowflake so their content impacts the final result of the operation.

The `--object` option is required and specifies the fully qualified name of the DCM project object to preview. You can use the `--limit` option to restrict the number of rows returned.

## Examples

* Preview data from the table named `MY_DB.PUBLIC.MY_TABLE` for a DCM project object, where the project name is specified in the `default_target` property in the manifest:

  ```snowcli
  snow dcm preview --object MY_DB.PUBLIC.MY_TABLE
  ```
* Preview data where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm preview --target DEV --object MY_DB.PUBLIC.MY_TABLE
  ```
* Preview data from a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm preview MY_DB.MY_SCHEMA.MY_PROJECT --object MY_DB.PUBLIC.MY_TABLE
  ```
* Preview with a row limit:

  ```snowcli
  snow dcm preview --object MY_DB.PUBLIC.MY_VIEW --limit 10
  ```
* Preview with variable substitution:

  ```snowcli
  snow dcm preview --object MY_DB.PUBLIC.MY_VIEW -D "source_table='MY_DB.PUBLIC.SOURCE'"
  ```

---
title: snow dcm refresh
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/refresh.md
section: Snowflake CLI
---

# snow dcm refresh

Refreshes dynamic tables defined in DCM project. It applies only to deployed objects.

## Syntax

```console
snow dcm refresh
  <identifier>
  --from <from_location>
  --target <target>
  --save-output
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--save-output`
:   Save command response and artifacts to local ‘out/’ directory. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm refresh` command triggers an immediate refresh of all dynamic tables defined in a DCM project object. This is useful when you need to update data before the scheduled refresh time.

The command reports the status of each dynamic table:

* **Refreshed**: The table was refreshed with new data, showing inserted and deleted row counts
* **Up-to-date**: The table already contains the latest data

If no dynamic tables are defined in the project, the command reports that no dynamic tables were found.

Use the `--save-output` option to save the refresh results to a local `out/refresh.json` file.

## Examples

* Refresh all dynamic tables in a DCM project object, where the project name is specified in the target identified by the `default_target` property in the manifest:

  ```snowcli
  snow dcm refresh
  ```
* Refresh all dynamic tables in a DCM project project where the project name is specified in the `PROD` target in the manifest:

  ```snowcli
  snow dcm refresh --target PROD
  ```
* Refresh all dynamic tables in a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm refresh MY_DB.MY_SCHEMA.MY_PROJECT
  ```
* Refresh and save the results to the `out/` directory:

  ```snowcli
  snow dcm refresh --save-output
  ```

---
title: snow dcm test
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/dcm-commands/test.md
section: Snowflake CLI
---

# snow dcm test

Tests all expectations defined in DCM project. It applies only to deployed objects.

## Syntax

```console
snow dcm test
  <identifier>
  --from <from_location>
  --target <target>
  --save-output
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of DCM Project. Example: MY_DB.MY_SCHEMA.MY_PROJECT. Supports fully qualified (recommended) or simple names. If unqualified, it defaults to the connection’s database and schema. Optional if `--target` or `default_target` is defined in the manifest.

## Options

`--from PATH`
:   Local directory path containing DCM project files. Omit to use current directory.

`--target TEXT`
:   Target profile from `manifest.yml` to use. Uses `default_target` if not specified.

`--save-output`
:   Save command response and artifacts to local ‘out/’ directory. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow dcm test` command runs all expectations (data metric functions) defined in your DCM project object. Expectations are data quality rules that validate conditions on your tables.

The command returns:

* Exit code `0` if all tests pass
* Exit code `1` if any test fails

Use the `--save-output` option to save the test results to a local `out/test.json` file.

## Examples

* Test all expectations in a DCM project object, where the project name is specified in the `default_target` property in the manifest:

  ```snowcli
  snow dcm test
  ```
* Test all expectations in a DCM project project where the project name is specified in the `DEV` target in the manifest:

  ```snowcli
  snow dcm test --target DEV
  ```
* Test all expectations in a DCM project object with an explicit fully qualified name:

  ```snowcli
  snow dcm test MY_DB.MY_SCHEMA.MY_PROJECT
  ```
* Test and save the results to the `out/` directory:

  ```snowcli
  snow dcm test --save-output
  ```

---
title: snow git commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/overview.md
section: Snowflake CLI
---

# snow git commands

Snowflake CLI supports the following commands to support Git integration:

* [snow git copy](copy.md)
* [snow git describe](describe.md)
* [snow git drop](drop.md)
* [snow git execute](execute.md)
* [snow git fetch](fetch.md)
* [snow git list](list.md)
* [snow git list-branches](list-branches.md)
* [snow git list-files](list-files.md)
* [snow git list-tags](list-tags.md)
* [snow git setup](setup.md)

---
title: snow git copy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/copy.md
section: Snowflake CLI
---

# snow git copy

Copies all files from given state of repository to local directory or stage. If the source path ends with ‘/’, the command copies contents of specified directory. Otherwise, it creates a new directory or file in the destination directory.

## Syntax

```console
snow git copy
  <repository_path>
  <destination_path>
  --parallel <parallel>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_path`
:   Path to git repository stage with scope provided. Path to the repository root must end with ‘/’. For example: @my_repo/branches/main/.

`destination_path`
:   Target path for copy operation. Should be a path to a directory on remote stage or local file system.

## Options

`--parallel INTEGER`
:   Number of parallel threads to use when downloading files. Default: 4.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

* This example creates a `snowcli2.0/` directory on stage `@public` and copies all files from the commit marked with tag `v2.0.0` into that directory:

  ```snowcli
  snow git copy @my_snow_git/tags/v2.0.0/ @public/snowcli2.0/
  ```
* The following example creates a `plugin_tests` directory in the local file system and downloads the contents of the `tests/plugin` directory into it.-

  ```snowcli
  snow git copy @snowcli_git/branches/main/tests/plugin plugin_tests/
  ```

---
title: snow git describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/describe.md
section: Snowflake CLI
---

# snow git describe

Provides description of git repository.

## Syntax

```console
snow git describe
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the git repository; for example: my_repo.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow git drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/drop.md
section: Snowflake CLI
---

# snow git drop

Drops git repository with given name.

## Syntax

```console
snow git drop
  <name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the git repository; for example: my_repo.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow git execute
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/execute.md
section: Snowflake CLI
---

# snow git execute

Execute immediate all files from the repository path. Files can be filtered with a glob-like pattern, e.g. `@my_repo/branches/main/*.sql`, `@my_repo/branches/main/dev/*`. Only files with `.sql` or `.py` extension will be executed.

## Syntax

```console
snow git execute
  <repository_path>
  --on-error <on_error>
  --variable <variables>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_path`
:   Path to git repository stage with scope provided. Path to the repository root must end with ‘/’. For example: @my_repo/branches/main/.

## Options

`--on-error [break|continue]`
:   What to do when an error occurs. Defaults to break. Default: break.

`--variable, -D TEXT`
:   Variables for the execution context; for example: `-D "<key>=<value>"`. For SQL files, variables are used to expand the template, and any unknown variable will cause an error (consider embedding quoting in the file).For Python files, variables are used to update the os.environ dictionary. Provided keys are capitalized to adhere to best practices. In case of SQL files string values must be quoted in `''` (consider embedding quoting in the file).

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> Snowflake CLI does not support executing Python files for Python versions 3.12 and above.

You can use glob-like patterns to filter the files, such as `@my_repo/branches/main/*.sql` and `@my_repo/branches/main/dev/*`. The command only executes files with a `.sql`
extension.

When using Jinja templates for the SQL files, you can pass template variables using `-D` or (`--variable)` option, such as `-D "<key>=<value>"`. You must enclose string values in single quotes (`''`).

## Examples

The following example shows how to execute SQL commands in all files within the `project` directory that match a regular expression.

```snowcli
snow git execute "@git_test/branches/main/projects/script*.sql"
```

```output
SUCCESS - git_test/branches/main/projects/script1.sql
SUCCESS - git_test/branches/main/projects/script2.sql
SUCCESS - git_test/branches/main/projects/script3.sql
+---------------------------------------------------------------+
| File                                        | Status  | Error |
|---------------------------------------------+---------+-------|
| git_test/branches/main/projects/script1.sql | SUCCESS | None  |
| git_test/branches/main/projects/script2.sql | SUCCESS | None  |
| git_test/branches/main/projects/script3.sql | SUCCESS | None  |
+---------------------------------------------------------------+
```

---
title: snow git fetch
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/fetch.md
section: Snowflake CLI
---

# snow git fetch

Fetch changes from origin to Snowflake repository.

## Syntax

```console
snow git fetch
  <repository_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_name`
:   Identifier of the git repository; for example: my_repo.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example refreshes a repository named `my_snow_git`:

```snowcli
snow git fetch my_snow_git
```

```output
alter Git repository my_snow_git fetch
+-------------------------------------------------------------------+
| status                                                            |
|-------------------------------------------------------------------|
| Git Repository MY_SNOW_GIT is up to date. No change was fetched.. |
+-------------------------------------------------------------------+
```

---
title: snow git list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/list.md
section: Snowflake CLI
---

# snow git list

Lists all available git repositories.

## Syntax

```console
snow git list
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all git repositories with name that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow git list-branches
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/list-branches.md
section: Snowflake CLI
---

# snow git list-branches

List all branches in the repository.

## Syntax

```console
snow git list-branches
  <repository_name>
  --like <like>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_name`
:   Identifier of the git repository; for example: my_repo.

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list-branches --like "%_test"` lists all branches that end with “_test”. Default: %%.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

For example, to list all of the branches in a repository named `my_snow_git`, enter the following command:

```snowcli
snow git list-branches my_snow_git
```

```output
show git branches in my_snow_git
+--------------------------------------------------------------------------------------------------------------------------------------------+
| name                                     | path                                     | checkouts | commit_hash                              |
|------------------------------------------+------------------------------------------+-----------+------------------------------------------|
| SNOW-1011750-service-create-options      | /branches/SNOW-1011750-service-create-op |           | 729855df0104c8d0ef1c7a3e8f79fe50c6c8d2fa |
|                                          | tions                                    |           |                                          |
| SNOW-1011775-containers-to-spcs-int-test | /branches/SNOW-1011775-containers-to-spc |           | e81b00de6b0eb73a99a7baaa39b0afa5ea1202d0 |
| s                                        | s-int-tests                              |           |                                          |
| SNOW-1105629-git-integration-tests       | /branches/SNOW-1105629-git-integration-t |           | 712b07b5e692624c34caabe07d64801615ce5f0f |
+--------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow git list-files
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/list-files.md
section: Snowflake CLI
---

# snow git list-files

List files from given state of git repository.

## Syntax

```console
snow git list-files
  <repository_path>
  --pattern <pattern>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_path`
:   Path to git repository stage with scope provided. Path to the repository root must end with ‘/’. For example: @my_repo/branches/main/.

## Options

`--pattern TEXT`
:   Regex pattern for filtering files by name. For example –pattern “.\*.txt” will filter only files with .txt extension.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example lists all of the files in the `tests/` directory of the `my_snow_git` repository marked with the `v2.0.0` tag:

```snowcli
snow git list-files @my_snow_git/tags/v2.0.0/tests --pattern ".*\.toml"
```

```output
ls @snowcli_git/tags/v2.0.0/tests pattern = '.*\.toml'
+-----------------------------------------------------------------------------------------------------------------------------------------+
| name                                            | size | md5  | sha1                                     | last_modified                |
|-------------------------------------------------+------+------+------------------------------------------+------------------------------|
| snowcli_git/tags/v2.0.0/tests/empty_config.toml | 0    | None | e69de29bb2d1d6434b8b29ae775ad8c2e48c5391 | Mon, 5 Feb 2024 13:16:25 GMT |
| snowcli_git/tags/v2.0.0/tests/test.toml         | 381  | None | 45f1c00f16eba1b7bc7b4ab2982afe95d0161e7f | Mon, 5 Feb 2024 13:16:25 GMT |
+-----------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow git list-tags
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/list-tags.md
section: Snowflake CLI
---

# snow git list-tags

List all tags in the repository.

## Syntax

```console
snow git list-tags
  <repository_name>
  --like <like>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_name`
:   Identifier of the git repository; for example: my_repo.

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list-tags --like "v2.0%"` lists all tags that start with “v2.0”. Default: %%.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

For example, to list all of the tags in a repository named `my_snow_git`, enter the following command:

```snowcli
snow git list-tags my_snow_git
```

```output
show git tags in my_snow_git
+--------------------------------------------------------------------------------------------------------------+
| name           | path                 | commit_hash                 | author                       | message |
|----------------+----------------------+-----------------------------+------------------------------+---------|
| v2.0.0rc3      | /tags/v2.0.0rc3      | 2b019d2841da823d8001f23c6f3 | None                         | None    |
|                |                      | 064e5899142a0               |                              |         |
| v2.1.0-rc0     | /tags/v2.1.0-rc0     | 829887b758b43b86959611dd612 | None                         | None    |
|                |                      | 7638da75cf871               |                              |         |
| v2.1.0-rc1     | /tags/v2.1.0-rc1     | b7efe1fe9c0925b95ba214e233b | None                         | None    |
|                |                      | 18924fa0404b3               |                              |         |
+--------------------------------------------------------------------------------------------------------------+
```

---
title: snow git setup
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/git-commands/setup.md
section: Snowflake CLI
---

# snow git setup

Sets up a git repository object.

## Syntax

```console
snow git setup
  <repository_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`repository_name`
:   Identifier of the git repository; for example: my_repo.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow git setup` command prompts for the following information:

* **URL**: address of repository to use for `git clone` operation.
* **Secret**: Snowflake secret containing authentication credentials. Not needed if origin repository does not require authentication for read-only operations, such as clone and fetch.
* **API integration**: object allowing Snowflake to interact with a Git repository.

If the role or user specified in your [connection](../../connecting/configure-connections.md) has not been granted, executing this command generates an error similar to the following:

```output
003001 (42501): 01b2f095-0508-c66d-0001-c1be009a66ee: SQL access control error: Insufficient privileges to operate on account XXX
```

In this situation, you should check your connection configuration or ask your account administrator to give you the necessary privileges or to create the integration for you. For more information, see [Setting up Snowflake to use Git](../../../git/git-setting-up.md).

## Examples

* Create a repository that requires a secret and credentials:

  ```snowcli
  $ snow git setup snowcli_git
  Origin url: https://github.com/snowflakedb/snowflake-cli.git
  Use secret for authentication? [y/N]: y
  Secret identifier (will be created if not exists) [snowcli_git_secret]: new_secret
  Secret 'new_secret' will be created
  username: john_doe
  password/token: ****
  API integration identifier (will be created if not exists) [snowcli_git_api_integration]:
  ```

  ```output
  Secret 'new_secret' successfully created.
  API integration snowcli_git_api_integration successfully created.
  +------------------------------------------------------+
  | status                                               |
  |------------------------------------------------------|
  | Git Repository SNOWCLI_GIT was successfully created. |
  +------------------------------------------------------+
  ```
* Create a repository without a secret and an existing API integration ID:

  ```snowcli
  $ snow git setup snowcli_git
  Origin url: https://github.com/snowflakedb/snowflake-cli.git
  Use secret for authentication [y/N]: n
  API integration identifier (will be created if not exists) [snowcli_git_api_integration]: EXISTING_INTEGRATION
  ```

  ```output
  Using existing API integration 'EXISTING_INTEGRATION'.
  +------------------------------------------------------+
  | status                                               |
  |------------------------------------------------------|
  | Git Repository SNOWCLI_GIT was successfully created. |
  +------------------------------------------------------+
  ```

---
title: snow helpers check-snowsql-env-vars
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/helpers-commands/check-snowsql-env-vars.md
section: Snowflake CLI
---

# snow helpers check-snowsql-env-vars

Check if there are any SnowSQL environment variables set.

## Syntax

```console
snow helpers check-snowsql-env-vars
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

This command helps you migrate from SnowSQL to Snowflake CLI by identifying your SnowSQL environment variables and mapping them to the corresponding Snowflake CLI environment variables. It displays information with suggested changes and links to documentation.

## Examples

This example assumes a user has defined the following environment variables:

* `SNOWSQL_USER`: Username for the connection.
* `SNOWSQL_ROLE`: Role for the connection.
* `SNOWSQL_UNUSED`: Variable not used in Snowflake CLI.

```snowcli
snow helpers check-snowsql-env-vars
```

```output
+--------------------------------------------------------------------------------------------------------------------------------------------+
| Found        | Suggested      | Additional info                                                                                            |
|--------------+----------------+------------------------------------------------------------------------------------------------------------|
| SNOWSQL_USER | SNOWFLAKE_USER | https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/configure-connections#use-environme |
|              |                | nt-variables-for-snowflake-credentials                                                                     |
| SNOWSQL_ROLE | SNOWFLAKE_ROLE | https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/configure-connections#use-environme |
|              |                | nt-variables-for-snowflake-credentials                                                                     |
+--------------------------------------------------------------------------------------------------------------------------------------------+

+----------------------------------------------+
| Found          | Suggested | Additional info |
|----------------+-----------+-----------------|
| SNOWSQL_UNUSED | n/a       | Unused variable |
+----------------------------------------------+

Found 3 SnowSQL environment variables, 2 with replacements, 1 unused.
```

---
title: snow helpers commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/helpers-commands/overview.md
section: Snowflake CLI
---

# snow helpers commands

Snowflake CLI supports the following workspace commands:

* [snow helpers check-snowsql-env-vars](check-snowsql-env-vars.md)
* [snow helpers import-snowsql-connections](import-snowsql-connections.md)
* [snow helpers v1-to-v2](v1-to-v2.md)

---
title: snow helpers import-snowsql-connections
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/helpers-commands/import-snowsql-connections.md
section: Snowflake CLI
---

# snow helpers import-snowsql-connections

Import your existing connections from your SnowSQL configuration.

## Syntax

```console
snow helpers import-snowsql-connections
  --snowsql-config-file <custom_snowsql_config_files>
  --default-connection-name <default_cli_connection_name>
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--snowsql-config-file FILE`
:   Specifies file paths to custom SnowSQL configuration. The option can be used multiple times to specify more than 1 file.

`--default-connection-name TEXT`
:   Specifies the name which will be given in Snowflake CLI to the default connection imported from SnowSQL. Default: default.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow helpers import-snowsql-connections` command imports existing connection definitions from SnowSQL into your `config.toml` configuration file.

By default, the command reads the SnowSQL configuration files in the order described in the [Configuring SnowSQL](../../../../user-guide/snowsql-config.md) topic.
If more than one of these configurations define the same connection, this command overwrites the previously imported connection definition with the most recent one.
To illustrate, assume the same `[connections.example]` connection is defined with different parameters in the following locations:

| Location of the configuration file | Connection definition |
| --- | --- |
| `/etc/snowsql.cnf` | ```yaml [connections]  [connections.example] username=user1 ``` |
| `<HOME_DIR>/.snowsql/config` | ```yaml [connections]  [connections.example] username=user2 password=<my-pwd> ``` |

After you run the command, your Snowflake CLI `config.toml` file contains the following `[connections.example]` definition (from the file with the higher precedence):

```yaml
[connections]

[connections.example]
username=user2
password=<my-pwd>
```

You can use the `--snowsql-config-file` option to override this default behavior and import from one or more specific SnowSQL configuration files instead.

The `snow helpers import-snowsql-connections` command also imports the default connection from SnowSQL, which is not a named connection.
It is defined directly in the `[connections]` section of the configuration file.
Because Snowflake CLI requires all connections to be named, the command defines a connection named `[default]`.
If you want to use another name for the default connection, you can specify it with the `--default-connection-name` option.

If a SnowSQL connection matches the name of an existing Snowflake CLI connection, the command prompt asks whether you want to overwrite the existing connection or skip importing that SnowSQL connection.

## Examples

The following example imports SnowSQL connections from the standard configuration file locations:

```snowcli
snow helpers import-snowsql-connections
```

As the command processes the SnowSQL configuration files, it shows the progress and prompts for confirmation when a connection with the same name is already defined in the Snowflake CLI `config.toml` file.

```output
SnowSQL config file [/etc/snowsql.cnf] does not exist. Skipping.
SnowSQL config file [/etc/snowflake/snowsql.cnf] does not exist. Skipping.
SnowSQL config file [/usr/local/etc/snowsql.cnf] does not exist. Skipping.
Trying to read connections from [/Users/<user>/.snowsql.cnf].
Reading SnowSQL's connection configuration [connections.connection1] from [/Users/<user>/.snowsql.cnf]
Trying to read connections from [/Users/<user>/.snowsql/config].
Reading SnowSQL's default connection configuration from [/Users/<user>/.snowsql/config]
Reading SnowSQL's connection configuration [connections.connection1] from [/Users/<user>/.snowsql/config]
Reading SnowSQL's connection configuration [connections.connection2] from [/Users/<user>/.snowsql/config]
Connection 'connection1' already exists in Snowflake CLI, do you want to use SnowSQL definition and override existing connection in Snowflake CLI? [y/N]: Y
Connection 'connection2' already exists in Snowflake CLI, do you want to use SnowSQL definition and override existing connection in Snowflake CLI? [y/N]: n
Connection 'default' already exists in Snowflake CLI, do you want to use SnowSQL definition and override existing connection in Snowflake CLI? [y/N]: n
Saving [connection1] connection in Snowflake CLI's config.
Connections successfully imported from SnowSQL to Snowflake CLI.
```

---
title: snow helpers v1-to-v2
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/helpers-commands/v1-to-v2.md
section: Snowflake CLI
---

# snow helpers v1-to-v2

Migrates the Snowpark, Streamlit, and Native App project definition files from V1 to V2.

## Syntax

```console
snow helpers v1-to-v2
  --accept-templates
  --migrate-local-overrides / --no-migrate-local-overrides
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`-t, --accept-templates`
:   Allows the migration of templates. Default: False.

`-l, --migrate-local-overrides / --no-migrate-local-overrides`
:   Merge values in snowflake.local.yml into the main project definition. The snowflake.local.yml file will not be migrated, instead its values will be reflected in the output snowflake.yml file. If unset and snowflake.local.yml is present, an error will be raised.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

Snowflake CLI 3.0 introduced support for V2 project definition files. If you have existing V1.x project definition files, you can use the `snow helpers v1-to-v2` command to convert the files to the V2 version. The command preserves the original version in a `snowflake_V1.yml` file.

You must run this command in the same directory as the `snowflake.yml` file.

> **Attention:**
>
> With the change in how Snowflake CLI 3.0 handles project definition templates, Snowflake cannot guarantee that project definition files using
> [templates](../../project-definitions/create-templates.md) will work correctly after conversion. By default, this command generates an error if you try convert a 1.x file that contains templates. You can force the command to convert these types of files by using the `--accept-templates` option. Then you
> must manually update any templates to their V2 equivalents.

## Examples

* Convert a version 1.x project definition file.

  ```snowcli
  cd <project-directory>
  snow helpers v1-to-v2
  ```

  ```output
  Project definition migrated to version 2.
  ```
* Convert a version 2 project definition file.

  ```snowcli
  cd <project-directory>
  snow helpers v1-to-v2
  ```

  ```output
  Project definition is already at version 2.
  ```
* Convert a version 1 project definition that contains templates without the `--accept-templates` option.

  ```snowcli
  cd <project-directory>
  snow helpers v1-to-v2
  ```

  ```output
  +- Error---------------------------------------------------------------------+
  | Project definition contains templates. They may not be migrated correctly, |
  | and require manual migration.You can try again with --accept-templates     |
  | option, to attempt automatic migration.                                    |
  +----------------------------------------------------------------------------+
  ```
* Convert a version 1 project definition with the `--accept-templates` option.

  ```snowcli
  cd <project-directory>
  snow helpers v1-to-v2
  ```

  ```output
  WARNING  snowflake.cli._plugins.workspace.commands:commands.py:60 Your V1 definition contains templates. We cannot guarantee the correctness of the migration.
  Project definition migrated to version 2
  ```

---
title: snow init
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/bootstrap-commands/init.md
section: Snowflake CLI
---

# snow init

Creates project directory from template.

## Syntax

```console
snow init
  <path>
  --template <template>
  --template-source <template_source>
  --variable <variables>
  --no-interactive
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`path`
:   Directory to be initialized with the project. This directory must not already exist.

## Options

`--template TEXT`
:   which template (subdirectory of –template-source) should be used. If not provided, whole source will be used as the template.

`--template-source TEXT`
:   local path to template directory or URL to git repository with templates. Default: <https://github.com/snowflakedb/snowflake-cli-templates>.

`--variable, -D TEXT`
:   String in `key=value` format. Provided variables will not be prompted for.

`--no-interactive`
:   Disable prompting. Default: False.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow init` command initializes a directory specified in the `<path>` parameter with a chosen template. It renders all files mentioned in the `files_to_render` list in the `template.yml`, resolving all variables enclosed in `<! … !>`. If a `template.yml` file is not present in the template’s root directory, the command finishes with an error. For information about creating project templates, see [Bootstrapping a project from a template](../../bootstrap-project/bootstrap.md).

By default, the command interactively prompts you for each parameter defined in the `template.yml` file. You can bypass the interactive prompts in the following ways:

* Use the `-D` option to specify the values for each parameter contained in the project template.
* Use the `--no-interactive` option to use default values, if defined, for each template parameter in the `template.yml` file.
* Use a combination of the `-D` and `--no-interactive` options to define values for some parameters and use the specified default values for the template.

  > **Note:**
  >
  > If you do not provide a value using the `-D` option that does not have a corresponding default value defined, the snow init command terminates with an error.

## Examples

* Bootstrap a Snowpark project that prompts for the parameters specified in the `example_snowpark` template contained in the [snowflake-cli-templates Git repository](https://github.com/snowflakedb/snowflake-cli-templates/).

  ```snowcli
  snow init new_snowpark_project --template example_snowpark

    Project identifier (used to determine artifacts stage path) [my_snowpark_project]:
    What stage should the procedures and functions be deployed to? [dev_deployment]: snowpark
  ```

  ```output
  Initialized the new project in new_snowpark_project
  ```
* Bootstrap a Streamlit project by using the `-D` option to provide the values for some of the parameters specified in the local `../local_templates/example_streamlit` template and prompt for others.

  ```snowcli
  snow init new_streamlit_project --template-source ../local_templates/example_streamlit -D query_warehouse=dev_wareshouse -D stage=testing

    Name of the streamlit app [streamlit_app]: My streamlit
  ```

  ```output
  Initialized the new project in new_streamlit_project
  ```

---
title: snow logs
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/logs-commands/logs.md
section: Snowflake CLI
---

# snow logs

Retrieves logs for a given object.

## Syntax

```console
snow logs
  <object_type>
  <object_name>
  --from <from_>
  --to <to>
  --refresh <refresh_time>
  --table <event_table>
  --log-level <log_level>
  --partial
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type`
:   Type of object. For example table, database, compute-pool.

`object_name`
:   Name of the object.

## Options

`--from TEXT`
:   The start time of the logs to retrieve. Accepts all ISO8061 formats.

`--to TEXT`
:   The end time of the logs to retrieve. Accepts all ISO8061 formats.

`--refresh INTEGER`
:   If set, the logs will be streamed with the given refresh time in seconds.

`--table TEXT`
:   The table to query for logs. If not provided, the default table will be used.

`--log-level TEXT`
:   The log level to filter by. If not provided, INFO will be used. Default: INFO.

`--partial`
:   Enable partial, case-insensitive matching for object names. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow logs` command accesses an event table and retrieves [logs](../../../logging-tracing/logging.md) for a specified entity. By default, the command looks for
the logs in the default event table, which is SNOWFLAKE.TELEMETRY.EVENTS; however, you can select a different table with the
`--table` option. For more information about event tables and default values, see [Create an event table](../../../logging-tracing/event-table-setting-up.md).

You can use the `--from` and `-to` options to filter the period during which to retrieve the logs.
You can use one or both of these option, but if you use both, the `--from` time must be earlier than the `--to` time.
The values for times you provide must comply with the [ISO 8601 standard](https://www.iso.org/iso-8601-date-and-time-format.html).
For more information, you can also check the Python [datetime.fromisoformat()](https://docs.python.org/3/library/datetime.html#datetime.datetime.fromisoformat) method documentation.

The `--log-level` option lets you filter message by [severity level](../../../logging-tracing/event-table-columns.md).
Some logs do not include a severity level. In those cases, messages are display for all `--log-level` values.

The `--partial` option lets you retrieve logs that contain a specific string using a case-insensitive match. For example, if you searched for logs containing *myDb* with this option, the results would include logs for databases named *mydb*, *MYDB*, and *MyDb*. Without this option, it would return only logs for databases named exactly *myDb*.

If you want continuous updates for the logs, you can use the `--refresh` option and provide the number of seconds between retrievals.
You cannot use both the `--refresh` and `--to` options together.
To stop streaming the logs, use your system’s default `Keyboardinterrupt` key, such as `CTRL-c` in a Mac Terminal.

## Examples

* Display the compute pool logs for a period from a specified starting time to now:

  ```snowcli
  snow logs compute_pool MY_COMPUTE_POOL --from '2025-04-01 09:00:31'
  ```

  ```output
  10.12.71.201 - - [01/Apr/2025 09:46:07] "GET /healthcheck HTTP/1.1" 200 -
  10.12.71.201 - - [01/Apr/2025 09:46:09] "GET /healthcheck HTTP/1.1" 200 -
  10.12.71.201 - - [01/Apr/2025 09:46:14] "GET /healthcheck HTTP/1.1" 200 -
  10.12.71.201 - - [01/Apr/2025 09:46:19] "GET /healthcheck HTTP/1.1" 200 -
  10.12.71.201 - - [01/Apr/2025 09:46:24] "GET /healthcheck HTTP/1.1" 200 -
  10.12.71.201 - - [01/Apr/2025 09:46:29] "GET /healthcheck HTTP/1.1" 200 -
  10.12.71.201 - - [01/Apr/2025 09:46:34] "GET /healthcheck HTTP/1.1" 200 -
  ```
* Display the logs for a specific event table:

  ```snowcli
  snow logs compute_pool SNOWCLI_COMPUTE_POOL --table "my_db.my_schema.my_events"
  ```
* Display the logs for all databases that contain `myDb` using a case-insensitive partial match:

  ```snowcli
  snow logs database myDb --partial
  ```
* Display the logs for a time range where the from time is later than the to time, which causes an error:

  ```snowcli
  snow logs compute_pool SNOWCLI_COMPUTE_POOL --from '2025-03-24 12:00:31' --to "2024-01-03 00:00:00"
  ```

  ```output
  ╭─ Error ─────────────────────────────────────────────────────────
  │ From_time cannot be later than to_time. Please check the values
  ╰─────────────────────────────────────────────────────────────────
  ```

---
title: snow logs commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/logs-commands/overview.md
section: Snowflake CLI
---

# snow logs commands

Snowflake CLI supports the following commands for accessing logs for various entities:

* [snow logs](logs.md)

---
title: snow notebook commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/notebook-commands/overview.md
section: Snowflake CLI
---

# snow notebook commands

Snowflake CLI supports the following commands for managing Snowflake notebooks:

* [snow notebook create](create.md)
* [snow notebook deploy](deploy.md)
* [snow notebook execute](execute.md)
* [snow notebook get-url](get-url.md)
* [snow notebook open](open.md)

You can get a list of all available notebooks by running the `snow object list notebook` command. For more information, see [snow object list](../object-commands/list.md).

---
title: snow notebook create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/notebook-commands/create.md
section: Snowflake CLI
---

# snow notebook create

> **Note:**
>
> Beginning with version 3.4.0, Snowflake CLI added the [snow notebook deploy](deploy.md) command to replace the `snow notebook create` command. To support backward compatibility, you can still create a notebook using this command, but Snowflake recommends that you begin using the new [Deploy and create a notebook](../../notebooks/use-notebooks.md) procedure.

Creates notebook from stage.

## Syntax

```console
snow notebook create
  <identifier>
  --notebook-file <notebook_file>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of the notebook; for example: MY_NOTEBOOK.

## Options

`--notebook-file, -f TEXT`
:   Stage path with notebook file. For example `@stage/path/to/notebook.ipynb`.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

By default, the command creates notebooks using the default warehouse provided in the connection. You can use the `--warehouse` parameter to specify a different warehouse or to specify one if the connection does not include a warehouse.

## Examples

The following example creates `MY_NOTEBOOK` from the staged `@MY_STAGE/path/to/notebook.ipynb` notebook:

```snowcli
snow notebook create MY_NOTEBOOK -f @MY_STAGE/path/to/notebook.ipynb
```

---
title: snow notebook deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/notebook-commands/deploy.md
section: Snowflake CLI
---

# snow notebook deploy

Uploads a notebook and required files to a stage and creates a Snowflake notebook.

## Syntax

```console
snow notebook deploy
  <entity_id>
  --replace
  --prune / --no-prune
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`entity_id`
:   ID of notebook entity.

## Options

`--replace`
:   Replace notebook object if it already exists. It only uploads new and overwrites existing files, but does not remove any files already on the stage. Default: False.

`--prune / --no-prune`
:   Delete files that exist in the stage, but not in the local filesystem. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow notebook deploy` command uploads local files to a stage and creates a new Notebook object inside your chosen database and schema. Your [project definition file](../../notebooks/use-notebooks.md) should specify the main notebook file and query warehouse. The `--replace` option replaces the specified Notebook object if it already exists.

## Examples

The following example uploads the files specified in your project definition file and creates a new notebook named `my_notebook`:

```snowcli
snow notebook deploy my_notebook
```

```output
Uploading artifacts to @notebooks/my_notebook
  Creating stage notebooks if not exists
  Uploading artifacts
Creating notebook my_notebook
Notebook successfully deployed and available under https://snowflake.com/provider-deduced-from-connection/#/notebooks/DB.SCHEMA.MY_NOTEBOOK
```

---
title: snow notebook execute
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/notebook-commands/execute.md
section: Snowflake CLI
---

# snow notebook execute

Executes a notebook in a headless mode.

## Syntax

```console
snow notebook execute
  <identifier>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of the notebook; for example: MY_NOTEBOOK.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow notebook execute` command executes a notebook in headless mode. Currently, the command only returns a message indicating whether the notebook executed successfully. It doesn’t return any result data from the notebook.

## Examples

The following example executes the `MY_NOTEBOOK` notebook:

```snowcli
snow notebook execute MY_NOTEBOOK
```

```output
Notebook MY_NOTEBOOK executed.
```

---
title: snow notebook get-url
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/notebook-commands/get-url.md
section: Snowflake CLI
---

# snow notebook get-url

Return a url to a notebook.

## Syntax

```console
snow notebook get-url
  <identifier>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of the notebook; for example: MY_NOTEBOOK.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The notebook get-url command returns a url link to an existing notebooks. Note the following requirements:

* The notebook must already be deployed.
* If your notebook is running under a different database and schema than specified in the connection, you must provide them in name as a fully-qualified name, such as `database.schema.name`.

## Examples

This example gets a URL for an notebook using a fully-qualified database and schema name:

```snowcli
snow notebook get-url database.schema.my_notebook
```

---
title: snow notebook open
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/notebook-commands/open.md
section: Snowflake CLI
---

# snow notebook open

Opens a notebook in default browser

## Syntax

```console
snow notebook open
  <identifier>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`identifier`
:   Identifier of the notebook; for example: MY_NOTEBOOK.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The notebook open command opens existing notebooks in your default browser. Note the following requirements:

* The notebook must already be deployed.
* If your notebook is running under a different database and schema than specified in the connection, you must provide them in name as a fully-qualified name, such as `database.schema.name`.

## Examples

This example opens a notebook using a fully-qualified database and schema name:

```snowcli
snow notebook open database.schema.my_notebook
```

---
title: snow object commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/object-commands/overview.md
section: Snowflake CLI
---

# snow object commands

Snowflake CLI supports the following commands to support Snowflake objects:

* [snow object create](create.md)
* [snow object describe](describe.md)
* [snow object drop](drop.md)
* [snow object list](list.md)

---
title: snow object create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/object-commands/create.md
section: Snowflake CLI
---

# snow object create

Create an object of a given type. Check documentation for the list of supported objects and parameters.

## Syntax

```console
snow object create
  <object_type>
  <object_attributes>
  --json <object_json>
  --if-not-exists
  --replace
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type`
:   Type of object. For example table, database, compute-pool.

`object_attributes...`
:   Object attributes provided as a list of key=value pairs, for example name=my_db comment=’created with Snowflake CLI’. Check documentation for the full list of available parameters for every object. .

## Options

`--json TEXT`
:   Object definition in JSON format, for example ‘{“name”: “my_db”, “comment”: “created with Snowflake CLI”}’. Check documentation for the full list of available parameters for every object.

`--if-not-exists`
:   Only apply this operation if the specified object does not already exist. Default: False.

`--replace`
:   Replace this object if it already exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow object create` command creates one of the following types Snowflake objects, based on the provided object attributes or definitions:

* `account`
* `catalog-integration`
* `compute-pool`
* `database`
* `database-role`
* `dynamic-table`
* `event-table`
* `external-volume`
* `function`
* `image-repository`
* `managed-account`
* `network-policy`
* `notebook`
* `notification-integration`
* `pipe`
* `procedure`
* `role`
* `schema`
* `service`
* `stage`
* `stream`
* `table`
* `task`
* `user-defined-function`
* `view`
* `warehouse`

For each object, you must specify the appropriate object details using either the object attributes or the object definitions.

* Use the `object_attributes` parameter specifies the object details as a series of `<key>=<value>` pairs, such as:

  ```snowcli
  snow object create database name=my_db comment="Created with Snowflake CLI"
  ```
* Use the `--json object_definition` option to specify the object details as JSON, such as:

  ```snowcli
  snow object create table name=my_table columns='[{"name":"col1","datatype":"number", "nullable":false}]' constraints='[{"name":"prim_key", "column_names":["col1"], "constraint_type":"PRIMARY KEY"}]' --database my_db --schema public
  ```
* See Examples for more examples.

> **Note:**
>
> The following object types require a database to be identified in the connection configuration, such as `config.toml`, or passed to the command using the `--database` option.
>
> * image-repository
> * schema
> * service
> * table
> * task

The following sections describe the attributes that Snowflake CLI supports for selected object types.

* compute-pool
* database
* image-repository
* schema
* service
* table
* task
* warehouse

You can find attributes for other types of objects by checking their corresponding SQL CREATE command references, such as [CREATE ACCOUNT](../../../../sql-reference/sql/create-account.md).

### Compute pool object attributes

Compute pool attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |
| **min_nodes**  *required*, *integer* | Minimum number of nodes for the compute pool. |
| **max_nodes**  *required*, *integer* | Maximum number of nodes for the compute pool. |
| **instance_family**  *required*, *string* | Name of the instance family. For more information about instance families, refer to the SQL CREATE COMPUTE POOL command. |
| **auto_resume**  *optional*, *string* | Whether to resume the compute pool automatically when any statement that requires the compute pool is submitted. |
| **comment**  *optional*, *string* | Comment describing the compute pool. |
| **auto_suspend_secs**  *optional*, *string* | Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool. |

### Database object attributes

Database attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |
| **comment**  *optional*, *string* | Comment describing the database. |
| **data_retention_time_in_days**  *optional*, *integer* | Number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the schema, as well as the default Time Travel retention time for all tables created in the schema. |
| **default_ddl_collation**  *optional*, *string* | Default collation specification for all schemas and tables added to the database. You can override this default at the schema and individual table level. |
| **max_data_extension_time_in_days**  *optional*, *integer* | Maximum number of days for which Snowflake can extend the data retention period for tables in the database to prevent streams on the tables from becoming stale. |
| **suspend_task_after_num_failures**  *optional*, *integer* | Number of consecutive failed task runs after which the current task is suspended automatically. |
| **user_task_managed_initial_warehouse_size**  *optional*, *integer* | Size of the compute resources to provision for the first run of the task, before a task history is available for Snowflake to determine an ideal size. Possible values include: XSMALL, SMALL, MEDIUM, LARGE, and XLARGE. |
| **user_task_timeout_ms**  *optional*, *integer* | Time limit, in milliseconds, for a single run of the task before it times out. For information, see [USER_TASK_TIMEOUT_MS](../../../../sql-reference/parameters.md). |

### Image repository object attributes

Image repository attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |

### Schema object attributes

Schema attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |
| **comment**  *optional*, *string* | Comment describing the schema. |
| **data_retention_time_in_days**  *optional*, *integer* | Number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the schema, as well as the default Time Travel retention time for all tables created in the schema. |
| **default_ddl_collation**  *optional*, *string* | Default collation specification for all schemas and tables added to the database. You can override this default at the schema and individual table level. |
| **max_data_extension_time_in_days**  *optional*, *integer* | Maximum number of days for which Snowflake can extend the data retention period for tables in the database to prevent streams on the tables from becoming stale. |
| **suspend_task_after_num_failures**  *optional*, *integer* | Number of consecutive failed task runs after which the current task is suspended automatically. |
| **user_task_managed_initial_warehouse_size**  *optional*, *integer* | Size of the compute resources to provision for the first run of the task, before a task history is available for Snowflake to determine an ideal size. |
| **user_task_timeout_ms**  *optional*, *integer* | Time limit, in milliseconds, for a single run of the task before it times out. For information, see [USER_TASK_TIMEOUT_MS](../../../../sql-reference/parameters.md). |

### Service object attributes

Service attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |
| **compute_pool**  *required*, *string* | Name of the compute pool in your account on which to run the service. |
| **spec**  *required*, *object* | Service specification. See service specification table for details. |
| **external_access_integrations**  *optional*, *string list* | Names of the external access integrations that allow your service to access external sites. |
| **auto_resume**  *optional*, *boolean* | Whether to automatically resume a service when a service function or ingress is called. |
| **min_instances**  *optional*, *integer* | Minimum number of service instances to run. |
| **max_instances**  *optional*, *integer* | Maximum number of service instances to run. |
| **query_warehouse**  *optional*, *string* | Warehouse to use if a service container connects to Snowflake to execute a query but does not explicitly specify a warehouse to use. |
| **comment**  *optional*, *string* | Comment for the service. |

**Service specification attributes**

Service specification attributes

| Attribute | Description |
| --- | --- |
| **spec_type**  *required*, *string* | Type of the service specification. Possible values include `from_file` or `from_inline`. |
| **spec_text**  *required*, *string* | (Valid only for `spec_type="from_inline"`)  Service specification. You can use a pair of dollar signs ($$) to delimit the beginning and ending of the specification string. |
| **stage**  *required*, *string* | (Valid only for `spec_type="from_inline"`)  Snowflake internal stage where the specification file is stored, such as `@tutorial_stage`. |
| **name**  *required*, *string* | (Valid only for `spec_type="from_inline"`)  Path to the service specification file on the stage, such as `some-dir/echo_spec.yaml`. |

### Table object attributes

Table attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. The name must be unique for the schema in which the table is created. |
| **kind**  *optional*, *string* | Table type. Possible values include: TABLE for permanent tables, TEMPORARY, and TRANSIENT. |
| **comment**  *optional*, *string* | Description of the table. |
| **cluster_by[]**  *optional*, *string list* | List of one or more columns or column expressions in the table as the clustering key. |
| **enable_schema_evolution**  *optional*, *boolean* | Whether to enable or disable schema evolution for the table. |
| **change_tracking**  *optional*, *boolean* | Whether to enable or disable change tracking for the table. |
| **data_retention_time_in_days**  *optional*, *integer* | Retention period, in days, for the table so that Time Travel actions SELECT, CLONE, UNDROP can be performed on historical data in the table. |
| **max_data_extension_time_in_days**  *optional*, *integer* | Maximum number of days Snowflake can extend the data retention period to prevent streams on the table from becoming stale. |
| **default_ddl_collation**  *optional*, *string* | Default collation specification for the columns in the table, including columns added to the table in the future. |
| **columns**  *required*, *column list* | List of column definitions. See Column definition attributes. |
| **constraints**  *optional*, *constraint list* | List of constraint definitions. See Constrain definition attributes. |

**Column definition attributes**

Column definition attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Column name. |
| **datatype**  *required*, *string* | Type of data contained in the column. |
| **nullable**  *optional*, *boolean* | Whether the column allows NULL values. |
| **collate**  *optional*, *string* | Collation to use for column operations such as string comparison. |
| **default**  *optional*, *string* | Whether to automatically insert a default value in the column if a value is not explicitly specified with an INSERT or CREATE TABLE AS SELECT statement. |
| **autoincrement**  *optional*, *boolean* | Whether to automatically increment and include the number in successive columns. |
| **autoincrement_start**  *optional*, *integer* | Staring value for the column. |
| **autoincrement_increment**  *optional*, *integer* | Increment for determining the next auto-incremented number. |
| **comment**  *optional*, *string* | Column description. |

**Constraint definition attributes**

Constraint definition attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Constraint name. |
| **column_names**  *required*, *string list* | Names of columns to apply the constraint. |
| **constraint_type**  *required*, *string* | Type of the constraint. Possible values include: UNIQUE, PRIMARY KEY and FOREIGN KEY. |
| **referenced_table_name**  *required*, *string* | (Valid only for `constraint_type="FOREIGN KEY"`)  Name of table referenced by foreign key |
| **referenced_column_names**  *optional*, *string* | (Valid only for `constraint_type="FOREIGN KEY"`)  Names of columns referenced by foreign key |

### Task attributes

Task attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |
| **definition**  *required*, *string* | SQL definition for the task. It can be a single SQL statement, a call to a stored procedure, or procedural logic using Snowflake scripting. |
| **warehouse**  *optional*, *string* | Virtual warehouse that provides compute resources for task runs. |
| **schedule**  *optional*, *string* | Schedule for periodically running the task. See Task schedule attributes for details. |
| **comment**  *optional*, *string* | Comment description for the task. |
| **predecessors**  *optional*, *string list* | One or more predecessor tasks for the current task. |
| **user_task_managed_initial_warehouse_size**  *optional*, *string* | Size of the compute resources to provision for the first run of the task. |
| **user_task_timeout_ms**  *optional*, *string* | Time limit, in milliseconds, on a single run of the task before it times out. For information, see [USER_TASK_TIMEOUT_MS](../../../../sql-reference/parameters.md). |
| **suspend_task_after_num_failures**  *optional*, *integer* | Number of consecutive failed task runs after which the current task is suspended automatically. |
| **condition**  *optional*, *string* | Boolean SQL expression condition; multiple conditions joined with AND/OR are supported. |
| **allow_overlapping_execution**  *optional*, *boolean* | Whether to allow multiple instances of the DAG to run concurrently. |

**Task schedule attributes**

Task schedule attributes

| Attribute | Description |
| --- | --- |
| **schedule_type**  *optional*, *string* | Type of the schedule. Possible values include `CRON_TYPE` or `MINUTES_TYPE`. |
| **cron_expr**  *optional*, *string* | (Valid only for `schedule_type="CRON_TYPE"`)  A cron expression for the task execution, such as `“* * * * ? *”`. |
| **timezone**  *optional*, *string* | (Valid only for `schedule_type="CRON_TYPE"`)  Time zone for the schedule, for example `"america/los_angeles"`. |
| **minutes**  *optional*, *string* | (Valid only for `schedule_type="MINUTES_TYPE"`)  Number of minutes between each task run. |

### Warehouse attributes

Warehouse attributes

| Attribute | Description |
| --- | --- |
| **name**  *required*, *string* | Snowflake object identifier. |
| **comment**  *optional*, *string* | Description of the warehouse. |
| **warehouse_type**  *optional*, *string* | Type of warehouse. Possible values include: STANDARD and SNOWPARK-OPTIMIZED. |
| **warehouse_size**  *optional*, *string* | Size of warehouse. Possible values include: XSMALL, SMALL, MEDIUM, LARGE, XLARGE, XXLARGE, XXXLARGE, X4LARGE, X5LARGE, and X6LARGE. |
| **auto_suspend**  *optional*, *string* | Time, in seconds, before the warehouse automatically suspends itself. |
| **auto_resume**  *optional*, *string* | Whether to automatically resume a warehouse when a SQL statement is submitted to it. Possible values include: “true” and “false”. |
| **max_concurrency_level**  *optional*, *integer* | Concurrency level for SQL statements executed by a warehouse cluster. |
| **statement_queued_timeout_in_seconds**  *optional*, *integer* | Time, in seconds, a SQL statement can be queued on a warehouse before it is canceled by the system. |
| **statement_timeout_in_seconds**  *optional*, *integer* | Time, in seconds, after which a running SQL statement is canceled by the system. |
| **resource_monitor**  *optional*, *string* | Name of a resource monitor that is explicitly assigned to the warehouse. When a resource monitor is explicitly assigned to a warehouse, the monitor controls the monthly credits used by the warehouse. |

## Examples

* Create a database object using the `option-attributes` parameter:

  ```snowcli
  snow object create database name=my_db comment='Created with Snowflake CLI'
  ```
* Create a table object using the `option-attributes` parameter:

  ```snowcli
  snow object create table name=my_table columns='[{"name":"col1","datatype":"number", "nullable":false}]' constraints='[{"name":"prim_key", "column_names":["col1"], "constraint_type":"PRIMARY KEY"}]' --database my_db --schema public
  ```
* Create a database using the `--json object-definition` option:

  ```snowcli
  snow object create database --json '{"name":"my_db", "comment":"Created with Snowflake CLI"}'
  ```
* Create a table using the `--json object-definition` option:

  ```snowcli
  snow object create table --json "$(cat table.json)" --database my_db
  ```

  where `table.json` contains the following:

  ```json
  {
    "name": "my_table",
    "columns": [
      {
        "name": "col1",
        "datatype": "number",
        "nullable": false
      }
    ],
    "constraints": [
      {
        "name": "prim_key",
        "column_names": ["col1"],
        "constraint_type": "PRIMARY KEY"
      }
    ]
  }
  ```

---
title: snow object describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/object-commands/describe.md
section: Snowflake CLI
---

# snow object describe

Provides description of an object of given type. Supported types: compute-pool, database, external-access-integration, function, git-repository, integration, network-rule, notebook, procedure, role, schema, secret, service, stage, stream, streamlit, table, task, user, view, warehouse

## Syntax

```console
snow object describe
  <object_type>
  <object_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type`
:   Type of object. For example table, database, compute-pool.

`object_name`
:   Name of the object.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `IDENTIFIER` for procedures and functions must specify argument types, such as `"hello(int,string)"`.

## Examples

To describe a function, run a command similar to the following:

```snowcli
snow object describe function "hello_function(string)"
```

```output
describe function hello_function(string)
+---------------------------------------------------------------------
| property           | value
|--------------------+------------------------------------------------
| signature          | (NAME VARCHAR)
| returns            | VARCHAR(16777216)
| language           | PYTHON
| null handling      | CALLED ON NULL INPUT
| volatility         | VOLATILE
| body               | None
| imports            |
| handler            | functions.hello_function
| runtime_version    | 3.12
| packages           | ['snowflake-snowpark-python']
| installed_packages | ['_libgcc_mutex==0.1','_openmp_mutex==5.1',...
+---------------------------------------------------------------------
```

---
title: snow object drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/object-commands/drop.md
section: Snowflake CLI
---

# snow object drop

Drops Snowflake object of given name and type. Supported types: compute-pool, database, external-access-integration, function, git-repository, image-repository, integration, network-rule, notebook, procedure, role, schema, secret, service, stage, stream, streamlit, table, task, user, view, warehouse

## Syntax

```console
snow object drop
  <object_type>
  <object_name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type`
:   Type of object. For example table, database, compute-pool.

`object_name`
:   Name of the object.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `IDENTIFIER` for procedures and functions must specify argument types, such as `"hello(int,string)"`.

## Examples

To delete a procedure, run a command similar to the following:

```snowcli
snow object drop procedure "test_procedure()"
```

```output
drop procedure test_procedure()
+--------------------------------------+
| status                               |
|--------------------------------------|
| TEST_PROCEDURE successfully dropped. |
+--------------------------------------+
```

---
title: snow object list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/object-commands/list.md
section: Snowflake CLI
---

# snow object list

Lists all available Snowflake objects of given type. Supported types: compute-pool, database, external-access-integration, function, git-repository, image-repository, integration, network-rule, notebook, procedure, role, schema, secret, service, stage, stream, streamlit, table, task, user, view, warehouse

## Syntax

```console
snow object list
  <object_type>
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type`
:   Type of object. For example table, database, compute-pool.

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list function --like "my%"` lists all functions that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list table --in database my_db`. Some object types have specialized scopes (e.g. `list service --in compute-pool my_pool`). Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `--like [-l] <pattern>` option lets you specify a SQL LIKE pattern for filtering objects by name. For example, `snow object list function --like "my%"` lists all functions
that begin with **my**. For more information about SQL patterns syntax, see [SQL LIKE Keyword](https://www.w3schools.com/sql/sql_ref_like.asp).

## Examples

The following example lists all roles beginning with **public**. The `--like` option

```snowcli
snow object list role --like public%
```

```output
show roles like 'public%'
+-------------------------------------------------------------------------------
| created_on                       | name        | is_default | is_current | ...
|----------------------------------+-------------+------------+------------+----
| 2023-02-01 15:25:04.105000-08:00 | PUBLIC      | N          | N          | ...
| 2024-01-15 12:55:05.840000-08:00 | PUBLIC_TEST | N          | N          | ...
+-------------------------------------------------------------------------------
```

---
title: snow package create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/package-commands/create.md
section: Snowflake CLI
---

# snow package create

Creates a Python package as a zip file that can be uploaded to a stage and imported for a Snowpark Python app.

## Syntax

```console
snow snowpark package create
  <name>
  --ignore-anaconda
  --index-url <index_url>
  --skip-version-check
  --allow-shared-libraries
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Name of the package to create.

## Options

`--ignore-anaconda`
:   Does not lookup packages on Snowflake Anaconda channel. Default: False.

`--index-url TEXT`
:   Base URL of the Python Package Index to use for package lookup. This should point to a repository compliant with PEP 503 (the simple repository API) or a local directory laid out in the same format.

`--skip-version-check`
:   Skip comparing versions of dependencies between requirements and Anaconda. Default: False.

`--allow-shared-libraries`
:   Allows shared (.so) libraries, when using packages installed through PIP. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snowpark package create` command does the following:

* Creates an artifact ready to upload to a stage.
* Checks for native libraries and asks if you want to continue. If the native libraries are present in the downloaded packages, this command works the same as the `snowpark package build` command.

## Examples

* This example creates a Python package as a zip file that can be uploaded to a stage and later imported by a Snowpark Python app. Dependencies for the “july” package are found on the Anaconda channel, so they were excluded from the `.zip` file. The command displays the packages you would need to include in `requirements.txt` of your Snowpark project.

  ```snowcli
  snow snowpark package create july==0.1
  ```

  ```output
  Package july.zip created. You can now upload it to a stage using
  snow snowpark package upload -f july.zip -s <stage-name>`
  and reference it in your procedure or function.
  Remember to add it to imports in the procedure or function definition.

  The package july is successfully created, but depends on the following
  Anaconda libraries. They need to be included in project requirements,
  as their are not included in .zip.
  matplotlib
  contourpy >=1.0.1
  numpy>=1.20
  bokeh
  selenium
  mypy==1.8.0
  Pillow
  pytest-xdist
  wurlitzer
  cycler >=0.10
  fonttools >=4.22.0
  kiwisolver >=1.3.1
  pyparsing >=2.3.1
  jinja2
  python-dateutil >=2.7
  six >=1.5
  importlib-resources >=3.2.0
  ```
* This example creates the `july.zip` package that you can use in your Snowpark project without needing to add any dependencies to the `requirements.txt` file. The error messages indicate that some packages contain shared libraries, which might not work, such as when creating a package using Windows.

  ```snowcli
  snow snowpark package create july==0.1 --ignore-anaconda --allow-shared-libraries
  ```

  ```output
  2024-04-11 16:24:56 ERROR Following dependencies utilise shared libraries, not supported by Conda:
  2024-04-11 16:24:56 ERROR numpy
  contourpy
  fonttools
  kiwisolver
  matplotlib
  pillow
  2024-04-11 16:24:56 ERROR You may still try to create your package with --allow-shared-libraries, but the might not work.
  2024-04-11 16:24:56 ERROR You may also request adding the package to Snowflake Conda channel
  2024-04-11 16:24:56 ERROR at https://support.anaconda.com/

  Package july.zip created. You can now upload it to a stage using
  snow snowpark package upload -f july.zip -s <stage-name>`
  and reference it in your procedure or function.
  Remember to add it to imports in the procedure or function definition.
  ```
* This example fails to create the package because it already exists. You can still forcibly create the package by using the `--ignore-anaconda` option.

  ```snowcli
  snow snowpark package create matplotlib
  ```

  ```output
  Package matplotlib is already available in Snowflake Anaconda Channel.
  ```

---
title: snow package lookup
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/package-commands/lookup.md
section: Snowflake CLI
---

# snow package lookup

Checks if a package is available on the Snowflake Anaconda channel.

## Syntax

```console
snow snowpark package lookup
  <package_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`package_name`
:   Name of the package.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow snowpark lookup` command checks to see whether a package is available on the Snowflake Anaconda channel.

## Examples

The following example illustrates looking up a package that is already available on the Snowflake Anaconda channel:

```snowcli
snow snowpark package lookup numpy
```

```output
Package `numpy` is available in Anaconda. Latest available version: 1.26.4.
```

If a package is not available on the Snowflake Anaconda channel, you can get a message similar to the following:

```snowcli
snow snowpark package lookup july
```

```output
Package `july` is not available in Anaconda. To prepare Snowpark compatible package run:

  snow snowpark package create july
```

---
title: snow package upload
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/package-commands/upload.md
section: Snowflake CLI
---

# snow package upload

Uploads a Python package zip file to a Snowflake stage so it can be referenced in the imports of a procedure or function.

## Syntax

```console
snow snowpark package upload
  --file <file>
  --stage <stage>
  --overwrite
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--file, -f PATH`
:   Path to the file to upload.

`--stage, -s TEXT`
:   Name of the stage in which to upload the file, not including the @ symbol.

`--overwrite, -o`
:   Overwrites the file if it already exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

If you specify a stage that does not exist, the command creates it automatically.

## Examples

* Upload a package to a stage:

  ```snowcli
  snow snowpark package upload -f my_package.zip -s deployments
  ```

  ```output
  Package my_package.zip UPLOADED to Snowflake @deployments/my_package.zip.
  ```
* Upload a package to a stage that already contains a package with that name:

  ```snowcli
  snow snowpark package upload -f my_package.zip -s deployments
  ```

  ```output
  Package already exists on stage. Consider using --overwrite to overwrite the file.
  ```

---
title: snow snowpark build
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/build.md
section: Snowflake CLI
---

# snow snowpark build

Builds artifacts required for the Snowpark project. The artifacts can be used by `deploy` command. For each directory in artifacts a .zip file is created. All non-anaconda dependencies are packaged in dependencies.zip file.

## Syntax

```console
snow snowpark build
  --ignore-anaconda
  --allow-shared-libraries
  --index-url <index_url>
  --skip-version-check
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--ignore-anaconda`
:   Does not lookup packages on Snowflake Anaconda channel. Default: False.

`--allow-shared-libraries`
:   Allows shared (.so) libraries, when using packages installed through PIP. Default: False.

`--index-url TEXT`
:   Base URL of the Python Package Index to use for package lookup. This should point to a repository compliant with PEP 503 (the simple repository API) or a local directory laid out in the same format.

`--skip-version-check`
:   Skip comparing versions of dependencies between requirements and Anaconda. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* The `app.zip` contains everything needed to run the functions and procedures in the project, apart from packages available through [Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/), which you can call directly from Snowflake.-
* The command parses `requirements.txt` for packages available on Conda channel. This process creates the `requirements.snowflake.txt` file that contains project dependencies available on the Conda channel, which is later used by the `snow snowpark deploy` command.
* By default, the command looks for the `snowflake.yml` file in the current directory. Alternatively, you can specify a different path with the `--project` option.
* This command automatically downloads dependencies and adds them to a file called `app.zip`, together with project source code (specified by the `src` field in the `snowflake.yml` file.
* To use different Python Package Index than PyPi, specify one using the `--index-url` option.
* You can use `--skip-version-check` option to skip version requirements between project dependencies and the Anaconda Channel.
* You can use the `--ignore-anaconda` option to include all the required dependencies in the `app.zip` file, even those available in Snowflake Anaconda channel. The dependencies aren’t downloaded from Anaconda, but from PyPi.
* The `--allow-shared-libraries` option checks whether any of the packages downloaded from PyPi are using native dependencies, which can cause problems as Snowpark currently supports only native dependencies for packages taken from Conda channel

## Examples

* Build a project located in the current directory:

  ```snowcli
  snow snowpark build
  ```

  ```output
  Resolving dependencies from requirements.txt
    No external dependencies.
  Preparing artifacts for source code
    Creating: app.zip
  Build done.
  ```
* Build a project located in a different directory:

  ```bash
  ls
  ```

  ```output
  project_dir    some_other_dir    some_file.txt
  ```

  ```snowcli
  snow snowpark build -p project_dir
  ```

  ```output
  Resolving dependencies from requirements.txt
    No external dependencies.
  Preparing artifacts for source code
    Creating: app.zip
  Build done.
  ```
* Build a project in a directory with no `snowflake.yml` project definition:

  ```bash
  ls
  ```

  ```output
  project_dir    some_other_dir    some_file.txt
  ```

  ```snowcli
  snow snowpark build
  ```

  ```output
  ╭─ Error ──────────────────────────────────────────────────────────╮
    Cannot find project definition (snowflake.yml). Please provide
    a path to the project or run this command in a valid
    project directory.
  ╰──────────────────────────────────────────────────────────────────╯
  ```
* Build a project with native libraries:

  ```snowcli
  snow snowpark build --ignore-anaconda --allow-shared-libraries
  ```

  ```output
  2024-04-16 16:05:52 ERROR Following dependencies utilise shared libraries, not supported by Conda:
  2024-04-16 16:05:52 ERROR contourpy
  pillow
  numpy
  kiwisolver
  fonttools
  matplotlib
  2024-04-16 16:05:52 ERROR You may still try to create your package with --allow-shared-libraries, but the might not work.
  2024-04-16 16:05:52 ERROR You may also request adding the package to Snowflake Conda channel
  2024-04-16 16:05:52 ERROR at https://support.anaconda.com/
  Build done. Artifact path: /Path/to/current/dir/project_dir/app.zip
  ```
* Build a project and include all dependencies:

  ```snowcli
  snow snowpark build --ignore-anaconda
  ```

  ```output
  Resolving dependencies from requirements.txt
    No external dependencies.
  Preparing artifacts for source code
    Creating: app.zip
  Build done.
  ```

---
title: snow snowpark commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/overview.md
section: Snowflake CLI
---

# snow snowpark commands

Snowflake CLI supports the following Snowpark commands:

* [snow snowpark build](build.md)
* [snow snowpark deploy](deploy.md)
* [snow snowpark describe](describe.md)
* [snow snowpark drop](drop.md)
* [snow snowpark execute](execute.md)
* [snow snowpark list](list.md)
* [snow snowpark package commands](package-commands/overview.md)

---
title: snow snowpark deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/deploy.md
section: Snowflake CLI
---

# snow snowpark deploy

Deploys procedures and functions defined in project. Deploying the project alters all objects defined in it. By default, if any of the objects exist already the commands will fail unless `--replace` flag is provided. Required artifacts are deployed before creating functions or procedures. Dependencies are deployed once to every stage specified in definitions.

## Syntax

```console
snow snowpark deploy
  --replace
  --force-replace
  --prune / --no-prune
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--replace`
:   Replaces procedure or function if there were changes in the definition. It only uploads new and overwrites existing files, but does not remove any files already on the stage. Default: False.

`--force-replace`
:   Replace this object, even if the state didn’t change. Default: False.

`--prune / --no-prune`
:   Remove contents of the stage before uploading artifacts. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow snowpark deploy` command does the following:

* Checks to see whether the objects listed for deployment already exist. If the objects exist, you must use the `--replace` option.

  > **Note:**
  >
  > If you want to update objects and files, even if they did not change, you can use the `--force-replace` option.
* Creates a stage in the database specified for your connection. If no stage is defined, the command creates a stage named `deployments`.
* If the `--prune` option was specified, removes existing content from the stage used by defined procedures and function objects.
* Uploads the new artifacts.
* Creates the objects specified then `snowflake.yml` file by executing the SQL CREATE PROCEDURE or CREATE FUNCTION queries.

The command deploys the source code and dependencies from the most recent build. If you modified the code or added any requirements since the last build, you must run the [snow snowpark build](build.md) command again before deploying the new version.

> > **Note:**
> >
> > When deploying a Snowpark stored procedure, Snowflake CLI lets you upload artifacts to a folder within a stage. This makes it possible to deploy several procedures to a single stage.
> >
> > If you are deploying to a different Snowflake account, you must run the [snow snowpark build](build.md) command again before deploying.

## Examples

The following example shows how to deploy functions and procedures in the current directory.

```snowcli
snow snowpark deploy
```

```output
+-----------------------------------------------------------------------------------+
| object                                             | type      | status           |
|----------------------------------------------------+-----------+------------------|
| MY_DATABASE.PUBLIC.HELLO_PROCEDURE(name string)    | procedure | packages updated |
| MY_DATABASE.PUBLIC.TEST_PROCEDURE()                | procedure | created          |
| MY_DATABASE.PUBLIC.HELLO_FUNCTION(name string)     | function  | packages updated |
+-----------------------------------------------------------------------------------+
```

The following example shows what happens when objects already exist and you deploy without specifying the `--replace` option.

```snowcli
snow snowpark deploy
```

```output
╭─ Error ──────────────────────────────────────────────────────────╮
│ Following objects already exists. Consider using --replace.      |
│ function: MY_DATABASE.PUBLIC.HELLO_FUNCTION(string)              |
│ procedure: MY_DATABASE.PUBLIC.HELLO_PROCEDURE(string)            |
│ procedure: MY_DATABASE.PUBLIC.TEST_PROCEDURE()                   |
╰──────────────────────────────────────────────────────────────────╯
```

---
title: snow snowpark describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/describe.md
section: Snowflake CLI
---

# snow snowpark describe

Provides description of a procedure or function.

## Syntax

```console
snow snowpark describe
  <object_type>
  <identifier>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type:{procedure|function}`
:   Type of Snowpark object.

`identifier`
:   Identifier of the function/procedure; for example: hello(int, string).

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow snowpark drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/drop.md
section: Snowflake CLI
---

# snow snowpark drop

Drop procedure or function.

## Syntax

```console
snow snowpark drop
  <object_type>
  <identifier>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type:{procedure|function}`
:   Type of Snowpark object.

`identifier`
:   Identifier of the function/procedure; for example: hello(int, string).

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow snowpark execute
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/execute.md
section: Snowflake CLI
---

# snow snowpark execute

Executes a procedure or function in a specified environment.

## Syntax

```console
snow snowpark execute
  <object_type>
  <execution_identifier>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type:{procedure|function}`
:   Type of Snowpark object.

`execution_identifier`
:   Execution identifier of the procedure/function. For example: hello(1, ‘world’).

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow snowpark execute` command executes a function or procedure stored in Snowflake. It uses the database defined for the connection.

Based on which command shell you use, you might need to wrap the `execution_identifier` argument in quotes, as illustrated in the **Examples** section.

## Examples

The following example calls a Snowpark function called `hello_function`:

```snowcli
snow snowpark execute function "hello_function('Olaf')"
```

```output
+--------------------------------------+
| key                    | value       |
|------------------------+-------------|
| HELLO_FUNCTION('Olaf') | Hello Olaf! |
+--------------------------------------+
```

---
title: snow snowpark list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/list.md
section: Snowflake CLI
---

# snow snowpark list

Lists all available procedures or functions.

## Syntax

```console
snow snowpark list
  <object_type>
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`object_type:{procedure|function}`
:   Type of Snowpark object.

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list function --like "my%"` lists all functions that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list function --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow snowpark package commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/snowpark-commands/package-commands/overview.md
section: Snowflake CLI
---

# snow snowpark package commands

Snowflake CLI supports the following commands to support Snowpark packages:

* [snow package create](create.md)
* [snow package lookup](lookup.md)
* [snow package upload](upload.md)

---
title: snow spcs compute-pool commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/overview.md
section: Snowflake CLI
---

# snow spcs compute-pool commands

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Snowflake CLI supports the following commands to support compute pools:

* [snow spcs compute-pool create](create.md)
* [snow spcs compute-pool deploy](deploy.md)
* [snow spcs compute-pool describe](describe.md)
* [snow spcs compute-pool drop](drop.md)
* [snow spcs compute-pool resume](resume.md)
* [snow spcs compute-pool set](set.md)
* [snow spcs compute-pool status](status.md)
* [snow spcs compute-pool stop-all](stop-all.md)
* [snow spcs compute-pool suspend](suspend.md)
* [snow spcs compute-pool unset](unset.md)

---
title: snow spcs compute-pool create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/create.md
section: Snowflake CLI
---

# snow spcs compute-pool create

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Creates a new compute pool.

## Syntax

```console
snow spcs compute-pool create
  <name>
  --family <instance_family>
  --min-nodes <min_nodes>
  --max-nodes <max_nodes>
  --auto-resume
  --no-auto-resume
  --init-suspend / --no-init-suspend
  --auto-suspend-secs <auto_suspend_secs>
  --tag <tags>
  --comment <comment>
  --if-not-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--family TEXT`
:   Name of the instance family. For more information about instance families, refer to the SQL CREATE COMPUTE POOL command.

`--min-nodes INTEGER RANGE`
:   Minimum number of nodes for the compute pool. Default: 1.

`--max-nodes INTEGER RANGE`
:   Maximum number of nodes for the compute pool.

`--auto-resume`
:   The compute pool will automatically resume when a service or job is submitted to it. Default: False.

`--no-auto-resume`
:   The compute pool will automatically resume when a service or job is submitted to it. Default: False.

`--init-suspend / --no-init-suspend`
:   Starts the compute pool in a suspended state. Default: False.

`--auto-suspend-secs INTEGER RANGE`
:   Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool. Default: 3600.

`--tag NAME=VALUE`
:   Tag for the compute pool.

`--comment TEXT`
:   Comment for the compute pool.

`--if-not-exists`
:   Only apply this operation if the specified object does not already exist. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example creates a compute pool named “pool_1” using the minimal CPU_X64_XS family, which comprises two
CPUs with 4GB of memory.

```snowcli
snow spcs compute-pool create "pool_1" --min-nodes 2 --max-nodes 2 --family "CPU_X64_XS"
```

---
title: snow spcs compute-pool deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/deploy.md
section: Snowflake CLI
---

# snow spcs compute-pool deploy

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Deploys a compute pool from the project definition file.

## Syntax

```console
snow spcs compute-pool deploy
  <entity_id>
  --upgrade
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`entity_id`
:   ID of compute-pool entity.

## Options

`--upgrade`
:   Updates the existing compute pool. Can update min_nodes, max_nodes, auto_resume, auto_suspend_seconds and comment. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow spcs compute pool deploy` command reads a `snowflake.yml` project definition file that defines a compute pool.
If your project definition has precisely one compute pool entity, you can omit the `<entity_id>` argument. However, if your project definition has multiple compute pool entities, you must specify the compute pool name in the `<entity_id>` argument.
For more information, see [Compute pools project definition](../../../services/manage-compute-pools.md).

The `--upgrade` option updates an existing service. You can update only the following project definition parameters:

* `min_instances`
* `max_instances`
* `query_warehouse`
* `auto_resume`
* `external_access_integrations`
* `comment`

## Examples

The following example creates and deploys a compute pool defined in the `snowflake.yml` file in the current directory.

```snowcli
snow spcs compute-pool deploy
```

```output
+---------------------------------------------------------------------+
| key    | value                                                      |
|--------+------------------------------------------------------------|
| status | Compute pool MY_COMPUTE_POOL successfully created.         |
+---------------------------------------------------------------------+
```

---
title: snow spcs compute-pool describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/describe.md
section: Snowflake CLI
---

# snow spcs compute-pool describe

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Provides description of compute pool.

## Syntax

```console
snow spcs compute-pool describe
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example creates a compute pool named “pool_1” using the minimal CPU_X64_XS family, which comprises two
CPUs with 4GB of memory.

```snowcli
snow spcs compute-pool create "pool_1" --min-nodes 2 --max-nodes 2 --family "CPU_X64_XS"
```

---
title: snow spcs compute-pool drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/drop.md
section: Snowflake CLI
---

# snow spcs compute-pool drop

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Creates a new compute pool.

## Syntax

```console
snow spcs compute-pool create
  <name>
  --family <instance_family>
  --min-nodes <min_nodes>
  --max-nodes <max_nodes>
  --auto-resume
  --no-auto-resume
  --init-suspend / --no-init-suspend
  --auto-suspend-secs <auto_suspend_secs>
  --tag <tags>
  --comment <comment>
  --if-not-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--family TEXT`
:   Name of the instance family. For more information about instance families, refer to the SQL CREATE COMPUTE POOL command.

`--min-nodes INTEGER RANGE`
:   Minimum number of nodes for the compute pool. Default: 1.

`--max-nodes INTEGER RANGE`
:   Maximum number of nodes for the compute pool.

`--auto-resume`
:   The compute pool will automatically resume when a service or job is submitted to it. Default: False.

`--no-auto-resume`
:   The compute pool will automatically resume when a service or job is submitted to it. Default: False.

`--init-suspend / --no-init-suspend`
:   Starts the compute pool in a suspended state. Default: False.

`--auto-suspend-secs INTEGER RANGE`
:   Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool. Default: 3600.

`--tag NAME=VALUE`
:   Tag for the compute pool.

`--comment TEXT`
:   Comment for the compute pool.

`--if-not-exists`
:   Only apply this operation if the specified object does not already exist. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example creates a compute pool named “pool_1” using the minimal CPU_X64_XS family, which comprises two
CPUs with 4GB of memory.

```snowcli
snow spcs compute-pool create "pool_1" --min-nodes 2 --max-nodes 2 --family "CPU_X64_XS"
```

---
title: snow spcs compute-pool list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/list.md
section: Snowflake CLI
---

# snow spcs compute-pool list

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Lists all available compute pools.

## Syntax

```console
snow spcs compute-pool list
  --like <like>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all compute pools that begin with “my”.. Default: %%.

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs compute-pool resume
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/resume.md
section: Snowflake CLI
---

# snow spcs compute-pool resume

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Resumes the compute pool from a SUSPENDED state.

## Syntax

```console
snow spcs compute-pool resume
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the compute pool to resume it.

## Examples

```snowcli
snow spcs compute-pool resume tutorial_compute_pool
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs compute-pool set
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/set.md
section: Snowflake CLI
---

# snow spcs compute-pool set

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Sets one or more properties for the compute pool.

## Syntax

```console
snow spcs compute-pool set
  <name>
  --min-nodes <min_nodes>
  --max-nodes <max_nodes>
  --auto-resume
  --no-auto-resume
  --auto-suspend-secs <auto_suspend_secs>
  --comment <comment>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--min-nodes INTEGER RANGE`
:   Minimum number of nodes for the compute pool.

`--max-nodes INTEGER RANGE`
:   Maximum number of nodes for the compute pool.

`--auto-resume`
:   The compute pool will automatically resume when a service or job is submitted to it.

`--no-auto-resume`
:   The compute pool will automatically resume when a service or job is submitted to it.

`--auto-suspend-secs INTEGER RANGE`
:   Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool.

`--comment TEXT`
:   Comment for the compute pool.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have MODIFY privilege on the compute pool to set properties.

## Examples

```snowcli
snow spcs compute-pool set tutorial_compute_pool --min-nodes 2 --max-nodes 4
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs compute-pool status
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/status.md
section: Snowflake CLI
---

# snow spcs compute-pool status

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Retrieves the status of a compute pool along with a relevant message, if one exists.

## Syntax

```console
snow spcs compute-pool status
  <pool_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`pool_name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs compute-pool stop-all
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/stop-all.md
section: Snowflake CLI
---

# snow spcs compute-pool stop-all

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Deletes all services running on the compute pool.

## Syntax

```console
snow spcs compute-pool stop-all
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example stops a compute pool named “pool1” and deletes all services running on it:

```snowcli
snow spcs compute-pool stop-all "pool1"
```

---
title: snow spcs compute-pool suspend
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/suspend.md
section: Snowflake CLI
---

# snow spcs compute-pool suspend

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Suspends the compute pool by suspending all currently running services and then releasing compute pool nodes.

## Syntax

```console
snow spcs compute-pool suspend
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the compute pool to suspend it.

## Examples

```snowcli
snow spcs compute-pool suspend tutorial_compute_pool
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs compute-pool unset
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/compute-pool-commands/unset.md
section: Snowflake CLI
---

# snow spcs compute-pool unset

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Resets one or more properties for the compute pool to their default value(s).

## Syntax

```console
snow spcs compute-pool unset
  <name>
  --auto-resume
  --auto-suspend-secs
  --comment
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the compute pool; for example: my_compute_pool.

## Options

`--auto-resume`
:   Reset the AUTO_RESUME property - The compute pool will automatically resume when a service or job is submitted to it. Default: False.

`--auto-suspend-secs`
:   Reset the AUTO_SUSPEND_SECS property - Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool. Default: False.

`--comment`
:   Reset the COMMENT property - Comment for the compute pool. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have MODIFY privilege on the compute pool to reset properties.

## Examples

```snowcli
snow spcs compute-pool unset tutorial_compute_pool --auto-resume
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs image-registry commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-registry-commands/overview.md
section: Snowflake CLI
---

# snow spcs image-registry commands

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Snowflake CLI provides the following commands for managing image registries:

* [snow spcs image-registry login](login.md)
* [snow spcs image-registry token](token.md)
* [snow spcs image-registry url](url.md)

---
title: snow spcs image-registry login
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-registry-commands/login.md
section: Snowflake CLI
---

# snow spcs image-registry login

Logs in to the account image registry with the current user’s credentials through Docker. Must be called from a role that can view at least one image repository in the image registry.

## Syntax

```console
snow spcs image-registry login
  --private-link
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--private-link`
:   Get the private link URL instead of the public URL. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* [Docker Desktop](https://www.docker.com/products/docker-desktop/) must be installed because the command uses docker to log in to Snowflake.
* The current role must have READ privileges for the image repository in the account to get the registry URL.

## Examples

```snowcli
snow spcs image-registry login
```

```output
Login Succeeded
```

---
title: snow spcs image-registry token
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-registry-commands/token.md
section: Snowflake CLI
---

# snow spcs image-registry token

Retrieves a registry authentication token based on your current connection. Note that this token is specific to your current user and will not grant access to any repositories that your current user cannot access.

## Syntax

```console
snow spcs image-registry token
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example shows how to return the token associated with the specified
connection that you can use to authenticate with the registry.

```snowcli
snow spcs image-registry token --connection mytest
```

```output
+----------------------------------------------------------------------------------------------------------------------+
| key        | value                                                                                                   |
|------------+---------------------------------------------------------------------------------------------------------|
| token      | ****************************************************************************************************    |
|            | ****************************************************************************************************    |
| expires_in | 3600                                                                                                    |
+----------------------------------------------------------------------------------------------------------------------+
```

Example usage with docker:

```snowcli
snow spcs image-registry token --format=JSON | docker login YOUR_HOST -u 0sessiontoken --password-stdin
```

---
title: snow spcs image-registry url
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-registry-commands/url.md
section: Snowflake CLI
---

# snow spcs image-registry url

Gets the image registry URL for the current account. Must be called from a role that can view at least one image repository in the image registry.

## Syntax

```console
snow spcs image-registry url
  --private-link
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--private-link`
:   Get the private link URL instead of the public URL. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have READ privileges for the image repository in the account to get the registry URL.

## Examples

```snowcli
snow spcs image-registry url
```

```output
<orgname-acctname>.registry.snowflakecomputing.com
```

---
title: snow spcs image-repository commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/overview.md
section: Snowflake CLI
---

# snow spcs image-repository commands

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Snowflake CLI provides the following commands to manage image repositories:

* [snow spcs image-repository create](create.md)
* [snow spcs image-repository deploy](deploy.md)
* [snow spcs image-repository drop](drop.md)
* [snow spcs image-repository list](list.md)
* [snow spcs image-repository list-images](list-images.md)
* [snow spcs image-repository list-tags](list-tags.md)
* [snow spcs image-repository url](url.md)

---
title: snow spcs image-repository create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/create.md
section: Snowflake CLI
---

# snow spcs image-repository create

Creates a new image repository in the current schema.

## Syntax

```console
snow spcs image-repository create
  <name>
  --replace
  --if-not-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the image repository; for example: my_repository.

## Options

`--replace`
:   Replace this object if it already exists. Default: False.

`--if-not-exists`
:   Only apply this operation if the specified object does not already exist. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

```snowcli
snow spcs image-repository create tutorial_repository
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs image-repository deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/deploy.md
section: Snowflake CLI
---

# snow spcs image-repository deploy

Deploys a new image repository from snowflake.yml file.

## Syntax

```console
snow spcs image-repository deploy
  <entity_id>
  --replace
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`entity_id`
:   ID of image-repository entity.

## Options

`--replace`
:   Replace the image repository if it already exists. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow spcs image repository deploy` command creates an image repository from its definition in a `snowflake.yml` project definition file. For more information, see [Image repository project definition file](../../../services/manage-images.md).

## Examples

The following example creates an image repository defined in the `snowflake.yml` file in the current directory.

```snowcli
snow spcs image-repository deploy
```

```output
+---------------------------------------------------------------------+
| key    | value                                                      |
|--------+------------------------------------------------------------|
| status | Image Repository MY_IMAGE_REPOSITORY successfully created. |
+---------------------------------------------------------------------+
```

---
title: snow spcs image-repository drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/drop.md
section: Snowflake CLI
---

# snow spcs image-repository drop

Drops image repository with given name.

## Syntax

```console
snow spcs image-repository drop
  <name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the image repository; for example: my_repository.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs image-repository list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/list.md
section: Snowflake CLI
---

# snow spcs image-repository list

Lists all available image repositories.

## Syntax

```console
snow spcs image-repository list
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `--like "my%"` lists all image repositories that begin with “my”.. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs image-repository list-images
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/list-images.md
section: Snowflake CLI
---

# snow spcs image-repository list-images

Lists images in the given repository.

## Syntax

```console
snow spcs image-repository list-images
  <name>
  --like <like_option>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the image repository; for example: my_repository.

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `--like "my%"` lists all image repositories that begin with “my”.. Default: %%.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example lists the images and tags in a repository named `images` in the `my_db` database:

```snowcli
snow spcs image-repository list-images images --database my_db
```

```output
+--------------------------------------------------------------------------------------------------------------------------------------------------------+
| created_on                | image_name            | tags   | digest                                         | image_path                               |
|---------------------------+-----------------------+--------+------------------------------------------------+------------------------------------------|
| 2024-10-11 14:23:49-07:00 | echo_service          | latest | sha256:a8a001fef406fdb3125ce8e8bf9970c35af7084 | my_db/test_schema/images/echo_service:   |
|                           |                       |        | fc33b0886d7a8915d3082c781                      | latest                                   |
| 2024-10-14 22:21:14-07:00 | test_counter          | latest | sha256:8cae96dac29a4a05f54bb5520003f964baf67fc | my_db/test_schema/images/test_counter:   |
|                           |                       |        | 38dcad3d2c85d6c5aa7381174                      | latest                                   |
+--------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow spcs image-repository list-tags
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/list-tags.md
section: Snowflake CLI
---

# snow spcs image-repository list-tags

Lists tags for the given image in a repository. This command is deprecated and will be removed in a future release. Use `list-images` instead.

## Syntax

```console
snow spcs image-repository list-tags
  <name>
  --image-name <image_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the image repository; for example: my_repository.

## Options

`--image-name, --image_name, -i TEXT`
:   Fully qualified name of the image as shown in the output of list-images.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example lists the tags associated with the registry named `MY_DB/PUBLIC/images/cp-schema-registry`.

```snowcli
snow spcs image-repository list-tags images --image_name "MY_DB/PUBLIC/images/cp-schema-registry" --database my_db
```

```output
+----------------------------------------------------+
| tag                                                |
|----------------------------------------------------|
| /MY_DB/PUBLIC/images/cp-schema-registry:7.3.0      |
+----------------------------------------------------+
```

---
title: snow spcs image-repository url
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/image-repository-commands/url.md
section: Snowflake CLI
---

# snow spcs image-repository url

Returns the URL for the given repository.

## Syntax

```console
snow spcs image-repository url
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the image repository; for example: my_repository.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* The current role must have READ privileges for the image repository in the account to get the registry URL.
* The URL is returned as a text string, so you can store it in an environment variable for convenience. For example:

  ```snowcli
  export REPO_URL = $(snow spcs image-repository url <name>)
  ```

## Examples

```snowcli
snow spcs image-repository url tutorial_repository
```

```output
<orgname-acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
```

---
title: snow spcs service commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/overview.md
section: Snowflake CLI
---

# snow spcs service commands

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Snowflake CLI supports the following commands for managing services:

> * [snow spcs service create](create.md)
> * [snow spcs service deploy](deploy.md)
> * [snow spcs service describe](describe.md)
> * [snow spcs service drop](drop.md)
> * [snow spcs service events](events.md)
> * [snow spcs service execute-job](execute-job.md)
> * [snow spcs service list](list.md)
> * [snow spcs service list-containers](list-containers.md)
> * [snow spcs service list-endpoints](list-endpoints.md)
> * [snow spcs service list-instances](list-instances.md)
> * [snow spcs service list-roles](list-roles.md)
> * [snow spcs service logs](logs.md)
> * [snow spcs service metrics](metrics.md)
> * [snow spcs service resume](resume.md)
> * [snow spcs service set](set.md)
> * [snow spcs service status](status.md)
> * [snow spcs service suspend](suspend.md)
> * [snow spcs service unset](unset.md)
> * [snow spcs service upgrade](upgrade.md)

---
title: snow spcs service create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/create.md
section: Snowflake CLI
---

# snow spcs service create

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Creates a new service in the current schema.

## Syntax

```console
snow spcs service create
  <name>
  --compute-pool <compute_pool>
  --spec-path <spec_path>
  --min-instances <min_instances>
  --max-instances <max_instances>
  --auto-resume / --no-auto-resume
  --eai-name <external_access_integrations>
  --query-warehouse <query_warehouse>
  --tag <tags>
  --comment <comment>
  --if-not-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--compute-pool TEXT`
:   Compute pool to run the service on.

`--spec-path FILE`
:   Path to service specification file.

`--min-instances INTEGER RANGE`
:   Minimum number of service instances to run. Default: 1.

`--max-instances INTEGER RANGE`
:   Maximum number of service instances to run.

`--auto-resume / --no-auto-resume`
:   The service will automatically resume when a service function or ingress is called. Default: True.

`--eai-name TEXT`
:   Identifies external access integrations (EAI) that the service can access. This option may be specified multiple times for multiple EAIs.

`--query-warehouse TEXT`
:   Warehouse to use if a service container connects to Snowflake to execute a query without explicitly specifying a warehouse to use.

`--tag NAME=VALUE`
:   Tag for the service.

`--comment TEXT`
:   Comment for the service.

`--if-not-exists`
:   Only apply this operation if the specified object does not already exist. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

You can optionally choose to run more than one instance of your service. Each service instance is a collection of
containers, as defined in the service specification file, that run together on a node in your compute pool. If you
choose to run multiple instances of a service, a load balancer manages incoming traffic.

## Examples

```snowcli
snow spcs service create "my-service" --compute-pool "pool_1" --spec-path "/some-dir/echo-speck.yaml"
```

---
title: snow spcs service deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/deploy.md
section: Snowflake CLI
---

# snow spcs service deploy

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Deploys a service defined in the project definition file.

## Syntax

```console
snow spcs service deploy
  <entity_id>
  --upgrade
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`entity_id`
:   ID of service entity.

## Options

`--upgrade`
:   Updates the existing service. Can update min_instances, max_instances, query_warehouse, auto_resume, auto_suspend_secs, external_access_integrations and comment. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `snow spcs service deploy` command reads a `snowflake.yml` project definition file that defines a service, then creates and deploys the compute pool to a stage named in the `snowflake.yml` file.
If your project definition has precisely one service entity, you can omit the `<entity_id>` argument. However, if your project definition has multiple service entities, you must specify the service name in the `<entity_id>` argument.
For more information, see [Services project definition](../../../services/manage-services.md).

You can optionally choose to run more than one instance of your service. Each service instance is a collection of
containers, as defined in the service specification file, that run together on a node in your compute pool. If you
choose to run multiple instances of a service, a load balancer manages incoming traffic.

The `--upgrade` option updates an existing service. You can update only the following project definition parameters:

* `min_instances`
* `max_instances`
* `query_warehouse`
* `auto_resume`
* `external_access_integrations`
* `comment`

## Examples

The following example creates and deploys a service defined in the `snowflake.yml` file in the current directory.

```snowcli
snow spcs service deploy
```

```output
+---------------------------------------------------------------------+
| key    | value                                                      |
|--------+------------------------------------------------------------|
| status | Service MY_SERVICE successfully created.                   |
+---------------------------------------------------------------------+
```

---
title: snow spcs service describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/describe.md
section: Snowflake CLI
---

# snow spcs service describe

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Provides description of service.

## Syntax

```console
snow spcs service describe
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs service drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/drop.md
section: Snowflake CLI
---

# snow spcs service drop

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Drops service with given name.

## Syntax

```console
snow spcs service drop
  <name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs service events
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/events.md
section: Snowflake CLI
---

# snow spcs service events

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Retrieve platform events for a service container.

## Syntax

```console
snow spcs service events
  <name>
  --container-name <container_name>
  --instance-id <instance_id>
  --since <since>
  --until <until>
  --first <first>
  --last <last>
  --all
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --private-key-file <private_key_file>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --format <format>
  --verbose
  --debug
  --silent
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--container-name TEXT`
:   Name of the container.

`--instance-id TEXT`
:   ID of the service instance, starting with 0.

`--since TEXT`
:   Fetch events that are newer than this time ago, in Snowflake interval syntax.

`--until TEXT`
:   Fetch events that are older than this time ago, in Snowflake interval syntax.

`--first INTEGER`
:   Fetch only the first N events. Cannot be used with –last.

`--last INTEGER`
:   Fetch only the last N events. Cannot be used with –first.

`--all`
:   Fetch all columns. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token-file-path TEXT`
:   Path to file with an OAuth token that should be used when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Run Python connector diagnostic test. Default: False.

`--diag-log-path TEXT`
:   Diagnostic report path. Default: <temporary_directory>.

`--diag-allowlist-path TEXT`
:   Diagnostic report path to optional allowlist.

`--format [TABLE|JSON]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> To use this command, you must enable the `enable_spcs_service_events` feature in your `config.toml` file, as shown:
>
> ```toml
> [cli.features]
> enable_spcs_service_events = true
> ```

* The following parameters are required:

  + `name`
  + `--container-name <name>`
  + `--instance-id <ID>`
* You can use the `--since` and `--until` time-based filters to return events for a specified period of time. You can specify the time as a relative time, such as `1h` (hour) or `2d` (days).
* You can use the `--first` and `--last` options to return only a specified number of events. Note that these options are mutually exclusive.

## Examples

* Retrieve all events for a specific service:

  ```snowcli
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0
  ```
* Retrieve a subset of events for a specific service:

  ```snowcli
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0 --first 5
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0 --last 5
  ```
* Fetch events newer than the last five minutes:

  ```snowcli
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0 --since '5 minutes'
  ```
* Fetch events older than one hour:

  ```snowcli
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0 --until '1 hour'
  ```
* Retrieve all events with all columns displayed:

  ```snowcli
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0 --all --last 1
  ```

  ```output
  TIMESTAMP | DATABASE NAME | SCHEMA NAME | SERVICE NAME | INSTANCE NAME | CONTAINER NAME | SEVERITY | EVENT NAME | EVENT VALUE
  -- | -- | -- | -- | -- | -- | -- | -- | --
  2024-12-13 10:01:52.808692 | TESTDB | PUBLIC | LOG_EVENT | 0 | log-printer | INFO | CONTAINER.STATUS_CHANGE | { "message": "Running", "status": "READY" }
  2024-12-14 22:27:25.420489 | TESTDB | PUBLIC | LOG_EVENT | 0 | log-printer | INFO | CONTAINER.STATUS_CHANGE | { "message": "Running", "status": "READY" }
  ```
* Retrieve events formatted for JSON output:

  ```snowcli
  snow spcs service events LOG_EVENT --container-name log-printer --instance-id 0 --last 1 --format json
  ```

  ```output
  [
       {
           "TIMESTAMP": "2024-12-14T22:27:25.420489",
           "DATABASE NAME": "TESTDB",
           "SCHEMA NAME": "PUBLIC",
           "SERVICE NAME": "LOG_EVENT",
           "INSTANCE NAME": "0",
           "CONTAINER NAME": "log-printer",
           "SEVERITY": "INFO",
           "EVENT NAME": "CONTAINER.STATUS_CHANGE",
           "EVENT VALUE": "{\n  \"message\": \"Running\",\n  \"status\": \"READY\"\n}"
       }
   ]
  ```

---
title: snow spcs service execute-job
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/execute-job.md
section: Snowflake CLI
---

# snow spcs service execute-job

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Creates and executes a job service in the current schema.

## Syntax

```console
snow spcs service execute-job
  <name>
  --compute-pool <compute_pool>
  --spec-path <spec_path>
  --eai-name <external_access_integrations>
  --query-warehouse <query_warehouse>
  --comment <comment>
  --async
  --replicas <replicas>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--compute-pool TEXT`
:   Compute pool to run the job service on.

`--spec-path FILE`
:   Path to service specification file.

`--eai-name TEXT`
:   Identifies external access integrations (EAI) that the job service can access. This option may be specified multiple times for multiple EAIs.

`--query-warehouse TEXT`
:   Warehouse to use if a service container connects to Snowflake to execute a query without explicitly specifying a warehouse to use.

`--comment TEXT`
:   Comment for the service.

`--async`
:   Execute the job asynchronously without waiting for completion. Default: False.

`--replicas INTEGER RANGE`
:   Number of job replicas to run.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow spcs service list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/list.md
section: Snowflake CLI
---

# snow spcs service list

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Lists all available services.

## Syntax

```console
snow spcs service list
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all services that begin with “my”.. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in compute-pool my_pool`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following command lists the services and their statuses:

```snowcli
snow spcs service list
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
|        |        |        |        |        |        |        |        |        |        |        |         | extern |         |        |         |        |         |        |        |         |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         | al_acc |         |        |         |        |         |        |        |         |        | managin | managi |
|        |        | databa |        |        |        |        | curren | target | min_in | max_in |         | ess_in |         |        |         |        | owner_r | query_ |        |         |        | g_objec | ng_obj |
|        |        | se_nam | schema |        | comput | dns_na | t_inst | _insta | stance | stance | auto_re | tegrat | created | update | resumed | commen | ole_typ | wareho |        | spec_di | is_upg | t_domai | ect_na |
| name   | status | e      | _name  | owner  | e_pool | me     | ances  | nces   | s      | s      | sume    | ions   | _on     | d_on   | _on     | t      | e       | use    | is_job | gest    | rading | n       | me     |
|--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+--------+---------+--------+---------+--------+---------+--------+---------+--------+--------+---------+--------+---------+--------|
| ECHO_S | RUNNIN | TEST00 | TEST_S | SYSADM | TUTORI | echo-s | 1      | 1      | 1      | 1      | true    | None   | 2024-10 | 2024-1 | None    | This   | ROLE    | COMPUT | false  | 52e62d1 | false  | None    | None   |
| ERVICE | G      | _DB    | CHEMA  | IN     | AL_COM | ervice |        |        |        |        |         |        | -16     | 0-16   |         | is a   |         | E_WH   |        | f19c720 |        |         |        |
|        |        |        |        |        | PUTE_P | .imhd. |        |        |        |        |         |        | 15:09:3 | 15:09: |         | test   |         |        |        | 6b5f4ef |        |         |        |
|        |        |        |        |        | OOL    | svc.sp |        |        |        |        |         |        | 0.49300 | 31.905 |         | servic |         |        |        | c069557 |        |         |        |
|        |        |        |        |        |        | cs.int |        |        |        |        |         |        | 0-07:00 | 000-07 |         | e      |         |        |        | 8b6c2b3 |        |         |        |
|        |        |        |        |        |        | ernal  |        |        |        |        |         |        |         | :00    |         |        |         |        |        | 806ad76 |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 67d78cc |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | ce8b6ed |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 6501a8a |        |         |        |
|        |        |        |        |        |        |        |        |        |        |        |         |        |         |        |         |        |         |        |        | 3       |        |         |        |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow spcs service list-containers
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/list-containers.md
section: Snowflake CLI
---

# snow spcs service list-containers

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Lists all service containers in a service.

## Syntax

```console
snow spcs service list-containers
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

This example lists containers in the `echo_service` service:

```snowcli
snow spcs service list-containers echo_service
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| database_name | schema_name | service_name | instance_id | container_name | status | message | image_name                                | image_digest                              | restart_count | start_time           |
|---------------+-------------+--------------+-------------+----------------+--------+---------+-------------------------------------------+-------------------------------------------+---------------+----------------------|
| TEST00_DB     | TEST_SCHEMA | ECHO_SERVICE | 0           | main           | READY  | Running | org-test-account-00.registry.registry.sno | sha256:06c3d54edc24925abe398eda70d37eb6b8 | 0             | 2024-10-16T22:09:35Z |
|               |             |              |             |                |        |         | wflakecomputing.com/test00_db/test_schema | 7b1c4dd6211317592764e1e7d94498            |               |                      |
|               |             |              |             |                |        |         | /test00_repo/echo_service:latest          |                                           |               |                      |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow spcs service list-endpoints
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/list-endpoints.md
section: Snowflake CLI
---

# snow spcs service list-endpoints

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Lists the endpoints in a service.

## Syntax

```console
snow spcs service list-endpoints
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

```snowcli
snow spcs service list-endpoints echo_service
```

```output
+--------------+------+----------+-----------------+-----------------------------------------+
| name         | port | protocol | ingress_enabled | ingress_url                             |
|--------------+------+----------+-----------------+-----------------------------------------|
| echoendpoint | 8000 | TCP      | true            | org-id-acct-id.snowflakecomputing.app   |
+--------------+------+----------+-----------------+-----------------------------------------+
```

---
title: snow spcs service list-instances
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/list-instances.md
section: Snowflake CLI
---

# snow spcs service list-instances

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Lists all service instances in a service.

## Syntax

```console
snow spcs service list-instances
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

This example lists the instances in the `echo_service` service:

```snowcli
snow spcs service list-instances echo_service
```

```output
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| database_name | schema_name | service_name | instance_id | status | spec_digest                                                      | creation_time        | start_time           |
|---------------+-------------+--------------+-------------+--------+------------------------------------------------------------------+----------------------+----------------------|
| TEST00_DB     | TEST_SCHEMA | ECHO_SERVICE | 0           | READY  | 336c065739dd2b96e770f01804affdc7810e6df68a23b23052d851627abfbdf9 | 2024-10-10T06:06:30Z | 2024-10-10T06:06:30Z |
+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: snow spcs service list-roles
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/list-roles.md
section: Snowflake CLI
---

# snow spcs service list-roles

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Lists all service roles in a service.

## Syntax

```console
snow spcs service list-roles
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example gets a list of service roles created for a service:

```snowcli
snow spcs service list-roles my_service
```

```output
+------------------------------------------------------------------+
| created_on                       | name                | comment |
|----------------------------------+---------------------+---------|
| 2024-10-09 16:48:52.980000-07:00 | ALL_ENDPOINTS_USAGE | None    |
+------------------------------------------------------------------+
```

---
title: snow spcs service logs
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/logs.md
section: Snowflake CLI
---

# snow spcs service logs

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Retrieves local logs from a service container.

## Syntax

```console
snow spcs service logs
  <name>
  --container-name <container_name>
  --instance-id <instance_id>
  --num-lines <num_lines>
  --previous-logs
  --since <since_timestamp>
  --include-timestamps
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--container-name TEXT`
:   Name of the container.

`--instance-id TEXT`
:   ID of the service instance, starting with 0.

`--num-lines INTEGER`
:   Number of lines to retrieve. Default: 500.

`--previous-logs`
:   Retrieve logs from the last terminated container. Default: False.

`--since TEXT`
:   Start log retrieval from a specified UTC timestamp.

`--include-timestamps`
:   Include timestamps in logs. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* The current role must have the MONITOR privilege on the service to access the container logs.
* The function returns a container log as a string.
* When using the `--follow` option for real-time log streaming, the `--num-lines` and `--previous-logs` options are not supported.

## Examples

* The following example displays the last three lines of the `echo_service` logs:

  ```snowcli
  snow spcs service logs echo_service --container-name echo --instance-id 0 --num-lines 3
  ```

  ```output
  10.18.94.31 - - [22/Nov/2024 09:16:47] "GET /healthcheck HTTP/1.1" 200 -
  10.18.94.31 - - [22/Nov/2024 09:16:52] "GET /healthcheck HTTP/1.1" 200 -
  10.18.94.31 - - [22/Nov/2024 09:16:57] "GET /healthcheck HTTP/1.1" 200 -
  ```
* This example streams the logs for the `echo_service` service and updates them every 10 seconds:

  ```snowcli
  snow spcs service logs echo_service --container-name echo --instance-id 0 --follow --follow-interval 10
  ```
* The following example displays the log entries since 9:30 UTC, 21 Nov 2024:

  ```snowcli
  snow spcs service logs echo_service --container-name echo --instance-id 0 --since 2024-11-21T09:30:00Z
  ```
* The following example retrieves logs from the last-terminated container:

  ```snowcli
  snow spcs service logs example_job_service --container-name main --instance-id 0 --previous-logs
  ```

---
title: snow spcs service metrics
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/metrics.md
section: Snowflake CLI
---

# snow spcs service metrics

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Retrieve platform metrics for a service container.

## Syntax

```console
snow spcs service metrics
  <name>
  --container-name <container_name>
  --instance-id <instance_id>
  --since <since>
  --until <until>
  --all
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--container-name TEXT`
:   Name of the container.

`--instance-id TEXT`
:   ID of the service instance, starting with 0.

`--since TEXT`
:   Fetch events that are newer than this time ago, in Snowflake interval syntax.

`--until TEXT`
:   Fetch events that are older than this time ago, in Snowflake interval syntax.

`--all`
:   Fetch all columns. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* The following parameters are required:

  + `name`
  + `--container-name <name>`
  + `--instance-id <ID>`
* You can use the `--since` and `--until` time-based filters to return metrics for a specified period of time. You can specify the time as a relative time, such as `1h` (hour) or `2d` (days).

## Examples

* Retrieve metrics for a specific service:

  ```snowcli
  snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0
  ```
* Retrieve a subset of metrics for a specific service:

  ```snowcli
      snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0
  snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0
  ```
* Fetch metrics older than the last two hours:

  ```snowcli
  snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0 --until '2 hours'
  ```
* Fetch metrics newer than one hour:

  ```snowcli
  snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0 --since '1hour'
  ```
* Retrieve metrics with all columns:

  ```snowcli
  snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0 --all
  ```

  ```snowcli
  | TIMESTAMP                  | DATABASE NAME | SCHEMA NAME | SERVICE NAME | INSTANCE NAME | CONTAINER NAME | METRIC NAME                | METRIC VALUE          |
  |----------------------------|---------------|-------------|--------------|---------------|----------------|----------------------------|-----------------------|
  | 2024-12-18 18:10:25.202000 | TESTDB        | PUBLIC      | LOG_EVENT    | 0             | log-printer    | container.cpu.limit        | 1                     |
  | 2024-12-18 18:10:25.202000 | TESTDB        | PUBLIC      | LOG_EVENT    | 0             | log-printer    | container.memory.requested | 536870912             |
  | 2024-12-18 18:10:25.202000 | TESTDB        | PUBLIC      | LOG_EVENT    | 0             | log-printer    | container.memory.limit     | 6442450944            |
  | 2024-12-18 18:10:25.202000 | TESTDB        | PUBLIC      | LOG_EVENT    | 0             | log-printer    | container.cpu.requested    | 0.5                   |
  | 2024-12-18 18:10:08.957000 | TESTDB        | PUBLIC      | LOG_EVENT    | 0             | log-printer    | container.cpu.usage        | 0.0004400012665396536 |
  | 2024-12-18 18:10:08.957000 | TESTDB        | PUBLIC      | LOG_EVENT    | 0             | log-printer    | container.memory.usage     | 1323008               |
  ```
* Retrieve metrics formatted for JSON output:

  ```snowcli
  snow spcs service metrics LOG_EVENT --container-name log-printer --instance-id 0 --format json
  ```

  ```output
  [
      {
          "TIMESTAMP": "2024-12-14T22:27:25.420489",
          "SERVICE NAME": "LOG_EVENT",
          "INSTANCE NAME": "0",
          "CONTAINER NAME": "log-printer",
          "METRIC TYPE": "CPU_UTILIZATION",
          "VALUE": "75.4"
      }
  ]
  ```

---
title: snow spcs service resume
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/resume.md
section: Snowflake CLI
---

# snow spcs service resume

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Resumes the service from a SUSPENDED state.

## Syntax

```console
snow spcs service resume
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the service to resume a service.

## Examples

```snowcli
snow spcs service resume echo_service
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs service set
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/set.md
section: Snowflake CLI
---

# snow spcs service set

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Sets one or more properties for the service.

## Syntax

```console
snow spcs service set
  <name>
  --min-instances <min_instances>
  --max-instances <max_instances>
  --query-warehouse <query_warehouse>
  --auto-resume / --no-auto-resume
  --auto-suspend-secs <auto_suspend_secs>
  --eai-name <external_access_integrations>
  --comment <comment>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--min-instances INTEGER RANGE`
:   Minimum number of service instances to run.

`--max-instances INTEGER RANGE`
:   Maximum number of service instances to run.

`--query-warehouse TEXT`
:   Warehouse to use if a service container connects to Snowflake to execute a query without explicitly specifying a warehouse to use.

`--auto-resume / --no-auto-resume`
:   The service will automatically resume when a service function or ingress is called.

`--auto-suspend-secs INTEGER RANGE`
:   Number of seconds of inactivity after which the service will be automatically suspended.

`--eai-name TEXT`
:   Identifies external access integrations (EAI) that the service can access. This option may be specified multiple times for multiple EAIs.

`--comment TEXT`
:   Comment for the service.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the service to set properties.

## Examples

```snowcli
snow spcs service set echo_service --min-instances 2 --max-instances 4
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs service status
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/status.md
section: Snowflake CLI
---

# snow spcs service status

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Retrieves the status of a service. This command is deprecated and will be removed in a future release. Use `describe` instead to get service status and use `list-instances` and `list-containers` to get more detailed information about service instances and containers.

## Syntax

```console
snow spcs service status
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have the MONITOR privilege on the service to get the status information.

## Examples

None.

---
title: snow spcs service suspend
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/suspend.md
section: Snowflake CLI
---

# snow spcs service suspend

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Suspends the service, shutting down and deleting all its containers.

## Syntax

```console
snow spcs service suspend
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the service to suspend a service.

## Examples

```snowcli
snow spcs service suspend echo_service
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs service unset
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/unset.md
section: Snowflake CLI
---

# snow spcs service unset

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Resets one or more properties for the service to their default value(s).

## Syntax

```console
snow spcs service unset
  <name>
  --min-instances
  --max-instances
  --query-warehouse
  --auto-resume
  --auto-suspend-secs
  --comment
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--min-instances`
:   Reset the MIN_INSTANCES property - Minimum number of service instances to run. Default: False.

`--max-instances`
:   Reset the MAX_INSTANCES property - Maximum number of service instances to run. Default: False.

`--query-warehouse`
:   Reset the QUERY_WAREHOUSE property - Warehouse to use if a service container connects to Snowflake to execute a query without explicitly specifying a warehouse to use. Default: False.

`--auto-resume`
:   Reset the AUTO_RESUME property - The service will automatically resume when a service function or ingress is called. Default: False.

`--auto-suspend-secs`
:   Reset the AUTO_SUSPEND_SECS property - Number of seconds of inactivity after which the service will be automatically suspended. Default: False.

`--comment`
:   Reset the COMMENT property - Comment for the service. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the service to reset properties.

## Examples

```snowcli
snow spcs service unset echo_service --min-instances --max-instances --auto-resume
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow spcs service upgrade
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/service-commands/upgrade.md
section: Snowflake CLI
---

# snow spcs service upgrade

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Updates an existing service with a new specification file.

## Syntax

```console
snow spcs service upgrade
  <name>
  --spec-path <spec_path>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the service; for example: my_service.

## Options

`--spec-path FILE`
:   Path to service specification file.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The current role must have OPERATE privilege on the service to upgrade a service.

## Examples

```snowcli
snow spcs service upgrade echo_service --spec-path spec.yml
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

---
title: snow sql
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/sql-commands/sql.md
section: Snowflake CLI
---

# snow sql

Executes Snowflake query. Use either query, filename or input option. Query to execute can be specified using query option, filename option (all queries from file will be executed) or via stdin by piping output from other command. For example `cat my.sql | snow sql -i`. The command supports variable substitution that happens on client-side.

## Syntax

```console
snow sql
  --query <query>
  --filename <files>
  --stdin
  --variable <data_override>
  --retain-comments
  --single-transaction / --no-single-transaction
  --enable-templating <enabled_templating>
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--query, -q TEXT`
:   Query to execute.

`--filename, -f FILE`
:   File to execute. Default: [].

`--stdin, -i`
:   Read the query from standard input. Use it when piping input to this command. Default: False.

`--variable, -D TEXT`
:   String in format of key=value. If provided the SQL content will be treated as template and rendered using provided data.

`--retain-comments`
:   Retains comments in queries passed to Snowflake. Default: False.

`--single-transaction / --no-single-transaction`
:   Connects with autocommit disabled. Wraps BEGIN/COMMIT around statements to execute them as a single transaction, ensuring all commands complete successfully or no change is applied. Default: False.

`--enable-templating [LEGACY|STANDARD|JINJA|ALL|NONE]`
:   Syntax used to resolve variables before passing queries to Snowflake. Default: [<_EnabledTemplating.LEGACY: ‘LEGACY’>, <_EnabledTemplating.STANDARD: ‘STANDARD’>].

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

You can specify the SQL query to execute using one of the following options:

* Specify the query string using the `--query` option.
* Use the `--filename` option to execute one or more files containing a SQL query or queries. When you specify multiple files, all files are executed sequentially on a single connection. For example:

  + `snow sql -f myfile.sql`
  + `snow sql -f file1.sql -f file2.sql -f file3.sql`
* Specify the query as `stdin` and pipe it to the `snow sql` command, such as `cat my.sql | snow sql`.
* If your query contains special characters, such as the dollar sign in [SYSTEM functions](../../../../sql-reference/functions-system.md), that you do not want the shell to interpret, you can do either of the following:

  + Enclose the query in single quotes instead of double quotes, as in:

    `snow sql -q 'SELECT SYSTEM$CLIENT_VERSION_INFO()'`
  + Escape the special character, as in:

    `snow sql -q "SELECT SYSTEM\$CLIENT_VERSION_INFO()"`
* Use variables for templating SQL queries with a combination of a `<% variable_name %>` placeholder in your SQL queries and a `-D` command-line option, in the form:

  ```snowcli
  snow sql -q "select * from my-database order by <% column_name %>" -D "column_name=Country"
  ```

  > **Note:**
  >
  > You can currently use the SnowSQL `&variable_name` and `<% variable_name %>` syntax for templates. However, Snowflake recommends using the `<% variable_name %>` syntax.
* Specify a scripting block in queries. For example:

  ```sqlexample
  EXECUTE IMMEDIATE $$
  -- Snowflake Scripting code
  DECLARE
    radius_of_circle FLOAT;
    area_of_circle FLOAT;
  BEGIN
    radius_of_circle := 3;
    area_of_circle := pi() * radius_of_circle * radius_of_circle;
    RETURN area_of_circle;
  END;
  $$
  ;
  ```

  > **Note:**
  >
  > When specifying the scripting block directly on the Snowflake CLI command line, the `$$` delimiters might not work for some shells because they interpret that delimiter as something else. For example, the bash and zsh shells interpret it as the process ID (PID). To address this limitation, you can use the following alternatives:
  >
  > + If you still want to specify the scripting block on the command line, you can escape the `$$` delimiters, as in `\$\$`.
  > + You can also put the scripting block with the default `$$` delimiters into a separate file and call it with the `snow sql -f <filename>` command.

### Formatting JSON output

The `--format` option provides two ways to display JSON:

* `JSON`: Returns JSON as quoted strings, similar to the following:

  ```snowcli
  snow sql --format json -q "SELECT PARSE_JSON('{"name": "Alice", "age": 30}') as json_col"
  ```

  ```output
  [
    {
        "JSON_COL": "{\"name\": \"Alice\", \"age\": 30}"
    }
  ]
  ```
* `JSON_EXT`: Returns JSON as JSON objects, similar to the following:

  ```snowcli
  snow sql --format JSON_EXT -q "SELECT PARSE_JSON('{"name": "Alice", "age": 30}') as json_col"
  ```

  ```output
  [
    {
      "JSON_COL": {
      "name": "Alice",
      "age": 30
    }
  ]
  ```

### Enhanced error codes

The `--enhanced-exit-codes` option provides information that helps identify whether problems result from query execution or from invalid command options. With this option, the `snow sql` command provides the following return codes:

* `0`: Successful execution
* `2`: Command parameter issues
* `5`: Query execution issues
* `1`: Other types of issues

After the command executes, you can use the `echo $?` shell command to see the return code.

In this example, the command contains both a query parameter (`-q 'select 1'`) and a query file parameter (`-f my.query`), which is an invalid parameter combination:

```snowcli
snow sql --enhanced-exit-codes -q 'select 1' -f my.query

echo $?
```

```output
2
```

The following examples show the effect of the `--enhanced-exit-codes` option when the command contains an invalid query (slect is misspelled):

* With the `--enhanced-exit-codes` option, the command returns a `5` exit code to indicate a query error:

  ```snowcli
  snow sql --enhanced-exit-codes -q 'slect 1'

  echo $?
  ```

  ```output
  5
  ```
* Without the `--enhanced-exit-codes` option, the command returns a `1` exit code to indicate a generic (other) error:

  ```snowcli
  snow sql --enhanced-exit-codes -q 'slect 1'

  echo $?
  ```

  ```output
  1
  ```

Alternatively, you can set the `SNOWFLAKE_ENHANCED_EXIT_CODES` environment variable to `1` to send the enhanced return codes for all `snow sql` commands.

### Interactive mode

The `snow sql` command supports an interactive mode that lets you enter SQL commands one at a time. Interactive mode provides the following features:

* Syntax highlighting
* Code completion while typing
* Searchable history

  To search your command history, press `CTRL-R`:
* Multi-line input

  Pressing `ENTER` on a line that does not end with a semicolon (`;`) moves the cursor to the next line for more commands until a statement ends with a semi-colon.

To use interactive mode, enter the `snow sql` command followed by `ENTER`, as shown:

```snowcli
snow sql
```

The command opens a sub-shell with a `>` prompt where you can enter SQL commands interactively:

```output
$ snow sql
  ╭───────────────────────────────────────────────────────────────────────────────────╮
  │ Welcome to Snowflake-CLI REPL                                                   │
  │ Type 'exit' or 'quit' to leave                                                  │
  ╰───────────────────────────────────────────────────────────────────────────────────╯
  >
```

You can then enter SQL commands, as shown:

```snowcli
> create table my_table (c1 int);
```

```output
+-------------------------------------+
| status                              |
|-------------------------------------|
| Table MY_TABLE successfully created.|
+-------------------------------------+
```

> **Note:**
>
> You must end each SQL statement with a semicolon (`;`).

To exit interactive mode, enter `exit`, `quit`, or `CTRL-D`.

### Multiple commands in a single transaction

The `--single-transaction` option lets you enter multiple SQL commands to execute as an all-or-nothing set of commands.
By executing commands in a single transaction, you can ensure that all of the commands complete successfully before committing any of the changes.
If any of the commands fail, none of the changes from the successful commands persist.

The following examples show successful and unsuccessful transactions:

* Successful command execution

  ```snowcli
  snow sql -q "insert into my_tbl values (123); insert into my_tbl values (124);" --single-transaction
  ```

  ```output
  BEGIN;
  +----------------------------------+
  | status                           |
  |----------------------------------|
  | Statement executed successfully. |
  +----------------------------------+

  insert into my_tbl values (123);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  insert into my_tbl values (124);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  COMMIT
  +----------------------------------+
  | status                           |
  |----------------------------------|
  | Statement executed successfully. |
  +----------------------------------+
  ```

  You can then verify that the commands were committed to the database:

  ```snowcli
  snow sql -q "select count(*) from my_tbl"
  ```

  ```output
  select count(*) from my_tbl
  +----------+
  | COUNT(*) |
  |----------|
  | 2        |
  +----------+
  ```
* Unsuccessful single transaction

  ```snowcli
  snow sql -q "insert into my_tbl values (123); insert into my_tbl values (124); select BAD;" --single-transaction
  ```

  ```output
  BEGIN;
  +----------------------------------+
  | status                           |
  |----------------------------------|
  | Statement executed successfully. |
  +----------------------------------+

  insert into my_tbl values (123);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  insert into my_tbl values (124);
  +-------------------------+
  | number of rows inserted |
  |-------------------------|
  | 1                       |
  +-------------------------+

  select BAD;
  ╭─ Error ───────────────────────────────────────────────────────────────────────────────╮
  │ 000904 (42000): 01bc3b84-0810-0247-0001-c1be14ee11ce: SQL compilation error: error    │
  │ line 1 at position 7                                                                  │
  │ invalid identifier 'BAD'                                                              │
  ╰───────────────────────────────────────────────────────────────────────────────────────╯
  ```

> You can then verify that the commands were not committed to the database:
>
> > ```snowcli
> > snow sql -q "select count(*) from my_tbl"
> > ```
> >
> > ```output
> > select count(*) from my_tbl
> > +----------+
> > | COUNT(*) |
> > |----------|
> > | 0        |
> > +----------+
> > ```

## Examples

* The following example uses the SQL [SYSTEM$CLIENT_VERSION_INFO](../../../../sql-reference/functions/system_client_version_info.md) system function to return version information about the clients and drivers.

  ```snowcli
  snow sql --query 'SELECT SYSTEM$CLIENT_VERSION_INFO();'
  ```

  ```output
  select current_version();
  +-------------------+
  | CURRENT_VERSION() |
  |-------------------|
  | 8.25.1            |
  +-------------------+
  ```
* The following example shows how you can specify a database using a client-side variable:

  ```snowcli
  snow sql -q "select * from <% database %>.logs" -D "database=dev"
  ```

  When executed, the command substitutes the value `dev` in the `<% database %>` variable to create the `dev.logs` identifier and then sends the `select * from dev.logs` SQL query to Snowflake for processing.

  > **Note:**
  >
  > You can currently use the SnowSQL `&variable_name` and &``{ variable_name }`` syntax for templates. However, Snowflake recommends using the `<% variable_name %>` syntax.
* This example shows how to pass in environment variables using the `--env` option:

  ```snowcli
  snow sql -q "select '<% ctx.env.test %>'" --env test=value_from_cli
  ```
* By default, Snowflake CLI removes comments in SQL query from the output. The following example uses the `--retain-comments` option to include the comments in the query results.

  Assume the `example.sql` file contains the following statements and comment:

  ```sqlexample
  select 'column1';
  -- My comment
  select 'column2';
  ```

  When you execute the following command, `-- My comment` appears in the query results.

  ```snowcli
  snow sql -f example.sql --retain-comments
  ```

  ```snowcli
  select 'column1';
  +-----------+
  | 'COLUMN1' |
  |-----------|
  | ABC       |
  +-----------+

  -- My comment
  select 'bar';
  +-----------+
  | 'COLUMN2' |
  |-----------|
  | 123       |
  +-----------+
  ```

---
title: snow sql commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/sql-commands/overview.md
section: Snowflake CLI
---

# snow sql commands

SQL commands provide developers the ability to execute SQL queries with Snowflake CLI.

* [snow sql](sql.md)

---
title: snow stage commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/overview.md
section: Snowflake CLI
---

# snow stage commands

Snowflake CLI supports the following commands to support Snowflake stage objects:

* [snow stage copy](copy.md)
* [snow stage create](create.md)
* [snow stage describe](describe.md)
* [snow stage drop](drop.md)
* [snow stage execute](execute.md)
* [snow stage list](list.md)
* [snow stage list-files](list-files.md)
* [snow stage remove](remove.md)

---
title: snow stage copy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/copy.md
section: Snowflake CLI
---

# snow stage copy

Copies all files from source path to target directory. This works for uploading to and downloading files from the stage, and copying between named stages.

## Syntax

```console
snow stage copy
  <source_path>
  <destination_path>
  --overwrite / --no-overwrite
  --parallel <parallel>
  --recursive / --no-recursive
  --auto-compress / --no-auto-compress
  --refresh / --no-refresh
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`source_path`
:   Source path for copy operation. Can be either stage path or local. You can use a glob pattern for local files but the pattern has to be enclosed in quotes.

`destination_path`
:   Target directory path for copy operation.

## Options

`--overwrite / --no-overwrite`
:   Overwrites existing files in the target path. Default: False.

`--parallel INTEGER`
:   Number of parallel threads to use when uploading files. Default: 4.

`--recursive / --no-recursive`
:   Copy files recursively with directory structure. Default: False.

`--auto-compress / --no-auto-compress`
:   Specifies whether Snowflake uses gzip to compress files during upload. Ignored when downloading. Default: False.

`--refresh / --no-refresh`
:   Specifies whether ALTER STAGE {name} REFRESH should be executed after uploading. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* One of `SOURCE_PATH` or `DESTINATION_PATH` must be a local directory, while the other should be a directory in the Snowflake stage. The stage path must start with “@”. For example:

  + `snow stage copy @my_stage dir/` - copies files from `my_stage` stage to the local `dir` directory.
  + `snow stage copy dir/ @my_stage` - copies files from the local `dir` directory to `my_stage`.
* You can specify multiple files matching a regular expression by using a glob pattern for the `source_path` argument. You must enclose the glob pattern in single or double quotes.

## Examples

* To copy files from the local machine to a stage, use a command similar to the following:

  ```snowcli
  snow stage copy local_example_app @example_app_stage/app
  ```

  ```output
  put file:///.../local_example_app/* @example_app_stage/app4 auto_compress=false parallel=4 overwrite=False
  +--------------------------------------------------------------------------------------
  | source           | target           | source_size | target_size | source_compression...
  |------------------+------------------+-------------+-------------+--------------------
  | environment.yml  | environment.yml  | 62          | 0           | NONE             ...
  | snowflake.yml    | snowflake.yml    | 252         | 0           | NONE             ...
  | streamlit_app.py | streamlit_app.py | 109         | 0           | NONE             ...
  +--------------------------------------------------------------------------------------
  ```
* To download files from a stage to a local directory, use a command similar to the following:

  ```snowcli
  mkdir local_app_backup
  snow stage copy @example_app_stage/app local_app_backup
  ```

  ```output
  get @example_app_stage/app file:///.../local_app_backup/ parallel=4
  +------------------------------------------------+
  | file             | size | status     | message |
  |------------------+------+------------+---------|
  | environment.yml  | 62   | DOWNLOADED |         |
  | snowflake.yml    | 252  | DOWNLOADED |         |
  | streamlit_app.py | 109  | DOWNLOADED |         |
  +------------------------------------------------+
  ```
* The following example copies all `.txt` files in a directory to a stage.

  ```snowcli
  snow stage copy "testdir/*.txt" @TEST_STAGE_3
  ```

  ```output
  put file:///.../testdir/*.txt @TEST_STAGE_3 auto_compress=false parallel=4 overwrite=False
  +------------------------------------------------------------------------------------------------------------+
  | source | target | source_size | target_size | source_compression | target_compression | status   | message |
  |--------+--------+-------------+-------------+--------------------+--------------------+----------+---------|
  | b1.txt | b1.txt | 3           | 16          | NONE               | NONE               | UPLOADED |         |
  | b2.txt | b2.txt | 3           | 16          | NONE               | NONE               | UPLOADED |         |
  +------------------------------------------------------------------------------------------------------------+
  ```

---
title: snow stage create
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/create.md
section: Snowflake CLI
---

# snow stage create

Creates a named stage if it does not already exist.

## Syntax

```console
snow stage create
  <stage_name>
  --encryption <encryption>
  --enable-directory
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`stage_name`
:   Identifier of the stage; for example: @my_stage.

## Options

`--encryption [SNOWFLAKE_FULL|SNOWFLAKE_SSE]`
:   Type of encryption supported for all files stored on the stage. Default: SNOWFLAKE_FULL.

`--enable-directory`
:   Specifies whether directory support is enabled for the stage. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `stage create` command creates a named stage if it does not already exist. The stage name can be a fully qualified name
or just a stage name. In the later case, the stage is created in the database and schema specified in
the connection details.

## Examples

The following example creates a stage called `new_stage` in the `bar` database:

```snowcli
snow stage create new_stage --database=bar --schema=public
```

```output
+-----------------------------------------------------+
| key    | value                                      |
|--------+--------------------------------------------|
| status | Stage area NEW_STAGE successfully created. |
+-----------------------------------------------------+
```

---
title: snow stage describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/describe.md
section: Snowflake CLI
---

# snow stage describe

Provides description of stage.

## Syntax

```console
snow stage describe
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the stage; for example: @my_stage.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow stage drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/drop.md
section: Snowflake CLI
---

# snow stage drop

Drops stage with given name.

## Syntax

```console
snow stage drop
  <name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the stage; for example: @my_stage.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow stage execute
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/execute.md
section: Snowflake CLI
---

# snow stage execute

Execute immediate all files from the stage path. Files can be filtered with a glob-like pattern, e.g. `@stage/*.sql`, `@stage/dev/*`. Only files with `.sql` extension will be executed.

## Syntax

```console
snow stage execute
  <stage_path>
  --on-error <on_error>
  --variable <variables>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`stage_path`
:   Stage path with files to be execute. For example `@stage/dev/*`.

## Options

`--on-error [break|continue]`
:   What to do when an error occurs. Defaults to break. Default: break.

`--variable, -D TEXT`
:   Variables for the execution context; for example: `-D "<key>=<value>"`. For SQL files, variables are used to expand the template, and any unknown variable will cause an error (consider embedding quoting in the file).For Python files, variables are used to update the os.environ dictionary. Provided keys are capitalized to adhere to best practices. In case of SQL files string values must be quoted in `''` (consider embedding quoting in the file).

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

> **Note:**
>
> Snowflake CLI does not support executing Python files for Python versions 3.12 and above.

* The command searches for files with a `.sql` extension in the specified `STAGE_PATH` and executes `EXECUTE IMMEDIATE` on each of them. `STAGE_PATH` can be:

  + Only a stage name, such as `@scripts`, which executes all `.sql` files from the stage.
  + Glob-like pattern, such as `@scripts/dir/*`, which executes `.sql` files from the `dir` directory.
  + Direct file path, such as `@scripts/script.sql`, which executes only the `script.sql` file from the `scripts`.

The `--silent` options hides intermediate messages with file execution results.

When using Jinja templates for the SQL files, you can pass template variables using `-D` (or `--variable`) option, such as `-D "<key>=<value>"`. You must enclose string values in single quotes (`''`).

## Examples

* Specify only a stage name to execute all `.sql` files in the stage:

  ```snowcli
  snow stage execute "@scripts"
  ```

  ```output
  SUCCESS - scripts/script1.sql
  SUCCESS - scripts/script2.sql
  SUCCESS - scripts/dir/script.sql
  +------------------------------------------+
  | File                   | Status  | Error |
  |------------------------+---------+-------|
  | scripts/script1.sql    | SUCCESS | None  |
  | scripts/script2.sql    | SUCCESS | None  |
  | scripts/dir/script.sql | SUCCESS | None  |
  +------------------------------------------+
  ```
* Specify a glob-like pattern to execute all `.sql` files in the `dir` directory:

  ```snowcli
  snow stage execute "@scripts/dir/*"
  ```

  ```output
  SUCCESS - scripts/dir/script.sql
  +------------------------------------------+
  | File                   | Status  | Error |
  |------------------------+---------+-------|
  | scripts/dir/script.sql | SUCCESS | None  |
  +------------------------------------------+
  ```
* Specify a glob-like pattern to execute only `.sql` files in the `dir` directory that begin with “script”, followed by one character:

  ```snowcli
  snow stage execute "@scripts/script?.sql"
  ```

  ```output
  SUCCESS - scripts/script1.sql
  SUCCESS - scripts/script2.sql
  +---------------------------------------+
  | File                | Status  | Error |
  |---------------------+---------+-------|
  | scripts/script1.sql | SUCCESS | None  |
  | scripts/script2.sql | SUCCESS | None  |
  +---------------------------------------+
  ```
* Specify a direct file path with the `--silent` option:

  ```snowcli
  snow stage execute "@scripts/script1.sql" --silent
  ```

  ```output
  +---------------------------------------+
  | File                | Status  | Error |
  |---------------------+---------+-------|
  | scripts/script1.sql | SUCCESS | None  |
  +---------------------------------------+
  ```

---
title: snow stage list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/list.md
section: Snowflake CLI
---

# snow stage list

Lists all available stages.

## Syntax

```console
snow stage list
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all stages that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow stage list-files
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/list-files.md
section: Snowflake CLI
---

# snow stage list-files

Lists the stage contents.

## Syntax

```console
snow stage list-files
  <stage_name>
  --pattern <pattern>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`stage_name`
:   Identifier of the stage; for example: @my_stage/path.

## Options

`--pattern TEXT`
:   Regex pattern for filtering files by name. For example –pattern “.\*.txt” will filter only files with .txt extension.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example lists the contents of the `jdoe.public.test` stage:

```snowcli
snow stage list-files jdoe.public.test
```

```output
ls @jdoe.public.test
+------------------------------------------------------------------------------+
| name            | size    | md5              | last_modified                 |
|-----------------+---------+------------------+-------------------------------|
| test/file.csv   | 195424  | 4fc596b5e00681d8 | Mon, 11 Mar 2024 17:09:01 GMT |
| test/data.csv   | 133248  | c0ddc25c1d3745d6 | Mon, 11 Mar 2024 17:08:57 GMT |
+------------------------------------------------------------------------------+
```

---
title: snow stage remove
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/stage-commands/remove.md
section: Snowflake CLI
---

# snow stage remove

Removes a file from a stage.

## Syntax

```console
snow stage remove
  <stage_name>
  <file_name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`stage_name`
:   Identifier of the stage; for example: @my_stage.

`file_name`
:   Name of the file to remove.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example removes the `app/pages/my_page.py` file from a stage:

```snowcli
snow stage remove example_app_stage app/pages/my_page.py
```

```output
+-------------------------------------------------+
| key    | value                                  |
|--------+----------------------------------------|
| name   | example_app_stage/app/pages/my_page.py |
| result | removed                                |
+-------------------------------------------------+
```

---
title: snow streamlit commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/overview.md
section: Snowflake CLI
---

# snow streamlit commands

Snowflake CLI supports the following commands for managing Streamlit apps:

* [snow streamlit deploy](deploy.md)
* [snow streamlit describe](describe.md)
* [snow streamlit drop](drop.md)
* [snow streamlit execute](execute.md)
* [snow streamlit get-url](get-url.md)
* [snow streamlit list](list.md)
* [snow streamlit share](share.md)

For more information about Streamlit apps, refer to [About Streamlit in Snowflake](../../../streamlit/about-streamlit.md) and [Add a Streamlit app](../../../native-apps/adding-streamlit.md).

---
title: snow streamlit deploy
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/deploy.md
section: Snowflake CLI
---

# snow streamlit deploy

Deploys a Streamlit app defined in the project definition file (snowflake.yml). By default, the command uploads environment.yml and any other pages or folders, if present. If you don’t specify a stage name, the `streamlit` stage is used. If the specified stage does not exist, the command creates it. If multiple Streamlits are defined in snowflake.yml and no entity_id is provided then command will raise an error.

## Syntax

```console
snow streamlit deploy
  <entity_id>
  --replace
  --prune / --no-prune
  --open
  --legacy
  --project <project_definition>
  --env <env_overrides>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`entity_id`
:   ID of streamlit entity.

## Options

`--replace`
:   Replaces the Streamlit app if it already exists. It only uploads new and overwrites existing files, but does not remove any files already on the stage. Default: False.

`--prune / --no-prune`
:   Delete files that exist in the stage, but not in the local filesystem. Default: False.

`--open`
:   Whether to open the Streamlit app in a browser. Default: False.

`--legacy`
:   Use legacy ROOT_LOCATION SQL syntax. Default: False.

`-p, --project TEXT`
:   Path where the Snowflake project is stored. Defaults to the current working directory.

`--env TEXT`
:   String in the format key=value. Overrides variables from the env section used for templates. Default: [].

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

This command creates a Streamlit app object in the database and a schema configured in the specified `connection`.

The command uploads local files to a specified stage and creates a Streamlit app using those files. You must
specify the main Python file and query warehouse. By default, the command uploads the `environment.yml` and `pages/` folder if present.
The Streamlit app is created in the database and schema configured in the specified `connection`.

If you don’t specify a stage name, the `streamlit` stage is used. If the specified stage does not exist, the command
creates it. You can modify the behavior by using command-line options.

If you specify the `--replace` option, the command uploads new files and overwrites existing files. It does not remove any files already on the stage.

If you specify the `--prune` option, the command removes files that exist in the stage, but not files in the local filesystem.

## Examples

```snowcli
snow streamlit deploy demo_app --replace
```

```output
Streamlit successfully deployed and available under https://app.snowflake.com/myorg/myacc/#/streamlit-apps/JDOE.PUBLIC.DEMO_APP
```

---
title: snow streamlit describe
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/describe.md
section: Snowflake CLI
---

# snow streamlit describe

Provides description of streamlit.

## Syntax

```console
snow streamlit describe
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the Streamlit app; for example: my_streamlit.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow streamlit drop
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/drop.md
section: Snowflake CLI
---

# snow streamlit drop

Drops streamlit with given name.

## Syntax

```console
snow streamlit drop
  <name>
  --if-exists
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the Streamlit app; for example: my_streamlit.

## Options

`--if-exists`
:   Only apply this operation if the specified object exists. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow streamlit execute
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/execute.md
section: Snowflake CLI
---

# snow streamlit execute

Executes a streamlit in a headless mode.

## Syntax

```console
snow streamlit execute
  <name>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the Streamlit app; for example: my_streamlit.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

* The command allows a Streamlit app to be executed without user interaction, such as for batch processing or automation tasks.
* Before executing this command, the following requirements must be met:

  + You must have a valid Snowflake connection.
  + The app must already be deployed in the Snowflake environment.
  + A valid configuration `snowflake.yml` file must exist with the `query_warehouse` and `stage` settings defined.
* The application logic, such as calculations and file processing, runs as if the app were displayed, but does not render any user-visible output.
* You must ensure that your Snowflake account, database, schema, and warehouse are properly configured before running the command.
* If an error, such as an invalid database configuration or missing files, occurs during execution, the command displays an error message in the terminal.

## Examples

* Execute the `my_streamlit_app` app in the current process without displaying any output.

  ```snowcli
  snow streamlit execute my_streamlit_app
  ```
* Retrieve the URL for the application after execution and open it in your default web browser.

  ```snowcli
  snow streamlit get-url my_streamlit_app --open
  ```

---
title: snow streamlit get-url
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/get-url.md
section: Snowflake CLI
---

# snow streamlit get-url

Returns a URL to the specified Streamlit app

## Syntax

```console
snow streamlit get-url
  <name>
  --open
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the Streamlit app; for example: my_streamlit.

## Options

`--open`
:   Whether to open the Streamlit app in a browser. Default: False.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

The `streamlit get-url` command returns a url link to an existing Streamlit application. You can also use the `--open` option to automatically
open the Streamlit in a new tab in your browser.

Note the following requirements:

* The app must already be deployed.
* You must use the same connection that was used to deploy the app.
* If your app is running under different database and schema than specified in the connection, you must provide them in name as a fully-qualified name, such as `database.schema.name`.

## Examples

* Get a URL for an app using the database and schema specified in the default connection and opens it in your browser:

  ```snowcli
  snow streamlit get-url my_streamlit_app --open
  ```

  ```output
  https://snowflake.com/provider-deduced-from-connection/#/streamlit-apps/DB.PUBLIC.MY_STREAMLIT_APP
  ```
* Get a URL for an app using a fully-qualified database and schema name:

  ```snowcli
  snow streamlit get-url database.schema.my_streamlit_app
  ```

  ```output
  https://snowflake.com/provider-deduced-from-connection/#/streamlit-apps/DATABASE.SCHEMA.MY_STREAMLIT_APP
  ```

---
title: snow streamlit list
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/list.md
section: Snowflake CLI
---

# snow streamlit list

Lists all available streamlits.

## Syntax

```console
snow streamlit list
  --like <like>
  --in <scope>
  --in-account
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

None

## Options

`--like, -l TEXT`
:   SQL LIKE pattern for filtering objects by name. For example, `list --like "my%"` lists all streamlit apps that begin with “my”. Default: %%.

`--in <TEXT TEXT>...`
:   Specifies the scope of this command using ‘–in <scope> <name>’, for example `list --in database my_db`. Default: (None, None).

`--in-account`
:   Lists objects across the entire account.

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

None.

---
title: snow streamlit share
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/streamlit-commands/share.md
section: Snowflake CLI
---

# snow streamlit share

Shares a Streamlit app with another role.

## Syntax

```console
snow streamlit share
  <name>
  <to_role>
  --connection <connection>
  --host <host>
  --port <port>
  --account <account>
  --user <user>
  --password <password>
  --authenticator <authenticator>
  --workload-identity-provider <workload_identity_provider>
  --private-key-file <private_key_file>
  --token <token>
  --token-file-path <token_file_path>
  --database <database>
  --schema <schema>
  --role <role>
  --warehouse <warehouse>
  --temporary-connection
  --mfa-passcode <mfa_passcode>
  --enable-diag
  --diag-log-path <diag_log_path>
  --diag-allowlist-path <diag_allowlist_path>
  --oauth-client-id <oauth_client_id>
  --oauth-client-secret <oauth_client_secret>
  --oauth-authorization-url <oauth_authorization_url>
  --oauth-token-request-url <oauth_token_request_url>
  --oauth-redirect-uri <oauth_redirect_uri>
  --oauth-scope <oauth_scope>
  --oauth-disable-pkce
  --oauth-enable-refresh-tokens
  --oauth-enable-single-use-refresh-tokens
  --client-store-temporary-credential
  --format <format>
  --verbose
  --debug
  --silent
  --enhanced-exit-codes
  --decimal-precision <decimal_precision>
```

## Arguments

`name`
:   Identifier of the Streamlit app; for example: my_streamlit.

`to_role`
:   Role with which to share the Streamlit app.

## Options

`--connection, -c, --environment TEXT`
:   Name of the connection, as defined in your `config.toml` file. Default: `default`.

`--host TEXT`
:   Host address for the connection. Overrides the value specified for the connection.

`--port INTEGER`
:   Port for the connection. Overrides the value specified for the connection.

`--account, --accountname TEXT`
:   Name assigned to your Snowflake account. Overrides the value specified for the connection.

`--user, --username TEXT`
:   Username to connect to Snowflake. Overrides the value specified for the connection.

`--password TEXT`
:   Snowflake password. Overrides the value specified for the connection.

`--authenticator TEXT`
:   Snowflake authenticator. Overrides the value specified for the connection.

`--workload-identity-provider TEXT`
:   Workload identity provider (AWS, AZURE, GCP, OIDC). Overrides the value specified for the connection.

`--private-key-file, --private-key-path TEXT`
:   Snowflake private key file path. Overrides the value specified for the connection.

`--token TEXT`
:   OAuth token to use when connecting to Snowflake.

`--token-file-path TEXT`
:   Path to file with an OAuth token to use when connecting to Snowflake.

`--database, --dbname TEXT`
:   Database to use. Overrides the value specified for the connection.

`--schema, --schemaname TEXT`
:   Database schema to use. Overrides the value specified for the connection.

`--role, --rolename TEXT`
:   Role to use. Overrides the value specified for the connection.

`--warehouse TEXT`
:   Warehouse to use. Overrides the value specified for the connection.

`--temporary-connection, -x`
:   Uses a connection defined with command-line parameters, instead of one defined in config. Default: False.

`--mfa-passcode TEXT`
:   Token to use for multi-factor authentication (MFA).

`--enable-diag`
:   Whether to generate a connection diagnostic report. Default: False.

`--diag-log-path TEXT`
:   Path for the generated report. Defaults to system temporary directory. Default: <system_temporary_directory>.

`--diag-allowlist-path TEXT`
:   Path to a JSON file that contains allowlist parameters.

`--oauth-client-id TEXT`
:   Value of client id provided by the Identity Provider for Snowflake integration.

`--oauth-client-secret TEXT`
:   Value of the client secret provided by the Identity Provider for Snowflake integration.

`--oauth-authorization-url TEXT`
:   Identity Provider endpoint supplying the authorization code to the driver.

`--oauth-token-request-url TEXT`
:   Identity Provider endpoint supplying the access tokens to the driver.

`--oauth-redirect-uri TEXT`
:   URI to use for authorization code redirection.

`--oauth-scope TEXT`
:   Scope requested in the Identity Provider authorization request.

`--oauth-disable-pkce`
:   Disables Proof Key for Code Exchange (PKCE). Default: `False`.

`--oauth-enable-refresh-tokens`
:   Enables a silent re-authentication when the actual access token becomes outdated. Default: `False`.

`--oauth-enable-single-use-refresh-tokens`
:   Whether to opt-in to single-use refresh token semantics. Default: `False`.

`--client-store-temporary-credential`
:   Store the temporary credential.

`--format [TABLE|JSON|JSON_EXT|CSV]`
:   Specifies the output format. Default: TABLE.

`--verbose, -v`
:   Displays log entries for log levels `info` and higher. Default: False.

`--debug`
:   Displays log entries for log levels `debug` and higher; debug logs contain additional information. Default: False.

`--silent`
:   Turns off intermediate output to console. Default: False.

`--enhanced-exit-codes`
:   Differentiate exit error codes based on failure type. Default: False.

`--decimal-precision INTEGER`
:   Number of decimal places to display for decimal values. Uses Python’s default precision if not specified. [env var: SNOWFLAKE_DECIMAL_PRECISION].

`--help`
:   Displays the help text for this command.

## Usage notes

None.

## Examples

The following example shares `my-app` with the custom `analyst` role:

```snowcli
snow streamlit share my-app analyst
```

---
title: Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/index.md
section: Snowflake CLI
---

# Snowflake CLI

## What is Snowflake CLI?

Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric workloads in
addition to SQL operations. It is a flexible and extensible tool that can accommodate modern development practices and
technologies.

With Snowflake CLI, developers can create, manage, update, and view apps running on Snowflake across workloads such as
Streamlit in Snowflake, the Snowflake Native App Framework, Snowpark Container Services, and Snowpark. It supports a range of Snowflake features,
including user-defined functions, stored procedures, Streamlit in Snowflake, and SQL execution.

## What’s in this guide?

This guide introduces and explains how to install and use Snowflake CLI. It includes the following sections:

* [Introducing Snowflake CLI](introduction/introduction.md)
* [Installing Snowflake CLI](installation/installation.md)
* [Configuring Snowflake CLI and connecting to Snowflake](connecting/connect.md)
* [Bootstrapping a project from a template](bootstrap-project/bootstrap.md)
* [About project definition files](project-definitions/about.md)
* [Managing Snowflake objects](objects/manage-objects.md)
* [Managing Snowflake stages](stages/manage-stages.md)
* [Managing Snowpark Container Services in Snowflake CLI](services/overview.md)
* [Using Snowpark in Snowflake CLI](snowpark/overview.md)
* [Using Snowflake Notebooks](notebooks/use-notebooks.md)
* [Managing Streamlit apps with Snowflake CLI](streamlit-apps/overview.md)
* [Using Snowflake Native App in Snowflake CLI](native-apps/overview.md)
* [Executing SQL statements](sql/execute-sql.md)
* [Managing Git repositories](git/overview.md)
* [Snowflake CLI command reference](command-reference/overview.md)

For more information about supported Snowflake products, see the following:

* [Snowflake Cortex](../../user-guide/snowflake-cortex/aisql.md) documentation
* [Native App Framework](../native-apps/native-apps-about.md) documentation
* [Snowflake notebooks](../../user-guide/ui-snowsight/notebooks.md) documentation
* [Snowpark Container Services](../snowpark-container-services/overview.md) documentation
* [Snowpark](../snowpark/index.md) documentation
* [SQL](../../reference.md) documentation
* [Git](../git/git-overview.md) documentation
* [Streamlit](../streamlit/about-streamlit.md) documentation

To see what changed in this release, see the [Snowflake CLI release notes](../../release-notes/clients-drivers/snowflake-cli.md).

Snowflake CLI is an open-source project available in the [Snowflake CLI Git repository](https://github.com/snowflakedb/snowflake-cli).

---
title: Snowflake CLI command reference
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/overview.md
section: Snowflake CLI
---

# Snowflake CLI command reference

Snowflake CLI supports commands for the following objects and activities:

* [snow](snow.md) command
* Bootstrap commands

  + [snow bootstrap commands](bootstrap-commands/overview.md)
* Connection commands

  + [snow connection commands](connection-commands/overview.md)
* Cortex commands

  + [snow cortex commands](cortex-commands/overview.md)
* dbt commands

  + [snow dbt commands](dbt-commands/overview.md)
* dcm commands

  + [snow dcm commands](dcm-commands/overview.md)
* Git commands

  + [snow git commands](git-commands/overview.md)
* Helpers commands

  + [snow helpers commands](helpers-commands/overview.md)
* Logs commands

  + [snow logs commands](logs-commands/overview.md)
* Notebook commands

  + [snow notebook commands](notebook-commands/overview.md)
* Snowflake Native App Framework commands

  + [snow app commands](native-apps-commands/overview.md)
* Snowflake objects commands

  + [snow object commands](object-commands/overview.md)
* Snowpark commands

  + [snow snowpark commands](snowpark-commands/overview.md)
  + [snow snowpark package commands](snowpark-commands/package-commands/overview.md)
* Snowpark Container Services (spcs) commands

  + [snow spcs image-registry commands](spcs-commands/image-registry-commands/overview.md)
  + [snow spcs image-repository commands](spcs-commands/image-repository-commands/overview.md)
  + [snow spcs compute-pool commands](spcs-commands/compute-pool-commands/overview.md)
  + [snow spcs service commands](spcs-commands/service-commands/overview.md)
* SQL commands

  + [snow sql commands](sql-commands/overview.md)
* Stage commands

  + [snow stage commands](stage-commands/overview.md)
* Streamlit commands

  + [snow streamlit commands](streamlit-commands/overview.md)

---
title: Snowpark Container Services (spcs) commands
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/command-reference/spcs-commands/overview.md
section: Snowflake CLI
---

# Snowpark Container Services (`spcs`) commands

> **Note:**
>
> You can use Snowpark Container Services from Snowflake CLI only if you have the necessary permissions to use Snowpark Container Services.

Snowflake CLI supports the following commands to support Snowpark Container Services:

> * [snow spcs image-registry commands](image-registry-commands/overview.md)
> * [snow spcs image-repository commands](image-repository-commands/overview.md)
> * [snow spcs compute-pool commands](compute-pool-commands/overview.md)
> * [snow spcs service commands](service-commands/overview.md)

---
title: Specify entities
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/specify-entities.md
section: Snowflake CLI
---

# Specify entities

In the `snowflake.yml` definition file, you can specify multiple entities. Each entity is identified by a unique key. The example below specifies two entities with the `entity_a` and `entity_b` keys:

```yaml
entities:
  entity_a:
    ...
  entity_b:
    ...
```

Each entity has to specify a type. Currently supported types include:

* [function](about.md)
* [procedure](about.md)
* [streamlit](../streamlit-apps/manage-apps/initialize-app.md)
* [application package](../native-apps/project-definitions.md)
* [application](../native-apps/project-definitions.md)

## Entity identifiers

You can specify multiple entities of the same type in the `snowflake.yml` file. You can name entities in the following ways:

* Use a unique key in the entities list.

  The following example shows using `entity_a` and `entity_b` as the unique keys:

  ```yaml
  entities:
    entity_a:
      ...
    entity_b:
      ...
  ```
* Specify an `identifier` name to each entity.

  The following example adds identifier names to the `entity_a` and `entity_b` entities:

  ```yaml
  entities:
    entity_a:
      identifier: entity_a_name
      ...
    entity_b:
      identifier:
        name: entity_a_name
  ```
* Add an `identifier` object to each entity.

  Using identifier objects allow to specify a name, database, and schema for each entity, as shown in the following example:

  ```yaml
  entities:
    entity_b:
      identifier:
        name: entity_a_name
        schema: public
        database: DEV
  ```

If you don’t specify an identifier, the entity key is used as the name of the object, without any database or schema qualification.

## Project mixins

In many cases you might find it useful to define project-wide default values. Mixins provide a way to extract common attributes out of individual entities. You can specify multiple mixins. You need to declare which mixins should be used by each entity using `meta.use_mixins` property.

When using mixins with an entity, you must ensure that all properties of a mixin can be applied to that entity. Applying a property that is not available on an entity causes an error. Consequently, in some cases you might need to use multiple mixins.

> **Note:**
>
> Mixin values are overridden by explicitly-declared entity attributes.

The following example includes two mixins: `stage_mixin` and `snowpark_shared`. The `my_dashboard` entity uses only `stage_mixin`, while the `my_function` entity uses both of the mixins.

```yaml
definition_version: 2
mixins:
  stage_mixin:
    stage: "my_stage"
  snowpark_shared:
    artifacts: ["app/"]
    imports: ["@package_stage/package.zip"]

entities:
  my_function:
    type: "function"
    ...
    meta:
      use_mixins:
        - "stage_mixin"
        - "snowpark_shared"
  my_dashboard:
    type: "dashboard"
    ...
    meta:
      use_mixins:
        - "stage_mixin"
```

If an entity uses multiple mixins that specify the same property, the entity uses the value of later mixin. In the following example, the value of key on the `foo` entity will be `mixin_2_value`.

```yaml
mixins:
  mixin_1:
    key: mixin_1_value
  mixin_2:
    key: mixin_2_value

entities:
  foo:
    meta:
      use_mixin:
      - mixin_1
      - mixin_2
```

The behavior of applying mixins values depends on value type. For scalar values (strings, numbers, Booleans) values are overridden.

| Mixin notation | Explicit result |
| --- | --- |
| ```yaml definition_version: 2 mixins:   mix1:     stage: A    mix2:     stage: B  entities:   test_procedure:     stage: C     meta:       use_mixins:         - mix1         - mix2 ``` | ```yaml definition_version: 2 entities:   test_procedure:     stage: C ``` |

In case of sequences, values are merged to create a new sequence. This implementation avoids creating duplicate entries in the sequence.

| Mixin notation | Explicit result |
| --- | --- |
| ```yaml definition_version: 2 mixins:   mix1:     artifacts:     - a.py    mix2:     artifacts:     - b.py  entities:   test_procedure:     artifacts:       - app/     meta:       use_mixins:         - mix1         - mix2 ``` | ```yaml definition_version: 2 entities:   test_procedure:     artifacts:       - a.py       - b.py       - app/ ``` |

For mapping values new keys are being added and existing values are updated. The update is recursive.

| Mixin notation | Explicit result |
| --- | --- |
| ```yaml definition_version: 2 mixins:   mix1:     secrets:       secret1: v1    mix2:     secrets:       secret2: v2  entities:   test_procedure:     secrets:       secret3: v3     meta:       use_mixins:         - mix1         - mix2 ``` | ```yaml definition_version: 2 entities:   test_procedure:     secrets:       secret1: v1       secret2: v2       secret3: v3 ``` |
| ```yaml definition_version: 2 mixins:   mix1:     secrets:       secret_name: v1    mix2:     secrets:       secret_name: v2  entities:   test_procedure:     secrets:       secret_name: v3     meta:       use_mixins:         - mix1         - mix2 ``` | ```yaml definition_version: 2 entities:   test_procedure:     secrets:       secret_name: v3 ``` |
| ```yaml definition_version: 2 mixins:   shared:     identifier:       schema: foo  entities:   sproc1:     identifier:       name: sproc     meta:       use_mixins: ["shared"]   sproc2:     identifier:       name: sproc       schema: from_entity     meta:       use_mixins: ["shared"] ``` | ```yaml definition_version: 2 entities:   sproc1:     identifier:       name: sproc       schema: foo   sproc2:     identifier:       name: sproc       schema: from_entity ``` |

---
title: Upload an existing Python package
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/upload.md
section: Snowflake CLI
---

# Upload an existing Python package

Snowflake CLI allows you to add existing Python packages to Snowpark imports using the `snow snowpark package` commands. You can use already implemented packages, such as those from PyPi, in your functions and procedures.

To add a Python package to Snowpark imports, do the following:

1. Check whether a package is already available.
2. Download a package and create a Snowflake artifact.
3. Upload the package to a Snowflake stage.
4. Use the package in Snowpark procedures and functions.

## Check whether a package is already available

To check whether a package is not already available use the `snow snowpark package lookup` command.

The following example illustrates looking up a package that is already available on the Snowflake Anaconda channel:

```bash
snow snowpark package lookup numpy
```

```output
Package `numpy` is available in Anaconda. Latest available version: 1.26.4.
```

If a package is not available on the Snowflake Anaconda channel, you can get a message similar to the following:

```bash
snow snowpark package lookup july
```

```output
Package `july` is not available in Anaconda. To prepare Snowpark compatible package run:

  snow snowpark package create july
```

For more information, see the [snowpark package lookup](../command-reference/snowpark-commands/package-commands/lookup.md) command.

## Download a package and create a Snowflake artifact

To download a package and create a Snowflake artifact to upload use the `snow snowpark package create` command.

```bash
snow snowpark package create <name>
```

where:

* `<name>` can be any requirement specifier supported by `pip`, such as a package name, an URL for a package, or a local file path.

Additional options:

* `--allow-shared-libraries`: Allows shared (`.so`/`.dll`) libraries, when using packages installed through `pip`.
* `--ignore-anaconda`: Does not lookup packages on Snowflake Anaconda channel.
* `--index-url`: Specifies the base URL of the Python Package Index to use for package lookup. This URL should point to a repository compliant with PEP 503 (the simple repository API) or a local directory laid out in the same format.
* `--skip-version-check`: Skips comparing versions of dependencies between requirements and Anaconda.

The following examples illustrate some different situations for creating Snowflake artifacts:

* Example: create a package with Anaconda dependencies
* Example: create a package using the --ignore-anaconda option
* Example: create a package already available in the Snowflake Anaconda channel

### Example: create a package with Anaconda dependencies

This example creates a Python package as a zip file that can be uploaded to a stage and later imported by a Snowpark Python app. Dependencies for “july” package are found on the Anaconda channel, so they were excluded from the `.zip` file. The command displays the packages you would need to include in `requirements.txt` of your Snowpark project.

```bash
snow snowpark package create july==0.1
```

```output
Package july.zip created. You can now upload it to a stage using
snow snowpark package upload -f july.zip -s <stage-name>`
and reference it in your procedure or function.
Remember to add it to imports in the procedure or function definition.

The package july==0.1 is successfully created, but depends on the following
Anaconda libraries. They need to be included in project requirements,
as their are not included in .zip.
matplotlib
numpy
```

### Example: create a package using the `--ignore-anaconda` option

This example creates the `july.zip` package that you can use in your Snowpark project without needing to add any dependencies to the `requirements.txt` file. The error messages indicate that some packages contain shared libraries, which might not work, such as when creating a package using Windows.

```bash
snow snowpark package create july==0.1 --ignore-anaconda --allow-shared-libraries
```

```output
2024-05-09 15:34:02 ERROR Following dependencies utilise shared libraries, not supported by Conda:
2024-05-09 15:34:02 ERROR contourpy
numpy
pillow
kiwisolver
matplotlib
fonttools
2024-05-09 15:34:02 ERROR You may still try to create your package with --allow-shared-libraries, but the might not work.
2024-05-09 15:34:02 ERROR You may also request adding the package to Snowflake Conda channel
2024-05-09 15:34:02 ERROR at https://support.anaconda.com/

Package july.zip created. You can now upload it to a stage using
snow snowpark package upload -f july.zip -s <stage-name>`
and reference it in your procedure or function.
Remember to add it to imports in the procedure or function definition.
```

### Example: create a package already available in the Snowflake Anaconda channel

This example fails to create the package because it already exists. You can still forcibly create the package by using the `--ignore-anaconda` option.

```bash
snow snowpark package create matplotlib
```

```output
Package matplotlib is already available in Snowflake Anaconda Channel.
```

For more information about creating a package, see the [snowpark package create](../command-reference/snowpark-commands/package-commands/create.md) command.

## Upload the package to a Snowflake stage

To upload your package, use the `snow snowpark package upload` command.

This command uploads a Python package zip file to a Snowflake stage so it can be referenced in the imports of a procedure or function.

```bash
snow snowpark package upload --file="july.zip" --stage="packages"
```

```output
Package july.zip UPLOADED to Snowflake @packages/july.zip.
```

## Use the package in Snowpark procedures and functions

To use the package in procedures or functions, add it to the `imports` parameter of [Snowpark definition](create.md) section in `snowflake.yml`.

> ```yaml
> get_custom_package_version:
>   handler: "functions.get_custom_package_version"
>   signature: ""
>   returns: string
>   type: function
>   imports:
>     - "@packages/july.zip"
>   meta:
>     use_mixins:
>       - snowpark_shared
> ```
>
> Then import your package in the function handler.
>
> ```python
> # functions.py
> import july
>
> def get_custom_package_version():
>   return july.__VERSION__
> ```

---
title: Use template functions
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/use-template-functions.md
section: Snowflake CLI
---

# Use template functions

To enable the concatenation of SQL identifiers such as database names and schema names, and to provide flexibility in using quoted or unquoted identifiers in different contexts, Snowflake CLI provides the following set of utility functions you can use in project template definition templates:

* fn.concat_ids()
* fn.str_to_id()
* fn.id_to_str()
* fn.get_username()
* fn.sanitize_id()

## `fn.concat_ids()`

* Input: one or more string arguments (SQL ID or plain String)
* Output: a valid SQL ID (quoted or unquoted)

The `fn.concat_ids()` function concatenates multiple string arguments into a single string representing a SQL ID (quoted or unquoted). If any of the input strings is a valid quoted identifier, it will be unescaped before the concatenation. The resulting string is then escaped and quoted if it contains non-SQL safe characters or if any of the input strings was a valid quoted identifier.

Examples:

* Calling `fn.concat_ids('id1_', '"quoted_id2"')` outputs `"id1_quoted_id2"` because one of the input values is a quoted identifier.
* Calling `fn.concat_ids('id1_', 'id2')` outputs `id1_id2` because none of the input values is a quoted identifier and none of the input values contains non SQL safe characters.

## `fn.str_to_id()`

* Input: one or more string arguments (SQL ID or plain String)
* Output: a valid SQL ID (quoted or unquoted)

The `fn.str_to_id()` function returns a string as a an ID. If the input string contains a valid quoted or unquoted identifier, the function returns it as is. However, if the input string contains unsafe SQL characters that are not properly quoted, the function returns a quoted ID that escapes the unsafe characters.

Examples:

* Calling `fn.str_to_id('id1')` returns `id1` because it is a valid unquoted identifier.
* Calling `fn.str_to_id('unsafe"id')` returns `"unsafe""id"` because it contains unsafe SQL characters.

## `fn.id_to_str()`

* Input: one string argument (SQL ID or plain String)
* Output: a plain string

If the input is a valid SQL ID, the function returns an unescaped plain String. Otherwise, the function returns the input string as is.

Examples:

* Calling `fn.id_to_str('id1')`, returns `id1` because it is already unquoted.
* Calling `fn.id_to_str('"quoted""id.example"')` returns `quoted"id.example`.

## `fn.get_username()`

* Input: one optional string containing the fallback value
* Output: current username detected from the Operating System

Returns the current username from the operating system environment variables. If the current username is not found or is empty, it will either return an empty value or use the fallback value if one is provided.

Examples:

* `fn.get_username('default_user')` returns the current username if found, otherwise, it returns `default_user`.

## `fn.sanitize_id()`

* Input: one string argument
* Output: a valid non-quoted SQL ID

The function `fn.sanitize_id()` removes any unsafe SQL characters from the input and returns it as a valid unquoted SQL ID. If the result does not start with a letter or an underscore, it appends an underscore to it. For very long strings, the function truncates the string to 255 characters.

Examples:

* When using `fn.sanitize_id('Some.id"With_Special_Chars')` the output is `SomeidWith_Special_Chars`.
* When using `fn.sanitize_id('1abc')` the output is `_1abc`.

## Sample use case

The following example shows how to use these functions in `snowflake.yml` project definition files:

```yaml
definition_version: 2
entities:
  pkg:
    type: application package
    identifier: <% fn.concat_ids(ctx.env.app_name, ctx.env.pkg_suffix) %>
    artifacts:
      - src: app/*
        dest: ./
  app:
    type: application
    identifier: <% fn.concat_ids(ctx.env.app_name, ctx.env.app_suffix) %>

env:
  app_name: myapp_base_name_<% fn.sanitize_id(fn.get_username()) %>
  app_suffix: _app_instance
  pkg_suffix: _pkg
```

The following example illustrates how to use the functions in a SQL file:

```snowcli
DESC APPLICATION <% fn.str_to_id(ctx.entities.app.identifier) %>;
DESC APPLICATION PACKAGE <% fn.str_to_id(ctx.entities.pkg.identifier) %>;
```

---
title: Use variables in SQL
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/project-definitions/use-sql-variables.md
section: Snowflake CLI
---

# Use variables in SQL

> **Note:**
>
> Support for variables requires project definition version 1.1.

You can also use project files to define variables that other commands, such as `snow sql`, can use. The `env` section in the project definition file(typically, `snowflake.yml`) lets you define variables as shown:

```yaml
definition_version: 2
env:
  database: "dev"
  role: "eng_rl"
```

After adding the `env` section to the project definition file, you can pass the variables to the `snow sql` command instead of specifying the variable and value on the command line.

Instead specifying the database and role on the command line with the `--variable` option, as shown:

```bash
snow sql \
-q "grant usage on database <% database %> to <% role %>" \
-D "database=dev" \
-D "role=eng_rl"
```

you can specify the variables defined in the `env` section as shown:

```bash
snow sql -q "grant usage on database <% ctx.env.database %> to <% ctx.env.role %>"
```

You can include the `env` section in addition to any other sections you include in the project definition file.

```yaml
definition_version: 2
entities:
  test_function:
    type: "function"
    stage: "dev_deployment"
    artifacts: ["app/"]
    handler: "functions.hello_function"
    signature: ""
    returns: string

  hello_procedure:
    type: "procedure"
    stage: "dev_deployment"
    artifacts: ["app/"]
    handler: "procedures.hello_procedure"
    signature:
      - name: "name"
        type: "string"
    returns: string

env:
  database: "dev"
  role: "eng_rl"
```

> **Note:**
>
> If your current project definition file uses `definition_version: 1`, you must update it to `definition_version: 1.1` if you want to take advantage of the variables feature. If you do not change the value, Snowflake CLI ignores the `env` section, but the other types of projects (`snowpark`, in this example) still work as expected.

You can override any variable defined the in `snowflake.yml` project definition file by setting a shell environment variable by the same name (case-sensitive). For example, to override the `database` value defined in the example, you can execute the following shell command:

```bash
export database="other"
```

For more information about using `env` variables, see [Storing variables in the snowflake.yml project definition file](../sql/execute-sql.md).

---
title: Using Snowflake Native App in Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/overview.md
section: Snowflake CLI
---

# Using Snowflake Native App in Snowflake CLI

Snowflake Native App developers can now initiate their own Snowflake Native App code repository from an existing git template, create and deploy their application package and application instance within minutes, and drop these objects once they are done verifying this behavior — all through the Snowflake CLI without requiring any SQL knowledge.

Developers no longer need to keep track of different platforms for performing uploads to stage or creating Snowflake objects, and will have an easier time with their local development of a Snowflake Native App.

Before you can get started with the CLI commands, here are a few new concepts you will find useful:

* [About Snowflake Native App projects](about-projects.md)
* [Project definition files](project-definitions.md)
* [Creating and managing Snowflake Native App objects](create-manage-apps.md)

---
title: Using Snowflake Notebooks
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/notebooks/use-notebooks.md
section: Snowflake CLI
---

# Using Snowflake Notebooks

Snowflake CLI includes the following `snow notebook` commands that let you create and execute [Snowflake notebooks](../../../user-guide/ui-snowsight/notebooks.md) from the command line:

* [snow notebook create](../command-reference/notebook-commands/create.md)
* [snow notebook deploy](../command-reference/notebook-commands/deploy.md)
* [snow notebook execute](../command-reference/notebook-commands/execute.md)
* [snow notebook get-url](../command-reference/notebook-commands/get-url.md)
* [snow notebook open](../command-reference/notebook-commands/open.md)
* [Snowflake notebooks](../../../user-guide/ui-snowsight/notebooks.md)

## Create a notebook

> **Note:**
>
> Beginning with version 3.4.0, Snowflake CLI added the `snow notebook deploy` command to replace the `snow notebook create` command. To support backward compatibility, you can still create a notebook using the `snow notebook create` command, but Snowflake recommends that you begin using the new Deploy and create a notebook procedure.

The `snow notebook create` command creates a notebook from an existing notebook on stage. The command returns a link to the new
notebook. The following example creates the MY_NOTEBOOK notebook from the specified staged notebook:

```snowcli
snow notebook create MY_NOTEBOOK -f @MY_STAGE/path/to/notebook.ipynb
```

The command creates the notebook in the default warehouse defined for the connection. You can use the `--warehouse` option to specify an
alternative warehouse or to specify one if the connection doesn’t define a default warehouse.

## Deploy and create a notebook

The `snow notebook deploy` command uploads local files to a stage and creates a new Notebook object inside your chosen database and schema. Your project definition file should specify the main notebook file and query warehouse. The `--replace` option replaces the specified Notebook object if it already exists.

Each notebook in Snowflake must include a `snowflake.yml` project definition file.

The following example shows a sample `snowflake.yml` notebook project definition file:

```yaml
definition_version: 2
entities:
  my_notebook:
    type: notebook
    query_warehouse: xsmall
    notebook_file: notebook.ipynb
    runtime_environment_version: "2025.07"
    artifacts:
    - notebook.ipynb
    - data.csv
```

The following table describes the properties of a notebook [project definition](../project-definitions/about.md):

Notebook project definition properties

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | Must be `notebook`. |
| **query_warehouse**  *required*, *string* | Snowflake warehouse to host the notebook. |
| **notebook_file**  *required*, *string* | Path to the notebook file. |
| **artifacts**  *required*, *string sequence* | List of files uploaded to the stage. Notebook file should be included in this list. |
| **stage_path**  *optional*, *string* | Path to the stage where the artifacts will be stored. Default: `notebooks/<notebook_id>`. |
| **compute_pool**  *optional*, *string* | Compute pool for a [containerized notebook](../../snowflake-ml/notebooks-on-spcs.md) to use.  **Note:** Containerized notebooks are currently in PuPr. |
| **runtime_name**  *optional*, *string* | Name of the Container Runtime for a [containerized notebook](../../snowflake-ml/notebooks-on-spcs.md) to use. The following values are valid:   * `SYSTEM$BASIC_RUNTIME` for CPU runtime * `SYSTEM$GPU_RUNTIME` for GPU runtime   **Note:** Containerized notebooks are currently in PuPr. |
| **runtime_environment_version**  *optional*, *string* | Runtime environment version for a notebook entity in your project definition file.  Notebook entity deployments will be rejected if both `compute_pool` and `runtime_environment_version` are specified in the configuration, leading to a validation failure.  **Note:** This field currently applies only to notebooks running on standard Snowflake warehouses, not those using compute pools (containerized notebooks). |
| **identifier**  *optional*, *string* | Optional Snowflake identifier for the entity. The value can have the following forms:   * String identifier text  ```yaml   identifier: my-notebook-id   ```  Both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (e.g., `’”My Notebook”’`). * Object  ```yaml   identifier:     name: my-notebook-id     schema: my-schema # optional     database: my-db # optional   ```  **Note:** An error occurs if you specify a `schema` or `database` and use a fully qualified name in the `name` property (such as `mydb.schema1.my-notebook`). |

The following example uploads the files specified in your project definition file and creates a new notebook named `my_notebook`:

```snowcli
snow notebook deploy my_notebook
```

```output
Uploading artifacts to @notebooks/my_notebook
  Creating stage notebooks if not exists
  Uploading artifacts
Creating notebook my_notebook
Notebook successfully deployed and available under https://snowflake.com/provider-deduced-from-connection/#/notebooks/DB.SCHEMA.MY_NOTEBOOK
```

## Execute a notebook

The snow notebook execute command executes a notebook in headless mode. Currently, the command only returns a message indicating whether
the notebook executed successfully.

```snowcli
snow notebook execute MY_NOTEBOOK
```

```output
Notebook MY_NOTEBOOK executed.
```

---
title: Using Snowpark in Snowflake CLI
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/snowpark/overview.md
section: Snowflake CLI
---

# Using Snowpark in Snowflake CLI

The [Snowpark API](../../snowpark/index.md) provides an intuitive library for querying and processing data at scale
in Snowflake, without using SQL. Using a library for any of three languages, you can build applications that process data in Snowflake—without moving
data to the system where your application code runs—and process at scale as part of the elastic and serverless Snowflake engine.

Snowflake CLI gives developers convenient tooling for developing and managing their Snowpark functions and procedures.
To create and maintain Snowpark functions and procedures, use the following process:

* [Initialize](initialize.md) — create a boilerplate

  The `snow init <project-name> --template example_snowpark` command creates a boilerplate project that you can customize.
* [Create](create.md) — create a project definition

  You edit the `snowflake.yml` file with the project details.
* [Build](build.md) — create artifacts

  The `snow snowpark build` command builds the Snowpark project as a `.zip` archive that can be used by the `snow snowpark deploy` command. The archive is built using only the `src` directory specified in the `snowflake.yml` file.
* [Deploy](deploy.md) — create Snowflake objects

  The `snow snowpark deploy` command uploads local files to the specified stage and creates procedure and function objects defined in the project.
* [Execute](execute.md) — use deployed procedures and functions

  The `snow snowpark execute` command executes deployed procedures and functions.
* [Upload](upload.md) — upload already implemented Snowpark functions, procedures, and custom packages, such as from PyPi, in your projects.

  The `snow snowpark package` commands let you reuse existing packages.
* [Manage](manage.md) — manage your Snowpark functions and procedures

  The `snow snowpark` and `snow object` commands let you create, list, execute, and delete Snowpark functions and procedures.

---
title: Validating an application package
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/native-apps/validate-app.md
section: Snowflake CLI
---

# Validating an application package

## Prerequisites

* You must have an existing connection in your `config.toml` file.
* You must have a `snowflake.yml` file in your Snowflake Native App project.

Snowflake CLI automatically validates a Snowflake Native App setup script, along with any other SQL files included using the SQL [EXECUTE IMMEDIATE](../../../sql-reference/sql/execute-immediate.md) statement, when you run (`snow app run`) or deploy (`snow app deploy`) an application. It uses the script most recently uploaded by one of the commands. Validation checks for SQL syntax errors, invalid object references, and best practices. If the script validation fails, the executed command aborts but does not automatically roll back the staged files.

For more information about Snowflake Native App setup scripts, see [Create the setup script](../../native-apps/creating-setup-script.md) in the Snowflake Native App Framework documentation.

## How to validate a setup script manually

Occasionally, you might want to validate a setup script before deploying an application to avoid potential impacts that could occur if validation fails during the deployment process. The `snow app validate` command validates a setup script without needing to run or deploy an application. It uploads source files to a separate scratch stage that drops automatically after the command completes to avoid disturbing files in the application’s source stage.

1. [Create a connection](../connecting/connect.md), if necessary.
2. Execute the `snow app validate` command from within your project, similar to the following:

   > ```snowcli
   > snow app validate --connection="dev"
   > ```

   When successful, the command returns the following message:

   > ```output
   > Snowflake Native App validation succeeded.
   > ```

   If validation fails, the following error message, along with any error messages, is displayed:

   > ```output
   > Snowflake Native App setup script failed validation.
   > ```

If you want to see the raw validation output as JSON, you can execute `snow app validate --format json`, as shown:

```snowcli
snow app validate --format json
```

```output
{
    "errors": [],
    "warnings": [],
    "status": "SUCCESS"
}
```

where:

* **errors** shows a list of errors, if they exist. Errors cause validation to fail.
* **warnings** shows a list of warnings, if they exist.
* **status** shows the result of the validation: SUCCESS or FAILURE.

If validation encounters any errors, which cause validation to fail, or warnings, which lets is succeed, the command displays the following information for the error or warning:

* `message`: Human-readable message for the error or warning.
* `cause`: Reason why the SQL construct is considered an error or warning.
* `errorCode`: Numeric code associated with the error or warning.
* `fileName`: Name of the file containing the error or warning, relative the stage root.
* `line`: Line number in the file identifying the location of the error or warning.
* `column`: Column number in the line where the error or warning occurred.

The following example shows a failed validation that contained both warnings and errors:

```snowcli
snow app validate --format json
```

```output
{
    "errors": [
        {
            "message": "Error in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/empty.sql': Empty SQL statement.",
            "cause": "Empty SQL statement.",
            "errorCode": "000900",
            "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/empty.sql",
            "line": -1,
            "column": -1
        },
        {
            "message": "Error in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/second.sql': Unsupported feature 'CREATE VERSIONED SCHEMA without OR ALTER'.",
            "cause": "Unsupported feature 'CREATE VERSIONED SCHEMA without OR ALTER'.",
            "errorCode": "000002",
            "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/second.sql",
            "line": -1,
            "column": -1
        },
        {
            "message": "Error in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql': File '/does-not-exist.sql' cannot be found in the same stage as the setup script is located.",
            "cause": "File '/does-not-exist.sql' cannot be found in the same stage as the setup script is located.",
            "errorCode": "093159",
            "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
            "line": -1,
            "column": -1
        }
    ],
    "warnings": [
        {
            "message": "Warning in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql' on line 11 at position 35: APPLICATION ROLE should be created with IF NOT EXISTS.",
            "cause": "APPLICATION ROLE should be created with IF NOT EXISTS.",
            "errorCode": "093352",
            "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
            "line": 11,
            "column": 35
        },
        {
            "message": "Warning in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql' on line 15 at position 13: CREATE Table statement in the setup script should have \"IF NOT EXISTS\", \"OR REPLACE\", or \"OR ALTER\".",
            "cause": "CREATE Table statement in the setup script should have \"IF NOT EXISTS\", \"OR REPLACE\", or \"OR ALTER\".",
            "errorCode": "093351",
            "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
            "line": 15,
            "column": 13
        },
        {
            "message": "Warning in file '@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql' on line 15 at position 13: Table identifier 'MY_TABLE' should include its parent schema name.",
            "cause": "Table identifier 'MY_TABLE' should include its parent schema name.",
            "errorCode": "093353",
            "fileName": "@STAGE_SNOWFLAKE_CLI_SCRATCH/setup_script.sql",
            "line": 15,
            "column": 13
        }
    ],
    "status": "FAIL"
}
```

---
title: Working with image registries and repositories
source: https://docs.snowflake.com/en/developer-guide/snowflake-cli/services/manage-images.md
section: Snowflake CLI
---

# Working with image registries and repositories

Snowpark Container Services provides an [OCIv2](https://github.com/opencontainers/distribution-spec/blob/main/spec.md)-compliant image registry service and a storage unit call repository to store images. You can use the following Snowflake CLI commands to manage Snowpark Container Services image registries and repositories:

* Manage image registries
* Manage image repositories

For more information about Snowpark Container Services image registries and repositories, see [Snowpark Container Services: Working with an image registry and repository](../../snowpark-container-services/working-with-registry-repository.md).

## Manage image registries

Snowflake CLI lets you perform the following tasks with Snowpark Container Services image repositories:

* Get environment tokens for registry authentication
* Log in to an image registry
* Retrieve the URL for an image registry

For common operations, such as listing or dropping, Snowflake CLI uses `snow object` commands as described in [Managing Snowflake objects](../objects/manage-objects.md).

### Get environment tokens for registry authentication

You can use the [snow spcs image-registry token](../command-reference/spcs-commands/image-registry-commands/token.md) command to return the token associated with the specified
connection that you can use to authenticate with the registry.

```snowcli
snow spcs image-registry token --connection mytest
```

```output
+----------------------------------------------------------------------------------------------------------------------+
| key        | value                                                                                                   |
|------------+---------------------------------------------------------------------------------------------------------|
| token      | ****************************************************************************************************    |
|            | ****************************************************************************************************    |
| expires_in | 3600                                                                                                    |
+----------------------------------------------------------------------------------------------------------------------+
```

You can then use that token to log in to a Docker container by piping it to the `docker login` command, similar to the following:

```snowcli
snow spcs image-registry token --format=JSON | docker login <org>-<account>.registry.snowflakecomputing.com -u 0sessiontoken --password-stdin
```

### Log in to an image registry

The [snow spcs image-registry login](../command-reference/spcs-commands/image-registry-commands/login.md) logs you into an image repository with the credentials specified for your connection. Before logging in, you must meet the following prerequisites:

* [Docker Desktop](https://www.docker.com/products/docker-desktop/) must be installed because the command uses docker to log in to Snowflake.
* The current role must have READ privileges for the image repository in the account to get the registry URL.

To log in to an image registry with your account credentials, use the following:

```snowcli
snow spcs image-registry login
```

```output
Login Succeeded
```

### Retrieve the URL for an image registry

The [snow spcs image-registry url](../command-reference/spcs-commands/image-registry-commands/url.md) command returns a URL for an image repository. The current role must have READ privileges for the image repository in the account to get the registry URL.

To get the URL for a repository, do the following:

```snowcli
snow spcs image-registry url
```

```output
<orgname-acctname>.registry.snowflakecomputing.com
```

## Manage image repositories

Snowflake CLI lets you perform the following tasks with Snowpark Container Services image repositories:

* Create an image repository
* Create and deploy an image repository from a project definition
* Retrieve the URL for an image repository
* List tags and images in an image repository

For common operations, such as listing or dropping, Snowflake CLI uses `snow object` commands as described in [Managing Snowflake objects](../objects/manage-objects.md).

### Create an image repository

The [snow spcs image-repository create](../command-reference/spcs-commands/image-repository-commands/create.md) command creates a new image repository in the current schema.

To create an image repository, enter a command similar to the following:

```snowcli
snow spcs image-repository create tutorial_repository
```

```output
+-------------------------------------------+
| key    | value                            |
|--------+----------------------------------|
| status | Statement executed successfully. |
+-------------------------------------------+
```

### Create and deploy an image repository from a project definition

You can deploy an image repository to a stage by creating a `snowflake.yml` project definition file and executing the `snow spcs image-repository deploy` command.

The following shows a sample `snowflake.yml` project definition file:

```yaml
definition_version: 2
entities:
  my_image_repository:
    type: image-repository
    identifier: my_image_repository
```

The following table describes the properties of a compute pool project definition.

Image repository project definition properties

| Property | Definition |
| --- | --- |
| **type**  *required*, *string* | Must be `image-repository`. |
| **identifier**  *optional*, *string* | Snowflake identifier for the entity. The value can have the following forms:   * String identifier text  ```yaml   identifier: my-image-repository   ```  Both unquoted and quoted identifiers are supported. To use quoted identifiers, include the surrounding quotes in the YAML value (for example, `"My Image Repository"`). * Object  ```yaml   identifier:     name: my-image-repository     schema: my-schema # optional     database: my-db # optional   ```  **Note:** An error occurs if you specify a `schema` or `database` and use a fully qualified name in the `name` property (such as `mydb.schema1.my-app`). |

To create and deploy the image repository, do the following:

1. Change your current directory to the directory containing the project definition file.
2. Run a `snow spcs image-repository deploy` command similar to the following:

   ```snowcli
   snow spcs image-repository deploy
   ```

   ```output
   +---------------------------------------------------------------------+
   | key    | value                                                      |
   |--------+------------------------------------------------------------|
   | status | Image Repository MY_IMAGE_REPOSITORY successfully created. |
   +---------------------------------------------------------------------+
   ```

### Retrieve the URL for an image repository

The [snow spcs image-repository url](../command-reference/spcs-commands/image-repository-commands/url.md) command gets the URL for an image repository.

To get the URL, enter a command similar to the following:

```snowcli
snow spcs image-repository url tutorial_repository
```

```output
<orgname-acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
```

### List tags and images in an image repository

The [snow spcs image-repository list-images](../command-reference/spcs-commands/image-repository-commands/list-images.md) command lets you get the images and tags for an image repository.

To list the images and tags in a repository, enter a command similar to the following, which lists the images in a repository named `images` in the `my_db` database:

```snowcli
snow spcs image-repository list-images images --database my_db
```

```output
+----------------------------+---------------+---------+-------------------------------------------------+-----------------------------------------+
| created_on                 | image_name    | tags    | digest                                          | image_path                              |
|----------------------------+---------------+---------+-------------------------------------------------+-----------------------------------------|
| 2024-10-11 14:23:49-07:00  | echo_service  | latest  | sha256:a8a001fef406fdb3125ce8e8bf9970c35af7084  | my_db/test_schema/images/echo_service:  |
|                            |               |         | fc33b0886d7a8915d3082c781                       | latest                                  |
| 2024-10-14 22:21:14-07:00  | test_counter  | latest  | sha256:8cae96dac29a4a05f54bb5520003f964baf67fc  | my_db/test_schema/images/test_counter:  |
|                            |               |         | 38dcad3d2c85d6c5aa7381174                       | latest                                  |
+----------------------------+---------------+---------+-------------------------------------------------+-----------------------------------------+
```

## Native Apps Framework

Build and distribute data applications using the Snowflake Native App Framework.

---
title: About release channels, versions, and patches
source: https://docs.snowflake.com/en/developer-guide/native-apps/release-channels-versions.md
section: Native Apps Framework
---

# About release channels, versions, and patches

This topic provides a general overview of release channels and how they are used to manage
updates to an app, including versions and patches.

## About release channels

Release channels allow providers to publish apps at different stages of the app development lifecycle. For
example, a provider can use release channels to perform the following tasks:

* Test an app locally in the provider account.
* Publish an app to consumers as a preview or for user acceptance testing (UAT).
* Publish the app to a production environment.

Release channels also allow providers to manage versions and patches of an app. By using release channels,
providers can create and publish multiple versions and patches of an app at the same time.

Using release channels, a provider can create more than two simultaneous versions of an app.

> **Note:**
>
> The two-version limit applies to each release channel instead of per application package.

Providers enable release channels on the application package. By default, when you create an application package, release channels are enabled. However, if you create an application package with release channels
enabled, you cannot disable them later.

## Supported release channels

Release channels allow providers to publish an app at different stages of the development
life cycle. The specific release channel a provider uses depends on whether the app is
in development or ready for production. The Snowflake Native App Framework supports the following release channels:

QA:
:   Versions and patches of an app assigned to this release channel are only available to consumers within
    the provider’s organization. Apps published using this release channel must be targeted to one or more specific
    accounts within that organization; they are not available to all accounts in the organization by default.

    Providers can use this release channel for testing. Apps published using the QA release channel are not
    required to run the [automated security scan](security-run-scan.md).

ALPHA:
:   Versions and patches of an app assigned to this release channel can be published to consumers
    outside the provider’s organization. When an app is assigned to this release channel, the automated
    security scan is performed.

    While the security scan is in progress, the provider can set the release directive for this version, and
    consumers can install it in their accounts. However, if a version assigned to this release channel fails
    the security scan, it can no longer be used.

    Providers can use this channel to collaborate with consumers during the development of an app.

DEFAULT:
:   Versions and patches of an app assigned to this release channel are available to all consumers
    who have access to the app version or patch. Apps assigned to this release channel must pass the automated
    security scan.

    This release channel is the production release channel. All apps assigned to this release channel must
    conform to the security requirements and guidelines for publishing an app. For more information, see
    [Security requirements and guidelines for a Snowflake Native App](security-overview.md).

## About versions and patches of an app

Snowflake Native Apps allow providers to create versions and patches of an app. Versions and patches allow providers
to release new functionality and updates to consumers.

Versions
:   Generally contain major updates to a Snowflake Native App. Versions generally introduce new features and changed
    functionality for an app.

Patches
:   Generally contain smaller updates to a Snowflake Native App. Unlike versions, patches should only contain small
    updates such as security fixes.

> **Note:**
>
> Each version and patch must have its own manifest file and setup script.

### Number of available versions per release channel

Versions and patches are defined in the release channel. Providers can create multiple versions and patches of an app. However, each release channel only allows two versions of an app at a time. To add a new
version to a release channel that currently has two versions defined, providers must remove one of the versions that are currently in the release channel.

To remove a version, a provider must perform the following steps:

1. Ensure that all consumers have upgraded off the version to be removed.
2. Remove the version from the release channel.
3. Create a new version.
4. Upgrade the app.

For information about upgrading an app, see [Upgrade an app using release channels](release-channels-upgrade.md).

### Number of available patches per version

Although a release channel can only contain two versions at one time, a single version can have multiple patches. Patches cannot be dropped. When a provider adds a new version to a release channel, the new version is automatically assigned patch 0 by default. When a provider adds a new patch to a version, they can manually specify the identifier for the patch. If no patch number is provided, Snowflake automatically increments the patch version by 1.

---
title: About the Snowflake Native App Framework
source: https://docs.snowflake.com/en/developer-guide/native-apps/native-apps-about.md
section: Native Apps Framework
---

# About the Snowflake Native App Framework

This topic provides general information about the Snowflake Native App Framework.

## Introduction to the Snowflake Native App Framework

The Snowflake Native App Framework allows you to create data applications that leverage core Snowflake functionality.
The Snowflake Native App Framework allows you to:

* Expand the capabilities of other Snowflake features by sharing data and related
  business logic with other Snowflake accounts. The business logic of an application can include a Streamlit app,
  stored procedures, and functions written using [Snowpark API](../snowpark/index.md),
  JavaScript, and SQL.
* Share an application with consumers through listings. A listing can be either free or paid.
  You can distribute and monetize your apps in the Snowflake Marketplace or distribute them to
  specific consumers using private listings.
* Include rich visualizations in your application using Streamlit.

The Snowflake Native App Framework also supports an enhanced development experience that provides:

* A streamlined testing environment where you can test your applications from a single account.
* A robust developer workflow. While your data and related database objects remain within Snowflake,
  you can manage supporting code files and resources within source control using your preferred
  developer tools.
* The ability to release versions and patches for your application that allows you, as a provider,
  to change and evolve the logic of your applications and release them incrementally to consumers.
* Support for logging of structured and unstructured events so that you can troubleshoot and monitor
  your applications.

## Components of the Snowflake Native App Framework

The following diagram shows a high-level view of the Snowflake Native App Framework.

The Snowflake Native App Framework is built around the concept of provider and consumer used by other
Snowflake features, including
[Snowflake Collaboration](../../collaboration/collaboration-listings-about.md)
and [Secure Data Sharing](../../user-guide/data-sharing-gs.md)

Provider
:   A Snowflake user who wants to share data content and application logic with other Snowflake users.

Consumer
:   A Snowflake user who wants to access the data content and application logic shared by providers.

### Develop and Test an Application Package

To share data content and application logic with a consumer, providers create an application package.

Application package
:   An application package encapsulates the data content, application logic,
    metadata, and setup script required by an application. An application package also contains
    information about versions and patch levels defined for the application. See
    [Create and manage an application package](creating-app-package.md) for details.

An application package can include references to data content and external code files that a provider
wants to include in the application. An application package requires a manifest file and a setup script.

Manifest file
:   Defines the configuration and setup properties required by the application, including the location of
    the setup script, versions, etc. See [Create the manifest file for an app](manifest-overview.md) for details.

Setup script
:   Contains SQL statements that are run when the consumer installs or upgrades an application or when
    a provider installs or upgrades an application for testing. The location of the setup script is
    specified in the manifest file. See [Create the setup script](creating-setup-script.md)
    for details.

### Publish an Application Package

After developing and testing an application package, a provider can share an application with consumers by
publishing a listing containing the application package as the data product of a listing. The listing can be a Snowflake Marketplace
listing or a private listing.

Snowflake Marketplace listing
:   Allows providers to market applications across the Snowflake Data Cloud. Offering a listing on the Snowflake Marketplace
    lets providers share applications with many consumers simultaneously, rather than maintain
    sharing relationships with each individual consumer.

Private listing
:   Allows providers to take advantage of the capabilities of listings to share applications directly with another
    Snowflake account in any [Snowflake region supported](limitations.md)
    by the Snowflake Native App Framework.

See [About listings](../../collaboration/collaboration-listings-about.md) for details.

### Install and Manage an Application

After a provider publishes a listing containing an application package, consumers can discover the listing and
install the application.

Snowflake Native App
:   A Snowflake Native App is the database object installed in the consumer account. When a consumer installs the Snowflake Native App,
    Snowflake creates the application and runs the setup script to create the required objects within the application.
    See [Install and test an app locally](installing-testing-application.md) for details.

After installing the application, consumers can perform additional tasks, including:

* [Enable logging and event sharing](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging)
  to help providers troubleshoot the application.
* [Grant privileges required by the application](https://other-docs.snowflake.com/en/native-apps/consumer-granting-privs).

See [Working with Applications as a Consumer](https://other-docs.snowflake.com/en/native-apps/consumer-about)
for details on how consumers install and manage an application.

## About Snowflake Native Apps with Snowpark Container Services

A Snowflake Native App with Snowpark Container Services (app with containers) is a Snowflake Native App that runs container workloads in Snowflake.
Container apps can run any containerized service supported by Snowpark Container Services.

Apps with containers leverage all of the features of the Snowflake Native App Framework, including provider IP protection, security
and governance, data sharing, monetization, and integration with compute resources.

Like any Snowflake Native App, an app with containers is comprised of an application package and application
object. However, there are some differences as shown in the following image:

Application package:
:   To manage containers, the application package must have access to a services specification file on a
    stage. Within this file, there are references to the container images required by the app. These images
    must be stored in an image repository in the provider account.

Application object:
:   When a consumer installs an app with containers, the application object that is created contains a
    compute pool that stores the containers required by the app.

Compute pool:
:   A compute pool is a collection of one or more virtual machine (VM) nodes on which Snowflake runs your
    Snowpark Container Services jobs and services. When a consumer installs an app with containers, they can
    grant the CREATE COMPUTE POOL privilege to the app or they can create the compute pools manually.

## Protect provider intellectual property in an app with containers

When an app with containers is installed in the consumer account, the query history of the services
is available in the consumer account. To protect a provider’s confidential information, the Snowflake Native App Framework redacts
the following information:

* The query text is hidden from the [QUERY_HISTORY view](../../sql-reference/account-usage/query_history.md).
* All information in the [ACCESS_HISTORY view](../../sql-reference/account-usage/access_history.md) is hidden.
* The [Query Profile](../../user-guide/ui-snowsight-activity.md) graph for the service’s query is collapsed
  into a single empty node instead of displaying the full query profile tree.

## Multi-factor requirements for users in a provider account

Depending on the type of user, Snowflake requires different types of authentication for
users in the provider account.

### Non-service users

Snowflake recommends that users in a provider account enroll in
[multi-factor authentication (MFA)](../../user-guide/security-mfa.md) if they do not have the
[TYPE](../../sql-reference/sql/create-user.md) property set to SERVICE. In a future update, multi-factor
authentication will be mandatory for these types of users. Non-service users who use
[federated authentication](../../user-guide/admin-security-fed-auth-overview.md) and single sign-on (SSO)
must have MFA enabled as part of their authentication process.

### Service users

Users who have the TYPE parameter set to SERVICE must use
[key-pair authentication](../../user-guide/key-pair-auth.md) or
[OAuth](../../user-guide/oauth-intro.md).

---
title: Add a compute pool to an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-compute-pool.md
section: Native Apps Framework
---

# Add a compute pool to an app with containers

The topic describes how to use compute pools in a Snowflake Native Apps with Snowpark Container Services.

## About compute pools in apps with containers

A [compute pool](../snowpark-container-services/working-with-compute-pool.md) is a
collection of one or more virtual machine (VM) nodes on which Snowflake runs Snowpark Container Services.
Apps with containers uses a compute pool in the consumer account to manage the container images required by
the app.

An app can create multiple compute pools and each compute pool is exclusive to the app. Compute pools
used by the app cannot be used for other purposes.

Containers within an app can directly access each other, even if they are in different compute pools.

However, using different compute pools allows providers to separate types of services. For example, a
provider can separate their frontend services from backend services.

Compute pools are account-level objects, meaning the name of each compute pool must be unique within
the consumer account.

## Best practices for using compute pools in an app with containers

Providers should consider the following best practices when creating compute pools a consumer account:

* Compute pools have cost implications. It is important to set values for the `min_nodes`,
  `max_nodes`, and `instance_family` properties to consume the correct amount of resources.
  Providers should also set the AUTO_SUSPEND_SECS property to automatically suspend inactive compute pools.

  See [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) for more information.
* Compute pools are account-level objects, as such their pool names must be unique within a consumer account.
  Consider using the application name as a prefix of the compute pool name to ensure uniqueness.
* When adding a compute pool to an app with containers that is installed on different cloud service
  providers, the code used to create the compute pool must account for differences in the instance
  families across different cloud service providers. For example, the `HIGHMEM_X64_L` instance family
  has a different configuration for each cloud service provider.

  See [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) for more information on available
  instance families. See Choose different instance families for each provider for an example of
  how to set the instance family for different cloud service providers.
* Set the `uses_gpu` property to TRUE only if the app with containers uses a GPU as the
  instance family of the compute pool. See Set the uses_gpu property in the manifest file for more
  information.

## Create a compute pool for an app

There are two ways to create a compute pool for an app with containers:

* The app creates the compute pools required during installation. This requires that the
  consumer grants the CREATE COMPUTE POOL privilege on the compute pool to the app. A
  provider can configure the app to request these privileges using Snowsight.

  See Configure an app to request the CREATE COMPUTE POOL privilege for more information.
* The consumer manually creates the compute pools required by the app. The consumer
  must run the [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) to create the compute pool, then
  manually grant the CREATE COMPUTE POOL privilege on the compute pool to the app.

## Set the `uses_gpu` property in the manifest file

If an app with containers specifies a GPU as the instance family for the Compute Pool, providers
must set the `uses_gpu` flag to `true` in the manifest. The following example
shows how to set this flag in the `artifacts` block:

```yaml
artifacts:
  readme: readme.md
  setup_script: scripts/setup.sql
  container_services:
    uses_gpu: true|false
    images:
    - /provider_db/provider_schema/provider_repo/server:prod
    - /provider_db/provider_schema/provider_repo/web:1.0
```

The automated security scan uses this flag security scanning framework to validate behavior during the app version scanning process.

> **Caution:**
>
> To publish an app with containers on the Snowflake Marketplace, the app must create
> the required compute pools during installation. See [Enforced requirements](publish-guidelines.md)
> for the Snowflake Marketplace publication requirements.

## Configure an app to request the CREATE COMPUTE POOL privilege

Providers can configure an app to request the CREATE COMPUTE POOL privilege. They can
also create the compute pool from the setup script when the app is installed or upgraded.

> **Note:**
>
> An app can create a maximum of five compute pools in a consumer account. Contact Snowflake support
> if your app needs to create additional compute pools.

### Request the CREATE COMPUTE POOL privilege

An app can request the CREATE COMPUTE POOL privileges from a consumer. This privilege allows the app
to create a compute pool in the consumer account.
See [Request global privileges from consumers](requesting-privs.md)
for general information about requesting global privileges from the consumer.

To request the CREATE COMPUTE POOL privilege from a consumer, add the CREATE COMPUTE POOL privilege to
the manifest file as shown in the following example:

```yaml
...
privileges:
 - CREATE COMPUTE POOL
   description: "Enable application to create one to five compute pools"
 ...
```

See [Create the manifest file for an app](manifest-overview.md) for more information on creating the manifest file for an app with containers.

> **Note:**
>
> The behavior for the CREATE COMPUTE POOL privilege request within a container
> app is different than other privilege requests. When you add this privilege to the manifest file, Snowsight displays an interface that allows a consumer
> to grant the required privileges.

### Add the CREATE COMPUTE POOL command to the setup script

To create a compute pool in the consumer account add the
[CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) command to the
setup script of the app.

The following example shows how to create a compute pool within a stored procedure
in the setup script:

```sqlexample
CREATE COMPUTE POOL IF NOT EXISTS app_compute_pool
  MIN_NODES = 1
  MAX_NODES = 1
  INSTANCE_FAMILY = standard_1
  AUTO_RESUME = true;
```

> **Note:**
>
> When creating a compute pool within the app, providers should check that the provider
> has granted the CREATE COMPUTE POOL privilege before creating the compute pool.

Compute pools that an app creates are owned exclusively by that app. They cannot be used by other
applications or by the consumer directly.

In general, users in the consumer account can only see compute pools created by the app in the
following situations:

* The user has been granted the MANAGE GRANTS privilege.
* The app grants access to the compute pool using application roles.

Application developers can allow users with active roles specific privileges on applications owned by compute pools. In addition, administrators with the ACCOUNTADMIN role can grant themselves the privileges necessary to control the applications owned by compute pools. For more information about compute pool access requirements, see [ALTER COMPUTE POOL](../../sql-reference/sql/alter-compute-pool.md).

### Prefix the compute pool within the setup script

Because compute pools are account-level objects, compute pool names must be unique within
the consumer account. The following example shows how to use the application name as a
prefix of the compute pool name:

```sqlexample
LET POOL_NAME := (select current_database()) || '_app_pool';
CREATE COMPUTE POOL IF NOT EXISTS identifier(:pool_name)
  MIN_NODES = 1
  MAX_NODES = 1
  INSTANCE_FAMILY = STANDARD_2;
```

## Choose different instance families for each provider

When creating a compute pool for an app that is published across multiple cloud
service providers, the code that creates the setup script must be written to account
for differences in how instance families are configured.

The following example shows how to write a stored procedure to create a compute pool
based on the cloud service provider where the app is being installed:

```sqlexample
 CREATE OR REPLACE PROCEDURE public.create_cp()
 RETURNS VARCHAR
 LANGUAGE SQL
 EXECUTE AS OWNER
 AS $$
  BEGIN
      LET POOL_NAME := (select current_database()) || '_app_pool';
      LET INSTANCE_FAMILY := IFF( CONTAINS(current_region(), 'AZURE') , 'GPU_NV_XS' , 'GPU_NV_S' );
      CREATE COMPUTE POOL IF NOT EXISTS identifier(:pool_name)
          MIN_NODES = 1
          MAX_NODES = 1
          INSTANCE_FAMILY = :instance_family;
      RETURN 'Compute Pool Created';
  END;
$$;
```

## Uninstall an app that creates a compute pool or warehouse

To drop an app with containers that creates a compute pool or warehouse, the
consumer must drop or transfer ownership of the compute pool or warehouse before
uninstalling the app.

For more information, see [Uninstall an app in Snowsight](ui-consumer-managing-applications.md).

---
title: Add a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/native-apps/adding-streamlit.md
section: Native Apps Framework
---

# Add a Streamlit app

This topic describes how to include a [Streamlit](https://streamlit.io/) app within a Snowflake Native App.

## About Streamlit and the Snowflake Native App Framework

[Streamlit](https://streamlit.io/) is an open-source Python library that makes it easy to create
and share custom web apps for machine learning and data science. By using Streamlit you can quickly
build and deploy powerful data applications.

For information about the open-source library, see the [Streamlit Library documentation](https://docs.streamlit.io/).

Within the Snowflake Native App Framework you can use Streamlit to perform the following:

* Create a front-end web app that enables consumers to visualize the data provided by your Snowflake Native App.
* Create a user interface that allows consumers to grant privileges and create references to objects within
  their account that are used by the Snowflake Native App.

  See [Create and access objects in a consumer account](requesting-about.md) for more information.

> **Note:**
>
> See Unsupported Streamlit Features for information on unsupported Streamlit features.

### Considerations for warehouses when using Streamlit in a Snowflake Native App

Streamlit apps in a Snowflake Native App run using a Snowflake warehouse. The same warehouse considerations apply to both
Streamlit in Snowflake and Streamlit in a Snowflake Native App. See [Guidelines for selecting resources in Streamlit in Snowflake](../streamlit/app-development/runtime-environments.md) for more information.

> **Note:**
>
> Streamlit apps in a Snowflake Native App support the [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) command. However, references to warehouses
> are not supported.

## Supported versions of the Streamlit library

The Snowflake Native App Framework supports the same versions of the Streamlit library as Streamlit in Snowflake. For more
information, see [Supported versions of the Streamlit library in warehouse runtimes](../streamlit/app-development/dependency-management.md).

Support for newer versions of the Streamlit library will be included as they are released.

See Set the Streamlit version for an app for information on how to set the version for a Streamlit app.

## Supported external packages

By default, a Streamlit app that is included within a Snowflake Native App includes the `python`, `streamlit`,
and `snowflake-snowpark-python` packages pre-installed in the consumer environment. The consumer environment
also has access to the dependencies required by these packages.

## Unsupported Streamlit Features

The following Streamlit features are not currently supported when using Streamlit in a
Snowflake Native App:

* Custom components are not supported.
* Using [Azure Private Link](../../user-guide/privatelink-azure.md) and
  [Google Cloud Private Service Connect](../../user-guide/private-service-connect-google.md) to access a Streamlit app is
  not supported.

* [st.bokeh_chart](https://docs.streamlit.io/library/api-reference/charts/st.bokeh_chart)
* [st.cache_data](https://docs.streamlit.io/library/api-reference/performance/st.cache_data)
* [st.cache_resource](https://docs.streamlit.io/library/api-reference/performance/st.cache_resource)
* [st.camera_input](https://docs.streamlit.io/library/api-reference/widgets/st.camera_input)
* [st.download_button](https://docs.streamlit.io/library/api-reference/widgets/st.download_button) (only supported in Streamlit version 1.26 or later)
* [st.file_uploader](https://docs.streamlit.io/library/api-reference/widgets/st.file_uploader)
* [st.image](https://docs.streamlit.io/library/api-reference/media/st.image)
* [st.pyplot](https://docs.streamlit.io/library/api-reference/charts/st.pyplot)
* [st.scatter_chart](https://docs.streamlit.io/library/api-reference/charts/st.scatter_chart)
* [st.set_page_config](https://docs.streamlit.io/library/api-reference/utilities/st.set_page_config)

  > The `page_title` and `page_icon` properties of the
  > [st.set_page_config](https://docs.streamlit.io/library/api-reference/utilities/st.set_page_config)
  > command are not supported.
* [st.video](https://docs.streamlit.io/library/api-reference/media/st.video)
* [Custom Components](https://docs.streamlit.io/library/components), including:

  > + [component.html()](https://docs.streamlit.io/library/components/components-api#stcomponentsv1html)
  > + [component.iframe()](https://docs.streamlit.io/library/components/components-api#stcomponentsv1iframe)
* [Configuration files](https://docs.streamlit.io/library/advanced-features/configuration)
* The following experimental features:

  > + [st.experimental_set_query_params](https://docs.streamlit.io/library/api-reference/utilities/st.experimental_set_query_params)
  > + [st.experimental_get_query_params](https://docs.streamlit.io/library/api-reference/utilities/st.experimental_get_query_params)
* Network access via the internet
* Anchor links

## Workflow to add a Streamlit app to a Snowflake Native App

The following workflow describes how to add a Streamlit app to a Snowflake Native App:

1. Develop your native app.

   This includes adding the data content that you want consumers to access using Streamlit. See
   [Snowflake Native App Framework workflow](native-apps-workflow.md) for more information.
2. Review the following sections to understand the supported version of the Streamlit library and
   unsupported features:

   * Supported versions of the Streamlit library
   * Unsupported Streamlit Features
   * Supported external packages
3. Develop a Streamlit app.

   See the [Streamlit Library documentation](https://docs.streamlit.io/) for information on using the
   Streamlit open-source library.
4. Create a local directory structure for the Streamlit app.

   See Example directory structure for a Streamlit app for recommendations on how to organize your Streamlit
   files within the structure of your app.
5. Add a CREATE STREAMLIT statement to the setup script.

   When running the [CREATE APPLICATION](../../sql-reference/sql/create-application.md) command, the setup script runs
   the [CREATE STREAMLIT](../../sql-reference/sql/create-streamlit.md) statement to create a Streamlit object. This object
   contains the schema and Python files required by the Streamlit app.
6. Configure the `environment.yml` file to include additional libraries in your Streamlit app.

   See Add additional packages to a Streamlit app for more information.
7. Optional: Add the Streamlit object name as an entry in the manifest
   file to display the Streamlit
   app as the default app in [Snowsight](../../user-guide/ui-snowsight-gs.md).

   See Add a Streamlit app to the manifest file for more information.
8. Upload the Streamlit files, `environment.yml` file, setup
   script, and manifest file.
   files to a named stage. To include Streamlit code files in an application package, the files must be
   uploaded to a named stage.
9. Test the application package.

   After creating the files required by the application package and Streamlit app, create an application
   object to test the setup script and manifest file.

   See Test the application package containing the Streamlit app for more information.
10. View the Streamlit app in Snowsight.

    To test the Streamlit app, view the app in [Snowsight](../../user-guide/ui-snowsight-gs.md). See
    Test the Streamlit app in Snowsight.

## Example directory structure for a Streamlit app

Like other Python modules, to add a Streamlit app to an application package you must upload
your Streamlit code files to a named stage. See [PUT](../../sql-reference/sql/put.md) for information
on how to upload files to a stage.

To account for multiple versions of a Snowflake Native App, consider using a directory structure similar to the following
to maintain your Streamlit apps and related application files:

```none
@test.schema1.stage1:
└── /
    ├── manifest.yml
    ├── readme.md
    ├── scripts/setup_script.sql
    └── code_artifacts/
        └── streamlit/
            └── environment.yml
            └── streamlit_app.py
```

Note that the directory structure you create depends on the requirements of your app and
development environment.

> **Note:**
>
> The `environment.yml` file must be at the same level as your main file of your Streamlit app.

See [Reference external code files](adding-application-logic.md) for more information on relative paths.

## Create the Streamlit object in the setup script

The following example shows how to use [CREATE STREAMLIT](../../sql-reference/sql/create-streamlit.md) within the setup
script of an app.

```sqlexample
CREATE OR REPLACE STREAMLIT app_schema.my_test_app_na
     FROM '/code_artifacts/streamlit'
     MAIN_FILE = '/streamlit_app.py';

GRANT USAGE ON SCHEMA APP_SCHEMA TO APPLICATION ROLE app_public;
GRANT USAGE ON STREAMLIT APP_SCHEMA.MY_TEST_APP_NA TO APPLICATION ROLE app_public;
```

This example creates a Streamlit object within a schema named `app_schema`.
The [CREATE STREAMLIT](../../sql-reference/sql/create-streamlit.md) command uses the Streamlit app specified by the
MAIN_FILE clause. The directory location is specified by the value of the FROM clause.

See Example directory structure for a Streamlit app for information on creating the directory
structure for a Streamlit app within an application package.

This example also grants the required privileges on the schema and Streamlit object to an
application role.

## Add additional packages to a Streamlit app

Use the `environment.yml` file to add additional Python packages to a Streamlit app. For
example, to add the `scikit-learn` library to a Streamlit app, add the following to the
`environment.yml` file:

```yaml
name: sf_env
channels:
- snowflake
dependencies:
- scikit-learn
```

The `name` and `channels` properties are both required.

Also, the `- snowflake` key is required under the `channels` property. This indicates the
[Snowflake Anaconda Channel](https://repo.anaconda.com/pkgs/snowflake/).

> **Note:**
>
> You can only install packages listed in the
> [Snowflake Anaconda Channel](https://repo.anaconda.com/pkgs/snowflake/).
> Snowflake does not support external Anaconda channels in Streamlit.

## Set the Streamlit version for an app

The Snowflake Native App Framework supports multiple versions of the Streamlit library. To set the Streamlit version within
a Snowflake Native App add `streamlit` to the `dependencies` section of the `environment.yml` file
as shown in the following example:

```yaml
name: sf_env
channels:
- snowflake
dependencies:
- streamlit=1.35.0
```

Snowflake recommends explicitly setting the Streamlit version for your app. However, currently, if you
do not explicitly set the version of the Streamlit library, Streamlit version 1.22.0 is set as the default.

## Add a Streamlit app to the manifest file

To specify the default Streamlit app launched by your app, add the following entries in the manifest file:

```yaml
artifacts:
  ...
  default_streamlit: app_schema.streamlit_app_na
  ...
```

The `default_streamlit: app_schema.streamlit_app_na` entry specifies the location of the
schema containing your Streamlit app.

## Test the application package containing the Streamlit app

To test the application package containing the Streamlit app, create an application object using
the files on a named stage by running the [CREATE APPLICATION](../../sql-reference/sql/create-application.md) as shown
in the following example:

> ```sqlexample
> CREATE APPLICATION hello_snowflake_app
>   FROM APPLICATION PACKAGE hello_snowflake_package
>   USING '@hello_snowflake_code.core.hello_snowflake_stage';
> ```

Depending on what you need to test, you can create the application object using other
forms of the [CREATE APPLICATION](../../sql-reference/sql/create-application.md). For example, you may want to
test the Streamlit app as part of a version or upgrade. See
[Install and test an app locally](installing-testing-application.md).

## Test the Streamlit app in Snowsight

To test the Streamlit app, view the app in [Snowsight](../../user-guide/ui-snowsight-gs.md) by doing the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the Streamlit app you want to view.

   The main Streamlit app opens in the Snowsight.
4. Optional: If you are viewing a multipage Streamlit app, select a tab to view additional pages.

## Troubleshoot a Streamlit app in the Snowflake Native App Framework

If the app displays an unknown error, make sure you have tried the solutions described in the following
sections.

### Acknowledge the Terms of Service

To use Streamlit and packages provided by Anaconda in Snowflake, you must acknowledge the
[External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/).
To learn more, see [Using third-party packages from Anaconda](../udf/python/udf-python-packages.md).

### Firewall allowlisting

Each Streamlit app uses a unique subdomain. If you use strict firewalls, add \*.snowflake.app
to your firewall allowlist. Adding this entry to your allowlist allows your apps to communicate with
Snowflake servers without any restrictions.

---
title: Add application logic to an application package
source: https://docs.snowflake.com/en/developer-guide/native-apps/adding-application-logic.md
section: Native Apps Framework
---

# Add application logic to an application package

This topic describes how to add application logic to the setup script of an application package. It
also describes how to use external code files in an application package.

See [Add a Streamlit app](adding-streamlit.md) for information about including a Streamlit in Snowflake app in an application package.

## Considerations for using stored procedures and functions

The Snowflake Native App Framework allows you to include stored procedures, user-defined functions
(UDFs), and external functions in an application package. These can be written in any of the
[languages Snowflake supports](../stored-procedures-vs-udfs.md).

If you plan to publish your Snowflake Native App to the Snowflake Marketplace as a limited trial listing and want to limit the functionality
of your application that is available to those trial consumers, see [Preparing to offer a limited trial listing](https://other-docs.snowflake.com/collaboration/provider-listings-preparing#label-prepare-limited-trial-listing).

### Add application code securely

All stored procedures and UDFs within a Snowflake Native App run as the application and
have access to all objects within the installed Snowflake Native App. This can lead to SQL
injection attacks.

When developing procedures and functions for use within a Snowflake Native App, Snowflake recommends
that all SQL commands requiring input from users be run using bound parameters. This includes
input provided through procedure arguments.

See [Creating a stored procedure](../stored-procedure/stored-procedures-creating.md) for more information.

### About caller’s rights and owner’s rights

All procedures created by the setup script or that run
within the installed Snowflake Native App must be run with the rights of the owner (EXECUTE AS OWNER).

This restriction exists because if the Snowflake Native App were to run with caller’s rights (EXECUTE AS CALLER) in
a procedure that the Snowflake Native App does not own, the procedure would run as the Snowflake Native App
itself and allow a consumer to create code to view or modify the contents of the Snowflake Native App and shared data content.

See [Understanding caller’s rights and owner’s rights stored procedures](../stored-procedure/stored-procedures-rights.md) for more information.

### Limitations when calling context functions from the setup script

[Context functions](../../sql-reference/functions-context.md) provide information about the context
in which a statement is run. Within the context of the Snowflake Native App Framework, some context
functions are not available. Context functions that are not available are either blocked and
return an error or always return a null value.

Use caution when using context functions in policies applied to shared data
content within a Snowflake Native App. Some functions, for example CURRENT_IP_ADDRESS, behave differently
in the context of a Snowflake Native App.

When using context functions that depend on the namespace within the client organization
there may be conflicts with functions in other namespaces. For example, a row access policy
using CURRENT_USER should be aware that the same username can exist in multiple accounts.

When using a [Streamlit](https://streamlit.io/) app within a Snowflake Native App, context functions
have additional constraints. For example, CURRENT_USER returns NULL when invoked from Streamlit in Snowflake.

The following table lists the context functions that are not supported by the Snowflake Native App Framework:

| Context Function | Blocked in shared content (returns null) | Blocked in setup scripts and stored procedure and UDFs owned by the Snowflake Native App (throws an exception) |
| --- | --- | --- |
| CURRENT_ROLE | ✔ |  |
| CURRENT_ROLE_TYPE | ✔ |  |
| CURRENT_USER | ✔ |  |
| CURRENT_SESSION | ✔ |  |
| IS_ROLE_IN_SESSION | ✔ |  |
| CURRENT_IP_ADDRESS | ✔ | ✔ |
| CURRENT_AVAILABLE_ROLES | ✔ | ✔ |
| CURRENT_SECONDARY_ROLES | ✔ | ✔ |
| ALL_USER_NAMES |  | ✔ |
| GET_USERS_FOR_COLLABORATION |  | ✔ |
| CURRENT_WAREHOUSE |  | ✔ |
| SYSTEM$ALLOWLIST |  | ✔ |

> **Note:**
>
> CURRENT_USER and CURRENT_SESSION return NULL when invoked from Streamlit in Snowflake within a Snowflake Native App unless
> permission is granted to the app with GRANT READ SESSION ON ACCOUNT TO APPLICATION.

## Supported versions of Python in a Snowflake Native App

For information on the versions of Python that Snowflake supports,
see [Snowflake Python Runtime Support](../python-runtime-support-policy.md).

> **Caution:**
>
> Snowflake Native Apps do not support decommissioned versions of Python.

As a provider, you must ensure that your app uses supported versions of Python. Apps cannot create functions
that use decommissioned versions of Python. Additionally, you cannot create or publish new versions of an app that
attempt to create functions that use decommissioned versions of Python.

Existing published versions of an app that use decommissioned versions of Python cannot be installed.

## Use Snowpark functions and procedures in an app

The Snowflake Native App Framework supports the Snowpark libraries for creating stored procedures in Java, Scala, and Python.

### Reference external code files

There are two types of code files that you can include in an application package:

* Referenced files: include binaries, libraries and other code files. These files are specific
  to a version defined in the application package. These files must be located in the root directory of the stage
  when creating or adding a version.

  Referenced files are different from user-defined functions and stored procedures because they are not
  defined in the setup script of an application package. These files are referenced by import statements within the
  stored procedures and UDFs that are defined in the setup script.
* Resource files: include semi-structured data, structured data, and binaries, for example, a machine
  learning model. These files must be uploaded to a named stage that is accessible to
  the application package.

A stored procedure, user-defined function, or external function that references these types of
code files must be created within a versioned schema in the setup script. When creating stored
procedures or functions within a versioned schema, you must reference a code file relative
to the root directory of the named stage.

For example, if the root directory of the named stage is `/app_files/dev`, this directory would
contain the following files and directories:

* A manifest file.
* A directory containing the setup script, for example `scripts/setup_version.sql`.
* Referenced files that are imported when creating a stored procedure, UDF, or external function
  within the setup script, for example:

  + `libraries/jars/lookup.jar`
  + `libraries/jars/log4j.jar`
  + `libraries/python/evaluate.py`

In this scenario, the directory structure would be as follows:

```none
@DEV_DB.DEV_SCHEMA.DEV_STAGE/V1:
└── app_files/
    └── dev
        ├── manifest.yml
        └── scripts/
            ├── setup_script.sql
            └── libraries/
                └── jars/
                    ├── lookup.jar
                    └── log4j.jar
            └── python
                └── evaluation.py
```

To access the JAR files in this directory structure, a stored procedure defined in the setup
script would reference these files as shown in the following example:

```sqlexample-java
CREATE PROCEDURE PROGRAMS.LOOKUP(...)
  RETURNS STRING
  LANGUAGE JAVA
  PACKAGES = ('com.snowflake:snowpark:latest')
  IMPORTS = ('/scripts/libraries/jar/lookup.jar',
             '/scripts/libraries/jar/log4j.jar')
  HANDLER = 'com.acme.programs.Lookup';
```

In this example, the IMPORTS statement has a path relative to the root directory used to create the
version, for example, the location of the manifest file.

### Include Java and Scala code in an application package

The Snowflake Native App Framework supports using Java and Scala in stored procedures and in external
code files.

#### Create Java and Scala UDFs inline

The Snowflake Native App Framework supports creating stored procedures containing
[Java](../stored-procedure/java/procedure-java-overview.md) and
[Scala](../stored-procedure/scala/procedure-scala-overview.md). The code that defines the
stored procedure must be added to the setup script.

The following example shows a stored procedure containing a Java function:

```sqlexample-java
CREATE OR ALTER VERSIONED SCHEMA app_code;
CREATE STAGE app_code.app_jars;

CREATE FUNCTION app_code.add(x INT, y INT)
  RETURNS INTEGER
  LANGUAGE JAVA
  HANDLER = 'TestAddFunc.add'
  TARGET_PATH = '@app_code.app_jars/TestAddFunc.jar'
  AS
  $$
  class TestAddFunc {
    public static int add(int x, int y) {
      Return x + y;
    }
  }
  $$;
```

#### Import external Java and Scala UDFs

The syntax for creating pre-compiled UDFs requires that imported JARs be included as part
of a set of versioned artifacts. To refer to pre-compiled JARs, use the relative path instead
of specifying the full stage location in the IMPORT clause.

The path must be relative to the root directory containing the version starting with a single
forward slash, for example `IMPORTS = ('/path/to/JARs/from/version/root')`. See
Reference external code files for
more information on relative paths.

The following shows an example directory structure for the code files.

```none
@DEV_DB.DEV_SCHEMA.DEV_STAGE/V1:
└── V1/
    ├── manifest.yml
    ├── setup_script.sql
    └── JARs/
        ├── Java/
        │   └── TestAddFunc.jar
        └── Scala/
            └── TestMulFunc.jar
```

The following example shows how to create a Java function using a JAR file:

```sqlexample
CREATE FUNCTION app_code.add(x INTEGER, y INTEGER)
  RETURNS INTEGER
  LANGUAGE JAVA
  HANDLER = 'TestAddFunc.add'
  IMPORTS = ('/JARs/Java/TestAddFunc.jar');
```

#### Restrictions on Java and Scala UDFs

The Snowflake Native App Framework imposes the following restrictions when using Java and Scala:

* Imports are only allowed for UDFs created in a versioned schema.
* Imports can only access the version artifacts using a relative path.
* UDFs created outside of versioned schemas can only be created inline.
* Relative paths are not supported for TARGET_PATH.

### Add Python code to an application package

The Snowflake Native App Framework supports using Python in stored procedures and in external code
files.

#### Define a Python function in the setup script

The Snowflake Native App Framework supports creating stored procedures in
[Python](../stored-procedure/python/procedure-python-overview.md).

The following example shows a stored procedure containing a Python function:

```sqlexample-python
CREATE FUNCTION app_code.py_echo_func(str STRING)
  RETURNS STRING
  LANGUAGE PYTHON
  HANDLER = 'echo'
AS
$$
def echo(str):
  return "ECHO: " + str
$$;
```

#### Use external Python files

The following example shows how to include an external Python file in an application package:

```sqlexample-python
CREATE FUNCTION PY_PROCESS_DATA_FUNC()
  RETURNS STRING
  LANGUAGE PYTHON
  HANDLER = 'TestPythonFunc.process'
  IMPORTS = ('/python_modules/TestPythonFunc.py',
    '/python_modules/data.csv')
```

See to Reference external code files for more information on relative paths.

#### Restrictions on Python UDFs

Snowflake Native App Framework imposes the following restrictions on Python UDFs:

* Imports are only allowed for UDFs created in a versioned schema.
* Imports can only access the version artifacts using a relative path.
* UDFs created outside of versioned schemas can only be created inline.

## Add JavaScript functions and procedures to an application package

The Snowflake Native App Framework supports using JavaScript in stored procedures and user-defined
functions using the [JavaScript API](../stored-procedure/stored-procedures-javascript.md).

### Handle JavaScript errors

When using JavaScript within an application package, Snowflake recommends that you catch and
handle errors. If not, the error message and stack trace that the
error returns are visible to the consumer. To ensure that data content and application logic
are kept private, use try/catch blocks in situations where sensitive objects or data is
being accessed.

The following example shows a JavaScript stored procedure that catches an error and returns
a message:

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE APP_SCHEMA.ERROR_CATCH()
  RETURNS STRING
  LANGUAGE JAVASCRIPT
  EXECUTE AS OWNER
  AS $$
    try {
      let x = y.length;
    }
    catch(err){
      return "There is an error.";
    }
    return "Done";
  $$;
```

This example creates a JavaScript stored procedure that contains a try/catch block. If the
stored procedure encounters an error when running the statement in the `try` block, it
returns the message “There is an error” which is visible to the consumer.

Without the try/catch block, the stored procedure would return the original error message
and the full stack trace which would be visible to the consumer.

> **Note:**
>
> Other languages supported by the Snowflake Native App Framework return redact error messages that occur in a Snowflake Native App.

## Add external functions to an application package

[External functions](../../sql-reference/sql/create-external-function.md) allow a Snowflake Native App
to make calls to application code that is hosted outside of Snowflake. External functions
require you to create an API Integration object.

Because API integrations allow connectivity outside of the consumer environment, the consumer
must provide the method of integration to the Snowflake Native App.

The following example shows a stored procedure created by the setup script that accepts the integration
and creates an external function. This example shows how to create an external function in the setup script of the
application package:

```sqlexample
CREATE OR REPLACE PROCEDURE calculator.create_external_function(integration_name STRING)
  RETURNS STRING
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  DECLARE
    CREATE_STATEMENT VARCHAR;
  BEGIN
    CREATE_STATEMENT := 'CREATE OR REPLACE EXTERNAL FUNCTION EXTERNAL_ADD(NUM1 FLOAT, NUM2 FLOAT)
        RETURNS FLOAT API_INTEGRATION = ? AS ''https://xyz.execute-api.us-west-2.amazonaws.com/production/sum'';' ;
    EXECUTE IMMEDIATE :CREATE_STATEMENT USING (INTEGRATION_NAME);
    RETURN 'EXTERNAL FUNCTION CREATED';
  END;

GRANT USAGE ON PROCEDURE calculator.create_external_function(string) TO APPLICATION ROLE app_public;
```

This example defines a stored procedure, written in SQL, and [creates an external function](../../sql-reference/sql/create-external-function.md)
that references an application hosted on a system outside of Snowflake. The external function returns an
API integration.

This example also grants USAGE on the stored procedure to an application role. The consumer must grant this
privilege to the Snowflake Native App before invoking this procedure in the setup script.

---
title: Add billable events to an application package
source: https://docs.snowflake.com/en/developer-guide/native-apps/adding-custom-event-billing.md
section: Native Apps Framework
---

# Add billable events to an application package

When you use Custom Event Billing for a Snowflake Native App, you can charge for specific types of application usage in addition to the existing
usage-based pricing plans. To set it up, you must perform two high-level steps:

1. Set up your application package to emit billable events by following the steps in this topic.
2. [Select a usage-based pricing plan with billable events](../../collaboration/provider-listings-pricing-model.md)
   for the listing you use to publish your Snowflake Native App to consumers.

This topic describes how to set up your application package to emit billable events using the [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) and [SYSTEM$CREATE_BILLING_EVENTS](../../sql-reference/functions/system_create_billing_events.md) system functions.

## Overview of billable events in an application package

You can set up your application package to emit billable events in response to specific usage events so that you can charge consumers based on
how much they use your Snowflake Native App.

For example, you can add a billable event to charge a consumer a specific amount for each call to a stored procedure in your Snowflake Native App.

To add billable events to an application package, do the following:

1. Create stored procedures to define which usage events trigger calls to the
   [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) and [SYSTEM$CREATE_BILLING_EVENTS](../../sql-reference/functions/system_create_billing_events.md) system functions.

   > **Note:**
   >
   > You cannot test the output of the system function at this stage. This system function can only be called from a Snowflake Native App
   > installed in a consumer account.
2. Add those stored procedures to the setup script of the application package.

> **Important:**
>
> Snowflake supports billable events that are emitted by calling the system function within a stored procedure in the application,
> as outlined by the examples in this topic.
>
> Snowflake does not support other methods of calculating the base charge for billable events, such as methods that use the output of a
> table or user-defined function that outputs consumer activity or methods that use telemetry logged in an event table.
>
> If you’re uncertain whether a proposed implementation will be supported, contact your Snowflake account representative.

## Billable event examples

The examples in this section show how to create stored procedures to emit billable events for common billing
scenarios. Each of these examples calls the `createBillingEvent` function.

### Call the SYSTEM$CREATE_BILLING_EVENT system function

The following example shows how to create a wrapper function in a stored procedure to call the
[SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) system function.

> **Note:**
>
> You can call this system function in a stored procedure written in JavaScript, Python, or Java.

This example creates a JavaScript stored procedure named `custom_event_billing` in the schema version that is accessible to the procedures that emit billing. The stored procedure creates a helper function called `createBillingEvent` which takes arguments that correspond to the typed parameters expected by the SYSTEM$CREATE_BILLING_EVENT system function.

For more details about the parameters and the required types, see [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md).

```sqlexample-javascript
 CREATE OR REPLACE PROCEDURE <schema_name>.custom_event_billing()
 RETURNS NULL
 LANGUAGE JAVASCRIPT
 AS
 $$
   /**
    * Helper method to add a billable event
    * Format timestamps as Unix timestamps in milliseconds
    */

   function createBillingEvent(className, subclassName, startTimestampVal, timestampVal, baseCharge, objects, additionalInfo) {
        try {
            var res = snowflake.createStatement({
            sqlText: `SELECT SYSTEM$CREATE_BILLING_EVENT('${className}',
                                                      '${subclassName}',
                                                      ${startTimestampVal},
                                                      ${timestampVal},
                                                      ${baseCharge},
                                                      '${objects}',
                                                      '${additionalInfo}')`
            }).execute();

            res.next();

            return res.getColumnValue(1);
        } catch(err) {
            return err.message;
        }
    }
$$;
```

The examples in this topic call this helper function.

### Batch multiple billing events with the SYSTEM$CREATE_BILLING_EVENTS system function

The following example stored procedure shows how to batch multiple Snowflake Native App billing events with the SYSTEM$CREATE_BILLING_EVENTS system function. By using batches, you save time, reduce the likelihood of exceeding call limits, and ensure your billing events are set up correctly.

For more details about the parameters and the required types, see [SYSTEM$CREATE_BILLING_EVENTS](../../sql-reference/functions/system_create_billing_events.md).

```sqlexample-javascript
 CREATE OR REPLACE PROCEDURE <app_provider_db_1><app_provider_schema_1>.external_proc_batch()
 RETURNS STRING
 LANGUAGE JAVASCRIPT
 EXECUTE AS OWNER
 AS
 $$
   function createBillingEventsBulk(events) {
     try {
       var res = snowflake.execute({
                    sqlText: `call SYSTEM$CREATE_BILLING_EVENTS('${events}')`
                 });
       res.next();
       return res.getColumnValueAsString(1);
     } catch (err) {
       return err.message;
     }
   }

   return createBillingEventsBulk(`
                                   [
                                     {
                                       "class": "class_1",
                                       "subclass": "subclass_1",
                                       "start_timestamp": ${Date.now()},
                                       "timestamp": ${Date.now()},
                                       "base_charge": 6.1,
                                       "objects": "obj1",
                                       "additional_info": "info1"
                                     },
                                     {
                                       "class": "class_2",
                                       "subclass": "subclass_2",
                                       "start_timestamp": ${Date.now()},
                                       "timestamp": ${Date.now()},
                                       "base_charge": 9.1,
                                       "objects": "obj2",
                                       "additional_info": "info2"
                                     }
                                   ]
                                 `);
$$;
```

### Example: Billing based on calls to a stored procedure

The following example shows how to create a stored procedure to emit a billable event when a consumer calls
that stored procedure in a Snowflake Native App.

Add this example code to your setup script in the same stored procedure that defines the helper function:

```sqlexample-javascript
...
  //
  // Send a billable event when a stored procedure is called.
  //
  var event_ts = Date.now();
  var billing_quantity = 1.0;
  var base_charge = billing_quantity;
  var objects = "[ \"db_1.public.procedure_1\" ]";
  var retVal = createBillingEvent("PROCEDURE_CALL", "", event_ts, event_ts, base_charge, objects, "");
  // Run the rest of the procedure ...
$$;
```

This example code creates a stored procedure that calls the `createBillingEvent` function to emit a billable event
with the class name `PROCEDURE_CALL` and a base charge of `1.0`.

> **Note:**
>
> The types of the arguments passed to the `createBillingEvent` function must correspond to the typed parameters
> expected by the [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) system function.

### Example: Billing based on rows consumed by a Snowflake Native App

The following example shows how to create a stored procedure to emit a billable event based on the number of
rows consumed within a table in the consumer account.

Add this example code to your setup script in the same stored procedure that defines the helper function:

```sqlexample-javascript
...
  // Run a query and get the number of rows in the result
  var select_query = "select i from db_1.public.t1";
  res = snowflake.execute ({sqlText: select_query});
  res.next();
  //
  // Send a billable event for rows returned from the select query
  //
  var event_ts = Date.now();
  var billing_quantity = 2.5;
  var base_charge = res.getRowcount() * billing_quantity;
  var objects = "[ \"db_1.public.t1\" ]";
  createBillingEvent("ROWS_CONSUMED", "", event_ts, event_ts, base_charge, objects, "");
  // Run the rest of the procedure ...
$$;
```

This example code creates a stored procedure that calls the `createBillingEvent` function to emit a billable event
with the class name `ROWS_CONSUMED` and a calculated base charge of `2.5` multiplied by the number of rows in the
`db_1.public.t1` table in the consumer account.

> **Note:**
>
> The types of the arguments passed to the `createBillingEvent` function must correspond to the typed parameters
> expected by the [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) system function.

### Example: Billing based on the number of rows ingested

The following example shows how to create a stored procedure to emit a billable event based on the number of rows
ingested into a table.

Add this example code to your setup script in the same stored procedure that defines the helper function:

```sqlexample-javascript
...
    // Run the merge query
    var merge_query = "MERGE INTO target_table USING source_table ON target_table.i = source_table.i
        WHEN MATCHED THEN UPDATE SET target_table.j = source_table.j
        WHEN NOT MATCHED
        THEN INSERT (i, j)
        VALUES (source_table.i, source_table.j)";
    res = snowflake.execute ({sqlText: merge_query});
    res.next();
    // rows ingested = rows inserted + rows updated
    var numRowsIngested = res.getColumnValue(1) + res.getColumnValue(2);

    //
    // Send a billable event for rows changed by the merge query
    //
    var event_ts = Date.now();
    var billing_quantity = 2.5;
    var base_charge = numRowsIngested * billing_quantity;
    var objects = "[ \"db_1.public.target_table\" ]";
    createBillingEvent("ROWS_CHANGED", "", event_ts, event_ts, base_charge, objects, "");
    // Run the rest of the procedure ...
$$;
```

This example code creates a stored procedure that calls the `createBillingEvent` function to emit a billable event
with the class name `ROWS_CHANGED` and a calculated base charge of `2.5` multiplied by the number of rows
ingested in the `db_1.target_table` table.

> **Note:**
>
> The types of the arguments passed to the `createBillingEvent` function must correspond to the typed parameters
> expected by the [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) system function.

### Example: Billing based on monthly active rows

Monthly active rows are the number of rows inserted or updated for the first time within a calendar month. Some
providers use this metric to only charge consumers for unique rows updated in a month. You can modify this example to instead
count unique users, or identify a unique data load location to determine a base charge.

The following example shows how to create a stored procedure to emit a billable event based on the number of
monthly active rows. Add this example code to your setup script in the same stored procedure that defines the helper function:

```sqlexample-javascript
...
    //
    // Get monthly active rows
    //
    var monthly_active_rows_query = "
     SELECT
         count(*)
     FROM
         source_table
     WHERE
         source_table.i not in
         (
           SELECT
             i
           FROM
             target_table
           WHERE
             updated_on >= DATE_TRUNC('MONTH', CURRENT_TIMESTAMP)
         )";
    res = snowflake.execute ({sqlText: monthly_active_rows_query});
    res.next();
    var monthlyActiveRows = parseInt(res.getColumnValue(1));
    //
    // Run the merge query and update the updated_on values for the rows
    //
    var merge_query = "
        MERGE INTO
            target_table
        USING
            source_table
        ON
            target_table.i = source_table.i
        WHEN MATCHED THEN
         UPDATE SET target_table.j = source_table.j
                    ,target_table.updated_on = current_timestamp
        WHEN NOT MATCHED THEN
            INSERT (i, j, updated_on) VALUES (source_table.i, source_table.j, current_timestamp)";
    res = snowflake.execute ({sqlText: merge_query});
    res.next();
    //
    // Emit a billable event for monthly active rows changed by the merge query
    //
    var event_ts = Date.now();
    var billing_quantity = 0.02
    var base_charge = monthlyActiveRows * billing_quantity;
    var objects = "[ \"db_1.public.target_table\" ]";
    createBillingEvent("MONTHLY_ACTIVE_ROWS", "", event_ts, event_ts, base_charge, objects, "");
    // Run the rest of the procedure ...
$$;
```

This example code creates a stored procedure that determines the number of monthly active rows using a merge query to identify unique
rows. The example then calculates the base charge using the value of the `monthlyActiveRows` variable and the `billing_quantity`.
The base charge is then passed to the `createBillingEvent` function.

> **Note:**
>
> The types of the arguments passed to the `createBillingEvent` function must correspond to the typed parameters
> expected by the [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) system function.

In your setup script, add this stored procedure after the stored procedure that calls the SYSTEM$CREATE_BILLING_EVENT system function.

### Snowpark Python example: Billing based on rows consumed

To write your stored procedure in Snowpark Python to bill based on rows consumed by your Snowflake Native App, use the following example:

```sqlexample-python
CREATE OR REPLACE PROCEDURE app_schema.billing_event_rows()
   RETURNS STRING
   LANGUAGE PYTHON
   RUNTIME_VERSION = '3.11'
   PACKAGES = ('snowflake-snowpark-python')
   HANDLER = 'run'
   EXECUTE AS OWNER
   AS $$
import time

# Helper method that calls the system function for billing
def createBillingEvent(session, class_name, subclass_name, start_timestamp, timestamp, base_charge, objects, additional_info):
   session.sql(f"SELECT SYSTEM$CREATE_BILLING_EVENT('{class_name}', '{subclass_name}', {start_timestamp}, {timestamp}, {base_charge}, '{objects}', '{additional_info}')").collect()
   return "Success"

# Handler function for the stored procedure
def run(session):
   # insert code to identify monthly active rows and calculate a charge
   try:

      # Run a query to select rows from a table
      query =  "select i from db_1.public.t1"
      res = session.sql(query).collect()

      # Define the price to charge per row
      billing_quantity = 2.5

      # Calculate the base charge based on number of rows in the result
      charge = len(res) * billing_quantity

      # Current time in Unix timestamp (epoch) time in milliseconds
      current_time_epoch = int(time.time() * 1000)

      return createBillingEvent(session, 'ROWS_CONSUMED', '', current_time_epoch, current_time_epoch, charge, '["billing_event_rows"]', '')
   except Exception as ex:
      return "Error " + ex
$$;
```

This example code creates a stored procedure that defines a helper method that calls the SYSTEM$CREATE_BILLING_EVENT system function,
as well as a method that calls that helper method, `createBillingEvent`, to emit a billable event
with the class name `ROWS_CONSUMED` and a base charge calculated by multiplying a price of `2.5` US dollars by the number of rows in
the `db_1.public.t1` table in the consumer account.

> **Note:**
>
> The types of the arguments passed to the `createBillingEvent` function must correspond to the typed parameters
> expected by the [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md) system function.

## Test custom event billing

To make sure that you set up Custom Event Billing properly and that billable events are emitted for usage events as you expect,
do the following:

1. Update your application package:

   1. Update your setup script to include the stored procedures that emit billable events.
   2. Update your application package with the new setup script.
   3. Update the version and release directive for your application package.
2. Share the application package with a consumer account in your organization that you have access to:

   1. [Create a private listing](../../collaboration/provider-listings-creating-publishing.md).
   2. Add [Custom Event Billing as the pricing plan](../../collaboration/provider-listings-pricing-model.md) for the listing.
   3. Share it with the consumer account.
   4. Sign in to the consumer account using Snowsight.
   5. Install the Snowflake Native App.
3. Confirm that the stored procedures successfully emit billable events.
4. Confirm that the listing is set up properly.

> **Note:**
>
> When you test Custom Event Billing, you must
> [set up a payment method](../../collaboration/consumer-listings-paying.md)
> but you will not be charged for usage within your organization.

### Validate whether the stored procedures emit billable events

While signed in to the consumer account with which you shared your listing, call the stored procedures that you added to your Snowflake Native App.

For example, to test the stored procedure created for billing based on monthly active rows, do the following:

1. Sign in to the consumer account in Snowsight.
2. Open a worksheet and set the context to `db_1.public`.
3. Run the following SQL statement:

   ```sqlexample
   CALL merge_procedure()
   ```

   If the stored procedure returns `Success`, your code is working.

> **Note:**
>
> If you run these SQL commands in the provider account that you used to create the application package, you see an error.

### Validate the custom event billing pricing plan

To validate the consumer experience of a Snowflake Native App and confirm that the listing and application package are set up properly, you can query
the [MARKETPLACE_PAID_USAGE_DAILY View](../../collaboration/views/marketplace-paid-usage-daily-ds.md) in the DATA_SHARING_USAGE schema of the shared SNOWFLAKE database.

> **Note:**
>
> Due to latency in the view, run these queries at least two days after first using the Snowflake Native App.

To confirm that billable events are successfully generated by a Snowflake Native App and listing,
run the following SQL statement in the consumer account that you shared the listing with:

> **Note:**
>
> Replace the PROVIDER_ACCOUNT_NAME and PROVIDER_ORGANIZATION_NAME values with those of the provider account.

```sqlexample
SELECT listing_global_name,
   listing_display_name,
   charge_type,
   charge
FROM SNOWFLAKE.DATA_SHARING_USAGE.MARKETPLACE_PAID_USAGE_DAILY
WHERE charge_type='MONETIZABLE_BILLING_EVENTS'
      AND PROVIDER_ACCOUNT_NAME = <account_name>
      AND PROVIDER_ORGANIZATION_NAME= <organization_name>;
```

```output
+---------------------+------------------------+----------------------------+--------+
| LISTING_GLOBAL_NAME |  LISTING_DISPLAY_NAME  |        CHARGE_TYPE         | CHARGE |
+---------------------+------------------------+----------------------------+--------+
| AAAA0BBB1CC         | Snowy Mountain Listing | MONETIZABLE_BILLING_EVENTS |   18.6 |
+---------------------+------------------------+----------------------------+--------+
```

---
title: Add job services to an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-services-job.md
section: Native Apps Framework
---

# Add job services to an app

This topic describes how to create and manage job services within a Snowflake Native App with Snowpark Container Services. For information
on using services in an app, see Add job services to an app.

A Snowflake Native App with Snowpark Container Services can run a Snowpark Container Services job service.

A service created using [CREATE SERVICE](../../sql-reference/sql/create-service.md) is long-running. An app must
explicitly stop the service when it is no longer needed. In contrast, a job service created using
[EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) is a service that terminates when the code of the service
exits, similar to a stored procedure. When all containers exit, the job is done.

Job services run synchronously. The [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) command completes after
all containers exit.

## Execute a job service in an app

To execute a job service in an app, add the [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) command
to the setup script.

The following example shows how to execute a job service in the context of a Snowflake Native App with Snowpark Container Services:

```sqlexample
EXECUTE JOB SERVICE
  IN COMPUTE POOL consumer_compute_pool
  FROM SPECIFICATION_FILE = 'job_service.yml'
  NAME = 'services_schema.job_service'

GRANT MONITOR ON SERVICE services.job_service TO APPLICATION ROLE app_public;
```

> **Note:**
>
> Note that the command parameters must be specified in the order shown in this example.

When called from the setup script, the [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) command
creates a job in a compute pool in the consumer account.

If the consumer creates the compute pool manually, they must grant the USAGE privilege on the compute
pool to the app before this command will succeed. Therefore, providers must include logic in a stored
procedure that tests if the correct privileges have been granted before running the
[EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md).

The `FROM SPECIFICATION_FILE =` clause specifies the relative path to the service specification
file on a stage. See [Create the service specification file](container-containers.md) for more information.

The `NAME =` clause specifies the identifier for the job service. The name of this job service
must be unique within the schema where it is located.

> **Note:**
>
> Job services cannot be executed within a version schema.

The `NAME =` clause should use the schema and name of the job within the application.
For , `services_schema.job_service` If the schema name is not specified the job service
is created in the schema of the stored procedure or function that is executing the job service.

## Monitor a job service in an app

To monitor the status of a job service within an app, use the
[SYSTEM$GET_SERVICE_STATUS — Deprecated](../../sql-reference/functions/system_get_service_status.md) command as shown in the following
example:

```sqlexample
CALL SYSTEM$GET_SERVICE_STATUS('schema.job_name')
```

This system function returns a JSON object that contains information about the specified job service
within the app. Providers can call this system function from within the app to determine if the services
has started or failed.

Consumers can also call this system function to determine the status of a service. This requires
that providers grant the MONITOR privilege on the service an application role. See
Execute a job service in an app for more information.

## Accessing local container logs

To obtain the system logs for a job service within an app, use the
[SYSTEM$GET_SERVICE_LOGS](../../sql-reference/functions/system_get_service_logs.md) system function as shown in the following
example:

```sqlexample
CALL SYSTEM$GET_SERVICE_LOGS('schema.job_name', 'instance_id', 'container_name'[, 10])
```

Providers can call this system function from within an app. In this context, the provider does not
have to specify the `app_name` as part of the fully qualified job name.

Consumers can also run this system command. This requires that providers grant the MONITOR privilege
on the service to an application role. See Execute a job service in an app for more
information.

---
title: Add services to an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-services.md
section: Native Apps Framework
---

# Add services to an app

The topic describes how to configure and use services in a Snowflake Native App with Snowpark Container Services.
For information on using a job service in an app, see [Add job services to an app](container-services-job.md).

## Privileges required to create a service in the consumer account

In order for an app to create a service in the consumer account, the consumer
must first grant the following privileges:

* CREATE COMPUTE POOL

  This privileges is required for all services. One or more compute pools are required
  to create a service in the consumer account.
* BIND SERVICE ENDPOINT

  This privilege is required for any service that exposes endpoints. If a service needs
  to make connections to URLs outside of Snowflake, this privilege is required for the
  app to create the required external access integration.

## Considerations when creating services within an app

The following considerations apply when creating a service within a Snowflake Native App with Snowpark Container Services:

* References to warehouses. See Best practices when using services within an app for
  information on using in a Snowflake Native App with Snowpark Container Services.
* Quoted names for a service within an app are not supported.
* Services cannot not be created in a versioned schema.
* Services may not be created outside of the application using a container image
  created within the app.

## Best practices when using services within an app

The following are best practices and considerations when using services within
a Snowflake Native App with Snowpark Container Services:

* Create a Streamlit app or stored procedures that allows consumers to interact with
  a service.

  In some situations, a consumer may need to create, start, stop, restart, and
  manage the services provided by the app.
* Use a single stored procedure to verify that the consumer has granted all the
  required privileges.

  A service may require that the consumer grants multiple privileges to the app.
  For example, a service may require the CREATE COMPUTE POOL, CREATE WAREHOUSE,
  BIND SERVICE ENDPOINT and other privileges. An app may also require reference to
  existing objects in the consumer account.

  In this context, Snowflake recommends using a single stored procedure to verify
  that all prerequisites have been met. After all prerequisites are verified,
  this stored procedure would then create the service.
* If a service requires a warehouse to execute queries, the app should
  create the warehouse directly in the consumer account. This requires that the
  consumer grant the CREATE WAREHOUSE global privilege to the app. See
  [Request global privileges from consumers](requesting-privs.md) for more information.
* When creating a service using a specification template, store the arguments provided by the consumer inside
  your application instance. This allows them to be passed as arguments when upgrading
  a service.

## Create a service in an app

To create a service in an app, use the [CREATE SERVICE](../../sql-reference/sql/create-service.md) command in the setup
script. Providers should always consider calling this command from within a stored procedure instead
of running it directly.

Within an app with containers, services can be created using specification file or by using a
[specification template](../snowpark-container-services/working-with-services.md).

### Create a service from a specification file

To create a service a service from a specification file, use the [CREATE SERVICE](../../sql-reference/sql/create-service.md)
command and include a reference to the service specification file:

```sqlexample
CREATE SERVICE IF NOT EXISTS app_service
  IN COMPUTE POOL app_compute_pool
  FROM SPECIFICATION_FILE = '/containers/service1_spec.yaml';
```

This example shows how to create the service using the FROM SPECIFICATION_FILE clause which uses a relative
path to the file. The FROM SPECIFICATION_FILE clause points to the service specification file that is specific
to a version of the app. This path is relative to the app root directory.

However, you can also use a specification file on a stage. See [CREATE SERVICE](../../sql-reference/sql/create-service.md)
for more information.

### Create a service with a specification template

To create a service using a [specification template](../snowpark-container-services/working-with-services.md),
use the FROM SPECIFICATION_TEMPLATE_FILE clause of the [CREATE SERVICE](../../sql-reference/sql/create-service.md) command as shown
in the following example:

```sqlexample
CREATE SERVICE IF NOT EXISTS app_service
  IN COMPUTE POOL app_compute_pool
  FROM SPECIFICATION_TEMPLATE_FILE = '/containers/service1_spec.yaml';
```

See [specification template](../snowpark-container-services/working-with-services.md) for more information.

## Add the CREATE SERVICE command to a stored procedure

A Snowflake Native Apps with Snowpark Container Services supports multiple ways of creating a service within a stored procedure.

* Create a service by using the grant_callback property
* Create a service based on a reference definition
* Create a service using a stored procedure

A provider can use any combination of these methods to create services in the consumer
account.

### Create a service by using the `grant_callback` property

`grant_callback` is a property in the manifest file that allows providers to
specify a callback function. The callback function is a stored procedure that can
create compute pools, services and perform other setup tasks required by the application.

> **Note:**
>
> Using the `grant_callback` property to specify the callback function is only
> supported by Snowflake Native Apps with Snowpark Container Services.

The advantage of specifying a callback function with `grant_callback` is that
the stored procedure is not called until the consumer grants the required privileges
to the app. This ensures that the app has the privileges required to create services
and other objects in the consumer account.

To use `grant_callback`, add it to the `configuration` section of the manifest file:

```yaml
configuration:
  log_level: INFO
  trace_level: ALWAYS
  metric_level: ALL
  log_event_level: INFO
  grant_callback: core.grant_callback
```

Then, in the setup script, define a call back function as shown in the following example:

```sqlexample
 CREATE SCHEMA core;
 CREATE APPLICATION ROLE app_public;

 CREATE OR REPLACE PROCEDURE core.grant_callback(privileges array)
 RETURNS STRING
 AS $$
 BEGIN
   IF (ARRAY_CONTAINS('CREATE COMPUTE POOL'::VARIANT, privileges)) THEN
      CREATE COMPUTE POOL IF NOT EXISTS app_compute_pool
          MIN_NODES = 1
          MAX_NODES = 3
          INSTANCE_FAMILY = GPU_NV_M;
   END IF;
   IF (ARRAY_CONTAINS('BIND SERVICE ENDPOINT'::VARIANT, privileges)) THEN
      CREATE SERVICE IF NOT EXISTS core.app_service
       IN COMPUTE POOL my_compute_pool
       FROM SPECIFICATION_FILE = '/containers/service1_spec.yaml';
   END IF;
   RETURN 'DONE';
 END;
 $$;

GRANT USAGE ON PROCEDURE core.grant_callback(array) TO APPLICATION ROLE app_public;
```

This example creates a `grant_callback` procedure that does the following:

* Tests whether the consumer has granted the CREATE COMPUTE POOL privilege to the app. If the consumer
  has granted this privilege, the `grant_callback` procedure creates the compute pool.
* Tests whether the consumer has granted the BIND SERVICE ENDPOINT privilege to the app. If the consumer
  has granted this privilege, the `grant_callback` procedure creates the service.

This example shows a pattern for creating services and a compute pool in an app with
containers. In this example, the app first tests whether the consumer has granted the required privileges
and then creates the service or compute pool.

### Create a service based on a reference definition

An app can create services using a reference definition by using the
`register_callback` property in the manifest file. This property specifies a
stored procedure used to bind an object in the consumer account to the reference definition.

For more information on using references in an app, see
[Request references and object-level privileges from consumers](requesting-refs.md)

An app can use the `register_callback` of the reference to create a service after all the
required references are bound. If a service is created before all the references to an external access
integrations or secret is allowed, the service creation fails.

### Create a service using a stored procedure

An app can create a service directly within a stored procedure. As with other stored procedures,
providers can define them in the application setup script. This stored procedure would use
the [CREATE SERVICE](../../sql-reference/sql/create-service.md) command to create the service, then grant the necessary privileges
on the stored procedure to an application role.

The consumer would call this stored procedure to create the service in their account
after they have given the app the required privileges and references.

## Determine the status of a service

To determine the status of a service, an app can call the
[SYSTEM$GET_SERVICE_STATUS — Deprecated](../../sql-reference/functions/system_get_service_status.md) system function from the setup script.

This system function returns a JSON object for each container in each service instance.

---
title: Allow access to a consumer account
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-granting-privs.md
section: Native Apps Framework
---

# Allow access to a consumer account

This topic describes how a consumer can allow a Snowflake Native App to create and access objects in
their account. This includes granting the privileges requested by an app or enabling access to existing
objects by using references. It also describes how to allow an app to use external and Apache Iceberg™
tables that a provider shares in the app.

## Privileges and references requested by an app

In a simple Snowflake Native App, all of the objects required by the app are created inside the
application object when the setup script runs during installation. All of the objects
required by the app are created in and accessed within the installed app.
The consumer does not need to perform any actions in their account.

However, some apps might ask the consumer to perform the following types of actions in their account:

* Create a database or warehouse.
* Execute tasks.
* Access existing objects such as tables.

There are two types of access that a Snowflake Native App can request:

* Privileges that allow the app to perform some account-level operations. An app can request the
  following global privileges:

  + EXECUTE TASK
  + EXECUTE MANAGED TASK
  + CREATE WAREHOUSE
  + MANAGE WAREHOUSES
  + CREATE DATABASE
  + CREATE COMPUTE POOL
  + BIND SERVICE ENDPOINT
  + READ SESSION

  Some apps might also request the IMPORTED PRIVILEGES privilege on the SNOWFLAKE database.
  See Grant the IMPORTED PRIVILEGES privilege on the SNOWFLAKE database.
* References that allow the app to access objects that already exist in the consumer
  account and are outside the application object. A provider defines the references required by the
  app in the manifest file.

  After installing the app, the consumer can authorize access on an object by creating a
  [reference](../../sql-reference/references.md) that associates the object to the app.

  An app can request access to the following types of objects and their corresponding privileges:

  | Object Type | Privileges Allowed |
  | --- | --- |
  | TABLE | SELECT, INSERT, UPDATE, DELETE, TRUNCATE, REFERENCES |
  | VIEW | SELECT, REFERENCES |
  | EXTERNAL TABLE | SELECT, REFERENCES |
  | FUNCTION | USAGE |
  | PROCEDURE | USAGE |
  | WAREHOUSE | MODIFY, MONITOR, USAGE, OPERATE |
  | API INTEGRATION | USAGE |

A consumer can approve these requests using [Snowsight](../../user-guide/ui-snowsight-gs.md) or by running the SQL commands.

> **Note:**
>
> If you do not grant the requested privileges or associate references on the requested object to the
> app, parts of the app may not function correctly.

## Manage access requests using Snowsight

If a provider implements a user interface in a Snowflake Native App, a consumer may perform the following using
[Snowsight](../../user-guide/ui-snowsight-gs.md).

* View and grant global privileges.
* Authorize access to existing objects in the consumer account.

### Grant global privileges

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Select the Settings icon in the toolbar.
5. Select the Privileges tab.

   The account level permissions requested by the app appear under
   Account level privileges
6. In the Account level privileges pane, select the Edit icon and then move the slider for each privilege that you want to grant.
7. Select Update Privileges.

### Authorize access to specific objects

If a provider implements a user interface for a Snowflake Native App, a consumer can use [Snowsight](../../user-guide/ui-snowsight-gs.md)
to authorize access on objects in their account.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Select the Settings icon in the toolbar.
5. Select the Privileges tab.
6. In the Object access privileges pane, select Add next to the object to which
   you want to authorize access.
7. Select Select Data and choose the data product to which you want to authorize access.
8. Select Save.

### Revoke privileges and access to objects

Revoking privileges or removing access from objects can cause the application to become unstable or stop working.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Select the Settings icon in the toolbar.
5. Select the Privileges tab.
6. In the Account level privileges pane, select the Edit icon and then move the slider for the privilege you want to revoke.
7. Select Update Privileges.

## Manage privileges for an app by using SQL commands

If your provider does not implement an interface for granting privileges, you must
use SQL commands to manage application access requests.

### View the privileges requested by an app

When a provider specifies the privileges required by the app, the privilege request is
included as part of the installed app. You can view these privileges after installing
the app.

To view the privileges required by an app, run the
[SHOW PRIVILEGES](../../sql-reference/sql/show-privileges.md) command as shown
in the following example:

```sqlexample
SHOW PRIVILEGES IN APPLICATION hello_snowflake_app;
```

### Grant privileges to a Snowflake Native App

After a consumer determines the privileges requested by an app, they can grant those privileges to
the app.

For example, to grant the EXECUTE TASK privilege to an app, run the
[GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command as shown
in the following example:

```sqlexample
GRANT EXECUTE TASK ON ACCOUNT TO APPLICATION hello_snowflake_app;
```

### Grant the MANAGE WAREHOUSES privilege to a Snowflake Native App

The [MANAGE WAREHOUSES privilege](../../user-guide/warehouses-tasks.md)
allows an app to create, modify, and use warehouses within the consumer account. To grant the
MANAGE WAREHOUSES privilege to an app, use the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md)
as shown in the following example:

```sqlexample
GRANT MANAGE WAREHOUSES ON ACCOUNT TO APPLICATION hello_snowflake_app;
```

### Grant the IMPORTED PRIVILEGES privilege on the SNOWFLAKE database

Some apps might request that a consumer grants the IMPORTED PRIVILEGES privilege on the
SNOWFLAKE database in their account. This privilege can only be granted using SQL commands. It cannot
be granted using [Snowsight](../../user-guide/ui-snowsight-gs.md). If an app requires this privilege, the provider should
communicate this requirement to the consumer, for example, in the README file of the app.

To grant the IMPORT privilege on the SNOWFLAKE database, run the following command:

```sqlexample
GRANT IMPORTED PRIVILEGES ON DATABASE SNOWFLAKE TO APPLICATION hello_snowflake_app;
```

> **Note:**
>
> The IMPORTED PRIVILEGES privilege allows the app to access information about usage and costs associated with
> the consumer account. A consumer should ensure that they want to share this information with the
> app before granting this privilege.

## Manually authorize access to objects

When a provider defines a reference to an object in the manifest file, this reference
definition is included as part of the installed app. A consumer can create a reference to an object
in their account to authorize the app to access the object. If the provider did not create a user
interface for allowing access to objects in the consumer account, the consumer can authorize access
manually.

The consumer can create a reference for an object to associate with the app if they have the requested
privileges on the object. For example, if SELECT and INSERT privileges are required for an object, for
example a table, the consumer must create the reference using a role that has the SELECT and INSERT privileges
on the table. To view the object types and the specific required privilege grants for each object,
see View the References Requested by an App.

> **Note:**
>
> A reference does not grant any privileges on the object. If the role used to create the reference loses
> privileges on the object, the reference is no longer valid. The consumer must do one of the following:
>
> * Restore the required privileges to the role that created the reference.
> * Recreate the reference using a role with the required privileges on the object.

### View the references requested by an app

A consumer can view the references requested by an app by running the
[SHOW REFERENCES](../../sql-reference/sql/show-references.md) command as shown in
the following example:

```sqlexample
SHOW REFERENCES IN APPLICATION hello_snowflake_app;
```

This command displays a list of all the references defined in the app. It also displays the privileges
that the consumer role must have on the object in order to create the reference.

### Create the reference and associate the reference to the app

After viewing the references requested by the app,
a consumer can create the reference by running the [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) system
function as shown in the following example:

```sqlexample
SELECT SYSTEM$REFERENCE('table', 'db1.schema1.table1', 'persistent', 'select', 'insert');
```

This command creates the reference and returns an identifier for the object. The identifier looks
similar to the following example:

```output
ENT_REF_TABLE_16617302895522_2CDD20F5C047A5B87B2CE36F6837715786AF9F2D
```

The consumer passes this identifier to a callback stored procedure to associate the reference to the app.

> **Note:**
>
> The consumer must run this command for each reference requested by the app.

To associate a reference to an app, the consumer must pass the identifier returned by calling
the [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md)
system function to a callback stored procedure. A callback procedure is a stored procedure that the
provider creates in the Snowflake Native App to associates a reference to the app.

To use a callback procedure, run the following command:

```sqlexample
CALL app.config.register_single_reference(
  'consumer_table', 'ADD', 'ENT_REF_TABLE_16617302895522_2CDD20F5C047A5B87B2CE36F6837715786AF9F2D');
```

In this example, the `register_single_reference()` stored procedure associates the reference with the
identifier `ENT_REF_TABLE_16617302895522_2CDD20F5C047A5B87B2CE36F6837715786AF9F2D` to the app.

> **Note:**
>
> A provider can include different callback procedures in an app. These should be specified in the
> README file of the app.

### Create and associate the reference to the app in a single step

After viewing the references requested by the application,
a consumer can create the reference and associate it to the app by passing the SYSTEM$REFERENCE
system function as an argument to a callback stored procedure.

The following example shows the syntax for passing the SYSTEM$REFERENCE system function as an argument to
a callback stored procedure:

```sqlexample
CALL app.config.register_single_reference(
 'consumer_table', 'ADD', SYSTEM$REFERENCE('table', 'db1.schema1.table1',
 'PERSISTENT', 'SELECT', 'INSERT'));
```

This example creates the reference and passes the identifier to the callback function to associate the
reference to the app.

## Enable external and Apache Iceberg™ tables

The Snowflake Native App Framework allows providers to share external and Apache Iceberg™ tables in the provider shares with consumers
in the app. However, consumers must give the app permission to access these tables.

### Security and cost considerations

When allowing an app to accesses an external or Iceberg table, consumers
should be aware of the following:

* External and Iceberg tables may pose data exfiltration risks to the consumer. For example, if an
  app exposes a view that contains an external table, a provider may be able to determine the types of
  queries the consumer makes by using their cloud provider access logs.
* External and Iceberg tables may incur additional costs related to egress and ingress usage if the
  object store containing the table is not in the same region where the app is published.

### Enable external and Iceberg tables using Snowsight

Providers can configure the app to display a dialog to all consumers to allow an app to access
an external or Iceberg tables.

To allow an app to access to an external or Iceberg table:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app.
4. In the toolbar, select Settings.
5. Select the Privileges tab.
6. Under External data access, select Review.
7. Select Enable.

### Enable external and Iceberg tables using SQL

To enable access to external and Iceberg tables by using SQL use the
SET_APPLICATION_RESTRICTED_FEATURE_ACCESS system function as shown in the following
example:

```sqlexample
SELECT SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS(hello_snowflake_app, 'external_data', '{"allowed_cloud_providers" : "all"}');
```

This command allows the `hello_snowflake_app` app to access the external or Iceberg tables in the
that the app uses.

To determine if external and Iceberg tables have been enabled for an app, use the
LIST_APPLICATION_RESTRICTED_FEATURES system function as shown in the following example:

```sqlexample
SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES('hello_snowflake_app')
```

This system function returns a JSON object that indicates if external and Iceberg tables are allowed
the for the `hello_snowflake_app`.

---
title: Allow an app to create resources in the consumer account
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-auto-privs.md
section: Native Apps Framework
---

# Allow an app to create resources in the consumer account

This topic describes how consumers can use automated granting of privileges to allow a Snowflake Native App to
create objects in the consumer account.

## Overview of automated granting of privileges

Often, an app needs to create or access objects or perform other actions in a consumer
account. This requires the consumer to grant the required privileges that allow the app
to perform these actions.

Auto privileges allow providers to specify the required privileges in the manifest file of
an app. When the consumer installs or upgrades an app, the privileges specified in the manifest
are automatically granted to the app by Snowflake.

## Security considerations when using automated granting of privileges

When a provider configures an app to use
`manifest_version: 2` in the manifest file, automated granting of
privileges is enabled. By default this allows Snowflake to automatically
grant certain privileges to the app. For information on the privileges
that can be automatically granted to the app, see
[Privileges granted by automated granting of privileges](requesting-auto-privs.md).

During installation, Snowsight displays a notification about
the privileges requested by the app. When a consumer installs an app
that uses automated granting of privileges, they agree that the app may
be granted these privileges during upgrades without requiring additional
consent.

Consumers can create feature policies that restrict the objects an app
can create. For more information on creating feature policies, see
[Use feature policies to limit the objects an app can create](ui-consumer-feature-policies.md).

## Privileges granted by automated granting of privileges

When using automated granting of privileges, a provider can add the following privileges to the manifest
file of the app:

* EXECUTE TASK
* EXECUTE MANAGED TASK
* CREATE WAREHOUSE
* CREATE COMPUTE POOL
* BIND SERVICE ENDPOINT
* CREATE DATABASE
* CREATE EXTERNAL ACCESS INTEGRATION
* CREATE SECURITY INTEGRATION

> **Note:**
>
> For restrictions on the CREATE EXTERNAL ACCESS INTEGRATION privilege, see
> Restrictions on the CREATE EXTERNAL ACCESS INTEGRATION and CREATE SECURITY INTEGRATION.

## Restrictions on the CREATE EXTERNAL ACCESS INTEGRATION and CREATE SECURITY INTEGRATION

The CREATE EXTERNAL ACCESS INTEGRATION and CREATE SECURITY INTEGRATION privileges allows an app
to create the objects in the consumer account that are required to connect to an external endpoint.
However, to allow connections to an external endpoint, consumers must also approve the app specification
which allows the app to connect to external hosts. If a consumer does not approve the app specification,
the external connection remains disabled.

For more information, see [Approve app specifications](ui-consumer-app-spec.md).

---
title: App config SQL reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/app_config_reference.md
section: Native Apps Framework
---

# App config SQL reference

File: `configuration/app_config.sql`

## Database objects and procedures

### STATE.APP_CONFIG

An internal table to store all the connector configurations. This table follows the following structure:

| KEY | VALUE | UPDATED_AT |
| --- | --- | --- |
| connector_configuration | {warehouse: “wh”, destination_db: “db”, destination_schema: “s”} | TIMESTAMP_NTZ_1 |
| custom_configuration | {journal_table: “j_table_name”} | TIMESTAMP_NTZ_2 |
| connection_configuration | {secret_name: “secret_db.schema.the_secret”} | TIMESTAMP_NTZ_3 |
| … | {…} | … |

### PUBLIC.CONNECTOR_CONFIGURATION

A view that retrieves and maps the data from the `APP_CONFIG` internal table
The mapping is as follows:

1. KEY (col) → CONFIG_GROUP (col);
2. JSON keys from VALUE column (JSON key) → CONFIG_KEY (col)
3. JSON values from VALUE column (JSON value) → VALUE (col)
4. UPDATED_AT (col) → UPDATED_AT (col)

Example CONNECTOR_CONFIGURATION view created on example APP_CONFIG:

| CONFIG_GROUP | CONFIG_KEY | VALUE | UPDATED_AT |
| --- | --- | --- | --- |
| connector_configuration | warehouse | wh | <timestamp_ntz> |
| connector_configuration | destination_db | db | <timestamp_ntz> |
| custom_configuration | journal_table | j_table_name | <timestamp_ntz> |
| connection_configuration | secret_name | secret_db.schema.the_secret | <timestamp_ntz> |
| … | … | … | … |

## Related Java objects

The following Java objects are tightly connected with the `APP_STATE` table:

* `ConnectorConfigurationService`
* `ConfigurationRepository`
* `ConfigurationMap`
* `KeyValueTable`

---
title: Appeal a failed security review
source: https://docs.snowflake.com/en/developer-guide/native-apps/security-appeal.md
section: Native Apps Framework
---

# Appeal a failed security review

This topic describes how to file an appeal for an app that failed has the security review process.

## Guidelines for appealing a failed security review

The following guidelines apply to the appeals process for a failed security review:

* Before submitting an appeal for a failed security review, review the security policies
  outlined in the following topics:

  + [Security requirements and guidelines for a Snowflake Native App](security-overview.md)
  + [Run the automated security scan](security-run-scan.md)
  + [Security requirements and best practices for a Snowflake Native App](security-app-requirements.md)
  + [Secure a Snowflake Native App with Snowpark Container Services](security-na-spcs.md)
* Snowflake does not provide information about the details of the security scan of an app.
* Snowflake allows one appeal per patch of an app. You must provide all information about
  your appeal in a single support case. Snowflake rejects subsequent appeals for the same
  patch of an app.
* If you do not include all required information in the support case for your appeal, the
  appeal may be rejected without review.

## Submit an appeal for a failed security review

To submit an appeal, you must [file a support case](../../user-guide/contacting-support.md) that
includes the following information for each field:

* Summary: Use the following format in the summary of the issue:

  ```text
  Appeal <App Name>, <Version>, <Patch>
  ```
* Description: Provide the following information in the description of the issue:

  + Application information: App Name, Version, Patch
  + Rejection reason(s): paste in the rejection reason & code located in Projects » App packages
  + Information required to appeal the rejection(s): Identify your rejection reason and include all information under “information required to appeal the rejection.”
* Category: Select General Administration
* Subcategory: Select Other
* Severity: 4

  All appeals have a turnaround time of 3-5 business days (Monday to Friday).
  Cases submitted with a higher severity may be downgraded to Severity 4.

> **Warning:**
>
> If you do not provide the information outlined above, your appeal may be rejected without
> review.

## Rejection reasons and information required for appeals

There are multiple reasons why an app may fail the security review. Before filing an appeal,
ensure that you have reviewed the following topics:

> * [Security requirements and guidelines for a Snowflake Native App](security-overview.md)
> * [Run the automated security scan](security-run-scan.md)
> * [Security requirements and best practices for a Snowflake Native App](security-app-requirements.md)
> * [Secure a Snowflake Native App with Snowpark Container Services](security-na-spcs.md)

Possible reasons for rejection include:

* All app code must be defined in the application package.
* All app code must be un-obfuscated.
* Dependencies and libraries must not contain critical or high CVEs.
* An app cannot store or require customer secrets to be in plain text
* Apps must not contain functionality harmful to Snowflake, customers, or 3rd parties.
* Apps must communicate required privileges to the consumer.
* Apps must only request the minimum set of privileges possible.

### App code must be defined in the application package

Snowflake security policies require that all the application code, including all library dependencies
and setup code, must be included in the app version defined in the application package.

**Reason for the rejection**

Your app is using code that is not available for review in the application package. This may be from
code that exists in a source that is outside the application package.

**How to fix this issue**

Update the app to include all the code required by the app in the application package.

Additional context is provided in the rejection reason.

**Information required to appeal the rejection**

If your app imports data from outside the application package, this can cause the app to be rejected.
This can be from tables not in the consumer account or through other external integrations.

Please provide a list of all the data imported by the app and the details about the use of the data.

### All app code must be un-obfuscated

Snowflake security policy requires all application code to be un-obfuscated, meaning that the code must be
human readable. This requirement includes minified JavaScript code.

**Reason for the rejection**

Your application includes obfuscated code that could not be reviewed by Snowflake. This could be
due to minified javascript or other forms of obfuscation like encryption or encoding. Please update
the app to remove all obfuscated code.

Additional context is provided in the rejection reason.

**Information required to appeal the rejection**

Appeals are only allowed for minified JavaScript. Please provide the location of the corresponding
source map file to the minified JavaScript.

### Dependencies and libraries must not contain critical or high CVEs.

Snowflake security policy requires all dependencies or libraries with critical or high Common
Vulnerabilities and Exposures (CVE) to be updated to a secure version, if available. See
[Common Vulnerabilities and Exposures (CVE) considerations](security-cve.md) for more information on identifying CVEs in a Snowflake Native App.

**Reason for the rejection**

An app may be rejected if you are using components that have known CVEs that can be harmful to
consumers if exploited. The specific CVEs in your app are provided in the rejection reason.

Different tools can detect different results based on their configuration, internal policies and depth
of scanning. Snowflake’s tools are configured to enforce the Snowflake Marketplace policies. Snowflake
may identify CVEs that you do not find in your own CVE scanning.

**Information required to appeal the rejection**

To appeal this rejection, you must provide the following information:

* Justification for why the CVE cannot be exploited in your app.
* A reachability analysis report, if available.
* A plan for an update to the fixed version.
* If there are no plans for update, provide a detailed explanation for why a vulnerable version
  cannot be updated to a fixed version.

### An app must not store or require plain text customer secrets

Snowflake security policy requires that apps do not store or require any customer secrets
to be in plain text.

**Reason for the rejection**

This result indicates that some customer secrets are stored in plain text.

Additional context is provided in the rejection reason.

**Information required to appeal the rejection**

If your app stores customer secrets, you must provide details of the secrets stored and their
uses. Also, provide details about how the secrets are stored.

> **Caution:**
>
> Do not include the secrets in your support ticket.

### Apps must not contain functionality harmful to Snowflake, customers, or 3rd parties

Snowflake’s security policy requires that applications do not contain any functionality that
could result in harm to Snowflake, its customers, or third parties.

**Reason for the rejection**

Your app contains functionality that Snowflake deems harmful.

**Information required to appeal the rejection**

Rejections due to this reason cannot be appealed.

### Apps must communicate required privileges to the consumer

Snowflake security policy requires that apps must provide all privileges required by the
app on all objects and all API integrations.

**Reason for the rejection**

This rejection may occur when the app requests that a consumer grants privileges on an
object without communicating the required privileges to the consumer in advance.

**How to resolve this issue**

To resolve this issue, you must provide information about the permissions required by the
application and objects created by the application before asking the consumer to grant
privileges.

**Information required to appeal the rejection**

To appeal this rejection, provide the following information in your support ticket:

* A list of all the permissions required by the application.
* A list of all the objects created by the application.
* The location in the application where the privileges are disclosed to the consumer
  before asking the consumer to grant the privileges.

### Apps must only request the minimum set of privileges possible

Snowflake security policy requires that applications should only ask for the minimum set of
privileges needed for the application to function.

**Reason for the rejection**

The app is requesting broad permissions in the consumer account. For example, the app is
requesting ownership on a database when usage permissions might be sufficient.

**How to resolve this issue**

To resolve this issue, modify your app to request only the minimum required privileges
for the application to function.

**Information required to appeal the rejection**

To appeal this rejection, provide the following information in your support ticket:

* A list of all the permissions required by the application.
* A list of all the objects created by the application.
* A detailed explanation for the use of any account-level privileges, ownership grants
  or admin role requests/grants.

---
title: Application configuration
source: https://docs.snowflake.com/en/developer-guide/native-apps/app-configuration.md
section: Native Apps Framework
---

# Application configuration

This topic describes how a Snowflake Native App can use application configuration objects to request
input from the consumer.

## Application configuration: Overview

An application configuration is a key-value pair that provides a coordination mechanism between
a Snowflake Native App and the consumer. When a Snowflake Native App requires input from the consumer, it
defines a configuration key along with a description explaining the purpose of the configuration.
The consumer then provides the value for that key.

Application configuration supports the following types:

`APPLICATION_NAME`
:   The consumer provides the name of an installed app in the consumer account. This type
    is used for [inter-app communication](inter-app-communication.md).

`STRING`
:   The consumer provides an arbitrary string value. This type can be used for a variety of
    use cases, such as providing external URLs, account identifiers, or other app-specific settings.

The application configuration workflow involves the following steps:

1. The app creates a configuration definition using `ALTER APPLICATION SET CONFIGURATION DEFINITION`,
   specifying the type of information needed and the app roles that have access to the configuration.
2. The consumer views incoming configuration requests using `SHOW CONFIGURATIONS` or Snowsight.
3. The consumer provides the requested value using `ALTER APPLICATION SET CONFIGURATION VALUE` or Snowsight.
4. The app retrieves the value and uses it to perform further operations,
   such as creating an application specification for a connection.

The Snowflake Native App Framework provides callbacks to notify the app when a configuration value is set or changed.
For more information, see [Configuration callbacks](callbacks.md).

## Terminology

Application configuration uses the following terms:

Configuration definition
:   An object that the app creates to request a specific piece of information from the consumer.
    The configuration definition specifies the type of information requested, a label, a description,
    and the app roles that have access to the configuration.

Configuration value
:   The value that the consumer provides in response to a configuration definition request.

## Using configurations

This section describes how to create, display, and manage configurations.

* Create a configuration request
* View configuration requests
* Provide the configuration value
* Update the value of a configuration
* Unset the value of a configuration
* Retrieve the value of a configuration

### Create a configuration request

To request a configuration value from the consumer, the app creates a configuration definition
in the setup script or at runtime. The configuration definition specifies the type of value
expected, a label and description that are displayed to the consumer, and the app roles that
can view the configuration and edit the value.

The following example shows how to create a configuration definition of type `STRING` that requests a
company URL from the consumer:

```sqlexample
ALTER APPLICATION SET CONFIGURATION DEFINITION company_url
  TYPE = STRING
  LABEL = 'Company URL'
  DESCRIPTION = 'Provide the company website URL'
  APPLICATION_ROLES = (app_user)
  SENSITIVE = FALSE;
```

The following properties control how the configuration is displayed and managed:

* `LABEL`: The name displayed to the consumer in Snowsight.
* `DESCRIPTION`: A description that helps the consumer understand the purpose of the
  configuration.
* `APPLICATION_ROLES`: The app roles that can view and set the value for this configuration.
  Consumer roles that are granted one of the specified app roles can view the configuration and edit its value.
* `SENSITIVE`: Specifies whether the configuration value should be treated as sensitive. When
  set to `TRUE`, the value is not displayed in the output of `SHOW CONFIGURATIONS`. For more information, see Sensitive configurations.

### View configuration requests

After an app creates a configuration request, the consumer can view pending requests
using SQL or Snowsight.

SQLSnowsight

To view the configuration requests and details of a configuration definition using SQL, use the [SHOW CONFIGURATIONS](../../sql-reference/sql/show-configurations.md) and [DESCRIBE CONFIGURATION](../../sql-reference/sql/desc-configuration.md) commands:

```sqlexample
SHOW CONFIGURATIONS IN APPLICATION example_app;

DESCRIBE CONFIGURATION company_url IN APPLICATION example_app;
```

To view the configuration requests and details of a configuration definition using Snowsight, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app.
4. Open the Security tab. The Configurations section displays the string configurations in the Other configurations section. Each string configuration shows the following:

   * The label of the configuration.
   * A description of what the configuration is for.
   * A Review button.

### Provide the configuration value

After the app creates a configuration request, the consumer provides the requested value using SQL or Snowsight.

SQLSnowsight

To provide a value for the configuration using SQL, use the [ALTER APPLICATION SET CONFIGURATION VALUE](../../sql-reference/sql/alter-application-set-configuration-value.md) command:

```sqlexample
ALTER APPLICATION <app> SET CONFIGURATION <config> VALUE = '<value>';
```

To provide a value for the configuration using Snowsight, do the following:

1. In the configuration’s details page in Snowsight, click the Review button. The configuration details page displays the following:

   * The label of the configuration.
   * A description of what the configuration is for.
   * If the configuration is sensitive, a Sensitive data protection banner is displayed. For more information, see Sensitive configurations.
2. Provide the value for the configuration in the Value field.
3. Click the Save button to submit the value for the configuration. Configuration updated successfully is displayed. The configuration list is refreshed to display the new value.

### Update the value of a configuration

You can update the value of a configuration using SQL or Snowsight.

SQLSnowsight

To update a configuration value, use the same syntax as setting the initial value:

```sqlexample
ALTER APPLICATION <app> SET CONFIGURATION <config> VALUE = '<value>';
```

To update the value of a configuration using Snowsight, do the following:

1. In the configuration’s details page in Snowsight, if a configuration has a value set, the following information displays:

   * A Configured banner.
   * If the configuration is not sensitive, the value is displayed.
   * If the configuration is sensitive, the value is masked.
   * An Update button.
   * A Clear value button.
2. Click the Edit button to update the value for the configuration.
3. Provide the new value for the configuration in the Value field.
4. Click the Save button to submit the new value for the configuration. Configuration updated successfully is displayed. The configuration list is refreshed to display the new value.

### Unset the value of a configuration

You can unset the value of a configuration using SQL or Snowsight.

SQLSnowsight

To unset the value of a configuration using SQL, use the [ALTER APPLICATION UNSET CONFIGURATION](../../sql-reference/sql/alter-application-unset-configuration.md) command:

```sqlexample
ALTER APPLICATION <app> UNSET CONFIGURATION <config>;
```

To unset the value of a configuration using Snowsight, do the following:

1. In the configuration’s details page in Snowsight, click the Clear value button.
2. Confirm the action. The configuration list is refreshed to display the unset configuration.

### Retrieve the value of a configuration

In addition to SHOW CONFIGURATIONS or DESCRIBE CONFIGURATION, an app can retrieve the value of a configuration that the consumer provided using the [get_configuration_value](../../sql-reference/functions/get_configuration_value.md) function. The following example shows how to retrieve the value of a configuration:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$APPLICATION', 'GET_CONFIGURATION_VALUE' , '<config_name>')
```

> **Note:**
>
> Only the app can retrieve the configuration value from the system context. To view the configuration value as a consumer, you can view the configuration details either using SQL or Snowsight. For more information, see View configuration requests.

## Sensitive configurations

When an app creates a configuration, it can mark the configuration as sensitive by setting
`SENSITIVE = TRUE`. This is useful when the app needs to request sensitive information
from the consumer, such as a personal access token or an API key.

> **Note:**
>
> The `SENSITIVE` property is only supported for configurations of type `STRING`.

When a configuration is sensitive, the consumer-provided value is protected from other consumer
users and roles. The Snowflake Native App Framework applies protections similar to those used for
[SECRET objects](../../sql-reference/sql/create-secret.md) in Snowflake:

* After the consumer sets a value, the query history for the
  `ALTER APPLICATION SET CONFIGURATION VALUE` command redacts the value so that it is not
  exposed to other consumer roles or users.
* The value is not displayed in the output of `SHOW CONFIGURATIONS`,
  `DESCRIBE CONFIGURATION`, INFORMATION_SCHEMA views, or ACCOUNT_USAGE views.

The app that creates the configuration can always retrieve the consumer-provided value,
even when the configuration is sensitive. This is by design, because the purpose of an
application configuration is for the consumer to provide a value to the app.

### Changing the SENSITIVE property

An app cannot change the `SENSITIVE` property while the configuration has a value set
(that is, when the configuration is not in a `PENDING` state). This restriction prevents the
consumer’s value from being accidentally exposed. If the app attempts to change the
`SENSITIVE` property while a value is set, the command completes without error but has
no effect.

To change the `SENSITIVE` property, the consumer must first unset the configuration value
using `ALTER APPLICATION UNSET CONFIGURATION`.

## SQL reference

The following SQL commands, functions, and views are used to manage application configurations.

### SQL commands

* [ALTER APPLICATION SET CONFIGURATION DEFINITION](../../sql-reference/sql/alter-application-set-configuration-definition.md): Creates or updates an application configuration definition that requests a value from the consumer.
* [ALTER APPLICATION DROP CONFIGURATION DEFINITION](../../sql-reference/sql/alter-application-drop-configuration-definition.md): Deletes an application configuration definition.
* [ALTER APPLICATION SET CONFIGURATION VALUE](../../sql-reference/sql/alter-application-set-configuration-value.md): Sets a value in an application configuration.
* [ALTER APPLICATION UNSET CONFIGURATION](../../sql-reference/sql/alter-application-unset-configuration.md): Unsets the value of an application configuration.
* [SHOW CONFIGURATIONS](../../sql-reference/sql/show-configurations.md): Lists all of the application configurations in an app.
* [DESCRIBE CONFIGURATION](../../sql-reference/sql/desc-configuration.md): Describes the details of an application configuration.

### SQL functions

* [IS_CONFIGURATION_SET (SYS_CONTEXT function)](../../sql-reference/functions/is_configuration_set.md): Returns whether or not the configuration has a value set.
* [GET_CONFIGURATION_VALUE (SYS_CONTEXT function)](../../sql-reference/functions/get_configuration_value.md): Returns the current value of a configuration.

### Information schema views and functions

* [APPLICATION_CONFIGURATIONS view](../../sql-reference/info-schema/application_configurations.md): This Information Schema view displays a row for each application configuration currently defined in the specified or current database where the INFORMATION_SCHEMA is located.
* [APPLICATION_CONFIGURATION_VALUE_HISTORY](../../sql-reference/functions/application_configuration_value_history.md): Returns the history of values for a configuration.

### Account usage schema views

* [APPLICATION_CONFIGURATIONS view](../../sql-reference/account-usage/application_configurations.md): This Account Usage view displays a row for each application configuration in the account.
* [APPLICATION_CONFIGURATION_VALUE_HISTORY view](../../sql-reference/account-usage/application_configuration_value_history.md): This Account Usage view displays the history of values for a configuration.

## Callbacks

When a configuration value changes, the Snowflake Native App Framework can invoke lifecycle callbacks registered in the
app’s [manifest](manifest-reference.md) file. These callbacks let the app validate, prepare for, or react to
configuration changes. For example, when configuring inter-app communication, a common use case is to use the
[before_configuration_change](callbacks.md)
callback to automatically create or update a connection specification when the consumer sets
the server app name. This avoids requiring the consumer to perform additional manual steps
after setting the configuration value. For more information about inter-app communication, see [Inter-app Communication](inter-app-communication.md).

The following configuration callbacks are available:

[validate_configuration_change](callbacks.md)
:   A synchronous callback called as part of the `ALTER APPLICATION SET CONFIGURATION VALUE`
    command. Lets the app perform custom validation on the provided value. If the callback
    returns an error, the command fails and the new value is not set.

[before_configuration_change](callbacks.md)
:   A synchronous callback called as part of the `ALTER APPLICATION SET CONFIGURATION VALUE`
    and `ALTER APPLICATION UNSET CONFIGURATION` commands. Lets the app perform operations
    based on the configuration value before it is saved.

[after_configuration_change](callbacks.md)
:   An asynchronous callback called after the `ALTER APPLICATION SET CONFIGURATION VALUE`
    or `ALTER APPLICATION UNSET CONFIGURATION` commands complete. Lets the app react to
    the change, for example for notification or tracking purposes.

For complete callback signatures and return values, see
[Callbacks](callbacks.md).

---
title: Approve app specifications
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-app-spec.md
section: Native Apps Framework
---

# Approve app specifications

This topic describes how consumers can use app specifications to approve
requests for external connections, data sharing, and other controlled operations for a Snowflake Native App.

## About app specifications

App specifications allow providers to specify the external (outside of Snowflake) endpoints and
resources that an app requires. Consumers can view the end points that the app is requesting and
approve or decline them as appropriate.

After a consumer approves the app specification, the app has permissions to connect to these endpoints.
App specifications only allow a consumer to approve connections to external resources.

The app can also request privileges to create objects, including external access integrations. For
more information, see [Allow an app to create resources in the consumer account](ui-consumer-auto-privs.md).

### Status of an app specification

An app specification has a status that indicates whether a consumer has approved or declined it.
The possible statuses are:

* `PENDING` The consumer has not approved or declined the app specification.
  This is the default status.
* `APPROVED` The consumer has approved the app specification.
* `DECLINED` The consumer has declined the app specification.

For information on determining the status of an app specification, see View the external end points required by the app.

### Sequence numbers of an app specification

Sequence numbers are used to uniquely identify a version of the app specification. Sequence numbers
are automatically incremented when a provider changes the definition of the app specification.
The definition of an app specification includes configuration and other required information. Fields
that are not part of the definition, such as `description`, do not trigger an update to the
sequence number.

Sequence numbers allow providers and consumers to know the current status of the app specification and
which external endpoints have been enabled.

## View the app specifications of an app

To view the external endpoints requested by an app, consumers can use the
[SHOW SPECIFICATIONS](../../sql-reference/sql/show-specifications.md) command as shown in the following example:

```sqlexample
SHOW SPECIFICATIONS IN APPLICATION hello_snowflake_app;
```

This command lists information about the app specifications of the app named
`hello_snowflake_app`.

The `status` column shows whether the app specification has been approved, declined, or is
still pending. See Status of an app specification for more information.

## View the external end points required by the app

To view the external endpoints required by the app, consumers can view the details of
the app specification by using the [DESCRIBE SPECIFICATION](../../sql-reference/sql/desc-specification.md) or
[SHOW SPECIFICATIONS](../../sql-reference/sql/show-specifications.md) commands as shown in the following examples:

```sqlexample
DESC SPECIFICATION my_app_specification IN APPLICATION hello_snowflake_app;
SHOW SPECIFICATIONS IN APPLICATION hello_snowflake_app;
```

For each sequence number, this command displays the properties of the app specification and
their values.

The `definition` field contains a list of the external hosts ports that the app is requesting.
See Sequence numbers of an app specification.

## Approve an app specification by using Snowsight

Using Snowsight, consumers can approve or deny an app specification.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app.
4. Select the Settings icon in the toolbar.
5. Select Connections.
6. Next to the connection you want to approve, expand Details.

   Snowsight displays the external access integrations, network rules, and requested
   endpoints for the app.
7. Approve or deny the requested endpoints:

   * To approve the endpoints, select …, then select Approve.
   * To deny the endpoints, select …, then select Deny.

## Approve or decline an app specification by using SQL

Consumers can approve or decline an app specification to allow the app to connect to
external endpoints.

### Privileges required to approve or decline an app specification

To approve or decline an app specification, a role must have the
MANAGE APPLICATION SPECIFICATIONS privilege on the account. This privilege is granted by default to
the SECURITYADMIN role. Users with the SECURITYADMIN role can grant this privilege to
other roles as required.

> **Note:**
>
> Because approving an app specification allows an app to access endpoints outside Snowflake,
> a role must have the MANAGE APPLICATION SPECIFICATIONS privilege on the account as
> delegated by the security administrator of the consumer account. Being
> the owner of the app does not grant the necessary privileges.

### Approve an app specification by using SQL

To approve an app specification, consumers can run the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md)
command as shown in the following example:

```sqlexample
ALTER APPLICATION hello-snowflake-app APPROVE SPECIFICATION
  my-app-spec SEQUENCE_NUMBER = 2;
```

This command approves the app specification named `my-app-spec` for the app named
`hello-snowflake-app`.

Consumers can obtain the value for `SEQUENCE_NUMBER` by running the
[DESCRIBE SPECIFICATION](../../sql-reference/sql/desc-specification.md) or [SHOW SPECIFICATIONS](../../sql-reference/sql/show-specifications.md) command.

### Decline an app specification by using SQL

To decline an app specification, consumers can run the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command
as shown in the following example:

```sqlexample
ALTER APPLICATION hello-snowflake-app DECLINE SPECIFICATION
  my-app-spec SEQUENCE_NUMBER = 2;
```

---
title: Callbacks
source: https://docs.snowflake.com/en/developer-guide/native-apps/callbacks.md
section: Native Apps Framework
---

# Callbacks

This topic describes the callbacks that are available for Snowflake Native Apps.

The Snowflake Native App Framework provides callbacks to help manage the app lifecycle. You can
use these callbacks to enhance your app’s functionality and workflow.

To use callbacks, add them to the `lifecycle_callbacks` section of the manifest file, as
in the following example:

```yaml
lifecycle_callbacks:
    before_configuration_change: app_schema.before_config_change_callback
```

## Types of callbacks

The Snowflake Native App Framework provides synchronous and asynchronous callbacks.

### Synchronous callbacks

Synchronous callbacks are called as part of the triggering SQL command. Synchronous callbacks block the calling SQL command. If the callback returns an error, the command will return an error,
and the callback’s error message is returned as part of the SQL error message of the command.

Synchronous callbacks run in a warehouse, so the calling procedure must have a session warehouse set.

### Asynchronous callbacks

Asynchronous callbacks run in the background, after the calling SQL command completes. Asynchronous callbacks do not block the calling SQL command, and errors in asynchronous callbacks are not returned by the calling command.

To ensure that an asynchronous callback has the most current information, the callback signature doesn’t provide state or status information. Instead, the callback should retrieve the most current information using the appropriate SQL commands, such as [SHOW CONFIGURATIONS](../../sql-reference/sql/show-configurations.md) or [SHOW SPECIFICATIONS](../../sql-reference/sql/show-specifications.md). See the description for each asynchronous callback for more information.

The return value from asynchronous callbacks is ignored.

> **Caution:**
>
> The execution order of asynchronous callbacks is not guaranteed.
> Your app should not rely on the order of asynchronous callbacks to perform its operations.

### Permissions

The callback procedures listed in this topic are not required to be granted to any application role. The procedure can be internal to the app, and does not need to be runnable by the consumer. The Snowflake Native App Framework triggers the callback.

### Specification vs. connection callbacks

Both after_specification_change and
after_server_connection_change are run when
a specification is approved or refused. The differences between the two callbacks are
as follows:

* after_specification_change is part of the
  application specification framework.
  It is only triggered when the consumer approves or refuses a specification request.
* after_server_connection_change is part of the
  inter-app communication framework.
  It is triggered by any operation that directly or indirectly impacts the connection state of application specification, including the following:

  + Approving a specification
  + Refusing an approved specification
  + Dropping an approved specification
  + Dropping the server app

Use after_server_connection_change when your app needs to respond to changes in the connection itself, such as a connection being established, lost, or the server app being deleted. This callback provides better connection tracking because it covers a broader range of events than specification approval alone.

Use after_specification_change when your app only needs to respond to the approval or refusal of a specification request, or when handling application specification types other than `CONNECTION`.

## Callback reference

The following categories of callbacks are provided for Snowflake Native Apps:

* Configuration callbacks
* Connection callbacks
* Specification callbacks

### Configuration callbacks

These callbacks are triggered when a [configuration](app-configuration.md) changes.

* validate_configuration_change
* before_configuration_change
* after_configuration_change

#### validate_configuration_change

This callback is a synchronous callback.

This callback is called as part of `ALTER APPLICATION SET CONFIGURATION VALUE` command.
This callback lets the app perform additional custom validation on the value provided by the
server app. If the callback fails, such as with a syntax error, or if the callback returns and error, the set command fails and the new value is not set.

##### Signature

```sqlexample
validate_configuration_change(configuration_name, configuration_value)
```

##### Parameters

* `configuration_name`: The name of the configuration object.
* `configuration_value`: The value provided by the server app.

##### Return value

The callback must return a string in the following JSON format to indicate a validation success or error.

```json
{
  "type": "SUCCESS | ERROR",
  "payload":{
      "error_message": "Error message indicating the validation failure"
  }
}
```

If the function returns a `type` of `ERROR`, the error message is returned as part of the SQL
error message of the `SET` command. If the function returns a `type` of `SUCCESS`, the
error message is ignored.

#### before_configuration_change

This callback is a synchronous callback.
This callback is called as part of `ALTER APPLICATION SET CONFIGURATION VALUE`
and `ALTER APPLICATION UNSET CONFIGURATION` commands. This callback lets the app
perform further operations based on the configuration value set. The value passed into
the callback is null for the `ALTER APPLICATION UNSET CONFIGURATION` command.

##### Signature

```sqlexample
before_configuration_change(configuration_name, configuration_value)
```

##### Parameters

* `configuration_name`: The name of the configuration object.
* `configuration_value`: The value provided by the server app.

##### Return value

The return value of the callback is ignored.

#### after_configuration_change

This callback is an asynchronous callback.
This callback is called after the `ALTER APPLICATION SET CONFIGURATION VALUE`
and `ALTER APPLICATION UNSET CONFIGURATION` commands complete. This callback lets the
client app be notified when a value is provided by the server app.

##### Signature

```sqlexample
after_configuration_change(configuration_name)
```

##### Parameters

* `configuration_name`: The name of the configuration object.

##### Retrieving the latest state

In the callback, the following code snippet can be used to retrieve the current status and value of the configuration:

```python
session.sql(f"""
  SHOW CONFIGURATIONS ->>
      SELECT "status", "value"
      FROM $1
      WHERE "name" = '{configuration_name}';
  """);
```

### Connection callbacks

These callbacks are triggered when a connection’s status changes.

* after_server_connection_change
* after_client_connection_change
* after_server_version_change

#### after_server_connection_change

This callback is an asynchronous callback.
This callback is triggered by any operation that directly or indirectly impacts the connection state of application specification, including the following:

* Approving a specification
* Refusing an approved specification
* Dropping an approved specification
* Dropping the server app

##### Signature

```sqlexample
after_server_connection_change(server_name)
```

##### Parameters

* `server_name`: The name of the server app for which the connection has been changed.

##### Retrieving the latest state

In the callback, the following code snippet retrieves the current
connection status to the server app:

```python
session.sql(f"""
  SHOW SPECIFICATIONS ->>
  SELECT "status"
  FROM $1
  WHERE PARSE_JSON("definition"):"SERVER_APPLICATION"::STRING = '{server_name}';
  """);
```

#### after_client_connection_change

This callback is an asynchronous callback.
This callback is triggered by any operation that directly or indirectly impacts the connection state of application specification, including the following:

* Approving a specification
* Refusing an approved specification
* Dropping an approved specification
* Dropping the client app

##### Signature

```sqlexample
after_client_connection_change(client_name)
```

##### Parameters

* `client_name`: The name of the client app for which the connection has been changed.

##### Retrieving the latest state

In the callback, the following code snippet retrieves what roles, if any,
have been granted to the client app:

```python
session.sql(f"""
  SHOW GRANTS TO APPLICATION {client_name} ->>
  SELECT "name"
  FROM $1
  WHERE "granted_on" = 'APPLICATION_ROLE'
      AND STARTSWITH("name", CURRENT_DATABASE())
  """);
```

#### after_server_version_change

This callback is an asynchronous callback.
This callback is called after the server app’s version or patch number changes.
This lets the client app react to the upgrade or downgrade.

##### Signature

```sqlexample
after_server_version_change(server_name)
```

##### Parameters

* `server_name`: The name of the server app for which the version has changed.

##### Retrieving the latest state

In the callback, the following code snippet can be used to retrieve the current
version of the server app:

```python
session.sql(f"""
  SHOW APPLICATIONS ->>
  SELECT "version", "patch"
  FROM $1
  WHERE "name" = {server_name}
  """);
```

### Specification callbacks

The callback is triggered when a specification of any type has a status change

* `after_specification_change`

#### after_specification_change

This callback is an asynchronous callback.
This callback is called after the `ALTER APPLICATION APPROVE SPECIFICATION` or `ALTER APPLICATION DECLINE SPECIFICATION` commands complete. This callback
lets the app be notified when its specification status is changed.

This callback replaces the functionality of the `specification_action` callback.
You can only specify one of `after_specification_change` or `specification_action` in the manifest file.
For information about the `specification_action` callback, see [Using callback functions with app specifications](requesting-app-specs.md).

##### Signature

```sqlexample
after_specification_change(spec_name)
```

##### Parameters

* `spec_name`: The name of the application specification that has been approved or refused.

##### Retrieving the latest state

In the callback, the following code snippet can be used to retrieve the current
status of the specification:

```python
session.sql(f"""
  SHOW SPECIFICATIONS ->>
      SELECT "status"
      FROM $1
      WHERE "name" = '{spec_name}';
  """);
```

## Callback history

Use the following SQL function and Account Usage view to monitor callback invocations for your Snowflake Native Apps:

* [APPLICATION_CALLBACK_HISTORY](../../sql-reference/functions/application_callback_history.md) (Information Schema table function): Returns the history of callback invocations for applications in the current account. Use this table function to query callback history for a specific application or callback.
* [APPLICATION_CALLBACK_HISTORY view](../../sql-reference/account-usage/application_callback_history.md) (Account Usage view): Provides a history of callback invocations for applications in the account. Use this view to analyze callback activity across all applications in the account.

---
title: Choosing SDK components
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/choosing_components.md
section: Native Apps Framework
---

# Choosing SDK components

The connectors native SDK consists of multiple components, some of them are independent and some of them depend on each other to work.
This section explains how to customize which components will be turned on in the connector.
Additionally, each component will be shortly described and their dependencies will be mentioned.

## Enabling/disabling components

Components are enabled and disabled on the Snowflake database objects level. This means that the executed `setup.sql`
file is the source of truth on what was enabled or disabled. For the first time users it is recommended to use the `all.sql` file provided by the SDK.
This file includes all of the basic features from the SDK (except Task Reactor).

To do so simply put the following line in the `setup.sql` file of the connector:

```sqlexample
EXECUTE IMMEDIATE FROM 'native-connectors-sdk-components/all.sql';
```

For more experienced users it is possible to customize enabled and disabled features.
To do so add and remove `EXECUTE IMMEDIATE` statements as needed.
Keep in mind that excluding a file which is required by the feature will break it.

```sqlexample
-- Core connector objects
EXECUTE IMMEDIATE FROM 'core.sql';

-- Connector configuration prerequisites
EXECUTE IMMEDIATE FROM 'prerequisites.sql';

-- Connector configuration flow
EXECUTE IMMEDIATE FROM 'configuration/app_config.sql';
EXECUTE IMMEDIATE FROM 'configuration/connector_configuration.sql';
```

## Components

The sections below contain a list of the connectors native SDK components
with short descriptions and a list of required other components for each of them.
For more information, see [The Snowflake Native SDK for Connectors reference](../reference/overview.md).

### Core component

The core component is responsible for creating basic objects for the connector like schemas,
roles and persistence layer for the internal status of the application.

#### Dependencies

This component has no dependencies to other components.

### Application configuration component

The application configuration component is a persistence layer for storing and reading
the internal configuration of the application.

#### Dependencies

This component has no dependencies to other components.

### Prerequisites component

Prerequisites are an optional part of the wizard.
It supports informing the end user about configurations and initial setup that needs to be satisfied,
usually outside of the connector itself.

#### Dependencies

* Core component

### Connector configuration component

The connector configuration is a wizard step responsible for configuring common
connector properties like: sink database, data owner role, warehouse etc.

#### Dependencies

* Core component
* Application configuration component

### Connection configuration component

The connection configuration is a wizard step responsible for configuring
the properties related to the communication with the external source system for the connector,
for example authentication and authorization properties and methods.

#### Dependencies

* Core component
* Application configuration component

### Finalize configuration component

The finalize connector is a wizard step responsible for performing final connection checks to the external source system and connector specific configurations.

#### Dependencies

* Core component
* Recommended: Application configuration component

### Pause/resume component

The pause/resume component provides the option of pausing and resuming the connector whenever desired to stop the credit consumption.

#### Dependencies

* Core component
* Recommended: Application configuration component
* Recommended: Finalize configuration component

### Ingestion component

The ingestion component provides abstraction and persistence to define the data that will be put into Snowflake from the external source system.

#### Dependencies

This component has no dependencies to other component, however requires multiple sql files to be executed.

### Scheduler component

The scheduler component allows provides a mechanism of triggering tasks inside a connector
according to the configuration using Snowflake tasks underneath.

#### Dependencies

* Core component
* Application configuration component
* Connector configuration component

### Connector stats component

The connectors stats component provides useful views to see the metadata from the performed ingestion tasks.
It is useful to monitor how much data is flowing through the connector.

#### Dependencies

* Ingestion component

### Sync status component

The sync status component provides a view to quickly check when was the last data sync.

#### Dependencies

* Ingestion component
* Connector stats component

### Task reactor component

The task reactor is a component that provides a mechanism to queue work items and spread them between a number of worker tasks.
The number of workers can be changed to allow for more of them when there are huge workloads.

#### Dependencies

This component has no dependencies to other components.

---
title: Common Vulnerabilities and Exposures (CVE) considerations
source: https://docs.snowflake.com/en/developer-guide/native-apps/security-cve.md
section: Native Apps Framework
---

# Common Vulnerabilities and Exposures (CVE) considerations

This topic describes how Snowflake applies Common Vulnerabilities and Exposures (CVE) criteria to
a Snowflake Native App.

## About the CVE for a Snowflake Native App

Common Vulnerabilities and Exposures (CVEs) are publicly disclosed information about security
vulnerabilities in software applications and systems. These vulnerabilities can potentially be
exploited, compromising the security of affected applications.

In the context of a Snowflake Native App, providers must address CVEs to ensure the secure execution of
these applications within Snowflake’s data cloud environment. This is necessary to protect the data
and operations of Snowflake customers. During the security review of a Snowflake Native App, Snowflake scans
all incoming apps for known CVEs.

> **Warning:**
>
> It is possible that not all CVEs are detected. Also, CVEs may not present the same level of risk
> or may not be actionable by Snowflake.

The purpose of Snowflake’s CVE Evaluation Criteria is to establish a set of clear and objective criteria
for evaluating and addressing known CVEs in an app submitted to Snowflake. By defining these criteria, Snowflake
prioritizes and mitigates critical security risks, while accounting for the effort required to address less
severe vulnerabilities. This policy guides the process for accepting or rejecting an app based on the CVE risk
profile.

This CVE policy applies to all incoming apps that undergo Snowflake’s security review process. It covers how
CVEs that are identified in the packages and dependencies of an app are evaluated and addressed. This policy
is enforced during the security review process, as documented in [Run the automated security scan](security-run-scan.md).

This process ensures that only apps that meet the defined criteria are approved for publishing and distribution to
consumers within Snowflake’s data cloud environment.

## CVE Evaluation Criteria

Snowflake uses the following three criteria to evaluate known vulnerabilities (CVEs) in
a Snowflake Native App and review each CVE:

* The CVE has a confirmed fix
* The CVE has a high integrity impact
* The CVE has an EPSS score of 10 percent or higher

By considering these three criteria, Snowflake decides which CVEs pose the most significant risks and
require immediate fixes within an app. An app is rejected if it contains any packages with a CVE that meets the criteria below or is not
appropriately remediated.

### The CVE has a confirmed fix

Snowflake provides actionable information and reports only on CVEs that have a confirmed fix according
to the National Vulnerability Database (NVD). This ensures that the identified vulnerabilities have a
known and available solution, enabling developers to address them effectively.

### The CVE has a high integrity impact

Snowflake focuses on CVEs with a high integrity impact, as defined by the Common Vulnerability Scoring
System (CVSS). A high integrity impact indicates a total loss of integrity or complete loss of protection,
allowing unauthorized modifications of data and/or data tampering without any constraints. Providers must
address these CVEs to ensure the security and reliability of our data cloud environment.

### The CVE has an EPSS score of 10 percent or higher

The Exploit Prediction Scoring System (EPSS) provides an estimate of how likely a software
vulnerability will be exploited based on factors including the age, complexity, and potential
impact of the vulnerability.

Snowflake rejects an app if it has an EPSS score of ten percent or higher. This threshold is determined
based on the analysis of data from current apps. This threshold allows Snowflake to prioritize and address
vulnerabilities that have a higher probability of being exploited, while maintaining a reasonable level of
risk tolerance.

## Additional information

The following links provide more information about the processes and policies Snowflake uses
when evaluating the CVE vulnerabilities for an app:

* [NVD Vulnerabilities](https://nvd.nist.gov/vuln)
* [Vulnerability metrics](https://nvd.nist.gov/vuln-metrics/cvss)
* [Exploited Protection Scoring System](https://nvd.nist.gov/vuln-metrics/cvss)
* [Enhancing Vulnerability Prioritization](https://arxiv.org/abs/2302.14172)

---
title: Configure event definitions for an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/event-definition.md
section: Native Apps Framework
---

# Configure event definitions for an app

This topic describes how to define event definitions in the manifest file of an app. Event definitions
define which log messages and trace events are shared with a provider.

## About event definitions

Event definitions specify how an app shares log messages and trace events with the provider.
Event definitions act as filters on the log message and trace event levels set by the provider.
A provider specifies the event definitions for an app when a new app version or patch is published.

Event definitions are filters that act on the log messages and trace events. They determine what
information is inserted in the provider event table when event sharing is enabled.

Event definitions are optional. If a provider does not specify event definitions for an app,
consumers can only enable or disable event sharing for all events when the provider enables event tracing.

> **Caution:**
>
> Event definitions differ from the log and tracing levels set by the provider. Log and
> tracing levels determine the information that is inserted into the consumer event table. If neither
> the log nor tracing levels are set, then the app does not emit any events.
>
> The log and trace levels for an app can change based on the event definitions enabled by the consumer.
> Snowflake uses the most verbose log and trace levels allowed by the event definitions
> the consumer has enabled.

## Mandatory and optional event definitions

Providers can set an event definition to be required or optional:

* Required event definitions are enabled automatically when the app is installed.

  After installing an app with required event definitions, consumers cannot disable event sharing or
  the required event definitions. When an app is being upgraded, providers can use system functions or
  the Python Permission SDK to check if the consumer has enabled all required event definitions.
* Optional event definitions can be enabled or disabled by the consumer as necessary.

## Supported event definitions

The following table lists currently supported event definitions.

| Type | Name | Description | Filter |
| --- | --- | --- | --- |
| All | SNOWFLAKE$ALL | Shares all log messages and trace events that the app emits. | `*` |
| Events | SNOWFLAKE$ALL_EVENTS | Shares all events from the application. | `RECORD_TYPE='EVENT'` |
| Errors and warnings | SNOWFLAKE$ERRORS_AND_WARNINGS | Shares logs related to errors, warnings, and fatal events. | `RECORD_TYPE = ‘LOG’ AND RECORD:severity_text in (‘FATAL’, ‘ERROR’, ‘WARN’)` |
| Traces | SNOWFLAKE$TRACES | Shares detailed traces of user activities and journeys in the application. | `RECORD_TYPE in (‘SPAN’, ‘SPAN_EVENT’)` |
| Usage logs | SNOWFLAKE$USAGE_LOGS | Shares high-level logs related to user actions and app events. | `RECORD_TYPE = LOG AND RECORD:severity_text = ‘INFO’` |
| Debug logs | SNOWFLAKE$DEBUG_LOGS | Shares technical logs used to troubleshoot the app. | `RECORD_TYPE = ‘LOG’ AND RECORD:severity_text in (‘DEBUG’, ‘TRACE’)` |
| Metrics | SNOWFLAKE$METRICS | Enable consumers to share metrics with providers. | `RECORD_TYPE  in (‘METRIC’)` |

> **Note:**
>
> Snowsight only displays the all event All type to the consumer if the provider has not configured the app to
> use event definitions.

## Limitations of event definitions in apps with containers

Snowflake Native Apps with Snowpark Container Services currently only supports the `ALL` event definition. Support for additional
event definitions will be added in a future release.

## Set the log and trace levels for an app

To allow an app to use event tracing, a provider must configure the log and trace levels
in the manifest file.

To set the log and trace levels for an app, add a `configuration` block in the manifest file as shown in the following example:

```yaml
configuration:
  ...
  log_level: INFO
  trace_level: ALWAYS
  metric_level: ALL
  log_event_level: INFO
  ...
```

This example sets the log and trace levels for the app as follows:

* The `log_level` property is set to `INFO`.
* The `trace_level` property is set to `ALWAYS`.
* The `metric_level` property is set to `ALL`.
* The `log_event_level` property is set to `INFO`.

See [LOG_LEVEL](../../sql-reference/parameters.md), [TRACE_LEVEL](../../sql-reference/parameters.md), [METRIC_LEVEL](../../sql-reference/parameters.md), and
[LOG_EVENT_LEVEL](../../sql-reference/parameters.md) for information on the valid values for these parameters.

> **Caution:**
>
> After you publish an app, the log and trace levels cannot be changed. If the log and trace levels
> are not set in the manifest file, the app does not emit any information.

When the log and trace levels are set for an app, consumers must set up an event table in their account
to see the log messages and trace events that the app emits.

To allow the provider to see the log messages and trace events that an app generates, consumers must
enable event sharing. See [Enable event sharing for an app](event-about.md)
for more information.

## Add an event definition to the manifest file

To specify an event definition, a provider adds an entry to the
`configuration.telemetry_event_definitions` block of the manifest file as shown in the
following example:

```yaml
configuration:
  telemetry_event_definitions:
    - type: ERRORS_AND_WARNINGS
      sharing: MANDATORY
    - type: DEBUG_LOGS
      sharing: OPTIONAL
```

This example specifies the following event definitions:

* A required event definition with type `ERRORS_AND_WARNINGS`.
* An optional event definition with type `DEBUG_LOGS`.

See Supported event definitions for more information.

After a consumer installs an app, the event definitions appears in the Events and logs tab on the
Security page of the app. See
[Enable logging and event sharing for an app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging)
for more information.

## Set the log, trace, and metric levels for specific objects

Providers may fine-tune the log, trace, and metric levels for specific objects within an app. This
gives providers more control over the telemetry data emitted by the app.

Providers can set the log, trace, and metric levels for the following objects within an app:

* Schemas
* Versioned schemas
* Stored procedures
* User-defined functions

The following table lists the SQL commands used to set the log, trace, and event levels for
these objects:

| Object | Command |
| --- | --- |
| Schemas | [ALTER SCHEMA](../../sql-reference/sql/alter-schema.md) |
| Versioned schema | [CREATE OR ALTER VERSIONED SCHEMA](../../sql-reference/sql/create-versioned-schema.md) |
| Stored procedures | [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md) |
| User-defined functions | [ALTER FUNCTION](../../sql-reference/sql/alter-function.md) |

For schemas, stored procedures, and user-defined functions, providers can use the `SET` clause
of the ALTER commands to set the following properties:

* LOG_LEVEL
* TRACE_LEVEL
* METRIC_LEVEL

For versioned schemas providers can set these properties using
[CREATE OR ALTER VERSIONED SCHEMA](../../sql-reference/sql/create-versioned-schema.md) in the setup script.

## Order of precedence for log, trace, and metric levels

Within an app, the log, trace, and metric levels can be configured in different ways
for components of the app. To determine the events that are emitted, the Snowflake Native App Framework uses the
following order of precedence:

* Stored procedures and user-defined functions

  If an override is set for the specific stored procedure or user-defined function,
  it takes precedence.
* Schemas and version schemas

  If no overrides are set for stored procedures or user-defined functions, overrides
  for schemas and versioned schemas take precedence.
* App-level settings

  If no object-level overrides are found, the app-level telemetry configuration, typically defined
  in the manifest file, is used.

---
title: Configure the privileges required by an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-auto-privs.md
section: Native Apps Framework
---

# Configure the privileges required by an app

This topic describes how to use automated granting of privileges to request the privileges
from a consumer when installing or upgrading a Snowflake Native App.

## Overview of automated granting of privileges

Often, an app needs to create objects or perform other actions in the consumer
account. This requires the consumer to grant the required privileges that allow the app
to perform these actions. For example, apps must have privileges to perform the following
types of tasks:

* Create and start warehouses and compute pools.
* Access data in the consumer account.
* Connect to external endpoints outside of Snowflake.

By using automated granting of privileges, providers can specify the required privileges in the
manifest file of an app. When the consumer installs or upgrades an app, the privileges specified
in the manifest are automatically granted to the app.

> **Caution:**
>
> The provider must communicate these privileges and their potential impact so that they are
> visible to the consumer when evaluating and installing the app. After privileges are
> automatically granted during installation or upgrade, these privileges cannot be revoked.

## Security considerations when using automated granting of privileges

When a provider configures an app to use
`manifest_version: 2` in the manifest file, automated granting of
privileges is enabled. By default this allows Snowflake to automatically
grant certain privileges to the app. For information on the privileges
that can be automatically granted to the app, see
Privileges granted by automated granting of privileges.

During installation, Snowsight displays a notification about
the privileges requested by the app. When a consumer installs an app
that uses automated granting of privileges, they agree that the app may
be granted these privileges during upgrades without requiring additional
consent.

Consumers can create feature policies that restrict the objects an app
can create. For more information on creating feature policies, see
[Use feature policies to limit the objects an app can create](ui-consumer-feature-policies.md).

## Request privileges for an app using automated granting of privileges

Providers can use automated granting of privileges to specify the privileges an app needs
to create and use objects in the consumer account. Automated granting of privileges grants the
required privileges to the app when the consumer installs or upgrades the app.

### Set the version of the manifest file

To enable automated granting of privileges for an app, set the version at the beginning of the
manifest file as shown in the following example:

```yaml
manifest_version: 2
```

### Specify the privileges in the manifest file

To specify the privileges required by the app, providers must declare them in the manifest file
of the app.

> **Note:**
>
> To use automated granting of privileges, providers must specify `manifest_version: 2`.

The following example shows how to specify the CREATE WAREHOUSE privilege in the manifest file:

```yaml
manifest_version: 2
...
privileges:
  - CREATE WAREHOUSE:
    description: "Allows the app to create warehouses in the consumer account"
```

When a consumer installs the app, the CREATE WAREHOUSE privilege is automatically granted to the app.

> **Caution:**
>
> If a provider changes the `manifest_version` property of the manifest file from `2` to `1`,
> all automatic privileges are revoked from the app during upgrade. If the consumer has explicitly
> granted privileges to the app, those privileges remain unchanged.

> **Note:**
>
> Providers can only change the `manifest_version` property during major upgrades to a new
> version of the app. The `manifest_version` cannot be changed in a patch release.

### Create the required objects in the setup script

Using automated granting of privileges, providers can add the SQL commands to the setup script to create and access objects in the consumer account.

The following example shows how to create a warehouse in the consumer account:

```sqlexample
CREATE OR REPLACE WAREHOUSE application_wh;
```

This command creates a warehouse named `application_wh` in the consumer account. The
automated granting of privileges feature allows the app to create the warehouse directly. The
provider does not have to add additional logic to check whether the consumer has granted the
required privileges.

## Privileges granted by automated granting of privileges

The following privileges are supported by automated granting of privileges:

* EXECUTE TASK
* EXECUTE MANAGED TASK
* CREATE WAREHOUSE
* CREATE COMPUTE POOL
* BIND SERVICE ENDPOINT
* CREATE DATABASE
* CREATE EXTERNAL ACCESS INTEGRATION
* CREATE SECURITY INTEGRATION
* CREATE SHARE
* CREATE LISTING

When a provider adds these privileges to the manifest file, they are automatically granted to
the app during installation and upgrade.

### Restrictions on privileges gated by app specifications

The following privileges allow apps to create objects, but require additional app specification approval:

CREATE EXTERNAL ACCESS INTEGRATION
:   Allows an app to create an external access integration in the consumer account. However, to allow
    connections to an external endpoint, consumers must also approve the app specification that allows
    the app to connect to external hosts.

CREATE SECURITY INTEGRATION
:   Allows an app to create a security integration in the consumer account. However, to enable OAuth
    authentication, consumers must also approve the app specification that defines the OAuth endpoints
    and scopes.

CREATE SHARE and CREATE LISTING
:   Allow an app to create shares and listings in the consumer account. However, to share data with
    target accounts, consumers must also approve the app specification that specifies the target accounts
    and auto-fulfillment settings.

For more information about app specifications, see [Overview of app specifications](requesting-app-specs.md).

### Privileges not granted by automated granting of privileges

Some privileges are not automatically granted to the app. Consumers must manually grant these
privileges when installing or upgrading an app. For example, the following privileges aren’t automatically granted to the app:

* MANAGE WAREHOUSES
* IMPORTED PRIVILEGES ON SNOWFLAKE DB
* READ SESSION
* EXECUTE ALERT

## Using automated granting of privileges during upgrades

When publishing a new version of an app, you might need to add or remove the privileges
required by the app. The setup script of the new version or patch runs with both the new auto
privileges specified in the manifest and the privileges required by the previous version. Any
excess privileges that are removed in the new version are revoked when the app upgrade is complete.

To ensure stability during upgrades, when the version of the manifest file is set to `2`, the
list of requested privileges in the manifest file cannot be modified as part of a patch. This
prevents providers from unintentionally breaking apps by removing required privileges in a patch.

---
title: Connection configuration
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/connection_configuration.md
section: Native Apps Framework
---

# Connection configuration

Connection configuration is a wizard step that comes directly after the connector configuration. This step allows the user
to specify properties required for establishing a connection with the source system to start ingesting data into Snowflake.
The procedure called `PUBLIC.SET_CONNECTION_CONFIGURATION(connection_configuration VARIANT)` is the entry point responsible
for this wizard phase. This procedure can be called by the UI or from the worksheet. When overwriting with custom logic,
this procedure needs to be replaced, to specify the custom Java handler.

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The connection configuration step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Status validation
2. Input validation
3. Configuration update
4. Internal callback
5. Connection validation
6. Status update

## Requirements

Connection configuration requires at least the following SQL files to be executed during native app installation:

* `core.sql` (See: [Core SQL reference](../reference/core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](../reference/app_config_reference.md))
* `configuration/connection_configuration.sql` (See: [Connection configuration reference](../reference/connection_configuration_reference.md))

In addition there is a requirement dependent on the SDK user:

* custom implementation of `PUBLIC.TEST_CONNECTION()` procedure

### Status validation

To perform connection configuration the internal status of the connector needs to be `CONFIGURING`, with configuration status: `CONFIGURED` or `CONNECTED`.
The first of the configuration statuses will be set directly after the connector configuration step,
the latter one will be present if for some reason Connection Configuration has to be updated during later steps.

This validation cannot be overwritten by using `ConnectionConfigurationHandlerBuilder` nor by overwriting a stored procedure.
However, it is possible to implement a custom handler, which will not have this kind of validation.

### Input validation

Input needs to be a `variant` containing a map of properties, however this might not work for all cases. For that reason the SDK provides
an internal stored procedure called: `PUBLIC.SET_CONNECTION_CONFIG_VALIDATE(config VARIANT)`. By default,
this procedure just returns `'responseCode': 'OK'`, overwriting it can update the provided config during validation.
This feature enables for custom logic. For example, trimming the input or conversion to upper/lower case.
To return config transformed in any way the response needs to contain an additional `"config"` property in the response `Variant`,
this property should contain the updated config as `Variant`.
The procedure can be customized by overwriting through the SQL or by using `ConnectionConfigurationHandlerBuilder` and providing custom implementation of the
`ConnectionConfigurationInputValidator` interface.

The following is a valid response from the custom implementation with transformation:

```json
{
    "response_code" : "OK",
    "config": {
        "key1": "value1",
        "key2": "value2"
    }
}
```

### Configuration update

Once the validations are passed successfully, configuration will be saved to the internal `APP_CONFIG` table.
The service responsible for this saves the provided `Variant` under the `connection_configuration` key.
This configuration does not follow any additional requirements when saving,
the set of provided properties is up to the user.

### Internal callback

Internal callback is another customizable step. By default, it invokes `PUBLIC.SET_CONNECTION_CONFIGURATION_INTERNAL(connection_configuration VARIANT)`,
which returns `'response_code': 'OK'`. For example, it can be used to alter other procedures by granting them external access integration.
It can be overwritten through the SQL script or by using a `ConnectionConfigurationHandlerBuilder` to provide custom implementation of the `ConnectionConfigurationCallback` interface.

### Connection validation

This step triggers a `PUBLIC.TEST_CONNECTION` procedure. This procedures tries to query the source system for the data.
This procedure is not implemented by default and needs to be provided by the SDK user. Additionally, `ConnectionValidator` interface
implementation can be provided to the `ConnectionConfigurationHandlerBuilder` to customize this phase, in this case,
there is no need to implement a stored procedure. The recommendation is
to perform just a minimal connectivity check in this procedure to ensure that external
access capabilities of Snowflake were configured correctly
and the Connector has all required privileges to use them.

### Status update

When all the above phases are completed successfully the internal status of the connector will be updated to:

```json
{
    "status": "CONFIGURING",
    "configurationStatus": "CONNECTED"
}
```

For the whole diagram of state transitions, see [Connector flow](overview.md).

### Viewing the configuration

There is a `PUBLIC.GET_CONNECTION_CONFIGURATION()` procedure available to the `ADMIN` and `VIEWER` users that
returns current connection configuration from the internal table.

### Response

#### Successful response

If the procedure finishes successfully it will return a response from `TEST_CONNECTION` procedure. We recommend using the following format:

> ```json
> {
>   "response_code": "OK"
> }
> ```

#### Error response

In case of an error the response will follow the below format:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - Invalid connector status. Expected status: `[CONFIGURING]`
* `INVALID_CONNECTOR_CONFIGURATION_STATUS` - Invalid connector configuration status. Expected status: `CONFIGURED`
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive
* `PROCEDURE_NOT_FOUND` - Procedure which was called does not exist. In this case it’s about `TEST_CONNECTION` procedure mostly
* `UNKNOWN_SQL_ERROR` - This error occurs when something unexpected happen when calling internal procedures
* `INVALID_RESPONSE` - This error occurs when response received from internal procedure does not contain `response_code` or an error response does not contain `message`, but contains `response_code`
* `UNKNOWN_ERROR` - It means that something unexpected went wrong - message of thrown exception is forwarded
* Custom error codes received from `TEST_CONNECTION()` procedure - defined by connector developer
* Custom error codes received from `SET_CONNECTION_CONFIGURATION_INTERNAL()` procedure - defined by connector developer

---
title: Connection configuration reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/connection_configuration_reference.md
section: Native Apps Framework
---

# Connection configuration reference

## Database objects and procedures

The following database objects are created through the file `configuration/connection_configuration.sql`.

### PUBLIC.SET_CONNECTION_CONFIGURATION (connection_configuration VARIANT)

Entry point procedure available to `ADMIN` role. This procedure invokes the Java function [ConnectionConfigurationHandler.setConnectionConfiguration()](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationHandler.md).

### PUBLIC.SET_CONNECTION_CONFIGURATION_VALIDATE (connection_configuration VARIANT)

Procedure used for Connector specific validation of the configuration. It can also be used to transform some parts of the configuration.
Transformed configuration needs to be returned as additional `"config"` property. By default, it returns `'response_code': 'OK'`.
It is invoked by the `DefaultConnectionConfigurationInputValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.SET_CONNECTION_CONFIGURATION_INTERNAL (connection_configuration VARIANT)

Procedure used for Connector specific additional connection configuration, for example adding external access integration to other procedures.
By default, it returns `'response_code': 'OK'`. It is invoked by the `InternalConnectionConfigurationCallback`. Can be overwritten both in SQL and Java.

### PUBLIC.GET_CONNECTION_CONFIGURATION()

A procedure to retrieve current connection configuration from the internal table. It is available to `ADMIN` and `VIEWER` users.

## Related tables and views

Connector configuration is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))

### PUBLIC.TEST_CONNECTION()

This procedure is not provided by default in any file, but is necessary for the `Connection Configuration` feature.
This procedure will be used as a light weight way to check access to the external source system.

## Related Java objects

The following Java objects from the [com.snowflake.connectors.application.configuration.connector](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/package-summary.md) package and some common components are tightly connected with the above procedures:

* [ConnectionConfigurationHandler](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationHandler.md)
* [ConnectionConfigurationInputValidator](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationInputValidator.md)
* [ConnectionValidator](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionValidator.md)
* [ConnectorConfigurationService](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConnectorConfigurationService.md)
* [ConnectionConfigurationHandlerBuilder](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationHandlerBuilder.md)
* [ConnectorErrorHelper](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/common/exception/helper/ConnectorErrorHelper.md)

## Custom handler

Handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of the [ConnectionConfigurationHandler](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationHandler.md) the PUBLIC.SET_CONNECTION_CONFIGURATION procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.SET_CONNECTION_CONFIGURATION(config VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomConnectionConfigurationHandler.setConnectionConfiguration';

GRANT USAGE ON PROCEDURE PUBLIC.CONFIGURE_CONNECTOR(VARIANT) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal `VALIDATE` and `INTERNAL` procedures can be also customized through the SQL. They can even invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.SET_CONNECTION_CONFIGURATION_INTERNAL(config VARIANT)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK', '"config"', '"transformed config variant"');
  END;

CREATE OR REPLACE PROCEDURE PUBLIC.SET_CONNECTION_CONFIGURATION_VALIDATE(config VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomConnectionConfigurationValidateHandler.setConnectionConfiguration';
```

### Builder approach

[ConnectionConfigurationHandler](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationHandler.md) can be customized using [ConnectionConfigurationHandlerBuilder](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationHandlerBuilder.md). This builder allows user to provide custom implementations of the following interfaces:

* [ConnectionConfigurationInputValidator](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationInputValidator.md)
* [ConnectionConfigurationCallback](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionConfigurationCallback.md)
* [ConnectionValidator](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connection/ConnectionValidator.md)
* [ConnectorErrorHelper](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/common/exception/helper/ConnectorErrorHelper.md)

In case one of them is not provided the default implementation provided by the SDK will be used.

```java
class CustomConnectionConfigurationInputValidator implements ConnectionConfigurationInputValidator {
  @Override
  public ConnectorResponse validate(Variant config) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.SET_CONNECTION_CONFIGURATION procedure using SQL
  public static Variant configureConnection(Session session, Variant configuration) {
    //Using builder
    var handler = ConnectionConfigurationHandler.builder(session)
      .withInputValidator(new CustomConnectionConfigurationInputValidator())
      .build();
    return handler.connectionConfiguration(configuration).toVariant();
  }
}
```

---
title: Connector configuration
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/connector_configuration.md
section: Native Apps Framework
---

# Connector configuration

Connector configuration is the first required step of the wizard phase. It ensures that the connector
has the configuration of the objects common between all connector types, regardless of the actual
source system and domain. The procedure called `PUBLIC.CONFIGURE_CONNECTOR(config VARIANT)`
is the entry point from the UI or worksheet to do so. When overwriting with custom logic, keep in mind that,
this procedure needs to be replaced, because it points to the `ConfigureConnectorHandler.configureConnector` static method in Java as a handler.

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The connector configuration step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Status validation
2. Fields validation
3. Input validation
4. Configuration update
5. Internal callback
6. Status update

## Requirements

The connector configuration requires at least the following SQL files to be executed during native app installation:

* `core.sql`
* `configuration/app_config.sql`
* `configuration/connector_configuration.sql`

### Status validation

To perform the connector configuration, the internal status of the connector needs to be `CONFIGURING`.
This validation cannot be overwritten by using `ConfigureConnectorHandlerBuilder` nor by overwriting a stored procedure. However,
it is possible to implement a custom handler, which will not have this kind of validation.

### Fields validation

The connector configuration needs to contain a set of specific fields. All of them are optional, but any other field causes an exception to be thrown.
The allowed keys are:

* `warehouse`
* `operational_warehouse`
* `cortex_warehouse`
* `destination_database`
* `destination_schema`
* `global_schedule`
* `data_owner_role`
* `cortex_user_role`
* `agent_username`
* `agent_role`

#### Warehouse

Warehouse is used by the Connector to run the scheduler, execute tasks and run queries.

#### Operational_warehouse

Occasionally, the connector has a need to use a separate warehouse for performing ingestion work.
A separate warehouse will allow the connector to split the ingestion operations from the main warehouse,
which is used for internal connector operations.

#### Cortex_warehouse

Occasionally, the connector has a need to use the Cortex AI. That use may require a separate warehouse
to split the operations from the main warehouse, which is used for internal connector operations.

#### Destination_database

The destination database is used to store the data ingested by the connector. This database should be outside
of the connector. It can be an existing database, however the connector needs to have write privileges on it.
It can be also a newly created database, however, this won’t happen automatically and has to be implemented as a part of the
internal callback during connector configuration or configuration finalization.

#### Destination_schema

The destination schema will be the schema used in the destination_database above.

#### Global_schedule

This property defines the running schedule for the scheduler task. Currently, the scheduler will only process resources with their own `scheduleType=GLOBAL`.
The value for this property should be similar to the one below:

```json
"global_schedule": {
    "scheduleType": "CRON",
    "scheduleDefinition": "*/10 * * * *"
}
```

#### Data_owner_role

Role that can be used to give ownership of the sync database for retaining the data upon connector un-installation.

#### Cortex_user_role

Role that can access the Cortex features available in the connector.

#### Agent_username

Username used by the push based connector’s agent when connecting with Snowflake.

#### Agent_role

Role used by the push based connector’s agent when connecting with Snowflake.

## Input validation

Input needs to be a valid `Variant`, In addition, the SDK provides
an internal stored procedure called: `PUBLIC.CONFIGURE_CONNECTOR_VALIDATE(config VARIANT)`. By default,
this procedure just returns `'response_code': 'OK'`, however it can be changed by overwriting this stored procedure.
Alternatively it can be customized using `ConfigureConnectorHandlerBuilder` and providing a custom implementation of the
`ConfigureConnectorValidator` interface.

## Configuration update

Once the validations are passed successfully configuration is saved to the internal `APP_CONFIG` table.
The service responsible for this saves the provided `Variant` under the `connector_configuration` key.

## Internal callback

Internal callback is another customizable step. By default, it invokes `PUBLIC.CONFIGURE_CONNECTOR_INTERNAL(config VARIANT)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `ConfigureConnectorHandlerBuilder` to provide custom implementation of the `ConfigureConnectorCallback` interface.

## Status update

When all the above phases are completed successfully the internal status of the Connector will be updated to:

```json
{
    "status": "CONFIGURING",
    "configurationStatus": "CONFIGURED"
}
```

For a diagram of state transitions, see [Connector flow](overview.md).

### Response

#### Successful response

If the procedure finishes successfully it will return a response in the following format:

> ```json
> {
>   "response_code": "OK",
> }
> ```

#### Error response

In case of an error the response will follow the below format:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - The procedure was called on already configured connector
* `CONNECTOR_CONFIGURATION_PARSING_ERROR` - Given configuration is not a valid JSON
* `CONNECTOR_STATUS_NOT_FOUND` - Connector status record does not exist in database
* `CONNECTOR_STATUS_PARSING_ERROR` - Value stored in table `APP_STATE` under `connector_status` key has incorrect format and cannot be parsed by the application

---
title: Connector configuration reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/connector_configuration_reference.md
section: Native Apps Framework
---

# Connector configuration reference

## Database objects and procedures

The following database objects are created through the file `configuration/connector_configuration.sql`.

### PUBLIC.CONFIGURE_CONNECTOR (config VARIANT)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function [ConfigureConnectorHandler.configureConnector](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorHandler.md).

### PUBLIC.CONFIGURE_CONNECTOR_VALIDATE (config VARIANT)

Procedure used for connector specific validation of the configuration. By default, it returns `'response_code': 'OK'`.
It is invoked by the `DefaultConfigureConnectorInputValidator` function. Can be overwritten both in SQL and Java.

### PUBLIC.CONFIGURE_CONNECTOR_INTERNAL (config VARIANT)

Procedure used for connector specific additional configuration. By default, it returns `'response_code': 'OK'`.
It is invoked by the `InternalConfigureConnectorCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

Connector configuration is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.configuration.connector` package and some common components are tightly connected with the above procedures:

* [ConfigureConnectorHandler](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorHandler.md)
* [ConfigureConnectorInputValidator](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorInputValidator.md)
* [ConfigureConnectorCallback](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorCallback.md)
* [ConnectorConfigurationService](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConnectorConfigurationService.md)
* [ConfigureConnectorHandlerBuilder](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorHandlerBuilder.md)
* [ConnectorErrorHelper](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/common/exception/helper/ConnectorErrorHelper.md)

## Custom handler

Handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of the [ConfigureConnectorHandler](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorHandler.md) the PUBLIC.CONFIGURE_CONNECTOR procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.CONFIGURE_CONNECTOR(config VARIANT)
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.custom.handler.CustomConfigureConnectorHandler.configureConnector';

GRANT USAGE ON PROCEDURE PUBLIC.CONFIGURE_CONNECTOR(VARIANT) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal `VALIDATE` and `INTERNAL` procedures can be also customized through the SQL. They can even invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.CONFIGURE_CONNECTOR_INTERNAL(config VARIANT)
RETURNS VARIANT
LANGUAGE SQL
EXECUTE AS OWNER
AS
BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
END;

CREATE OR REPLACE PROCEDURE PUBLIC.CONFIGURE_CONNECTOR_VALIDATE(config VARIANT)
    RETURNS VARIANT
    LANGUAGE JAVA
    RUNTIME_VERSION = '11'
    PACKAGES = ('com.snowflake:snowpark:1.11.0')
    IMPORTS = ('/connectors-native-sdk.jar')
    HANDLER = 'com.custom.handler.CustomConfigureConnectorInternalHandler.configureConnector';
```

### Builder approach

[ConfigureConnectorHandler](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorHandler.md) can be customized using [ConfigureConnectorHandlerBuilder](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorHandlerBuilder.md). This builder allows user to provide custom implementations of the following interfaces:

* [ConfigureConnectorInputValidator](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorInputValidator.md)
* [ConfigureConnectorCallback](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/configuration/connector/ConfigureConnectorCallback.md)
* [ConnectorErrorHelper](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/common/exception/helper/ConnectorErrorHelper.md)

In case one of them is not provided the default implementation provided by the SDK will be used.

```java
class CustomConfigureConnectorInputValidator implements ConfigureConnectorInputValidator {
    @Override
    public ConnectorResponse validate(Variant config) {
        // CUSTOM LOGIC
        return ConnectorResponse.success();
    }
}

class CustomHandler {

    // Path to this method needs to be specified in the PUBLIC.CONFIGURE_CONNECTOR procedure using SQL
    public static Variant configureConnector(Session session, Variant configuration) {
            //Using builder
        var handler = ConfigureConnectorHandler.builder(session)
            .withInputValidator(new CustomConfigureConnectorInputValidator())
            .build();
        return handler.configureConnector(configuration).toVariant();
    }
}
```

---
title: Connector flow
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/overview.md
section: Native Apps Framework
---

# Connector flow

This section describes how the lifecycle of a native connector application is organized from the user perspective.

The connector needs to store its internal state and configuration to be able to work properly.
That information is stored in the tables that are internal to the application.
To prevent accidental manual changes, they are hidden from the end user.
However, the native SDK for connectors provides several views that show the values stored inside them.

The most important tables internally are `STATE.APP_STATE` and `STATE.APP_CONFIG`.
Those tables are described in the below sections.

## Connector internal status

Connector status consists of the two parts called `status` and `configurationStatus`.
`Status` is the global high-level state that the connector is in, while `configurationStatus` can be considered a sub-status during the wizard phase.
The transitions between states are shown in the below diagram:

The left part of the above diagram shows the transitions of the `status`. The right part shows the transition of the `configurationStatus` during the wizard phase.
For every `configurationStatus`, the value of `status` is `Configuring`.

If at any point the connector *gets stuck* in a particular state - you may need to use the
[PUBLIC.RECOVER_CONNECTOR_STATE()](../reference/core_reference.md) procedure
or reinstall the connector to fix the issue.

## Wizard

The wizard phase is the initial phase after installing a connector. It guides the end user through all the needed configurations that need to be finished for the connector to be able to perform ingestion.
Steps of the wizard phase are represented by the `configurationStatus` and are shown in the above diagram in its right part.
This process consists of the following steps:

### Prerequisites

Optional step to ensure that all configurations outside the Connector itself are done. For example authentication in the source system.

More on [Prerequisites](prerequisites.md).

### Connector configuration

Configuration of the most crucial properties of the connector application, such as the warehouse, sink database, etc.

More on [Connector Configuration](connector_configuration.md).

### Connection configuration

Configuration of the properties required to connect to the external source system, for example authentication method, credentials etc.

More on [Connection Configuration](connection_configuration.md).

### Configuration finalization

Finalization is the last step of the wizard. It provides the means to perform any additional configurations custom to the connector.

More on [Configuration Finalization](finalize_configuration.md).

## Daily use

After the wizard phase is completed, the connector is ready to start ingesting data. Lifecycle operations are enabled.
The following options become available:

### Ingestion management

Ingestion management is the most important part of the connector. It defines what data should be ingested from the source system.

For more information, see [Ingestion management](ingestion-management/overview.md).

### Pausing and Resuming Connector

Pausing and resuming connector allows end user to completely stop and restart all of the connector operations.
Paused connector does not ingest data, but the costs of maintaining it are also minimized.

For more information, see [Pausing Connector](pause_connector.md) and [Resuming Connector](resume_connector.md).

### Viewing Statistics

Statistics collected during the ingestion allow end user to see how much data connector ingests on the hourly basis and can help to notice anomalies in the connector behavior.

For more information, see [Viewing Statistics](../reference/connector_stats_reference.md).

## Configuration update

When the wizard phase is complete, the Connector is fully configured and ready to start ingesting data.
To make changes to the configuration after the wizard phase, use one of the following options:

### Updating the connection configuration

To change the configuration which was set in the [Connection Configuration](connection_configuration.md) wizard step.

For more information, see [Update Connection Configuration](update_connection_configuration.md).

### Updating the warehouse

To change the warehouse which was set in [Connector Configuration](connector_configuration.md) wizard step.

For more information, see [Update Warehouse](update_warehouse.md).

---
title: Connector stats reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/connector_stats_reference.md
section: Native Apps Framework
---

# Connector stats reference

## Database objects and procedures

The following database objects are created through the file `observability/connector_stats.sql`.

### PUBLIC.GENERIC_CONNECTOR_STATS

View not available for any role, access via view `CONNECTOR_STATS`. View providing the data about ongoing and finished
ingestion runs. A view that retrieves and maps the data from the union of [`STATE.INGESTION_RUN` /
`STATE.RESOURCE_INGESTION_DEFINITION` / `STATE.INGESTION_PROCESS`] internal tables.

View structure with mapping is as follows:

1. ID (col) → RUN_ID (col);
2. RESOURCE_INGESTION_DEFINITION_ID (col)
3. INGESTION_CONFIGURATION_ID (col)
4. INGESTION_PROCESS_ID (col)
5. NAME (col)
6. STARTED_AT (col)
7. UPDATED_AT (col)
8. COMPLETED_AT (col)
9. STATUS (col)
10. INGESTED_ROWS (col)
11. DATEDIFF(second from STARTED_AT and COMPLETED_AT) (col) → DURATION_S (col);
12. INGESTED_ROWS (col) / DURATION_S (col) → THROUGHPUT_RPS (col);
13. METADATA (col)

### PUBLIC.AGGREGATED_CONNECTOR_STATS

This view is exposed to the `ADMIN` and `VIEWER` roles. It returns aggregated data from the above view and allows
access for the defined user. The rows will be grouped by truncated hours and displayed with summed updated rows.
View providing the aggregated data about daily ingestion runs.

A view that retrieves and maps the data from the `GENERIC_CONNECTOR_STATS` internal table
The mapping is as follows:

1. GROUPED BY(hours from STARTED_AT (col)) → RUN_DATE (col);
2. SUM(INGESTED_ROWS (col)) → UPDATED_ROWS (col);

Example `AGGREGATED_CONNECTOR_STATS` view created on example GENERIC_CONNECTOR_STATS:

| RUN_DATE | UPDATED_ROWS |
| --- | --- |
| <timestamp_ntz> | 20 |
| <timestamp_ntz> | 40 |
| … | … |

Overwriting this view is not recommended.

### PUBLIC.CONNECTOR_STATS

This view is exposed to the `ADMIN` role. It returns data from the connector stats view and allows access for the defined user.
In the default implementation this view exists only as an additional layer above `GENERIC_CONNECTOR_STATS`.
This implementation should be overwritten if some additional custom data needs to be added.

## Related tables and views

Connector stats are related to and dependent on the objects from the following files:

* `ingestion/ingestion_run.sql` (See: [STATE.INGESTION_RUN](resource_definition_and_ingestion_processes_reference.md))
* `ingestion/resource_ingestion_definition.sql` (See: [STATE.RESOURCE_INGESTION_DEFINITION](resource_definition_and_ingestion_processes_reference.md))
* `ingestion/ingestion_process.sql` (See: [STATE.INGESTION_PROCESS](resource_definition_and_ingestion_processes_reference.md))

---
title: Consumer security best practices for an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-security.md
section: Native Apps Framework
---

# Consumer security best practices for an app with containers

This topic provides security guidelines for consumers when running a Snowflake Native App.

## Consumer security best practices

Consumers have a shared responsibility for ensuring the security of their data and the safe
use of a Snowflake Native App with Snowpark Container Services. The following best practices help to ensure the security of an app:

* Review the app’s listing, documentation, and security certification before installing the app.
* Grant the minimum privileges required for an app.
* Grant access only to the specific tables and views that the app requires to function correctly.
* Periodically review and modify the privileges to ensure that the minimum required privileges
  are granted to the app.
* Immediately report any suspected security incidents to Snowflake and the app provider.
* Ensure the secure configuration of the consumer network environment and access controls.
* Review [network controls](security-na-spcs.md),
  [network configurations](../snowpark-container-services/service-network-communications.md),
  and [limitations of Snowpark Container Services](../snowpark-container-services/spcs-guidelines-and-limitations.md)
  and ensure only trusted endpoints are accessible to an app.
* Regularly update and patch systems and software to maintain a secure posture.
* Educate users on the secure use of apps with containers and the importance of data protection.

## Mitigate data exfiltration risks

Despite the security features of a Snowflake Native App, Snowflake cannot guarantee the security of
a third-party app. Consumers are responsible for ensuring the security of their data.

Snowflake recommends the following policies as part of a comprehensive security program to mitigate
data exfiltration and other security risks. This is important when using an app with containers
provided by third parties that implement services and ingress.

* Carefully review app listings, documentation, and security certifications before deploying apps.
* Ensure security controls outlined in Snowflake documentation are appropriate for your use case and
  environment.
* Configure app privileges and access controls to use only the minimum privileges required
  by an app.
* Regularly review and adjust privileges to maintain this principle and disable unused features.
* Use a modern, up-to-date browser with security features, for example the latest versions of
  Chrome, Firefox, or Safari.
* Ensure that your browser settings are configured to block pop-ups and protect against potential
  vulnerabilities.
* Ensure that network rule configurations conform to the expected behavior of the app. When a Snowflake Account URL is
  added to an external access integration, communication is not limited to a specific account. Traffic can be routed to different
  accounts based on access methods. Consumers should avoid adding account URLs if possible communication with other Snowflake accounts
  is unacceptable.

---
title: Consumer-controlled maintenance policies
source: https://docs.snowflake.com/en/developer-guide/native-apps/consumer-maintenance-policies.md
section: Native Apps Framework
---

# Consumer-controlled maintenance policies

With Snowflake Native Apps, consumers can set a maintenance policy for an upgrade so that apps don’t
update during specific time periods. When an upgrade is ready and a new release
directive is set, the upgrade begins. However, if the consumer has set a
maintenance policy, the upgrade is delayed until the start date and time
specified in the maintenance policy.

To create and set a maintenance policy, the consumer uses the following SQL commands:

* [CREATE MAINTENANCE POLICY](../../sql-reference/sql/create-maintenance-policy.md): Creates a new maintenance policy. The customer sets a schedule for the maintenance policy to allow upgrades to begin at a specific time.

To view and manage maintenance policies, the consumer uses the following SQL commands:

* [ALTER MAINTENANCE POLICY](../../sql-reference/sql/alter-maintenance-policy.md): Modifies an existing maintenance policy.
* [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md): Applies or removes a maintenance policy for all apps in the account.
* [ALTER APPLICATION](../../sql-reference/sql/alter-application.md): Applies or removes a maintenance policy for a specific app.
* [SHOW MAINTENANCE POLICIES](../../sql-reference/sql/show-maintenance-policies.md): Lists the maintenance policies for the specified account or app.
* [DESCRIBE MAINTENANCE POLICY](../../sql-reference/sql/desc-maintenance-policy.md): Shows the details of a maintenance policy.
* [DROP MAINTENANCE POLICY](../../sql-reference/sql/drop-maintenance-policy.md): Removes a maintenance policy from the current or specified schema.

Note the following details about consumer-controlled maintenance policies:

* If a consumer does not set a maintenance policy, the upgrade begins when the default upgrade time is reached. For more information, see
  [Maintenance window](../snowpark-container-services/working-with-compute-pool.md).
* Only the start time for a maintenance policy can be specified; not the end time or the duration of the maintenance policy.
* Each app or account can only have one maintenance policy set.
* The provider can set a maintenance deadline for an upgrade, so that the consumer can’t postpone the upgrade indefinitely. As a consumer,
  you should schedule your upgrades as soon as possible during a time when you can be available to test the upgrade and make any necessary adjustments, so that you can avoid having your app become unexpectedly unavailable during an upgrade.

For information about how providers enable maintenance window upgrades, see
[Consumer-controlled maintenance policies: Provider guide](consumer-maintenance-policies-provider.md).

## Creating a maintenance policy

To create a maintenance policy, a consumer uses the [CREATE MAINTENANCE POLICY](../../sql-reference/sql/create-maintenance-policy.md) command.

```sqlexample
CREATE MAINTENANCE POLICY my_maintenance_policy
  SCHEDULE = 'USING CRON 0 2 * * SAT UTC'
  COMMENT = 'Weekly Saturday maintenance policy';
```

Once the maintenance policy is created, it can be applied to an account or app using the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) or [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) commands.

```sqlexample
ALTER ACCOUNT SET MAINTENANCE POLICY my_maintenance_policy FOR ALL APPLICATIONS;

ALTER APPLICATION my_app SET MAINTENANCE POLICY my_maintenance_policy;
```

## Privileges

Use the following privileges to manage consumer-controlled maintenance policies.

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE MAINTENANCE POLICY | Schema | Required to create a new maintenance policy. |
| APPLY MAINTENANCE POLICY | Account | Required to apply a maintenance policy to an account or app. |
| APPLY or OWNERSHIP | Maintenance policy | Allows users access to apply or view a maintenance policy. |

## SQL reference

The following SQL commands are used to manage consumer-controlled maintenance policies:

* [CREATE MAINTENANCE POLICY](../../sql-reference/sql/create-maintenance-policy.md)
* [ALTER MAINTENANCE POLICY](../../sql-reference/sql/alter-maintenance-policy.md)
* [DROP MAINTENANCE POLICY](../../sql-reference/sql/drop-maintenance-policy.md)
* [SHOW MAINTENANCE POLICIES](../../sql-reference/sql/show-maintenance-policies.md)
* [DESCRIBE MAINTENANCE POLICY](../../sql-reference/sql/desc-maintenance-policy.md)

---
title: Consumer-controlled maintenance policies: Provider guide
source: https://docs.snowflake.com/en/developer-guide/native-apps/consumer-maintenance-policies-provider.md
section: Native Apps Framework
---

# Consumer-controlled maintenance policies: Provider guide

With consumer-controlled maintenance policies, consumers can define when Snowflake Native App upgrades happen in their
accounts. Instead of upgrades happening immediately when you release a new version, consumers can delay upgrades
to a maintenance window that works for their operations. For information about how consumers create and manage
maintenance policies, see [Consumer-controlled maintenance policies](consumer-maintenance-policies.md).

As a provider, you need to:

* Enable maintenance window upgrades on your release directives.
* Set an upgrade deadline so that consumers can’t postpone upgrades indefinitely.
* Optionally, align Snowpark Container Services compute pool node maintenance with the consumer’s maintenance window.
  Such an alignment is recommended because it minimizes disruptions for the consumers.

## Enabling maintenance window upgrades

When setting a release directive, you can specify that upgrades should respect consumer maintenance policies by
setting the UPGRADE_IN_MAINTENANCE_WINDOW parameter to TRUE. You must also set the UPGRADE_DEADLINE
parameter, which defines the latest date and time by which the upgrade must be completed. After this deadline,
the upgrade proceeds regardless of the consumer’s maintenance policy.

To enable maintenance window upgrades, use the [ALTER APPLICATION PACKAGE … MODIFY RELEASE CHANNEL](../../sql-reference/sql/alter-application-package-release-channel.md) command
as shown in the following example:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  MODIFY RELEASE CHANNEL DEFAULT
  SET DEFAULT RELEASE DIRECTIVE
  VERSION = v1_0
  PATCH = 2
  UPGRADE_IN_MAINTENANCE_WINDOW = TRUE
  UPGRADE_DEADLINE = '2026-2-10T10:00:00Z';
```

This command configures the release directive so that consumers with a maintenance policy can delay the upgrade
until their next maintenance window, up to a maximum of February 10, 2026 at 10:00 AM.

> **Note:**
>
> * The UPGRADE_DEADLINE parameter is required when UPGRADE_IN_MAINTENANCE_WINDOW is set to TRUE.
>   Set the deadline to a date and time that allows sufficient time for consumers to complete the upgrade
>   within their maintenance windows.
> * You can’t set the UPGRADE_AFTER and UPGRADE_IN_MAINTENANCE_WINDOW parameters at the same time.
>   If you try to set both, the command fails with an error.

## Enabling automatic compute pool maintenance

If your app uses Snowpark Container Services, you can align compute pool node software upgrades with the consumer’s maintenance
window. Without this setting, application upgrades and compute pool node maintenance are separate concerns
that can happen at different times. By enabling automatic application maintenance, both are coordinated into
the consumer’s chosen maintenance window.

To enable this, set the AUTOMATIC_APPLICATION_MAINTENANCE property on the application package:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  SET AUTOMATIC_APPLICATION_MAINTENANCE = TRUE;
```

With this enabled, Snowpark Container Services compute pool node software upgrades are scheduled to occur during the consumer’s
maintenance window. The application upgrades first, then any compute pool node maintenance follows.

## What happens when a consumer has a maintenance policy

When you release an update with UPGRADE_IN_MAINTENANCE_WINDOW set to TRUE, the following occurs:

* If the consumer has set a maintenance policy, the upgrade is delayed until the next maintenance window
  defined by the consumer’s policy, or until the upgrade deadline is reached, whichever comes first.
* If the consumer has not set a maintenance policy, the upgrade happens during the default system maintenance
  window.
* If AUTOMATIC_APPLICATION_MAINTENANCE is enabled, the application code upgrades first, followed by
  any Snowpark Container Services compute pool node maintenance, all within the same maintenance window.

---
title: Core SQL reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/core_reference.md
section: Native Apps Framework
---

# Core SQL reference

File: `core.sql`

## Database objects and procedures

### STATE SCHEMA

An un-versioned schema containing the internal state of the Connector. This schema is persisted between different versions of the application.

### STATE.APP_STATE

Table to store the current status of the connector. This table is only accessible internally.
The table contains the following columns:

1. key STRING
2. value VARIANT
3. updated_at TIMESTAMP_NTZ

The following status is set as a default value during the installation:

```json
{
    "status": "CONFIGURING",
    "configurationStatus": "INSTALLED"
}
```

To retrieve the status use the `GET_CONNECTOR_STATUS` procedure below.

### PUBLIC.GET_CONNECTOR_STATUS()

This procedure retrieves the current status from the `APP_STATE` table. An exception will be thrown when status does not exist in the table.

### PUBLIC.RECOVER_CONNECTOR_STATE(NEW_CONNECTOR_STATUS STRING)

This procedure allows the user to force a change of the connector status. It should only be used as
a last resort, when all other means of fixing the connector have failed and the connector is ‘stuck’
in an unchangeable state.

The procedure can only be used by a user with the `ADMIN` role, to force connector status change
from `STARTING`, `PAUSING` or `ERROR` status into `STARTED` or `PAUSED` status.

## Roles

The `core.sql` file introduces the following roles into the application:

* `ADMIN` - has access to all publicly exposed procedures and views
* `VIEWER` - has access to all read only procedures and views
* `DATA_READER` - no access to anything by default. Should be used to access sink database only

## Related Java objects

The following Java objects are tightly connected with the `APP_STATE` table:

* `ConnectorStatusService`
* `ConnectorStatusRepository`
* `KeyValueTable`
* `FullConnectorStatus`
* `ConnectorStatus`

---
title: Costs associated with apps with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-cost-governance.md
section: Native Apps Framework
---

# Costs associated with apps with containers

This topic describes the costs associated with developing, publishing and using a
Snowflake Native App with Snowpark Container Services. It contains information for both providers and consumers.

## Costs to consumers

A Snowflake Native App may incur costs in the consumer account. The total cost of running
a Snowflake Native App with Snowpark Container Services is determined by the following:

* Costs determined by the provider
* Infrastructure costs

### Costs determined by the provider

A provider may monetize a Snowflake Native App using any of the
[paid listing pricing models](../../collaboration/provider-becoming.md)
that are available in the Snowflake Marketplace. These models include subscription based and usage based plans.

This cost to the consumer is determined by the provider. Consumers pay for provider software via the Snowflake Marketplace in addition to costs associated with running Snowflake
infrastructure, including warehouses and compute pools.

### Infrastructure costs

All infrastructure costs, including those related to compute pools, warehouse compute, storage, and
data transfer are the responsibility of the consumer of a Snowflake Native App.

A consumer can use the IN ACCOUNT clause of the
[SHOW COMPUTE POOLS](../../sql-reference/sql/show-compute-pools.md) command to see all compute pools in their account
and the current state of the compute pool. Costs are not incurred when a compute pool is suspended.

A Snowflake Native App with Snowpark Container Services requires at least one compute pool and might require multiple compute pools to run as
intended. A consumer has full control over the compute resources that the app requires, and may suspend a
compute pool or drop an application at any time.

Separate charges for compute pool compute related to the Snowflake Native App with Snowpark Container Services appear on the customer billing
statement. A consumer can determine the compute pool billing charges for a Snowflake Native App with Snowpark Container Services using the
[ACCOUNT USAGE views](../snowpark-container-services/accounts-orgs-usage-views.md) provided by
Snowpark Container Services.

For more details, such as the consumption table for compute pools, contact your account representative.

## Costs to providers

Providers can also incur costs when developing and maintaining a Snowflake Native App with Snowpark Container Services, including the
following:

* Providers incur Snowpark Container Services compute costs associated with both initial development and
  ongoing testing and support for their Snowflake Native App. The compute cost may be controlled through
  orchestration of compute pools during provider-side development and testing.
* The storage of container images can incur costs when a provider creates a new version or patch of
  a Snowflake Native App with Snowpark Container Services. In this context, the Docker images that the app requires are copied into an image
  repository that is not directly accessible or observable by the provider or the consumer.

  Services in the consumer account are created from the versioned images that are stored in this
  repository. Providers are responsible for the storage costs for the images in this stage, which
  appear on their Snowflake bill. These costs are aggregated with other storage costs that their account incurs.

---
title: Create a user interface to request privileges and references
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-ui.md
section: Native Apps Framework
---

# Create a user interface to request privileges and references

This topic describes how you, as a provider, create a user interface using Streamlit and the
Snowsight to allow consumers to grant privileges and create references for an installed
Snowflake Native App. To access Snowflake privileges and references from a Streamlit program, the Snowflake Native App Framework
provides the Python Permission SDK.

See [Python Permission SDK reference](requesting-permission-sdk-ref.md) for information on the methods in the Python Permission SDK.

## About privileges and references

For general information on requesting privileges and references from the consumer using the
Snowflake Native App Framework, refer to [Create and access objects in a consumer account](requesting-about.md).

## About the Python Permission SDK

The Snowflake Native App Framework provides the Python Permission SDK which allows a provider to do the following within a
Snowflake Native App:

* Check for account level privileges.
* Request global privileges that are listed in the manifest file.
* Request references to objects and their corresponding object level privileges as defined
  in the manifest file.
* Request privileged actions, for example creating an API integration or creating a share.

Using the Python Permission SDK, Snowsight displays the access requests in the
Security tab of the installed Snowflake Native App.

See [Python Permission SDK reference](requesting-permission-sdk-ref.md) for information on the methods in the Python Permission SDK.

## Workflow for creating an interface to approve privileges and bind references

The following general workflow outlines the steps required to implement a Streamlit app to
request grants for privileges and references from the consumer.

1. Create an application package.
2. In the manifest file, specify the privileges and define the references required for the Snowflake Native App.
3. Add a Streamlit app to your application package.
4. Add an `environment.yml` file to your application package.

   > **Note:**
   >
   > The `environment.yml` file must be in the same directory as main Streamlit file
   > used to implement the Snowsight interface.
5. Add the `snowflake-native-apps-permission` library as a dependency.
6. Import the `snowflake.permissions` library in your Streamlit app.
7. Add functions to your Streamlit app that call the functions provided by the SDK.

## Add the Python Permission SDK to your Streamlit environment

To use the Python Permission SDK in a Streamlit app, add the `snowflake-native-apps-permission`
package as a dependency in your `environment.yml` file as shown in the following example:

```yaml
name: sf_env
channels:
- snowflake
dependencies:
- snowflake-native-apps-permission
```

## Import the Python Permission SDK in a Streamlit app

To import the Python Permission SDK into your Streamlit app, include the following import statement in
your app:

```python
import snowflake.permissions as permissions
```

## Request privileges from the consumer

The following examples show how to perform different tasks using the Python Permission SDK.

### Check Account Level Privileges

This example shows how to use the
[get_held_account_privileges()](requesting-permission-sdk-ref.md) method of the
Python Permission SDK to check if permissions declared in the manifest file are granted to the installed Snowflake Native App.

For example, if a Snowflake Native App needs to create a database outside of the APPLICATION object, a provider
can define the reference in the manifest file as follows:

```yaml
privileges:
- CREATE DATABASE:
    description: "Creation of ingestion (required) and audit databases"
```

Using the Python Permission SDK, you can use the [get_held_account_privileges()](requesting-permission-sdk-ref.md)
method to obtain a list of privileges that have been granted to the Snowflake Native App.

```python
import streamlit as st
import snowflake.permissions as permissions
...
if not permissions.get_held_account_privileges(["CREATE DATABASE"]):
    st.error("The app needs CREATE DB privilege to replicate data")
```

This example calls the [get_held_account_privileges()](requesting-permission-sdk-ref.md) function, passing the
CREATE DATABASE permission as a parameter. A provider can use the [get_held_account_privileges()](requesting-permission-sdk-ref.md)
function to wait until the consumer grants the required privileges to the app.

> **Note:**
>
> Only privileges defined in the manifest file are valid arguments to
> [get_held_account_privileges()](requesting-permission-sdk-ref.md). Passing other arguments results in an error.

## Request privileged actions from the consumer

Providers can use the Python Permission SDK to request privileged actions required by the Snowflake Native App.

For example, to request an API integration that allows the Snowflake Native App to connect to a
ServiceNow instance, a provider would define the API integration in the manifest file:

```yaml
references:
- servicenow_api_integration:
  label: "API INTEGRATION for ServiceNow communication"
  description: "An integration required in order to support extraction and visualization of ServiceNow data."
  privileges:
    - USAGE
  object_type: API Integration
  register_callback: config.register_reference
```

Next, in the Streamlit app, the provider calls the [request_reference()](requesting-permission-sdk-ref.md) method
to request the USAGE privilege on the API integration as shown in the following example:

```python
permissions.request_reference("servicenow_api_integration")
```

---
title: Create and access objects in a consumer account
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-about.md
section: Native Apps Framework
---

# Create and access objects in a consumer account

This topic describes how providers can develop a Snowflake Native App to create objects in the
consumer account or access existing objects.

## Overview of creating and accessing objects in a consumer account

Snowflake Native Apps often need to create or access objects in a consumer account. For example,
even a basic app that allows the consumer to query shared data would require the app
to create and use a warehouse in the consumer account. An app may also need to connect to external
services that are outside Snowflake.

The Snowflake Native App Framework provides two ways of requesting privileges to create objects in the consumer account.

### Automatically granting privileges to an app

The Snowflake Native App Framework allows providers to request certain safe privileges and then grants them automatically. Providers add these privileges to the manifest file of the app. During installation or upgrade of the app, Snowflake automatically grants these privileges to the app.

For more information on configuring an app to have privileges granted automatically, see [Configure the privileges required by an app](requesting-auto-privs.md).

From the provider perspective, automated granting of privileges streamlines app development because the
app does not have to determine if a consumer has granted the requested privileges or created the required
objects in their account.

From the consumer perspective, automated granting of privileges simplifies app installation and configuration. However, it also gives the consumer less control by default over what an app can do in their account. To allow consumers more control over what an app can do, the Snowflake Native App Framework provides the following features:

App specifications:
:   Allow consumers to control the external endpoints that an app can connect to. To access services outside
    Snowflake, an app may need to create an external access integration or service integration depending on
    the type of service. Using automated granting of privileges, the app can create these objects in the
    consumer account. However, an account admin of the consumer account must approve the app specification
    that allows the app to perform the external connection.

    For information on developing an app to use app specifications, see [Overview of app specifications](requesting-app-specs.md).
    For information on how the consumer approves an app specification, see [Approve app specifications](ui-consumer-app-spec.md).

Feature policies:
:   Feature policies allow consumers to override the automatic granting of privileges. Before installing or
    upgrading an app, consumers can create a feature policy to prohibit the app from creating specific types
    of objects. For example, a consumer may need to configure a feature policy to prohibit an app from creating a
    warehouse. If an app attempts to create a warehouse during installation or upgrade, the installation fails.

    For information on how a consumer creates feature policies, see [Use feature policies to limit the objects an app can create](ui-consumer-feature-policies.md).

### Manually granting privileges to an app

For apps that were created before the release of
[automated granting of privileges](requesting-auto-privs.md), for example an app that was installed and does not
have the necessary privileges to create objects in the consumer account, consumers must manually grant
privileges to the app using SQL or Snowsight, depending on how the app is configured. For more
information, see [Request global privileges from consumers](requesting-privs.md).

### Accessing existing objects in the consumer account

In some contexts an app needs to access existing objects in a consumer account that exist outside the app. For example, an app might need to access existing tables in a consumer database. To allow the app to create objects, the Snowflake Native App Framework uses references that enable the customer to specify the name and schema for an object and enable access to the object.

For more information, see [Request references and object-level privileges from consumers](requesting-refs.md).

## Comparison of automatic and manual privileges

| App requirement | Automatically granting privileges | Manually granting of privileges |
| --- | --- | --- |
| Privileges to create objects | Apps have privileges to create objects, with some exceptions. | Consumers must explicitly grant privileges to the app using Snowsight or SQL. |
| Access external services | Apps can create network rules and external access integrations.  Consumers must approve external access using app specifications. | Consumers must manually create the required network rules and external access integrations and bind the integration using references. |
| Access external identity providers | Apps can create security integrations for external API Authentication.  Consumers must approve the external connection using app specification. | Consumers must manually create the required security integrations and bind the integration with references |
| Access to existing objects | Providers must use references to access existing objects.  Consumers approve access to the references. | Providers must use references to access existing objects.  Consumers approve access to the references. |
| App development | Providers do not have to write code to determine if the consumer has granted a certain privilege. | Providers must write code that checks if the consumer has granted a certain privilege. |
| App installation | Consumers do not have to manually create objects or grant privileges. | Consumers must manually create objects in their account or explicitly grant privileges to the app using Snowsight or SQL. |

## Security considerations when using auto privileges with app specifications

App specifications only control communications to endpoints outside
Snowflake. Consumers can approve or decline app specifications to
allow or prevent the app making connection to these endpoints.

App specifications do not prevent the app from creating
Snowflake objects that control external connections: network rules,
external access integrations, and security integrations. Privileges
to create these objects are granted using automated granting of
privileges.

App specifications do not provide data validation. In addition,
they do not place any restrictions on secrets or tokens referenced
by an external access integration or security integration.

For example, if a provider configures an external access integration
of an app to use ALLOWED_AUTHENTICATION_SECRETS and the consumer
approves the app specification for that integration, the app can later
modify the secrets and tokens that it uses.

However, if a provider modifies the app to use a different endpoint,
the sequence number of the app specification would change and the
consumer would need to re-approve or decline the new version.

---
title: Create and manage an application package
source: https://docs.snowflake.com/en/developer-guide/native-apps/creating-app-package.md
section: Native Apps Framework
---

# Create and manage an application package

This topic describes how providers can create an application package to develop a Snowflake Native App.

## About the application package

An application package is a container that encapsulates the data content and application logic used by a Snowflake Native App. An application package also contains
information about versions and patches defined for an app.

Each version of an app requires its own version of the manifest and setup script:

manifest file:
:   The manifest file contains information that the application package
    requires to create and manage a Snowflake Native App. This includes the location
    of the setup script, version definitions, and configuration information
    for the app.

    For more information, see [Create the manifest file for an app](manifest-overview.md).

setup script:
:   The setup script contains SQL statements that are run when the app is
    installed, either in the consumer account or locally during development and
    testing.

    For more information, see [Create the setup script](creating-setup-script.md).

> **Note:**
>
> You can create an application package without creating the manifest file or
> setup script. However, to develop or test an app, you must upload these
> files to a stage so they are accessible to the application package.

## About release channels

Release channels manage the release lifecycle of Snowflake Native Apps. They allow providers to create
and manage versions of an app and to publish the app at different stages of development to all consumers or specific groups of consumers.

> **Caution:**
>
> When you create an application package, release channels are enabled by default.
> After release channels have been enabled for an application package, they can’t
> be disabled.

For more information on using release channels to manage the release lifecycle
of an app, see [Publish an app using release channels](release-channels.md).

To use the previous process for managing versions and patches you must explicitly disable release
channels when creating the application package. However, for new app development Snowflake recommends
using release channels to manage the release lifecycle of your apps.

For information on using the older features for managing versions and patches, see
[Develop a new version of an app (Legacy)](update-app-develop.md).

## Privileges required to create an application package

To create an application package you must have the global CREATE APPLICATION PACKAGE privilege granted to your role.

## Create an application package

You can create an application package using one of the following methods:

* Snowsight
* The [CREATE APPLICATION PACKAGE](../../sql-reference/sql/create-application-package.md) command.
* The [Snowflake CLI](../snowflake-cli/index.md)

### Create an application package using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App packages.
3. Select Create and then click App Package in the right pane.
4. Enter a name for your application package.
5. Select the intended consumer for the application package:

   * Select Distribute to accounts outside of your organization to make
     the application package available outside your organization. Selecting this
     option initiates an [automated security scan](security-overview.md) for each version and patch defined in your application package.
   * Select Distribute to accounts in your organization to make the
     application package available within your organization. The automated
     security scan is not initiated.
6. (Optional) Enter comments for the application package. These comments are not visible to the consumer.
7. Select Add.

### Create an application package using SQL commands

To create an application package using SQL, use the
[CREATE APPLICATION PACKAGE](../../sql-reference/sql/create-application-package.md)
command as shown in the following example:

```sqlexample
CREATE APPLICATION PACKAGE my_application_package;
```

This command creates an application package named `my_application_package` in your Snowflake account.
By default, [release channels](release-channels.md) are enabled for the application package.

After creating an application package, use the
[SHOW APPLICATION PACKAGES](../../sql-reference/sql/show-application-packages.md) command to view the list
of available application packages.

### Create an application package using the Snowflake CLI

If you are using the Snowflake CLI to develop an app, the application package is
created when you run the
[snow app run](../snowflake-cli/command-reference/native-apps-commands/run-app.md)
command. This command creates an application package in your Snowflake account, uploads code
files to a stage, then creates or upgrades an app from the application package.

## Grant the required privileges on an application package

Some tasks related to creating or using an application package require specific privileges on the application package. The following table describes the privileges required to perform these tasks:

| Privilege | Task |
| --- | --- |
| ATTACH LISTING | Add an application package to a listing. |
| DEVELOP | Create an APPLICATION object in development mode from the application package. |
| INSTALL | Create an APPLICATION object based on the application package. |
| MANAGE RELEASES | Specify a release directive, view the version and patch level. |
| MANAGE VERSIONS | Add a version and patch level to an application package. |
| OWNERSHIP | Perform all of the tasks above. |

### Grant privileges on an application package using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App packages.
3. Select the application package, then select the Settings tab.
4. In the Privileges section, select the edit icon next to the privilege you want to
   grant.
5. Select Add Role, then select the role to which you want to grant the privilege.
6. Select Save.

The role appears next to the privilege.

### Grant privileges on an application package using SQL commands

To grant a privilege on the application package to a role using SQL, use the
[GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command as shown in the following example:

```sqlexample
GRANT MANAGE RELEASES ON APPLICATION PACKAGE hello_snowflake_package TO ROLE app_release_mgr;
```

This command grants the MANAGE RELEASES privilege to the `app_release_mgr` role. You can
use the same command to grant the other privileges available on an application package.

## Set the default release directive for an application package

A release directive determines the version and patch of an app that is available
to a consumer when they install the app or when an installed app is automatically
upgraded. For information on setting the release directive, see
[Set the release directive for an app (Legacy)](update-app-release-directive.md)

## Allow consumers to install multiple instances of an app

Providers can configure an application package to allow consumers to install
multiple instances of an app.

To enable multiple instances of an app, use the `MULTIPLE_INSTANCES = TRUE` clause of the
[CREATE APPLICATION PACKAGE](../../sql-reference/sql/create-application-package.md) or
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) commands.

If multiple instances are allowed for an app, consumers can install a maximum of 30 instances of the app in their account.

You cannot set this property for an application package that is included in a trial listing. An app installed from a paid listing
can’t have multiple instances using the DEFAULT release channel.

> **Caution:**
>
> After setting the `MULTIPLE_INSTANCES` property to `TRUE`, it cannot be unset or set to `FALSE`.

## Transfer ownership of an application package

After creating an application package, you can transfer ownership of the application package to
another account-level role.

### Transfer ownership using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App packages.
3. Select … next to the application package you want to transfer ownership, then select
   Transfer Ownership.
4. Under Transfer to, select the new account-level role.
5. Select Transfer.

### Transfer ownership using SQL Commands

To transfer ownership of an application package to a different account-level role using SQL, use the
[GRANT OWNERSHIP](../../sql-reference/sql/grant-ownership.md) command as shown in the following example:

```sqlexample
GRANT OWNERSHIP ON APPLICATION PACKAGE hello_snowflake_package TO ROLE native_app_dev;
```

## Delete an application package

Providers with the OWNERSHIP privilege on an application package can remove it
from an account. However, providers cannot remove an application package that is
currently associated with a listing.

After removing an application package, it is no longer available in the provider account.

> **Caution:**
>
> After removing a listing and the attached application package, the consumer
> can view but not access the Snowflake Native App created from the application package.
> If a consumer tries to access the Snowflake Native App, they receive an error
> indicating the application package has been removed.

### Delete an application package using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App packages.
3. Select … next to the application package you want to remove, then select Drop.

### Delete an application package using SQL commands

To remove an application package using SQL, run the [DROP APPLICATION PACKAGE](../../sql-reference/sql/drop-application-package.md) command
as shown in the following example:

```sqlexample
DROP APPLICATION PACKAGE hello_snowflake_package;
```

---
title: Create resource
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/ingestion-management/create_resource.md
section: Native Apps Framework
---

# Create resource

Creating a resource is required in order to define and schedule data ingestion from a source system.
`PUBLIC.CREATE_RESOURCE` procedure is the entry point from the UI or worksheet to create a new resource.

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The resource creation process consists of several phases. Several of which are customizable but include reasonable defaults.
Phases are:

1. Initial validation
2. Custom validation
3. Custom logic before a resource is created
4. Creation of resource ingestion definition and ingestion processes
5. Custom logic after a resource is created

## Initial validation

Initial validation is performed at the very beginning of the resource creation process, and checks:

* whether given input data represent a valid resource ingestion definition object
* whether a resource with given `id` and `resourceId` does not exist

## Custom validation

Custom validation is executes after initial validation and is designed to support customized connector-specific logic.
For example, it can be used to verify that a given resource exists in a source system.

By default, it invokes `PUBLIC.CREATE_RESOURCE_VALIDATE(resource VARIANT)`,
which returns `'response_code': 'OK'`. It can be overwritten using a SQL script or by using
a `CreateResourceHandlerBuilder` to provide custom implementation of the `CreateResourceValidator` interface.

If the custom validation returns an error, the following steps will not be executed and the error response will be returned from `CREATE_RESOURCE` procedure.

## Custom logic before a resource is created

You can implement custom logic before a resource is created and scheduled.
For example, it can be used to create a new destination table where ingestion data will be saved.

By default, it invokes `PUBLIC.PRE_CREATE_RESOURCE(resource VARIANT)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `CreateResourceHandlerBuilder` to provide custom implementation of the `PreCreateResourceCallback` interface.

If custom logic returns an error, the follow on steps will not be executed and provided error response will be returned from `CREATE_RESOURCE` procedure.

## Creation of resource ingestion definition and ingestion processes

During this step a new record is added to `STATE.RESOURCE_INGESTION_DEFINITION` table.
Additionally, when a resource should be initially enabled (`enabled` parameter equals `true`),
a new ingestion process is added for each provided ingestion configuration.
Ingestion processes are created with `SCHEDULED` status which means that the ingestion will begin later.
If the `enabled` flag is set to `false`, no ingestion process is created and in the `ENABLE_RESOURCE` procedure
must be called to enable ingestion.

## Custom logic after a resource is created

Custom logic can be specified after a resource is created and scheduled.
For example, it can be used to create a new destination table where ingestion data will be saved.

By default, it invokes `PUBLIC.POST_CREATE_RESOURCE(id VARCHAR)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `CreateResourceHandlerBuilder` to provide custom implementation of the `PostCreateResourceCallback` interface.

If custom logic returns an error, given error response will be returned from `CREATE_RESOURCE` procedure but the creation of resource ingestion definition and ingestion processes will not be rolled back.

## Response

### Successful response

When successful, the procedure returns a response resembling:

> ```json
> {
>   "response_code": "OK",
>   "id": "<new resource ingestion definition id>"
> }
> ```

The `id` returned in the response is an id of resource ingestion definition and can be used to enable, disable or update the resource afterwards.

### Error response

On error the, the procedure returns a response resembling:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes include:

* `INVALID_INPUT` - Provided procedure’s arguments are invalid and it is not possible to create a valid resource object or a resource with given id already exists.
* `CREATE_RESOURCE_ERROR` - Something unexpected happened when creating the new resource ingestion definition or when creating ingestion processes. All changes are rolled back.

---
title: Create resource reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/create_resource_reference.md
section: Native Apps Framework
---

# Create resource reference

## Database objects and procedures

The following database objects are created when the file `ingestion/resource_management.sql` is executed.

### PUBLIC.CREATE_RESOURCE(name VARCHAR,resource_id VARIANT,ingestion_configurations VARIANT,id VARCHAR,enabled BOOLEAN,resource_metadata VARIANT)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `CreateResourceHandler.createResource`.

### PUBLIC.CREATE_RESOURCE_VALIDATE(resource VARIANT)

Procedure used for connector specific validation of create process. By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultCreateResourceValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.PRE_CREATE_RESOURCE(resource VARIANT)

Procedure used for adding connector specific logic which is invoked before a resource is created.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPreCreateResourceCallback`. Can be overwritten both in SQL and Java.

### PUBLIC.POST_CREATE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Procedure used for adding connector specific logic which is invoked after a resource is created and scheduled.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPostCreateResourceCallback`. Can be overwritten both in SQL and Java.

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.ingestion.create` package and some common components are tightly connected with the above procedures:

* `CreateResourceHandler`
* `CreateResourceHandlerBuilder`
* `CreateResourceValidator`
* `PreCreateResourceCallback`
* `PostCreateResourceCallback`
* `ConnectorErrorHelper`

## Custom handler

The handler and its internals can be customized using the following approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of `CreateResourceHandler`, the `PUBLIC.CREATE_RESOURCE` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.CREATE_RESOURCE(name VARCHAR,resource_id VARIANT,ingestion_configurations VARIANT,id VARCHAR,enabled BOOLEAN,resource_metadata VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomCreateResourceHandler.createResource';

  GRANT USAGE ON PROCEDURE PUBLIC.CREATE_RESOURCE(VARCHAR, VARIANT, VARIANT, VARCHAR, BOOLEAN, VARIANT) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal procedures `CREATE_RESOURCE_VALIDATE`, `PRE_CREATE_RESOURCE` and `POST_CREATE_RESOURCE` can be also customized through the SQL. They can also invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.CREATE_RESOURCE_VALIDATE(resource VARIANT)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
      -- SOME CUSTOM LOGIC BEGIN
      SELECT sysdate();
      -- SOME CUSTOM LOGIC END

      RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;

  CREATE OR REPLACE PROCEDURE PUBLIC.CREATE_RESOURCE_VALIDATE(resource VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomHandler.createResourceValidate';
```

### Builder approach

`CreateResourceHandler` can be customized using `CreateResourceHandlerBuilder`. This builder allows user to provide custom implementations of the following interfaces:

* `CreateResourceValidator`
* `PreCreateResourceCallback`
* `PostCreateResourceCallback`
* `ConnectorErrorHelper`

In case a function is not provided the default implementation provided by the SDK will be used.

```java
class CustomPreCreateResourceCallback implements PreCreateResourceCallback {
  @Override
  public ConnectorResponse execute(String resourceIngestionDefinitionId) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.CREATE_RESOURCE procedure using SQL
  public static Variant createResource(
      Session session,
      String name,
      Variant resourceId,
      Variant ingestionConfigurations,
      String id,
      boolean enabled,
      Variant resourceMetadata) {
    //Using builder
    var handler = CreateResourceHandlerBuilder.builder(session)
      .withPreCreateResourceCallback(new CustomPreCreateResourceCallback())
      .build();
    return handler.createResource(resourceIngestionDefinitionId).toVariant();
  }
}
```

---
title: Create the manifest file for an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/manifest-overview.md
section: Native Apps Framework
---

# Create the manifest file for an app

This topic describes how to create the manifest file for an app.

## About the manifest file

The manifest file contains information that the application package
requires to create and manage a Snowflake Native App. This includes the location
of the setup script, version definitions, and configuration information
for the app.

The manifest file has the following requirements:

* The name of the manifest file must be `manifest.yml`.
* The manifest file must be uploaded to a named stage so that it is
  accessible to the application package.
* The manifest file must exist at the root of the directory structure on
  the named stage where other application files are stored.

## Version 1 and version 2 of the manifest file

Snowflake Native Apps supports two versions of the manifest file. The version is
specified using the `manifest_version` field.

`manifest_version: 1`
:   This version of the manifest file supports the current and
    legacy functionality of Snowflake Native Apps.

`manifest_version: 2`
:   This version of the manifest file provides support for additional
    features, including automated granting of privileges.

## Security considerations when using version 2 of the manifest file

When using version 2 of the manifest file, consider the following security implications:

When a provider configures an app to use
`manifest_version: 2` in the manifest file, automated granting of
privileges is enabled. By default this allows Snowflake to automatically
grant certain privileges to the app. For information on the privileges
that can be automatically granted to the app, see
[Privileges granted by automated granting of privileges](requesting-auto-privs.md).

During installation, Snowsight displays a notification about
the privileges requested by the app. When a consumer installs an app
that uses automated granting of privileges, they agree that the app may
be granted these privileges during upgrades without requiring additional
consent.

Consumers can create feature policies that restrict the objects an app
can create. For more information on creating feature policies, see
[Use feature policies to limit the objects an app can create](ui-consumer-feature-policies.md).

## Specify the privileges required by an app with containers

Like other apps, the `privileges` field of the manifest file
specifies the privileges that an app with containers requests from
consumers.

The following privileges are specific to an app with containers:

* CREATE COMPUTE POOL

  This privilege is required to allow the app to create a compute pool in the consumer account. It is not required if the consumer creates
  the compute pool manually.
* BIND SERVICE ENDPOINT

  This privilege is required to allow an endpoint to be accessible outside of Snowflake.

The following example shows how to add these privileges to the `privileges` block:

```yaml
privileges:
- CREATE COMPUTE POOL:
  description: 'Required to allow the app to create a compute pool in the consumer account.'
- BIND SERVICE ENDPOINT:
  description: 'Required to allow endpoints to be externally accessible.'
```

## Specify the container images used by an app with containers

To specify the location of the container images used by the app with
containers, add the `images` property to the `artifacts.container_services` block.

You must include an entry for each image. The path specified includes
the name of the database, schema, and image repository. This path has
the following form:

```yaml
/<database>/<schema>/<image_repository>/<image_name>:tag
```

The following example shows how to specify the `images` property:

```yaml
artifacts
...
  container_services
    ...
    images
      - /dev_db/dev_schema/dev_repo/image1
      - /dev_db/dev_schema/dev_repo/image2
```

## Specify the user interface endpoint for an app with containers

To specify the endpoint for the user interface of the app with
containers, add the `default_web_endpoint` property to the
`artifacts` block.

The `default_web_endpoint` property is optional. If this property
is specified, the endpoint must also be defined in the service
specification file.

> **Note:**
>
> Only one of the `default_web_endpoint` and `default_streamlit` can be specified.

This entry in the manifest file has two additional properties:

* `service`
  :   Specifies the name of the service of the user interface.
* `endpoint`
  :   Specifies the name of the endpoint.

The following example shows how to specify the `default_web_endpoint` property.

```yaml
default_web_endpoint:
  service: ux_schema.ux_service
  endpoint: ui
```

## Example manifest files

The following examples show typical manifest files for different types of use cases.

### Example manifest file for using automated granting of privileges

The following manifest file shows how to configure an app to use
automated granting of privileges. This example uses version 2 of the
manifest file. The `privileges` block specifies the privileges that the app requires.

```yaml
manifest_version: 2
version:
  name: v1
artifacts:
  readme: readme.md
  setup_script: setup.sql
privileges:
  - CREATE TABLE:
    description: "Allows the app to create tables in the consumer account"
  - CREATE WAREHOUSE:
    description: "Allows the app to create warehouses in the consumer account"
```

When the app is installed, Snowflake automatically grants the CREATE TABLE and CREATE WAREHOUSE privileges to the app.

### Example manifest file files for an app with containers

Snowflake Native Apps supports entries in the manifest file that are specific to
an app with containers. The following example manifest file
shows a typical manifest file for an app with containers:

```yaml
manifest_version: 2
version:
  name: v1
artifacts:
  readme: readme.md
  setup_script: setup.sql
  container_services:
    images:
      - /dev_db/dev_schema/dev_repo/image1
      - /dev_db/dev_schema/dev_repo/image2
  default_web_endpoint:
    service: ux_schema.ux_service
    endpoint: ui
privileges:
 - CREATE COMPUTE POOL:
   description: "..."
 - BIND SERVICE ENDPOINT:
   description: "...”
```

---
title: Create the setup script
source: https://docs.snowflake.com/en/developer-guide/native-apps/creating-setup-script.md
section: Native Apps Framework
---

# Create the setup script

This topic describes how to use the setup script to create objects in the app when
running the [CREATE APPLICATION](../../sql-reference/sql/create-application.md) command.

It also describes application roles and how they are used within the setup script.

## About the setup script

The setup script contains SQL statements that are run when the [CREATE APPLICATION](../../sql-reference/sql/create-application.md)
command is run in one of following contexts:

* A consumer installs or upgrades a Snowflake Native App.
* A provider creates or upgrades an app when testing the application package.

> **Note:**
>
> The setup script only support using SQL commands. Other languages are not supported.

The SQL statements in the setup script create objects within the app that are
required by the app. This includes database objects, stored procedures, views, and
application roles.

The manifest file specifies the filename and relative path to the setup script. The setup
script must exist on a named stage and be accessible by the app package.

## Restrictions on the setup script

The following cannot be performed within a setup script:

* [USE DATABASE](../../sql-reference/sql/use-database.md)
* [USE SCHEMA](../../sql-reference/sql/use-schema.md)
* [USE ROLE](../../sql-reference/sql/use-role.md)
* [USE SECONDARY ROLES](../../sql-reference/sql/use-secondary-roles.md)
* Only certain object types support setting the LOG_LEVEL, TRACE_LEVEL, METRIC_LEVEL, and
  LOG_EVENT_LEVEL properties using the [ALTER <object>](../../sql-reference/sql/alter.md) command. For a list of
  supported object types, see [Set the log and trace levels for an app](event-definition.md).
* Creating or invoking procedures that are EXECUTE AS CALLER.
* Creating Snowpark user-defined functions (UDFs) or procedures that use IMPORT to include files
  on a named stage.
* Calling procedures, functions or anonymous code blocks that refer to code not included in the
  application package.
* Importing code files from a named stage when using the [CREATE FUNCTION](../../sql-reference/sql/create-function.md)
  command.
* Using [CALL](../../sql-reference/sql/call.md) to call a procedure that runs as EXECUTE AS CALLER.

There are additional restrictions on objects created within a versioned schema.

## Visibility of objects created in the setup script

The setup script can create most types of database-level objects. Database objects created by
the setup script are internal to the app. When a consumer installs an app, by default,
these objects are invisible and inaccessible to the consumer account directly.

> **Note:**
>
> Providers can access objects created by the setup script by using development mode when
> testing an application package. See [Use development, debug, and session debug modes to test an app](installing-testing-application.md) for more information.

A provider can make these objects visible to the consumer using application roles. An application
role created within the setup script is automatically granted to the role owning the app. Application roles
granted by the setup script cannot be revoked.

Users that have been granted a role that owns the application object can grant application roles to other
roles within their account. For example, the setup script can define an application
role, such as APP_ADMIN, and this role can grant permission to access objects within the app.

## Set the log level for messages output by the setup script

A provider can specify the log level for messages generated when the setup script runs. See
[Logging messages in Snowflake Scripting](../logging-tracing/logging-snowflake-scripting.md) for additional information.

To configure the log level for the setup script, use one of the following system function:

* [SYSTEM$LOG](../../sql-reference/functions/system_log.md)
* [SYSTEM$LOG_<level>](../../sql-reference/functions/system_log.md)

For example, to configure the setup script to log error messages, add the following command at
the beginning of the setup script:

```sqlexample
SYSTEM$LOG('error', 'Error message');
```

## Create modular setup scripts

The setup script of a typical app can be large and complex. To make the setup script more modular
and easier to maintain, a provider can create a primary setup script that calls multiple secondary
setup scripts.

For example, a provider can create different setup scripts to handle different types of tasks, for
example, creating objects, creating views, creating stored procedures, etc.

When the [CREATE APPLICATION](../../sql-reference/sql/create-application.md) command runs, it runs the main setup script
specified in the manifest file. To run additional setup scripts from the main setup script,
use the [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md) command.

Setup scripts included in the primary setup script are run in the order they are
encountered. These secondary setup scripts can also include instances of the
[EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md) command.

### Add multiple setup scripts to an app

1. Add the location of the primary setup script to the manifest file.

   ```yaml
   artifacts:
     ...
     setup_script: scripts/setup.sql
     ...
   ```
2. Create the primary setup script.

   The following example shows a typical directory structure for an app:

   ```none
   @test.schema1.stage1:
   └── /
       ├── manifest.yml
       ├── readme.md
       ├── scripts/setup_script.sql
   ```

   Where `setup_script.sql` is the primary setup script.
3. Create the secondary setup scripts.

   The following example shows a typical directory structure for an app containing
   multiple setup scripts:

   ```none
   @test.schema1.stage1:
   └── /
       ├── manifest.yml
       ├── readme.md
       ├── scripts/setup_script.sql
       ├── scripts/secondary_script.sql
       ├── scripts/procs/setup_procs.sql
       ├── scripts/views/setup_views.sql
       ├── scripts/data/setup_data.sql
   ```
4. Within the primary setup script, use the [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md)
   command to specify a relative path to each secondary setup script:

   ```sqlexample
   ...
   EXECUTE IMMEDIATE FROM 'secondary_script.sql';
   EXECUTE IMMEDIATE FROM './procs/setup_procs.sql';
   EXECUTE IMMEDIATE FROM '../scripts/views/setup_views.sql';
   EXECUTE IMMEDIATE FROM '/scripts/data/setup_data.sql';
   ...
   ```

   The path provided to the [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md) command
   is case-sensitive and it can be used with any setup script. Use a forward slash (`/`) to
   indicate the relative path of the app root directory, use a period and a forward slash (`./`)
   to indicate the current directory for the setup script, and use two periods and a forward slash (`../`)
   to indicate the parent directory for the setup script.

   A primary setup script is the script defined in the manifest. The [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md)
   command can be used with any setup script.

### Limitations on using EXECUTE IMMEDIATE FROM in a setup script

The following limitations apply when using [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md) within
a setup script:

* Event logging is not supported in setup scripts called using [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md).
* Accessing files stored on encrypted external stages in the consumer account is not supported.
* During app runtime, only the relative path format with a forward slash (`/`) is permitted. For example,
  `EXECUTE IMMEDIATE FROM '/scripts/data/setup_data.sql'`.

## Best practices when creating the setup script

Snowflake recommends the following best practices when creating the setup script for an app.

### Use idempotent forms of CREATE statements

When using a CREATE command to create objects within the setup script, Snowflake recommends using the
following versions of these commands:

* CREATE OR REPLACE
* CREATE IF NOT EXISTS

The setup script can be run multiple times during installation and upgrade. In cases where an error occurs,
these objects might already exist, especially if they are created within a versioned schema.

### Include the target schema when creating objects

The [CREATE SCHEMA](../../sql-reference/sql/create-schema.md) command does not change the session context. Objects must be
qualified with the target schema when they are created. For example, to create a schema within the setup
script, use the following commands:

```sqlexample
CREATE SCHEMA IF NOT EXISTS app_config;
CREATE TABLE IF NOT EXISTS app_config.params(...);
```

### Do not refer to objects in the app from outside the app

Do not create objects outside the app that refer to objects within the app. Although the Snowflake Native App Framework
does not prohibit creating these objects, it can lead to problems when a consumer installs the Snowflake Native App.

For example, consider the context where a setup script creates a database, schema, and view outside
of the app and the view refers to a table within the app. In this context, the view in the database breaks when the
consumer takes ownership of the database and drops the app.

This best practice applies to tables, stored procedures, user-defined functions and references created
by the setup script.

### Account for possible failures when using versioned or non-versioned schemas

Objects in a versioned schema can refer to objects in a non-versioned schema and vice versa. The setup
script must account for what might happen in case of failure during installation or upgrade. For example,
a provider must account for what happens if the setup script automatically runs again if the initial
run fails.

For example, consider creating objects using the following:

```sqlexample
CREATE OR REPLACE PROCEDURE app_state.proc()...;
GRANT USAGE ON PROCEDURE app_state.proc()
  TO APPLICATION ROLE app_user;
```

In this example, the CREATE OR REPLACE statement replaces an existing procedure, which implicitly
removes privileges that had been previously granted to that procedure. Although the grants might be
restored later in the script, if the script fails when it is run, consumers might lose the ability to access
the procedure.

If a setup script fails due to an issue that cannot be resolved by a retrying, for example a
syntax error, the consumer cannot access the procedure until the app is upgraded to a new version or patch
and the grant is restored.

> **Caution:**
>
> The guidance in this section does not apply to [tags](../../user-guide/object-tagging/introduction.md),
> [masking policies](../../user-guide/security-column-intro.md), and [row access policies](../../user-guide/security-row-intro.md) outside the
> context of the Snowflake Native App Framework.
>
> Tag and policy assignments do not propagate to incremental versions of a versioned schema. These scenarios trigger an error message
> (using a tag as an example):
>
> * Create a tag in the versioned schema and assign the tag to an object in a different schema.
> * Create a tag in a non-versioned schema and assign the tag to an object in a versioned schema.
> * Create tables or views in the versioned schema and assign a tag to the tables or views when the tag exists in a non-versioned schema.
> * Create tables or views in a non-versioned schema and assign a tag to the tables or views when the tag exists in a versioned schema.
>
> The error message is:
>
> ```output
> A TAG in a versioned schema can only be assigned to the objects in the same schema. An object in a versioned schema can only have a TAG assigned that is defined in the same schema.
> ```
>
> If the policy assignment triggers the error message, the error message specifies `POLICY` instead of `TAG`.
>
> To prevent the error message:
>
> * The Snowflake Native App provider should update the setup script to ensure that tags (or policies) are set on objects within the
>   same schema as the tag when a versioned schema contains either the tag or the object on which the tag is set. If a non-versioned
>   schema contains either the tag or the object on which the tag is set, it is not necessary to update the setup script.
> * If the Snowflake Native App consumer sees this error message when installing an app, the consumer should ask the provider to update
>   their setup script. Additionally, the consumer should not assign any tag that exists in a versioned schema to any object in their
>   account, such as a warehouse, or assign a policy that exists in a versioned schema to a table or column, or assign a policy or tag to
>   an object that exists in a versioned schema inside the Snowflake Native App. If so, Snowflake returns the same error message.

### Define views within a versioned schema

Always define views on shared content in a versioned schema to ensure that any code
accessing the view during an upgrade uses a consistent view. You should also use a versioned
schema when adding or removing new columns or other attributes.

### Ensure time-consuming operations are compatible

If the setup script must perform very long-running operations, such as upgrading large state tables,
ensure that these updates are compatible with existing running code from the previous version.

## About application roles

By default the consumer has no privileges on objects created within the app. Even the ACCOUNTADMIN role
cannot view the objects **within** an app. Objects that the app creates outside itself,
such as a database, are visible only to the ACCOUNTADMIN role of the consumer account.

Application roles are similar to database roles, but may only be created within the app. Unlike database
roles, application roles can be granted privileges on objects that exist outside of the app.

Application roles should be created by the setup script when the app is installed and are automatically granted
to the app owner’s role, who then can grant appropriate application roles to other roles in the consumer account.

> **Note:**
>
> Application roles are the only type of role that can be created within an app. Database
> roles, for example, are not permitted within the app.
>
> Likewise, application roles can only be created in an app and not, for example, in a normal
> database or at the account level.

Any privileges granted to application roles is passed to the app owner, which is the role used to install
the app. The owner may further delegate application roles to other roles within the consumer
account.

The setup script can also define an application role (e.g. USER). Using this role, consumers
are granted access to use the functionality provided by the app. The setup script
can define an application role, such as READ_ONLY, to provide restricted access to select
areas of data within the app.

Unlike database roles, application roles may also be granted privileges on objects outside
of the installed app. They may therefore be used to grant privileges on objects
outside of the app. However, the application role itself must be created within the app.

## Supported SQL commands for working with application roles

The Snowflake Native App Framework provides the following SQL commands for working with application roles:

* [ALTER APPLICATION ROLE](../../sql-reference/sql/alter-application-role.md)
* [CREATE APPLICATION ROLE](../../sql-reference/sql/create-application-role.md)
* [DROP APPLICATION ROLE](../../sql-reference/sql/drop-application-role.md)
* [GRANT APPLICATION ROLE](../../sql-reference/sql/grant-application-role.md)
* [REVOKE APPLICATION ROLE](../../sql-reference/sql/revoke-application-role.md)
* [SHOW APPLICATION ROLES](../../sql-reference/sql/show-application-roles.md)

## Using application roles in the setup script

Application roles defined in the setup script are automatically granted to the role owning
the app instance. When the app is installed, the role used to install the app is the owner of the app.
However, the app owner can grant privileges to other account roles in the consumer account.

Application roles allow privileges on objects within the app to be granted to the consumer. For example:

```sqlexample
CREATE APPLICATION ROLE admin;
CREATE APPLICATION ROLE user;
GRANT APPLICATION ROLE user TO APPLICATION ROLE admin;

CREATE OR ALTER VERSIONED SCHEMA app_code;
GRANT USAGE ON SCHEMA app_code TO APPLICATION ROLE admin;
GRANT USAGE ON SCHEMA app_code TO APPLICATION ROLE user;
CREATE OR REPLACE PROCEDURE app_code.config_app(...)
GRANT USAGE ON PROCEDURE app_code.config_app(..)
  TO APPLICATION ROLE admin;

CREATE OR REPLACE FUNCTION app_code.add(x INT, y INT)
GRANT USAGE ON FUNCTION app_code.add(INT, INT)
  TO APPLICATION ROLE admin;
GRANT USAGE ON FUNCTION app_code.add(INT, INT)
  TO APPLICATION ROLE user;
```

In this example, the setup script creates application roles named `admin` and a `user`. The setup
script then grants both application roles access to the schema containing the app code. It also grants
access to the `add` function within the schema. The `admin` role is also granted access to the
`config_app` procedure.

## Application roles and versions

Application roles are not versioned. This means that dropping an application role or revoking a
permission from an object that is not in a versioned schema can impact the current version of an
application or the version being upgraded. Application roles may only be safely dropped when you have
dropped all versions of the app that use those roles.

> **Note:**
>
> Application roles cannot be granted ownership of objects. This means that an application role
> defined in the setup script should only be used to allow consumers to access objects within the installed
> Snowflake Native App.

---
title: Create versions and patches for an app (Legacy)
source: https://docs.snowflake.com/en/developer-guide/native-apps/update-app-versions.md
section: Native Apps Framework
---

# Create versions and patches for an app (Legacy)

This topic describes how to add versions and patches to an application package.

For general information about versions and patches and how they are used to update and
upgrade an app, see [About app versions and patches](update-app-overview.md).

## Add a version or patch to an application package

The version and patches of an app are defined in the application package.

After adding a version or patch to an application package, providers can test the changes locally
by creating an app based on the version or patch.

See [Create an app from a version or patch](installing-testing-application.md) for more information.

### Privileges required to add or remove versions and patches

To specify a version or patch for an application package, you must have one of the following privileges
granted on the application package to your role:

* OWNERSHIP
* MANAGE VERSIONS

For example, to grant the MANAGE VERSION privilege on the application package to the
`release_mgr` role, use the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command as shown in the following
example:

```sqlexample
GRANT MANAGE VERSIONS ON APPLICATION PACKAGE hello_snowflake_package
  TO ROLE release_mgr;
```

### Add a version to an application package

To add a version to the application package by using SQL, run the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md)
command:

```sqlexample
ALTER APPLICATION PACKAGE MyAppPackage
  ADD VERSION v1
  USING '@dev_stage/v1'
  LABEL = 'MyApp Version 1.0';
```

In this example, `v1` is an identifier for the version. This identifier is not visible to consumers when they install
the application. The consumer sees version information as defined in the LABEL clause.

> **Caution:**
>
> Only two versions of an application can exist at the same time. See [About app versions and patches](update-app-overview.md)
> for more information.

You can define the version name and label in the manifest file or specify them directly with the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command. If you define them in the manifest
file as well as with the SQL command, the values specified in the SQL command take precedence over the
values specified in the manifest file.

### Add a patch to an application package

In addition to creating versions for an app you can also create patches for a specific version. Like
versions, app patches also have their own application files.

To create a new patch for an application package, use the ADD PATCH FOR VERSION clause of the
[ALTER APPLICATION PACKAGE … VERSION](../../sql-reference/sql/alter-application-package-version.md) command, as shown in the following
example:

```sqlexample
ALTER APPLICATION PACKAGE MyAppPackage
 ADD PATCH FOR VERSION V1_0
 USING '@dev_stage/v1_0_p1';
```

In the example, no patch number is provided to the ADD PATCH FOR VERSION V1_0 clause. In this case
Snowflake automatically increments the patch number by 1.

To create a new patch for an application with a custom patch number, provide a patch number to the
ADD PATCH FOR VERSION clause of the [ALTER APPLICATION PACKAGE … VERSION](../../sql-reference/sql/alter-application-package-version.md)
command, as shown in the following example:

```sqlexample
ALTER APPLICATION PACKAGE MyAppPackage
 ADD PATCH 3
 FOR VERSION V1_0
 USING '@dev_stage/v1_p1';
```

### View the versions and patches in an application package

As a provider, you can view the versions and patches defined for an application by running the
[SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md) command on the application package.

The following command displays the versions and patches that have been defined for an application
package named `hello_snowflake_package`:

```sqlexample
SHOW VERSIONS IN APPLICATION PACKAGE hello_snowflake_package;
```

### Remove a version from an application package

To remove a version from an application package, you must verify that there are no
[release directives](update-app-release-directive.md) currently
pointing that the version you want to remove.

See [View the release directives for an application package](update-app-release-directive.md) for information on viewing the release directives.

To remove a version from an application package, use the DROP VERSION clause of the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command as shown in the following example:

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  DROP VERSION v1_0;
```

After running this command, the version is not dropped until all installed instances of the app are dropped.
To verify the status of the drop command, use the [SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md) as shown in
the following example:

```sqlexample
SHOW VERSIONS IN APPLICATION PACKAGE hello_snowflake_package;
```

The `dropped_on` column lists the timestamp when the drop was initiated.

> **Note:**
>
> The dropped version only appears in the output of this command while the status is `DROPPED`.
> When all installed instances of the app are dropped, the dropped version no longer appears.

When a version is dropped, consumers can no longer install new instances of that version of the app.

Depending on how the application is published to consumers, it can take different amounts of time
for the version to be dropped:

* If the application package has not been published to consumers, the version is dropped immediately.
* If the application package has been published as a public or private listing within a single region,
  the version is dropped immediately.
* If the application package is published as the data product of a listing shared within the same
  region as the application package, the version is dropped within a few hours.
* If the application package is published as the data product of a listing using
  [Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md),
  it may take longer for the version to be dropped across all regions.

---
title: Create, train and use a Snowflake ML model in an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/snowflake-ml-na-no-model.md
section: Native Apps Framework
---

# Create, train and use a Snowflake ML model in an app

This topic provides an example of how to train a Snowflake ML model within a Snowflake Native App
using the `scikit-learn` Python package. The example in this topic can be used to
train models on data in the consumer or provider accounts.

## Create a versioned schema to hold the stored procedures

Within the setup script create a versioned schema that contains the stored
procedure as shown in the following example:

1. Create a versioned schema for the stored procedure

   ```sqlexample
   CREATE OR ALTER VERSIONED SCHEMA core;
   GRANT USAGE ON SCHEMA core TO APPLICATION ROLE app_public;
   ```

## Create a stored procedure to create and train a model

1. Create a stored procedure for the Python function you are using to train a
   model as shown in the following example:

> ```sqlexample
> CREATE OR REPLACE PROCEDURE core.py_log_model(db STRING, schema STRING, mname STRING, mvname STRING)
> RETURNS STRING
> LANGUAGE python
> RUNTIME_VERSION = 3.11
> HANDLER = 'log_model'
> PACKAGES = ('snowflake-snowpark-python','scikit-learn', 'snowflake-ml-python >=1.6.2', 'pandas', 'numpy')
> AS '
>   -- <body of the stored procedure>
> ';
> ```
>
> This example creates a stored procedure named `py_log_model` and declares the Python packages required to train a model using `scikit-learn`:
>
> * snowflake-snowpark-python
> * scikit-learn
> * snowflake-ml-python
> * pandas
> * numpy
> * xgboost
>
> After creating a stored procedure, add the following code to the body
> of the stored procedure:

1. Add Python code to the body of the stored procedure

> ```python
> import _snowflake
> from snowflake.ml.registry import Registry
> import pandas as pd
> import numpy as np
> from sklearn import datasets
> from snowflake.ml.modeling.xgboost import XGBClassifier
>
> def log_model(sp_session, mname, mvname):
>     reg = Registry(session=sp_session, schema_name=''stateful_schema'')
>
>     iris = datasets.load_iris()
>     df = pd.DataFrame(data=np.c_[iris["data"], iris["target"]], columns=iris["feature_names"] + ["target"])
>     df.columns = [s.replace(" (CM)", "").replace(" ", "") for s in df.columns.str.upper()]
>     input_cols = ["SEPALLENGTH", "SEPALWIDTH", "PETALLENGTH", "PETALWIDTH"]
>     label_cols = "TARGET"
>     output_cols = "PREDICTED_TARGET"
>
>     clf_xgb = XGBClassifier(
>         input_cols=input_cols, output_cols=output_cols, label_cols=label_cols, drop_input_cols=True
>     )
>     clf_xgb.fit(df)
>     model_ref = reg.log_model(
>         clf_xgb,
>         model_name=f"{mname}",
>         version_name=f"{mvname}",
>         options={"enable_explainability": False},
>     )
>     return "success"
> ```
>
> The `log_model` function performs the following:
>
> * Uses `pandas` and `numpy` to create a DataFrame to serve a the
>   training data for the model.
> * Creates an instance of the XGBoost to serve as the training algorithm for the data.
> * Calls the `fit()` function of XGBoost to create a model and train it on the dataset.
> * Calls the `log_model()` function of Snowflake Model Registry to add the model to the
>   model registry.
>
> > **Note:**
> >
> > Models created by an app must be stored in a model registry. Apps cannot access models that are
> > stored on a stage.

1. Optional: To allow consumers to run the stored procedure to train the model, grant the USAGE privilege
   on the stored procedure:

   ```sqlexample
   GRANT USAGE ON PROCEDURE core.py_log_model(STRING, STRING) TO APPLICATION ROLE app_public;
   ```

## Create a stored procedure to run a model

1. Create a stored procedure for the Python function you use to call the model.

```sqlexample
  CREATE OR REPLACE PROCEDURE core.py_call_predict(mname STRING, mvname STRING)
  RETURNS TABLE()
  LANGUAGE python
  RUNTIME_VERSION = 3.11
  HANDLER = 'run_model'
  PACKAGES = ('snowflake-snowpark-python','scikit-learn', 'snowflake-ml-python>=1.6.2', 'pandas', 'xgboost')
  AS
'
-- <body of the stored procedure>
';
```

1. Add the Python code you use to call the model

> ```python
> import _snowflake
> from snowflake.ml.registry import Registry
> import pandas as pd
> from sklearn import datasets
>
> def run_model(sp_session, mname, mvname):
>   iris = datasets.load_iris()
>   df = pd.DataFrame(data=iris["data"], columns=iris["feature_names"])
>   df.columns = [s.replace(" (CM)", "").replace(" ", "") for s in df.columns.str.upper()]
>
>   reg = Registry(session=sp_session, schema_name="stateful_schema")
>   mv = reg.get_model(mname).version(mvname)
>
>   pred = mv.run(df.head(10), function_name="predict")
>   return sp_session.create_dataframe(pred)
> ```
>
> The `run_model` function does the following:
>
> * Runs the `load_iris()` function to
>   [load the iris machine learning dataset](https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_iris.html).
> * Uses `pandas` to create a DataFrame based on the iris data set.
> * Runs the `get_model()` function to get the model registry.
> * Runs the predict function on the model.
> * Returns the result.

1. Optional: To allow consumers to run the stored procedure to train the model, grant the USAGE privilege
   on the stored procedure:

   ```sqlexample
   GRANT USAGE ON PROCEDURE core.py_call_predict(STRING, STRING) TO APPLICATION ROLE app_public;
   ```

## Run the stored procedures

If the app grants the USAGE privilege on these stored procedures to an application role, consumers can
call the stored procedures to train and run the models as shown in the following examples:

```sqlexample
CALL my_app.core.py_log_model('md1', 'V1');
```

This command calls the `py_log_model` stored procedure to train the model.

```sqlexample
CALL my_app.core.py_call_predict('md1', 'V1');
```

This command calls the `py_call_predict` stored procedure to call the predict function on the model.

---
title: Determine information about event sharing in the consumer account
source: https://docs.snowflake.com/en/developer-guide/native-apps/event-develop.md
section: Native Apps Framework
---

# Determine information about event sharing in the consumer account

This topic describes how a provider can set up an app to determine if a consumer has enabled
event sharing in their account.

## Verify event definitions by using system functions

To determine if event sharing is enabled in a consumer account, providers can call the following
system functions within the setup script:

* SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING()

  Returns TRUE if the AUTHORIZE_TELEMETRY_EVENT_SHARING property is set, which indicates that
  event sharing is allowed in the consumer account. Otherwise, this system function returns FALSE.
* SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED()

  Returns TRUE if all required event definitions have been enabled in the consumer account.
  Otherwise, this system function returns FALSE.

The following example shows a stored procedure that performs a calculation only if both
IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING and
IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED are set to TRUE.

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA app_schema;
CREATE OR REPLACE PROCEDURE app_schema.sum(num1 float, num2 float)
RETURNS STRING
LANGUAGE SQL
EXECUTE AS OWNER
AS $$
    BEGIN
      IF (SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING() and SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED()) THEN
        RETURN num1 + num2;
      ELSE
        -- notify consumers that they need to enable event sharing
        RETURN 'Sorry you can\'t access the API, please enable event sharing.';
      END IF;
    END;
$$;
```

## Verify event definitions by using the Permissions SDK

The Python Permission SDK provides the following functions to determine if even sharing is enabled
in a consumer account:

* `is_application_authorized_for_telemetry_event_sharing()`

  Returns `true` if the AUTHORIZE_TELEMETRY_EVENT_SHARING property is `true`. Returns `false`, otherwise.

  See [is_application_authorized_for_telemetry_event_sharing()](requesting-permission-sdk-ref.md) for more information.
* `is_application_all_mandatory_telemetry_event_definitions_enabled()`

  Returns `true` if all mandatory event definitions have been enabled in the consumer account.

  See [is_application_all_mandatory _telemetry_event_definitions_enabled()](requesting-permission-sdk-ref.md) for more information.

The following example shows how to use the `is_application_authorized_for_telemetry_event_sharing()`
and `is_application_all_mandatory_telemetry_event_definitions_enabled()` functions of the
Python Permission SDK to verify that event sharing is enabled in the consumer account and that mandatory
events have been enabled.

```python
import streamlit as st
import snowflake.permissions as permissions

def critical_feature_that_requires_event_sharing():
  st.write("critical_feature_that_requires_event_sharing")

def main():
  if permissions.is_application_authorized_for_telemetry_event_sharing() and permissions.is_application_all_mandatory_telemetry_event_definitions_enabled():
     critical_feature_that_requires_event_sharing()
  else:
     permissions.request_event_sharing()

if __name__ == "__main__":
  main()
```

---
title: Develop a new version of an app (Legacy)
source: https://docs.snowflake.com/en/developer-guide/native-apps/update-app-develop.md
section: Native Apps Framework
---

# Develop a new version of an app (Legacy)

This topic provides information and best practices when updating an app to a new version or patch.

## Best practices when developing a new version or patch

Providers should consider the following best practices when developing a new version or patch for an app.

### Fully test an app before initiating the automated security scan

The following actions can initiate the automated security scan:

* Setting the DISTRIBUTION property of the application package to EXTERNAL if a version of
  the app exists
* Adding a new version or patch to an application package that has the DISTRIBUTION property
  set to EXTERNAL

Snowflake recommends that you fully test a new version or patch of your app locally before
initiating the security scan to avoid delays and multiple iterations of the scan in case of failure.

### Ensure compatibility between versions

Providers must ensure that a new version is compatible with the previous version of an app. For example,
if an app has versions v1 and v2, v2 must be compatible with v1. When version v3 is added, it must be compatible with
version v2. However, because only two versions of an app can exist at one time, version v3 does not have to be
compatible with version v1.

Code running in the previous version must handle state changes introduced in the new version. To handle
stateless objects, providers should use versioned schemas to ensure that upgrades are handled correctly. See
[Use versioned schema to manage app objects across versions](versioned-schema.md) for more information.

### Minimize state changes in patches

Providers must ensure that new patches do not introduce state changes that are different from previous
patches of the same version. Providers must minimize state changes such as adding or altering tables or
columns when developing a patch. Tables and columns must remain compatible across all versions and patches.
Patches should focus on bug fixes or minor feature additions without involving state modifications.

State changes should only be made when updating the version of an app.

### Use safe practices when creating objects from the setup script

When creating objects from the setup script, consider the following best practices:

* Use CREATE IF NOT EXISTS:

  You should always use CREATE OR REPLACE, CREATE IF NOT EXISTS or CREATE OR ALTER, whichever is applicable,
  when creating database objects such as tables, views, functions, or procedures. This prevents errors when
  trying to create objects that already exist during upgrade.

  Snowflake recommends using CREATE OR REPLACE only for stateless objects, such as functions or procedures,
  but not for stateful objects, such as tables.
* Ensure that the setup script of each app is self-contained

  Each version of the app must be complete and independent. For example, if a table was created in version v2.0
  using the CREATE TABLE IF NOT EXISTS a(int c) and version v3.0 includes ALTER TABLE A(…), ensure both the
  CREATE TABLE and ALTER TABLE statements are present in version v3.0. This ensures users installing the app from a
  later version have all necessary schema and objects.
* Use only idempotent changes in the setup script

  > Structure CREATE and ALTER statements to be idempotent, so that they can run multiple times without errors or
  > unintended side effects. If the setup script fails during installation, Snowflake reruns the setup script from
  > the beginning. If a versioned schema has already been created for this version it is not recreated or deleted.
  > For this reason, providers should use the CREATE IF NOT EXISTS version of the CREATE commands.
  >
  > For example:
  >
  > + Use ALTER TABLE ADD COLUMN IF NOT EXISTS to ensure columns are added only if they do not already exist.
  > + When inserting rows, implement safeguards to prevent duplicate rows if unintended, as upgrades may be retried
  >   multiple times.

### Use caution when creating or dropping application roles

Use caution when creating or dropping application roles in a version or patch. Application roles are not versioned.
Dropping an application role or revoking a grant on an object from one version to another can cause the app to stop working
or prevent consumers from accessing the app.

Avoid using CREATE OR REPLACE APPLICATION ROLE. Instead, use CREATE APPLICATION ROLE IF NOT EXISTS. The OR REPLACE clause will
drop and recreate roles, causing permission issues as account-level roles granted to the application role in previous versions
would need to be re-granted.

## Best practices when developing a new patch or version of an app with containers

Providers should consider the following best practices when developing a new version or patch for an app with containers:

* Use caution when setting the timeout value for the [SYSTEM$WAIT_FOR_SERVICES](../../sql-reference/functions/system_wait_for_services.md) system function.

  Setting this value to value that is too long may cause other part of the app to fail if they are expecting a service to be
  available. See Pause setup script execution for more information.
* Snowflake recommends creating the version initializer stored procedure within a versioned schema. If the version initializer
  is not created within a versioned schema, the version initializer may not exist from one version to the next.
* If an app specifies a version initializer, Snowflake recommends that the app attempts to start or upgrade services within
  the version initializer instead of the setup script. This ensures that the correct version of the service is running if an
  upgrade attempt fails.
* The version initializer does not need to be granted to an application role.

See Update an app with containers for additional information on updating an app with containers.

## Update an app with containers

Updating an app with containers to a new version adds additional considerations during upgrade.
The process of upgrading an app with containers has two main stages:

* Upgrade the services in the containers managed by the app.

  Like other Snowpark Container Services, container apps use the
  [ALTER SERVICE](../../sql-reference/sql/alter-service.md) command to modify a service
  based on a service specification file for the new version. This command
  runs asynchronously.
* Upgrade other objects in the app.

  After the services are successfully upgraded, other object within the app are
  upgraded. This is similar to the normal Snowflake Native App upgrade process. See
  [About app upgrades](update-app-overview.md) for more information.

The Snowflake Native App Framework allows users to continue using an app even during major version upgrades, ensuring no downtime for
a normal app. However, for apps with containers, as both [CREATE SERVICE](../../sql-reference/sql/create-service.md) and
[ALTER SERVICE](../../sql-reference/sql/alter-service.md) are asynchronous. This means that even after the upgrade finishes,
the new version of the service may not be immediately available.

The potential issue when upgrading an app with containers is that the
[ALTER SERVICE](../../sql-reference/sql/alter-service.md) command runs asynchronously. If this command adds
the [ALTER SERVICE](../../sql-reference/sql/alter-service.md) directly to the setup script, the setup script continues to run while the
service upgrade is in progress.

Providers should write their setup script assuming that service upgrades may not yet be complete or
they should use [SYSTEM$WAIT_FOR_SERVICES](../../sql-reference/functions/system_wait_for_services.md) and
Use a version initializer to manage service upgrades to guarantee the correct version of the service is
ready for use.

To handle service upgrades correctly, the Snowflake Native App Framework provides features that allow the app to:

* Pause the execution of the setup script until the services upgrade successfully or
  fail. Providers should ensure that the setup script can handle possible situations. See
  Pause setup script execution for more information.
* Use the version initializer function to rollback service upgrades to the previous
  version if the upgrade fails. See Considerations when upgrading services
  for more information.

### Pause setup script execution

To minimize downtime and ensure services are ready, use the [SYSTEM$WAIT_FOR_SERVICES](../../sql-reference/functions/system_wait_for_services.md)
system function in the setup script after creating or altering a service:

```sqlsyntax
SELECT SYSTEM$WAIT_FOR_SERVICES(600, 'services.web_ui', 'services.worker', 'services.aggregation');
```

This command causes the setup script to pause until one of the following occurs:

* All named services passed to the system function have READY status.
* Any of the named services has the FAILED status.
* 600 seconds has passed.

This system function ensures the app installation or upgrade waits until the service is available or until
a failure occurs, ensuring that the service state is in sync with the version upgrade.

### Considerations when upgrading services

The Snowflake Native App Framework provides the version initializer callback function that allows providers to synchronize upgrading
services with the rest of the upgrade procedure.

During the upgrade of a basic app, the setup script upgrades to the new version
of the app by modifying objects within a versioned schema. If an error occurs during
upgrade, the objects within the versioned schema revert back to the previous version of the
app.

In the case of an app with containers, services that are created or modified by running the
[CREATE SERVICE](../../sql-reference/sql/create-service.md) or [ALTER SERVICE](../../sql-reference/sql/alter-service.md) commands in the setup
script use a service specification file for the new version.

Because services are not created within versioned schemas, a service is upgraded as soon as
the [CREATE SERVICE](../../sql-reference/sql/create-service.md) or [ALTER SERVICE](../../sql-reference/sql/alter-service.md) command run successfully.
If there is a failure later in the setup script, for example, the objects in versioned schemas are reverted back to the
previous version, but the modified services are the services of the new version.

### Use a version initializer to manage service upgrades

The Snowflake Native App Framework provides a version initializer that is used to start or upgrade services or other
related processes, for example tasks. The version initializer is a callback stored procedure
that is specified in the manifest file.

The version initializer is invoked in the following contexts:

* During installation, the version initializer is called as soon as the setup script of
  the app finishes without errors.
* During upgrade, there are two possible scenarios where the version initializer is called:

  + If the setup script of the new version succeeds, then the
    version initializer of the new version of the app is called.
  + If the setup script or the version initializer of the new version fails, then
    the version initializer of the previous version of the app is called. This allows the version
    initializer of the previous version to use the [ALTER SERVICE](../../sql-reference/sql/alter-service.md)
    to revert the services to the previous version.

### Add the version initializer to an app

To specify the stored procedure used as the version initializer, add the following to
the manifest file:

```yaml
lifecycle_callbacks:
  version_initializer: callback.version_init
```

In this example, the `version_initializer` property is set to a stored procedure named
`version_init` within a schema named `callback`.

Within the setup script, a provider can define this procedure within a versioned schema
as shown in the following example:

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA callback;

CREATE OR REPLACE PROCEDURE callback.version_init()
  ...
  -- body of the version_init() procedure
  ...
```

---
title: Disable resource
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/ingestion-management/disable_resource.md
section: Native Apps Framework
---

# Disable resource

Disabling a resource is used to stop ingesting data for a given resource.
`PUBLIC.DISABLE_RESOURCE` procedure is the entry point from the UI or worksheet to disable a resource.

Calling this procedure requires the user to have the been assigned the `ADMIN` application role.

The disable resource process consists of several phases. Several of which are customizable but include reasonable defaults.
Phases are:

1. Initial validation
2. Custom logic before a resource is disabled
3. Finishing active ingestion processes and marking resource ingestion definition as disabled
4. Custom logic after a resource is created

## Initial validation

Initial validation is performed at the very beginning of the disable resource process, and checks:

* whether a resource with given id exists
* whether a resource with given id is already disabled

When a resource was previously disabled, nothing is done and success response is returned.

## Custom logic before a resource is disabled

This step can be used to implement custom logic before a resource is disabled.

By default, it invokes `PUBLIC.PRE_DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `DisableResourceHandlerBuilder` to provide custom implementation of the `PreDisableResourceCallback` interface.

If custom logic returns error, the next steps will not be executed and given error response will be returned from `DISABLE_RESOURCE` procedure.

## Finishing active ingestion processes and marking resource ingestion definition as disabled

Within this step all ingestion processes with state `SCHEDULED` or `IN_PROGRESS` are completed so and the next iteration of ingestion will not be executed for a given resource.
Then the resource ingestion definition’s `enabled` flag is changed to `false`.

> **Note:**
>
> The implementation of disable resource process does not stop currently executing ingestion. It only prevents executing
> next iteration of ingestion. If stopping an ongoing ingestion is required, you must implement Custom logic after a resource is disabled.

## Custom logic after a resource is disabled

It can be used to implement custom logic after a resource is disabled. For example, it can be used to stop ongoing ingestion for a given resource.

By default, it invokes `PUBLIC.POST_DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `DisableResourceHandlerBuilder` to provide custom implementation of the `PostDisableResourceCallback` interface.

If custom logic returns an error, the following steps will not be executed and the given error response will be returned from `DISABLE_RESOURCE` procedure.

## Response

### Successful response

On procedure success, a response resembling below is returned:

> ```json
> {
>   "response_code": "OK"
> }
> ```

### Error response

On procedure error, a response resembling below is returned:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes include:

* `INVALID_INPUT` - Resource with given resource ingestion definition id does not exist.
* `DISABLE_RESOURCE_ERROR` - Something unexpected happened when updating the resource ingestion definition or when finishing ingestion processes. All changes are rolled back.

---
title: Disable resource reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/disable_resource_reference.md
section: Native Apps Framework
---

# Disable resource reference

## Database objects and procedures

The following database objects are created when the file `ingestion/resource_management.sql` is executed.

### PUBLIC.DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `DisableResourceHandler.disableResource`.

### PUBLIC.PRE_DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Procedure used for adding connector specific logic which is invoked before a resource is disabled.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPreDisableResourceCallback`. Can be overwritten both in SQL and Java.

### PUBLIC.POST_DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Procedure used for adding connector specific logic which is invoked after a resource is disabled.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPostDisableResourceCallback`. Can be overwritten both in SQL and Java.

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.ingestion.disable` package and some common components are tightly connected with the above procedures:

* `DisableResourceHandler`
* `DisableResourceHandlerBuilder`
* `PreDisableResourceCallback`
* `PostDisableResourceCallback`
* `ConnectorErrorHelper`

## Custom handler

The handler and its internals can be customized using the following approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide a custom implementation of `DisableResourceHandler`, replace the `PUBLIC.DISABLE_RESOURCE` procedure.

For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomDisableResourceHandler.disableResource';

GRANT USAGE ON PROCEDURE PUBLIC.DISABLE_RESOURCE(VARCHAR) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal procedures `PRE_DISABLE_RESOURCE` and `POST_DISABLE_RESOURCE` can be also customized through the SQL. These procedures can also invoke other Java handlers:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.PRE_DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
      -- SOME CUSTOM LOGIC BEGIN
      SELECT sysdate();
      -- SOME CUSTOM LOGIC END

      RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;

CREATE OR REPLACE PROCEDURE PUBLIC.PRE_DISABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomHandler.disableResourceValidate';
```

### Builder approach

`DisableResourceHandler` can be customized using `DisableResourceHandlerBuilder`. This builder allows user to provide custom implementations of the following interfaces:

* `PreDisableResourceCallback`
* `PostDisableResourceCallback`
* `ConnectorErrorHelper`

When a function is not provided the default implementation provided by the SDK is used.

```java
class CustomPreDisableResourceCallback implements PreDisableResourceCallback {
  @Override
  public ConnectorResponse execute(String resourceIngestionDefinitionId) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.DISABLE_RESOURCE procedure using SQL
  public static Variant disableResource(Session session, String resourceIngestionDefinitionId) {
    //Using builder
    var handler = DisableResourceHandlerBuilder.builder(session)
      .withPreDisableResourceCallback(new CustomPreDisableResourceCallback())
      .build();
    return handler.disableResource(resourceIngestionDefinitionId).toVariant();
  }
}
```

---
title: Enable resource
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/ingestion-management/enable_resource.md
section: Native Apps Framework
---

# Enable resource

Enabling a resource is used to start ingesting data for a given resource.
`PUBLIC.ENABLE_RESOURCE` procedure is the entry point from the UI or worksheet to enable a resource.
It can be used after a resource was disabled or when a resource was created as disabled and now it is needed to enable it.

Calling this procedure requires the user has been assigned the `ADMIN` application role.
Phases are:

1. Initial validation
2. Custom validation
3. Custom logic before a resource is enabled
4. Marking a resource ingestion definition as enabled and creating new ingestion processes
5. Custom logic after a resource is created

## Initial validation

Initial validation is performed at the very beginning of the enable resource process, and checks:

* whether a resource with given id exists
* whether a resource with given id is already enabled

When a resource is already enabled, nothing is done and success response is returned.

## Custom validation

Custom validation is executed after initial validation, and is designed to be support customized connector-specific logic.
For example, it can be used to verify that a given resource still exists in a source system.

By default, it invokes `PUBLIC.ENABLE_RESOURCE_VALIDATE(resource_ingestion_definition_id)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `EnableResourceHandlerBuilder` to provide custom implementation of the `EnableResourceValidator` interface.

If the custom validation returns error, the next steps will not be executed and given error response will be returned from `ENABLE_RESOURCE` procedure.

## Custom logic before a resource is enabled

Custom logic can be specified and executed before a resource is enabled.

By default, it invokes `PUBLIC.PRE_ENABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `EnableResourceHandlerBuilder` to provide custom implementation of the `PreEnableResourceCallback` interface.

If custom logic returns error, following steps will not be executed and an error response will be returned from `ENABLE_RESOURCE` procedure.

## Marking a resource ingestion definition as enabled and creating new ingestion processes

Within this step the resource ingestion definition’s `enabled` flag is changed to `true` and then
a new ingestion process is created for each ingestion configuration.
Ingestion processes are created with `SCHEDULED` status which means that the ingestion will start a while later.
When a new ingestion process is being created, the metadata column is inherited from the last finished process with given ingestion configuration id.

## Custom logic after a resource is enabled

Custom logic can be defined to be executed after a resource is enabled.

By default, it invokes `PUBLIC.POST_ENABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `EnableResourceHandlerBuilder` to provide custom implementation of the `PostEnableResourceCallback` interface.

If custom logic returns error, given error response will be returned from `ENABLE_RESOURCE` procedure but
the process marking a resource ingestion definition as enabled and creating new ingestion processes will not be rolled back.

## Response

### Successful response

On procedure success, a response resembling the one below is returned:

> ```json
> {
>   "response_code": "OK"
> }
> ```

### Error response

On procedure error, a response resembling the one below is returned:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes include:

* `INVALID_INPUT` - Resource with given resource ingestion definition id does not exist.
* `ENABLE_RESOURCE_ERROR` - Something unexpected happened when updating the resource ingestion definition or when creating ingestion processes. All changes are rolled back.

---
title: Enable resource reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/enable_resource_reference.md
section: Native Apps Framework
---

# Enable resource reference

## Database objects and procedures

The following database objects are created when the file `ingestion/resource_management.sql` is executed.

### PUBLIC.ENABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `EnableResourceHandler.enableResource`.

### PUBLIC.ENABLE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR)

Procedure used for connector specific validation of enable process. By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultEnableResourceValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.PRE_ENABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Procedure used for adding connector specific logic which is invoked before a resource is enabled.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPreEnableResourceCallback`. Can be overwritten both in SQL and Java.

### PUBLIC.POST_ENABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)

Procedure used for adding connector specific logic which is invoked after a resource is enabled.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPostEnableResourceCallback`. Can be overwritten both in SQL and Java.

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.ingestion.enable` package and some common components are tightly connected with the above procedures:

* `EnableResourceHandler`
* `EnableResourceHandlerBuilder`
* `EnableResourceValidator`
* `PreEnableResourceCallback`
* `PostEnableResourceCallback`
* `ConnectorErrorHelper`

## Custom handler

The handler and its internals can be customized using the following approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of `EnableResourceHandler`, the `PUBLIC.ENABLE_RESOURCE` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.ENABLE_RESOURCE(resource_ingestion_definition_id VARCHAR)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomEnableResourceHandler.enableResource';

  GRANT USAGE ON PROCEDURE PUBLIC.ENABLE_RESOURCE(VARCHAR) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal procedures `ENABLE_RESOURCE_VALIDATE`, `PRE_ENABLE_RESOURCE` and `POST_ENABLE_RESOURCE` can be also customized through the SQL. These procedures can also invoke other Java handlers:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.ENABLE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;

CREATE OR REPLACE PROCEDURE PUBLIC.ENABLE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomHandler.enableResourceValidate';
```

### Builder approach

`EnableResourceHandler` can be customized using `EnableResourceHandlerBuilder`. This builder allows user to provide custom implementations of the following interfaces:

* `EnableResourceValidator`
* `PreEnableResourceCallback`
* `PostEnableResourceCallback`
* `ConnectorErrorHelper`

In case a function is not provided the default implementation provided by the SDK will be used.

```java
class CustomPreEnableResourceCallback implements PreEnableResourceCallback {
  @Override
  public ConnectorResponse execute(String resourceIngestionDefinitionId) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.ENABLE_RESOURCE procedure using SQL
  public static Variant enableResource(Session session, String resourceIngestionDefinitionId) {
    //Using builder
    var handler = EnableResourceHandlerBuilder.builder(session)
      .withPreEnableResourceCallback(new CustomPreEnableResourceCallback())
      .build();
    return handler.enableResource(resourceIngestionDefinitionId).toVariant();
  }
}
```

---
title: Example - Configure external access for services in an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-eai-example.md
section: Native Apps Framework
---

# Example - Configure external access for services in an app with containers

This topic describes how to grant access to an endpoint that is external to Snowflake in an
app with containers. This example uses external access integrations and secrets to allow access to
the endpoint.

To grant access to an external endpoint in an app with containers, providers must define reference to the
following objects:

* [EXTERNAL ACCESS INTEGRATION](../external-network-access/creating-using-external-network-access.md)

  Defines a list of network rules that specify the domain names of external endpoints. An external access
  integration can also specify a list of secrets that store the credentials used to access these endpoints.
  Secrets are optional and can be set to NONE or ALL.

  In the context of an app with containers, external access integrations require the USAGE privilege.

  > **Note:**
  >
  > The `multi_valued` property cannot be set to TRUE. Only single-valued references are supported.
* [SECRET](../external-network-access/creating-using-external-network-access.md)

  > Contains the credentials required to use the external access integration to connect to an
  > external endpoint.
  >
  > In the context of an app with containers, secrets support the USAGE and READ privileges. At least one
  > of these privileges must be specified. The READ privilege must be specified if the secret is used with
  > a service or is attached to a stored procedure or user-defined function.

## Add an external access integration reference to the manifest file

The following example shows how a provider defines an external access integration in the manifest file:

```yaml
references:
  ...
  - my_external_access:
      label: "Default External Access Integration"
      description: "This EAI is required to access xyz.com"
      privileges:
        - USAGE
      object_type: EXTERNAL ACCESS INTEGRATION
      required_at_setup: true
      register_callback: config.REGISTER_EAI_CALLBACK
      configuration_callback: config.get_config_for_ref
```

This example specifies the following properties, among others, under `references`:

* `my_external_access`: Specifies the name of the external reference.

  + `privileges`: Lists the privileges required by the external access
    integration. In this example, the USAGE privilege is required.
  + `object_type: EXTERNAL ACCESS INTEGRATION`: Indicates a reference to an external access integration.
  + `required_at_setup`: Indicates that the consumer must
    authorize access on the object before the app can create the object when set to `true`.
  + `register_callback`: Specifies the callback stored procedure used to register the reference
    with the app.
  + `configuration_callback`: Specifies the configuration callback function for the secret. See
    Add the configuration_callback function to the setup script for more information.

## Add a secret reference to the manifest file.

The following example shows how a provider defines a secret in the manifest file:

```yaml
references:
 ...
 - consumer_secret:
     label: "Consumer secret"
     description: "Needed to authenticate with an external endpoint"
     privileges:
       - READ
     object_type: SECRET
     register_callback: config.register_my_secret
     configuration_callback: config.get_config_for_ref
```

This example specifies the following properties, among others, under `references`:

* `consumer_secret`: Specifies the name of the reference.

  + `privileges`: Lists the privileges required by the secret. In this example, the READ
    privilege is specified.
  + `object_type: SECRET`: Indicates that the reference is a secret.
  + `register_callback`: Specifies the callback stored procedure used to register the reference
    with the app.
  + `configuration_callback`: Specifies the configuration callback function for the secret. See
    Add the configuration_callback function to the setup script for more information.

## Add the configuration_callback function to the setup script

After adding references for the secret and external access integration, you must add the
`configuration_callback` function to the setup script. To create an external access integration
or secret, the app must be able to determine values for the host port, secret type, the authorization and
token endpoint for OAuth, and so on. The `configuration_callback` function provides this information
from the consumer account to the app.

```sqlexample
CREATE OR REPLACE PROCEDURE CONFIG.GET_CONFIG_FOR_REFERENCE(ref_name STRING)
RETURNS STRING
LANGUAGE SQL
AS
$$
BEGIN
 CASE (UPPER(ref_name))
   WHEN 'my_external_access' THEN
     RETURN '{
       "type": "CONFIGURATION",
       "payload":{
         "host_ports":["google.com"],
         "allowed_secrets" : "LIST",
         "secret_references":["CONSUMER_SECRET"]}}';
   WHEN 'consumer_secret' THEN
     RETURN '{
       "type": "CONFIGURATION",
       "payload":{
         "type" : "OAUTH2",
         "security_integration": {
           "oauth_scopes": ["https://www.googleapis.com/auth/analytics.readonly"],
           "oauth_token_endpoint": "https://oauth2.googleapis.com/token",
           "oauth_authorization_endpoint":
               "https://accounts.google.com/o/oauth2/auth"}}}';
  END CASE;
  RETURN '';
END;
$$;
```

Snowsight runs this callback procedure to populate the configuration dialog that prompts the user to configure the
required objects.

> **Note:**
>
> The `configuration_callback` function is only supported for external access integration and secret objects.

The procedure needs to be granted to an app role for execution as shown in the following example:

```sqlexample
GRANT USAGE ON PROCEDURE CONFIG.GET_CONFIG_FOR_REFERENCE(STRING)
  TO APPLICATION ROLE app_admin;
```

## Best practices when using external access integrations in an app with containers

Snowflake recommends the following best practices when using external access integrations in an app with
containers:

* Any reference to external access integrations that are specified in a [CREATE SERVICE](../../sql-reference/sql/create-service.md) or
  [ALTER SERVICE](../../sql-reference/sql/alter-service.md) command must be bound before the commands are run in the setup script. These
  commands fail when the reference is not bound.
* Any references to secrets that are specified in the service specification must also be bound before the
  [CREATE SERVICE](../../sql-reference/sql/create-service.md) or [ALTER SERVICE](../../sql-reference/sql/alter-service.md) commands are run in the setup script.
  These commands fail when the reference is not bound.
* If returning a payload of type ERROR in `configuration_callback` function, providers should return an informative error
  message that helps the consumer understand the cause of the error and how to resolve it. For example:

  + If there is an error in the app
  + If the reference is not required yet
  + If the reference is not ready to be allowed.
* If the `configuration_callback` function contain references with the `required_at_setup` property set to
  TRUE, the `configuration_callback` function must succeed at setup time. In this context, the `configuration_callback` function can’t depend on
  information from the consumer.
* When using a reference to an external access integration with a service, consider creating the service using
  ALLOWED_AUTHENTICATION_SECRETS = ALL if the app requires secrets provided by the consumer. This simplifies handling a
  secret within an external access integration.
* If an app only needs to reach specific endpoints and does not require any secrets, use ALLOWED_AUTHENTICATION_SECRETS = NONE.
  NONE is the default value. See [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md) for more information.
* If the app needs to update a reference, first, unbind the reference, then prompt the consumer to create and bind a new
  object to the reference. A consumer can choose to edit and bind an existing object.
  See [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md).

---
title: Example - External access using OAuth and references
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-example-oauth.md
section: Native Apps Framework
---

# Example - External access using OAuth and references

This topic provides an example that describes how to use references to allow providers to grant access to
an endpoint that is external to Snowflake. This example uses a OAuth2 secret and an external access
integration to allow access.

> **Important:**
>
> This example shows the manual method using references where consumers must create integrations
> themselves. For new apps, Snowflake recommends using [automated granting of privileges](requesting-auto-privs.md)
> with [app specifications](requesting-app-specs.md) instead. See [Request external access integrations (EAIs) with app specifications](requesting-app-specs-eai.md)
> for external access integrations and [Request security integrations with app specifications](requesting-app-specs-sec-integ.md) for security integrations.

## Add references to the manifest file

To enable access to an external endpoint using OAuth, a provider can add the following entries in the manifest file:

* EXTERNAL ACCESS INTEGRATION reference with the USAGE privilege
* SECRET reference with the READ privilege

The following example manifest file shows how to define these references:

```yaml
manifest_version: 1
configuration:
  log_level: warn
  trace_level: off
...
references:
  - consumer_secret:
      label: "Consumer's Secret"
      description: "Needed to authenticate with xyz.com"
      privileges:
        - READ
      object_type: SECRET
      register_callback: config.register_my_secret
      configuration_callback: config.get_config_for_ref
  - consumer_external_access:
      label: "Default External Access Integration"
      description: "This is required to access xyz.com"
      privileges:
        - USAGE
      object_type: EXTERNAL ACCESS INTEGRATION
      register_callback: config.register_reference
      configuration_callback: config.get_config_for_ref
      required_at_setup: true
```

> **Note:**
>
> These references cannot have the `multi_valued` property set to true.

References to secrets and external access objects also require a `configuration_callback` function
in the setup script. See Add the configuration_callback function to the setup script for more information.

## Add the configuration_callback function to the setup script

After adding references for the secret and external access integration, you must add the
`configuration_callback` function to the setup script. To create an external access integration or
secret, the application must be able to determine values for host port, secret type, the authorization
and token endpoint for OAuth, etc. The `configuration_callback` provides this information from
the consumer account to the application.

Snowsight runs this callback procedure to populate the configuration dialog that prompts the
user to configure the objects. The procedure needs to be granted to an app role for execution.

> **Note:**
>
> The configuration_callback is only supported for external access integration and secret
> objects.

The callback function has the following requirements:

* The callback function must accept an argument containing a reference name. This allows the same
  callback function to handle multiple references.
* The callback function must return a well-formed JSON object. The JSON object contains the following
  properties:

  + `type`

    Indicates the type of message. Valid values are:

    > - `CONFIGURATION`: Returns a payload with the configuration values for the object based on object type.
    > - `ERROR`: Returns an error with the associated message that is displayed in Snowsight.
  + `payload`

    Contains the content of the response based on the value of the `type` property and the object type being configured.

The signature for the configuration callback is:

```sqlexample
CREATE OR REPLACE PROCEDURE configuration_callback_name(ref_name string)
RETURNS STRING
language <language>
as
$$
  ...
$$
```

Within the setup script, you must grant the USAGE privilege to the application roles that are used
for configuring the app so that they have permission to call the stored procedure. The following
example shows how to grant the USAGE privilege on a stored procedure:

```sqlexample
GRANT USAGE ON PROCEDURE configuration_callback_name(string)
  TO APPLICATION ROLE app_role;
```

The callback function returns a JSON object. See [JSON format for the configuration callback response](requesting-refs.md) for
more information.

The following example shows a typical callback function for handling external access and secret references.

This function does the following:

* For a reference to an external access integration, the procedure returns a JSON object containing the
  required configuration information. See [JSON format for external access integration](requesting-refs.md) for more
  information.
* For a reference to a secret, the procedure returns a JSON object containing a secret configuration of
  type OAuth2. See [JSON format for secret references](requesting-refs.md) for more information.

```sqlexample
  CREATE OR REPLACE PROCEDURE config.get_config_for_ref(ref_name STRING)
    RETURNS STRING
    LANGUAGE SQL
    AS
    $$
    BEGIN
      CASE (ref_name)
        WHEN 'CONSUMER_EXTERNAL_ACCESS' THEN
          RETURN '{
            "type": "CONFIGURATION",
            "payload":{
              "host_ports":["google.com"],
              "allowed_secrets" : "LIST",
              "secret_references":["CONSUMER_SECRET"]}}';
        WHEN 'CONSUMER_SECRET' THEN
          RETURN '{
            "type": "CONFIGURATION",
            "payload":{
              "type" : "OAUTH2",
              "security_integration": {
                "oauth_scopes": ["https://www.googleapis.com/auth/analytics.readonly"],
                "oauth_token_endpoint": "https://oauth2.googleapis.com/token",
                "oauth_authorization_endpoint":
                    "https://accounts.google.com/o/oauth2/auth"}}}';
  END CASE;
  RETURN '';
  END;
  $$;

GRANT USAGE ON PROCEDURE config.get_config_for_ref(string)
  TO APPLICATION ROLE app_admin;
```

## Using the Python Permission SDK for secrets and external access integrations

Python Permission SDK supports secret and external access integration objects.
However, the behavior is slightly different for these objects.

When a provider calls `permission.request_reference()`
and passes the name of a reference with an `object_type` value of `SECRET` or
`EXTERNAL ACCESS INTEGRATION`, Snowsight automatically performs
the following:

* Calls the `configuration_callback` function in the setup script.
* Validates the values returned by the `configuration_callback` function.
* Displays the configuration dialog to the consumer.

> **Note:**
>
> If a provider configures an external access integration with the
> `payload.allow_secrets` property set to `LIST`, it is not necessary to
> make a separate call to request a reference for the secret. The secret configuration
> is implicitly included as part of the external access integration configuration.

---
title: External integration setup reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/setup_external_integration.md
section: Native Apps Framework
---

# External integration setup reference

The following database objects are created through the file `setup_external_integration.sql`.

## PUBLIC.SETUP_EXTERNAL_INTEGRATION_WITH_NAMES()

The procedure alters other procedures or functions, whose signatures are passed as procedure argument in an array, with
an `EXTERNAL ACCESS INTEGRATION` and a `SECRET` objects names that are stored in the connection configuration under the
following keys:

> * `external_access_configuration` for an `EXTERNAL ACCESS INTEGRATION` object identifier.
> * `secret` for a `SECRET` object identifier.

Secret is attached to altered procedure/function with the `credentials` key. By default, the procedure is not available for any
of application user roles.

### Procedure signature

> ```sqlexample
> CREATE OR REPLACE PROCEDURE PUBLIC.SETUP_EXTERNAL_INTEGRATION_WITH_NAMES(methods ARRAY)
>     RETURNS VARIANT
>     LANGUAGE SQL
>     [...]
> ```

Where:

* `methods ARRAY` stand for an array of procedure/function signatures as varchar, e.g. `ARRAY_CONSTRUCT('PUBLIC.PROC_1(VARIANT)', 'PUBLIC.PROC_2()')`.

### Returned values

The procedure always returns a Variant with a standard connector response structure.

In case of a successful procedure execution:

> ```json
> {
>   "response_code": "OK",
>   "message": "Successfully set up <number> method(s)."
> }
> ```
>
> > **Note:**
> >
> > The procedure execution finishes successfully even when procedure/function signatures passed as arguments do not
> > represent existing objects or an application does not have an access to these objects. The altering process of this
> > kind of procedure/function is skipped and the general process continues.

In case of a failure:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>",
>   "SQLCODE": "<code of a thrown exception>",
>   "SQLERRM": "<error message of a thrown exception>",
>   "SQLCODE": "<sql code of a thrown exception>"
> }
> ```
>
> > **Attention:**
> >
> > The procedure does not throw any error if an error occurs during the execution. Each error is wrapped into the
> > connector response, and mapped to appropriate `response_code` which allows validating the procedure result and
> > using it safely in the `setup.sql` during the application installation (otherwise any unhandled error could
> > interrupt and terminate the application installation process).

### Possible errors

* `EAI_UNAVAILABLE` - an `EXTERNAL ACCESS INTEGRATION` object does not exist or an application does not have a `USAGE` privilege on it.
* `SECRET_UNAVAILABLE` - a `SECRET` object does not exist or an application does not have at least a `READ` privilege on it.
* `INTERNAL ERROR` - this response code is returned in case of unexpected errors occurrences.

### Example usage

> ```sqlexample
> CALL PUBLIC.SETUP_EXTERNAL_INTEGRATION_WITH_NAMES(ARRAY_CONSTRUCT(
>     'PUBLIC.TEST_CONNECTION()',
>     'PUBLIC.FINALIZE_CONFIGURATION(VARIANT)',
>     'PUBLIC.TEMPLATE_WORKER(NUMBER, STRING)')
> );
> ```

## PUBLIC.SETUP_EXTERNAL_INTEGRATION_WITH_REFERENCES()

The procedure alters other procedures or functions, whose signatures are passed as procedure argument in an array, with
an `EXTERNAL ACCESS INTEGRATION` and a `SECRET` objects that are assigned to application references. When using this
procedure, it’s required to have references registered with the following names:

* `EAI_REFERENCE` - for a reference to an `EXTERNAL ACCESS INTEGRATION` object.
* `SECRET_REFERENCE` - for a reference to a `SECRET` object.

Secret is attached to altered procedure/function with the `credentials` key. By default, the procedure is not available for any
of application user roles.

### Procedure signature

> ```sqlexample
> CREATE OR REPLACE PROCEDURE PUBLIC.SETUP_EXTERNAL_INTEGRATION_WITH_REFERENCES(methods ARRAY)
>     RETURNS VARIANT
>     LANGUAGE SQL
>     [...]
> ```

Where:

* `methods ARRAY` stand for an array of procedure/function signatures as varchar, e.g. `ARRAY_CONSTRUCT('PUBLIC.PROC_1(VARIANT)', 'PUBLIC.PROC_2()')`.

### Returned values

The procedure always returns a Variant with a standard connector response structure.

In case of a successful procedure execution:

> ```json
> {
>   "response_code": "OK",
>   "message": "Successfully set up <number> method(s)."
> }
> ```
>
> > **Note:**
> >
> > The procedure execution finishes successfully even when procedure/function signatures passed as arguments do not
> > represent existing objects or an application does not have an access to these objects. The altering process of this
> > kind of procedure/function is skipped and the general process continues.

In case of a failure:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>",
>   "SQLCODE": "<code of a thrown exception>",
>   "SQLERRM": "<error message of a thrown exception>",
>   "SQLCODE": "<sql code of a thrown exception>"
> }
> ```
>
> > **Attention:**
> >
> > The procedure does not throw any error if an error occurs during the execution. Each error is wrapped into the
> > connector response, and mapped to appropriate `response_code` which allows validating the procedure result and
> > using it safely in the `setup.sql` during the application installation (otherwise any unhandled error could
> > interrupt and terminate the application installation process).

### Possible errors

* `EAI_UNAVAILABLE` - an `EXTERNAL ACCESS INTEGRATION` object does not exist or an application does not have a `USAGE` privilege on it.
* `SECRET_UNAVAILABLE` - a `SECRET` object does not exist or an application does not have at least a `READ` privilege on it.
* `INTERNAL ERROR` - this response code is returned in case of unexpected errors occurrences.

### Example usage

> ```sqlexample
> CALL PUBLIC.SETUP_EXTERNAL_INTEGRATION_WITH_REFERENCES(ARRAY_CONSTRUCT(
>     'PUBLIC.TEST_CONNECTION()',
>     'PUBLIC.FINALIZE_CONFIGURATION(VARIANT)',
>     'PUBLIC.TEMPLATE_WORKER(NUMBER, STRING)')
> );
> ```

## PUBLIC.SETUP_EXTERNAL_INTEGRATION()

This is a raw version of procedures described above which is also used by them. The procedure alters other procedures or
functions, whose signatures are passed as procedure argument in an array, with an `EXTERNAL ACCESS INTEGRATION` and
a `SECRET` object names that are also passed as procedure arguments. This procedure gives developer a freedom to decide
how to provide information about external access related objects to the procedure.

Secret is attached to altered procedure/function with the `credentials` key. By default, the procedure is not available for any
of application user roles.

Using this procedure is recommended only when there is no possibility of using procedures described above, that use references
with predefined names or object names stored under predefined keys in connection configuration.

### Procedure signature

> ```sqlexample
> CREATE OR REPLACE PROCEDURE PUBLIC.SETUP_EXTERNAL_INTEGRATION(eai_idf VARCHAR, secret_idf VARCHAR, methods ARRAY)
>     RETURNS VARIANT
>     LANGUAGE SQL
>     [...]
> ```

Where:

* `eai_idf VARCHAR` - stands for an identifier of an `EXTERNAL_ACCESS_INTEGRATION` object. If you want to pass there a reference name, you need to wrap it as follows: `'reference(\'<reference_name>\')'`
* `secret_idf VARCHAR` - stands for an identifier of aa `SECRET` object. If you want to pass there a reference name, you need to wrap it as follows: `'reference(\'<reference_name>\')'`
* `methods ARRAY` stand for an array of procedure/function signatures as varchar, e.g. `ARRAY_CONSTRUCT('PUBLIC.PROC_1(VARIANT)', 'PUBLIC.PROC_2()')`.

### Returned values

The procedure always returns a Variant with a standard connector response structure.

In case of a successful procedure execution:

> ```json
> {
>   "response_code": "OK",
>   "message": "Successfully set up <number> method(s)."
> }
> ```
>
> > **Note:**
> >
> > The procedure execution finishes successfully even when procedure/function signatures passed as arguments do not
> > represent existing objects or an application does not have an access to these objects. The altering process of this
> > kind of procedure/function is skipped and the general process continues.

In case of a failure:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>",
>   "SQLCODE": "<code of a thrown exception>",
>   "SQLERRM": "<error message of a thrown exception>",
>   "SQLCODE": "<sql code of a thrown exception>"
> }
> ```
>
> > **Attention:**
> >
> > The procedure does not throw any error if an error occurs during the execution. Each error is wrapped into the
> > connector response, and mapped to appropriate `response_code` which allows validating the procedure result and
> > using it safely in the `setup.sql` during the application installation (otherwise any unhandled error could
> > interrupt and terminate the application installation process).

### Possible errors

* `EAI_UNAVAILABLE` - an `EXTERNAL ACCESS INTEGRATION` object does not exist or an application does not have a `USAGE` privilege on it.
* `SECRET_UNAVAILABLE` - a `SECRET` object does not exist or an application does not have at least a `READ` privilege on it.
* `INTERNAL ERROR` - this response code is returned in case of unexpected errors occurrences.

### Example usage

> ```sqlexample
> CALL PUBLIC.SETUP_EXTERNAL_INTEGRATION(
>     'EXAMPLE_EAI_IDF',
>     'reference(\'CUSTOM_REFERENCE_NAME\')',
>     ARRAY_CONSTRUCT('PUBLIC.TEST_CONNECTION()',
>     'PUBLIC.FINALIZE_CONFIGURATION(VARIANT)',
>     'PUBLIC.TEMPLATE_WORKER(NUMBER, STRING)')
> );
> ```

When you want to use this procedure in the `setup.sql` script and names of a `SECRET` and an `EXTERNAL ACCESS INTEGRATION`
objects are stored in a different way from the one which is recommended by the Native SDK for Connectors, you need to
retrieve these values somehow. In this case, you can use the `EXECUTE IMMEDIATE` mechanism:

> ```sqlexample
> EXECUTE IMMEDIATE $$
>     DECLARE
>         eai_idf VARCHAR;
>         secret_idf VARCHAR;
>     BEGIN
>         -- retrieve name of an EXTERNAL ACCESS INTEGRATION object
>         :eai_idf = <eai_object_name>;
>
>         -- retrieve name of a SECRET object
>         :secret_idf = <secret_object_name>;
>
>         CALL PUBLIC.SETUP_EXTERNAL_INTEGRATION(
>             :eai_idf,
>             :secret_idf,
>             ARRAY_CONSTRUCT('PUBLIC.TEST_CONNECTION()',
>             'PUBLIC.FINALIZE_CONFIGURATION(VARIANT)',
>             'PUBLIC.TEMPLATE_WORKER(NUMBER, STRING)')
>         );
>     END;
> $$
> ;
> ```

---
title: Finalize configuration
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/finalize_configuration.md
section: Native Apps Framework
---

# Finalize configuration

Finalize configuration is the last step of the Wizard, it comes directly after `connection configuration`.
This step allows the user to provide any custom configuration that was not included during the previous steps of the configuration.
Furthermore, it can be used to do some final touches when it comes to configuration, like creating the sink database, starting task reactor etc.
The entry point for this phase is a procedure called `PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION(CUSTOM_CONFIGURATION VARIANT)`.
It can be customized by replacing it in SQL or by using `FinalizeConnectorHandlerBuilder`.
By default, the provided `custom_configuration` is NOT persisted in the database,
so if it’s required by the design, the configuration must be saved in one of the extension methods
(most likely in the `FINALIZE_CONNECTOR_CONFIGURATION_INTERNAL`).

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The finalize configuration step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Status validation
2. Input validation
3. Source validation
4. Internal callback
5. Status update

## Requirements

Finalize configuration requires at least the following sql files to be executed during native app installation:

* `core.sql`
* `configuration/finalize_configuration.sql`
* Recommended: `configuration/app_config.sql`

## Status validation

To perform connector finalization the internal status of the connector needs to be `CONFIGURING`, with configuration status `CONNECTED`.

This validation cannot be overwritten by using `FinalizeConnectorHandlerBuilder` nor by overwriting stored procedures.
However, it is possible to implement a custom handler, which will not have this kind of validation.

## Input validation

Input needs to be a valid `Variant`. IN addition, there are custom validations that need to be satisfied. One stored procedure,
`PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION_VALIDATE(CUSTOM_CONFIGURATION VARIANT)` stored can be customized by the user.
By default, this procedure just returns `'response_code': 'OK'`.
Customize it by overwriting the SQL or by using `FinalizeConnectorHandlerBuilder` and providing a custom implementation of the
`FinalizeConnectorValidator` interface.

## Source validation

Once the validations are passed, the procedure `PUBLIC.VALIDATE_SOURCE(CUSTOM_CONFIGURATION VARIANT)` connects to an external source.
In some cases this procedure can be the same as the `TEST_CONNECTION` procedure that was executed during connection configuration.
However, `TEST_CONNECTION` is designed to just check some basic connectivity, while `VALIDATE_SOURCE` is a procedure
that can require some additional configuration. For example, checking permissions to a specific resource in the source system.
The default implementation of `VALIDATE_SOURCE` returns `'response_code': 'OK'`. This default implementation can be overwritten with
SQL or by implementing the `SourceValidator` interface using `FinalizeConnectorHandlerBuilder`.

## Internal callback

Internal callback is a customizable step that invokes `PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION_INTERNAL(CUSTOM_CONFIGURATION VARIANT)`,
which returns `'response_code': 'OK'` by default. This procedure allows the user to perform any additional configurations needed by the connector.
For example, saving the provided `custom_configuration` in the `STATE.CONNECTOR_CONFIGURATION` table.
It can be overwritten through the SQL script or by using a `FinalizeConnectorHandlerBuilder` to provide custom implementation of the `FinalizeConnectorCallback` interface.

## Status update

When all the above phases are completed successfully the internal status of the Connector will be updated to:

```json
{
    "status": "STARTED",
    "configurationStatus": "FINALIZED"
}
```

For the whole diagram of state transitions, see [Connector flow](overview.md).

### Response

#### Successful response

If the procedure finishes successfully it will return a response from `FINALIZE_CONNECTOR_CONFIGURATION_INTERNAL` procedure. We recommend using the following format:

> ```json
> {
>   "response_code": "OK"
> }
> ```

#### Error response

In case of an error the response will follow the below format:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes:

* `INVALID_CONNECTOR_STATUS` - The procedure was called on already configured connector
* `INVALID_CONNECTOR_CONFIGURATION_STATUS` - The procedure was called when the `CONFIGURATION_STATUS` was different from `CONNECTED`
* `CONNECTOR_STATUS_NOT_FOUND` - Connector status record does not exist in database (independent of user’s input at this stage - an internal error)
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive

---
title: Finalize configuration reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/finalize_configuration_reference.md
section: Native Apps Framework
---

# Finalize configuration reference

## Database objects and procedures

The following database objects are created through the file `configuration/finalize_configuration.sql`.

### PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION (CUSTOM_CONFIGURATION VARIANT)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `FinalizeConnectorHandler.finalizeConnectorConfiguration`.

### PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION_VALIDATE (CUSTOM_CONFIGURATION VARIANT)

Procedure used for connector specific validation of the custom configuration. By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultFinalizeConnectorValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.VALIDATE_SOURCE (CUSTOM_CONFIGURATION VARIANT)

Procedure checking the connection to the source system with additional configuration specific to the connector. In some cases
it might be the same as the `TEST_CONNECTION` procedure, but sometimes it will be performing checks in a more detailed way.
By default, it returns `'response_code': 'OK'`. It is invoked by `InternalSourceValidator`.

### PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION_INTERNAL (CUSTOM_CONFIGURATION VARIANT)

Procedure used to perform any additional custom configurations. By default, it returns `'response_code': 'OK'`.
It is invoked by `InternalFinalizeConnectorCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

Connector configuration is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))

## Related Java objects

The following Java objects are from the `com.snowflake.connectors.application.configuration.finalization` package and some common components are tightly connected with the above procedures:

* `FinalizeConnectorHandler`
* `FinalizeConnectorValidator`
* `SourceValidator`
* `FinalizeConnectorCallback`
* `ConnectorStatusService`
* `ConnectorErrorHandler`

## Custom handler

The handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of `FinalizeConnectorHandler` the `PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION(CUSTOM_CONFIGURATION VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomFinalizeConnectorHandler.finalizeConnectorConfiguration';

GRANT USAGE ON PROCEDURE PUBLIC.CONFIGURE_CONNECTOR(VARIANT) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal `VALIDATE`, `INTERNAL` and `VALIDATE_SOURCE` procedures can be also customized through the SQL. They can even invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION_INTERNAL(config VARIANT)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;

CREATE OR REPLACE PROCEDURE PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION_VALIDATE (config VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomFinalizeConnectorConfigurationValidateHandler.validate';
```

### Builder approach

`FinalizeConnectorHandler` can be customized using `FinalizeConnectorHandlerBuilder`. This builder allows the user to provide custom implementations of the following interfaces:

* `FinalizeConnectorValidator`
* `SourceValidator`
* `FinalizeConnectorCallback`
* `ConnectorErrorHelper`

In case one of them is not provided the default implementation provided by the SDK will be used.

```java
class CustomFinalizeConnectorValidator implements FinalizeConnectorValidator {
  @Override
  public ConnectorResponse validate(Variant config) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION procedure using SQL
  public static Variant finalizeConnector(Session session, Variant configuration) {
    //Using builder
    var handler = FinalizeConnectorHandler.builder(session)
      .withValidator(new CustomFinalizeConnectorValidator())
      .build();
    return handler.finalizeConnector(configuration).toVariant();
  }
}
```

---
title: Getting started with the Snowflake Native SDK for Connectors
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/getting_started.md
section: Native Apps Framework
---

# Getting started with the Snowflake Native SDK for Connectors

The Snowflake Native SDK for Connectors is a library that provides universal components that can be used to build a Snowflake native app
that ingests the data from an external data source into Snowflake. The provided components define the recommended flow
of the connector application, allow customization, and provide building blocks for building ingestion logic.

The Snowflake Native SDK for Connectors is distributed as code to be pulled and build locally.
Below you can find some information to get you familiarized with the structure of the SDK, how to use it in your project,
how to deploy and install an application and how to use it during development.

## Project structure

The Snowflake Native SDK for Connectors consists of multiple parts, which will be described below.

* `connectors-native-sdk`

This directory contains the actual Snowflake Native SDK for Connectors source code and tests.
Because of the nature of the native app inside Snowflake source code is not only Java code,
but also bundled SQL code with database objects definitions.

The Java code is inside `src/main` directory as for any regular Java library. The same for the unit tests located inside `src/test`.
Additionally, inside `src/` directory you can find `intTest` and `appTest` directories.
Those are respectively integration and application tests. Both of those test types require connection to a Snowflake account.
The former ones test the SDK components using standalone database objects, while the latter deploy an actual application and run tests using it.

SQL source files are contained inside `src/main/resources` directory. They are included inside jar archive when building the Snowflake Native SDK for Connectors.
To use them they need to be extracted from the jar and put inside a build target directory that will be copied to Snowflake stage inside application package.

* `connectors-native-sdk-test`

This directory contains helper library designed to enable easier unit testing of the Connector based on the Snowflake Native SDK for Connectors.
It provides mock implementations for some of the database objects,
specially designed test builders that allow overwriting parts of the code not available for customization and some custom assertions based on the [AssertJ library](https://assertj.github.io/doc/).

## SDK installation and usage

Currently, installation and usage of the Snowflake Native SDK for Connectors requires the developer to perform some manual actions.

The Snowflake Native SDK for Connectors library is available via Maven Central

```output
repositories {
    mavenCentral()
}

dependencies {
    compileOnly 'com.snowflake:connectors-native-sdk:2.0.0'
    testImplementation 'com.snowflake:connectors-native-sdk-test:2.0.0'
}
```

To access provided SQL files they need to be extracted to the target directory for the native app. To achieve this use the following gradle task definition (for now it has to be manually copied into the `build.gradle` file).

```javascript
String defaultBuildDir = './sf_build'
String defaultSrcDir = './app'
String libraryName = 'connectors-native-sdk'

project.tasks.register('copySdkComponents') {
    it.group = 'Snowflake'
    it.description = "Copies .sql files from ${sdkComponentsDirName} directory to the connector build file."
    doLast {
        copySdkComponents(libraryName, defaultBuildDir, sdkComponentsDirName)
    }
}

private void copySdkComponents(String libraryName, String defaultBuildDir, String sdkComponentsDirName) {
    TaskLogger.info("Starting 'copySdkComponents' task...")
    def targetDir = getCommandArgument('targetDir', {defaultBuildDir})

    try {
        project.copy {
            TaskLogger.info("Copying [${sdkComponentsDirName}] directory with .sql files to '${targetDir}'")
            from project.zipTree(project.configurations.compileClasspath.find {
                it.name.startsWith(libraryName)})
            into targetDir
            include "${sdkComponentsDirName}/**"
        }
    } catch (IllegalArgumentException e) {
        Utils.exitWithErrorLog("Unable to find [${libraryName}] in the compile classpath. Make sure that the library is " +
                "published to Maven local repository and the proper dependency is added to the build.gradle file.")
    }
    project.copy {
        TaskLogger.info("Copying [${libraryName}] jar file to [${targetDir}]")
        from configurations.runtimeClasspath.find {
            it.name.startsWith(libraryName)
        }
        into targetDir
        rename ("^.*${libraryName}.*\$", "${libraryName}.jar")
    }
    TaskLogger.success("Copying sdk components finished successfully.")
}
```

To then run this task:

```bash
./gradlew copySdkComponents
```

The extracted SQL files can be then executed during the execution of the `setup.sql` for the Native App.

## Deployment and installation

The Snowflake Native SDK for Connectors is designed to be used with the Snowflake Native App Framework. This means that deployment and installation is happening the same way
as it does for any other native app. This mean that first the Application Package needs to be created and all the files need to be uploaded into stage,
recommendation is to create stage inside the Application Package. If the above example script was used then all the required files from
the Snowflake Native SDK for Connectors should be already present in the target build directory of the Connector.
This means that its up to the developer to make sure that the custom code of the Connector and any Streamlit files are also there.
For more information check [Create and manage an application package](../creating-app-package.md).

Once the Application Package is created and files are uploaded to stage, then a `version` of the application can be created. This step is optional during development,
because Application Instance can be created directly from files in stage instead of using registered version.
For more information check [Install and test an app locally](../installing-testing-application.md).

## Development

The Snowflake Native SDK for Connectors provides objects and procedures that handle common use cases for each Connector application.
This includes things like configuration, lifecycle, ingestion definition, and so on. To review the full list of features, see the SDK reference.
Some parts of the predefined features can be customized, for more information on that check Stored procedures and handlers customization.

## Testing

As mentioned before Connectors Native SDK contains different types of tests.
This includes unit tests, integration tests and so called application tests.
The unit tests use features provided in the aforementioned `connectors-native-sdk-test`.
As for integration and application tests they require connection to Snowflake.
Connection details can be defined using the `.env/snowflake_credentials` file.
Application tests directory also contains an empty connector application in resource.
That application is deployed during the test suite execution

Additional resources:

> * [Snowflake Native SDK for Connectors Java API Reference](/developer-guide/native-apps/connector-sdk/java.md)
> * [Snowflake Native SDK for Connectors Java API TEST Reference](/developer-guide/native-apps/connector-sdk/test.md)

For hands-on experience on using and implementing your own connector, try our tutorials:

> * [Tutorial: Snowflake Native SDK for Connectors example Java connector](tutorials/native_sdk_example_connector_tutorial.md)
> * [Tutorial: Snowflake Native SDK for Connectors Java connector template](tutorials/native_sdk_template_connector_tutorial.md)

---
title: Grant restricted caller’s rights to an executable in an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-restricted-callers-rights.md
section: Native Apps Framework
---

# Grant restricted caller’s rights to an executable in an app

This topic describes how a consumer can grant caller grants to an executable
in a Snowflake Native App.

## About owner’s rights and restricted caller’s rights in an app

In the context of an app, the following
types of executables are supported:

* Stored procedures owned by the app
* Services available in apps with containers

Each of these types of executables can be configured to use either owner’s rights or restricted caller’s rights.

Owner’s rights:
:   By default, executables within an app use owner’s rights, which means that they run with the privileges granted to the owner of the executable, which is the app itself.

    > For example, owner’s rights allow an executable to access data in the provider account
    > and present that data to the consumer. However, they do not allow the consumer to access
    > the data directly.
    >
    > For example, the [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) command creates a
    > stored procedure that uses owner’s rights by default. Consumers can call the
    > stored procedure if they have been granted access using application roles. If the
    > app has the privileges to perform an operation, then the stored procedure can perform that
    > operation.
    >
    > For general information on owner’s rights, see
    > [Understanding caller’s rights and owner’s rights stored procedures](../stored-procedure/stored-procedures-rights.md).

Restricted caller’s rights:
:   Restricted caller’s rights allow an executable to run with caller’s rights, but restrict
    which of the caller’s privileges the executable runs with. With restricted caller’s rights,
    an executable owned by an app cannot run with a specific privilege unless an administrator
    in the consumer account explicitly allows it by using the [GRANT CALLER](../../sql-reference/sql/grant-caller.md)
    command.

    > **Note:**
    >
    > To guarantee that executables in an app are secure, Snowflake Native Apps do not support unrestricted
    > caller’s rights.

    For general information on restricted caller’s rights, see
    [Restricted caller’s rights](../restricted-callers-rights.md).

### Scope of restricted caller’s rights in an app

Snowflake recommends that consumers grant caller grants at a container level and not on specific objects in their account.

Schema level:
:   Grants caller rights to the schema, but does not grant any rights to objects
    in the schema. For example, granting the CALLER USAGE caller grant on a schema only
    grants the USAGE privilege on the schema. To grant access to a specific object, for
    example a function, use GRANT INHERITED CALLER USAGE ON ALL FUNCTIONS IN SCHEMA.

Database level:
:   Granting caller grants at the database level only allows an executable to
    access the database and all schemas in the database. For example, granting the
    CALLER USAGE caller grant grants the USAGE privilege on the database. However, to
    grant access to a specific object, you must use the following command:

    ```sqlexample
    GRANT INHERITED CALLER USAGE ON ALL FUNCTIONS IN DATABASE;
    ```

Account level:
:   Granting caller grants at the account level allows an executable to perform account-level operations.
    Granting the CALLER USAGE caller grant only allows the executable to access the account,
    it does not provide access to objects within the account.

    To allow access to specific objects, you grant access to specific types of object in the account.
    For example, granting the CREATE DATABASE caller grant allows an executable to create databases in
    the consumer account as shown in the following example:

    ```sqlexample
    GRANT CALLER CREATE DATABASE ON ACCOUNT TO my_app;
    ```

### Account-level caller grants that can be granted to an app

Providers can configure an executable in an app to use the following account-level caller grants:

* CREATE DATABASE
* EXECUTE ALERT
* EXECUTE MANAGED TASK
* EXECUTE TASK
* READ SESSION
* VIEW LINEAGE

> **Note:**
>
> Consumers should use caution when granting account-level caller grants to an app.

## Privileges required to grant restricted caller’s rights to an app

To grant caller grants to an app as a consumer, you must use the ACCOUNTADMIN role or use
a role that has the MANAGE CALLER GRANTS privilege. For more information, see
[GRANT CALLER](../../sql-reference/sql/grant-caller.md).

## Grant caller grants to an executable in an app using Snowsight

Using Snowsight, you can grant caller grants to an app on objects in the consumer account.

> **Note:**
>
> To perform other tasks, including revoking caller grants from an app, granting caller grants to a
> specific table, or granting account-level caller’s rights, you must use the appropriate SQL commands.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Click the Settings icon in the toolbar, then select the
   Privileges tab.

   If the app supports restricted caller’s rights, the Restricted caller’s rights section
   is displayed in the Privileges tab.

   > **Note:**
   >
   > You can only grant caller grants from Snowsight if the provider has configured the app
   > to display the restricted caller’s rights UI.
5. Click Add grants.
6. Select an Access scope.

   This determines whether the caller’s rights apply to a schema, a database, or at the account level.
   You should select the option with the least amount of scope possible to avoid granting unnecessary
   rights to the app.

   > **Caution:**
   >
   > Use caution when selecting account level scope, which can grant caller’s rights to the app on all
   > supported object types.
7. If you selected schema or database scope, select the schema or
   database as required.

> > **Note:**
> >
> > You can select multiple schemas or databases. You can also select schemas in different databases.

1. Click Next.
2. Select the type of objects to which caller’s rights will be granted.

   Use search to find an object type. The list of object types depends on the scope you chose above.

   When you select an object type, the object’s entry in the list expands to available privileges for
   each object type.
3. Select the privileges you want to grant.

   You can select multiple privileges for each object type. You can also select privileges for
   other object types.

   > **Note:**
   >
   > Snowflake automatically grants the USAGE privilege on any objects you select.
4. Click Next.
5. Select Grant summary to verify the scope, object types, and privileges that you select.

   > **Note:**
   >
   > Any objects of the selected type that are created in the future will be created with the same
   > privileges using the scope and object types selected.
6. Select SQL to view the [GRANT CALLER](../../sql-reference/sql/grant-caller.md) commands the Snowsight will run.

   > **Note:**
   >
   > If required, you can copy these commands and run them manually in a worksheet.
7. Click Save

> The scope, objects, and privileges you selected are displayed in the Restricted caller’s rights section.

To modify the privileges you selected, click Edit and select or deselect privileges as required.

## Grant caller grants to an executable in an app using SQL

When configuring an app that requests restricted caller’s rights, perform the
following tasks to grant caller grants to the app:

1. Check the listing of the app to verify if the provider has communicated that
   the app has RCR executables.
2. Grant the caller grants as mentioned in the listing. The following example shows how to use
   the [GRANT CALLER](../../sql-reference/sql/grant-caller.md) command to grant the SELECT privilege on all tables
   in a specific database and schema:

   ```sqlexample
   GRANT CALLER USAGE ON DATABASE db1
     TO APPLICATION hello_snowflake_app;
   GRANT CALLER USAGE ON SCHEMA db1.sch1
     TO APPLICATION hello_snowflake_app;
   GRANT INHERITED CALLER SELECT ON ALL TABLES IN SCHEMA db.sch1
     TO APPLICATION hello_snowflake_app;
   ```

   This command allows an executable with restricted caller’s rights to access run queries on
   all tables with the `db.sch1` database and schema. In addition to granting the SELECT privilege
   on all tables, you must also grant USAGE on the database and schema.

---
title: Guidelines for publishing an app to the Snowflake Marketplace
source: https://docs.snowflake.com/en/developer-guide/native-apps/publish-guidelines.md
section: Native Apps Framework
---

# Guidelines for publishing an app to the Snowflake Marketplace

This topic describes the criteria for publishing a Snowflake Native App to the Snowflake Marketplace.

## Publish an app in the Snowflake Marketplace

When your application package is ready to be published on the Snowflake Marketplace, you must submit it
to Snowflake for approval.

> **Note:**
>
> The approval process required to publish an app on the Snowflake Marketplace is in addition to the
> [automated security scan](security-overview.md) that is run when
> the DISTRIBUTION property of an application package is set to EXTERNAL.

Before creating a listing, verify that you understand the
enforced requirements and ensure that your
application package follows each requirement. If an application package does not follow these requirements,
your submission may be rejected.

If you receive a rejection notification for the application package you submitted, make the recommended
changes and resubmit your application package for approval.

## Standards for Snowflake Native Apps on the Snowflake Marketplace

Snowflake provides a platform that allows providers to build, distribute, and monetize apps.

The Snowflake review process ensures the quality of the apps that are published to the Snowflake
Marketplace. To ensure a streamlined review process, Snowflake provides the following requirements and
guidelines for apps that are published to the Snowflake Marketplace.

Immediate utility:
:   The app functionality must be provided within the consumer account and the app must be operational once installed.

Standalone:
:   Apps must deliver product experience on Snowflake and facilitate external requirements through Snowflake functionality.

Data-centric:
:   Apps should be based on data-centric use cases that leverage data stored in Snowflake.

Transparent and simple:
:   Apps must use Snowflake features to disclose the app’s resource and access requirements and simplify the configuration process
    for the consumer.

## Enforced requirements

Snowflake uses the following guidelines to determine if a Snowflake Native App meets the
requirements for publication on the Snowflake Marketplace. These requirements are verified when
you submit a listing with an attached application package to the Snowflake Marketplace.

1. Immediate utility

   1. Apps must not be shell apps that advertise functionality. Apps must deliver the advertised functionality.
   2. Apps must include a clear framework and instruction for utilizing app functionality.
   3. Apps should not crash, freeze, or otherwise function abnormally.
   4. Apps must list all required credentials and providers must share required credentials with
      Snowflake at submission for testing.
   5. If apps are not immediately actionable, they must document the expected workflow for a consumer,
      allowing consumers to fully install and configure the app.
2. Standalone

   1. Apps must not be pass-through. For example, they must not redirect consumers to an external
      service to enable the app’s core functionality.
   2. App interfaces must be accessible after installation directly from Snowflake.
   3. Apps cannot require consumers to create users or roles that provide access to an external service
      in the Snowflake consumer’s account.
   4. Apps cannot use the Snowflake Marketplace as a distribution platform for cross-selling external
      applications or services.
3. Data-centric

   1. Apps must leverage Snowflake data in one of the following ways:

      1. Share data from the app provider’s account.
      2. Use datasets from the Snowflake Marketplace.
      3. Access data in the consumer account.
4. Transparent

   1. All account-level privileges and references that the app requires must be listed in the
      application package manifest file.
   2. All resource requirements for the Snowflake Native App must be listed in the
      [marketplace.yml](marketplace-file.md)
      file of the app. The app must create these resources as part of installation and setup.
   3. All account-level privileges and references listed in the application package manifest file must be requested
      from the consumer through Snowsight or the Python Permission SDK.
   4. Apps must provide a readme file. Apps that do not include a Streamlit or custom user interface must include the
      following information in the readme file:

      1. A description of what the app does.
      2. The steps the consumer must perform to configure the app after it is installed.
      3. The stored procedures and user-defined functions the app uses.
      4. The privileges the app requires.
      5. Example SQL commands that show consumers how to use the app.
   5. All required SQL commands must be delivered using Snowflake and formatted as code blocks.
   6. If the app provides sample data, you must include procedures on how to use the sample data.
   7. If an application package contains a Streamlit app but does not contain a `readme` file,
      you must [configure a default Streamlit app](adding-streamlit.md).

## Best practices when publishing a Snowflake Native App

In addition to the requirements for submitting an application package to the Snowflake Marketplace,
Snowflake also recommends the following best practices when publishing a Snowflake Native App:

* Ensure that all required files are uploaded to the named stage for the version of the app you are
  submitting, including:

  + The manifest file.
  + The setup script.
  + The README file.
  + Any external stored procedures or user-defined functions required by the application package.
  + Any Streamlit files required by the application package.
  + Any external source code, including Python, Java, etc.
* Ensure that the version of the app you are developing passes the
  [automated security scan](security-overview.md).
* Test the new version of your application package by creating the application object locally by using the
  [CREATE APPLICATION](../../sql-reference/sql/create-application.md) command.

  + Do not add a new version to your application package or set the DISTRIBUTION property to EXTERNAL
    while you are developing and testing an app. These actions trigger the
    [automated security scan](security-overview.md) which
    delays the development cycle.

    Instead, create the application object using
    [files on a named stage](installing-testing-application.md).
  + If your app includes a Streamlit app, test the application in Snowsight to ensure
    the Streamlit app works as expected.
  + Verify that interactions between the Streamlit app and Snowflake Worksheets are seamless
    and that the consumer does not have to navigate excessively between the two.
* Review all parts of a listing before submitting it for approval.
* Ensure that there are no typos or other textual errors in the listing, `readme` file, and
  Streamlit app.

### Recommendations for trial listings

When an app trial listing expires, Snowflake automatically suspends the app to avoid consumers incurring
extra compute costs to the consumer. Snowflake only suspends the objects owned by the app that are currently
active. Snowflake does not modify the status of objects that are already suspended.

When a trial listing is converted to a full or paid listing, Snowflake attempts to re-enable the
app by resuming tasks, containers, and compute pools. Snowflake only resumes services and compute pools
that have the `auto_resume` property set to false.

### Recommendations for apps with containers

* Compute pools should be set to automatically suspend in combination with Snowpark Container Services
  jobs to avoid idle compute nodes.
* For higher availability during upgrades and to reduce cold start latency, Snowflake recommends that you
  set the `MIN_NODES` parameter greater than 1.
* If connections across different services are required in the same app, use the DNS name of the service
  instead of configuring an external access integration.

### Recommendations for event sharing

* Providers should configure an app to emit log messages and trace events that conform
  to [supported event definitions](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#about-event-sharing)
  to ensure that consumers understand what information is collected.
* Mandatory event definitions should be limited to the log messages and trace events required by the app. Excessive or unnecessary
  mandatory event definitions should be avoided.
* Adding new mandatory event definitions in a version upgrade must require the consumer re-enable event definitions
  for the app.
* Use the Python Permission SDK to allow consumers to share optional events.

---
title: Implementing connector applications using SDK
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/overview.md
section: Native Apps Framework
---

# Implementing connector applications using SDK

This section describes what you need to know to implement your own Native Application using the SDK library.

We recommend going through the
[tutorial](../tutorials/native_sdk_template_connector_tutorial.md) as a first step.

* [Choosing SDK components](choosing_components.md)
* [Responses and error handling](response_and_error_handling.md)
* [Stored procedures and handlers customization](sproc_and_handlers_customization.md)
* [Troubleshooting](troubleshooting.md)
* [Ingestion scheduler](scheduler.md)
* [Task reactor](task_reactor.md)

---
title: Include a trained model in an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/snowflake-ml-na-with-model.md
section: Native Apps Framework
---

# Include a trained model in an app

This topic describes how to include a previously trained mode in a Snowflake Native App

## Workflow - Add a model to an app

The following procedure outlines the typical workflow a provider follows create and add
a Snowflake ML model to an app:

1. The provider develops a Snowflake ML model and logs it in the
   [Snowflake Model Registry](../snowflake-ml/model-registry/overview.md).
2. The provider exports the model artifacts from the Snowflake Model
   Registry and uploads them to
   a stage so that they are accessible to the application package.
3. The provider creates the model in the setup script of the app.
4. The app creates a model from these artifacts in the consumer account during installation
   or after an upgrade. Optionally, the app can grant access on the model to an application role.
5. The consumer uses the machine learning model if the provider configures the app to grant access to it.

> **Note:**
>
> A provider is not required to grant access on the model to the consumer. The model can be created as an
> object that the app uses internally, but is not accessible to the consumer.

## Develop a machine learning model

Providers can develop new machine learning models or include existing models in an app.

* For information about developing models, see [Snowflake ML Model Development](../snowflake-ml/modeling.md).
* For information about managing models in a Snowflake Model Registry, see
  [Snowflake Model Registry](../snowflake-ml/model-registry/overview.md).

## Export the model artifacts and upload to a stage

To include a model in an app, providers must export the model artifacts and upload them to
a stage where they are accessible to the application package.

### Manually export the model artifacts and upload them to a stage

1. Download the model artifacts. See [Snowflake Model Registry](../snowflake-ml/model-registry/overview.md)
2. Use one of the following methods to upload the machine learning artifacts to the stage where your app resources are located:

   * To upload the files using Snowsight, see [Staging files using Snowsight](../../user-guide/data-load-local-file-system-stage-ui.md).
   * To upload the files using the Snowflake CLI, use the `snow app deploy` command. See
     [How to create an application package and an application object together](../snowflake-cli/native-apps/create-package.md).
   * To upload the files using SQL, see [Staging data files from a local file system](../../user-guide/data-load-local-file-system-stage.md).

### Use a stored procedure to export the model artifacts and upload them to a stage

Providers can use the following stored procedure example as a template for automating the process of
downloading the model artifacts and uploading them to a stage:

```sqlexample
CREATE OR REPLACE PROCEDURE copy_model_artifacts_to_stage(src_registry_schema_fqn string, src_model string, src_model_version string, dst string)
  RETURNS STRING
  LANGUAGE python
  runtime_version = 3.11
  handler = 'copy_model_artifacts_to_stage'
  packages = ('snowflake-snowpark-python')
  execute as caller
as
$$

def copy_model_artifacts_to_stage(session, src_registry_schema_fqn, src_model, src_model_version, dst):
  session.use_schema(src_registry_schema_fqn)
  list_files = session.sql(f"list 'snow://model/{src_model}/versions/{src_model_version}/'")
  list_files.collect()
  for row in list_files.toLocalIterator():
     parts = row["name"].rsplit('/', 1)
     directory = parts[0]
     filename = parts[1]
     session.file.get(f"snow://model/{src_model}/{directory}/{filename}", f"/tmp/{directory}")
     session.file.put(f"/tmp/{directory}/{filename}", f"{dst}/{src_model}/{directory}", auto_compress=False, overwrite=True, source_compression="NONE")

  return f"Copied [snow://model/{src_model}/versions/{src_model_version}/*] to [{dst}/{src_model}/{directory}/]"
$$;

CALL copy_model_artifacts_to_stage('my_db.my_model_registry, 'my_model', 'V1', '@my_app_pkg.source_schema.source_stage/models');
```

## Create the model objects in the consumer account

To create the model objects in the consumer account, the provider adds the necessary
SQL commands to the setup script as shown in the following example:

```sqlexample
CREATE APPLICATION ROLE IF NOT EXISTS app_user;

CREATE OR ALTER VERSIONED SCHEMA app_code;
GRANT USAGE ON SCHEMA app_code TO APPLICATION ROLE app_user;

CREATE OR REPLACE MODEL app_code.my_model FROM '/models/my_model/versions/V1;
```

Optionally, providers can grant access on the model to consumers by granting the
USAGE privilege on the model to an application role:

```sqlexample
GRANT USAGE ON MODEL app_code.my_model TO APPLICATION ROLE app_user;
```

## Access the model within the app

To use the model internally as part of the app, providers add a SELECT statement
to the setup script as shown in the following example:

```sqlexample
SELECT app_code.my_model!predict(...);
```

## Use the model as a consumer

If a provider grants privilege on the model to a consumer, the consumer can run the
following command to access the model:

```sqlexample
SELECT app_code.my_model!predict(...);
```

To run this command, consumers must use a role that has one of the following:

* The USAGE privilege granted on the model.
* The OWNERSHIP privilege on the application object.

---
title: Ingestion management
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/ingestion-management/overview.md
section: Native Apps Framework
---

# Ingestion management

After the connector is configured it can start ingesting the data.
However, usually some more information is needed before it can ingest the data from the source system.
Most of the systems persist the data with at least some granularity, be it tables, repositories, files, or reports.
The Snowflake Native SDK for Connectors uses a term `resource` regardless of the name in the original system.
To identify resources and customize settings for their ingestion, `resource_ingestion_definitions` are being used. Additionally,
the actual process of ingestion is organized into `ingestion_processes`, which consist of multiple `ingestion_runs`.
This abstraction makes it easier to track, schedule and differentiate processes.

## Requirements

This section requires at least the following SQL files to be executed during native app installation:

* `ingestion/ingestion_management.sql`
* `ingestion/ingestion_definitions_view.sql`
* `ingestion/ingestion_process.sql`
* `ingestion/ingestion_run.sql`
* `ingestion/resource_ingestion_definition.sql`

## Resource ingestion definition

Resource ingestion definition is a generic entity that contains the definition of the source data in the source system.
To keep it as generic as possible the system specific options are persisted as `variants`
in the underlying `STATE.RESOURCE_INGESTION_DEFINITION` table. However, the Java definition of the repository `ResourceIngestionDefinitionRepository`
is a generic interface to have better control over typing.

Since most of the resource ingestion definition can be customized by during the implementation,
then it is up to the developer to decide how to use the generic fields and then make use of them during ingestion.

The most important customizable properties of the resource ingestion definition are:

* `parent_id`

This optional parameter allows linking resource definitions with each other, for example, to inherit a part of the configuration.

* `resource_id`

This `variant` should allow the identification of a resource in the source system, it should be unique.

* `ingestion_configurations`

This property actual configuration of the ingestion, each definition can have multiple configurations,
for example if for some reason the same resource should be ingested at two different schedules or saved into multiple sink tables.
This property has some required fields inside of it, but still allows some flexibility when it comes to defining custom configuration
and destination of the data.

* `resource_metadata`

This property should contain any additional information that is needed, but does not fit into above mentioned fields.

## Ingestion process

Ingestion process is an entity representing enabled process of ingesting a defined resource. It is created once a resource is added or enabled and should be completed
once it’s deleted or disabled. In a way it is kind of like a background process in the operating system, it can be alive but not necessarily doing any work at the particular moment.
Whenever the ingestion is actually running it can be transitioned to `IN_PROGRESS` state, otherwise it can remain in `SCHEDULED` state.
When dispatching work `scheduler` retrieves all the `SCHEDULED` processes and runs ingestion for them.

The ingestion process can be also used to define different types of ingestion, for example, say that on a daily basis connector loads some data,
but for some reason some old data is corrupted and needs to be reloaded. If that’s the case then a new process `type` can be introduced, for example `RELOAD`.
Then `scheduler` can have custom logic to perform different operations for different types of processes.

## Ingestion run

Ingestion run is another entity to store information about the past and ongoing ingestion. However, this data is more granular than the `ingestion_process` itself.
First of all, ingestion run should be considered as a log data. Secondly, `ingestion_run` is an entry describing just a single invocation during a long running process.
So if a resource is ingested once a day, then every day there should be a new ingestion run entry. All of those entries will be linked with the single process.

## Ingestion management operations

### Creating new resource

Resource creation process is used to define and schedule an ingestion of data from a source system.
It creates a resource ingestion definition record and corresponding ingestion processes if a given resource should be initially enabled.

For more information, see [Create Resource](create_resource.md).

### Viewing resources

Configured resources definitions can be examined in the `PUBLIC.INGESTION_DEFINITIONS` view. However, this view only returns basic information about each resource.
All the custom configurations are not visible to the end user, especially because some of them can be generated internally by the connector’s logic.

### Disabling a resource

The disabling a resource step is used to stop ingesting data for a given resource.
It finishes active ingestion processes and marks a resource ingestion definition as disabled.

For more information, see [Disable Resource](disable_resource.md).

### Enabling a resource

Enabling a resource is used to start ingesting data for a given resource.
It creates new ingestion processes and marks a resource ingestion definition as enabled.

For more information, see [Enable Resource](enable_resource.md).

### Updating a resource

Updating a resource is used to change a configuration of ingestion for a given resource.
It modifies a resource ingestion definition and finishes or creates new ingestion processes.

For more information, see [Update Resource](update_resource.md).

---
title: Ingestion scheduler
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/scheduler.md
section: Native Apps Framework
---

# Ingestion scheduler

Library which provides common elements and features that are used in all Snowflake connectors.

## Requirements

Default implementation of the scheduler requires the following files to be executed during the connector installation:

* `core.sql` (See: [Core SQL reference](../reference/core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](../reference/app_config_reference.md))
* `configuration/connector_configuration.sql` (See: [Connector configuration reference](../reference/connector_configuration_reference.md))
* `scheduler/scheduler.sql` (See: [Ingestion scheduler reference](../reference/scheduler_reference.md))

## Overview

The scheduler task takes care of triggering the ingestion of resources at appropriate times according to their configuration.
This task is not started by the SDK itself and needs to be created and resumed, for example, during finalize configuration step.
There are two ways of achieving this: using the procedure called [PUBLIC.CREATE_SCHEDULER()](../reference/scheduler_reference.md) from SQL
or by calling [SchedulerCreator#createScheduler()](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/scheduler/SchedulerCreator.md) directly from the Java code.

The default implementation will create the scheduler task using the expression provided in `connector_configuration`, under the
`global_schedule` key. When the default scheduler task is executed it searches for all the enabled resource ingestion definitions that
have their [ScheduleType](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/definition/ScheduleType.md) in configuration set to `GLOBAL` and their corresponding ingestion processes.
Each of the processes is then updated to `IN_PROGRESS` status. This status will be updated again to `SCHEDULED` after ingestion iteration is finished.
Then for each of them [OnIngestionScheduledCallback](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/scheduler/OnIngestionScheduledCallback.md) is executed.
This callback can be completely custom and can be implemented using SQL or Java. The default implementation of this callback does nothing,
however the SDK also provides an implementation of this callback using the [Task reactor](task_reactor.md) module. This implementation retrieves
the data about resources from the database and puts a work item containing this data in the Task Reactor queue.

When the work item is finished another callback called [OnIngestionFinishedCallback](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/taskreactor/OnIngestionFinishedCallback.md) is executed.
This callback changes the process state back to `SCHEDULED` once the ingestion is done.

---
title: Ingestion scheduler reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/scheduler_reference.md
section: Native Apps Framework
---

# Ingestion scheduler reference

## Database objects and procedures

The following procedures are created by the file `scheduler/scheduler.sql`.

### PUBLIC.CREATE_SCHEDULER()

This procedure acts as the entry point between SQL and Java. It will create a task running according to the schedule available in `APP_STATE` table.
This task will execute below `PUBLIC.RUN_SCHEDULER_ITERATION()` procedure when executed.

### PUBLIC.RUN_SCHEDULER_ITERATION()

This procedure is an entry point to the Java implementation of the actual scheduling task. It will
be invoked whenever the scheduler task is executed.

It needs `com.snowflake:telemetry` package in order to emit metrics to event table.

### PUBLIC.ON_INGESTION_SCHEDULED (process_id VARCHAR)

This procedure defines the ingestion flow for a single process that was taken by the scheduler for execution. The default implementation does nothing.
We recommend implementing this in Java using the `OnIngestionScheduledCallback` interface.

#### Related features

Other related features:

* `Task Reactor`
* `Ingestion`

### Related Java objects

Java implementations and related classes:

* `CreateSchedulerHandler`
* `RunSchedulerIterationHandler`
* `RunSchedulerIterationHandlerBuilder`
* `OnIngestionScheduledCallback`
* `OnIngestionFinishedCallback`

### Custom handler

Ingestion scheduler feature consists of two different handlers acting as entry point from SQL to Java:

* `CreateSchedulerHandler`
* `RunSchedulerIterationHandler`

We recommend customizing only the latter one.

### Builder approach

`RunSchedulerIterationHandler` can be customized using `RunSchedulerIterationHandlerBuilder`.
This helper objects allows for custom implementations of the underlying interfaces:

* `ConnectorErrorHelper`
* `OnIngestionScheduledCallback`

In case they are not provided the default implementations will be used.

```java
class CustomOnIngestionScheduledCallback implements OnIngestionScheduledCallback {
    @Override
    public void onIngestionScheduled(String processId) {
        // CUSTOM LOGIC
    }
}

class CustomHandler {

    // Path to this method needs to be specified in the PUBLIC.RUN_SCHEDULER_ITERATION procedure using SQL
    public static Variant runIteration(Session session) {
        return RunSchedulerIterationHandler.builder(session)
            .withOnIngestionScheduledCallback(new CustomOnIngestionScheduledCallback())
            .build()
            .runIteration()
            .toVariant();
    }
}
```

---
title: Install an app from a listing
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-installing.md
section: Native Apps Framework
---

# Install an app from a listing

This topic describes how to use [Snowsight](../../user-guide/ui-snowsight-gs.md) to install apps created using the Snowflake Native App Framework.

## Workflow for installing an app from a listing

To find and install a listing for an app:

1. Set up the privileges required to install a listing.
2. Install the app from the listing.

   * If you are installing a privately shared listing, see Install an app from a privately shared listing
   * If you are installing a listing shared on the Snowflake Marketplace, see
     [Working with Snowflake Marketplace listings for an app](ui-consumer-installing-container.md).
   * If a provider has published multiple version of an app, see Install an app using release channels.
3. [View the installed listing](ui-consumer-managing-applications.md).

   See [Allow access to a consumer account](ui-consumer-granting-privs.md) for information on tasks related to managing an app.
   See [Set up event tracing for an app](ui-consumer-enable-logging.md) for information on setting up event sharing.

## Set up required privileges

To access a listing, you must use the ACCOUNTADMIN role or another role with the IMPORT SHARE and
CREATE DATABASE privileges.

After an app is installed, the app owner can grant access to the app
using application roles. See [Grant application roles to account roles](ui-consumer-managing-applications.md) for details.

> **Note:**
>
> To pay for an app, your role must also have the PURCHASE DATA EXCHANGE LISTING privilege and you must meet additional
> criteria. Refer to [Pay for listings](../../collaboration/consumer-listings-paying.md).

## Install an app from a privately shared listing

> **Note:**
>
> As a provider, you can test your app by creating a private listing, sharing it with another account in your organization
> that you have access to, signing in to that account, and following these steps to install the app.

To install an app from a private listing:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. In Recently shared with you, select the tile for the listing.
4. Select Security to view the privileges and logging requests for the app, including:

   * [Account level privileges](ui-consumer-granting-privs.md)
   * [Privileges on objects](ui-consumer-granting-privs.md)
   * Connections
   * [App events](ui-consumer-enable-logging.md)
5. Select Get, or for a monetized app, select Buy.

   > **Note:**
   >
   > If the provider includes required
   > [event definitions](ui-consumer-enable-logging.md)
   > in the app, the consumer must set up an event table before installing the app. Even sharing
   > and the required event definitions are enabled during installation and cannot be disabled later.
6. Enter a name for the app.
7. Select the warehouse that you want to use to install the app.
8. Select Get.
9. Select Open to view the app or Done to finish.

## Install an app from a Snowflake Marketplace listing

To install an app from a Snowflake Marketplace listing:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search or browse to the listing you want to access.
4. Select the listing, then view the privileges and logging requests for the app,
   including:

   * [Account level privileges](ui-consumer-granting-privs.md)
   * [Privileges on objects](ui-consumer-granting-privs.md)
   * [Events and logs](ui-consumer-enable-logging.md)
5. Select Get to access the listing.

   > **Note:**
   >
   > If the provider includes required
   > [event definitions](ui-consumer-enable-logging.md)
   > in the app, the consumer must set up an event table before installing the app. Even sharing
   > and the required event definitions are enabled during installation and cannot be disabled later.
6. Select the warehouse that you want to use to install the app.
7. (Optional) Enter a name for Application name.
8. Select Get.
9. Select Open to view the app, or select Done to finish.

## Install an app using release channels

Release channels allow providers to publish multiple versions of an app. Possible
versions are:

QA:
:   Allows providers to publish a test version of an app. Apps installed from the QA release channel
    have not been reviewed or tested.

Alpha:
:   Allows providers to share apps with consumers to obtain feedback. Apps installed from the Alpha
    release channel may contain versions that have not passed the security review.

Default:
:   This is the production version of an app. Default versions have passed the Snowflake and functional review.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. In Recently shared with you, select the tile for the listing.
4. Select Security to view the privileges and logging requests for the app, including:

   * [Account level privileges](ui-consumer-granting-privs.md)
   * [Privileges on objects](ui-consumer-granting-privs.md)
   * Connections
   * [App events](ui-consumer-enable-logging.md)
5. Select Get to access the listing.
6. Select the version of the app you want to install.

   Installing different versions of the app allows you to test each version independently.
7. Select the warehouse that you want to use to install the app.
8. Optional: For Application name, enter a name.
9. Select Get.
10. Select Open to view the app or Done to finish.

## Install multiple instances of an app

Providers can configure an app so that multiple instances of an app can be installed at the same time.

> **Note:**
>
> Apps installed from a trial listing or a monetized listings cannot have multiple instances.

If an app is configured to allow multiple installs, consumers can install additional instances after
installing the app from a private listing or from
the Snowflake Marketplace.

If multiple instances are enabled for an app, you can install a maximum of 10 instances in your account.

To install a new instance of an app, perform the following tasks:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app of which you want to install another instance.

   If multiple instances of the app are already installed, Snowsight displays
   a panel showing all of the instances of the app.
4. Select Add instance

   > **Caution:**
   >
   > Add instance only appears if the provider has configured the app to allow multiple instances.
5. Enter a name for the instance, then select the warehouse to use for this instance.
6. Select Get.

   > The app installs and Snowflake sends a notification email to the app admin.
7. Select Done to complete the installation.

After installing the app instance, you can
[set up event tracing for an app](ui-consumer-enable-logging.md),
[configure privileges](ui-consumer-granting-privs.md) for the app, and perform other
[management tasks](ui-consumer-managing-applications.md).

---
title: Install and manage an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-installing-container.md
section: Native Apps Framework
---

# Install and manage an app with containers

This topic describes how to use [Snowsight](../../user-guide/ui-snowsight-gs.md) to install a Snowflake Native App with Snowpark Container Services.

## Workflow for installing an app with containers from a listing

To find and install a listing for a Snowflake Native App with Snowpark Container Services:

1. Set up the privileges required to install a listing.
2. Install the listing.

   * If you are installing a privately shared listing, refer to Install an app with containers from a privately shared listing
   * If you are installing a listing shared on the Snowflake Marketplace, refer to
     Working with Snowflake Marketplace Listings for an app.
3. [View the installed listing](ui-consumer-managing-applications.md).
4. Refer to [Allow access to a consumer account](ui-consumer-granting-privs.md) for information on tasks related to managing an app.

## Set up required privileges

To access a listing, you must use the ACCOUNTADMIN role or another role with the IMPORT SHARE and
CREATE privileges on the app.

After an app is installed, the app owner can grant access to the app
using application roles. Refer to [Grant application roles to account roles](ui-consumer-managing-applications.md) for details.

> **Note:**
>
> To pay for an app, your role must also have the PURCHASE DATA EXCHANGE LISTING privilege and you must meet additional
> criteria. For more information, see
> [Pay for listings](../../collaboration/consumer-listings-paying.md).

## Install an app with containers from a privately shared listing

> **Note:**
>
> As a provider, you can test your app by creating a private listing, sharing it with another account in your organization
> that you have access to, signing in to that account, and following these steps to install the app.

To install an app with containers from a private listing:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. In Recently shared with you, select the tile for the listing.
4. Select Get, or for a monetized app, select Buy.
5. Select Options, then enter a name for the app.
6. Select the warehouse where you want to install the app.
7. Select Get.

   The Installing app dialog displays. It may take some time to install the app.
   After the app is installed, the dialog displays Successfully Installed.
8. Select Configure.

   This displays a list of the privileges and references to objects the app requires.
9. Click Grant to grant the privileges required by the app.

   Apps with containers frequently require the following privileges:

   * CREATE COMPUTE POOL allows the app to create a compute pool in your account.
   * BIND SERVICE ENDPOINT allows services in the app to connect to each other.
10. Click Activate.

    The app begins activation. Depending on the complexity of the app, this may take some time.

    After activation, the Settings page is displayed.
11. After the activation is complete, select Launch App.

## Install an app from a Snowflake Marketplace listing

To install an app from a Snowflake Marketplace listing, perform the following steps:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search or browse to the listing you want to access.
4. Select the tile for the listing.
5. Select Get, or for a monetized app, select Buy.
6. Select Options, then enter a name for the app.
7. Select the warehouse where you want to install the app.
8. Select Get.

   The Installing app dialog displays. It may take some time to install the app.
   After the app is installed, the dialog displays Successfully Installed.
9. Select Configure.

   This displays a list of the privileges and references to objects the app requires.
10. Click Grant to grant the privileges required by the app.

    Apps with containers frequently require the following privileges:

    * CREATE COMPUTE POOL allows the app to create a compute pool in your account.
    * BIND SERVICE ENDPOINT allows services in the app to connect to each other.
11. Click Activate.

    The app begins activation. Depending on the complexity of the app, this may take some time.

    After activation, the Settings page is displayed.
12. After the activation is complete, select Launch App.

## View the compute pools used by an app with containers

An app with containers provides a Compute tab that allows you to view information about
the compute pools used by an app. For information about managing other components of an app,
see [Manage apps](ui-consumer-managing-applications.md).

To view the compute pools used by an app:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app whose compute pools you want to view.
4. Select the Compute tab.

This tab displays the following information for each compute pool:

* The name of the compute pool and its status.
* The number of jobs running in the compute pool.
* The number of services running in the compute pool.
* The number of nodes currently assigned to the compute pool.
* The minimum number of nodes the compute pool can contain.
* The maximum number of nodes the compute pool can contain.
* The instance family of the compute pool.

For more information on these properties, see
[CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md)

---
title: Install and test an app locally
source: https://docs.snowflake.com/en/developer-guide/native-apps/installing-testing-application.md
section: Native Apps Framework
---

# Install and test an app locally

This topic describes how providers can create and test a Snowflake Native App locally.

## About creating and testing apps

With the Snowflake Native App Framework, providers can create an app within the same account as the
application package, so they can test the app before publishing it to consumers.

Providers can also test the app in a single account without having to
alternate between provider and consumer accounts.

## Privileges required to create and test an app

To create an app locally from an application package, you must have the following privileges
granted to your role:

* The CREATE APPLICATION account-level privilege granted to your role.
* The INSTALL object-level privilege granted on the application package.

The following examples show how to use the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command
to grant these privileges to an account:

```sqlexample
GRANT CREATE APPLICATION ON ACCOUNT TO ROLE provider_role;
GRANT INSTALL ON APPLICATION PACKAGE hello_snowflake_package
  TO ROLE provider_role;
```

### Use the DEVELOP privilege

By default, the role used to create an application package has permissions to use the
[CREATE APPLICATION](../../sql-reference/sql/create-application.md) command to create an app based on the
application package.

However, in some development environments you may need to allow users with other roles to
create and test an application package. To do this, grant the DEVELOP object-level privilege
on the application package to a role.

The DEVELOP privilege grants the privileges required to create and test an
app based on an application package. This privilege allows a user to perform
the following tasks using the application package on which they have been granted access:

* Create an app based on a version or patch specified in the application package.
* Upgrade to a different version of an app using the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command.
* Create or upgrade an app using files on a named stage.
* Enable debug mode on an app created in development mode.

To grant the DEVELOP privilege to a role, use the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command as
shown in the following example:

```sqlexample
GRANT DEVELOP ON APPLICATION PACKAGE hello_snowflake_package TO ROLE other_dev_role;
```

> **Note:**
>
> The DEVELOP object-level privilege is specific to a single application package. You must run
> [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) for each application package you want
> to assign the DEVELOP privilege for.

## Workflow for creating and testing an app

The Snowflake Native App Framework provides different ways of creating an app from an application
package. This allows you to test a Snowflake Native App before publishing it to consumers. The method you
use depends on what parts of the app you want to test.

The following steps outline a typical workflow for testing an app:

1. Create the app.

   You can create an app locally based on the following:

   * Files on a stage

     This allows you to quickly test a new version of a setup script or application code files.
     See Create an app using staged files for more information.
   * A version or patch defined in the application package

     After defining a version or patch for an application package, you can test this version by
     creating an app based on it. For
     more information, see Create an app from a version or patch.
2. Upgrade an app.

   After verifying that an app is working correctly, you can upgrade it to a new version in one of two ways:

   * From a file on a stage
   * From a version or patch defined in the application package
3. Create an app based on a release directive.

   After testing an app using specific files or a version or patch, you can create an app
   based on the release directive defined for the application package. Using the release directive,
   you do not need to specify a stage or version of the app.

   For more information, see Create an app using staged files.
4. Install an app from a listing.

   After testing that the application package and app are working correctly in your local account, you
   can add the application package to a listing and test the installation using Snowsight.

   For more information, see Create an app using staged files.

## Create an app

You can install an app directly in your account to test its functionality and privileges before
sharing it with customers. The [CREATE APPLICATION](../../sql-reference/sql/create-application.md) command supports different
syntaxes for creating an app.

> **Note:**
>
> The following sections assume that you have created an application package,
> the required manifest file, and a setup script.

### Create an app using staged files

You can create an app using a manifest file and setup script uploaded to a named
stage. This allows you to test changes to these files without having to
add a new version to an application package.

Use the [CREATE APPLICATION](../../sql-reference/sql/create-application.md) command to create an app
using staged files as shown in the following example:

```sqlexample
CREATE APPLICATION hello_snowflake_app FROM APPLICATION PACKAGE hello_snowflake_package
  USING '@hello_snowflake_code.core.hello_snowflake_stage';
```

### Create an app from a version or patch

After defining a version or patch in an application package, you can create an app
based on that version or patch.

To create a an app from a specific version, use the [CREATE APPLICATION](../../sql-reference/sql/create-application.md)
command as shown in the following example:

```sqlexample
CREATE APPLICATION hello_snowflake_app
  FROM APPLICATION PACKAGE hello_snowflake_package
  USING VERSION v1_0;
```

To create an app from a specific patch, use the
[CREATE APPLICATION](../../sql-reference/sql/create-application.md) command as shown in the following example:

```sqlexample
CREATE APPLICATION hello_snowflake_app
  FROM APPLICATION PACKAGE hello_snowflake_package
  USING VERSION v1_0 PATCH 2;
```

### Create an app based on a release directive

After specifying a release directive — either custom or default — in an application package, you can create an app based on that release directive.

To create an app based on a release directive, use the
[CREATE APPLICATION](../../sql-reference/sql/create-application.md) command as shown in the following example:

```sqlexample
CREATE APPLICATION hello_snowflake_app FROM APPLICATION PACKAGE hello_snowflake_package;
```

### Upgrade an app using a stage

To upgrade an app using files on a named stage, use the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md)
command, as shown in the following example:

```sqlexample
ALTER APPLICATION HelloSnowflake
  UPGRADE USING @CODEDATABASE.CODESCHEMA.AppCodeStage;
```

### Upgrade an app from a version or patch

To upgrade an app that was created using a specific a version or patch, use the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command as shown in the following example:

```sqlexample
ALTER APPLICATION HelloSnowflake
 UPGRADE USING VERSION "v1_1";
```

## Set an app as the active context

To set an app as the active context for a session, run the USE APPLICATION command, as shown in the following example:

```sqlexample
USE APPlICATION hello_snowflake_app;
```

> **Note:**
>
> To run this command, you must have the USAGE privilege granted on the app to your role.

## View the app in your account

To see a list of apps available to your account, use the [SHOW APPLICATIONS](../../sql-reference/sql/show-applications.md) command, as shown in
the following example:

```sqlexample
SHOW APPLICATIONS;
```

## View information about an app

To view details of an app, run the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command, as shown in
the following example:

```sqlexample
DESC APPLICATION hello_snowflake_app;
```

In development mode, this command displays the schemas allowed
by the consumer’s application roles.

In debug mode, this command displays all schemas in
an application package.

## Use development, debug, and session debug modes to test an app

With the Snowflake Native App Framework, providers can use the following modes to create an app and test its functionality:

Development mode
:   The provider can test the app from the consumer perspective. This means that
    the provider can only access objects to which the consumer has been granted privileges.

Debug mode
:   The provider can access all the objects within the app. In debug mode, the session’s primary
    role is used when modifying the state in the app.

Session debug mode
:   The provider can access objects within the app using either the privileges granted to the app
    or the setup script.

### About development mode

When you create an app locally from an application package by
specifying a version or
application files on a named stage,
the app is considered to be in development mode.

Use development mode to test and troubleshoot an app within a single account.
In development mode you can create and test an app based on a specific version of an
application package. You can also create and test an app using application files on a stage.
This enables you to quickly test changes to the setup script or application logic.

Development mode provides an additional debug mode that
you can use to view and test all of the objects within an app that a consumer would not be able to view.

In development mode, for example, running the SHOW or DESC commands on objects within the app will only
display those objects that the consumer has been granted permissions to view. However in DEBUG mode, you
can see all objects within the app.

### About debug mode

In debug mode, you can view and modify all of the objects within an app. Objects that are not visible to a
consumer, for example, objects not granted to a database role or shared content objects, are visible while in
this mode.

> **Note:**
>
> When you create objects, such as a table, in debug mode, the object will not have the same ownership
> as the app. If you need to create new objects while testing an app, use
> session debug mode.

Testing an app in debug mode requires the following:

* The app must be created in development mode, meaning it must be based on a specific version or
  files on a stage.
* You must explicitly enable debug mode on the app.

> **Note:**
>
> Debug mode can only be toggled on and off for an app created in development mode within
> the same account containing the application package.

To enable debug mode on an app, use the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md)
command as shown in the following example:

```sqlexample
ALTER APPLICATION hello_snowflake_app SET DEBUG_MODE = TRUE;
```

This command turns on debug mode for an app named `hello_snowflake_app`.
Similarly, to turn off debug mode, use the same command, as shown in the following example:

```sqlexample
ALTER APPLICATION hello_snowflake_app SET DEBUG_MODE = FALSE;
```

This command turns off debug mode for the app named `hello_snowflake_app`.

> **Note:**
>
> To run this command, you must have the OWNERSHIP privilege on the app.
> You must also have the DEVELOP privilege on the application package.
>
> Additionally, the app must be created in development mode and in the same account
> as the application package.

## Session debug mode

Session debug mode allows providers to view and modify all of the objects within the app and
execute statements using the same privileges that the app has when installed in the consumer account.
Objects that are not visible to a consumer, for example, objects that are not granted to an application
role, are also visible in session debug mode.

Unlike debug mode, session debug mode only applies to the current session to reduce security risks. You
must enable session debug mode for an app each time you
start a new session. Session debug mode also differs from debug mode in that it allows you to test an app
using the same privileges as the app or the setup script. To use these privileges, you can specify one of the
following when enabling session debug mode. For more information, see Enable session debug mode for an app.

* `AS_APPLICATION`: all statements are executed using the same privileges as the app has when it is created
  in the consumer account.
* `AS_SETUP_SCRIPT`: all statements are executed using the same privileges as the setup script has when it is
  run in the consumer account when an app is created or upgraded.

When a provider creates objects, such as a table, using session debug mode, the object is created with
the same privileges as the app.

### Privileges required to use session debug mode

Using session debug mode to view objects in an app has the following requirements:

* The app must be created in development mode, which requires the
  app to be created based on a specific version or based on files located on a stage.
* The app must be in the same account as the application package on which the app is based.
* You must have the OWNERSHIP privilege on the app.
* You must have the DEVELOP privilege on the application package.

> **Note:**
>
> Session debug mode can only be used in the session in which debug mode is set. For example if you enter
> debug mode in a worksheet, then open the app in a second worksheet, the app in the second worksheet
> is not in session debug mode.

### Enable session debug mode for an app

To enable session debug mode on an app in the current session, use the
[SYSTEM$BEGIN_DEBUG_APPLICATION](../../sql-reference/functions/system_begin_debug_application.md)
system function as shown in the following example:

```sqlexample
SELECT SYSTEM$BEGIN_DEBUG_APPLICATION('hello_snowflake_app');
```

This function enables session debug mode for the app named `hello_snowflake_app`.

You can also enable session debugging by specifying the execution mode for the app
as shown in the following example:

```sqlexample
SYSTEM$BEGIN_DEBUG_APPLICATION( 'hello_snowflake_app', execution_mode ='AS_APPLICATION')
```

This function sets the execution mode of the `hello_snowflake_app` app to `AS_APPLICATION`.
This mode executes all statements using the same privileges as the app has when created in the
consumer account.

### View the session debug status for an app in the current session

To view the session debug status in the current session, use the
[SYSTEM$GET_DEBUG_STATUS](../../sql-reference/functions/system_get_debug_status.md) system function, as
shown in the following example:

```sqlexample
SELECT SYSTEM$GET_DEBUG_STATUS();
```

### Disable session debug mode for an app

To disable session debug mode for an app in the current session, use the
[SYSTEM$END_DEBUG_APPLICATION](../../sql-reference/functions/system_end_debug_application.md) system
function, as shown in the following example:

```sqlexample
SELECT SYSTEM$END_DEBUG_APPLICATION();
```

## Disable redaction of provider data when testing an app

Within an app, information is redacted from the query profile and query history to hide implementation details
about the app from the consumer. See [Protect provider intellectual property](redacted-content.md).

When testing an app locally, you can disable redaction of provider data from the query
profile and query history.

> **Note:**
>
> When session debug mode is used, all objects and data within an app are visible to the provider, even if the information is
> redacted for the consumer. For example, information returned by the [SHOW APPLICATIONS](../../sql-reference/sql/show-applications.md) and
> [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) commands is not redacted when session debug mode is used.

### Privileges required to disable redaction of provider data when testing an app

Disabling redaction of provider data for an app requires the following privileges:

* The app must be created in development mode, meaning it must be based on a specific version or files on a stage.
* The app must be created within the same account containing the application package.
* You must have the OWNERSHIP privilege on the app.
* You must have the DEVELOP privilege on the application package.

### Disable information redaction of provider data

To disable information for an app, use the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command as shown in the following example:

```sqlexample
ALTER APPLICATION hello_snowflake_app SET DISABLE_APPLICATION_REDACTION = TRUE;
```

This command disables redaction of provider data for an app named `hello_snowflake_app`.

To enable redaction of provider data, use the same command as shown in the following example:

```sqlexample
ALTER APPLICATION hello_snowflake_app SET DISABLE_APPLICATION_REDACTION = FALSE;
```

## Test event sharing in development mode

Providers use development mode to install and test an app that uses
[logging and event tracing](event-about.md).
Providers can set up an event table locally in their development account,
install the app in development mode, and view the events and logs that the app emits and those
that are shared back with the provider.

> **Note:**
>
> To test event sharing in development mode, the app must
> [define event definitions](event-definition.md) in
> the manifest file.

### Differences in development mode

In development mode, apps are created based on one of the following:

* Files uploaded to a stage.
* Versions or patches defined in the application package.

When testing event sharing locally in development mode, there are differences in
behavior from apps created from a listing.

* The MANAGE EVENT SHARING global privilege is not required to enable event sharing.
* Shared events are collected in local event tables. In the local event table, providers can
  see two entries for one event:

  + The event that the app emits on the consumer side when the app is installed.
  + The event that is shared with the provider.

### Test event sharing in development mode

1. Configure the app to [use logging and event tracing](event-about.md).
2. [Set up an event table](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#label-nativeapps-consumer-logging-setting-up)
   in the local development account.
3. Create the app locally by running one of the following commands:

   ```sqlexample
   CREATE APPLICATION hello_snowflake_app
     FROM APPLICATION PACKAGE hello_snowflake_package
     USING @path_to_staged_files
     AUTHORIZE_TELEMETRY_EVENT_SHARING = TRUE;

   CREATE APPLICATION hello_snowflake_app
     FROM APPLICATION PACKAGE hello_snowflake_package
     USING VERSION v1_0
     PATCH 0
     AUTHORIZE_TELEMETRY_EVENT_SHARING = TRUE;
   ```
4. [View the log messages and trace events in the event table](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#view-the-log-messages-and-trace-events-in-the-event-table).

---
title: Inter-app Communication
source: https://docs.snowflake.com/en/developer-guide/native-apps/inter-app-communication.md
section: Native Apps Framework
---

# Inter-app Communication

This topic describes how one Snowflake Native App can communicate with another Snowflake Native App
using inter-app communication (IAC).

## Inter-app Communication: Overview

Inter-app communication (IAC) allows a Snowflake Native App to provide additional
functionality to other Snowflake Native Apps in the same consumer account by providing access to functions and procedures
that other apps can call.

For example, a Snowflake Native App that resolves customer IDs can help other Snowflake Native Apps
enhance customer data by joining data sets from different vendors.

IAC provides the infrastructure for two or more independent apps to communicate
with each other while respecting their needs for management and security.
App developers enable IAC for their app by doing the following:

* Creating interfaces
* Using app roles to control access to the interfaces.
* Choosing synchronous or asynchronous interaction. Synchronous interaction uses
  stored procedures or functions that other apps can call directly,
  while asynchronous interaction provides access to request results that are stored in tables or views,
  which other apps can poll to check for results.

## Terminology

IAC uses the following terms:

Client
:   The app that initiates the connection request and calls the server app’s functions and procedures.

Server
:   The app that provides access to its functions and procedures using app roles.

Consumer
:   The user who installs the client and server apps.

Application configuration
:   A SQL object that the client app uses to request the name of the server app. IAC uses an application configuration of type `APPLICATION_NAME` to store the server app name.

Application specification
:   A SQL object that the client app creates to request a connection to the server app. IAC uses an application specification of type `CONNECTION`.
    For information about app specifications, see [Overview of app specifications](requesting-app-specs.md).

## Workflow for Inter-app communication

Establishing and using a connection involves a handshake process between the client app and the server app.

1. **Obtain app role names from the server app provider**: The client app provider coordinates with the server app provider outside of Snowflake to determine which server app roles to request in the connection specification.
2. Identify the target app: The client app creates a configuration definition object to request the name of the server app. The consumer detects incoming requests, and provides the server app name to the client app through the configuration object.
3. Request and approve a connection: The client app creates an application specification to request a connection to the server app, and the consumer approves the connection request.
4. Communicate with the Server App: The client app calls the server app’s procedures or functions.

### Identify the target app

Before a client app can communicate with a server app, it must first identify the exact name of the app. Because the consumer can choose a custom name for an app during installation, the client app must first identify the exact name of the server app.

The client app’s setup script creates a `CONFIGURATION DEFINITION` object to request this information.

The following example shows how the client app’s setup script creates a `CONFIGURATION DEFINITION` object to request the name of the server app:

```sqlexample
ALTER APPLICATION
  SET CONFIGURATION DEFINITION my_server_app_name_configuration
    TYPE = APPLICATION_NAME
    LABEL = 'Server App'
    DESCRIPTION = 'Request for an app that will provide access to server procedures and functions. The server app version must be greater than or equal to 3.2.'
    APPLICATION_ROLES = (my_server_app_role);
```

The following example shows how the consumer checks for incoming configuration definition requests:

```sqlexample
SHOW CONFIGURATIONS IN APPLICATION my_server_app_name;
```

This command returns results similar to the following:

```output
name                             | created_on              | updated_on              | type               | ...
my_server_app_name_configuration | 2026-02-09 10:00:00.000 | 2026-02-09 10:00:00.000 | APPLICATION_NAME   | ...
```

The consumer then uses the following command to provide the server app name:

```sqlexample
ALTER APPLICATION my_client_app_name
  SET CONFIGURATION my_server_app_name_configuration
  VALUE = MY_SERVER_APP_NAME;
```

### Request and approve a connection

Once the client app has the name of the server app, it creates an `APPLICATION SPECIFICATION` to request a connection to the server app. Note that the application role names are obtained through offline communication outside of snowflake.

The following example shows how to create an `APPLICATION SPECIFICATION` for a connection to the server app named `my_server_app_name`:

```sqlexample
ALTER APPLICATION SET SPECIFICATION my_server_app_name_connection_specification
  TYPE = CONNECTION
  LABEL = 'Server App'
  DESCRIPTION = 'Request for an app that will provide access to server procedures and functions. The server app version must be greater than or equal to 3.2.'
  SERVER_APPLICATION = MY_SERVER_APP_NAME -- server name obtained from Step 1
  SERVER_APPLICATION_ROLES = (my_server_app_role);
```

By creating the application specification, the client app is requesting to be granted the server app roles specified in the app specification.

> **Note:**
>
> The values given for `LABEL` and `DESCRIPTION` in the app specification must match the values given for `LABEL` and `DESCRIPTION` in the `CONFIGURATION DEFINITION` object created in Step 1. If the values do not match, the connection won’t display properly in Snowsight.

To create an efficient connection workflow, we recommend that the client app create the application specification in the [before_configuration_change](callbacks.md) synchronous callback. This callback is run when the `ALTER APPLICATION SET CONFIGURATION VALUE` command is run. For information about callbacks, see Callbacks. For an example setup script that creates the application specification in the [before_configuration_change](callbacks.md) synchronous callback, see Examples.

Once the client app has created the app specification, the consumer can review and approve or refuse the connection request.

#### Approving the connection request using SQL

The following example shows how the consumer approves the connection request using SQL:

```sqlexample
ALTER APPLICATION my_server_app_name
  APPROVE SPECIFICATION my_server_app_name_connection_specification
  SEQUENCE_NUMBER = 1;
```

#### Approving the connection request using Snowsight

To view and approve connection requests in Snowsight, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. Select the app. A section titled Application connections appears under Configurations. Each pending connection shows the name or label for the connection, a brief description of the connection, and a Review button.
3. Click the Review button. The details of the connection request appear.
4. Select the target app from Select from your apps.
5. Click Next. The following information appears:

   * A diagram showing that the client app will connect to the server app, and what roles the apps will use.
   * The details of the connection.
   * A subset of the server permissions that will be granted to the client app. For information about security considerations for IAC, see Security considerations.
   * An Approve Connection toggle switch. The switch is set to On.
6. To approve the connection, leave the toggle switch set to On, and click Save. The updated connection list appears showing the status of the connection.
7. To refuse the connection, switch the toggle switch to Off.
8. To exit the review page without approving or refusing the connection, click the Cancel button.

#### Post-approval

When the consumer approves the connection request, the Snowflake Native App Framework grants the requested server app roles to the client app. The approval
also grants USAGE on the client app to the server app. This allows the server app to be aware of what client apps are connected to it.

When the consumer approves the connection request, the following callbacks are triggered in the client and server apps, respectively:

* [after_server_connection_change](callbacks.md) is triggered in the client app
* [after_client_connection_change](callbacks.md) is triggered in the server app

These callbacks allow the server and client apps to perform additional actions when the connection is established.

For more information about approving application specifications, see the following topics:

* [ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION](../../sql-reference/sql/alter-application-sequence-number.md)
* [Approve app specifications](ui-consumer-app-spec.md)

### Communicate with the server app

Once the connection is established and the client app is granted the requested server app roles, the client app can communicate with the server app.

> **Note:**
>
> Before calling server app methods, the client app should retrieve the server app’s name at runtime from the approved application specification, to ensure it uses the correct name in case the server app is renamed. The following example shows how to retrieve the server app’s name at runtime:

```sqlexample
SHOW APPROVED SPECIFICATIONS ->>
  SELECT PARSE_JSON("definition"):"SERVER_APPLICATION"::STRING
  FROM $1
  WHERE "name" = 'MY_SERVER_APP_NAME_CONNECTION_SPECIFICATION';
```

The client app can communicate with the server app synchronously or asynchronously.

* Synchronous communication involves invoking the server app’s procedures or functions directly.
* Asynchronous communication involves using a queue stored in a data object, such as a table. For
  example, the server app can provide a procedure to insert records into a table as requests, which the server app then processes periodically. The client app can then use a different server-provided procedure to check the table for results.

The following example of a synchronous operation shows a client app calling a server app’s procedure using Python:

```python
session.call("server_app_name.customer_schema.get_customer_data", customer_id);
```

The following example of an asynchronous operation shows a client app calling a server app’s procedure using Python. The client app calls the server app’s procedure, which creates a request in a table, which the server app then processes. The client app can poll the table to check for updated records for results.

```python
session.call("server_app_name.customer_schema.request_customer_data_async", customer_id);
```

The client app can then poll the table to check for updated records for results:

```python
session.call("server_app_name.customer_schema.check_customer_data_requests_async", customer_id);
```

## Managing connections

To view existing connections in Snowsight, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app. All app connections are shown in a section titled Configurations. Below that section, there is a sub-section titled Application connections.
4. To modify a connection, click the pencil icon for the connection. You can change the following:

   * Which app is connected to the app
   * The approval status of the connection
5. To view the connected app, click the View app button.
6. To change security settings for the connection, click the gear icon.

## Security considerations

When approving a specification request, consumers should be aware that granting server app access to a client app can elevate the privileges the client app has. For example, if a server app has external access, the client app might gain indirect access to the Internet or other external resources through the server app. If the
server app is a client app of another server app, the client app may be able to access the resources of the other
server app through the first server app.

Consumers should inspect the capabilities and privileges of the server app before approving a connection.
Use an admin role (for example, `ACCOUNTADMIN`) to inspect the capabilities of the server.
Inspecting the server with a lower privileged role won’t reveal all of the server’s capabilities and privileges.
Consumers should note that the server app code is not visible to the consumer, and the server app permissions and capabilities can be changed after the consumer approves the connection.

Example SQL commands to inspect the capabilities and privileges of the server app include, but are not limited to the following:

* [SHOW GRANTS TO APPLICATION](../../sql-reference/sql/show-grants.md): This command lists what grants on the client app have
  been granted to the server app.
* [SHOW PRIVILEGES IN APPLICATION](../../sql-reference/sql/show-privileges.md): This command lists what
  potential account-level privileges could be granted to the client app.
* [SHOW REFERENCES IN APPLICATION](../../sql-reference/sql/show-references.md): This command lists
  references that the client app could potentially use without using grants.
* [SHOW SPECIFICATIONS IN APPLICATION](../../sql-reference/sql/show-specifications.md): This command lists the application
  specifications that the consumer has approved, including external access integrations (EAIs),
  security integrations, shares, listings, and connections.

## SQL Reference

The following SQL commands are used to manage inter-app communication.

* [ALTER APPLICATION SET SPECIFICATION](../../sql-reference/sql/alter-application-set-app-spec.md): Creates an app specification that the server app uses to grant access to its functions and procedures to the client app.
* [ALTER APPLICATION DROP SPECIFICATION](../../sql-reference/sql/alter-application-drop-app-spec.md): Deletes an app specification.
* [ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION](../../sql-reference/sql/alter-application-sequence-number.md): Approves or refuses an app specification request.
* [SHOW SPECIFICATIONS](../../sql-reference/sql/show-specifications.md): Lists all of the application specifications in an app.
* [DESCRIBE SPECIFICATION](../../sql-reference/sql/desc-specification.md): Describes the app specifications for an app.
* [ALTER APPLICATION SET CONFIGURATION DEFINITION](../../sql-reference/sql/alter-application-set-configuration-definition.md): Creates or updates an application configuration (a key-value pair) that requests the name of another application from the consumer.
* [ALTER APPLICATION DROP CONFIGURATION DEFINITION](../../sql-reference/sql/alter-application-drop-configuration-definition.md): Deletes an application configuration.
* [ALTER APPLICATION SET CONFIGURATION VALUE](../../sql-reference/sql/alter-application-set-configuration-value.md): Sets a value in an application configuration.
* [ALTER APPLICATION UNSET CONFIGURATION](../../sql-reference/sql/alter-application-unset-configuration.md): Unsets the value to the specified application configuration.
* [SHOW CONFIGURATIONS](../../sql-reference/sql/show-configurations.md): Lists all of the application configurations in an app.
* [DESCRIBE CONFIGURATION](../../sql-reference/sql/desc-configuration.md): Describes the details of an application configuration.
* [IS_CONFIGURATION_SET (SYS_CONTEXT function)](../../sql-reference/functions/is_configuration_set.md): Returns whether or not the configuration has a value set.
* [GET_CONFIGURATION_VALUE (SYS_CONTEXT function)](../../sql-reference/functions/get_configuration_value.md): Returns the current value of the configuration.
* [SHOW GRANTS TO APPLICATION](../../sql-reference/sql/show-grants.md): Lists all the privileges and database/application roles granted to the specified app.
* [SHOW GRANTS TO APPLICATION ROLE](../../sql-reference/sql/show-grants.md): Lists all the permissions that the application role has.
* [SHOW GRANTS OF APPLICATION ROLE](../../sql-reference/sql/show-grants.md): Lists all the roles and applications to whom the specified application role is granted.

## Callbacks

The Snowflake Native App Framework provides lifecycle callbacks to help manage the inter-app communication
workflow. These callbacks let an app react to changes in configurations, connections,
and specifications. To use callbacks, register them in the `lifecycle_callbacks`
section of the app’s manifest file.

For general information about callbacks, see [Callbacks](callbacks.md).

### Configuration callbacks

These callbacks are triggered when a configuration value is set or unset. A common use
case is to use the [before_configuration_change](callbacks.md)
callback to automatically create a connection specification when the consumer provides
the server app name.

[validate_configuration_change](callbacks.md)
:   A synchronous callback called as part of the `ALTER APPLICATION SET CONFIGURATION VALUE`
    command. Lets the app perform custom validation on the provided value. If the callback
    returns an error, the command fails and the new value is not set.

[before_configuration_change](callbacks.md)
:   A synchronous callback called as part of the `ALTER APPLICATION SET CONFIGURATION VALUE`
    and `ALTER APPLICATION UNSET CONFIGURATION` commands. Lets the app perform operations
    based on the configuration value before it is saved.

[after_configuration_change](callbacks.md)
:   An asynchronous callback called after the `ALTER APPLICATION SET CONFIGURATION VALUE`
    or `ALTER APPLICATION UNSET CONFIGURATION` commands complete. Lets the app react to
    the change, for example for notification or tracking purposes.

### Connection callbacks

These callbacks are triggered when a connection’s status changes, such as when a
connection is established, refused, dropped, or when the connected app is deleted.

[after_server_connection_change](callbacks.md)
:   An asynchronous callback triggered in the client app by any operation that impacts
    the connection state, including approving, refusing, or dropping a specification,
    or dropping the server app.

[after_client_connection_change](callbacks.md)
:   An asynchronous callback triggered in the server app by any operation that impacts
    the connection state, including approving, refusing, or dropping a specification,
    or dropping the client app.

[after_server_version_change](callbacks.md)
:   An asynchronous callback called in the client app after the server app’s version or
    patch number changes. Lets the client app react to an upgrade or downgrade.

## Examples

The following examples show how to configure an app to use inter-app communication.

* Example: Setup script and manifest files
* Example: Asynchronous communication between apps

### Example: Setup script and manifest files

The following example shows a client app’s setup script (`setup.sql`):

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA app_schema;

-- create a callback that creates the connection request before the config value of the server name is saved
CREATE OR REPLACE PROCEDURE app_schema.before_config_change_callback(config_name STRING, config_value STRING)
RETURNS STRING
LANGUAGE SQL
AS
$$
DECLARE
    spec_name VARCHAR;
    existing_target VARCHAR;
BEGIN
    IF (config_value IS NOT NULL AND config_name = 'MY_SERVER_APP_NAME_CONFIGURATION') THEN
        SHOW SPECIFICATIONS;
        SELECT PARSE_JSON("definition"):SERVER_APPLICATION::STRING
            INTO existing_target
            FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));

        IF(existing_target IS NOT NULL AND UPPER(existing_target) != UPPER(config_value)) THEN
            EXECUTE IMMEDIATE 'ALTER APPLICATION DROP SPECIFICATION CONNECTION_' || UPPER(existing_target);
        END IF;

        spec_name := 'CONNECTION_' || UPPER(config_value);
        EXECUTE IMMEDIATE
        'ALTER APPLICATION SET SPECIFICATION ' || spec_name || '
            TYPE = CONNECTION
            LABEL = ''Server App''
            DESCRIPTION = ''Request for an app that will provide access to server procedures and functions. The server app version must be greater than or equal to 3.2.''
            SERVER_APPLICATION = ' || config_value || '
            SERVER_APPLICATION_ROLES = (my_server_app_role)';
    END IF;
RETURN 'success';
END;
$$;

CREATE APPLICATION ROLE IF NOT EXISTS client_app_user;
GRANT USAGE ON SCHEMA app_schema TO APPLICATION ROLE client_app_user;
ALTER APPLICATION SET CONFIGURATION DEFINITION my_server_app_name_configuration
    TYPE = APPLICATION_NAME
    LABEL = 'Server App'
    DESCRIPTION = 'Request for an application that will provide access to server procedures and functions. The server app version must be greater than or equal to 3.2'
    APPLICATION_ROLES = (client_app_user);
```

The following example shows a client app’s manifest file (`manifest.yml`):

```yaml
manifest_version: 2

artifacts:
  setup_script: setup.sql

lifecycle_callbacks:
    before_configuration_change: app_schema.before_config_change_callback
```

Note the following about the preceding code example:

* In the [before_configuration_change](callbacks.md) callback, the app checks for an existing connection
  specification matching the configuration’s previous value, and drops it if it exists. The callback
  then creates a new connection specification for the newly provided server app name. Creating a new connection when the server name is set prevents duplicate connection specifications from being created.

### Example: Asynchronous communication between apps

The following example shows how to create procedures in a server app’s setup script (`setup.sql`) for asynchronous communication. The server app creates a processing queue table, and provides two procedures to client apps through an app role: `submit_request` to add a request to the queue and `fetch_response` to retrieve the result of a completed request. The server app periodically uses the `process_requests` procedure to process all pending requests.

```sqlexample
CREATE TABLE IF NOT EXISTS app_schema.processing_queue (
  request_id NUMBER AUTOINCREMENT,
  operation STRING,
  input STRING,
  status STRING DEFAULT 'PENDING',
  response STRING DEFAULT ''
);

CREATE OR REPLACE PROCEDURE app_schema.submit_request(operation STRING, input STRING)
RETURNS STRING
LANGUAGE SQL
EXECUTE AS OWNER
AS
$$
BEGIN
    INSERT INTO app_schema.processing_queue (operation, input) VALUES (:operation, :input);
    RETURN 'Request submitted successfully';
END;
$$;

CREATE OR REPLACE PROCEDURE app_schema.process_requests()
RETURNS STRING
LANGUAGE SQL
EXECUTE AS OWNER
AS
$$
DECLARE
    -- Cursor to find all PENDING requests
    c1 CURSOR FOR SELECT * FROM app_schema.processing_queue WHERE status = 'PENDING';
    result STRING;
BEGIN
    FOR request IN c1 DO

        IF (request.operation = 'OPERATION_X') THEN
            -- assuming there is a UDF func_x(input) to perform operation_x
            result := (SELECT func_x(:request.input));
        END IF;

        -- update the processing queue with the result
        LET stmt STRING :=
            'UPDATE app_schema.processing_queue SET status = 'DONE', response = ' ||
            result ||
            ' WHERE request_id = ' ||
            request.request_id;
        EXECUTE IMMEDIATE (:stmt);

    END FOR;

    RETURN 'Processed pending requests.';
END;
$$;

CREATE OR REPLACE PROCEDURE app_schema.fetch_response(operation STRING, input STRING)
RETURNS STRING
LANGUAGE SQL
EXECUTE AS OWNER
AS
$$
BEGIN
    LET res STRING := (SELECT response FROM app_schema.processing_queue WHERE operation = :operation AND input = :input);
    RETURN res;
END;
$$;

CREATE APPLICATION ROLE IF NOT EXISTS my_server_app_role;
GRANT USAGE ON SCHEMA app_schema TO APPLICATION ROLE my_server_app_role;
GRANT USAGE ON PROCEDURE app_schema.submit_request(string, string) TO APPLICATION ROLE my_server_app_role;
GRANT USAGE ON PROCEDURE app_schema.process_requests() TO APPLICATION ROLE my_server_app_role;
GRANT USAGE ON PROCEDURE app_schema.fetch_response(string, string) TO APPLICATION ROLE my_server_app_role;
```

---
title: Manage apps
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-managing-applications.md
section: Native Apps Framework
---

# Manage apps

This topic describes how to manage a Snowflake Native App after it is installed in a consumer
account.

## View installed Snowflake Native Apps and Streamlit apps

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.

   A list of installed applications and Streamlit apps appears in the Installed Apps list.

## View the readme file for an app

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Select the Settings icon in the toolbar.
5. Select the About the app tab.

## Grant application roles to account roles

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Select the Settings icon in the toolbar.
5. Select the Access management tab.
6. In the Account roles with access pane select Add.
7. Select a role in the Account roles list.
8. Select Close.

## Use a SQL command to grant application roles to account roles

To grant an application role to an account role in the consumer account using SQL commands,
use GRANT APPLICATION ROLE of the GRANT DATABASE ROLE command as
shown in the following example:

```sqlsyntax
GRANT APPLICATION ROLE hello_snowflake_app.app_public TO ROLE data_manager;
```

## Launch an app

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select an app.
4. Select the Settings icon in the toolbar.
5. Click Launch App.

## Use custom budgets to monitor credit usage for an app

[Budgets](../../user-guide/budgets.md) allow you to define a monthly spending limit on the
[compute costs](../../user-guide/cost-understanding-compute.md) for an app. You can create
and configure a custom budget to monitor the credit usage for the objects owned by the app that consume credits.

When you add an app to a custom budget, the objects that are owned by the app and that consume credits are added to the
custom budget automatically. These include the warehouses and compute pools that are owned by the app.

Warehouses and compute pools that are **shared** are not tracked by the custom budget automatically, although
you can add these to the custom budget manually. When you create a custom budget for an app, you cannot add objects created
and owned by an app to a separate custom budget. However, you can add warehouses and compute pools that are shared to a separate
custom budget.

### Set up the required role to create a custom budget for an app

To create or edit a custom budget for an app, you must use a role that has the correct privileges. See
[Custom budgets](../../user-guide/budgets/custom-budget.md).

### Create a custom budget for an app in Snowsight

You can create or edit a custom budget for an app directly from the app configuration page. You can also do it from the Budgets tab
in Snowsight (see [Use Snowsight to create a custom budget](../../user-guide/budgets/custom-budget.md)).

To create a custom budget for an app from the app configuration page, follow these steps:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app whose custom budget you want to view.
4. Select the Cost management tab.
5. Select Create Budget.
6. Select Budget.
7. Enter a Budget name.
8. Select the database and schema in which to create your budget.
9. Enter the Spending limit.
10. Enter the email addresses to receive notifications.

    > **Note:**
    >
    > Each email address added for custom budget notifications must be
    > [verified](../../user-guide/notifications/email-notifications.md). The
    > notification email setup fails if any email address in the list is *not* verified.
11. Select Resources to monitor.
12. Select the app to add to the custom budget.

    * To add an app, expand Native Apps to select an app.
    * To add a database, expand Databases to select a database.
    * To add objects in a schema, expand the schema to list available objects. Expand the object category
      (for example, Tables or Tasks) to select objects.
    * To add a warehouse, expand Warehouses to select a warehouse.
    * To add a compute pool, expand Compute Pools to select a compute pool.
    > **Note:**
    > * When you select a database or schema, all [supported objects](../../user-guide/budgets/custom-budget.md)
    >   (for example, tables) contained within the database or schema are also added to the custom budget.
    > * You can only add an object to one custom budget. If an object is currently included in one custom
    >   budget and you add that object to a second custom budget, Snowflake removes the object from the first
    >   custom budget without issuing a warning.

### Create a custom budget for an app by using SQL

To create a custom budget for an app by using SQL, see
[Use SQL commands to create a custom budget](../../user-guide/budgets/custom-budget.md).

## Monitor an app

By default, an app owner can use different SQL commands to view information about an app
in the consumer account. To allow other roles in the consumer account to use these commands,
you can delegate the MONITOR privilege to another role.

```sqlsyntax
GRANT MONITOR ON APPLICATION hello_snowflake_app TO ROLE data_analyst;
```

You can also grant the MONITOR privilege on the app to another app as shown in the following example:

```sqlsyntax
GRANT MONITOR ON APPLICATION hello_snowflake_app TO APPLICATION another_app;
```

The MONITOR privilege allows the role to run the following commands:

* [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md)
* [SHOW REFERENCES](../../sql-reference/sql/show-references.md)
* [SHOW OBJECTS OWNED BY APPLICATION](../../sql-reference/sql/show-objects-owned-by-application.md)
* [SHOW SPECIFICATIONS](../../sql-reference/sql/show-specifications.md)
* [DESCRIBE SPECIFICATION](../../sql-reference/sql/desc-specification.md)

## What to do if an app is unavailable

To check the status of an app, run the
[SHOW APPLICATIONS](../../sql-reference/sql/show-applications.md) command and determine the
`upgrade_status` value. When an app is unavailable, the
[DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command
fails and the error message provides information about why the app is unavailable.

The following table lists the reasons an app is unavailable and methods for resolving
the issue:

| Reason | Possible resolution |
| --- | --- |
| Snowflake disabled the app. | Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) |
| The consumer account is inactive after being locked or suspended. | The app is re-enabled when the account is restored. |
| The version of the app was dropped from the application package in the provider account. | In this situation the app is no longer usable and must be uninstalled and reinstalled from a current listing. |
| The consumer exceeded the usage limit for a [usage based trial](../../collaboration/consumer-listings-exploring.md). | See [Trial a listing](../../collaboration/consumer-listings-exploring.md) for possible options. |
| The app was installed from a paid listing, but payment information was not provided or is not current. | Pay for the listing. See [Pay for listings](../../collaboration/consumer-listings-paying.md) for more information. |
| The trial duration of the listing has exceeded. | Contact the app provider. |

## Uninstall a Snowflake Native App

You can uninstall an app using Snowsight or by running SQL commands.

To uninstall an app, you must use a role that has the OWNERSHIP privilege on the
app. See [GRANT OWNERSHIP](../../sql-reference/sql/grant-ownership.md).

To transfer ownership of objects owned by the app that exist outside the app, you
must use a role that has the MANAGE GRANTS privilege on the objects. See
[Access control considerations](../../user-guide/security-access-control-considerations.md).

### Uninstall an app in Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Next to the app, select …, then select Uninstall.

   If the app created objects that exist outside the app, a dialog appears showing
   a list of the objects and their types.

   1. Select one of the following:

      * Yes, transfer selected objects to a role.

        If you select this option, choose a role from the list. This role becomes the
        new owner of the object.

        > **Caution:**
        >
        > When using Snowsight, only the following objects owned by the Snowflake Native App
        > can be transferred to a different role:
        >
        > + Database
        > + Schema
        > + Table
        > + Views
      * No, delete all objects created outside the app.

        If you select this option, the objects will be deleted when the app is
        uninstalled.
4. Select Uninstall.

### Use SQL commands to uninstall an app

1. Use the `SHOW OBJECTS OWNED BY APPLICATION` command to view the objects owned by
   the Snowflake Native App that exist outside the app as shown in the following example:

   ```sqlexample
   SHOW OBJECTS OWNED BY APPLICATION hello_snowflake_app;
   ```

   This command shows a list of objects and their types.
2. Optionally, to transfer ownership of an object to a different role, use the
   [GRANT OWNERSHIP](../../sql-reference/sql/grant-ownership.md) command as shown in the following example.

   ```sqlexample
   GRANT OWNERSHIP ON DATABASE na_external_db TO ROLE consumer_role;
   ```
3. To delete the app, run the
   [DROP APPLICATION](../../sql-reference/sql/drop-application.md) command as shown in the
   following example:

   ```sqlexample
   DROP APPLICATION hello_snowflake_app CASCADE;
   ```

   > **Note:**
   >
   > If you do not transfer ownership of the objects owned by the app to a different role, you must used the `CASCADE` option. If objects owned
   > by the app still exist you can’t drop the app without using the `CASCADE` option.

---
title: Overview of app specifications
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-app-specs.md
section: Native Apps Framework
---

# Overview of app specifications

This topic describes how a provider can configure a Snowflake Native App to use
app specifications to request controlled access from consumers. App specifications
allow consumers to review and approve or decline requests for the following
actions:

* Connections to external endpoints outside of Snowflake
* Authentication with third-party services
* Data sharing with other Snowflake accounts

## Types of controlled access for Snowflake Native Apps

Snowflake Native Apps often need to interact with resources beyond the consumer’s Snowflake account.
These interactions can include connecting to external services, authenticating with third-party
providers, or sharing data with other Snowflake accounts.

To access external services and share data, Snowflake provides the following objects:

External access integrations:
:   Allow secure access to external network endpoints within a user-defined function or stored
    procedure. External access integrations use network rules to restrict access to specific external
    network locations.

Security integrations:
:   Allow secure access to third-party authentication providers such as OAuth. Security integrations
    provide secure authentication and access control.

Shares and listings:
:   Allow apps to share data back to providers or third-party Snowflake accounts.
    Shares contain database objects to be shared, and listings provide the
    mechanism to share data across accounts and regions.

When using [automated granting of privileges](requesting-auto-privs.md), an
app has the required privileges to create these objects when running the setup
script. However, because these objects enable external connections or data
sharing, consumers must approve these operations when configuring the app.

Using automated granting of privileges with app specifications has the following benefits:

* Consumers do not have to manually create integrations, shares, or listings required by the app
  and approve access to them using references.
* Providers do not have to write code that checks for the existence of the required privileges and
  objects during installation or upgrade.
* Consumers have clear visibility and control over external connections and data sharing requests.

## Use app specifications for consumer approval

App specifications allow you to specify what controlled access the app
requires. After the consumer installs the app, they review the app specification and
approve or decline each request as necessary. This includes requests for external connections,
authentication integrations, and data sharing permissions.

* For information about using app specifications to request access to external
  endpoint access, see [Request external access integrations (EAIs) with app specifications](requesting-app-specs-eai.md).
* For information about
  using app specifications to request access to OAuth integrations, see
  [Request security integrations with app specifications](requesting-app-specs-sec-integ.md).
* For information about using app specifications to share data through listings, see
  [Request data sharing with app specifications](requesting-app-specs-listing.md).

## App specification definition

An app specification definition contains the properties that are required for the app to perform
controlled operations such as external connections or data sharing. These properties are displayed
to the consumer for approval. The app specification definition contains a subset of the metadata and
properties specific to each type of operation: external access integration, security integration, or listing.

For information about the app specification definition for security integrations, see
[App specification definition for security integrations](requesting-app-specs-sec-integ.md).

For information about the app specification definition for external access integrations, see [App specification definition for an EAI](requesting-app-specs-eai.md).

For information about the app specification definition for listings, see [Create an app specification for a listing](requesting-app-specs-listing.md).

## Sequence numbers of an app specification

The sequence number is similar to a version number for the app specification. Sequence numbers
are automatically incremented when a provider changes the definition of the app specification.
The definition of an app specification includes configuration details and other required information.
Fields that are not part of the definition, such as `description`, do not trigger an update to the
sequence number.

Sequence numbers allow providers and consumers to identify different versions of an app specification.
For example, if a provider adds a new configuration detail to the app specification definition,
the sequence number is incremented. When the consumer views the app specification, they can see that
the sequence number has changed, and they can review the updated app specification.

## Best practices when using app specifications

[Automated granting of privileges](requesting-auto-privs.md) ensures that the app has the required
privileges to create objects like external access integrations, security integrations, or listings. However,
consumers can choose to decline the app specification that enables external connections or data sharing.
When developing an app, you must account for situations where app specifications might not be approved.

Consider the following scenarios:

* An app might request multiple network ports for an external access integration, but the consumer might
  allow only one. The app should include logic to handle errors that occur if a network port is not available.
* A data sharing request might be declined or only partially approved for some target accounts but not others.
  The app should gracefully handle these cases.
* Authentication integrations might be rejected, requiring the app to use
  alternative methods.

As a best practice, always include proper error handling and provide clear feedback to consumers about
which features require approved specifications to function.

## Using callback functions with app specifications

In some contexts, an app might need to know when the consumer has approved or declined an
app specification. For example:

* The app might need to wait until an external access specification is approved before making API calls.
* Data population might need to start only after a listing specification is approved.
* OAuth flows might need to be initialized after security integration approval.

To handle this situation, the Snowflake Native App Framework provides a mechanism that allows provider to define a callback
stored procedure that runs when the consumer approves or declines an app specification.

Providers can add a stored procedure to the manifest file as shown in the following example:

```yaml
lifecycle_callbacks:
  specification_action: callbacks.on_spec_update
```

This example shows how to add a stored procedure named `callbacks.on_spec_update` to the manifest
file. In the setup script, providers can add a stored procedure as shown in
the following example:

```sqlexample
CREATE OR REPLACE PROCEDURE callbacks.on_spec_update (
  name STRING,
  status STRING,
  payload STRING)
  ...
```

This example shows the signature of a stored procedure called `callbacks.on_spec_update`.
You include the code in the body of this procedure to check the status of the app specification, create objects, and perform actions as required.

---
title: Overview of app versions and upgrades (Legacy)
source: https://docs.snowflake.com/en/developer-guide/native-apps/update-app-overview.md
section: Native Apps Framework
---

# Overview of app versions and upgrades (Legacy)

This topic provides information about how versions, patches and upgrades work in the
Snowflake Native App Framework.

## About app versions and patches

The Snowflake Native App Framework allows providers to create versions and patches of an app. Versions and patches allow providers to
release new functionality and updates to consumers.

Version
:   Generally contains major updates to a Snowflake Native App. Versions generally introduce new features and changed
    functionality for an app.

Patch
:   Generally contains smaller updates to a Snowflake Native App. Unlike versions, patches should only contain small
    updates such as security fixes.

The versions and patches of an app are specified in the application package.

> **Caution:**
>
> An app can only have two active versions at one time. Each version of an app can have up to 130 patches.

To add a new version to an application package that currently has two versions defined, providers must remove one of
the existing versions. To remove a version, a provider must:

1. Ensure that all consumers have upgraded off the version to be removed.
2. Remove the version from the application package.
3. Create a new version.
4. Upgrade the app.

> **Caution:**
>
> Although an app might be upgraded in the consumer account, the previous version of the app might still have code that is
> running. Providers cannot remove the previous of the app from the application package until all running code from the
> previous version has completed. This applies to all installed versions of the app across all consumer accounts. If a single
> upgrade fails, providers must fix the reason for the upgrade failure before they can remove the version.

Although an application package can only contain two active versions at one time, a single version can have multiple patches.
The Snowflake Native App Framework does not support dropping patches. When a provider adds a new version to an application package, the new version is
automatically assigned patch 0 by default. This cannot be changed.

When a provider adds a new patch to a version, they can manually specify the identifier for the patch. If no patch number is
provided, Snowflake automatically increments the patch version by 1.

> **Note:**
>
> Each version and patch must have its own setup script and application files versions.

### Upgrading versions and patches

When a provider publishes a new version of an app, the Snowflake Native App Framework ensures that only the previous version of the
app is active. For example, if a provider has published versions v1 and v2 of an app, the Snowflake Native App Framework ensures that only v2 is
currently installed in a consumer account before upgrading to v3. This requires that all installed apps using version
v1 are migrated to version v2.

This ensures that the setup script of the app only has to account for differences between v2 and v3. The setup script is
only backwards compatible with the most recent version of the app. If a provider makes a state change to the app, for example
creating a new table or adding columns to a table, providers only have to ensure that there are no compatibility issues between
two versions.

In contrast, when a provider creates a new patch for a version of an app, the Snowflake Native App Framework does not enforce any
restrictions on the number of active patches running. Providers must avoid making changes to the state of
an app in a patch to avoid incompatibility across multiple patches.

## Stateful and stateless objects

For information on stateful and stateless objects in a Snowflake Native App, see
[Use versioned schema to manage app objects across versions](versioned-schema.md).

## About versioned schemas

For information on using versioned schema in a Snowflake Native App, see
[Use versioned schema to manage app objects across versions](versioned-schema.md).

## About app upgrades

The Snowflake Native App Framework allows providers to upgrade an app to a new version or patch. To see how
upgrades fit in the overall workflow for developing a new version or patch of an app, see
[Workflow for updating an app](update-app.md).

Providers can initiate an upgrade of an app to a new version or patch by setting a release directive
on the application package. When the release directive is modified, Snowflake automatically upgrades
all installed instances of the current version of the app to the version specified by the release directive.

When the provider initiates an upgrade, Snowflake adds each app to be upgraded to a queue. Each
app is upgraded as resources are available. The upgrade process can take a while to complete across all
installed versions of the app. To expedite the upgrade process, consumers can also manually initiate an upgrade
of an app when a new version or patch is available.

> **Note:**
>
> After the upgrade process begins for their app, consumers can no longer manually upgrade the app.

For more information, see [Upgrade an app (Legacy)](update-app-upgrade.md).

### Upgrades across regions

See [Upgrade an app across regions](release-channels-upgrade.md) for information on upgrading an app installed
across regions using Cross-Cloud Auto-Fulfillment.

## Lifecycle of app version and patches

To understand how app versions and patches work together, consider a scenario where a provider
has published an initial version, v1, of an app and consumer A and consumer B have installed that
version of the app in their accounts.

This scenario is shown in the following sections.

### Version v1.0 is installed in the consumer account

Figure 1 shows version `v1.0` of an app that a provider published and two consumers have
installed the app in their accounts:

This figure shows the following:

* The application files for `v1.0` are stored in a stage.
* The release directive of the application package is set to `v1.0`.
* Consumers have installed `v1.0` in their account.
* The provider has begun development of version v2.0 in their account.

### Add version v2.0 to the application package

Figure 2 shows that the provider has uploaded version `v2.0` and created a new
version in the application package:

This figures shows the following:

* After testing version `v2.0` of the app locally, the provider uploads the `v2.0` file to the stage
* The provider creates a new version for the app in the application package.
* The release directive continues to point to version `v1.0` of the app.
* Consumers continue to have version `v1.0` installed in their account.

### Upgrade the app from version v1.0 to version v2.0

To perform an upgrade from version `v1.0` to version `v2.0` of the app, the provider sets the
[release directive](../../sql-reference/sql/alter-application-package-release-directive.md) of the application
package to version `v2.0`. This starts the process of upgrading the app in the consumer
accounts.

After the upgrade completes, both consumers A and B have version v2.0 installed in their accounts as shown in the
Figure 3 diagram.

Also, in this scenario the provider has begun developing and testing version v3.0 in their local development environment.

### Drop version v1.0 to be able to create v3.0

When testing is complete, the provider uploads version `v3.0` to the stage. When the provider wants to begin the upgrade to
version `v3.0`, they must first ensure that all consumers have migrated off of version `v1.0`.

In the scenario shown in the previous section, all consumers are currently on `v2.0`.

The provider must drop version `v1.0` from the application package as shown in Figure 4:

### Add version `v3.0` to the application package

After dropping version `v1.0`, the provider can then add version `v3.0` to the application package. In this context, the
release directive is still pointing to `v2.0` and consumers have `v2.0` installed in their account.

### Upgrade to version `v3.0`

To upgrade to `v3.0`, the provider updates the release directive to point to `v3.0`. This begins the upgrade. When the upgrade
is complete, consumers are upgraded to version `v3.0` as shown in the following figure:

---
title: Pause connector
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/pause_connector.md
section: Native Apps Framework
---

# Pause connector

Pausing the connector is available after the wizard. It can be executed after the `Finalize Configuration` step. This
step allows user to manipulate the status of the connector after it is launched. The entry point for this phase is a procedure
called `PUBLIC.PAUSE_CONNECTOR()`. It can be customized by replacing it in SQL or by using `PauseConnectorHandlerBuilder`.
The reverse process of pausing the connector, allowing user to restart it, is [Resume connector](resume_connector.md).

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The pause connector step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Privileges validation
2. Status validation
3. State validation
4. Status update (PAUSING)
5. Internal callback
6. Pausing of Task Reactor (if Task Reactor is enabled)
7. Status update (PAUSED)

## Requirements

Pause connector requires at least the following SQL files to be executed during native app installation:

* `core.sql`
* `configuration/app_config.sql`
* `lifecycle/pause.sql`
* Recommended: `configuration/finalize_configuration.sql`

## Privileges validation

To pause the connector, the `EXECUTE TASK` privilege must be granted to the application.

This validation cannot be overwritten by using `PauseConnectorHandlerBuilder` nor by overwriting a stored procedure.
However, it is possible to implement a custom handler.

## Status validation

To pause the connector the internal status of the connector needs to be `STARTED`.

This validation cannot be overwritten by using `PauseConnectorHandlerBuilder` nor by overwriting stored procedure.
However, it is possible to implement a custom handler.

## State validation

In case there are some additional custom validations that need to be satisfied there is a `PUBLIC.PAUSE_CONNECTOR_VALIDATE()`
stored procedure, which can be customized by the user. By default, this procedure just returns `'response_code': 'OK'`.
The procedure can be customized by overwriting through the SQL or by using `PauseConnectorHandlerBuilder` and providing custom implementation of the
`PauseConnectorStateValidator` interface.

## Internal callback

Internal callback is another customizable step. By default, it invokes `PUBLIC.PAUSE_CONNECTOR_INTERNAL()`, which returns `'response_code': 'OK'`.
This procedure allows the user to perform any additional duties needed when pausing the connector. For example, pausing additional connector specific tasks.
It can be overwritten through the SQL script or by using a `PauseConnectorHandlerBuilder` to provide custom implementation of the `PauseConnectorCallback` interface.

## Status update

When all the above phases are completed successfully the internal status of the Connector will be updated to:

```json
{
    "status": "PAUSED",
    "configurationStatus": "FINALIZED"
}
```

For the whole diagram of state transitions, see [Connector flow](overview.md).

### Response

#### Successful response

When the procedure successfully pauses all tasks in the background and changes its status to PAUSED, then the `Connector successfully paused.`
message will be returned directly from the `PauseConnectorHandler` method body. It is recommended to use the following format:

> ```json
> {
>   "response_code": "OK"
> }
> ```

#### Error response

In case of an error the response will follow the below format:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "error message"
> }
> ```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - The procedure was called on connector with state different than `[STARTED, PAUSING]`
* `CONNECTOR_STATUS_NOT_FOUND` - Connector status record does not exist in database (independent of user’s input at this stage - an internal error)
* `ROLLBACK_CODE` - An error occurred, but the changes were successfully reverted.
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive
* `UNKNOWN_ERROR_CODE` - An unknown error occurred and the connector is now in an unspecified state

---
title: Pause connector reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/pause_connector_reference.md
section: Native Apps Framework
---

# Pause connector reference

## Database objects and procedures

The following database objects are created through the file `lifecycle/pause.sql`.

### PUBLIC.PAUSE_CONNECTOR()

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `PauseConnectorHandler.pauseConnector`.

### PUBLIC.PAUSE_CONNECTOR_VALIDATE()

Procedure used for connector specific validation of pausing process. By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPauseConnectorStateValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.PAUSE_CONNECTOR_INTERNAL()

Procedure used for connector specific additional pausing duties. By default, it returns `'response_code': 'OK'`.
It is invoked by `InternalPauseConnectorCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

The pause connector is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))
* `configuration/finalize_configuration.sql` (See [Finalize configuration reference](finalize_configuration_reference.md))

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.lifecycle` package and some common components are tightly connected with the above procedures:

* `PauseConnectorHandler`
* `PauseConnectorStateValidator`
* `PauseConnectorCallback`
* `ConnectorStatusService`
* `LifecycleService`
* `ConnectorErrorHelper`

## Custom handler

The handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of `PauseConnectorHandler`, the `PUBLIC.PAUSE_CONNECTOR` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.PAUSE_CONNECTOR()
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.custom.handler.CustomPauseConnectorHandler.pauseConnector';

GRANT USAGE ON PROCEDURE PUBLIC.PAUSE_CONNECTOR() TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal `VALIDATE` and `INTERNAL` procedures can be also customized through the SQL. They can also invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.PAUSE_CONNECTOR_INTERNAL()
RETURNS VARIANT
LANGUAGE SQL
EXECUTE AS OWNER
AS
BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
END;

CREATE OR REPLACE PROCEDURE PUBLIC.PAUSE_CONNECTOR_VALIDATE()
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.custom.handler.CustomPauseConnectorInternalHandler.pauseConnector';
```

### Builder approach

`PauseConnectorHandler` can be customized using `PauseConnectorHandlerBuilder`. This builder allows user to provide custom implementations of the following interfaces:

* `PauseConnectorStateValidator`
* `PauseConnectorCallback`
* `ConnectorErrorHelper`

In case a function is not provided the default implementation provided by the SDK will be used.

```java
class CustomPauseConnectorStateValidator implements PauseConnectorStateValidator {
    @Override
    public ConnectorResponse validate() {
        // CUSTOM LOGIC
        return ConnectorResponse.success();
    }
}

class CustomHandler {

    // Path to this method needs to be specified in the PUBLIC.PAUSE_CONNECTOR procedure using SQL
    public static Variant pauseConnector(Session session) {
            //Using builder
        var handler = PauseConnectorHandlerBuilder.builder(session)
            .withStateValidator(new CustomPauseConnectorStateValidator())
            .build();
        return handler.pauseConnector().toVariant();
    }
}
```

---
title: Prerequisites
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/prerequisites.md
section: Native Apps Framework
---

# Prerequisites

The prerequisites step is the first step of the wizard phase of the connector. This step is completely optional,
but it is recommended, if the end user needs to perform some additional setup outside of the native app or even whole Snowflake context.
An example of this could be setting up authentication and authorization in the source system from which the data will be ingested.

To setup prerequisites they must be inserted to the `STATE.PREREQUISITES` table during the connector installation.
Most of the columns in that table should be self-explanatory. The URL columns should be used to provide
the end user with more information on the required setups. In case there is a need to provide something more
custom in the prerequisites the `custom_properties` column should be used.

The prerequisites phase consists of 2 steps:

1. Marking prerequisites as done
2. Completing the step

## Requirements

Prerequisites require at least the following sql files to be executed during native app installation:

* `core.sql`
* `configuration/prerequisites.sql`

### Marking prerequisites as done

This step can be achieved in two different ways. Either prerequisites can be marked one by one as completed or all of them together.
The end result is the same, each of the prerequisites has its `is_completed` value set to `true`.
This step is handled by the following procedures:

* PUBLIC.MARK_ALL_PREREQUISITES_AS_DONE()
* PUBLIC.UPDATE_PREREQUISITE(ID VARCHAR, IS_COMPLETED BOOLEAN)

Both of those procedures require the connector to be in the `CONFIGURING` status and the configuration status to not be `FINALIZED`.

## Completing the step

To complete the prerequisites step call `PUBLIC.COMPLETE_PREREQUISITES_STEP()` procedure.
This procedure has no effect unless the connector is in the `CONFIGURING` status with configuration status `INSTALLED`.

If that’s the case then the status will be updated to the following value:

```json
{
    "status": "CONFIGURING",
    "configurationStatus": "PREREQUISITES_DONE"
}
```

This procedure requires the connector to be in the `CONFIGURING` status and the configuration status to not be `FINALIZED`.

---
title: Prerequisites SQL Reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/prerequisites_reference.md
section: Native Apps Framework
---

# Prerequisites SQL Reference

## Database objects and procedures

The following database objects are created through the file `prerequisites.sql`.

### STATE.PREREQUISITES

An internal table to persist the data about the prerequisites. This table is not accessible from outside the app. To read the data use the `PUBLIC.PREREQUISITES` view below.
The table contains the following columns:

* `id STRING`
* `title VARCHAR`
* `description VARCHAR`
* `learnmore_url VARCHAR`
* `documentation_url VARCHAR`
* `guide_url VARCHAR`
* `custom_properties VARIANT`
* `is_completed BOOLEAN`
* `position INTEGER`

### PUBLIC.PREREQUISITES

This view is exposed to the `ADMIN` and `VIEWER` roles. It returns the data from the above table. Rows will be sorted ascending by the `position` column.
Inserting prerequisites happens inside `setup.sql`. However, it has to be done in a way that will skip the insert during the update.
For example:

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
    prerequisites_exist NUMBER;
BEGIN
    SELECT COUNT (*) INTO :prerequisites_exist FROM state.prerequisites;
    IF (:prerequisites_exist = 0) THEN
        INSERT INTO STATE.PREREQUISITES (ID, TITLE, DESCRIPTION, DOCUMENTATION_URL, POSITION)
            VALUES
                ('1', '<Prerequisite name>', '<Prerequisite description>', 'Prerequisite url', 1)
    END IF;
END;
$$;
```

Another approach is to use a merge statement instead and do nothing (or update) on match.

### PUBLIC.COMPLETE_PREREQUISITES_STEP()

A procedure available to `ADMIN` users. The successful execution of this procedure does not require all prerequisites to be completed.
If the configuration status of the connector is `INSTALLED` it sets the status of the Connector to:

```json
{
    "status": "CONFIGURING",
    "configurationStatus": "PREREQUISITES_DONE"
}
```

Otherwise, there is no effect.

This procedure requires the connector to be in the `CONFIGURING` status and configuration status other than `FINALIZED`. Otherwise an exception is thrown.

Possible errors include:

* `INVALID_CONNECTOR_STATUS` - connector_status is not `[CONFIGURING]`.
* `INVALID_CONNECTOR_CONFIGURATION_STATUS` - configuration_status is `FINALIZED`.
* `UNKNOWN_ERROR` - Something unexpected went wrong - message of thrown exception is forwarded.

### PUBLIC.UPDATE_PREREQUISITE (ID VARCHAR, IS_COMPLETED BOOLEAN)

This procedure sets a status of the given prerequisite to the provided value. It is only available to `ADMIN` users.
The validations are similar to the `COMPLETE_PREREQUISITES_STEP()` procedure.

Possible errors include:

* `INVALID_CONNECTOR_STATUS` - Connector status is not `[CONFIGURING]`.
* `INVALID_CONNECTOR_CONFIGURATION_STATUS` - Connector configuration status is `FINALIZED`.
* `PREREQUISITE_NOT_FOUND` - Prerequisite with given ID not found.

### PUBLIC.MARK_ALL_PREREQUISITES_AS_DONE()

This procedures sets the `is_completed` column for all the prerequisites to `true`. The validations are similar to the `COMPLETE_PREREQUISITES_STEP()` procedure.

Possible errors include:

* `INVALID_CONNECTOR_STATUS` - Connector status is not `[CONFIGURING]`.
* `INVALID_CONNECTOR_CONFIGURATION_STATUS` - Connector configuration status is `FINALIZED`.

---
title: Previous logging and event sharing functionality — Deprecated
source: https://docs.snowflake.com/en/developer-guide/native-apps/event-old.md
section: Native Apps Framework
---

# Previous logging and event sharing functionality — *Deprecated*

This topic describes the deprecated method for setting up logging and event sharing before the
introduction of event definitions.

Providers who are setting up logging and event sharing should use the method described in [Use logging and event tracing for an app](event-about.md).
See [Considerations when migrating from the previous event sharing functionality](event-about.md) for information about migrating from the deprecated
to the new logging and event sharing functionality.

> **Warning:**
>
> The process for setting up logging and event sharing described in this topic will be deprecated
> in a future release.

## Previous logging and event sharing functionality

This topic provides information on setting up logging and event sharing as a provider. Refer to
[Enabling Logging and Event Sharing for an app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging)
for the consumer requirements for configuring this feature.

Logging and trace events allow you to collect information about an app to troubleshoot errors. Using logging and trace events,
you can also get a better idea of how your app runs and improve your app later.

## Workflow for setting up logging and event sharing as a provider

As a provider, you can set up logging and event sharing for an app by performing the following:

1. Review the considerations for using logging and event sharing.
2. Configure logging and trace events for functions and stored procedures.
3. Set the log and trace level in the manifest file.
4. Configure an account to store shared events.

After the consumer installs an app and enables logging and event sharing, you can view logging and event information shared by the application:

* View the log and trace level.
* [View the logs and events in the event table](event-manage-provider.md).

## Considerations for using logging and event sharing

Before using logging and event sharing for an app, providers must consider the following:

* Providers are responsible for all costs associated with event sharing on the provider side, including data
  ingestion and storage.
* Providers must have [an account to store shared events](event-manage-provider.md)
  in each region where you want to support event sharing.
* Providers must define the default log level and trace level for an app in the manifest file.

> **Note:**
>
> Event sharing cannot be enabled for an app that is installed in the same account as the application package
> it is based on. To test event sharing for an app, a provider must use multiple accounts.

## Configure logging and trace events in functions and procedures

The Native Apps Framework requires an event table to store log messages and trace events generated from
functions and stored procedures in an app.

> **Note:**
>
> If the consumer of an app does not set up an event table and make it the active table
> before installing the app, event and logging data are discarded.

An account can have multiple event tables, but only one of them can be set as the active event table for a
Snowflake account at a time. Without an active event table, log messages and trace events generated by the
app are not captured. This is true even if the functions and procedures in an app call the logging
and trace event APIs.

To create an event table, use the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command. For more information, see
[Event table overview](../logging-tracing/event-table-setting-up.md).

After code has recorded log messages and trace events, a provider can query recorded data.

For information about recording and querying log and trace data, see the following:

* [Logging messages from functions and procedures](../logging-tracing/logging.md).
* [Trace events for functions and procedures](../logging-tracing/tracing.md).

## Set the log and trace level in the manifest file

To set the default log and trace event levels for a version of an app, set the
`log_level`, `trace_level`, and `metric_level` parameters in the manifest file as shown in the following example:

```yaml
artifacts:
  setup_script: setup.sql
configuration:
  trace_level: OFF
  log_level: DEBUG
```

When a provider enables tracing, a Snowflake Native App automatically captures the start and end
times for all queries and stored procedure calls.

> **Note:**
>
> Publishing a Snowflake Native App with the `trace_level` property set to a value other than `OFF`
> might expose calls to hidden stored procedures to any user in the consumer account who can view
> the event table.

For information on supported values for `trace_level`, `log_level`, `metric_level`, and `log_event_level`, see
[Setting levels for logging, metrics, and tracing](../logging-tracing/telemetry-levels.md).

When the Snowflake Native App is initially installed, it uses the log levels defined in the manifest file. If the log
level is changed in a subsequent upgrade, the new log level takes effect after the upgrade process completes.

The log and trace level can only be set within the manifest file. The consumer is not allowed to modify the log level using the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) or [ALTER DATABASE](../../sql-reference/sql/alter-database.md) commands.

Similarly, any session level settings for the logging level are ignored by the app.

## Configure an account to store shared events

To store shared logs and events, a provider must select an account to hold an event table. This can be any
account that a provider can access. However, if an organization has multiple providers publishing
app packages, consider using a Snowflake account that is dedicated to storing shared events from the
consumer.

The following restrictions apply to accounts used to store shared events:

* You must use an [organization administrator role](../../user-guide/organization-administrators.md) to set an account as the account used
  to store events.
* The account must have an active event table.
* The specified account cannot be any of the following:

  + A locked or suspended account.
  + A reader account.
  + A trial account.
  + A Snowflake managed account.

> **Note:**
>
> A provider can collect logs and shared events only in the same region where a consumer installs an app.
> Providers must set up an account to store shared events in every region where consumers configure event
> sharing for an app.

### Set an account as the events account

To set an account to be the events account for a region, call the SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION system function:

```sqlsyntax
CALL SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION('<snowflake_region>', '<region_group>', '<account_name>')
```

Where:

`snowflake_region`
:   Specifies the region where the account is located, for example: `AWS_US_WEST_2, AWS_US_EAST_1`.

`region_group`
:   Specifies the region group, for example: `PUBLIC`. Refer to
    [Region groups](../../user-guide/admin-account-identifier.md) for details.

`account_name`
:   Specifies the account name. If another account is already set as the events account in the
    specified region, running this command changes the events account to be the account
    specified here. Always use the account name and not the Snowflake account identifier.

> **Note:**
>
> You must use an [organization administrator role](../../user-guide/organization-administrators.md) to call this function.

### Un-set an account as the events account

To unset an account to be the events account for a region, call the SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION system function:

```sqlsyntax
CALL SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION('<snowflake_region>', '<region_group>', '<account_name>')
```

Where:

`snowflake_region`
:   Specifies the region where the account is located, for example: `AWS_US_WEST_2, AWS_US_EAST_1`.

`region_group`
:   Specifies the region group, for example: `PUBLIC`.

`account_name`
:   Specifies the account name. Always use the account name and not the Snowflake account identifier.

> **Note:**
>
> You must use an [organization administrator role](../../user-guide/organization-administrators.md) to call this function.

### View event accounts in the provider’s organization

To show events accounts in a provider’s organization, call the SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS system function:

```sqlexample
CALL SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS()
```

> **Note:**
>
> You must use an [organization administrator role](../../user-guide/organization-administrators.md) to call this function.

This system function returns a string in JSON format containing a list of event accounts within the organization.
Because the metadata takes some time to propagate to all regions, this function might experience some delay when
showing latest events account after the user set/unset an events account for the organization.

## View the logging and trace event levels defined for an application package

Use the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command to view the logging level of an app, as shown in the
following command:

```sqlexample
DESC APPLICATION HelloSnowflake;
```

Use the [SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md) command to view the logging level the app versions defined in an application
package, as shown in the following example:

```sqlexample
SHOW VERSIONS
  IN APPLICATION PACKAGE HelloSnowflake;
```

## View the logs and events in the event table

To view the logs and events stored in the event table, use the [SELECT](../../sql-reference/sql/select.md) command as shown
in the following example:

```sqlexample
SELECT * FROM EVENT_DB.EVENT_SCHEMA.MY_EVENT_TABLE
```

## Shared event information available to the provider

The following sections describe the information that the Native Apps Framework shares with providers.

### App event context shared with the provider

To help providers easily identify the source of the shared events, the following fields are populated into the
`RESOURCE_ATTRIBUTES` column of the event table when they are shared with the provider:

* snow.application.package.name
* snow.application.consumer.organization
* snow.application.consumer.name
* snow.listing.name
* snow.listing.global_name

### Fields that are not shared with the provider

To protect consumer information, the following fields from the `RESOURCE_ATTRIBUTES` column are
not shared with provider:

* snow.database.id
* snow.database.name
* snow.schema.id
* snow.executable.id
* snow.owner.name
* snow.owner.id
* snow.warehouse.name
* snow.warehouse.id
* snow.query.id
* snow.session.id
* snow.session.role.primary.name
* snow.session.role.primary.id
* snow.user.name
* snow.user.id
* db.user

Instead of directly sharing the `snow.database.name` and `snow.query.id` fields with the provider, Snowflake
shares the hash values (SHA-1) of these two fields as the following fields:

* snow.database.hash
* snow.query.hash

Snowflake provides the [SHA-1 function](../../sql-reference/functions/sha1.md) used to mask these attributes.
Consumers can calculate the hash values for the database name and query id, and use them as reference values when
contacting the provider.

## Determine if event sharing is enabled in the consumer account

In some contexts, a provider may need to determine if event sharing has been enabled in a consumer
account. For example, a provider may need to disable app functionality if the event table is not
available.

To determine if event sharing is enabled in a consumer account, providers can call the following
system functions when defining the app logic:

* IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER

  Returns TRUE if the app enables event sharing and an active event table is available in the consumer
  account. Returns FALSE, otherwise.
* IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT

  Returns TRUE if the app was installed in the same account as the application package it is based on.
  Returns FALSE otherwise.

> **Note:**
>
> These system functions can only be called from within an app. See
> Determine if event sharing is enabled using the Python Permission SDK and
> Determine if event sharing is enabled using SQL

### Determine if event sharing is enabled using the Python Permission SDK

The Python Permission SDK provides the following functions to determine if even sharing is enabled in
a consumer account:

* `is_event_sharing_enabled()`

  Returns TRUE if the SHARE_EVENTS_WITH_PROVIDER property is true and the consumer account has
  an active event table configured. Returns FALSE, otherwise.
* `is_application_local_to_package()`

  Returns TRUE if the app is in the same account as the application package. Returns FALSE,
  otherwise.

### Determine if event sharing is enabled using SQL

The following example shows how to call a stored procedure when event sharing is
enabled in the consumer account.

Consider the following SQL stored procedure that creates a function to calculate the sum of
two numbers:

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA app_schema;

CREATE OR REPLACE PROCEDURE app_schema.hidden_sum(num1 float, num2 float)
RETURNS FLOAT
LANGUAGE SQL
EXECUTE AS OWNER
AS $$
  DECLARE
    SUM FLOAT;
  BEGIN
    SYSTEM$LOG('INFO', 'CALCULATE THE SUM OF TWO NUMBERS');
    SUM := :NUM1 + :NUM2;
    RETURN SUM;
  END;
$$;
```

When added to the setup script of the app, these SQL commands create the `hidden_sum` stored procedure
in the consumer account when the app is installed. However, this stored procedure is not visible to consumers
because the USAGE privilege is not granted on the stored procedure to an application role.

The following example shows how you can use the values returned by the IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER
and IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT system functions to call the `hidden_sum` stored procedure.

```sqlexample
CREATE OR REPLACE PROCEDURE app_schema.sum(num1 float, num2 float)
RETURNS STRING
LANGUAGE SQL
EXECUTE AS OWNER
AS $$
    BEGIN
      IF (SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT() or SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER()) THEN
        CALL APP_SCHEMA.HIDDEN_SUM(:NUM1, :NUM2);
      ELSE
        -- notify consumers that they need to enable event sharing
        RETURN 'Sorry you can\'t access the API, please enable event sharing.';
      END IF;
    END;
$$;

CREATE APPLICATION ROLE IF NOT EXISTS ADMIN_ROLE;
GRANT USAGE ON SCHEMA APP_SCHEMA TO APPLICATION ROLE ADMIN_ROLE;
```

In this example, the `sum` stored procedure tests the values of the
IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER and IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT
stored procedures. If one of their values is `true`, the `sum` stored procedure calls
the `hidden_sum` stored procedure.

## Request the event sharing from consumers using the Python Permission SDK

A provider can use the Python Permission SDK to create a Streamlit app to prompt consumers to enable
event sharing in their account.

The SDK provides the `request_event_sharing()` method that displays a dialog in Snowsight
that prompts the consumer to enable event sharing in their account. If the event table does not
exist in the consumer account, the dialog allows the consumer to set the event table if they are using
the ACCOUNTADMIN role.

## Example: Using the Python Permission SDK with event tables

The following Streamlit example shows how to use the Python Permission SDK to do the following:

* Determine if event sharing is enabled.
* If event sharing is enabled, call the `critical_feature_that_requires_event_sharing()` function.
* If event sharing is not enabled, call the `request_event_sharing()` function to display a dialog in
  Snowsight that prompts the consumer to enable event sharing.

```python
import streamlit as st
import snowflake.permissions as permissions

def critical_feature_that_requires_event_sharing():
  st.write("critical_feature_that_requires_event_sharing")

def main():
  if permissions.is_event_sharing_enabled() or permissions.is_application_local_to_package():
     critical_feature_that_requires_event_sharing()
  else:
     permissions.request_event_sharing()

if __name__ == "__main__":
  main()
```

In this example, the `critical_feature_that_requires_event_sharing()` method is only called if
one of the following is true:

* Event sharing is enabled and the event table exists.
* The Snowflake Native App is installed in the same account as the application package.

If neither condition is true, the Streamlit app calls the `request_event_sharing()` method which
prompts the consumer to select an event table.

For more information, see Determine if event sharing is enabled in the consumer account.

---
title: Protect provider intellectual property
source: https://docs.snowflake.com/en/developer-guide/native-apps/redacted-content.md
section: Native Apps Framework
---

# Protect provider intellectual property

This topic describes how the Snowflake Native App Framework protects provider data by redacting or removing information
about objects shared by a Snowflake Native App.

## About intellectual property protection in the Snowflake Native App Framework

When a consumer installs a Snowflake Native App, they are not allowed to view the objects within the
application object unless a provider grants permissions on the objects using application roles.

In general, when a consumer queries object metadata using a schema, view, or uses Snowsight
to view the Query Profile or Query History for those queries, the Snowflake Native App Framework redacts information
about the objects within the application object.

## Information redacted from Query Profile

The Snowflake Native App Framework redacts information from the [query profile](../../user-guide/ui-snowsight-activity.md) in the
following contexts:

* Queries that are run when the app is installed or upgraded.
* Queries that originate from a stored procedure owned by the app.
* Queries containing a non-secure view or function owned by the app.

For each of these types of queries, Snowsight collapses the query profile data into a single
empty node instead of displaying the full query profile tree.

## Information redacted from Query History

For queries related to a Snowflake Native App, the `query_text` and `error_message` fields are redacted
from the [query history](../../user-guide/ui-snowsight-activity.md) in the following contexts:

* Queries run when the app is installed or upgraded.
* Queries that originate from a child job of a stored procedure owned by the app.

In each of these situations, the cell of the query history in Snowsight appears blank.

## Information redacted from SQL commands and views

When a consumer uses the a SHOW or DESCRIBE command to view information about the application object
or the objects owned by the app, information about implementation details is redacted. For example,
function definitions and the function body are redacted from the output of these commands.

Information about implementation details is redacted from the [ACCESS_HISTORY](../../sql-reference/account-usage/access_history.md) view
in the following contexts:

* Queries generated when the app is installed or upgraded.
* Queries generated by stored procedures and user-defined functions owned by the app.

Additionally, for views owned by the app, information about the base table is redacted.

## Considerations when granting the MONITOR or OPERATE privilege on dynamic tables

Providers should use caution when granting the MONITOR or OPERATE privilege on dynamic tables to an
application role. These privileges allow the consumer to view a dynamic table’s metadata, which might
expose the implementation details of the app. See [Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md) for
more information on what actions the consumer can perform.

## Blocked context functions

To protect information related to objects within an application object, the Snowflake Native App Framework blocks the following
context functions:

| Context Function | Blocked in shared content (returns null) | Blocked in setup scripts and stored procedure and UDFs owned by the Snowflake Native App (throws an exception) |
| --- | --- | --- |
| CURRENT_ROLE | ✔ |  |
| CURRENT_ROLE_TYPE | ✔ |  |
| CURRENT_USER | ✔ |  |
| CURRENT_SESSION | ✔ |  |
| IS_ROLE_IN_SESSION | ✔ |  |
| CURRENT_IP_ADDRESS | ✔ | ✔ |
| CURRENT_AVAILABLE_ROLES | ✔ | ✔ |
| CURRENT_SECONDARY_ROLES | ✔ | ✔ |
| ALL_USER_NAMES |  | ✔ |
| GET_USERS_FOR_COLLABORATION |  | ✔ |
| CURRENT_WAREHOUSE |  | ✔ |
| SYSTEM$ALLOWLIST |  | ✔ |

## Protect shared content

To protect the privacy and integrity of a provider’s data content, the Snowflake Native App Framework
implements the following restriction:

* Shared objects are read-only for an application object and installed Snowflake Native App.
* Shared objects are not directly exposed to consumers. Objects are only exposed through a
  view that is installed when the setup script runs during the Snowflake Native App installation or upgrade.
* Only the provider can update the shared content.
* Only the following objects can be shared with an application object or installed Snowflake Native App. These
  object must have certain privileges:

  + Schemas: Only the USAGE privilege can be granted to the shared content of an application package.
  + Tables: Only the SELECT privilege can be granted to the shared content of an application package.
    Tables with defined policies (row access, masking, tag based, etc.) cannot be shared. Policies can be
    defined on the objects when they are exposed to consumers.
  + Views: Only the SELECT privilege can be granted to the shared content of an application package. Views
    with defined policies, including row access, masking, tag based, etc., cannot be shared.

> **Note:**
>
> A view, or any views from which a view is composed, cannot contain JavaScript, Java, Python, or Scala
> functions.

Refer to [Allow consumers to access shared objects in an app](preparing-data-content.md) for more information.

---
title: Publish an app to consumers
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-provider-publishing-app-package.md
section: Native Apps Framework
---

# Publish an app to consumers

After developing and testing the application package containing your app, you can
publish the app to consumers using
[listings](../../collaboration/collaboration-listings-about.md).

As a provider, you add an application package as the data content of an listing. The consumer
installs the app in their account from the listing.

## Set up roles and privileges

When you create a listing, you create it from the account that has the data or application package in it. The role that attaches a data
product to a listing and publishes the listing must be the same role that created, and therefore owns, the application package or share.
You cannot transfer the OWNERSHIP privilege for a share.

If you use a different role to create and manage the listing, grant the MODIFY privilege on the listing to the role
that owns the application package or share. For example:

Share or application package owner role:
:   OWNERSHIP privilege on the share or application package.
    MODIFY privilege on the listing.

Listing owner role:
:   OWNERSHIP privilege on the listing.

    Global CREATE LISTING privilege.

Within the provider account, you can use one of the following to create and manage listings:

ACCOUNTADMIN:
:   If you use the ACCOUNTADMIN role to create and manage listings, the ORGADMIN role must first
    [Delegate privileges to set up auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

Custom role:
:   If you use a custom role, the ORGADMIN role must first [Delegate privileges to set up auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md)
    to the ACCOUNTADMIN role, which can then be used to grant the relevant privileges to the custom role.

For more information about granting sharing privileges, see [Granting Privileges to Other Roles:](../../user-guide/data-exchange-marketplace-privileges.md).

## Prerequisites for publishing a listing for an application package

Before creating a listing for an application package, ensure that you have completed the following:

* Create and test your application package.

  Before publishing your application package, ensure that it is working correctly and that the roles and privileges
  are set properly.
* Become a Provider of Listings

  Becoming a provider of listings in Snowflake makes it easier to manage sharing apps from your account to
  other Snowflake accounts. For more information, see [Use listings as a provider](../../collaboration/provider-becoming.md).

  Creating a provider profile is not required for private listings.

## Workflow for publishing an application package

To publish an application package:

1. Ensure that you have completed the prerequisites for publishing
   a listing for an application package.
2. Set the default release directive.
3. Initiate the automated security scan.
4. Create a listing.
5. (Optional) Add a pricing plan to get paid for your application.
6. Submit your listing for approval.

   You only need to approve listings published to the Snowflake Marketplace.
7. Publish your listing.

## Set the default release directive

Before creating a listing for an application package, you must specify the default release directive that
points to the version or patch of the app you are publishing.

If you are using release channels to manage the versions of your app, you can set custome release directive for each release channel. You must set the default release directive on the default release channel.

For more information, see [Set the release directive using a release channel](release-channels.md)

If you are publishing your app using the legacy versioning method, you can set the default release directive on the application package. For more information, see
[Set the release directive for an app (Legacy)](update-app-release-directive.md)

## Initiate the automated security scan for an application package

To publish a listing for an application package to an account outside of your organization, your application package must pass an automated security scan.

The automated security scan is initiated when you set the DISTRIBUTION property of the application package to `EXTERNAL` or when you add a new version or patch to an application package that has the DISTRIBUTION property set to `EXTERNAL`. For more information, see
[Security requirements and guidelines for a Snowflake Native App](security-overview.md).

## Create a listing for an app

To share your app with consumers, create a listing and add the application package as the data product
of the listing.

### Create a private listing for an app

To publish your app to specific consumers, create a listing:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing. The Create Listing window opens.
4. Enter a name for your listing.
5. In the Who can discover the listing section, select Only specified consumers to privately share the
   listing with specific accounts.
6. Click + Select to select the application package to be in the listing.
7. Enter a description for your listing.
8. (Optional) If you have multiple provider profiles, select which provider profile to use to publish this listing.
9. In the Add consumer accounts section, add the account identifiers for the consumers with whom you want to
   share the listing.
10. If the consumer accounts are located in another region, set up auto-fulfillment:

    1. Review the refresh frequency configured at the account level. If you need to use a different
       refresh frequency, see [Set the account-level refresh interval](../../collaboration/provider-listings-auto-fulfillment-set-refresh-interval.md).
    2. Optional: Select a warehouse to use to set up auto-fulfillment.
11. Select Publish to publish the listing to the selected consumers, or select Save Draft
    to save it as a draft.

To monetize your app, add a pricing plan.

### Create a listing for an app published to the Snowflake Marketplace

To publish your app on the Snowflake Marketplace, create a listing:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing. The Create Listing window opens.
4. Enter a name for your listing.
5. In the Who can discover the listing section, select Anyone on the Marketplace to publish the listing on
   the Snowflake Marketplace.
6. In the How will consumers access the data product? section, select Free or Paid.
7. Select Next. A draft listing is created.

Before publishing your draft listing, you must configure additional required and optional capabilities.

#### Configure a Snowflake Marketplace listing for an application package

After you create a listing for the Snowflake Marketplace, you must configure additional information for your listing so that you can submit it for approval and publish it.

To configure a listing:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the draft listing you want to configure.
4. Select Add next to each section that appears on the page and provide the required information.

   As you provide information for each section, refer to
   [Configure listings](../../collaboration/provider-listings-reference.md)
   for information on each field. The specific properties available to edit depend on the type of listing
   that you create.

   If you want to monetize your Snowflake Native App, add a pricing plan to get paid for your Snowflake Native App.

## Submit a listing for approval

Before you can publish a listing to the Snowflake Marketplace, you must submit the listing to Snowflake for
approval.

If you want to submit your listing for approval, but the option to Submit for Approval is disabled, check the
following:

* You completed the steps to configure the listing.
* You are the ACCOUNTADMIN or have the OWNERSHIP privilege for the data product attached to the listing.
* All sample SQL queries attached to the listing pass validation.

To submit a listing for approval:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the draft listing you want to submit for approval.
4. Select Submit for Approval.

   After the listing is reviewed by Snowflake, the state changes to Approved or Denied.

   If the listing has been denied, update the listing based on the feedback provided in comments, and resubmit it for approval.

   When a listing is approved or denied, an email notification is sent to both the Business Contact and Technical Contact email addresses
   in the provider profile associated with the listing.

## Publish a listing for an app

To publish an approved listing on the Snowflake Marketplace:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the listing you want to publish.
4. Select Publish.

After publishing your Snowflake Marketplace listing, you can define a
[referral link](../../collaboration/provider-listings-referral-link.md)
to share a direct link to your listing with consumers.

---
title: Publish an app using release channels
source: https://docs.snowflake.com/en/developer-guide/native-apps/release-channels.md
section: Native Apps Framework
---

# Publish an app using release channels

This topic describes how to use release channels to manage and publish multiple versions of a Snowflake Native App.

## Privileges required to use release channels

To use release channels, you must use a role that has been granted the MANAGE RELEASES privilege. This privilege allows you to:

* Enable release channels on the application package.
* Enable the QA and ALPHA release channels.
* Register and deregister versions and patches.
* Add versions and patches.
* Set the release directive.

## Enable release channels for an application package

If you disabled release channels when you created an application package, you can enable them using
the ENABLE_RELEASE_CHANNELS clause of the application package as shown in the following commands:

```sqlexample
CREATE APPLICATION PACKAGE my_app_package ENABLE_RELEASE_CHANNELS=TRUE;
ALTER APPLICATION PACKAGE my_app_package SET ENABLE_RELEASE_CHANNELS=TRUE;
```

> **Warning:**
>
> After release channels are enabled for an application package, they cannot be disabled.

## Enable the ALPHA and QA release channels

By default, only the DEFAULT release channel is available to all consumers and enables them to install
an app from a listing to which they have access.

To use the QA and ALPHA release channels providers must explicitly enable them on the application package for
specific accounts. For these channels, the application package maintains a list of accounts that have been
added to each channel.

Providers can add an account to a release channel using the MODIFY RELEASE CHANNEL clause of the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command.

To add the ORG1.ACCOUNT1 account to the ALPHA release channel:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package MODIFY RELEASE CHANNEL ALPHA ADD ACCOUNTS=(ORG1.ACCOUNT1);
```

To remove the ORG1.ACCOUNT1 account from the ALPHA release channel:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package MODIFY RELEASE CHANNEL ALPHA REMOVE ACCOUNTS=(ORG1.ACCOUNT1);
```

To overwrite the current list of accounts added to a relase channel, a provider can use the SET clause of the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command as shown in the following example:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package MODIFY RELEASE CHANNEL ALPHA SET ACCOUNTS=(ORG1.ACCOUNT2);
```

This command removes all current accounts from the ALPHA release channel and adds the ORG1.ACCOUNT2 account.

> **Note:**
>
> Apps installed from the QA or ALPHA release channels are meant for testing. Only apps installed from the DEFAULT release channel are meant for production. For limited trial and paid (with trial) listings, the following limitations apply:
>
> * **For an app installed from a listing using the QA or ALPHA release channels**: These apps are free, and will be disabled after the trial period ends. If these apps are installed from a paid listing, they will still be available to the consumer after the trial period ends. If the provider would like to revoke access to these apps after the trial period ends, they can do so by removing the consumer from the active targets of the release channel.
> * **For an app installed from a listing using the DEFAULT release channel**: These apps are disabled after the trial period ends. If the consumer wants to continue using the app, they must accept an offer, and select the app from the default release channel.

## Monitoring release channels

### View the release channels defined in an application package or listing

To view the release channels defined in the application package, use the SHOW RELEASE CHANNELS command as shown
in the following example:

```sqlexample
SHOW RELEASE CHANNELS IN APPLICATION PACKAGE my_app_package;
```

To view the release channels defined for a listing, use the SHOW RELEASE CHANNELS command as shown in the
following example:

```sqlexample
SHOW RELEASE CHANNELS IN LISTING <listing_id>;
```

### View the release channel of an installed app

To view the release channels for all installed instances of an app, view the `current_release_channel_name` column of
the SNOWFLAKE.DATA_SHARING_USAGE.APPLICATION_STATE view.

## Manage versions and patches using release channels

Providers must add a version or patch to a specific release channel before they can be used by release directives
inside a release channel. After a version has been added to a release channel, subsequent patches for that version are also
bound to that release channel to be used.

> **Note:**
>
> The `ADD VERSION USING ‘@stage/path’` clause of the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command is not supported for application packages that have release
> channels enabled. Providers must register and deregister a version in the application package.

### Register a version

Before adding a new version of an app to an application package with release channels, providers must
register the version in the release channel by using the REGISTER clause of the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md)
command:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package REGISTER VERSION V1 USING '@stage/path';
```

This command creates version V1 of the app and also creates patch 0. This version is not assigned to any release channel. There
is a maximum of two unassigned versions (not added to any release channels) allowed in the application package.

### Deregister a version

To create a new version in an application package that already has two versions, providers must deregister an old version.

To deregister a version and its associated patches, use the DEREGISTER clause of the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md). The following command shows how to deregister version `v1` from
the application package:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package DEREGISTER VERSION v1;
```

> **Note:**
>
> When using release channels, you do not have to drop the existing version from the application package.

### Add a version to a release channel

After registering a new version of an app in the application package, you must explicitly add the version to a release channel
to set the release directive for the app.

To add a version to a release channel, use the ADD VERSION clause of the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md)
command:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package MODIFY RELEASE CHANNEL QA ADD VERSION V1;
```

> **Note:**
>
> A release channel can only contain two simultaneous versions.

### Remove a version from a release channel

To remove a version from a release channel, use the DROP clause of the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md):

```sqlexample
ALTER APPLICATION PACKAGE my_app_package MODIFY RELEASE CHANNEL QA DROP VERSION V1;
```

Dropping a version from a release channel is asynchronous and will only be truly dropped once all
consumers have been upgraded off that version.

## Set the release directive using a release channel

When release channels are enabled for an application package, each channel has its own release directive.

To set the default release directive for a release channel:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  MODIFY RELEASE CHANNEL ALPHA
  SET DEFAULT RELEASE DIRECTIVE VERSION=v1 PATCH=10;
```

To set a custom release directive for a release channel:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  MODIFY RELEASE CHANNEL ALPHA
  SET RELEASE DIRECTIVE my_custom_release_directive
  VERSION=V1 PATCH=11 ACCOUNTS=(ORG1.ACCOUNT1);
```

## Enable multiple instances using release channels

You can allow consumers to create multiple instances of an app in their account. Providers can
also create multiple instances of an app in their test account.

To enable multiple instances use the MULTIPLE_INSTANCES property of the application package as shown in the
following commands:

```sqlexample
CREATE APPLICATION PACKAGE <name> MULTIPLE_INSTANCES=TRUE;
ALTER APPLICATION PACKAGE <name> SET MULTIPLE_INSTANCES=TRUE;
```

> **Note:**
>
> Enabling multiple instances for the application package applies to all release channels within the application
> package.

> **Note:**
>
> A consumer can have multiple instances of an app using the QA or ALPHA release channels. If your app package is deployed to a paid listing, a consumer can only have one instance of an app using the DEFAULT release channel. Consumers can still install multiple app instances using the DEFAULT release channel from app packages in a free listing.

## Monetization and release channels

All app instances installed using the DEFAULT release channel use the pricing plan configured for the listing.

App instances installed from ALPHA and QA release channels are free and do not use the pricing plan configured for
the listing.

## Install an app using release channels

Providers can use [CREATE APPLICATION](../../sql-reference/sql/create-application.md) to create an app from a release channel in their
test environment.

> **Note:**
>
> To install an app from the QA or ALPHA release channels, you must use a role that has been granted the
> CREATE PREVIEW APPLICATION privilege.

To install an app from an application package in the same account, run the following command:

```sqlexample
CREATE APPLICATION my_app
  FROM APPLICATION PACKAGE my_app_package
  USING RELEASE CHANNEL QA;
```

If you do not explicitly use the `USING RELEASE CHANNEL` clause, the DEFAULT release channel is used.

* To install an app in another account from a listing, run the following command:

```sqlexample
CREATE APPLICATION my_app
  FROM LISTING
  USING RELEASE CHANNEL QA;
```

If you do not explicitly use the `USING RELEASE CHANNEL` clause, the DEFAULT release channel is used.

---
title: Python Permission SDK reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-permission-sdk-ref.md
section: Native Apps Framework
---

# Python Permission SDK reference

This topic provides reference information for the functions supported by the `snowflake.permissions`
module of the Python Permission SDK. For information on using the Python Permission SDK to request privileges in the consumer account, see [Create a user interface to request privileges and references](requesting-ui.md).

## get_application_specifications()

Returns all app specifications defined for the app.

Signature:
:   ```python
    get_application_specifications()
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   An array of dictionaries, where each dictionary contains the following key/value pairs:

    ```json
    {
      "name": "<value>",
      "requested_on": "<value>",
      "type": "<value>",
      "sequence_number": "<value>",
      "status": "<value>",
      "status_upgraded_on": "<value>",
      "label": "<value>",
      "description": "<value>",
      "definition": "<value>",
    }
    ```

    Where:

    * `name`: The name of the app specification.
    * `requested_on`: Timestamp when the app specification was requested.
    * `type`: Type of app specification. Supported values are: EXTERNAL ACCESS and SECURITY INTEGRATION.
    * `sequence_number`: ID for a version of an app specification. This value is incremented each time a provider
      changes the [app specification definition](requesting-app-specs.md).
    * `status`: Specifies the current status of the app specification. Possible values are:

      + `APPROVED`: The consumer approved the app specification.
      + `DECLINED`: The app specification is waiting for the consumer to approve or decline.
      + `DECLINED`: The consumer declined the app specification.
      + `PENDING`: The app specification is waiting for the consumer to approve or decline.
    * `status_updated_on`: Timestamp of the last status change.
    * `label`: Name of the app specification that is displayed to the consumer in Snowsight.
    * `description`: Description of the app specification that is displayed to the consumer in Snowsight.
    * `definition`: Values that are part of the
      [app specification definition](requesting-app-specs.md). The values of
      this column depend on the type of app specification.

## get_detailed_reference_associations()

Provides detailed information about a reference to an object in the consumer account.

Signature:
:   ```python
    get_detailed_reference_associations(reference_name: str) -> List[dict]
    ```

Arguments:
:   A string value containing the name of a reference.

Returns:
:   Returns a JSON object representing an array of dictionaries. Each dictionary contains the following
    key/value pairs:

    ```json
    {
      "alias": "<value>",
      "database": "<value>",
      "schema": "<value>",
      "name": "<value>"
    }
    ```

    Where:

    * `alias`: The system-generated alias for the reference.
    * `database`: The parent database name of the consumer object, if the object resides in a database.
      Otherwise, null.
    * `schema`: The parent schema of the consumer object, if the object resides in a schema. Otherwise,
      null.
    * `name`: The name of the consumer object.

## get_held_account_privileges()

Returns the privileges that have been granted to the app.

Signature:
:   ```python
    get_held_account_privileges(privilege_names: [str]) -> [str]
    ```

Arguments:
:   An array of string values containing the names of privileges to check.

Returns:
:   Returns an array containing the privileges that have been granted to the app
    based on the array of privileges passed to the function.

    Returns an array containing the privileges that have been granted to the Snowflake Native App
    based on the array of privileges passed to the function.

## get_missing_account_privileges()

Returns the privileges that have not been granted to the app.

Signature:
:   ```python
    get_missing_account_privileges(privilege_names: [str]) -> [str]
    ```

Arguments:
:   An array of string values containing the names of privileges to check.

Returns:
:   Returns a string array containing the privileges that have **not** been granted to the app
    based on the array of privileges passed to the function.

## get_reference_associations()

Determines the objects in the consumer account that are associated with a reference.

To get more detailed information about references to objects in the consumer account,
use get_detailed_reference_associations().

Signature:
:   ```python
    get_reference_associations(reference_name: str) -> [str]
    ```

Arguments:
:   A string value containing the name of a reference.

Returns:
:   Returns an array containing Snowflake-generated aliases of objects in the consumer account
    that are bound to the reference.

## is_application_all_mandatory _telemetry_event_definitions_enabled()

Checks if all mandatory telemetry event definitions are enabled for the app.

For more information on telemetry event sharing, see
[Verify event definitions by using the Permissions SDK](event-develop.md).

Signature:
:   ```python
    is_application_all_mandatory_telemetry_event_definitions_enabled() -> bool
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   Returns TRUE if all mandatory telemetry event definitions are enabled for the app.
    Returns FALSE, otherwise.

## is_application_authorized_for_telemetry_event_sharing()

Checks if the current application is authorized for telemetry event sharing.

For more information on telemetry event sharing, see [Verify event definitions by using the Permissions SDK](event-develop.md).

Signature:
:   ```python
    is_application_authorized_for_telemetry_event_sharing() -> bool
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   Returns TRUE if the application is authorized for telemetry event sharing.
    Returns FALSE, otherwise.

## is_application_local_to_package()

Checks if the app is installed in the same account as the application package.

Signature:
:   ```python
    is_application_local_to_package() -> bool
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   Returns TRUE if the app is installed in the same account as the application package.
    Returns FALSE, otherwise.

## is_event_sharing_enabled()

Checks if event sharing is enabled for the app.

Signature:
:   ```python
    is_event_sharing_enabled() -> bool
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   Returns TRUE if the SHARE_EVENTS_WITH_PROVIDER property is true and the consumer account has an
    active event table configured. Returns FALSE, otherwise.

## is_external_data_enabled()

Checks if the current application is enabled to use external and iceberg tables.

Signature:
:   ```python
    is_external_data_enabled() -> bool
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   Returns TRUE if the app is enabled to use external and iceberg tables.
    Returns FALSE, otherwise.

## request_application_specification_review()

Opens a dialog in a Streamlit app that allows the consumer to review an app specification then
approve, decline, or take no action. A consumer can only decline an app specification if it is optional.

Signature:
:   ```python
    request_application_specification_review(spec_names: [str] = None)
    ```

Arguments:
:   An optional array of string values containing the names of the app specifications to review. If this parameter is
    not specified, the dialog will show all app specifications defined for the app.

Returns:
:   This method does not return a value.

## request_aws_api_integration()

Requests an API integration from the consumer for the Amazon API Gateway.

You must define the API integration in the manifest file. For more information,
see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md) for information on other parameters.

Signature:
:   ```python
    request_aws_api_integration(id: str, allowed_prefixes: [str], gateway: AwsGateway, aws_role_arn: str, api_key: str = None, name: str = None, comment: str = None)
    ```

Arguments:
:   * `id`: The name of the API integration defined in the manifest file.
    * `allowed_prefixes`: An array of string values containing the allowed prefixes for the API integration.
    * `gateway`: The type of API Gateway to use. This parameter must be one of the following values:

      + permissions.AwsGateway.API_GATEWAY
      + permissions.AwsGateway.PRIVATE_API_GATEWAY
      + permissions.AwsGateway.GOV_API_GATEWAY
      + permissions.AwsGateway.GOV_PRIVATE_API_GATEWAY
    * `aws_role_arn`: The Amazon Resource Name (ARN) of the IAM role that the API Gateway uses to access the
      consumer account.
    * `api_key`: An optional API key for the API Gateway.
    * `name`: An optional name for the API integration.
    * `comment`: An optional comment for the API integration.

    See [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md) for information on other possible parameters.

Returns:
:   A string value containing the name of a reference.

## request_azure_api_integration()

Requests an API integration from the consumer for Azure API Management.

You must define the API integration in the manifest file. For more information,
see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md) for information on other parameters.

Signature:
:   ```python
    request_azure_api_integration(id: str, allowed_prefixes: [str], tenant_id: str, application_id: str, api_key: str = None, name: str = None, comment: str = None)
    ```

Arguments:
:   * `id`: The name of the API integration defined in the manifest file.
    * `allowed_prefixes`: An array of string values containing the allowed prefixes for the API integration.
    * `tenant_id`: The tenant ID for the Azure API Management.
    * `application_id`: The application ID for the Azure API Management.
    * `api_key`: An optional API key for the Azure API Management.
    * `name`: An optional name for the API integration.
    * `comment`: An optional comment for the API integration.

Returns:
:   This method does not return a value.

## request_event_sharing()

Opens a dialog in a Streamlit app that allows the consumer to share events with the app.

Signature:
:   ```python
    request_event_sharing()
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   This method does not return a value.

## request_external_data()

Requests consent from the consumer to use external and iceberg tables.

Signature:
:   ```python
    request_external_data()
    ```

Arguments:
:   This function does not take any arguments.

Returns:
:   This method does not return a value.

## request_google_api_integration()

Requests an API integration from the consumer for Google Cloud API Gateway.

You must define the API integration in the manifest file. For more information,
see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md) for information on other parameters.

Signature:
:   ```python
    request_google_api_integration(id: str, allowed_prefixes: [str], audience: str, name: str = None, comment: str = None, api_key: str = None
    ```

Arguments:
:   * `id`: The name of the API integration defined in the manifest file.
    * `allowed_prefixes`: An array of string values containing the allowed prefixes for the API integration.
    * `audience`: The audience for the Google Cloud API Gateway.
    * `name`: An optional name for the API integration.
    * `comment`: An optional comment for the API integration.
    * `api_key`: An optional API key for the Google Cloud API Gateway.

Returns:
:   This method does not return a value.

## request_account_privileges()

Requests privileges from the consumer specified by a string array passed to the function that
contains the privileges. The specified privileges must be listed in the manifest file.

Signature:
:   ```python
    request_account_privileges(privileges: [str])
    ```

Arguments:
:   An string array containing a list of privileges the app is requesting.

Returns:
:   This method does not return a value.

## request_reference()

Requests a reference from the consumer specified by the string passed to the function. The
reference passed to the function must be defined in the manifest file.

See [Object types and privileges that a reference can contain](requesting-refs.md) for a list of the objects that can be
included in a reference and their supported privileges.

Signature:
:   ```python
    request_reference(reference: str)
    ```

Arguments:
:   A string value containing the name of a reference to request.

Returns:
:   This method does not return a value.

---
title: Request access to external and Apache Iceberg™ tables
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-external-tables.md
section: Native Apps Framework
---

# Request access to external and Apache Iceberg™ tables

This topic describes how a provider can configure an app to request that a consumer allow the app to
access external and Apache Iceberg™ tables that a provider shares in an app.

## About external and Iceberg tables in a Snowflake Native App

The Snowflake Native App Framework allows providers to share [external tables](../../user-guide/tables-external-intro.md) and
[Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) with consumers. For general information,
see [Support for external and Iceberg tables](preparing-data-content.md).

To include an external or Iceberg table in an app:

1. Add the table to the app. See [Share data content in a Snowflake Native App](preparing-data-content.md).
2. Add an entry for external and Iceberg tables to the manifest.
3. Request permissions to access external and Iceberg tables.

> **Caution:**
>
> Before an app can access a shared external or Iceberg table,
> the consumer must explicitly give the app permission to use the table. For more information, see [Enable external and Apache Iceberg™ tables](ui-consumer-granting-privs.md).

## Add an entry for external and Iceberg tables to the manifest

To include external or Iceberg tables in an app, providers must add
an entry in the manifest file as shown in the following example:

```yaml
restricted_features:
  - external_data:
     description: “The reason for enabling an external or Iceberg table.”
```

## Request permissions to access external and Iceberg tables

For security and cost considerations, consumers must explicitly give an app permissions
to use an external or Iceberg table.

> **Note:**
>
> If an app attempts to resolve an external or Iceberg table directly in setup script
> the setup script fails if the consumer has not yet given permission to the app. To access external
> data, for example to create a view from an external table, providers should create the view
> in a stored procedure in the setup script. The app can then call the stored procedure after the
> consumer gives the app permission.

To allow a custom Streamlit app to access external and Iceberg tables, the Python Permission SDK provides the following functions:

`request_external_data() -> None`
:   Causes Snowsight to display a dialog that prompts the consumer to allow the app to
    access the external or Iceberg tables required by the app.

`is_external_data_enabled() -> boolean`
:   Determines if the consumer has allowed the app to use external or Iceberg tables. Returns
    `True` if allowed. Returns `False`, otherwise.

Alternatively, a consumer can run the [SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS](../../sql-reference/functions/system_set_application_restricted_feature_access.md) system function to allow an app access to external and Iceberg tables.

---
title: Request access to objects and privileges in a consumer account
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-objects-privs.md
section: Native Apps Framework
---

# Request access to objects and privileges in a consumer account

This topic provides general information about how a provider can develop an app to request
privileges or access to objects in the consumer account when a Snowflake Native App is installed or upgraded.
It provides information on developing an app so that the consumer must manually grant
privileges to the app using Snowsight or SQL. For information on developing an app to use
automated granting of privileges, see [Configure the privileges required by an app](requesting-auto-privs.md).

## About privileges and references in a Snowflake Native App

In a simple Snowflake Native App, all of the required objects are created inside the
APPLICATION object when the setup script runs during installation. In this context, all of the objects
required by the app are created and accessible within the APPLICATION object. The consumer is not required
to perform any actions. All of the necessary privileges required are managed by the app using
application roles.

However, a more complex Snowflake Native App may need to create new objects or access objects
in the consumer account that are outside the APPLICATION object. In this case, the consumer must
grant the necessary privileges or authorize access to allow the Snowflake Native App to create or access these
objects.

The Snowflake Native App Framework allows providers to do the following:

* Check for account-level privileges in the consumer account.
* Request account-level privileges to perform tasks, for example creating a database.
* Use [references](../../sql-reference/references.md) to access existing objects in the consumer account.

Providers request access to a consumer account by requesting the following:

Global privileges
:   Global privileges allow the Snowflake Native App to perform actions in the consumer account.
    Refer to [Privileges the provider can request from the consumer](requesting-privs.md) for details.

References
:   [References](../../sql-reference/references.md) allow the app to access existing objects in the
    consumer account. A provider defines the references that the app requests in the manifest file.

    After installation, the consumer allows access to the object by providing a reference that is created
    with the [fully qualified name](../../sql-reference/name-resolution.md) of the object.

    References allow the app to access objects using a logical name. A reference allows a provider
    to create the app without having to know the specific name of the object or its parent database and
    schema.

    See [references](../../sql-reference/references.md) for more information.

## How a consumer allows access to a Snowflake Native App

For each request for access that a provider defines in the app, the consumer must allow access
to the app. How a consumer allows access is different for global privileges and references.

### Grant global privileges to a Snowflake Native App

When a provider configures an app to request specific privileges or access to specific objects,
there are two ways a consumer can grant these privileges to the app:

* If a provider implements a user interface using the Python Permission SDK, the consumer uses
  Snowsight to grant the permissions that are requested by the app. The Python Permission SDK
  automatically runs the required GRANT statements in the consumer account.
* If a provider does not implement a user interface, the provider must communicate to the consumer
  what privileges the app requires. For example, the provider must communicate to the consumer information
  about the SQL statements that the consumer must run to grant the necessary privileges to the app.

  Snowflake recommends including this information in the README file of the app, which the consumer
  can view as part of listing for the Snowflake Native App.

### Authorize access on objects

When a provider defines a reference to an object in the consumer account that is outside of the
APPLICATION object, there are two ways a consumer can create references on these objects and associate
them to the application.

* If a provider implements a user interface with the Python Permission SDK, the consumer uses Snowsight to
  associate the references to the objects required by the app. See
  [Managing Access Requests using Snowsight](https://other-docs.snowflake.com/en/native-apps/consumer-granting-privs#managing-access-requests-using-snowsight).
* If a provider does not implement a user interface, the consumer must manually create the reference, then
  associate it with the Snowflake Native App.

---
title: Request data sharing with app specifications
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-app-specs-listing.md
section: Native Apps Framework
---

# Request data sharing with app specifications

This topic describes how to configure the specifications of a Snowflake Native App
to request permission to share data with providers or third parties through
listings. This enables use cases such as compliance reporting, telemetry
sharing, and data preprocessing.

## Share data from an app with providers or third parties

Some Snowflake Native Apps need to share data back with the provider or with third-party Snowflake accounts for
various business purposes. Common use cases include the following:

* **Compliance reporting:** Sharing audit logs or compliance data with regulatory accounts
* **Telemetry and analytics:** Sending usage metrics back to the provider for product improvement
* **Data preprocessing:** Sharing transformed data with partner accounts
* **Support and troubleshooting:** Providing diagnostic data to support teams

To enable data sharing from an app, the app needs to provide both shares and
listings. A share contains the database objects to be shared, and a listing
provides the mechanism to share data across accounts and regions.

For more information about data shares, see [About Secure Data Sharing](../../user-guide/data-sharing-intro.md).

To configure an app to share data using listings, follow these steps:

1. Use [automated granting of privileges](requesting-auto-privs.md) to request privileges from
   the consumer to create shares and listings.
2. Create a share and grant database objects to it.
3. Create an external listing attached to the share.
4. To request permission from the consumer to share data with specific target
   accounts, use [application specifications](requesting-app-specs.md).

> **Note:**
>
> Unlike other app specification types, each LISTING specification is associated with exactly one
> listing object. An app cannot create multiple app specifications for the same listing.

## App specification workflow for sharing data

Configuring an app to share data by using listings follows this general workflow:

1. Providers configure [automated granting of privileges](requesting-auto-privs.md) for the app.
   This grants the app privileges to create shares and listings.

   > > **Note:**
   > >
   > > App specifications require `manifest_version: 2` to be set in the manifest file.
2. Providers add the
   CREATE SHARE and CREATE LISTING privileges to the
   manifest file.
3. Providers add SQL statements to the setup script to create the following objects as required:

   * [Share](../../sql-reference/sql/create-share.md)
   * [External listing](../../sql-reference/sql/create-listing.md)
   * [App specification](../../sql-reference/sql/alter-application-set-app-spec.md)

   The setup script creates the share and listing when the app is installed or
   upgraded. The app specification can be created during setup or at runtime
   through a stored procedure.
4. When configuring the app, consumers review and approve the target accounts
   and auto-fulfillment settings on the listing.
   Auto-fulfillment settings are only applicable for cross-region sharing.
   For more information on how consumers view and approve app specifications, see [Approve app specifications](ui-consumer-app-spec.md).

## App specification definition for sharing data

For an app specification of type LISTING, the app specification definition contains the following entries:

* `TARGET_ACCOUNTS`: A comma-separated list of target accounts to share
  data with, enclosed in single quotes. Each account must be specified in the
  format `OrgName.AccountName`; for example:
  `'ProviderOrg.ProviderAccount,PartnerOrg.PartnerAccount'`.
* `LISTING`: The identifier of the listing object created by the app.
* `AUTO_FULFILLMENT_REFRESH_SCHEDULE`: Optional. The refresh schedule for cross-region data
  sharing. Can be specified as `<num> MINUTE` or `USING CRON <expression>`.

> **Note:**
>
> The listing name in the app specification must match an existing listing created by the app.
> After this is set, the listing name cannot be changed.

## Set the version of the manifest file

To enable automated granting of privileges for an app, set the version at the
beginning of the manifest file, as shown in the following example:

```yaml
manifest_version: 2
```

## Add the CREATE SHARE and CREATE LISTING privileges to the manifest file

The CREATE SHARE and CREATE LISTING privileges allow the app to create shares
and listings during installation or upgrade.

* To configure an app to request
  these privileges, add the following code to the `privileges` section of
  the manifest file:

  > ```yaml
  > manifest_version: 2
  > ...
  > privileges:
  >   - CREATE SHARE:
  >       description: "Create a share for sharing compliance data with provider"
  >   - CREATE LISTING:
  >       description: "Create a listing for cross-region sharing of compliance data"
  > ...
  > ```

If you set the `manifest_version` to 2 in the manifest file, Snowflake automatically grants
the CREATE SHARE and CREATE LISTING privileges to the app during installation or upgrade.

## Create a share and grant objects to it

1. To create a share for data sharing, add the
   [CREATE SHARE](../../sql-reference/sql/create-share.md) command to the setup script, as shown
   in the following example:

```sqlexample
CREATE SHARE IF NOT EXISTS compliance_share;
```

1. Grant the database objects you want to share, as shown in the following
   example:

> ```sqlexample
> GRANT USAGE ON DATABASE app_created_db TO SHARE compliance_share;
> GRANT USAGE ON SCHEMA app_created_db.reporting TO SHARE compliance_share;
> GRANT SELECT ON TABLE app_created_db.reporting.metrics TO SHARE compliance_share;
> ```

> **Note:**
>
> Apps can only share data from the following sources:
>
> * Databases created by the app: The app must be the owner of these databases.
>
> Apps can choose to directly grant privileges on an object to a share or grant a database
> role to share. For more information, see [How to share database objects](../../user-guide/data-sharing-gs.md).
> Apps cannot directly add target accounts to the share. This is controlled through the app specification.

## Create an external listing

1. To create an external listing attached to the share, add the
   [CREATE LISTING](../../sql-reference/sql/create-listing.md) command to the setup script as shown in the following
   example:

   ```sqlexample
   CREATE EXTERNAL LISTING IF NOT EXISTS compliance_listing
   SHARE compliance_share
     AS
     $$
       title: "Compliance Data Share"
       subtitle: "Regulatory compliance reporting data"
       description: "Share compliance and audit data with authorized accounts"
       listing_terms:
         type: "OFFLINE"
     $$
     PUBLISH = FALSE
     REVIEW = FALSE;
   ```

> **Note:**
>
> * Apps can only attach shares, not application packages, to a listing.
> * Apps cannot directly add target accounts or auto-fulfillment configuration
>   to the listing.
> * The listing manifest can only include the following properties: title,
>   subtitle, description, and listing_terms.
> * All new listings must be created in an unpublished state, with both PUBLISH
>   and REVIEW set to FALSE.
> * The listing title and description can be customized based on the consumer
>   info, allowing providers to distinguish data sources.

## Create an app specification for a listing

1. To create an app specification for a listing, follow this example:

   ```sqlexample
   ALTER APPLICATION SET SPECIFICATION shareback_spec
     TYPE = LISTING
     LABEL = 'Compliance Data Sharing'
     DESCRIPTION = 'Share compliance data with provider for regulatory reporting'
     TARGET_ACCOUNTS = 'ProviderOrg.ProviderAccount,AuditorOrg.AuditorAccount'
     LISTING = compliance_listing
     AUTO_FULFILLMENT_REFRESH_SCHEDULE = '720 MINUTE';
   ```

   This command creates an app specification named `shareback_spec` that requests permission to
   share data with the specified target accounts.
2. For cross-region sharing, the `AUTO_FULFILLMENT_REFRESH_SCHEDULE` parameter is required.
   You can set it to one of the following values:

   * `'<num> MINUTE'`: Number of minutes, with a minimum of 10 minutes,
   * and a maximum of 8 days or 11520 minutes (eight days)
   * `'USING CRON <expression> <time_zone>'`: Cron expression with time
     zone

> **Note:**
>
> * The app should only create the app specification after the listing and share objects exist.
> * Each listing can only have one associated app specification.
> * Updating the target accounts creates a new pending request for consumer approval.

## Consumer approval workflow

For more information on how consumers view and approve app specifications, see [Approve app specifications](ui-consumer-app-spec.md).

Consumer approval of a LISTING app specification triggers this workflow:

* Snowflake automatically adds the target accounts to the listing.
* If specified, Snowflake configures the auto-fulfillment refresh schedule.
* The listing becomes visible to the target accounts.
* Data attached to the listing can be queried from the approved accounts.

Consumer rejection of a LISTING app specification triggers this workflow:

* Auto-fulfillment is disabled.
* The listing remains published, allowing the consumer to continue viewing the shared data.
* All target accounts are removed from the listing, with the exception of the current account where the app is installed.
* Auto-fulfillment is disabled.
* Data attached to the listing can no longer be queried by target accounts other than the current account.

## Validating the listing configuration

Apps can validate that the listing has been properly configured after approval by running the following commands:

```sqlexample
-- Check if the app specification is approved:
SHOW APPROVED SPECIFICATIONS IN APPLICATION;

-- Validate the listing configuration:
DESC LISTING compliance_listing;
```

## Best practices for LISTING app specifications

When implementing data sharing through app specification, consider the following
best practices:

* **Share integrity:** Snowflake does not prevent consumers from modifying
  shares created by an application. As a result, the provider is responsible
  for implementing measures to protect the integrity of the underlying shared
  data.
* **Error handling:** Implement proper error handling for cases where the app
  specification is declined or not yet approved.
* **Cross-region considerations:** The app provider is responsible for setting
  refresh schedules that balance data freshness requirements with cost
  considerations. Although Listing Auto Fulfillment costs are billed to the app
  consumer, the provider’s choice of schedule should be cost-aware to minimize
  the unnecessary cost for the app consumer.
* **Compliance:** Document clearly what data you are sharing, and why you are
  sharing it, in the app specification description.

## Using callback functions with LISTING app specifications

Apps can use lifecycle callbacks to respond when consumers approve or decline
listing specifications by adding the following code to the manifest file:

```yaml
lifecycle_callbacks:
  specification_action: callbacks.on_spec_update
```

In the setup script, add the following callback stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE callbacks.on_spec_update (
  name STRING,
  status STRING,
  payload STRING)
RETURNS STRING
LANGUAGE SQL
AS
$$
BEGIN
  IF (name = 'SHAREBACK_SPEC' AND status = 'APPROVED') THEN
    -- Start populating shared tables
    CALL populate_compliance_data();
  ELSEIF (name = 'SHAREBACK_SPEC' AND status = 'DECLINED') THEN
    -- Clean up or notify provider
    CALL cleanup_share_data();
  END IF;
  RETURN 'Processed specification update';
END;
$$;
```

The procedure allows the app to react appropriately to consumer decisions about app’s data sharing request.

## Viewing data shared with the provider

Consumers can view the data that has been shared with the provider using either Snowsight
or by querying the data via SQL or the Snowflake CLI.

### To view data shared with the provider using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. Navigate to the app’s page.
3. In the application page, select the Permissions tab.
4. Under Data sharing, click Review for a requested data share. The following details are displayed:

   * The name of the data share listing
   * The reason that the app is requesting to share data
   * The accounts that have access to the data share
   * The replication schedule for the data share
5. To view the data shared with the provider, click the Preview data button. The shared data is displayed in
   table format, grouped by schema. Note that the shared data is not editable.
6. To view shared data for other schemas or tables, use the drop-down menus.
7. To view data shared with the provider using a worksheet, click the Open in workspace button.

### To view data shared with the provider using a worksheet

The Uniform Listing Locator (ULL) is a unique identifier for app-created listings that allows consumers to query datasets directly. To find the ULL for a specific listing, check the `uniform_listing_locator` column in the output of the `SHOW LISTINGS` or `DESC LISTINGS` commands.

To view data shared with the provider, reference the objects using the ULL which contains a `NATIVEAPP$` prefix following the listing SQL name:

```sqlexample
SHOW SCHEMAS IN DATABASE NATIVEAPP$MY_LISTING;
SHOW OBJECTS IN SCHEMA NATIVEAPP$MY_LISTING.MY_SCHEMA;
SELECT * FROM NATIVEAPP$MY_LISTING.MY_SCHEMA.MY_TABLE;
```

## Limitations

This section describes limitations when using app specifications.

Auditing
:   Snowflake doesn’t offer built-in auditing for data that an app shares
    back to the provider. If a consumer has compliance or regulatory requirements
    that include an audit trail, they must coordinate directly with the provider
    to implement their own separate monitoring solutions.

Sharing from within the application
:   Snowflake does not recommend data sharing with the provider directly from
    within the application because Listing Auto Fulfillment is not currently
    supported for data shared in this manner.

---
title: Request external access integrations (EAIs) with app specifications
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-app-specs-eai.md
section: Native Apps Framework
---

# Request external access integrations (EAIs) with app specifications

This topic describes how to configure a Snowflake Native App to use app specifications
to request access to an external access integration (EAI) in the consumer
account. An EAI allows an app to connect to an endpoint that is external to
Snowflake.

## Access to external endpoints from an app

To access an external endpoint, an app must create a network rule and an
EAI, which uses network rules to restrict access to specific
external network locations. Network rules define the external endpoints that an
app can access.

To configure an app to use an EAI, follow these steps:

* To request privileges from the consumer to create an EAI, use
  [automated granting of privileges](requesting-auto-privs.md).
* Add an EAI to an app.
* Use [application specifications](requesting-app-specs.md) to request permissions from the consumer to connect to
  an external endpoint.

> **Note:**
>
> A single app specification applies to all of the EAIs created by the app.
> Providers can create multiple app specifications for an app; however, this is not required.

## App specification workflow for an EAI

1. Providers configure [automated granting of privileges](requesting-auto-privs.md) for the app.
   This allows consumers to give permission to an app to create the EAI.

   > > **Note:**
   > >
   > > App specifications require that `manifest_version: 2` be set in the manifest file.
2. Providers add the
   CREATE EXTERNAL ACCESS INTEGRATION privilege to the
   manifest file.
3. Providers add SQL statements to the setup script to create the following objects:

   * [Network rule](../../sql-reference/sql/create-network-rule.md)
   * [External access integration](../../sql-reference/sql/create-external-access-integration.md)
   * [App specification](../../sql-reference/sql/alter-application-set-app-spec.md)

   The setup script creates the app specification and other objects when the app is installed or
   upgraded or at runtime.
4. When configuring the app, consumers review and approve the host ports and external services. For more
   information on how consumers view and approve app specifications, see [Approve app specifications](ui-consumer-app-spec.md).

## App specification definition for an EAI

The app specification definition for an EAI contains the following entries:

* `HOST_PORTS`: A list of host ports defined in the network rule that the app requires.
* `PRIVATE_HOST_PORTS`: A list of private host ports that allow private connectivity to
  resources outside Snowflake.

> **Note:**
>
> These values must match the values the app uses to
> [create the network rule](../../sql-reference/sql/create-network-rule.md).

## Set the version of the manifest file

To enable automated granting of privileges for an app, set the version at the beginning of the
manifest file as shown in the following example:

```yaml
manifest_version: 2
```

## Add the CREATE EXTERNAL ACCESS INTEGRATION privilege to the manifest file

The CREATE EXTERNAL ACCESS INTEGRATION privilege allows the app to create an external
access integration during installation or upgrade.

* To configure an app to request the
  CREATE EXTERNAL ACCESS INTEGRATION privilege, add the following code to the
  `privileges` section of the manifest file:

  > ```yaml
  > manifest_version: 2
  > ...
  > privileges:
  >   - CREATE EXTERNAL ACCESS INTEGRATION:
  >       description: "Allows the app to create an EAI to connect to an external service."
  > ...
  > ```

If you set the `manifest_version` to 2 in the manifest file, Snowflake
automatically grants the CREATE EXTERNAL ACCESS INTEGRATION privilege to the app
during installation or upgrade.

## Add a network rule and an EAI to the setup script

EAIs are the Snowflake objects that enable access to specific external network
locations and contain a list of network rules that specify the external
locations that an app can access.

* To create a network rule for an app, add the
  [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md) command to the setup script as
  shown in the following example:

  ```sqlexample
  CREATE OR REPLACE NETWORK RULE setup.my_network_rule
  TYPE = HOST_PORT
  VALUE_LIST = ( 'example.com' )
  MODE = EGRESS;
  ```

The `HOST_PORT` and `VALUE_LIST` properties indicate that the network rule must point to a
valid domain, port, or range of ports. When an app is installed or upgraded,
consumers grant permission for the app to use these domains or ports.

## Create an EAI

* To create an EAI for an app, add the
  [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md) command to the
  setup script, as shown in the following example:

  > ```sqlexample
  > CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION my_app_prefix_eai_rule
  >   ALLOWED_NETWORK_RULES = (setup.my_network_rule)
  >   ENABLED = TRUE;
  > ```

> **Note:**
>
> This command creates an EAI in the consumer account. However, it is not usable
> until the consumer approves the app specifications that allow external access
> to the requested host ports.
>
> For more information, see [Approve app specifications](ui-consumer-app-spec.md).

## Creating a user-defined function to access the external endpoint

After the EAI is created, the setup script can create user-defined
functions and stored procedures that use it to connect to the endpoints defined in the
network rule.

The following example shows a user-defined function that uses the
`my_app_prefix_eai_rule` EAI:

```sqlexample
CREATE OR REPLACE FUNCTION setup.EXTERNAL_ACCESS_UDF(hostname STRING)
  RETURNS STRING
  LANGUAGE JAVA
  HANDLER='TestHostNameLookup.compute'
  EXTERNAL_ACCESS_INTEGRATIONS = (my_app_prefix_eai_rule)
  AS
  '
      import java.net.InetAddress;
      import java.net.UnknownHostException;
      class TestHostNameLookup {{
          public static String compute(String hostname) throws Exception {{
              InetAddress addr = null;
              try {
                  addr = InetAddress.getByName(hostname);
              } catch(UnknownHostException ex) {
                  return "Hostname lookup failed";
              }
              return "Hostname lookup successful";
          }
      }
';
GRANT USAGE ON FUNCTION setup.EXTERNAL_ACCESS_UDF(STRING)
  TO APPLICATION ROLE app_public;
```

This function sets the value of the EXTERNAL_ACCESS_INTEGRATIONS to the EAI
created previously.

This function uses the `InetAddress` Java package to look up the hostname passed to
the procedure. The hostname provided must match one of the values provided in the `VALUE_LIST`
property of the network rules used by the EAI.

## Creating an app specification for an EAI

The following example shows how to create an app specification for an EAI:

```sqlexample
ALTER APPLICATION SET SPECIFICATION eai_app_spec
  TYPE = EXTERNAL_ACCESS
  LABEL = 'Connection to an external API'
  DESCRIPTION = 'Access an API that exists outside Snowflake'
  HOST_PORTS = ('example.com')
```

This command creates an app specification named `eai_app_spec`.

## Approve the app specification in the consumer account

After the provider configures the app to create the network rule, EAI, and
app specification, consumers can view the app specification and approve or
decline it as appropriate when configuring the app. For more information, see
[Approve app specifications](ui-consumer-app-spec.md).

---
title: Request global privileges from consumers
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-privs.md
section: Native Apps Framework
---

# Request global privileges from consumers

This topic describes how providers can configure a Snowflake Native App to request global
privileges from a consumer after the consumer installs the app. These privileges allow
the Snowflake Native App to perform tasks in the consumer account, for example creating a warehouse or
a database.

If an app needs to perform actions or create objects outside the context of the Snowflake Native App,
the consumer must grant the privileges to allow the application to do so.

## Workflow for requesting global privileges from the consumer

> **Note:**
>
> Refer to [Create a user interface to request privileges and references](requesting-ui.md) for information on creating a user interface that
> allows consumers to grant privileges using Snowsight.

To configure a Snowflake Native App to request global privileges providers use the following workflow:

1. Determine the privileges required by the app.

   For example, if an app needs to create a database in the consumer account, the provider must
   request that the consumer grant the CREATE DATABASE global privilege to the application.

   Refer to Privileges the provider can request from the consumer for details on the global privileges
   an app can request.
2. Add the required privileges to the manifest file. See
   Add a privilege request to the manifest file for details.

After installing the Snowflake Native App, the consumer performs the following:

1. Review the global privileges required by the application. See
   View the privileges requested by a Snowflake Native App for more information.
2. Grant the global privileges on the application. See Grant privileges to an application for more
   information.

## Privileges the provider can request from the consumer

The Snowflake Native App Framework allows providers to request the following
[global privileges](../../user-guide/security-access-control-privileges.md) in the consumer account:

* BIND SERVICE ENDPOINT
* CREATE COMPUTE POOL
* CREATE DATABASE
* CREATE WAREHOUSE
* EXECUTE ALERT
* EXECUTE TASK
* EXECUTE MANAGED TASK
* IMPORTED PRIVILEGES ON SNOWFLAKE DB
* MANAGE WAREHOUSES
* READ SESSION

> **Note:**
>
> Granting IMPORTED PRIVILEGES ON SNOWFLAKE DB allows the Snowflake Native App to see information about usage and costs
> associated with the consumer account. You should ensure that consumers are aware of this
> when publishing your Snowflake Native App.

## Add a privilege request to the manifest file

The following example shows how to add the EXECUTE TASK privilege to the manifest file:

```yaml
privileges:
  - EXECUTE TASK:
    description: "Privilege to run tasks within the consumer account"
```

A provider can add any of the supported privileges
in the same manner.

## View the privileges requested by a Snowflake Native App

When a provider specifies a privilege in the manifest file, the privilege requests are
included as part of the installed Snowflake Native App. The consumer can view the privilege requests
after installing the app.

To view the global privileges required by an app, run the [SHOW PRIVILEGES](../../sql-reference/sql/show-privileges.md)
command as shown in the following example:

```sqlexample
SHOW PRIVILEGES IN APPLICATION hello_snowflake_app;
```

## Grant privileges to an application

After determining the privileges required by a Snowflake Native App, the consumer must then grant
these privileges to the app.

To grant the global privilege request in the example above, the consumer runs the
[GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command as shown in the following example:

```sqlexample
GRANT CREATE DATABASE ON ACCOUNT TO APPLICATION hello_snowflake_app;
```

To grant the IMPORT privilege on the MYDATABASE database, run the following command:

```sqlexample
GRANT IMPORTED PRIVILEGES ON DATABASE MYDATABASE TO APPLICATION hello_snowflake_app;
```

---
title: Request references and object-level privileges from consumers
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-refs.md
section: Native Apps Framework
---

# Request references and object-level privileges from consumers

This topic describes how providers can configure a Snowflake Native App to request access to objects in
the consumer account that exist outside the app.

## About references

In some contexts an installed Snowflake Native App needs to access existing objects in
the consumer account that exist outside the app. For example, an app
might need to access existing tables in a consumer database.

In this context, it’s not sufficient for the consumer to grant access on an object to
the app because the app cannot determine the name of the schema and object in
the consumer account.

To allow the app to access existing objects outside the app, the Snowflake Native App Framework
uses references that enable the customer to specify the name and schema for an object
and enable access to the object.

### Workflow for defining references in the consumer account

To request a reference and object-level privilege, the provider performs the following when
developing and publishing a Snowflake Native App:

1. Determine which objects require references and their corresponding privileges.
2. [Define the references in the manifest file](requesting-privs.md).
3. Add a stored procedure in the setup script to handle the callback for each reference
   defined in the manifest file.

After installing the Snowflake Native App, the consumer performs the following:

1. View the references required by the Snowflake Native App.
2. Create the reference by calling the SYSTEM$REFERENCE system function.
3. Run the callback stored procedure passing the id of the reference.

After the consumer runs the callback stored procedure, the Snowflake Native App can access the
requested object.

This workflow outlines the process where the consumer creates the reference
manually. Refer to [Create a user interface to request privileges and references](requesting-ui.md) for information on creating
a user interface to allow consumers to create references and grant privileges using Snowsight.

### Object types and privileges that a reference can contain

The following table lists the object types that a reference can include and the privileges
allowed for each object:

| Object Type | Privileges Allowed |
| --- | --- |
| TABLE | SELECT, INSERT, UPDATE, DELETE, TRUNCATE, REFERENCES |
| VIEW | SELECT, REFERENCES |
| EXTERNAL TABLE | SELECT, REFERENCES |
| FUNCTION | USAGE |
| PROCEDURE | USAGE |
| WAREHOUSE | MODIFY, MONITOR, USAGE, OPERATE |
| API INTEGRATION | USAGE |
| EXTERNAL ACCESS INTEGRATION | USAGE |
| SECRET | USAGE, READ |

## Define a reference in the manifest file

The following example shows how to define a reference in the manifest file for a table in the consumer account that exists outside the
APPLICATION object:

```yaml
references:
  - consumer_table:
      label: "Consumer table"
      description: "A table in the consumer account that exists outside the APPLICATION object."
      privileges:
        - INSERT
        - SELECT
      object_type: TABLE
      multi_valued: false
      register_callback: config.register_single_reference
```

This example defines an reference named `consumer_table` that requires the INSERT and SELECT
privileges on a table in the consumer account. The `register_callback` property specifies a stored
procedure used to bind a consumer table to this reference definition.

Use `multi_valued` to bind multiple consumer objects to the same reference. When this property is specified,
the same operations are performed on objects with a single value reference. The property can also be used with
objects with multi-valued references. See Supported reference functions to learn more about Snowflake Native App Framework reference operations.

### Remove a reference definition

> **Note:**
>
> Snowflake recommends against removing a reference definition from the manifest file in a new version of an app. If you need to
> remove a defined reference, update any code that uses the removed reference in the same version release and notify the consumer
> in the README file.

If an app defines a reference then later deletes the reference definition from a subsequent version of the app, calling any function or
procedure that still uses the deleted reference results in an error for consumers. For example, the manifest file for version V1 of app
`my_app` includes a reference definition for REF_TO_TABLE and a stored procedure CREATE_VIEW_FROM_TABLE that uses the table reference
REF_TO_TABLE to create a view VIEW_SELECT_FROM_DEFINED_REF.

In version V2 of `my_app`, the reference definition for REF_TO_TABLE is removed from the manifest file. When a consumer upgrades
their installed app `my_app` to version V2, calling the CREATE_VIEW_FROM_TABLE procedure results in the following error:

```output
Reference definition '<REF_DEF_NAME>' cannot be found in the current version of the application '<APP_NAME>'
```

## Create a callback stored procedure for a reference

After defining a reference in the manifest file, a provider must add a
stored procedure to the setup script to register the callback for the
reference.

The following example shows a stored procedure used to handle a callback for the reference
shown in Define a reference in the manifest file:

```sqlexample
CREATE APPLICATION ROLE app_admin;

CREATE OR ALTER VERSIONED SCHEMA config;
GRANT USAGE ON SCHEMA config TO APPLICATION ROLE app_admin;

CREATE PROCEDURE CONFIG.REGISTER_SINGLE_REFERENCE(ref_name STRING, operation STRING, ref_or_alias STRING)
  RETURNS STRING
  LANGUAGE SQL
  AS $$
    BEGIN
      CASE (operation)
        WHEN 'ADD' THEN
          SELECT SYSTEM$SET_REFERENCE(:ref_name, :ref_or_alias);
        WHEN 'REMOVE' THEN
          SELECT SYSTEM$REMOVE_REFERENCE(:ref_name, :ref_or_alias);
        WHEN 'CLEAR' THEN
          SELECT SYSTEM$REMOVE_ALL_REFERENCES(:ref_name);
      ELSE
        RETURN 'unknown operation: ' || operation;
      END CASE;
      RETURN NULL;
    END;
  $$;

GRANT USAGE ON PROCEDURE CONFIG.REGISTER_SINGLE_REFERENCE(STRING, STRING, STRING)
  TO APPLICATION ROLE app_admin;
```

This example creates a stored procedure named `REGISTER_SINGLE_REFERENCE` that calls
a system function to perform a specific operation on a reference that is passed as an
argument to the stored procedure.

> **Note:**
>
> Because the stored procedure uses the SYSTEM$SET_REFERENCE system function, the stored procedure
> only works for a reference with a single value in the description. To associate a reference with
> multiple values, use the SYSTEM$ADD_REFERENCE system function.

## Create a callback stored procedure for requesting object configuration

For some object types, a provider must add a stored procedure to the setup script to provide additional
configuration. This callback is used when consumers allow references using Snowsight.

The following example shows how to define a configuration callback stored procedure for the reference
shown in Define a reference in the manifest file:

```sqlexample
CREATE OR REPLACE CONFIG.GET_CONFIGURATION_FOR_REFERENCE(ref_name STRING)
  RETURNS STRING
  LANGUAGE SQL
  AS
  $$
  BEGIN
    CASE (ref_name)
      WHEN 'CONSUMER_EXTERNAL_ACCESS' THEN
        RETURN '{
          "type": "CONFIGURATION",
          "payload":{
            "host_ports":["google.com"],
            "allowed_secrets" : "LIST",
            "secret_references":["CONSUMER_SECRET"]}}';
      WHEN 'CONSUMER_SECRET' THEN
        RETURN '{
          "type": "CONFIGURATION",
          "payload":{
            "type" : "OAUTH2",
            "security_integration": {
              "oauth_scopes": ["https://www.googleapis.com/auth/analytics.readonly"],
              "oauth_token_endpoint": "https://oauth2.googleapis.com/token",
              "oauth_authorization_endpoint":
                "https://accounts.google.com/o/oauth2/auth"}}}';
     END CASE;
     RETURN '';
   END;
   $$;

GRANT USAGE ON PROCEDURE CONFIG.GET_CONFIGURATION_FOR_REFERENCE(STRING)
  TO APPLICATION ROLE app_admin;
```

This example creates a stored procedure named `GET_CONFIGURATION_FOR_REFERENCE` that returns
a JSON-formatted configuration that is used to build a reference of type EXTERNAL ACCESS INTEGRATION or
SECRET reference. The entries in the CASE statement should map to the reference names in the manifest file.

> **Note:**
>
> This callback function is required by references of type EXTERNAL ACCESS INTEGRATION and SECRET.
> It is only applicable to these types of references.

## View the references defined in an application

When a provider defines references in the manifest file, the references are included as part of the installed Snowflake Native App.

To view the references defined for a Snowflake Native App, run the [SHOW REFERENCES](../../sql-reference/sql/show-references.md)
command as shown in the following example:

```sqlexample
SHOW REFERENCES IN APPLICATION hello_snowflake_app;
```

## Bind an object to the application

After viewing the reference definition for a Snowflake Native App, the consumer creates a reference by running
the SYSTEM$REFERENCE system function as shown in the following example:

```sqlexample
SELECT SYSTEM$REFERENCE('table', 'db1.schema1.table1', 'persistent', 'select', 'insert');
```

This command returns an identifier for the reference. The consumer can pass the identifier to the
callback stored procedure for the reference as shown in the following example:

```sqlexample
CALL app.config.register_single_reference(
  'consumer_table' , 'ADD', SYSTEM$REFERENCE('TABLE', 'db1.schema1.table1', 'PERSISTENT', 'SELECT', 'INSERT'));
```

In this example, `consumer_table` is the name of the reference defined in the manifest file.
After the consumer runs the stored procedure that associates the reference, the Snowflake Native App can access the
table in the consumer account.

The callback stored procedure in the previous section
calls the SYSTEM$SET_REFERENCE system function as shown in the following example:

```sqlexample
SELECT SYSTEM$SET_REFERENCE(:ref_name, :ref_or_alias);
```

Refer to Supported reference functions for other system functions related
to references.

## Considerations when using references

Snowflake recommends that you do not modify reference definitions across versions.
To update a reference definition in a new version, for example, to change the privileges
to SELECT, INSERT from SELECT, you must define a new reference definition with a different name
The updated Snowflake Native App can use this new reference in the new version of the app.

To embed a reference within another object, for example to assign a reference to a variable,
the reference must already be bound to an object in the consumer account. For example, you
cannot create a task unless you first bind the reference to the consumer warehouse.

## Examples of using references in a Snowflake Native App

The following sections provide examples of using references in different contexts.

> **Note:**
>
> The `reference()` functions in the following examples can only be called in a stored procedure
> in the APPLICATION object.

### Run queries using a reference

The following examples show how to run queries using references:

```sqlexample
SELECT * FROM reference('consumer_table');
```

```sqlexample
SELECT reference('encrypt_func')(t.c1) FROM consumer_table t;
```

### Call a stored procedure using a reference

The following example shows how to call a stored procedure using a reference:

```sqlexample
CALL reference('consumer_proc')(11, 'hello world');
```

### Run DML commands using a reference

The following examples show how to modify data in a table using references:

```sqlexample
INSERT INTO reference('data_export')(C1, C2)
  SELECT T.C1, T.C2 FROM reference('other_table')
```

```sqlexample
COPY INTO reference('the_table') ...
```

### Run the DESCRIBE command using a reference

The following example shows how to run the DESCRIBE operation using a reference:

```sqlexample
DESCRIBE TABLE reference('the_table')
```

### Use references in a task

```sqlexample
CREATE TASK app_task
  WAREHOUSE = reference('consumer_warehouse')
  ...;

ALTER TASK app_task SET WAREHOUSE = reference('consumer_warehouse');
```

### Use references in a view definition

```sqlexample
CREATE VIEW app_view
  AS SELECT reference('function')(T.C1) FROM reference('table') AS T;
```

### Use references in a function body

```sqlexample
CREATE FUNCTION app.func(x INT)
  RETURNS STRING
  AS $$ select reference('consumer_func')(x) $$;
```

### Use references in an external function

```sqlexample
CREATE EXTERNAL FUNCTION app.func(x INT)
  RETURNS STRING
  ...
  API_INTEGRATION = reference('app_integration');
```

### Use references in a function or procedure

```sqlexample
CREATE FUNCTION app.func(x INT)
  RETURNS STRING
  ...
  EXTERNAL_ACCESS_INTEGRATIONS = (reference('consumer_external_access_integration'), ...);
  SECRETS = ('cred1' = reference('consumer_secret'), ...);
```

> **Note:**
>
> Consumers cannot directly call functions or stored procedures that use references
> to external access integrations or secrets.
>
> However, other components of the app, including Streamlit apps, tasks, and other functions and stored procedures, can use these references.

To allow consumers to call a function or stored procedure that uses references to external
access integrations or secrets, providers can do the following:

1. In the setup script, create a function or stored procedure that uses a reference, for
   example: `function_with_eai_secret_reference` as shown in the following example:

   ```sqlexample
   CREATE FUNCTION app_schema.function_with_eai_secret_reference(arg1 STRING)
     RETURNS string
     LANGUAGE python
     RUNTIME_VERSION = 3.11
     HANDLER = 'my_handler'
     EXTERNAL_ACCESS_INTEGRATIONS = (reference('eai_ref'))
     PACKAGES = ('snowflake-snowpark-python','requests')
     SECRETS = ('cred' = reference('secret_ref') )
     ...
   AS
   $$
   ```
2. In the setup script, create a wrapper stored procedure named `my_wrapper_procedure`.

   ```sqlexample
   CREATE OR REPLACE PROCEDURE app_schema.my_wrapper_procedure(arg1 STRING)
     RETURNS STRING
     LANGUAGE SQL
     AS
     $$
       BEGIN
           ...
       END;
     $$;
   ```

> > **Note:**
> >
> > The wrapper must be a stored procedure, not a function.

1. Within `my_wrapper_procedure`, add a call to
   `function_with_eai_secret_reference` as shown in the following example:

   > ```sqlexample
   > BEGIN
   >   RETURN app_schema.function_with_eai_secret_reference(arg1);
   > END;
   > ```
2. Grant `my_wrapper_procedure` to an application role to allow consumers to call the
   procedure as shown in the following example:

   ```sqlexample
   GRANT USAGE ON PROCEDURE app_schema.my_wrapper_procedure(STRING)
     TO APPLICATION ROLE app_role;
   ```

After the app is installed, consumers can call `my_wrapper_procedure` which then calls
`function_with_eai_secret_reference`.

### Use references in a policy

```sqlexample
CREATE ROW ACCESS POLICY app_policy
  AS (sales_region varchar) RETURNS BOOLEAN ->
  'sales_executive_role' = reference('get_sales_team')
    or exists (
      select 1 from reference('sales_table')
        where sales_manager = reference('get_sales_team')()
        and region = sales_region
      );
```

## JSON format for the configuration callback response

The configuration callback function returns a response in JSON format. The JSON
format returned is different for external access integration and secret references.

### JSON format for external access integration

For EXTERNAL ACCESS INTEGRATION references, the expected structure of the JSON response is:

```sqlexample
{
  "type": "CONFIGURATION",
  "payload": {
    "host_ports": ["host_port_1", ...],
    "allowed_secrets": "NONE|ALL|LIST",
    "secret_references": ["ref_name_1", ...]
  }
}
```

* `host_ports`

  > An array of strings. Each value must be a valid domain.
  >
  > Optionally, it can also include a port. The valid port range is 1 to 65535, inclusive.
  > If you do not specify a port, it defaults to 443. If an external network location supports
  > dynamic ports, you need to specify all possible ports.
  >
  > + To allow access to all ports, specify the port as 0; for example, `example.com:0`.
  >
  > These values are used to create an egress network rule for the external access integration.
  > See [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md) for more information.
* `allowed_secrets`

  > Specifies the secrets allowed by the EXTERNAL ACCESS INTEGRATION reference. Valid values are:
  >
  > + `NONE`: Secrets are not allowed.
  > + `ALL`: Allows any existing secret.
  > + `LIST`: Allows a specific set of secrets as specified in the `secret_references`
  >   property.
  >
  > The values of the `allowed_secrets` are used to create the external access integration.
  > See [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md) for more information.
* `secret_references`:

  > Specifies a list of secret references that are allowed by the external access integration.
  >
  > The values specified here must be the same as the secret references defined in the manifest.
  >
  > This property is only applicable if the `allowed_secrets` is set to `LIST`. In this
  > context, `secret_references` is required.

### JSON format for secret references

For secret references, the expected structure of the JSON response is:

```json
{
  "type": "CONFIGURATION",
    "payload": {
            "type": "<payload_type>",
            "security_integration": {
                    "oauth_scopes": ["scope_1", "scope_2"],
                    "oauth_token_endpoint" : "token_endpoint",
                    "oauth_authorization_endpoint" : "auth_endpoint"
            }
    }
}
```

* `payload.type`
  :   The type of secret. Valid values are:

      + `OAUTH2`: Specifies the secret to use with the OAuth2 grant flow.
      + `GENERIC_STRING`: Specifies a generic string secret.
      + `PASSWORD`: Specifies a password secret.

      See [CREATE SECRET](../../sql-reference/sql/create-secret.md) for more information.
* `payload.security_integration`
  :   Specifies the values required to configure the
      [API Authentication](../../sql-reference/sql/create-security-integration-api-auth.md) for an OAuth secret.

### JSON format error responses

In case of errors or if the reference is not yet ready for configuration, the expected structure of
the error response is:

```json
{
  "type": "ERROR",
  "payload":{
    "message": "The reference is not available for configuration ..."
 }
}
```

* `message`:
  The error message from the application that is displayed in Snowsight.

## Supported reference functions

The Snowflake Native App Framework supports the following functions to perform different operations related to references:

* [SYSTEM$ADD_REFERENCE](../../sql-reference/functions/system_add_reference.md)
* [SYSTEM$GET_ALL_REFERENCES](../../sql-reference/functions/system_get_all_references.md)
* [SYSTEM$GET_REFERENCED_OBJECT_ID_HASH](../../sql-reference/functions/system_get_referenced_object_id_hash.md)
* [SYSTEM$REMOVE_ALL_REFERENCES](../../sql-reference/functions/system_remove_all_references.md)
* [SYSTEM$REMOVE_REFERENCE](../../sql-reference/functions/system_remove_reference.md)
* [SYSTEM$SET_REFERENCE](../../sql-reference/functions/system_set_reference.md)

---
title: Request security integrations with app specifications
source: https://docs.snowflake.com/en/developer-guide/native-apps/requesting-app-specs-sec-integ.md
section: Native Apps Framework
---

# Request security integrations with app specifications

This topic describes how to configure a Snowflake Native App to use app specifications to request
access to security integrations in the consumer account. Security integrations allow
an app to connect to third-party authentication providers such as OAuth.

## Access third-party authentication providers from an app

To implement a third-party authentication service, Snowflake provides security integrations. A
security integration allows an app to connect to a third-party authentication service such as OAuth.

> > **Note:**
> >
> > Snowflake Native Apps only support security integrations of type `API_AUTHENTICATION`. For more
> > information, see [CREATE SECURITY INTEGRATION (External API Authentication)](../../sql-reference/sql/create-security-integration-api-auth.md).

## App specification workflow for security integrations

The general workflow for configuring an app to use a security integration is as follows:

1. Providers configure [automated granting of privileges](requesting-auto-privs.md) for the app.
   This grants the app privileges to create a security integration.

   > > **Note:**
   > >
   > > App specifications require that `manifest_version: 2` be set in the manifest file.
2. Providers add the
   CREATE SECURITY INTEGRATION privilege to the
   manifest file.
3. Providers add SQL statements to the setup script to create the following objects as required:

   * [Security integration](../../sql-reference/sql/create-security-integration.md)
   * [App specification](../../sql-reference/sql/alter-application-set-app-spec.md)

   Providers can add these commands directly in the setup script, which causes these objects to
   be created when the app is installed. Alternatively, these commands can be added to a stored
   procedure that is called at runtime to create these objects.
4. Consumers approve information related to OAuth integration when configuring the app. For more
   information on how consumers view and approve app specifications, see [Approve app specifications](ui-consumer-app-spec.md).

## App specification definition for security integrations

For a security integration, the [app specification definition](requesting-app-specs.md) contains the properties required to connect
to a third-party provider. For OAuth, the app specification definition depends on the
OAuth type. The following table lists the app specification definition for each type:

| Security integration type | Values defined in the app specification |
| --- | --- |
| `CLIENT_CREDENTIALS` | * `OAUTH_TOKEN_ENDPOINT` (required) * `OAUTH_ALLOWED_SCOPES` (required) |
| `AUTHORIZATION_CODE_GRANT` | * `OAUTH_TOKEN_ENDPOINT` (required) * `OAUTH_AUTHORIZATION_ENDPOINT` (optional) |
| `JWT` | * `OAUTH_TOKEN_ENDPOINT` (required) * `OAUTH_AUTHORIZATION_ENDPOINT` (optional) |

## Set the version of the manifest file

To enable automated granting of privileges for an app, set the version at the beginning of the
manifest file as shown in the following example:

```yaml
manifest_version: 2
```

## Add the CREATE SECURITY INTEGRATION privilege to the manifest file

* To allow an app to create a security integration, add the
  `CREATE SECURITY INTEGRATION` privilege
  to the manifest file, as shown in the following example:

  > ```yaml
  > manifest_version: 2
  > ...
  > privileges:
  >   - CREATE SECURITY INTEGRATION:
  >       description: "Allows the app to create security integrations to access external auth providers"
  > ...
  > ```

If you set the `manifest_version` to 2 in the manifest file, Snowflake automatically grants
the CREATE SECURITY INTEGRATION privilege to the app during installation or upgrade.

## Add a security integration to the setup script

Security integrations allow an app to connect to a third-party authentication service
like OAuth. To create a security integration for an app, add the
[CREATE SECURITY INTEGRATION (External API Authentication)](../../sql-reference/sql/create-security-integration-api-auth.md) command to the
setup script as shown in the following example:

```sqlexample
CREATE SECURITY INTEGRATION external_oauth_provider
  TYPE = API_AUTHENTICATION
  AUTH_TYPE = OAUTH2
  OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
  OAUTH_CLIENT_ID = 'YOUR_CLIENT_ID'
  OAUTH_CLIENT_SECRET = 'YOUR_CLIENT_SECRET'
  OAUTH_GRANT = 'CLIENT_CREDENTIALS'
  OAUTH_TOKEN_ENDPOINT = 'https://login.microsoftonline.com/YOUR_TENANT_ID/oauth2/v2.0/token'
  OAUTH_ALLOWED_SCOPES = ('https://graph.microsoft.com/.default')
  ENABLED = TRUE;
```

This example shows how to create a security integration to connect to Microsoft Graph using
OAuth with client credentials. For other supported methods of connecting to an OAuth provider, see
[CREATE SECURITY INTEGRATION (External API Authentication)](../../sql-reference/sql/create-security-integration-api-auth.md).

## Create an app specification for a security integration

The following example shows how to create an app specification for a security integration
using the CLIENT_CREDENTIALS OAuth type:

```sqlexample
ALTER APPLICATION SET SPECIFICATION oauth_app_spec
  TYPE = SECURITY_INTEGRATION
  LABEL = 'Connection to an external OAuth provider'
  DESCRIPTION = 'Integrates an external identity provider in the app'
  OAUTH_TYPE = 'CLIENT_CREDENTIALS'
  OAUTH_TOKEN_ENDPOINT = 'https://login.microsoftonline.com/YOUR_TENANT_ID/oauth2/v2.0/token'
  OAUTH_ALLOWED_SCOPES = ('https://graph.microsoft.com/.default');
```

> **Note:**
>
> The values you provide when creating the app specification must be the same as those
> you use when creating the security integration
> in the setup script.

For information on using other OAuth types, see [ALTER APPLICATION SET SPECIFICATION](../../sql-reference/sql/alter-application-set-app-spec.md).

## Approve the app specification in the consumer account

After the provider configures the app to create the security integration and app specification, consumers can view the app specification and approve or decline it as necessary when configuring the app. For more information, see [Approve app specifications](ui-consumer-app-spec.md).

---
title: Reset configuration
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/reset_configuration.md
section: Native Apps Framework
---

# Reset configuration

Resetting the configuration is possible only in the wizard phase, and it can be done by calling the `PUBLIC.RESET_CONFIGURATION()` procedure.
This procedure resets all prerequisites as not completed, deletes previously saved configuration, and sets the connector status to `INSTALLED`.
It can be used if there is a need to reconfigure the connector (that is, go through the [Prerequisites](prerequisites.md), [Connection configuration](connection_configuration.md), or [Connector configuration](connector_configuration.md) step again).
Connectors can only be reconfigured if you have not completed [Finalize configuration](finalize_configuration.md) step.
Connector reconfiguration can be customized using SQL or `ResetConfigurationHandlerBuilder`.

Only users assigned the `ADMIN` application role can call the `PUBLIC.RESET_CONFIGURATION()` procedure.

Resetting configuration consists of the following configurable phases, which by default have no effect:

1. Status validation
2. State validation
3. Internal callback
4. SDK callback
5. Status update

## Requirements

Configuration reset requires executing the following SQL files during native app installation:

* `core.sql`
* `configuration/app_config.sql`
* `configuration/prerequisites.sql`
* `configuration/connector_configuration.sql`
* `configuration/connection_configuration.sql`
* `configuration/reset_configuration.sql`

## Status validation

To reset the configuration, the connector needs to be in the `CONFIGURING` status. The configuration status needs to be equal to one of the following:
`INSTALLED`, `PREREQUISITES_DONE`, `CONFIGURED`, or `CONNECTED`. For the complete diagram of status transitions, see [Connector flow](overview.md).

Validation cannot be overwritten using `ResetConfigurationHandlerBuilder` or overwriting stored procedure.
However, it is possible to implement a custom handler, which will not have this kind of validation.

## State validation

The state validation phase is customizable and, by default, executes the `PUBLIC.RESET_CONFIGURATION_VALIDATE()` procedure, which returns `'response_code': 'OK'`.
This procedure can be customized by replacing the procedure using SQL or by implementing the `ResetConfigurationValidator` interface.

## Internal callback

The internal callback phase is customizable and, by default, executes the `PUBLIC.RESET_CONFIGURATION_INTERNAL()` procedure, which returns `'response_code': 'OK'`.
This procedure supports executing custom logic required when resetting the configuration. For example, deleting custom configuration.
This procedure can be customized by replacing the procedure using SQL or by implementing the `ResetConfigurationCallback` interface.

## SDK callback

SDK callback is used to update the SDK-controlled components.
This step consists of the following processes, which are executed as a single transaction:

1. Set all prerequisites as not completed
2. Delete connector configuration
3. Delete connections configuration

### Set all prerequisites as not completed

During this step the `IS_COMPLETED` column is set to false for all records in the internal `PREREQUISITES` table.

### Delete connector configuration

During this step, `connector_configuration` is deleted from the internal `APP_CONFIG` table.

### Delete connector configuration

During this step, `connection_configuration` is deleted from the internal `APP_CONFIG` table.

The SDK callback cannot be overwritten using the `ResetConfigurationHandlerBuilder` or overwriting the stored procedure.
It is possible to implement a custom handler, which will not have this callback.

> **Note:**
>
> The `PUBLIC.CONNECTOR_CONFIGURATION` view returns the current configuration from the internal `APP_CONFIG` table.
> The `PUBLIC.PREREQUISITES` view returns prerequisites from the internal `PREREQUISITES` table. Both views are available to the `ADMIN` and `VIEWER` application roles.

## Status update

When complete, this step sets the internal status of the connector to:

```json
{
    "status": "CONFIGURING",
    "configurationStatus": "INSTALLED"
}
```

## Response

### Successful response

If the procedure completes successfully it returns an `OK` response code as shown below:

```json
{
  "response_code": "OK"
}
```

### Error response

On error, the following response is returned:

```json
{
  "response_code": "<ERROR_CODE>",
  "message": "<error message>"
}
```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - Invalid connector status. Expected status: `[CONFIGURING]`.
* `INVALID_CONNECTOR_CONFIGURATION_STATUS` - Invalid connector status. Expected statuses: `[INSTALLED, PREREQUISITES_DONE, CONFIGURED, CONNECTED]`.
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive.
* `PROCEDURE_NOT_FOUND` - The procedure that was called does not exist.
* `UNKNOWN_SQL_ERROR` - This error occurs when something unexpected happen when calling internal procedures.
* `INVALID_RESPONSE` - This error occurs when response received from internal procedure does not contain `response_code` or an error response does not contain `message`, but contains `response_code`.
* `UNKNOWN_ERROR` - It means that something unexpected went wrong (message of thrown exception is forwarded).
* Custom error codes received from `RESET_CONFIGURATION_INTERNAL` and `RESET_CONFIGURATION_VALIDATE` procedures - defined by the connector developer.

---
title: Reset configuration reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/reset_configuration_reference.md
section: Native Apps Framework
---

# Reset configuration reference

Details about objects and procedures associated with the reset configuration feature.

## Database objects and procedures

The following database objects are created using the `configuration/reset_configuration.sql`.

### PUBLIC.RESET_CONFIGURATION()

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java `ResetConfigurationHandler.resetConfiguration` handler.

### PUBLIC.RESET_CONFIGURATION_VALIDATE()

Used to provide additional connector specific validation. By default returns `'response_code': 'OK'`.
It is invoked by the default `ResetConfigurationValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.RESET_CONFIGURATION_INTERNAL()

Used to provide additional connector specific logic. By default returns `'response_code': 'OK'`.
It is invoked by the default `ResetConfigurationCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

Configuration reset is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `prerequisites.sql` (See [Prerequisites SQL Reference](prerequisites_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))
* `configuration/connector_configuration.sql` (See: [Connector configuration reference](connector_configuration_reference.md))

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.configuration.reset` package and some common components are tightly connected with the above procedures:

* `ResetConfigurationHandler`
* `ResetConfigurationValidator`
* `ResetConfigurationCallback`
* `ResetConfigurationSdkCallback`
* `ResetConfigurationHandlerBuilder`
* `ConnectorStatusService`
* `ConfigurationRepository`
* `PrerequisitesRepository`
* `TransactionManager`
* `ConnectorErrorHandler`

## Custom handler

Handlers can be customized by being completely replaced using SQL or by implementing Java interfaces.

### Replacing using SQL

The following components can be replaced using SQL.

#### Handler

To provide a custom implementation of `ResetConfigurationHandler` the `PUBLIC.RESET_CONFIGURATION` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.RESET_CONFIGURATION()
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.snowflake.connectors.application.configuration.reset.CustomResetConfigurationHandler.resetConfiguration';

GRANT USAGE ON PROCEDURE PUBLIC.RESET_CONFIGURATION() TO APPLICATION ROLE ADMIN;
```

#### Internal procedure

The `INTERNAL` procedure can also be customized through SQL.

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.RESET_CONFIGURATION_INTERNAL()
    RETURNS VARIANT
LANGUAGE SQL
EXECUTE AS OWNER
AS
BEGIN
    -- SOME CUSTOM LOGIC

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
END;
```

It can also invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.RESET_CONFIGURATION_INTERNAL()
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.snowflake.connectors.application.configuration.reset.CustomResetConfigurationCallback.resetConfiguration';
```

### Builder approach

`ResetConfigurationHandler` can be customized using `ResetConfigurationHandlerBuilder`. This builder allows the developer to provide custom implementations of the following interfaces:

* `ResetConfigurationValidator`
* `ResetConfigurationCallback`
* `ConnectorErrorHelper`

Not all interfaces need to be implemented, in which case the default implementation provided by the SDK is used.

The following example shows how `ResetConfigurationValidator` can be customized.

```java
class CustomResetConfigurationValidator implements ResetConfigurationValidator {

    @Override
    public ConnectorResponse validate() {
        // CUSTOM VALIDATION LOGIC
        return ConnectorResponse.success();
    }
}

class CustomHandler {

    // Path to this method needs to be specified in the SQL definition of the PUBLIC.RESET_CONFIGURATION procedure
    public static Variant resetConfiguration(Session session) {
        // Using the builder
        var handler = ResetConfigurationHandler.builder(session)
            .withValidator(new CustomResetConfigurationValidator())
            .build();
        return handler.resetConfiguration().toVariant();
    }
}
```

---
title: Resource definition and ingestion SQL reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/resource_definition_and_ingestion_processes_reference.md
section: Native Apps Framework
---

# Resource definition and ingestion SQL reference

## STATE.RESOURCE_INGESTION_DEFINITION

This table is used to persist the data about configured resources. The data consists mostly of semi-structured variants.
The definition can be found in the file `ingestion/resource_ingestion_definition.sql`.

The table contains the following columns:

| Column name | Description |
| --- | --- |
| `id` | Id of Resource Ingestion Definition. |
| `name` | Name of the Resource Ingestion Definition that can be shown on UI. |
| `enabled` | Information whether the ingestion is enabled. |
| `parent_id` | Id of parent’s Resource Ingestion Definition, it allows to create resource hierarchy which can be ingested |
| `resource_id` | Set of properties that are needed to define a resource in a specific connector. They identify a resource in a source system. They are set by a user. |
| `resource_metadata` | Set of additional properties that describe a resource. They can be fetched automatically or calculated by a connector. Optional. |
| `ingestion_configurations` | Set of configuration properties that describe how the resource should be ingested from the source system. Structure of this field is described in the next table. |
| `updated_at` | UTC timestamp representing recent update. |

The `ingestion_configuration` property should follow the below schema:

| Field name | Description |
| --- | --- |
| `id` | Id of Ingestion Configuration. Unique for given Resource Ingestion Definition |
| `ingestion_strategy` | Strategy of given ingestion. Values: snapshot, incremental |
| `custom_configuration` | Set of connector-specific ingestion properties |
| `schedule_type` | Type of schedule. Values: interval, cron |
| `schedule_definition` | String defining a schedule. e.g. 30m, 4h, 1d for interval. Cron expression in case of cron. |
| `destination` | Set of properties that describe where ingested data for a given resource should be stored. |

### Related Java objects

To interact with the `RESOURCE_INGESTION_DEFINITION` table the following Java objects are useful:

* [ResourceIngestionDefinitionRepository](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/definition/ResourceIngestionDefinitionRepository.md)
* [ResourceIngestionDefinition](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/definition/ResourceIngestionDefinition.md)
* [IngestionConfiguration](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/definition/IngestionConfiguration.md)
* [IngestionStrategy](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/definition/IngestionStrategy.md)
* [ScheduleType](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/definition/ScheduleType.md)

## PUBLIC.INGESTION_DEFINITIONS

File: `ingestion/ingestion_definitions_view.sql`

This view available to `ADMIN` and `VIEWER` users returns the data from the `STATE.RESOURCE_INGESTION_DEFINITION` table. The returned data is simplified and contains only some of the columns:

* id
* resource_id
* name
* enabled

## STATE.INGESTION_PROCESS

File: `ingestion/ingestion_run.sql`

This table is used to persist the data about process. It is not available to any role apart from the connector itself.
It contains the following columns:

> | Column | Type |
> | --- | --- |
> | `id` | `STRING` |
> | `resource_ingestion_definition_id` | `STRING` |
> | `ingestion_configuration_id` | `STRING` |
> | `type` | `STRING` |
> | `status` | `STRING` |
> | `created_at` | `TIMESTAMP_NTZ` |
> | `finished_at` | `TIMESTAMP_NTZ` |
> | `updated_at` | `TIMESTAMP_NTZ` |

### Related Java objects

The following Java classes are related to this table:

* [IngestionProcessRepository](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/process/IngestionProcessRepository.md)
* [CrudIngestionProcessRepository](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/process/CrudIngestionProcessRepository.md)
* [IngestionProcess](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/ingestion/process/IngestionProcess.md)

## STATE.INGESTION_RUN

File: `ingestion/ingestion_run.sql`

A table used to store log data about past and current ingestion triggered by the scheduler. It is not available to any role apart from the connector itself.

It contains the following columns:

> | Column | Type |
> | --- | --- |
> | `id` | `STRING` |
> | `resource_ingestion_definition_id` | `STRING` |
> | `ingestion_configuration_id` | `STRING` |
> | `process_id` | `STRING` |
> | `started_at` | `TIMESTAMP_NTZ` |
> | `completed_at` | `TIMESTAMP_NTZ` |
> | `status` | `STRING` |
> | `ingested_rows` | `NUMBER` |
> | `metadata` | `VARIANT` |

### Related Java objects

The following Java classes are related to this table:

* [IngestionRunRepository](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/observability/IngestionRunRepository.md)
* [CrudIngestionRunRepository](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/observability/CrudIngestionRunRepository.md)
* [IngestionRun](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/observability/IngestionRun.md)
* [IngestionRun.IngestionStatus](/developer-guide/native-apps/connector-sdk/java/com/snowflake/connectors/application/observability/IngestionRun.IngestionStatus.md)

---
title: Responses and error handling
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/response_and_error_handling.md
section: Native Apps Framework
---

# Responses and error handling

The Snowflake Native SDK for Connectors uses certain standard responses, especially for procedures exposed and designed to be used from the UI.
Additionally it provides a way to ensure that exceptions are mapped to valid responses and logged into the `EVENT TABLE`.

## Responses

The SDK procedures, both high-level ones and internal ones, use `variant` of a certain structure to pass information.
The requirement for such a `variant` is that it has to contain a `response_code` field,
and in some cases the response code is different than `OK`, in the required `message` field.
Any additional field can be included, but it requires further custom handling. THe response format is:

```json
{
    "response_code": "<response code>",
    "message": "<message>"
}
```

It is recommended to use this format when replacing default implementations of the procedures and objects.

## Error handling

The Snowflake Native SDK for Connectors provides a useful default mechanism to handle exceptions that can occur during runtime.
The class responsible for this is called `ConnectorErrorHelper` and its default implementation is `DefaultConnectorErrorHelper`.
This feature provides 2 customizable callbacks. The first one, `ExceptionMapper`, is responsible for wrapping all unexpected
errors into the `ConnectorException` format. This feature is used mainly to ensure responses are compliant with the format mentioned above.

The second callback, called `ExceptionLogger`, ensures that the error is logged.
This is important because all standard log entries are then saved in the `EVENT TABLE`
by Snowflake, which helps when resolving problems with the applications.

### How to use the helper

The helper exposes 2 methods:

* `withExceptionWrapping(Supplier<ConnectorResponse> action)`
* `withExceptionLogging(Supplier<T> action)`

Those methods respectively use `mapper` and `logger` mentioned above. There is also a default
implementation of a helper method which mixes those approaches together:

```java
default ConnectorResponse withExceptionLoggingAndWrapping(Supplier<ConnectorResponse> action) {
    return withExceptionWrapping(() -> withExceptionLogging(action));
}
```

It is recommended to use this wrapping at the highest possible level when invoking a
method from a `handler`. For example in ConnectionConfigurationHandler it is used like this:

```java
public static Variant setConnectionConfiguration(Session session, Variant configuration) {
    var handler = ConnectionConfigurationHandler.builder(session).build();
    return handler.setConnectionConfiguration(configuration).toVariant();
}

public ConnectorResponse setConnectionConfiguration(Variant configuration) {
    return errorHelper.withExceptionLoggingAndWrapping(
        () -> setConnectionConfigurationBody(configuration)
    );
}
```

The SDK also exposes a builder to customize this behavior, called `ConnectorErrorHelperBuilder`.
This builder allows the developer to customize the behavior of the `mapper` and `logger` callbacks.
Once customized the new `helper` can be passed to the `handler` classes in their respective `builders`.
For example:

```java
class CustomUnknownExceptionMapper implements ExceptionMapper<Exception> {

    @Override
    public ConnectorException map(Exception exception) {
        return new CustomConnectorException(exception);
    }
}

class CustomHandler {

    // Path to this method needs to be specified in the PUBLIC.SET_CONNECTION_CONFIGURATION procedure using SQL
    public static Variant configureConnection(Session session, Variant configuration) {
            //Using builder
        var errorHelper = new ConnectorErrorHelperBuilder()
            .withConnectorExceptionLogger(new CustomUnknownExceptionMapper())
            .build();

        var handler = ConnectionConfigurationHandler.builder(session)
            .withErrorHelper(errorHelper)
            .build();

        return handler.connectionConfiguration(configuration).toVariant();
    }
}
```

---
title: Resume connector
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/resume_connector.md
section: Native Apps Framework
---

# Resume connector

Resuming the connector is available after the wizard. It can be executed after the `Finalize Configuration` additionally with `Pause Connector`.
This step allows user to manipulate the status of the connector after it is launched. The entry point for this phase is a procedure
called `PUBLIC.RESUME_CONNECTOR()`. It can be customized by replacing it in SQL or by using `ResumeConnectorHandlerBuilder`.
The reverse process of resuming the connector, allowing user to suspend it, is [Pause connector](pause_connector.md).

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The resume connector step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Privileges validation
2. Status validation
3. State validation
4. Status update (STARTING)
5. Internal callback
6. Resuming of Task Reactor (if Task Reactor is enabled)
7. Status update (STARTED)

## Requirements

Resume connector requires at least the following sql files to be executed during native app installation:

* `core.sql`
* `configuration/app_config.sql`
* `lifecycle/resume.sql`
* Recommended: `lifecycle/pause.sql`
* Recommended: `configuration/finalize_configuration.sql`

## Privileges validation

To resume connector the `EXECUTE TASK` privilege must be granted to the application.

This validation cannot be overwritten by using `ResumeConnectorHandlerBuilder` nor by overwriting stored procedure.
However, it is possible to implement a custom handler.

## Status validation

To resume the connector the internal status of the connector needs to be `PAUSED`.

This validation cannot be overwritten by using `ResumeConnectorHandlerBuilder` nor by overwriting stored procedure.
However, it is possible to implement a custom handler.

## State validation

In case there are some additional custom validations that need to be satisfied there is a `PUBLIC.RESUME_CONNECTOR_VALIDATE()`
stored procedure, which can be customized by the user. By default, this procedure just returns `'response_code': 'OK'`.
The procedure can be customized by overwriting through the SQL or by using `ResumeConnectorHandlerBuilder` and providing a custom implementation of the
`ResumeConnectorStateValidator` interface.

## Internal callback

Internal callback is another customizable step. By default, it invokes `PUBLIC.RESUME_CONNECTOR_INTERNAL()`, which returns `'response_code': 'OK'`.
This procedure allows the user to perform any additional duties needed when resuming connector. For example resuming additional connector specific tasks.
It can be overwritten through the SQL script or by using a `ResumeConnectorHandlerBuilder` to provide custom implementation of the `ResumeConnectorCallback` interface.

## Status update

When all the above phases are completed successfully the internal status of the Connector will be updated to:

```json
{
    "status": "STARTED",
    "configurationStatus": "FINALIZED"
}
```

For the whole diagram of state transitions, see [Connector flow](overview.md).

### Response

#### Successful response

When the procedure successfully resumes all tasks in the background and changes status tocSTARTED, then the `Connector successfully resumed.`
message will be returned directly from ResumeConnectorHandler method body. It is recommended to use the following format:

> ```json
> {
>   "response_code": "OK"
> }
> ```

#### Error response

In case of an error the response will follow the below format:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "error message"
> }
> ```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - The procedure was called on connector with state different than `[PAUSED, STARTING]`
* `CONNECTOR_STATUS_NOT_FOUND` - Connector status record does not exist in database (independent of user’s input at this stage - an internal error)
* `ROLLBACK_CODE` - An error occurred, but the changes were successfully reverted.
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive
* `UNKNOWN_ERROR_CODE` - An unknown error occurred and the connector is now in an unspecified state

---
title: Resume connector reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/resume_connector_reference.md
section: Native Apps Framework
---

# Resume connector reference

## Database objects and procedures

The following database objects are created through the file `lifecycle/resume.sql`.

### PUBLIC.RESUME_CONNECTOR()

The entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `ResumeConnectorHandler.resumeConnector`

### PUBLIC.RESUME_CONNECTOR_VALIDATE()

The procedure used for connector specific validation of pausing process. By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultResumeConnectorStateValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.RESUME_CONNECTOR_INTERNAL()

The procedure used for connector specific additional pausing duties. By default, it returns `'response_code': 'OK'`.
It is invoked by the `InternalResumeConnectorCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

Resume connector is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))
* `configuration/finalize_configuration.sql` (See [Finalize configuration reference](finalize_configuration_reference.md))
* `lifecycle/pause.sql` (See [Pause connector reference](pause_connector_reference.md))

### Related Java objects

The following Java objects from the `com.snowflake.connectors.application.lifecycle` package and some common components are tightly connected with the above procedures:

* `ResumeConnectorHandler`
* `ResumeConnectorStateValidator`
* `ResumeConnectorCallback`
* `ConnectorStatusService`
* `LifecycleService`
* `ConnectorErrorHelper`

## Custom handler

The handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of the `ResumeConnectorHandler` the `PUBLIC.RESUME_CONNECTOR` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.RESUME_CONNECTOR()
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.custom.handler.CustomResumeConnectorHandler.resumeConnector';

GRANT USAGE ON PROCEDURE PUBLIC.RESUME_CONNECTOR() TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

The internal `VALIDATE` and `INTERNAL` procedures can be also customized through the SQL. They can even invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.RESUME_CONNECTOR_INTERNAL()
RETURNS VARIANT
LANGUAGE SQL
EXECUTE AS OWNER
AS
BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
END;

CREATE OR REPLACE PROCEDURE PUBLIC.RESUME_CONNECTOR_VALIDATE()
RETURNS VARIANT
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:1.11.0')
IMPORTS = ('/connectors-native-sdk.jar')
HANDLER = 'com.custom.handler.CustomResumeConnectorInternalHandler.resumeConnector';
```

### Builder approach

`ResumeConnectorHandler` can be customized using `ResumeConnectorHandlerBuilder`. This builder allows user to provide custom implementations of the following interfaces:

* `ResumeConnectorStateValidator`
* `ResumeConnectorCallback`
* `ConnectorErrorHelper`

In case one is not provided, the default implementation provided by the SDK will be used.

```java
class CustomResumeConnectorStateValidator implements ResumeConnectorStateValidator {
    @Override
    public ConnectorResponse validate() {
        // CUSTOM LOGIC
        return ConnectorResponse.success();
    }
}

class CustomHandler {

    // Path to this method needs to be specified in the PUBLIC.RESUME_CONNECTOR procedure using SQL
    public static Variant resumeConnector(Session session) {
            //Using builder
        var handler = ResumeConnectorHandlerBuilder.builder(session)
            .withStateValidator(new CustomResumeConnectorStateValidator())
            .build();
        return handler.resumeConnector().toVariant();
    }
}
```

---
title: Run the automated security scan
source: https://docs.snowflake.com/en/developer-guide/native-apps/security-run-scan.md
section: Native Apps Framework
---

# Run the automated security scan

This topic describes how to initiate the automated security scan and view the current status.

## Security scan workflow

The following diagram shows how the security scan fits within the workflow for developing and
publishing a Snowflake Native App:

This workflow includes the following steps:

1. Create an application package.
2. Update the application code and related files.

   Before running the automated security scan, ensure that the app conforms to the security
   requirements and best practices outline in [Security requirements and best practices for a Snowflake Native App](security-app-requirements.md). If the app is
   a Snowflake Native App with Snowpark Container Services, review the additional security requirements outlined in [Secure a Snowflake Native App with Snowpark Container Services](security-na-spcs.md).
3. Add a version or patch to the application package.
4. Run the automated security scan. The automated security scan starts when the provider does one of the following:

   * Adds a new version or patch to the application package when the DISTRIBUTION property is set to
     `EXTERNAL`. The new version is scanned automatically.
   * Sets the DISTRIBUTION property to “EXTERNAL” on an application package that already has a version
     defined. The ten most recent versions of the application package are scanned automatically. All
     patches for these version are also scanned.
5. Await the results of the scan.

   If the scan is approved, the provider can continue with the process of publishing the app.

   If the scan is rejected, the provider must update the application code based on the results of the scan.
   Alternatively, the provider can appeal the rejection.
6. Create or modify the release directive for the app.
7. Create a listing for the app.
8. Submit the listing to Snowflake for approval.

   If the listing is approved, the provider can publish the listing on the Snowflake Marketplace.

   If the listing is rejected, the provider must update the listing and resubmit for approval.
9. Publish the listing.

## Set the DISTRIBUTION property on an application package

The DISTRIBUTION property of an application package indicates the type of listing a provider can
create when using the application package as the data product of a listing. This property has the
following values:

* `INTERNAL` indicates that a provider can only create a private listing within the same organization
  where the application package was created. The automated security scan is not performed when
  the DISTRIBUTION property is set to `INTERNAL`.
* `EXTERNAL` indicates that a provider can create listings outside the same organization where the
  application package was created. This includes the following:

  + Private listings outside the provider’s organization.
  + Public listings.
  + Marketplace listings.

A provider can set the DISTRIBUTION property when creating the application package or afterwards.

To set the DISTRIBUTION property when creating an application package, run the
[CREATE APPLICATION PACKAGE](../../sql-reference/sql/create-application-package.md) as shown in the following example:

```sqlexample
CREATE APPLICATION PACKAGE hello_snowflake_package
  DISTRIBUTION = EXTERNAL;
```

When a provider sets the DISTRIBUTION property when creating the application package, any versions or
patches added to the application package later are scanned immediately.

To set the DISTRIBUTION property for an existing application package run the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) as shown in the following example:

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  SET DISTRIBUTION = EXTERNAL;
```

When a provider sets the DISTRIBUTION property for an existing application package, the automated
security scan is automatically run on the ten most recent versions of the app. All patches for these
versions are also scanned.

## View the status of the security scan

After initiating the security scan for a version or patch, providers can view that status
in the application package. The possible statuses are:

* `NOT_REVIEWED` indicates that the automated security scan has not been performed on this application
  package.
* `IN_PROGRESS` indicates that the automated security scan is currently in progress.
* `APPROVED` indicates that the automated security scan completed and the application package
  has been approved. The provider can set the release directive for the application package.
* `REJECTED` indicates that the automated security scan completed, but the application package
  was not approved.

> **Note:**
>
> When an automated security scan fails, the Snowflake manually reviews the application package.
> After the manual review is complete, the status is updated to `APPROVED` or `REJECTED`.

### View the status of the security scan using SQL

To view the status of the security scan, run the [SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md)
command as shown in the following example:

```sqlexample
SHOW VERSIONS IN APPLICATION PACKAGE hello_snowflake_package;
```

The `review_status` column displays the status of the automated review scan.

### View the status of the security scan using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App packages.
3. Select the application package whose status you want to view.

   The Security Scan Status column shows the current status of the review of
   each version and patch associated with the application package.
4. If the status is Rejected, select the app package to see the reason for
   the rejection.

## Appealing a rejection

If critical vulnerabilities or policy violations are found after Snowflake performs a manual review,
the application package is rejected and the reason for the rejection can be
reviewed in the application package.

A provider can appeal the rejection by opening a severity 4 support ticket. When appealing a CVE-based
rejection, providers must submit detailed documentation explaining the following:

* Why the CVE is not exploitable in the application
* Reachability analysis report, if available
* A plan for updating to the fixed version
* If there are no plans for an update, a detailed explanation of why a vulnerable version cannot be updated

The Snowflake Security team reviews and issues decisions for all appeals.

For additional information on the appeal process, see [Appeal a failed security review](security-appeal.md).

## Ongoing security monitoring and remediation

After an app is approved and published on the Snowflake Marketplace, it undergoes continuous security
monitoring to ensure ongoing safety and compliance. This includes:

* Periodic image security analysis to detect new vulnerabilities or policy violations.
* If issues are discovered, the provider is notified and given 30 business days to patch the app or
  can request an exception within 15 days.

---
title: Secure a Snowflake Native App with Snowpark Container Services
source: https://docs.snowflake.com/en/developer-guide/native-apps/security-na-spcs.md
section: Native Apps Framework
---

# Secure a Snowflake Native App with Snowpark Container Services

This topic describes the security considerations for a Snowflake Native App with Snowpark Container Services. In addition to the general
security requirements for all apps, apps with containers have specific security implications
and considerations. The security review process for apps with containers includes a thorough
examination of the container images they contain.

Snowflake uses container image scanning tools to detect known vulnerabilities and
security best practice violations.

## Network isolation and egress control

Apps with containers use strict network isolation and egress control measures to help prevent unauthorized
data exfiltration and to protect consumer data. Each app with containers runs in its own isolated network
environment, with controlled access to external systems and services.

Snowflake uses network monitoring and filtering mechanisms to detect and block suspicious egress traffic
patterns. App providers are required to explicitly declare all external end points in the application
manifest, which undergoes a security review.

Consumer data is protected using the following:

* Secure data access patterns.
* Encryption in transit and at rest.
* Fine-grained access controls.

The Snowflake Native App Framework ensures that app with containers can only access the specific data and resources to which
an app has been granted access. This minimizes the risk of data exfiltration.

## Additional approval requirements for apps with containers

Snowflake implements an additional approval process for an app with containers. The approval is
mandatory before an app with containers can be published to the Snowflake Marketplace. Before a provider
can create a public or private listing for an app with containers, they must be approved by the
Snowflake Product Security team.

Providers who successfully pass this approval process are authorized to publish a public listing
for an app with containers. This allows the app to be discoverable and accessible to Snowflake customers.

If a provider does not pass the approval process, they may not publish a listing for an
app with containers.

### Initiate the provider approval process

When a provider sets the DISTRIBUTION=EXTERNAL property for an application package of an app with
containers, Snowflake returns the following error if the provider has not been approved to publish an app with
containers:

```output
Error Code: 093197 Account is not allowed to create application package versions or patches with
Snowpark Container Services for EXTERNAL distribution
```

If you receive this error, you must submit a
[security questionnaire](https://docs.google.com/forms/d/1XLjbcSrp689kXEvVELa6KbEUOPfsJIirSTG5pGQDMZE/edit?ts=65fb4866)
to begin the approval process.

The security questionnaire assesses the following:

* The provider’s security practices.
* The provider’s compliance readiness.

Submitting the security questionnaire begins the provider approval process.

### Evaluation of the security questionnaire

After a provider submits the security questionnaire, Snowflake’s Security and Compliance
team evaluates each response and the documentation included by the provider. Responses are
evaluated to ensure alignment with industry best practices and standards.

In some cases, providers may be asked to provide additional information or undergo a more in-depth
review to clarify any potential concerns or risks.

After reviewing the questionnaire, Snowflake makes a decision to allow the provider to publish an
app with containers. If a provider is not approved, they must wait until Snowflake Native App with Snowpark Container Services is generally
available.

The provider receives an email from Snowflake indicating if they are approved or wait listed
until general availability.

### Scanning an app with containers

After a provider is approved, the app with containers undergoes the automated security scan. This
scan includes a normal app security scan and a scan of the container images included in the app.

The guidelines for how long the security scan takes to complete are:

| App size | Approximate time to complete scan |
| --- | --- |
| Five images or fewer / smaller than 40 GB | Less than 8 hours |
| Ten images or fewer / smaller than 70 GB | Less than 24 hours |
| Ten images or more / larger than 70 GB | 2 business days or more |

> **Caution:**
>
> The time frames provided are for information only. They do not constitute formal
> service-level agreements (SLAs).

---
title: Security requirements and best practices for a Snowflake Native App
source: https://docs.snowflake.com/en/developer-guide/native-apps/security-app-requirements.md
section: Native Apps Framework
---

# Security requirements and best practices for a Snowflake Native App

This topic describes the security requirements and best practices that providers must
follow when developing a Snowflake Native App. All apps that meet the conditions described in
[Automated security reviews](security-overview.md) must conform to the security requirements
outlined in the following sections:

* Security requirements for application code
* Security requirements for app functionality
* Security requirements for app permissions

> **Note:**
>
> Security requirements are subject to change as Snowflake continues to monitor new potential risks.

## Security requirements for application code

App code included within an application package must conform to the following security requirements:

1. Your app must not load or execute any code from outside the application package except
   Snowflake-provided libraries. All the app code, including all library dependencies and setup code,
   must be included in the app version defined in the application package.
2. All app code must be un-obfuscated, meaning that the code must be human readable. This requirement
   includes minified JavaScript code.

   > **Note:**
   >
   > If an app needs to use minified JavaScript code, it must include a corresponding source map file
   > that can be used to recover the un-minified code.
3. All dependencies or libraries with critical or high common vulnerabilities and exposures (CVE) must be
   updated to a secure version, if available.

## Security requirements for app functionality

The following security requirements apply to the functionality of your app:

1. All apps must provide the following information to customers as part of a listing:

   1. All app functionality and features.
   2. All Internet endpoints and URLs that the app connects to.
   3. All external functions in the app.
   4. Any consumer data logged, collected, or stored by the app.

      1. Apps should prohibit all non-essential cookies.
      2. Apps should communicate all essential cookies to consumers
2. Apps should function as advertised in the app listing.
3. All app installation and setup instructions must be included in the app listing.
4. Apps must not store or require any plain text customer secrets.
5. Any communication between the app and the Internet should be over an HTTPS connection with a valid
   TLS certificate.
6. Apps must not have any functionality that could result in harm to Snowflake, its customers, or third
   parties. Harm includes but is not limited to:

   1. Data leakage and/or loss;
   2. Restricting consumer access to their data unless explicitly designed as part of the app
      functionality, for example, data masking for data access policies.
   3. Excessive resource consumption.
   4. Arbitrary code injection/execution.
7. All connections to an app, including web-based user interfaces and APIs, must first authenticate using a
   Snowflake-provided method of authentication. Any app-specific authentication must be presented to users
   after Snowflake authentication has succeeded.
8. Apps should not create any public endpoints that allow connections to the app without a successful
   authentication through Snowflake first.

## Security requirements for app permissions

The following security requirements apply to the privileges set by your app:

1. All apps must provide the following information in the manifest file:

   1. All privileges required by the app on all objects.
   2. All API integrations.
2. Apps should only ask for the minimum set of privileges needed for the app to function.

## Recommended security best practices

In addition to the security requirements imposed by the automated security scan, Snowflake recommends the
following best practices when developing a Snowflake Native App. Following these best practices helps reduce
the likelihood of an app being blocked during security review.

* Follow secure Software Development Life Cycle (SDLC) practices.

  + Review app code for vulnerabilities during the development lifecycle and fix them before creating
    an app version.
  + Review third-party libraries for vulnerabilities and update them to the latest secure version.
  + Review and update all third-party libraries in the app at least once a quarter.
* Follow Snowflake security best practices as described in the following:

  + [Security Practices for UDFs and Procedures](../udf-stored-procedure-security-practices.md)
  + [Securing an external function](../../sql-reference/external-functions-security.md)

## Recommended security best practices for an app with containers

In addition to the security best practices for a core Snowflake Native App outlined in
Recommended security best practices, the following
security best practices apply to an app with containers:

* Limit the use of external dependencies and libraries to minimize the attack surface of an app and
  reduce the risk of supply chain vulnerabilities.
* Follow container image hardening requirements, such as the use of minimal base images, removal of
  unnecessary packages, and secure configuration of runtime environments.
* Use secure communication protocols and encryption for all inter-container and external communication.
* Generate comprehensive logging and auditing of container activities and data access patterns.
* Update and patch container images regularly to address known vulnerabilities and security issues.
* Implement only required privileges to minimizing the attack surface of containerized apps.
* Managing secrets and sensitive data securely, using appropriate encryption and access controls.
* Conduct thorough security testing and vulnerability assessments before submitting apps for review.
* Respond promptly to security incidents and collaborate with Snowflake during incident response.
* Provide clear and accurate documentation of app functionality, dependencies, and security controls.
* Educate and guide consumers on the secure use and configuration of their apps.

## Best practices for developing and publishing an application package

To streamline the development and publishing process for a Snowflake Native App, Snowflake recommends creating
two separate application packages:

* Development application package

  The development application package is intended for rapid iteration and testing purposes. It should
  have its DISTRIBUTION property set to `INTERNAL`. This ensures that the application package remains
  internal and is not distributed to external consumers or to Snowflake scanning and approval.

  By keeping this package separate from the production package, developers can quickly make changes and
  test new features without triggering the security review process for each iteration.
* Production application package

  The production application package is intended for publishing an application package and distributing it
  to Snowflake for scanning and approval and to external consumers. The production application package should have
  its DISTRIBUTION property set to `EXTERNAL`.

  Only versions that have passed the provider’s security review should be added to this package, ensuring
  that the app meets the required security standards before being made available to consumers.

By following the best practice of having separate development and production packages, developers can maintain an efficient
development lifecycle while ensuring that only secure and approved versions of the app are published and
distributed to external consumers.

---
title: Security requirements and guidelines for a Snowflake Native App
source: https://docs.snowflake.com/en/developer-guide/native-apps/security-overview.md
section: Native Apps Framework
---

# Security requirements and guidelines for a Snowflake Native App

This topic provides an overview of the security requirements and guidelines
when developing a Snowflake Native App. It also provides general information about automated
security scan and review process when publishing an app to consumers.

> **Caution:**
>
> It is your responsibility to ensure that no personal data, sensitive data,
> export-controlled data, or other regulated data is entered into any files included
> in your application package.

## Overview of Snowflake Native App security requirements

The Snowflake Native App Framework provides security requirements and best practices that providers must follow when
developing a Snowflake Native App. For security requirements and best practices for an app, see
[Security requirements and best practices for a Snowflake Native App](security-app-requirements.md). For security requirements for an app with containers, see
[Secure a Snowflake Native App with Snowpark Container Services](security-na-spcs.md).

To publish an app to consumers, either as a private listing or on Snowflake Marketplace,
Snowflake implements a security review process that requires a security scan of the components
of an app. If an app does not pass the automated security review, a manual review occurs.

All apps that are published to consumers must pass this security review.

## Potential security risks

The following are some of the possible security risks that can occur when running an app:

* Data exfiltration:

  Malicious apps could copy consumer data to external functions or logs.
* Compute abuse:

  Apps could perform unauthorized tasks, such as cryptomining, at the consumer’s expense.
* Ransomware

  Apps could encrypt or corrupt consumer data, demanding payment for restoration.
* Privilege escalation:

  Apps could attempt to gain unauthorized permissions within the consumer’s account.

To mitigate these and other possible security risks, the Snowflake Native App Framework uses a security review to evaluate
an app for security risks and to ensure security best practices.

## Automated security reviews

To mitigate potential security risks, Snowflake uses the Native App Anti-Abuse Pipeline Service (NAAAPS).
This service automatically scans all new app versions using various tools to determine if an app can
be distributed to consumers.

This automated security review occurs when a new version or patch of an app is created. This review
performs the following:

* Copies the app to a dedicated Snowflake account used to scan apps.
* Scans the files associated with the app and updates the security review status.
* Auto-approves the app or initiates a manual review of the app.

During the manual review process, an app can be approved or rejected. Snowflake does not send a notification if an
app is rejected. Providers can [view the status of the review](security-run-scan.md) in
Snowsight.

## Scanners and tools used during a security review

The automated security review uses the following scanners and tools to perform the
following to analyze different components of an app:

* Scan code for bugs, anti-patterns, and security vulnerabilities in code.
* Scan code for malware.
* Identify vulnerabilities in app dependencies.

The processes help detect various security issues, such as data exfiltration, ransomware, compute
abuse, privilege escalation, and dynamic code execution.

## Security requirements and best practices for an app

All apps must conform to the security requirements outlined in the [Security requirements and best practices for a Snowflake Native App](security-app-requirements.md).

> **Note:**
>
> Security requirements are subject to change as Snowflake continues to monitor new potential risks.

## Security considerations for a Snowflake Native App with Snowpark Container Services

For information about additional security requirements for a Snowflake Native App with Snowpark Container Services see
[Secure a Snowflake Native App with Snowpark Container Services](security-na-spcs.md).

## Guidelines for publishing an app to Snowflake Marketplace

When publishing an app to Snowflake Marketplace, providers must consider additional requirements
and best practices. See [Guidelines and requirements for listing Apps on Snowflake Marketplace](../../collaboration/guidelines-reqs-for-listing-apps.md).

## CVE evaluation criteria for an app

Snowflake’s approach to addressing Common Vulnerabilities and Exposures (CVEs) in a Snowflake Native App
is based on our CVE Evaluation Criteria, a policy that establishes clear and objective criteria
for evaluating and prioritizing CVEs based on their risk profile.

The policy aims to balance the mitigation of critical security risks with the effort required to
address less severe vulnerabilities. It applies to all apps undergoing security review
and is enforced to ensure only apps meeting the defined criteria are approved for publishing
in Snowflake’s data cloud environment.

See [Common Vulnerabilities and Exposures (CVE) considerations](security-cve.md) for additional information.

## Scanning Regions

When configuring a Snowflake Native App to be shared externally, providers automatically share the code in app
with Snowflake for scanning. The following table maps the NAAAPS scanning regions to the corresponding
provider regions:

| Cloud provider | Provider region | Scanning region |
| --- | --- | --- |
| AWS | US West (Oregon) | US West (Oregon) |
| AWS | US East (Ohio) | US East (Ohio) |
| AWS | US East (N. Virginia) | US East (N. Virginia) |
| AWS | Canada (Central) | Canada (Central) |
| AWS | South America (São Paulo) | South America (São Paulo) |
| AWS | EU (Ireland) | EU (Ireland) |
| AWS | Europe (London) | Europe (London) |
| AWS | EU (Paris) | EU (Paris) |
| AWS | EU (Frankfurt) | EU (Frankfurt) |
| AWS | EU (Zurich) | EU (Zurich) |
| AWS | EU (Stockholm) | EU (Stockholm) |
| AWS | Asia Pacific (Tokyo) | Asia Pacific (Tokyo) |
| AWS | Asia Pacific (Osaka) | Asia Pacific (Osaka) |
| AWS | Asia Pacific (Seoul) | Asia Pacific (Seoul) |
| AWS | Asia Pacific (Mumbai) | Asia Pacific (Mumbai) |
| AWS | Asia Pacific (Singapore) | Asia Pacific (Singapore) |
| AWS | Asia Pacific (Sydney) | Asia Pacific (Sydney) |
| AWS | Asia Pacific (Jakarta) | Asia Pacific (Jakarta) |
| Azure | * West US 2 (Washington) * Central US (Iowa) * South Central US (Texas) * East US 2 (Virginia) * Canada Central (Toronto) | Azure East US 2 (Virginia) |
| Azure | * UK South (London) * North Europe (Ireland) * West Europe (Netherlands) * Switzerland North (Zurich) * UAE North (Dubai) | Azure West Europe (Netherlands) |
| Azure | * Central India (Pune) * Japan East (Tokyo) * Southeast Asia (Singapore) * Australia East (New South Wales) | Azure Australia East (New South Wales) |
| GCP | * US Central1 (Iowa) * US East4 (N. Virginia) * Europe West2 (London) * Europe West4 (Netherlands) | AWS US West (Oregon) |

---
title: Set the release directive for an app (Legacy)
source: https://docs.snowflake.com/en/developer-guide/native-apps/update-app-release-directive.md
section: Native Apps Framework
---

# Set the release directive for an app (Legacy)

This topic describes how to set the release directive for an application package.

## About release directives

Release directives determine the version of the app that is available to a consumer when they
install or upgrade an app. Release directives are defined in the application package using the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command.

There are two types of release directives:

Custom release directive
:   Allows a provider to specify the version of an application that specific
    Snowflake accounts can install. See
    add a custom release directive for more information.

Default release directive
:   Specifies the version and patch that is applicable to all consumers when
    installing a Snowflake Native App. If a provider creates versions V1 and V2 of an application, setting the
    default release directive to V2 ensures that when a consumer installs the Snowflake Native App, they install.

    See set a default release directive for
    more information.

If a provider creates version V2 and V3 of an application, they can assign V2 to be the default release and
create a custom release directive to share V3 only with specific accounts. A provider may also
share version V3 of the application with a test account before publishing that version.

> **Note:**
>
> If you specify both a default and custom release directive, the custom release directive always
> takes precedence. In the example above, consumer accounts specified in the custom release directive
> would only be able to install V3 of the application.

You must define a release directive in an application package before you can perform the following tasks:

* Create a public listing with the application package as the data content.
* Install a Snowflake Native App in a consumer account.

## Privileges required to set the release directive

To set a release directive, a provider must have the MANAGE RELEASES privilege or ownership of the application
package.

```sqlexample
GRANT MANAGE RELEASES ON APPLICATION PACKAGE hello_snowflake_package
  TO ROLE release_mgr;
```

## Set the default release directive

Use the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with SET DEFAULT RELEASE
DIRECTIVE to set the default release directive as shown in the following
example:

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  SET DEFAULT RELEASE DIRECTIVE
  VERSION = v1_0
  PATCH = 2;
```

To update the default release directive for an application package, run the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with SET DEFAULT RELEASE DIRECTIVE again, specifying
new values for VERSION or PATCH, as appropriate.

## Set and update a custom release directive

### Set a custom release directive

To add a custom release directive, use the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with
SET RELEASE DIRECTIVE. Use the ACCOUNTS clause to specify the accounts to which this
release directive applies. For example:

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  SET RELEASE DIRECTIVE hello_snowflake_package_custom
  ACCOUNTS = (CONSUMER_ORG.CONSUMER_ACCOUNT)
  VERSION = v1_0
  PATCH = 0;
```

### Update a custom release directive

To update the version or patch for a custom release directive, use the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with MODIFY RELEASE DIRECTIVE as shown
in the following example:

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  MODIFY RELEASE DIRECTIVE hello_snowflake_package_custom
  VERSION = v1_0
  PATCH = 0;
```

However, you cannot modify the accounts associated with the release directive. To change the
organization and account associated with a release directive do the following:

1. Remove the release directive from the application package by running the
   [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with UNSET RELEASE DIRECTIVE.
2. Add the release directive back to the application package by running the
   [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with SET RELEASE DIRECTIVE and
   using the ACCOUNTS clause to specify the list of accounts.

> **Note:**
>
> When you change the organization and account associated with the release directive, add
> the new release directive immediately after you remove the old one. If you don’t, the installed
> apps for the accounts assigned to the custom release directive revert to default release directive.

### Remove a custom release directive

To remove a custom release directive from an application package, use the
[ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command with UNSET RELEASE DIRECTIVE as shown in the following
example:

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  UNSET RELEASE DIRECTIVE hello_snowflake_package_custom;
```

## Test an app based on a release directive

When installing an app from an application package in development mode, the version and
patch are explicitly specified. However, when the application is installed using the following
command:

```sqlexample
CREATE APPLICATION hello_snowflake
  FROM APPLICATION PACKAGE hello_snowflake_package
```

The release directive determines the version that is installed when running this command.

## View the release directives for an application package

To view the release directives by using SQL, run the
[SHOW RELEASE DIRECTIVES](../../sql-reference/sql/show-release-directives.md) command as shown in the following example:

```sqlexample
SHOW RELEASE DIRECTIVES IN APPLICATION PACKAGE hello_snowflake_package;
```

---
title: Set up and manage an event table in the provider account
source: https://docs.snowflake.com/en/developer-guide/native-apps/event-manage-provider.md
section: Native Apps Framework
---

# Set up and manage an event table in the provider account

This topic describes how providers can set up an event table and manage event sharing for an app.

## Set up an event table in the provider organization in every region

To collect the log messages and trace events that a consumer shares, a provider must set up an event
table by performing the following:

1. Set an account as the event account.
2. Create an event table in the event account.
3. Set the event table as the active event table in event account.

> **Important:**
>
> If a provider does not have an event account and active event table withing the region where the app is installed
> before the consumer installs an app, trace events and log messages are discarded.

### Set an account as the events account

To store shared logs and events, a provider must select an account to hold an event table. This can be any
account that a provider can access. However, if an organization has multiple providers publishing
application packages, consider using a Snowflake account that is dedicated to storing shared events from the
consumer.

The following restrictions apply to accounts used to store shared events:

* You must use an [organization administrator role](../../user-guide/organization-administrators.md) to set an account as the account used
  to store events.
* The account must have an active event table.
* The specified account cannot be any of the following:

  + A locked or suspended account.
  + A reader account.
  + A trial account.
  + A Snowflake managed account.

> **Note:**
>
> A provider can collect logs and shared events only in the same region where a consumer installs an app.
> Providers must set up an event account to store shared events in every region where consumers configure event
> sharing for an app.

To set an account to be the events account for a region, call the [SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION](../../sql-reference/functions/system_set_event_sharing_account_for_region.md)
system function as shown in the following example:

```sqlsyntax
SELECT SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION('<snowflake_region>', '<region_group>', '<account_name>')
```

Where:

`snowflake_region`
:   Specifies the region where the account is located, for example: `AWS_US_WEST_2, AWS_US_EAST_1`.

`region_group`
:   Specifies the region group, for example: `PUBLIC`. Refer to
    [Region groups](../../user-guide/admin-account-identifier.md) for details.

`account_name`
:   Specifies the account name. If another account is already set as the events account in the
    specified region, running this command changes the events account to be the account
    specified here.

### Create an event table in the event account

To create an event table, run the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command as shown in the
following example:

```sqlexample
CREATE EVENT TABLE event_db.event_schema.my_event_table;
```

This command specifies the database and schema that contain the event table.

## Set the event table as the active event table

An account can have multiple event tables, but only one can be set as the active event table in a
Snowflake account at a time. Without an active event table, log messages and trace events that the consumer
shares are discarded.

After creating the event table, use [ALTER ACCOUNT … SET EVENT_TABLE](../../sql-reference/sql/alter-account.md)
to specify that the event table is the active table for the account:

```sqlexample
ALTER ACCOUNT SET EVENT_TABLE=event_db.event_schema.my_event_table;
```

## Unset an account as the events account

To unset an account to be the events account for a region, call the
[SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION](../../sql-reference/functions/system_unset_event_sharing_account_for_region.md) system function:

```sqlsyntax
SELECT SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION('<snowflake_region>', '<region_group>', '<account_name>')
```

Where:

`snowflake_region`
:   Specifies the region where the account is located, for example: `AWS_US_WEST_2`.

`region_group`
:   Specifies the region group, for example: `PUBLIC`.

`account_name`
:   Specifies the account name.

## View the event accounts in an organization

To show events accounts in a provider’s organization, call the
[SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS](../../sql-reference/functions/system_show_event_sharing_accounts.md) system function:

```sqlexample
SELECT SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS()
```

> **Note:**
>
> You must use an [organization administrator role](../../user-guide/organization-administrators.md) to call this function.

This system function returns a string in JSON format containing a list of event accounts within the organization.
Because the metadata takes some time to propagate to all regions, this function might have a short delay before
showing the most current events account after the user sets or unsets the event account for the organization.

## View the logging and trace event levels defined in an application package

Use the [SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md) command to view the logging level of the app versions
defined in an application package, as shown in the following example:

```sqlexample
SHOW VERSIONS
  IN APPLICATION PACKAGE HelloSnowflake;
```

## View the logs and events in the event table

To view the logs and events stored in the event table, use the [SELECT](../../sql-reference/sql/select.md) command as shown
in the following example:

```sqlexample
SELECT * FROM EVENT_DB.EVENT_SCHEMA.MY_EVENT_TABLE
```

For more information on querying the event table, see the following:

* [Viewing log messages](../logging-tracing/logging-accessing-messages.md)
* [Viewing trace data](../logging-tracing/tracing-accessing-events.md)

See [Event table columns](../logging-tracing/event-table-columns.md) for information on the columns
in the event table.

## Shared event information available to the provider

The following sections describe the information that the Native Apps Framework shares with providers.

### App event context shared with the provider

To help providers easily identify the source of the shared events, the following fields are populated into the
`RESOURCE_ATTRIBUTES` column of the event table when they are shared with the provider:

* `snow.application.package.name`
* `snow.application.consumer.organization`
* `snow.application.consumer.name`
* `snow.listing.name`
* `snow.listing.global_name`

### Fields that are not shared with the provider

To protect consumer information, the following fields from the `RESOURCE_ATTRIBUTES` column are
not shared with provider:

* `snow.database.id`
* `snow.database.name`
* `snow.schema.id`
* `snow.executable.id`
* `snow.owner.name`
* `snow.owner.id`
* `snow.warehouse.name`
* `snow.warehouse.id`
* `snow.query.id`
* `snow.session.id`
* `snow.session.role.primary.name`
* `snow.session.role.primary.id`
* `snow.user.name`
* `snow.user.id`
* `db.user`

Instead of directly sharing the `snow.database.name` and `snow.query.id` fields with the provider, Snowflake
shares the hash values (SHA-1) of these two fields as the following fields:

* `snow.database.hash`
* `snow.query.hash`

Snowflake provides the [SHA-1 function](../../sql-reference/functions/sha1.md) used to mask these attributes.
Consumers can calculate the hash values for the database name and query id, and use them as reference values when
contacting the provider.

---
title: Set up event tracing for an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-enable-logging.md
section: Native Apps Framework
---

# Set up event tracing for an app

This topic describes how to set up use event tracing to capture the log messages and trace events
emitted by an app. It also describes how to enable event sharing to share log messages and trace events
with providers.

## About event tracing in the Snowflake Native App Framework

Event tracing allows an app to emit information related to its performance and behavior. The Snowflake Native App Framework
supports using the Snowflake
[logging and tracing](../logging-tracing/logging-tracing-overview.md).
functionality to gather this information. An app can emit the following:

* Log messages that are independent, detailed messages with information about the state of a specific
  feature within the app.
* Trace events with structured data you can use to get information spanning and grouping multiple
  parts of an app.
* Metrics data that includes the CPU and memory metrics that Snowflake generates.

### View the log messages, trace events, and metrics for an app

To view the log messages and trace events emitted by the app, consumers must set up an event table in
their account to collect this information. See
Set up an event table for more information.

## About event sharing

Consumers can also enable event sharing to share event data with providers. When a provider enables
event sharing, the log messages and trace events that are inserted into the event table in
the consumer account are also inserted into an event table in provider account.

Event sharing allows the provider to collect information about the app’s performance and behavior. See
About event sharing for an app for more information.

### About event definitions

Event definitions specify how an app shares log messages and trace events with the provider.
Event definitions act as filters on the log message and trace event levels set by the provider.
A provider specifies the event definitions for an app when a new version or patch is published.

> **Note:**
>
> Event definitions are not required. If a provider does not specify event definitions for an app
> consumers can enable or disable event sharing as required.

Providers can set an event definition to be required or optional:

* Required event definitions are enabled automatically when the app is installed. To collect the
  event definitions emitted by an app, a consumer should create an event table and set it as
  the active event table for their account.
* Optional event definitions can be enabled or disable by the consumer as necessary. Optional event
  definitions require an active event table, but they are not required to install or use the app.

> **Caution:**
>
> Event definitions are not the same as the log and tracing levels set by the provider. The log and
> tracing levels determine the information that is inserted into the consumer event table.
>
> Event definitions are filters that act on the log messages and trace events. They determine what
> information is inserted in the provider event table when event sharing is enabled.

### Supported event definitions

The following table lists the event definitions that are currently supported:

> | Type | Name | Description | Filter |
> | --- | --- | --- | --- |
> | All | SNOWFLAKE$ALL | Shares all log messages and trace events that the app emits. | `*` |
> | Events | SNOWFLAKE$ALL_EVENTS | Shares all events from the application. | `RECORD_TYPE='EVENT'` |
> | Errors and warnings | SNOWFLAKE$ERRORS_AND_WARNINGS | Shares logs related to errors, warnings, and fatal events. | `RECORD_TYPE = ‘LOG’ AND RECORD:severity_text in (‘FATAL’, ‘ERROR’, ‘WARN’)` |
> | Metrics | SNOWFLAKE$METRICS | Shares the CPU and memory metrics that Snowflake generates. | `RECORD_TYPE = in ('METRIC')` |
> | Traces | SNOWFLAKE$TRACES | Shares detailed traces of user activities and journeys in the application. | `RECORD_TYPE in (‘SPAN’, ‘SPAN_EVENT’)` |
> | Usage logs | SNOWFLAKE$USAGE_LOGS | Shares high-level logs related to user actions and app events. | `RECORD_TYPE = LOG AND RECORD:severity_text = ‘INFO’` |
> | Debug logs | SNOWFLAKE$DEBUG_LOGS | Shares technical logs used to troubleshoot the app. | `RECORD_TYPE = ‘LOG’ AND RECORD:severity_text in (‘DEBUG’, ‘TRACE’)` |

> **Note:**
>
> If a provider does not configure the app to use event definitions, Snowsight displays only the
> All type.

### Considerations for consumers when using event definitions

Consumers can continue to use the existing SHARE_EVENTS_WITH_PROVIDER property, however there
are limitations:

* If an app only uses the OPTIONAL ALL event definition, setting the SHARE_EVENTS_WITH_PROVIDER property
  to `true` enables event sharing and setting it to `false` disables event sharing.

  This is applicable when a provider explicitly adds the OPTIONAL ALL event definition to the manifest
  file or an app was migrated from the existing event sharing functionality.
* If a provider adds mandatory and optional event definitions to the manifest file, setting the
  SHARE_EVENTS_WITH_PROVIDER property to `true` enables all event definitions. In contrast, the
  SHARE_EVENTS_WITH_PROVIDER property can only be set to `false` if the provider adds only
  optional event definitions.

  SHARE_EVENTS_WITH_PROVIDER is TRUE only when all event definitions are enabled, otherwise it is FALSE.

## Workflow to set up event tracing for an app

The following workflow describes how to set up event tracing for an app:

1. Review the considerations for using logging and event tracing.
2. Set up an event table.
3. View the logging and trace event levels configured for the app.
4. View the events in the event table.
5. Enable event sharing on an app.

## Considerations when using event tracing

Before setting up event tracing for an app, you must consider the following:

* This feature requires you to set up an event table in
  your account.
* After you enable event sharing, a masked and redacted
  copy of the trace events and logs messages is automatically inserted in the event table of the designated
  provider account.
* Snowflake does not charge you to enable event sharing. However, you are responsible for the
  cost of ingesting trace events and log message in the event table as well as storage
  costs for the event table.
* After enabling event sharing with a provider, you cannot revoke access to shared
  trace events and log messages.
* You cannot share historical events using event sharing.
* Snowflake sends the shared events to a designated provider account within the same region as your account.
  This feature does not share data across different regions.
* You cannot change the logging or tracing levels for an app. The app provider sets these levels
  when publishing the app.
* Snowflake recommends reviewing the trace events and log messages in the event table before enabling
  event sharing.
* Snowflake recommends disabling event sharing if you do not need to troubleshoot the app.

## Set up an event table

To collect the log messages and trace events emitted by the app, consumers must create an event table to
store the information.

> **Note:**
>
> IF the consumer does not set up an event table and make it the active event table before installing the
> app, trace event and log data is discarded.
>
> If a provider includes required event definitions in the app, they are enabled by default during
> installation. However, if the consumer does not have an active event table, the log messages and
> trace events emitted by the app are discarded.

An account can have multiple event tables, but only one of them can be set as the active event table in a
Snowflake account at a time. Without an active event table, log messages and trace events that the app emits
are not captured. This is true even if the functions and procedures in an app call the logging and trace
event APIs directly.

To create an event table, run the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command as shown in the following example:

```sqlexample
CREATE EVENT TABLE event_db.event_schema.my_event_table;
```

Note that this command specifies the database and schema that contain the event table.

After creating the event table, use the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command to
specify that the event table is the active table for the account:

```sqlexample
ALTER ACCOUNT SET EVENT_TABLE=event_db.event_schema.my_event_table;
```

## Enable event sharing for an app

The Snowflake Native App Framework supports sharing log messages and trace events stored in the consumer event table with the
app provider. To share logs and event information with a provider, the consumer must enable event
sharing for an app.

### Prerequisites for enabling event sharing for an app

The following prerequisites must be met to enable event sharing for an app instance:

* Use a role with the MANAGE EVENT SHARING global privilege. The ACCOUNTADMIN role has this privilege by
  default and can grant it to other roles.
* Set up an event table in the consumer account.

### Enable event sharing using Snowsight

> **Note:**
>
> If the provider includes required
> event definitions
> in the app, event sharing and the required event definitions are enabled during installation and
> cannot be disabled later.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app.
4. Select the Settings icon in the toolbar.
5. Select the Events and logs tab.
6. Under the Events and logs sharing area, move the slider for the events you want to capture.
7. If the provider has defined event definitions for the app:

   1. Use the slider to enable optional event definitions. By default, all event types are enabled.
   2. Select Save.
8. If no event table is currently selected, select the event table from the list
   under Event table location.

   > **Caution:**
   >
   > Use caution when changing the event table in Snowsight. Each Snowflake account
   > uses a single event table for all events generated within the account. Changing the event
   > table causes all events generated in the account to be stored in the new location.

### Enable event sharing by using SQL

1. Use the
   [SHOW TELEMETRY EVENT DEFINITIONS](../../sql-reference/sql/show-telemetry-event-definitions.md)
   command to determine the event definitions for the app:

   ```sqlexample
   SHOW TELEMETRY EVENT DEFINITIONS IN APPLICATION hello_snowflake;
   ```

   If the provider did not configure the app to use event definitions, the `type` column
   displays `ALL`. Otherwise, this command lists the optional event definitions specified
   for the app.
2. If the app contains required event definitions, use the
   [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command to
   enable them:

   ```sqlexample
   ALTER APPLICATION hello_snowflake SET AUTHORIZE_TELEMETRY_EVENT_SHARING=true
   ```

   This command enables all of the require event definitions, but does not enable optional event
   definitions.

   > **Note:**
   >
   > After enabling the required event definitions for an app, event sharing cannot be disabled.
3. If the app contains options event definitions, use the use the
   [ALTER APPLICATION](../../sql-reference/sql/alter-application.md)
   to enable them as shown in the following example:

   ```sqlexample
   ALTER APPLICATION hello_snowflake SET SHARED TELEMETRY EVENTS ('SNOWFLAKE$TRACES', 'SNOWFLAKE$DEBUG_LOGS');
   ```

   This example enables the `SNOWFLAKE$TRACES` and `SNOWFLAKE$DEBUG_LOGS` based on the output of the
   [SHOW TELEMETRY EVENT DEFINITIONS](../../sql-reference/sql/show-telemetry-event-definitions.md)
   command.
4. To verify that event tracing and logging is enabled, use the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md)
   command:

   ```sqlexample
   DESC APPLICATION hello_snowflake;
   ```

   The `authorize_telemetry_event_sharing` and `share_events_with_provider` rows of the output
   indicate if event sharing is enabled.

### Enable event sharing using SQL (deprecated functionality)

> **Caution:**
>
> The method of enabling event sharing using SQL described in this section will be deprecated
> in a future release. Snowflake recommends using the method described in
> Enable log and event sharing using SQL
> to enable event sharing using SQL.

To enable event sharing for an app, run the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command to set
SHARE_EVENTS_WITH_PROVIDER to `TRUE`. For example:

```sqlexample
ALTER APPLICATION HelloSnowflake SET SHARE_EVENTS_WITH_PROVIDER = TRUE;
```

To show the event sharing status for an app, use the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command as shown in the following example:

```sqlexample
DESC APPLICATION HelloSnowflake;
```

`SHARE_EVENTS_WITH_PROVIDER` shows the status of event sharing for the app.

## Enable event definitions during upgrades

During upgrades, event definitions behave as follows:

> | Change to event definition | Behavior during upgrade |
> | --- | --- |
> | No change to an event definition | The event definition retains the same status as the previous version or patch. |
> | A new event definition | Not enabled automatically. This is true for both required and optional event definitions. The consumer must manually enable new event definitions. |
> | Changes from required to optional or optional to required | The event definition retains the same status as the previous version or patch. |
> | Deleted event definition | Event sharing stops after upgrade for log messages or trace events filter in the previous version or patch. |

During upgrade, consumers are prompted to review the changes to event definitions from the previous patch or
version.

## View the log messages and trace events in the event table

When an event table is enabled, consumers can query the event table to see the log messages and
trace events emitted by the app.

### View the event log messages and trace events by using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app.
4. In the toolbar, select Settings.
5. Select the App events tab.
6. In the Event logging section, select View logs.

This opens a new worksheet with a pre-populated
[SELECT](../../sql-reference/sql/select.md) statement that displays the log messages
and trace events for the app.

### View the event log messages and trace events by using SQL

Use the [SELECT](../../sql-reference/sql/select.md) command to query the
event log messages and trace events, as shown in the following example:

```sqlexample
SELECT
  TIMESTAMP as time,
  RESOURCE_ATTRIBUTES['snow.executable.name'] as executable,
  RECORD['severity_text'] as severity,
  VALUE as message
FROM
  "EVENT_LOG"."PUBLIC"."CONSUMER_EVENT_TABLE"
WHERE RESOURCE_ATTRIBUTES['snow.application.name'] = 'YOUR_APP_NAME'
```

This command returns all of the log messages and trace events stored in the event table `CONSUMER_EVENT_TABLE`
for an app named `YOUR_APP_NAME`.

### Determine if a log message or trace event is shared with the provider

The RECORD_ATTRIBUTES column contains the `snow.application.shared` field. If the value of
this field is TRUE, the log message or trace event is shared with the provider. Otherwise,
the log message or event is not shared.

## View the log and trace levels for an app

The log and trace level of an app are defined by the provider before publishing an app.
Consumers cannot change the log and trace levels for an app.

However, before setting up event tracing or enabling event sharing for an app, Snowflake
recommends verifying the log level to understand the type of information that collected
and shared with the provider.

To view the log and trace level of an app, run the following command:

```sqlexample
DESC APPLICATION HelloSnowflake;
```

This command displays information about the `HelloSnowflake` app, including the following information
about the log and trace level set for the app:

* log_level: The log level set by the provider.
* trace_level: The trace level set by the provider.
* metric_level: The metric data level set by the provider.
* log_event_level: The log event level set by the provider.
* effective_log_level: The log level set for the app.
* effective_trace_level: The trace level enabled for the app.

The effective log and trace levels are determined by the event definitions the consumer enables for the app.

For example, if the provider defines the log level as OFF, but consumer enables the ERROR_AND_WARNING event
definition, the app dynamically changes log level to WARN so that ERROR_AND_WARNING events can be collected. The app
emits events that are less or equally as verbose as WARN and shares those error and warning events with the provider.
The values of `log_level` would be OFF and the value of `effective_log_level` would be WARN.

In contrast, if the provider defines the log level as TRACE, but the consumer enables the ERROR_AND_WARNING
event definition, the app emits events that are less or equally as verbose as trace, but only error and warning
messages are shared with the provider. The value of both log_level and effective_log_level would be TRACE.

---
title: Set up the containers and services managed by an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-containers.md
section: Native Apps Framework
---

# Set up the containers and services managed by an app

The topic describes how to set up the containers and services for a Snowflake Native App with Snowpark Container Services.

## Create an image repository

To manage containers with a Snowflake Native App, providers must create an
[image repository](../snowpark-container-services/working-with-registry-repository.md)
in the provider account to store the images required by the app.

The image repository must exist within a database and schema. The following example shows how to
create an image repository using the [CREATE IMAGE REPOSITORY](../../sql-reference/sql/create-image-repository.md) command.

```sqlexample
CREATE DATABASE provider_db;
CREATE SCHEMA provider_schema;
CREATE IMAGE REPOSITORY provider_repo;
```

> **Note:**
>
> Snowflake recommends that providers create the image repository outside the application package.
>
> If the application package is attached to a listing and the listing is configured to use
> [Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md)
> an image repository within the application package would be replicated and additional costs incurred.

The images uploaded to this repository are accessible to the application package when adding a version
definition. The app has can only access the images in this repository that are specified in the manifest
file of the application package.

The following consideration apply to image repositories in the context of an app with containers:

* External image repositories are not supported. Image repositories that are outside Snowflake cannot be
  referenced by any services within the container. This is applicable to services that exist in or outside
  of the app.
* Providers cannot directly share an image repository with an app. For example, providers cannot use the
  GRANT TO SHARE IN APPLICATION PACKAGE command.
* Providers can store multiple container images in an image repository. However, container images that are not
  explicitly listed in the manifest are not accessible by the app in the consumer’s account.
* When a provider adds a version definition to an application package the container images included in that
  version cannot be modified. The images for that version are immutable and persist
  throughout the life cycle of the version. To alter the containers within an app, providers must use a new
  version.

## Upload container images to the image repository

After creating an image repository, providers use Docker commands to upload the container images required
by the app to the image repository. The specific commands required depend on the provider’s environment.
However, the general workflow is:

1. docker login
2. docker build
3. docker tag
4. docker push

The following shows a typical example of how to use these commands:

```bash
$ docker login org-provider-account.registry.snowflakecomputing.com
$ docker build --rm --platform linux/amd64 -t service:1.0 .
$ docker tag service:1.0 org-provider-account.registry.snowflakecomputing.com/provider_db/provider_schema/provider_repo/service:1.0
$ docker push org-provider-account.registry.snowflakecomputing.comprovider_db/provider_schema/provider_repo/service:1.0
```

## Create the service specification file

The service specification is a YAML file that Snowpark Container Services uses to configure and
run a service. See [Service specification reference](../snowpark-container-services/specification-reference.md) for
general information on the syntax of this file. See [Create a service from a specification file](container-services.md)
for an example of creating a service in the setup script.

The following example shows the fields in the service specification file that are required by
an app with containers.

```yaml
spec:
  containers:
  - image: /provider_db/provider_schema/provider_repo/server:prod
    name: server
    ...
  - image: /provider_db/provider_schema/provider_repo/web:1.0
    name: web
    ...
  endpoints:
  - name: invoke
    port: 8000
  - name: ui
    port: 5000
    public: true
```

> **Note:**
>
> The service specification file references container images using the original database, schema and
> image repository names. During installation or upgrade, a service is created from the service
> specification file..
>
> Explicit registry URLs, for example `org-provider.registry.snowflakecomputing.com/db/schema/repo/img:123`
> are not supported and result in an error. The location of the image must always a full-qualified
> name in the provider account.

## Use a specification template

Providers can also use a [specification template](../snowpark-container-services/working-with-services.md)
by adding a reference to a template in the service specification file:

```yaml
spec:
  containers:
  - image: /provider_db/provider_schema/provider_repo/server:prod
    name: my_app_container
  endpoints:
  - name: invoke
    port: 8000
  - name: ui
    port: 5000
    public: true
```

See [Create a service with a specification template](container-services.md) for an example of creating a service in an app using a
specification template.

---
title: Set up the Snowflake CLI to develop an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/installing-snow-cli-na.md
section: Native Apps Framework
---

# Set up the Snowflake CLI to develop an app

This topic describes how providers install and use the Snowflake CLI for developing and managing
Snowflake Native Apps. Snowflake CLI is an open-source command-line tool explicitly designed for developer-centric
workloads in addition to SQL operations.

## Install and configure the Snowflake CLI

To install and configure the Snowflake CLI, follow these steps:

1. Install the Snowflake CLI for your operating system.

   For more information, see [Installing Snowflake CLI](../snowflake-cli/installation/installation.md).
2. Set up a connection to your Snowflake account.

   For more information, see [Configuring Snowflake CLI](../snowflake-cli/connecting/configure-cli.md).

## About Snowflake CLI projects and app templates

When using the Snowflake CLI to develop your Snowflake Native App, you work within a project. A project is a directory
that contains all the files and directories required for your Snowflake Native App. Like other code repositories,
these files can be version-controlled using technologies like Git and shared on platforms like Github.

Snowflake provides app templates that you can use to set up your project. These templates are available
in the [snowflake-cli-templates GitHub repository](https://github.com/snowflakedb/snowflake-cli-templates).

The available templates are:

| **Template** | **Description** |
| --- | --- |
| app_basic | A basic template that includes the essential files and directories required for your app. |
| app_streamlit_java | A template that includes the essential files and directories required for your app, Java extension code, and sample Streamlit app. |
| app_streamlit_js | A template that includes the essential files and directories required for your app, JavaScript extension code, and sample Streamlit app. |
| app_streamlit_python | A template that includes the essential files and directories required for your app, Python extension code, and sample Streamlit app. |

## Set up a new project for your Snowflake Native App

To set up a new project for your Snowflake Native App, follow these steps:

1. Run the `init` command to create a new project:

   ```bash
   snow init --template <template_name> <project_name>
   ```
2. Enter a value for the project identifier.

   This value is used as a base name for the app components that Snowflake CLI creates, including the
   application package. You can modify or override this value later in the project definition file.

After running this command, a new directory named `<project_name>` is created in the directory
where you ran the command. This directory contains the files and directories required for your Snowflake Native App
with the following directory structure:

```text
<project_name>/
├── app/
 ├── manifest.yml
 ├── README.md
 ├── setup_script.sql
├── README.md
├── snowflake.yml
```

The folders and files in this directory are described in the following table:

| **File/Directory** | **Description** |
| --- | --- |
| `app/` | This directory contains the application code and resources for your app. You can modify or add files in this directory as needed. |
| `app/manifest.yml` | This file defines the metadata and configuration for your app, including the app name, version, description, and resources. See [Create the manifest file for an app](manifest-overview.md) for more information. |
| `app/README.md` | This file provides an overview of your app and instructions for using it. This is the README file that is displayed in the Snowflake Marketplace |
| `app/setup_script.sql` | This SQL script is executed when the app is installed. You can modify this script to include any setup steps required for your app. For more information, see [Create the setup script](creating-setup-script.md) |
| `README.md` | This file provides an overview of the project and instructions for using the Snowflake CLI with your app. |
| `snowflake.yml` | This is the project definition file that describes the objects that can be deployed to Snowflake. You must modify this file to define the resources that are part of your app. |

## Create the project definition file

Snowflake CLI uses a project definition file to describe objects that can be deployed to Snowflake. This
file must be named `snowflake.yml`. This file determines the name of the application package and
specifies the resources that are part of your Snowflake Native App.

For more information about the project definition file, see
[Project definition files](../snowflake-cli/native-apps/project-definitions.md).

The following is an example of a simple `snowflake.yml` file used for a Snowflake Native App:

```yaml
definition_version: 2
entities:
  hello_snowflake_package:
    type: application package
    stage: stage_content.hello_snowflake_stage
    manifest: app/manifest.yml
    identifier: hello_snowflake_package
    artifacts:
       - src: app/*
         dest: ./
  hello_snowflake_app:
    type: application
    from:
       target: hello_snowflake_package
    debug: false
```

The following table describes the fields in this example:

| **Field** | **Description** |
| --- | --- |
| `definition_version` | The version of the project definition file format. The current version is 2. |
| `entities` | A list of entities that are part of the project. Each entity has a unique identifier and a type. |
| `hello_snowflake_package` | The identifier for the application package entity. This name is used as a base name for the app components that Snowflake CLI creates, including the application package. |
| `type` | The type of entity. In this case, it is an `application package`. |
| `stage` | The stage where the application package is stored. |
| `manifest` | The path to the manifest file that defines the metadata and configuration for your app. |
| `identifier` | The identifier for the application package entity. This name is used as a base name for the app components that Snowflake CLI creates, including the application package. |
| `artifacts` | A list of files and directories that are included in the application package. Each artifact has a source (`src`) and a destination (`dest`). |
| `hello_snowflake_app` | The identifier for the application entity. |
| `from` | Specifies the source of the application. In this case, it is created from the `hello_snowflake_package` application package. |
| `debug` | A boolean value that indicates whether debug mode is enabled for the app. |

## Develop your Snowflake Native App

After setting up your project and creating the project definition file, you can start developing
your Snowflake Native App by creating the application package and modifying the manifest file and setup script.

For more information, see the following topics:

* [Create and manage an application package](creating-app-package.md)
* [Create the manifest file for an app](manifest-overview.md)
* [Create the setup script](creating-setup-script.md)

---
title: Share data content in a Snowflake Native App
source: https://docs.snowflake.com/en/developer-guide/native-apps/preparing-data-content.md
section: Native Apps Framework
---

# Share data content in a Snowflake Native App

This topic describes how providers can add shared data content to a Snowflake Native App.

> **Note:**
>
> Providers can publish a Snowflake Native App to the Snowflake Marketplace as a limited trial listing. To publish
> an app as a trial listing, see [Preparing to offer a limited trial listing](https://other-docs.snowflake.com/collaboration/provider-listings-preparing#label-prepare-limited-trial-listing).

## About shared data in a Snowflake Native App

The Snowflake Native App Framework allows providers to add shared data content to an app. This data content is shared with
the consumers when they install and use the app. To share data content, providers must grant privileges
on the shared data to the application package. Data content that providers share with an application
package is shared across all installed instances of the app.

> **Caution:**
>
> Shared data content is not versioned, which means that all versions of an app use the same data.

Consumers cannot access shared content directly. Instead, a provider creates a secure view in the setup
script of an application package and grants the consumer access to the secure view.
For details, see Allow consumers to access shared objects in an app.

### Database objects that can be shared

The Snowflake Native App Framework allows providers to add the following database objects to an application package:

* Schemas
* Tables, including external and Apache Iceberg™ tables
* Views

> **Note:**
>
> When sharing database objects such as tables and views, you must also share the
> schema that contains them.

The following restrictions apply to tables and views shared in an application package:

* Tables cannot have virtual columns, including policies containing Java, Python, or JavaScript code.
* View definitions or virtual columns associated with them such as policies cannot
  contain calls to Java, Python, or JavaScript.
* Shared tables cannot be temporary, volatile, or transient tables.
* Cross-Cloud Auto-Fulfillment is not supported for apps that contain external or Apache Iceberg™ tables.

### Restrictions on sharing data content that contains policies

Some functions that are commonly referenced by policies, such as CURRENT_USER, behave
differently in the context of an app installed in a consumer account. Although a policy
defined by a provider might work correctly in the provider account, it might not work correctly in the consumer account after the consumer
installs the app.

Snowflake recommends that providers define policies on the proxy views that are specified in
the setup script. Defining policies on the proxy views ensures that the definition of the policies
cannot be changed after an app is installed. Defining policies on proxy views also ensures
that during upgrades running code continues to use the policies that were applied when the version was
created.

## Grant privileges on shared content to an application package

To include shared data content in an app, providers must grant privileges on the object to be shared
with the application package. When providers add an object to an application
package, by default, the object is private to the application package and not visible when the
app is installed.

To make objects visible to an app installed from the application package, use the
GRANT … TO SHARE IN APPLICATION PACKAGE command as shown in the following example:

```sqlexample
CREATE APPLICATION PACKAGE app_package;

GRANT USAGE ON SCHEMA app_package.shared_schema
  TO SHARE IN APPLICATION PACKAGE app_package;
GRANT SELECT ON TABLE app_package.shared_schema.shared_table
  TO SHARE IN APPLICATION PACKAGE app_package;
```

In this example, the first command grants the USAGE privilege on the `shared_schema` schema to the
application package. This command allows the schema to be shared with consumers. The second command
grants the SELECT privilege on the `shared_table` table within `shared_schema` to the application
package. This command allows consumers to query the table.

After a consumer installs an app from the application package `app_package`, they can access the
`shared_schema` and query the `shared_table`.

> **Note:**
>
> When adding a shared object to an application package, you must also share the schema that contains
> the object.

You can also share views with an app by using similar SQL commands.

## Grant privileges on objects outside the application package

To share a database object that exists outside the application package, providers must create
views in the application package that allow access the object. You cannot share objects
outside the application package directly with app installed in the consumer account.

For example, to share a database that is outside the application package, grant the REFERENCE_USAGE
privilege on the database to the application package as shown in the following example:

```sqlexample
GRANT REFERENCE_USAGE ON DATABASE other_db
  TO SHARE IN APPLICATION PACKAGE app_pkg;
```

After granting the REFERENCE_USAGE on the external database, a provider must create a view within the
application to references the shared objects as shown in the following example:

```sqlexample
CREATE VIEW app_pkg.shared_schema.shared_view
  AS SELECT c1, c2, c3, c4
  FROM other_db.other_schema.other_table;
```

This command creates a view in the application package that references the database, table,
and schema that are outside the application package.

After creating the view, you must grant privileges on the schema and view to the application
as shown in the following example:

```sqlexample
GRANT USAGE ON SCHEMA app_pkg.shared_schema
  TO SHARE IN APPLICATION PACKAGE app_pkg;
GRANT SELECT ON VIEW app_pkg.shared_schema.shared_view
  TO SHARE IN APPLICATION PACKAGE app_pkg;
```

## Allow consumers to access shared objects in an app

Database objects that are created in the setup script are directly accessible to the app
after installation. These include tables, functions, procedures, and new view definitions,

However, by default, database objects shared with an application package are not visible to consumers.
To allow consumers to view and access data content, the application package must create a secure view
and grant the appropriate privileges.

This approach offers the following advantages:

* Creating the view in the setup script ensures that changes made directly to the
  shared objects, such as new columns, are not visible to the version of the Snowflake Native App being
  installed by the setup script. To allow access to changes to the object, providers must create a new
  version or patch for the app.
* Creating the view in a versioned schema ensures that each version of the Snowflake Native App contains
  only the definition of the view for that version. This is important for upgrade scenarios.

To expose shared objects to a consumer, the setup script must include commands to:

* Install views within a versioned schema in the application package.
* Grant access to those views to the consumer using an application role.

The following example describes how to use application roles to grant access to shared objects within the
setup script of the application package:

```sqlexample
CREATE APPLICATION ROLE app_user;

CREATE OR ALTER VERSIONED SCHEMA inst_schema;
GRANT USAGE ON SCHEMA inst_schema
  TO APPLICATION ROLE app_user;

CREATE VIEW IF NOT EXISTS inst_schema.shared_view
  AS SELECT c1, c2, c3, c4
  FROM shared_schema.shared_table;

GRANT SELECT ON VIEW inst_schema.shared_view
  TO APPLICATION ROLE app_user;
```

In this example, the view accesses content in the `shared_schema.shared_view` and shares it with the
application package.

> **Note:**
>
> If a provider attempts to define a view that directly accesses shared data content, such as a
> database object external to the application package, Snowflake returns an error.

## Define policies on proxy views

Snowflake recommends that you create proxy views and define policies to protect them within the
setup script. Defining policies to protect the proxy view ensures that the definition
of policies cannot be changed after the Snowflake Native App is installed. Defining policies on proxy
views also ensures that during upgrades running code continues to use the policies that are applied
when the upgraded version is created.

## Support for external and Apache Iceberg™ tables

The Snowflake Native App Framework allows providers to share [external tables](../../user-guide/tables-external-intro.md) and
[Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) with consumers. The method for sharing these types
of tables is similar to normal tables in that providers add views to the setup script and grant
privileges on these views to an application role.

> **Caution:**
>
> External and Apache Iceberg™ tables might incur additional egress or ingress costs to the provider or consumer
> if the backing object store is not in the same region as the app listing.

The following restrictions and requirements apply when sharing an external or Iceberg table:

* External and Iceberg tables and views that access them are read-only for the app.
* Cross-Cloud Auto-Fulfillment is not supported for apps
  that share external or Iceberg tables.
* Consumers must allow the app to use an external or Iceberg table in the provider account before it
  is available to the app.

### Cross-Cloud Auto-Fulfillment is not supported

External and Iceberg tables are not supported for apps that have listings that enable
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md).

> **Note:**
>
> To publish an app that uses external or Iceberg tables to multiple Snowflake regions, providers
> must publish the listing to each region.

If a provider creates a listing for an app that includes an external or Iceberg table,
the ability to configure Cross-Cloud Auto-Fulfillment is disabled in Snowsight.

If a provider attempts to add a version or patch to an application package that is the
data product of a listing that has Cross-Cloud Auto-Fulfillment configured, Snowflake
returns an error.

### Add an external or Iceberg table to an app

In addition to adding an external or Iceberg table to the application package, providers must perform
the following:

* Add an entry to the manifest file to enable external and Iceberg tables. See
  [Add an entry for external and Iceberg tables to the manifest](requesting-external-tables.md)
  for more information.
* Providers can use the Python Permission SDK to allow consumers to use Snowsight to enable
  the app to access an external or Iceberg table. Alternately, providers can ask consumers to
  enable the external or Iceberg table manually.
  See [Request permissions to access external and Iceberg tables](requesting-external-tables.md)
  for more information.

## Use caution when revoking privileges on or dropping shared objects

Use caution when revoking privileges from shared objects in an application package or when dropping
shared objects. If an installed version of a Snowflake Native App still requires access to those objects,
the Snowflake Native App might become unstable or fail.

## Considerations when granting the MONITOR or OPERATE privilege on dynamic tables

Providers should use caution when granting the MONITOR or OPERATE privilege on dynamic tables to an
application role. These privileges allow the consumer to view a dynamic table’s metadata, which might
expose the implementation details of the app. See [Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md) for
more information on what actions the consumer can perform.

## Revoke and drop permissions on shared objects

Use caution when revoking permissions on shared objects from an application package or when dropping shared
objects. If an installed version of the Snowflake Native App still requires access to those objects,
the Snowflake Native App might become unstable or fail.

---
title: Snowflake Native App Framework workflow
source: https://docs.snowflake.com/en/developer-guide/native-apps/native-apps-workflow.md
section: Native Apps Framework
---

# Snowflake Native App Framework workflow

This topic describes the workflows for developing, publishing, and installing a Snowflake Native App.

## Development workflow

The following workflow outlines the general tasks for developing and testing Snowflake Native App:

> **Note:**
>
> Developing an app is an iterative process. You might perform many of these tasks multiple
> times or in a different order depending on the requirements of your app and environment.

1. Set up your development environment.

   To develop a Snowflake Native App, you need to set up your development environment. This includes:

   * Install the Snowflake CLI. See [Set up the Snowflake CLI to develop an app](installing-snow-cli-na.md).
   * Create a stage to upload your application files.

     > **Note:**
     >
     > If you are using Snowflake CLI you do not need to create a stage manually because Snowflake CLI
     > automatically creates a temporary stage to upload your application files during development.

     For information on creating a stage using SQL, see [CREATE STAGE](../../sql-reference/sql/create-stage.md). For information on creating a stage using Snowsight, see
     [Staging files using Snowsight](../../user-guide/data-load-local-file-system-stage-ui.md).
2. [Create an application package](creating-app-package.md).

   An application package is a container that encapsulates the data content, application logic,
   metadata, and setup script required by an app.
3. [Create the setup script](creating-setup-script.md) for your
   app.

   The setup script contains the SQL statements that define the components created
   when a consumer installs your app.
4. [Create the manifest file](manifest-overview.md) for your
   app.

   The manifest file defines the configuration and setup properties required by the app,
   including the location of the setup script and versions.
5. Upload the application files to a stage.

   The setup script, the manifest file, and other resources that your app requires
   must be uploaded to a named stage so that these files are available as you develop your app.
6. Add versions and patches for your app.

   See [About release channels, versions, and patches](release-channels-versions.md) for more information.
7. Add shared data content to your app.

   You can securely share your data content with consumers as part of your app. For more information,
   see [Share data content in a Snowflake Native App](preparing-data-content.md)
8. Add features to your app.

   You can add various features to your app to provide additional functionality, including the following
   features:

   * [Add application logic to an application package](adding-application-logic.md)
   * [Extending Snowflake with Functions and Procedures](../extensibility.md).
   * [Snowpark API](../snowpark/index.md).
   * [Introduction to external functions](../../sql-reference/external-functions-introduction.md).
9. [Set up logging and event handling to troubleshoot your app.](event-about.md)

   To troubleshoot an app, you can set up logging and event handling.
   Consumers can set up logging and event handling in their account and share them with providers.
10. Set the release directive for your app.

    A release directive determines which version and patch level are available to consumers. You can set the release directive for each release channel of your application package. For more information, see
    [Set the release directive using a release channel](release-channels.md).
11. Test your app.

    You can test an app in your account before publishing it to consumers. For more information, see
    [Install and test an app locally](installing-testing-application.md).

    Snowflake provides [development mode](installing-testing-application.md) and
    [debug mode](installing-testing-application.md) to test different aspects of your app.
12. [Run the automated security scan](security-overview.md).

    Before you can share an app with consumers outside your organization, the app must pass an
    automated security scan to ensure that it is secure and stable.

## Publishing workflow

After developing and testing your app, providers can publish the app to share it
with consumers.

1. [Become a provider](../../collaboration/provider-becoming.md).

   Becoming a provider allows you to create and manage listings to share your app with consumers.
2. Create a listing.

   You can create a private listing or a Snowflake Marketplace listing to share your app with consumers.
   For more information, see [Create a listing for an app](ui-provider-publishing-app-package.md).
3. Submit your listing for approval.

   Before you can publish a listing to the Snowflake Marketplace, you must submit the listing to
   Snowflake for approval. For more information, see [Submit a listing for approval](ui-provider-publishing-app-package.md)
4. Publish your listing.

   After your listing is approved, you can publish the listing to make it available to consumers.
   For more information, see [Publish a listing for an app](ui-provider-publishing-app-package.md).

## Consumer workflow

Consumers can discover the app and install it from a listing. After installing the
app, consumers can configure, use, and monitor the app. See
[Working with apps as a consumer](https://other-docs.snowflake.com/en/native-apps/consumer-about).

1. [Become a Snowflake consumer](../../collaboration/consumer-becoming.md).

   Becoming a Snowflake consumer allows you to access listings shared privately or on the
   Snowflake Marketplace. You can also access data shared as part of direct shares or data exchanges, which
   offer more limited data sharing capabilities.
2. [Install the app](https://other-docs.snowflake.com/en/native-apps/consumer-installing).

   Consumers can install an app from a listing.
3. [Grant the privileges required by the app](https://other-docs.snowflake.com/en/native-apps/consumer-granting-privs).

   Some apps might ask the consumer to grant global and object-level privileges to
   the app.
4. [Enable logging and event sharing to troubleshoot the app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging).

   A provider can set up an app to emit logging and event data. A consumer can set up an events table
   to share this data with providers. Logs and event data are useful when troubleshooting an app.
5. [Manage an app](https://other-docs.snowflake.com/en/native-apps/consumer-managing-applications).

   After installing and configuring the app, a consumer can perform additional tasks to
   use and monitor the app.

---
title: Snowflake Native App manifest file reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/manifest-reference.md
section: Native Apps Framework
---

# Snowflake Native App manifest file reference

This topic describes the structure and fields of a Snowflake Native App manifest file.

## `manifest_version` field

Specifies the version of the manifest file format.

This field is required.

### `manifest_version: 1`

This version of the manifest file supports the current and legacy
functionality of Snowflake Native Apps.

Example: `manifest_version: 1`

### `manifest_version: 2`

This version of the manifest file provides support for additional
features, including automated granting of privileges.

> **Caution:**
>
> Before using version 2 of the manifest file, consider the security
> implications described in [About the manifest file](manifest-overview.md).

Example: `manifest_version: 2`

### `manifest_version` example

```yaml
manifest_version: 2
```

## `version` field

Defines a block containing fields related to the version of an app.
For more information about versions and patches, see [Update an app (Legacy)](update-app.md).

> **Note:**
>
> Versions and patches defined using the
> [CREATE APPLICATION PACKAGE](../../sql-reference/sql/create-application-package.md) or
> [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) commands take
> precedence over those defined in the manifest file.

This field is optional.

### `name`

Specifies the name of the version. The version name can
only contain alphanumeric characters, underscores (_), hyphens (-), dollar signs ($), periods (.), and spaces.

This field is optional.

Example: `name: v1`

### `patch`

Specifies the default patch number.

This field is optional.

Example: `patch: 1`

### `label`

Specifies a name for the version that is displayed to consumers.

This field is optional.

Example: `label: "Initial Release"`

### `comment`

Specifies a comment for the version. This comment is visible in Snowsight or when the provider runs the [SHOW VERSIONS IN APPLICATION PACKAGE](../../sql-reference/sql/show-versions.md) command.

This field is optional.

Example: `comment: "This is the initial release of the app."`

### `version:` example

```yaml
version:
  name: v1
  patch: 1
  label: "Initial Release"
  comment: "This is the initial release of the app."
```

## `artifacts:` field

Defines a block that specifies resources used the app.

This field is required.

### `setup_script:`

Specifies the path and filename of the setup script that is run when
the Snowflake Native App is installed or upgraded. If you do not specify a
value, the app uses the default value is `setup.sql` in the same
directory as the manifest file. The setup script name and path
can only contain alphanumeric characters, underscores (_), hyphens (-), periods (.), backslashes (), and forward slashes (/).

Example: `setup_script: scripts/setup.sh`

### `readme:`

Specifies a path to a Markdown readme file that provides an overview of the app and its functionality.

In the case of a Streamlit app, if no value is specified for the `default_streamlit` property, the contents of this file is displayed to consumers when viewing the installed Snowflake Native App.

The location of this file is specified relative to the location of the manifest file.

This field is optional, however Snowflake recommends that you include a readme file with your app.

Example: `readme: docs/README.md`

### `default_streamlit_app:`

If the Snowflake Native App includes a Streamlit app, this property specifies
the schema and name of the default Streamlit app available to consumers.

This field is required if the app includes a Streamlit app.

### `extension_code:`

Enables or disables the use of extension code languages, including Java, Python, and Scala.

Example: `extension_code: true`

### `container_services:`

Specifies the location of the container images used by an app with containers. See [Specify the container images used by an app with containers](manifest-overview.md)
for more information.

This field is required for an app with containers.

#### `uses_gpu:`

Indicates that the app with containers uses a GPU.

This field is required for an app with containers.

Example: `uses_gpu: true`

#### `images:`

Specifies the path to each of the container images used by an app with
containers.

This field is required for an app with containers.

Example:

```yaml
images:
- /spcs_app/napp/img_repo/eap_frontend
- /spcs_app/napp/img_repo/eap_backend
- /spcs_app/napp/img_repo/eap_router
```

### `artifacts` example

```yaml
artifacts:
  setup_script: scripts/setup.sql
  readme: docs/README.md
  default_streamlit_app: apps/main.py
  extension_code: true
  container_services:
      uses_gpu: true
      images:
        - /spcs_app/napp/img_repo/eap_frontend
        - /spcs_app/napp/img_repo/eap_backend
```

## `configuration` field

Specifies a block containing configuration properties for an app.

This field is optional.

### `log_level:`

Specifies the logging level to use for the app Snowflake Native App.

If you do not set a value for this property, the default log data is
not captured.

For information about supported values, see
[Setting levels for logging, metrics, and tracing](../logging-tracing/telemetry-levels.md).

### `trace_level:`

Specifies the trace event level to use for the app. When a provider
enables tracing, the app automatically captures the start and end times
for all queries and stored procedure calls.

> **Caution:**
>
> Publishing an app with the `trace_level` property set to a
> value other than `OFF` might expose calls to hidden stored procedures to any user in the consumer account who can view
> the event table.

If you do not set a value for this property, trace events are not captured.

For the supported values of the `trace_level` property, see [Setting levels for logging, metrics, and tracing](../logging-tracing/telemetry-levels.md).

### `metric_level:`

Specifies the metric level to use for the app. When a provider enables
metrics the app automatically emits auto-instrumented resource metrics
data points to the event table.

See [Set the log and trace levels for an app](event-definition.md) for more information.

For the supported values of the `metric_level` property, see
[Setting levels for logging, metrics, and tracing](../logging-tracing/telemetry-levels.md).

### `log_event_level:`

Specifies the event logging level to use for the Snowflake Native App.

If you do not set a value for this property, log events are not captured.

For the supported values of the `log_event_level` property, see
[LOG_EVENT_LEVEL](../../sql-reference/parameters.md).

### `grant_callback:`

Specifies the schema and name of the callback function for app an with
containers. The callback function is a stored procedure that can create
compute pools, services, and perform other setup tasks required by the
application.

This field is required for an app with containers.

For more information, see [Create a service by using the grant_callback property](container-services.md).

Example: `grant_callback: my_schema.my_grant_callback`

### `configuration` example

```yaml
configuration:
  log_level: INFO
  trace_level: OFF
  metric_level: BASIC
  log_event_level: INFO
  grant_callback: my_schema.my_grant_callback
```

## `lifecycle_callbacks:` field

Specifies a block containing the lifecycle callbacks for the app.

For more information, see [Callbacks](callbacks.md).

This field is optional.

### `<callback_name>:`

Specifies the name of a lifecycle callback for the app.

This field is required if the `lifecycle_callbacks` property is specified.

## `privileges:` field

Defines a block containing the privileges that the app requests in a consumer account.

This field is required if the app requests privileges in the consumer account.

### `<privilege_name>:`

Specifies the name of a privilege that the app is requests in a consumer account.

This field is required if the `privileges` property is specified.

#### `description:`

Provides a description of the privilege being requested. The text
specified in `description` is displayed to the consumer when the
privilege is displayed in Snowsight using the
Python Permission SDK, or when the
[SHOW PRIVILEGES](../../sql-reference/sql/show-privileges.md) command is run.

As a provider, you should include as much information as possible about why the Snowflake Native App needs this privilege and if the privilege is required or optional.

This field is required if the `privileges` field is specified.

### `privileges:` example

```yaml
privileges:
- CREATE TABLE:
  description: 'Required to create tables in the consumer account.'
- CREATE COMPUTE POOL:
  description: 'Required to allow the app to create a compute pool in the consumer account.'
- BIND SERVICE ENDPOINT:
  description: 'Required to allow endpoints to be externally accessible.'
```

## `references:` field

Defines a block containing the references that the app is requesting in
a consumer account. The consumer must bind these references to objects
within their account.

This field is required if the app requests references in the consumer
account.

### `- <reference_name>:`

Specifies the name of a reference that the app is requesting in a
consumer account.

This field is required if the `references` property is specified.

#### `label:`

Specifies a label for the reference that is displayed to consumers.

This field is required if the `references` property is specified.

Example: `label: "Orders table"`

#### `description:`

Provides a description of the reference being requested. The text
specified in `description` is displayed to the consumer when the
reference is displayed in Snowsight using the
Python Permission SDK, or when the
[SHOW REFERENCES](../../sql-reference/sql/show-references.md) command is run.

This field is required if the `references` property is specified.

#### `privileges:`

Specifies a list of privileges that the app requires on the object to
which the reference is bound in the consumer account.

This field is required if the `references` property is specified.

Example:

```yaml
privileges:
  - SELECT
  - INSERT
```

#### `object_type`

Specifies the type of object associated with the reference, such as a schema and table, or an API integration.

This field is required if the `references` field is specified.

Example: `object_type: TABLE`

For more information, see
[Object types and privileges that a reference can contain](requesting-refs.md).

#### `multi_valued:`

Allows more than one object to be associated with the reference.
Use this property to bind multiple consumer objects to the same reference. When this property is specified, the same operations are
performed on objects with a single value reference. The property
can also be used with objects with multi-valued references.

This field is optional. The default value is `false`.

For more information, see
[Request references and object-level privileges from consumers](requesting-refs.md)

Example: `multi_valued: true`

#### `register_callback`

Specifies the schema and name of the callback function that is run
when the consumer binds the reference to an object in their account.

This field is required if the `references` property is specified.

Example: `register_callback: my_schema.my_register_callback`

#### `configuration_callback`

Specifies the name of the callback function that provides the desired configuration for the object to bind to this reference.

This property is required if `object_type` is
`EXTERNAL ACCESS INTEGRATION` or `SECRET`. This property is not applicable to other types of objects.

#### `required_at_setup`

Indicates that references must be bound when the app is installed.

Example: `required_at_setup: true`

### `references` example

```yaml
references:
- ORDERS_TABLE:
    label: "Orders table"
    description: "Orders table in TPC-H samples"
    privileges:
      - SELECT
    object_type: VIEW
    multi_valued: false
    register_callback: v1.register_single_callback

- EXTERNAL_ENDPOINT_EAI:
    label: "Allows egress to an external API"
    description: "EAI for Egress from NA+SPCS"
    privileges: [USAGE]
    object_type: EXTERNAL_ACCESS_INTEGRATION
    register_callback: v1.register_single_callback
    configuration_callback: v1.get_configuration
    required_at_setup: true
```

## `restricted_callers_rights:` field

Specifies configuration properties related to restricted caller’s
rights.

This field is required if the app creates stored procedures or Snowpark Container Services
services that run with restricted caller’s rights.

For more information, see
[Use owner’s rights and restricted caller’s rights in an app](restricted-callers-rights.md).

### `enabled:`

Specifies whether the app is allowed to create executables with restricted caller’s rights.

Providers must set this property to `true` if the app creates stored procedures or Snowpark Container Services services that run with restricted caller’s rights.

### `description:`

Provides a description of why the app needs to create executables with
restricted caller’s rights.

### `restricted_callers_rights:` example

```yaml
restricted_callers_rights:
  enabled: true
  description: "Required to create stored procedures that run with restricted caller's rights."
```

## `restricted_features:` field

Specifies configuration properties related to features that require consumer approval to enable.

### `external_data`

If present, specifies that the app shares external tables or Iceberg tables. For more information, see
[Request access to external and Apache Iceberg™ tables](requesting-external-tables.md).

### `restricted_features:` example

```yaml
restricted_features:
  - external_data:
     description: “The reason for enabling an external or Iceberg table.”
```

---
title: Snowflake Native SDK for Connectors
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/about-connector-sdk.md
section: Native Apps Framework
---

# Snowflake Native SDK for Connectors

The Snowflake Native SDK for Connectors is a library that provides a skeleton of the Snowflake native app whose purpose is to ingest data from external data source into Snowflake.
We call such an app a native connector.

The Snowflake Native SDK for Connectors is a set of application examples, templates, and tutorials which show how to
build, deploy, configure, and use a Snowflake Native App that ingests data from an external data
source into Snowflake. These resources cover both pull-based and push-based data integration patterns.

These templates do not restrict or limit developers. Instead, the templates provide examples of how
to use core Snowflake features to ingest data and encapsulate application code within a
Snowflake Native App.

The Snowflake Native App Framework allows providers to publish and monetize a Snowflake Native App on the Snowflake Marketplace.
Snowflake Native App developers can clone the template repository, modify the boilerplate code, and create
their own Snowflake Connectors.

## What is a native connector?

A connector is an application that allows data flow from an external source system into Snowflake.
A native connector is a connector application built and deployed using the Snowflake Native App Framework.
There are different types of connectors:

* pull-based connectors
* push-based connectors

The Snowflake Native SDK for Connectors currently supports only the pull-based pattern.

### Pull-based connectors

Pull-based patterns are effective when the source data provider does not manage customer data in Snowflake and is not
willing to incur COGS for a continuous data share in Snowflake. These patterns are also effective when a source data
provider has well-documented APIs that customers can use to replicate and consume data.

### How to use a pull-based pattern

By using a pull-based connector pattern, providers (Snowflake, or a third-party ETL provider) can publish and
distribute a native connector based on a Snowflake Native App using the Snowflake Marketplace.
A native connector uses direct external access to connect with the source application. It performs outbound authentication, fetches data from the source directly into a customer’s Snowflake account,
processes and persists the data based on a user-specified configuration.

### Push-based connectors

Using a push-based pattern is effective when inbound access to the source application through a customer
firewall is not feasible because of security, performance or governance limitations.
This pattern uses an agent and a Snowflake Native App to allow customers to ingest data changes into Snowflake from behind a firewall .

### How to use a push-based pattern

An agent is a standalone application, distributed as a Docker image, that is deployed in a customer environment and is
responsible for sending initial and incremental loads to Snowflake by reading data changes from a source CDC stream.

A Snowflake Native App runs within Snowflake and coordinates the integration. It is primarily responsible for managing
the replication process, controlling the agent state and creating required objects, including the target databases.

## What is the native SDK for connectors?

The Snowflake Native SDK for Connectors is a library that provides universal components that can be used to build a custom Snowflake native app
that ingests the data from an external data source into Snowflake. The provided components define the recommended flow

of the connector application and allow for customization and exclusion of some features.
As of now the Snowflake Native SDK for Connectors is provided as code to be built locally and only in Java.
Additionally, a second library containing useful helper and utility classes for writing unit tests is provided.
Those libraries can be found in the maven central repository:

* [Native SDK for Connectors library](https://central.sonatype.com/artifact/com.snowflake/connectors-native-sdk)
* [Native SDK for Connectors Test library](https://central.sonatype.com/artifact/com.snowflake/connectors-native-sdk-test/overview)

The provided examples using those libraries also include example scripts
that can be used to deploy and create instance of the application inside Snowflake.

The Snowflake Native SDK for Connectors is designed to be used when building applications based on
the Snowflake Native App Framework and then publish and monetize them using Snowflake Marketplace.
To use the Snowflake Native SDK for Connectors, clone it from a template or example application.

The Snowflake Native SDK for Connectors leverages the following features of Snowflake:

* [Native App Framework](../native-apps-about.md)
* [External network access overview](../../external-network-access/external-network-access-overview.md)
* [Stored procedures](../../stored-procedure/stored-procedures-overview.md) and [UDFs](../../udf/udf-overview.md)
* [Streamlit in Snowflake](../../streamlit/about-streamlit.md)

### Additional information

For more information about the Snowflake Native SDK for Connectors, examples, template, and tutorials see:

* [Snowflake Native SDK for Connectors GitHub repository](https://github.com/snowflakedb/connectors-native-sdk)
* [Tutorial: Snowflake Native SDK for Connectors example Java connector](tutorials/native_sdk_example_connector_tutorial.md)
* [Tutorial: Snowflake Native SDK for Connectors Java connector template](tutorials/native_sdk_template_connector_tutorial.md)

## Learn more

For more information about implementing connectors, see [Getting started with the Snowflake Native SDK for Connectors](getting_started.md)

---
title: Snowflake Native SDK for Connectors tutorials
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/tutorials_overview.md
section: Native Apps Framework
---

# Snowflake Native SDK for Connectors tutorials

To help with understanding how to build, deploy, configure, and use connector native applications, we
have created example connectors and a connector template.

For an example connector for GitHub, built using the Snowflake Native SDK for Connectors, see
[Tutorial: Snowflake Native SDK for Connectors example Java connector](tutorials/native_sdk_example_connector_tutorial.md).

For a connector template, which will provide you with a skeleton of a native app based connector, and
serve as a convenient starting point for your next custom connector, see
[Tutorial: Snowflake Native SDK for Connectors Java connector template](tutorials/native_sdk_template_connector_tutorial.md).

You can also take a look at our older example connectors, which did not use the Snowflake Native SDK for Connectors, but
may provide you with knowledge not present in our more up-to-date tutorials (e.g. on using a push-based
approach):

* [Quickstart: Pull-based Python connector](https://quickstarts.snowflake.com/guide/connectors_github_python)
* [Quickstart: Pull-based Java connector](https://quickstarts.snowflake.com/guide/connectors_github_java)
* [Quickstart: Push-based Java connector](https://quickstarts.snowflake.com/guide/connectors_example_push_based_java)

---
title: Specify the resources required by an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/marketplace-file.md
section: Native Apps Framework
---

# Specify the resources required by an app

This topic describes how to use the `marketplace.yml` file to declare the resource
requirements for an Snowflake Native App.

The `marketplace.yml` is a configuration file similar to the manifest file of an app. Snowflake uses this file in the following contexts:

* The objects specified in `required_compute_pools` and `connections` properties
  appear in the listing in Snowsight. This allows the consumer to see the resources the app
  may require.
* This file can help avoid creating or using unnecessary resources, for example replicating an application
  package to a regions where it cannot be installed by a consumer. Before consumer requests the listing in a
  remote region, Snowflake ensures that the consumer meets the resource requirements declared in the
  `marketplace.yml` file. This helps prevent unnecessary replication costs.
* Before installing and upgrading the application, Snowsight ensures the requirements are satisfied,
  to prevent installing a broken/unusable application or upgrading a working application into a unusable state.

This optional file must be at the root directory of an app at the same level as the manifest file. If this file is not present, no action is taken.

## Specify the compute pools required by an app

The following example shows how to specify the compute pool resources required for a
specific version of an app:

```yaml
required_compute_pools:
  - HIGH_MEM_POOL_1:
      label: "High memory pool"
      description: "A compute pool for computational tasks."
      compatible_instance_families:
        - HIGHMEM_X64_M
        - HIGHMEM_X64_L
```

In this example, the `required_compute_pools` a compute pool named `HIGH_MEM_POOL_1`.

The `compatible_instance_families` property specifies the type of machine to provision
for the compute pool. You must specify at least one value declared for each compute pool.
See [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) for more information.

> **Note:**
>
> If the `compatible_instance_families` property is missing or the values are invalid,
> version creation fails.

## Specify the external endpoints required by an app

The following example shows how to declare the external endpoints required by an app:

```yaml
connections:
  - LAUNCH_DARKLY:
     label: "Launch Darkly"
     description: "Feature flag and configuration"
     required: true
     endpoints:
       - "mobile.launchdarkly.com"
       - "stream.launchdarkly.com"
  - OPEN_AI:
     label: "OpenAPI"
     description: "LLM Connection"
     required: false
     endpoints:
       - "openai.com"
```

In this example, the `connection` property specifies two external endpoints,
`LAUNCH_DARKLY` and `OPEN_AI`. The `required` property indicates to the
consumer in Snowsight that the connection is required.

If you specify the `connection` in this file, the `endpoints`, and `required` properties are
required. If these properties are not present, version creation fails. The `endpoints` property
requires at least one URL.

---
title: Stored procedures and handlers customization
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/sproc_and_handlers_customization.md
section: Native Apps Framework
---

# Stored procedures and handlers customization

The Snowflake Native SDK for Connectors provides the general structure of the connector,
however, it allows for some customizations, depending on the source system and the actual needs of the developer.
For that reason, some features have empty basic implementations and it is possible to overwrite them with custom logic.
Furthermore, the components can be enabled and disabled according to specific needs, more on this in the choosing components section.

## Stored procedures

Stored procedures provided by the SDK can be split into two categories:

1. High-level entry points to the logic implemented in Java
2. Internal procedures with smaller scope

Because they have different responsibilities, the customization process is also different.

### High level procedures customization

High level procedures are used only as an entry point to the actual logic implemented in Java.
So to change the underlying logic a path to the new handler needs to be specified when recreating the stored procedure.
This procedure needs to be added as custom code in the `setup.sql` script.
This requires the new Java implementation to be provided, it can be done from scratch or using the provided in the SDK `builders`,
which are described below:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.CONFIGURE_CONNECTOR(input VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/jar_with_custom_code.jar')
  HANDLER = 'com.custom.handler.CustomHandler.configure';
```

### Smaller scope procedures customization

Some of the procedures provided by the Snowflake Native SDK for Connectors have so little logic that they can be easily written using only SQL.
For those procedures it is possible to replace the default implementations using SQL only. For example some procedures with `_VALIDATE` or `_INTERNAL` suffixes can be reimplemented this way.
All those procedures can be also customized using Java only approach through `Builders`. This approach is explained below.
There is also a possibility to replace a procedure that was using only plain SQL to use handler instead. In this case it will be
the same as for the high level stored procedures above.

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.CONFIGURE_CONNECTOR_INTERNAL(config VARIANT)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- input some custom logic here
    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;
```

## Handlers

The Snowflake Native SDK for Connectors defines default handlers for the stored procedures. They can be used as they are, customized or completely replaced.
For the latter case, the whole custom implementation does not need to follow standards defined by
the SDK and the custom implementation needs to be specified in SQL as it was mentioned above for customizing high level procedures.

However, if you wish to follow the flow of the connector defined by the SDK there is a way to customize only some parts of the flow.
Each existing handler is using multiple underlying objects, in the most cases those are:
`validator`, `callback` or `helper` classes. Each of them satisfies some interface and it’s
possible to replace default implementations with the custom implementations of the interface.

### Builders

To retain the SDK-defined flow in the connector during customization helper objects called `builders` are provided.
Each `handler` class has its own `builder` bundled. Those allow the user to provide a custom implementation of the underlying Java objects.
This way the developer does not need to touch the connector internal flow and can focus on customizing just the needed parts.
There is a small catch when using `builders`, this approach also requires the developer to
specify the new entry point method that will be referenced in the stored procedure.

For example, a `handler` using a customized `validator` using the `builder` looks like this:

```java
class CustomConnectionConfigurationInputValidator implements ConnectionConfigurationInputValidator {
  @Override
  public ConnectorResponse validate(Variant config) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.SET_CONNECTION_CONFIGURATION procedure using SQL
  public static Variant configureConnection(Session session, Variant configuration) {
    //Using builder
    var handler = ConnectionConfigurationHandler.builder(session)
      .withInputValidator(new CustomConnectionConfigurationInputValidator())
      .build();
    return handler.connectionConfiguration(configuration).toVariant();
  }
}
```

Then the entry point method in SQL needs to be specified like this:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.SET_CONNECTION_CONFIGURATION(input VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/jar_with_custom_code.jar')
  HANDLER = 'com.custom.handler.CustomHandler.configureConnection';
```

---
title: Sync status reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/sync_status_reference.md
section: Native Apps Framework
---

# Sync status reference

## Database objects and procedures

The following database objects are created through the file `observability/sync_status.sql`.

### PUBLIC.SYNC_STATUS

View exposed to the `ADMIN` and `VIEWER` roles and providing information about the synchronization status of the connector.
The main functionalities are based on the tables `APP_STATE`, `GENERIC_CONNECTOR_STATS`, `INGESTION_DEFINITIONS`,
be careful when overwriting so that the view is still usable.

The view contains the following columns:

1. `status` `STRING`
2. `last_synced_at` `TIMESTAMP_NTZ`

With the following statuses available:

* `PAUSED` when the connector is paused.
* `LAST_SYNCED` when at least one run ended with COMPLETED status.
* `SYNCING_DATA` when there is an enabled resource but no runs ended with COMPLETED status.
* `NOT_SYNCING` when no runs were started and all resources are disabled.
* `DISCONNECTED` this state is not supported yet.

## Related tables and views

Sync Status is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `observability/connector_stats.sql` (See [Connector stats reference](connector_stats_reference.md))
* `ingestion/ingestion_definitions_view.sql` (See [Resource definition and ingestion SQL reference](resource_definition_and_ingestion_processes_reference.md))

---
title: Task reactor
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/task_reactor.md
section: Native Apps Framework
---

# Task reactor

Library which provides common elements and features that are used in all Snowflake connectors.

## Requirements

The task reactor requires at least the following sql files to be executed during Native App installation:

* `task_reactor.sql` (See: [Task reactor SQL reference](../reference/task_reactor_reference.md))

## Overview

Task reactor is a separate module that provides an orchestration mechanism for work chunks stored inside a queue with a limited set of tasks.
Task reactors’ queue and dispatcher is based on
[Snowflake Streams](../../../../user-guide/streams-intro.md) with [Snowflake Tasks](../../../../user-guide/tasks-intro.md) and will be triggered
every one minute due to the
refresh time limitation. The task reactor will be active only when there is data in the input queue, to allow the warehouse to save some credits.

The task Reactor consists of three main components - queue, dispatcher and workers:

1. Your connector application adds QueueItems to the queue.
2. Every minute the dispatcher (a Snowflake task) fetches awaiting QueueItems from the queue and passes them to the workers.
3. Every minute the workers (Snowflake tasks) work in parallel on the assigned QueueItems.

Once the connector configuration is finalized, the task reactor configuration is limited to 3 steps:

1. Creating All Components of Task Reactor
2. Initializing Instance
3. (optional) Changing workers number

### Creating all Components of task reactor

To create an instance object, the user first has to create `worker`, `selector` and optionally `expired selector` implementations and then integrate them using
the [TASK_REACTOR.CREATE_INSTANCE_OBJECTS](../reference/task_reactor_reference.md) procedure.

#### Worker Implementation

The worker is responsible for performing a task assigned by the dispatcher, such as pulling and ingesting certain data.
The only mandatory part is to have a specific worker method that initiates the job. This method must be callable from the
Snowpark procedure, return a String and contain the following parameters:

* `session` - Snowpark session object
* `worker_id` - number, unique worker id
* `task_reactor_schema` - Schema name where task reactor objects are created. It can be used as a name of Task Reactor instance.

The worker is responsible for executing the task assigned by the dispatcher, e.g. pulling and
ingesting specific data. We recommend using the (`com.snowflake.connectors.sdk.taskreactor.worker.IngestionWorker`
and `com.snowflake.connectors.sdk.taskreactor.ingestion.Ingestion`) Java classes or for simpler tasks
(`com.snowflake.connectors.sdk.taskreactor.worker.SimpleTaskWorker` and `com.snowflake.connectors.sdk.taskreactor.ingestion.SimpleTask`),
however your worker can be created in any programming language supported for writing stored procedures handlers.

Example of Java worker method:

```java
public static String executeWork(Session session, int workerId, String taskReactorSchema) {
  FakeIngestion fakeIngestion = new FakeIngestion(session, workerId);
  WorkerId workerIdentifier = new WorkerId(workerId);
  Identifier schemaIdentifier = Identifier.fromWithAutoQuoting(taskReactorSchema);
  try {
    IngestionWorker.from(session, fakeIngestion, workerIdentifier, schemaIdentifier).run();
  } catch (WorkerException e) {
    // handle the exception...
    throw new RuntimeException(e);
  }
  return "Worker procedure executed.";
}
```

With an already created worker method, the user has to integrate it into `CONNECTOR.WORKER_PROCEDURE`. The procedure should call its
own worker method. It must be created in your application schema, return a STRING and contain the following parameters:

* `worker_id` - number
* `task_reactor_schema` - string

An example procedure, calling the Java implementation of the worker:

```sqlexample
CREATE OR REPLACE PROCEDURE CONNECTOR.WORKER_PROCEDURE(worker_id number, task_reactor_schema string)
  RETURNS STRING
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0', 'com.snowflake:telemetry:0.0.1')
  IMPORTS = ('@jars/myconnector-1.0.0.jar')
  HANDLER = 'com.snowflake.myconnector.WorkerImpl.executeWork';
```

The telemetry library is required to collect metrics which are logged to Event Table.

#### Selector Implementation

The selector’s job is to decide which queued tasks should be handled by the task reactor. Similar to the worker implementation -
it can be created in any language supported by Snowpark. The task selector can be implemented as a database procedure or a database view.
The selector (procedure or view) must be passed as an argument in the `TASK_REACTOR.CREATE_NEW_INSTANCE` procedure.

The procedure must be callable from a Snowpark procedure, return a string and contain the following parameters:

* `session` - Snowpark Session
* `queueItems` - String[] (an array of individual JSON Strings, each describing a single QueueItem)

Example of Java selector method:

```java
public static String selectWork(Session session, String[] queueItems) {
  Variant[] sorted =
    Arrays.stream(queueItems)
      .map(Variant::new)
      .filter(
        queueItem ->
          !queueItem.asMap().get("resourceId").asString().equals("filter-out-resource"))
      .sorted(comparing(queueItem -> queueItem.asMap().get("resourceId").asString()))
      .toArray(Variant[]::new);
  return new Variant(sorted).asJsonString();
}
```

Instead of the selector method, it is still possible to create a view that will filter and sort tasks from the existing queue.
The dispatcher can retrieve new tasks from the newly created view using an example query:

```sqlexample
CREATE VIEW CONNECTOR_SCHEMA.WORK_SELECTOR_VIEW AS SELECT * FROM TASK_REACTOR.QUEUE ORDER BY RESOURCE_ID;
```

With already created selector method, user has to integrate it into `CONNECTOR.WORK_SELECTOR`. The procedure should call
your obligatory work selector method. It must be created in your application schema, return an ARRAY, and contain the following parameter:

* `work_items - array`

An example procedure, calling the Java implementation of the work selector:

```sqlexample
CREATE OR REPLACE PROCEDURE CONNECTOR.WORK_SELECTOR(work_items array)
  RETURNS ARRAY
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('@jars/myconnector-1.0.0.jar')
  HANDLER = 'com.snowflake.myconnector.WorkSelector.selectWork';
```

#### Expired Selector Implementation

The expired selector’s job is to decide which queued items should be removed from the task reactor’s queue.
Items can be needed to be removed because the selector can never reach some items and these items would stay in the queue for ever.
Besides, some items that are waiting in the queue can be created long time before and it makes no sense to process them any more.
The expired selector can be implemented as a database view.
The selector view must be passed as an argument in the `TASK_REACTOR.CREATE_NEW_INSTANCE` procedure.
If there is no need to remove items from the queue, the default implementation can be used `TASK_REACTOR.EMPTY_EXPIRED_WORK_SELECTOR`.

Using the following query it is possible to create an expired selector view which selects the items that were created more than 3 days ago:

```sqlexample
CREATE VIEW CONNECTOR_SCHEMA.EXPIRED_WORK_SELECTOR_VIEW
  AS SELECT * FROM TASK_REACTOR.QUEUE q
  WHERE DATEDIFF(day, q.timestamp, sysdate()) > 3;
```

#### Integrate instance objects

The [TASK_REACTOR.CREATE_INSTANCE_OBJECTS](../reference/task_reactor_reference.md) lets user configure all instances together before initializing created instances.
The procedure can be executed only once per schema, so any future calls will not effect any changes. We recommend to put
initialization call to the `setup.sql` file, to prevent the procedure from being executed multiple times or not being called at all.

Required parameters:

* `instance_schema_name VARCHAR` - One per instance unique schema which stores database objects that the instance works on.
* `worker_procedure_name VARCHAR` - Name of the worker procedure described in the `Worker Implementation` part.
* `work_selector_type VARCHAR` - Values specifying whether new tasks should use view or procedure. Possible values: VIEW, PROCEDURE.
* `work_selector_name VARCHAR` - Name of the selector procedure/view described in the `Selector Implementation` part.

Optional parameters:

* `expired_work_selector_name VARCHAR` - Name of the expired selector view described in `Expired Selector Implementation` part. If the value is not provided, `TASK_REACTOR.EMPTY_EXPIRED_WORK_SELECTOR` is used as a default implementation which returns nothing.

### Initializing Instance

To initialize and run all configurations in task reactor user has to call `INITIALIZE_INSTANCE`.
The procedure takes the following parameters as input:

* `instance_schema_name` - (required) Name of schema which stores database objects that the instance works on.
* `warehouse_name` (required) Name of warehouse on which the instance will run.
* `dt_should_be_started` (optional) - default: `TRUE`. Dispatcher task should start when creating a new instance or not.
* `dt_task_schedule` (optional) - default: `1 MINUTE`. Frequency of running the dispatcher task.
* `dt_allow_overlapping_execution` (optional) - default: `FALSE`. Allows the DAG to run concurrently.
* `dt_user_task_timeout_ms` (optional) - the time limit on a single run of the task before it times out (in milliseconds).

> **Note:**
>
> If the worker procedure takes longer than the timeout set on the workers task
> ([USER_TASK_TIMEOUT_MS](../../../../sql-reference/parameters.md)), the procedure
> will abort with a timeout error. It is important to schedule tasks to not exceed the timeout of the
> Snowflake task.

After providing the minimum number of required parameters, the `Task Reactor` is initialized with the provided configuration
and dispatches workers using the `TASK_REACTOR.DISPATCHER` procedure.

### Setting Number of Workers

Number of workers can be changed manually by calling [TASK_REACTOR.SET_WORKERS_NUMBER](../reference/task_reactor_reference.md) procedure with following parameters:

* `WORKERS_NUMBER` - new number of workers.
* `TR_INSTANCE_SCHEMA_NAME` - name of instance schema

### Metrics

Task Reactor contains a metrics mechanism. It is based on
[Snowflake Trace Events](../../../logging-tracing/tracing.md).
The metrics are logged into the Event Table, so the Event Table has to be enabled in order to make metrics work.

Currently, the following metrics are introduced:

* `worker working time` (`TASK_REACTOR_WORKER_WORKING_TIME`) - It shows the time when a worker was actually processing resources. The timer starts when a worker task begins and ends when the worker task finishes.
* `worker idle time` (`TASK_REACTOR_WORKER_IDLE_TIME`) - It is the opposite to the `worker working time`. It shows the time when a worker was asleep: either waiting for a new work or waiting for the next schedule of its task. The timer begins when a worker finishes its task and ends when the worker task starts again.
* `worker item waiting time` (`TASK_REACTOR_WORK_ITEM_WAITING_IN_QUEUE_TIME`) - It shows the time of work item waiting in the dispatcher queue. The timer starts when a work item is inserted to the dispatcher queue and ends when the work item is removed from the dispatcher queue and is inserted to a worker queue.
* `worker item number in queue` (`TASK_REACTOR_WORK_ITEMS_NUMBER_IN_QUEUE`) - It shows the number of work items present in the dispatcher queue.
* `worker statuses` (`TASK_REACTOR_WORKER_STATUS`) - It shows the number of workers in each worker status and the total number of workers.

In order to see all logged metrics events, the following query can be used:

```sqlexample
SET EVENT_TABLE = 'TOOLS.PUBLIC.EVENTS';
SET APP_NAME = 'YOUR_APP_NAME';

SELECT
    event.record:name::string AS EVENT_NAME,
    span.record_attributes:task_reactor_instance::string AS INSTANCE_NAME,
    span.record_attributes:worker_id AS WORKER_ID,
    event.record_attributes AS PAYLOAD
  FROM IDENTIFIER($EVENT_TABLE) event
  JOIN IDENTIFIER($EVENT_TABLE) span ON event.trace:span_id = span.trace:span_id AND event.record_type = 'SPAN_EVENT' AND span.record_type = 'SPAN'
  WHERE
    event.resource_attributes:"snow.database.name" = $APP_NAME
  ORDER BY event.timestamp DESC;
```

In order to select only one type of metrics, add `event.record:name = <metric name>` to the `where` clause of the query.
The following queries can be used to load individual metrics:

Worker working time (`TASK_REACTOR_WORKER_WORKING_TIME`)

```sqlexample
SELECT
  event.record:name::string AS EVENT_NAME,
    span.record_attributes:task_reactor_instance::string AS INSTANCE_NAME,
    span.record_attributes:worker_id AS WORKER_ID,
    event.record_attributes:value AS DURATION
  FROM IDENTIFIER($EVENT_TABLE) event
  JOIN IDENTIFIER($EVENT_TABLE) span ON event.trace:span_id = span.trace:span_id AND event.record_type = 'SPAN_EVENT' AND span.record_type = 'SPAN'
  WHERE
    event.resource_attributes:"snow.database.name" = $APP_NAME
      AND event.record:name = 'TASK_REACTOR_WORKER_WORKING_TIME'
  ORDER BY event.timestamp DESC;
```

Worker idle time (`TASK_REACTOR_WORKER_IDLE_TIME`)

```sqlexample
SELECT
    event.record:name::string AS EVENT_NAME,
    span.record_attributes:task_reactor_instance::string AS INSTANCE_NAME,
    span.record_attributes:worker_id AS WORKER_ID,
    event.record_attributes:value AS DURATION
  FROM IDENTIFIER($EVENT_TABLE) event
  JOIN IDENTIFIER($EVENT_TABLE) span ON event.trace:span_id = span.trace:span_id AND event.record_type = 'SPAN_EVENT' AND span.record_type = 'SPAN'
  WHERE
    event.resource_attributes:"snow.database.name" = $APP_NAME
        AND event.record:name = 'TASK_REACTOR_WORKER_IDLE_TIME'
  ORDER BY event.timestamp DESC;
```

Worker item waiting time (`TASK_REACTOR_WORK_ITEM_WAITING_IN_QUEUE_TIME`)

```sqlexample
SELECT
    event.record:name::string AS EVENT_NAME,
    span.record_attributes:task_reactor_instance::string AS INSTANCE_NAME,
    event.record_attributes:value AS DURATION,
    event.timestamp
  FROM IDENTIFIER($EVENT_TABLE) event
  JOIN IDENTIFIER($EVENT_TABLE) span ON event.trace:span_id = span.trace:span_id AND event.record_type = 'SPAN_EVENT' AND span.record_type = 'SPAN'
  WHERE
    event.resource_attributes:"snow.database.name" = $APP_NAME
      AND event.record:name = 'TASK_REACTOR_WORK_ITEM_WAITING_IN_QUEUE_TIME'
  ORDER BY event.timestamp DESC;
```

Worker item number in queue (`TASK_REACTOR_WORK_ITEMS_NUMBER_IN_QUEUE`)

```sqlexample
SELECT
    event.record:name::string AS EVENT_NAME,
    event.record_attributes:task_reactor_instance::string AS INSTANCE_NAME,
    event.record_attributes:value AS WORK_ITEMS_NUMBER,
    event.timestamp
  FROM IDENTIFIER($EVENT_TABLE) event
  WHERE
    event.resource_attributes:"snow.database.name" = $APP_NAME
      AND event.record:name = 'TASK_REACTOR_WORK_ITEMS_NUMBER_IN_QUEUE'
  ORDER BY event.timestamp DESC;
```

Worker statuses (`TASK_REACTOR_WORKER_STATUS`)

```sqlexample
SELECT
    event.record:name::string AS EVENT_NAME,
    span.record_attributes:task_reactor_instance::string AS INSTANCE_NAME,
    event.record_attributes:TOTAL AS WORKERS_TOTAL,
    IFNULL(event.record_attributes:AVAILABLE, 0) AS WORKERS_AVAILABLE,
    IFNULL(event.record_attributes:WORK_ASSIGNED, 0) AS WORKERS_WORK_ASSIGNED,
    IFNULL(event.record_attributes:IN_PROGRESS, 0) AS WORKERS_IN_PROGRESS,
    IFNULL(event.record_attributes:SCHEDULED_FOR_CANCELLATION, 0) AS WORKERS_SCHEDULED_FOR_CANCELLATION,
    event.timestamp
  FROM IDENTIFIER($EVENT_TABLE) event
  JOIN IDENTIFIER($EVENT_TABLE) span ON event.trace:span_id = span.trace:span_id AND event.record_type = 'SPAN_EVENT' AND span.record_type = 'SPAN'
  WHERE
    event.resource_attributes:"snow.database.name" = $APP_NAME
      AND event.record:name = 'TASK_REACTOR_WORKER_STATUS'
  ORDER BY event.timestamp DESC;
```

---
title: Task reactor SQL reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/task_reactor_reference.md
section: Native Apps Framework
---

# Task reactor SQL reference

## Database objects and procedures

The following database objects are created through the file `task_reactor.sql`.

### TASK_REACTOR SCHEMA

Versioned schema containing some database object of task reactor in the connector.

### TASK_REACTOR_INSTANCES SCHEMA

Non-Versioned schema containing some instance database object of task reactor in the connector.

### TASK_REACTOR_INSTANCES.INSTANCE_REGISTRY

This table is created to store the data about Task Reactor instances in order to give the ability to track and manage
existing instances during the application runtime. The table is created in the `TASK_REACTOR_INSTANCES` schema.

* `instance_name` `VARCHAR`
* `is_initialized` `BOOLEAN`
* `is_active` `BOOLEAN`

### TASK_REACTOR.DISPATCHER(INSTANCE_SCHEMA_NAME VARCHAR)

This procedure invokes the Java `DispatcherHandler.dispatchWorkItems` and allows to dispatch work items.

### TASK_REACTOR.SET_WORKERS_NUMBER (WORKERS_NUMBER NUMBER, INSTANCE_SCHEMA_NAME VARCHAR)

This procedure invokes the Java `SetWorkersNumberHandler.setWorkersNumber` and allows to set number of workers.

### TASK_REACTOR.CREATE_INSTANCE_OBJECTS

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `WORKER_PROCEDURE_NAME` `VARCHAR`
* `WORK_SELECTOR_TYPE` `VARCHAR`
* `WORK_SELECTOR_NAME` `VARCHAR`
* `EXPIRED_WORK_SELECTOR_NAME` `VARCHAR`

Procedure creates all of instance objects required for accurate `Task reactor` flow and validates the ones who should
not be already initialized. At the end of process it insert new instance registry record to the table.

Possible errors include:

* `INSTANCE_NOT_FOUND` - Instance with this name does not exists.
* `INSTANCE_ALREADY_INITIALIZED` - Instance with this name is already initialized.
* `DEFAULT_PROCEDURE_VALIDATION_EXCEPTION` - Procedure not found.
* `SCHEMA_WITH_THE_SAME_NAME_ALREADY_EXISTS` - Schema with the same name already exists.
* `CREATING_TR_INSTANCE_EXCEPTION` - Something unexpected went wrong while creating a new instance of task reactor. No instance has been created.

### TASK_REACTOR.INITIALIZE_INSTANCE

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `WAREHOUSE_NAME` `VARCHAR`
* `DT_SHOULD_BE_STARTED` `BOOLEAN`
* `DT_TASK_SCHEDULE` `VARCHAR`
* `DT_ALLOW_OVERLAPPING_EXECUTION` `BOOLEAN`
* `DT_USER_TASK_TIMEOUT_MS` `VARCHAR`

Procedure starts all non initialized instances within the same database instance. It consist of checking instance exists,
or whether it is not already initialized and then creates dispatcher tasks and starts this task if was required.

Procedure ends successfully with:

```json
{
    "response_code": "OK",
    "message": "Instance has been initialized successfully."
}
```

Possible errors include:

* `INSTANCE_NOT_FOUND` - Instance does not exist.
* `INSTANCE_ALREADY_INITIALIZED` - Instance with this name is already initialized.

### TASK_REACTOR.PAUSE_INSTANCE

Input parameters:

* `INSTANCE_SCHEMA` `VARCHAR`

Procedure starts the process of pausing a given instance of Task Reactor and returns OK response.
It starts a job which asynchronously stops all worker tasks and the dispatcher task.
In case when a worker task was already performing an ingestion, the task is not being stopped right away, but the task will be stopped after the ingestion is finished.

> **Note:**
>
> The logic of this procedure is already used in [Pause Connector](pause_connector_reference.md), so it’s not needed to use this procedure as a part of stopping the whole connector.

Procedure ends successfully with:

```json
{
    "response_code": "OK"
}
```

### TASK_REACTOR.RESUME_INSTANCE()

Input parameters:

* `INSTANCE_SCHEMA` `VARCHAR`

Procedure starts the process of resuming a given instance of Task Reactor and returns OK response.
It resumes the dispatcher task and starts a job which asynchronously resumes all worker tasks that have already assigned work.

> **Note:**
>
> The logic of this procedure is already used in [Resume Connector](resume_connector_reference.md), so it’s not needed to use this procedure as a part of resuming the whole connector.

Procedure ends successfully with:

```json
{
    "response_code": "OK"
}
```

### TASK_REACTOR.REMOVE_INSTANCE()

Input parameters:

* `INSTANCE_SCHEMA` `VARCHAR`

Removes a given instance of Task Reactor from the instance registry and returns OK response.
If no instances exist with the provided name - no action is performed.

Procedure ends successfully with:

```json
{
    "response_code": "OK"
}
```

### TASK_REACTOR.UPDATE_WAREHOUSE_INSTANCE

Input parameters:

* `WAREHOUSE_NAME` `VARCHAR`
* `INSTANCE_SCHEMA` `VARCHAR`

Procedure starts the process of changing the warehouse for a given instance of Task Reactor.
It changes the warehouse of the dispatcher task and then starts a job which asynchronously changes the warehouse of all worker tasks.

> **Note:**
>
> The logic of this procedure is already used in [Update Warehouse](update_warehouse_reference.md), so it’s not needed to use this procedure as a part of updating the warehouse for the whole connector.

Procedure ends successfully with:

```json
{
    "response_code": "OK"
}
```

Possible errors include:

* `INSTANCE_NOT_FOUND` - Given instance does not exist.
* `TASK_REACTOR_INSTANCE_IS_ACTIVE` - Given Task Reactor instance has not been paused before using this procedure.

### Internal procedures

All of below procedures are used only for internal use in `task_reactor` setup script and should not be used externally.

#### TASK_REACTOR.CREATE_INSTANCE_SCHEMA (INSTANCE_SCHEMA_NAME VARCHAR)

This procedure creates new schema with identifier named `instance_schema_name`, and then throws
a new exception if the schema could not be created.

Possible errors include:

* `SCHEMA_WITH_THE_SAME_NAME_ALREADY_EXISTS` - Schema with the same name already exists.

#### TASK_REACTOR.VALIDATE_PROCEDURE_EXISTENCE

Input parameters:

* `PROCEDURE_NAME` `VARCHAR`
* `PROCEDURE_TYPE` `VARCHAR`

This procedure validates whether defined procedures does not exists and then throws new exception.

Possible errors include:

* `WORKER_PROCEDURE_NOT_FOUND_EXCEPTION` - Worker procedure not found.
* `WORK_SELECTOR_PROCEDURE_NOT_FOUND_EXCEPTION` - Work selector procedure not found.
* `DEFAULT_PROCEDURE_VALIDATION_EXCEPTION` - Procedure not found.

#### TASK_REACTOR.CREATE_QUEUE

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `TABLE_NAME` `VARCHAR`
* `STREAM_NAME` `VARCHAR`

The helper method for `Task Reactor`, it offers creating queue table with the name `instance_schema_name.table_name`
and the following columns:

* `ID` `STRING`
* `RESOURCE_ID` `STRING`
* `DISPATCHER_OPTIONS` `VARIANT`
* `WORKER_PAYLOAD` `VARIANT`
* `TIMESTAMP` `DATETIME`

Then it creates stream with name `instance_schema_name.stream_name` if it does not exist yet.

#### TASK_REACTOR.CREATE_WORKER_REGISTRY_SEQUENCE

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `SEQUENCE_NAME` `VARCHAR`

The helper method for `Task Reactor`, which offers creating sequence for worker registry with the
`instance_schema_name.sequence_name` sequence name.

#### TASK_REACTOR.CREATE_WORKER_REGISTRY

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `TABLE_NAME` `VARCHAR`
* `SEQUENCE_NAME` `VARCHAR`

The helper method for `Task Reactor`, which offers creating worker registries consists of a table with the name
`instance_schema_name.table_name` and columns:

* `WORKER_ID NUMBER` with default `instance_schema_name.sequence_name` sequence
* `CREATED_AT` `DATETIME`
* `UPDATED_AT` `DATETIME`
* `STATUS` `STRING`

#### TASK_REACTOR.CREATE_WORKER_STATUS_TABLE

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `TABLE_NAME` `VARCHAR`

A helper method for `Task Reactor`, which offers creating a status table for a worker with the name `instance_schema_name.table_name` and columns:

* `WORKER_ID` `NUMBER`
* `TIMESTAMP` `DATETIME`
* `STATUS` `STRING`

#### TASK_REACTOR.CREATE_CONFIG_TABLE

Input parameters:

* `INSTANCE_SCHEMA_NAME` `VARCHAR`
* `TABLE_NAME` `VARCHAR`
* `WORKER_PROCEDURE_NAME` `VARCHAR`
* `WORK_SELECTOR_TYPE` `VARCHAR`
* `WORK_SELECTOR_NAME` `VARCHAR`
* `EXPIRED_WORK_SELECTOR_NAME` `VARCHAR`
* `IS_INSTANCE_REGISTERED` `BOOLEAN`

The helper method for `Task Reactor`, offers to create a configuration table named `instance_schema_name.table_name`
with key and value columns. It then inserts the configuration data into the table if it is not already registered with the following values:

* `WORKER_PROCEDURE`
* `WORK_SELECTOR_TYPE`
* `WORK_SELECTOR`
* `SCHEMA`

### Related features

Other related features:

* `Scheduler`
* `Ingestion`

### Related Java objects

Java implementations and related classes:

* `Dispatcher`
* `SetWorkersNumber`
* `Worker`

---
title: Testing native connectors
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/testing.md
section: Native Apps Framework
---

# Testing native connectors

Native SDK for connectors uses tests on 3 different levels:

* unit tests
* integration tests
* application tests

These unit tests are not different from unit tests for any other application. Parts of the system are mocked up to easily manipulate their returned values.
The `connectors-native-sdk-test` library, which as a library bundled with the SDK provides useful `InMemory` mockups of some classes responsible for communication with Snowflake.
It enables the developer to write and run unit tests on the local environment without a need to use any Snowflake connection.

Integration tests require a connection to Snowflake and run queries against Snowflake objects. However, the used database objects are stand-alone in these kinds of tests.
Keep in mind some Snowflake features may be unavailable in the context of Native Apps.

Application tests are tests that run on a native app deployed into Snowflake.
These kinds of tests are quite time-consuming and might be costly, so the recommendation is to test only some main scenarios this way.

Overall, the recommendation is to test as many cases as possible using unit tests and minimize the number of integration and application tests to the most critical paths.

## Testing library

As mentioned above the Native SDK has a `connectors-native-sdk-test` library, which provides various features useful in tests. The main features are:

* custom assertions
* in-memory implementations
* test builders

### Assertions

Assertions provided in the library are based on the [AssertJ fluent assertions](https://assertj.github.io/doc/).
All the assertions provided have a fabrication method implemented inside the `NativeSdkAssertions` class, furthermore,
this class inherits all of the original AssertJ fabrication methods, so only one import is needed to use both custom and base assertions.

The list of provided assertions:

* `TestConfigAssert`
* `IngestionProcessAssert`
* `IngestionRunAssert`
* `ConnectorResponseAssert`
* `FullConnectorStatusAssert`
* `ResourceIngestionDefinitionAssert`
* `ResponseMapAssert`
* `TestStateAssert`
* `TaskAssert`
* `TaskPropertiesAssert`
* `VariantAssert`

### Mockups

The mockups used inside the library are using an in-memory map under the hood to mock up the data stored inside database table.
They can be used like in the example below:

```java
var customResourceRepository = new InMemoryDefaultResourceIngestionDefinitionRepository();
var key = "test_key";

var resource = createResource(key);
customResourceRepository.save(resource);

var result = customResourceRepository.fetch(key);
```

```java
var table = new InMemoryDefaultKeyValueTable();
var repository = new DefaultConfigurationRepository(table);
var connectorService = new DefaultConnectorConfigurationService(repository);
```

List of provided in memory objects:

* `InMemoryResourceIngestionDefinitionRepository`
* `InMemoryIngestionProcessRepository`
* `InMemoryAppendOnlyKeyValueTable`
* `InMemoryDefaultKeyValueTable`
* `InMemoryReadOnlyKeyValueTable`
* `InMemoryConnectorErrorHelper`
* `InMemoryTaskRef`
* `InMemoryTaskRepository`

List of provided in memory task reactor objects:

* `InMemoryCommandsQueueRepository`
* `InMemoryConfigRepository`
* `InMemoryWorkSelector`
* `InMemoryExpiredWorkSelector`
* `InMemoryWorkItemQueue`
* `InMemoryInstanceRegistryRepository`
* `InMemoryWorkerQueue`
* `InMemoryWorkerQueueManager`
* `InMemoryWorkerRegistry`
* `InMemoryWorkerStatusRepository`
* `InMemoryWorkerCombinedView`
* `InMemoryConfiguredTaskReactorExistenceVerifier`
* `InMemoryNotConfiguredTaskReactorExistenceVerifier`
* `InMemoryTaskReactorInstanceComponentProvider`

### Test builders

Test builders are helper objects similar to the `Builders` used when customizing SDK components.
However, they expose all of the internal services to override.
For more information on using `Builders` check customization documentation.

```java
new ConfigureConnectorHandlerTestBuilder()
                .withErrorHelper(mock(ConnectorErrorHelper.class))
                .withInputValidator(mock(ConfigureConnectorInputValidator.class))
                .withCallback(mock(ConfigureConnectorCallback.class))
                .withConfigurationService(mock(ConnectorConfigurationService.class))
                .withStatusService(mock(ConnectorStatusService.class))
                .build();
```

---
title: The Snowflake Native SDK for Connectors reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/overview.md
section: Native Apps Framework
---

# The Snowflake Native SDK for Connectors reference

Detailed description each of the components of the SDK.

* [App config SQL reference](app_config_reference.md)
* [Connection configuration reference](connection_configuration_reference.md)
* [Connector configuration reference](connector_configuration_reference.md)
* [Connector stats reference](connector_stats_reference.md)
* [Core SQL reference](core_reference.md)
* [Create resource reference](create_resource_reference.md)
* [Disable resource reference](disable_resource_reference.md)
* [Enable resource reference](enable_resource_reference.md)
* [Finalize configuration reference](finalize_configuration_reference.md)
* [Pause connector reference](pause_connector_reference.md)
* [Prerequisites SQL Reference](prerequisites_reference.md)
* [Reset configuration reference](reset_configuration_reference.md)
* [Resource definition and ingestion SQL reference](resource_definition_and_ingestion_processes_reference.md)
* [Resume connector reference](resume_connector_reference.md)
* [Ingestion scheduler reference](scheduler_reference.md)
* [External integration setup reference](setup_external_integration.md)
* [Sync status reference](sync_status_reference.md)
* [Task reactor SQL reference](task_reactor_reference.md)
* [Update connection configuration reference](update_connection_configuration_reference.md)
* [Update resource reference](update_resource_reference.md)
* [Update warehouse reference](update_warehouse_reference.md)

---
title: Troubleshooting
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/using/troubleshooting.md
section: Native Apps Framework
---

# Troubleshooting

This topic provides guidelines for troubleshooting issues with the Snowflake Native SDK for Connectors. If you want to
discover the cause of a specific error, you need appropriate tools
that make the troubleshooting process easier. For that reason the Snowflake Native SDK for Connectors provides a couple of
procedures, views, and other methods to troubleshoot the connector effectively.

## Procedure responses

Usually when something wrong happens with the connector, or even when the user cannot
execute the particular procedure successfully because of the state of the Connector, the first source
of the troubleshooting data should be the procedure response. In the Snowflake Native SDK for Connectors, the error response
from the procedure is standardized. The response is returned as a `VARIANT` with
two fields, that are always present:

* `response_code` - the value of this field, in case of an error response, is an error code, e.g. `INVALID_CONNECTOR_STATUS`
* `message` - the value of this field is a message that provides more information regarding the occurred error

Structure of an error response:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

## Database objects

The Snowflake Native SDK for Connectors provides a couple of views and procedures that help in checking the actual state of the Connector.

Views:

* `PUBLIC.CONNECTOR_CONFIGURATION` (See [Connection configuration reference](../reference/connection_configuration_reference.md))
* `PUBLIC.SYNC_STATUS` (See [Sync status reference](../reference/sync_status_reference.md))
* `PUBLIC.CONNECTOR_STATS` (See [Connector stats reference](../reference/connector_stats_reference.md))
* `PUBLIC.AGGREGATED_CONNECTOR_STATS` (See [Connector stats reference](../reference/connector_stats_reference.md))

Procedures:

* `PUBLIC.GET_CONNECTOR_STATUS()` (See [Core SQL reference](../reference/core_reference.md))
* `PUBLIC.GET_CONNECTION_CONFIGURATION()` (See [Connection configuration reference](../reference/connection_configuration_reference.md))

## Event table

By default, procedures provided by the SDK that are implemented in Java use
`ConnectorErrorHelper` to wrap thrown exceptions during the particular procedure execution. Apart from
wrapping the thrown exception and mapping it to `ConnectorResponse` with an error
code, the default implementation of `ConnectorErrorHelper` logs events basing on the thrown
exceptions to the Event Table if it’s set up in the customer’s account. If you
want to learn more about using Event Table, see the
[Logging, tracing, and metrics](../../../logging-tracing/logging-tracing-overview.md).

There is a possibility to filter out logs generated by the Snowflake Native SDK for Connectors from the event table
knowing only the application instance name. In addition, there is also a possibility
to filter out logs of errors that occurred in the most common parts of the Connector.

Example query:

```sqlexample
SELECT * FROM PLATFORM_CI_TOOLS.PUBLIC.EVENTS
    WHERE RESOURCE_ATTRIBUTES:"snow.database.name" LIKE '<INSTANCE_NAME>'
    [AND SCOPE:"name" LIKE '<ERROR_CODE>']
    [ORDER BY timestamp DESC];
```

Possible error codes:

* `CONFIGURE_CONNECTOR_FAILED` - occurs when Configure Connector Wizard step failed
* `SET_CONNECTION_CONFIGURATION_FAILED` - occurs when Set Connection Configuration Wizard step failed
* `FINALIZE_CONNECTOR_CONFIGURATION_FAILED` - occurs when Finalize Connector Configuration Wizard step failed
* `PAUSE_CONNECTOR_FAILED` - occurs when Pause Connector process failed
* `RESUME_CONNECTOR_FAILED` - occurs when Resume Connector process failed

## Connector state recovery

If an uncaught error occurs during pause or resume process, which causes the connector to not
rollback the process and fail unexpectedly - an `ERROR` connector status may be set. Also, if
for any reason these processes get suddenly terminated - the connector may be ‘stuck’ in a
`STARTING` or `PAUSING` connector status.

Such problems should be diagnosed using the above mentioned methods and a suitable repair should be
attempted. In many cases a complete reinstallation of the connector may be required, but if the
repair was successful and the only remaining issue is the ‘stuck’ connector status - there may a
couple possible solutions:

* If the connector is ‘stuck’ in the `STARTING` status - another call of the `PUBLIC.RESUME_CONNECTOR()`
  procedure may fix the status issue
* If the connector is ‘stuck’ in the `PAUSING` status - another call of the `PUBLIC.PAUSE_CONNECTOR()`
  procedure may fix the status issue
* If the aforementioned methods failed, or the connector is in the `ERROR` status - the
  `PUBLIC.RECOVER_CONNECTOR_STATE(STRING)` procedure may be used to force the status change (see
  [Core SQL reference](../reference/core_reference.md)). It is advised to force the change into `PAUSED` status and
  try restarting the connector using the `PUBLIC.RESUME_CONNECTOR()` procedure

---
title: Tutorial 1: Create a basic Snowflake Native App
source: https://docs.snowflake.com/en/developer-guide/native-apps/tutorials/getting-started-tutorial.md
section: Native Apps Framework
---

App Development

# Tutorial 1: Create a basic Snowflake Native App

## Introduction

The Snowflake Native App Framework allows providers to build, sell, and distribute a Snowflake Native App within the
Snowflake Data Cloud. Providers can create apps that leverage core Snowflake functionality to share data
and application logic with consumers. The logic of Snowflake Native App can include features such as stored procedures,
and user-defined functions (UDFs). Providers can share their applications with consumers through listings in the
Snowflake Marketplace or through private listings.

This tutorial describes how to use the Snowflake Native App Framework to create a basic Snowflake Native App to share data and related business logic
with other Snowflake accounts.

> **Note:**
>
> The tutorial uses both Snowflake CLI and the Snowsight web interface.

### What you learn in this tutorial

In this tutorial, you learn how to:

* Create an application package that contains the data and business logic of your app.
* Share data with an application package.
* Add business logic to an application package.
* Test the app locally.
* View and test the app in Snowsight.
* Publish your app by creating a private listing.
* Install the app from a private listing.
* Use Snowflake CLI to perform many of the steps above.

### About providers and consumers

Within the context of the Snowflake Native App Framework, providers are the roles and
organizations who have data and business logic that they want to share with
other Snowflake users, who are the consumers. A consumer can be another account
within your organization, a different organization within your company, or a
Snowflake user in another company.

Within the context of this tutorial, most of the tasks you perform are
those typically performed by providers, but these include tasks that may be performed
by multiple roles within your organization including application developers and database
administrators.

In this tutorial, you also perform tasks that mimic the actions performed
by consumers to install an app.

### Prerequisites

* You must have [Snowflake CLI](../../snowflake-cli/index.md) version 3.0.0 or greater installed on your machine.
* You must run all of the SQL commands in the same SQL command session because the session
  context is required.

  To do this in Snowsight, for example, paste all of your code into the same worksheet as
  you go along. As you progress from section to section, each section builds on the previous.
* You must be able to use the ACCOUNTADMIN role to perform the following tasks:

  > + Create the role used in this tutorial, which is the `tutorial1_role` role.
  > + Grant the required privileges to the `tutorial1_role` role.
  > + Create a listing for your app.

  In this tutorial, you perform the steps to create your basic Snowflake Native App by using the `tutorial1_role` role. In general
  practice, however, you would use roles with privileges specifically defined for the action you’re performing.
  For example, you might have separate roles for the following users:

  > + Developers who create UDFs and stored procedures
  > + Database administrators who manage roles and permissions
  > + Administrators who [manage listings](../../../collaboration/collaboration-listings-about.md)
  >   using Snowflake Collaboration
* To install your app from a private listing, you must have access to a second Snowflake account.
  You use this account to mimic how consumers would install an app.

  > **Note:**
  >
  > Although the Snowflake Native App Framework supports sharing apps with accounts in different
  > organizations, for the purposes of this tutorial, both accounts must be in the same organization.
* You must set a current warehouse. See [USE WAREHOUSE](../../../sql-reference/sql/use-warehouse.md).

## Set up a role for this tutorial

To create and set up the `tutorial1_role` role, follow these steps:

1. Create the `tutorial1_role` role:

   ```sqlexample
   CREATE ROLE tutorial1_role;
   ```
2. Grant the `tutorial1_role` to the Snowflake user who performs the tutorial:

   ```sqlexample
   GRANT ROLE tutorial1_role TO USER <user_name>;
   ```

   Where:

   > `user_name`
   > :   Specifies the name of the user who performs the tutorial.
3. Grant the privileges required to create a basic Snowflake Native App and Snowflake objects:

   ```sqlexample
   GRANT ALL PRIVILEGES ON warehouse <warehouse_name> TO ROLE tutorial1_role;
   GRANT CREATE APPLICATION PACKAGE ON ACCOUNT TO ROLE tutorial1_role;
   GRANT CREATE APPLICATION ON ACCOUNT TO ROLE tutorial1_role;
   ```

   Where:

   > `warehouse_name`
   > :   Specifies the name of the warehouse that is currently set.

After performing the tasks in this section, the user that has the `tutorial1_role` role granted
to their account has the permissions to create all of the Snowflake objects required to
create a basic Snowflake Native App.

In this section, you set up the `tutorial1_role` role, which you’ll use in this tutorial. In the next section, you’ll create a Snowflake
CLI connection for the tutorial.

## Create a Snowflake CLI connection for the tutorial

To run the Snowflake CLI commands in this tutorial, you must setup a Snowflake CLI connection for the tutorial.

To create a connection:

1. From the terminal, run the following command:

   ```snowcli
   snow connection add
   ```
2. Enter `tut1-connection` for the name of the connection.
3. Enter additional information for the Snowflake CLI connection.

   The specific values you use depend on your Snowflake account. However, you must use the following
   values for the role and warehouse properties:

   | Parameter | Required value |
   | --- | --- |
   | Role for the connection | tutorial1_role |
   | Warehouse for the connection | Specify the name of any warehouse that you have access to. |
4. Verify the connection by running the following command:

   ```snowcli
   snow connection test -c tut1-connection
   ```

   The output of this command should look similar to the following:

   ```output
   +----------------------------------------------------------------------------------+
   | key             | value                                                          |
   |-----------------+----------------------------------------------------------------|
   | Connection name | tut1-connection                                                |
   | Status          | OK                                                             |
   | Host            | USER_ACCOUNT.snowflakecomputing.com                            |
   | Account         | USER_ACCOUNT                                                   |
   | User            | tutorial_user                                                  |
   | Role            | TUTORIAL1_ROLE                                                 |
   | Database        | not set                                                        |
   | Warehouse       | WAREHOUSE_NAME                                                 |
   +----------------------------------------------------------------------------------+
   ```

> **Caution:**
>
> If you do not create the `tut1-connection` connection, you must use a connection that
> specifies the correct values for the role, database, and warehouse connection properties.

In this section, you set up a Snowflake CLI connection for the tutorial. In the next section, you’ll create the application files.

## Create the application files

In this section, you create a setup script, a manifest file and a project definition file.
The first two of these files are required by the Snowflake Native App Framework.

Setup script
:   An SQL script that runs automatically when a consumer installs an app in
    their account.

Manifest file
:   A YAML file that contains basic configuration information about the app.

Project definition file
:   A YAML file that contains information about the Snowflake objects that you want to create.

You learn more about these files, and their contents, throughout this tutorial. You
also create a readme file that is useful when viewing and publishing your app in
later sections of this tutorial.

### Initialize a new project folder

You use Snowflake CLI to initialize a new Snowflake Native App project in your local filesystem.

To do this:

1. Execute the following command:

   ```snowcli
   snow init --template app_basic tutorial
   ```
2. Enter a value for the project identifier.

   This value is used as a base name for the entities that snow app commands will generate. For example, if you enter `foo`, the application
   package is `foo_pkg` and the application entity is `foo`. However, in this getting started tutorial, you will replace the contents of
   the project definition file (snowflake.yml), which overrides the value that you specify for the project identifier.

This command creates a folder named `tutorial` inside the current working directory and
populates it with a basic Snowflake Native App project based on a basic template. This is
the root directory for all of your application files.

> **Note:**
>
> You modify and add files and subfolders to this folder in later sections.

> **Note:**
>
> There are other templates available to help you quickly get up-and-running with the
> Snowflake Native App Framework. Please consult `snow init --help` for more information.

### Create the setup script

Modify or replace the contents of the `app/setup_script.sql` file as shown in the following
example:

```sqlexample
-- Setup script for the Hello Snowflake! app.
```

This line is a placeholder because the setup script cannot be empty.

> **Note:**
>
> This tutorial refers a particular structure and filename for the setup
> script. However, when building your own app you can choose your
> own name and directory structure for this file.

### Create a README file for your app

A readme file provides a description of what your application does. You see the
readme when you view your app in Snowsight.

Modify or replace the contents of `app/README.md` with the following:

```text
This is the readme file for the Hello Snowflake app.
```

### Create the manifest file

The Snowflake Native App Framework requires a manifest file for each app. The manifest file
contains metadata and configuration parameters for an app and influences the
run-time behavior of your app.

> **Note:**
>
> This file must be named `manifest.yml`. Paths to other files,
> including the setup script, are relative to the location of this file.

Modify or replace the contents of the `app/manifest.yml` with the following:

```yaml
manifest_version: 1
artifacts:
   setup_script: setup_script.sql
   readme: README.md
```

The `setup_script` property specifies the location of the setup script
relative to the location of the manifest file. The path and file name
specified here must be the same as the relative location of the setup
script you modified above. The `readme` property follows the same rules.

> **Note:**
>
> The `manifest_version`, `artifacts`, and `setup_script` properties are required.
> The `readme` property is optional.

### Create the project definition file

Snowflake CLI uses a project definition file to describe objects that can be deployed to Snowflake.
This file must be named `snowflake.yml`. This file controls the name of the deployed application
package and object, as well as which files are uploaded to the project stage.

> **Note:**
>
> This file must be named `snowflake.yml` and it must exist at the
> root level of your project. Paths to other files, such as the
> manifest file and the setup script, are relative to the location of this file.

Modify or replace the contents of the `snowflake.yml` with the following:

```yaml
definition_version: 2
entities:
   hello_snowflake_package:
      type: application package
      stage: stage_content.hello_snowflake_stage
      manifest: app/manifest.yml
      identifier: hello_snowflake_package
      artifacts:
         - src: app/*
           dest: ./
   hello_snowflake_app:
      type: application
      from:
         target: hello_snowflake_package
      debug: false
```

The next section of this tutorial describes how to use each of these properties.

### Review what you learned in this section

After performing the steps in this section, you should now have a directory structure that
looks like the following:

```text
/tutorial
  snowflake.yml
  README.md
  /app/
    manifest.yml
    README.md
    setup_script.sql
```

In this section you learned how to create the setup script and manifest files that are required
by the Snowflake Native App Framework and the project definition file that is required by the Snowflake CLI.

Although the content you added to both the setup script and manifest file is basic, all apps
must have these files.

You also added a readme file that is displayed when viewing your app in Snowsight
or when publishing your app as a listing.

## Understanding the project definition file

In this section you learn about the contents of the
[project definition](../../snowflake-cli/native-apps/project-definitions.md) file (`snowflake.yml`) you created
in the previous section. You also perform additional setup tasks for your provider account. The project definition file (`snowflake.yml`)
defines the names of objects that are created in your Snowflake account:

* The application package (`hello_snowflake_package`)
* The application object (`hello_snowflake_app`) that is created from the application package
* The stage that holds application files (`stage_content.hello_snowflake_stage`)

At its core, an application package is a Snowflake database that is extended to include additional
information about an app. In that sense, it is a container for an app that includes:

* Shared data content
* Application files

Note that the name of the stage is specified as a schema-qualified name. This schema is created inside
the application package. This named stage is used to store the files required by the
Snowflake Native App Framework. This stage must include any files you want available to the setup script of your app setup
script or at runtime.

There is also a section called `artifacts` in the project definition file which
is a list of rules that specify which files are copied to the named stage.

The rule specifies that anything in the `app/` subfolder is copied to the root of the stage. This
means the following:

* `tutorial/app/manifest.yml` is uploaded to the root of `@hello_snowflake_package.stage_content.hello_snowflake_stage`.
* `tutorial/app/README.md` is uploaded to the root of `@hello_snowflake_package.stage_content.hello_snowflake_stage`.
* `tutorial/app/setup_script.sql` is uploaded to the root of `@hello_snowflake_package.stage_content.hello_snowflake_stage`.

You are not yet creating the application package or executing any SQL commands that perform
these tasks. In a later section, you run the Snowflake CLI command to perform these tasks.

Finally, you set `debug: false` inside of the app definition. For applications deployed using
the Snowflake CLI, debug mode is enabled by default.

In this section you learned that an application package is a container for the resources used by an
app. You also learned the how to set the fields in the project definition file.

## Add application logic and install your first app

In this section, you add code to the application package and install your first app. To do
this, you perform the following tasks:

* Add a stored procedure to the setup script.
* Install and test the app in stage dev mode.

### Add a stored procedure to the setup script

In this section, you add a stored procedure to the app by adding the code for the
stored procedure to the setup script on your local file system.

To add a stored procedure to the setup script:

1. Add the following SQL statements at the end of the `setup_script.sql` file that you created
   in an earlier section of this tutorial:

   ```sqlexample
   CREATE APPLICATION ROLE IF NOT EXISTS app_public;
   CREATE SCHEMA IF NOT EXISTS core;
   GRANT USAGE ON SCHEMA core TO APPLICATION ROLE app_public;
   ```

   When the setup script runs during app installation, these statements create an application
   role named `app_public`. Application roles are similar to database roles, but they can
   only be used within the context of an app. They are used to grant access to objects
   within the application object that is created in the consumer account.

   This example also creates a schema to contain the stored procedure and grants the USAGE
   privilege on the schema to the application role. Creating an application role and granting
   privileges on an object, for example a schema, to the application role is a common pattern
   within the setup script.
2. Add the code for the stored procedure at the end of the `setup_script.sql` file:

   ```sqlexample
   CREATE OR REPLACE PROCEDURE CORE.HELLO()
     RETURNS STRING
     LANGUAGE SQL
     EXECUTE AS OWNER
     AS
     BEGIN
       RETURN 'Hello Snowflake!';
     END;
   ```

   This example creates a stored procedure that outputs the string “Hello Snowflake!”.
3. Add the following statement to the end of the `setup_script.sql` file:

   ```sqlexample
   GRANT USAGE ON PROCEDURE core.hello() TO APPLICATION ROLE app_public;
   ```

   This example grants the USAGE privilege on the stored procedure to the application role.

In this section you added a stored procedure to the setup script. You also created an
application role and granted the USAGE privilege to this role. This allows the setup script
to create the stored procedure when the app is installed. It also gives the app
permission to run the stored procedure.

### Install and test the app in stage development mode

You are now ready to create the application package, the app and all the other entities you
specified in the project definition file.

To perform these tasks:

1. In a terminal, change to the `tutorial` folder.
2. Run the following Snowflake CLI command:

   ```snowcli
   snow app run -c tut1-connection
   ```

This command performs the following tasks:

1. Create an application package name `hello_snowflake_package` with schema `stage_content` and
   stage `hello_snowflake_stage`.
2. Upload all required files to the named stage.
3. Create or upgrade the app `hello_snowflake_app` using files from this stage.

If the command runs successfully, it outputs a URL where you can see your app in
Snowsight.

To run the `HELLO` stored procedure that you added to `setup_script.sql`
in a previous section, run the following Snowflake CLI command:

```snowcli
snow sql -q "call hello_snowflake_app.core.hello()" -c tut1-connection
```

You should see the following output after running this command:

```text
+------------------+
| HELLO            |
|------------------|
| Hello Snowflake! |
+------------------+
```

### Review what you learned in this section

Congratulations! You have created, installed, and tested your first Snowflake Native App using the Snowflake Native App Framework!
Although the app only has basic functionality, the components you used to build the app are the same
for more complex apps.

In this section you completed the following:

* Added a stored procedure to the setup script. The setup script specifies how your app is
  installed in the consumer account. In later sections you add data content and other types of
  application logic to your app.
* Deployed your app for the first time using Snowflake CLI.
* Tested your installed app by running a stored procedure.

In later sections you learn about other ways to view and test your app.

## Add data content to your app

In the previous section you created an app that contains a stored procedure that demonstrates
how you would add application logic to an app.

In this section you include data content in your app by creating a database within
the `HELLO_SNOWFLAKE_PACKAGE` application package and granting privileges to share this
database with the app.

### Create a table to share with an app

In this section you learn how to share data content with an app. Specifically,
you share a table in the provider account by granting privileges on the schema and
table to the application package.

1. To create a table and insert the sample data in the application package,
   create a folder `tutorial/scripts`, then a file `shared_content.sql` inside
   the folder. Add the following contents to this file:

   ```sqlexample
   USE APPLICATION PACKAGE <% ctx.entities.hello_snowflake_package.identifier %>;

   CREATE SCHEMA IF NOT EXISTS shared_data;
   USE SCHEMA shared_data;
   CREATE TABLE IF NOT EXISTS accounts (ID INT, NAME VARCHAR, VALUE VARCHAR);
   TRUNCATE TABLE accounts;
   INSERT INTO accounts VALUES
     (1, 'Joe', 'Snowflake'),
     (2, 'Nima', 'Snowflake'),
     (3, 'Sally', 'Snowflake'),
     (4, 'Juan', 'Acme');
   -- grant usage on the ``ACCOUNTS`` table
   GRANT USAGE ON SCHEMA shared_data TO SHARE IN APPLICATION PACKAGE <% ctx.entities.hello_snowflake_package.identifier %>;
   GRANT SELECT ON TABLE accounts TO SHARE IN APPLICATION PACKAGE <% ctx.entities.hello_snowflake_package.identifier %>;
   ```

   In this example, `<% ctx.entities.hello_snowflake_package.identifier %>` is a template that is replaced by the resolved identifier
   of your `application package` from the `snowflake.yml` file when you execute a Snowflake CLI command.

   Granting these privileges on the objects within the application package makes the `shared_data.accounts`
   table available to all objects created from this application package. This sharing
   takes place due to the privileges GRANT TO SHARE command at the end of the script.

   > **Note:**
   >
   > You must grant the USAGE privilege on each schema to an application package for each schema
   > you want to share with a consumer in an app. You must then grant the SELECT privilege
   > on the objects within the schema that you want to share.
2. Add an entry to the project definition file to ensure that this script runs when you
   update your application package. The final project definition file (snowflake.yml) should be:

   ```yaml
   definition_version: 2
   entities:
      hello_snowflake_package:
         type: application package
         stage: stage_content.hello_snowflake_stage
         manifest: app/manifest.yml
         identifier: hello_snowflake_package
         artifacts:
            - src: app/*
              dest: ./
         meta:
            post_deploy:
               - sql_script: app/scripts/shared_content.sql
      hello_snowflake_app:
         type: application
         from:
            target: hello_snowflake_package
         debug: false
   ```

> **Note:**
>
> Because the script is executed directly from your local machine, it is not necessary (nor recommended)
> to add post-deploy hooks to the `artifacts` section of your project definition file.

> **Note:**
>
> Because post-deploy hooks are executed every time you deploy an app, they must be written in an
> idempotent manner.

### Add a view to access data content

In this section you update the setup script to add a view that allows the consumer who installed
the app to access the data in the `ACCOUNTS` table that you created in the previous section.

To add a view to access data content:

1. To create a schema for the view, add the following to the setup script:

   ```sqlexample
   CREATE OR ALTER VERSIONED SCHEMA code_schema;
   GRANT USAGE ON SCHEMA code_schema TO APPLICATION ROLE app_public;
   ```

   These statements create a versioned schema to contain the view and grant the USAGE privilege on
   the schema. The Snowflake Native App Framework uses versioned schema to handle different versions of
   stored procedures and functions.
2. To create the view, add the following to the setup script:

   ```sqlexample
   CREATE VIEW IF NOT EXISTS code_schema.accounts_view
     AS SELECT ID, NAME, VALUE
     FROM shared_data.accounts;
   GRANT SELECT ON VIEW code_schema.accounts_view TO APPLICATION ROLE app_public;
   ```

   These statements create the view in the `code_schema` schema and grant the required privilege
   on the view to the application role.

   This updated setup script is also uploaded to the stage the next time you deploy your app
   using Snowflake CLI.

### Test the updated app

In this subsection, you upgrade the app and query the example table using the view within the
installed app.

To test the updated app, follow these steps:

1. To update the application package and the application object installed in the consumer account,
   run the following command:

   ```snowcli
   snow app run -c tut1-connection
   ```

   This uploads all the edited files to the stage, runs the `scripts/shared_content.sql` script,
   and upgrade the app using those files on the stage.
2. To verify that the view works correctly, run the following command:

   ```snowcli
   snow sql -q "SELECT * FROM hello_snowflake_app.code_schema.accounts_view" -c tut1-connection
   ```

   The output of this command should be:

   ```output
   +----+----------+-----------+
   | ID | NAME     | VALUE     |
   |----+----------+-----------|
   |  1 | Joe      | Snowflake |
   |  2 | Nima     | Snowflake |
   |  3 | Sally    | Snowflake |
   |  4 | Juan     | Acme      |
   +----+----------+-----------+
   ```

### Review what you learned in this section

In this section you learned how to include shared data content in your app by
performing the following tasks:

* Created the `ACCOUNTS` table within the application package and inserted data into the table.
* Granted reference usage on the `ACCOUNTS` table to the application package.
* Created a schema and view that references the `ACCOUNTS` table in the application package.
* Granted usage on the schema to the application role.
* Granted select on the view to the application role.

You also updated the setup script to perform the following when the application is installed:

* Created a schema and view that the app uses to access the example data.
* Granted usage on the schema to the application role.
* Granted select on the view to the application role.

## Add python code to your app

In this section, you expand the functionality of your app by adding
Python code to enhance the application logic. In this section you include Python
code as the following:

* An inline Python UDF that is a self-contained function in the setup script.
* A Python UDF that references a Python file outside the setup script.

> **Note:**
>
> Although this section introduces examples using Python, the same techniques are
> applicable to Java and JavaScript.

### Add an inline python function as a user-defined function (UDF)

In this section you add a Python function as a UDF.

To include a Python UDF in your app, add the following code to your setup script (setup_script.sql).

```sqlexample
CREATE OR REPLACE FUNCTION code_schema.addone(i INT)
  RETURNS INT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.11'
  HANDLER = 'addone_py'
AS
$$
def addone_py(i):
    return i+1
$$;

GRANT USAGE ON FUNCTION code_schema.addone(int) TO APPLICATION ROLE app_public;
```

These commands perform the following tasks when the app is installed:

* Create a versioned schema named `code_schema`.
* Grant the usage privilege on the schema to the `APP_PUBLIC` application role.
* Create the `ADDONE()` UDF in the `code_schema` schema.
* Grant the usage privilege on the function to the `APP_PUBLIC` application role.

Note that the schema created in the code example above is a versioned schema. User-defined functions
and stored procedures must be defined in a versioned schema instead of a normal schema. This prevents
app upgrades from interfering with concurrent code execution.

### Add an external python module

To add an external python module to your app:

1. Add the following Python function to your setup script (setup_script.sql):

   ```sqlexample
   CREATE or REPLACE FUNCTION code_schema.multiply(num1 float, num2 float)
     RETURNS float
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.11
     IMPORTS = ('/python/hello_python.py')
     HANDLER='hello_python.multiply';

   GRANT USAGE ON FUNCTION code_schema.multiply(FLOAT, FLOAT) TO APPLICATION ROLE app_public;
   ```

   Similar to the previous example, these statement create a Python UDF in a schema and grants
   privileges on the function to the application role. However, this example contains an
   IMPORTS clause that refers to an external Python file that you create and include in
   your named stage.
2. In the `tutorial` folder create a subfolder named `python`.
3. In the `python` subfolder, create a file named `hello_python.py`.
4. Add the following to the `hello_python.py` file:

   ```python
   def multiply(num1, num2):
     return num1*num2
   ```

   The function defined in this external file matches the inline function defined
   in the setup script.
5. Add the following to the existing `artifacts` section of the project definition file (snowflake.yml):

   ```yaml
   - python/hello_python.py
   ```

In this section, you added a Python UDF to your app. This UDF refers to an external Python
module that can be referenced by your application package.

### Install and test the updated app

To install and test an app:

1. To update the application package and the app, run the following command:

   ```snowcli
   snow app run -c tut1-connection
   ```

   This command uploads the edited and new files to the stage and upgrades your app using the files
   on the stage.
2. To test the Python stored procedure, run the following command:

   ```snowcli
   snow sql -q "SELECT hello_snowflake_app.code_schema.addone(1)" -c tut1-connection
   ```
3. To test the referenced Python function, run the following command:

   ```snowcli
   snow sql -q "SELECT hello_snowflake_app.code_schema.multiply(1,2)" -c tut1-connection
   ```

### Review what you learned in this section

In this section, you added the following new functionality to your app:

* A Python function defined as an inline UDF.
* A Python function defined as a UDF that references external code.

You also tested each of these examples by installing an updated version of your
app and running each of the functions.

## Add a streamlit app to your app

In this section, you complete your Snowflake Native App by adding a Streamlit user interface.
Streamlit is an open source Python framework for developing data science and machine learning
applications. You can include Streamlit apps within an app to add user interaction and
data visualization.

### Create the streamlit app file

To create a Streamlit app, follow these steps:

1. In the `tutorial` folder, create a subfolder named `streamlit`.
2. In the `streamlit` folder, create a file named `hello_snowflake.py`.
3. Add the following code to this file:

   ```python
   # Import python packages
   import streamlit as st
   from snowflake.snowpark import Session

   # Write directly to the app
   st.title("Hello Snowflake - Streamlit Edition")
   st.write(
      """The following data is from the accounts table in the application package.
         However, the Streamlit app queries this data from a view called
         code_schema.accounts_view.
      """
   )

   # Get the current credentials
   session = Session.builder.getOrCreate()

   #  Create an example data frame
   data_frame = session.sql("SELECT * FROM code_schema.accounts_view")

   # Execute the query and convert it into a Pandas data frame
   queried_data = data_frame.to_pandas()

   # Display the Pandas data frame as a Streamlit data frame.
   st.dataframe(queried_data, use_container_width=True)
   ```
4. Add the following to the existing `artifacts` section of the project definition file (snowflake.yml):

   ```yaml
   - streamlit/hello_snowflake.py
   ```

### Add the streamlit object to the setup script

To create the Streamlit object in the app, follow these steps:

1. Add the following statement at the end of the `setup_script.sql` file to create the Streamlit object:

   ```sqlexample
   CREATE STREAMLIT IF NOT EXISTS code_schema.hello_snowflake_streamlit
     FROM '/streamlit'
     MAIN_FILE = '/hello_snowflake.py'
   ;
   ```

   This statement creates a STREAMLIT object in the core schema.
2. Add the following statement at the end of the `setup_script.sql` file to allow the APP_PUBLIC role to
   access the Streamlit object:

   ```sqlexample
   GRANT USAGE ON STREAMLIT code_schema.hello_snowflake_streamlit TO APPLICATION ROLE app_public;
   ```

### Install the updated app

1. To update the application package and the app, run the following command:

   ```snowcli
   snow app run -c tut1-connection
   ```

   This command uploads the edited and new files to the stage and upgrades your app using those files
   on the stage. You can then navigate to the URL this command prints out to see your new Streamlit in
   action; once you are there, click on the tab named HELLO_SNOWFLAKE_STREAMLIT that appears beside
   the name of your application.

### Review what you learned in this section

In this section you added a Streamlit app to your Snowflake Native App by doing the following:

* Created a python file that uses the Streamlit library to render a user interface.
* Created a Streamlit app in your Snowflake Native App that displays shared data.

## Add a version to your app

In previous sections, you have been using a “stage development” mode to push changes.
The stage development mode allows you to quickly iterate app development without having to create
new versions or patches. However, you must create a version of the app to list your application package
and share it with other Snowflake users.

In this section, you add a version to your app that includes all of the functionality you have
added in this tutorial.

1. To add a version to the `HELLO_SNOWFLAKE_PACKAGE` application package, run the following command:

   ```snowcli
   snow app version create V1_0 -c tut1-connection
   ```

   In this command, you modified your application package to add a version based on the
   application files that you uploaded to the named stage in an earlier section.

   > **Note:**
   >
   > The value specified for VERSION is a label, not a numerical value or string.

   > **Note:**
   >
   > The patch number for the new version you added is automatically created at `0`. As you add
   > additional patches for a version, these are automatically incremented. However, when
   > you create a new version, for example `V1_1`, the patch number for that version is reset
   > to `0`.
2. To verify that the version was added to the application package, run the following command:

   ```snowcli
   snow app version list -c tut1-connection
   ```

   This command shows additional information about the version as shown in the following output:

   ```text
   +---------+-------+-------+---------+-------------------------------+------------+-----------+-------------+-------+---------------+
   | version | patch | label | comment | created_on                    | dropped_on | log_level | trace_level | state | review_status |
   |---------+-------+-------+---------+-------------------------------+------------+-----------+-------------+-------+---------------|
   | V1_0    | 0     | NULL  | NULL    | 2024-05-09 10:33:39.768 -0700 | NULL       | OFF       | OFF         | READY | NOT_REVIEWED  |
   +---------+-------+-------+---------+-------------------------------+------------+-----------+-------------+-------+---------------+
   ```
3. To install the app based on a version, run the following command:

   ```snowcli
   snow app run --version V1_0 -c tut1-connection
   ```

   Because the existing app was created using files on the named stage, upgrading the app
   using a version requires the existing app to be dropped and recreated with this version.
   Answer yes to the prompt accordingly.

In this section, you modified the application package to include a version for your
app and re-created the application object using versioned development mode.

## View your app in Snowsight

In this section, you view your app in Snowsight. In previous sections, you used
SQL statements to test or find information about your app. However, you can also view information
about your app in Snowsight. You can also view your deployed Streamlit app.

To view your app in Snowsight, follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. Switch to the TUTORIAL1_ROLE role you created previously:

   1. In the navigation menu, select your username to open the account menu.
   2. Select the active role. For example, PUBLIC.

      The role selector appears.
   3. Select the TUTORIAL1_ROLE role.
3. In the navigation menu, select Catalog » Apps.
4. Select `HELLO_SNOWFLAKE_APP`.

   The About the app tab displays the content you added to the `app/README.md` file in an earlier section.
5. To view your Streamlit app, select HELLO_SNOWFLAKE_STREAMLIT.
6. If needed, select a warehouse to proceed.

   The content of the `HELLO_SNOWFLAKE_DATA` database displays in a Streamlit data frame.
7. To open the app in a worksheet, in the navigation menu, select Projects » Worksheets.
8. [Create a new SQL worksheet](../../../user-guide/ui-snowsight-worksheets-gs.md) named HELLO_SNOWFLAKE_APP.
9. If necessary, select the warehouse where you installed the app.
10. Select the `tutorial1_role` role you created:

    ```sqlexample
    USE ROLE tutorial1_role;
    ```
11. Select the `hello_snowflake_app` application object you created:

    ```sqlexample
    USE APPLICATION hello_snowflake_app;
    ```
12. Grant the ACCOUNTADMIN role with the privilege to attach a listing for the HELLO_SNOWFLAKE_PACKAGE application package:
    following command:

    ```sqlexample
    GRANT ATTACH LISTING ON APPLICATION PACKAGE HELLO_SNOWFLAKE_PACKAGE TO ROLE ACCOUNTADMIN;
    ```

    This grant is needed to allow you to publish your app as the account administrator, which you’ll do in the next section.

From the Snowflake worksheet you can test your app using SQL commands. For example, you can
re-run the commands you ran in previous sections to test the features you added to your application:

```sqlexample
LIST @hello_snowflake_package.stage_content.hello_snowflake_stage;
CALL core.hello();
SELECT * FROM code_schema.accounts_view;
SELECT code_schema.addone(10);
SELECT code_schema.multiply(2,3);
```

> **Note:**
>
> You can also directly view your app’s user interface by using the `snow app open`
> command in Snowflake CLI. This command opens the appropriate URL in your
> system-configured web browser.

## Publish and install your app

In this section, you publish your app by creating a private listing
that uses the application package as the data content. After creating the listing,
you login to another account to install the listing.

### Set the release channel

Before you can create a listing for your application package, you must
set the release channel. A release channel is a version management tool that
allows providers to publish apps at different stages of the app development lifecycle.

For information about release channels, see
[About release channels, versions, and patches](../release-channels-versions.md).

In this tutorial you set the release channel using the version you
added in a previous section.

To set the release channel on the application package, follow these steps:

1. To view the versions and patches defined for your application package, run the
   following command:

   ```snowcli
   snow app version list -c tut1-connection
   ```

   This command displays the versions and patches defined for the application package.
2. To attach the previously created version to the `default` release channel , run the following command:

   ```snowcli
   snow app release-channel add-version --version V1_0 default -c tut1-connection
   ```

   The output of this command is shown in the following example:

   ```text
   Successfully added version V1_0 to the release channel.
   ```
3. To publish the app using version `V1_0` and patch `0`, run the following command:

   ```snowcli
   snow app publish --version V1_0 --patch 0 --channel DEFAULT -c tut1-connection
   ```

   The output of this command is shown in the following example:

   ```text
   Version V1_0 and patch 0 published to release directive DEFAULT of release channel DEFAULT.
   ```
4. (Optional) This step is only necessary if you want to share your app with consumers outside your organization. Note that this step runs
   a security scan, which may take up to 24 hours to complete. Your application listing won’t be available to consumers until the security scan is complete.

   To share your app with consumers outside your organization, run the following command:

   ```snowcli
   snow sql  -c tut1-connection -q "ALTER APPLICATION PACKAGE hello_snowflake_package SET DISTRIBUTION = EXTERNAL;"
   ```

   The output of this command is shown in the following example:

   ```text
   +----------------------------------+
   | status                           |
   |----------------------------------|
   | Statement executed successfully. |
   +----------------------------------+
   ```

In this section, you verified what versions and patches exist in your application package.
Using this information, you configured the release channel for the application package
and published the app.

### Create a listing for your application

Now that you have specified a release directive for your application package, you
create a listing and add the application package as the data content of the listing. This
allows you to share your app with other Snowflake users and allows them to install
and use the app in their account.

To create a listing for your app:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing and then Specified consumers to privately share the listing with specific accounts.
4. From the Select role drop-down, select ACCOUNTADMIN.
5. Enter a name for your listing.
6. Select Next.
7. Click + Add data product and then `+ Select` to select the application package for the listing.
8. Enter a description for your listing.
9. In the Add consumer accounts section, add the account identifier for the account
   you are using to test the consumer experience of installing the app from a listing.
10. Select Publish.

In this section you created a private listing containing your application package as the
shared data content.

### Install the app in a consumer account

In this section you install the app associated with the listing you created
in the previous section. You install the listing in a different account which mimics
how a consumer would install the app in their account.

To install your app from the listing, follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the tile for the listing under Recently shared with you.
4. Select Get.
5. Select Options and enter a customer-facing name for the app. For this tutorial, use “HelloSnowflakeApp”.
6. Select the warehouse where you want to install the app.
7. Select Get.
8. Select Open to view your listing or Done to finish.

In this section you learned how to publish and install a listing that allows you to share
your app with other Snowflake users.

## Learn more

Congratulations! Not only have you finished this tutorial, but you have worked through
development and publishing life cycle of an app using the Snowflake Native App Framework.

Along the way, you:

* Used Snowsight and Snowflake CLI to build an app using the
  Snowflake Native App Framework.

  + For more information about Snowsight, refer to
    [Getting started with worksheets](../../../user-guide/ui-snowsight-worksheets-gs.md) and
    [Work with worksheets in Snowsight](../../../user-guide/ui-snowsight-worksheets.md).
  + For more information about Snowflake Native App in Snowflake CLI, refer to
    [Using Snowflake Native App in Snowflake CLI](../../snowflake-cli/native-apps/overview.md).
* Created the manifest and setup script that are required by all apps.

  + Refer to [Create the manifest file for an app](../manifest-overview.md) and
    [Create the setup script](../creating-setup-script.md) for details.
* Created an application package that works as a container for the application logic and
  data content of your app.

  + Refer to [Create and manage an application package](../creating-app-package.md) for details.
* Added logic to your app using stored procedures and UDFs written in Python.

  + Refer to [Add application logic to an application package](../adding-application-logic.md) for information
    on using stored procedures, UDFs, and external function in the Snowflake Native App Framework.
  + Refer to [Snowpark API](../../snowpark/index.md), [Extending Snowflake with Functions and Procedures](../../extensibility.md)
    and [Writing external functions](../../../sql-reference/external-functions.md) for general information on each type of
    procedure and function.
* Added shared data content to your app.

  + Refer to [Share data content in a Snowflake Native App](../preparing-data-content.md) for additional information.
* Included a Streamlit app in your app.

  Refer to [Add a Streamlit app](../adding-streamlit.md) for additional information.
* Viewed your app in Snowsight.

  + Refer to [Working with Apps as a Consumer](https://other-docs.snowflake.com/en/native-apps/consumer-about)
* Created a private listing for your app and installed the app in a
  separate Snowflake account.

  + Refer to [Sharing an App with Consumers](https://other-docs.snowflake.com/en/native-apps/provider-publishing-app-package)
    for information on publish a listing containing an application package.
  + Refer to [Installing an App from a Listing](https://other-docs.snowflake.com/en/native-apps/consumer-installing)
    for information on how consumers install an app from a listing.

---
title: Tutorial 2: Create an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/tutorials/na-spcs-tutorial.md
section: Native Apps Framework
---

App Development

# Tutorial 2: Create an app with containers

## Introduction

The Snowflake Native App Framework allows providers to build, sell, and distribute apps within the
Snowflake Data Cloud. Providers can create apps that leverage core Snowflake functionality to share data
and application logic with consumers. The logic of a Snowflake Native App can include features such as stored procedures and user-defined functions (UDFs). Providers can share their applications with consumers through listings in the
Snowflake Marketplace or through private listings.

A Snowflake Native App can implement Snowpark Container Services to facilitate the deployment, management, and scaling of
containerized apps within the Snowflake ecosystem. This tutorial describes how to create a Snowflake Native App with Snowpark Container Services, which is
a Snowflake Native App that runs container workloads in Snowflake. Snowflake Native Apps with Snowpark Container Services can run any containerized services, while
leveraging all of the features of the Snowflake Native App Framework, including security, logging, shared data content and application logic.

> **Note:**
>
> This tutorial uses both Snowflake CLI and Snowsight to perform the required tasks.

### What you learn in this tutorial

In this tutorial, you learn how to:

* Use Snowflake CLI to initialize a Snowflake Native App with Snowpark Container Services project.
* Build a Docker image for an app.
* Create the application package and required application files for a Snowflake Native App with Snowpark Container Services.
* Test a Snowflake Native App with Snowpark Container Services by calling the service function within the container.

### Set up your Snowflake environment

To perform this tutorial, you must meet the following prerequisites:

* Access to a Snowflake account that supports Snowpark Container Services.
* You must be able to use the ACCOUNTADMIN role to create the role used in this
  tutorial and grant the required privileges to that role.
* You must have [Snowflake CLI](../../snowflake-cli/index.md)
  version `3.0.0` or greater installed on your local machine.
* You must have Docker Desktop installed on your local machine.

## Set up a role for this tutorial

This tutorial walks you through the process of creating a Snowflake Native App with Snowpark Container Services using the
`tutorial_role` role. Before working through this tutorial, a Snowflake user with the
ACCOUNTADMIN role must perform the following steps to configure this role.

To create and set up the `tutorial_role` role, follow these steps:

1. To create the `tutorial_role` role, run the following command:

   ```sqlexample
   CREATE ROLE tutorial_role;
   ```
2. To grant the `tutorial_role` to the Snowflake user who performs the tutorial, run the
   following command:

   ```sqlsyntax
   GRANT ROLE tutorial_role TO USER <user_name>;
   ```

   Where:

   > `user_name`
   > :   Specifies the name of the user who performs the tutorial.
3. To grant the privileges required to create and use the Snowflake objects required by a container
   app, run the following commands:

   ```sqlsyntax
   GRANT CREATE INTEGRATION ON ACCOUNT TO ROLE tutorial_role;
   GRANT CREATE WAREHOUSE ON ACCOUNT TO ROLE tutorial_role;
   GRANT CREATE DATABASE ON ACCOUNT TO ROLE tutorial_role;
   GRANT CREATE APPLICATION PACKAGE ON ACCOUNT TO ROLE tutorial_role;
   GRANT CREATE APPLICATION ON ACCOUNT TO ROLE tutorial_role;
   GRANT CREATE COMPUTE POOL ON ACCOUNT TO ROLE tutorial_role WITH GRANT OPTION;
   GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO ROLE tutorial_role WITH GRANT OPTION;
   ```

After performing the tasks in this section, the user that has the `tutorial_role` role granted
to their account has the permissions to create all of the Snowflake objects required to
create a Snowflake Native App with Snowpark Container Services.

You use this role through the rest of this tutorial.

In a real-world situation, a provider may need similar privileges or access to
existing objects to develop an app with containers, including a compute pool, warehouse, and database.

## Create the required objects in your account

In this section, you create the Snowflake objects required by an app with containers.

### Create a warehouse and image repository

To create the required objects, perform the following either through Snowsight or Snowflake CLI.

1. To set the current context in Snowsight to use the `tutorial_role` role, run the following command:

   ```sqlexample
   USE ROLE tutorial_role;
   ```

   If you are using Snowflake CLI, you can use `--role tutorial_role` instead.
2. To create a warehouse for the Snowflake Native App with Snowpark Container Services, run the following command:

   ```sqlexample
   CREATE OR REPLACE WAREHOUSE tutorial_warehouse WITH
     WAREHOUSE_SIZE = 'X-SMALL'
     AUTO_SUSPEND = 180
     AUTO_RESUME = true
     INITIALLY_SUSPENDED = false;
   ```

   A warehouse is required by the Snowflake Native App to run SQL commands and stored procedures.
3. To create the image repository used to store the container, run the following command:

   ```sqlexample
   CREATE DATABASE tutorial_image_database;
   CREATE SCHEMA tutorial_image_schema;
   CREATE IMAGE REPOSITORY tutorial_image_repo;
   ```

In this section, you created a warehouse that is used to execute queries for the app
you create, as well as an image repository to host container images.

In the next section you create an image for the container and upload it to the
image repository you created above.

## Create a Snowflake CLI connection for the tutorial

To run the Snowflake CLI commands in this tutorial, you must setup a Snowflake CLI connection for the tutorial.

To create a connection, perform the following tasks:

1. From the terminal, run the following command:

   ```snowcli
   snow connection add
   ```
2. Enter `tut-connection` for the name of the connection.
3. Enter additional information for the Snowflake CLI connection.

   The specific values you use depend on your Snowflake account. However, you must use the following
   values for the role, warehouse, database, and schema properties:

   | Parameter | Required value |
   | --- | --- |
   | Role for the connection | tutorial_role |
   | Warehouse for the connection | tutorial_warehouse |
   | Database for the connection | tutorial_image_database |
   | Schema for the connection | tutorial_image_schema |
4. Verify the connection by running the following command:

   ```snowcli
   snow connection test -c tut-connection
   ```

   The output of this command should look similar to the following:

   ```snowcli
   +----------------------------------------------------------------------------------+
   | key             | value                                                          |
   |-----------------+----------------------------------------------------------------|
   | Connection name | tut-connection                                                 |
   | Status          | OK                                                             |
   | Host            | USER_ACCOUNT.snowflakecomputing.com                            |
   | Account         | USER_ACCOUNT                                                   |
   | User            | tutorial_user                                                  |
   | Role            | TUTORIAL_ROLE                                                  |
   | Database        | TUTORIAL_IMAGE_DATABASE                                        |
   | Warehouse       | TUTORIAL_WAREHOUSE                                             |
   +----------------------------------------------------------------------------------+
   ```

> **Caution:**
>
> If you do not create the `tut-connection` connection, you must use a connection that
> specifies the correct values for the role, database, and warehouse connection properties.

## Setup a project for the app

In the previous section, you set up a Snowflake CLI connection for the tutorial.

In this section, you use Snowflake CLI to create a project for your app. A project contains
all of the assets required by an app. These files are stored on your local file system and can be
managed by a version control system as part of your development workflow.

### Create a project file using the Snowflake CLI

1. To create a project file, run the following command:

   ```bash
   snow init --template app_basic na-spcs-tutorial
   ```
2. Enter a value for the project identifier.

   > You add additional files and subfolders to this folder and edit the files
   > this command created in later subsections.

This command creates a folder named `na-spcs-tutorial` using the `app_basic` project template.

Within the `na-spcs-tutorial` folder, this command creates the following files and folders:

```text
├── README.md
├── app
    └── manifest.yml
    └── README.md
    └── setup_script.sql
├── snowflake.yml
```

In later sections you modify these files and add additional resources to your app.

### Add the service files to the app project

In the previous section you created a project which includes the default application
files required by your app. In this section, you add the files required to create the container for your app.

1. Create a folder called `service` inside the `na-spcs-tutorial` folder.

   This folder contains the source code for the container-based service we are about to build and publish to Snowflake.
2. To obtain the Docker files required for the tutorial, download the
   [`na_spcs_tutorial.zip`](../../../_downloads/af07363a9484a0281a50e9a0556f78cb/na-spcs-tutorial.zip) file to your local file system.
3. Unzip the contents of the zip file into the `na-spcs-tutorial/service` folder. This folder
   should contain the following files:

   * `echo_service.py`
   * `Dockerfile`
   * `templates/basic_ui.html`
   * `echo_spec.yaml`

### Verify the directory structure of your project

After create the project for your app and adding the files for the service and Docker container, the project
should have the following structure within the `na-spcs-tutorial` folder:

```text
├── app
      └── manifest.yml
      └── README.md
      └── setup_script.sql
├── README.md
├── service
      └── echo_service.py
      ├── echo_spec.yaml
      ├── Dockerfile
      └── templates
         └── basic_ui.html
├── snowflake.yml
```

## Build an image for a Snowpark Container Services service

In this section, you build a Docker image and upload it to the image repository you
created in the previous section.

### Build a Docker image and upload it to the image repository

To build a Docker image and upload it to the image repository, follow these steps:

1. From a terminal window, change to the `na-spcs-tutorial/service` folder.
2. Run the following Docker CLI command. Note that you must specify the current working
   directory (.) in the command:

   ```bash
   docker build --rm --platform linux/amd64 -t my_echo_service_image:tutorial .
   ```

   This command performs the following:

   * Builds a Docker image using the Docker file included in the zip file that you downloaded
   * Names the image `my_echo_service_image`
   * Applies the `tutorial` tag to the image.
3. To identify the URL of the image repository you created in a previous section, run the following
   command:

   ```bash
   REPO_URL=$(snow spcs image-repository url tutorial_image_database.tutorial_image_schema.tutorial_image_repo -c tut-connection)
   echo $REPO_URL
   ```

   The URL of the image repository is captured in the `$REPO_URL` variable, then printed to the console.
   You use this value in the next step.
4. To create a tag for the image that includes the image URL, run the following Docker CLI command:

   ```bash
   docker tag <image_name> <image_url>/<image_name>
   ```

   This command requires two parameters:

   * `<image_name>`
     Specifies the name of the image and tag.
   * `<image_url>/<image_name>`
     Specifies the URL of the image repository where the image is uploaded and the image name
     and tag where it should be stored in the remote repository.

   For this tutorial, use `$REPO_URL` and `my_echo_service_image:tutorial`:

   ```bash
   docker tag my_echo_service_image:tutorial $REPO_URL/my_echo_service_image:tutorial
   ```
5. To authenticate with the Snowflake registry, run the following Snowflake CLI command:

   ```bash
   snow spcs image-registry login -c tut-connection
   ```

   This command loads necessary credentials required for the Docker CLI to use the image repositories
   in your Snowflake account. You must specify the connection name, if you are not using the default.

   The message `Login Succeeded` displays if everything was successful.
6. To upload the Docker image to the image repository, run the following `docker push` command:

   ```bash
   docker push $REPO_URL/<image_name>
   ```

   Using the same value as `<image_name>` from previous steps, this command is:

   ```bash
   docker push $REPO_URL/my_echo_service_image:tutorial
   ```
7. Confirm the image was uploaded successfully by running the following command:

   ```snowcli
   snow spcs image-repository list-images tutorial_image_database.tutorial_image_schema.tutorial_image_repo -c tut-connection
   ```

In this section, you created a Docker image containing the echo service and pushed it to the
`tutorial_repository` image repository you created earlier in the tutorial.

In the next section, you create an application package that uses this image.

## Develop your Snowflake Native App

In a previous section, you used the Snowflake CLI to create a project file based on a project template. This template
created default versions of the files required by the app.

In this section, you update these default files for your app:

Project Definition file
:   A YAML file that contains information about the Snowflake object(s) that you want to create.
    This file is called `snowflake.yml` and is used by Snowflake CLI to deploy the application
    package and object into your account.

Manifest file
:   A YAML file that contains basic configuration and callback information about the application.
    This file is called `manifest.yml`.

Setup Script
:   An SQL script that runs automatically when a consumer installs an application in
    their account. This file can be called whatever you like, as long as it is referenced
    by your manifest.

The first file is used by Snowflake CLI, while the latter two are required by the Snowflake Native App Framework.

You learn more about these files, and their contents, throughout this tutorial.

In this section, you also create a readme file that is useful when viewing and publishing your
app.

### Modify the defaul manifest file

To modify the manifest file for the app, follow these steps:

1. Modify `na-spcs-tutorial/app/manifest.yml` to look like the following:

   ```yaml
   manifest_version: 1

   artifacts:
      setup_script: setup_script.sql
      readme: README.md
      container_services:
         images:
         - /tutorial_image_database/tutorial_image_schema/tutorial_image_repo/my_echo_service_image:tutorial

   privileges:
   - BIND SERVICE ENDPOINT:
        description: "A service that can respond to requests from public endpoints."
   - CREATE COMPUTE POOL:
        description: "Permission to create compute pools for running services"
   ```

   This example includes the following:

   * The `artifacts` property specifies the locations of resources required by an app
     with containers, including the location of the Docker image you created in a previous step,
     as well as the project README that is visible in Snowsight.
   * The `privileges` property allows a service to respond to public requests as well
     as to create its own compute pool. These properties are required for instantiating our service
     in the next step of the tutorial.

### Modify the default setup script

To modify the default setup script for the application package, follow these steps:

1. Modify the `na-spcs-tutorial/app/setup_script.sql` file to include the following:

   ```sqlexample
   CREATE APPLICATION ROLE IF NOT EXISTS app_user;

   CREATE SCHEMA IF NOT EXISTS core;
   GRANT USAGE ON SCHEMA core TO APPLICATION ROLE app_user;

   CREATE OR ALTER VERSIONED SCHEMA app_public;
   GRANT USAGE ON SCHEMA app_public TO APPLICATION ROLE app_user;

   CREATE OR REPLACE PROCEDURE app_public.start_app()
      RETURNS string
      LANGUAGE sql
      AS
   $$
   BEGIN
      -- account-level compute pool object prefixed with app name to prevent clashes
      LET pool_name := (SELECT CURRENT_DATABASE()) || '_compute_pool';

      CREATE COMPUTE POOL IF NOT EXISTS IDENTIFIER(:pool_name)
         MIN_NODES = 1
         MAX_NODES = 1
         INSTANCE_FAMILY = CPU_X64_XS
         AUTO_RESUME = true;

      CREATE SERVICE IF NOT EXISTS core.echo_service
         IN COMPUTE POOL identifier(:pool_name)
         FROM spec='service/echo_spec.yaml';

      CREATE OR REPLACE FUNCTION core.my_echo_udf (TEXT VARCHAR)
         RETURNS varchar
         SERVICE=core.echo_service
         ENDPOINT=echoendpoint
         AS '/echo';

      GRANT USAGE ON FUNCTION core.my_echo_udf (varchar) TO APPLICATION ROLE app_user;

      RETURN 'Service successfully created';
   END;
   $$;

   GRANT USAGE ON PROCEDURE app_public.start_app() TO APPLICATION ROLE app_user;

   CREATE OR REPLACE PROCEDURE app_public.service_status()
   RETURNS TABLE ()
   LANGUAGE SQL
   EXECUTE AS OWNER
   AS $$
      BEGIN
            LET stmt VARCHAR := 'SHOW SERVICE CONTAINERS IN SERVICE core.echo_service';
            LET res RESULTSET := (EXECUTE IMMEDIATE :stmt);
            RETURN TABLE(res);
      END;
   $$;

   GRANT USAGE ON PROCEDURE app_public.service_status() TO APPLICATION ROLE app_user;
   ```

### Modify the default README

To modify the README file for the app, follow these steps:

1. Modify `na-spcs-tutorial/app/README.md` to look like the following:

   ```text
   Welcome to your first app with containers!
   ```

This README file is visible to consumers after they install your app.

### Modify the default project definition file

In this section, you modify the project definition file used by the Snowflake CLI.

1. Modify `na-spcs-tutorial/snowflake.yml` to look like the following:

   ```yaml
   definition_version: 2
   entities:
      na_spcs_tutorial_pkg:
         type: application package
         manifest: app/manifest.yml
         artifacts:
            - src: app/*
              dest: ./
            - service/echo_spec.yaml
         meta:
            role: tutorial_role
            warehouse: tutorial_warehouse
      na_spcs_tutorial_app:
         type: application
         from:
            target: na_spcs_tutorial_pkg
         debug: false
         meta:
            role: tutorial_role
            warehouse: tutorial_warehouse
   ```

In this section, you defined a local file structure that can be deployed to a Snowflake account
as a Snowflake Native App with Snowpark Container Services. In the next section, you perform this deployment using Snowflake CLI.

## Create and test the app

After defining the manifest file, setup script, and service specification for your Snowflake Native App with Snowpark Container Services,
you can test the app by deploying it to your account using Snowflake CLI.

### Upload files to the stage and create the application object

To create an app in development mode, follow these steps:

1. In a terminal, change to the `na-spcs-tutorial` folder.
2. Create the application package and object in your account by running the following command:

   ```bash
   snow app run -c tut-connection
   ```

   This command displays a confirmation that an application package called
   `na_spcs_tutorial_pkg` and an application object called `na_spcs_tutorial_app`
   have been created in your account. These names correspond to the names in the
   `snowflake.yml` project definition you modified in a previous section.

You can use the URL output to the console to view the application. However,
you must first ensure it has all necessary privileges to create its container-based service.

### Grant the privileges and test the app

In this section, you grant the required privileges to the app and test the app by
calling the services in the container.

You can run SQL commands using either Snowsight or the Snowflake CLI.

To grant the privileges and test the app, perform the following steps from a Snowflake worksheet:

1. Grant the `CREATE COMPUTE POOL` privilege to the app by running the following:

   ```sqlexample
   GRANT CREATE COMPUTE POOL ON ACCOUNT TO APPLICATION na_spcs_tutorial_app;
   GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO APPLICATION na_spcs_tutorial_app;
   ```
2. Run the `app_public.start_app` procedure we defined in the `setup_script.sql` file.

   ```sqlexample
   CALL na_spcs_tutorial_app.app_public.start_app();
   ```

   This procedure creates the compute pool, instantiate the service, and creates the service function.
3. Confirm the function was created by running the following:

   ```sqlexample
   SHOW FUNCTIONS LIKE '%my_echo_udf%' IN APPLICATION na_spcs_tutorial_app;
   ```

   > **Note:**
   >
   > Consumers cannot see the running service because it runs as part of the Snowflake Native App.
   > For example, running `SHOW SERVICES IN APPLICATION na_spcs_tutorial_app;` does not
   > return anything.
4. To verify that the service has been created and healthy, run the following command:

   ```sqlexample
   CALL na_spcs_tutorial_app.app_public.service_status();
   ```

   This statement calls the `app_public.service_status` procedure that you defined in the setup script. The procedure returns information about the containers for this service.

   If the value in the `status` column is not `READY`, execute the statement again, until the status of the service container is `READY`.
5. To call the service function to send a request to the service and verify the response, run
   the following command:

   ```sqlexample
   SELECT na_spcs_tutorial_app.core.my_echo_udf('hello');
   ```

   You see the following message from the service you configured in an earlier section:

   ```text
   ``Bob said hello``
   ```

## Teardown the app and objects created in the tutorial

> **Caution:**
>
> If you plan to perform the [Tutorial 3: Upgrade an app with containers](na-upgrade-tutorial.md) after completing this tutorial,
> do not perform the steps in this section. The app with containers you created in this tutorial
> is a prerequisite for the upgrade tutorial.

Because the app uses a compute pool, it accrues credits in your account
and costs money to run. To stop the app from consuming resources, you must tear down
both the application object and any of the account-level objects it created, for example the
compute pool.

1. To confirm that the compute pool is currently running, run the following command:

   ```snowcli
   snow object list compute-pool -l "na_spcs_tutorial_app_%"
   ```

   If the compute pool is running, a row with an `ACTIVE` compute pool that was created by the
   application object is displayed.
2. Run the following Snowflake CLI command to tear down the app:

   ```snowcli
   snow app teardown --cascade --force -c tut-connection
   ```

   This command removes all of the Snowflake objects created by the app. Without the `--force` option,
   this command does not drop the application package because it contains versions.
3. To confirm that the compute pool was dropped run the following command again:

   ```snowcli
   snow object list compute-pool -l "na_spcs_tutorial_app_%"
   ```

   This command returns `no data` if the compute pool has been dropped successfully.

> **Note:**
>
> The `snow app teardown` command drops both the application package and application object.
> Therefore, any stateful data is lost.

## Learn more

Congratulations! Not only have you finished this tutorial, but you have worked through the
development and publishing life cycle of a Snowflake Native App with Snowpark Container Services.

Along the way, you:

* Used Snowsight and Snowflake CLI to build an application using the
  Snowflake Native App Framework.

  + See [Configuring Snowflake CLI and connecting to Snowflake](../../snowflake-cli/connecting/connect.md) for more information
    on how to configure the connections used by Snowflake CLI.
  + For more information about Snowsight, refer to
    [Getting started with worksheets](../../../user-guide/ui-snowsight-worksheets-gs.md) and
    [Work with worksheets in Snowsight](../../../user-guide/ui-snowsight-worksheets.md).
  + For more information about Native Apps in Snowflake CLI, refer to
    [Using Snowflake Native App in Snowflake CLI](../../snowflake-cli/native-apps/overview.md).
* Created the manifest and setup script that are required by all applications.

  + Refer to [Create the manifest file for an app](../manifest-overview.md) and
    [Create the setup script](../creating-setup-script.md) for details.
* Created an application package that works as a container for the application logic and
  data content of your application.

  + Refer to [Create and manage an application package](../creating-app-package.md) for details.
* Used Docker CLI and Snowflake CLI to build and upload a container to Snowflake.
* Used Snowpark Container Services to create a `COMPUTE POOL` and instantiate the
  container inside of a Snowflake Native App.

---
title: Tutorial 3: Upgrade an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/tutorials/na-upgrade-tutorial.md
section: Native Apps Framework
---

App Development

# Tutorial 3: Upgrade an app with containers

## Introduction

The Snowflake Native App Framework allows providers to build, sell, and distribute apps within the Snowflake Data Cloud. Providers can
create apps that leverage core Snowflake functionality to share data and application logic with consumers. Apps
can also implement Snowpark Container Services to facilitate the deployment, management, and scaling of
containerized apps within the Snowflake ecosystem.

The Snowflake Native App Framework allows providers to make updates to an app and publish new version or patch to consumers. This tutorial
describes how to perform the following tasks:

* Add a version initializer to the app.
* Create versions and patches for changes made to the app.
* Upgrade the app in the consumer account.

### Prerequisite tutorials

This tutorial assumes that you know how to develop a basic Snowflake Native App and can create
a Snowflake Native App with Snowpark Container Services. This tutorial builds on the knowledge gained from completing the following tutorials:

* [Tutorial 1: Create a basic Snowflake Native App](getting-started-tutorial.md)
* [Tutorial 2: Create an app with containers](na-spcs-tutorial.md)

Before following this tutorial to upgrade an app with containers, ensure that you have completed both
of these tutorials.

> **Caution:**
>
> This tutorial builds on the app you created in [Tutorial 2: Create an app with containers](na-spcs-tutorial.md). If
> you do not have the application files and Snowflake objects in your account, you must work through that tutorial
> again before starting this tutorial. See Verify the app from the previous tutorial exists in your account for more information.

### What you learn in this tutorial

This tutorial expands the app with containers you created in
[Tutorial 2: Create an app with containers](na-spcs-tutorial.md). In this tutorial you learn how to:

* Use the version initializer callback function to handle service upgrades and failures.
* Create version definitions for an app.
* Upgrade an app.
* Simulate upgrade failure for an app.
* Create a patch for the app to fix the failure.

## Verify the app from the previous tutorial exists in your account

To verify that the app with containers you created in [Tutorial 2: Create an app with containers](na-spcs-tutorial.md)
is still available in your account, perform the following tasks:

> **Caution:**
>
> If any of the following tasks does not complete successfully, you will need to perform
> [Tutorial 2: Create an app with containers](na-spcs-tutorial.md) again.

1. To verify that Snow CLI is configured correctly, run the following command:

   ```snowcli
   snow connection test -c tut-connection
   ```

   The output of this command should be similar to the following:

   ```text
   +----------------------------------------------------------------------------------+
   | key             | value                                                          |
   |-----------------+----------------------------------------------------------------|
   | Connection name | tut-connection                                                 |
   | Status          | OK                                                             |
   | Host            | USER_ACCOUNT.snowflakecomputing.com                            |
   | Account         | USER_ACCOUNT                                                   |
   | User            | tutorial_user                                                  |
   | Role            | TUTORIAL_ROLE                                                  |
   | Database        | TUTORIAL_IMAGE_DATABASE                                        |
   | Warehouse       | TUTORIAL_WAREHOUSE                                             |
   +----------------------------------------------------------------------------------+
   ```

   This command verifies the following requirements:

   * The Snow CLI connection is working.
   * The TUTORIAL_ROLE exists.
   * The TUTORIAL_WAREHOUSE exists.
2. To verify that the other required Snowflake objects exist, run the following commands from a worksheet:

   ```sqlsyntax
   USE ROLE tutorial_role;
   ```

   ```sqlsyntax
   SHOW DATABASES LIKE 'tutorial_image_database';
   ```

   ```sqlsyntax
   SHOW SCHEMAS LIKE 'tutorial_image_schema';
   ```

   ```sqlsyntax
   SHOW IMAGE REPOSITORIES LIKE 'tutorial_image_repo';
   ```

   Each of these commands should return the name of each Snowflake object.
3. To verify that the service is still running, run the following command from a worksheet:

   ```sqlsyntax
   CALL na_spcs_tutorial_app.app_public.service_status();
   ```
4. Ensure that your local directory structure looks like the following example:

   ```text
   ├── app
       └── manifest.yml
       └── README.md
       └── setup_script.sql
   ├── README.md
   ├── service
       └── echo_service.py
       ├── echo_spec.yaml
       ├── Dockerfile
       └── templates
           └── basic_ui.html
   ├── snowflake.yml
   ```

   You may also see a folder called `output` that contains the app files generated by the `snow app run` command.

### Drop the application object

If the app you created when working through [Tutorial 2: Create an app with containers](na-spcs-tutorial.md)
still exists in your account, you must drop the application object before proceeding with this tutorial.

> **Note:**
>
> You must drop the existing app because an app created in development mode directly from staged files cannot be upgraded.

1. To determine whether the app from the previous tutorial (`na_spcs_tutorial_app`) exists in your account, run
   the following command from a worksheet:

   ```sqlexample
   SHOW APPLICATIONS LIKE 'na_spcs_tutorial_app';
   ```
2. If the `na_spcs_tutorial_app` app appears in the output of this command, drop the app by running the following commands
   from a worksheet:

   ```sqlexample
   USE ROLE tutorial_role;
   DROP APPLICATION IF EXISTS na_spcs_tutorial_app CASCADE;
   ```

### What you completed in this section

In this section, you verified that the application files and Snowflake objects from the previous tutorial are still
working in your account.

In the next section, you will learn more about versions and upgrades in the Snowflake Native App Framework.

## Understand versions, patches and upgrades

The section introduces you to the concepts covered in this tutorial, including:

* Versions and patches
* Upgrades
* The version initializer

### About versions and patches

Versions in the Snowflake Native App Framework are combinations of version and patch numbers. These are defined in the application package.

. rst-class:: bulleted-definition-list

Version
:   Generally contains major updates to a Snowflake Native App. Versions are defined in an application package.

Patch
:   Generally contains smaller updates to a Snowflake Native App. Like versions, patches are defined in the
    application package.

> **Note:**
>
> An application package can only have two active versions at one time. A single version of an app can have up to 130 patches.

### About upgrades

Within the context of the Snowflake Native App Framework, upgrades are updates to a version or patch of a Snowflake Native App that is
installed in the consumer account. The Snowflake Native App Framework supports two types of upgrades:

Automated upgrades
:   Automated upgrades are upgrades that are initiated by the provider. When a new version or patch is
    available, the provider modifies the release directive on the application package. This triggers an
    automatic upgrade of all installed instances of the app specified by the release directive.

Manual upgrades
:   Manual upgrades are upgrades that are initiated by the consumer in response to communication from
    the provider. Manual upgrades are useful when a provider needs to quickly release an update, such as a bug fix, to a consumer.

    > **Note:**
    >
    > This tutorials describes how to perform a manual upgrade for an app with containers.

When a new version or patch is available, the provider modifies the release directive on the
application package and then notifies the consumer that a new version is available.

The consumer performs the upgrade by running the [ALTER APPLICATION](../../../sql-reference/sql/alter-application.md)
command in their account to perform the upgrade. In general, manual upgrades allow the consumer to upgrade their
installed app faster than automated upgrades.

### About the version initializer

A version initializer is used to start or upgrade services or other related processes. The version initializer
is a callback stored procedure defined in the manifest file and implemented in the setup script. The version
initializer callback function is invoked in the following contexts:

* During installation, the version initializer is called as soon as the setup script of the app finishes without
  errors.
* During upgrade, there are two possible scenarios where the version initializer is called:

  + If the setup script of the new version succeeds, then the new version of the version initializer is called.
  + If the setup script or the version initializer of the new version fails, then the version initializer of the
    previous version is called. This allows the version initializer of the previous version to use the
    [ALTER SERVICE](../../../sql-reference/sql/alter-service.md) command to revert the services to the previous version.

## Add a version initializer to the app

In the previous tutorial, you created a basic app with containers. In this section you update this app
to add a version initializer to the app. You also add a version to the application package.

### Add the version initializer to the manifest file

The version initializer is defined in the manifest file of the app. To define the version initializer, add
the following code to the end of the manifest file:

```yaml
lifecycle_callbacks:
  version_initializer: app_public.version_init
```

This specifies the schema and name of the stored procedure used as the version initializer. In the next section,
you implement the `version_init` stored procedure.

### Add the version initializer as a stored procedure to the setup script

In the previous section, you added the name of the version initializer to the manifest file. In this section,
you add the code for the stored procedure to the setup script.

1. Add the following code at the end of the `setup_script.sql` file:

```sqlexample
CREATE OR REPLACE PROCEDURE app_public.version_init()
RETURNS STRING
LANGUAGE SQL
AS
$$
DECLARE
can_create_compute_pool BOOLEAN;  -- Flag to check if 'CREATE COMPUTE POOL' privilege is held
BEGIN
-- Check if the account holds the 'CREATE COMPUTE POOL' privilege
   SELECT SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT('CREATE COMPUTE POOL')
      INTO can_create_compute_pool;

   ALTER SERVICE IF EXISTS core.echo_service
      FROM SPECIFICATION_FILE = 'service/echo_spec.yaml';
   IF (can_create_compute_pool) THEN
      -- When installing app, the app has no 'CREATE COMPUTE POOL' privilege at that time,
      -- so it will not execute the code below

      -- Since the ALTER SERVICE is an async process, wait for the service to be ready
      SELECT SYSTEM$WAIT_FOR_SERVICES(120, 'core.echo_service');
   END IF;
   RETURN 'DONE';
END;
$$;
```

### Upload the changed files and create a version

After modifying the setup script, upload the modified files to the stage and create a version
by performing the following procedure:

1. Run the following command to upload the files and create a version:

   ```snowcli
   snow app version create v1 -c tut-connection
   ```

The `snow app version` command uploads the updated files to the stage. If the application
package and files already exist, this command only uploads files that have changed.

This command creates a new version of the app called v1 with the default patch set to 0.

### Set the default release directive for the app

In the previous section, you uploaded the changed files and created version `v1` of the app. In this
section, you set the default release directive to use version `v1`.

To update the default release directive run the following command from a worksheet:

```sqlexample
ALTER APPLICATION PACKAGE na_spcs_tutorial_pkg
  SET DEFAULT RELEASE DIRECTIVE VERSION=v1 PATCH=0;
```

When you set the default release directive for an app, consumers automatically install that
version when they install the app in their account. In the next section, you create the app
in your local account based on the release directive.

### Create and test the app

Now that you have added a version and set the default release directive, you can create the app
and grant the required privileges:

1. Create the app from the release directive by running the following command:

   ```snowcli
   snow app run --from-release-directive -c tut-connection
   ```

   This command creates the app using the release directive you defined in the previous section.
2. After creating the app, grant the required privileges to the app to be able to run it by running
   the following commands from a worksheet.

   ```sqlexample
   GRANT CREATE COMPUTE POOL ON ACCOUNT TO APPLICATION na_spcs_tutorial_app;
   GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO APPLICATION na_spcs_tutorial_app;
   ```
3. Call the `app_public.start_app` procedure that you defined in the `setup_script.sql`
   file: by running the following command from a worksheet:

   ```sqlexample
   CALL na_spcs_tutorial_app.app_public.start_app();
   ```
4. Confirm the function was created by running the following command from a worksheet:

   ```sqlexample
   SHOW FUNCTIONS LIKE '%my_echo_udf%' IN APPLICATION na_spcs_tutorial_app;
   ```
5. To verify that the service has been created and healthy, run the following command from a worksheet:

   ```sqlexample
   CALL na_spcs_tutorial_app.app_public.service_status();
   ```
6. To call the service function to send a request to the service and verify the response,
   run the following command from a worksheet:

   ```sqlexample
   SELECT na_spcs_tutorial_app.core.my_echo_udf('hello');
   ```
7. To view information about the app, run the following command from a worksheet:

   ```sqlexample
   DESC APPLICATION na_spcs_tutorial_app;
   ```

### Review what you learned in this section

In this section, you completed the following tasks:

* Learned about the version initializer and how you can add it to the manifest file and
  the setup script.
* Learned the basics of versions and patches in the Snowflake Native App Framework.
* Set the default release directive to point to a specific version of an app.
* Installed the app based on the release directive.
* Tested the app by calling a stored procedure and used the [DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md)
  command to view the status of the app.

> **Note:**
>
> In this tutorial you created the application object in your local account and used
> the [DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) command. This mimics the behavior of the app in the
> consumer account.

## Update the app and upgrade to a new version

In the previous section, you modified the original app by adding the version initializer
as a stored procedure. You also created a new version of the app, version v1, based on the
default release directive.

In this section, you make another change to the app, create version v2, update the default
release directive, and upgrade the installed app from version v1 to version v2.

### Add a new table to the app

To simulate adding a new feature to your app, add a new table to the setup script.

1. Add the following commands to the end of the `setup_script.yml`

   ```sqlexample
   CREATE TABLE IF NOT EXISTS core.setup_script_run(run_at TIMESTAMP);
   GRANT SELECT ON TABLE core.setup_script_run to APPLICATION ROLE app_user;
   INSERT INTO core.setup_script_run(run_at) values(current_timestamp());
   ```

### Create a version of the app

To upload the modified setup script to the stage and create version v2 of the app:

1. Run the following command inside the `na-spcs-tutorial` folder:

```snowcli
snow app version create v2 -c tut-connection
```

This command creates a new version of the app called v2 with the default patch set to 0.

The `snow app version` command uploads the updated files to the stage. If the application package
and files already exist, this command only uploads files that have changed.

### Set the default release directive for the app

After creating version `v2` of the app, set the release directive for the application
package by running the following command from a worksheet:

```sqlexample
ALTER APPLICATION PACKAGE na_spcs_tutorial_pkg
  SET DEFAULT RELEASE DIRECTIVE VERSION=v2 PATCH=0;
```

This command sets the release directive to version `v2` and patch `0`.

### Upgrade the app from v1 to v2

Now that you have updated the release directive to point to the new version, upgrade the app
by running the following command from a worksheet:

```snowcli
snow app run --from-release-directive -c tut-connection
```

### Test the upgraded app

After upgrading the app, test the app by running the following command from a worksheet:

```sqlexample
SELECT na_spcs_tutorial_app.core.my_echo_udf('hello');
```

### Review what you learned in this section

Congratulations! You successfully upgraded the app from version `v1` to version `v2`.

In this section, you completed the following tasks:

* Updated the app to include a table.
* Created a new version for the app based on this update.
* Updated the default release directive to point to the new version.
* Manually upgraded the app.

In the next section, you upgrade the service of the app and simulate an error in
the upgrade process by intentionally adding an error in the setup script.

## Simulate an upgrade error

In the previous section, you added a new table to the app, created a new version, and
upgraded the app.

In this section you update the service specification to simulate an update to the service.
You also add an intentional error to the setup script to simulate an upgrade failure, which
shows you how the version initializer handles service upgrades when the upgrade fails.

### Update the service specification file

In this section, you update the service specification of the app to simulate a change to
the service.

1. In the `service/echo_spec.yaml` file, change the value of `CHARACTER_NAME` from `Bob` to `Tom`.

   This change causes the service to return the following message:

   ```text
   `Tom said hello.`
   ```

The purpose of this change is to allow you to see which version of the service is running after attempting
an upgrade in the following sections.

### Update the setup script to include an intentional error

To simulate an error during the upgrade process, introduce an intentional error in the setup
script by adding a SELECT statement for a table that does not exist.

Add the following statement to the end of the `app_public.version_init()` procedure in the `setup_script.sql`.

```sqlexample
SELECT * FROM table_does_not_exist;
```

This statement is syntactically correct, but refers to a table that does not exist. This causes an error when the setup
script runs during upgrade.

After making this change, the `app_public.version_init()` function should look like the following example:

```sqlexample
GRANT USAGE ON PROCEDURE app_public.service_status() TO APPLICATION ROLE app_user;

CREATE OR REPLACE PROCEDURE app_public.version_init()
RETURNS STRING
LANGUAGE SQL
AS
$$
DECLARE
  -- Flag to check if 'CREATE COMPUTE POOL' privilege is held
  can_create_compute_pool BOOLEAN;
BEGIN
   -- Check if the account holds the 'CREATE COMPUTE POOL' privilege
   SELECT SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT('CREATE COMPUTE POOL')
     INTO can_create_compute_pool;

   ALTER SERVICE IF EXISTS core.echo_service
     FROM SPECIFICATION_FILE = 'service/echo_spec.yaml';
   IF (can_create_compute_pool) THEN
     -- When installing app, the app has no 'CREATE COMPUTE POOL' privilege at that time,
     -- so it will not execute the code below

     -- Since the ALTER SERVICE is an async process, wait for the service to be ready
     SELECT SYSTEM$WAIT_FOR_SERVICES(120, 'core.echo_service');
   END IF;

   -- trigger an error. The upgrade fails
   SELECT * FROM non_exist_table;

   RETURN 'DONE';
END;
$$;
```

### Upload the revised files and create a new patch

In previous sections, you updated the service specification and setup script of the app.

To upload the files and create a new patch for the app, perform the following tasks:

1. Run the following command to add a patch to the application package.

```snowcli
snow app version create v2 --patch 1 -c tut-connection
```

1. When prompted, enter `y` to add a new patch to the application package.

### Set the default release directive for the app

In the previous section, you uploaded the files and created a patch for the updates.
To set the default release directive for the patch, run the following command from a worksheet:

```sqlexample
ALTER APPLICATION PACKAGE na_spcs_tutorial_pkg
  SET DEFAULT RELEASE DIRECTIVE VERSION=v2 PATCH=1;
```

This command sets that patch for the app to patch `1`.

### Upgrade the app

In the previous sections, you made updates to the app and created a new patch. In
this section, you upgrade the app with the expectation that it fails due to the error
you introduced in previous sections.

To upgrade the app, run the following command:

```snowcli
snow app run --from-release-directive -c tut-connection
```

To view the upgrade state of the app, run the following command from a worksheet:

```sqlexample
DESC APPLICATION na_spcs_tutorial_app;
```

This command displays information about the app including the upgrade state, the number of upgrade attempts,
and the reason for an upgrade failure.

After the upgrade fails, Snowflake CLI returns the following message:

```text
Object 'TABLE_DOES_NOT_EXIST' does not exist or not authorized.'
```

Also, after the upgrade fails, the DESC APPLICATION command displays the following properties
related to upgrades:

> | Property | Value |
> | --- | --- |
> | upgrade_state | FAILED |
> | upgrade_failure_reason | upgrade_failure_reason[ErrorCode 2003] Uncaught exception of type ‘STATEMENT_ERROR’ on line 89 at position 0 : Uncaught exception of type ‘STATEMENT_ERROR’ on line 19 at position 3 : SQL compilation error: Object ‘TABLE_DOES_NOT_EXIST’ does not exist or not authorized. |

### Run app service to see which version of the service is running

In the previous section, you simulated a failure when upgrading from version v2, patch 0 to version v2, patch 1.

To determine which version of the service is currently running, run the following command from a worksheet.

```sqlexample
SELECT na_spcs_tutorial_app.core.my_echo_udf('hello');
```

This command returns the following string:

```text
Bob said hello
```

Here, you see that since the upgrade failed, the app continues to run the service from v2, patch 0.

However, if you did not include a version initializer in the app, the upgrade process would have upgraded
the service to v2, patch 1 although the app upgrade failed. If an app upgrade fails, the version initializer
ensures that the version of the service does not upgrade and continues to be in sync with the app.

### Review what you learned in this section

In this section, you completed the following tasks:

* Introduced an error in the setup script to simulate an error in the upgrade process.
* Verified the version of both the app and service after the failure.
* Learned how the version initializer ensures that the version of a service is in synch with the version of the app
  when an upgrade fails.

## Create a patch to fix the upgrade error

In the previous section, you introduced an error in the setup script of the app. When you upgraded
the app, you were able to verify that both the app and the service were continuing to run using
version v2 patch 0.

In this section, you modify the setup script of the app to fix the error, create a patch for
the update, and upgrade the app.

### Modify the setup script

To fix the intentional error you introduced in a previous section, remove the following
statement from the `setup_script.yaml` file:

```sqlexample
SELECT * FROM table_does_not_exist;
```

### Upload the updated files and create a new patch

To upload the modified setup script to a stage and create a new patch, perform the following tasks:

1. Run the following command to create a new patch for the app:

   ```snowcli
   snow app version create v2 --patch 2 -c tut-connection
   ```
2. When prompted, enter `y` to add a new patch to the application package.

### Update the default release directive

In the previous section, you created patch `2` for the app. To set the default
release directive for the patch, run the following command from a worksheet:

```sqlexample
ALTER APPLICATION PACKAGE na_spcs_tutorial_pkg
  SET DEFAULT RELEASE DIRECTIVE VERSION=v2 PATCH=2;
```

### Upgrade the app and verify the version of the service

After creating a new version and setting the default release directive, upgrade the
app and test the service by performing the following tasks:

1. To upgrade the app from version `v2` patch `0` to version `v2` patch `2`,
   run the following command:

   ```snowcli
   snow app run --from-release-directive -c tut-connection
   ```
2. To verify the version of the service that is currently running, run the following
   command from a worksheet:

   ```sqlexample
   SELECT na_spcs_tutorial_app.core.my_echo_udf('hello');
   ```
3. To view the status of the app, including the version that is currently installed,
   run the following command:

   ```sqlexample
   DESC APPLICATION na_spcs_tutorial_app;
   ```

   In the output the `version` property is `v2` and the patch property is `2`.

### Review what you learned in this section

Congratulations! You successfully upgraded the app after the upgrade failure.

In this section, you completed the following tasks:

* Fixed the error in the setup script.
* Created a new patch, `p2`, to update the app.
* Upgraded the app to the new patch.

## Tear down the app and objects created in the tutorial

Because the app uses a compute pool, it uses credits in your account
and costs money to run. To stop the app from consuming resources, you must tear down
both the application object and any of the account-level objects it created, such as the
compute pool.

1. To confirm that the compute pool is currently running, run the following command:

   ```snowcli
   snow object list compute-pool -l "na_spcs_tutorial_app_%"
   ```

   If the compute pool is running, a row with an `ACTIVE` compute pool that was created by the
   application object is displayed.
2. Run the following Snowflake CLI command to tear down the app:

   ```snowcli
   snow app teardown --cascade --force -c tut-connection
   ```

   This command removes all of the Snowflake objects created by the app. Without the `--force` option,
   this command does not drop the application package because it contains versions.
3. To confirm that the compute pool was dropped run the following command again:

   ```snowcli
   snow object list compute-pool -l "na_spcs_tutorial_app_%"
   ```

   This command returns `no data` if the compute pool has been dropped successfully.

> **Note:**
>
> The `snow app teardown` command drops both the application package and application object.
> Therefore, any stateful data is lost.

## Learn more

Congratulations! In this tutorial, you learned how to manually upgrade an app with containers.

### Summary

In this tutorial, you completed the following tasks:

* Added a version initializer stored procedure to handle services during upgrades
  and failures.
* Created a new version definition of the app in the application package. Version
  definitions specify the version number and patch of the app.
* Set the default release directive for an app. Release directives determine which
  version and patch is installed when a consumer installs or upgrades an app.
* Upgraded an app and verified what happens during upgrade failure.

---
title: Tutorial: Snowflake Native SDK for Connectors example Java connector
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/tutorials/native_sdk_example_connector_tutorial.md
section: Native Apps Framework
---

Snowflake

Connector

Native SDK

# Tutorial: Snowflake Native SDK for Connectors example Java connector

## Introduction

Welcome to our tutorial on an example connector built using Snowflake Native SDK for Connectors. This guide will show you
how to build, deploy, install, and configure an example connector.

Provided example application ingests GitHub issues data, by connecting to the GitHub API to pull
information about issues from the specified repositories.

In this tutorial you will learn how to:

* Build an example connector from sources
* Deploy a new application package and version
* Install a new application instance
* Configure the connector instance to ingest data

### Prerequisites

Before getting started please make sure that you meet the following requirements:

* Basic knowledge of Java
* Basic knowledge of [Snowflake Native Apps](../../native-apps-about.md)
* Basic knowledge of [Streamlit UI](https://docs.streamlit.io/)
* Access to a Snowflake account with an `ACCOUNTADMIN` role
* GitHub account, which can create [GitHub apps](https://docs.github.com/en/apps/creating-github-apps/about-creating-github-apps/about-creating-github-apps)
* MacOS or Linux machine to build the project and run deployment scripts

## Prepare your local environment

Before proceeding you need to make sure all necessary software is installed on your machine and
clone the example connector repository.

### Java installation

Snowflake Native SDK for Connectors requires Java LTS (Long-Term Support) version 11 or higher. If
the minimum required version of Java is not installed on your machine, you must install either
Oracle Java or OpenJDK.

#### Oracle Java

The latest LTS release of the JDK is free to download and use, at no cost,
under the Oracle NFTC. For download and installation instructions go to the [Oracle page](https://www.oracle.com/java/technologies/downloads/).

#### OpenJDK

OpenJDK is an open-source implementation of Java. For download and installation instructions go to
[openjdk.org](https://openjdk.org/install/) and [jdk.java.net](https://jdk.java.net/).

You may also use a 3rd party OpenJDK version, such as [Eclipse Temurin](https://adoptium.net/temurin/releases/)
or [Amazon Corretto](https://aws.amazon.com/corretto/).

### Repository cloning

Clone the [connectors-native-sdk](https://github.com/snowflakedb/connectors-native-sdk) repository to your machine.

### Snowflake CLI configuration

The [Snowflake CLI](../../../snowflake-cli/index.md) tool is
required to build, deploy, and install the connector. If you do not have Snowflake CLI on your
machine, install it as per instructions available in [Installing Snowflake CLI](../../../snowflake-cli/installation/installation.md).

After the tool is installed, you need to configure a connection to Snowflake in your
[configuration file](../../../snowflake-cli/connecting/configure-cli.md).

If you do not have any connections configured, create a new one named `native_sdk_connection`. You
can find an example connection in the `deployment/snowflake.toml` file.

If you already have a connection configured and would like to use it with the connector, use its
name instead of `native_sdk_connection` whenever this connection is used in this tutorial.

## Project structure

The Snowflake Native SDK for Connectors project consists of a couple main elements.

### Connectors Native SDK Java

The `connectors-native-sdk-java` directory contains all the Snowflake Native SDK for Connectors Java code, with unit
and integration tests for the SDK components. Because of the nature of Native Apps inside Snowflake,
this means not only Java code, but also SQL code, which is necessary to create a working application.
The definitions of the database objects can be found inside `src/main/resources` directory. Those
files are used while creating an application to customize which objects will be available inside the
application. In the example connector we use the `all.sql` file, which creates objects for all
available features. This file will be executed during the installation process of the application
instance.

### Connectors Native SDK Test Java

The `connectors-native-sdk-test-java` directory contains source code of a helper library used in
unit tests, e.g. objects used to mock particular components, custom assertions etc. Those files are
not a part of the connector application.

### Example Java GitHub connector

The actual example connector is located inside `examples/connectors-native-sdk-example-java-github-connector`
directory. The `app/` directory contains all the files needed to run the Native App:

* The `app/streamlit/` directory contains source files necessary to run the Streamlit UI of the connector.
* The `setup.sql` file is run during the application installation and is responsible for creating the necessary database objects.
  This file will execute the `all.sql` file mentioned before, as well as some custom sql code.
* The `manifest.yml` file is the manifest of the Native App. It is required to create an application
  package and then the application instance itself. This file specifies application properties, as
  well as permissions needed by the application.

Additionally the `examples/connectors-native-sdk-example-java-github-connector` directory contains
the `src/` subdirectory, which contains custom connector logic, such as implementation of the
required classes and customizations of the default SDK components.

### Connectors Native SDK Template

A template Gradle Java project which uses Snowflake Native SDK for Connectors as a dependency to help you quickly build a
new connector. You can read more about in [Tutorial: Snowflake Native SDK for Connectors Java connector template](native_sdk_template_connector_tutorial.md).

## Build, deployment, and installation

The following sections will show you how to build, deploy, and install the example connector.

### Build the connector

Building a connector created using Snowflake Native SDK for Connectors is a bit different from building a typical
Java application. There are some things which must be done besides just building the .jar archives
from the sources. Building the application consists of the following steps:

1. Copying custom internal components to the build directory
2. Copying SDK components to the build directory

#### Copy internal components

This step builds the connector .jar file and then copies it (along with the UI, manifest and setup files)
to the `sf_build` directory.

To run this step execute the command: `./gradlew copyInternalComponents`.

#### Copy SDK components

This step copies the SDK .jar file (added as a dependency to the connector Gradle module) to the
`sf_build` directory and extracts bundled .sql files from the .jar archive.

Those .sql files allow the customization of which provided objects will be created during the
application installation. For the first time users customization is not recommended, because omitting
objects may cause some features to fail if done incorrectly. The example connector application uses
the `all.sql` file, which creates all recommended SDK objects.

To run this step execute the command: `./gradlew copySdkComponents`.

### Deploy the connector

To deploy a Native App an application package needs to be created inside Snowflake. After that all
the files from the `sf_build` directory need to be uploaded to Snowflake.

Please note that for development purposes, version creation is optional, an application instance can be
created directly from staged files. This approach allows you to see changes in most of the connector
files without recreating the version and application instance.

The following operations will be performed:

1. Create a new application package, if it does not already exist
2. Create a schema and file stage inside the package
3. Upload files from the `sf_build` directory to the stage (this step may take some time)

To deploy the connector execute the command: `snow app deploy --connection=native_sdk_connection`.

For more information about the `snow app deploy` command see [snow app deploy](../../../snowflake-cli/command-reference/native-apps-commands/deploy-app.md).

The created application package will now be visible in the `App packages` tab, in the
`Data products` category, in the Snowflake UI of your account.

### Install the connector

Installation of the application is the last step of the process. It creates an application from the
application package created previously.

To install the connector execute the command: `snow app run --connection=native_sdk_connection`.

For more information about the `snow app run` command see [snow app run](../../../snowflake-cli/command-reference/native-apps-commands/run-app.md).

The installed application will now be visible in the `Installed apps` tab, in the
`Data products` category, in the Snowflake UI of your account.

### Update connector files

If at any point you wish to modify any of the connector files, you can easily upload the modified
files into the application package stage. The upload command depends on which files were updated.

Before any of the update commands are run, you have to copy the new files of your connector to the
`sf_build` directory by running: `./gradlew copyInternalComponents`

#### UI .py files or connector .java files

Use the `snow app deploy --connection=native_sdk_connection` command, the current application
instance will use the new files without reinstallation.

#### setup.sql or manifest.yml files

Use the `snow app run --connection=native_sdk_connection` command, the current application
instance will be reinstalled after the new files are uploaded to stage.

## Connector flow

Before we move to configuring the connector and ingesting the data, we should have a quick look at
how the connector actually works. Below you can see all the steps that will be completed in the
next steps of this tutorial. The starting point will be completing the prerequisites and going
through the Wizard.

The Wizard stage of the connector guides the users through all the required configurations needed
by the connector. The Daily Use stage allows user to view statistics, configure repositories for
ingestion and pause/resume the connector.

## Configuration Wizard

After opening the application the Wizard UI page will be opened. The connector needs some information
provided by the user before it can start ingesting data. The Wizard will guide you through all the
required steps in the application itself, but also on the whole Snowflake account and sometimes even
the external system that will be the source of the ingested data, in this case GitHub. After all
these steps are finished, the connector will be ready to start ingesting the data.

## Prerequisites

The first step of the Wizard are the prerequisites. This step will provide you a list of things which
should be prepared before configuring the connector. Completing the prerequisites is not required,
but it is recommended to ensure smoother configuration process later.

In the case of the example GitHub connector there are two things that need to be taken care of before
going further:

1. Preparing a GitHub account
2. Confirming access to the GitHub repositories you want to ingest

## Connector Configuration

Next step of the Wizard is connector configuration. This step allows the user to:

* Grant application privileges, which are requested using the
  [Native Apps Permission SDK](../../requesting-ui.md)
* Choose a warehouse which will be referenced when scheduling ingestion tasks
* Choose destination database and schema for the data that will be ingested

### Privileges

Application requires two account level permissions to operate: `CREATE DATABASE` and `EXECUTE TASK`.

The first privilege is needed to create a destination database for the ingested data. This database
should be created outside the application, so that the ingested data can be left intact if the application
is uninstalled. However, this example does not support this feature, a new database is always created.

The second privilege is needed to schedule periodic tasks that will fetch the data from GitHub and
save it in the destination database.

Granting those privileges can be done using the security tab or by pressing the `Grant privileges`
button in the connector configuration screen. The latter one will result in a popup appearing on the
screen.

### Warehouse reference

The connector requires a warehouse to run and schedule ingestion tasks. Application will use the
warehouse through a [reference](../../requesting-refs.md).
Warehouse reference is defined in the `manifest.yml` file:

```yaml
references:
  - WAREHOUSE_REFERENCE:
      label: "Warehouse used for ingestion"
      description: "Warehouse which will be used to schedule ingestion tasks"
      privileges: [USAGE]
      object_type: WAREHOUSE
      register_callback: PUBLIC.REGISTER_REFERENCE
```

The reference can set using the security tab, the same as the privileges above, or by pressing the
`Choose warehouse` button.

### Destination database and schema

As mentioned before, the connector requires a database to store the ingested data. This database will
be created, during a later step, with the schema specified by the user. Name of the database is up
to the user, as long as the provided database does not already exist.

The completed connector configuration screen will look similar to this one:

## Connection Configuration

Next step of the Wizard is connection configuration. This step allows user to set up the connection
to an external data source. We recommend using OAuth2 authentication whenever possible, instead of
using user/password or plaintext tokens.

GitHub currently supports two ways of OAuth2 authentication: OAuth apps and GitHub apps. OAuth apps
are a bit easier to set up and use, however they do not provide the same level of permission control
granularity. We recommend using a GitHub app for this example; however if you wish to use an OAuth
app, the connector will still work as intended.

### Permission SDK setup

OAuth2 authentication requires a security integration, secret and external access integration to be
created in the user’s account. Our connector uses the
[Native Apps Permission SDK](../../requesting-ui.md)
to request the creation of those objects.

References for the external access integration and secret, which are needed by the connector, are
defined in the `manifest.yml` file:

```yaml
references:
  - GITHUB_EAI_REFERENCE:
      label: "GitHub API access integration"
      description: "External access integration that will enable connection to the GitHub API using OAuth2"
      privileges: [USAGE]
      object_type: "EXTERNAL ACCESS INTEGRATION"
      register_callback: PUBLIC.REGISTER_REFERENCE
      configuration_callback: PUBLIC.GET_REFERENCE_CONFIG
  - GITHUB_SECRET_REFERENCE:
      label: "GitHub API secret"
      description: "Secret that will enable connection to the GitHub API using OAuth2"
      privileges: [READ]
      object_type: SECRET
      register_callback: PUBLIC.REGISTER_REFERENCE
      configuration_callback: PUBLIC.GET_REFERENCE_CONFIG
```

In addition, a special procedure needs to be added in the `setup.sql` file. It is referenced in the
`configuration_callback` property for each of the references presented above:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.GET_REFERENCE_CONFIG(ref_name STRING)
    RETURNS STRING
    LANGUAGE SQL
    AS
        BEGIN
            CASE (ref_name)
                WHEN 'GITHUB_EAI_REFERENCE' THEN
                    RETURN OBJECT_CONSTRUCT(
                        'type', 'CONFIGURATION',
                        'payload', OBJECT_CONSTRUCT(
                            'host_ports', ARRAY_CONSTRUCT('api.github.com'),
                            'allowed_secrets', 'LIST',
                            'secret_references', ARRAY_CONSTRUCT('GITHUB_SECRET_REFERENCE')
                        )
                    )::STRING;
                WHEN 'GITHUB_SECRET_REFERENCE' THEN
                    RETURN OBJECT_CONSTRUCT(
                        'type', 'CONFIGURATION',
                        'payload', OBJECT_CONSTRUCT(
                            'type', 'OAUTH2',
                            'security_integration', OBJECT_CONSTRUCT(
                                'oauth_scopes', ARRAY_CONSTRUCT('repo'),
                                'oauth_token_endpoint', 'https://github.com/login/oauth/access_token',
                                'oauth_authorization_endpoint', 'https://github.com/login/oauth/authorize'
                            )
                        )
                    )::STRING;
                ELSE
                    RETURN '';
            END CASE;
        END;
```

For the external access integration reference the procedure provides:

* `host_ports` - hostnames of the external data source, which will be accessed during ingestion
* `secret_references` - array of names of references to OAuth secrets
* `allowed_secrets` - `LIST`, telling the Permission SDK to use secrets specified in the
  `secret_references` field

For the secret reference the procedure provides:

* `type` - `OAUTH2` in case of our secret
* `security_integration` - properties of the created security integration:

  + `oauth_scopes` - a list of OAuth scopes requested by the connector (if using a GitHub app -
    this field is optional)
  + `oauth_token_endpoint` - endpoint from which the refresh and access token will be acquired
  + `oauth_authorization_endpoint` - endpoint to which the authorization request will be sent

### GitHub app setup

The next step is the setup of a GitHub app in the user’s account. This app will be used to grant
limited access to the account, so that data can be ingested.

The first step is to press the `Request access` button in the connector UI.

The first screen allows you to review the endpoints, for which external connectivity will be allowed.

After pressing `Next`, you will see a second screen. Select `OAuth2` to create a new integration
and secret, and copy the provided redirect URL, it will contain your organization name and the
region of your account.

Next go to your GitHub account settings page, then into `Developer settings > GitHub Apps` and
press the `New GitHub App` button:

1. Enter the name and homepage URL of your app
2. Paste the redirect URL you copied into the `Callback URL` field
3. Make sure the `Expire user authorization tokens` option is selected
4. Make sure the `Request user authorization (OAuth) during installation` is not selected
5. If you do not need it, deselect the `Active` option in the `Webhook` section
6. Select the permissions needed for the connector to work:

   1. `Repository permissions > Issues` with the `Read-only` access
   2. `Repository permissions > Metadata` with the `Read-only` access
7. If the app will only be used by you, with this example connector, it is best to select
   `Only on this account` in the installation access section

After the app is created, press the `Install app` option in the left sidebar and install the
application in your account. You can choose which repositories the app (and by extension the
connector) will be able to access. Without this installation, the connector will only be able to
access public repositories.

### OAuth integration setup

After installation return to your GitHub app and generate a new client secret. Make sure to copy it
immediately, as it will not be shown again. Paste the client secret in the OAuth configuration popup
of your connector. Finally, copy the client ID (not app ID) of your application and also paste it in
the OAuth configuration popup of your connector.

After pressing `Connect` a GitHub window will pop up, asking you for app authorization on your
GitHub account. After authorizing, you will be automatically redirected back to the connector UI.
After successful authorization (it may take a couple seconds to finish and close the popup) the
page will be populated with the IDs of external access integration and secret references.

Pressing the `Connect` button will trigger the `TEST_CONNECTION` procedure inside the connector.
This procedure will try to access the [GitHub API octocat endpoint](https://api.github.com/octocat),
to check if external connectivity was configured correctly, and the OAuth access token obtained
correctly.

When the test succeeds, application will proceed into the finalization step.

## Configuration Finalization

Finalization is the last step of the Wizard. In this step you will be asked to provide an organisation
and a repository name. This repository must be accessible with the OAuth token obtained during the
connection configuration step. The provided repository will be used only for connection validation
purposes.

This is a bit different from the previous step, because the `TEST_CONNECTION` procedure only checks
if the GitHub API is accessible and the provided token is valid.

Finalization step ensures that repository provided by user is accessible with the provided GitHub
API token. It will fail if the token does not have required permissions to access the repository.
If you would like to ingest data from private repositories, we recommend finalizing the configuration
using a private repository, just to make sure they work correctly.

Additionally, during this step the database and schema specified in connector configuration phase
will finally be created in your account.

## Daily Use

After the Configuration Wizard is completed successfully you can now start using your example GitHub
connector.

Next steps will explain:

* How to configure resources to ingest the data
* How the ingestion process works
* How to view statistics of the ingested records
* How to view ingested data
* How to pause and resume connector

## Configuring resources

To configure resources go to the `Data Sync` tab. This tab displays a list of the repositories
already configured for ingestion. When opened for the first time the list will be empty.

To configure a resource enter the organisation and repository names in the designated fields, then
press the `Queue ingestion` button. For example:

The definition for a new resource will be saved, and it will be picked up by the scheduler according
to the global schedule. **It will take some time before the data is ingested and visible in the sink
table.** It will be visible in the table below:

## Ingestion schedule and status

At the top of the `Data Sync` tab there is a section containing general information about the
ingestion. This section allows the user to see the global schedule with which the configured resources
will be ingested. The label at the bottom right corner shows the current ingestion status. At first
it will show the `NOT SYNCING` state, until the first resource is defined. After that it will
transition to `SYNCING`, and finally when at least one resource ingestion is successfully
finished, it will show the finish date of that ingestion.

## Ingestion process

Ingestion process is handled using a `Scheduler Task` and `Task Reactor` components. The scheduler
picks up the defined resources according to the global schedule and submits them as `Work Items` to
a queue. Then task reactor component called a `Dispatcher` picks them up and splits between the
defined number of workers. Each worker performs the actual ingestion for every item from the queue
that it picks up.

Singular ingestion of a resource consists of fetching the data from the endpoints in the GitHub API
and then saving them in the designated tables in the sink database. For this example purposes all
the data is fetched in every run, which results in new records being added to the table and old records
being updated. Additionally, execution of each `Work Item` includes logging data like the start and
end date, number of ingested rows, status etc. to internal connector tables, which are then used for
statistics purposes.

## Viewing statistics

The `Home` screen contains statistics from the past ingestion runs. The data is based on the
`PUBLIC.AGGREGATED_CONNECTOR_STATS` view. The view aggregates the number of ingested rows based
on the hour of the day when it was ingested. The data from this view can be retrieved using `SELECT`
queries run in a worksheet, that way it can also be aggregated by a time window bigger than an hour.

There is another view named `PUBLIC.CONNECTOR_STATS` that is available through the worksheet. Using this
data, you can see the status, start and end date, average rows ingested per seconds and some other
information regarding data ingestion.

Example ingestion statistics chart:

## Viewing ingested data

Ingested data is not visible in the UI, but can be viewed by querying data from specific tables, by
users with `ADMIN` or `DATA_READER` roles. To view the data you must to go to a SQL worksheet and just
select the destination database. The destination database uses name and schema defined during the
connector configuration step. You can `SELECT` data from:

1. The `ISSUES` table, it contains the following columns:

   * ORGANIZATION
   * REPOSITORY
   * RAW_DATA
2. The `ISSUES_VIEW` view, it contains the following columns:

   * ID
   * ORGANIZATION
   * REPOSITORY
   * STATE
   * TITLE
   * CREATED_AT
   * UPDATED_AT
   * ASSIGNEE

Data visible in the `ISSUES_VIEW` view is extracted from the `raw_data` column found in the
`ISSUES` table. To see the data you can use one of the following queries:

```sqlexample
SELECT * FROM DEST_DATABASE.DEST_SCHEMA.ISSUES;

SELECT * FROM DEST_DATABASE.DEST_SCHEMA.ISSUES_VIEW;
```

## Pausing and resuming

The connector can be paused and resumed, whenever desired. To do so just press the `Pause` button
in the `Data Sync` tab. When pausing is triggered the underlying scheduling and work execution mechanism
is disabled. However, any active ingestion work will finish before the connector actually goes into
the `PAUSED` state. Because of that, it can take up to a couple minutes for the connector to
fully pause.

To resume the connector, just have to press `Resume` button, which will be displayed in place of
the `Pause` button. This will resume the scheduling task which will start queueing new `Work Items`.

## Connector settings

After configuration is finished one more tab called `Settings` becomes available. This tab allows
the user to see current connector and connection configurations. The data from this tab is extracted
from the underlying `APP_CONFIG` configuration table and is read only.

## Troubleshooting

If the connector encounters any problems, they will be visible in the `event table` logs, if the
table is created and set in the account.

More on the enabling and using the `event table`, event logging, and event sharing in Native Apps
can be found in the documentation:

* [Event table overview](../../../logging-tracing/event-table-setting-up.md)
* [Use logging and event tracing for an app](../../event-about.md)

## Cleanup

After the tutorial is completed you can either pause the connector as explained in the Daily Use
section or completely remove it from your account using the command:

`snow app teardown --connection=native_sdk_connection --cascade --force`

The `--cascade` option is needed to remove the destination database without transferring the ownership
to the account admin. In real connectors the database should not be removed to preserve the ingested
data, it should be either owned by the account admin or ownership should be transferred before
uninstallation.

**If the cleanup part is skipped, the example connector will consume credits until it is paused or
removed, even if no repositories were configured for ingestion!**

## Customization

This tutorial has shown you an example connector built using Snowflake Native SDK for Connectors. To learn more about
how to customize the connector, or build your own from scratch, see:

* [Snowflake Native SDK for Connectors](../about-connector-sdk.md)
* [Tutorial: Snowflake Native SDK for Connectors Java connector template](native_sdk_template_connector_tutorial.md)

---
title: Tutorial: Snowflake Native SDK for Connectors Java connector template
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/tutorials/native_sdk_template_connector_tutorial.md
section: Native Apps Framework
---

Snowflake

Connector

Native SDK

# Tutorial: Snowflake Native SDK for Connectors Java connector template

## Introduction

Welcome to our tutorial on using a connector template utilizing Snowflake Native SDK for Connectors. This guide will help you setup a simple Connector Native Application.

In this tutorial you will learn how to:

* Deploy a Connector Native Application
* Configure a template connector to ingest data
* Customize a template connector to your own needs

The template contains various helpful comments in the code to make it easier for you to find specific files that need to be modified.
Look for the comments with the following keywords, they will guide you and help implement your own connector:

* `TODO`
* `TODO: HINT`
* `TODO: IMPLEMENT ME`

Before you begin this tutorial, you should prepare yourself by reviewing the following recommended content:

* [Snowflake Native SDK for Connectors](../about-connector-sdk.md)
* [Tutorial: Snowflake Native SDK for Connectors example Java connector](native_sdk_example_connector_tutorial.md)

### Prerequisites

Before getting started please make sure that you meet the following requirements:

* Access to a Snowflake account with an `ACCOUNTADMIN` role
* Review [Snowflake Native SDK for Connectors](../about-connector-sdk.md) and keep it open while following this tutorial
* Review [Tutorial: Snowflake Native SDK for Connectors example Java connector](native_sdk_example_connector_tutorial.md)

  + That tutorial uses an example connector based on this template and it can be referenced to check
    out example implementations of various components.

## Prepare your local environment

Before proceeding you need to make sure all necessary software is installed on your machine and
clone the connector template.

### Java installation

Snowflake Native SDK for Connectors requires Java LTS (Long-Term Support) version 11 or higher. If
the minimum required version of Java is not installed on your machine, you must install either
Oracle Java or OpenJDK.

#### Oracle Java

The latest LTS release of the JDK is free to download and use, at no cost,
under the Oracle NFTC. For download and installation instructions go to the [Oracle page](https://www.oracle.com/java/technologies/downloads/).

#### OpenJDK

OpenJDK is an open-source implementation of Java. For download and installation instructions go to
[openjdk.org](https://openjdk.org/install/) and [jdk.java.net](https://jdk.java.net/).

You may also use a 3rd party OpenJDK version, such as [Eclipse Temurin](https://adoptium.net/temurin/releases/)
or [Amazon Corretto](https://aws.amazon.com/corretto/).

### Snowflake CLI configuration

The [Snowflake CLI](../../../snowflake-cli/index.md) tool is
required to build, deploy and install the connector. If you do not have Snowflake CLI on your
machine - install it as per instructions available in [Installing Snowflake CLI](../../../snowflake-cli/installation/installation.md).

After the tool is installed - you need to configure a connection to Snowflake in your
[configuration file](../../../snowflake-cli/connecting/configure-cli.md).

If you do not have any connections configured - create a new one named `native_sdk_connection`. You
can find an example connection in the `deployment/snowflake.toml` file.

If you already have a connection configured and would like to use it with the connector - use its
name instead of `native_sdk_connection` whenever this connection is used in this tutorial.

### Template cloning

To clone the connector template use the following command:

```output
snow init <project_dir> \
  --template-source https://github.com/snowflakedb/connectors-native-sdk \
  --template templates/connectors-native-sdk-template
```

In place of `<project_dir>` enter the name of the directory (it must not exist) in which the Java
project of your connector will be created.

After executing the command you will be asked to provide additional information for application instance
and stage name configuration. You may provide any names, as long as they are valid unquoted Snowflake
identifiers, or click enter to use the default values, which are shown in the square brackets.

An example command execution, providing custom application and stage names:

```output
$ snow init my_connector \
    --template-source https://github.com/snowflakedb/connectors-native-sdk \
    --template templates/connectors-native-sdk-template

Name of the application instance which will be created in Snowflake [connectors-native-sdk-template]: MY_CONNECTOR
Name of the schema in which the connector files stage will be created [TEST_SCHEMA]:
Name of the stage used to store connector files in the application package [TEST_STAGE]: CUSTOM_STAGE_NAME
Initialized the new project in my_connector
```

## Connector build, deployment, and cleanup

The template can be deployed out of the box, even before any modification. The following sections will
show you how to build, deploy and install the connector.

### Build the connector

Building a connector created using Snowflake Native SDK for Connectors is a bit different from building a typical
Java application. There are some things which must be done besides just building the .jar archives
from the sources. Building the application consists of the following steps:

1. Copying custom internal components to the build directory
2. Copying SDK components to the build directory

#### Copy internal components

This step builds the connector .jar file and then copies it (along with the UI, manifest and setup
files) to the `sf_build` directory.

To run this step execute the command: `./gradlew copyInternalComponents`.

#### Copy SDK components

This step copies the SDK .jar file (added as a dependency to the connector Gradle module) to the
`sf_build` directory and extracts bundled .sql files from the .jar archive.

Those .sql files allow the customization of which provided objects will be created during the
application installation. For the first time users customization is not recommended, because omitting
objects may cause some features to fail if done incorrectly. The template connector application uses
the `all.sql` file, which creates all recommended SDK objects.

To run this step execute the command: `./gradlew copySdkComponents`.

### Deploy the connector

To deploy a Native App an application package needs to be created inside Snowflake. After that all
the files from the `sf_build` directory need to be uploaded to Snowflake.

Please note - for development purposes version creation is optional, an application instance can be
created directly from staged files. This approach allows you to see changes in most of the connector
files without recreating the version and application instance.

The following operations will be performed:

1. Create a new application package, if it does not already exist
2. Create a schema and file stage inside the package
3. Upload files from the `sf_build` directory to the stage (this step may take some time)

To deploy the connector execute the command: `snow app deploy --connection=native_sdk_connection`.

For more information about the `snow app deploy` command see [snow app deploy](../../../snowflake-cli/command-reference/native-apps-commands/deploy-app.md).

The created application package will now be visible in the `App packages` tab, in the
`Data products` category, in the Snowflake UI of your account.

### Install the connector

Installation of the application is the last step of the process. It creates an application from the
application package created previously.

To install the connector execute the command: `snow app run --connection=native_sdk_connection`.

For more information about the `snow app run` command see [snow app run](../../../snowflake-cli/command-reference/native-apps-commands/run-app.md).

The installed application will now be visible in the `Installed apps` tab, in the
`Data products` category, in the Snowflake UI of your account.

### Update connector files

If at any point you wish to modify any of the connector files - you can easily upload the modified
files into the application package stage. The upload command depends on which files were updated.

Before any of the update commands are run - you have to copy the new files of your connector to the
`sf_build` directory by running: `./gradlew copyInternalComponents`

#### UI .py files or connector .java files

Use the `snow app deploy --connection=native_sdk_connection` command, the current application
instance will use the new files without reinstallation.

#### setup.sql or manifest.yml files

Use the `snow app run --connection=native_sdk_connection` command, the current application
instance will be reinstalled after the new files are uploaded to stage.

### Cleanup

After the tutorial is completed, or if for any reason you want to remove the application and its
package, you can completely remove them from your account using the command:

`snow app teardown --connection=native_sdk_connection --cascade --force`

The `--cascade` option is needed to remove the destination database without transferring the ownership
to the account admin. In real connectors the database should not be removed to preserve the ingested
data, it should be either owned by the account admin or ownership should be transferred before
uninstallation.

**Please note - the connector will consume credits until it is paused or removed, even if no
ingestion was configured!**

## Prerequisites step

Right after installation the Connector is in its Wizard phase. This phase consists of a few steps
that guide the end user through all the necessary configurations.

The first step is the Prerequisites step. It is optional and might not be necessary for every connector.
Prerequisites are usually actions required from the user outside of the application, e.g. running
queries in the SQL worksheet, doing configuration on the source system side, etc.

Read more about prerequisites: [Prerequisites](../flow/prerequisites.md)

The contents of each prerequisite are retrieved directly from the `STATE.PREREQUISITES` table,
located inside the connector. They can be customized through the `setup.sql` script. However, keep
in mind that the `setup.sql` script is executed on every installation, upgrade and downgrade of the
application. The inserts must be idempotent, because of this it is recommended to use a merge query
as in the example below:

```sqlexample
MERGE INTO STATE.PREREQUISITES AS dest
USING (SELECT * FROM VALUES
           ('1',
            'Sample prerequisite',
            'Prerequisites can be used to notice the end user of the connector about external configurations. Read more in the SDK documentation below. This content can be modified inside `setup.sql` script',
            'https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/flow/prerequisites',
            NULL,
            NULL,
            1
           )
) AS src (id, title, description, documentation_url, learnmore_url, guide_url, position)
ON dest.id = src.id
WHEN NOT MATCHED THEN
    INSERT (id, title, description, documentation_url, learnmore_url, guide_url, position)
    VALUES (src.id, src.title, src.description, src.documentation_url, src.learnmore_url, src.guide_url, src.position);
```

## Connector configuration step

The next step of the Wizard Phase is the connector configuration step. During this step you can
configure database objects and permissions required by the connector. This step allows for the
following configuration properties to be specified:

* `warehouse`
* `operational_warehouse`
* `cortex_warehouse`
* `destination_database`
* `destination_schema`
* `global_schedule`
* `data_owner_role`
* `cortex_user_role`
* `agent_username`
* `agent_role`

If you need any other custom properties, they can be configured in one of the next steps of the Wizard
phase. For more information on each of the properties see: [Connector configuration](../flow/connector_configuration.md)

Additionally, the Streamlit component (`streamlit/wizard/connector_config.py`) provided in the
template shows how to trigger the [Native Apps Permission SDK](../../requesting-ui.md)
and requests privilege grants from the end-user. As long as the available properties satisfy the
needs of the connector then there is no need to overwrite any of the backend classes, although this
is still possible the same way as for the components in the further steps of the configuration.

For more information on internal procedures and Java objects see: [Connector configuration reference](../reference/connector_configuration_reference.md)

The provided Streamlit example allows for requesting account level privileges configured in the
`manifest.yml` file - `CREATE DATABASE` and `EXECUTE TASKS`. It also allows the user to specify
a warehouse reference through the Permission SDK popup.

In the template, the user is asked to only provide the `destination_database` and `destination_schema`.
However, a `TODO` comment in `streamlit/wizard/connector_configuration.py` contains commented
code that can be reused to display more input fields in the Streamlit UI.

```python
# TODO: Here you can add additional fields in connector configuration.
# For example:
st.subheader("Operational warehouse")
input_col, _ = st.columns([2, 1])
with input_col:
    st.text_input("", key="operational_warehouse", label_visibility="collapsed")
st.caption("Name of the operational warehouse to be used")
```

## Connection configuration step

The next step of the Wizard Phase is the connection configuration step. This step allows the end-user
to configure external connectivity parameters for the connector. This configuration may include
identifiers of objects like secrets, integrations, etc.

Because this information varies depending on the source system for the data ingested by the connector,
this is the first place where bigger customizations have to be made in the source code.

For more information on connection configuration see:

* [Connection configuration](../flow/connection_configuration.md)
* [Connection configuration reference](../reference/connection_configuration_reference.md)

Starting with the Streamlit UI side (`streamlit/wizard/connection_config.py`) you need to add text
inputs for all needed parameters. An example text input is implemented for you and if you search the
code in this file, you can find a `TODO` with commented code for a new field.

```python
# TODO: Additional configuration properties can be added to the UI like this:
st.subheader("Additional connection parameter")
input_col, _ = st.columns([2, 1])
with input_col:
    st.text_input("", key="additional_connection_property", label_visibility="collapsed")
st.caption("Some description of the additional property")
```

After the properties are added to the form, they need to be passed to the backend layer of the connector.
To do so, two additional places must be modified in the Streamlit files. The first one is the
`finish_config` function in the `streamlit/wizard/connection_config.py` file. The state of the
newly added text inputs must be read here. Additionally, it can be validated if needed, and then passed
to the `set_connection_configuration` function.

For example if `additional_connection_property` was added it would look like this after the edits:

```python
def finish_config():
try:
    # TODO: If some additional properties were specified they need to be passed to the set_connection_configuration function.
    # The properties can also be validated, for example, check whether they are not blank strings etc.
    response = set_connection_configuration(
        custom_connection_property=st.session_state["custom_connection_property"],
        additional_connection_property=st.session_state["additional_connection_property"],
    )

# rest of the method without changes
```

Then the `set_connection_configuration` function must be edited, it can be found in the
`streamlit/native_sdk_api/connection_config.py` file. This function is a proxy between Streamlit UI
and the underlying SQL procedure, which is an entry points to the backend of the connector.

```python
def set_connection_configuration(custom_connection_property: str, additional_connection_property: str):
    # TODO: this part of the code sends the config to the backend so all custom properties need to be added here
    config = {
        "custom_connection_property": escape_identifier(custom_connection_property),
        "additional_connection_property": escape_identifier(additional_connection_property),
    }

    return call_procedure(
        "PUBLIC.SET_CONNECTION_CONFIGURATION",
        [variant_argument(config)]
    )
```

After doing this, the new property is saved in the internal connector table, which contains the
configuration. However, this is not the end of the possible customizations. Some backend components
can be customized too, look for the following comments in the code to find them:

* `TODO: IMPLEMENT ME connection configuration validate`
* `TODO: IMPLEMENT ME connection callback`
* `TODO: IMPLEMENT ME test connection`

The validate part allows for any additional validation on the data received from the UI. It can also
transform the data, e.g. change the character case, trim the provided data, or check if objects with
provided names actually exist inside Snowflake.

Connection callback is a part that lets you perform any additional operation based on the config, e.g.
alter procedures that need to use external access integrations, using a solution described in
[External integration setup reference](../reference/setup_external_integration.md).

Test connection is the final component of the connection configuration, it checks whether the connection
can be established between the connector and the source system.

For more information on those internal components see:

* [Connection configuration](../flow/connection_configuration.md)
* [Connection configuration reference](../reference/connection_configuration_reference.md)

Example implementations might look like this:

```java
public class TemplateConfigurationInputValidator implements ConnectionConfigurationInputValidator {

    private static final String ERROR_CODE = "INVALID_CONNECTION_CONFIGURATION";

    @Override
    public ConnectorResponse validate(Variant config) {
      // TODO: IMPLEMENT ME connection configuration validate: If the connection configuration input
      // requires some additional validation this is the place to implement this logic.
      // See more in docs:
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/reference/connection_configuration_reference
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/flow/connection_configuration
      var integrationCheck = checkParameter(config, INTEGRATION_PARAM, false);
      if (!integrationCheck.isOk()) {
        return integrationCheck;
      }

      var secretCheck = checkParameter(config, SECRET_PARAM, true);
      if (!secretCheck.isOk()) {
        return ConnectorResponse.error(ERROR_CODE);
      }

      return ConnectorResponse.success();
    }
}
```

```java
public class TemplateConnectionConfigurationCallback implements ConnectionConfigurationCallback {

    private static final String[] EXTERNAL_SOURCE_PROCEDURE_SIGNATURES = {
        asVarchar(format("%s.%s()", PUBLIC_SCHEMA, TEST_CONNECTION_PROCEDURE)),
        asVarchar(format("%s.%s(VARIANT)", PUBLIC_SCHEMA, FINALIZE_CONNECTOR_CONFIGURATION_PROCEDURE)),
        asVarchar(format("%s.%s(NUMBER, STRING)", PUBLIC_SCHEMA, WORKER_PROCEDURE))
      };

    private final Session session;

    public TemplateConnectionConfigurationCallback(Session session) {
      this.session = session;
    }

    @Override
    public ConnectorResponse execute(Variant config) {
      // TODO: If you need to alter some procedures with external access you can use
      // configureProcedure method or implement a similar method on your own.
      // TODO: IMPLEMENT ME connection callback: Implement the custom logic of changes in application
      // to be done after connection configuration, like altering procedures with external access.
      // See more in docs:
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/reference/connection_configuration_reference
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/flow/connection_configuration
      var response = configureProceduresWithReferences();
      if (response.isNotOk()) {
         return response;
      }
      return ConnectorResponse.success();
    }

    private ConnectorResponse configureProceduresWithReferences() {
      return callProcedure(
        session,
        PUBLIC_SCHEMA,
        SETUP_EXTERNAL_INTEGRATION_WITH_NAMES_PROCEDURE,
        EXTERNAL_SOURCE_PROCEDURE_SIGNATURES);
    }
}
```

```java
public class TemplateConnectionValidator {

    private static final String ERROR_CODE = "TEST_CONNECTION_FAILED";

    public static Variant testConnection(Session session) {
      // TODO: IMPLEMENT ME test connection: Implement the custom logic of testing the connection to
      // the source system here. This usually requires connection to some webservice or other external
      // system. It is suggested to perform only the basic connectivity validation here.
      // If that's the case then this procedure must be altered in TemplateConnectionConfigurationCallback first.
      // See more in docs:
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/reference/connection_configuration_reference
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/flow/connection_configuration
      return test().toVariant();
    }

    private static ConnectorResponse test() {
      try {
        var response = SourceSystemHttpHelper.testEndpoint();

        if (isSuccessful(response.statusCode())) {
          return ConnectorResponse.success();
        } else {
          return ConnectorResponse.error(ERROR_CODE, "Connection to source system failed");
        }
      } catch (Exception exception) {
        return ConnectorResponse.error(ERROR_CODE, "Test connection failed");
      }
    }
}
```

## Finalize configuration step

The finalize connector configuration step is the final step of the Wizard Phase. This step has multiple
responsibilities:

1. Allows the user to specify any additional configuration needed by the connector
2. Creates the sink database, schema and additional tables and views for the ingested data if needed
3. Initializes internal components such as the scheduler and task reactor

For more information on configuration finalization see:

* [Finalize configuration](../flow/finalize_configuration.md)
* [Finalize configuration reference](../reference/finalize_configuration_reference.md)

For more information on task reactor and scheduling see:

* [Task reactor](../using/task_reactor.md)
* [Task reactor SQL reference](../reference/task_reactor_reference.md)
* [Ingestion scheduler](../using/scheduler.md)
* [Ingestion scheduler reference](../reference/scheduler_reference.md)

Similarly to the connection configuration step, customization can be started with the Streamlit UI.
The `streamlit/wizard/finalize_config.py` file contains a form with an example property. More
properties can be added according to the connector needs. To add another property look for a `TODO`
comment, that contains example code of adding a new property in the mentioned file.

```python
# TODO: Here you can add additional fields in finalize connector configuration.
# For example:
st.subheader("Some additional property")
input_col, _ = st.columns([2, 1])
with input_col:
    st.text_input("", key="some_additional_property", label_visibility="collapsed")
st.caption("Description of some new additional property")
```

After adding the text input for a new property it needs to be passed to the backend side. To do so,
modify the `finalize_configuration` function in the same file:

```python
def finalize_configuration():
    try:
        st.session_state["show_main_error"] = False
        # TODO: If some additional properties were introduced, they need to be passed to the finalize_connector_configuration function.
        response = finalize_connector_configuration(
            st.session_state.get("custom_property"),
            st.session_state.get("some_additional_property")
        )
```

Next, open the `streamlit/native_sdk_api/finalize_config.py` file and add the new property to the
following function:

```python
def finalize_connector_configuration(custom_property: str, some_additional_property: str):
    # TODO: If some custom properties were configured, then they need to be specified here and passed to the FINALIZE_CONNECTOR_CONFIGURATION procedure.
    config = {
        "custom_property": custom_property,
        "some_additional_property": some_additional_property,
    }
    return call_procedure(
        "PUBLIC.FINALIZE_CONNECTOR_CONFIGURATION",
        [variant_argument(config)]
    )
```

Again, similarly to the connection configuration step, this step also allows for the customization of
various backend components, they can be found using the following comments in the source code:

* `TODO: IMPLEMENT ME validate source`
* `TODO: IMPLEMENT ME finalize internal`

The validate source part is responsible for performing more sophisticated validations on the source
systems. If the previous test connection only checked that a connection can be established, then validate
source could check access to specific data in the system, e.g. extracting a single record of data.

Finalize internal is an internal procedure responsible for initializing task reactor and scheduler,
creating a sink database and any necessary nested objects. It can also be used to save the configuration
provided during the finalize step (this configuration is not saved by default).

More information on the internal components can be found in:

* [Finalize configuration](../flow/finalize_configuration.md)
* [Finalize configuration reference](../reference/finalize_configuration_reference.md)

Additionally, input can be validated using the `FinalizeConnectorInputValidator` interface and
providing it to the finalize handler - check the `TemplateFinalizeConnectorConfigurationCustomHandler` file.
More information on using builders can be found in: [Stored procedures and handlers customization](../using/sproc_and_handlers_customization.md).

Example implementation of the validate source might look like this:

```java
public class SourceSystemAccessValidator implements SourceValidator {

    @Override
    public ConnectorResponse validate(Variant variant) {
      // TODO: IMPLEMENT ME validate source: Implement the custom logic of validating the source
      // system. In some cases this can be the same validation that happened in
      // TemplateConnectionValidator.
      // However, it is suggested to perform more complex validations, like specific access rights to
      // some specific resources here.
      // See more in docs:
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/reference/finalize_configuration_reference
      // https://docs.snowflake.com/developer-guide/native-apps/connector-sdk/flow/finalize_configuration
      var finalizeProperties = Configuration.fromCustomConfig(variant);

      var httpResponse = SourceSystemHttpHelper.validateSource(finalizeProperties.get("custom_property"));
      return prepareConnectorResponse(httpResponse.statusCode());
    }

    private ConnectorResponse prepareConnectorResponse(int statusCode) {
      switch (statusCode) {
        case 200:
          return ConnectorResponse.success();
        case 401:
          return ConnectorResponse.error("Unauthorized error");
        case 404:
          return ConnectorResponse.error("Not found error");
        default:
          return ConnectorResponse.error("Unknown error");
      }
    }
}
```

## Create resources

After the Wizard Phase is completed, the connector is ready to start ingesting data. But first,
the resources must be implemented and configured. A resource is an abstraction describing a specific
set of data in the source system, e.g. a table, an endpoint, a file, etc.

Different source systems might need different information about a resource - for that reason a resource
definition needs to be customized according to the specific needs. To do so, go to the `streamlit/daily_use/data_sync_page.py`
file. There you can find a `TODO` comment about adding text inputs for resource parameters. The
resource parameters should allow for the identification and retrieval of data from the source system.
Those parameters can be then extracted during the ingestion.

```python
# TODO: specify all the properties needed to define a resource in the source system. A subset of those properties should allow for a identification of a single resource, be it a table, endpoint, repository or some other data storage abstraction
st.text_input(
    "Resource name",
    key="resource_name",
)
st.text_input(
    "Some resource parameter",
    key="some_resource_parameter"
)
```

Once all necessary properties are added to the form, they can be passed to the backend side.
First, the state of the text fields has to be extracted and passed to the API level `queue_resource`
method in the `streamlit/daily_use/data_sync_page.py` file:

```python
def queue_resource():
    # TODO: add additional properties here and pass them to create_resource function
    resource_name = st.session_state.get("resource_name")
    some_resource_parameter = st.session_state.get("some_resource_parameter")

    if not resource_name:
        st.error("Resource name cannot be empty")
        return

    result = create_resource(resource_name, some_resource_parameter)
    if result.is_ok():
        st.success("Resource created")
    else:
        st.error(result.get_message())
```

Then the `create_resource` function from the `streamlit/native_sdk_api/resource_management.py` file
needs to be updated:

```python
def create_resource(resource_name, some_resource_parameter):
    ingestion_config = [{
        "id": "ingestionConfig",
        "ingestionStrategy": "INCREMENTAL",
        # TODO: HINT: scheduleType and scheduleDefinition are currently not supported out of the box, due to globalSchedule being used. However, a custom implementation of the scheduler can use those fields. They need to be provided because they are mandatory in the resourceDefinition.
        "scheduleType": "INTERVAL",
        "scheduleDefinition": "60m"
    }]
    # TODO: HINT: resource_id should allow identification of a table, endpoint etc. in the source system. It should be unique.
    resource_id = {
        "resource_name": resource_name,
    }
    id = f"{resource_name}_{random_suffix()}"

    # TODO: if you specified some additional resource parameters then you need to put them inside resource metadata:
    resource_metadata = {
        "some_resource_parameter": some_resource_parameter
    }

    return call_procedure("PUBLIC.CREATE_RESOURCE",
                          [
                              varchar_argument(id),
                              variant_argument(resource_id),
                              variant_list_argument(ingestion_config),
                              varchar_argument(id),
                              "true",
                              variant_argument(resource_metadata)
                          ])
```

### Customizing CREATE_RESOURCE() procedure logic

The `PUBLIC.CREATE_RESOURCE()` procedure allows the developer to customize its execution by implementing
custom logic that is plugged into several places of the main execution flow. The SDK allows the developer to:

1. Validate the resource before it’s created. The logic should be implemented in the
   `PUBLIC.CREATE_RESOURCE_VALIDATE()` procedure.
2. Perform custom operations before the resource is created. The logic should be implemented in the
   `PUBLIC.PRE_CREATE_RESOURCE()` procedure.
3. Perform custom operations after the resource is created. The logic should be implemented in the
   `PUBLIC.POST_CREATE_RESOURCE()` procedure.

More information about `PUBLIC.CREATE_RESOURCE()` procedure customization can be found here:

* [Create resource](../flow/ingestion-management/create_resource.md)
* [Create resource reference](../reference/create_resource_reference.md)

#### TemplateCreateResourceHandler.java

This class is a handler for the `PUBLIC.CREATE_RESOURCE()` procedure. Here, you can inject the Java
implementations of handlers for callback procedures mentioned before. By default the template provides
mocked Java implementations of callback handlers in order to get rid of calling SQL procedures, which
would extend the procedure execution time - Java implementations make the execution faster. These
mocked implementations do nothing apart from returning a success response. You can either provide the
custom implementation to the callback classes prepared by the template or create these callbacks
from scratch and inject them to the main procedure execution flow in the handler builder.

In order to implement the custom logic of callback methods that are called by default, look for the
following comments in the code:

* `TODO: IMPLEMENT ME create resource validate`
* `TODO: IMPLEMENT ME pre create resource callback`
* `TODO: IMPLEMENT ME post create resource callback`

## Ingestion

To perform ingestion of data you need to implement a class that will handle the connection with the
source system and retrieve data based on the resource configuration. Scheduler and Task Reactor modules
will take care of triggering and queueing of the ingestion tasks.

Ingestion logic is invoked from the `TemplateIngestion` class. Look for the `TODO: IMPLEMENT ME ingestion`
comment in the code and replace the random data generation with the data retrieval from the source system.
If you added custom properties to the resource definition, they can be fetched from the internal
connectors tables using the `ResourceIngestionDefinitionRepository` and properties available in the
`TemplateWorkItem`:

* `resourceIngestionDefinitionId`
* `ingestionConfigurationId`

Example of retrieving data from a webservice **might** look like this:

```java
public final class SourceSystemHttpHelper {

  private static final String DATA_URL = "https://source_system.com/data/%s";
  private static final SourceSystemHttpClient sourceSystemClient = new SourceSystemHttpClient();
  private static final ObjectMapper objectMapper = new ObjectMapper();

  private static List<Variant> fetchData(String resourceId) {
    var response = sourceSystemClient.get(String.format(url, resourceId));
    var body = response.body();

    try {
        return Arrays.stream(objectMapper.readValue(body, Map[].class))
              .map(Variant::new)
              .collect(Collectors.toList());
    } catch (JsonProcessingException e) {
      throw new RuntimeException("Cannot parse json", e);
    }
  }
}
```

```java
public class SourceSystemHttpClient {

  private static final Duration REQUEST_TIMEOUT = Duration.ofSeconds(15);

  private final HttpClient client;
  private final String secret;

  public SourceSystemHttpClient() {
    this.client = HttpClient.newHttpClient();
    this.secret =
        SnowflakeSecrets.newInstance()
            .getGenericSecretString(ConnectionConfiguration.TOKEN_NAME);
  }

  public HttpResponse<String> get(String url) {
    var request =
        HttpRequest.newBuilder()
            .uri(URI.create(url))
            .GET()
            .header("Authorization", format("Bearer %s", secret))
            .header("Content-Type", "application/json")
            .timeout(REQUEST_TIMEOUT)
            .build();

    try {
      return client.send(request, HttpResponse.BodyHandlers.ofString());
    } catch (IOException | InterruptedException ex) {
      throw new RuntimeException(format("HttpRequest failed: %s", ex.getMessage()), ex);
    }
  }
}
```

## Manage resources lifecycle

Once the logic of creating resources and the their ingestion is implemented, you can manage their
lifecycle by calling the following procedures:

1. `PUBLIC.ENABLE_RESOURCE()` enables a particular resource, meaning that it will be scheduled for ingestion
2. `PUBLIC.DISABLE_RESOURCE()` disables a particular resource, meaning that its ingestion scheduling will be stopped
3. `PUBLIC.UPDATE_RESOURCE()` allows you to update the ingestion configurations of a particular resource.
   It isn’t implemented in the Streamlit UI by default because sometimes it may be undesirable for the
   developer to allow the connector user to customize the ingestion configuration (revoke grants on
   this procedure to application role `ADMIN` in order to disallow its usage completely).

All these procedures have Java handlers and are extended with callbacks that allow you to customize
their execution. You can inject custom implementations of callbacks using the builders for these
handlers. By default the template provides mocked Java implementations of callback handlers.
These mocked implementations do nothing apart from returning a success response. You can either
provide the custom implementation to the callback classes prepared by the template or create these
callbacks from scratch and inject them to the main procedure execution flow in the handler builders.

### TemplateEnableResourceHandler.java

This class is a handler for the `PUBLIC.ENABLE_RESOURCE()` procedure, which can be extended with
the callbacks that are dedicated to:

1. Validate the resource before it’s enabled. Look for the `TODO: IMPLEMENT ME enable resource validate`
   comment in the code to provide the custom implementation.
2. Perform custom operations before the resource is enabled. Look for the `TODO: IMPLEMENT ME pre enable resource`
   comment in the code to provide the custom implementation.
3. Perform custom operations after the resource is enabled. Look for the `TODO: IMPLEMENT ME post enable resource`
   comment in the code to provide the custom implementation.

Learn more from the `PUBLIC.ENABLE_RESOURCE()` procedure detailed documentations:

* [Enable resource](../flow/ingestion-management/enable_resource.md)
* [Enable resource reference](../reference/enable_resource_reference.md)

### TemplateDisableResourceHandler.java

This class is a handler for the `PUBLIC.DISABLE_RESOURCE()` procedure, which can be extended with the callbacks that are
dedicated to:

1. Validate the resource before it’s disabled. Look for the `TODO: IMPLEMENT ME disable resource validate`
   comment in the code to provide the custom implementation.
2. Perform custom operations before the resource is disabled. Look for the `TODO: IMPLEMENT ME pre disable resource`
   comment in the code in order to provide the custom implementation.

Learn more from the `PUBLIC.DISABLE_RESOURCE()` procedure detailed documentations:

* [Disable resource](../flow/ingestion-management/disable_resource.md)
* [Disable resource reference](../reference/disable_resource_reference.md)

### TemplateUpdateResourceHandler.java

This class is a handler for the `PUBLIC.UPDATE_RESOURCE()` procedure, which can be extended with
the callbacks that are dedicated to:

1. Validate the resource before it’s updated. Look for the `TODO: IMPLEMENT ME update resource validate`
   comment in the code to provide the custom implementation.
2. Perform custom operations before the resource is updated. Look for the `TODO: IMPLEMENT ME pre update resource`
   comment in the code to provide the custom implementation.
3. Perform custom operations after the resource is updated. Look for the `TODO: IMPLEMENT ME post update resource`
   comment in the code to provide the custom implementation.

Learn more from the `PUBLIC.UPDATE_RESOURCE()` procedure detailed documentations:

* [Update resource](../flow/ingestion-management/update_resource.md)
* [Update resource reference](../reference/update_resource_reference.md)

## Settings

The template contains a settings tab that lets you view all the configuration made before.
However, if configuration properties were customized, then this view also needs some customizations.
Settings tab code can be found in the `streamlit/daily_use/settings_page.py` file.

To customize it, simply extract the values from the configuration for the keys that were added in
the respective configurations. For example, if earlier `additional_connection_property` was added
in the connection configuration step, then it could be added in the settings view like this:

```python
def connection_config_page():
    current_config = get_connection_configuration()

    # TODO: implement the display for all the custom properties defined in the connection configuration step
    custom_property = current_config.get("custom_connection_property", "")
    additional_connection_property = current_config.get("additional_connection_property", "")

    st.header("Connector configuration")
    st.caption("Here you can see the connector connection configuration saved during the connection configuration step "
               "of the Wizard. If some new property was introduced it has to be added here to display.")
    st.divider()

    st.text_input(
        "Custom connection property:",
        value=custom_property,
        disabled=True
    )
    st.text_input(
        "Additional connection property:",
        value=additional_connection_property,
        disabled=True
    )
    st.divider()
```

---
title: Understand limitations in the Snowflake Native App Framework
source: https://docs.snowflake.com/en/developer-guide/native-apps/limitations.md
section: Native Apps Framework
---

# Understand limitations in the Snowflake Native App Framework

This topic provides information about the limitations of the Snowflake Native Apps.

## Known limitations

Snowflake Native Apps have the following known limitations:

* Temporary tables or stages are not supported.
* Some Streamlit features are not supported. See [Unsupported Streamlit Features](adding-streamlit.md)
  for details.
* Snowflake Native Apps do not support failover for business continuity. For example, adding an application
  package to a replication group or failover group is not supported.
* [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md) aren’t
  supported in Snowflake Native Apps.
* [Snowflake ML functions](../../guides-overview-ml-functions.md) such as
  [Top Insights](../../user-guide/ml-functions/top-insights.md) aren’t supported
  in Snowflake Native Apps.

## Known limitations in Snowflake Native Apps with Snowpark Container Services

Snowflake Native Apps with Snowpark Container Services have the following limitations:

* Apps with containers are only supported on specific AWS, Azure, and Google Cloud commercial regions.
  See Support for private connectivity, VPS, and government regions for information on support for private connectivity, VPS, and government regions.
* Sessions used in connections from containers, for example using the Python connector, are limited
  to the application owner role. See
  [Snowpark Container Services: Additional considerations for services and jobs](../snowpark-container-services/spcs-execute-sql.md)
  for additional information.
* A maximum of 15 compute pools per application is allowed.
* [Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) has the following
  limitation:

  + There is a 100GB limit for each file within the image repository.
* Using the LOG_LEVEL, TRACE_LEVEL, METRIC_LEVEL, and LOG_EVENT_LEVEL properties in the
  manifest file to set the logging and trace level for
  a container is not supported. Instead, use the `spec.logExporters` property in the service specification file.

  See [spec.logExporters field (optional)](../snowpark-container-services/specification-reference.md) for more information.

## Support for private connectivity, VPS, and government regions

The following tables list Snowflake Native App support for private connectivity, Virtual Private Snowflake (VPS),
and government regions on the [cloud platform](../../user-guide/intro-cloud-platforms.md) that
Snowflake supports:

**Amazon Web Services**

> |  | Amazon Web Services | AWS PrivateLink | Virtual Private Snowflake | Government regions |
> | --- | --- | --- | --- | --- |
> | Snowflake Native App Framework (without containers) | Generally available | Generally available | Generally available | Generally available |
> | Snowflake Native App Framework (with containers) | Generally available | Generally available | Not yet supported | Generally available |

**Microsoft Azure**

> |  | Microsoft Azure | Microsoft Azure Private Link | Virtual Private Snowflake | Government regions |
> | --- | --- | --- | --- | --- |
> | Snowflake Native App Framework (without containers) | Generally available | Generally available | Not yet supported | Generally available |
> | Snowflake Native App Framework (with containers) | Generally available | Generally available | Not yet supported | Not yet supported |

**Google Cloud**

> |  | Google Cloud | Google Cloud Private Service Connect | Virtual Private Snowflake |
> | --- | --- | --- | --- |
> | Snowflake Native App Framework (without containers) | Generally available | Not yet supported | Not yet supported |
> | Snowflake Native App Framework (with containers) | Generally available | Not yet supported | Not yet supported |

## Known issue with AWS and Azure PrivateLink

Links in email notifications from apps do not correctly link into a private link accounts.

## Limitations on Snowflake Native App with Snowpark Container Services

## Limitations on Snowflake Native Apps in government regions

The following limitations apply to Snowflake Native App support for government regions:

* Providers publishing apps from government regions can
  only share listings within the same organization.

### Limitations on AWS government regions

Snowflake Native App support all government regions except Department of Defense (DoD) regions.

### Limitations on Azure GovCloud

Azure GovCloud is supported only in the following regions:

* US East (N. Virginia)

### Limitations on apps with containers in government regions

For apps with containers published to government regions, the following limitations apply:

* Apps with containers are only supported on AWS government regions.
* Only FedRAMP Moderate on `awsuseast1gov` is supported.

## Limitations on Virtual Private Snowflake (VPS)

The following limitations apply to Snowflake Native App support for Virtual Private Snowflake (VPS):

* Snowflake Native Apps and Streamlit are not enabled by default in Virtual Private Snowflake. To use
  Snowflake Native Apps or Streamlit in VPS, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* If Streamlit is not enabled in the VPS deployment, consumers cannot use the Python Permission SDK
  to manage privileges and references.
* Sharing an app from a VPS account to an account outside the VPS
  is only supported within the same organization. To share an app outside the current organization, contact
  [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* Only private listings are supported for applications published inside the VPS.
* Consumers in the VPS can
  [enable event sharing](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#enable-event-sharing-for-an-app)
  for an app. However, log messages and trace events are not shared unless the provider has
  an event table within the VPS.
* Because the Snowflake Marketplace interface is not available in VPS, providers and consumers must
  manage listings by using SQL. For additional information, see [About managing listings using SQL](../../progaccess/listing-progaccess-about.md).

---
title: Update an app (Legacy)
source: https://docs.snowflake.com/en/developer-guide/native-apps/update-app.md
section: Native Apps Framework
---

# Update an app (Legacy)

The Snowflake Native App Framework enables providers to update a Snowflake Native App to add new functionality,
fix bugs, and make other changes. Providers can create new versions or patches of
and app and upgrade the app in the consumer account.

## Workflow for updating an app

1. Understand the version and upgrade process for an app.

   Before developing a new version or patch of an app, providers should understand the version
   lifecycle for an app and how the upgrade process works. See [Overview of app versions and upgrades (Legacy)](update-app-overview.md) for
   information.
2. Develop and test the updated app local.

   Providers develop and test new versions or patches locally before publishing them to consumers. See
   [Develop a new version of an app (Legacy)](update-app-develop.md) for guidelines on how to develop a new version or patch. See [Use versioned schema to manage app objects across versions](versioned-schema.md)
   for information on how to handle objects during the upgrade.
3. Add the version or patch to the application package.

   After developing and testing a new version or patch locally, providers create a new version or patch
   for the app. Version and patch information are stored in the application package. See [Create versions and patches for an app (Legacy)](update-app-versions.md) for
   information on creating versions and patches.

   > **Note:**
   >
   > If an application package already has two versions for an app defined in the
   > application package, providers must drop one of the versions before adding a new version.
4. Wait for the results of the automated security scan.

   If the DISTRIBUTION property of the application package is set to EXTERNAL, creating a new version
   or patch initiates the automated security scan. The app must pass the security scan before it can be
   published to the Snowflake Marketplace.

   For information on setting the DISTRIBUTION property and the automated security scan, see
   [Run the automated security scan](security-run-scan.md).
5. Upgrade the app.

   Upgrades are initiated when the provider updates the
   [release directive](update-app-release-directive.md) of the application package.

   This initiates the upgrade process for all installed apps that are on the previous
   version. However, a provider can ask a consumer to perform a manual upgrade
   if the consumer needs to upgrade their app before the automated upgrade is complete.
6. Monitor the upgrade.

   After the upgrade begins, providers can monitor the upgrade in their account by querying the
   [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md).

   See [Monitor the state of an upgrade](update-app-upgrade.md) for information on monitoring an app upgrade and the
   possible upgrade statuses.
7. Update the listing for the app.

   After an app passes the security scan and the provider sets the release directive,
   Snowflake automatically updates the version and patch for the listing. However, providers
   may still need to update the listing to describe new functionality to the consumer.

   For more information, see [Modify published listings](../../collaboration/provider-listings-modifying.md).

---
title: Update connection configuration reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/update_connection_configuration_reference.md
section: Native Apps Framework
---

# Update connection configuration reference

## Database objects and procedures

The following database objects are created through the `configuration/update_connection_configuration.sql`

### PUBLIC.UPDATE_CONNECTION_CONFIGURATION( connection_configuration VARIANT)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java `UpdateConnectionConfigurationHandler.updateConnectionConfiguration` handler.

### PUBLIC.UPDATE_CONNECTION_CONFIGURATION_VALIDATE( connection_configuration VARIANT)

Procedure used for providing additional connector specific validation logic. By default, it returns `'response_code': 'OK'`.
It is invoked by the default `ConnectionConfigurationInputValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.DRAFT_CONNECTION_CONFIGURATION_INTERNAL( connection_configuration VARIANT)

Procedure used for providing additional connector specific logic. By default, it returns `'response_code': 'OK'`.
It is invoked by the default `ConnectionConfigurationCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

Connection configuration update is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))
* `configuration/connection_configuration.sql` (See: [Connection configuration reference](connection_configuration_reference.md))

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.configuration.connection` package and some common components are tightly connected with the above procedures:

* `UpdateConnectionConfigurationHandler`
* `ConnectionConfigurationInputValidator`
* `ConnectionConfigurationCallback`
* `DraftConnectionValidator`
* `ConnectionValidator`
* `UpdateConnectionConfigurationHandlerBuilder`
* `ConnectorStatusService`
* `ConnectorConfigurationService`
* `ConnectorErrorHandler`

## Custom handler

Handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide a custom implementation of `UpdateConnectionConfigurationHandler` the `PUBLIC.UPDATE_CONNECTION_CONFIGURATION` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_CONNECTION_CONFIGURATION(connection_configuration VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomUpdateConnectionConfigurationHandler.updateConnectionConfiguration';

GRANT USAGE ON PROCEDURE PUBLIC.UPDATE_CONNECTION_CONFIGURATION(VARIANT) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

The `VALIDATE` and `INTERNAL` procedures can also be customized through SQL. It can even invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.DRAFT_CONNECTION_CONFIGURATION_INTERNAL(connection_configuration VARIANT)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;

  CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_CONNECTION_CONFIGURATION_VALIDATE(connection_configuration VARIANT)
    RETURNS VARIANT
    LANGUAGE JAVA
    RUNTIME_VERSION = '11'
    PACKAGES = ('com.snowflake:snowpark:1.11.0')
    IMPORTS = ('/connectors-native-sdk.jar')
    HANDLER = 'com.custom.handler.CustomConnectionConfigurationInputValidator.validate';
```

### Builder approach

`UpdateConnectionConfigurationHandler` can be customized using `UpdateConnectionConfigurationHandlerBuilder`. This builder allows the developer to provide custom implementations of the following interfaces:

* `ConnectionConfigurationInputValidator`
* `ConnectionConfigurationCallback`
* `DraftConnectionValidator`
* `ConnectionConfigurationCallback`
* `ConnectionValidator`
* `ConnectorErrorHelper`

In case one of them is not provided - the default implementation provided by the SDK will be used.

```java
class CustomConnectionConfigurationInputValidator implements ConnectionConfigurationInputValidator {

  @Override
  public ConnectorResponse validate(Variant configuration) {
    // CUSTOM VALIDATION LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.UPDATE_CONNECTION_CONFIGURATION procedure using SQL
  public static Variant updateConnectionConfiguration(Session session, Variant configuration) {
    // Using the builder
    var handler = UpdateConnectionConfigurationHandler.builder(session)
      .withInputValidator(new CustomConnectionConfigurationInputValidator())
      .build();
    return handler.updateConnectionConfiguration(configuration).toVariant();
  }
}
```

---
title: Update resource
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/ingestion-management/update_resource.md
section: Native Apps Framework
---

# Update resource

Updating a resource is used to redefine ingestion configurations for a particular resource.
`PUBLIC.UPDATE_RESOURCE` procedure is the entry point from the UI or worksheet to update a resource.

Calling this procedure requires the user has been assigned the `ADMIN` application role.

The resource updating process consists of several phases. Several of which are customizable but include reasonable defaults.
Phases are:

1. Initial validation
2. Custom validation
3. Custom logic before a resource is updated
4. Update of ingestion configurations
5. Finishing ingestion processes for removed ingestion configurations
6. Scheduling ingestion processes for new ingestion configuration
7. Custom logic after a resource is updated and ingestion processes are managed

## Initial validation

Initial validation is performed at the very beginning of the resource update process. It checks:

* whether given input data represent a valid resource ingestion configuration object
* whether a resource with given `id` and `resourceId` exists

## Custom validation

Custom validation is executed just after initial validation.
It is a part of the process which is designed to be customized with the connector-specific logic.

By default, it invokes `PUBLIC.UPDATE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `UpdateResourceHandlerBuilder` to provide custom implementation of the `UpdateResourceValidator` interface.

If the custom validation returns error, the next steps will not be executed and given error response will be returned from `UPDATE_RESOURCE` procedure.

## Custom logic before a resource is updated

Custom logic can be defined and executed before a resource is updated and rescheduled.

By default, it invokes `PUBLIC.PRE_UPDATE_RESOURCE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `UpdateResourceHandlerBuilder` to provide custom implementation of the `PreUpdateResourceCallback` interface.

If custom logic returns error, the next steps will not be executed and given error response will be returned from `UPDATE_RESOURCE` procedure.

## Update of resource ingestion configurations

Within this step a new ingestion configurations are saved to `STATE.RESOURCE_INGESTION_DEFINITION` table for the resource
with a given `resource_ingestion_definition_id`.

## Finishing ingestion processes for removed ingestion configurations

In this step, when a resource is enabled (`enabled` parameter equals `true`) all active ingestion processes (with
statuses `SCHEDULED` or `IN_PROGRESS`) with ingestion configurations that ids aren’t included in the set of updated ingestion
configurations are finished, which means that their status is switched to `FINISHED`.

## Scheduling ingestion processes for new ingestion configuration

In this step, when a resource is enabled (`enabled` parameter equals `true`) new ingestion processes are created for
updated ingestion configurations that didn’t exist in a previous ingestion configurations state for a given resource.

## Custom logic after a resource is updated

Custom logic can be implemented and executed after resource ingestion configurations are updated.

By default, it invokes `PUBLIC.POST_UPDATE_RESOURCE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)`,
which returns `'response_code': 'OK'`. It can be overwritten through the SQL script or by using
a `UpdateResourceHandlerBuilder` to provide custom implementation of the `PostUpdateResourceCallback` interface.

If custom logic returns an error, the given error response will be returned from `UPDATE_RESOURCE` procedure but the
update of resource ingestion definition and ingestion processes will not be rolled back so if required, it should be
handled by the custom implementation.

## Response

### Successful response

On success the procedure returns a result resembling:

> ```json
> {
>   "response_code": "OK",
>   "message": "Resource successfully updated."
> }
> ```

### Error response

On error a response resembling the following is returned:

> ```json
> {
>   "response_code": "<ERROR_CODE>",
>   "message": "<error message>"
> }
> ```

Possible error codes include:

* `INVALID_INPUT` - Provided procedure’s arguments are invalid and it is not possible to update resource ingestion configurations or a resource with given does not exists.
* `UPDATE_RESOURCE_ERROR` - Something unexpected happened when updating the resource ingestion definition with new ingestion configurations or when managing ingestion processes. All changes are rolled back.

---
title: Update resource reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/update_resource_reference.md
section: Native Apps Framework
---

# Update resource reference

## Database objects and procedures

The following database objects are created when the file `ingestion/resource_management.sql` is executed.

### PUBLIC.UPDATE_RESOURCE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java function `UpdateResourceHandler.updateResource`.

### PUBLIC.UPDATE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)

Procedure used for connector specific validation of update process. By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultUpdateResourceValidator`. Can be overwritten both in SQL and Java.

### PUBLIC.PRE_UPDATE_RESOURCE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)

Procedure used for adding connector specific logic which is invoked before a resource is updated.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPreUpdateResourceCallback`. Can be overwritten both in SQL and Java.

### PUBLIC.POST_UPDATE_RESOURCE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)

Procedure used for adding connector specific logic which is invoked after a resource is updated.
By default, it returns `'response_code': 'OK'`.
It is invoked by `DefaultPostUpdateResourceCallback`. Can be overwritten both in SQL and Java.

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.ingestion.update` package and some common components are tightly connected with the above procedures:

* `UpdateResourceHandler`
* `UpdateResourceHandlerBuilder`
* `UpdateResourceValidator`
* `PreUpdateResourceCallback`
* `PostUpdateResourceCallback`
* `ConnectorErrorHelper`

## Custom handler

The handler and its internals can be customized using the following approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide whole custom implementation of `UpdateResourceHandler`, the `PUBLIC.UPDATE_RESOURCE` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_RESOURCE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomUpdateResourceHandler.updateResource';

GRANT USAGE ON PROCEDURE PUBLIC.UPDATE_RESOURCE(VARCHAR, VARIANT) TO APPLICATION ROLE ADMIN;
```

#### Internal procedures

Internal procedures `UPDATE_RESOURCE_VALIDATE`, `PRE_UPDATE_RESOURCE` and `POST_UPDATE_RESOURCE` can be also customized through the SQL. These procedures can also invoke other Java handlers:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- SOME CUSTOM LOGIC BEGIN
    SELECT sysdate();
    -- SOME CUSTOM LOGIC END

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;

CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_RESOURCE_VALIDATE(resource_ingestion_definition_id VARCHAR, ingestion_configurations VARIANT)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomHandler.updateResourceValidate';
```

### Builder approach

`UpdateResourceHandler` can be customized using `UpdateResourceHandlerBuilder`. This builder allows user to provide custom implementations of the following interfaces:

* `UpdateResourceValidator`
* `PreUpdateResourceCallback`
* `PostUpdateResourceCallback`
* `ConnectorErrorHelper`

In case a function is not provided the default implementation provided by the SDK will be used.

```java
class CustomPreUpdateResourceCallback implements PreUpdateResourceCallback {
  @Override
  public ConnectorResponse execute(String resourceIngestionDefinitionId, Variant updatedIngestionConfigurations) {
    // CUSTOM LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.UPDATE_RESOURCE procedure using SQL
  public static Variant updateResource(Session session, String resourceIngestionDefinitionId, Variant updatedIngestionConfigurations) {
    //Using builder
    var handler = UpdateResourceHandlerBuilder.builder(session)
      .withPreUpdateResourceCallback(new CustomPreUpdateResourceCallback())
      .build();
    return handler.updateResource(resourceIngestionDefinitionId).toVariant();
  }
}
```

---
title: Update the connection configuration
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/update_connection_configuration.md
section: Native Apps Framework
---

# Update the connection configuration

Updating the connection configuration is a step that can be called directly after [pausing the connector](pause_connector.md). This step allows the user
to update properties required to establish a connection with the source system to start ingesting data into Snowflake.
When overwriting with custom logic, this procedure needs to be replaced, to specify a custom Java handler.

Calling this procedure requires the user to have the `ADMIN` application role assigned.

The connection configuration step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Status validation
2. Input validation
3. Draft callback
4. Draft connection validation
5. Configuration update
6. Internal callback
7. Connection validation
8. Status update

## Requirements

Connection configuration requires at least the following sql files to be executed during Native App installation:

* `core.sql`
* `configuration/app_config.sql`
* `configuration/connection_configuration.sql`
* `configuration/update_connection_configuration.sql`

In case of this feature there is additional requirement dependent on the SDK user:

* Custom implementation of `PUBLIC.TEST_DRAFT_CONNECTION()` and `PUBLIC.TEST_CONNECTION()` procedures

## Status validation

To perform connection configuration update the internal status of the connector needs to be `PAUSED`.

This validation cannot be overwritten by using `UpdateConnectionConfigurationHandlerBuilder` nor by overwriting stored procedure.
However, it is possible to implement a custom handler, which will not have this kind of validation.

## Input validation

Input needs to be a `variant` containing a map of properties, however this is not enough sometimes. For that reason the SDK provides
an internal stored procedure called: `PUBLIC.UPDATE_CONNECTION_CONFIG_VALIDATE(config VARIANT)`. By default,
this procedure just returns `'response_code': 'OK'`, but when overwriting it can update the provided config during validation.
This feature enables custom logic like for example trimming the input, conversion to upper/lower case etc.
To return config transformed in any way the response needs to contain additional `"config"` property in the response `Variant`,
this property should contain the updated config as `Variant`.
The procedure can be customized by overwriting through the SQL or by using `UpdateConnectionConfigurationHandlerBuilder` and providing custom implementation of the
`ConnectionConfigurationInputValidator` interface.

The valid response from the custom implementation with transformation looks like this:

```json
{
    "response_code" : "OK",
    "config": {
        "key1": "value1",
        "key2": "value2"
    }
}
```

## Configuration update

Once the validations are passed successfully, configuration will be saved to the internal `APP_CONFIG` table.
Service responsible for this will save the provided `Variant` under the `connection_configuration` key.
This configuration has to be successfully validated by internal draft callback and draft connection validation to be updated,
the set of provided properties is completely up to the user.

## Internal draft callback

Internal callback is another customizable step. By default, it invokes `PUBLIC.DRAFT_CONNECTION_CONFIGURATION_INTERNAL(connection_configuration VARIANT)`,
which returns `'response_code': 'OK'`. For example it can be used to alter other procedures by granting them external access integration.
It can be overwritten through the sql script or by using a `ConnectionConfigurationHandlerBuilder` to provide custom implementation of the `ConnectionConfigurationCallback` interface.

## Draft connection validation

This step will trigger a `PUBLIC.TEST_DRAFT_CONNECTION(connection_configuration VARIANT)` procedure. This procedure tries to query the source system
for the data using data from input parameter as connection configuration. This procedure is not implemented by default and needs to be provided by the
SDK user. Additionally, a `ConnectionValidator` interface implementation can be provided to the `UpdateConnectionConfigurationHandlerBuilder` to
customize this phase. In this case, there is no need to implement stored procedure. The recommendation is to perform just a minimal connectivity check
in this procedure to ensure that external access capabilities of Snowflake were configured correctly and the Connector has all required privileges to use them.

## Internal callback

Internal callback is another customizable step. By default, it invokes `PUBLIC.SET_CONNECTION_CONFIGURATION_INTERNAL(connection_configuration VARIANT)`,
which returns `'response_code': 'OK'`. For example it can be used to alter other procedures by granting them external access integration.
It can be overwritten through the sql script or by using a `ConnectionConfigurationHandlerBuilder` to provide custom implementation of the `ConnectionConfigurationCallback` interface.

## Connection validation

This step will trigger a `PUBLIC.TEST_CONNECTION` procedure. This procedure has twinning action to the `PUBLIC.TEST_DRAFT_CONNECTION(connection_configuration VARIANT)`
but has no input parameter and should be used for testing the official connection using a configuration saved in the database.

## Viewing the configuration

There is a `PUBLIC.GET_CONNECTION_CONFIGURATION()` procedure available to the `ADMIN` and `VIEWER` users that
returns a current connection configuration from the internal table.

## Response

### Successful response

If the procedure finishes successfully it returns a response from `TEST_CONNECTION` procedure. We recommend using the following format:

```json
{
  "response_code": "OK"
}
```

### Error response

In case of an error the response follows the below format:

```json
{
  "response_code": "<ERROR_CODE>",
  "message": "<error message>"
}
```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - Invalid connector status. Expected status: `[PAUSED]`
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive
* `PROCEDURE_NOT_FOUND` - Procedure which was called does not exist. In this case it’s about `TEST_CONNECTION` and `TEST_DRAFT_CONNECTION` procedure mostly
* `UNKNOWN_SQL_ERROR` - This error occurs when something unexpected happen when calling internal procedures
* `INVALID_RESPONSE` - This error occurs when response received from internal procedure does not contain `response_code` or an error response does not contain `message`, but contains `response_code`
* `UNKNOWN_ERROR` - It means that something unexpected went wrong - message of thrown exception is forwarded
* Custom error codes received from `TEST_DRAFT_CONNECTION()` procedure - defined by connector developer
* Custom error codes received from `DRAFT_CONNECTION_CONFIGURATION_INTERNAL()` procedure - defined by connector developer
* Custom error codes received from `TEST_CONNECTION()` procedure - defined by connector developer
* Custom error codes received from `SET_CONNECTION_CONFIGURATION_INTERNAL()` procedure - defined by connector developer

---
title: Update the warehouse
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/flow/update_warehouse.md
section: Native Apps Framework
---

# Update the warehouse

Warehouse update is a step which can be called directly after the [Pause connector](pause_connector.md) step. This step allows the user
to update the warehouse, set up during `Connector Configuration`, which is used for running SDK-controlled tasks.
When overwriting with custom logic, this procedure needs to be replaced, to specify a custom Java handler.

Calling this procedure requires the user to have the `ADMIN` application role assigned.

Warehouse update step internally consists of several phases. Some of them are fully customizable and by default,
don’t do anything. The phases are as follows:

1. Status validation
2. Input validation
3. Internal callback
4. SDK callback
5. Configuration update

## Requirements

Warehouse configuration requires at least the following sql files to be executed during native app installation:

* `core.sql`
* `configuration/app_config.sql`
* `configuration/connector_configuration.sql`
* `configuration/update_warehouse.sql`

## Status validation

To perform the warehouse update the internal status of the connector needs to be `PAUSED`.

This validation cannot be overwritten neither using the `UpdateWarehouseHandlerBuilder`, nor by overwriting the stored procedure.
However, it is possible to implement a custom handler, which will not have this kind of validation.

## Input validation

Input needs to be a `String` containing the new warehouse. This provided warehouse is then validated using an implementation of `UpdateWarehouseInputValidator`.
By default the following validations are performed, each throwing an exception if the required criteria are not met:

1. Validating if the provided warehouse is a valid Snowflake Identifier.
2. Validating if the new warehouse is different than the already configured one.
3. Validating if the application instance can access the new warehouse (by using the `SHOW WAREHOUSES` query).

This input validation step can only be customized by using the `UpdateWarehouseHandlerBuilder` and building a new, custom handler instance.

## Internal callback

Internal callback is also a customizable step.
By default it invokes the `PUBLIC.UPDATE_WAREHOUSE_INTERNAL` procedure, whose default implementation returns `'response_code': 'OK'`.
This step can be used to provide custom logic for the warehouse update process, e.g. altering the tasks created by the application developer.
The callback can be overwritten through the sql script or by using the `UpdateWarehouseHandlerBuilder` to provide a custom implementation of the `UpdateWarehouseCallback` interface.

## SDK callback

SDK callback is similar to the internal callback phase. Its purpose is to update the SDK-controlled components, e.g. tasks created by the Task Reactor.

This validation cannot be overwritten by using the `UpdateWarehouseHandlerBuilder`, nor by overwriting the stored procedure.
It is possible to implement a custom handler, which will not have this kind of validation, however it is highly discouraged.

## Configuration update

Once the validations and callbacks have passed successfully, the new warehouse will be saved to the internal `APP_CONFIG` table.
Service responsible for this will save the provided warehouse under the `connector_configuration` key, replacing the previously configured value.

## Viewing the configuration

There is a `PUBLIC.CONNECTOR_CONFIGURATION` view available to the `ADMIN` and `VIEWER` application roles, which
returns current configuration from the internal `APP_CONFIG` table.

## Response

### Successful response

If the procedure finishes successfully it returns a response with the `OK` response code:

```json
{
  "response_code": "OK"
}
```

### Error response

In case of an error the response has the following format:

```json
{
  "response_code": "<ERROR_CODE>",
  "message": "<error message>"
}
```

Possible error codes include:

* `INVALID_CONNECTOR_STATUS` - Invalid connector status. Expected status: `[PAUSED]`
* `INTERNAL_ERROR` - Something went wrong internally, the message should be descriptive
* `PROCEDURE_NOT_FOUND` - Procedure which was called does not exist
* `UNKNOWN_SQL_ERROR` - This error occurs when something unexpected happen when calling internal procedures
* `INVALID_RESPONSE` - This error occurs when response received from internal procedure does not contain `response_code` or an error response does not contain `message`, but contains `response_code`
* `UNKNOWN_ERROR` - It means that something unexpected went wrong (message of thrown exception is forwarded)
* `EMPTY_IDENTIFIER` - Provided identifier is a NULL value or an empty String
* `INVALID_IDENTIFIER` - Provided warehouse identifier is not valid
* `WAREHOUSE_ALREADY_USED` - Provided warehouse is already used by the application
* `INACCESSIBLE_WAREHOUSE` - Provided warehouse cannot be used access by the application instance
* Custom error codes received from `UPDATE_WAREHOUSE_INTERNAL` procedure - defined by the connector developer

---
title: Update warehouse reference
source: https://docs.snowflake.com/en/developer-guide/native-apps/connector-sdk/reference/update_warehouse_reference.md
section: Native Apps Framework
---

# Update warehouse reference

## Database objects and procedures

The following database objects are created through the `configuration/update_warehouse.sql`.

### PUBLIC.UPDATE_WAREHOUSE(warehouse_name STRING)

Entry point procedure available to the `ADMIN` role. This procedure invokes the Java `UpdateWarehouseHandler.updateWarehouse` handler.

### PUBLIC.UPDATE_WAREHOUSE_INTERNAL(warehouse_name STRING)

Procedure used for providing additional connector specific logic. By default, it returns `'response_code': 'OK'`.
It is invoked by the default `UpdateWarehouseCallback`. Can be overwritten both in SQL and Java.

## Related tables and views

Warehouse update is related to and dependent on the objects from the following files:

* `core.sql` (See [Core SQL reference](core_reference.md))
* `configuration/app_config.sql` (See: [App config SQL reference](app_config_reference.md))
* `configuration/connector_configuration.sql` (See: [Connector configuration reference](connector_configuration_reference.md))

## Related Java objects

The following Java objects from the `com.snowflake.connectors.application.configuration.warehouse` package and some common components are tightly connected with the above procedures:

* `UpdateWarehouseHandler`
* `UpdateWarehouseInputValidator`
* `UpdateWarehouseCallback`
* `UpdateWarehouseSdkCallback`
* `UpdateWarehouseHandlerBuilder`
* `ConnectorStatusService`
* `ConnectorConfigurationService`
* `ConnectorErrorHandler`

## Custom handler

Handler and its internals can be customized using the following two approaches.

### Procedure replacement approach

The following components can be replaced using SQL.

#### Handler

To provide a custom implementation of `UpdateWarehouseHandler` the `PUBLIC.UPDATE_WAREHOUSE` procedure must be replaced. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_WAREHOUSE(warehouse_name STRING)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomUpdateWarehouseHandler.updateWarehouse';

GRANT USAGE ON PROCEDURE PUBLIC.UPDATE_WAREHOUSE(STRING) TO APPLICATION ROLE ADMIN;
```

#### Internal procedure

The `INTERNAL` procedure can also be customized through SQL. It can even invoke another Java handler:

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_WAREHOUSE_INTERNAL(warehouse_name STRING)
  RETURNS VARIANT
  LANGUAGE SQL
  EXECUTE AS OWNER
  AS
  BEGIN
    -- SOME CUSTOM LOGIC

    RETURN OBJECT_CONSTRUCT('response_code', 'OK');
  END;
```

```sqlexample
CREATE OR REPLACE PROCEDURE PUBLIC.UPDATE_WAREHOUSE_INTERNAL(warehouse_name STRING)
  RETURNS VARIANT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:1.11.0')
  IMPORTS = ('/connectors-native-sdk.jar')
  HANDLER = 'com.custom.handler.CustomUpdateWarehouseCallback.execute';
```

### Builder approach

`UpdateWarehouseHandler` can be customized using `UpdateWarehouseHandlerBuilder`. This builder allows the developer to provide custom implementations of the following interfaces:

* `UpdateWarehouseInputValidator`
* `UpdateWarehouseCallback`
* `ConnectorErrorHelper`

In case one of them is not provided - the default implementation provided by the SDK will be used.

```java
class CustomUpdateWarehouseInputValidator implements UpdateWarehouseInputValidator {

  @Override
  public ConnectorResponse validate(Identifier warehouse) {
    // CUSTOM VALIDATION LOGIC
    return ConnectorResponse.success();
  }
}

class CustomHandler {

  // Path to this method needs to be specified in the PUBLIC.UPDATE_WAREHOUSE procedure using SQL
  public static Variant updateWarehouse(Session session, String warehouseName) {
    // Using the builder
    var handler = UpdateWarehouseHandler.builder(session)
      .withInputValidator(new CustomUpdateWarehouseInputValidator())
      .build();
    return handler.updateWarehouse(warehouseName).toVariant();
  }
}
```

---
title: Upgrade an app (Legacy)
source: https://docs.snowflake.com/en/developer-guide/native-apps/update-app-upgrade.md
section: Native Apps Framework
---

# Upgrade an app (Legacy)

This topic provides information on upgrading a Snowflake Native App.

## About upgrades

The Snowflake Native App Framework allows providers to upgrade an app to a new version or patch. To see how
upgrades fit in the overall workflow for developing a new version or patch of an app, see
[Workflow for updating an app](update-app.md).

Providers can initiate an upgrade of an app to a new version or patch by setting a release directive
on the application package. When the release directive is modified, Snowflake automatically upgrades
all installed instances of the current version of the app to the version specified by the release directive.

When the provider initiates an upgrade, Snowflake adds each app to be upgraded to a queue. Each
app is upgraded as resources are available. The upgrade process can take a while to complete across all
installed versions of the app. To expedite the upgrade process, consumers can also manually initiate an upgrade
of an app when a new version or patch is available.

> **Note:**
>
> After the upgrade process begins for their app, consumers can no longer manually upgrade the app.

## Upgrade workflow

A provider upgrades an installed app by using the following workflow:

1. Update the app to include any new features.
2. If you are creating a new version of the app and there are two versions currently defined
   for the app:

   1. Ensure that no consumers are currently running the version.
   2. Drop the version of the app you are replacing.
3. Create a new version or patch for the changes in the application package.

   If the DISTRIBUTION property of the application package is set to `EXTERNAL`, the
   [automated security scan](security-overview.md) is initiated.
   The security scan must pass before the upgrade can occur.
4. Test the new version by creating installing the app in your test account.
5. Update the release directive for the version or patch.

   This initiates an automated upgrade that will update all installed instances of the previous version.
   A provider can notify the consumer that an upgrade is available and ask them to
   manually upgrade the app.

## Set a start date and time for an upgrade

Providers can set a date and time that specifies when an automatic upgrade should begin.
This date and time is set in the release directive (default or custom) using the
`UPGRADE_AFTER` clause of the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command.
Both default and custom release directives are supported.

The upgrade date and time can be any valid [date and time type](../../sql-reference/data-types-datetime.md).
If the timestamp has already passed the upgrade is immediately scheduled. This is the same behavior as not setting the
`UPGRADE_AFTER` clause.

You can only use the `UPGRADE_AFTER` clause if you are setting the version and patch. This clause can’t be used
to modify only the upgrade time and date.

### Set the upgrade date and time for the default release directive

1. To set the upgrade date and time for the default release directive:

   ```sqlexample
   ALTER APPLICATION PACKAGE hello_snowflake_package
     SET DEFAULT RELEASE DIRECTIVE
     VERSION = 'v1_0'
     PATCH = '2'
     UPGRADE_AFTER = '2025-04-06T11:00:00Z'
   ```

This command sets the upgrade date and time to April 6, 2025 at 11:00 am.

### Set the upgrade date and time for a custom release directive

1. To set the upgrade date and time for a custom release directive:

   ```sqlexample
   ALTER APPLICATION PACKAGE hello_snowflake_package
     SET DEFAULT RELEASE DIRECTIVE
     ACCOUNTS = ( USER_ACCOUNT.snowflakecomputing.com )
     VERSION = 'v1_0'
     PATCH = '2'
     UPGRADE_AFTER = '2025-04-06T11:00:00Z'
   ```

This command sets the upgrade date and time to April 6, 2025 at 11:00am.

### Change the upgrade date and time for a custom release directive

1. To change the upgrade date and time for a default release directive:

   ```sqlexample
   ALTER APPLICATION PACKAGE hello_snowflake_package
     SET DEFAULT RELEASE DIRECTIVE
     ACCOUNTS = ( USER_ACCOUNT.snowflakecomputing.com )
     VERSION = 'v1_0'
     PATCH = '2'
     UPGRADE_AFTER = '2025-04-06T11:00:00Z'
   ```

This command sets the upgrade date and time to April 6, 2025 at 11:00am.

## Start an upgrade

The upgrade process starts automatically when a provider updates the release directive (default or custom) of the
application package to point to a new version or patch. Use the
[ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](../../sql-reference/sql/alter-application-package-release-directive.md) command to set the release directive as shown
in the following examples:

```sqlexample
ALTER APPLICATION PACKAGE my_application_package SET DEFAULT RELEASE DIRECTIVE
  VERSION = v2
  PATCH = 0;
```

This command sets the default release directive to version `v2` and patch `0`.

```sqlexample
ALTER APPLICATION PACKAGE my_application_package
  SET RELEASE DIRECTIVE my_custom_release_directive
  ACCOUNTS = ( USER_ACCOUNT.snowflakecomputing.com )
  VERSION = v2
  PATCH = 0;
```

This command sets the custom release directive named `my_custom_release_directive` to version `v2` and patch `0`
for the account USER_ACCOUNT.snowflakecomputing.com.

See [Set the release directive for an app (Legacy)](update-app-release-directive.md) for more information.

## Manually upgrade an app

Manual upgrades allow a consumer to upgrade their installed app faster than automated upgrades.
When a new version or patch is available, a provider can ask the consumer to perform
a manual upgrade.

The consumer performs a manual upgrade by running the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md).
This command initiate the upgrade of an installed version or patch of an app using the release directive
specified in the application package.

To upgrade an installed Snowflake Native App to the latest available version, a consumer can use the UPGRADE clause of
the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command to modify the app:

```sqlexample
ALTER APPLICATION <name> UPGRADE
```

## Upgrade an app across regions

After upgrading an app, changes to the installed Snowflake Native App in the consumer account
might not be visible until the refresh to remote regions is performed.

You can use the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md) in the [Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema to
monitor the state. If the upgrade is not complete more than one day after the first refresh following the upgrade, there might be an issue
with the refresh process. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

If a provider publishes a Snowflake Native App using
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md),
automated upgrades may take a while to upgrade depending on multiple factors, including:

* The value of the refresh schedule.
* The number of installed instances of the app.
* The number of regions where the app is deployed.

If the upgrade contains an urgent fix that needs to be upgraded in a remote region, the provider can reduce the refresh
frequency of the listing to a smaller value. See
[Managing and monitoring auto-fulfillment settings](../../collaboration/provider-listings-auto-fulfillment-monitor.md)
for information on setting the account-level refresh frequency.

> > **Caution:**
> >
> > Reducing the refresh frequency can increase the costs associated with replication.

## Upgrade states

During the upgrade process, the app passes through different states. The following diagram shows the possible
states when upgrading from the previous version, v1, to a new version, v2.

> **Note:**
>
> Although this diagram shows an upgrade for a version, it also applies to patch upgrades.

The following table shows each stage of the upgrade process for an app within the same region that the application
package is located:

|  | Stage | Description |
| --- | --- | --- |
| 1 | App is disabled? | If the app is disabled, no upgrade is possible. |
| 2 | Set release directive to v2.0 | The provider sets the release directive to v2.0. |
| 3 | Eligible to upgrade | Snowflake performs checks to verify that the app is eligible to upgrade. These checks include verifying that the app is not disabled, that application package is available, that the version and patch is valid for upgrade, the consumer account is valid, etc. |
| 4 | Obtain upgrade slot? | Depending on the number of apps being upgraded, the number of consumer accounts, etc. they may have to wait to begin the upgrade process. |
| 5 | Setup script run successfully? | When the upgrade begins, Snowflake runs setup script. If any uncaught errors occur, the setup script execution stops. Snowflake queues the app for upgrade again based on the number of retries configured. |
| 6 | Is version updated? | Snowflake checks to see if the upgrade is for a version or patch. If the upgrade is for a version, Snowflake performs additional checks and waits until all jobs from the older version of the app have completed. |

The following table shows the upgrade process for apps that are deployed to remote regions:

|  | Stage | Description |
| --- | --- | --- |
| 7 | Release directive v2.0 replicated in remote region | When a provider sets the release directive for an app that is deployed to a remote region, the release directive is propagated to the application package deployed in the remote region. |
| 8 | Active region for v2.0? | When most of the apps in the primary region have been upgraded, Snowflake sends messages to the remote region to begin the app upgrade. |
| 9 | Begin upgrade process | Begin upgrade process for the app as described in the previous table. |

The following table describes each of the possible states of the upgrade process:

| State | Description |
| --- | --- |
| DISABLED | The app is disabled and not eligible for upgrade. |
| QUEUED | The app is in the queue to be upgraded based on the number of apps and consumer accounts. |
| UPGRADING | The app is in the process of being upgraded. |
| COMPLETED | The app has upgraded successfully. |
| QUEUED_RETRY | The setup script or other check failed and the app is returned to the upgrade queue. |
| FAILED | The app upgrade failed. Upgrades can fail on the provider side, for example due to an error in the setup script. Upgrades can also fail on the consumer side if the app is disabled, the consumer account is inactive, etc. |

## Monitor the state of an upgrade

To view the upgrade state of an app, use the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md).

For example, in a situation where you updated the default release directive and want to see if all apps have
reached the target version. To find application instances that have not yet finished the upgrade, use the query in
the following example:

```sqlexample
SELECT * FROM snowflake.data_sharing_usage.APPLICATION_STATE
```

This view includes columns that are specific to upgrades, including the upgrade state and the region where
the app is deployed. For information on upgrade states see [Upgrade states](release-channels-upgrade.md).

## Troubleshoot upgrade problems

The Snowflake Native App Framework provides several ways to troubleshoot the upgrade:

### Identify upgrade errors

Consumers can use the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command to view error messages related to
failed upgrades. This command provides insight into the errors that occurred during the upgrade process.

Providers can use the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md) to view error messages for failed upgrades.
Using this view, providers can diagnose issues with specific applications. See [Monitor the state of an upgrade](release-channels-upgrade.md)
for more information.

### Use logging and event tracing

If [logging and event tracing](event-about.md) is configured for the
app, providers can query the event table to diagnose problems with the app upgrade.

See [View the logs and events in the event table](event-manage-provider.md) for more information.

### Monitor the state of an app’s services

To view information about the status of a compute pool or service within an app,
consumers can use the following system functions:

* [DESCRIBE COMPUTE POOL](../../sql-reference/sql/desc-compute-pool.md)
* [SYSTEM$GET_SERVICE_LOGS](../../sql-reference/functions/system_get_service_logs.md)

Consumers can share this information back to providers. Providers can also configure event sharing
to return this information.

## Disabled apps

When an app installed in the consumer account is disabled, it is no longer usable.
An app installed in a consumer account can become disabled for multiple reasons, including:

* Problems with the application package
* Problems with the installed application
* Problems with the consumer account

Both providers and consumers should avoid situations where an app remains disabled for an extended
period of time. Disabled apps can become unusable and must be reinstalled

### Upgrade a disabled app

Disabled apps are not part of the normal upgrade process and cannot be upgraded. If a disabled app becomes reenabled,
it is automatically upgrade to the version and patch of the release directive. However, if the version or patch is no
longer available the app cannot be upgraded and must be reinstalled.

For example, if a disabled app is on version `v1`, but the current and previous versions in the application package
are `v2` and `v3`, the app cannot be upgraded and is unusable.

### Reasons an app can become disabled

You can view the DISABLEMENT_REASONS column of the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md)
to see the reasons an app is disabled. The following table lists the possible values for the DISABLEMENT_REASONS
column:

| Value | Status description | Is recoverable? |
| --- | --- | --- |
| MANUALLY_DISABLED | The app is disabled by Snowflake | Yes. To re-enable the app, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| ACCOUNT_INACTIVE | The account becomes inactive by being locked or suspended causing the app to be unavailable. In this state a consumer cannot execute any SQL queries in their account and the app cannot be upgraded. | Yes. The app is automatically re-enabled if the account lock or suspension is removed |
| PACKAGE_VERSION_IS_MISSING | The application package version for the app was dropped by the provider. | Possibly. This can be caused by a temporary platform outage, in which case the app may recover automatically. Otherwise, the provider can work with Snowflake Support to attempt version recovery. Contact the application provider for more details. |
| CMK_ACCESS_DENIED | The consumer manages the encryption key themselves (ENCRYPT_USE_CMK_KMS is enabled) and Snowflake doesn’t have access to this key. | Yes. To re-enable the app, ensure that the cloud provider configuration to retrieve the CMK is correct and that Snowflake has access to the key. |
| LISTING_ACCESS_REVOKED | The listing used to create the app is no longer available. Possible reasons for this status include:   * The provider deleted the listing * The provider manually removed access to the private listing from the consumer account | Possibly. Recoverability depends on the reason why access was revoked.  For example, if the listing was deleted it is not recoverable. If a consumer account was manually removed from the private listing, access to the listing and app can be restored. |
| LISTING_TRIAL_USAGE_EXCEEDED | The application has exceeded the usage limit for a usage-based trial listing. | No |
| LISTING_PAYMENT_REQUIRED | The listing used to install the app is a paid listing and requires payment for further usage. | Yes. The consumer must correctly set up payment for the app. |
| LISTING_TRIAL_TIME_EXCEEDED | The application exceeded the trial duration. | No |
| APPLICATION_PACKAGE_NOT_AVAILABLE | The application package used to create the app no longer exists. The provider may have dropped the corresponding application package. | No |
| APPLICATION_PACKAGE_DISABLED | The application package used to create the app is disabled by the Snowflake. | Yes. The app is re-enabled, if Snowflake re-enables the application package. |
| APPLICATION_SUSPENDED | The app resources for example, tasks, services, and compute pools, are suspended due to the app being disabled.  The suspended objects remain suspended until the app is re-enabled and there are no other reasons the app was disabled. | Yes |
| APPLICATION_SUSPEND_RESUME_IN_PROGRESS | The app resources, for example tasks, services, and compute pools, are currently resuming. | Yes |

---
title: Upgrade an app as a consumer
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-upgrade.md
section: Native Apps Framework
---

# Upgrade an app as a consumer

This topic describes how a consumer can upgrade a Snowflake Native App that is installed in
their account.

## Overview of upgrades

In general, when a provider publishes an updated version or patch for an app, the
app is automatically upgraded in the consumer account. The automated upgrade occurs
for all instances of the app across all Snowflake accounts where the app is installed.

It may take some time for all app instances to be upgraded. However, consumers can
manually upgrade an app after a new version of patch has been published as long
as the upgrade of the instance installed in their account has not started. Also,
providers can specify a date and time when an automated upgrade occurs.

## Manually upgrade an app

Manual upgrades allow a consumer to upgrade an app when a provider publishes a new version
or patch.

To upgrade an installed Snowflake Native App to the latest available version, a consumer can use the UPGRADE clause of the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command to modify the app:

```sqlexample
ALTER APPLICATION hello_snowflake_app UPGRADE;
```

This command initiates the upgrade of an installed version or patch of an app using the
release directive specified in the application package.

## Upgrade an app at a specific date and time

When publishing a new version of an app, providers can specify a date and time when the app
will be upgraded. This date and time specifies the earliest date when the app can be
upgraded.

Consumers can [create a task](../../sql-reference/sql/create-task.md) to upgrade the app at a specific time.

```sqlexample
CREATE OR REPLACE TASK APP_UPGRADE_TASK
 SCHEDULE = 'USING CRON 0 9-17 * * SUN America/Los_Angeles'
 WAREHOUSE = 'WH'
 AS
   ALTER APPLICATION hello_snowflake_app UPGRADE;

ALTER TASK APP_UPGRADE_TASK RESUME;
```

This example attempts to upgrade the app every hour starting at 9:00 am and ending at
5:00 pm on Sundays (America/Los_Angeles time zone).

## Monitor the status of an upgrade

Consumers can use the
[DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md)
command to view the status of an upgrade. The following columns provide information specific
to upgrades:

> | Column | Description |
> | --- | --- |
> | `upgrade_after` | Indicates that the provider has scheduled an upgrade to begin at this time. However, the app may be upgraded before this date and time. For more information, see Manually upgrade an app. |
> | `upgrade_state` | The current state of the background installation or upgrade of the application object. Valid values are:   * `INSTALLING`: The application object is in the process of being created. * `INSTALL_FAILED`: The creation of the application object failed. The application object   remains in the `INSTALL_FAILED` state until it is dropped. See the `UPGRADE_FAILURE_REASON`   column of the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command for information about why the   installation or upgrade failed. * `COMPLETE`: The setup script successfully completed and the application object was created   or upgraded. * `QUEUED`: The application object is queued for upgrade. * `UPGRADING`: The application object is in the process of being upgraded. * `FAILED`: All upgrade attempts failed. The reason for the failure is listed in the   `UPGRADE_FAILURE_REASON` column, if present. The instance remains in the `FAILED` state until   a release directive is updated to point to a different version than the one that the upgrade was   targeting, as defined in the `TARGET_UPGRADE_VERSION` column. * `QUEUED_DELAYED`: The application object is queued for an upgrade that is scheduled for a future time. * `QUEUED_RETRY`: The instance failed one or more upgrade attempts. The reason for the failure   is indicated in `UPGRADE_FAILURE_REASON`: The instance is queued to perform another upgrade attempt. * `DISABLED`: The application object and its upgrades were disabled. In this state the instance will be   inaccessible for consumers, it will not be considered for upgrades and will not block application package   version drop. The reason for the failure is listed in the `UPGRADE_FAILURE_REASON` column, if present. |
> | `upgrade_target_version` | The version identifier of the version to which the app is being upgraded. |
> | `upgrade_target_patch` | The patch to which the application object is being upgraded. |

---
title: Upgrade an app using release channels
source: https://docs.snowflake.com/en/developer-guide/native-apps/release-channels-upgrade.md
section: Native Apps Framework
---

# Upgrade an app using release channels

This topic provides information on how to upgrade a Snowflake Native App using release channels.

## About app upgrades

The Snowflake Native App Framework allows providers to upgrade an app to a new version or patch.

Providers can initiate an upgrade of an app to a new version or patch by setting the release directive
on a release channel. When the release directive is modified, Snowflake automatically upgrades all
installed instances of the current version of the app to the version specified by the release directive.

When the provider initiates an upgrade, Snowflake adds each app to be upgraded to a queue. Each app is
upgraded as resources are available. The upgrade process can take a while to complete across all installed
versions of the app. To expedite the upgrade process, consumers can also manually initiate an upgrade of
an app when a new version or patch is available.

> **Note:**
>
> After the upgrade process begins on an app, consumers can no longer manually upgrade the app.

### Considerations when upgrading an app with multiple versions

When a provider publishes a new version of an app, Snowflake ensures that only the previous version of
the app is active. For example, if a provider has published versions v1 and v2 of an app, Snowflake ensures
that only v2 is currently installed in a consumer account before upgrading to v3. This requires that all
installed apps using version v1 are migrated to version v2.

This ensures that the setup script of the app only has to account for differences between v2 and v3. The
setup script is only backwards compatible with the most recent version of the app. If a provider makes a
state change to the app, for example creating a new table or adding columns to a table, providers only have
to ensure that there are no compatibility issues between two versions.

In contrast, when a provider creates a new patch for a version of an app, the Snowflake Native App Framework does not enforce any
restrictions on the number of active patches running. Providers must avoid making changes to the state of
an app in a patch to avoid incompatibility across multiple patches.

### Considerations when removing a version of an app after an upgrade

Although an app might be upgraded in the consumer account, the previous version of the app might still have code that is running. Providers cannot remove the previous version of the app from the release channel until all running code from the previous version has completed. This applies to all installed versions of the app across all consumer accounts. If a single upgrade fails, providers must fix the reason for the upgrade failure before they can remove the version.

### Upgrades across regions

For information on upgrading an app across multiple regions, see Upgrade an app across regions.

## Workflow for upgrading an app

A provider upgrades an app by using the following workflow:

1. Update the app to include any new features.
2. If you are creating a new version of the app and there are two versions currently defined for the app:

> 1. Ensure that no consumers are currently running the version.
> 2. Drop the version of the app you are replacing.

1. Create a new version or patch for the changes in the release channel.

   If the DISTRIBUTION property of the application package is set to EXTERNAL, the automated security scan is initiated. The security scan must pass before the upgrade can occur.
2. Test the new version by installing the app in your test account.
3. Update the release directive for the version or patch.

   This initiates an automated upgrade that will update all installed instances of the previous version. A provider can notify the consumer that an upgrade is available and ask them to manually upgrade the app.

## Set a start date and time for an upgrade

Providers can set a date and time that specifies when an automatic upgrade should begin.
This date and time is set in the release directive (default or custom) using the
`UPGRADE_AFTER` clause of the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command.
Both default and custom release directives are supported.

The upgrade date and time can be any valid [date and time type](../../sql-reference/data-types-datetime.md).
If the timestamp has already passed the upgrade is immediately scheduled. This is the same behavior as not setting the
`UPGRADE_AFTER` clause.

You can only use the `UPGRADE_AFTER` clause if you are setting the version and patch. This clause can’t be used
to modify only the upgrade time and date.

### Set the upgrade date and time for the default release directive

1. To set the upgrade date and time for the default release directive:

   ```sqlexample
   ALTER APPLICATION PACKAGE hello_snowflake_package
     MODIFY RELEASE CHANNEL DEFAULT
     SET DEFAULT RELEASE DIRECTIVE
     VERSION = 'v1_0'
     PATCH = '2'
     UPGRADE_AFTER = '2025-04-06T11:00:00Z'
   ```

This command sets the upgrade date and time for patch `2` or version `v1_0` of the DEFAULT
release channel to April 6, 2025 at 11:00 am.

> **Note:**
>
> This is the earliest date and time when the upgrade can begin. The actual upgrade might occur later
> depending on the number of apps being upgraded, the number of consumer accounts, etc.

### Set the upgrade date and time for a custom release directive

1. To set the upgrade date and time for a default release directive:

   ```sqlexample
   ALTER APPLICATION PACKAGE hello_snowflake_package
     MODIFY RELEASE CHANNEL DEFAULT
     SET DEFAULT RELEASE DIRECTIVE
     VERSION = 'v1_0'
     PATCH = '2'
     UPGRADE_AFTER = '2025-04-06T11:00:00Z'
   ```

This command sets the upgrade date and time for patch `2` or version `v1_0` of the DEFAULT
release channel to April 6, 2025 at 11:00 am.

> **Note:**
>
> This is the earliest date and time when the upgrade can begin. The actual upgrade might occur later
> depending on the number of apps being upgraded, the number of consumer accounts, etc.

### Change the upgrade date and time for a custom release directive

1. To change the upgrade date and time for a default release directive:

   ```sqlexample
   ALTER APPLICATION PACKAGE hello_snowflake_package
     MODIFY RELEASE CHANNEL DEFAULT
     SET DEFAULT RELEASE DIRECTIVE
     ACCOUNTS = ( USER_ACCOUNT.snowflakecomputing.com )
     VERSION = 'v1_0'
     PATCH = '2'
     UPGRADE_AFTER = '2025-04-06T11:00:00Z'
   ```

This command changes the upgrade date and time for patch `2` or version `v1_0` of the DEFAULT
release channel to April 6, 2025 at 11:00 am.

> **Note:**
>
> This is the earliest date and time when the upgrade can begin. The actual upgrade might occur later
> depending on the number of apps being upgraded, the number of consumer accounts, etc.

## Start an upgrade

The upgrade process starts automatically when a provider updates the release directive (default or custom) of the release channel to point to a new version or patch. Use the
[ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](../../sql-reference/sql/alter-application-package-release-directive.md) command to set the release directive
as shown in the following examples:

```sqlexample
ALTER APPLICATION PACKAGE my_application_package
  MODIFY RELEASE CHANNEL DEFAULT
  SET DEFAULT RELEASE DIRECTIVE
  VERSION = v2
  PATCH = 0;
```

This command sets the default release directive to version `v2` and patch `0` for the default
release channel.

```sqlexample
ALTER APPLICATION PACKAGE my_application_package
  MODIFY RELEASE CHANNEL DEFAULT
  SET RELEASE DIRECTIVE my_custom_release_directive
  ACCOUNTS = ( USER_ACCOUNT.snowflakecomputing.com )
  VERSION = v2
  PATCH = 0;
```

This command sets the custom release directive named `my_custom_release_directive` to version `v2`
and patch `0` for the account USER_ACCOUNT.snowflakecomputing.com.

See [Set the release directive for an app (Legacy)](update-app-release-directive.md) for more information.

## Manually upgrade an app

Manual upgrades allow a consumer to upgrade their installed app faster than automated upgrades.
When a new version or patch is available, a provider can ask the consumer to perform
a manual upgrade.

The consumer performs a manual upgrade by running the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md).
This command initiate the upgrade of an installed version or patch of an app using the release directive
specified in the release channel.

To upgrade an installed Snowflake Native App to the latest available version, a consumer can use the UPGRADE clause of
the [ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command to modify the app:

```sqlexample
ALTER APPLICATION <name> UPGRADE
```

## Upgrade an app across regions

After upgrading an app, changes to the installed Snowflake Native App in the consumer account
might not be visible until the refresh to remote regions is performed.

You can use the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md) in the
[Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema to monitor the state. If the upgrade is not
complete more than one day after the first refresh following the upgrade, there might be an issue
with the refresh process. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

If a provider publishes a Snowflake Native App using
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md),
automated upgrades may take a while to upgrade depending on multiple factors, including:

* The value of the refresh schedule.
* The number of installed instances of the app.
* The number of regions where the app is deployed.

If the upgrade contains an urgent fix that needs to be upgraded in a remote region, the provider can reduce the refresh
frequency of the listing to a smaller value. See
[Managing and monitoring auto-fulfillment settings](../../collaboration/provider-listings-auto-fulfillment-monitor.md)
for information on setting the account-level refresh frequency.

> > **Caution:**
> >
> > Reducing the refresh frequency can increase the costs associated with replication.

## Upgrade states

During the upgrade process, the app passes through different states. The following diagram shows the possible
states when upgrading from the previous version, v1, to a new version, v2.

> **Note:**
>
> Although this diagram shows an upgrade for a version, it also applies to patch upgrades.

The following table shows each stage of the upgrade process for an app within the same region that the application
package is located:

|  | Stage | Description |
| --- | --- | --- |
| 1 | App is disabled? | If the app is disabled, no upgrade is possible. |
| 2 | Set release directive to v2.0 | The provider sets the release directive to v2.0. |
| 3 | Eligible to upgrade | Snowflake performs checks to verify that the app is eligible to upgrade. These checks include verifying that the app is not disabled, that application package is available, that the version and patch is valid for upgrade, the consumer account is valid, etc. |
| 4 | Obtain upgrade slot? | Depending on the number of apps being upgraded, the number of consumer accounts, etc. they may have to wait to begin the upgrade process. |
| 5 | Setup script run successfully? | When the upgrade begins, Snowflake runs setup script. If any uncaught errors occur, the setup script execution stops. Snowflake queues the app for upgrade again based on the number of retries configured. |
| 6 | Is version updated? | Snowflake checks to see if the upgrade is for a version or patch. If the upgrade is for a version, Snowflake performs additional checks and waits until all jobs from the older version of the app have completed. |

The following table shows the upgrade process for apps that are deployed to remote regions:

|  | Stage | Description |
| --- | --- | --- |
| 7 | Release directive v2.0 replicated in remote region | When a provider sets the release directive for an app that is deployed to a remote region, the release directive is propagated to the application package deployed in the remote region. |
| 8 | Active region for v2.0? | When most of the apps in the primary region have been upgraded, Snowflake sends messages to the remote region to begin the app upgrade. |
| 9 | Begin upgrade process | Begin upgrade process for the app as described in the previous table. |

The following table describes each of the possible states of the upgrade process:

| State | Description |
| --- | --- |
| DISABLED | The app is disabled and not eligible for upgrade. |
| QUEUED | The app is in the queue to be upgraded based on the number of apps and consumer accounts. |
| UPGRADING | The app is in the process of being upgraded. |
| COMPLETED | The app has upgraded successfully. |
| QUEUED_RETRY | The setup script or other check failed and the app is returned to the upgrade queue. |
| FAILED | The app upgrade failed. Upgrades can fail on the provider side, for example due to an error in the setup script. Upgrades can also fail on the consumer side if the app is disabled, the consumer account is inactive, etc. |

## Monitor the state of an upgrade

To view the upgrade state of an app, use the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md).

For example, in a situation where you updated the default release directive and want to see if all apps have
reached the target version. To find application instances that have not yet finished the upgrade, use the query in
the following example:

```sqlexample
SELECT * FROM snowflake.data_sharing_usage.APPLICATION_STATE
```

This view includes columns that are specific to upgrades, including the upgrade state and the region where
the app is deployed. For information on upgrade states see Upgrade states.

## Troubleshoot upgrade problems

The Snowflake Native App Framework provides several ways to troubleshoot the upgrade as described in the following sections:

### Identify upgrade errors

Consumers can use the [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command to view error messages related to
failed upgrades. This command provides insight into the errors that occurred during the upgrade process.

Providers can use the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md) to view error messages for failed upgrades.
Using this view, providers can diagnose issues with specific applications. See Monitor the state of an upgrade
for more information.

### Use logging and event tracing

If [logging and event tracing](event-about.md) is configured for the
app, providers can query the event table to diagnose problems with the app upgrade.

See [View the logs and events in the event table](event-manage-provider.md) for more information.

### Monitor the state of an app’s services

To view information about the status of a compute pool or service within an app,
consumers can use the following system functions:

* [DESCRIBE COMPUTE POOL](../../sql-reference/sql/desc-compute-pool.md)
* [SYSTEM$GET_SERVICE_LOGS](../../sql-reference/functions/system_get_service_logs.md)

Consumers can share this information back to providers. Providers can also configure event sharing
to return this information.

## Disabled apps

When an app installed in the consumer account is disabled, it is no longer usable.
An app installed in a consumer account can become disabled for multiple reasons, including:

* Problems with the application package
* Problems with the installed application
* Problems with the consumer account

Both providers and consumers should avoid situations where an app remains disabled for an extended
period of time. Disabled apps can become unusable and must be reinstalled

### Upgrade a disabled app

Disabled apps are not part of the normal upgrade process and cannot be upgraded. If a disabled app becomes
reenabled, it is automatically upgraded to the version and patch of the release directive. However, if the
version or patch is no longer available, the app cannot be upgraded and must be reinstalled.

For example, if a disabled app is on version `v1`, but the current and previous versions in the release channel are `v2` and `v3`, the app cannot be upgraded and is unusable.

### Reasons an app can become disabled

You can view the DISABLEMENT_REASONS column of the
[APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md) to see the reasons an app is disabled. The
following table lists the possible values for the DISABLEMENT_REASONS column:

| Value | Status description | Is recoverable? |
| --- | --- | --- |
| MANUALLY_DISABLED | The app is disabled by Snowflake | Yes. To re-enable the app, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| ACCOUNT_INACTIVE | The account becomes inactive by being locked or suspended causing the app to be unavailable. In this state a consumer cannot execute any SQL queries in their account and the app cannot be upgraded. | Yes. The app is automatically re-enabled if the account lock or suspension is removed |
| PACKAGE_VERSION_IS_MISSING | The application package version for the app was dropped by the provider. | Possibly. This can be caused by a temporary platform outage, in which case the app may recover automatically. Otherwise, the provider can work with Snowflake Support to attempt version recovery. Contact the application provider for more details. |
| CMK_ACCESS_DENIED | The consumer manages the encryption key themselves (ENCRYPT_USE_CMK_KMS is enabled) and Snowflake doesn’t have access to this key. | Yes. To re-enable the app, ensure that the cloud provider configuration to retrieve the CMK is correct and that Snowflake has access to the key. |
| LISTING_ACCESS_REVOKED | The listing used to create the app is no longer available. Possible reasons for this status include:   * The provider deleted the listing * The provider manually removed access to the private listing from the consumer account | Possibly. Recoverability depends on the reason why access was revoked.  For example, if the listing was deleted it is not recoverable. If a consumer account was manually removed from the private listing, access to the listing and app can be restored. |
| LISTING_TRIAL_USAGE_EXCEEDED | The application has exceeded the usage limit for a usage-based trial listing. | No |
| LISTING_PAYMENT_REQUIRED | The listing used to install the app is a paid listing and requires payment for further usage. | Yes. The consumer must correctly set up payment for the app. |
| LISTING_TRIAL_TIME_EXCEEDED | The application exceeded the trial duration. | No |
| APPLICATION_PACKAGE_NOT_AVAILABLE | The application package used to create the app no longer exists. The provider may have dropped the corresponding application package. | No |
| APPLICATION_PACKAGE_DISABLED | The application package used to create the app is disabled by the Snowflake. | Yes. The app is re-enabled, if Snowflake re-enables the application package. |
| APPLICATION_SUSPENDED | The app resources for example, tasks, services, and compute pools, are suspended due to the app being disabled.  The suspended objects remain suspended until the app is re-enabled and there are no other reasons the app was disabled. | Yes |
| APPLICATION_SUSPEND_RESUME_IN_PROGRESS | The app resources, for example tasks, services, and compute pools, are currently resuming. | Yes |

---
title: Use and manage Snowflake Native Apps as a consumer
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-about.md
section: Native Apps Framework
---

# Use and manage Snowflake Native Apps as a consumer

Consumers can discover and install apps published to the Snowflake Marketplace or shared
using private listings.

The Native Apps Framework allows consumers to perform the following:

* Use the app by accessing data via Snowflake worksheets.
* View Streamlit apps created by the provider.
* Grant privileges on the app object to users in your organization.
* Associate references that allow access to object required by the app.
* Share event and logging information with the provider.

## Consumer workflow for working with a Snowflake Native App

The following workflow is what consumers typically do when working with apps:

1. [Become a consumer](../../collaboration/consumer-becoming.md).
2. [Install an app from a listing](ui-consumer-installing.md).
3. [Review the access requests from the app](ui-consumer-granting-privs.md).

   This includes granting the privileges and creating references required by the app.
4. (Optional) [Set up an event table](ui-consumer-enable-logging.md) to enable logging and
   event sharing for an app.

## Apps installed from trial listings

When the trial period ends for an app installed from a trial listing, Snowflake
automatically suspends the app unless the consumer converts the app to a full listing.
When the trial period is about to expire, Snowflake sends an email notification before the app is
suspended.

Snowflake recommends that consumers convert the trial listing into a full listing before the
trial period expires. After the app is suspended it may not be possible to resume the app. For example,
if the provider removes the current version of the app or there are unresolved state changes, the
app cannot be resumed.

When an app installed from a trial listing is suspended, all data written inside the app is retained
as long as the consumer does not delete the app.

If the app installed from a trial listing creates objects in the consumer account outside
the application object, consumers can retain these objects after the app is uninstalled. However, they
must transfer ownership of the objects before uninstalling the app. See [Uninstall a Snowflake Native App](ui-consumer-managing-applications.md)

---
title: Use feature policies to limit the objects an app can create
source: https://docs.snowflake.com/en/developer-guide/native-apps/ui-consumer-feature-policies.md
section: Native Apps Framework
---

# Use feature policies to limit the objects an app can create

This topic describes how to use feature policies to limit the objects that a Snowflake Native App
can create.

## About feature policies

If an app is configured to use
[automated granting of privileges](requesting-auto-privs.md), the app can request to use
the following privileges:

* EXECUTE TASK
* EXECUTE MANAGED TASK
* CREATE WAREHOUSE
* CREATE COMPUTE POOL
* BIND SERVICE ENDPOINT
* CREATE DATABASE
* CREATE EXTERNAL ACCESS INTEGRATION

If the app is configured to use these privileges, a consumer cannot directly revoke these privileges
after the app is installed. However, consumer administrators can use feature policies
to limit the objects an app can create in the consumer account.

For example, if a consumer does not want an app to create warehouses or compute pools, a consumer account
administrator can create a feature policy that prohibits a particular app or all apps from
creating warehouses or compute pools.

Feature policies allow consumers to restrict an app from creating or using the following
objects:

* COMPUTE POOLS
* DATABASES
* TASKS
* WAREHOUSES

> **Note:**
>
> External access integrations can’t be blocked using feature policies. Instead, consumers
> can choose to approve or decline the endpoints for an app using app specifications.

## Workflow

The general workflow for using feature policies to limit the objects an app can create is:

1. View the listing for the app to determine the privileges the app is
   requesting.
2. If there are any objects you want to restrict, create a feature policy to block these objects.

   For more information, see Create a new feature policy.
3. Apply the feature policy to the account or to a specific object.

   For more information, see Assign a feature policy at the account level and
   Apply a feature policy to an app.

## Replication considerations when using feature policies

Feature policy references at the account-level are replicated when specifying the database containing policy, for example, by setting `ALLOWED_DATABASES = policy_db` in a replication group or failover
group.

If the account has already been replicated to a target account, a consumer account administrator must
do the following:

1. Update the replication or failover group in the source account to include the databases and object types
   required to successfully replicate the feature policy.
2. Execute a refresh operation to update the target account.

> **Note:**
>
> The feature policy must be in the same account as the account-level policy assignment.

If you have a feature policy set on the account and you do not update the replication or failover group to include the policy_db containing the policy, this creates a dangling reference in the target account. This means that Snowflake cannot locate the policy in the target account because the fully-qualified name of the policy points to the database in the source account. The result is that the target account or users in the
target account are not required to comply with the feature policy.

To successfully replicate a feature policy, verify that the replication or failover group includes the object types and databases required to prevent a dangling reference.

For more information, see [Replication considerations](../../user-guide/account-replication-considerations.md).

## Feature policy precedence

Consumers can apply a feature policy to all applications in an account or a specified application.
If there are feature policies applied to more than one of these, the most specific feature policy
overrides more general feature policies. The following summarizes the order of precedence:

Account:
:   Feature policies applied to an account are the most general feature policies. They are overridden
    by feature policies applied to a specific object, for example, an application

Object:
:   Feature policies applied to a specific object override feature policies applied to the account.

Consumers can use this precedence to fine-tune the objects an app can create in their account.
For example, a consumer can apply an account-level feature policy that prohibits all apps in the account from creating a
database. If an app tries to create a database during installation, the installation fails.

However, consumers can also create a feature policy with no restrictions and apply tha feature policy
to a specific app. That app would be allowed to create a database.

For more information, see Create a new feature policy.

## Privileges required to use feature policies

The following table describes the privileges require to create and use feature policies:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FEATURE POLICY | SCHEMA | Required to create a feature policy. This privilege must be granted on the schema containing the feature policy. |
| APPLY FEATURE POLICY | ACCOUNT |  |
| APPLY or OWNERSHIP | FEATURE POLICY |  |

## Working with feature policies

Consumers can use Snowsight or SQL to manage the lifecycle of a feature policy.

### Create a new feature policy

Consumers can create feature policies to prohibit an app from creating certain
types of objects. The following example shows how to create a feature policy to
prohibit an app from creating a database:

```sqlexample
CREATE DATABASE feature_policy_db;
CREATE SCHEMA sch;
CREATE FEATURE POLICY block_create_db_policy
  BLOCKED_OBJECT_TYPES_FOR_CREATION = (DATABASE);
```

> **Note:**
>
> Feature policies must be created within a schema.

Consumers can also create a feature policy that doesn’t restrict creating objects,
as shown in the following example:

```sqlexample
CREATE FEATURE POLICY block_nothing_policy
  BLOCKED_OBJECT_TYPES_FOR_CREATION = ();
```

### Assign a feature policy at the account level

Consumers can apply a feature policy at the account level by using the
[ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command, as shown in the following example:

```sqlexample
ALTER ACCOUNT
  SET FEATURE POLICY feature_policy_db.sch.block_create_db_policy
  FOR ALL APPLICATIONS;
```

This command applies the `block_create_db_policy` policy for any app that is installed
in the account. After applying this policy, apps can no longer create databases.

### Apply a feature policy to an app

To apply a feature policy when creating an app manually, use the WITH FEATURE POLICY clauase of the
[CREATE APPLICATION](../../sql-reference/sql/create-application.md) command,
as shown in the following example:

```sqlexample
CREATE APPLICATION hello_snowflake_app
  WITH FEATURE POLICY = feature_policy_db.block_create_db_policy;
```

To app a feature policy to an app, use the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command,
as shown in the following example:

```sqlexample
ALTER APPLICATION hello_snowflake_app
  SET FEATURE POLICY feature_policy_db.block_create_db_policy;
```

### Unapply a feature policy

To unapply a feature policy at the account level, use the
[ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command,
as shown in the following example:

```sqlexample
ALTER ACCOUNT UNSET FEATURE POLICY FOR ALL APPLICATIONS;
```

To unapply a feature policy for a specific app, use the
[ALTER APPLICATION](../../sql-reference/sql/alter-application.md) command,
as shown in the following example:

```sqlexample
ALTER APPLICATION FEATURE_POLICY_TEST_APP UNSET FEATURE POLICY;
```

### Delete a feature policy

To delete a feature policy, use the [DROP FEATURE POLICY](../../sql-reference/sql/drop-feature-policy.md)
command, as shown in the following example:

```sqlexample
DROP FEATURE POLICY block_create_db_policy;
```

## View information about feature policies

To view the feature policies in an account for which you have access privileges, use the
[SHOW FEATURE POLICIES](../../sql-reference/sql/show-feature-policies.md)
command:

```sqlexample
SHOW FEATURE POLICIES ON ACCOUNT;
```

To view the feature policies applied to an app, use the following command:

```sqlexample
SHOW FEATURE POLICIES ON APPLICATION hello_snowflake_app;
```

To see information about a specific feature policy, use the
[DESCRIBE FEATURE POLICY](../../sql-reference/sql/desc-feature-policy.md),
as shown in the following example:

```sqlexample
DESCRIBE FEATURE POLICY feature_policy_db.block_create_db_policy;
```

---
title: Use logging and event tracing for an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/event-about.md
section: Native Apps Framework
---

# Use logging and event tracing for an app

This topic describes how providers can configure a Snowflake Native App to record log messages and trace events.

## About log messages and trace events in an app

The Snowflake Native App Framework supports using the Snowflake [logging and tracing](../logging-tracing/logging-tracing-overview.md)
functionality to gather information about an app. Providers can configure an app to record and analyze the following:

* [Log messages](../logging-tracing/logging.md) — Independent, detailed messages with information about
  the state of a specific piece app code.
* [Trace events](../logging-tracing/tracing.md) — Structured data that providers can
  use to get information spanning and grouping multiple parts of your code. Trace events allows an app to emit information related
  to its performance and behavior.
* [Metrics](../logging-tracing/metrics.md) - Information about stored procedure and UDF resource consumption
  based on the CPU and memory metrics that Snowflake generates.

To configure an app to emit log messages and trace events, providers set the log and trace levels in the manifest file.
See [Set the log and trace levels for an app](event-definition.md).

Providers can also configure an app to use event sharing to allow the consumer to share the log messages
and trace events with the provider. See About event sharing for
more information.

## About application lifecycle events

Snowflake records events that provide visibility into the status and history of a
Snowflake Native App. These Snowflake-provided events are referred to as
*application lifecycle events*.

For example, if a consumer’s app instance transitions to a failed state due to an
error during an upgrade, you can use application lifecycle events to view this
historical event.

Snowflake logs these application lifecycle events in the
[event table](../logging-tracing/event-table-setting-up.md) in the
account. By default, application lifecycle events are not logged.

If [BCR bundle 2026_02](../../release-notes/bcr-bundles/2026_02/bcr-2232.md) is enabled, the
[LOG_EVENT_LEVEL](../../sql-reference/parameters.md) property in the manifest file controls which
application lifecycle events are recorded in the event table. To enable logging of application
lifecycle events, set the `log_event_level` property in the
[manifest file](manifest-reference.md). See
[Set the log and trace levels for an app](event-definition.md) for more information.

If BCR bundle 2026_02 is disabled, the `log_level` property in the manifest file controls
which application lifecycle events are recorded instead.

The value of the `log_event_level` (or `log_level` if the BCR bundle is disabled)
property in the manifest file determines the severity of events recorded in the event table.
Application lifecycle events support the following severity levels:

* `TRACE`
* `DEBUG`
* `INFO`
* `WARN`
* `ERROR`
* `FATAL`
* `OFF`

> **Note:**
>
> Each logging level includes records from all lower levels. For example, setting the log level
> to `WARN` also records `ERROR` and `FATAL` events.

### Query application lifecycle events

After you configure the log level for your app, Snowflake records application
lifecycle events in the active event table in your Snowflake account. You can query
the event table to view these events.

The following SELECT statement retrieves application lifecycle events for a specific
app recorded in the past hour:

```sqlexample
SELECT TIMESTAMP, RESOURCE_ATTRIBUTES, RECORD, VALUE
  FROM <your_event_table>
  WHERE TIMESTAMP > DATEADD(hour, -1, CURRENT_TIMESTAMP())
    AND RESOURCE_ATTRIBUTES:"snow.application.name" = '<your_app_name>'
    AND RECORD_TYPE = 'EVENT'
  ORDER BY TIMESTAMP DESC
  LIMIT 10;
```

## About event sharing

Event sharing allows the provider to collect information about an app’s performance and behavior.
A provider can configure an app to request that the consumers share the log messages
and trace events with the provider. Event sharing requires that the provider and consumer configure an
event table in their account to store the log messages and trace events emitted by the app.

When event sharing is enabled, the log messages and trace events that are inserted into the event table in the
consumer account are also inserted into the event table in provider account.

> **Note:**
>
> The only events with a `RECORD_TYPE` of `EVENT` that support event
> sharing are Snowflake Native Apps application lifecycle events and Snowpark Container Services platform events.

## Considerations when using event sharing

Before configuring logging and event sharing for an app, providers must consider the following:

* Providers are responsible for all costs associated with event sharing on the provider side, including data
  ingestion and storage.
* Providers must have [an account to store shared events](event-manage-provider.md)
  in each region where you want to support event sharing.
* Providers must define the default log level and trace level for an app in the manifest file.

## Considerations when migrating from the previous event sharing functionality

When migrating from the existing event sharing functionality to use event definitions, providers
should consider the following.

* The previous event sharing functionality is equivalent to the OPTIONAL ALL event definition.
* Published versions and patches of an app that used the previous functionality will have the
  OPTIONAL ALL event definition by default. Providers do not need to add this event definition
  to the manifest file.

To begin using event definitions, providers can add supported event definitions to the manifest
file. This is applicable to new apps as well as new versions and patches of existing apps.

> **Note:**
>
> To being begin requesting more granular log and event sharing, providers only have to add
> event definitions to the manifest file. No other actions are required for providers.

## Workflow - Set up event sharing for an app

Event sharing allows consumers to share log messages and trace events with the provider.

The following workflow shows how to set up and enable event sharing for an app:

1. The provider [sets the log and trace levels](event-definition.md)
   for the app.
2. The provider [adds event definitions](event-definition.md) to the manifest file.

   Event definitions act as filters on the log messages and trace events emitted by the app.
   Providers can configure event definitions to be required or optional.
3. The provider [sets up an event table](event-manage-provider.md)
   in their organization.
4. The provider publishes the app.

When a consumer installs an app, they can set up an event table and enable event sharing.
See [Enable logging and event sharing for an app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging)
for more information on the consumer requirements for event sharing.

## Monitor consumer application health

You can use the `LAST_HEALTH_STATUS` and `LAST_HEALTH_STATUS_UPDATED_ON` columns
of the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md) to monitor the health of consumer instances of your
app. The `LAST_HEALTH_STATUS` column has the following possible values:

* `OK`: The consumer instance is healthy.
* `FAILED`: The consumer instance is in an error state.
* `PAUSED`: The consumer manually paused the app.

The following code sample demonstrates using the `APPLICATION_STATE` view
to retrieve the health status of all consumer instances of your app:

```sqlexample
SELECT
    CONSUMER_ORGANIZATION_NAME,
    CONSUMER_ACCOUNT_NAME,
    LAST_HEALTH_STATUS,
    LAST_HEALTH_STATUS_UPDATE_TIME
FROM
    SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_STATE
WHERE
    PROVIDER_ORG_NAME = '<your_provider_org_name>'
    AND APPLICATION_NAME = '<your_app_name>'
ORDER BY
    LAST_HEALTH_STATUS_UPDATE_TIME DESC;
```

The preceding query may return results similar to the following:

```output
CONSUMER_ORG_NAME    CONSUMER_ACCOUNT_NAME    LAST_HEALTH_STATUS    LAST_HEALTH_STATUS_UPDATE_TIME
------------------   ---------------------    ------------------    -------------------------------
consumer_org_1      consumer_account_1       OK                    2024-01-15 10:30:00.000
consumer_org_2      consumer_account_2       FAILED                2024-01-15 09:45:00.000
consumer_org_3      consumer_account_3       PAUSED                2024-01-14 16:20:00.000
```

---
title: Use monitoring for an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/monitoring.md
section: Native Apps Framework
---

# Use monitoring for an app

This topic describes how providers can monitor consumer app health for a Snowflake Native App.

## Monitor consumer application health

Your application can report its health status to Snowflake, which allows you to
monitor the health of consumer instances of your app.

To report health status, your app uses the system-defined
`SYSTEM$REPORT_HEALTH_STATUS(VARCHAR)` function, passing in the health status
as an enum value:

* `OK`: The consumer instance is healthy.
* `FAILED`: The consumer instance is in an error state.
* `PAUSED`: The consumer manually paused the app.

You can use the `LAST_HEALTH_STATUS` and `LAST_HEALTH_STATUS_UPDATED_ON` fields
of the [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md)
to monitor the health of consumer instances of your app. The `LAST_HEALTH_STATUS`
field has the most recent value passed in by the app running in the consumer account.

The following code sample demonstrates using the APPLICATION_STATE view
to retrieve the health status of all consumer instances of your app:

```sqlexample
SELECT
    CONSUMER_ORGANIZATION_NAME,
    CONSUMER_ACCOUNT_NAME,
    LAST_HEALTH_STATUS,
    LAST_HEALTH_STATUS_UPDATE_TIME
FROM
    SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_STATE
WHERE
    PROVIDER_ORG_NAME = '<your_provider_org_name>'
    AND APPLICATION_NAME = '<your_app_name>'
ORDER BY
    LAST_HEALTH_STATUS_UPDATE_TIME DESC;
```

The preceding query may return results similar to the following:

```output
CONSUMER_ORG_NAME    CONSUMER_ACCOUNT_NAME    LAST_HEALTH_STATUS    LAST_HEALTH_STATUS_UPDATE_TIME
------------------   ---------------------    ------------------    -------------------------------
consumer_org_1       consumer_account_1       OK                    2024-01-15 10:30:00.000
consumer_org_2       consumer_account_2       FAILED                2024-01-15 09:45:00.000
consumer_org_3       consumer_account_3       PAUSED                2024-01-14 16:20:00.000
```

---
title: Use owner’s rights and restricted caller’s rights in an app
source: https://docs.snowflake.com/en/developer-guide/native-apps/restricted-callers-rights.md
section: Native Apps Framework
---

# Use owner’s rights and restricted caller’s rights in an app

This topic describes how to configure an app to use owner’s rights and
restricted caller’s rights.

## About owners right’s and restricted caller’s rights in an app

In the context of an app, the following
types of executables are supported:

* Stored procedures owned by the app
* Services available in apps with containers

Each of these types of executables can be configured to use either owner’s rights or restricted caller’s rights.

Owner’s rights:
:   By default, executables within an app use owner’s rights, which means that they run with the privileges granted to the owner of the executable, which is the app itself.

    > For example, owner’s rights allow an executable to access data in the provider account
    > and present that data to the consumer. However, they do not allow the consumer to access
    > the data directly.
    >
    > For example, the [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) command creates a
    > stored procedure that uses owner’s rights by default. Consumers can call the
    > stored procedure if they have been granted access using application roles. If the
    > app has the privileges to perform an operation, then the stored procedure can perform that
    > operation.
    >
    > For general information on owner’s rights, see
    > [Understanding caller’s rights and owner’s rights stored procedures](../stored-procedure/stored-procedures-rights.md).

Restricted caller’s rights:
:   Restricted caller’s rights allow an executable to run with caller’s rights, but restrict
    which of the caller’s privileges the executable runs with. With restricted caller’s rights,
    an executable owned by an app cannot run with a specific privilege unless an administrator
    in the consumer account explicitly allows it by using the [GRANT CALLER](../../sql-reference/sql/grant-caller.md)
    command.

    > **Note:**
    >
    > To guarantee that executables in an app are secure, Snowflake Native Apps do not support unrestricted
    > caller’s rights.

    For general information on restricted caller’s rights, see
    [Restricted caller’s rights](../restricted-callers-rights.md).

### Scope of restricted caller’s rights in an app

Snowflake recommends that consumers grant caller grants at a container level and not on specific objects in their account.

Schema level:
:   Grants caller rights to the schema, but does not grant any rights to objects
    in the schema. For example, granting the CALLER USAGE caller grant on a schema only
    grants the USAGE privilege on the schema. To grant access to a specific object, for
    example a function, use GRANT INHERITED CALLER USAGE ON ALL FUNCTIONS IN SCHEMA.

Database level:
:   Granting caller grants at the database level only allows an executable to
    access the database and all schemas in the database. For example, granting the
    CALLER USAGE caller grant grants the USAGE privilege on the database. However, to
    grant access to a specific object, you must use the following command:

    ```sqlexample
    GRANT INHERITED CALLER USAGE ON ALL FUNCTIONS IN DATABASE;
    ```

Account level:
:   Granting caller grants at the account level allows an executable to perform account-level operations.
    Granting the CALLER USAGE caller grant only allows the executable to access the account,
    it does not provide access to objects within the account.

    To allow access to specific objects, you grant access to specific types of object in the account.
    For example, granting the CREATE DATABASE caller grant allows an executable to create databases in
    the consumer account as shown in the following example:

    ```sqlexample
    GRANT CALLER CREATE DATABASE ON ACCOUNT TO my_app;
    ```

### Account-level caller grants that can be granted to an app

Providers can configure an executable in an app to use the following account-level caller grants:

* CREATE DATABASE
* EXECUTE ALERT
* EXECUTE MANAGED TASK
* EXECUTE TASK
* READ SESSION
* VIEW LINEAGE

> **Note:**
>
> Consumers should use caution when granting account-level caller grants to an app.

## Determine the access requirements of an app

Snowflake Native Apps give providers flexibility in how they configure access to the data and executables
managed by the app. The following table provides guidelines for which mechanism to use depending on
the access requirements of the app:

| Access required | How to get access |
| --- | --- |
| Data or functions owned by the app | Use owner’s rights by default. Providers do not need to request access from the consumer to create or access objects owned by the app. |
| Specific tables, views, or functions in the consumer account | Request references from the consumer |
| Tables, views, functions, and row policies owned by another user or role. | Use restricted caller’s rights, which allow the consumer to enable access to these objects. |
| Broad access to consumer-owned databases | Use database role grants |
| Queries that access a combination of consumer and provider data | Use references and owner’s rights together |
| Account-level objects | Provide custom scripts that contain GRANT commands to grant privileges on specific objects. |
| Perform account-level operations such as creating databases or executing tasks. | Use restricted caller’s rights, which allow the consumer to enable access to perform these actions. |

## Add the `restricted_callers_rights` property to the manifest

As a provider, if you configure an executable in an app to use restricted caller’s rights,
Snowflake recommends that you add the `restricted_callers_rights` to the manifest
file as shown in the following example:

```yaml
restricted_callers_rights:
  enabled: true
  description: This app includes stored procedure that uses restricted caller's rights.
```

Although `restricted_callers_rights` is not required, if it is present and `enabled`
is set to `true,` Snowsight includes a section named Restricted caller’s rights
in the app’s listing.

## Configure a procedure or service to use restricted caller’s rights

To create a stored procedure that uses restricted caller’s rights, use the
EXECUTE AS RESTRICTED CALLER clause when the app creates an executable as shown
in the following example:

```sqlexample
CREATE OR REPLACE PROCEDURE CORE.HELLO()
RETURNS STRING
LANGUAGE SQL
EXECUTE AS RESTRICTED CALLER
AS
BEGIN
  RETURN 'Hello Snowflake!';
END;
```

For more information, see [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md).

## Using a restricted caller’s rights service in an app with containers

For more information on configuring a service to use restricted caller’s rights, see
[About using Snowflake-provided caller credentials (caller’s rights)](../snowpark-container-services/spcs-execute-sql.md).

In an app with containers, a Snowpark Container Services service can run with privileges of the app (owner’s rights)
or with the privileges of the caller of the service (restricted caller’s rights). However,
the service _owner_ role, instead of the service _user_ role, is the app. When granting caller
grants, a consumer specifies the app by using the TO APPLICATION clause as shown in the
following example:

```sqlexample
GRANT CALLER USAGE ON DATABASE consumer_db TO APPLICATION hello_snowflake_app;
```

This example allows the service to run with restricted caller’s rights to
use the caller’s role to access the `consumer_db` database.

## Limitations on restricted caller’s rights in an app

The following limitations apply when using restricted caller’s rights within an app.

### General limitations on restricted caller’s rights

For general limitations on restricted caller’s rights on executables, see
[Limitations of an executable with restricted caller’s rights](../restricted-callers-rights.md). The limitations listed there also apply to
apps.

### Additional limitations for apps

In addition to the general limitations on restricted caller’s rights, apps have
the following limitations:

* *Unrestricted* caller’s rights are not supported for executables in an app.
* Cannot execute the following commands:

  + SHOW ROLES
  + SHOW USERS
  + SHOW [CALLER] GRANTS
  + SHOW AVAILABLE LISTINGS
* Cannot execute the following functions:

  + ALL_USER_NAMES
  + GET_USERS_FOR_COLLABORATION
  + CURRENT_IP_ADDRESS
  + CURRENT_AVAILABLE_ROLES
  + CURRENT_SECONDARY_ROLES
  + SYSTEM$ALLOWLIST (or the deprecated SYSTEM$WHITELIST)
* Persistent reference functions are not supported.
* Relative paths to objects on a stage are not supported.
* Restricted caller’s rights executables cannot access the app’s internal objects.
* Executables with owner’s rights cannot call other executables that use unrestricted
  caller’s rights.

  Although executables with owner’s rights owned by the application can invoke executables
  with restricted caller’s rights, these executables must also be owned by the app. For example,
  an app’s stored procedure with owner’s rights cannot call a consumer’s stored procedure with
  restricted caller’s rights.

## Calling billing functions from a restricted caller’s rights procedure in an app

Billing functions such as [SYSTEM$CREATE_BILLING_EVENT](../../sql-reference/functions/system_create_billing_event.md)
or [SYSTEM$CREATE_BILLING_EVENTS](../../sql-reference/functions/system_create_billing_events.md) cannot be called from an
RCR executable. To call these functions from an
RCR executable in an app, include these functions in an owner’s rights procedure
and call the procedure from the restricted caller’s rights procedure.

## Intellectual property protection and restricted caller’s rights

Apps that include owner’s rights stored procedures redact information about the internal
implementation of the app. For more information, see
[Protect provider intellectual property](redacted-content.md).

Stored procedures with restricted caller’s rights also redact internal information about the app.
In particular, this applies to the following commands, views, and functions where much of the information
is redacted:

* The [DESCRIBE PROCEDURE](../../sql-reference/sql/desc-procedure.md) command
* The [PROCEDURES view](../../sql-reference/info-schema/procedures.md) of the Information Schema
* The [PROCEDURES view](../../sql-reference/account-usage/procedures.md) of the Account Information Schema
* The [GET_DDL](../../sql-reference/functions/get_ddl.md) function

However, unlike stored procedures with owner’s rights, procedures using restricted caller’s rights
do not redact information from the query history and query profiles.

## Developing apps that use restricted caller’s rights

In general, only roles that have been granted the MANAGE CALLER GRANTS privilege can
grant caller grants to an executable. However, during the development lifecycle of a
Snowflake Native App, providers may need to create and drop the app frequently. Snowflake Native Apps
provide a mechanism for creating caller grants for an
[app in development mode](installing-testing-application.md).

Providers can use a role without the MANAGE CALLER GRANTS to grant caller grants to an app.
However, this requires all of the following:

* The app must be created in [development mode](installing-testing-application.md).
* The current role has the app owner role in the role hierarchy.
* The app owner role has a superset of the caller grants that are being granted.

  > **Note:**
  >
  > The ALL CALLER PRIVILEGES privilege is a superset of any set of caller privileges.

The superset requirement cannot be met through INHERITED CALLER grants, which normally
cover all current and future objects of a particular type under a specific container.
For more information, see [GRANT CALLER](../../sql-reference/sql/grant-caller.md).

To meet this requirement, the caller grants being granted must be explicitly covered by the
caller grants that are granted to the app owner role.

### Enable app developers to grant caller rights to an app in dev mode

To allow an app owner role to grant a certain set of caller grants to an app in development
mode, a user with the ACCOUNTADMIN role or a role that has been granted the MANAGE CALLER GRANTS
privilege must first grant caller grants that cover the set of grants to the app owner role.

The following example shows how to grant the required caller privileges on the databases in an
account:

```sqlexample
GRANT ALL INHERITED CALLER PRIVILEGES
  ON ALL DATABASES IN ACCOUNT
  TO ROLE app_dev_role;
```

This command grants all caller grants on all databases to the `app_dev_role` role.

> **Note:**
>
> This command only grants caller grants. It does not allow the app developer to access
> an object that they are not already allowed to access.

### Additional security measures for apps in development mode

Allowing app developers owner roles without the MANAGE CALLER GRANTS privilege streamlines
the app development process, but it also introduces potential security risks. To minimize
these risks, Snowflake verifies that the app owner continues to have required caller grants
for the app. If the app owner loses the required caller grants, the app loses them also.

---
title: Use Snowflake machine learning models in a Snowflake Native App
source: https://docs.snowflake.com/en/developer-guide/native-apps/snowflake-ml-na-about.md
section: Native Apps Framework
---

# Use Snowflake machine learning models in a Snowflake Native App

This topic describes how to use a [Snowflake ML](../snowflake-ml/overview.md)
model in a Snowflake Native App. It also describes how to call
[Snowflake Cortex](../../user-guide/snowflake-cortex/aisql.md) functions from an app.

## Overview of using Snowpark ML in a Snowflake Native App

Snowflake ML is an integrated set of capabilities for end-to-end machine learning
in a single platform on top of your governed data. You can this functionality within
a Snowflake Native App.

The Snowflake Native App Framework supports the following use cases:

* Providers include a training algorithm in the app, but the trained model is not included.
  Providers include the source code for the model, for example linear regression or logistical
  regression, in the app.

  After the app is installed, training occurs on data in the consumer account, for example by calling the
  model’s `fit()` method.

  For more information, see [Create, train and use a Snowflake ML model in an app](snowflake-ml-na-no-model.md).
* Providers share data with the consumer and include a training algorithm in the app. After installation,
  the app trains the model based on data in the consumer account that has been shared with the app

  For more information, see [Create, train and use a Snowflake ML model in an app](snowflake-ml-na-no-model.md).
* Providers train a model based on data in their account and include these models in the app. When the app
  is installed, consumers can use the model directly, for example by calling the model’s
  :predict() method.

  For more information, see [Include a trained model in an app](snowflake-ml-na-with-model.md).

## Limitations when using Snowflake ML in an app

The following limitations apply when using Snowflake ML in an app:

* Only models based on warehouses are currently supported.
* Providers must use the Snowflake Model Registry to share models with consumers. Snowpark
  ML functions like `fit()` store results in a temporary stage which is not supported
  for Snowflake Native Apps.
* There are limitations on machine learning algorithms that are runnable in a Snowpark sandbox
  within a warehouse. More complex machine learning frameworks like TensorFlow or PyTorch are
  not runnable in these sandboxes.
* Training performed on a provider’s dataset may not yield a model sufficiently effective for
  a consumer’s data. Training a model on consumer data may provide better results.

## Calling Snowflake Cortex functions from an app

To call a [Snowflake Cortex function](../../user-guide/snowflake-cortex/aisql.md) from
an app, *consumers* must first grant the CORTEX_USER database role to the app as shown in the following
example:

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.CORTEX_USER TO APPLICATION my_app;
```

> **Note:**
>
> Providers should mention in the listing of an app that consumers must grant the CORTEX_USER database role.

The CORTEX_USER database role in the SNOWFLAKE database includes the privileges that allow users to
call Snowflake Cortex LLM functions. See [Snowflake Cortex AI Functions (including LLM functions)](../../user-guide/snowflake-cortex/aisql.md) for more information.

After consumers this role to the app, the app can call Snowflake Cortex functions as shown in the following
example:

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRANSLATE('La plateforme unique de Snowflake élimine les silos de données!','fr','en');
```

---
title: Use versioned schema to manage app objects across versions
source: https://docs.snowflake.com/en/developer-guide/native-apps/versioned-schema.md
section: Native Apps Framework
---

# Use versioned schema to manage app objects across versions

This topic describes how to use versioned schema to manage app state when installing and upgrading a Snowflake Native App.

## About versioned schemas

Versioned schemas are special types of database schema that are designed to handle stateless objects from one version
to another.

A versioned schema contains metadata about the objects in an app that are associated with a specific version.
Version pinning is a feature of versioned schemas that allows an app to know what job, queries, etc. are associated with
these objects.

When an object in a versioned schema runs a query, for example, that query is “pinned” to the version of the app running the query.

Version pinning is important when upgrading an app to a new version. Consider the context where V1 of an app runs a
complex query that takes a long time to complete.

If an upgrade occurs while this query is still running, the upgrade state of the app changes to `COMPLETE` and
the app is upgraded to `v2`. The previous version state changes to `FINALIZING` until all jobs from version
`v1` have completed.

See [Upgrade states](release-channels-upgrade.md) for more information on the upgrade states for an app.

## Stateful and stateless objects

When developing a new version of an app, providers must consider if the components they are modifying need to preserve
their state from one version or patch to another. A typical app contains two types of components:

Stateless objects
:   Stateless objects are recreated for each new version or patch of the app. Stateless objects only need to be available
    for the lifetime of the version and can be recreated as necessary. Stateless objects are typically the code of the app,
    including stored procedures, user-defined functions, Streamlit apps, and similar content.

    Stateless objects should be created in a versioned schema.

Stateful objects
:   Stateful objects are shared from one version or patch of the app to another. Stateful components are intended to have a
    lifetime across multiple versions of the app. For example, if an app uses a table to store configuration information within
    the consumer account, the contents of this table would need to be preserved during upgrade.

    Stateful objects should be created using a regular schema.

## About versioned schemas

When writing the setup script for the new version of the app, providers must account for stateless and stateful components. To
handle stateless objects the Snowflake Native App Framework provides a special type of database schema referred to as versioned schemas. A versioned schema is
similar to a regular database schema with added functionality to handle multiple versions of objects created by different app
versions.

## Restrictions on versioned schemas

* Snowpark Container Services is not supported in versioned schemas.
* Versioned schema are only available within the context of an application object. They are created only within the setup script.
  Each version of an app has its own setup script and contains versioned schema that are specific to that version.
* A versioned schema can only be used within the setup script of an application package. They can only be created within the
  context of an application object.
* Tasks are not supported on versioned schemas. For example, providers cannot include tags when creating or altering a
  versioned schema. However, providers can use tags inside a versioned schema, as long as they only apply those tags objects
  within a versioned schema in the same app.
* Tags and masking policies are not supported in versioned schemas.
* Grants and future grants are not supported in versioned schemas.
* Versioned schemas cannot be used as either the source or destination of a clone operation.
* Dropping a versioned schema is not supported. Dropping a versioned schema would drop all versions of the objects it contains and
  would impact queries running against older versions or patches of the app.

> **Note:**
>
> To use application roles, tasks, tags, masking policies and Snowpark Container Services within the setup script of an app, you
> must create them in a normal schema.

## Internal implementation of versioned schemas

Internally, versioned schemas contain subschema that correspond to each version of the app.

However, these subschema are not directly accessible to the consumer within the application object. A consumer will only see
objects within the versioned schema that correspond to the version of the app they have installed in their account.

For example, if a consumer uses the [SHOW OBJECTS](../../sql-reference/sql/show-objects.md) command to view the objects
in a versioned schema, they will only see the objects for the version they are currently using.

When writing the setup script, providers must recreate objects in a versioned schema using CREATE OR REPLACE or CREATE IF NOT EXISTS.
This is important because internally, each version of the app has its own objects within the subschema of the versioned schema.

## Using versioned and non-versioned schemas in the setup script

To manage the state of an app during upgrades, the Snowflake Native App Framework uses versioned schemas. A versioned schema
is similar to regular database schema with added functionality to handle multiple versions of objects
created by different application versions.

Versioned schema are only available within the context of an application object. They are created
only within the setup script. Each version of an app has its own setup script and contains versioned
schema that are specific to that version.

When developing a new version of an app, providers must account for changes to the objects that the
app creates using the setup script.

The following example shows a common situation where both versioned and normal schema can be
used in a setup script to create stateless and stateful components:

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA stateless_objects;
CREATE OR REPLACE PROCEDURE stateless_object.py_echo_proc(STR string)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION=3.12
  PACKAGES=('snowflake-snowpark-python')
  HANDLER='echo.echo_proc'
  IMPORTS=('/libraries/echo.py');

CREATE OR ALTER SCHEMA stateful_object;
CREATE TABLE stateful_object.config_props
  prop_name STRING;
  prop_value STRING;
  time_stamp TIMESTAMP;
```

## Create a versioned schema

To create a versioned schema with the setup script, use the
[CREATE OR ALTER VERSIONED SCHEMA](../../sql-reference/sql/create-versioned-schema.md) command as shown in the following example:

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA version_schema;
```

> **Note:**
>
> You should always include the CREATE OR ALTER version of this command to ensure that
> versioned schemas are compatible across versions and patches.

## Use non-versioned schemas for stateful objects

Objects within an app may need to preserve state across versions. For example, configuration
data or data collected while the app has been running may be to be preserved.

These types of objects must reside in a normal database schema and they should be created to persist
during initial installation and upgrades.

The following example shows how to create a stateful object in the setup script:

```sqlexample
CREATE SCHEMA IF NOT EXISTS stateful_object;

CREATE TABLE IF NOT EXISTS stateful_object.config (
  config_param STRING,
  config_value STRING,
  default_value STRING,
  modified_on  TIMESTAMP);

ALTER TABLE stateful_object.config
  ADD COLUMN IF NOT EXISTS modified_on TIMESTAMP;
```

In this example, the setup script defines a configuration table that would persist from one version
of an app to another. If the previous version of the application did not have a `modified_on` column,
the setup script first attempts to fully create the table (in the case of initial installation) or modify
the existing table by adding the column (in the case of an upgrade).

## Use versioned schemas for stateless objects

Some objects within an app do not persist state between versions of an application. For example, code that
defines the application logic, including stored procedures, functions, etc. can be fully recreated in the
setup script without losing any user data or state.

Snowflake recommends that these objects be contained within a versioned schema.

The following example shows how to create a UDF within a versioned schema.

```sqlexample
CREATE OR ALTER VERSIONED SCHEMA stateless_object;
CREATE FUNCTION IF NOT EXISTS stateless_object.add(x int, y int)
  RETURNS INT
  LANGUAGE SQL
  AS $$ x + y $$;
```

## Version pinning

A versioned schema contains metadata about which objects in a Snowflake Native App are associated
with a specific version. Version pinning is a feature of versioned schemas that allows an
app to know what job, queries, etc. are associated with these objects.

When an object in a versioned schema runs a query, for example, that query is “pinned”
to the version of the app running the query.

Version pinning is important when upgrading an app to a new version. Consider the context
where V1 of an app runs a complex query that takes a long time to complete. If an
upgrade occurs while this query is still running, the app will not upgrade until the query
is complete.

---
title: Workflow: Develop an app with containers
source: https://docs.snowflake.com/en/developer-guide/native-apps/container-workflow.md
section: Native Apps Framework
---

# Workflow: Develop an app with containers

This topic describes the general workflow for creating a Snowflake Native App with Snowpark Container Services.

## Understand Snowpark Container Services and the Snowflake Native App Framework

Before beginning to develop a Snowflake Native App with Snowpark Container Services

1. Ensure that you are familiar with [Snowpark Container Services](../snowpark-container-services/overview.md)
   and the [Snowflake Native App Framework](native-apps-about.md).

   The following tutorials are available for these Snowflake products:

   * [Common Setup for Snowpark Container Services Tutorials](../snowpark-container-services/tutorials/common-setup.md)
   * [Create a Snowpark Container Services service](../snowpark-container-services/tutorials/tutorial-1.md)
   * [Create a Snowpark Container Services job service](../snowpark-container-services/tutorials/tutorial-2.md)
   * [Develop an app with the Snowflake Native App Framework](tutorials/getting-started-tutorial.md)
   * [Create a Snowflake Native App with Snowpark Container Services](tutorials/na-spcs-tutorial.md)
2. Review [About Snowflake Native Apps with Snowpark Container Services](native-apps-about.md) to understand how Snowflake Native App with Snowpark Container Services works.
3. Review [Costs associated with apps with containers](container-cost-governance.md) to understand the
   costs associated with developing, publishing, and using an app with containers.

## Create the containers and services to be managed by an app.

The first step in developing an app with containers is to set up the required containers and services using
[Snowpark Container Services](../snowpark-container-services/overview.md).

The basic workflow for using Snowpark Container Services is:

1. Create a repository to store container images.

   This repository exists in the provider account and maintains the container images required by the
   app. See [Create an image repository](container-containers.md)
2. Copy the container images to the image repository.

   After creating the image repository, providers must upload the container images used by the application.
   Snowpark Container Services support using Docker commands to perform the upload.

   See [Upload container images to the image repository](container-containers.md) for
   more information.
3. Create a service specification file.

   The service specification file is a YAML file used to configure and run services within
   Snowpark Container Services. Snowflake Native App with Snowpark Container Services includes this file within the application package.

   See [Create the service specification file](container-containers.md) for more information.
4. Configure block storage and snapshots.

   If the services in your app require using block storage, create a `spec.volumes` in your
   service specification file.

   See [Using block storage volumes with services](../snowpark-container-services/block-storage-volume.md) for more information.
5. Upload the required files to a stage.

   To make the service specification file accessible to the application package,
   providers must upload it to the stage used to store other files required by the application package.

   See [Staging data files from a local file system](../../user-guide/data-load-local-file-system-stage.md) and
   [Staging files using Snowsight](../../user-guide/data-load-local-file-system-stage-ui.md) for more information on uploading files
   to a stage.

   > **Note:**
   >
   > If you are using the Snowflake CLI, you are not required to upload the files to a stage.

## Develop and publish a Snowflake Native App with Snowpark Container Services

The workflow for developing and publishing an app with containers is similar to the
workflow for any Snowflake Native App. However, within each stage of the workflow there are
differences.

The following is a typical workflow for developing and publishing an app with containers:

1. Create the manifest file for the app.

   The manifest file for an app with containers includes configuration information about the
   containers included in the app. See [Create the manifest file for an app](manifest-overview.md) for more information.
2. Create the setup script for the app.

   The specific contents of the setup script depend on the requirements of the app. For
   general information on creating the setup script for an app, see [Create the setup script](creating-setup-script.md).

   Within the setup script you can create the following objects that are specific to
   a Snowflake Native App with Snowpark Container Services:

   * [Add a compute pool to an app with containers](container-compute-pool.md)
   * [Add services to an app](container-services.md)
   * [Add job services to an app](container-services-job.md)

   You can also add other objects that are part of any Snowflake Native App, including:

   > * Warehouses
   > * External access integrations
   > * Secrets
3. Create the application package.

   The process of creating an application package for an app with containers is the
   same as other apps. See [Create and manage an application package](creating-app-package.md) for more
   information.
4. Publish the app

   Publishing an app as a private listing or on the Snowflake Marketplace is the same
   as other apps. See
   [Share an app with consumers](https://other-docs.snowflake.com/en/native-apps/provider-publishing-app-package)
   for more information.

## Snowpark

Snowpark API for Python, Java, and Scala — process data at scale inside Snowflake.

---
title: A Simple Example of Using Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/example.md
section: Snowpark
---

# A Simple Example of Using Snowpark Java

The following example prints the count and names of tables in the current database.
Replace the `<placeholders>` with values that you use to connect to Snowflake.

```java
import com.snowflake.snowpark_java.*;
import java.util.HashMap;
import java.util.Map;

public class SnowparkExample {
  public static void main(String[] args) {
    // Create a Session, specifying the properties used to
    // connect to the Snowflake database.
    Map<String, String> properties = new HashMap<>();
    properties.put("URL", "https://<account_identifier>.snowflakecomputing.com");
    properties.put("USER", "<username>");
    properties.put("PASSWORD", "<password>");
    properties.put("ROLE", "<role_name_with_access_to_public_schema>");
    properties.put("WAREHOUSE", "<warehouse_name>");
    properties.put("DB", "<database_name>");
    properties.put("SCHEMA", "<schema_name>");
    Session session = Session.builder().configs(properties).create();

    // Get the number of tables in the PUBLIC schema.
    DataFrame dfTables = session.table("INFORMATION_SCHEMA.TABLES")
      .filter(Functions.col("TABLE_SCHEMA").equal_to(Functions.lit("PUBLIC")));
    long tableCount = dfTables.count();
    String currentDb = session.getCurrentDatabase().orElse("<no current database>");
    System.out.println("Number of tables in the PUBLIC schema in " + currentDb + " database: " + tableCount);

    // Get the list of table names in the PUBLIC schema.
    DataFrame dfPublicSchemaTables = dfTables.select(Functions.col("TABLE_NAME"));
    dfPublicSchemaTables.show();
  }
}
```

This prints out the number of tables and the list of tables in the schema:

```none
Number of tables in the PUBLIC schema in the "MY_DB" database: 8
...
---------------------
|"TABLE_NAME"       |
---------------------
|A_TABLE            |
...
```

---
title: A Simple Example of Using Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/example.md
section: Snowpark
---

# A Simple Example of Using Snowpark Scala

The following example prints the count and names of tables in the current database.
Replace the `<placeholders>` with values that you use to connect to Snowflake.

```scala
import com.snowflake.snowpark._
import com.snowflake.snowpark.functions._

object Main {
  def main(args: Array[String]) {

    // Create a Session, specifying the properties used to
    // connect to the Snowflake database.
    val builder = Session.builder.configs(Map(
      "URL" -> "https://<account_identifier>.snowflakecomputing.com",
      "USER" -> "<username>",
      "PASSWORD" -> "<password>",
      "ROLE" -> "<role_name_with_access_to_public_schema>",
      "WAREHOUSE" -> "<warehouse_name>",
      "DB" -> "<database_name>",
      "SCHEMA" -> "<schema_name>"
    ))
    val session = builder.create

    // Get the number of tables in the PUBLIC schema.
    var dfTables = session.table("INFORMATION_SCHEMA.TABLES").filter(col("TABLE_SCHEMA") === "PUBLIC")
    var tableCount = dfTables.count()
    var currentDb = session.getCurrentDatabase.getOrElse("<no current database>")
    println(s"Number of tables in the PUBLIC schema in the $currentDb database: $tableCount")

    // Get the list of table names in the PUBLIC schema.
    var dfPublicSchemaTables = dfTables.select(col("TABLE_NAME"))
    dfPublicSchemaTables.show()
  }
}
```

This prints out the number of tables and the list of tables in the schema:

```none
Number of tables in the PUBLIC schema in the "MY_DB" database: 8
...
---------------------
|"TABLE_NAME"       |
---------------------
|A_TABLE            |
...
```

---
title: Analyzing queries and troubleshooting with Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/troubleshooting.md
section: Snowpark
---

# Analyzing queries and troubleshooting with Snowpark Java

This topic provides some guidelines on analyzing queries and troubleshooting problems when working with the Snowpark library.

## Viewing the execution plan for a query in Snowpark

To inspect the evaluation plan of a DataFrame, call the `explain` method of the DataFrame. This prints the SQL statements
used to evaluate the DataFrame. If there is only one SQL statement, the method also prints the logical plan for the statement.

```none
----------DATAFRAME EXECUTION PLAN----------
Query List:
0.
SELECT
  "_1" AS "col %",
  "_2" AS "col *"
FROM
  (
    SELECT
      *
    FROM
      (
        VALUES
          (1 :: int, 2 :: int),
          (3 :: int, 4 :: int) AS SN_TEMP_OBJECT_639016133("_1", "_2")
      )
  )
Logical Execution Plan:
 GlobalStats:
    partitionsTotal=0
    partitionsAssigned=0
    bytesAssigned=0
Operations:
1:0     ->Result  SN_TEMP_OBJECT_639016133.COLUMN1, SN_TEMP_OBJECT_639016133.COLUMN2
1:1          ->ValuesClause  (1, 2), (3, 4)

--------------------------------------------
```

After the execution of a DataFrame has been triggered, you can check on the progress of the query in the Query History
page in Snowsight.

In the Query Tag column, you can find the name of the function and the line number in your code that triggered this query.

## Changing the logging settings

By default, the Snowpark library logs `INFO` level messages to stdout. To change the logging settings, create a
`simplelogger.properties` file, and configure the logger properties in that file. For example, to set the log level to
`DEBUG`:

```none
# simplelogger.properties file (a text file)
# Set the default log level for the SimpleLogger to DEBUG.
org.slf4j.simpleLogger.defaultLogLevel=debug
```

Put this file in your classpath. If you are using a Maven directory layout, put the file in the `src/main/resources/`
directory.

## java.lang.OutOfMemoryError exceptions

If a `java.lang.OutOfMemoryError` exception is thrown, increase the maximum heap size for the JVM (e.g. through the
`-J-Xmxmaximum_size` flag).

## Unnamed module error on Java 17

When executing a Snowpark Java or Scala client on Java 17, you might see the following error:

```output
java.base does not "opens java.nio" to unnamed module
```

This is because Snowpark uses the [Apache Arrow connector](https://arrow.apache.org/docs/java/install.html#id3), which depends on
internal Java APIs that are not exposed by default after Java 9.

To work around this error, set the following parameter either as a command-line argument when running your application or in your
system’s environment variables.

```none
--add-opens=java.base/java.nio=ALL-UNNAMED
```

> **Note:**
>
> The Snowpark API supports the following versions of Java:
>
> * 11.x
> * 17.x

### Setting the argument when running the application

You can set this argument from the command line when running your application.

For example, when calling the `java` command, you can add `--add-opens=java.base/java.nio=ALL-UNNAMED`, as in the following:

```none
java --add-opens=java.base/java.nio=ALL-UNNAMED -jar my-snowpark-app.jar.
```

If you are also using RSA private key authentication, you will also need to allow `sun.security.util`, as in the following example:

```none
java --add-opens=java.base/java.nio=ALL-UNNAMED --add-opens=java.base/sun.security.util=ALL-UNNAMED -jar my-snowpark-app.jar
```

### Setting the parameter as an environment variable

You can set the parameter in your system’s environment variables. Refer to your operating system’s documentation for
instructions on setting environment variables.

Create or update a `JDK_JAVA_OPTIONS` environment variable, as in the following Unix-based example:

```none
export JDK_JAVA_OPTIONS="--add-opens=java.base/java.nio=ALL-UNNAMED"
```

If you are also using RSA private key authentication, you will also need to allow `sun.security.util`, as in the following example:

```none
export JDK_JAVA_OPTIONS="--add-opens=java.base/java.nio=ALL-UNNAMED --add-opens=java.base/sun.security.util=ALL-UNNAMED"
```

---
title: Analyzing Queries and Troubleshooting with Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/troubleshooting.md
section: Snowpark
---

# Analyzing Queries and Troubleshooting with Snowpark Scala

This topic provides some guidelines on analyzing queries and troubleshooting problems when working with the Snowpark library.

## Viewing the Execution Plan for a Query in Snowpark

To inspect the evaluation plan of a DataFrame, call the `explain` method of the DataFrame. This prints the SQL statements
used to evaluate the DataFrame. If there is only one SQL statement, the method also prints the logical plan for the statement.

```none
----------DATAFRAME EXECUTION PLAN----------
Query List:
0.
SELECT
  "_1" AS "col %",
  "_2" AS "col *"
FROM
  (
    SELECT
      *
    FROM
      (
        VALUES
          (1 :: int, 2 :: int),
          (3 :: int, 4 :: int) AS SN_TEMP_OBJECT_639016133("_1", "_2")
      )
  )
Logical Execution Plan:
 GlobalStats:
    partitionsTotal=0
    partitionsAssigned=0
    bytesAssigned=0
Operations:
1:0     ->Result  SN_TEMP_OBJECT_639016133.COLUMN1, SN_TEMP_OBJECT_639016133.COLUMN2
1:1          ->ValuesClause  (1, 2), (3, 4)

--------------------------------------------
```

After the execution of a DataFrame has been triggered, you can check on the progress of the query in the Query History
page in Snowsight.

In the Query Tag column, you can find the name of the function and the line number in your code that triggered this query.

## Troubleshooting

### Changing the Logging Settings

By default, the Snowpark library logs `INFO` level messages to stdout. To change the logging settings, create a
`simplelogger.properties` file, and configure the logger properties in that file. For example, to set the log level to
`DEBUG`:

```none
# simplelogger.properties file (a text file)
# Set the default log level for the SimpleLogger to DEBUG.
org.slf4j.simpleLogger.defaultLogLevel=debug
```

Put this file in your classpath. If you are using a Maven directory layout, put the file in the `src/main/resources/`
directory.

### java.lang.OutOfMemoryError Exceptions

If a `java.lang.OutOfMemoryError` exception is thrown, increase the maximum heap size for the JVM.

If you are using the Scala REPL and you need to increase the maximum heap size, edit the `run.sh` shell script (provided in
the archive file) and add the `-J-Xmxmaximum_size` flag to the `scala` command. The following example increases
the maximum heap size to 4 GB:

> ```bash
> scala -J-Xmx4G ...
> ```

---
title: Calling functions and stored procedures in Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/calling-functions.md
section: Snowpark
---

# Calling functions and stored procedures in Snowpark Java

To process data in a DataFrame, you can call system-defined SQL functions, user-defined functions, and stored procedures. This
topic explains how to call these in Snowpark.

## Calling system-defined functions

If you need to call [system-defined SQL functions](../../../sql-reference-functions.md), use the equivalent static methods in the
[Functions class](../reference/java/com/snowflake/snowpark_java/Functions.md).

The following example calls the `upper` static method in the `Functions` class (the equivalent of the system-defined
[UPPER](../../../sql-reference/functions/upper.md) function) to return the values in the name column with the letters in uppercase:

```java
DataFrame df = session.table("sample_product_data");
df.select(Functions.upper(Functions.col("name"))).show();
```

If a system-defined SQL function is not available in the `Functions` class, you can use the `Functions.callUDF`
static method to call the system-defined function.

For `callUDF`, pass the name of the system-defined function as the first argument. If you need
to pass the values of columns to the system-defined function, define and pass
[Column](working-with-dataframes.md) objects as additional arguments to the `callUDF` method.

The following example calls the system-defined function [RADIANS](../../../sql-reference/functions/radians.md), passing in the value from the
column `degrees`:

```java
// Call the system-defined function RADIANS() on degrees.
DataFrame dfDegrees = session.range(0, 360, 45).rename("degrees", Functions.col("id"));
dfDegrees.select(Functions.col("degrees"), Functions.callUDF("radians", Functions.col("degrees"))).show();
```

The `callUDF` method returns a `Column`, which you can pass to the
[DataFrame transformation methods](working-with-dataframes.md) (e.g. filter, select, etc.).

## Calling scalar user-defined functions (UDFs)

The method for calling a UDF depends on how the UDF was created:

* To call [an anonymous UDF](creating-udfs.md), call the `apply` method of the
  [UserDefinedFunction](../reference/java/com/snowflake/snowpark_java/UserDefinedFunction.md) object that was returned when you created the UDF.

  The arguments that you pass to a UDF must be [Column](working-with-dataframes.md) objects. If you
  need to pass in a literal, use `Functions.lit()`, as explained in [Using Literals as Column Objects](working-with-dataframes.md).
* To call UDFs that you [registered by name](creating-udfs.md) and UDFs that you created by executing
  [CREATE FUNCTION](../../../sql-reference/sql/create-function.md), use the `Functions.callUDF` static method.

  Pass the name of the UDF as the first argument and any UDF parameters as additional arguments.

Calling a UDF returns a `Column` object containing the return value of the UDF.

The following example calls the UDF function `doubleUdf`, passing in the value from the columns `quantity`. The
example passes the return value from `doubleUdf` to the `select` method of the DataFrame.

```java
import com.snowflake.snowpark_java.types.*;
...
// Create and register a temporary named UDF
// that takes in an integer argument and returns an integer value.
UserDefinedFunction doubleUdf =
  session
    .udf()
    .registerTemporary(
      "doubleUdf",
      (Integer x) -> x + x,
      DataTypes.IntegerType,
      DataTypes.IntegerType);
// Call the named UDF, passing in the "quantity" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleQuantity".
DataFrame df = session.table("sample_product_data");
DataFrame dfWithDoubleQuantity = df.withColumn("doubleQuantity", doubleUdf.apply(Functions.col("quantity")));
dfWithDoubleQuantity.show();
```

## Calling table functions (system functions and UDTFs)

To call a [table function](../../../sql-reference/functions-table.md) or a
[user-defined table function (UDTF)](../../udf/udf-overview.md):

1. Construct a [TableFunction](../reference/java/com/snowflake/snowpark_java/TableFunction.md) object, passing in the name of the table function.
2. Call the [tableFunction method of the Session object](../reference/java/com/snowflake/snowpark_java/Session.md), passing in the `TableFunction` object and a `Map` of input
   argument names and values.

`table?Function` returns a DataFrame that contains the output of the table function.

For example, suppose that you executed the following command to create a SQL UDTF:

```sqlexample
CREATE OR REPLACE FUNCTION product_by_category_id(cat_id INT)
  RETURNS TABLE(id INT, name VARCHAR)
  AS
  $$
    SELECT id, name
      FROM sample_product_data
      WHERE category_id = cat_id
  $$
  ;
```

The following code calls this UDTF and creates a DataFrame for the output of the UDTF. The example prints the first 10 rows of
output to the console.

```java
import java.util.HashMap;
import java.util.Map;
...

Map<String, Column> arguments = new HashMap<>();
arguments.put("cat_id", Functions.lit(10));
DataFrame dfTableFunctionOutput = session.tableFunction(new TableFunction("product_by_category_id"), arguments);
dfTableFunctionOutput.show();
```

If you need to join the output of a table function with a DataFrame, call the [join method that passes in a TableFunction](../reference/java/com/snowflake/snowpark_java/DataFrame.md).

## Calling stored procedures

You can execute a procedure either on the server side (in the Snowflake environment) or locally. Keep in mind that as the two environments
are different, the conditions and results of procedure execution may differ between them.

You can call a procedure with the Snowpark API in either of the following ways:

* Execute a function locally for testing and debugging using the [SProcRegistration.runLocally](../reference/java/com/snowflake/snowpark_java/SProcRegistration.md) method.
* Execute a procedure in the server-side Snowflake environment using one of the [Session.storedProcedure](../reference/java/com/snowflake/snowpark_java/Session.md) methods. This includes a procedure
  scoped to the current session or a permanent procedure stored on Snowflake.

You can also call a permanent stored procedure you create with the Snowpark API from SQL code. For more information, refer
to [Calling a stored procedure](../../stored-procedure/stored-procedures-calling.md).

For more on creating procedures with the Snowpark API, refer to [Creating stored procedures for DataFrames in Java](creating-sprocs.md).

### Executing a procedure’s logic locally

You can execute the lambda function for your procedure in your local environment using the `SProcRegistration.runLocally` method.
The method executes the function and returns its result as the type returned by the function.

For example, you can call a lambda function that you intend to use in a procedure before registering a procedure from it on Snowflake. You
begin by assigning the lambda code as a value to a variable whose type is one of the `com.snowflake.snowpark_java.sproc.JavaSProc`
interfaces. Using that variable, you can test call the function with the `SProcRegistration.runLocally` method. You can also use
the variable to represent the function when registering the procedure.

Code in the following example initializes a `JavaSProc` variable from the lambda function that will be the procedure’s logic. It then
tests the function by passing the variable to the `SProcRegistration.runLocally` method with the function’s argument. The variable
is also used to register the function.

```java
Session session = Session.builder().configFile("my_config.properties").create();

// Assign the lambda function to a variable.
JavaSProc1<Integer, Integer> func =
  (Session session, Integer num) -> num + 1;

// Execute the function locally.
int result = (Integer)session.sproc().runLocally(func, 1);
System.out.println("\nResult: " + result);

// Register the procedure.
StoredProcedure sp =
  session.sproc().registerTemporary(
    func,
    DataTypes.IntegerType,
    DataTypes.IntegerType
  );

// Execute the procedure on the server.
session.storedProcedure(sp, 1).show();
```

### Executing a procedure on the server

To execute a procedure in the Snowflake environment on the server, use the `Session.storedProcedure` method. The method returns a
`DataFrame` object.

For example, you can execute:

* A temporary or permanent procedure you [create using the Snowpark API](creating-sprocs.md).
* A procedure [created using a CREATE PROCEDURE statement](../../stored-procedure/stored-procedures-creating.md).

Code in the following example creates a temporary procedure designed to execute on the server, but only last for as long as the current
Snowpark session. It then executes the procedure using both the procedure’s name and the [com.snowflake.snowpark_java.StoredProcedure](../reference/java/com/snowflake/snowpark_java/StoredProcedure.md)
variable representing it.

```java
Session session = Session.builder().configFile("my_config.properties").create();

String incrementProc = "increment";

// Register the procedure.
StoredProcedure tempSP =
  session.sproc().registerTemporary(
    incrementProc,
    (Session session, Integer num) -> num + 1,
    DataTypes.IntegerType,
    DataTypes.IntegerType
  );

// Execute the procedure on the server by passing the procedure's name.
session.storedProcedure(incrementProc, 1).show();

// Execute the procedure on the server by passing a variable
// representing the procedure.
session.storedProcedure(tempSP, 1).show();
```

---
title: Calling Functions and Stored Procedures in Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/calling-functions.md
section: Snowpark
---

# Calling Functions and Stored Procedures in Snowpark Python

To process data in a DataFrame, you can call system-defined SQL functions, user-defined functions, and stored procedures. This
topic explains how to call these in Snowpark.

To process data in a DataFrame, you can call system-defined SQL functions, user-defined functions,
and stored procedures.

## Calling System-Defined Functions

If you need to call [system-defined SQL functions](../../../sql-reference-functions.md), use the equivalent functions in the
`snowflake.snowpark.functions` module.

The following example calls the `upper` function in the `functions` module (the equivalent of the system-defined
[UPPER](../../../sql-reference/functions/upper.md) function) to return the values in the name column of the
[sample_product_data](working-with-dataframes.md) table with the letters in uppercase:

```python
# Import the upper function from the functions module.
from snowflake.snowpark.functions import upper, col
session.table("sample_product_data").select(upper(col("name")).alias("upper_name")).collect()
```

```output
[Row(UPPER_NAME='PRODUCT 1'), Row(UPPER_NAME='PRODUCT 1A'), Row(UPPER_NAME='PRODUCT 1B'), Row(UPPER_NAME='PRODUCT 2'),
Row(UPPER_NAME='PRODUCT 2A'), Row(UPPER_NAME='PRODUCT 2B'), Row(UPPER_NAME='PRODUCT 3'), Row(UPPER_NAME='PRODUCT 3A'),
Row(UPPER_NAME='PRODUCT 3B'), Row(UPPER_NAME='PRODUCT 4'), Row(UPPER_NAME='PRODUCT 4A'), Row(UPPER_NAME='PRODUCT 4B')]
```

If a system-defined SQL function is not available in the functions module, you can use one of the following approaches:

* Use the `call_function` function to call the system-defined function.
* Use the `function` function to create a function object that you can use to call the system-defined function.

`call_function` and `function` are defined in the `snowflake.snowpark.functions` module.

For `call_function`, pass the name of the system-defined function as the first argument. If you need
to pass the values of columns to the system-defined function, define and pass
[Column](working-with-dataframes.md) objects as additional arguments to the `call_function` function.

The following example calls the system-defined function [RADIANS](../../../sql-reference/functions/radians.md), passing in the value from the
column `col1`:

```python
# Import the call_function function from the functions module.
from snowflake.snowpark.functions import call_function
df = session.create_dataframe([[1, 2], [3, 4]], schema=["col1", "col2"])
# Call the system-defined function RADIANS() on col1.
df.select(call_function("radians", col("col1"))).collect()
```

```output
[Row(RADIANS("COL1")=0.017453292519943295), Row(RADIANS("COL1")=0.05235987755982988)]
```

The `call_function` function returns a `Column`, which you can pass to the
[DataFrame transformation methods](working-with-dataframes.md) (e.g. `filter`, `select`, etc.).

For `function`, pass the name of the system-defined function, and use the returned function object to call the
system-defined function. For example:

```python
# Import the call_function function from the functions module.
from snowflake.snowpark.functions import function

# Create a function object for the system-defined function RADIANS().
radians = function("radians")
df = session.create_dataframe([[1, 2], [3, 4]], schema=["col1", "col2"])
# Call the system-defined function RADIANS() on col1.
df.select(radians(col("col1"))).collect()
```

```output
[Row(RADIANS("COL1")=0.017453292519943295), Row(RADIANS("COL1")=0.05235987755982988)]
```

## Calling User-Defined Functions (UDFs)

To call UDFs that you [registered by name](creating-udfs.md) and UDFs that you created by executing CREATE
FUNCTION, use the `call_udf` function in the `snowflake.snowpark.functions` module. Pass the name of the UDF as the
first argument and any UDF parameters as additional arguments.

The following example calls the UDF function `minus_one`, passing in the values from the columns `col1` and `col2`. The
example passes the return value from `minus_one` to the `select` method of the DataFrame.

```python
# Import the call_udf function from the functions module.
from snowflake.snowpark.functions import call_udf

# Runs the scalar function 'minus_one' on col1 of df.
df = session.create_dataframe([[1, 2], [3, 4]], schema=["col1", "col2"])
df.select(call_udf("minus_one", col("col1"))).collect()
```

```output
[Row(MINUS_ONE("COL1")=0), Row(MINUS_ONE("COL1")=2)]
```

## Calling User-Defined Table Functions (UDTFs)

To call UDTFs that you registered by name and UDTFs that you created by executing CREATE FUNCTION, use one of the functions listed below.
Both return a `DataFrame` representing a lazily-evaluated relational dataset.

Note that you can use these to also call other table functions, including the
[system-defined table functions](../../../sql-reference/functions-table.md).

For more information on registering a UDTF, see [Registering a UDTF](creating-udtfs.md).

* To call the UDTF without specifying a lateral join, call the `table_function` function in the `snowflake.snowpark.Session` class.

  For the function reference and examples, see
  [Session.table_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.Session.table_function).

  Code in the following example uses `table_function` to call the `generator_udtf` function registered with the `udtf`
  function.

  ```python
  from snowflake.snowpark.types import IntegerType, StructField, StructType
  from snowflake.snowpark.functions import udtf, lit
  class GeneratorUDTF:
      def process(self, n):
          for i in range(n):
              yield (i, )
  generator_udtf = udtf(GeneratorUDTF, output_schema=StructType([StructField("number", IntegerType())]), input_types=[IntegerType()])
  session.table_function(generator_udtf(lit(3))).collect()
  ```

  ```output
  [Row(NUMBER=0), Row(NUMBER=1), Row(NUMBER=2)]
  ```
* To make a call to the UDTF in which your call specifies a lateral join, use the `join_table_function` function in the
  `snowflake.snowpark.DataFrame` class.

  When you lateral join a UDTF, you can specify the PARTITION BY and ORDER BY clauses.

  For the function reference and examples, see
  [DataFrame.join_table_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.DataFrame.join_table_function.html#snowflake.snowpark.DataFrame.join_table_function).

  Code in the following example performs a lateral join, specifying the `partition_by` and `order_by` parameters. Code in this
  example first calls the `snowflake.snowpark.functions.table_function` function to create a function object representing the system-defined
  `SPLIT_TO_TABLE` function. It is this function object that `join_table_function` then calls.

  For the `snowflake.snowpark.functions.table_function` function reference, see
  [table_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.functions.table_function.html#snowflake.snowpark.functions.table_function).
  For the `SPLIT_TO_TABLE` function reference, see [SPLIT_TO_TABLE](../../../sql-reference/functions/split_to_table.md).

  ```python
  from snowflake.snowpark.functions import table_function
  split_to_table = table_function("split_to_table")
  df = session.create_dataframe([
    ["John", "James", "address1 address2 address3"],
    ["Mike", "James", "address4 address5 address6"],
    ["Cathy", "Stone", "address4 address5 address6"],
  ],
  schema=["first_name", "last_name", "addresses"])
  df.join_table_function(split_to_table(df["addresses"], lit(" ")).over(partition_by="last_name", order_by="first_name")).show()
  ```

  ```output
  ----------------------------------------------------------------------------------------
  |"FIRST_NAME"  |"LAST_NAME"  |"ADDRESSES"                 |"SEQ"  |"INDEX"  |"VALUE"   |
  ----------------------------------------------------------------------------------------
  |John          |James        |address1 address2 address3  |1      |1        |address1  |
  |John          |James        |address1 address2 address3  |1      |2        |address2  |
  |John          |James        |address1 address2 address3  |1      |3        |address3  |
  |Mike          |James        |address4 address5 address6  |2      |1        |address4  |
  |Mike          |James        |address4 address5 address6  |2      |2        |address5  |
  |Mike          |James        |address4 address5 address6  |2      |3        |address6  |
  |Cathy         |Stone        |address4 address5 address6  |3      |1        |address4  |
  |Cathy         |Stone        |address4 address5 address6  |3      |2        |address5  |
  |Cathy         |Stone        |address4 address5 address6  |3      |3        |address6  |
  ----------------------------------------------------------------------------------------
  ```

## Calling Stored Procedures

To call a stored procedure, use the call method of the `Session` class.

```python
session.call("your_proc_name", 1)
```

```output
0
```

---
title: Calling functions and stored procedures in Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/calling-functions.md
section: Snowpark
---

# Calling functions and stored procedures in Snowpark Scala

To process data in a DataFrame, you can call system-defined SQL functions, user-defined functions, and stored procedures. This
topic explains how to call these in Snowpark.

## Calling system-defined functions

If you need to call [system-defined SQL functions](../../../sql-reference-functions.md), use the equivalent functions in the
[com.snowflake.snowpark.functions object](../reference/scala/com/snowflake/snowpark/functions$.md).

The following example calls the `upper` function in the `functions` object (the equivalent of the system-defined
[UPPER](../../../sql-reference/functions/upper.md) function) to return the values in the name column with the letters in uppercase:

```scala
// Import the upper function from the functions object.
import com.snowflake.snowpark.functions._
...
session.table("products").select(upper(col("name"))).show()
```

If a system-defined SQL function is not available in the functions object, you can use one of the following approaches:

* Use the `callBuiltin` function to call the system-defined function.
* Use the `builtin` function to create a function object that you can use to call the system-defined function.

`callBuiltin` and `builtin` are defined in the `com.snowflake.snowpark.functions` object.

For `callBuiltin`, pass the name of the system-defined function as the first argument. If you need
to pass the values of columns to the system-defined function, define and pass
[Column](working-with-dataframes.md) objects as additional arguments to the `callBuiltin` function.

The following example calls the system-defined function [RADIANS](../../../sql-reference/functions/radians.md), passing in the value from the
column `col1`:

```scala
// Import the callBuiltin function from the functions object.
import com.snowflake.snowpark.functions._
...
// Call the system-defined function RADIANS() on col1.
val result = df.select(callBuiltin("radians", col("col1"))).collect()
```

The `callBuiltin` function returns a `Column`, which you can pass to the
[DataFrame transformation methods](working-with-dataframes.md) (e.g. filter, select, etc.).

For `builtin`, pass the name of the system-defined function, and use the returned function object to call the
system-defined function. For example:

```scala
// Import the callBuiltin function from the functions object.
import com.snowflake.snowpark.functions._
...
// Create a function object for the system-defined function RADIANS().
val radians = builtin("radians")
// Call the system-defined function RADIANS() on col1.
val result = df.select(radians(col("col1"))).collect()
```

## Calling scalar user-defined functions (UDFs)

The method for calling a UDF depends on how the UDF was created:

* To call [an anonymous UDF](creating-udfs.md), call the `apply` method of the
  [UserDefinedFunction](../reference/scala/com/snowflake/snowpark/UserDefinedFunction.md) object that was returned when you created the UDF.

  The arguments that you pass to a UDF must be [Column](working-with-dataframes.md) objects. If you need
  to pass in a literal, use `lit()`, as explained in [Using Literals as Column Objects](working-with-dataframes.md).
* To call UDFs that you [registered by name](creating-udfs.md) and UDFs that you created by executing
  [CREATE FUNCTION](../../../sql-reference/sql/create-function.md), use the `callUDF` function in the `com.snowflake.snowpark.functions`
  object.

  Pass the name of the UDF as the first argument and any UDF parameters as additional arguments.

Calling a UDF returns a `Column` object containing the return value of the UDF.

The following example calls the UDF function `myFunction`, passing in the values from the columns `col1` and `col2`. The
example passes the return value from `myFunction` to the `select` method of the DataFrame.

```scala
// Import the callUDF function from the functions object.
import com.snowflake.snowpark.functions._
...
// Runs the scalar function 'myFunction' on col1 and col2 of df.
val result =
    df.select(
        callUDF("myDB.schema.myFunction", col("col1"), col("col2"))
    ).collect()
```

## Calling table functions (system functions and UDTFs)

To call a [table function](../../../sql-reference/functions-table.md) or a
[user-defined table function (UDTF)](../../udf/udf-overview.md):

1. Construct a [TableFunction](../reference/scala/com/snowflake/snowpark/TableFunction.md) object, passing in the name of the table function.

   If you are creating a UDTF in Snowpark, you can just use the `TableFunction` object returned by the
   `UDTFRegistration.registerTemporary` or `UDTFRegistration.registerPermanent` method. See
   [Creating User-Defined Table Functions (UDTFs)](creating-udfs.md).
2. Call [session.tableFunction](../reference/scala/com/snowflake/snowpark/Session.md), passing in the `TableFunction` object and a `Map` of input argument names and
   values.

`table?Function` returns a DataFrame that contains the output of the table function.

For example, suppose that you executed the following command to create a SQL UDTF:

```sqlexample
CREATE OR REPLACE FUNCTION product_by_category_id(cat_id INT)
  RETURNS TABLE(id INT, name VARCHAR)
  AS
  $$
    SELECT id, name
      FROM sample_product_data
      WHERE category_id = cat_id
  $$
  ;
```

The following code calls this UDTF and creates a DataFrame for the output of the UDTF. The example prints the first 10 rows of
output to the console.

```scala
val dfTableFunctionOutput = session.tableFunction(TableFunction("product_by_category_id"), Map("cat_id" -> lit(10)))
dfTableFunctionOutput.show()
```

If you need to join the output of a table function with a DataFrame, call the
[DataFrame.join method that passes in a TableFunction](../reference/scala/com/snowflake/snowpark/DataFrame.md).

## Calling stored procedures

You can execute a procedure either on the server side (in the Snowflake environment) or locally. Keep in mind that as the two environments
are different, the conditions and results of procedure execution may differ between them.

You can call a procedure with the Snowpark API in either of the following ways:

* Execute a function locally for testing and debugging using the `SProcRegistration.runLocally` method.
* Execute a procedure in the server-side Snowflake environment using the `Session.storedProcedure` method. This includes a procedure
  scoped to the current session or a permanent procedure stored on Snowflake.

You can also call a permanent stored procedure you create with the Snowpark API from a Snowflake worksheet. For more information, refer
to [Calling a stored procedure](../../stored-procedure/stored-procedures-calling.md).

For more on creating procedures with the Snowpark API, refer to [Creating stored procedures for DataFrames in Scala](creating-sprocs.md).

### Executing a procedure’s logic locally

You can execute the lambda function for your procedure in your local environment using the `SProcRegistration.runLocally` method.
The method executes the function and returns its result as the type returned by the function.

For example, you can locally call (on the client side) a lambda function that you intend to use in a procedure before registering a
procedure from it on Snowflake. You begin by assigning the lambda code as a value to a variable. You pass that variable to the
`SProcRegistration.runLocally` method to run it on the client side. You can also use the variable to represent the function when
registering the procedure.

Code in the following example assigns the function to the `func` variable. It then tests the function locally by passing the
variable to the `SProcRegistration.runLocally` method with the function’s argument value. The variable is also used to register the
procedure.

```scala
val session = Session.builder.configFile("my_config.properties").create

// Assign the lambda function.
val func = (session: Session, num: Int) => num + 1

// Execute the function locally.
val result = session.sproc.runLocally(func, 1)
print("\nResult: " + result)
```

### Executing a procedure on the server

To execute a procedure in the Snowflake environment on the server, use the `Session.storedProcedure` method. The method returns a
`DataFrame` object.

For example, you can execute:

* A temporary or permanent procedure you [create using the Snowpark API](../java/creating-sprocs.md).
* A procedure [created using a CREATE PROCEDURE statement](../../stored-procedure/stored-procedures-creating.md).

Code in the following example creates a temporary procedure designed to execute on the server, but only last for as long as the current
Snowpark session. It then executes the procedure using both the procedure’s name and the `StoredProcedure` variable representing it.

```scala
val session = Session.builder.configFile("my_config.properties").create

val name: String = "add_two"

val tempSP: StoredProcedure =
  session.sproc.registerTemporary(
    name,
    (session: Session, num: Int) => num + 2
  )

session.storedProcedure(name, 1).show()

// Execute the procedure on the server by passing the procedure's name.
session.storedProcedure(incrementProc, 1).show();

// Execute the procedure on the server by passing a variable
// representing the procedure.
session.storedProcedure(tempSP, 1).show();
```

---
title: Checkpoints in Databricks
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-databricks.md
section: Snowpark
---

# Checkpoints in Databricks

Snowpark Checkpoints writes files about collected results and reads these same files to validate DataFrames. Some of these files are generated using PySpark; others using Python packages such as `os` or `glob`. This type of file handling behavior can lead to inconsistencies in a Databricks environment, where the file system differs from traditional environments. Therefore, you must adapt the package to ensure correct file reading and writing.

The following section demonstrates how to configure Snowpark Checkpoints to work seamlessly in a Databricks environment, thus enabling efficient DataFrame validation.

## Prerequisites

Before using Snowpark Checkpoints in Databricks, ensure that your environment meets the following requirements:

* `PySpark:` Version 3.5.0 or higher.
* `Python:` Version 3.9, 3.10 and 3.11

The Databricks Runtime versions that satisfy these requirements are:

* `Databricks Runtime 14.3 LTS`
* `Databricks Runtime 15.4 LTS`

## Input/output (I/O) strategies

To ensure that Snowpark Checkpoints works correctly across various environments, you can use the interface `EnvStrategy` and its implementation classes for file read and write operations. This allows I/O operations to be adaptable and customizable.

* With Snowpark Checkpoints, you can implement your own custom input/output methods by creating a class that implements the `EnvStrategy` interface. You can then tailor operations to your specific execution environment and expectations.
* Internally, the package uses a default class (`IODefaultStrategy`) that implements the `EnvStrategy` interface and provides a basic implementation of I/O operations. You can replace this default strategy with a custom implementation suited to your environment’s specific needs.

> **Important:**
>
> Each Snowpark Checkpoints package (`snowpark-checkpoints-collectors`, `snowpark-checkpoints-validators`, `snowpark-checkpoints-hypothesis`) includes its own copy of the file handling classes. Therefore, any changes to file configurations must be applied to each package separately. Be sure to import the configuration from the package you are using.

## I/O functions

These file read and write methods can be customized:

* `mkdir`: Creates a folder.
* `folder_exists`: Checks whether a folder exists.
* `file_exists`: Checks whether a file exists.
* `write`: Writes content to a file.
* `read`: Reads content from a file.
* `read_bytes`: Reads binary content from a file.
* `ls`: Lists the contents of a directory.
* `getcwd`: Gets the current working directory.
* `remove_dir`: Removes a directory and its contents. This function is exclusively used in the `snowpark-checkpoints-collectors` module.
* `telemetry_path_files`: Gets the path to the telemetry files.

## Databricks strategy

The Databricks strategy is a configuration that knows how to work with DBFS file paths. It uses the `normalize_dbfs_path` function to ensure that all paths begin with `/dbfs/`.

## How to use it

To use the Databricks strategy, you must explicitly configure it in the code. Here’s how:

1. Import the necessary classes:

   ```python
   from typing import Optional, BinaryIO
   from pathlib import Path
   from snowflake.snowpark_checkpoints_collector.io_utils import EnvStrategy, IODefaultStrategy
   from snowflake.snowpark_checkpoints_collector.io_utils.io_file_manager import get_io_file_manager
   ```
2. Define the Databricks strategy:

   ```python
   class IODatabricksStrategy(EnvStrategy):

     def __init__(self):
         self.default_strategy = IODefaultStrategy()

     def mkdir(self, path: str, exist_ok: bool = False) -> None:
         path = normalize_dbfs_path(path)
         self.default_strategy.mkdir(path, exist_ok=exist_ok)

     def folder_exists(self, path: str) -> bool:
         path = normalize_dbfs_path(path)
         return self.default_strategy.folder_exists(path)

     def file_exists(self, path: str) -> bool:
         path = normalize_dbfs_path(path)
         return self.default_strategy.file_exists(path)

     def write(self, file_path: str, file_content: str, overwrite: bool = True) -> None:
         file_path = normalize_dbfs_path(file_path)
         self.default_strategy.write(file_path, file_content, overwrite=overwrite)

     def read(
         self, file_path: str, mode: str = "r", encoding: Optional[str] = None
     ) -> str:
         file_path = normalize_dbfs_path(file_path)
         return self.default_strategy.read(file_path, mode=mode, encoding=encoding)

     def read_bytes(self, file_path: str) -> bytes:
         file_path = normalize_dbfs_path(file_path)
         return self.default_strategy.read_bytes(file_path)

     def ls(self, path: str, recursive: bool = False) -> list[str]:
         file_path = normalize_dbfs_path(path)
         list_of_files = self.default_strategy.ls(file_path, recursive=recursive)
         return [content.replace("/dbfs","") for content in list_of_files]

     def getcwd(self) -> str:
         try:
             parent_folder = "/snowpark_checkpoints"
             self.mkdir(parent_folder, exist_ok=True)
             return parent_folder
         except Exception:
             return ""

     def remove_dir(self, path:str) -> None:
         path = normalize_dbfs_path(path)
         self.default_strategy.remove_dir(path)

     def telemetry_path_files(self, path:str) -> Path:
         path = normalize_dbfs_path(path)
         return self.default_strategy.telemetry_path_files(path)

   def normalize_dbfs_path(path: str) -> str:
       if isinstance(path, Path):
           path = str(path)
       if not path.startswith("/"):
           path = "/" + path
       if not path.startswith("/dbfs/"):
           path = f'/dbfs{path}'
       return path
   ```
3. Configure the Databricks strategy:

   ```python
   get_io_file_manager().set_strategy(IODatabricksStrategy())
   ```

Executing this code at the start of your Databricks script or notebook configures Snowpark Checkpoints to use the defined I/O strategy for correct file handling in DBFS.

## Optional customization

For more specialized input/output operations, a custom strategy can be designed and implemented. This approach offers complete control and flexibility over the I/O behavior. It allows developers to tailor the strategy precisely to their specific requirements and constraints, potentially optimizing performance, resource utilization, or other relevant factors.

> **Important:**
>
> When using custom strategies, it is your responsibility to ensure that I/O operations function correctly.

---
title: Configure the Snowpark Migration Accelerator (SMA)
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-setup-sma.md
section: Snowpark
---

# Configure the Snowpark Migration Accelerator (SMA)

To configure the SMA for Checkpoints generation, see [Snowpark Migration Accelerator Documentation](https://docs.snowconvert.com/sma).

---
title: Creating a Session for Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/creating-session.md
section: Snowpark
---

# Creating a Session for Snowpark Java

To use Snowpark in your application, you need to create a session. For convenience in writing code, you can also import the names
of packages and objects.

## Importing Names from Packages for Snowpark

The Snowpark API provides a number of classes in different packages. For convenience, you can import these packages to avoid
having to use qualified names for classes.

For example:

* The [com.snowflake.snowpark_java package](../reference/java/com/snowflake/snowpark_java/package-summary.md) contains the main classes for the Snowpark API. To import the names in this package:

  ```java
  import com.snowflake.snowpark_java.*;
  ```
* The [com.snowflake.snowpark_java.types package](../reference/java/com/snowflake/snowpark_java/types/package-summary.md) defines classes that you can use to define schemas for semi-structured data.

  ```java
  import com.snowflake.snowpark_java.types.*;
  ```

## Creating a Session for Snowpark

The first step in using the library is establishing a session with the Snowflake database. To create a session, use the methods in
the `SessionBuilder` class. You can access a `SessionBuilder` object by calling the static `builder` method in
the `Session` class:

```java
import com.snowflake.snowpark_java.*;

...
// Get a SessionBuilder object.
SessionBuilder builder = Session.builder();
```

To provide the details to establish a session with a Snowflake database (for example, the account identifier, user name, etc.),
either create a properties file (a text file) or programmatically build a `Map` containing the properties.

In the properties file or `Map`, set the following properties:

* `URL`: Set this to the URL for your account in the form `https://account_identifier.snowflakecomputing.com`.

  See [Account identifiers](../../../user-guide/admin-account-identifier.md).

  If the account identifier contains underscores (`_`), replace those underscores with hyphens (`-`).
* Any additional JDBC parameters (see [JDBC Driver connection parameter reference](../../jdbc/jdbc-parameters.md) in the JDBC driver documentation) needed to connect
  to Snowflake (e.g. `USER`, `ROLE`, `WAREHOUSE`, `DB`, `SCHEMA`, etc.).

  > **Note:**
  >
  > To change the logging level (e.g. from `INFO` to `DEBUG`), see [Changing the logging settings](troubleshooting.md).
* (Optional) `snowpark_request_timeout_in_seconds`: Set this to the maximum number of seconds that the Snowpark library
  should wait in the following cases:

  + Waiting for [dependencies to be uploaded to a stage](creating-udfs.md).
  + Waiting for an [asynchronous action](working-with-dataframes.md) to complete.

  The default value of this property is 86400 seconds (1 day).

  > **Note:**
  >
  > This property was introduced in Snowpark 0.10.0.

To authenticate, you can use the same mechanisms that the JDBC Driver supports. For example, you can use:

* password-based authentication (by setting the `PASSWORD` property)
* [key-pair authentication](../../jdbc/jdbc-configure.md)
* [single sign-on (SSO)](../../jdbc/jdbc-configure.md)

For key-pair authentication, you can either:

* Set the `PRIVATE_KEY_FILE` property to the path to the private key file.

  If the private key is encrypted, set the `PRIVATE_KEY_FILE_PWD` property to the passphrase for decrypting the key.
* Set the `PRIVATEKEY` property to the string value of the unencrypted private key from the private key file.
  (If the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY` property.)

To create the session:

1. Set the properties in the `SessionBuilder` object.

   * If you created a properties file, pass the path to the properties file to the `configFile` method of the
     `SessionBuilder` object.
   * If you programmatically built a `Map` of the properties, pass the `Map` to the `configs` method of the
     `SessionBuilder` object.

   Both methods return a `SessionBuilder` object that has these properties.
2. Call the `create` method of the `SessionBuilder` object to establish the session.

The following is an example of a properties file that sets the basic parameters for connecting to a Snowflake database. The
example is set up to use key-pair authentication. Set `PRIVATE_KEY_FILE` to the path to the private key file. In addition,
if the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key:

```none
# profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATE_KEY_FILE = </path/to/private_key_file.p8>
PRIVATE_KEY_FILE_PWD = <if the private key is encrypted, set this to the passphrase for decrypting the key>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

As an alternative, you can set the `PRIVATEKEY` property to the unencrypted private key from the private key file.

```none
# profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATEKEY = <unencrypted_private_key_from_the_private_key_file>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

The following example uses this properties file to create a new session:

```java
// Create a new session, using the connection properties
// specified in a file.
Session session = Session.builder().configFile("/path/to/properties/file").create();
```

The following example uses a Map to set the properties:

```java
import com.snowflake.snowpark_java.*;
import java.util.HashMap;
import java.util.Map;
...
// Create a new session, using the connection properties
// specified in a Map.
// Replace the <placeholders> below.
Map<String, String> properties = new HashMap<>();
properties.put("URL", "https://<account_identifier>.snowflakecomputing.com:443");
properties.put("USER", "<user name>");
properties.put("PRIVATE_KEY_FILE", "</path/to/private_key_file.p8>");
properties.put("PRIVATE_KEY_FILE_PWD", "<if the private key is encrypted, set this to the passphrase for decrypting the key>");
properties.put("ROLE", "<role name>");
properties.put("WAREHOUSE", "<warehouse name>");
properties.put("DB", "<database name>");
properties.put("SCHEMA", "<schema name>");
Session session = Session.builder().configs(properties).create();
```

## Closing a Session

If you no longer need to use a session for executing queries and you want to cancel any queries that are currently running, call
`close` method of the `Session` object. For example:

```java
// Close the session, cancelling any queries that are currently running, and
// preventing the use of this Session object for performing any subsequent queries.
session.close();
```

---
title: Creating a Session for Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-session.md
section: Snowpark
---

# Creating a Session for Snowpark Python

To use Snowpark in your application, you need to create a session. For convenience in writing code, you can also import the names
of packages and objects.

## Creating a Session

The first step in using the library is establishing a session with the Snowflake database.

Import the Session class.

```python
from snowflake.snowpark import Session
```

To authenticate, you use the same mechanisms that the [Snowflake Connector for Python](../../python-connector/python-connector-example.md) supports.

Establish a session with a Snowflake database using the same parameters (for example, the account name, user name, etc.) that you use in the `connect` function in the Snowflake
Connector for Python. For more information, see the [parameters for the connect function](../../python-connector/python-connector-api.md) in the Python Connector API documentation.

## Connect by using the `connections.toml` file

To add credentials in a connections configuration file:

1. In a text editor, open the `connections.toml` file for editing. For example, to open the file in the Linux **vi** editor:

   ```bash
   vi connections.toml
   ```
2. Add a new Snowflake connection definition.

   You can generate the basic settings for the TOML configuration file in Snowsight. For information, see
   [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../../user-guide/gen-conn-config.md).

   For example, to add a Snowflake connection called `myconnection` with the account `myaccount`,
   user `johndoe`, and password credentials, as well as database information,
   add the following lines to the configuration file:

   ```bash
   [myconnection]
   account = "myaccount"
   user = "jdoe"
   password = "******"
   warehouse = "my-wh"
   database = "my_db"
   schema = "my_schema"
   ```

   Connection definitions support the same configuration options available in the
   [snowflake.connector.connect](../../python-connector/python-connector-api.md) method.
3. Optional: Add more connections, as shown:

   ```bash
   [myconnection_test]
   account = "myaccount"
   user = "jdoe-test"
   password = "******"
   warehouse = "my-test_wh"
   database = "my_test_db"
   schema = "my_schema"
   ```
4. Save changes to the file.
5. In your Python code, supply connection name to `snowflake.connector.connect` and then add it to `session`, similar to the following:

   ```python
   session = Session.builder.config("connection_name", "myconnection").create()
   ```

For more information, see [configuration file](../../python-connector/python-connector-connect.md).

## Connect by specifying connection parameters

Construct a dictionary (`dict`) containing the names and values of these parameters
(e.g. `account`, `user`, `role`, `warehouse`, `database`, `schema`, etc.).

To create the session:

1. Create a Python dictionary (`dict`) containing the names and values of the parameters for connecting to Snowflake.
2. Pass this dictionary to the `Session.builder.configs` method to return a builder object that has these connection parameters.
3. Call the `create` method of the `builder` to establish the session.

The following example uses a `dict` containing connection parameters to create a new session:

```python
connection_parameters = {
  "account": "<your snowflake account>",
  "user": "<your snowflake user>",
  "password": "<your snowflake password>",
  "role": "<your snowflake role>",  # optional
  "warehouse": "<your snowflake warehouse>",  # optional
  "database": "<your snowflake database>",  # optional
  "schema": "<your snowflake schema>",  # optional
}

new_session = Session.builder.configs(connection_parameters).create()
```

For the `account` parameter, use your [account identifier](../../../user-guide/admin-account-identifier.md).
Note that the account identifier does not include the snowflakecomputing.com suffix.

> **Note:**
>
> This example shows you one way to create a session but there are several other ways that you can connect, including:
> the default authenticator, single sign-on (SSO), multi-factor authentication (MFA), key pair authentication,
> using a proxy server, and OAuth. For more information, see [Connecting to Snowflake with the Python Connector](../../python-connector/python-connector-connect.md).

## Using single sign-on (SSO) through a web browser

If you have [configured Snowflake to use single sign-on (SSO)](../../../user-guide/admin-security-fed-auth-overview.md), you can configure
your client application to use browser-based SSO for authentication.

Construct a dictionary (`dict`) containing the names and values of these parameters
(e.g. `account`, `user`, `role`, `warehouse`, `database`, `authenticator`, etc.).

To create the session:

1. Create a Python dictionary (`dict`) containing the names and values of the parameters for connecting to Snowflake.
2. Pass this dictionary to the `Session.builder.configs` method to return a builder object that has these connection parameters.
3. Call the `create` method of the `builder` to establish the session.

The following example uses a `dict` containing connection parameters to create a new session. Set the `authenticator` option to `externalbrowser`.

```python
from snowflake.snowpark import Session
connection_parameters = {
  "account": "<your snowflake account>",
  "user": "<your snowflake user>",
  "role":"<your snowflake role>",
  "database":"<your snowflake database>",
  "schema":"<your snowflake schema",
  "warehouse":"<your snowflake warehouse>",
  "authenticator":"externalbrowser"
}
session = Session.builder.configs(connection_parameters).create()
```

## Closing a Session

If you no longer need to use a session for executing queries and you want to
cancel any queries that are currently running, call the close method of the Session object.
For example:

```python
new_session.close()
```

---
title: Creating a Session for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-session.md
section: Snowpark
---

# Creating a Session for Snowpark Scala

To use Snowpark in your application, you need to create a session. For convenience in writing code, you can also import the names
of packages and objects.

## Importing Names from Packages and Objects for Snowpark

The Snowpark API provides a number of classes, objects, and functions that are available in different packages. For convenience,
you can import the class, object, and function names from packages and objects to avoid having to use qualified names.

For example:

* The [com.snowflake.snowpark package](../reference/scala/com/snowflake/snowpark/index.md) contains the main classes for the Snowpark API. To import the names in this package, use:

  ```scala
  import com.snowflake.snowpark._
  ```
* The [com.snowflake.snowpark.functions object](../reference/scala/com/snowflake/snowpark/functions$.md) defines utility functions (including
  [system-defined functions](../../../sql-reference-functions.md)). To import the function names from this object, use:

  ```scala
  import com.snowflake.snowpark.functions._
  ```
* The [com.snowflake.snowpark.types package](../reference/scala/com/snowflake/snowpark/types/index.md) defines classes and objects that you can use to define schemas for semi-structured
  data.

  ```scala
  import com.snowflake.snowpark.types._
  ```

> **Note:**
>
> If you used the `run.sh` script to [start the Scala REPL](quickstart-scala-repl.md), the script already imports names
> `com.snowflake.snowpark` and `com.snowflake.snowpark.functions`.

## Creating a Session for Snowpark

The first step in using the library is establishing a session with the Snowflake database. To create a session, use the
`SessionBuilder` object. You can access the `SessionBuilder` object through the `builder` field in the
`Session` companion object:

```scala
import com.snowflake.snowpark._

...
// Get a SessionBuilder object.
val builder = Session.builder
```

To provide the details to establish a session with a Snowflake database (for example, the account identifier, user name, etc.),
either create a properties file (a text file) or programmatically build a `Map` containing the properties.

In the properties file or `Map`, set the following properties:

* `URL`: Set this to the URL for your account in the form `https://account_identifier.snowflakecomputing.com`.

  See [Account identifiers](../../../user-guide/admin-account-identifier.md).

  If the account identifier contains underscores (`_`), replace those underscores with hyphens (`-`).
* Any additional JDBC parameters (see [JDBC Driver connection parameter reference](../../jdbc/jdbc-parameters.md) in the JDBC driver documentation) needed to connect
  to Snowflake (e.g. `USER`, `ROLE`, `WAREHOUSE`, `DB`, `SCHEMA`, etc.).

  > **Note:**
  >
  > To change the logging level (e.g. from `INFO` to `DEBUG`), see [Changing the Logging Settings](troubleshooting.md).
* (Optional) `snowpark_request_timeout_in_seconds`: Set this to the maximum number of seconds that the Snowpark library
  should wait in the following cases:

  + Waiting for [dependencies to be uploaded to a stage](creating-udfs.md).
  + Waiting for an [asynchronous action](working-with-dataframes.md) to complete.

  The default value of this property is 86400 seconds (1 day).

  > **Note:**
  >
  > This property was introduced in Snowpark 0.10.0.

To authenticate, you can use the same mechanisms that the JDBC Driver supports. For example, you can use:

* password-based authentication (by setting the `PASSWORD` property)
* [key-pair authentication](../../jdbc/jdbc-configure.md)
* [single sign-on (SSO)](../../jdbc/jdbc-configure.md)

For key-pair authentication, you can either:

* Set the `PRIVATE_KEY_FILE` property to the path to the private key file.

  If the private key is encrypted, set the `PRIVATE_KEY_FILE_PWD` property to the passphrase for decrypting the key.
* Set the `PRIVATEKEY` property to the string value of the unencrypted private key from the private key file.
  (If the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY` property.)

To create the session:

1. Set the properties in the `Session.builder` object.

   * If you created a properties file, pass the path to the properties file to the `Session.builder.configFile` method.
   * If you programmatically built a `Map` of the properties, pass the `Map` to the
     `Session.builder.configs` method.

   Both methods return a `builder` object that has these properties.
2. Call the `create` method of the `builder` object to establish the session.

The following is an example of a properties file that sets the basic parameters for connecting to a Snowflake database. The
example is set up to use key-pair authentication. Set `PRIVATE_KEY_FILE` to the path to the private key file. In addition,
if the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key:

```none
# profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATE_KEY_FILE = </path/to/private_key_file.p8>
PRIVATE_KEY_FILE_PWD = <if the private key is encrypted, set this to the passphrase for decrypting the key>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

As an alternative, you can set the `PRIVATEKEY` property to the unencrypted private key from the private key file.

```none
# profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATEKEY = <unencrypted_private_key_from_the_private_key_file>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

The following example uses this properties file to create a new session:

```scala
// Create a new session, using the connection properties
// specified in a file.
val session = Session.builder.configFile("/path/to/properties/file").create
```

The following example uses a Map to set the properties:

```scala
// Create a new session, using the connection properties
// specified in a Map.
val session = Session.builder.configs(Map(
    "URL" -> "https://<account_identifier>.snowflakecomputing.com",
    "USER" -> "<username>",
    "PRIVATE_KEY_FILE" -> "</path/to/private_key_file.p8>",
    "PRIVATE_KEY_FILE_PWD" -> "<if the private key is encrypted, set this to the passphrase for decrypting the key>",
    "ROLE" -> "<role_name>",
    "WAREHOUSE" -> "<warehouse_name>",
    "DB" -> "<database_name>",
    "SCHEMA" -> "<schema_name>"
)).create
```

## Closing a Session

If you no longer need to use a session for executing queries and you want to cancel any queries that are currently running, call
`close` method of the `Session` object. For example:

```scala
// Close the session, cancelling any queries that are currently running, and
// preventing the use of this Session object for performing any subsequent queries.
session.close();
```

---
title: Creating stored procedures for DataFrames in Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/creating-sprocs.md
section: Snowpark
---

# Creating stored procedures for DataFrames in Java

Using the Snowpark API, you can create stored procedures for your custom lambda expression in Java. You can call these stored procedures
to process the data in your `DataFrame`.

You can create:

* Temporary stored procedures that exist only within the current session.
* Permanent stored procedures that you can use in other sessions, including
  from a Snowflake worksheet.

## Creating a temporary stored procedure

You can create a temporary procedure that will last for the current session only. The temporary procedure can be:

* An anonymous procedure that you can call by reference, such as by passing a variable of type [com.snowflake.snowpark_java.StoredProcedure](../reference/java/com/snowflake/snowpark_java/StoredProcedure.md)
  representing it to code that calls the procedure.
* A named procedure with a name you assign. You can call the procedure by name from other code within the session.

To create a temporary procedure, you register it with one of the `registerTemporary` methods of
[com.snowflake.snowpark_java.SProcRegistration](../reference/java/com/snowflake/snowpark_java/SProcRegistration.md). The method is overloaded multiple times to support different numbers of procedure
arguments. To get an `SProcRegistration` instance, call the [sproc](../reference/java/com/snowflake/snowpark_java/Session.md) method of the [com.snowflake.snowpark_java.Session](../reference/java/com/snowflake/snowpark_java/Session.md) class.

When calling `registerTemporary`, you can pass as arguments the following:

* The procedure’s name (when it is a named procedure).
* The procedure itself as a lambda expression.
* Parameter data types as a single or array of `com.snowflake.snowpark_java.types.DataType` class. Omit this argument when the
  procedure you’re creating has no parameters.

  These should correspond to the parameter types defined in the procedure.
* Return data type as a `com.snowflake.snowpark_java.types.DataType` class.

You can call the procedure with a `storedProcedure` method of the [com.snowflake.snowpark_java.Session](../reference/java/com/snowflake/snowpark_java/Session.md) class.

### Creating an anonymous temporary procedure

To create an anonymous temporary procedure, you register it as a temporary procedure without specifying a name. Snowflake will create a
hidden name for its own use.

Code in the following example calls the `SProcRegistration.registerTemporary` method to create an anonymous procedure from a lambda
expression. The procedure takes a `Session` object and an integer as arguments. The method registers a `DataTypes.IntegerType`
as the single parameter type and a `DataTypes.IntegerType` as the return type.

The procedure itself will take a `Session` object and an integer as arguments. The `Session` argument represents an implicit
parameter that callers needn’t pass as an argument.

```java
Session session = Session.builder().configFile("my_config.properties").create();

StoredProcedure sp =
  session.sproc().registerTemporary(
    (Session spSession, Integer num) -> num + 1,
    DataTypes.IntegerType,
    DataTypes.IntegerType
  );
```

Code in the following example calls the anonymous procedure, passing the `sp` variable and `1` as its arguments.
Note that the `Session` object is an implicit argument that you needn’t pass when you call the procedure.

```java
session.storedProcedure(sp, 1).show();
```

### Creating a named temporary procedure

To create a named temporary procedure, you register it as a temporary procedure, passing its name as one of the arguments.

Code in the following example calls the `registerTemporary` method to create a named temporary procedure called
`increment` from a lambda expression, passing the procedure’s name as an argument. The method registers a `DataTypes.IntegerType` as
the single parameter type and a `DataTypes.IntegerType` as the return type.

The procedure itself will take a `Session` object and an integer as arguments. The `Session` argument represents an implicit
parameter that callers needn’t pass as an argument.

```java
Session session = Session.builder().configFile("my_config.properties").create();

String procName = "increment";

StoredProcedure tempSP =
  session.sproc().registerTemporary(
    procName,
    (Session session, Integer num) -> num + 1,
    DataTypes.IntegerType,
    DataTypes.IntegerType
  );
```

Code in the following example calls the `increment` procedure, passing the procedure name and `1` as its
arguments. Note that the `Session` object is an implicit argument that you needn’t pass when you call the procedure.

```java
session.storedProcedure(procName, 1).show();
```

## Creating a permanent stored procedure

You can create a permanent stored procedure that you can call from any session, including
[from within a Snowflake worksheet](../../stored-procedure/stored-procedures-calling.md).

To create a permanent procedure, you register it with a `registerPermanent` method of the [com.snowflake.snowpark_java.SProcRegistration](../reference/java/com/snowflake/snowpark_java/SProcRegistration.md)
class. The method is overloaded multiple times to support different numbers of procedure arguments.

When calling `registerPermanent`, you pass as arguments the following:

* The procedure’s name.
* The procedure itself as a lambda expression.
* Parameter data types as a single or array of `com.snowflake.snowpark_java.types.DataType` class. Omit this argument when the
  procedure you’re creating has no parameters.

  These should correspond to the parameter types defined in the procedure.
* The return data type as a `com.snowflake.snowpark_java.types.DataType` class.
* An existing stage to which Snowflake should copy files resulting from compiling the procedure.

  Snowflake will copy all related data, including dependencies and lambda functions. This must be a permanent stage (not session temporary)
  because this stored procedure can be invoked outside of the current session. If the procedure is later dropped, you must manually
  remove related files from the stage.
* A boolean value indicating whether this procedure should execute with caller’s rights.

  For more about caller’s rights and owner’s rights, refer to [Understanding caller’s rights and owner’s rights stored procedures](../../stored-procedure/stored-procedures-rights.md).

Code in the following example calls the `registerPermanent` method to create a permanent procedure called
`add_hundred` from a lambda expression.

The method registers a `DataTypes.IntegerType` as the single parameter type and a `DataTypes.IntegerType` as the return type.
It specifies a stage called `sproc_libs` for the procedure and its dependencies. It also specifies that the procedure should be
executed with caller’s rights.

The procedure itself will take a `Session` object and an integer as arguments. The `Session` argument represents an implicit
parameter that callers needn’t pass as an argument.

```java
Session session = Session.builder().configFile("my_config.properties").create();

String procName = "add_hundred";
String stageName = "sproc_libs";

StoredProcedure sp =
    session.sproc().registerPermanent(
        procName,
        (Session session, Integer num) -> num + 100,
        DataTypes.IntegerType,
        DataTypes.IntegerType,
        stageName,
        true
    );
```

Code in the following example calls the `add_hundred` procedure using a `storedProcedure` method of the
[com.snowflake.snowpark_java.Session](../reference/java/com/snowflake/snowpark_java/Session.md) class. The call passes the procedure name and `1` as its arguments. Note that the `Session`
object used in the handler as an argument is an implicit argument that you needn’t pass when you call the procedure.

```java
session.storedProcedure(procName, 1).show();
```

---
title: Creating Stored Procedures for DataFrames in Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-sprocs.md
section: Snowpark
---

# Creating Stored Procedures for DataFrames in Python

The Snowpark API provides methods that you can use to create a stored procedure in Python.
This topic explains how to create stored procedures.

## Introduction

With Snowpark, you can create stored procedures for your custom lambdas and functions, and you can call these
stored procedures to process the data in your DataFrame.

You can create stored procedures that only exist within the current session (temporary stored procedures)
as well as stored procedures that you can use in other sessions (permanent stored procedures).

## Using Artifact Repository packages in a Stored Procedure

For more information, see [Artifact Repository overview](../../udf/python/udf-python-packages.md).

## Using Third-Party Packages from Anaconda in a Stored Procedure

You can specify Anaconda packages to install when you create Python stored procedures.
When calling the Python stored procedure inside a Snowflake warehouse, Anaconda packages
are installed seamlessly and cached on the virtual warehouse on your behalf.
For more information about best practices, how to view the available packages, and how to
set up a local development environment, see [Using third-party packages](../../udf/python/udf-python-packages.md).

Use `session.add_packages` to add packages at the session level.

This code example shows how to import packages and return their versions.

```python
import pandas as pd
import snowflake.snowpark
import xgboost as xgb
from snowflake.snowpark.functions import sproc

session.add_packages("snowflake-snowpark-python", "pandas", "xgboost==1.5.0")

@sproc
def compute(session: snowflake.snowpark.Session) -> list:
  return [pd.__version__, xgb.__version__]
```

You can also use `session.add_requirements` to specify packages with a
[requirements file](https://pip.pypa.io/en/stable/user_guide/#requirements-files).

```python
session.add_requirements("mydir/requirements.txt")
```

You can add the stored-procedure-level packages to overwrite the session-level packages you might have added previously.

```python
import pandas as pd
import snowflake.snowpark
import xgboost as xgb
from snowflake.snowpark.functions import sproc

@sproc(packages=["snowflake-snowpark-python", "pandas", "xgboost==1.5.0"])
def compute(session: snowflake.snowpark.Session) -> list:
  return [pd.__version__, xgb.__version__]
```

> **Important:**
>
> If you don’t specify a package version, Snowflake will use the latest version when resolving dependencies.
> When deploying the stored procedure to production, however, you may want to ensure that your code always
> uses the same dependency versions. You can do that for both permanent and temporary stored procedures.
>
> * When you create a permanent stored procedure, the stored procedure is created and registered only once.
>   This resolves dependencies once and the selected version is used for production workloads. When the stored procedure executes,
>   it will always use the same dependency versions.
> * When you create a temporary stored procedure, specify dependency versions as part of the version spec.
>   That way, when the stored procedure is registered, package resolution will use the specified version.
>   If you don’t specify the version, the dependency might be updated when a new version becomes available.

## Creating an Anonymous Stored Procedure

To create an anonymous stored procedure, you can either:

* Call the `sproc` function in the `snowflake.snowpark.functions` module, passing in the definition of the anonymous function.
* Call the `register` method in the `StoredProcedureRegistration` class, passing in the definition of the anonymous function.
  To access an attribute or method of the `StoredProcedureRegistration` class, call the `sproc` property of the `Session` class.

Here is an example of an anonymous stored procedure:

```python
from snowflake.snowpark.functions import sproc
from snowflake.snowpark.types import IntegerType

add_one = sproc(lambda session, x: session.sql(f"select {x} + 1").collect()[0][0], return_type=IntegerType(), input_types=[IntegerType()], packages=["snowflake-snowpark-python"])
```

> **Note:**
>
> When writing code that might execute in multiple sessions, use the `register` method to register stored procedures,
> rather than using the `sproc` function. This can prevent errors in which the default Snowflake `Session` object
> cannot be found.

## Creating and registering a named stored procedure

If you want to call a stored procedure by name (e.g. by using the `call` function in the `Session` object),
you can create and register a named stored procedure. To do this, you can either:

* Call the `sproc` function in the `snowflake.snowpark.functions` module, passing in the `name` argument
  and the definition of the anonymous function.
* Call the `register` method in the `StoredProcedureRegistration` class, passing in the `name` argument
  and the definition of the anonymous function.
  To access an attribute or method of the `StoredProcedureRegistration` class, call the `sproc` property of the `Session` class.

Calling `register` or `sproc` will create a temporary stored procedure that you can use in the current session.

To create a permanent stored procedure, call the `register` method or the `sproc` function and set
the `is_permanent` argument to `True`. When you create a permanent stored procedure, you must also set the `stage_location`
argument to the stage location where the Python connector used by Snowpark uploads the Python file for the stored procedure and its dependencies.

Here is an example of how to register a named temporary stored procedure:

```python
from snowflake.snowpark.functions import sproc
from snowflake.snowpark.types import IntegerType

add_one = sproc(lambda session, x: session.sql(f"select {x} + 1").collect()[0][0], return_type=IntegerType(), input_types=[IntegerType()], name="my_sproc", replace=True, packages=["snowflake-snowpark-python"])
```

Here is an example of how to register a named permanent stored procedure by setting the `is_permanent` argument to `True`:

```python
import snowflake.snowpark
from snowflake.snowpark.functions import sproc

@sproc(name="minus_one", is_permanent=True, stage_location="@my_stage", replace=True, packages=["snowflake-snowpark-python"])
def minus_one(session: snowflake.snowpark.Session, x: int) -> int:
  return session.sql(f"select {x} - 1").collect()[0][0]
```

Here is an example of these stored procedures being called:

```python
add_one(1)
```

```output
2
```

```python
session.call("minus_one", 1)
```

```output
0
```

```python
session.sql("call minus_one(1)").collect()
```

```output
[Row(MINUS_ONE(1)=0)]
```

## Reading Files with a Stored Procedure

To read the contents of a file with a stored procedure, you can:

* Read a statically-specified file by importing a file and then reading it from the stored procedure’s home directory.
* Read a dynamically-specified file with SnowflakeFile. You might do this if you need to access a file during computation.

### Reading Statically-Specified Files

1. Specify that the file is a dependency, which uploads the file to the server. This is done the same way as for UDFs.
   For more information, see [Specifying Dependencies for a UDF](creating-udfs.md).

   For example:

   ```python
   # Import a file from your local machine as a dependency.
   session.add_import("/<path>/my_file.txt")

   # Or import a file that you uploaded to a stage as a dependency.
   session.add_import("@my_stage/<path>/my_file.txt")
   ```
2. In the stored procedure, read the file.

   ```python
   def read_file(name: str) -> str:
     import sys
     IMPORT_DIRECTORY_NAME = "snowflake_import_directory"
     import_dir = sys._xoptions[IMPORT_DIRECTORY_NAME]

     with open(import_dir + 'my_file.txt', 'r') as file:
     return file.read()
   ```

### Reading Dynamically-Specified Files with `SnowflakeFile`

You can read a file from a stage using the `SnowflakeFile` class in the Snowpark `snowflake.snowpark.files` module.
The `SnowflakeFile` class provides dynamic file access, which lets you stream files of any size. Dynamic file access is also useful when you want to iterate over multiple files. For example, see [Processing multiple files](../../udf/python/udf-python-examples.md).

For more information about and examples of reading files using `SnowflakeFile`, see [Reading a File Using the SnowflakeFile Class in a Python UDF Handler](../../udf/python/udf-python-examples.md).

The following example creates a permanent stored procedure that reads a file from a stage using `SnowflakeFile` and returns the file length.

Create the stored procedure:

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark.functions import sproc
from snowflake.snowpark.files import SnowflakeFile
from snowflake.snowpark.types import StringType, IntegerType

@sproc(name="calc_size", is_permanent=True, stage_location="@my_procedures", replace=True, packages=["snowflake-snowpark-python"])
def calc_size(ignored_session: snowpark.Session, file_path: str) -> int:
  with SnowflakeFile.open(file_path) as f:
    s = f.read()
  return len(s);
```

Call the stored procedure:

```python
file_size = session.sql("call calc_size(build_scoped_file_url('@my_stage', 'my_file.csv'))")
```

---
title: Creating stored procedures for DataFrames in Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-sprocs.md
section: Snowpark
---

# Creating stored procedures for DataFrames in Scala

Using the Snowpark API, you can create stored procedures for your custom lambdas and functions. You can call these
stored procedures to process the data in your DataFrame.

You can create:

* Temporary stored procedures that exist only within the current session.
* Permanent stored procedures that you can use in other sessions, including
  from a Snowflake worksheet.

## Creating a temporary stored procedure

You can create a temporary procedure that will last for the current session only. The temporary procedure can be:

* An anonymous procedure that you can call by reference, such as by passing a [com.snowflake.snowpark.StoredProcedure](../reference/scala/com/snowflake/snowpark/StoredProcedure.md) variable
  representing it to code that calls the procedure.
* A named procedure with a name you assign. You can call the procedure by name from other code within the session.

To create a temporary procedure, you register it with one of the `registerTemporary` methods of [com.snowflake.snowpark.SProcRegistration](../reference/scala/com/snowflake/snowpark/SProcRegistration.md).
The method is overloaded multiple times to support different numbers of procedure arguments.

When calling `registerTemporary`, you can pass as arguments the following:

* The procedure’s name (when it is a named procedure).
* The procedure itself as a lambda expression.

### Creating an anonymous temporary procedure

To create an anonymous temporary procedure, you register it as a temporary procedure without specifying a name. Snowflake will create a
hidden name for its own use.

Code in the following example calls the `SProcRegistration.registerTemporary` method to create an anonymous procedure from a lambda
function. The function itself will take a `Session` object and an integer as arguments. The `Session` argument represents an implicit
parameter that callers needn’t pass as an argument.

```scala
val session = Session.builder.configFile("my_config.properties").create

val sp = session.sproc.registerTemporary(
  (session: Session, num: Int) => num + 1
)
```

Code in the following example calls the anonymous procedure using the `storedProcedure` method of the
[com.snowflake.snowpark.Session](../reference/scala/com/snowflake/snowpark/Session.md) class, passing the `sp` variable and `1` as its arguments. Note that the `Session` object
is an implicit argument that you needn’t pass when you call the procedure.

```scala
session.storedProcedure(sp, 1).show()
```

### Creating a named temporary procedure

To create a named temporary procedure, you register it as a temporary procedure, passing its name as one of the arguments.

Code in the following example calls the `registerTemporary` method to create a named temporary procedure called
`add_two` from a lambda expression, passing the procedure’s name as an argument. The method registers an `Int` as
the single parameter type.

The procedure itself will take a `Session` object and an integer as arguments. The `Session` argument represents an implicit
parameter that callers needn’t pass as an argument.

```scala
val session = Session.builder.configFile("my_config.properties").create

val procName: String = "add_two"
val tempSP: StoredProcedure =
  session.sproc.registerTemporary(
    procName,
    (session: Session, num: Int) => num + 2)
```

Code in the following example calls the `add_two` procedure using a `storedProcedure` method of the
[com.snowflake.snowpark.Session](../reference/scala/com/snowflake/snowpark/Session.md) class, passing the procedure name and `1` as its arguments. Note that the `Session` object is
an implicit argument that you needn’t pass when you call the procedure.

```scala
session.storedProcedure(procName, 1).show()
```

## Creating a permanent stored procedure

You can create a permanent stored procedure that you can call from any session, including
[from within a Snowflake worksheet](../../stored-procedure/stored-procedures-calling.md).

To create a permanent procedure, you register it with a `registerPermanent` method of the [com.snowflake.snowpark.SProcRegistration](../reference/scala/com/snowflake/snowpark/SProcRegistration.md)
class. The method is overloaded multiple times to support a variety of procedure requirements.

When calling `registerPermanent`, you pass as arguments the following:

* The procedure’s name.
* The procedure itself as a lambda expression.
* An existing stage to which Snowflake should copy files resulting from compiling the procedure.

  Snowflake will copy all related data, including dependencies and lambda functions. This must be a permanent stage (not session temporary)
  because this stored procedure can be invoked outside of the current session. If the procedure is later dropped, you must manually
  remove related files from the stage.
* A boolean value indicating whether this procedure should execute with caller’s rights.

  For more about caller’s rights and owner’s rights, refer to [Understanding caller’s rights and owner’s rights stored procedures](../../stored-procedure/stored-procedures-rights.md).

Code in the following example calls the `registerPermanent` method to create a permanent procedure called
`add_hundred` from a lambda expression.

The method specifies a stage called `sproc_libs` for the procedure and its dependencies. It also specifies that the procedure should
be executed with caller’s rights.

The procedure itself will take a `Session` object and an integer as arguments. The `Session` argument represents an implicit
parameter that callers needn’t pass as an argument.

```scala
val session = Session.builder.configFile("my_config.properties").create

val procName: String = "add_hundred"
val stageName: String = "sproc_libs"

val sp: StoredProcedure =
  session.sproc.registerPermanent(
    procName,
    (session: Session, num: Int) => num + 100,
    stageName,
    true
  )
```

Code in the following example calls the `add_hundred` procedure using a `storedProcedure` method of the
[com.snowflake.snowpark.Session](../reference/scala/com/snowflake/snowpark/Session.md) class. The call passes the procedure name and `1` as its arguments. Note that the `Session`
object used in the handler as an argument is an implicit argument that you needn’t pass when you call the procedure.

```scala
session.storedProcedure(procName, 1).show()
```

---
title: Creating User-Defined Aggregate Functions (UDAFs) for DataFrames in Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-udafs.md
section: Snowpark
---

# Creating User-Defined Aggregate Functions (UDAFs) for DataFrames in Python

You can use Snowpark Python APIs to create and call user-defined aggregate functions (UDAFs). A UDAF takes one or more rows as input and
produces a single row of output. It operates on values across multiple rows to perform mathematical calculations such as sum, average,
counting, finding minimum or maximum values, standard deviation, and estimation, as well as some non-mathematical operations.

To create and register a UDAF with Snowpark, you need to:

* Implement a UDAF handler.

  The handler contains the UDAF’s logic. A UDAF handler must implement functions that Snowflake will invoke at runtime when the UDAF is
  called. For more information, see Implementing a handler.
* Register the UDAF and its handler in the Snowflake database.

  Once you’ve registered the UDAF, you can call it from SQL or by using the Snowpark API. You can use the Snowpark API to register the
  UDAF and its handler. For more information about registering, see Registering a UDAF.

You can also create your own UDAFs using SQL as described in [Python user-defined aggregate functions](../../udf/python/udf-python-aggregate-functions.md).

## Implementing a handler

As described in [Interface for aggregate function handler](../../udf/python/udf-python-aggregate-functions.md), a UDAF handler class must implement methods that Snowflake invokes
when the UDAF is called. You can use the class you write as a handler whether you’re registering the UDAF with the Snowpark API or
[creating it with SQL using the CREATE FUNCTION statement](../../udf/python/udf-python-aggregate-functions.md).

Your UDAF handler class implements methods listed in the following table, which Snowflake invokes at run time. See
examples in this topic.

| Method | Requirement | Description |
| --- | --- | --- |
| `__init__` | Required | Initializes the internal state of an aggregate. |
| `aggregate_state` | Required | Returns the internal state of an aggregate.   * The method must have a [@property decorator](https://docs.python.org/3.12/library/functions.html#property). * An aggregate state object can be any Python data type serializable by the   [Python pickle library](https://docs.python.org/3/library/pickle.html#what-can-be-pickled-and-unpickled). * For simple aggregate states, use a primitive Python data type. For more complex aggregate states, use   [Python data classes](https://docs.python.org/3/library/dataclasses.html). |
| `accumulate` | Required | Accumulates the state of the aggregate based on the new input row. |
| `merge` | Required | Combines two intermediate aggregated states. |
| `finish` | Required | Produces the final result based on the aggregated state. |

## Registering a UDAF

Once you’ve implemented a UDAF handler, you can use the Snowpark API to register the UDAF on the Snowflake database. Registering the UDAF
creates the UDAF so that it can be called.

You can register the UDAF as a named or anonymous function, as you can for a scalar UDF. For related information about registering a scalar
UDF, see [Creating an Anonymous UDF](creating-udfs.md) and [Creating and Registering a Named UDF](creating-udfs.md). When you register a UDAF,
you specify parameter values that Snowflake needs to create the UDAF.

You can register the function using the following functions and methods:

* Use the `register` method or `udaf` function, specifying the name of your handler class, along with arguments to define the
  function. You can also use `udaf` as a `@udaf` decorator on the handler class.

  For reference information on these, see the following:

  + [snowflake.snowpark.functions.udaf](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udaf)
  + [snowflake.snowpark.udaf.UDAFRegistration.register](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.udaf.UDAFRegistration)
* Use the `register_from_file` function, pointing to a Python file or zip file containing Python source code.

  For the function reference, see [snowflake.snowpark.udaf.UDAFRegistration.register_from_file](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.udaf.UDAFRegistration.register_from_file#snowflake.snowpark.udaf.UDAFRegistration.register_from_file).

## Examples

### Create a UDAF with a return value and a single parameter

Python code in the following handler example supports a `sum_int` UDAF that receives a single integer argument, adds the value
across rows and returns the result.

#### Register the function

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark.types import IntegerType
from snowflake.snowpark.functions import udaf
def main(session: snowpark.Session):
class PythonSumUDAF:
  def __init__(self):
    # This aggregate state is a primitive Python data type.
    self._partial_sum = 0

  @property
  def aggregate_state(self):
    return self._partial_sum

  def accumulate(self, input_value):
    self._partial_sum += input_value

  def merge(self, other_partial_sum):
    self._partial_sum += other_partial_sum

  def finish(self):
    return self._partial_sum
sum_udaf = udaf(PythonSumUDAF, name="sum_int", replace=True, return_type=IntegerType(), input_types=[IntegerType()])
```

#### Call the function

Python code in the following example invokes the `sum_int` UDAF with a DataFrame.

```python
df = session.create_dataframe([[1, 3], [1, 4], [2, 5], [2, 6]]).to_df("a", "b")
result = df.agg(sum_udaf("a")).collect()
print(result.collect())
```

### Create a UDAF with a return value and two parameters

#### Register the function

Python code in the following handler example supports a `sum_int` UDAF that receives two integer arguments, adds the argument
values together across rows and returns the result.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark.types import IntegerType
from snowflake.snowpark.functions import udaf
def main(session: snowpark.Session):
  class PythonSumUDAF:
    def __init__(self):
      self._partial_sum = 0

    @property
  def aggregate_state(self):
    return self._partial_sum

  def accumulate(self, input_value, input_value2):
    self._partial_sum += input_value + input_value2

  def merge(self, other_partial_sum):
    self._partial_sum += other_partial_sum

  def finish(self):
    return self._partial_sum
sum_udaf = udaf(PythonSumUDAF, name="sum_int", replace=True, return_type=IntegerType(), input_types=[IntegerType(), IntegerType()])
```

#### Call the function

Python code in the following example invokes the `sum_int` UDAF with a DataFrame.

```python
df = session.create_dataframe([[1, 3], [1, 4], [2, 5], [2, 6]]).to_df("a", "b")
result = df.agg(sum_udaf("a", "b"))
print(result.collect())
```

---
title: Creating User-Defined Functions (UDFs) for DataFrames in Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/creating-udfs.md
section: Snowpark
---

# Creating User-Defined Functions (UDFs) for DataFrames in Java

The Snowpark API provides methods that you can use to create a user-defined function from a lambda expression in Java.
This topic explains how to create these types of functions.

## Introduction

You can call Snowpark APIs to create user-defined functions (UDFs) for lambda expressions in Java, and you can call
these UDFs to process the data in your DataFrame.

When you use the Snowpark API to create a UDF, the Snowpark library serializes and uploads the code for your UDF to a stage. When you call the UDF, the Snowpark library executes your function on the server, where the data is located. As a result,
the data doesn’t need to be transferred to the client in order for the function to process the data.

In your custom code, you can also call code that is packaged in JAR files (for example, Java classes for a third-party library).

You can create a UDF for your custom code in one of two ways:

* You can create an anonymous UDF and assign the function to a variable. As long
  as this variable is in scope, you can use this variable to call the UDF.

  > ```java
  > import com.snowflake.snowpark_java.types.*;
  > ...
  >
  > // Create and register an anonymous UDF (doubleUdf)
  > // that takes in an integer argument and returns an integer value.
  > UserDefinedFunction doubleUdf =
  >   Functions.udf((Integer x) -> x + x, DataTypes.IntegerType, DataTypes.IntegerType);
  > // Call the anonymous UDF.
  > DataFrame df = session.table("sample_product_data");
  > DataFrame dfWithDoubleQuantity = df.withColumn("doubleQuantity", doubleUdf.apply(Functions.col("quantity")));
  > dfWithDoubleQuantity.show();
  > ```
* You can create a named UDF and call the UDF by name. You can use this if, for
  example, you need to call a UDF by name or use the UDF in a subsequent session.

  > ```java
  > import com.snowflake.snowpark_java.types.*;
  > ...
  >
  > // Create and register a permanent named UDF ("doubleUdf")
  > // that takes in an integer argument and returns an integer value.
  > UserDefinedFunction doubleUdf =
  >   session
  >     .udf()
  >     .registerPermanent(
  >       "doubleUdf",
  >       (Integer x) -> x + x,
  >       DataTypes.IntegerType,
  >       DataTypes.IntegerType,
  >       "mystage");
  > // Call the named UDF.
  > DataFrame df = session.table("sample_product_data");
  > DataFrame dfWithDoubleQuantity = df.withColumn("doubleQuantity", Functions.callUDF("doubleUdf", Functions.col("quantity")));
  > dfWithDoubleQuantity.show();
  > ```

The rest of this topic explains how to create UDFs.

> **Note:**
>
> If you defined a UDF by running the `CREATE FUNCTION` command, you can call that UDF in Snowpark.
>
> For details, see [Calling scalar user-defined functions (UDFs)](calling-functions.md).

## Data Types Supported for Arguments and Return Values

In order to create a UDF for a Java lambda, you must use the supported data types listed below for the arguments and return value
of your method:

| SQL Data Type | Java Data Type | Notes |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | The following types are supported:   * `Integer` * `Long` * `java.math.BigDecimal` or `java.math.BigInteger` |  |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `Float` |  |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `Double` |  |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `String` |  |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `Boolean` |  |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` |  |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` |  |
| [BINARY](../../../sql-reference/data-types-text.md) | `Byte[]` |  |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark_java.types.Variant](../reference/java/com/snowflake/snowpark_java/types/Variant.md) |  |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `String[]` or `Variant[]` |  |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `Map<String, String>` or `Map<String, Variant>` |  |
| [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) | [com.snowflake.snowpark_java.types.Geography](../reference/java/com/snowflake/snowpark_java/types/Geography.md) |  |

## Specifying Dependencies for a UDF

In order to define a UDF through the Snowpark API, you must call `Session.addDependency()` for any files that contain any
classes and resources that your UDF depends on (e.g. JAR files, resource files, etc.). (For details on reading resources from a
UDF, see Reading Files from a UDF.)

The Snowpark library uploads these files to an internal stage and adds the files to the classpath when executing your UDF.

> **Tip:**
>
> If you don’t want the library to upload the file every time you run your application, upload the file to a stage. When calling
> `addDependency`, pass the path to the file in the stage.

The following example demonstrates how to add a JAR file in a stage as a dependency:

```java
// Add a JAR file that you uploaded to a stage.
session.addDependency("@my_stage/<path>/my-library.jar");
```

The following examples demonstrate how to add dependencies for JAR files and resource files:

```java
// Add a JAR file on your local machine.
session.addDependency("/<path>/my-library.jar");

// Add a directory of resource files.
session.addDependency("/<path>/my-resource-dir/");

// Add a resource file.
session.addDependency("/<path>/my-resource.xml");
```

You should not need to specify the following dependencies:

> * **Your Java runtime libraries.**
>
>   These libraries are already available in the runtime environment on the server where your UDFs are executed.
> * **The Snowpark JAR file.**
>
>   The Snowpark library automatically attempts to detect and upload the Snowpark JAR file to the server.
>
>   To prevent the library from repeatedly uploading the Snowpark JAR file to the server:
>
>   1. Upload the Snowpark JAR file to a stage.
>
>      For example, the following command uploads the Snowpark JAR file to the stage `@mystage`. The PUT command compresses the
>      JAR file and names the resulting file snowpark_2.12-1.18.0.jar.gz.
>
>      ```
>      -- Put the Snowpark JAR file in a stage.
>      PUT file:///<path>/snowpark_2.12-1.18.0.jar @mystage
>      ```
>   2. Call `addDependency` to add the Snowpark JAR file in the stage as a dependency.
>
>      For example, to add the Snowpark JAR file uploaded by the previous command:
>
>      ```
>      // Add the Snowpark JAR file that you uploaded to a stage.
>      session.addDependency("@mystage/snowpark_2.12-1.18.0.jar.gz");
>      ```
>
>      Note that the specified path to the JAR file includes the `.gz` filename extension, which was added by the PUT command.
> * **The JAR file or directory with the currently running application.**
>
>   The Snowpark library automatically attempts to detect and upload these dependencies.
>
>   If the Snowpark library is unable to detect these dependencies automatically, the library reports an error, and you must call
>   `addDependency` to add these dependencies manually.

If it takes too long for the dependencies to be uploaded to the stage, the Snowpark library reports a timeout exception. To
configure the maximum amount of time that the Snowpark library should wait, set the
[snowpark_request_timeout_in_seconds](creating-session.md) property when creating the session.

## Creating an Anonymous UDF

To create an anonymous UDF, you can either:

* Call the `Functions.udf` static method, passing in the lambda expression and the [DataTypes](../reference/java/com/snowflake/snowpark_java/types/DataTypes.md) fields (or objects
  constructed by the methods of that class) representing the data types of the inputs and output.
* Call the `registerTemporary` method in the `UDFRegistration` class, passing in the lambda expression and the
  [DataTypes](../reference/java/com/snowflake/snowpark_java/types/DataTypes.md) fields (or objects constructed by the methods of that class) representing the data types of the inputs and output.

  You can access an instance of the `UDFRegistration` class by calling the `udf` method of the `Session` object.

  When calling `registerTemporary`, use a method signature that does not have a `name` parameter. (Because you are
  creating an anonymous UDF, you do not specify a name for the UDF.)

> **Note:**
>
> When writing multi-threaded code (e.g. when using parallel collections), use the `registerTemporary` method to register
> UDFs, rather than using the `udf` method. This can prevent errors in which the default Snowflake `Session` object
> cannot be found.

These methods return a `UserDefinedFunction` object, which you can use to call the UDF. (See
[Calling scalar user-defined functions (UDFs)](calling-functions.md).)

The following example creates an anonymous UDF:

```java
import com.snowflake.snowpark_java.types.*;
...

// Create and register an anonymous UDF
// that takes in an integer argument and returns an integer value.
UserDefinedFunction doubleUdf =
  Functions.udf((Integer x) -> x + x, DataTypes.IntegerType, DataTypes.IntegerType);
// Call the anonymous UDF, passing in the "quantity" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleQuantity".
DataFrame df = session.table("sample_product_data");
DataFrame dfWithDoubleQuantity = df.withColumn("doubleQuantity", doubleUdf.apply(Functions.col("quantity")));
dfWithDoubleQuantity.show();
```

The following example creates an anonymous UDF that uses a custom class (`LanguageDetector`, which detects the language used
in text). The example calls the anonymous UDF to detect the language in the `text_data` column in a DataFrame and creates a new
DataFrame that includes an additional `lang` column with the language used.

```java
import com.snowflake.snowpark_java.types.*;

// Import the package for your custom code.
// The custom code in this example detects the language of textual data.
import com.mycompany.LanguageDetector;

// If the custom code is packaged in a JAR file, add that JAR file as
// a dependency.
session.addDependency("$HOME/language-detector.jar");

// Create a detector
LanguageDetector detector = new LanguageDetector();

// Create an anonymous UDF that takes a string of text and returns the language used in that string.
// Note that this captures the detector object created above.
// Assign the UDF to the langUdf variable, which will be used to call the UDF.
UserDefinedFunction langUdf =
  Functions.udf(
    (String s) -> Option(detector.detect(s)).getOrElse("UNKNOWN"),
    DataTypes.StringType,
    DataTypes.StringType);

// Create a new DataFrame that contains an additional "lang" column that contains the language
// detected by the UDF.
DataFrame dfEmailsWithLangCol =
    dfEmails.withColumn("lang", langUdf(Functions.col("text_data")));
```

## Creating and Registering a Named UDF

If you want to call a UDF by name (e.g. by using the `Functions.callUDF` static method) or if you need to use a UDF in
subsequent sessions, you can create and register a named UDF. To do this, use one of the following methods in the
`UDFRegistration` class:

* `registerTemporary`, if you just plan to use the UDF in the current session
* `registerPermanent`, if you plan to use the UDF in subsequent sessions

To access an object of the `UDFRegistration` class, call the `udf` method of the `Session` object.

When calling `registerTemporary` or `registerPermanent` method, pass in the lambda expression and the [DataTypes](../reference/java/com/snowflake/snowpark_java/types/DataTypes.md)
fields (or objects constructed by the methods of that class) representing the data types of the inputs and output.

For example:

```java
import com.snowflake.snowpark_java.types.*;
...
// Create and register a temporary named UDF
// that takes in an integer argument and returns an integer value.
UserDefinedFunction doubleUdf =
  session
    .udf()
    .registerTemporary(
      "doubleUdf",
      (Integer x) -> x + x,
      DataTypes.IntegerType,
      DataTypes.IntegerType);
// Call the named UDF, passing in the "quantity" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleQuantity".
DataFrame df = session.table("sample_product_data");
DataFrame dfWithDoubleQuantity = df.withColumn("doubleQuantity", Functions.callUDF("doubleUdf", Functions.col("quantity")));
dfWithDoubleQuantity.show();
```

`registerPermanent` creates a UDF that you can use in the current and subsequent sessions. When you call
`registerPermanent`, you must also specify a location in an internal stage location where the JAR files for the UDF and its
dependencies will be uploaded.

> **Note:**
>
> `registerPermanent` does not support external stages.

For example:

```java
import com.snowflake.snowpark_java.types.*;
...

// Create and register a permanent named UDF
// that takes in an integer argument and returns an integer value.
// Specify that the UDF and dependent JAR files should be uploaded to
// the internal stage named mystage.
UserDefinedFunction doubleUdf =
  session
    .udf()
    .registerPermanent(
      "doubleUdf",
      (Integer x) -> x + x,
      DataTypes.IntegerType,
      DataTypes.IntegerType,
      "mystage");
// Call the named UDF, passing in the "quantity" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleQuantity".
DataFrame df = session.table("sample_product_data");
DataFrame dfWithDoubleQuantity = df.withColumn("doubleQuantity", Functions.callUDF("doubleUdf", Functions.col("quantity")));
dfWithDoubleQuantity.show();
```

## Using Objects That Are Not Serializable

When you create a UDF for a lambda expression, the Snowpark library serializes the lambda closure and sends it to the server for
execution.

If an object captured by the lambda closure is not serializable, the Snowpark library throws an
`java.io.NotSerializableException` exception.

```none
Exception in thread "main" java.io.NotSerializableException: <YourObjectName>
```

If this occurs, you must make the object serializable.

## Writing Initialization Code for a UDF

If your UDF requires initialization code or context, you can provide this through values captured as part of the UDF closure.

The following example uses a separate class to initialize the context needed by two UDFs.

* The first UDF creates a new instance of the class within the lambda, so the initialization is performed every time the UDF is
  invoked.
* The second UDF captures an instance of the class generated in your client program. The context generated on the client is
  serialized and is used by the UDF. Note that the context class must be serializable for this approach to work.

```java
import com.snowflake.snowpark_java.*;
import com.snowflake.snowpark_java.types.*;
import java.io.Serializable;

// Context needed for a UDF.
class Context {
  double randomInt = Math.random();
}

// Serializable context needed for the UDF.
class SerContext implements Serializable {
  double randomInt = Math.random();
}

class TestUdf {
  public static void main(String[] args) {
    // Create the session.
    Session session = Session.builder().configFile("/<path>/profile.properties").create();
    session.range(1, 10, 2).show();

    // Create a DataFrame with two columns ("c" and "d").
    DataFrame dummy =
      session.createDataFrame(
        new Row[]{
          Row.create(1, 1),
          Row.create(2, 2),
          Row.create(3, 3)
        },
        StructType.create(
          new StructField("c", DataTypes.IntegerType),
          new StructField("d", DataTypes.IntegerType))
        );
    dummy.show();

    // Initialize the context once per invocation.
    UserDefinedFunction udfRepeatedInit =
      Functions.udf(
        (Integer i) -> new Context().randomInt,
        DataTypes.IntegerType,
        DataTypes.DoubleType
      );
    dummy.select(udfRepeatedInit.apply(dummy.col("c"))).show();

    // Initialize the serializable context only once,
    // regardless of the number of times that the UDF is invoked.
    SerContext sC = new SerContext();
    UserDefinedFunction udfOnceInit =
      Functions.udf(
        (Integer i) -> sC.randomInt,
        DataTypes.IntegerType,
        DataTypes.DoubleType
      );
    dummy.select(udfOnceInit.apply(dummy.col("c"))).show();
    UserDefinedFunction udfOnceInit = udf((i: Int) => sC.randomInt);
  }
}
```

## Reading Files from a UDF

As mentioned earlier, the Snowpark library uploads and executes UDFs on the server. If your UDF needs to read data from a file,
you must ensure that the file is uploaded with the UDF.

In addition, if the content of the file remains the same between calls to the UDF, you can write your code to load the file once
during the first call and not on subsequent calls. This can improve the performance of your UDF calls.

To set up a UDF to read a file:

1. Add the file to a JAR file.

   For example, if your UDF needs to use a file in a `data/` subdirectory (`data/hello.txt`), run the `jar` command to
   add this file to a JAR file:

   ```bash
   # Create a new JAR file containing data/hello.txt.
   $ jar cvf <path>/myJar.jar data/hello.txt
   ```
2. Specify that the JAR file is a dependency, which uploads the file to the server and adds the file to the classpath. See
   Specifying Dependencies for a UDF.

   For example:

   ```java
   // Specify that myJar.jar contains files that your UDF depends on.
   session.addDependency("<path>/myJar.jar");
   ```
3. In the UDF, call `Class.forName().getResourceAsStream()` to find the file in the classpath and read the file.

   To avoid adding a dependency on `this`, you can use `Class.forName("com.snowflake.snowpark_java.DataFrame")`
   (rather than `getClass()`) to get the `Class` object.

   For example, to read the `data/hello.txt` file:

   ```java
   // Read data/hello.txt from myJar.jar.
   String resourceName = "/data/hello.txt";
   InputStream inputStream = Class.forName("com.snowflake.snowpark_java.DataFrame").getResourceAsStream(resourceName);
   ```

   In this example, the resource name starts with a `/`, which indicates that this is the full path of the file in the JAR file.
   (In this case, the location of the file is not relative to the package of the class.)

> > **Note:**
> >
> > If you don’t expect the content of the file to change between UDF calls, read the file into a static field of your class,
> > and read the file only if the field is not set.

The following example defines an object (`UDFCode`) with a function that will be used as a UDF (`readFileFunc`). The function
reads the file `data/hello.txt`, which is expected to contain the string `hello,`. The function prepends this string to the
string passed in as an argument.

```java
import java.io.InputStream;
import java.nio.charset.StandardCharsets;

// Create a function class that reads a file.
class UDFCode {
  private static String fileContent = null;
  // The code in this block reads the file. To prevent this code from executing each time that the UDF is called,
  // The file content is cached in 'fileContent'.
  public static String readFile() {
    if (fileContent == null) {
      try {
        String resourceName = "/data/hello.txt";
        InputStream inputStream = Class.forName("com.snowflake.snowpark_java.DataFrame")
          .getResourceAsStream(resourceName);
        fileContent = new String(inputStream.readAllBytes(), StandardCharsets.UTF_8);
      } catch (Exception e) {
        fileContent = "Error while reading file";
      }
    }
    return fileContent;
  }
}
```

The next part of the example registers the function as an anonymous UDF. The example calls the UDF on the `NAME` column in a
DataFrame. The example assumes that the `data/hello.txt` file is packaged in the JAR file `myJar.jar`.

```java
import com.snowflake.snowpark_java.types.*;

// Add the JAR file as a dependency.
session.addDependency("<path>/myJar.jar");

// Create a new DataFrame with one column (NAME)
// that contains the name "Raymond".
DataFrame myDf = session.sql("select 'Raymond' NAME");

// Register the function that you defined earlier as an anonymous UDF.
UserDefinedFunction readFileUdf = session.udf().registerTemporary(
  (String s) -> UDFCode.readFile() + " : " + s, DataTypes.StringType, DataTypes.StringType);

// Call UDF for the values in the NAME column of the DataFrame.
myDf.withColumn("CONCAT", readFileUdf.apply(Functions.col("NAME"))).show();
```

## Creating User-Defined Table Functions (UDTFs)

To create and register a UDTF in Snowpark, you must:

* Define a class for the UDTF.
* Create an instance of that class, and register that instance as a UDTF.

The next sections describe these steps in more detail.

For information on calling a UDTF, see Calling a UDTF.

### Defining the UDTF Class

Define a class that implements one of the `JavaUDTFn` interfaces (e.g. `JavaUDTF0`, `JavaUDTF1`, etc.) in the
[com.snowflake.snowpark_java.udtf package](../reference/java/com/snowflake/snowpark_java/udtf/package-summary.md), where `n` specifies the number of input arguments for your UDTF. For example,
if your UDTF passes in 2 input arguments, implement the `JavaUDTF2` interface.

In your class, implement the following methods:

* outputSchema(), which returns a `types.StructType` object
  that describes the names and types of the fields in the returned rows (the “schema” of the output).
* process(), which is called once for each row in the
  input partition (see the note below).
* inputSchema(), which returns a `types.StructType` object that
  describes the types of the input parameters.

  If your `process()` method passes in `Map` arguments, you must implement the `inputSchema()` method.
  Otherwise, implementing this method is optional.
* endPartition(), which is called once for each partition after all
  rows have been passed to `process()`.

When a UDTF is called, the rows are grouped into partitions before they are passed to the UDTF:

* If the statement that calls the UDTF specifies the PARTITION clause (explicit partitions), that clause determines how the rows
  are partitioned.
* If the statement does not specify the PARTITION clause (implicit partitions), Snowflake determines how best to partition the
  rows.

For an explanation of partitions, see [Table functions and partitions](../../udf/udf-calling-sql.md).

For an example of a UDTF class, see Example of a UDTF Class.

#### Implementing the outputSchema() Method

Implement the `outputSchema()` method to define the names and data types of the fields (the “output schema”) of the rows
returned by the `process()` and `endPartition()` methods.

> ```java
> public StructType outputSchema()
> ```

In this method, construct and return a [StructType](../reference/java/com/snowflake/snowpark_java/types/StructType.md) object that contains [StructField](../reference/java/com/snowflake/snowpark_java/types/StructField.md) objects representing the Snowflake data
type of each field in a returned row. Snowflake supports the following type objects for the output schema for a UDTF:

| SQL Data Type | Java Type | `com.snowflake.snowpark_java.types` Type |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.lang.Short` | `ShortType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.lang.Integer` | `IntType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.lang.Long` | `LongType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.math.BigDecimal` | `DecimalType` |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `java.lang.Float` | `FloatType` |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `java.lang.Double` | `DoubleType` |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `java.lang.String` | `StringType` |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `java.lang.Boolean` | `BooleanType` |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` | `DateType` |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` | `TimestampType` |
| [BINARY](../../../sql-reference/data-types-text.md) | `byte[]` | `BinaryType` |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark_java.types.Variant](../reference/java/com/snowflake/snowpark_java/types/Variant.md) | `VariantType` |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `String[]` | `ArrayType(StringType)` |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `Variant[]` | `ArrayType(VariantType)` |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `java.util.Map<String, String>` | `MapType(StringType, StringType)` |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `java.util.Map<String, Variant>` | `MapType(StringType, VariantType)` |

For example, if your UDTF returns a row with a single integer field:

> ```java
> public StructType outputSchema() {
>   return StructType.create(new StructField("C1", DataTypes.IntegerType));
> }
> ```

#### Implementing the process() Method

In your UDTF class, implement the `process()` method:

> ```java
> Stream<Row> process(A0 arg0, ... A<n> arg<n>)
> ```

where `n` is the number of arguments passed to your UDTF.

The number of arguments in the signature corresponds to the interface that you implemented. For example, if your UDTF passes in 2
input arguments and you are implementing the `JavaUDTF2` interface, the `process()` method has this signature:

> ```java
> Stream<Row> process(A0 arg0, A1 arg1)
> ```

This method is invoked once for each row in the input partition.

##### Choosing the Types of the Arguments

For the type of each argument in the `process()` method, use the Java type that corresponds to the Snowflake data type of
the argument passed to the UDTF.

Snowflake supports the following data types for the arguments for a UDTF:

| SQL Data Type | Java Data Type | Notes |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | The following types are supported:   * `java.lang.Short` * `java.lang.Integer` * `java.lang.Long` * `java.math.BigDecimal` |  |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `java.lang.Float` |  |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `java.lang.Double` |  |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `java.lang.String` |  |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `java.lang.Boolean` |  |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` |  |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` |  |
| [BINARY](../../../sql-reference/data-types-text.md) | `byte[]` |  |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark_java.types.Variant](../reference/java/com/snowflake/snowpark_java/types/Variant.md) |  |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `String[]` or `Variant[]` |  |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `Map<String, String>` or `Map<String, Variant>` |  |

> **Note:**
>
> If you pass in `java.util.Map` arguments, you must implement the `inputSchema` method to describe the types of those
> arguments. See Implementing the inputSchema() Method.

##### Returning Rows

In the `process()` method, build and return a [java.util.stream.Stream](https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/util/stream/Stream.html) of `Row` objects that contain the data to be
returned by the UDTF for the given input values. The fields in the row must use the types that you specified in the
`outputSchema` method. (See Implementing the outputSchema() Method.)

For example, if your UDTF generates rows, construct and return an `Iterable` of `Row` objects for the generated rows:

> ```java
> import java.util.stream.Stream;
> ...
>
> public Stream<Row> process(Integer start, Integer count) {
>   Stream.Builder<Row> builder = Stream.builder();
>   for (int i = start; i < start + count ; i++) {
>     builder.add(Row.create(i));
>   }
>   return builder.build();
> }
> ```

#### Implementing the inputSchema() Method

If the process() method passes in a `java.util.Map` argument, you
must implement the `inputSchema()` method to describe the types of the input arguments.

> **Note:**
>
> If the `process()` method does not pass in `Map` arguments, you do not need to implement the `inputSchema()`
> method.

In this method, construct and return a [StructType](../reference/java/com/snowflake/snowpark_java/types/StructType.md) object that contains [StructField](../reference/java/com/snowflake/snowpark_java/types/StructField.md) objects representing the Snowflake data
type of each argument passed in to the `process()` method. Snowflake supports the following type objects for the input
schema for a UDTF:

| SQL Data Type | Java Type | `com.snowflake.snowpark_java.types` Type |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.lang.Short` | `ShortType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.lang.Integer` | `IntType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.lang.Long` | `LongType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.math.BigDecimal` | `DecimalType` |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `java.lang.Float` | `FloatType` |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `java.lang.Double` | `DoubleType` |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `java.lang.String` | `StringType` |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `java.lang.Boolean` | `BooleanType` |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` | `DateType` |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` | `TimestampType` |
| [BINARY](../../../sql-reference/data-types-text.md) | `byte[]` | `BinaryType` |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark_java.types.Variant](../reference/java/com/snowflake/snowpark_java/types/Variant.md) | `VariantType` |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `String[]` | `ArrayType(StringType)` |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `Variant[]` | `ArrayType(VariantType)` |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `java.util.Map<String, String>` | `MapType(StringType, StringType)` |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `java.util.Map<String, Variant>` | `MapType(StringType, VariantType)` |

For example, suppose that your `process()` method passes in a `Map<String, String>` argument and a
`Map<String, Variant>` argument:

```java
import java.util.Map;
import com.snowflake.snowpark_java.*;
import com.snowflake.snowpark_java.types.*;
...

public Stream<Row> process(Map<String, String> stringMap, Map<String, Variant> varMap) {
  ...
}
```

You must implement the `inputSchema()` method to return a `StructType` object that describes the types of these input
arguments:

```java
import java.util.Map;
import com.snowflake.snowpark_java.types.*;
...

public StructType inputSchema() {
  return StructType.create(
      new StructField(
          "string_map",
          DataTypes.createMapType(DataTypes.StringType, DataTypes.StringType)),
      new StructField(
          "variant_map",
          DataTypes.createMapType(DataTypes.StringType, DataTypes.VariantType)));
}
```

#### Implementing the endPartition() Method

Implement the `endPartition` method and add code that should be executed after all rows in the input partition have been
passed to the `process` method. The `endPartition` method is invoked once for each input partition.

> ```java
> public Stream<Row> endPartition()
> ```

You can use this method if you need to perform any work after all of the rows in the partition have been processed. For example,
you can:

* Return rows based on state information that you capture in each `process` method call.
* Return rows that are not tied to a specific input row.
* Return rows that summarize the output rows that have been generated by the `process` method.

The fields in the rows that you return must match the types that you specified in the `outputSchema` method. (See
Implementing the outputSchema() Method.)

If you do not need to return additional rows at the end of each partition, return an empty `Stream`. For example:

> ```java
> public Stream<Row> endPartition() {
>   return Stream.empty();
> }
> ```

> **Note:**
>
> While Snowflake supports large partitions with timeouts tuned to process them successfully, especially large partitions can cause
> processing to time out (such as when `endPartition` takes too long to complete). Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need the
> timeout threshold adjusted for specific usage scenarios.

#### Example of a UDTF Class

The following is an example of a UDTF class that generates a range of rows.

* Because the UDTF passes in 2 arguments, the class implements `JavaUDTF2`.
* The arguments `start` and `count` specify the starting number for the row and the number of rows to generate.

```java
import java.util.stream.Stream;
import com.snowflake.snowpark_java.types.*;
import com.snowflake.snowpark_java.udtf.*;

class MyRangeUdtf implements JavaUDTF2<Integer, Integer> {
  public StructType outputSchema() {
    return StructType.create(new StructField("C1", DataTypes.IntegerType));
  }

  // Because the process() method in this example does not pass in Map arguments,
  // implementing the inputSchema() method is optional.
  public StructType inputSchema() {
    return StructType.create(
            new StructField("start_value", DataTypes.IntegerType),
            new StructField("value_count", DataTypes.IntegerType));
  }

  public Stream<Row> endPartition() {
    return Stream.empty();
  }

  public Stream<Row> process(Integer start, Integer count) {
    Stream.Builder<Row> builder = Stream.builder();
    for (int i = start; i < start + count ; i++) {
      builder.add(Row.create(i));
    }
    return builder.build();
  }
}
```

### Registering the UDTF

Next, create an instance of the new class, and register the class by calling one of the [UDTFRegistration](../reference/java/com/snowflake/snowpark_java/UDFRegistration.md) methods. You can
register a temporary or
permanent UDTF.

#### Registering a Temporary UDTF

To register a temporary UDTF, call `UDTFRegistration.registerTemporary`:

* If you do not need to call the UDTF by name, you can register an anonymous UDTF by passing in an instance of the class:

  > ```java
  > // Register the MyRangeUdtf class that was defined in the previous example.
  > TableFunction tableFunction = session.udtf().registerTemporary(new MyRangeUdtf());
  > // Use the returned TableFunction object to call the UDTF.
  > session.tableFunction(tableFunction, Functions.lit(10), Functions.lit(5)).show();
  > ```
* If you need to call the UDTF by name, pass in a name of the UDTF as well:

  > ```java
  > // Register the MyRangeUdtf class that was defined in the previous example.
  > TableFunction tableFunction = session.udtf().registerTemporary("myUdtf", new MyRangeUdtf());
  > // Call the UDTF by name.
  > session.tableFunction(new TableFunction("myUdtf"), Functions.lit(10), Functions.lit(5)).show();
  > ```

#### Registering a Permanent UDTF

If you need to use the UDTF in subsequent sessions, call `UDTFRegistration.registerPermanent` to register a permanent UDTF.

When registering a permanent UDTF, you must specify a stage where the registration method will upload the JAR files for the UDTF
and its dependencies. For example:

> ```java
> // Register the MyRangeUdtf class that was defined in the previous example.
> TableFunction tableFunction = session.udtf().registerPermanent("myUdtf", new MyRangeUdtf(), "@myStage");
> // Call the UDTF by name.
> session.tableFunction(new TableFunction("myUdtf"), Functions.lit(10), Functions.lit(5)).show();
> ```

### Calling a UDTF

After registering the UDTF, you can call the UDTF by passing the returned `TableFunction` object to the
`tableFunction` method of the `Session` object:

> ```java
> // Register the MyRangeUdtf class that was defined in the previous example.
> TableFunction tableFunction = session.udtf().registerTemporary(new MyRangeUdtf());
> // Use the returned TableFunction object to call the UDTF.
> session.tableFunction(tableFunction, Functions.lit(10), Functions.lit(5)).show();
> ```

To call a UDTF by name, construct a `TableFunction` object with that name, and pass that to the `tableFunction`
method:

> ```java
> // Register the MyRangeUdtf class that was defined in the previous example.
> TableFunction tableFunction = session.udtf().registerTemporary("myUdtf", new MyRangeUdtf());
> // Call the UDTF by name.
> session.tableFunction(new TableFunction("myUdtf"), Functions.lit(10), Functions.lit(5)).show();
> ```

You can also call a UDTF through a SELECT statement directly:

> ```java
> session.sql("select * from table(myUdtf(10, 5))");
> ```

---
title: Creating User-Defined Functions (UDFs) for DataFrames in Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-udfs.md
section: Snowpark
---

# Creating User-Defined Functions (UDFs) for DataFrames in Python

The Snowpark API provides methods that you can use to create a user-defined function from a lambda or function in Python.
This topic explains how to create these types of functions.

## Introduction

With Snowpark, you can create user-defined functions (UDFs) for your custom lambdas and functions, and you can call these
UDFs to process the data in your DataFrame.

When you use the Snowpark API to create a UDF, the Snowpark library uploads the code for your function to an internal stage.
When you call the UDF, the Snowpark library executes your function on the server, where the data is. As a result, the data
doesn’t need to be transferred to the client in order for the function to process the data.

In your custom code, you can also import modules from Python files or third-party packages.

You can create a UDF for your custom code in one of two ways:

* You can create an anonymous UDF and assign the function to a variable. As long as
  this variable is in scope, you can use this variable to call the UDF.
* You can create a named UDF and call the UDF by name. You can use this if, for example,
  you need to call a UDF by name or use the UDF in a subsequent session.

The next sections explain how to create these UDFs using a local development environment or using a
[Python worksheet](python-worksheets.md).

Note that if you defined a UDF by running the `CREATE FUNCTION` command, you can call that UDF in Snowpark. For details, see
[Calling User-Defined Functions (UDFs)](calling-functions.md).

> **Note:**
>
> Vectorized Python UDFs let you define Python functions that receive batches of input rows as Pandas DataFrames.
> This results in much better performance with machine learning inference scenarios.
> For more information, see Using Vectorized UDFs.

> **Note:**
>
> If you are working with a Python worksheet, use these examples within the handler function:
>
> ```python
> import snowflake.snowpark as snowpark
> from snowflake.snowpark.functions import col
>
> def main(session: snowpark.Session):
>   df_table = session.table("sample_product_data")
> ```
>
> If the examples return something other than a DataFrame, such as a `list` of `Row` objects,
> [change the return type](python-worksheets.md) to match the return type of the example.
>
> After you run a code example, use the Results tab to view any output returned.
> Refer to [Running Python Worksheets](python-worksheets.md) for more details.

## Specifying Dependencies for a UDF

To define a UDF using the Snowpark API, you must import the files that contain any modules that your UDF depends on, such as Python files,
zip files, resource files, etc.

* To do this using Python worksheets, refer to [Add a Python File from a Stage to a Worksheet](python-worksheets.md).
* To do this using your local development environment, you must call `Session.add_import()` in your code.

You can also specify a directory and the Snowpark library automatically compresses the directory and uploads it as a zip file.
(For details on reading resources from a UDF, see Reading Files with a UDF.)

When you call `Session.add_import()`, the Snowpark library uploads the specified files to an internal stage and imports the
files when executing your UDF.

The following example demonstrates how to add a zip file in a stage as a dependency to your code:

```python
# Add a zip file that you uploaded to a stage.
session.add_import("@my_stage/<path>/my_library.zip")
```

The following examples demonstrate how to add a Python file from your local machine:

```python
# Import a Python file from your local machine.
session.add_import("/<path>/my_module.py")

# Import a Python file from your local machine and specify a relative Python import path.
session.add_import("/<path>/my_module.py", import_path="my_dir.my_module")
```

The following examples demonstrate how to add other types of dependencies:

```python
# Add a directory of resource files.
session.add_import("/<path>/my-resource-dir/")

# Add a resource file.
session.add_import("/<path>/my-resource.xml")
```

> **Note:**
>
> The Python Snowpark library is not uploaded automatically.

You do not need to specify the following dependencies:

* **Your Python built-in libraries.**

  These libraries are already available in the runtime environment on the server where your UDFs are executed.

## Using Artifact Repository packages in a UDF

For more information, see [Artifact Repository overview](../../udf/python/udf-python-packages.md).

## Using Third-Party Packages from Anaconda in a UDF

You can use third-party packages from the Snowflake Anaconda channel in a UDF.

* If you create a Python UDF in a Python worksheet, the Anaconda packages are already available to your worksheet.
  Refer to [Add a Python File from a Stage to a Worksheet](python-worksheets.md).
* If you create a Python UDF in your local development environment, you can specify which Anaconda packages to install.

When queries that call Python UDFs are executed inside a Snowflake warehouse, Anaconda packages
are installed seamlessly and cached on the virtual warehouse on your behalf.

For more information about best practices, how to view the available packages, and how to
set up a local development environment, see [Using third-party packages](../../udf/python/udf-python-packages.md).

If you write a Python UDF in your local development environment, use `session.add_packages` to add packages at the session level.

This code example shows how to import packages and return their versions.

```python
import numpy as np
import pandas as pd
import xgboost as xgb
from snowflake.snowpark.functions import udf

session.add_packages("numpy", "pandas", "xgboost==1.5.0")

@udf
def compute() -> list:
  return [np.__version__, pd.__version__, xgb.__version__]
```

You can also use `session.add_requirements` to specify packages with a
[requirements file](https://pip.pypa.io/en/stable/user_guide/#requirements-files).

```python
session.add_requirements("mydir/requirements.txt")
```

You can add the UDF-level packages to overwrite the session-level packages you might have added previously.

```python
import numpy as np
import pandas as pd
import xgboost as xgb
from snowflake.snowpark.functions import udf

@udf(packages=["numpy", "pandas", "xgboost==1.5.0"])
def compute() -> list:
  return [np.__version__, pd.__version__, xgb.__version__]
```

> **Important:**
>
> If you don’t specify a package version, Snowflake uses the latest version when resolving dependencies. When you deploy the UDF to
> production, you might want to ensure that your code always uses the same dependency versions. You can do that for both permanent
> and temporary UDFs.
>
> * When you create a permanent UDF, the UDF is created and registered only once. This resolves dependencies once and the selected version
>   is used for production workloads. When the UDF executes, it always uses the same dependency versions.
> * When you create a temporary UDF, specify dependency versions as part of the version spec. That way, when the UDF is registered, package
>   resolution uses the specified version. If you don’t specify the version, the dependency might be updated when a new version becomes
>   available.

## Creating an Anonymous UDF

To create an anonymous UDF, you can either:

* Call the `udf` function in the `snowflake.snowpark.functions` module, passing in the definition of the anonymous
  function.
* Call the `register` method in the `UDFRegistration` class, passing in the definition of the anonymous
  function.

Here is an example of an anonymous UDF:

```python
from snowflake.snowpark.types import IntegerType
from snowflake.snowpark.functions import udf

add_one = udf(lambda x: x+1, return_type=IntegerType(), input_types=[IntegerType()])
```

> **Note:**
>
> When writing code that might execute in multiple sessions, use the `register` method to register
> UDFs, rather than using the `udf` function. This can prevent errors in which the default Snowflake `Session` object
> cannot be found.

## Creating and Registering a Named UDF

If you want to call a UDF by name (e.g. by using the `call_udf` function in the `functions` module), you can create and register a named UDF. To do this, use one of the following:

* The `register` method, in the `UDFRegistration` class, with the `name` argument.
* The `udf` function, in the `snowflake.snowpark.functions` module, with the `name` argument.

To access an attribute or method of the `UDFRegistration` class, call the `udf` property of the `Session` class.

Calling `register` or `udf` will create a temporary UDF that you can use in the current session.

To create a permanent UDF, call the `register` method or the `udf` function and set
the `is_permanent` argument to `True`. When you create a permanent UDF, you must also set the `stage_location`
argument to the stage location where the Python file for the UDF and its dependencies are uploaded.

Here is an example of how to register a named temporary UDF:

```python
from snowflake.snowpark.types import IntegerType
from snowflake.snowpark.functions import udf

add_one = udf(lambda x: x+1, return_type=IntegerType(), input_types=[IntegerType()], name="my_udf", replace=True)
```

Here is an example of how to register a named permanent UDF by setting the `is_permanent` argument to `True`:

```python
@udf(name="minus_one", is_permanent=True, stage_location="@my_stage", replace=True)
def minus_one(x: int) -> int:
  return x-1
```

Here is an example of these UDFs being called:

```python
df = session.create_dataframe([[1, 2], [3, 4]]).to_df("a", "b")
df.select(add_one("a"), minus_one("b")).collect()
```

```output
[Row(MY_UDF("A")=2, MINUS_ONE("B")=1), Row(MY_UDF("A")=4, MINUS_ONE("B")=3)]
```

You can also call the UDF using SQL:

```python
session.sql("select minus_one(1)").collect()
```

```output
[Row(MINUS_ONE(1)=0)]
```

## Creating a UDF from a Python source file

If you create your UDF in your local development environment, you can define your UDF handler in a Python file and then use the
`register_from_file` method in the `UDFRegistration` class to create a UDF.

> **Note:**
>
> You cannot use this method in a Python worksheet.

Here are examples of using `register_from_file`.

Suppose you have a Python file `test_udf_file.py` that contains:

```python
def mod5(x: int) -> int:
  return x % 5
```

Then you can create a UDF from this function of file `test_udf_file.py`.

```python
# mod5() in that file has type hints
mod5_udf = session.udf.register_from_file(
  file_path="tests/resources/test_udf_dir/test_udf_file.py",
  func_name="mod5",
)
session.range(1, 8, 2).select(mod5_udf("id")).to_df("col1").collect()
```

```output
[Row(COL1=1), Row(COL1=3), Row(COL1=0), Row(COL1=2)]
```

You can also upload the file to a stage location, then use it to create the UDF.

```python
from snowflake.snowpark.types import IntegerType
# suppose you have uploaded test_udf_file.py to stage location @mystage.
mod5_udf = session.udf.register_from_file(
  file_path="@mystage/test_udf_file.py",
  func_name="mod5",
  return_type=IntegerType(),
  input_types=[IntegerType()],
)
session.range(1, 8, 2).select(mod5_udf("id")).to_df("col1").collect()
```

```output
[Row(COL1=1), Row(COL1=3), Row(COL1=0), Row(COL1=2)]
```

## Reading Files with a UDF

To read the contents of a file, your Python code can:

* Read a statically-specified file by importing a file and then reading it from the UDF’s home directory.
* Read a dynamically-specified file with SnowflakeFile. You might do this if you need to access a file during computation.

### Reading Statically-Specified Files

The Snowpark library uploads and executes UDFs on the server. If your UDF needs to read data from a file,
you must ensure that the file is uploaded with the UDF.

> **Note:**
>
> If you write your UDF in a Python worksheet, the UDF can only read files from a stage.

To set up a UDF to read a file:

1. Specify that the file is a dependency, which uploads the file to the server. For more information, see
   Specifying Dependencies for a UDF.

   For example:

   ```python
   # Import a file from your local machine as a dependency.
   session.add_import("/<path>/my_file.txt")

   # Or import a file that you uploaded to a stage as a dependency.
   session.add_import("@my_stage/<path>/my_file.txt")
   ```
2. In the UDF, read the file. In the following example, the file will only be read once during UDF creation, and will not
   be read again during UDF execution. This is achieved with a third-party library
   [cachetools](https://pypi.org/project/cachetools/).

   ```python
   import sys
   import os
   import cachetools
   from snowflake.snowpark.types import StringType
   @cachetools.cached(cache={})
   def read_file(filename):
      import_dir = sys._xoptions.get("snowflake_import_directory")
         if import_dir:
            with open(os.path.join(import_dir, filename), "r") as f:
               return f.read()

      # create a temporary text file for test
   temp_file_name = "/tmp/temp.txt"
   with open(temp_file_name, "w") as t:
      _ = t.write("snowpark")
   session.add_import(temp_file_name)
   session.add_packages("cachetools")

   def add_suffix(s):
      return f"{read_file(os.path.basename(temp_file_name))}-{s}"

   concat_file_content_with_str_udf = session.udf.register(
         add_suffix,
         return_type=StringType(),
         input_types=[StringType()]
      )

   df = session.create_dataframe(["snowflake", "python"], schema=["a"])
   df.select(concat_file_content_with_str_udf("a")).to_df("col1").collect()
   ```

   ```output
   [Row(COL1='snowpark-snowflake'), Row(COL1='snowpark-python')]
   ```

   ```python
   os.remove(temp_file_name)
   session.clear_imports()
   ```

### Reading Dynamically-Specified Files with `SnowflakeFile`

You can read a file from a stage using the `SnowflakeFile` class in the Snowpark `snowflake.snowpark.files` module.
The `SnowflakeFile` class provides dynamic file access, which lets you stream files of any size. Dynamic file access is also useful when you want to iterate over multiple files. For example, see [Processing multiple files](../../udf/python/udf-python-examples.md).

For more information about and examples of reading files using `SnowflakeFile`, see [Reading a File Using the SnowflakeFile Class in a Python UDF Handler](../../udf/python/udf-python-examples.md).

The following example registers a temporary UDF that reads a text file from a stage using `SnowflakeFile` and returns the file length.

Register the UDF:

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark.functions import udf
from snowflake.snowpark.files import SnowflakeFile
from snowflake.snowpark.types import StringType, IntegerType

@udf(name="get_file_length", replace=True, input_types=[StringType()], return_type=IntegerType(), packages=['snowflake-snowpark-python'])
def get_file_length(file_path):
  with SnowflakeFile.open(file_path) as f:
    s = f.read()
  return len(s);
```

Call the UDF:

```python
session.sql("select get_file_length(build_scoped_file_url(@my_stage, 'example-file.txt'));")
```

## Writing files from Snowpark Python UDFs and UDTFs

With Snowpark Python, you can write files to stages with user-defined functions (UDFs), vectorized UDFs, user-defined table functions
(UDTFs), and vectorized UDTFs. In the function handler, you use the
[SnowflakeFile API](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.files.SnowflakeFile#snowflake.snowpark.files.SnowflakeFile)
to open and write files. When you return the file from the function, the file is written alongside the query results.

A simple UDF to write a file might look like this:

```sqlexample-python
CREATE OR REPLACE FUNCTION write_file()
RETURNS STRING
LANGUAGE PYTHON
VOLATILE
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'write_file'
AS
$$
from snowflake.snowpark.files import SnowflakeFile

def write_file():
  file = SnowflakeFile.open_new_result("w") # Open a new result file
  file.write("Hello world")                 # Write data
  return file               # File must be returned
$$;
```

Executing this UDF will then give you a scoped URL referencing the result file.

### Accessing the result files

A file handler is returned as a scoped URL to the query calling the UDF. You can use this particular scoped URL to access files from within
Snowflake (through another UDF or the COPY FILES command), but not from outside of Snowflake as a pre-signed URL. The scoped URL is valid
for 24 hours.

After a file is returned by a UDF, you can access it using any of the following storage tools, depending on your use case:

* [COPY FILES](../../../sql-reference/sql/copy-files.md): Copy the file to another stage location. After the file is copied, you can use it like a typical
  staged file, such as by using the following tools:

  + [Directory tables](../../../user-guide/data-load-dirtables.md): Query a list of files on a stage using a WHERE clause to filter if necessary.
  + [GET_PRESIGNED_URL](../../../sql-reference/functions/get_presigned_url.md): Generate a URL to the @stage/file.
  + [External stages](../../../user-guide/data-load-overview.md): Access the file outside of Snowflake through cloud provider APIs.
* [UDF](../../udf/udf-overview.md): Read the file in another query.

For more information, see [COPY FILES](../../../sql-reference/sql/copy-files.md).

### Limitations

* This feature is not available for Java or Scala.
* Stored procedures also support file writes, but cannot be easily chained with a COPY FILES command. Therefore, for file writes using
  stored procedures, we recommend using the file staging [PUT](../../../sql-reference/sql/put.md) command.

### Examples

This section includes code examples that show how to write files to stages for different use cases.

> * File transformation
> * Create a PDF from a partition of table data and copy it to a final location
> * Split files and unload them into multiple tables

#### File transformation

The following is a UDF handler example that transforms a file. You can modify this example to do different types of file
transformation, such as:

* Convert from one file format to another format.
* Re-size an image.
* Transform files into a “golden state” in a time-stamped format folder in the same or different bucket.

```sqlexample-python
CREATE OR REPLACE FUNCTION convert_to_foo(filename string)
RETURNS STRING
LANGUAGE PYTHON
VOLATILE
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'convert_to_foo'
AS
$$
from snowflake.snowpark.files import SnowflakeFile

def convert_to_foo(filename):
  input_file = SnowflakeFile.open(filename, "r")
  converted_file = SnowflakeFile.open_new_result("w")

  # Foo-type is just adding foo at the end of every line
  for line in input_file.readlines():
    converted_file.write(line[:-1] + 'foo' + '\n')
  return converted_file
$$;
```

You can call this UDF in a query and then access the `converted_file` result file written by the UDF.

The following SQL examples show what you can do with result files returned by UDFs, such as copying them to a stage or consuming them in a
subsequent query or another UDF. These basic SQL patterns are applicable to any UDF file write examples included in this topic. For example,
you can use the pre-signed URL query for any of the following UDF examples by using it in place of another SQL statement.

##### Example 1: Convert a single file and copy it to a final stage

```sqlexample
COPY FILES INTO @output_stage FROM
  (SELECT convert_to_foo(BUILD_SCOPED_FILE_URL(@input_stage, 'in.txt')), 'out.foo.txt');
```

##### Example 2: Convert a table of files and copy them to a final stage

```sqlexample
CREATE TABLE files_to_convert(file string);
-- Populate files_to_convert with input files:
INSERT INTO files_to_convert VALUES ('file1.txt');
INSERT INTO files_to_convert VALUES ('file2.txt');

COPY FILES INTO @output_stage FROM
  (SELECT convert_to_foo(BUILD_SCOPED_FILE_URL(@input_stage, file)),
      REPLACE(file, '.txt', '.foo.txt') FROM files_to_convert);
```

##### Example 3: Convert all files in a stage and copy them to a final stage

```sqlexample
COPY FILES INTO @output_stage FROM
  (SELECT convert_to_foo(BUILD_SCOPED_FILE_URL(@input_stage, RELATIVE_PATH)),
      REPLACE(RELATIVE_PATH, 'format1', 'format2') FROM DIRECTORY(@input_stage));
```

##### Example 4: Convert all files from a table and read them without copying

```sqlexample-python
-- A basic UDF to read a file:
CREATE OR REPLACE FUNCTION read_udf(filename string)
RETURNS STRING
LANGUAGE PYTHON
VOLATILE
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'read'
AS
'
from snowflake.snowpark.files import SnowflakeFile

def read(filename):
  return SnowflakeFile.open(filename, "r").read()
';
```

```sqlexample
-- Create files_to_convert as in Example 2.

SELECT convert_to_foo(BUILD_SCOPED_FILE_URL(@input_stage, file)) as new_file
  FROM files_to_convert;
-- The following query must be run within 24 hours from the prior one
SELECT read_udf(new_file) FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
```

##### Example 5: Convert all files from a table and read them immediately via a UDF

```sqlexample
-- Set up files_to_convert as in Example 2.
-- Set up read_udf as in Example 4.

SELECT read_udf(
    convert_to_foo(BUILD_SCOPED_FILE_URL(@input_stage, file))) FROM files_to_convert;
```

##### Example 6: Read using pre-signed URLs

This example is only for stages with server-side encryption. Internal stages have client-side encryption by default.

```sqlexample
COPY FILES INTO @output_stage FROM
  (SELECT convert_to_foo(BUILD_SCOPED_FILE_URL(@input_stage, file)) FROM files_to_convert);

-- Refresh the directory to get new files in output_stage.
ALTER STAGE output_stage REFRESH;
```

#### Create a PDF from a partition of table data and copy it to a final location

The following UDF handler example partitions input data and writes a PDF report for each partition of the data. This example partitions
reports by the `location` string.

You can also modify this example to write other types of files such as ML models and other custom formats for each partition.

```sqlexample-python
-- Create a stage that includes the font (for PDF creation)

CREATE OR REPLACE STAGE fonts
URL = 's3://sfquickstarts/misc/';

-- UDF to write the data
CREATE OR REPLACE FUNCTION create_report_pdf(data string)
RETURNS TABLE (file string)
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
HANDLER='CreateReport'
PACKAGES = ('snowflake-snowpark-python', 'fpdf')
IMPORTS  = ('@fonts/DejaVuSans.ttf')
AS $$
from snowflake.snowpark.files import SnowflakeFile
from fpdf import FPDF
import shutil
import sys
import uuid
import_dir = sys._xoptions["snowflake_import_directory"]

class CreateReport:
  def __init__(self):
      self.pdf = None

  def process(self, data):
      if self.pdf == None:
        # PDF library edits this file, make sure it's unique.
        font_file = f'/tmp/DejaVuSans-{uuid.uuid4()}.ttf'
        shutil.copy(f'{import_dir}/DejaVuSans.ttf', font_file)
        self.pdf = FPDF()
        self.pdf.add_page()
        self.pdf.add_font('DejaVu', '', font_file, uni=True)
        self.pdf.set_font('DejaVu', '', 14)
      self.pdf.write(8, data)
      self.pdf.ln(8)

  def end_partition(self):
      f = SnowflakeFile.open_new_result("wb")
      f.write(self.pdf.output(dest='S').encode('latin-1'))
      yield f,
$$;
```

The following SQL example first sets up the `reportData` table with fictitious data and creates the `output_stage` stage. Then it calls
the `create_report_pdf` UDF to create a PDF file using data that it queries from the `reportData` table. In the same SQL statement, the
COPY FILES command copies the result file from the UDF to `output_stage`.

> **Note:**
>
> We use a server-side-encrypted (SSE) output stage because the pre-signed URL
> to a file on an SSE stage will download an unencrypted file. In general, we
> recommend the default stage encryption on stages as the file is client-side
> encrypted and it’s more secure. Files on normal stages can still be read
> through UDFs or GET/PUT - just not via pre-signed URLs. Ensure you understand
> the security implications before using an SSE stage in a production environment.

```sqlexample
 -- Fictitious data
 CREATE OR REPLACE TABLE reportData(location string, item string);
 INSERT INTO reportData VALUES ('SanMateo' ,'Item A');
 INSERT INTO reportData VALUES ('SanMateo' ,'Item Z');
 INSERT INTO reportData VALUES ('SanMateo' ,'Item X');
 INSERT INTO reportData VALUES ('Bellevue' ,'Item B');
 INSERT INTO reportData VALUES ('Bellevue' ,'Item Q');

 -- Presigned URLs only work with SSE stages, see note above.
 CREATE OR REPLACE STAGE output_stage ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');

 COPY FILES INTO @output_stage
   FROM (SELECT reports.file, location || '.pdf'
           FROM reportData, TABLE(create_report_pdf(item)
           OVER (partition BY location)) AS reports);

 -- Check the results
LIST @output_stage;
SELECT GET_PRESIGNED_URL(@output_stage, 'SanMateo.pdf');
```

#### Split files and unload them into multiple tables

The following UDF handler example splits a CSV file by line based on the first character of each line. The UDF then unloads the split files
into multiple tables.

```sqlexample-python
CREATE OR REPLACE FUNCTION split_file(path string)
RETURNS TABLE(file string, name string)
LANGUAGE PYTHON
VOLATILE
PACKAGES = ('snowflake-snowpark-python')
RUNTIME_VERSION = 3.12
HANDLER = 'SplitCsvFile'
AS $$
import csv
from snowflake.snowpark.files import SnowflakeFile

class SplitCsvFile:
    def process(self, file):
      toTable1 = SnowflakeFile.open_new_result("w")
      toTable1Csv = csv.writer(toTable1)
      toTable2 = SnowflakeFile.open_new_result("w")
      toTable2Csv = csv.writer(toTable2)
      toTable3 = SnowflakeFile.open_new_result("w")
      toTable3Csv = csv.writer(toTable3)
      with SnowflakeFile.open(file, 'r') as file:
        # File is of the format 1:itemA \n 2:itemB \n [...]
        for line in file.readlines():
          forTable = line[0]
          if (forTable == "1"):
            toTable1Csv.writerow([line[2:-1]])
          if (forTable == "2"):
            toTable2Csv.writerow([line[2:-1]])
          if (forTable == "3"):
            toTable3Csv.writerow([line[2:-1]])
      yield toTable1, 'table1.csv'
      yield toTable2, 'table2.csv'
      yield toTable3, 'table3.csv'
$$;
-- Create a stage with access to an import file.
CREATE OR REPLACE STAGE staged_files url="s3://sfquickstarts/misc/";

-- Add the files to be split into a table - we just add one.
CREATE OR REPLACE TABLE filesToSplit(path string);
INSERT INTO filesToSplit VALUES ( 'items.txt');

-- Create output tables
CREATE OR REPLACE TABLE table1(item string);
CREATE OR REPLACE TABLE table2(item string);
CREATE OR REPLACE TABLE table3(item string);

-- Create output stage
CREATE OR REPLACE stage output_stage;

-- Creates files named path-tableX.csv
COPY FILES INTO @output_stage FROM
  (SELECT file, path || '-' || name FROM filesToSplit, TABLE(split_file(build_scoped_file_url(@staged_files, path))));

-- We use pattern and COPY INTO (not COPY FILES INTO) to upload to a final table.
COPY INTO table1 FROM @output_stage PATTERN = '.*.table1.csv';
COPY INTO table2 FROM @output_stage PATTERN = '.*.table2.csv';
COPY INTO table3 FROM @output_stage PATTERN = '.*.table3.csv';

-- See results
SELECT * from table1;
SELECT * from table2;
SELECT * from table3;
```

## Using Vectorized UDFs

Vectorized Python UDFs let you define Python functions that receive batches of input rows
as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) and
return batches of results as [Pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html)
or [Series](https://pandas.pydata.org/docs/reference/series.html).
The column in the Snowpark `dataframe` will be vectorized as a Pandas Series inside the UDF.

Here is an example of how to use the batch interface:

```python
from sklearn.linear_model import LinearRegression
model = LinearRegression()
model.fit(X, y)

@udf(packages=['pandas', 'scikit-learn','xgboost'])
def predict(df: PandasDataFrame[float, float, float, float]) -> PandasSeries[float]:
    # The input pandas DataFrame doesn't include column names. Specify the column names explicitly when needed.
    df.columns = ["col1", "col2", "col3", "col4"]
    return model.predict(df)
```

You call vectorized Python UDFs the same way you call other Python UDFs.
For more information, see [Vectorized Python UDFs](../../udf/python/udf-python-batch.md), which explains how to create a vectorized UDF by using a SQL statement.
For example, you can use the `vectorized` decorator when you specify the Python code in the SQL statement.
By using the Snowpark Python API described in this document, you don’t use a SQL statement to create a vectorized UDF.
So you don’t use the `vectorized` decorator.

It is possible to limit the number of rows per batch. For more information, see [Setting a target batch size](../../udf/python/udf-python-batch.md).

For more explanations and examples of using the Snowpark Python API to create vectorized UDFs, refer to
the [UDFs section of the Snowpark API Reference](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.udf.UDFRegistration).

---
title: Creating User-Defined Functions (UDFs) for DataFrames in Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-udfs.md
section: Snowpark
---

# Creating User-Defined Functions (UDFs) for DataFrames in Scala

The Snowpark API provides methods that you can use to create a user-defined function from a lambda or function in Scala. This
topic explains how to create these types of functions.

## Introduction

You can call Snowpark APIs to create user-defined functions (UDFs) for your custom lambdas and functions in Scala, and you can
call these UDFs to process the data in your DataFrame.

When you use the Snowpark API to create an UDF, the Snowpark library serializes and uploads the code for your UDF to an internal
stage. When you call the UDF, the Snowpark library executes your function on the server, where the data is located. As a result,
the data doesn’t need to be transferred to the client in order for the function to process the data.

In your custom code, you can also call code that is packaged in JAR files (for example, Java classes for a third-party library).

You can create a UDF for your custom code in one of two ways:

* You can create an anonymous UDF and assign the function to a variable. As long as
  this variable is in scope, you can use this variable to call the UDF.

  > ```scala
  > // Create and register an anonymous UDF (doubleUdf).
  > val doubleUdf = udf((x: Int) => x + x)
  > // Call the anonymous UDF.
  > val dfWithDoubleNum = df.withColumn("doubleNum", doubleUdf(col("num")))
  > ```
* You can create a named UDF and call the UDF by name. You can use this if, for example,
  you need to call a UDF by name or use the UDF in a subsequent session.

  > ```scala
  > // Create and register a permanent named UDF ("doubleUdf").
  > session.udf.registerPermanent("doubleUdf", (x: Int) => x + x, "mystage")
  > // Call the named UDF.
  > val dfWithDoubleNum = df.withColumn("doubleNum", callUDF("doubleUdf", col("num")))
  > ```

The next sections provide important information about creating UDFs in Snowpark:

* Data Types Supported for Arguments and Return Values
* Caveat About Creating UDFs in an Object With the App Trait

The rest of this topic explains how to create UDFs.

> **Note:**
>
> If you defined a UDF by running the `CREATE FUNCTION` command, you can call that UDF in Snowpark.
>
> For details, see [Calling scalar user-defined functions (UDFs)](calling-functions.md).

### Data Types Supported for Arguments and Return Values

In order to create a UDF for a Scala function or lambda, you must use the supported data types listed below for the arguments and
return value of your function or lambda:

| SQL Data Type | Scala Data Type | Notes |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | The following types are supported:   * `Short` or `Option[Short]` * `Int` or `Option[Int]` * `Long` or `Option[Long]` * `java.math.BigDecimal` |  |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `Float` or `Option[Float]` |  |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `Double` or `Option[Double]` |  |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `String` or `java.lang.String` |  |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `Boolean` or `Option[Boolean]` |  |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` |  |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` |  |
| [BINARY](../../../sql-reference/data-types-text.md) | `Array[Byte]` |  |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark.types.Variant](../reference/scala/com/snowflake/snowpark/types/Variant.md) |  |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `Array[String]` or `Array[Variant]` |  |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `Map[String, String]` or `Map[String, Variant]` | Mutable maps of the following types are supported:   * `scala.collection.mutable.Map[String, String]` * `scala.collection.mutable.Map[String, Variant]` |
| [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) | [com.snowflake.snowpark.types.Geography](../reference/scala/com/snowflake/snowpark/types/Geography.md) |  |

### Caveat About Creating UDFs in an Object With the App Trait

Scala provides an [App](https://www.scala-lang.org/api/2.12.11/scala/App.html) trait that you can extend in order to turn your
Scala object into an executable program. The `App` trait provides a `main` method that automatically executes all the
code in the body of your object definition. (The code in your object definition effectively becomes the `main` method.)

One effect of extending the `App` trait is that the fields in your object won’t be initialized until the `main` method
is called. If your object extends `App` and you define a UDF that uses an object field that you initialized earlier, the UDF
definition uploaded to the server won’t include the initialized value of the object field.

For example, suppose that you define and initialize a field named `myConst` in the object and use that field in a UDF:

```scala
object Main extends App {
  ...
  // Initialize a field.
  val myConst = "Prefix "
  // Use the field in a UDF.
  // Because the App trait delays the initialization of the object fields,
  // myConst in the UDF definition resolves to null.
  val myUdf = udf((s : String) =>  myConst + s )
  ...
}
```

When Snowpark serializes and uploads the UDF definition to Snowflake, `myConst` is not initialized and resolves to `null`.
As a result, calling the UDF returns `null` for `myConst`.

To work around this, change your object so that it does not extend the `App` trait, and implement a separate `main`
method for your code:

```scala
object Main {
  ...
  def main(args: Array[String]): Unit = {
    ... // Your code ...
  }
  ...
}
```

## Specifying Dependencies for a UDF

In order to define a UDF through the Snowpark API, you must call `Session.addDependency()` for any files that contain any
classes and resources that your UDF depends on (e.g. JAR files, resource files, etc.). (For details on reading resources from a
UDF, see Reading Files from a UDF.)

The Snowpark library uploads these files to an internal stage and adds the files to the classpath when executing your UDF.

> **Tip:**
>
> If you don’t want the library to upload the file every time you run your application, upload the file to a stage. When calling
> `addDependency`, pass the path to the file in the stage.

If you are using the Scala REPL, you must add the
[directory of classes generated by the REPL](quickstart-scala-repl.md) as a dependency. For example, if you used the
`run.sh` script to start the REPL, call the following method, which adds the `repl_classes` directory created by the
script:

```scala
// If you used the run.sh script to start the Scala REPL, call this to add the REPL classes directory as a dependency.
session.addDependency("<path_to_directory_where_you_ran_run.sh>/repl_classes/")
```

The following example demonstrates how to add a JAR file in a stage as a dependency:

```scala
// Add a JAR file that you uploaded to a stage.
session.addDependency("@my_stage/<path>/my-library.jar")
```

The following examples demonstrate how to add dependencies for JAR files and resource files:

```scala
// Add a JAR file on your local machine.
session.addDependency("/<path>/my-library.jar")

// Add a directory of resource files.
session.addDependency("/<path>/my-resource-dir/")

// Add a resource file.
session.addDependency("/<path>/my-resource.xml")
```

You should not need to specify the following dependencies:

> * **Your Scala runtime libraries.**
>
>   These libraries are already available in the runtime environment on the server where your UDFs are executed.
> * **The Snowpark JAR file.**
>
>   The Snowpark library automatically attempts to detect and upload the Snowpark JAR file to the server.
>
>   To prevent the library from repeatedly uploading the Snowpark JAR file to the server:
>
>   1. Upload the Snowpark JAR file to a stage.
>
>      For example, the following command uploads the Snowpark JAR file to the stage `@mystage`. The PUT command compresses the
>      JAR file and names the resulting file snowpark_2.12-1.18.0.jar.gz.
>
>      ```
>      -- Put the Snowpark JAR file in a stage.
>      PUT file:///<path>/snowpark_2.12-1.18.0.jar @mystage
>      ```
>   2. Call `addDependency` to add the Snowpark JAR file in the stage as a dependency.
>
>      For example, to add the Snowpark JAR file uploaded by the previous command:
>
>      ```
>      // Add the Snowpark JAR file that you uploaded to a stage.
>      session.addDependency("@mystage/snowpark_2.12-1.18.0.jar.gz")
>      ```
>
>      Note that the specified path to the JAR file includes the `.gz` filename extension, which was added by the PUT command.
> * **The JAR file or directory with the currently running application.**
>
>   The Snowpark library automatically attempts to detect and upload these dependencies.
>
>   If the Snowpark library is unable to detect these dependencies automatically, the library reports an error, and you must call
>   `addDependency` to add these dependencies manually.

If it takes too long for the dependencies to be uploaded to the stage, the Snowpark library reports a timeout exception. To
configure the maximum amount of time that the Snowpark library should wait, set the
[snowpark_request_timeout_in_seconds](creating-session.md) property when creating the session.

## Creating an Anonymous UDF

To create an anonymous UDF, you can either:

* Call the `udf` function in the `com.snowflake.snowpark.functions` object, passing in the definition of the anonymous
  function.
* Call the `registerTemporary` method in the `UDFRegistration` class, passing in the definition of the anonymous
  function. Because you are registering an anonymous UDF, you must use the method signatures that don’t have a `name`
  parameter.

> **Note:**
>
> When writing multi-threaded code (e.g. when using parallel collections), use the `registerTemporary` method to register
> UDFs, rather than using the `udf` function. This can prevent errors in which the default Snowflake `Session` object
> cannot be found.

These methods return a `UserDefinedFunction` object, which you can use to call the UDF. (See
[Calling scalar user-defined functions (UDFs)](calling-functions.md).)

The following example creates an anonymous UDF:

```scala
// Create and register an anonymous UDF.
val doubleUdf = udf((x: Int) => x + x)
// Call the anonymous UDF, passing in the "num" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleNum".
val dfWithDoubleNum = df.withColumn("doubleNum", doubleUdf(col("num")))
```

> **Note:**
>
> If you are creating a UDF in a Jupyter notebook, you must set up the notebook to work with Snowpark (see
> [Setting Up a Jupyter Notebook for Snowpark Scala](quickstart-jupyter.md)) and follow the guidelines for writing UDFs in a notebook (see
> Creating UDFs in Jupyter Notebooks).

The following example creates an anonymous UDF that passes in an `Array` of `String` values and appends the string
`x` to each value:

```scala
// Create and register an anonymous UDF.
val appendUdf = udf((x: Array[String]) => x.map(a => a + " x"))
// Call the anonymous UDF, passing in the "a" column, which holds an ARRAY.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "appended".
val dfWithXAppended = df.withColumn("appended", appendUdf(col("a")))
```

The following example creates an anonymous UDF that uses a custom class (`LanguageDetector`, which detects the language used
in text). The example calls the anonymous UDF to detect the language in the `text_data` column in a DataFrame and creates a new
DataFrame that includes an additional `lang` column with the language used.

```scala
// Import the udf function from the functions object.
import com.snowflake.snowpark.functions._

// Import the package for your custom code.
// The custom code in this example detects the language of textual data.
import com.mycompany.LanguageDetector

// If the custom code is packaged in a JAR file, add that JAR file as
// a dependency.
session.addDependency("$HOME/language-detector.jar")

// Create a detector
val detector = new LanguageDetector()

// Create an anonymous UDF that takes a string of text and returns the language used in that string.
// Note that this captures the detector object created above.
// Assign the UDF to the langUdf variable, which will be used to call the UDF.
val langUdf = udf((s: String) =>
     Option(detector.detect(s)).getOrElse("UNKNOWN"))

// Create a new DataFrame that contains an additional "lang" column that contains the language
// detected by the UDF.
val dfEmailsWithLangCol =
    dfEmails.withColumn("lang", langUdf(col("text_data")))
```

## Creating and Registering a Named UDF

If you want to call a UDF by name (e.g. by using the `callUDF` function in the `functions` object) or if you need to
use a UDF in subsequent sessions, you can create and register a named UDF. To do this, use one of the following methods in the
`UDFRegistration` class:

* `registerTemporary`, if you just plan to use the UDF in the current session
* `registerPermanent`, if you plan to use the UDF in subsequent sessions

To access an object of the `UDFRegistration` class, call the `udf` method of the `Session` class.

`registerTemporary` creates a temporary UDF that you can use in the current session.

```scala
// Create and register a temporary named UDF.
session.udf.registerTemporary("doubleUdf", (x: Int) => x + x)
// Call the named UDF, passing in the "num" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleNum".
val dfWithDoubleNum = df.withColumn("doubleNum", callUDF("doubleUdf", col("num")))
```

`registerPermanent` creates a UDF that you can use in the current and subsequent sessions. When you call
`registerPermanent`, you must also specify a location in an internal stage location where the JAR files for the UDF and its
dependencies will be uploaded.

> **Note:**
>
> `registerPermanent` does not support external stages.

For example:

```scala
// Create and register a permanent named UDF.
// Specify that the UDF and dependent JAR files should be uploaded to
// the internal stage named mystage.
session.udf.registerPermanent("doubleUdf", (x: Int) => x + x, "mystage")
// Call the named UDF, passing in the "num" column.
// The example uses withColumn to return a DataFrame containing
// the UDF result in a new column named "doubleNum".
val dfWithDoubleNum = df.withColumn("doubleNum", callUDF("doubleUdf", col("num")))
```

> **Note:**
>
> If you are creating a UDF in a Jupyter notebook, you must set up the notebook to work with Snowpark (see
> [Setting Up a Jupyter Notebook for Snowpark Scala](quickstart-jupyter.md)) and follow the guidelines for writing UDFs in a notebook (see
> Creating UDFs in Jupyter Notebooks).

## Creating UDFs in Jupyter Notebooks

If you are creating UDFs in a [Jupyter notebook](https://jupyter-notebook.readthedocs.io/en/stable/notebook.html), you must
follow these additional steps:

* [Setting Up a Jupyter Notebook for Snowpark Scala](quickstart-jupyter.md) (if you haven’t already set up the notebook to work with Snowpark)
* Writing the Implementation of a UDF
* Accessing a Variable Defined in Another Cell

### Writing the Implementation of a UDF

Define the implementation of your function in a class that extends `Serializable`. For example:

```scala
// Class containing a function that implements your UDF.
class MyUDFCode( ... ) extends Serializable {
  val myUserDefinedFunc = (s: String) => {
    ...
  }
}
val myUdf = udf((new MyUDFCode(resourceName)).myUserDefinedFunc)
```

### Accessing a Variable Defined in Another Cell

If you need to use a variable defined in another cell in your UDF, you must pass the variable as an argument to the class
constructor. For example, suppose that in cell 1, you’ve defined a variable:

```none
In [1]:
```

```scala
val prefix = "Hello"
```

and you want to use that variable in a UDF that you’ve defined in cell 2. In the class constructor for the UDF, add an argument
for this variable. Then, when calling the class constructor to create the UDF, pass in the variable defined in cell 1:

```none
In [2]:
```

```scala
// resourceName is the argument for the variable defined in another cell.
class UDFCode(var prefix: String) extends Serializable {
  val prependPrefixFunc = (s: String) => {
    s"$prefix $s"
  }
}

// When constructing UDFCode, pass in the variable (resourceName) that is defined in another cell.
val prependPrefixUdf = udf((new UDFCode(prefix)).prependPrefixFunc)
val myDf = session.sql("select 'Raymond' NAME")
myDf.withColumn("CONCAT", prependPrefixUdf(col("NAME"))).show()
```

## Using Objects That Are Not Serializable

When you create a UDF for a lambda or function, the Snowpark library serializes the lambda closure and sends it to the server for
execution.

If an object captured by the lambda closure is not serializable, the Snowpark library throws an
`java.io.NotSerializableException` exception.

```none
Exception in thread "main" java.io.NotSerializableException: <YourObjectName>
```

If this occurs, you can either:

* Make the object serializable, or
* Declare the object as a `lazy val` or use the `@transient` annotation to avoid serializing the object.

  For example:

  ```scala
  // Declare the detector object as lazy.
  lazy val detector = new LanguageDetector("en")
  // The detector object is not serialized but is instead reconstructed on the server.
  val langUdf = udf((s: String) =>
       Option(detector.detect(s)).getOrElse("UNKNOWN"))
  ```

## Writing Initialization Code for a UDF

If your UDF requires initialization code or context, you can provide this through values captured as part of the UDF closure.

The following example uses a separate class to initialize the context needed by three UDFs.

* The first UDF creates a new instance of the class within the lambda, so the initialization is performed every time the UDF is
  invoked.
* The second UDF captures an instance of the class generated in your client program. The context generated on the client is
  serialized and is used by the UDF. Note that the context class must be serializable for this approach to work.
* The third UDF captures a `lazy val`, so the context is instantiated lazily on the first UDF invocation and is reused in
  subsequent invocations. This approach works even when the context is not serializable. However, there is no guarantee that
  ALL UDF invocations within a dataframe will use the same lazily generated context.

```scala
import com.snowflake.snowpark._
import com.snowflake.snowpark.functions._
import scala.util.Random

// Context needed for a UDF.
class Context {
  val randomInt = Random.nextInt
}

// Serializable context needed for the UDF.
class SerContext extends Serializable {
  val randomInt = Random.nextInt
}

object TestUdf {
  def main(args: Array[String]): Unit = {
    // Create the session.
    val session = Session.builder.configFile("/<path>/profile.properties").create
    import session.implicits._
    session.range(1, 10, 2).show()

    // Create a DataFrame with two columns ("c" and "d").
    val dummy = session.createDataFrame(Seq((1, 1), (2, 2), (3, 3))).toDF("c", "d")
    dummy.show()

    // Initialize the context once per invocation.
    val udfRepeatedInit = udf((i: Int) => (new Context).randomInt)
    dummy.select(udfRepeatedInit('c)).show()

    // Initialize the serializable context only once,
    // regardless of the number of times that the UDF is invoked.
    val sC = new SerContext
    val udfOnceInit = udf((i: Int) => sC.randomInt)
    dummy.select(udfOnceInit('c)).show()

    // Initialize the non-serializable context only once,
    // regardless of the number of times that the UDF is invoked.
    lazy val unserC = new Context
    val udfOnceInitU = udf((i: Int) => unserC.randomInt)
    dummy.select(udfOnceInitU('c)).show()
  }
}
```

## Reading Files from a UDF

As mentioned earlier, the Snowpark library uploads and executes UDFs on the server. If your UDF needs to read data from a file,
you must ensure that the file is uploaded with the UDF.

In addition, if the content of the file remains the same between calls to the UDF, you can write your code to load the file once
during the first call and not on subsequent calls. This can improve the performance of your UDF calls.

To set up a UDF to read a file:

1. Add the file to a JAR file.

   For example, if your UDF needs to use a file in a `data/` subdirectory (`data/hello.txt`), run the `jar` command to
   add this file to a JAR file:

   ```bash
   # Create a new JAR file containing data/hello.txt.
   $ jar cvf <path>/myJar.jar data/hello.txt
   ```
2. Specify that the JAR file is a dependency, which uploads the file to the server and adds the file to the classpath. See
   Specifying Dependencies for a UDF.

   For example:

   ```scala
   // Specify that myJar.jar contains files that your UDF depends on.
   session.addDependency("<path>/myJar.jar")
   ```
3. In the UDF, call `Class.getResourceAsStream` to find the file in the classpath and read the file.

   To avoid adding a dependency on `this`, you can use `classOf[com.snowflake.snowpark.DataFrame]` (rather than
   `getClass`) to get the `Class` object.

   For example, to read the `data/hello.txt` file:

   ```scala
   // Read data/hello.txt from myJar.jar.
   val resourceName = "/data/hello.txt"
   val inputStream = classOf[com.snowflake.snowpark.DataFrame].getResourceAsStream(resourceName)
   ```

   In this example, the resource name starts with a `/`, which indicates that this is the full path of the file in the JAR file.
   (In this case, the location of the file is not relative to the package of the class.)

> > **Note:**
> >
> > If you don’t expect the content of the file to change between UDF calls, read the file into a `lazy val`. This causes
> > the file loading code to execute only on the first call to the UDF and not on subsequent calls.

The following example defines an object (`UDFCode`) with a function that will be used as a UDF (`readFileFunc`). The function
reads the file `data/hello.txt`, which is expected to contain the string `hello,`. The function prepends this string to the
string passed in as an argument.

```scala
// Create a function object that reads a file.
object UDFCode extends Serializable {

  // The code in this block reads the file. To prevent this code from executing each time that the UDF is called,
  // the code is used in the definition of a lazy val. The code for a lazy val is executed only once when the variable is
  // first accessed.
  lazy val prefix = {
    import java.io._
    val resourceName = "/data/hello.txt"
    val inputStream = classOf[com.snowflake.snowpark.DataFrame]
      .getResourceAsStream(resourceName)
    if (inputStream == null) {
      throw new Exception("Can't find file " + resourceName)
    }
    scala.io.Source.fromInputStream(inputStream).mkString
  }

  val readFileFunc = (s: String) => prefix + " : " + s
}
```

The next part of the example registers the function as an anonymous UDF. The example calls the UDF on the `NAME` column in a
DataFrame. The example assumes that the `data/hello.txt` file is packaged in the JAR file `myJar.jar`.

```scala
// Add the JAR file as a dependency.
session.addDependency("<path>/myJar.jar")

// Create a new DataFrame with one column (NAME)
// that contains the name "Raymond".
val myDf = session.sql("select 'Raymond' NAME")

// Register the function that you defined earlier as an anonymous UDF.
val readFileUdf = udf(UDFCode.readFileFunc)

// Call UDF for the values in the NAME column of the DataFrame.
myDf.withColumn("CONCAT", readFileUdf(col("NAME"))).show()
```

## Creating User-Defined Table Functions (UDTFs)

To create and register a UDTF in Snowpark, you must:

* Define a class for the UDTF.
* Create an instance of that class, and register that instance as a UDTF.

The next sections describe these steps in more detail.

For information on calling a UDTF, see Calling a UDTF.

### Defining the UDTF Class

Define a class that inherits from one of the `UDTFn` classes (e.g. `UDTF0`, `UDTF1`, etc.) in the
[com.snowflake.snowpark.udtf package](../reference/scala/com/snowflake/snowpark/udtf/index.md), where `n` specifies the number of input arguments for your UDTF. For example, if
your UDTF passes in 2 input arguments, extend the `UDTF2` class.

In your class, override the following methods:

* outputSchema(), which returns a `types.StructType` object that
  describes the names and types of the fields in the returned rows (the “schema” of the output).
* process(), which is called once for each row in the input partition
  (see the note below).
* endPartition(), which is called once for each partition after all rows
  have been passed to `process()`.

When a UDTF is called, the rows are grouped into partitions before they are passed to the UDTF:

* If the statement that calls the UDTF specifies the PARTITION clause (explicit partitions), that clause determines how the rows
  are partitioned.
* If the statement does not specify the PARTITION clause (implicit partitions), Snowflake determines how best to partition the
  rows.

For an explanation of partitions, see [Table functions and partitions](../../udf/udf-calling-sql.md).

For an example of a UDTF class, see Example of a UDTF Class.

#### Overriding the outputSchema() Method

Override the `outputSchema()` method to define the names and data types of the fields (the “output schema”) of the rows
returned by the `process()` and `endPartition()` methods.

> ```scala
> def outputSchema(): StructType
> ```

In this method, construct and return a [StructType](../reference/scala/com/snowflake/snowpark/types/StructType$.md) object that uses an `Array` of [StructField](../reference/scala/com/snowflake/snowpark/types/StructField$.md) objects to specify the
Snowflake data type of each field in a returned row. Snowflake supports the following type objects for the output schema for a
UDTF:

| SQL Data Type | Scala Type | `com.snowflake.snowpark.types` Type |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `Short` or `Option[Short]` | `ShortType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `Int` or `Option[Int]` | `IntType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `Long` or `Option[Long]` | `LongType` |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | `java.math.BigDecimal` | `DecimalType` |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `Float` or `Option[Float]` | `FloatType` |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `Double` or `Option[Double]` | `DoubleType` |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `String` or `java.lang.String` | `StringType` |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `Boolean` or `Option[Boolean]` | `BooleanType` |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` | `DateType` |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` | `TimestampType` |
| [BINARY](../../../sql-reference/data-types-text.md) | `Array[Byte]` | `BinaryType` |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark.types.Variant](../reference/scala/com/snowflake/snowpark/types/Variant.md) | `VariantType` |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `Array[String]` | `ArrayType(StringType)` |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `Array[Variant]` | `ArrayType(VariantType)` |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `Map[String, String]` | `MapType(StringType, StringType)` |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `Map[String, Variant]` | `MapType(StringType, VariantType)` |

For example, if your UDTF returns a row with a single integer field:

> ```scala
> override def outputSchema(): StructType = StructType(StructField("C1", IntegerType))
> ```

#### Overriding the process() Method

In your UDTF class, override the `process()` method:

> ```scala
> def process(arg0: A0, ... arg<n> A<n>): Iterable[Row]
> ```

where `n` is the number of arguments passed to your UDTF.

The number of arguments in the signature corresponds to the class that you extended. For example, if your UDTF passes in 2 input
arguments and you are extending the `UDTF2` class, the `process()` method has this signature:

> ```scala
> def process(arg0: A0, arg1: A1): Iterable[Row]
> ```

This method is invoked once for each row in the input partition.

##### Choosing the Types of the Arguments

For the type of each argument in the `process()` method, use the Scala type that corresponds to the Snowflake data type of
the argument passed to the UDTF.

Snowflake supports the following data types for the arguments for a UDTF:

| SQL Data Type | Scala Data Type | Notes |
| --- | --- | --- |
| [NUMBER](../../../sql-reference/data-types-numeric.md) | The following types are supported:   * `Short` or `Option[Short]` * `Int` or `Option[Int]` * `Long` or `Option[Long]` * `java.math.BigDecimal` |  |
| [FLOAT](../../../sql-reference/data-types-numeric.md) | `Float` or `Option[Float]` |  |
| [DOUBLE](../../../sql-reference/data-types-numeric.md) | `Double` or `Option[Double]` |  |
| [VARCHAR](../../../sql-reference/data-types-text.md) | `String` or `java.lang.String` |  |
| [BOOLEAN](../../../sql-reference/data-types-logical.md) | `Boolean` or `Option[Boolean]` |  |
| [DATE](../../../sql-reference/data-types-datetime.md) | `java.sql.Date` |  |
| [TIMESTAMP](../../../sql-reference/data-types-datetime.md) | `java.sql.Timestamp` |  |
| [BINARY](../../../sql-reference/data-types-text.md) | `Array[Byte]` |  |
| [VARIANT](../../../sql-reference/data-types-semistructured.md) | [com.snowflake.snowpark.types.Variant](../reference/scala/com/snowflake/snowpark/types/Variant.md) |  |
| [ARRAY](../../../sql-reference/data-types-semistructured.md) | `Array[String]` or `Array[Variant]` |  |
| [OBJECT](../../../sql-reference/data-types-semistructured.md) | `Map[String, String]` or `Map[String, Variant]` | Mutable maps of the following types are supported:   * `scala.collection.mutable.Map[String, String]` * `scala.collection.mutable.Map[String, Variant]` |

##### Returning Rows

In the `process()` method, construct and return an `Iterable` of `Row` objects that contain the data to be
returned by the UDTF for the given input values. The fields in the row must use the types that you specified in the
`outputSchema` method. (See Overriding the outputSchema() Method.)

For example, if your UDTF generates rows, construct and return an `Iterable` of `Row` objects for the generated rows:

> ```scala
> override def process(start: Int, count: Int): Iterable[Row] =
>     (start until (start + count)).map(Row(_))
> ```

#### Overriding the endPartition() Method

Override the `endPartition` method and add code that should be executed after all rows in the input partition have been
passed to the `process` method. The `endPartition` method is invoked once for each input partition.

> ```scala
> def endPartition(): Iterable[Row]
> ```

You can use this method if you need to perform any work after all of the rows in the partition have been processed. For example,
you can:

* Return rows based on state information that you capture in each `process` method call.
* Return rows that are not tied to a specific input row.
* Return rows that summarize the output rows that have been generated by the `process` method.

The fields in the rows that you return must match the types that you specified in the `outputSchema` method. (See
Overriding the outputSchema() Method.)

If you do not need to return additional rows at the end of each partition, return an empty `Iterable` of `Row`
objects. for example:

> ```scala
> override def endPartition(): Iterable[Row] = Array.empty[Row]
> ```

> **Note:**
>
> While Snowflake supports large partitions with timeouts tuned to process them successfully, especially large partitions can cause
> processing to time out (such as when `endPartition` takes too long to complete). Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need the
> timeout threshold adjusted for specific usage scenarios.

#### Example of a UDTF Class

The following is an example of a UDTF class that generates a range of rows.

* Because the UDTF passes in 2 arguments, the class extends `UDTF2`.
* The arguments `start` and `count` specify the starting number for the row and the number of rows to generate.

```scala
class MyRangeUdtf extends UDTF2[Int, Int] {
  override def process(start: Int, count: Int): Iterable[Row] =
    (start until (start + count)).map(Row(_))
  override def endPartition(): Iterable[Row] = Array.empty[Row]
  override def outputSchema(): StructType = StructType(StructField("C1", IntegerType))
}
```

### Registering the UDTF

Next, create an instance of the new class, and register the class by calling one of the [UDTFRegistration](../reference/scala/com/snowflake/snowpark/UDTFRegistration.md) methods. You can
register a temporary or
permanent UDTF.

#### Registering a Temporary UDTF

To register a temporary UDTF, call `UDTFRegistration.registerTemporary`:

* If you do not need to call the UDTF by name, you can register an anonymous UDTF by passing in an instance of the class:

  > ```scala
  > // Register the MyRangeUdtf class that was defined in the previous example.
  > val tableFunction = session.udtf.registerTemporary(new MyRangeUdtf())
  > // Use the returned TableFunction object to call the UDTF.
  > session.tableFunction(tableFunction, lit(10), lit(5)).show
  > ```
* If you need to call the UDTF by name, pass in a name of the UDTF as well:

  > ```scala
  > // Register the MyRangeUdtf class that was defined in the previous example.
  > val tableFunction = session.udtf.registerTemporary("myUdtf", new MyRangeUdtf())
  > // Call the UDTF by name.
  > session.tableFunction(TableFunction("myUdtf"), lit(10), lit(5)).show()
  > ```

#### Registering a Permanent UDTF

If you need to use the UDTF in subsequent sessions, call `UDTFRegistration.registerPermanent` to register a permanent UDTF.

When registering a permanent UDTF, you must specify a stage where the registration method will upload the JAR files for the UDTF
and its dependencies. For example:

> ```scala
> // Register the MyRangeUdtf class that was defined in the previous example.
> val tableFunction = session.udtf.registerPermanent("myUdtf", new MyRangeUdtf(), "@mystage")
> // Call the UDTF by name.
> session.tableFunction(TableFunction("myUdtf"), lit(10), lit(5)).show()
> ```

### Calling a UDTF

After registering the UDTF, you can call the UDTF by passing the returned `TableFunction` object to the
`tableFunction` method of the `Session` object:

> ```scala
> // Register the MyRangeUdtf class that was defined in the previous example.
> val tableFunction = session.udtf.registerTemporary(new MyRangeUdtf())
> // Use the returned TableFunction object to call the UDTF.
> session.tableFunction(tableFunction, lit(10), lit(5)).show()
> ```

To call a UDTF by name, construct a `TableFunction` object with that name, and pass that to the `tableFunction`
method:

> ```scala
> // Register the MyRangeUdtf class that was defined in the previous example.
> val tableFunction = session.udtf.registerTemporary("myUdtf", new MyRangeUdtf())
> // Call the UDTF by name.
> session.tableFunction(TableFunction("myUdtf"), lit(10), lit(5)).show()
> ```

You can also call a UDTF through a SELECT statement directly:

> ```scala
> session.sql("select * from table(myUdtf(10, 5))")
> ```

---
title: Creating User-Defined Table Functions (UDTFs) for DataFrames in Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-udtfs.md
section: Snowpark
---

# Creating User-Defined Table Functions (UDTFs) for DataFrames in Python

The Snowpark API provides methods that you can use to create a user-defined table function with a handler written in Python.
This topic explains how to create these types of functions.

## Introduction

You can create a user-defined table function (UDTF) using the Snowpark API.

You do this in a way similar to creating a scalar user-defined function (UDF) with the API, as described in
[Creating User-Defined Functions (UDFs) for DataFrames in Python](creating-udfs.md). Key differences include UDF handler requirements and parameter values required when registering
the UDTF.

To create and register a UDTF with Snowpark, you must:

* Implement a UDTF handler.

  The handler contains the UDTF’s logic. A UDTF handler must implement functions that Snowflake will invoke at runtime when the UDTF is
  called. For more information, see Implementing a UDTF Handler.
* Register the UDTF and its handler in the Snowflake database.

  You can use the Snowpark API to register the UDTF and its handler. Once you’ve registered the UDTF, you can call it from SQL or by using
  the Snowpark API. For more information about registering, see Registering a UDTF.

For information on calling a UDTF, see [Calling User-Defined Table Functions (UDTFs)](calling-functions.md).

## Implementing a UDTF Handler

As described in detail in [Writing a UDTF in Python](../../udf/python/udf-python-tabular-functions.md), a UDTF handler class must implement methods that
Snowflake invokes when the UDTF is called. You can use the class you write as a handler whether you’re registering the UDTF with the
Snowpark API or creating it with SQL using the CREATE FUNCTION statement.

Methods of a handler class are designed to process rows and partitions received by the UDTF.

A UDTF handler class implements the following, which Snowflake invokes at run time:

* An `__init__` method. Optional. Invoked to initialize stateful processing of input partitions.
* A `process` method. Required. Invoked for each input row. The method returns a tabular value as tuples.
* An `end_partition` method. Optional. Invoked to finalize processing of input partitions.

  While Snowflake supports large partitions with timeouts tuned to process them successfully, especially large partitions can cause
  processing to time out (such as when `end_partition` takes too long to complete). Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need the
  timeout threshold adjusted for specific usage scenarios.

For handler details and examples, see [Writing a UDTF in Python](../../udf/python/udf-python-tabular-functions.md).

## Registering a UDTF

Once you’ve implemented a UDTF handler, you can use the Snowpark API to register the UDTF on the Snowflake database. Registering the UDTF
creates the UDTF so that it can be called.

You can register the UDTF as a named or anonymous function, as you can for a scalar UDF. For related information about registering a scalar
UDF, see [Creating an Anonymous UDF](creating-udfs.md) and [Creating and Registering a Named UDF](creating-udfs.md).

When you register a UDTF, you specify parameter values that Snowflake needs to create the UDTF. (Many of these parameters correspond
functionally to clauses of the CREATE FUNCTION statement in SQL. For more information, see [CREATE FUNCTION](../../../sql-reference/sql/create-function.md).)

Most of these parameters are the same as those you specify when you create a scalar UDF (for more information,
see [Creating User-Defined Functions (UDFs) for DataFrames in Python](creating-udfs.md)). The primary differences are due to the fact that a UDTF returns a tabular
value and the fact that its handler is a class, rather than a function. For a complete list of parameters, see the documentation for the
APIs linked below.

To register a UDTF with Snowpark, you use one of the following, specifying parameter values required to create the UDTF in the
database. For information that differentiates these options, see
[UDFRegistration](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.udf.UDFRegistration),
which describes similar options for registering a scalar UDF.

* Use the `register` or `udtf` function, pointing to a runtime Python function. You can also use the `udtf` function as
  a decorator on the handler class.

  For reference on these functions, see:

  + [snowflake.snowpark.functions.udtf](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.functions.udtf)
  + [snowflake.snowpark.udtf.UDTFRegistration.register](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.udtf.UDTFRegistration)
* Use the `register_from_file` function, pointing to a Python file or zip file containing Python source code.

  For the function reference, see [snowflake.snowpark.udtf.UDTFRegistration.register_from_file](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.udtf.UDTFRegistration.register_from_file).

### Defining a UDTF’s Input Types and Output Schema

When you register a UDTF, you specify details about the function’s parameters and output value. You do this so that the function itself
declares types that accurately correspond to those for the function’s underlying handler.

For examples, see Examples in this topic and in the
[snowflake.snowpark.udtf.UDTFRegistration](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.udtf.UDTFRegistration)
reference.

You specify the following for the UDTF when registering it:

* Types of its input parameters as a value of the registering function’s `input_types` parameter. The `input_types` parameter is
  optional if you provide type hints in the `process` method’s declaration.

  Specify this value as a list of types based on
  [snowflake.snowpark.types.DataType](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/types).
  For example, you might specify `input_types=[StringType(), IntegerType()]`.
* Schema of its tabular output as a value of the registering function’s `output_schema` parameter.

  The `output_schema` value can be one of the following:

  + A list of the names for columns in the UDTF’s return value.

    The list will include column names only, so you must also provide type hints in the `process` method’s declaration.
  + A [StructType](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.StructType)
    that represents the output table’s column names *and* types.

    Code in the following example assigns a schema as a value to an `output` variable, then uses the variable when registering the UDTF.

    ```python
    from snowflake.snowpark.types import StructField, StructType, StringType, IntegerType, FloatType
    from snowflake.snowpark.functions import udtf, table_function
    schema = StructType([
      StructField("symbol", StringType())
      StructField("cost", IntegerType()),
    ])
    @udtf(output_schema=schema,input_types=[StringType(), IntegerType(), FloatType()],stage_location="straut_udf",is_permanent=True,name="test_udtf",replace=True)
    class StockSale:
      def process(self, symbol, quantity, price):
        cost = quantity * price
        yield (symbol, cost)
    ```

## Examples

The following is a brief list of examples. For more examples, see [snowflake.snowpark.udtf.UDTFRegistration](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.udtf.UDTFRegistration).

**Registering a UDTF with the udtf Function**

Register the function.

```python
from snowflake.snowpark.types import IntegerType, StructField, StructType
from snowflake.snowpark.functions import udtf, lit
class GeneratorUDTF:
    def process(self, n):
        for i in range(n):
            yield (i, )
generator_udtf = udtf(GeneratorUDTF, output_schema=StructType([StructField("number", IntegerType())]), input_types=[IntegerType()])
```

Call the function.

```python
session.table_function(generator_udtf(lit(3))).collect()  # Query it by calling it
```

```output
[Row(NUMBER=0), Row(NUMBER=1), Row(NUMBER=2)]
```

```python
session.table_function(generator_udtf.name, lit(3)).collect()  # Query it by using the name
```

```output
[Row(NUMBER=0), Row(NUMBER=1), Row(NUMBER=2)]
```

**Registering a UDTF with the register Function**

Register the function.

```python
from collections import Counter
from typing import Iterable, Tuple
from snowflake.snowpark.functions import lit
class MyWordCount:
      def __init__(self):
          self._total_per_partition = 0

      def process(self, s1: str) -> Iterable[Tuple[str, int]]:
        words = s1.split()
        self._total_per_partition = len(words)
        counter = Counter(words)
        yield from counter.items()

    def end_partition(self):
        yield ("partition_total", self._total_per_partition)
udtf_name = "word_count_udtf"
word_count_udtf = session.udtf.register(
    MyWordCount, ["word", "count"], name=udtf_name, is_permanent=False, replace=True
)
```

Call the function.

```python
# Call it by its name
df1 = session.table_function(udtf_name, lit("w1 w2 w2 w3 w3 w3"))
df1.show()
```

```output
-----------------------------
|"WORD"           |"COUNT"  |
-----------------------------
|w1               |1        |
|w2               |2        |
|w3               |3        |
|partition_total  |6        |
-----------------------------
```

**Registering a UDTF with the register_from_file Function**

Register the function.

```python
from snowflake.snowpark.types import IntegerType, StructField, StructType
from snowflake.snowpark.functions import udtf, lit
_ = session.sql("create or replace temp stage mystage").collect()
_ = session.file.put("tests/resources/test_udtf_dir/test_udtf_file.py", "@mystage", auto_compress=False)
generator_udtf = session.udtf.register_from_file(
    file_path="@mystage/test_udtf_file.py",
    handler_name="GeneratorUDTF",
    output_schema=StructType([StructField("number", IntegerType())]),
    input_types=[IntegerType()]
  )
```

Call the function.

```python
session.table_function(generator_udtf(lit(3))).collect()
```

```output
[Row(NUMBER=0), Row(NUMBER=1), Row(NUMBER=2)]
```

---
title: Install Snowpark Checkpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-installation.md
section: Snowpark
---

# Install Snowpark Checkpoints

To install the Snowpark Checkpoints library into a Python virtual environment, use [conda](https://anaconda.org/anaconda/conda) or [pip](https://pypi.org/project/pip/).

* Using conda:

  ```bash
  conda install snowpark-checkpoints
  ```
* Using pip:

  ```bash
  pip install snowpark-checkpoints
  ```

You can also install the packages individually:

* **snowpark-checkpoints-collectors:** Use this package to collect information about PySpark DataFrames.

  + Using conda:

    ```bash
    conda install snowpark-checkpoints-collectors
    ```
  + Using pip:

    ```bash
    pip install snowpark-checkpoints-collectors
    ```
* **snowpark-checkpoints-hypothesis:** Use this package to create unit tests for your Snowpark code based on synthetic data automatically generated, following the DataFrame schemas collected from the original PySpark code.

  + Using conda:

    > ```bash
    > conda install snowpark-checkpoints-hypothesis
    > ```
  + Using pip:

    ```bash
    pip install snowpark-checkpoints-hypothesis
    ```
* **snowpark-checkpoints-validators:** Use this package to validate your converted Snowpark DataFrames against the collected schemas or exported DataFrames generated by the collector functionality.

  + Using conda:

    ```bash
    conda install snowpark-checkpoints-validators
    ```
  + Using pip:

    ```bash
    pip install snowpark-checkpoints-validators
    ```
* **snowpark-checkpoints-configuration:** Use this package to allow `snowpark-checkpoints-collectors` and `snowpark-checkpoints-validators` to automatically load the configuration of the checkpoints.

  + Using conda:

    ```bash
    conda install snowpark-checkpoints-configuration
    ```
  + Using pip:

    ```bash
    pip install snowpark-checkpoints-configuration
    ```

---
title: Install the Snowflake extension for Snowpark Checkpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/snowflake-extension-checkpoints.md
section: Snowpark
---

# Install the Snowflake extension for Snowpark Checkpoints

## Prerequisites

Before you use the Checkpoints feature in the Snowflake extension, you must [install the Snowpark Checkpoints library](checkpoints-installation.md).

## Install the extension

* To use Snowpark Checkpoints from the VS Code Snowflake extension, install the extension from the Visual Studio Code Marketplace:

---
title: Local testing framework
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/testing-locally.md
section: Snowpark
---

# Local testing framework

This topic explains how to test your code locally when working with the Snowpark Python library.

The Snowpark Python local testing framework is an emulator that allows you to create and operate on Snowpark Python DataFrames locally without
connecting to a Snowflake account. You can use the local testing framework to test your DataFrame operations on your development
machine or in a CI (continuous integration) pipeline before deploying code changes to your account. The API is the same,
so you can run your tests either locally or against a Snowflake account without making code changes.

## Prerequisites

To use the local testing framework:

You must use version 1.18.0 or later of the Snowpark Python library with the optional dependency `localtest`. The supported versions of Python are:

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

### Install the Snowpark Python library

* To install the library with the optional dependency, run the following command:

  ```bash
  pip install "snowflake-snowpark-python[localtest]"
  ```

## Create a session and enable local testing

1. Create a Snowpark `Session` and set the local testing configuration to `True`:

   ```python
   from snowflake.snowpark import Session

   session = Session.builder.config('local_testing', True).create()
   ```
2. Use the session to create and operate on DataFrames:

   ```python
   df = session.create_dataframe([[1,2],[3,4]],['a','b'])
   df.with_column('c', df['a']+df['b']).show()
   ```

## Loading data

You can create Snowpark DataFrames from Python primitives, files, and pandas DataFrames.
This is useful for specifying the input and expected output of test cases. With this method,
the data is in source control, which makes it easier to keep the test data in sync with the test cases.

### Load CSV data

* To load CSV files into a Snowpark DataFrame, first call `Session.file.put()` to load the file to the in-memory stage, and then use `Session.read()` to read the contents.

**Example**

Assume there is a file, `data.csv`, with the following contents:

```none
col1,col2,col3,col4
1,a,true,1.23
2,b,false,4.56
```

You can use the following code to load `data.csv` into a Snowpark DataFrame.
You need to put the file onto a stage first; If you do not, you will receive a “file cannot be found” error.

```python
from snowflake.snowpark.types import StructType, StructField, IntegerType, BooleanType, StringType, DoubleType

# Put file onto stage
session.file.put("data.csv", "@mystage", auto_compress=False)
schema = StructType(
    [
        StructField("col1", IntegerType()),
        StructField("col2", StringType()),
        StructField("col3", BooleanType()),
        StructField("col4", DoubleType()),
    ]
)

# with option SKIP_HEADER set to 1, the header will be skipped when the csv file is loaded
dataframe = session.read.schema(schema).option("SKIP_HEADER", 1).csv("@mystage/data.csv")
dataframe.show()
```

Expected output:

```output
-------------------------------------
|"COL1"  |"COL2"  |"COL3"  |"COL4"  |
-------------------------------------
|1       |a       |True    |1.23    |
|2       |b       |False   |4.56    |
-------------------------------------
```

### Load pandas data

* To create a Snowpark Python DataFrame from a pandas DataFrame, call the `create_dataframe` method and pass the data as a pandas DataFrame.

**Example**

```python
import pandas as pd

pandas_df = pd.DataFrame(
    data={
        "col1": pd.Series(["value1", "value2"]),
        "col2": pd.Series([1.23, 4.56]),
        "col3": pd.Series([123, 456]),
        "col4": pd.Series([True, False]),
    }
)

dataframe = session.create_dataframe(data=pandas_df)
dataframe.show()
```

Expected output:

```output
-------------------------------------
|"col1"  |"col2"  |"col3"  |"col4"  |
-------------------------------------
|value1  |1.23    |123     |True    |
|value2  |4.56    |456     |False   |
-------------------------------------
```

* To convert a Snowpark Python DataFrame to a pandas DataFrame, call the `to_pandas` method on the DataFrame.

**Example**

```python
from snowflake.snowpark.types import StructType, StructField, StringType, DoubleType, LongType, BooleanType

dataframe = session.create_dataframe(
    data=[
        ["value1", 1.23, 123, True],
        ["value2", 4.56, 456, False],
    ],
    schema=StructType([
        StructField("col1", StringType()),
        StructField("col2", DoubleType()),
        StructField("col3", LongType()),
        StructField("col4", BooleanType()),
    ])
)

pandas_dataframe = dataframe.to_pandas()
print(pandas_dataframe.to_string())
```

Expected output:

```output
    COL1  COL2  COL3   COL4
0  value1  1.23   123   True
1  value2  4.56   456  False
```

## Create a PyTest Fixture for a session

[PyTest fixtures](https://docs.pytest.org/en/6.2.x/fixture.html) are functions that are executed before a test (or module of tests),
typically to provide data or connections to tests. In this procedure, you create a fixture that returns a Snowpark `Session` object.

1. If you do not already have a `test` directory, create one.
2. In the `test` directory, create a file named `conftest.py` with the following contents, where `connection_parameters` is a dictionary with your Snowflake account credentials:

   ```python
   # test/conftest.py
   import pytest
   from snowflake.snowpark.session import Session

   def pytest_addoption(parser):
       parser.addoption("--snowflake-session", action="store", default="live")

   @pytest.fixture(scope='module')
   def session(request) -> Session:
       if request.config.getoption('--snowflake-session') == 'local':
           return Session.builder.config('local_testing', True).create()
       else:
           return Session.builder.configs(CONNECTION_PARAMETERS).create()
   ```

For more information about the dictionary format, see [Creating a Session](creating-session.md).

The call to `pytest_addoption` adds a command-line option named `snowflake-session` to the `pytest` command.
The `Session` fixture checks this command-line option and creates a local or live `Session`, depending on its value.
This lets you easily switch between local and live modes for testing, as shown in the following command-line examples:

```python
# Using local mode:
pytest --snowflake-session local

# Using live mode
pytest
```

## SQL operations

`Session.sql(...)` is not supported in the local testing framework. Use Snowpark’s DataFrame APIs whenever possible,
and in cases where you must use `Session.sql(...)`, you can mock the tabular return value by using Python’s
`unittest.mock.patch` to patch the expected response from a given `Session.sql()` call.

In the following example, `mock_sql()` maps the SQL query text to the desired DataFrame response.
The conditional statement checks whether the current session is using local testing, and if so, applies the patch to the `Session.sql()` method.

```python
from unittest import mock
from functools import partial

def test_something(pytestconfig, session):

    def mock_sql(session, sql_string):  # patch for SQL operations
        if sql_string == "select 1,2,3":
            return session.create_dataframe([[1,2,3]])
        else:
            raise RuntimeError(f"Unexpected query execution: {sql_string}")

    if pytestconfig.getoption('--snowflake-session') == 'local':
        with mock.patch.object(session, 'sql', wraps=partial(mock_sql, session)): # apply patch for SQL operations
            assert session.sql("select 1,2,3").collect() == [Row(1,2,3)]
    else:
        assert session.sql("select 1,2,3").collect() == [Row(1,2,3)]
```

When local testing is enabled, all tables created by `DataFrame.save_as_table()` are saved as temporary tables in memory and can be
retrieved using `Session.table()`. You can use the supported DataFrame operations on the table as usual.

## Patching built-in functions

Some of the built-in functions under `snowflake.snowpark.functions` are not supported in the local testing framework.
If you use a function that is not supported, you can use the `@patch` decorator from `snowflake.snowpark.mock` to create a patch.

For the patched function to be defined and implemented, the signature (parameter list) must align with the built-in function’s parameters. The local testing framework passes parameters to the patched function using the following rules:

* For parameters of type `ColumnOrName` in the signature of built-in functions, `ColumnEmulator` is passed as the parameter of the patched functions.
  `ColumnEmulator` is similar to a `pandas.Series` object that contains the column data.
* For parameters of type `LiteralType` in the signature of built-in functions, the literal value is passed as the parameter of the patched functions.
* Otherwise, the raw value is passed as the parameter of the patched functions.

As for the returning type of the patched functions, returning an instance of `ColumnEmulator` is expected in correspondence with the returning type of `Column` of built-in functions.

For example, the built-in function `to_timestamp()` could be patched like this:

```python
import datetime
from snowflake.snowpark.mock import patch, ColumnEmulator, ColumnType
from snowflake.snowpark.functions import to_timestamp
from snowflake.snowpark.types import TimestampType

@patch(to_timestamp)
def mock_to_timestamp(column: ColumnEmulator, format = None) -> ColumnEmulator:
    ret_column = ColumnEmulator(data=[datetime.datetime.strptime(row, '%Y-%m-%dT%H:%M:%S%z') for row in column])
    ret_column.sf_type = ColumnType(TimestampType(), True)
    return ret_column
```

## Skipping test cases

If your PyTest test suite contains a test case that is not well supported by local testing, you can skip those cases by using PyTest’s `mark.skipif` decorator.
The following example assumes that you configured your session and parameters as described earlier. The condition checks whether the `local_testing_mode` is set to `local`; if so, the test case is skipped with an explanatory message.

```python
import pytest

@pytest.mark.skipif(
    condition="config.getvalue('local_testing_mode') == 'local'",
reason="Test case disabled for local testing"
)
def test_case(session):
    ...
```

## Registering UDFs and stored procedures

You can create and call user-defined functions (UDFs) and stored procedures in the local testing framework. To create the objects, you can
use the following syntax options:

| Syntax | UDF | Stored procedure |
| --- | --- | --- |
| Decorators | `@udf` | `@sproc` |
| Register methods | `udf.register()` | `sproc.register()` |
| Register-from-file methods | `udf.register_from_file()` | `sproc.register_from_file()` |

**Example**

The following code example creates a UDF and stored procedure using the decorators, and then calls both by name:

```python
from snowflake.snowpark.session import Session
from snowflake.snowpark.dataframe import col, DataFrame
from snowflake.snowpark.functions import udf, sproc, call_udf
from snowflake.snowpark.types import IntegerType, StringType

# Create local session
session = Session.builder.config('local_testing', True).create()

# Create local table
table = 'example'
session.create_dataframe([[1,2],[3,4]],['a','b']).write.save_as_table(table)

# Register a UDF, which is called from the stored procedure
@udf(name='example_udf', return_type=IntegerType(), input_types=[IntegerType(), IntegerType()])
def example_udf(a, b):
    return a + b

# Register stored procedure
@sproc(name='example_proc', return_type=IntegerType(), input_types=[StringType()])
def example_proc(session, table_name):
    return session.table(table_name)\
        .with_column('c', call_udf('example_udf', col('a'), col('b')))\
        .count()

# Call the stored procedure by name
output = session.call('example_proc', table)

print(output)
```

## Limitations

The following list contains the known limitations and behavior gaps in the local testing framework. **Snowflake currently has no plans to address these
items.**

* Raw SQL strings and operations that require parsing SQL strings, such as `session.sql` and `DataFrame.filter("col1 > 12")`,
  are not supported.
* Asynchronous operations are not supported.
* Database objects such as tables, stored procedures, and UDFs are not persisted beyond the session level, and all operations are performed
  in memory. For example, permanent stored procedures registered in one mock session are not visible to other mock sessions.
* [String collation](../../../sql-reference/collation.md) related features, such as `Column.collate`, are not supported.
* `Variant`, `Array`, and `Object` data types are only supported with standard JSON encoding and decoding. Expressions
  like [1,2,,3,] are considered valid JSON in Snowflake but not in local testing, where Python’s built-in JSON functionalities are used. You
  can specify the module-level variables `snowflake.snowpark.mock.CUSTOM_JSON_ENCODER` and
  `snowflake.snowpark.mock.CUSTOM_JSON_DECODER` to override the default settings.
* Only a subset of Snowflake’s functions (including window functions) are implemented. To learn how to inject your own function definition,
  see Patching built-in functions.

  + Patching rank-related functions is currently not supported.
* [SQL format models](../../../sql-reference/sql-format-models.md) are not supported. For example, the mock implementation of `to_decimal` does not handle the
  optional parameter `format`.
* The Snowpark Python library does not have a built-in Python API to create or drop stages, so the local testing framework assumes that every
  incoming stage has already been created.
* The current implementation of UDFs and stored procedures does not perform any package validation. All packages referenced in your code
  need to be installed before the program is executed.
* Query tags are not supported.
* Query history is not supported.
* Lineage is not supported.
* When a UDF or stored procedure is registered, optional parameters such as `parallel`, `execute_as`, `statement_params`,
  `source_code_display`, `external_access_integrations`, `secrets`, and `comment` are ignored.
* For `Table.sample`, SYSTEM or BLOCK sampling is the same as ROW sampling.
* Snowflake does not officially support running the local testing framework inside stored procedures. Sessions of local testing mode inside
  stored procedures might encounter or trigger unexpected errors.

## Unsupported features

The following is a list of features that are currently not implemented in the local testing framework. **Snowflake is actively working to address
these items.**

In general, any reference to these functionalities should raise a `NotImplementedError`:

* UDTFs (user-defined table functions)
* UDAFs (user-defined aggregate functions)
* Vectorized UDFs and UDTFs
* Built-in table functions
* Table stored procedures
* `Geometry`, `Geography`, and `Vector` data types
* Interval expressions
* Read file formats other than JSON and CSV

  + For a supported file format, not all read options are supported. For example, `infer_schema` is not supported for the CSV format.

For any features not listed here as unsupported or as a known limitation, check the latest list of [feature requests for local testing](https://github.com/snowflakedb/snowpark-python/issues?q=is%3Aopen+label%3A%22local+testing%22+label%3A%22feature%22+), or
[create a feature request](https://github.com/snowflakedb/snowpark-python/issues/new/choose) in the `snowpark-python` GitHub repository.

## Known issues

The following is a list of known issues or behavior gaps that exist in the local testing framework. **Snowflake is actively planning to address
these issues.**

* Using window functions inside `DataFrame.groupby` or other aggregation operations is not supported.

  ```python
  # Selecting window function expressions is supported
  df.select("key", "value", sum_("value").over(), avg("value").over())

  # Aggregating window function expressions is NOT supported
  df.group_by("key").agg([sum_("value"), sum_(sum_("value")).over(window) - sum_("value")])
  ```
* Selecting columns with the same name will only return one column. As a workaround, use
  `Column.alias` to rename the columns to have distinct names.

  ```python
  df.select(lit(1), lit(1)).show() # col("a"), col("a")
  #---------
  #|"'1'"  |
  #---------
  #|1      |
  #|...    |
  #---------

  # Workaround: Column.alias
  DataFrame.select(lit(1).alias("col1_1"), lit(1).alias("col1_2"))
  # "col1_1", "col1_2"
  ```
* For `Table.merge` and `Table.update`, the session parameters `ERROR_ON_NONDETERMINISTIC_UPDATE` and
  `ERROR_ON_NONDETERMINISTIC_MERGE` must be set to `False`. This means that for multi-joins, one of the matched rows is updated.
* Fully qualified stage names in GET and PUT file operations are not supported. Database and schema names are treated as part of the stage name.
* The `mock_to_char` implementation only supports timestamps in a format that has separators between different time parts.
* `DataFrame.pivot` has a parameter called `values` that allows a pivot to be limited to specific values. Only statistically
  defined values can be used at this time. Values that are provided using a subquery will raise an error.
* Creating a `DataFrame` from a pandas `DataFrame` that contains a timestamp with timezone information is not supported.

For any issues not mentioned in this list, check the [latest list of open issues](https://github.com/snowflakedb/snowpark-python/issues?q=is%3Aopen+is%3Aissue+label%3A%22local+testing%22), or
[create a bug report](https://github.com/snowflakedb/snowpark-python/issues/new/choose) in the `snowpark-python` GitHub repository.

---
title: pandas on Snowflake
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/pandas-on-snowflake.md
section: Snowpark
---

# pandas on Snowflake

pandas on Snowflake lets you run your pandas code directly on your data in Snowflake.
By simply changing the import statement and a few lines of code, you can get the familiar
pandas experience to develop robust pipelines, while seamlessly benefiting from Snowflake’s performance and scalability as your pipelines scale.

pandas on Snowflake intelligently determines whether to run pandas code locally or use the Snowflake engine to scale and enhance performance through Hybrid execution. When working with large datasets in Snowflake, it runs workloads natively in Snowflake through transpilation to SQL, enabling it to take advantage of parallelization and the data governance and security benefits of Snowflake.

pandas on Snowflake is delivered through the Snowpark pandas API as part of the [Snowpark Python library](index.md), which enables scalable data processing of Python code within the Snowflake platform.

## Benefits of using pandas on Snowflake

* **Meeting Python developers where they are:** pandas on Snowflake offers a familiar interface to Python developers by providing a
  pandas-compatible layer that can run natively in Snowflake.
* **Scalable distributed pandas:** pandas on Snowflake bridges the convenience of pandas with the scalability of Snowflake by leveraging existing query optimization techniques in Snowflake. Minimal code rewrites are required, simplifying the migration journey, so you can seamlessly move from prototype to production.
* **No additional compute infrastructure to manage and tune:** pandas on Snowflake leverages the Snowflake’s powerful compute engine, so you do not need to set
  up or manage any additional compute infrastructure.

## Getting started with pandas on Snowflake

> **Note:**
>
> For a hands-on example of how to use pandas on Snowflake, check out this [Notebook](https://github.com/Snowflake-Labs/snowflake-python-recipes/blob/main/pandas%20on%20Snowflake%20101/pandas%20on%20Snowflake%20101.ipynb) and watch this [video](https://www.youtube.com/watch?v=p9eX0QQGiZE).

To install pandas on Snowflake, you can use conda or pip to install the package. For detailed instructions, see Installation.

```bash
pip install "snowflake-snowpark-python[modin]"
```

Once pandas on Snowflake is installed, instead of importing pandas as `import pandas as pd`, use the following two lines:

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin
```

Here is an example of how you can start using pandas on Snowflake through the pandas on Snowpark Python library with Modin:

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin

# Create a Snowpark session with a default connection.
from snowflake.snowpark.session import Session
session = Session.builder.create()

# Create a Snowpark pandas DataFrame from existing Snowflake table
df = pd.read_snowflake('SNOWFALL')

# Inspect the DataFrame
df
```

```output
      DAY  LOCATION  SNOWFALL
0       1  Big Bear       8.0
1       2  Big Bear      10.0
2       3  Big Bear       NaN
3       1     Tahoe       3.0
4       2     Tahoe       NaN
5       3     Tahoe      13.0
6       1  Whistler       NaN
7  Friday  Whistler      40.0
8       3  Whistler      25.0
```

```python
# In-place point update to fix data error.
df.loc[df["DAY"]=="Friday","DAY"]=2

# Inspect the columns after update.
# Note how the data type is updated automatically after transformation.
df["DAY"]
```

```output
0    1
1    2
2    3
3    1
4    2
5    3
6    1
7    2
8    3
Name: DAY, dtype: int64
```

```python
# Drop rows with null values.
df.dropna()
```

```output
  DAY  LOCATION  SNOWFALL
0   1  Big Bear       8.0
1   2  Big Bear      10.0
3   1     Tahoe       3.0
5   3     Tahoe      13.0
7   2  Whistler      40.0
8   3  Whistler      25.0
```

```python
# Compute the average daily snowfall across locations.
df.groupby("LOCATION").mean()["SNOWFALL"]
```

```output
LOCATION
Big Bear     9.0
Tahoe        8.0
Whistler    32.5
Name: SNOWFALL, dtype: float64
```

`read_snowflake` supports reading from Snowflake views, dynamic tables, Iceberg tables, and more. You can also pass in a SQL query directly and get back a pandas on Snowflake DataFrame, making it easy to move seamlessly between SQL and pandas on Snowflake.

```Python
summary_df = pd.read_snowflake("SELECT LOCATION, AVG(SNOWFALL) AS avg_snowfall FROM SNOWFALL GROUP BY LOCATION")
summary_df
```

## How hybrid execution works

> **Note:**
>
> Starting with Snowpark Python version 1.40.0, hybrid execution is enabled by default when using pandas on Snowflake.

pandas on Snowflake uses hybrid execution to determine whether to run pandas code locally or use the Snowflake engine to scale and enhance performance. This allows you to continue writing familiar pandas code to develop robust pipelines, without having to think about the most optimal and efficient way to run your code, while seamlessly benefiting from Snowflake’s performance and scalability as their pipelines scale.

**Example 1**: Create a small, 11-row DataFrame inline. With hybrid execution, Snowflake selects local, in-memory pandas backend for executing the operation:

```python
# Create a basic dataframe with 11 rows
df = pd.DataFrame([
    ("New Year's Day", "2025-01-01"),
    ("Martin Luther King Jr. Day", "2025-01-20"),
    ("Presidents' Day", "2025-02-17"),
    ("Memorial Day", "2025-05-26"),
    ("Juneteenth National Independence Day", "2025-06-19"),
    ("Independence Day", "2025-07-04"),
    ("Labor Day", "2025-09-01"),
    ("Columbus Day", "2025-10-13"),
    ("Veterans Day", "2025-11-11"),
    ("Thanksgiving Day", "2025-11-27"),
    ("Christmas Day", "2025-12-25")
], columns=["Holiday", "Date"])
# Print out the backend used for this dataframe
df.get_backend()
# >> Output: 'Pandas'
```

**Example 2**: Seed a table with 10 million rows of transactions

```python
# Create a 10M row table in Snowflake and populate with sythentic data
session.sql('''CREATE OR REPLACE TABLE revenue_transactions (Transaction_ID STRING, Date DATE, Revenue FLOAT);''').collect()
session.sql('''SET num_days = (SELECT DATEDIFF(DAY, '2024-01-01', CURRENT_DATE));''').collect()
session.sql('''INSERT INTO revenue_transactions (Transaction_ID, Date, Revenue) SELECT UUID_STRING() AS Transaction_ID, DATEADD(DAY, UNIFORM(0, $num_days, RANDOM()), '2024-01-01') AS Date, UNIFORM(10, 1000, RANDOM()) AS Revenue FROM TABLE(GENERATOR(ROWCOUNT => 10000000));''').collect()

# Read Snowflake table as Snowpark pandas dataframe
df_transactions = pd.read_snowflake("REVENUE_TRANSACTIONS")
```

You can see that the table leverages Snowflake as the backend since this is a large table that resides in Snowflake.

```python
print(f"The dataset size is {len(df_transactions)} and the data is located in {df_transactions.get_backend()}.")
# >> Output: The dataset size is 10000000 and the data is located in Snowflake.

#Perform some operations on 10M rows with Snowflake
df_transactions["DATE"] = pd.to_datetime(df_transactions["DATE"])
df_transactions.groupby("DATE").sum()["REVENUE"]
```

**Example 3**: Filter data and perform a `groupby` aggregation resulting in 7 rows of data.

When data is filtered, Snowflake implicitly recognizes the backend choice of engine changes from Snowflake to pandas, since the output is only 7 rows of data.

```python
# Filter to data in last 7 days
df_transactions_filter1 = df_transactions[(df_transactions["DATE"] >= pd.Timestamp.today().date() - pd.Timedelta('7 days')) & (df_transactions["DATE"] < pd.Timestamp.today().date())]

# Since filter is not yet evaluated, data stays in Snowflake
assert df_transactions_filter1.get_backend() == "Snowflake"
# After groupby operation, result is transfered from Snowflake to Pandas
df_transactions_filter1.groupby("DATE").sum()["REVENUE"]
```

## Notes and limitations

* The DataFrame type will always be `modin.pandas.DataFrame/Series/etc` even when the backend changes, to ensure interoperability/compatibility with downstream code.
* To determine what backend to use, Snowflake sometimes uses an estimate of the row size instead of computing the exact length of the DataFrame at each step. This means that Snowflake may not always switch to the optimal backend immediately after an operation when the dataset gets larger/smaller (e.g. filter, aggregation).
* When there is an operation that combines two or more DataFrames across different backends, Snowflake determines where to move the data based on the lowest data transfer cost.
* Filter operations may not result in the movement of data, because Snowflake may not be able to estimate the size of the underlying filtered data.
* Any DataFrames comprised of in-memory Python data will use the pandas backend, such as the following:

  ```python
  pd.DataFrame([1])
  ```

  ```python
  pd.DataFrame(pandas.DataFrame([1]))
  ```

  ```python
  pd.Series({'a': [4]})
  ```

  ```python
  An empty DataFrame: pd.DataFrame()
  ```
* DataFrames will automatically move from the Snowflake engine to the pandas engine on a limited set of operations. These operations include `df.apply`, `df.plot`, `df.iterrows`, `df.itertuples`, `series.items`, and in reduction operations where the size of data is guaranteed to be smaller. Not all operations are supported points where data migration can occur.
* Hybrid execution does not automatically move a DataFrame from the pandas engine back to Snowflake, except in cases where an operation like `pd.concat` acts on multiple DataFrames.
* Snowflake does not automatically move a DataFrame from the pandas engine back to Snowflake unless an operation like `pd.concat` acts on multiple DataFrames.

## When you should use pandas on Snowflake

You should use pandas on Snowflake if any of the following is true:

* You are familiar with the pandas API and the broader PyData ecosystem.
* You work on a team with others who are familiar with pandas and want to collaborate on the same codebase.
* You have existing code written in pandas.
* You prefer more accurate code completion from AI-based copilot tools.

For more information, see [Snowpark DataFrames vs Snowpark pandas DataFrame: Which should I choose?](working-with-dataframes.md)

## Using pandas on Snowflake with Snowpark DataFrames

The pandas on Snowflake and DataFrame API is highly interoperable, so you can build a pipeline that leverages both APIs. For more information, see [Snowpark DataFrames vs Snowpark pandas DataFrame: Which should I choose?](working-with-dataframes.md)

You can use the following operations to do conversions between Snowpark DataFrames and Snowpark pandas DataFrames:

| Operation | Input | Output |
| --- | --- | --- |
| [to_snowpark_pandas](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrame.to_snowpark_pandas) | Snowpark DataFrame | Snowpark pandas DataFrame |
| [to_snowpark](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.to_snowpark) | Snowpark pandas DataFrame or Snowpark pandas Series | Snowpark DataFrame |

## How pandas on Snowflake compares to native pandas

pandas on Snowflake and native pandas have similar DataFrame APIs with matching signatures and similar semantics.
pandas on Snowflake provides the same API signature as native pandas and provides scalable computation with Snowflake.
pandas on Snowflake respects the semantics described in the native pandas documentation as much as possible, but it uses the Snowflake
computation and type system. However, when native pandas executes on a client machine, it uses the Python computation and type system.
For information about the type mapping between pandas on Snowflake and Snowflake, see
Data types.

Starting with Snowpark Python 1.40.0, pandas on Snowflake is best used with data which is already in Snowflake. To convert between native pandas and pandas on Snowflake type, use the following operations:

| Operation | Input | Output |
| --- | --- | --- |
| [to_pandas](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/snowflake.snowpark.modin.pandas.to_pandas) | Snowpark pandas DataFrame | Native pandas DataFrame - Materialize all data to the local environment. If the dataset is large, this may result in an out-of-memory error. |
| [pd.DataFrame(…)](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/snowflake.snowpark.modin.pandas.DataFrame) | Native pandas DataFrame, raw data, Snowpark pandas object | Snowpark pandas DataFrame |

### Execution environment

* `pandas`: Operates on a single machine and processes in-memory data.
* `pandas on Snowflake`: Integrates with Snowflake, which allows for distributed computing across a cluster of machines for large datasets, while leveraging in memory pandas for processing small datasets. This integration enables handling of much larger datasets that exceed the memory capacity of a single machine. Note that using the Snowpark
  pandas API requires a connection to Snowflake.

### Lazy versus eager evaluation

* `pandas`: Executes operations immediately and materializes results fully in memory after each operation. This eager
  evaluation of operations might lead to increased memory pressure because data must be moved extensively within a machine.
* `pandas on Snowflake`: Provides the same API experience as pandas. It mimics the eager evaluation model of pandas, but internally
  builds a lazily-evaluated query graph to enable optimization across operations.

  Fusing and transpiling operations through a query graph enables additional optimization opportunities for the underlying distributed
  Snowflake compute engine, which decreases both cost and end-to-end pipeline runtime compared to running pandas directly within Snowflake.

  > **Note:**
  >
  > I/O-related APIs and APIs whose return value is not a Snowpark pandas object (that is, `DataFrame`, `Series` or `Index`) always evaluate eagerly. For example:
  >
  > + `read_snowflake`
  > + `to_snowflake`
  > + `to_pandas`
  > + `to_dict`
  > + `to_list`
  > + `__repr__`
  > + The dunder method, `__array__` which can be called automatically by some third-party libraries such as scikit-learn.
  >   Calls to this method will materialize results to the local machine.

### Data source and storage

* `pandas`: Supports the various readers and writers listed in the pandas documentation in
  [IO tools (text, CSV, HDF5, …)](https://pandas.pydata.org/docs/user_guide/io.html).
* `pandas on Snowflake`: Can read and write from Snowflake tables and read local or staged CSV, JSON, or Parquet files.
  For more information, see IO (Read and Write).

### Data types

* `pandas`: Has a rich set of data types, such as integers, floats, strings, `datetime` types, and categorical types. It also
  supports user-defined data types. Data types in pandas are typically derived from the underlying data and are enforced strictly.
* `pandas on Snowflake`: Is constrained by Snowflake type system, which maps pandas objects to SQL by translating the pandas data types to the SQL types in Snowflake. A majority
  of pandas types have a natural equivalent in Snowflake, but the mapping is not always one to one. In some cases, multiple pandas types
  are mapped to the same SQL type.

The following table lists the type mappings between pandas and Snowflake SQL:

| pandas type | Snowflake type |
| --- | --- |
| All signed/unsigned integer types, including pandas extended integer types | NUMBER(38, 0) |
| All float types, including pandas extended float data types | FLOAT |
| `bool`, `BooleanDtype` | BOOLEAN |
| `str`, `StringDtype` | STRING |
| `datetime.time` | TIME |
| `datetime.date` | DATE |
| All timezone-naive `datetime` types | TIMESTAMP_NTZ |
| All timezone-aware `datetime` types | TIMESTAMP_TZ |
| `list`, `tuple`, `array` | ARRAY |
| `dict`, `json` | MAP |
| Object column with mixed data types | VARIANT |
| Timedelta64[ns] | NUMBER(38, 0) |

> **Note:**
>
> Categorical, period, interval, sparse, and user-defined data types are not supported. Timedelta is only supported on the pandas on Snowpark client today. When writing Timedelta back to Snowflake, it will be stored as Number type.

The following table provides the mapping of Snowflake SQL types back to pandas on Snowflake types using `df.dtypes`:

| Snowflake type | pandas on Snowflake type (`df.dtypes`) |
| --- | --- |
| NUMBER (`scale = 0`) | `int64` |
| NUMBER (`scale > 0`), REAL | `float64` |
| BOOLEAN | `bool` |
| STRING, TEXT | `object (str)` |
| VARIANT, BINARY, GEOMETRY, GEOGRAPHY | `object` |
| ARRAY | `object (list)` |
| OBJECT | `object (dict)` |
| TIME | `object (datetime.time)` |
| TIMESTAMP, TIMESTAMP_NTZ, TIMESTAMP_LTZ, TIMESTAMP_TZ | `datetime64[ns]` |
| DATE | `object (datetime.date)` |

When you convert from the Snowpark pandas DataFrame to the native pandas DataFrame with `to_pandas()`, the native pandas DataFrame will
have refined data types compared to the pandas on Snowflake types, which are compatible with the [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md) for
functions and procedures.

### Casting and type inference

* `pandas`: Relies on [NumPy](https://numpy.org/) and by default follows the NumPy and Python type system for implicit type casting
  and inference. For example, it treats booleans as integer types, so `1 + True` returns `2`.
* `pandas on Snowflake`: Maps NumPy and Python types to Snowflake types according to the preceding table, and uses the underlying
  Snowflake type system for implicit [type casting and inference](../../../sql-reference/data-type-conversion.md). For example, in accordance
  with the [Logical data types](../../../sql-reference/data-types-logical.md), it does not implicitly convert booleans to integer types, so `1 + True` results in a
  type conversion error.

### Null value handling

* `pandas`: In pandas versions 1.x, pandas was flexible when
  [handling missing data](https://pandas.pydata.org/docs/user_guide/missing_data.html#values-considered-missing), so it treated all of
  Python `None`, `np.nan`, `pd.NaN`, `pd.NA`, and `pd.NaT` as missing values.
  In later versions of pandas (2.2.x) these values are treated as different values.
* `pandas on Snowflake`: Adopts a similar approach to earlier pandas versions that treats all of the preceding values listed as missing values.
  Snowpark reuses `NaN`, `NA`, and `NaT` from pandas. But note that all these missing values are treated interchangeably and stored as SQL NULL in the Snowflake table.

### Offset/frequency aliases

* `pandas`: Date offsets in pandas changed in version 2.2.1. The single-letter aliases `'M'`, `'Q'`, `'Y'`, and others have been deprecated in favor of two-letter offsets.
* `pandas on Snowflake`: Exclusively uses the new offsets described in the [pandas time series documentation](https://pandas.pydata.org/pandas-docs/stable/user_guide/timeseries.html#dateoffset-objects).

## Install the pandas on Snowflake library

**Prerequisites**

The following package versions are required:

* Python 3.9 (deprecated), 3.10, 3.11, 3.12 or 3.13
* Modin version 0.32.0
* pandas version 2.2.\*

> **Tip:**
>
> To use pandas on Snowflake in Snowflake Notebooks, see the setup instructions in [pandas on Snowflake in notebooks](../../../user-guide/ui-snowsight/notebooks-use-with-snowflake.md).

To install pandas on Snowflake in your development environment, follow these steps:

1. Change to your project directory and activate your Python virtual environment.

   > **Note:**
   >
   > The API is under active development, so we recommend installing it in a Python virtual environment instead of
   > system-wide. This practice allows each project you create to use a specific version, which insulates you from changes
   > in future versions.

   You can create a Python virtual environment for a particular Python version by using tools like
   [Anaconda](https://www.anaconda.com/),
   [Miniconda](https://docs.conda.io/en/latest/miniconda.html), or
   [virtualenv](https://docs.python.org/3/tutorial/venv.html).

   For example, to use conda to create a Python 3.12 virtual environment, run these commands:

   ```bash
   conda create --name snowpark_pandas python=3.12
   conda activate snowpark_pandas
   ```

   > **Note:**
   >
   > If you previously installed an older version of pandas on Snowflake using Python 3.9 and pandas 1.5.3, you will need to upgrade your Python and pandas
   > versions as described above. Follow the steps to create a new environment with Python 3.10 to 3.13.
2. Install the Snowpark Python library with Modin:

   ```bash
   pip install "snowflake-snowpark-python[modin]"
   ```

   or

   ```bash
   conda install snowflake-snowpark-python modin==0.28.1
   ```

> > **Note:**
> >
> > Confirm that `snowflake-snowpark-python` version 1.17.0 or later is installed.

## Authenticating to Snowflake

Before using pandas on Snowflake, you must establish a session with the Snowflake database.
You can use a config file to choose the connection parameters for your session, or you can enumerate them in your code.
For more information, see [Creating a Session for Snowpark Python](creating-session.md).
If a unique active Snowpark Python session exists, pandas on Snowflake will automatically use it. For example:

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin
from snowflake.snowpark import Session

CONNECTION_PARAMETERS = {
    'account': '<myaccount>',
    'user': '<myuser>',
    'password': '<mypassword>',
    'role': '<myrole>',
    'database': '<mydatabase>',
    'schema': '<myschema>',
    'warehouse': '<mywarehouse>',
}
session = Session.builder.configs(CONNECTION_PARAMETERS).create()

# pandas on Snowflake will automatically pick up the Snowpark session created above.
# It will use that session to create new DataFrames.
df = pd.DataFrame([1, 2])
df2 = pd.read_snowflake('CUSTOMER')
```

The `pd.session` is a Snowpark session, so you can do anything with it that you can do with any other Snowpark session. For example, you can use it to execute an arbitrary SQL query,
which results in a Snowpark DataFrame as per the [Session API](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.Session.sql), but note that
the result is a Snowpark DataFrame, not a Snowpark pandas DataFrame.

```python
# pd.session is the session that pandas on Snowflake is using for new DataFrames.
# In this case it is the same as the Snowpark session that we've created.
assert pd.session is session

# Run SQL query with returned result as Snowpark DataFrame
snowpark_df = pd.session.sql('select * from customer')
snowpark_df.show()
```

Alternatively, you can configure your Snowpark connection parameters in a [configuration file](../../python-connector/python-connector-connect.md).
This eliminates the need to enumerate connection parameters in your code, which allows you to write your pandas on Snowflake code almost as you would normally write pandas code.

1. Create a configuration file located at `~/.snowflake/connections.toml` that looks something like this:

   ```bash
   default_connection_name = "default"

   [default]
   account = "<myaccount>"
   user = "<myuser>"
   password = "<mypassword>"
   role="<myrole>"
   database = "<mydatabase>"
   schema = "<myschema>"
   warehouse = "<mywarehouse>"
   ```
2. To create a session using these credentials, use `snowflake.snowpark.Session.builder.create()`:

   ```python
   import modin.pandas as pd
   import snowflake.snowpark.modin.plugin
   from snowflake.snowpark import Session

   # Session.builder.create() will create a default Snowflake connection.
   Session.builder.create()
   # create a DataFrame.
   df = pd.DataFrame([[1, 2], [3, 4]])
   ```

You can also create multiple Snowpark sessions, then assign one of them to pandas on Snowflake. pandas on Snowflake only uses one session, so you have to explicitly assign one
of the sessions to pandas on Snowflake with `pd.session = pandas_session`:

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin
from snowflake.snowpark import Session

pandas_session = Session.builder.configs({"user": "<user>", "password": "<password>", "account": "<account1>").create()
other_session = Session.builder.configs({"user": "<user>", "password": "<password>", "account": "<account2>").create()
pd.session = pandas_session
df = pd.DataFrame([1, 2, 3])
```

The following example shows that trying to use pandas on Snowflake when there is no active Snowpark session will raise a `SnowparkSessionException` with an
error like “pandas on Snowflake requires an active snowpark session, but there is none.” After you create a session, you can use pandas on Snowflake. For example:

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin

df = pd.DataFrame([1, 2, 3])
```

The following example shows that trying to use pandas on Snowflake when there are multiple active Snowpark sessions will cause
a `SnowparkSessionException` with a message like, “There are multiple active snowpark sessions, but you need to choose one for pandas on Snowflake.”

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin
from snowflake.snowpark import Session

pandas_session = Session.builder.configs({"user": "<user>", "password": "<password>", "account": "<account1>"}).create()
other_session = Session.builder.configs({"user": "<user>", "password": "<password>", "account": "<account2>"}).create()
df = pd.DataFrame([1, 2, 3])
```

> **Note:**
>
> You must set the session used for a new pandas on Snowflake DataFrame or Series via `modin.pandas.session`.
> However, joining or merging DataFrames created with different sessions is not supported, so you should avoid repeatedly setting different sessions
> and creating DataFrames with different sessions in a workflow.

## API reference

See [the pandas on Snowflake API reference](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/index) for the full list of currently implemented APIs and methods available.

For a full list of supported operations, see the following tables in pandas on Snowflake reference:

* [pandas general utilities supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/general_supported)
* [Series supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/series_supported)
* [DataFrame supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/dataframe_supported)
* [Index supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index_supported)
* [Windows supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/window_supported)
* [GroupBy supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/groupby_supported)
* [Resampler supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/resampling_supported)
* [DatetimeProperties supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/series_dt_supported)
* [StringMethods supported APIs](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/series_str_supported)

## APIs and configuration parameter for hybrid execution

Hybrid execution uses a combination of the dataset size estimate and the operations being applied to the DataFrame to determine the choice of backend. In general, datasets under 100k rows will tend to use local pandas; those over 100k rows will tend to use Snowflake, unless the dataset is loaded from local files.

### Configuring transfer costs

To change the default switching threshold to another row limit value, you can modify the environment variable before initializing a DataFrame:

```python
# Change row transfer threshold to 500k
from modin.config.envvars import SnowflakePandasTransferThreshold
SnowflakePandasTransferThreshold.put(500_000)
```

Setting this value will penalize transferring rows out of Snowflake.

### Configuring local execution limits

Once a DataFrame is local it will generally stay local unless there is a need to move it back to Snowflake for a merge, but there is an upper bound considered for the maximum size of data than can be processed locally. Currently this boundary is 10M rows.

### Checking and setting backend

To check the current backend of choice, you can use the [df.getbackend()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.DataFrame.get_backend#modin.pandas.DataFrame.get_backend) command, which returns `Pandas` for local execution, or `Snowflake` for pushdown execution.

To set the current backend of choice with either `set_backend` or its alias `move_to`:

```python
df_local = df.set_backend('Pandas')
```

```python
df_local = df.move_to('Pandas')
```

```python
df_snow = df.set_backend('Snowflake')
```

You can also set the backend in place:

```python
df.set_backend('Pandas', inplace=True)
```

To inspect and display information about *why* data was moved:

```python
pd.explain_switch()
```

### Manual override backend selection by pinning backend

By default, Snowflake automatically chooses the best backend for a given DataFrame and operation. If you would like to override the automatic engine selection, you can disable automatic switching on an object and all resulting data produced by it, using the [pin_backend()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.DataFrame.pin_backend#modin.pandas.DataFrame.pin_backend) method:

```python
pinned_df_snow = df.move_to('Snowflake').pin_backend()
```

To re-enable automatic backend switching, call [unpin_backend()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.DataFrame.unpin_backend#modin.pandas.DataFrame.unpin_backend):

```python
unpinned_df_snow = pinned_df_snow.unpin_backend()
```

## Using Snowpark pandas in Snowflake notebooks

To use pandas on Snowflake in Snowflake notebooks, see [pandas on Snowflake in notebooks](../../../user-guide/ui-snowsight/notebooks-use-with-snowflake.md).

## Using Snowpark pandas in Python Worksheets

To use Snowpark pandas, you need to install Modin by selecting `modin` from Packages in the Python Worksheet environment.

You can select the Return type of the Python function under Settings > Return type. By default, this is set as a Snowpark table. To display the Snowpark pandas DataFrame as a result, you can convert a Snowpark pandas DataFrame to a Snowpark DataFrame by calling [to_snowpark()](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.to_snowpark). No I/O cost will be incurred in this conversion.

Here is an example of using Snowpark pandas with Python Worksheets:

```python
import snowflake.snowpark as snowpark

def main(session: snowpark.Session):
  import modin.pandas as pd
  import snowflake.snowpark.modin.plugin

  df = pd.DataFrame([[1, 'Big Bear', 8],[2, 'Big Bear', 10],[3, 'Big Bear', None],
                  [1, 'Tahoe', 3],[2, 'Tahoe', None],[3, 'Tahoe', 13],
                  [1, 'Whistler', None],['Friday', 'Whistler', 40],[3, 'Whistler', 25]],
                  columns=["DAY", "LOCATION", "SNOWFALL"])

  # Print a sample of the dataframe to standard output.
  print(df)

  snowpark_df = df.to_snowpark(index=None)
  # Return value will appear in the Results tab.
  return snowpark_df
```

## Using pandas on Snowflake in stored procedures

You can use pandas on Snowflake in a [stored procedure](../../stored-procedure/python/procedure-python-overview.md) to build a data pipeline and schedule the execution of the stored procedure with [tasks](../../../user-guide/tasks-intro.md).

Here is how you can create a stored procedure using SQL:

```sqlexample-python
CREATE OR REPLACE PROCEDURE run_data_transformation_pipeline_sp()
RETURNS VARCHAR
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python','modin')
HANDLER='data_transformation_pipeline'
AS $$
def data_transformation_pipeline(session):
  import modin.pandas as pd
  import snowflake.snowpark.modin.plugin
  from datetime import datetime
  # Create a Snowpark pandas DataFrame with sample data.
  df = pd.DataFrame([[1, 'Big Bear', 8],[2, 'Big Bear', 10],[3, 'Big Bear', None],
                    [1, 'Tahoe', 3],[2, 'Tahoe', None],[3, 'Tahoe', 13],
                    [1, 'Whistler', None],['Friday', 'Whistler', 40],[3, 'Whistler', 25]],
                      columns=["DAY", "LOCATION", "SNOWFALL"])
  # Drop rows with null values.
  df = df.dropna()
  # In-place point update to fix data error.
  df.loc[df["DAY"]=="Friday","DAY"]=2
  # Save Results as a Snowflake Table
  timestamp = datetime.now().strftime("%Y_%m_%d_%H_%M")
  save_path = f"OUTPUT_{timestamp}"
  df.to_snowflake(name=save_path, if_exists="replace", index=False)
  return f'Transformed DataFrame saved to {save_path}.'
$$;
```

Here is how you can create a stored procedure using the [Snowflake Python API](../../snowflake-python-api/snowflake-python-managing-functions-procedures.md):

```python
from snowflake.snowpark.context import get_active_session
session = get_active_session()

from snowflake.snowpark import Session

def data_transformation_pipeline(session: Session) -> str:
  import modin.pandas as pd
  import snowflake.snowpark.modin.plugin
  from datetime import datetime
  # Create a Snowpark pandas DataFrame with sample data.
  df = pd.DataFrame([[1, 'Big Bear', 8],[2, 'Big Bear', 10],[3, 'Big Bear', None],
                     [1, 'Tahoe', 3],[2, 'Tahoe', None],[3, 'Tahoe', 13],
                     [1, 'Whistler', None],['Friday', 'Whistler', 40],[3, 'Whistler', 25]],
                      columns=["DAY", "LOCATION", "SNOWFALL"])
  # Drop rows with null values.
  df = df.dropna()
  # In-place point update to fix data error.
  df.loc[df["DAY"]=="Friday","DAY"]=2
  # Save Results as a Snowflake Table
  timestamp = datetime.now().strftime("%Y_%m_%d_%H_%M")
  save_path = f"OUTPUT_{timestamp}"
  df.to_snowflake(name=save_path, if_exists="replace", index=False)
  return f'Transformed DataFrame saved to {save_path}.'

dt_pipeline_sproc = session.sproc.register(name="run_data_transformation_pipeline_sp",
                             func=data_transformation_pipeline,
                             replace=True,
                             packages=['modin', 'snowflake-snowpark-python'])
```

To call the stored procedure, you can run `dt_pipeline_sproc()` in Python or `CALL run_data_transformation_pipeline_sp()` in SQL.

## Using pandas on Snowflake with third-party libraries

pandas is commonly used with third-party library APIs for visualization and machine learning applications. pandas on Snowflake is interoperable with most of these libraries, so they can be used without converting to pandas DataFrames explicitly. However, note that distributed execution is not often supported in most third-party libraries except in limited use cases. Therefore, this can lead to slower performance on large datasets.

### Supported third-party libraries

The libraries listed below accept pandas on Snowflake DataFrames as input, but not all their methods have been tested. For an in-depth interoperability status on an API level, see [Interoperability with third party libraries](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/interoperability).

* Plotly
* Altair
* Seaborn
* Matplotlib
* Numpy
* Scikit-learn
* XGBoost
* NLTK
* Streamlit

pandas on Snowflake currently has limited compatibility for certain [NumPy](https://numpy.org/) and [Matplotlib](https://matplotlib.org/) APIs, such as distributed implementation for `np.where` and interoperability with `df.plot`. Converting Snowpark pandas DataFrames via `to_pandas()` when working with these third-party libraries will avoid multiple I/O calls.

Here is an example with [Altair](https://altair-viz.github.io/) for visualization and [scikit-learn](https://scikit-learn.org/stable/) for machine learning.

```python
# Create a Snowpark session with a default connection.
session = Session.builder.create()

train = pd.read_snowflake('TITANIC')

train[['Pclass', 'Parch', 'Sex', 'Survived']].head()
```

```output
    Pclass  Parch     Sex       Survived
0       3      0     male               0
1       1      0   female               1
2       3      0   female               1
3       1      0   female               1
4       3      0     male               0
```

```python
import altair as alt

survived_per_age_plot = alt.Chart(train).mark_bar(
).encode(
    x=alt.X('Age', bin=alt.Bin(maxbins=25)),
    y='count()',
    column='Survived:N',
    color='Survived:N',
).properties(
    width=300,
    height=300
).configure_axis(
    grid=False
)
```

You can also analyze survival based on gender.

```python
# Perform groupby aggregation with Snowpark pandas
survived_per_gender = train.groupby(['Sex','Survived']).agg(count_survived=('Survived', 'count')).reset_index()

survived_per_gender_pandas = survived_per_gender
survived_per_gender_plot = alt.Chart(survived_per_gender).mark_bar().encode(
   x='Survived:N',
   y='Survived_Count',
   color='Sex',
   column='Sex'
).properties(
   width=200,
   height=200
).configure_axis(
   grid=False
)
```

You can now use scikit-learn to train a simple model.

```python
feature_cols = ['Pclass', 'Parch']
X_pandas = train.loc[:, feature_cols]
y_pandas = train["Survived"]

from sklearn.linear_model import LogisticRegression

logreg = LogisticRegression()
logreg.fit(X_pandas, y_pandas)
y_pred_pandas = logreg.predict(X_pandas)
acc_eval = accuracy_score(y_pandas, y_pred_pandas)
```

> **Note:**
>
> For greater performance, we recommend converting to pandas DataFrames via [to_pandas()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.21.0/modin/pandas_api/snowflake.snowpark.modin.pandas.to_pandas), particularly when using machine learning libraries such as scikit-learn. The `to_pandas()` function collects all rows, however, so it may be better to reduce the dataframe size first with `sample(frac=0.1)` or `head(10)`.

### Unsupported libraries

When using unsupported third-party libraries with a pandas on Snowflake DataFrame, we recommend converting the pandas on Snowflake DataFrame to a pandas DataFrame by calling [to_pandas()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.21.0/modin/pandas_api/snowflake.snowpark.modin.pandas.to_pandas) before passing the DataFrame to the third-party library method.

> **Note:**
>
> Calling [to_pandas()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.21.0/modin/pandas_api/snowflake.snowpark.modin.pandas.to_pandas) pulls your data out of Snowflake and into memory, so consider that for large datasets and sensitive use cases.

## Using Snowflake Cortex LLM functions with Snowpark pandas

You can use [Snowflake Cortex LLM functions](../../../user-guide/snowflake-cortex/aisql.md) via the [Snowpark pandas apply function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.DataFrame.apply).

You apply the function with special keyword arguments. Currently, the following Cortex functions are supported:

* [AI_SENTIMENT](../../../sql-reference/functions/ai_sentiment.md)
* [SUMMARIZE (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/summarize-snowflake-cortex.md)
* [TRANSLATE](../../../sql-reference/functions/translate.md)
* [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/classify_text-snowflake-cortex.md)
* [EXTRACT_ANSWER (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/extract_answer-snowflake-cortex.md)

The following example uses the [TRANSLATE](../../../sql-reference/functions/translate.md) function across multiple records in a Snowpark pandas DataFrame:

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin

from snowflake.cortex import Translate
content_df = pd.DataFrame(["good morning","hello", "goodbye"], columns=["content"])
result = content_df.apply(Translate, from_language="en", to_language="de")
result["content"]
```

Output:

```output
Guten Morgen
Hallo
Auf Wiedersehen
Name: content, dtype: object
```

The following example uses the [SENTIMENT (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/sentiment-snowflake-cortex.md) function on a Snowflake table named `reviews`:

```python
from snowflake.cortex import Sentiment

s = pd.read_snowflake("reviews")["content"]
result = s.apply(Sentiment)
result
```

The following example uses the [EXTRACT_ANSWER (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/extract_answer-snowflake-cortex.md) to answer a question:

```python
from snowflake.cortex import ExtractAnswer
content = "The Snowflake company was co-founded by Thierry Cruanes, Marcin Zukowski, and Benoit Dageville in 2012 and is headquartered in Bozeman, Montana."

df = pd.DataFrame([content])
result = df.apply(ExtractAnswer, question="When was Snowflake founded?")
result[0][0][0]["answer"]
```

Output:

```output
'2012'
```

> **Note:**
>
> The [snowflake-ml-python](https://pypi.org/project/snowflake-ml-python/) package must be installed to use Cortex LLM functions.

## Limitations

pandas on Snowflake has the following limitations:

* pandas on Snowflake provides no guarantee of compatibility with OSS third-party libraries. Starting with version 1.14.0a1, however, Snowpark
  pandas introduces limited compatibility for NumPy, specifically for `np.where` usage. For more information, see
  [NumPy Interoperability](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/numpy).

  When you call third-party library APIs with a Snowpark pandas DataFrame,
  Snowflake recommends that you convert the Snowpark pandas DataFrame to a pandas DataFrame by calling `to_pandas()` before passing the DataFrame to
  the third-party library call. For more information, see Using pandas on Snowflake with third-party libraries.
* pandas on Snowflake is not integrated with [Snowpark ML](../../snowflake-ml/overview.md). When you use Snowpark ML, we recommend that you convert the Snowpark
  pandas DataFrame to a Snowpark DataFrame using [to_snowpark()](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.to_snowpark) before calling Snowpark ML.
* Lazy `MultiIndex` objects are not supported. When `MultiIndex` is used, it returns a native pandas `MultiIndex` object,
  which requires pulling all data to the client side.
* Not all pandas APIs have a distributed implementation in pandas on Snowflake, although some are being added. For unsupported APIs,
  `NotImplementedError` is thrown.
  For information about supported APIs, see the API reference documentation.
* pandas on Snowflake provides compatibility with any patch version of pandas 2.2.
* Snowpark pandas cannot be referenced within Snowpark pandas `apply` function. You can only use native pandas inside `apply`.

  > + Following is an example:
  >
  >   ```python
  >   import modin.pandas as pd
  >   import pandas
  >
  >   df.apply(lambda row: pandas.to_datetime(f"{row.date} {row.time}"), axis=1)
  >   ```

## Troubleshooting

This section describes troubleshooting tips for using pandas on Snowflake.

* When troubleshooting, try running the same operation on a native pandas DataFrame (or a sample) to see whether the same error persists.
  This approach might provide hints on how to fix your query. For example:

  ```python
  df = pd.DataFrame({"a": [1,2,3], "b": ["x", "y", "z"]})
  # Running this in Snowpark pandas throws an error
  df["A"].sum()
  # Convert a small sample of 10 rows to pandas DataFrame for testing
  pandas_df = df.head(10).to_pandas()
  # Run the same operation. KeyError indicates that the column reference is incorrect
  pandas_df["A"].sum()
  # Fix the column reference to get the Snowpark pandas query working
  df["a"].sum()
  ```
* If you have a long-running notebook opened, note that by default Snowflake sessions time out after the session is idle for 240 minutes (4 hours).
  When the session expires, if you run additional pandas on Snowflake queries, the following message appears: “Authentication token has expired. The user must authenticate again.”
  At this point, you must re-establish the connection to Snowflake. This might cause the loss of any unpersisted session variables.
  For more information about how to configure the session idle timeout parameter, see [Session policies](../../../user-guide/session-policies.md).

## Best practices

This section describes best practices to follow when using pandas on Snowflake.

* Avoid using iterative code patterns, such as `for` loops, `iterrows`, and `iteritems`. Iterative code patterns quickly increase
  the generated query complexity. Let pandas on Snowflake, not the client code, perform the data distribution and computation parallelization. With regard to iterative code patterns, look for operations that can be performed on the whole DataFrame, and use the corresponding operations instead.

```python
for i in np.arange(0, 50):
  if i % 2 == 0:
    data = pd.concat([data, pd.DataFrame({'A': i, 'B': i + 1}, index=[0])], ignore_index=True)
  else:
    data = pd.concat([data, pd.DataFrame({'A': i}, index=[0])], ignore_index=True)

# Instead of creating one DataFrame per row and concatenating them,
# try to directly create the DataFrame out of the data, like this:

data = pd.DataFrame(
      {
          "A": range(0, 50),
          "B": [i + 1 if i % 2 == 0 else None for i in range(50)],
      },
)
```

* Avoid calling `apply`, `applymap`, and `transform`, which are eventually implemented with
  [UDFs](../../udf/python/udf-python-introduction.md) or
  [UDTFs](../../udf/python/udf-python-tabular-functions.md), which might not be as performant as
  regular SQL queries. If the function applied has an equivalent DataFrame or series operation, use that operation instead.
  For example, instead of `df.groupby('col1').apply('sum')`, directly call `df.groupby('col1').sum()`.
* Call `to_pandas()` before passing the DataFrame or series to a third-party library call. pandas on Snowflake does not provide a
  compatibility guarantee with third-party libraries.
* Use a materialized regular Snowflake table to avoid extra I/O overhead. pandas on Snowflake works on top of a data snapshot that
  only works for regular tables. For other types, including external tables, views, and Apache Iceberg™ tables, a temporary table is
  created before the snapshot is taken, which introduces extra materialization overhead.
* pandas on Snowflake provides fast and zero copy clone capability while creating DataFrames from Snowflake tables using `read_snowflake`.
* Double check the result type before proceeding to other operations, and do explicit type casting with `astype` if needed.

  Due to limited type inference capability, if no type hint is given, `df.apply` will return results of object (variant) type even if the result contains all
  integer values. If other operations require the `dtype` to be `int`, you can do an explicit type casting by calling the
  `astype` method to correct the column type before you continue.
* Avoid calling APIs that require evaluation and materialization unless necessary.

  APIs that don’t return `Series` or `Dataframe` require eager evaluation and materialization to produce the result in the
  correct type. Same for plotting methods. Reduce calls to those APIs to minimize unnecessary evaluations and materialization.
* Avoid calling `np.where(<cond>, <scalar>, n)` on large datasets. The `<scalar>` will be broadcast to a DataFrame
  the size of `<cond>`, which may be slow.
* When working with iteratively built queries, `df.cache_result` can be used to materialize intermediate
  results to reduce the repeated evaluation and improve the latency and reduce complexity of the overall query. For example:

  ```python
  df = pd.read_snowflake('pandas_test')
  df2 = pd.pivot_table(df, index='index_col', columns='pivot_col') # expensive operation
  df3 = df.merge(df2)
  df4 = df3.where(df2 == True)
  ```

  In the example above, the query to produce `df2` is expensive to compute and is reused in the creation of both `df3` and `df4`.
  Materializing `df2` into a temporary table (making subsequent operations involving `df2` a table scan instead of a pivot) can reduce the
  overall latency of the code block:

  ```python
  df = pd.read_snowflake('pandas_test')
  df2 = pd.pivot_table(df, index='index_col', columns='pivot_col') # expensive operation
  df2.cache_result(inplace=True)
  df3 = df.merge(df2)
  df4 = df3.where(df2 == True)
  ```

## Examples

Here is a code example with pandas operations. We start with a Snowpark pandas DataFrame named `pandas_test`, which contains three
columns: `COL_STR`, `COL_FLOAT`, and `COL_INT`. To view the notebook associated with these examples, see the
[pandas on Snowflake examples in the Snowflake-Labs repository](https://github.com/Snowflake-Labs/sf-samples/blob/main/samples/snowpark-pandas/api-examples/api_examples.ipynb).

```python
import modin.pandas as pd
import snowflake.snowpark.modin.plugin

from snowflake.snowpark import Session

CONNECTION_PARAMETERS = {
    'account': '<myaccount>',
    'user': '<myuser>',
    'password': '<mypassword>',
    'role': '<myrole>',
    'database': '<mydatabase>',
    'schema': '<myschema>',
    'warehouse': '<mywarehouse>',
}
session = Session.builder.configs(CONNECTION_PARAMETERS).create()

df = pd.DataFrame([['a', 2.1, 1],['b', 4.2, 2],['c', 6.3, None]], columns=["COL_STR", "COL_FLOAT", "COL_INT"])

df
```

```output
  COL_STR    COL_FLOAT    COL_INT
0       a          2.1        1.0
1       b          4.2        2.0
2       c          6.3        NaN
```

We save the DataFrame as a Snowflake table named `pandas_test`, which we will use throughout our examples.

```python
df.to_snowflake("pandas_test", if_exists='replace',index=False)
```

Next, we create a DataFrame from the Snowflake table. We drop the column `COL_INT` and then
save the result back to Snowflake with a column named `row_position`.

```python
# Create a DataFrame out of a Snowflake table.
df = pd.read_snowflake('pandas_test')

df.shape
```

```output
(3, 3)
```

```python
df.head(2)
```

```output
    COL_STR  COL_FLOAT  COL_INT
0         a        2.1        1
1         b        4.2        2
```

```python
df.dropna(subset=["COL_FLOAT"], inplace=True)

df
```

```output
    COL_STR  COL_FLOAT  COL_INT
0         a        2.1        1
1         c        6.3        2
```

```python
df.shape
```

```output
(2, 3)
```

```python
df.dtypes
```

```output
COL_STR       object
COL_FLOAT    float64
COL_INT        int64
dtype: object
```

```python
# Save the result back to Snowflake with a row_pos column.
df.reset_index(drop=True).to_snowflake('pandas_test2', if_exists='replace', index=True, index_label=['row_pos'])
```

The result is a new table, `pandas_test2`, which looks like this:

```output
     row_pos  COL_STR  COL_FLOAT  COL_INT
0          1         a       2.0        1
1          2         b       4.0        2
```

### IO (Read and Write)

```python
# Reading and writing to Snowflake
df = pd.DataFrame({"fruit": ["apple", "orange"], "size": [3.4, 5.4], "weight": [1.4, 3.2]})
df.to_snowflake("test_table", if_exists="replace", index=False )

df_table = pd.read_snowflake("test_table")

# Generate sample CSV file
with open("data.csv", "w") as f:
    f.write('fruit,size,weight\napple,3.4,1.4\norange,5.4,3.2')
# Read from local CSV file
df_csv = pd.read_csv("data.csv")

# Generate sample JSON file
with open("data.json", "w") as f:
    f.write('{"fruit":"apple", "size":3.4, "weight":1.4},{"fruit":"orange", "size":5.4, "weight":3.2}')
# Read from local JSON file
df_json = pd.read_json('data.json')

# Upload data.json and data.csv to Snowflake stage named @TEST_STAGE
# Read CSV and JSON file from stage
df_csv = pd.read_csv('@TEST_STAGE/data.csv')
df_json = pd.read_json('@TEST_STAGE/data.json')
```

For more information, see [Input/Output](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/io).

### Indexing

```python
df = pd.DataFrame({"a": [1,2,3], "b": ["x", "y", "z"]})
df.columns
```

```output
Index(['a', 'b'], dtype='object')
```

```python
df.index
```

```output
Index([0, 1, 2], dtype='int8')
```

```python
df["a"]
```

```output
0    1
1    2
2    3
Name: a, dtype: int8
```

```python
df["b"]
```

```output
0    x
1    y
2    z
Name: b, dtype: object
```

```python
df.iloc[0,1]
```

```output
'x'
```

```python
df.loc[df["a"] > 2]
```

```output
a  b
2  3  z
```

```python
df.columns = ["c", "d"]
df
```

```output
     c  d
0    1  x
1    2  y
2    3  z
```

```python
df = df.set_index("c")
df
```

```output
   d
c
1  x
2  y
3  z
```

```python
df.rename(columns={"d": "renamed"})
```

```output
    renamed
c
1       x
2       y
3       z
```

### Missing values

```python
import numpy as np
df = pd.DataFrame([[np.nan, 2, np.nan, 0],
                [3, 4, np.nan, 1],
                [np.nan, np.nan, np.nan, np.nan],
                [np.nan, 3, np.nan, 4]],
                columns=list("ABCD"))
df
```

```output
     A    B   C    D
0  NaN  2.0 NaN  0.0
1  3.0  4.0 NaN  1.0
2  NaN  NaN NaN  NaN
3  NaN  3.0 NaN  4.0
```

```python
df.isna()
```

```output
       A      B     C      D
0   True  False  True  False
1  False  False  True  False
2   True   True  True   True
3   True  False  True  False
```

```python
df.fillna(0)
```

```output
     A    B    C    D
0   0.0  2.0  0.0  0.0
1   3.0  4.0  0.0  1.0
2   0.0  0.0  0.0  0.0
3   0.0  3.0  0.0  4.0
```

```python
df.dropna(how="all")
```

```output
     A    B   C    D
0   NaN  2.0 NaN  0.0
1   3.0  4.0 NaN  1.0
3   NaN  3.0 NaN  4.0
```

### Type conversion

```python
df = pd.DataFrame({"int": [1,2,3], "str": ["4", "5", "6"]})
df
```

```output
   int str
0    1   4
1    2   5
2    3   6
```

```python
df_float = df.astype(float)
df_float
```

```output
   int  str
0  1.0  4.0
1  2.0  5.0
2  3.0  6.0
```

```python
df_float.dtypes
```

```output
int    float64
str    float64
dtype: object
```

```python
pd.to_numeric(df.str)
```

```output
0    4.0
1    5.0
2    6.0
Name: str, dtype: float64
```

```python
df = pd.DataFrame({'year': [2015, 2016],
                'month': [2, 3],
                'day': [4, 5]})
pd.to_datetime(df)
```

```output
0   2015-02-04
1   2016-03-05
dtype: datetime64[ns]
```

### Binary operations

```python
df_1 = pd.DataFrame([[1,2,3],[4,5,6]])
df_2 = pd.DataFrame([[6,7,8]])
df_1.add(df_2)
```

```output
    0    1     2
0  7.0  9.0  11.0
1  NaN  NaN   NaN
```

```python
s1 = pd.Series([1, 2, 3])
s2 = pd.Series([2, 2, 2])
s1 + s2
```

```output
0    3
1    4
2    5
dtype: int64
```

```python
df = pd.DataFrame({"A": [1,2,3], "B": [4,5,6]})
df["A+B"] = df["A"] + df["B"]
df
```

```output
   A  B  A+B
0  1  4    5
1  2  5    7
2  3  6    9
```

### Aggregation

```python
df = pd.DataFrame([[1, 2, 3],
                [4, 5, 6],
                [7, 8, 9],
                [np.nan, np.nan, np.nan]],
                columns=['A', 'B', 'C'])
df.agg(['sum', 'min'])
```

```output
        A     B     C
sum  12.0  15.0  18.0
min   1.0   2.0   3.0
```

```python
df.median()
```

```output
A    4.0
B    5.0
C    6.0
dtype: float64
```

### Merge

```python
df1 = pd.DataFrame({'lkey': ['foo', 'bar', 'baz', 'foo'],
                    'value': [1, 2, 3, 5]})
df1
```

```output
  lkey  value
0  foo      1
1  bar      2
2  baz      3
3  foo      5
```

```python
df2 = pd.DataFrame({'rkey': ['foo', 'bar', 'baz', 'foo'],
                    'value': [5, 6, 7, 8]})
df2
```

```output
  rkey  value
0  foo      5
1  bar      6
2  baz      7
3  foo      8
```

```python
df1.merge(df2, left_on='lkey', right_on='rkey')
```

```output
  lkey  value_x rkey  value_y
0  foo        1  foo        5
1  foo        1  foo        8
2  bar        2  bar        6
3  baz        3  baz        7
4  foo        5  foo        5
5  foo        5  foo        8
```

```python
df = pd.DataFrame({'key': ['K0', 'K1', 'K2', 'K3', 'K4', 'K5'],
                'A': ['A0', 'A1', 'A2', 'A3', 'A4', 'A5']})
df
```

```output
  key   A
0  K0  A0
1  K1  A1
2  K2  A2
3  K3  A3
4  K4  A4
5  K5  A5
```

```python
other = pd.DataFrame({'key': ['K0', 'K1', 'K2'],
                    'B': ['B0', 'B1', 'B2']})
df.join(other, lsuffix='_caller', rsuffix='_other')
```

```output
  key_caller   A key_other     B
0         K0  A0        K0    B0
1         K1  A1        K1    B1
2         K2  A2        K2    B2
3         K3  A3      None  None
4         K4  A4      None  None
5         K5  A5      None  None
```

### Groupby

```python
df = pd.DataFrame({'Animal': ['Falcon', 'Falcon','Parrot', 'Parrot'],
               'Max Speed': [380., 370., 24., 26.]})

df
```

```output
   Animal  Max Speed
0  Falcon      380.0
1  Falcon      370.0
2  Parrot       24.0
3  Parrot       26.0
```

```python
df.groupby(['Animal']).mean()
```

```output
        Max Speed
Animal
Falcon      375.0
Parrot       25.0
```

For more information, see [GroupBy](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/groupby).

### Pivot

```python
df = pd.DataFrame({"A": ["foo", "foo", "foo", "foo", "foo",
                        "bar", "bar", "bar", "bar"],
                "B": ["one", "one", "one", "two", "two",
                        "one", "one", "two", "two"],
                "C": ["small", "large", "large", "small",
                        "small", "large", "small", "small",
                        "large"],
                "D": [1, 2, 2, 3, 3, 4, 5, 6, 7],
                "E": [2, 4, 5, 5, 6, 6, 8, 9, 9]})
df
```

```output
     A    B      C  D  E
0  foo  one  small  1  2
1  foo  one  large  2  4
2  foo  one  large  2  5
3  foo  two  small  3  5
4  foo  two  small  3  6
5  bar  one  large  4  6
6  bar  one  small  5  8
7  bar  two  small  6  9
8  bar  two  large  7  9
```

```python
pd.pivot_table(df, values='D', index=['A', 'B'],
                   columns=['C'], aggfunc="sum")
```

```output
    C    large  small
A   B
bar one    4.0      5
    two    7.0      6
foo one    4.0      1
    two    NaN      6
```

```python
df = pd.DataFrame({'foo': ['one', 'one', 'one', 'two', 'two', 'two'],
                'bar': ['A', 'B', 'C', 'A', 'B', 'C'],
                'baz': [1, 2, 3, 4, 5, 6],
                'zoo': ['x', 'y', 'z', 'q', 'w', 't']})
df
```

```output
   foo bar  baz zoo
0  one   A    1   x
1  one   B    2   y
2  one   C    3   z
3  two   A    4   q
4  two   B    5   w
5  two   C    6   t
```

## Resources

* [Snowpark pandas API](/developer-guide/snowpark/reference/python/latest/modin/index)
* [Quickstart: Getting Started with pandas on Snowflake](https://quickstarts.snowflake.com/guide/getting_started_with_pandas_on_snowflake/index.html)
* [Quickstart: Data Engineering Pipelines with Snowpark Python](https://quickstarts.snowflake.com/guide/data_engineering_pipelines_with_snowpark_pandas/#0)

---
title: Prerequisites for Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/prerequisites.md
section: Snowpark
---

# Prerequisites for Snowpark Java

## Java Virtual Machine (JVM)

The Snowpark API supports the following versions of Java:

* 11.x
* 17.x

---
title: Prerequisites for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/prerequisites.md
section: Snowpark
---

# Prerequisites for Snowpark Scala

## Scala

The Snowpark API is supported with the following versions of Scala:

[Preview Feature](../../../release-notes/preview-features.md) — Open

Support for version 2.13 is in preview. Available to all accounts.

* 2.13
* 2.12

For more information, see [Writing code to support different Scala versions](../../scala-version-differences.md).

## Java Virtual Machine (JVM) for Scala

For the JVM used with Scala, the Snowpark API supports the following versions of Java:

* 11.x
* 17.x

---
title: Profiling Snowpark Python stored procedure handlers
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/profiling-procedure-handlers.md
section: Snowpark
---

# Profiling Snowpark Python stored procedure handlers

You can discover how much time or memory was spent executing your handler code by using the built-in code profiler. The profiler generates
information describing how much time or memory was spent executing each line of the procedure handler.

Using the profiler, you can generate reports that focus on one of the following at a time:

* **Amount of time per line**, in which the report shows the number of times a line was executed, how long the execution took, and so on.
* **Amount of memory usage per line**, in which the report shows the amount of memory consumed per line.

The profiler saves the generated report to the Snowflake [internal user stage](../../../user-guide/data-load-overview.md) you specify.
You can read the profiler output using the `StoredProcedureProfiler.get_output`
function.

> **Note:**
>
> Profiling introduces performance overhead on Python execution and can affect the performance of the query.
> It’s intended for development and testing and should not be enabled on continuous production workloads.

## Required privileges

When a stored procedure is executed after the `StoredProcedureProfiler.set_active_profiler` function is called, Snowflake checks
the following privileges for the user executing the procedure:

* You must have read write privileges on the profiling output stage.
* If the profiled stored procedure is a [caller’s rights stored procedure](../../stored-procedure/stored-procedures-rights.md),
  you must use a role with USAGE privilege on the stored procedure.
* If the profiled stored procedure is an [owner’s rights stored procedure](../../stored-procedure/stored-procedures-rights.md),
  you must use a role with OWNERSHIP privilege on the stored procedure.

## Limitations

* Only stored procedures are supported. UDFs support is not available yet.
* Recursive profiling is not supported. Only top-level functions of the specified modules are profiled, while functions defined inside
  functions are not.
* Profiling stored procedures created on the client-side via the `snowflake.snowpark` API is not supported.
* Python functions running in parallel through `joblib` are not profiled.
* System defined stored procedures cannot be profiled. They produce no output.
* The profiling API must be used in the same thread as the procedure was called from.

## Usage

Once you’ve set up the profiler for use, you can use it simply by calling the stored procedure to generate profiler output. After the
procedure finishes executing, the profiler’s output is written to a file on the stage you specify. You can fetch the profiler output
using a system function, as described below.

Follow these steps in your code to set up and use the profiler:

1. Acquire a profiler object from the `Session` object.
2. Specify the Snowflake stage where profile output should be written.
3. Enable the profiler and set what the profile report should focus on.
4. Call the stored procedure.
5. View profiling output.

### Acquire profiler object

In Python, create a variable of type `StoredProcedureProfiler` with which to configure and run the profiler.

```python
# Create your session
session = Session.builder.configs(CONNECTION_PARAMETERS).create()

# Acquire profiler object
profiler = session.stored_procedure_profiler
```

### Specify the Snowflake stage where profile output should be written

Before running the profiler, you must specify a stage in which to save the output. To specify the stage, call
`StoredProcedureProfiler.set_target_stage`, specifying the fully-qualified name of an internal
[Snowflake stage](../../../user-guide/data-load-overview.md) to which the report should be written.

Keep in mind the following:

* The stage name must be a fully-qualified name.
* If the stage you put into this function does not exist, Snowflake creates a temporary stage with that name.
* If you want to preserve the profiler output outside of the scope of the session, create a permanent stage before executing
  `set_target_stage` and specify that permanent stage’s name in the function call.
* If you do not set a target stage with `set_target_stage`, Snowflake sets the current session’s temporary stage as the target
  stage. To discover that temporary stage, call `Session.get_session_stage`.

Code in the following example creates a temporary `profiler_output` stage to receive the profiler output.

```python
profiler.set_target_stage("mydb.myschema.profiler_output")
```

### Enable the profiler by specifying its focus

Use the `StoredProcedureProfiler.set_active_profiler` function, specifying a value indicating which kind of profile report you want
to generate.

* To have the profiler report on line use activity, set the parameter to the `LINE` value (case insensitive), as shown below:

  ```python
  profiler.set_active_profiler("LINE")
  ```
* To have the profiler report on memory use activity, set the parameter to the `MEMORY` value (case insensitive), as shown below:

  ```python
  profiler.set_active_profiler("MEMORY")
  ```

To disable the profiler, use the `StoredProcedureProfiler.disable` function.

### Call the stored procedure

After the profiler is enabled, [call your stored procedure](calling-functions.md).

```python
session.call("my_stored_procedure")
```

### View profiling output

At the end of execution, you can access the output using the `StoredProcedureProfiler.get_output` function.

```python
profiler.get_output()
```

## Including additional modules for profiling

When profiling, you can include modules that aren’t included by default.

By default, methods defined in the your module are profiled. These methods include the following:

* The handler method
* Methods defined in the module
* Methods imported from packages or other modules

To include additional modules for profiling, use the `StoredProcedureProfiler.register_modules` function, specifying the modules
you want to include.

Code in the following example registers modules module_A and module_B for profiling.

```python
profiler.register_modules(["module_A", "module_B"])
```

To unregister registered modules, use `register_modules` with no arguments, as in the following example.

```python
profiler.register_modules()
```

## Example

The following examples illustrate how to use the profiler to generate and retrieve a report of line usage.

Code in this example creates a procedure `profiler_test_proc`.

```sqlexample-python
CREATE OR REPLACE PROCEDURE profiler_test_proc()
RETURNS NUMBER
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'main'
AS
$$
from snowflake.snowpark.functions import col, udf

def main(session):
  df = session.sql("select 1")
  return df.collect()[0][0]
$$;
```

Code in the following example sets up a profiler, then profiles the `profiler_test_proc` procedure.

```python
profiler = profiler_session.stored_procedure_profiler
profiler.register_modules(["profiler_test_proc"])
profiler.set_target_stage(
  f"{db_parameters['database']}.{db_parameters['schema']}.{tmp_stage_name}"
)

profiler.set_active_profiler("LINE")

profiler_session.call("profiler_test_proc")
res = profiler.get_output()
print(res)

profiler.disable()
profiler.register_modules([])
```

The generated line profiler output looks like this:

```output
Handler Name: main
Python Runtime Version: 3.12
Modules Profiled: ['main_module']
Timer Unit: 0.001 s

Total Time: 0.0619571 s
File: _udf_code.py
Function: main at line 4

Line #      Hits         Time  Per Hit   % Time  Line Contents
==============================================================
     4                                           def main(session):
     5         1          0.4      0.4      0.6      df = session.sql("select 1")
     6         1         61.6     61.6     99.4      return df.collect()[0][0]
```

---
title: Quick reference: Snowpark Java APIs for SQL commands
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/sql-to-snowpark.md
section: Snowpark
---

# Quick reference: Snowpark Java APIs for SQL commands

This topic provides a quick reference of some of the Snowpark APIs that correspond to SQL commands.

(Note that this is not a complete list of the APIs that correspond to SQL commands.)

## Performing queries

### Selecting columns

To select specific columns, use [select](../reference/java/com/snowflake/snowpark_java/DataFrame.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT id, name FROM sample_product_data; ``` | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfSelectedCols = df.select(Functions.col("id"), Functions.col("name"));  dfSelectedCols.show(); ``` |

### Renaming columns

To rename a column, use [as](../reference/java/com/snowflake/snowpark_java/Column.md) or [alias](../reference/java/com/snowflake/snowpark_java/Column.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT id AS item_id FROM sample_product_data; ``` | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfRenamedCol = df.select(Functions.col("id").as("item_id"));  dfRenamedCol.show(); ``` |
|  | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfRenamedCol = df.select(Functions.col("id").alias("item_id"));  dfRenamedCol.show(); ``` |

### Filtering data

To filter data, use [filter](../reference/java/com/snowflake/snowpark_java/DataFrame.md) or [where](../reference/java/com/snowflake/snowpark_java/DataFrame.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_product_data WHERE id = 1; ``` | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfFilteredRows = df.filter(Functions.col("id").equal_to(Functions.lit(1)));  dfFilteredRows.show(); ``` |
|  | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfFilteredRows = df.where(Functions.col("id").equal_to(Functions.lit(1)));  dfFilteredRows.show(); ``` |

### Sorting data

To sort data, use [sort](../reference/java/com/snowflake/snowpark_java/DataFrame.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_product_data ORDER BY category_id; ``` | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfSorted = df.sort(Functions.col("category_id"));  dfSorted.show(); ``` |

### Limiting the number of rows returned

To limit the number of rows returned, use [limit](../reference/java/com/snowflake/snowpark_java/DataFrame.md). See [Limiting the Number of Rows in a DataFrame](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_product_data   ORDER BY category_id LIMIT 2; ``` | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfSorted = df.sort(Functions.col("category_id")).limit(2);  Row[] arrayRows = dfSorted.collect(); ``` |

### Performing joins

To perform a join, use [join](../reference/java/com/snowflake/snowpark_java/DataFrame.md) or [naturalJoin](../reference/java/com/snowflake/snowpark_java/DataFrame.md). See [Joining DataFrames](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_a   INNER JOIN sample_b   on sample_a.id_a = sample_b.id_a; ``` | ```java DataFrame dfLhs = session.table("sample_a");  DataFrame dfRhs = session.table("sample_b");  DataFrame dfJoined =   dfLhs.join(dfRhs, dfLhs.col("id_a").equal_to(dfRhs.col("id_a")));  dfJoined.show(); ``` |
| ```sqlexample SELECT * FROM sample_a NATURAL JOIN sample_b; ``` | ```java DataFrame dfLhs = session.table("sample_a");  DataFrame dfRhs = session.table("sample_b");  DataFrame dfJoined = dfLhs.naturalJoin(dfRhs);  dfJoined.show(); ``` |

### Querying semi-structured data

To traverse semi-structured data, use [subField(“<field_name>”)](../reference/java/com/snowflake/snowpark_java/Column.md) and [subField(<index>)](../reference/java/com/snowflake/snowpark_java/Column.md). See
[Working with Semi-Structured Data](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT src:salesperson.name FROM car_sales; ``` | ```java DataFrame df = session.table("car_sales");  DataFrame dfJsonField =   df.select(Functions.col("src").subField("salesperson").subField("name"));  dfJsonField.show(); ``` |

### Grouping and aggregating data

To group data, use [groupBy](../reference/java/com/snowflake/snowpark_java/DataFrame.md). This returns a [RelationalGroupedDataFrame](../reference/java/com/snowflake/snowpark_java/RelationalGroupedDataFrame.md) object, which you can use to perform the aggregations.

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT category_id, count(*)   FROM sample_product_data GROUP BY category_id; ``` | ```java DataFrame df = session.table("sample_product_data");  DataFrame dfCountPerCategory = df.groupBy(Functions.col("category_id")).count();  dfCountPerCategory.show(); ``` |

### Calling window functions

To call a [window function](../../../user-guide/functions-window-using.md), use the [Window](../reference/java/com/snowflake/snowpark_java/Window.md) object methods to build a [WindowSpec](../reference/java/com/snowflake/snowpark_java/WindowSpec.md)
object, which in turn you can use for windowing functions (similar to using ‘<function> OVER … PARTITION BY … ORDER BY’).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT category_id, id, SUM(amount) OVER   (PARTITION BY category_id ORDER BY product_date)   FROM sample_product_data ORDER BY product_date; ``` | ```java WindowSpec window = Window.partitionBy(   Functions.col("category_id")).orderBy(Functions.col("product_date"));  DataFrame df = session.table("sample_product_data");  DataFrame dfCumulativePrices = df.select(   Functions.col("category_id"), Functions.col("product_date"),   Functions.sum(Functions.col("amount")).over(window)).sort(Functions.col("product_date"));  dfCumulativePrices.show(); ``` |

## Updating, deleting, and merging rows

To update, delete, and merge rows in a table, use [Updatable](../reference/java/com/snowflake/snowpark_java/Updatable.md). See [Updating, Deleting, and Merging Rows in a Table](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample UPDATE sample_product_data   SET serial_number = 'xyz' WHERE id = 12; ``` | ```java import java.util.HashMap; import java.util.Map; ...  Map<Column, Column> assignments = new HashMap<>();  assignments.put(Functions.col("serial_number"), Functions.lit("xyz"));  Updatable updatableDf = session.table("sample_product_data");  UpdateResult updateResult =   updatableDf.update(     assignments,     Functions.col("id").equal_to(Functions.lit(12)));  System.out.println("Number of rows updated: " + updateResult.getRowsUpdated()); ``` |
| ```sqlexample DELETE FROM sample_product_data   WHERE category_id = 50; ``` | ```java Updatable updatableDf = session.table("sample_product_data");  DeleteResult deleteResult =   updatableDf.delete(updatableDf.col("category_id").equal_to(Functions.lit(50)));  System.out.println("Number of rows deleted: " + deleteResult.getRowsDeleted()); ``` |
| ```sqlexample MERGE  INTO target_table USING source_table   ON target_table.id = source_table.id   WHEN MATCHED THEN     UPDATE SET target_table.description =       source_table.description; ``` | ```java import java.util.HashMap; import java.util.Map;  Map<String, Column> assignments = new HashMap<>(); assignments.put("description", source.col("description")); MergeResult mergeResult =   target.merge(source, target.col("id").equal_to(source.col("id")))   .whenMatched.updateColumn(assignments)   .collect(); ``` |

## Working with stages

For more information on working with stages, see [Working With Files in a Stage](working-with-dataframes.md).

### Uploading and Downloading Files from a Stage

To upload and download files from a stage, use [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md). See [Uploading and Downloading Files in a Stage](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample PUT file:///tmp/*.csv @myStage OVERWRITE = TRUE; ``` | ```java import java.util.HashMap; import java.util.Map; ... Map<String, String> putOptions = new HashMap<>();  putOptions.put("OVERWRITE", "TRUE");  PutResult[] putResults = session.file().put(   "file:///tmp/*.csv", "@myStage", putOptions);  for (PutResult result : putResults) {   System.out.println(result.getSourceFileName() + ": " + result.getStatus()); } ``` |
| ```sqlexample GET @myStage file:///tmp PATTERN = '.*.csv.gz'; ``` | ```java import java.util.HashMap; import java.util.Map; ... Map<String, String> getOptions = new HashMap<>();  getOptions.put("PATTERN", "'.*.csv.gz'");  GetResult[] getResults = session.file().get( "@myStage", "file:///tmp", getOptions);  for (GetResult result : getResults) {   System.out.println(result.getFileName() + ": " + result.getStatus()); } ``` |

### Reading data from files in a stage

To read data from files in a stage, use [DataFrameReader](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md) to create a DataFrame for the data. See
[Setting Up a DataFrame for Files in a Stage](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample CREATE FILE FORMAT snowpark_temp_format TYPE = JSON;  SELECT "$1"[0]['salesperson']['name'] FROM (   SELECT $1::VARIANT AS "$1" FROM @mystage/car_sales.json(     FILE_FORMAT => 'snowpark_temp_format')) LIMIT 10;  DROP FILE FORMAT snowpark_temp_format; ``` | ```java DataFrame df = session.read().json(   "@mystage/car_sales.json").select(     Functions.col("$1").subField(0).subField("salesperson").subField("name"));  df.show(); ``` |

### Copying data from files in a stage to a table

To copy data from files in a stage to a table, use [DataFrameReader](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md) to create a [CopyableDataFrame](../reference/java/com/snowflake/snowpark_java/CopyableDataFrame.md) for the data, and use the
[copyInto](../reference/java/com/snowflake/snowpark_java/CopyableDataFrame.md) method to copy the data to the table. See [Copying Data from Files into a Table](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample COPY INTO new_car_sales   FROM @mystage/car_sales.json   FILE_FORMAT = (TYPE = JSON); ``` | ```java CopyableDataFrame dfCopyableDf = session.read().json("@mystage/car_sales.json"); dfCopyableDf.copyInto("new_car_sales"); ``` |

### Saving a DataFrame to files on a stage

To save a DataFrame to files on a stage, use the [DataFrameWriter](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method named after the format of the files that you want to
use. See [Saving a DataFrame to Files on a Stage](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample COPY INTO @mystage/saved_data.json   FROM (  SELECT  *  FROM (car_sales) )   FILE_FORMAT = ( TYPE = JSON COMPRESSION = 'none' )   OVERWRITE = TRUE   DETAILED_OUTPUT = TRUE ``` | ```java DataFrame df = session.table("car_sales");  WriteFileResult writeFileResult = df.write().mode(   SaveMode.Overwrite).option(   "DETAILED_OUTPUT", "TRUE").option(   "compression", "none").json(   "@mystage/saved_data.json"); ``` |

## Creating and calling user-defined functions (UDFs)

To create an anonymous UDF, use [Functions.udf](../reference/java/com/snowflake/snowpark_java/Functions.md).

To create a temporary or permanent UDF that you can call by name, use [UDFRegistration.registerTemporary](../reference/java/com/snowflake/snowpark_java/UDFRegistration.md) or
[UDFRegistration.registerPermanent](../reference/java/com/snowflake/snowpark_java/UDFRegistration.md).

To call a permanent UDF by name, use [Functions.callUDF](../reference/java/com/snowflake/snowpark_java/Functions.md).

For details, see [Creating User-Defined Functions (UDFs) for DataFrames in Java](creating-udfs.md) and [Calling scalar user-defined functions (UDFs)](calling-functions.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample CREATE FUNCTION <temp_function_name>   RETURNS INT   LANGUAGE JAVA   ...   AS   ...;  SELECT ...,   <temp_function_name>(quantity) AS doublenum   FROM sample_product_data; ``` | ```java UserDefinedFunction doubleUdf =   Functions.udf(     (Integer x) -> x + x,     DataTypes.IntegerType,     DataTypes.IntegerType);  DataFrame df = session.table("sample_product_data");  DataFrame dfWithDoubleNum =   df.withColumn("doubleNum",     doubleUdf.apply(Functions.col("quantity")));  dfWithDoubleNum.show(); ``` |
| ```sqlexample CREATE FUNCTION <temp_function_name>   RETURNS INT   LANGUAGE JAVA   ...   AS   ...;  SELECT ...,   <temp_function_name>(quantity) AS doublenum   FROM sample_product_data; ``` | ```java UserDefinedFunction doubleUdf =   session     .udf()     .registerTemporary(       "doubleUdf",       (Integer x) -> x + x,       DataTypes.IntegerType,       DataTypes.IntegerType);  DataFrame df = session.table("sample_product_data");  DataFrame dfWithDoubleNum =   df.withColumn("doubleNum",     Functions.callUDF("doubleUdf", Functions.col("quantity"))); dfWithDoubleNum.show(); ``` |
| ```sqlexample CREATE FUNCTION doubleUdf(arg1 INT)   RETURNS INT   LANGUAGE JAVA   ...   AS   ...;  SELECT ...,   doubleUdf(quantity) AS doublenum   FROM sample_product_data; ``` | ```java UserDefinedFunction doubleUdf =   session     .udf()     .registerPermanent(       "doubleUdf",       (Integer x) -> x + x,       DataTypes.IntegerType,       DataTypes.IntegerType,       "mystage");  DataFrame df = session.table("sample_product_data");  DataFrame dfWithDoubleNum =   df.withColumn("doubleNum",     Functions.callUDF("doubleUdf", Functions.col("quantity"))); dfWithDoubleNum.show(); ``` |

## Creating and calling stored procedures

For a guide on creating stored procedures with Snowpark, see [Creating stored procedures for DataFrames in Java](creating-sprocs.md).

* To create an anonymous or named temporary procedure, use a `registerTemporary` method of [com.snowflake.snowpark_java.SProcRegistration](../reference/java/com/snowflake/snowpark_java/SProcRegistration.md).
* To create a named permanent procedure, use a `registerPermanent` method of the [com.snowflake.snowpark_java.SProcRegistration](../reference/java/com/snowflake/snowpark_java/SProcRegistration.md) class.
* To call a procedure, use the `storedProcedure` method of the [com.snowflake.snowpark_java.Session](../reference/java/com/snowflake/snowpark_java/Session.md) class.

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample CREATE PROCEDURE <temp_procedure_name>(x INTEGER, y INTEGER)   RETURNS INTEGER   LANGUAGE JAVA   ...   AS   $$   BEGIN     RETURN x + y;   END   $$   ;  CALL <temp_procedure_name>(2, 3); ``` | ```java StoredProcedure sp =   session.sproc().registerTemporary((Session session, Integer x, Integer y) -> x + y,     new DataType[] {DataTypes.IntegerType, DataTypes.IntegerType},     DataTypes.IntegerType);    session.storedProcedure(sp, 2, 3).show(); ``` |
| ```sqlexample CREATE PROCEDURE sproc(x INTEGER, y INTEGER)   RETURNS INTEGER   LANGUAGE JAVA   ...   AS   $$   BEGIN    RETURN x + y;   END   $$   ;  CALL sproc(2, 3); ``` | ```java String name = "sproc";  StoredProcedure sp =   session.sproc().registerTemporary(name,     (Session session, Integer x, Integer y) -> x + y,     new DataType[] {DataTypes.IntegerType, DataTypes.IntegerType},     DataTypes.IntegerType);    session.storedProcedure(name, 2, 3).show(); ``` |
| ```sqlexample CREATE PROCEDURE add_hundred(x INTEGER)   RETURNS INTEGER   LANGUAGE JAVA   ...   AS   $$   BEGIN    RETURN x + 100;   END   $$   ;  CALL add_hundred(3); ``` | ```java String name = "add_hundred"; String stageName = "sproc_libs";  StoredProcedure sp =   session.sproc().registerPermanent(     name,     (Session session, Integer x) -> x + 100,     DataTypes.IntegerType,     DataTypes.IntegerType,     stageName,     true);    session.storedProcedure(name, 3).show(); ``` |

---
title: Quick reference: Snowpark Scala APIs for SQL commands
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/sql-to-snowpark.md
section: Snowpark
---

# Quick reference: Snowpark Scala APIs for SQL commands

This topic provides a quick reference of some of the Snowpark APIs that correspond to SQL commands.

(Note that this is not a complete list of the APIs that correspond to SQL commands.)

## Performing queries

### Selecting columns

To select specific columns, use [DataFrame.select](../reference/scala/com/snowflake/snowpark/DataFrame.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT id, name FROM sample_product_data; ``` | ```scala val dfSelectedCols = df.select(col("id"), col("name")) dfSelectedCols.show() ``` |

### Renaming columns

To rename a column, use [Column.as](../reference/scala/com/snowflake/snowpark/Column.md), [Column.alias](../reference/scala/com/snowflake/snowpark/Column.md), or [Column.name](../reference/scala/com/snowflake/snowpark/Column.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT id AS item_id FROM sample_product_data; ``` | ```scala val dfRenamedCol = df.select(col("id").as("item_id")) dfRenamedCol.show() ``` |
|  | ```scala val dfRenamedCol = df.select(col("id").alias("item_id")) dfRenamedCol.show() ``` |
|  | ```scala val dfRenamedCol = df.select(col("id").name("item_id")) dfRenamedCol.show() ``` |

### Filtering data

To filter data, use [DataFrame.filter](../reference/scala/com/snowflake/snowpark/DataFrame.md) or [DataFrame.where](../reference/scala/com/snowflake/snowpark/DataFrame.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_product_data WHERE id = 1; ``` | ```scala val dfFilteredRows = df.filter((col("id") === 1)) dfFilteredRows.show() ``` |
|  | ```scala val dfFilteredRows = df.where((col("id") === 1)) dfFilteredRows.show() ``` |

### Sorting data

To sort data, use [DataFrame.sort](../reference/scala/com/snowflake/snowpark/DataFrame.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_product_data ORDER BY category_id; ``` | ```scala val dfSorted = df.sort(col("category_id")) dfSorted.show() ``` |

### Limiting the number of rows returned

To limit the number of rows returned, use [DataFrame.limit](../reference/scala/com/snowflake/snowpark/DataFrame.md). See [Limiting the Number of Rows in a DataFrame](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_product_data   ORDER BY category_id LIMIT 2; ``` | ```scala val dfSorted = df.sort(col("category_id")).limit(2); val arrayRows = dfSorted.collect() ``` |

### Performing joins

To perform a join, use [DataFrame.join](../reference/scala/com/snowflake/snowpark/DataFrame.md) or [DataFrame.naturalJoin](../reference/scala/com/snowflake/snowpark/DataFrame.md). See [Joining DataFrames](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT * FROM sample_a   INNER JOIN sample_b   on sample_a.id_a = sample_b.id_a; ``` | ```scala val dfJoined =   dfLhs.join(dfRhs, dfLhs.col("id_a") === dfRhs.col("id_a")) dfJoined.show() ``` |
| ```sqlexample SELECT * FROM sample_a NATURAL JOIN sample_b; ``` | ```scala val dfJoined = dfLhs.naturalJoin(dfRhs) dfJoined.show() ``` |

### Querying semi-structured data

To traverse semi-structured data, use [Column.apply(“<field_name>”)](../reference/scala/com/snowflake/snowpark/Column.md) and [Column.apply(<index>)](../reference/scala/com/snowflake/snowpark/Column.md). See
[Working with Semi-Structured Data](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT src:salesperson.name FROM car_sales; ``` | ```scala dfJsonField =   df.select(col("src")("salesperson")("name")) dfJsonField.show() ``` |

### Grouping and aggregating data

To group data, use [DataFrame.groupBy](../reference/scala/com/snowflake/snowpark/DataFrame.md). This returns a [RelationalGroupedDataFrame](../reference/scala/com/snowflake/snowpark/RelationalGroupedDataFrame.md) object, which you can use to perform the
aggregations.

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT category_id, count(*)   FROM sample_product_data GROUP BY category_id; ``` | ```scala val dfCountPerCategory = df.groupBy(col("category")).count() dfCountPerCategory.show() ``` |

### Calling window functions

To call a [window function](../../../user-guide/functions-window-using.md), use the [Window](../reference/scala/com/snowflake/snowpark/Window$.md) object methods to build a [WindowSpec](../reference/scala/com/snowflake/snowpark/WindowSpec.md)
object, which in turn you can use for windowing functions (similar to using ‘<function> OVER … PARTITION BY … ORDER BY’).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample SELECT category_id, price_date, SUM(amount) OVER   (PARTITION BY category_id ORDER BY price_date)   FROM prices ORDER BY price_date; ``` | ```scala val window = Window.partitionBy(   col("category")).orderBy(col("price_date")) val dfCumulativePrices = dfPrices.select(   col("category"), col("price_date"),   sum(col("amount")).over(window)).sort(col("price_date")) dfCumulativePrices.show() ``` |

## Updating, deleting, and merging rows

To update, delete, and merge rows in a table, use [Updatable](../reference/scala/com/snowflake/snowpark/Updatable.md). See [Updating, Deleting, and Merging Rows in a Table](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample UPDATE sample_product_data   SET serial_number = 'xyz' WHERE id = 12; ``` | ```scala val updateResult =   updatableDf.update(     Map("serial_number" -> lit("xyz")),     col("id") === 12) ``` |
| ```sqlexample DELETE FROM sample_product_data   WHERE category_id = 50; ``` | ```scala val deleteResult =   updatableDf.delete(updatableDf("category_id") === 50) ``` |
| ```sqlexample MERGE  INTO target_table USING source_table   ON target_table.id = source_table.id   WHEN MATCHED THEN     UPDATE SET target_table.description =       source_table.description; ``` | ```scala val mergeResult =    target.merge(source, target("id") === source("id"))   .whenMatched.update(Map("description" -> source("description")))   .collect() ``` |

## Working with stages

For more information on working with stages, see [Working With Files in a Stage](working-with-dataframes.md).

### Uploading and downloading files from a stage

To upload and download files from a stage, use [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md). See [Uploading and Downloading Files in a Stage](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample PUT file:///tmp/*.csv @myStage OVERWRITE = TRUE; ``` | ```scala val putOptions = Map("OVERWRITE" -> "TRUE") val putResults = session.file.put(   "file:///tmp/*.csv", "@myStage", putOptions) ``` |
| ```sqlexample GET @myStage file:///tmp PATTERN = '.*.csv.gz'; ``` | ```scala val getOptions = Map("PATTERN" -> s"'.*.csv.gz'") val getResults = session.file.get(  "@myStage", "file:///tmp", getOptions) ``` |

### Reading data from files in a stage

To read data from files in a stage, use [DataFrameReader](../reference/scala/com/snowflake/snowpark/DataFrameReader.md) to create a DataFrame for the data. See
[Setting Up a DataFrame for Files in a Stage](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample CREATE FILE FORMAT snowpark_temp_format TYPE = JSON; SELECT "$1"[0]['salesperson']['name'] FROM (   SELECT $1::VARIANT AS "$1" FROM @mystage/car_sales.json(     FILE_FORMAT => 'snowpark_temp_format')) LIMIT 10; DROP FILE FORMAT snowpark_temp_format; ``` | ```scala val df = session.read.json(   "@mystage/car_sales.json").select(     col("$1")(0)("salesperson")("name")) df.show(); ``` |

### Copying data from files in a stage to a table

To copy data from files in a stage to a table, use [DataFrameReader](../reference/scala/com/snowflake/snowpark/DataFrameReader.md) to create a [CopyableDataFrame](../reference/scala/com/snowflake/snowpark/CopyableDataFrame.md) for the data, and use the
[CopyableDataFrame.copyInto](../reference/scala/com/snowflake/snowpark/CopyableDataFrame.md) method to copy the data to the table. See [Copying Data from Files into a Table](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample COPY INTO new_car_sales   FROM @mystage/car_sales.json   FILE_FORMAT = (TYPE = JSON); ``` | ```scala val dfCopyableDf = session.read.json("@mystage/car_sales.json") dfCopyableDf.copyInto("new_car_sales") ``` |

### Saving a DataFrame to files on a stage

To save a DataFrame to files on a stage, use the [DataFrameWriter](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method named after the format of the files that you want to
use. See [Saving a DataFrame to Files on a Stage](working-with-dataframes.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample COPY INTO @mystage/saved_data.json   FROM (  SELECT  *  FROM (car_sales) )   FILE_FORMAT = ( TYPE = JSON COMPRESSION = 'none' )   OVERWRITE = TRUE   DETAILED_OUTPUT = TRUE ``` | ```scala val df = session.table("car_sales") val writeFileResult = df.write.mode(   SaveMode.Overwrite).option(   "DETAILED_OUTPUT", "TRUE").option(   "compression", "none").json(   "@mystage/saved_data.json") ``` |

## Creating and calling user-defined functions (UDFs)

To create a Scala function that serves as a UDF (an anonymous UDF), use [udf](../reference/scala/com/snowflake/snowpark/functions$.md).

To create a temporary or permanent UDF that you can call by name, use [UDFRegistration.registerTemporary](../reference/scala/com/snowflake/snowpark/UDFRegistration.md) or
[UDFRegistration.registerPermanent](../reference/scala/com/snowflake/snowpark/UDFRegistration.md).

To call a permanent UDF by name, use [callUDF](../reference/scala/com/snowflake/snowpark/functions$.md).

For details, see [Creating User-Defined Functions (UDFs) for DataFrames in Scala](creating-udfs.md) and [Calling scalar user-defined functions (UDFs)](calling-functions.md).

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample CREATE FUNCTION <temp_function_name>   RETURNS INT   LANGUAGE JAVA   ...   AS   ...;  SELECT ...,   <temp_function_name>(amount) AS doublenum   FROM sample_product_data; ``` | ```scala val doubleUdf = udf((x: Int) => x + x) val dfWithDoubleNum = df.withColumn(  "doubleNum", doubleUdf(col("amount"))) dfWithDoubleNum.show() ``` |
| ```sqlexample CREATE FUNCTION <temp_function_name>   RETURNS INT   LANGUAGE JAVA   ...   AS   ...;  SELECT ...,   <temp_function_name>(amount) AS doublenum   FROM sample_product_data; ``` | ```scala session.udf.registerTemporary(   "doubleUdf", (x: Int) => x + x) val dfWithDoubleNum = df.withColumn(  "doubleNum", callUDF("doubleUdf", (col("amount")))) dfWithDoubleNum.show() ``` |
| ```sqlexample CREATE FUNCTION doubleUdf(arg1 INT)   RETURNS INT   LANGUAGE JAVA   ...   AS   ...;  SELECT ...,   doubleUdf(amount) AS doublenum   FROM sample_product_data; ``` | ```scala session.udf.registerPermanent(   "doubleUdf", (x: Int) => x + x, "mystage") val dfWithDoubleNum = df.withColumn(  "doubleNum", callUDF("doubleUdf", (col("amount")))) dfWithDoubleNum.show() ``` |

## Creating and calling stored procedures

For a guide on creating stored procedures with Snowpark, see [Creating stored procedures for DataFrames in Scala](creating-sprocs.md).

* To create an anonymous or named temporary procedure, use a `registerTemporary` methods of [com.snowflake.snowpark.SProcRegistration](../reference/scala/com/snowflake/snowpark/SProcRegistration.md).
* To create a named permanent procedure, use a `registerPermanent` method of the [com.snowflake.snowpark.SProcRegistration](../reference/scala/com/snowflake/snowpark/SProcRegistration.md) class.
* To call a procedure, use the `storedProcedure` method of the [com.snowflake.snowpark.Session](../reference/scala/com/snowflake/snowpark/Session.md) class.

| Example of a SQL Statement | Example of Snowpark Code |
| --- | --- |
| ```sqlexample CREATE PROCEDURE <temp_procedure_name>(x INTEGER, y INTEGER)   RETURNS INTEGER   LANGUAGE JAVA   ...   AS   $$   BEGIN    RETURN x + y;   END   $$   ;  CALL <temp_procedure_name>(2, 3); ``` | ```scala StoredProcedure sp =   session.sproc().registerTemporary((Session session, Integer x, Integer y) -> x + y,     new DataType[] {DataTypes.IntegerType, DataTypes.IntegerType},     DataTypes.IntegerType);    session.storedProcedure(sp, 2, 3).show(); ``` |
| ```sqlexample CREATE PROCEDURE sproc(x INTEGER, y INTEGER)   RETURNS INTEGER   LANGUAGE JAVA   ...   AS   $$   BEGIN    RETURN x + y;   END   $$   ;  CALL sproc(2, 3); ``` | ```scala String name = "sproc"; StoredProcedure sp =   session.sproc().registerTemporary(name,     (Session session, Integer x, Integer y) -> x + y,     new DataType[] {DataTypes.IntegerType, DataTypes.IntegerType},     DataTypes.IntegerType);    session.storedProcedure(name, 2, 3).show(); ``` |
| ```sqlexample CREATE PROCEDURE add_hundred(x INTEGER)   RETURNS INTEGER   LANGUAGE JAVA   ...   AS   $$   BEGIN    RETURN x + 100;   END   $$   ;  CALL add_hundred(3); ``` | ```scala val name: String = "add_hundred" val stageName: String = "sproc_libs"  val sp: StoredProcedure =   session.sproc.registerPermanent(     name,     (session: Session, x: Int) => x + 100,     stageName,     true   )  session.storedProcedure(name, 3).show ``` |

---
title: Setting Up a Jupyter Notebook for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/quickstart-jupyter.md
section: Snowpark
---

# Setting Up a Jupyter Notebook for Snowpark Scala

This topic explains how to set up a Jupyter notebook for Snowpark.

## Setting Up Jupyter Notebooks for Scala Development

Make sure that Jupyter is set up to use Scala. For example, you can
[install the Almond kernel](https://almond.sh/docs/quick-start-install).

> **Note:**
>
> When using `coursier` to install the Almond kernel, specify a
> [supported version of Scala](prerequisites.md).

## Creating a New Notebook in a New Folder

The Snowpark library requires access to the directory that contains classes generated by the Scala REPL. If you are planning to
use multiple notebooks, you must use a separate REPL class directory for each notebook.

To make it easier to set up a separate REPL class directory for each notebook, create a separate folder for each notebook:

1. In the [Notebook Dashboard](https://jupyter-notebook.readthedocs.io/en/latest/ui_components.html#notebook-dashboard),
   click New » Folder to create a new folder for a notebook.
2. Select the checkbox next to the folder, click Rename, and assign a new name for the folder.
3. Click the link for the folder to navigate into the folder.
4. Click New » Scala to create a new notebook in that folder.

## Configuring the Jupyter Notebook for Snowpark

Next, configure the Jupyter notebook for Snowpark.

1. In a new cell, run the following commands to define a variable for a directory:

   ```scala
   val replClassPathObj = os.Path("replClasses", os.pwd)
   if (!os.exists(replClassPathObj)) os.makeDir(replClassPathObj)
   val replClassPath = replClassPathObj.toString()
   ```

   This does the following:

   * Defines a [os.Path](https://github.com/com-lihaoyi/os-lib#ospath) variable and a `String` variable for a directory
     for classes generated by the Scala REPL.
   * Creates that directory, if that directory does not already exist.

   The Scala REPL generates classes for the Scala code that you write, including your code that defines UDFs. The Snowpark
   library uses this directory to find and upload the classes for your UDFs that are generated by the REPL.

   > **Note:**
   >
   > If you are using multiple notebooks, you’ll need to create and configure a separate REPL class directory for each notebook.
   > For simplicity, you can just put each notebook in a separate folder, as explained in
   > Creating a New Notebook in a New Folder.
2. Run the following commands in a cell to configure the compiler for the Scala REPL:

   ```scala
   interp.configureCompiler(_.settings.outputDirs.setSingleOutput(replClassPath))
   interp.configureCompiler(_.settings.Yreplclassbased)
   interp.load.cp(replClassPathObj)
   ```

   This does the following:

   * Configures the compiler to generate classes for the REPL in the directory that you created earlier.
   * Configures the compiler to wrap code entered in the REPL in classes, rather than in objects.
   * Adds the directory that you created earlier as a dependency of the REPL interpreter.
3. [Create a new session in Snowpark](creating-session.md), and add the REPL class directory that you created
   earlier as a dependency. For example:

   ```
   // Import the Snowpark library from Maven.
   import $ivy.`com.snowflake:snowpark_2.12:1.18.0`

   import com.snowflake.snowpark._
   import com.snowflake.snowpark.functions._

   val session = Session.builder.configs(Map(
       "URL" -> "https://<account_identifier>.snowflakecomputing.com",
       "USER" -> "<username>",
       "PASSWORD" -> "<password>",
       "ROLE" -> "<role_name>",
       "WAREHOUSE" -> "<warehouse_name>",
       "DB" -> "<database_name>",
       "SCHEMA" -> "<schema_name>"
   )).create

   // Add the directory for REPL classes that you created earlier.
   session.addDependency(replClassPath)
   ```

   See [Creating a Session for Snowpark Scala](creating-session.md) for an explanation of the `Map` keys.
4. Run the following commands in a cell to add the Ammonite kernel classes as
   [dependencies for your UDF](creating-udfs.md):

   ```scala
   def addClass(session: Session, className: String): String = {
     var cls1 = Class.forName(className)
     val resourceName = "/" + cls1.getName().replace(".", "/") + ".class"
     val url = cls1.getResource(resourceName)
     val path = url.getPath().split(":").last.split("!").head
     session.addDependency(path)
     path
   }
   addClass(session, "ammonite.repl.ReplBridge$")
   addClass(session, "ammonite.interp.api.APIHolder")
   addClass(session, "pprint.TPrintColors")
   ```

   > **Note:**
   >
   > If you plan to create UDFs that have dependencies that are available through Maven, you can use the `addClass` method
   > defined above to add those dependencies:
   >
   > ```scala
   > addClass(session, "<dependency_package>.<dependency_class>")
   > ```
   >
   > If you need to specify a dependency in a JAR file, call `interp.load.cp` to load the JAR file for the REPL interpreter,
   > and call `session.addDependency` to add the JAR file as a dependency for your UDFs:
   >
   > ```scala
   > interp.load.cp(os.Path(<path to jar file>/<jar file>))
   > addDependency(<path to jar file>/<jar file>)
   > ```

## Verifying Your Jupyter Notebook Configuration

Run the following commands in a cell to verify that you can define and call an anonymous
user-defined function (UDF):

```scala
class UDFCode extends Serializable {
  val appendLastNameFunc = (s: String) => {
    s"$s Johnson"
  }
}
// Define an anonymous UDF.
val appendLastNameUdf = udf((new UDFCode).appendLastNameFunc)
// Create a DataFrame that has a column NAME with a single row with the value "Raymond".
val df = session.sql("select 'Raymond' NAME")
// Call the UDF, passing in the values in the NAME column.
// Return a new DataFrame that has an additional column "Full Name" that contains the value returned by the UDF.
df.withColumn("Full Name", appendLastNameUdf(col("NAME"))).show()
```

## Troubleshooting

### value res<n> is not a member of ammonite.$sess.cmd<n>.wrapper.Helper

If the following error occurs:

```none
value res<n> is not a member of ammonite.$sess.cmd<n>.wrapper.Helper
```

Delete the contents of the directory containing the REPL classes (the directory with the path specified by the `replClassPath`
variable), and restart the notebook server.

---
title: Setting up a Python development environment
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-setup-python-environment.md
section: Snowpark
---

# Setting up a Python development environment

To use Snowpark Checkpoints, set up a Python development environment with one of these supported versions:

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

> **Note:**
>
> * Python 3.9 (deprecated) depends on Snowpark client version 1.5.0.
> * Python 3.10 depends on Snowpark client version 1.5.1.
> * Python 3.11 depends on Snowpark client version 1.9.0.

You can create a Python virtual environment for a particular Python version using tools like
[Anaconda](https://www.anaconda.com/),
[Miniconda](https://docs.conda.io/en/latest/miniconda.html), or
[virtualenv](https://docs.python.org/3/tutorial/venv.html).

---
title: Setting up an IDE for using Snowpark Checkpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-ide.md
section: Snowpark
---

# Setting up an IDE for using Snowpark Checkpoints

The [Snowflake Extension for Visual Studio Code](../../../user-guide/vscode-ext.md) offers support for the Snowpark Checkpoints library to enhance the experience of using the framework. It gives you fine-grained control over the `collect` and `validate` statements inserted into your code and also reviews the status of the behavioral-equivalence assertions of your converted code.

## Enable Snowpark Checkpoints

* To enable Snowpark Checkpoints, in the Snowflake extension settings, select Snowpark Checkpoints: Enabled:

### View

Setting the Snowpark Checkpoints property to Enabled opens a new tab in the extension called SNOWPARK CHECKPOINTS that displays all checkpoints in the workspace and enables multiple actions to be performed, such as enabling/disabling all or individual checkpoints or clearing all checkpoints from files. Double-clicking each checkpoint navigates to the file and line of code where it is defined.

### Toggle all checkpoints

* To enable or disable all checkpoints, select this control in the upper-right corner of the Snowpark Checkpoints tab:

Enabled checkpoints:

Disabled checkpoints are skipped at runtime:

### Remove all checkpoints

* To remove checkpoints from all Python files, including Jupyter notebooks, in your workspace, select this control in the upper-right corner of the Snowpark Checkpoints tab:

The control does not remove the checkpoints from the contract and panel. They can be restored by using the command `Snowflake: Restore All Checkpoints`.

### Insert a checkpoint in a file

* To insert a checkpoint in a file, right-click inside a file, and on the Snowpark Checkpoints menu, select Add Collection Checkpoint or Add Validation Checkpoint.

Snowpark Checkpoints menu and options:

Collector/Validator added:

### Run a single checkpoint

* To run a single checkpoint, select the code lens option displayed above the checkpoint:

Running the checkpoint will open an output console that displays the progress and then the results view.
Even if the checkpoint is disabled in the contract file, it will be enabled for its execution.

If an entry point is not declared in the contract file, the following error message is displayed: *Entry point not found for the checkpoint.*

### Run all enabled Snowpark Checkpoints in a file

* To run all the enabled Checkpoints in a file, in the upper-right corner of the file, select Run all checkpoints from the current file:

An output channel displays the progress:

### Timeline view

Displays a timeline of the checkpoints execution results.

### Commands

The following commands are available for Snowpark Checkpoints. To use them, enter `Snowflake: [command name]` into the command palette.

| Command | Description |
| --- | --- |
| Snowflake: Toggle Checkpoints | Toggles the enabled property of all checkpoints. |
| Snowflake: Snowpark Checkpoints Project Initialization | Triggers project initialization, creating a contract file if it doesn’t exist. If the file exists, a display asks whether you want to load the checkpoint into the contract file. |
| Snowflake: Clear All Checkpoints | Deletes all checkpoints from all files in the workspace. |
| Snowflake: Restore All Checkpoints | Restore checkpoints previously deleted from files that are still present in the contract file. |
| Snowflake: Add Validation/Collection Checkpoint | Adds a validator or collector with its mandatory parameters at the cursor position. |
| Snowflake: Focus on Snowpark Checkpoints View | Shifts focus to the panel SNOWPARK CHECKPOINTS. |
| Snowflake: Open Checkpoints Timeline | Displays a timeline of Checkpoints executions. |
| Snowflake: Run all Checkpoints from the current file | Runs all enabled checkpoints in the current file. |
| Snowflake: Run all Checkpoints in the workspace | Runs all enabled checkpoints from the workspace. |
| Snowflake: Show All Snowpark Checkpoints Result | Displays a tab with all checkpoints results. |

### Warnings

* **Duplicate:** In a *collection* project, if two checkpoints have the same name, this warning is displayed: *“Another checkpoint with an identical name has been detected and will be overwritten.”*

  *Validation* projects can have multiple checkpoints with the same name, so no warning is displayed.
* **Wrong type:** If you add a checkpoint with a different type than the project type, it is underlined, and this error message is displayed: *“Please make sure you are using the correct Snowpark-Checkpoints statement. This particular checkpoint statement is different from the others used in this project, statements that don’t match the project type will be ignored when executed.”*
* **Invalid checkpoint name:** There are invalid ways to add a checkpoint name parameter. If this happens, a warning is displayed: *“Invalid checkpoint name. Checkpoint names must start with a letter and can only contain letters, numbers, hyphens, and underscores”*.

---
title: Setting Up IntelliJ IDEA CE for Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/quickstart-intellij.md
section: Snowpark
---

# Setting Up IntelliJ IDEA CE for Snowpark Java

This topic explains how to set up IntelliJ IDEA CE for Snowpark.

## Creating a New Maven Project in IntelliJ IDEA

Create a new Maven project for Snowpark.

1. Choose File » New » Project.
2. From the Project SDK menu, select 11 (for Java version 11).

   Note that you don’t need to select an archetype. You can just leave the Create from archetype box unchecked.
3. Click Next.
4. Enter a name and location for your project (e.g. `hello-snowpark`).
5. Click Finish to create the new project.

## Configuring the IntelliJ IDEA Project for Snowpark

Next, configure the project for Snowpark.

1. Open the `pom.xml` file for the project.
2. In the `<project>` tag, add the tags to specify a dependency on the Snowpark library:

   ```
   <dependencies>
     ...
     <dependency>
       <groupId>com.snowflake</groupId>
       <artifactId>snowpark_2.12</artifactId>
       <version>1.18.0</version>
     </dependency>
     ...
   </dependencies>
   ```
3. Save the changes to the `pom.xml` file.
4. Update your Maven repositories.

   See [Update Maven repositories](https://www.jetbrains.com/help/idea/troubleshooting-common-maven-issues.html#5e1bf655).

## Verifying Your IntelliJ IDEA Project Configuration

To verify that you have configured your project to use Snowpark, run a simple example of Snowpark code.

1. In the Project tool window on the
   [left](https://www.jetbrains.com/help/idea/2020.3/guided-tour-around-the-user-interface.html),
   expand your project, expand the `src/main` folders, and select the `java` folder.
2. Right-click on the folder, and choose New » Java class.
3. In the New Java Class dialog box, enter the name “HelloSnowpark”, select Class, and press the Enter key.
4. In the `HelloSnowpark.java` file, replace the contents with the code below:

   ```java
   import com.snowflake.snowpark_java.*;
   import java.util.HashMap;
   import java.util.Map;

   public class HelloSnowpark {
     public static void main(String[] args) {
       // Replace the <placeholders> below.
       Map<String, String> properties = new HashMap<>();
       properties.put("URL", "https://<account_identifier>.snowflakecomputing.com:443");
       properties.put("USER", "<user name>");
       properties.put("PASSWORD", "<password>");
       properties.put("ROLE", "<role name>");
       properties.put("WAREHOUSE", "<warehouse name>");
       properties.put("DB", "<database name>");
       properties.put("SCHEMA", "<schema name>");
       Session session = Session.builder().configs(properties).create();
       session.sql("show tables").show();
     }
   }
   ```

   Note the following:

   * Replace the `<placeholders>` with values that you use to connect to Snowflake.
   * For `<account_identifier>`, specify your [account identifier](../../../user-guide/admin-account-identifier.md).
   * If you prefer to use [key pair authentication](../../../user-guide/key-pair-auth.md):

     + Replace `PASSWORD` with `PRIVATE_KEY_FILE`, and set it to the path to your private key file.
     + If the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key.

     As an alternative to setting `PRIVATE_KEY_FILE` and `PRIVATE_KEY_FILE_PWD`, you can set the `PRIVATEKEY`
     property to the string value of the unencrypted private key from the private key file.

     + For example, if your private key file is unencrypted, set this to the value of the key in the file (without the
       `-----BEGIN PRIVATE KEY-----` and `-----END PRIVATE KEY-----` header and footer and without the line endings).
     + Note that if the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY`
       property.
5. Click the green arrow next to the `Class` line, and choose Run HelloSnowpark.main() to run the example.

---
title: Setting Up IntelliJ IDEA CE for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/quickstart-intellij.md
section: Snowpark
---

# Setting Up IntelliJ IDEA CE for Snowpark Scala

This topic explains how to set up IntelliJ IDEA CE for Snowpark.

## Setting Up IntelliJ IDEA CE for Scala Development

To use Scala in IntelliJ IDEA CE, you need to install the Scala plugin. See the
[Installation section](https://docs.scala-lang.org/getting-started/intellij-track/getting-started-with-scala-in-intellij.html#installation)
of the tutorial
[Getting Started with Scala in IntelliJ IDEA](https://docs.scala-lang.org/getting-started/intellij-track/getting-started-with-scala-in-intellij.html).

## Creating a New Scala Project in IntelliJ IDEA

Next, create a new Scala project for Snowpark.

1. Choose File » New » Project.

   1. In the list on the left, select Scala.
   2. In the list on the right, select sbt.
   3. Click Next.
2. Fill in the details for your new project.

   For the JDK and Scala SDK, select the [JDK and Scala versions supported for use with Snowpark](prerequisites.md).
3. Click Finish to create the new project.

## Configuring the IntelliJ IDEA Project for Snowpark

Next, configure the project for Snowpark.

1. In the Project tool window on the
   [left](https://www.jetbrains.com/help/idea/2020.3/guided-tour-around-the-user-interface.html), double-click on the
   `build.sbt` file for your project.

   In the `build.sbt` file for your project, make the following changes:

   1. If the `scalaVersion` setting does not match the version that you plan to use, update the setting. For example:

      ```scala
      scalaVersion := "2.12.20"
      ```

      Note that you must use a
      [Scala version that is supported for use with the Snowpark library](prerequisites.md).
   2. Add the Snowpark library to the list of dependencies. For example:

      ```
      libraryDependencies += "com.snowflake" % "snowpark_2.12" % "1.18.0"
      ```
2. Save the changes to the `build.sbt` file.
3. Update your Maven repositories.

   See [Update Maven repositories](https://www.jetbrains.com/help/idea/troubleshooting-common-maven-issues.html#5e1bf655).
4. Reload the SBT project:

   1. Choose View » Tool Windows » sbt to display the sbt Tool window.
   2. Right-click on the project name, and choose Reload sbt Project.

   This causes IntelliJ IDEA CE to download the Snowpark library and makes the API available for use in your code.

## Verifying Your IntelliJ IDEA Project Configuration

To verify that you have configured your project to use Snowpark, run a simple example of Snowpark code.

1. In the Project tool window on the
   [left](https://www.jetbrains.com/help/idea/2020.3/guided-tour-around-the-user-interface.html),
   expand your project, expand the `src/main` folders, and select the `scala` folder.
2. Right-click on the folder, and choose New » Scala class.
3. In the Create New Scala Class dialog box, enter the name “Main”, select Object, and press the Enter key.
4. In the `Main.scala` file, replace the contents with the code below:

   ```scala
   import com.snowflake.snowpark._
   import com.snowflake.snowpark.functions._

   object Main {
     def main(args: Array[String]): Unit = {
       // Replace the <placeholders> below.
       val configs = Map (
         "URL" -> "https://<account_identifier>.snowflakecomputing.com:443",
         "USER" -> "<user name>",
         "PASSWORD" -> "<password>",
         "ROLE" -> "<role name>",
         "WAREHOUSE" -> "<warehouse name>",
         "DB" -> "<database name>",
         "SCHEMA" -> "<schema name>"
       )
       val session = Session.builder.configs(configs).create
       session.sql("show tables").show()
     }
   }
   ```

   Note the following:

   * Replace the `<placeholders>` with values that you use to connect to Snowflake.
   * For `<account_identifier>`, specify your [account identifier](../../../user-guide/admin-account-identifier.md).
   * If you prefer to use [key pair authentication](../../../user-guide/key-pair-auth.md):

     + Replace `PASSWORD` with `PRIVATE_KEY_FILE`, and set it to the path to your private key file.
     + If the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key.

     As an alternative to setting `PRIVATE_KEY_FILE` and `PRIVATE_KEY_FILE_PWD`, you can set the `PRIVATEKEY`
     property to the string value of the unencrypted private key from the private key file.

     + For example, if your private key file is unencrypted, set this to the value of the key in the file (without the
       `-----BEGIN PRIVATE KEY-----` and `-----END PRIVATE KEY-----` header and footer and without the line endings).
     + Note that if the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY`
       property.
   * If you plan to create UDFs:

     + Don’t set up your `object` to extend the `App` trait. For details, see
       [Caveat About Creating UDFs in an Object With the App Trait](creating-udfs.md).
     + Don’t set up your `object` to extend a class or trait that is not serializable.
5. Click the green arrow next to the `Object` line, and choose Run Main to run the example.

---
title: Setting Up Other Development Environments for Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/setup-other-environments.md
section: Snowpark
---

# Setting Up Other Development Environments for Snowpark Java

If you are using a development environment not covered earlier (see [Setting Up Your Development Environment for Snowpark Java](setup.md)), see the instructions in this topic for
configuring your environment to use Snowpark.

## Using the Snowpark Library in a Maven Project

To integrate the Snowpark library into a Maven project, add the library as a dependency to your `pom.xml` file. For example:

```
<dependencies>
  ...
  <dependency>
    <groupId>com.snowflake</groupId>
    <artifactId>snowpark_2.12</artifactId>
    <version>1.18.0</version>
  </dependency>
  ...
</dependencies>
```

Set the `<version>` tag to the version of the library that you want to use. Note that version 1.18.0 is
used in this example for illustration purposes only. The latest available version of the driver may be higher.

## Downloading the Snowpark Library and its Dependencies

If you are not using Maven to manage the dependencies for your application and you need a copy of the Snowpark library and
its dependencies, you can download a TAR archive file or a zip file that contains the JAR files for the library and all of
its dependencies. The TAR/ZIP archive includes the API reference documentation in javadoc format.

To download the Snowpark library:

1. Go to the [Snowpark Client Download](https://developers.snowflake.com/snowpark/) page, and find the version that you want to use.
2. Browse to the directory for the version that you want to use.

   The rest of the steps use 1.18.0 as an example.
3. Download the snowpark_2.12-1.18.0-bundle.tar.gz (or .zip) file.

   > **Note:**
   >
   > As of Snowpark 0.9.0, rather than downloading an archive file that contains the Snowpark library and its dependencies in
   > separate JAR files, you can choose to download a single JAR file that contains the Snowpark library and its dependencies.
   > This JAR file is named snowpark_2.12-1.18.0-with-dependencies.jar.
   >
   > If you download this JAR file, skip the rest of the steps. (The steps apply to the archive file.)
4. If you want to verify the signature of the file:

   1. Download the snowpark_2.12-1.18.0-bundle.tar.gz.asc file.
   2. From the public keyserver, download and import the Snowflake GPG public key for the version of the library that you are
      using:

      * For version 1.17.0 and higher:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 2A3149C82551A34A
        ```
      * For version 1.15.0:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 5A125630709DD64B
        ```
      * For version 1.6.1 through 1.14.0:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 630D9F3CAB551AF3
        ```
      * For version 0.6.0 through 1.6.0:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 37C7086698CB005C
        ```
      > **Note:**
      >
      > If this command fails with the following error:
      >
      > > ```none
      > > gpg: keyserver receive failed: Server indicated a failure
      > > ```
      >
      > then specify that you want to use port 80 for the keyserver:
      >
      > > ```bash
      > > gpg --keyserver hkp://keyserver.ubuntu.com:80  ...
      > > ```
   3. Run the `gpg --verify` command to verify the signature. For example:

      ```
      gpg --verify snowpark_2.12-1.18.0-bundle.tar.gz.asc snowpark_2.12-1.18.0-bundle.tar.gz
      ```

      The output of the command should indicate that the archive file was signed with this key.

      > **Note:**
      >
      > Verifying the signature produces a warning similar to the following:
      >
      > > ```none
      > > gpg: Signature made Mon 24 Sep 2018 03:03:45 AM UTC using RSA key ID <gpg_key_id>
      > > gpg: Good signature from "Snowflake Computing <snowflake_gpg@snowflake.net>" unknown
      > > gpg: WARNING: This key is not certified with a trusted signature!
      > > gpg: There is no indication that the signature belongs to the owner.
      > > ```
      >
      > To avoid the warning, you can grant the Snowflake GPG public key implicit trust.
5. Extract the contents of the archive file.

   The README.txt file in the archive file describes the contents of each directory.
6. Add the following extracted file and directory to the classpath for building and running your application:

   * The snowpark_2.12-1.18.0.jar file
   * The lib directory

---
title: Setting Up Other Development Environments for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/setup-other-environments.md
section: Snowpark
---

# Setting Up Other Development Environments for Snowpark Scala

If you are using a development environment not covered earlier (see [Setting Up Your Development Environment for Snowpark Scala](setup.md)), see the instructions in this topic for
configuring your environment to use Snowpark.

## Using the Snowpark Library in an sbt Build File

To integrate the Snowpark library into a project that uses an sbt build file, add the library as a dependency.

In the `build.sbt` file for your project, make the following changes:

1. If the `scalaVersion` setting does not match the version that you plan to use, update the setting. For example:

   ```scala
   scalaVersion := "2.12.20"
   ```

   Note that you must use a
   [Scala version that is supported for use with the Snowpark library](prerequisites.md).
2. Add the Snowpark library to the list of dependencies. For example:

   ```
   libraryDependencies += "com.snowflake" % "snowpark_2.12" % "1.18.0"
   ```

## Using the Snowpark Library in a Maven Project

To integrate the Snowpark library into a Maven project, add the library as a dependency to your `pom.xml` file. For example:

```
<dependencies>
  ...
  <dependency>
    <groupId>com.snowflake</groupId>
    <artifactId>snowpark_2.12</artifactId>
    <version>1.18.0</version>
  </dependency>
  ...
</dependencies>
```

Set the `<version>` tag to the version of the library that you want to use. Note that version 1.18.0 is
used in this example for illustration purposes only. The latest available version of the driver may be higher.

## Downloading the Snowpark Library and its Dependencies

If you are not using sbt or Maven to manage the dependencies for your application and you need a copy of the Snowpark library and
its dependencies, you can download a TAR archive file or a zip file that contains the JAR files for the library and all of
its dependencies. The TAR/ZIP archive includes the API reference documentation in scaladoc format.

To download the Snowpark library:

1. Go to the [Snowpark Client Download](https://developers.snowflake.com/snowpark/) page, and find the version that you want to use.
2. Browse to the directory for the version that you want to use.

   The rest of the steps use 1.18.0 as an example.
3. Download the snowpark_2.12-1.18.0-bundle.tar.gz (or .zip) file.

   > **Note:**
   >
   > As of Snowpark 0.9.0, rather than downloading an archive file that contains the Snowpark library and its dependencies in
   > separate JAR files, you can choose to download a single JAR file that contains the Snowpark library and its dependencies.
   > This JAR file is named snowpark_2.12-1.18.0-with-dependencies.jar.
   >
   > If you download this JAR file, skip the rest of the steps. (The steps apply to the archive file.)
4. If you want to verify the signature of the file:

   1. Download the snowpark_2.12-1.18.0-bundle.tar.gz.asc file.
   2. From the public keyserver, download and import the Snowflake GPG public key for the version of the library that you are
      using:

      * For version 1.17.0 and higher:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 2A3149C82551A34A
        ```
      * For version 1.15.0:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 5A125630709DD64B
        ```
      * For version 1.6.1 through 1.14.0:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 630D9F3CAB551AF3
        ```
      * For version 0.6.0 through 1.6.0:

        ```
        $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 37C7086698CB005C
        ```
      > **Note:**
      >
      > If this command fails with the following error:
      >
      > > ```none
      > > gpg: keyserver receive failed: Server indicated a failure
      > > ```
      >
      > then specify that you want to use port 80 for the keyserver:
      >
      > > ```bash
      > > gpg --keyserver hkp://keyserver.ubuntu.com:80  ...
      > > ```
   3. Run the `gpg --verify` command to verify the signature. For example:

      ```
      gpg --verify snowpark_2.12-1.18.0-bundle.tar.gz.asc snowpark_2.12-1.18.0-bundle.tar.gz
      ```

      The output of the command should indicate that the archive file was signed with this key.

      > **Note:**
      >
      > Verifying the signature produces a warning similar to the following:
      >
      > > ```none
      > > gpg: Signature made Mon 24 Sep 2018 03:03:45 AM UTC using RSA key ID <gpg_key_id>
      > > gpg: Good signature from "Snowflake Computing <snowflake_gpg@snowflake.net>" unknown
      > > gpg: WARNING: This key is not certified with a trusted signature!
      > > gpg: There is no indication that the signature belongs to the owner.
      > > ```
      >
      > To avoid the warning, you can grant the Snowflake GPG public key implicit trust.
5. Extract the contents of the archive file.

   The README.txt file in the archive file describes the contents of each directory.
6. Add the following extracted file and directory to the classpath for building and running your application:

   * The snowpark_2.12-1.18.0.jar file
   * The lib directory

---
title: Setting up Snowpark Checkpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-setup-snowpark-checkpoints.md
section: Snowpark
---

# Setting up Snowpark Checkpoints

Before running Snowpark Checkpoints, complete these tasks:

* Set up a [Python development environment](checkpoints-setup-python-environment.md).
* Install the [Snowpark Checkpoints library](https://github.com/snowflakedb/snowpark-checkpoints).
* Install and enable the [Snowflake VS Code Extension for Snowpark Checkpoints](https://github.com/snowflake-eng/snowflake-vscode-extension).
* Optionally, integrate the Snowpark Migration Accelerator (SMA). For more information, see [Snowpark Migration Accelerator Documentation](https://docs.snowconvert.com/sma).)

These components work together to enable accurate validation of Snowpark workloads against their original PySpark implementations.

---
title: Setting Up the SBT REPL for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/quickstart-sbt.md
section: Snowpark
---

# Setting Up the SBT REPL for Snowpark Scala

This topic explains how to set up the SBT REPL for Snowpark.

## Creating a New Scala Project in sbt

Next, create a new Scala project for Snowpark.

1. Create a new directory for your project, and change to that directory.

   ```bash
   mkdir snowpark_project
   cd snowpark_project
   ```
2. Run the `sbt new` command, and specify the [template](https://www.scala-sbt.org/1.x/docs/sbt-new-and-Templates.html)
   that you want to use to create the new project. For example:

   ```bash
   sbt new scala/hello-world.g8
   ```

   Enter a name for your project. This creates a project directory with that name.

## Configuring the sbt Project for Snowpark

Next, configure the project for Snowpark.

In the `build.sbt` file for your project, make the following changes:

1. If the `scalaVersion` setting does not match the version that you plan to use, update the setting. For example:

   ```scala
   scalaVersion := "2.12.20"
   ```

   Note that you must use a
   [Scala version that is supported for use with the Snowpark library](prerequisites.md).
2. Add the Snowpark library to the list of dependencies. For example:

   ```
   libraryDependencies += "com.snowflake" % "snowpark_2.12" % "1.18.0"
   ```

1. Add the following lines to configure the REPL:

   ```scala
   Compile/console/scalacOptions += "-Yrepl-class-based"
   Compile/console/scalacOptions += "-Yrepl-outdir"
   Compile/console/scalacOptions += "repl_classes"
   ```

## Verifying Your sbt Project Configuration

To verify that you have configured your project to use Snowpark, run a simple example of Snowpark code.

1. In the `src/main/scala/Main.scala` file, replace the contents with the code below:

   ```scala
   import com.snowflake.snowpark._
   import com.snowflake.snowpark.functions._

   object Main {
     def main(args: Array[String]): Unit = {
       // Replace the <placeholders> below.
       val configs = Map (
         "URL" -> "https://<account_identifier>.snowflakecomputing.com:443",
         "USER" -> "<user name>",
         "PASSWORD" -> "<password>",
         "ROLE" -> "<role name>",
         "WAREHOUSE" -> "<warehouse name>",
         "DB" -> "<database name>",
         "SCHEMA" -> "<schema name>"
       )
       val session = Session.builder.configs(configs).create
       session.sql("show tables").show()
     }
   }
   ```

   Note the following:

   * Replace the `<placeholders>` with values that you use to connect to Snowflake.
   * For `<account_identifier>`, specify your [account identifier](../../../user-guide/admin-account-identifier.md).
   * If you prefer to use [key pair authentication](../../../user-guide/key-pair-auth.md):

     + Replace `PASSWORD` with `PRIVATE_KEY_FILE`, and set it to the path to your private key file.
     + If the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key.

     As an alternative to setting `PRIVATE_KEY_FILE` and `PRIVATE_KEY_FILE_PWD`, you can set the `PRIVATEKEY`
     property to the string value of the unencrypted private key from the private key file.

     + For example, if your private key file is unencrypted, set this to the value of the key in the file (without the
       `-----BEGIN PRIVATE KEY-----` and `-----END PRIVATE KEY-----` header and footer and without the line endings).
     + Note that if the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY`
       property.
   * If you plan to create UDFs:

     + Don’t set up your `object` to extend the `App` trait. For details, see
       [Caveat About Creating UDFs in an Object With the App Trait](creating-udfs.md).
     + Don’t set up your `object` to extend a class or trait that is not serializable.
2. Change to your project directory, and run the following command to run the sample code:

   ```bash
   sbt "runMain Main"
   ```

---
title: Setting Up the Scala REPL for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/quickstart-scala-repl.md
section: Snowpark
---

# Setting Up the Scala REPL for Snowpark Scala

This topic explains how to set up the Scala REPL for Snowpark.

## Installing the Scala REPL

The [Scala REPL](https://docs.scala-lang.org/overviews/repl/overview.html)
([read-eval-print loop](https://en.wikipedia.org/wiki/Read%E2%80%93eval%E2%80%93print_loop)) is provided with the Scala build
tool. To install the
[supported version of the Scala build tool](prerequisites.md),
[find the version that you plan to use](https://www.scala-lang.org/download/all.html), and follow the installation instructions.

## Running the Scala REPL

To use the Snowpark library in the Scala REPL:

1. If you have not already done so,
   [download the Snowpark library archive file and extract the contents of the file](setup-other-environments.md).
2. Start the REPL by running the `run.sh` shell script provided in the archive file:

> ```
> cd <path>/snowpark-1.18.0
> ./run.sh
> ```

The `run.sh` script does the following:

* Adds the Snowpark library and dependencies to the classpath.
* Creates a <path>/snowpark-1.18.0/repl_classes/ directory for the classes generated by the Scala REPL.
* Preloads the `preload.scala` file, which imports the `com.snowflake.snowpark` package and the
  `com.snowflake.snowpark.functions` object.

If you are using a different REPL for Scala:

1. Add the Snowpark library JAR file and dependencies to the classpath.

   * The Snowpark library JAR file is in the top level directory of the extracted TAR/ZIP archive file.
   * The dependencies are in the `lib` directory of the extracted TAR/ZIP archive file.
2. Create a temporary directory for the classes generated by the REPL, and configure the REPL to generate classes in that
   directory.

Later, when defining inline user-defined functions (UDFs), you’ll need to
[specify the directory for the REPL classes as a dependency](creating-udfs.md).

## Verifying Your Scala REPL Configuration

To verify that you have configured your project to use Snowpark, run a simple example of Snowpark code.

1. In the directory containing the files extracted from the `.zip` / `.tar.gz` file (i.e. the directory containing
   the `run.sh` script), create a `Main.scala` file that contains the code below:

   ```scala
   import com.snowflake.snowpark._
   import com.snowflake.snowpark.functions._

   object Main {
     def main(args: Array[String]): Unit = {
       // Replace the <placeholders> below.
       val configs = Map (
         "URL" -> "https://<account_identifier>.snowflakecomputing.com:443",
         "USER" -> "<user name>",
         "PASSWORD" -> "<password>",
         "ROLE" -> "<role name>",
         "WAREHOUSE" -> "<warehouse name>",
         "DB" -> "<database name>",
         "SCHEMA" -> "<schema name>"
       )
       val session = Session.builder.configs(configs).create
       session.sql("show tables").show()
     }
   }
   ```

   Note the following:

   * Replace the `<placeholders>` with values that you use to connect to Snowflake.
   * For `<account_identifier>`, specify your [account identifier](../../../user-guide/admin-account-identifier.md).
   * If you prefer to use [key pair authentication](../../../user-guide/key-pair-auth.md):

     + Replace `PASSWORD` with `PRIVATE_KEY_FILE`, and set it to the path to your private key file.
     + If the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key.

     As an alternative to setting `PRIVATE_KEY_FILE` and `PRIVATE_KEY_FILE_PWD`, you can set the `PRIVATEKEY`
     property to the string value of the unencrypted private key from the private key file.

     + For example, if your private key file is unencrypted, set this to the value of the key in the file (without the
       `-----BEGIN PRIVATE KEY-----` and `-----END PRIVATE KEY-----` header and footer and without the line endings).
     + Note that if the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY`
       property.
   * If you plan to create UDFs:

     + Don’t set up your `object` to extend the `App` trait. For details, see
       [Caveat About Creating UDFs in an Object With the App Trait](creating-udfs.md).
     + Don’t set up your `object` to extend a class or trait that is not serializable.
2. From within the directory, run the `run.sh` script to start the Scala REPL with the settings needed for the Snowpark
   library:

   ```bash
   ./run.sh
   ```
3. In the Scala REPL shell, enter the following command to load the sample file that you just created:

   ```none
   :load Main.scala
   ```
4. Run the following statement to execute the `main` method of the class that you loaded:

   ```scala
   Main.main(Array[String]())
   ```

   This runs the `SHOW TABLES` command and prints out the first 10 rows of the results.

---
title: Setting Up Visual Studio Code for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/quickstart-vscode.md
section: Snowpark
---

# Setting Up Visual Studio Code for Snowpark Scala

This topic explains how to set up Visual Studio Code for Snowpark.

## Setting Up Visual Studio Code for Scala Development

For convenience when writing Scala code in Visual Studio Code, install the
[Metals extension](https://scalameta.org/metals/docs/editors/vscode.html). The Metals extension provides code completion,
parameter hints, and information about types and methods.

To install the Metals extension:

1. In the Activity Bar on the [left side of the window](https://code.visualstudio.com/docs/getstarted/userinterface), click the Extensions icon.

   (If the Activity Bar isn’t displayed, make sure that View » Appearance »
   Show Activity Bar is checked.)

   This displays the Extensions view, which allows you to browse and install extensions from the Extensions Marketplace.
2. In the search box for Search Extensions in Marketplace, search for the term:

   ```none
   metals
   ```
3. In the search results, find the Scala (Metals) extension, and click Install.

For more information about the Scala (Metals) extension, see
[Visual Studio Code](https://scalameta.org/metals/docs/editors/vscode.html) in the
[Metals documentation](https://scalameta.org/metals/docs/editors/overview.html).

## Creating a New Scala Project in Visual Studio Code

Next, create a new Scala project for Snowpark.

1. Create a workspace directory for your projects. For example:

   ```bash
   mkdir snowpark_projects
   ```

   This directory will contain subdirectories for the projects that you create.
2. In Visual Studio Code, choose File » Open, select the directory that you created, and click Open.
3. In the Activity Bar on the left, click the Metals icon.
4. Under Packages in the Side Bar ([to the right](https://code.visualstudio.com/docs/getstarted/userinterface) of the Activity Bar), click the
   New Scala Project button.
5. Select a template to use for the new project (e.g. `scala/hello-world.g8`).
6. Select the workspace directory that you created earlier (`snowpark_projects`), and click Ok.
7. Enter a name for the new project (e.g. `hello_snowpark`).
8. When prompted by the dialog box in the lower right corner of the window, click Yes to open the new project in a new
   window.
9. When prompted by the dialog box in the lower right corner of the window, click Import build to
   [import the build](https://scalameta.org/metals/docs/editors/vscode.html#importing-a-build).

## Configuring the Visual Studio Code Project for Snowpark

Next, configure the project for Snowpark.

1. In the Activity Bar on the [left side of the window](https://code.visualstudio.com/docs/getstarted/userinterface), make sure that the Explorer icon (the first icon at
   the top) is selected.
2. Under Explorer in the Side Bar ([to the right](https://code.visualstudio.com/docs/getstarted/userinterface) of the Activity Bar), under your project, select
   the `build.sbt` file for editing.

   In the `build.sbt` file for your project, make the following changes:

   1. If the `scalaVersion` setting does not match the version that you plan to use, update the setting. For example:

      ```scala
      scalaVersion := "2.12.20"
      ```

      Note that you must use a
      [Scala version that is supported for use with the Snowpark library](prerequisites.md).
   2. Add the Snowpark library to the list of dependencies. For example:

      ```
      libraryDependencies += "com.snowflake" % "snowpark_2.12" % "1.18.0"
      ```
3. After making those changes, choose File » Save to save your changes.
4. When prompted by the dialog box in the lower right corner of the window, click Import changes to
   [re-import the file](https://scalameta.org/metals/docs/editors/vscode.html#importing-changes).

## Verifying Your Visual Studio Code Project Configuration

To verify that you have configured your project to use Snowpark, run a simple example of Snowpark code.

1. In the Activity Bar on the [left side of the window](https://code.visualstudio.com/docs/getstarted/userinterface), make sure that the Explorer icon (the first icon at
   the top) is selected.
2. Under Explorer in the Side Bar, under your project, expand the `src/main/scala` folder, and select and open
   the `Main.scala` file.
3. In the `Main.scala` file, replace the contents with the code below:

   ```scala
   import com.snowflake.snowpark._
   import com.snowflake.snowpark.functions._

   object Main {
     def main(args: Array[String]): Unit = {
       // Replace the <placeholders> below.
       val configs = Map (
         "URL" -> "https://<account_identifier>.snowflakecomputing.com:443",
         "USER" -> "<user name>",
         "PASSWORD" -> "<password>",
         "ROLE" -> "<role name>",
         "WAREHOUSE" -> "<warehouse name>",
         "DB" -> "<database name>",
         "SCHEMA" -> "<schema name>"
       )
       val session = Session.builder.configs(configs).create
       session.sql("show tables").show()
     }
   }
   ```

   Note the following:

   * Replace the `<placeholders>` with values that you use to connect to Snowflake.
   * For `<account_identifier>`, specify your [account identifier](../../../user-guide/admin-account-identifier.md).
   * If you prefer to use [key pair authentication](../../../user-guide/key-pair-auth.md):

     + Replace `PASSWORD` with `PRIVATE_KEY_FILE`, and set it to the path to your private key file.
     + If the private key is encrypted, you must set `PRIVATE_KEY_FILE_PWD` to the passphrase for decrypting the private key.

     As an alternative to setting `PRIVATE_KEY_FILE` and `PRIVATE_KEY_FILE_PWD`, you can set the `PRIVATEKEY`
     property to the string value of the unencrypted private key from the private key file.

     + For example, if your private key file is unencrypted, set this to the value of the key in the file (without the
       `-----BEGIN PRIVATE KEY-----` and `-----END PRIVATE KEY-----` header and footer and without the line endings).
     + Note that if the private key is encrypted, you must decrypt the key before setting it as the value of the `PRIVATEKEY`
       property.
   * If you plan to create UDFs:

     + Don’t set up your `object` to extend the `App` trait. For details, see
       [Caveat About Creating UDFs in an Object With the App Trait](creating-udfs.md).
     + Don’t set up your `object` to extend a class or trait that is not serializable.
4. Click run above the `Object` line to run the example.

If the following error message appears:

```none
Run session not started
```

check the Problems tab in the bottom of the window. If this tab does not appear in the bottom of the window, select the
View > Problems item from the menu.

---
title: Setting Up Your Development Environment for Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/setup.md
section: Snowpark
---

# Setting Up Your Development Environment for Snowpark Java

The Snowpark library is distributed as a JAR file through Maven. You can integrate Snowpark as a dependency in your own Maven
projects. As an alternative, you can download the JAR file for use in your application.

You can use this library to develop applications in various development tools (e.g. IntelliJ IDEA Community Edition).

This set of topics provides instructions for setting up different types of application development environments to use the
Snowpark library.

**Next Topics:**

* [Prerequisites for Snowpark Java](prerequisites.md)
* [Setting Up IntelliJ IDEA CE for Snowpark Java](quickstart-intellij.md)
* [Setting Up Other Development Environments for Snowpark Java](setup-other-environments.md)

---
title: Setting up your development environment for Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/setup.md
section: Snowpark
---

# Setting up your development environment for Snowpark Python

Set up your preferred local development environment to build client applications with Snowpark Python.

If you are writing a stored procedure with Snowpark Python, consider setting up a
[Python worksheet](python-worksheets.md) instead.

## Prerequisites

Use the following SQL query to see which Python versions are enabled in your Snowflake account:

```sqlexample
SELECT DISTINCT runtime_version
FROM SNOWFLAKE.INFORMATION_SCHEMA.PACKAGES
WHERE language = 'python'
AND runtime_version IS NOT NULL
ORDER BY runtime_version;
```

This query returns all versions including those that are deprecated.

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

> **Note:**
>
> * Python 3.9 (deprecated) depends on Snowpark client version 1.5.0.
> * Python 3.10 depends on Snowpark client version 1.5.1.
> * Python versions 3.11, 3.12, and 3.13 depend on Snowpark client version 1.9.0.

You can create a Python virtual environment for a particular Python version using tools like
[Anaconda](https://www.anaconda.com/),
[Miniconda](https://docs.conda.io/en/latest/miniconda.html), or
[virtualenv](https://docs.python.org/3/tutorial/venv.html).

For example, to use conda to create a Python 3.12 virtual environment, add the Snowflake conda channel,
and install the numpy and pandas packages, type:

```bash
conda create --name py12_env --override-channels -c https://repo.anaconda.com/pkgs/snowflake python=3.12 numpy pandas pyarrow
```

Creating a new conda environment locally with the Snowflake channel is recommended
in order to have the best experience when using UDFs. For more information, see
[Local development and testing](../../udf/python/udf-python-packages.md).

> **Note:**
>
> There is a known issue with running Snowpark Python on Apple M1 chips due to memory handling in pyOpenSSL.
> The error message displayed is, “Cannot allocate write+execute memory for ffi.callback()”.
>
> As a workaround, set up a virtual environment that uses x86 Python using these commands:
>
> ```bash
> CONDA_SUBDIR=osx-64 conda create -n snowpark python=3.12 numpy pandas pyarrow --override-channels -c https://repo.anaconda.com/pkgs/snowflake
> conda activate snowpark
> conda config --env --set subdir osx-64
> ```
>
> Then, install Snowpark within this environment as described in the next section.

### Prerequisites for using Pandas DataFrames

The Snowpark API provides methods for writing data to and from Pandas DataFrames.
[Pandas](https://pandas.pydata.org/) is a library for data analysis.
With Pandas, you use a data structure called a DataFrame to analyze and manipulate two-dimensional data.

These methods require the following libraries:

* Pandas 1.0.0 (or higher).
* [PyArrow library](https://arrow.apache.org/docs/python/) version 8.0.0.

> **Note:**
>
> If you have already installed any version of the PyArrow library other than the recommended
> version listed above, uninstall PyArrow before installing Snowpark.
>
> Installing Snowpark using pip automatically installs the appropriate version of PyArrow.
> If you use conda to install Snowpark, you must specify `pyarrow` in the list of packages.
>
> Do not re-install a different version of PyArrow after installing Snowpark.

## Installation instructions

> **Note:**
>
> Before running the commands in this section, make sure you are in a Python environment for a supported Python version.
> You can check this by typing the command `python -V`. If the version displayed is not a supported version,
> refer to the previous section.

Install the Snowpark Python package into the Python virtual environment by using `conda` or `pip`.

```bash
conda install snowflake-snowpark-python
```

-or-

```bash
pip install snowflake-snowpark-python
```

Optionally, specify packages that you want to install in the environment such as,
for example, the Pandas data analysis package:

```bash
conda install snowflake-snowpark-python pandas pyarrow
```

-or-

```bash
pip install "snowflake-snowpark-python[pandas]"
```

You can view the Snowpark Python project description on
[the Python Package Index (PyPi) repository](https://pypi.org/project/snowflake-snowpark-python/).

## Setting up Snowflake Notebooks for Snowpark

You can use the Snowflake Notebooks development environment to perform data science and data engineering workflows with Python. Snowflake Notebooks
comes preinstalled with Snowpark for Python.

For more information about getting started with Snowflake Notebooks, see [Getting started with Legacy Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-get-started.md).

For information about setting up Snowflake Notebooks, see [Set up Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-setup.md).

For more information about using the Snowpark library in Snowflake Notebooks, see [Snowpark Python in notebooks](../../../user-guide/ui-snowsight/notebooks-use-with-snowflake.md).

## Setting up a Jupyter notebook for Snowpark

To get started using Snowpark with Jupyter Notebooks, do the following:

1. Install Jupyter Notebooks:

   ```bash
   pip install notebook
   ```
2. Start a Jupyter Notebook:

   ```bash
   jupyter notebook
   ```
3. In the top-right corner of the web page that opened, select New » Python 3 Notebook.
4. In a cell, create a session. For more information, see [Creating a Session](creating-session.md).

## Setting up an IDE for Snowpark

You can use Snowpark with an integrated development environment (IDE).

To use Snowpark with Microsoft Visual Studio Code,
[install the Python extension and then specify the Python environment to use](https://code.visualstudio.com/docs/languages/python).

To use features for authoring and debugging Snowpark Python stored procedures in VS Code, install the [Snowflake Extension for Visual Studio Code](../../../user-guide/vscode-ext.md). The
extension enables you to connect to Snowflake and execute SQL statements directly in VS Code.

> **Important:**
>
> You must manually select the Python environment that you created when you set up your development environment.
> To do this, use the `Python: Select Interpreter` command from the `Command Palette`.
> For more information, see [Using Python environments in VS Code](https://code.visualstudio.com/docs/python/environments)
> in the Microsoft Visual Studio documentation.

## Importing modules

The main classes for the Snowpark API are in the `snowflake.snowpark` module.

To import particular names from a module, specify the names. For example:

```python
>>> from snowflake.snowpark.functions import avg
```

---
title: Setting Up Your Development Environment for Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/setup.md
section: Snowpark
---

# Setting Up Your Development Environment for Snowpark Scala

The Snowpark library is distributed as a JAR file through Maven. You can integrate Snowpark as a dependency in your own Maven projects or SBT build files. As an alternative, you can download the JAR file for use in your application.

You can use this library to develop applications in various environments, including:

* application development tools (like Visual Studio Code or IntelliJ IDEA Community Edition)
* [read-eval-print loop (REPL)](https://en.wikipedia.org/wiki/Read%E2%80%93eval%E2%80%93print_loop) interactive shells (like
  the sbt REPL or the Scala REPL)
* Jupyter notebooks

This set of topics provides instructions for setting up different types of application development environments to use the
Snowpark library.

**Next Topics:**

* [Prerequisites for Snowpark Scala](prerequisites.md)
* [Setting Up Visual Studio Code for Snowpark Scala](quickstart-vscode.md)
* [Setting Up IntelliJ IDEA CE for Snowpark Scala](quickstart-intellij.md)
* [Setting Up the SBT REPL for Snowpark Scala](quickstart-sbt.md)
* [Setting Up the Scala REPL for Snowpark Scala](quickstart-scala-repl.md)
* [Setting Up a Jupyter Notebook for Snowpark Scala](quickstart-jupyter.md)
* [Setting Up Other Development Environments for Snowpark Scala](setup-other-environments.md)

---
title: Snowpark API
source: https://docs.snowflake.com/en/developer-guide/snowpark/index.md
section: Snowpark
---

# Snowpark API

The Snowpark API provides an intuitive library for querying and processing data at scale in Snowflake. Using a library for any of three
languages, you can build applications that process data in Snowflake without moving data to the system where your application code runs,
and process at scale as part of the elastic and serverless Snowflake engine.

Snowflake currently provides Snowpark libraries for three languages: Java, Python, and Scala.

## Quickstarts

You can use the following Quickstarts to get a hands-on introduction to Snowpark.

* [Machine Learning with Snowpark Python](https://quickstarts.snowflake.com/guide/getting_started_snowpark_machine_learning/index.html)
* [Data Engineering Pipelines with Snowpark Python](https://quickstarts.snowflake.com/guide/data_engineering_pipelines_with_snowpark_python/index.html)
* [Getting Started With Snowpark for Python and Streamlit](https://quickstarts.snowflake.com/guide/getting_started_with_snowpark_for_python_streamlit/index.html)
* [An Image Recognition App in Snowflake using Snowpark Python, PyTorch, Streamlit and OpenAI](https://quickstarts.snowflake.com/guide/image_recognition_snowpark_pytorch_streamlit_openai/index.html)
* [Getting Started With Snowpark Scala](https://quickstarts.snowflake.com/guide/getting_started_with_snowpark_scala/index.html)

## Developer Guides

You can use Snowpark libraries for the languages listed in the following table:

| Language | Developer Guide | API Reference |
| --- | --- | --- |
| Java | [Snowpark Developer Guide for Java](java/index.md) | [Snowpark Library for Java API Reference](/developer-guide/snowpark/reference/java/index.md) |
| Python | [Snowpark Developer Guide for Python](python/index.md) | [Snowpark Library for Python API Reference](/developer-guide/snowpark/reference/python/latest/index) |
| Scala | [Snowpark Developer Guide for Scala](scala/index.md) | [Snowpark Library for Scala API Reference](/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/index.md) |

## Download

You can download the Snowpark library for any of the three supported languages. For downloads, see
[Snowpark Client Download](https://developers.snowflake.com/snowpark/) (Snowflake Developer Center).

## Key Features

Snowpark has several features that distinguish it from other client libraries, as described in the following sections.

### Benefits When Compared with the Spark Connector

In comparison to using the [Snowflake Connector for Spark](../../user-guide/spark-connector.md), developing with Snowpark includes the following benefits:

* Support for interacting with data within Snowflake using libraries and patterns purpose built for different languages without compromising
  on performance or functionality.
* Support for authoring Snowpark code using local tools such as Jupyter, VS Code, or IntelliJ.
* Support for pushdown for all operations, including Snowflake UDFs. This means Snowpark pushes down all data transformation and
  heavy lifting to the Snowflake data cloud, enabling you to efficiently work with data of any size.
* No requirement for a separate cluster outside of Snowflake for computations. All of the computations are done within
  Snowflake. Scale and compute management are handled by Snowflake.

### Ability to Build SQL Statements with Native Constructs

The Snowpark API provides programming language constructs for building SQL statements. For example, the API provides a
`select` method that you can use to specify the column names to return, rather than writing
`'select column_name'` as a string.

Although you can still use a string to specify the SQL statement to execute, you benefit from features like
[intelligent code completion](https://en.wikipedia.org/wiki/Intelligent_code_completion) and type checking when you use the
native language constructs provided by Snowpark.

#### Example

Python code in the following example performs a select operation on the `sample_product_data` table, specifying the columns
`id`, `name`, and `serial_number`.

```python
>>> # Import the col function from the functions module.
>>> from snowflake.snowpark.functions import col

>>> # Create a DataFrame that contains the id, name, and serial_number
>>> # columns in the "sample_product_data" table.
>>> df = session.table("sample_product_data").select(col("id"), col("name"), col("serial_number"))
>>> df.show()
```

### Reduced Data Transfer

Snowpark operations are executed lazily on the server, meaning that you can use the library to delay running data transformation until as
late in the pipeline as possible while batching up many operations into a single operation. This reduces the amount of data transferred
between your client and the Snowflake database. It also improves performance.

The core abstraction in Snowpark is the DataFrame, which represents a set of data and provides methods to operate on that data.
In your client code, you construct a DataFrame object and set it up to retrieve the data that you want to use (for example, the
columns containing the data, the filter to apply to rows, etc.).

The data isn’t retrieved when you construct the DataFrame object. Instead, when you are ready to retrieve the data,
you can perform an action that evaluates the DataFrame objects and sends the corresponding SQL statements to the Snowflake
database for execution.

#### Example

Python code in the following example sets up a query against a table. It calls the `collect` method to execute the query and retrieve
results.

```python
>>> # Create a DataFrame with the "id" and "name" columns from the "sample_product_data" table.
>>> # This does not execute the query.
>>> df = session.table("sample_product_data").select(col("id"), col("name"))

>>> # Send the query to the server for execution and
>>> # return a list of Rows containing the results.
>>> results = df.collect()
```

### Ability to Create UDFs Inline

You can create user-defined functions (UDFs) inline in a Snowpark app. Snowpark can push your code to the server, where the code can
operate on the data at scale. This is useful for looping or batch functionality where creating as a UDF will allow Snowflake to parallelize
and apply the codeful logic at scale within Snowflake.

You can write functions in the same language that you use to write your client code (for example, by using anonymous functions
in Scala or by using lambda functions in Python). To use these functions to process data in the Snowflake database, you define
and call user-defined functions (UDFs) in your custom code.

Snowpark automatically pushes the custom code for UDFs to the Snowflake engine. When you call the UDF in your client code,
your custom code is executed on the server (where the data is). You don’t need to transfer the data to your client in order to
execute the function on the data.

#### Example

Python code in the following example creates a UDF called `my_udf` and assigns it to the `add_one` variable.

```python
>>> from snowflake.snowpark.types import IntegerType
>>> add_one = udf(lambda x: x+1, return_type=IntegerType(), input_types=[IntegerType()], name="my_udf", replace=True)
```

---
title: Snowpark Checkpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/snowpark-checkpoints-library.md
section: Snowpark
---

# Snowpark Checkpoints

Snowpark Checkpoints is a testing library that validates code migrated from [Apache PySpark](https://spark.apache.org/) to Snowpark Python. It compares the outputs of DataFrame operations across both platforms, ensuring that Snowpark implementations produce results that are functionally equivalent to their PySpark counterparts. It strives to maintain data integrity and analytical consistency throughout the migration process.

---
title: Snowpark Checkpoints library
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/snowpark-checkpoints-library-details.md
section: Snowpark
---

# Snowpark Checkpoints library

The Snowpark Checkpoints Python package provides a range of functionalities to support the validation of migrated workloads. The following sections outline the key features and capabilities included in the package, along with guidance on how to use them effectively.

* [Collectors](checkpoints-collectors.md)
* [Validators](checkpoints-validators.md)
* [Hypothesis](checkpoints-hypothesis.md)
* [Logging](checkpoints-logging.md)
* [Using Databricks](checkpoints-databricks.md)

---
title: Snowpark Checkpoints library: Collectors
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-collectors.md
section: Snowpark
---

# Snowpark Checkpoints library: Collectors

The `snowpark-checkpoints-collectors` package offers a function for extracting information from the PySpark DataFrames. You can then use that data to validate against the converted Snowpark DataFrames to ensure behavioral equivalence.

* To insert a new checkpoint collection point, use the following function:

  **Function signature:**

  ```python
  def collect_dataframe_checkpoint(df: SparkDataFrame,
    checkpoint_name: str,
    sample: Optional[float],
    mode: Optional[CheckpointMode],
    output_path: Optional[str]) -> None:
  ```

  **Function parameters:**

  + **df:** The PySpark DataFrame
  + **checkpoint_name:** The name of the checkpoint

    Starts with a letter (A-Z, a-z) or an underscore (_) and contains only letters, underscores, and numbers (0-9).
  + **sample:** (Optional) The sample size

    The default value is 1.0 (entire PySpark DataFrame) in a range from 0 to 1.0.
  + **mode:** (Optional) The execution mode

    The options are `SCHEMA` (default) and `DATAFRAME`.
  + **output_path:** (Optional) The path to the file where the checkpoint is saved

    The default value is the current working directory.

The collection process generates a JSON output file, called `checkpoint_collection_result.json`, that contains the following information about the result for each collection point:

* A timestamp for when the collection point started
* The relative path of the file where the collection point is
* The line of code of the file where the collection point is
* The name of the collection point checkpoint
* The result of the collection point (fail or pass)

## Schema inference collected data mode (Schema)

This is the default mode, which leverages Pandera schema inference to obtain the metadata and checks that will be evaluated for the specified DataFrame. This mode also collects custom data from columns of the DataFrame based on the PySpark type.

The column data and checks are collected based on the PySpark type of the column (see the following tables). For any column, no matter its type, the custom data collected includes the name of the column, the type of the column, nullable, the count of rows, the count of not null rows, and the count of null rows.

Custom data is collected based on the PySpark type of the column

| Column type | Custom data collected |
| --- | --- |
| Numeric (`byte`, `short`, `integer`, `long`, `float` and `double`) | Minimum value; maximum value; mean value; decimal precision (in case of integer type, the value is zero); standard deviation |
| Date | Minimum value; maximum value; date format (*%Y-%m-%d*) |
| DayTimeIntervalType and YearMonthIntervalType | Minimum value; maximum value |
| Timestamp | Minimum value; maximum value; date format (*%Y-%m-%dH:%M:%S*) |
| Timestamp ntz | Minimum value; maximum value; date format (*%Y-%m-%dT%H:%M:%S%z*) |
| String | Minimum length value; maximum length value |
| Char | PySpark handles any literal as a string type; therefore, *char* is not a valid type. |
| Varchar | PySpark handles any literal as a string type; therefore, *Varchar* is not a valid type. |
| Decimal | Minimum value; maximum value; mean value; decimal precision |
| Array | Type of the value; if allowed, null as an element; proportion of null values; maximum array size; minimum array size; mean size of arrays; whether all arrays have the same size |
| Binary | Maximum size; minimum size; mean size; whether all elements have the same size |
| Map | Type of the key; type of the value; if allowed, null as a value; proportion of null values; maximum map size; minimum map size; mean map size; whether all maps have the same size |
| Null | NullType represents None because the type data cannot be determined; therefore, it is not possible to get information from this type. |
| Struct | The metadata of the struct for each structField: `name`, `type`, `nullable`, `rows count`, `rows not null count` and `rows null count`. It is an array. |

It also defines a set of predefined validation checks for each data type detailed in the following table:

Checks are collected based on the type of the column

| Type | Pandera checks | Additional checks |
| --- | --- | --- |
| Boolean | Each value is True or False. | The count of True and False values |
| Numeric (`byte`, `short`, `integer`, `long`, `float` and `double`) | Each value is in the range of min value and max value. | Decimal precision; mean value; standard deviation |
| Date | N/A | Minimum and maximum values |
| Timestamp | Each value is in the range of min value and max value. | The format of the value |
| Timestamp ntz | Each value is in the range of min value and max value. | The format of the value |
| String | Each value length is in the range of min and max length. | None |
| Char | PySpark handles any literal as a string type; therefore, `char` is not a valid type. |  |
| Varchar | PySpark handles any literal as a string type; therefore, `Varchar` is not a valid type. |  |
| Decimal | N/A | N/A |
| Array | N/A | None |
| Binary | N/A | None |
| Map | N/A | None |
| Null | N/A | N/A |
| Struct | N/A | None |

This mode allows the user to define a sample of a DataFrame to collect, but it is optional. By default, the collection works with the entire DataFrame. The size of the sample must represent the population statistically.

Pandera can only infer the schema of a pandas DataFrame, which implies that the PySpark DataFrame must be converted into a pandas DataFrame, which can affect the columns’ type resolutions. In particular, pandera infers the following PySpark types as object types: `string`, `array`, `map`, `null`, `struct`, and `binary`.

The output of this mode is a JSON file for each collected DataFrame, and the name of the file is the same as the checkpoint. This file contains information related to the schema and has two sections:

* The **Pandera schema** section contains the data inferred by pandera such as name, type (pandas), whether the column allows null values, and other information for each column, and checks whether the column is based on the PySpark type. It is a `DataFrameSchema` object of pandera.
* The **custom data** section is an array of the custom data collected by each column based on the PySpark type.

> **Note:**
>
> The collection package might have memory issues when processing large PySpark DataFrames. To work with a subset of the data instead of the entire PySpark DataFrame, you can set the sample parameter in the collection function to a value between 0.0 and 1.0.

## DataFrame collected data mode (DataFrame)

This mode collects the data of the PySpark DataFrame. In this case, the mechanism saves all data of the given DataFrame in parquet format. Using the default user Snowflake connection, it tries to upload the parquet files into the Snowflake temporal stage and create a table based on the information in the stage. The name of the file and the table are the same as the checkpoint.

The output of this mode is a parquet file result of the DataFrame saved and a table with the DataFrame data in the default Snowflake configuration connection.

---
title: Snowpark Checkpoints library: Hypothesis
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-hypothesis.md
section: Snowpark
---

# Snowpark Checkpoints library: Hypothesis

## Hypothesis unit testing

Hypothesis is a powerful testing library for Python that is designed to enhance traditional unit testing by generating a wide range of input data automatically. It uses property-based testing, where instead of specifying individual test cases, you can describe the expected behavior of your code with properties or conditions, and Hypothesis generates examples to test those properties thoroughly. This approach helps uncover edge cases and unexpected behaviors, making it especially effective for complex functions. For more information, see [Hypothesis](https://hypothesis.readthedocs.io/en/latest/).

The `snowpark-checkpoints-hypothesis` package extends the Hypothesis library to generate synthetic Snowpark DataFrames for testing purposes. By leveraging the ability of Hypothesis to generate diverse and randomized test data, you can create Snowpark DataFrames with varying schemas and values to simulate real-world scenarios, thus ensuring robust code and verifying the correctness of complex transformations.

The Hypothesis strategy for Snowpark relies on pandera for generating synthetic data. The `dataframe_strategy` function uses the specified schema to generate a pandas DataFrame that conforms to it and then converts it into a Snowpark DataFrame.

**Function signature:**

```python
def dataframe_strategy(
  schema: Union[str, DataFrameSchema],
  session: Session,
  size: Optional[int] = None
) -> SearchStrategy[DataFrame]
```

**Function parameters:**

* `schema`: The schema that defines the columns, data types, and checks that the generated Snowpark dataframe should match

  The schema can be:

  + A path to a JSON schema file generated by the `collect_dataframe_checkpoint` function of the `snowpark-checkpoints-collectors` package
  + An instance of [pandera.api.pandas.container.DataFrameSchema](https://pandera.readthedocs.io/en/stable/reference/generated/pandera.api.pandas.container.DataFrameSchema.html)
* `session`: An instance of [snowflake.snowpark.Session](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.Session) that will be used for creating the Snowpark DataFrames
* `size`: The number of rows to generate for each Snowpark DataFrame

  If this parameter is not provided, the strategy will generate DataFrames of different sizes.

**Function output:**

Returns a [Hypothesis SearchStrategy](https://github.com/HypothesisWorks/hypothesis/blob/904bdd967ca9ff23475aa6abe860a30925149da7/hypothesis-python/src/hypothesis/strategies/_internal/strategies.py#L221) that generates Snowpark DataFrames

## Supported and unsupported data types

The `dataframe_strategy` function supports the generation of Snowpark DataFrames with different data types, which vary depending on the type of the schema argument passed to the function. Note that the strategy will raise an exception if it finds an unsupported data type.

The following table shows the supported and unsupported PySpark data types by the `dataframe_strategy` function when a JSON file is passed as the `schema` argument:

| PySpark data type | Supported |
| --- | --- |
| [Array](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.ArrayType.html) | Yes |
| [Boolean](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.BooleanType.html) | Yes |
| [Char](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.CharType.html) | No |
| [Date](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.DateType.html) | Yes |
| [DayTimeIntervalType](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.DayTimeIntervalType.html) | No |
| [Decimal](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.DecimalType.html) | No |
| [Map](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.MapType.html) | No |
| [Null](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.NullType.html) | No |
| [Byte](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.ByteType.html), [Short](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.ShortType.html), [Integer](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.IntegerType.html), [Long](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.LongType.html), [Float](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.FloatType.html), [Double](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.DoubleType.html) | Yes |
| [String](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.StringType.html) | Yes |
| [Struct](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.StructType.html) | No |
| [Timestamp](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.TimestampType.html) | Yes |
| [TimestampNTZ](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.TimestampNTZType.html) | Yes |
| [Varchar](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.VarcharType.html) | No |
| [YearMonthIntervalType](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.YearMonthIntervalType.html) | No |

The following table shows the pandera data types supported by the `dataframe_strategy` function when a DataFrameSchema object is passed as the `schema` argument and the Snowpark data types they are mapped to:

| Pandera data type | Snowpark data type |
| --- | --- |
| int8 | [ByteType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.ByteType.html#snowflake.snowpark.types.ByteType) |
| int16 | [ShortType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.ShortType.html#snowflake.snowpark.types.ShortType) |
| int32 | [IntegerType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.IntegerType.html#snowflake.snowpark.types.IntegerType) |
| int64 | [LongType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.LongType.html#snowflake.snowpark.types.LongType) |
| float32 | [FloatType](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.types.FloatType#snowflake.snowpark.types.FloatType) |
| float64 | [DoubleType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.DoubleType.html#snowflake.snowpark.types.DoubleType) |
| string | [StringType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.StringType.html#snowflake.snowpark.types.StringType) |
| bool | [BooleanType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.BooleanType.html#snowflake.snowpark.types.BooleanType) |
| datetime64[ns, tz] | [TimestampType(TZ)](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.types.TZ) |
| datetime64[ns] | [TimestampType(NTZ)](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.types.NTZ) |
| date | [DateType](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.types.DateType.html#snowflake.snowpark.types.DateType) |

## Examples

The following procedure presents the typical workflow for using the Hypothesis library to generate Snowpark DataFrames:

1. Create a standard Python test function with the different assertions or conditions your code should satisfy for all inputs.
2. Add the Hypothesis `@given` decorator to your test function, and pass the `dataframe_strategy` function as an argument.

   For more information about the `@given` decorator, see [hypothesis.given](https://hypothesis.readthedocs.io/en/latest/details.html#hypothesis.given).
3. Run the test function.

   Hypothesis automatically provides the generated inputs as arguments to the test.

**Example 1: Generate Snowpark DataFrames from a JSON file**

In this example, Snowpark DataFrames are generated from a JSON schema file generated by the `collect_dataframe_checkpoint` function of the `snowpark-checkpoints-collectors` package:

```python
from hypothesis import given

from snowflake.hypothesis_snowpark import dataframe_strategy
from snowflake.snowpark import DataFrame, Session

@given(
    df=dataframe_strategy(
        schema="path/to/file.json",
        session=Session.builder.getOrCreate(),
        size=10,
    )
)
def test_my_function_from_json_file(df: DataFrame):
    # Test a particular function using the generated Snowpark DataFrame
    ...
```

**Example 2: Generate a Snowpark DataFrame from a pandera DataFrameSchema object**

In this example, Snowpark DataFrames are generated from an instance of a pandera DataFrameSchema:

```python
import pandera as pa

from hypothesis import given

from snowflake.hypothesis_snowpark import dataframe_strategy
from snowflake.snowpark import DataFrame, Session

@given(
    df=dataframe_strategy(
        schema=pa.DataFrameSchema(
            {
                "boolean_column": pa.Column(bool),
                "integer_column": pa.Column("int64", pa.Check.in_range(0, 9)),
                "float_column": pa.Column(pa.Float32, pa.Check.in_range(10.5, 20.5)),
            }
        ),
        session=Session.builder.getOrCreate(),
        size=10,
    )
)
def test_my_function_from_dataframeschema_object(df: DataFrame):
    # Test a particular function using the generated Snowpark DataFrame
    ...
```

For more information, see [Pandera DataFrameSchema](https://pandera.readthedocs.io/en/latest/dataframe_schemas.html).

**Example 3: Customize the Hypothesis behavior**

You can also customize the behavior of your test with the Hypothesis `@settings` decorator. This decorator allows you to customize various configuration parameters to tailor test behavior to your needs.

By using the `@settings` decorator, you can control aspects like the maximum number of test cases, the deadline for each test execution, and verbosity levels:

```python
from datetime import timedelta

from hypothesis import given, settings
from snowflake.snowpark import DataFrame, Session

from snowflake.hypothesis_snowpark import dataframe_strategy

@given(
    df=dataframe_strategy(
        schema="path/to/file.json",
        session=Session.builder.getOrCreate(),
    )
)
@settings(
    deadline=timedelta(milliseconds=800),
    max_examples=25,
)
def test_my_function(df: DataFrame):
    # Test a particular function using the generated Snowpark DataFrame
    ...
```

For more information, see [Hypothesis settings](https://hypothesis.readthedocs.io/en/latest/settings.html).

---
title: Snowpark Checkpoints library: Logging
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-logging.md
section: Snowpark
---

# Snowpark Checkpoints library: Logging

Snowpark Checkpoints uses Python’s built-in [logging](https://docs.python.org/3/library/logging.html) module to provide log messages about its internal operations. The library emits log messages at different [log levels](https://docs.python.org/3/library/logging.html#logging-levels) that can be used to understand the behavior of the library and to diagnose issues.

## Logging structure

Snowpark Checkpoints follows a module-level logging approach, where each Python module that needs to log messages defines its own logger, and the logger’s name matches the module’s fully qualified name.

Each Snowpark Checkpoints package defines a top-level logger, which is named after the package itself and acts as the parent for all module-level loggers within that package. The top-level logger is initialized with a [NullHandler](https://docs.python.org/3/library/logging.handlers.html#nullhandler), ensuring that the logger does not produce output that it wasn’t explicitly configured to produce. Any logging configuration applied to the top-level logger automatically applies to all the module loggers within that package.

Top-level logger names of Snowpark Checkpoints:

| Package name | Top-level logger name |
| --- | --- |
| `snowpark-checkpoints-collectors` | `snowflake.snowpark_checkpoints_collector` |
| `snowpark-checkpoints-validators` | `snowflake.snowpark_checkpoints` |
| `snowpark-checkpoints-configuration` | `snowflake.snowpark_checkpoints_configuration` |
| `snowpark-checkpoints-hypothesis` | `snowflake.hypothesis_snowpark` |

This module-level approach allows for fine-grained control over logging output and ensures that logs inherit settings from a higher-level logger while emitting precise information about their origin.

## Logging configuration

Snowpark Checkpoints does not provide a default logging configuration. You must explicitly configure logging in your application to see the log messages.

If your application already has a logging configuration using Python’s built-in logging module, then you should be able to see the log messages emitted by Snowpark Checkpoints without any additional configuration. If you do not have a logging configuration, you can set up logging using the [basicConfig](https://docs.python.org/3/library/logging.html#logging.basicConfig) function or by creating a custom configuration.

It is advisable to configure logging once at the entry point of your application, such as in the main script or the module that initializes your application. This ensures that logging is set up before any library components are used. If the library is used within a standalone script, logging should be set up at the beginning of that script. Below are some examples to help you get started:

## Basic logging configuration

The simplest and quickest way to enable logging is by using the [basicConfig](https://docs.python.org/3/library/logging.html#logging.basicConfig) function. This function allows you to configure the root logger, which is the ancestor of all loggers in the logging module hierarchy.

The following example demonstrates how to set up the root logger to capture log messages at the specified log level and above and print them to the console:

```python
import logging

logging.basicConfig(
  level=logging.DEBUG, # Adjust the log level as needed
  format="%(asctime)s - %(name)s - %(levelname)s - %(message)s"
)
```

## Advanced logging configuration

For more advanced logging configurations, you can use the [logging.config](https://docs.python.org/3/library/logging.config.html) module to set up logging. This approach allows you to define custom loggers, handlers, and formatters, and configure them using a dictionary.

The following example demonstrates how to set up the root logger using a custom configuration that logs messages to the console and to a file:

```python
import logging.config
from datetime import datetime

LOGGING_CONFIG = {
  "version": 1,
  "disable_existing_loggers": False,
  "formatters": {
      "standard": {
          "format": "{asctime} - {name} - {levelname} - {message}",
          "style": "{",
          "datefmt": "%Y-%m-%d %H:%M:%S",
      },
  },
  "handlers": {
      "console": {
          "class": "logging.StreamHandler",
          "formatter": "standard",
          "level": "DEBUG",  # Adjust the log level as needed
      },
      "file": {
          "class": "logging.FileHandler",
          "formatter": "standard",
          "filename": f"app_{datetime.now().strftime('%Y-%m-%d_%H-%M-%S')}.log",
          "level": "DEBUG",  # Adjust the log level as needed
          "encoding": "utf-8",
      },
  },
  "root": {
      "handlers": ["console", "file"],
      "level": "DEBUG",  # Adjust the log level as needed
  },
}

logging.config.dictConfig(LOGGING_CONFIG)
```

## Enable logging for specific packages

To configure logging for a specific package of Snowpark Checkpoints without affecting other loggers, you can use the top-level logger name for that package and apply any custom handlers and formatters according to your needs. Applying the configuration to the top-level logger ensures that all module-level loggers inherit that configuration.

The following example demonstrates how to configure logging just for the following packages:

* `snowpark-checkpoints-collectors`
* `snowpark-checkpoints-configuration`
* `snowpark-checkpoints-validators`
* `snowpark-checkpoints-hypothesis`

```python
import logging.config
from datetime import datetime

LOGGING_CONFIG = {
  "version": 1,
  "disable_existing_loggers": False,
  "formatters": {
      "standard": {
          "format": "{asctime} - {name} - {levelname} - {message}",
          "style": "{",
          "datefmt": "%Y-%m-%d %H:%M:%S",
      },
  },
  "handlers": {
      "console": {
          "class": "logging.StreamHandler",
          "formatter": "standard",
          "level": "DEBUG",  # Adjust the log level as needed
      },
      "file": {
          "class": "logging.FileHandler",
          "formatter": "standard",
          "filename": f"app_{datetime.now().strftime('%Y-%m-%d_%H-%M-%S')}.log",
          "level": "DEBUG",  # Adjust the log level as needed
          "encoding": "utf-8",
      },
  },
  "loggers": {
      "snowflake.snowpark_checkpoints_collector": {
          "handlers": ["console", "file"],
          "level": "DEBUG",  # Adjust the log level as needed
          "propagate": False,
      },
      "snowflake.snowpark_checkpoints": {
          "handlers": ["console", "file"],
          "level": "DEBUG",  # Adjust the log level as needed
          "propagate": False,
      },
      "snowflake.snowpark_checkpoints_configuration": {
          "handlers": ["console", "file"],
          "level": "DEBUG",  # Adjust the log level as needed
          "propagate": False,
      },
      "snowflake.hypothesis_snowpark": {
          "handlers": ["console", "file"],
          "level": "DEBUG",  # Adjust the log level as needed
          "propagate": False,
      },
  },
}

logging.config.dictConfig(LOGGING_CONFIG)
```

For more details on Python’s logging module, see the [Python logging documentation](https://docs.python.org/3/library/logging.html).

---
title: Snowpark Checkpoints library: Validators
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-validators.md
section: Snowpark
---

# Snowpark Checkpoints library: Validators

The Snowpark Checkpoints package offers a set of validations that can be applied to the Snowpark code to ensure behavioral equivalence against the PySpark code.

## Functions provided by the framework

* check_with_spark: A decorator that will convert any Snowpark DataFrame arguments to a function or sample and then to PySpark DataFrames. The check will then execute a provided spark function that mirrors the functionality of the new Snowpark function and compares the outputs between the two implementations. Assuming the spark function and Snowpark functions are semantically identical, this decorator verifies those functions on real, sampled data.

  Parameters:
  :   + `job_context` (SnowparkJobContext): The job context that contains configuration and details for the validation
      + `spark_function` (fn): The equivalent PySpark function to compare against the Snowpark implementation
      + `checkpoint_name` (str): A name for the checkpoint; defaults to None
      + `sample_number` (Optional[int], optional): The number of rows for validation; defaults to 100
      + `sampling_strategy` (Optional[SamplingStrategy], optional): The strategy used for sampling data; defaults to `SamplingStrategy.RANDOM_SAMPLE`
      + `output_path` (Optional[str], optional): The path to the file where the validation results are stored; defaults to None

  Example:

  ```python
   def original_spark_code_I_dont_understand(df):
   from pyspark.sql.functions import col, when

   ret = df.withColumn(
       "life_stage",
       when(col("byte") < 4, "child")
       .when(col("byte").between(4, 10), "teenager")
       .otherwise("adult"),
   )
   return ret

  @check_with_spark(
   job_context=job_context, spark_function=original_spark_code_I_dont_understand
  )
  def new_snowpark_code_I_do_understand(df):
    from snowflake.snowpark.functions import col, lit, when

    ref = df.with_column(
        "life_stage",
        when(col("byte") < 4, lit("child"))
        .when(col("byte").between(4, 10), lit("teenager"))
        .otherwise(lit("adult")),
   )
   return ref

   df1 = new_snowpark_code_I_do_understand(df)
  ```
* validate_dataframe_checkpoint: This function validates a Snowpark Dataframe against a specific checkpoint schema file or imported Dataframe according to the argument mode. It ensures that the information collected for that DataFrame and the DataFrame that is passed to the function are equivalent.

  Parameters:
  :   + `df` (SnowparkDataFrame): The DataFrame to validate
      + `checkpoint_name` (str): The name of the checkpoint to validate against
      + `job_context` (SnowparkJobContext, optional) (str): The job context for the validation; required for PARQUET mode
      + `mode` (CheckpointMode): The mode of validation (e.g., SCHEMA, PARQUET); defaults to SCHEMA
      + `custom_checks` (Optional[dict[Any, Any]], optional): Custom checks to apply during validation
      + `skip_checks` (Optional[dict[Any, Any]], optional): Checks to skip during validation
      + `sample_frac` (Optional[float], optional): Fraction of the DataFrame to sample for validation; defaults to 0.1
      + `sample_number` (Optional[int], optional): Number of rows to sample for validation
      + `sampling_strategy` (Optional[SamplingStrategy], optional): Strategy to use for sampling
      + `output_path` (Optional[str], optional): The path to the file where the validation results are stored

  Example:

  ```python
  # Check a schema/stats here!
  validate_dataframe_checkpoint(
      df1,
      "demo_add_a_column_dataframe",
      job_context=job_context,
      mode=CheckpointMode.DATAFRAME, # CheckpointMode.Schema)
  )
  ```

  Depending on the mode selected, the validation will use either the collected schema file or a Parquet-loaded Dataframe in Snowflake to verify the equivalence against the PySpark version.
* check-output_schema: This decorator validates the schema of a Snowpark function’s output and ensures that the output DataFrame conforms to a specified Pandera schema. It is particularly useful for enforcing data integrity and consistency in Snowpark pipelines. This decorator takes several parameters, including the Pandera schema to validate against, the checkpoint name, sampling parameters, and an optional job context. It wraps the Snowpark function and performs schema validation on the output DataFrame before returning the result.

  Example:

  ```python
  from pandas import DataFrame as PandasDataFrame
  from pandera import DataFrameSchema, Column, Check
  from snowflake.snowpark import Session
  from snowflake.snowpark import DataFrame as SnowparkDataFrame
  from snowflake.snowpark_checkpoints.checkpoint import check_output_schema
  from numpy import int8

  # Define the Pandera schema
  out_schema = DataFrameSchema(
  {
      "COLUMN1": Column(int8, Check.between(0, 10, include_max=True, include_min=True)),
      "COLUMN2": Column(float, Check.less_than_or_equal_to(-1.2)),
      "COLUMN3": Column(float, Check.less_than(10)),
  }
  )

  # Define the Snowpark function and apply the decorator
  @check_output_schema(out_schema, "output_schema_checkpoint")
  def preprocessor(dataframe: SnowparkDataFrame):
   return dataframe.with_column(
      "COLUMN3", dataframe["COLUMN1"] + dataframe["COLUMN2"]
  )

  # Create a Snowpark session and DataFrame
  session = Session.builder.getOrCreate()
  df = PandasDataFrame(
  {
      "COLUMN1": [1, 4, 0, 10, 9],
      "COLUMN2": [-1.3, -1.4, -2.9, -10.1, -20.4],
  }
  )

  sp_dataframe = session.create_dataframe(df)

  # Apply the preprocessor function
  preprocessed_dataframe = preprocessor(sp_dataframe)
  ```
* check_input_schema: This decorator validates the schema of a Snowpark function’s input arguments. This decorator ensures that the input DataFrame conforms to a specified Pandera schema before the function is executed. It is particularly useful for enforcing data integrity and consistency in Snowpark pipelines. This decorator takes several parameters, including the Pandera schema to validate against, the checkpoint name, sampling parameters, and an optional job context. It wraps the Snowpark function and performs schema validation on the input DataFrame before executing the function.

  Example:

  ```python
  from pandas import DataFrame as PandasDataFrame
  from pandera import DataFrameSchema, Column, Check
  from snowflake.snowpark import Session
  from snowflake.snowpark import DataFrame as SnowparkDataFrame
  from snowflake.snowpark_checkpoints.checkpoint import check_input_schema
  from numpy import int8

  # Define the Pandera schema
  input_schema = DataFrameSchema(
  {
      "COLUMN1": Column(int8, Check.between(0, 10, include_max=True, include_min=True)),
      "COLUMN2": Column(float, Check.less_than_or_equal_to(-1.2)),
  }
  )

  # Define the Snowpark function and apply the decorator
  @check_input_schema(input_schema, "input_schema_checkpoint")
  def process_dataframe(dataframe: SnowparkDataFrame):
  return dataframe.with_column(
      "COLUMN3", dataframe["COLUMN1"] + dataframe["COLUMN2"]
  )

  # Create a Snowpark session and DataFrame
  session = Session.builder.getOrCreate()
  df = PandasDataFrame(
  {
      "COLUMN1": [1, 4, 0, 10, 9],
      "COLUMN2": [-1.3, -1.4, -2.9, -10.1, -20.4],
  }
  )
  sp_dataframe = session.create_dataframe(df)

  # Apply the process_dataframe function
  processed_dataframe = process_dataframe(sp_dataframe)
  ```

## Statistics checks

Statistics validations are applied to the specific column type by default when the validation is run in `Schema` mode; these checks can be skipped with `skip_checks`.

| Column type | Default check |
| --- | --- |
| Numeric: `byte`, `short`, `integer`, `long`, `float`, and `double` | between: Validate whether the value is between the min or the max, including the min and max.  decimal_precision: If the value is decimal, this will check the decimal precision.  mean: Validate whether the mean of the columns falls within a specific range. |
| Boolean | isin: Validate whether the value is True or False.  True_proportion: Validate whether the proportion of the True values falls within a specific range.  False_proportion: Validate whether the proportion of the False values falls within a specific range. |
| Date: `date`, `timestamp`, and `timestamp_ntz` | between: Validate whether the value is between the min or the max, including the min and max. |
| Nullable: All supported types | Null_proportion: Validate the null proportion accordingly. |

## Skip checks

With this granular control for checks, you can skip column validation or specific checks for a column. With the parameter `skip_checks`, you can specify the particular column and which validation type you want to skip. The name of the check used to skip is the one associated with the check.

* `str_contains`
* `str_endswith`
* `str_length`
* `str_matches`
* `str_startswith`
* `in_range`
* `​​equal_to`
* `greater_than_or_equal_to`
* `greater_than`
* `less_than_or_equal_to`
* `less_than`
* `not_equal_to`
* `notin`
* `isin`

Example:

```python
df = pd.DataFrame(
{
      "COLUMN1": [1, 4, 0, 10, 9],
      "COLUMN2": [-1.3, -1.4, -2.9, -10.1, -20.4],
}
)

schema = DataFrameSchema(
{
      "COLUMN1": Column(int8, Check.between(0, 10, element_wise=True)),
      "COLUMN2": Column(
          float,
          [
              Check.greater_than(-20.5),
              Check.less_than(-1.0),
              Check(lambda x: x < -1.2),
          ],
      ),
}
)

session = Session.builder.getOrCreate()
sp_df = session.create_dataframe(df)
check_dataframe_schema(
  sp_df,
  schema,
  skip_checks={"COLUMN1": [SKIP_ALL], "COLUMN2": ["greater_than", "less_than"]},
)
```

## Custom checks

You can add additional checks to the schema generated from the JSON file with the `custom_checks` property. This adds the check to the pandera schema.

Example:

```python
df = pd.DataFrame(
  {
        "COLUMN1": [1, 4, 0, 10, 9],
        "COLUMN2": [-1.3, -1.4, -2.9, -10.1, -20.4],
  }
)

session = Session.builder.getOrCreate()
sp_df = session.create_dataframe(df)

# Those check will be added to the schema generate from the JSON file
result = validate_dataframe_checkpoint(
  sp_df,
  "checkpoint-name",
  custom_checks={
        "COLUMN1": [
            Check(lambda x: x.shape[0] == 5),
            Check(lambda x: x.shape[1] == 2),
    ],
    "COLUMN2": [Check(lambda x: x.shape[0] == 5)],
  },
)
```

## Sampling strategies

The provided code’s sampling process is designed to efficiently validate large DataFrames by taking a representative sample of the data. This approach helps perform schema validation without the need to process the entire dataset, which can be computationally expensive and time-consuming.

Parameters:
:   * `sample_frac`: This parameter specifies the fraction of the DataFrame to sample. For example, if `sample_frac` is set to 0.1, then 10 percent of the DataFrame rows will be sampled. This is useful when you want to validate a subset of the data to save on computational resources.
    * `sample_number`: This parameter specifies the exact number of rows to sample from the DataFrame. For example, if `sample_number` is set to 100, then 100 rows will be sampled from the DataFrame. This is useful when you want to validate a fixed number of rows regardless of the DataFrame size.

## Validation result

After any type of validation is executed, the result, whether it passes or fails, is saved into `checkpoint_validation_results.json`. This file is primarily used for the functionalities of the VSCode extension. It contains information about the status of the validation, timestamp, checkpoint name, number of the line where the execution of the function occurs, and the file.

It also logs the result into the default Snowflake account in a table called *SNOWPARK_CHECKPOINTS_REPORT*, which contains information about the validation result.

* `DATE`: Execute timestamp of the validation
* `JOB`: Name of the SnowparkJobContext
* `STATUS`: Status of the validation
* `CHECKPOINT`: Name of the checkpoint validated
* `MESSAGE`: Error message
* `DATA`: Data from the validation execution
* `EXECUTION_MODE`: Validation mode executed

---
title: Snowpark Developer Guide for Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/index.md
section: Snowpark
---

# Snowpark Developer Guide for Java

The Snowpark library provides an intuitive API for querying and processing data in a data pipeline. Using the Snowpark library, you can
build applications that process data in Snowflake without moving data to the system where your application code runs.

For an introduction to Snowpark, see [Snowpark API](../index.md).

## Get Started

[Setting Up Your Development Environment for Snowpark Java](setup.md)
:   Set up to build Snowpark apps using any of several development environments.

## Developer Guides

[Creating a Session for Snowpark Java](creating-session.md)
:   Establish a session with which you interact with the Snowflake database.

[Working with DataFrames in Snowpark Java](working-with-dataframes.md)
:   Query and process data with a `DataFrame` object.

[Creating User-Defined Functions (UDFs) for DataFrames in Java](creating-udfs.md)
:   Create user-defined functions (UDFs) using the Snowpark API.

[Creating stored procedures for DataFrames in Java](creating-sprocs.md)
:   Create stored procedures using the Snowpark API.

[Calling functions and stored procedures in Snowpark Java](calling-functions.md)
:   Use the Snowpark API to call system-defined functions, UDFs, and stored procedures.

[A Simple Example of Using Snowpark Java](example.md)
:   See example code for an application that prints information about tables in Snowflake.

[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md)
:   Record log messages and trace events in an event table for analysis later.

[Analyzing queries and troubleshooting with Snowpark Java](troubleshooting.md)
:   Troubleshoot your code with logging and by viewing underlying SQL.

## Reference

[Quick reference: Snowpark Java APIs for SQL commands](sql-to-snowpark.md)
:   Learn how SQL statements map to Snowpark APIs for common operations.

[Snowpark Library for Java API Reference](/developer-guide/snowpark/reference/java/index.md)
:   Read details about the classes and methods in the Snowpark API.

---
title: Snowpark Developer Guide for Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/index.md
section: Snowpark
---

# Snowpark Developer Guide for Python

The [Snowpark library](../index.md) provides an intuitive API for querying and processing data in a data pipeline.
Using the Snowpark library, you can build applications that process data in Snowflake without moving data to the system where your
application code runs. You can also automate data transformation and processing by writing stored procedures and scheduling those
procedures as tasks in Snowflake.

## Get Started

You can write Snowpark Python code in a local development environment or in a Python worksheet in Snowsight.

If you need to write a client application, set up a local development environment by doing the following:

1. Set up your preferred development environment to build Snowpark apps. See [Setting up your development environment for Snowpark Python](setup.md).
2. Establish a session to interact with the Snowflake database. See [Creating a Session for Snowpark Python](creating-session.md).

If you want to write a stored procedure to automate tasks in Snowflake, use Python worksheets in Snowsight.
See [Writing Snowpark Code in Python Worksheets](python-worksheets.md).

## Write Snowpark Python Code

You can query, process, and transform data in a variety of ways using Snowpark Python.

* Query and process data with a `DataFrame` object. See [Working with DataFrames in Snowpark Python](working-with-dataframes.md).
* Run your pandas code directly on your data in Snowflake. See [pandas on Snowflake](pandas-on-snowflake.md).
* Convert custom lambdas and functions to user-defined functions (UDFs) that you can call to process data.
  See [Creating User-Defined Functions (UDFs) for DataFrames in Python](creating-udfs.md).
* Write a user-defined tabular function (UDTF) that processes data and returns data in a set of rows with one or more columns.
  See [Creating User-Defined Table Functions (UDTFs) for DataFrames in Python](creating-udtfs.md).
* Write a stored procedure that you can call to process data, or automate with a task to build a data pipeline.
  See [Creating Stored Procedures for DataFrames in Python](creating-sprocs.md).

### Perform Machine Learning Tasks

You can use Snowpark Python to perform machine learning tasks like training models:

* Train machine learning models by writing stored procedures. See [Training Machine Learning Models with Snowpark Python](python-snowpark-training-ml.md).
* Train, score, and tune machine learning models using Snowpark Python stored procedures and deploy the trained models with user-defined functions.
  See [Machine Learning with Snowpark Python - Credit Card Approval Prediction](https://quickstarts.snowflake.com/guide/getting_started_snowpark_machine_learning/index.html) (Snowflake Quickstarts).

### Troubleshoot Snowpark Python Code

Troubleshoot your code with logging statements and by viewing the underlying SQL. See [Troubleshooting with Snowpark Python](troubleshooting.md).

### Record and Analyze Data About Code Execution

You can record log messages and trace events in an event table for later analysis. For more information, see
[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).

## API Reference

The Snowpark for Python API reference contains extensive details about the available classes and methods.
See [Snowpark Library for Python API Reference](/developer-guide/snowpark/reference/python/latest/index).

The pandas on Snowflake API reference contains extensive details about the available classes and methods. See [Snowpark pandas API](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/index) .

For the list of changes to the API between versions, see [Snowpark Library for Python release notes](../../../release-notes/clients-drivers/snowpark-python.md).

---
title: Snowpark Developer Guide for Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/index.md
section: Snowpark
---

# Snowpark Developer Guide for Scala

The Snowpark library provides an intuitive API for querying and processing data in a data pipeline. Using the Snowpark library, you can
build applications that process data in Snowflake without moving data to the system where your application code runs.

For an introduction to Snowpark, see [Snowpark API](../index.md).

## Get Started

[Setting Up Your Development Environment for Snowpark Scala](setup.md)
:   Set up to build Snowpark apps using any of several development environments.

[Snowpark Library for Scala and Java release notes](../../../release-notes/clients-drivers/snowpark-scala-java.md)
:   Get the latest release notes.

### Quickstarts

[Getting Started With Snowpark in Scala](https://quickstarts.snowflake.com/guide/getting_started_with_snowpark_scala/index.html) (Snowflake Quickstarts)
:   Use a tutorial to learn the basics of Snowpark with Scala.

## Developer Guides

[Creating a Session for Snowpark Scala](creating-session.md)
:   Establish a session with which you interact with the Snowflake database.

[Working with DataFrames in Snowpark Scala](working-with-dataframes.md)
:   Query and process data with a `DataFrame` object.

[Creating User-Defined Functions (UDFs) for DataFrames in Scala](creating-udfs.md)
:   Create user-defined functions (UDFs) using the Snowpark API.

[Creating stored procedures for DataFrames in Scala](creating-sprocs.md)
:   Create stored procedures using the Snowpark API.

[Calling functions and stored procedures in Snowpark Scala](calling-functions.md)
:   Use the Snowpark API to call system-defined functions, UDFs, and stored procedures.

[A Simple Example of Using Snowpark Scala](example.md)
:   See example code for an application that prints information about tables in Snowflake.

[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md)
:   Record log messages and trace events in an event table for analysis later.

[Analyzing Queries and Troubleshooting with Snowpark Scala](troubleshooting.md)
:   Troubleshoot your code with logging and by viewing underlying SQL.

## Reference

[Quick reference: Snowpark Scala APIs for SQL commands](sql-to-snowpark.md)
:   Learn how SQL statements map to Snowpark APIs for common operations.

[Snowpark Library for Scala API Reference](/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/index.md)
:   Read details about the classes and methods in the Snowpark API.

---
title: Snowpark Migration Accelerator
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/snowpark-checkpoints-using-sma.md
section: Snowpark
---

# Snowpark Migration Accelerator

Snowpark Migration Accelerator (SMA) automatically generates the `checkpoint.json` file based on your workload, covering both input and output folders. This allows you to run Snowpark Checkpoints from the Snowflake Extension using the SMA-generated checkpoint files and results. For more information, see [Snowpark Migration Accelerator Documentation](https://docs.snowconvert.com/sma).

To configure SMA for Checkpoints generation, see [Snowpark Migration Accelerator Documentation](https://docs.snowconvert.com/sma).

---
title: Training Machine Learning Models with Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/python-snowpark-training-ml.md
section: Snowpark
---

# Training Machine Learning Models with Snowpark Python

This topic explains how to train machine learning (ML) models with Snowpark.

> **Note:**
>
> [Snowpark ML](../../snowflake-ml/overview.md) is a companion to Snowpark Python built specifically for
> machine learning in Snowflake. This topic still contains useful general information about machine learning with
> Snowpark Python, particularly if you prefer to write your own stored procedures for machine learning.

## Snowpark-Optimized Warehouses

Training machine learning (ML) models can sometimes be very resource intensive.
Snowpark-optimized warehouses are a type of Snowflake virtual warehouse that can be used for workloads
that require a large amount of memory and compute resources. For example, you can use them to train
an ML model using custom code on a single node.

These optimized warehouses can also benefit some UDF and UDTF scenarios.

For more information about how to create a Snowpark-optimized warehouse, see [Snowpark-optimized warehouses](../../../user-guide/warehouses-snowpark-optimized.md).

## Using Snowpark Python Stored Procedures for ML Training

[Snowpark Python stored procedures](../../stored-procedure/python/procedure-python-overview.md) can be used to run custom code using a Snowflake warehouse.
Snowpark-optimized warehouses make it possible to use Snowpark stored procedures to run
single-node ML training workloads directly in Snowflake.

A Python stored procedure can run nested queries, using the [Snowpark API for Python](index.md), to load and
transform the dataset, which is then loaded into the stored procedure memory to perform
pre-processing and ML training.
The trained model can be uploaded into a Snowflake stage, and can be used to create UDFs to perform inference.

While Snowpark-optimized warehouses can be used to execute pre-processing and training logic, it
may be necessary to execute nested queries in a separate warehouse to achieve better performance
and resource utilization. A separate query warehouse can be tuned and scaled independently based
on the dataset size.

### Guidelines

Follow these guidelines to perform single-node ML training workloads:

* Set WAREHOUSE_SIZE = MEDIUM to ensure that the Snowpark-optimized warehouse consists of 1 Snowpark-optimized node.
* Consider setting up the warehouse as [multi-cluster warehouse](../../../user-guide/warehouses-multicluster.md) to support the desired concurrency
  if needed.
* Consider using a separate warehouse for executing nested queries from the stored procedure:

  + Use the [session.use_warehouse()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.Session.use_warehouse) API
    to select the warehouse for the query inside the stored procedure.
* Don’t mix other workloads on the Snowpark-optimized warehouse that is used to run ML training stored procedures.

#### Example

The following example creates and uses a Snowpark-optimized warehouse. The example then creates a stored procedure that trains a linear regression model.
The stored procedure uses data in a table named `MARKETING_BUDGETS_FEATURES` (not shown here).

```sqlexample-python
CREATE OR REPLACE WAREHOUSE snowpark_opt_wh WITH
  WAREHOUSE_SIZE = 'MEDIUM'
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  MAX_CONCURRENCY_LEVEL = 1;

CREATE OR REPLACE PROCEDURE train()
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('snowflake-snowpark-python', 'scikit-learn', 'joblib')
  HANDLER = 'main'
AS $$
import os
from sklearn.compose import ColumnTransformer
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split, GridSearchCV
from joblib import dump

def main(session):
 # Load features
 df = session.table('MARKETING_BUDGETS_FEATURES').to_pandas()
 X = df.drop('REVENUE', axis = 1)
 y = df['REVENUE']

 # Split dataset into training and test
 X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state = 42)

 # Preprocess numeric columns
 numeric_features = ['SEARCH_ENGINE','SOCIAL_MEDIA','VIDEO','EMAIL']
 numeric_transformer = Pipeline(steps=[('poly',PolynomialFeatures(degree = 2)),('scaler', StandardScaler())])
 preprocessor = ColumnTransformer(transformers=[('num', numeric_transformer, numeric_features)])

 # Create pipeline and train
 pipeline = Pipeline(steps=[('preprocessor', preprocessor),('classifier', LinearRegression(n_jobs=-1))])
 model = GridSearchCV(pipeline, param_grid={}, n_jobs=-1, cv=10)
 model.fit(X_train, y_train)

 # Upload trained model to a stage
 model_file = os.path.join('/tmp', 'model.joblib')
 dump(model, model_file)
 session.file.put(model_file, "@ml_models",overwrite=True)

 # Return model R2 score on train and test data
 return {"R2 score on Train": model.score(X_train, y_train),"R2 score on Test": model.score(X_test, y_test)}
$$;
```

To call the stored procedure, execute the following command:

```sqlexample
CALL train();
```

> **Note:**
>
> Various other Snowpark Python demos are available in the
> [Snowflake-Labs GitHub repository](https://github.com/Snowflake-Labs/snowpark-python-demos).
> The [Advertising Spend and ROI Prediction](https://github.com/Snowflake-Labs/snowpark-python-demos/blob/main/Advertising-Spend-ROI-Prediction/Snowpark_For_Python.ipynb)
> example demonstrates how to create a stored procedure that trains a linear regression model.

---
title: Troubleshooting with Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/troubleshooting.md
section: Snowpark
---

# Troubleshooting with Snowpark Python

This topic provides some guidelines on troubleshooting problems when working with the Snowpark library.

## Changing the Logging Settings

By default, the Snowpark library logs `INFO` level messages to stdout. If you want to change these logging
settings, you can change the level to one of the
[supported levels](https://docs.python.org/3/library/logging.html#levels).

For example:

```output
>>> import logging
>>> import sys

>>> logging.basicConfig(stream=sys.stdout, level=logging.DEBUG)
```

---
title: Tutorial: Testing Python Snowpark
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/tutorials/testing-tutorial.md
section: Snowpark
---

App Development

# Tutorial: Testing Python Snowpark

## Introduction

This tutorial introduces the basics of testing your Snowpark Python code.

### What You Will Learn

In this tutorial, you will learn how to:

* Test your Snowpark code while connected to Snowflake.

  You can use standard testing utilities, like PyTest, to test your Snowpark Python UDFs, DataFrame transformations, and stored procedures.
* Test your Snowpark Python DataFrames locally without connecting to a Snowflake account by using the local testing framework.

  You can use the local testing framework to test locally, on your development machine, before deploying code changes.

### Prerequisites

To use the local testing framework:

You must use version 1.11.1 or higher of the Snowpark Python library. The supported versions of Python are:

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

## Set Up the Project

In this section, you’ll clone the project repository and set up the environment you’ll need for the tutorial.

1. Clone the project repository.

   > ```bash
   > git clone https://github.com/Snowflake-Labs/sftutorial-snowpark-testing
   > ```
   >
   > If you do not have git installed, go to the repository page and download the contents by clicking Code » Download Contents.
2. Set environment variables with your account credentials. The Snowpark API will use these to authenticate to your Snowflake account.

   > ```bash
   > # Linux/MacOS
   > export SNOWSQL_ACCOUNT=<replace with your account identifier>
   > export SNOWSQL_USER=<replace with your username>
   > export SNOWSQL_ROLE=<replace with your role>
   > export SNOWSQL_PWD=<replace with your password>
   > export SNOWSQL_DATABASE=<replace with your database>
   > export SNOWSQL_SCHEMA=<replace with your schema>
   > export SNOWSQL_WAREHOUSE=<replace with your warehouse>
   > ```
   >
   > ```bash
   > # Windows/PowerShell
   > $env:SNOWSQL_ACCOUNT = "<replace with your account identifier>"
   > $env:SNOWSQL_USER = "<replace with your username>"
   > $env:SNOWSQL_ROLE = "<replace with your role>"
   > $env:SNOWSQL_PWD = "<replace with your password>"
   > $env:SNOWSQL_DATABASE = "<replace with your database>"
   > $env:SNOWSQL_SCHEMA = "<replace with your schema>"
   > $env:SNOWSQL_WAREHOUSE = "<replace with your warehouse>"
   > ```
   >
   > Optional: You can set this env var permanently by editing your bash profile (on Linux/MacOS) or using the System Properties menu (on Windows).
3. Create and activate a conda environment using Anaconda:

   > ```bash
   > conda env create --file environment.yml
   > conda activate snowpark-testing
   > ```
4. Create the sample table in your account by running `setup/create_table.py`. This Python script will create a database called CITIBIKE, a schema called PUBLIC, and a small table called TRIPS.

   > ```bash
   > python setup/create_table.py
   > ```

You’re now ready to move to the next section. In this section you:

* Cloned the tutorial repository.
* Created environment variables with your account information.
* Created a conda environment for the project.
* Connected to Snowflake using the Snowpark API and created a sample database, schema, and table.

## Try the Stored Procedure

The sample project includes a stored procedure handler (`sproc.py`) and three DataFrames transformer methods (`transformers.py`).
The stored procedure handler uses the UDF and DataFrame transformers to read from the source table, `CITIBIKE.PUBLIC.TRIPS`, and creates
two fact tables: `MONTH_FACTS` and `BIKE_FACTS`.

You can execute the stored procedure from the command line by running this command.

```bash
python project/sproc.py
```

Now that you’ve familiarized yourself with the project, in the next section you will set up the test directory and create a PyTest Fixture for the Snowflake session.

## Create a PyTest Fixture for the Snowflake Session

[PyTest fixtures](https://docs.pytest.org/en/6.2.x/fixture.html) are functions which are executed before a test (or module of tests), typically to provide data or connections to tests.
For this project, you will create a PyTest fixture which returns a Snowpark `Session` object. Your test cases will use this session to connect to Snowflake.

1. Create a `test` directory under the project root directory.

   > ```bash
   > mkdir test
   > ```
2. Under the `test` directory, create a new Python file named `conftest.py`. Within `conftest.py`, create a PyTest fixture for the `Session` object:

   > ```python
   > import pytest
   > from project.utils import get_env_var_config
   > from snowflake.snowpark.session import Session
   >
   > @pytest.fixture
   > def session() -> Session:
   >     return Session.builder.configs(get_env_var_config()).create()
   > ```

## Add Unit Tests for DataFrame Transformers

1. In the `test` directory, create a new Python file named `test_transformers.py`.
2. In the `test_transformers.py` file, import the transformer methods.

   > ```python
   > # test/test_transformers.py
   >
   > from project.transformers import add_rider_age, calc_bike_facts, calc_month_facts
   > ```
3. Next, create unit tests for these transformers. The typical convention is to create a method for each test with the name `test_<name of method>`. In our case, the tests will be:

   > ```python
   > # test/test_transformers.py
   > from project.transformers import add_rider_age, calc_bike_facts, calc_month_facts
   > def test_add_rider_age(session):
   >     ...
   >
   > def test_calc_bike_facts(session):
   >     ...
   >
   >
   > def test_calc_month_facts(session):
   >     ...
   > ```
   >
   > The `session` parameter in each test case refers to the PyTest fixture that you created in the previous section.
4. Now implement the test cases for each transformer. Use the following pattern.

   > 1. Create an input DataFrame.
   > 2. Create the expected output DataFrame.
   > 3. Pass the input DataFrame from step 1 into the transformer method.
   > 4. Compare the output of step 3 to the expected output from step 2.
   >
   > ```python
   > # test/test_transformers.py
   > from project.transformers import add_rider_age, calc_bike_facts, calc_month_facts
   > from snowflake.snowpark.types import StructType, StructField, IntegerType, FloatType
   >
   > def test_add_rider_age(session: Session):
   >     input = session.create_dataframe(
   >         [
   >             [1980],
   >             [1995],
   >             [2000]
   >         ],
   >         schema=StructType([StructField("BIRTH_YEAR", IntegerType())])
   >     )
   >
   >     expected = session.create_dataframe(
   >         [
   >             [1980, 43],
   >             [1995, 28],
   >             [2000, 23]
   >         ],
   >         schema=StructType([StructField("BIRTH_YEAR", IntegerType()), StructField("RIDER_AGE", IntegerType())])
   >     )
   >
   >     actual = add_rider_age(input)
   >     assert expected.collect() == actual.collect()
   >
   >
   > def test_calc_bike_facts(session: Session):
   >     input = session.create_dataframe([
   >             [1, 10, 20],
   >             [1, 5, 30],
   >             [2, 20, 50],
   >             [2, 10, 60]
   >         ],
   >         schema=StructType([
   >             StructField("BIKEID", IntegerType()),
   >             StructField("TRIPDURATION", IntegerType()),
   >             StructField("RIDER_AGE", IntegerType())
   >         ])
   >     )
   >
   >     expected = session.create_dataframe([
   >             [1, 2, 7.5, 25.0],
   >             [2, 2, 15.0, 55.0],
   >         ],
   >         schema=StructType([
   >             StructField("BIKEID", IntegerType()),
   >             StructField("COUNT", IntegerType()),
   >             StructField("AVG_TRIPDURATION", FloatType()),
   >             StructField("AVG_RIDER_AGE", FloatType())
   >         ])
   >     )
   >
   >     actual = calc_bike_facts(input)
   >     assert expected.collect() == actual.collect()
   >
   >
   > def test_calc_month_facts(session: Session):
   >     from patches import patch_to_timestamp
   >
   >     input = session.create_dataframe(
   >         data=[
   >             ['2018-03-01 09:47:00.000 +0000', 1, 10,  15],
   >             ['2018-03-01 09:47:14.000 +0000', 2, 20, 12],
   >             ['2018-04-01 09:47:04.000 +0000', 3, 6,  30]
   >         ],
   >         schema=['STARTTIME', 'BIKE_ID', 'TRIPDURATION', 'RIDER_AGE']
   >     )
   >
   >     expected = session.create_dataframe(
   >         data=[
   >             ['Mar', 2, 15, 13.5],
   >             ['Apr', 1, 6, 30.0]
   >         ],
   >         schema=['MONTH', 'COUNT', 'AVG_TRIPDURATION', 'AVG_RIDER_AGE']
   >     )
   >
   >     actual = calc_month_facts(input)
   >
   >     assert expected.collect() == actual.collect()
   > ```
5. You can now run PyTest to run all of the unit tests.

   > ```bash
   > pytest test/test_transformers.py
   > ```

## Add Integration Tests for Stored Procedures

Now that we have unit tests for the DataFrame transformer methods, let’s add an integration test for the stored procedure.
The test case will follow this pattern:

1. Create a table representing the input data to the stored procedure.
2. Create two DataFrames with the expected contents of the stored procedure’s two output tables.
3. Call the stored procedure.
4. Compare the actual output tables to the DataFrames from step 2.
5. Clean up: delete the input table from step 1 and the output tables from step 3.

Create a Python file named `test_sproc.py` under the `test` directory.

Import the stored procedure handler from the project directory and create a test case.

```python
# test/test_sproc.py
from project.sproc import create_fact_tables

def test_create_fact_tables(session):
    ...
```

Implement the test case, starting with the creation of the input table.

```python
# test/test_sproc.py
from project.sproc import create_fact_tables
from snowflake.snowpark.types import *

def test_create_fact_tables(session):
    DB = 'CITIBIKE'
    SCHEMA = 'TEST'

    # Set up source table
    tbl = session.create_dataframe(
        data=[
            [1983, '2018-03-01 09:47:00.000 +0000', 551, 30958],
            [1988, '2018-03-01 09:47:01.000 +0000', 242, 19278],
            [1992, '2018-03-01 09:47:01.000 +0000', 768, 18461],
            [1980, '2018-03-01 09:47:03.000 +0000', 690, 15533],
            [1991, '2018-03-01 09:47:03.000 +0000', 490, 32449],
            [1959, '2018-03-01 09:47:04.000 +0000', 457, 29411],
            [1971, '2018-03-01 09:47:08.000 +0000', 279, 28015],
            [1964, '2018-03-01 09:47:09.000 +0000', 546, 15148],
            [1983, '2018-03-01 09:47:11.000 +0000', 358, 16967],
            [1985, '2018-03-01 09:47:12.000 +0000', 848, 20644],
            [1984, '2018-03-01 09:47:14.000 +0000', 295, 16365]
        ],
        schema=['BIRTH_YEAR', 'STARTTIME', 'TRIPDURATION',    'BIKEID'],
    )

    tbl.write.mode('overwrite').save_as_table([DB, SCHEMA, 'TRIPS_TEST'], mode='overwrite')
```

Next, create DataFrames for the expected output tables.

```python
# test/test_sproc.py
from project.sproc import create_fact_tables
from snowflake.snowpark.types import *

def test_create_fact_tables(session):
    DB = 'CITIBIKE'
    SCHEMA = 'TEST'

    # Set up source table
    tbl = session.create_dataframe(
        data=[
            [1983, '2018-03-01 09:47:00.000 +0000', 551, 30958],
            [1988, '2018-03-01 09:47:01.000 +0000', 242, 19278],
            [1992, '2018-03-01 09:47:01.000 +0000', 768, 18461],
            [1980, '2018-03-01 09:47:03.000 +0000', 690, 15533],
            [1991, '2018-03-01 09:47:03.000 +0000', 490, 32449],
            [1959, '2018-03-01 09:47:04.000 +0000', 457, 29411],
            [1971, '2018-03-01 09:47:08.000 +0000', 279, 28015],
            [1964, '2018-03-01 09:47:09.000 +0000', 546, 15148],
            [1983, '2018-03-01 09:47:11.000 +0000', 358, 16967],
            [1985, '2018-03-01 09:47:12.000 +0000', 848, 20644],
            [1984, '2018-03-01 09:47:14.000 +0000', 295, 16365]
        ],
        schema=['BIRTH_YEAR', 'STARTTIME', 'TRIPDURATION',    'BIKEID'],
    )

    tbl.write.mode('overwrite').save_as_table([DB, SCHEMA, 'TRIPS_TEST'], mode='overwrite')

    # Expected values
    n_rows_expected = 12
    bike_facts_expected = session.create_dataframe(
        data=[
            [30958, 1, 551.0, 40.0],
            [19278, 1, 242.0, 35.0],
            [18461, 1, 768.0, 31.0],
            [15533, 1, 690.0, 43.0],
            [32449, 1, 490.0, 32.0],
            [29411, 1, 457.0, 64.0],
            [28015, 1, 279.0, 52.0],
            [15148, 1, 546.0, 59.0],
            [16967, 1, 358.0, 40.0],
            [20644, 1, 848.0, 38.0],
            [16365, 1, 295.0, 39.0]
        ],
        schema=StructType([
            StructField("BIKEID", IntegerType()),
            StructField("COUNT", IntegerType()),
            StructField("AVG_TRIPDURATION", FloatType()),
            StructField("AVG_RIDER_AGE", FloatType())
        ])
    ).collect()

    month_facts_expected = session.create_dataframe(
        data=[['Mar', 11, 502.18182, 43.00000]],
        schema=StructType([
            StructField("MONTH", StringType()),
            StructField("COUNT", IntegerType()),
            StructField("AVG_TRIPDURATION", DecimalType()),
            StructField("AVG_RIDER_AGE", DecimalType())
        ])
    ).collect()
```

And finally, call the stored procedure and read the output tables. Compare the actual tables against the DataFrame contents.

```python
# test/test_sproc.py
from project.sproc import create_fact_tables
from snowflake.snowpark.types import *

def test_create_fact_tables(session):
    DB = 'CITIBIKE'
    SCHEMA = 'TEST'

    # Set up source table
    tbl = session.create_dataframe(
        data=[
            [1983, '2018-03-01 09:47:00.000 +0000', 551, 30958],
            [1988, '2018-03-01 09:47:01.000 +0000', 242, 19278],
            [1992, '2018-03-01 09:47:01.000 +0000', 768, 18461],
            [1980, '2018-03-01 09:47:03.000 +0000', 690, 15533],
            [1991, '2018-03-01 09:47:03.000 +0000', 490, 32449],
            [1959, '2018-03-01 09:47:04.000 +0000', 457, 29411],
            [1971, '2018-03-01 09:47:08.000 +0000', 279, 28015],
            [1964, '2018-03-01 09:47:09.000 +0000', 546, 15148],
            [1983, '2018-03-01 09:47:11.000 +0000', 358, 16967],
            [1985, '2018-03-01 09:47:12.000 +0000', 848, 20644],
            [1984, '2018-03-01 09:47:14.000 +0000', 295, 16365]
        ],
        schema=['BIRTH_YEAR', 'STARTTIME', 'TRIPDURATION',    'BIKEID'],
    )

    tbl.write.mode('overwrite').save_as_table([DB, SCHEMA, 'TRIPS_TEST'], mode='overwrite')

    # Expected values
    n_rows_expected = 12
    bike_facts_expected = session.create_dataframe(
        data=[
            [30958, 1, 551.0, 40.0],
            [19278, 1, 242.0, 35.0],
            [18461, 1, 768.0, 31.0],
            [15533, 1, 690.0, 43.0],
            [32449, 1, 490.0, 32.0],
            [29411, 1, 457.0, 64.0],
            [28015, 1, 279.0, 52.0],
            [15148, 1, 546.0, 59.0],
            [16967, 1, 358.0, 40.0],
            [20644, 1, 848.0, 38.0],
            [16365, 1, 295.0, 39.0]
        ],
        schema=StructType([
            StructField("BIKEID", IntegerType()),
            StructField("COUNT", IntegerType()),
            StructField("AVG_TRIPDURATION", FloatType()),
            StructField("AVG_RIDER_AGE", FloatType())
        ])
    ).collect()

    month_facts_expected = session.create_dataframe(
        data=[['Mar', 11, 502.18182, 43.00000]],
        schema=StructType([
            StructField("MONTH", StringType()),
            StructField("COUNT", IntegerType()),
            StructField("AVG_TRIPDURATION", DecimalType()),
            StructField("AVG_RIDER_AGE", DecimalType())
        ])
    ).collect()

    # Call sproc, get actual values
    n_rows_actual = create_fact_tables(session, 'TRIPS_TEST')
    bike_facts_actual = session.table([DB, SCHEMA, 'bike_facts']).collect()
    month_facts_actual = session.table([DB, SCHEMA, 'month_facts']).collect()

    # Comparisons
    assert n_rows_expected == n_rows_actual
    assert bike_facts_expected == bike_facts_actual
    assert month_facts_expected ==  month_facts_actual
```

To run the test case, run `pytest` from the terminal.

```bash
pytest test/test_sproc.py
```

To run all the tests in the project, run `pytest` without any other options.

```bash
pytest
```

## Configure Local Testing

At this point you have a PyTest test suite for your DataFrame transformers and stored procedure.
In each test case, the `Session` fixture is used to connect to your Snowflake account, send the SQL from the Snowpark Python API, and retrieve the response.

Alternatively, you can use the local testing framework to run the transformations locally without a connection to Snowflake.
In large test suites, this can add up to significantly faster test execution. This section shows how to update the test suite to use the local testing framework functionality.

1. Begin by updating the PyTest `Session` fixture. We will add a command-line option to PyTest to switch between local and live testing modes.

   > ```python
   > # test/conftest.py
   >
   > import pytest
   > from project.utils import get_env_var_config
   > from snowflake.snowpark.session import Session
   >
   > def pytest_addoption(parser):
   >     parser.addoption("--snowflake-session", action="store", default="live")
   >
   > @pytest.fixture(scope='module')
   > def session(request) -> Session:
   >     if request.config.getoption('--snowflake-session') == 'local':
   >         return Session.builder.configs({'local_testing': True}).create()
   >     else:
   >         return Session.builder.configs(get_env_var_config()).create()
   > ```
2. We must first patch this method because not all built-in functions are supported with the local testing framework, for example the `monthname()` function used in the `calc_month_facts()` transformer.
   Create a file named `patches.py` under the tests directory. In this file, paste the following code.

   > ```python
   > from snowflake.snowpark.mock.functions import patch
   > from snowflake.snowpark.functions import monthname
   > from snowflake.snowpark.mock.snowflake_data_type import ColumnEmulator, ColumnType
   > from snowflake.snowpark.types import StringType
   > import datetime
   > import calendar
   >
   > @patch(monthname)
   > def patch_monthname(column: ColumnEmulator) -> ColumnEmulator:
   >     ret_column = ColumnEmulator(data=[
   >         calendar.month_abbr[datetime.datetime.strptime(row, '%Y-%m-%d %H:%M:%S.%f %z').month]
   >         for row in column])
   >     ret_column.sf_type = ColumnType(StringType(), True)
   >     return ret_column
   > ```
   >
   > The patch above accepts a single parameter, `column`, which is a `pandas.Series`-like object containing the rows of data within the column.
   > We then use a combination of methods from the Python modules `datetime` and `calendar` to emulate the functionality of the built-in `monthname()` column.
   > Finally, we set the return type to `String`, as the built-in method returns strings corresponding to the months (“Jan”, “Feb”, “Mar”, etc.).
3. Next, import this method into the tests for the DataFrame transformer and the stored procedure.

   > ```python
   > # test/test_transformers.py
   >
   > # No changes to the other unit test methods
   >
   > def test_calc_month_facts(request, session):
   >     # Add conditional to include the patch if local testing is being used
   >     if request.config.getoption('--snowflake-session') == 'local':
   >         from patches import patch_monthname
   >
   >     # No other changes
   > ```
4. Rerun `pytest` with the local flag.

   > ```bash
   > pytest test/test_transformers.py --snowflake-session local
   > ```
5. Now apply the same patch to the stored procedure test.

   > ```python
   > #test/test_sproc.py
   >
   > def test_create_fact_tables(request, session):
   >     # Add conditional to include the patch if local testing is being used
   >     if request.config.getoption('--snowflake-session') == 'local':
   >         from patches import patch_monthname
   >
   >     # No other changes required
   > ```
6. Re-run pytest with the local flag.

   > ```bash
   > pytest test/test_sproc.py --snowflake-session local
   > ```
7. To wrap things up, let’s compare the time taken to run the full test suite locally versus with a live connection.
   We will use the `time` command to measure the time taken for both commands. Let’s start with the live connection.

   > ```python
   > time pytest
   > ```
   >
   > In this case, the test suite took 7.89 seconds to run. (Your exact time may differ depending on your computer, network connection, and other factors.)
   >
   > ```output
   > =================================== test session starts ==========================
   > platform darwin -- Python 3.12.18, pytest-7.4.3, pluggy-1.3.0
   > rootdir: /Users/jfreeberg/Desktop/snowpark-testing-tutorial
   > configfile: pytest.ini
   > collected 4 items
   >
   > test/test_sproc.py .                                                             [ 25%]
   > test/test_transformers.py ...                                                    [100%]
   >
   > =================================== 4 passed in 6.86s =================================
   > pytest  1.63s user 1.86s system 44% cpu 7.893 total
   > ```
   >
   > Now let’s try with the local testing framework:
   >
   > ```bash
   > time pytest --snowflake-session local
   > ```
   >
   > With the local testing framework the test suite, execution only took 1 second!
   >
   > ```output
   > ================================== test session starts ================================
   > platform darwin -- Python 3.12.18, pytest-7.4.3, pluggy-1.3.0
   > rootdir: /Users/jfreeberg/Desktop/snowpark-testing-tutorial
   > configfile: pytest.ini
   > collected 4 items
   >
   > test/test_sproc.py .                                                             [ 25%]
   > test/test_transformers.py ...                                                    [100%]
   >
   > =================================== 4 passed in 0.10s ==================================
   > pytest --snowflake-session local  1.37s user 1.70s system 281% cpu 1.093 total
   > ```

## Learn More

You finished! Nicely done.

In this tutorial, you got an end-to-end view of how you can test your Python Snowpark code.
Along the way, you:

* **Created a PyTest fixture and added unit tests and integration tests.**

  + For more information, see [Writing Tests for Snowpark Python](../testing-python-snowpark.md).
* **Configured local testing**

  + For more information, see [Local testing framework](../testing-locally.md).

---
title: Using Snowpark Checkpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/checkpoints-using-checkpoints.md
section: Snowpark
---

# Using Snowpark Checkpoints

When setup is complete, you can execute checkpoints either directly by interacting with the Checkpoints library or through the Snowflake extension, which provides additional integration and automation capabilities.

The Snowpark Checkpoints Python package provides a range of functionalities to support the validation of migrated workloads. The following sections outline the key features and capabilities included in the package, along with guidance on how to use them effectively:

* [Collectors](checkpoints-collectors.md)
* [Validators](checkpoints-validators.md)
* [Hypothesis](checkpoints-hypothesis.md)
* [Logging](checkpoints-logging.md)
* [Using Databricks](checkpoints-databricks.md)

---
title: Using Snowpark to read data
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/reading-data.md
section: Snowpark
---

# Using Snowpark to read data

Whether your data sits in operational databases or arrives as files, Snowpark gives you a simple, Python-first way to pull it in, convert it to a DataFrame, and view it in Snowflake tables, so you can model, transform, and analyze without context switching.

## Reading data from external sources using Snowpark Python DB-API

Use standard Python DB-API 2.0 drivers to pull data from external databases (SQL Server, Oracle, PostgreSQL, MySQL, Databricks) directly into a Snowpark DataFrame. Snowpark Python DB-API can run from your client (*local* mode) or inside Snowflake using stored procedures or notebooks (with external access integration). The result behaves like any other DataFrame you can use to join, transform, and write to Snowflake tables. For more information, see [Using the Snowpark Python DB-API](reading-data-from-external-sources.md).

## Reading data from external sources using Snowpark Python JDBC

Use standard JDBC drivers provided by you to pull data from external databases directly into a Snowpark DataFrame. Snowpark Python JDBC can run from your client or inside Snowflake using stored procedures or notebooks. A UDTF is created to ingest your target data. The result behaves like any other DataFrame you can use to join, transform, and write to Snowflake tables. For more information, see [Using the Snowpark Python JDBC](snowpark-jdbc.md).

> **Note:**
>
> To use this feature, upload the JDBC driver to a stage, configure an external access integration, and ensure Snowflake can reach the source endpoint.

## Reading data from XML files using Snowpark XML RowTag Reader

Use Snowpark XML to read large staged XML files efficiently: the reader splits the file on `rowTag`, loads each match as one row, and maps child elements into `VARIANT` columns (preserving the nested structure) for immediate querying with Snowpark or SQL. You can also validate each row against an XSD with `PERMISSIVE` (quarantine invalid rows in `_corrupt_record`) or `FAILFAST` behavior. The output is a standard DataFrame you can transform and save to tables. For more information, see [Using the Snowpark XML RowTag Reader](snowpark-xml-rowtag-reader.md).

---
title: Using the Snowpark Python DB-API
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/reading-data-from-external-sources.md
section: Snowpark
---

# Using the Snowpark Python DB-API

With the Snowpark Python DB-API, Snowpark Python users can programmatically pull data from external databases into Snowflake. The DB-API includes:

* **Python DB-API support:** Connect to external databases using Python’s standard DB-API 2.0 drivers.
* **Streamlined setup:** Use `pip` to install the necessary drivers, with no need to manage additional dependencies.

With these APIs, you can seamlessly pull data into Snowflake tables and transform it using [Snowpark DataFrames](working-with-dataframes.md) for advanced analytics.

The [DB-API](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.dbapi) can be used in a similar way as the [Spark JDBC API](https://spark.apache.org/docs/3.5.4/sql-data-sources-jdbc.html). Most parameters are designed to be identical or similar for better parity. At the same time, Snowpark emphasizes a Python-first design with intuitive naming conventions that avoid JDBC-specific configurations. This provides Python developers with a familiar experience. For more information that compares the Snowpark Python DB-API with the Spark JDBC API, see the following table:

## DB-API parameters

| Parameter | Snowpark Python DB-API |
| --- | --- |
| `create_connection` | Function to create a Python DB-API connection |
| `table` | Specifies the table in the source database |
| `query` | SQL query wrapped as a subquery for reading data |
| `column` | Partitioning column for parallel reads |
| `lower_bound` | Lower bound for partitioning |
| `upper_bound` | Upper bound for partitioning |
| `num_partitions` | Number of partitions for parallelism |
| `query_timeout` | Timeout for SQL execution (in seconds) |
| `fetch_size` | Number of rows fetched per round trip |
| `custom_schema` | Custom schema for pulling data from external databases |
| `max_workers` | Number of workers for parallel fetching and pulling data from external databases |
| `predicates` | List of conditions for WHERE clause partitions |
| `session_init_statement` | Executes a SQL or PL/SQL statement upon session initialization |
| `udtf_configs` | Executes the workload using a Snowflake UDTF for better performance |
| `fetch_merge_count` | Number of fetched batches to be merged into a single Parquet file before it is uploaded |

## Understanding parallelism

The Snowpark Python DB-API has two underlying forms of ingestion mechanisms:

Local ingestion
:   In local ingestion, Snowpark first fetches data from external sources to your local environment, where the `dbapi()` function is called and
    converts them to Parquet files. Next, Snowpark uploads these Parquet files to a temporary Snowflake stage and copies them into a temporary
    table from the stage.

UDTF ingestion
:   In UDTF ingestion, all workloads run on the Snowflake server. Snowpark first creates a UDTF and executes it, and the UDTF directly
    ingests data into Snowflake and stores it in a temporary table.

The Snowpark Python DB-API also has two ways to parallelize and accelerate ingestion:

Partition column
:   This method divides source data into multiple partitions based on four parameters when users call `dbapi()`:

    * `column`
    * `lower_bound`
    * `upper_bound`
    * `num_partitions`

    These four parameters have to be set at the same time and `column` must be numeric or date type.

Predicates
:   This method divides source data into partitions based on parameter predicates, which are a list of expressions suitable for inclusion
    in `WHERE` clauses, where each expression defines a partition. Predicates provide a more flexible way of dividing partitions; for example,
    you can divide partitions on Boolean or non-numeric columns.

The Snowpark Python DB-API also allows the adjustment of parallelism level within a partition:

Fetch_size
:   Within a partition, the API fetches rows in chunks defined by fetch_size. These rows are written to Snowflake in parallel as they are
    fetched, which allows reading and writing to overlap and maximizes throughput.

By combining the listed methods of ingestion and parallelism, Snowflake has four ways of ingestion:

* **Local ingestion with partition column**

  ```python
  df_local_par_column = session.read.dbapi(
      create_connection,
      table="target_table",
      fetch_size=100000,
      num_partitions=4,
      column="ID",  # Swap with the column you want your partition based on
      upper_bound=10000,
      lower_bound=0
  )
  ```
* **Local ingestion with predicates**

  ```python
  df_local_predicates = session.read.dbapi(
      create_connection,
      table="target_table",
      fetch_size=100000,
      predicates=[
          "ID < 3",
          "ID >= 3"
      ]
  )
  ```
* **UDTF ingestion with partition column**

  ```python
  udtf_configs = {
      "external_access_integration": "<your external access integration>"
  }
  df_udtf_par_column = session.read.dbapi(
      create_connection,
      table="target_table",
      udtf_configs=udtf_configs,
      fetch_size=100000,
      num_partitions=4,
      column="ID",  # Swap with the column you want your partition based on
      upper_bound=10000,
      lower_bound=0
  )
  ```
* **UDTF ingestion with predicates**

  ```python
  udtf_configs = {
      "external_access_integration": "<your external access integration>"
  }

  df_udtf_predicates = session.read.dbapi(
      create_dbx_connection,
      table="target_table",
      udtf_configs=udtf_configs,
      fetch_size=100000,
      predicates=[
          "ID < 3",
          "ID >= 3"
      ]
  )
  ```

## SQL Server

To connect to SQL Server from Snowpark, you need the following three packages:

* Snowpark: [snowflake-snowpark-python[pandas]](https://pypi.org/project/snowflake-snowpark-python/)
* SQL Server ODBC Driver: [Microsoft ODBC Driver for SQL Server](https://learn.microsoft.com/en-us/sql/connect/odbc/microsoft-odbc-driver-for-sql-server)

  By installing the driver, you agree to Microsoft’s EULA.
* The open source pyodbc library: [pyodbc](https://pypi.org/project/pyodbc/)

The following code examples show how to connect to SQL Server from a Snowpark client and a stored procedure.

### Use the DB-API to connect to SQL Server from a Snowpark client

1. Install the Python SQL Driver:

   ```bash
   /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/master/install.sh)"
   brew tap microsoft/mssql-release https://github.com/Microsoft/homebrew-mssql-release
   brew update
   HOMEBREW_ACCEPT_EULA=Y brew install msodbcsql mssql-tools
   ```
2. Install `snowflake-snowpark-python[pandas]` and `pyodbc`:

   ```bash
   pip install snowflake-snowpark-python[pandas]
   pip install pyodbc
   ```
3. Define the factory method for creating a connection to SQL Server:

   ```python
   def create_sql_server_connection():
       import pyodbc
       SERVER = "<your host name>"
       PORT = <your port>
       UID = "<your user name>"
       PWD = "<your password>"
       DATABASE = "<your database name>"
       connection_str = (
           f"DRIVER={{ODBC Driver 18 for SQL Server}};"
           f"SERVER={SERVER}:{PORT};"
           f"UID={UID};"
           f"PWD={PWD};"
           f"DATABASE={DATABASE};"
           "TrustServerCertificate=yes"
           "Encrypt=yes"
           # Optional to identify source of queries
           "APP=snowflake-snowpark-python;"
       )
       connection = pyodbc.connect(connection_str)
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_sql_server_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_sql_server_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_sql_server_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # Swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_sql_server_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )
   ```

### Use the DB-API to connect to SQL Server from a stored procedure

1. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
2. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   -- Configure a secret to allow egress to the source endpoint

   CREATE OR REPLACE SECRET mssql_secret
   TYPE = PASSWORD
   USERNAME = 'mssql_username'
   PASSWORD = 'mssql_password';

   -- Configure a network rule to allow egress to the source endpoint

   CREATE OR REPLACE NETWORK RULE mssql_network_rule
   MODE = EGRESS
   TYPE = HOST_PORT
   VALUE_LIST = ('mssql_host:mssql_port');

   -- Configure an external access integration

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION mssql_access_integration
   ALLOWED_NETWORK_RULES = (mssql_network_rule)
   ALLOWED_AUTHENTICATION_SECRETS = (mssql_secret)
   ENABLED = true;
   ```
3. Use the DB-API to pull data from SQL Server in a Python stored procedure:

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE sp_mssql_dbapi()
       RETURNS TABLE()
       LANGUAGE PYTHON
       RUNTIME_VERSION='3.11'
       HANDLER='run'
       PACKAGES=('snowflake-snowpark-python', 'pyodbc', 'msodbcsql')
       EXTERNAL_ACCESS_INTEGRATIONS = (mssql_access_integration)
       SECRETS = ('cred' = mssql_secret )
   AS $$

   # Get user name and password from mssql_secret

   import _snowflake
   username_password_object = _snowflake.get_username_password('cred')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   # Define a method to connect to SQL Server_hostname
   from snowflake.snowpark import Session
   def create_sql_server_connection():
       import pyodbc

       host = "<your host>"
       port = <your port>
       username = USER
       password = PASSWORD
       database = "<your database name>"
       connection_str = (
           f"DRIVER={{ODBC Driver 18 for SQL Server}};"
           f"SERVER={host},{port};"
           f"DATABASE={database};"
           f"UID={username};"
           f"PWD={password};"
           "TrustServerCertificate=yes"
           "Encrypt=yes"
           # Optional to identify source of queries
           "APP=snowflake-snowpark-python;"
       )

       connection = pyodbc.connect(connection_str)
       return connection

   def run(session: Session):
       # Feel free to combine local/udtf ingestion and partition column/predicates
       # as stated in the understanding parallelism section

       # Call dbapi to pull data from target table

       df = session.read.dbapi(
           create_sql_server_connection,
           table="target_table"
       )

       # Call dbapi to pull data from target query

       df_query = session.read.dbapi(
           create_sql_server_connection,
           query="select * from target_table"
       )

       # Pull data from target table with parallelism using partition column

       df_local_par_column = session.read.dbapi(
           create_sql_server_connection,
           table="target_table",
           fetch_size=100000,
           num_partitions=4,
           column="ID",  # swap with the column you want your partition based on
           upper_bound=10000,
           lower_bound=0
       )

       udtf_configs = {
           "external_access_integration": "<your external access integration>"
       }

       # Pull data from target table with udtf ingestion with parallelism using predicates

       df_udtf_predicates = session.read.dbapi(
           create_sql_server_connection,
           table="target_table",
           udtf_configs=udtf_configs,
           fetch_size=100000,
           predicates=[
               "ID < 3",
               "ID >= 3"
           ]
       )

       return df
   $$;

   CALL sp_mssql_dbapi();
   ```

### Use the DB-API to connect to SQL Server from a Snowflake notebook

1. From [Snowflake Notebook packages](../../../user-guide/ui-snowsight/notebooks-import-packages.md), select `snowflake-snowpark-python` and `pyodbc`.
2. In the Files pane, open the file `environment.yml`, and under Dependencies, add the following line of code after other entries:

   ```yaml
   - msodbcsql
   ```
3. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   -- Configure a secret to allow egress to the source endpoint

   CREATE OR REPLACE SECRET mssql_secret
   TYPE = PASSWORD
   USERNAME = 'mssql_username'
   PASSWORD = 'mssql_password';

   ALTER NOTEBOOK mynotebook SET SECRETS = ('snowflake-secret-object' = mssql_secret);

   -- Configure a network rule to allow egress to the source endpoint

   CREATE OR REPLACE NETWORK RULE mssql_network_rule
   MODE = EGRESS
   TYPE = HOST_PORT
   VALUE_LIST = ('mssql_host:mssql_port');

   -- Configure an external access integration

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION mssql_access_integration
   ALLOWED_NETWORK_RULES = (mssql_network_rule)
   ALLOWED_AUTHENTICATION_SECRETS = (mssql_secret)
   ENABLED = true;
   ```
4. [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md), and then restart the notebook session.
5. Use the DB-API to pull data from SQL Server in a Python cell of a Snowflake notebook:

   ```python
   # Get user name and password from mssql_secret

   import _snowflake
   username_password_object = _snowflake.get_username_password('snowflake-secret-object')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   import snowflake.snowpark.context
   session = snowflake.snowpark.context.get_active_session()

   def create_sql_server_connection():
       import pyodbc
       SERVER = SQL_SERVER_CONNECTION_PARAMETERS["SERVER"]
       UID = SQL_SERVER_CONNECTION_PARAMETERS["UID"]
       PWD = SQL_SERVER_CONNECTION_PARAMETERS["PWD"]
       DATABASE = "test_query_history"
       connection_str = (
           f"DRIVER={{ODBC Driver 18 for SQL Server}};"
           f"SERVER={SERVER};"
           f"UID={UID};"
           f"PWD={PWD};"
           f"DATABASE={DATABASE};"
           "TrustServerCertificate=yes;"
           "Encrypt=yes;"
           # Optional to identify source of queries
           "APP=snowflake-snowpark-python;"
       )
       connection = pyodbc.connect(connection_str)
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_sql_server_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_sql_server_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_sql_server_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_sql_server_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )

   # Save data into sf_table
   df.write.mode("overwrite").save_as_table('sf_table')
   ```

### Source tracing when using the DB-API to connect to SQL Server

1. Include a tag of Snowpark in your create connection function:

   ```python
   def create_sql_server_connection():
       import pyodbc
       SERVER = "<your host name>"
       PORT = <your port>
       UID = "<your user name>"
       PWD = "<your password>"
       DATABASE = "<your database name>"
       connection_str = (
           f"DRIVER={{ODBC Driver 18 for SQL Server}};"
           f"SERVER={SERVER}:{PORT};"
           f"UID={UID};"
           f"PWD={PWD};"
           f"DATABASE={DATABASE};"
           "TrustServerCertificate=yes"
           "Encrypt=yes"
           # include this parameter for source tracing
           "APP=snowflake-snowpark-python;"
       )
       connection = pyodbc.connect(connection_str)
       return connection
   ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   ```sqlexample
   SELECT
       s.session_id,
       s.program_name,
       r.status,
       t.text AS sql_text
   FROM sys.dm_exec_sessions s
   JOIN sys.dm_exec_requests r ON s.session_id = r.session_id
   CROSS APPLY sys.dm_exec_sql_text(r.sql_handle) AS t
   WHERE s.program_name = 'snowflake-snowpark-python';
   ```

## Oracle

To connect to Oracle from Snowpark, you need the following two packages:

* Snowpark: [snowflake-snowpark-python[pandas]](https://pypi.org/project/snowflake-snowpark-python/)
* The open source oracledb library: [oracledb](https://pypi.org/project/oracledb/)

The following code examples show how to connect to Oracle from a Snowpark client, stored procedures, and a Snowflake notebook.

### Use the DB-API to connect to Oracle from a Snowpark client

1. Install `snowflake-snowpark-python[pandas]` and `oracledb`:

   ```bash
   pip install snowflake-snowpark-python[pandas]
   pip install oracledb
   ```
2. Use the DB-API to pull data from Oracle and define the factory method for creating a connection to Oracle:

   ```python
   def create_oracle_db_connection():
       import oracledb
       HOST = "<your host>"
       PORT = <your port>
       SERVICE_NAME = "<your service name>"
       USER = "<your user name>"
       PASSWORD = "your password"
       DSN = f"{HOST}:{PORT}/{SERVICE_NAME}"
       connection = oracledb.connect(
           user=USER,
           password=PASSWORD,
           dsn=DSN
       )
       # Optional: include this parameter for source tracing
       connection.clientinfo = "snowflake-snowpark-python"
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_oracle_db_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_oracle_db_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_oracle_db_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_oracle_db_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )
   ```

### Use the DB-API to connect to Oracle from a stored procedure

1. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
2. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   -- Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   CREATE OR REPLACE SECRET ora_secret
   TYPE = PASSWORD
   USERNAME = 'ora_username'
   PASSWORD = 'ora_password';

   -- configure a network rule to allow egress to the source endpoint

   CREATE OR REPLACE NETWORK RULE ora_network_rule
   MODE = EGRESS
   TYPE = HOST_PORT
   VALUE_LIST = ('ora_host:ora_port');

   -- configure an external access integration

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION ora_access_integration
   ALLOWED_NETWORK_RULES = (ora_network_rule)
   ALLOWED_AUTHENTICATION_SECRETS = (ora_secret)
   ENABLED = true;
   ```
3. Use the Snowpark Python DB-API to pull data from Oracle in a Python stored procedure:

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE sp_ora_dbapi()
       RETURNS TABLE()
       LANGUAGE PYTHON
       RUNTIME_VERSION='3.11'
       HANDLER='run'
       PACKAGES=('snowflake-snowpark-python', 'oracledb')
       EXTERNAL_ACCESS_INTEGRATIONS = (ora_access_integration)
       SECRETS = ('cred' = ora_secret )
   AS $$

   # Get user name and password from ora_secret
   import _snowflake
   username_password_object = _snowflake.get_username_password('cred')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   # Define the factory method for creating a connection to Oracle

   from snowflake.snowpark import Session

   def create_oracle_db_connection():
       import oracledb
       host = "ora_host"
       port = "ora_port"
       service_name = "ora_service"
       user = USER
       password = PASSWORD
       DSN = f"{host}:{port}/{service_name}"
       connection = oracledb.connect(
           user=USER,
           password=PASSWORD,
           dsn=DSN
       )
       # Optional: include this parameter for source tracing
       connection.clientinfo = "snowflake-snowpark-python"
       return connection

   def run(session: Session):
       # Feel free to combine local/udtf ingestion and partition column/predicates
       # as stated in the understanding parallelism section

       # Call dbapi to pull data from target table

       df = session.read.dbapi(
           create_oracle_db_connection,
           table="target_table"
       )

       # Call dbapi to pull data from target query

       df_query = session.read.dbapi(
           create_oracle_db_connection,
           query="select * from target_table"
       )

       # Pull data from target table with parallelism using partition column

       df_local_par_column = session.read.dbapi(
           create_oracle_db_connection,
           table="target_table",
           fetch_size=100000,
           num_partitions=4,
           column="ID",  # swap with the column you want your partition based on
           upper_bound=10000,
           lower_bound=0
       )

       udtf_configs = {
           "external_access_integration": "<your external access integration>"
       }

       # Pull data from target table with udtf ingestion with parallelism using predicates

       df_udtf_predicates = session.read.dbapi(
           create_oracle_db_connection,
           table="target_table",
           udtf_configs=udtf_configs,
           fetch_size=100000,
           predicates=[
               "ID < 3",
               "ID >= 3"
           ]
       )
       return df
   $$;

   CALL sp_ora_dbapi();
   ```

### Use the DB-API to connect to Oracle from a Snowflake notebook

1. From [Snowflake Notebook packages](../../../user-guide/ui-snowsight/notebooks-import-packages.md), select `snowflake-snowpark-python` and `oracledb`.
2. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
3. Configure the secret, a network rule, and EAI to allow egress to the source endpoint:

   ```sqlexample
   -- Configure the secret, a network rule to allow egress to the source endpoint, and EAI:
   CREATE OR REPLACE SECRET mysql_secret
       TYPE = PASSWORD
       USERNAME = 'mysql_username'
       PASSWORD = 'mysql_password';
   ALTER NOTEBOOK mynotebook SET SECRETS = ('snowflake-secret-object' = mysql_secret);

   -- configure a network rule to allow egress to the source endpoint

   CREATE OR REPLACE NETWORK RULE mysql_network_rule
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('mysql_host:mysql_port');

   -- configure an external access integration

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION mysql_access_integration
       ALLOWED_NETWORK_RULES = (mysql_network_rule)
       ALLOWED_AUTHENTICATION_SECRETS = (mysql_secret)
       ENABLED = true;
   ```
4. [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md), and then restart the notebook session.
5. Use the DB-API to pull data from Oracle in a Python cell of a Snowflake notebook:

   ```python
   # Get user name and password from ora_secret

   import _snowflake
   username_password_object = _snowflake.get_username_password('snowflake-secret-object')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   import snowflake.snowpark.context
   session = snowflake.snowpark.context.get_active_session()

   # Define the factory method for creating a connection to Oracle

   def create_oracle_db_connection():
       import oracledb
       host = "ora_host"
       port = "ora_port"
       service_name = "ora_service"
       user = USER
       password = PASSWORD
       DSN = f"{host}:{port}/{service_name}"
       connection = oracledb.connect(
           user=USER,
           password=PASSWORD,
           dsn=DSN,
       )
       # Optional: include this parameter for source tracing
       connection.clientinfo = "snowflake-snowpark-python"
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_oracle_db_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_oracle_db_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_oracle_db_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_oracle_db_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )

   # Save data into sf_table

   df_ora.write.mode("overwrite").save_as_table('sf_table')
   ```

### Source tracing when using the DB-API to connect to Oracle

1. Include a tag of Snowpark in your create connection function:

   ```python
   def create_oracle_db_connection():
       import oracledb
       HOST = "myhost"
       PORT = "myport"
       SERVICE_NAME = "myservice"
       USER = "myuser"
       PASSWORD = "mypassword"
       DSN = f"{HOST}:{PORT}/{SERVICE_NAME}"
       connection = oracledb.connect(
           user=USER,
           password=PASSWORD,
           dsn=DSN,
       )
       # include this parameter for source tracing
       connection.clientinfo = "snowflake-snowpark-python"
       return connection
   ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   ```sqlexample
   SELECT
       s.sid,
       s.serial#,
       s.username,
       s.module,
       q.sql_id,
       q.sql_text,
       q.last_active_time
   FROM
       v$session s
       JOIN v$sql q ON s.sql_id = q.sql_id
   WHERE
       s.client_info = 'snowflake-snowpark-python'
   ```

## PostgreSQL

To connect to PostgreSQL from Snowpark, you need the following two packages:

* Snowpark: [snowflake-snowpark-python[pandas]](https://pypi.org/project/snowflake-snowpark-python/)
* The open source psycopg2 library: [psycopg2](https://pypi.org/project/psycopg2/)

The following code examples show how to connect to PostgreSQL from a Snowpark client, stored procedures, and a Snowflake notebook.

### Use the DB-API to connect to PostgreSQL from a Snowpark client

1. Install `psycopg2`:

   ```bash
   pip install psycopg2
   ```
2. Define the factory method for creating a connection to PostgreSQL:

   ```python
   def create_pg_connection():
       import psycopg2
       connection = psycopg2.connect(
           host="pg_host",
           port=pg_port,
           dbname="pg_dbname",
           user="pg_user",
           password="pg_password",
           # Optional: include this parameter for source tracing
           application_name="snowflake-snowpark-python"
       )
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_pg_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_pg_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_pg_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # Swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_pg_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )
   ```

### Use the DB-API to connect to PostgreSQL from a stored procedure

1. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
2. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   -- configure a secret

   CREATE OR REPLACE SECRET pg_secret
       TYPE = PASSWORD
       USERNAME = 'pg_username'
       PASSWORD = 'pg_password';

   -- configure a network rule.

   CREATE OR REPLACE NETWORK RULE pg_network_rule
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('pg_host:pg_port');

   -- configure an external access integration.

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION pg_access_integration
       ALLOWED_NETWORK_RULES = (pg_network_rule)
       ALLOWED_AUTHENTICATION_SECRETS = (pg_secret)
       ENABLED = true;
   ```
3. Use the Snowpark Python DB-API to pull data from PostgreSQL in a Python stored procedure:

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE sp_pg_dbapi()
       RETURNS TABLE()
       LANGUAGE PYTHON
       RUNTIME_VERSION='3.11'
       HANDLER='run'
       PACKAGES=('snowflake-snowpark-python', 'psycopg2')
       EXTERNAL_ACCESS_INTEGRATIONS = (pg_access_integration)
       SECRETS = ('cred' = pg_secret )
   AS $$

   # Get user name and password from pg_secret

   import _snowflake
   username_password_object = _snowflake.get_username_password('cred')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   # Define the factory method for creating a connection to PostgreSQL

   from snowflake.snowpark import Session

   def create_pg_connection():
       import psycopg2
       connection = psycopg2.connect(
           host="pg_host",
           port=pg_port,
           dbname="pg_dbname",
           user=USER,
           password=PASSWORD,
           # Optional: include this parameter for source tracing
           application_name="snowflake-snowpark-python"
       )
       return connection

   def run(session: Session):

       # Feel free to combine local/udtf ingestion and partition column/predicates
       # as stated in the understanding parallelism section

       # Call dbapi to pull data from target table

       df = session.read.dbapi(
           create_pg_connection,
           table="target_table"
       )

       # Call dbapi to pull data from target query

       df_query = session.read.dbapi(
           create_pg_connection,
           query="select * from target_table"
       )

       # Pull data from target table with parallelism using partition column

       df_local_par_column = session.read.dbapi(
           create_pg_connection,
           table="target_table",
           fetch_size=100000,
           num_partitions=4,
           column="ID",  # swap with the column you want your partition based on
           upper_bound=10000,
           lower_bound=0
       )

       udtf_configs = {
           "external_access_integration": "<your external access integration>"
       }

       # Pull data from target table with udtf ingestion with parallelism using predicates

       df_udtf_predicates = session.read.dbapi(
           create_pg_connection,
           table="target_table",
           udtf_configs=udtf_configs,
           fetch_size=100000,
           predicates=[
               "ID < 3",
               "ID >= 3"
           ]
       )
       return df

   $$;
   CALL sp_pg_dbapi();
   ```

### Use the DB-API to connect to PostgreSQL from a Snowflake notebook

1. From [Snowflake Notebook packages](../../../user-guide/ui-snowsight/notebooks-import-packages.md), select `snowflake-snowpark-python` and `psycopg2`.
2. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
3. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   -- Configure the secret

   CREATE OR REPLACE SECRET pg_secret
       TYPE = PASSWORD
       USERNAME = 'pg_username'
       PASSWORD = 'pg_password';

   ALTER NOTEBOOK pg_notebook SET SECRETS = ('snowflake-secret-object' = pg_secret);

   -- Configure the network rule to allow egress to the source endpoint

   CREATE OR REPLACE NETWORK RULE pg_network_rule
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('pg_host:pg_port');

   -- Configure external access integration

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION pg_access_integration
       ALLOWED_NETWORK_RULES = (pg_network_rule)
       ALLOWED_AUTHENTICATION_SECRETS = (pg_secret)
       ENABLED = true;
   ```
4. [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md), and then restart the notebook session.
5. Use the DB-API to pull data from PostgreSQL in a Python cell of a Snowflake notebook:

   ```python
   # Get the user name and password from :code:`pg_secret`

   import _snowflake
   username_password_object = _snowflake.get_username_password('snowflake-secret-object')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   import snowflake.snowpark.context
   session = snowflake.snowpark.context.get_active_session()

   # Define the factory method for creating a connection to PostgreSQL

   def create_pg_connection():
       import psycopg2
       connection = psycopg2.connect(
           host="pg_host",
           port=pg_port,
           dbname="pg_dbname",
           user=USER,
           password=PASSWORD,
           # Optional: include this parameter for source tracing
           application_name="snowflake-snowpark-python"
       )
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_pg_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_pg_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_pg_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_pg_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )

   # Save data into sf_table

   df.write.mode("overwrite").save_as_table('sf_table')
   # Get the user name and password from :code:`pg_secret`
   ```

### Source tracing when using the DB-API to connect to PostgreSQL

1. Include a tag of Snowpark in your create connection function:

   ```python
   def create_pg_connection():
       import psycopg2
       connection = psycopg2.connect(
           host="pg_host",
           port=pg_port,
           dbname="pg_dbname",
           user="pg_user",
           password="pg_password",
           # Include this parameter for source tracing
           application_name="snowflake-snowpark-python"
       )
       return connection
   ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   ```sqlexample
   SELECT
       pid,
       usename AS username,
       datname AS database,
       application_name,
       client_addr,
       state,
       query_start,
       query
   FROM
       pg_stat_activity
   WHERE
       application_name = 'snowflake-snowpark-python';
   ```

## MySQL

To connect to MySQL from Snowpark, you need the following two packages:

* Snowpark: [snowflake-snowpark-python[pandas]](https://pypi.org/project/snowflake-snowpark-python/)
* The open source pymysql library: [PyMySQL](https://pypi.org/project/PyMySQL/)

The following code examples show how to connect to MySQL from a Snowpark client, stored procedures, and a Snowflake notebook.

### Use the DB-API to connect to MySQL from a Snowpark client

1. Install pymysql:

   ```bash
   pip install snowflake-snowpark-python[pandas]
   pip install pymysql
   ```
2. Define the factory method for creating a connection to MySQL:

   ```python
   def create_mysql_connection():
       import pymysql
       connection = pymysql.connect(
           host="mysql_host",
           port=mysql_port,
           database="mysql_db",
           user="mysql_user",
           password="mysql_password",
           # Optional: include this parameter for source tracing
           init_command="SET @program_name='snowflake-snowpark-python';"
       )
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_mysql_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_mysql_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_mysql_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_mysql_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )
   ```

### Use the DB-API to connect to MySQL from a stored procedure

1. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
2. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   CREATE OR REPLACE SECRET mysql_secret
       TYPE = PASSWORD
       USERNAME = 'mysql_username'
       PASSWORD = 'mysql_password';

   -- configure a network rule.

   CREATE OR REPLACE NETWORK RULE mysql_network_rule
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('mysql_host:mysql_port');

   -- configure an external access integration

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION mysql_access_integration
       ALLOWED_NETWORK_RULES = (mysql_network_rule)
       ALLOWED_AUTHENTICATION_SECRETS = (mysql_secret)
           ENABLED = true;
   ```
3. Use the Snowpark Python DB-API to pull data from MySQL in a Python stored procedure:

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE sp_mysql_dbapi()
       RETURNS TABLE()
       LANGUAGE PYTHON
       RUNTIME_VERSION='3.11'
       HANDLER='run'
       PACKAGES=('snowflake-snowpark-python', 'pymysql')
       EXTERNAL_ACCESS_INTEGRATIONS = (mysql_access_integration)
       SECRETS = ('cred' = mysql_secret )
   AS $$

   # Get user name and password from mysql_secret

   import _snowflake
       username_password_object = _snowflake.get_username_password('cred')
       USER = username_password_object.username
       PASSWORD = username_password_object.password

   # Define the factory method for creating a connection to MySQL

   from snowflake.snowpark import session

   def create_mysql_connection():
       import pymysql
       connection = pymysql.connect(
           host="mysql_host",
           port=mysql_port,
           dbname="mysql_dbname",
           user=USER,
           password=PASSWORD,
           # Optional: include this parameter for source tracing
           init_command="SET @program_name='snowflake-snowpark-python';"
       )
       return connection

   # Using Snowpark Python DB-API to pull data from MySQL in a Python stored procedure.

   def run(session: Session):
       # Feel free to combine local/udtf ingestion and partition column/predicates
       # as stated in the understanding parallelism section

       # Call dbapi to pull data from target table

       df = session.read.dbapi(
           create_mysql_connection,
           table="target_table"
       )

       # Call dbapi to pull data from target query

       df_query = session.read.dbapi(
           create_mysql_connection,
           query="select * from target_table"
       )

       # Pull data from target table with parallelism using partition column

       df_local_par_column = session.read.dbapi(
           create_mysql_connection,
           table="target_table",
           fetch_size=100000,
           num_partitions=4,
           column="ID",  # swap with the column you want your partition based on
           upper_bound=10000,
           lower_bound=0
       )

       udtf_configs = {
           "external_access_integration": "<your external access integration>"
       }

       # Pull data from target table with udtf ingestion with parallelism using predicates

       df_udtf_predicates = session.read.dbapi(
           create_mysql_connection,
           table="target_table",
           udtf_configs=udtf_configs,
           fetch_size=100000,
           predicates=[
               "ID < 3",
               "ID >= 3"
           ]
       )
       return df
   $$;

   CALL sp_mysql_dbapi();
   ```

### Use the DB-API to connect to MySQL from a Snowflake notebook

1. From [Snowflake Notebook packages](../../../user-guide/ui-snowsight/notebooks-import-packages.md), select `snowflake-snowpark-python` and `pymysql`.
2. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
3. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   CREATE OR REPLACE SECRET mysql_secret
       TYPE = PASSWORD
       USERNAME = 'mysql_username'
       PASSWORD = 'mysql_password';

   ALTER NOTEBOOK mynotebook SET SECRETS = ('snowflake-secret-object' = mysql_secret);

   -- configure a network rule.
   CREATE OR REPLACE NETWORK RULE mysql_network_rule
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('mysql_host:mysql_port');

   -- configure an EAI
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION mysql_access_integration
       ALLOWED_NETWORK_RULES = (mysql_network_rule)
       ALLOWED_AUTHENTICATION_SECRETS = (mysql_secret)
       ENABLED = true;
   ```
4. [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md), and then restart the notebook session.
5. Use the DB-API to pull data from MySQL in a Python cell of a Snowflake notebook:

   ```python
   # Get user name and password from mysql_secret
   import _snowflake
   username_password_object = _snowflake.get_username_password('snowflake-secret-object')
   USER = username_password_object.username
   PASSWORD = username_password_object.password

   import snowflake.snowpark.context
   session = snowflake.snowpark.context.get_active_session()

   # Define the factory method for creating a connection to MySQL

   def create_mysql_connection():
       import pymysql
       connection = pymysql.connect(
           host="mysql_host",
           port=mysql_port,
           dbname="mysql_dbname",
           user=USER,
           password=PASSWORD,
           # Optional: include this parameter for source tracing
           init_command="SET @program_name='snowflake-snowpark-python';"
       )
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_mysql_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_mysql_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_mysql_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_mysql_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )

   # Save data into sf_table

   df.write.mode("overwrite").save_as_table('sf_table')
   ```

### Source tracing when using the DB-API to connect to MySQL

1. Include a tag of Snowpark in your create connection function:

   ```python
   def create_mysql_connection():
       import pymysql
       connection = pymysql.connect(
           host="mysql_host",
           port=mysql_port,
           database="mysql_db",
           user="mysql_user",
           password="mysql_password",
           # include this parameter for source tracing
           init_command="SET @program_name='snowflake-snowpark-python';"
       )
       return connection
   ```
2. Run the following SQL in your data source to capture queries from Snowpark:

   ```sqlexample
   SELECT *
   FROM performance_schema.events_statements_history_long
   WHERE THREAD_ID = (
       SELECT THREAD_ID
       FROM performance_schema.events_statements_history_long
       WHERE SQL_TEXT = "SET @program_name='snowflake-snowpark-python'"
       ORDER BY EVENT_ID DESC
       LIMIT 1
   )
   ```

## Databricks

To connect to Databricks from Snowpark, you need the following two packages:

* Snowpark: [snowflake-snowpark-python[pandas]](https://pypi.org/project/snowflake-snowpark-python/)
* The open source psycopg2 library: [databricks-sql-connector](https://pypi.org/project/databricks-sql-connector/)

The following code examples show how to connect to Databricks from a Snowpark client, stored procedures, and a Snowflake notebook.

### Use the DB-API to connect to Databricks from a Snowpark client

1. Install databricks-sql-connector:

   ```bash
   pip install snowflake-snowpark-python[pandas]
   pip install databricks-sql-connector
   ```
2. Define the factory method for creating a connection to Databricks:

   ```python
   def create_dbx_connection():
       import databricks.sql
       connection = databricks.sql.connect(
           server_hostname=HOST,
           http_path=PATH,
           access_token=ACCESS_TOKEN
       )
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_dbx_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_dbx_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_dbx_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_dbx_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )
   ```

### Use the DB-API to connect to Databricks from a stored procedure

1. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
2. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   CREATE OR REPLACE SECRET dbx_secret
       TYPE = GENERIC_STRING
       SECRET_STRING = 'dbx_access_token';

   CREATE OR REPLACE NETWORK RULE dbx_network_rule
       MODE = EGRESS
       TYPE = HOST_PORT
       VALUE_LIST = ('dbx_host:dbx_port');

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION dbx_access_integration
       ALLOWED_NETWORK_RULES = (dbx_network_rule)
       ALLOWED_AUTHENTICATION_SECRETS = (dbx_secret)
       ENABLED = true;
   ```
3. Use the Snowpark Python DB-API to pull data from Databricks in a Python stored procedure:

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE sp_dbx_dbapi()
       RETURNS TABLE()
       LANGUAGE PYTHON
       RUNTIME_VERSION='3.11'
       HANDLER='run'
       PACKAGES=('snowflake-snowpark-python', 'databricks-sql-connector')
       EXTERNAL_ACCESS_INTEGRATIONS = (dbx_access_integration)
       SECRETS = ('cred' = dbx_secret )
   AS $$

   # Get user name and password from dbx_secret

   import _snowflake
   ACCESS_TOKEN = _snowflake.get_generic_secret_string('cred')

   from snowflake.snowpark import Session

   # Define the method for creating a connection to Databricks
   def create_dbx_connection():
       import databricks.sql
       connection = databricks.sql.connect(
           server_hostname="dbx_host",
           http_path="dbx_path",
           access_token=ACCESS_TOKEN,
       )
       return connection

   # Using Snowpark Python DB-API to pull data from DataBricks in a Python stored procedure.

   def run(session: Session):
       # Feel free to combine local/udtf ingestion and partition column/predicates
       # as stated in the understanding parallelism section

       # Call dbapi to pull data from target table

       df = session.read.dbapi(
           create_dbx_connection,
           table="target_table"
       )

       # Call dbapi to pull data from target query

       df_query = session.read.dbapi(
           create_dbx_connection,
           query="select * from target_table"
       )

       # Pull data from target table with parallelism using partition column

       df_local_par_column = session.read.dbapi(
           create_dbx_connection,
           table="target_table",
           fetch_size=100000,
           num_partitions=4,
           column="ID",  # swap with the column you want your partition based on
           upper_bound=10000,
           lower_bound=0
       )

       udtf_configs = {
           "external_access_integration": "<your external access integration>"
       }

       # Pull data from target table with udtf ingestion with parallelism using predicates

       df_udtf_predicates = session.read.dbapi(
           create_dbx_connection,
           table="target_table",
           udtf_configs=udtf_configs,
           fetch_size=100000,
           predicates=[
               "ID < 3",
               "ID >= 3"
           ]
       )
       return df

   $$;

   CALL sp_dbx_dbapi();
   ```

### Use the DB-API to connect to Databricks from a Snowflake notebook

1. From [Snowflake Notebook packages](../../../user-guide/ui-snowsight/notebooks-import-packages.md), select `snowflake-snowpark-python` and `databricks-sql-connector`.
2. Configure an external access integration (EAI), which is required to allow Snowflake to connect to the source endpoint.

   > **Note:**
   >
   > [PrivateLink](../../../user-guide/admin-security-privatelink.md) is recommended for secure data transfer, especially when you’re dealing with
   > sensitive information. Ensure that your Snowflake account has the necessary PrivateLink privileges enabled and that the
   > PrivateLink feature is configured and active in your Snowflake Notebook environment.
3. Configure the secret, a network rule to allow egress to the source endpoint, and EAI:

   ```sqlexample
   CREATE OR REPLACE SECRET dbx_secret
   TYPE = GENERIC_STRING
   SECRET_STRING = 'dbx_access_token';

   ALTER NOTEBOOK mynotebook SET SECRETS = ('snowflake-secret-object' = dbx_secret);

   CREATE OR REPLACE NETWORK RULE dbx_network_rule
   MODE = EGRESS
   TYPE = HOST_PORT
   VALUE_LIST = ('dbx_host:dbx_port');

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION dbx_access_integration
   ALLOWED_NETWORK_RULES = (dbx_network_rule)
   ALLOWED_AUTHENTICATION_SECRETS = (dbx_secret)
   ENABLED = true;
   ```
4. [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md), and then restart the notebook session.
5. Use the DB-API to pull data from Databricks in a Python cell of a Snowflake notebook:

   ```python
   # Get user name and password from dbx_secret

   import _snowflake
   ACCESS_TOKEN = _snowflake.get_generic_secret_string('cred')

   import snowflake.snowpark.context
   session = snowflake.snowpark.context.get_active_session()

   # Define the factory method for creating a connection to Databricks

   def create_dbx_connection():
       import databricks.sql
       connection = databricks.sql.connect(
           server_hostname="dbx_host",
           http_path="dbx_path",
           access_token=ACCESS_TOKEN,
       )
       return connection

   # Feel free to combine local/udtf ingestion and partition column/predicates as
   # stated in the understanding parallelism section

   # Call dbapi to pull data from target table

   df = session.read.dbapi(
       create_dbx_connection,
       table="target_table"
   )

   # Call dbapi to pull data from target query

   df_query = session.read.dbapi(
       create_dbx_connection,
       query="select * from target_table"
   )

   # Pull data from target table with parallelism using partition column

   df_local_par_column = session.read.dbapi(
       create_dbx_connection,
       table="target_table",
       fetch_size=100000,
       num_partitions=4,
       column="ID",  # swap with the column you want your partition based on
       upper_bound=10000,
       lower_bound=0
   )

   udtf_configs = {
       "external_access_integration": "<your external access integration>"
   }

   # Pull data from target table with udtf ingestion with parallelism using predicates

   df_udtf_predicates = session.read.dbapi(
       create_dbx_connection,
       table="target_table",
       udtf_configs=udtf_configs,
       fetch_size=100000,
       predicates=[
           "ID < 3",
           "ID >= 3"
       ]
   )

   # Save data into sf_table

   df.write.mode("overwrite").save_as_table('sf_table')
   ```

### Source tracing when using the DB-API to connect to Databricks

1. Include a tag of Snowpark in your create connection function:

   ```python
   def create_dbx_connection():
       import databricks.sql
       connection = databricks.sql.connect(
           server_hostname=HOST,
           http_path=PATH,
           access_token=ACCESS_TOKEN,
           # include this parameter for source tracing
           user_agent_entry="snowflake-snowpark-python"
       )
       return connection
   ```
2. Navigate to query history on the DataBricks console and search for the query whose source is `snowflake-snowpark-python`.

## Limitations

The Snowpark Python DB-API supports only Python DB-API 2.0–compliant drivers (for example, `pyodbc` or `oracledb`). JDBC drivers are not supported in this release.

---
title: Using the Snowpark Python JDBC
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/snowpark-jdbc.md
section: Snowpark
---

# Using the Snowpark Python JDBC

With the Snowpark Python JDBC, Snowpark Python users can programmatically pull data from external databases into Snowflake. This allows you to connect to external databases using JDBC drivers.

With these APIs, you can seamlessly pull data into Snowflake tables and transform it using [Snowpark DataFrames](working-with-dataframes.md) for advanced analytics.

The [JDBC](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.jdbc) can be used in a similar way as the Spark JDBC API. Most parameters are designed to be identical or similar for better parity. For more information that compares the Snowpark Python JDBC with the Spark JDBC API, see the following table:

## Snowpark JDBC parameters

| Parameter | Snowpark Python JDBC |
| --- | --- |
| `url` | A connection string used to connect to the external data source via the JDBC driver |
| `udtf_configs` | A dictionary containing the necessary configurations for the UDTF creation |
| `properties` | A dictionary containing the key-value pair that is needed during establishing JDBC connection |
| `table` | Table in the source database |
| `query` | SQL query wrapped as a subquery for reading data |
| `column` | Partitioning column for parallel reads |
| `lower_bound` | Lower bound for partitioning |
| `upper_bound` | Upper bound for partitioning |
| `num_partitions` | Number of partitions for parallelism |
| `query_timeout` | The timeout duration for SQL execution, measured in seconds. |
| `fetch_size` | Number of rows fetched per round trip |
| `custom_schema` | Custom schema for pulling data from external databases |
| `predicates` | List of conditions for WHERE clause partitions |
| `session_init_statement` | Executes a SQL or PL/SQL statement upon session initialization |

## Understanding parallelism

Snowpark Python JDBC currently has one form of underlying ingestion mechanism:

UDTF ingestion
:   All workloads run on the Snowflake server. Snowpark creates a Java UDTF and invoke it in parallel to ingest data into a Snowflake temporary table. Thus the `udtf_configs` parameter is required for this feature.

The Snowpark Python JDBC has two ways to parallelize and accelerate ingestion:

Partition column
:   This method divides source data into a number of partitions based on four parameters when users call `jdbc()`:

    * `column`
    * `lower_bound`
    * `upper_bound`
    * `num_partitions`

    These four parameters have to be set at the same time and the `column` must be numeric or date type.

Predicates
:   This method divides source data into partitions based on parameter predicates, which are a list of expressions suitable for inclusion in `WHERE` clauses, where each expression defines a partition. Predicates provide a more flexible way of dividing partitions; for example, you can divide partitions on boolean or non-numeric columns.

The Snowpark Python JDBC also allows adjusting parallelism level within a partition:

Fetch_size
:   Within a partition, the API fetches rows in chunks defined by `fetch_size`. These rows are written to Snowflake in parallel as they are fetched, which allows reading and writing to overlap and maximizes throughput.

## Using JDBC to ingest data from external data source

### Using JDBC to ingest data from a Snowpark client

1. Upload the JDBC driver jar file to a Snowflake stage using Snowpark or Snowsight

   > * Upload using Snowpark.
   >
   >   > In Snowpark, after creating a session, run the following code:
   >   >
   >   > > ```Python
   >   > > session.file.put("<your directory>/<your file name>", "@<your stage name>/<stage path>")
   >   > > ```
   > * Upload using Snowsight as described in the following steps.
   >
   >   > 1. In Snowsight, click on Catalog -> Database Explorer.
   >   > 2. In the left search bar of databases, click on [your database name] -> [your schema name] -> stages -> [your stage name].
   >   > 3. Click the “+File” button on the top right corner of the stage page.
2. Configure the secret, network rule, and external access integration.

   > ```SQL
   > -- Configure a secret to allow egress to the source endpoint
   > CREATE OR REPLACE SECRET <your secret>
   >     TYPE = PASSWORD
   >     USERNAME = '<your username>'
   >     PASSWORD = '<your password>';
   >
   > -- Configure a network rule to allow egress to the source endpoint
   > CREATE OR REPLACE NETWORK RULE <your network rule>
   >   TYPE = HOST_PORT
   >   MODE = EGRESS
   >   VALUE_LIST = ('<your host>:<your port>');
   >
   > -- Configure an external access integration
   > CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION <your integration>
   >   ALLOWED_NETWORK_RULES = (<your network rule>)
   >   ALLOWED_AUTHENTICATION_SECRETS = (<your secret>)
   >   ENABLED = true;
   > ```
3. Pull data from the target using Snowpark JDBC from a Snowpark client.

   > ```Python
   > connection_str=f"jdbc:<your dbms>://<your host>:<your port>/<your db>"
   > udtf_configs = {
   >     "external_access_integration": "<your integration>",
   >     "secret": "<your secret>",
   >     "imports": ["<your stage path to jdbc jar file>"]
   > }
   >
   > # Call jdbc to pull data from target table
   > df_table = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >     )
   >
   > # Call jdbc to pull data from target query
   > df_query = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         query="select * from <your table>",
   >     )
   >
   > # Pull data from target table with parallelism using partition column
   > df_table_partition_column = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >         fetch_size=100000,
   >         num_partitions=4,
   >         column="ID",
   >         upper_bound=10000,
   >         lower_bound=0
   >     )
   >
   > # Pull data from target table with parallelism using predicates
   > df_table_predicates = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >         fetch_size=100000,
   >         predicates = [
   >             "ID < 3",
   >             "ID >= 3"
   >         ]
   >     )
   > ```

### Using JDBC to ingest data from a stored procedure

1. Upload JDBC driver jar file to Snowflake stage using Snowsight

   > * In Snowsight, click on Catalog -> Database Explorer
   > * In the left search bar of databases, click [your database name] -> [your schema name] -> stages -> [your stage name].
   > * Click the “+File” button on the top right corner of the stage page.
2. Configure secret, network rule, and external access integration.

   > ```SQL
   > -- Configure a secret to allow egress to the source endpoint
   > CREATE OR REPLACE SECRET <your secret>
   >     TYPE = PASSWORD
   >     USERNAME = '<your username>'
   >     PASSWORD = '<your password>';
   >
   > -- Configure a network rule to allow egress to the source endpoint
   > CREATE OR REPLACE NETWORK RULE <your network rule>
   >   TYPE = HOST_PORT
   >   MODE = EGRESS
   >   VALUE_LIST = ('<your host>:<your port>');
   >
   > -- Configure an external access integration
   > CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION <your integration>
   >   ALLOWED_NETWORK_RULES = (<your network rule>)
   >   ALLOWED_AUTHENTICATION_SECRETS = (<your secret>)
   >   ENABLED = true;
   > ```
3. Pull data from target using Snowpark JDBC from a stored procedure.

   > ```sqlexample-python
   > CREATE OR REPLACE PROCEDURE sp_jdbc()
   > RETURNS STRING
   > LANGUAGE PYTHON
   > RUNTIME_VERSION = '3.10'
   > PACKAGES = ('snowflake-snowpark-python')
   > HANDLER = 'run'
   > AS
   > $$
   > import time
   > def run(session):
   >     connection_str=f"jdbc:<your dbms>://<your host>:<your port>/<your db>"
   >     udtf_configs = {
   >         "external_access_integration": "<your integration>",
   >         "secret": "<your secret>",
   >         "imports": ["<your stage path to jdbc jar file>"]
   >     }
   >
   >     # Call jdbc to pull data from target table
   >     df_table = session.read.jdbc(
   >             url=connection_str,
   >             udtf_configs=udtf_configs,
   >             table="<your table>",
   >         )
   >
   >     # Call jdbc to pull data from target query
   >     df_query = session.read.jdbc(
   >             url=connection_str,
   >             udtf_configs=udtf_configs,
   >             query="select * from <your table>",
   >         )
   >
   >     # Pull data from target table with parallelism using partition column
   >     df_table_partition_column = session.read.jdbc(
   >             url=connection_str,
   >             udtf_configs=udtf_configs,
   >             table="<your table>",
   >             fetch_size=100000,
   >             num_partitions=4,
   >             column="ID",
   >             upper_bound=10000,
   >             lower_bound=0
   >         )
   >
   >     # Pull data from target table with parallelism using predicates
   >     df_table_predicates = session.read.jdbc(
   >             url=connection_str,
   >             udtf_configs=udtf_configs,
   >             table="<your table>",
   >             fetch_size=100000,
   >             predicates = [
   >                 "ID < 3",
   >                 "ID >= 3"
   >             ]
   >         )
   >     df_table.write.save_as_table("snowflake_table", mode="overwrite")
   >     return f"success"
   >
   > $$
   > ;
   >
   > call sp_jdbc();
   > select * from snowflake_table ;
   > ```

### Using JDBC to ingest data from a Snowflake notebook

1. Upload JDBC driver jar file to Snowflake stage using Snowsight

   > * In Snowsight, click on Catalog -> Database Explorer
   > * In the left search bar of databases, click [your database name] -> [your schema name] -> stages -> [your stage name].
   > * Click the “+File” button on the top right corner of the stage page.
2. Configure secret, network rule, and external access integration.

   > ```SQL
   > -- Configure a secret to allow egress to the source endpoint
   > CREATE OR REPLACE SECRET <your secret>
   >     TYPE = PASSWORD
   >     USERNAME = '<your username>'
   >     PASSWORD = '<your password>';
   >
   > -- Configure a network rule to allow egress to the source endpoint
   > CREATE OR REPLACE NETWORK RULE <your network rule>
   >   TYPE = HOST_PORT
   >   MODE = EGRESS
   >   VALUE_LIST = ('<your host>:<your port>');
   >
   > -- Configure an external access integration
   > CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION <your integration>
   >   ALLOWED_NETWORK_RULES = (<your network rule>)
   >   ALLOWED_AUTHENTICATION_SECRETS = (<your secret>)
   >   ENABLED = true;
   > ```
3. Pull data from target using Snowpark JDBC from a Snowflake notebook.

   > ```Python
   > import snowflake.snowpark.context
   > session = snowflake.snowpark.context.get_active_session()
   > connection_str=f"jdbc:<your dbms>://<your host>:<your port>/<your db>"
   > udtf_configs = {
   >         "external_access_integration": "<your integration>",
   >         "secret": "<your secret>",
   >         "imports": ["<your stage path to jdbc jar file>"]
   >     }
   >
   > # Call jdbc to pull data from target table
   > df_table = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >     )
   >
   > # Call jdbc to pull data from target query
   > df_query = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         query="select * from <your table>",
   >     )
   >
   > # Pull data from target table with parallelism using partition column
   > df_table_partition_column = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >         fetch_size=100000,
   >         num_partitions=4,
   >         column="ID",
   >         upper_bound=10000,
   >         lower_bound=0
   >     )
   >
   > # Pull data from target table with parallelism using predicates
   > df_table_predicates = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >         fetch_size=100000,
   >         predicates = [
   >             "ID < 3",
   >             "ID >= 3"
   >         ]
   >     )
   > ```

## Source tracing

### Source tracing when using Snowpark JDBC connect to MySQL

1. Include a tag of Snowpark in your create connection function:

   > ```Python
   > connection_str="jdbc:mysql://<your host>:<your port>/<your db>?applicationName=snowflake-snowpark-python"
   > udtf_configs = {
   >     "external_access_integration": "<your integration>",
   >     "secret": "<your secret>",
   >     "imports": ["<your stage path to jdbc jar file>"]
   > }
   >
   > # Call dbapi to pull data from target table
   > df_table = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >     )
   > ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   > ```SQL
   > SELECT *
   > FROM performance_schema.events_statements_history_long
   > WHERE THREAD_ID = (
   >   SELECT THREAD_ID, NAME FROM performance_schema.threads WHERE NAME LIKE '%snowflake-snowpark-python%';
   > )
   > ```

### Source tracing when using Snowpark JDBC to connect to SQL Server

1. Include a tag of Snowpark in your create connection function:

   > ```Python
   > connection_str="jdbc:mssql://<your host>:<your port>/<your db>?applicationName=snowflake-snowpark-python"
   > udtf_configs = {
   > "external_access_integration": "<your integration>",
   > "secret": "<your secret>",
   > "imports": ["<your stage path to jdbc jar file>"]
   > }
   > # Call dbapi to pull data from target table
   > df_table = session.read.jdbc(
   >       url=connection_str,
   >       udtf_configs=udtf_configs,
   >       table="<your table>",
   >   )
   > ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   > ```SQL
   > SELECT
   >   s.session_id,
   >   s.program_name,
   >   r.status,
   >   t.text AS sql_text
   > FROM sys.dm_exec_sessions s
   > JOIN sys.dm_exec_requests r ON s.session_id = r.session_id
   > CROSS APPLY sys.dm_exec_sql_text(r.sql_handle) AS t
   > WHERE s.program_name = 'snowflake-snowpark-python';
   > ```

### Source tracing when using Snowpark JDBC to connect to PostgresSQL

1. Include a tag of Snowpark in your create connection function:

   > ```Python
   > connection_str="jdbc:postgres://<your host>:<your port>/<your db>?applicationName=snowflake-snowpark-python"
   > udtf_configs = {
   > "external_access_integration": "<your integration>",
   > "secret": "<your secret>",
   > "imports": ["<your stage path to jdbc jar file>"]
   > }
   >
   > # Call dbapi to pull data from target table
   > df_table = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >     )
   > ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   > ```SQL
   > SELECT
   >   pid,
   >   usename AS username,
   >   datname AS database,
   >   application_name,
   >   client_addr,
   >   state,
   >   query_start,
   >   query
   > FROM
   >   pg_stat_activity
   > WHERE
   >   application_name = 'snowflake-snowpark-python';
   > ```

### Source tracing when using Snowpark JDBC to connect to Oracle

1. Include a tag of Snowpark in your create connection function:

   > ```Python
   > connection_str="jdbc:oracle://<your host>:<your port>/<your db>?applicationName=snowflake-snowpark-python"
   > udtf_configs = {
   > "external_access_integration": "<your integration>",
   > "secret": "<your secret>",
   > "imports": ["<your stage path to jdbc jar file>"]
   > }
   > # Call dbapi to pull data from target table
   > df_table = session.read.jdbc(
   >         url=connection_str,
   >         udtf_configs=udtf_configs,
   >         table="<your table>",
   >     )
   > ```
2. Run the following SQL in your data source to capture queries from Snowpark that are still live:

   > ```SQL
   > SELECT
   >   sid,
   >   serial#,
   >   username,
   >   program,
   >   module,
   >   action,
   >   client_identifier,
   >   client_info,
   >   osuser,
   >   machine
   > FROM v$session
   > WHERE program = 'snowflake-snowpark-python';
   > ```

## Common DBMS and Type Support

The following is a certified list of data types of different DBMS systems. If your source data involves other data types, Snowpark Python JDBC will try to map them to best-effort Snowflake data types, or fall back to strings.

### Oracle

* INTEGER
* NUMBER
* BINARY_FLOAT
* BINARY_DOUBLE
* VARCHAR2
* CHAR
* CLOB
* NCHAR
* NVARCHAR2
* NCLOB
* DATE
* TIMESTAMP
* TIMESTAMP WITH TIME ZONE
* TIMESTAMP WITH LOCAL TIME ZONE
* RAW

### PostgresSQL

* BIGINT
* BIGSERIAL
* BIT
* BIT VARYING
* BOOLEAN
* BOX
* BYTEA
* CHAR
* VARCHAR
* CIDR
* CIRCLE
* DATE
* DOUBLE PRECISION
* INET
* INTEGER
* INTERVAL
* JSON
* JSONB
* LINE
* LSEG
* MACADDR
* POINT
* POLYGON
* REAL
* SMALLINT
* SMALLSERIAL
* SERIAL
* TEXT
* TIME
* TIMESTAMP
* TIMESTAMPTZ
* TSQUERY
* TSVECTOR
* TXID_SNAPSHOT
* UUID
* XML

### MySQL

* INT
* DECIMAL
* INT
* TINYINT
* SMALLINT
* MEDIUMINT
* BIGINT
* YEAR
* FLOAT
* DOUBLE
* CHAR
* VARCHAR
* TINYTEXT
* TEXT
* MEDIUMTEXT
* LONGTEXT
* ENUM
* SET
* BIT
* BINARY
* VARBINARY
* TINYBLOB
* BLOB
* MEDIUMBLOB
* LONGBLOB
* DATE
* DATETIME
* TIMESTAMP
* TIME
* JSON

### SQL Server

* INT
* BIGINT
* INT
* SMALLINT
* TINYINT
* BIT
* DECIMAL
* NUMERIC
* MONEY
* SMALLMONEY
* FLOAT
* REAL
* DATE
* TIME
* DATETIME
* DATETIME2
* SMALLDATETIME
* CHAR
* VARCHAR
* VARCHAR(MAX)
* TEXT
* NCHAR
* NVARCHAR
* NVARCHAR(MAX)
* NTEXT
* BINARY
* VARBINARY
* VARBINARY(MAX)
* IMAGE
* UNIQUEIDENTIFIER
* TIMESTAMP

### Databricks

Connecting to Databricks using Snowpark Python JDBC is currently not supported.

---
title: Using the Snowpark XML RowTag Reader
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/snowpark-xml-rowtag-reader.md
section: Snowpark
---

# Using the Snowpark XML RowTag Reader

You can activate the Snowpark XML RowTag Reader by specifying `.option("rowTag", "<rowtag>")` in `session.read.option("rowTag", "<rowtag>").xml()`. Instead of loading the entire document as a single object, this mode splits the file based on the specified `rowTag`, loads each matching element as a separate row, and splits each row into multiple columns in a Snowpark DataFrame. The Reader is especially useful for processing only selective elements in XML files or ingesting large XML files in a scalable, Snowpark-native way.

## Example

This sample XML is an example:

```xml
<library>
    <book id="1">
        <title>The Art of Snowflake</title>
        <author>Jane Doe</author>
        <price>29.99</price>
        <reviews>
            <review>
                <user>tech_guru_87</user>
                <rating>5</rating>
                <comment>Very insightful and practical.</comment>
            </review>
            <review>
                <user>datawizard</user>
                <rating>4</rating>
                <comment>Great read for data engineers.</comment>
            </review>
        </reviews>
        <editions>
            <edition year="2023" format="Hardcover"/>
            <edition year="2024" format="eBook"/>
        </editions>
    </book>

    <book id="2">
        <title>XML for Data Engineers</title>
        <author>John Smith</author>
        <price>35.50</price>
        <reviews>
            <review>
                <user>xml_master</user>
                <rating>5</rating>
                <comment>Perfect for mastering XML parsing.</comment>
            </review>
        </reviews>
        <editions>
            <edition year="2022" format="Paperback"/>
        </editions>
    </book>
</library>
```

### Snowpark script

```Python
df = session.read.option("rowTag", "book").xml("@mystage/books.xml")
```

This loads each `<book>` element from the XML file into its own row, with child elements (for example, `<title>` and `<author>`) automatically extracted as columns of type `VARIANT`.

### Output

| `_id` | `author` | `editions` | `price` | `reviews` | `title` |
| --- | --- | --- | --- | --- | --- |
| “2” | “John Smith” | `{ "edition": { "_format": "Paperback", "_year": "2022" } }` | “35.50” | `{ "review": { "comment": "Perfect for mastering XML parsing.", "rating": "5", "user": "xml_master" } }` | “XML for Data Engineers” |
| “1” | “Jane Doe” | `{ "edition": [ { "_format": "Hardcover", "_year": "2023" }, { "_format": "eBook", "_year": "2024" } ] }` | “29.99” | `{ "review": [ { "comment": "Very insightful and practical.", "rating": "5", "user": "tech_guru_87" }, { "comment": "Great read for data engineers.", "rating": "4", "user": "datawizard" } ] }` | “The Art of Snowflake” |

* Each XML element identified by `rowTag` becomes one row.
* Each sub-element within that tag becomes a column, stored as a `VARIANT`. Nested elements are captured as nested `VARIANT` data.
* The resulting DataFrame is flattened and columnized and behaves like any other Snowpark DataFrame.

## Getting started

1. Install the Snowpark Python package:

   ```shell
   pip install snowflake-snowpark-python
   ```
2. Upload your XML file to a Snowflake stage:

   ```sqlexample
   PUT file:///path/to/books.xml @mystage;
   ```
3. Use Snowpark to read the XML file:

   ```python
   df = session.read.option("rowTag", "book").xml("@mystage/books.xml")
   ```
4. Use DataFrame methods to transform or save:

   ```python
   df.select(col("`title`"), col("`author`")).show()
   df.write.save_as_table("books_table")
   ```

## Supported options

* `rowTag` (Required): The name of the XML element to extract as a row.
* `rowValidationXSDPath` (Optional): Stage path to an XSD used to validate each rowTag fragment during load.
* `mode` (Optional): Default behavior loads without validation. When `rowValidationXSDPath` is set:

  > + `PERMISSIVE`: Quarantines invalid rows in `_corrupt_record`; loads the rest.
  > + `FAILFAST`: Stops at the first invalid row and raises an error.

For more information about XML options, see [snowflake.snowpark.DataFrameReader.xml](/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.xml).

## Validate XML using XSD

* To validate each `rowTag` fragment against an XSD during load, set the XSD path and choose a validation mode:

  > ```python
  > df = (
  > session.read
  >     .option("rowTag", "book")
  >     .option("rowValidationXSDPath", "@mystage/schema.xsd")  # validates each row element
  >     .option("mode", "PERMISSIVE")                         # or "FAILFAST"
  >     .xml("@mystage/books.xml")
  > )
  > ```

`PERMISSIVE`: Invalid rows are quarantined in a special `_corrupt_record` column; valid rows load normally.

* To persist the result, write the DataFrame to a table with `df.write.save_as_table("<table_name>")`. The table will include all parsed columns plus an extra `_corrupt_record` column: it is `NULL` for valid rows and contains the full XML records for invalid rows (with the other columns showing `NULL`).

  > ```output
  > +-------------------+
  > | _corrupt_record   |
  > | <book id="1"> ... |
  > | <book id="2"> ... |
  > +-------------------+
  > ```

`FAILFAST`: The read stops at the first offending row and returns an error.

## Limitations

Snowpark XML RowTag Reader has the following limitations:

* Doesn’t infer schema, and the output columns are all of type `VARIANT`.
* Only supports files stored in Snowflake stages; local files are not supported.
* Is available only in the Snowpark Python library.

---
title: Working with DataFrames in Snowpark Java
source: https://docs.snowflake.com/en/developer-guide/snowpark/java/working-with-dataframes.md
section: Snowpark
---

# Working with DataFrames in Snowpark Java

In Snowpark, the main way in which you query and process data is through a DataFrame. This topic explains how to work with
DataFrames.

To retrieve and manipulate data, you use the [DataFrame](../reference/java/com/snowflake/snowpark_java/DataFrame.md) class. A DataFrame represents a relational dataset that is evaluated
lazily: it only executes when a specific action is triggered. In a sense, a DataFrame is like a query that needs to be evaluated
in order to retrieve data.

To retrieve data into a DataFrame:

1. Construct a DataFrame, specifying the source of the data for the dataset.

   For example, you can create a DataFrame to hold data from a table, an external CSV file, or the execution of a SQL statement.
2. Specify how the dataset in the DataFrame should be transformed.

   For example, you can specify which columns should be selected, how the rows should be filtered, how the results should be
   sorted and grouped, etc.
3. Execute the statement to retrieve the data into the DataFrame.

   In order to retrieve the data into the DataFrame, you must invoke a method that performs an action (for example, the
   `collect()` method).

The next sections explain these steps in more detail.

## Setting up the Examples for this Section

Some of the examples of this section use a DataFrame to query a table named `sample_product_data`. If you want to run these
examples, you can create this table and fill the table with some data by executing the following SQL statements:

```sqlexample
CREATE OR REPLACE TABLE sample_product_data (id INT, parent_id INT, category_id INT, name VARCHAR, serial_number VARCHAR, key INT, "3rd" INT, amount NUMBER(12, 2), quantity INT, product_date DATE);
INSERT INTO sample_product_data VALUES
    (1, 0, 5, 'Product 1', 'prod-1', 1, 10, 1.00, 15, TO_DATE('2021.01.01', 'YYYY.MM.DD')),
    (2, 1, 5, 'Product 1A', 'prod-1-A', 1, 20, 2.00, 30, TO_DATE('2021.02.01', 'YYYY.MM.DD')),
    (3, 1, 5, 'Product 1B', 'prod-1-B', 1, 30, 3.00, 45, TO_DATE('2021.03.01', 'YYYY.MM.DD')),
    (4, 0, 10, 'Product 2', 'prod-2', 2, 40, 4.00, 60, TO_DATE('2021.04.01', 'YYYY.MM.DD')),
    (5, 4, 10, 'Product 2A', 'prod-2-A', 2, 50, 5.00, 75, TO_DATE('2021.05.01', 'YYYY.MM.DD')),
    (6, 4, 10, 'Product 2B', 'prod-2-B', 2, 60, 6.00, 90, TO_DATE('2021.06.01', 'YYYY.MM.DD')),
    (7, 0, 20, 'Product 3', 'prod-3', 3, 70, 7.00, 105, TO_DATE('2021.07.01', 'YYYY.MM.DD')),
    (8, 7, 20, 'Product 3A', 'prod-3-A', 3, 80, 7.25, 120, TO_DATE('2021.08.01', 'YYYY.MM.DD')),
    (9, 7, 20, 'Product 3B', 'prod-3-B', 3, 90, 7.50, 135, TO_DATE('2021.09.01', 'YYYY.MM.DD')),
    (10, 0, 50, 'Product 4', 'prod-4', 4, 100, 7.75, 150, TO_DATE('2021.10.01', 'YYYY.MM.DD')),
    (11, 10, 50, 'Product 4A', 'prod-4-A', 4, 100, 8.00, 165, TO_DATE('2021.11.01', 'YYYY.MM.DD')),
    (12, 10, 50, 'Product 4B', 'prod-4-B', 4, 100, 8.50, 180, TO_DATE('2021.12.01', 'YYYY.MM.DD'));
```

To verify that the table was created, run:

```sqlexample
SELECT * FROM sample_product_data;
```

## Constructing a DataFrame

To construct a DataFrame, you can use methods in the `Session` class. Each of the following methods constructs a DataFrame
from a different type of data source:

* To create a DataFrame from data in a table, view, or stream, call the `table` method:

  ```java
  // Create a DataFrame from the data in the "sample_product_data" table.
  DataFrame dfTable = session.table("sample_product_data");

  // Print out the first 10 rows.
  dfTable.show();
  ```

  > **Note:**
  >
  > The `table` method returns an `Updatable` object. `Updatable` extends `DataFrame` and provides
  > additional methods for working with data in the table (e.g. methods for updating and deleting data). See
  > Updating, Deleting, and Merging Rows in a Table.
* To create a DataFrame from specified values:

  1. Construct an array of `Row` objects that contain the values.
  2. Construct a `StructType` object that describes the data types of those values.
  3. Call the `createDataFrame` method, passing in the array and `StructType` object.

  ```java
   // Import name from the types package, which contains StructType and StructField.
  import com.snowflake.snowpark_java.types.*;
  ...

   // Create a DataFrame containing specified values.
   Row[] data = {Row.create(1, "a"), Row.create(2, "b")};
   StructType schema =
     StructType.create(
       new StructField("num", DataTypes.IntegerType),
       new StructField("str", DataTypes.StringType));
   DataFrame df = session.createDataFrame(data, schema);

   // Print the contents of the DataFrame.
   df.show();
  ```

  > **Note:**
  >
  > Words reserved by Snowflake are not valid as column names when constructing a DataFrame. For a list of reserved words, refer to
  > [Reserved & limited keywords](../../../sql-reference/reserved-keywords.md).
* To create a DataFrame containing a range of values, call the `range` method:

  ```java
  // Create a DataFrame from a range
  DataFrame dfRange = session.range(1, 10, 2);

  // Print the contents of the DataFrame.
  dfRange.show();
  ```
* To create a DataFrame for a file in a stage, call `read` to get a
  `DataFrameReader` object. In the `DataFrameReader` object, call the method corresponding to the format of the data
  in the file:

  ```java
  // Create a DataFrame from data in a stage.
  DataFrame dfJson = session.read().json("@mystage2/data1.json");

  // Print the contents of the DataFrame.
  dfJson.show();
  ```
* To create a DataFrame to hold the results of a SQL query, call the `sql` method:

  ```java
  // Create a DataFrame from a SQL query
  DataFrame dfSql = session.sql("SELECT name from sample_product_data");

  // Print the contents of the DataFrame.
  dfSql.show();
  ```

  Note: Although you can use this method to execute SELECT statements that retrieve data from tables and staged files, you should
  use the `table` and `read` methods instead. Methods like `table` and `read` can provide better syntax
  highlighting, error highlighting, and intelligent code completion in development tools.

## Specifying How the Dataset Should Be Transformed

To specify which columns should be selected and how the results should be filtered, sorted, grouped, etc., call the DataFrame
methods that transform the dataset. To identify columns in these methods, use the `Functions.col` static method or an
expression that evaluates to a column. (See Specifying Columns and Expressions.)

For example:

* To specify which rows should be returned, call the `filter` method:

  ```java
  // Create a DataFrame for the rows with the ID 1
  // in the "sample_product_data" table.
  DataFrame df = session.table("sample_product_data").filter(
    Functions.col("id").equal_to(Functions.lit(1)));
  df.show();
  ```
* To specify the columns that should be selected, call the `select` method:

  ```java
  // Create a DataFrame that contains the id, name, and serial_number
  // columns in te "sample_product_data" table.
  DataFrame df = session.table("sample_product_data").select(
    Functions.col("id"), Functions.col("name"), Functions.col("serial_number"));
  df.show();
  ```

Each method returns a new DataFrame object that has been transformed. (The method does not affect the original DataFrame object.)
This means that if you want to apply multiple transformations, you can
chain method calls, calling each subsequent transformation method
on the new DataFrame object returned by the previous method call.

Note that these transformation methods do not retrieve data from the Snowflake database. (The action methods described in
Performing an Action to Evaluate a DataFrame perform the data retrieval.) The transformation methods simply specify how
the SQL statement should be constructed.

### Specifying Columns and Expressions

When calling these transformation methods, you might need to specify columns or expressions that use columns. For example, when
calling the `select` method, you need to specify the columns that should be selected.

To refer to a column, create a [Column](../reference/java/com/snowflake/snowpark_java/Column.md) object by calling the [Functions.col](../reference/java/com/snowflake/snowpark_java/Functions.md) static method.

```java
DataFrame dfProductInfo = session.table("sample_product_data").select(Functions.col("id"), Functions.col("name"));
dfProductInfo.show();
```

> **Note:**
>
> To create a `Column` object for a literal, see Using Literals as Column Objects.

When specifying a filter, projection, join condition, etc., you can use `Column` objects in an expression. For example:

* You can use `Column` objects with the `filter` method to specify a filter condition:

  ```java
  // Specify the equivalent of "WHERE id = 12"
  // in an SQL SELECT statement.
  DataFrame df = session.table("sample_product_data");
  df.filter(Functions.col("id").equal_to(Functions.lit(12))).show();
  ```

  ```java
  // Specify the equivalent of "WHERE key + category_id < 10"
  // in an SQL SELECT statement.
  DataFrame df2 = session.table("sample_product_data");
  df2.filter(Functions.col("key").plus(Functions.col("category_id")).lt(Functions.lit(10))).show();
  ```
* You can use `Column` objects with the `select` method to define an alias:

  ```java
  // Specify the equivalent of "SELECT key * 10 AS c"
  // in an SQL SELECT statement.
  DataFrame df3 = session.table("sample_product_data");
  df3.select(Functions.col("key").multiply(Functions.lit(10)).as("c")).show();
  ```
* You can use `Column` objects with the `join` method to define a join condition:

  ```java
  // Specify the equivalent of "sample_a JOIN sample_b on sample_a.id_a = sample_b.id_a"
  // in an SQL SELECT statement.
  DataFrame dfLhs = session.table("sample_a");
  DataFrame dfRhs = session.table("sample_b");
  DataFrame dfJoined = dfLhs.join(dfRhs, dfLhs.col("id_a").equal_to(dfRhs.col("id_a")));
  dfJoined.show();
  ```

#### Referring to Columns in Different DataFrames

When referring to columns in two different DataFrame objects that have the same name (for example, joining the DataFrames on that
column), you can use the `col` method in each DataFrame object to refer to a column in that object (for example,
`df1.col("name")` and `df2.col("name")`).

The following example demonstrates how to use the `col` method to refer to a column in a specific DataFrame. The example
joins two DataFrame objects that both have a column named `value`. The example uses the `as` method of the `Column`
object to change the names of the columns in the newly created DataFrame.

```java
// Create a DataFrame that joins two other DataFrames (dfLhs and dfRhs).
// Use the DataFrame.col method to refer to the columns used in the join.
DataFrame dfLhs = session.table("sample_a");
DataFrame dfRhs = session.table("sample_b");
DataFrame dfJoined = dfLhs.join(dfRhs, dfLhs.col("id_a").equal_to(dfRhs.col("id_a"))).select(dfLhs.col("value").as("L"), dfRhs.col("value").as("R"));
dfJoined.show();
```

### Using Double Quotes Around Object Identifiers (Table Names, Column Names, etc.)

The names of databases, schemas, tables, and stages that you specify must conform to the
[Snowflake identifier requirements](../../../sql-reference/identifiers-syntax.md). When you specify a name, Snowflake considers the
name to be in upper case. For example, the following calls are equivalent:

```java
// The following calls are equivalent:
df.select(Functions.col("id123"));
df.select(Functions.col("ID123"));
```

If the name does not conform to the identifier requirements, you must use double quotes (`"`) around the name. Use a backslash
(`\`) to escape the double quote character within a Scala string literal. For example, the following table name does not start
with a letter or an underscore, so you must use double quotes around the name:

```java
DataFrame df = session.table("\"10tablename\"");
```

Note that when specifying the name of a column, you don’t need to use double quotes around the name. The Snowpark library
automatically encloses the column name in double quotes for you if the name does not comply with the identifier requirements:.

```java
// The following calls are equivalent:
df.select(Functions.col("3rdID"));
df.select(Functions.col("\"3rdID\""));

// The following calls are equivalent:
df.select(Functions.col("id with space"));
df.select(Functions.col("\"id with space\""));
```

If you have already added double quotes around a column name, the library does not insert additional double quotes around the
name.

In some cases, the column name might contain double quote characters:

```sqlexample
describe table quoted;
+------------------------+ ...
| name                   | ...
|------------------------+ ...
| name_with_"air"_quotes | ...
| "column_name_quoted"   | ...
+------------------------+ ...
```

As explained in [Identifier requirements](../../../sql-reference/identifiers-syntax.md), for each double quote character within a double-quoted identifier, you
must use two double quote characters (e.g. `"name_with_""air""_quotes"` and `"""column_name_quoted"""`):

```java
DataFrame dfTable = session.table("quoted");
dfTable.select("\"name_with_\"\"air\"\"_quotes\"");
dfTable.select("\"\"\"column_name_quoted\"\"\"");
```

Keep in mind that when an identifier is enclosed in double quotes (whether you explicitly added the quotes or the library added
the quotes for you), [Snowflake treats the identifier as case-sensitive](../../../sql-reference/identifiers-syntax.md):

```java
// The following calls are NOT equivalent!
// The Snowpark library adds double quotes around the column name,
// which makes Snowflake treat the column name as case-sensitive.
df.select(Functions.col("id with space"));
df.select(Functions.col("ID WITH SPACE"));
```

### Using Literals as Column Objects

To use a literal in a method that passes in a `Column` object, create a `Column` object for the literal by passing
the literal to the `lit` static method in the `Functions` class. For example:

```java
// Show the first 10 rows in which category_id is greater than 5.
// Use `Functions.lit(5)` to create a Column object for the literal 5.
DataFrame df = session.table("sample_product_data");
df.filter(Functions.col("category_id").gt(Functions.lit(5))).show();
```

If the literal is a floating point or double value in Java (e.g. `0.05` is treated as a Double by default), the Snowpark library
generates SQL that implicitly casts the value to the corresponding Snowpark data type (e.g. `0.05::DOUBLE`). This can produce
an approximate value that differs from the exact number specified.

For example, the following code displays no matching rows, even though the filter (that matches values greater than or equal to
`0.05`) should match the rows in the DataFrame:

```java
// Create a DataFrame that contains the value 0.05.
DataFrame df = session.sql("select 0.05 :: Numeric(5, 2) as a");

// Applying this filter results in no matching rows in the DataFrame.
df.filter(Functions.col("a").leq(Functions.lit(0.06).minus(Functions.lit(0.01)))).show();
```

The problem is that `Functions.lit(0.06)` and `Functions.lit(0.01)` produce approximate values for `0.06` and `0.01`,
not the exact values.

To avoid this problem, cast the literal to the Snowpark type that you want to
use. For example, to use a [NUMBER](../../../sql-reference/data-types-numeric.md) with a precision of 5 and a scale of 2:

```java
import com.snowflake.snowpark_java.types.*;
...

df.filter(Functions.col("a").leq(Functions.lit(0.06).cast(DataTypes.createDecimalType(5, 2)).minus(Functions.lit(0.01).cast(DataTypes.createDecimalType(5, 2))))).show();
```

### Casting a Column Object to a Specific Type

To cast a `Column` object to a specific type, call the [cast](../reference/java/com/snowflake/snowpark_java/Column.md) method, and pass in a type object from the
[com.snowflake.snowpark_java.types package](../reference/java/com/snowflake/snowpark_java/types/package-summary.md). For example, to cast a literal as a [NUMBER](../../../sql-reference/data-types-numeric.md) with a precision
of 5 and a scale of 2:

```java
// Import for the DecimalType class..
import com.snowflake.snowpark_java.types.*;

Column decimalValue = Functions.lit(0.05).cast(DataTypes.createDecimalType(5,2));
```

### Chaining Method Calls

Because each method that transforms a DataFrame object returns a new DataFrame
object that has the transformation applied, you can [chain method calls](https://en.wikipedia.org/wiki/Method_chaining) to
produce a new DataFrame that is transformed in additional ways.

The following example returns a DataFrame that is configured to:

* Query the `sample_product_data` table.
* Return the row with `id = 1`.
* Select the `name` and `serial_number` columns.

```java
DataFrame dfProductInfo = session.table("sample_product_data").filter(Functions.col("id").equal_to(Functions.lit(1))).select(Functions.col("name"), Functions.col("serial_number"));
dfProductInfo.show();
```

In this example:

* `session.table("sample_product_data")` returns a DataFrame for the `sample_product_data` table.

  Although the DataFrame does not yet contain the data from the table, the object does contain the definitions of the columns in
  the table.
* `filter(Functions.col("id").equal_to(Functions.lit(1)))` returns a DataFrame for the `sample_product_data` table that is
  set up to return the row with `id = 1`.

  Note again that the DataFrame does not yet contain the matching row from the table. The matching row is not retrieved until you
  call an action method.
* `select(Functions.col("name"), Functions.col("serial_number"))` returns a DataFrame that contains the `name` and
  `serial_number` columns for the row in the `sample_product_data` table that has `id = 1`.

When you chain method calls, keep in mind that the order of calls is important. Each method call returns a DataFrame that has been
transformed. Make sure that subsequent calls work with the transformed DataFrame.

For example, in the code below, the `select` method returns a DataFrame that just contains two columns: `name` and
`serial_number`. The `filter` method call on this DataFrame fails because it uses the `id` column, which is not in the
transformed DataFrame.

```java
// This fails with the error "invalid identifier 'ID'."
DataFrame dfProductInfo = session.table("sample_product_data").select(Functions.col("name"), Functions.col("serial_number")).filter(Functions.col("id").equal_to(Functions.lit(1)));
dfProductInfo.show();
```

In contrast, the following code executes successfully because the `filter()` method is called on a DataFrame that contains
all of the columns in the `sample_product_data` table (including the `id` column):

```java
// This succeeds because the DataFrame returned by the table() method
// includes the "id" column.
DataFrame dfProductInfo = session.table("sample_product_data").filter(Functions.col("id").equal_to(Functions.lit(1))).select(Functions.col("name"), Functions.col("serial_number"));
dfProductInfo.show();
```

Keep in mind that you might need to make the `select` and `filter` method calls in a different order than you would
use the equivalent keywords (SELECT and WHERE) in a SQL statement.

### Limiting the Number of Rows in a DataFrame

To limit the number of rows in a DataFrame, you can use the [limit](../reference/java/com/snowflake/snowpark_java/DataFrame.md) transformation method.

The Snowpark API also provides action methods for retrieving and printing out a limited number of rows:

* the [first](../reference/java/com/snowflake/snowpark_java/DataFrame.md) action method (to execute the query and return the first `n` rows)
* the [show](../reference/java/com/snowflake/snowpark_java/DataFrame.md) action method (to execute the query and print the first `n` rows)

These methods effectively add a [LIMIT](../../../sql-reference/constructs/limit.md) clause to the SQL statement that is executed.

As explained in the [usage notes for LIMIT](../../../sql-reference/constructs/limit.md), the results are non-deterministic unless you
specify a sort order (ORDER BY) in conjunction with LIMIT.

To keep the ORDER BY clause with the LIMIT clause (e.g. so that ORDER BY is not in a separate subquery), you must call the method
that limits results on the DataFrame returned by the `sort` method.

For example, if you are chaining method calls:

```java
DataFrame df = session.table("sample_product_data");

// Limit the number of rows to 5, sorted by parent_id.
DataFrame dfSubset = df.sort(Functions.col("parent_id")).limit(5);

// Return the first 5 rows, sorted by parent_id.
Row[] arrayOfRows = df.sort(Functions.col("parent_id")).first(5);

// Print the first 5 rows, sorted by parent_id.
df.sort(Functions.col("parent_id")).show(5);
```

### Retrieving Column Definitions

To retrieve the definition of the columns in the dataset for the DataFrame, call the `schema` method. This method returns
a `StructType` object that contains an `Array` of `StructField` objects. Each `StructField` object
contains the definition of a column.

```java
import com.snowflake.snowpark_java.types.*;
...

// Get the StructType object that describes the columns in the
// underlying rowset.
StructType tableSchema = session.table("sample_product_data").schema();
System.out.println("Schema for sample_product_data: " + tableSchema);
```

In the returned `StructType` object, the column names are always normalized. Unquoted identifiers are returned in uppercase,
and quoted identifiers are returned in the exact case in which they were defined.

The following example creates a DataFrame containing the columns named `ID` and `3rd`. For the column name `3rd`, the
Snowpark library automatically encloses the name in double quotes (`"3rd"`) because
the name does not comply with the requirements for an identifier.

The example calls the `schema` method and then calls the `names` method on the returned `StructType` object to
get an array of column names. The names are normalized in the `StructType` returned by the `schema` method.

```java
import java.util.Arrays;
...

// Create a DataFrame containing the "id" and "3rd" columns.
DataFrame dfSelectedColumns = session.table("sample_product_data").select(Functions.col("id"), Functions.col("3rd"));
// Print out the names of the columns in the schema.
System.out.println(Arrays.toString(dfSelectedColumns.schema().names()));
```

### Joining DataFrames

To join DataFrame objects, call the [join](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method.

The following sections explain how to use DataFrames to perform a join:

* Setting up the Sample Data for the Joins
* Specifying the Columns for the Join
* Performing a Natural Join
* Specifying the Type of Join
* Joining Multiple Tables
* Performing a Self-Join

#### Setting up the Sample Data for the Joins

The examples in the next sections use sample data that you can set up by executing the following SQL statements:

```sqlexample
CREATE OR REPLACE TABLE sample_a (
  id_a INTEGER,
  name_a VARCHAR,
  value INTEGER
);
INSERT INTO sample_a (id_a, name_a, value) VALUES
  (10, 'A1', 5),
  (40, 'A2', 10),
  (80, 'A3', 15),
  (90, 'A4', 20)
;
CREATE OR REPLACE TABLE sample_b (
  id_b INTEGER,
  name_b VARCHAR,
  id_a INTEGER,
  value INTEGER
);
INSERT INTO sample_b (id_b, name_b, id_a, value) VALUES
  (4000, 'B1', 40, 10),
  (4001, 'B2', 10, 5),
  (9000, 'B3', 80, 15),
  (9099, 'B4', NULL, 200)
;
CREATE OR REPLACE TABLE sample_c (
  id_c INTEGER,
  name_c VARCHAR,
  id_a INTEGER,
  id_b INTEGER
);
INSERT INTO sample_c (id_c, name_c, id_a, id_b) VALUES
  (1012, 'C1', 10, NULL),
  (1040, 'C2', 40, 4000),
  (1041, 'C3', 40, 4001)
;
```

#### Specifying the Columns for the Join

With the `DataFrame.join` method, you can specify the columns to use in one of the following ways:

* Specify a Column expression that describes the join condition.
* Specify one or more columns that should be used as the common columns in the join.

The following example performs an inner join on the column named `id_a`:

```java
// Create a DataFrame that joins the DataFrames for the tables
// "sample_a" and "sample_b" on the column named "id_a".
DataFrame dfLhs = session.table("sample_a");
DataFrame dfRhs = session.table("sample_b");
DataFrame dfJoined = dfLhs.join(dfRhs, dfLhs.col("id_a").equal_to(dfRhs.col("id_a")));
dfJoined.show();
```

Note that the example uses the `DataFrame.col` method to specify the condition to use for the join. See
Specifying Columns and Expressions for more about this method.

This prints the following output:

```none
----------------------------------------------------------------------
|"ID_A"  |"NAME_A"  |"VALUE"  |"ID_B"  |"NAME_B"  |"ID_A"  |"VALUE"  |
----------------------------------------------------------------------
|10      |A1        |5        |4001    |B2        |10      |5        |
|40      |A2        |10       |4000    |B1        |40      |10       |
|80      |A3        |15       |9000    |B3        |80      |15       |
----------------------------------------------------------------------
```

##### Identical Column Names Duplicated in the Join Result

In the DataFrame resulting from a join, the Snowpark library uses the column names found in the tables that were joined even when the
column names are identical across tables. When this happens, these column names are duplicated in the DataFrame resulting from the join.
To access a duplicated column by name, call the `col` method on the DataFrame representing the column’s original table. (For more
information about specifying columns, see Referring to Columns in Different DataFrames.)

Code in the following example joins two DataFrames, then calls the `select` method on the joined DataFrame. It specifies the columns
to select by calling the `col` method from the variable representing the respective DataFrame objects: `dfRhs` and
`dfLhs`. It uses the `as` method to give the columns new names in the DataFrame that the `select` method creates.

```java
DataFrame dfLhs = session.table("sample_a");
DataFrame dfRhs = session.table("sample_b");
DataFrame dfJoined = dfLhs.join(dfRhs, dfLhs.col("id_a").equal_to(dfRhs.col("id_a")));
DataFrame dfSelected = dfJoined.select(dfLhs.col("value").as("LeftValue"), dfRhs.col("value").as("RightValue"));
dfSelected.show();
```

This prints the following output:

```none
------------------------------
|"LEFTVALUE"  |"RIGHTVALUE"  |
------------------------------
|5            |5             |
|10           |10            |
|15           |15            |
------------------------------
```

##### Deduplicate Columns Before Saving or Caching

Note that when a DataFrame resulting from a join includes duplicate column names, you must deduplicate or rename columns to remove
duplication in the DataFrame before you save the result to a table or cache the DataFrame. For duplicate column names in a DataFrame that
you save to a table or cache, the Snowpark library will replace duplicate column names with aliases so that they’re no longer duplicated.

The following example illustrates how the output of a cached DataFrame might appear if column names `ID_A` and `VALUE` were
duplicated in a join from two tables, then not deduplicated or renamed prior to caching the result.

```none
--------------------------------------------------------------------------------------------------
|"l_ZSz7_ID_A"  |"NAME_A"  |"l_ZSz7_VALUE"  |"ID_B"  |"NAME_B"  |"r_heec_ID_A"  |"r_heec_VALUE"  |
--------------------------------------------------------------------------------------------------
|10             |A1        |5               |4001    |B2        |10             |5               |
|40             |A2        |10              |4000    |B1        |40             |10              |
|80             |A3        |15              |9000    |B3        |80             |15              |
--------------------------------------------------------------------------------------------------
```

#### Performing a Natural Join

To perform a [natural join](../../../user-guide/querying-joins.md) (where DataFrames are joined on columns that have the same name),
call the [naturalJoin](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method.

The following example joins the DataFrames for the tables `sample_a` and `sample_b` on their common columns (the column
`id_a`):

```java
DataFrame dfLhs = session.table("sample_a");
DataFrame dfRhs = session.table("sample_b");
DataFrame dfJoined = dfLhs.naturalJoin(dfRhs);
dfJoined.show();
```

This prints the following output:

```none
---------------------------------------------------
|"ID_A"  |"VALUE"  |"NAME_A"  |"ID_B"  |"NAME_B"  |
---------------------------------------------------
|10      |5        |A1        |4001    |B2        |
|40      |10       |A2        |4000    |B1        |
|80      |15       |A3        |9000    |B3        |
---------------------------------------------------
```

#### Specifying the Type of Join

By default, the `DataFrame.join` method creates an inner join. To specify a different type of join, set the
`joinType` argument to one of the following values:

| Type of Join | `joinType` |
| --- | --- |
| Inner join | `inner` (default) |
| Cross join | `cross` |
| Full outer join | `full` |
| Left outer join | `left` |
| Left anti join | `leftanti` |
| Left semi join | `leftsemi` |
| Right outer join | `right` |

For example:

```java
// Create a DataFrame that performs a left outer join on
// "sample_a" and "sample_b" on the column named "id_a".
DataFrame dfLhs = session.table("sample_a");
DataFrame dfRhs = session.table("sample_b");
DataFrame dfLeftOuterJoin = dfLhs.join(dfRhs, dfLhs.col("id_a").equal_to(dfRhs.col("id_a")), "left");
dfLeftOuterJoin.show();
```

This prints the following output:

```none
----------------------------------------------------------------------
|"ID_A"  |"NAME_A"  |"VALUE"  |"ID_B"  |"NAME_B"  |"ID_A"  |"VALUE"  |
----------------------------------------------------------------------
|40      |A2        |10       |4000    |B1        |40      |10       |
|10      |A1        |5        |4001    |B2        |10      |5        |
|80      |A3        |15       |9000    |B3        |80      |15       |
|90      |A4        |20       |NULL    |NULL      |NULL    |NULL     |
----------------------------------------------------------------------
```

#### Joining Multiple Tables

To join multiple tables:

1. Create a DataFrame for each table.
2. Call the `DataFrame.join` method on the first DataFrame, passing in the second DataFrame.
3. Using the DataFrame returned by the `join` method, call the `join` method, passing in the third DataFrame.

You can chain the `join` calls as shown below:

```java
DataFrame dfFirst = session.table("sample_a");
DataFrame dfSecond  = session.table("sample_b");
DataFrame dfThird = session.table("sample_c");
DataFrame dfJoinThreeTables = dfFirst.join(dfSecond, dfFirst.col("id_a").equal_to(dfSecond.col("id_a"))).join(dfThird, dfFirst.col("id_a").equal_to(dfThird.col("id_a")));
dfJoinThreeTables.show();
```

This prints the following output:

```none
------------------------------------------------------------------------------------------------------------
|"ID_A"  |"NAME_A"  |"VALUE"  |"ID_B"  |"NAME_B"  |"ID_A"  |"VALUE"  |"ID_C"  |"NAME_C"  |"ID_A"  |"ID_B"  |
------------------------------------------------------------------------------------------------------------
|10      |A1        |5        |4001    |B2        |10      |5        |1012    |C1        |10      |NULL    |
|40      |A2        |10       |4000    |B1        |40      |10       |1040    |C2        |40      |4000    |
|40      |A2        |10       |4000    |B1        |40      |10       |1041    |C3        |40      |4001    |
------------------------------------------------------------------------------------------------------------
```

#### Performing a Self-Join

If you need to join a table with itself on different columns, you cannot perform the self-join with a single DataFrame. The
following examples that use a single DataFrame to perform a self-join fail because the column expressions for `"id"` are
present in the left and right sides of the join:

```java
// This fails because columns named "id" and "parent_id"
// are in the left and right DataFrames in the join.
DataFrame df = session.table("sample_product_data");
DataFrame dfJoined = df.join(df, Functions.col("id").equal_to(Functions.col("parent_id")));
```

```java
// This fails because columns named "id" and "parent_id"
// are in the left and right DataFrames in the join.
DataFrame df = session.table("sample_product_data");
DataFrame dfJoined = df.join(df, df.col("id").equal_to(df.col("parent_id")));
```

Both of these examples fail with the following exception:

```none
Exception in thread "main" com.snowflake.snowpark_java.SnowparkClientException:
  Joining a DataFrame to itself can lead to incorrect results due to ambiguity of column references.
  Instead, join this DataFrame to a clone() of itself.
```

Instead, use the [clone](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method to create a clone of the DataFrame object, and use the two DataFrame objects to perform the join:

```java
// Create a DataFrame object for the "sample_product_data" table for the left-hand side of the join.
DataFrame dfLhs = session.table("sample_product_data");
// Clone the DataFrame object to use as the right-hand side of the join.
DataFrame dfRhs = dfLhs.clone();

// Create a DataFrame that joins the two DataFrames
// for the "sample_product_data" table on the
// "id" and "parent_id" columns.
DataFrame dfJoined = dfLhs.join(dfRhs, dfLhs.col("id").equal_to(dfRhs.col("parent_id")));
dfJoined.show();
```

If you want to perform a self-join on the same column, call the `join` method that passes in the name of the column (or an
array of column names) for the `USING` clause:

```java
// Create a DataFrame that performs a self-join on a DataFrame
// using the column named "key".
DataFrame df = session.table("sample_product_data");
DataFrame dfJoined = df.join(df, "key");
```

## Performing an Action to Evaluate a DataFrame

As mentioned earlier, the DataFrame is lazily evaluated, which means the SQL statement isn’t sent to the server for execution
until you perform an action. An action causes the DataFrame to be evaluated and sends the corresponding SQL statement to the
server for execution.

The following sections explain how to perform an action synchronously and asynchronously on a DataFrame:

* Performing an Action Synchronously
* Performing an Action Asynchronously

### Performing an Action Synchronously

To perform an action synchronously, call one of the following action methods:

| Method to Perform an Action Synchronously | Description |
| --- | --- |
| `DataFrame.collect()` | Evaluates the DataFrame and returns the resulting dataset as an `Array` of [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects. See Returning All Rows. |
| `DataFrame.toLocalIterator()` | Evaluates the DataFrame and returns an `Iterator` of [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects. If the result set is large, use this method to avoid loading all the results into memory at once. See Returning an Iterator for the Rows. |
| `DataFrame.count()` | Evaluates the DataFrame and returns the number of rows. |
| `DataFrame.show()` | Evaluates the DataFrame and prints the rows to the console. Note that this method limits the number of rows to 10 (by default). See Printing the Rows in a DataFrame. |
| `DataFrame.cacheResult()` | Executes the query, creates a temporary table, and puts the results into the table. The method returns a `HasCachedResult` object that you can use to access the data in this temporary table. See Caching a DataFrame. |
| `DataFrame.write().saveAsTable()` | Saves the data in the DataFrame to the specified table. See Saving Data to a Table. |
| `DataFrame.read().fileformat().copyInto('tableName')` | Copies the data in the DataFrame to the specified table. See Copying Data from Files into a Table. |
| `Session.table('tableName').delete()` | Deletes rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').update()`, `Session.table('tableName').updateColumn()` | Updates rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').merge().methods.collect()` | Merges rows into the specified table. See Updating, Deleting, and Merging Rows in a Table. |

For example, to execute the query and return the number of results, call the `count` method:

```java
// Create a DataFrame for the "sample_product_data" table.
DataFrame dfProducts = session.table("sample_product_data");

// Send the query to the server for execution and
// print the count of rows in the table.
System.out.println("Rows returned: " + dfProducts.count());
```

You can also call action methods to:

* Execute a query against a table and return the results.
* Execute a query and print the results to the console.

Note: If you are calling the `schema` method to get the definitions of the columns in the DataFrame, you do not need to
call an action method.

### Performing an Action Asynchronously

> **Note:**
>
> This feature was introduced in Snowpark 0.11.0.

To perform an action asynchronously, call the `async` method to return an “async actor” object (e.g.
`DataFrameAsyncActor`), and call an asynchronous action method in that object.

These action methods of an async actor object return a `TypedAsyncJob` object, which you can use to check
the status of the asynchronous action and retrieve the results of the action.

The next sections explain how to perform actions asynchronously and check the results.

* Understanding the Basic Flow of Asynchronous Actions
* Specifying the Maximum Number of Seconds to Wait
* Accessing an Asynchronous Query by ID

#### Understanding the Basic Flow of Asynchronous Actions

You can use the following methods to perform an action asynchronously:

| Method to Perform an Action Asynchronously | Description |
| --- | --- |
| `DataFrame.async().collect()` | Asynchronously evaluates the DataFrame to retrieve the resulting dataset as an `Array` of [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects. See Returning All Rows. |
| `DataFrame.async.toLocalIterator` | Asynchronously evaluates the DataFrame to retrieve an `Iterator` of [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects. If the result set is large, use this method to avoid loading all the results into memory at once. See Returning an Iterator for the Rows. |
| `DataFrame.async().count()` | Asynchronously evaluates the DataFrame to retrieve the number of rows. |
| `DataFrame.write().async().saveAsTable()` | Asynchronously saves the data in the DataFrame to the specified table. See Saving Data to a Table. |
| `DataFrame.read().fileformat().async().copyInto('tableName')` | Asynchronously copies the data in the DataFrame to the specified table. See Copying Data from Files into a Table. |
| `Session.table('tableName').async().delete()` | Asynchronously deletes rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').async().update()` and `Session.table('tableName').async().updateColumn()` | Asynchronously updates rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |

From the returned [TypedAsyncJob](../reference/java/com/snowflake/snowpark_java/TypedAsyncJob.md) object, you can do the following:

* To determine if the action has completed, call the `isDone` method.
* To get the query ID that corresponds to the action, call the `getQueryId` method.
* To return the results of the action (e.g. the `Array` of `Row` objects for the `collect` method or the count
  of rows for the `count` method), call the `getResult` method.

  Note that `getResult` is a blocking call.
* To cancel the action, call the `cancel` method.

For example, to execute a query asynchronously and retrieve the results as an `Array` of `Row` objects, call
`async().collect()`:

```java
import java.util.Arrays;

// Create a DataFrame with the "id" and "name" columns from the "sample_product_data" table.
// This does not execute the query.
DataFrame df = session.table("sample_product_data").select(Functions.col("id"), Functions.col("name"));

// Execute the query asynchronously.
// This call does not block.
TypedAsyncJob<Row[]> asyncJob = df.async().collect();
// Check if the query has completed execution.
System.out.println("Is query " + asyncJob.getQueryId() + " done? " + asyncJob.isDone());
// Get an Array of Rows containing the results, and print the results.
// Note that getResult is a blocking call.
Row[] results = asyncJob.getResult();
System.out.println(Arrays.toString(results));
```

To execute the query asynchronously and retrieve the number of results, call `async().count()`:

```java
// Create a DataFrame for the "sample_product_data" table.
DataFrame dfProducts = session.table("sample_product_data");

// Execute the query asynchronously.
// This call does not block.
TypedAsyncJob<Long> asyncJob = dfProducts.async().count();
// Check if the query has completed execution.
System.out.println("Is query " + asyncJob.getQueryId() + " done? " + asyncJob.isDone());
// Print the count of rows in the table.
// Note that getResult is a blocking call.
System.out.println("Rows returned: " + asyncJob.getResult());
```

#### Specifying the Maximum Number of Seconds to Wait

When calling the `getResult` method, you can use the `maxWaitTimeInSeconds` argument to specify the maximum number of
seconds to wait for the query to complete before attempting to retrieve the results. For example:

```java
// Wait a maximum of 10 seconds for the query to complete before retrieving the results.
Row[] results = asyncJob.getResult(10);
```

If you omit this argument, the method waits for the maximum number of seconds specified by the
[snowpark_request_timeout_in_seconds](creating-session.md) configuration property. (This is a
property that you can set when [creating the Session object](creating-session.md).)

#### Accessing an Asynchronous Query by ID

If you have the query ID of an asynchronous query that you submitted earlier, you can call `Session.createAsyncJob` method
to create an [AsyncJob](../reference/java/com/snowflake/snowpark_java/AsyncJob.md) object that you can use to check the status of the query, retrieve the query results, or cancel the
query.

Note that unlike `TypedAsyncJob`, `AsyncJob` does not provide a `getResult` method for retrieving the results.
If you need to retrieve the results, call the `getRows` or `getIterator` method instead.

For example:

```java
import java.util.Arrays;
...

AsyncJob asyncJob = session.createAsyncJob(myQueryId);
// Check if the query has completed execution.
System.out.println("Is query " + asyncJob.getQueryId() + " done? " + asyncJob.isDone());
// If you need to retrieve the results, call getRows to return an Array of Rows containing the results.
// Note that getRows is a blocking call.
Row[] rows = asyncJob.getRows();
System.out.println(Arrays.toString(rows));
```

## Retrieving Rows into a DataFrame

After you specify how the DataFrame should be transformed, you can
call an action method to execute a query and return the results. You can
return all of the rows in an `Array`, or you can return an `Iterator` that allows you to iterate over the results,
row by row. In the latter case, if the amount of data is large, the rows are loaded into memory by chunk to avoid loading a large
amount of data into memory.

* Returning All Rows
* Returning an Iterator for the Rows
* Returning the First n Rows

### Returning All Rows

To return all rows at once, call the [collect](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method. This method returns an Array of [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects. To retrieve the values
from the row, call the `getType` method (e.g. `getString`, `getInt`, etc.).

For example:

```java
Row[] rows = session.table("sample_product_data").select(Functions.col("name"), Functions.col("category_id")).sort(Functions.col("name")).collect();
for (Row row : rows) {
  System.out.println("Name: " + row.getString(0) + "; Category ID: " + row.getInt(1));
}
```

### Returning an Iterator for the Rows

If you want to use an `Iterator` to iterate over the [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects in the results, call [toLocalIterator](../reference/java/com/snowflake/snowpark_java/DataFrame.md). If the amount
of data in the results is large, the method loads the rows by chunk to avoid loading all rows into memory at once.

For example:

```java
import java.util.Iterator;

Iterator<Row> rowIterator = session.table("sample_product_data").select(Functions.col("name"), Functions.col("category_id")).sort(Functions.col("name")).toLocalIterator();
while (rowIterator.hasNext()) {
  Row row = rowIterator.next();
  System.out.println("Name: " + row.getString(0) + "; Category ID: " + row.getInt(1));
}
```

### Returning the First `n` Rows

To return the first `n` rows, call the [first](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method, passing in the number of rows to return.

As explained in Limiting the Number of Rows in a DataFrame, the results are non-deterministic. If you want the results to be
deterministic, call this method on a sorted DataFrame (`df.sort().first()`).

For example:

```java
import java.util.Arrays;
...

DataFrame df = session.table("sample_product_data");
Row[] rows = df.sort(Functions.col("name")).first(5);
System.out.println(Arrays.toString(rows));
```

## Printing the Rows in a DataFrame

To print the first 10 rows in the DataFrame to the console, call the [show](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method. To print out a different number of rows, pass
in the number of rows to print.

As explained in Limiting the Number of Rows in a DataFrame, the results are non-deterministic. If you want the results to be
deterministic, call this method on a sorted DataFrame (`df.sort().show()`).

For example:

```java
DataFrame df = session.table("sample_product_data");
df.sort(Functions.col("name")).show();
```

## Updating, Deleting, and Merging Rows in a Table

> **Note:**
>
> This feature was introduced in Snowpark 0.7.0.

When you call `Session.table` to create a `DataFrame` object for a table, the method returns an `Updatable`
object, which extends `DataFrame` with additional methods for updating and deleting data in the table. (See [Updatable](../reference/java/com/snowflake/snowpark_java/Updatable.md).)

If you need to update or delete rows in a table, you can use the following methods of the `Updatable` class:

* Call `update` or `updateColumn` to update existing rows in the table. See
  Updating Rows in a Table.
* Call `delete` to delete rows from a table. See Deleting Rows in a Table.
* Call `merge` to insert, update, and delete rows in one table, based on data in a second table or subquery. (This is the
  equivalent of the [MERGE](../../../sql-reference/sql/merge.md) command in SQL.) See Merging Rows into a Table.

### Updating Rows in a Table

To update the rows in a table, call the `update` or `updateColumn` method, passing in a `Map` that associates
the columns to update and the corresponding values to assign to those columns:

* To specify the column names as strings in the `Map`, call `updateColumn`.
* To specify `Column` objects in the `Map`, call `update`.

Both methods return an `UpdateResult` object, which contains the number of rows that were updated. (See [UpdateResult](../reference/java/com/snowflake/snowpark_java/UpdateResult.md).)

> **Note:**
>
> Both methods are action methods, which means that calling the method
> sends SQL statements to the server for execution.

For example, to replace the values in the column named `count` with the value `1`, and you want to use a `Map` that
associates the column name (a `String`) with the corresponding value, call `updateColumn`:

```java
import java.util.HashMap;
import java.util.Map;
...

Map<String, Column> assignments = new HashMap<>();
assignments.put("3rd", Functions.lit(1));
Updatable updatableDf = session.table("sample_product_data");
UpdateResult updateResult = updatableDf.updateColumn(assignments);
System.out.println("Number of rows updated: " + updateResult.getRowsUpdated());
```

If you want to use a `Column` object in the `Map` to identify the column to update, call `update`:

```java
import java.util.HashMap;
import java.util.Map;
...

Map<Column, Column> assignments = new HashMap<>();
assignments.put(Functions.col("3rd"), Functions.lit(1));
Updatable updatableDf = session.table("sample_product_data");
UpdateResult updateResult = updatableDf.update(assignments);
System.out.println("Number of rows updated: " + updateResult.getRowsUpdated());
```

If the update should be made only when a condition is met, you can specify that condition as an argument. For example, to replace
the values in the column named `count` with `2` for rows in which the `category_id` column has the value `20`:

```java
import java.util.HashMap;
import java.util.Map;
...
Map<Column, Column> assignments = new HashMap<>();
assignments.put(Functions.col("3rd"), Functions.lit(2));
Updatable updatableDf = session.table("sample_product_data");
UpdateResult updateResult = updatableDf.update(assignments, Functions.col("category_id").equal_to(Functions.lit(20)));
System.out.println("Number of rows updated: " + updateResult.getRowsUpdated());
```

If you need to base the condition on a join with a different `DataFrame` object, you can pass that `DataFrame` in as
an argument and use that `DataFrame` in the condition. For example, to replace the values in the column named `count` with
`3` for rows in which the `category_id` column matches the `category_id` in the `DataFrame` `dfParts`:

```java
import java.util.HashMap;
import java.util.Map;
...
Map<Column, Column> assignments = new HashMap<>();
assignments.put(Functions.col("3rd"), Functions.lit(3));
Updatable updatableDf = session.table("sample_product_data");
DataFrame dfParts = session.table("parts");
UpdateResult updateResult = updatableDf.update(assignments, updatableDf.col("category_id").equal_to(dfParts.col("category_id")), dfParts);
System.out.println("Number of rows updated: " + updateResult.getRowsUpdated());
```

### Deleting Rows in a Table

For the `delete` method, you can specify a condition that identifies the rows to delete, and you can base that condition on
a join with another DataFrame. `delete` returns a `DeleteResult` object, which contains the
number of rows that were deleted. (See [DeleteResult](../reference/java/com/snowflake/snowpark_java/DeleteResult.md).)

> **Note:**
>
> `delete` is an action method, which means that calling the method sends
> SQL statements to the server for execution.

For example, to delete the rows that have the value `1` in the `category_id` column:

```java
Updatable updatableDf = session.table("sample_product_data");
DeleteResult deleteResult = updatableDf.delete(updatableDf.col("category_id").equal_to(Functions.lit(1)));
System.out.println("Number of rows deleted: " + deleteResult.getRowsDeleted());
```

If the condition refers to columns in a different DataFrame, pass that DataFrame in as the second argument. For example, to delete
the rows in which the `category_id` column matches the `category_id` in the `DataFrame` `dfParts`, pass in `dfParts`
as the second argument:

```java
Updatable updatableDf = session.table("sample_product_data");
DeleteResult deleteResult = updatableDf.delete(updatableDf.col("category_id").equal_to(dfParts.col("category_id")), dfParts);
System.out.println("Number of rows deleted: " + deleteResult.getRowsDeleted());
```

### Merging Rows into a Table

To insert, update, and deletes rows in one table based on values in a second table or a subquery (the equivalent of the
[MERGE](../../../sql-reference/sql/merge.md) command in SQL), do the following:

1. In the `Updatable` object for the table where you want the data merged in, call the `merge` method, passing in
   the `DataFrame` object for the other table and the column expression for the join condition.

   This returns a `MergeBuilder` object that you can use to specify the actions to take (e.g. insert, update, or delete) on
   the rows that match and the rows that don’t match. (See [MergeBuilder](../reference/java/com/snowflake/snowpark_java/MergeBuilder.md).)
2. Using the `MergeBuilder` object:

   * To specify the update or deletion that should be performed on matching rows, call the `whenMatched` method.

     If you need to specify an additional condition whe rows should be updated or deleted, you can pass in a column expression for
     that condition.

     This method returns a `MatchedClauseBuilder` object that you can use to specify the action to perform. (See
     [MatchedClauseBuilder](../reference/java/com/snowflake/snowpark_java/MatchedClauseBuilder.md).)

     Call the `update` or `delete` method in the `MatchedClauseBuilder` object to specify the update or delete
     action that should be performed on matching rows. These methods return a `MergeBuilder` object that you can use to
     specify additional clauses.
   * To specify the insert that should be performed when rows do not match, call the `whenNotMatched` method.

     If you need to specify an additional condition when rows should be inserted, you can pass in a column expression for that
     condition.

     This method returns a `NotMatchedClauseBuilder` object that you can use to specify the action to perform. (See
     [NotMatchedClauseBuilder](../reference/java/com/snowflake/snowpark_java/NotMatchedClauseBuilder.md).)

     Call the `insert` method in the `NotMatchedClauseBuilder` object to specify the insert action that should be
     performed when rows do not match. These methods return a `MergeBuilder` object that you can use to specify additional
     clauses.
3. When you are done specifying the inserts, updates, and deletions that should be performed, call the `collect` method of
   the `MergeBuilder` object to perform the specified inserts, updates, and deletions on the table.

   `collect` returns a `MergeResult` object, which contains the number of rows that were inserted, updated, and
   deleted. (See [MergeResult](../reference/java/com/snowflake/snowpark_java/MergeResult.md).)

The following example inserts a row with the `id` and `value` columns from the `source` table into the `target` table if
the `target` table does not contain a row with a matching ID:

```java
MergeResult mergeResult = target.merge(source, target.col("id").equal_to(source.col("id")))
                    .whenNotMatched().insert([source.col("id"), source.col("value")])
                    .collect();
```

The following example updates a row in the `target` table with the value of the `value` column from the row in the `source`
table that has the same ID:

```java
import java.util.HashMap;
import java.util.Map;
...
Map<String, Column> assignments = new HashMap<>();
assignments.put("value", source.col("value"));
MergeResult mergeResult = target.merge(source, target.col("id").equal_to(source.col("id")))
                    .whenMatched().update(assignments)
                    .collect();
```

## Saving Data to a Table

You can save the contents of a DataFrame to a new or existing table. In order to do this, you must have the following privileges:

* CREATE TABLE privileges on the schema, if the table does not exist.
* INSERT privileges on the table.

To save the contents of a DataFrame to a table:

1. Call the [write](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method of the DataFrame to get a [DataFrameWriter](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) object.
2. Call the [mode](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method of the `DataFrameWriter` object, passing in a [SaveMode](../reference/java/com/snowflake/snowpark_java/SaveMode.md) object that specifies your preferences
   for writing to the table:

   * To insert rows, pass in `SaveMode.Append`.
   * To overwrite the existing table, pass in `SaveMode.Overwrite`.

   This method returns the same `DataFrameWriter` object configured with the specified mode.
3. If you are inserting rows into an existing table (`SaveMode.Append`) and the column names in the DataFrame match the
   column names in the table, call the [DataFrameWriter.option](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md), passing in `"columnOrder"` and `"name"` as
   arguments.

   > **Note:**
   >
   > This method was introduced in Snowpark 1.4.0.

   By default, the `columnOrder` option is set to `"index"`, which means that the `DataFrameWriter` inserts the
   values in the order that the columns appear. For example, the `DataFrameWriter` inserts the value from the first column
   from the DataFrame in the first column in the table, the second column from the DataFrame in the second column in the table,
   etc.

   This method returns the same `DataFrameWriter` object configured with the specified option.
4. Call the [saveAsTable](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method of the `DataFrameWriter` object to save the contents of the DataFrame to a specified
   table.

   You do not need to call a separate method (e.g. `collect`) to execute the SQL statement that saves the data to the table.
   `saveAsTable` is an action method that executes the SQL statement.

The following example overwrites an existing table (identified by the `tableName` variable) with the contents of the DataFrame
`df`:

```java
df.write().mode(SaveMode.Overwrite).saveAsTable(tableName);
```

The following example inserts rows from the DataFrame `df` into an existing table (identified by the `tableName` variable).
In this example, the table and the DataFrame both contain the columns `c1` and `c2`.

The example demonstrates the difference between setting the `columnOrder` option to `"name"` (which inserts values
into the table columns with the same names as the DataFrame columns) and using the default `columnOrder` option (which
inserts values into the table columns based on the order of the columns in the DataFrame).

```java
DataFrame df = session.sql("SELECT 1 AS c2, 2 as c1");
// With the columnOrder option set to "name", the DataFrameWriter uses the column names
// and inserts a row with the values (2, 1).
df.write().mode(SaveMode.Append).option("columnOrder", "name").saveAsTable(tableName);
// With the default value of the columnOrder option ("index"), the DataFrameWriter uses the column positions
// and inserts a row with the values (1, 2).
df.write().mode(SaveMode.Append).saveAsTable(tableName);
```

## Creating a View From a DataFrame

To create a view from a DataFrame, call the [createOrReplaceView](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method:

```java
df.createOrReplaceView("db.schema.viewName");
```

Note that calling `createOrReplaceView` immediately creates the new view. More importantly, it does not
cause the DataFrame to be evaluated. (The DataFrame itself is not evaluated until you
perform an action.)

Views that you create by calling `createOrReplaceView` are persistent. If you no longer need that view, you can
[drop the view manually](../../../sql-reference/sql/drop-view.md).

If you need to create a temporary view just for the session, call the [createOrReplaceTempView](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method instead:

```java
df.createOrReplaceTempView("db.schema.viewName");
```

## Caching a DataFrame

In some cases, you may need to perform a complex query and keep the results for use in subsequent operations (rather than
executing the same query again). In these cases, you can cache the contents of a DataFrame by calling the [cacheResult](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method.

This method:

* Runs the query.

  You do not need to call a separate action method to retrieve the results
  before calling `cacheResult`. `cacheResult` is an action method that executes the query.
* Saves the results in a temporary table

  Because `cacheResult` creates a temporary table, you must have the CREATE TABLE privilege on the schema that is in use.
* Returns a [HasCachedResult](../reference/java/com/snowflake/snowpark_java/HasCachedResult.md) object, which provides access to the results in the temporary table.

  Because `HasCachedResult` extends `DataFrame`, you can perform some of the same operations on this cached data as
  you can perform on a DataFrame.

> **Note:**
>
> Because `cacheResult` executes the query and saves the results to a table, the method can result in increased compute and
> storage costs.

For example:

```java
// Set up a DataFrame to query a table.
DataFrame df = session.table("sample_product_data").filter(Functions.col("category_id").gt(Functions.lit(10)));
// Retrieve the results and cache the data.
HasCachedResult cachedDf = df.cacheResult();
// Create a DataFrame containing a subset of the cached data.
DataFrame dfSubset = cachedDf.filter(Functions.col("category_id").equal_to(Functions.lit(20))).select(Functions.col("name"), Functions.col("category_id"));
dfSubset.show();
```

Note that the original DataFrame is not affected when you call this method. For example, suppose that `dfTable` is a DataFrame
for the table `sample_product_data`:

```scala
HasCachedResult dfTempTable = dfTable.cacheResult();
```

After you call `cacheResult`, `dfTable` still points to the `sample_product_data` table, and you can continue to use
`dfTable` to query and update that table.

To use the cached data in the temporary table, you use `dfTempTable` (the `HasCachedResult` object returned by
`cacheResult`).

## Working With Files in a Stage

The Snowpark library provides classes and methods that you can use to [load data into Snowflake](../../../guides-overview-loading-data.md) and
[unload data from Snowflake](../../../user-guide/data-unload-overview.md) by using files in stages.

> **Note:**
>
> In order to use these classes and methods on a stage, you must have the required
> [privileges for working with the stage](../../../user-guide/security-access-control-privileges.md).

The next sections explain how to use these classes and methods:

* Uploading and Downloading Files in a Stage
* Using Input Streams to Upload and Download Data in a Stage
* Setting Up a DataFrame for Files in a Stage
* Loading Data from Files into a DataFrame
* Copying Data from Files into a Table
* Saving a DataFrame to Files on a Stage

### Uploading and Downloading Files in a Stage

To upload and download files in a stage, use the `put` and `get` methods of the [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md) object:

* Uploading Files to a Stage
* Downloading Files from a Stage

#### Uploading Files to a Stage

To upload files to a stage:

1. Verify that you have the [privileges to upload files to the stage](../../../user-guide/security-access-control-privileges.md).
2. Use the [file](../reference/java/com/snowflake/snowpark_java/Session.md) method of the `Session` object to access the [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md) object for the session.
3. Call the [put](../reference/java/com/snowflake/snowpark_java/FileOperation.md) method of the `FileOperation` object to upload the files to a stage.

   This method executes a SQL [PUT](../../../sql-reference/sql/put.md) command.

   * To specify any [optional parameters](../../../sql-reference/sql/put.md) for the PUT command, create a `Map` of the
     parameters and values, and pass in the `Map` as the `options` argument. For example:

     ```java
     import java.util.HashMap;
     import java.util.Map;
     ...
     // Upload a file to a stage without compressing the file.
     Map<String, String> putOptions = new HashMap<>();
     putOptions.put("AUTO_COMPRESS", "FALSE");
     PutResult[] putResults = session.file().put("file:///tmp/myfile.csv", "@myStage", putOptions);
     ```
   * In the `localFileName` argument, you can use wildcards (`*` and `?`) to identify a set of files to upload. For
     example:

     ```java
     // Upload the CSV files in /tmp with names that start with "file".
     // You can use the wildcard characters "*" and "?" to match multiple files.
     PutResult[] putResults = session.file().put("file:///tmp/file*.csv", "@myStage/prefix2")
     ```
4. Check the `Array` of [PutResult](../reference/java/com/snowflake/snowpark_java/PutResult.md) objects returned by the `put` method to determine if the files were successfully
   uploaded. For example, to print the filename and the status of the PUT operation for that file:

   ```java
   // Print the filename and the status of the PUT operation.
   for (PutResult result : putResults) {
     System.out.println(result.getSourceFileName() + ": " + result.getStatus());
   }
   ```

#### Downloading Files from a Stage

To download files from a stage:

1. Verify that you have the [privileges to download files from the stage](../../../user-guide/security-access-control-privileges.md).
2. Use the [file](../reference/java/com/snowflake/snowpark_java/Session.md) method of the `Session` object to access the [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md) object for the session.
3. Call the [get](../reference/java/com/snowflake/snowpark_java/FileOperation.md) method of the `FileOperation` object to download the files from a stage.

   This method executes a SQL [GET](../../../sql-reference/sql/get.md) command.

   To specify any [optional parameters](../../../sql-reference/sql/get.md) for the GET command, create a `Map` of the
   parameters and values, and pass in the `Map` as the `options` argument. For example:

   ```java
   import java.util.HashMap;
   import java.util.Map;
   ...
   // Upload a file to a stage without compressing the file.
   // Download files with names that match a regular expression pattern.
   Map<String, String> getOptions = new HashMap<>();
   getOptions.put("PATTERN", "'.*file_.*.csv.gz'");
   GetResult[] getResults = session.file().get("@myStage", "file:///tmp", getOptions);
   ```
4. Check the `Array` of [GetResult](../reference/java/com/snowflake/snowpark_java/GetResult.md) objects returned by the `get` method to determine if the files were successfully
   downloaded. For example, to print the filename and the status of the GET operation for that file:

   ```java
   // Print the filename and the status of the GET operation.
   for (GetResult result : getResults) {
     System.out.println(result.getFileName() + ": " + result.getStatus());
   }
   ```

### Using Input Streams to Upload and Download Data in a Stage

> **Note:**
>
> This feature was introduced in Snowpark 1.4.0.

To use input streams to upload data to a file on a stage and download data from a file on a stage, use the `uploadStream`
and `downloadStream` methods of the [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md) object:

* Using an Input Stream to Upload Data to a File on a Stage
* Using an Input Stream to Download Data from a File on a Stage

#### Using an Input Stream to Upload Data to a File on a Stage

To upload the data from a [java.io.InputStream](https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/io/InputStream.html) object to a file on a stage:

1. Verify that you have the [privileges to upload files to the stage](../../../user-guide/security-access-control-privileges.md).
2. Use the [file](../reference/java/com/snowflake/snowpark_java/Session.md) method of the `Session` object to access the [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md) object for the session.
3. Call the [uploadStream](../reference/java/com/snowflake/snowpark_java/FileOperation.md) method of the `FileOperation` object.

   Pass in the complete path to the file on the stage where the data should be written and the `InputStream` object. In
   addition, use the `compress` argument to specify whether or not the data should be compressed before it is uploaded.

For example:

```java
import java.io.InputStream;
...
boolean compressData = true;
String pathToFileOnStage = "@myStage/path/file";
session.file().uploadStream(pathToFileOnStage, new ByteArrayInputStream(fileContent.getBytes()), compressData);
```

#### Using an Input Stream to Download Data from a File on a Stage

To download data from a file on a stage to a [java.io.InputStream](https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/io/InputStream.html) object:

1. Verify that you have the [privileges to download files from the stage](../../../user-guide/security-access-control-privileges.md).
2. Use the [file](../reference/java/com/snowflake/snowpark_java/Session.md) method of the `Session` object to access the [FileOperation](../reference/java/com/snowflake/snowpark_java/FileOperation.md) object for the session.
3. Call the [downloadStream](../reference/java/com/snowflake/snowpark_java/FileOperation.md) method of the `FileOperation` object.

   Pass in the complete path to the file on the stage containing the data to download. Use the `decompress` argument to
   specify whether or not the data in the file is compressed.

For example:

```java
import java.io.InputStream;
...
boolean isDataCompressed = true;
String pathToFileOnStage = "@myStage/path/file";
InputStream is = session.file().downloadStream(pathToFileOnStage, isDataCompressed);
```

### Setting Up a DataFrame for Files in a Stage

This section explains how to set up a DataFrame for files in a Snowflake stage. Once you create this DataFrame, you can use the
DataFrame to:

* retrieve data from the files
* copy data from the files into a table

To set up a DataFrame for files in a Snowflake stage, use the `DataFrameReader` class:

1. Verify that you have the following privileges:

   * [Privileges to access files in the stage](../../../user-guide/security-access-control-privileges.md).
   * One of the following:

     + CREATE TABLE privileges on the schema, if you plan to specify
       copy options that determine how data is copied from the staged files.
     + CREATE FILE FORMAT privileges on the schema, otherwise.
2. Call the `read` method in the `Session` class to access a `DataFrameReader` object.
3. If the files are in CSV format, describe the fields in the file. To do this:

   1. Create a [StructType](../reference/java/com/snowflake/snowpark_java/types/StructType.md) object that consists of an array of [StructField](../reference/java/com/snowflake/snowpark_java/types/StructField.md) objects that describe the fields in the file.
   2. For each `StructField` object, specify the following:

      * The name of the field.
      * The data type of the field (specified as an object in the `com.snowflake.snowpark_java.types` package).
      * Whether or not the field is nullable.

      For example:

      ```java
      import com.snowflake.snowpark_java.types.*;
      ...

      StructType schemaForDataFile = StructType.create(
        new StructField("id", DataTypes.StringType, true),
        new StructField("name", DataTypes.StringType, true));
      ```
   3. Call the `schema` method in the `DataFrameReader` object, passing in the `StructType` object.

      For example:

      ```java
      DataFrameReader dfReader = session.read().schema(schemaForDataFile);
      ```

      The `schema` method returns a `DataFrameReader` object that is configured to read files containing the specified
      fields.

      Note that you do not need to do this for files in other formats (such as JSON). For those files, the
      `DataFrameReader` treats the data as a single field of the VARIANT type with the field name `$1`.
4. If you need to specify additional information about how the data should be read (for example, that the data is compressed or
   that a CSV file uses a semicolon instead of a comma to delimit fields), call the [DataFrameReader.option](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md) method or the
   [DataFrameReader.options](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md) method.

   Pass in the name and value of the option that you want to set. You can set the following types of options:

   * The [file format options](../../../sql-reference/sql/create-file-format.md) described in the
     [documentation on CREATE FILE FORMAT](../../../sql-reference/sql/create-file-format.md).
   * The [copy options](../../../sql-reference/sql/copy-into-table.md) described in the
     [COPY INTO TABLE documentation](../../../sql-reference/sql/copy-into-table.md).

     Note that setting copy options can result in a more expensive execution strategy when you
     retrieve the data into the DataFrame.

   The following example sets up the `DataFrameReader` object to query data in a CSV file that is not compressed and that
   uses a semicolon for the field delimiter.

   ```java
   dfReader = dfReader.option("field_delimiter", ";").option("COMPRESSION", "NONE");
   ```

   The `option` method returns a `DataFrameReader` object that is configured with the specified options.

   To set multiple options, you can either
   chain calls to the `option` method (as shown in the example
   above) or call the [DataFrameReader.options](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md) method, passing in a `Map` of the names and values of the options.
5. Call the method corresponding to the format of the files. You can call one of the following methods:

   * [DataFrameReader.avro](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md)
   * [DataFrameReader.csv](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md)
   * [DataFrameReader.json](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md)
   * [DataFrameReader.orc](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md)
   * [DataFrameReader.parquet](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md)
   * [DataFrameReader.xml](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md)

   When calling these methods, pass in the stage location of the files to be read. For example:

   ```java
   DataFrame df = dfReader.csv("@mystage/myfile.csv");
   ```

   To specify multiple files that start with the same prefix, specify the prefix after the stage name. For example, to load files
   that have the prefix `csv_` from the stage `@mystage`:

   ```java
   DataFrame df = dfReader.csv("@mystage/csv_");
   ```

   The methods corresponding to the format of a file return a [CopyableDataFrame](../reference/java/com/snowflake/snowpark_java/CopyableDataFrame.md) object for that file. `CopyableDataFrame`
   extends `DataFrame` and provides additional methods for working the data in staged files.
6. Call an action method to:

   * retrieve data from the files, or
   * copy data from the files into a table

   As is the case with DataFrames for tables, the data is not retrieved into the DataFrame until you call
   an action method.

### Loading Data from Files into a DataFrame

After you set up a DataFrame for files in a stage, you can load data from the
files into the DataFrame:

1. Use the DataFrame object methods to perform any transformations needed on the
   dataset (for example, selecting specific fields, filtering rows, etc.).

   For example, to extract the `color` element from a JSON file named `data.json` in the stage named `mystage`:

   ```java
   DataFrame df = session.read().json("@mystage/data.json").select(Functions.col("$1").subField("color"));
   ```

   As explained earlier, for files in formats other than CSV (e.g. JSON), the `DataFrameReader` treats the data in the file
   as a single VARIANT column with the name `$1`.
2. Call the `DataFrame.collect` method to load the data. For example:

   ```java
   Row[] results = df.collect();
   ```

### Copying Data from Files into a Table

After you set up a DataFrame for files in a stage, you can call the
[copyInto](../reference/java/com/snowflake/snowpark_java/CopyableDataFrame.md) method to copy the data into a table. This method executes the [COPY INTO <table>](../../../sql-reference/sql/copy-into-table.md) command.

> **Note:**
>
> You do not need to call the `collect` method before calling `copyInto`. The data from the files does not need to
> be in the DataFrame before you call `copyInto`.

For example, the following code loads data from the CSV file specified by `myFileStage` into the table `mytable`. Because the
data is in a CSV file, the code must also describe the fields in the file. The example does this by calling the [schema](../reference/java/com/snowflake/snowpark_java/DataFrameReader.md) method of the
`DataFrameReader` object and passing in a [StructType](../reference/java/com/snowflake/snowpark_java/types/StructType.md) object (`schemaForDataFile`) containing an array of
[StructField](../reference/java/com/snowflake/snowpark_java/types/StructField.md) objects that describe the fields.

```java
CopyableDataFrame copyableDf = session.read().schema(schemaForDataFile).csv("@mystage/myfile.csv");
copyableDf.copyInto("mytable");
```

### Saving a DataFrame to Files on a Stage

> **Note:**
>
> This feature was introduced in Snowpark 1.5.0.

If you need to save a DataFrame to files on a stage, you can call the [DataFrameWriter](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method corresponding to the format of
the file (e.g. the `csv` method to write to a CSV file), passing in the stage location where the files should be saved.
These `DataFrameWriter` methods execute the [COPY INTO <location>](../../../sql-reference/sql/copy-into-location.md) command.

> **Note:**
>
> You do not need to call the `collect` method before calling these `DataFrameWriter` methods. The data from the file
> does not need to be in the DataFrame before you call these methods.

To save the contents of a DataFrame to files on a stage:

1. Call the [write](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method of the DataFrame object to get a [DataFrameWriter](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) object. For example, to get the
   `DataFrameWriter` object for a DataFrame that represents the table named `sample_product_data`:

   ```java
   DataFrameWriter dfWriter = session.table("sample_product_data").write();
   ```
2. If you want to overwrite the contents of the file (if the file exists), call the [mode](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method of the `DataFrameWriter`
   object, passing in `SaveMode.Overwrite`.

   Otherwise, by default, the `DataFrameWriter` reports an error if the specified file on the stage already exists.

   The `mode` method returns the same `DataFrameWriter` object configured with the specified mode.

   For example, to specify that the `DataFrameWriter` should overwrite the file on the stage:

   ```java
   dfWriter = dfWriter.mode(SaveMode.Overwrite);
   ```
3. If you need to specify additional information about how the data should be saved (for example, that the data should be
   compressed or that you want to use a semicolon to delimit fields in a CSV file), call the [DataFrameWriter.option](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method
   or the [DataFrameWriter.options](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method.

   Pass in the name and value of the option that you want to set. You can set the following types of options:

   * The [file format options](../../../sql-reference/sql/copy-into-location.md) described in the
     [documentation on COPY INTO <location>](../../../sql-reference/sql/copy-into-location.md).
   * The [copy options](../../../sql-reference/sql/copy-into-location.md) described in the
     documentation on COPY INTO <location>.
   * [PARTITION BY or HEADER](../../../sql-reference/sql/copy-into-location.md).

   Note that you cannot use the `option` method to set the following options:

   * The TYPE format type option.
   * The OVERWRITE copy option. To set this option, call the `mode` method instead (as mentioned in the previous step).

   The following example sets up the `DataFrameWriter` object to save data to a CSV file in uncompressed form, using a
   semicolon (rather than a comma) as the field delimiter.

   ```java
   dfWriter = dfWriter.option("field_delimiter", ";").option("COMPRESSION", "NONE");
   ```

   The `option` method returns a `DataFrameWriter` object that is configured with the specified option.

   To set multiple options, you can
   chain calls to the `option` method (as shown in the example
   above) or call the [DataFrameWriter.options](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md) method, passing in a `Map` of the names and values of the options.
4. To return details about each file that was saved, set the `DETAILED_OUTPUT`
   [copy option](../../../sql-reference/sql/copy-into-location.md) to `TRUE`.

   By default, `DETAILED_OUTPUT` is `FALSE`, which means that the method returns a single row of output containing the
   fields `"rows_unloaded"`, `"input_bytes"`, and `"output_bytes"`.

   When you set `DETAILED_OUTPUT` to `TRUE`, the method returns a row of output for each file saved. Each row contains
   the fields `FILE_NAME`, `FILE_SIZE`, and `ROW_COUNT`.
5. Call the method corresponding to the format of the file to save the data to the file. You can call one of the following
   methods:

   * [DataFrameWriter.csv](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md)
   * [DataFrameWriter.json](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md)
   * [DataFrameWriter.parquet](../reference/java/com/snowflake/snowpark_java/DataFrameWriter.md)

   When calling these methods, pass in the stage location of the file where the data should be written (e.g. `@mystage`).

   By default, the method saves the data to filenames with the prefix `data_` (e.g. `@mystage/data_0_0_0.csv`). If you want
   the files to be named with a different prefix, specify the prefix after the stage name. For example:

   ```java
   WriteFileResult writeFileResult = dfWriter.csv("@mystage/saved_data");
   ```

   This example saves the contents of the DataFrame to files that begin with the prefix `saved_data` (e.g.
   `@mystage/saved_data_0_0_0.csv`).
6. Check the [WriteFileResult](../reference/java/com/snowflake/snowpark_java/WriteFileResult.md) object returned for information about the amount of data written to the file.

   From the `WriteFileResult` object, you can access the output produced by the COPY INTO <location> command:

   * To access the rows of output as an array of [Row](../reference/java/com/snowflake/snowpark_java/Row.md) objects, call the `getRows` method.
   * To determine which fields are present in the rows, call the `getSchema` method, which returns a [StructType](../reference/java/com/snowflake/snowpark_java/types/StructType.md) that
     describes the fields in the row.

   For example, to print out the names of the fields and values in the output rows:

   ```java
   WriteFileResult writeFileResult = dfWriter.csv("@mystage/saved_data");
   Row[] rows = writeFileResult.getRows();
   StructType schema = writeFileResult.getSchema();
   for (int i = 0 ; i < rows.length ; i++) {
     System.out.println("Row:" + i);
     Row row = rows[i];
     for (int j = 0; j < schema.size(); j++) {
       System.out.println(schema.get(j).name() + ": " + row.get(j));
     }
   }
   ```

The following example uses a DataFrame to save the contents of the table named `car_sales` to JSON files with the prefix
`saved_data` on the stage `@mystage` (e.g. `@mystage/saved_data_0_0_0.json`). The sample code:

* Overwrites the file, if the file already exists on the stage.
* Returns detailed output about the save operation.
* Saves the data uncompressed.

Finally, the sample code prints out each field and value in the output rows returned:

```java
DataFrame df = session.table("car_sales");
WriteFileResult writeFileResult = df.write().mode(SaveMode.Overwrite).option("DETAILED_OUTPUT", "TRUE").option("compression", "none").json("@mystage/saved_data");
Row[] rows = writeFileResult.getRows();
StructType schema = writeFileResult.getSchema();
for (int i = 0 ; i < rows.length ; i++) {
  System.out.println("Row:" + i);
  Row row = rows[i];
  for (int j = 0; j < schema.size(); j++) {
    System.out.println(schema.get(j).name() + ": " + row.get(j));
  }
}
```

## Working with Semi-Structured Data

Using a DataFrame, you can query and access [semi-structured data](../../../user-guide/semistructured-intro.md) (e.g JSON data). The
next sections explain how to work with semi-structured data in a DataFrame.

* Traversing Semi-Structured Data
* Explicitly Casting Values in Semi-Structured Data
* Flattening an Array of Objects into Rows

> **Note:**
>
> The examples in these sections use the sample data in [Sample Data Used in Examples](../../../user-guide/querying-semistructured.md).

### Traversing Semi-Structured Data

To refer to a specific field or element in semi-structured data, use the following methods of the [Column](../reference/java/com/snowflake/snowpark_java/Column.md) object:

* Use [subField(“<field_name>”)](../reference/java/com/snowflake/snowpark_java/Column.md) to return a `Column` object for a field in an OBJECT (or a VARIANT that contains an
  OBJECT).
* Use [subField(<index>)](../reference/java/com/snowflake/snowpark_java/Column.md) to return a `Column` object for an element in an ARRAY (or a VARIANT that contains an ARRAY).

> **Note:**
>
> If the field name or elements in the path are irregular and make it difficult to use the `Column.apply` methods, you can
> use [Functions.get](../reference/java/com/snowflake/snowpark_java/Functions.md), [Functions.get_ignore_case](../reference/java/com/snowflake/snowpark_java/Functions.md), or [Functions.get_path](../reference/java/com/snowflake/snowpark_java/Functions.md) as an alternative.

For example, the following code selects the `dealership` field in objects in the `src` column of the
[sample data](../../../user-guide/querying-semistructured.md):

```java
DataFrame df = session.table("car_sales");
df.select(Functions.col("src").subField("dealership")).show();
```

The code prints the following output:

```none
----------------------------
|"""SRC""['DEALERSHIP']"   |
----------------------------
|"Valley View Auto Sales"  |
|"Tindel Toyota"           |
----------------------------
```

> **Note:**
>
> The values in the DataFrame are surrounded by double quotes because these values are returned as string literals. To cast these
> values to a specific type, see Explicitly Casting Values in Semi-Structured Data.

You can also chain method calls to traverse a path to a specific
field or element.

For example, the following code selects the `name` field in the `salesperson` object:

```java
DataFrame df = session.table("car_sales");
df.select(Functions.col("src").subField("salesperson").subField("name")).show();
```

The code prints the following output:

```none
------------------------------------
|"""SRC""['SALESPERSON']['NAME']"  |
------------------------------------
|"Frank Beasley"                   |
|"Greg Northrup"                   |
------------------------------------
```

As another example, the following code selects the first element of `vehicle` field, which holds an array of vehicles. The
example also selects the `price` field from the first element.

```java
DataFrame df = session.table("car_sales");
df.select(Functions.col("src").subField("vehicle").subField(0)).show();
df.select(Functions.col("src").subField("vehicle").subField(0).subField("price")).show();
```

The code prints the following output:

```none
---------------------------
|"""SRC""['VEHICLE'][0]"  |
---------------------------
|{                        |
|  "extras": [            |
|    "ext warranty",      |
|    "paint protection"   |
|  ],                     |
|  "make": "Honda",       |
|  "model": "Civic",      |
|  "price": "20275",      |
|  "year": "2017"         |
|}                        |
|{                        |
|  "extras": [            |
|    "ext warranty",      |
|    "rust proofing",     |
|    "fabric protection"  |
|  ],                     |
|  "make": "Toyota",      |
|  "model": "Camry",      |
|  "price": "23500",      |
|  "year": "2017"         |
|}                        |
---------------------------

------------------------------------
|"""SRC""['VEHICLE'][0]['PRICE']"  |
------------------------------------
|"20275"                           |
|"23500"                           |
------------------------------------
```

As an alternative to the `apply` method, you can use [Functions.get](../reference/java/com/snowflake/snowpark_java/Functions.md), [Functions.get_ignore_case](../reference/java/com/snowflake/snowpark_java/Functions.md), or
[Functions.get_path](../reference/java/com/snowflake/snowpark_java/Functions.md) functions if the field name or elements in the path are irregular and make it difficult to use the
`Column.subField` methods.

For example, the following lines of code both print the value of a specified field in an object:

```java
df.select(Functions.get(Functions.col("src"), Functions.lit("dealership"))).show();
df.select(Functions.col("src").subField("dealership")).show();
```

Similarly, the following lines of code both print the value of a field at a specified path in an object:

```java
df.select(Functions.get_path(Functions.col("src"), Functions.lit("vehicle[0].make"))).show();
df.select(Functions.col("src").subField("vehicle").subField(0).subField("make")).show();
```

### Explicitly Casting Values in Semi-Structured Data

By default, the values of fields and elements are returned as string literals (including the double quotes), as shown in the
examples above.

To avoid unexpected results, call the cast method to cast the value to a specific
type. For example, the following code prints out the values without and with casting:

```java
// Import the objects for the data types, including StringType.
import com.snowflake.snowpark_java.types.*;
...
DataFrame df = session.table("car_sales");
df.select(Functions.col("src").subField("salesperson").subField("id")).show();
df.select(Functions.col("src").subField("salesperson").subField("id").cast(DataTypes.StringType)).show();
```

The code prints the following output:

```none
----------------------------------
|"""SRC""['SALESPERSON']['ID']"  |
----------------------------------
|"55"                            |
|"274"                           |
----------------------------------

---------------------------------------------------
|"CAST (""SRC""['SALESPERSON']['ID'] AS STRING)"  |
---------------------------------------------------
|55                                               |
|274                                              |
---------------------------------------------------
```

### Flattening an Array of Objects into Rows

If you need to “flatten” semi-structured data into a DataFrame (e.g. producing a row for every object in an array), call the
[flatten](../reference/java/com/snowflake/snowpark_java/DataFrame.md) method. This method is equivalent to the [FLATTEN](../../../sql-reference/functions/flatten.md) SQL function. If you pass in
a path to an object or array, the method returns a DataFrame that contains a row for each field or element in the object or array.

For example, in the [sample data](../../../user-guide/querying-semistructured.md), `src:customer` is an array of objects that
contain information about a customer. Each object contains a `name` and `address` field.

If you pass this path to the `flatten` function:

```java
DataFrame df = session.table("car_sales");
df.flatten(Functions.col("src").subField("customer")).show();
```

the method returns a DataFrame:

```none
----------------------------------------------------------------------------------------------------------------------------------------------------------
|"SRC"                                      |"SEQ"  |"KEY"  |"PATH"  |"INDEX"  |"VALUE"                            |"THIS"                               |
----------------------------------------------------------------------------------------------------------------------------------------------------------
|{                                          |1      |NULL   |[0]     |0        |{                                  |[                                    |
|  "customer": [                            |       |       |        |         |  "address": "San Francisco, CA",  |  {                                  |
|    {                                      |       |       |        |         |  "name": "Joyce Ridgely",         |    "address": "San Francisco, CA",  |
|      "address": "San Francisco, CA",      |       |       |        |         |  "phone": "16504378889"           |    "name": "Joyce Ridgely",         |
|      "name": "Joyce Ridgely",             |       |       |        |         |}                                  |    "phone": "16504378889"           |
|      "phone": "16504378889"               |       |       |        |         |                                   |  }                                  |
|    }                                      |       |       |        |         |                                   |]                                    |
|  ],                                       |       |       |        |         |                                   |                                     |
|  "date": "2017-04-28",                    |       |       |        |         |                                   |                                     |
|  "dealership": "Valley View Auto Sales",  |       |       |        |         |                                   |                                     |
|  "salesperson": {                         |       |       |        |         |                                   |                                     |
|    "id": "55",                            |       |       |        |         |                                   |                                     |
|    "name": "Frank Beasley"                |       |       |        |         |                                   |                                     |
|  },                                       |       |       |        |         |                                   |                                     |
|  "vehicle": [                             |       |       |        |         |                                   |                                     |
|    {                                      |       |       |        |         |                                   |                                     |
|      "extras": [                          |       |       |        |         |                                   |                                     |
|        "ext warranty",                    |       |       |        |         |                                   |                                     |
|        "paint protection"                 |       |       |        |         |                                   |                                     |
|      ],                                   |       |       |        |         |                                   |                                     |
|      "make": "Honda",                     |       |       |        |         |                                   |                                     |
|      "model": "Civic",                    |       |       |        |         |                                   |                                     |
|      "price": "20275",                    |       |       |        |         |                                   |                                     |
|      "year": "2017"                       |       |       |        |         |                                   |                                     |
|    }                                      |       |       |        |         |                                   |                                     |
|  ]                                        |       |       |        |         |                                   |                                     |
|}                                          |       |       |        |         |                                   |                                     |
|{                                          |2      |NULL   |[0]     |0        |{                                  |[                                    |
|  "customer": [                            |       |       |        |         |  "address": "New York, NY",       |  {                                  |
|    {                                      |       |       |        |         |  "name": "Bradley Greenbloom",    |    "address": "New York, NY",       |
|      "address": "New York, NY",           |       |       |        |         |  "phone": "12127593751"           |    "name": "Bradley Greenbloom",    |
|      "name": "Bradley Greenbloom",        |       |       |        |         |}                                  |    "phone": "12127593751"           |
|      "phone": "12127593751"               |       |       |        |         |                                   |  }                                  |
|    }                                      |       |       |        |         |                                   |]                                    |
|  ],                                       |       |       |        |         |                                   |                                     |
|  "date": "2017-04-28",                    |       |       |        |         |                                   |                                     |
|  "dealership": "Tindel Toyota",           |       |       |        |         |                                   |                                     |
|  "salesperson": {                         |       |       |        |         |                                   |                                     |
|    "id": "274",                           |       |       |        |         |                                   |                                     |
|    "name": "Greg Northrup"                |       |       |        |         |                                   |                                     |
|  },                                       |       |       |        |         |                                   |                                     |
|  "vehicle": [                             |       |       |        |         |                                   |                                     |
|    {                                      |       |       |        |         |                                   |                                     |
|      "extras": [                          |       |       |        |         |                                   |                                     |
|        "ext warranty",                    |       |       |        |         |                                   |                                     |
|        "rust proofing",                   |       |       |        |         |                                   |                                     |
|        "fabric protection"                |       |       |        |         |                                   |                                     |
|      ],                                   |       |       |        |         |                                   |                                     |
|      "make": "Toyota",                    |       |       |        |         |                                   |                                     |
|      "model": "Camry",                    |       |       |        |         |                                   |                                     |
|      "price": "23500",                    |       |       |        |         |                                   |                                     |
|      "year": "2017"                       |       |       |        |         |                                   |                                     |
|    }                                      |       |       |        |         |                                   |                                     |
|  ]                                        |       |       |        |         |                                   |                                     |
|}                                          |       |       |        |         |                                   |                                     |
----------------------------------------------------------------------------------------------------------------------------------------------------------
```

From this DataFrame, you can select the `name` and `address` fields from each object in the `VALUE` field:

```java
df.flatten(Functions.col("src").subField("customer")).select(Functions.col("value").subField("name"), Functions.col("value").subField("address")).show();
```

```none
-------------------------------------------------
|"""VALUE""['NAME']"   |"""VALUE""['ADDRESS']"  |
-------------------------------------------------
|"Joyce Ridgely"       |"San Francisco, CA"     |
|"Bradley Greenbloom"  |"New York, NY"          |
-------------------------------------------------
```

The following code adds to the previous example by
casting the values to a specific type and changing the names of the columns:

```java
df.flatten(Functions.col("src").subField("customer")).select(Functions.col("value").subField("name").cast(DataTypes.StringType).as("Customer Name"), Functions.col("value").subField("address").cast(DataTypes.StringType).as("Customer Address")).show();
```

```none
-------------------------------------------
|"Customer Name"     |"Customer Address"  |
-------------------------------------------
|Joyce Ridgely       |San Francisco, CA   |
|Bradley Greenbloom  |New York, NY        |
-------------------------------------------
```

## Executing SQL Statements

To execute a SQL statement that you specify, call the `sql` method in the `Session` class, and pass in the statement
to be executed. The method returns a DataFrame.

Note that the SQL statement won’t be executed until you call an action method.

```java
import java.util.Arrays;

// Get the list of the files in a stage.
// The collect() method causes this SQL statement to be executed.
DataFrame dfStageFiles  = session.sql("ls @myStage");
Row[] files = dfStageFiles.collect();
System.out.println(Arrays.toString(files));

// Resume the operation of a warehouse.
// Note that you must call the collect method in order to execute
// the SQL statement.
session.sql("alter warehouse if exists myWarehouse resume if suspended").collect();

DataFrame tableDf = session.table("sample_product_data").select(Functions.col("id"), Functions.col("name"));
// Get the count of rows from the table.
long numRows = tableDf.count();
System.out.println("Count: " + numRows);
```

If you want to call methods to transform the DataFrame (e.g. filter, select,
etc.), note that these methods work only if the underlying SQL statement is a SELECT statement. The transformation methods are not
supported for other kinds of SQL statements.

```java
import java.util.Arrays;

DataFrame df = session.sql("select id, category_id, name from sample_product_data where id > 10");
// Because the underlying SQL statement for the DataFrame is a SELECT statement,
// you can call the filter method to transform this DataFrame.
Row[] results = df.filter(Functions.col("category_id").lt(Functions.lit(10))).select(Functions.col("id")).collect();
System.out.println(Arrays.toString(results));

// In this example, the underlying SQL statement is not a SELECT statement.
DataFrame dfStageFiles = session.sql("ls @myStage");
// Calling the filter method results in an error.
dfStageFiles.filter(...);
```

---
title: Working with DataFrames in Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/working-with-dataframes.md
section: Snowpark
---

# Working with DataFrames in Snowpark Python

In Snowpark, the main way in which you query and process data is through a DataFrame. This topic explains how to work with
DataFrames.

To retrieve and manipulate data, you use the `DataFrame` class. A
DataFrame represents a relational dataset that is evaluated lazily: it only executes when a specific action is triggered. In a
sense, a DataFrame is like a query that needs to be evaluated in order to retrieve data.

To retrieve data into a DataFrame:

1. Construct a DataFrame, specifying the source of the data for the dataset.

   For example, you can create a DataFrame to hold data from a table, an external CSV file, from local data, or the execution of a SQL statement.
2. Specify how the dataset in the DataFrame should be transformed.

   For example, you can specify which columns should be selected, how the rows should be filtered, how the results should be
   sorted and grouped, etc.
3. Execute the statement to retrieve the data into the DataFrame.

   In order to retrieve the data into the DataFrame, you must invoke a method that performs an action (for example, the
   `collect()` method).

The next sections explain these steps in more detail.

## Setting up the Examples for this Section

Some of the examples of this section use a DataFrame to query a table named `sample_product_data`. If you want to run these
examples, you can create this table and fill the table with some data by executing the following SQL statements.

You can run the SQL statements using Snowpark Python:

```python
session.sql('CREATE OR REPLACE TABLE sample_product_data (id INT, parent_id INT, category_id INT, name VARCHAR, serial_number VARCHAR, key INT, "3rd" INT)').collect()
```

```output
[Row(status='Table SAMPLE_PRODUCT_DATA successfully created.')]
```

```python
session.sql("""
INSERT INTO sample_product_data VALUES
(1, 0, 5, 'Product 1', 'prod-1', 1, 10),
(2, 1, 5, 'Product 1A', 'prod-1-A', 1, 20),
(3, 1, 5, 'Product 1B', 'prod-1-B', 1, 30),
(4, 0, 10, 'Product 2', 'prod-2', 2, 40),
(5, 4, 10, 'Product 2A', 'prod-2-A', 2, 50),
(6, 4, 10, 'Product 2B', 'prod-2-B', 2, 60),
(7, 0, 20, 'Product 3', 'prod-3', 3, 70),
(8, 7, 20, 'Product 3A', 'prod-3-A', 3, 80),
(9, 7, 20, 'Product 3B', 'prod-3-B', 3, 90),
(10, 0, 50, 'Product 4', 'prod-4', 4, 100),
(11, 10, 50, 'Product 4A', 'prod-4-A', 4, 100),
(12, 10, 50, 'Product 4B', 'prod-4-B', 4, 100)
""").collect()
```

```output
[Row(number of rows inserted=12)]
```

To verify that the table was created, run:

```python
session.sql("SELECT count(*) FROM sample_product_data").collect()
```

```output
[Row(COUNT(*)=12)]
```

### Setting up the Examples in a Python Worksheet

To set up and run these examples in a [Python worksheet](python-worksheets.md),
create the sample table and set up your Python worksheet.

1. Create a SQL worksheet and run the following:

> ```sqlexample
> CREATE OR REPLACE TABLE sample_product_data
>   (id INT, parent_id INT, category_id INT, name VARCHAR, serial_number VARCHAR, key INT, "3rd" INT);
>
> INSERT INTO sample_product_data VALUES
>   (1, 0, 5, 'Product 1', 'prod-1', 1, 10),
>   (2, 1, 5, 'Product 1A', 'prod-1-A', 1, 20),
>   (3, 1, 5, 'Product 1B', 'prod-1-B', 1, 30),
>   (4, 0, 10, 'Product 2', 'prod-2', 2, 40),
>   (5, 4, 10, 'Product 2A', 'prod-2-A', 2, 50),
>   (6, 4, 10, 'Product 2B', 'prod-2-B', 2, 60),
>   (7, 0, 20, 'Product 3', 'prod-3', 3, 70),
>   (8, 7, 20, 'Product 3A', 'prod-3-A', 3, 80),
>   (9, 7, 20, 'Product 3B', 'prod-3-B', 3, 90),
>   (10, 0, 50, 'Product 4', 'prod-4', 4, 100),
>   (11, 10, 50, 'Product 4A', 'prod-4-A', 4, 100),
>   (12, 10, 50, 'Product 4B', 'prod-4-B', 4, 100);
>
> SELECT count(*) FROM sample_product_data;
> ```

2. [Create a Python worksheet](python-worksheets.md), setting the same database and schema context as the
   SQL worksheet that you used to create the `sample_product_data` table.

If you want to use the examples in this topic in a Python worksheet, use the example within the handler function (e.g. `main`),
and use the `Session` object that is passed into the function to create DataFrames.

For example, call the `table` method of the `session` object to create a DataFrame for a table:

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark.functions import col

def main(session: snowpark.Session):
  df_table = session.table("sample_product_data")
```

To review the output produced by the function, such as by calling the `show` method of the DataFrame object, use the Output tab.

To examine the value returned by the function, choose the data type of the return value from Settings » Return type,
and use the Results tab:

* If your function returns a DataFrame, use the default return type of Table.
* If your function returns the `list` of `Row` from the `collect` method of a DataFrame object,
  use Variant for the return type.
* If your function returns any other value that can be cast to a string, or if your function does not return a value, use String
  as the return type.

Refer to [Running Python Worksheets](python-worksheets.md) for more details.

## Constructing a DataFrame

To construct a DataFrame, you can use the methods and properties of the `Session` class. Each of the following
methods constructs a DataFrame from a different type of data source.

You can run these examples in your local development environment
or call them within the `main` function defined in a [Python worksheet](python-worksheets.md).

* To create a DataFrame from data in a table, view, or stream, call the `table` method:

  ```python
  # Create a DataFrame from the data in the "sample_product_data" table.
  df_table = session.table("sample_product_data")

  # To print out the first 10 rows, call df_table.show()
  ```
* To create a DataFrame from specified values, call the `create_dataframe` method:

  ```python
  # Create a DataFrame with one column named a from specified values.
  df1 = session.create_dataframe([1, 2, 3, 4]).to_df("a")
  df1.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  # return df1
  ```

  ```output
  -------
  |"A"  |
  -------
  |1    |
  |2    |
  |3    |
  |4    |
  -------
  ```

  Create a DataFrame with 4 columns, “a”, “b”, “c” and “d”:

  ```python
  # Create a DataFrame with 4 columns, "a", "b", "c" and "d".
  df2 = session.create_dataframe([[1, 2, 3, 4]], schema=["a", "b", "c", "d"])
  df2.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  # return df2
  ```

  ```output
  -------------------------
  |"A"  |"B"  |"C"  |"D"  |
  -------------------------
  |1    |2    |3    |4    |
  -------------------------
  ```

  Create another DataFrame with 4 columns, “a”, “b”, “c” and “d”:

  ```python
  # Create another DataFrame with 4 columns, "a", "b", "c" and "d".
  from snowflake.snowpark import Row
  df3 = session.create_dataframe([Row(a=1, b=2, c=3, d=4)])
  df3.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  # return df3
  ```

  ```output
  -------------------------
  |"A"  |"B"  |"C"  |"D"  |
  -------------------------
  |1    |2    |3    |4    |
  -------------------------
  ```

  Create a DataFrame and specify a schema:

  ```python
  # Create a DataFrame and specify a schema
  from snowflake.snowpark.types import IntegerType, StringType, StructType, StructField
  schema = StructType([StructField("a", IntegerType()), StructField("b", StringType())])
  df4 = session.create_dataframe([[1, "snow"], [3, "flake"]], schema)
  df4.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  # return df4
  ```

  ```output
  ---------------
  |"A"  |"B"    |
  ---------------
  |1    |snow   |
  |3    |flake  |
  ---------------
  ```
* To create a DataFrame containing a range of values, call the `range` method:

  ```python
  # Create a DataFrame from a range
  # The DataFrame contains rows with values 1, 3, 5, 7, and 9 respectively.
  df_range = session.range(1, 10, 2).to_df("a")
  df_range.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  # return df_range
  ```

  ```output
  -------
  |"A"  |
  -------
  |1    |
  |3    |
  |5    |
  |7    |
  |9    |
  -------
  ```
* To create a DataFrame to hold the data from a file in a stage, use the `read` property to get a
  `DataFrameReader` object. In the `DataFrameReader` object, call the method corresponding to the
  format of the data in the file:

  ```python
  from snowflake.snowpark.types import StructType, StructField, StringType, IntegerType

  # Create DataFrames from data in a stage.
  df_json = session.read.json("@my_stage2/data1.json")
  df_catalog = session.read.schema(StructType([StructField("name", StringType()), StructField("age", IntegerType())])).csv("@stage/some_dir")
  ```
* To create a DataFrame to hold the results of a SQL query, call the `sql` method:

  ```python
  # Create a DataFrame from a SQL query
  df_sql = session.sql("SELECT name from sample_product_data")
  df_sql.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  # return df_sql
  ```

  ```output
  --------------
  |"NAME"      |
  --------------
  |Product 1   |
  |Product 1A  |
  |Product 1B  |
  |Product 2   |
  |Product 2A  |
  |Product 2B  |
  |Product 3   |
  |Product 3A  |
  |Product 3B  |
  |Product 4   |
  --------------
  ```

It is possible to use the `sql` method to execute SELECT statements that retrieve data from tables and staged files,
but using the `table` method and `read` property offer better syntax highlighting, error highlighting, and
intelligent code completion in development tools.

## Specifying How the Dataset Should Be Transformed

To specify which columns to select and how to filter, sort, group, etc. results, call the DataFrame methods that transform the dataset.
To identify columns in these methods, use the `col` function or an expression that
evaluates to a column. Refer to Specifying Columns and Expressions.

For example:

* To specify which rows should be returned, call the `filter` method:

  ```python
  # Import the col function from the functions module.
  # Python worksheets import this function by default
  from snowflake.snowpark.functions import col

  # Create a DataFrame for the rows with the ID 1
  # in the "sample_product_data" table.

  # This example uses the == operator of the Column object to perform an
  # equality check.
  df = session.table("sample_product_data").filter(col("id") == 1)
  df.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  return df
  ```

  ```output
  ------------------------------------------------------------------------------------
  |"ID"  |"PARENT_ID"  |"CATEGORY_ID"  |"NAME"     |"SERIAL_NUMBER"  |"KEY"  |"3rd"  |
  ------------------------------------------------------------------------------------
  |1     |0            |5              |Product 1  |prod-1           |1      |10     |
  ------------------------------------------------------------------------------------
  ```
* To specify the columns that should be selected, call the `select` method:

  ```python
  # Import the col function from the functions module.
  from snowflake.snowpark.functions import col

  # Create a DataFrame that contains the id, name, and serial_number
  # columns in the "sample_product_data" table.
  df = session.table("sample_product_data").select(col("id"), col("name"), col("serial_number"))
  df.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  return df
  ```

  ```output
  ---------------------------------------
  |"ID"  |"NAME"      |"SERIAL_NUMBER"  |
  ---------------------------------------
  |1     |Product 1   |prod-1           |
  |2     |Product 1A  |prod-1-A         |
  |3     |Product 1B  |prod-1-B         |
  |4     |Product 2   |prod-2           |
  |5     |Product 2A  |prod-2-A         |
  |6     |Product 2B  |prod-2-B         |
  |7     |Product 3   |prod-3           |
  |8     |Product 3A  |prod-3-A         |
  |9     |Product 3B  |prod-3-B         |
  |10    |Product 4   |prod-4           |
  ---------------------------------------
  ```
* You can also reference columns like this:

  ```python
  # Import the col function from the functions module.
  from snowflake.snowpark.functions import col

  df_product_info = session.table("sample_product_data")
  df1 = df_product_info.select(df_product_info["id"], df_product_info["name"], df_product_info["serial_number"])
  df2 = df_product_info.select(df_product_info.id, df_product_info.name, df_product_info.serial_number)
  df3 = df_product_info.select("id", "name", "serial_number")
  ```

Each method returns a new DataFrame object that has been transformed. The method does not affect the original DataFrame object.
If you want to apply multiple transformations, you can chain method calls,
calling each subsequent transformation method on the new DataFrame object returned by the previous method call.

These transformation methods specify how to construct the SQL statement and do not retrieve data from the Snowflake database.
The action methods described in Performing an Action to Evaluate a DataFrame perform the data retrieval.

### Joining DataFrames

To join DataFrame objects, call the `join` method:

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
# Create a DataFrame that joins the two DataFrames
# on the column named "key".
df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key")).select(df_lhs["key"].as_("key"), "value1", "value2").show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key")).select(df_lhs["key"].as_("key"), "value1", "value2")
```

```output
-------------------------------
|"KEY"  |"VALUE1"  |"VALUE2"  |
-------------------------------
|a      |1         |3         |
|b      |2         |4         |
-------------------------------
```

If both DataFrames have the same column to join on, you can use the following example syntax:

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
# If both dataframes have the same column "key", the following is more convenient.
df_lhs.join(df_rhs, ["key"]).show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_lhs.join(df_rhs, ["key"])
```

```output
-------------------------------
|"KEY"  |"VALUE1"  |"VALUE2"  |
-------------------------------
|a      |1         |3         |
|b      |2         |4         |
-------------------------------
```

You can also use the & operator to connect join expressions:

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
# Use & operator connect join expression. '|' and ~ are similar.
df_joined_multi_column = df_lhs.join(df_rhs, (df_lhs.col("key") == df_rhs.col("key")) & (df_lhs.col("value1") < df_rhs.col("value2"))).select(df_lhs["key"].as_("key"), "value1", "value2")
df_joined_multi_column.show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_joined_multi_column
```

```output
-------------------------------
|"KEY"  |"VALUE1"  |"VALUE2"  |
-------------------------------
|a      |1         |3         |
|b      |2         |4         |
-------------------------------
```

If you want to perform a self-join, you must copy the DataFrame:

```python
# copy the DataFrame if you want to do a self-join
from copy import copy

# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
df_lhs_copied = copy(df_lhs)
df_self_joined = df_lhs.join(df_lhs_copied, (df_lhs.col("key") == df_lhs_copied.col("key")) & (df_lhs.col("value1") == df_lhs_copied.col("value1")))
```

When there are overlapping columns in the DataFrames, Snowpark prepends a randomly generated prefix to the columns in the join result:

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key")).show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key"))
```

```output
-----------------------------------------------------
|"l_av5t_KEY"  |"VALUE1"  |"r_1p6k_KEY"  |"VALUE2"  |
-----------------------------------------------------
|a             |1         |a             |3         |
|b             |2         |b             |4         |
-----------------------------------------------------
```

You can rename the overlapping columns using `Column.alias`:

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key")).select(df_lhs["key"].alias("key1"), df_rhs["key"].alias("key2"), "value1", "value2").show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key")).select(df_lhs["key"].alias("key1"), df_rhs["key"].alias("key2"), "value1", "value2")
```

```output
-----------------------------------------
|"KEY1"  |"KEY2"  |"VALUE1"  |"VALUE2"  |
-----------------------------------------
|a       |a       |1         |3         |
|b       |b       |2         |4         |
-----------------------------------------
```

To avoid random prefixes, you can also specify a suffix to append to the overlapping columns:

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value1"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value2"])
df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key"), lsuffix="_left", rsuffix="_right").show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key"), lsuffix="_left", rsuffix="_right")
```

```output
--------------------------------------------------
|"KEY_LEFT"  |"VALUE1"  |"KEY_RIGHT"  |"VALUE2"  |
--------------------------------------------------
|a           |1         |a            |3         |
|b           |2         |b            |4         |
--------------------------------------------------
```

These examples use `DataFrame.col` to specify the columns to use in the join.
Refer to Specifying Columns and Expressions for more ways to specify columns.

If you need to join a table with itself on different columns, you cannot perform the self-join with a single DataFrame. The
following examples use a single DataFrame to perform a self-join, which fails because the column expressions for `"id"` are
present in the left and right sides of the join:

```python
from snowflake.snowpark.exceptions import SnowparkJoinException

df = session.table("sample_product_data")
# This fails because columns named "id" and "parent_id"
# are in the left and right DataFrames in the join.
try:
  df_joined = df.join(df, col("id") == col("parent_id")) # fails
except SnowparkJoinException as e:
  print(e.message)
```

```output
You cannot join a DataFrame with itself because the column references cannot be resolved correctly. Instead, create a copy of the DataFrame with copy.copy(), and join the DataFrame with this copy.
```

```python
# This fails because columns named "id" and "parent_id"
# are in the left and right DataFrames in the join.
try:
  df_joined = df.join(df, df["id"] == df["parent_id"])   # fails
except SnowparkJoinException as e:
  print(e.message)
```

```output
You cannot join a DataFrame with itself because the column references cannot be resolved correctly. Instead, create a copy of the DataFrame with copy.copy(), and join the DataFrame with this copy.
```

Instead, use Python’s builtin `copy()` method to create a clone of the DataFrame object, and use the two DataFrame
objects to perform the join:

```python
from copy import copy

# Create a DataFrame object for the "sample_product_data" table for the left-hand side of the join.
df_lhs = session.table("sample_product_data")
# Clone the DataFrame object to use as the right-hand side of the join.
df_rhs = copy(df_lhs)

# Create a DataFrame that joins the two DataFrames
# for the "sample_product_data" table on the
# "id" and "parent_id" columns.
df_joined = df_lhs.join(df_rhs, df_lhs.col("id") == df_rhs.col("parent_id"))
df_joined.count()
```

### Specifying Columns and Expressions

When calling these transformation methods, you might need to specify columns or expressions that use columns. For example, when
calling the `select` method, you need to specify the columns to select.

To refer to a column, create a `Column` object by calling the `col` function in the
`snowflake.snowpark.functions` module.

```python
# Import the col function from the functions module.
from snowflake.snowpark.functions import col

df_product_info = session.table("sample_product_data").select(col("id"), col("name"))
df_product_info.show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_product_info
```

```output
---------------------
|"ID"  |"NAME"      |
---------------------
|1     |Product 1   |
|2     |Product 1A  |
|3     |Product 1B  |
|4     |Product 2   |
|5     |Product 2A  |
|6     |Product 2B  |
|7     |Product 3   |
|8     |Product 3A  |
|9     |Product 3B  |
|10    |Product 4   |
---------------------
```

> **Note:**
>
> To create a `Column` object for a literal, refer to Using Literals as Column Objects.

When specifying a filter, projection, join condition, etc., you can use `Column` objects in an expression. For example:

* You can use `Column` objects with the `filter` method to specify a filter condition:

  ```python
  # Specify the equivalent of "WHERE id = 20"
  # in a SQL SELECT statement.
  df_filtered = df.filter(col("id") == 20)
  ```

  ```python
  df = session.create_dataframe([[1, 3], [2, 10]], schema=["a", "b"])
  # Specify the equivalent of "WHERE a + b < 10"
  # in a SQL SELECT statement.
  df_filtered = df.filter((col("a") + col("b")) < 10)
  df_filtered.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  return df_filtered
  ```

  ```output
  -------------
  |"A"  |"B"  |
  -------------
  |1    |3    |
  -------------
  ```
* You can use `Column` objects with the `select` method to define an alias:

  ```python
  df = session.create_dataframe([[1, 3], [2, 10]], schema=["a", "b"])
  # Specify the equivalent of "SELECT b * 10 AS c"
  # in a SQL SELECT statement.
  df_selected = df.select((col("b") * 10).as_("c"))
  df_selected.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  return df_selected
  ```

  ```output
  -------
  |"C"  |
  -------
  |30   |
  |100  |
  -------
  ```
* You can use `Column` objects with the `join` method to define a join condition:

  ```python
  dfX = session.create_dataframe([[1], [2]], schema=["a_in_X"])
  dfY = session.create_dataframe([[1], [3]], schema=["b_in_Y"])
  # Specify the equivalent of "X JOIN Y on X.a_in_X = Y.b_in_Y"
  # in a SQL SELECT statement.
  df_joined = dfX.join(dfY, col("a_in_X") == col("b_in_Y")).select(dfX["a_in_X"].alias("the_joined_column"))
  df_joined.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  return df_joined
  ```

  ```output
  -----------------------
  |"THE_JOINED_COLUMN"  |
  -----------------------
  |1                    |
  -----------------------
  ```

When referring to columns in two different DataFrame objects that have the same name (for example, joining the DataFrames on that
column), you can use the `DataFrame.col` method in one DataFrame object to refer to a column in that object (for example,
`df1.col("name")` and `df2.col("name")`).

The following example demonstrates how to use the `DataFrame.col` method to refer to a column in a specific DataFrame. The
example joins two DataFrame objects that both have a column named `key`. The example uses the `Column.as` method to change
the names of the columns in the newly created DataFrame.

```python
# Create two DataFrames to join
df_lhs = session.create_dataframe([["a", 1], ["b", 2]], schema=["key", "value"])
df_rhs = session.create_dataframe([["a", 3], ["b", 4]], schema=["key", "value"])
# Create a DataFrame that joins two other DataFrames (df_lhs and df_rhs).
# Use the DataFrame.col method to refer to the columns used in the join.
df_joined = df_lhs.join(df_rhs, df_lhs.col("key") == df_rhs.col("key")).select(df_lhs.col("key").as_("key"), df_lhs.col("value").as_("L"), df_rhs.col("value").as_("R"))
df_joined.show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_joined
```

```output
---------------------
|"KEY"  |"L"  |"R"  |
---------------------
|a      |1    |3    |
|b      |2    |4    |
---------------------
```

#### Using Double Quotes Around Object Identifiers (Table Names, Column Names, etc.)

The names of databases, schemas, tables, and stages that you specify must conform to the
[Snowflake identifier requirements](../../../sql-reference/identifiers-syntax.md).

Create a table that has case-sensitive columns:

```python
session.sql("""
  create or replace temp table "10tablename"(
  id123 varchar, -- case insensitive because it's not quoted.
  "3rdID" varchar, -- case sensitive.
  "id with space" varchar -- case sensitive.
)""").collect()
# Add return to the statement to return the collect() results in a Python worksheet
```

```output
[Row(status='Table 10tablename successfully created.')]
```

Then add values to the table:

```python
session.sql("""insert into "10tablename" (id123, "3rdID", "id with space") values ('a', 'b', 'c')""").collect()
# Add return to the statement to return the collect() results in a Python worksheet
```

```output
[Row(number of rows inserted=1)]
```

Then create a DataFrame for the table and query the table:

```python
df = session.table('"10tablename"')
df.show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df
```

```output
---------------------------------------
|"ID123"  |"3rdID"  |"id with space"  |
---------------------------------------
|a        |b        |c                |
---------------------------------------
```

When you specify a name, Snowflake considers the
name to be in upper case. For example, the following calls are equivalent:

```python
df.select(col("id123")).collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(ID123='a')]
```

If the name does not conform to the identifier requirements, you must use double quotes (`"`) around the name. Use a backslash
(`\`) to escape the double quote character within a string literal. For example, the following table name does not start
with a letter or an underscore, so you must use double quotes around the name:

```python
df = session.table("\"10tablename\"")
```

Alternatively, you can use single quotes instead of backslashes to escape the double quote character within a string literal.

```python
df = session.table('"10tablename"')
```

Note that when specifying the name of a Column, you don’t need to use double quotes around the name. The Snowpark library
automatically encloses the column name in double quotes for you if the name does not comply with the identifier requirements:

```python
df.select(col("3rdID")).collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(3rdID='b')]
```

As another example, the following calls are equivalent:

```python
df.select(col("id with space")).collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(id with space='c')]
```

```python
df.select(col("\"id with space\"")).collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(id with space='c')]
```

If you have already added double quotes around a column name, the library does not insert additional double quotes around the
name.

In some cases, the column name might contain double quote characters:

```python
session.sql('''
  create or replace temp table quoted(
  "name_with_""air""_quotes" varchar,
  """column_name_quoted""" varchar
)''').collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(status='Table QUOTED successfully created.')]
```

```python
session.sql('''insert into quoted ("name_with_""air""_quotes", """column_name_quoted""") values ('a', 'b')''').collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(number of rows inserted=1)]
```

As explained in [Identifier requirements](../../../sql-reference/identifiers-syntax.md), for each double quote character within a double-quoted identifier, you
must use two double quote characters (e.g. `"name_with_""air""_quotes"` and `"""column_name_quoted"""`):

```python
df_table = session.table("quoted")
df_table.select("\"name_with_\"\"air\"\"_quotes\"").collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(name_with_"air"_quotes='a')]
```

```python
df_table.select("\"\"\"column_name_quoted\"\"\"").collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row("column_name_quoted"='b')]
```

When an identifier is enclosed in double quotes (whether you explicitly added the quotes or the library added
the quotes for you), [Snowflake treats the identifier as case-sensitive](../../../sql-reference/identifiers-syntax.md):

```python
# The following calls are NOT equivalent!
# The Snowpark library adds double quotes around the column name,
# which makes Snowflake treat the column name as case-sensitive.
df.select(col("id with space")).collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(id with space='c')]
```

Compared with this example:

```python
from snowflake.snowpark.exceptions import SnowparkSQLException
try:
  df.select(col("ID WITH SPACE")).collect()
except SnowparkSQLException as e:
  print(e.message)
```

```output
000904 (42000): SQL compilation error: error line 1 at position 7
invalid identifier '"ID WITH SPACE"'
```

### Using Literals as Column Objects

To use a literal in a method that takes a `Column` object as an argument, create a `Column` object for the literal by passing
the literal to the `lit` function in the `snowflake.snowpark.functions` module. For example:

```python
# Import for the lit and col functions.
from snowflake.snowpark.functions import col, lit

# Show the first 10 rows in which num_items is greater than 5.
# Use `lit(5)` to create a Column object for the literal 5.
df_filtered = df.filter(col("num_items") > lit(5))
```

### Casting a Column Object to a Specific Type

To cast a `Column` object to a specific type, call the `cast` method, and pass in a type object from the
`snowflake.snowpark.types` module. For example, to cast a literal
as a [NUMBER](../../../sql-reference/data-types-numeric.md) with a precision of 5 and a scale of 2:

```python
# Import for the lit function.
from snowflake.snowpark.functions import lit

 # Import for the DecimalType class.
from snowflake.snowpark.types import DecimalType

decimal_value = lit(0.05).cast(DecimalType(5,2))
```

### Chaining Method Calls

Because each method that transforms a DataFrame object returns a new DataFrame object
that has the transformation applied, you can [chain method calls](https://en.wikipedia.org/wiki/Method_chaining) to produce a
new DataFrame that is transformed in additional ways.

The following example returns a DataFrame that is configured to:

* Query the `sample_product_data` table.
* Return the row with `id = 1`.
* Select the `name` and `serial_number` columns.

  ```python
  df_product_info = session.table("sample_product_data").filter(col("id") == 1).select(col("name"), col("serial_number"))
  df_product_info.show()
  # To return the DataFrame as a table in a Python worksheet use return instead of show()
  return df_product_info
  ```

  ```output
  -------------------------------
  |"NAME"     |"SERIAL_NUMBER"  |
  -------------------------------
  |Product 1  |prod-1           |
  -------------------------------
  ```

In this example:

* `session.table("sample_product_data")` returns a DataFrame for the `sample_product_data` table.

  Although the DataFrame does not yet contain the data from the table, the object does contain the definitions of the columns in
  the table.
* `filter(col("id") == 1)` returns a DataFrame for the `sample_product_data` table that is set up to return the row with
  `id = 1`.

  Note that the DataFrame does not yet contain the matching row from the table. The matching row is not retrieved until you
  call an action method.
* `select(col("name"), col("serial_number"))` returns a DataFrame that contains the `name` and `serial_number` columns
  for the row in the `sample_product_data` table that has `id = 1`.

The order of calls is important when you chain method calls. Each method call returns a DataFrame that has been
transformed. Make sure that subsequent calls work with the transformed DataFrame.

When using Snowpark Python, you might need to make the `select` and `filter` method calls in a different order than you would
use the equivalent keywords (SELECT and WHERE) in a SQL statement.

### Retrieving Column Definitions

To retrieve the definition of the columns in the dataset for the DataFrame, call the `schema` property. This method returns
a `StructType` object that contains an `list` of `StructField` objects. Each `StructField` object
contains the definition of a column.

```python
# Import the StructType
from snowflake.snowpark.types import *
# Get the StructType object that describes the columns in the
# underlying rowset.
table_schema = session.table("sample_product_data").schema
table_schema
StructType([StructField('ID', LongType(), nullable=True), StructField('PARENT_ID', LongType(), nullable=True), StructField('CATEGORY_ID', LongType(), nullable=True), StructField('NAME', StringType(), nullable=True), StructField('SERIAL_NUMBER', StringType(), nullable=True), StructField('KEY', LongType(), nullable=True), StructField('"3rd"', LongType(), nullable=True)])
```

In the returned `StructType` object, the column names are always normalized. Unquoted identifiers are returned in uppercase,
and quoted identifiers are returned in the exact case in which they were defined.

The following example creates a DataFrame containing the columns named `ID` and `3rd`. For the column name `3rd`, the
Snowpark library automatically encloses the name in double quotes (`"3rd"`) because
the name does not comply with the requirements for an identifier.

The example calls the `schema` property and then calls the `names` property on the returned `StructType` object to
get a `list` of column names. The names are normalized in the `StructType` returned by the `schema` property.

```python
# Create a DataFrame containing the "id" and "3rd" columns.
df_selected_columns = session.table("sample_product_data").select(col("id"), col("3rd"))
# Print out the names of the columns in the schema.
# This prints List["ID", "\"3rd\""]
df_selected_columns.schema.names
```

```output
['ID', '"3rd"']
```

## Performing an Action to Evaluate a DataFrame

As mentioned earlier, the DataFrame is lazily evaluated, which means the SQL statement isn’t sent to the server for execution
until you perform an action. An action causes the DataFrame to be evaluated and sends the corresponding SQL statement to the
server for execution.

The following methods perform an action:

| Class | Method | Description |
| --- | --- | --- |
| `DataFrame` | `collect` | Evaluates the DataFrame and returns the resulting dataset as an `list` of `Row` objects. |
| `DataFrame` | `count` | Evaluates the DataFrame and returns the number of rows. |
| `DataFrame` | `show` | Evaluates the DataFrame and prints the rows to the console. This method limits the number of rows to 10 (by default). |
| `DataFrameWriter` | `save_as_table` | Saves the data in the DataFrame to the specified table. Refer to Saving Data to a Table. |

For example, to execute a query against a table and return the results, call the `collect` method:

```python
# Create a DataFrame with the "id" and "name" columns from the "sample_product_data" table.
# This does not execute the query.
df = session.table("sample_product_data").select(col("id"), col("name"))

# Send the query to the server for execution and
# return a list of Rows containing the results.
results = df.collect()
# Use a return statement to return the collect() results in a Python worksheet
# return results
```

To execute the query and return the number of results, call the `count` method:

```python
# Create a DataFrame for the "sample_product_data" table.
df_products = session.table("sample_product_data")

# Send the query to the server for execution and
# print the count of rows in the table.
print(df_products.count())
12
```

To execute a query and print the results to the console, call the `show` method:

```python
# Create a DataFrame for the "sample_product_data" table.
df_products = session.table("sample_product_data")

# Send the query to the server for execution and
# print the results to the console.
# The query limits the number of rows to 10 by default.
df_products.show()
# To return the DataFrame as a table in a Python worksheet use return instead of show()
return df_products
```

```output
-------------------------------------------------------------------------------------
|"ID"  |"PARENT_ID"  |"CATEGORY_ID"  |"NAME"      |"SERIAL_NUMBER"  |"KEY"  |"3rd"  |
-------------------------------------------------------------------------------------
|1     |0            |5              |Product 1   |prod-1           |1      |10     |
|2     |1            |5              |Product 1A  |prod-1-A         |1      |20     |
|3     |1            |5              |Product 1B  |prod-1-B         |1      |30     |
|4     |0            |10             |Product 2   |prod-2           |2      |40     |
|5     |4            |10             |Product 2A  |prod-2-A         |2      |50     |
|6     |4            |10             |Product 2B  |prod-2-B         |2      |60     |
|7     |0            |20             |Product 3   |prod-3           |3      |70     |
|8     |7            |20             |Product 3A  |prod-3-A         |3      |80     |
|9     |7            |20             |Product 3B  |prod-3-B         |3      |90     |
|10    |0            |50             |Product 4   |prod-4           |4      |100    |
-------------------------------------------------------------------------------------
```

To limit the number of rows to 20:

```python
# Create a DataFrame for the "sample_product_data" table.
df_products = session.table("sample_product_data")

# Limit the number of rows to 20, rather than 10.
df_products.show(20)
# All rows are returned when you use return in a Python worksheet to return the DataFrame as a table
return df_products
```

```output
-------------------------------------------------------------------------------------
|"ID"  |"PARENT_ID"  |"CATEGORY_ID"  |"NAME"      |"SERIAL_NUMBER"  |"KEY"  |"3rd"  |
-------------------------------------------------------------------------------------
|1     |0            |5              |Product 1   |prod-1           |1      |10     |
|2     |1            |5              |Product 1A  |prod-1-A         |1      |20     |
|3     |1            |5              |Product 1B  |prod-1-B         |1      |30     |
|4     |0            |10             |Product 2   |prod-2           |2      |40     |
|5     |4            |10             |Product 2A  |prod-2-A         |2      |50     |
|6     |4            |10             |Product 2B  |prod-2-B         |2      |60     |
|7     |0            |20             |Product 3   |prod-3           |3      |70     |
|8     |7            |20             |Product 3A  |prod-3-A         |3      |80     |
|9     |7            |20             |Product 3B  |prod-3-B         |3      |90     |
|10    |0            |50             |Product 4   |prod-4           |4      |100    |
|11    |10           |50             |Product 4A  |prod-4-A         |4      |100    |
|12    |10           |50             |Product 4B  |prod-4-B         |4      |100    |
-------------------------------------------------------------------------------------
```

> **Note:**
>
> If you call the `schema` property to get the definitions of the columns in the DataFrame, you do not need to
> call an action method.

## Saving Data to a Table

To save the contents of a DataFrame to a table:

1. Call the `write` property to get a `DataFrameWriter` object.
2. Call the `mode` method in the `DataFrameWriter` object and specify the mode.
   For more information, see [the API documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.DataFrameWriter.mode#snowflake.snowpark.DataFrameWriter.mode).
   This method returns a new `DataFrameWriter` object that is configured with the specified mode.
3. Call the `save_as_table` method in the `DataFrameWriter` object to save the contents of the DataFrame to a
   specified table.

Note that you do not need to call a separate method (e.g. `collect`) to execute the SQL statement that saves the data to the
table.

For example:

```python
df.write.mode("overwrite").save_as_table("table1")
```

## Creating a View From a DataFrame

To create a view from a DataFrame, call the `create_or_replace_view` method, which immediately creates the new view:

```python
import os
database = os.environ["snowflake_database"]  # use your own database and schema
schema = os.environ["snowflake_schema"]
view_name = "my_view"
df.create_or_replace_view(f"{database}.{schema}.{view_name}")
```

```output
[Row(status='View MY_VIEW successfully created.')]
```

In a Python worksheet, because you run the worksheet in the context of a database and schema, you can run the following to create a view:

```python
# Define a DataFrame
df_products = session.table("sample_product_data")
# Define a View name
view_name = "my_view"
# Create the view
df_products.create_or_replace_view(f"{view_name}")
# return the view name
return view_name + " successfully created"
my_view successfully created
```

Views that you create by calling `create_or_replace_view` are persistent. If you no longer need that view, you can
[drop the view manually](../../../sql-reference/sql/drop-view.md).

Alternatively, use the `create_or_replace_temp_view` method, which creates a temporary view.
The temporary view is only available in the session in which it is created.

## Working With Files in a Stage

This section explains how to query data in a file in a Snowflake stage. For other operations on files,
use SQL statements.

To query data in files in a Snowflake stage, use the `DataFrameReader` class:

1. Call the `read` method in the `Session` class to access a `DataFrameReader` object.
2. If the files are in CSV format, describe the fields in the file. To do this:

   1. Create a `StructType` object that consists of a `list` of `StructField` objects that describe the fields in
      the file.
   2. For each `StructField` object, specify the following:

      * The name of the field.
      * The data type of the field (specified as an object in the `snowflake.snowpark.types` module).
      * Whether or not the field is nullable.

      For example:

      > ```python
      > from snowflake.snowpark.types import *
      >
      > schema_for_data_file = StructType([
      >                           StructField("id", StringType()),
      >                           StructField("name", StringType())
      >                        ])
      > ```
   3. Call the `schema` property in the `DataFrameReader` object, passing in the `StructType` object.

      For example:

      ```python
      df_reader = session.read.schema(schema_for_data_file)
      ```

      The `schema` property returns a `DataFrameReader` object that is configured to read files containing the specified
      fields.

      Note that you do not need to do this for files in other formats (such as JSON). For those files, the
      `DataFrameReader` treats the data as a single field of the VARIANT type with the field name `$1`.
3. If you need to specify additional information about how the data should be read (for example, that the data is compressed or
   that a CSV file uses a semicolon instead of a comma to delimit fields), call the `option` or `options` methods of the
   `DataFrameReader` object.

   The `option` method takes a name and a value of the option that you want to set and lets you combine multiple chained calls
   whearas the `options` method takes a dictionary of the names of options and their corresponding values.

   For the names and values of the file format options, see the
   [documentation on CREATE FILE FORMAT](../../../sql-reference/sql/create-file-format.md).

   You can also set the copy options described in the [COPY INTO TABLE documentation](../../../sql-reference/sql/copy-into-table.md).
   Note that setting copy options can result in a more expensive execution strategy when you
   retrieve the data into the DataFrame.

   The following example sets up the `DataFrameReader` object to query data in a CSV file that is not compressed and that
   uses a semicolon for the field delimiter.

   > ```python
   > df_reader = df_reader.option("field_delimiter", ";").option("COMPRESSION", "NONE")
   > ```

   The `option` and `options` methods return a `DataFrameReader` object that is configured with the specified options.
4. Call the method corresponding to the format of the file (e.g. the `csv` method), passing in the location of the file.

   > ```python
   > df = df_reader.csv("@s3_ts_stage/emails/data_0_0_0.csv")
   > ```

   The methods corresponding to the format of a file return a DataFrame object that is configured to hold the data in that file.
5. Use the DataFrame object methods to perform any transformations needed on the
   dataset (for example, selecting specific fields, filtering rows, etc.).

   For example, to extract the `color` element from a JSON file in the stage named `my_stage`:

   > ```python
   > # Import the sql_expr function from the functions module.
   > from snowflake.snowpark.functions import sql_expr
   >
   > df = session.read.json("@my_stage").select(sql_expr("$1:color"))
   > ```

   As explained earlier, for files in formats other than CSV (e.g. JSON), the `DataFrameReader` treats the data in the file
   as a single VARIANT column with the name `$1`.

   This example uses the `sql_expr` function in the `snowflake.snowpark.functions` module to specify the path to
   the `color` element.

   Note that the `sql_expr` function does not interpret or modify the input argument. The function just allows you to
   construct expressions and snippets in SQL that are not yet supported by the Snowpark API.
6. Call an action method to query the data in the file.

   As is the case with DataFrames for tables, the data is not retrieved into the DataFrame until you call an action method.

## Working with Semi-Structured Data

Using a DataFrame, you can query and access [semi-structured data](../../../user-guide/semistructured-intro.md) (e.g JSON data). The
next sections explain how to work with semi-structured data in a DataFrame.

* Traversing Semi-Structured Data
* Explicitly Casting Values in Semi-Structured Data
* Flattening an Array of Objects into Rows

> **Note:**
>
> The examples in these sections use the sample data in [Sample Data Used in Examples](../../../user-guide/querying-semistructured.md).

### Traversing Semi-Structured Data

To refer to a specific field or element in semi-structured data, use the following methods of the `Column` object:

* Get attribute `col_object["<field_name>"]` to return a `Column` object for a field in an OBJECT (or a VARIANT that contains an
  OBJECT).
* Use `col_object[<index>]` to return a `Column` object for an element in an ARRAY (or a VARIANT that contains an ARRAY).

> **Note:**
>
> If the field name or elements in the path are irregular and make it difficult to use the indexing described above, you can
> use `get`, `get_ignore_case`, or `get_path` as an alternative.

For example, the following code selects the `dealership` field in objects in the `src` column of the
[sample data](../../../user-guide/querying-semistructured.md):

```python
from snowflake.snowpark.functions import col

df = session.table("car_sales")
df.select(col("src")["dealership"]).show()
```

The code prints the following output:

```output
----------------------------
|"""SRC""['DEALERSHIP']"   |
----------------------------
|"Valley View Auto Sales"  |
|"Tindel Toyota"           |
----------------------------
```

> **Note:**
>
> The values in the DataFrame are surrounded by double quotes because these values are returned as string literals. To cast these
> values to a specific type, see Explicitly Casting Values in Semi-Structured Data.

You can also chain method calls to traverse a path to a specific
field or element.

For example, the following code selects the `name` field in the `salesperson` object:

```python
df = session.table("car_sales")
df.select(df["src"]["salesperson"]["name"]).show()
```

The code prints the following output:

```output
------------------------------------
|"""SRC""['SALESPERSON']['NAME']"  |
------------------------------------
|"Frank Beasley"                   |
|"Greg Northrup"                   |
------------------------------------
```

As another example, the following code selects the first element of `vehicle` field, which holds an array of vehicles. The
example also selects the `price` field from the first element.

```python
df = session.table("car_sales")
df.select(df["src"]["vehicle"][0]).show()
df.select(df["src"]["vehicle"][0]["price"]).show()
```

The code prints the following output:

```output
---------------------------
|"""SRC""['VEHICLE'][0]"  |
---------------------------
|{                        |
|  "extras": [            |
|    "ext warranty",      |
|    "paint protection"   |
|  ],                     |
|  "make": "Honda",       |
|  "model": "Civic",      |
|  "price": "20275",      |
|  "year": "2017"         |
|}                        |
|{                        |
|  "extras": [            |
|    "ext warranty",      |
|    "rust proofing",     |
|    "fabric protection"  |
|  ],                     |
|  "make": "Toyota",      |
|  "model": "Camry",      |
|  "price": "23500",      |
|  "year": "2017"         |
|}                        |
---------------------------

------------------------------------
|"""SRC""['VEHICLE'][0]['PRICE']"  |
------------------------------------
|"20275"                           |
|"23500"                           |
------------------------------------
```

As an alternative to access fields in aforementioned way, you can use `get`, `get_ignore_case`, or
`get_path` functions if the field name or elements in the path are irregular.

For example, the following lines of code both print the value of a specified field in an object:

```python
from snowflake.snowpark.functions import get, get_path, lit

df.select(get(col("src"), lit("dealership"))).show()
df.select(col("src")["dealership"]).show()
```

Similarly, the following lines of code both print the value of a field at a specified path in an object:

```python
df.select(get_path(col("src"), lit("vehicle[0].make"))).show()
df.select(col("src")["vehicle"][0]["make"]).show()
```

### Explicitly Casting Values in Semi-Structured Data

By default, the values of fields and elements are returned as string literals (including the double quotes), as shown in the
examples above.

To avoid unexpected results, call the cast method to cast the value to a specific
type. For example, the following code prints out the values without and with casting:

```python
# Import the objects for the data types, including StringType.
from snowflake.snowpark.types import *

df = session.table("car_sales")
df.select(col("src")["salesperson"]["id"]).show()
df.select(col("src")["salesperson"]["id"].cast(StringType())).show()
```

The code prints the following output:

```output
----------------------------------
|"""SRC""['SALESPERSON']['ID']"  |
----------------------------------
|"55"                            |
|"274"                           |
----------------------------------

---------------------------------------------------
|"CAST (""SRC""['SALESPERSON']['ID'] AS STRING)"  |
---------------------------------------------------
|55                                               |
|274                                              |
---------------------------------------------------
```

### Flattening an Array of Objects into Rows

If you need to “flatten” semi-structured data into a DataFrame (e.g. producing a row for every object in an array), call the
`flatten` using the `join_table_function` method. This method is equivalent to the [FLATTEN](../../../sql-reference/functions/flatten.md) SQL function. If you pass in
a path to an object or array, the method returns a DataFrame that contains a row for each field or element in the object or array.

For example, in the [sample data](../../../user-guide/querying-semistructured.md), `src:customer` is an array of objects that
contain information about a customer. Each object contains a `name` and `address` field.

If you pass this path to the `flatten` function:

```python
df = session.table("car_sales")
df.join_table_function("flatten", col("src")["customer"]).show()
```

the method returns a DataFrame:

```output
----------------------------------------------------------------------------------------------------------------------------------------------------------
|"SRC"                                      |"SEQ"  |"KEY"  |"PATH"  |"INDEX"  |"VALUE"                            |"THIS"                               |
----------------------------------------------------------------------------------------------------------------------------------------------------------
|{                                          |1      |NULL   |[0]     |0        |{                                  |[                                    |
|  "customer": [                            |       |       |        |         |  "address": "San Francisco, CA",  |  {                                  |
|    {                                      |       |       |        |         |  "name": "Joyce Ridgely",         |    "address": "San Francisco, CA",  |
|      "address": "San Francisco, CA",      |       |       |        |         |  "phone": "16504378889"           |    "name": "Joyce Ridgely",         |
|      "name": "Joyce Ridgely",             |       |       |        |         |}                                  |    "phone": "16504378889"           |
|      "phone": "16504378889"               |       |       |        |         |                                   |  }                                  |
|    }                                      |       |       |        |         |                                   |]                                    |
|  ],                                       |       |       |        |         |                                   |                                     |
|  "date": "2017-04-28",                    |       |       |        |         |                                   |                                     |
|  "dealership": "Valley View Auto Sales",  |       |       |        |         |                                   |                                     |
|  "salesperson": {                         |       |       |        |         |                                   |                                     |
|    "id": "55",                            |       |       |        |         |                                   |                                     |
|    "name": "Frank Beasley"                |       |       |        |         |                                   |                                     |
|  },                                       |       |       |        |         |                                   |                                     |
|  "vehicle": [                             |       |       |        |         |                                   |                                     |
|    {                                      |       |       |        |         |                                   |                                     |
|      "extras": [                          |       |       |        |         |                                   |                                     |
|        "ext warranty",                    |       |       |        |         |                                   |                                     |
|        "paint protection"                 |       |       |        |         |                                   |                                     |
|      ],                                   |       |       |        |         |                                   |                                     |
|      "make": "Honda",                     |       |       |        |         |                                   |                                     |
|      "model": "Civic",                    |       |       |        |         |                                   |                                     |
|      "price": "20275",                    |       |       |        |         |                                   |                                     |
|      "year": "2017"                       |       |       |        |         |                                   |                                     |
|    }                                      |       |       |        |         |                                   |                                     |
|  ]                                        |       |       |        |         |                                   |                                     |
|}                                          |       |       |        |         |                                   |                                     |
|{                                          |2      |NULL   |[0]     |0        |{                                  |[                                    |
|  "customer": [                            |       |       |        |         |  "address": "New York, NY",       |  {                                  |
|    {                                      |       |       |        |         |  "name": "Bradley Greenbloom",    |    "address": "New York, NY",       |
|      "address": "New York, NY",           |       |       |        |         |  "phone": "12127593751"           |    "name": "Bradley Greenbloom",    |
|      "name": "Bradley Greenbloom",        |       |       |        |         |}                                  |    "phone": "12127593751"           |
|      "phone": "12127593751"               |       |       |        |         |                                   |  }                                  |
|    }                                      |       |       |        |         |                                   |]                                    |
|  ],                                       |       |       |        |         |                                   |                                     |
|  "date": "2017-04-28",                    |       |       |        |         |                                   |                                     |
|  "dealership": "Tindel Toyota",           |       |       |        |         |                                   |                                     |
|  "salesperson": {                         |       |       |        |         |                                   |                                     |
|    "id": "274",                           |       |       |        |         |                                   |                                     |
|    "name": "Greg Northrup"                |       |       |        |         |                                   |                                     |
|  },                                       |       |       |        |         |                                   |                                     |
|  "vehicle": [                             |       |       |        |         |                                   |                                     |
|    {                                      |       |       |        |         |                                   |                                     |
|      "extras": [                          |       |       |        |         |                                   |                                     |
|        "ext warranty",                    |       |       |        |         |                                   |                                     |
|        "rust proofing",                   |       |       |        |         |                                   |                                     |
|        "fabric protection"                |       |       |        |         |                                   |                                     |
|      ],                                   |       |       |        |         |                                   |                                     |
|      "make": "Toyota",                    |       |       |        |         |                                   |                                     |
|      "model": "Camry",                    |       |       |        |         |                                   |                                     |
|      "price": "23500",                    |       |       |        |         |                                   |                                     |
|      "year": "2017"                       |       |       |        |         |                                   |                                     |
|    }                                      |       |       |        |         |                                   |                                     |
|  ]                                        |       |       |        |         |                                   |                                     |
|}                                          |       |       |        |         |                                   |                                     |
----------------------------------------------------------------------------------------------------------------------------------------------------------
```

From this DataFrame, you can select the `name` and `address` fields from each object in the `VALUE` field:

```python
df.join_table_function("flatten", col("src")["customer"]).select(col("value")["name"], col("value")["address"]).show()
```

```output
-------------------------------------------------
|"""VALUE""['NAME']"   |"""VALUE""['ADDRESS']"  |
-------------------------------------------------
|"Joyce Ridgely"       |"San Francisco, CA"     |
|"Bradley Greenbloom"  |"New York, NY"          |
-------------------------------------------------
```

The following code adds to the previous example by
casting the values to a specific type and changing the names of the columns:

```python
df.join_table_function("flatten", col("src")["customer"]).select(col("value")["name"].cast(StringType()).as_("Customer Name"), col("value")["address"].cast(StringType()).as_("Customer Address")).show()
```

```output
-------------------------------------------
|"Customer Name"     |"Customer Address"  |
-------------------------------------------
|Joyce Ridgely       |San Francisco, CA   |
|Bradley Greenbloom  |New York, NY        |
-------------------------------------------
```

## Executing SQL Statements

To execute a SQL statement that you specify, call the `sql` method in the `Session` class, and pass in the statement
to be executed. The method returns a DataFrame.

Note that the SQL statement won’t be executed until you call an action method.

```python
# Get the list of the files in a stage.
# The collect() method causes this SQL statement to be executed.
session.sql("create or replace temp stage my_stage").collect()
```

```output
# Prepend a return statement to return the collect() results in a Python worksheet
[Row(status='Stage area MY_STAGE successfully created.')]
```

```python
stage_files_df = session.sql("ls @my_stage").collect()
# Prepend a return statement to return the collect() results in a Python worksheet
# Resume the operation of a warehouse.
# Note that you must call the collect method to execute
# the SQL statement.
session.sql("alter warehouse if exists my_warehouse resume if suspended").collect()
```

```output
# Prepend a return statement to return the collect() results in a Python worksheet
[Row(status='Statement executed successfully.')]
```

```python
# Set up a SQL statement to copy data from a stage to a table.
session.sql("copy into sample_product_data from @my_stage file_format=(type = csv)").collect()
# Prepend a return statement to return the collect() results in a Python worksheet
```

```output
[Row(status='Copy executed with 0 files processed.')]
```

If you want to call methods to transform the DataFrame
(e.g. `filter`, `select`, etc.),
note that these methods work only if the underlying SQL statement is a SELECT statement. The transformation methods are not
supported for other kinds of SQL statements.

```python
df = session.sql("select id, parent_id from sample_product_data where id < 10")
# Because the underlying SQL statement for the DataFrame is a SELECT statement,
# you can call the filter method to transform this DataFrame.
results = df.filter(col("id") < 3).select(col("id")).collect()
# Prepend a return statement to return the collect() results in a Python worksheet

# In this example, the underlying SQL statement is not a SELECT statement.
df = session.sql("ls @my_stage")
# Calling the filter method results in an error.
try:
  df.filter(col("size") > 50).collect()
except SnowparkSQLException as e:
  print(e.message)
```

```output
000904 (42000): SQL compilation error: error line 1 at position 104
invalid identifier 'SIZE'
```

## Submit Snowpark queries concurrently

> **Note:**
>
> This feature requires Snowpark Library for Python version of 1.24 or greater and server version 8.46 or greater.

Thread-safe session objects allow different parts of your Snowpark Python code to run concurrently while using the same session. This enables multiple operations - such as transformations on multiple DataFrames - to be executed concurrently. This is particularly useful when you’re working with queries that can be processed independently on the Snowflake server and it aligns with a more traditional multithreading approach.

The Global Interpreter Lock (GIL) in Python is a mutex that protects access to Python objects, preventing multiple native threads from executing Python bytecode simultaneously. While I/O-bound operations can still benefit from Python’s threading model due to the GIL being released during I/O operations, CPU-bound threads will not achieve true parallelism because only one thread can execute at a time.

Moreover, when used inside Snowflake (e.g. in a stored procedure), the Snowpark Python server manages the Global Interpreter Lock (GIL) by releasing it before submitting queries to Snowflake. This ensures that true concurrency can be achieved when enqueuing multiple queries from separate threads. With this management, Snowpark allows multiple threads to submit queries concurrently, ensuring optimal parallel execution.

### Benefits of Using Thread-Safe Session Objects in Snowpark

The ability to run multiple DataFrame operations concurrently can bring the following benefits to Snowpark users:

* Improved Performance: Thread-safe session objects allow you to run multiple Snowpark Python queries concurrently, reducing overall runtime. For example, if you need to process several tables independently, this feature significantly cuts down the time it takes to complete the job, as you no longer need to wait for each table’s processing to finish before starting the next one.
* Efficient Compute Utilization: Submitting queries concurrently ensures that Snowflake’s compute resources are used efficiently, reducing idle times.
* Usability: Thread-safe session objects integrate seamlessly with Python’s native multithreading APIs, which allows developers to leverage Python’s built-in tools to control thread behavior and optimize parallel execution.

Thread-safe session objects and async jobs can complement each other depending on your use case. Async jobs are useful when you don’t need to wait for your jobs to finish, allowing for non-blocking execution without thread pool management. Thread-safe session objects, on the other hand, are useful when you want to submit multiple queries concurrently from the client side. In some cases, the code blocks can also contain async jobs, allowing both methods to be used together effectively.

Following are some examples where thread-safe session objects can enhance your data pipeline.

#### Example 1: Concurrent Loading of Multiple Tables

This example demonstrates loading data from three different CSV files into three separate tables using three threads to run the `COPY INTO` command concurrently.

```python
import threading
from snowflake.snowpark import Session

# Define the list of tables
tables = ["customers", "orders", "products"]

# Function to copy data from stage to tables
def execute_copy(table_name):
    try:
        # Read data from the stage using DataFrameReader
        df = (
            session.read.option("SKIP_HEADER", 1)
            .option("PATTERN", f"{table_name}[.]csv")
            .option("FORCE", True)
            .csv(f"@my_stage")
        )

        # Copy data into the target table
        df.copy_into_table(
            table_name=table_name, target_columns=session.table(table_name).columns
        )

    except Exception as e:
        print(f"Failed to copy data into {table_name}, Error: {e}")

# Create an empty list of threads
threads = []

# Loop through and start a thread for each table
for table in tables:
    thread = threading.Thread(target=execute_copy, args=(table,))
    threads.append(thread)
    thread.start()

# Wait for all threads to finish
for thread in threads:
    thread.join()
```

#### Example 2: Concurrent Processing of Multiple Tables

This example demonstrates how you can use multiple threads to concurrently filter, aggregate, and insert data into a result table from each customer transaction table (transaction_customer1, transaction_customer2, and transaction_customer3).

```python
from concurrent.futures import ThreadPoolExecutor
from snowflake.snowpark import Session
from snowflake.snowpark.functions import col, month, sum, lit

# List of customers
customers = ["customer1", "customer2", "customer3"]

# Define a function to process each customer transaction table
def process_customer_table(customer_name):
    table_name = f"transaction_{customer_name}"

    try:
        # Load the customer transaction table
        df = session.table(table_name)
        print(f"Processing {table_name}...")

        # Filter data by positive values and non null categories
        df_filtered = df.filter((col("value") > 0) & col("category").is_not_null())

        # Perform aggregation: Sum of value by category and month
        df_aggregated = df_filtered.with_column("month", month(col("date"))).with_column("customer_name", lit(customer_name)).group_by(col("category"), col("month"), col("customer_name")).agg(sum("value").alias("total_value"))

        # Save the processed data into a new result table
        df_aggregated.show()
        df_aggregated.write.save_as_table("aggregate_customers", mode="append")
        print(f"Data from {table_name} processed and saved")

    except Exception as e:
        print(f"Error processing {table_name}: {e}")

# Using ThreadPoolExecutor to handle concurrency
with ThreadPoolExecutor(max_workers=3) as executor:
    # Submit tasks for each customer table
    executor.map(process_customer_table, customers)

# Display the results from the aggregate table
session.table("aggregate_customers").show()
```

#### Limitations of Using Thread-Safe Session Objects

* If you need to manage multiple transactions concurrently, it’s important to use multiple session objects because multiple threads of a single session do not support concurrent transactions.
* Changing session runtime configurations (including Snowflake session variables like database, schema, warehouse, and client side configurations like cte_optimization_enabled, sql_simplifier_enabled) while other threads are active can lead to unexpected behavior. To avoid conflicts, it’s best to use separate session objects if different threads require distinct configurations. For example, if you need to perform operations on different databases in parallel, ensure each thread has its own session object rather than sharing the same session.

## Return the Contents of a DataFrame as a Pandas DataFrame

To return the contents of a DataFrame as a Pandas DataFrame, use the `to_pandas` method.

For example:

```python
python_df = session.create_dataframe(["a", "b", "c"])
pandas_df = python_df.to_pandas()
```

## Snowpark DataFrames vs Snowpark pandas DataFrame: Which should I choose?

By installing the Snowpark Python library, you have the option of using the DataFrames API or [pandas on Snowflake](pandas-on-snowflake.md).

Snowpark DataFrames are modeled after PySpark, while Snowpark pandas is intended to extend the Snowpark DataFrame functionality and provide a familiar interface to pandas users to facilitate easy migration and adoption. We recommend using the different APIs depending on your use case and preference:

| Use Snowpark pandas if you …. | Use Snowpark DataFrames if you … |
| --- | --- |
| Prefer working with or have existing code written in pandas | Prefer working with or have existing code written in Spark |
| Have workflow that involves interactive analysis and iterative exploration | Have workflow that involves batch processing and limited iterative development |
| Are familiar with working with DataFrame operations that get executed immediately | Are familiar with working with DataFrame operations that are lazily evaluated |
| Prefer data being consistent and ordered during the operations | Are Ok with data not being ordered |
| Are Ok with slightly slower performance compared to Snowpark DataFrames in favor of easier to use API | Performance is more important to you than ease of use |

From an implementation perspective, Snowpark DataFrames and pandas DataFrames are semantically different. Since Snowpark DataFrames are modeled after PySpark, it operates on the original data source, gets the most recent updated data, so it does not maintain order for operations. Snowpark pandas are modeled after pandas, which operate on a snapshot of the data, maintain order during the operation, and allow for order-based positional indexing. Order maintainenace is useful for visual inspection of data in interactive data analysis.

For more information, see [Using pandas on Snowflake with Snowpark DataFrames](pandas-on-snowflake.md).

---
title: Working with DataFrames in Snowpark Scala
source: https://docs.snowflake.com/en/developer-guide/snowpark/scala/working-with-dataframes.md
section: Snowpark
---

# Working with DataFrames in Snowpark Scala

In Snowpark, the main way in which you query and process data is through a DataFrame. This topic explains how to work with
DataFrames.

To retrieve and manipulate data, you use the [DataFrame](../reference/scala/com/snowflake/snowpark/DataFrame.md) class. A DataFrame represents a relational dataset that is evaluated
lazily: it only executes when a specific action is triggered. In a sense, a DataFrame is like a query that needs to be evaluated
in order to retrieve data.

To retrieve data into a DataFrame:

1. Construct a DataFrame, specifying the source of the data for the dataset.

   For example, you can create a DataFrame to hold data from a table, an external CSV file, or the execution of a SQL statement.
2. Specify how the dataset in the DataFrame should be transformed.

   For example, you can specify which columns should be selected, how the rows should be filtered, how the results should be
   sorted and grouped, etc.
3. Execute the statement to retrieve the data into the DataFrame.

   In order to retrieve the data into the DataFrame, you must invoke a method that performs an action (for example, the
   `collect()` method).

The next sections explain these steps in more detail.

## Setting up the Examples for this Section

Some of the examples of this section use a DataFrame to query a table named `sample_product_data`. If you want to run these
examples, you can create this table and fill the table with some data by executing the following SQL statements:

```sqlexample
CREATE OR REPLACE TABLE sample_product_data (id INT, parent_id INT, category_id INT, name VARCHAR, serial_number VARCHAR, key INT, "3rd" INT);
INSERT INTO sample_product_data VALUES
    (1, 0, 5, 'Product 1', 'prod-1', 1, 10),
    (2, 1, 5, 'Product 1A', 'prod-1-A', 1, 20),
    (3, 1, 5, 'Product 1B', 'prod-1-B', 1, 30),
    (4, 0, 10, 'Product 2', 'prod-2', 2, 40),
    (5, 4, 10, 'Product 2A', 'prod-2-A', 2, 50),
    (6, 4, 10, 'Product 2B', 'prod-2-B', 2, 60),
    (7, 0, 20, 'Product 3', 'prod-3', 3, 70),
    (8, 7, 20, 'Product 3A', 'prod-3-A', 3, 80),
    (9, 7, 20, 'Product 3B', 'prod-3-B', 3, 90),
    (10, 0, 50, 'Product 4', 'prod-4', 4, 100),
    (11, 10, 50, 'Product 4A', 'prod-4-A', 4, 100),
    (12, 10, 50, 'Product 4B', 'prod-4-B', 4, 100);
```

To verify that the table was created, run:

```sqlexample
SELECT * FROM sample_product_data;
```

## Constructing a DataFrame

To construct a DataFrame, you can use methods in the `Session` class. Each of the following methods constructs a DataFrame
from a different type of data source:

* To create a DataFrame from data in a table, view, or stream, call the `table` method:

  ```scala
  // Create a DataFrame from the data in the "sample_product_data" table.
  val dfTable = session.table("sample_product_data")

  // To print out the first 10 rows, call:
  //   dfTable.show()
  ```

  > **Note:**
  >
  > The `session.table` method returns an `Updatable` object. `Updatable` extends `DataFrame` and provides
  > additional methods for working with data in the table (e.g. methods for updating and deleting data). See
  > Updating, Deleting, and Merging Rows in a Table.
* To create a DataFrame from a sequence of values, call the `createDataFrame` method:

  ```scala
  // Create a DataFrame containing a sequence of values.
  // In the DataFrame, name the columns "i" and "s".
  val dfSeq = session.createDataFrame(Seq((1, "one"), (2, "two"))).toDF("i", "s")
  ```

  > **Note:**
  >
  > Words reserved by Snowflake are not valid as column names when constructing a DataFrame. For a list of reserved words, refer to
  > [Reserved & limited keywords](../../../sql-reference/reserved-keywords.md).
* To create a DataFrame containing a range of values, call the `range` method:

  ```scala
  // Create a DataFrame from a range
  val dfRange = session.range(1, 10, 2)
  ```
* To create a DataFrame for a file in a stage, call `read` to get a
  `DataFrameReader` object. In the `DataFrameReader` object, call the method corresponding to the format of the data
  in the file:

  ```scala
  // Create a DataFrame from data in a stage.
  val dfJson = session.read.json("@mystage2/data1.json")
  ```
* To create a DataFrame to hold the results of a SQL query, call the `sql` method:

  ```scala
  // Create a DataFrame from a SQL query
  val dfSql = session.sql("SELECT name from products")
  ```

  Note: Although you can use this method to execute SELECT statements that retrieve data from tables and staged files, you should
  use the `table` and `read` methods instead. Methods like `table` and `read` can provide better syntax
  highlighting, error highlighting, and intelligent code completion in development tools.

## Specifying How the Dataset Should Be Transformed

To specify which columns should be selected and how the results should be filtered, sorted, grouped, etc., call the DataFrame
methods that transform the dataset. To identify columns in these methods, use the `col` function or an expression that
evaluates to a column. (See Specifying Columns and Expressions.)

For example:

* To specify which rows should be returned, call the `filter` method:

  ```scala
  // Import the col function from the functions object.
  import com.snowflake.snowpark.functions._

  // Create a DataFrame for the rows with the ID 1
  // in the "sample_product_data" table.
  //
  // This example uses the === operator of the Column object to perform an
  // equality check.
  val df = session.table("sample_product_data").filter(col("id") === 1)
  df.show()
  ```
* To specify the columns that should be selected, call the `select` method:

  ```scala
  // Import the col function from the functions object.
  import com.snowflake.snowpark.functions._

  // Create a DataFrame that contains the id, name, and serial_number
  // columns in te "sample_product_data" table.
  val df = session.table("sample_product_data").select(col("id"), col("name"), col("serial_number"))
  df.show()
  ```

Each method returns a new DataFrame object that has been transformed. (The method does not affect the original DataFrame object.)
This means that if you want to apply multiple transformations, you can
chain method calls, calling each subsequent transformation method on the
new DataFrame object returned by the previous method call.

Note that these transformation methods do not retrieve data from the Snowflake database. (The action methods described in
Performing an Action to Evaluate a DataFrame perform the data retrieval.) The transformation methods simply specify how the SQL
statement should be constructed.

### Specifying Columns and Expressions

When calling these transformation methods, you might need to specify columns or expressions that use columns. For example, when
calling the `select` method, you need to specify the columns that should be selected.

To refer to a column, create a [Column](../reference/scala/com/snowflake/snowpark/Column.md) object by calling the [col](../reference/scala/com/snowflake/snowpark/functions$.md) function in the `com.snowflake.snowpark.functions`
object.

```scala
// Import the col function from the functions object.
import com.snowflake.snowpark.functions._

val dfProductInfo = session.table("sample_product_data").select(col("id"), col("name"))
dfProductInfo.show()
```

> **Note:**
>
> To create a `Column` object for a literal, see Using Literals as Column Objects.

When specifying a filter, projection, join condition, etc., you can use `Column` objects in an expression. For example:

* You can use `Column` objects with the `filter` method to specify a filter condition:

  ```scala
  // Specify the equivalent of "WHERE id = 20"
  // in an SQL SELECT statement.
  df.filter(col("id") === 20)
  ```

  ```scala
  // Specify the equivalent of "WHERE a + b < 10"
  // in an SQL SELECT statement.
  df.filter((col("a") + col("b")) < 10)
  ```
* You can use `Column` objects with the `select` method to define an alias:

  ```scala
  // Specify the equivalent of "SELECT b * 10 AS c"
  // in an SQL SELECT statement.
  df.select((col("b") * 10) as "c")
  ```
* You can use `Column` objects with the `join` method to define a join condition:

  ```scala
  // Specify the equivalent of "X JOIN Y on X.a_in_X = Y.b_in_Y"
  // in an SQL SELECT statement.
  dfX.join(dfY, col("a_in_X") === col("b_in_Y"))
  ```

#### Referring to Columns in Different DataFrames

When referring to columns in two different DataFrame objects that have the same name (for example, joining the DataFrames on that
column), you can use the `DataFrame.col` method in one DataFrame object to refer to a column in that object (for example,
`df1.col("name")` and `df2.col("name")`).

The following example demonstrates how to use the `DataFrame.col` method to refer to a column in a specific DataFrame. The
example joins two DataFrame objects that both have a column named `key`. The example uses the `Column.as` method to change
the names of the columns in the newly created DataFrame.

```scala
// Create a DataFrame that joins two other DataFrames (dfLhs and dfRhs).
// Use the DataFrame.col method to refer to the columns used in the join.
val dfJoined = dfLhs.join(dfRhs, dfLhs.col("key") === dfRhs.col("key")).select(dfLhs.col("value").as("L"), dfRhs.col("value").as("R"))
```

#### Using the `apply` Method to Refer to a Column

As an alternative to the `DataFrame.col` method, you can use the `DataFrame.apply` method to refer to a column in a
specific DataFrame. Like the `DataFrame.col` method, the `DataFrame.apply` method accepts a column name as input and
returns a `Column` object.

Note that when an object has an `apply` method in Scala, you can call the `apply` method by calling the object as if
it were a function. For example, to call `df.apply("column_name")`, you can simply write `df("column_name")`. The
following calls are equivalent:

* `df.col("<column_name>")`
* `df.apply("<column_name>")`
* `df("<column_name>")`

The following example is the same as the previous example but uses the `DataFrame.apply` method to refer to the columns in
a join operation:

```scala
// Create a DataFrame that joins two other DataFrames (dfLhs and dfRhs).
// Use the DataFrame.apply method to refer to the columns used in the join.
// Note that dfLhs("key") is shorthand for dfLhs.apply("key").
val dfJoined = dfLhs.join(dfRhs, dfLhs("key") === dfRhs("key")).select(dfLhs("value").as("L"), dfRhs("value").as("R"))
```

### Using Shorthand For a Column Object

As an alternative to using the `col` function, you can refer to a column in one of these ways:

* Use a dollar sign in front of the quoted column name (`$"column_name"`).
* Use an apostrophe (a single quote) in front of the unquoted column name (`'column_name`).

To do this, import the names from the `implicits` object after you create a `Session` object:

```none
val session = Session.builder.configFile("/path/to/properties").create

// Import this after you create the session.
import session.implicits._

// Use the $ (dollar sign) shorthand.
val df = session.table("T").filter($"id" === 10).filter(($"a" + $"b") < 10)

// Use ' (apostrophe) shorthand.
val df = session.table("T").filter('id === 10).filter(('a + 'b) < 10).select('b * 10)
```

### Using Double Quotes Around Object Identifiers (Table Names, Column Names, etc.)

The names of databases, schemas, tables, and stages that you specify must conform to the
[Snowflake identifier requirements](../../../sql-reference/identifiers-syntax.md). When you specify a name, Snowflake considers the
name to be in upper case. For example, the following calls are equivalent:

```scala
// The following calls are equivalent:
df.select(col("id123"))
df.select(col("ID123"))
```

If the name does not conform to the identifier requirements, you must use double quotes (`"`) around the name. Use a backslash
(`\`) to escape the double quote character within a Scala string literal. For example, the following table name does not start
with a letter or an underscore, so you must use double quotes around the name:

```scala
val df = session.table("\"10tablename\"")
```

Note that when specifying the name of a column, you don’t need to use double quotes around the name. The Snowpark library
automatically encloses the column name in double quotes for you if the name does not comply with the identifier requirements:.

```scala
// The following calls are equivalent:
df.select(col("3rdID"))
df.select(col("\"3rdID\""))

// The following calls are equivalent:
df.select(col("id with space"))
df.select(col("\"id with space\""))
```

If you have already added double quotes around a column name, the library does not insert additional double quotes around the
name.

In some cases, the column name might contain double quote characters:

```sqlexample
describe table quoted;
+------------------------+ ...
| name                   | ...
|------------------------+ ...
| name_with_"air"_quotes | ...
| "column_name_quoted"   | ...
+------------------------+ ...
```

As explained in [Identifier requirements](../../../sql-reference/identifiers-syntax.md), for each double quote character within a double-quoted identifier, you
must use two double quote characters (e.g. `"name_with_""air""_quotes"` and `"""column_name_quoted"""`):

```scala
val dfTable = session.table("quoted")
dfTable.select("\"name_with_\"\"air\"\"_quotes\"").show()
dfTable.select("\"\"\"column_name_quoted\"\"\"").show()
```

Keep in mind that when an identifier is enclosed in double quotes (whether you explicitly added the quotes or the library added
the quotes for you), [Snowflake treats the identifier as case-sensitive](../../../sql-reference/identifiers-syntax.md):

```scala
// The following calls are NOT equivalent!
// The Snowpark library adds double quotes around the column name,
// which makes Snowflake treat the column name as case-sensitive.
df.select(col("id with space"))
df.select(col("ID WITH SPACE"))
```

### Using Literals as Column Objects

To use a literal in a method that passes in a `Column` object, create a `Column` object for the literal by passing
the literal to the `lit` function in the `com.snowflake.snowpark.functions` object. For example:

```scala
// Import for the lit and col functions.
import com.snowflake.snowpark.functions._

// Show the first 10 rows in which num_items is greater than 5.
// Use `lit(5)` to create a Column object for the literal 5.
df.filter(col("num_items").gt(lit(5))).show()
```

If the literal is a floating point or double value in Scala (e.g. `0.05` is
[treated as a Double by default](https://docs.scala-lang.org/overviews/scala-book/built-in-types.html)), the Snowpark library
generates SQL that implicitly casts the value to the corresponding Snowpark data type (e.g. `0.05::DOUBLE`). This can produce
an approximate value that differs from the exact number specified.

For example, the following code displays no matching rows, even though the filter (that matches values greater than or equal to
`0.05`) should match the rows in the DataFrame:

```scala
// Create a DataFrame that contains the value 0.05.
val df = session.sql("select 0.05 :: Numeric(5, 2) as a")

// Applying this filter results in no matching rows in the DataFrame.
df.filter(col("a") <= lit(0.06) - lit(0.01)).show()
```

The problem is that `lit(0.06)` and `lit(0.01)` produce approximate values for `0.06` and `0.01`, not the exact values.

To avoid this problem, you can use one of the following approaches:

* Option 1: Cast the literal to the Snowpark type that you want to use. For example,
  to use a [NUMBER](../../../sql-reference/data-types-numeric.md) with a precision of 5 and a scale of 2:

  ```scala
  df.filter(col("a") <= lit(0.06).cast(new DecimalType(5, 2)) - lit(0.01).cast(new DecimalType(5, 2))).show()
  ```
* Option 2: Cast the value to the type that you want to use before passing the value to the `lit` function. For example,
  if you want to use the
  [BigDecimal type](https://docs.scala-lang.org/overviews/scala-book/built-in-types.html#bigint-and-bigdecimal):

  ```scala
  df.filter(col("a") <= lit(BigDecimal(0.06)) - lit(BigDecimal(0.01))).show()
  ```

### Casting a Column Object to a Specific Type

To cast a `Column` object to a specific type, call the [Column.cast](../reference/scala/com/snowflake/snowpark/Column.md) method, and pass in a type object from the
[com.snowflake.snowpark.types package](../reference/scala/com/snowflake/snowpark/types/index.md). For example, to cast a literal as a [NUMBER](../../../sql-reference/data-types-numeric.md) with a precision of 5
and a scale of 2:

```scala
// Import for the lit function.
import com.snowflake.snowpark.functions._
// Import for the DecimalType class..
import com.snowflake.snowpark.types._

val decimalValue = lit(0.05).cast(new DecimalType(5,2))
```

### Chaining Method Calls

Because each method that transforms a DataFrame object returns a new DataFrame object
that has the transformation applied, you can [chain method calls](https://en.wikipedia.org/wiki/Method_chaining) to produce a
new DataFrame that is transformed in additional ways.

The following example returns a DataFrame that is configured to:

* Query the `sample_product_data` table.
* Return the row with `id = 1`.
* Select the `name` and `serial_number` columns.

```scala
val dfProductInfo = session.table("sample_product_data").filter(col("id") === 1).select(col("name"), col("serial_number"))
dfProductInfo.show()
```

In this example:

* `session.table("sample_product_data")` returns a DataFrame for the `sample_product_data` table.

  Although the DataFrame does not yet contain the data from the table, the object does contain the definitions of the columns in
  the table.
* `filter(col("id") === 1)` returns a DataFrame for the `sample_product_data` table that is set up to return the row with
  `id = 1`.

  Note again that the DataFrame does not yet contain the matching row from the table. The matching row is not retrieved until you
  call an action method.
* `select(col("name"), col("serial_number"))` returns a DataFrame that contains the `name` and `serial_number` columns
  for the row in the `sample_product_data` table that has `id = 1`.

When you chain method calls, keep in mind that the order of calls is important. Each method call returns a DataFrame that has been
transformed. Make sure that subsequent calls work with the transformed DataFrame.

For example, in the code below, the `select` method returns a DataFrame that just contains two columns: `name` and
`serial_number`. The `filter` method call on this DataFrame fails because it uses the `id` column, which is not in the
transformed DataFrame.

```scala
// This fails with the error "invalid identifier 'ID'."
val dfProductInfo = session.table("sample_product_data").select(col("name"), col("serial_number")).filter(col("id") === 1)
```

In contrast, the following code executes successfully because the `filter()` method is called on a DataFrame that contains
all of the columns in the `sample_product_data` table (including the `id` column):

```scala
// This succeeds because the DataFrame returned by the table() method
// includes the "id" column.
val dfProductInfo = session.table("sample_product_data").filter(col("id") === 1).select(col("name"), col("serial_number"))
dfProductInfo.show()
```

Keep in mind that you might need to make the `select` and `filter` method calls in a different order than you would
use the equivalent keywords (SELECT and WHERE) in a SQL statement.

### Limiting the Number of Rows in a DataFrame

To limit the number of rows in a DataFrame, you can use the [DataFrame.limit](../reference/scala/com/snowflake/snowpark/DataFrame.md) transformation method.

The Snowpark API also provides action methods for retrieving and printing out a limited number of rows:

* the [DataFrame.first](../reference/scala/com/snowflake/snowpark/DataFrame.md) action method (to execute the query and return the first `n` rows)
* the [DataFrame.show](../reference/scala/com/snowflake/snowpark/DataFrame.md) action method (to execute the query and print the first `n` rows)

These methods effectively add a [LIMIT](../../../sql-reference/constructs/limit.md) clause to the SQL statement that is executed.

As explained in the [usage notes for LIMIT](../../../sql-reference/constructs/limit.md), the results are non-deterministic unless you
specify a sort order (ORDER BY) in conjunction with LIMIT.

To keep the ORDER BY clause with the LIMIT clause (e.g. so that ORDER BY is not in a separate subquery), you must call the method
that limits results on the DataFrame returned by the `sort` method.

For example, if you are chaining method calls:

```scala
// Limit the number of rows to 5, sorted by parent_id.
var dfSubset = df.sort(col("parent_id")).limit(5);

// Return the first 5 rows, sorted by parent_id.
var arrayOfRows = df.sort(col("parent_id")).first(5)

// Print the first 5 rows, sorted by parent_id.
df.sort(col("parent_id")).show(5)
```

### Retrieving Column Definitions

To retrieve the definition of the columns in the dataset for the DataFrame, call the `schema` method. This method returns
a `StructType` object that contains an `Array` of `StructField` objects. Each `StructField` object
contains the definition of a column.

```scala
// Get the StructType object that describes the columns in the
// underlying rowset.
val tableSchema = session.table("sample_product_data").schema
println("Schema for sample_product_data: " + tableSchema);
```

In the returned `StructType` object, the column names are always normalized. Unquoted identifiers are returned in uppercase,
and quoted identifiers are returned in the exact case in which they were defined.

The following example creates a DataFrame containing the columns named `ID` and `3rd`. For the column name `3rd`, the
Snowpark library automatically encloses the name in double quotes (`"3rd"`) because
the name does not comply with the requirements for an identifier.

The example calls the `schema` method and then calls the `names` method on the returned `StructType` object to
get an `ArraySeq` of column names. The names are normalized in the `StructType` returned by the `schema` method.

```scala
// Create a DataFrame containing the "id" and "3rd" columns.
val dfSelectedColumns = session.table("sample_product_data").select(col("id"), col("3rd"))
// Print out the names of the columns in the schema. This prints out:
//   ArraySeq(ID, "3rd")
println(dfSelectedColumns.schema.names.toSeq)
```

### Joining DataFrames

To join DataFrame objects, call the [DataFrame.join](../reference/scala/com/snowflake/snowpark/DataFrame.md) method.

The following sections explain how to use DataFrames to perform a join:

* Setting up the Sample Data for the Joins
* Specifying the Columns for the Join
* Performing a Natural Join
* Specifying the Type of Join
* Joining Multiple Tables
* Performing a Self-Join

#### Setting up the Sample Data for the Joins

The examples in the next sections use sample data that you can set up by executing the following SQL statements:

```sqlexample
create or replace table sample_a (
  id_a integer,
  name_a varchar,
  value integer
);
insert into sample_a (id_a, name_a, value) values
  (10, 'A1', 5),
  (40, 'A2', 10),
  (80, 'A3', 15),
  (90, 'A4', 20)
;
create or replace table sample_b (
  id_b integer,
  name_b varchar,
  id_a integer,
  value integer
);
insert into sample_b (id_b, name_b, id_a, value) values
  (4000, 'B1', 40, 10),
  (4001, 'B2', 10, 5),
  (9000, 'B3', 80, 15),
  (9099, 'B4', null, 200)
;
create or replace table sample_c (
  id_c integer,
  name_c varchar,
  id_a integer,
  id_b integer
);
insert into sample_c (id_c, name_c, id_a, id_b) values
  (1012, 'C1', 10, null),
  (1040, 'C2', 40, 4000),
  (1041, 'C3', 40, 4001)
;
```

#### Specifying the Columns for the Join

With the `DataFrame.join` method, you can specify the columns to use in one of the following ways:

* Specify a Column expression that describes the join condition.
* Specify one or more columns that should be used as the common columns in the join.

The following example performs an inner join on the column named `id_a`:

```scala
// Create a DataFrame that joins the DataFrames for the tables
// "sample_a" and "sample_b" on the column named "id_a".
val dfLhs = session.table("sample_a")
val dfRhs = session.table("sample_b")
val dfJoined = dfLhs.join(dfRhs, dfLhs.col("id_a") === dfRhs.col("id_a"))
dfJoined.show()
```

Note that the example uses the `DataFrame.col` method to specify the condition to use for the join. See
Specifying Columns and Expressions for more about this method.

This prints the following output:

```none
----------------------------------------------------------------------
|"ID_A"  |"NAME_A"  |"VALUE"  |"ID_B"  |"NAME_B"  |"ID_A"  |"VALUE"  |
----------------------------------------------------------------------
|10      |A1        |5        |4001    |B2        |10      |5        |
|40      |A2        |10       |4000    |B1        |40      |10       |
|80      |A3        |15       |9000    |B3        |80      |15       |
----------------------------------------------------------------------
```

##### Identical Column Names Duplicated in the Join Result

In the DataFrame resulting from a join, the Snowpark library uses the column names found in the tables that were joined even when the
column names are identical across tables. When this happens, these column names are duplicated in the DataFrame resulting from the join.
To access a duplicated column by name, call the `col` method on the DataFrame representing the column’s original table. (For more
information about specifying columns, see Referring to Columns in Different DataFrames.)

Code in the following example joins two DataFrames, then calls the `select` method on the joined DataFrame. It specifies the columns
to select by calling the `col` method from the variable representing the respective DataFrame objects: `dfRhs` and
`dfLhs`. It uses the `as` method to give the columns new names in the DataFrame that the `select` method creates.

```scala
val dfLhs = session.table("sample_a")
val dfRhs = session.table("sample_b")
val dfJoined = dfLhs.join(dfRhs, dfLhs.col("id_a") === dfRhs.col("id_a"))
val dfSelected = dfJoined.select(dfLhs.col("value").as("LeftValue"), dfRhs.col("value").as("RightValue"))
dfSelected.show()
```

This prints the following output:

```none
------------------------------
|"LEFTVALUE"  |"RIGHTVALUE"  |
------------------------------
|5            |5             |
|10           |10            |
|15           |15            |
------------------------------
```

##### Deduplicate Columns Before Saving or Caching

Note that when a DataFrame resulting from a join includes duplicate column names, you must deduplicate or rename columns to remove
duplication in the DataFrame before you save the result to a table or cache the DataFrame. For duplicate column names in a DataFrame that
you save to a table or cache, the Snowpark library will replace duplicate column names with aliases so that they’re no longer duplicated.

The following example illustrates how the output of a cached DataFrame might appear if column names `ID_A` and `VALUE` were
duplicated in a join from two tables, then not deduplicated or renamed prior to caching the result.

```none
--------------------------------------------------------------------------------------------------
|"l_ZSz7_ID_A"  |"NAME_A"  |"l_ZSz7_VALUE"  |"ID_B"  |"NAME_B"  |"r_heec_ID_A"  |"r_heec_VALUE"  |
--------------------------------------------------------------------------------------------------
|10             |A1        |5               |4001    |B2        |10             |5               |
|40             |A2        |10              |4000    |B1        |40             |10              |
|80             |A3        |15              |9000    |B3        |80             |15              |
--------------------------------------------------------------------------------------------------
```

#### Performing a Natural Join

To perform a [natural join](../../../user-guide/querying-joins.md) (where DataFrames are joined on columns that have the same name),
call the [DataFrame.naturalJoin](../reference/scala/com/snowflake/snowpark/DataFrame.md) method.

The following example joins the DataFrames for the tables `sample_a` and `sample_b` on their common columns (the column
`id_a`):

```scala
val dfLhs = session.table("sample_a")
val dfRhs = session.table("sample_b")
val dfJoined = dfLhs.naturalJoin(dfRhs)
dfJoined.show()
```

This prints the following output:

```none
---------------------------------------------------
|"ID_A"  |"VALUE"  |"NAME_A"  |"ID_B"  |"NAME_B"  |
---------------------------------------------------
|10      |5        |A1        |4001    |B2        |
|40      |10       |A2        |4000    |B1        |
|80      |15       |A3        |9000    |B3        |
---------------------------------------------------
```

#### Specifying the Type of Join

By default, the `DataFrame.join` method creates an inner join. To specify a different type of join, set the
`joinType` argument to one of the following values:

| Type of Join | `joinType` |
| --- | --- |
| Inner join | `inner` (default) |
| Cross join | `cross` |
| Full outer join | `full` |
| Left outer join | `left` |
| Left anti join | `leftanti` |
| Left semi join | `leftsemi` |
| Right outer join | `right` |

For example:

```scala
// Create a DataFrame that performs a left outer join on
// "sample_a" and "sample_b" on the column named "id_a".
val dfLhs = session.table("sample_a")
val dfRhs = session.table("sample_b")
val dfLeftOuterJoin = dfLhs.join(dfRhs, dfLhs.col("id_a") === dfRhs.col("id_a"), "left")
dfLeftOuterJoin.show()
```

This prints the following output:

```none
----------------------------------------------------------------------
|"ID_A"  |"NAME_A"  |"VALUE"  |"ID_B"  |"NAME_B"  |"ID_A"  |"VALUE"  |
----------------------------------------------------------------------
|40      |A2        |10       |4000    |B1        |40      |10       |
|10      |A1        |5        |4001    |B2        |10      |5        |
|80      |A3        |15       |9000    |B3        |80      |15       |
|90      |A4        |20       |NULL    |NULL      |NULL    |NULL     |
----------------------------------------------------------------------
```

#### Joining Multiple Tables

To join multiple tables:

1. Create a DataFrame for each table.
2. Call the `DataFrame.join` method on the first DataFrame, passing in the second DataFrame.
3. Using the DataFrame returned by the `join` method, call the `join` method, passing in the third DataFrame.

You can chain the `join` calls as shown below:

```scala
val dfFirst = session.table("sample_a")
val dfSecond  = session.table("sample_b")
val dfThird = session.table("sample_c")
val dfJoinThreeTables = dfFirst.join(dfSecond, dfFirst.col("id_a") === dfSecond.col("id_a")).join(dfThird, dfFirst.col("id_a") === dfThird.col("id_a"))
dfJoinThreeTables.show()
```

This prints the following output:

```none
------------------------------------------------------------------------------------------------------------
|"ID_A"  |"NAME_A"  |"VALUE"  |"ID_B"  |"NAME_B"  |"ID_A"  |"VALUE"  |"ID_C"  |"NAME_C"  |"ID_A"  |"ID_B"  |
------------------------------------------------------------------------------------------------------------
|10      |A1        |5        |4001    |B2        |10      |5        |1012    |C1        |10      |NULL    |
|40      |A2        |10       |4000    |B1        |40      |10       |1040    |C2        |40      |4000    |
|40      |A2        |10       |4000    |B1        |40      |10       |1041    |C3        |40      |4001    |
------------------------------------------------------------------------------------------------------------
```

#### Performing a Self-Join

If you need to join a table with itself on different columns, you cannot perform the self-join with a single DataFrame. The
following examples that use a single DataFrame to perform a self-join fail because the column expressions for `"id"` are
present in the left and right sides of the join:

```scala
// This fails because columns named "id" and "parent_id"
// are in the left and right DataFrames in the join.
val df = session.table("sample_product_data");
val dfJoined = df.join(df, col("id") === col("parent_id"))
```

```scala
// This fails because columns named "id" and "parent_id"
// are in the left and right DataFrames in the join.
val df = session.table("sample_product_data");
val dfJoined = df.join(df, df("id") === df("parent_id"))
```

Both of these examples fail with the following exception:

```none
Exception in thread "main" com.snowflake.snowpark.SnowparkClientException:
  Joining a DataFrame to itself can lead to incorrect results due to ambiguity of column references.
  Instead, join this DataFrame to a clone() of itself.
```

Instead, use the [DataFrame.clone](../reference/scala/com/snowflake/snowpark/DataFrame.md) method to create a clone of the DataFrame object, and use the two DataFrame objects to
perform the join:

```scala
// Create a DataFrame object for the "sample_product_data" table for the left-hand side of the join.
val dfLhs = session.table("sample_product_data")
// Clone the DataFrame object to use as the right-hand side of the join.
val dfRhs = dfLhs.clone()

// Create a DataFrame that joins the two DataFrames
// for the "sample_product_data" table on the
// "id" and "parent_id" columns.
val dfJoined = dfLhs.join(dfRhs, dfLhs.col("id") === dfRhs.col("parent_id"))
dfJoined.show()
```

If you want to perform a self-join on the same column, call the `join` method that passes in a `Seq` of column
expressions for the `USING` clause:

```scala
// Create a DataFrame that performs a self-join on a DataFrame
// using the column named "key".
val df = session.table("sample_product_data");
val dfJoined = df.join(df, Seq("key"))
```

## Performing an Action to Evaluate a DataFrame

As mentioned earlier, the DataFrame is lazily evaluated, which means the SQL statement isn’t sent to the server for execution
until you perform an action. An action causes the DataFrame to be evaluated and sends the corresponding SQL statement to the
server for execution.

The following sections explain how to perform an action synchronously and asynchronously on a DataFrame:

* Performing an Action Synchronously
* Performing an Action Asynchronously

### Performing an Action Synchronously

To perform an action synchronously, call one of the following action methods:

| Method to Perform an Action Synchronously | Description |
| --- | --- |
| `DataFrame.collect` | Evaluates the DataFrame and returns the resulting dataset as an `Array` of [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects. See Returning All Rows. |
| `DataFrame.toLocalIterator` | Evaluates the DataFrame and returns an [Iterator](https://docs.scala-lang.org/overviews/collections/iterators.html) of [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects. If the result set is large, use this method to avoid loading all the results into memory at once. See Returning an Iterator for the Rows. |
| `DataFrame.count` | Evaluates the DataFrame and returns the number of rows. |
| `DataFrame.show` | Evaluates the DataFrame and prints the rows to the console. Note that this method limits the number of rows to 10 (by default). See Printing the Rows in a DataFrame. |
| `DataFrame.cacheResult` | Executes the query, creates a temporary table, and puts the results into the table. The method returns a `HasCachedResult` object that you can use to access the data in this temporary table. See Caching a DataFrame. |
| `DataFrame.write.saveAsTable` | Saves the data in the DataFrame to the specified table. See Saving Data to a Table. |
| `DataFrame.write.(csv|json|parquet)` | Saves a DataFrame to a specified file on a stage. See Saving a DataFrame to Files on a Stage. |
| `DataFrame.read.fileformat.copyInto('tableName')` | Copies the data in the DataFrame to the specified table. See Copying Data from Files into a Table. |
| `Session.table('tableName').delete` | Deletes rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').update` | Updates rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').merge.methods.collect` | Merges rows into the specified table. See Updating, Deleting, and Merging Rows in a Table. |

To execute the query and return the number of results, call the `count` method:

```scala
// Create a DataFrame for the "sample_product_data" table.
val dfProducts = session.table("sample_product_data")

// Send the query to the server for execution and
// print the count of rows in the table.
println("Rows returned: " + dfProducts.count())
```

You can also call action methods to:

* Execute a query against a table and return the results.
* Execute a query and print the results to the console.

Note: If you are calling the `schema` method to get the definitions of the columns in the DataFrame, you do not need to
call an action method.

### Performing an Action Asynchronously

> **Note:**
>
> This feature was introduced in Snowpark 0.11.0.

To perform an action asynchronously, call the `async` method to return an “async actor” object (e.g.
`DataFrameAsyncActor`), and call an asynchronous action method in that object.

These action methods of an async actor object return a `TypedAsyncJob` object, which you can use to check
the status of the asynchronous action and retrieve the results of the action.

The next sections explain how to perform actions asynchronously and check the results.

* Understanding the Basic Flow of Asynchronous Actions
* Specifying the Maximum Number of Seconds to Wait
* Accessing an Asynchronous Query by ID

#### Understanding the Basic Flow of Asynchronous Actions

You can use the following methods to perform an action asynchronously:

| Method to Perform an Action Asynchronously | Description |
| --- | --- |
| `DataFrame.async.collect` | Asynchronously evaluates the DataFrame to retrieve the resulting dataset as an `Array` of [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects. See Returning All Rows. |
| `DataFrame.async.toLocalIterator` | Asynchronously evaluates the DataFrame to retrieve an [Iterator](https://docs.scala-lang.org/overviews/collections/iterators.html) of [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects. If the result set is large, use this method to avoid loading all the results into memory at once. See Returning an Iterator for the Rows. |
| `DataFrame.async.count` | Asynchronously evaluates the DataFrame to retrieve the number of rows. |
| `DataFrame.write.async.saveAsTable` | Asynchronously saves the data in the DataFrame to the specified table. See Saving Data to a Table. |
| `DataFrame.write.async.(csv|json|parquet)` | Saves a DataFrame to a specified file on a stage. See Saving a DataFrame to Files on a Stage. |
| `DataFrame.read.fileformat.async.copyInto('tableName')` | Asynchronously copies the data in the DataFrame to the specified table. See Copying Data from Files into a Table. |
| `Session.table('tableName').async.delete` | Asynchronously deletes rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').async.update` | Asynchronously updates rows in the specified table. See Updating, Deleting, and Merging Rows in a Table. |
| `Session.table('tableName').merge.methods.async.collect` | Asynchronously merges rows into the specified table. Supported in version 1.3.0 or later. See Updating, Deleting, and Merging Rows in a Table. |

From the returned [TypedAsyncJob](../reference/scala/com/snowflake/snowpark/TypedAsyncJob.md) object, you can do the following:

* To determine if the action has completed, call the `isDone` method.
* To get the query ID that corresponds to the action, call the `getQueryId` method.
* To return the results of the action (e.g. the `Array` of `Row` objects for the `collect` method or the count
  of rows for the `count` method), call the `getResult` method.

  Note that `getResult` is a blocking call.
* To cancel the action, call the `cancel` method.

For example, to execute a query asynchronously and retrieve the results as an `Array` of `Row` objects, call
`DataFrame.async.collect`:

```scala
// Create a DataFrame with the "id" and "name" columns from the "sample_product_data" table.
// This does not execute the query.
val df = session.table("sample_product_data").select(col("id"), col("name"))

// Execute the query asynchronously.
// This call does not block.
val asyncJob = df.async.collect()
// Check if the query has completed execution.
println(s"Is query ${asyncJob.getQueryId()} done? ${asyncJob.isDone()}")
// Get an Array of Rows containing the results, and print the results.
// Note that getResult is a blocking call.
val results = asyncJob.getResult()
results.foreach(println)
```

To execute the query asynchronously and retrieve the number of results, call `DataFrame.async.count`:

```scala
// Create a DataFrame for the "sample_product_data" table.
val dfProducts = session.table("sample_product_data")

// Execute the query asynchronously.
// This call does not block.
val asyncJob = df.async.count()
// Check if the query has completed execution.
println(s"Is query ${asyncJob.getQueryId()} done? ${asyncJob.isDone()}")
// Print the count of rows in the table.
// Note that getResult is a blocking call.
println("Rows returned: " + asyncJob.getResult())
```

#### Specifying the Maximum Number of Seconds to Wait

When calling the `getResult` method, you can use the `maxWaitTimeInSeconds` argument to specify the maximum number of
seconds to wait for the query to complete before attempting to retrieve the results. For example:

```scala
// Wait a maximum of 10 seconds for the query to complete before retrieving the results.
val results = asyncJob.getResult(10)
```

If you omit this argument, the method waits for the maximum number of seconds specified by the
[snowpark_request_timeout_in_seconds](creating-session.md) configuration property. (This is a property
that you can set when [creating the Session object](creating-session.md).)

#### Accessing an Asynchronous Query by ID

If you have the query ID of an asynchronous query that you submitted earlier, you can call `Session.createAsyncJob` method
to create an [AsyncJob](../reference/scala/com/snowflake/snowpark/AsyncJob.md) object that you can use to check the status of the query, retrieve the query results, or cancel the
query.

Note that unlike `TypedAsyncJob`, `AsyncJob` does not provide a `getResult` method for retrieving the results.
If you need to retrieve the results, call the `getRows` or `getIterator` method instead.

For example:

```scala
val asyncJob = session.createAsyncJob(myQueryId)
// Check if the query has completed execution.
println(s"Is query ${asyncJob.getQueryId()} done? ${asyncJob.isDone()}")
// If you need to retrieve the results, call getRows to return an Array of Rows containing the results.
// Note that getRows is a blocking call.
val rows = asyncJob.getRows()
rows.foreach(println)
```

## Retrieving Rows into a DataFrame

After you specify how the DataFrame should be transformed, you can
call an action method to execute a query and return the results. You can return
all of the rows in an `Array`, or you can return an [Iterator](https://docs.scala-lang.org/overviews/collections/iterators.html) that allows you to iterate over the results, row by row. In
the latter case, if the amount of data is large, the rows are loaded into memory by chunk to avoid loading a large amount of data
into memory.

* Returning All Rows
* Returning an Iterator for the Rows
* Returning the First n Rows

### Returning All Rows

To return all rows at once, call the [DataFrame.collect](../reference/scala/com/snowflake/snowpark/DataFrame.md) method. This method returns an Array of [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects. To retrieve the
values from the row, call the `getType` method (e.g. `getString`, `getInt`, etc.).

For example:

```scala
import com.snowflake.snowpark.functions_

val rows = session.table("sample_product_data").select(col("name"), col("category_id")).sort(col("name")).collect()
for (row <- rows) {
  println(s"Name: ${row.getString(0)}; Category ID: ${row.getInt(1)}")
}
```

### Returning an Iterator for the Rows

If you want to use an [Iterator](https://docs.scala-lang.org/overviews/collections/iterators.html) to iterate over the [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects in the results, call [DataFrame.toLocalIterator](../reference/scala/com/snowflake/snowpark/DataFrame.md). If the
amount of data in the results is large, the method loads the rows by chunk to avoid loading all rows into memory at once.

For example:

```scala
import com.snowflake.snowpark.functions_

while (rowIterator.hasNext) {
  val row = rowIterator.next()
  println(s"Name: ${row.getString(0)}; Category ID: ${row.getInt(1)}")
}
```

### Returning the First `n` Rows

To return the first `n` rows, call the [DataFrame.first](../reference/scala/com/snowflake/snowpark/DataFrame.md) method, passing in the number of rows to return.

As explained in Limiting the Number of Rows in a DataFrame, the results are non-deterministic. If you want the results to be
deterministic, call this method on a sorted DataFrame (`df.sort().first()`).

For example:

```scala
import com.snowflake.snowpark.functions_

val df = session.table("sample_product_data")
val rows = df.sort(col("name")).first(5)
rows.foreach(println)
```

## Printing the Rows in a DataFrame

To print the first 10 rows in the DataFrame to the console, call the [DataFrame.show](../reference/scala/com/snowflake/snowpark/DataFrame.md) method. To print out a different number of
rows, pass in the number of rows to print.

As explained in Limiting the Number of Rows in a DataFrame, the results are non-deterministic. If you want the results to be
deterministic, call this method on a sorted DataFrame (`df.sort().show()`).

For example:

```scala
import com.snowflake.snowpark.functions_

val df = session.table("sample_product_data")
df.sort(col("name")).show()
```

## Updating, Deleting, and Merging Rows in a Table

> **Note:**
>
> This feature was introduced in Snowpark 0.7.0.

When you call `Session.table` to create a `DataFrame` object for a table, the method returns an `Updatable`
object, which extends `DataFrame` with additional methods for updating and deleting data in the table. (See [Updatable](../reference/scala/com/snowflake/snowpark/Updatable.md).)

If you need to update or delete rows in a table, you can use the following methods of the `Updatable` class:

* Call `update` to update existing rows in the table. See Updating Rows in a Table.
* Call `delete` to delete rows from a table. See Deleting Rows in a Table.
* Call `merge` to insert, update, and delete rows in one table, based on data in a second table or subquery. (This is the
  equivalent of the [MERGE](../../../sql-reference/sql/merge.md) command in SQL.) See Merging Rows into a Table.

### Updating Rows in a Table

For the `update` method, pass in a `Map` that associates the columns to update and the corresponding values to assign
to those columns. `update` returns an `UpdateResult` object, which contains the number of rows that were updated. (See
[UpdateResult](../reference/scala/com/snowflake/snowpark/UpdateResult.md).)

> **Note:**
>
> `update` is an action method, which means that calling the method sends
> SQL statements to the server for execution.

For example, to replace the values in the column named `count` with the value `1`:

```scala
val updatableDf = session.table("sample_product_data")
val updateResult = updatableDf.update(Map("count" -> lit(1)))
println(s"Number of rows updated: ${updateResult.rowsUpdated}")
```

The example above uses the name of the column to identify the column. You can also use a column expression:

```scala
val updateResult = updatableDf.update(Map(col("count") -> lit(1)))
```

If the update should be made only when a condition is met, you can specify that condition as an argument. For example, to replace
the values in the column named `count` for rows in which the `category_id` column has the value `20`:

```scala
val updateResult = updatableDf.update(Map(col("count") -> lit(1)), col("category_id") === 20)
```

If you need to base the condition on a join with a different `DataFrame` object, you can pass that `DataFrame` in as
an argument and use that `DataFrame` in the condition. For example, to replace the values in the column named `count` for
rows in which the `category_id` column matches the `category_id` in the `DataFrame` `dfParts`:

```scala
val updatableDf = session.table("sample_product_data")
val dfParts = session.table("parts")
val updateResult = updatableDf.update(Map(col("count") -> lit(1)), updatableDf("category_id") === dfParts("category_id"), dfParts)
```

### Deleting Rows in a Table

For the `delete` method, you can specify a condition that identifies the rows to delete, and you can base that condition on
a join with another DataFrame. `delete` returns a `DeleteResult` object, which contains the
number of rows that were deleted. (See [DeleteResult](../reference/scala/com/snowflake/snowpark/DeleteResult.md).)

> **Note:**
>
> `delete` is an action method, which means that calling the method sends
> SQL statements to the server for execution.

For example, to delete the rows that have the value `1` in the `category_id` column:

```scala
val updatableDf = session.table("sample_product_data")
val deleteResult = updatableDf.delete(updatableDf("category_id") === 1)
println(s"Number of rows deleted: ${deleteResult.rowsDeleted}")
```

If the condition refers to columns in a different DataFrame, pass that DataFrame in as the second argument. For example, to delete
the rows in which the `category_id` column matches the `category_id` in the `DataFrame` `dfParts`, pass in `dfParts`
as the second argument:

```scala
val updatableDf = session.table("sample_product_data")
val deleteResult = updatableDf.delete(updatableDf("category_id") === dfParts("category_id"), dfParts)
println(s"Number of rows deleted: ${deleteResult.rowsDeleted}")
```

### Merging Rows into a Table

To insert, update, and deletes rows in one table based on values in a second table or a subquery (the equivalent of the
[MERGE](../../../sql-reference/sql/merge.md) command in SQL), do the following:

1. In the `Updatable` object for the table where you want the data merged in, call the `merge` method, passing in
   the `DataFrame` object for the other table and the column expression for the join condition.

   This returns a `MergeBuilder` object that you can use to specify the actions to take (e.g. insert, update, or delete) on
   the rows that match and the rows that don’t match. (See [MergeBuilder](../reference/scala/com/snowflake/snowpark/MergeBuilder.md).)
2. Using the `MergeBuilder` object:

   * To specify the update or deletion that should be performed on matching rows, call the `whenMatched` method.

     If you need to specify an additional condition whe rows should be updated or deleted, you can pass in a column expression for
     that condition.

     This method returns a `MatchedClauseBuilder` object that you can use to specify the action to perform. (See
     [MatchedClauseBuilder](../reference/scala/com/snowflake/snowpark/MatchedClauseBuilder.md).)

     Call the `update` or `delete` method in the `MatchedClauseBuilder` object to specify the update or delete
     action that should be performed on matching rows. These methods return a `MergeBuilder` object that you can use to
     specify additional clauses.
   * To specify the insert that should be performed when rows do not match, call the `whenNotMatched` method.

     If you need to specify an additional condition when rows should be inserted, you can pass in a column expression for that
     condition.

     This method returns a `NotMatchedClauseBuilder` object that you can use to specify the action to perform. (See
     [NotMatchedClauseBuilder](../reference/scala/com/snowflake/snowpark/NotMatchedClauseBuilder.md).)

     Call the `insert` method in the `NotMatchedClauseBuilder` object to specify the insert action that should be
     performed when rows do not match. These methods return a `MergeBuilder` object that you can use to specify additional
     clauses.
3. When you are done specifying the inserts, updates, and deletions that should be performed, call the `collect` method of
   the `MergeBuilder` object to perform the specified inserts, updates, and deletions on the table.

   `collect` returns a `MergeResult` object, which contains the number of rows that were inserted, updated, and
   deleted. (See [MergeResult](../reference/scala/com/snowflake/snowpark/MergeResult.md).)

The following example inserts a row with the `id` and `value` columns from the `source` table into the `target` table if
the `target` table does not contain a row with a matching ID:

```scala
val mergeResult = target.merge(source, target("id") === source("id"))
                      .whenNotMatched.insert(Seq(source("id"), source("value")))
                      .collect()
```

The following example updates a row in the `target` table with the value of the `value` column from the row in the `source`
table that has the same ID:

```scala
val mergeResult = target.merge(source, target("id") === source("id"))
                      .whenMatched.update(Map("value" -> source("value")))
                      .collect()
```

## Saving Data to a Table

You can save the contents of a DataFrame to a new or existing table. In order to do this, you must have the following privileges:

* CREATE TABLE privileges on the schema, if the table does not exist.
* INSERT privileges on the table.

To save the contents of a DataFrame to a table:

1. Call the [DataFrame.write](../reference/scala/com/snowflake/snowpark/DataFrame.md) method to get a [DataFrameWriter](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) object.
2. Call the [DataFrameWriter.mode](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method, passing in a [SaveMode](../reference/scala/com/snowflake/snowpark/SaveMode$.md) object that specifies your preferences for writing to the
   table:

   * To insert rows, pass in `SaveMode.Append`.
   * To overwrite the existing table, pass in `SaveMode.Overwrite`.

   This method returns the same `DataFrameWriter` object configured with the specified mode.
3. If you are inserting rows into an existing table (`SaveMode.Append`) and the column names in the DataFrame match the
   column names in the table, call the [DataFrameWriter.option](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method, passing in `"columnOrder"` and `"name"` as
   arguments.

   > **Note:**
   >
   > This method was introduced in Snowpark 1.4.0.

   By default, the `columnOrder` option is set to `"index"`, which means that the `DataFrameWriter` inserts the
   values in the order that the columns appear. For example, the `DataFrameWriter` inserts the value from the first column
   from the DataFrame in the first column in the table, the second column from the DataFrame in the second column in the table,
   etc.

   This method returns the same `DataFrameWriter` object configured with the specified option.
4. Call the [DataFrameWriter.saveAsTable](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) to save the contents of the DataFrame to a specified table.

   You do not need to call a separate method (e.g. `collect`) to execute the SQL statement that saves the data to the table.
   `saveAsTable` is an action method that executes the SQL statement.

The following example overwrites an existing table (identified by the `tableName` variable) with the contents of the DataFrame
`df`:

```scala
df.write.mode(SaveMode.Overwrite).saveAsTable(tableName)
```

The following example inserts rows from the DataFrame `df` into an existing table (identified by the `tableName` variable).
In this example, the table and the DataFrame both contain the columns `c1` and `c2`.

The example demonstrates the difference between setting the `columnOrder` option to `"name"` (which inserts values
into the table columns with the same names as the DataFrame columns) and using the default `columnOrder` option (which
inserts values into the table columns based on the order of the columns in the DataFrame).

```scala
val df = session.sql("SELECT 1 AS c2, 2 as c1")
// With the columnOrder option set to "name", the DataFrameWriter uses the column names
// and inserts a row with the values (2, 1).
df.write.mode(SaveMode.Append).option("columnOrder", "name").saveAsTable(tableName)
// With the default value of the columnOrder option ("index"), the DataFrameWriter the uses column positions
// and inserts a row with the values (1, 2).
df.write.mode(SaveMode.Append).saveAsTable(tableName)
```

## Creating a View From a DataFrame

To create a view from a DataFrame, call the [DataFrame.createOrReplaceView](../reference/scala/com/snowflake/snowpark/DataFrame.md) method:

```scala
df.createOrReplaceView("db.schema.viewName")
```

Note that calling `createOrReplaceView` immediately creates the new view. More importantly, it does not
cause the DataFrame to be evaluated. (The DataFrame itself is not evaluated until you
perform an action.)

Views that you create by calling `createOrReplaceView` are persistent. If you no longer need that view, you can
[drop the view manually](../../../sql-reference/sql/drop-view.md).

If you need to create a temporary view just for the session, call the [DataFrame.createOrReplaceTempView](../reference/scala/com/snowflake/snowpark/DataFrame.md) method instead:

```scala
df.createOrReplaceTempView("db.schema.viewName")
```

## Caching a DataFrame

In some cases, you may need to perform a complex query and keep the results for use in subsequent operations (rather than
executing the same query again). In these cases, you can cache the contents of a DataFrame by calling the
[DataFrame.cacheResult](../reference/scala/com/snowflake/snowpark/DataFrame.md) method.

This method:

* Runs the query.

  You do not need to call a separate action method to retrieve the results
  before calling `cacheResult`. `cacheResult` is an action method that executes the query.
* Saves the results in a temporary table

  Because `cacheResult` creates a temporary table, you must have the CREATE TABLE privilege on the schema that is in use.
* Returns a [HasCachedResult](../reference/scala/com/snowflake/snowpark/HasCachedResult.md) object, which provides access to the results in the temporary table.

  Because `HasCachedResult` extends `DataFrame`, you can perform some of the same operations on this cached data as
  you can perform on a DataFrame.

> **Note:**
>
> Because `cacheResult` executes the query and saves the results to a table, the method can result in increased compute and
> storage costs.

For example:

```scala
import com.snowflake.snowpark.functions_

// Set up a DataFrame to query a table.
val df = session.table("sample_product_data").filter(col("category_id") > 10)
// Retrieve the results and cache the data.
val cachedDf = df.cacheResult()
// Create a DataFrame containing a subset of the cached data.
val dfSubset = cachedDf.filter(col("category_id") === lit(20)).select(col("name"), col("category_id"))
dfSubset.show()
```

Note that the original DataFrame is not affected when you call this method. For example, suppose that `dfTable` is a DataFrame
for the table `sample_product_data`:

```scala
val dfTempTable = dfTable.cacheResult()
```

After you call `cacheResult`, `dfTable` still points to the `sample_product_data` table, and you can continue to use
`dfTable` to query and update that table.

To use the cached data in the temporary table, you use `dfTempTable` (the `HasCachedResult` object returned by
`cacheResult`).

## Working With Files in a Stage

The Snowpark library provides classes and methods that you can use to [load data into Snowflake](../../../guides-overview-loading-data.md) and
[unload data from Snowflake](../../../user-guide/data-unload-overview.md) by using files in stages.

> **Note:**
>
> In order to use these classes and methods on a stage, you must have the required
> [privileges for working with the stage](../../../user-guide/security-access-control-privileges.md).

The next sections explain how to use these classes and methods:

* Uploading and Downloading Files in a Stage
* Using Input Streams to Upload and Download Data in a Stage
* Setting Up a DataFrame for Files in a Stage
* Loading Data from Files into a DataFrame
* Copying Data from Files into a Table
* Saving a DataFrame to Files on a Stage

### Uploading and Downloading Files in a Stage

To upload and download files in a stage, use the [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md) object:

* Uploading Files to a Stage
* Downloading Files from a Stage

#### Uploading Files to a Stage

To upload files to a stage:

1. Verify that you have the [privileges to upload files to the stage](../../../user-guide/security-access-control-privileges.md).
2. Use [Session.file](../reference/scala/com/snowflake/snowpark/Session.md) to access the [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md) object for the session.
3. Call the [FileOperation.put](../reference/scala/com/snowflake/snowpark/FileOperation.md) method to upload the files to a stage.

   This method executes a SQL [PUT](../../../sql-reference/sql/put.md) command.

   * To specify any [optional parameters](../../../sql-reference/sql/put.md) for the PUT command, create a `Map` of the
     parameters and values, and pass in the `Map` as the `options` argument. For example:

     ```scala
     // Upload a file to a stage without compressing the file.
     val putOptions = Map("AUTO_COMPRESS" -> "FALSE")
     val putResults = session.file.put("file:///tmp/myfile.csv", "@myStage", putOptions)
     ```
   * In the `localFilePath` argument, you can use wildcards (`*` and `?`) to identify a set of files to upload. For
     example:

     ```scala
     // Upload the CSV files in /tmp with names that start with "file".
     // You can use the wildcard characters "*" and "?" to match multiple files.
     val putResults = session.file.put("file:///tmp/file*.csv", "@myStage/prefix2")
     ```
4. Check the `Array` of [PutResult](../reference/scala/com/snowflake/snowpark/PutResult.md) objects returned by the `put` method to determine if the files were successfully
   uploaded. For example, to print the filename and the status of the PUT operation for that file:

   ```scala
   // Print the filename and the status of the PUT operation.
   putResults.foreach(r => println(s"  ${r.sourceFileName}: ${r.status}"))
   ```

#### Downloading Files from a Stage

To download files from a stage:

1. Verify that you have the [privileges to download files from the stage](../../../user-guide/security-access-control-privileges.md).
2. Use [Session.file](../reference/scala/com/snowflake/snowpark/Session.md) to access the [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md) object for the session.
3. Call the [FileOperation.get](../reference/scala/com/snowflake/snowpark/FileOperation.md) method to download the files from a stage.

   This method executes a SQL [GET](../../../sql-reference/sql/get.md) command.

   To specify any [optional parameters](../../../sql-reference/sql/get.md) for the GET command, create a `Map` of the
   parameters and values, and pass in the `Map` as the `options` argument. For example:

   ```scala
   // Download files with names that match a regular expression pattern.
   val getOptions = Map("PATTERN" -> s"'.*file_.*.csv.gz'")
   val getResults = session.file.get("@myStage", "file:///tmp", getOptions)
   ```
4. Check the `Array` of [GetResult](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/GetResult.html) objects returned by the `get` method to determine if the files were successfully
   downloaded. For example, to print the filename and the status of the GET operation for that file:

   ```scala
   // Print the filename and the status of the GET operation.
   getResults.foreach(r => println(s"  ${r.fileName}: ${r.status}"))
   ```

### Using Input Streams to Upload and Download Data in a Stage

> **Note:**
>
> This feature was introduced in Snowpark 1.4.0.

To use input streams to upload data to a file on a stage and download data from a file on a stage, use the `uploadStream`
and `downloadStream` methods of the [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md) object:

* Using an Input Stream to Upload Data to a File on a Stage
* Using an Input Stream to Download Data from a File on a Stage

#### Using an Input Stream to Upload Data to a File on a Stage

To upload the data from a [java.io.InputStream](https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/io/InputStream.html) object to a file on a stage:

1. Verify that you have the [privileges to upload files to the stage](../../../user-guide/security-access-control-privileges.md).
2. Use [Session.file](../reference/scala/com/snowflake/snowpark/Session.md) to access the [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md) object for the session.
3. Call the [FileOperation.uploadStream](../reference/scala/com/snowflake/snowpark/FileOperation.md) method.

   Pass in the complete path to the file on the stage where the data should be written and the `InputStream` object. In
   addition, use the `compress` argument to specify whether or not the data should be compressed before it is uploaded.

For example:

```scala
import java.io.InputStream
...
val compressData = true
val pathToFileOnStage = "@myStage/path/file"
session.file.uploadStream(pathToFileOnStage, new ByteArrayInputStream(fileContent.getBytes()), compressData)
```

#### Using an Input Stream to Download Data from a File on a Stage

To download data from a file on a stage to a [java.io.InputStream](https://docs.oracle.com/en/java/javase/11/docs/api/java.base/java/io/InputStream.html) object:

1. Verify that you have the [privileges to download files from the stage](../../../user-guide/security-access-control-privileges.md).
2. Use [Session.file](../reference/scala/com/snowflake/snowpark/Session.md) to access the [FileOperation](../reference/scala/com/snowflake/snowpark/FileOperation.md) object for the session.
3. Call the [FileOperation.downloadStream](../reference/scala/com/snowflake/snowpark/FileOperation.md) method.

   Pass in the complete path to the file on the stage containing the data to download. Use the `decompress` argument to
   specify whether or not the data in the file is compressed.

For example:

```scala
import java.io.InputStream
...
val isDataCompressed = true
val pathToFileOnStage = "@myStage/path/file"
val is = session.file.downloadStream(pathToFileOnStage, isDataCompressed)
```

### Setting Up a DataFrame for Files in a Stage

This section explains how to set up a DataFrame for files in a Snowflake stage. Once you create this DataFrame, you can use the
DataFrame to:

* retrieve data from the files
* copy data from the files into a table

To set up a DataFrame for files in a Snowflake stage, use the `DataFrameReader` class:

1. Verify that you have the following privileges:

   * [Privileges to access files in the stage](../../../user-guide/security-access-control-privileges.md).
   * One of the following:

     + CREATE TABLE privileges on the schema, if you plan to specify
       copy options that determine how data is copied from the staged files.
     + CREATE FILE FORMAT privileges on the schema, otherwise.
2. Call the `read` method in the `Session` class to access a `DataFrameReader` object.
3. If the files are in CSV format, describe the fields in the file. To do this:

   1. Create a [StructType](../reference/scala/com/snowflake/snowpark/types/StructType$.md) object that consists of a sequence of [StructField](../reference/scala/com/snowflake/snowpark/types/StructField$.md) objects that describe the fields in the file.
   2. For each `StructField` object, specify the following:

      * The name of the field.
      * The data type of the field (specified as an object in the `com.snowflake.snowpark.types` package).
      * Whether or not the field is nullable.

      For example:

      ```scala
      import com.snowflake.snowpark.types._

      val schemaForDataFile = StructType(
          Seq(
              StructField("id", StringType, true),
              StructField("name", StringType, true)))
      ```
   3. Call the `schema` method in the `DataFrameReader` object, passing in the `StructType` object.

      For example:

      ```scala
      var dfReader = session.read.schema(schemaForDataFile)
      ```

      The `schema` method returns a `DataFrameReader` object that is configured to read files containing the specified
      fields.

      Note that you do not need to do this for files in other formats (such as JSON). For those files, the
      `DataFrameReader` treats the data as a single field of the VARIANT type with the field name `$1`.
4. If you need to specify additional information about how the data should be read (for example, that the data is compressed or
   that a CSV file uses a semicolon instead of a comma to delimit fields), call the [DataFrameReader.option](../reference/scala/com/snowflake/snowpark/DataFrameReader.md) method or the
   [DataFrameReader.options](../reference/scala/com/snowflake/snowpark/DataFrameReader.md) method.

   Pass in the name and value of the option that you want to set. You can set the following types of options:

   * The [file format options](../../../sql-reference/sql/create-file-format.md) described in the
     [documentation on CREATE FILE FORMAT](../../../sql-reference/sql/create-file-format.md).
   * The [copy options](../../../sql-reference/sql/copy-into-table.md) described in the
     [COPY INTO TABLE documentation](../../../sql-reference/sql/copy-into-table.md).

     Note that setting copy options can result in a more expensive execution strategy when you
     retrieve the data into the DataFrame.

   The following example sets up the `DataFrameReader` object to query data in a CSV file that is not compressed and that
   uses a semicolon for the field delimiter.

   ```scala
   dfReader = dfReader.option("field_delimiter", ";").option("COMPRESSION", "NONE")
   ```

   The `option` method returns a `DataFrameReader` object that is configured with the specified option.

   To set multiple options, you can either
   chain calls to the `option` method (as shown in the example
   above) or call the [DataFrameReader.options](../reference/scala/com/snowflake/snowpark/DataFrameReader.md) method, passing in a `Map` of the names and values of the options.
5. Call the method corresponding to the format of the files. You can call one of the following methods:

   * [DataFrameReader.avro](../reference/scala/com/snowflake/snowpark/DataFrameReader.md)
   * [DataFrameReader.csv](../reference/scala/com/snowflake/snowpark/DataFrameReader.md)
   * [DataFrameReader.json](../reference/scala/com/snowflake/snowpark/DataFrameReader.md)
   * [DataFrameReader.orc](../reference/scala/com/snowflake/snowpark/DataFrameReader.md)
   * [DataFrameReader.parquet](../reference/scala/com/snowflake/snowpark/DataFrameReader.md)
   * [DataFrameReader.xml](../reference/scala/com/snowflake/snowpark/DataFrameReader.md)

   When calling these methods, pass in the stage location of the files to be read. For example:

   ```scala
   val df = dfReader.csv("@s3_ts_stage/emails/data_0_0_0.csv")
   ```

   To specify multiple files that start with the same prefix, specify the prefix after the stage name. For example, to load files
   that have the prefix `csv_` from the stage `@mystage`:

   ```scala
   val df = dfReader.csv("@mystage/csv_")
   ```

   The methods corresponding to the format of a file return a [CopyableDataFrame](../reference/scala/com/snowflake/snowpark/CopyableDataFrame.md) object for that file. `CopyableDataFrame`
   extends `DataFrame` and provides additional methods for working the data in staged files.
6. Call an action method to:

   * retrieve data from the files, or
   * copy data from the files into a table

   As is the case with DataFrames for tables, the data is not retrieved into the DataFrame until you call
   an action method.

### Loading Data from Files into a DataFrame

After you set up a DataFrame for files in a stage, you can load data from the files
into the DataFrame:

1. Use the DataFrame object methods to perform any transformations needed on the
   dataset (for example, selecting specific fields, filtering rows, etc.).

   For example, to extract the `color` element from a JSON file named `data.json` in the stage named `mystage`:

   ```scala
   val df = session.read.json("@mystage/data.json").select(col("$1")("color"))
   ```

   As explained earlier, for files in formats other than CSV (e.g. JSON), the `DataFrameReader` treats the data in the file
   as a single VARIANT column with the name `$1`.
2. Call the `DataFrame.collect` method to load the data. For example:

   ```scala
   val results = df.collect()
   ```

### Copying Data from Files into a Table

After you set up a DataFrame for files in a stage, you can call the
[CopyableDataFrame.copyInto](../reference/scala/com/snowflake/snowpark/CopyableDataFrame.md) method to copy the data into a table. This method executes the
[COPY INTO <table>](../../../sql-reference/sql/copy-into-table.md) command.

> **Note:**
>
> You do not need to call the `collect` method before calling `copyInto`. The data from the files does not need to
> be in the DataFrame before you call `copyInto`.

For example, the following code loads data from the CSV file specified by `myFileStage` into the table `mytable`. Because the
data is in a CSV file, the code must also describe the fields in the file. The
example does this by calling the [DataFrameReader.schema](../reference/scala/com/snowflake/snowpark/DataFrameReader.md) method and passing in a [StructType](../reference/scala/com/snowflake/snowpark/types/StructType$.md) object (`csvFileSchema`)
containing a sequence of [StructField](../reference/scala/com/snowflake/snowpark/types/StructField$.md) objects that describe the fields.

```scala
val df = session.read.schema(csvFileSchema).csv(myFileStage)
df.copyInto("mytable")
```

### Saving a DataFrame to Files on a Stage

> **Note:**
>
> This feature was introduced in Snowpark 1.5.0.

If you need to save a DataFrame to files on a stage, you can call the [DataFrameWriter](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method corresponding to the format of
the file (e.g. the `csv` method to write to a CSV file), passing in the stage location where the files should be saved.
These `DataFrameWriter` methods execute the [COPY INTO <location>](../../../sql-reference/sql/copy-into-location.md) command.

> **Note:**
>
> You do not need to call the `collect` method before calling these `DataFrameWriter` methods. The data from the file
> does not need to be in the DataFrame before you call these methods.

To save the contents of a DataFrame to files on a stage:

1. Call the [DataFrame.write](../reference/scala/com/snowflake/snowpark/DataFrame.md) method to get a [DataFrameWriter](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) object. For example, to get the `DataFrameWriter` object
   for a DataFrame that represents the table named `sample_product_data`:

   ```scala
   dfWriter = session.table("sample_product_data").write
   ```
2. If you want to overwrite the contents of the file (if the file exists), call the [DataFrameWriter.mode](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method, passing in
   `SaveMode.Overwrite`.

   Otherwise, by default, the `DataFrameWriter` reports an error if the specified file on the stage already exists.

   The `mode` method returns the same `DataFrameWriter` object configured with the specified mode.

   For example, to specify that the `DataFrameWriter` should overwrite the file on the stage:

   ```scala
   dfWriter = dfWriter.mode(SaveMode.Overwrite)
   ```
3. If you need to specify additional information about how the data should be saved (for example, that the data should be
   compressed or that you want to use a semicolon to delimit fields in a CSV file), call the [DataFrameWriter.option](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method
   or the [DataFrameWriter.options](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method.

   Pass in the name and value of the option that you want to set. You can set the following types of options:

   * The [file format options](../../../sql-reference/sql/copy-into-location.md) described in the
     [documentation on COPY INTO <location>](../../../sql-reference/sql/copy-into-location.md).
   * The [copy options](../../../sql-reference/sql/copy-into-location.md) described in the
     documentation on COPY INTO <location>.
   * [PARTITION BY or HEADER](../../../sql-reference/sql/copy-into-location.md).

   Note that you cannot use the `option` method to set the following options:

   * The TYPE format type option.
   * The OVERWRITE copy option. To set this option, call the `mode` method instead (as mentioned in the previous step).

   The following example sets up the `DataFrameWriter` object to save data to a CSV file in uncompressed form, using a
   semicolon (rather than a comma) as the field delimiter.

   ```scala
   dfWriter = dfWriter.option("field_delimiter", ";").option("COMPRESSION", "NONE")
   ```

   The `option` method returns a `DataFrameWriter` object that is configured with the specified option.

   To set multiple options, you can
   chain calls to the `option` method (as shown in the example
   above) or call the [DataFrameWriter.options](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md) method, passing in a `Map` of the names and values of the options.
4. To return details about each file that was saved, set the `DETAILED_OUTPUT`
   [copy option](../../../sql-reference/sql/copy-into-location.md) to `TRUE`.

   By default, `DETAILED_OUTPUT` is `FALSE`, which means that the method returns a single row of output containing the
   fields `"rows_unloaded"`, `"input_bytes"`, and `"output_bytes"`.

   When you set `DETAILED_OUTPUT` to `TRUE`, the method returns a row of output for each file saved. Each row contains
   the fields `FILE_NAME`, `FILE_SIZE`, and `ROW_COUNT`.
5. Call the method corresponding to the format of the file to save the data to the file. You can call one of the following
   methods:

   * [DataFrameWriter.csv](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md)
   * [DataFrameWriter.json](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md)
   * [DataFrameWriter.parquet](../reference/scala/com/snowflake/snowpark/DataFrameWriter.md)

   When calling these methods, pass in the stage location of the file where the data should be written (e.g. `@mystage`).

   By default, the method saves the data to filenames with the prefix `data_` (e.g. `@mystage/data_0_0_0.csv`). If you want
   the files to be named with a different prefix, specify the prefix after the stage name. For example:

   ```scala
   val writeFileResult = dfWriter.csv("@mystage/saved_data")
   ```

   This example saves the contents of the DataFrame to files that begin with the prefix `saved_data` (e.g.
   `@mystage/saved_data_0_0_0.csv`).
6. Check the [WriteFileResult](../reference/scala/com/snowflake/snowpark/WriteFileResult.md) object returned for information about the amount of data written to the file.

   From the `WriteFileResult` object, you can access the output produced by the COPY INTO <location> command:

   * To access the rows of output as an array of [Row](../reference/scala/com/snowflake/snowpark/Row.md) objects, use the `rows` value member.
   * To determine which fields are present in the rows, use the `schema` value member, which is a [StructType](../reference/scala/com/snowflake/snowpark/types/StructType$.md) that
     describes the fields in the row.

   For example, to print out the names of the fields and values in the output rows:

   ```scala
   val writeFileResult = dfWriter.csv("@mystage/saved_data")
   for ((row, index) <- writeFileResult.rows.zipWithIndex) {
     (writeFileResult.schema.fields, writeFileResult.rows(index).toSeq).zipped.foreach {
       (structField, element) => println(s"${structField.name}: $element")
     }
   }
   ```

The following example uses a DataFrame to save the contents of the table named `car_sales` to JSON files with the prefix
`saved_data` on the stage `@mystage` (e.g. `@mystage/saved_data_0_0_0.json`). The sample code:

* Overwrites the file, if the file already exists on the stage.
* Returns detailed output about the save operation.
* Saves the data uncompressed.

Finally, the sample code prints out each field and value in the output rows returned:

```scala
val df = session.table("car_sales")
val writeFileResult = df.write.mode(SaveMode.Overwrite).option("DETAILED_OUTPUT", "TRUE").option("compression", "none").json("@mystage/saved_data")
for ((row, index) <- writeFileResult.rows.zipWithIndex) {
  println(s"Row: $index")
  (writeFileResult.schema.fields, writeFileResult.rows(index).toSeq).zipped.foreach {
    (structField, element) => println(s"${structField.name}: $element")
  }
}
```

## Working with Semi-Structured Data

Using a DataFrame, you can query and access [semi-structured data](../../../user-guide/semistructured-intro.md) (e.g JSON data). The
next sections explain how to work with semi-structured data in a DataFrame.

* Traversing Semi-Structured Data
* Explicitly Casting Values in Semi-Structured Data
* Flattening an Array of Objects into Rows

> **Note:**
>
> The examples in these sections use the sample data in [Sample Data Used in Examples](../../../user-guide/querying-semistructured.md).

### Traversing Semi-Structured Data

To refer to a specific field or element in semi-structured data, use the following methods of the [Column](../reference/scala/com/snowflake/snowpark/Column.md) object:

* Use [Column.apply(“<field_name>”)](../reference/scala/com/snowflake/snowpark/Column.md) to return a `Column` object for a field in an OBJECT (or a VARIANT that contains an
  OBJECT).
* Use [Column.apply(<index>)](../reference/scala/com/snowflake/snowpark/Column.md) to return a `Column` object for an element in an ARRAY (or a VARIANT that contains an ARRAY).

> **Note:**
>
> If the field name or elements in the path are irregular and make it difficult to use the `Column.apply` methods, you can
> use the [get](../reference/scala/com/snowflake/snowpark/functions$.md), [get_ignore_case](../reference/scala/com/snowflake/snowpark/functions$.md), or [get_path](../reference/scala/com/snowflake/snowpark/functions$.md) functions as an alternative.

As mentioned in Using the apply Method to Refer to a Column, you can omit the method name `apply`:

```scala
col("column_name")("field_name")
col("column_name")(index)
```

For example, the following code selects the `dealership` field in objects in the `src` column of the
[sample data](../../../user-guide/querying-semistructured.md):

```scala
val df = session.table("car_sales")
df.select(col("src")("dealership")).show()
```

The code prints the following output:

```none
----------------------------
|"""SRC""['DEALERSHIP']"   |
----------------------------
|"Valley View Auto Sales"  |
|"Tindel Toyota"           |
----------------------------
```

> **Note:**
>
> The values in the DataFrame are surrounded by double quotes because these values are returned as string literals. To cast these
> values to a specific type, see Explicitly Casting Values in Semi-Structured Data.

You can also chain method calls to traverse a path to a specific field or
element.

For example, the following code selects the `name` field in the `salesperson` object:

```scala
val df = session.table("car_sales")
df.select(col("src")("salesperson")("name")).show()
```

The code prints the following output:

```none
------------------------------------
|"""SRC""['SALESPERSON']['NAME']"  |
------------------------------------
|"Frank Beasley"                   |
|"Greg Northrup"                   |
------------------------------------
```

As another example, the following code selects the first element of `vehicle` field, which holds an array of vehicles. The
example also selects the `price` field from the first element.

```scala
val df = session.table("car_sales")
df.select(col("src")("vehicle")(0)).show()
df.select(col("src")("vehicle")(0)("price")).show()
```

The code prints the following output:

```none
---------------------------
|"""SRC""['VEHICLE'][0]"  |
---------------------------
|{                        |
|  "extras": [            |
|    "ext warranty",      |
|    "paint protection"   |
|  ],                     |
|  "make": "Honda",       |
|  "model": "Civic",      |
|  "price": "20275",      |
|  "year": "2017"         |
|}                        |
|{                        |
|  "extras": [            |
|    "ext warranty",      |
|    "rust proofing",     |
|    "fabric protection"  |
|  ],                     |
|  "make": "Toyota",      |
|  "model": "Camry",      |
|  "price": "23500",      |
|  "year": "2017"         |
|}                        |
---------------------------

------------------------------------
|"""SRC""['VEHICLE'][0]['PRICE']"  |
------------------------------------
|"20275"                           |
|"23500"                           |
------------------------------------
```

As an alternative to the `apply` method, you can use the [get](../reference/scala/com/snowflake/snowpark/functions$.md), [get_ignore_case](../reference/scala/com/snowflake/snowpark/functions$.md), or [get_path](../reference/scala/com/snowflake/snowpark/functions$.md) functions if the field
name or elements in the path are irregular and make it difficult to use the `Column.apply` methods.

For example, the following lines of code both print the value of a specified field in an object:

```scala
df.select(get(col("src"), lit("dealership"))).show()
df.select(col("src")("dealership")).show()
```

Similarly, the following lines of code both print the value of a field at a specified path in an object:

```scala
df.select(get_path(col("src"), lit("vehicle[0].make"))).show()
df.select(col("src")("vehicle")(0)("make")).show()
```

### Explicitly Casting Values in Semi-Structured Data

By default, the values of fields and elements are returned as string literals (including the double quotes), as shown in the
examples above.

To avoid unexpected results, call the cast method to cast the value to a specific
type. For example, the following code prints out the values without and with casting:

```scala
// Import the objects for the data types, including StringType.
import com.snowflake.snowpark.types._
...
val df = session.table("car_sales")
df.select(col("src")("salesperson")("id")).show()
df.select(col("src")("salesperson")("id").cast(StringType)).show()
```

The code prints the following output:

```none
----------------------------------
|"""SRC""['SALESPERSON']['ID']"  |
----------------------------------
|"55"                            |
|"274"                           |
----------------------------------

---------------------------------------------------
|"CAST (""SRC""['SALESPERSON']['ID'] AS STRING)"  |
---------------------------------------------------
|55                                               |
|274                                              |
---------------------------------------------------
```

### Flattening an Array of Objects into Rows

If you need to “flatten” semi-structured data into a DataFrame (e.g. producing a row for every object in an array), call the
[DataFrame.flatten](../reference/scala/com/snowflake/snowpark/DataFrame.md) method. This method is equivalent to the [FLATTEN](../../../sql-reference/functions/flatten.md) SQL function. If you pass in
a path to an object or array, the method returns a DataFrame that contains a row for each field or element in the object or array.

For example, in the [sample data](../../../user-guide/querying-semistructured.md), `src:customer` is an array of objects that
contain information about a customer. Each object contains a `name` and `address` field.

If you pass this path to the `flatten` function:

```scala
val df = session.table("car_sales")
df.flatten(col("src")("customer")).show()
```

the method returns a DataFrame:

```none
----------------------------------------------------------------------------------------------------------------------------------------------------------
|"SRC"                                      |"SEQ"  |"KEY"  |"PATH"  |"INDEX"  |"VALUE"                            |"THIS"                               |
----------------------------------------------------------------------------------------------------------------------------------------------------------
|{                                          |1      |NULL   |[0]     |0        |{                                  |[                                    |
|  "customer": [                            |       |       |        |         |  "address": "San Francisco, CA",  |  {                                  |
|    {                                      |       |       |        |         |  "name": "Joyce Ridgely",         |    "address": "San Francisco, CA",  |
|      "address": "San Francisco, CA",      |       |       |        |         |  "phone": "16504378889"           |    "name": "Joyce Ridgely",         |
|      "name": "Joyce Ridgely",             |       |       |        |         |}                                  |    "phone": "16504378889"           |
|      "phone": "16504378889"               |       |       |        |         |                                   |  }                                  |
|    }                                      |       |       |        |         |                                   |]                                    |
|  ],                                       |       |       |        |         |                                   |                                     |
|  "date": "2017-04-28",                    |       |       |        |         |                                   |                                     |
|  "dealership": "Valley View Auto Sales",  |       |       |        |         |                                   |                                     |
|  "salesperson": {                         |       |       |        |         |                                   |                                     |
|    "id": "55",                            |       |       |        |         |                                   |                                     |
|    "name": "Frank Beasley"                |       |       |        |         |                                   |                                     |
|  },                                       |       |       |        |         |                                   |                                     |
|  "vehicle": [                             |       |       |        |         |                                   |                                     |
|    {                                      |       |       |        |         |                                   |                                     |
|      "extras": [                          |       |       |        |         |                                   |                                     |
|        "ext warranty",                    |       |       |        |         |                                   |                                     |
|        "paint protection"                 |       |       |        |         |                                   |                                     |
|      ],                                   |       |       |        |         |                                   |                                     |
|      "make": "Honda",                     |       |       |        |         |                                   |                                     |
|      "model": "Civic",                    |       |       |        |         |                                   |                                     |
|      "price": "20275",                    |       |       |        |         |                                   |                                     |
|      "year": "2017"                       |       |       |        |         |                                   |                                     |
|    }                                      |       |       |        |         |                                   |                                     |
|  ]                                        |       |       |        |         |                                   |                                     |
|}                                          |       |       |        |         |                                   |                                     |
|{                                          |2      |NULL   |[0]     |0        |{                                  |[                                    |
|  "customer": [                            |       |       |        |         |  "address": "New York, NY",       |  {                                  |
|    {                                      |       |       |        |         |  "name": "Bradley Greenbloom",    |    "address": "New York, NY",       |
|      "address": "New York, NY",           |       |       |        |         |  "phone": "12127593751"           |    "name": "Bradley Greenbloom",    |
|      "name": "Bradley Greenbloom",        |       |       |        |         |}                                  |    "phone": "12127593751"           |
|      "phone": "12127593751"               |       |       |        |         |                                   |  }                                  |
|    }                                      |       |       |        |         |                                   |]                                    |
|  ],                                       |       |       |        |         |                                   |                                     |
|  "date": "2017-04-28",                    |       |       |        |         |                                   |                                     |
|  "dealership": "Tindel Toyota",           |       |       |        |         |                                   |                                     |
|  "salesperson": {                         |       |       |        |         |                                   |                                     |
|    "id": "274",                           |       |       |        |         |                                   |                                     |
|    "name": "Greg Northrup"                |       |       |        |         |                                   |                                     |
|  },                                       |       |       |        |         |                                   |                                     |
|  "vehicle": [                             |       |       |        |         |                                   |                                     |
|    {                                      |       |       |        |         |                                   |                                     |
|      "extras": [                          |       |       |        |         |                                   |                                     |
|        "ext warranty",                    |       |       |        |         |                                   |                                     |
|        "rust proofing",                   |       |       |        |         |                                   |                                     |
|        "fabric protection"                |       |       |        |         |                                   |                                     |
|      ],                                   |       |       |        |         |                                   |                                     |
|      "make": "Toyota",                    |       |       |        |         |                                   |                                     |
|      "model": "Camry",                    |       |       |        |         |                                   |                                     |
|      "price": "23500",                    |       |       |        |         |                                   |                                     |
|      "year": "2017"                       |       |       |        |         |                                   |                                     |
|    }                                      |       |       |        |         |                                   |                                     |
|  ]                                        |       |       |        |         |                                   |                                     |
|}                                          |       |       |        |         |                                   |                                     |
----------------------------------------------------------------------------------------------------------------------------------------------------------
```

From this DataFrame, you can select the `name` and `address` fields from each object in the `VALUE` field:

```scala
df.flatten(col("src")("customer")).select(col("value")("name"), col("value")("address")).show()
```

```none
-------------------------------------------------
|"""VALUE""['NAME']"   |"""VALUE""['ADDRESS']"  |
-------------------------------------------------
|"Joyce Ridgely"       |"San Francisco, CA"     |
|"Bradley Greenbloom"  |"New York, NY"          |
-------------------------------------------------
```

The following code adds to the previous example by
casting the values to a specific type and changing the names of the columns:

```scala
df.flatten(col("src")("customer")).select(col("value")("name").cast(StringType).as("Customer Name"), col("value")("address").cast(StringType).as("Customer Address")).show()
```

```none
-------------------------------------------
|"Customer Name"     |"Customer Address"  |
-------------------------------------------
|Joyce Ridgely       |San Francisco, CA   |
|Bradley Greenbloom  |New York, NY        |
-------------------------------------------
```

## Executing SQL Statements

To execute a SQL statement that you specify, call the `sql` method in the `Session` class, and pass in the statement
to be executed. The method returns a DataFrame.

Note that the SQL statement won’t be executed until you call an action method.

```scala
// Get the list of the files in a stage.
// The collect() method causes this SQL statement to be executed.
val dfStageFiles = session.sql("ls @myStage")
val files = dfStageFiles.collect()
files.foreach(println)

// Resume the operation of a warehouse.
// Note that you must call the collect method in order to execute
// the SQL statement.
session.sql("alter warehouse if exists myWarehouse resume if suspended").collect()

val tableDf = session.table("table").select(col("a"), col("b"))
// Get the count of rows from the table.
val numRows = tableDf.count()
println("Count: " + numRows);
```

If you want to call methods to transform the DataFrame (e.g. filter, select, etc.),
note that these methods work only if the underlying SQL statement is a SELECT statement. The transformation methods are not
supported for other kinds of SQL statements.

```scala
val df = session.sql("select id, category_id, name from sample_product_data where id > 10")
// Because the underlying SQL statement for the DataFrame is a SELECT statement,
// you can call the filter method to transform this DataFrame.
val results = df.filter(col("category_id") < 10).select(col("id")).collect()
results.foreach(println)

// In this example, the underlying SQL statement is not a SELECT statement.
val dfStageFiles = session.sql("ls @myStage")
// Calling the filter method results in an error.
dfStageFiles.filter(...)
```

---
title: Writing Snowpark Code in Python Worksheets
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/python-worksheets.md
section: Snowpark
---

# Writing Snowpark Code in Python Worksheets

Write [Snowpark](../index.md) code in Python worksheets to process data using Snowpark Python in Snowsight.
By writing code in Python worksheets, you can perform your development and testing in Snowflake without needing to install dependent libraries.

To develop with Python worksheets, do the following:

1. Prepare roles and packages in Snowflake.
2. Set up your worksheet for development.
3. Write Snowpark code in your Python worksheet.
4. Run your Python worksheet.

For example, you might write code in a Python worksheet that extracts data from stages or database objects in Snowflake, transforms the
data, and stores the transformed data in Snowflake. You could then
[deploy that code as a stored procedure](../../stored-procedure/python/procedure-python-create-worksheet.md) and build a
data pipeline, all without leaving Snowflake.

## About Python Worksheets

Python worksheets let you use Snowpark Python in Snowsight to perform data manipulations and transformations. You can use
[third-party packages listed in the Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/) or import your own Python files
from stages to use in scripts.

After running a Python worksheet, review the results and output returned by your script. The results display as a string, variant, or a
table, depending on your code. See Running Python Worksheets.

> **Note:**
>
> Because Python worksheets run inside Snowflake rather than in your local development environment, you cannot use `session.add_import`
> to add a file that your Python code depends on, or `session.add_packages` or `session.add_requirements` to add packages that you need
> to use in your Python code. Instead, you add those files to a stage and reference them in your code.
> See [Staging files using Snowsight](../../../user-guide/data-load-local-file-system-stage-ui.md).

Python worksheets have the following limitations:

* Log levels lower than WARN do not appear in the Output for a Python worksheet by default. To log lower level messages to the output,
  use a logging library such as the `logging` module to set the level of messages logged.
* No support for breakpoints or running only portions of the Python code in a worksheet.
* No support for images or webpages. Images or webpages generated by Python code cannot be displayed in Python worksheets.
* Python worksheets use Python 3.11 by default, but you can choose another supported version in Packages.

If you require support for any of these options, consider using your local development environment instead.
See [Setting up your development environment for Snowpark Python](setup.md).

## Prerequisites for Python Worksheets

To use Python worksheets, you must do the following:

* (Optional) Add Python files and packages that are not [included with Anaconda](https://repo.anaconda.com/pkgs/snowflake/) that you want
  to use in a Python worksheet to a named stage. See Add a Python File from a Stage to a Worksheet.
* Choose a warehouse to use for Python worksheets. Snowflake recommends using an X-Small warehouse for development.
  If you’re running a very large Snowpark workload, use a [Snowpark-optimized warehouse](../../../user-guide/warehouses-snowpark-optimized.md).
  See [Warehouse size](../../../user-guide/warehouses-overview.md) for additional details about warehouse sizes.

### Review and accept the Anaconda Terms of Service

Before you start using the packages provided by Anaconda inside Snowflake, you must acknowledge
the [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/).

> **Note:**
>
> You must use the ORGADMIN role to accept the terms. You only need to accept the
> [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/) once for your Snowflake account. If you do not have
> access to the ORGADMIN role, see [Enabling the ORGADMIN role in an account](../../../user-guide/organization-administrators.md).

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Anaconda section, select Enable.
4. In the Anaconda Packages dialog, click the link to review the [External Offerings Terms page](https://www.snowflake.com/legal/external-offering-terms/).
5. If you agree to the terms, select Acknowledge & Continue.

If you encounter an error when attempting to accept the [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/),
it may be due to missing information in your user profile, such as a first name, last name, or email address. If you have administrator
privileges, see [Add user details to your user profile](../../../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
administrator to [update your account](../../../user-guide/admin-user-management.md).

### Add a Python File from a Stage to a Worksheet

Snowflake includes the Snowpark packages from the [Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/)
in Python worksheets.

If you want to use Python files or packages other than those included in Anaconda in your Python worksheet, you must upload the files
to a named stage in Snowflake and then add them to the list of installed packages for your Python worksheet.

To use a Python package in your worksheet, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Python Worksheet.
3. Select a database and schema.
4. Select Packages » Stage Packages.
5. Enter the path to the package in the stage:

   * If the selected database and schema for the worksheet contain the stage where the package is located, you can reference the stage using
     an unqualified name. For example, `@YourStage/path/to/example_package.py`.
   * To reference a stage in a different database and schema, fully qualify the name of the stage. For example,
     `@Database.Schema.Stage/path/to/other_package.py`.
6. Select Import to add your package to the list of installed packages.
7. In your code, use `import` statements to use the package in your Python worksheet.
   For example, after importing packages from the `example_package.py` and `other_package.py` files,
   write the following code to import a function called `function` from the `example_package`, and import the
   package `other_package` for use in your code:

   ```python
   from example_package import function
   import other_package
   ```

> **Note:**
>
> Packages that you add to a worksheet are available only to that worksheet. If you want to use the same package in a different Python
> worksheet, use this procedure to add the package to that worksheet.

For more details, see [Making dependencies available to your code](../../upload-dependencies.md).

## Start Developing with Python Worksheets

To open a worksheet and configure your development environment, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Python Worksheet.
3. Select a database and schema.
4. Select a warehouse to use to run the worksheet. If you have a default warehouse for your user, it is pre-selected.

   Python worksheets require a running warehouse to load Python packages and run Python code.
5. (Optional) Select Packages to install Python libraries.

   * The `snowflake-snowpark-python` package is required and always installed for Python worksheets.
   * Search for packages [listed in the Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/),
     such as numpy, pandas, requests, and urllib3. Select a package to install it for use in your worksheet, and optionally change
     the default package version in the list of Installed Packages.
   * Add your own packages and Python files by selecting Stage Packages and specifying the file path of the stage and package,
     then selecting Import. See Add a Python File from a Stage to a Worksheet.

   Packages installed by you appear under Installed Packages.
6. If you installed Python libraries for your worksheet, add `import` statements to your code to use the installed libraries.

   For example, if you install the package scikit-learn for your Python worksheet, add an `import` statement for that package
   at the beginning of your code

   ```python
   import scikit-learn
   ```
7. Run the sample Python code to validate your configuration.

Error messages or the return value from your code appears in the Results section. To view log messages, select Output.
See Running Python Worksheets.

## Writing Snowpark Code in Python Worksheets

After you follow the steps to start developing with Python worksheets, you can replace the
sample code with your own.

Write your Snowpark Python code inside the handler function:

```python
import snowflake.snowpark as snowpark

def main(session: snowpark.Session):
    # your code goes here
```

The default handler function is `main`, but you can change it in the Settings for the worksheet.
The active handler is highlighted in the worksheet.

Use the `session` object provided in the boilerplate code to access data in Snowflake with the Snowpark API libraries.
For example, you can create a [DataFrame](working-with-dataframes.md) for a table or execute a SQL
statement. See the [Snowpark Developer Guide for Python](index.md).

As you type, you see autocomplete for Python methods, defined variables, and more. You do not see autocomplete for
some third-party packages or files imported from a stage. Python worksheets also include syntax highlighting and guidance for
method parameters. You can configure linting and line wrapping in the Settings for the worksheet.

### Return Results of a Different Data Type

When you write your Python code, consider which type of data is returned by the `return` statement in your code and adjust how the
worksheet returns results. By default, a Python worksheet has a return type of Table() because the placeholder code returns a DataFrame.

Depending on what your Python code returns, you might want to change the worksheet settings to display the output differently:

* If your handler function returns a `DataFrame`, use the default return type of Table().
* If your handler function returns a list of `Row` objects, such as with the `collect` method, change the return type
  to Variant.
* If your handler function returns a string, such as `return "Hello Python"`, or a value that you want to cast as a string,
  change the return type to String.
* If your handler function returns an integer, such as with the `count` method, use a return type of Variant or String.

For details about the return type of some DataFrame methods, see [Performing an Action to Evaluate a DataFrame](working-with-dataframes.md).

To update the worksheet settings to return results of a different type, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open the Python worksheet for which you want to display the results as a table.
4. Select a warehouse to use to run the worksheet. If you have a default warehouse for your user, it is pre-selected. Make sure your warehouse is running.
5. Select Settings and for the Return type, select the type returned by the handler function.
6. Run your Python worksheet.
7. Review the results in the Results panel.

### Passing Additional Arguments to the Handler Function

With a Python worksheet, you can test a Python function that takes a single argument (a Snowpark `Session` object) by designating that
function as the handler for the worksheet. Every function defined in a Python worksheet needs to pass in the `session: snowpark.Session`
argument.

To test a function that passes in additional arguments, do the following:

1. Add the arguments to your function.
2. Define a separate, single-argument function that passes in a Snowpark `Session`. In this function, call the multi-argument function,
   passing in values for the additional arguments, then return the value of the function.

   For example, to write Snowpark Python code that filters a table of packages by the package language column, you can write the following
   code:

   ```python
   import snowflake.snowpark as snowpark
   from snowflake.snowpark.functions import col

   # Add parameters with optional type hints to the main handler function
   def main(session: snowpark.Session, language: str):
     # Your code goes here, inside the "main" handler.
     table_name = 'information_schema.packages'
     dataFrame = session.table(table_name).filter(col("language") == language)

     # Print a sample of the dataFrame to standard output
     dataFrame.show()

     # The return value appears in the Results tab
     return dataFrame

   # Add a second function to supply a value for the language parameter to validate that your main handler function runs.
   def test_language(session: snowpark.Session):
     return main(session, 'java')
   ```

   In this example, the `main` function is the multi-argument function and the `test_language` function is the single-argument
   function used to validate that your code runs with the passed argument values.
3. Set the single-argument function as the handler function to run the worksheet and validate that your code runs with the argument values.

   In this example, change the handler to the `test_language` function and then select Run. You can change the handler in the
   worksheet Settings, or select the Show actions lightbulb next to the handler function and select
   Set function “test_language” as handler.

When you [deploy your Python worksheet as a stored procedure](../../stored-procedure/python/procedure-python-create-worksheet.md),
you can choose the main handler function and review the arguments and the mapped types for your stored procedure.

## Running Python Worksheets

After you write your Python worksheet, select Run to run your Python worksheet.
Running your worksheet executes all of the code in your Python worksheet. Partial or incremental execution of code is not supported.

> **Note:**
>
> If you use a package [listed in the Snowflake Anaconda channel](https://repo.anaconda.com/pkgs/snowflake/)
> and have not yet accepted the Anaconda terms, you might see an error about missing packages. See [Using third-party packages from Anaconda](../../udf/python/udf-python-packages.md).

### Review Output Generated by Your Code

You can review standard output (stdout) or standard error (stderr) messages for your Python code in the Output panel for a
Python worksheet.

You can see the output from the following types of functions in the Output panel:

* Functions that write to the console, such as `print()`.
* Functions that print a DataFrame, such as the `show` method of the DataFrame class in Snowpark Python.

> **Note:**
>
> Output appears after all Python processes finish running, rather than appearing in a stream as the code runs.
>
> Log output is written to a temporary stage and is only captured if the following are true:
>
> * You select a database and schema for the worksheet.
> * The selected database was not created from a share.
> * You run the worksheet using a role that has USAGE privileges on the selected database and schema.

### Review the Query History for a Python Worksheet

When a Python worksheet runs in Snowsight, an anonymous stored procedure runs the code and generates queries
that execute the Snowpark commands in the code.

You can use the Query History page in Snowsight to review the queries that ran.
See [Review Query History in Snowsight](../../../user-guide/ui-snowsight-activity.md).

For example, after running a worksheet, you can review the queries that ran by doing the following:

1. Review the Results of the worksheet.
2. In the Query Details for the worksheet, select  » Copy Query ID
3. To return to the list of worksheets, in the navigation menu, select Projects » Worksheets.
4. In the navigation menu, select Monitoring » Query History.
5. On the Query History page, display only the queries from your Python worksheet:

   1. Select Filters, and enable the Query ID option.
   2. Enter the Query ID of your Python worksheet.
   3. Select Apply Filters.
6. Review the queries run for the worksheet.

## Example Code for Python Worksheets

When you write Python worksheets, you can perform data transformation and manipulation tasks,
including reading data from a named stage.

* Example: Write a Simple Snowpark Program
* Example: Transform Data in a Python Worksheet
* Example: Read Files from a Stage with Python Worksheets

You can review additional examples in [Working with DataFrames in Snowpark Python](working-with-dataframes.md).

### Example: Write a Simple Snowpark Program

In this example, write a Snowpark Python program that generates a small range of numbers and writes the range to a
table that your code creates, or overwrites if it already exists, in Snowflake. To run this code example, you must have the
CREATE TABLE privilege on the database schema to which you want to add the table.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Python Worksheet.
3. Select a database and schema that you want to add the table to.
4. Select a warehouse to use to run the worksheet. If you have a default warehouse for your user, it is pre-selected. Make sure your warehouse is running.
5. Write the Snowpark Python code as part of the `main` function:

   ```python
   import snowflake.snowpark as snowpark

   def main(session: snowpark.Session):
     tableName = "range_table"
     df_range = session.range(1, 10, 2).to_df('a')
     df_range.write.mode("overwrite").save_as_table(tableName)
     return tableName + " table successfully created"
   ```
6. Select Settings and for the Return type, select String for the type returned by the handler function.
7. Run the code.

### Example: Transform Data in a Python Worksheet

In this example, write Python code that aggregates the entries in the TASK_HISTORY view in the ACCOUNT_USAGE schema of the SNOWFLAKE
database by scheduled time and state and saves the aggregated output to a table, `aggregate_task_history`.

> **Note:**
>
> Because this example queries account usage data, you must use a role with:
>
> * Access to query the views in the ACCOUNT_USAGE schema. See [Enabling other roles to use schemas in the SNOWFLAKE database](../../../sql-reference/account-usage.md).
> * The CREATE TABLE privilege on the database schema to which you want to add the table.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Python Worksheet.
3. Select a database and schema that you want to add the table to.
4. Select a warehouse to use to run the worksheet. If you have a default warehouse for your user, it is pre-selected. Make sure your warehouse is running.
5. Write the Snowpark Python code as part of the `main` function:

   ```python
   import snowflake.snowpark as snowpark
   from snowflake.snowpark.functions import col
   from snowflake.snowpark.dataframe_reader import *
   from snowflake.snowpark.functions import *

   def main(session: snowpark.Session):

     inputTableName = "snowflake.account_usage.task_history"
     outputTableName = "aggregate_task_history"

     df = session.table(inputTableName)
     df.filter(col("STATE") != "SKIPPED")\
       .group_by(("SCHEDULED_TIME"), "STATE").count()\
       .write.mode("overwrite").save_as_table(outputTableName)
     return outputTableName + " table successfully written"
   ```
6. Select Settings and for the Return type, select String for the type returned by the handler function.
7. Run the code.

After you run your code in a Python worksheet, you can open a SQL worksheet and query the table. See [Querying data using worksheets](../../../user-guide/ui-snowsight-query.md).

### Example: Read Files from a Stage with Python Worksheets

Snowpark Python lets you read files from a stage and write the contents to a table or save them as a view in Snowflake.
In this example, the Python code reads the contents of a compressed CSV-formatted file containing employee data,
`data_0_0_0.csv.gz` from the `db1.public.files` named stage and writes the contents to a table called `employees`.

> **Note:**
>
> To run this code example, you must use a role that has:
>
> * The USAGE privilege on the stage, database, and schema used in the code.
> * The CREATE TABLE privilege on the database schema to which you want to add the table.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » Python Worksheet.
3. Select a database and schema that you want to add the table to.
4. Select a warehouse to use to run the worksheet. If you have a default warehouse for your user, it is pre-selected.
   Make sure your warehouse is running.
5. Write the Snowpark Python code as part of the `main` function:

   ```python
   import snowflake.snowpark as snowpark
   from snowflake.snowpark.types import *

   schema_for_file = StructType([
     StructField("name", StringType()),
     StructField("role", StringType())
   ])

   fileLocation = "@DB1.PUBLIC.FILES/data_0_0_0.csv.gz"
   outputTableName = "employees"

   def main(session: snowpark.Session):
     df_reader = session.read.schema(schema_for_file)
     df = df_reader.csv(fileLocation)
     df.write.mode("overwrite").save_as_table(outputTableName)

     return outputTableName + " table successfully written from stage"
   ```
6. Select Settings and for the Return type, select String for the type returned by the handler function.
7. Run the code.

After you run your code in a Python worksheet, you can open a SQL worksheet and query the table. See [Querying data using worksheets](../../../user-guide/ui-snowsight-query.md).

For more details about working with files in a stage using Snowpark, see [Working with DataFrames in Snowpark Python](working-with-dataframes.md).

---
title: Writing Tests for Snowpark Python
source: https://docs.snowflake.com/en/developer-guide/snowpark/python/testing-python-snowpark.md
section: Snowpark
---

# Writing Tests for Snowpark Python

This topic explains how to test your Snowpark code while connected to Snowflake.
You can use standard testing utilities, like PyTest, to test your Snowpark Python UDFs, DataFrame transformations, and stored procedures.

Thorough testing can help to prevent unintended breaking changes. Unit tests verify that a section of code works as expected.
Integration tests help ensure that components work together correctly for an end-to-end use case.

The examples in this document use PyTest, one of the most popular testing frameworks for Python.
For additional guidance and best practices, see the [PyTest documentation](https://docs.pytest.org/en/7.4.x/).

Alternatively, you can use the Snowpark Python local testing framework to create and operate on Snowpark Python DataFrames locally without
connecting to a Snowflake account. For more information, see [Local testing framework](testing-locally.md).

## Setting up your Tests

Install PyTest in your project, by running `pip install pytest` or `conda install pytest`.
You can also add it to your `requirements.txt` or conda environment file.

Create a `test` directory next to your source code directory and add your unit and integration tests to it.
To see an example, refer to the [Snowpark Python project template](https://github.com/Snowflake-Labs/snowpark-python-template/).

## Creating a PyTest Fixture for the Snowpark Session

PyTest fixtures are functions that are executed before a test (or module of tests) to provide data or connections to tests.
In this scenario, create a PyTest fixture that returns a Snowpark `Session` object.

1. Create a `test` directory if you do not already have one.
2. Create a `conftest.py` under `test` with the following contents, where `connection_parameters` is a dictionary with your Snowflake
   account credentials. For more information about the dictionary format, see [Creating a Session](creating-session.md).
3. Create the `Session` fixture as a module-scoped fixture instead of as a file-scoped fixture to prevent multiple sessions from being created
   and causing issues due to conflicting session objects.

```python
from snowflake.snowpark.session import Session

@pytest.fixture(scope='module')
def session(request) -> Session:
    connection_parameters = {}
    return Session.builder.configs(...).create()
```

## Unit Tests for UDFs

You can test your Python UDF logic by testing the UDF handler as a generic Python method.

1. Create a file under your `test` directory for the UDF unit tests. For example, name the file `test_functions.py`.
2. Import the Python methods to test.
3. For each test scenario, create a Python method named `test_<scenario_to_test>`.

For example, here is a Python UDF handler:

```python
def fahrenheit_to_celsius(temp_f: float) -> float:
    """
    Converts fahrenheit to celsius
    """
    return (float(temp_f) - 32) * (5/9)
```

You can import this method into the test file (`test/test_functions.py`) and test it as a generic Python method.

```python
import fahrenheit_to_celsius

def test_fahrenheit_to_celsius():
    expected = 0.0
    actual = fahrenheit_to_celsius(32)
    assert expected == actual
```

## Unit Tests for DataFrame Transformations

Adding unit tests for your DataFrame transformations helps to protect against unexpected bugs and regressions.
To make your DataFrame logic easily testable, encapsulate the transformations into a Python method that takes as
input the DataFrames to be transformed and returns the transformed DataFrames.

In the example below, `mf_df_transformer` contains the transformation logic. It can be imported into other
modules in the Python project and tested easily.

```python
from snowflake.snowpark.dataframe import DataFrame, col

def my_df_tranformer(df: DataFrame) -> DataFrame:
    return df \
        .with_column('c', df['a']+df['b']) \
        .filter(col('c') > 3)
```

To test this transformation, follow these steps:

1. Create a file for the DataFrame tests, `test_transformers.py`, under the `test` directory (`test/test_transformers.py`).
2. Create a test method for the transformer to be tested: `test_my_df_transformer(session)`. The `session` parameter here refers to the session fixture created in the earlier section.
3. Using the session fixture, create the input and expected output DataFrames within the test method.
4. Pass the input DataFrame to the transformer and compare the expected DataFrame to the actual DataFrame returned by the transformer.

```python
# test/test_transformers.py

import my_df_transformer

def test_my_df_transformer(session):
    input_df = session.create_dataframe([[1,2],[3,4]], ['a', 'b'])
    expected_df = session.create_dataframe([3,4,7], ['a','b','c'])
    actual_df = my_df_transformer(input_df)
    assert input_df.collect() == actual_df.collect()
```

## Integration Tests for Stored Procedures

To test your stored procedure handlers, use the session fixture to call the stored procedure handler.
If your stored procedure reads from tables, such as in an ETL pipeline, you can create those tables prior to calling the stored procedure handler,
as shown in the example below. This pattern ensures that your input data is tracked in source control and does not unexpectedly change between test executions.

```python
from project import my_sproc_handler  # import stored proc handler

def test_my_sproc_handler(session: Session):

    # Create input table
    input_tbl = session.create_dataframe(
        data=[...],
        schema=[...],
    )

    input_tbl.write.mode('overwrite').save_as_table(['DB', 'SCHEMA', 'INPUT_TBL'], mode='overwrite')

    # Create expected output dataframe
    expected_df = session.create_dataframe(
        data=[...],
        schema=[...],
    ).collect()

    # Call the stored procedure
    my_sproc_handler()

    # Get actual table
    actual_tbl = session.table(['DB', 'SCHEMA', 'OUTPUT_TBL']).collect()

    # Clean up tables
    session.table(['DB', 'SCHEMA', 'OUTPUT_TBL']).delete()
    session.table(['DB', 'SCHEMA', 'INPUT_TBL']).delete()

    # Compare the actual and expected tables
    assert expected_df == actual_tbl
```

## Snowflake ML

Snowflake ML APIs, feature store, model registry, and ML pipelines.

---
title: Access control requirements for ML Jobs
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/ml-jobs/access-control-requirements.md
section: Snowflake ML
---

# Access control requirements for ML Jobs

To use Snowflake ML Jobs, users need specific access privileges assigned to their roles. This page outlines the required privileges for both setting up new environments and using existing ones.

## Setting Up a New Environment

Creating a new environment for ML Jobs requires the following privileges:

* CREATE COMPUTE POOL privilege on the account: Required to create new compute pools. Alternatively, you may use any existing compute pool.
* CREATE SCHEMA privilege on the database (optional): Needed if you want to create a new schema for organizing ML Jobs and resources. We recommend this approach to easily clean up old jobs and payload stages.

## Using an Existing Environment

For users who will be executing ML Jobs in an existing environment, the following privileges are required:

Basic Access

* USAGE privilege on the database where the ML Jobs run
* USAGE privilege on the schema where ML Jobs run
* CREATE SERVICE privilege on the schema to create and manage ML Jobs
* USAGE privilege on the compute pool to allow it to be used for ML workloads
* USAGE privilege on a stage to upload ML Job payloads for execution

> **Note:**
>
> If you don’t have an existing stage, the ML Job creates one on your behalf using the specified `stage_name`.
> This requires the CREATE STAGE privilege on the schema.

## Additional Requirements

* Data Access Privileges: Users need appropriate privileges for any data tables, warehouses, or other resources their ML workloads will access

---
title: Autocapture inference logs for realtime inference
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/auto-capture-inference-logs.md
section: Snowflake ML
---

# Autocapture inference logs for realtime inference

Use Auto Capture in Snowflake ML to automatically log every request and response processed by a model service. Auto Capture provides immediate visibility into the request successes, request failures, and the inputs behind unexpected predictions.

Instead of piping request or response data into a table or view, you can automatically persist inference request and response data. Instead of needing to correctly create pipelines for data ingestion and monitoring, you can use Auto Capture.

With Auto Capture, you can do the following:

* **Rapidly debug**: Analyze historical inference data to diagnose edge cases and understand model behavior.
* **Continuously improve your models**: Use real-world production data to create high-quality datasets to train new models.
* **Test**: Use the data collected from the logs for A/B testing and shadow testing.

For each inference request, Auto Capture logs the following:

* Request payload
* Response payload
* Model version identifier
* Service identifier
* Gateway routing metadata
* Request/response timestamps
* Response code (such as 200).

> **Note:**
>
> Snowflake doesn’t capture data for the inputs and outputs using the vLLM inference engine.

This data is read-only and cannot be modified by users.

Snowflake only captures response data for successful requests. If a request fails, Snowflake doesn’t capture any data.

# Prerequisites and model version compatibility

The following sections describe the prerequisites and model version compatibility for Auto Capture.

## Access control requirements

To configure and access captured inference data, your role must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Model | Required to create a service with autocapture enabled and to read inference table data using the INFERENCE_TABLE function. OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../../../sql-reference/sql/grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). In a managed access schema, only the schema owner (for example, the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| OWNERSHIP | Service | Required to list whether a service has autocapture enabled in the list_service() function. OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../../../sql-reference/sql/grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). In a managed access schema, only the schema owner (for example, the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |
| USAGE | Model, Service, Version, Gateway | Required to resolve entities in the INFERENCE_TABLE function. |

## Model version compatibility

Each model exists as a model object. The model object has its own inference table, which contains the data and metadata for each inference request. Auto Capture logs the data from the model’s inference table. Each model service has its own inference table.

If you’re creating a new model, Auto Capture is automatically enabled.

Models created before January 23, 2026, don’t support Auto Capture. You must clone the model and enable Auto Capture for its service.

Use the following command to duplicate an existing model:

```sqlexample
CREATE [ OR REPLACE ] MODEL [ IF NOT EXISTS ] <name> [ WITH VERSION <version_name> ]
FROM MODEL <source_model_name> [ VERSION <source_version_or_alias_name> ]
```

The model created with the preceding command has an empty inference table. For information about enabling autocapture, see Activate Auto Capture.

You can also create a model version from an existing model version. For more information, see [Variant Syntax](../../../sql-reference/sql/create-model.md).

After duplicating the model, you can enable autocapture by following the steps in Activate Auto Capture.

# Activate Auto Capture

After you’ve created a new model or cloned an existing model, enable Auto Capture for the model service using the Python SDK. For more information about the model service, see [Deploy models for Real time Inference (REST API)](real-time-inference-rest-api.md).

Use the following Python code to enable Auto Capture:

```python
mv.create_service(
    service_name="my_service",
    service_compute_pool="my_compute_pool",
    autocapture=True
)
```

The `mv` variable is the model version object. You defined it when you logged the model to the model registry.

The default value for autocapture is `False`. Make sure you’re enabling autocapture for a model that you’ve created after January 23, 2026 and logged to the model registry. Otherwise, the service creation fails because the model doesn’t have an inference table.

> **Important:**
>
> The autocapture setting is immutable. You can’t enable or disable auto-capture on an existing model service. You must recreate the service to change this configuration. If you recreate the service, the endpoint changes unless you use a stable endpoint or gateway.

# Query inference data

To access your logs, use the INFERENCE_TABLE function. This function returns inference logs for a model and supports filtering by version, service, and gateway. Only model owners are able to see the data when they have USAGE privileges on the gateway and service.

## Basic example

The following example demonstrates how to retrieve all inference logs for a model using the INFERENCE_TABLE function. This query returns all captured request and response data for every inference request processed by the model’s services.

```sqlexample
-- Fetch all inference logs for a specific model
SELECT * FROM TABLE(INFERENCE_TABLE('my_model'));
```

## Advanced filtering example

You can filter by specific versions, services, or gateways directly within the INFERENCE_TABLE() function:

```sqlexample
SELECT * FROM TABLE(
INFERENCE_TABLE(
'MY_MODEL',
VERSION => 'V1',
SERVICE => 'MY_PREDICTION_SERVICE',
GATEWAY => 'MY_GATEWAY'
)
);
```

> **Important:**
>
> The service, version, and gateway arguments must exist at the time of the query. If created a new service, version, or gateway with the same name as one that had existed previously, the query only produces data from the current version.

You can use the following predicate clause to filter by function name:

```sqlexample
WHERE RECORD_ATTRIBUTES:"snow.model_serving.function.name" = 'predict'
```

> **Note:**
>
> For best performance, filter for a time range on the TIMESTAMP column.

## Querying historical data for deleted entities

The inference data is retained after you delete a service, version, or gateway. You can still query this historical data so long as the model still exists.

The following example returns all inference logs for a model:

```sqlexample
SELECT *
FROM TABLE(
  INFERENCE_TABLE('my_model')
);
```

The following example filters inference logs by model version:

```sqlexample
SELECT *
FROM TABLE(
  INFERENCE_TABLE(
    'my_model',
    MODEL_VERSION => 'v1'
  )
);
```

The following example filters inference logs by version and service:

```sqlexample
SELECT *
FROM TABLE(
  INFERENCE_TABLE(
    'my_model',
    MODEL_VERSION => 'v1',
    SERVICE => 'my_service'
  )
);
```

The following example filters inference logs by version and gateway:

```sqlexample
SELECT *
FROM TABLE(
  INFERENCE_TABLE(
    'my_model',
    MODEL_VERSION => 'v1',
    GATEWAY => 'my_gateway'
  )
);
```

# Data schema and metadata

Snowflake only captures response data for successful requests. If a request fails, Snowflake doesn’t capture any data.

The following are the record attributes that are captured:

| Field | Description |
| --- | --- |
| `snow.model_serving.request.data.<column>` | The input features sent to the model. |
| `snow.model_serving.response.data.<column>` | The inference output returned by the model. |
| `snow.model_serving.request.timestamp` | When the request hit the inference service. |
| `snow.model_serving.response.code` | HTTP status (such as 200 for success and 5xx for errors). |
| `snow.model_serving.truncation_policy` | Indicates if data exceeded size limits (NONE or TRUNCATED_DEFAULT). For more information, see Data truncation logic. |
| `snow.model_serving.last_hop_id` | Reflects the last gateway id from where the request landed to the inference service. |
| `snow.model_serving.hop_ids` | Reflects the list of gateway ids, depicting the path of traversal. Currently limited to only one gateway. |

# Data truncation logic

To maintain system performance, there’s a 1 MB limit for each inference event. If the request and response reaches the limit, Snowflake applies a multi-stage truncation process to preserve as much utility as possible.

The following table shows the truncation process:

| Stage | Trigger | Action taken |
| --- | --- | --- |
| 1: Soft Reduction | > 700 KB | Raw bytes removed; Strings > 2 KB truncated; JSON objects replaced with a `TRUNCATED` status. |
| 2: Aggressive | > 900 KB | All strings further truncated to 256 bytes. |
| 3: Removal | > 900 KB\* | If still over limit, the payload is dropped and replaced with a minimal metadata skeleton. |

\*Stage 3 occurs if metadata alone exceeds the threshold after content reduction.

# Limits

Keep in mind the following limitations and considerations when using auto capture:

* **LLM Support**: Auto capture isn’t supported for Large Language Models (LLMs).
* **Throughput**: Auto capture is designed for a system throughput of approximately 300-400 requests per second (or 10MB/s) per service.
* **Replication**: You can’t replicate inference tables. Replicated models will have no inference tables in the target account.
* **Retention**: Inference data persists even if the Service or Gateway is deleted.
* **Warning**: Deleting the Model object will permanently delete all associated inference data.
* **Ground Truth**: To perform drift analysis, maintain a separate ground truth table and join it with the INFERENCE_TABLE output using common request IDs.
* **Consumer Accounts**: Consumer accounts can’t create a service with autocapture enabled for shared models with inference tables.
* **Performance**: Autocapture is designed to not add latency to inference requests. However, it may drop some captures during periods of extremely high request volume.

# Schema

As part of this feature, the following values are added to the respective columns.

## RESOURCE_ATTRIBUTES

The following table describes the resource attribute schema fields:

| Field | Description |
| --- | --- |
| `snow.model.version.id` | Unique identifier for the model version. |
| `snow.model.version.name` | Name of the model version. |

## RECORD_ATTRIBUTES

The following table describes the record attribute schema fields:

| Field | Description |
| --- | --- |
| `snow.model_serving.function.name` | Name of the model function that was called. |
| `snow.model_serving.last_hop_id` | The ID of the last gateway that processed the request. |
| `snow.model_serving.hop_ids` | List of gateway IDs that processed the request. |
| `snow.model_serving.request.data.<column>` | Input fields where `<column>` represents specific input field names. |
| `snow.model_serving.request.timestamp` | Timestamp of when the request was captured by the inference service. |
| `snow.model_serving.response.data.<column>` | Response data where `<column>` contains the inference response fields. |
| `snow.model_serving.response.timestamp` | Timestamp of when the response was captured by the service. |
| `snow.model_serving.response.code` | Response code from the inference service (for example, 200, 5xx). |
| `snow.model_serving.truncation_policy` | Indicates whether data was truncated. Values are `NONE` or `TRUNCATED_DEFAULT`. |

---
title: Batch inference jobs
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/batch-inference-jobs.md
section: Snowflake ML
---

# Batch inference jobs

> **Note:**
>
> **Preview Feature — Public**
>
> Supported in public preview since snowflake-ml-python versions 1.26.0.

Use Snowflake Batch Inference to enable efficient, large-scale model inference on static or periodically updated datasets. The Batch Inference API uses Snowpark Container Services (SPCS) to provide a distributed compute layer optimized for massive throughput and cost-efficiency.

## When to use batch inference

Use the `run_batch` method for workloads to:

* Process images, audio, or video files or using multimodal models with unstructured data
* Execute inference over millions or billions of rows.
* Run inference as a discrete, asynchronous stage in a pipeline.
* Integrate inference as a step within an Airflow DAG or Snowflake Task.

## Limitations

* For the multi-modal use cases, encryption is only supported on the server side

## Get started

### Connect to Model Registry

Connect to the Snowflake Model Registry and retrieve the model reference as:

```python
from snowflake.ml.registry import Registry

registry = Registry(session=session, database_name=DATABASE, schema_name=REGISTRY_SCHEMA)
mv = registry.get_model('my_model').version('my_version')  # returns ModelVersion
```

### Execute batch job

This API uses Snowpark Container Services (SPCS) job to launch the inference workload. After running inference, the compute automatically winds down to prevent you from incurring additional charges. On a high level, this API looks like the following:

```python
from snowflake.ml.model.batch import OutputSpec

# how to run a batch job
job = mv.run_batch(
    compute_pool = "my_compute_pool",
    X = session.table("my_table"),
    output_spec = OutputSpec(stage_location="@my_db.my_schema.my_stage/path/"),
)

job.wait() # Optional: Blocking until the job finishes
```

### Job management

You can get a list of jobs, cancel a job, get a job’s handle, or delete a job using the methods below:

```python
from snowflake.ml.jobs import list_jobs, delete_job, get_job

# view logs to troubleshoot
job.get_logs()

# cancel a job
job.cancel()

# list to see all jobs
list_jobs().show()

# get the handle of a job
job = get_job("my_db.my_schema.job_name")

# delete a job that you no longer wish to run
delete_job(job)
```

> **Note:**
>
> The `result` function in the ML Job APIs is not supported for batch inference jobs.

## Specify inference data

You can use structured data or unstructured data for batch inference. To use structured data for your workflow, you can either provide a SQL query or a dataframe to the run_batch method.

For unstructured data, you can reference your files from a Snowflake stage. To reference your files, create a dataframe with the file paths.

You provide your dataframe to the run_batch method. run_batch provides the content of the files to the model.

### Structured input

The following are examples illustrating the range of input possibilities:

```python
# providing input from a query
X = session.sql("SELECT id, feature_1, feature_2 FROM feature_table WHERE feature_1 > 100"),

# reading from parquet files
X = session.read.option("pattern",".*file.*\\.parquet")
    .parquet("@DB.SCHEMA.STAGE/some/path")
    .select(col("id1").alias("id"), col("feature_1"), col("feature_2"))).filter(col("feature_1") > 100)
```

### Unstructured input (multi-modal)

For unstructured data, the `run_batch` method can read the files from the fully qualified stage paths provided in the input dataframe. The following example shows you how to specify unstructured input data:

```python
# Process a list of files
# The file paths have to be in the form of a full stage path as below
data = [
    ["@DB.SCHEMA.STAGE/dataset/files/file1"],
    ["@DB.SCHEMA.STAGE/dataset/files/file2"],
    ["@DB.SCHEMA.STAGE/dataset/files/file3"],
]
column_names = ["image"]
X = session.create_dataframe(data, schema=column_names)
```

To automatically list all files in a stage as dataframe, use code like the following:

```python
from snowflake.ml.utils.stage_file import list_stage_files

# get all files under a path
X = list_stage_files(session, "@db.schema.my_stage/path")

# get all files under a path ending with ".jpg"
X = list_stage_files(session, "@db.schema.my_stage/path", pattern=".*\\.jpg")

# get all files under a path ending with ".jpg" and return the datafram with a column_name "IMAGES"
X = list_stage_files(session, "@db.schema.my_stage/path", pattern=".*\\.jpg", column_name="IMAGES")
```

### Expressing type of data

Run_batch automatically converts your files to the model compatible formats.

Your model can accept data in one of the following formats:

* RAW_BYTES
* BASE64

For example, if you have images stored in PNG format in your stage and your model accepts RAW_BYTES, you can use the `input_spec` argument to specify how Snowflake converts your data.

The following example code converts files in your stage to RAW_BYTES:

```python
mv.run_batch(
    X,
    input_spec=InputSpec(
 # we need to provide column_handling in the InputSpec to perform the necessary conversion
 # FULL_STAGE_PATH: fully qualified path (@db.schema.stage/path) to a file
 # RAW_BYTES: download and convert the file from the stage path to bytes
        column_handling={
            "path": {"input_format": InputFormat.FULL_STAGE_PATH, "convert_to": FileEncoding.RAW_BYTES}
        }
    ),
    ...
)
```

The `column_handling` argument tells the framework that the path column of X contains a full stage path, and calls the model with raw bytes from that file.

### Output (`output_spec`)

Specify a stage directory to store the file output, as shown here:

```python
mv.run_batch(
    ...
    output_spec = OutputSpec(stage_location="@db.schema.stage/path/"),
)
```

Snowflake currently supports models that output text and stores them as parquet files. You can convert the parquet files to a Snowpark data frame as follows:

```python
session.read.option("pattern", ".*\\.parquet").parquet("@db.schema.stage/output_path/")
```

### Passing parameters

If the model’s signature includes parameters defined with
[ParamSpec](../model-registry/model-signature.md), you can pass parameter values at
inference time using the `params` argument in `InputSpec`. Any parameter not included in the dictionary uses its
default value from the signature.

```python
from snowflake.ml.model.batch import InputSpec, OutputSpec

mv.run_batch(
    X=input_df,
    compute_pool="my_compute_pool",
    input_spec=InputSpec(
        params={"temperature": 0.9, "max_tokens": 512}
    ),
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/path/"),
)
```

### Partitioned models

> **Note:**
>
> This feature requires `snowflake-ml-python` version 1.33.0 or later.

You can run batch inference jobs with partitioned models by passing the `partition_column`
argument in `InputSpec`. Each partition is processed independently, which is useful for
models that train or predict per group.

```python
from snowflake.ml.model.batch import InputSpec, OutputSpec

job = model_version.run_batch(
    input_df,
    compute_pool="my_compute_pool",
    input_spec=InputSpec(partition_column="STORE_NUMBER"),
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/results/"),
)
```

For more information about partitioned models, see
[Using partitioned models](../model-registry/partitioned-models.md).

## Job specification

To configure job-level settings for your batch inference workload (such as the number of workers, resource allocation, and execution parameters,
pass a `JobSpec` instance as the `job_spec` arument of the `run_batch` method. An example is shown below:

```python
from snowflake.ml.model.batch import JobSpec, OutputSpec

job_spec = JobSpec(
    job_name="my_inference_job",
    cpu_requests="2",
    memory_requests="8GiB",
    max_batch_rows=2048,
    replicas=2,
)

job = mv.run_batch(
    X=input_df,
    compute_pool="my_compute_pool",
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/path/"),
    job_spec=job_spec,
)
```

## Best practices

### Using a sentinel file

A job can fail midway for various reasons. The output directory can therefore end up having partial data. To mark completion of the job, run_batch writes a completion file _SUCCESS in the output directory.

To avoid having partial or incorrect output:

* Read output data only after the sentinel file is found.
* Provide an empty directory to begin with.
* Run run_batch with mode = SaveMode.ERROR.

## Examples

### Using a custom model

```python
from transformers import pipeline
from snowflake.ml.model import custom_model
from snowflake.ml.model import target_platform
from snowflake.ml.model.batch import InputSpec, OutputSpec, FileEncoding, InputFormat
from snowflake.ml.model.model_signature import core

# first we must define the schema, we'll expect audio file input as base64 string
signature = core.ModelSignature(
    inputs=[
        core.FeatureSpec(name="audio", dtype=core.DataType.STRING),
    ],
    outputs=[
        core.FeatureGroupSpec(
            name="outputs",
            specs=[
                core.FeatureSpec(name="text", dtype=core.DataType.STRING),
                core.FeatureGroupSpec(
                    name="chunks",
                    specs=[
                        core.FeatureSpec(
                            name="timestamp", dtype=core.DataType.DOUBLE, shape=(2,)
                        ),
                        core.FeatureSpec(name="text", dtype=core.DataType.STRING),
                    ],
                    shape=(-1,),
                ),
            ],
        ),
    ],
)

# defining the custom model, we decode the input from base64 to bytes and
# use whisper to perform the transcription
class CustomTranscriber(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)
        self.model = self.context.model_ref("my_model")

    @custom_model.inference_api
    def predict(self, df: pd.DataFrame) -> pd.DataFrame:
        import base64
        audio_b64_list = df["audio"].tolist()
        audio_bytes_list = [base64.b64decode(audio_b64) for audio_b64 in audio_b64_list]
        temp_res = [self.model(audio_bytes) for audio_bytes in audio_bytes_list]
        return pd.DataFrame({"outputs": temp_res})

# creating an instance of our transcriber for logging
transcriber = CustomTranscriber(
    custom_model.ModelContext(
        models={
            "my_model": pipeline(
                task="automatic-speech-recognition", model="openai/whisper-small"
            )
        }
    )
)

# log the model
mv = reg.log_model(
    transcriber,
    model_name="custom_transcriber",
    version_name="v1",
    signatures={"predict": signature},
)

# input dataframe
data = [
    ["@DB.SCHEMA.STAGE/dataset/audio/audio1.mp3"],
    ["@DB.SCHEMA.STAGE/dataset/audio/audio2.mp3"],
    ["@DB.SCHEMA.STAGE/dataset/audio/audio3.mp3"],
]
column_names = ["audio"] # This column was defined in the signature above
input_df = session.create_dataframe(data, schema=column_names)

job = mv.run_batch(
    X=input_df,
    compute_pool="my_compute_pool",
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/path/"),
    input_spec=InputSpec(
# we need to provide column_handling in the InputSpec to perform the necessary conversion
# FULL_STAGE_PATH: fully qualified path (db.schema.stage/path) to a file
# BASE_64: download and convert the file from the stage path to base64 string
        column_handling={
            "audio": {"input_format": InputFormat.FULL_STAGE_PATH, "convert_to": FileEncoding.BASE64}
        }
    )
)
```

### Using Hugging Face Model

```python
from transformers import pipeline
from snowflake.ml.model import target_platform
from snowflake.ml.model.batch import InputSpec, OutputSpec, FileEncoding, InputFormat

# supported Hugging Face tasks will have their signatures auto-inferred
classifier = pipeline(task="image-classification", model="google/vit-base-patch16-224")

# log the model
mv = reg.log_model(
    classifier,
    model_name="image_classifier",
    version_name="v1",
    target_platforms=target_platform.SNOWPARK_CONTAINER_SERVICES_ONLY,
    pip_requirements=[
        "pillow" # dependency for image classification
    ],
)

# input dataframe
data = [
    ["@DB.SCHEMA.STAGE/dataset/image/image1.mp3"],
    ["@DB.SCHEMA.STAGE/dataset/image/image2.mp3"],
    ["@DB.SCHEMA.STAGE/dataset/image/image3.mp3"],
]
# this column was defined in the auto-inferred signature
# you can view the signature by calling 'mv.show_functions()'
column_names = ["images"]
input_df = session.create_dataframe(data, schema=column_names)

mv.run_batch(
    X=input_df,
    compute_pool="my_compute_pool",
    output_spec=OutputSpec(stage_location=f"@my_db.my_schema.my_stage/path/"),
    input_spec=InputSpec(
# we need to provide column_handling in the InputSpec to perform the necessary conversion
# FULL_STAGE_PATH: fully qualified path (db.schema.stage/path) to a file
# RAW_BYTES: download and convert the file to bytes (matching the predefined signature)
        column_handling={
            "IMAGES": {"input_format": InputFormat.FULL_STAGE_PATH, "convert_to": FileEncoding.RAW_BYTES}
        }
    )
)
```

### Using Hugging Face Model with vLLM

#### Task: text generation

```python
import json

from snowflake.ml.model import target_platform
from snowflake.ml.model.batch import InputSpec, OutputSpec, FileEncoding, InputFormat

# it's a large model so we remotely log it
model = huggingface.TransformersPipeline(model="Qwen/Qwen2.5-0.5B-Instruct", task="text-generation")

mv = reg.log_model(
    model,
    model_name="qwenw_5",
    version_name="v1",
    options={"cuda_version": "12.4"},
    target_platforms=target_platform.SNOWPARK_CONTAINER_SERVICES_ONLY,
)

# constructing OpenAi chat/completions API compatible messages
messages = [[
    {"role": "system", "content": [{"type": "text", "text": "You are an expert on cats and kitchens."}]},
    {
        "role": "user",
        "content": [
            {"type": "text", "text": "How many breeds of cats are there?"},
        ]
    }
]]
schema = ["messages"]
data = [(json.dumps(m)) for m in messages]
input_df = session.create_dataframe(data, schema=schema)

mv.run_batch(
    X=input_df,
    compute_pool="my_compute_pool",
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/path/"),
    inference_engine_options={
 # set vLLM as the inference backend
        "engine": InferenceEngine.VLLM,
    },
)
```

#### Task: image text to text

```python
import json

from snowflake.ml.model import target_platform
from snowflake.ml.model.batch import InputSpec, OutputSpec

# it's a large model so we remotely log it
model = huggingface.TransformersPipeline(model="Qwen/Qwen2-VL-2B-Instruct", task="image-text-to-text")

mv = reg.log_model(
    model,
    model_name="qwen2_vl_2b",
    version_name="v1",
    options={"cuda_version": "12.4"},
    targets=target_platform.SNOWPARK_CONTAINER_SERVICES_ONLY,
)

# constructing OpenAi chat/completions API compatible messages
messages = [[
    {"role": "system", "content": [{"type": "text", "text": "You are an expert on cats and kitchens."}]},
    {
        "role": "user",
        "content": [
            {"type": "text", "text": "What breed of cat is this?"},
            {
                "type": "image_url",
                "image_url": {
                    # run_batch will downlaod and convert the file to the format that vLLM can handle
                    "url": f"@db.schema.stage/path/cat.jpeg",
                }
            }
     # you can also pass video and audio like below
            # {
            #     "type": "video_url",
            #     "video_url": {
            #         "url": "@db.schema.stage/path/video.avi",
            #     }
            # }
            # {
            #     "type": "input_audio",
            #     "input_audio": {
            #         "data": "@db.schema.stage/path/audio.mp3",
            #         "format": "mp3",
            #     }
            # }
        ]
    }
]]

schema = ["messages"]
data = [(json.dumps(m)) for m in messages]
input_df = session.create_dataframe(data, schema=schema)

mv.run_batch(
    X=input_df,
    compute_pool="my_compute_pool",
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/path/"),
    inference_engine_options={
 # set vLLM as the inference backend
        "engine": InferenceEngine.VLLM,
    },
)
```

### Sample notebooks

For end-to-end runnable examples, see the
[batch inference sample notebooks](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/ml/model_serving/batch_inference)
on GitHub.

---
title: Bring your own model types via serialized files
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/bring-your-own-model-types.md
section: Snowflake ML
---

# Bring your own model types via serialized files

The model registry supports logging [built-in model types](built-in-models/overview.md) directly in the registry.
We also provide a method of logging other model types with `snowflake.ml.model.custom_model.CustomModel`. Serializable models trained using external tools or obtained from open source repositories can be used with `CustomModel`.

This guide explains how to:

* Create a custom model.
* Create model context with files and model objects.
* Include additional code with your model using `code_paths`.
* Log the custom model to the Snowflake Model Registry.
* Deploy the model for inference.

> **Note:**
>
> [This quickstart](https://quickstarts.snowflake.com/guide/deploying_custom_models_to_snowflake_model_registry/) provides an example of logging a custom PyCaret model.

## Defining model context by keyword arguments

The `snowflake.ml.model.custom_model.ModelContext` can be instantiated with user-defined keyword arguments. The values can either be string file paths or instances of [supported model types](built-in-models/overview.md). The files and serialized models will be packaged with the model for use in the model inference logic.

### Using in-memory model objects

When working with [built-in model types](built-in-models/overview.md), the recommended approach is to pass in-memory model objects directly to the `ModelContext`. This allows Snowflake ML to handle serialization automatically.

```python
import pandas as pd
from snowflake.ml.model import custom_model

# Initialize ModelContext with an in-memory model object
# my_model can be any supported model type (e.g., sklearn, xgboost, lightgbm, and others)
model_context = custom_model.ModelContext(
    my_model=my_model,
)

# Define a custom model class that utilizes the context
class ExampleBringYourOwnModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        # Use the model with key 'my_model' from the context to make predictions
        model_output = self.context['my_model'].predict(input)
        return pd.DataFrame({'output': model_output})

# Instantiate the custom model with the model context. This instance can be logged in the model registry.
my_model = ExampleBringYourOwnModel(model_context)
```

> **Note:**
>
> In your custom model class, always access model objects through the model context. For example, use `self.model = self.context['my_model']`
> instead of directly assigning `self.model = model` (where `model` is an in-memory model object). Accessing the model
> directly captures a second copy of the model in a closure, which results in significantly larger model files during serialization.

### Using serialized files

For models or data that are stored in serialized files like Python pickles or JSON, you can provide file paths to your `ModelContext`. Files can be serialized models, configuration files, or files containing parameters. This is useful when working with pre-trained models saved to disk or configuration data.

```python
import pickle
import pandas as pd
from snowflake.ml.model import custom_model

# Initialize ModelContext with a file path
# my_file_path is a local pickle file path
model_context = custom_model.ModelContext(
    my_file_path='/path/to/file.pkl',
)

# Define a custom model class that loads the pickled object
class ExampleBringYourOwnModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

        # Use 'my_file_path' key from the context to load the pickled object
        with open(self.context['my_file_path'], 'rb') as f:
            self.obj = pickle.load(f)

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        # Use the loaded object to make predictions
        model_output = self.obj.predict(input)
        return pd.DataFrame({'output': model_output})

# Instantiate the custom model with the model context. This instance can be logged in the model registry.
my_model = ExampleBringYourOwnModel(model_context)
```

> **Important:**
>
> When you combine a supported model type (such as XGBoost) with unsupported models or data, you don’t need to
> serialize the supported model yourself. Set the supported model object directly in the context (e.g., `base_model =
> my_xgb_model`) and it is serialized automatically.

> **Important:**
>
> Methods decorated with `@custom_model.inference_api` should always be written to work on multi-row dataframe.
> Don’t assume that the input DataFrame will always contain a single row. Because of server-side batching,
> specifically in real-time inference, even single-record requests from multiple sources can be batched together into one DataFrame.

## Defining inference parameters

Custom model inference methods can accept optional parameters that control inference behavior, such as a temperature
setting or maximum number of tokens. Define parameters as keyword-only arguments (after `*`) on the
`@inference_api` method, with type annotations and default values.

```python
import pandas as pd
from snowflake.ml.model import custom_model

class TextGenerationModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

    @custom_model.inference_api
    def predict(
        self,
        input: pd.DataFrame,
        *,
        temperature: float = 0.7,
        max_tokens: int = 256,
    ) -> pd.DataFrame:
        # Use temperature and max_tokens to control generation behavior
        output = self.context['my_model'].generate(
            input["input_text"],
            temperature=temperature,
            max_tokens=max_tokens,
        )
        return pd.DataFrame({"output_text": output})
```

When this model is logged, the parameters are automatically included in the model signature. Callers can override
them at inference time, or omit them to use the defaults. For more information, see
[Specifying model signatures](model-signature.md).

The following requirements apply to inference parameters:

* They must be keyword-only (defined after `*` in the method signature).
* They must have a type annotation. Supported types are `int`, `float`, `str`, `bool`, `bytes`,
  `datetime.datetime`, and `list` with a supported element type (for example, `list[str]`,
  `list[list[int]]`).
* They must have a default value.

## Testing and logging a custom model

You can test a custom model by running it locally.

```python
my_model = ExampleBringYourOwnModel(model_context)
output_df = my_model.predict(input_df)
```

When the model works as intended, log it to the Snowflake Model Registry. As shown in the next code
example, provide `conda_dependencies` (or `pip_requirements`) to specify the libraries that the model class needs.
Provide `sample_input_data` (a pandas or Snowpark DataFrame) to infer the input signature for the model. Alternatively,
provide a [model signature](model-signature.md).

```python
reg = Registry(session=sp_session, database_name="ML", schema_name="REGISTRY")
mv = reg.log_model(my_model,
            model_name="my_custom_model",
            version_name="v1",
            conda_dependencies=["scikit-learn"],
            comment="My Custom ML Model",
            sample_input_data=train_features)
output_df = mv.run(input_df)
```

## Including additional code with code_paths

Use the `code_paths` parameter in [Registry.log_model](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/api/registry/snowflake.ml.registry.Registry#snowflake.ml.registry.Registry.log_model) to
package Python code, such as helper modules, utilities, and configuration files with your model. You can import this code just as you would locally.

You can either provide string paths to copy files or directories, or `CodePath` objects. The objects provide more control over which subdirectories or files are included, and the import paths that will be used by the model.

### Using string paths

Pass a list of string paths to include files or directories. The last component of each path becomes the
importable module name.

```python
mv = reg.log_model(
    my_model,
    model_name="my_model",
    version_name="v1",
    code_paths=["src/mymodule"],  # import with: import mymodule
)
```

### Using CodePath with filter

Use the `CodePath` class when you want to package only part of a directory tree
or control the import paths used by your model.

```python
from snowflake.ml.model import CodePath
```

A `CodePath` has two parameters:

* `root`: A directory or file path.
* `filter` (optional): A relative path under `root` that selects a subdirectory or file.

When `filter` is provided, the source is `root/filter`, and the `filter` value determines the import path.
For example, `filter="utils"` allows you to `import utils`, and `filter="pkg/subpkg"` allows you to
`import pkg.subpkg`.

**Example:** Given this project structure:

```text
my_project/src/
├── utils/
│   └── preprocessing.py
├── models/
│   └── classifier.py
└── tests/          # Not needed for inference
```

To package only `utils/` and `models/`, excluding `tests/`:

```python
mv = reg.log_model(
    my_model,
    model_name="my_model",
    version_name="v1",
    code_paths=[
        CodePath("my_project/src/", filter="utils/"),
        CodePath("my_project/src/", filter="models/"),
    ],
)
```

You can also filter a single file:

```python
code_paths=[
    CodePath("my_project/src/", filter="utils/preprocessing.py"),
]
# Import with: import utils.preprocessing
```

## Example: Logging a PyCaret model

The following example uses PyCaret to log a custom model type. PyCaret is a low-code, high-efficiency third-party package that Snowflake doesn’t support natively.
You can bring your own model types using similar methods.

### Step 1: Define the model context

Before you log your model, define the model context. The model context refers to your own custom model type.
The following example specifies the path to the serialized (pickled) model using the context’s `model_file` attribute. You can choose any
name for the attribute as long as the name is not used for anything else.

```python
pycaret_model_context = custom_model.ModelContext(
  model_file = 'pycaret_best_model.pkl',
)
```

### Step 2: Create a custom model class

Define a custom model class to log a model type without native support. In this example, a `PyCaretModel` class,
derived from `CustomModel`, is defined so the model can be logged in the registry.

```python
from pycaret.classification import load_model, predict_model

class PyCaretModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)
        model_dir = self.context["model_file"][:-4]  # Remove '.pkl' suffix
        self.model = load_model(model_dir, verbose=False)
        self.model.memory = '/tmp/'  # Update memory directory

    @custom_model.inference_api
    def predict(self, X: pd.DataFrame) -> pd.DataFrame:
        model_output = predict_model(self.model, data=X)
        return pd.DataFrame({
            "prediction_label": model_output['prediction_label'],
            "prediction_score": model_output['prediction_score']
        })
```

> **Note:**
>
> As shown, set the model’s memory directory to `/tmp/`. Snowflake’s warehouse nodes have restricted directory
> access. `/tmp` is always writeable and is a safe choice when the model needs a place to write files. This might
> not be necessary for other types of models.

### Step 3: Test the custom model

Test the PyCaret model locally using code like the following.

```python
test_data = [
    [1, 237, 1, 1.75, 1.99, 0.00, 0.00, 0, 0, 0.5, 1.99, 1.75, 0.24, 'No', 0.0, 0.0, 0.24, 1],
    # Additional test rows...
]
col_names = ['Id', 'WeekofPurchase', 'StoreID', 'PriceCH', 'PriceMM', 'DiscCH', 'DiscMM',
            'SpecialCH', 'SpecialMM', 'LoyalCH', 'SalePriceMM', 'SalePriceCH',
            'PriceDiff', 'Store7', 'PctDiscMM', 'PctDiscCH', 'ListPriceDiff', 'STORE']

test_df = pd.DataFrame(test_data, columns=col_names)

my_pycaret_model = PyCaretModel(pycaret_model_context)
output_df = my_pycaret_model.predict(test_df)
```

### Step 4: Define a model signature

In this example, use the sample data to infer a [model signature](model-signature.md) for input validation:

```python
predict_signature = model_signature.infer_signature(input_data=test_df, output_data=output_df)
```

### Step 5: Log the model

The following code logs (registers) the model in the Snowflake Model Registry.

```python
snowml_registry = Registry(session)

custom_mv = snowml_registry.log_model(
    my_pycaret_model,
    model_name="my_pycaret_best_model",
    version_name="version_1",
    conda_dependencies=["pycaret==3.0.2", "scipy==1.11.4", "joblib==1.2.0"],
    options={"relax_version": False},
    signatures={"predict": predict_signature},
    comment = 'My PyCaret classification experiment using the CustomModel API'
)
```

### Step 6: Verify the model in the registry

To verify that the model is available in the Model Registry, use `show_models` function.

```python
snowml_registry.show_models()
```

### Step 7: Make predictions with the registered model

Use the `run` function to call the model for prediction.

```python
snowpark_df = session.create_dataframe(test_data, schema=col_nms)

custom_mv.run(snowpark_df).show()
```

## Next Steps

After deploying a PyCaret model by way of the Snowflake Model Registry, you can view the model in Snowsight.
In the navigation menu, select AI & ML » Models. If you do not see it there, make sure you are using the ACCOUNTADMIN role or the
role you used to log the model.

To use the model from SQL, use SQL like the following:

```sqlexample
SELECT
    my_pycaret_model!predict(*) AS predict_dict,
    predict_dict['prediction_label']::text AS prediction_label,
    predict_dict['prediction_score']::double AS prediction_score
from pycaret_input_data;
```

---
title: CatBoost
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/catboost.md
section: Snowflake ML
---

# CatBoost

The Snowflake ML Model Registry supports models created using CatBoost (models derived from `catboost.CatBoost`, such as
`catboost.CatBoostClassifier`, `catboost.CatBoostRegressor`, and `catboost.CatBoostRanker`).

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. CatBoost models have the following target methods by default, assuming the method exists: `predict`, `predict_proba`. |
| `enable_explainability` | Whether to enable explainability for the model using SHAP. Defaults to `True`. When enabled, an `explain` method will be available on the logged model. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a CatBoost model so
that the registry knows the signatures of the target methods.

## Examples

These examples assume `reg` is an instance of `snowflake.ml.registry.Registry`.

### CatBoostClassifier

The following example demonstrates the key steps to train a CatBoost classifier, log it to the Snowflake ML Model Registry, and use the registered model for inference and explainability. The workflow includes:

* Trains a CatBoost classifier on a sample dataset.
* Logs the model to the Snowflake ML Model Registry.
* Makes predictions and retrieves prediction probabilities.
* Gets SHAP values for the model’s predictions.

```python
import catboost
from sklearn import datasets, model_selection

# Load dataset
cal_data = datasets.load_breast_cancer(as_frame=True)
cal_X = cal_data.data
cal_y = cal_data.target

# Normalize column names (replace spaces with underscores)
cal_X.columns = [col.replace(' ', '_') for col in cal_X.columns]

cal_X_train, cal_X_test, cal_y_train, cal_y_test = model_selection.train_test_split(
    cal_X, cal_y, test_size=0.2
)

# Train CatBoost Classifier
classifier = catboost.CatBoostClassifier(
    iterations=100,
    learning_rate=0.1,
    depth=6,
    verbose=False
)
classifier.fit(cal_X_train, cal_y_train)

# Log the model
model_ref = reg.log_model(
    model=classifier,
    model_name="my_catboost_classifier",
    version_name="v1",
    sample_input_data=cal_X_test,
)

# Make predictions
result_df = model_ref.run(cal_X_test[-10:], function_name="predict")

# Get prediction probabilities
proba_df = model_ref.run(cal_X_test[-10:], function_name="predict_proba")

# Get explanations (SHAP values)
explanations_df = model_ref.run(cal_X_test[-10:], function_name="explain")
```

### CatBoostRegressor

The following example demonstrates the key steps to train a CatBoost regressor, log it to the Snowflake ML Model Registry, and use the registered model for inference. The workflow includes:

* Trains a CatBoost regressor on a sample dataset.
* Logs the model to the Snowflake ML Model Registry.
* Makes predictions.

```python
import catboost
from sklearn import datasets, model_selection

# Load dataset
cal_data = datasets.load_diabetes(as_frame=True)
cal_X = cal_data.data
cal_y = cal_data.target

cal_X_train, cal_X_test, cal_y_train, cal_y_test = model_selection.train_test_split(
    cal_X, cal_y, test_size=0.2
)

# Train CatBoost Regressor
regressor = catboost.CatBoostRegressor(
    iterations=100,
    learning_rate=0.1,
    depth=6,
    verbose=False
)
regressor.fit(cal_X_train, cal_y_train)

# Log the model
model_ref = reg.log_model(
    model=regressor,
    model_name="my_catboost_regressor",
    version_name="v1",
    sample_input_data=cal_X_test,
)

# Make predictions
result_df = model_ref.run(cal_X_test[-10:], function_name="predict")
```

### Disabling Explainability

If you do not need explainability features, you can disable them during logging to reduce model size and dependencies:

```python
model_ref = reg.log_model(
    model=classifier,
    model_name="my_catboost_classifier_no_explain",
    version_name="v1",
    sample_input_data=cal_X_test,
    options={"enable_explainability": False},
)
```

---
title: Common feature and query patterns
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/examples.md
section: Snowflake ML
---

# Common feature and query patterns

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

The `FeatureView` class accepts a Snowpark DataFrame object containing the feature transformation logic. You can
therefore describe your features in any way supported by the Snowpark DataFrame API or by Snowflake SQL. You can pass
the DataFrame to the `FeatureView` constructor directly.

The Snowpark Python API provides
[analytics functions](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameAnalyticsFunctions)
for easily defining many common feature types, such as windowed aggregations. This topic contains some examples of these.

The open source [snowflake-ml-python](https://github.com/snowflakedb/snowflake-ml-python/tree/main/snowflake/ml/feature_store/examples)
on Github also contains some sample feature view and entity definitions using public datasets.

## Per-row features

In per-row features, functions are applied to each row of tabular data. For example, the following code fills null in
`foo` with zero, then computes a ZIP code from `lat` and `long`. There is one output row per input
row.

Python:

```python
def get_zipcode(df: snowpark.DataFrame) -> snowpark.DataFrame:
    df = df.fillna({"foo": 0})
    df = df.with_column(
        "zipcode",
        F.compute_zipcode(df["lat"], df["long"])
    )
    return df
```

Snowflake SQL:

```sqlexample
SELECT
    COALESCE(foo, 0) AS foo,
    compute_zipcode(lat, long) AS zipcode
FROM <source_table_name>;
```

## Per-group features

Per-group features aggregate values in a column within a group. For example, the sum of daily rainfall might be grouped
by city for weather forecasting. The output DataFrame has one row per group.

Python:

```python
def sum_rainfall(df: snowpark.DataFrame) -> snowpark.DataFrame:
    df = df.group_by(
        ["location", to_date(timestamp)]
    ).agg(
        sum("rain").alias("sum_rain"),
        avg("humidity").alias("avg_humidity")
    )
    return df
```

Snowflake SQL:

```sqlexample
SELECT
    location,
    TO_DATE(timestamp) AS date,
    SUM(rain) AS sum_rain,
    AVG(humidity) AS avg_humidity
FROM <source_table_name>
GROUP BY location, date;
```

## Row-based window features

Row-based window features aggregate values over a fixed window of rows; for example, summing the last three
transaction amounts. The output DataFrame has one row per window frame.

Python:

```python
def sum_past_3_transactions(df: snowpark.DataFrame) -> snowpark.DataFrame:
    window = Window.partition_by("id").order_by("ts").rows_between(2, Window.CURRENT_ROW)

    return df.select(
        sum("amount").over(window).alias("sum_past_3_transactions")
    )
```

Snowflake SQL:

```sqlexample
SELECT
    id,
    SUM(amount) OVER (PARTITION BY id ORDER BY ts ROWS BETWEEN 2 PRECEDING and 0 FOLLOWING)
        AS sum_past_3_transactions
FROM <source_table_name>;
```

## Moving aggregation features

Moving aggregation features calculate moving statistics, such as sum and average, within a specified window size.
This function dynamically computes these aggregates across different subsets of the DataFrame based on the defined
window sizes, order, and groupings. The output DataFrame has one row per window frame.

```python
new_df =  df.analytics.moving_agg(
    aggs={"SALESAMOUNT": ["SUM", "AVG"]},
    window_sizes=[2, 3],
    order_by=["ORDERDATE"],
    group_by=["PRODUCTKEY"]
)
```

## Cumulative aggregation features

Cumulative aggregation computes ongoing totals, minimums, maximums, and other cumulative statistics across a data
partition, which is sorted and grouped as specified. Unlike moving aggregates, these totals extend from the start of the
partition or to the end, depending on the direction specified, providing running totals that do not reset. The output
DataFrame has one row per input row.

```python
 new_df = df.analytics.cumulative_agg(
    aggs={"SALESAMOUNT": ["SUM", "MIN", "MAX"]},
    order_by=["ORDERDATE"],
    group_by=["PRODUCTKEY"],
    is_forward=True
)
```

## Lag features

Lag features introduce new columns containing values from prior rows within each partition, offset by a specified number
of rows. This function is critical for comparing current values against previous values in a dataset, thus assisting in
detecting trends or changes over time. The output DataFrame has one row per input row.

```python
new_df = df.analytics.compute_lag(
    cols=["SALESAMOUNT"],
    lags=[1, 2],
    order_by=["ORDERDATE"],
    group_by=["PRODUCTKEY"]
)
```

## Lead features

The inverse of lag features, lead features create new columns containing values from subsequent rows, shifting data
upward. This feature is essential for making predictions or assumptions based on future data points already present in a
dataset. The output DataFrame has one row per input row.

```python
new_df = df.analytics.compute_lead(
    cols=["SALESAMOUNT"],
    leads=[1, 2],
    order_by=["ORDERDATE"],
    group_by=["PRODUCTKEY"]
)
```

## Time-series features

Time-series features compute feature values based on a time window and a fixed position along the time axis. Examples
include the count of trips over the past week for rideshares or the sum of sales over the past three days. The output
DataFrame has one row per time window.

Recent versions of the Snowflake Feature Store include an experimental time series aggregation API. Using this API,
a time series feature can be created using code like the following:

Python:

```python
def custom_column_naming(input_col, agg, window):
    return f"{agg}_{input_col}_{window.replace('-', 'past_')}"

result_df = weather_df.analytics.time_series_agg(
    aggs={"rain": ["SUM"]},
    windows=["-3D", "-5D"],
    sliding_interval="1D",
    group_by=["location"],
    time_col="ts",
    col_formatter=custom_column_naming
)
```

You can also construct time-series features with RANGE BETWEEN syntax in SQL. for more details, see
[Snowflake Window functions](../../../sql-reference/functions-window.md).

Snowflake SQL:

```sqlexample
select
    TS,
    LOCATION,
    sum(RAIN) over (
        partition by LOCATION
        order by TS
        range between interval '3 days' preceding and current row
    ) SUM_RAIN_3D,
    sum(RAIN) over (
        partition by LOCATION
        order by TS
        range between interval '5 days' preceding and current row
    ) SUM_RAIN_5D
from <source_table_name>
```

## Using user-defined functions in feature pipelines

The Snowflake Feature Store supports user defined functions (UDFs) in feature pipeline definitions. However, only
deterministic functions (functions that always return the same result for the same input) can be incrementally
maintained. To enable incremental maintenance, mark your UDF as immutable when registering it.

```python
# In Python
@F.udf(
    name="MY_UDF",
    immutable=True,
    # ...
)
def my_udf(...):
    # ...
```

If your function is written in SQL, specify the IMMUTABLE keyword. See [this guide](../../../sql-reference/sql/create-function.md).

---
title: Container Runtime on multi-node clusters
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime-multi-node.md
section: Snowflake ML
---

# Container Runtime on multi-node clusters

In this preview, [Container Runtime](container-runtime-ml.md) allows you to run
ML workloads on multi-node clusters in Snowflake Notebooks. The `snowflake-ml-python` library includes APIs to set the
number of nodes in the compute pool available for ML workloads, allowing the resources available to a workload to be
scaled without resizing the compute pool. Another API retrieves a list of active nodes.

A multi-node cluster assigns one node to be the *head* node. Additional nodes are called *worker* nodes. The head node
orchestrates parallel operations in the cluster and also contributes its computing resources to running the workload. A
multi-node cluster with one active node has only a head node. A multi-node cluster with three active nodes has one head
node and two worker nodes, and all three nodes participate in running your workload.

## Prerequisites

To use multi-node clusters to run your ML workloads, you need:

* An active Snowflake account with access to notebooks. See [Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md).
* Privileges to create and manage notebooks that use the container runtime.
  See [Notebooks on Container Runtime](notebooks-on-spcs.md).

### Configure a compute pool

To use a multi-node setup, you need a compute pool with at least two nodes. You can either [create a new compute pool](../../sql-reference/sql/create-compute-pool.md)
or [alter an existing one](../../sql-reference/sql/alter-compute-pool.md). In either command, pass a MAX_NODES argument to set the pool’s maximum capacity.
It’s good practice to provision one or more extra nodes so you can easily scale up or down for larger or smaller workloads.

To see a compute pool’s capacity, use the [DESCRIBE COMPUTE POOL](../../sql-reference/sql/desc-compute-pool.md) command.
The capacity is in the MAX_NODES column of the returned table.

```sqlexample
DESCRIBE COMPUTE POOL my_pool;
```

To set a compute pool’s capacity, use the [ALTER COMPUTE POOL](../../sql-reference/sql/alter-compute-pool.md) command.

```sqlexample
ALTER COMPUTE POOL <compute_pool_name>
    SET MAX_NODES = <total_capacity>;
```

## Running a workload on a multi-node cluster

Choosing a multi-node compute pool for your notebook is the only action required to use multiple nodes in the compute
pool to run an ML workload.

In the notebook, set the number of active nodes using the `snowflake.ml.runtime_cluster.scale_cluster` Python API.
The number of active nodes in a compute pool is the number of nodes available to run a workload, up to the pool’s
MAX_NODES. The method takes the total number of active nodes required, including the head node and all worker nodes, as its primary parameter.

> **Note:**
>
> This function is blocking by default (that is, it waits until the scaling operation finishes) and has a 12-minute timeout.
> If the operation times out, it will automatically roll back to its previous state.

Scaling operations don’t persist across sessions. That is, if a notebook ends with a non-zero number of worker
nodes, it will not automatically scale up the next time the notebook is started. You must call the scaling API again to
set the number of worker nodes.

### Syntax

```python
snowflake.ml.runtime_cluster.scale_cluster(
    expected_cluster_size: int,
    *,
    notebook_name: Optional[str] = None,
    is_async: bool = False,
    options: Optional[Dict[str, Any]] = None
) -> bool
```

#### Arguments

* `expected_cluster_size` (int): The number of active nodes in the compute pool, up to the pool’s MAX_NODES.
  This includes the head node and all worker nodes.
* `notebook_name` (Optional[str]): The name of the notebook where the workload is run. The compute pool to be scaled is the
  pool that the specified notebook is running on. If not provided, it will be automatically determined from the current context.
  An exception is raised if the wrong notebook name is used.
* `is_async` (bool): Controls whether the function blocks waiting for scaling:

  + If False (default): The function blocks until the cluster is fully ready or the operation times out.
  + If True: The function returns immediately after confirming the scaling request has been accepted.
* `options` (Optional[Dict[str, Any]]): Advanced configuration options:

  + `rollback_after_seconds` (int): Maximum time before automatic rollback if scaling is not completed. The default is 720 seconds.
  + `block_until_min_cluster_size` (int): Minimum number of nodes that must be ready before the function returns.

#### Returns

`True` if the compute pool is successfully scaled to the specified number of active nodes. Otherwise, an exception
is raised.

### Example

```python
from snowflake.ml.runtime_cluster import scale_cluster

# Example 1: Scale up the cluster
scale_cluster(3) # Scales the cluster to 3 total nodes (1 head + 2 workers)

# Example 2: Scale down the cluster
scale_cluster(1) # Scales the cluster to 1 head + 0 workers

# Example 3: Asynchronous scaling - function returns immediately after request is accepted
scale_cluster(5, is_async=True)

# Example 4: Scaling with custom options - wait for at least 2 nodes to be ready
scale_cluster(5, options={"block_until_min_cluster_size": 2})
```

## Get the available number of nodes

Use the `get_nodes` API to get information about the active nodes in the cluster. The function takes no arguments.

### Syntax

```python
get_nodes() -> list
```

#### Returns

A list containing details of the active nodes in the cluster. Each element of the list is a dictionary with the following keys:

* `name` (str): The name of the node.
* `cpus` (int): The number of CPUs on the node.
* `gpus` (int): The number of GPUs on the node.

### Example

```python
from snowflake.ml.runtime_cluster import get_nodes

# Example: Get the active nodes in the cluster
nodes = get_nodes()
print(len(nodes), nodes)
```

The output of the example code is as follows:

```output
2 [{'name': "IP1", 'cpus': 4, 'gpus': 0}, {'name': "IP2", 'cpus': 8, 'gpus': 1}]
```

## Distributed training on multi-node clusters

The Container Runtime supports distributed training of LightGBM, XGBoost, and PyTorch models.
The distributed training APIs for LightGBMEstimator, XGBEstimator, and PyTorch are documented in detail in the
[API Reference](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/distributors).

### Scaling configuration

All models provide an optional scaling configuration parameter that allows you to specify the resource for the training
job. The scaling configuration is an instance of a model-specific class: `LightGBMScalingConfig`,
`XGBScalingConfig`, or `PyTorchScalingConfig` depending on the model type.

LightGBM and XGBoost scaling configuration objects have the following attributes:

* `num_workers`: The number of worker processes to use for training. The default is -1, which sets the number
  of worker processes automatically.
* `num_cpu_per_worker`: Number of CPUs allocated per worker process. The default is -1, which sets the number of CPUs
  per worker process automatically.
* `use_gpu`: Whether to use the GPU for training. The default is None, allowing the estimator to choose based on the environment.
  When using the GPU, be sure to also configure the model parameters to use the GPU.

> **Note:**
>
> Generally, leave `num_workers` and `num_cpu_per_worker` at their default values, so Container Services
> determines the best way to distribute these resources. The runtime assigns a worker for each node in the compute pool,
> and the necessary CPUs or GPUs for each worker to complete the task.

PyTorch scaling configuration objects have the following attributes:

* `num_cpus`: The number of CPU cores to reserve for each worker.
* `num_gpus`: The number of GPUs to reserve for each worker. The default is 0, indicating no GPUs are reserved.

### Distributed training of LightGBM/XGBoost models

Memory usage
:   Typically, a node with *n* GB of RAM can train a model on *n/4* to *n/3* of data without running out of memory. The
    maximum dataset size depends on the number of worker processes and the training algorithm used.

Compute performance
:   Performance of multi-node training depends on model parameters such as tree depth, number of trees, and maximum
    number of bins. Increasing these parameter values can increase the total training time on a dataset.

#### Example

The following example shows how to train an XGBoost model on a multi-node cluster. Training of LightGBM models is similar.

```python
from snowflake.ml.modeling.distributors.xgboost import XGBEstimator, XGBScalingConfig
from snowflake.ml.data.data_connector import DataConnector
from implementations.ray_data_ingester import RayDataIngester
table_name = "MULTINODE_SAMPLE_TRAIN_DS"

# Use code like the following to generate example data
"""
# Create a table in current database/schema and store data there
def generate_dataset_sql(db, schema, table_name) -> str:
    sql_script = f"CREATE TABLE IF NOT EXISTS {db}.{schema}.{table_name} AS \n"
    sql_script += f"select \n"
    for i in range(1, 10):
        sql_script += f"uniform(0::float, 10::float, random()) AS FT_{i}, \n"
    sql_script += f"FT_1 + FT_2 AS TARGET, \n"
    sql_script += f"from TABLE(generator(rowcount=>({10000})));"
    return sql_script
session.sql(generate_dataset_sql(session.get_current_database(), session.get_current_schema(), table_name)).collect()
"""

sample_train_df = session.table(table_name)
INPUT_COLS = list(sample_train_df.columns)
LABEL_COL = "TARGET"
INPUT_COLS.remove(LABEL_COL)

params = {
    "eta": 0.1,
    "max_depth": 8,
    "min_child_weight": 100,
    "tree_method": "hist",
}

scaling_config = XGBScalingConfig(
    use_gpu=False
)

estimator = XGBEstimator(
    n_estimators=50,
    objective="reg:squarederror",
    params=params,
    scaling_config=scaling_config,
)
data_connector = DataConnector.from_dataframe(
    sample_train_df, ingestor_class=RayDataIngester
)

xgb_model = estimator.fit(
    data_connector, input_cols=INPUT_COLS, label_col=LABEL_COL
)
```

### Distributed training of PyTorch models

PyTorch models are trained using a training function (`train_func`) that is called in each worker process.

#### Using the context APIs

During the execution of the training function, you can use context APIs to access essential metadata about the
training environment and for parameter forwarding from the caller to the training functions. See
[Related classes](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/distributors#id2) for
documentation of the PyTorch context class.

The context object exposes runtime metadata that you can use to customize the behavior of the training function. You can
retrieve these using the provided methods `get_node_rank`, `get_local_rank`, `get_world_size`, and others.

Tho following code is an example of retrieving the values `test` and `train` from the context object; these are
passed in a key called `dataset_map` (which you can see in the training function example later in this topic).
These values are used to create PyTorch dataset objects that are then passed to the model.

```python
dataset_map = context.get_dataset_map()
train_dataset = DecodedDataset(dataset_map["train"].get_shard().to_torch_dataset())
test_dataset = DecodedDataset(dataset_map["test"].to_torch_dataset())

hyper_parms = context.get_hyper_params()
num_epochs = int(hyper_parms['num_epochs'])
```

#### Metrics reporting

> Use the `metrics_reporter` method of the context object to send metrics from the training function to the
> controlling code. This enables real-time monitoring and debugging of the training process, as shown in the following
> example.
>
> ```python
> context.get_metrics_reporter().log_metrics({"train_func_train_time": int(now-start_time)})
> ```

#### Example

The following example is a training function for a PyTorch model.

```python
def train_func():
    import io
    import base64
    import time
    import torch
    import torch.nn as nn
    import torch.nn.functional as F
    import torch.optim as optim
    import torch.distributed as dist
    from torch.nn.parallel import DistributedDataParallel as DDP
    from torchvision import transforms
    from torch.utils.data import IterableDataset
    from torch.optim.lr_scheduler import StepLR
    from PIL import Image
    from snowflake.ml.modeling.distributors.pytorch import get_context

    class Net(nn.Module):
        def __init__(self):
            super(Net, self).__init__()
            self.conv1 = nn.Conv2d(1, 32, 3, 1)
            self.conv2 = nn.Conv2d(32, 64, 3, 1)
            self.dropout1 = nn.Dropout(0.25)
            self.dropout2 = nn.Dropout(0.5)
            self.fc1 = nn.Linear(9216, 128)
            self.fc2 = nn.Linear(128, 10)

        def forward(self, x):
            x = self.conv1(x)
            x = F.relu(x)
            x = self.conv2(x)
            x = F.relu(x)
            x = F.max_pool2d(x, 2)
            x = self.dropout1(x)
            x = torch.flatten(x, 1)
            x = self.fc1(x)
            x = F.relu(x)
            x = self.dropout2(x)
            x = self.fc2(x)
            output = F.log_softmax(x, dim=1)
            return output

    class DecodedDataset(IterableDataset):
        def __init__(self, source_dataset):
            self.source_dataset = source_dataset
            self.transforms = transforms.ToTensor()  # Ensure we apply ToTensor transform

        def __iter__(self):
            for row in self.source_dataset:
                base64_image = row['IMAGE']
                image = Image.open(io.BytesIO(base64.b64decode(base64_image)))
                # Convert the image to a tensor
                image = self.transforms(image)  # Converts PIL image to tensor

                labels = row['LABEL']
                yield image, int(labels)

    def train(model, device, train_loader, optimizer, epoch):
        model.train()
        batch_idx = 1
        for data, target in train_loader:
            # print(f"data : {data} \n target: {target}")
            # raise RuntimeError("test")
            data, target = data.to(device), target.to(device)
            optimizer.zero_grad()
            output = model(data)
            loss = F.nll_loss(output, target)
            loss.backward()
            optimizer.step()
            if batch_idx % 100 == 0:
                print('Train Epoch: {} [Processed {} images]\tLoss: {:.6f}'.format(epoch, batch_idx * len(data), loss.item()))
            batch_idx += 1

    context = get_context()
    rank = context.get_local_rank()
    device = f"cuda:{rank}"
    is_distributed = context.get_world_size() > 1
    if is_distributed:
        dist.init_process_group(backend="nccl")
    print(f"Worker Rank : {context.get_rank()}, world_size: {context.get_world_size()}")

    dataset_map = context.get_dataset_map()
    train_dataset = DecodedDataset(dataset_map["train"].get_shard().to_torch_dataset())
    test_dataset = DecodedDataset(dataset_map["test"].to_torch_dataset())

    batch_size = 64
    train_loader = torch.utils.data.DataLoader(
        train_dataset,
        batch_size=batch_size,
        pin_memory=True,
        pin_memory_device=f"cuda:{rank}"
    )
    test_loader = torch.utils.data.DataLoader(
        test_dataset,
        batch_size=batch_size,
        pin_memory=True,
        pin_memory_device=f"cuda:{rank}"
    )

    model = Net().to(device)
    if is_distributed:
        model = DDP(model)
    optimizer = optim.Adadelta(model.parameters())
    scheduler = StepLR(optimizer, step_size=1)

    hyper_parms = context.get_hyper_params()
    num_epochs = int(hyper_parms['num_epochs'])
    start_time = time.time()
    for epoch in range(num_epochs):
        train(model, device, train_loader, optimizer, epoch+1)
        scheduler.step()
    now = time.time()
    context.get_metrics_reporter().log_metrics({"train_func_train_time": int(now-start_time)})
    test(model, device, test_loader, context)
```

The following code illustrates how to kick off distributed training given the preceding training function. The example
creates a PyTorch distributor object to run the training on multiple nodes, connects the training and test data to the
training function via a context object, and establishes the scaling configuration before running the trainer.

```python
# Set up PyTorchDistributor
from snowflake.ml.modeling.distributors.pytorch import PyTorchDistributor, PyTorchScalingConfig, WorkerResourceConfig
from snowflake.ml.data.sharded_data_connector import ShardedDataConnector
from snowflake.ml.data.data_connector import DataConnector

df = session.table("MNIST_60K")

train_df, test_df = df.random_split([0.99, 0.01], 0)

# Create data connectors for training and test data
train_data = ShardedDataConnector.from_dataframe(train_df)
test_data = DataConnector.from_dataframe(test_df)

pytorch_trainer = PyTorchDistributor(
    train_func=train_func,
    scaling_config=PyTorchScalingConfig(  # scaling configuration
        num_nodes=2,
        num_workers_per_node=1,
        resource_requirements_per_worker=WorkerResourceConfig(num_cpus=0, num_gpus=1),
    )
)

# Run the trainer.
results = pytorch_trainer.run(  # accepts context values as parameters
    dataset_map={"train": train_data, "test": test_data},
    hyper_params={"num_epochs": "1"}
)
```

## Known limitations and common issues

These limitations and issues are likely to be addressed before multi-node training on Container Runtime is generally available.

### Scaling operation times out

The scaling operation can fail because the new nodes are not ready within the 12-minute timeout. Possible causes include:

* *Insufficient pool capacity.* You have requested more nodes than the pool’s MAX_NODES. Increase the pool’s MAX_NODES.
* *Resource contention.* 12 minutes may not be enough time to warm the added nodes. Set the pool’s MIN_NODES
  to a larger number to keep some of the nodes warm, or increase the number of active nodes using more than one call to
  `scale_cluster` with a smaller increment. Another option is to use asynchronous mode to skip waitting for all the nodes to be ready:

  > + Use asynchronous mode for non-blocking operations:
  >
  > ```python
  > scale_cluster(3, is_async=True)
  > ```
  >
  > + Increase the timeout threshold:
  >
  > ```python
  > scale_cluster(3, options={"rollback_after_seconds": 1200})
  > ```

### Notebook Name Errors

If you see an error message like “Notebook <name> does not exist or not authorized”, this means the automatically
detected notebook name doesn’t match the current notebook. This can happen when:

* Your notebook name contains special characters like dots and spaces
* The automatic notebook name detection is not working correctly

Solution: Explicitly provide the notebook name parameter. Note that the notebook name needs double quotes to be treated
as an [identifier](../../sql-reference/identifiers-syntax.md):

```python
# Explicitly specifying the notebook name if naming auto detection doesn't work
try:
    scale_cluster(2)
except Exception as e:
    print(e)  # Output: "Notebook "WRONG_NOTEBOOK" does not exist or not authorized"
    scale_cluster(2, notebook_name='"MY_NOTEBOOK"')
```

### SPCS services are not cleaned up after failed scaling operation

When scaling operations fail, the system should clean up all resources created in the operation. However, if this fails,
one or more SPCS services may be left in PENDING or FAILED state. Services in the PENDING state might become ACTIVE
later, or if there is no capacity in the compute pool, stay PENDING forever.

To remove services in the PENDING or FAILED states, scale the cluster to have one node (zero worker nodes). To clean up
all launched services, end the current notebook session by clicking on “End Session” in the notebook interface.

---
title: Create and serve online features
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/create-and-serve-online-features-python.md
section: Snowflake ML
---

# Create and serve online features

Create and serve online features for latency-sensitive machine learning inference workflows. Enable online features on a feature view that you’re creating or update an existing feature view to enable online serving.

> **Important:**
>
> You must have Snowflake version 9.26 or later and `snowflake-ml-python` version 1.18.0 to use online feature serving.

Online feature serving provides the following benefits:

* Low-latency point lookups for real-time inference
* Automatic data synchronization from offline sources
* Fully managed infrastructure and maintenance
* Elastic scaling for demanding workloads

Online feature serving is backed by [online feature tables](../../../sql-reference/commands-feature-store.md).

## Data freshness

A feature view with online feature serving automatically synchronizes data from the offline store.

Use the `target_lag` parameter to configure how often data is synchronized to your online feature table. You can set this value from a minimum of 10 seconds to a maximum of 8 days.

The online feature tables are refreshed in the background using the value that you’ve specified. The online feature table is suspended if there are five consecutive refresh failures.
For information about troubleshooting the refresh failure, check your refresh history.

## Refresh modes

Snowflake uses the following refresh modes to update the data:

* Incremental Refresh: This is the preferred and most efficient mode. Snowflake tracks changes in the sources and merges only the new or updated rows into the online store. This minimizes compute and I/O costs.
* Full Refresh: This mode drops all existing data in the table and reloads everything from the source. It is more resource-intensive and is used when an incremental refresh is not possible.

You can explicitly set the refresh mode to INCREMENTAL or FULL, or set it to AUTO to let Snowflake determine the most efficient available refresh mode.

## Time series data handling

To ensure data consistency, you can specify a `timestamp_col`. When multiple rows with the same primary key are found in the source, Snowflake only ingests the version with the most recent timestamp. If you don’t specify a timestamp column, the most recently processed row takes precedence.

### Provide access to create and serve online features

Before you get started with using the online feature store, you must provide the necessary permissions to the relevant roles.

To provide permissions, use the access control script described in [Access control setup in SQL](rbac.md). After you’ve run the script, grant the following privileges:

```sqlexample
GRANT CREATE ONLINE FEATURE TABLE ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);

GRANT SELECT, MONITOR ON FUTURE ONLINE FEATURE TABLES IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);

GRANT SELECT, MONITOR ON ALL ONLINE FEATURE TABLES IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);
```

### Manage and serve online features using the Python API

The following example shows how to configure online feature serving when creating a new feature view. You can use the `OnlineConfig` object to specify the online serving settings, such as the target data freshness lag.

```python
from snowflake.ml.feature_store import FeatureView
from snowflake.ml.feature_store.feature_view import OnlineConfig

online_config = OnlineConfig(enable=True, target_lag="30 seconds")

fv = FeatureView(
    name="MY_FV",
    entities=[entity],
    feature_df=my_df, # Snowpark DataFrame containing feature transformations
    timestamp_col="ts", # optional timestamp column name in the dataframe
    refresh_freq="5 minutes",
    refresh_mode="AUTO", # refresh mode of the feature data
    desc="my feature view", # optional description
    online_config=online_config
)

fv = fs.register_feature_view(feature_view=fv, version="v1")
```

The following are the `OnlineConfig` parameters:

| Parameter | Type | Description | Default |
| --- | --- | --- | --- |
| enable | Boolean, optional | Specifies whether online feature serving should be enabled for the feature view. | Default: False |
| target_lag | Str, optional | String in a “<num> (seconds|minutes|hours|days|s|m|h|d)” format specifying the target data freshness lag. | Default: 10 seconds |

> **Note:**
>
> `refresh_freq` and `OnlineConfig.target_lag` act independently.
> In the example above, the effective target data propagation lag from the source data represented by `my_df` to the online data store will be `refresh_freq + online_config.target_lag`.

## Update a feature view to enable/disable online feature serving

For existing feature views, you can update the online feature serving configuration using the `update_feature_view` method.
You can use this method to enable online feature serving for existing feature views.

Use the following code to enable online feature serving.

```python
# Enable online feature serving

fs.update_feature_view(
    name="<name>",
    version="<version>",
    online_config=OnlineConfig(enable=True, target_lag="5 minutes")
)
```

Use the following code to disable online feature serving.

```python
# Disable online feature serving

fs.update_feature_view(
    name="<name>",
    version="<version>",
    online_config=OnlineConfig(enable=False)
)
```

## Retrieve features from online storage

To retrieve feature values from online storage for a given sample, use the `read_feature_view` method and pass the list of feature names as well as the join keys of the sample:

```python
fs.read_feature_view(
    feature_view=fv,
    keys=[["<k_1>", "<k_2>"]],
    feature_names=["<feature1>", "<feature2>", "<feature3>"],
    store_type=StoreType.ONLINE
)
```

## Suspend/resume online data refresh

Use the following code to temporarily suspend data refresh.

```python
fs.suspend_feature_view(feature_view=fv)
```

Use the following code to resume data refresh.

```python
fs.resume_feature_view(feature_view=fv)
```

These operations suspend/resume both the offline feature view (dynamic table and associated task) and the online feature table (if it exists) to ensure consistent state across all storage types.

## Manually refresh feature view

```python
fs.refresh_feature_view(
    feature_view=fv,
    store_type=<store_type>
)
```

The `store_type` argument specifies whether to refresh offline (`StoreType.OFFLINE`) or online (`StoreType.ONLINE`) feature data.

## View refresh history

```python
fs.get_refresh_history(
    feature_view=fv,
    store_type=store_type
)
```

The `store_type` argument specifies whether to return the offline (`StoreType.OFFLINE`) or online (`StoreType.ONLINE`) store refresh history.

### Understanding costs

Online Feature Tables incur costs across the following consumption modes:

* **Virtual warehouse compute**: Both key lookups and data ingestion operations consume virtual warehouse credits at standard rates. For more information, see [Virtual warehouse credit usage](../../../user-guide/cost-understanding-compute.md).
* **Cloud Services Compute**: Required to identify changes in underlying base objects and determine when refresh operations are needed. For more information, see [Cloud service credit usage](../../../user-guide/cost-understanding-compute.md).
* **Hybrid Table Storage**: Storage costs based on flat monthly rate per GB. It’s more expensive than the cost for traditional Snowflake storage, but identical to the cost to store hybrid tables. For more information, see Table 3(b) in the [Credit Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
* **Hybrid Table Requests**: As of March 1, 2026, hybrid table requests are no longer billed, and metering was disabled soon after this pricing change took effect.

> **Tip:**
>
> Incremental refresh can help reduce costs. Incremental updates are generally more cost-efficient than full refresh, resulting in lower compute and data ingestion costs.

## Cost monitoring

To monitor costs, use these views:

```sqlexample
-- Hybrid table request credits (historical data only; no new events are recorded)
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.HYBRID_TABLE_USAGE_HISTORY;

-- Storage consumption
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.STORAGE_USAGE;
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.DATABASE_STORAGE_USAGE_HISTORY;

-- Overall costs
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.METERING_HISTORY;
```

---
title: Create pipelines and deploy them
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/create-pipelines-deploy.md
section: Snowflake ML
---

# Create pipelines and deploy them

## Overview

Machine learning (ML) workflows typically involve several key stages:

**Data Exploration and Preparation**: This initial phase involves understanding the raw data, cleaning it, handling missing values, and transforming it into a usable format.

**Data Engineering**: Here, raw data is transformed into features that better represent the underlying problem to the predictive models, often involving techniques like scaling, encoding, and creating new features from existing ones.

**Model Development**: In this stage, various ML models are selected, trained on the prepared data, and tuned to optimize their performance. Developed models are rigorously evaluated using appropriate metrics to assess their accuracy, fairness, and generalization capabilities.

**Model Deployment**: Production ready models are saved to a model registry and subsequently deployed for batch or real time predictions on new data.

Initial development of ML models often benefits from an agile, iterative approach, allowing data scientists to quickly experiment with different algorithms and features. However, as models mature and demonstrate value, the focus shifts to operationalization, where pipelines are hardened and automated with CI/CD (Continuous Integration/Continuous Delivery). This automation ensures that changes to code, data pipelines, or models are consistently built, tested, and deployed, leading to more reliable, efficient, and maintainable ML systems.

## Develop

Start with interactive development in a local IDE (e.g., VS Code) or an interactive notebook (Snowflake Notebook or Jupyter). Parameterize inputs (tables, stages, hyperparameters) and keep steps modular for portability. For instance, it can be helpful to have one cell/function for data preparation, another for feature engineering, another for model training, and so-on.

Snowflake provides the following tools for each stage of the machine learning lifecycle:

| Stage | Tool | Usage |
| --- | --- | --- |
| Data Exploration | Snowflake Notebooks | Develop in a managed, browser‑based notebook environment. Use Python and SQL in one place to profile datasets, visualize distributions, and iterate quickly. |
|  | Snowpark DataFrames | Work with familiar DataFrame APIs that push computation down to Snowflake. |
| Data Engineering | Snowpark DataFrames | Build reproducible transforms at warehouse scale using SQL/Python/Scala with pushdown optimization. |
|  | UDFs/UDTFs | Encapsulate custom Python logic as functions or table functions to reuse complex transforms across teams and pipelines. |
|  | Feature Store | Define, register, and serve features with point‑in‑time correctness and reuse across models. Supports consistent offline training sets and low‑latency online retrieval, reducing leakage and duplication. |
| Model Training | Snowflake Notebooks | Train ML models with familiar open source libraries like scikit-learn, XGBoost, and PyTorch in your Snowflake Notebooks. Leverage elastic scale, avoid data movement, and persist models and preprocessing in one place. |
|  | ML Jobs | Offload resource-intensive steps to specialized compute options like high-memory instances, GPU acceleration, and distributed processing from any environment, including local IDEs, notebooks, and externally hosted orchestrators. |
| Model Deployment | Model Registry | Register and version models with lineage and governance controls. Centralizes discovery and promotes safe promotion workflows, audits, and rollback. |
|  | Batch Inference | Serve registered models from Python or SQL, keeping inference close to governed data and simplifying ops with consistent registry-backed execution. |
|  | Real-time Inference | Deploy registered models to managed HTTPS endpoints with autoscaling. Eliminates serving infrastructure, offering simple, secure, low‑latency inference integrated with Snowflake auth and governance. |
|  | Model Monitoring | Create a monitor per model version to materialize inference logs and automatically refresh daily metrics, surfacing drift, performance, and statistical signals in Snowsight. Configure alerts and custom dashboards to compare versions and quickly diagnose data or pipeline issues |
| Workflow Orchestration | Scheduled Notebooks | Parameterize and configure Snowflake Notebooks to execute non-interactively on a schedule. |
|  | Task Graphs | Operationalize your ML pipeline into a Directed Acyclic Graph (DAG) and configure it to run on a schedule or by event based triggers. |
| Security and governance | RBAC, tags, masking, policies | Apply role‑based access, data classification, and masking/row policies to training data, features, and models. Ensures least‑privilege access and compliance throughout the ML lifecycle. |

## Prepare for production

### Prepare code

Before operationalizing your pipeline, prepare your code for productionization. If you started with notebooks, begin by restructuring your code into modular, reusable functions where each major step (data preparation, feature engineering, model training, evaluation) becomes a separate function with clear inputs and outputs. If you already have modular scripts, ensure each function has well-defined interfaces and responsibilities. Parameterize all configuration values like table names and hyperparameters to enable cross-environment deployment. We recommend also authoring an entrypoint script which executes the end-to-end pipeline locally for debugging and future development.

Example directory structure:

```text
ml_pipeline_project/
├── README.md
├── requirements.txt
├── config/
├── src/ml_pipeline/
│   ├── utils/                     # Common utilities
│   ├── data/                      # Data preparation
│   ├── features/                  # Feature engineering
│   ├── models/                    # Model training
│   └── inference/                 # Model inference
├── scripts/
│   ├── run_pipeline.py            # Main entry point
│   └── dag.py
├── tests/
└── notebooks/
```

Example run_pipeline.py script:

```python
import argparse
from ml_pipeline.utils.config_loader import load_config
from ml_pipeline.data.ingestion import load_raw_data
from ml_pipeline.data.validation import validate_data_quality
from ml_pipeline.features.transformers import create_features
from ml_pipeline.models.training import train_model
from ml_pipeline.models.evaluation import evaluate_model
from ml_pipeline.models.registry import register_model

def main():
    parser = argparse.ArgumentParser()
    parser.add_argument("--config", required=True, help="Config file path")
    parser.add_argument("--env", default="dev", help="Environment (dev/prod)")
    args = parser.parse_args()

    # Load configuration
    config = load_config(args.config, args.env)

    # Execute pipeline stages
    raw_data = load_raw_data(config.data.source_table)
    validate_data_quality(raw_data, config.data.quality_checks)
    features = create_features(raw_data, config.features.transformations)
    model = train_model(features, config.model.hyperparameters)
    metrics = evaluate_model(model, features, config.model.eval_metrics)
    register_model(model, metrics, config.model.registry_name)

if __name__ == "__main__":
    main()
```

### Migrating from Notebooks to ML Jobs

Most code written in Snowflake Notebooks will work in ML Jobs with no code changes necessary. The few aspects to be aware of are:

**Runtime APIs**

Certain distributed ML APIs are only available inside the Container Runtime, and attempting to import them outside the Container Runtime environment will fail. These APIs are available inside ML Jobs, but need to be imported inside the ML Job payload.

```python
# Attempting to import distributed runtime APIs in local/external
# environments will fail!
from snowflake.ml.modeling.distributors.xgboost import XGBEstimator

from snowflake.ml.jobs import remote

@remote(...)
def my_remote_function(...):
  # Move imports *inside* your ML Job payloads
  from snowflake.ml.modeling.distributors.xgboost import XGBEstimator  # This works!
  ...

job = my_remote_function()  # Start ML Job
job.wait()  # Wait for job to complete
```

**Cluster Scaling**

The `scale_cluster()` API only works inside Notebooks and will not work inside ML Jobs. Instead, specify the desired cluster size at job submission time. See [Snowflake Multi-Node ML Jobs](ml-jobs/distributed-ml-jobs.md) for more information.

```python
from snowflake.ml.jobs import remote

@remote(..., target_instances=4)
def my_remote_function(...):
  # 4-node cluster will be provisioned for distributed processing
  # inside this job. The cluster will be automatically cleaned up on
  # job termination.
```

### Pipeline orchestration

Once you’ve prepared your end-to-end pipeline, operationalize your pipeline using an orchestrator like Snowflake Task Graphs, Scheduled Notebooks, or external orchestrators like Airflow. Using an orchestration framework provides several key advantages:

* Fault tolerance and reliability through automatic retries and failure isolation
* Observability with run history, real-time status, and alerts
* Scheduling and coordination for complex dependency graphs and various triggers
* Operational hygiene with version control integration and configuration management

Snowflake ML is compatible with most orchestration frameworks including Airflow, Dagster, and Prefect. If you already have an existing workflow/DAG setup, we recommend simply integrating your existing workflows with Snowflake ML features and offloading compute or data intensive steps to ML Jobs or UDFs. If you do not have an existing DAG setup, you can use Snowflake Task Graphs for a Snowflake native solution.

To set up orchestration with a DAG on Snowflake, follow these high-level steps:

1. Prepare your local pipeline code according to Prepare code
2. Create a new `dag.py` file (or any other name) to hold your DAG definition
3. Implement the DAG form of your pipeline according to this guide
4. Run the `dag.py` script to deploy the Task Graph into your Snowflake account

> **Tip:**
>
> Running a Task Graph script does not necessarily execute the graph; a basic Task Graph script simply defines and deploys the Task Graph. The Task Graph must be separately triggered to execute, either manually or on a schedule.

### Separating development and production

We recommend parameterizing your DAG script to support isolating your development (DEV) and production (PROD) environments. You can use Snowflake connection management, application specific configurations, or any combination of the two to achieve this. The level of isolation needed depends on your governance requirements, but generally we recommend using separate databases for DEV and PROD, where the PROD database is protected by RBAC policies which limit accessibility to administrators and specialized service accounts.

### CI/CD

You can automate validation and deployment of your pipelines using CI/CD pipelines such as Azure Pipelines and GitHub Actions. In general, we recommend testing in a DEV or STAGING environment before deploying to PROD. Best practice is to configure your source control repository with merge gates which validate code changes in DEV before merging into your production branch. Changes to the production branch can be deployed to PROD continuously (i.e. for every change) or on some regular cadence (daily/weekly). Best practice is to run a final validation of the production branch’s state in a DEV or STAGING environment before deploying changes to PROD. Use platform features like GitHub Actions Deployments and Environments to define and configure connections to each deployment environment. Configure your CI/CD pipeline to push your changes into the deployment environment, including:

* (Optional) Building libraries and modules as Python packages and pushing them into a proprietary package feed
* (Optional) Uploading files into a Snowflake Stage

  This is most commonly required when you utilize `snowflake.ml.jobs.submit_from_stage()` in your pipeline

  Alternatively, you can use Snowflake’s GitHub integration to directly track your GitHub repository as a Snowflake Stage
* Running `dag.py` to deploy the Task Graph in the configured environment
* (Optional) Trigger and monitor execution of the newly deployed Task Graph to verify validity

## Additional Resources

* [E2E Task Graph Quickstart](https://quickstarts.snowflake.com/guide/e2e-task-graph/)

---
title: Creating or connecting to a feature store
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/create.md
section: Snowflake ML
---

# Creating or connecting to a feature store

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

Create a feature store or connect to an existing feature store by using the `FeatureStore` constructor, providing a
Snowpark session, database name, feature store name, and default warehouse name. The `mode` parameter determines
whether the feature store is created if it does not already exist.

| Mode | Description |
| --- | --- |
| `CreationMode.FAIL_IF_NOT_EXIST` | Throws an exception if the specified feature store does not exist. Default. |
| `CreationMode.CREATE_IF_NOT_EXIST` | Creates the feature store if it does not exist. |

To create a feature store, use the `CreationMode.CREATE_IF_NOT_EXIST` mode when instantiating `FeatureStore`.
Creating a feature store creates a schema in the specified database with the specified feature store name. Generally,
an administrator role will create the feature store schema and corresponding roles.

You can subsequently connect to the existing feature store by using the default mode, `CreationMode.FAIL_IF_NOT_EXIST`.

The following Python code creates a feature store:

```python
from snowflake.ml.feature_store import FeatureStore, CreationMode

fs = FeatureStore(
        session=session,
        database="MY_DB",
        name="MY_FEATURE_STORE",
        default_warehouse="MY_WH",
        creation_mode=CreationMode.CREATE_IF_NOT_EXIST,
     )
```

> **Tip:**
>
> Storing your feature stores in a dedicated database will make it simpler to [replicate them](replication-sharing.md).

After you have created a feature store, use code like the following to access it:

```python
from snowflake.ml.feature_store import FeatureStore, CreationMode

fs = FeatureStore(
        session=session,
        database="MY_DB",
        name="MY_FEATURE_STORE",
        default_warehouse="MY_WH",
      )
```

---
title: CUDA-X Libraries in Snowflake ML
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/nvidia-cuda-x-libraries.md
section: Snowflake ML
---

# CUDA-X Libraries in Snowflake ML

Use Snowflake Container Runtime’s CUDA-X integrations to seamlessly scale data transformations and ML over GPUs without changing your code. Snowflake has integrated NVIDIA’s cuML and cuDF libraries into the runtime environment. With this integration, you can use libraries such as scikit-learn, umap-learn, or hdbscan with your GPUs. You don’t have to learn new frameworks or handle complex dependencies.

You can run complex processing such as topic modeling, genomics, and pattern recognition without compromising on data sizes or algorithmic
complexity. Reducing the processing time gives you the opportunity to further iterate on your models.

The integration with the CUDA-X libraries enables the GPU-accelerated processing of large datasets in the Snowflake ML Container Runtime. The processing speed can be orders of magnitude faster than using the Container Runtime exclusively.

## NVIDIA CUDA-X Libraries for Data Science

Open-source libraries like cuML and cuDF utilize GPUs for more efficient and scalable data workflows. You can use these libraries to process
data with billions of rows and millions of dimensions. For more information about these libraries, see [NVIDIA CUDA-X Data Science](https://developer.nvidia.com/topics/ai/data-science/cuda-x-data-science-libraries).

CUDA-X DS libraries combine the power of GPUs with commonly used Python libraries for data analytics, machine learning, and graph
analytics—delivering major speedups without requiring teams to rewrite their code. With CUDA-X DS, you can use the GPU speed increases, to
process datasets up to terabytes in size with a single GPU.

NVIDIA cuML can deliver the following performance improvements over CPU workflows:

* Up to 50x for scikit-learn
* Up to 60x for UMAP
* Up to 175x for HDBSCAN

## Use Cases

The integration of the CUDA-X libraries in the Snowflake ML Container Runtime uses GPUs with Scikit-learn and pandas for the following use cases:

### Large-Scale topic modeling

Topic modeling on large and feature-rich data sets requires:

* Using embedding models
* Applying dimensionality reduction at scale
* Using clustering and visualization to extract accurate and relevant topics

GPU parallelism can help you accomplish the preceding workflows more efficiently. By accelerating your processing with cuML, you can transform millions of product reviews from raw text to well-defined topic clusters that can be reduced from hours on CPU to minutes on GPU with no modifications to existing Python code. This highlights the seamless drop-in acceleration for UMAP and HDBSCAN libraries.

For more information about performing topic modeling over GPUs on Snowflake, see
<https://quickstarts.snowflake.com/guide/accelerate-topic-modeling-with-gpus-in-snowflake-ml/#0>

### Computational Genomics Workflows

Use Snowflake’s CUDA-X integrations to significantly accelerate the processing of biological sequences. You can convert DNA sequences into
feature vectors for scalable classification tasks, such as predicting gene families.

Executing pandas and scikit-learn code directly on GPUs with cuDF and cuML speeds up data loading, preprocessing, and ensemble
model training. This GPU acceleration for existing workflows, without code changes, allows researchers to prioritize biological insights and
model design over low-level GPU programming.

## Developing in Snowflake

Use the CUDA-X libraries to develop and deploy GPU-accelerated machine learning models within the Snowflake ML Container Runtime . This
section provides a step-by-step guide for integrating these tools into your Python workflows.

To get started, do the following:

1. Define your Python script in a Snowflake Notebook or an ML Job
2. Select the GPU runtime and a GPU compute pool for your Notebook or ML Job

After you’ve done the preceding steps, run the following code to configure the CUDA-X accelerators in your environment.

```python
#Install cuDF and cuML accelerators for zero code change acceleration

import cuml
cuml.accel.install()
import cudf.pandas
cudf.pandas.install()
```

Now you can run pandas operations directly over GPUs or fit the scikit-learn, umap, or hdbscan model (note that there is no code change
needed to run over GPUs). This example shows how to use `hdbscan` on large datasets:

```python
import hdbscan
from sklearn.datasets import make_blobs

# Generate some sample data with multiple clusters
data, _ = make_blobs(n_samples=500, centers=4, cluster_std=0.8, random_state=42)

# Initialize and fit HDBSCAN
# min_cluster_size: The minimum size of clusters; smaller clusters will be considered noise.
# min_samples: The number of samples in a neighborhood for a point to be considered as a core point.
hdbscan_model = hdbscan.HDBSCAN(min_cluster_size=15, min_samples=5, cluster_selection_epsilon=0.5)
hdbscan_model.fit(data)
```

### Applied Use Case: Topic Modeling at Scale

Computational efficiency is crucial for large scale text analysis and topic modeling. GPUs use parallel processing to reduce processing time
from hours to minutes. This section demonstrates how to accelerate ML models on a dataset of 200,000 beauty product reviews using GPU
acceleration with CUDA-X.

You can use CUDA-X to do the following:

* Transform raw text into numerical representations (embeddings) for machine learning.
* Accelerate dimensionality reduction

To utilize the CUDA libraries, add %load_ext cuml.accel at the beginning of your code. This reduces your processing time from hours to
minutes.

The following example code uses the `SentenceTransformer` class to create embeddings.

```python
from sentence_transformers import SentenceTransformer
model = SentenceTransformer('all-MiniLM-L6-v2')
embeddings = model.encode(texts, show_progress_bar=True)
```

The following example code uses HDBSCAN to reduce high-dimensional data. It retains the cluster topics.

```python
from umap import UMAP
from hdbscan import HDBSCAN
umap_model = UMAP(n_components=15, n_neighbors=15, min_dist=0.0)
hdbscan_model = HDBSCAN(min_cluster_size=100, gen_min_span_tree=True, prediction_data=True)
```

### Applied Use Case: Running complex genomics workflows

Gene family organization, which includes paralogs and orthologs, is crucial for understanding gene evolution, function, and conserved
biological processes.

With the CUDA-X libraries, you can create a classification model to predict gene families from DNA sequences. This model can accelerate
genomic annotation, identify novel gene functions, and provide insights into evolutionary pathways.

The [dataset](https://raw.githubusercontent.com/nageshsinghc4/DNA-Sequence-Machine-learning/master/human_data.txt) has a series of plain
text nucleotide sequences and their corresponding gene family class labels. The classes correspond to seven distinct human gene families.

The following code uses the **nucleotide transformer** from Hugging Face to convert the DNA sequences into vectors. The transformer
tokenizes and batches the sequences to transform each gene sequence into a 1280-feature vector.

```python
%load_ext cudf.pandas
%load_ext cuml.accel

from transformers import AutoTokenizer, AutoModelForMaskedLM
import torch

def get_dna_embeddings(sequences, classes):
    tokens_ids = tokenizer.batch_encode_plus(sequences, return_tensors="pt", padding="longest")["input_ids"].to('cuda:0')

    attention_mask = tokens_ids != tokenizer.pad_token_id
    try:
        torch_outs = model(
            tokens_ids,
            attention_mask=attention_mask,
            encoder_attention_mask=attention_mask,
            output_hidden_states=True
        )
    except:
        return []

    embeddings = torch_outs['hidden_states'][-1].detach()
    attention_mask = torch.unsqueeze(attention_mask, dim=-1)
    mean_sequence_embeddings = torch.sum(attention_mask*embeddings, axis=-2)/torch.sum(attention_mask, axis=1)
    return list(zip(mean_sequence_embeddings.numpy(), classes))

# Load the tokenizer and model
tokenizer = AutoTokenizer.from_pretrained("InstaDeepAI/nucleotide-transformer-500m-human-ref")
model = AutoModelForMaskedLM.from_pretrained("InstaDeepAI/nucleotide-transformer-500m-human-ref")

# Example of obtaining embeddings (simplified)
sequences = ["ATGCCCCAACTAAATACTACCGTATGGCCCACCATAATTACCCCCA", ...]
classes = [0, ...]

genes = []
batch_size=10

emb = get_dna_embeddings(human_genes[i], human_classes[i])
genes += emb
```

You can use the following code to evaluate two ensemble classification models:

* A Random Forest Classifier
* An XGBoost classifier

```python
%load_ext cudf.pandas
%load_ext cuml.accel

import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier

human_dna = pd.read_table(human_url) # This would now run on GPU

genes = []
batch_size=10

human_genes = human_dna['sequence'].tolist()
human_classes = human_dna['class'].tolist()

human_genes = [human_genes[i:i + batch_size] for i in range(0, len(human_genes), batch_size)]
human_classes = [human_classes[i:i + batch_size] for i in range(0, len(human_classes), batch_size)]

# Create the embeddings
for i in tqdm(range(len(human_genes)), desc='Producing embeddings...'):
    emb = get_dna_embeddings(human_genes[i], human_classes[i])
    genes += emb

genes_df = pd.DataFrame(genes, columns=['embeddings', 'class'])
genes_df[[f'emb_{i}' for i in range(1280)]] = pd.DataFrame(genes_df['embeddings'].tolist(), index=genes_df.index) # the embeddings generated above

X, y = genes_df[[f'emb_{i}' for i in range(1280)]], genes_df['class']
X_train, X_test, y_train, y_test = train_test_split(X, y,
                                                   test_size = 0.20,
                                                   random_state=42)

classifier = RandomForestClassifier(n_estimators=200, max_depth=20, max_features=1.0, n_jobs=-1)
classifier.fit(X_train, y_train)
```

## See Also

* [Snowflake ML: End-to-End Machine Learning](overview.md)
* [Snowflake Container Runtime](container-runtime-ml.md)

---
title: Deploy models for Real time Inference (REST API)
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/real-time-inference-rest-api.md
section: Snowflake ML
---

# Deploy models for Real time Inference (REST API)

> **Note:**
>
> Generally available since snowflake-ml-python version 1.25.0.

Use real-time inference for interactive workflows that require low latency. You can deploy any model from the
[Snowflake Model Registry](../model-registry/overview.md) as a managed service with a dedicated HTTP endpoint.
Managed services feature autoscaling and are fully integrated within the Snowflake ecosystem, offering comprehensive observability.

Use online inference for your workflow when:

* Your application requires low latency for immediate responses
* Your model serves as a backend for a user-facing web or mobile application.
* The input to your model can fit within the HTTP payload of the request.
* The service must automatically scale horizontally to handle fluctuating request volumes.

## How It Works

Snowflake simplifies the deployment pipeline by hosting your model as an HTTP server within **Snowpark Container Services (SPCS)**. This
architecture enables you to:

* **Abstract Complexity:** Deploy sophisticated models without managing Docker images or Kubernetes clusters.
* **Scale Performance:** Run large-scale models on distributed GPU clusters for high-performance requirements.
* **Ensure Reliability:** Utilize built-in observability, traffic splitting, and shadow/canary deployments for seamless model upgrades.

### Prerequisites

Before you begin, make sure you have the following:

* A Snowflake account in any commercial AWS, Azure, or Google Cloud region. Government regions are not supported.
* Version 1.8.0 or later of the snowflake-ml-python Python package.
* A model logged into [Snowflake Model Registry](../model-registry/overview.md).
* An understanding of [compute pools](../../snowpark-container-services/working-with-compute-pool.md) and
  related privileges on SPCS.

### Required privileges

Model Serving runs on top of [Snowpark Container Services](../../snowpark-container-services/overview.md).
You need the following privileges to use Model Serving:

* USAGE or OWNERSHIP on the compute pool where the service runs.
  Alternatively, you can use the default System Compute Pools.
* BIND SERVICE ENDPOINT privilege on account to be able to create a public endpoint.
* OWNER or READ privilege on the Model

### Limitations

The following limitations apply to online model serving in Snowpark Container Services.

* Table functions aren’t supported. Your model must have a table function to be deployed to Snowflake.
* Models developed using Snowpark ML modeling classes
  can’t be deployed to environments that have a GPU. As a workaround, you can extract the native model and deploy that. For more information,
  see Deploy a model for online inference.

### Deploy a model for online inference

Snowflake ML uses a model version object to create a model service that handles inference requests. To create a model version object, you
can either log a new model version or
obtain a reference to an existing model version.
After you get your model version object, you can use the following Python code to create a model service and deploy that service to SPCS:

```python
# reg is a snowflake.ml.registry.Registry object
example_mv_object = reg.get_model("mymodel_name").version("version_name") # a snowflake.ml.model.ModelVersion object

example_mv_object.create_service(service_name="myservice",
                  service_compute_pool="my_compute_pool",
                  ingress_enabled=True,
                  gpu_requests=None)
```

`create_service` requires the following arguments:

* service_name: The name of the service that you’re creating. This name must be unique within your Snowflake account.
* service_compute_pool: The name of the compute pool that you’re using to run the model. The compute pool must already exist. If the model
  fits well in System Compute Pools, you can use them
  (`SYSTEM_COMPUTE_POOL_GPU` or `SYSTEM_COMPUTE_POOL_CPU`) too.
* ingress_enabled: This is required to be True to call online inference from outside of Snowflake.
* `gpu_requests`: A string specifying the number of GPUs. For a model that can be run on either a CPU or multiple GPUs, this argument determines whether the model will be run on the CPU or on the GPUs. If the model is of a known type that can only run on a CPU (such as scikit-learn models), the image build fails if you request GPUs. If you’re deploying a new model, it can take up to 10 minutes to create the service for CPU-powered models and 20 minutes for GPU-powered models. If the compute pool is idle or requires resizing, it might take longer to create the service.

The preceding example only shows the required and most commonly used arguments. For a complete list of arguments, see the
ModelVersion API reference.

## Default service configuration

The server running the model that you’ve deployed uses defaults that work for most use-cases:

* *Number of Worker Threads*: For a CPU-powered model, the number of processes that the server uses is twice the number of CPUs plus one.
  GPU-powered models use one worker process. You can override this using the num_workers argument in the create_service call. It is
  **recommended** to specify the smallest GPU node where the model fits into memory. Scale by increasing the number of instances. For example,
  if the model fits in the GPU_NV_S (GPU_NV_SM on Azure) instance type, use gpu_requests=1 and scale up by increasing max_instances. However
  if the smallest available node has 4 GPUs and you only need 2, use `num_workers=2` (that is, gpu available / gpus needed by the
  model).
* *Thread Safety*: Some models are not thread-safe. Therefore, the service loads a separate copy of the model for each worker process. This
  can result in resource depletion for large models.
* *Node Utilization*: By default, one inference server instance requests the whole node by requesting all the CPU and memory of the node it
  runs on. To customize resource allocation per instance, use arguments like cpu_requests, memory_requests, and gpu_requests.
* *Endpoint*: The inference endpoint is named inference and uses port 5000. These cannot be customized. For optimal resource utilization,
  specify the smallest GPU node that can fit the model into memory. Increase the number of instances to scale to your workload. For example,
  if the model fits in the GPU_NV_S (GPU_NV_SM on Azure) instance type, use gpu_requests=1 and scale up by increasing max_instances.

## Container image build behavior

**The Snowflake conda channel is available only in warehouses and is the only source for warehouse dependencies. By default, conda
dependencies for SPCS models obtain their dependencies from conda-forge.**

By default, Snowflake Model Serving builds the container image using the same compute pool that’s used to run the model. The compute pool is
likely overpowered for the process of building images (for example, GPUs are not used in building container images). For the most part,
this won’t significantly impact compute costs. However, if you’re concerned, you can specify a less powerful compute pool to build
images with the image_build_compute_pool argument.

Calling create_service() multiple times does not trigger a build every time you call it.

However, container images might be rebuilt if Snowflake made updates to the inference service, including fixes for vulnerabilities in
dependent packages. When this happens, create_service automatically triggers a rebuild of the image.

## User interface

You can manage deployed models in the Model Registry Snowsight UI. For more information, see [Model inference services](../model-registry/snowsight-ui.md).

### Invoking deployed model

## HTTP endpoints

Every service comes with its internal DNS name. Deploying a service with ingress_enabled also creates a public HTTP endpoint available
outside of Snowflake. Either endpoint can be used to call a service.

You can find the public HTTP endpoint of a service with ingress enabled using the [SHOW ENDPOINTS](../../../sql-reference/sql/show-endpoints.md) command.
The output contains an ingress_url column, which has an entry of the format *unique-service-id*-*account-id*.snowflakecomputing.app. This is the publicly available HTTP endpoint for your service. For private link users, use privatelink_ingress_url instead of ingress_url.

To get the internal DNS name on Snowflake, use the [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) command.
The dns_name column of output from this command contains a service’s internal DNS name. To find your service’s port, use the SHOW ENDPOINTS
IN SERVICE command. The port or port_range column contains the port used by a service. You can make internal calls to your service through
the URL <http://>*dns_name*:*port*.

To call any particular methods of the model, use the method name as path to the URL (eg
`https://unique-service-id-account-id.snowflakecomputing.app/method-name` or <http://>*dns_name*:*port*/<method-name>). In a URL,
underscores (_) in the method name are replaced by dashes (-) in the URL. For example, the service name predict_prob is changed to
predict-proba in the URL.

To simplify things, in Python, list_services() API can be called on ModelVersion object:

```python
# mv: snowflake.ml.model.ModelVersion
mv.list_services()
```

It outputs both public endpoint (`inference_endpoint`) and internal endpoint (`internal_endpoint`).

## Authentication

Snowflake supports multiple authentication protocols.
Simplest of all is to use [Programmatic Access Tokens (PAT)](../../../user-guide/programmatic-access-tokens.md)
where token can be passed simply to the request header as `Authorization: Snowflake Token="your_pat_token"`

> **Note:**
>
> All authorization failures such as an incorrect token or lack of network route to the service result in a 404 error. As of today, there is
> no way to distinguish authentication errors from invalid URLs.

## Authorization

By default only service owners can use the endpoint. To allow another role to access the endpoint, service owners can
[grant the service role](../../../sql-reference/sql/grant-service-role.md) ALL_ENDPOINTS_USAGE.

## Request body (or protocol or data format)

Snowflake supports two types of data formats for REST requests. They are inspired by Pandas dataframe particularly because they are well
known in the industry and verifiable by customers using simple Python scripts with a Pandas Dataframe.

> **Tip:**
>
> **Method-to-URL Mapping:** When constructing your request URL, note that underscores (`_`) in your model’s method names are
> automatically replaced by dashes (`-`). For example, if your model method is `predict_proba`, the endpoint URL path becomes
> `/predict-proba`.

Here are the details about the formats

1. `dataframe_split` is a compact, index/columns/data representation.

> * A representation that mirrors `pandas_df.to_json(orient="split")`.

1. `dataframe_records` is a key/value (record-oriented) representation.

> * A representation that mirrors `df.to_json(orient="records")`.

It is **recommended** to use `dataframe_split` format. Since `dataframe_records` repeats column names for each row, it typically
produces a larger request body than `dataframe_split`. This can have a performance impact for large batches or frequent calls.

Model endpoints continue to return a **single output format**, regardless of which input format you use.

1. `dataframe_split` **format (Recommended)**

This matches the structure produced by the Pandas “split” orientation. The request body wraps the following structure under a
`dataframe_split` key:

* `index`: A list of row indices.
* `columns`: A list of column names.
* `data`: A list of rows, where each row is a list of values aligned with the columns.

Example `cURL` Request:

```bash
curl -X POST "<endpoint_url>" \
  -H 'Authorization: Snowflake Token="<pat_token>"' \
  -H 'Content-Type: application/json' \
  -w "\n\n=== RESULT ===\nHTTP Status: %{http_code}\nTotal Time: %{time_total}s\nConnect Time: %{time_connect}s\nServer Processing: %{time_starttransfer}s\nResponse Size: %{size_download} bytes\nRequest Size: %{size_upload} bytes\n" \
 -d '{
       "dataframe_split": {
         "index": [0, 1],
         "columns": ["customer_id", "age", "monthly_spend"],
         "data": [
            [101, 32, 85.5],
            [102, 45, 120.0],
         ]
       }
     }'
```

1. `dataframe_records` **format**

`dataframe_records` matches the structure produced by **Pandas records orientation**:

* A **list of records**, where each record is a dictionary mapping **column names** to **values**.

The request body wraps this list under the `dataframe_records` key:

Example `cURL` Request:

```bash
curl -X POST "<endpoint_url>" \
  -H 'Authorization: Snowflake Token="<pat_token>"' \
  -H 'Content-Type: application/json' \
  -w "\n\n=== RESULT ===\nHTTP Status: %{http_code}\nTotal Time: %{time_total}s\nConnect Time: %{time_connect}s\nServer Processing: %{time_starttransfer}s\nResponse Size: %{size_download} bytes\nRequest Size: %{size_upload} bytes\n" \
 -d '{
       "dataframe_records": [
          {
            "customer_id": 101,
            "age": 32,
            "monthly_spend": 85.5,
          },
          {
            "customer_id": 102,
            "age": 45,
            "monthly_spend": 120.0,
          },
        ]
     }'
```

## Passing parameters

If the model’s signature includes parameters defined with
[ParamSpec](../model-registry/model-signature.md), you can pass parameter values by
including a top-level `params` key in the JSON request body alongside `dataframe_split` or
`dataframe_records`. Only include the parameters you want to override; unspecified parameters use their default
values from the signature.

Example `cURL` request with parameters:

```bash
curl -X POST "<endpoint_url>/predict" \
  -H 'Authorization: Snowflake Token="<pat_token>"' \
  -H 'Content-Type: application/json' \
  -d '{
        "dataframe_split": {
            "index": [0],
            "columns": ["input_text"],
            "data": [["Hello, world!"]]
        },
        "params": {"temperature": 0.9, "max_tokens": 512}
      }'
```

The `params` key works the same way with the `dataframe_records` format:

```bash
curl -X POST "<endpoint_url>/predict" \
  -H 'Authorization: Snowflake Token="<pat_token>"' \
  -H 'Content-Type: application/json' \
  -d '{
        "dataframe_records": [
            {"input_text": "Hello, world!"}
        ],
        "params": {"temperature": 0.9, "max_tokens": 512}
      }'
```

## Python Examples

1. `dataframe_split` format

Snowflake recommends generating the payload by using **Pandas JSON serialization**, and then deserializing with `json.loads` before
sending the request. This ensures that data types are handled consistently.

```python
import json
import pandas as pd
import requests

# Example DataFrame
df = pd.DataFrame(
    {
        "customer_id": [101, 102],
        "age": [32, 45],
        "monthly_spend": [85.5, 120.0],
    }
)

ENDPOINT_URL = "<your endpoint URL>"
HEADERS = {
    "Authorization": f'Snowflake Token="{PAT}"',
    "Content-Type": "application/json"
}

# Use Pandas to generate the JSON, then load it back to a Python dict
split_obj = json.loads(df.to_json(orient="split"))

payload = {
    "dataframe_split": split_obj
}

response = requests.post(
    ENDPOINT_URL,
    headers=HEADERS,
    json=payload,
    timeout=30,
)

result = response.json()
```

Key points:

* Use pd.Dataframe.to_json (eg. `df.to_json(orient="split")` ) to correctly handle types such as timestamps, floats, nulls, categoricals
  etc. which the native json serializer is unfamiliar with.
* `json.loads(...)` converts the JSON string to a Python dictionary so we can properly construct the payload.
* `requests.post(..., json=payload)` serializes the dictionary back to JSON for the HTTP request.

To include parameters, add a `params` key to the payload dictionary:

```python
payload = {
    "dataframe_split": split_obj,
    "params": {"temperature": 0.9, "max_tokens": 512}
}
```

1. `dataframe_records` format

As with `dataframe_split`, use Pandas JSON serialization and `json.loads`:

```python
import json
import pandas as pd
import requests

df = pd.DataFrame(
    {
        "customer_id": [101, 102],
        "age": [32, 45],
        "monthly_spend": [85.5, 120.0],
    }
)

ENDPOINT_URL = "<your endpoint invoke URL>"
HEADERS = {
    "Authorization": "Bearer <your token>",
    "Content-Type": "application/json",
}

records_obj = json.loads(df.to_json(orient="records"))

payload = {
    "dataframe_records": records_obj
}

response = requests.post(
    ENDPOINT_URL,
    headers=HEADERS,
    json=payload,
    timeout=30,
)

response.raise_for_status()
result = response.json()
```

## Next Steps

Explore these detailed guides to optimize and manage your inference services:

* [Example workflows](real-time-inference-examples.md): See end-to-end code for XGBoost (CPU), Hugging Face (GPU), and PyTorch models.
* **Service Management & Scaling:** Learn about autoscaling, manual suspension, and hardware configuration.
* **Stable Endpoints & API Reference:** Deep dive into the Snowflake Gateway, authentication, and data protocols
  (`dataframe_split`).
* **Auto-capture Inference Logs:** Set up automated logging for model monitoring.
* **Troubleshooting:** Common fixes for package conflicts, OOM errors, and build failures.

---
title: Distributed training
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/distributed-training.md
section: Snowflake ML
---

# Distributed training

The Snowflake Container Runtime provides a flexible training environment that you can use to train models on Snowflake’s infrastructure. You can use open source packages, or use Snowflake ML distributed trainers for multi-node and multi-device training.

Distributed trainers automatically scale your machine learning workloads across multiple nodes and GPUs. Snowflake distributors intelligently manage cluster resources without requiring complex configuration, making distributed training accessible and efficient.

**Use standard open source libraries when you**

* Work with small datasets on single-node environments
* Rapidly prototype and experiment with models
* Lift and shift workflows without distributed requirements

**Use Snowflake Distributed Trainers To:**

* Train models on datasets that are larger than the memory of a single compute node
* Utilize multiple GPUs efficiently
* Automatically leverage all compute multi-node MLJobs or scaled notebook clusters

## Snowflake ML distributed training

Snowflake ML provides distributed trainers for popular machine learning frameworks, including XGBoost, LightGBM, and PyTorch. These trainers are optimized to run on Snowflake’s infrastructure and can automatically scale across multiple nodes and GPUs.

* **Automatic Resource Management** - Snowflake automatically discovers and uses all available cluster resources
* **Simplified Setup** - The Container Runtime environment is backed by a Ray cluster provided by Snowflake, with no user configuration required
* **Seamless Snowflake integration** - Direct compatibility with Snowflake data connectors and stages
* **Optional scaling configs** - Advanced users can fine-tune when needed

### Data loading

For both open source and Snowflake distributed trainers, the most performant way to ingest data is with the Snowflake Data Connector:

```python
from snowflake.ml.data.data_connector import DataConnector

# Load data
train_connector = DataConnector.from_dataframe(session.table('TRAINING_DATA'))
eval_connector = DataConnector.from_dataframe(session.table('EVAL_DATA'))
```

### Training methods

#### Open source training

Use standard open source libraries when you need maximum flexibility and control over your training process. With open source training, you directly use popular ML frameworks like XGBoost, LightGBM, and PyTorch with minimal modifications, while still benefiting from Snowflake’s infrastructure and data connectivity.

The following examples train a model with XGBoost and LightGBM.

XGBoostLightGBM

To train with open source XGBoost, after loading data with the data connector, convert it into a pandas dataframe and use the XGB library directly:

```python
import xgboost as xgb

train_df = train_connector.to_pandas()
eval_df = eval_connector.to_pandas()

# Create DMatrix
train_df = train_connector.to_pandas()
dtrain = xgb.DMatrix(train_df[INPUT_COLS], label=train_df[LABEL_COL])
deval = xgb.DMatrix(eval_df)

# Training parameters
params = {
   'objective': 'reg:squarederror',
   'max_depth': 6,
   'learning_rate': 0.1
}

# Train and evaluate model
evals_result = {}
model = xgb.train(
   params,
   dtrain,
   num_boost_round=100,
   evals=[(dtrain, 'train'), (deval, 'valid')],
   evals_result=evals_result
)

# Access the evaluation results
print(evals_result)
```

```python
from snowflake.ml.modeling.distributors.lightgbm import LightGBMEstimator, LightGBMScalingConfig

# Training parameters
params = {
   'objective': 'regression',
   'metric': 'rmse',
   'boosting_type': 'gbdt',
   'num_leaves': 31,
   'learning_rate': 0.05,
   'feature_fraction': 0.9
}

# Automatic scaling (recommended)
estimator = LightGBMEstimator(
   params=params
)

# Call with custom GPU scaling
gpu_estimator = LightGBMEstimator(
   params=params,
   scaling_config=LightGBMScalingConfig(use_gpu=True) # optional - available resources will be used automatically
)

# Train and evaluate
booster = estimator.fit(
   dataset=train_connector,
   input_cols=['age', 'income', 'credit_score'],
   label_col='default_risk',
   eval_set=eval_connector,
   verbose_eval=10
)

# Access results
booster = estimator.get_booster() # If you forgot to save the output of fit, get the booster from the estimator
feature_importance = booster.feature_importance(importance_type='gain')
```

#### Distributed training

The distributed `XGBEstimator` class has a similar API with a few key differences:

* The XGBoost training parameters are passed to the `XGBEstimator` during class initialization through the “params” parameter.
* The DataConnector object can be passed directly into the estimator’s `fit` function, along with the input columns defining the features and the label column defining the target.
* You can provide a scaling configuration when instantiating the `XGBEstimator` class. However, Snowflake defaults to using all available resources.

```python
from snowflake.ml.modeling.distributors.xgboost import XGBEstimator, XGBScalingConfig

# Training parameters
params = {
    'objective': 'reg:squarederror',
    'max_depth': 6,
    'learning_rate': 0.1
}

# Automatic scaling (recommended)
estimator = XGBEstimator(
    params=params
)

# Call with custom GPU scaling
gpu_estimator = XGBEstimator(
    params=params,
    scaling_config=XGBScalingConfig(use_gpu=True) # optional - available resources will be used automatically
)

# Train and evaluate
booster = estimator.fit(
    dataset=train_connector,
    input_cols=['age', 'income', 'credit_score'],
    label_col='default_risk',
    eval_set=eval_connector,
    verbose_eval=10
)

# Access results
booster = estimator.get_booster() # If you forgot to save the output of fit, get the booster from the estimator
feature_importance = booster.get_score(importance_type='gain')
```

#### Evaluating the model

Models can be evaluated by passing an `eval_set` and using `verbose_eval` to print the evaluation data to the console. Additionally, inference can be done as a second step. The distributed estimator offers a `predict` method for convenience, but it will not do inference in a distributed fashion. We recommend converting the fit model into an OSS xgboost estimator after training in order to do inference and to log to the model registry.

#### Registering the model

To register the model to the Snowflake model registry, use the open source booster provided by `estimator.get_booster` and returned from `estimator.fit`. For more information, see [XGBoost](model-registry/built-in-models/xgboost.md).

#### PyTorch

The Snowflake PyTorch Distributor natively supports Distributed Data Parallel models on the Snowflake backend. To use DDP on Snowflake, leverage open source PyTorch modules with a few Snowflake specific modifications:

* Load data using the `ShardedDataConnector` to automatically shard data into the number of partitions that matches the `world_size` of the distributed trainer. Call `get_shard` within a Snowflake training context to retrieve the shard associated with that worker process.
* Inside the training function, use the `context` object to get process specific information like rank, local rank, and the data required for training.
* Save the model using the context’s `get_model_dir` to find the location to store the model to. This will store the model locally for single node training, and sync the model to a Snowflake stage for distributed training. If no stage location is provided, your user stage will be used by default.

#### Load data

```python
# Create ShardedDataConnector for data ingestion
from snowflake.ml.data.sharded_data_connector import ShardedDataConnector

example_snowpark_dataframe = session.table("EXAMPLE_TRAINING_DATA")

data_connector = ShardedDataConnector.from_dataframe(example_snowpark_dataframe)
```

#### Train model

```python
# Import necessary PyTorch libraries
import torch
import torch.nn as nn
import torch.optim as optim
from torch.utils.data import DataLoader

# Define a simple neural network
class SimpleNet(nn.Module):
    def __init__(self, input_size, hidden_size, output_size):
        super(SimpleNet, self).__init__()
        self.fc1 = nn.Linear(input_size, hidden_size)
        self.relu = nn.ReLU()
        self.fc2 = nn.Linear(hidden_size, output_size)

    def forward(self, x):
        x = self.fc1(x)
        x = self.relu(x)
        x = self.fc2(x)
        return x

# Define the training function
def train_func():
    import torch.distributed as dist
    from torch.nn.parallel import DistributedDataParallel as DDP
    from snowflake.ml.modeling.distributors.pytorch import get_context

    # Use the Snowflake context to get the necessary methods to manage and retrieve information about the distributed training environment
    context = get_context()
    rank = context.get_rank()
    dist.init_process_group(backend='gloo')
    device = torch.device(f"cuda:{context.get_local_rank()}"
                         if torch.cuda.is_available() else "cpu")

    # Initialize model, loss function, and optimizer
    model = SimpleNet(input_size=len(input_cols), hidden_size=32, output_size=1).to(device)
    model = DDP(model)
    criterion = nn.MSELoss()
    optimizer = optim.Adam(model.parameters(), lr=0.001)

    # Retrieve training data
    dataset_map = context.get_dataset_map()
    torch_dataset = dataset_map['train'].get_shard().to_torch_dataset(batch_size=1024)
    dataloader = DataLoader(torch_dataset)

    # Training loop
    for epoch in range(10):
        for batch_dict in dataloader:
            features = torch.cat([batch_dict[col].T for col in input_cols], dim=1).float().to(device)
            labels = batch_dict[label_col].T.squeeze(0).float().to(device)
            output = model(features)
            loss = criterion(output, labels.unsqueeze(1))

            optimizer.zero_grad()
            loss.backward()
            optimizer.step()
        print(f'Epoch [{epoch+1}/10], Loss: {loss.item():.4f}')

    # Save the model to the model directory provided by the context
    if context.get_rank() == 0:
        torch.save(
            model.module.state_dict(), os.path.join(context.get_model_dir(), "model.pt")
        )

# Set up PyTorchDistributor for distributed training
from snowflake.ml.modeling.distributors.pytorch import PyTorchDistributor, PyTorchScalingConfig, WorkerResourceConfig

pytorch_trainer = PyTorchDistributor(
    train_func=train_func,
    # Optional Scaling Configuration, for single node multi-GPU training.
    scaling_config=PyTorchScalingConfig(
        num_nodes=1,
        num_workers_per_node=1,
        resource_requirements_per_worker=WorkerResourceConfig(num_cpus=0, num_gpus=4)
    )
)

# Run the training process
pytorch_trainer.run(dataset_map={'train': data_connector})
```

#### Retrieving the model

If you are using multi-node DDP, the model is automatically synchronized to a Snowflake stage as the shared persistent storage.

The following code gets the model from a stage. It uses the `artifact_stage_location` parameter to specify the location of the stage that stores the model artifact.

The function saved in the `stage_location` variable gets the location of the model in the stage after training completes. The model artifact is saved under `"DB_NAME.SCHEMA_NAME.STAGE_NAME/model/{request_id}"`.

```python
response = pytorch_trainer.run(
        dataset_map={'train': data_connector},
        artifact_stage_location="DB_NAME.SCHEMA_NAME.STAGE_NAME",
    )

stage_location = response.get_model_dir()
```

---
title: Engineer features
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/transform-data.md
section: Snowflake ML
---

# Engineer features

Snowflake ML allows you to transform your raw data into features, allowing for efficient use by machine learning models.
You can transform data using several approaches, each suited for different scales and requirements:

* **Open Source Software (OSS) preprocessors** - For small to medium datasets and quick prototyping, use familiar Python ML libraries that run locally or on single nodes within Container Runtime.
* **Snowflake ML Preprocessors** - For larger datasets, use Snowflake ML’s preprocessing APIs that execute natively on the Snowflake platform. These APIs distribute the processing across warehouse compute resources.
* **Ray map_batches** - For highly customizable large-scale processing, especially with unstructured data, use parallel, resource-managed execution across single-node or multi-node Container Runtime environments.

Choose the approach that best matches your data size, performance requirements, and need for custom transformation logic.

The following table shows detailed comparisons of three main approaches for feature engineering in Snowflake ML:

| Feature/Aspect | OSS (including scikit-learn) | Snowflake ML preprocessors | Ray `map_batches` |
| --- | --- | --- | --- |
| Scale | Small & medium datasets | Large/distributed data | Large/distributed data |
| Execution Environment | In memory | Pushdown to the default warehouse that you’re using to run SQL queries | Across nodes in a compute pool |
| Compute Resources | Snowpark Container Services (Compute Pool) | Warehouse | Snowpark Container Services (Compute Pool) |
| Integration | Standard Python ML ecosystem | Integrates natively with Snowflake ML | Both with Python ML and Snowflake |
| Performance | Fast for local, in-memory workloads; scale limited and non-distributed | Designed for scalable, distributed feature engineering | Highly parallel and resource-managed, excels on large/unstructured data |
| Use Case Suitability | Quickly prototyping and experimentation | Production workflows with large datasets | Large data workflows that require custom resource controls |

The following examples demonstrate how to implement feature transformations using each approach:

OSS scikit-learnSnowflake ML PreprocessorsRay map_batches

Use the following code to implement scikit-learn for your preprocessing workflows:

```python
import pandas as pd
from sklearn.preprocessing import StandardScaler, OneHotEncoder
from sklearn.pipeline import Pipeline
from sklearn.compose import ColumnTransformer

# Load your data locally into a Pandas DataFrame
df = pd.DataFrame({
    'age': [34, 23, 54, 31],
    'city': ['SF', 'NY', 'SF', 'LA'],
    'income': [120000, 95000, 135000, 99000]
})

# Define preprocessing steps
numeric_features = ['age', 'income']
numeric_transformer = StandardScaler()

categorical_features = ['city']
categorical_transformer = OneHotEncoder()

preprocessor = ColumnTransformer(
    transformers=[
        ('num', numeric_transformer, numeric_features),
        ('cat', categorical_transformer, categorical_features)
    ]
)

pipeline = Pipeline(steps=[
    ('preprocessor', preprocessor)
])

# Preprocess the data
X_processed = pipeline.fit_transform(df)
print(X_processed)
```

Snowflake ML preprocessors handle distributed transformations directly within Snowflake. These preprocessors are pushed down to scale across warehouses.
Use Snowflake ML preprocessors for large datasets and production workloads.

> **Note:**
>
> The Snowflake ML preprocessors are a subset of the preprocessors available in sci-kit learn, but they cover the most common use cases.
> For information about the available preprocessors, see [Snowflake ML modeling preprocessing](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/modeling#snowflake-ml-modeling-preprocessing).

The following code uses the `StandardScaler` and `OneHotEncoder` libraries.

```python
from snowflake.snowpark import Session
from snowflake.ml.modeling.preprocessing import StandardScaler, OneHotEncoder
from snowflake.ml.modeling.pipeline import Pipeline

# Assume your Snowflake connection details are configured
session = Session.builder.configs(...).create()

# Load your data from a Snowflake table as a DataFrame
df = session.table('CUSTOMER_DATA')

# Define Snowflake ML preprocessors
scaler = StandardScaler(input_cols=['AGE', 'INCOME'], output_cols=['AGE_SCALED', 'INCOME_SCALED'])
encoder = OneHotEncoder(input_cols=['CITY'], output_cols=['CITY_ENCODED'])

pipeline = Pipeline(steps=[
    ('scaling', scaler),
    ('encoding', encoder)
])

# Fit and transform data in Snowflake (distributed)
result = pipeline.fit_transform(df)
result.show()
```

Use Ray for distributed, parallel processing with custom transformations. Ray `map_batches` uses lazy execution, meaning processing won’t happen until you materialize the datasets, which helps reduce memory usage. This approach is ideal for large-scale data processing with custom logic:

```python
import ray
from snowflake.ml.ray.datasource.stage_parquet_file_datasource import SFStageParquetDataSource
from snowflake.ml.data.data_connector import DataConnector

# Example for data transform
def preprocess_batch(batch: pd.DataFrame) -> pd.DataFrame:
    batch['AGE_SCALED'] = (batch['age'] - batch['age'].mean()) / batch['age'].std()
    return batch

# Example of filtering
def filter_by_value(row):
    return row['city'] != 'LA'

# Build Ray dataset from provided datasources
ray_ds = ray.data.read_datasource(data_source)

# Setup filter operations, not executed yet
filtered_ds = ray_ds.filter(filter_by_value)

transformed_ds = filtered_ds.map_batches(example_transform_batch_function)

# Create DataConnector directly from ray dataset
data_connector = DataConnector.from_ray_dataset(transformed_ds)
```

---
title: Example workflows
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/real-time-inference-examples.md
section: Snowflake ML
---

# Example workflows

This page provides example workflows for deploying machine learning models for real-time inference using Snowpark Container Services (SPCS). Each example demonstrates the complete lifecycle from model registration to deployment and inference.

This includes:

* How to create services, make predictions, and access models via HTTP endpoints.
* How to use different model architectures (XGBoost, Hugging Face transformers, PyTorch) and compute options (CPU and GPU).

## Deploy an XGBoost model for CPU-powered inference

The following code:

> * Deploys an XGBoost model for inference in SPCS
> * Uses the deployed model for inference.

```python
from snowflake.ml.registry import registry
from snowflake.ml.utils.connection_params import SnowflakeLoginOptions
from snowflake.snowpark import Session

from xgboost import XGBRegressor

# your model training code here output of which is a trained xgb_model

# Open model registry
reg = registry.Registry(session=session, database_name='my_registry_db', schema_name='my_registry_schema')

# Log the model in Snowflake Model Registry
model_ref = reg.log_model(
    model_name="my_xgb_forecasting_model",
    version_name="v1",
    model=xgb_model,
    conda_dependencies=["scikit-learn","xgboost"],
    sample_input_data=pandas_test_df,
    comment="XGBoost model for forecasting customer demand"
)

# Deploy the model to SPCS
model_ref.create_service(
    service_name="forecast_model_service",
    service_compute_pool="my_cpu_pool",
    ingress_enabled=True)

# See all services running a model
model_ref.list_services()

# Run on SPCS
model_ref.run(pandas_test_df, function_name="predict", service_name="forecast_model_service")

# Delete the service
model_ref.delete_service("forecast_model_service")
```

### Calling via HTTP (External Application)

Since this model has ingress enabled (`ingress_enabled=True`), you can call its public HTTP endpoint. The following example uses a PAT stored in the environment variable `PAT_TOKEN` to authenticate with a public Snowflake endpoint:

```python
import os
import json
import numpy as np
from pprint import pprint
import requests

def get_headers(pat_token):
    headers = {'Authorization': f'Snowflake Token="{pat_token}"'}
    return headers

headers = get_headers(os.getenv("PAT_TOKEN"))

# Put the endpoint url with method name `predict`
# The endpoint url can be found with `show endpoints in service <service_name>`.
URL = 'https://<random_str>-<organization>-<account>.snowflakecomputing.app/predict'

# Prepare data to be sent
data = {"data": np.column_stack([range(pandas_test_df.shape[0]), pandas_test_df.values]).tolist()}

# Send over HTTP
def send_request(data: dict):
    output = requests.post(URL, json=data, headers=headers)
    assert (output.status_code == 200), f"Failed to get response from the service. Status code: {output.status_code}"
    return output.content

# Test
results = send_request(data=data)
print(json.loads(results))
```

## Deploy a Hugging Face sentence transformer for GPU-powered inference

The following code trains and deploys a Hugging Face sentence transformer, including an HTTP endpoint.

This example requires the `sentence-transformers` package, a GPU compute pool and an image repository.

```python
from snowflake.ml.registry import registry
from snowflake.ml.utils.connection_params import SnowflakeLoginOptions
from snowflake.snowpark import Session
from sentence_transformers import SentenceTransformer

session = Session.builder.configs(SnowflakeLoginOptions("connection_name")).create()
reg = registry.Registry(session=session, database_name='my_registry_db', schema_name='my_registry_schema')

# Take an example sentence transformer from HF
embed_model = SentenceTransformer('sentence-transformers/all-MiniLM-L6-v2')

# Have some sample input data
input_data = [
    "This is the first sentence.",
    "Here's another sentence for testing.",
    "The quick brown fox jumps over the lazy dog.",
    "I love coding and programming.",
    "Machine learning is an exciting field.",
    "Python is a popular programming language.",
    "I enjoy working with data.",
    "Deep learning models are powerful.",
    "Natural language processing is fascinating.",
    "I want to improve my NLP skills.",
]

# Log the model with pip dependencies
pip_model = reg.log_model(
    embed_model,
    model_name="sentence_transformer_minilm",
    version_name="pip",
    sample_input_data=input_data,  # Needed for determining signature of the model
    pip_requirements=["sentence-transformers", "torch", "transformers"], # If you want to run this model in the Warehouse, you can use conda_dependencies instead
)

# Force Snowflake to not try to check warehouse
conda_forge_model = reg.log_model(
    embed_model,
    model_name="sentence_transformer_minilm",
    version_name="conda_forge_force",
    sample_input_data=input_data,
    # setting any package from conda-forge is sufficient to know that it can't be run in warehouse
    conda_dependencies=["sentence-transformers", "conda-forge::pytorch", "transformers"]
)

# Deploy the model to SPCS
pip_model.create_service(
    service_name="my_minilm_service",
    service_compute_pool="my_gpu_pool",  # Using GPU_NV_S - smallest GPU node that can run the model
    ingress_enabled=True,
    gpu_requests="1", # Model fits in GPU memory; only needed for GPU pool
    max_instances=4, # 4 instances were able to run 10M inferences from an XS warehouse
)

# See all services running a model
pip_model.list_services()

# Run on SPCS
pip_model.run(input_data, function_name="encode", service_name="my_minilm_service")

# Delete the service
pip_model.delete_service("my_minilm_service")
```

In SQL, you can call the service function as follows:

```sqlexample
SELECT my_minilm_service!encode('This is a test sentence.');
```

Similarly, you can call its HTTP endpoint as follows.

```python
import json
from pprint import pprint
import requests

# Put the endpoint url with method name `encode`
URL='https://<random_str>-<account>.snowflakecomputing.app/encode'

# Prepare data to be sent
data = {
    'data': []
}
for idx, x in enumerate(input_data):
    data['data'].append([idx, x])

# Send over HTTP
def send_request(data: dict):
    output = requests.post(URL, json=data, headers=headers)
    assert (output.status_code == 200), f"Failed to get response from the service. Status code: {output.status_code}"
    return output.content

# Test
results = send_request(data=data)
pprint(json.loads(results))
```

## Deploy a PyTorch model for GPU-powered inference

For an example of training and deploying a PyTorch deep learning recommendation model (DLRM) to SPCS for GPU inference, see this [quickstart](https://quickstarts.snowflake.com/guide/snowpark-container-services-model-serving-guide/)

## Deploy a Snowpark ML modeling model

Models developed using Snowpark ML modeling classes cannot be deployed to environments that have a GPU. As a workaround, you can extract the native model and deploy that. For example:

```python
# Train a model using Snowpark ML
from snowflake.ml.modeling.xgboost import XGBRegressor
regressor = XGBRegressor(...)
regressor.fit(training_df)

# Extract the native model
xgb_model = regressor.to_xgboost()
# Test the model with pandas dataframe
pandas_test_df = test_df.select(['FEATURE1', 'FEATURE2', ...]).to_pandas()
xgb_model.predict(pandas_test_df)

# Log the model in Snowflake Model Registry
mv = reg.log_model(xgb_model,
                   model_name="my_native_xgb_model",
                   sample_input_data=pandas_test_df,
                   comment = 'A native XGB model trained from Snowflake Modeling API',
                   )
# Now we should be able to deploy to a GPU compute pool on SPCS
mv.create_service(
    service_name="my_service_gpu",
    service_compute_pool="my_gpu_pool",
    image_repo="my_repo",
    max_instances=1,
    gpu_requests="1",
)
```

---
title: Examples and Quickstarts
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/examples-and-quickstarts.md
section: Snowflake ML
---

# Examples and Quickstarts

This topic contains several examples and quickstarts for common use cases for model logging and model inference in
Snowflake ML. You can use these examples as a starting point for your own use case.

## Beginner Quickstart

Getting started with Snowflake ML: train an xgboost regression model, log to model registry, and run inference in a
Warehouse.

[Quickstart](https://quickstarts.snowflake.com/guide/intro_to_machine_learning_with_snowpark_ml_for_python/)

## xgboost model, CPU inference in Snowpark Container Services

This code illustrates the key steps in deploying an XGBoost model in Snowpark Container Services (SPCS), then using the deployed model for inference.

For more information, see [Deploy models for Real time Inference (REST API)](../inference/real-time-inference-rest-api.md).

## Log a pipeline with custom preprocessing and model training

This example illustrates how to:

* Perform feature engineering.
* Train a pipeline with custom preprocessing steps and an xgboost forecasting model.
* Run hyperparameter optimization.
* Log the optimum pipeline.
* Run inference in a warehouse or in Snowpark Container Services (SPCS).

[Example Notebook](https://github.com/rajshah4/snowflake-notebooks/blob/main/Forecasting_ChicagoBus/Snowpark_Forecasting_Bus_FeatureStore.ipynb)

## Getting Started with Model Serving in Snowpark Container Services

This example illustrates how to:

* Train, register, and version a model using the Snowflake Model Registry.
* Deploy a model as a service in Snowpark Container Services.
* Access the deployed model endpoint using REST API with both Key-Pair and Programmatic Access Token (PAT) authentication.

[Quickstart](https://quickstarts.snowflake.com/guide/snowpark-container-services-model-serving-guide/)

## Large scale open source embeddings model, GPU inference

This example uses Snowflake Notebooks on Container Runtime to train a large-scale embeddings model from the Hugging Face
`sentence_transformer` library and run large scale predictions using GPUs on Snowpark Container Services (SPCS).

[Quickstart](https://quickstarts.snowflake.com/guide/scale-embeddings-with-snowflake-notebooks-on-container-runtime/)

## Complete pipeline with distributed PyTorch recommender model, GPU inference

This example shows how to build an end-to-end distributed Pytorch recommender model using GPUs, deploying the model for GPU inference on Snowpark Container Services (SPCS).

[Quickstart](https://quickstarts.snowflake.com/guide/getting-started-with-running-distributed-pytorch-models-on-snowflake/)

## Bring an existing model trained externally (eg. AWS Sagemaker/Azure ML/GCP Vertex AI) to Snowflake

These examples show how to bring your existing model in AWS Sagemaker, Azure ML, or GCP Vertex AI to Snowflake (see [blog post](https://medium.com/snowflake/integrating-machine-learning-models-with-snowpark-ml-a-guide-for-azureml-and-sagemaker-users-735292843a7b) for more details).

* AWS and Azure ML [Quickstart](https://quickstarts.snowflake.com/guide/deploying_models_from_azureml_and_sagemaker_to_snowparkml/)
* GCP Vertex AI [Quickstart](https://quickstarts.snowflake.com/guide/getting_started_with_snowpark_for_machine_learning_on_vertexai/)

## Bring an MLFlow PyFunc model to Snowflake

This example shows how to log an MLFlow PyFunc model in the Snowflake Model Registry and run inference.

[Example](built-in-models/mlflow.md)

## Log a partitioned forecasting model for training and inference

This example shows how to log a forecasting model for running partitioned training and inference in Snowflake.

[Quickstart](https://quickstarts.snowflake.com/guide/partitioned-ml-model/)

## Log many models as a collection for running partitioned inference at scale

This example shows how to log thousands of models as a custom partitioned model for running distributed, partitioned inference.

[Quickstart](https://quickstarts.snowflake.com/guide/many-model-inference-in-snowflake/)

---
title: Find Feature Store objects
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/feature-store-ui.md
section: Snowflake ML
---

# Find Feature Store objects

After you create entities and feature views, you can use the Feature Store User Interface in Snowsight to find the objects you need. Use the search bar to search for Feature Store objects, such as the following:

* Feature views
* Feature column names
* Description names
* Entity names

When you search for an object, Snowflake does a universal search across all feature store object names and metadata.
Snowflake searches through all of the metadata to return the best possible result.

For example, you might have a feature view called `rider_features`, that has this comment: “demographic features for all users who are signed up as passengers of ride share services”. If your search query is “passenger features”, the search results will return the `rider_features` view even though the search query didn’t include “rider”.

For more information about the universal search, see [Search Snowflake objects and resources](../../../user-guide/ui-snowsight-universal-search.md).

> **Important:**
>
> For information about the privileges you need to access features within the feature store, see [Snowflake Feature Store access control model](rbac.md).

To access the Feature Store User Interface, do the following:

* Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
* In the navigation menu, select AI & ML » Features.

The landing page lists all the feature views within the feature store that you’ve selected. It also includes summary information about each feature view, such as the following:

* Number of versions
* Description
* Feature column names
* Entity name

The following image shows the feature views:

At the top of the page, select the Entities tab to see the Feature Views organized by entity.
The view also shows you the join keys used by the entity to get the features.

To see the details of a feature view, do the following:

* Select the feature view.
* Select the version you want to view from the dropdown in the top-right corner.

You can now see the details of the feature view, such as the following:

* Whether it’s a dynamic table or a view
* Feature column details

The following image shows where you can see whether the feature view is a dynamic table or a view:

> **Note:**
>
> You can use the Lineage tab to display the end to end lineage of the source data and the downstream objects from the feature view. Lineage tracking is a public preview feature.

For a dynamic table, to view information about its metrics and refresh history, select the table name.

To delete a feature view or refresh a dynamic table, select the … button.

The following image shows the lineage of a feature view along with the … UI element:

---
title: Force plots
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-explainability-visualization/force-plots.md
section: Snowflake ML
---

# Force plots

Use the `plot_force()` function to create a visualization that shows how each feature contributes to your model’s prediction.
A feature’s contribution is represented by an arrow that directs the model’s prediction higher or lower from the base value.

The size of the arrow in the force plot corresponds to the size of the magnitude.
In the preceding figure, `feature_5` has the largest positive influence, pushing the prediction higher, while `feature_4` has the largest negative influence, pulling the prediction lower. The final predicted value is approximately 4.

## Required arguments

| Argument | Description |
| --- | --- |
| `shap_row` | A pandas Series or Snowpark Row containing SHAP values for a specific instance. SHAP values represent how much each feature contributes to the prediction. |
| `features_row` | A pandas Series or Snowpark Row containing the actual feature values for the same instance. These values are shown alongside their contributions. |

## Optional arguments

| Argument | Description |
| --- | --- |
| `base_value` | The base value that represents the model’s average prediction. This defaults to 0.0 but should typically be set to the model’s mean prediction value. |
| `figsize` | A tuple of (width, height) that controls the size of the plot. Uses a default size of (1400, 500) if not specified. |
| `contribution_threshold` | A float between 0 and 1 that filters which features to display. Only features with absolute SHAP values greater than this threshold (as a percentage of total absolute SHAP values) will be shown. Defaults to 0.05 (5%). |

The function returns a chart that visualizes the following items:

1. The model’s prediction as a starting point
2. Positive contributions (pushing prediction higher) in red
3. Negative contributions (pushing prediction lower) in blue
4. Feature names, feature values, and influence values as annotations

The visualization can be helpful to understand the following data points:

* Which features have the strongest influence on a specific prediction
* Whether each feature pushes the prediction higher or lower
* The magnitude of each feature’s contribution
* How the features combine to arrive at the final prediction

> **Note:**
>
> If no features meet the contribution threshold, or if an invalid threshold is provided (not between 0 and 1), the function will raise a `SnowflakeMLException`.

---
title: Hugging Face pipeline
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/hugging-face.md
section: Snowflake ML
---

# Hugging Face pipeline

The Snowflake Model Registry supports any Hugging Face model defined as a
[transformer](https://huggingface.co/docs/transformers/index) that can be loaded with the [transformers.Pipeline](https://huggingface.co/docs/transformers/main_classes/pipelines#transformers.pipeline) method.

Use one of the following methods to log a Hugging Face model to the Model Registry:

1. Import and deploy a model from Hugging Face using Snowsight. See [Import and deploy models from an external service](../snowsight-ui.md) for instructions.
2. Create a `snowflake.ml.model.models.huggingface.TransformersPipeline` instance and call [`log_model()`](/developer-guide/snowpark-ml/reference/latest/api/registry/snowflake.ml.registry.Registry.md "(in Snowpark ML API Reference (Python))"):

   ```python
   # reg: snowflake.ml.registry.Registry

   from snowflake.ml.model.models import huggingface

   model = huggingface.TransformersPipeline(
       task="text-classification",
       model="ProsusAI/finbert",
       # compute_pool_for_log=... # Optional
   )

   mv = reg.log_model(model, model_name='finbert', version_name='v5')
   ```

   > **Important:**
   > * If you don’t specify a `compute_pool_for_log` argument, the model is logged using the default CPU compute pool.
   > * If you specify a `compute_pool_for_log` argument, the model is logged using the specified compute pool.
   > * If you specify `compute_pool_for_log` argument as None, the model files are downloaded locally and then uploaded to the model registry. This requires [huggingface-hub](https://pypi.org/project/huggingface-hub/) to be installed.
3. Load the model from Hugging Face in memory and log it to Model Registry:

   > ```python
   > # reg: snowflake.ml.registry.Registry
   >
   > lm_hf_model = transformers.pipeline(
   >     task="text-generation",
   >     model="bigscience/bloom-560m",
   >     token="...",  # Put your HuggingFace token here.
   >     return_full_text=False,
   >     max_new_tokens=100,
   > )
   >
   > lmv = reg.log_model(lm_hf_model, model_name='bloom', version_name='v560m')
   > ```

If you are using Snowflake Notebooks, in order to download the weights of the model, you need to have an external access integration attached to your notebook. This integration is required to allow egress to the following hosts:

* `huggingface.co`
* `hub-ci.huggingface.co`
* `cdn-lfs-us-1.hf.co`
* `cdn-lfs-eu-1.hf.co`
* `cdn-lfs.hf.co`
* `transfer.xethub.hf.co`
* `cas-server.xethub.hf.co`
* `cas-bridge.xethub.hf.c`

> **Note:**
>
> This list of hosts are only those required for accessing Hugging Face, and may change at any time. Your model may require artifacts from other sources, which should be added to the network rule as allowed for egress.

The following example creates a new external access integration `huggingface_network_rule` for use with a Notebook:

```sqlexample
CREATE NETWORK RULE huggingface_network_rule
TYPE = HOST_PORT
VALUE_LIST = (
    'huggingface.co',
    'hub-ci.huggingface.co',
    'cdn-lfs-us-1.hf.co',
    'cdn-lfs-eu-1.hf.co',
    'cdn-lfs.hf.co',
    'transfer.xethub.hf.co',
    'cas-server.xethub.hf.co',
    'cas-bridge.xethub.hf.co'
)
MODE = EGRESS
COMMENT = 'Network Rule for Hugging Face external access';

CREATE EXTERNAL ACCESS INTEGRATION huggingface_access_integration
ALLOWED_NETWORK_RULES = (huggingface_network_rule)
ENABLED = true;
```

See [Creating and using an external access integration](../../../external-network-access/creating-using-external-network-access.md) for more information.

Once your external access integration is created, attach it to your Notebook and have access to the Hugging Face model repository to download the weights and configurations of the model. See [Set up external access for Snowflake Notebooks](../../../../user-guide/ui-snowsight/notebooks-external-access.md) for more information.

## Model Registry API

When calling `log_model()`, the `options` dictionary supports the following keys:

| Option key | Description | Type |
| --- | --- | --- |
| `target_methods` | A list of methods available on the model object. Hugging Face models use the object’s `__call__` method by default, if it exists. | `list[str]` |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with a GPU. If set to `None`, the model can’t be deployed to a platform with a GPU. Defaults to `12.4`. | `Optional[str]` |

The model registry infers the `signatures` argument if the pipeline contains a task from the following list:

* fill-mask
* question-answering (single output, multiple outputs)
* summarization
* table-question-answering
* text2text-generation
* text-classification (single output, multiple outputs)
* sentiment-analysis (single output, multiple outputs)
* text-generation (with OpenAI-compatible settings)
* token-classification
* ner
* translation
* translation_xx_to_yy, where `xx` and `yy` are two-letter country codes defined in [ISO 3166-1 alpha-2](https://en.wikipedia.org/wiki/List_of_ISO_3166_country_codes#Current_ISO_3166_country_codes)
* zero-shot-classification

> **Note:**
>
> Task names are case-sensitive.

The `sample_input_data` argument to `log_model` is ignored for Hugging Face models. Specify the `signatures` argument
when logging a Hugging Face model that is not in the preceding list so that the registry knows the signatures of the target
methods.

To see the inferred signature, call the [`show_functions()`](/developer-guide/snowpark-ml/reference/latest/api/model/snowflake.ml.model.ModelVersion.md "(in Snowpark ML API Reference (Python))") method. This signature gives you the required types and column names for model function input, as well as the format of its output. The following example shows the signature for the model `bigscience/bloom-560m` with a task of `text-generation`:

```output
{'name': '__CALL__',
  'target_method': '__call__',
  'signature': ModelSignature(
                      inputs=[
                          FeatureSpec(dtype=DataType.STRING, name='inputs')
                      ],
                      outputs=[
                          FeatureSpec(dtype=DataType.STRING, name='outputs')
                      ]
                  )}]
```

The following example shows how to invoke a model using the previous signature:

```python
# model: snowflake.ml.model.ModelVersion

import pandas as pd

remote_prediction = model.run(pd.DataFrame(["Hello, how are you?"], columns=["inputs"]))
```

## Usage notes

* Many Hugging Face models are large and don’t fit in a standard warehouse. Use a Snowpark-optimized warehouse or choose
  a smaller version of the model. For example, an alternative to the `Llama-2-70b-chat-hf` model is `Llama-2-7b-chat-hf`.
* Snowflake warehouses do not have GPUs. Use only CPU-optimized Hugging Face models.
* Some Hugging Face transformers return an array of dictionaries per input row. The model registry converts this array of dictionaries to a
  string containing a JSON representation of the array. For example, multi-output Question Answering output looks similar to this:

  ```output
  '[{"score": 0.61094731092453, "start": 139, "end": 178, "answer": "learn more about the world of athletics"},
  {"score": 0.17750297486782074, "start": 139, "end": 180, "answer": "learn more about the world of athletics.\""}]'
  ```

## Example

```python
# Prepare model

import transformers
import pandas as pd

finbert_model = transformers.pipeline(
    task="text-classification",
    model="ProsusAI/finbert",
    top_k=2,
)

# Log the model
mv = registry.log_model(
    finbert_model,
    model_name="finbert",
    version_name="v1",
)

# Use the model
mv.run(pd.DataFrame(
        [
            ["I have a problem with my Snowflake that needs to be resolved asap!!", ""],
            ["I would like to have udon for today's dinner.", ""],
        ]
    )
)
```

Result:

```output
0  [{"label": "negative", "score": 0.8106237053871155}, {"label": "neutral", "score": 0.16587384045124054}]
1  [{"label": "neutral", "score": 0.9263970851898193}, {"label": "positive", "score": 0.05286872014403343}]
```

## Inferred signatures for Hugging Face pipelines

This section describes the inferred signatures for supported Hugging Face pipelines, including a description and example of the required inputs and expected outputs. All inputs and outputs are Snowpark DataFrames.

### Fill-mask pipeline

A pipeline whose task is “[fill-mask](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.FillMaskPipeline)”
has the following inputs and outputs.

#### Inputs

* `inputs`: A string where there is a mask to fill.

Example:

```output
--------------------------------------------------
|"inputs"                                        |
--------------------------------------------------
|LynYuu is the [MASK] of the Grand Duchy of Yu.  |
--------------------------------------------------
```

#### Outputs

* `outputs`: A string that contains a JSON representation of a list of objects, each of which may contain keys such
  as `score`, `token`, `token_str`, or `sequence`. For details, see
  [FillMaskPipeline](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.FillMaskPipeline).

Example:

```output
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|"outputs"                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|[{"score": 0.9066258072853088, "token": 3007, "token_str": "capital", "sequence": "lynyuu is the capital of the grand duchy of yu."}, {"score": 0.08162177354097366, "token": 2835, "token_str": "seat", "sequence": "lynyuu is the seat of the grand duchy of yu."}, {"score": 0.0012052370002493262, "token": 4075, "token_str": "headquarters", "sequence": "lynyuu is the headquarters of the grand duchy of yu."}, {"score": 0.0006560495239682496, "token": 2171, "token_str": "name", "sequence": "lynyuu is the name of the grand duchy of yu."}, {"score": 0.0005427763098850846, "token": 3200, "token_str"...  |
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="fill-mask",
    model="google-bert/bert-base-uncased",
)

mv = registry.log_model(
    model=model,
    model_name="GOOGLE_BERT_BASE_UNCASED",
)

input_df = pd.DataFrame([{"text": "LynYuu is the [MASK] of the Grand Duchy of Yu."}])
mv.run(
    input_df,
    # function_name="__call__", # Optional
)
```

### Token classification

A pipeline whose task is “ner” or
[token-classification](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TokenClassificationPipeline)
has the following inputs and outputs.

#### Inputs

* `inputs`: A string that contains the tokens to be classified.

Example:

```output
------------------------------------------------
|"inputs"                                      |
------------------------------------------------
|My name is Izumi and I live in Tokyo, Japan.  |
------------------------------------------------
```

#### Outputs

* `outputs`: A string that contains a JSON representation of a list of result objects, each of which may contain keys such
  as `entity`, `score`, `index`, `word`, `name`, `start`, or `end`. For details, see
  [TokenClassificationPipeline](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TokenClassificationPipeline).

Example:

```output
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|"outputs"                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|[{"entity": "PRON", "score": 0.9994392991065979, "index": 1, "word": "my", "start": 0, "end": 2}, {"entity": "NOUN", "score": 0.9968984127044678, "index": 2, "word": "name", "start": 3, "end": 7}, {"entity": "AUX", "score": 0.9937735199928284, "index": 3, "word": "is", "start": 8, "end": 10}, {"entity": "PROPN", "score": 0.9928083419799805, "index": 4, "word": "i", "start": 11, "end": 12}, {"entity": "PROPN", "score": 0.997334361076355, "index": 5, "word": "##zumi", "start": 12, "end": 16}, {"entity": "CCONJ", "score": 0.999173104763031, "index": 6, "word": "and", "start": 17, "end": 20}, {...  |
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="token-classification",
    model="dslim/bert-base-NER",
)

mv = registry.log_model(
    model=model,
    model_name="BERT_BASE_NER",
)

mv.run(
    pd.DataFrame([{"inputs": "My name is Izumi and I live in Tokyo, Japan."}]),
    # function_name="__call__", # Optional
)
```

### Question answering (single output)

A pipeline whose task is “[question-answering](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.QuestionAnsweringPipeline)”,
where `top_k` is either unset or set to 1, has the following inputs and outputs.

#### Inputs

* `question`: A string that contains the question to answer.
* `context`: A string that may contain the answer.

Example:

```output
-----------------------------------------------------------------------------------
|"question"                  |"context"                                           |
-----------------------------------------------------------------------------------
|What did Doris want to do?  |Doris is a cheerful mermaid from the ocean dept...  |
-----------------------------------------------------------------------------------
```

#### Outputs

* `score`: Floating-point confidence score from 0.0 to 1.0.
* `start`: Integer index of the first token of the answer in the context.
* `end`: Integer index of the last token of the answer in the original context.
* `answer`: A string that contains the found answer.

Example:

```output
--------------------------------------------------------------------------------
|"score"           |"start"  |"end"  |"answer"                                 |
--------------------------------------------------------------------------------
|0.61094731092453  |139      |178    |learn more about the world of athletics  |
--------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="question-answering",
    model="deepset/roberta-base-squad2",
)

QA_input = {
    "question": "Why is model conversion important?",
    "context": "The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks.",
}

mv = registry.log_model(
    model=model,
    model_name="ROBERTA_BASE_SQUAD2",
)

mv.run(
    pd.DataFrame.from_records([QA_input]),
    # function_name="__call__", # Optional
)
```

### Question answering (multiple outputs)

A pipeline whose task is “[question-answering](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.QuestionAnsweringPipeline)”,
where `top_k` is set and is larger than 1, has the following inputs and outputs.

#### Inputs

* `question`: A string that contains the question to answer.
* `context`: A string that may contain the answer.

Example:

```output
-----------------------------------------------------------------------------------
|"question"                  |"context"                                           |
-----------------------------------------------------------------------------------
|What did Doris want to do?  |Doris is a cheerful mermaid from the ocean dept...  |
-----------------------------------------------------------------------------------
```

#### Outputs

* `outputs`: A string that contains a JSON representation of a list of result objects, each of which may contain keys such
  as `score`, `start`, `end`, or `answer`.

Example:

```output
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|"outputs"                                                                                                                                                                                                                                                                                                                                        |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|[{"score": 0.61094731092453, "start": 139, "end": 178, "answer": "learn more about the world of athletics"}, {"score": 0.17750297486782074, "start": 139, "end": 180, "answer": "learn more about the world of athletics.\""}, {"score": 0.06438097357749939, "start": 138, "end": 178, "answer": "\"learn more about the world of athletics"}]  |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="question-answering",
    model="deepset/roberta-base-squad2",
    top_k=3,
)

QA_input = {
    "question": "Why is model conversion important?",
    "context": "The option to convert models between FARM and transformers gives freedom to the user and let people easily switch between frameworks.",
}

mv = registry.log_model(
    model=model,
    model_name="ROBERTA_BASE_SQUAD2",
)

mv.run(
    pd.DataFrame.from_records([QA_input]),
    # function_name="__call__", # Optional
)
```

### Summarization

A pipeline whose task is “[summarization](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.SummarizationPipeline)”,
where `return_tensors` is False or unset, has the following inputs and outputs.

#### Inputs

* `documents`: A string that contains text to summarize.

Example:

```output
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|"documents"                                                                                                                                                                                               |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|Neuro-sama is a chatbot styled after a female VTuber that hosts live streams on the Twitch channel "vedal987". Her speech and personality are generated by an artificial intelligence (AI) system  wh...  |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Outputs

* `summary_text`: A string that contains the generated summary, or, if `num_return_sequences` is greater than 1,
  a string that contains a JSON representation of a list of results, each of which is a dictionary that contains fields, including `summary_text`.

Example:

```output
---------------------------------------------------------------------------------
|"summary_text"                                                                 |
---------------------------------------------------------------------------------
| Neuro-sama is a chatbot styled after a female VTuber that hosts live streams  |
---------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="summarization",
    model="facebook/bart-large-cnn",
)

text = "The transformers library is a great library for natural language processing which provides a unified interface for many different models and tasks."

mv = registry.log_model(
    model=model,
    model_name="BART_LARGE_CNN",
)

mv.run(
    pd.DataFrame.from_records([{"documents": text}]),
    # function_name="__call__", # Optional
)
```

### Table question answering

A pipeline whose task is
“[table-question-answering](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TableQuestionAnsweringPipeline)”
has the following inputs and outputs.

#### Inputs

* `query`: A string that contains the question to be answered.
* `table`: A string that contains a JSON-serialized dictionary in the form `{column -> [values]}` representing the table
  that may contain an answer.

Example:

```output
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|"query"                                  |"table"                                                                                                                                                                                                                                                   |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|Which channel has the most subscribers?  |{"Channel": ["A.I.Channel", "Kaguya Luna", "Mirai Akari", "Siro"], "Subscribers": ["3,020,000", "872,000", "694,000", "660,000"], "Videos": ["1,200", "113", "639", "1,300"], "Created At": ["Jun 30 2016", "Dec 4 2017", "Feb 28 2014", "Jun 23 2017"]}  |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Outputs

* `answer`: A string that contains a possible answer.
* `coordinates`: A list of integers that represent the coordinates of the cells where the answer was located.
* `cells`: A list of strings that contain the content of the cells where the answer was located.
* `aggregator`: A string that contains the name of the aggregator used.

Example:

```output
----------------------------------------------------------------
|"answer"     |"coordinates"  |"cells"          |"aggregator"  |
----------------------------------------------------------------
|A.I.Channel  |[              |[                |NONE          |
|             |  [            |  "A.I.Channel"  |              |
|             |    0,         |]                |              |
|             |    0          |                 |              |
|             |  ]            |                 |              |
|             |]              |                 |              |
----------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd
import json

model = transformers.pipeline(
    task="table-question-answering",
    model="microsoft/tapex-base-finetuned-wikisql",
)

data = {
    "year": [1896, 1900, 1904, 2004, 2008, 2012],
    "city": ["athens", "paris", "st. louis", "athens", "beijing", "london"],
}
query = "What is the city of the year 2004?"

mv = registry.log_model(
    model=model,
    model_name="TAPEX_BASE_FINETUNED_WIKISQL",
)

mv.run(
    pd.DataFrame.from_records([{"query": query, "table": json.dumps(data)}]),
    # function_name="__call__", # Optional
)
```

### Text classification (single output)

A pipeline whose task is
“[text-classification](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TextClassificationPipeline)” or “sentiment-analysis”,
where `top_k` is not set or is None,
has the following inputs and outputs.

#### Inputs

* `text`: A string to classify.
* `text_pair`: A string to classify along with `text`, and which is used with models that compute text similarity. Leave empty if the model does not use it.

Example:

```output
----------------------------------
|"text"       |"text_pair"       |
----------------------------------
|I like you.  |I love you, too.  |
----------------------------------
```

#### Outputs

* `label`: A string that represents the classification label of the text.
* `score`: A floating-point confidence score from 0.0 to 1.0.

Example:

```output
--------------------------------
|"label"  |"score"             |
--------------------------------
|LABEL_0  |0.9760091304779053  |
--------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="text-classification",
    model="cardiffnlp/twitter-roberta-base-sentiment-latest",
)

text = "I'm happy today!"

mv = registry.log_model(
    model=model,
    model_name="TWITTER_ROBERTA_BASE_SENTIMENT_LATEST",
)

mv.run(
    pd.DataFrame.from_records([{"text": text}]),
    # function_name="__call__", # Optional
)
```

### Text classification (multiple output)

A pipeline whose task is
“[text-classification](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TextClassificationPipeline)” or “sentiment-analysis”,
where `top_k` is set to a number,
has the following inputs and outputs.

> **Note:**
>
> A text classification task is considered multiple-output if `top_k` is set to any number, even if that number is 1.
> To get a single output, use a `top_k` value of None.

#### Inputs

* `text`: A string to classify.
* `text_pair`: A string to classify along with `text`, which is used with models that compute text similarity. Leave empty if the model does not use it.

Example:

```output
--------------------------------------------------------------------
|"text"                                              |"text_pair"  |
--------------------------------------------------------------------
|I am wondering if I should have udon or rice fo...  |             |
--------------------------------------------------------------------
```

#### Outputs

* `outputs`: A string that contains a JSON representation of a list of results, each of which contains fields that include `label` and `score`.

Example:

```output
--------------------------------------------------------
|"outputs"                                             |
--------------------------------------------------------
|[{"label": "NEGATIVE", "score": 0.9987024068832397}]  |
--------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="text-classification",
    model="cardiffnlp/twitter-roberta-base-sentiment-latest",
    top_k=3,
)

text = "I'm happy today!"

mv = registry.log_model(
    model=model,
    model_name="TWITTER_ROBERTA_BASE_SENTIMENT_LATEST",
)

mv.run(
    pd.DataFrame.from_records([{"text": text}]),
    # function_name="__call__", # Optional
)
```

### Text-to-text generation

A pipeline whose task is
“[text2text-generation](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.Text2TextGenerationPipeline)”,
where `return_tensors` is False or unset,
has the following inputs and outputs.

#### Inputs

* `inputs`: A string that contains a prompt.

Example:

```output
--------------------------------------------------------------------------------
|"inputs"                                                                      |
--------------------------------------------------------------------------------
|A descendant of the Lost City of Atlantis, who swam to Earth while saying, "  |
--------------------------------------------------------------------------------
```

#### Outputs

* generated_text : A string that contains the generated text if `num_return_sequences` is 1, or if num_return_sequences is
  greater than 1, a string representation
  of a JSON list of result dictionaries that contain fields including `generated_text` .

Example:

```output
----------------------------------------------------------------
|"generated_text"                                              |
----------------------------------------------------------------
|, said that he was a descendant of the Lost City of Atlantis  |
----------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="text2text-generation",
    model="google-t5/t5-small",
)

text = "Tell me a joke."

mv = registry.log_model(
    model=model,
    model_name="T5_SMALL",
)

mv.run(
    pd.DataFrame.from_records([{"inputs": text}]),
    # function_name="__call__", # Optional
)
```

> **Note:**
>
> Text-to-text generation pipelines where `return_tensors` is True are not supported.

### Translation generation

A pipeline whose task is
“[translation](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TranslationPipeline)”,
where `return_tensors` is False or unset,
has the following inputs and outputs.

> **Note:**
>
> Translation generation pipelines where `return_tensors` is True are not supported.

#### Inputs

* `inputs`: A string that contains text to translate.

Example:

```output
------------------------------------------------------------------------------------------------------
|"inputs"                                                                                            |
------------------------------------------------------------------------------------------------------
|Snowflake's Data Cloud is powered by an advanced data platform provided as a self-managed service.  |
------------------------------------------------------------------------------------------------------
```

#### Outputs

* `translation_text`: A string that represents generated translation if `num_return_sequences` is 1, or a string
  representation of a JSON list of result dictionaries, each containing fields that include `translation_text`.

Example:

```output
---------------------------------------------------------------------------------------------------------------------------------
|"translation_text"                                                                                                             |
---------------------------------------------------------------------------------------------------------------------------------
|Le Cloud de données de Snowflake est alimenté par une plate-forme de données avancée fournie sous forme de service autogérés.  |
---------------------------------------------------------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="translation",
    model="deepvk/kazRush-kk-ru",
)

text = "Иттерді кім шығарды?"

mv = registry.log_model(
    model=model,
    model_name="KAZRUSH_KK_RU",
)

mv.run(
    pd.DataFrame.from_records([{"inputs": text}]),
    # function_name="__call__", # Optional
)
```

### Zero-shot classification

A pipeline whose task is
“[zero-shot-classification](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.ZeroShotClassificationPipeline)”
has the following inputs and outputs.

#### Inputs

* `sequences`: A string that contains the text to be classified.
* `candidate_labels`: A list of strings that contain the labels to be applied to the text.

Example:

```output
-----------------------------------------------------------------------------------------
|"sequences"                                                       |"candidate_labels"  |
-----------------------------------------------------------------------------------------
|I have a problem with Snowflake that needs to be resolved asap!!  |[                   |
|                                                                  |  "urgent",         |
|                                                                  |  "not urgent"      |
|                                                                  |]                   |
|I have a problem with Snowflake that needs to be resolved asap!!  |[                   |
|                                                                  |  "English",        |
|                                                                  |  "Japanese"        |
|                                                                  |]                   |
-----------------------------------------------------------------------------------------
```

#### Outputs

* `sequence`: The input string.
* `labels`: A list of strings that represent the labels that were applied.
* `scores`: A list of floating-point confidence scores for each label.

Example:

```output
--------------------------------------------------------------------------------------------------------------
|"sequence"                                                        |"labels"        |"scores"                |
--------------------------------------------------------------------------------------------------------------
|I have a problem with Snowflake that needs to be resolved asap!!  |[               |[                       |
|                                                                  |  "urgent",     |  0.9952737092971802,   |
|                                                                  |  "not urgent"  |  0.004726255778223276  |
|                                                                  |]               |]                       |
|I have a problem with Snowflake that needs to be resolved asap!!  |[               |[                       |
|                                                                  |  "Japanese",   |  0.5790848135948181,   |
|                                                                  |  "English"     |  0.42091524600982666   |
|                                                                  |]               |]                       |
--------------------------------------------------------------------------------------------------------------
```

### Text generation

A pipeline whose task is
“[text-generation](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TextGenerationPipeline)”,
where `return_tensors` is False or unset,
has the following inputs and outputs.

> **Note:**
>
> Text generation pipelines where `return_tensors` is True are not supported.

#### Inputs

* `inputs`: A string that contains a prompt.

Example:

```output
--------------------------------------------------------------------------------
|"inputs"                                                                      |
--------------------------------------------------------------------------------
|A descendant of the Lost City of Atlantis, who swam to Earth while saying, "  |
--------------------------------------------------------------------------------
```

#### Outputs

* `outputs`: A string that contains a JSON representation of a list of result objects, each of which contains fields that include `generated_text`.

Example:

```output
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|"outputs"                                                                                                                                                                                                 |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
|[{"generated_text": "A descendant of the Lost City of Atlantis, who swam to Earth while saying, \"For my life, I don't know if I'm gonna land upon Earth.\"\n\nIn \"The Misfits\", in a flashback, wh...  |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd

model = transformers.pipeline(
    task="text-generation",
    model="TinyLlama/TinyLlama-1.1B-Chat-v1.0",
)

mv = registry.log_model(
    model=model,
    model_name="TINYLLAMA",
)

text = "A descendant of the Lost City of Atlantis, who swam to Earth while saying,"
mv.run(
    pd.DataFrame.from_records([{"inputs": text}]),
    # function_name="__call__", # Optional
)
```

### Text generation (OpenAI-compatible)

A pipeline whose task is
“[text-generation](https://huggingface.co/docs/transformers/en/main_classes/pipelines#transformers.TextGenerationPipeline)”,
where `return_tensors` is False or unset,
has the following inputs and outputs.

By providing the `snowflake.ml.model.openai_signatures.OPENAI_CHAT_SIGNATURE` signature, while logging the model, the model will be compatible with the OpenAI API. This allows the users to pass `openai.client.ChatCompletion` style requests to the model.

> **Note:**
>
> Text generation pipelines where `return_tensors` is True are not supported.

#### Inputs

* `messages`: A list of dictionaries that contain the messages to be sent to the model.
* `max_completion_tokens`: The maximum number of tokens to generate.
* `temperature`: The temperature to use for the generation.
* `stop`: The stop sequence to use for the generation.
* `n`: The number of generations to produce.
* `stream`: Whether to stream the generation.
* `top_p`: The top p value to use for the generation.
* `frequency_penalty`: The frequency penalty to use for the generation.
* `presence_penalty`: The presence penalty to use for the generation.

Example:

```output
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| messages                                                                                                                                                                                          |   max_completion_tokens |   temperature | stop   |   n | stream   |   top_p |   frequency_penalty |  presence_penalty |
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| [{'role': 'system', 'content': 'Complete the sentence.'}, {'role': 'user', 'content': [{'type': 'text', 'text': 'A descendant of the Lost City of Atlantis, who swam to Earth while saying, '}]}] |                     250 |           0.9 |        |   3 | False    |       1 |                 0.1 |               0.2 |
-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Outputs

* `outputs`: A string that contains a JSON representation of a list of result objects, each of which contains fields that include `generated_text`.

Example:

```output
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| id           | object          |     created | model                                      | choices                                                                                                                                      |  usage                                                               |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| chatcmpl-... | chat.completion | 1.76912e+09 | /shared/model/model/models/TINYLLAMA/model | [{'finish_reason': 'stop', 'index': 0, 'logprobs': None, 'message': {'content': 'The descendant is not actually ...', 'role': 'assistant'}}] | {'completion_tokens': 399, 'prompt_tokens': 52, 'total_tokens': 451} |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### Code Example

```python
import transformers
import pandas as pd
from snowflake.ml.model import openai_signatures

model = transformers.pipeline(
    task="text-generation",
    model="TinyLlama/TinyLlama-1.1B-Chat-v1.0",
)

mv = registry.log_model(
    model=model,
    model_name="TINYLLAMA",
    signatures=openai_signatures.OPENAI_CHAT_SIGNATURE,
)

# create a pd.DataFrame with openai.client.chat.completion arguments
x_df = pd.DataFrame.from_records(
    [
        {
            "messages": [
                {
                    "role": "system",
                    "content": [
                        {
                            "type": "text",
                            "text": "Complete the sentence.",
                        }
                    ],
                },
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "text",
                            "text": "A descendant of the Lost City of Atlantis, who swam to Earth while saying, ",
                        }
                    ],
                },
            ],
            "max_completion_tokens": 250,
            "temperature": 0.9,
            "stop": None,
            "n": 3,
            "stream": False,
            "top_p": 1.0,
            "frequency_penalty": 0.1,
            "presence_penalty": 0.2,
        }
    ],
)

# OpenAI Chat Completion compatible output
output_df = mv.run(X=x_df)
```

---
title: Influence sensitivity plots
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-explainability-visualization/influence-sensitivity-plots.md
section: Snowflake ML
---

# Influence sensitivity plots

Use the `plot_influence_sensitivity()` function to create a SHAP dependence scatter plot to visualize the relationship between feature values and their SHAP values.
This can help you understand how changes in feature values influence model predictions.

In the preceding example, the plot shows how the feature values influence the model’s prediction.

## Required arguments

| Argument | Description |
| --- | --- |
| `shap_values` | A pandas Series or 2D array containing the SHAP values for the same feature |
| `feature_values` | A pandas Series or 2D array containing the feature values for a specific feature |

## Optional arguments

| Argument | Description |
| --- | --- |
| `figsize` | A tuple of (width, height) that controls the size of the plot. Uses a default size of (1400, 500) if not specified. |

> **Note:**
>
> The feature of providing a 2D array of SHAP values and feature values is only available in Snowflake Notebooks.
> To select the feature for which you want to visualize the SHAP values, you can use the provided interactive dropdown selector.
> If you are using a local notebook, you must pass a single feature’s SHAP values and feature values as arguments.

The function returns a chart that visualizes the feature values along the x-axis and their corresponding SHAP values along the y-axis.

The visualization can be helpful to understand the following data points:

* Trends in how feature values influence predictions
* The strength and direction of influence for each feature
* Clusters or patterns in feature interactions

---
title: Keras
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/keras.md
section: Snowflake ML
---

# Keras

The Snowflake ML Model Registry supports Keras 3 models (`keras.Model` with Keras version >= 3.0.0).
Keras 3 is a multi-backend framework that supports TensorFlow, PyTorch, and JAX as backends.

> **Note:**
>
> For Keras version < 3.0.0, use the [TensorFlow](tensorflow.md) handler.

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. Keras models have `predict` as the default target method. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a Keras model so
that the registry knows the signatures of the target methods.

> **Note:**
>
> Keras models can only have one target method.

## Examples

These examples assume `reg` is an instance of `snowflake.ml.registry.Registry`.

### Sequential Model

The following example demonstrates training a Keras 3 sequential model, logging it to the Snowflake ML Model Registry, and running inference.

```python
import keras
from sklearn import datasets, model_selection

# Load dataset
iris = datasets.load_iris(as_frame=True)
X = iris.data
y = iris.target

# Rename columns for valid Snowflake identifiers
X.columns = [col.replace(' ', '_').replace('(', '').replace(')', '') for col in X.columns]

X_train, X_test, y_train, y_test = model_selection.train_test_split(X, y, test_size=0.2)

# Build Keras sequential model
model = keras.Sequential([
    keras.layers.Dense(64, activation='relu'),
    keras.layers.Dense(32, activation='relu'),
    keras.layers.Dense(3, activation='softmax')
])

model.compile(
    optimizer='adam',
    loss='sparse_categorical_crossentropy',
    metrics=['accuracy']
)

# Train the model
model.fit(X_train, y_train, epochs=50, verbose=0)

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_keras_classifier",
    version_name="v1",
    sample_input_data=X_test,
)

# Make predictions
result_df = model_ref.run(X_test[-10:], function_name="predict")
```

### Functional API Model

The following example demonstrates creating a model using the Keras Functional API.

```python
import keras
import numpy as np
import pandas as pd

# Create sample data
n_samples, n_features = 100, 10
X = pd.DataFrame(
    np.random.rand(n_samples, n_features),
    columns=[f"feature_{i}" for i in range(n_features)]
)
y = np.random.randint(0, 2, n_samples).astype(np.float32)

# Build model using Functional API
inputs = keras.Input(shape=(n_features,))
x = keras.layers.Dense(32, activation='relu')(inputs)
x = keras.layers.Dense(16, activation='relu')(x)
outputs = keras.layers.Dense(1, activation='sigmoid')(x)
model = keras.Model(inputs=inputs, outputs=outputs)

model.compile(
    optimizer=keras.optimizers.SGD(learning_rate=0.01),
    loss=keras.losses.MeanSquaredError()
)

# Train the model
model.fit(X, y, epochs=10, verbose=0)

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_functional_model",
    version_name="v1",
    sample_input_data=X,
)

# Make predictions
result_df = model_ref.run(X[-10:], function_name="predict")
```

### Custom Subclass Model

The following example demonstrates creating a custom model by subclassing `keras.Model`.

```python
import keras
import numpy as np
import pandas as pd

# Define custom model with serialization support
@keras.saving.register_keras_serializable()
class BinaryClassifier(keras.Model):
    def __init__(self, hidden_units: int, output_units: int) -> None:
        super().__init__()
        self.dense1 = keras.layers.Dense(hidden_units, activation="relu")
        self.dense2 = keras.layers.Dense(output_units, activation="sigmoid")

    def call(self, inputs):
        x = self.dense1(inputs)
        return self.dense2(x)

    def get_config(self):
        base_config = super().get_config()
        config = {
            "dense1": keras.saving.serialize_keras_object(self.dense1),
            "dense2": keras.saving.serialize_keras_object(self.dense2),
        }
        return {**base_config, **config}

    @classmethod
    def from_config(cls, config):
        dense1_config = config.pop("dense1")
        dense1 = keras.saving.deserialize_keras_object(dense1_config)
        dense2_config = config.pop("dense2")
        dense2 = keras.saving.deserialize_keras_object(dense2_config)
        obj = cls(1, 1)
        obj.dense1 = dense1
        obj.dense2 = dense2
        return obj

# Create sample data
n_samples, n_features = 100, 10
X = pd.DataFrame(
    np.random.rand(n_samples, n_features),
    columns=[f"feature_{i}" for i in range(n_features)]
)
y = np.random.randint(0, 2, n_samples).astype(np.float32)

# Create and train model
model = BinaryClassifier(hidden_units=32, output_units=1)
model.compile(
    optimizer=keras.optimizers.SGD(learning_rate=0.01),
    loss=keras.losses.MeanSquaredError()
)
model.fit(X, y, epochs=10, verbose=0)

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_custom_classifier",
    version_name="v1",
    sample_input_data=X,
)

# Make predictions
result_df = model_ref.run(X[-10:], function_name="predict")
```

---
title: LightGBM
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/lightgbm.md
section: Snowflake ML
---

# LightGBM

The Snowflake ML Model Registry supports models created using LightGBM (models derived from the scikit-learn API wrapper, e.g. `lightgbm.LGBMClassifier` or the native API, e.g. `lightgbm.Booster`).

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. Models derived from the scikit-learn API (e.g. `LGBMClassifier`) have the following target methods by default, assuming the method exists: `predict`, `predict_proba`. Models derived from the native API (e.g. `Booster`) have the `predict` method by default. |
| `enable_explainability` | Whether to enable explainability for the model using SHAP. Defaults to `True`. When enabled, an `explain` method will be available on the logged model. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a LightGBM model so
that the registry knows the signatures of the target methods.

## Examples

These examples assume `reg` is an instance of `snowflake.ml.registry.Registry`.

### Scikit-Learn API (LGBMClassifier)

The following example demonstrates the key steps to train a LightGBM classifier using the scikit-learn API, log it to the Snowflake ML Model Registry, and use the registered model for inference and explainability. The workflow includes:

* Trains a LightGBM classifier on a sample dataset.
* Logs the model to the Snowflake ML Model Registry.
* Makes predictions and retrieves prediction probabilities.
* Gets SHAP values for the model’s predictions.

```python
import lightgbm as lgb
from sklearn import datasets, model_selection

# Load dataset
cal_data = datasets.load_breast_cancer(as_frame=True)
cal_X = cal_data.data
cal_y = cal_data.target

# Normalize column names (replace spaces with underscores)
cal_X.columns = [col.replace(' ', '_') for col in cal_X.columns]

cal_X_train, cal_X_test, cal_y_train, cal_y_test = model_selection.train_test_split(
    cal_X, cal_y, test_size=0.2
)

# Train LightGBM Classifier
classifier = lgb.LGBMClassifier(
    n_estimators=100,
    learning_rate=0.05,
    num_leaves=31
)
classifier.fit(cal_X_train, cal_y_train)

# Log the model
model_ref = reg.log_model(
    model=classifier,
    model_name="my_lightgbm_classifier",
    version_name="v1",
    sample_input_data=cal_X_test,
)

# Make predictions
result_df = model_ref.run(cal_X_test[-10:], function_name="predict")

# Get prediction probabilities
proba_df = model_ref.run(cal_X_test[-10:], function_name="predict_proba")

# Get explanations (SHAP values)
explanations_df = model_ref.run(cal_X_test[-10:], function_name="explain")
```

### Native API (Booster)

The following example demonstrates the key steps to train a LightGBM model using the native Snowflake ML API, log it to the Snowflake ML Model Registry, and use the registered model for inference. The workflow does the following:

* Trains a LightGBM model on a sample dataset.
* Logs the model to the Snowflake ML Model Registry.
* Makes predictions.

```python
import lightgbm as lgb
import pandas as pd
from sklearn import datasets, model_selection

# Load dataset
cal_data = datasets.load_breast_cancer()
cal_X = pd.DataFrame(cal_data.data, columns=cal_data.feature_names)
cal_y = cal_data.target

# Normalize column names (replace spaces with underscores)
cal_X.columns = [col.replace(' ', '_') for col in cal_X.columns]

cal_X_train, cal_X_test, cal_y_train, cal_y_test = model_selection.train_test_split(
    cal_X, cal_y, test_size=0.2
)

# Prepare LightGBM Data Structure
lgb_train = lgb.Dataset(cal_X_train, cal_y_train)

# Define parameters and train the model
params = {
    'objective': 'binary',
    'metric': 'binary_logloss',
    'boosting_type': 'gbdt',
    'num_leaves': 31,
    'learning_rate': 0.05,
    'feature_fraction': 0.9,
}

num_round = 100
booster = lgb.train(
    params,
    lgb_train,
    num_round
)

# Log the model
model_ref = reg.log_model(
    model=booster,
    model_name="my_lightgbm_booster",
    version_name="v1",
    sample_input_data=cal_X_test,
)

# Make predictions
result_df = model_ref.run(cal_X_test[-10:], function_name="predict")
```

---
title: Load and write data
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/load-data.md
section: Snowflake ML
---

# Load and write data

Use Snowflake ML to efficiently load data from Snowflake tables and stages into your machine learning workflows. Snowflake ML provides optimized data loading capabilities that take advantage of Snowflake’s distributed processing to accelerate data ingestion for your training and inference workflows.

You can load and process data using:

* **Snowflake Notebooks**: Interactive development environment for exploring data and building ML models. For more information, see [Notebooks on Container Runtime](notebooks-on-spcs.md).
* **Snowflake ML Jobs**: Run your ML workloads asynchronously from any development environment. For more information, see [Snowflake ML Jobs](ml-jobs/overview.md).

Both Notebooks and ML Jobs run on the Container Runtime, which provides preconfigured environments optimized for machine learning workloads with distributed processing capabilities. The Container Runtime uses Ray, an open-source framework for distributed computing, to efficiently process data across multiple compute nodes. For more information about the Container Runtime, see [Snowflake Container Runtime](container-runtime-ml.md).

Snowflake ML provides different APIs for loading structured and unstructured data:

**Structured data (tables and datasets)**

* **DataConnector**: Load data from Snowflake tables and Snowflake Datasets. For more information, see Load structured data from Snowflake tables.
* **DataSink**: Write data back to Snowflake tables. For more information, see Write structured data back to Snowflake tables.

**Unstructured data (files in stages)**

* **DataSource APIs**: Load data from various file formats (CSV, Parquet, images, and more) from Snowflake stages. For more information, see Load unstructured data from Snowflake stages.

The following table can help you choose the right API for your use case:

Data Sources and APIs

| Data Type | Data Source | API for Loading | API for Writing |
| --- | --- | --- | --- |
| Structured | Snowflake Tables | DataConnector | DataSink |
| Structured | Snowflake Datasets | DataConnector | DataSink |
| Unstructured | CSV Files (Stage) | DataSource API | N/A |
| Unstructured | Parquet Files (Stage) | DataSource API | N/A |
| Unstructured | Other Staged Files | DataSource API | N/A |

## Load structured data from Snowflake tables

Use the Snowflake DataConnector to load structured data from Snowflake tables and Snowflake Datasets into a Snowflake Notebook or Snowflake ML Job. The DataConnector accelerates data loading by parallelizing the reads across multiple compute nodes.

The DataConnector works with either Snowpark DataFrames or Snowflake Datasets:

* **Snowpark DataFrames**: Provide direct access to the data in your Snowflake tables. Best used during development.
* **Snowflake Datasets**: Versioned schema-level objects. Best used for production workflows. For more information, see [Snowflake Datasets](dataset.md).

After parallelizing the reads, the DataConnector can convert the data into one of following data structures:

* pandas dataframe
* PyTorch dataset
* TensorFlow dataset

### Create a DataConnector

You can create a DataConnector from a Snowpark DataFrame or a Snowflake Dataset.

Use the following code to create a DataConnector from a Snowpark DataFrame:

```python
from snowflake.ml.data.data_connector import DataConnector
from snowflake.snowpark.context import get_active_session

session = get_active_session()

# Create DataConnector from a Snowflake table
data_connector = DataConnector.from_dataframe(session.table("example-table-name"))
```

Use the following code to create a DataConnector from a Snowflake Dataset:

```python
from snowflake.ml.data.data_connector import DataConnector

# Create DataConnector from a Snowflake Dataset
data_connector = DataConnector.from_dataset(snowflake_dataset)
```

### Convert DataConnector to other formats

After creating a DataConnector, you can convert it to different data structures for use with various ML frameworks.

pandas dataframePyTorch datasetTensorFlow dataset

You can convert a DataConnector to a pandas dataframe for use with scikit-learn and other pandas-compatible libraries.

The following example loads data from a Snowflake table into a pandas dataframe and trains an XGBoost classifier:

```python
from snowflake.ml.data.data_connector import DataConnector
from snowflake.snowpark.context import get_active_session
import xgboost as xgb

session = get_active_session()

# Specify training table location
table_name = "TRAINING_TABLE"

# Load table into DataConnector
data_connector = DataConnector.from_dataframe(session.table(table_name))

# Convert to pandas dataframe
pandas_df = data_connector.to_pandas()

# Prepare features and labels
label_column_name = 'TARGET'
X, y = pandas_df.drop(label_column_name, axis=1), pandas_df[label_column_name]

# Train classifier
clf = xgb.Classifier()
clf.fit(X, y)
```

You can convert a DataConnector to a PyTorch dataset for use with PyTorch models and data loaders.

The following example loads data from a Snowflake table into a PyTorch dataset:

```python
import torch
import torch.nn as nn
from torch.utils.data import DataLoader
from snowflake.ml.data.data_connector import DataConnector

# Create DataConnector (see previous examples)
# data_connector = DataConnector.from_dataframe(...)

# Convert to PyTorch dataset
torch_dataset = data_connector.to_torch_dataset(batch_size=32)
dataloader = DataLoader(torch_dataset, batch_size=None)

label_col = 'TARGET'
feature_cols = ['FEATURE1', 'FEATURE2']

for batch_idx, batch in enumerate(dataloader):
    y = batch_data.pop(label_col).squeeze()
    X = torch.stack(
        [tensor.squeeze() for key, tensor in batch.items() if key in feature_cols]
    )
```

You can convert a DataConnector to a TensorFlow dataset for use with TensorFlow models. Data is loaded in a streaming fashion for maximum efficiency.

The following example converts a DataConnector to a TensorFlow dataset:

```python
from snowflake.ml.data.data_connector import DataConnector

# Create DataConnector (see previous examples)
# data_connector = DataConnector.from_dataframe(...)

# Convert to TensorFlow dataset
tf_ds = data_connector.to_tf_dataset(
    batch_size=4,
    shuffle=True,
    drop_last_batch=True
)

for batch in tf_ds:
    print(batch)
```

### Use with Snowflake’s distributed training APIs

For best performance, you can pass a DataConnector directly to Snowflake’s optimized distributed training APIs instead of converting to pandas, PyTorch, or TensorFlow datasets first.

The following example trains an XGBoost model using Snowflake’s distributed XGBoost estimator:

```python
from snowflake.ml.data.data_connector import DataConnector
from snowflake.ml.modeling.distributors.xgboost.xgboost_estimator import (
    XGBEstimator,
    XGBScalingConfig,
)
from snowflake.snowpark.context import get_active_session

session = get_active_session()

# Create DataConnector from a Snowpark dataframe
snowflake_df = session.table("TRAINING_TABLE")
data_connector = DataConnector.from_dataframe(snowflake_df)

# Create Snowflake XGBoost estimator
snowflake_est = XGBEstimator(
    n_estimators=1,
    objective="reg:squarederror",
    scaling_config=XGBScalingConfig(use_gpu=False),
)

# Train using the data connector
# When using a data connector, input_cols and label_col must be provided
fit_booster = snowflake_est.fit(
    data_connector,
    input_cols=NUMERICAL_COLS,
    label_col=LABEL_COL
)
```

### Use sharding with PyTorch distributor

You can use the ShardedDataConnector to shard your data across multiple nodes for distributed training with the Snowflake PyTorch distributor.

The following example trains a PyTorch model on the digits dataset using sharded data across multiple processes:

```python
from sklearn import datasets
from snowflake.ml.data.sharded_data_connector import ShardedDataConnector
from snowflake.ml.modeling.pytorch import (
    PyTorchTrainer,
    ScalingConfig,
    WorkerResourceConfig,
    getContext,
)
from torch import nn
from snowflake.snowpark.context import get_active_session

session = get_active_session()

# Create the Snowflake data from a Snowpark dataframe
digits = datasets.load_digits(as_frame=True).frame
digits_df = session.create_dataframe(digits)

# Create sharded data connector
sharded_data_connector = ShardedDataConnector.from_dataframe(digits_df)

# Define the PyTorch model
class DigitsModel(nn.Module):
    def __init__(self):
        super(DigitsModel, self).__init__()
        self.flatten = nn.Flatten()
        self.linear_relu_stack = nn.Sequential(
            nn.Linear(8 * 8, 512),
            nn.ReLU(),
            nn.Linear(512, 512),
            nn.ReLU(),
            nn.Linear(512, 10)
        )

    def forward(self, x):
        x = self.flatten(x)
        logits = self.linear_relu_stack(x)
        return logits

# Define training function that runs across multiple nodes or devices
# Each process receives a unique data shard
def train_func():
    import os
    import torch
    import torch.distributed as dist
    from torch.utils.data import DataLoader
    from torch import nn
    from torch.nn.parallel import DistributedDataParallel as DDP

    # Get context with data shards and model directory
    context = getContext()
    dataset_map = context.get_dataset_map()
    model_dir = context.get_model_dir()
    training_data = dataset_map["train"].get_shard().to_torch_dataset()
    train_dataloader = DataLoader(training_data, batch_size=batch_size, drop_last=True)

    dist.init_process_group()
    device = "cpu"
    label_col = '"target"'
    batch_size = 64

    model = DDP(DigitsModel())
    loss_fn = nn.CrossEntropyLoss()
    optimizer = torch.optim.SGD(model.parameters(), lr=1e-3)

    # Training loop
    for epoch in range(5):
        for batch, batch_data in enumerate(train_dataloader):
            y = batch_data.pop(label_col).flatten().type(torch.LongTensor).to(device)
            X = torch.concat(
                [tensor.to(torch.float32) for tensor in batch_data.values()],
                dim=-1,
            ).to(device)
            pred = model(X)
            loss = loss_fn(pred, y)

            optimizer.zero_grad()
            loss.backward()
            optimizer.step()

            if batch % 100 == 0:
                print(f"Epoch {epoch}, Batch {batch}, Loss: {loss.item()}")

    # Save the model
    if dist.get_rank() == 0:
        torch.save(model.state_dict(), os.path.join(model_dir, "digits_model.pth"))

# Create PyTorch trainer with scaling configuration
pytorch_trainer = PyTorchTrainer(
    train_func=train_func,
    scaling_config=ScalingConfig(
        num_nodes=1,
        num_workers_per_node=4,
        resource_requirements_per_worker=WorkerResourceConfig(num_cpus=1, num_gpus=0),
    ),
)

# Run distributed training
response = pytorch_trainer.run(
    dataset_map=dict(
        train=sharded_data_connector,
    )
)
```

## Load unstructured data from Snowflake stages

Use the Snowflake DataSource APIs to read unstructured data from Snowflake stages. Each file format has a corresponding datasource class that defines how to read the data.

The following shows the file formats and corresponding APIs that you use to load the data:

* **Binary files**: `SFStageBinaryFileDataSource`
* **Text files**: `SFStageTextDataSource`
* **CSV files**: `SFStageCSVDataSource`
* **Parquet files**: `SFStageParquetDataSource`
* **Image files**: `SFStageImageDataSource`

### Load and process data

When you create a Snowflake Datasource, you must provide the following:

* The name of the stage from which you’re reading the data
* The database that has the stage (defaults to current session)
* The schema that has the stage (defaults to current session)
* The pattern to the filter files being read from the datasource (optional)

The Data API or the Data Connector retrieves all files within the provided path that matches the file pattern.

After you define the Snowflake Datasource, you can load data into a Ray dataset. With the Ray dataset, you can do the following:

* Use the dataset with Ray APIs
* Pass the dataset to DataConnector
* Convert to pandas or PyTorch datasets if needed.

The following example does the following:

* Reads Parquet files from a Snowflake stage into a Ray dataset
* Converts the dataset to a DataConnector

```python
import ray
from snowflake.ml.ray.datasource.stage_parquet_file_datasource import SFStageParquetDataSource
from snowflake.ml.data.data_connector import DataConnector

data_source = SFStageParquetDataSource(
    stage_location="@stage/path/",
    database="DB_NAME", # optional
    schema="SCHEMA_NAME", # optional
    file_pattern='*.parquet', # optional
)

# Build Ray dataset from provided datasources
ray_ds = ray.data.read_datasource(data_source)

dc = DataConnector.from_ray_dataset(ray_ds)
```

## Write structured data back to Snowflake tables

Use the Snowflake DataSink API to write structured data from your Notebook or ML Job back to a Snowflake table. You can write transformed or prediction datasets to Snowflake for further analysis or storage.

To define a data sink, provide the following:

* Stage name
* Database name (defaults to current session)
* Schema name (defaults to current session)
* File pattern to match specific files (optional)

The following example defines a data sink:

```python
from snowflake.ml.ray.datasink import SnowflakeTableDatasink
datasink = SnowflakeTableDatasink(
    table_name="table_name",
    database="db_name",
    schema="schema_name",
    auto_create_table=True, # create table if not exists
    override=True # replace vs insert to table
)
```

After you define a data sink, you can use the following code to write the Ray dataset to a Snowflake table.

```python
import ray

# Get Ray dataset from sources
ray_ds = ray.data.read_datasource(data_source)

# Setup transform operations, not executed yet
transformed_ds = ray_ds.map_batches(example_transform_batch_function)

# Start writing to Snowflake distributedly
transformed_ds.write_datasink(datasink)
```

## Best Practices and Considerations

For optimal performance and resource utilization, consider the following best practices:

**Parallelism**: Design your data source implementations to leverage Ray’s distributed nature. Customize the parallelism and concurrency arguments to better suit your use case. You can manually define how many resources you’re allocating per task in each step.

**Partitioning**: By default, Ray’s internal logic will partition the dataset based on resources and data size. You can customize number of partitions to choose between large number of small tasks vs small number of big tasks based on use case with `ray_ds.repartition(X)`.

**Best practices**: Follow [Ray Data User Guide](https://docs.ray.io/en/latest/data/user-guide.html) for additional guidance.

**Ray API details**:

* [Ray Datasource](https://docs.ray.io/en/latest/data/api/doc/ray.data.read_datasource.html)
* [Ray Map Batches (batch transformation)](https://docs.ray.io/en/latest/data/api/doc/ray.data.Dataset.map_batches.html)

## Next steps

After loading your data, you can:

* [Transform and engineer features](transform-data.md)
* [Train models](modeling.md)
* [Use the Feature Store](feature-store/overview.md) for feature management

---
title: Manage packages in notebooks on Container Runtime
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime-package-management.md
section: Snowflake ML
---

# Manage packages in notebooks on Container Runtime

Snowflake Notebooks on Container Runtime currently support common `pip` commands and workflows for managing packages. This includes the following common workflows:

* Use a package spec, such as a `requirements.txt` file, to set up a notebook environment.
* View all packages installed in a notebook environment.
* Uninstall packages.
* Export a package spec that captures the current notebook environment.
* Update packages in the notebook environment.

In addition to these workflows, Notebooks on Container Runtime supports other `pip` workflows.

## Prerequisite

Ensure that an external access integration (EAI) for PyPI is set up in the notebook or that Artifact Repository is active in the Snowflake account. For more information about PyPI EAI, see [Enable external access integrations in Snowsight](../../user-guide/ui-snowsight/notebooks-external-access.md). For information about Artifact Repository, see [Artifact Repository overview](../udf/python/udf-python-packages.md).

## View all packages installed in a notebook environment

* To view a full list of the packages currently installed in the notebook environment and their respective versions, from a notebook cell, run the following command:

  > ```none
  > !pip freeze
  > ```

## Install individual packages in your notebook environment

You can modify your notebook’s Python environment by installing individual packages using inline `pip` commands in your notebook cells.

* To install a package, from a notebook cell, run the following command:

  > ```none
  > !pip install <package_name>
  > ```

## Install packages from a package spec to set up a notebook environment

You can modify your notebook’s Python environment using a package spec, such as a `requirements.txt` file, to install your desired packages. The following example shows how to install packages from a `requirements.txt` file stored locally. You can also install packages from a `requirements.txt` file stored in an internal or external stage.

1. Upload the `requirements.txt` file to the notebook.

   > For information about the `requirements.txt` file, see [Requirements File Format](https://pip.pypa.io/en/stable/reference/requirements-file-format/).
2. To install all of the packages, from a notebook cell, run the following command:

   > ```none
   > !pip install -r requirements.txt
   > ```

## Update package versions in the notebook environment

1. From a notebook cell, run one of the following commands that corresponds to the version of the package you want to update to:

   * Latest version:

     > ```none
     > !pip install <package_name> --upgrade
     > ```
   * Specific version:

     > ```none
     > !pip install <package_name> --<version>
     > ```
2. To confirm that the update is complete, when prompted, restart the notebook kernel.

## Uninstall packages from a notebook environment

Complete the following steps to uninstall all of the packages that you installed using a package spec in the notebook environment.

1. Verify that a `requirements.txt` file exists in the notebook environment.
2. From a cell in the notebook, run the following command:

   > ```none
   > !pip uninstall -r requirements.txt
   > ```
3. To confirm that the packages were uninstalled, when prompted, restart the notebook kernel.

## Export packages in your notebook environment as a package spec

You can export a package spec that captures the current state of the notebook environment. With this package spec, you can quickly replicate
the notebook environment.

1. From a cell in the notebook, run the following command:

   > ```none
   > !pip list --format=freeze <filename>.txt
   > ```
2. To upload the file to a stage, run the following command:

   > ```python
   > session.file.put("<path to file>/<filename>.txt", "@mystage/prefix1")
   > ```

For more information about storing files in a stage, see [Store files in a Snowflake stage](../../user-guide/ui-snowsight/notebooks-work-with-files.md).

---
title: Managing models with the Snowflake Model Registry
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-management.md
section: Snowflake ML
---

# Managing models with the Snowflake Model Registry

The Snowflake Model Registry simplifies the process of bringing a machine learning model from development to production.
A well-organized model registry serves as the central hub and single source for all models, their metrics, and their
metadata. Logging your model in the registry is the first and most significant step in your Snowflake ML Ops journey,
bringing your machine learning operations under the control, security, and governance Snowflake is known for.

The Snowflake Model Registry is flexible enough to address a wide range of ML model management use cases and scenarios.
This topic offers guidance on how best to use the registry to seamlessly manage models from development to production,
including:

* How to control access to models so that the right group of users or roles can perform various operations.
* How to query the metrics and other metadata of all your models.
* How to manage the lifecycle of a model from development to production.
* How to roll out a new version of a model without any changes to production code.

## Governance

Because [machine learning models](../../../sql-reference/commands-model.md) are first-class objects in Snowflake, you
can use all standard Snowflake governance capabilities with them, including role-based access control and the
information schema.

### Role-based access control

Model objects have three privileges: OWNERSHIP, USAGE, and READ.

| Privilege | Description |
| --- | --- |
| OWNERSHIP | Full control of the model, including managing model versions, accessing artifacts, and updating model metadata. Only one role can own the model, but you can [grant that role](../../../sql-reference/sql/grant-role.md) to multiple users or to other roles. |
| USAGE | Read-only access to the model, allowing warehouse inference (prediction) and use of the SHOW MODELS and SHOW VERSIONS IN MODEL commands. Roles with only USAGE cannot access the model code, weights, or other artifacts. |
| READ | Read-only access to the model, allowing SPCS inference (prediction), model files, metadata and use of the SHOW MODELS and SHOW VERSIONS IN MODEL commands. |

The owner of a model can grant access to any role as follows:

```sqlexample
GRANT USAGE ON MODEL my_model TO ROLE prod_role;
-- OR
GRANT READ ON MODEL my_model TO ROLE prod_role;
```

### Information schema queries

Like all Snowflake objects, models are represented in a view in the [Snowflake Information Schema](../../../sql-reference/info-schema.md). The view for
models and their versions is [INFORMATION_SCHEMA.MODEL_VERSIONS](../../../sql-reference/info-schema/model_versions.md).
Model version information is a superset of the information for models, so there is no separate MODEL view.

Through this view, you can query the registry itself. For example, assume that you maintain an accuracy metric, adding
it to each model version using SQL like the following.

```sqlexample
ALTER MODEL my_model MODIFY VERSION v1
    SET METADATA = '{"metric": {"accuracy": 0.769}}';
```

> **Note:**
>
> You can also [set metrics with the registry’s Python API](overview.md).
>
> ```python
> mv = reg.get_model("my_model").version("v1").set_metric("accuracy", 0.769)
> ```

After you have added this metric to all versions of your models, you can use a query like the one here to retrieve information
about all model objects and list them in order of highest accuracy to lowest.

```sqlexample
SELECT
    catalog_name,
    schema_name,
    model_name,
    model_version_name,
    metadata:metric:accuracy AS accuracy,
    comment,
    owner,
    functions,
    created_on,
    last_altered_on
FROM my_database.INFORMATION_SCHEMA.MODEL_VERSIONS
ORDER BY accuracy DESC;
```

You can create more complex queries that join to other information schema views or other tables for more detailed
analysis.

## Model lifecycle management

To meet the diverse needs of small and large enterprises, the Snowflake Model Registry provides four simple, yet
powerful, schemes for managing the lifecycle of a model from development to production. Choose the one that works
best for you based on the governance structure you prefer.

* Using the default version
* Using aliases
* Using tags
* Using multiple schemas

### Using the default version

Models are versioned, and one version is designated as the default version. You can treat the default version of a model
as the production version by convention; production code only ever calls the default version of the model.

In this scenario, you promote a model version to production simply by setting it as the default, perhaps after it meets
your model scoring or performance evaluation workflow requirements. This is the simplest way to control which version of
a model is used in production.

*Use this method when:*

* The owner of the model has the authority to decide which version to use in production.
* You don’t need to track any lifecycle stages besides development/production.

#### Initial setup

The model owner grants usage on the model to a production role.

```sqlexample
GRANT USAGE ON MODEL my_model TO ROLE prod_role;
```

When the model is initially logged, its sole version is the default, and that version is ready to be used.

> **Important:**
>
> A model must always have a default version. Under this scheme, then, you can’t designate a model as not yet having a
> production version. If you need to prevent models from being used before they’re ready, you might log an initial
> version that immediately throws an error. This version would remain the default until some other version is ready.

#### Promoting a model to production

When a new version, called `new_version` in the SQL below, has cleared the quality bar, designate it as the
default to mark it as the production version.

```sqlexample
ALTER MODEL my_model SET DEFAULT_VERSION = new_version;
```

#### Using the model in production

In production, call the model directly to use the default version.

```sqlexample
SELECT my_model!predict(...) ... ;
```

#### Development and testing

To use a pre-release version, call the desired model version by name:

```sqlexample
WITH my_version AS MODEL my_model VERSION new_version
    SELECT my_version!predict(...) ...;
```

### Using aliases

Many organizations manage model lifecycles using multiple stages, such as development, canary, staging, production, and
deprecation. Model versions can have [aliases](overview.md), user-defined
labels or tags that you can exclusively attach to any of a model’s versions. You can you use aliases to represent the
lifecycle stages your organization uses.

*Use this method when:*

* The model owner has the authority to make model lifecycle stage decisions.
* You want to track multiple lifecycle stages, not just development/production.

The example below uses two preproduction stages (`alpha` and `beta`) and one production stage (`production`).

#### Initial setup

The model owner grants usage on the model to a production role.

```sqlexample
GRANT USAGE ON MODEL my_model TO ROLE prod_role;
```

#### Promoting the initial version of the model

When you log the model, set the `production` alias to point to the first version, here named `v1`.

```sqlexample
ALTER MODEL my_model VERSION v1 SET ALIAS = production;
```

#### Managing preproduction versions

Initially, the model has no designated `alpha` or `beta` version. When you add a new version, initially designate it
`alpha`.

```sqlexample
ALTER MODEL my_model VERSION v2 SET ALIAS = alpha;
```

Later, to promote the new version to `beta`:

```sqlexample
ALTER MODEL my_model VERSION v2 UNSET ALIAS;
ALTER MODEL my_model VERSION v2 SET ALIAS = beta;
```

#### Promoting subsequent versions of the model

When a new version of a model has passed muster, remove the `production` alias from the current production version,
here `v1`, and apply it to the new version, here `v2`.

```sqlexample
ALTER MODEL my_model VERSION v1 UNSET ALIAS;
ALTER MODEL my_model VERSION v2 UNSET ALIAS;
ALTER MODEL my_model VERSION v2 SET ALIAS = production;
```

#### Using the model in production

Call the production version of the model through the `production` alias.

```sqlexample
WITH my_version AS MODEL my_model VERSION production
    SELECT my_version!predict(...) ...;
```

#### Development and testing

To use pre-release versions, call the model through the `alpha` or `beta` alias instead. For example, to test the
alpha version:

```sqlexample
WITH my_version AS MODEL my_model VERSION alpha
    SELECT my_version!predict(...) ...;
```

### Using tags

The default version and
alias lifecycle management schemes already described assume that the
model owner can manage model lifecycles. In many organizations, though, this responsibility rests with a separate production
engineering role, and data scientists don’t have the authority to promote model versions to production. Because models
are first-class Snowflake objects, you can apply [tags](../../../user-guide/object-tagging/introduction.md) to them for this purpose. Tags are
securable by role-based access control and are suitable for this separation of responsibility.

*Use this method when:*

* A role other than the model owner determines when to promote a model version from one lifecycle stage to the next.

#### Initial setup

The model owner grants usage on the model to a production role.

```sqlexample
GRANT USAGE ON MODEL my_model TO ROLE prod_role;
```

The production role also needs the ability to see the tags on a model and to read the tag values. Here, the
former is achieved by granting the broad APPLY TAG privilege on the account to the role. The latter is achieved
by granting the USAGE privilege on the schema.

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT APPLY TAG ON ACCOUNT TO ROLE prod_role;
GRANT USAGE ON SCHEMA model_database.model_schema TO ROLE prod_role;
```

To create tags, a role needs the CREATE TAG privilege on the schema.

Create a tag named `live_version` in a schema owned by `prod_role` to hold the name of the current production
version of the model.

```sqlexample
USE ROLE prod_role;
USE SCHEMA prod_db.prod_schema;

CREATE TAG live_version;
```

> **Note:**
>
> Here, the tag is created in the production schema, as it is managed by the production role. When using it in other
> schemas, use its fully-qualified name.

#### Promoting the initial version of the model

To make a model available in production, apply the `live_version` tag to the model, specifying the initial production
version as the value of the tag.

```sqlexample
USE ROLE prod_role;
USE SCHEMA prod_db.prod_schema;

ALTER MODEL model_database.model_schema.my_model
    SET TAG live_version = 'V1';
```

#### Promoting subsequent versions of the model

When a new version of the model is ready, update the `live_version` tag with the name of that version.

```sqlexample
USE ROLE prod_role;

ALTER MODEL model_database.model_schema.my_model
    SET TAG prod_db.prod_schema.live_version = 'V2';
```

#### Using the model in production

Call the production version of the model by retrieving the value of the `live_version` tag from the model using
[SYSTEM$GET_TAG](../../../sql-reference/functions/system_get_tag.md), then calling the model version that has that name. The following SQL
shows this two-step process.

> **Note:**
>
> The SQL domain of models, for use with SYSTEM$GET_TAG, is MODULE.

```sqlexample
-- get production model version from live_version tag
SET live_version = (SELECT
    SYSTEM$GET_TAG('prod_db.prod_schema.live_version', 'my_model', 'MODULE'));

-- call that version
WITH my_version AS MODEL my_model VERSION IDENTIFIER($live_version)
    SELECT my_version!predict(...) ... ;
```

#### Development and testing

For pre-release versions, you can use the same method with additional tags (such as `alpha_version` and
`beta_version`). In many organizations, however, only promotion to production is managed by engineering, and it’s
reasonable to manage pre-release stages using the simpler alias
method.

### Using multiple schemas

You can use multiple schemas to manage lifecycle stages. With this approach, code exclusively calls models in a
designated production schema, which holds only models being used in production. Models in other stages are stored
elsewhere. When a model version is ready for production, it is copied to the production schema. Because the production
models are separate objects with their own access control, you can protect them from accidental modification while
model developers have free rein over models in development stages.

*Use this method when:*

* A role other than the owner of the model promotes models to production.
* You want strong separation between development and production environments.

Note that the role that promotes models to production should have OWNERSHIP or READ privilege on the source model.

#### Initial setup

Create a role (called, for example, `ml_admin`) that has access to both the development and production schemas. In
this example, access to these two environments is encapsulated in existing roles named `model_owner` and
`prod_role`, which contain privileges like USAGE and CREATE MODEL on the development and production schemas,
respectively. The new `ml_admin` role gets the privileges it needs by being granted those roles.

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE ml_admin;

USE ROLE model_owner;
GRANT ROLE model_owner TO ROLE ml_admin;

USE ROLE prod_role;
GRANT ROLE prod_role TO ROLE ml_admin;
```

#### Promoting the initial version of the model

Use the `ml_admin` role to copy model versions from the development schema to the production schema, performing the
initial copy using CREATE MODEL … FROM MODEL to copy just the desired version. You can use the same identifier for the
production version or establish a different numbering scheme for production. Here, development version `V12` becomes
production version `V1`.

```sqlexample
USE ROLE ml_admin;

CREATE MODEL prod_db.prod_schema.prod_model WITH VERSION V1
    FROM MODEL dev_db.dev_sch.dev_model VERSION V12;
```

After creating the initial production version of the model, grant USAGE or OWNERSHIP to the production
role based on need.

```sqlexample
USE ROLE ml_admin;

GRANT USAGE ON MODEL my_model TO ROLE prod_role;
```

#### Promoting subsequent versions of the model

When a new version of the model is ready for production, copy just the new model version to the production environment.
Here, development version `V24` becomes production version `V2`. `V2` is then set as the default version.

```sqlexample
USE ROLE ml_admin;

ALTER MODEL prod_db.prod_schema.prod_model ADD VERSION V2
    FROM MODEL dev_db.dev_schema.dev_model VERSION V24;

ALTER MODEL prod_db.prod_schema.prod_model
    SET DEFAULT_VERSION = V2;
```

> **Tip:**
>
> It’s a good idea to keep previous production versions in case you need to roll back, which you can do by
> setting the default version to a previous version, as shown below.
>
> ```sqlexample
> ALTER MODEL prod_db.prod_schema.prod_model SET DEFAULT_VERSION = V1;
> ```
>
> Establish a policy around how many old versions to keep and how long to keep them.

#### Using the model in production

In production, call the default version of the model.

```sqlexample
SELECT prod_model!predict(...) ... ;
```

#### Development and testing

To manage pre-release versions, you could use additional schemas, promoting versions from one stage to the next by
copying them from one schema to the next. If the owner of the model can manage pre-production stages, you could
use a simpler method such as aliases to manage these versions. Using
additional schemas still may be useful to segregate multiple pre-production environments, such as development and
testing, when one or more of these stages is managed by another role.

---
title: ML Lineage: Trace ML data flow
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/ml-lineage.md
section: Snowflake ML
---

# ML Lineage: Trace ML data flow

> **Note:**
>
> ML Lineage is available in the `snowflake-ml-python` package version 1.6.0 and later.

ML Lineage provides comprehensive tracing of data as it flows through your machine learning pipeline. This feature
enables you to track the lineage between various data artifacts, including source tables/views/stages, feature views,
datasets, registered models, and deployed model services. Additionally, ML Lineage captures the relationships between
cloned artifacts and artifacts of similar types, ensuring a complete view of data transformations and dependencies
within your pipeline. A possible pipeline is illustrated below:

The lineage relationships that can be tracked between the types of nodes in your pipeline are summarized in the table
below. Each row represents the source of the dependency, and each column represents the target. The intersection
of a row or column contains an icon indicating whether that relationship is captured by ML Lineage.

|  | Table/View/Stage | Feature View | Dataset | Model | Deployed Model Service |
| --- | --- | --- | --- | --- | --- |
| Table/View/Stage | ✔ | ✔ | ✔ | ✔ | - |
| Feature View | ✔ (only to table) | ✔ | ✔ | - | - |
| Dataset | ✔ | - | ✔ | ✔ | - |
| Model | ❌ | - | - | ✔ | ✔ |
| Deployed Model Service | ❌ | - | - | - | - |

* ✔: This relationship is captured by ML Lineage.
* ❌: This relationship is not yet captured by ML Lineage, but is on the roadmap.
* -: This combination of objects does not represent a relationship.

With ML Lineage, you can understand how machine learning artifacts relate to each other and can answer questions
like:

* Where did the data come from to train my model?
* What feature views does my dataset depend on?
* What models were trained on data from my dataset?
* Which services use my model?

Dive into the [Quick Start Notebook](https://github.com/Snowflake-Labs/sfguide-getting-started-with-snowflake-ml-lineage/blob/main/notebooks/0_start_here.ipynb)
to see how to use ML Lineage APIs. Follow up with more complete [end-to-end ML quickstart](https://quickstarts.snowflake.com/guide/develop-and-manage-ml-models-with-feature-store-and-model-registry/)
with Feature Store and Model Registry that incorporates ML Lineage in a full ML workflow.

## Limitations

* Tables and views created from model predictions do not currently capture the lineage relationship back to the model.
* Lineage information is not replicated at this time.

Snowflake intends to address these limitations in future releases of ML Lineage.

## Required privileges

Users need the VIEW LINEAGE privilege to explore lineage from Python APIs. This privilege is automatically granted to
the ACCOUNTADMIN role, which can then grant it to other roles at the account level. For example:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT VIEW LINEAGE ON ACCOUNT TO ROLE test_role;
```

## Creating ML Lineage

Generally, Snowflake records lineage information when objects are created. Lineage for models is captured when the model
is logged to the Model Registry. Training a model using Snowpark ML automatically generates lineage records if the model
is trained from a Snowpark DataFrame.

Other scenarios, such as those listed below, can also generate lineage records with a little extra effort.

* Training a model using Snowpark MLfrom some other kind of data source (such as a pandas DataFrame).
* Training a model without using Snowpark ML or a Snowpark DataFrame.
* Training a model outside of Snowflake.

In these scenarios, you can still associate the source data object and the trained model by passing a Snowpark DataFrame
backed by the source data object as `sample_data` to the Model Registry’s `log_model` method, as shown below.

```python
registry.log_model(...,
          sample_input_data=df_backed_by_source_table)
```

> **Note:**
>
> Only objects created after the ML Lineage feature is enabled in your account contain lineage information.

## Querying ML Lineage

You can query the lineage of ML artifacts in several ways.

### Snowsight UI

Every artifact’s landing page has a Lineage tab. The default view displays upstream and downstream objects one step
away from the selected object. For a more detailed exploration of lineage within the Snowsight UI, see [Data Lineage](../../user-guide/ui-snowsight-lineage.md).

A sample of the Snowsight view of lineage data is shown below.

### Snowpark ML library

The Snowpark ML library (the `snowflake-ml-python` package) offers a user-friendly API on all Snowflake ML artifact
objects to explore lineage in both upstream and downstream directions. It returns connected artifact objects, and you
can chain API calls to further explore in the desired direction. This API works directly with Snowflake ML Python
objects. For more information, see Snowpark ML lineage API.

### Snowpark Python library

The Snowpark library provides a flexible API to explore data and ML lineage of supported Snowflake artifacts at greater depths
in the direction of your choice. It accepts domains and fully qualified names, returning a DataFrame with details of
connected artifacts. For more information, see
[snowflake.snowpark.lineage.Lineage.trace](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.16.0/api/snowflake.snowpark.lineage.Lineage.trace)
in the Snowpark Python API Reference.

### Snowflake SQL

The SQL function `SNOWFLAKE.CORE.GET_LINEAGE` can be used to query lineage information similar to the Snowpark
library. For more information, see [GET_LINEAGE (SNOWFLAKE.CORE)](../../sql-reference/functions/get_lineage-snowflake-core.md).

## Snowpark ML lineage API

The `lineage` method available on `FeatureView`, `ModelVersion`, and `Dataset` objects retrieves
lineage relationships for the current object, so you can trace the lineage of data objects retrieved from the Snowflake
Feature Store or Model Registry.

For all supported objects, the `lineage` method accepts the following arguments:

* `direction`, either `UPSTREAM` or `DOWNSTREAM`. `DOWNSTREAM` is the default.
* `domain_filter`, a list of target object types for which lineage will be retrieved. The default is to return all lineage relationships.
  The available domains are `"feature_view"`, `"dataset"`, `"model"`, `"table"`, and `"view"`.

The method returns a list of connected lineage nodes. These nodes can be instances of `Dataset`,
`FeatureView`, or `ModelVersion`, if you have imported these classes into your Python session. Otherwise,
each node is represented by a generic `LineageNode` instance.

### Examples

The following examples demonstrate how to answer common questions using the Snowpark ML lineage API.

* Given a model version, where did its training data come from?

  > ```python
  > model_version.lineage(direction="upstream")
  > ```
* Which feature views does a particular dataset depend on?

  > ```python
  > my_dataset.lineage(direction="upstream", domain_filter=["feature_view"])
  > ```
* Which models were trained on data from a given dataset?

  > ```python
  > my_dataset.lineage(direction="downstream", domain_filter=["model"])
  > ```

For more complete examples, see these resources:

* [ML Lineage Overview Notebook](https://github.com/Snowflake-Labs/snowflake-demo-notebooks/blob/main/ML%20Lineage%20Workflows/ML%20Lineage%20Workflows.ipynb)
* [End to end ML quickstart](https://quickstarts.snowflake.com/guide/develop-and-manage-ml-models-with-feature-store-and-model-registry/)

---
title: ML Observability: Monitoring model behavior over time
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-observability.md
section: Snowflake ML
---

# ML Observability: Monitoring model behavior over time

Model behavior can change over time due to input drift, stale training assumptions, and data pipeline issues, as well as
the usual factors, including changes to the underlying hardware and software and the fluid nature of traffic. ML
Observability allows you to track the quality of production models you have deployed via the Snowflake Model Registry
across multiple dimensions, such as performance, drift, and volume. Additionally, you can monitor model performance
across different segments of your data using string categorical columns.

Currently, the model monitor supports regression and binary classification models.

> **Note:**
>
> To dive in and start using ML Observability, see the [quickstart](https://quickstarts.snowflake.com/guide/getting-started-with-ml-observability-in-snowflake/).

## ML Observability workflow

When you use a model that has been logged in the Snowflake Model Registry for inference, you receive results in the form
of a Snowpark or pandas DataFrame, depending on the type of input DataFrame passed to the inference method. This data
typically originates in Snowflake. Even in cases where inference is run outside Snowflake, it is common to store the
results in Snowflake. ML Observability allows you to monitor your model’s performance in both of these scenarios by
working on the stored inference data. The typical workflow is shown below.

The monitoring logs store the inference data and the predictions so that the ML Observability feature can observe
changes in predictions over time. The monitoring logs are stored in a table that contains an ID, a timestamp, features,
predictions, and a ground truth label, which indicates whether a given row is a prediction or observed data. The basic
structure is shown below.

You must explicitly create a model monitor object for each model version you want to monitor. Each model version can
have exactly one monitor, and each monitor can monitor exactly one model version; they cannot be shared. The monitor
object automatically refreshes the monitor logs by querying source data and updates the monitoring reports based on the
logs.

Each monitor encapsulates the following information:

* The model version to monitor.
* The table in which the monitor logs are stored.
* The minimum time granularity at which data is stored (aggregation window), currently 1 day minimum.
* An optional baseline table for comparative metric operations such as drift.

## Prerequisites

Before you begin, make sure you have the following:

* A Snowflake account.
* Version 1.7.1 or later of the `snowflake-ml-python` Python package.
* Familiarity with the [Snowflake Model Registry](overview.md).

## Creating a model monitor

Create a model monitor using the CREATE MODEL MONITOR command. The model monitor must be created in the same schema as
the model version to be monitored. You must have the CREATE MODEL MONITOR privilege on the schema where the monitor is
created. You can create a maximum of 250 model monitors per account.

See [CREATE MODEL MONITOR](../../../sql-reference/sql/create-model-monitor.md) for more details on the CREATE MODEL MONITOR command.

> **Tip:**
>
> For details on other SQL commands that you can use with model monitors, see [Model monitor commands](../../../sql-reference/commands-model-monitor.md).

## Temporarily stopping and resuming monitoring

You can suspend (temporarily stop) a model monitor using ALTER MODEL MONITOR … SUSPEND. To resume monitoring,
issue ALTER MODEL MONITOR … RESUME.

### Automatic suspension on refresh failure

Model monitors automatically suspend refreshes when they encounter five consecutive refresh failures related to the
source tables. You can view the status and cause of refresh suspension using the
[DESCRIBE MODEL MONITOR](../../../sql-reference/sql/desc-model-monitor.md) command. The output includes the following columns, among others:

* `aggregation_status`: The value in this column is a JSON object. One or more of the values in this object will be SUSPENDED if the model monitor is suspended.
* `aggregation_last_error`: The value in this column is a JSON object that contains the specific SQL error that caused the suspension.

After resolving the root cause of the refresh failure, resume the monitor by issuing [ALTER MODEL MONITOR … RESUME](../../../sql-reference/sql/alter-model-monitor.md).

## Adding Segments to a model monitor

Model monitors support segmentation, which allows you to monitor model quality over time for specific subsets of your data in addition to monitoring the complete dataset.
Segments are used to group the data into logical units, such as different regions or different user groups.

### Creating monitors with segments

When creating a model monitor, you can specify segment columns using the SEGMENT_COLUMNS parameter. Segment columns must be string columns in your source data.

> **Important:**
>
> To create segments on numeric columns, bucket them into valid categories before you create the monitor. For example, you can transform a numeric `TEMPERATURE` column into categorical values like ‘COLD’ (< 32°F), ‘MODERATE’ (32-80°F), and ‘HOT’ (> 80°F) before using it as a segment column.

```sqlsyntax
CREATE [OR REPLACE] MODEL MONITOR [IF NOT EXISTS] <NAME> WITH
    --- all other existing parameters of CREATE MODEL MONITOR
    SEGMENT_COLUMNS = (<segment_column_name_array>)
```

For complete syntax and parameter details, see [CREATE MODEL MONITOR](../../../sql-reference/sql/create-model-monitor.md).

### Adding segments to existing or new monitors

You can add segment columns to existing monitors using the ALTER MODEL MONITOR command:

```sqlsyntax
ALTER MODEL MONITOR <NAME> ADD SEGMENT_COLUMN = <segment_column_name>
```

You can also remove segment columns from existing monitors:

```sqlsyntax
ALTER MODEL MONITOR <NAME> DROP SEGMENT_COLUMN = <segment_column_name>
```

For complete syntax and options, see [ALTER MODEL MONITOR](../../../sql-reference/sql/alter-model-monitor.md).

### Defining the segment in Monitoring Segments in the UI

You can configure and manage segments through the Monitoring Segments settings in the UI:

The segments settings interface allows you to define and configure which segments to monitor for your model.

### Choose the segment in the segments selector in the model monitor dashboard

In the model monitor dashboard, you can use the segments selector to view metrics for specific segments of your data:

### Performance considerations for segments

Performance depends on many factors, like number of features, number of segment columns, unique values per segment column, warehouse size, warehouse type, aggregation window, total rows, and rows per aggregation window.

* Performance impact of CREATE with SEGMENT_COLUMNS is directly proportional to the number of segment columns in the request
* If CREATE performance is slow with many segment columns, consider adding segment columns one at a time using the ALTER command
* Each segment column and value combination is independently queried, and there may be time differences in when data was last updated based on scheduling and other factors. But we try best to update all the data at the same time.

## Viewing monitoring reports

To view monitor reports, visit the ML Monitoring dashboard in Snowsight. In the navigation menu, select AI & ML » Models. The resulting list contains all the models in the Snowflake Model Registry in all the databases and schemas that your current role has access to.

Open a model’s details page by selecting the corresponding row in the Models list. The details page displays key
model information, including the model’s description, tags, versions, and monitors.

The Monitors list in the details page displays the list of model monitors, the model versions they are attached
to, their status, and when they were created.

Open a model monitor dashboard page by selecting the corresponding row in the Monitors list. The dashboard is populated
with graphs displaying key metrics of the model over time. The exact graphs displayed depend on the type of model the
monitor is based on (that is, binary classification or regression).

In the dashboard, you can take the following actions:

* Change the range of the graphs by clicking the time range selector.
* Change the graphs shown by clicking the Settings button. (Hover the mouse over a metric name to see more
  information about it.)
* Compare model monitors by clicking the Compare model selector drop-down.
* Display more information about the model monitor by selecting Display monitor details.

## Querying monitoring results

Each model monitor that you create has the following metrics:

* **Drift metrics**: Distribution changes or data shifts
* **Performance metrics**: Distribution changes or data shifts
* **Statistical metrics**: Counts or null values

To query the metrics computed by the monitor, use the [monitor metric functions](../../../sql-reference/functions-model-monitors.md). The metric functions get the metrics from the model monitor objects. You can use the results from the metric functions to create custom dashboards in Streamlit or other centralized monitoring tools.

> **Important:**
>
> You must have the following privileges to work with model monitor objects:
>
> | Command | Required privileges |
> | --- | --- |
> | CREATE MODEL MONITOR | * CREATE MODEL MONITOR privilege on the schema where you want to create the model * SELECT on data source (table or view) * USAGE on database, schema, warehouse, and model |
> | SHOW MODEL MONITORS | Any privilege on the model monitor |
> | DESCRIBE MODEL MONITOR | Any privilege on the model monitor |
> | ALTER MODEL MONITOR | MODIFY on the model monitor |
> | DROP MODEL MONITOR | OWNERSHIP on the model monitor |

Use the following SQL template to get the drift metric from your model monitor.

```sqlsyntax
SELECT *
FROM TABLE(MODEL_MONITOR_DRIFT_METRIC (
                                        <model_monitor_name>,
                                        <drift_metric_name>,
                                        <column_name>,
                                        <granularity>,
                                        <start_time>,
                                        <end_time>,
                                        <extra_args>
                                      )
          )
```

Use the following SQL template to get the performance metric from your model monitor.

```sqlsyntax
SELECT *
FROM TABLE(MODEL_MONITOR_PERFORMANCE_METRIC (
                                        <model_monitor_name>,
                                        <metric_name>,
                                        <granularity>,
                                        <start_time>,
                                        <end_time>,
                                        <extra_args>
                                      )
          )
```

Use the following SQL template to get the statistical metric from your model monitor.

```sqlsyntax
SELECT *
FROM TABLE(MODEL_MONITOR_STAT_METRIC (
                                        <model_monitor_name>,
                                        <metric_name>,
                                        <granularity>,
                                        <start_time>,
                                        <end_time>,
                                        <extra_args>
                                      )
          )
```

### Querying segment-specific metrics

To query metrics for specific segments, use the `<extra_args>` parameter with a JSON format that specifies the segment column and value. The `<extra_args>` parameter is optional - if not provided, the query returns metrics for all data (non-segment query).

> **Note:**
>
> Currently, segment queries support only 1 segment column:value pair per query. You cannot query multiple segments simultaneously in a single function call.

For segment queries, use this format for the `<extra_args>` parameter:

```sqlsyntax
'{"SEGMENTS": [{"column": "<segment_column_name>", "value": "<segment_value>"}]}'
```

For example, to get drift metrics for premium customers only:

```sqlsyntax
SELECT *
FROM TABLE(MODEL_MONITOR_DRIFT_METRIC (
                                        'my_customer_monitor',
                                        'PSI',
                                        'FEATURE_1',
                                        'DAY',
                                        '2024-01-01'::TIMESTAMP_NTZ,
                                        '2024-01-31'::TIMESTAMP_NTZ,
                                        '{"SEGMENTS": [{"column": "CUSTOMER_TIER", "value": "PREMIUM"}]}'
                                      )
          )
```

The result tables for segment queries include two additional columns:

* `SEGMENT_COLUMN`: Name of the segment column for which the metric is computed (or NULL for non-segment queries)
* `SEGMENT_VALUE`: Segment value for which the metric is computed (or NULL for non-segment queries)

For more information about segments, see Adding Segments to a model monitor.

You can set up alerts and notifications for your monitoring metrics. For more information, see [Alerts and Notifications](../../../guides-overview-alerts.md).

### Known limitations

The following limitations apply to model monitors:

* Monitors must reside in the same database and schema as the model version.
* Only single-output regression and binary classification models are supported.
* At least one prediction column (class or score) is required; actual columns are optional but needed for accuracy metrics.
* Drift calculation requires baseline data; without it, to add baseline data, you must drop the monitor and create it again.
* Each column can only be used once in the monitor. For example, you can’t use the same column as the ID column and the prediction column.
* Data can’t contain invalid values (nulls, NaNs, +/-Inf, probability scores outside 0-1, non-binary classes, or more than two classes in a PREDICTION_CLASS_COLUMNS column) to avoid monitor failure and suspension.
* Timestamp columns must be of type `TIMESTAMP_NTZ`; prediction and actual columns must be `NUMBER`.
* You must specify the aggregation windows in days.
* A maximum of 500 features can be monitored.
* Up to 250 monitors can be created.
* Segment columns must be string categorical columns only.
* A maximum of 5 segment columns per model monitor (hard limit).
* Each segment column should have fewer than 25 unique values (recommended limit).
* Segment values are case sensitive and special characters are not supported for segment queries.
* NULL filtering is not supported for segment queries.

### Cost considerations

Virtual warehouse compute:

> * Model monitors use a virtual warehouse, incurring costs during creation and each refresh.
> * Loading the Snowsight dashboard also uses a virtual warehouse, incurring additional charges.

Storage:

> * Model monitors materialize the source data into a table stored in your account.
> * Segment columns add additional materialized table stored in your account.

Cloud services compute:

> * Model monitors use cloud services compute to trigger refreshes when an underlying base object has changed. Cloud services compute cost is only billed if the daily cloud services cost is greater than 10% of the daily warehouse cost for the account.

---
title: MLFlow
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/mlflow.md
section: Snowflake ML
---

# MLFlow

You can use MLflow models that support PyFunc. If your MLFlow model has a signature, the `signature`
argument is inferred from the model. Otherwise, you must provide either `signature` or `sample_input_data`.

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `model_uri` | The URI of the artifacts of the MLFlow model. Must be provided if it is not available in the model’s metadata as `model.metadata.get_model_info().model_uri`. |
| `ignore_mlflow_metadata` | If `True`, the model’s metadata is not imported to the model object in the registry. Default: `False` |
| `ignore_mlflow_dependencies` | If `True`, the dependencies in the model’s metadata are ignored, which is useful due to package available limitations in Snowflake warehouses. Default: `False` |

## Example

```python
import mlflow
from sklearn import datasets, model_selection, ensemble

db = datasets.load_diabetes(as_frame=True)
X_train, X_test, y_train, y_test = model_selection.train_test_split(db.data, db.target)
with mlflow.start_run() as run:
    rf = ensemble.RandomForestRegressor(n_estimators=100, max_depth=6, max_features=3)
    rf.fit(X_train, y_train)

    # Use the model to make predictions on the test dataset.
    predictions = rf.predict(X_test)
    signature = mlflow.models.signature.infer_signature(X_test, predictions)
    mlflow.sklearn.log_model(
        rf,
        "model",
        signature=signature,
    )
    run_id = run.info.run_id

model_ref = registry.log_model(
    mlflow.pyfunc.load_model(f"runs:/{run_id}/model"),
    model_name="mlflowModel",
    version_name="v1",
    conda_dependencies=["mlflow<=2.4.0", "scikit-learn", "scipy"],
    options={"ignore_mlflow_dependencies": True}
)
model_ref.run(X_test)
```

---
title: Model Explainability
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-explainability.md
section: Snowflake ML
---

# Model Explainability

During the training process, machine learning models infer relationships between inputs and outputs, rather than
requiring that these relationships be stated explicitly up front. This allows ML techniques to tackle complicated
scenarios involving many variables without extensive setup, particularly where the causal factors of a particular
outcome are complex or unclear, but the resulting model can be something of a black box. If a model underperforms, it
can be difficult to understand why, and furthermore how to improve its performance. The black box model can also conceal
implicit biases and fail to establish clear reasons for decisions. Industries that have regulations around trustworthy
systems, like finance and healthcare, might require stronger evidence that the model is producing the correct results
for the right reasons.

To help address such concerns, the Snowflake Model Registry includes an explainability function based on
[Shapley values](https://towardsdatascience.com/the-shapley-value-for-ml-models-f1100bff78d1). Shapley values are a
way to attribute the output of a machine learning model to its input features. By considering all possible combinations of
features, Shapley values measure the average marginal contribution of each feature to the model’s prediction. This
approach ensures fairness in attributing importance and provides a solid foundation for understanding complex models.
While computationally intensive, the insights gained from Shapley values are invaluable for model interpretability and
debugging.

For example, assume we have a model for predicting the price of a house, which was trained on the homes’ size, location,
number of bedrooms, and whether pets are allowed. In this example, the average price of houses is $100,000, and the final
prediction of the model was $250,000 for a house that is 2000 square feet, beachside, three bedrooms, and doesn’t allow
pets. Each of these feature values might contribute to the final model prediction as shown in the following table.

| Feature | Value | Contribution vs. an average house |
| --- | --- | --- |
| Size | 2000 | +$50,000 |
| Location | Beachside | +$75,000 |
| Bedrooms | 3 | +$50,000 |
| Pets | No | -$25,000 |

Together, these contributions explain why this particular house is priced $150,000 higher than an average home. Shapley
values can affect the final outcome positively or negatively, adding up to a difference of outcomes compared to an
average. In this example, it is less desirable to live in a house where pets are not allowed, so that feature value’s
contribution is -$25,000.

The average value is calculated using background data, a representative sample of the entire dataset. For more information,
see Logging models with background data.

## Supported model types

This preview release supports the following Python-native model packages.

* XGBoost
* CatBoost
* LightGBM
* Scikit-learn

The following Snowpark ML modeling classes from `snowflake.ml.modeling` are supported.

* XGBoost
* LightGBM
* Scikit-learn

Explainability is available by default for the above models logged using Snowpark ML 1.6.2 and later. The
implementation uses the [SHAP library](https://pypi.org/project/shap/).

## Logging models with background data

Background data, typically a sample of representative data, is an important ingredient of Shapley value-based
explanations. Background data gives the Shapley algorithm an idea of what “average” inputs look like to which it can
compare individual explanations.

The Shapley value is computed by systematically perturbing input features and replacing them with the background data.
Because it reports deviation from background data, it is important to use consistent background data when comparing
Shapley values from multiple data sets.

Some tree-based models implicitly encode background data within their structure during training, and may not require
explicit background data. Most models, however, require background data to be provided separately for useful
explanations, and all models (including tree-based models) can be explained more accurately if you provide background
data.

You can provide up to 1,000 rows of background data when logging a model by passing it in the `sample_input_data`
parameter, as shown below.

> **Note:**
>
> If the model is a type that requires explicit background data to calculate Shapley values, explainability cannot be
> enabled without this data.

```python
mv = reg.log_model(
    catboost_model,
    model_name="diamond_catboost_explain_enabled",
    version_name="explain_v0",
    conda_dependencies=["snowflake-ml-python"],
    sample_input_data = xs, # xs will be used as background data
)
```

You can also provide background data while logging the model with a signature, as shown below.

```python
mv = reg.log_model(
    catboost_model,
    model_name="diamond_catboost_explain_enabled",
    version_name="explain_v0",
    conda_dependencies=["snowflake-ml-python"],
    signatures={"predict": predict_signature, "predict_proba": predict_proba_signature},
    sample_input_data = xs, # xs will be used as background data
    options= {"enable_explainability": True} # you will need to set this flag in order to pass both signatures and background data
)
```

## Retrieving explainability values

Models with explainability have a method named `explain` that returns the Shapley values for the model’s features.

Because Shapley values are explanations of predictions made from specific inputs, you must pass input data to `explain` to
generate the predictions to be explained.

The Snowflake model version object will have a method called `explain`, and you call it using `ModelVersion.run` in Python.

```python
reg = Registry(...)
mv = reg.get_model("Explainable_Catboost_Model").default
explanations = mv.run(input_data, function_name="explain")
```

The following is an example of retrieving the explanation in SQL.

```sqlexample
WITH MV_ALIAS AS MODEL DATABASE.SCHEMA.DIAMOND_CATBOOST_MODEL VERSION EXPLAIN_V0
SELECT *,
      FROM DATABASE.SCHEMA.DIAMOND_DATA,
          TABLE(MV_ALIAS!EXPLAIN(CUT, COLOR, CLARITY, CARAT, DEPTH, TABLE_PCT, X, Y, Z));
```

> **Important:**
>
> If you are using `snowflake-ml-python` prior to version 1.7.0, you may receive the error `UnicodeDecodeError: 'utf-8' codec can't decode byte` with XGBoost models.
> This is due to an incompatibility between version 0.42.1 of the [SHAP library](https://pypi.org/project/shap/) and the latest XGBoost version (2.1.1) supported by Snowflake.
> If you cannot upgrade `snowflake-ml-python` to version 1.7.0 or later, downgrade the XGBoost version to 2.0.3 and log the model with the `relax_version` option set to `False`,
> as shown in the following example.
>
> ```python
> mv_new = reg.log_model(
>     model,
>     model_name="model_with_explain_enabled",
>     version_name="explain_v0",
>     conda_dependencies=["snowflake-ml-python"],
>     sample_input_data = xs,
>     options={"relax_version": False}
> )
> ```

## Adding explainability to existing models

Models that were logged in the registry using a version of Snowpark ML older than 1.6.2 do not have the explainability
feature. Since model versions are immutable, you must create a new model version to add explainability to an existing
model. You can use `ModelVersion.load` to retrieve the Python object representing the model’s implementation, then log
that to the registry as a new model version. Be sure to pass your background data as `sample_input_data`. This
approach is shown below.

> **Important:**
>
> The Python environment into which you load the model must be exactly the same (that is, the same version of Python
> and of all libraries) as the environment where the model is deployed. For details, see
> [Loading a model version](overview.md).

```python
mv_old = reg.get_model("model_without_explain_enabled").default
model = mv_old.load()
mv_new = reg.log_model(
    model,
    model_name="model_with_explain_enabled",
    version_name="explain_v0",
    conda_dependencies=["snowflake-ml-python"],
    sample_input_data = xs
)
```

## Logging models without explainability

Explainability is enabled by default if the model supports it. To log a model version in the registry without
explainability, pass `False` for the `enable_explainability` option when logging the model, as shown here.

```python
mv = reg.log_model(
    catboost_model,
    model_name="diamond_catboost_explain_enabled",
    version_name="explain_v0",
    conda_dependencies=["snowflake-ml-python"],
    sample_input_data = xs,
    options= {"enable_explainability": False}
)
```

---
title: Model Inference in Snowflake
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/inference-overview.md
section: Snowflake ML
---

# Model Inference in Snowflake

Snowflake uses two distinct compute engines:

* The warehouse (SQL Engine)
* Snowpark Container Service

The Snowflake Model Registry provides a unified interface to both engines. The optimal environment for your use cases depends on your latency, data type, and scaling requirements. Snowflake offers the following approaches to your inference workflows:

**Real-time Inference (REST API):** Designed for low-latency and real-time use cases. Requests are facilitated via HTTP endpoints and are ideal for powering external applications.

**Snowflake Native Batch Inference (SQL):** Designed for batch workloads that require integration with the Snowflake SQL ecosystem. For example, batch workloads can integrate with Dynamic Tables, Snowpark, DBT, and User Tasks. You can use a SQL function, you can embed intelligence directly into your existing data pipelines without moving data or managing external infrastructure.

**Job-based Batch Inference:** This approach is designed for high-throughput, distributed processing where inference is treated as a standalone compute stage. By decoupling inference from the SQL engine, you can optimize for both price and performance. You can use Batch Inference to help you handle massive datasets or navigate complex computational requirements. This is ideal for processing files—such as images, video, and audio—directly from Snowflake Stages.

## When to choose

Use the following table to align your specific workload requirements with the correct compute pattern.

| Feature | Real-Time Inference (SPCS) | Native Batch Inference (SQL) | Job-Based Batch (SPCS) |
| --- | --- | --- | --- |
| Primary Goal | Interactive Responses: Low-latency, sub-second feedback for live users. | Inline Intelligence: Seamlessly embedding models into SQL data pipelines. | Standalone Processing: Large-scale, decoupled compute for unstructured data. |
| Best For… | • Web/Mobile app backends.  • Real-time user interactions.  • High-concurrency request spikes. | • Upstream pipelines (Dynamic Tables, Snowpark).  • SQL-first users (Analysts/DEs).  • Tools like dbt. | • Processing files (Images, Video, Audio).  • Large-scale historical backfills.  • Multi-modal data processing. |
| Data Source | Small inputs passed via HTTP payload. | Data residing in Snowflake Tables. | Data residing in Snowflake Stages (Files). |
| Scalability | Horizontal autoscaling to meet request volume. | Serverless scaling via Virtual Warehouses. | High-throughput distributed processing for bulk data. |
| Key Advantage | Zero-Ops Complexity: Snowflake handles container orchestration, ingress, and security patching automatically. | Zero Infrastructure: Treat your model like a native SQL function. | Cost Optimization: Significant efficiency for distinct, high-volume compute stages. |

---
title: Model Training and Inference
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/modeling.md
section: Snowflake ML
---

# Model Training and Inference

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

## Generating tables for training

You can generate a training data set with the feature store’s `generate_training_set` method, which enriches a
Snowpark DataFrame that contains the source data with the derived feature values. To select a subset of features from a
feature view, use `fv.slice`.

For time-series features, provide the timestamp column name to automate the point-in-time feature value lookup.

```python
training_set = fs.generate_training_set(
    spine_df=MySourceDataFrame,
    features=[registered_fv],
    save_as="data_20240101",                    # optional
    spine_timestamp_col="TS",                   # optional
    spine_label_cols=["LABEL1", "LABEL2"],      # optional
    include_feature_view_timestamp_col=False,   # optional
)
```

> **Note:**
>
> Here, the `spine_df` (`MySourceDataFrame`) is a DataFrame containing the entity IDs in source data, the time
> stamp, label columns, and additional columns containing training data. Requested features are retrieved for the list
> of entity IDs, with point-in-time correctness with respect to the provided time stamp.

Training sets are ephemeral by default; they exist only as Snowpark DataFrames and are not materialized. To materialize
the training set to a Table, specify the argument `save_as` with a valid, non-existing table name. The training set
is written to the newly created table.

Materialized tables currently don’t guarantee immutability and have limited metadata support. If you require these
features, consider using Snowflake Datasets instead.

> **Note:**
>
> The `generate_training_set` API is available in `snowflake-ml-python` version `1.5.4` or later.

## Generating Snowflake Datasets for training

You can generate a [Snowflake Dataset](../dataset.md) using the feature store’s
`generate_dataset` method. The method signature is similar to `generate_training_set`; the key
differences are the required `name` argument, optional `version` argument, and additional metadata fields.
`generate_dataset` always materializes the result.

Snowflake Datasets provide an immutable, file-based snapshot of data, which helps to ensure model reproducibility
and efficient data ingestion for large datasets and/or distributed training. Datasets also have expanded metadata
support for easier discoverability and consumption.

The following code illustrates the generation of a dataset from a feature view:

```python
dataset: Dataset = fs.generate_dataset(
    name="MY_DATASET",
    spine_df=MySourceDataFrame,
    features=[registered_fv],
    version="v1",                               # optional
    spine_timestamp_col="TS",                   # optional
    spine_label_cols=["LABEL1", "LABEL2"],      # optional
    include_feature_view_timestamp_col=False,   # optional
    desc="my new dataset",                      # optional
)
```

## Model training

After creating a training data set, you can pass it to your model when training as follows.

If you generated a Snowpark DataFrame, pass it directly to your model:

```python
my_model = train_my_model(training_set)
```

If you generated a Snowflake Dataset, convert it to a Snowpark DataFrame and pass it to your model:

```python
my_model = train_my_model(dataset.read.to_snowpark_dataframe())
```

Once trained, the model can be logged in the [Snowflake Model Registry](../model-registry/overview.md).

## Retrieving features and making predictions

If you created a model in your Python session, you can retrieve the feature view from the feature store and pass
it to your model for prediction, as shown here:

```python
prediction_df: snowpark.DataFrame = fs.retrieve_feature_values(
    spine_df=prediction_source_dataframe,
    features=[registered_fv],
    spine_timestamp_col="TS",
    exclude_columns=[],
)

# predict with your previously trained model
my_model.predict(prediction_df)
```

You can exclude specified columns using the `exclude_columns` argument, or include the timestamp column from the
feature view by setting `include_feature_view_timestamp_col`.

---
title: Notebooks on Container Runtime
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/notebooks-on-spcs.md
section: Snowflake ML
---

# Notebooks on Container Runtime

## Overview

You can run Snowflake Notebooks on Container Runtime. Container Runtime is powered by Snowpark Container Services,
giving you a flexible container infrastructure that supports building and
operationalizing a wide variety of workflows entirely within Snowflake. Container Runtime
provides software and hardware options to support advanced data science and machine learning workloads.
Compared to [virtual warehouses](../../user-guide/warehouses.md), Container Runtime provides a more flexible
compute environment where you can install packages from multiple sources and select compute resources, including GPU
machine types, while still running SQL queries on warehouses for optimal performance.

This document describes some considerations for using notebooks on [Snowflake Container Runtime](container-runtime-ml.md).
You can also try the
[Getting Started with Snowflake Notebook Container Runtime](https://quickstarts.snowflake.com/guide/notebook-container-runtime/)
quickstart to learn more about using the Container Runtime in your development.

## Prerequisites

Before you start using Snowflake Notebooks on Container Runtime, the ACCOUNTADMIN role must complete the notebook setup steps for creating
the necessary resources and granting privileges to those resources. For detailed steps, see [Administrator setup](../../user-guide/ui-snowsight/notebooks-setup.md).

## Create a notebook on Container Runtime

When you create a notebook on Container Runtime, you choose a warehouse, runtime, and compute pool to provide the
resources to run your notebook. The runtime you choose gives you access to different Python packages
based on your use case. Different warehouse sizes or compute pools have different cost and
performance implications. All of these settings can be changed later if needed.

> **Note:**
>
> A user with the ACCOUNTADMIN, ORGADMIN, or SECURITYADMIN roles cannot directly create or own a notebook on Container Runtime. Notebooks created or
> directly owned by these roles will fail to run. However, if a notebook is owned by a role that the ACCOUNTADMIN, ORGADMIN, or SECURITYADMIN
> roles inherit privileges from, such as the PUBLIC role, then you can use those roles to run that notebook.

To create a Snowflake Notebook to run on Container Runtime, follow these steps:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. Select + Notebook.
4. Enter a name for your notebook.
5. Select a database and schema in which to store your notebook. These cannot be changed after you create the notebook.

   > **Note:**
   >
   > The database and schema are only required for storing your notebooks. You can query any database and schema your role has access to
   > from within your notebook.
6. Select Run on container for the Runtime.
7. Select the Runtime version from the CPU or GPU options.
8. Select a Compute pool.
   :   Snowflake automatically provisions two compute pools in each account for running notebooks: SYSTEM_COMPUTE_POOL_CPU and SYSTEM_COMPUTE_POOL_GPU.
9. Change the selected warehouse to use to run SQL and Snowpark queries.
10. To create and open your notebook, select Create.

Runtime version:

> Two runtime version types are available: CPU and GPU. Each runtime image contains a base set of Python packages and versions verified and
> integrated by Snowflake. All runtime images support data analysis, modeling, and training with Snowpark Python, Snowflake ML, and Streamlit.
>
> To install additional packages from a public repo, you can use pip. An external access integration (EAI) is required for
> Snowflake Notebooks to install packages from external endpoints. To configure EAIs, see [Set up external access for Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks-external-access.md).
> However, if a package is already part of the base image, then you can’t change the version on the package by installing a different
> version with pip install. For a list of the pre-installed packages, run the following command from a cell in the notebook:
>
> ```none
> !pip freeze
> ```

Compute pool:

> A compute pool provides the compute resources for your notebook kernel and Python code. Use smaller, CPU-based compute pools to
> get started, and select higher-memory, GPU-based compute pools to optimize for intensive GPU usage scenarios like computer
> vision or LLMs/VLMs.
>
> Note that each compute node is limited to running one notebook per user at a time. You should set the MAX_NODES parameter to a
> value greater than one when creating compute pools for notebooks. For an example, see [Compute resources](../../user-guide/ui-snowsight/notebooks-setup.md). For
> more details on Snowpark Container Services compute pools, see [Snowpark Container Services: Working with compute pools](../snowpark-container-services/working-with-compute-pool.md).
>
> When a notebook is not being used, consider shutting it down to free up node resources. You can shut down a notebook by selecting
> End session from the connection dropdown.
>
> If a notebook runs on Container Runtime, the role needs the USAGE privilege on a compute pool instead of on the Notebook warehouse.
> Compute pools are CPU-based or GPU-based virtual machines managed by Snowflake. When creating a compute pool, set the MAX_NODES parameter to greater than one because each notebook will require one full node to run.
> For information, see [Snowpark Container Services: Working with compute pools](../snowpark-container-services/working-with-compute-pool.md).
>
> You can view your resource utilization. For more information, see [About Legacy Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md).

> **Note:**
>
> On AWS, notebooks running on GPU compute pools use high performance NVMe storage as the
> default boot device.

## Run a notebook on Container Runtime

After you create your notebook, you can start running code immediately by adding and running cells.
For information about adding cells, see [Develop and run code in Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks-develop-run.md).

### Importing more packages

In addition to pre-installed packages to get your notebook up and running, you can install packages from public sources
that you have external access set up for. You can also use packages stored in a stage or a private repository. You need to use the
ACCOUNTADMIN role or a role that can create external access integrations (EAIs) to set up and grant you access for visiting
specific external endpoints. Use the [ALTER NOTEBOOK](../../sql-reference/sql/alter-notebook.md) command to enable external access on your notebook. Once granted,
you will see the EAIs in Notebook settings. Toggle the EAIs before you start installing from external channels.
For instructions, see [Configure a notebook with external access and secrets](../../user-guide/ui-snowsight/notebooks-external-access.md).

The following example installs an external package using pip install in a code cell:

```none
!pip install transformers scipy ftfy accelerate
```

### Updating notebook settings

You can update settings, such as which compute pools or warehouse to use, any time in Notebook settings, which can be accessed
through the  **Notebook actions** menu at the top right.

One of the settings you can update in Notebook settings is the idle timeout setting. The default for idle timeout is 1 hour, and you
can set it for up to 72 hours. To set this in SQL, use the [CREATE NOTEBOOK](../../sql-reference/sql/create-notebook.md) or
[ALTER NOTEBOOK](../../sql-reference/sql/alter-notebook.md) command to set the IDLE_AUTO_SHUTDOWN_TIME_SECONDS property of the notebook.

### Installing private packages

Pip supports the installation of packages from private sources with [basic authentication](https://pip.pypa.io/en/stable/topics/authentication/#basic-http-authentication),
such as JFrog Artifactory. Configure the notebook for external access integration (EAI) so it can access the repository.

1. Create a network rule to specify the repository you want to access. For example, this network rule specifies a JFrog repository:

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE jfrog_network_rule
     MODE = EGRESS
     TYPE = HOST_PORT
     VALUE_LIST = ('<your-repo>.jfrog.io');
   ```
2. Create a secret that represents credentials required to authenticate with the external network location.

   ```sqlexample
   CREATE OR REPLACE SECRET jfrog_token
     TYPE = GENERIC_STRING
     SECRET_STRING = '<your-jfrog-token>';
   ```
3. [Create an external access integration](../external-network-access/creating-using-external-network-access.md) that allows repository access:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION jfrog_integration
     ALLOWED_NETWORK_RULES = (jfrog_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (jfrog_token)
     ENABLED = TRUE;

   GRANT USAGE ON INTEGRATION jfrog_integration TO ROLE data_scientist;
   ```
4. Associate the external access integration and secret with the notebook.

   ```sqlexample
   ALTER NOTEBOOK my_notebook
     SET EXTERNAL_ACCESS_INTEGRATIONS = (jfrog_integration),
       SECRETS = ('jfrog_token' = jfrog_token);
   ```
5. To access the external access configuration, select the  (Notebook actions menu) on the top right of your notebook.
6. Select Notebook settings, and then select the External access tab.
7. Select the EAI to connect to the repository.

   The notebook restarts.
8. Once the notebook has restarted, you can install from the repository:

   ```none
   !pip install hello-jfrog --index-url https://<user>:<token>@<your-repo>.jfrog.io/artifactory/api/pypi/test-pypi/simple
   ```

### Installing private packages with private connectivity

If your private package repository requires private connectivity, follow these steps to configure your account. If you need assistance, you can coordinate with your account administrator to set up the network rule.

1. Follow the steps in [Network egress using private connectivity](../snowpark-container-services/service-network-communications.md) to set up network egress using private connectivity.
2. Create a secret that represents credentials required to authenticate with the external network location.

   ```sqlexample
   CREATE OR REPLACE SECRET jfrog_token
     TYPE = GENERIC_STRING
     SECRET_STRING = '<your-jfrog-token>';
   ```
3. Create an EAI with the network rule from step 1. For example:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION private_repo_integration
     ALLOWED_NETWORK_RULES = (PRIVATE_LINK_NETWORK_RULE)
     ALLOWED_AUTHENTICATION_SECRETS = (jfrog_token)
     ENABLED = TRUE;

   GRANT USAGE ON INTEGRATION private_repo_integration TO ROLE data_scientist;
   ```
4. Associate the external access integration and secret with the notebook.

   ```sqlexample
   ALTER NOTEBOOK my_notebook
     SET EXTERNAL_ACCESS_INTEGRATIONS = (jfrog_integration),
       SECRETS = ('jfrog_token' = jfrog_token);
   ```
5. To access the external access configuration, select the  (Notebook actions menu) on the top right of your notebook.
6. Select Notebook settings, and then select the External access tab.
7. Select the EAI to connect to your private repository.

   The notebook restarts.
8. After the notebook has restarted, you can provide the `--index-url` of your repository:

   ```none
   !pip install my_package --index-url https://my-private-repo-url.com/simple
   ```

## Running ML workloads

Notebooks on Container Runtime are well suited for running ML workloads such as model training and parameter tuning. Runtimes come
pre-installed with popular ML packages. With external integration access set up, you can install any other packages you need using `!pip install`.

For an optimal experience, use OSS libraries to develop model or to import notebooks that use OSS components. The Container Runtime has optimized APIs such as the following:

* `DataConnector` for faster data ingestion
* Distributed training APIs for scalable model fitting
* Distributed hyperparameter tuning APIs to efficiently utilize all available resources.

For more information, see [Snowflake Container Runtime](container-runtime-ml.md).

> **Note:**
>
> Because the runtime comes pre-installed with many packages, a change to any version requires a kernel restart.
> For more information, see [Explore Legacy Notebooks](../../user-guide/ui-snowsight/notebooks.md).

### Use OSS ML libraries

The following example uses an OSS ML library, `xgboost`, with an active Snowpark session to fetch data directly into memory for training:

```python
from snowflake.snowpark.context import get_active_session
import pandas as pd
import xgboost as xgb
from sklearn.model_selection import train_test_split

session = get_active_session()
df = session.table("my_dataset")
# Pull data into local memory
df_pd = df.to_pandas()
X = df_pd[['feature1', 'feature2']]
y = df_pd['label']
# Split data into test and train in memory
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.15, random_state=34)
# Train in memory
model = xgb.XGBClassifier()
model.fit(X_train, y_train)
# Predict
y_pred = model.predict(X_test)
```

### Limitations

After a Container Runtime notebook session starts, it can run up to seven days without disruption. After seven days, it may be disrupted and shut down if there is a scheduled SPCS service maintenance event. The notebook idle time settings still apply. For details on SPCS service maintenance, see [Compute pool maintenance](../snowpark-container-services/working-with-compute-pool.md).

## Cost and billing considerations

When running notebooks on Container Runtime, you may incur both [warehouse compute](../../user-guide/cost-understanding-compute.md) and
[SPCS compute costs](../snowpark-container-services/accounts-orgs-usage-views.md). Warehouses are required not only for
executing queries but also for supporting certain frontend functionality in Snowflake Notebooks. For example, when using a compute pool for
Python execution, a warehouse may still be needed for rendering outputs or handling interactive components.

Snowflake Notebooks rely on virtual warehouses to efficiently run SQL and Snowpark queries. As a result, you may incur warehouse compute costs when
executing SQL cells or Snowpark push-down queries in Python cells.

The following diagram shows where compute happens for SQL, Snowpark, and Python cells within a notebook:

> **Note:**
>
> When you execute a notebook that uses a compute pool, the Python code runs on the compute pool. However, you might see activity in
> Query History showing that a warehouse was used to run the [EXECUTE NOTEBOOK](../../sql-reference/sql/execute-notebook.md) command. This is expected behavior.
> The warehouse is used briefly to initialize the execution environment but does not consume any warehouse credits. All code execution is handled
> by the compute pool.

For example, the following Python example uses the [xgboost](https://xgboost.readthedocs.io/en/stable/) library.
The data is pulled into the container and compute is handled entirely by Snowpark Container Services:

```python
from snowflake.snowpark.context import get_active_session
import pandas as pd
import xgboost as xgb
from sklearn.model_selection import train_test_split

session = get_active_session()
df = session.table("my_dataset")
# Pull data into local memory
df_pd = df.to_pandas()
X = df_pd[['feature1', 'feature2']]
y = df_pd['label']
# Split data into test and train in memory
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.15, random_state=34)
```

To learn more about warehouse costs, see [Overview of warehouses](../../user-guide/warehouses-overview.md).

---
title: Parallel Hyperparameter Optimization (HPO) on Container Runtime
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-hpo.md
section: Snowflake ML
---

# Parallel Hyperparameter Optimization (HPO) on Container Runtime

The Snowflake ML Hyperparameter Optimization (HPO) API is a model-agnostic framework that enables efficient,
parallelized hyperparameter tuning of models.
You can use any open-source framework or algorithm. You can also use Snowflake ML APIs.

You can use HPO in a Snowflake Notebook that’s configured to use the Container Runtime on Snowpark
Container Services (SPCS). After you [create such a notebook](notebooks-on-spcs.md), you can:

* Train a model using any open source package, and use this API to distribute the hyperparameter tuning process
* Train a model using Snowflake ML distributed training APIs, and scale HPO while also scaling each of the training runs

The HPO workload that you initiate from your notebook executes inside Snowpark Container Services on either CPU or GPU
instances. The workload scales out to the CPU or GPU cores that are available on a single node in the SPCS compute pool.

The parallelized HPO API provides the following benefits:

* A single API that automatically handles all the complexities of distributing the training across multiple resources
* The ability to train with virtually any framework or algorithm using open-source ML frameworks or the Snowflake ML
  modeling APIs
* A selection of tuning and sampling options, including Bayesian and random search algorithms along with various
  continuous and non-continuous sampling functions
* Tight integration with the rest of Snowflake; for example efficient data ingestion via Snowflake Datasets or Dataframes
  and automatic ML lineage capture

> **Note:**
>
> You can scale the HPO run to use multiple nodes in the SPCS compute pool.
> For more information, see [Running a workload on a multi-node cluster](container-runtime-multi-node.md).

## Optimize a model’s hyperparameters

Use Snowflake ML HPO API to tune a model. The following steps illustrate the process:

1. Ingest the data.
2. Use the search algorithm to define the strategy used to optimize the hyperparameters.
3. Define how the hyperparameters are sampled.
4. Configure the tuner.
5. Get the hyperparameters and training metrics from each training job.
6. Initiate the training job.
7. Get the training job results.

The following sections walk through the preceding steps. For an example, see [Container Runtime HPO Example](https://github.com/Snowflake-Labs/sf-samples/blob/main/samples/ml/container_runtime_hpo/hpo_example.ipynb).

### Ingest the data

Use the `dataset_map` object to ingest the data into the HPO API. The `dataset_map` object is a dictionary that pairs
the training or test dataset with its corresponding Snowflake DataConnector object. The `dataset_map` object is passed to the
training function. The following is an example of a `dataset_map` object:

```python
dataset_map = {
  "train": DataConnector.from_dataframe(session.create_dataframe(X_train)),
  "test": DataConnector.from_dataframe(session.create_dataframe(X_test)),
  ),
}
```

### Define the search algorithm

Define the search algorithm used to explore the hyperparameter space. The algorithm uses the outcomes of previous trials to determine how to configure the hyperparameters.
You can use the following search algorithms:

* Grid search

  Explores a grid for hyperparamter values that you define. The HPO API evaluates every possible combination of hyperparameters.
  The following is an example of a hyperparameter grid:

  ```python
  search_space = {
      "n_estimators": [50, 51],
      "max_depth": [4, 5]),
      "learning_rate": [0.01, 0.3],
  }
  ```

  In the preceding example, each parameter has two possible values. There are 8 (2 \* 2 \* 2) possible combinations of hyperparameters.
* Bayesian optimization

  Uses a probabilistic model to determine the next set of hyperparameters to evaluate. The algorithm uses the outcomes of previous trials to determine how to configure the hyperparameters. For more information about Bayesian optimization, see [Bayesian Optimization](https://github.com/bayesian-optimization/BayesianOptimization).
* Random search

  Randomly samples the hyperparameter space. It’s a simple and effective approach that works particularly well with large or mixed (continuous or discrete) search spaces.

You can use the following code to define the search algorithm:

```python
from snowflake.ml.modeling.tune.search import BayesOpt, RandomSearch, GridSearch
search_alg = BayesOpt()
search_alg = RandomSearch()
search_alg = GridSearch()
```

### Define hyperparameter sampling

Use the search space functions to define the hyperparameter sampling method during each trial. Use them to describe the range and type of the values that the hyperparameters can take.

The following are the available sampling functions:

* uniform(`lower`, `upper`):
  Samples a continuous value uniformly between lower and upper. Useful for parameters like dropout rates or regularization strengths.
* loguniform(`lower`, `upper`):
  Samples a value in logarithmic space, ideal for parameters that span several orders of magnitude (e.g., learning rates).
* randint(`lower`, `upper`):
  Samples an integer uniformly between lower (inclusive) and upper (exclusive). Suitable for discrete parameters like the number of layers.
* choice(options):
  Randomly selects a value from a provided list. Often used for categorical parameters.

The following is an example of how you can define the search space with the uniform function:

```python
search_space = {
    "n_estimators": tune.uniform(50, 200),
    "max_depth": tune.uniform(3, 10),
    "learning_rate": tune.uniform(0.01, 0.3),
}
```

### Configure the tuner

Use the `TunerConfig` object to configure the tuner. Within the object, you specify the metric being optimized, the optimization mode, and the other execution parameters. The following are the available configuration options:

* Metric
  The performance metric, such as accuracy or loss that you’re optimizing.
* Mode
  Determines whether the objective is to maximize or minimize the metric (`"max"` or `"min"`).
* Search Algorithm
  Specifies the strategy for exploring the hyperparameter space.
* Number of Trials
  Sets the total number of hyperparameter configurations to evaluate.
* Concurrency
  Defines how many trials can run concurrently.

The following example code uses the Bayesian optimization library to maximize the accuracy of a model over five trials.

```python
from snowflake.ml.modeling import tune
tuner_config = tune.TunerConfig(
  metric="accuracy",
  mode="max",
  search_alg=search_algorithm.BayesOpt(
      utility_kwargs={"kind": "ucb", "kappa": 2.5, "xi": 0.0}
  ),
  num_trials=5,
  max_concurrent_trials=1,
)
```

### Get the hyperparameters and training metrics

The Snowflake ML HPO API requires the training metrics and hyperparameters from each training run to optimize the hyperparameters effectively. Use the `TunerContext` object to get the hyperparameters and training metrics. The following example creates a training function to get the hyperparameters and training metrics:

```python
def train_func():
  tuner_context = get_tuner_context()
  config = tuner_context.get_hyper_params()
  dm = tuner_context.get_dataset_map()
  ...
  tuner_context.report(metrics={"accuracy": accuracy}, model=model)
```

### Initiate the training job

Use the `Tuner` object to initiate the training job. The `Tuner` object takes the training function, search space, and tuner configuration as arguments. The following is an example of how to initiate the training job:

```python
from snowflake.ml.modeling import tune
tuner = tune.Tuner(train_func, search_space, tuner_config)
tuner_results = tuner.run(dataset_map=dataset_map)
```

The preceding code distributes the training function across the available resources. It collects and summarizes the trial outcomes and identifies the best performing
configuration.

### Get the training job results

> After all trials are completed, the `TunerResults` object consolidates the outcomes of each trial. It provides structured access to the performance metrics, the best configuration, and the best model.
>
> The following are its available attributes:

* results: A Pandas DataFrame containing metrics and configurations for every trial.
* best_result: A DataFrame row summarizing the trial with the best performance.
* best_model: The model instance associated with the best trial, if applicable.

The following code gets the results, the best model, and the best result:

```python
print(tuner_results.results)
print(tuner_results.best_model)
print(tuner_results.best_result)
```

## API reference

### Tuner

The following is the import statement for the Tuner module:

```python
from snowflake.ml.modeling.tune import Tuner
```

The Tuner class is the main interface for interacting with the container runtime HPO API. To run an HPO job, use the following code to initialize a Tuner object and call the run method with the Snowflake datasets.

```python
class Tuner:
  def __init__(
      self,
      train_func: Callable,
      search_space: SearchSpace,
      tuner_config: TunerConfig,
  )

  def run(
      self, dataset_map: Optional[Dict[str, DataConnector]] = None
  ) -> TunerResults
```

### SearchSpace

The following is the import statement for the search space:

```python
from snowflake.ml.modeling.tune import uniform, choice, loguniform, randint
```

The following code defines the search space functions:

```python
def uniform(lower: float, upper: float)
    """
    Sample a float value uniformly between lower and upper.

    Use for parameters where all values in range are equally likely to be optimal.
    Examples: dropout rates (0.1 to 0.5), batch normalization momentum (0.1 to 0.9).
    """

def loguniform(lower: float, upper: float) -> float:
    """
    Sample a float value uniformly in log space between lower and upper.

    Use for parameters spanning several orders of magnitude.
    Examples: learning rates (1e-5 to 1e-1), regularization strengths (1e-4 to 1e-1).
    """

def randint(lower: int, upper: int) -> int:
    """
    Sample an integer value uniformly between lower(inclusive) and upper(exclusive).

    Use for discrete parameters with a range of values.
    Examples: number of layers, number of epochs, number of estimators.
    """

def choice(options: List[Union[float, int, str]]) -> Union[float, int, str]:
    """
    Sample a value uniformly from the given options.

    Use for categorical parameters or discrete options.
    Examples: activation functions ['relu', 'tanh', 'sigmoid']
    """
```

### TunerConfig

The following is the import statement for the TunerConfig module:

```python
from snowflake.ml.modeling.tune import TunerConfig
```

Use the following code to define configuration class for the tuner:

```python
class TunerConfig:
  """
  Configuration class for the tuning process.

  Attributes:
    metric (str): The name of the metric to optimize. This should correspond
        to a key in the metrics dictionary reported by the training function.

    mode (str): The optimization mode for the metric. Must be either "min"
        for minimization or "max" for maximization.

    search_alg (SearchAlgorithm): The search algorithm to use for
        exploring the hyperparameter space. Defaults to random search.

    num_trials (int): The maximum number of parameter configurations to
        try. Defaults to 5

    max_concurrent_trials (Optional[int]): The maximum number of concurrently running trials per node. If   not specified, it defaults to the total number of nodes in the cluster. This value must be a positive
    integer if provided.

  Example:
      >>> from snowflake.ml.modeling.tune import  TunerConfig
      >>> config = TunerConfig(
      ...     metric="accuracy",
      ...     mode="max",
      ...     num_trials=5,
      ...     max_concurrent_trials=1
      ... )
  """
```

### SearchAlgorithm

The following is the import statement for the search algorithm:

```python
from snowflake.ml.modeling.tune.search import BayesOpt, RandomSearch, GridSearch
```

The following code creates a Bayesian optimization search algorithm object:

```python
@dataclass
class BayesOpt():
    """
    Bayesian Optimization class that encapsulates parameters for the acquisition function.

    This class is designed to facilitate Bayesian optimization by configuring
    the acquisition function through a dictionary of keyword arguments.

    Attributes:
        utility_kwargs (Optional[Dict[str, Any]]):
            A dictionary specifying parameters for the utility (acquisition) function.
            If not provided, it defaults to:
                {
                    'kind': 'ucb',   # Upper Confidence Bound acquisition strategy
                    'kappa': 2.576,  # Exploration parameter for UCB
                    'xi': 0.0      # Exploitation parameter
                }
    """
    utility_kwargs: Optional[Dict[str, Any]] = None
```

The following code creates a random search algorithm object:

```python
@dataclass
class RandomSearch():
    The default and most basic way to do hyperparameter search is via random search.

    Attributes:
Seed or NumPy random generator for reproducible results. If set to None (default), the global generator (np.random) is used.
    random_state: Optional[int] = None
```

### TunerResults

The following code creates a TunerResults object:

```python
@dataclass
class TunerResults:
    results: pd.DataFrame
    best_result: pd.DataFrame
    best_model: Optional[Any]
```

### get_tuner_context

The following is the import statement for the `get_tuner_context` module:

```python
from snowflake.ml.modeling.tune import get_tuner_context
```

This helper method is designed to be called within the training function. It returns a TunerContext object that encapsulates several useful fields for running the trial, including:

* Hyperparameters selected by the HPO framework for the current trial.
* The dataset required for training.
* A helper function to report metrics, guiding the HPO framework in suggesting the next set of hyperparameters

The following code creates a tuner context object:

```python
class TunerContext:
    """
    A centralized context class for managing trial configuration, reporting, and dataset information.
    """

    def get_hyper_params(self) -> Dict[str, Any]:
        """
        Retrieve the configuration dictionary.

        Returns:
            Dict[str, Any]: The configuration dictionary for the trial.
        """
        return self._hyper_params

    def report(self, metrics: Dict[str, Any], model: Optional[Any] = None) -> None:
    """
    Report metrics and optionally the model if provided.

    This method is used to report the performance metrics of a model and, if provided, the model itself.
    The reported metrics will be used to guide the next set of hyperparameters selection in the
    optimization process.

    Args:
        metrics (Dict[str, Any]): A dictionary containing the performance metrics of the model.
            The keys are metric names, and the values are the corresponding metric values.
        model (Optional[Any], optional): The trained model to be reported. Defaults to None.

    Returns:
        None: This method doesn't return anything.
    """

    def get_dataset_map(self) -> Optional[Dict[str, Type[DataConnector]]]:
        """
        Retrieve the dataset mapping.

        Returns:
            Optional[Dict[str, Type[DataConnector]]]: A mapping of dataset names to DataConnector types, if available.
        """
        return self._dataset_map
```

## Limitations

Bayesian optimization requires continuous search spaces and works only with the uniform sampling function. It is incompatible with discrete parameters.
sampled using the `tune.randint` or `tune.choice` methods. To work around this limitation, either use
`tune.uniform` and cast the parameter inside the training function, or switch to a sampling algorithm that handles
both discrete and continuous spaces, such as `tune.RandomSearch`.

## Troubleshooting

| Error message | Possible causes | Possible solutions |
| --- | --- | --- |
| Invalid search space configuration: BayesOpt requires all sampling functions to be of type ‘Uniform’. | Bayesian optimization works only with uniform sampling, not with discrete samples. (See Limitations above.) | * Use `tune.uniform` and cast the result in your training function. * Switch to `RandomSearch` algorithm, which accepts both discrete and non-discrete samples. |
| Insufficient CPU resources. Required: 16, Available: 8. The numbers of required and available resources may differ. | `max_concurrent_trials` is set to a value higher than the available cores. | Follow guidance provided by the error message. |
| Insufficient GPU resources. Required: 4, Available: 2. May refer to CPU or GPU. The numbers of required and available resources may differ. | `max_concurrent_trials` is set to a value higher than the available cores. | Follow the guidance provided by the error message. |

## Next steps

* [Container Runtime HPO Example](https://github.com/Snowflake-Labs/sf-samples/blob/main/samples/ml/container_runtime_hpo/hpo_example.ipynb)

---
title: Pre-processing and post-processing with models
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/custom-processing-with-models.md
section: Snowflake ML
---

# Pre-processing and post-processing with models

This topic explains how to create models, log them to the Snowflake Model Registry, and deploy them, using a number of
model types and scenarios as examples. These include:

* In-memory scikit-learn models and pipelines.
* Your own custom models.
* More than one model.

## In-memory scikit-learn models and pipelines

Snowflake ML allows seamless integration of in-memory `scikit-learn` models into the Model Registry by using keyword
arguments with `ModelContext` class. Below is an example of passing an in-memory `scikit-learn` model as a keyword
argument to model context and calling it in a custom model class.

```python
from sklearn import datasets, svm
import pandas as pd
from snowflake.ml.model import custom_model

# Step 1: Import the Iris dataset
iris_X, iris_y = datasets.load_iris(return_X_y=True)

# Step 2: Initialize a scikit-learn LinearSVC model and train it
svc = svm.LinearSVC()
svc.fit(iris_X, iris_y)

# Step 3: Initialize ModelContext with keyword arguments
mc = custom_model.ModelContext(
    my_model=svc,
)

# Step 4: Define a custom model class to utilize the context
class ExampleSklearnModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        # Use the model from the context for predictions
        model_output = self.context['my_model'].predict(input)
        # Return the predictions in a DataFrame
        return pd.DataFrame({'output': model_output})
```

## Using `scikit-learn` pipelines with Snowflake ML

Below is an example showing how to use scikit-learn pipelines within Snowflake ML. This involves preprocessing
steps such as scaling or imputing, followed by a predictive model, all managed within a custom model class using the
`ModelContext`.

```python
from sklearn import datasets
from sklearn.svm import SVC
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import StandardScaler
from sklearn.impute import SimpleImputer
import pandas as pd
from snowflake.ml.model import custom_model

# Step 1: Load the Iris dataset
iris_X, iris_y = datasets.load_iris(return_X_y=True)

# Step 2: Create a scikit-learn pipeline
# The pipeline includes:
# - A SimpleImputer to handle missing values
# - A StandardScaler to standardize the data
# - A Support Vector Classifier (SVC) for predictions
pipeline = Pipeline([
    ('imputer', SimpleImputer(strategy='mean')),
    ('scaler', StandardScaler()),
    ('classifier', SVC(kernel='linear', probability=True))
])

# Step 3: Fit the pipeline to the dataset
pipeline.fit(iris_X, iris_y)

# Step 4: Initialize ModelContext with the pipeline
mc = custom_model.ModelContext(
    pipeline_model=pipeline,
)

# Step 5: Define a custom model class to utilize the pipeline
class ExamplePipelineModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        # Use the pipeline from the context to process input and make predictions
        predictions = self.context['pipeline_model'].predict(input)
        probabilities = self.context['pipeline_model'].predict_proba(input)

        # Return predictions and probabilities as a DataFrame
        return pd.DataFrame({
            'predictions': predictions,
            'probability_class_0': probabilities[:, 0],
            'probability_class_1': probabilities[:, 1]
        })

# Example usage:
# Convert new input data into a DataFrame
new_input = pd.DataFrame(iris_X[:5])  # Using the first 5 samples for demonstration

# Initialize the custom model and run predictions
custom_pipeline_model = ExamplePipelineModel(context=mc)
result = custom_pipeline_model.predict(new_input)

print(result)
```

## Using your own models

The following example uses your own model as a custom model.

```python
mc = custom_model.ModelContext(
    my_model=your_own_model,
)

from snowflake.ml.model import custom_model
import pandas as pd
import json

class ExampleYourOwnModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        model_output = self.context['my_model'].predict(features)
        return pd.DataFrame({'output': model_output})
```

## Using more than one model

Below is a custom model that combines multiple models and uses a configuration file to apply bias when generating
predictions.

```python
mc = custom_model.ModelContext(
    model1=model1,
    model2=model2,
    feature_preproc=preproc
    }
)
```

> **Note:**
>
> `model1` and `model2` are objects of any type of model natively supported by the registry. `feature_preproc`
> is a `scikit-learn pipeline` object.

```python
from snowflake.ml.model import custom_model
import pandas as pd
import json

class ExamplePipelineModel(custom_model.CustomModel):

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        ...
        return pd.DataFrame(...)

# Here is the fully-functional custom model that uses both model1 and model2
class ExamplePipelineModel(custom_model.CustomModel):
    def __init__(self, context: custom_model.ModelContext) -> None:
        super().__init__(context)

    @custom_model.inference_api
    def predict(self, input: pd.DataFrame) -> pd.DataFrame:
        features = self.context['feature_preproc'].transform(input)
        model_output = self.context['model1'].predict(
            self.context['model2'].predict(features)
        )
        return pd.DataFrame({'output': model_output})
```

---
title: Process data with custom logic across partitions
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/process-data-across-partitions.md
section: Snowflake ML
---

# Process data with custom logic across partitions

Use the Distributed Partition Function (DPF) to process data in parallel across one or more nodes in a compute pool.
DPF handles distributed orchestration, errors, observability, and artifact persistence automatically. You can run DPF in either a [Snowflake Notebook](../../user-guide/ui-snowsight/notebooks.md) or a [Snowflake ML Job](ml-jobs/overview.md).

DPF supports the following execution modes:

* **DataFrame mode** (`run()`): Partition a Snowpark DataFrame by column values and execute your function on each
  partition concurrently. Data is prefetched in parallel for optimal throughput.
* **Stage mode** (`run_from_stage()`): Process files from a Snowflake stage where each file becomes a partition.
  Ideal for large-scale file processing with predictable memory usage.

You can use DPF to process large datasets efficiently across different data segments.

This tool is ideal for scenarios such as the following:

* Analyzing sales data by region
* Processing customer data by geographic segments
* Training ML models on each data partition
* Performing data transformations where each data partition requires the same processing logic

DPF handles the distributed data processing automatically. You don’t need to manage the distributed computing infrastructure.

DPF lets you write custom Python code using open source libraries on containerized infrastructure with GPU access.

> **Important:**
>
> DPF automatically stores results and artifacts in Snowflake stages. Before you use DPF, make sure you have permissions to the stage where DPF stores results and artifacts.

## DataFrame mode: Process data by column partitions

Use DataFrame mode to partition a Snowpark DataFrame by a specified column and execute your Python function on each
partition in parallel. The following example demonstrates processing sales data by region.

1. Define the processing function
2. Initialize and run DPF
3. Monitor progress and wait for completion
4. Retrieve results from each partition
5. Handle errors
6. Restore results from a completed run

### Define the processing function

Import the classes required to run distributed processing:

```python
from snowflake.ml.modeling.distributors.distributed_partition_function.dpf import DPF
from snowflake.ml.modeling.distributors.distributed_partition_function.dpf_run import DPFRun
from snowflake.ml.modeling.distributors.distributed_partition_function.entities import (
    RunStatus, ExecutionOptions
)
```

Define a processing function that takes two arguments:

* `data_connector`: A [DataConnector](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector)
  that provides access to the partition’s data. Call `data_connector.to_pandas()` to load it as a pandas DataFrame,
  or use other methods like `to_torch_dataset()` or `to_ray_dataset()`.
* `context`: A PartitionContext object that provides the partition ID and methods for
  uploading and downloading artifacts.

```python
import json

def process_sales_data(data_connector, context):
    df = data_connector.to_pandas()
    print(f"Processing {len(df)} records for region: {context.partition_id}")

    # Perform region-specific analytics
    summary = {
        'region': context.partition_id,
        'total_sales': df['amount'].sum(),
        'avg_order_value': df['amount'].mean(),
        'customer_count': df['customer_id'].nunique(),
        'record_count': len(df)
    }

    # Store results in stage for subsequent access

    context.upload_to_stage(summary, "sales_summary.json",
        write_function=lambda obj, path: json.dump(obj, open(path, 'w')))
```

For each region, this function computes summary statistics and saves the results as a JSON file to the partition’s
dedicated stage directory.

### Initialize and run DPF

Create a `DPF` instance with your processing function and an output stage name, then call `run()` to start
distributed processing.

> **Important:**
>
> The Snowpark DataFrame that you provide must be created from a table. For information about creating a DataFrame from
> a table, see the [Constructing a DataFrame](../snowpark/python/working-with-dataframes.md).

```python
dpf = DPF(process_sales_data, "analytics_stage")
run = dpf.run(
    partition_by="region",
    snowpark_dataframe=sales_data,
    run_id="regional_analytics_2024"
)
```

The `run()` method accepts the following parameters:

* `partition_by` (str): Column name to partition the DataFrame by. Each unique value creates a separate partition.
* `snowpark_dataframe`: The Snowpark DataFrame to partition and process.
* `run_id` (str): Unique identifier for this run. Creates a dedicated directory `@{stage_name}/{run_id}/` for all
  artifacts. Use descriptive names like `experiment_2024_01_15` or `model_v1_retrain`.
* `on_existing_artifacts` (str, optional): Action when artifacts for the `run_id` already exist.
  `"error"` (default) raises an error; `"overwrite"` replaces existing artifacts.
* `execution_options` (ExecutionOptions, optional): Configuration for resource
  allocation and execution behavior.

### Monitor progress and wait for completion

Call `wait()` to block until the run completes. By default, it displays a progress bar.

```python
final_status = run.wait()  # Shows progress bar by default
print(f"Job completed with status: {final_status}")
```

The following is an example of the output:

```output
Progress: 100%|██████████| 4/4 [02:15<00:00, 33.75s/it]
Job completed with status: RunStatus.SUCCESS
```

You can also check the status and progress at any time without blocking:

```python
# Check overall status
current_status = run.status

# Get progress grouped by partition status
progress = run.get_progress()
```

### Retrieve results from each partition

After the run completes successfully, retrieve results from each partition using the `partition_details` property.
Each partition’s details include a `stage_artifacts_manager` for accessing saved artifacts.

```python
if final_status == RunStatus.SUCCESS:
    import json
    all_results = []
    for partition_id, details in run.partition_details.items():
        # List available artifacts for this partition
        files = details.stage_artifacts_manager.list()
        print(f"Partition {partition_id} artifacts: {files}")

        # Load an artifact using a custom deserializer
        summary = details.stage_artifacts_manager.get("sales_summary.json",
            read_function=lambda path: json.load(open(path, 'r')))
        all_results.append(summary)

    # Combine results across all regions
    total_sales = sum(r['total_sales'] for r in all_results)
    total_customers = sum(r['customer_count'] for r in all_results)
```

The `stage_artifacts_manager` provides three methods:

* `list()`: Returns a list of filenames saved in the partition’s stage directory.
* `get(filename, read_function=None)`: Downloads and deserializes an artifact. Uses `pickle` by default, or
  a custom `read_function` if provided.
* `download(filename, local_dir)`: Downloads a raw file to a local directory and returns the local file path.

### Handle errors

If the run does not succeed, inspect individual partition details to diagnose failures:

```python
if final_status != RunStatus.SUCCESS:
    for partition_id, details in run.partition_details.items():
        if details.status != PartitionStatus.DONE:
            print(f"Partition {partition_id} status: {details.status}")
            try:
                error_logs = details.logs
                print(error_logs)
            except RuntimeError:
                print("Logs not available for this partition")
```

The overall `RunStatus` reflects the aggregate outcome:

* `SUCCESS`: All partitions completed successfully.
* `PARTIAL_FAILURE`: Some partitions succeeded, but at least one failed.
* `FAILURE`: No partitions completed successfully.
* `CANCELLED`: The run was cancelled.
* `IN_PROGRESS`: The run is still executing.

Each partition has a `PartitionStatus`:

* `PENDING`: Waiting to be processed.
* `RUNNING`: Currently being processed.
* `DONE`: Completed successfully.
* `FAILED`: The user function raised an exception.
* `CANCELLED`: Cancelled by the user.
* `INTERNAL_ERROR`: An internal system error occurred (for example, out-of-memory).

To import these enums:

```python
from snowflake.ml.modeling.distributors.distributed_partition_function.entities import (
    RunStatus, PartitionStatus
)
```

To cancel a running job:

```python
run.cancel()
```

> **Note:**
>
> Partitions that have already completed are not affected by cancellation. Partial results, logs, and artifacts
> from completed partitions remain available.

### Restore results from a completed run

You can restore a completed run from its persisted state and access the same results without re-running the process:

```python
from snowflake.ml.modeling.distributors.distributed_partition_function.dpf_run import DPFRun

restored_run = DPFRun.restore_from(run_id="regional_analytics_2024", stage_name="analytics_stage")

# Access results just like the original run
for partition_id, details in restored_run.partition_details.items():
    print(f"{partition_id}: {details.status}")
```

> **Note:**
>
> Restored runs are read-only. You cannot call `wait()` or `cancel()` on a restored run.

## Stage mode: Process files from a stage

Use stage mode to process files from a Snowflake stage where each file becomes a partition. This is ideal for
large-scale file processing, such as processing a collection of Parquet files that have been staged.

### Define a processing function

The processing function signature is the same as DataFrame mode. The `data_connector` provides access to the file’s data,
and `context.partition_id` is the relative file path within the stage.

```python
def process_file(data_connector, context):
    df = data_connector.to_pandas()
    print(f"Processing file {context.partition_id}: {len(df)} rows")

    # Process data and save results
    result = {"row_count": len(df), "columns": list(df.columns)}
    import json
    context.upload_to_stage(result, "result.json",
        write_function=lambda obj, path: json.dump(obj, open(path, 'w')))
```

### Run DPF from stage

Call `run_from_stage()` instead of `run()`. Specify the input `stage_location` containing the source files
and optionally a `file_pattern` to filter which files to process.

```python
dpf = DPF(process_file, "output_stage")
run = dpf.run_from_stage(
    stage_location="@my_db.my_schema.input_stage/data/",
    run_id="file_processing_2024",
    file_pattern="*.parquet",
)
final_status = run.wait()
```

The `run_from_stage()` method accepts the following parameters:

* `stage_location` (str): Input stage path containing the source data files. Each file matching the `file_pattern`
  becomes a partition. Supports both simple and fully qualified stage names:

  + Simple: `"@my_stage/data/"`
  + Fully qualified: `"@my_db.my_schema.my_stage/data/"`
* `run_id` (str): Unique identifier for this run.
* `file_pattern` (str, optional): Glob pattern to filter files. Defaults to `"*.parquet"`.
* `on_existing_artifacts` (str, optional): `"error"` (default) or `"overwrite"`.
* `execution_options` (ExecutionOptions, optional): Configuration for resource
  allocation and execution behavior.

> **Note:**
>
> The `stage_location` is the *input* data source. The `stage_name` provided to `DPF()` is the *output*
> location for artifacts like logs and results. These can be different stages.

Monitoring, result retrieval, error handling, and run restoration work the same way as
DataFrame mode.

For I/O-bound file processing, set `num_cpus_per_worker=1` in `ExecutionOptions` to maximize parallelism
(one actor per CPU). For CPU-bound workloads, use the default or increase `num_cpus_per_worker`.

```python
from snowflake.ml.modeling.distributors.distributed_partition_function.entities import ExecutionOptions

run = dpf.run_from_stage(
    stage_location="@my_stage/data/",
    run_id="io_bound_processing",
    execution_options=ExecutionOptions(num_cpus_per_worker=1),
)
```

## Configure execution options

Use `ExecutionOptions` to control resource allocation and execution behavior, such as CPU/GPU allocation per worker,
retry count, and fail-fast behavior. All fields are optional with sensible defaults.

```python
from snowflake.ml.modeling.distributors.distributed_partition_function.entities import ExecutionOptions

options = ExecutionOptions(
    num_cpus_per_worker=4,
    num_gpus_per_worker=1,
    fail_fast=True,
)

run = dpf.run(
    partition_by="region",
    snowpark_dataframe=sales_data,
    run_id="my_run",
    execution_options=options,
)
```

For the full list of parameters and worker scaling behavior, see the
[ExecutionOptions API reference](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/container-runtime/distributors.distributed_partition_function).

## Work with artifacts using PartitionContext

The `PartitionContext` object is passed as the second argument to your processing function. It provides methods for
interacting with artifacts and Snowflake sessions during partition execution.

### Upload artifacts

Use `upload_to_stage()` to save results from within your processing function. By default, objects are serialized
using pickle. Provide a `write_function` for custom serialization.

```python
def my_function(data_connector, context):
    df = data_connector.to_pandas()

    # Save a pickle object (default serialization)
    results = {'total': df['amount'].sum(), 'count': len(df)}
    context.upload_to_stage(results, "summary.pkl")

    # Save JSON data with a custom write function
    import json
    context.upload_to_stage(
        results, "summary.json",
        write_function=lambda obj, path: json.dump(obj, open(path, 'w'))
    )

    # Save a CSV file
    df_processed = df.groupby('category').sum()
    context.upload_to_stage(
        df_processed, "aggregated.csv",
        write_function=lambda df, path: df.to_csv(path, index=False)
    )
```

### Download artifacts

Use `download_from_stage()` to load artifacts within your processing function. You can use this function to
access artifacts from a prior run. For example, you can use it to load a model for inference.

```python
def my_inference_function(data_connector, context):
    # Load a pickle object from the current partition's stage path
    model = context.download_from_stage("model.pkl")

    # Load from a different stage path (e.g., from a prior training run)
    model = context.download_from_stage(
        "model.pkl",
        stage_path="@db.schema.stage/training_run/partition_0"
    )

    # Load JSON with a custom deserializer
    import json
    config = context.download_from_stage(
        "config.json",
        read_function=lambda path: json.load(open(path, 'r'))
    )
```

### Use Snowflake sessions

Use `with_session()` to execute operations that require a Snowflake session, such as writing results to a table.
This method uses a bounded session pool to avoid hitting Snowflake’s session limits when running many partitions
concurrently.

```python
def my_function(data_connector, context):
    df = data_connector.to_pandas()

    # Load a model from a prior training run
    model = context.download_from_stage("model.pkl")

    predictions = model.predict(df[['X1', 'X2']])

    results = df.copy()
    results['predictions'] = predictions
    results['partition_id'] = context.partition_id

    # Write results to a Snowflake table using the bounded session pool
    context.with_session(lambda session:
        session.create_dataframe(results)
            .write.mode("append")
            .save_as_table("predictions_table")
    )
```

> **Note:**
>
> The function passed to `with_session()` is serialized using cloudpickle. Avoid capturing large objects or
> non-serializable resources in the closure.

## Scale across multiple nodes

To run DPF across multiple nodes, scale your cluster before starting the run:

```python
from snowflake.ml.runtime_cluster import scale_cluster

# Scale to 3 nodes for increased parallelism
scale_cluster(3)

dpf = DPF(process_sales_data, "analytics_stage")
run = dpf.run(
    partition_by="region",
    snowpark_dataframe=sales_data,
    run_id="multi_node_run",
    execution_options=ExecutionOptions(use_head_node=True),
)
final_status = run.wait()
```

When running on multiple nodes, set `use_head_node=False` if you want the head node to act solely as a coordinator
without executing user functions. This can improve reliability for long-running workloads because an out-of-memory
error on a worker node does not affect the coordinator.

## Limitations and constraints

* **One concurrent run**: Only one DPF run can execute at a time. Starting a new run while another is in progress
  raises an error. Cancel the previous run with `run.cancel()` before starting a new one.
* **DataFrame requirements**: In DataFrame mode, the Snowpark DataFrame must contain exactly one query and no post-actions.
* **Single-node restriction**: `use_head_node=False` is not supported on single-node clusters.
* **Artifact path structure**: Artifacts are stored at `@{stage_name}/{run_id}/{partition_id}/`. This path
  structure is fixed and cannot be customized.
* **Restored runs are read-only**: Runs restored with `DPFRun.restore_from()` cannot call `wait()` or
  `cancel()`.

## Next steps

* Explore [Train models across data partitions](train-models-across-partitions.md) to learn about training multiple ML models using DPF as the underlying
  infrastructure.
* For the complete API documentation, see the
  [Distributed Partition Function (DPF) API reference](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/container-runtime/distributors.distributed_partition_function).
* For end-to-end examples, see the
  [Snowflake ML sample notebooks](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/ml).

---
title: Prophet
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/prophet.md
section: Snowflake ML
---

# Prophet

The Snowflake ML Model Registry supports time series forecasting models created using Prophet (`prophet.Prophet`).

> **Note:**
>
> Prophet models can currently only be deployed in the Snowflake warehouse for inference. Model serving in
> Snowpark Container Services (SPCS) is not supported for Prophet models at this time.

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. The default target method is `predict`. |
| `date_column` | The name of the column containing datetime values in your input data. If specified, this column will be automatically mapped to Prophet’s required `ds` column. If not specified, your data must contain a column named `ds`. |
| `target_column` | The name of the column containing target values in your input data. If specified, this column will be automatically mapped to Prophet’s required `y` column. If not specified, your data must contain a column named `y`. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a Prophet model so
that the registry knows the signatures of the target methods.

## Data Format Requirements

Prophet models require input data in a specific format:

* A datetime column (named `ds` by default, or use `date_column` option to map a custom name)
* A target value column (named `y` by default, or use `target_column` option to map a custom name)
* Optional additional regressor columns (if the model was trained with regressors)

For forecasting future periods, provide a DataFrame with future dates in the `ds` column and `NaN` values
in the `y` column.

## Example

In the following examples, `reg` is an instance of `snowflake.ml.registry.Registry`. For information on
creating a registry object, see [Snowflake Model Registry](../overview.md).

### Basic Prophet Model

```python
import prophet
import pandas as pd
import numpy as np

# Create sample time series data
dates = pd.date_range(start="2020-01-01", periods=365, freq="D")
values = np.linspace(100, 200, 365) + 10 * np.sin(2 * np.pi * np.arange(365) / 365.25)

training_data = pd.DataFrame({
    "ds": dates,
    "y": values
})

# Train Prophet model
model = prophet.Prophet(
    daily_seasonality=True,
    weekly_seasonality=True,
    yearly_seasonality=True,
)
model.fit(training_data)

# Create future data for forecasting
last_date = training_data["ds"].max()
future_dates = pd.date_range(start=last_date + pd.Timedelta(days=1), periods=30, freq="D")
future_data = pd.DataFrame({
    "ds": future_dates,
    "y": [float("nan")] * 30  # NaN indicates periods to forecast
})

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_prophet_model",
    version_name="v1",
    sample_input_data=training_data[:10],
)

# Make predictions
result_df = model_ref.run(future_data, function_name="predict")
```

### Prophet Model with Custom Column Names

```python
import prophet
import pandas as pd
import numpy as np

# Create sample time series data with custom column names
dates = pd.date_range(start="2020-01-01", periods=365, freq="D")
values = np.linspace(100, 200, 365) + 10 * np.sin(2 * np.pi * np.arange(365) / 365.25)

training_data = pd.DataFrame({
    "date": dates,        # Custom date column name
    "sales": values       # Custom target column name
})

# Rename columns to Prophet format for training
prophet_training_data = training_data.rename(columns={"date": "ds", "sales": "y"})

# Train Prophet model
model = prophet.Prophet()
model.fit(prophet_training_data)

# Create future data with custom column names
last_date = training_data["date"].max()
future_dates = pd.date_range(start=last_date + pd.Timedelta(days=1), periods=30, freq="D")
future_data = pd.DataFrame({
    "date": future_dates,
    "sales": [float("nan")] * 30
})

# Log the model with column mapping options
model_ref = reg.log_model(
    model=model,
    model_name="my_prophet_model_custom_cols",
    version_name="v1",
    sample_input_data=training_data[:10],
    options={
        "date_column": "date",
        "target_column": "sales",
    },
)

# Make predictions using custom column names
result_df = model_ref.run(future_data, function_name="predict")
```

### Prophet Model with Regressors

```python
import prophet
import pandas as pd
import numpy as np

# Create sample time series data with additional regressors
dates = pd.date_range(start="2020-01-01", periods=365, freq="D")
values = np.linspace(100, 200, 365) + 10 * np.sin(2 * np.pi * np.arange(365) / 365.25)

training_data = pd.DataFrame({
    "ds": dates,
    "y": values,
    "holiday": (dates.dayofweek >= 5).astype(int),  # Weekend indicator
    "temperature": 20 + 5 * np.sin(2 * np.pi * np.arange(365) / 365.25)
})

# Train Prophet model with regressors
model = prophet.Prophet()
model.add_regressor("holiday")
model.add_regressor("temperature")
model.fit(training_data)

# Create future data with regressor values
last_date = training_data["ds"].max()
future_dates = pd.date_range(start=last_date + pd.Timedelta(days=1), periods=30, freq="D")
future_data = pd.DataFrame({
    "ds": future_dates,
    "y": [float("nan")] * 30,
    "holiday": (future_dates.dayofweek >= 5).astype(int),
    "temperature": [22.0] * 30  # Predicted future temperatures
})

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_prophet_model_regressors",
    version_name="v1",
    sample_input_data=training_data[:10],
)

# Make predictions
result_df = model_ref.run(future_data, function_name="predict")
```

## Prediction Output

The `predict` method returns a DataFrame with the following columns:

* `ds`: The datetime for each prediction
* `yhat`: The predicted value
* `yhat_lower`: Lower bound of the prediction interval
* `yhat_upper`: Upper bound of the prediction interval
* Additional columns for trend and seasonality components (e.g., `trend`, `weekly`, `yearly`)

---
title: Python APIs for Snowflake ML
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/snowpark-ml.md
section: Snowflake ML
---

# Python APIs for Snowflake ML

The `snowflake-ml-python` Python package provides Python APIs that connect to the various Snowflake ML workflow
components and also includes APIs for building and training your own models. You can use these APIs in your favorite
Python IDE on your own workstation, in Snowsight worksheets, or in Snowflake notebooks.

> **Tip:**
>
> See [Introduction to Machine Learning with Snowpark ML](https://quickstarts.snowflake.com/guide/intro_to_machine_learning_with_snowpark_ml_for_python/#0)
> for an example of an end-to-end workflow using this library.

## Using Snowflake ML in Snowflake Notebooks

[Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md) provide an easy-to-use notebook interface for your data
work, blending Python, SQL, and Markdown. To use Snowflake ML features in notebooks, choose the Anaconda package
`snowflake-ml-python` using the Packages menu at the top of the notebook.

Notebooks support both CPU and GPU runtime options. Many kinds of models require, or benefit from, having a GPU available.

> **Important:**
>
> The `snowflake-ml-python` package and its dependencies must be allowed by your organization’s
> [package policy](../udf/python/packages-policy.md).

## Using Snowflake ML in Snowsight Worksheets

[Snowsight Worksheets](../../user-guide/ui-snowsight-worksheets.md) provide a powerful and versatile method for running
Python code. To use Snowflake ML features in worksheets, choose the Anaconda package `snowflake-ml-python` using the Packages menu
at the top of the worksheet.

> **Important:**
>
> The `snowflake-ml-python` package and its dependencies must be allowed by your organization’s
> [package policy](../udf/python/packages-policy.md).

## Using Snowflake ML Locally

You must install the `snowflake-ml-python` package to develop on your own workstation or elsewhere outside Snowflake.
All Snowflake ML features are available in a single package, `snowflake-ml-python`. You can install the package
from the Snowflake conda channel using the `conda` command or from the Python Package Index (PyPI) using
`pip`. Conda is preferred.

* Installing from the Snowflake conda Channel
* Installing from PyPI

### Installing from the Snowflake conda Channel

1. Create the conda environment where you will install Snowflake ML. If you prefer to use an existing environment, skip this step.

   ```console
   conda create --name snowflake-ml
   ```
2. Activate the conda environment:

   ```console
   conda activate snowflake-ml
   ```
3. Install `snowflake-ml-python` from the Snowflake conda channel

   ```console
   conda install --override-channels --channel https://repo.anaconda.com/pkgs/snowflake/ snowflake-ml-python
   ```

> **Tip:**
>
> Install packages from the Snowflake conda channel whenever possible to ensure that you receive packages that have
> been validated with Snowflake ML.

### Installing from PyPI

You can install `snowflake-ml-python` from the Python Package Index (PyPI) by using the standard Python package manager,
`pip`.

> **Warning:**
>
> Do not use this installation procedure if you are using a conda environment. Use the
> conda instructions instead.

1. Create and activate your Python virtual environment:

   > ```console
   > python3 -m virtualenv venv
   > source venv/bin/activate
   > ```
2. Install the `snowflake-ml-python` package:

   > ```console
   > python -m pip install snowflake-ml-python
   > ```

### Installing Optional Dependencies

Some APIs require dependencies that are not installed as dependencies of `snowflake-ml-python`.
By default, scikit-learn is installed. Other packages such as lightgbm, xgboost, keras, pytorch, and others are
optional dependencies.

If you plan to use the `snowflake.ml.modeling.lightgbm` module, install lightgbm. Use the following
commands to activate your conda environment and install lightgbm from the Snowflake conda channel.

```console
conda activate snowflake-ml
conda install --override-channels --channel https://repo.anaconda.com/pkgs/snowflake/ lightgbm
```

Use the following commands to activate your virtual environment and install lightgbm using `pip`.

```console
source venv/bin/activate
python -m pip install 'snowflake-ml-python[lightgbm]'
```

### Setting Up Snowpark Python

Snowpark Python is a dependency of `snowflake-ml-python` and is installed automatically with it. If Snowpark
Python is not already set up on your system, you might need to perform additional configuration steps. See
[Setting up your development environment for Snowpark Python](../snowpark/python/setup.md) for Snowpark Python setup instructions.

## Connecting to Snowflake

Before using Snowflake ML features in Python, connect to Snowflake using a Snowpark `Session` object. Use the
`SnowflakeLoginOptions` function in the `snowflake.ml.utils.connection_params` module to get the
configuration settings to create the session. The function can read the connection settings from a named connection in
your [SnowSQL configuration file](../../user-guide/snowsql-config.md) or from environment variables that you set. It
returns a dictionary containing these parameters, which can be used to create a connection.

The following examples read the connection parameters from the named connection `myaccount` in the SnowSQL
configuration file. To create a Snowpark Python session, create a builder for the `Session` class, and pass the
connection information to the builder’s `configs` method:

```python
from snowflake.snowpark import Session
from snowflake.ml.utils import connection_params

params = connection_params.SnowflakeLoginOptions("myaccount")
sp_session = Session.builder.configs(params).create()
```

You can now pass the session to any that needs it.

> **Tip:**
>
> To create a Snowpark Python session from a Snowflake Connector for Python connection, pass the connection object to
> the session builder. Here, `connection` is the Snowflake Connector for Python connection.
>
> ```python
> session = Session.builder.configs({"connection": connection}).create()
> ```

### Specifying a Warehouse

Many Snowflake ML features, for example model training or inference, run code in a Snowflake warehouse. These operations
run in the warehouse specified by the session you use to connect. For example, if you create a session from a named
connection in your [SnowSQL configuration file](../../user-guide/snowsql-config.md), you can specify a warehouse using the
`warehousename` parameter in the named configuration.

You can add the warehouse setting when creating the `Session` object, as shown here, if it does not already
exist in the configuration.

```python
from snowflake.snowpark import Session
from snowflake.ml.utils import connection_params

# Get named connection from SnowSQL configuration file
params = connection_params.SnowflakeLoginOptions(connection_name="my_connection")

# Add warehouse name for model method calls if it's not already present
if "warehouse" not in params:
    params["warehouse"] = "mlwarehouse"
sp_session = Session.builder.configs(params).create()
```

If no warehouse is specified in the session, or if you want to use a different warehouse, call the session’s
`use_warehouse` method to specify a warehouse.

```python
sp_session.use_warehouse("mlwarehouse")
```

## API Reference

The [Snowflake ML API reference](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/index) includes documentation on
all publicly-released functionality. You can also obtain detailed API documentation for any API by using Python’s
`help` function in an interactive Python session. For example:

```python
from snowflake.ml.modeling.preprocessing import OneHotEncoder

help(OneHotEncoder)
```

---
title: PyTorch
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/pytorch.md
section: Snowflake ML
---

# PyTorch

The Snowflake ML Model Registry supports models created using PyTorch (models derived from `torch.nn.Module`).

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. PyTorch models have the following target method by default: `forward`. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |
| `multiple_inputs` | Whether the model expects multiple tensor inputs. Defaults to `False`. When `True`, the model will accept a list of tensors as input instead of a single tensor. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a PyTorch model so
that the registry knows the signatures of the target methods.

> **Note:**
>
> When using pandas DataFrames (which use float64 by default), ensure your PyTorch model layers are created
> with `dtype=torch.float64` to avoid dtype mismatch errors.

## Example

This example assumes `reg` is an instance of `snowflake.ml.registry.Registry`.

```python
import torch
import torch.nn as nn
from sklearn import datasets, model_selection

# Define a simple neural network for classification
class IrisClassifier(nn.Module):
    def __init__(self, input_dim: int, hidden_dim: int, output_dim: int):
        super().__init__()
        # Use float64 to match pandas DataFrame default dtype
        self.model = nn.Sequential(
            nn.Linear(input_dim, hidden_dim, dtype=torch.float64),
            nn.ReLU(),
            nn.Linear(hidden_dim, hidden_dim, dtype=torch.float64),
            nn.ReLU(),
            nn.Linear(hidden_dim, output_dim, dtype=torch.float64),
        )

    def forward(self, x: torch.Tensor) -> torch.Tensor:
        return self.model(x)

# Load dataset
iris = datasets.load_iris(as_frame=True)
X = iris.data
y = iris.target

# Rename columns for valid Snowflake identifiers
X.columns = [col.replace(' ', '_').replace('(', '').replace(')', '') for col in X.columns]

X_train, X_test, y_train, y_test = model_selection.train_test_split(X, y, test_size=0.2)

# Create model
model = IrisClassifier(input_dim=4, hidden_dim=32, output_dim=3)

# Train the model
optimizer = torch.optim.Adam(model.parameters(), lr=0.01)
criterion = nn.CrossEntropyLoss()

X_train_tensor = torch.tensor(X_train.values)
y_train_tensor = torch.tensor(y_train.values, dtype=torch.long)

model.train()
for epoch in range(100):
    optimizer.zero_grad()
    outputs = model(X_train_tensor)
    loss = criterion(outputs, y_train_tensor)
    loss.backward()
    optimizer.step()

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_iris_classifier",
    version_name="v1",
    sample_input_data=X_test,
)

# Make predictions
result_df = model_ref.run(X_test[-10:])
```

---
title: Quickstarts
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/quickstart.md
section: Snowflake ML
---

# Quickstarts

Use the following quickstarts to help you get up to speed with Snowflake ML.

End to end examples

| Quickstart | Level | Description |
| --- | --- | --- |
| [Build an End-to-End ML Model in Snowflake](https://quickstarts.snowflake.com/guide/end-to-end-ml-workflow/index.html?index=..%2F..index#1) | Beginner | Build, deploy and manage an XGBoost model in production, including full intro of Snowflake’s MLOps capabilities |
| [Orchestrate ML Pipelines with ML Jobs and Task Graphs](https://www.snowflake.com/en/developers/guides/e2e-task-graph/?index=..%2F..index#1) | Intermediate | Create a complete ML pipeline with data preparation, distributed model training, evaluation, conditional promotion, and automated cleanup orchestrated through Task Graphs |
| [Scale Embeddings with Snowflake Notebooks on Container Runtime](https://www.snowflake.com/en/developers/solutions-center/scale-embeddings-gpus-snowflake-notebooks-container-runtime/) | Intermediate | Experiment with an open source embedding model and serve for large batch inference |
| [Defect Detection Using Distributed PyTorch with Snowflake Notebooks](https://www.snowflake.com/en/developers/solutions-center/computer-vision-defect-detection-distributed-pytorch-snowflake-notebooks/) | Intermediate | Detect defects with PyTorch-based computer vision models using GPUs |
| [Getting Started with Distributed PyTorch with Snowflake Notebooks](https://www.snowflake.com/en/developers/solutions-center/running-distributed-pytorch-models-on-snowflake-an-end-to-end-ml-solution/) | Intermediate | Build and deploy a recommendation model with PyTorch using GPUs |
| [Building ML Models to Crack the Code of Customer Conversions](https://quickstarts.snowflake.com/guide/build-ml-models-for-customer-conversions/) | Intermediate | Build a complete ML pipeline that classifies text data, performs sentiment analysis with gen AI, and predicts customer purchases using XGBoost |

Model development examples

| Quickstart | Level | Description |
| --- | --- | --- |
| [Getting Started with Snowflake Notebooks on Container Runtime](https://www.snowflake.com/en/developers/solutions-center/getting-started-with-snowflake-notebook-container-runtime/) | Beginner | Introductory quickstart covering the basics of using Snowflake Notebooks on Container Runtime |
| [Getting Started with ML Development in Snowflake](https://quickstarts.snowflake.com/guide/intro_to_machine_learning_with_snowpark_ml_for_python/) | Beginner | Develop a model in Snowflake Notebooks, including preprocessing, feature engineering and model training |
| [Train an XGBoost Model with GPUs using Snowflake Notebooks](https://www.snowflake.com/en/developers/solutions-center/harness-gpus-in-snowflake-notebooks-to-train-an-xgboost-model/) | Beginner | Train an XGBoost model on GPUs in Snowflake Notebooks |
| [Distributed Multi-Node and Multi GPU Audio Transcription with Snowflake ML](https://quickstarts.snowflake.com/guide/getting_started_with_distributed_multi_node_multi_gpu_audio_transcription_with_snowflake_ml_container_runtime/index.html?index=..%2F..index#5) | Intermediate | Perform multi-node, multi-GPU audio transcriptions using Container Runtime with OpenAI’s Whisper’s large-v3 on HuggingFace |

MLOps examples

| Quickstart | Level | Description |
| --- | --- | --- |
| [Introduction to Snowflake Feature Store with Snowflake Notebooks](https://quickstarts.snowflake.com/guide/intro-to-feature-store/) | Beginner | Introductory quickstart covering the basics of using Snowflake Feature Store |
| [Getting Started with Snowflake Feature Store API](https://quickstarts.snowflake.com/guide/overview-of-feature-store-api/) | Beginner | Introductory quickstart covering the basics of using APIs in Snowflake Feature Store |
| [Getting Started with ML Observability in Snowflake](https://www.snowflake.com/en/developers/solutions-center/monitoring-customer-churn-with-ml-observability-in-snowflake/) | Beginner | Introductory quickstart covering the basics of using ML Observability in Snowflake |
| [Develop and Manage ML Models with Feature Store and Model Registry](https://www.snowflake.com/en/developers/solutions-center/develop-and-manage-ml-models-with-feature-store-and-model-registry/) | Intermediate | Demonstrates an ML experiment cycle including feature creation, training data generation, model training and inference |

---
title: Replicating and sharing features
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/replication-sharing.md
section: Snowflake ML
---

# Replicating and sharing features

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

Because feature store objects are implemented as Snowflake objects, they support replication and sharing.

## Replicating a feature store

To replicate a feature store, replicate the database that contains its schema. Note that replicating the database
replicates all schemas in the database, not just feature stores. For more information on database replication, see
[Introduction to replication and failover across multiple accounts](../../../user-guide/account-replication-intro.md).

## Sharing a feature store

To share features across accounts, share the entire feature store by sharing the underlying schema. Because this shares
all feature views in the feature store, you might want to organize feature views into feature stores based on who they
will be shared with. For more information on sharing, see [About Secure Data Sharing](../../../user-guide/data-sharing-intro.md).

## Sharing feature views

It is also possible to share individual [feature views](feature-views.md). Doing so requires additional steps because
you must also share the associated tags, which the feature store uses internally. The steps below share a single
feature view.

1. Set the variables in the initial block, below, as follows:

   > * FS_SHARE: The name of the share with which the feature view will be shared.
   > * FS_DATABASE: The name of the database that contains the feature store.
   > * FS_SCHEMA: The name of the schema that contains the feature view.
   > * FV_NAME: The name and version of the feature view separated by `$`. For example, if the feature view’s name is
   >   `myfv` and its version is `v1`, this value is `myfv$v1`.
   > * ENTITY_NAME: The entity to which the feature view belongs.
   >
   > ```sqlexample
   > SET FS_SHARE = '<fs_share_name>';
   > SET FS_DATABASE = '<fs_database_name>';
   > SET FS_SCHEMA = '<fs_schema_name>';
   > SET FV_NAME = '<feature_view_name_with_version>';
   > SET ENTITY_NAME = '<entity_name>';
   > ```
2. Execute the following statements, which set some intermediate variables, then grant most of the necessary privileges.

   > ```sqlexample
   > SET SCHEMA_FQN = CONCAT($FS_DATABASE, '.', $FS_SCHEMA);
   > SET TAG_OBJECT_FQN = CONCAT($SCHEMA_FQN, '.', 'SNOWML_FEATURE_STORE_OBJECT');
   > SET TAG_METADATA_FQN = CONCAT($SCHEMA_FQN, '.', 'SNOWML_FEATURE_VIEW_METADATA');
   > SET FULL_ENTITY_NAME = CONCAT('SNOWML_FEATURE_STORE_ENTITY_', $ENTITY_NAME);
   > SET ENTITY_FQN = CONCAT($SCHEMA_FQN, '.', $FULL_ENTITY_NAME);
   > SET FV_FQN = CONCAT($SCHEMA_FQN, '.', $FV_NAME);
   >
   > -- Grant privileges to target share
   > GRANT USAGE ON DATABASE IDENTIFIER($FS_DATABASE) TO SHARE IDENTIFIER($FS_SHARE);
   > GRANT REFERENCE_USAGE ON DATABASE IDENTIFIER($FS_DATABASE) to SHARE IDENTIFIER($FS_SHARE);
   > GRANT USAGE ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO SHARE IDENTIFIER($FS_SHARE);
   > GRANT READ ON TAG IDENTIFIER($TAG_OBJECT_FQN) TO SHARE IDENTIFIER($FS_SHARE);
   > GRANT READ ON TAG IDENTIFIER($TAG_METADATA_FQN) TO SHARE IDENTIFIER($FS_SHARE);
   > GRANT READ ON TAG IDENTIFIER($ENTITY_FQN) TO SHARE IDENTIFIER($FS_SHARE);
   > ```
3. Finally, execute one of the two statements below depending on the type of feature view you are sharing.

   > * For a [Snowflake-managed feature view](feature-views.md):
   >
   >   > ```sqlexample
   >   > GRANT SELECT ON DYNAMIC TABLE IDENTIFIER($FV_FQN) TO SHARE IDENTIFIER($FS_SHARE);
   >   > ```
   > * For an [external feature view](feature-views.md):
   >
   >   > ```sqlexample
   >   > GRANT SELECT ON VIEW IDENTIFIER($FV_FQN) TO SHARE IDENTIFIER($FS_SHARE);
   >   > ```

---
title: Run an experiment to compare and select models
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/experiments.md
section: Snowflake ML
---

# Run an experiment to compare and select models

With Snowflake ML Experiments, you can set up *experiments*, organized evaluations of the results of model training. This allows you to quickly compare the results of hyperparameter adjustment, different target metrics, and behavior of different model types in an organized fashion in order to select the best model for your needs. Each experiment consists of a series of *runs*, which are metadata and artifacts from your training. Snowflake is unopinionated about your run artifacts – you can submit anything that’s useful for your model evaluation process.

After you complete an experiment, the results are visible through Snowsight. You can also retrieve run artifacts at any time in Python or SQL.

> **Note:**
>
> Snowflake Experiments require `snowflake-ml-python` version 1.19.0 or later.

## Access control requirements

Creating an experiment requires the CREATE EXPERIMENT privilege on the schema where run artifacts are stored. Creating an experiment requires the USAGE privilege on the parent database and schema.

## Create an experiment

First, create an experiment. This requires an existing database and schema, used to store run information.

PythonSnowsight

Experiment support is available in the `snowflake.ml.experiment.ExperimentTracking` class. Use the `set_experiment(name: Optional[str])` method to both open an experiment with the given name and set it to the active experiment context for logs and artifacts. Experiments which don’t exist yet are created.

The following example shows how to create or open an experiment named `My_Experiment` in the active database and schema and set it as the active experiment, using an existing `session`:

```python
from snowflake.ml.experiment import ExperimentTracking

exp = ExperimentTracking(session=session)
exp.set_experiment("My_Experiment")
```

> **Important:**
>
> The `ExperimentTracking` class is a singleton, and only one run can be active at any time. Experiments and runs are not thread-safe and should only be modified from the thread where `ExperimentTracking` instance was configured.

1. In the navigation menu, select AI & ML » Experiments.
2. Select New Experiment.
3. Enter the Name of your experiment.
4. Select the database and schema to store your experiment’s run artifacts in.
5. Select Create to create the experiment, or Cancel to cancel.

## Start an experiment run

Each run in an experiment has its own set of metrics, parameters, and artifacts. This information is used in Snowsight to provide visualizations and data about your model training and its results.

Start a run with the `start_run(name: Optional[str])` method on your `ExperimentTracking` instance. This returns a new `Run`, which supports use in a `with` statement. Snowflake recommends that you use `with` statements, so that runs are cleanly completed and it’s easier to reason about run scope.

```python
with exp.start_run("my_run"):
  # .. Train your model and log artifacts
```

### Automatically log training information

You can autolog training information for XGBoost, LightGBM, or Keras models during model training. Autologging is performed by registering a callback which refers to your experiment and information about the model you’re training. Each time a method is called on your `Model` instance which adjusts a parameter or metric, it’s automatically logged to your experiment for the active run.

The following example shows how to configure your experiment’s callbacks for each supported model trainer and then start a basic training run to log artifacts.

XGBoostLightGBMKeras

```python
# exp: ExperimentTracking

from xgboost import XGBClassifier

from snowflake.ml.experiment.callback.xgboost import SnowflakeXgboostCallback
from snowflake.ml.model.model_signature import infer_signature

sig = infer_signature(X, y)
callback = SnowflakeXgboostCallback(
    exp, model_name="name", model_signature=sig
)
model = XGBClassifier(callbacks=[callback])
with exp.start_run("my_run"):
    model.fit(X, y, eval_set=[(X, y)])
```

```python
# exp: ExperimentTracking

from lightgbm import LGBMClassifier

from snowflake.ml.experiment.callback.lightgbm import SnowflakeLightgbmCallback
from snowflake.ml.model.model_signature import infer_signature

sig = infer_signature(X, y)
callback = SnowflakeLightgbmCallback(
    exp, model_name="name", model_signature=sig
)
model = LGBMClassifier()
with exp.start_run("my_run"):
    model.fit(X, y, eval_set=[(X, y)], callbacks=[callback])
```

```python
# exp: ExperimentTracking

import keras

from snowflake.ml.experiment.callback.keras import SnowflakeKerasCallback
from snowflake.ml.model.model_signature import infer_signature

sig = infer_signature(X, y)
callback = SnowflakeKerasCallback(
    exp, model_name="name", model_signature=sig
)
model = keras.Sequential()
model.add(keras.layers.Dense(1))
model.compile(
    optimizer=keras.optimizers.RMSprop(learning_rate=0.1),
    loss="mean_squared_error",
    metrics=["mean_absolute_error"],
)
with exp.start_run("my_run"):
    model.fit(X, y, validation_split=0.5, callbacks=[callback])
```

### Manually log training information and artifacts

For models which don’t support automatic logging or are pre-trained, you can manually log experiment information and upload artifacts in Python. Parameters are constant inputs to the training model, while metrics are evaluated at a model *step*. You can choose to represent a training epoch as a corresponding step. The following example shows how to log parameters, log metrics, and upload artifacts.

> **Note:**
>
> The default step value is `0`.

```python
# Logging requires an active run for the exp: ExperimentTracker instance.

# Log model parameters with the log_param(...) or log_params(...) methods
exp.log_param("learning_rate", 0.01)
exp.log_params({"optimizer": "adam", "batch_size": 64})

# Log model metrics with the log_metric(...) or log_metrics(...) methods
exp.log_metric("loss", 0.3, step=100)
exp.log_metrics({"loss": 0.4, "accuracy": 0.8}, step=200)

# Log your model to the experiment's model registry with the log_model(...) method.
exp.log_model(model, model_name="my_model", signatures={"predict": model_signature})
exp.log_model(model, model_name="my_model", sample_input_data=data)

# Log local artifacts to an experiment run with the log_artifact(...) method.
exp.log_artifact('/tmp/file.txt', artifact_path='artifacts')
```

### Log stdout and stderr output

When a run is active on a Snowflake Notebook or any other SPCS workload such as ML Jobs, you can log the stdout and stderr output as part of your run. To enable live logging, call the following method:

```python
experiment.set_live_logging_status(True)
```

When live logging is enabled, stdout and stderr output from the notebook is written to the Snowflake default [event table](../logging-tracing/event-table-setting-up.md). To view the captured output, go to the Experiments UI and select the run. The output is displayed in the Logs tab.

Please note that this feature does not work with legacy notebooks.

## Complete a run

Completing a run makes it immutable and presents it as finished in Snowsight.

If you started a run as part of a `with` statement, the run is automatically completed when exiting scope. Otherwise, you can end a run by calling your experiment’s `end_run(name: Optional[str])` method with the name of the run to complete:

```python
experiment.end_run("my_run")
```

## Compare runs within an experiment

Experiment evaluation is done through Snowsight. In the navigation menu, select AI & ML » Experiments and select your experiment to examine from the list.

The runs list displays Run name, Status, Created date, and a column for each metric. You can also toggle parameters as additional columns. View more details, such as artifacts, metric charts, and linked model versions from the run view and run comparison views.

> **Note:**
>
> Viewing linked model versions is part of the [Model Lineage feature](ml-lineage.md), which is only available for customers on Enterprise Edition and above.

You can select up to five runs in your experiment. To compare runs, select the Compare button. You’re presented with the comparison view, which displays run metadata, parameters, metrics, and model version information.

## Search and filter runs

You can programmatically search and filter runs using the `list_metrics` and `list_params` methods. Each method
returns a Snowpark DataFrame with one row per run: `list_metrics` includes a `run_name` column and one float column
per logged metric, while `list_params` includes a `run_name` column and one string column per logged parameter.

Because the results are Snowpark DataFrames, you can join, filter, and sort them using Snowpark expressions, or convert
them to pandas for local analysis.

Snowparkpandas

```python
from snowflake.ml.experiment import ExperimentTracking
from snowflake.snowpark import functions as F

exp = ExperimentTracking(session)
exp.set_experiment("my_experiment")

run_names = ["RUN_1", "RUN_2"]

metrics = exp.list_metrics()
params  = exp.list_params()
runs    = metrics.join(params, on='"run_name"')

results = runs.filter(
  (F.col('"run_name"').isin(run_names))
  & (F.col('"loss"') > 0.3)
  & (F.col('"f1 score"') < 0.5)
  & (F.col('"model"').like("GPT%"))
)

results.show()
```

```python
from snowflake.ml.experiment import ExperimentTracking

exp = ExperimentTracking(session)
exp.set_experiment("my_experiment")

run_names = ["RUN_1", "RUN_2"]

metrics_df = exp.list_metrics().to_pandas()
params_df  = exp.list_params().to_pandas()
runs_df    = metrics_df.merge(params_df, on="run_name")

results = runs_df[
  (runs_df["run_name"].isin(run_names))
  & (runs_df["loss"] > 0.3)
  & (runs_df["f1 score"] < 0.5)
  & (runs_df["model"].str.startswith("GPT"))
]

print(results)
```

You can also retrieve metrics or parameters for a single run by passing a `run_name` argument:

```python
single_run_metrics = exp.list_metrics(run_name="RUN_1")
```

## Retrieve artifacts from a run

At any time during or after a run, you can retrieve artifacts. The following example shows how to list a run’s available artifacts in the `logs` path, and download the `logs/log0.txt` artifact for the run `my_run` in the experiment `my_experiment` to the local directory `/tmp`:

PythonSQL

```python
# exp: ExperimentTracking
exp.set_experiment("my_experiment")

exp.list_artifacts("my_run", artifact_path="logs")
exp.download_artifacts("my_run", artifact_path="logs/log0.txt", target_path="/tmp")
```

```sqlexample
LIST snow://experiment/my_experiment/versions/my_run/logs;
GET snow://experiment/my_experiment/versions/my_run/logs/log0.txt file:///tmp;
```

## Delete runs and experiments

After finishing an experiment, you can remove it and all of its associated run artifacts. The following example removes the experiment `my_experiment`:

PythonSQL

```python
# exp: ExperimentTracking
exp.delete_experiment("my_experiment")
```

```sqlexample
DROP EXPERIMENT my_experiment;
```

You can also remove an individual run from an experiment. The following example removes the run `my_run` from the experiment `my_experiment`:

PythonSQL

```python
# exp: ExperimentTracking
exp.set_experiment("my_experiment")
exp.delete_run("my_run")
```

```sqlexample
ALTER EXPERIMENT my_experiment DROP RUN my_run;
```

## Limitations

Snowflake Experiments are subject to the following limitations:

* Each schema is limited to 500 experiments.
* Each experiment is limited to 500 runs.
* Runs are limited to 1000 unique parameters and 200 unique metrics.

## Cost considerations

There is no additional cost to use Snowflake Experiments. It incurs standard Snowflake consumption-based costs. These include the following:

* Cost of storing run artifacts. For general information about storage costs, see [Exploring storage cost](../../user-guide/cost-exploring-data-storage.md).
* Cost of visualizing data. The charts in the UI are powered by virtual warehouses. For more information, see [Viewing credit usage](../../user-guide/cost-exploring-compute.md).

---
title: Scale an application using Ray
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/scale-application-ray.md
section: Snowflake ML
---

# Scale an application using Ray

The Snowflake container runtime integrates with [Ray](https://docs.ray.io/), an open-source unified framework for scaling AI and Python applications. This integration allows you to use Ray’s distributed computing capabilities on Snowflake for your machine learning workloads.

Ray is pre-installed and runs as a background process within the Snowflake ML container runtime. You can access Ray from the Container Runtime in the following ways:

**Snowflake Notebooks**: An interactive environment where you can connect to Ray, define tasks, and scale your cluster dynamically for development and experimentation.

**Snowflake ML Jobs**: Submit your Ray applications as structured, repeatable jobs. You can specify the cluster size as part of the job configuration for production workloads.

When you run the container runtime within a Snowflake Notebook or ML Job, the Ray process is automatically initiated as part of that container.

Use the following Python code to connect to the cluster:

```python
import ray
# Connect to the pre-existing Ray cluster within the Snowflake environment
ray.init(address="auto", ignore_reinit_error=True)
print(f"Ray cluster resources: {ray.cluster_resources()}")
```

> **Important:**
>
> Make sure you always use the `"auto"` address when you’re connecting to the Ray cluster.
> Initializing with the `"auto"` address directs your application to the head node of the Ray cluster that Snowflake has provisioned for your session.

## Scaling your Ray cluster

After you connect to the Ray cluster, you can adjust its size to meet the computational demands of your workload.

Use the following approaches to scale your Ray cluster:

In a Snowflake NotebookIn a Snowflake ML Job

Within a notebook, you can dynamically scale your cluster up or down using the `scale_cluster` function. This is ideal for interactive workflows where resource needs might change.

When you specify `expected_cluster_size=5`, you get 1 head node and 4 worker nodes.

```python
from snowflake.ml.runtime_cluster import scale_cluster, get_nodes

# Check current cluster size
print(f"Current cluster size: {len(get_nodes())} nodes")

# Scale up to 4 nodes (1 head + 3 workers)
print("Scaling up cluster...")
scale_cluster(expected_cluster_size=4)
print(f"New cluster size: {len(get_nodes())} nodes")
```

For ML Jobs, you define the cluster size declaratively within your job definition. Specifying the cluster size in the job definition ensures that the required number of nodes is provisioned when the job starts.

For example, your job decorator might include:

```python
from snowflake.ml.jobs import remote

@remote(
  "MY_COMPUTE_POOL",
  stage_name="payload_stage",
  session=session,
  target_instances=5  # Specify the number of nodes
)
def distributed_ray():
  import ray
  ray.init(address="auto", ignore_reinit_error=True)
  print(f"Ray cluster resources: {ray.cluster_resources()}")

job = distributed_ray()
```

After you’ve finished using your cluster you can scale it down. For more information, see Cleaning up.

### Monitoring with the Ray Dashboard

If you’re running a job from a Snowflake Notebook, you can use the Ray Dashboard to monitor your cluster. The dashboard is a web interface that allows you to view the cluster’s resources, jobs, tasks, and performance.
Use the following code to get the dashboard URL:

```python
from snowflake.ml.runtime_cluster import get_ray_dashboard_url

# This function is available in Notebooks to retrieve the dashboard URL
dashboard_url = get_ray_dashboard_url()
print(f"Access the Ray Dashboard here: {dashboard_url}")
```

Open the URL in a new browser tab, log in with your Snowflake credentials.

## Advanced use cases

This section covers advanced Ray features for complex workloads and for migrating existing applications.

### Creating and operating distributed workloads with Ray

Ray provides components that enable you to create and operate distributed workloads. These include foundational components via Ray Core with essential primitives for building and scaling these workloads.

It also includes the following libraries that enable you build your own workflows for data preprocessing, ML training, hyperparameter tuning, and model inference:

* **Ray Data**: Scalable data processing and transformation
* **Ray Train**: Distributed training and fine-tuning of ML models
* **Ray Tune**: Hyperparameter optimization with advanced search algorithms
* **Ray Serve**: Model serving and inference

The following sections describe how you can use these libraries directly, while native Snowflake interfaces built over Ray provide additional tools to build, deploy, and operationalize Ray-based applications.

#### Ray Core: Tasks and Actors

Ray provides the following distributed computing primitives:

* **Tasks**: Stateless functions that run remotely and return values
* **Actors**: Stateful classes that can be instantiated remotely and called multiple times
* **Objects**: Immutable values stored in Ray’s distributed object store
* **Resources**: CPU, GPU, and custom resource requirements for tasks and actors

The following example demonstrates how to use a basic Ray Task and Actors to do linear regression:

```python
import ray
import numpy as np
import pandas as pd
from sklearn.linear_model import LinearRegression

# Initialize Ray (automatically connects to cluster in Snowflake ML)
ray.init(address="auto", ignore_reinit_error=True)

# Create sample data
large_dataset = np.random.randn(1000, 10)
batch_data = pd.DataFrame(np.random.randn(100, 5), columns=[f'feature_{i}' for i in range(5)])

# Ray Tasks - stateless remote functions
@ray.remote
def compute_heavy_task(data):
    """CPU-intensive computation example"""
    # Simulate heavy computation (matrix operations)
    result = np.dot(data, data.T)
    return np.mean(result)

# Ray Actors - stateful remote classes
@ray.remote
class DataProcessor:
    def __init__(self):
        # Load a simple model
        self.model = LinearRegression()
        # Train on dummy data
        X_dummy = np.random.randn(100, 5)
        y_dummy = np.random.randn(100)
        self.model.fit(X_dummy, y_dummy)

    def process_batch(self, batch):
        # Convert to numpy if it's a DataFrame
        if isinstance(batch, pd.DataFrame):
            batch_array = batch.values
        else:
            batch_array = batch
        return self.model.predict(batch_array)

# Submit tasks and get object references
future = compute_heavy_task.remote(large_dataset)
result = ray.get(future)  # Blocks until task completes
print(f"Task result: {result}")

# Create and use actors
processor = DataProcessor.remote()
batch_result = ray.get(processor.process_batch.remote(batch_data))
print(f"Batch processing result shape: {batch_result.shape}")
```

#### Ray Train: Distributed Training

Ray Train is a library that enables distributed training and fine-tuning of models. You can run your training code on a single machine or an entire cluster.

You can use Ray Train for both single-node and multi-node execution.

For multi-node training, you must handle the following:

* Distributed storage for checkpoints (no shared filesystem across nodes)
* Custom data loading
* Manual resource configuration to coordinate between data ingestion and training resource usage

For a streamlined experience, use the Optimized Training functions for XGBoost, LightGBM, and PyTorch. On the same Ray cluster, these functions handle:

* Snowflake stage-based checkpointing
* Native Snowflake data ingestion
* Built-in resource allocation for data ingestion and training

#### Ray Data: Scalable Data Processing

Ray Data provides scalable, distributed data processing for ML workloads. It can handle datasets larger than cluster memory through streaming execution and lazy evaluation.

> **Note:**
>
> Snowflake offers a native integration to transform any Snowflake data source to Ray Data. For more information, see the Data Connector and Ray Data Ingestion pages.

Use Ray Data for:

* Processing large datasets that don’t fit in single-node memory
* Distributed data preprocessing and feature engineering
* Building data pipelines that integrate with other Ray libraries

```python
import ray
import ray.data as rd
import pandas as pd
import numpy as np
from snowflake.ml.runtime_cluster import scale_cluster

# Initialize Ray
ray.init(address="auto", ignore_reinit_error=True)

# Optional: Scale cluster for better performance with large datasets or CPU-intensive operations
# Scaling benefits Ray Data when:
# - Processing datasets larger than single-node memory (>10GB)
# - Performing CPU-intensive transformations (complex feature engineering, ML preprocessing)
# - Need faster processing through parallelization across multiple nodes
scale_cluster(expected_cluster_size=4)

# Create sample dataset
np.random.seed(42)
n_samples = 50000
n_features = 15

# Generate features with some correlation structure
base_features = np.random.randn(n_samples, 5)
derived_features = np.column_stack([
    base_features[:, 0] * base_features[:, 1],  # interaction
    np.sin(base_features[:, 2]),  # non-linear
    base_features[:, 3] ** 2,  # polynomial
    np.random.randn(n_samples, n_features - 8)  # additional random features
])

X = np.column_stack([base_features, derived_features])
y = (X[:, 0] + 0.5 * X[:, 1] - 0.3 * X[:, 2] + 0.1 * X[:, 5] + np.random.randn(n_samples) * 0.2 > 0).astype(int)

sample_data = pd.DataFrame(X, columns=[f'feature_{i}' for i in range(n_features)])
sample_data['target'] = y

print(f"Created dataset with {n_samples} samples and {n_features} features")

# Create Ray Dataset from pandas DataFrame
ray_dataset = rd.from_pandas(sample_data)

# Transform data with Ray Data operations
def preprocess_batch(batch):
    """Preprocess a batch of data"""
    # Get all feature columns
    feature_cols = [col for col in batch.columns if col.startswith('feature_')]

    # Normalize numerical features (first 3 for demo)
    for col in feature_cols[:3]:
        if col in batch.columns:
            batch[f'{col}_scaled'] = (batch[col] - batch[col].mean()) / batch[col].std()

    # Add derived features using actual column names
    if 'feature_0' in batch.columns and 'feature_1' in batch.columns:
        batch['feature_0_squared'] = batch['feature_0'] ** 2
        batch['feature_interaction'] = batch['feature_0'] * batch['feature_1']

    return batch

# Apply transformations lazily
processed_dataset = ray_dataset.map_batches(
    preprocess_batch,
    batch_format="pandas"
)

# Repartition for optimal performance across cluster nodes
processed_dataset = processed_dataset.repartition(num_blocks=8)

# Convert to different formats for downstream use
print("Converting to pandas...")
pandas_df = processed_dataset.to_pandas()  # Collect to pandas
print(f"Processed dataset shape: {pandas_df.shape}")
print(f"New columns: {list(pandas_df.columns)}")

# Iterate through batches for memory efficiency
print("Processing batches...")
batch_count = 0
for batch in processed_dataset.iter_batches(batch_size=1000, batch_format="pandas"):
    batch_count += 1
    print(f"Batch {batch_count}: {batch.shape}")
    if batch_count >= 3:  # Just show first 3 batches
        break

print(f"Total batches processed: {batch_count}")
```

#### Ray Tune: Distributed Hyperparameter Tuning

Ray Tune provides distributed hyperparameter optimization with advanced search algorithms and early stopping capabilities. For a more integrated and optimized experience when reading from Snowflake data sources, use the native Hyperparameter Optimization (HPO) API. For more information about using HPO optimization, see [Optimize a model’s hyperparameters](container-hpo.md).

If you’re looking for a more customizable approach to a distributed HPO implementation, use Ray Tune.

You can use Ray Tune for the following use cases:

* Hyperparameter optimization across multiple trials in parallel
* Advanced search algorithms (Bayesian optimization, population-based training)
* Large-scale hyperparameter sweeps requiring distributed execution

```python
import ray
from ray import tune
import pandas as pd
import numpy as np
from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split
from sklearn.metrics import accuracy_score
from snowflake.ml.runtime_cluster import scale_cluster

# Initialize Ray
ray.init(address="auto", ignore_reinit_error=True)

# Optional: Scale cluster for hyperparameter tuning
# Scaling benefits Ray Tune when:
# - Running many trials in parallel
# - Each trial is computationally intensive
# - Need faster hyperparameter search
scale_cluster(expected_cluster_size=6)

# Create sample dataset
np.random.seed(42)
n_samples = 5000
n_features = 10

X = np.random.randn(n_samples, n_features)
y = ((X[:, 0] + X[:, 1] * X[:, 2] + np.sin(X[:, 3]) + np.random.randn(n_samples) * 0.3) > 0).astype(int)

# Split data
X_train, X_val, y_train, y_val = train_test_split(X, y, test_size=0.2, random_state=42)

def train_function(config):
    """Training function that gets hyperparameters from Ray Tune"""
    # Train model with current hyperparameters
    model = RandomForestClassifier(
        n_estimators=config["n_estimators"],
        max_depth=config["max_depth"],
        min_samples_split=config["min_samples_split"],
        random_state=42,
        n_jobs=-1
    )

    model.fit(X_train, y_train)

    # Evaluate and report results
    val_predictions = model.predict(X_val)
    accuracy = accuracy_score(y_val, val_predictions)

    # Report metrics back to Ray Tune
    return {"accuracy": accuracy}

# Define search space
search_space = {
    "n_estimators": tune.randint(50, 200),
    "max_depth": tune.randint(3, 15),
    "min_samples_split": tune.randint(2, 10)
}

# Configure and run hyperparameter optimization
tuner = tune.Tuner(
    tune.with_resources(
        train_function,
        resources={"CPU": 2}
    ),
    param_space=search_space,
    tune_config=tune.TuneConfig(
        metric="accuracy",
        mode="max",
        num_samples=20,  # Number of trials
        max_concurrent_trials=4
    )
)

print("Starting hyperparameter optimization...")
results = tuner.fit()

# Get best results
best_result = results.get_best_result()
print(f"✅ Hyperparameter tuning completed!")
print(f"   Best accuracy: {best_result.metrics['accuracy']:.4f}")
print(f"   Best parameters: {best_result.config}")

# Show results summary
df_results = results.get_dataframe()
print(f"\nTop 5 results:")
top_results = df_results.nlargest(5, 'accuracy')
for i, (_, row) in enumerate(top_results.iterrows(), 1):
    print(f"  {i}. Accuracy: {row['accuracy']:.4f}, n_estimators: {row['config/n_estimators']}, max_depth: {row['config/max_depth']}")
```

#### Model Serving

For model serving, you can use Snowflake’s native capabilities. For more information, see [Deploy models for Real time Inference (REST API)](inference/real-time-inference-rest-api.md).

#### Submit and manage distributed applications on Ray clusters

Use Ray Jobs to submit and manage distributed applications on Ray clusters with better resource isolation and lifecycle management. For all job-based executions that require access to a Ray Cluster, Snowflake recommends using an ML Job, where you can define the Ray application logic. For instances where you require direct access to the Ray Job interface, such as migrating an existing implementation, you could use the Ray Job primitive as is described in the [Ray documentation](https://docs.ray.io/en/latest/cluster/running-applications/job-submission/sdk.html).

Use Ray jobs for:

* Production ML pipelines and scheduled workflows
* Long-running workloads requiring fault tolerance
* Batch processing and large-scale data processing

```python
import ray
from ray.job_submission import JobSubmissionClient
import os

# Initialize Ray and get job client
ray.init(address="auto", ignore_reinit_error=True)

# Get Ray dashboard address for job submission
node_ip = os.getenv("NODE_IP_ADDRESS", "0.0.0.0")
dashboard_port = os.getenv("DASHBOARD_PORT", "9999")
dashboard_address = f"http://{node_ip}:{dashboard_port}"

client = JobSubmissionClient(dashboard_address)

# Simple job script
job_script = '''
import ray

@ray.remote
def compute_task(x):
    return x * x

# Submit tasks to Ray cluster
futures = [compute_task.remote(i) for i in range(5)]
results = ray.get(futures)
print(f"Results: {results}")
'''

# Submit job
job_id = client.submit_job(
    entrypoint=f"python -c '{job_script}'",
    runtime_env={"pip": ["numpy"]},
    submission_id="my-ray-job"
)

print(f"Submitted job: {job_id}")

# Monitor job status
status = client.get_job_status(job_id)
print(f"Job status: {status}")
```

### Scaling Ray Clusters with Options

From a Snowflake Notebook, you can scale your Ray clusters to precisely match computational demands. A cluster consists of a head node (coordinator) and worker nodes (for task execution).

```python
from snowflake.ml.runtime_cluster import scale_cluster, get_nodes

# Asynchronous scaling - returns immediately
scale_cluster(
    expected_cluster_size=2,
    is_async=True  # Don't wait for all nodes to be ready
)

# Scaling with custom options
scale_cluster(
    expected_cluster_size=3,
    options={
        "rollback_after_seconds": 300,  # Auto-rollback after 5 minutes
        "block_until_min_cluster_size": 2  # Return when at least 2 nodes ready
    }
)

# Scale down for cost efficiency
scale_cluster(expected_cluster_size=2)
```

#### Resource monitoring

```python
import ray
from snowflake.ml.runtime_cluster import get_nodes
from snowflake.ml.runtime_cluster.cluster_manager import (
    get_available_cpu, get_available_gpu, get_num_cpus_per_node
)

# Check available resources
available_cpus = get_available_cpu()
available_gpus = get_available_gpu()
cpus_per_node = get_num_cpus_per_node()

print(f"Available CPUs: {available_cpus}")
print(f"Available GPUs: {available_gpus}")
print(f"CPUs per node: {cpus_per_node}")

# Get Ray's view of resources
ray_resources = ray.available_resources()
print(f"Ray available resources: {ray_resources}")

# Calculate resource utilization
total_cpus = ray.cluster_resources().get('CPU', 0)
used_cpus = total_cpus - available_cpus
utilization = (used_cpus / total_cpus * 100) if total_cpus > 0 else 0
print(f"CPU Utilization: {utilization:.1f}%")
```

### Cleaning up

After you’re finished with the cluster, you can scale it down to avoid additional charges. Use the following code to scale it down:

```python
# Scale down when finished to conserve resources
print("Scaling down cluster...")
scale_cluster(expected_cluster_size=1)
print(f"Final cluster size: {len(get_nodes())} nodes")
```

---
title: scikit-learn
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/scikit-learn.md
section: Snowflake ML
---

# scikit-learn

The registry supports models created using scikit-learn (models derived from `sklearn.base.BaseEstimator` or
`sklearn.pipeline.Pipeline`).

The following additional options can be used in the `options` dictionary
when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. scikit-learn models have the following target methods by default, assuming the method exists: `predict`, `transform`, `predict_proba`, `predict_log_proba`, `decision_function`. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a scikit-learn model
so that the registry knows the signatures of the target methods.

## Example

In this example, a `RandomForestClassifier` and `Pipeline` are trained and logged to the model registry.

```python
from snowflake.ml.registry import Registry
from sklearn import datasets, ensemble

# create a session and set DATABASE and SCHEMA
# session = ...

registry = Registry(session=session, database_name=DATABASE, schema_name=SCHEMA)

iris_X, iris_y = datasets.load_iris(return_X_y=True, as_frame=True)

# Rename columns so they are valid Snowflake identifiers
column_name_map = {
        'sepal length (cm)': 'sepal_length',
        'sepal width (cm)': 'sepal_width',
        'petal length (cm)': 'petal_length',
        'petal width (cm)': 'petal_width'
}
iris_X = iris_X.rename(columns=column_name_map)

# Train the model
clf = ensemble.RandomForestClassifier(random_state=42)
clf.fit(iris_X, iris_y)

# Log the model in the registry
model_ref = registry.log_model(
    clf,
    model_name="RandomForestClassifier",
    version_name="v1",
    sample_input_data=iris_X,
    options={
        "method_options": {
            "predict": {"case_sensitive": True},
            "predict_proba": {"case_sensitive": True},
            "predict_log_proba": {"case_sensitive": True},
        }
    },
)

# Generate predictions
model_ref.run(iris_X[-10:], function_name='"predict_proba"')

# Pipelines can also be logged in the registry
from sklearn import pipeline, preprocessing

pipe = pipeline.Pipeline([
    ('scaler', preprocessing.StandardScaler()),
    ('classifier', ensemble.RandomForestClassifier(random_state=42)),
])
pipe.fit(iris_X, iris_y)

model_ref = registry.log_model(
    pipe,
    model_name="Pipeline",
    version_name="v1",
    sample_input_data=iris_X,
    options={
        "method_options": {
            "predict": {"case_sensitive": True},
            "predict_proba": {"case_sensitive": True},
            "predict_log_proba": {"case_sensitive": True},
        }
    },
)

# Generate predictions
model_ref.run(iris_X[-10:], function_name='"predict_proba"')
```

> **Note:**
>
> You can combine scikit-learn preprocessing with a XGBoost model as a scikit-learn pipeline.

---
title: Sentence Transformer
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/sentence-transformer.md
section: Snowflake ML
---

# Sentence Transformer

The Snowflake Model Registry supports models that use Sentence Transformers (`sentence_transformers.SentenceTransformer`).
For more information, see the [Sentence Transformers documentation](https://sbert.net/).

For the registry to know the signatures of the target methods, you must specify either sample input data or the signatures that define the input and output schema for the model’s methods.

For sample input data, specify a Snowpark DataFrame as the value for the `sample_input_data` parameter. For example you can specify a value such as `sample_input = pd.DataFrame(["This is a sample sentence."], columns=["TEXT"])`.

If you’re using the signatures parameter, specify a dictionary as the value for the `signatures` parameter. The dictionary defines the input and output methods for the model. For example, the following code defines the input and output schema for the model’s `encode` method:

```python
from snowflake.ml.model.model_signature import ModelSignature, FeatureSpec, DataType

  signatures = {
      "encode": ModelSignature(
          inputs=[FeatureSpec(dtype=DataType.STRING, name='TEXT')],
          outputs=[FeatureSpec(dtype=DataType.FLOAT, name='EMBEDDINGS', shape=(-1,))]
      )
  }
```

When you call `log_model`, you can use the following additional options in the `options` dictionary:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. Sentence Transformer models have the following target method by default, assuming the method exists: `encode`. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with a GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |

The following example:

* Loads a pre-trained Sentence Transformer model.
* Logs it to the Snowflake ML Model Registry
* Uses the logged model for inference.

> **Note:**
>
> In the example, `reg` is an instance of `snowflake.ml.registry.Registry`. For information on
> creating a registry object, see [Snowflake Model Registry](../overview.md).

```python
from sentence_transformers import SentenceTransformer
import pandas as pd

# 1. Initialize the model
# This example uses the 'all-MiniLM-L6-v2' model, which is a popular
# and efficient model for generating sentence embeddings.
model = SentenceTransformer('all-MiniLM-L6-v2')

# 2. Prepare sample input data
# Sentence Transformers expect a single column of text data for the 'encode' method.
sentences = ["This is an example sentence", "Each sentence is converted into a vector"]
sample_input = pd.DataFrame(sentences, columns=["TEXT"])

# 3. Log the model
# Provide the model object, a name, and a version.
# Including sample_input_data allows the registry to infer the input/output signatures.
model_ref = reg.log_model(
    model=model,
    model_name="my_sentence_transformer",
    version_name="v1",
    sample_input_data=sample_input,
)

# 4. Use the model for inference
# The 'run' method executes the default 'encode' function on the input data.
result_df = model_ref.run(sample_input, function_name="encode")

# The result is a DataFrame where the output column (usually named 'outputs')
# contains the embeddings as arrays of floats.
print(result_df)
```

---
title: Service Management & Scaling
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/service-management.md
section: Snowflake ML
---

# Service Management & Scaling

Once a model is deployed to Snowpark Container Services (SPCS), you must manage its lifecycle, resource consumption, and reliability. This page covers standard operations, observability, and configuring high availability for production workloads.

## Managing services

Snowpark Container Services offers a SQL interface for managing services. You can use the DESCRIBE SERVICE and ALTER SERVICE commands with SPCS services created by Snowflake Model Serving just as you would for managing any other SPCS service. For example, you can:

* Change MIN_INSTANCES and other properties of a service
* Drop (delete) a service
* Share a service to another account
* Change ownership of a service (the new owner must have READ access to the model)

> **Note:**
>
> If the owner of a service loses access to the underlying model for any reason, the service stops working after a restart. It will continue running until it is restarted.

To ensure reproducibility and debugability, you cannot change the specification of an existing inference service. You can, however, copy the specification, customize it, and use the customized specification to create your own service to host the model. However, this method does not protect the underlying model from being deleted. Furthermore, it does not track lineage. It is best to allow Snowflake Model Serving to create services.

## Scaling services

> **Note:**
>
> Starting with snowflake-ml-python 1.25.0, you can define the scaling boundaries for your inference service by setting min_instances and max_instances within the create_service method.

### How Autoscaling Works

The service initializes with the number of nodes specified in min_instances and dynamically scales within your defined range based on real-time traffic volume and hardware utilization.

**Scale-to-Zero (Auto-Suspend):** If min_instances is set to 0 (the default), the service will automatically suspend if no traffic is detected for 30 minutes.

**Scaling Latency:** Scaling triggers typically activate after one minute of meeting the required condition. Note that total spin-up time includes this trigger period plus the time required to provision and initialize new service instances.

### Configuration Best Practices

| Parameter | Recommended Strategy |
| --- | --- |
| min_instances | Set to 1 or more for production workloads to ensure immediate availability and avoid cold-start delays. |
| max_instances | Set to accommodate peak demand while maintaining a ceiling on resource consumption and cost. |

## Suspending services

The default min_instances=0 setting allows the service to auto-suspend after 30 minutes of inactivity. Incoming requests will trigger a resume, with the total delay determined by compute pool availability and the model’s loading time (startup delay).

To manually suspend or resume a service, use the ALTER SERVICE command.

```sqlexample
ALTER SERVICE my_service [ SUSPEND | RESUME ];
```

## Deleting models

You can manage models and model versions as usual with either the SQL interface or the Python API, with the restriction that a model or model version that is being used by a service (whether running or suspended) cannot be dropped (deleted). To drop a model or model version, drop the service first.

## Monitoring services

When running models in Snowpark Container Services, you can monitor service health and troubleshoot issues by accessing container logs and metrics. Model serving services generate logs that can help you understand service behavior, diagnose errors, and optimize performance.

For comprehensive information about monitoring SPCS services, including accessing metrics and logs, see [Monitoring services](../../snowpark-container-services/monitoring-services.md).

### In Snowsight

You can monitor model serving services in Snowsight:

1. In the navigation menu, select **Monitoring » Services & jobs**.
2. On the **Services** tab, select your service to view the service details page.
3. The **Overview** tab displays service information including the compute pool, endpoints, and instance count.
4. The **Logs**, **Metrics**, and **Events** tabs provide logs, performance metrics, and service events (such as instance provisioning and shutdowns). Filter results by instance and container name as needed.

### Accessing service logs

You can access logs for your model serving services using any of the following methods:

#### Using the service helper function

Model serving include a built-in helper function that retrieves logs from the event table for running or suspended services:

```sqlexample
-- Retrieve logs using the service helper function
SELECT * FROM TABLE(mydb.myschema.my_model_service!SPCS_GET_LOGS())
WHERE
timestamp > dateadd(hour, -1, current_timestamp())
AND instance_id = 0  -- choose all instances or one particular
AND container_name = 'model-inference';
```

#### Querying the event table directly

If you have an event table configured for your account, you can query it directly to retrieve service logs:

```sqlexample
-- Find the event table for your account
SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;

-- Query the event table for model service logs
SELECT TIMESTAMP, RESOURCE_ATTRIBUTES, RECORD_ATTRIBUTES, VALUE
FROM <current_event_table_for_your_account>
WHERE timestamp > dateadd(hour, -1, current_timestamp())
    AND RESOURCE_ATTRIBUTES:"snow.service.name" = '<model_service_name>'
    AND RECORD_TYPE = 'LOG'
    AND RESOURCE_ATTRIBUTES:"snow.service.container.instance" = '0'  -- choose all instances or one particular
    AND RESOURCE_ATTRIBUTES:"snow.service.container.name" = 'model-inference'
ORDER BY timestamp DESC
LIMIT 10;
```

#### Using the system function (Running instances only)

For real-time debugging of active containers you can use the SYSTEM$GET_SERVICE_LOGS function:

```sqlexample
-- Retrieve logs from a specific service instance
SELECT SYSTEM$GET_SERVICE_LOGS('model_service_name', '0', 'model-inference', 10);
```

> **Note:**
>
> The container name for model inference services is model-inference. For troubleshooting image build issues, use model-build as the container name.

### Accessing service metrics

Model serving services emit performance and health metrics that can help you monitor resource utilization, request rates, latency, and other operational characteristics. These metrics are captured in the event table and can be queried to analyze service performance over time.

For more information about SPCS service metrics, see [Accessing event table service metrics](../../snowpark-container-services/monitoring-services.md).

#### Using the service helper function

Model serving services include a built-in helper function that retrieves metrics from the event table for running or suspended services:

```sqlexample
-- Retrieve metrics using the service helper function
SELECT *
FROM TABLE(mydb.myschema.my_model_service!SPCS_GET_METRICS())
WHERE
timestamp > dateadd(hour, -1, current_timestamp())
AND instance_id = 0  -- choose all instances or one particular
AND container_name = 'model-inference';
```

#### Querying the event table directly

You can query the event table directly to retrieve and filter specific metrics:

```sqlexample
-- Find the event table for your account
SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;

-- Query the event table for model service metrics
SELECT
    timestamp,
    RESOURCE_ATTRIBUTES:"snow.service.container.instance" as instance,
    RESOURCE_ATTRIBUTES:"snow.service.container.name" as container,
    RECORD:metric:"name" as metric,
    value
FROM my_event_table_db.my_event_table_schema.my_event_table
WHERE timestamp > DATEADD(hour, -1, CURRENT_TIMESTAMP())
    AND RESOURCE_ATTRIBUTES:"snow.service.name" = '<model_service_name>'
    AND RECORD_TYPE = 'METRIC'
    AND RESOURCE_ATTRIBUTES:"snow.service.container.instance" = '0'  -- choose all instances or one particular
    AND RESOURCE_ATTRIBUTES:"snow.service.container.name" = 'model-inference'
ORDER BY timestamp DESC
LIMIT 100;
```

## Fault tolerance

In any distributed system, failures happen. For mission-critical workloads it is on users to configure the service to be resilient against node and zonal failures.

### Node Failure Resilience

To tolerate standard node failures, Snowflake recommends over-provisioning by 50% or maintaining a minimum of 3 instances (whichever is higher).

**Example:** If you need 4 instances to support peak traffic, you should provision 6 instances

### Zonal Failure Resilience

For mission-critical workloads that require resilience against a full zonal failure, you can use a distributed [compute pool](../../../sql-reference/sql/create-compute-pool.md) when creating a [service](../../../sql-reference/sql/create-service.md). Distributed compute pools are created with the PLACEMENT_GROUP parameter set to DISTRIBUTED. For more information about distributed compute pools, see [Compute pool placement](../../snowpark-container-services/working-with-compute-pool.md).

### Configuration Guide

#### Convert an Existing Pool

> **Warning:**
>
> You cannot change this setting on an active pool. You must suspend it first.

```sqlexample
ALTER COMPUTE POOL my_pool SUSPEND;

ALTER COMPUTE POOL my_pool
  SET PLACEMENT_GROUP = 'DISTRIBUTED';

ALTER COMPUTE POOL my_pool RESUME;
```

#### Revert an Existing Pool

> **Warning:**
>
> You cannot change this setting on an active pool. You must suspend it first.

```sqlexample
ALTER COMPUTE POOL my_pool SUSPEND;

ALTER COMPUTE POOL my_pool
  UNSET PLACEMENT_GROUP;

ALTER COMPUTE POOL my_pool RESUME;
```

#### Verification

To confirm your pool is correctly configured for HA, check the placement_group column:

```sqlexample
DESCRIBE COMPUTE POOL my_service_pool;
```

---
title: Snowflake Container Runtime
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime-ml.md
section: Snowflake ML
---

# Snowflake Container Runtime

## Overview

The Snowflake Container Runtime is a set of preconfigured customizable environments built for machine learning on Snowpark Container Services,
covering interactive experimentation and batch ML workloads such as model training, hyperparameter tuning, batch
inference and fine tuning. They include the most popular machine learning and deep learning frameworks. Used with
Snowflake notebooks, they provide an end-to-end ML experience.

## Execution environment

The Container Runtime provides an environment populated with packages and libraries that support a wide variety
of ML development tasks inside Snowflake. In addition to the pre-installed packages, you can import packages from
external sources like public PyPI repositories, or internally-hosted package repositories that provide a list of
packages approved for use inside your organization.

Executions of your custom Python ML workloads and supported training APIs occur within Snowpark Container Services, which offers the ability
to run on CPU or GPU compute pools. When using the Snowflake ML APIs, the Container Runtime distributes the processing
across available resources.

Container Runtimes are versioned, allowing you to select specific runtime environments, pin your workloads to a specific version,
and migrate to updated container runtime environments at your own pace.

## Distributed processing

The Snowflake ML modeling and data loading APIs are built on top of Snowflake ML’s distributed processing framework,
which maximizes resource utilization by fully leveraging the available compute power. By default, this framework uses
all GPUs on multi-GPU nodes, offering significant performance improvements compared to open-source packages and reduces
overall runtime.

Machine learning workloads, including data loading, are executed in a Snowflake-managed compute environment. The
framework allows dynamic scaling of resources based on the specific requirements of the task at hand, such as training
models or loading data. The number of resources, including GPU and memory allocation for each task, can be easily
configured through the provided APIs.

## Optimized data loading

The Container Runtime provides a set of data connector APIs that enable connecting Snowflake data sources (including
tables, DataFrames, and Datasets) to popular ML frameworks such as PyTorch and TensorFlow, taking full advantage of
multiple cores or GPUs. Once loaded, the data can be processed using open source packages, or any of the Snowflake ML
APIs, including the distributed versions that are described below. These APIs are found in the `snowflake.ml.data`
namespace.

The [`snowflake.ml.data.data_connector.DataConnector`](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector.md "(in Snowpark ML API Reference (Python))") class connects Snowpark DataFrames or Snowflake ML Datasets to
TensorFlow or PyTorch DataSets or Pandas DataFrames. Instantiate a connector using one of the following class methods:

> * [`DataConnector.from_dataframe`](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector.md "(in Snowpark ML API Reference (Python))"): Accepts a Snowpark DataFrame.
> * [`DataConnector.from_dataset`](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector.md "(in Snowpark ML API Reference (Python))"): Accepts a Snowflake ML Dataset, specified by name and version.
> * [`DataConnector.from_sources`](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector.md "(in Snowpark ML API Reference (Python))"): Accepts list of sources, each of which can be a DataFrame or a Dataset.

Once you have instantiated the connector (calling the instance, for example, `data_connector`), call the following
methods to produce the desired kind of output.

* `data_connector.to_tf_dataset`: Returns a TensorFlow Dataset suitable for use with TensorFlow.
* `data_connector.to_torch_dataset`: Returns a PyTorch Dataset suitable for use with PyTorch.

For more information on these APIs, see the [Snowflake ML API reference](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/data).

## Building with open source

With the foundational CPU and GPU images that come pre-populated with popular ML packages, and the flexibility to
install additional libraries using `pip`, users can employ familiar and innovative open source frameworks inside Snowflake
Notebooks, without moving data out of Snowflake. You can scale processing by using Snowflake’s distributed
APIs for data loading, training, and hyperparameter optimization, with the familiar APIs of popular OSS
packages, with small changes to the interface to allow for scaling configurations.

The following code illustrates creating an XGBoost classifier using these APIs:

```python
from snowflake.snowpark.context import get_active_session
from snowflake.ml.data.data_connector import DataConnector
import pandas as pd
import xgboost as xgb
from sklearn.model_selection import train_test_split

session = get_active_session()

# Use the DataConnector API to pull in large data efficiently
df = session.table("my_dataset")
pandas_df = DataConnector.from_dataframe(df).to_pandas()

# Build with open source

X = df_pd[['feature1', 'feature2']]
y = df_pd['label']

# Split data into test and train in memory
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.15, random_state=34)

# Train in memory
model = xgb.XGBClassifier()
model.fit(X_train, y_train)

# Predict
y_pred = model.predict(X_test)
```

The CPU container runtime has different packages than the GPU container runtime. The following sections list the packages available within each container runtime.

## Snowflake Container Runtime packages

The full list of available packages in Snowflake Container Runtime is maintained as part of the [Container Runtime Release Notes](container-runtime/releases.md).

## Optimized training

Container Runtime offers a set of distributed training APIs, including distributed versions of LightGBM, PyTorch,
and XGBoost, that take full advantage of the available resources in the container environment. These are found in the
`snowflake.ml.modeling.distributors` namespace. The APIs of the distributed classes are similar to those of the
standard versions.

For more information on these APIs, see the [API reference](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/container-runtime/index).

### XGBoost

The primary XGBoost class is `snowflake.ml.modeling.distributors.xgboost.XGBEstimator`. Related classes include:

* `snowflake.ml.modeling.distributors.xgboost.XGBScalingConfig`

For an example of working with this API, see the
[XGBoost on GPU](https://github.com/Snowflake-Labs/sfguide-getting-started-with-container-runtime-apis/blob/main/XGBoost_on_GPU_Quickstart.ipynb)
example notebook in the Snowflake Container Runtime GitHub repository.

### LightGBM

The primary LightGBM class is `snowflake.ml.modeling.distributors.lightgbm.LightGBMEstimator`. Related classes include:

* `snowflake.ml.modeling.distributors.lightgbm.LightGBMScalingConfig`

For an example of working with this API, see the
[LightGBM on GPU](https://github.com/Snowflake-Labs/sfguide-getting-started-with-container-runtime-apis/blob/main/LightGBM_on_GPU_Quickstart.ipynb)
example notebook in the Snowflake Container Runtime GitHub repository.

### PyTorch

The primary PyTorch class is `snowflake.ml.modeling.distributors.pytorch.PyTorchDistributor`. Related classes and functions include:

* `snowflake.ml.modeling.distributors.pytorch.WorkerResourceConfig`
* `snowflake.ml.modeling.distributors.pytorch.PyTorchScalingConfig`
* `snowflake.ml.modeling.distributors.pytorch.Context`
* `snowflake.ml.modeling.distributors.pytorch.get_context`

For an example of working with this API, see the
[PyTorch on GPU](https://github.com/Snowflake-Labs/sfguide-getting-started-with-container-runtime-apis/blob/main/PyTorch_on_GPU_Quickstart.ipynb)
example notebook in the Snowflake Container Runtime GitHub repository.

## Next steps

* To try a Snowflake Notebook using Container Runtime, see [Notebooks on Container Runtime](notebooks-on-spcs.md).

---
title: Snowflake Container Runtime CPU Version 2.3
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime/releases/cpu/2_3.md
section: Snowflake ML
---

# Snowflake Container Runtime CPU Version 2.3

The following lists the packages available for each Python version of CPU version `2.3`.

Python 3.10Python 3.11Python 3.12

CPU Container Runtime Python 3.10 Version `2.3` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| async-timeout | 5.0.1 |
| attrs | 25.4.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.5.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.2 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.1 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| exceptiongroup | 1.3.1 |
| executing | 2.2.1 |
| fastapi | 0.128.2 |
| fastjsonschema | 2.21.2 |
| filelock | 3.20.3 |
| flaml | 2.5.0 |
| flask | 3.1.2 |
| fonttools | 4.61.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.29.0 |
| google-auth | 2.48.0 |
| googleapis-common-protos | 1.72.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| gunicorn | 25.0.1 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.15.1 |
| hf-xet | 1.2.0 |
| holidays | 0.90 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 1.4.0 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.1.0 |
| ipython | 8.38.0 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.3 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.4.9 |
| lark | 1.3.1 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.46.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.2.18 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.16.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.4.2 |
| nltk | 3.9.2 |
| notebook | 7.5.3 |
| notebook-shim | 0.2.4 |
| numba | 0.63.1 |
| numpy | 1.26.4 |
| nvidia-nccl-cu12 | 2.29.3 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.39.1 |
| opentelemetry-exporter-prometheus | 0.60b1 |
| opentelemetry-proto | 1.27.0 |
| opentelemetry-sdk | 1.39.1 |
| opentelemetry-semantic-conventions | 0.60b1 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.5 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.5.1 |
| plotly | 6.5.2 |
| polars | 1.38.0 |
| polars-runtime-32 | 1.38.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 4.25.8 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.2 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.11.0 |
| pylev | 1.4.0 |
| pymc | 5.25.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2025.2 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.1.15 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rpds-py | 0.30.0 |
| rsa | 4.9.1 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| setuptools | 80.10.2 |
| shap | 0.49.1 |
| shellingham | 1.5.4 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.0 |
| smmap | 5.0.2 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.11.0 |
| snowflake-connector-python | 3.18.0 |
| snowflake-core | 1.11.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.26.0 |
| snowflake-snowpark-python | 1.45.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.50.0 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tenacity | 9.1.3 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.22.2 |
| toml | 0.10.2 |
| tomli | 2.4.0 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cpu |
| torchvision | 0.21.0+cpu |
| tornado | 6.5.4 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 5.1.0 |
| typer-slim | 0.21.1 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.40.0 |
| virtualenv | 20.36.1 |
| watchdog | 5.0.3 |
| wcwidth | 0.5.3 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.5 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2025.6.1 |
| xarray-einstats | 0.8.0 |
| xgboost | 3.1.3 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.22.0 |
| zipp | 3.23.0 |

CPU Container Runtime Python 3.11 Version `2.3` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.2.0 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.1 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| fastapi | 0.128.2 |
| fastjsonschema | 2.21.2 |
| filelock | 3.20.3 |
| flaml | 2.5.0 |
| flask | 3.1.2 |
| fonttools | 4.61.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.29.0 |
| google-auth | 2.48.0 |
| googleapis-common-protos | 1.72.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| gunicorn | 25.0.1 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.15.1 |
| hf-xet | 1.2.0 |
| holidays | 0.90 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 1.4.0 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.1.0 |
| ipython | 9.10.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.3 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.4.9 |
| lark | 1.3.1 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.46.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.2.18 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.16.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.2 |
| notebook | 7.5.3 |
| notebook-shim | 0.2.4 |
| numba | 0.63.1 |
| numpy | 1.26.4 |
| nvidia-nccl-cu12 | 2.29.3 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.39.1 |
| opentelemetry-exporter-prometheus | 0.60b1 |
| opentelemetry-proto | 1.27.0 |
| opentelemetry-sdk | 1.39.1 |
| opentelemetry-semantic-conventions | 0.60b1 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.5 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.5.1 |
| plotly | 6.5.2 |
| polars | 1.38.0 |
| polars-runtime-32 | 1.38.0 |
| preliz | 0.19.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 4.25.8 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.2 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.11.0 |
| pylev | 1.4.0 |
| pymc | 5.25.1 |
| pymc-extras | 0.3.1 |
| pymc-marketing | 0.15.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyprojroot | 0.3.0 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2025.2 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.1.15 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rpds-py | 0.30.0 |
| rsa | 4.9.1 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| setuptools | 80.10.2 |
| shap | 0.49.1 |
| shellingham | 1.5.4 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.0 |
| smmap | 5.0.2 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.11.0 |
| snowflake-connector-python | 3.18.0 |
| snowflake-core | 1.11.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.26.0 |
| snowflake-snowpark-python | 1.45.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.50.0 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tenacity | 9.1.3 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.22.2 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cpu |
| torchvision | 0.21.0+cpu |
| tornado | 6.5.4 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 5.1.0 |
| typer-slim | 0.21.1 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.40.0 |
| virtualenv | 20.36.1 |
| watchdog | 5.0.3 |
| wcwidth | 0.5.3 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.5 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.1.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.1.3 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.22.0 |
| zipp | 3.23.0 |

CPU Container Runtime Python 3.12 Version `2.3` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.2.0 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.1 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| fastapi | 0.128.2 |
| fastjsonschema | 2.21.2 |
| filelock | 3.20.3 |
| flaml | 2.5.0 |
| flask | 3.1.2 |
| fonttools | 4.61.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.29.0 |
| google-auth | 2.48.0 |
| googleapis-common-protos | 1.72.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| gunicorn | 25.0.1 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.15.1 |
| hf-xet | 1.2.0 |
| holidays | 0.90 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.1 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.1.0 |
| ipython | 9.10.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.3 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.4.9 |
| lark | 1.3.1 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.46.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.2.18 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.16.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.2 |
| notebook | 7.5.3 |
| notebook-shim | 0.2.4 |
| numba | 0.63.1 |
| numpy | 1.26.4 |
| nvidia-nccl-cu12 | 2.29.3 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.39.1 |
| opentelemetry-exporter-prometheus | 0.60b1 |
| opentelemetry-proto | 1.27.0 |
| opentelemetry-sdk | 1.39.1 |
| opentelemetry-semantic-conventions | 0.60b1 |
| orderly-set | 5.5.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.5 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.5.1 |
| plotly | 6.5.2 |
| polars | 1.38.0 |
| polars-runtime-32 | 1.38.0 |
| preliz | 0.19.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 4.25.8 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.2 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.11.0 |
| pylev | 1.4.0 |
| pymc | 5.25.1 |
| pymc-extras | 0.3.1 |
| pymc-marketing | 0.15.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyprojroot | 0.3.0 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2025.2 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.1.15 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rpds-py | 0.30.0 |
| rsa | 4.9.1 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 80.10.2 |
| shap | 0.49.1 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.0 |
| smmap | 5.0.2 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.11.0 |
| snowflake-connector-python | 3.18.0 |
| snowflake-core | 1.11.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.26.0 |
| snowflake-snowpark-python | 1.45.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.50.0 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tenacity | 9.1.3 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cpu |
| torchvision | 0.21.0+cpu |
| tornado | 6.5.4 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.40.0 |
| virtualenv | 20.36.1 |
| watchdog | 5.0.3 |
| wcwidth | 0.5.3 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.5 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.1.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.1.3 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.22.0 |
| zipp | 3.23.0 |

---
title: Snowflake Container Runtime CPU Version 2.4 (Latest)
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime/releases/cpu/2_4.md
section: Snowflake ML
---

# Snowflake Container Runtime CPU Version 2.4 (Latest)

The following lists the packages available for each Python version of CPU version `2.4`.

Python 3.10Python 3.11Python 3.12

CPU Container Runtime Python 3.10 Version `2.4` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.3.0 |
| async-timeout | 5.0.1 |
| attrs | 26.1.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.5.0 |
| certifi | 2026.2.25 |
| cffi | 2.0.0 |
| charset-normalizer | 3.4.6 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.2 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.2 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| exceptiongroup | 1.3.1 |
| executing | 2.2.1 |
| fastapi | 0.135.1 |
| fastjsonschema | 2.21.2 |
| filelock | 3.25.2 |
| flaml | 2.5.0 |
| flask | 3.1.3 |
| fonttools | 4.62.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| gcsfs | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.30.0 |
| google-auth | 2.49.1 |
| google-auth-oauthlib | 1.3.0 |
| google-cloud-core | 2.5.0 |
| google-cloud-storage | 3.10.0 |
| google-crc32c | 1.8.0 |
| google-resumable-media | 2.8.0 |
| googleapis-common-protos | 1.73.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| grpcio-status | 1.71.2 |
| gunicorn | 25.0.3 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.16.0 |
| hf-xet | 1.4.2 |
| holidays | 0.93 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 1.7.1 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.2.0 |
| ipython | 8.38.0 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| jpype1 | 1.6.0 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.6 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.5.0 |
| lark | 1.3.1 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.46.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.5.7 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.18.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.4.2 |
| nltk | 3.9.3 |
| nodeenv | 1.10.0 |
| nodejs-wheel-binaries | 24.14.0 |
| notebook | 7.5.5 |
| notebook-shim | 0.2.4 |
| numba | 0.64.0 |
| numpy | 1.26.4 |
| nvidia-nccl-cu12 | 2.29.7 |
| oauthlib | 3.3.1 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.40.0 |
| opentelemetry-exporter-prometheus | 0.61b0 |
| opentelemetry-proto | 1.40.0 |
| opentelemetry-sdk | 1.40.0 |
| opentelemetry-semantic-conventions | 0.61b0 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.6 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.9.4 |
| plotly | 6.6.0 |
| polars | 1.39.2 |
| polars-runtime-32 | 1.39.2 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 5.29.6 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| py4j | 0.10.9.7 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.3 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.12.1 |
| pylev | 1.4.0 |
| pymc | 5.25.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyright | 1.1.408 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-discovery | 1.2.0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2026.1.post1 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.2.28 |
| requests | 2.32.5 |
| requests-oauthlib | 2.0.0 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rpds-py | 0.30.0 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| setuptools | 82.0.1 |
| shap | 0.49.1 |
| shellingham | 1.5.4 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.1 |
| smmap | 5.0.3 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.12.0 |
| snowflake-connector-python | 4.0.0 |
| snowflake-core | 1.12.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.31.0 |
| snowflake-snowpark-python | 1.47.0 |
| snowpark-connect | 1.18.0 |
| snowpark-connect-deps-1 | 3.56.4 |
| snowpark-connect-deps-2 | 3.56.4 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlglot | 30.0.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.52.1 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tenacity | 9.1.4 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.22.2 |
| toml | 0.10.2 |
| tomli | 2.4.0 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cpu |
| torchvision | 0.21.0+cpu |
| tornado | 6.5.5 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 5.3.0 |
| typer | 0.24.1 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.42.0 |
| virtualenv | 21.2.0 |
| watchdog | 5.0.3 |
| wcwidth | 0.6.0 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.6 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2025.6.1 |
| xarray-einstats | 0.8.0 |
| xgboost | 3.2.0 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.23.0 |
| zipp | 3.23.0 |

CPU Container Runtime Python 3.11 Version `2.4` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.3.0 |
| attrs | 26.1.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.3.1 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.8.0 |
| certifi | 2026.2.25 |
| cffi | 2.0.0 |
| charset-normalizer | 3.4.6 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.2 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| fastapi | 0.135.1 |
| fastjsonschema | 2.21.2 |
| filelock | 3.25.2 |
| flaml | 2.5.0 |
| flask | 3.1.3 |
| fonttools | 4.62.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| gcsfs | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.30.0 |
| google-auth | 2.49.1 |
| google-auth-oauthlib | 1.3.0 |
| google-cloud-core | 2.5.0 |
| google-cloud-storage | 3.10.0 |
| google-crc32c | 1.8.0 |
| google-resumable-media | 2.8.0 |
| googleapis-common-protos | 1.73.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| grpcio-status | 1.71.2 |
| gunicorn | 25.0.3 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.16.0 |
| hf-xet | 1.4.2 |
| holidays | 0.93 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 1.7.1 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.2.0 |
| ipython | 9.10.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| jpype1 | 1.6.0 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.6 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.5.0 |
| lark | 1.3.1 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.46.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.5.7 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.18.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.3 |
| nodeenv | 1.10.0 |
| nodejs-wheel-binaries | 24.14.0 |
| notebook | 7.5.5 |
| notebook-shim | 0.2.4 |
| numba | 0.64.0 |
| numpy | 1.26.4 |
| nvidia-nccl-cu12 | 2.29.7 |
| oauthlib | 3.3.1 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.40.0 |
| opentelemetry-exporter-prometheus | 0.61b0 |
| opentelemetry-proto | 1.40.0 |
| opentelemetry-sdk | 1.40.0 |
| opentelemetry-semantic-conventions | 0.61b0 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.6 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.9.4 |
| plotly | 6.6.0 |
| polars | 1.39.2 |
| polars-runtime-32 | 1.39.2 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 5.29.6 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| py4j | 0.10.9.7 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.3 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.12.1 |
| pylev | 1.4.0 |
| pymc | 5.25.1 |
| pymc-extras | 0.3.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyright | 1.1.408 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-discovery | 1.2.0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2026.1.post1 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.2.28 |
| requests | 2.32.5 |
| requests-oauthlib | 2.0.0 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rpds-py | 0.30.0 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| setuptools | 82.0.1 |
| shap | 0.49.1 |
| shellingham | 1.5.4 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.1 |
| smmap | 5.0.3 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.12.0 |
| snowflake-connector-python | 4.0.0 |
| snowflake-core | 1.12.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.31.0 |
| snowflake-snowpark-python | 1.47.0 |
| snowpark-connect | 1.18.0 |
| snowpark-connect-deps-1 | 3.56.4 |
| snowpark-connect-deps-2 | 3.56.4 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlglot | 30.0.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.52.1 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tenacity | 9.1.4 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.22.2 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cpu |
| torchvision | 0.21.0+cpu |
| tornado | 6.5.5 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 5.3.0 |
| typer | 0.24.1 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.42.0 |
| virtualenv | 21.2.0 |
| watchdog | 5.0.3 |
| wcwidth | 0.6.0 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.6 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.2.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.2.0 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.23.0 |
| zipp | 3.23.0 |

CPU Container Runtime Python 3.12 Version `2.4` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.13.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.3.0 |
| attrs | 26.1.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.3.1 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.8.0 |
| certifi | 2026.2.25 |
| cffi | 2.0.0 |
| charset-normalizer | 3.4.6 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.2 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| fastapi | 0.135.1 |
| fastjsonschema | 2.21.2 |
| filelock | 3.25.2 |
| flaml | 2.5.0 |
| flask | 3.1.3 |
| fonttools | 4.62.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| gcsfs | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.30.0 |
| google-auth | 2.49.1 |
| google-auth-oauthlib | 1.3.0 |
| google-cloud-core | 2.5.0 |
| google-cloud-storage | 3.10.0 |
| google-crc32c | 1.8.0 |
| google-resumable-media | 2.8.0 |
| googleapis-common-protos | 1.73.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| grpcio-status | 1.71.2 |
| gunicorn | 25.0.3 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.16.0 |
| hf-xet | 1.4.2 |
| holidays | 0.93 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.2 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.2.0 |
| ipython | 9.11.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| jpype1 | 1.6.0 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.6 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.5.0 |
| lark | 1.3.1 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.46.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.5.7 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.18.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.3 |
| nodeenv | 1.10.0 |
| nodejs-wheel-binaries | 24.14.0 |
| notebook | 7.5.5 |
| notebook-shim | 0.2.4 |
| numba | 0.64.0 |
| numpy | 1.26.4 |
| nvidia-nccl-cu12 | 2.29.7 |
| oauthlib | 3.3.1 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.40.0 |
| opentelemetry-exporter-prometheus | 0.61b0 |
| opentelemetry-proto | 1.40.0 |
| opentelemetry-sdk | 1.40.0 |
| opentelemetry-semantic-conventions | 0.61b0 |
| orderly-set | 5.5.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.6 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.9.4 |
| plotly | 6.6.0 |
| polars | 1.39.2 |
| polars-runtime-32 | 1.39.2 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 5.29.6 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| py4j | 0.10.9.7 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.3 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.12.1 |
| pylev | 1.4.0 |
| pymc | 5.25.1 |
| pymc-extras | 0.3.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyright | 1.1.408 |
| pysimdjson | 7.0.2 |
| pystan | 3.10.1 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-discovery | 1.2.0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2026.1.post1 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.2.28 |
| requests | 2.32.5 |
| requests-oauthlib | 2.0.0 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rpds-py | 0.30.0 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 82.0.1 |
| shap | 0.49.1 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.1 |
| smmap | 5.0.3 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.12.0 |
| snowflake-connector-python | 4.0.0 |
| snowflake-core | 1.12.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.31.0 |
| snowflake-snowpark-python | 1.47.0 |
| snowpark-connect | 1.18.0 |
| snowpark-connect-deps-1 | 3.56.4 |
| snowpark-connect-deps-2 | 3.56.4 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlglot | 30.0.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.52.1 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tenacity | 9.1.4 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cpu |
| torchvision | 0.21.0+cpu |
| tornado | 6.5.5 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.42.0 |
| virtualenv | 21.2.0 |
| watchdog | 5.0.3 |
| wcwidth | 0.6.0 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.6 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.2.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.2.0 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.23.0 |
| zipp | 3.23.0 |

---
title: Snowflake Container Runtime GPU Version 2.3
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime/releases/gpu/2_3.md
section: Snowflake ML
---

# Snowflake Container Runtime GPU Version 2.3

The following lists the packages available for each Python version of GPU version `2.3`.

Python 3.10Python 3.11Python 3.12

GPU Container Runtime Python 3.10 Version `2.3` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| async-timeout | 5.0.1 |
| attrs | 25.4.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.5.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.2 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cuda-bindings | 12.9.5 |
| cuda-pathfinder | 1.3.3 |
| cuda-python | 12.9.5 |
| cudf-cu12 | 25.6.0 |
| cuml-cu12 | 25.6.0 |
| cupy-cuda12x | 13.6.0 |
| cuvs-cu12 | 25.6.1 |
| cycler | 0.12.1 |
| dask | 2025.5.0 |
| dask-cuda | 25.6.0 |
| dask-cudf-cu12 | 25.6.0 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.1 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| distributed | 2025.5.0 |
| distributed-ucxx-cu12 | 0.44.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| exceptiongroup | 1.3.1 |
| executing | 2.2.1 |
| fastapi | 0.128.2 |
| fastjsonschema | 2.21.2 |
| fastrlock | 0.8.3 |
| filelock | 3.20.3 |
| flaml | 2.5.0 |
| flask | 3.1.2 |
| fonttools | 4.61.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.29.0 |
| google-auth | 2.48.0 |
| googleapis-common-protos | 1.72.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| gunicorn | 25.0.1 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.15.1 |
| hf-xet | 1.2.0 |
| holidays | 0.90 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.1 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.1.0 |
| ipython | 8.38.0 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.3 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.4.9 |
| lark | 1.3.1 |
| libcudf-cu12 | 25.6.0 |
| libcuml-cu12 | 25.6.0 |
| libcuvs-cu12 | 25.6.1 |
| libkvikio-cu12 | 25.6.0 |
| libraft-cu12 | 25.6.0 |
| librmm-cu12 | 25.6.0 |
| libucx-cu12 | 1.18.1 |
| libucxx-cu12 | 0.44.0 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.44.0 |
| locket | 1.0.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.2.18 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.16.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.4.2 |
| nltk | 3.9.2 |
| notebook | 7.5.3 |
| notebook-shim | 0.2.4 |
| numba | 0.61.2 |
| numba-cuda | 0.11.0 |
| numpy | 1.26.4 |
| nvidia-cublas-cu12 | 12.6.4.1 |
| nvidia-cuda-cupti-cu12 | 12.6.80 |
| nvidia-cuda-nvcc-cu12 | 12.9.86 |
| nvidia-cuda-nvrtc-cu12 | 12.6.77 |
| nvidia-cuda-runtime-cu12 | 12.6.77 |
| nvidia-cudnn-cu12 | 9.5.1.17 |
| nvidia-cufft-cu12 | 11.3.0.4 |
| nvidia-curand-cu12 | 10.3.7.77 |
| nvidia-cusolver-cu12 | 11.7.1.2 |
| nvidia-cusparse-cu12 | 12.5.4.2 |
| nvidia-cusparselt-cu12 | 0.6.3 |
| nvidia-ml-py | 12.575.51 |
| nvidia-nccl-cu12 | 2.21.5 |
| nvidia-nvjitlink-cu12 | 12.6.85 |
| nvidia-nvtx-cu12 | 12.6.77 |
| nvtx | 0.2.14 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.39.1 |
| opentelemetry-exporter-prometheus | 0.60b1 |
| opentelemetry-proto | 1.27.0 |
| opentelemetry-sdk | 1.39.1 |
| opentelemetry-semantic-conventions | 0.60b1 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.2.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.5 |
| partd | 1.4.2 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| peft | 0.17.1 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.5.1 |
| plotly | 6.5.2 |
| polars | 1.38.0 |
| polars-runtime-32 | 1.38.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 4.25.8 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.2 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.11.0 |
| pylev | 1.4.0 |
| pylibcudf-cu12 | 25.6.0 |
| pylibraft-cu12 | 25.6.0 |
| pymc | 5.25.1 |
| pynvjitlink-cu12 | 0.7.0 |
| pynvml | 12.0.0 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2025.2 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| raft-dask-cu12 | 25.6.0 |
| rapids-dask-dependency | 25.6.0 |
| rapids-logger | 0.1.19 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.1.15 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rmm-cu12 | 25.6.0 |
| rpds-py | 0.30.0 |
| rsa | 4.9.1 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 80.10.2 |
| shap | 0.49.1 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.0 |
| smmap | 5.0.2 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.11.0 |
| snowflake-connector-python | 3.18.0 |
| snowflake-core | 1.11.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.26.0 |
| snowflake-snowpark-python | 1.45.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.50.0 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tblib | 3.2.2 |
| tenacity | 9.1.3 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomli | 2.4.0 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cu126 |
| torchvision | 0.21.0+cu126 |
| tornado | 6.5.4 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| treelite | 4.4.1 |
| triton | 3.2.0 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| ucx-py-cu12 | 0.44.0 |
| ucxx-cu12 | 0.44.0 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.40.0 |
| virtualenv | 20.36.1 |
| watchdog | 5.0.3 |
| wcwidth | 0.5.3 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.5 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2025.6.1 |
| xarray-einstats | 0.8.0 |
| xgboost | 3.1.3 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.22.0 |
| zict | 3.0.0 |
| zipp | 3.23.0 |

GPU Container Runtime Python 3.11 Version `2.3` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.2.0 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cuda-bindings | 12.9.5 |
| cuda-pathfinder | 1.3.3 |
| cuda-python | 12.9.5 |
| cudf-cu12 | 25.6.0 |
| cuml-cu12 | 25.6.0 |
| cupy-cuda12x | 13.6.0 |
| cuvs-cu12 | 25.6.1 |
| cycler | 0.12.1 |
| dask | 2025.5.0 |
| dask-cuda | 25.6.0 |
| dask-cudf-cu12 | 25.6.0 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.1 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| distributed | 2025.5.0 |
| distributed-ucxx-cu12 | 0.44.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| fastapi | 0.128.2 |
| fastjsonschema | 2.21.2 |
| fastrlock | 0.8.3 |
| filelock | 3.20.3 |
| flaml | 2.5.0 |
| flask | 3.1.2 |
| fonttools | 4.61.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.29.0 |
| google-auth | 2.48.0 |
| googleapis-common-protos | 1.72.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| gunicorn | 25.0.1 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.15.1 |
| hf-xet | 1.2.0 |
| holidays | 0.90 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.1 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.1.0 |
| ipython | 9.10.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.3 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.4.9 |
| lark | 1.3.1 |
| libcudf-cu12 | 25.6.0 |
| libcuml-cu12 | 25.6.0 |
| libcuvs-cu12 | 25.6.1 |
| libkvikio-cu12 | 25.6.0 |
| libraft-cu12 | 25.6.0 |
| librmm-cu12 | 25.6.0 |
| libucx-cu12 | 1.18.1 |
| libucxx-cu12 | 0.44.0 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.44.0 |
| locket | 1.0.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.2.18 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.16.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.2 |
| notebook | 7.5.3 |
| notebook-shim | 0.2.4 |
| numba | 0.61.2 |
| numba-cuda | 0.11.0 |
| numpy | 1.26.4 |
| nvidia-cublas-cu12 | 12.6.4.1 |
| nvidia-cuda-cupti-cu12 | 12.6.80 |
| nvidia-cuda-nvcc-cu12 | 12.9.86 |
| nvidia-cuda-nvrtc-cu12 | 12.6.77 |
| nvidia-cuda-runtime-cu12 | 12.6.77 |
| nvidia-cudnn-cu12 | 9.5.1.17 |
| nvidia-cufft-cu12 | 11.3.0.4 |
| nvidia-curand-cu12 | 10.3.7.77 |
| nvidia-cusolver-cu12 | 11.7.1.2 |
| nvidia-cusparse-cu12 | 12.5.4.2 |
| nvidia-cusparselt-cu12 | 0.6.3 |
| nvidia-ml-py | 12.575.51 |
| nvidia-nccl-cu12 | 2.21.5 |
| nvidia-nvjitlink-cu12 | 12.6.85 |
| nvidia-nvtx-cu12 | 12.6.77 |
| nvtx | 0.2.14 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.39.1 |
| opentelemetry-exporter-prometheus | 0.60b1 |
| opentelemetry-proto | 1.27.0 |
| opentelemetry-sdk | 1.39.1 |
| opentelemetry-semantic-conventions | 0.60b1 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.2.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.5 |
| partd | 1.4.2 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| peft | 0.17.1 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.5.1 |
| plotly | 6.5.2 |
| polars | 1.38.0 |
| polars-runtime-32 | 1.38.0 |
| preliz | 0.19.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 4.25.8 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.2 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.11.0 |
| pylev | 1.4.0 |
| pylibcudf-cu12 | 25.6.0 |
| pylibraft-cu12 | 25.6.0 |
| pymc | 5.25.1 |
| pymc-extras | 0.3.1 |
| pymc-marketing | 0.15.1 |
| pynvjitlink-cu12 | 0.7.0 |
| pynvml | 12.0.0 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyprojroot | 0.3.0 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2025.2 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| raft-dask-cu12 | 25.6.0 |
| rapids-dask-dependency | 25.6.0 |
| rapids-logger | 0.1.19 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.1.15 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rmm-cu12 | 25.6.0 |
| rpds-py | 0.30.0 |
| rsa | 4.9.1 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 80.10.2 |
| shap | 0.49.1 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.0 |
| smmap | 5.0.2 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.11.0 |
| snowflake-connector-python | 3.18.0 |
| snowflake-core | 1.11.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.26.0 |
| snowflake-snowpark-python | 1.45.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.50.0 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tblib | 3.2.2 |
| tenacity | 9.1.3 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cu126 |
| torchvision | 0.21.0+cu126 |
| tornado | 6.5.4 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| treelite | 4.4.1 |
| triton | 3.2.0 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| ucx-py-cu12 | 0.44.0 |
| ucxx-cu12 | 0.44.0 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.40.0 |
| virtualenv | 20.36.1 |
| watchdog | 5.0.3 |
| wcwidth | 0.5.3 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.5 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.1.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.1.3 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.22.0 |
| zict | 3.0.0 |
| zipp | 3.23.0 |

GPU Container Runtime Python 3.12 Version `2.3` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.12.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.1.0 |
| attrs | 25.4.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.2.0 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.7.0 |
| certifi | 2026.1.4 |
| cffi | 1.17.1 |
| charset-normalizer | 3.4.4 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cuda-bindings | 12.9.5 |
| cuda-pathfinder | 1.3.3 |
| cuda-python | 12.9.5 |
| cudf-cu12 | 25.6.0 |
| cuml-cu12 | 25.6.0 |
| cupy-cuda12x | 13.6.0 |
| cuvs-cu12 | 25.6.1 |
| cycler | 0.12.1 |
| dask | 2025.5.0 |
| dask-cuda | 25.6.0 |
| dask-cudf-cu12 | 25.6.0 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.1 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| distributed | 2025.5.0 |
| distributed-ucxx-cu12 | 0.44.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| fastapi | 0.128.2 |
| fastjsonschema | 2.21.2 |
| fastrlock | 0.8.3 |
| filelock | 3.20.3 |
| flaml | 2.5.0 |
| flask | 3.1.2 |
| fonttools | 4.61.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.29.0 |
| google-auth | 2.48.0 |
| googleapis-common-protos | 1.72.0 |
| graphviz | 0.21 |
| grpcio | 1.76.0 |
| gunicorn | 25.0.1 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.15.1 |
| hf-xet | 1.2.0 |
| holidays | 0.90 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.1 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.1.0 |
| ipython | 9.10.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.3 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.4.9 |
| lark | 1.3.1 |
| libcudf-cu12 | 25.6.0 |
| libcuml-cu12 | 25.6.0 |
| libcuvs-cu12 | 25.6.1 |
| libkvikio-cu12 | 25.6.0 |
| libraft-cu12 | 25.6.0 |
| librmm-cu12 | 25.6.0 |
| libucx-cu12 | 1.18.1 |
| libucxx-cu12 | 0.44.0 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.44.0 |
| locket | 1.0.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.2.18 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.16.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.2 |
| notebook | 7.5.3 |
| notebook-shim | 0.2.4 |
| numba | 0.61.2 |
| numba-cuda | 0.11.0 |
| numpy | 1.26.4 |
| nvidia-cublas-cu12 | 12.6.4.1 |
| nvidia-cuda-cupti-cu12 | 12.6.80 |
| nvidia-cuda-nvcc-cu12 | 12.9.86 |
| nvidia-cuda-nvrtc-cu12 | 12.6.77 |
| nvidia-cuda-runtime-cu12 | 12.6.77 |
| nvidia-cudnn-cu12 | 9.5.1.17 |
| nvidia-cufft-cu12 | 11.3.0.4 |
| nvidia-curand-cu12 | 10.3.7.77 |
| nvidia-cusolver-cu12 | 11.7.1.2 |
| nvidia-cusparse-cu12 | 12.5.4.2 |
| nvidia-cusparselt-cu12 | 0.6.3 |
| nvidia-ml-py | 12.575.51 |
| nvidia-nccl-cu12 | 2.21.5 |
| nvidia-nvjitlink-cu12 | 12.6.85 |
| nvidia-nvtx-cu12 | 12.6.77 |
| nvtx | 0.2.14 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.39.1 |
| opentelemetry-exporter-prometheus | 0.60b1 |
| opentelemetry-proto | 1.27.0 |
| opentelemetry-sdk | 1.39.1 |
| opentelemetry-semantic-conventions | 0.60b1 |
| orderly-set | 5.5.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.2.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.5 |
| partd | 1.4.2 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| peft | 0.17.1 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.5.1 |
| plotly | 6.5.2 |
| polars | 1.38.0 |
| polars-runtime-32 | 1.38.0 |
| preliz | 0.19.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 4.25.8 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.2 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.11.0 |
| pylev | 1.4.0 |
| pylibcudf-cu12 | 25.6.0 |
| pylibraft-cu12 | 25.6.0 |
| pymc | 5.25.1 |
| pymc-extras | 0.3.1 |
| pymc-marketing | 0.15.1 |
| pynvjitlink-cu12 | 0.7.0 |
| pynvml | 12.0.0 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyprojroot | 0.3.0 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2025.2 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| raft-dask-cu12 | 25.6.0 |
| rapids-dask-dependency | 25.6.0 |
| rapids-logger | 0.1.19 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.1.15 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rmm-cu12 | 25.6.0 |
| rpds-py | 0.30.0 |
| rsa | 4.9.1 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 80.10.2 |
| shap | 0.49.1 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.0 |
| smmap | 5.0.2 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.11.0 |
| snowflake-connector-python | 3.18.0 |
| snowflake-core | 1.11.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.26.0 |
| snowflake-snowpark-python | 1.45.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.50.0 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.13.1 |
| tblib | 3.2.2 |
| tenacity | 9.1.3 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.6.0+cu126 |
| torchvision | 0.21.0+cu126 |
| tornado | 6.5.4 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| treelite | 4.4.1 |
| triton | 3.2.0 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| ucx-py-cu12 | 0.44.0 |
| ucxx-cu12 | 0.44.0 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.40.0 |
| virtualenv | 20.36.1 |
| watchdog | 5.0.3 |
| wcwidth | 0.5.3 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.5 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.1.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.1.3 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.22.0 |
| zict | 3.0.0 |
| zipp | 3.23.0 |

---
title: Snowflake Container Runtime GPU Version 2.4 (Latest)
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/container-runtime/releases/gpu/2_4.md
section: Snowflake ML
---

# Snowflake Container Runtime GPU Version 2.4 (Latest)

The following lists the packages available for each Python version of GPU version `2.4`.

Python 3.10Python 3.11Python 3.12

GPU Container Runtime Python 3.10 Version `2.4` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.13.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.3.0 |
| async-timeout | 5.0.1 |
| attrs | 26.1.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.5.0 |
| certifi | 2026.2.25 |
| cffi | 2.0.0 |
| charset-normalizer | 3.4.6 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.2 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cuda-bindings | 12.9.6 |
| cuda-core | 0.6.0 |
| cuda-pathfinder | 1.4.3 |
| cuda-python | 12.9.6 |
| cuda-toolkit | 12.8.1 |
| cudf-cu12 | 26.2.1 |
| cuml-cu12 | 26.2.0 |
| cupy-cuda12x | 14.0.1 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.2 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| exceptiongroup | 1.3.1 |
| executing | 2.2.1 |
| faiss-gpu-cu12 | 1.14.1.post1 |
| fastapi | 0.135.1 |
| fastjsonschema | 2.21.2 |
| filelock | 3.25.2 |
| flaml | 2.5.0 |
| flask | 3.1.3 |
| fonttools | 4.62.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.30.0 |
| google-auth | 2.49.1 |
| googleapis-common-protos | 1.73.0 |
| graphviz | 0.21 |
| grpcio | 1.78.0 |
| gunicorn | 25.0.3 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.16.0 |
| hf-xet | 1.4.2 |
| holidays | 0.93 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.2 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.2.0 |
| ipython | 8.38.0 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.6 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.5.0 |
| lark | 1.3.1 |
| libcudf-cu12 | 26.2.1 |
| libcuml-cu12 | 26.2.0 |
| libkvikio-cu12 | 26.2.0 |
| libraft-cu12 | 26.2.0 |
| librmm-cu12 | 26.2.0 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.44.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.5.7 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.18.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.4.2 |
| nltk | 3.9.3 |
| nodeenv | 1.10.0 |
| nodejs-wheel-binaries | 24.14.0 |
| notebook | 7.5.5 |
| notebook-shim | 0.2.4 |
| numba | 0.61.2 |
| numba-cuda | 0.22.2 |
| numpy | 2.2.6 |
| nvidia-cublas-cu12 | 12.8.4.1 |
| nvidia-cuda-cccl-cu12 | 12.9.27 |
| nvidia-cuda-cupti-cu12 | 12.8.90 |
| nvidia-cuda-nvcc-cu12 | 12.8.93 |
| nvidia-cuda-nvrtc-cu12 | 12.8.93 |
| nvidia-cuda-runtime-cu12 | 12.8.90 |
| nvidia-cudnn-cu12 | 9.10.2.21 |
| nvidia-cufft-cu12 | 11.3.3.83 |
| nvidia-cufile-cu12 | 1.13.1.3 |
| nvidia-curand-cu12 | 10.3.9.90 |
| nvidia-cusolver-cu12 | 11.7.3.90 |
| nvidia-cusparse-cu12 | 12.5.8.93 |
| nvidia-cusparselt-cu12 | 0.7.1 |
| nvidia-libnvcomp-cu12 | 5.1.0.21 |
| nvidia-ml-py | 13.595.45 |
| nvidia-nccl-cu12 | 2.27.3 |
| nvidia-nvjitlink-cu12 | 12.8.93 |
| nvidia-nvtx-cu12 | 12.8.90 |
| nvtx | 0.2.15 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.40.0 |
| opentelemetry-exporter-prometheus | 0.61b0 |
| opentelemetry-proto | 1.40.0 |
| opentelemetry-sdk | 1.40.0 |
| opentelemetry-semantic-conventions | 0.61b0 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.6 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| peft | 0.17.1 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.9.4 |
| plotly | 6.6.0 |
| polars | 1.39.2 |
| polars-runtime-32 | 1.39.2 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 5.29.6 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.3 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.12.1 |
| pylev | 1.4.0 |
| pylibcudf-cu12 | 26.2.1 |
| pylibraft-cu12 | 26.2.0 |
| pymc | 5.25.1 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyright | 1.1.408 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.31.7 |
| python-dateutil | 2.9.0.post0 |
| python-discovery | 1.2.0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2026.1.post1 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| rapids-logger | 0.2.3 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.2.28 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rmm-cu12 | 26.2.0 |
| rpds-py | 0.30.0 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 82.0.1 |
| shap | 0.49.1 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.1 |
| smmap | 5.0.3 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.12.0 |
| snowflake-connector-python | 4.0.0 |
| snowflake-core | 1.12.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.31.0 |
| snowflake-snowpark-python | 1.47.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.52.1 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.14.0 |
| tenacity | 9.1.4 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomli | 2.4.0 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.8.0+cu128 |
| torchvision | 0.23.0+cu128 |
| tornado | 6.5.5 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| treelite | 4.7.0 |
| triton | 3.4.0 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.42.0 |
| virtualenv | 21.2.0 |
| watchdog | 5.0.3 |
| wcwidth | 0.6.0 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.6 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2025.6.1 |
| xarray-einstats | 0.8.0 |
| xgboost | 3.2.0 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.23.0 |
| zipp | 3.23.0 |

GPU Container Runtime Python 3.11 Version `2.4` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.13.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| arviz-stats | 0.8.0 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.3.0 |
| attrs | 26.1.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.3.1 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.8.0 |
| certifi | 2026.2.25 |
| cffi | 2.0.0 |
| charset-normalizer | 3.4.6 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cuda-bindings | 12.9.6 |
| cuda-core | 0.6.0 |
| cuda-pathfinder | 1.4.3 |
| cuda-python | 12.9.6 |
| cuda-toolkit | 12.8.1 |
| cudf-cu12 | 26.2.1 |
| cuml-cu12 | 26.2.0 |
| cupy-cuda12x | 14.0.1 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.2 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| faiss-gpu-cu12 | 1.14.1.post1 |
| fastapi | 0.135.1 |
| fastjsonschema | 2.21.2 |
| filelock | 3.25.2 |
| flaml | 2.5.0 |
| flask | 3.1.3 |
| fonttools | 4.62.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.30.0 |
| google-auth | 2.49.1 |
| googleapis-common-protos | 1.73.0 |
| graphviz | 0.21 |
| grpcio | 1.78.0 |
| gunicorn | 25.0.3 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.16.0 |
| hf-xet | 1.4.2 |
| holidays | 0.93 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.2 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.2.0 |
| ipython | 9.10.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.6 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.5.0 |
| lark | 1.3.1 |
| libcudf-cu12 | 26.2.1 |
| libcuml-cu12 | 26.2.0 |
| libkvikio-cu12 | 26.2.0 |
| libraft-cu12 | 26.2.0 |
| librmm-cu12 | 26.2.0 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.44.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.5.7 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.18.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.3 |
| nodeenv | 1.10.0 |
| nodejs-wheel-binaries | 24.14.0 |
| notebook | 7.5.5 |
| notebook-shim | 0.2.4 |
| numba | 0.61.2 |
| numba-cuda | 0.22.2 |
| numpy | 2.2.6 |
| nvidia-cublas-cu12 | 12.8.4.1 |
| nvidia-cuda-cccl-cu12 | 12.9.27 |
| nvidia-cuda-cupti-cu12 | 12.8.90 |
| nvidia-cuda-nvcc-cu12 | 12.8.93 |
| nvidia-cuda-nvrtc-cu12 | 12.8.93 |
| nvidia-cuda-runtime-cu12 | 12.8.90 |
| nvidia-cudnn-cu12 | 9.10.2.21 |
| nvidia-cufft-cu12 | 11.3.3.83 |
| nvidia-cufile-cu12 | 1.13.1.3 |
| nvidia-curand-cu12 | 10.3.9.90 |
| nvidia-cusolver-cu12 | 11.7.3.90 |
| nvidia-cusparse-cu12 | 12.5.8.93 |
| nvidia-cusparselt-cu12 | 0.7.1 |
| nvidia-libnvcomp-cu12 | 5.1.0.21 |
| nvidia-ml-py | 13.595.45 |
| nvidia-nccl-cu12 | 2.27.3 |
| nvidia-nvjitlink-cu12 | 12.8.93 |
| nvidia-nvtx-cu12 | 12.8.90 |
| nvtx | 0.2.15 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.40.0 |
| opentelemetry-exporter-prometheus | 0.61b0 |
| opentelemetry-proto | 1.40.0 |
| opentelemetry-sdk | 1.40.0 |
| opentelemetry-semantic-conventions | 0.61b0 |
| orderly-set | 5.5.0 |
| overrides | 7.7.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.6 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| peft | 0.17.1 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.9.4 |
| plotly | 6.6.0 |
| polars | 1.39.2 |
| polars-runtime-32 | 1.39.2 |
| preliz | 0.22.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 5.29.6 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.3 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.12.1 |
| pylev | 1.4.0 |
| pylibcudf-cu12 | 26.2.1 |
| pylibraft-cu12 | 26.2.0 |
| pymc | 5.28.2 |
| pymc-extras | 0.10.0 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyright | 1.1.408 |
| pysimdjson | 6.0.2 |
| pystan | 3.10.0 |
| pytensor | 2.38.2 |
| python-dateutil | 2.9.0.post0 |
| python-discovery | 1.2.0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2026.1.post1 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| rapids-logger | 0.2.3 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.2.28 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rmm-cu12 | 26.2.0 |
| rpds-py | 0.30.0 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 82.0.1 |
| shap | 0.51.0 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.1 |
| smmap | 5.0.3 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.12.0 |
| snowflake-connector-python | 4.0.0 |
| snowflake-core | 1.12.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.31.0 |
| snowflake-snowpark-python | 1.47.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.52.1 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.14.0 |
| tenacity | 9.1.4 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.8.0+cu128 |
| torchvision | 0.23.0+cu128 |
| tornado | 6.5.5 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| treelite | 4.7.0 |
| triton | 3.4.0 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.42.0 |
| virtualenv | 21.2.0 |
| watchdog | 5.0.3 |
| wcwidth | 0.6.0 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.6 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.2.0 |
| xarray-einstats | 0.9.1 |
| xgboost | 3.2.0 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.23.0 |
| zipp | 3.23.0 |

GPU Container Runtime Python 3.12 Version `2.4` has the following packages:

| Package | Version |
| --- | --- |
| absl-py | 2.4.0 |
| accelerate | 1.13.0 |
| aiobotocore | 2.26.0 |
| aiohappyeyeballs | 2.6.1 |
| aiohttp | 3.13.3 |
| aiohttp-cors | 0.8.1 |
| aioitertools | 0.13.0 |
| aiosignal | 1.4.0 |
| altair | 5.5.0 |
| annotated-doc | 0.0.4 |
| annotated-types | 0.7.0 |
| anyio | 4.12.1 |
| appdirs | 1.4.4 |
| argon2-cffi | 25.1.0 |
| argon2-cffi-bindings | 25.1.0 |
| arrow | 1.4.0 |
| arviz | 0.23.4 |
| arviz-stats | 1.0.0 |
| asn1crypto | 1.5.1 |
| asttokens | 3.0.1 |
| async-lru | 2.3.0 |
| attrs | 26.1.0 |
| babel | 2.18.0 |
| bayesian-optimization | 1.5.1 |
| beautifulsoup4 | 4.14.3 |
| better-optimize | 0.3.1 |
| bleach | 6.3.0 |
| blinker | 1.9.0 |
| boto3 | 1.41.5 |
| botocore | 1.41.5 |
| cachetools | 5.5.2 |
| causalpy | 0.8.0 |
| certifi | 2026.2.25 |
| cffi | 2.0.0 |
| charset-normalizer | 3.4.6 |
| click | 8.2.1 |
| clikit | 0.6.2 |
| cloudpickle | 3.1.1 |
| cmdstanpy | 1.3.0 |
| colorama | 0.4.6 |
| colorful | 0.5.8 |
| comm | 0.2.3 |
| cons | 0.4.7 |
| contourpy | 1.3.3 |
| crashtest | 0.3.1 |
| cryptography | 43.0.3 |
| cuda-bindings | 12.9.6 |
| cuda-core | 0.6.0 |
| cuda-pathfinder | 1.4.3 |
| cuda-python | 12.9.6 |
| cuda-toolkit | 12.8.1 |
| cudf-cu12 | 26.2.1 |
| cuml-cu12 | 26.2.0 |
| cupy-cuda12x | 14.0.1 |
| cycler | 0.12.1 |
| datasets | 4.0.0 |
| debugpy | 1.8.20 |
| decorator | 5.2.1 |
| deepdiff | 8.6.2 |
| defusedxml | 0.7.1 |
| dill | 0.3.8 |
| distlib | 0.4.0 |
| etuples | 0.3.10 |
| evaluate | 0.4.6 |
| executing | 2.2.1 |
| faiss-gpu-cu12 | 1.14.1.post1 |
| fastapi | 0.135.1 |
| fastjsonschema | 2.21.2 |
| filelock | 3.25.2 |
| flaml | 2.5.0 |
| flask | 3.1.3 |
| fonttools | 4.62.1 |
| fqdn | 1.5.1 |
| frozenlist | 1.8.0 |
| fsspec | 2025.3.0 |
| geojson | 3.2.0 |
| gitdb | 4.0.12 |
| gitpython | 3.1.46 |
| google-api-core | 2.30.0 |
| google-auth | 2.49.1 |
| googleapis-common-protos | 1.73.0 |
| graphviz | 0.21 |
| grpcio | 1.78.0 |
| gunicorn | 25.0.3 |
| h11 | 0.16.0 |
| h5netcdf | 1.8.1 |
| h5py | 3.16.0 |
| hf-xet | 1.4.2 |
| holidays | 0.93 |
| httpcore | 1.0.9 |
| httpstan | 4.13.0 |
| httpx | 0.28.1 |
| huggingface-hub | 0.36.2 |
| idna | 3.11 |
| importlib-metadata | 8.7.1 |
| importlib-resources | 6.5.2 |
| inflection | 0.5.1 |
| ipykernel | 7.2.0 |
| ipython | 9.11.0 |
| ipython-pygments-lexers | 1.1.1 |
| isoduration | 20.11.0 |
| itsdangerous | 2.2.0 |
| jedi | 0.19.2 |
| jinja2 | 3.1.6 |
| jmespath | 1.1.0 |
| joblib | 1.5.3 |
| json5 | 0.13.0 |
| jsonpointer | 3.0.0 |
| jsonschema | 4.26.0 |
| jsonschema-specifications | 2025.9.1 |
| jupyter-client | 8.8.0 |
| jupyter-core | 5.9.1 |
| jupyter-events | 0.12.0 |
| jupyter-lsp | 2.3.0 |
| jupyter-server | 2.17.0 |
| jupyter-server-terminals | 0.5.4 |
| jupyterlab | 4.5.6 |
| jupyterlab-pygments | 0.3.0 |
| jupyterlab-server | 2.28.0 |
| kiwisolver | 1.5.0 |
| lark | 1.3.1 |
| libcudf-cu12 | 26.2.1 |
| libcuml-cu12 | 26.2.0 |
| libkvikio-cu12 | 26.2.0 |
| libraft-cu12 | 26.2.0 |
| librmm-cu12 | 26.2.0 |
| lightgbm | 4.6.0 |
| lightgbm-ray | 0.1.9 |
| llvmlite | 0.44.0 |
| logical-unification | 0.4.7 |
| lxml | 6.0.2 |
| markdown-it-py | 4.0.0 |
| markupsafe | 3.0.3 |
| marshmallow | 3.26.2 |
| matplotlib | 3.10.8 |
| matplotlib-inline | 0.2.1 |
| mdurl | 0.1.2 |
| minikanren | 1.0.5 |
| mistune | 3.2.0 |
| mlruntimes-service | 2.5.7 |
| modin | 0.37.1 |
| mpmath | 1.3.0 |
| msgpack | 1.1.2 |
| multidict | 6.7.1 |
| multipledispatch | 1.0.0 |
| multiprocess | 0.70.16 |
| narwhals | 2.18.0 |
| nbclient | 0.10.4 |
| nbconvert | 7.17.0 |
| nbformat | 5.10.4 |
| nest-asyncio | 1.6.0 |
| networkx | 3.6.1 |
| nltk | 3.9.3 |
| nodeenv | 1.10.0 |
| nodejs-wheel-binaries | 24.14.0 |
| notebook | 7.5.5 |
| notebook-shim | 0.2.4 |
| numba | 0.61.2 |
| numba-cuda | 0.22.2 |
| numpy | 2.2.6 |
| nvidia-cublas-cu12 | 12.8.4.1 |
| nvidia-cuda-cccl-cu12 | 12.9.27 |
| nvidia-cuda-cupti-cu12 | 12.8.90 |
| nvidia-cuda-nvcc-cu12 | 12.8.93 |
| nvidia-cuda-nvrtc-cu12 | 12.8.93 |
| nvidia-cuda-runtime-cu12 | 12.8.90 |
| nvidia-cudnn-cu12 | 9.10.2.21 |
| nvidia-cufft-cu12 | 11.3.3.83 |
| nvidia-cufile-cu12 | 1.13.1.3 |
| nvidia-curand-cu12 | 10.3.9.90 |
| nvidia-cusolver-cu12 | 11.7.3.90 |
| nvidia-cusparse-cu12 | 12.5.8.93 |
| nvidia-cusparselt-cu12 | 0.7.1 |
| nvidia-libnvcomp-cu12 | 5.1.0.21 |
| nvidia-ml-py | 13.595.45 |
| nvidia-nccl-cu12 | 2.27.3 |
| nvidia-nvjitlink-cu12 | 12.8.93 |
| nvidia-nvtx-cu12 | 12.8.90 |
| nvtx | 0.2.15 |
| opencensus | 0.11.4 |
| opencensus-context | 0.1.3 |
| opentelemetry-api | 1.40.0 |
| opentelemetry-exporter-prometheus | 0.61b0 |
| opentelemetry-proto | 1.40.0 |
| opentelemetry-sdk | 1.40.0 |
| opentelemetry-semantic-conventions | 0.61b0 |
| orderly-set | 5.5.0 |
| packaging | 24.2 |
| pandapower | 3.1.2 |
| pandas | 2.3.3 |
| pandocfilters | 1.5.1 |
| parso | 0.8.6 |
| pastel | 0.2.1 |
| patsy | 1.0.2 |
| peft | 0.17.1 |
| pexpect | 4.9.0 |
| pillow | 10.4.0 |
| platformdirs | 4.9.4 |
| plotly | 6.6.0 |
| polars | 1.39.2 |
| polars-runtime-32 | 1.39.2 |
| preliz | 0.22.0 |
| prometheus-client | 0.22.1 |
| prompt-toolkit | 3.0.52 |
| propcache | 0.4.1 |
| prophet | 1.3.0 |
| proto-plus | 1.27.1 |
| protobuf | 5.29.6 |
| psutil | 7.2.2 |
| ptyprocess | 0.7.0 |
| pure-eval | 0.2.3 |
| py-spy | 0.4.1 |
| pyarrow | 18.1.0 |
| pyasn1 | 0.6.3 |
| pyasn1-modules | 0.4.2 |
| pycparser | 3.0 |
| pydantic | 2.12.5 |
| pydantic-core | 2.41.5 |
| pydeck | 0.9.1 |
| pygments | 2.19.2 |
| pyjwt | 2.12.1 |
| pylev | 1.4.0 |
| pylibcudf-cu12 | 26.2.1 |
| pylibraft-cu12 | 26.2.0 |
| pymc | 5.28.2 |
| pymc-extras | 0.10.0 |
| pyopenssl | 25.1.0 |
| pyparsing | 3.3.2 |
| pyright | 1.1.408 |
| pysimdjson | 7.0.2 |
| pystan | 3.10.1 |
| pytensor | 2.38.2 |
| python-dateutil | 2.9.0.post0 |
| python-discovery | 1.2.0 |
| python-json-logger | 4.0.0 |
| pytimeparse | 1.1.8 |
| pytz | 2026.1.post1 |
| pyyaml | 6.0.3 |
| pyzmq | 27.1.0 |
| rapids-logger | 0.2.3 |
| ray | 2.53.0 |
| referencing | 0.37.0 |
| regex | 2026.2.28 |
| requests | 2.32.5 |
| retrying | 1.4.2 |
| rfc3339-validator | 0.1.4 |
| rfc3986-validator | 0.1.1 |
| rfc3987-syntax | 1.1.0 |
| rich | 13.9.4 |
| rmm-cu12 | 26.2.0 |
| rpds-py | 0.30.0 |
| s3fs | 2025.3.0 |
| s3transfer | 0.15.0 |
| safetensors | 0.7.0 |
| scikit-learn | 1.7.2 |
| scipy | 1.15.3 |
| seaborn | 0.13.2 |
| send2trash | 2.1.0 |
| sentencepiece | 0.2.1 |
| setuptools | 82.0.1 |
| shap | 0.51.0 |
| six | 1.17.0 |
| slicer | 0.0.8 |
| smart-open | 7.5.1 |
| smmap | 5.0.3 |
| snowbooks | 1.76.10rc1 |
| snowflake | 1.12.0 |
| snowflake-connector-python | 4.0.0 |
| snowflake-core | 1.12.0 |
| snowflake-legacy | 1.0.2 |
| snowflake-ml-python | 1.31.0 |
| snowflake-snowpark-python | 1.47.0 |
| sortedcontainers | 2.4.0 |
| soupsieve | 2.8.3 |
| sqlparse | 0.5.5 |
| stack-data | 0.6.3 |
| stanio | 0.5.1 |
| starlette | 0.52.1 |
| statsmodels | 0.14.6 |
| streamlit | 1.39.1 |
| sympy | 1.14.0 |
| tenacity | 9.1.4 |
| terminado | 0.18.1 |
| threadpoolctl | 3.6.0 |
| tinycss2 | 1.4.0 |
| tokenizers | 0.21.4 |
| toml | 0.10.2 |
| tomlkit | 0.14.0 |
| toolz | 1.1.0 |
| torch | 2.8.0+cu128 |
| torchvision | 0.23.0+cu128 |
| tornado | 6.5.5 |
| tqdm | 4.67.3 |
| traitlets | 5.14.3 |
| transformers | 4.51.3 |
| treelite | 4.7.0 |
| triton | 3.4.0 |
| typing-extensions | 4.15.0 |
| typing-inspection | 0.4.2 |
| tzdata | 2025.3 |
| tzlocal | 5.3.1 |
| uri-template | 1.3.0 |
| urllib3 | 2.6.3 |
| uvicorn | 0.42.0 |
| virtualenv | 21.2.0 |
| watchdog | 5.0.3 |
| wcwidth | 0.6.0 |
| webargs | 8.7.1 |
| webcolors | 25.10.0 |
| webencodings | 0.5.1 |
| websocket-client | 1.9.0 |
| werkzeug | 3.1.6 |
| wheel | 0.46.3 |
| wrapt | 1.17.3 |
| xarray | 2026.2.0 |
| xarray-einstats | 0.10.0 |
| xgboost | 3.2.0 |
| xgboost-ray | 0.1.19 |
| xxhash | 3.6.0 |
| yarl | 1.23.0 |
| zipp | 3.23.0 |

---
title: Snowflake Datasets
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/dataset.md
section: Snowflake ML
---

# Snowflake Datasets

Datasets are new Snowflake schema-level objects specifically designed for machine learning workflows. Snowflake Datasets
hold collections of data organized into versions. Each version holds a materialized snapshot of your data with
guaranteed immutability, efficient data access, and interoperability with popular deep learning frameworks.

Use Snowflake Datasets in the following situations:

* You need to manage and version large datasets for reproducible machine learning model training and testing.
* You need fine-grained file-level access and/or data shuffling for distributed training or data streaming.
* You need to integrate with external machine learning frameworks and tools.
* You need to track the lineage used to create an ML model.

Datasets are materialized data objects. You can use either Snowflake ML or SQL commands to interact with them.
They don’t appear in the Snowsight database object explorer.

> **Note:**
>
> * Datasets incur storage costs. Delete unused datasets to minimize costs.
> * Datasets created before the general availability release on [March 20, 2025](../../release-notes/2025/other/2025-03-20-snowflake-ml-datasets.md), don’t support replication.
>   For more information, see [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md).

## Installation

The Dataset Python SDK is included in Snowpark ML (Python package `snowflake-ml-python`) starting in version 1.7.5.
For installation instructions, see [Using Snowflake ML Locally](snowpark-ml.md).

## Required privileges

Creating Datasets requires the CREATE DATASET schema-level privilege. Modifying Datasets, for example adding or deleting
dataset versions, requires OWNERSHIP on the Dataset. Reading from a Dataset requires only the USAGE privilege on the
Dataset (or OWNERSHIP). For more information about granting privileges in Snowflake, see [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md).

> **Tip:**
>
> Setting up privileges for the Snowflake Feature Store using either the `setup_feature_store` method or the
> [privilege setup SQL script](feature-store/rbac.md) also sets up Dataset privileges.
> If you have already set up feature store privileges by one of these methods, no further action is needed.

## Creating and using Datasets

You can create and manage datasets with either SQL or Python. For information about using the SQL commands, see SQL commands.
For information about using the Python API, see [snowflake.ml.dataset](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/dataset).

Create a Dataset by passing a Snowpark DataFrame to the `snowflake.ml.dataset.create_from_dataframe` function.

```python
from snowflake import snowpark
from snowflake.ml import dataset

# Create Snowpark Session
# See https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-session
session = snowpark.Session.builder.configs(connection_parameters).create()

# Create a Snowpark DataFrame to serve as a data source
# In this example, we generate a random table with 100 rows and 1 column
df = session.sql(
  "select uniform(0, 10, random(1)) as x, uniform(0, 10, random(2)) as y from table(generator(rowcount => 100))"
)

# Materialize DataFrame contents into a Dataset
ds1 = dataset.create_from_dataframe(
    session,
    "my_dataset",
    "version1",
    input_dataframe=df)
```

Datasets are versioned. Each version is an immutable, point-in-time snapshot of the data managed by the Dataset. The
Python API includes a `Dataset.selected_version` property that indicates whether a given dataset is selected for use.
This property is automatically set by the `dataset.create_from_dataframe` and `dataset.load_dataset` factory
methods, so creating a dataset automatically selects the created version. The `Dataset.select_version` and
`Dataset.create_version` methods can also be used to explicitly switch between versions. Reading from a Dataset
reads from the active selected version.

```python
# Inspect currently selected version
print(ds1.selected_version) # DatasetVersion(dataset='my_dataset', version='version1')
print(ds1.selected_version.created_on) # Prints creation timestamp

# List all versions in the Dataset
print(ds1.list_versions()) # ["version1"]

# Create a new version
ds2 = ds1.create_version("version2", df)
print(ds1.selected_version.name)  # "version1"
print(ds2.selected_version.name)  # "version2"
print(ds1.list_versions())        # ["version1", "version2"]

# selected_version is immutable, meaning switching versions with
# ds1.select_version() returns a new Dataset object without
# affecting ds1.selected_version
ds3 = ds1.select_version("version2")
print(ds1.selected_version.name)  # "version1"
print(ds3.selected_version.name)  # "version2"
```

## Reading data from Datasets

Dataset version data is stored as evenly sized files in the Apache Parquet format. The API is extensible to support custom framework
connectors.

Reading from a Dataset requires an active selected version.

### Connect to TensorFlow

Datasets can be converted to TensorFlow’s `tf.data.Dataset` and streamed in batches for efficient training and evaluation.

```python
import tensorflow as tf

# Convert Snowflake Dataset to TensorFlow Dataset
tf_dataset = ds1.read.to_tf_dataset(batch_size=32)

# Train a TensorFlow model
for batch in tf_dataset:
    # Extract and build tensors as needed
    input_tensor = tf.stack(list(batch.values()), axis=-1)

    # Forward pass (details not included for brevity)
    outputs = model(input_tensor)
```

### Connect to PyTorch

Datasets also support conversion to PyTorch DataPipes and can be streamed in batches for efficient training and
evaluation.

```python
import torch

# Convert Snowflake Dataset to PyTorch DataPipe
pt_datapipe = ds1.read.to_torch_datapipe(batch_size=32)

# Train a PyTorch model
for batch in pt_datapipe:
    # Extract and build tensors as needed
    input_tensor = torch.stack([torch.from_numpy(v) for v in batch.values()], dim=-1)

    # Forward pass (details not included for brevity)
    outputs = model(input_tensor)
```

### Connect to Snowpark ML

Datasets can also be converted back to Snowpark DataFrames for integration with Snowpark ML Modeling. The converted
Snowpark DataFrame is not the same as the DataFrame that was provided during Dataset creation, but instead points to the
materialized data in the Dataset version.

```python
from snowflake.ml.modeling.ensemble import random_forest_regressor

# Get a Snowpark DataFrame
ds_df = ds1.read.to_snowpark_dataframe()

# Note ds_df != df
ds_df.explain()
df.explain()

# Train a model in Snowpark ML
xgboost_model = random_forest_regressor.RandomForestRegressor(
    n_estimators=100,
    random_state=42,
    input_cols=["X"],
    label_cols=["Y"],
)
xgboost_model.fit(ds_df)
```

### Direct file access

The Dataset API also exposes an [fsspec](https://filesystem-spec.readthedocs.io/en/latest/) interface, which can be
used to build custom integrations with external libraries like PyArrow, Dask, or any other package that supports
`fsspec` and allows distributed and/or stream-based model training.

```python
print(ds1.read.files()) # ['snow://dataset/my_dataset/versions/version1/data_0_0_0.snappy.parquet']

import pyarrow.parquet as pq
pd_ds = pq.ParquetDataset(ds1.read.files(), filesystem=ds1.read.filesystem())

import dask.dataframe as dd
dd_df = dd.read_parquet(ds1.read.files(), filesystem=ds1.read.filesystem())
```

## Dataset, Feature Store, Model Registry, and ML Lineage

> Datasets are deeply integrated into the Snowflake ML ecosystem to provide a seamless end-to-end model development and
> MLOps experience inside Snowflake. Datasets can be produced from Snowflake Feature Store features by using the
> `FeatureStore.generate_dataset` API. Datasets can then be converted to Snowpark DataFrames and passed to Snowpark ML
> Modeling for model training. The trained model can then be logged to Snowflake Model Registry, automatically completing
> the ML Lineage graph linking source data, feature views, datasets, and models for full end-to-end governance.

### Use SQL to read from a dataset version

You can use standard Snowflake SQL commands to read data from a dataset version. You can use SQL commands to do the following operations:

* List files
* Infer schema
* Query data directly from stage.

> **Important:**
>
> You must have the USAGE or OWNERSHIP privilege on the dataset to read from it.

#### List files from a dataset version

Use the `LIST snow_url` command to list files in a dataset version. Use the following SQL syntax to list all files within a dataset version:

```sqlsyntax
LIST 'snow://dataset/<dataset_name>/versions/<dataset_version>'
```

#### Analyze files and get column definitions

Use the [INFER_SCHEMA](../../sql-reference/functions/infer_schema.md) function to analyze files in a dataset version and retrieve column definitions. Use the following SQL example to list all files within a dataset version:

```sqlsyntax
INFER_SCHEMA(
  LOCATION => 'snow://dataset/<dataset_name>/versions/<dataset_version>',
  FILE_FORMAT => '<file_format_name>'
)
```

You must use the pattern specified in the example to get the location of the dataset version.

For `FILE_FORMAT`, specify `PARQUET`.

The following example creates a file format and runs the INFER_SCHEMA function:

```sqlexample
CREATE FILE FORMAT my_parquet_format TYPE = PARQUET;

SELECT *
FROM TABLE(
    INFER_SCHEMA(
        FILE_FORMAT => 'snow://dataset/MYDS/versions/v1,
        FILE_FORMAT => 'my_parquet_format'
    )
);
```

#### Stage query

Query data directly from the files stored in a dataset version, in a similar manner to querying an external table. Use the following SQL example to help you get started:

```sqlsyntax
SELECT $1
FROM 'snow://dataset/foo/versions/V1'
( FILE_FORMAT => 'my_parquet_format',
PATTERN => '.*data.*' ) t;
```

## SQL commands

You can use SQL commands to create and manage datasets. For more information, see:

* [CREATE DATASET](../../sql-reference/sql/create-dataset.md)
* [ALTER DATASET](../../sql-reference/sql/alter-dataset.md)
* [SHOW DATASETS](../../sql-reference/sql/show-datasets.md)
* [SHOW VERSIONS IN DATASET](../../sql-reference/sql/show-versions-in-dataset.md)

## Current limitations and known issues

* Dataset names are SQL identifiers and subject to [Snowflake identifier requirements](../../sql-reference/identifiers-syntax.md).
* Dataset versions are strings and have a maximum length of 128 characters. Some characters are not permitted and will
  produce an error message.
* Certain query operations on Datasets with wide schemas (more than about 4,000 columns) are not fully optimized. This
  should improve in upcoming releases.

---
title: Snowflake Feature Store
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/overview.md
section: Snowflake ML
---

# Snowflake Feature Store

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

The Snowflake Feature Store lets data scientists and ML engineers create, maintain, and use ML features in data science
and ML workloads, all within Snowflake.

Generically, *features* are data elements used as inputs to a machine learning model. Many columns in a dataset, such as
temperature or attendance, can be used as features as-is. In other cases, a column can be made more useful for training
via preprocessing and transformation. For example, you might derive a day-of-week feature from a timestamp to allow the
model to detect weekly patterns. Other common feature transformations involve aggregating, differentiating, or
time-shifting data. *Feature engineering* is the process of deciding what features are needed by your models and
defining how they will be derived from the raw data.

A *feature store* lets you standardize commonly used feature transformations in a central repository, enabling reuse,
helping to reduce duplication of data and effort, and improving productivity. It also helps maintain features by
updating them on new source data, always providing correct, consistent, and fresh features in a single source of truth.
By cultivating consistency in how features are extracted from raw data, a feature store can also help to make your
production ML pipelines more robust.

The Snowflake Feature Store is designed to make creating, storing, and managing features for data science and machine
learning workloads easier and more efficient. Hosted natively inside Snowflake, the Snowflake Feature Store provides
the following advantages:

* Your data remains secure, completely under your control and governance, and never leaves Snowflake.
* The Snowsight Feature Store UI makes it easy to search for and discover features.
* Access is managed with fine-grained [role-based access control](../../../user-guide/security-access-control-overview.md).

Key benefits of the Snowflake Feature Store include support for:

* Both batch and streaming data, with efficient automatic updates as new data arrives
* Backfill and point-in-time correct features with [ASOF JOIN](../../../sql-reference/constructs/asof-join.md)
* Feature transformations authored in Python or SQL
* Automatic update and refresh of feature values from source data with Snowflake managed Feature Views
* Ability to use user-managed feature pipelines with external tools such as [dbt](https://www.getdbt.com/)

The Snowflake Feature Store is fully integrated with the
[Snowflake Model Registry](../model-registry/overview.md)
and other Snowflake ML features for end-to-end production ML.

> * Ability to trace data flow from source, to feature, to dataset, to trained model via ML Lineage. This also helps
>   make inference easier by enabling models to automatically retrieve the correct feature values at inference time
>   and so the user does not have to provide all the feature inputs. ML Lineage is automatically created when the feature
>   store is used.

The following illustration shows how the Snowflake Feature Store fits into a machine learning pipeline:

* Raw data can be obtained in batch from tables or views or from streaming data sources.
* The raw data is then transformed by features defined by data engineers, resulting in a feature table.
* The feature table can be used to generate training datasets used for training models in Snowpark ML, or to enrich test
  data used by the model to make predictions.

## How does it work?

> **Note:**
>
> A feature store in Snowflake is simply a schema. You can create a new schema to use as a feature store, or use an
> existing one.

A feature store contains [feature views](feature-views.md). A feature view encapsulates a Python or SQL pipeline
for transforming raw data into one or more related features. All features defined in a feature view are refreshed from
the source data at the same time.

> **Tip:**
>
> Users who have access to more than one feature store can combine feature views from multiple feature stores to
> create training and inference datasets.

The Snowflake Feature Store supports two kinds of feature views:

* *Snowflake-managed*: The Snowflake Feature Store refreshes the features in the feature view for you, incrementally
  and efficiently, on a schedule you specify.
* *External*: Some other process outside of the feature store maintains the features in the feature view. This type of
  feature view is intended for use with tools such as [dbt](https://www.getdbt.com/).

Feature views are organized in the feature store according to the [entities](entities.md) to which they apply. An entity is a
higher-level abstraction that represents the subject matter of a feature. For example, in a feature store for a movie streaming
service, the main entities might be users and movies. Raw movie data and user activity data can be converted into useful
features such as per-movie viewing time and user session length, and the feature views containing these features can be tagged
with relevant entities.

### Back-end data model

Feature store objects are implemented as Snowflake objects. All feature store objects are therefore subject to Snowflake
access control rules.

| Feature store object | Snowflake object |
| --- | --- |
| feature store | [schema](../../../sql-reference/ddl-database.md) |
| feature view | [dynamic table](../../../user-guide/dynamic-tables-about.md) or [view](../../../user-guide/views-introduction.md) |
| entity | [tag](../../../user-guide/object-tagging/introduction.md) |
| feature | column in a dynamic table or in a view |

Properties of feature views (such as name and entity) are implemented as tags on dynamic tables or views.

You can query or manipulate the Snowflake objects using SQL. Changes you make via SQL are reflected in the Python API
and vice versa.

> **Tip:**
>
> All objects of a Snowflake Feature Store are stored in the feature store’s schema. To completely delete
> a feature store, make sure the schema doesn’t contain any other resources, and then
> [drop the schema](../../../sql-reference/sql/drop-schema.md).

## Getting started

> **Note:**
>
> The Snowflake Feature Store Python API is part of the Snowpark ML Python package, `snowflake-ml-python`. You can use
> it on your local system in your preferred Python IDE or in a Snowsight worksheet or notebook. For details, see
> [Python APIs for Snowflake ML](../snowpark-ml.md).

Begin your journey with [Introduction to the Snowflake Feature Store](https://quickstarts.snowflake.com/guide/intro-to-feature-store/)
for an introduction to Snowflake Feature Store concepts. Then follow up with additional [Snowflake quickstarts](https://quickstarts.snowflake.com), including:

* [Develop and Manage ML Models with the Snowflake Feature Store and Model Registry](https://quickstarts.snowflake.com/guide/develop-and-manage-ml-models-with-feature-store-and-model-registry/). This is an end-to-end ML development cycle demo with the Feature Store and the Model Registry.
* [Getting Started with the Snowflake Feature Store API](https://quickstarts.snowflake.com/guide/overview-of-feature-store-api/). This is an overview of Feature Store Python APIs.
* [Advanced Guide to the Snowflake Feature Store](https://quickstarts.snowflake.com/guide/advanced_guide_to_snowflake_feature_store/). This is a more advanced example of Feature Store and pipelines.
* [Getting Started with Snowflake Feature Store and dbt](https://quickstarts.snowflake.com/guide/getting-started-with-feature-store-and-dbt/). This demonstrates how to register features from DBT pipeline into Snowflake Feature Store.

See [Common feature and query patterns](examples.md) for examples of specific types of feature transformations.

> **Note:**
>
> These quickstarts are only shown as examples. Following along with the example may require additional rights to third-party data,
> products, or services that are not owned or provided by Snowflake. Snowflake does not guarantee the accuracy of these examples or
> cover them under any Service Level Agreement.

---
title: Snowflake Feature Store access control model
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/rbac.md
section: Snowflake ML
---

# Snowflake Feature Store access control model

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

The privileges required by the Snowflake Feature Store depend on the type of user.

* *Producers* can create and operate on feature views.
* *Consumers* can read information about feature views and entities in the feature store.

Typically, each type of user will have their own [Snowflake database role](../../../sql-reference/snowflake-db-roles.md) with
the necessary privileges. Feature store roles are most naturally configured using a
[role hierarchy](../../../user-guide/security-access-control-overview.md).

Producers require the following privileges:

* CREATE DYNAMIC TABLE, CREATE TAG, and CREATE VIEW on the feature store schema.

  > > **Note:**
  > >
  > > For Snowflake-managed feature views (backed by a dynamic table) with incremental refresh, the source tables must have
  > > [change tracking enabled](../../../user-guide/dynamic-tables-create.md), or the user must have OWNERSHIP of these tables
  > > to automatically enable change tracking when the feature view is created.
* CREATE TABLE and CREATE DATASET on the feature store schema and/or the destination schema when generating datasets for
  training.
* OPERATE on the dynamic tables and tasks in the feature store schema to manage Feature View refresh settings.
* USAGE on the warehouse passed in to the feature store initializer.
* CREATE SCHEMA is optional if the feature store schema already exists and the producers have usage privileges on it.
* All consumer privileges listed below.

Consumers require the following privileges at minimum:

* USAGE on the feature store database and schema.
* SELECT on and MONITOR on DYNAMIC TABLES in the feature store schema.
* SELECT and REFERENCE on views in the feature store schema.
* USAGE on the warehouse passed to the feature store initializer.

Consumers can also have the following privileges to allow them to use feature store data:

* CREATE TABLE and CREATE DATASET on the feature store schema and/or the destination schema for generating datasets for training.
* SELECT and REFERENCE on tables in the feature store or any schemas containing generated datasets.
* USAGE on DATASETs in the feature store schema or any schemas containing generated datasets.

With multiple feature stores, you probably will have these two types of roles for each individual feature store,
or for logical groupings of feature stores.

> **Note:**
>
> A role with `MANAGE GRANTS`, `CREATE ROLE`, and `CREATE SCHEMA ON DATABASE <DB>`
> privileges is needed to configure the necessary Feature Store roles and privileges. You may use the
> [ACCOUNTADMIN](../../../user-guide/security-access-control-considerations.md) built-in role or use a custom role with these privileges.

## Access control setup in Python

`snowflake-ml-python` package version 1.6.3 and later include a `setup_feature_store` utility API for configuring a
new feature store with producer and consumer roles and privileges. In the following example, fill in the names of the
database, schema, warehouse, and producer and consumer role where indicated.

```python
from snowflake.ml.feature_store import setup_feature_store

setup_feature_store(
    session=session,
    database="<FS_DATABASE_NAME>",
    schema="<FS_SCHEMA_NAME>",
    warehouse="<FS_WAREHOUSE>",
    producer_role="<FS_PRODUCER_ROLE>",
    consumer_role="<FS_CONSUMER_ROLE>",
)
```

## Access control setup in SQL

You can manually configure the Feature Store roles and privileges using the following SQL commands. Note that in the
first block, there are several SET commands that tell the script the names you want to use for your producer and
consumer roles as well as the names of the database and schema where the feature views will be stored. All of these
objects are created if they do not exist.

```sqlexample
-- Initialize variables for usage in SQL scripts below
SET FS_ROLE_PRODUCER = '<FS_PRODUCER_ROLE>';
SET FS_ROLE_CONSUMER = '<FS_CONSUMER_ROLE>';
SET FS_DATABASE = '<FS_DATABASE_NAME>';
SET FS_SCHEMA = '<FS_SCHEMA_NAME>';
SET FS_WAREHOUSE = '<FS_WAREHOUSE>';

-- Create schema
SET SCHEMA_FQN = CONCAT($FS_DATABASE, '.', $FS_SCHEMA);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($SCHEMA_FQN);

-- Create roles
CREATE ROLE IF NOT EXISTS IDENTIFIER($FS_ROLE_PRODUCER);
CREATE ROLE IF NOT EXISTS IDENTIFIER($FS_ROLE_CONSUMER);

-- Build role hierarchy
GRANT ROLE IDENTIFIER($FS_ROLE_PRODUCER) TO ROLE SYSADMIN;
GRANT ROLE IDENTIFIER($FS_ROLE_CONSUMER) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);

-- Grant PRODUCER role privileges
GRANT CREATE DYNAMIC TABLE ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);
GRANT CREATE VIEW ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);
GRANT CREATE TAG ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);
GRANT CREATE DATASET ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);
GRANT CREATE TABLE ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_PRODUCER);

-- Grant CONSUMER role privileges
GRANT USAGE ON DATABASE IDENTIFIER($FS_DATABASE) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);
GRANT USAGE ON SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);

GRANT SELECT, MONITOR ON FUTURE DYNAMIC TABLES IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);
GRANT SELECT, MONITOR ON ALL DYNAMIC TABLES IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);

GRANT SELECT, REFERENCES ON FUTURE VIEWS IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);
GRANT SELECT, REFERENCES ON ALL VIEWS IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);

GRANT USAGE ON FUTURE DATASETS IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);
GRANT USAGE ON ALL DATASETS IN SCHEMA IDENTIFIER($SCHEMA_FQN) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);

-- Grant USAGE ON WAREHOUSE to CONSUMER
GRANT USAGE ON WAREHOUSE IDENTIFIER($FS_WAREHOUSE) TO ROLE IDENTIFIER($FS_ROLE_CONSUMER);
```

---
title: Snowflake ML Jobs
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/ml-jobs/overview.md
section: Snowflake ML
---

# Snowflake ML Jobs

Use Snowflake ML Jobs to run machine learning (ML) workflows inside Snowflake ML container runtimes.
You can run them from any development environment. You don’t need to run the code in a Snowflake worksheet or notebook. Use jobs to leverage Snowflake’s infrastructure to run resource-intensive tasks within your development workflow. For information about setting up Snowflake ML locally, see [Using Snowflake ML Locally](../snowpark-ml.md).

> **Important:**
>
> Snowflake ML Jobs are available in `snowflake-ml-python` version 1.26.0 and later.

Snowflake ML Jobs enable you to do the following:

* Run ML workloads on Snowflake Compute Pools, including GPU and high-memory CPU instances.
* Use your preferred development environment such as VS Code or Jupyter notebooks.
* Install and use custom Python packages within your runtime environment.
* Use Snowflake’s distributed APIs to optimize data loading, training, and hyperparameter tuning.
* Integrate with orchestration tools, such as Apache Airflow.
* Monitor and manage jobs through Snowflake’s APIs.

You can use these capabilities to do the following:

* Execute resource-intensive training on large datasets requiring GPU acceleration or significant compute resources.
* Productionize ML workflows by moving ML code from development to production with programmatic execution through pipelines.
* Retain your existing development environment while leveraging Snowflake’s compute resources.
* Lift and shift OSS ML workflows with minimal code changes.
* Work directly with large Snowflake datasets to reduce data movement and avoid expensive data transfers.

## Prerequisites

1. Install the Snowflake ML Python package.

   ```bash
   pip install snowflake-ml-python>=1.26.0
   ```
2. The default compute pool size uses the CPU_X64_S instance family. The minimum number of nodes is 1 and the maximum is 25. You can use the following SQL command to create a custom compute pool:

   ```sqlexample
   CREATE COMPUTE POOL IF NOT EXISTS MY_COMPUTE_POOL
     MIN_NODES = <MIN_NODES>
     MAX_NODES = <MAX_NODES>
     INSTANCE_FAMILY = <INSTANCE_FAMILY>;
   ```
3. Snowflake ML Jobs require a Snowpark Session. Use the following code to create it:

   ```python
   from snowflake.snowpark import Session
   from snowflake.ml.jobs import list_jobs

   ls = list_jobs() # This will fail! You must create a session first.

   # Requires valid ~/.snowflake/config.toml file
   session = Session.builder.getOrCreate()

   ls = list_jobs(session=session)
   ls = list_jobs() # Infers created session from context
   ```

   For information about creating a session, see [Creating a Session](../../snowpark/python/creating-session.md).

## Run a Snowflake ML job

You can run a Snowflake ML Job in one of the following ways:

* Using a function decorator within your code.
* Submitting entire files or directories using the Python API.

### Run a Python function as a Snowflake ML Job

Use Function Dispatch to run individual Python functions remotely on Snowflake’s compute resources with the `@remote` decorator.

Using `@remote`, you can:

* Serializate the function and its dependencies.
* Upload it to a specified Snowflake stage.
* Execute it within a specific Container Runtime.

The following example Python code uses the `@remote` decorator to submit a function call as a Snowflake ML Job:

```python
from snowflake.ml.jobs import remote

@remote("MY_COMPUTE_POOL", stage_name="payload_stage", session=session)
def train_model(data_table: str):
  # Provide your ML code here, including imports and function calls
  ...

job = train_model("my_training_data")
```

> **Note:**
>
> Submitting a job requires an existing Snowpark `Session`; See Prerequisites for details.

Invoking a `@remote` decorated function returns a Snowflake `MLJob` object that can be used to manage and monitor the job execution. For more information, see Ray Dashboard in ML Jobs.

### Run a Python file as a Snowflake ML Job

Run Python files or project directories on Snowflake compute resources. This is useful when:

* You have complex ML projects with multiple modules and dependencies.
* You want to maintain separation between local development and production code.
* You need to run scripts that use command-line arguments.
* You’re working with existing ML projects that weren’t specifically designed for execution on Snowflake compute.

The Snowflake Job API offers three main methods for submitting file-based payloads:

* `submit_file()`: For running single Python files
* `submit_directory()`: For running Python projects spanning multiple files and resources
* `submit_from_stage()`: For running Python projects saved on a Snowflake stage

Both methods support:

* Command-line argument passing
* Environment variable configuration
* Custom dependency specification
* Project asset management through Snowflake stages

File Dispatch is particularly useful for productionizing existing ML workflows and maintaining clear separation between development and execution environments.

The following Python code submits a file as a Snowflake ML Job:

```python
from snowflake.ml.jobs import submit_file

# Run a single file
job1 = submit_file(
  "train.py",
  "MY_COMPUTE_POOL",
  stage_name="payload_stage",
  args=["--data-table", "my_training_data"],
  session=session,
)
```

The following Python code submits a directory as a Snowflake ML Job:

```python
from snowflake.ml.jobs import submit_directory

# Run from a directory
job2 = submit_directory(
  "./ml_project/",
  "MY_COMPUTE_POOL",
  entrypoint="train.py",
  stage_name="payload_stage",
  session=session,
)
```

The following Python code submits a directory from a Snowflake Stage as a Snowflake ML Job:

```python
from snowflake.ml.jobs import submit_from_stage

# Run from a directory
job3 = submit_from_stage(
  "@source_stage/ml_project/"
  "MY_COMPUTE_POOL",
  entrypoint="@source_stage/ml_project/train.py",
  stage_name="payload_stage",
  session=session,
)

# Entrypoint may also be a relative path
job4 = submit_from_stage(
  "@source_stage/ml_project/",
  "MY_COMPUTE_POOL",
  entrypoint="train.py",  # Resolves to @source_stage/ml_project/train.py
  stage_name="payload_stage",
  session=session,
)
```

Submitting a file or directory returns a Snowflake `MLJob` object that can be used to manage and monitor the job execution. For more information, see Ray Dashboard in ML Jobs.

### Run a Snowflake ML Job on a specific container runtime

The `@remote` decorator, as well as the functions `submit_directory()`, `submit_from_stage()`, and `submit_file()` all support the `runtime_environment` keyword. When you don’t provide this keyword in your decorator or function call, Snowflake automatically uses the latest avialable version of the Snowflake Container Runtime on your compute pool.

To specify a container runtime for your ML Job, use the `runtime_environment` keyword with a string value of the Container Runtime version to use. See [Container Runtime releases](../container-runtime/releases.md) for the full list of available versions and what’s contained in these environments by default.

The following example shows how to pin a function with the `@remote` decorator to Snowflake Container Runtime version 2.3:

```python
from snowflake.ml.jobs import remote

@remote("MY_COMPUTE_POOL", stage_name="payload_stage", session=session, runtime_environment="2.3")
def train_model(data_table: str):
  # Provide your ML code here, including imports and function calls
  ...
```

### Supporting Additional Payloads in Submissions

When submitting a file, directory, or from a stage, additional payloads are supported for use during job execution.
The import path can be specified explicitly; otherwise, it will be inferred from the location of the additional payload.

> **Important:**
>
> You can only load single Python files from a stage.

```python
# Run from a file
 job1 = submit_file(
   "train.py",
   "MY_COMPUTE_POOL",
   stage_name="payload_stage",
   session=session,
   imports=[
     ("src/utils/", "utils"), # the import path is utils
   ],
 )

 # Run from a directory
 job2 = submit_directory(
   "./ml_project/",
   "MY_COMPUTE_POOL",
   entrypoint="train.py",
   stage_name="payload_stage",
   session=session,
   imports=[
     ("src/utils/"), # the import path is utils
   ],
 )

 # Run from a stage
 job3 = submit_from_stage(
   "@source_stage/ml_project/",
   "MY_COMPUTE_POOL",
   entrypoint="@source_stage/ml_project/train.py",
   stage_name="payload_stage",
   session=session,
   imports=[
     ("@source_stage/src/utils/sub_utils/", "utils.sub_utils"),
   ],
 )
```

### Accessing Snowpark Session in ML Jobs

When running ML Jobs on Snowflake, a Snowpark Session is automatically available in the execution context.
You can access the Session object from within your ML Job payload using the following approaches:

```python
from snowflake.ml.jobs import remote
from snowflake.snowpark import Session

@remote("MY_COMPUTE_POOL", stage_name="payload_stage")
def my_function():
  # This approach works for all payload types, including file and directory payloads
  session = Session.builder.getOrCreate()
  print(session.sql("SELECT CURRENT_VERSION()").collect())

@remote("MY_COMPUTE_POOL", stage_name="payload_stage")
def my_function_with_injected_session(session: Session):
  # This approach works only for function dispatch payloads
  # The session is injected automatically by the Snowflake ML Job API
  print(session.sql("SELECT CURRENT_VERSION()").collect())
```

The Snowpark Session can be used to access Snowflake tables, stages, and other database objects inside your ML Job.

### Returning results from ML Jobs

Snowflake ML Jobs support returning execution results back to the client environment.
This enables you to retrieve computed values, trained models, or any other artifacts produced by your job payloads.

For function dispatch, simply return a value from your decorated function.
The returned value will be serialized and made available through the `result()` method.

```python
from snowflake.ml.jobs import remote

@remote("MY_COMPUTE_POOL", stage_name="payload_stage")
def train_model(data_table: str):
  # Your ML code here
  model = XGBClassifier()
  model.fit(data_table)
  return model

job1 = train_model("my_training_data")
```

For file-based jobs, use the special `__return__` variable to specify the return value.

```python
# Example: /path/to/repo/my_script.py
def main():
    # Your ML code here
    model = XGBClassifier()
    model.fit(data_table)
    return model

if __name__ == "__main__":
    __return__ = main()
```

```python
from snowflake.ml.jobs import submit_file

job2 = submit_file(
    "/path/to/repo/my_script.py",
    "MY_COMPUTE_POOL",
    stage_name="payload_stage",
    session=session,
)
```

You can retrieve the job execution result using the `MLJob.result()` API.
The API blocks the calling thread until the job reaches a terminal state, then returns the payload’s return value or, if execution failed, raises an exception.
If the payload does not define a return value, the result will be `None` on success.

```python
# These will block until the respective job is done and return the trained model
model1 = job1.result()
model2 = job2.result()
```

## ML Job Definitions

An ML Job Definition captures the reusable components of an ML Job—payload location, compute pool, and related configuration.
This allows you to submit multiple jobs from the same payload with different arguments without re-uploading
the payload.

> **Note:**
>
> ML Job Definitions are available in `snowflake-ml-python` version 1.26 and later.

To create an ML Job Definition, use the `MLJobDefinition` class.
The API closely mirrors the job-creation APIs. All optional parameters supported for job creation are also supported when creating job definitions.

Use Function Dispatch to register individual Python functions with the ` @remote` decorator.

```python
from snowflake.ml.jobs import remote

compute_pool = "MY_COMPUTE_POOL"
@remote(compute_pool, stage_name="payload_stage")
def hello_world(name: str = "world"):
    from datetime import datetime

    print(f"{datetime.now()} Hello {name}!")

# this is a definition handle
definition = hello_world

job1 = hello_world()
```

Use `register()` to create job definitions from a local file, a local directory, or a stage directory.

```python
from snowflake.ml.jobs import MLJobDefinition

# create a job definition from a stage directory
job_definition1 = MLJobDefinition.register(
    entrypoint ='@tmp_stage/my_project/xgb.py',
    source = '@tmp_stage/my_project',
    stage_name = "payload_stage",
    compute_pool = compute_pool
)

# create a job definition from local file
job_definition2 = MLJobDefinition.register(
    source ='/path/to/script.py',
    stage_name = "payload_stage",
    compute_pool = compute_pool
)

# create a job definition from the directory
job_definition3 = MLJobDefinition.register(
    entrypoint ='/path/to/directory/script.py',
    source = '/path/to/directory',
    stage_name = "payload_stage",
    compute_pool = compute_pool
)
```

Create a job from a job definition, with support for passing different parameters to generate distinct jobs.

```python
from snowflake.ml.jobs import remote

# create a job definition using the remote decorator
compute_pool = "MY_COMPUTE_POOL"
@remote(compute_pool, stage_name="payload_stage")
def hello_world(name: str = "world"):
    from datetime import datetime

    print(f"{datetime.now()} Hello {name}!")

definition = hello_world

job1 = definition()

job2 = definition(name="ML Job Definition") # pass in the different parameter
```

The `register()` function takes `runtime_environment` as an optional keyword argument to select the container image that runs on your selected compute pool. By default, your job definition uses the latest available version of the Snowflake Container Runtime.

To specify a container runtime for your ML Job, use the `runtime_environment` keyword with a string value of the Container Runtime version to use. See [Container Runtime releases](../container-runtime/releases.md) for the full list of available versions and what’s contained in these environments by default.

Support integration with Tasks. Jobs executed from a Task do not run within a stored procedure. Refer to [ML Jobs Task Integration samples](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/ml/ml_jobs/e2e_task_graph) for examples of using Snowflake ML Job Definitions in Tasks.

```python
from snowflake.ml.jobs import remote

compute_pool = "MY_COMPUTE_POOL"
@remote(COMPUTE_POOL, stage_name="payload_stage")
def train_model(input_data: DataSource) -> Optional[str]:
    ...

train_model_task = DAGTask("TRAIN_MODEL", definition=train_model) # train_model is a job definition created by the @remote decorator
```

## Ray Dashboard in ML Jobs

ML Job now supports the ray dashboard for the running jobs in `snowflake-ml-python` version 1.30 and later.

> **Note:**
>
> The Ray Dashboard is not supported on the `CPU_X64_XS` compute pool instance family. The dashboard is only available while the job is running.

```python
from snowflake.ml.jobs import remote

@remote("MY_COMPUTE_POOL", stage_name="payload_stage", session=session)
def train_model(data_table: str):
  # Provide your ML code here, including imports and function calls
  ...

job = train_model("my_training_data")
ray_dashboard_url = job.get_ray_dashboard_url() # copy and paste this url in browser to log in then to see the ray dashboard
```

## Managing ML Jobs

When you submit a Snowflake ML Job, the API creates an `MLJob` instance. You can use it to do the following:

* Track job progress through status updates
* Debug issues using detailed execution logs
* Retrieve the execution result (if any)

You can use the `get_job()` API to retrieve an `MLJob` object by its ID. The following Python code shows how to retrieve an `MLJob` object:

```python
from snowflake.ml.jobs import MLJob, get_job, list_jobs, delete_job

# Get a list of the 10 most recent jobs as a Pandas DataFrame
jobs_df = list_jobs(limit=10)
print(jobs_df)  # Display list in table format

# Retrieve an existing job based on ID
job = get_job("<job_id>")  # job is an MLJob instance

# Retrieve status and logs for the retrieved job
print(job.status)  # PENDING, RUNNING, FAILED, DONE
print(job.get_logs())

# Clean up the job
delete_job(job)
```

## Managing dependencies

The Snowflake ML Job API runs payloads inside the [Snowflake Container Runtime](../container-runtime-ml.md) environment. The environment has the most commonly used Python packages for machine learning and data science.
Most use cases should work “out of the box” without additional configuration.
If you need custom dependencies, you can use `pip_requirements` to install them.

To install custom dependencies, you must enable external network access using an External Access Integration. You can use the following SQL example command to provide access:

```sqlexample
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION PYPI_EAI
  ALLOWED_NETWORK_RULES = (snowflake.external_access.pypi_rule)
  ENABLED = true;
```

For more information about external access integrations, see [Creating and using an external access integration](../../external-network-access/creating-using-external-network-access.md).

After you’ve provided external network access, you can use the `pip_requirements` and `external_access_integrations` parameters to configure custom dependencies. You can use packages that aren’t available in the container runtime environment or if you specific versions of the packages.

The following Python code shows how to specify custom dependencies to the `remote` decorator:

```python
@remote(
  "MY_COMPUTE_POOL",
  stage_name="payload_stage",
  pip_requirements=["custom-package"],
  external_access_integrations=["PYPI_EAI"],
  session=session,
)
def my_function():
  # Your code here
```

The following Python code shows how to specify custom dependencies for the `submit_file()` method:

```python
from snowflake.ml.jobs import submit_file

# Can include version specifier to specify version(s)
job = submit_file(
  "/path/to/repo/my_script.py",
  compute_pool,
  stage_name="payload_stage",
  pip_requirements=["custom-package==1.0.*"],
  external_access_integrations=["pypi_eai"],
  session=session,
)
```

### Private package feeds

Snowflake ML Jobs also support loading packages from private feeds such as JFrog Artifactory and Sonatype Nexus Repository. These feeds are commonly used to distribute internal and proprietary packages, maintain control over dependency versions, and ensure security/compliance.

To install packages from a private feed, you must do the following:

1. Create a Network Rule to allow access to the private feed’s URL.

   1. For sources which use basic authentication, you can simply create a network rule.

      ```sqlexample
      CREATE OR REPLACE NETWORK RULE private_feed_nr
      MODE = EGRESS
      TYPE = HOST_PORT
      VALUE_LIST = ('<your-repo>.jfrog.io');
      ```
   2. To configure access to a source using private connectivity (i.e. Private Link), follow the steps in [Network egress using private connectivity](../../snowpark-container-services/service-network-communications.md).
2. Create an External Access Integration using the network rule. Grant permission to use the EAI to the role that will be submitting jobs.

   > ```sqlexample
   > CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION private_feed_eai
   > ALLOWED_NETWORK_RULES = (PRIVATE_FEED_NR)
   > ENABLED = true;
   >
   > GRANT USAGE ON INTEGRATION private_feed_eai TO ROLE <role_name>;
   > ```
3. Specify the private feed URL, External Access Integration, and package(s) when submitting the job

   > ```python
   > # Option 1: Specify private feed URL in pip_requirements
   > job = submit_file(
   >   "/path/to/script.py",
   >   compute_pool="MY_COMPUTE_POOL",
   >   stage_name="payload_stage",
   >   pip_requirements=[
   >     "--index-url=https://your.private.feed.url",
   >     "internal-package==1.2.3"
   >   ],
   >   external_access_integrations=["PRIVATE_FEED_EAI"]
   > )
   > ```
   >
   > ```python
   > # Option 2: Specify private feed URL by environment variable
   > job = submit_directory(
   >   "/path/to/code/",
   >   compute_pool="MY_COMPUTE_POOL",
   >   entrypoint="script.py",
   >   stage_name="payload_stage",
   >   pip_requirements=["internal-package==1.2.3"],
   >   external_access_integrations=["PRIVATE_FEED_EAI"],
   >   env_vars={'PIP_INDEX_URL': 'https://your.private.feed.url'},
   > )
   > ```

If your private feed URL contains sensitive information like authentication tokens, manage the URL by creating a Snowflake Secret.
Use the [CREATE SECRET](../../../sql-reference/sql/create-secret.md) to create a secret. Configure secrets during job submission with the `spec_overrides` argument.

> **Note:**
>
> When using `spec_overrides`, Snowflake only supports and validates secrets in the `secrets` field within container definitions. Snowflake does not support or validate other fields, such as `args`, `volumes`, and `endpoints`.

```python
# Create secret for private feed URL with embedded auth token
feed_url = "<your-repo>.jfrog.io/artifactory/api/pypi/test-pypi/simple"
user = "<auth_user>"
token = "<auth_token>"
session.sql(f"""
CREATE SECRET IF NOT EXISTS PRIVATE_FEED_URL_SECRET
 TYPE = GENERIC_STRING
 SECRET_STRING = 'https://{auth_user}:{auth_token}@{feed_url}'
""").collect()

# Prepare service spec override for mounting secret into job execution
spec_overrides = {
 "spec": {
  "containers": [
    {
     "name": "main",  # Primary container name is always "main"
     "secrets": [
      {
        "snowflakeSecret": "PRIVATE_FEED_URL_SECRET",
        "envVarName": "PIP_INDEX_URL",
        "secretKeyRef": "secret_string"
      },
     ],
    }
  ]
 }
}

# Load private feed URL from secret (e.g. if URL includes auth token)
job = submit_file(
  "/path/to/script.py",
  compute_pool="MY_COMPUTE_POOL",
  stage_name="payload_stage",
  pip_requirements=[
    "internal-package==1.2.3"
  ],
  external_access_integrations=["PRIVATE_FEED_EAI"],
  spec_overrides=spec_overrides,
)
```

For more information about the `container.secrets`, see [containers.secrets field](../../snowpark-container-services/specification-reference.md).

## Examples

See [ML Jobs Code Samples](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/ml/ml_jobs) for examples of how to use Snowflake ML Jobs.

## Cost considerations

Snowflake ML Jobs run on Snowpark Container Services and are billed based on usage. For information about compute costs, see [Snowpark Container Services costs](../../snowpark-container-services/accounts-orgs-usage-views.md).

Job payloads are uploaded to the stage specified with the `stage_name` argument. To avoid additional charges, you must clean them up. For more information, see [Understanding storage cost](../../../user-guide/cost-understanding-data-storage.md) and [Exploring storage cost](../../../user-guide/cost-exploring-data-storage.md) to learn more about costs associated with stage storage.

---
title: Snowflake ML Model Development
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/modeling.md
section: Snowflake ML
---

# Snowflake ML Model Development

Snowflake ML is a highly flexible platform that lets you use your open-source (OSS) code to train machine learning models directly on your data.
This approach removes the need for complex data movement while allowing you to use your preferred ML libraries, tools, and development processes.

Snowflake ML integrates with Snowflake-compatible data sources to accelerate ML workflows using optimized data ingestion pipelines. Advanced distributed APIs enable efficient scaling of model training and tuning.
You can access all Snowflake ML features from a notebook environment with an ML runtime image, eliminating the need for you to manage packages and infrastructure.

## Train and tune your models

### Build with Notebooks on Container Runtime

[Snowflake Container Runtime](container-runtime-ml.md) provides a pre-built ML environment with popular packages included. You can securely add libraries from public or private PyPI repositories to customize your environment.
Its distributed APIs enable you to transform data and run AI/ML workflows at scale.

In addition to using Snowflake’s distributed APIs to scale your workflows, you can also use Ray. Ray is an open-source framework that provides a simple and flexible way to scale Python applications. It allows you to run your code in parallel across multiple nodes. For more information about using Ray with Snowflake ML, see the [Ray Getting Started Guide](https://docs.ray.io/en/latest/ray-overview/getting-started.html).

[Container Runtime Notebooks](notebooks-on-spcs.md) are Snowflake Notebooks integrated with the Container Runtime.
They provide features such as a pre-built ML runtime image, distributed processing, CPU compute pools, and GPU compute pools.
If you’re a data scientist or ML engineer, Container Runtime Notebooks can be particularly useful for your ML development tasks.

### Remote execution from any external IDE

You can also use your preferred external IDE, such as Visual Studio Code or a cloud-based Jupyter Notebook, and remotely execute ML workflows in the Container Runtime. To execute your workflows remotely, annotate your Python code, functions, or files and run it in a Container Runtime instance. For more information, see [Run a Python function as a Snowflake ML Job](ml-jobs/overview.md).

## Develop your code

### Ingest data directly into open source objects

Use the [Data Connector](load-data.md) for optimized data loading from your Snowflake tables and stages into open source objects such as pandas dataframes, PyTorch datasets, and TensorFlow datasets.
The Data Connector uses the Container Runtime’s distributed processing to speed up ingestion. After loading, you can use the data with any open-source library.

Using the Data Connector, you can load structured and unstructured data from multiple sources. In addition to its versatility, it provides improved performance over `to_pandas` for loading large datasets.

### Train with OSS frameworks

We recommend using your existing open source code or training models directly in Snowflake with open source libraries.

You can use the following features for your Snowflake ML workflows:

* Import features built and managed in the [Snowflake Feature Store](feature-store/overview.md).
* [Use Snowpark](../snowpark/python/creating-udfs.md) to scale your data preprocessing and transformation.
* Bring your data into memory with the [Data Connector APIs](load-data.md).
* Leverage the latest in OSS frameworks to engineer features, train models, and evaluate them.

### Scale workloads using distributed APIs

Training ML models on large datasets can exceed the resources of a single node. With Snowflake’s distributed APIs, you can scale feature engineering and training workflows across multiple nodes for improved performance.
With the distributed APIs, you can do the following:

* Leverage distributed preprocessing functions in [snowflake.ml.modeling.preprocessing](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/modeling#snowflake-ml-modeling-preprocessing).
* Scale your model training out across one or more nodes using optimized training APIs in [Snowflake Container Runtime](container-runtime-ml.md).

### Tune hyperparameters with distributed HPO

Accelerate hyperparameter tuning with [Snowflake ML’s distributed HPO](container-hpo.md), optimized for data stored in Snowflake. You can also use open source libraries like hyperopt or optuna.

### Operationalize training workflows

[Snowflake ML Jobs](ml-jobs/overview.md) allow you to run Python-based ML workloads remotely, making it easy to operationalize work developed interactively in environments like Snowflake Notebooks. This ensures secure, reproducible ML training and scoring, and integrates seamlessly with CI/CD pipelines.

### Schedule ML jobs and pipelines to run periodically

Use [Introduction to tasks](../../user-guide/tasks-intro.md) to build complex DAGs to represent ML training pipelines, where each task corresponds to a phase in your workflow. These pipelines can run on a schedule or be triggered by events. You can allocate resources to each step as needed, optimizing your pipeline. Snowsight provides built-in tools to view, manage, and modify these pipelines.

With Snowflake’s built-in git integration, you can also configure git hooks to construct and trigger the ML pipelines that best fit your CI/CD configuration.

---
title: Snowflake ML: End-to-End Machine Learning
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/overview.md
section: Snowflake ML
---

# Snowflake ML: End-to-End Machine Learning

Snowflake ML is an integrated set of capabilities for end-to-end machine learning in a single platform on top of your
governed data.
This is a unified environment for ML development and productionization that is optimized for large-scale distributed feature engineering, model training and inference on CPU and GPU compute without manual tuning or configuration.

Scaling end-to-end ML workflows in Snowflake is seamless. You can do the following:

* Prepare data
* Create and use features with the Snowflake Feature Store
* Train models with CPUs or GPUs using any open-source package from Snowflake Notebooks on Container Runtime
* Create experiments to evaluate your trained models against set metrics
* Operationalize your pipelines using Snowflake ML Jobs
* Deploy your model for inference at scale with the Snowflake Model Registry
* Monitor your production models with ML Observability and Explainability
* Use ML Lineage to track the source data to features, datasets, and models throughout your ML pipeline

Snowflake ML is also flexible and modular. You can deploy the models that you’ve developed in Snowflake outside of Snowflake and externally-trained models can easily be brought into Snowflake for inference.

## Capabilties for data scientists and ML engineers

### Snowflake Notebooks on Container Runtime

[Snowflake Notebooks on Container Runtime](container-runtime-ml.md) provide a Jupyter-like environment for training and fine-tuning large-scale models in Snowflake, without infrastructure management. Start training with preinstalled packages such as PyTorch, XGBoost, or Scikit-learn, or install any package from open-source repositories like HuggingFace or PyPI.
Container Runtime is optimized to run on Snowflake’s infrastructure to provide you with highly efficient data loading, distributed model training, and hyperparameter tuning.

### Snowflake Feature Store

The [Snowflake Feature Store](feature-store/overview.md) is an integrated solution for defining, managing, storing and discovering ML features derived from your data. The Snowflake Feature Store supports automated, incremental refresh from batch and streaming data sources, so that feature pipelines need be defined only once to be continuously updated with new data.

### ML Jobs

Use [Snowflake ML Jobs](ml-jobs/overview.md) to develop and automate ML pipelines. ML Jobs also enable teams that prefer working from an external IDE (VS Code, PyCharm, SageMaker Notebooks) to dispatch functions, files or modules down to Snowflake’s Container Runtime.

### Experiments

Use [experiments](experiments.md) to record the results of your model training, and evaluate a collection of models in an organized way. Experiments help you select the best model for your use case to bring live to production. Training can either be logged in an experiment during model training on Snowflake, or you can upload your own metadata and artifacts from prior training. After concluding your training, view all of the results in Snowsight and pick the right model for your needs.

### Snowflake Model Registry and Model Serving

The [Snowflake Model Registry](model-registry/overview.md) allows for the logging and management of all your ML models, regardless of whether they’re trained on Snowflake or other platforms. You can use the models from the model registry to run inference at scale. You can use Model Serving to deploy the models to Snowpark Container Service for inference.

### ML Observability

[ML Observability](model-registry/model-observability.md) provides tools to monitor model performance metrics in Snowflake. You can track models in production, monitor performance and drift metrics, and set alerts for performance thresholds. Additionally, use the ML Explainability function to compute Shapley values for models in the Snowflake Model Registry, regardless of where they were trained.

### ML Lineage

[ML Lineage](ml-lineage.md) is a capability to trace end-to-end lineage of ML artifacts from source data to features, datasets, and models. This enables reproducibility, compliance, and debugging across the full lifecycle of ML assets.

### Snowflake Datasets

[Snowflake Datasets](dataset.md) provide an immutable, versioned snapshot of your data suitable for ingestion by your machine learning models.

## Capabilities for business analysts

For business analysts, use [ML Functions](../../guides-overview-ml-functions.md) to shorten development time for common scenarios such as forecasting and anomaly detection across your organization with SQL.

## Additional Resources

See the following resources to get started with Snowflake ML:

* [Train an end-to-end ML model](http://quickstarts.snowflake.com/guide/end-to-end-ml-workflow/)
* [Quickstarts](https://quickstarts.snowflake.com/guide/intro_to_machine_learning_with_snowpark_ml_for_python)
* [Snowflake ML webpage](https://www.snowflake.com/en/product/features/snowflake-ml/)

Contact your Snowflake representative for early access to documentation on other features currently under development.

---
title: Snowflake Model Registry
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/overview.md
section: Snowflake ML
---

# Snowflake Model Registry

> **Note:**
>
> The model registry API described in this topic is generally available as of `snowflake-ml-python` package version 1.5.0.

After training your model, operationalizing the model and running inference in Snowflake starts with logging
the model in the Snowflake Model Registry. The Model Registry lets you securely manage models and their metadata in Snowflake,
regardless of origin and type, and makes running inference easy.

> **Important:**
>
> The Snowflake Model Registry works with machine learning models developed in Python for the Snowflake ML
> ecosystem. Models trained using [Snowflake ML Functions](../../../guides-overview-ml-functions.md) (for example,
> [FORECAST](../../../sql-reference/classes/forecast.md)) do not appear in the model registry. Some model types,
> such as [Cortex Fine-Tuned LLMs](../../../user-guide/snowflake-cortex/cortex-finetuning.md), appear in the model registry’s
> [Snowsight UI](snowsight-ui.md), but are not managed by the model registry API.

The Snowflake Model Registry provides the following capabilities:

* Stores and manages model versions, model metrics, and model metadata.
* Serves models and runs distributed inference at scale using Python, SQL, or REST API endpoints.
* Manages model life cycle with flexible governance options and working with models from dev to prod environments.
* Monitors model performance and drift using Snowflake ML Observability.
* Securely manages model access with role based access control (RBAC).

The model registry stores machine learning models as first-class schema-level objects in Snowflake.

After you have logged a model, you can invoke its methods (equivalent to functions or stored procedures) to perform
model operations, such as [inference](../inference/native-batch-inference-sql.md)
, in a Snowflake [virtual warehouse](../../../user-guide/warehouses.md),
or serve the model in Snowpark Container Services for [GPU-based inference](../inference/real-time-inference-rest-api.md).

The Snowflake Model Registry has [built-in types](built-in-models/overview.md) support for the most common model
types, including [scikit-learn](built-in-models/scikit-learn.md),
[xgboost](built-in-models/xgboost.md),
[LightGBM](built-in-models/lightgbm.md),
[Prophet](built-in-models/prophet.md),
[CatBoost](built-in-models/catboost.md),
[PyTorch](built-in-models/pytorch.md),
[TensorFlow](built-in-models/tensorflow.md),
[Keras](built-in-models/keras.md),
[Hugging Face pipelines](built-in-models/hugging-face.md),
[Sentence Transformer](built-in-models/sentence-transformer.md),
and [MLFlow pyfunc models](built-in-models/mlflow.md).
The Model Registry is also flexible and powerful enough to support your own previously-trained models, as well as any custom processing code.

> **Tip:**
>
> See examples of these model types with end to end workflows in [Examples and Quickstarts](examples-and-quickstarts.md).

The main classes in the Snowflake Model Registry Python API are:

* [snowflake.ml.registry.Registry](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/api/registry/snowflake.ml.registry.Registry):
  Manages models within a schema.
* [snowflake.ml.model.Model](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/api/model/snowflake.ml.model.Model):
  Represents a model.
* [snowflake.ml.model.ModelVersion](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/api/model/snowflake.ml.model.ModelVersion):
  Represents a version of a model.

This topic describes how to perform registry operations in Python using the `snowflake-ml-python` library.
You can also perform many registry operations in SQL; see [Model Registry SQL](../../../sql-reference/commands-model.md).

## Required privileges

To create a model, you must either own the schema where the model is created or have the `CREATE MODEL` privilege on it.
To use a model, you must either own the model or have either USAGE or READ privilege on it.

1. The USAGE privilege allows grantees to use the model for warehouse inference without being able to see any of its internals.
2. The READ privilege allows grantees to use the model for SPCS inference and also see its metadata, such as its comments, tags, and metrics.

To give users USAGE access to all existing models in a schema, use
`GRANT USAGE ON ALL MODELS IN SCHEMA <schema> TO ROLE <role>;` You can also give users access to future models created in a schema
automatically via `GRANT USAGE ON FUTURE MODELS IN SCHEMA <schema> TO ROLE <role>;`. .

Similarly, you can give users READ access to all existing or future models in a schema by using the same syntax, but replacing
`USAGE` with `READ`.

If a user’s role has OWNER, USAGE or READ privilege on a model, it appears in the [Snowsight model registry page](snowsight-ui.md).
For details about how privileges work in Snowflake, see [Access control privileges](../../../user-guide/security-access-control-privileges.md).

## Current limitations

The following limits apply to models and model versions:

|  |  |
| --- | --- |
| Models | * Maximum of 1000 versions |
| Model versions | * Maximum of 10 methods * Maximum of 500 arguments per method * Maximum metadata (including metrics) of 100 KB * Maximum total model size of 15 GB (for warehouse deployed models) * Maximum config file size of 250 KB, including `conda.yml` and other manifest files that `log_model` generates internally.   (If a model has many functions and all of them have many arguments, for example, this limit might be exceeded.) |

## Opening the Snowflake Model Registry

Models are first-class Snowflake objects and can be organized within a database and schema along with other Snowflake
objects. The Snowflake Model Registry provides a Python class for managing models within a schema. Thus, any Snowflake schema
can be used as a registry. It is not necessary to initialize or otherwise prepare a schema for this purpose. Snowflake
recommends creating one or more dedicated schemas for this purpose, such as ML.REGISTRY. You can create the schema using
[CREATE SCHEMA](../../../sql-reference/sql/create-schema.md).

Before you can create or modify models in the registry, you must open the registry. Opening the registry returns a
reference to it, which you can then use to add new models and obtain references to existing models.

```python
from snowflake.ml.registry import Registry

reg = Registry(session=sp_session, database_name="ML", schema_name="REGISTRY")
```

## Registering models and versions

> **Note:**
>
> You can also import a model from an external provider to Snowflake. For more information, see [Import and deploy models from an external service](snowsight-ui.md).

Adding a model to the registry is called *logging* the model. Log a model by calling the registry’s `log_model`
method. This method serializes the model — a Python object — and creates a Snowflake model object from it. This method also adds metadata, such as a description, to the model as specified in the `log_model` call.

Each model can have unlimited versions. To log additional versions of the model, call `log_model` again with
the same `model_name` but a different `version_name`.

You cannot add tags to a model when it is added to the registry, because tags are attributes of the model, and
`log_model` adds a specific model version, only creating a model when adding its first version. You can update
the model’s tags after logging the first version of the model.

In the following example, `clf`, short for “classifier,” is the Python model object, which was already
created elsewhere in your code. You can add a comment at registration time, as shown here. The combination of
name and version must be unique in the schema. You may specify `conda_dependencies` lists; the
specified packages will be deployed with the model.

```python
from snowflake.ml.model import task, type_hints
mv = reg.log_model(clf,
                   model_name="my_model",
                   version_name="v1",
                   conda_dependencies=["scikit-learn"],
                   comment="My awesome ML model",
                   metrics={"score": 96},
                   sample_input_data=train_features,
                   task=task.Task.TABULAR_BINARY_CLASSIFICATION)
```

The arguments of `log_model` are described here.

**Required arguments**

| Argument | Description |
| --- | --- |
| `model` | The Python model object of a supported model type. Must be serializable (“pickleable”). |
| `model_name` | The model’s name, used with `version_name` to identify the model in the registry. The name cannot be changed after the model is logged. Must be a [valid Snowflake identifier](../../../sql-reference/identifiers-syntax.md). |

> **Note:**
>
> The combination of model name and version must be unique in the schema.

**Optional arguments**

| Argument | Description |
| --- | --- |
| `version_name` | String specifying the model’s version, used with `model_name` to identify the model in the registry. Must be a [valid Snowflake identifier](../../../sql-reference/identifiers-syntax.md). If missing, a human-readable version name is generated automatically. |
| `code_paths` | List of paths to directories of code to import when loading or deploying the model. |
| `comment` | Comment, for example a description of the model. |
| `conda_dependencies` | List of Conda packages required by your model. This argument specifies package names and optional versions in [Conda format](https://docs.conda.io/projects/conda/en/latest/user-guide/concepts/pkg-search.html), that is, `"[channel::]package [operator version]"`. If you do not specify a channel, the Snowflake channel is assumed when the model runs on a warehouse. conda-forge is assumed for models running on Snowpark Container Services (SPCS). |
| `ext_modules` | List of external modules to pickle with the model. Supported with scikit-learn, Snowpark ML, PyTorch, TorchScript, and custom models. |
| `metrics` | Dictionary that contains metrics linked to the model version. |
| `options` | Dictionary that contains options for model creation. The following options are available for all model types:   * `embed_local_ml_library`: whether to embed a copy of the local Snowpark ML library into the model. Default: `False`. * `relax_version`: whether to relax the version constraints of the dependencies. This replaces version specifiers like   `==x.y.z` with specifiers like `<=x.y, <(x+1)`. Default: `True`. * `save_location`: A string specifying the location (directory path) to save the model and metadata (e.g. `"/path/to/my/directory"`). * `function_type`: Sets the method function type globally to either “FUNCTION” or “TABLE_FUNCTION”. To set method function types   individually see `function_type` in `method_options`. * `volatility`: Set the volatility for all model methods. Custom models default to `VOLATILE` and all other models default to `IMMUTABLE`.   To set method volatility individually, see `volatility` in `method_options`.  **Note:** `VOLATILE` model methods require a full table refresh when used in Dynamic Tables. For more information, see [Supported queries for dynamic tables](../../../user-guide/dynamic-tables-supported-queries.md). * `method_options`: A dictionary of per-method options, where the key is the name of a method and the value is a dictionary   that contains one or more of the options described here. The available options are:    + `case_sensitive`: Indicates whether the method and its signature are case-sensitive. Case-sensitive methods must be double-quoted     when used in SQL. This option also allows non-alphabetic characters in method names. Default: `False`.   + `max_batch_size`: Maximum batch size that the method will accept when called in the warehouse. Default: `None` (the batch     size is automatically determined).   + `function_type`: Set the method function type to “FUNCTION” or “TABLE_FUNCTION”.   + `volatility`: Set the method volatility level to `IMMUTABLE` for deterministic functions or `VOLATILE` for non-deterministic functions. Deterministic functions always return the same result for the same input.   ```python from snowflake.ml.model.volatility import Volatility  options = {   "embed_local_ml_library": True,   "relax_version": True,   "save_location": "/path/to/my/directory",   "function_type": "TABLE_FUNCTION",   "volatility": Volatility.IMMUTABLE,   "method_options": {     "predict": {       "case_sensitive": False,       "max_batch_size": 100,       "function_type": "TABLE_FUNCTION",       "volatility": Volatility.VOLATILE,     },   } ```  Individual model types may support additional options. See [Using built-in model types](built-in-models/overview.md). |
| `pip_requirements` | List of package specs for PyPI packages required by your model. Models running in a warehouse must also specify a pip artifact repository (see `artifact_repository_map` argument, next). |
| `artifact_repository_map` | Dictionary mapping the artifact repository type (must be `"pip"`) to a repository name. For example, to use the built-in PyPI artifact repository, specify `{"pip": "snowflake.snowpark.pypi_shared_repository"}`.  When specified, pip requirements are installed via the artifact repository in warehouse environments. The following model is runnable in warehouse; scikit-learn is installed via the built-in `pypi_shared_repository` artifact repository.  ```python mv = reg.log_model(     clf,     model_name="my_model",     artifact_repository_map={         "pip": "snowflake.snowpark.pypi_shared_repository"     },     pip_requirements=['scikit-learn'],     sample_input_data=train_features, ) ``` |
| `resource_constraint` | Dictionary mapping of warehouse resource constraint keys and values, e.g. {“architecture”: “x86”}. This can be used to ensure the model runs in a warehouse with the necessary architecture. |
| `target_platforms` | List of target platforms to run the model. The only acceptable inputs are a combination of `"WAREHOUSE"` and `"SNOWPARK_CONTAINER_SERVICES"`, or a target platform constant. If `WAREHOUSE` is specified in `target_platforms`, and the model is not runnable in the warehouse (due to dependencies, gpu requirement, model size etc), `log_model()` fails. Default value in [Container Runtime](../container-runtime-ml.md) is `["SNOWPARK_CONTAINER_SERVICES"]` and both elsewhere. For partitioned models, the value must be `["WAREHOUSE"]` or `snowflake.ml.model.target_platform.WAREHOUSE_ONLY`. |
| `python_version` | The version of Python under which the model will run. Defaults to `None`, which designates the latest version available in the warehouse. |
| `sample_input_data` | A DataFrame that contains sample input data. The feature names required by the model and their types are extracted from this DataFrame. Either this argument or `signatures` must be provided for all models except Snowpark ML and MLFlow models and Hugging Face pipelines. |
| `signatures` | Model method signatures as a mapping from target method name to signatures of input and output. Either this argument or `sample_input_data` must be provided for all models except Snowpark ML and MLFlow models and Hugging Face pipelines. |
| `task` | The task defining the problem that the model is meant to solve. If left unspecified, Snowflake makes a best effort to infer the model task from the model class. If the model class can’t be inferred, the model task is set to `type_hints.Task.UNKNOWN`. You must set this parameter to use [ML Observability](model-observability.md).  It helps us identify which monitoring metrics are relevant to your model.  Valid values:   * `snowflake.ml.model.task.Task.TABULAR_BINARY_CLASSIFICATION` * `snowflake.ml.model.task.Task.TABULAR_REGRESSION` * `snowflake.ml.model.task.Task.TABULAR_MULTI_CLASSIFICATION` |
| `user_files` | A dictionary mapping stage subdirectory path to a list of local filepaths. Filepaths may use `?` and `*` wildcards. For example, `{"subdir": ["/path/to/my_file.json"]}` will upload `my_file.json` along with model files into the `subdir` stage subdirectory.  For snowflake-ml-python versions >=1.7.3 and <1.8.1, the user must set the following flag for user files to be included:  ```python from snowflake.ml.model._model_composer.model_manifest import (     model_manifest ) model_manifest.ModelManifest._ENABLE_USER_FILES = True ``` |

`log_model` returns a `snowflake.ml.model.ModelVersion` object, which represents the version of the model
that was added to the registry.

After registration, the model itself cannot be modified (although you can change its metadata). To delete a model and all
its versions, use the registry’s delete_model method.

## Working with dependencies and target platforms

| **target_platforms** | **Model Types** | **Default behavior of** `log_model()` | **Other options** |
| --- | --- | --- | --- |
| `[“SNOWPARK_CONTAINER_SERVICES”]`  `snowflake.ml.model.target_platform.SNOWPARK_CONTAINER_SERVICES_ONLY`  (default in Container runtime) | Built-in model type | * `pip_requirements` are automatically populated. * Package versions are picked up automatically from the   environment. * The model will not be runnable in `WAREHOUSE`. | * Users can override dependencies by specifying `conda_dependencies` and/or `pip_requirements`. |
| Custom Model | * The model will not be runnable in `WAREHOUSE`. * Users must provide all dependencies in   either of `conda_dependencies` and `pip_requirements`. |  |
| `[“WAREHOUSE”]`  `snowflake.ml.model.target_platform.WAREHOUSE_ONLY`  (partitioned models) | Built-in model type | * `conda_dependencies` are automatically populated. * Package versions are picked up automatically from the   environment. * If the model is not runnable in `WAREHOUSE`, `log_model()`   will fail. | * Users can override dependencies by specifying `conda_dependencies` and/or `pip_requirements`. * To use a PyPI repository in the warehouse, use Artifact Repository (currently a preview feature). See `artifact_repository_map` below. |
| Custom Model | * If the model is not runnable in `WAREHOUSE`, `log_model()`   will fail. * Users must provide all the dependencies in   `conda_dependencies` and/or `pip_requirements`. | * To use a PyPI repository in the warehouse, use Artifact Repository (currently a preview feature). See `artifact_repository_map` below. |
| `[“WAREHOUSE”, “SNOWPARK_CONTAINER_SERVICES”]`  `snowflake.ml.model.target_platform.BOTH_WAREHOUSE_AND_SNOWPARK_CONTAINER_SERVICES`  (default everywhere except in Container runtime) | Built-in model type | * `conda_dependencies` are automatically populated. * Package versions are picked up automatically from the   environment. * If the model is not runnable in `WAREHOUSE`, `log_model()`   will fail. | * Users can override dependencies by specifying `conda_dependencies` and/or `pip_requirements`. * To use a PyPI repository in the warehouse, use Artifact Repository (currently a preview feature). See `artifact_repository_map` below. |
| Custom Model | * If the model is not runnable in `WAREHOUSE`, `log_model()`   will fail. * Users must provide all the dependencies in   `conda_dependencies` and/or `pip_requirements`. | * To use a PyPI repository in the warehouse, use Artifact Repository (currently a preview feature). See `artifact_repository_map` below. |

## Working with model artifacts

After a model has been logged, its artifacts (the files backing the model, including its serialized Python objects and
various metadata files such as its manifest) are available on an internal stage. Artifacts cannot be modified, but you
can view or download the artifacts of models you own.

> **Note:**
>
> Having the USAGE privilege on a model does not allow you to access its artifacts; ownership is required.

You can access model artifacts from a stage using, for example, the [GET command](../../../sql-reference/sql/get.md)
or its equivalent in Snowpark Python,
[FileOperation.get](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.FileOperation.get).

However, you cannot address model artifacts using the usual stage path syntax. Instead, use a `snow://` URL, a more
general way to specify the location of objects in Snowflake. For example, a version inside a model can be specified by a
URL of the form `snow://model/<model_name>/versions/<version_name>/`.

Knowing the of name of the model and the version you want, you can use the
[LIST command](../../../sql-reference/sql/list.md) to view the artifacts of the model as follows:

```sqlexample
LIST 'snow://model/my_model/versions/V3/';
```

The output resembles:

```output
name                                      size                  md5                      last_modified
versions/V3/MANIFEST.yml           30639    2f6186fb8f7d06e737a4dfcdab8b1350        Thu, 18 Jan 2024 09:24:37 GMT
versions/V3/functions/apply.py      2249    e9df6db11894026ee137589a9b92c95d        Thu, 18 Jan 2024 09:24:37 GMT
versions/V3/functions/predict.py    2251    132699b4be39cc0863c6575b18127f26        Thu, 18 Jan 2024 09:24:37 GMT
versions/V3/model.zip             721663    e92814d653cecf576f97befd6836a3c6        Thu, 18 Jan 2024 09:24:37 GMT
versions/V3/model/env/conda.yml          332        1574be90b7673a8439711471d58ec746        Thu, 18 Jan 2024 09:24:37 GMT
versions/V3/model/model.yaml       25718    33e3d9007f749bb2e98f19af2a57a80b        Thu, 18 Jan 2024 09:24:37 GMT
```

To retrieve one of these artifacts, use the SQL GET command:

```sqlexample
GET 'snow://model/model_my_model/versions/V3/MANIFEST.yml' file::///tmp/my_model/
```

Or the equivalent with Snowpark Python:

```python
session.file.get('snow://model/my_model/versions/V3/MANIFEST.yml', 'model_artifacts')
```

> **Note:**
>
> The names and organization of a model’s artifacts can vary depending on the type of the model and might change.
> The preceding example artifact list is intended to be illustrative, not authoritative.

## Deleting models

Use the registry’s `delete_model` method to delete a model and all its versions:

```python
reg.delete_model("mymodel")
```

> **Tip:**
>
> You can also delete models in SQL using [DROP MODEL](../../../sql-reference/sql/drop-model.md).

## Getting models from the registry

To get information about each model, use the `show_models` method:

```python
model_df = reg.show_models()
```

> **Tip:**
>
> In SQL, use [SHOW MODELS](../../../sql-reference/sql/show-models.md) to get a list of models.

The result of `show_models` is a pandas DataFrame. The available columns are listed here:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the model was created. |
| `name` | Name of the model. |
| `database_name` | Database in which the model is stored. |
| `schema_name` | Schema in which the model is stored. |
| `owner` | Role that owns the model. |
| `comment` | Comment for the model. |
| `versions` | JSON array listing versions of the model. |
| `default_version_name` | Version of the model used when referring to the model without a version. |

To get a list of the models in the registry instead, each as a `Model` instance, use the `models` method:

```python
model_list = reg.models()
```

To get a reference to a specific model from the registry by name, use the registry’s `get_model` method:

```python
m = reg.get_model("MyModel")
```

> **Note:**
>
> `Model` instances are not copies of the original logged Python model object; they are references to the underlying
> model object in the registry.

After you have a reference to a model, either one from the list returned by the `models` method or one retrieved using
`get_model`, you can work with its metadata and
its versions.

## Viewing and updating a model’s metadata

You can view and update a model’s metadata attributes in the registry, including its name, comment, tags, and metrics.

### Retrieving and updating comments

Use the model’s `comment` attribute to retrieve and update the model’s comment:

```python
print(m.comment)
m.comment = "A better description than the one I provided originally"
```

> **Note:**
>
> The `description` attribute is a synonym for `comment`. The previous code can also be written this way:
>
> ```python
> print(m.description)
> m.description = "A better description than the one I provided originally"
> ```

> **Tip:**
>
> You can also set a model’s comment in SQL by using [ALTER MODEL](../../../sql-reference/sql/alter-model.md).

### Retrieving and updating tags

Tags are metadata used to record a model’s purpose, algorithm, training data set, lifecycle stage, or other information
you choose. You can set tags when the model is registered or at any time afterward. You can also update the values of
existing tags or remove tags entirely.

> **Note:**
>
> You must define the names of all tags (and potentially their possible values) first by using CREATE TAG. See
> [Introduction to object tagging](../../../user-guide/object-tagging/introduction.md).

To get all of a model’s tags as a Python dictionary, use `show_tags`:

```python
print(m.show_tags())
```

To add a new tag or change the value of an existing tag, use `set_tag`:

```python
m.set_tag("live_version", "v1")
```

To retrieve the value of a tag, use `get_tag`:

```python
m.get_tag("live_version")
```

To remove a tag, use `unset_tag`:

```python
m.unset_tag("live_version")
```

> **Tip:**
>
> You can also set a model’s comment in SQL by using [ALTER MODEL](../../../sql-reference/sql/alter-model.md).

### Renaming a model

Use the `rename` method to rename or move a model. Specify a fully qualified name as the new name to move the model to
a different database or schema.

```python
m.rename("MY_MODEL_TOO")
```

> **Tip:**
>
> You can also rename a model in SQL using [ALTER MODEL](../../../sql-reference/sql/alter-model.md).

## Working with model versions

A model can have unlimited versions, each identified by a string. You can use any version naming convention that you
like. Logging a model actually logs a *specific version* of the model. To log additional versions of a model, call
`log_model` again with the same `model_name` but a different `version_name`.

> **Tip:**
>
> In SQL, use [SHOW VERSIONS IN MODEL](../../../sql-reference/sql/show-versions-in-model.md) to see the versions of a model.

A version of a model is represented by an instance of the `snowflake.ml.model.ModelVersion` class.

To get a list of all the versions of a model, call the model object’s `versions` method. The result is a list of
`ModelVersion` instances:

```python
version_list = m.versions()
```

To get information about each model as a DataFrame instead, call the model’s `show_versions` method:

```python
version_df = m.show_versions()
```

The resulting DataFrame contains the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the model version was created. |
| `name` | Name of the version. |
| `database_name` | Database in which the version is stored. |
| `schema_name` | Schema in which the version is stored. |
| `model_name` | Name of the model that this version belongs to. |
| `is_default_version` | Boolean value indicating whether this version is the model’s default version. |
| `functions` | JSON array of the names of the functions available in this version. |
| `metadata` | JSON object containing metadata as key-value pairs (`{}` if no metadata is specified). |
| `user_data` | JSON object from the `user_data` section of the model definition manifest (`{}` if no user data is specified). |

### Deleting model versions

You can delete a model version by using the model’s `delete_version` method:

```python
m.delete_version("rc1")
```

> **Tip:**
>
> You can also delete a model version in SQL by using [ALTER MODEL … DROP VERSION](../../../sql-reference/sql/alter-model-drop-version.md).

### Default version

A version of a model can be designated as the default model. Retrieve or set the model’s `default` attribute to obtain
the current default version (as a `ModelVersion` object) or to change it (using a string):

```python
default_version = m.default
m.default = "v2"
```

> **Tip:**
>
> In SQL, use [ALTER MODEL](../../../sql-reference/sql/alter-model.md) to set the default version.

### Model version aliases

You can assign an alias to a model version by using the SQL [ALTER MODEL](../../../sql-reference/sql/alter-model.md) command.
You can use an alias wherever a version name is required, such as when getting a reference to a model version, in Python
or in SQL. A given alias can be assigned to only one model version at a time.

In addition to aliases you create, the following system aliases are available in all models:

* `DEFAULT` refers to the default version of the model.
* `FIRST` refers to the oldest version of the model by creation time.
* `LAST` refers to the newest version of the model by creation time.

Alias names you create must not be the same as any existing version name or alias in the model, including system aliases.

### Getting a reference to a model version

To get a reference to a specific version of a model as a `ModelVersion` instance, use the model’s `version` method.
Use the model’s `default` attribute to get the default version of the model:

```python
m = reg.get_model("MyModel")

mv = m.version("v1")
mv = m.default
```

After you have a reference to a specific version of a model (such as the variable `mv` in this example), you can
retrieve or update its comments or metrics and call the model’s methods (or functions) as shown in the following sections.

### Retrieving and updating comments

As with models, model versions can have comments, which can be accessed and set via the model version’s `comment` or
`description` attribute:

```python
print(mv.comment)
print(mv.description)

mv.comment = "A model version comment"
mv.description = "Same as setting the comment"
```

> **Tip:**
>
> You can also change a model version’s comment in SQL by using [ALTER MODEL … MODIFY VERSION](../../../sql-reference/sql/alter-model-modify-version.md).

### Retrieving and updating metrics

Metrics are key-value pairs used to track prediction accuracy and other model version characteristics. You can set
metrics when creating a model version or set them using the `set_metric` method. A metric value can be any Python
object that can be serialized to JSON, including numbers, strings, lists, and dictionaries. Unlike tags, metric names
and possible values do not need to be defined in advance.

A test accuracy metric might be generated using sklearn’s `accuracy_score`:

```python
from sklearn import metrics

test_accuracy = metrics.accuracy_score(test_labels, prediction)
```

The confusion matrix can be generated similarly using sklearn:

```python
test_confusion_matrix = metrics.confusion_matrix(test_labels, prediction)
```

Then you can set these values as metrics:

```python
# scalar metric
mv.set_metric("test_accuracy", test_accuracy)

# hierarchical (dictionary) metric
mv.set_metric("evaluation_info", {"dataset_used": "my_dataset", "accuracy": test_accuracy, "f1_score": f1_score})

# multivalent (matrix) metric
mv.set_metric("confusion_matrix", test_confusion_matrix)
```

To retrieve a model version’s metrics as a Python dictionary, use `show_metrics`:

```python
metrics = mv.show_metrics()
```

To delete a metric, call `delete_metric`:

```python
mv.delete_metric("test_accuracy")
```

> **Tip:**
>
> You can also modify a model version’s metrics (which are stored as metadata) in SQL by using
> [ALTER MODEL … MODIFY VERSION](../../../sql-reference/sql/alter-model-modify-version.md).

### Retrieving model explanations

The model registry can explain a model’s results, telling you which input features contribute most to predictions, by calculating
[Shapley values](https://towardsdatascience.com/the-shapley-value-for-ml-models-f1100bff78d1). This preview feature is available by default in all
model views created in Snowflake 8.31 and later through the underlying model’s `explain` method. You can call `explain` from SQL or via a model view’s
`run` method in Python.

For details on this feature, see [Model Explainability](model-explainability.md).

### Exporting a model version

Use `mv.export` to export a model’s files to a local directory; the directory is created if it does not exist:

```python
mv.export("~/mymodel/")
```

By default, the exported files include the code, the environment to load the model, and model weights. To also
export the files needed to run the model in a warehouse, specify `export_mode = ExportMode.FULL`:

```python
mv.export("~/mymodel/", export_mode=ExportMode.FULL)
```

### Loading a model version

Use `mv.load` to load the original Python model object that was originally added to the registry. You can then
use the model for inference just as though you had defined it in your Python code:

```python
clf = mv.load()
```

To ensure proper functionality of a model loaded from the registry, the target Python environment (that is, the
versions of the Python interpreter and of all libraries) should be identical to the environment from which the model
was logged. Specify `force=True` in the `load` call to force the model to be loaded even if the environment is
different.

> **Tip:**
>
> To make sure your environment is the same as the one where the model is hosted, download a copy of the conda environment
> from the model registry:
>
> ```python
> conda_env = session.file.get("snow://model/<modelName>/versions/<versionName>/runtimes/python_runtime/env/conda.yml", ".")
> open("~/conda.yml", "w").write(conda_env)
> ```
>
> Then create a new conda environment from this file:
>
> ```bash
> conda env create --name newenv --file=~/conda.yml
> conda activate newenv
> ```

The optional `options` argument is a dictionary of options for loading the model. Currently, the argument supports
only the `use_gpu` option.

| Option | Type | Description | Default |
| --- | --- | --- | --- |
| `use_gpu` | `bool` | Enables GPU-specific loading logic. | `False` |

The following example illustrates the use of the `options` argument:

```python
clf = mv.load(options={"use_gpu": True})
```

## Calling model methods

Model versions can have *methods,* which are attached functions that can be executed to perform inference or other model
operations. The versions of a model can have different methods, and the signatures of these methods can also differ.

To call a method of a model version, use `mv.run`, where `mv` is a `ModelVersion` object. Specify the name of the
function to be called and pass a Snowpark or pandas DataFrame that contains the inference data, along with any required
parameters. The method is executed in a Snowflake warehouse.

The return value of the method is a Snowpark or pandas DataFrame, matching the type of DataFrame passed in.
Snowpark DataFrames are evaluated lazily, so the method is run only when the DataFrame’s `collect`, `show`,
or `to_pandas` method is called.

> **Note:**
>
> Invoking a method runs it in the warehouse specified in the session you’re using to connect to the registry.
> See [Specifying a Warehouse](../snowpark-ml.md).

The following example illustrates running the `predict` method of a model. This model’s `predict` method does not
require any parameters besides the inference data (`test_features` here). If it did, they would be passed as
additional arguments after the inference data.

```python
remote_prediction = mv.run(test_features, function_name="predict")
remote_prediction.show()   # assuming test_features is Snowpark DataFrame
```

To see what methods can be called on a given model, call `mv.show_functions`. The return value of this method is a
list of `ModelFunctionInfo` objects. Each of these objects includes the following attributes:

* `name`: The name of the function that can be called from Python or SQL.
* `target_method`: The name of the Python method in the original logged model.

> **Tip:**
>
> You can also call model methods in SQL. See [Inference from SQL](../inference/native-batch-inference-sql.md).

## Sharing models

Models can both be shared and replicated. The following privileges are grantable to a shared model:

* USAGE: Allows the grantee to use the model for warehouse inference without being able to see any of its internals.
* READ: Allows the grantee to use the model for SPCS inference and also see its metadata. Metadata includes, but isn’t limited to, comments, tags, and metrics.

## Cost considerations

Using the Snowflake Model Registry incurs standard Snowflake consumption-based costs. These include:

* Cost of storing model artifacts, metadata, and functions. For general information about storage costs, see [Exploring storage cost](../../../user-guide/cost-exploring-data-storage.md).
* Cost of copying files between stages to Snowflake. See [COPY FILES](../../../sql-reference/sql/copy-files.md).
* Cost of serverless model object operations through the Snowsight UI or the SQL or Python interface, such as
  showing models and model versions and altering model comments, tags, and metrics.
* Warehouse compute costs, which vary depending on the type of model and the quantity of data used in inference.
  For general information about Snowflake compute costs, see [Understanding compute cost](../../../user-guide/cost-understanding-compute.md).
  Warehouse compute costs are incurred for:

  + Model and version creation operations
  + Invoking a model’s methods

---
title: Snowflake Model Registry user interface
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/snowsight-ui.md
section: Snowflake ML
---

# Snowflake Model Registry user interface

> **Note:**
>
> Model Registry Snowsight UI is generally available in all deployments.
>
> Inference Services UI for SPCS Model Serving is in preview in AWS, Azure and GCP commercial deployments only.

On the Models page in Snowsight, you can find all your machine learning models. You can also view their metadata and deployments.

## Model details

The Models page displays the models that you’ve created and logged into the Snowflake Model Registry or have access to the models via the USAGE or READ privilege.
These are both models that have been developed with Snowpark ML and externally sourced models (such as models from Hugging Face).
It also shows [Cortex Fine-tuned](../../../user-guide/snowflake-cortex/cortex-finetuning.md) models, and may include other model types that you can create in Snowflake in future releases.

To display the Models page, in the navigation menu, select AI & ML » Models. The resulting list contains all the models in the Snowflake Model Registry in all the databases and schemas that your current role has access to.

> **Note:**
>
> If you don’t see any models, make sure your role has the
> [required privileges](overview.md).

To open a model’s details page, select the corresponding row in the Models list. The details page displays
key model information, including the model’s description, tags, and versions.

To edit the model description or delete the model, select … in the top right corner.

To open the version’s details page, select a model version. This page displays model version metadata, such
as metrics, and a list of available methods that can be called from Python or SQL.

To view code that calls the model function, select the SQL or Python link next to it. You can copy this code
snippet into a Snowsight SQL worksheet or a Python notebook.

To add or modify metadata or delete the model version, select the … in top right corner.

The Files tab contains a list of the model version’s underlying artifacts. You can download individual files from this page. This page is only available if the user has either OWNERSHIP or READ privilege on the model.

The Lineage tab shows the full data flow lineage information for the model, including any datasets that were used to train the model, any feature views from Feature Store, and the source data tables.

## Deploy user models

You can deploy models to SPCS Model Serving directly from the Model Registry page.

> **Note:**
>
> Snowflake Model Registry only supports deploying user models to SPCS Model Serving.

To deploy a model, complete the following steps:

1. Select a model from the list of models.
2. From the model details page, navigate to the Versions section.
3. To open the version details page, select a model version from the list of versions.
4. From the version details page, select the Deploy button.
5. From the opened pane, enter a name for the service to be deployed.
6. Select whether to create a REST API endpoint for the deployed service.
7. Select a compute pool for the deployed service.
8. (Optional) To customize performance and resource usage, you can adjust details, such as the number of workers, CPU, and memory, from the advanced settings.
9. Select Deploy.

   The deployment process can take up to 15 minutes to create the service.

After the deployment is complete, the service is displayed on the Inference Services tab on the main Model Registry page.

## Import and deploy models from an external service

[Preview Feature](../../../release-notes/preview-features.md) — Open

Available to all accounts.

> **Note:**
>
> Currently, only [Hugging Face](https://huggingface.co/) is supported as a model provider.

You can import pre-trained models from an external provider and deploy them as Snowflake services for inference. To import an external model, follow these steps:

1. In the navigation menu, select AI & ML » Models.
2. Select Import model.

   > The Import model dialog opens.
3. In the Model handle field, enter the model handle from your provider, or select one from the list of Snowflake-verified models.
4. In the Task list, select the task that the model will perform.
5. Optional: To enable downloading custom Python code from the model repository, select the Trust remote code checkbox.

   > > **Warning:**
   > >
   > > Allowing models to download arbitrary code should be considered a security risk. Only allow remote code from models you’ve thoroughly evaluated and trust to run on Snowflake.
6. Optional: To import a gated model, enter the name of the Snowflake secret for your Hugging Face token in the Hugging Face token secret field.

   > Your Hugging Face token secret should be a generic text secret, with your Hugging Face token as a value. For information on how to create a generic text secret, see [CREATE SECRET](../../../sql-reference/sql/create-secret.md).
7. Optional: Expand Advanced settings:

   > 1. To perform input and output token conversion for your model, in the Tokenizer model field, enter a tokenizer model.
   > 2. To add a hyperparameter, select Add parameter, and then enter a name and value that are recognized by the model.
8. In the Model name field, enter a name for use in the Snowflake model registry.
9. In the Version name field, enter a version for registration.
10. In the Database and schema list, select a database to link this model to.
11. Optional: Expand Advanced settings:

    > 1. To add pip requirements to the model’s runtime environment, select Add Pip requirement, and then add a pip [requirement specifier](https://pip.pypa.io/en/stable/reference/requirement-specifiers/) for your package. Only packages served from PyPi are supported.
    > 2. In the Comment field, enter any useful information about the model.
12. Select Continue to deployment.

    > The Deploy (model handle) dialog opens.

To deploy your model, follow these steps:

1. In the Service name field, enter a name that the service will run under.

   > Snowflake provides a default based on the model name and version.
2. Optional: To change whether an API endpoint is automatically created for your model’s service, select or clear Create REST API endpoint.
3. In the Compute pool list, select an existing compute pool for the service to run on.
4. Optional: Adjust the number of instances in the compute pool that the service runs on.

   > The maximum is limited by the number of nodes in your compute pool.
5. Optional for CPU compute pools: To provide details for the service’s available resources in the compute pool, expand Advanced settings:

   > * Number of workers
   > * Max batch rows
   > * CPU: The number of virtual cores, in milli-units
   > * GPU: The number of physical GPUs (**Required** for GPU compute pools)
   > * Memory: The amount of maximum available memory
6. To import the model and create the service that users access your model through, select Deploy.

   > You can also cancel the model import or return to the model details.

Once deployment starts, a dialog displays a Query ID. This query creates the jobs to import the model and deploy your service; it is **not** a query to monitor either job.

1. Do one of the following:

   > * To dismiss the dialog, select Done.
   > * To monitor the query, select Open query monitoring.

Snowflake performs the following actions for your model and service deployment:

> * Downloads the required files from your provider.
> * Uploads and logs the model to your model registry.
> * Creates a model-specific container image for your service to run in.
> * Deploys the model image as a service.

> **Note:**
>
> The length of time that Snowflake takes to perform these operations is dependent on several factors, including the model size, available compute resources, and network setup.

If an error occurs in deployment, find the associated SQL query for more information. In the navigation menu, select Monitoring » Query History to find your deployment query, which contains a call to `SYSTEM$DEPLOY_MODEL`.

### Monitoring model and service deployment

When external models are loaded and prepared for deployment, Snowflake automatically starts registering the associated service. Monitor the deployment by following these steps:

1. In the navigation menu, select Monitoring » Services & jobs.
2. On the Jobs tab, select the job that matches your service’s location and compute pool, created at the time you started the import.

   > This job has a name in the form `MODEL_DEPLOY_IDENTIFIER`. Each service deployment performed by a model import creates a unique identifier for the associated jobs.
3. To monitor the model deployment, select the Logs tab.

   > When the model deployment is complete, Snowflake starts a job to build and deploy your service.
4. Return to the Jobs tab, and select the job named `MODEL_BUILD_IDENTIFIER`.

   > This identifier is the same as your model deployment job.
5. To monitor the service container build, return to the Logs tab.

   > When this job is complete, your service is deployed and ready.

## Model inference services

You can see the model inference services created with SPCS Model Serving in the Model Registry UI. The main model listing page shows the status of inference services created for any model.

If you select model name and a model version, you can use the Inference Services tab in the model version details page to see more details about the deployed inference service, as well as suspend the inference service. This also shows the list of functions that the service exposes. And you can see or copy the SQL or Python usage code snippet.

Select Open Details to display service parameters. To view more details about the deployed inference service, select Open Service Details from the service parameters pane.
You can also access the service details from the Inference Services tab on the main Model Registry page.

## Model monitoring

For any models that have Model Monitors attached to them, you can visualize model monitoring metrics using the Model Monitors in the model details page.

Select the desired model monitors to display the Monitoring dashboard:

Select Compare to view the menu of model version select a second model version to compare this model version against:

Monitoring supports a large number of model accuracy, model drift, and feature drift metrics.
To select the metrics that are computed and displayed, select Settings icon to choose the desired metrics:

---
title: Snowflake Multi-Node ML Jobs
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/ml-jobs/distributed-ml-jobs.md
section: Snowflake ML
---

# Snowflake Multi-Node ML Jobs

Use Snowflake Multi-Node ML Jobs to run distributed machine learning (ML) workflows inside Snowflake ML container runtimes across multiple compute nodes.
Distribute work across multiple nodes to process large datasets and complex models with improved performance. For information about Snowflake ML Jobs, see [Snowflake ML Jobs](overview.md).

Snowflake Multi-Node ML Jobs extend Snowflake ML Job capabilities by enabling distributed execution across multiple nodes. This brings you:

* **Scalable Performance**: Horizontally scale to process datasets too large to fit on a single node
* **Reduced Training Time**: Speed up complex model training through parallelization
* **Resource Efficiency**: Optimize resource utilization for data-intensive workloads
* **Framework Integration**: Seamlessly use distributed frameworks like [Distributed Modeling Classes](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/modeling_distributors) and [Ray](https://www.ray.io/).

When you run a Snowflake ML Job with multiple nodes, the following occurs:

* One node serves as the head node (coordinator)
* Additional nodes serve as worker nodes (compute resources)
* Together, the nodes form a single logical ML job entity in Snowflake

A single-node ML Job only has a head node. A multi-node job with three active nodes has one head node and two worker nodes. All three nodes participate in running your workload.

## Prerequisites

The following prerequisites are required to use Snowflake Multi-Node ML Jobs.

To set up multi-node jobs, do the following:

1. Install the Snowflake ML Python package.

   ```bash
   pip install snowflake-ml-python>=1.9.2
   ```
2. Create a compute pool with enough nodes to support your multi-node job:

   ```sqlexample
   CREATE COMPUTE POOL IF NOT EXISTS MY_COMPUTE_POOL
     MIN_NODES = 1
     MAX_NODES = <NUM_INSTANCES>
     INSTANCE_FAMILY = <INSTANCE_FAMILY>;
   ```

   > **Important:**
   >
   > You must set MAX_NODES to be greater than or equal to the number of target instances that you’re using to run your training job.
   > If you request more nodes than you intend to use for your training job, it might fail or behave unpredictably.
   > For information about running a training job, see Running multi-node ML jobs.

## Writing code for multi-node jobs

For multi-node jobs, your code needs to be designed for distributed processing using
[Distributed Modeling Classes](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/modeling_distributors)
or [Ray](https://www.ray.io/).

The following are key patterns and considerations when you use distributed modeling classes or Ray:

### Understanding node initialization and availability

In multi-node jobs, worker nodes can initialize asynchronously and at different times:

* Nodes might not all start simultaneously, especially if compute pool resources are limited
* Some nodes might start seconds or even minutes after others
* ML Jobs automatically wait for the specified `target_instances` to be available before executing your payload.
  The job fails with an error if the expected nodes aren’t available within the timeout period.
  For more information on customizing this behavior, see Advanced Configuration: Using min_instances.

You can check available nodes in your job through Ray:

```python
import ray
ray.init(address="auto", ignore_reinit_error=True)  # Ray is automatically initialized in multi-node jobs
nodes_info = ray.nodes()
print(f"Available nodes: {len(nodes_info)}")
```

### Distributed Processing Patterns

There are multiple patterns you can apply in the payload body of the multi-node job for distributed processing. These patterns leverage [Distributed Modeling Classes](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/modeling_distributors) and [Ray](https://www.ray.io/):

#### Using Snowflake’s Distributed Training API

Snowflake provides optimized trainers for common ML frameworks:

```python
# Inside the ML Job payload body
from snowflake.ml.modeling.distributors.xgboost import XGBEstimator, XGBScalingConfig

# Configure scaling for distributed execution
scaling_config = XGBScalingConfig()

# Create distributed estimator
estimator = XGBEstimator(
    n_estimators=100,
    params={"objective": "reg:squarederror"},
    scaling_config=scaling_config
)

# Train using distributed resources
# NOTE: data_connector and feature_cols excluded for brevity
model = estimator.fit(data_connector, input_cols=feature_cols, label_col="target")
```

For more information about the available APIs, see [Distributed Modeling Classes](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/modeling_distributors) .

#### Using Native Ray Tasks

Another approach is to use Ray’s task-based programming model:

```python
# Inside the ML Job payload body
import ray

@ray.remote
def process_chunk(data_chunk):
    # Process a chunk of data
    return processed_result

# Distribute work across available workers
data_chunks = split_data(large_dataset)
futures = [process_chunk.remote(chunk) for chunk in data_chunks]
results = ray.get(futures)
```

For more information, see [Ray’s task programming documentation](https://docs.ray.io/en/latest/ray-core/tasks.html).

## Running multi-node ML jobs

You can run multi-node ML jobs using the same methods as single-node jobs, using the `target_instances` parameter:

### Using the Remote Decorator

```python
from snowflake.ml.jobs import remote

@remote(
    "MY_COMPUTE_POOL",
    stage_name="payload_stage",
    session=session,
    target_instances=3  # Specify the number of nodes
)
def distributed_training(data_table: str):

    from snowflake.ml.modeling.distributors.xgboost import XGBEstimator, XGBScalingConfig

    # Configure scaling for distributed execution
    scaling_config = XGBScalingConfig()

    # Create distributed estimator
    estimator = XGBEstimator(
        n_estimators=100,
        params={"objective": "reg:squarederror"},
        scaling_config=scaling_config
    )

    # Train using distributed resources
    # NOTE: data_connector and feature_cols excluded for brevity
    model = estimator.fit(data_connector, input_cols=feature_cols, label_col="target")

job = distributed_training("<my_training_data>")
```

### Running a Python File

```python
from snowflake.ml.jobs import submit_file

job = submit_file(
    "<script_path>",
    "MY_COMPUTE_POOL",
    stage_name="<payload_stage>",
    session=session,
    target_instances=<num_training_nodes>  # Specify the number of nodes
)
```

### Running a Directory

```python
from snowflake.ml.jobs import submit_directory

job = submit_directory(
    "<script_directory>",
    "MY_COMPUTE_POOL",
    entrypoint="<script_name>",
    stage_name="<payload_stage>",
    session=session,
    target_instances=<num_training_nodes>  # Specify the number of nodes
)
```

### Advanced Configuration: Using min_instances

For more flexible resource management, you can use the optional `min_instances` parameter to specify a minimum number of instances required for the job to proceed.
If `min_instances` is set, the job payload is executed as soon as the minimum number of nodes becomes available, even if that number is smaller than `target_instances`.

This is useful when you want to:

* Start training with fewer nodes if the full target isn’t immediately available
* Reduce wait times when compute pool resources are limited
* Implement fault-tolerant workflows that can adapt to varying resource availability

```python
from snowflake.ml.jobs import remote

@remote(
    "MY_COMPUTE_POOL",
    stage_name="payload_stage",
    session=session,
    target_instances=5,  # Prefer 5 nodes
    min_instances=3      # But start with at least 3 nodes
)
def flexible_distributed_training(data_table: str):
    import ray

    # Check how many nodes we actually got
    available_nodes = len(ray.nodes())
    print(f"Training with {available_nodes} nodes")

    # Adapt your training logic based on available resources
    from snowflake.ml.modeling.distributors.xgboost import XGBEstimator, XGBScalingConfig

    scaling_config = XGBScalingConfig(
        num_workers=available_nodes
    )

    estimator = XGBEstimator(
        n_estimators=100,
        params={"objective": "reg:squarederror"},
        scaling_config=scaling_config
    )

    # Train using available distributed resources
    model = estimator.fit(data_connector, input_cols=feature_cols, label_col="target")

job = flexible_distributed_training("<my_training_data>")
```

## Managing Multi-Node Jobs

### Monitoring Job Status

Job status monitoring is unchanged from single node jobs:

```python
from snowflake.ml.jobs import MLJob, get_job, list_jobs

# List all jobs
jobs = list_jobs()

# Retrieve an existing job based on ID
job = get_job("<job_id>")  # job is an MLJob instance

# Basic job information
print(f"Job ID: {job.id}")
print(f"Status: {job.status}")  # PENDING, RUNNING, FAILED, DONE

# Wait for completion
job.wait()
```

### Accessing Logs by Node

In multi-node jobs, you can access logs from specific instances:

```python
# Get logs from the default (head) instance
logs_default = job.get_logs()

# Get logs from specific instances by ID
logs_instance0 = job.get_logs(instance_id=0)
logs_instance1 = job.get_logs(instance_id=1)
logs_instance2 = job.get_logs(instance_id=2)

# Display logs in the notebook/console
job.show_logs()  # Default (head) instance logs
job.show_logs(instance_id=0)  # Instance 0 logs (not necessarily the head node)
```

## Common Issues and Limitations

Use the following information to address common issues that you might encounter.

* **Node Connection Failures**: If worker nodes fail to connect to the head node, it’s possible that the head node completes its task and then turns itself down before the worker finishes its job. To avoid connection failures, implement result collection logic in the job.
* **Memory Exhaustion**: If jobs fail due to memory issues, increase the node size or use more nodes with less data per node.
* **Node Availability Timeout**: If the required number of instances (either `target_instances` or `min_instances`) are not available within the predefined timeout, the job will fail. Ensure your compute pool has sufficient capacity or adjust your instance requirements.

---
title: Snowflake Native Batch Inference (SQL)
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/native-batch-inference-sql.md
section: Snowflake ML
---

# Snowflake Native Batch Inference (SQL)

Use the Snowflake Model Registry to execute batch inference calls to your models. You can integrate these batch inference calls into your Snowflake workflows. With Snowflake Native Batch Inference, you can do the following:

* Integrate into your Snowflake SQL, Snowpark Python, Streaming, and Dynamic Tables workflows.
* Use third party tools such as dbt with the results of your inference calls

## Where to run a model?

### Selecting a Runtime Environment

You can either host a model on a Virtual Warehouse or a SPCS compute pool. Use the following information to determine where you’re hosting your model.

| Runtime | Best For… | Avoid When… |
| --- | --- | --- |
| Virtual Warehouse | • In-Database Batch Inference: Executing models as native SQL functions.  • Zero-Ops Experience: Leveraging existing warehouses without managing compute pools.  • Small Models: CPU-runnable models (e.g., scikit-learn, XGBoost). | • Hardware Constraints: The model requires a GPU for execution.  • Memory Limits: Model size exceeds 15GB. (this limit is lower for smaller warehouse sizes). |
| Snowpark Container Services (SPCS) | • Large Models: Optimized for LLMs, deep learning models requiring high memory, or models requiring GPUs.  • Custom Environments: For specific pip packages or a custom OS-level environment not found in standard warehouses. | • Organizational Policy: SPCS is not yet approved or enabled in your account.  • Sufficient warehouse compute: If your warehouse can process your batch inference requests, you don’t need the additional compute from SPCS. |

### To run model on warehouse

To host your model on a warehouse, specify WAREHOUSE in the target_platforms argument to log your model. For more information, see working with dependencies and target platforms.

For information about whether an existing model is runnable in a warehouse, run SHOW VERSIONS IN MODEL. If the runnable_in column has WAREHOUSE as a value, you can run it.

### To run model on SPCS

To use your model in SPCS, you must deploy the model as a service. For more information about deploying a model to SPCS, see deploying the model for online inference. Make sure auto-suspend is active.

> **Note:**
>
> If your service is suspended, it automatically resumes when there’s an inference request. However, if service fails to resume within a specified timeframe, the query might fail. Queries might fail to resume if there are a lack of available nodes in the compute pool. To mitigate this risk, you can explicitly resume the service and wait for its availability.

Use an XSMALL or SMALL warehouse to route your inference requests to the SPCS compute pool. A warehouse can run multiple threads per node and send a large number of inference requests with each request. Consequently, the service operating within SPCS can be easily overwhelmed. Therefore, the recommendation is to utilize an XSMALL or SMALL warehouse when the model is deployed to SPCS.

## Inference from Python

If you’re using the Snowflake Python API to make inference requests, you must have the snowflake-ml-python package.

### Connect to Model Registry

Retrieve the model that you’re using for inference requests from the model registry. Use the following code to retrieve the model:

```python
from snowflake.ml.registry import Registry

registry = Registry(session=session, database_name=DATABASE, schema_name=REGISTRY_SCHEMA)
mv = registry.get_model('my_model').version('my_version')  # returns ModelVersion
```

### Run batch inference job

Use the run method of your model version object to run a batch inference job. Using the run method, you can:

* Run an inference job on either a warehouse or an SPCS compute pool.
* Provide a Snowpark or pandas dataframe with the inference data.

The run method returns a dataframe that matches the type of the dataframe that you’ve specified. For example, if you specify a pandas dataframe as the input, you get a pandas dataframe as the output.

> **Note:**
>
> Snowpark DataFrames undergo lazy evaluation. The execution only happens upon the DataFrame’s collect, show, or to_pandas method.

The following example runs a batch inference job on a warehouse:

```python
# Run inference on a warehouse
# mv: snowflake.ml.model.ModelVersion
remote_prediction = mv.run(input_features, function_name="predict")
remote_prediction.show()
```

To run inference on SPCS instead of a warehouse, add the `service_name` argument to the `run` call:

```python
# Run inference on SPCS
# mv: snowflake.ml.model.ModelVersion
remote_prediction = mv.run(input_features, function_name="predict", service_name="example_spcs_service")
remote_prediction.show()
```

To see the methods that you can call from a model, run mv.show_functions. The return value of this method is a list of ModelFunctionInfo objects. Each of these objects includes the following attributes:

* name: The name of the function that can be called from Python or SQL.
* target_method: The name of the Python method in the original logged model.

```python
# Get signature of the inference function in Python
# mv: snowflake.ml.model.ModelVersion
mv.show_functions()
```

### Passing parameters during inference

If the model’s signature includes parameters defined in the
[ParamSpec](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/api/model/snowflake.ml.model.model_signature.ParamSpec) object, you can pass parameter values at inference time using the `params` argument. Any parameter that isn’t included in the dictionary uses its default value from
the signature. The `params` argument works the same way whether you are running inference on a warehouse or on SPCS.

```python
# Pass parameters to override default values
# mv: snowflake.ml.model.ModelVersion
remote_prediction = mv.run(
    input_features,
    function_name="predict",
    params={"temperature": 0.9, "max_tokens": 512}
)
```

## Inference from SQL

Use the following command to understand the functions available and the signature for a model version:

```sqlexample
SHOW FUNCTION IN MODEL mymodel VERSION myversion;
```

### To run model on warehouse

Use the `MODEL(model_name)!method_name(...)` syntax to call or invoke methods of a model. The methods available on a model are determined by the underlying Python model class. For example, many types of models use a method named predict for inference.

To call a method of the default model, use the following syntax. Include any method arguments within the parentheses and specify the table containing the inference data in the FROM clause.

```sqlexample
SELECT MODEL(<model_name>)!<method_name>(...) FROM <table_name>;
```

To invoke a method from a specific version of a model, create an alias to the specific version of the model and call the method through the alias.

Use the following syntax to call a method from a specific version of a model.

```sqlexample
SELECT MODEL(<model_name>,<version_or_alias_name>)!<method_name>(...) FROM <table_name>;
```

The following example uses the LAST alias to call the latest version of a model.

```sqlexample
SELECT MODEL(my_model,LAST)!predict(...) FROM my_table;
```

### Passing parameters in SQL

If the model’s signature includes parameters defined with
[ParamSpec](../model-registry/model-signature.md), you can pass parameter values as
additional arguments after the input columns. Parameters can be specified by position or by name.

When using positional arguments, parameters can be omitted from the right-hand side, and the defaults from the
signature are used:

```sqlexample
-- Pass all parameters positionally (temperature, then max_tokens)
SELECT MODEL(my_model, v1)!predict(input_text, 0.9, 512) FROM my_table;

-- Omit max_tokens from the right; its default value from the signature is used
SELECT MODEL(my_model, v1)!predict(input_text, 0.9) FROM my_table;
```

When using named arguments, all arguments (including input columns) must be specified by name. This lets you pass
only specific parameters regardless of their position:

```sqlexample
SELECT MODEL(my_model, v1)!predict(
    input_text => input_text,
    max_tokens => 512
) FROM my_table;
```

> **Note:**
>
> You must specify all arguments by name or all by position. You cannot mix positional and named arguments in the
> same call.

### To run model on SPCS

Unlike running in a warehouse, functions can be called from a service by calling `service_name!method_name(...)`.

```sqlexample
SELECT <mservice_name>!<method_name>(...) FROM <table_name>;
```

Parameters are passed the same way as with warehouse functions, either all positionally or all by name:

```sqlexample
-- Positional
SELECT my_service!predict(input_text, 0.9, 512) FROM my_table;

-- Named arguments (all arguments must be named)
SELECT my_service!predict(input_text => input_text, max_tokens => 512) FROM my_table;
```

## Continuous Model Inference with Dynamic Tables

Snowflake’s dynamic tables establish a continuous transformation layer atop streaming data. By defining a dynamic table that applies the predictions of a machine learning model to incoming data, one can sustain an automated, continuously operating model inference pipeline on the data without the requirement for manual orchestration.

Consider, for instance, a stream of login events populating a table (LOGINS_RAW), which includes columns such as USER_ID, LOCATION, and a timestamp. This table is subsequently updated with the model’s predictions regarding the login risk for newly-arrived events. Crucially, only new rows are processed with the model’s predictions.

### SQL

Dynamic Tables offer a robust capability for Snowflake users to perform incremental inference on incoming data. Use SQL to define a dynamic table that references the model and applies it to new incoming rows in LOGINS_RAW:

```sqlexample
CREATE OR REPLACE DYNAMIC TABLE logins_with_predictions
    WAREHOUSE = my_wh
    TARGET_LAG = '20 minutes'
    REFRESH_MODE = INCREMENTAL
    INITIALIZE = on_create
    COMMENT = 'Dynamic table with continuously updated model predictions'
AS
SELECT
    login_id,
    user_id,
    location,
    event_time,
    MODEL(ml.registry.mymodel)!predict(l.user_id, l.location) AS prediction_result
FROM logins_raw;
```

### Snowpark Python

The Snowpark Python API allows you to access the model registry programmatically and run inference directly on DataFrames. This approach can be more flexible and maintainable, especially in code-driven environments.

```python
from snowflake.snowpark import Session
from snowflake.snowpark.functions import col
from snowflake.ml.registry import Registry

# Initialize the registry
reg = Registry(session=sp_session, database_name="ML", schema_name="REGISTRY")

# Retrieve the default model version from the registry
model = reg.get_model("MYMODEL")

# Load the source data
df_raw = sp_session.table("LOGINS_RAW")

# Run inference on the necessary features
predictions_df = model.run(df_raw.select("USER_ID", "LOCATION"))

# Join predictions back to the source data
joined_df = df_raw.join(predictions_df, on=["USER_ID", "LOCATION"])

# Create or replace a dynamic table from the joined DataFrame
joined_df.create_or_replace_dynamic_table(
    name="LOGINS_WITH_PREDICTIONS",
    warehouse="MY_WH",
    lag='20 minutes',
    refresh_mode='INCREMENTAL',
    initialize="ON_CREATE",
    comment="Dynamic table continuously updated with model predictions"
)
```

The code sample above will run inference using MYMODEL on new data in LOGINS_RAW every 20 minutes automatically.

### Immutable vs volatile

This incrementality is essential and necessitates that all functions invoked within the Dynamic Table definition be designated as IMMUTABLE. While functions within a standard model are typically IMMUTABLE, Custom models default to VOLATILE. If the underlying model is known to be immutable, it is crucial to ensure that the corresponding model function is explicitly marked as immutable when the model is logged.

---
title: Snowpark ML
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/snowpark-ml.md
section: Snowflake ML
---

# Snowpark ML

The registry supports models created using [Snowpark ML modeling APIs](../../modeling.md) (models derived from
`snowpark.ml.modeling.framework.base.BaseEstimator`).

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. Snowpark ML models have the following target methods by default, assuming the method exists: `predict`, `transform`, `predict_proba`, `predict_log_proba`, `decision_function`. |

You do not need to specify `sample_input_data` or `signatures` when logging a Snowpark ML model;
these are automatically inferred during fitting.

> **Note:**
>
> Snowpark ML pipelines require an estimator. You can’t register a transformer-only Snowpark ML pipeline. Use a scikit-learn pipeline to register your transformers.

## Example

```python
import pandas as pd
import numpy as np
from sklearn import datasets
from snowflake.ml.modeling.xgboost import XGBClassifier

iris = datasets.load_iris()
df = pd.DataFrame(data=np.c_[iris["data"], iris["target"]], columns=iris["feature_names"] + ["target"])
df.columns = [s.replace(" (CM)", "").replace(" ", "") for s in df.columns.str.upper()]

input_cols = ["SEPALLENGTH", "SEPALWIDTH", "PETALLENGTH", "PETALWIDTH"]
label_cols = "TARGET"
output_cols = "PREDICTED_TARGET"

clf_xgb = XGBClassifier(
        input_cols=input_cols, output_cols=output_cols, label_cols=label_cols, drop_input_cols=True
)
clf_xgb.fit(df)
model_ref = registry.log_model(
    clf_xgb,
    model_name="XGBClassifier",
    version_name="v1",
)
model_ref.run(df.drop(columns=label_cols).head(10), function_name='predict_proba')
```

---
title: Specifying model signatures
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-signature.md
section: Snowflake ML
---

# Specifying model signatures

To ensure a consistent experience no matter where a model is run, the Snowflake Model Registry needs to know the input
and output schema of the model’s inference methods: that is, the name and type of all columns in the input or output
DataFrame. This allows these columns to be mapped between Python and SQL data types when necessary. This schema is
referred to as a *signature* by analogy to the arguments of a function and their types. Signatures can also include
optional *parameters* that control inference behavior, such as a temperature setting.

For certain ML frameworks, the model registry can infer these schemas, either from data structures in the model itself
or from sample input data. However, models often accept or return objects that lack this information, such as NumPy
arrays. In these cases, Snowpark ML infers the input feature names as `input_feature_1`, `input_feature_2`, and so
on. Similarly, output features are named `output_feature_1`, `output_feature_2`, and so on.

To use more meaningful names in your custom models, you can use one of the following methods:

* Update `sample_input_data` with column names, usually by converting the dataset to a pandas or
  [Snowpark DataFrame](../../snowpark/python/working-with-dataframes.md).
* Explicitly pass signatures to `log_model`. When a model does not produce names in its output, explicit signatures
  might be the only option.

## Inferring a signature

Like the model registry itself, you can generate signatures automatically. Use
`snowflake.ml.model.model_signature.infer_signature` to infer a signature based on provided sample input, output, and
column names, and then apply that signature to the appropriate methods when logging the model, as in the following example:

```python
import pandas as pd
from sklearn import svm, datasets

from snowflake.ml.model import model_signature

digits = datasets.load_digits()
target_digit = 6

def one_vs_all(dataset, digit):
    return [x == digit for x in dataset]

train_features = digits.data[:10]
train_labels = one_vs_all(digits.target[:10], target_digit)
clf = svm.SVC(gamma=0.001, C=10.0, probability=True)
clf.fit(train_features, train_labels)

sig = model_signature.infer_signature(
    train_features,
    train_labels,
    input_feature_names=['column1', 'column2', ...],
    output_feature_names=['is_target_digit'])

# Supply a signature for every function the model exposes, in this case only `predict`.
mv = reg.log_model(
    clf,
    model_name='my_model',
    version_name='v1',
    signatures={"predict": sig}
)
```

This example applies the signature to only one method, but you can infer a signature for each method your model exposes.
You can use the same signature object (`sig` in the example) for all methods that have the same signature.

> **Note:**
>
> For Snowpark DataFrames, `infer_signature` must run the DataFrame’s query to obtain the data from which the
> signature is inferred. This can incur significant cost depending on the size of the dataset. Most training datasets
> are large enough to make this a consideration.
>
> To avoid such large queries, `infer_signature` considers only the first hundred rows of the data by adding LIMIT
> 100 to the query. However, if these rows are not representative of the data, the inferred signature might not be
> accurate. This commonly occurs when the dataset contains many NULL values and a column in the dataset has only NULL
> values in the first hundred rows. In this case, the inferred signature incorrectly omits that column. Provide the
> signature explicitly, as shown in the next section, to avoid this issue.

## Constructing a signature

You can also manually construct a signature by using `snowflake.ml.model.model_signature.ModelSignature`. Both scalar and
tensor types (including ragged tensors) are supported.

Example:

```python
from snowflake.ml.model.model_signature import ModelSignature, FeatureSpec, DataType

sig = ModelSignature(
    inputs=[
        FeatureSpec(dtype=DataType.DOUBLE, name=f_0),
        FeatureSpec(dtype=DataType.INT64, name=sparse_0_fixed_len, shape=(5, 5)),
        FeatureSpec(dtype=DataType.INT64, name=sparse_1_variable_len, shape=(-1,)),
    ],
    outputs=[
        FeatureSpec(dtype=DataType.FLOAT, name=output),
    ]
)
```

Then pass the signature object, `sig`, to `log_model` with the `signatures` argument as in the example above
for the methods to which it applies.

## Specifying parameters with ParamSpec

In addition to input and output features, model signatures can include *parameters*. Parameters define optional
configuration values that you can pass to model inference methods when you make an inference request.
Unlike input features, which specify the data being processed, parameters control inference behavior, such as the number of results to return or a temperature setting.

Use `ParamSpec` from [snowflake.ml.model.model_signature.ModelSignature](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/api/model/snowflake.ml.model.model_signature.ModelSignature) to define a parameter.

Each `ParamSpec` requires a name,
a data type, and a default value. The default value is used when the parameter is not explicitly provided at inference
time.

### Constructing a signature with parameters

The following example creates a model signature that includes both input/output features and parameters:

```python
from snowflake.ml.model.model_signature import ModelSignature, FeatureSpec, ParamSpec, DataType

sig = ModelSignature(
    inputs=[
        FeatureSpec(dtype=DataType.STRING, name="input_text"),
    ],
    outputs=[
        FeatureSpec(dtype=DataType.STRING, name="output_text"),
    ],
    params=[
        ParamSpec(name="temperature", dtype=DataType.DOUBLE, default_value=0.7),
        ParamSpec(name="max_tokens", dtype=DataType.INT32, default_value=256),
    ]
)

mv = reg.log_model(
    my_model,
    model_name='my_model',
    version_name='v1',
    signatures={"predict": sig}
)
```

You can also include parameters when inferring a signature with `infer_signature`:

```python
from snowflake.ml.model.model_signature import ParamSpec, DataType
from snowflake.ml.model import model_signature

params = [
    ParamSpec(name="top_k", dtype=DataType.INT32, default_value=10),
    ParamSpec(name="threshold", dtype=DataType.DOUBLE, default_value=0.5),
]

sig = model_signature.infer_signature(
    input_data,
    output_data,
    params=params
)
```

> **Note:**
>
> Parameter names must be unique within the signature and cannot share names with input features. If a parameter name
> conflicts with an input feature name, a `ValueError` is raised.

For a full list of `ParamSpec` arguments, see the
[API reference](https://docs.snowflake.com/developer-guide/snowpark-ml/reference/latest/model).

For details on passing parameter values at inference time, see
[Passing parameters during inference](../inference/native-batch-inference-sql.md) and
[Passing parameters in SQL](../inference/native-batch-inference-sql.md).

## Data type mappings

This section describes the equivalence of types in the Snowflake Model Registry for supported type systems.

### Column data types

The following table shows the equivalence of model signature type, pandas DataFrames (NumPy) type, and Snowpark
Python type.

| Model signature type | pandas DataFrame (NumPy) type | Snowpark Python type |
| --- | --- | --- |
| INT8 | `np.int8` | `ByteType` |
| INT16 | `np.int16` | `ShortType` |
| INT32 | `np.int32` | `IntegerType` |
| INT64 | `np.int64` | `LongType` |
| FLOAT | `np.float32` | `FloatType` |
| DOUBLE | `np.float64` | `DoubleType` |
| UINT8 | `np.uint8` | `ByteType` |
| UINT16 | `np.uint16` | `ShortType` |
| UINT32 | `np.uint32` | `IntegerType` |
| UINT64 | `np.uint64` | `LongType` |
| BOOL | `np.bool_` | `BooleanType` |
| STRING | `np.str_` | `StringType` |
| BYTES | `np.bytes_` | `BinaryType` |
| TIMESTAMP_NTZ | `np.datetime64` | `TimestampType` |

The representation of tensor features where the shape is specified uses `np.object_`.

### Missing values

If `sample_input_data` is used to infer model signature, it generally should not contain any NULL values.
The model registry attempts to infer signatures from the data provided, but it may not always be able to so
completely. It is good practice to prevent NULLs from being included in the sample data as early as possible,
for example at data input time, whenever possible.

### Conversion from NumPy

If the NumPy data type can be safely cast to a NumPy type shown in Column data types, it is
inferred as the corresponding data type.

### Conversion from PyTorch

| PyTorch type | Model signature type |
| --- | --- |
| `torch.uint8` | UINT8 |
| `torch.int8` | INT8 |
| `torch.int16` | INT16 |
| `torch.int32` | INT32 |
| `torch.int64` | INT64 |
| `torch.float32` | FLOAT |
| `torch.float64` | DOUBLE |
| `torch.bool` | BOOL |

### Conversion from Snowpark

In addition to the mappings shown in Column data types, the following conversions apply:

* `DecimalType` with scale of 0 maps to INT64.
* `DecimalType` with scale greater than 0 maps to DOUBLE.

---
title: Stable Endpoints & API Reference
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/stable-endpoints-api-reference.md
section: Snowflake ML
---

# Stable Endpoints & API Reference

This page provides the technical specifications for consuming your inference services externally and using the Snowflake Gateway to manage production model upgrades and high availability.

## Stable Endpoints with Snowflake Gateway

The standard SPCS ingress system has a tight coupling between a service and its hostname; when a service is recreated, the associated hostname is lost. The Snowflake Gateway resolves this by providing a permanent hostname allocated at creation that does not change for the lifetime of the gateway object.

### Key Capabilities

**Stable URL:** Maintain one permanent URL while pointing the gateway to different underlying services as your models evolve. Changes are typically reflected within one minute.

**Traffic Splitting:** Route requests to multiple endpoints based on assigned percentages, facilitating blue-green or canary deployments.

**Automatic Failover:** Automatically redirect traffic from an unavailable or non-operational endpoint to other healthy targets.

### Gateway Failover Behavior

The gateway respects the relative percentage of specified healthy endpoints and will automatically trigger a failover if:

* A service is suspended (and auto_resume is false) or its compute pool is suspended (until it comes back up).
* A service fails its readiness probe or is dropped entirely.
* The gateway owner loses USAGE or OWNERSHIP privileges on a target service endpoint.

> **Note:**
>
> Traffic is never failed over to an endpoint with a 0% split; a target must have at least 1% to be considered for failover.

## Managing Model Upgrades

### 1. Creating and Altering a Gateway

You can define how traffic is distributed between model versions using a YAML-based specification within a SQL command.

```sqlexample
-- Create a gateway to split traffic between V1 (90%) and V2 (10%)
CREATE OR REPLACE GATEWAY my_model_gateway
  FROM SPECIFICATION $$
    spec:
      type: traffic_split
      split_type: custom
      targets:
        - type: endpoint
          value: my_db.my_schema.model_v1_service!inference
          weight: 90
        - type: endpoint
          value: my_db.my_schema.model_v2_service!inference
          weight: 10
  $$;

-- Change the gateway to split traffic differently V1 (60%) and V2 (40%)
ALTER GATEWAY split_gateway
FROM SPECIFICATION $$
spec:
type: traffic_split
split_type: custom
targets:
- type: endpoint
value: my_db.my_schema.model_v1_service!inference
weight: 60
- type: endpoint
value: my_db.my_schema.model_v2_service!inference
weight: 40
$$;
```

**Rules for Specifications:** type must be traffic_split, split_type must be custom, and all target weights must sum to exactly 100. By default, a gateway can route to a maximum of 5 endpoints.

### 2. Handling Schema Evolution

When a new model version (V2) requires different input features than V1, follow this strategy to avoid request disruptions:

1. **Superset Update:** Update your client application to send all the features required by both V1 and V2. Snowflake model serving implicitly ignores unnecessary features.
2. **Gradual Split:** Deploy V2 and use ALTER GATEWAY to slowly shift traffic percentages from V1 to V2.
3. **Client Cleanup:** Once 100% of traffic is routed to V2, update the client to remove the now-obsolete V1 features.

> **Important:**
>
> Gateway routing with superset features is currently supported in dataframe_records format; support for dataframe_split is coming soon.

### 3. HTTP endpoint

Every gateway object comes with its endpoint name, which can be found by using following query:

```sqlexample
DESC GATEWAY split_gateway ->> select "ingress_url" as endpoint from $1
```

The endpoint of the gateway will be <https:/>/<endpoint>/. To call any particular methods to the model via gateway, use the method name as path to the URL (eg <https:/>/<endpoint>/<method-name> ). In a URL, underscores (_) in the method name are replaced by dashes (-) in the URL. For example, the service name predict_prob is changed to predict-proba in the URL.

For private link users, use privatelink_ingress_url instead of ingress_url.

## Authorization & Security

### Accessing the Endpoint

**Authentication:** Using Programmatic Access Tokens (PAT) in the header is the simplest: `Authorization: Snowflake Token="your_pat_token"`. Gateway supports all the protocols Service Endpoint supports.

**The 404 Behavior:** For security, Snowflake returns a 404 Not Found for all authorization failures (e.g., incorrect token or lack of network route). There is currently no way to distinguish authentication errors from invalid URLs.

### Required Privileges

To manage or use a Gateway, the owner role requires:

* **Gateway Management:** CREATE GATEWAY in the schema and USAGE, MODIFY, or OWNERSHIP on the gateway object.
* **Endpoint Usage:** USAGE on the database, schema, and target service endpoints (specifically the ALL_ENDPOINTS_USAGE service role on the deployed service).
* **Public Access:** BIND SERVICE ENDPOINT on the account to expose the gateway to the public internet.

## Request & Response Protocols

Gateway supports the same data format as described in the Real-time inference page.

### Passing Supplemental Metadata

In some scenarios, you may need to pass supplemental data (such as record IDs or primary keys) that are not part of the model’s input signature but are required for downstream logging or joining with ground-truth labels. To handle this, Snowflake supports an optional extra_columns top-level field.

#### Example

With dataframe_split you include extra_columns as a top-level field alongside the DataFrame payload:

```python
payload = {
    "dataframe_split": {
        "index": [0, 1],
        "columns": [
            "customer_id",
            "age",
            "monthly_spend",
            "primary_key",
        ],
        "data": [
            [101, 32, 85.5, "001"],
            [102, 45, 120.0, "002"],
        ]
    },
    "extra_columns": ["primary_key"]
}
```

or with dataframe_records:

```python
payload = {
    "dataframe_records": [
        {
            "customer_id": 101,
            "age": 32,
            "monthly_spend": 85.5,
            "primary_key": "001",
        },
        {
            "customer_id": 102,
            "age": 45,
            "monthly_spend": 120.0,
            "primary_key": "002",
        },
    ],
    "extra_columns": ["primary_key"]
}
```

#### Guidelines for extra_columns

**Optional:** You can omit extra_columns entirely if you do not need it.

**No collisions:** The column names listed in extra_columns must not collide with the columns that your model method expects as inputs. Keep model inputs and extra columns conceptually separate.

**Payload size limit:** The entire request payload (including extra_columns and all data rows) is limited to 1 MB. If you exceed this limit:

* Reduce the batch size (fewer rows per request), or
* Remove or shorten extra columns that are not strictly necessary.

---
title: TensorFlow
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/tensorflow.md
section: Snowflake ML
---

# TensorFlow

The Snowflake ML Model Registry supports models created using TensorFlow (models derived from `tensorflow.Module`)
and Keras v2 models (`keras.Model` with Keras version < 3.0.0).

> **Note:**
>
> For Keras 3.0.0 or later, use the [Keras](keras.md) handler.

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. TensorFlow models have `__call__` as the default target method. Keras v2 models have `predict` as the default target method. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |
| `multiple_inputs` | Whether the model expects multiple tensor inputs. Defaults to `False`. When `True`, the model will accept a list of tensors as input instead of a single tensor. |

You must specify either the `sample_input_data` or `signatures` parameter when logging a TensorFlow model so
that the registry knows the signatures of the target methods.

> **Note:**
>
> Keras v2 models can only have one target method.

> **Note:**
>
> When using pandas DataFrames (which use float64 by default), ensure your TensorFlow model uses `tf.float64`
> for variables and `tf.TensorSpec` input signatures to avoid dtype mismatch errors.

## Examples

These examples assume `reg` is an instance of `snowflake.ml.registry.Registry`.

### TensorFlow Module

The following example demonstrates creating a TensorFlow model by subclassing `tf.Module`, logging it to the Snowflake ML Model Registry, and running inference.

```python
import tensorflow as tf
import pandas as pd

# Define a simple TensorFlow module
class LinearModel(tf.Module):
    def __init__(self, name=None):
        super().__init__(name=name)
        self.weight = tf.Variable(2.0, dtype=tf.float64, name="weight")
        self.bias = tf.Variable(1.0, dtype=tf.float64, name="bias")

    @tf.function(input_signature=[tf.TensorSpec(shape=(None, 1), dtype=tf.float64)])
    def __call__(self, x):
        return self.weight * x + self.bias

# Create model instance
model = LinearModel(name="linear_model")

# Create sample input data as DataFrame
sample_df = pd.DataFrame({"input": [1.0, 2.0, 3.0, 4.0, 5.0]})

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_tf_linear_model",
    version_name="v1",
    sample_input_data=sample_df,
)

# Make predictions (default target method is __call__)
test_df = pd.DataFrame({"input": [6.0, 7.0, 8.0]})
result_df = model_ref.run(test_df)
```

### Keras v2 Sequential Model

The following example demonstrates training a Keras v2 sequential model, logging it to the Snowflake ML Model Registry, and running inference.

```python
import tf_keras as keras
from sklearn import datasets, model_selection

# Load dataset
iris = datasets.load_iris(as_frame=True)
X = iris.data
y = iris.target

# Rename columns for valid Snowflake identifiers
X.columns = [col.replace(' ', '_').replace('(', '').replace(')', '') for col in X.columns]

X_train, X_test, y_train, y_test = model_selection.train_test_split(X, y, test_size=0.2)

# Build Keras v2 model
model = keras.Sequential([
    keras.layers.Dense(64, activation='relu', input_shape=(X_train.shape[1],)),
    keras.layers.Dense(32, activation='relu'),
    keras.layers.Dense(3, activation='softmax')
])

model.compile(
    optimizer='adam',
    loss='sparse_categorical_crossentropy',
    metrics=['accuracy']
)

# Train the model
model.fit(X_train, y_train, epochs=50, verbose=0)

# Log the model
model_ref = reg.log_model(
    model=model,
    model_name="my_iris_classifier",
    version_name="v1",
    sample_input_data=X_test,
)

# Make predictions
result_df = model_ref.run(X_test[-10:], function_name="predict")
```

---
title: Train models
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/train-models.md
section: Snowflake ML
---

# Train models

Use Snowflake ML to develop machine learning and deep learning models with popular open-source frameworks.
Snowflake ML provides flexible development environments, efficient data access, and powerful compute resources without the management overhead.

You can train a model within either a Snowflake Notebook or a Snowflake ML Job.

Snowflake Notebooks are interactive environments that you can use for machine learning. For more information about using Snowflake Notebooks for machine learning workflows, see [Notebooks on Container Runtime](notebooks-on-spcs.md).

Snowflake ML Jobs allow you to run ML workflows from any environment. For more information about using Snowflake ML Jobs, see [Snowflake ML Jobs](ml-jobs/overview.md).

With Snowflake Experiments, you can compare your trained models in an organized manner. Use information logged during model training to evaluate the results and select the best model for your needs. For more information, see [Run an experiment to compare and select models](experiments.md).

## Train with open source

When you use a Snowflake Notebook or ML Job, you get access to the Container Runtime. The Container Runtime is an environment that has popular packages and frameworks that you can use to train your models.
The packages include scikit-learn, numpy, and scipy. For more information, see [Snowflake Container Runtime](container-runtime-ml.md).

The following example trains a logistic regression model using scikit-learn:

```python
import pandas as pd
from snowflake.ml.data.data_connector import DataConnector
from snowflake.snowpark.context import get_active_session
from sklearn.linear_model import LogisticRegression

# Get the active Snowpark session
session = get_active_session()

# Specify training table location
table_name = "TRAINING_TABLE"  # Replace with your actual Snowflake table name

# Load table into DataConnector
data_connector = DataConnector.from_dataframe(session.table(table_name))

# Convert to pandas DataFrame
pandas_df = data_connector.to_pandas()

# Assuming 'TARGET' is the label column in your Snowflake table
label_column_name = 'TARGET'

# Separate features (X) and target (y)
X, y = pandas_df.drop(label_column_name, axis=1), pandas_df[label_column_name]

# Initialize and fit a Logistic Regression model
logistic_regression_model = LogisticRegression(max_iter=1000)  # Increased max_iter for convergence
logistic_regression_model.fit(X, y)
```

In addition to scikit-learn, you can use the XGBoost and LightGBM libraries to develop powerful classification, regression, and ranking models.

The following example loads data from a Snowflake table using the Snowflake DataConnector, converts it to a pandas DataFrame, and trains an XGBoost model.
The DataConnector accelerates data loading and pandas dataframe conversion. For more information about the DataConnector, see [Load structured data from Snowflake tables](load-data.md)

```python
from snowflake.ml.data.data_connector import DataConnector
from snowflake.snowpark.context import get_active_session
import xgboost as xgb

session = get_active_session()

# Specify training table location
table_name = "TRAINING_TABLE"

# Load table into DataConnector
data_connector = DataConnector.from_dataframe(session.table(table_name))

pandas_df = data_connector.to_pandas()
label_column_name = 'TARGET'
X, y = pandas_df.drop(label_column_name, axis=1), pandas_df[label_column_name]

clf = xgb.Classifier()
clf.fit(X, y)
```

## Train deep learning models

You can use a GPU-powered container runtime image to train deep learning models with PyTorch, TensorFlow, and other frameworks. You can use the pre-installed libraries or you can extend the base image with packages from either public or private repositories.

You can get GPU compute on demand from your available compute pools. You only pay for the resources that you use.

With the GPU container runtime image, you can use features such as distributed training to accelerate the development of large-scale models.

For an example of efficient data loading with DataConnector and distributed training, see [Running Distributed PyTorch Models on Snowflake: An End-to-End ML Solution](https://www.snowflake.com/en/developers/solutions-center/running-distributed-pytorch-models-on-snowflake-an-end-to-end-ml-solution/).

The following example loads data efficiently:

```python
import torch
import torch.nn as nn
from torch.utils.data import DataLoader
from snowflake.ml.data.data_connector import DataConnector

example_snowpark_dataframe = session.table("EXAMPLE_TRAINING_DATA")

# Connector from a Snowflake table
data_connector = DataConnector.from_dataframe(example_snowpark_dataframe)

# Load as a torch dataset
torch_dataset = data_connector.to_torch_dataset(batch_size=32)
train_loader = DataLoader(torch_dataset, batch_size=None)

label_col = 'TARGET'
feature_cols = ['FEATURE1', 'FEATURE2']

for batch_idx, batch in enumerate(dataloader):
    y = batch_data.pop(label_col).squeeze()
    X = torch.stack(
        [tensor.squeeze() for key, tensor in batch.items() if key in feature_cols]
    )
```

The following example trains a model:

```python
# ------------------------
# Tiny MLP for binary classification
# ------------------------
input_dim = X.shape[1]

class MLP(nn.Module):
    def __init__(self, d_in):
        super().__init__()
        self.net = nn.Sequential(
            nn.Linear(d_in, 64), nn.ReLU(),
            nn.Linear(64, 32), nn.ReLU(),
            nn.Linear(32, 1)  # logits
        )

    def forward(self, x):
        return self.net(x).squeeze(1)

DEVICE = torch.device("cuda" if torch.cuda.is_available() else "cpu")
model = MLP(input_dim).to(DEVICE)
opt = torch.optim.Adam(model.parameters(), lr=1e-3)
criterion = nn.BCEWithLogitsLoss()

# ------------------------
# Train
# ------------------------
EPOCHS = 5

for epoch in range(1, EPOCHS + 1):
    model.train()
    for xb, yb in train_loader:
        xb, yb = xb.to(DEVICE), yb.to(DEVICE)
        logits = model(xb)
        loss = criterion(logits, yb)
        opt.zero_grad()
        loss.backward()
        opt.step()
    acc = evaluate(val_loader)
    print(f"epoch {epoch} val_acc={acc:.3f}")
```

## Handle complex training tasks

Training models on large datasets, complex model architectures and hyperparameters requires significant time, cost, and access to resources that facilitate such complex processing. With Snowflake ML, you can train such models in confidence.

### Fully managed training infrastructure

Snowflake ML provides fully managed training infrastructure through Notebooks and ML Jobs on Container Runtime. You don’t need to manage custom images or provision resources. You can bring your workload, select the appropriate compute nodes from the admin-determined list, and start training.

### Efficient and accelerated data movement

Loading large amounts of data into memory for processing with training packages can be slow, especially when you’re trying to read directly into an object such as a pandas dataframe. Snowflake ML makes data loading efficient by using the distributed processing of the underlying compute pools. Use the Data Connector to load from your Snowflake tables and stages into open source objects such as pandas dataframes, PyTorch datasets, and TensorFlow datasets.

### Distributed training and hyperparameter tuning

Training ML models on large datasets can exceed the resources of a single node. With Snowflake’s distributed APIs, you can scale feature engineering and training workflows across multiple nodes for improved performance. With the distributed APIs, you can do the following:

* Leverage distributed preprocessing functions in `snowflake.ml.modeling.preprocessing`.
* Scale your model training out across one or more nodes using optimized training APIs in [Snowflake Container Runtime](container-runtime-ml.md).
* Accelerate hyperparameter tuning with Snowflake ML’s [distributed HPO](container-hpo.md), optimized for data stored in Snowflake. You can also use open source libraries like `hyperopt` or `optuna`.

In addition to using Snowflake’s distributed APIs to scale your workflows, you can also use Ray. Ray is an open-source framework that provides a simple and flexible way to scale Python applications. It allows you to run your code in parallel across multiple nodes. For more information about using Ray with Snowflake ML, see the [Ray Getting Started Guide](https://docs.ray.io/en/latest/ray-overview/getting-started.html).

## Integrate with MLOps

Snowflake provides a fully integrated MLOps platform that you can access through Snowflake Notebooks and ML Jobs. This enables you to train models using production-ready features, manage experiments and models, and deploy trained models to production.

You can use the following features for your MLOps workflow:

* Create and manage features via the Feature Store
* Run feature pre-processing at scale with OSS and SnowflakeML APIs
* Manage experiments with built-in experiment tracking
* Register and manage the trained model
* Run inference pipelines against the registered model
* Monitor the deployed model for drift and accuracy

## Next steps

After training your models, you can:

* [Tune hyperparameters](container-hpo.md) to optimize performance
* [Train across partitions](train-models-across-partitions.md) for large-scale model training
* [Register models](model-registry/overview.md) in the Model Registry
* [Deploy models](inference/native-batch-inference-sql.md) for inference

---
title: Train models across data partitions
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/train-models-across-partitions.md
section: Snowflake ML
---

# Train models across data partitions

Use Many Model Training (MMT) to train multiple machine learning models efficiently across data partitions. It handles distributed orchestration, model storage, and artifact persistence automatically.

MMT partitions your Snowpark DataFrame by a specified column and trains separate models on each partition in parallel. Focus on your model training logic while MMT handles infrastructure complexity and scales automatically.

You can use MMT to train multiple models efficiently across different data segments. This tool is ideal for scenarios like training region-specific sales forecasting models, building personalized recommendation systems where each customer group requires its own model, or creating segment-specific predictive models. MMT handles the distributed model training automatically, eliminating the complexity of managing distributed computing infrastructure.

You can use MMT to train models using open source machine learning models and frameworks such as XGBoost, scikit-learn, PyTorch, and TensorFlow. MMT automatically serializes model artifacts, so that you can access them at the time of inference.

You can also implement the ModelSerde interface to train custom models or use unsupported ML frameworks. This allows you to integrate MMT with any machine learning framework or custom model architecture that you use.

> **Important:**
>
> Before you use MMT, make sure you have the following:
>
> * **Container Runtime Environment**: MMT requires a Snowflake ML container runtime environment.
> * **Stage Access Permissions**: MMT automatically stores model artifacts in Snowflake stages. Ensure you have appropriate permissions to access the specified named stage.
> * **ML Framework Support**: Built-in integrations are available for XGBoost, scikit-learn, PyTorch, and TensorFlow. For custom models, implement the ModelSerde interface.

The following section walks you through using MMT in an example workflow.

## Training a model with MMT

This section demonstrates the complete MMT workflow in five key steps:

1. **Import your data** - Load training data using Snowpark
2. **Define the training function** - Define the training function
3. **Train models across partitions** - Use MMT to train models on each partition in parallel
4. **Access trained models** - Retrieve and use the trained models for each partition
5. **Model persistence and retrieval** - Save models to stages and restore them later

The workflow automatically handles distributed training, model serialization, and artifact storage across your data partitions.

### Import your data

Use a Snowpark session to start importing your data. The Many Model Training function splits the data that you import into different partitions using the column that you specify.

Before you use MMT, create a Snowpark session. For more information, [Creating a Session for Snowpark Python](../snowpark/python/creating-session.md).

The following code uses a Snowpark session to import your training data.

```python
# Example: sales_data with columns: region, feature1, feature2, feature3, target
sales_data = session.table("SALES_TRAINING_DATA")
```

### Define the training function

After you get your data, you define the training function that MMT uses to train models across partitions. The training function receives a data connector and a context object that points it to the data partition on which it’s training. This section has examples defining a training function for training an XGBoost model in addition to examples that leverage TensorFlow and PyTorch.

Your training function must have this exact signature: `(data_connector, context)`.
For each data partition, MMT calls `train_xgboost_model` with the following arguments:

* `data_connector`: A data connector that provides access to the data that MMT partitions. `train_xgboost_model` converts that dataframe to pandas.
* `context`: An object that provides the `partition_id` to the `train_xgboost_model` function. This ID is the name of the column that you’re partitioning on.

You don’t call this function yourself. MMT handles the execution across all partitions.

Use the following code to define your training function. After you change the code to reflect the features in your data, you can pass it to the MMT function.

XGBoostPyTorchTensorFlowCustom model

Use XGBoost to train models across data partitions. XGBoost provides excellent performance for structured data and handles missing values automatically.

```python
def train_xgboost_model(data_connector, context):
    df = data_connector.to_pandas()
    print(f"Training model for partition: {context.partition_id}")

    # Prepare features and target
    X = df[['feature1', 'feature2', 'feature3']]
    y = df['target']

    # Train the model
    from xgboost import XGBRegressor
    model = XGBRegressor(
        n_estimators=100,
        max_depth=6,
        learning_rate=0.1,
        random_state=42
    )
    model.fit(X, y)
    return model

trainer = ManyModelTraining(train_xgboost_model, "model_stage")
```

Use PyTorch to train deep learning models across data partitions. PyTorch offers flexible neural network architectures and dynamic computation graphs.

```python
def train_pytorch_model(data_connector, context):
    import torch
    import torch.nn as nn

    df = data_connector.to_pandas()
    # ... prepare data for PyTorch ...

    model = nn.Sequential(nn.Linear(10, 1))
    # ... training logic ...
    return model  # Automatically saved as model.pth

from snowflake.ml.modeling.distributors.many_model import TorchSerde
trainer = ManyModelTraining(train_pytorch_model, "models_stage", serde=TorchSerde())
```

Use TensorFlow to train deep learning models across data partitions. TensorFlow provides comprehensive tools for both research and production deployment.

```python
def train_tf_model(data_connector, context):
    import tensorflow as tf

    df = data_connector.to_pandas()
    # ... prepare data for TensorFlow ...

    model = tf.keras.Sequential([tf.keras.layers.Dense(1)])
    # ... training logic ...
    return model  # Automatically saved as model.h5

from snowflake.ml.modeling.distributors.many_model import TensorFlowSerde
trainer = ManyModelTraining(train_tf_model, "models_stage", serde=TensorFlowSerde())
```

Use custom models or unsupported ML frameworks by implementing the ModelSerde interface. This example shows scikit-learn with custom metadata handling.

```python
from snowflake.ml.modeling.distributors.many_model import ModelSerde
import json

class ScikitLearnSerde(ModelSerde):
    '''Custom serializer for scikit-learn models with metadata'''

    @property
    def filename(self) -> str:
        return "sklearn_model.joblib"

    def write(self, model, file_path: str) -> None:
        import joblib
        # Save model with metadata
        model_data = {
            'model': model,
            'feature_names': getattr(model, 'feature_names_in_', None),
            'model_type': type(model).__name__
        }
        joblib.dump(model_data, file_path)

    def read(self, file_path: str):
        import joblib
        return joblib.load(file_path)

def train_sklearn_model(data_connector, context):
    from sklearn.ensemble import RandomForestRegressor
    df = data_connector.to_pandas()
    X, y = df[['feature1', 'feature2']], df['target']

    model = RandomForestRegressor()
    model.fit(X, y)
    return model  # Automatically saved with metadata

trainer = ManyModelTraining(train_sklearn_model, "models_stage", serde=ScikitLearnSerde())
```

### Train models across partitions

After you’ve defined your training function, you can use MMT to train models across partitions. Specify the column to partition by and the stage where the models are saved.

The following code partitions the data by the `region` column and uses the `train_xgboost_model` function to train separate models for each region in parallel.

For example, if the following were the possible values for the `region` column:

* North
* South
* East
* West
* Central

The `ManyModelTraining` function would create a separate data partition for each of the preceding regions and train a model on each partition.

```python
from snowflake.ml.modeling.distributors.many_model import ManyModelTraining

trainer = ManyModelTraining(train_xgboost_model, "model_stage") # Specify the stage to store the models
training_run = trainer.run(
    partition_by="region",  # Train separate models for each region
    snowpark_dataframe=sales_data,
    run_id="regional_models_v1" # Specify a unique ID for the training run
)

# Monitor training progress
final_status = training_run.wait()
print(f"Training completed with status: {final_status}")
```

Models are stored in the stage at `run_id/{partition_id}` where `partition_id` is the partition column value.

### Access trained models

After MMT finishes, you have trained models for each data partition stored in your specified stage. Each model is trained on data specific to its partition. For example, a “North” model is trained only on North region data.

The training run object provides methods to access these models and check training status for each partition.

The following code retrieves the checks the status of the training run and retrieves the trained models for each partition:

```python
if final_status == RunStatus.SUCCESS:
    # Access models for each partition
    for partition_id in training_run.partition_details:
        trained_model = training_run.get_model(partition_id)
        print(f"Model for {partition_id}: {trained_model}")

        # You can now use the model for predictions or further analysis
        # Example: model.predict(new_data)
else:
    # Handle training failures
    for partition_id, details in training_run.partition_details.items():
        if details.status != "DONE":
            print(f"Training failed for {partition_id}")
            error_logs = details.logs
```

### Model Persistence and Retrieval

MMT automatically persists trained models to your specified Snowflake stage during the training process. Each model is stored with a structured path that includes the run ID and partition identifier, making it easy to organize and retrieve models later.

The automatic persistence means you don’t need to manually save models. MMT handles serialization and storage for you, eliminating the risk of losing trained models due to session timeouts or connection issues.

You can restore previous training runs and access their models even after your original session has ended. This persistence mechanism enables you to:

* Resume work across different sessions
* Share trained models with team members
* Build model versioning workflows
* Integrate with downstream inference pipelines

Models are automatically saved to the specified stage and can be retrieved later:

```python
# Restore training run from stage
restored_run = ManyModelTraining.restore_from("regional_models_v1", "model_stage")

# Access models from restored run
north_model = restored_run.get_model("North")
south_model = restored_run.get_model("South")
```

## Training custom models

For custom models or unsupported ML frameworks, implement the ModelSerde interface. You can define your own serialization and deserialization logic for custom models. This allows you to integrate MMT with any machine learning framework or custom model architecture that you use.

```python
from snowflake.ml.modeling.distributors.many_model import ModelSerde

class CustomModelSerde(ModelSerde):
    def serialize(self, model, path):
        # Custom serialization logic
        pass

    def deserialize(self, path):
        # Custom deserialization logic
        pass

def train_custom_model(data_connector, context):
    # Your custom training logic
    model = your_custom_model_training(data_connector.to_pandas())
    return model

trainer = ManyModelTraining(
    train_custom_model,
    "custom_model_stage",
    model_serde=CustomModelSerde()
)
```

## Integrating with Model Registry

MMT can be integrated with Snowflake’s Model Registry for enhanced model management. The Model Registry provides centralized model versioning, metadata tracking, and deployment management across your organization. This integration is particularly valuable when training multiple models with MMT, as it helps you organize, track, and govern all the partition-specific models from a single location.

Using the Model Registry with MMT enables you to do the following:

* Track different iterations of your partition-specific models
* Store model performance metrics, training parameters, and lineage information
* Manage which model versions are deployed to production for each partition
* Share models across teams with proper access controls and documentation
* Implement approval workflows and compliance tracking for model deployments

```python
# Register trained models to Model Registry
for partition_id in training_run.partition_details:
    model = training_run.get_model(partition_id)

    # Register to Model Registry
    model_ref = registry.log_model(
        model,
        model_name=f"sales_model_{partition_id.lower()}",
        version_name="v1"
    )
```

---
title: Troubleshooting
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/inference/real-time-inference-troubleshooting.md
section: Snowflake ML
---

# Troubleshooting

This guide explains how to monitor your deployments in Snowpark Container Services (SPCS) and resolve common issues related to package dependencies, memory, and environment configurations.

## Monitor SPCS deployments

You can monitor deployment by inspecting the services being launched using the following SQL query.

```sqlexample
SHOW SERVICES IN COMPUTE POOL my_compute_pool;
```

Two jobs are launched:

* **MODEL_BUILD_xxxxx**: The final characters of the name are randomized to avoid name conflicts. This job builds the image and ends after the image has been built. If an image already exists, the job is skipped.

  The logs are useful for debugging issues such as conflicts in package dependencies. To see the logs from this job, run the SQL below, being sure to use the same final characters:

  ```sqlexample
  CALL SYSTEM$GET_SERVICE_LOGS('MODEL_BUILD_xxxxx', 0, 'model-build');
  ```
* **MYSERVICE**: The name of the service as specified in the call to `create_service`. This job is started if the MODEL_BUILD job is successful or skipped. To see the logs from this job, run the SQL below:

  ```sqlexample
  CALL SYSTEM$GET_SERVICE_LOGS('MYSERVICE', 0, 'model-inference');
  ```

If logs are not available via `SYSTEM$GET_SERVICE_LOG` because the build job or service has been deleted, you can check the event table (if enabled) to see the logs:

```sqlexample
SELECT RESOURCE_ATTRIBUTES, VALUE
FROM <EVENT_TABLE_NAME>
WHERE true
    AND timestamp > dateadd(day, -1, current_timestamp())  -- choose appropriate timestamp range
    AND RESOURCE_ATTRIBUTES:"snow.database.name" = '<db of the service>'
    AND RESOURCE_ATTRIBUTES:"snow.schema.name" = '<schema of the service>'
    AND RESOURCE_ATTRIBUTES:"snow.service.name" = '<Job or Service name>'
    AND RESOURCE_ATTRIBUTES:"snow.service.container.instance" = '0'  -- choose all instances or one particular
    AND RESOURCE_ATTRIBUTES:"snow.service.container.name" != 'snowflake-ingress' --skip logs from internal sidecar
ORDER BY timestamp ASC;
```

## Package conflicts

Two systems dictate the packages installed in the service container: the model itself and the inference server. To minimize conflicts with your model’s dependencies, the inference server requires only the following packages:

* `gunicorn<24.0.0`
* `starlette<1.0.0`
* `uvicorn-standard<1.0.0`

Make sure your model dependencies, along with the above, are resolvable by pip or conda, whichever you use.

If a model has both `conda_dependencies` and `pip_requirements` set, these will be installed as follows via conda:

* **Channels:**

  + `conda-forge`
  + `nodefaults`
* **Dependencies:**

  + `all_conda_packages`
  + **pip:**

    - `all_pip_packages`

Snowflake gets Anacaonda packages from conda-forge when building container images because the Snowflake conda channel is available only in warehouses, and the defaults channel requires users to accept Anaconda terms of use, which isn’t possible during an automated build. To obtain packages from a different channel, such as defaults, specify each package with the channel name, as in `defaults::pkg_name`.

> **Note:**
>
> If you specify both `conda_dependencies` and `pip_requirements`, the container image builds successfully even if the two sets of dependencies are not compatible, which might cause the resulting container image not to work as you expect. Snowflake recommends using only `conda_dependencies` or only `pip_requirements`, not both.

## Service out of memory

Some models are not thread-safe, so Snowflake loads a separate copy of the model in memory for each worker process. This can cause out-of-memory conditions for large models with a higher number of workers. Try reducing `num_workers`.

## Unable to alter the service spec

The specifications of the model build and inference services cannot be changed using `ALTER SERVICE`. You can only change attributes such as `TAG`, `MIN_INSTANCES`, and so forth. Since the image is published in the image repo, however, you can copy the spec, modify it, and create a new service from it, which you can start manually.

## Package not found

Model deployment failed during the image building phase. model-build logs suggest that a requested package was not found. (This step uses conda-forge by default if the package is mentioned in `conda_dependencies`.)

Package installation can fail for any of the following reasons:

* The package name or version is invalid. Check the spelling and version of the package.
* The requested version of the package does not exist in conda-forge. You can try removing the version specification to get the latest version that is available in conda-forge, or use `pip_requirements` instead. You can browse all available packages here.
* Sometimes, you may need a package from a special channel (eg pytorch). Add a `channel_name::` prefix to the dependency, such as `pytorch::torch`.

## Huggingface Hub version mismatch

A Hugging Face model inference service can fail with the error message:

```text
ImportError: huggingface-hub>=0.30.0,<1.0 is required for a normal functioning of this module, but found huggingface-hub==0.25.2
```

This is because the `transformers` package does not specify the correct dependencies on `huggingface-hub` but instead checks in the code. To resolve this problem, log the model again, this time explicitly specifying the required version of `huggingface-hub` in the `conda_dependencies` or `pip_requirements`.

## Torch not compiled with CUDA enabled

The typical cause of this error is that you have specified both `conda_dependencies` and `pip_requirements`. As mentioned in Package conflicts section, conda is the package manager used for building the container image. Anaconda does not resolve packages from `conda_dependencies` and `pip_requirements` together and gives conda packages precedence. This can lead to a situation where the conda packages are not compatible with the pip packages. You might have specified `torch` in the `pip_requirements`, not in the `conda_dependencies`. Consider consolidating the dependencies into either `conda_dependencies` or `pip_requirements`. If that is not possible, prefer specifying the most important packages in `conda_dependencies`.

---
title: Use Snowflake online feature store in production
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/use-online-feature-store-in-production.md
section: Snowflake ML
---

# Use Snowflake online feature store in production

Snowflake ML Feature Store helps manage your features throughout the process of feature engineering.

For online applications that require low-latency inference, use the online feature store to serve your features.

The following sections go through productionizing the process of retrieving features within your Python application. These sections have
code examples that do the following:

1. Load the Iris dataset into Snowflake
2. Define the connection to Snowflake
3. Create the Feature Store and Feature Views
4. Retrieve the features and feature values
5. Generate predictions from your model

The code examples are written in Python. To go through this workflow for applications written in other languages, use a Snowflake driver
that’s specific to that language. For more information, see [Drivers](../../drivers.md).

## Prerequisites

To run online ML feature retrieval in Snowflake, you need the following:

* Data that you’ve already loaded into Snowflake
* A Snowflake feature store
* Feature views
* Online feature serving enabled for each feature view

You can use features from your own Snowflake feature store, but you can use the following code to load the Iris dataset into Snowflake if
you don’t already have a feature store.

```python
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
import pandas as pd

from snowflake.snowpark.context import get_active_session

sf_session = get_active_session()

### Download the Iris dataset.

iris = load_iris()
X = pd.DataFrame(iris.data, columns=iris.feature_names)
# rename the columns to fit the Snowflake feature naming requirements
X.rename(columns={
    'sepal length (cm)': 'SEPAL_LENGTH_CM',
    'sepal width (cm)': 'SEPAL_WIDTH_CM',
    'petal length (cm)': 'PETAL_LENGTH_CM',
    'petal width (cm)': 'PETAL_WIDTH_CM'
}, inplace=True)
y = iris.target

### Load the data into Snowflake.
X = X.reset_index().rename(columns={"index": "ID"})
sepal_df = sf_session.write_pandas(
    X[['ID', 'SEPAL_LENGTH_CM', 'SEPAL_WIDTH_CM']],
    table_name="SEPAL_DATA",
    auto_create_table=True,
    overwrite=True
)
petal_df = sf_session.write_pandas(
    X[['ID', 'PETAL_LENGTH_CM', 'PETAL_WIDTH_CM']],
    table_name="PETAL_DATA",
    auto_create_table=True,
    overwrite=True
)
```

After you have the data in your environment, you create the feature store. The following code creates a feature store and the
`id_entity` entity for the different samples from the Iris dataset.

```python
### Install Snowflake ML
%pip install snowflake-ml-python==1.18.0

from snowflake.ml.feature_store import (
    FeatureStore,
    FeatureView,
    Entity,
    CreationMode,
)
from snowflake.ml.feature_store.feature_view import OnlineConfig

### Create Snowflake feature store

feature_store = FeatureStore(
    session=sf_session,
    database=sf_session.get_current_database(),
    name="MY_FEATURE_STORE",
    default_warehouse=sf_session.get_current_warehouse(),
    creation_mode=CreationMode.OR_REPLACE
)
sf_session.use_schema("MY_FEATURE_STORE")

id_entity = Entity(
    name='SAMPLE_ID',
    join_keys=["ID"],
    desc='sample id'
)
feature_store.register_entity(id_entity)
```

> **Note:**
>
> Snowflake ML Feature Store has the concept of entities. Entities are keys that organize features between feature views. For more information
> about entities, see [Working with entities](entities.md).

After you’ve created the feature store, you define the feature views. The following code defines the sepal and petal feature views from the
Iris dataset.

```python
### Create feature views with Online Serving.
sepal_fv = FeatureView(
    name='SEPAL_FEATURES',
    entities=[id_entity],
    feature_df=sepal_df,
    desc='Sepal features',
    refresh_freq='10 minutes',
    online_config=OnlineConfig(enable=True)
)
petal_fv = FeatureView(
    name='PETAL_FEATURES',
    entities=[id_entity],
    feature_df=petal_df,
    desc='Petal features',
    refresh_freq='10 minutes',
    online_config=OnlineConfig(enable=True)
)
sepal_fv = feature_store.register_feature_view(
    sepal_fv, version="v1", overwrite=True)
petal_fv = feature_store.register_feature_view(
    petal_fv, version="v1", overwrite=True)
```

## Retrieve the feature values

After you’ve registered the feature views and enabled online feature serving for each feature view, you can have the feature values from
each feature view served to your application.

To retrieve the feature values, you do the following:

1. Set up a connection to Snowflake
2. Create the session and Snowflake Feature Store objects that initialize when the application starts
3. Retrieve the features from your feature views
4. Create a prediction endpoint and get predictions from that endpoint

> **Important:**
>
> You must install `snowflake-ml-python>=1.18.0` into your application’s environment to use the Feature Store API.

To connect to Snowflake from your application, you must set up either a [Programmatic Access Token (PAT)](../../../user-guide/programmatic-access-tokens.md) or
[key-pair authentication](../../../user-guide/key-pair-auth.md) as an authentication method.

### Configure the client

When you initialize your application, it must connect to Snowflake ML Feature Store API and create the required Feature Store Python
objects.

Use the following sections to configure your client’s connection to the Snowflake ML Feature Store API.

### Configure a Programmatic Access Token (PAT)

Programmatic Access Token (PAT)Key Pair Authentication

Specify the following connection parameters in the following code to connect to Snowflake from your application:

* `schema` - the name of the Snowflake feature store
* `database` - the database containing the schema or feature store
* `role` - the role required to read from the feature store. For more information, see
  [Provide access to create and serve online features](create-and-serve-online-features-python.md).
* `password` - your PAT.

```python
import os

### Define connection parameters using PAT authentication.
snowflake_connection_parameters = {
    "account": "<account_identifier>",
    "user": "<user>",
    "password": pat,
    "role": "<FS_CONSUMER_ROLE>",
    "host": "<host>",
    "warehouse": "<warehouse>",
    "database": "<database>",
    "schema": "MY_FEATURE_STORE",
}
```

Specify the following connection parameters in the following code to connect to Snowflake from your application:

* `schema` - the name of the Snowflake feature store
* `database` - the database containing the schema or feature store
* `role` - the role required to read from the feature store. For more information, see
  [Create and serve online features](create-and-serve-online-features-python.md).
* `private_key_file` - the private key file
* `private_key_file_pwd` - the password to the private key file

```python
import os

### Define connection parameters for key-pair authentication.
snowflake_connection_parameters = {
    "account": "<account_identifier>",
    "user": "<user>",
    "private_key_file": "<private key file>",
    "private_key_file_pwd": "<private key file pwd>",
    "role": "<FS_CONSUMER_ROLE>",
    "host": "<host>",
    "warehouse": "<warehouse>",
    "database": "<database>",
    "schema": "MY_FEATURE_STORE",
}
```

**Create the Session and Feature Store Objects**

After you’ve defined your connection parameters, you create the session and Feature Store objects that your application uses to connect to
Snowflake.

The following code:

1. Creates the Snowflake Session, the client that your application uses to communicate with Snowflake.
2. Configures a thread pool executor to enable feature retrieval parallelism.
3. Lists the features that we’re retrieving from the feature store.
4. Initializes the feature store reader client. This object wraps the Snowflake session. It’s the main way your application interacts with the
   feature store.
5. Initializes the feature views that you’ve defined. You can replace these with your own features.

```python
import os
from concurrent.futures import ThreadPoolExecutor

from snowflake.snowpark.session import Session
from snowflake.ml.feature_store import FeatureStore, CreationMode

# 1.Start a Snowflake session
sf_session = Session.builder.configs(snowflake_connection_parameters).create()

# 2. Create a thread pool executor for feature store requests
MAX_WORKERS=os.cpu_count() * 2
executor = ThreadPoolExecutor(max_workers=MAX_WORKERS)

# 3. List individual features we are going to retrieve for inference. In this
#    example, we are listing Iris features described above in the
#    "Prerequisites" section.
PETAL_FEATURE_LIST = ["PETAL_WIDTH_CM", "PETAL_LENGTH_CM"]
SEPAL_FEATURE_LIST = ["SEPAL_WIDTH_CM", "SEPAL_LENGTH_CM"]

# 4. Initialize feature store consumer client
feature_store = FeatureStore(
    session=sf_session,
    database=sf_session.get_current_database(),
    name="MY_FEATURE_STORE",
    default_warehouse="<warehouse>",
    creation_mode=CreationMode.FAIL_IF_NOT_EXIST
)

# 5. Initialize the feature views
sepal_fv = feature_store.get_feature_view("SEPAL_FEATURES", version="v1")
petal_fv = feature_store.get_feature_view("PETAL_FEATURES", version="v1")
```

## Retrieve the online features on the serving path

After you’ve defined how the application initializes, you can create a prediction endpoint.

There are different ways where you can define how your application handles requests. The following Python code:

* Defines the prediction endpoint in your application
* Takes the keys from the JSON request
* Uses the keys to retrieve the feature values from the feature views
* Passes those feature values to the model
* Gets the predictions from the model
* Returns the predictions in the response

```python
from snowflake.ml.feature_store.feature_view import StoreType
import json
import flask

def _retrieve_features(
    feature_view: FeatureView,
    keys: List[int],
    feature_names: List[str]):
    """Retrieve features from the given feature view"""

    return feature_store.read_feature_view(
        feature_view,
        keys=[keys],
        feature_names=feature_names,
        store_type=StoreType.ONLINE  # Query the ONLINE store
    ).collect()

@app.route("/prediction-endpoint", methods=["POST"])
def prediction():
    if flask.request.content_type == 'application/json':
        input_data = flask.request.data.decode("utf-8")
        input_data = json.loads(data)
    else:
        return flask.Response(
            response="This predictor only supports JSON data",
            status=415,
            mimetype="text/plain"
        )

    # Expect that input data is a single key
    keys = [int(input_data["key"])]

    # Retrieve features from two feature views in parallel.
    sepal_features = executor.submit(
        _retrieve_features, sepal_fv, keys, SEPAL_FEATURE_LIST)
    petal_features = executor.submit(
        _retrieve_features, petal_fv, keys, PETAL_FEATURE_LIST)

    sepal_features = sepal_features.result()
    petal_features = petal_features.result()

    predictions = []
    if len(sepal_features) != 0 and len(petal_features) != 0:
        # Compose the feature vector, excluding the join keys.
        feature_vector = (
            list(sepal_features[0][1:])
            + list(petal_features[0][1:])
        )

        # Using a hypothetical run_inference function.
        predictions = run_inference(feature_vector)

    result = json.dumps({"results": list(predictions)})
    return flask.Response(response=result, status=200,
        mimetype="application/json")
```

The preceding code calls a hypothetical `run_inference` function. Your own inference function could get predictions from your model
regardless of whether it’s hosted remotely or in application memory.

The prediction endpoint in the preceding code accepts a key and returns the prediction for that key. Your data might have multiple keys
characterizing a single sample. The preceding code is meant to be an example that you can adapt to your own use case.

## Related content

* [Create and serve online features](create-and-serve-online-features-python.md)
* [Snowflake Feature Store](overview.md)
* [Feature Store SQL commands](../../../sql-reference/commands-feature-store.md)
* [Online feature store Notebook examples](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/ml/feature_store)

---
title: Using built-in model types
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/overview.md
section: Snowflake ML
---

# Using built-in model types

The Snowflake Model Registry supports the following built-in model types:

* [Snowpark ML Modeling](snowpark-ml.md)
* [scikit-learn](scikit-learn.md)
* [XGBoost](xgboost.md)
* [LightGBM](lightgbm.md)
* [Prophet](prophet.md)
* [CatBoost](catboost.md)
* [PyTorch](pytorch.md)
* [TensorFlow](tensorflow.md)
* [Keras](keras.md)
* [MLFlow PyFunc](mlflow.md)
* [Sentence Transformer](sentence-transformer.md)
* [Hugging Face pipeline](hugging-face.md)

Other types of models are supported via the `snowflake.ml.model.CustomModel` class (see [Bring your own model types via serialized files](../bring-your-own-model-types.md))

---
title: Using partitioned models
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/partitioned-models.md
section: Snowflake ML
---

# Using partitioned models

Many datasets can be partitioned into multiple independent subsets. For example, a dataset containing sales data
for a chain of stores can be partitioned by store number. A separate model can then be trained for each partition.
Training and inference operations on the partitions can be parallelized, reducing the wall-clock time for these
operations. Furthermore, since individual stores likely differ significantly in how their features affect their
sales, this approach can lead to more accurate inference at the store level.

The Snowflake Model Registry supports distributed processing of training and inference of partitioned data when:

* The dataset contains a column that reliably identifies partitions in the data.
* The data in each individual partition is uncorrelated with the data in the other partitions and contains enough
  rows to train the model.

Models may be stateless (training is performed each time inference is called) or stateful (training is performed once
before inference and retained for use in multiple inference operations).

With the Snowflake Model Registry, implement partitioned training and inference using
[custom models](bring-your-own-model-types.md). During inference, the model
inference method partitions the dataset, generates predictions for each partition in parallel using all the nodes and
cores in your warehouse, and combines the results into a single dataset afterward.

> **Note:**
>
> For partitioned models, it’s important to distinguish the registered model from the individual models that
> are created by or compose the registered model. Where possible, we will refer to the individual underlying models
> as submodels.

## Defining and logging the model

The partitioned model class inherits from `snowflake.ml.model.custom_model.CustomModel`, and inference methods are
declared with the `@custom_model.partitioned_api` decorator. See
[Bring your own model types via serialized files](bring-your-own-model-types.md) for information on defining standard custom models.

```python
import pandas as pd

from snowflake.ml.model import custom_model

class ExamplePartitionedModel(custom_model.CustomModel):

  @custom_model.partitioned_api
  def predict(self, input: pd.DataFrame) -> pd.DataFrame:
      # All data in the partition will be loaded in the input dataframe.
      #… implement model logic here …
      return output_df

my_model = ExamplePartitionedModel()
```

When logging the model, provide a `function_type` of `TABLE_FUNCTION` in the `options` dictionary along with any
other [options](overview.md) your model requires.

```python
from snowflake.ml.registry import Registry

reg = Registry(session=sp_session, database_name="ML", schema_name="REGISTRY")
model_version = reg.log_model(my_model,
  model_name="my_model",
  version_name="v1",
  options={"function_type": "TABLE_FUNCTION"},    ###
  conda_dependencies=["scikit-learn"],
  sample_input_data=train_features
)
```

If your partitioned model also has regular (non-table) functions as methods, you can use the `method_options`
dictionary to specify the type of each method instead.

```python
model_version = reg.log_model(my_model,
    model_name="my_model",
    version_name="v1",
    options={
      "method_options": {                                 ###
        "METHOD1": {"function_type": "TABLE_FUNCTION"},   ###
        "METHOD2": {"function_type": "FUNCTION"}          ###
      }
    },
    conda_dependencies=["scikit-learn"],
    sample_input_data=train_features,
)
```

## Partitioned model inference

Use the `run` method of a Python `ModelVersion` object to invoke the table function methods in a partitioned
fashion, passing `partition_column` to specify the name of the column that contains a numeric or string value that
identifies the partition of each record. As usual, you may pass a Snowpark or pandas DataFrame (the latter is useful for
local testing). You will receive the same type of DataFrame as the result. In these examples, inference is partitioned
on a store number.

```python
model_version.run(
  input_df,
  function_name="PREDICT",
  partition_column="STORE_NUMBER"
)
```

You can also call the model table functions directly with SQL, as shown here.

```sqlexample
SELECT output1, output2, partition_column
  FROM input_table,
      TABLE(
          my_model!predict(input_table.input1, input_table.input2)
          OVER (PARTITION BY input_table.store_number)
      )
  ORDER BY input_table.store_number;
```

The input data is automatically split among the nodes and cores in your warehouse and the partitions are processed
in parallel.

For more information about table function syntax, see [Calling a UDF with SQL](../../udf/udf-calling-sql.md).

### Using parameters with partitioned models

Partitioned model methods decorated with `@partitioned_api` can accept optional inference parameters, the same way
as `@inference_api` methods. Define parameters as keyword-only arguments (after `*`), with type annotations and
default values:

```python
class PartitionedModelWithParams(custom_model.CustomModel):

  @custom_model.partitioned_api
  def predict(
      self,
      input_df: pd.DataFrame,
      *,
      n_estimators: int = 100,
      learning_rate: float = 0.1,
  ) -> pd.DataFrame:
      import xgboost
      training_data = ...

      my_model = xgboost.XGBRegressor(
          n_estimators=n_estimators,
          learning_rate=learning_rate,
      )
      my_model.fit(training_data)

      output_df = my_model.predict(...)
      return output_df
```

Pass parameters at inference time through `mv.run`:

```python
model_version.run(
    input_df,
    function_name="PREDICT",
    partition_column="STORE_NUMBER",
    params={"n_estimators": 200, "learning_rate": 0.05}
)
```

Or in SQL using positional or named arguments:

```sqlexample
-- Positional: input columns, then parameters
SELECT output1, output2, partition_column
  FROM input_table,
      TABLE(
          my_model!predict(input_table.input1, input_table.input2, 200, 0.05)
          OVER (PARTITION BY input_table.store_number)
      )
  ORDER BY input_table.store_number;

-- Named arguments (all arguments must be named)
SELECT output1, output2, partition_column
  FROM input_table,
      TABLE(
          my_model!predict(
              input1 => input_table.input1,
              input2 => input_table.input2,
              n_estimators => 200
          )
          OVER (PARTITION BY input_table.store_number)
      )
  ORDER BY input_table.store_number;
```

For more information about defining parameters, see
[Specifying model signatures](model-signature.md) and
[Defining inference parameters](bring-your-own-model-types.md).

### Using batch inference jobs with partitioned models

> **Note:**
>
> This feature requires `snowflake-ml-python` version 1.33.0 or later.

You can also use the `run_batch` method to run partitioned inference as a
[batch inference job](../inference/batch-inference-jobs.md) on
Snowpark Container Services (SPCS). This is useful for large-scale workloads that benefit from distributed
compute with configurable resources.

To partition the input data, pass the `partition_column` argument in `InputSpec`:

```python
from snowflake.ml.model.batch import InputSpec, OutputSpec

job = model_version.run_batch(
    input_df,
    compute_pool="my_compute_pool",
    input_spec=InputSpec(partition_column="STORE_NUMBER"),
    output_spec=OutputSpec(stage_location="@my_db.my_schema.my_stage/results/"),
)
```

For full details on batch inference jobs, see
[Batch inference jobs](../inference/batch-inference-jobs.md).

## Stateless partitioned models

In the simplest application of partitioned models, training and inference are both done when `predict` is
called. The model is fitted, inference is run, and the fitted model is discarded immediately afterward. This type
of model is called “stateless” because no fit state is stored. Here is an example in which each partition trains
an XGBoost model:

```python
class ExampleStatelessPartitionedModel(custom_model.CustomModel):

  @custom_model.partitioned_api
  def predict(self, input_df: pd.DataFrame) -> pd.DataFrame:
      import xgboost
      # All data in the partition will be loaded in the input dataframe.
      # Construct training data by transforming input_df.
      training_data = ...

      # Train the model.
      my_model = xgboost.XGBRegressor()
      my_model.fit(training_data)

      # Generate predictions.
      output_df = my_model.predict(...)

      return output_df

my_model = ExampleStatelessPartitionedModel()
```

See the [Partitioned Model Quickstart Guide](https://quickstarts.snowflake.com/guide/partitioned-ml-model/)
for an example of a stateless partitioned model, including sample data.

## Stateful partitioned models

It’s also possible to implement stateful partitioned models that load stored submodel fit state. You do this by providing
models in memory via the `snowflake.ml.model.custom_model.ModelContext` or by providing file paths pointing to fitted
model artifacts and loading them during inference.

The following example shows how to provide models in memory to the model context.

```python
from snowflake.ml.model import custom_model

# `models` is a dict with model ids as keys, and fitted xgboost models as values.
models = {
  "model1": models[0],
  "model2": models[1],
  ...
}

model_context = custom_model.ModelContext(
  models=models
)
my_stateful_model = MyStatefulCustomModel(model_context=model_context)
```

When logging `my_stateful_model`, the submodels provided in the context are stored along with all model files.
They can then be accessed in the inference method logic by retrieving them from context, as shown below:

```python
class ExampleStatefulModel(custom_model.CustomModel):

  @custom_model.inference_api
  def predict(self, input: pd.DataFrame) -> pd.DataFrame:
    model1 = self.context.model_ref("model1")
    # ... use model1 for inference
```

It’s also possible to access the models programmatically by partition ID in the `predict` method. If a partition column is
provided as an input feature, it can be used to access a model fitted for the partition. For example, if the partition column
is `MY_PARTITION_COLUMN`, the following model class can be defined:

```python
class ExampleStatefulModel(custom_model.CustomModel):

  @custom_model.inference_api
  def predict(self, input: pd.DataFrame) -> pd.DataFrame:
    model_id = input["MY_PARTITION_COLUMN"][0]
    model = self.context.model_ref(model_id)
    # ... use model for inference
```

Similarly, submodels can be stored as artifacts and loaded at runtime. This approach is useful when the models are too
large to fit into memory. Provide string file paths to the model context. The filepaths are accessible during inference
with `self.context.path(artifact_id)`. For more information, see [Defining model context by keyword arguments](bring-your-own-model-types.md).

## Example

See the [Partitioned Model Quickstart Guide](https://quickstarts.snowflake.com/guide/partitioned-ml-model/)
for an example, including sample data.

See the [Many Model Inference in Snowflake Quickstart Guide](https://quickstarts.snowflake.com/guide/many-model-inference-in-snowflake/)
for an example of a stateful partitioned custom model.

---
title: Violin Plots
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/model-explainability-visualization/violin-plots.md
section: Snowflake ML
---

# Violin Plots

Use the `plot_violin()` function to create a SHAP violin plot. This can be used to visualize the distribution and range of SHAP values for each feature.

## Required arguments

| Argument | Description |
| --- | --- |
| `shap_df` | 2D array containing SHAP values for multiple features |
| `feature_df` | 2D array containing the corresponding feature values |

## Optional arguments

| Argument | Description |
| --- | --- |
| `figsize` | A tuple of (width, height) that controls the size of the plot. Uses a default size of (1400, 100) if not specified. |

The function returns a chart that visualizes a violin plot for each feature. The violin plots are sorted
by the absolute mean SHAP value of each feature, with the features with the most significant
influence on the model’s predictions at the top.

---
title: Working with entities
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/entities.md
section: Snowflake ML
---

# Working with entities

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

Entities organize feature views by subject matter so that users can more easily find the feature views they need.
For example, a feature store for a video streaming service might define entities for users and movies. Each feature view in the feature store
is tagged as related to movies or to users, or to both, and you can retrieve a list of feature views related to these entities.

In addition to helping to organize feature views, entities store the names of the key columns you can use to join the
extracted features back to the original data.

## Creating an entity

To create a new entity and register it in a feature store, use the feature store’s `register_entity` method. Here,
`fs` is the feature store instance (see [Creating or connecting to a feature store](create.md)).

```python
from snowflake.ml.feature_store import Entity

entity = Entity(
    name="MY_ENTITY",
    join_keys=["UNIQUE_ID"],
    desc="my entity"
)
fs.register_entity(entity)
```

## Listing entities

To see the registered entities in your feature store, use the feature store’s `list_entities` method, which
returns a Snowpark DataFrame. (`fs` is the feature store instance; see [Creating or connecting to a feature store](create.md).)

```python
fs.list_entities().show()
```

## Retrieving an entity

You can retrieve a registered entity using the feature store’s `get_entity` method; for example, to obtain its join keys.

```python
entity = fs.get_entity(name="MY_ENTITY")
print(entity.join_keys)
```

## Modifying an entity

You can update an entity’s description using the feature store’s `update_entity` method:

```python
fs.update_entity(
    name="MY_ENTITY",
    desc="NEW DESCRIPTION"
)
```

Other aspects of the entity, such as its join keys, are immutable. To change these, create a new entity.

## Deleting an entity

You can delete an entity using the feature store’s `delete_entity` method.

```python
fs.delete_entity(name="MY_ENTITY")
```

Entities that are referenced by any feature view cannot be deleted.

## Known limitations

* Entities are implemented as tags and are subject to the limit of [10,000 tags per account](../../../sql-reference/sql/create-tag.md)
  and [50 unique tags per object](../../../user-guide/object-tagging/introduction.md).

---
title: Working with feature views
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/feature-store/feature-views.md
section: Snowflake ML
---

# Working with feature views

> **Note:**
>
> The Snowflake Feature Store API is available in the Snowpark ML Python package (`snowflake-ml-python`) v1.5.0 and later.

A *feature view* encapsulates the transformation of raw data into one or more related *features.* All features in a
feature view are refreshed on the same schedule. Feature stores are backed by a *feature table* that stores the features.

The Snowflake Feature Store supports two different kinds of feature views:

* Snowflake-managed feature view: The feature table is automatically
  refreshed from raw data by Snowflake on a schedule you specify. A feature view is considered Snowflake-managed if you
  provide a schedule for refreshing it.
* External feature view: If you don’t provide a schedule for
  refreshing the feature view, it’s considered external. You are responsible for maintaining the feature table,
  updating features from raw data as needed, for example using a tool such as [dbt](https://www.getdbt.com/).

The class `snowflake.ml.feature_store.FeatureView` is the Python API for interacting with feature views. The
`FeatureView` constructor accepts a Snowpark DataFrame that contains the feature generation logic. The provided
DataFrame must also contain the `join_keys` columns specified in the entities associated with the feature view. A
timestamp column name is required if your feature view includes time-series features.

See the [Feature Store API Reference](https://docs.snowflake.com/en/developer-guide/snowpark-ml/reference/latest/feature_store)
for full details of the Python API.

## Creating a Snowflake-managed feature view

A Snowflake-managed feature view uses a [dynamic table](../../../user-guide/dynamic-tables-about.md) as the feature table.
Features are extracted from the source data on a schedule you specify, handling new data efficiently and incrementally.
The illustration below shows the flow of data from its source, through feature transformations, into a feature table.

To create a Snowflake-managed feature view, use code like the following Python block, where `entity` is the
[entity](entities.md) that the features are associated with, and `my_df` is the Snowpark DataFrame that contains
your feature transformation logic based on your source data.

Setting the `refresh_freq` parameter designates the feature view as Snowflake-managed. The value can be a time delta
(minimum value `1 minute`), or it can be a `cron` expression with time zone (e.g. `* * * * * America/Los_Angeles`).

```python
from snowflake.ml.feature_store import FeatureView

managed_fv = FeatureView(
    name="MY_MANAGED_FV",
    entities=[entity],
    feature_df=my_df,                   # Snowpark DataFrame containing feature transformations
    timestamp_col="ts",                 # optional timestamp column name in the dataframe
    refresh_freq="5 minutes",           # how often feature data refreshes
    desc="my managed feature view"      # optional description
)
```

You can write feature transformations using Snowpark Python or in SQL. The Snowpark Python API provides
[utility functions](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameAnalyticsFunctions)
for defining common feature types such as windowed aggregations. See [Common feature and query patterns](examples.md) for examples of using
these functions.

To qualify for incremental refresh, each source table must have [change tracking enabled](../../../user-guide/dynamic-tables-create.md).
If change tracking is not already enabled on a source table, Snowflake attempts to enable it automatically when creating
the feature view’s dynamic table. This requires OWNERSHIP of the table. If you do not own the table, ask the owner to
enable change tracking, or create the feature view with `refresh_mode='FULL'`, which fully reads the source table
for each refresh.

## Creating an external feature view

Features generated outside of the Snowflake Feature Store can be registered by setting the `refresh_freq`
parameter to `None` when creating them. In this situation, you must create and maintain the feature table
yourself. The feature DataFrame is based on the feature table, not on the raw data source, and usually contains a simple
projection from this table, with no transformations.

> **Note:**
>
> You *can* perform feature transformations in the feature DataFrame; these calculations are carried out as needed when you
> retrieve data from the feature view. However, external feature views are primarily intended for use with tools such
> as [dbt](https://www.getdbt.com/) that you already use to perform feature transformations. Generally, you should use
> Snowflake-managed feature views if you want Snowflake to perform
> feature transformation.

The illustration below shows the flow of data from its source, through feature transformation by an external tool (here
dbt), into a feature table.

External feature views are implemented as [views](../../../user-guide/views-introduction.md) on your feature table, so they
incur no additional storage cost.

The code below shows how to create an external feature view.

```python
external_fv = FeatureView(
    name="MY_EXTERNAL_FV",
    entities=[entity],
    feature_df=my_df,                   # Snowpark DataFrame referencing the feature table
    timestamp_col="ts",                 # optional timestamp column name in the dataframe
    refresh_freq=None,                  # None means the feature view is external
    desc="my external feature view"     # optional description
)
```

## Making feature views more discoverable

Adding per-feature descriptions to the `FeatureView` makes it easier to find features using
[Snowsight Universal Search](../../../user-guide/ui-snowsight-universal-search.md). The following example uses a feature view’s
`attach_feature_desc` method to provide a short description of each included feature in a Python dictionary:

```python
external_fv = external_fv.attach_feature_desc(
    {
        "SENDERID": "Sender account ID for the transaction",
        "RECEIVERID": "Receiver account ID for the transaction",
        "IBAN": "International Bank Identifier for the receiver bank",
        "AMOUNT": "Amount of the transaction"
    }
)
```

Both kinds of feature views can be enriched with feature descriptions.

## Registering feature views

Once a feature view has been completely defined, you can register it in the feature store using the feature store’s
`register_feature_view` method, with a customized name and version. Incremental maintenance (for supported query
types) and automatic refresh occur based on the specified refresh frequency.

When the provided query cannot be maintained via incremental maintenance using a dynamic table, the table will be fully
refreshed from the query at the specified frequency. This may lead to greater lag in feature refresh and higher
maintenance costs. You can alter the query logic, breaking the query into multiple smaller queries that support
incremental maintenance, or provision a larger virtual warehouse for dynamic table maintenance. See
[General limitations](../../../user-guide/dynamic-tables-limitations.md) for the latest information on dynamic table limitations.

```python
registered_fv: FeatureView = fs.register_feature_view(
    feature_view=managed_fv,    # feature view created above, could also use external_fv
    version="1",
    block=True,         # whether function call blocks until initial data is available
    overwrite=False,    # whether to replace existing feature view with same name/version
)
```

A feature view pipeline definition is immutable after it has been registered, providing consistent feature computation as
long as the feature view exists.

## Retrieving feature views

Once a feature view has been registered with the feature store, you can retrieve it from there when you need it by using
the feature store’s `get_feature_view` method:

```python
retrieved_fv: FeatureView = fs.get_feature_view(
    name="MY_MANAGED_FV",
    version="1"
)
```

## Discovering feature views

You can list all registered feature views in the feature store, optionally filtering by entity name or feature view
name, using the `list_feature_views` method. Information about the matching features is returned as a Snowpark
DataFrame. The following code shows an example of getting a list of feature views; `fs` is a reference to the
feature store.

```python
fs.list_feature_views(
    entity_name="<entity_name>",                # optional
    feature_view_name="<feature_view_name>",    # optional
).show()
```

Features can also be discovered using the Snowsight Feature Store UI or Universal Search.

## Updating feature views

You can update some properties of a feature view you have registered in the feature store using the feature store’s
`update_feature_view` method. The updatable properties are:

* The feature view’s refresh frequency
* The warehouse where the feature transforms execute
* The description of the feature view

Feature definitions and columns cannot be modified. To change the features in a feature store, create a new version of
the feature view.

When you call `update_feature_view`, specify the feature view version to be updated by providing its name and
version. The additional parameters specify the properties to be updated; you can specify just the ones you want to
change. The following code shows an example of changing feature view properties; `fs` is a reference to the
feature store.

```python
fs.update_feature_view(
    name="<name>",
    version="<version>",
    refresh_freq="<new_fresh_freq>",    # optional
    warehouse="<new_warehouse>",        # optional
    desc="<new_description>",           # optional
)
```

## Deleting feature views

You can delete a feature view from the feature store with the feature store’s `delete_feature_view` method. The
following code shows an example of deleting a feature view; `fs` is a reference to the feature store.

```python
fs.delete_feature_view(
    feature_view="<name>",
    version="<version>",
)
```

> **Warning:**
>
> Deleting a feature view version breaks any pipelines that use it. Make sure the feature view version is not in use
> before deleting it.

## Cost considerations

Snowflake-managed feature views use Snowflake dynamic tables. See [Monitor dynamic tables](../../../user-guide/dynamic-tables-monitor.md) for information
on monitoring dynamic tables and [Understanding costs for dynamic tables](../../../user-guide/dynamic-tables-cost.md) for information on the costs of dynamic tables.
External feature views use views, which do not incur additional storage costs.

## Known limitations

* The maximum number of Snowflake-managed feature views and the feature transformation queries in feature views are subject to the
  [limitations of dynamic tables](../../../user-guide/dynamic-tables-limitations.md).
* Not all feature transformation queries are supported by dynamic incremental refresh.
  [See the limitations](../../../user-guide/dynamic-tables-limitations.md).
* Feature view names are SQL identifiers and subject to [Snowflake identifier requirements](../../../sql-reference/identifiers-syntax.md).
* Feature view versions are strings and have a maximum length of 128 characters. Some characters are not permitted and will
  produce an error message.

---
title: XGBoost
source: https://docs.snowflake.com/en/developer-guide/snowflake-ml/model-registry/built-in-models/xgboost.md
section: Snowflake ML
---

# XGBoost

The Snowflake ML Model Registry supports models created using XGBoost (models derived from `xgboost.XGBModel` or `xgboost.Booster`).

The following additional options can be used in the `options` dictionary when you call `log_model`:

| Option | Description |
| --- | --- |
| `target_methods` | A list of the names of the methods available on the model object. Models derived from `XGBModel` have the following target methods by default, assuming the method exists: `predict`, `predict_proba`. (Before v1.4.0, `apply` was also included.) Models derived from `Booster` have the `predict` method by default. |
| `cuda_version` | The version of the CUDA runtime to be used when deploying to a platform with GPU; defaults to 11.8. If manually set to `None`, the model cannot be deployed to a platform having a GPU. |

You must specify either the `sample_input_data` or `signatures` parameter when logging an XGBoost model so
that the registry knows the signatures of the target methods.

## Example

```python
import xgboost
from sklearn import datasets, model_selection

cal_X, cal_y = datasets.load_breast_cancer(as_frame=True, return_X_y=True)
cal_X_train, cal_X_test, cal_y_train, cal_y_test = model_selection.train_test_split(cal_X, cal_y)
params = dict(n_estimators=100, reg_lambda=1, gamma=0, max_depth=3, objective="binary:logistic")
regressor = xgboost.train(params, xgboost.DMatrix(data=cal_X_train, label=cal_y_train))
model_ref = registry.log_model(
    regressor,
    model_name="xgBooster",
    version_name="v1",
    sample_input_data=cal_X_test,
    options={
        "target_methods": ["predict"],
        "method_options": {
            "predict": {"case_sensitive": True},
        },
    },
)
model_ref.run(cal_X_test[-10:])
```

## Streamlit in Snowflake

Build and deploy interactive Streamlit apps directly inside Snowflake.

---
title: About Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/about-streamlit.md
section: Streamlit in Snowflake
---

# About Streamlit in Snowflake

This topic describes key features of Streamlit in Snowflake.

## What is Streamlit?

[Streamlit](https://streamlit.io/) is an open-source Python library that makes it easy to create
and share custom web apps for machine learning and data science. By using Streamlit you can quickly
build and deploy powerful data applications. For more information about the open-source library, see the
[Streamlit documentation](https://docs.streamlit.io/).

## Deploy Streamlit apps in Snowflake

Streamlit in Snowflake helps developers securely build, deploy, and share Streamlit apps on Snowflake’s data
cloud. Using Streamlit in Snowflake, you can build applications that process and use data in Snowflake without moving
data or application code to an external system.

### Key features of Streamlit in Snowflake

* Snowflake manages the underlying compute and storage for your Streamlit app.
* Snowflake stores your source code and environment configuration within a Snowflake object that uses [Role-based Access Control (RBAC)](../../user-guide/security-access-control-overview.md) to manage access to your Streamlit app.
* You can choose between a warehouse and container runtime.
* Streamlit in Snowflake works seamlessly with Snowpark, user-defined functions (UDFs), stored procedures, and Snowflake Native App Framework.
* When working in Snowsight, you can use the side-by-side editor and app preview to quickly modify your source code and environment.

## Use cases

For additional use cases on building dashboards, data tools, and ML/AI, see [Streamlit in Snowflake demos](https://github.com/Snowflake-Labs/snowflake-demo-streamlit).

> **Note:**
>
> These quickstarts are only shown as examples. Following along with the example may require additional rights to third-party data,
> products, or services that are not owned or provided by Snowflake. Snowflake does not guarantee the accuracy of these examples or
> cover them under any Service Level Agreement.

## Developer guides

The following guides explain working with Streamlit in Snowflake.

| Guide | Description |
| --- | --- |
| [Getting started with Streamlit in Snowflake](getting-started/overview.md) | Deploy your first app with sample code and learn the basics. |
| [Create your Streamlit app](app-development/creating-your-app.md) | Deploy a Streamlit app from your existing code using Snowsight, SQL, or Snowflake CLI. |
| [Runtime environments for Streamlit apps](app-development/runtime-environments.md) | Understand the container and warehouse runtime environments for Streamlit in Snowflake apps. |

---
title: Create your Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/creating-your-app.md
section: Streamlit in Snowflake
---

# Create your Streamlit app

This topic describes how to deploy a Streamlit in Snowflake app from existing Streamlit app code. If you’re
new to Streamlit in Snowflake and want to try a starter app first, see
[Getting started with Streamlit in Snowflake](../getting-started/overview.md).

Before you begin:

* Ensure that you meet the required [prerequisites](../getting-started/overview.md).
* Choose a [runtime environment](runtime-environments.md) for your app (container or warehouse).
* Prepare your [dependencies](dependency-management.md) in a `requirements.txt`,
  `pyproject.toml`, or `environment.yml` file.
* Review the expected [file organization](file-organization.md) for your app’s source files.

## Deploy your app code

If you already have a Streamlit app on your local machine or on a Snowflake stage, use one
of the following methods to create a STREAMLIT object from your source files.

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. Enter a name for your app.
5. In the App location dropdown, select the database and schema for your app.
6. Configure the runtime for your app:

   To create a container-runtime app, make the following selections:

   * Select Run on container.
   * If a compute pool dropdown appears, select the compute pool to run your app on.
     If no dropdown appears, the app uses the account-level
     [DEFAULT_STREAMLIT_COMPUTE_POOL](../../../sql-reference/parameters.md) parameter. You can change the compute
     pool after the app is created. See [Change the compute pool](managing-your-app.md).
   * Select a query warehouse to run your app’s queries on.

   To create a warehouse-runtime app, make the following selections:

   * Select Run on warehouse.
   * Select a warehouse to run your app on.
7. Select Create.
8. In the editor, replace the starter code with your own app code. You can paste code
   directly or upload files:

   * To upload files, select + (Add) » Upload file, choose the files,
     and select Upload.
   * To create additional files (such as `pyproject.toml`), select + (Add)
     » Create new file.
9. Select Run.

1. Upload your app files to a named stage:

   ```sqlexample
   CREATE STAGE IF NOT EXISTS my_db.my_schema.my_stage;

   PUT file:///path/to/streamlit_app.py @my_db.my_schema.my_stage/app
      AUTO_COMPRESS = FALSE OVERWRITE = TRUE;
   PUT file:///path/to/pyproject.toml @my_db.my_schema.my_stage/app
      AUTO_COMPRESS = FALSE OVERWRITE = TRUE;
   ```

   You can also upload files through Snowsight as described in
   [Staging files using Snowsight](../../../user-guide/data-load-local-file-system-stage-ui.md).
2. Create the STREAMLIT object from your staged files:

   To create a container-runtime app, run the following command:

   ```sqlexample
   CREATE STREAMLIT my_app
      FROM '@my_db.my_schema.my_stage/app'
      MAIN_FILE = 'streamlit_app.py'
      RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
      COMPUTE_POOL = my_compute_pool
      QUERY_WAREHOUSE = my_warehouse;
   ```

   To create a warehouse-runtime app, omit the RUNTIME_NAME and COMPUTE_POOL parameters:

   ```sqlexample
   CREATE STREAMLIT my_app
      FROM '@my_db.my_schema.my_stage/app'
      MAIN_FILE = 'streamlit_app.py'
      QUERY_WAREHOUSE = my_warehouse;
   ```
3. Push your code to the live version:

   ```sqlexample
   ALTER STREAMLIT my_app ADD LIVE VERSION FROM LAST;
   ```

   You must run this command before users with only USAGE privilege on the Streamlit
   object can view it.

For the full parameter reference, see [CREATE STREAMLIT](../../../sql-reference/sql/create-streamlit.md).

> **Note:**
>
> [Snowflake CLI](../../snowflake-cli/installation/installation.md) version 3.14.0
> or later is required. Version 3.14+ uses the modern CREATE STREAMLIT syntax by default.

1. In your project directory, create a `snowflake.yml` file alongside your app code.

   To create a container-runtime app, use the following configuration:

   ```yaml
   definition_version: 2
   entities:
      my_streamlit:
         type: streamlit
         identifier: my_app
         query_warehouse: my_warehouse
         compute_pool: my_compute_pool
         runtime_name: SYSTEM$ST_CONTAINER_RUNTIME_PY3_11
         main_file: streamlit_app.py
         artifacts:
         - streamlit_app.py
         - pyproject.toml
   ```

   To create a warehouse-runtime app, omit `compute_pool` and `runtime_name`:

   ```yaml
   definition_version: 2
   entities:
      my_streamlit:
         type: streamlit
         identifier: my_app
         query_warehouse: my_warehouse
         main_file: streamlit_app.py
         artifacts:
         - streamlit_app.py
         - environment.yml
   ```

   List all files your app needs in the `artifacts` section.
2. Deploy the app:

   ```snowcli
   snow streamlit deploy --open
   ```

For more information, see the
[Creating a Streamlit app](../../snowflake-cli/streamlit-apps/manage-apps/initialize-app.md) and
[Deploying a Streamlit app](../../snowflake-cli/streamlit-apps/manage-apps/deploy-app.md) guides.

## View a Streamlit app

For information about the privileges required to view a Streamlit app, see
[Privileges required to view a Streamlit app](../object-management/privileges.md).

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app you want to view.

If you are viewing a multipage Streamlit app, select a tab to view additional pages.

To view information about a STREAMLIT object:

```sqlexample
DESC STREAMLIT my_app;
```

To view the app in a browser, sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md), then In the navigation menu, select Projects » Streamlit,
and select the app.

To get the URL for your deployed app:

```snowcli
snow streamlit get-url my_app
```

## Set up CI/CD with GitHub Actions

You can deploy Streamlit in Snowflake apps automatically from a Git repository using Snowflake CLI
and [GitHub Actions](https://docs.github.com/en/actions). You can use a similar
approach with other CI/CD providers.

### Prerequisites

* A GitHub repository containing your Streamlit app files and `snowflake.yml`.
* A `SNOWCLI_PW` secret configured in your GitHub repository settings.

### Example workflow

Create a `.github/workflows/deploy.yml` file in your repository:

```yaml
name: Deploy via Snowflake CLI

on:
  push:
    branches:
      - main

env:
  PYTHON_VERSION: '3.12'

jobs:
  build-and-deploy:
    runs-on: ubuntu-latest
    environment: dev
    steps:
      - name: 'Checkout GitHub Action'
        uses: actions/checkout@v3

      - name: Install Python
        uses: actions/setup-python@v4
        with:
          python-version: ${{ env.PYTHON_VERSION }}

      - name: 'Install Snowflake CLI'
        shell: bash
        run: |
          python -m pip install --upgrade pip
          pip install snowflake-cli

      - name: 'Create config'
        shell: bash
        env:
          SNOWFLAKE_PASSWORD: ${{ secrets.SNOWCLI_PW }}
        run: |
          mkdir -p ~/.snowflake
          cp config.toml ~/.snowflake/config.toml
          echo "password = \"$SNOWFLAKE_PASSWORD\"" >> ~/.snowflake/config.toml
          chmod 0600 ~/.snowflake/config.toml

      - name: 'Deploy the Streamlit app'
        shell: bash
        run: |
          snow streamlit deploy --replace
```

Commit and push the file to trigger the workflow.

For more information, see [GitHub Actions documentation](https://docs.github.com/en/actions).

---
title: Custom sleep timer for a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/sleep-timer.md
section: Streamlit in Snowflake
---

# Custom sleep timer for a Streamlit app

This topic describes how to set a custom sleep timer for a Streamlit app in Streamlit in Snowflake on warehouses.

## About sleep timers for Streamlit apps

Sleep timers only apply to Streamlit apps that use warehouse runtimes. Container runtimes
are intended for long-running services and don’t support sleep timers.

The sleep timer is configured using the Streamlit app’s `config.toml` configuration file.
If your app was created with the ROOT_LOCATION parameter, you must use SQL to PUT the configuration
file in the app’s stage location. Otherwise, you can use SQL or the Snowsight Streamlit in Snowflake editor.

## WebSocket timeout

When a viewer opens a Streamlit app, a WebSocket connection is established between the
viewer’s browser and the Streamlit server. If there is no custom sleep timer, the
app will automatically suspend after the WebSocket connection times out due to inactivity.
At the account level, the default WebSocket timeout is approximately 15 minutes. You can
change your account’s WebSocket timeout for all Streamlit apps by contacting Snowflake Support.

When you set a custom sleep timer, the timer attempts to keep an app awake until the specified
time limit is reached, and then attempts to close the connection gracefully. However, depending
on a user’s browser settings, the timing mechanism may be suspended or delayed by an
inactive browser tab. In such cases, the app is subject to the WebSocket timeout setting. Therefore,
if you set a custom sleep timer that is less than the WebSocket timeout, your app may not
automatically suspend as quickly as expected in some scenarios. For the best results, set your
WebSocket timeout to be equal to the smallest custom sleep timer used by your apps.

Additionally, any mouse movement over an app will reset both the WebSocket timeout
and the custom sleep timer.

## Set a custom sleep timer using Snowsight

If your Streamlit app is using a warehouse runtime, to reduce code warehouse costs,
you can set a custom sleep timer for a Streamlit app to auto-suspend. If your app was
created with the ROOT_LOCATION parameter, you must use the PUT command instead of Snowsight.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then select your Streamlit app.
3. In the upper-right corner, select Edit.
4. If `.streamlit/config.toml` doesn’t exist, in the file explorer on the left,
   select  » Create new file. Enter `.streamlit/config.toml`, and select Create.
5. In the file explorer on the left, navigate to `.streamlit/config.toml`.
6. In the file editor, set the value of `streamlitSleepTimeoutMinutes` in the `[snowflake.sleep]` table.

   For example, if you want the Streamlit app to auto-suspend after 8 minutes, add the following text to the `config.toml` file:

   ```toml
   [snowflake]
   [snowflake.sleep]
   streamlitSleepTimeoutMinutes = 8
   ```

## Set a custom sleep timer using the PUT command

If your Streamlit app was created with the ROOT_LOCATION parameter, you must use
the PUT command to modify your app’s configuration file. If your Streamlit app was created with the
FROM parameter, you can use either the PUT command or Snowsight to modify your app’s
configuration file.

1. Create or modify the `config.toml` file on your local machine to set
   `streamlitSleepTimeoutMinutes` in the `[snowflake.sleep]` table.

   For example, if you want the Streamlit app to auto-suspend after 8 minutes, include the following text in your `config.toml` file:

   ```toml
   [snowflake]
   [snowflake.sleep]
   streamlitSleepTimeoutMinutes = 8
   ```
2. Upload the `config.toml` file to your app’s stage location.

   > If your app was created with the ROOT_LOCATION parameter, execute the following command:
   >
   > ```sqlexample
   > PUT file:///<path_to_your_local_directory>/config.toml @streamlit_db.streamlit_schema.streamlit_stage/.streamlit/ overwrite=true auto_compress=false;
   > ```
   >
   > If your app was created with the FROM parameter, execute the following command:
   >
   > ```sqlexample
   > PUT file:///<path_to_your_local_directory>/config.toml snow://streamlit/streamlit_db.streamlit_schema.streamlit_stage/versions/live/.streamlit/ overwrite=true auto_compress=false;
   > ```

For more information about working with Streamlit files, see [Create your Streamlit app](../app-development/creating-your-app.md).

> **Note:**
>
> You can set the `streamlitSleepTimeoutMinutes` to any value between 5 to 240 minutes.
>
> If you do not create the configuration file to specify the timer, the default auto-suspend time is 15 minutes.

---
title: Delete your Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/deleting-your-app.md
section: Streamlit in Snowflake
---

# Delete your Streamlit app

Deleting a Streamlit app permanently removes it from Snowflake. Any users with whom you
have shared the app will no longer be able to view or interact with it. Before deleting an
app, ensure that you have saved your application code outside of Snowflake.

## Delete a Streamlit app

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app you want to delete.
4. Select Edit.
5. Select the name of the app in the upper-left corner.
6. Select Delete, and then select Delete App.

Snowflake deletes the Streamlit app and displays the updated list of available apps.

Use the [DROP STREAMLIT](../../../sql-reference/sql/drop-streamlit.md) command:

```sqlexample
DROP STREAMLIT my_app;
```

Drop the Streamlit app:

```snowcli
snow streamlit drop my_app
```

---
title: Edit your Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/editing-your-app.md
section: Streamlit in Snowflake
---

# Edit your Streamlit app

After deploying a Streamlit app in Snowsight, you can edit both the app code and
dependencies using Snowsight or SQL commands. The way your changes take effect
depends on the runtime environment and how the app was created.

> **Note:**
>
> Apps created with the ROOT_LOCATION parameter (legacy apps) have limited editing
> capabilities and should be converted to use the FROM parameter for full functionality.
> For more information, see [Understanding the different types of Streamlit objects](../migrations-and-upgrades/overview.md).
>
> This page only covers apps created with the FROM parameter.

Both container and warehouse runtime environments are subject to possible race conditions
when multiple people edit the same app simultaneously. See the Collaborative editing considerations
section below for details and best practices.

## Editing methods

You can edit your app through an in-browser editor in Snowsight or by uploading
files using SQL commands.

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then select your Streamlit app.
3. In the upper-right corner, select Edit.
4. In the file explorer, select or create a new file to edit:

   * To edit an existing file, select it from the file explorer.
   * To create a new file, select + (Add) » Create new file, enter
     the filename, and then select Create. You can include subdirectories in the filename,
     like `subdir/new_file.py`.
   * To upload a file from your local machine, select + (Add) » Upload file,
     choose the file to upload, modify the filename and path if needed, and then select
     Upload.
5. Make your changes in the editor pane.

   Changes are automatically saved to the app’s source location, after a few seconds.
6. Optional: Select Run.

   If you don’t want to wait a few seconds for the changes to be saved, you can select
   Run to copy the changes immediately.
7. If your app uses a warehouse runtime, viewers must select Run to copy the
   changes to their app instance. If your app uses a container runtime, changes are
   directly saved to the live app’s source and will be visible to all viewers the
   next time they interact with the app.

If you have your edited app files on a stage, you can CREATE OR REPLACE your
app with the following command:

```sqlexample
CREATE OR REPLACE STREAMLIT my_app
FROM '@my_stage/app_folder'
MAIN_FILE = 'streamlit_app.py'
QUERY_WAREHOUSE = my_warehouse
RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
COMPUTE_POOL = my_compute_pool
EXTERNAL_ACCESS_INTEGRATIONS = (pypi_access_integration);
```

Alternatively, if you want to update your app files in place or want to update only
a subset of your app files, you can use the following commands:

1. Get the URI of your app’s source location:

   ```sqlexample
   DESCRIBE STREAMLIT my_app;
   ```

   The `live_version_location_uri` value is the source location for your app. Copy
   this to use in the next step.
2. Upload one or more updated app files to the source location with PUT or COPY FILES.

   ```sqlexample
   COPY FILES INTO '<live_version_location_uri>' FROM @my_stage FILES = ('streamlit_app.py');;
   ```

> **Note:**
>
> [Snowflake CLI](../../snowflake-cli/installation/installation.md) version 3.14.0
> or later is required. Version 3.14+ uses the modern CREATE STREAMLIT syntax by default.

If you have a complete set of edited app files on your local machine (including its
`snowflake.yml` file for Snow CLI), you can redeploy your app with the following
command:

```snowcli
snow streamlit deploy --replace
```

## Runtime behavior differences

The way your edits take effect depends on your app’s [runtime type](runtime-environments.md).

### Container runtime

When you edit a container runtime app:

* Changes to your app’s source go directly to the live app.
* Current viewers see updates the next time they interact with the app and trigger a rerun. (The Streamlit
  [configuration option](https://docs.streamlit.io/develop/api-reference/configuration/config.toml#server)
  `server.runOnSave` is disabled by default.)
* The Run button is available to viewers but not required to propagate changes to a current viewing or editing session.
* All users see the same app instance with immediate changes.

Even though the live app is shared between viewers, the view of the source code in
Snowsight editors isn’t. Therefore, apps on container runtimes are still
subject to race conditions when multiple people edit the app simultaneously. See the
Collaborative editing considerations section below for details and best practices.

### Warehouse runtime

When you edit a warehouse runtime app:

* App source code is copied when each viewer’s instance starts.
* Current viewers must select Run to copy updates made to the source during their session.
* Even the person making edits must click Run to see changes in their preview pane.
* Each viewer gets their own isolated app instance.

## Collaborative editing considerations

When multiple people edit the same app, be aware of potential conflicts. Both
container and warehouse runtime apps are subject to the following race condition
if more than one person edits the app simultaneously.

### Race conditions

The Snowsight editor works as follows:

* The current source code is copied into the editor pane when you open it or use the file navigator to open a file.
* If you are viewing a file in the editor pane, it doesn’t update automatically when changes are made by others.
* If you make changes in your editor pane, the automatic save will overwrite any changes made by others after you opened the editor.
* There’s no automatic merging of conflicting edits.

For example, the following sequence can lead to lost changes:

1. Developer A opens the editor at 2:00 PM.
2. Developer B makes and saves changes at 2:15 PM.
3. Developer A saves changes at 2:30 PM.
4. Developer B’s changes are lost (overwritten by Developer A).

### Best practices for team editing

To avoid conflicts when working with a team:

* Communicate with your team members before making edits.
* Keep your source files in a Git repository and deploy your code from there.
* Use separate development apps for testing changes.
* Reload the Snowsight editor to get the latest version immediately before making changes.

---
title: Example: Build a form that writes to Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/getting-started/example-crud-app.md
section: Streamlit in Snowflake
---

# Example: Build a form that writes to Snowflake

This example walks you through building a Streamlit in Snowflake app that collects user input through a form
and writes it to a Snowflake table. The app also reads the data back to display all
submissions, and uses `st.user` to track who submitted each entry.

The app uses a container runtime. Before you begin, make sure you’ve completed the
[prerequisites](overview.md).

## Set up the target table

This example uses a database called `crud_demo`. You can substitute any database
and schema you have access to – just update the references in the SQL and app code to match.

Create a table to store form submissions. Run the following SQL in a worksheet or SQL session:

```sqlexample
CREATE OR REPLACE TABLE crud_demo.public.feedback (
   submitted_at TIMESTAMP_NTZ DEFAULT CURRENT_TIMESTAMP(),
   submitted_by VARCHAR,
   category VARCHAR,
   rating INTEGER,
   comments VARCHAR
);
```

## Write the app code

On your local machine, create a file named `streamlit_app.py` with the following code.
If you plan to use Snowsight, you can paste this code into the editor after creating
the app.

```python
import streamlit as st

st.title("Feedback Form")
st.write(f"Logged in as: {st.user.user_name}")

conn = st.connection("snowflake")
session = conn.session()

with st.form("feedback_form"):
    category = st.selectbox(
        "Category", ["Bug Report", "Feature Request", "General Feedback"]
    )
    rating = st.slider("Rating", 1, 5, 3)
    comments = st.text_area("Comments")
    submitted = st.form_submit_button("Submit")

if submitted:
    session.sql(
        """
        INSERT INTO crud_demo.public.feedback
            (submitted_by, category, rating, comments)
        VALUES (?, ?, ?, ?)
        """,
        params=[st.user.user_name, category, rating, comments],
    ).collect()
    st.success("Feedback submitted!")

st.subheader("Last 10 submissions")
data = session.sql(
    "SELECT * FROM crud_demo.public.feedback ORDER BY submitted_at DESC LIMIT 10"
).to_pandas()
st.dataframe(data, use_container_width=True)
```

This app uses:

* `st.form` to collect input before submitting, preventing re-runs on every widget
  interaction.
* `st.connection("snowflake").session()` to get a Snowpark session for writing data.
  For more information, see [Manage secrets and configure your Streamlit app](../app-development/secrets-and-configuration.md).
* `session.sql()` instead of `conn.query()` to read back the submissions.
  `conn.query()` caches results by default, so new entries wouldn’t appear until the
  cache expires. `session.sql()` executes a fresh query on every rerun.
* `st.user.user_name` to record who submitted each entry. For more information,
  see [Personalize your Streamlit app with user information](../app-development/personalization.md).

## Declare dependencies

This app only uses `streamlit` and the built-in Snowflake connection, so no additional
dependencies are required.

For more information, see [Manage dependencies for your Streamlit app](../app-development/dependency-management.md).

## Deploy the app

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. Enter `feedback_app` as the app name.
5. Select a database and schema.
6. Select Run on container, then select a compute pool and query warehouse.
7. Select Create.
8. In the editor, replace the starter code with the app code above.
9. Select Run.

1. Stage your app files:

   ```sqlexample
   CREATE STAGE IF NOT EXISTS crud_demo.public.app_stage;

   PUT file:///path/to/streamlit_app.py @crud_demo.public.app_stage/feedback
      AUTO_COMPRESS = FALSE OVERWRITE = TRUE;
   ```
2. Create the Streamlit app:

   ```sqlexample
   CREATE STREAMLIT crud_demo.public.feedback_app
      FROM '@crud_demo.public.app_stage/feedback'
      MAIN_FILE = 'streamlit_app.py'
      RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
      COMPUTE_POOL = my_compute_pool
      QUERY_WAREHOUSE = my_warehouse;

   ALTER STREAMLIT crud_demo.public.feedback_app ADD LIVE VERSION FROM LAST;
   ```
3. To view your app, sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md), then In the navigation menu, select Projects » Streamlit, and select your app.

> **Note:**
>
> [Snowflake CLI](../../snowflake-cli/installation/installation.md) version 3.14.0
> or later is required. Version 3.14+ uses the modern CREATE STREAMLIT syntax by default.

1. Create a project directory with the following structure:

   ```none
   feedback_app/
   ├── snowflake.yml
   └── streamlit_app.py
   ```
2. Create a `snowflake.yml` file:

   ```yaml
   definition_version: 2
   entities:
      feedback_app:
         type: streamlit
         identifier: feedback_app
         query_warehouse: my_warehouse
         compute_pool: my_compute_pool
         runtime_name: SYSTEM$ST_CONTAINER_RUNTIME_PY3_11
         main_file: streamlit_app.py
         artifacts:
         - streamlit_app.py
   ```
3. Deploy the app:

   ```snowcli
   snow streamlit deploy --open
   ```

## Try the app

1. Open the app in your browser.
2. Fill in the form fields and select Submit.
3. The feedback table below the form updates to show your new submission, including your
   email address and a timestamp.
4. Submit a few more entries, then try filtering or sorting the data in the table.

## Extend the app

Try adding a delete button next to each row, or a chart that shows the average rating
by category. For example, add the following after the dataframe:

```python
import plotly.express as px

if not data.empty:
    avg_ratings = data.groupby("CATEGORY")["RATING"].mean().reset_index()
    fig = px.bar(avg_ratings, x="CATEGORY", y="RATING", title="Average Rating by Category")
    st.plotly_chart(fig, use_container_width=True)
```

If you add `plotly`, declare it in a `requirements.txt` file:

```text
plotly
```

For more complex dependency scenarios, you can use a `pyproject.toml` file instead.
For more information, see [Manage dependencies for your Streamlit app](../app-development/dependency-management.md).

## Clean up

To remove the resources created in this example, run the following SQL:

```sqlexample
DROP STREAMLIT IF EXISTS crud_demo.public.feedback_app;
DROP TABLE IF EXISTS crud_demo.public.feedback;
```

## What’s next?

* [Create your Streamlit app](../app-development/creating-your-app.md): Learn about all the options for creating apps.
* [Personalize your Streamlit app with user information](../app-development/personalization.md): Explore all the user attributes available
  through `st.user`.
* [Manage secrets and configure your Streamlit app](../app-development/secrets-and-configuration.md): Access secrets and external services
  in your app.
* [Sharing Streamlit in Snowflake apps](../features/sharing-streamlit-apps.md): Share your app with other users.

---
title: Example: Build a personalized data dashboard
source: https://docs.snowflake.com/en/developer-guide/streamlit/getting-started/example-data-dashboard.md
section: Streamlit in Snowflake
---

# Example: Build a personalized data dashboard

This example walks you through building a Streamlit in Snowflake app that queries Snowflake data, adds a
third-party charting library, and personalizes the display for each viewer. By the end,
you’ll understand the core development cycle: create, deploy, edit, and redeploy.

The app uses a container runtime. Before you begin, make sure you’ve completed the
[prerequisites](overview.md).

## Set up sample data

This example uses a database called `dashboard_demo`. You can substitute any database
and schema you have access to – just update the references in the SQL and app code to match.

Create a table with sample revenue data. Run the following SQL in a worksheet or SQL session:

```sqlexample
CREATE OR REPLACE TABLE dashboard_demo.public.monthly_revenue (
   month DATE,
   region VARCHAR,
   revenue NUMBER(12, 2)
);

INSERT INTO dashboard_demo.public.monthly_revenue VALUES
   ('2026-01-01', 'North America', 125000.00),
   ('2026-01-01', 'Europe', 98000.00),
   ('2026-01-01', 'Asia Pacific', 87000.00),
   ('2026-02-01', 'North America', 132000.00),
   ('2026-02-01', 'Europe', 101000.00),
   ('2026-02-01', 'Asia Pacific', 93000.00),
   ('2026-03-01', 'North America', 141000.00),
   ('2026-03-01', 'Europe', 110000.00),
   ('2026-03-01', 'Asia Pacific', 99000.00);
```

## Write the app code

On your local machine, in a project directory of your choice, create a file named
`streamlit_app.py` with the following code. If you plan to use Snowsight,
you can paste this code into the editor after creating the app.

```python
import streamlit as st
import plotly.express as px

st.title("Revenue Dashboard")
st.write(f"Welcome, {st.user.user_name}!")

conn = st.connection("snowflake")

df = conn.query("""
    SELECT month, region, revenue
    FROM dashboard_demo.public.monthly_revenue
    ORDER BY month
""")

selected_regions = st.multiselect(
    "Filter by region",
    options=df["REGION"].unique(),
    default=df["REGION"].unique(),
)

filtered = df[df["REGION"].isin(selected_regions)]

fig = px.bar(
    filtered,
    x="MONTH",
    y="REVENUE",
    color="REGION",
    barmode="group",
    title="Monthly Revenue by Region",
)
st.plotly_chart(fig, use_container_width=True)

st.dataframe(filtered, use_container_width=True)
```

This app uses:

* `conn.query()` to query data from Snowflake. Results are cached automatically, so
  the query only runs once until the cache expires. For more information, see
  [Manage secrets and configure your Streamlit app](../app-development/secrets-and-configuration.md).
* `st.user.user_name` to greet the current viewer. For more information, see
  [Personalize your Streamlit app with user information](../app-development/personalization.md).
* `plotly` for interactive charts, which is an external dependency that you declare in the
  next step.

## Declare dependencies

Container runtimes install packages listed in a `requirements.txt` file. Create a
`requirements.txt` file alongside your `streamlit_app.py`:

```text
plotly
streamlit
```

When the app starts, the container runtime automatically installs the declared packages.
For more complex dependency scenarios, you can use a `pyproject.toml` file instead.
For more information, see [Manage dependencies for your Streamlit app](../app-development/dependency-management.md).

## Deploy the app

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. Enter `revenue_dashboard` as the app name.
5. Select a database and schema.
6. Select Run on container, then select a compute pool and query warehouse.
7. Select Create.
8. In the editor, replace the starter code with the app code above.
9. Upload or create the `requirements.txt` file by selecting + (Add) »
   Create new file, entering `requirements.txt`, and pasting the contents.
10. Select Run.

1. Stage your app files:

   ```sqlexample
   CREATE STAGE IF NOT EXISTS dashboard_demo.public.app_stage;

   PUT file:///path/to/streamlit_app.py @dashboard_demo.public.app_stage/dashboard
      AUTO_COMPRESS = FALSE OVERWRITE = TRUE;
   PUT file:///path/to/requirements.txt @dashboard_demo.public.app_stage/dashboard
      AUTO_COMPRESS = FALSE OVERWRITE = TRUE;
   ```
2. Create the Streamlit app:

   ```sqlexample
   CREATE STREAMLIT dashboard_demo.public.revenue_dashboard
      FROM '@dashboard_demo.public.app_stage/dashboard'
      MAIN_FILE = 'streamlit_app.py'
      RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
      COMPUTE_POOL = my_compute_pool
      QUERY_WAREHOUSE = my_warehouse;

   ALTER STREAMLIT dashboard_demo.public.revenue_dashboard ADD LIVE VERSION FROM LAST;
   ```
3. To view your app, sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md), then In the navigation menu, select Projects » Streamlit, and select your app.

> **Note:**
>
> [Snowflake CLI](../../snowflake-cli/installation/installation.md) version 3.14.0
> or later is required. Version 3.14+ uses the modern CREATE STREAMLIT syntax by default.

1. Create a project directory with the following structure:

   ```none
   revenue_dashboard/
   ├── snowflake.yml
   ├── requirements.txt
   └── streamlit_app.py
   ```
2. Create a `snowflake.yml` file:

   ```yaml
   definition_version: 2
   entities:
      revenue_dashboard:
         type: streamlit
         identifier: revenue_dashboard
         query_warehouse: my_warehouse
         compute_pool: my_compute_pool
         runtime_name: SYSTEM$ST_CONTAINER_RUNTIME_PY3_11
         main_file: streamlit_app.py
         artifacts:
         - streamlit_app.py
         - requirements.txt
   ```
3. Deploy the app:

   ```snowcli
   snow streamlit deploy --open
   ```

## Make a change

Try editing your app to see the development cycle in action. Add a summary metric by
inserting the following two lines into `streamlit_app.py`, between the
`filtered = ...` line and the `fig = px.bar(...)` line:

```python
total = filtered["REVENUE"].sum()
st.metric("Total Revenue", f"${total:,.0f}")
```

SnowsightSQLSnowflake CLI

If you’re editing in the browser, paste the lines into the editor and select Run.

Stage the updated file, then copy it to your app’s live version location:

```sqlexample
PUT file:///path/to/streamlit_app.py @dashboard_demo.public.app_stage/dashboard
   AUTO_COMPRESS = FALSE OVERWRITE = TRUE;

DESCRIBE STREAMLIT dashboard_demo.public.revenue_dashboard;
-- Copy the live_version_location_uri value from the result.

COPY FILES INTO '<live_version_location_uri>'
   FROM @dashboard_demo.public.app_stage/dashboard
   FILES = ('streamlit_app.py');
```

Save the file locally and redeploy:

```snowcli
snow streamlit deploy --replace
```

For more information about the editing workflow, see
[Edit your Streamlit app](../app-development/editing-your-app.md).

## Clean up

To remove the resources created in this example, run the following SQL:

```sqlexample
DROP STREAMLIT IF EXISTS dashboard_demo.public.revenue_dashboard;
DROP TABLE IF EXISTS dashboard_demo.public.monthly_revenue;
```

## What’s next?

* [Example: Build a form that writes to Snowflake](example-crud-app.md): Build an app with a form that writes data back to Snowflake.
* [Personalize your Streamlit app with user information](../app-development/personalization.md): Learn more about personalizing apps with
  `st.user`.
* [External network access in Streamlit in Snowflake](../features/external-access.md): Connect your app to external APIs.

---
title: External network access in Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/external-access.md
section: Streamlit in Snowflake
---

# External network access in Streamlit in Snowflake

This topic describes how to create secure access to network locations external to Snowflake.

## External network access in Streamlit in Snowflake

You can create secure access to specific network locations external to Snowflake, and you can use that access from within the
Streamlit app code. You can enable access through an external access integration.

To enable a Streamlit app to use an external access integration (EAI), you can run the [CREATE STREAMLIT](../../../sql-reference/sql/create-streamlit.md)
or [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command and set the EXTERNAL_ACCESS_INTEGRATIONS parameter to include that EAI.

With an EAI, you can use Python libraries that access external locations, such as `requests` or `urllib`,
and use third-party libraries that require access to a network location.

For more information, see [External network access overview](../../external-network-access/external-network-access-overview.md).

### Example: Access the OpenAI API

The following example shows how to create an EAI for an outbound request to the OpenAI API:

1. To create a network rule that represents the external network’s location and restrictions for access, use the [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md)
   command:

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE network_rules
     MODE = EGRESS
     TYPE = HOST_PORT
     VALUE_LIST = ('api.openai.com');
   ```

   For more information, see [Creating a network rule to represent the external network location](../../external-network-access/creating-using-external-network-access.md).
2. To create a secret that represents credentials required to authenticate with the external network location, use the
   [CREATE SECRET](../../../sql-reference/sql/create-secret.md) command:

   ```sqlexample
   CREATE OR REPLACE SECRET openai_key
     TYPE = GENERIC_STRING
     SECRET_STRING = '<any_string>';
   ```

   For more information, see [Creating a secret to represent credentials](../../external-network-access/creating-using-external-network-access.md).
3. To create an EAI, run the [CREATE EXTERNAL ACCESS INTEGRATION](../../../sql-reference/sql/create-external-access-integration.md) command, setting ALLOWED_NETWORK_RULES
   to the network rule you created and ALLOWED_AUTHENTICATION_SECRETS to the secret you created:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION openai_access_int
     ALLOWED_NETWORK_RULES = (network_rules)
     ALLOWED_AUTHENTICATION_SECRETS = (openai_key)
     ENABLED = TRUE;
   ```
4. To grant the required privileges to use the SECRET and INTEGRATION objects for external access to the Streamlit app creator, use the [GRANT <privileges> … TO ROLE](../../../sql-reference/sql/grant-privilege.md)
   command:

   ```sqlexample
   GRANT READ ON SECRET openai_key TO ROLE streamlit_app_creator_role;
   GRANT USAGE ON INTEGRATION openai_access_int TO ROLE streamlit_app_creator_role;
   ```
5. To enable the Streamlit app to use the integration, run the [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command, setting the
   EXTERNAL_ACCESS_INTEGRATIONS property to the integration:

   ```sqlexample
   USE ROLE streamlit_app_creator_role;

   ALTER STREAMLIT streamlit_db.streamlit_schema.streamlit_app
     SET EXTERNAL_ACCESS_INTEGRATIONS = (openai_access_int)
     SECRETS = ('my_openai_key' = streamlit_db.streamlit_schema.openai_key);
   ```

   > **Note:**
   >
   > You can also set up a new Streamlit object to use an external access integration by specifying the EXTERNAL_ACCESS_INTEGRATIONS parameter
   > when you run the [CREATE STREAMLIT](../../../sql-reference/sql/create-streamlit.md) command:
   >
   > ```sqlexample
   > CREATE STREAMLIT streamlit_db.streamlit_schema.streamlit_app
   >   ROOT_LOCATION = '<stage_path_and_root_directory>'
   >   MAIN_FILE = '<path_to_main_file_in_root_directory>'
   >   EXTERNAL_ACCESS_INTEGRATIONS = (openai_access_int)
   >   SECRETS = ('my_openai_key' = streamlit_db.streamlit_schema.openai_key);
   > ```
6. In your Streamlit app code, call the external API:

   ```python
   from openai import OpenAI
   import streamlit as st
   import _snowflake

   st.title(":speech_balloon: Simple chat app using an external LLM")
   st.write("This app shows how to call an external LLM to build a simple chat application.")

   # Use the _snowflake library to access secrets
   secret = _snowflake.get_generic_secret_string('my_openai_key')
   client = OpenAI(api_key=secret)

   # ...
   # code to use API
   # ...
   ```

---
title: Getting started with Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/getting-started/overview.md
section: Streamlit in Snowflake
---

# Getting started with Streamlit in Snowflake

This topic walks you through deploying your first Streamlit in Snowflake app in under five minutes using
a container runtime. After that, two hands-on examples show you how to build real apps
that query data, personalize the experience for each viewer, and write back to Snowflake.

## Prerequisites

Before you can create a Streamlit app, ensure that your administrator has completed the
[essential security setup](../object-management/security.md) for Streamlit apps.

Your role must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Database where you create the Streamlit app |  |
| CREATE STREAMLIT,  USAGE | Schema where you create the Streamlit app |  |
| USAGE | Compute pool that runs the Streamlit app | For all accounts, Snowflake configures a general-purpose compute pool that typical users will have access to. For more information, see [Configuring your own preferred compute pools for Streamlit apps](../../snowpark-container-services/working-with-compute-pool.md). |
| USAGE | Warehouse that runs queries in the Streamlit app |  |

For more information, see [Privileges required to create and use a Streamlit app](../object-management/privileges.md).

## Deploy your first Streamlit in Snowflake app

The fastest way to get started is to create a Streamlit app using the default starter code.
When you create an app without specifying source files, Snowflake provides example code
automatically.

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select + Streamlit App.
4. Enter a name for your app.
5. Select a database and schema to create your app in.
6. Select Run on container.
7. Select a compute pool and a query warehouse.
8. Select Create.

Snowsight redirects you to the app editor. Your app will be ready
within a few minutes. Then, you can view and edit it immediately.

Run the following SQL commands in a SQL session:

```sqlexample
CREATE STREAMLIT my_first_app
   RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
   COMPUTE_POOL = my_compute_pool
   QUERY_WAREHOUSE = my_warehouse;

ALTER STREAMLIT my_first_app ADD LIVE VERSION FROM LAST;
```

To view your app, sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md), then In the navigation menu, select Projects » Streamlit, and select your app.

> **Note:**
>
> [Snowflake CLI](../../snowflake-cli/installation/installation.md) version 3.14.0
> or later is required. Version 3.14+ uses the modern CREATE STREAMLIT syntax by default.

1. Initialize a new Streamlit project:

   ```snowcli
   snow init my_first_app --template example_streamlit
   ```
2. Navigate to the project directory:

   ```bash
   cd my_first_app
   ```
3. Edit the `snowflake.yml` file to use a container runtime:

   ```yaml
   definition_version: 2
   entities:
      my_streamlit:
         type: streamlit
         identifier: my_first_app
         query_warehouse: my_warehouse
         compute_pool: my_compute_pool
         runtime_name: SYSTEM$ST_CONTAINER_RUNTIME_PY3_11
         main_file: streamlit_app.py
         artifacts:
         - streamlit_app.py
   ```
4. Deploy the app and open it in your browser:

   ```snowcli
   snow streamlit deploy --open
   ```

### Edit your app

After deploying, you can edit the app code to customize it. For a quick test:

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then select your app.
3. Select Edit.
4. Modify the code in `streamlit_app.py`.
5. Select Run to see your changes.

1. Get your app’s source location:

   ```sqlexample
   DESCRIBE STREAMLIT my_first_app;
   ```
2. Copy an updated file to that location:

   ```sqlexample
   COPY FILES INTO '<live_version_location_uri>' FROM @my_stage FILES = ('streamlit_app.py');
   ```

1. Edit `streamlit_app.py` in your local project directory.
2. Redeploy:

   ```snowcli
   snow streamlit deploy --replace
   ```

For more information, see [Edit your Streamlit app](../app-development/editing-your-app.md).

## What’s next?

Now that you have a running app, try one of these hands-on examples:

* [Example: Build a personalized data dashboard](example-data-dashboard.md): Build a dashboard that queries Snowflake data and personalizes
  the display for each viewer using `st.connection` and `st.user`.
* [Example: Build a form that writes to Snowflake](example-crud-app.md): Build a form that writes user input back to a Snowflake table,
  demonstrating `st.form`, dependency management, and `st.user`.

To learn more about specific topics:

* [Create your Streamlit app](../app-development/creating-your-app.md): Detailed instructions
  for creating apps from Snowsight, SQL, or the CLI.
* [Manage dependencies for your Streamlit app](../app-development/dependency-management.md): Add Python packages
  to your app.
* [Runtime environments for Streamlit apps](../app-development/runtime-environments.md): Understand container
  and warehouse runtimes.
* [External network access in Streamlit in Snowflake](../features/external-access.md): Connect your app to external services.

---
title: Limitations and library changes
source: https://docs.snowflake.com/en/developer-guide/streamlit/limitations.md
section: Streamlit in Snowflake
---

# Limitations and library changes

This topic describes limitations and feature behavior changes when a Streamlit feature
works differently in Snowflake than in the open-source library.

To view release notes for each Streamlit version, see [Streamlit documentation](https://docs.streamlit.io/develop/quick-reference/release-notes).

## Limitations and changes for all runtimes

The following limitations apply to all Streamlit in Snowflake apps, regardless of runtime environment:

* Unsupported Streamlit features
* Loading external resources
* Custom components with external scripts
* Query parameters
* Using external stages isn’t supported.
* Replication isn’t supported.
* Using `.so` files isn’t supported.

## Limitations and changes that vary by runtime

The following table compares limitations that differ between warehouse runtimes and container runtimes.
For more information about runtime environments, see [Runtime environments for Streamlit apps](app-development/runtime-environments.md).

| Limitation | Warehouse runtime | Container runtime |
| --- | --- | --- |
| [Python versions](app-development/dependency-management.md) | 3.9, 3.10, 3.11 | 3.11 only |
| [Streamlit versions](app-development/dependency-management.md) | 1.22+ (limited selection) | 1.50+ (any version, including `streamlit-nightly` versions) |
| Package-based v2 components | Not supported | Supported |
| Displaying large amounts of data | 32 MB | Configurable |
| File uploads | 200 MB | Configurable |
| Mapbox and Carto | Requires acknowledgement of the External Offerings Terms. | Not subject to the External Offerings Terms. |
| Caching | Single-session caching. Cached values can’t be shared between sessions. | Fully supported. Cached values are shared across all viewer sessions unless you use session-scoping in the cache decorator. |
| [ROOT_LOCATION parameter](migrations-and-upgrades/overview.md) | Supported as a legacy parameter in CREATE STREAMLIT. | Not supported |
| Maintenance window | Not applicable | Subject to the Snowpark Container Services [maintenance window](../snowpark-container-services/working-with-compute-pool.md). |
| Static file serving | Not supported | Supported |

## Limitation details

### Unsupported Streamlit features

The following Streamlit features are not fully supported in Streamlit in Snowflake:

* [`st.set_page_config`](https://docs.streamlit.io/develop/api-reference/configuration/st.set_page_config):

  > The `page_title`, `page_icon`, and `menu_items` properties of the
  > `st.set_page_config` command aren’t supported.
* [`config.toml`](https://docs.streamlit.io/develop/api-reference/configuration/config.toml) file:

  > For a summary of supported and unsupported configuration options, see [Streamlit configuration](app-development/secrets-and-configuration.md).

### Loading external resources

All Streamlit in Snowflake apps run within a Content Security Policy (CSP) that restricts what resources can be
loaded. The CSP blocks loading code from external domains and embedding external content in iframes.
It also blocks front-end calls that are generally considered unsafe, like `eval()`.
For more information about the CSP, see [Content Security Policy](object-management/security.md).

For example, the following code runs without a Python error, but the script is not loaded or executed in the browser:

```python
# This won't work
st.html(
   "<script src="http://www.example.com/example.js"></script>",
   unsafe_allow_javascript=True
)
```

> **Note:**
>
> App developers are responsible for security checks and the software supply chain of Streamlit in Snowflake app code per the
> [Snowflake’s Shared Responsibility Model](https://www.snowflake.com/en/resources/report/snowflake-shared-responsibility-model/).

### Custom components

As a consequence of the CSP, custom components can’t load scripts from external domains
in warehouse and container runtimes. Because package-based components use an asset directory
to serve their static content, the following differences apply:

* In warehouse runtimes, package-based v2 components that utilize an asset directory aren’t supported.
* In container runtimes, package-based v2 components are fully supported.

To use a v2 custom component in a warehouse runtime, it must be defined with inline HTML, CSS,
and JavaScript.

> **Note:**
>
> Components imported from a third-party source are subject to the license attached to that component. You are responsible
> for ensuring that your use of a component is permitted by its license.
>
> Snowflake doesn’t build or maintain third-party components that you might import into Streamlit in Snowflake. Use of such components is at
> your own risk and is not subject to any warranties, service level agreements, or other similar guarantees by Snowflake.

### Query parameters

For [`st.query_params`](https://docs.streamlit.io/develop/api-reference/caching-and-state/st.query_params) in Streamlit in Snowflake, a `streamlit-` prefix is added to each query parameter key in the URL. This prefix
isn’t included when you use `st.query_params` to get or set a value.

For example, consider the following URL:

```output
https://app.snowflake.com/org/account_name/#/streamlit-apps/DB.SCHEMA.APP_NAME?streamlit-first_key=one&streamlit-second_key=two
```

The parameters in this URL are accessible in `st.query_params` as the following key-value pairs:

```python
{
   "first_key" : "one",
   "second_key" : "two"
}
```

### Displaying large amounts of data

Streamlit apps running in warehouse runtimes have a 32-MB limit on the size of messages exchanged between the backend and the
frontend. If you attempt to display more than 32 MB of data with a single Streamlit command, like `st.dataframe`, the following error occurs:

```output
MessageSizeError: Data Size exceeds message limit
```

To avoid this limit, design your Streamlit app to display data in increments smaller than 32 MB. There is no explicit limit on the size of
a query you can run or the amount of data you can have in memory.

In container runtimes, this limit defaults to 200 MB and can be changed by setting the Streamlit configuration option,
`server.maxMessageSize`. However, the message size can’t exceed the capacity of the container’s memory. Allowing larger messages
could exceed the container’s memory limit, especially if concurrent viewers are present.

### File uploads

The default file-size limit for [`st.file_uploader`](https://docs.streamlit.io/develop/api-reference/widgets/st.file_uploader) and [`st.chat_input`](https://docs.streamlit.io/develop/api-reference/chat/st.chat_input) is 200 MB.
In warehouse runtimes, this isn’t configurable. In container runtimes, this limit can be changed by setting the Streamlit configuration option,
`server.maxUploadSize`. However, the file size can’t exceed the capacity of the container’s memory. Allowing larger files
could exceed the container’s memory limit, especially if concurrent viewers are present.

For larger files, consider processing data in smaller batches or using alternative uploading methods.

### Mapbox and Carto

Mapbox and Carto provide map tiles when you use the [`st.map`](https://docs.streamlit.io/develop/api-reference/charts/st.map) or [`st.pydeck_chart`](https://docs.streamlit.io/develop/api-reference/charts/st.pydeck_chart) Streamlit commands.

In warehouse runtimes, which manage their packages with conda, Mapbox and Carto are third-party
applications that are subject to Snowflake’s
[External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/).

To use these commands in warehouse runtimes, you must acknowledge the External Offerings Terms.
Container runtimes don’t require this acknowledgement.

### Caching

Caching is partially supported in warehouse runtimes and fully supported in container runtimes.

In warehouse runtimes, caching is limited to single-session caching. Cached values can’t be shared between sessions.
In container runtimes, caching is fully supported. Cached values are shared across all viewer sessions.

---
title: Logging and tracing for Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/logging-tracing.md
section: Streamlit in Snowflake
---

# Logging and tracing for Streamlit in Snowflake

Streamlit in Snowflake supports logging for both warehouse and container runtimes. Warehouse runtimes use the
Snowflake telemetry framework to capture log messages and trace events into an event table.
Container runtimes capture logs that your app emits to standard output and standard error,
store them in the account’s event table, and provide both live console logs and historical
log views in Snowsight.

Both runtimes store logs in the account-level event table. An account administrator must
set up and configure this event table before logs can be captured. For instructions, see
[Event table overview](../../logging-tracing/event-table-setting-up.md).

* To find the event table configured for your account, run:

  ```sqlexample
  SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;
  ```

The following table compares logging and tracing support by runtime:

| Feature | Warehouse runtime | Container runtime |
| --- | --- | --- |
| Event table logging | Supported | Supported |
| Tracing | Supported | Not supported |
| Live console logs in Snowsight | Not supported | Supported |
| Historical logs in Snowsight | Not supported | Supported |

## Container runtime logging

Container-runtime Streamlit apps run inside a Snowpark Container Services container. Snowflake automatically
captures anything your app emits to standard output and standard error and stores it in
the account’s event table. You can view these logs in Snowsight or query them
with SQL.

### Python’s logging module

Use Python’s built-in `logging` module to emit log messages from your app. The following
example configures a logger that writes INFO-level and higher messages to standard output:

```python
import logging
import sys

logging.basicConfig(
    level=logging.INFO,
    format="%(asctime)s %(levelname)s %(name)s: %(message)s",
    stream=sys.stdout,
)

LOGGER = logging.getLogger("my_app")
```

In order from least to most severe, Python has the following logging levels:

* DEBUG
* INFO
* WARNING
* ERROR

Setting the level to INFO captures INFO, WARNING, and ERROR messages but not DEBUG messages.

> **Note:**
>
> By default, Python’s `logging` module writes to standard error (`sys.stderr`).
> Snowflake captures both standard output and standard error, so your logs are captured
> regardless of the stream you use. Setting the stream to `sys.stdout` is optional but
> recommended because standard error is conventionally reserved for error output.

After configuring the logger, you can use it to log messages throughout your app code.
It is common to define a logger in a separate module and then import it into your app code:

```none
source_directory/
├── my_logger.py
├── pyproject.toml
└── streamlit_app.py
```

```python
import streamlit as st
from my_logger import LOGGER

LOGGER.info("Home page loaded")
st.title("My App")

if st.button("Run analysis"):
    LOGGER.info("Analysis button clicked")
    try:
        result = run_analysis()
        LOGGER.info("Analysis completed successfully")
    except Exception as e:
        LOGGER.error("Analysis failed: %s", e)
        st.error("Analysis failed: %s", e)
```

### Live logs in Snowsight

When you edit a container-runtime app in Snowsight, a logs pane appears below
the editor. This pane streams log messages in real time as your app emits them. A short
history of the most recent logs is displayed when you first connect.

Each log entry shows the following information:

| Column | Description |
| --- | --- |
| `Source` | `APP` for logs from your Streamlit process and user-configured loggers, or `MANAGER` for logs from the system process that manages the container. |
| `Level` | The severity level of the log message (DEBUG, INFO, WARNING, ERROR). |
| `Message` | The log message content. |

### Available live-log actions

In the upper-right corner of the logs pane, you can search and filter the logs to help you find the
information you need. This includes text search, filtering by source, and filtering by severity level.
In the three-dot menu, you can download the current logs, navigate to the historical logs, or clear the
live-logging pane. When you clear the pane, the current logs are deleted from your current view but not
from the event table. Immediately reloading the page restores the most recent logs.

### Understanding log sources

Logs from container-runtime apps have one of two sources:

* `MANAGER`: The system process inside the container that prepares and runs your app.
  Manager logs include messages about downloading your app files from the stage,
  installing Python dependencies, and starting the Streamlit server process. If you
  update your app’s dependency files while the app is running, the manager process
  reinstalls dependencies and produces additional manager logs.
* `APP`: Logs from the running Streamlit server process. This includes messages from
  your user-configured Python loggers, Streamlit’s built-in logger, and any other output
  your app writes to standard output or standard error.

The boundary between sources is the `streamlit run` command. Everything the container does
before starting the Streamlit process produces `MANAGER` logs. After the Streamlit process
starts, output from that process produces `APP` logs.

### View historical logs in Snowsight

The following steps only apply to container-runtime apps. Warehouse runtime apps
don’t have a logs pane.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then select your app.
3. In the upper-right corner of the page, select Edit.
4. In the upper-right corner of the logs pane, select the three-dot menu (Other actions) » Historical logs.

This opens the Snowpark Container Services monitoring page for the service that runs behind your app. The logs
table shows the following columns:

| Column | Description |
| --- | --- |
| Timestamp | The timestamp of the log message. |
| Instance ID | The identifier for the container instance. This is always `0` for Streamlit apps. |
| Container | The identifier for the container instance. |
| Stream | Whether the log was emitted to standard output (`stdout`) or standard error (`stderr`). |
| Value | The JSON-formatted log message that includes `"level"`, `"message"`, `"source"`, and `"timestamp"` fields. |

For more information about the monitoring page, see
[Snowpark Container Services: Monitoring Services](../../snowpark-container-services/monitoring-services.md).

### Query logs with SQL

You can query the event table directly to analyze your container-runtime app’s logs.
The following query retrieves logs from a specific Streamlit app:

```sqlexample
SELECT
    TIMESTAMP,
    RECORD['severity_text']::VARCHAR AS level,
    VALUE::VARCHAR AS message,
    RESOURCE_ATTRIBUTES['snow.database.name']::VARCHAR AS database_name,
    RESOURCE_ATTRIBUTES['snow.schema.name']::VARCHAR AS schema_name,
    RESOURCE_ATTRIBUTES['snow.executable.name']::VARCHAR AS app_name,
    RECORD_ATTRIBUTES['log.iostream']::VARCHAR AS stream
FROM <event_table>
WHERE RESOURCE_ATTRIBUTES['snow.database.name'] = '<database_name>'
  AND RESOURCE_ATTRIBUTES['snow.schema.name'] = '<schema_name>'
  AND RESOURCE_ATTRIBUTES['snow.executable.name'] = '<app_name>'
  AND RECORD_TYPE = 'LOG'
  AND TIMESTAMP > DATEADD(hour, -1, CURRENT_TIMESTAMP())
ORDER BY TIMESTAMP DESC
LIMIT 100;
```

Replace `<event_table>` with the event table name returned by the SHOW PARAMETERS command,
and replace `<database_name>`, `<schema_name>`, and `<app_name>` with the values for
your Streamlit app.

> **Tip:**
>
> Include a TIMESTAMP filter in your event table queries to improve performance.
> Event tables can contain a large volume of data from various Snowflake components.

For more information about the event table columns, see
[Event table columns](../../logging-tracing/event-table-columns.md).

## Warehouse runtime logging

For Streamlit apps using warehouse runtimes, you can capture log messages and trace
events of your Streamlit app code as it runs and then analyze the results with SQL,
for example, to analyze errors. For more information, see [Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).

Warehouse runtimes require log and trace levels to be set on the database containing your app:

```sqlexample
-- Set the log level for the database containing your app
ALTER DATABASE <database_name> SET LOG_LEVEL = INFO;

-- Set the trace level for the database containing your app
ALTER DATABASE <database_name> SET TRACE_LEVEL = ON_EVENT;
```

### Example: Logging from a warehouse-runtime app

```python
import logging
import streamlit as st

logger = logging.getLogger("simple_logger")

# Write directly to the app
st.title("Simple Logging Example")

# Get the current credentials
session = st.connection('snowflake').session()

def get_log_messages_query() -> str:
    return """
            SELECT
                TIMESTAMP,
                RECORD:"severity_text"::VARCHAR AS SEVERITY,
                RESOURCE_ATTRIBUTES:"db.user"::VARCHAR AS USER,
                VALUE::VARCHAR AS VALUE
            FROM
                SAMPLE_EVENTS
            WHERE
                SCOPE:"name" = 'simple_logger'
            ORDER BY
                TIMESTAMP DESC;
            """

button = st.button("Log a message")

if button:
    try:
        logger.info("Logging an info message through Streamlit App.")
        st.success('Logged a message')
    except Exception as e:
        logger.error("Logging an error message through Streamlit App: %s",e)
        st.error('Logged an error')

sql = get_log_messages_query()

df = session.sql(sql).to_pandas()

with st.expander("**Show All Messages**"):
     st.dataframe(df, use_container_width=True)
```

## Tracing (warehouse runtimes only)

Tracing is supported for warehouse runtimes only. You can emit trace events from your
Streamlit app and then query the event table to analyze them.

> **Note:**
>
> The following example requires installing the `snowflake-telemetry-python` package.
> For more information, see [Adding support for the telemetry package](../../logging-tracing/tracing-python.md).

```python
import streamlit as st
import time
import random
from snowflake import telemetry

def sleep_function() -> int:
    random_time = random.randint(1, 10)
    time.sleep(random_time)
    return random_time

def get_trace_messages_query() -> str:
    return """
            SELECT
                TIMESTAMP,
                RESOURCE_ATTRIBUTES :"db.user" :: VARCHAR AS USER,
                RECORD_TYPE,
                RECORD_ATTRIBUTES
            FROM
                SAMPLE_EVENTS
            WHERE
                RECORD :"name" :: VARCHAR = 'tracing_some_data'
                OR RECORD_ATTRIBUTES :"logging_demo.tracing" :: VARCHAR = 'begin_span'
            ORDER BY
                TIMESTAMP DESC;
            """

def trace_message() -> None:
    execution_time = sleep_function()
    telemetry.set_span_attribute("logging_demo.tracing", "begin_span")
    telemetry.add_event(
        "tracing_some_data",
        {"function_name": "sleep_function", "execution_time": execution_time},
    )

# Write directly to the app
st.title("Simple Tracing Example")

# Get the current credentials
session = st.connection('snowflake').session()

button = st.button("Add trace event")

if button:
    with st.spinner("Executing function..."):
        trace_message()
        st.toast("Successfully log a trace message!", icon="✅")

sql = get_trace_messages_query()

df = session.sql(sql).to_pandas()

with st.expander("**Show All Trace Messages**"):
     st.dataframe(df, use_container_width=True)
```

---
title: Manage dependencies for your Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/dependency-management.md
section: Streamlit in Snowflake
---

# Manage dependencies for your Streamlit app

By default, Streamlit in Snowflake environments come with Python, Streamlit, and Snowflake Snowpark installed.
How you manage your app’s dependencies differs based on the runtime environment you choose:

* Container runtimes manage packages with [uv](https://docs.astral.sh/uv/). You can specify
  dependencies in a `pyproject.toml` (recommended) or `requirements.txt` file. By default, your
  app doesn’t have access to a package index like PyPI. Therefore, if you want to edit or specify the
  versions of your app’s dependencies, you must create an external access integration (EAI).
  Additionally, you can install packages from wheel files included in your project directory.
* Warehouse runtimes manage packages with [conda](https://docs.conda.io/en/latest/). You can
  specify dependencies using an `environment.yml` file or the built-in package picker
  in Snowsight. You can only install packages from the
  [Snowflake Anaconda Channel](https://repo.anaconda.com/pkgs/snowflake/).

To learn how to add or edit files in your deployed app, see
[Edit your Streamlit app](editing-your-app.md).

| Supported dependency sources | Warehouse runtime | Container runtime |
| --- | --- | --- |
| PyPI or other external [“simple”](https://peps.python.org/pep-0503/) package indexes | No | Yes (with EAI) |
| Snowflake Anaconda Channel | Yes, with limitations on Streamlit versions | No |
| Internal stage | No | Yes, but only via relative paths within the app’s source files |
| Snowflake Artifact Repository (`snowflake.snowpark.pypi_shared_repository`) | No | No |

## Supported versions of Python

Newly created Streamlit in Snowflake apps run in Python 3.11 by default.

* For container runtimes, Python 3.11 is the only currently supported version.
* For warehouse runtimes, you can choose Python 3.9, 3.10, or 3.11.

## Supported versions of Streamlit

Newly created Streamlit in Snowflake apps use the latest supported version of Streamlit available in
their runtime environment. When a new version of Streamlit is released, there
might be a delay before the new version becomes the default.

* For container runtimes, the minimum required version of Streamlit is 1.50. You can
  use any later version of Streamlit, including `streamlit-nightly` versions.

  > **Important:**
  >
  > `streamlit-nightly` versions are experimental. For more information, see
  > [Nightly releases](https://docs.streamlit.io/develop/quick-reference/prerelease#nightly-releases)
  > in the Streamlit documentation.

  You can immediately use the latest Streamlit version by installing it from a package index.
* For warehouse runtimes, you are limited to a subset of versions starting from 1.22.0.
  `streamlit-nightly` versions aren’t supported.

  It’s not possible to immediately use the latest Streamlit version in a warehouse runtime.

To prevent unexpected package upgrades, configure your app’s dependencies as
described on this page.

### Supported versions of the Streamlit library in warehouse runtimes

Streamlit in Snowflake supports the following versions of the Streamlit open-source library:

* 1.52.2
* 1.52.1
* 1.52.0
* 1.51.0
* 1.50.0
* 1.49.1
* 1.48.0
* 1.47.0
* 1.46.1
* 1.45.1
* 1.45.0
* 1.44.1
* 1.44.0
* 1.42.0
* 1.39.0
* 1.35.0
* 1.31.1
* 1.29.0
* 1.26.0
* 1.22.0

## Non-Python dependencies

Some Python packages require non-Python system libraries to be installed in the
runtime environment. For example, the `Pillow` package requires libraries for
handling different image formats.

* For non-Python dependencies in container runtimes, you can only use the pre-installed
  system libraries. Installing additional non-Python dependencies isn’t supported yet.
* For non-Python dependencies in warehouse runtimes, some system libraries are available
  in the Snowflake Anaconda Channel.

## Best practices for declaring dependencies

When declaring your app’s dependencies, consider the following best practices:

* Pin critical package versions.

  + For container runtimes, use the `==` operator in `pyproject.toml` or `requirements.txt` files.
  + For warehouse runtimes, use the `=` operator in `environment.yml` files.
* Use version ranges for flexibility.

  + For container runtimes, use the `<`, `<=`, `>=`, and `>` operators in `pyproject.toml` or `requirements.txt` files.
  + For warehouse runtimes, use `*` wildcard suffixes in `environment.yml` files.
* Keep dependency lists minimal to reduce build time.
* Test dependency changes in development before deploying.
* Ensure your dependencies are compatible with the Python version in your runtime.

When migrating between runtimes or changing your package manager,
review your dependency names. For example, some packages have different names between
Conda and PyPI:

| Package | Conda Name | PyPI Name |
| --- | --- | --- |
| Pillow | `pillow` | `Pillow` |
| OpenCV | `opencv` | `opencv-python` |
| PyYAML | `pyyaml` | `PyYAML` |

## Managing dependencies for container runtimes

Container-runtime apps require an external access integration (EAI) to install packages
from an external package index like PyPI. Without an EAI, you can only use packages
shipped with the runtime or included in your app’s source files.

Even if you only want to specify the version of Streamlit, you must include an EAI with
your app. Without an EAI, if you attempt to use version specifiers on pre-installed packages,
you might encounter an error when the runtime base image is updated. This is because your
version specifier might no longer be compatible with the pre-installed packages.

### External access integrations for container runtimes

For a general overview of external access integrations (EAIs), see [External network access overview](../../external-network-access/external-network-access-overview.md).

#### PyPI EAI

PyPI is the default package index used by uv to install Python packages in your container runtime.
Snowflake provides a managed network rule that simplifies creating an EAI for PyPI. Your account
administrator can use this rule to create a PyPI EAI and grant your role access to it. For setup
instructions, see [Set up a PyPI EAI for app developers](../object-management/security.md).

If you need to use a private or authenticated package repository such as JFrog Artifactory, your
administrator must create a custom EAI with the appropriate network rule and authentication secrets.
For an example, see [Example: Authenticate to a private JFrog Artifactory repository](secrets-and-configuration.md).

After your administrator has created a PyPI EAI and granted your role USAGE on it, you need to add
it to your Streamlit object. You can do this in Snowsight or with SQL:

SnowsightSQL

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then select your app.
3. In the upper-right corner, select  (more options) » App settings.
4. In the App settings dialog, select the External networks tab.
5. From the list of available EAIs, select the EAI for PyPI.
6. To save the change and close the dialog, select Save.

Replace `pypi_access_integration` with the name of your PyPI EAI and run the following SQL command:

```sqlexample
ALTER STREAMLIT my_app SET
  EXTERNAL_ACCESS_INTEGRATIONS = (pypi_access_integration);
```

### Dependency files

Container runtimes use uv for fast, reliable dependency resolution. uv works
like pip to install Python packages, but it’s more performant and customizable. For more information
about uv’s features, see the [Features](https://docs.astral.sh/uv/getting-started/features/) overview
in the uv documentation.

Container runtimes search for dependency files in the same directory as your app’s entrypoint file. If no dependency files
are found, the search continues up the directory tree until reaching the root of your app’s source location. The first
dependency file found is used to install your app’s dependencies.

When multiple dependency files exist in the same directory, they are used in the following order of precedence:

* `requirements.txt`: Lists the Python packages and versions required by your Streamlit app,
  including Streamlit itself. You can’t configure your Python version with `requirements.txt`.

  For more information about the format of `requirements.txt`, see
  [Requirements File Format](https://pip.pypa.io/en/stable/reference/requirements-file-format/) in the pip documentation.
* `pyproject.toml` (recommended): Manages your Python version and dependencies. Currently, only
  Python version 3.11 is supported. When you provide a `pyproject.toml` file, uv will generate
  a `uv.lock` file to lock your dependency versions. This lock file will be updated whenever you update
  your dependencies. You must use `pyproject.toml` if you want to use a different package index than PyPI.

  For more information about the format of `pyproject.toml`, see [Writing your pyproject.toml](https://packaging.python.org/en/latest/guides/writing-pyproject-toml/)
  in the Python documentation.

`requirements.txt` is the simplest way to declare your app’s dependencies
and is provided for the convenience of getting started. However, for more advanced
dependency management, Snowflake recommends using `pyproject.toml` instead.
For example, this lets you lock dependency versions to ensure that your builds are reproducible.

> **Tip:**
>
> * You can install a package from any URL if you have the necessary EAI assigned to your app. URLs requiring
>   authentication must support embedded credentials.
> * You can install a package from within your project directory by using a relative path from the
>   dependency file to a wheel file.
> * If you use version specifiers on pre-installed packages, you must have an EAI to a package index to avoid
>   errors when the runtime base image is updated.
> * In your local project directory with uv installed, you can run `uv init --bare` to generate a minimal
>   `pyproject.toml` file to edit.

Commonly, your entrypoint file and dependency file will be in the root of your project directory.
However, your entrypoint file can be in a subdirectory and your dependency file can be in the same
directory or any parent up to the root of your project.

For example, your project directory might have one of the following structures:

```none
source_directory/
├── requirements.txt
└── streamlit_app.py
```

```none
source_directory/
├── pyproject.toml
├── streamlit_app.py
└── uv.lock
```

```none
source_directory/
├── pyproject.toml
├── subdirectory/
│   └── streamlit_app.py
└── uv.lock
```

```none
source_directory/
└── subdirectory/
    ├── pyproject.toml
    ├── streamlit_app.py
    └── uv.lock
```

> **Note:**
>
> The container runtime will use the directory containing the dependency file as its working directory
> for uv. Therefore, if you use a relative path to install a package from among your app source files,
> the path should be relative to the dependency file location. For more information about declaring
> package sources, see [Dependency sources](https://docs.astral.sh/uv/concepts/projects/dependencies/#dependency-sources)
> in the uv documentation.

#### PyPI dependency file examples

Your `pyproject.toml` file must include a `name` and `version` to be in a valid format
for uv, but their values can be arbitrary. Use `requires-python` to set your Python
version, even though container runtimes only support Python 3.11 for now. Use `dependencies`
to list your Python packages for your container runtime.

> **Tip:**
>
> Install Streamlit as `streamlit[snowflake]` to include its Snowflake connector
> dependencies (`snowflake-snowpark-python`).

If you have an EAI for PyPI, the following `pyproject.toml` file declares
a minimum Python version of 3.11 and includes five Python packages which will be
installed from PyPI:

```toml
[project]
name = "my-streamlit-app"
version = "0.1.0"
requires-python = ">=3.11"
dependencies = [
    "streamlit[snowflake]==1.50.0",
    "pandas>=2.0.0",
    "plotly>5.0.0",
    "requests>2.0.0,<3.0.0"
]
```

As an alternative to `pyproject.toml`, you can use a `requirements.txt` file
to declare your app’s dependencies. The following `requirements.txt` contains the
same Python packages as the previous `pyproject.toml` example:

```text
streamlit[snowflake]==1.50.0
pandas>=2.0.0
plotly>5.0.0
requests>2.0.0,<3.0.0
```

> **Note:**
>
> To pin a version of a package, you must use the `==` operator. To specify a version range,
> you must use `<`, `<=`, `>=`, and `>` operators. For example, `pandas>=2.0.0,<3.0.0` will install
> any version between 2.0.0 and 2.99.99. For more information, see [Dependency specifiers](https://packaging.python.org/en/latest/specifications/dependency-specifiers/).

#### JFrog dependency file examples

For added security, your system administrator may require you to use a curated or private
package index like JFrog Artifactory. This is an exclusive feature for container runtimes.
With JFrog, you can create a public or private package index that proxies PyPI or hosts
custom packages. This allows you to control which packages and versions are available to
your Streamlit apps.

To specify a package index, you must use `pyproject.toml`. For more information, see
[Using alternative package indexes](https://docs.astral.sh/uv/guides/integration/alternative-indexes/)
in the uv documentation.

The following `pyproject.toml` file declares a minimum Python version of 3.11,
includes five Python packages, and specifies JFrog as the package index that proxies PyPI:

```toml
[project]
name = "my-streamlit-app"
version = "0.1.0"
requires-python = ">=3.11"
dependencies = [
    "streamlit[snowflake]==1.50.0",
    "pandas>=2.0.0",
    "plotly>=5.0.0",
    "requests>2.0.0,<3.0.0"
]

[[tool.uv.index]]
name = "jfrog"
url = "<server_name>.jfrog.io/artifactory/api/pypi/<repository_key>/simple"
default = true
```

If your JFrog repository requires authentication, generate a personal access
token or get a scoped token from your JFrog system administrator. Then, include the
token in the URL. Don’t use your JFrog password in the URL. In this case, the `[[tool.uv.index]]`
table in the previous example would be replaced with the following:

```toml
[[tool.uv.index]]
name = "jfrog"
url = "https://<username>:<access_token>@<server_name>.jfrog.io/artifactory/api/pypi/<repository_key>/simple"
default = true
```

## Managing dependencies for warehouse runtimes

Warehouse runtimes use conda to manage your app’s dependencies. You can declare
your dependencies using an `environment.yml` file or the built-in package picker
in Snowsight. Dependencies are installed from the
[Snowflake Anaconda Channel](https://repo.anaconda.com/pkgs/snowflake/),
which includes both Python packages and some non-Python system libraries.

The Snowflake Anaconda Channel contains more versions of Streamlit than are supported
in Streamlit in Snowflake warehouse runtimes. To avoid compatibility issues, only use versions of
Streamlit that are listed in Supported versions of the Streamlit library in warehouse runtimes. Otherwise,
you may install any other package available in the Snowflake Anaconda Channel.

### `environment.yml` file

To install dependencies in your warehouse runtime environment using an `environment.yml`
file, create or edit the file in the root of your app’s source location. If you don’t provide an
`environment.yml` file, Snowflake uses only the pre-installed packages for your selected
environment. For more information about the structure of `environment.yml`, see the
[conda documentation](https://docs.conda.io/projects/conda/en/latest/user-guide/tasks/manage-environments.html#creating-an-environment-file-manually).

The following limitations apply when using `environment.yml` files in Streamlit in Snowflake warehouse runtimes:

* You can only use the Snowflake Anaconda Channel to install packages.
* You can only use the Streamlit versions listed in Supported versions of the Streamlit library in warehouse runtimes.
* You can’t declare `pip` packages in the `dependencies` section, including relative paths to local packages.

The following `environment.yml` declares Python 3.11 and five Python packages:

```yaml
name: my-streamlit-app
channels:
  - snowflake
dependencies:
  - python=3.11
  - streamlit=1.50.0
  - pandas=2.*
  - plotly=5.0.*
  - requests
  - snowflake-snowpark-python
```

Snowflake recommends pinning a version of Streamlit to prevent the app from being upgraded when a new version
of Streamlit becomes available in the Snowflake Anaconda Channel.

> **Note:**
>
> To pin a version of a package, you must use the `=` operator. To specify a version range,
> you must use `*` wildcards. For example, `pandas=2.*` will install
> any version of pandas between 2.0.0 and 2.99.99.

### Local development with conda

When developing your warehouse-runtime app locally with conda, you must include
additional details in your `environment.yml` file to ensure the dependencies
are installed correctly.

* Identify the Snowflake Anaconda Channel by its URL: `https://repo.anaconda.com/pkgs/snowflake`.
* Block the default channel.

In your `environment.yml` file, use the following two channels:

```yaml
channels:
  - https://repo.anaconda.com/pkgs/snowflake
  - nodefaults
```

If `defaults` appears in your `~/.condarc` file, comment it out:

```yaml
channels:
  # - defaults
```

### Snowsight package picker

Besides editing the `environment.yml` file directly for your warehouse-runtime app, you can also
use the built-in package picker in Snowsight to add or remove packages from your
app’s environment. The package picker is only available for apps using warehouse runtimes.
Additionally, the package picker only displays packages compatible with the current Python
version of your app. Some system libraries that are independent of Python version might not
be shown in the package picker and must be added manually to `environment.yml`.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then select your Streamlit app.
3. In the upper-right corner, select Edit.
4. In the upper-left corner of the editor pane, select Packages.

   A drop-down pane appears with the Anaconda Packages tab selected.
5. Do any of the following actions:

> * To set the Python version, in the Python version selector, choose the desired version.
> * To add a package, use the search bar to find packages by name, then select the desired package.
> * To remove a package, in the Installed Packages section, select the x icon
>   to the right of the package version.
> * To set the version of an installed package, in the Installed Packages section,
>   use the version selector next to the package name.
>
> Snowflake updates your `environment.yml` file automatically and reboots your app.
> If you have the `environment.yml` file open in the editor, refresh the page to
> see the changes.

---
title: Manage secrets and configure your Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/secrets-and-configuration.md
section: Streamlit in Snowflake
---

# Manage secrets and configure your Streamlit app

Streamlit apps often need to access sensitive information such as API keys, passwords, and other credentials. How you manage
secrets in your Streamlit app depends on the runtime environment you’re using. Streamlit in Snowflake provides secure, built-in mechanisms
for accessing secrets in both warehouse and container runtimes. For Streamlit configuration, each runtime has
different restrictions, too.

In the Streamlit library, apps use a `.streamlit/` directory to store configuration and secrets:

* `.streamlit/config.toml`: Customizes app settings such as theme, layout, and server behavior.
* `.streamlit/secrets.toml`: Stores sensitive information like API keys and credentials (in local development).

Streamlit in Snowflake supports these files with some limitations depending on your runtime environment. The following table
summarizes the support for these files in warehouse and container runtimes:

| Feature | Warehouse runtime | Container runtime |
| --- | --- | --- |
| `config.toml` support | Limited subset of configuration options | Broader subset of configuration options |
| `secrets.toml` support | Not supported | Supported, but only recommended for non-secret environment variables |

For `secrets.toml`, Streamlit in Snowflake provides a more secure, built-in secrets management system that is recommended
for managing sensitive information. The following sections describe how to use Snowflake secrets in your apps.

## Managing your connection to Snowflake

To manage your connection to Snowflake, you can use [`st.connection("snowflake")`](https://docs.streamlit.io/develop/api-reference/connections/st.connections.snowflakeconnection). This allows you to connect to Snowflake from both
your local development environment and your deployed app.

```python
import streamlit as st

conn = st.connection("snowflake")
session = conn.session()

session.sql("SELECT 1").collect()
```

In warehouse runtimes, you can also use Snowpark’s [`get_active_session()`](/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.context.get_active_session) function to get the active session.

```python
import streamlit as st
from snowflake.snowpark.context import get_active_session

# ONLY IN WAREHOUSE RUNTIMES
session = get_active_session()
session.sql("SELECT 1").collect()
```

> **Important:**
>
> `get_active_session()` isn’t thread-safe and can’t be used in container runtimes.

## Secrets in container runtimes

You can use [`st.secrets`](https://docs.streamlit.io/develop/api-reference/connections/st.secrets) to access Snowflake secrets in your container runtime Streamlit in Snowflake apps. This allows you to
securely store and retrieve sensitive information such as API keys, credentials, and other configuration values.
Just like Streamlit does for `.streamlit/secrets.toml` in local development, Streamlit in Snowflake populates
secrets to environment variables, too.

> **Note:**
>
> Container runtimes don’t have access to the `_snowflake` module. If you are migrating an older
> warehouse-runtime app that uses `_snowflake` secret functions, replace those calls with
> [`st.secrets`](https://docs.streamlit.io/develop/api-reference/connections/st.secrets) as described in this section.

### Access a secret in a container runtime

1. Stage the following Python file in `@my_stage/app_folder/streamlit_app.py`. For information about staging files,
   see [Staging files using Snowsight](../../../user-guide/data-load-local-file-system-stage-ui.md).

   ```python
   import streamlit as st

   secret_value = st.secrets["my_secret_name"]
   ```
2. Create a secret in your Snowflake account:

   ```sqlexample
   CREATE OR REPLACE SECRET my_secret
     TYPE = GENERIC_STRING
     SECRET_STRING = 'my_secret_value';
   ```

   For more information, see [CREATE SECRET](../../../sql-reference/sql/create-secret.md).
3. Create an external access integration (EAI), and assign the secret to it:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION my_eai
     ALLOWED_AUTHENTICATION_SECRETS = (my_secret)
     ENABLED = TRUE;
   ```
4. Create your Streamlit app to reference the secret using the SECRETS parameter:

   ```sqlexample
   CREATE STREAMLIT my_container_app
     FROM '@my_stage/app_folder'
     MAIN_FILE = 'streamlit_app.py'
     RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
     COMPUTE_POOL = my_compute_pool
     QUERY_WAREHOUSE = my_warehouse
     EXTERNAL_ACCESS_INTEGRATIONS = (my_eai)
     SECRETS = ('my_secret_name' = my_secret);

   ALTER STREAMLIT my_container_app ADD LIVE VERSION FROM LAST;
   ```

   > **Note:**
   >
   > You must assign both the EAI and the secret to the Streamlit object.
   > You can’t assign a secret to a Streamlit object by itself.

   Because the generic-string secret `my_secret` is associated to the string `"my_secret_name"` in the SECRETS parameter, you can access
   the secret in your Streamlit app code using `st.secrets["my_secret_name"]`.

### Supported secret types and environment variables

Container runtimes support generic string and basic authentication secrets. In addition to mapping
secrets to `st.secrets`, Streamlit in Snowflake also maps secrets to environment variables. Environment variable
names are case-sensitive. For basic authentication secrets, two environment variables are created:
one for the username (`_USERNAME` suffix), and one for the password (`_PASSWORD` suffix).

| Secret type | `st.secrets` access | Environment variable access |
| --- | --- | --- |
| Generic string | `st.secrets["my_secret_name"]` | `os.environ["my_secret_name"]` |
| Basic authentication (username) | `st.secrets["my_secret_name"]["username"]` | `os.environ["my_secret_name_USERNAME"]` |
| Basic authentication (password) | `st.secrets["my_secret_name"]["password"]` | `os.environ["my_secret_name_PASSWORD"]` |

> **Note:**
>
> Cloud provider, symmetric key, and OAuth secret types aren’t currently supported.

#### Generic string secrets

Generic string secrets are stored as top-level keys in `st.secrets`:

```sqlexample
ALTER STREAMLIT my_container_app
  SET SECRETS = ('my_generic_secret_name' = my_generic_secret);
```

You can access the secret using dictionary or attribute notation:

```python
import streamlit as st

api_key = st.secrets["my_generic_secret_name"]
api_key = st.secrets.my_generic_secret_name
```

#### Basic authentication secrets

Basic authentication secrets are stored as dict-like objects with `"username"` and `"password"` attributes:

```sqlexample
ALTER STREAMLIT my_container_app
  SET SECRETS = ('my_basic_auth_secret_name' = my_basic_auth_secret);
```

You can access the secret using dictionary or attribute notation:

```python
import streamlit as st

username = st.secrets["my_basic_auth_secret_name"]["username"]
password = st.secrets["my_basic_auth_secret_name"]["password"]

username = st.secrets.my_basic_auth_secret_name.username
password = st.secrets.my_basic_auth_secret_name.password
```

### Secrets for authenticated package repositories

Secrets are automatically exposed as environment variables. In particular, this enables authentication
with private package repositories like JFrog Artifactory.

For most authenticated package repositories, use a basic authentication secret. The secret is automatically
converted to environment variables with `_USERNAME` and `_PASSWORD` suffixes. If you need a different
naming convention, use generic string secrets, and set the name of each environment variable manually. For more
information about the environment variables that uv uses, see
[Package indexes](https://docs.astral.sh/uv/concepts/indexes/#providing-credentials-directly)
in the uv documentation.

#### Example: Authenticate to a private JFrog Artifactory repository

1. Stage your app’s source files in `@my_stage/app_folder`. Your app’s source files must include a
   `pyproject.toml` file that configures the private package index in the `[[tool.uv.index]]` table:

   ```toml
   [[tool.uv.index]]
   name = "my_jfrog_repo"
   url = "https://my-org.jfrog.io/artifactory/api/pypi/pypi-local/simple"
   ```

   For more information about declaring your app’s dependencies in a `pyproject.toml` file,
   see [Manage dependencies for your Streamlit app](dependency-management.md).
2. Create a basic authentication secret with your JFrog credentials:

   ```sqlexample
   CREATE OR REPLACE SECRET jfrog_creds
     TYPE = PASSWORD
     USERNAME = 'my_username'
     PASSWORD = 'my_api_token';
   ```
3. Create an external access integration for your private repository:

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE jfrog_network_rule
     TYPE = HOST_PORT
     MODE = EGRESS
     VALUE_LIST = ('my-org.jfrog.io');

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION jfrog_eai
     ALLOWED_NETWORK_RULES = (jfrog_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (jfrog_creds)
     ENABLED = TRUE;
   ```

   > **Note:**
   >
   > To avoid a DNS error, you might need to include the cloud provider for your repository
   > in the network rule value list. For example, if your repository is on AWS, you might need
   > the following value list in your network rule:
   >
   > ```sqlexample
   > VALUE_LIST = ('my-org.jfrog.io', '<jfrog-server-name>.s3.amazonaws.com');
   > ```
4. Attach the EAI and secret to your Streamlit app:

   ```sqlexample
   CREATE STREAMLIT my_app
     FROM '@my_stage/app_folder'
     MAIN_FILE = 'streamlit_app.py'
     RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
     COMPUTE_POOL = my_compute_pool
     QUERY_WAREHOUSE = my_warehouse
     EXTERNAL_ACCESS_INTEGRATIONS = (jfrog_eai)
     SECRETS = ('UV_INDEX_MY_JFROG_REPO' = jfrog_creds);

   ALTER STREAMLIT my_app ADD LIVE VERSION FROM LAST;
   ```

   Because the basic authentication secret `jfrog_creds` is associated to the string `"UV_INDEX_MY_JFROG_REPO"` in the
   SECRETS parameter, the runtime automatically injects the `UV_INDEX_MY_JFROG_REPO_USERNAME` and `UV_INDEX_MY_JFROG_REPO_PASSWORD` environment variables
   as required by uv.

### Precedence of a local `.streamlit/secrets.toml` file

You can combine Snowflake-managed secrets with a local `.streamlit/secrets.toml` file in your app’s source directory.
When both are present, the Streamlit library merges them. The locally defined `.streamlit/secrets.toml` file takes precedence over
the Snowflake-managed secrets.

Because `.streamlit/secrets.toml` is stored as plain text in your staged files, it is not a security best practice
to store actual secrets in it. Use Snowflake’s built-in secrets management for sensitive credentials. Use the locally defined
`.streamlit/secrets.toml` file to store non-sensitive configuration values or environment-specific settings.

### Remove or change your Streamlit app’s secrets

* To remove all secrets from a Streamlit in Snowflake app, use the UNSET SECRETS clause with [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md):

  ```sqlexample
  ALTER STREAMLIT my_database.my_schema.my_app
    UNSET SECRETS;
  ```

  This removes all secret associations from the Streamlit in Snowflake app. The underlying secret objects remain in your Snowflake account
  and can be reassigned later. To also remove any EAI associations, unset the EXTERNAL_ACCESS_INTEGRATIONS property, too.
* To update or modify which secrets are attached, use [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) with SET SECRETS:

  ```sqlexample
  ALTER STREAMLIT my_database.my_schema.my_app
    SET SECRETS = ('new_secret' = my_new_secret);
  ```

### Example: Create a container-runtime Streamlit app with an authenticated external API

This example demonstrates creating a Streamlit in Snowflake app that calls an external API using a secret API key.

1. Stage the following Python file in `@my_stage/weather_app/streamlit_app.py`:

   ```python
   import streamlit as st
   import requests

   api_key = st.secrets["weather_api_name"]

   response = requests.get(
       "https://api.weather.com/v1/current",
       headers={"Authorization": f"Bearer {api_key}"}
   )

   st.write(response.json())
   ```

   Because `requests` is a dependency of `streamlit`, it is included in the runtime base image. Therefore,
   the runtime automatically installs it, even if you don’t include a dependencies file or configure a package index.
2. Create the secret, network rule, and EAI:

   ```sqlexample
   CREATE OR REPLACE SECRET weather_api_key
     TYPE = GENERIC_STRING
     SECRET_STRING = 'secret_value';

   CREATE OR REPLACE NETWORK RULE weather_api_rule
     TYPE = HOST_PORT
     MODE = EGRESS
     VALUE_LIST = ('api.weather.com');

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION weather_eai
     ALLOWED_NETWORK_RULES = (weather_api_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (weather_api_key)
     ENABLED = TRUE;
   ```
3. Create the Streamlit object:

   ```sqlexample
   CREATE STREAMLIT weather_app
     FROM '@my_stage/weather_app'
     MAIN_FILE = 'streamlit_app.py'
     RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
     COMPUTE_POOL = my_compute_pool
     QUERY_WAREHOUSE = my_warehouse
     EXTERNAL_ACCESS_INTEGRATIONS = (weather_eai)
     SECRETS = ('weather_api_name' = weather_api_key);

   ALTER STREAMLIT weather_app ADD LIVE VERSION FROM LAST;
   ```

### Calling a Cortex Agent in a container runtime

To call a Cortex Agent in a container-runtime app, read the session token from the
underlying Snowpark Container Services container and then use the `requests` library. This is the
recommended replacement for `_snowflake.send_snow_api_request()`.

```python
import requests
import json
import os

SNOWFLAKE_HOST = os.getenv("SNOWFLAKE_HOST")
SNOWFLAKE_ACCOUNT = os.getenv("SNOWFLAKE_ACCOUNT")
ANALYST_ENDPOINT = "/api/v2/cortex/analyst/message"
URL = "https://" + SNOWFLAKE_HOST + ANALYST_ENDPOINT

def get_token() -> str:
    """Read the oauth token embedded into SPCS container"""
    return open("/snowflake/session/token", "r").read()

def send_request(semantic_model_file, prompt):
    """Sends the prompt using the semantic model file """
    headers = {
        "Content-Type": "application/json",
        "accept": "application/json",
        "Authorization": f"Bearer {get_token()}",
        "X-Snowflake-Authorization-Token-Type": "OAUTH"
    }
    request_body = {
        "messages": [
            {
                "role": "user",
                "content": [{"type": "text", "text": prompt}],
            }
        ],
        "semantic_model_file": semantic_model_file,
    }
    return requests.post(URL, headers=headers, data=json.dumps(request_body))
```

## Secrets in warehouse runtimes

In warehouse runtimes, you can use the `_snowflake` module to access secrets directly in your Streamlit app code.
Warehouse runtimes inherit access to the `_snowflake` module from stored procedures, which allows you to retrieve
secrets that are referenced in the Streamlit object.

To use secrets in a warehouse runtime:

1. Create a secret object in Snowflake. For more information, see [CREATE SECRET](../../../sql-reference/sql/create-secret.md).

   ```sqlexample
   CREATE OR REPLACE SECRET my_secret
     TYPE = GENERIC_STRING
     SECRET_STRING = 'my_secret_value';
   ```
2. Create an external access integration and assign the secret to it.

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION my_eai
     ALLOWED_AUTHENTICATION_SECRETS = (my_secret)
     ENABLED = TRUE;
   ```
3. Reference the secret in your Streamlit object using the SECRETS parameter:

   ```sqlexample
   ALTER STREAMLIT my_warehouse_app
     SET EXTERNAL_ACCESS_INTEGRATIONS = (my_eai)
     SECRETS = ('my_secret_key' = my_secret);
   ```

   You must assign both the external access integration and the secret to the Streamlit object. You can’t assign a
   secret to a Streamlit object by itself.
4. In your Streamlit app code, import the `_snowflake` module and retrieve the secret:

   ```python
   import streamlit as st
   import _snowflake

   # Retrieve an API key from a generic string secret
   my_secret = _snowflake.get_generic_secret_string('my_secret_key')
   ```

For more information about accessing secrets with the `_snowflake` module, see [Python API for Secret Access](../../external-network-access/secret-api-reference.md).

## Streamlit configuration

Streamlit apps can include a configuration file (`.streamlit/config.toml`). This file allows
you to customize various aspects of your app, such as the theme, layout, and behavior. The configuration
file is written in TOML format. For more information about available configuration options, see the
Streamlit documentation on [`config.toml`](https://docs.streamlit.io/develop/api-reference/configuration/config.toml).

Support for configuration options varies by runtime environment. Container runtimes generally provide
broader support for configuration options than warehouse runtimes, particularly for static serving.
The following table shows which configuration sections are supported in warehouse and container runtimes:

| Configuration section | Warehouse runtime | Container runtime |
| --- | --- | --- |
| `[global]` | Not supported | Limited support (`disableWidgetStateDuplicationWarning`) |
| `[logger]` | Not supported | Not supported |
| `[client]` | Not supported | Limited support (`showErrorDetails`, `showSidebarNavigation`) |
| `[runner]` | Not supported | Supported |
| `[server]` | Not supported | Not supported |
| `[browser]` | Not supported | Not supported |
| `[mapbox]` | Not supported | Supported (deprecated, use environment variables instead) |
| `[theme]` | Supported | Supported |
| `[theme.sidebar]` | Supported | Supported |
| `[secrets]` | Not supported | Supported (but only recommended for non-secret environment variables) |
| `[snowflake.sleep]` | Supported | Not applicable |

For information about using the `[snowflake.sleep]` section to configure sleep timers in warehouse runtimes, see
[Custom sleep timer for a Streamlit app](../features/sleep-timer.md).

The following directory structure shows an example of a Streamlit app with a configuration file:

```none
source_directory/
├── .streamlit/
│   └── config.toml
├── pyproject.toml
├── streamlit_app.py
└── uv.lock
```

---
title: Manage your Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/managing-your-app.md
section: Streamlit in Snowflake
---

# Manage your Streamlit app

This topic describes how to view, rename, and modify properties of a deployed Streamlit in Snowflake app.

## Rename a Streamlit app

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app you want to rename.
4. Select Edit.
5. Select the name of the app in the upper-left corner.
6. Enter the new name in the text box.
7. Click outside the text box to commit the change.

Use the RENAME TO clause of the [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command:

```sqlexample
ALTER STREAMLIT my_app RENAME TO my_new_app;
```

Snowflake CLI does not support renaming a deployed app directly. Use SQL or Snowsight
instead.

## Change the query warehouse

You might want to switch to a warehouse with more capacity to handle the queries run by
your app.

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app whose warehouse you want to change.
4. Select the name of the app in the upper-left corner.
5. Select the new warehouse from the dropdown list.

Use the [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command to set the QUERY_WAREHOUSE
property:

```sqlexample
ALTER STREAMLIT my_app SET QUERY_WAREHOUSE = my_new_warehouse;
```

Update the `query_warehouse` value in your `snowflake.yml` file and redeploy:

```snowcli
snow streamlit deploy --replace
```

## Change the compute pool

You can change the compute pool for a container-runtime Streamlit app after it’s created.
This has no effect on warehouse-runtime apps.

SnowsightSQL

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app whose compute pool you want to change.
4. Select the three-dots button in the upper-right corner, then select App Settings.
5. Select a new compute pool from the dropdown.
6. Select Save.

Use the [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command to set the COMPUTE_POOL
property:

```sqlexample
ALTER STREAMLIT my_app SET COMPUTE_POOL = my_new_pool;
```

## Change the stage or main file

SnowsightSQLSnowflake CLI

Changing the stage or main file is not available from Snowsight. Use SQL
instead.

Use the [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command:

To change the stage:

```sqlexample
ALTER STREAMLIT my_app SET ROOT_LOCATION = '@my_db.my_schema.new_stage';
```

To change the main file:

```sqlexample
ALTER STREAMLIT my_app SET MAIN_FILE = 'new_main.py';
```

Update the `main_file` or artifact paths in your `snowflake.yml` and redeploy:

```snowcli
snow streamlit deploy --replace
```

## List Streamlit apps

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.

Snowsight displays all Streamlit apps available to your current role.

Use the [SHOW STREAMLITS](../../../sql-reference/sql/show-streamlits.md) command:

```sqlexample
SHOW STREAMLITS;
```

List deployed Streamlit apps:

```snowcli
snow streamlit list
```

## View app details

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app.

The app details panel shows the app’s database, schema, warehouse, and other properties.

Use the [DESCRIBE STREAMLIT](../../../sql-reference/sql/desc-streamlit.md) command:

```sqlexample
DESC STREAMLIT my_app;
```

Use the `describe` command:

```snowcli
snow streamlit describe my_app
```

## Share a Streamlit app

You can share your Streamlit app with other Snowflake users by granting USAGE privilege
to a role. For more information about sharing options, see
[Sharing Streamlit in Snowflake apps](../features/sharing-streamlit-apps.md).

SnowsightSQLSnowflake CLI

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app you want to share.
4. Select Share.
5. Begin typing the name of the role you want to share your app with, and then select it.
6. Optional: Select Copy to clipboard to copy the app URL.
7. Select Done.

Grant USAGE privilege on the Streamlit object:

```sqlexample
GRANT USAGE ON STREAMLIT my_app TO ROLE viewer_role;
```

Share the app with a role:

```snowcli
snow streamlit share my_app viewer_role
```

---
title: Managing costs for Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/object-management/billing.md
section: Streamlit in Snowflake
---

# Managing costs for Streamlit in Snowflake

This topic describes billing considerations for Streamlit in Snowflake.

## Billing considerations for Streamlit in Snowflake

Streamlit in Snowflake billing is based on the app’s runtime environment and query warehouse. The runtime environment
executes the Python code in your Streamlit app and can be either a container or warehouse. The query
warehouse executes any SQL queries within your app’s code.

### Query warehouse

When your app’s code executes SQL queries, those queries use the app’s query warehouse.
Snowflake automatically resumes and suspends the query warehouse according to its
own AUTO_RESUME and AUTO_SUSPEND values.

### Container runtime

If your Streamlit app uses a container runtime, you are billed for the usage of the
underlying Snowpark Container Services compute pool. In this case, the Streamlit app is a long-running
service. The Streamlit server runs continuously on a node of the compute pool, allowing
viewers to quickly access the app. Concurrent viewers connect to a single Streamlit
server. Only a single app can run on a node in the compute pool; a Streamlit app
takes an entire node.

After three days of viewer inactivity, the Streamlit server process ends and
Snowflake suspends the compute pool according to its own AUTO_SUSPEND value. If you
try to keep a session alive for three days without a new viewer connecting, the app
will probably be suspended. It’s a best practice to move long-running computations to another service.
For more information about compute pool billing, see [Understanding compute cost](../../../user-guide/cost-understanding-compute.md).

### Warehouse runtime

If your app uses a warehouse runtime, Snowflake resumes the app’s code warehouse
when someone visits the app. Each time a viewer connects to the app, a new Streamlit server
process starts in the code warehouse and a WebSocket connection is established. Concurrent
viewers each connect to their own Streamlit server running in the same code warehouse.

A WebSocket connection keeps the code warehouse active and expires approximately 15 minutes
after the associated viewer’s last activity. However, this can be affected by the viewer’s
browser settings and activity. Mouse movement over the app counts as activity and keeps
the WebSocket connection alive. You can change the WebSocket timeout value for your
account by contacting Snowflake Support.

The code warehouse is billed for the time it is active. To conserve credits, you
can do one of the following:

* Manually suspend the app from Snowsight.
* Close all browser tabs running the app, or navigate away from the app. This closes the WebSocket
  connection and allows the warehouse to auto-suspend.
* Set a custom sleep timer for the app. This automatically suspends the warehouse after a specified period
  of inactivity. For more information, see [Custom sleep timer for a Streamlit app](../features/sleep-timer.md).

For guidelines on selecting a warehouse, see [Guidelines for selecting resources in Streamlit in Snowflake](../app-development/runtime-environments.md).

---
title: Migrate your app from ROOT_LOCATION to FROM
source: https://docs.snowflake.com/en/developer-guide/streamlit/migrations-and-upgrades/root-location.md
section: Streamlit in Snowflake
---

# Migrate your app from ROOT_LOCATION to FROM

To convert your Streamlit object, use [CREATE OR REPLACE STREAMLIT](../../../sql-reference/sql/create-streamlit.md)
with the FROM parameter. For simplicity, this procedure assumes you will use a warehouse runtime.
If you want to upgrade to a container runtime, you will need to alter your app code
for compatibility. See the [Migrating between runtime environments](runtime-migration.md) page.

If your app code is compatible with a container runtime, you can modify this procedure by
adding the following parameters to your CREATE OR REPLACE STREAMLIT command:

```sqlexample
RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
COMPUTE_POOL = my_compute_pool
EXTERNAL_ACCESS_INTEGRATIONS = (pypi_access_integration)
```

To migrate your app from ROOT_LOCATION to FROM do the following steps:

1. To identify your app’s current configuration, run the following command:

   ```sqlexample
   DESCRIBE STREAMLIT streamlit_db.streamlit_schema.my_app;
   ```

   Replace `streamlit_db.streamlit_schema.my_app` with your Streamlit object.
2. For use in a later step, open a text editor and note the following values. Sample
   values are shown so you can identify and replace them with your values in later steps:

   | Column | Value |
   | --- | --- |
   | `name` | `my_app` |
   | `title` | `My Streamlit App` |
   | `root_location` | `@db1.schema1/my_app_folder` |
   | `main_file` | `streamlit_app.py` |
   | `query_warehouse` | `my_warehouse` |
   | `user_packages` | `streamlit==1.45.0, pandas==2.2.0` |
   | `import_urls` | `@db2.schema2/packages/package1.zip, @db3.schema3/packages/package2.zip` |
   | `external_access_integration` | `eai_name_1, eai_name_2` |
   | `external_access_secrets` | `secret1, secret2` |

   If your Streamlit object did not return a `root_location` column, your app
   was created using the FROM parameter and doesn’t require conversion.
3. Confirm that entrypoint file is in the root of your app’s source directory. (You can
   skip this step if you will use a container runtime.)

   The entrypoint file is specified by the `main_file` value from the previous step.
   To create an app using FROM, `main_file` must declare a file in the root of the
   source directory. If your entrypoint file is in a subdirectory, you must rearrange
   your app’s files and update your app’s code accordingly before proceeding.
4. To convert your app, use CREATE OR REPLACE STREAMLIT with the FROM parameter.

   In the simplest case, your app may have null values for `title`, `user_packages`,
   `import_urls`, `external_access_integration`, and `external_access_secrets`. In
   this case, you can run the following command, replacing the placeholders with
   your app’s values:

   ```sqlexample
   CREATE OR REPLACE STREAMLIT my_app
   FROM '@db1.schema1/my_app_folder'
   MAIN_FILE = 'streamlit_app.py'
   QUERY_WAREHOUSE = my_warehouse;
   ```

   If your app has non-null values for any of the optional parameters, include them
   in the CREATE OR REPLACE STREAMLIT command. For example:

   ```sqlexample
   CREATE OR REPLACE STREAMLIT my_app
   FROM '@db1.schema1/my_app_folder'
   MAIN_FILE = 'streamlit_app.py'
   TITLE = 'My Streamlit App'
   QUERY_WAREHOUSE = my_warehouse
   IMPORTS = ('@db2.schema2/packages/package1.zip', '@db3.schema3/packages/package2.zip')
   EXTERNAL_ACCESS_INTEGRATION = ('eai_name_1', 'eai_name_2')
   SECRETS = ('secret1', 'secret2');
   ```
5. If your app isn’t loading, confirm your dependencies.

   For more information about dependency management, see [Manage dependencies for your Streamlit app](../app-development/dependency-management.md).

---
title: Migrating between runtime environments
source: https://docs.snowflake.com/en/developer-guide/streamlit/migrations-and-upgrades/runtime-migration.md
section: Streamlit in Snowflake
---

# Migrating between runtime environments

You can migrate a Streamlit app between warehouse runtimes and container runtimes by
updating the app’s RUNTIME_NAME and COMPUTE_POOL properties. However, some features
are only supported in one type of runtime environment, so there are some considerations
when migrating an app between runtime environments.

This page provides a checklist for migrating from warehouse to container runtime.
Each item provides a brief summary and a link to detailed information when needed.

> **Note:**
>
> If a user is viewing an app with a warehouse runtime and the app is altered on another tab to use a
> container runtime, the warehouse instance continues to run until one of the following events happens: the user
> navigates away, the tab is closed, the page is refreshed, or the WebSocket times out. In this case, source code changes made from the
> warehouse view will update the new container instance, but the warehouse preview won’t reflect any changes,
> even if the user selects Run.

## Prerequisites

Before you begin, adjust your warehouse runtime app in place to prepare for migration.

Optional: Back up your app’s code
:   If your app’s source code isn’t already stored in a version control system, an external
    repository, or a local directory, back it up to avoid any potential data loss during migration.

Ensure that your app wasn’t created with ROOT_LOCATION
:   Apps created with the ROOT_LOCATION parameter can only use warehouse runtimes.
    If your app was created with ROOT_LOCATION, upgrade it to use the FROM parameter.

    See: [Understanding the different types of Streamlit objects](overview.md)

Upgrade your app to Streamlit 1.50+
:   Ensure your app and all dependencies are
    compatible with Streamlit 1.50+.

    See: [Manage dependencies for your Streamlit app](../app-development/dependency-management.md)

Update your app to Python 3.11 only
:   Container runtimes only support Python 3.11, while warehouse runtimes support
    Python 3.9, 3.10, and 3.11. Ensure your app and all dependencies are compatible
    with Python 3.11.

    See: [Manage dependencies for your Streamlit app](../app-development/dependency-management.md)

Optional: Locally install Snowflake CLI 3.14.0+
:   If you deploy apps using Snowflake CLI, you need version 3.14.0 or later to
    support the container runtime deployment syntax. Check your version with
    `snow --version`. Optionally, you can use versions 3.12.0 - 3.13.1 if you
    use the `--experimental` flag.

    See: [Create your Streamlit app](../app-development/creating-your-app.md)

## Resources and permissions

Your app can continue to use its existing query warehouse, but you need to set up
a compute pool for the container runtime.

Create and grant access to a compute pool
:   The app owner needs USAGE privileges on the compute pool where the container
    runtime will run. App viewers don’t need any compute pool permissions.

    See: [Privileges required to create and use a Streamlit app](../object-management/privileges.md)

Create and grant access to an external access integration
:   Container runtimes ship with a minimal set of pre-installed packages. If your app
    requires additional packages or different versions of the pre-installed packages,
    you must use an external package index like PyPI. To allow your app to access
    an external package index, you must create an external access integration (EAI) and
    grant USAGE privileges on the EAI to the app owner.

    See: [External network access in Streamlit in Snowflake](../features/external-access.md)

## Dependency management

Replace `environment.yml` with `pyproject.toml` or `requirements.txt`
:   If you need to lock any dependency versions or specify additional dependencies, you must
    add a `pyproject.toml` or `requirements.txt` file to the root of your project
    directory. Packages can have different names between Conda and PyPI, so ensure you use
    the correct package names for your artifact repository.

    See: [Manage dependencies for your Streamlit app](../app-development/dependency-management.md)

Alter your app to set its external access integrations
:   If your dependencies include any version specifiers, or if you install any additional
    packages, you must assign an external access integration to your app. This is so that it
    can access the package index specified in your dependency file. PyPI is the default package
    index.

    See: [Manage dependencies for your Streamlit app](../app-development/dependency-management.md)

## Code changes

Replace `get_active_session()` with `st.connection("snowflake").session()`
:   When you use a container runtime, the Streamlit server handles multiple viewers
    concurrently. `get_active_session()` isn’t thread-safe, so you must use
    `st.connection("snowflake")` to manage your connection instead.

    See: [Manage secrets and configure your Streamlit app](../app-development/secrets-and-configuration.md)

Review your code and implement caching
:   Because container runtimes share disk, compute, and memory resources between viewer sessions,
    you should use `st.cache_resource` or `st.cache_data` to cache expensive computations
    or data that doesn’t change frequently.

    See: [Understanding Streamlit’s client-server architecture](https://docs.streamlit.io/develop/concepts/architecture/architecture) and
    [Caching overview](https://docs.streamlit.io/develop/concepts/architecture/caching) in the Streamlit documentation.

Ensure thread-safety
:   When using a container runtime, your app code must be thread-safe to handle multiple
    viewers concurrently. While each viewer gets a unique instance of
    the app script, you should review any imported code for shared state or global variables that
    could lead to race conditions or inconsistent behavior. If you introduce new threads
    into a Streamlit app, review Streamlit’s architecture and don’t use Streamlit commands
    in your custom threads.

    See: [Multithreading in Streamlit](https://docs.streamlit.io/develop/concepts/design/multithreading) in the Streamlit documentation.

Replace `_snowflake` usage with native Python equivalents
:   `_snowflake` is a private module that is only available in user-defined functions (UDFs)
    and stored procedures. Warehouse runtimes inherit access to `_snowflake`, but container
    runtimes don’t. If your app uses `_snowflake`, replace it with native Python
    equivalents, such as the Snowflake Python Connector. If needed, use stored procedures to
    access secrets.

    See: [Manage secrets and configure your Streamlit app](../app-development/secrets-and-configuration.md)

Update file paths and organization
:   The root of your source location is the working directory for your app.
    For most Python libraries, your app will need to use relative paths from the
    root of your source location. However, some Streamlit commands require paths
    relative to the entrypoint file. If your entrypoint file is in a subdirectory,
    check the paths in your code accordingly.

    Verify `secrets.toml` and `config.toml` locations.

    See: [Organize your Streamlit app files](../app-development/file-organization.md)

## App configuration changes

Alter your app to set its compute pool, query warehouse, and runtime
:   When you are ready to switch the runtime type of your app, you can use Snowsight or SQL.

    SnowsightSQL

    1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
    2. In the navigation menu, select Projects » Streamlit, and then select your app.
    3. In the upper-right corner, select the vertical ellipsis  menu, and then select App settings.
    4. For the Python environment, select Run on container.
    5. In the Compute pool dropdown, select your compute pool.
    6. In the Query warehouse dropdown, select your query warehouse.
    7. To save your changes and close the dialog, select Save.

    ```sqlexample
    ALTER STREAMLIT my_app
    COMPUTE_POOL = my_compute_pool
    QUERY_WAREHOUSE = my_warehouse
    RUNTIME_NAME = SYSTEM$ST_CONTAINER_RUNTIME_PY3_11;
    ```

    Your app will take a couple minutes to reboot and build its new container.

    See: [Runtime environments for Streamlit apps](../app-development/runtime-environments.md)

---
title: Organize your Streamlit app files
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/file-organization.md
section: Streamlit in Snowflake
---

# Organize your Streamlit app files

When you deploy an app to Streamlit in Snowflake, the app’s entrypoint file can have any name that follows
standard file naming conventions and can be located anywhere in the app’s source directory.
The app’s source directory can contain additional files, such as page scripts, Python modules,
media files, and configuration files.

The root of your app’s source directory is Streamlit’s working directory. If you develop and run
your app locally, this means you must execute the `streamlit run` command from the root of
your source directory to ensure that all paths are correct.

When you initialize a Streamlit app from Snowsight or use CREATE STREAMLIT
without specifying a source location, the embedded stage of the Streamlit object
contains an entrypoint file in its root. You can use the file explorer to add
additional files. If you need to rename or move your entrypoint file, you must use
SQL commands to update your Streamlit object’s MAIN_FILE value.

> **Note:**
>
> If you use the CREATE STREAMLIT command with the ROOT_LOCATION parameter, your app can only
> use a warehouse runtime and is subject to additional limitations. This page covers apps
> created with the FROM parameter. For more information, see [Understanding the different types of Streamlit objects](../migrations-and-upgrades/overview.md).

## Container runtime file structure

When you use a container runtime, your entrypoint file can have any name that follows
standard file naming conventions and can be located anywhere in your source
directory. However, with the introduction of [`st.navigation`](https://docs.streamlit.io/develop/api-reference/navigation/st.navigation) in Streamlit v1.36, the most common
practice is to use `streamlit_app.py` as the entrypoint file because page names don’t have to be
inferred from the file names.

Snowflake executes the `streamlit run` command from the root of the source directory, so
you must handle paths accordingly.

* Your entrypoint file can have any name and be located anywhere in your source directory.
* Your dependency files can be in any directory between the root of your source directory
  and directory containing your entrypoint file. For more information, see
  [Manage dependencies for your Streamlit app](dependency-management.md).
* You can have one or more `.streamlit/` directories between the root of
  your source directory and the directory containing your entrypoint file.
* The root of your source directory is Streamlit’s working directory.

The following directory structure is valid for a container-runtime Streamlit app:

```none
source_directory/
├── .streamlit/           # Optional configuration
│   ├── config.toml
│   └── secrets.toml
├── page_1.py             # Page 1
├── page_2.py             # Page 2
├── pyproject.toml        # Python dependencies
├── streamlit_app.py      # Entrypoint file
└── uv.lock               # Auto-generated lockfile
```

The following directory structure shows two apps in one source directory, each with its own entrypoint file
and dependencies. In this example, two different Streamlit objects exist. Both Streamlit objects set FROM to the location
represented by `source_directory`, but each object sets MAIN_FILE to a different `streamlit_app.py` file.
The first app uses a `pyproject.toml` file for dependencies, while the second app uses a `requirements.txt` file.

```none
source_directory/
├── .streamlit/           # Shared configuration
│   ├── config.toml
│   └── secrets.toml
├── app_one/              # First app source directory
│   ├── .streamlit/       # Overriding first-app configuration
│   │   ├── config.toml
│   │   └── secrets.toml
│   ├── page_1.py
│   ├── page_2.py
│   ├── pyproject.toml     # Python dependencies for first app
│   ├── streamlit_app.py   # Entrypoint file for first app
│   └── uv.lock
├──  app_two/              # Second app source directory
│   ├── requirements.txt   # Python dependencies for second app
│   ├── page_1.py
│   ├── page_2.py
│   ├── streamlit_app.py   # Entrypoint file for second app
│   └── uv.lock
└── utils/                 # Shared modules
    └── helper.py
```

> **Important:**
>
> Some Streamlit features require paths relative to the working directory while others
> require paths relative to the entrypoint file.

Typically, paths to images and other media within your app should be relative to the
working directory (the root of your source directory). However, paths to other pages in a
multipage app are relative to the location of the entrypoint file.

To avoid confusion, consider organizing your app files so that the entrypoint file
is in the root of your source directory. You can save multiple apps in one Git
repository and pass a subdirectory to the FROM parameter when you create the
Streamlit object. That subdirectory is then your app’s source directory. In the
previous example, this means using `source_directory/app_one` and
`source_directory/app_two` in the FROM parameter. Although in that case, the apps
would lose access to the shared modules in `source_directory/utils`.

## Warehouse runtime file structure

When you use a warehouse runtime, your entrypoint file can have any name but must be located in the root of your
source directory. Your Python version and dependencies are specified in an `environment.yml` file in the root
of your source directory. If you don’t include an `environment.yml` file, your app will run on the latest
version of Python and latest version of Streamlit that are currently supported in Streamlit in Snowflake. If you use the
[package picker](dependency-management.md) in Snowsight to add packages, the
`environment.yml` file is automatically updated or created for you.

The following directory structure is valid for a warehouse-runtime Streamlit app:

```none
source_directory/
├── .streamlit/           # Optional configuration
│   └── config.toml
├── environment.yml       # Conda dependencies
├── page_1.py
├── page_2.py
└── streamlit_app.py      # Entrypoint file
```

### Importing modules and files from other stages

The CREATE STREAMLIT and ALTER STREAMLIT commands support the IMPORTS parameter, which allows you to
import additional files from other stages into your app’s source directory. If you have a set of
common modules or files that you want to share across multiple apps, you can store them in a stage
and import them into each app using the IMPORTS parameter. However, this is only supported for apps
using a warehouse runtime.

## Multipage apps

Streamlit supports two methods for creating multipage apps:

* Using `st.navigation`: You can use the `st.navigation` command to create a custom navigation structure within your app. This
  allows you to define pages programmatically and control the navigation flow. The entrypoint file acts like a page router and the pages of your
  app can be defined as functions or Python scripts anywhere in your source directory. This is the recommended method for creating multipage
  apps, because it provides the most flexibility.
* Using a `pages/` directory: You can create a directory named `pages/` adjacent to your app’s entrypoint file. The
  entrypoint file is treated as the home page of your app. Each Python file in the `pages/` directory is treated as an additional page in the app.
  Page names are derived from the filenames.

You can’t mix the two methods for creating multipage apps. For more information on multipage apps, see
[Overview of multipage apps](https://docs.streamlit.io/develop/concepts/multipage-apps/overview)
in the Streamlit documentation.

> **Note:**
>
> When you host multipage apps in Streamlit in Snowflake, URL pathnames are prefixed with `/!`. For example, if the relative path to a page is `/page2`
> in a multipage app, its relative path in Streamlit in Snowflake becomes `/!/page2` as shown in the following URL: `https://app.snowflake.com/org/account_name/#/streamlit-apps/DB.SCHEMA.APP_NAME/!/page_2`

## Update your entrypoint file

If you rename or move your entrypoint file, you must use SQL commands to update your Streamlit
object to use the new entrypoint file. You must use a container runtime if you move your
entrypoint file to a subdirectory.

1. Use the [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) command to change the MAIN_FILE parameter
   of your Streamlit object, as shown in the following example:

   ```sqlexample
   ALTER STREAMLIT my_streamlit_app
   SET MAIN_FILE = 'subdir/new_entrypoint.py';
   ```

   This example changes the entrypoint file of the `my_streamlit_app` Streamlit object
   to `subdir/new_entrypoint.py`.

---
title: Personalize your Streamlit app with user information
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/personalization.md
section: Streamlit in Snowflake
---

# Personalize your Streamlit app with user information

The [`st.user`](https://docs.streamlit.io/develop/api-reference/user/st.user) API lets your Streamlit in Snowflake app access information about the person currently
viewing the app. You can use this to display personalized greetings, filter data by user,
or track who performed an action.

## What st.user provides in Streamlit in Snowflake

In Streamlit in Snowflake, `st.user` provides two attributes for the current viewer:

* `st.user.user_name` – the viewer’s Snowflake username.
* `st.user.email` – the viewer’s email address.

The following example greets the viewer by their Snowflake username:

```python
import streamlit as st

st.write(f"Hello, {st.user.user_name}!")
```

## Look up the viewer’s display name (optional)

`st.user.user_name` returns the Snowflake username (for example, `JSMITH`). To
show a friendlier name, you can look up the user’s display name with
[DESCRIBE USER](../../../sql-reference/sql/desc-user.md). This requires elevated privileges:
the app owner’s role must have the MONITOR privilege on the user object, or be an account
administrator.

The following example greets the viewer by their display name:

```python
import streamlit as st

conn = st.connection("snowflake")
session = conn.session()

user_info = session.sql(
    f"DESCRIBE USER {st.user.user_name}"
).collect()
display_name = st.user.user_name
for row in user_info:
    if row["property"] == "DISPLAY_NAME":
        display_name = row["value"]
        break

st.write(f"Hello, {display_name}!")
```

## Personalize data queries

Filter query results so each viewer sees only their own data. This example uses
`session.sql()` instead of `conn.query()` so that the query runs fresh on every
rerun rather than returning cached results:

```python
import streamlit as st

conn = st.connection("snowflake")
session = conn.session()
data = session.sql(
    "SELECT * FROM my_table WHERE owner = ?",
    params=[st.user.user_name],
).to_pandas()
st.dataframe(data)
```

## Track who performed an action

Include the viewer’s identity when writing data back to Snowflake:

```python
import streamlit as st

conn = st.connection("snowflake")
session = conn.session()

with st.form("entry_form"):
    value = st.text_input("Enter a value")
    submitted = st.form_submit_button("Save")

if submitted:
    session.sql(
        "INSERT INTO my_table (created_by, value) VALUES (?, ?)",
        params=[st.user.user_name, value],
    ).collect()
    st.success("Saved!")
```

## Relationship to CURRENT_USER()

The SQL function CURRENT_USER() returns the Snowflake username of the session owner. In
Streamlit in Snowflake apps using owner’s rights, this is the owner of the Streamlit object, not the viewer.
To identify the viewer in your Python code, use `st.user` instead.

If you need the viewer’s identity in SQL queries, consider using restricted caller’s rights (Preview).
With restricted caller’s rights, queries run as the viewer, so CURRENT_USER() returns the viewer’s
identity. For more information, see [Restricted caller’s rights and Streamlit in Snowflake](../features/restricted-callers-rights.md).

## Runtime differences

`st.user` is available in both container and warehouse runtimes. The behavior is
the same in both environments: it returns the identity of the person viewing the app,
not the owner of the Streamlit object.

---
title: Private connectivity for Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/object-management/privatelink.md
section: Streamlit in Snowflake
---

# Private connectivity for Streamlit in Snowflake

This topic describes using private connectivity when accessing Streamlit in Snowflake.

## Configuring access to Snowflake

1. Set up private connectivity for your Snowflake account for a supported service:

   * [AWS PrivateLink](../../../user-guide/admin-security-privatelink.md)
   * [Azure Private Link](../../../user-guide/privatelink-azure.md)
   * [Google Cloud Private Service Connect](../../../user-guide/private-service-connect-google.md)
2. Set up private connectivity for [Snowsight](../../../user-guide/ui-snowsight-gs.md).

## Configuring access to Streamlit in Snowflake

To determine the hostname, call [SYSTEM$GET_PRIVATELINK_CONFIG](../../../sql-reference/functions/system_get_privatelink_config.md) in your Snowflake account.
The Streamlit hostname is displayed under the `app-service-privatelink-url` key, which is the wildcard URL required for
routing Streamlit application traffic through a private connectivity service, such as AWS PrivateLink.

> **Note:**
>
> You can set up a new VPC endpoint for Streamlit or create a DNS record to the same VPC endpoint of your Snowflake account, as shown in the following example:
>
> * Record name: `*.<identifier>.privatelink.snowflake.app`
> * Type: CNAME
> * Route traffic to: same VPC as your Snowflake traffic.

Hostname routing at an account level is currently not supported.

## Security considerations

Streamlit in Snowflake apps serve both HTTPS-encrypted traffic and WebSocket-encrypted traffic. The Streamlit browser client application is mounted in a third-party, cross-origin
iframe within Snowsight. This enables strict cross-site browser isolation control.

Streamlit in Snowflake uses a separate URL scheme for specific security requirements. Streamlit URLs have their own top-level domain with no shared elements
with Snowsight. Each Streamlit app has a unique origin.

> **Note:**
>
> When using AWS PrivateLink or Azure Private Link, you control the DNS resolution; there are no PrivateLink DNS records controlled by Snowflake.

---
title: Privileges required to create and use a Streamlit app
source: https://docs.snowflake.com/en/developer-guide/streamlit/object-management/privileges.md
section: Streamlit in Snowflake
---

# Privileges required to create and use a Streamlit app

Within Streamlit in Snowflake, a Streamlit app is a securable object that adheres to the
[Snowflake access control framework](../../../user-guide/security-access-control-overview.md).
Streamlit apps use a permission model that is based on owner’s rights. For more information, see [Understanding owner’s rights and Streamlit in Snowflake apps](owners-rights.md).
You can also configure a container-runtime app to use restricted caller’s rights (Preview). For more information, see
[Restricted caller’s rights and Streamlit in Snowflake](../features/restricted-callers-rights.md).

The app owner and the owner of the schema containing the Streamlit app can determine which roles have
permission to use the app. Users can interact with the app and can see anything displayed by
the Streamlit app. Users have the same view of the app as the owner does except that they can’t access
the edit mode.

For more information, see [Share a Streamlit app](../app-development/managing-your-app.md).

## Privileges required to create a Streamlit app

To create a Streamlit app, if your role does not own the objects in the following table,
then your role must have the listed
[privileges](../../../user-guide/security-access-control-overview.md) on those objects:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE STREAMLIT | Schema where you create the Streamlit object |  |
| READ | Stage from which you copy the Streamlit app source files |  |
| USAGE | Warehouse used by the Streamlit app |  |
| USAGE | Compute pool used by the Streamlit app | This privilege is only required if your app uses a container runtime. |
| USAGE | External access integrations used by the Streamlit app | This privilege is only required if your app uses external access integrations. For container runtimes, this privilege is required to install packages from external package indexes like PyPI. |
| USAGE | Secrets used by the Streamlit app | This privilege is only required if your app uses secrets and only applies to warehouse runtimes. |
| CREATE STAGE | Schema where you create the Streamlit object | This privilege is only required to create Streamlit objects with the ROOT_LOCATION parameter. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

Use the [GRANT <privileges> … TO ROLE](../../../sql-reference/sql/grant-privilege.md) command to grant these privileges to a role. The following
example shows how to grant the necessary privileges to create container-runtime apps:

```sqlexample
GRANT USAGE ON DATABASE streamlit_db TO ROLE streamlit_developer;
GRANT USAGE ON SCHEMA streamlit_db.apps TO ROLE streamlit_developer;
GRANT CREATE STREAMLIT ON SCHEMA streamlit_db.apps TO ROLE streamlit_developer;
GRANT USAGE ON COMPUTE_POOL streamlit_compute_pool TO ROLE streamlit_developer;
GRANT USAGE ON INTEGRATION python_package_index TO ROLE streamlit_developer;
GRANT USAGE ON WAREHOUSE streamlit_wh TO ROLE streamlit_developer;
```

If a future grant is defined on the database or schema, ensure that the user creates the Streamlit app using the role defined
in the future grant.

## Privileges required to view a Streamlit app

To view a Streamlit app, you must have a Snowflake account and be signed in. Additionally,
you must use a role that is granted the USAGE privilege on the following objects:

* The database that contains the Streamlit app
* The schema that contains the Streamlit app
* The Streamlit app

In most cases, when the app owner shares a Streamlit app with another role, the USAGE privilege is
automatically granted to the new role. However, if a Streamlit app is created in a schema with
MANAGED ACCESS, the USAGE privilege must be manually granted to the new role.

The schema owner or a user with the role with the MANAGE GRANTS privilege must grant the USAGE
privilege using the [GRANT <privileges> … TO ROLE](../../../sql-reference/sql/grant-privilege.md) command as shown in this example:

```sqlexample
GRANT USAGE ON DATABASE streamlit_db TO ROLE streamlit_viewer;
GRANT USAGE ON SCHEMA streamlit_db.streamlit_schema TO ROLE streamlit_viewer;
GRANT USAGE ON STREAMLIT streamlit_db.streamlit_schema.streamlit_app TO ROLE streamlit_viewer;
```

The schema owner or a user with the role with the MANAGE GRANTS privilege can grant the USAGE
privilege to view all future Streamlit apps created in the schema as shown in this example:

```sqlexample
GRANT USAGE ON FUTURE STREAMLITS IN SCHEMA streamlit_db.streamlit_schema TO ROLE streamlit_viewer;
```

---
title: Restricted caller’s rights and Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/restricted-callers-rights.md
section: Streamlit in Snowflake
---

# Restricted caller’s rights and Streamlit in Snowflake

By default, all Streamlit in Snowflake apps [run with the privileges of the owner](../object-management/owners-rights.md),
not the privileges of the caller. The Streamlit app developer can define whether a
container-runtime app runs with owner’s rights or restricted caller’s rights. Restricted caller’s rights
aren’t supported in warehouse runtimes.

Restricted caller’s rights allow a Streamlit app to run with caller’s rights, but restrict which of the caller’s
privileges the app runs with. With restricted caller’s rights, a Streamlit app can’t run with a specific privilege
unless an administrator expressly allows it. Administrators use caller grants
to define which of the caller’s privileges an app can run with. This way, Streamlit apps only access data (on behalf
of the viewer) that they are authorized to access.

For more information, see [Restricted caller’s rights](../../restricted-callers-rights.md).

## Required caller grants

To access any tables, stored procedures, or warehouses on behalf of the viewer, the Streamlit app developer must have
the caller grants granted by a user with the MANAGE CALLER GRANTS privilege.

### Example workflow

1. The administrator grants the MANAGE CALLER GRANTS privilege to the `data_science_manager` role:

   ```sqlexample
   GRANT MANAGE CALLER GRANTS ON ACCOUNT TO ROLE data_science_manager;
   ```
2. A user with the `data_science_manager` role grants the following privileges to the `streamlit_app_developer` role:

   * Caller select privileges to the `streamlit_app_developer` role so that Streamlit apps owned by that role that access
     the `streamlit_db.streamlit_schema.streamlit_table` table can run with the SELECT privilege on that table:

     ```sqlexample
     GRANT CALLER SELECT ON TABLE streamlit_db.streamlit_schema.streamlit_table TO ROLE streamlit_app_developer;
     ```
   * Caller usage privileges to the `streamlit_app_developer` role to use the `streamlit_wh` warehouse:

     ```sqlexample
     GRANT CALLER USAGE ON WAREHOUSE streamlit_wh TO ROLE streamlit_app_developer;
     ```

For more information about caller grants, see [About caller grants](../../restricted-callers-rights.md)
and [GRANT CALLER](../../../sql-reference/sql/grant-caller.md).

## Use cases for restricted caller’s rights in Streamlit in Snowflake

Restricted caller’s rights in Streamlit in Snowflake let you control the following:

* Which pages of a Streamlit app are available
* Which data in the Streamlit app is available
* Which data with row access policies the CURRENT_ROLE can access
* Which warehouses are accessible
* Which stored procedures can be called in a Streamlit app

## Restricted caller’s rights in container runtimes

In container runtimes, you can combine owner’s rights and restricted caller’s rights in the same app.

* To create a connection that uses owner’s rights, use `st.connection("snowflake")`.
* To create a connection that uses restricted caller’s rights, use `st.connection("snowflake-callers-rights")`.

For more information, see [`st.connection`](https://docs.streamlit.io/develop/api-reference/connections/st.connection) and [`SnowflakeConnection`](https://docs.streamlit.io/develop/api-reference/connections/st.connections.snowflakeconnection) in the Streamlit documentation.

The following example shows how to create a caller’s rights connection:

```python
import streamlit as st

conn = st.connection("snowflake-callers-rights")
df = conn.query("SELECT CURRENT_USER()")
st.write(f"Running as: {df[0][0]}")
```

### Tips and limitations for using restricted caller’s rights in container runtimes

* The token provided in the `Sf-Context-Current-User-Token` header is only valid for two minutes and
  is created at the start of the app session. Create any caller’s rights connections at the top of your app script
  and not behind if-else blocks or pages.
* Restricted caller’s rights connections use the viewer’s default role and not the role they have selected in Snowsight.
* You can use both restricted caller’s rights connections and regular owner’s rights connections in the same app by creating multiple connections.
* Restricted caller’s rights connections only work when your app is using a container runtime. If you try to use a restricted caller’s rights
  connection in a local development environment or in a warehouse-runtime environment, you will get an error.
* Restricted caller’s rights don’t support secondary roles.

> **Important:**
>
> Restricted caller’s rights connections are session-scoped. If you need to cache data returned from a restricted caller’s rights
> connection, you must use session-scoping in the cache decorator. This prevents data from being shared between sessions.
> To use session-scoping with caching, set `scope="session"` in the caching decorator. For more information, see
> [`st.cache_data`](https://docs.streamlit.io/develop/api-reference/caching-and-state/st.cache_data) in the Streamlit documentation.

---
title: Row access policies in Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/row-access.md
section: Streamlit in Snowflake
---

# Row access policies in Streamlit in Snowflake

This topic describes using context functions and row access policies in Streamlit in Snowflake warehouse runtimes.

In container runtimes, context functions on owner’s rights connections will return values from the owner role’s context
and so are not appropriate for user-targeted row access policies. However, restricted caller’s rights connections
return the viewer’s context. For more information, see [Restricted caller’s rights and Streamlit in Snowflake](restricted-callers-rights.md).

## Context functions and row access policies in Streamlit in Snowflake

To use [context functions](../../../sql-reference/functions-context.md) such as [CURRENT_USER](../../../sql-reference/functions/current_user.md)
and data from tables with [row access policies](../../../user-guide/security-row-intro.md) in a Streamlit in Snowflake app, a user with the
ACCOUNTADMIN role must grant the global READ SESSION privilege to the Streamlit app owner role, as shown in the following example:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT READ SESSION ON ACCOUNT TO ROLE streamlit_owner_role;
```

> **Note:**
>
> In a Streamlit in Snowflake app, you can’t use row access policies that use CURRENT_ROLE. Streamlit in Snowflake apps run with owner’s rights,
> so using CURRENT_ROLE inside a Streamlit app always returns the app owner role.
> For more information, see [Understanding owner’s rights and Streamlit in Snowflake apps](../object-management/owners-rights.md).

### Example: Access data in a table with row access policy using CURRENT_USER

You can use a Streamlit in Snowflake app to govern access to rows in a table protected by a row access policy.
Specify the CURRENT_USER function in the body of the row access policy and add the row access policy to the table.

The following example demonstrates how to govern access to a table that is protected by a row access policy in a Streamlit in Snowflake app.

1. Create a table and insert data:

   ```sqlexample
   CREATE TABLE row_access_policy_test_table (
       id INT,
       some_data VARCHAR(100),
       the_owner VARCHAR(50)
   );

   INSERT INTO row_access_policy_test_table (id, some_data, the_owner)
   VALUES
       (4, 'Some information 4', 'ALICE'),
       (5, 'Some information 5', 'FRANK'),
       (6, 'Some information 6', 'ALICE');
   ```
2. Create a row access policy:

   ```sqlexample
   CREATE OR REPLACE ROW ACCESS POLICY st_schema.row_access_policy
   AS (the_owner VARCHAR) RETURNS BOOLEAN ->
       the_owner = CURRENT_USER();
   ```
3. Add the row access policy to the table:

   ```sqlexample
   ALTER TABLE row_access_policy_test_table ADD ROW ACCESS POLICY st_schema.row_access_policy ON (the_owner);
   ```
4. Create a Streamlit app.
5. Grant the global READ SESSION privilege to the Streamlit app owner role:

   ```sqlexample
   GRANT READ SESSION ON ACCOUNT TO ROLE streamlit_owner_role;
   ```
6. Add the following code to your Streamlit app:

   ```python
   # Import Python packages
   import streamlit as st
   from snowflake.snowpark.context import get_active_session

   st.title("CURRENT_USER() + Row Access Policy in SiS Demo :balloon:")
   st.write(
           """You can access `CURRENT_USER()` and data from tables with row access policies
           in Streamlit in Snowflake apps
           """)

   # Get the current credentials
   session = get_active_session()

   st.header('Demo')

   st.subheader('Credentials')
   sql = "SELECT CURRENT_USER();"
   df = session.sql(sql).collect()
   st.write(df)

   st.subheader('Row Access on a Table')
   sql = "SELECT * FROM st_db.st_schema.row_access_policy_test_table;"
   df = session.sql(sql).collect()

   st.write(df)
   ```

---
title: Runtime environments for Streamlit apps
source: https://docs.snowflake.com/en/developer-guide/streamlit/app-development/runtime-environments.md
section: Streamlit in Snowflake
---

# Runtime environments for Streamlit apps

Streamlit in Snowflake offers two types of runtime environments for Streamlit apps:

* **Container runtime**: Serves an app as a long-running service and creates a dedicated
  instance of the app that is shared among all viewers.
* **Warehouse runtime**: Runs on-demand and creates a personal instance of the app for each
  viewer.

> **Note:**
>
> If you use the CREATE STREAMLIT command with the ROOT_LOCATION parameter, your app can only
> use a warehouse runtime and is subject to additional limitations. This page covers apps
> created with the FROM parameter. For more information, see [Understanding the different types of Streamlit objects](../migrations-and-upgrades/overview.md).

The following table compares the features supported by warehouse runtimes and container runtimes for Streamlit in Snowflake apps.

| Supported features | Warehouse runtime | Container runtime |
| --- | --- | --- |
| Compute | Virtual warehouse for app code and internal queries. | [Compute pool](../../snowpark-container-services/working-with-compute-pool.md) node for app code. Virtual warehouse for internal queries. |
| Execution length | Configurable with a [sleep timer](../features/sleep-timer.md). | Suspension after three days of viewer inactivity. |
| Maintenance window | Not applicable. | Subject to the Snowpark Container Services [maintenance window](../../snowpark-container-services/working-with-compute-pool.md). |
| Base image | Linux in a Python stored procedure. | Linux in a Snowpark container. |
| Python versions | 3.9, 3.10, 3.11 | 3.11 |
| Streamlit versions | 1.22+ (limited selection). | 1.50+ (any version, including `streamlit-nightly` versions). |
| Dependencies | Packages from the Snowflake Conda channel via `environment.yml`. | Packages from an external package index like PyPI via `pyproject.toml` or `requirements.txt`. |
|  | Pin versions with the `=` operator. | Pin versions with the `==` operator. |
|  | Use version ranges with the `*` wildcard. | Use version ranges with `<`, `<=`, `>=`, `>`, and comma-separated lists. |
| Entrypoint location | Root of your source directory. | Root or subdirectory within your source directory. |
| Streamlit server | Temporary, individual instance of the Streamlit server for each viewer session. | Persistent, shared server instance for all viewer sessions. |
|  | Doesn’t share disk, compute, and memory resources between viewer sessions. | Shares disk, compute, and memory resources between viewer sessions. |
|  | Doesn’t support caching between sessions. | Fully supports Streamlit’s caching features. |
| Startup times | Slower per viewer session due to on-demand app creation. | Faster per viewer session but slower deployment due to container startup. |
| Access | Requires ownership to edit. | Same as warehouse runtime. |
|  | Uses owner’s rights for queries, limited similarly to owner’s rights stored procedures. | Uses owner’s rights for queries by default. Supports [restricted caller’s rights (Preview)](../features/restricted-callers-rights.md) on some or all queries. |
| Logging | Event table logging and tracing via the [telemetry framework](../../logging-tracing/logging-tracing-overview.md). | Live console logs and historical event table logs. |

## Container runtimes

A container runtime provides a dedicated instance of your Streamlit app that is shared
among all viewers. Each viewer connects to the same instance of the app, which means
viewers connect quickly to an already-live app. Containers cost significantly less
than warehouses per minute and are generally a more cost effective hosting solution,
especially for apps with frequent usage.

Container runtimes share disk, compute, and memory resources between viewer sessions.
This means you can fully take advantage of Streamlit’s caching features to improve
performance. Efficient app design is important with container runtimes to ensure that
all viewers have a good experience.

With an external access integration, you can install Python packages from PyPI or other
package indexes that support the [simple repository API](https://peps.python.org/pep-0503/).
This makes container runtimes more flexible. You’ll always have access to the latest
version of Streamlit, including `streamlit-nightly` versions. Container runtimes also
support [restricted caller’s rights (Preview)](../features/restricted-callers-rights.md).

## Warehouse runtimes

Warehouse runtimes provide an on-demand, personal instance of the Streamlit app for
each viewer. When a viewer opens the app, a new instance of the app is created for
that viewer. Each viewer has their own isolated environment, which increases user
load times. While both runtimes execute SQL queries using the owner’s privileges,
apps using warehouse runtimes are subject to similar restrictions as owner’s rights
stored procedures. For more information, see [Owner’s rights stored procedures](../../stored-procedure/stored-procedures-rights.md).

## Guidelines for selecting resources in Streamlit in Snowflake

When you run a Streamlit app in Streamlit in Snowflake, multiple factors may affect performance, including
the complexity of the Streamlit app, availability of warehouses, and latency. The following
sections provide general guidelines for using virtual warehouses and compute pools in Streamlit in Snowflake.

### Selecting a compute pool

When you use a container runtime, you must select a compute pool to run the Streamlit app.
Each Streamlit app runs on a single compute pool node; a Streamlit app takes an entire node.
The size of the compute pool node affects the performance of the app. Larger node sizes can
be used if your app requires more memory. However, because Streamlit runs as a single process,
your app is unlikely to benefit from multiple CPUs. For more information, see
[Creating a compute pool](../../snowpark-container-services/working-with-compute-pool.md).

> **Tip:**
>
> * To reduce friction when you add more apps in the future, set MAX_NODES to
>   account for future Streamlit apps.
> * To ensure that app creation is fast, create your compute pool with MIN_NODES
>   equal to the number of apps you intend to run simultaneously, including testing
>   and experiments.
> * To reduce costs, use smaller node sizes.
> * Both node quantity and node size impact costs. For more information, see
>   [Compute pool cost](../../snowpark-container-services/accounts-orgs-usage-views.md).

For example, the following command creates a compute pool to run two to five
Streamlit apps simultaneously:

```sqlexample
CREATE COMPUTE POOL streamlit_compute_pool
 MIN_NODES = 2
 MAX_NODES = 5
 INSTANCE_FAMILY = CPU_X64_XS;
```

### Selecting a virtual warehouse

To optimize costs, performance, and monitoring, use separate compute resources for
running your app and executing queries within your app. If you use a container runtime,
your compute resources are automatically separated because your app code runs on a
compute pool node and its queries run on a virtual warehouse. If you use a warehouse
runtime, your app will use the same warehouse to run your app code and execute queries
unless you activate a different query warehouse within your app code.

For example, with a warehouse runtime, you might use an X-Small warehouse to run your
Python code and activate a Large query warehouse in your app to run complex queries.

> **Note:**
>
> In the CREATE STREAMLIT and ALTER STREAMLIT commands, the QUERY_WAREHOUSE parameter
> should be used differently depending on the runtime type:
>
> * For container runtimes, QUERY_WAREHOUSE sets the query warehouse for executing queries
>   within the app.
> * For warehouse runtimes, QUERY_WAREHOUSE sets the code warehouse for running the app code.
>   If you don’t activate a different warehouse within your app code, the same warehouse will
>   be used for executing queries.

#### Best practices for query warehouses

In a Streamlit app, to select a query warehouse, follow the same general guidelines
as you would for any other Snowflake workload. Consider the complexity of the queries,
the size of the data being queried, and the expected concurrency when selecting a warehouse size.

If your app uses a container runtime, use the QUERY_WAREHOUSE parameter to set the query warehouse
when you create or alter the Streamlit app. However, if your app uses a warehouse runtime, use the
QUERY_WAREHOUSE parameter to set your code warehouse. You should generally use a smaller, dedicated
warehouse for running the app code and manually switch to different query warehouse within your app code.

**Example: Container runtime**

When you use a container runtime, set a sufficiently large query warehouse to run your app’s internal queries:

```sqlexample
CREATE STREAMLIT my_app
FROM '@my_stage/app_folder'
MAIN_FILE = 'streamlit_app.py'
RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
COMPUTE_POOL = streamlit_compute_pool
QUERY_WAREHOUSE = my_large_warehouse
;
```

**Example: Warehouse runtime**

When you use a warehouse runtime, set a small, dedicated code warehouse for running Streamlit apps:

```sqlexample
CREATE STREAMLIT my_app
FROM '@my_stage/app_folder'
MAIN_FILE = 'streamlit_app.py'
QUERY_WAREHOUSE = my_small_warehouse;
```

Within your app code, switch to a different warehouse for queries:

```python
import streamlit as st

conn = st.connection("snowflake")
session = conn.session()
session.use_warehouse("my_large_warehouse")
```

#### Best practices for code warehouses

When use a warehouse runtime in Streamlit in Snowflake, select the smallest warehouse possible to run your app code.

Warehouses cache Python packages used by Streamlit apps, improving performance for subsequent app loads.
The cache is removed when the warehouse suspends, which may slow initial app loading after the warehouse resumes.
If the resumed warehouse runs more apps, the package cache rebuilds and improves loading performance.

Per-second billing and auto-suspend provide flexibility to start with smaller warehouses and adjust sizes
as needed. You can increase warehouse size at any time. For more information, see [Change the query warehouse](managing-your-app.md).

Snowflake recommends using a dedicated warehouse for Streamlit apps to isolate costs and potentially
improve load times by avoiding other workloads. Within your app code, activate a different warehouse for
queries as needed.

For more information, see [Warehouse considerations](../../../user-guide/warehouses-considerations.md).

> **Tip:**
>
> * Set auto-suspend to at least 30 seconds to avoid warehouse suspension during initialization.
> * Configure sleep times and WebSocket timeouts for your Streamlit apps to reduce costs. For more information, see [Custom sleep timer for a Streamlit app](../features/sleep-timer.md).

---
title: Security overview for Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/object-management/security.md
section: Streamlit in Snowflake
---

# Security overview for Streamlit in Snowflake

This topic provides a security overview for system administrators managing Streamlit in Snowflake in their Snowflake accounts.
Understanding the security model and implementing proper controls ensures that developers can build secure applications
while administrators maintain governance over sensitive data and resources.

## Security model

Streamlit in Snowflake follows Snowflake’s comprehensive security model, which includes authentication, role-based access control,
network policies, and data governance features. Apps are first-class Snowflake objects that integrate with existing
security infrastructure.

### Owner’s rights execution

By default, Streamlit apps run with owner’s rights, similar to stored procedures. This has the following consequences:

* Apps execute queries using the privileges of the app owner, not the viewer.
* The app owner’s role determines what data and operations the app can access.
* Viewers can interact with the app without needing direct access to underlying tables or views.

This model eliminates the need for service account tokens and integrates seamlessly with Snowflake’s authentication
and access control features. For more information, see [Understanding owner’s rights and Streamlit in Snowflake apps](owners-rights.md).

As an alternative, you can configure a container-runtime app to use restricted caller’s rights (Preview), which
allows the app to run with the viewer’s privileges instead of the owner’s. For more information, see
[Restricted caller’s rights and Streamlit in Snowflake](../features/restricted-callers-rights.md).

### Shared responsibility model

Security responsibility is shared between Snowflake, account administrators, and app developers:

* Snowflake provides the secure platform, authentication, encryption, and security features.
* Administrators configure account-level security policies, manage roles and privileges, and audit app usage.
* App developers write secure code, handle secrets properly, and follow security best practices.

For more information about Snowflake’s security model, see [Snowflake’s Shared Responsibility Model](https://www.snowflake.com/en/resources/report/snowflake-shared-responsibility-model/).

### Content Security Policy

All Streamlit apps run within a [Content Security Policy](https://developer.mozilla.org/en-US/docs/Web/HTTP/Guides/CSP) (CSP)
that restricts which resources can be loaded. This policy provides defense-in-depth protection against cross-site scripting (XSS)
and other code injection attacks. The CSP is not configurable at this time.

The CSP blocks the following external resources:

* Loading code (scripts, styles, fonts) from external domains
* Embedding apps in iframes from external domains

The CSP allows the following external resources:

* Images and media from HTTPS sources: Apps can load images and media files from any HTTPS URL, including
  external image hosting services and APIs that return images. This doesn’t require an external access integration.
* Data URIs and blob URLs: Apps can use embedded data (data URIs) and dynamically generated content (blob URLs)
  for images and media. This supports features like displaying charts, diagrams, or user-uploaded content.
* Mapbox and Carto resources: A limited subset of resources from Mapbox and Carto are permitted to support
  mapping visualizations.

> **Note:**
>
> * For warehouse runtimes which use conda to manage dependencies, you must accept the Anaconda terms to use Mapbox.
>   For more information, see [Using third-party packages from Anaconda](../../udf/python/udf-python-packages.md).
> * Loading images or media from external domains is supported in Streamlit in Snowflake, but not in Snowflake Native App Framework.
> * The CSP also blocks front-end calls that are generally considered unsafe, such as `eval()`.

This restrictive policy means that most third-party JavaScript libraries and custom components that rely on
external scripts won’t work in Streamlit apps. For more information about CSP limitations, see
[Loading external resources](../limitations.md).

## Essential security setup

The following security configurations are essential for a secure and well-functioning Streamlit in Snowflake environment.

### Network access configuration

Configure network access to ensure that apps can communicate with Snowflake.

**For all deployments:**

* Add `*.snowflake.app` to your network allowlist to enable communication between Streamlit apps and Snowflake.
* For Streamlit apps using container runtimes, also add `*.snowflakecomputing.app` to your network allowlist.
* Ensure WebSockets are not blocked in your network configuration.

For more information, see [You can’t load the Streamlit app](../troubleshooting.md).

**For private connectivity:**

If your organization requires private connectivity, configure AWS PrivateLink, Azure Private Link, or
Google Cloud Private Service Connect for both Snowflake access and Streamlit app access. For more information,
see [Private connectivity for Streamlit in Snowflake](privatelink.md).

### Role-based access control

Establish a role hierarchy for managing Streamlit apps.

**Recommended role structure:**

* Creator roles: Roles with CREATE STREAMLIT privileges on schemas where apps will be deployed.
* Viewer roles: Roles with USAGE privileges on apps for end users.

The following example shows how to create a role hierarchy for Streamlit apps:

```sqlexample
-- Create dedicated roles for Streamlit
CREATE ROLE streamlit_developer;
CREATE ROLE streamlit_viewer;

-- Grant hierarchy
GRANT ROLE streamlit_viewer TO ROLE streamlit_developer;

-- Grant privileges for app creation
GRANT USAGE ON DATABASE streamlit_db TO ROLE streamlit_developer;
GRANT USAGE ON SCHEMA streamlit_db.apps TO ROLE streamlit_developer;
GRANT CREATE STREAMLIT ON SCHEMA streamlit_db.apps TO ROLE streamlit_developer;
GRANT USAGE ON COMPUTE_POOL streamlit_compute_pool TO ROLE streamlit_developer;
GRANT USAGE ON INTEGRATION python_package_index TO ROLE streamlit_developer;

-- Grant privileges for app viewing
GRANT USAGE ON WAREHOUSE streamlit_wh TO ROLE streamlit_viewer;
GRANT USAGE ON DATABASE streamlit_db TO ROLE streamlit_viewer;
GRANT USAGE ON SCHEMA streamlit_db.apps TO ROLE streamlit_viewer;
GRANT USAGE ON STREAMLIT streamlit_db.apps.my_app TO ROLE streamlit_viewer;
```

The app developer also needs USAGE on `streamlit_wh`, but this is inherited from the
viewer role. For more information about required privileges, see [Privileges required to create and use a Streamlit app](privileges.md).

### Secrets management

Configure proper secrets management for apps that access external services or sensitive credentials:

1. Enable secrets access for apps by granting appropriate privileges:

   ```sqlexample
   -- Grant privileges on secrets to app owner role
   GRANT READ ON SECRET my_secret TO ROLE streamlit_developer;
   GRANT USAGE ON INTEGRATION my_external_access_integration TO ROLE streamlit_developer;
   ```
2. For container runtime apps, create SQL functions to wrap secret access rather than embedding
   secrets in app code.

For more information, see [Manage secrets and configure your Streamlit app](../app-development/secrets-and-configuration.md).

### Context functions and row-level security

In warehouse runtimes, if your apps use context functions (such as `CURRENT_USER()`) or access tables with row access policies,
grant the global READ SESSION privilege to app owner roles:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT READ SESSION ON ACCOUNT TO ROLE streamlit_developer;
```

> **Note:**
>
> Warehouse-runtime apps using `CURRENT_ROLE()` in row access policies will always return the app owner’s role, not
> the viewer’s role, because apps run with owner’s rights by default.

For more information and examples, see [Row access policies in Streamlit in Snowflake](../features/row-access.md).

In container runtimes, context functions on owner’s rights connections will return values from the owner role’s context
and so are not appropriate for user-targeted row access policies. However, restricted caller’s rights connections
return the viewer’s context. For more information, see [Restricted caller’s rights and Streamlit in Snowflake](../features/restricted-callers-rights.md).

### Container runtimes only: Package repository access and security

Configure one or more package indexes for container runtimes.

Container runtimes can install packages from external repositories like PyPI. You can
control package sources using managed package indexes, like JFrog Artifactory, or you can
use the default package index, PyPI. Regardless of which package index you use, you must
create an external access integration (EAI) to allow your apps to install dependencies.

Using a managed package index provides the following benefits:

* This helps prevent supply chain attacks and ensures packages come from trusted sources.
* It allows you to control which packages and versions are available to your apps.
* It provides audit trails for package installations.

For more information about how developers use EAIs to manage dependencies, see
[Managing dependencies for container runtimes](../app-development/dependency-management.md). For more information
about setting up a managed package repository with authentication, see [Example: Authenticate to a private JFrog Artifactory repository](../app-development/secrets-and-configuration.md).

#### Set up a PyPI EAI for app developers

Container-runtime apps attempt to install dependencies from PyPI by default. Snowflake provides a managed network
rule, `SNOWFLAKE.EXTERNAL_ACCESS.PYPI_RULE`, that allows egress to PyPI. You can use this rule to
create a PyPI EAI without defining your own network rule. For more information about managed network rules,
see [Snowflake-managed egress network rules](../../../user-guide/network-rules.md).

The following SQL commands create a PyPI EAI using the Snowflake-managed network rule and grant
USAGE to an app-development role:

```sqlexample
USE ROLE ACCOUNTADMIN;

CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION pypi_access_integration
  ALLOWED_NETWORK_RULES = (snowflake.external_access.pypi_rule)
  ENABLED = true;

GRANT USAGE ON INTEGRATION pypi_access_integration TO ROLE app_developer_role;
```

### Warehouse runtimes only: External offerings terms

Warehouse runtimes use conda to manage your app’s dependencies. If you want to use
Mapbox in your apps, you must acknowledge the
[External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/).

For information about using this package, see [Using third-party packages from Anaconda](../../udf/python/udf-python-packages.md).

## Available security features

The following security features are available to enhance app security and governance.

### External access integrations

Control which external networks and services your apps can access:

* Create network rules to define allowed endpoints, including package indexes.
* Create external access integrations that reference network rules and authentication secrets.
* Assign external access integrations to Streamlit apps.

This prevents apps from making unauthorized outbound connections and provides audit trails for external access.

For more information, see [External network access in Streamlit in Snowflake](../features/external-access.md).

### Git integration

Integrate Streamlit apps with Git repositories for version control and change tracking:

* Grant appropriate privileges on Git repository objects (READ, WRITE, or OWNERSHIP).
* Use Git integration to maintain audit trails of code changes.
* Implement code review processes before deploying changes to production apps.

For more information, see [Sync Streamlit in Snowflake apps with a Git repository](../features/git-integration.md).

### Private connectivity

For organizations with strict network security requirements, configure private connectivity to ensure all
Streamlit traffic remains within your private network. Streamlit in Snowflake supports the following private connectivity options:

* [AWS PrivateLink](../../../user-guide/admin-security-privatelink.md)
* [Azure Private Link](../../../user-guide/privatelink-azure.md)
* [Google Cloud Private Service Connect](../../../user-guide/private-service-connect-google.md)

Private connectivity eliminates exposure to the public internet and provides additional network isolation.

For more information, see [Private connectivity for Streamlit in Snowflake](privatelink.md).

### Logging and tracing

Enable logging to monitor app behavior and troubleshoot issues:

* Configure an event table for your account. For more information, see [Event table overview](../../logging-tracing/event-table-setting-up.md).
* For warehouse runtimes, set appropriate log and trace levels for databases containing Streamlit apps.
  For more information, see [Setting levels for logging, metrics, and tracing](../../logging-tracing/telemetry-levels.md).
* Review logs regularly for security events, errors, and unusual behavior.

For container runtimes, Snowflake automatically captures standard output and standard error
from the container and stores them in the account’s event table. No additional configuration is needed.

For more information, see [Logging and tracing for Streamlit in Snowflake](../features/logging-tracing.md).

### Limit a user’s access to only Streamlit in Snowflake

To restrict a user to only access Streamlit in Snowflake and prevent them from accessing other parts of Snowflake, an account
administrator can add a custom user property via SQL or SCIM attribute.

* To restrict a user, use the [ALTER USER](../../../sql-reference/sql/alter-user.md) SQL command to set the ALLOWED_INTERFACES property
  to include STREAMLIT:

  ```sqlexample
  ALTER USER <user_name> SET ALLOWED_INTERFACES = (STREAMLIT);
  ```

If you’re provisioning users with SCIM APIs, you can set the same setting using the custom attribute `allowedInterfaces`.
For more information about SCIM custom attributes, see [SCIM user API reference](../../../user-guide/scim-user-api-reference.md).

After Streamlit-only access is configured, the user can’t access any part of Snowflake except the Streamlit in Snowflake apps for which they have permission.
Additionally, they can only access the app-viewer URL for those apps. If a Streamlit-only user attempts to navigate anywhere in Snowflake,
including any app-builder URL, it results in an access control error.

### Redirect app viewers to your identity provider

An account administrator can configure all app-viewer URLs to redirect to your identity provider (IdP) when an unauthenticated viewer accesses an app.
This process eliminates a step from the user’s login flow.

* To redirect unauthenticated users from app-viewer URLs to your IdP, use the [ALTER ACCOUNT](../../../sql-reference/sql/alter-account.md) SQL command to set
  the LOGIN_IDP_REDIRECT account property to include STREAMLIT:

  ```sqlexample
  ALTER ACCOUNT SET LOGIN_IDP_REDIRECT = (STREAMLIT = <your_security_integration>);
  ```

For more information about configuring your Snowflake account to use an IdP, see the following topics:

* [Configuring Snowflake to use federated authentication](../../../user-guide/admin-security-fed-auth-security-integration.md).
* [Configuring an identity provider (IdP) for Snowflake](../../../user-guide/admin-security-fed-auth-configure-idp.md).

## Best practices for administrators

The following best practices help maintain a secure Streamlit environment.

**Use dedicated roles and schemas:**

* Create separate schemas for development, testing, and production apps.
* Use different roles for each environment to prevent accidental changes to production apps.
* Grant production app ownership to service roles rather than individual user accounts.

**Implement least privilege access:**

* Grant only the minimum required privileges to each role.
* Regularly review and audit role memberships and privileges.
* Avoid granting ACCOUNTADMIN or other powerful roles to app owner roles unless absolutely necessary.

**Manage app lifecycle:**

* Establish processes for app approval and deployment.
* Require code reviews before promoting apps to production.
* Document which apps access sensitive data and require additional scrutiny.
* Regularly review and remove unused or deprecated apps.

**Monitor resource usage:**

* Set appropriate warehouse sizes for app workloads.
* Monitor compute costs and set up alerts for unusual usage patterns.
* For container runtimes, configure compute pools with appropriate MIN_NODES and MAX_NODES settings.
* Use separate warehouses for different app environments to isolate costs and resources.

For more information about resource management, see [Managing costs for Streamlit in Snowflake](billing.md) and
[Runtime environments for Streamlit apps](../app-development/runtime-environments.md).

**Use secure app development practices:**

* Never embed credentials or API keys directly in app code.
* Use Snowflake secrets for storing sensitive information.
* Validate and sanitize user inputs to prevent SQL injection.
* Limit the data exposed through apps to only what viewers need to see.
* Test apps thoroughly before sharing with wider audiences.

For more information about owner’s rights security considerations, see [Owner’s rights and app security](owners-rights.md).

**Perform regular security audits:**

* Review which roles have CREATE STREAMLIT privileges.
* Audit which apps access which data sources.
* Review external access integrations and network rules.
* Check for apps owned by former employees or inactive accounts.
* Review Git repository access and commit history.

Use the following queries to audit your Streamlit apps:

```sqlexample
-- List all Streamlit apps and their owners
SHOW STREAMLITS;

-- Check privileges on a specific app
SHOW GRANTS ON STREAMLIT streamlit_db.apps.my_app;

-- List all roles with CREATE STREAMLIT privileges
SHOW GRANTS OF CREATE STREAMLIT;
```

---
title: Sharing Streamlit in Snowflake apps
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/sharing-streamlit-apps.md
section: Streamlit in Snowflake
---

# Sharing Streamlit in Snowflake apps

This topic covers URLs for sharing Streamlit in Snowflake apps with or without the Snowsight interface.

## App URLs

Each Streamlit in Snowflake app has two URLs: app-builder URLs that show the Snowsight elements and
app-viewer URLs that hide them. This allows you to share view-only links with business users.

By default, sharing an app using the app-viewer URL lets end users change the URL to access other parts of Snowflake.
To enforce restricted access to only app-viewer URLs, an administrator must configure the ALLOWED_INTERFACES user
property. For more information, see [Limit a user’s access to only Streamlit in Snowflake](../object-management/security.md).

An administrator can also configure app-viewer URLs to redirect to your organization’s identity provider (IdP).
For more information, see [Essential security setup](../object-management/security.md).

### App-builder URLs

When you view an app from its app-builder URL, an object toolbar appears at the top of the app. The left side of the toolbar displays the
app’s name. The right side of the toolbar displays the app’s status. Additionally, if you have the necessary privileges to edit
the app, the toolbar contains an Edit button. If you have the necessary permission to share the app with other roles, the toolbar
contains a Share button.

If you select any app from the Streamlit Apps page in Snowsight, a new tab opens to its app-builder URL. This URL has the following
format:

```none
https://app.snowflake.com/<organization_name>/<account_name>/#/streamlit-apps/<app_database>.<app_schema>.<app_name>
```

### App-viewer URLs

When you view an app from its app-viewer URL, the app is displayed without any part of the Snowsight interface.
To enforce restricted access to only app-viewer URLs, an administrator must configure the ALLOWED_INTERFACES user
property. For more information, see [Limit a user’s access to only Streamlit in Snowflake](../object-management/security.md).

The app-viewer URL has the following format:

```none
https://app.snowflake.com/streamlit/<organization_name>/<account_name>/#/apps/<url_id>
```

Your app’s `url_id` is returned by DESCRIBE STREAMLIT.

## Share a Streamlit app

There are two sharing permission levels for Streamlit in Snowflake apps:

* View and share: If a user visits the app-builder URL, they can view the app and share it with other roles.
* View only: If a user visits the app-builder URL, they can only view the app. They can’t share it with other roles.

All roles with necessary USAGE privileges on the app can access the app-viewer URL, regardless of the sharing option.

To share a Streamlit app, do the following steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Select the Streamlit app you want to share.
4. Select Share.

   The Share Streamlit app window opens.
5. To add a role to the app’s sharing list, begin typing the name of the role.
6. Select the name of the role.

   The new role appears in the list of roles.
7. In the drop-down list on the right of the role, select a sharing permission level.
8. To copy your app’s URL, select Copy link.

   * To copy the app-builder URL, select For app builders from the dropdown list.
   * To copy the app-viewer URL, select For app viewers from the dropdown list.

   You can then send this URL through email or text.
9. Select Done.

---
title: Sync Streamlit in Snowflake apps with a Git repository
source: https://docs.snowflake.com/en/developer-guide/streamlit/features/git-integration.md
section: Streamlit in Snowflake
---

# Sync Streamlit in Snowflake apps with a Git repository

To use version control with your Streamlit apps, you can sync your app with a branch in a Git repository.

You must have already set up your Snowflake account to be connected to a Git repository and have created
a branch in that repository to use with your app. See [Setting up Snowflake to use Git](../../git/git-setting-up.md).

> **Note:**
>
> For Streamlit apps created using the [ROOT_LOCATION parameter](../../../sql-reference/sql/create-streamlit.md), Git integration is not supported.

## Create a Streamlit in Snowflake app from a file in a Git repository

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Next to + Streamlit, open the drop-down menu and select Create from repository.
4. For File location in repository, select the repository and branch in the repository that contain the Streamlit app file, then select
   the specific `.py` file. For details on connecting Snowflake to your Git repository, see [Setting up Snowflake to use Git](../../git/git-setting-up.md).
5. For App location, select a database and schema to contain the Streamlit app. You can’t change these after you create the app.
6. For Query warehouse and App warehouse, select a warehouse.
7. Select Create to create a Streamlit app from the `.py` file in your Git repository.

## Connect an existing Streamlit in Snowflake app with a Git repository

> **Note:**
>
> To connect a Streamlit app to a Git repository, you must use a role with the following privileges at a minimum:
>
> * OWNERSHIP or READ on the Git repository
> * USAGE on the schema that contains the Git repository

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then open or create a Streamlit app.
3. In the Files tab, next to the database object explorer, select Connect Git Repository.
4. For File location in repository, select the repository and the branch in the repository that you want to sync with the Streamlit app.
5. Select Select Folder.
6. When the prompt to commit your app to the Git repository appears, complete the commit steps outlined in Push changes to a branch in a Git repository.

After connecting your Streamlit app with a Git repository, you can select the branch name and open the repository details in Snowflake or Github.

## Push changes to a branch in a Git repository

If a Streamlit app is connected to a branch in a Git repository, after you make changes to the app you can push
your changes to the branch.

> **Note:**
>
> You must use a role with the OWNERSHIP or WRITE privilege on the Git repository to push your changes.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then open a Streamlit app.
3. Make any relevant changes to the Streamlit app.
4. Select Push to Git.
5. In the Push to Git dialog that appears, you can review the username and email address that are used to commit the changes
   to the specified branch and repository. If you need to update the username and email address, expand the Credentials section and
   update the Author name and Author email.
6. For Commit message, enter a message to include with your commit.
7. Expand the Credentials section to configure credentials. Enter your personal access token for the Git repository in the
   Personal access token field. This access token comes from the remote Git provider, such as GitHub.

   * This token is required to authenticate to the Git repository.
   * The token must have read and write access to the content of the repository for the commit to work.
   * Once entered, the token will be saved for future commits. You can update it during any future commits.
8. Select Push.

A confirmation message states that your changes were pushed successfully to your branch.

## Sync a Streamlit in Snowflake app with a remote branch in a Git repository

After you connect your app to a branch in a Git repository, you can sync any changes in the remote branch with your Streamlit app.

To sync a Streamlit app with a remote branch in a Git repository:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then open or create a Streamlit app.
3. On the Files tab in the database object explorer, select Pull.

Snowflake fetches any changes present on the remote repository branch and merges the contents with those changes.

### Merge conflicts

Snowflake attempts to resolve merge conflicts that occur during a sync. If there are merge conflicts that Snowflake
isn’t able to resolve, you’ll receive a message to discard or commit your changes to a new branch. When they
are committed to a new branch, use your Git provider to manually merge your changes from the new branch to the original
branch. Then you should pull the latest updates into your Streamlit app.

---
title: Troubleshooting Streamlit in Snowflake
source: https://docs.snowflake.com/en/developer-guide/streamlit/troubleshooting.md
section: Streamlit in Snowflake
---

# Troubleshooting Streamlit in Snowflake

The following scenarios can help you troubleshoot issues that can occur when working with Streamlit in Snowflake.

## You can’t connect to the Snowflake backend

In some cases, browser extensions can make local network access (LNA) requests as part of normal operation.
For example, a security extension may detect Streamlit in Snowflake’s cross-origin traffic and then make an LNA request.
Chrome 142 introduced mandatory LNA restrictions. Because you can’t configure LNA at the extension
level, you must either disable the extension or allow LNA for
Snowsight. For more information about Chrome’s LNA restrictions, see
[New permission prompt for Local Network Access](https://developer.chrome.com/blog/local-network-access)
in the Chrome for Developers blog.

|  |  |
| --- | --- |
| Error | Unable to connect to the Snowflake backend. |
| Cause | A browser extension attempted to make a local network access (LNA) request that was blocked by Chrome’s LNA restrictions. |
| Solution | Disable the browser extension or allow LNA for Snowsight. |

Contact Snowflake support if the issue persists after performing the following steps:

* Verify you’re on the latest Snowsight release.
* Verify that the “Local network access” permission is enabled for Snowsight. If you use an enterprise managed browser,
  contact your IT administrator.
* Verify that load failures continue after temporarily disabling extensions.
* Verify that load failures continue after disabling the Chrome flag via `chrome://flags#local-network-access-check`.

## You can’t load the Streamlit app

Each Streamlit app running in Streamlit in Snowflake uses a unique subdomain.

Ensure that `*.snowflake.app` and `*.snowflake.com` are on the allowlist in your network (including content filtering systems), and
can connect to Snowflake. For Streamlit apps using container runtimes, also add `*.snowflakecomputing.app` to the allowlist.
When these domains are on the allowlist, your apps can communicate with Snowflake servers without any restrictions.
However, in some cases adding these domains may not be sufficient due to network policies blocking subpaths under them. If this occurs,
contact your network administrator.

In addition, to prevent any issues connecting to the Snowflake backend, ensure that WebSockets are not blocked in your network configuration.

|  |  |
| --- | --- |
| Error | ```output Could not reload streamlit files. Error: 092806 (P0002): The specified Streamlit was not found. ``` |
| Cause | The Snowflake WebSocket connection cannot reach the endpoint associated with the application. |
| Solution | Add \*.snowflake.app to the allowlist on the organization’s firewall configuration. For Streamlit apps using container runtimes, also add \*.snowflakecomputing.app to the allowlist. |

## You can’t see your data or change your database

You might not be able to see your data or change the database, warehouse, or role because Streamlit apps run with owner’s rights by default, which means that they run with the privileges of the owner, not the privileges of the caller. Streamlit apps use the database and schema that the Streamlit in Snowflake app was created in, not the database and schema that the caller is currently using.

For more information, see [Understanding owner’s rights and Streamlit in Snowflake apps](object-management/owners-rights.md). To run a container-runtime app with the viewer’s
privileges instead, see [Restricted caller’s rights and Streamlit in Snowflake](features/restricted-callers-rights.md).

## Streamlit library feature doesn’t work

Ensure that the Streamlit library version and feature that you use are supported by Streamlit in Snowflake. For more information, see [Supported versions of the Streamlit library in warehouse runtimes](app-development/dependency-management.md) and [Unsupported Streamlit features](limitations.md).

To ask questions on features in the open-source Streamlit library, see [Streamlit Community Forum](https://discuss.streamlit.io/).

---
title: Understanding owner’s rights and Streamlit in Snowflake apps
source: https://docs.snowflake.com/en/developer-guide/streamlit/object-management/owners-rights.md
section: Streamlit in Snowflake
---

# Understanding owner’s rights and Streamlit in Snowflake apps

## Introduction

The model for Streamlit in Snowflake closely maps to the owner’s rights model in
[stored procedures](../../stored-procedure/stored-procedures-rights.md). This eliminates
the need for service account tokens and integrates with the authentication, access control,
and network policy features that Snowflake provides.

## About owner’s rights in Streamlit in Snowflake

By default, Streamlit apps adhere to the following rules within a session:

* Run with the privileges of the owner, not the privileges of the caller. To run a container-runtime
  app with restricted caller’s rights instead, see [Restricted caller’s rights and Streamlit in Snowflake](../features/restricted-callers-rights.md).
* Run with the warehouse provisioned by the app owner.
* Use the database and schema that the Streamlit in Snowflake app was created in, not the database and
  schema that the caller is currently using.

## About app creation

Streamlit apps are schema-level objects. To create a Streamlit app, you need appropriate
privileges on the database, schema, and warehouse. When an app is created, it runs with
the role of the user who originally created the app.

For more information, see [Privileges required to create a Streamlit app](privileges.md).

## Viewing an app

The app owner can choose which roles have permission to use the app. Viewers can interact
with the app and see anything displayed on the screen. All of the privileges of the app owner’s role
can be used by the app when shared with other roles, regardless of whether the privilege has WITH GRANT enabled.

For more information, see [Privileges required to view a Streamlit app](privileges.md).

## Restrictions on owner’s rights

Because apps run with owner’s rights, they have several additional restrictions.
If you use any context functions, you must grant the global READ SESSION privilege
to the app owner role. For more information, see [Row access policies in Streamlit in Snowflake](../features/row-access.md).

Warehouse-runtime apps run as a stored procedure and are subject to the same restrictions as
owner’s rights stored procedures. For example, the following items are affected:

* The built-in functions that can be called from inside a stored procedure.
* The ability to execute [ALTER USER](../../../sql-reference/sql/alter-user.md) statements.
* DESCRIBE, SHOW, and LIST commands.
* The types of SQL statements that can be called from inside a stored procedure.

For more information, see [Additional restrictions on owner’s rights stored procedures](../../stored-procedure/stored-procedures-rights.md).
Container-runtime apps don’t run as stored procedures and aren’t subject to these
additional restrictions.

## Owner’s rights and app security

Streamlit apps running in Streamlit in Snowflake run with owner’s rights and follow the same security model as other
Snowflake objects that run with owner’s rights.

Although Snowflake provides security features like authentication, role-based access control, and
admin controls, responsibility for the security of apps is shared with app creators and owners.

Use caution, for example, when granting a role with write privileges to another Snowflake user. Write
privileges allow the user to modify the Streamlit app.

In general, Snowflake recommends using role-based access control and dedicated roles for creating and
viewing Streamlit apps. Additionally, you should follow appropriate security practices while developing
Streamlit apps inside Snowflake and perform regular security audits of the Streamlit apps in your account.

---
title: Understanding the different types of Streamlit objects
source: https://docs.snowflake.com/en/developer-guide/streamlit/migrations-and-upgrades/overview.md
section: Streamlit in Snowflake
---

# Understanding the different types of Streamlit objects

Streamlit objects in Snowflake have two important differences that
influence their capabilities, limitations, and management:

* How the source location is specified: using the ROOT_LOCATION parameter (legacy) or
  the FROM parameter (recommended).
* The runtime environment: warehouse runtime or container runtime.

This page explains the differences between the source location specifications, which
in turn affect the runtime environments that can be used.

For more information about runtime environments, see [Runtime environments for Streamlit apps](../app-development/runtime-environments.md).
For a checklist to migrate from warehouse to container runtimes, see [Migrating between runtime environments](runtime-migration.md).

## Source location

When you create Streamlit apps in Snowflake, you can use two different parameters to
specify the location of your source files: ROOT_LOCATION or FROM. ROOT_LOCATION is a
legacy parameter that has several restrictions. Currently, FROM is the recommended
parameter for new apps and is required to access the latest Streamlit in Snowflake features, including
container runtimes. This page explains how to differentiate between the two types
of apps. To enable the latest Streamlit in Snowflake features, you must upgrade your legacy Streamlits.
For more information, see [Migrating between runtime environments](runtime-migration.md).

### FROM parameter (recommended)

The FROM parameter copies files from a specified location into an embedded stage
within the Streamlit object. This is the current recommended approach for creating
Streamlit apps.

**Benefits:**

* Supports both warehouse and container runtimes.
* Enables multi-file editing in Snowsight.
* Compatible with Git integration features.
* Uses an embedded, versioned stage within the Streamlit object.

**Syntax:**

```sqlexample
CREATE STREAMLIT my_app
FROM '@my_stage/app_folder'
MAIN_FILE = 'streamlit_app.py';
```

### ROOT_LOCATION parameter (legacy)

The ROOT_LOCATION parameter creates an app that references an internal stage as
its source. This is a legacy approach with several limitations. If you created
your app using an older version of Snowsight, your app’s root location
might be a snow URL (`snow://`) instead of a named stage.

**Limitations:**

* Only supports warehouse runtime (can’t use container runtime).
* No multi-file editing support in Snowsight.
* Can’t use Git integration features.
* Requires ongoing access to the internal stage.

**Syntax:**

```sqlexample
CREATE STREAMLIT my_app
ROOT_LOCATION = '@my_stage/app_folder'
MAIN_FILE = '/streamlit_app.py';
```

## Identifying your app’s source location type

You can determine which parameter was used to create your app by examining the
[DESCRIBE STREAMLIT](../../../sql-reference/sql/desc-streamlit.md) output:

```sqlexample
DESCRIBE STREAMLIT my_app;
```

* ROOT_LOCATION-based apps return fewer columns and include a `root_location` column.
* FROM-based apps return more columns and include a `live_version_location_uri` column.

When you edit an app in Snowsight, you are pushing updates to either the `root_location`
or the `live_version_location_uri`, depending on the app type. You can update both types of apps
using SQL commands to PUT or COPY FILES to these locations.

## Snowpark Container Services

Run containerized workloads inside Snowflake using OCI-compatible images.

---
title: Common Setup for Snowpark Container Services Tutorials
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/common-setup.md
section: Snowpark Container Services
---

App Development

# Common Setup for Snowpark Container Services Tutorials

## Introduction

This topic provides instructions for the common setup required for all Snowpark Container Services tutorials provided in this
documentation.

## Configure prerequisites

Review the following prerequisites to ensure you can complete the tutorials:

* **A Snowflake account:** Note that trial accounts are not supported.
* **SnowSQL, the command-line client for executing SQL commands (optional):** You can use any Snowflake client that supports
  executing SQL commands and uploading files to a Snowflake stage. The tutorials are tested using the SnowSQL and the
  [Snowsight](../../../user-guide/ui-snowsight-gs.md) web interface. For instructions to install this command-line client, see
  [Installing SnowSQL](../../../user-guide/snowsql-install-config.md).
* **Docker Desktop:** These tutorials provide instructions for using Docker Desktop. For installation instructions, see
  <https://docs.docker.com/get-docker/>. Note that you can use any OCI-compliant clients to create images, such as Docker, Podman,
  or Nerdctl.

## Create Snowflake objects

Execute the SQL provided using either the SnowSQL or the Snowsight.

1. Login to Snowflake as a user with the ACCOUNTADMIN role.
2. Using the ACCOUNTADMIN role, execute the following script, replacing `user_name` with the name of your Snowflake user who will test the tutorials. For these tutorials, you might choose the same user who executes this script or another user in your Snowflake account. The script does the following:

   * Creates a role (`test_role`) and other Snowflake objects. To create the role and objects, you must use the ACCOUNTADMIN role.
     (This restriction helps to control costs and manage business information risks.) The script also grants the `test_role` role
     the privileges needed to manage the newly created objects.
   * Grants the role to the specified Snowflake user, who then uses the role to explore the tutorials.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   CREATE ROLE test_role;

   CREATE DATABASE IF NOT EXISTS tutorial_db;
   GRANT OWNERSHIP ON DATABASE tutorial_db TO ROLE test_role COPY CURRENT GRANTS;

   CREATE OR REPLACE WAREHOUSE tutorial_warehouse WITH
     WAREHOUSE_SIZE='X-SMALL';
   GRANT USAGE ON WAREHOUSE tutorial_warehouse TO ROLE test_role;

   GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO ROLE test_role;

   CREATE COMPUTE POOL tutorial_compute_pool
     MIN_NODES = 1
     MAX_NODES = 1
     INSTANCE_FAMILY = CPU_X64_XS;
   GRANT USAGE, MONITOR ON COMPUTE POOL tutorial_compute_pool TO ROLE test_role;

   GRANT ROLE test_role TO USER <user_name>
   ```

   Note that:

   * You create a warehouse because the services (including job services) can run SQL DML statements (such as SELECT and INSERT). Snowflake executes these statements in the warehouse.
   * In tutorial 1, you create a service that exposes an endpoint as public to allow users to access the service from the public web (ingress). To create this service:

     + The role `test_role` must have the BIND SERVICE ENDPOINT privilege on the account.
     + The current implementation requires a security integration, which the script creates.
   * A [compute pool](../working-with-compute-pool.md) is a collection of one or more virtual machine (VM) nodes on which Snowflake runs your services.
3. Make sure you are logged in to Snowflake as the user specified in the preceding script.
4. Using the `test_role` role, execute the following script to create database-scoped objects common to all the tutorials.

   ```sqlexample
   USE ROLE test_role;
   USE DATABASE tutorial_db;
   USE WAREHOUSE tutorial_warehouse;

   CREATE SCHEMA IF NOT EXISTS data_schema;
   CREATE IMAGE REPOSITORY IF NOT EXISTS tutorial_repository;
   CREATE STAGE IF NOT EXISTS tutorial_stage
     DIRECTORY = ( ENABLE = true );
   ```

   Note that:

   * You create an image repository to store your service code (container images).
   * You create a Snowflake stage to store your service specification files in tutorial 2 and 3.

## Verify that you are ready to continue

1. To verify that you have the objects needed for the tutorials, execute the following commands:

   ```sqlexample
   SHOW COMPUTE POOLS; --or DESCRIBE COMPUTE POOL tutorial_compute_pool;
   ```

   ```sqlexample
   SHOW WAREHOUSES;
   ```

   ```sqlexample
   SHOW IMAGE REPOSITORIES;
   ```

   ```sqlexample
   SHOW STAGES;
   ```
2. To verify that you have your account information (organization and account names), use one of the following methods:

   * Find the information on the Snowsight web interface, in the lower left corner of the Home page.
   * In the SnowSQL CLI, execute SHOW IMAGE REPOSITORIES. The command returns the repository URL, including the organization and
     account names.

     **Example**

     ```sqlexample
     <orgname>-<acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
     ```

### What’s next?

You can now explore [Tutorial 1](tutorial-1.md).

---
title: Compute pool surcharges in Snowflake Native Apps with containers
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/provider-pricing-surcharges.md
section: Snowpark Container Services
---

# Compute pool surcharges in Snowflake Native Apps with containers

This preview lets Snowflake Marketplace providers bill based on usage
of compute pools managed by a Snowflake Native App with Snowpark Container Services (SPCS).

> **Note:**
>
> Compute pool surcharges apply only to Snowflake Native Apps with Snowpark Container Services. The app must be attached to a paid listing on Snowflake Marketplace.

## About billing for compute pools

If you have a paid listing on Snowflake Marketplace for Snowflake Native Apps with Snowpark Container Services (also called an app with containers), then you can add a surcharge for SPCS compute pool (CP) resources created by the app during setup. During this preview, we support combining SPCS CP surcharges with a base charge *only*.

The Marketplace invoice for a provider is itemized by listing, displaying a total usage-based amount per month. The consumer receives a detailed report on usage-based charges.

The surcharge pricing model is available only if all of the following conditions apply:

* The app must use at least one SPCS container with compute pools.
* The app must automatically create its compute pools during installation.
* The app must automatically request privileges during installation.
* You must be participating in the open preview for Snowflake Native Apps with Snowpark Container Services (introduced June 2024).
  For more information on this preview, see [Add a compute pool to an app with containers](../native-apps/container-compute-pool.md).
* The app must be available on the Snowflake Marketplace as a paid listing
  before you can configure surcharges.

## Developing a Native App’s compute pools for surcharging

To update your app code so that it correctly creates compute pools for surcharging, refer to the following information:

1. Add the CREATE COMPUTE POOL command to the setup script.
2. Request the CREATE COMPUTE POOL privilege in the manifest file.

To be surcharged, compute pool names must be unique and should describe the purpose, usage, owner role, and/or associated app of the compute pool.

If any compute pools are added after setup (for example, by the consumer), the listing prevents the app from running.

> **Note:**
>
> Consumer-created compute pools cannot run an app with containers from a listing.

## How to add a compute pool surcharge using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Navigate to the listing you want to work with, or create a new listing.
4. Add the data product, if it isn’t already attached.
5. To configure pricing, click Pricing & Trial » Add, located in
   the Data Product » Access & Pricing section of the screen.
6. If Usage-based is not already selected at the top,
   click on it to display the relevant options.
7. To start configuring charges for computing resources, click + Compute Pool Surcharge in
   the Snowpark Container Services Compute Pool Surcharge section.

   For each compute pool that you want to display or charge for:

   1. Enter the pre-configured name of the compute pool.
      This name must be the same name as in the app.
   2. Add an amount to bill per credit (in USD). For compute pools that should be displayed
      but have no surcharge, set this amount to $0.
   3. If you have more compute pools to add, click + Compute Pool Surcharge again.
   4. Continue until you have entered all the compute pools you want to display or charge for.
8. (Optional) To set an optional maximum on the charges billed to per month,
   add the amount in Maximum Monthly Charge in the Charging Limit section.
9. To save your work, click Save. To exit without saving, click Cancel.

## View pricing selections

Pricing selections are displayed on your view of the listing page. To view them,
select Preview on the listing page. To view
pricing selections as they appear to the consumer, on the Preview page, select Buy.

> **Note:**
>
> You should test to ensure that the surcharge is configured properly.

## Reporting

To report on usage, use the following views in the [DATA_SHARING_USAGE](../../sql-reference/data-sharing-usage.md) schema:

* [MARKETPLACE_PAID_USAGE_DAILY View](../../collaboration/views/marketplace-paid-usage-daily-ds.md)
* [MARKETPLACE_PROVIDER_SPCS_USAGE View](../../collaboration/views/marketplace-provider-spcs-usage-ds.md)
* [MONETIZED_USAGE_DAILY View](../../collaboration/views/monetized-usage-daily-ds.md)

This preview adds new values to the CHARGE_TYPE field in the [MARKETPLACE_PAID_USAGE_DAILY View](../../collaboration/views/marketplace-paid-usage-daily-ds.md)
and the [MONETIZED_USAGE_DAILY View](../../collaboration/views/monetized-usage-daily-ds.md):

* SPCS_COMPUTE_POOL_SURCHARGE - The amount of the SPCS compute pool surcharge.
* MAX_SPCS_COMPUTE_POOL_SURCHARGE_REACHED - No further charge. When the consumer ran additional
  queries, they had already reached the maximum total SPCS compute pool surcharge for this listing.

```sqlexample
SELECT listing_global_name,
   listing_display_name,
   charge_type,
   charge
FROM SNOWFLAKE.DATA_SHARING_USAGE.MARKETPLACE_PAID_USAGE_DAILY
WHERE charge_type='SPCS_COMPUTE_POOL_SURCHARGE';
```

```sqlexample
SELECT
  usage_date,
  listing_display_name,
  consumer_account_name,
  consumer_organization_name,
  charge_type,
  gross_charge
FROM SNOWFLAKE.DATA_SHARING_USAGE.MONETIZED_USAGE_DAILY
WHERE charge_type='SPCS_COMPUTE_POOL_SURCHARGE';
```

## Limitations

* You can combine compute pool surcharges with a base charge, but not with any other
  usage-based pricing model. If you have both a base charge and compute pool surcharges, the base charge won’t be reflected in MONETIZED_DAILY_USAGE views or the MARKETPLACE_DISBURSEMENT_REPORT views. However, both the base charge and the surcharge appear on the invoice.
* Compute pool surcharges can’t be combined with subscription-based pricing.
* Compute pool surcharges are calculated per day, not per hour.
* Compute pool surcharges are only calculated in US dollars.
* Time-based trials are supported. Other types of trials (usage-based or limited functionality)
  are not supported.

## Frequently asked questions

**How often is usage metered for compute pool surcharges?**

Usage is metered every 5 minutes.

---
title: Configuring private connectivity
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/private-connectivity.md
section: Snowpark Container Services
---

# Configuring private connectivity

This section explains inbound private connectivity (to endpoints exposed by Snowpark Container Services) and outbound private connectivity (egress traffic from your service).

## Inbound connectivity

Snowpark Container Services exposes three endpoints:

* **Image registry service:** It serves the OCIv2 API for you to upload your application images to a repository in your Snowflake account. For more information, see [Snowpark Container Services: Working with an image registry and repository](working-with-registry-repository.md).
* **Public endpoints exposed by a service:** You can allow users, in your account, access to your service from outside Snowflake (ingress) by declaring one or more endpoints as public. For more information, see [Using a service](working-with-services.md).
* **Authentication endpoint:** When a user attempts to access a service’s public endpoint, Snowpark Container Services redirects the user through this endpoint for authentication.

This section explains how to enable private connectivity to these endpoints.

> **Note:**
>
> * When configuring private connectivity, you control the DNS resolution; there are no DNS records controlled by Snowflake.

### Configure prerequisites

To enable private connectivity to Snowpark Container Services, first configure private connectivity to connect your Snowflake account to your cloud provider account’s network. For more information, see [Inbound private connectivity to Snowflake service](../../user-guide/private-connectivity-inbound.md).

In addition, create CNAME records in your DNS for the `regionless-privatelink-account-url` value returned from calling [SYSTEM$GET_PRIVATELINK_CONFIG](../../sql-reference/functions/system_get_privatelink_config.md).

### Configure public endpoints access

To enable ingress requests from your network to your service’s public endpoint:

1. Call [SYSTEM$GET_PRIVATELINK_CONFIG](../../sql-reference/functions/system_get_privatelink_config.md) in your Snowflake account to get a list of hostnames for your account. In the output:

   1. `app-service-privatelink-url` key provides a wildcard hostname for Snowpark Container Services public endpoints.
   2. `spcs-auth-privatelink-url` key provides the hostname required for routing Snowpark Container Services authentication.
2. To access Snowflake via private connectivity, you must create CNAME records in your DNS to resolve the endpoint values from the SYSTEM$GET_PRIVATELINK_CONFIG function to your private network.

   > **Note:**
   >
   > Hostname routing at an account level is currently not supported.

### Configuring access to Snowpark Container Services Registry in Snowflake

1. Call SYSTEM$GET_PRIVATELINK_CONFIG in your Snowflake account to get a list of hostnames for your account. In the output, the `spcs-registry-privatelink-url` key provides the hostname required for routing Snowpark Container Services image registry requests.
2. To access Snowflake via private connectivity, it is necessary to create records in your DNS to resolve the endpoint values from the SYSTEM$GET_PRIVATELINK_CONFIG function to your private network.

### Security considerations

The following apply for public endpoints that services expose:

* Each endpoint can serve both HTTPS-encrypted traffic and WebSocket-encrypted traffic.
* Each endpoint has their own top-level domain, with no shared elements with Snowsight. This ensures that browsers isolate services from Snowsight and services from each other, mitigating risks of cross-origin attacks.

## Outbound connectivity

Instead of routing network egress via the public internet, you might opt to direct your service’s egress traffic through a private connectivity endpoint. For more information, see [Network egress using private connectivity](service-network-communications.md).

---
title: Running a Snowpark Container Services job as a Snowflake task
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/jobs-as-tasks.md
section: Snowpark Container Services
---

# Running a Snowpark Container Services job as a Snowflake task

You can run a Snowpark Container Services [job service](working-with-services.md) as a Snowflake task. When you run a job service as a Snowflake task, the integration enables scenarios that leverage the robust containerization and scalability of Snowpark Container Services. This process occurs directly within your scheduled or event-triggered data pipelines that are managed by Snowflake Tasks.

For example, the following [CREATE TASK](../../sql-reference/sql/create-task.md) command creates a task to run a job service every hour. The command provides the job details by using the [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) SQL command:

```sqlexample
CREATE TASK job_task
SCHEDULE = '60 MINUTE'
AS
  EXECUTE JOB SERVICE
    IN COMPUTE POOL my_compute_pool
    FROM SPECIFICATION $$
    spec:
      containers:
      - name: main
        image: /my_db/my_schema/my_repository/my_job_image:latest
        args:
          - "--process_data"
    $$;
```

> **Note:**
>
> * Snowflake job tasks supports the [Serverless model](../../user-guide/tasks-intro.md), so you don’t specify a warehouse in the CREATE TASK statement.
> * When you run a job service as a task, you should run the job service synchronously, otherwise the task will report completion before the job service is completed.

## Passing data into and out of jobs running as tasks

[Task graphs](../../user-guide/tasks-graphs.md) enable you to create and manage complex, multi-step data pipelines that seamlessly integrate job services running as tasks. You can use the [supported system functions](../../user-guide/tasks-graphs.md) in your job service code to access the task context and use it to fetch task graph configuration, and runtime information of the executing task.

When you run job services as tasks, you can use the following data-sharing options between tasks in a task graph:

* **Predecessor return value mechanism:** In a task graph, you can pass output of a task as input to a subsequent, dependent task. Snowflake recommends this option when you pass small metadata, such as a file path, status string, or some other ID value. For more information, see [Pass return values between tasks](../../user-guide/tasks-graphs.md).

  Just as with a SQL task, a job running as a task can retrieve the return value of a preceding task. Similarly, a job can also provide a return value for a subsequent task.
* **Common persistent storage mechanism:** When you transfer large datasets, such as files, Snowflake recommends that you persist the data in persistent storage, such as a Snowflake stage or table, and ensure that the tasks in your task graph can access the storage.

> **Note:**
>
> Sessions aren’t shared between job services. Therefore, you can’t use temporary tables or session variables as a way to share data because these are session-scoped objects.

## Example

For an example, see [Tutorial: Run a Snowflake Container Services job as a Snowflake task](tutorials/advanced/run-job-as-task.md).

---
title: Service networking
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/service-network-communications.md
section: Snowpark Container Services
---

# Service networking

With Snowpark Container Services services, there are three types of networking to consider:

* Ingress networking: How to connect from outside Snowflake into your service.
* Egress networking: How your service connects to resources outside Snowflake.
* Service-to-service communication in Snowpark Container Services.

The following sections explain how to configure each kind of networking.

## Configure service ingress

To allow anything to interact with your service from the internet, you declare the network ports on which your service is
listening as endpoints in the service specification file. These endpoints control ingress.

By default, service endpoints are private. Only
[service functions](working-with-services.md) and
[service-to-service communications](working-with-services.md)
can make requests to the private endpoints. You can declare an endpoint as public to allow requests to an endpoint from the
internet as shown in the following example service specification.

> **Note:**
>
> Creating a public endpoint is a privileged operation and the service’s owner role must have the BIND SERVICE ENDPOINT privilege on the account.

```yaml
endpoints:
- name: <endpoint name>
  port: <port number>
  protocol : < TCP / HTTP >
  public: true
  corsSettings:                  # optional CORS configuration
    Access-Control-Allow-Origin: # required list of allowed origins
      - <origin>                 # for example, "http://example.com"
      - <origin>
        ...
    Access-Control-Allow-Methods: # optional list of HTTP methods
      - <method>
      - <method>
        ...
    Access-Control-Allow-Headers: # optional list of HTTP headers
      - <header-name>
      - <header-name>
        ...
    Access-Control-Expose-Headers: # optional list of HTTP headers
      - <header-name>
      - <header-name>
        ...
```

For an example, see [Tutorial 1](tutorials/tutorial-1.md).

When you make an endpoint public, Snowflake allocates a unique hostname to the public endpoint. Snowflake sends incoming requests with that hostname to the service.

If you want to expose multiple service endpoints behind a single host name, you can create a *gateway*. A gateway routes ingress requests to one or more service endpoints, based on the gateway configuration. For more information about gateways and scenarios for creating a gateway, see [Use Gateways to route ingress requests to multiple endpoints](gateway.md)

### Ingress connection timeout

Ingress endpoints have a timeout of 90 seconds. If there is no activity on a connection to an ingress endpoint for 90 seconds, Snowflake terminates the connection. If your application needs longer connectivity, use polling or WebSockets.

### Ingress web browser authentication logout

If you’re building a web app that runs as a service, you have the option to allow users to log out from your app by directing them to `/sfc-endpoint/logout`.

After logging out, the user will be required to re-authenticate to Snowflake to access the public endpoint of the service.

### Ingress and web app security

You can create a Snowpark Container Services service for web hosting using the public endpoint support (network ingress). For added
security, Snowflake employs a proxy service to monitor incoming requests from clients to your service and outgoing responses from your
service to the clients. This section explains what the proxy does and how it impacts a service deployed to Snowpark Container Services.

> **Note:**
>
> When you test a service locally, you are not using the Snowflake proxy and therefore there will be differences between your experience running
> a service locally as opposed to when deployed in Snowpark Container Services. Review this section and update your local setup for better
> testing.

For example:

* The proxy does not forward an incoming HTTP request if the request uses a banned HTTP method.
* The proxy sends a 403 response to the client if the Content-Type header in the response indicates that the response contains an executable.

Additionally, the proxy can also inject new headers and alter existing headers in the request and the response, with your container and data security in
mind.

For example, upon receiving a request, your service might send HTML, JavaScript, CSS, and other content for a web page to the client browser
in the response. The web page in the browser is part of your service, acting as the user interface. For security reasons, if your service has restrictions (such as a restriction on making network connections to other sites), you might also want the web page for your service to have the same restrictions.

By default, services have limited permissions to access the internet. The browser should also restrict the client app from accessing the internet and potentially sharing data in most cases. If you set up an External Access Integration (EAI) to allow your service to access `example.com` (see Configure service egress), the web page for your service should also be able to access `example.com` through your browser.

The Snowflake proxy applies the same network restrictions on the service and the web page by adding a `Content-Security-Policy` (CSP) header in the response. By default, the proxy adds a baseline CSP in the
response to protect against common security threats. Browser security is a best effort to balance functionality with security, it is a shared responsibility to ensure this baseline is appropriate for your use case. In addition, if your service is configured to use an EAI, the proxy
applies the same network rules from the EAI to the CSP for the web page. This CSP enables the web page in the browser to access the same sites that the service can access.

Snowflake provides CORS support which you configure in the service specification.

The Snowflake proxy returns the CORS settings defined in the service specification. Note that the proxy removes any CORS headers returned by the service.

The following CORS headers are set by default:

* `Access-Control-Expose-Headers` header always reports the following header names, in addition to headers configured in the service specification for the endpoint.

  + `X-Frame-Options`
  + `Cross-Origin-Opener-Policy`
  + `Cross-Origin-Resource-Policy`
  + `X-Content-Type-Options`
  + `Cross-Origin-Embedder-Policy`
  + `Content-Security-Policy-Report-Only`
  + `Content-Security-Policy`
* `Access-Control-Max-Age` is set to two hours.
* `Access-Control-Allow-Credentials` is set to true.

In addition, Snowflake sets the `Vary` header with the value `Origin` to indicate to the browser that based on the value of the `Origin`, the value to `Access-Control-Allow-Origin` might be different.

The `Authorization` header is required to make the CORS request. You can specify a programmatic access token (PAT) in this header (`Authorization: "Snowflake Token=\"${patToken}\""`). For information on generating a programmatic access token, see [Using programmatic access tokens for authentication](../../user-guide/programmatic-access-tokens.md).

The following sections explain how the Snowflake proxy handles incoming requests for your service and modifies the outgoing responses from your service to the clients.

#### Requests incoming to the service

When a request arrives, the proxy does the following before forwarding the request to the service:

* **Incoming requests with banned HTTP methods:** If an incoming HTTP request uses any of the following banned HTTP methods, the proxy does not forward the request to your service:

  + `TRACE`
  + `CONNECT`
* **Incoming requests header scrubbing:** Snowflake proxy removes the following request headers if present:

  + `X-SF-SPCS-Authorization`
  + `Authorization`: Only removed if it contains a Snowflake token; otherwise, it is passed through to your service.

#### Responses outgoing to the clients

The Snowflake proxy applies these modifications to the response sent by your service before forwarding the response to the client.

* **Header Scrubbing:** Snowflake proxy removes these response headers, if present:

  + `X-XSS-Protection`
  + `Server`
  + `X-Powered-By`
  + `Public-Key-Pins`
* **CORS headers manipulation:** See Ingress and CORS considerations.
* **Content-Type response header:** If your service response includes the Content-Type header with any of the following MIME type values
  (that indicate an executable), Snowflake proxy does not forward that response to the client. Instead, the proxy sends a `403 Forbidden`
  response.

  + `application/x-msdownload`: Microsoft executable.
  + `application/exe`: Generic executable.
  + `application/x-exe`: Another generic executable.
  + `application/dos-exe`: DOS executable.
  + `application/x-winexe`: Windows executable.
  + `application/msdos-windows`: MS-DOS Windows executable.
  + `application/x-msdos-program`: MS-DOS executable.
  + `application/x-sh`: Unix shell script.
  + `application/x-bsh`: Bourne shell script.
  + `application/x-csh`: C shell script.
  + `application/x-tcsh`: Tcsh shell script.
  + `application/batch`: Windows batch file.
* **X-Frame-Options response header:** To prevent clickjacking attacks, the Snowflake proxy sets this response header to `DENY`, preventing other web pages from using an iframe to the web page for your service.
* **Cross-Origin-Opener-Policy (COOP) response header:** Snowflake sets the COOP response header to `same-origin` to prevent referring cross-origin windows from accessing your service tab.
* **Cross-Origin-Resource-Policy (CORP) response header:** Snowflake sets the CORP header to `same-origin` to prevent external sites from loading resources exposed by the ingress endpoint (for example, in an iframe).
* **X-Content-Type-Options response header:** Snowflake proxy sets this header to `nosniff` to ensure the clients do not change the MIME
  type stated in the response by your service.
* **Cross-Origin-Embedder-Policy (COEP) response header:** Snowflake proxy sets the COEP response header to `credentialless`, which means
  when loading a cross-origin object such as an image or a script, if the remote object does not support Cross-Origin Resource Sharing (CORS) protocol, Snowflake does not send the credentials when loading it.
* **Content-Security-Policy-Report-Only response header:** Snowflake proxy replaces this response header with a new value directing
  the client to send the CSP reports to Snowflake.
* **Content-Security-Policy (CSP) response header:** By default the Snowflake proxy adds the following baseline CSP to protect against common
  web attacks.

  ```none
  default-src 'self' 'unsafe-inline' 'unsafe-eval' blob: data:; object-src 'none'; connect-src 'self'; frame-ancestors 'self';
  ```

  There are two content security policy considerations:

  + In addition to the baseline content security policy that proxy adds, the service itself can explicitly add a CSP in the response. A
    service might choose to enhance security by adding a stricter CSP. For example, a service might add the following CSP to allow scripts only from `self`.

    ```none
    script-src 'self'
    ```

    In the resulting response sent to the client, there will be two CSP headers. Upon receiving the response, the client browsers then apply
    the strictest content security policy that includes the additional restrictions specified by each policy.
  + If you configure an External Access Integration (EAI) to allow your service to access an external site
    (Configure service egress), the Snowflake proxy creates a CSP that allows your web page to access that site. For example, suppose a
    network rule associated with an EAI allows your service egress access to `example.com`. Then, Snowflake proxy adds this CSP response header:

    ```html
    default-src 'self' 'unsafe-inline' 'unsafe-eval' http://example.com https://example.com blob: data:; object-src 'none'; connect-src 'self' http://example.com https://example.com wss://example.com; frame-ancestors 'self';
    ```

    Browsers honor the content access policy received in the response. In this example, browsers allow the app access to `example.com` but not other sites.

### Ingress and CORS considerations

By default, browsers block web apps hosted on one server from sending requests to another server with a different hostname. For instance, if you host a web app outside Snowpark Container Services that needs to interact with a backend service deployed within Snowpark Container Services, this restriction applies.

CORS (Cross-Origin Resource Sharing) enables a Snowpark Container Services service to tell the browsers to allow requests from web apps hosted outside its environment. You can configure each public endpoint to specify how it responds to both CORS preflight requests and standard requests.

Snowflake proxy always overrides the following response headers:

* `Access-Control-Allow-Origin`
* `Access-Control-Allow-Methods`
* `Access-Control-Allow-Headers`
* `Access-Control-Expose-Headers`
* `Access-Control-Max-Age`
* `Access-Control-Allow-Credentials`

The Snowflake proxy does not include any of these CORS headers in the response when either of the following is true:

* CORS is not configured for the service endpoint. That is, there no `corsSettings` in the service specification
* CORS is configured for the service endpoint, but the `Origin` header in the request doesn’t match the specified `Access-Control-Allow-Origin` field in the service specification

In the service specification, you can configure CORS settings for each public endpoint. When the `origin` header in the request matches the `Access-Control-Allow-Origin` field specified for the endpoint in the specification, the proxy includes in the response the CORS headers defined in the specification, with the following adjustments:

* `Access-Control-Allow-Origin`: Returns the `Origin` header from the request.
* `Access-Control-Expose-Headers`: Merges the list of allowed headers you configured with these always-exposed headers: `X-Frame-Options`, `Cross-Origin-Opener-Policy`, `Cross-Origin-Resource-Policy`,
  `X-Content-Type-Options`, `Cross-Origin-Embedder-Policy`, `Content-Security-Policy-Report-Only`, `Content-Security-Policy`.
* `Access-Control-Max-Age`: Is set to two hours.
* `Access-Control-Allow-Credentials`: Is set to true.

### Ingress and your Identity Provider (IdP) considerations

An account administrator can configure all ingress URLs to redirect to your identity provider (IdP) when an
unauthenticated viewer accesses an app. This process eliminates the additional customer step of selecting the IdP
from the login page, resulting in fewer clicks and a smoother login flow.

To redirect unauthenticated users from ingress URLs to your IdP, use the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md)
SQL command to set the `LOGIN_IDP_REDIRECT` account property to include `SPCS`:

```sqlexample
ALTER ACCOUNT SET LOGIN_IDP_REDIRECT = (SPCS = <your_security_integration>);
```

For more information about configuring your Snowflake account to use an IdP, see the following topics:

* [Configuring Snowflake to use federated authentication](../../user-guide/admin-security-fed-auth-configure-snowflake.md)
* [Configuring an identity provider (IdP) for Snowflake](../../user-guide/admin-security-fed-auth-security-integration.md)

### Ingress and SSO considerations

When accessing the public endpoint from the internet, you might find that username/password authentication works, but SSO results in a blank page or the error: “OAuth client integration with the given client ID is not found.”

This happens when you’re using the old style of federated authentication (SSO) with Snowflake instead of the newer security integration version as explained in [Configuring Snowflake to use federated authentication](../../user-guide/admin-security-fed-auth-security-integration.md). Do the following to verify:

1. Run the following query:

   ```sqlexample
   SHOW PARAMETERS LIKE 'SAML_IDENTITY_PROVIDER' IN ACCOUNT;
   ```

   If you have this parameter set, then at some point you were using the old-style federated authentication.
2. If the preceding parameter was set, run the following query
   to see if you have a SAML security integration:

   ```sqlexample
   SHOW INTEGRATIONS
     ->> SELECT * FROM $1 WHERE "type" = 'SAML2';
   ```

   If you don’t have any integrations of the SAML2 type, then you’re using the old style federated authentication.

In this case, the solution is to migrate from the old-style federated authentication to the new integration-style federated authentication. For more information, see [Migrating to a SAML2 security integration](../../user-guide/admin-security-fed-auth-configure-snowflake.md).

## Configure service egress

Your application code might require access to the internet. By default, application containers don’t have
permission to access the internet. You need to enable internet access using
[External Access Integrations (EAIs)](../external-network-access/external-network-access-overview.md).

Typically, you want an account administrator to create EAIs to manage external access allowed from services (including job services). Account
administrators can then grant EAI usage to specific roles that developers use to run services.

The following example outlines the steps in creating an EAI that allows egress traffic to specific destinations specified using
network rules. You then refer to the EAI when creating a service to allow requests to specific internet destinations.

**Example**

Suppose you want your application code to send requests to the following destinations:

* HTTPS requests to translation.googleapis.com
* HTTP and HTTPS requests to google.com

Follow these steps to enable your service to access these domains on the internet:

1. Create an External Access Integration (EAI). This requires appropriate permissions. For example, you can use ACCOUNTADMIN role
   to create an EAI. This is a two-step process:

   1. Use the [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md) command to create one or more egress network rules listing external
      destinations you want to allow access to. You can accomplish this example with one network rule, but for illustration, we
      create two network rules:

      1. Create a network rule named `translate_network_rule`:

         ```sqlexample
         CREATE OR REPLACE NETWORK RULE translate_network_rule
           MODE = EGRESS
           TYPE = HOST_PORT
           VALUE_LIST = ('translation.googleapis.com');
         ```

         This rule allows TCP connections to the `translation.googleapis.com` destination. The domain in the VALUE_LIST
         property does not specify the optional port number, so the default port 443 (HTTPS) is assumed. This allows your
         application to connect to any URL that starts with `https://translation.googleapis.com/`.
      2. Create a network rule named `google_network_rule`:

         ```sqlexample
         CREATE OR REPLACE NETWORK RULE google_network_rule
           MODE = EGRESS
           TYPE = HOST_PORT
           VALUE_LIST = ('google.com:80', 'google.com:443');
         ```

         This allows your application to connect to any URL that starts with `http://google.com/` or
         `https://google.com/`.
      > **Note:**
      >
      > For the `VALUE_LIST` parameter, you must provide a full host name. Wildcards (for example, `*.googleapis.com`) are not supported.

      Snowpark Container Services supports only the network rules that allow ports 22, 80, 443, and 1024+. If a network rule
      referenced allows access to other ports, creation of the service will fail. Contact your account representative if you
      require use of additional ports.

      > **Note:**
      >
      > To allow your service to send HTTP or HTTPS requests to any destination on the internet, you specify “0.0.0.0”
      > as the domain in the VALUE_LIST property. The following network rule allows sending both “HTTP” and “HTTPS” requests
      > anywhere on the internet. Only ports 80 or 443 are supported with “0.0.0.0”.
      >
      > ```sqlexample
      > CREATE NETWORK RULE allow_all_rule
      >   TYPE = 'HOST_PORT'
      >   MODE= 'EGRESS'
      >   VALUE_LIST = ('0.0.0.0:443','0.0.0.0:80');
      > ```
   2. [Create an external access integration (EAI)](../external-network-access/creating-using-external-network-access.md)
      that specifies that the preceding two egress network rules are allowed:

      ```sqlexample
      CREATE EXTERNAL ACCESS INTEGRATION google_apis_access_integration
        ALLOWED_NETWORK_RULES = (translate_network_rule, google_network_rule)
        ENABLED = true;
      ```

      Now the account admin can grant usage of the integration to developers to allow them to run a service that can access
      specific destinations on the internet.

      ```sqlexample
      GRANT USAGE ON INTEGRATION google_apis_access_integration TO ROLE test_role;
      ```
2. Create the service by providing the EAI as shown in the following examples. The owner role that is creating the service needs the USAGE privilege on the EAI and READ privilege on the secrets referenced. Note that you cannot use the
   ACCOUNTADMIN role to create a service.

   * Create a service:

     ```sqlexample
     USE ROLE test_role;

     CREATE SERVICE eai_service
       IN COMPUTE POOL MYPOOL
       EXTERNAL_ACCESS_INTEGRATIONS = (GOOGLE_APIS_ACCESS_INTEGRATION)
       FROM SPECIFICATION
       $$
       spec:
         containers:
           - name: main
             image: /db/data_schema/tutorial_repository/my_echo_service_image:tutorial
             env:
               TEST_FILE_STAGE: source_stage/test_file
             args:
               - read_secret.py
         endpoints:
           - name: read
             port: 8080
       $$;
     ```

     This example CREATE SERVICE request uses an inline service specification and specifies the optional
     EXTERNAL_ACCESS_INTEGRATIONS property to include the EAI. The EAI specifies the network rules that allow egress traffic
     from the service to the specific destinations.
   * Execute a job service:

     ```sqlexample
     EXECUTE JOB SERVICE
       IN COMPUTE POOL tt_cp
       NAME = example_job_service
       EXTERNAL_ACCESS_INTEGRATIONS = (GOOGLE_APIS_ACCESS_INTEGRATION)
       FROM SPECIFICATION $$
       spec:
         container:
         - name: curl
           image: /tutorial_db/data_schema/tutorial_repo/alpine-curl:latest
           command:
           - "curl"
           - "http://google.com/"
       $$;
     ```

     This example EXECUTE JOB SERVICE command specifies inline specification and the optional EXTERNAL_ACCESS_INTEGRATIONS property
     to include the EAI. This allows egress traffic from the job to destinations specified in the network rules the EAI allows.

### Network egress using private connectivity

Instead of routing network egress via the public internet, you might opt to direct your service’s egress traffic through a
[private connectivity endpoint](../../user-guide/private-connectivity-outbound.md).

You first need to create the private connectivity endpoint in your Snowflake account. Then configure a network rule to permit outgoing
traffic to use [private connectivity](../../user-guide/private-connectivity-outbound.md). The process for setting up an External Access
Integration (EAI) remains the same as described in the preceding section.

> **Note:**
>
> Private communication requires that both Snowflake and the customer’s cloud account use the same cloud provider and same region.

For example, if you want to enable your service’s outbound internet access to an Amazon S3 bucket via private connectivity, you do the
following:

1. Enable the private link connectivity for the self-maintained endpoint service (Amazon S3). For step-by-step instructions, see
   [AWS Private Link for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html).
2. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a private connectivity
   endpoint in your Snowflake VNet. This enables Snowflake to connect to the external service (in this example, Amazon S3) using private
   connectivity.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'com.amazonaws.us-west-2.s3',
     '*.s3.us-west-2.amazonaws.com'
   );
   ```
3. In the cloud provider account, approve the endpoint. In this example, for Amazon AWS, see
   [Accept or reject connection requests](https://docs.aws.amazon.com/vpc/latest/privatelink/configure-endpoint-service.html#accept-reject-connection-requests)
   in the AWS documentation. Also, to approve the endpoint in Azure, see the
   [Azure documentation](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
4. Use the [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md) command to create an egress network rule specifying the external destinations
   that you want to allow access to.

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE private_link_network_rule
     MODE = EGRESS
     TYPE = PRIVATE_HOST_PORT
     VALUE_LIST = ('<bucket-name>.s3.us-west-2.amazonaws.com');
   ```

   The TYPE parameter value is set to PRIVATE_HOST_PORT. It indicates that the network rule allows outgoing network traffic to use
   [private connectivity](../../user-guide/private-connectivity-outbound.md).
5. The rest of the steps to create an EAI and use it to create a service are the same as explained in the preceding section
   (see Configure service egress).

For more information about working with private connectivity endpoints, see the following:

* [Manage private connectivity endpoints: AWS](../../user-guide/private-manage-endpoints-aws.md)
* [Manage private connectivity endpoints: Azure](../../user-guide/private-manage-endpoints-azure.md)
* [Manage private connectivity endpoints: Google Cloud](../../user-guide/private-manage-endpoints-gcp.md)

## Considerations when Configuring communications between services

There are two considerations:

* **Communications between containers of a service instance:** If a service instance runs multiple containers, these containers
  can communicate with each other over localhost (there is no need to define endpoints in the service specification).
* **Communication between containers across multiple services or multiple service instances:** Containers belonging to
  different services (or different instances of the same service) can communicate using endpoints defined in specification files.
  For more information, see [Service-to-service communications](working-with-services.md).

---
title: Service specification reference
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/specification-reference.md
section: Snowpark Container Services
---

# Service specification reference

The Snowpark Container Services specification is in YAML
(<https://yaml.org/spec/>). It gives Snowflake the necessary
information to configure and run your service. You provide the specification at the time of creating a service.

The general syntax is:

```yaml
spec:
  containers:                           # container list
  - name: <name>
    image: <image-name>
    command:                            # optional list of strings
      - <cmd>
      - <arg1>
    args:                               # optional list of strings
      - <arg2>
      - <arg3>
      - ...
    env:                                # optional
        <key>: <value>
        <key>: <value>
        ...
    readinessProbe:                     # optional
        port: <TCP port-num>
        path: <http-path>
    volumeMounts:                       # optional list
      - name: <volume-name>
        mountPath: <mount-path>
      - name: <volume-name>
        ...
    resources:                          # optional
        requests:
          memory: <amount-of-memory>
          nvidia.com/gpu: <count>
          cpu: <cpu-units>
        limits:
          memory: <amount-of-memory>
          nvidia.com/gpu: <count>
          cpu: <cpu-units>
    secrets:                                # optional list
      - snowflakeSecret:
          objectName: <object-name>         # specify this or objectReference
          objectReference: <reference-name> # specify this or objectName
        directoryPath: <path>               # specify this or envVarName
        envVarName: <name>                  # specify this or directoryPath
        secretKeyRef: username | password | secret_string # specify only with envVarName
  endpoints:                             # optional endpoint list
    - name: <name>
      port: <TCP port-num>                     # specify this or portRange
      portRange: <TCP port-num>-<TCP port-num> # specify this or port
      public: <true / false>
      protocol : < TCP / HTTP >
      corsSettings:                  # optional CORS configuration
        Access-Control-Allow-Origin: # required list of allowed origins, for example, "http://example.com"
          - <origin>
          - <origin>
            ...
        Access-Control-Allow-Methods: # optional list of HTTP methods
          - <method>
          - <method>
            ...
        Access-Control-Allow-Headers: # optional list of HTTP headers
          - <header-name>
          - <header-name>
            ...
        Access-Control-Expose-Headers: # optional list of HTTP headers
          - <header-name>
          - <header-name>
            ...
    - name: <name>
      ...
  volumes:                               # optional volume list
    - name: <name>
      source: local | stage | memory | block
      size: <bytes-of-storage>           # specify if memory or block is the volume source
      uid: <UID-value>                   # optional, only for stage volumes
      gid: <GID-value>                   # optional, only for stage volumes
      blockConfig:                       # optional
        initialContents:
          fromSnapshot: <snapshot-name>
        iops: <number-of-operations>
        throughput: <MiB-per-second>
        encryption: SNOWFLAKE_SSE | SNOWFLAKE_FULL
        snapshotOnDelete: true | false             # defaults to true for services and false for jobs
        snapshotDeleteAfter: (<hours>h)|(<days>d)  # defaults to 7 days
      stageConfig:                       # optional
        name: <stage_name>
        metadataCache: <time_period>      # optional
        resources:                       # optional
          requests:
            memory: <amount-of-memory>
            cpu: <cpu-units>
          limits:
            memory: <amount-of-memory>
            cpu: <cpu-units>
    - name: <name>
      source: local | stage| memory | block
      size: <bytes-of-storage>           # specify if memory or block is the volume source
      ...
  logExporters:
    eventTableConfig:
      logLevel: <INFO | ERROR | NONE>
  platformMonitor:                      # optional, platform metrics to log to the event table
    metricConfig:
      groups:
      - <group-1>
      - <group-2>
      ...
capabilities:
  securityContext:
    executeAsCaller: <true / false>     # optional, indicates whether application intends to use caller’s rights
serviceRoles:                   # Optional list of service roles
- name: <service-role-name>
  endpoints:
  - <endpoint_name1>
  - <endpoint_name2>
  - ...
- ...
```

Note that the `spec` and `serviceRoles` are the top-level fields in the specification.

* `spec`: Use this field to provide specification details. It includes these top-level fields:

  + spec.containers (required): A list of one or more application containers.
    Your containerized application must have at least one container.
  + spec.endpoints (optional): A list of
    endpoints that the service exposes. You might choose to make an
    endpoint public, allowing network ingress access to the service.
  + spec.volumes (optional): A list of storage volumes for the
    containers to use.
  + spec.logExporters (optional): This field manages the level of container logs
    exported to the event table in your account.
* `serviceRoles`: Use this field to define one or more service roles. The service role is the mechanism you use to manage privileges to endpoints the service exposes.

## General guidelines

* The following format guidelines apply for the `name` fields (container, endpoint, and volume names):

  + Can be up to 63 characters long.
  + Can contain a sequence of lowercase alphanumeric or `-` characters.
  + Must start with an alphabetic character.
  + Must end with an alphanumeric character.
* Customers should ensure that no personal data, sensitive data, export-controlled data, or other regulated data is entered as
  metadata in the specification file. For more information, see [Metadata Fields in Snowflake](../../sql-reference/metadata.md).

The following sections explain each of the top-level `spec` fields.

## `spec.containers` field (required)

Use the `spec.containers` field to describe each of the [OCI](https://opencontainers.org/) containers in your application.

Note the following:

* When you create a service, Snowflake runs these containers on a single node in the specified compute pool, sharing the same network interface.
* You might choose to run multiple service instances to load-balance incoming requests. Snowflake might choose to run these service instances on the same node or different nodes in the specified compute pool. All containers for a given instance always run on one node.
* Currently, Snowpark Container Services requires linux/amd64 platform images.

The following sections explain the types of containers fields.

### `containers.name` and `containers.image` fields

For each container, only name and image are required fields.

* `name` is the image name. This name can be used to identify a specific container for the purposes of observability (for example, [logs](monitoring-services.md),
  [metrics](working-with-services.md)).
* `image` is the name of the image you uploaded to a Snowflake image repository in your Snowflake account.

For example:

```yaml
spec:
  containers:
    - name: echo
      image: /tutorial_db/data_schema/tutorial_repository/echo_service:dev
```

### `containers.command` and `containers.args` fields

Use these optional fields to control what executable is started in your container and the arguments that are passed to that executable. You can configure defaults for these at the time of creating the image, typically in a Dockerfile.
By using these service specification fields, you can change these defaults (and thus change the container behavior) without having to rebuild your container image:

* `containers.command` overrides the `Dockerfile` `ENTRYPOINT`. This allows you
  to run a different executable in the container.
* `containers.args` overrides the `Dockerfile` `CMD`. This allows you
  to provide different arguments to the command (the executable).

**Example**

Your `Dockerfile` includes the following code:

```bash
ENTRYPOINT ["python3", "main.py"]
CMD ["Bob"]
```

These `Dockerfile` entries execute the `python3` command
and pass two arguments: `main.py` and `Bob`. You can override
these values in the specification file as follows:

* To override the ENTRYPOINT, add the
  `containers.command` field in the specification file:

  ```yaml
  spec:
    containers:
    - name: echo
      image: <image_name>
      command:
      - python3.9
      - main.py
  ```
* To override the argument “Bob”, add the
  `containers.args` field in the specification file:

  ```yaml
  spec:
    containers:
    - name: echo
      image: <image_name>
      args:
        - Alice
  ```

### `containers.env` field

Use the `containers.env` field to define container environment variables. All processes in the container have access to these
environment variables:

```yaml
spec:
  containers:
  - name: <name>
    image: <image_name>
    env:
      ENV_VARIABLE_1: <value1>
      ENV_VARIABLE_2: <value2>
      …
      …
```

**Example**

In [Tutorial 1](tutorials/tutorial-1.md), the application code
(`echo_service.py`) reads the environment variables as shown:

```python
CHARACTER_NAME = os.getenv('CHARACTER_NAME', 'I')
SERVER_PORT = os.getenv('SERVER_PORT', 8080)
```

Note that the example passes default values for the variables to the `getenv` function. If the environment variables are not defined, these defaults are used.

* `CHARACTER_NAME`: When the Echo service receives an
  HTTP POST request with a string (for example, “Hello”), the service
  returns “I said Hello”. You can overwrite this default value
  in the specification file. For example, set
  the value to “Bob”; the Echo service returns
  a “Bob said Hello” response.
* `SERVER_PORT`: In this default configuration,
  the Echo service listens on port 8080. You
  can override the default value and specify another
  port.

The following service specification overrides both of these environment
variable values:

```yaml
spec:
  containers:
  - name: echo
    image: <image_name>
    env:
      CHARACTER_NAME: Bob
      SERVER_PORT: 8085
  endpoints:
  - name: echo-endpoint
    port: 8085
```

Note that, because you changed the port number your service listens on,
the specification must also update the endpoint (`endpoints.port field` value) as shown.

### `containers.readinessProbe` field

Use the `containers.readinessProbe` field to identify a readiness probe in your application. Snowflake
continuously calls this probe to determine when your application is ready to serve requests, and will
stop routing traffic if the probe starts failing.

Snowflake makes an HTTP GET request
to the specified readiness probe, at the specified port and path, and looks for
your service to return an HTTP 200 OK status to ensure that only healthy containers
serve traffic.

Use the following fields to provide the required information:

* `port`: The network port on which the service is listening for the readiness probe requests. You need not declare this port as an endpoint.
* `path`: Snowflake makes HTTP GET requests to the service with this path.

**Example**

In Tutorial 1, the application code (`echo_python.py`) implements
the following readiness probe:

```python
@app.get("/healthcheck")
def readiness_probe():
```

Accordingly, the specification file includes the `containers.readinessProbe`
field:

```yaml
spec:
  containers:
  - name: echo
    image: <image_name>
    env:
      SERVER_PORT: 8088
      CHARACTER_NAME: Bob
    readinessProbe:
      port: 8088
      path: /healthcheck
  endpoints:
  - name: echo-endpoint
    port: 8088
```

The port specified by the readiness probe does not have to
be a configured endpoint. Your service could listen on a
different port solely for the purpose of the readiness probe.

### `containers.volumeMounts` field

Because the `spec.volumes` and `spec.containers.volumeMounts` fields work together,
they are explained together in one section. For more information,
see spec.volumes field (optional).

### `containers.resources` field

A compute pool defines a set of available resources (CPU, memory, and storage) and Snowflake determines where in the compute pool to run your services.

It is recommended that you explicitly indicate resource requirements
for the specific container and set appropriate limits in the specification. Note that the resources you specify are constrained by the instance family of the
nodes in your compute pool. For more information, see [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md).

Use `containers.resources` field to specify explicit resource requirements for the specific application container:

* `containers.resources.requests`: The requests you specify should be the average resource usage you anticipate by your service. Snowflake uses this information to determine placement of the service instance in the compute pool. Snowflake ensures that the sum of the resource requests placed on a given node fits within the available resources on the node.
* `containers.resources.limits`: The limits you specify direct Snowflake to not allocate resources more than the specified limits. Thus, you can prevent cost overruns.

You can specify requests and limits for the following resources:

* `memory`: This is the memory required for your application container. You can use either decimal or binary units to express the values. For example, 2G represents a request for 2,000,000,000 bytes and 2Gi represents a request for 2 x 1024 x 1024 x 1024 bytes.

  When specifying memory, a unit is required. For example, `100M` or `5Gi`. The supported units are: M, Mi, G, Gi.
* `cpu`: This refers to virtual core (vCPU) units. For example, 1 CPU unit is equivalent to 1 vCPU. Fractional requests are allowed, such as 0.5, which can also be expressed as 500m.
* `nvidia.com/gpu`: If GPUs are required, they must be requested, and there must also be a `limit`
  specified for the same quantity. If your container does not specify requests and limits for GPU capacity, it cannot access any GPUs. The number of GPUs you can request is limited by the maximum GPUs supported by the `INSTANCE_TYPE` you choose when creating a [compute pool](../../sql-reference/sql/create-compute-pool.md).

`resource.requests` and `resource.limits` are relative to the node capacity (vCPU and memory) of the instance family of the associated [compute pool](working-with-compute-pool.md).

* If a resource request (cpu, memory, or both) is not provided, Snowflake derives one for you:

  + For `cpu`, the derived value is either 0.5 or the `cpu` limit you provided, whichever is greater.
  + For `memory`, the derived value is either 0.5 GiB or the `memory` limit you provided, whichever is greater.
* If a resource limit (cpu, memory, or both) is not provided, Snowflake defaults the limits to the node capacity for the instance family of the associated [compute pool](working-with-compute-pool.md).
* If you do provide `resource.limits` and they exceed the node capacity, Snowflake will cap the limit to the node capacity.
* Snowflake evaluates these resource requirements independently for `cpu` and `memory`.

Note that if it’s theoretically impossible for Snowflake to schedule the service on the given compute pool, CREATE SERVICE will fail. Theoretically impossible assumes the compute pool has the maximum number of allowed nodes and there are no other services running on the compute pool. That is, there is no way Snowflake could allocate the requested resources within the compute pool limits.
If it’s theoretically possible, but required resources are in use, then CREATE SERVICE will succeed. Some service instances will report status indicating that the service cannot be scheduled due to insufficient resources until resources become available.

**Example 1**

In the following specification, the `containers.resources` field
describes the resource requirements for the container:

```yaml
spec:
  containers:
  - name: resource-test-gpu
    image: ...
    resources:
      requests:
        memory: 2G
        cpu: 0.5
        nvidia.com/gpu: 1
      limits:
        memory: 4G
        nvidia.com/gpu: 1
```

In this example, Snowflake is asked to allocate at least 2 GB of memory, one GPU, and a half CPU core for the container. At the same
time, the container is not allowed to use more than 4 GB of memory and one GPU.

**Example 2**

Suppose:

* You create a compute pool of two nodes; each node has 27 GB of memory and one GPU:

  ```sqlexample
  CREATE COMPUTE POOL tutorial_compute_pool
    MIN_NODES = 2
    MAX_NODES = 2
    INSTANCE_FAMILY = gpu_nv_s
  ```
* You create a service that asks Snowflake to run two instances of the service:

  ```sqlexample
  CREATE SERVICE echo_service
    MIN_INSTANCES=2
    MAX_INSTANCES=2
    IN COMPUTE POOL tutorial_compute_pool
    FROM @<stage_path>
    SPEC=<spec-file-stage-path>;
  ```

  Both `MIN_INSTANCES` and `MAX_INSTANCES` are set to 2. Therefore,
  Snowflake will run two instances of the service.

Now, consider these scenarios:

* If your service does not explicitly include resource requirements in your application specification,
  Snowflake decides whether to run these instances on the same node or different nodes in the compute pool.
* You do include resource requirements in the service specification and request 15 GB of memory for the container:

  ```yaml
  - name: resource-test
    image: ...
    resources:
      requests:
        memory: 15G
  ```

  Your compute pool node has 27 GB of memory, and Snowflake cannot
  run two containers on the same node. Snowflake will run the two service
  instances on separate nodes in the compute pool.
* You include resource requirements in the service specification and request 2 GB of memory and one GPU for the container:

  ```yaml
  spec:
    containers:
    - name: resource-test-gpu
      image: ...
      resources:
        requests:
          memory: 2G
          nvidia.com/gpu: 1
        limits:
          nvidia.com/gpu: 1
  ```

  You are requesting one GPU per container, and each node has only one GPU.
  In this case, although memory is not an issue, Snowflake cannot schedule both
  service instances on one node. This requirement forces Snowflake to run the two
  service instances on two separate compute pool nodes.

### `containers.secrets` field

```yaml
secrets:                                # optional list
  - snowflakeSecret:
      objectName: <object-name>         # specify this or objectReference
      objectReference: <reference-name> # specify this or objectName
    directoryPath: <path>               # specify this or envVarName
    envVarName: <name>                  # specify this or directoryPath
    secretKeyRef: username | password | secret_string # specify only with envVarName
  - snowflakeSecret: <object-name>      # equivalent to snowflakeSecret.objectName
    ...
```

Use the `containers.secrets` field in your service specification to provide Snowflake-managed credentials to your application containers. Start by storing the credentials in [Snowflake secret](../../user-guide/api-authentication.md) objects. Then, in the service specification, reference the secret object and specify where to place the credentials inside the container.

The following is a summary of how to use the `containers.secrets` fields:

* **Specify Snowflake secret:** Use the `snowflakeSecret` field to specify either a Snowflake secret object name or object reference. Object references are applicable when using Snowpark Container Services to create a Native App (an app with containers).

  + Use `secretKeyRef` to provide the name of the key in the Snowflake secret.
* **Specify the secret placement in the application container:** Use the `envVarName` field to pass the secret as environment variables or `directoryPath` to write the secrets to local container files.

For more information, see
[Passing credentials to a container using Snowflake secrets](working-with-services.md).

Note that, the role that is creating the service (owner role) will need the READ privilege on the secrets referenced.

## `spec.endpoints` field (optional)

Use the `spec.endpoints` field to specify a list of TCP network ports that your application exposes.
A service might expose zero to many endpoints. Use the following fields to describe an endpoint:

* `name`: Unique name of the endpoint. The name is used to identify the endpoint in
  [service function](working-with-services.md) and
  [service role](working-with-services.md) specification.
* `port`: The network port on which your service is listening. You must specify this field or the `portRange` field.
* `portRange`: The network port range on which your application is listening. You must specify this field or the `port` field.

  Ports defined in `portRange` can only be accessed by directly calling service instance IP addresses. To get service instance IP addresses, use the `instances.` prefixed DNS name.

  ```output
  instances.<Snowflake_assigned_service_DNS_name>
  ```

  For more information, see [Service-to-service communications](working-with-services.md).

  Note that you can only specify the `portRange` field if the `protocol` field is set to TCP and the `public` field is false.
* `public`: If you want this endpoint to be accessible from outside the Snowpark Container Services network, set this field
  to `true`. Public endpoints only support the “HTTP” value for the `protocol` field.
* `protocol`: The protocol that the endpoint supports. The supported values are TCP and HTTP. By default, the protocol is HTTP. When specifying the `protocol`, the following apply:

  + When this endpoint is public or the target of a service function (see [Using a service](working-with-services.md)), the protocol must be HTTP or HTTPS.
* `corsSettings`: The fields under `endpoints` allow you to configure Snowflake support for CORS on HTTP requests to public endpoints.

  + `corsSettings.Access-Control-Allow-Origin`: Specifies the origins for which Snowflake responds with the provided CORS allow and expose response headers. The value must be a valid URL with no path specified, for example, `https://example.com/, https://example.com:12345`, for security reasons the “\*” wildcard is not allowed for `Access-Control-Allow-Origin`.
  + Snowflake supports the following CORS response headers:

    - `corsSettings.Access-Control-Allow-Methods`: Specifies the value of the HTTP `Access-Control-Allow-Methods` CORS response header. This tells the browsers what HTTP methods (GET, POST, etc.) they should allow when sending requests to this endpoint.
    - `corsSettings.Access-Control-Allow-Headers`: Specifies the value of the HTTP `Access-Control-Allow-Headers` CORS response header. This tells the browsers what HTTP headers they should allow when sending requests to this endpoint.
    - `corsSettings.Access-Control-Expose-Headers`: Specifies the value of the HTTP `Access-Control-Expose-Headers` CORS response header. This tells the browsers what HTTP headers they should allow when exposing responses from this endpoint.

> **Note:**
>
> Snowflake performs authentication and authorization checks for public access that
> allow only Snowflake users that have permission to use the service. Public access to an endpoint requires Snowflake authentication. The authenticated user must also have authorization to this service endpoint (user has usage permission of a role which has access to the endpoint).

**Example**

The following is the application specification used in [Tutorial 1](tutorials/tutorial-1.md):

```yaml
spec:
  container:
  - name: echo
    image: <image-name>
    env:
      SERVER_PORT: 8000
      CHARACTER_NAME: Bob
    readinessProbe:
      port: 8000
      path: /healthcheck
  endpoint:
  - name: echoendpoint
    port: 8000
    public: true
```

This application container exposes one endpoint.
It also includes the optional `public` field to enable access
to the endpoint from outside of Snowflake (internet access). By default, `public` is `false`.

## `spec.volumes` field (optional)

This section explains both `spec.volumes` and `spec.containers.volumeMounts` specification fields because they’re closely related.

* `spec.volumes` defines a shared file system. These volumes can be made available in your containers.
* `spec.containers.volumeMount` defines where a volume appears in specific containers.

Note that, the `volumes` field is specified at the `spec` level, but since multiple containers can share the same volume, `volumeMounts` becomes a `spec.containers`-level field.

Use these fields to describe both the volumes and volume mounts.

* `spec.volumes`: Use the following fields to describe a volume:

  + Required fields for all volume types:

    - `name`: Unique name of the volume. It is referred to by `spec.containers.volumeMounts.name`.
    - `source`: This can be `local`, `memory`, `block`, `stage`, or `"@<stagename>"` (which is deprecated). The next section explains these volume types.
    - `size` (required only for the `memory` and `block` volume types): For memory and block volumes, this is the size of the volume in bytes.
      For block storage, the value must always be an integer, specified using the Gi unit suffix. For example, `5Gi` means `5*1024*1024*1024` bytes.
  + For the `block` type volume, you can specify these optional fields:
    `blockConfig.initialContents.fromSnapshot`, `blockConfig.iops`, `blockConfig.throughput`,
    `blockConfig.encryption`, `snapshotOnDelete`, and `snapshotDeleteAfter`.
    For more information, see [Specifying block storage in service specification](block-storage-volume.md).
  + For the `stage` type volume, `name` is a required field. It identifies the stage. You can also specify the optional fields `stageConfig.resources` and `stageConfig.metadataCache`. For more information, see [Using Snowflake stage volumes with services](snowflake-stage-volume.md).
* `spec.containers.volumeMounts`: Each container can have zero or more volume mounts. `containers.volumeMounts` is also a list. That is, each container can have multiple volume mounts. Use the following fields to describe a volume mount:

  + `name`: The name of the volume to mount. A single container can reference the same volume multiple times.
  + `mountPath`: The file path to where the volume for the container should be mounted.

### About the supported volume types

Snowflake supports these volume types for application containers to use: local, memory, block, and Snowflake stage.

* **Local volume:** Containers in a service instance can use a
  local disk to share files. For example, if your application has
  two containers—an application container and a log analyzer—
  the application can write logs to
  the local volume, and the log analyzer can read the logs.

  Note that, if you are running multiple instances of a service,
  only containers belonging to a service instance can share volumes.
  Containers that belong to different service instances do not share volumes.
* **Memory:** You can use a RAM-backed file system for container use.
* **Block:** Containers can also use block storage volumes. For more information, see [Using block storage volumes with services](block-storage-volume.md).
* **Snowflake stage:** You can also
  give containers convenient access to files on a Snowflake stage in your account. For more information, see [Using Snowflake stage volumes with services](snowflake-stage-volume.md).

**Example**

Your machine learning application includes the following two containers:

* An `app` container for the main application
* A `logger-agent` container that collects logs and uploads them
  to Amazon S3

These containers use the following two volumes:

* `local` volume: This application writes logs that the log agent reads.
* Snowflake stage, `@model_stage`: The main application reads
  files from this stage.

In the following example specification, the `app` container mounts both the
`logs` and `models` volumes, and the `logging-agent` container
mounts only the `logs` volume:

> ```yaml
> spec:
>   containers:
>   - name: app
>     image: <image1-name>
>     volumeMounts:
>     - name: logs
>       mountPath: /opt/app/logs
>     - name: models
>       mountPath: /opt/models
>   - name: logging-agent
>     image: <image2-name>
>     volumeMounts:
>     - name: logs
>       mountPath: /opt/logs
>   volumes:
>   - name: logs
>     source: local
>   - name: models
>     source: "@model_stage"
> ```

If multiple instances of the service are running, the
`logging-agent` and the `app` containers within a service instance
share the `logs` volume. The `logs` volume is not shared across
service instances.

If, in addition to these volumes, your `app` container also uses a 2-GB memory volume, revise the specification
to include the volume in the `volumes` list and also add another volume mount in the `app` containers `volumeMounts` list:

> ```yaml
> spec:
>   containers:
>   - name: app
>     image: <image1-name>
>     volumeMounts:
>     - name: logs
>       mountPath: /opt/app/logs
>     - name: models
>       mountPath: /opt/models
>     - name: my-mem-volume
>       mountPath: /dev/shm
>   - name: logging-agent
>     image: <image2-name>
>     volumeMounts:
>     - name: logs
>       mountPath: /opt/logs
>   volumes:
>   - name: logs
>     source: local
>   - name: models
>     source: "@model_stage"
>   - name: "my-mem-volume"
>     source: memory
>     size: 2G
> ```

Note that when you specify `memory` as the volume `source`, you must also specify the `volumes.size` field to indicate the
memory size. For information about the memory size units you can specify, see About units.

### About file permissions on mounted volumes

A container that mounts a Snowflake stage or a block storage volume typically runs as a root user. However, sometimes your container might run as a non-root user. For example:

* If your application uses a third-party library, the library uses a non-root user to run application code inside the container.
* For other reasons, such as security, you might run your application as a non-root user inside the container.

To avoid potential errors related to file user permissions, it’s important
to set the UID (User ID) and GID (Group ID) of the container as part of the
specification. This is particularly relevant for containers that use a
specific user and group for launching or running the application within the container.
By setting the appropriate UID and GID, you can use a container running
as a non-root user. For example:

```yaml
spec:
  ...

  volumes:
  - name: stagemount
    source: "@test"
    uid: <UID-value>
    gid: <GID-value>
```

Snowflake uses this information to mount the stage with appropriate permissions.

To obtain the UID and GID of the container, do the following:

1. Run the container locally using `docker run`.
2. Look up the container ID using the `docker container list` command. Partial sample output:

   > ```output
   > CONTAINER ID   IMAGE                       COMMAND
   > —----------------------------------------------------------
   > a6a1f1fe204d  tutorial-image         "/usr/local/bin/entr…"
   > ```
3. Run the `docker id` command inside the container to get the UID and GID:

   > ```bash
   > docker exec -it <container-id> id
   > ```
   >
   > Sample output:
   >
   > ```output
   > uid=0(root) gid=0(root) groups=0(root)
   > ```

## `spec.logExporters` field (optional)

Snowflake collects your applications output to standard output or standard error. For more information, see
[Accessing local container logs.](monitoring-services.md) Use `spec.logExporters` to configure which of these outputs Snowflake exports to your [event table](../logging-tracing/event-table-operations.md).

```yaml
logExporters:
  eventTableConfig:
    logLevel: < INFO | ERROR | NONE >
```

The supported `logLevel` values are:

* `INFO` (default): Export all the user logs.
* `ERROR`: Export only the error logs. Snowflake exports only the logs from stderr stream.
* `NONE`: Do not export logs to the event table.

## `spec.platformMonitor` field (optional)

Individual services publish metrics. These Snowflake-provided metrics are also referred to as the platform metrics. You add the `spec.platformMonitor` field in the specification to direct Snowflake to send metrics from the service to the event table configured for your account. The target use case for this is to observe resource utilization of a specific service.

```yaml
platformMonitor:
  metricConfig:
    groups:
    - <group_1>
    - <group_2>
    ...
```

`group_N` refers to a [predefined metrics groups](monitoring-services.md) that you are interested in. While the service is running, Snowflake logs metrics from specified groups to the event table. You can then query the metrics from the event table. For more information, see [Monitoring Services](monitoring-services.md).

## About units

A service specification takes numeric values in several places. A variety of units are supported to express these values. For large and small values, you can use binary and decimal units as shown. In the following list, “#” represents an integer value.

* Binary units:

  + `numberKi` means `number*1024`. For example, 4Ki is equivalent to 4096.
  + `numberMi` means `number*1024*1024`.
  + `numberGi` means `number*1024*1024*1024`.
* Decimal units:

  + `numberk` means `number*1000`. For example, 4k is equivalent to 4000.
  + `numberM` means `number*1000*1000`.
  + `numberG` mean `number*1000*1000*1000`.
* Fractional units:

  + `numberm` means `number*0.001`. For example, `cpu: 500m` is equivalent to `cpu: 0.5`.

## `capabilities` field (optional)

In the `capabilities` top-level field in the specification, use the `securityContext.executeAsCaller` field to indicate the application intends to use [caller’s rights](spcs-execute-sql.md).

```yaml
capabilities:
  securityContext:
    executeAsCaller: <true / false>    # optional, indicates whether application intends to use caller’s rights
```

By default, `executeAsCaller` is false.

## `serviceRoles` field (optional)

Use the `serviceRoles` top-level field in the specification to define one or more service roles. For each service role, provide a name and a list of one or more endpoints (defined in the `spec.endpoints`) you want the service role to grant USAGE privilege on.

```yaml
serviceRoles:                   # Optional list of service roles
- name: <name>
  endpoints:
  - <endpoint-name>
  - <endpoint-name>
  - ...
- ...
```

Note the following:

* Both the `name` and `endpoints` are required.
* The service role name must adhere to the following format:

  + Must contain alphanumeric or `_` characters.
  + Must start with an alphabetic character.
  + Must end with an alphanumeric character.

For more information, see [Managing service-related privileges](working-with-services.md).

---
title: Snowpark Container Services
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/overview.md
section: Snowpark Container Services
---

# Snowpark Container Services

## About Snowpark Container Services

Snowflake started by providing a SQL database for querying structured and semi-structured data, but SQL alone isn’t ideal for complex computations or machine learning. To address this, Snowflake introduced [Snowpark](../snowpark/index.md), which lets developers use languages like Python, Java, and Scala to build data applications and pipelines. Snowpark translates this code into optimized SQL, combining the flexibility of modern languages with the performance and scalability of Snowflake’s SQL engine.

For more flexibility, Snowflake offers Snowpark Container Services, a managed container orchestration platform within Snowflake. You can package your application and its dependencies into an Open Container Initiative (OCI) image, which can include any programming language, framework, or library.
This enables use cases that require custom runtimes, specialized libraries, or specific software configurations. In addition, with support for advanced CPUs and GPUs, you can run compute-intensive workloads, such as ML model serving, ML model training, and advanced AI analytics. Snowflake manages the underlying infrastructure, but you have full control over the contents of your containerized environment.

As a fully managed service, Snowpark Container Services streamlines operational tasks related to running your containers. Using best practices, Snowpark Container Services handles the intricacies of container management, including security and configuration. This ensures that you can focus on developing and deploying your applications without the overhead of managing the underlying infrastructure.

Snowpark Container Services is fully integrated with Snowflake. For example, your application can easily perform these tasks:

* Connect to Snowflake and run SQL in a Snowflake virtual warehouse.
* Access data files in a Snowflake stage.
* Process data retrieved through SQL queries.

Your application can leverage your existing Snowflake configuration, including the following items:

* Network policies for network ingress
* External access integration for network egress
* Role-based access control for enabling service-to-service communications
* Event tables for logs, metrics, and events

Snowpark Container Services is also integrated with third-party tools. It lets you use third-party clients, such as Docker, to easily upload your application images to Snowflake. Seamless integration makes it easier for teams to focus on building data applications.

All these capabilities come with Snowflake platform benefits, most notably ease-of-use, security, and governance features. You also get a scalable, flexible compute layer next to the powerful Snowflake data layer without needing to move data off the platform.

## Common scenarios for using Snowpark Container Services

Your application can be deployed to Snowflake regions without concern for the underlying cloud platform (AWS, Azure, or Google Cloud). Snowpark Container Services also makes it easy for your application to access your Snowflake data. In addition, Snowflake manages the underlying compute nodes.

The following list shows the common workloads for Snowpark Container Services:

* **Batch Data Processing Jobs:** Run flexible jobs similar to stored procedures, pulling data from Snowflake or external sources, processing it, and producing results. Workloads can be distributed across multiple job instances, and graphics processing unit (GPU) support is available for computationally intensive tasks like AI and machine learning.
* **Service Functions:** Your service can provide a service function so that your queries can send batches of data to your service for processing. The query processing happens in Snowflake’s advanced query engine and your service provides custom data processing that Snowflake can scale to multiple compute nodes. For an example, see [Tutorial 1](tutorials/tutorial-1.md). In step 4 of this tutorial, you call the service function in a query.
* **APIs or Web UI Over Snowflake Data:** Deploy services that expose APIs or web interfaces with embedded business logic. Users interact with the service rather than raw data. Caller’s rights ensure that queries run with the correct user permissions. For an example, see [Tutorial 1](tutorials/tutorial-1.md). In this tutorial, the service also exposes a web UI to the internet. In step 4, you send requests to the service from a web browser.

## How does it work?

To run containerized applications in Snowpark Container Services, in addition to working with the basic Snowflake objects, such
as databases and warehouses, you work with these objects: [image repository](working-with-registry-repository.md),
[compute pool](working-with-compute-pool.md), and [service](working-with-services.md).

Snowflake offers an [OCIv2](https://github.com/opencontainers/distribution-spec/blob/main/spec.md)
compliant *image registry* service for storing your images. This service enables Open Container Initiative (OCI) clients, such as Docker CLI, to upload your application images to a *repository* (a storage unit) in your
Snowflake account. You create a repository using the [CREATE IMAGE REPOSITORY](../../sql-reference/sql/create-image-repository.md) command. For more information, see
[Working with an image registry and repository](working-with-registry-repository.md).

After you upload your application image to a repository, you can run your application by creating a
[long-running service or executing a job service](working-with-services.md).

* **Service:** A service is long-running and, as with a web service, you explicitly stop it when it is no longer needed. If a service container
  exits for whatever reason, Snowflake restarts that container. To create a service, such as a full stack web application,
  use the [CREATE SERVICE](../../sql-reference/sql/create-service.md) command.
* **Job service:** A job service has a finite lifespan, similar to a stored procedure.
  When all containers exit, the job service is done. Snowflake doesn’t restart any job service containers. To start a job service, such as training a machine learning model with GPUs, use the
  [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) command.

Your services, including job services, run in a *compute pool*, which is a collection of one or more virtual machine (VM) nodes. You first
create a compute pool by using the [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) command, and then specify the compute pool when
you create a service or a job service. The required information to create a compute pool includes the machine type, the minimum number of nodes to
launch the compute pool with, and the maximum number of nodes the compute pool can scale to. Some of the supported machine types
provide GPU. For more information, see [Working with compute pools](working-with-compute-pool.md).

After you create a service, users in the same Snowflake account that created the service can use the service, if they have the appropriate permissions. For more information, see [Using a service](working-with-services.md).

> **Note:**
>
> The Snowpark Container Services documentation primarily uses SQL commands and functions in explanations of concepts and in examples. Snowflake also provides other interfaces, including [Python APIs](../snowflake-python-api/snowflake-python-overview.md), [REST APIs](../snowflake-rest-api/snowflake-rest-api.md), and the [Snowflake CLI](../snowflake-cli/index.md) command-line tool for most operations.

## Available regions and considerations

Snowpark Container Services is in all [regions](../../user-guide/intro-regions.md) except the following:

* Snowpark Container Services supports [public sector (government) workloads](../../user-guide/intro-regions.md) in the AWS US East (Commercial Gov - N. Virginia) region and is not available in other AWS or Azure government regions.
* Currently, Snowpark Container Services in the GCP Dammam region uses the global endpoints for Google Cloud APIs.

## What’s next?

If you’re new to Snowpark Container Services, we suggest that you first explore the tutorials and then continue with other
topics to learn more and create your own containerized applications. The following topics provide more information:

* **Tutorials:** These [introductory tutorials](overview-tutorials.md) provide step-by-step instructions for you to explore
  Snowpark Container Services. After initial exploration, you can continue with
  [advanced tutorials](overview-advanced-tutorials.md).
* **Service specification reference:** This reference explains the [YAML syntax](specification-reference.md) to
  create a service specification.
* **Working with services and job services:** These topics provide details about the Snowpark Container Services components that you use
  in developing services and job services:

  + [Working with an image registry and repository](working-with-registry-repository.md)
  + [Working with compute pools](working-with-compute-pool.md)
  + [Working with services](working-with-services.md)
  + [Troubleshooting](troubleshooting.md)
* **Reference:** Snowpark Container Services provides the following SQL commands and functions:

  + SQL commands: [Snowpark Container Services commands](../../sql-reference/commands-snowpark-container-services.md) and [CREATE FUNCTION (Snowpark Container Services)](../../sql-reference/sql/create-function-spcs.md)
  + SQL functions:

    - System function: [SYSTEM$GET_SERVICE_LOGS](../../sql-reference/functions/system_get_service_logs.md)
    - Scalar functions: [Snowpark Container Services functions](../../sql-reference/functions-spcs.md)
    - Table-valued functions:

      * [GET_JOB_HISTORY](../../sql-reference/functions/get_job_history.md)
      * [SPCS_GET_LOGS](../../sql-reference/functions/spcs_get_logs.md)
      * [SPCS_GET_EVENTS](../../sql-reference/functions/spcs_get_events.md)
      * [SPCS_GET_METRICS](../../sql-reference/functions/spcs_get_metrics.md)
* **Billing:** This topic explains the costs associated with using Snowpark Container Services:

  + [Snowpark Container Services costs](accounts-orgs-usage-views.md)

---
title: Snowpark Container Services Advanced Tutorials
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/overview-advanced-tutorials.md
section: Snowpark Container Services
---

# Snowpark Container Services Advanced Tutorials

These tutorials provide step-by-step instructions for you to explore Snowpark Container Services.

[Tutorial 4: Service-to-Service Communications](tutorials/advanced/tutorial-4.md)
:   In this tutorial you explore how service to service communications works.

[Tutorial 5: Create a service with a block storage volume mounted](tutorials/advanced/tutorial-5-block-storage.md)
:   In this tutorial you create a service that uses a block storage volume.

[Tutorial 6: Configure and test service endpoint privileges](tutorials/advanced/tutorial-6-configure-test-service-role.md)
:   In this tutorial, you explore how to grant a role the [USAGE privilege on the service endpoint](working-with-services.md) so that the role can communicate with the service.

[Tutorial 7: Create a service that uses callers rights](tutorials/advanced/tutorial-7-callers-rights.md)
:   In this tutorial, you explore how to create a service that uses callers rights.

[Tutorial 8: Access the public endpoint programmatically](tutorials/advanced/tutorial-8-access-public-endpoint-programmatically.md)
:   In Tutorial 1, you learned how to access a public endpoint by using a browser. In this tutorial, you access the same endpoint programmatically.

---
title: Snowpark Container Services costs
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/accounts-orgs-usage-views.md
section: Snowpark Container Services
---

# Snowpark Container Services costs

The costs associated with using Snowpark Container Services can be categorized into storage cost, compute pool cost, and data
transfer cost.

## Storage cost

When you use Snowpark Container Services, storage costs associated with Snowflake, including the cost of Snowflake stage usage
or database table storage, apply. For more information, see [Exploring storage cost](../../user-guide/cost-exploring-data-storage.md). In addition, the
following cost considerations apply:

* **Image repository storage cost:** The implementation of the [image repository](working-with-registry-repository.md) uses
  a Snowflake stage. Therefore, the associated cost for using the Snowflake stage applies.
* **Log storage cost:** When you store
  [local container logs in event tables](monitoring-services.md), event table storage
  costs apply.
* **Mounting volumes cost:**

  + When you mount a Snowflake stage as a volume, the cost of using the Snowflake stage applies.
  + When you mount storage from the compute pool node as a volume, it appears as local storage in the container. But there is no
    additional cost because the local storage cost is covered by the cost of the compute pool node.
* **Block storage cost:** When you create a service that uses [block storage](block-storage-volume.md), you are billed for block storage and snapshot storage. For more information about storage pricing, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf). The SPCS Block Storage Pricing table in this document provides the information.

## Compute pool cost

A [compute pool](working-with-compute-pool.md) is a collection of one or more virtual machine (VM) nodes on which Snowflake
runs your Snowpark Container Services jobs and services. The number and type (instance family) of the nodes in the compute pool
(see [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md)) determine the credits it consumes and thus the cost you pay. For more information, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You incur charges for a compute pool in the IDLE, ACTIVE, STOPPING, or RESIZING state, but not when it is in a STARTING or
SUSPENDED state. To optimize compute pool expenses, you should leverage the AUTO_SUSPEND feature (see CREATE COMPUTE POOL).

The following views provide usage information:

* **ACCOUNT_USAGE views**

  The following ACCOUNT_USAGE views contain Snowpark Container Services credit usage information:

  + The [SNOWPARK_CONTAINER_SERVICES_HISTORY view](../../sql-reference/account-usage/snowpark_container_services_history.md) offers
    credit usage information (hourly consumption) exclusively for Snowpark Container Services.
  + In the [METERING_DAILY_HISTORY view](../../sql-reference/account-usage/metering_daily_history.md), query for rows in which the
    `service_type` column contains the value `SNOWPARK_CONTAINER_SERVICES`.
  + In the [METERING_HISTORY view](../../sql-reference/account-usage/metering_history.md), query for rows in which the
    `service_type` column contains the value `SNOWPARK_CONTAINER_SERVICES`.
* **ORGANIZATION_USAGE views**

  + In the [METERING_DAILY_HISTORY view](../../sql-reference/organization-usage/metering_daily_history.md), use the
    `SERVICE_TYPE = SNOWPARK_CONTAINER_SERVICES` query filter.

## Data transfer cost

Data transfer is the process of moving data into (ingress) and out of (egress) Snowflake. For more information, see
[Understanding data transfer cost](../../user-guide/cost-understanding-data-transfer.md). When you use Snowpark Container Services, the following additional cost
considerations apply:

* **Outbound data transfer:** Snowflake applies the same data transfer rate for outbound data transfers from services and jobs
  to other cloud regions and to the internet, consistent with the rate for all Snowflake outbound data transfers. For more
  information, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) (table 4a).

  You can query the [DATA_TRANSFER_HISTORY ACCOUNT_USAGE view](../../sql-reference/account-usage/data_transfer_history.md) for
  usage information. The `transfer_type` column identifies this cost as the `SNOWPARK_CONTAINER_SERVICES` type.
* **Internal data transfer:** This class of data transfer refers to data movements across compute entities within Snowflake, such as
  between two compute pools or a compute pool and a warehouse, that resulted from executing a
  [service function](working-with-services.md).
  For more information, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf)
  (tables 4(a) for AWS, 4(b) for Azure, and the column titled “SPCS Data Transfer to Same Cloud Provider, Same Region”).

  To view the costs associated with internal data transfer, you can do the following:

  + Query the [INTERNAL_DATA_TRANSFER_HISTORY view](../../sql-reference/account-usage/internal_data_transfer_history.md) in the ACCOUNT_USAGE schema.
  + Query the [DATA_TRANSFER_HISTORY view](../../sql-reference/account-usage/data_transfer_history.md) in the ACCOUNT_USAGE schema. The
    `transfer_type` column identifies this cost as the `INTERNAL` type.
  + Query the [DATA_TRANSFER_HISTORY view](../../sql-reference/organization-usage/data_transfer_history.md) in the ORGANIZATION_USAGE schema.
    The `transfer_type` column identifies this cost as the `INTERNAL` type.
  + Query the [DATA_TRANSFER_DAILY_HISTORY view](../../sql-reference/organization-usage/data_transfer_daily_history.md) in the ORGANIZATION_USAGE schema. The `service_type` column identifies this cost as the `INTERNAL_DATA_TRANSFER` type.
  + Query the [RATE_SHEET_DAILY view](../../sql-reference/organization-usage/rate_sheet_daily.md) in the ORGANIZATION USAGE
    schema. The `service_type` column identifies this cost as the `INTERNAL_DATA_TRANSFER` type.
  + Query the [USAGE_IN_CURRENCY_DAILY view](../../sql-reference/organization-usage/usage_in_currency_daily.md) in the ORGANIZATION USAGE
    schema. The `service_type` column identifies this cost as the `INTERNAL_DATA_TRANSFER` type.

> **Note:**
>
> Data transfer costs are currently not billed for Snowflake accounts on Google Cloud.

---
title: Snowpark Container Services troubleshooting
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/troubleshooting.md
section: Snowpark Container Services
---

# Snowpark Container Services troubleshooting

This topic discusses common issues and how you might resolve them.

## Image registry

* “invalid consent request” error received when accessing a public service endpoint.

  In the current implementation, Snowflake authenticates the current user using their default role. This error is probably
  because the user is using one of the privileged roles, such as ACCOUNTADMIN or SECURITYADMIN, as their default role
  (see [Blocking specific roles from using the integration](../../user-guide/oauth-custom.md)). Use the
  [ALTER USER](../../sql-reference/sql/alter-user.md) command to change the default role for the user, and try again.
* `docker login` authentication failure.

  Do not use an uppercase hostname in the `docker login` command and then use a lowercase hostname in the
  `docker push`, `docker pull` command. Docker CLI stores credentials with cased keys. Related Docker CLI
  [issue](https://github.com/docker/cli/issues/2753).
* `docker push` error: no Host in request URL.

  When interacting with Docker CLI, always replace the underscores in your account name in a URL with hyphens. Docker CLI will
  return an error if hostnames have underscores in them (even though cURL works). For example, the following Docker push
  specifies “my_acct” as the account name:

  ```bash
  docker push myorg-my_acct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/service_to_service
  ```

  Docker returns this error:

  ```output
  Get "https:/v2/": http: no Host in request URL.
  ```

  You can also use the [SHOW IMAGE REPOSITORIES](../../sql-reference/sql/show-image-repositories.md) command to get a valid repository URL.

## Compute pool

* Suspended compute pool stuck in STOPPING state.

  Running services will prevent the compute pool from stopping. Suspend all remaining active services in the compute pool using
  the [ALTER COMPUTE POOL](../../sql-reference/sql/alter-compute-pool.md) command:

  ```sqlsyntax
  ALTER COMPUTE POOL <pool_name> STOP ALL
  ```

## Service

* A running service is no longer responding to requests (from a service function or a public endpoint). The service status
  changed from running to pending.

  This could be an indication of resource starvation on compute pool nodes. If your containers are resource-intensive
  (CPU/memory), you should explicitly specify resource requests in the service specification to prevent too many
  services (including job services) being placed on a single node.

  In the current implementation, you can only specify memory requests.

  For example, if a node in the compute pool has 64 GB of RAM, requesting more than 32 GB for your service (or job service) would guarantee
  that only one service or job service can be running on a node at a time. For more information about the capability of each instance
  type, see [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md).
* If you are submitting a request to the public endpoint for the service, do not use double quotes in the HTTP authentication
  header.

## Service functions

* Service function timeout and duplicate execution issues.

  If a single service function invocation takes longer than 30 seconds, Snowflake retries the function, and after a few retries,
  the function fails with a timeout error. You might get unexpected results if the function implementation in the container is
  not idempotent. Consider using asynchronous execution by returning a different HTTP code (202), which allows a longer timeout.
  For more information, see [Asynchronous remote service](../../sql-reference/external-functions-implementation.md).

## Ingress

* If authentication fails, it might be because of the network policy associated with the user or account.
* When accessing the public endpoint from the internet, you might find that username/password authentication works, but SSO results in a blank page or the error: “OAuth client integration with the given client ID is not found.”. For information about addressing this issue, see [Ingress and SSO considerations](service-network-communications.md).
* When a client receives a 5XX error from an ingress endpoint (500/503/504), the client should retry, with some backoff. We recommend bounded [exponential backoff](https://en.wikipedia.org/wiki/Exponential_backoff).
* A CORS pre-flight request returns status 404-Endpoint does not exist:

  Check if the endpoint exists by running the [SHOW ENDPOINTS](../../sql-reference/sql/show-endpoints.md) command.
* A CORS pre-flight request returns status 302.

  The CORS request did not have any form of authentication. Due to the inability to distinguish between CORS requests and redirects/anchor tags, we have to assume it’s a redirect/anchor tag case and return a 302.
* Response status 403 “Rejecting a CORS request since cookie was present, and request does not have an auth header”.

  This occurs when a cross-origin non-GET non-HEAD request is attempting to use cookie as the authentication method. The Authentication header is required for these cross-origin requests. This error occurs when the request has cookies present and the Authentication header is not present.
* Error message “Cross-Origin Request Blocked: The Same Origin Policy disallows reading the remote resource at …”.

  Browser returns this error when the service indicated it does not support the requested cross-origin access. This typically happens because the `Origin` provided by the browser does not match one of the origins specified in `Access-Control-Allow-Origin` in the service specification for the endpoint. For these to match, both the scheme (HTTP/HTTPS) and hostname must match.

---
title: Snowpark Container Services Tutorials
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/overview-tutorials.md
section: Snowpark Container Services
---

# Snowpark Container Services Tutorials

These tutorials provide step-by-step instructions for you to explore Snowpark Container Services.

[Common Setup](tutorials/common-setup.md)
:   You need to complete these steps before you can explore any tutorials.

[Tutorial 1: Create a Service](tutorials/tutorial-1.md)
:   Provides step-by-step instructions to create your first Snowpark Container Services service.

[Tutorial 2: Create a Job Service](tutorials/tutorial-2.md)
:   Provides step-by-step instructions to create your first Snowpark Container Services job service.

[Tutorial 3: Create a service and a job using the Snowflake Python APIs](tutorials/tutorial-1-with-sf-python.md)
:   Provides step-by-step instructions to create a service using Snowflake Python API.

---
title: Snowpark Container Services: Guidelines and limitations
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/spcs-guidelines-and-limitations.md
section: Snowpark Container Services
---

# Snowpark Container Services: Guidelines and limitations

* **General limitations:** If you encounter any issues with these limitations, contact your account representative.

  + You can create up to 200 services in your Snowflake account.
  + Each service can have up to 100 endpoints (see [spec.endpoints](specification-reference.md)).
  + Each service can have up to 20 containers (see [spec.containers](specification-reference.md)).
  + Each service can have up to 50 secrets (see [containers.secrets](specification-reference.md)).
  + Each service can have up to 20 volumes (see [spec.volumes](specification-reference.md)).
  + The following limitations apply when you enable internet access (see [Configure service egress](service-network-communications.md)) using external access integrations (EAIs).

    - Each service can support up to 10 EAIs (see [CREATE SERVICE](../../sql-reference/sql/create-service.md) and [ALTER SERVICE](../../sql-reference/sql/alter-service.md)).
    - Each EAI can have up to 100 host names.
  + When accessing the public endpoint from the internet, you might find that username/password authentication works, but SSO results in a blank page or the error: “OAuth client integration with the given client ID is not found.”. For information about addressing this issue, see [Ingress and SSO considerations](service-network-communications.md).
* **Image platform requirements:** Currently, Snowpark Container Services requires linux/amd64 platform images.
* **Service containers are not privileged:** Service containers do not run as privileged, and therefore cannot change
  the configuration of the hardware on the host and can change only limited OS configurations. Service containers can only
  perform operating system configurations that a normal user (that is, a user who doesn’t require root) can do.
* **Renaming the database and schema:**

  + Do not rename databases and schemas where you already created a service. Renaming is effectively moving a service to another
    database and schema, which is not supported. For example:

    - Database and schema information that Snowflake provided to the running service containers will continue to refer to the
      old names.
    - New logs that services ingest in the event table will continue to refer to the old database and schema names.
    - The service function will continue to reference the service in the old database and schema, and when you invoke the service
      function, it will fail.
  + A service specification can reference objects such as Snowflake stages and image repositories. If you rename database or
    schema names where these objects reside, you need to manually update the database and schema names of the referenced objects
    in the service specification.
* **Transferring ownership of parent schema or database:**

  The ownership of the parent database/schema can be transferred to a different role. But the ownership of services inside the database/schema is not transferred to the new role because services run as service owner roles and that does not change. As a result, the services could lose permissions on objects inside the schema, such as image repositories and Snowflake stages in the same schema.

  If ownership transfer of parent schema/database is required, consider re-creating the services.
* **Dropping and un-dropping a database and schema:**

  + When you drop the parent database or schema, services are deleted asynchronously. This means that a service might continue
    to run for some time before internal processes remove it.
  + If you attempt to un-drop a previously deleted database or schema, there is no guarantee that services will be restored.
* **Ownership transfer of services:** [Ownership transfer](../../sql-reference/sql/grant-ownership.md) or future ownership transfer for services, including job services, isn’t supported.
* **Ownership transfer of service functions:**

  The ownership of a service function can be transferred different role. If the new owner role doesn’t have USAGE privilege on the service, function invocations will fail. You need to grant USAGE privilege to the new function owner role.
* **Replication:** When dealing with replication in Snowflake, note the following:

  + Snowpark Container Services objects, such as services, compute pools, and repositories, cannot be replicated.
  + If you create a repository within a database, the entire database cannot be replicated. In cases where the database contains
    other resources, such as services or compute pools, the database replication process will succeed, but these
    individual objects within the database will not be replicated.
* **Job services timeout:** Snowpark Container Services job services runs synchronously by default. If a statement times out, the job service is canceled. The
  default statement timeout is two days. Customers can change the timeout by setting the parameter STATEMENT_TIMEOUT_IN_SECONDS
  using ALTER SESSION.

  ```sqlsyntax
  ALTER SESSION SET statement_timeout_in_seconds=<time>
  ```

  Set it before running the EXECUTE JOB SERVICE command. You can run job services asynchronously, by specifying `ASYNC=true`, to avoid job services from being interrupted by a statement timeout.
* **File staging commands support in Google Cloud:** To use the PUT, GET, LIST, or REMOVE command with Snowflake client libraries on Google Cloud, update your clients to at least the following versions.

  | Client | Version |
  | --- | --- |
  | Go Snowflake Driver | 1.14.1 |
  | Snowflake Connector for Python | 3.16.0 |
  | .NET driver | 4.6.0 |
  | Node.js Driver | 2.1.3 |
  | JDBC Driver | 3.25.1 |
  | ODBC Driver | 3.10.0 |

---
title: Snowpark Container Services: Monitoring Services
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/monitoring-services.md
section: Snowpark Container Services
---

# Snowpark Container Services: Monitoring Services

Snowflake provides a variety of mechanisms for monitoring services, jobs, and compute pools. The following sections describe the details.

A user needs the appropriate privileges on services, jobs, and compute pools to access the monitoring data. For more information, see [Privileges needed to perform operations on the service](working-with-services.md) and [Compute pool privileges](working-with-compute-pool.md).

## Publishing and accessing container logs

Snowflake automatically collects and stores container logs — whatever your application container emits to standard
output and standard error — to an event table for later analysis, unless you choose to opt out.
Ensure that your code outputs useful information that can help with debugging your service or conducting retrospective analysis
of your services and jobs.

Use a combination of the following settings to control which container logs are sent to the event table:

* In the service specification, use the [logExporter](specification-reference.md) field
  to indicate which stream (stdout/stderr) should be sent to the event table.
* In [CREATE SERVICE](../../sql-reference/sql/create-service.md) or
  [ALTER SERVICE](../../sql-reference/sql/alter-service.md) command, specify the LOG_LEVEL parameter to indicate the severity at which logs are collected.

When a service container is running, you can also retrieve the container log, without saving the logs to the event table by using the
[SYSTEM$GET_SERVICE_LOGS](../../sql-reference/functions/system_get_service_logs.md) system function. This process is most useful during development and testing
of your service code.

### Publishing container logs

Your application containers can publish structured or unstructured logs:

* **Unstructured logs:** Text that can’t be parsed as JSON
  that your application containers emit
  to standard output and standard error. Snowflake persists these strings to the value
  column in the event table.
* **Structured logs:** These are JSON text that application containers emit to standard
  output and standard error. Snowflake extracts JSON fields and saves them to specific
  columns in the event table. Then, you can query the event table and apply filters
  when exploring events.

  The following JSON structure shows supported fields that Snowflake stores in event
  table columns. If your application emits JSON that includes unsupported fields,
  as shown in the following example, Snowflake ignores those fields.

  ```sqljson
  {
    "severity_text": "DEBUG",    # "<DEBUG, INFO, WARN, ERROR, FATAL>",
    "body": "hello from SPCS",   # <body text>",
    "attributes": {
      "attr_key1": "attr_value1",
      "attr_key2": { "nested_key2": "nested_value2" } },
    },
    "scope": { "name": "val1" },
    "timestamp": "2025-01-01T12:34:56.789Z", # Format: RFC 3339
    # Unsupported fields are dropped.
    "another_field_key1": "another_field_val1",
    "another_field_key2": "another_field_val2",
  }
  ```

  The following table shows the JSON-log field names and the corresponding event-table
  column names where Snowflake stores their values. For description of the log fields,
  see the
  [Event table columns](../logging-tracing/event-table-columns.md) descriptions.

  | JSON field | Event table column | Comment |
  | --- | --- | --- |
  | severity_text | [RECORD](../logging-tracing/event-table-columns.md) | Snowflake saves both `severity_text` and the Snowflake-assigned `severity_number` as Object fields in this column. |
  | attributes | [RECORD_ATTRIBUTES](../logging-tracing/event-table-columns.md). | The fields from structured log are copied as Object fields in this column. |
  | scope | [SCOPE](../logging-tracing/event-table-columns.md) | The fields from the structured log are copied as Object fields in this column. |
  | timestamp | [TIMESTAMP](../logging-tracing/event-table-columns.md) |  |

  The following example shows a container log stored in an event table:

  ```output
  +----------------------+--------------------------+-------------+----------------------------+-------------------+---------------------------------------------------------------------------------------------------+
  |        VALUE         |       TIMESTAMP          | RECORD_TYPE |           RECORD           |       SCOPE       |            RECORD_ATTRIBUTES           |                   RESOURCE_ATTRIBUTES                    |
  +----------------------+--------------------------+-------------+----------------------------+-------------------+---------------------------------------------------------------------------------------------------+
  | "hello from SPCS"    | 2025-01-01T12:34:56.789Z | LOG         | {                          | {                 | {                                      | {                                                        |
  |                      |                          |             |   "severity_number": 5,    |   "name1": "val1" |   "attr_key1": "attr_value1",          |   "snow.account.name": "****",                           |
  |                      |                          |             |   "severity_text": "DEBUG" | }                 |   "attr_key2": {                       |   "snow.compute_pool.id": "****",                        |
  |                      |                          |             | }                          |                   |     "nested_key2": "nested_value2"     |   "snow.compute_pool.name": "MYPO****",                  |
  |                      |                          |             |                            |                   |   }                                    |   "snow.compute_pool.node.id": "****",                   |
  |                      |                          |             |                            |                   | }                                      |   "snow.compute_pool.node.instance_family": "CPU_****",  |
  |                      |                          |             |                            |                   |                                        |   "snow.database.id": "****",                            |
  |                      |                          |             |                            |                   |                                        |   "snow.database.name": "MYDB****",                      |
  |                      |                          |             |                            |                   |                                        |   "snow.query.id": "****",                               |
  |                      |                          |             |                            |                   |                                        |   "snow.schema.id": "****",                              |
  |                      |                          |             |                            |                   |                                        |   "snow.schema.name": "MYSC****",                        |
  |                      |                          |             |                            |                   |                                        |   "snow.service.container.name": "main****",             |
  |                      |                          |             |                            |                   |                                        |   "snow.service.container.run.id": "****",               |
  |                      |                          |             |                            |                   |                                        |   "snow.service.id": "****",                             |
  |                      |                          |             |                            |                   |                                        |   "snow.service.instance": "0",                          |
  |                      |                          |             |                            |                   |                                        |   "snow.service.name": "TEST****",                       |
  |                      |                          |             |                            |                   |                                        |   "snow.service.type": "Service"                         |
  |                      |                          |             |                            |                   |                                        | }                                                        |
  +----------------------+--------------------------+-------------+----------------------------+-------------------+----------------------------------------+----------------------------------------------------------+
  ```

  If you use Python for your application code, you can use
  the [Snowflake-provided log formatter](https://pypi.org/project/snowflake-telemetry-python/)
  (`SnowflakeLogFormatter`) to emit structured logs, as shown in the following example:

  ```python
  from snowflake.telemetry.logs import SnowflakeLogFormatter

  handler = logging.StreamHandler(stream=get_stream(arguments.stream))
  handler.setFormatter(SnowflakeLogFormatter())
  logger.addHandler(handler)
  logger.setLevel(logging.DEBUG) # info by default

  # Emit logs with record attributes (`extra` argument)
  logger.warning("warning log record with attributes", extra={"custom": True})
  logger.debug("debug log with nested attributes", extra={"nested": {"key1": [1, 2, 3]}})
  ```

### Accessing container logs

You can currently access container logs by using the following options:

* **Use the service helper method:** We recommend calling the
  Using the <service-name>!SPCS_GET_LOGS function to retrieve container logs of the specified service or job, collected by Snowflake in the event table.
* **Use the event table directly:** If you have full access to the event table, you can query the event table directly to get historical logs.
* **Use the SYSTEM$GET_SERVICE_LOGS system function:** Call SYSTEM$GET_SERVICE_LOGS to retrieve the logs of the currently running service or job container.

### Using the <service-name>!SPCS_GET_LOGS function

The [<service_name>!SPCS_GET_LOGS](../../sql-reference/functions/spcs_get_logs.md) table function returns logs from the containers of the specified job. These logs are collected by Snowflake and are stored in the event table.

The following list explains the advantages of using this table function:

* You can retrieve logs for a specific service.
* You can retrieve logs within a specified time range.
* The caller doesn’t need access to the entire events table,
  which can be beneficial for customers with strict information-security requirements. If the current session includes the service owner role, then they have access to these logs.

For `service_name`, you specify the name of the service. The function returns logs Snowflake collected from containers of that service (see Publishing and accessing container logs).

You can optionally specify a date range. By default, the function returns one-day logs. For example, the query retrieved logs that Snowflake collected from containers of the `my_test_job` job over the past day, which is the default.

> ```sqlexample
> SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_LOGS());
> ```
>
> Example output:
>
> ```output
> +-------------------------+-------------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+
> | TIMESTAMP               | INSTANCE_ID | CONTAINER_NAME | LOG                                                                                                                                                                 | RECORD_ATTRIBUTES          |
> |-------------------------+-------------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------|
> | 2025-06-26 00:23:40.281 |           0 | main           | job-tutorial - INFO - Job finished                                                                                                                                  | {                          |
> |                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
> |                         |             |                |                                                                                                                                                                     | }                          |
> | 2025-06-26 00:23:38.787 |           0 | main           | job-tutorial - INFO - Executing query [select current_time() as time,'hello'] and writing result to table [results]                                                 | {                          |
> |                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
> |                         |             |                |                                                                                                                                                                     | }                          |
> | 2025-06-26 00:23:38.787 |           0 | main           | job-tutorial - INFO - Connection succeeded. Current session context: database="TUTORIAL_DB", schema="DATA_SCHEMA", warehouse="TUTORIAL_WAREHOUSE", role="TEST_ROLE" | {                          |
> |                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
> |                         |             |                |                                                                                                                                                                     | }                          |
> | 2025-06-26 00:23:36.852 |           0 | main           | job-tutorial - INFO - Job started                                                                                                                                   | {                          |
> |                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
> |                         |             |                |                                                                                                                                                                     | }                          |
> +-------------------------+-------------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+
> ```

For more information about calling this method, see [<service_name>!SPCS_GET_LOGS](../../sql-reference/functions/spcs_get_logs.md).

### Using event table

Snowflake can capture logs sent from containers to the standard output and standard error streams into the event table configured for your account.
For more information about configuring an event table, see
[Logging, tracing, and metrics](../logging-tracing/logging-tracing-overview.md).

You control which streams are collected (all, standard error only, or none) that you want stored in an event table by using the
[spec.logExporters field](specification-reference.md) in the service specification file.

You can then query the event table for events. To find the active event table for the account, use the [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md) command to check the value of the [EVENT_TABLE](../../sql-reference/parameters.md) parameter:

```sqlexample
SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;
```

The parameter specifies the active event table for the account.

Next, query that event table. The following SELECT statement retrieves Snowflake service and job events recorded in the past hour:

```sqlexample
SELECT TIMESTAMP, RESOURCE_ATTRIBUTES, RECORD_ATTRIBUTES, VALUE
FROM <current_event_table_for_your_account>
WHERE timestamp > dateadd(hour, -1, current_timestamp())
AND RESOURCE_ATTRIBUTES:"snow.service.name" = '<service_name>'
AND RECORD_TYPE = 'LOG'
ORDER BY timestamp DESC
LIMIT 10;
```

Snowflake recommends that you include a timestamp in the WHERE clause of event
table queries, as shown in this example. This is particularly important
because of the potential volume of data generated by various Snowflake
components. By applying filters, you can retrieve a smaller subset
of data, which improves query performance.

The event table includes the following columns, which provide useful information regarding the logs collected
by Snowflake from your container:

* **TIMESTAMP:** Shows when Snowflake collected the log.
* **RESOURCE_ATTRIBUTES:** Provides a JSON object that identifies the Snowflake service and the container in the service that generated
  the log message. For example, it furnishes details such as the service name, container name, and compute pool name that were specified
  when the service was run.

  ```sqljson
  {
    "snow.account.name": "SPCSDOCS1",
    "snow.compute_pool.id": 20,
    "snow.compute_pool.name": "TUTORIAL_COMPUTE_POOL",
    "snow.compute_pool.node.id": "a17e8157",
    "snow.compute_pool.node.instance_family": "CPU_X64_XS",
    "snow.database.id": 26,
    "snow.database.name": "TUTORIAL_DB",
    "snow.schema.id": 212,
    "snow.schema.name": "DATA_SCHEMA",
    "snow.container.instance": "0",
    "snow.service.container.name": "echo",
    "snow.service.container.run.id": "b30566",
     "snow.service.id": 114,
    "snow.service.name": "ECHO_SERVICE2",
    "snow.service.type": "Service"
  }
  ```
* **RECORD_ATTRIBUTES:** For a Snowflake service, it identifies an
  error source (standard output or standard error).

  ```sqljson
  { "log.iostream": "stdout" }
  ```
* **VALUE:** Standard output and standard error are broken into lines,
  and each line generates a record in the event table.

  ```output
  "echo-service [2023-10-23 17:52:27,429] [DEBUG] Sending response: {'data': [[0, 'Joe said hello!']]}"
  ```

### Using SYSTEM$GET_SERVICE_LOGS

The [SYSTEM$GET_SERVICE_LOGS](../../sql-reference/functions/system_get_service_logs.md) function returns logs of the currently running service container. After a container exits, you can continue to access the logs by using the system function for a short time. System functions are most useful during development and testing, when you are initially authoring a service or a job.

You provide the service name, instance ID, container name, and optionally the number of most recent log lines to retrieve. If only one service instance is running, the service instance ID is 0. For example, the following statement command retrieves the
trailing 10 lines from the log of a container named `echo`
that belongs to instance 0 of a service named `echo_service`:

```sqlexample
SELECT SYSTEM$GET_SERVICE_LOGS('echo_service', '0', 'echo', 10);
```

Example output:

```output
+--------------------------------------------------------------------------+
| SYSTEM$GET_SERVICE_LOGS                                                  |
|--------------------------------------------------------------------------|
| 10.16.6.163 - - [11/Apr/2023 21:44:03] "GET /healthcheck HTTP/1.1" 200 - |
| 10.16.6.163 - - [11/Apr/2023 21:44:08] "GET /healthcheck HTTP/1.1" 200 - |
| 10.16.6.163 - - [11/Apr/2023 21:44:13] "GET /healthcheck HTTP/1.1" 200 - |
| 10.16.6.163 - - [11/Apr/2023 21:44:18] "GET /healthcheck HTTP/1.1" 200 - |
+--------------------------------------------------------------------------+
1 Row(s) produced. Time Elapsed: 0.878s
```

If you don’t have the information about the service that you need to call the function — such as the instance ID or container name — you can first run the [SHOW SERVICE CONTAINERS IN SERVICE](../../sql-reference/sql/show-service-containers-in-service.md) command to get information
about the service instances and containers running in each instance.

The SYSTEM$GET_SERVICE_LOGS function has the following limitations:

* It merges standard output and standard error streams. The function provides no indication of which stream the output came from.
* It reports the captured data for a specific container in a single
  service instance.
* It only reports logs for a running container. The function can’t fetch
  logs from a previous container that was restarted or from a
  container of a service that is stopped or deleted.
* The function returns up to 100 KB of data.

## Access platform metrics

Snowflake provides metrics for [compute pools](working-with-compute-pool.md) in your account and [services](working-with-services.md) running on those compute pools. These metrics, provided by Snowflake, are also referred to as platform metrics.

* **Event-table service metrics:** Individual services publish metrics. These are a subset of the compute pool metrics that provide information specific to the service. The target use case for this is to observe the resource utilization of a specific service. In the service specification, you define which metrics you want Snowflake to record in the event table while the service is running.
* **Compute pool metrics:** Each compute pool also publishes metrics that provide information about what is happening inside that compute pool. The target use case for this is to observe the compute pool utilization. To access your compute pool metrics, you will need to write a service that uses Prometheus-compatible API to poll the metrics that the compute pool publishes.

### Accessing event-table service metrics

To log metrics from a service into the event table configured for your account, include the following section in your service specification:

```yaml
platformMonitor:
  metricConfig:
    groups:
    - <group 1>
    - <group 2>
    - ...
```

Where each `group N` refers to a predefined metrics group that you are interested in; for example, `system`, `network`, or `storage`. For more information, see the [spec.platformMonitor field](specification-reference.md) section in the documentation on the service specification.

While the service is running, Snowflake records these metrics to the event table in your account. You can read these metrics in the following ways:

* **Using the service helper method:** The [<service_name>!SPCS_GET_METRICS](../../sql-reference/functions/spcs_get_metrics.md) table function returns metrics Snowflake collected for the specified service. The following list explains advantages of using this table function:

  + You can retrieve metrics for a specific service.
  + You can retrieve metrics within a specified time range.
  + The caller doesn’t need access to the entire events table, which can be beneficial for customers with strict information security requirements.

  The following SELECT statement uses the table function to retrieve platform events for the specified service that was recorded in the past hour:

  ```sqlexample
  SELECT *
    FROM TABLE(mydb.myschema.echo_service!SPCS_GET_METRICS(start_time => dateadd('hour', -1, current_timestamp())));
  ```
* **Query the events table directly:** You can query your event table to read the metrics. The following query retrieves the service metrics that were recorded in the past hour for the service `my_service`:

  ```sqlexample
  SELECT timestamp, value
    FROM my_event_table_db.my_event_table_schema.my_event_table
    WHERE timestamp > DATEADD(hour, -1, CURRENT_TIMESTAMP())
      AND RESOURCE_ATTRIBUTES:"snow.service.name" = 'MY_SERVICE'
      AND RECORD_TYPE = 'METRIC'
      ORDER BY timestamp DESC
      LIMIT 10;
  ```

  If you don’t know the name of the active event table for the account, run the [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md) command to display the value of the account-level [EVENT_TABLE](../../sql-reference/parameters.md) parameter:

  ```sqlexample
  SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;
  ```

  For more information about event tables, see Using event table.

**Example**

To create an example service that records metrics to the event table that is configured for your account, complete the following steps.

1. Create a service named `echo_service` by following the steps in [Tutorial 1](tutorials/tutorial-1.md), with one change. In step 3, where you create a service, use the following CREATE SERVICE command, which adds the `platformMonitor` field in the modified service specification:

   ```sqlexample-yaml
   CREATE SERVICE echo_service
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
       spec:
         containers:
         - name: echo
           image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
           env:
             SERVER_PORT: 8000
             CHARACTER_NAME: Bob
           readinessProbe:
             port: 8000
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: 8000
           public: true
         platformMonitor:
           metricConfig:
             groups:
             - system
             - system_limits
         $$
       MIN_INSTANCES=1
       MAX_INSTANCES=1;
   ```

> After the service is running, Snowflake starts recording the metrics in the specified metric groups to the event table.

1. Access the metrics by calling the <service_name>!SPCS_GET_METRICS function or by querying the event table. For example, retrieve metrics reported in the last hour by the echo_service service:

   > * Use the [<service_name>!SPCS_GET_METRICS](../../sql-reference/functions/spcs_get_metrics.md) helper function:
   >
   >   ```sqlexample
   >   SELECT *
   >   FROM TABLE(mydb.myschema.echo_service!SPCS_GET_METRICS(START_TIME => DATEADD('hour', -1, CURRENT_TIMESTAMP())));
   >   ```
   > * Query the event table directly:
   >
   >   ```sqlexample
   >   SELECT timestamp, value
   >    FROM my_events
   >    WHERE timestamp > DATEADD(hour, -1, CURRENT_TIMESTAMP())
   >      AND RESOURCE_ATTRIBUTES:"snow.service.name" = 'ECHO_SERVICE'
   >      AND RECORD_TYPE = 'METRIC'
   >      AND RECORD:metric.name = 'container.cpu.usage'
   >      ORDER BY timestamp DESC
   >      LIMIT 100;
   >   ```

### Access compute pool metrics

[Compute pool](working-with-compute-pool.md) metrics offer insights into the nodes in the compute pool and the services running on them. Each node reports node-specific metrics, such as the amount of available memory for containers, as well as service metrics, like the memory usage by individual containers. The compute pool metrics provide information from a node’s perspective.

Each node has a metrics publisher that listens on TCP port 9001. Other services can make an HTTP GET request with the path `/metrics` to port 9001 on the node. To discover the node’s IP address, retrieve SRV records (or A records) from DNS for the `discover.monitor.compute_pool_name.cp.spcs.internal` hostname. Then, create another service in your account that actively polls each node to retrieve the metrics.

The body in the response provides the metrics using the
[Prometheus format](https://prometheus.io/docs/instrumenting/exposition_formats/#text-based-format)
as shown in the following example metrics:

```output
# HELP node_memory_capacity Defines SPCS compute pool resource capacity on the node
# TYPE node_memory_capacity gauge
node_memory_capacity{snow_compute_pool_name="MY_POOL",snow_compute_pool_node_instance_family="CPU_X64_S",snow_compute_pool_node_id="10.244.3.8"} 1
node_cpu_capacity{snow_compute_pool_name="MY_POOL",snow_compute_pool_node_instance_family="CPU_X64_S",snow_compute_pool_node_id="10.244.3.8"} 7.21397383168e+09
```

Note the following:

* The response body starts with `# HELP` and `# TYPE`, which provide a short description and the type of the metric. In this example, the `node_memory_capacity` metric is of type `gauge`.
* It is then followed by the metric’s name, a list of labels describing a specific resource (data point), and its value. In this example, the metric (named `node_memory_capacity`) provides memory information, indicating that the node has 7.2 GB available memory. The metric also includes metadata in the form of labels as shown:

  ```output
  snow_compute_pool_name="MY_POOL",
  snow_compute_pool_node_instance_family="CPU_X64_S",snow_compute_pool_node_id="10.244.3.8"
  ```

You can process these metrics any way you choose; for example, you might store metrics in a database and use a UI (such as a Grafana dashboard) to display the information.

> **Note:**
>
> * Snowflake does not provide any aggregation of metrics. For example, to get metrics for a given service, you must query all nodes that are running instances of that service.
> * The compute pool must have a DNS-compatible name for you to access the metrics.
> * The endpoint exposed by a compute pool can be accessed by a service using a role that has the OWNERSHIP or MONITOR privilege on the compute pool.

For a list of available compute pool metrics, see Available platform metrics.

**Example**

For an example of configuring Prometheus to poll your compute pool for metrics, see the [compute pool metrics tutorials](https://github.com/Snowflake-Labs/spcs-templates/tree/main/user-metrics).

### Available platform metrics

The following is a list of available platform metrics groups and metrics within each group. Note that `storage` metrics are currently only collected from block storage volumes.

| Metric group . Metric name | Unit | Type | resource_attributes | record_attributes | Description |
| --- | --- | --- | --- | --- | --- |
| system . container.cpu.usage | cpu cores | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | Average number of CPU cores used since last measurement. 1.0 indicates full utilization of 1 CPU core. Max value is number of cpu cores available to the container. |
| system . container.memory.usage | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | Memory used, in bytes. |
| system . container.gpu.memory.usage | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | Per-GPU memory used, in bytes. The source GPU is denoted in the ‘gpu’ attribute. |
| system . container.gpu.utilization | ratio | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | gpu | Ratio of per-GPU usage to capacity. The source GPU is denoted in the ‘gpu’ attribute. |
| system_limits . container.cpu.limit | cpu cores | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | CPU resource limit from the service specification. If no limit is defined, defaults to node capacity. |
| system_limits . container.gpu.limit | gpus | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | gpu | GPU count limit from the service specification. If no limit is defined, the metric is not emitted. |
| system_limits . container.memory.limit | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | Memory limit from the service specification. If no limit is defined, defaults to node capacity. |
| system_limits . container.cpu.requested | cpu cores | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | CPU resource request from the service specification. If no limit is defined, this defaults to a value chosen by Snowflake. |
| system_limits . container.gpu.requested | gpus | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | gpu | GPU count from the service specification. If no limit is defined, the metric is not emitted. |
| system_limits . container.memory.requested | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | gpu | Memory request from the service specification. If no limit is defined, this defaults to a value chosen by Snowflake. |
| system_limits . container.gpu.memory.capacity | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | gpu | Per-GPU memory capacity. The source GPU is denoted in the ‘gpu’ attribute. |
| status . container.restarts | restarts | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | Number of times Snowflake restarted the container. |
| status . container.state.finished | boolean | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | When the container is in the ‘finished’ state, this metric will be emitted with the value 1. |
| status . container.state.last.finished.reason | boolean | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | reason | If the container has restarted previously, this metric will be emitted with the value 1. The ‘reason’ label describes why the container last finished. |
| status . container.state.last.finished.exitcode | integer | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | If a container has restarted previously, this metric will contain the exit code of the previous run. |
| status . container.state.pending | boolean | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | When a container is in the ‘pending’ state, this metric will be emitted with the value 1. |
| status . container.state.pending.reason | boolean | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id | reason | When a container is in the ‘pending’ state, this metric will be emitted with the value 1. The ‘reason’ label describes why the container was most recently in the pending state. |
| status . container.state.running | boolean | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | When a container is in the ‘running’ state, this metric will have the value 1. |
| status . container.state.started | boolean | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id |  | When a container is in the ‘started’ state, this metric will have the value 1. |
| network . network.egress.denied.packets | packets | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id |  | Network egress total denied packets from service instance due to policy validation failures. |
| network . network.egress.received.bytes | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id |  | Network egress total bytes received by service instance from remote destinations. |
| network . network.egress.received.packets | packets | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id |  | Network egress total packets received by service instance from remote destinations. |
| network . network.egress.transmitted.bytes | byte | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id |  | Network egress total bytes transmitted by service instance out to remote destinations. |
| network . network.egress.transmitted.packets | packets | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id |  | Network egress total packets transmitted by service instance out to remote destinations. |
| network . network.ingress.connections.active | connections | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id snow.endpoint.name |  | Number of active ingress connections for this endpoint. This metric includes the resource attribute `snow.endpoint.name` to determine the value per endpoint. |
| network . network.ingress.cps | connections/sec | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.service.container.name snow.query.id snow.endpoint.name |  | Number of ingress connections to this endpoint per second. This metric includes the resource attribute `snow.endpoint.name` to help you to determine the value per endpoint. |
| storage . volume.capacity | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Size of the filesystem. The target volume is denoted in the `volume_name` attribute. |
| storage . volume.io.inflight | operations | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Number of active filesystem I/O operations at current instant. The target volume is denoted in the `volume_name` attribute. |
| storage . volume.read.throughput | bytes/sec | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Filesystem reads throughput in bytes per second since last measurement. The target volume is denoted in the `volume_name` attribute. |
| storage . volume.read.iops | operations/sec | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Filesystem read operations per second since last measurement. The target volume is denoted in the `volume_name` attribute |
| storage . volume.usage | bytes | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Total number of bytes used in the filesystem since last measurement. The target volume is denoted in the `volume_name` attribute. |
| storage . volume.write.throughput | bytes/sec | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Filesystem write throughput in bytes per second since last measurement. The target volume is denoted in the `volume_name` attribute. |
| storage . volume.write.iops | operations/sec | gauge | snow.account.name snow.compute_pool.id snow.compute_pool.name snow.compute_pool.node.id snow.compute_pool.node.instance_family snow.database.id snow.database.name snow.schema.id snow.schema.name snow.service.id snow.service.name snow.service.type snow.container.instance snow.query.id | snow_volume_id snow_volume_name snow_volume_replica volume_type | Filesystem write operations per second since last measurement. The target volume is denoted in the `volume_name` attribute. |

As shown in the preceding table, the platform metrics contain the following attributes. These attributes are stored in the event table resource_attributes and record_attributes columns. Snowflake exposes these attributes as Prometheus labels when scraped directly from the node.

**Resource attributes**

* `snow.account.name`: Name of the account that launched the service.
* `snow.compute_pool.id`: Id of the compute pool where the service was scheduled.
* `snow.compute_pool.name`: Name of the compute pool where service was scheduled.
* `snow.compute_pool.node_id`: Id of the compute pool node running the container that produced this metric.
* `snow.compute_pool.node.instance_family`: The type of the instance family of the compute pool that is running the service. For more information, see [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md).
* `snow.database.id`: Id of the database that owns the service.
* `snow.database.name`: Name of the database that owns the service.
* `snow.schema.id`: Id of the schema that owns the service.
* `snow.schema.name`: Name of the schema that owns the service
* `snow.service.id`: Id of the service.
* `snow.service.name`: Name of the service.
* `snow.service.type`: Specifies whether the container is a job service or a long-running service.
* `snow.service_container.instance`: Id of the container instance that produced the metric.
* `snow.service.container.name`: Name of the container that produced the metric.
* `snow.query.id`: The [uuid of the query](../../sql-reference/functions/last_query_id.md) that created the service.

**Record attributes**

* `gpu`: Index of the gpu from which this metric originated, starting with 0.
* `reason`: Explains the container state. This attribute appears only for metrics that end with reason suffix.

  + `spcs.container.state.pending.reason`

    - `FailedToPullImage`: Container cannot pull image.
    - `FailingToStartContainer`: Container cannot be started. It is getting scheduled to the node, but then fails.
    - `ServiceRunError`: Runtime error occurred resulting in the container eviction.
    - `ServiceSpecError`: Container cannot be scheduled because error in service specification.
    - `ServiceCreateError`: Error during container initialization.
    - `Initializing`: Container is currently initializing.
    - `Creating`: Container in process of creating, for example, pulling an image.
  + `container.state.last.finished.reason`

    - `Done`: Container finished without error.
    - `Failed`: Container terminated with an error.
    - `FailedWithOOM`: Container terminated after exceeding memory limit from service specification.
    - `FailedToStart`: Container did not start due to error.
* `resource`: Node resource that the metric describes (cpu, memory, gpu, gpu_memory).
* `snow_volume_id`: Id of the volume.
* `snow_volume_name`: Name of the volume.
* `snow_volume_replica`: Indicates the service instance’s ordinal identity within a service. For example, `snow_volume_replica="3"` represents the third instance of that service.
* `volume_type`: Volume type (local, memory, block, and Snowflake stage).

## Publishing and accessing application metrics

Application metrics and traces are generated by your service in contrast to platform metrics that Snowflake generates. Your service
containers can generate OLTP or Prometheus metrics and Snowflake publishes them to the event table configured for your account.

Note that you should ensure that your service container code outputs metrics with the correct units, aggregation, and instrumentation
types to generate metrics that are meaningful and effective for your analysis.

### Publishing OTLP application metrics and traces

Snowflake runs an OTel collector that your service container can use to publish OTLP application metrics and traces. That is, a service container can push metrics to the OTel collector endpoints, which Snowflake then writes to the event table configured for your Snowflake account along with the originating service details.

It works as follows:

* Snowflake automatically populates the following environment variables in your service container that provide the OTel collector endpoints where containers can publish application metrics and traces:

  + `OTEL_EXPORTER_OTLP_METRICS_ENDPOINT`
  + `OTEL_EXPORTER_OTLP_TRACES_ENDPOINT`
* The [standard OTLP client](https://opentelemetry.io/docs/languages/) looks for these environment variables to discover the OTel collector automatically. This enables your service container to publish metrics and traces using this client.

#### Configuring OTLP application Trace IDs

Traces must use the Snowflake Trace ID format to be viewable in [Snowflake Trail](https://www.snowflake.com/en/product/features/snowflake-trail/) and allow for performant lookup.

Snowflake provides Python and Java libraries to simplify Trace ID generator setup. The following examples show how to override the default OpenTelemetry trace ID generator with these libraries.

```python
from opentelemetry.sdk.trace import TracerProvider
from snowflake.telemetry.trace import SnowflakeTraceIdGenerator

trace_id_generator = SnowflakeTraceIdGenerator()
tracer_provider = TracerProvider(
    resource=Resource.create({"service.name": SERVICE_NAME}),
    id_generator=trace_id_generator
)
```

For more information, see [snowflake-telemetry-python](https://pypi.org/project/snowflake-telemetry-python/) on PyPI.

```Java
import io.opentelemetry.sdk.autoconfigure.AutoConfiguredOpenTelemetrySdk;
import com.snowflake.telemetry.trace.SnowflakeTraceIdGenerator;

static OpenTelemetry initOpenTelemetry() {
  return AutoConfiguredOpenTelemetrySdk.builder()
      .addPropertiesSupplier(
          () ->
              Map.of(...config options...)
      .addTracerProviderCustomizer(
          (tracerProviderBuilder, configProperties) -> {
            tracerProviderBuilder.setIdGenerator(SnowflakeTraceIdGenerator.INSTANCE);
            return tracerProviderBuilder;
          })
      .build()
      .getOpenTelemetrySdk();
```

For more information about installing `com.snowflake.telemetry`, see [Setting up your Java and Scala environment to use the Telemetry class](../logging-tracing/telemetry-build-maven.md).

A trace ID generator can be implemented for any other programming language as well. The 16-byte ID (big endian) must contain a timestamp in the four highest-order bytes. The other bytes should contain random bits. For more information, see [Python reference implementation](https://github.com/snowflakedb/snowflake-telemetry-python/blob/0c5b4faf024997d993f7cd1d00e6ae0cb0bb7d08/src/snowflake/telemetry/trace/__init__.py#L14).

### Publishing Prometheus application metrics

Snowflake supports Prometheus metrics where instead of pushing OTLP metrics, your application might expose Prometheus metrics to be polled
by a Snowflake-provided collector. For Snowflake to collect these application metrics from your service and publish them to the event
table, follow these steps:

* Have your service listen on a port, which exposes your Prometheus metrics.
* Include in your service a Snowflake-provided container (also referred to as “sidecar” container), with necessary configuration to pull
  the metrics from your service container.

The Prometheus sidecar pulls the application metrics from the container at a scheduled frequency, converts the Prometheus format to OTLP format, and pushes the metrics to the OTel collector. The OTel collector then publishes those metrics into the event table configured for your Snowflake account.

> **Note:**
>
> Snowflake doesn’t support Prometheus Summary [metric type](https://prometheus.io/docs/concepts/metric_types/), as it is [deprecated by OpenTelemetry](https://opentelemetry.io/docs/specs/otel/metrics/data-model/#summary-legacy). Use the Histogram type instead.

You add the Prometheus sidecar container to the service specification as another container and include an argument to specify the HTTP endpoint exposed by your container, using the following format:

```output
localhost:{PORT}/{METRICS_PATH}, {SCRAPE_FREQUENCY}
```

It specifies a port number, path, and frequency at which the sidecar should pull the metrics.

An example service specification fragment shows the sidecar container scraping metrics every minute from your service container from port 8000 and pulling metrics from the path “/metrics”:

```yaml
spec:
  containers:
  - name: <name>
    image: <image-name>
    .....
  - name: prometheus
    image: /snowflake/images/snowflake_images/monitoring-prometheus-sidecar:0.0.1
    args:
      - "-e"
      - "localhost:8000/metrics,1m"
```

In the specification:

* `image` is the Snowflake-provided sidecar container image.
* `args` provides necessary configuration for the prometheus container to scrape metrics:

  + From port 8000 provided by your container. The port is required in this prometheus container configuration.
  + Using path “/metrics”. It is optional. If not specified, “/metrics” is the default path.
  + Every minute. It is optional. If not specified, “1m” is the default.

  If you leverage the defaults, this is the equivalent configuration for scraping metrics:

  ```yaml
  spec:
      ...
      args:
        - "-e"
        - "localhost:8000"
  ```

> **Note:**
>
> The Prometheus sidecar container is only supported for services (not jobs). If you want to collect application metrics for a job, it must push the metrics to the OTel collector.

### Accessing application metrics and traces in the event table

You can query the event table to retrieve application metrics. The following query retrieves the application metrics collected in the past hour.

```sqlexample
SELECT timestamp, record:metric.name, value
  FROM <current_event_table_for_your_account>
  WHERE timestamp > dateadd(hour, -1, CURRENT_TIMESTAMP())
    AND resource_attributes:"snow.service.name" = <service_name>
    AND scope:"name" != 'snow.spcs.platform'
    AND record_type = 'METRIC'
  ORDER BY timestamp DESC
  LIMIT 10;
```

For more information about event tables, see [Event table overview](../logging-tracing/event-table-setting-up.md). You can visualize these metrics in [Snowflake dashboards](../../user-guide/ui-snowsight-dashboards.md).

You can also query your event table to view the application traces. For example, to retrieve application traces from the past hour, in the preceding query, replace the `record_type` condition as follows:

```sqlexample
AND record_type = 'SPAN' OR record_type = 'SPAN_EVENT'
```

Traces can be visualized in the [Snowflake trail](https://www.snowflake.com/en/data-cloud/snowflake-trail/) viewer.

Metrics and traces contain both user-defined and Snowflake-defined attributes as resource and record attributes. Note that the `snow.` prefix is reserved for Snowflake-generated attributes, Snowflake ignores custom attributes that use this prefix. To see a list of Snowflake defined attributes see Available platform metrics.

[Example code](https://github.com/Snowflake-Labs/spcs-templates/tree/main/application-observability) is provided in both Python and Java that demonstrates instrumenting an application with custom metrics and traces using the OTLP SDK. The examples show how to configure Snowflake Trace ID generation for compatibility with the Snowflake trail viewer for traces.

## Accessing platform events

Snowflake records events that provide visibility into the status and history of services. These Snowflake-provided events are referred to as *platform events*.

For example, if your service container is currently running but was restarted a day earlier due to a fatal error (such as an out-of-memory condition), you can use platform events to view this historical event.

Snowflake logs these platform events in the event table in your account. By default, platform events are not logged. To enable the logging of platform events, set the [LOG_EVENT_LEVEL](../../sql-reference/parameters.md) parameter when creating resources (for example, when running CREATE SERVICE) or use ALTER statements to update the level for existing resources.

Container log lines from standard output and standard error still use [LOG_LEVEL](../../sql-reference/parameters.md); platform events use `LOG_EVENT_LEVEL`. For more information, see [Parameters](../../sql-reference/parameters.md) and [New LOG_EVENT_LEVEL parameter to control events](../../release-notes/bcr-bundles/2026_02/bcr-2229.md).

> **Note:**
>
> If the LOG_EVENT_LEVEL parameter is not set at the resource level, Snowflake can inherit the value of the parameter that is set at a higher level. For a service, Snowflake can inherit the value of the LOG_EVENT_LEVEL parameter that is set on the schema, database, or the account of the service. For more information, see [How Snowflake determines the level in effect](../logging-tracing/telemetry-levels.md).

You can check the current value set for a service by running [SHOW PARAMETERS … IN SERVICE](../../sql-reference/sql/show-parameters.md):

```sqlexample
SHOW PARAMETERS LIKE 'LOG_EVENT_LEVEL' IN SERVICE mydb.myschema.myservice;
```

The value of the LOG_EVENT_LEVEL parameter determines the severity of platform events you want recorded in the event table. In the current implementation, the supported LOG_EVENT_LEVEL values are: `INFO` and `ERROR`.

* If you want to record only ERROR platform events in the event table, set LOG_EVENT_LEVEL to `ERROR`.
* If you want `INFO` and `ERROR` platform events recorded in the event table, set LOG_EVENT_LEVEL to `INFO`.
* If you want to stop recording platform events in the event table, set LOG_EVENT_LEVEL to `OFF`.

For more information, see [Setting telemetry levels](../logging-tracing/telemetry-levels.md).

### Query platform events

After you configure LOG_EVENT_LEVEL for your resource, Snowflake records the platform events to the active event table in your Snowflake account. You can access these events in the following ways:

* **Using the service helper method:** The [<service_name>!SPCS_GET_EVENTS](../../sql-reference/functions/spcs_get_events.md) table function returns events collected by Snowflake from the containers of the specified service.

  The following list explains the advantages of using this table function:

  + You can retrieve events for a specific service.
  + You can retrieve events within a specified time range.
  + The caller doesn’t need access to the entire events table, which can be beneficial for customers with strict information security requirements.

  The following SELECT statement uses the table function to retrieve platform events for the specified service recorded in the past hour:

  ```sqlexample
  SELECT *
  FROM TABLE(mydb.myschema.echo_service!SPCS_GET_EVENTS(START_TIME => DATEADD('hour', -1, CURRENT_TIMESTAMP())));
  ```
* **Using the event table directly:** You can query the event table directly. To find the active event table for the account, use the [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md) command to check the value of the [EVENT_TABLE](../../sql-reference/parameters.md) parameter:

  ```sqlexample
  SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;
  ```

  The parameter specifies the active event table for the account.

  Next, query that event table. The following SELECT statement retrieves platform events for the specified service that was recorded in the past hour:

  ```sqlexample
  SELECT TIMESTAMP, RESOURCE_ATTRIBUTES, RECORD, VALUE
    FROM <your_event_table>
    WHERE TIMESTAMP > DATEADD(hour, -1, CURRENT_TIMESTAMP())
      AND RESOURCE_ATTRIBUTES:"snow.service.name" = '<your_service_name>'
      AND RECORD_TYPE = 'EVENT'
      AND SCOPE:"name" = 'snow.spcs.platform'
    ORDER BY TIMESTAMP DESC
    LIMIT 10;
  ```

  For more information about event tables, see Using event table.

  The following columns in the event table provide useful information about the platform events:

  + **TIMESTAMP:** Shows when the event was recorded.
  + **RESOURCE_ATTRIBUTES:** Provides a JSON object with metadata about the event source, such as a service, a container, or a compute pool. The following example of a value in the `resource_attribute` column identifies a specific service for which the event is recorded

    ```json
    {
      "snow.compute_pool.name": "TUTORIAL_COMPUTE_POOL",
      "snow.compute_pool.id": 123,
      "snow.database.name": "TUTORIAL_DB",
      "snow.database.id": 456,
      "snow.schema.name": "DATA_SCHEMA",
      "snow.schema.id": 789,
      "snow.service.container.name": "echo",
      "snow.service.name": "ECHO_SERVICE2",
      "snow.service.id": 212,
      "snow.service.type": "Service"
    }
    ```
  + **SCOPE:** Indicates the origin of the event. For platform events, the name of the scope is `snow.spcs.platform`, as shown in the following example:

    ```json
    { "name": "snow.spcs.platform" }
    ```
  + **RECORD_TYPE:** For platform events, EVENT is the RECORD_TYPE.
  + **RECORD:** Provides metadata about the specific event. The following metadata shows the name and severity level of the platform event:

    ```json
    { "name": "CONTAINER.STATUS_CHANGE", "severity_text": "INFO" }
    ```
  + **VALUE:** Provides the event details. The following example shows the status and a message about the status of the container:

    ```json
    { "message": "Running", "status": "READY" }
    ```

### Supported events

Currently, Snowflake supports only the container status change events.

The following table lists the platform events that Snowflake records. `RECORD` and `VALUE` in the column names refer to the columns in the event table (explained in the preceding section).

| RECORD:name | RECORD:severity_text | VALUE:message | VALUE:status |
| --- | --- | --- | --- |
| CONTAINER.STATUS_CHANGE | INFO | Running | READY |
| CONTAINER.STATUS_CHANGE | INFO | Readiness probe is failing at path: <path>, port: <port> | PENDING |
| CONTAINER.STATUS_CHANGE | INFO | Waiting to start | PENDING |
| CONTAINER.STATUS_CHANGE | INFO | Compute pool node(s) are being provisioned | PENDING |
| CONTAINER.STATUS_CHANGE | ERROR | Failed to pull image | PENDING |
| CONTAINER.STATUS_CHANGE | ERROR | Provided image name uses an invalid format | FAILED |
| CONTAINER.STATUS_CHANGE | ERROR | Encountered fatal error, retrying | FAILED |
| CONTAINER.STATUS_CHANGE | ERROR | Encountered fatal error | FAILED |
| CONTAINER.STATUS_CHANGE | ERROR | Encountered fatal error while running, check container logs | FAILED |
| CONTAINER.STATUS_CHANGE | ERROR | Container was OOMKilled due to resource usage | FAILED |
| CONTAINER.STATUS_CHANGE | ERROR | User application error, check container logs | FAILED |
| CONTAINER.STATUS_CHANGE | ERROR | Encountered fatal error while starting container | FAILED |
| CONTAINER.STATUS_CHANGE | INFO | Completed successfully | DONE |

## Guidelines and limitations

* Maximum throughput for logs ingested to the event table per node is 1 MB/second for Snowflake accounts on AWS and Azure.
* Maximum combined throughput for metrics and traces ingested to the event table is 1 MB/second per node for both Azure and AWS.
* Maximum record size for logs ingested to the event table is 16 KiB.

---
title: Snowpark Container Services: SQL execution
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/spcs-execute-sql.md
section: Snowpark Container Services
---

# Snowpark Container Services: SQL execution

Your application container can connect to Snowflake and execute SQL. This topic describes how container code obtains the required information to connect to Snowflake, including authentication credentials, the database and schema context of the service, and the warehouse used to run SQL statements.

## Credential configuration options

Snowflake recommends that application containers use Snowflake-provided credentials to authenticate to Snowflake when executing SQL. Although it is possible to use other credentials by connecting through an external access integration (EAI), connecting through an EAI treats the service as if it were running outside Snowflake and connecting to Snowflake over the internet.

You have three options to connect to Snowflake from a service container:

* **Use Snowflake-provided service user credentials:** Snowflake provides every service with credentials, which are referred to as service credentials. A service uses these credentials to connect to Snowflake as the service user.
* **Use Snowflake-provided caller credentials (caller’s rights):** When you configure your service with caller’s rights, Snowflake also provides credentials for the service to connect to Snowflake as the calling user.
* **Use other credentials:** In this case, you use an external access integration (EAI) that allows your service to connect to Snowflake’s internet endpoint by using valid authentication credentials. This option requires an administrator to create the EAI, and then grant the USAGE privilege on the integration to the service owner role.

  > **Note:**
  >
  > If you use external access integrations to access Snowflake, you might send potentially sensitive information over the internet.

For examples of code that uses various Snowflake drivers to connect to Snowflake, see [Snowflake Connection Samples](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/spcs/sf-connection).

### Using Snowflake-provided service user credentials

When you use Snowflake-provided service credentials, be aware of the following effects:

* Every object in Snowflake has an *owner role*, which is the role that is used to create the object. A service’s owner role determines the capabilities that the service is allowed when it interacts with Snowflake. These capabilities include executing SQL, accessing stages, and performing service-to-service networking.
* When you create a service, Snowflake also creates a service user that is specific to that service. That service user has access to only two roles: the service owner role and the ‘PUBLIC’ role. The default role for the service user is the service owner role.

When you start a service, including job services, Snowflake performs several actions. In each of your application containers, Snowflake enables the container code to use drivers for connecting to Snowflake and executing SQL, which is similar to any other code on your computer connecting to Snowflake. The following list shows the actions that Snowflake performs when you start a service:

* Provides credentials (an OAuth token) in the container in a file that is named `/snowflake/session/token`. The container code uses
  these credentials to authenticate as the service user. This OAuth token can’t be used outside Snowpark Container Services
* Sets the following environment variables for you to configure a Snowflake client in your service code:

  + SNOWFLAKE_ACCOUNT: This variable is set to the [account locator](../../user-guide/admin-account-identifier.md) for the Snowflake account that the service is currently running under.
  + SNOWFLAKE_HOST: This variable provides the hostname that is used to connect to Snowflake.

When you create a connection to Snowflake as the service user, container code must use SNOWFLAKE_HOST, SNOWFLAKE_ACCOUNT, and the OAuth token. The OAuth token can’t be used without also using SNOWFLAKE_HOST.

**Example**

In [Tutorial 2](tutorials/tutorial-2.md) (see `main.py`), the code reads the environment variables as shown in the following example:

```python
SNOWFLAKE_ACCOUNT = os.getenv('SNOWFLAKE_ACCOUNT')
SNOWFLAKE_HOST = os.getenv('SNOWFLAKE_HOST')
```

The code passes these variables to a connection creation code for the Snowflake client of choice. The container uses these
credentials to create a new session, with the service’s owner role as the session’s primary role, to run queries. The following example
shows the minimum code that you need to create a Snowflake connection in Python:

```python
def get_login_token():
  with open('/snowflake/session/token', 'r') as f:
    return f.read()

conn = snowflake.connector.connect(
  host = os.getenv('SNOWFLAKE_HOST'),
  account = os.getenv('SNOWFLAKE_ACCOUNT'),
  token = get_login_token(),
  authenticator = 'oauth'
)
```

Be aware of the following details about this OAuth token:

* Snowflake refreshes the content of the `/snowflake/session/token` file every few minutes. Every token is valid
  for up to one hour. After a container connects to Snowflake successfully, the
  expiration time doesn’t apply to the connection, as is the case with any sessions that users create directly.
* This OAuth token is valid only within the specific Snowflake service. You can’t copy the OAuth token and use it outside
  the service.
* If you use the OAuth token to connect, it creates a new session. The OAuth token is not associated with any existing SQL session.

  > **Note:**
  >
  > A significant difference between executing stored procedures and executing a service is that stored procedures run in
  > the same session as the SQL that runs the procedures. But every time a container establishes a new connection, you create a new
  > session.

To view the queries issued by a specific service user, you can use the ACCOUNTADMIN role to view the
[query history](../../user-guide/ui-snowsight-activity.md).
The user name of the service user appears in the following forms:

* For a service created before the 8.35 server release, the service user name is of the format `SF$SERVICE$unique-id`.
* For a service created after the 8.35 server release, the service user name is the same as the service name.

> **Note:**
>
> A service’s owner role is the role that created the service. You can define one or more service roles to manage access to the endpoints that the service exposes. For more information, see [Managing service-related privileges](working-with-services.md).

### About using Snowflake-provided caller credentials (caller’s rights)

In certain application scenarios, you might need to execute
queries by using the context of the end user rather than the service user as explained in the preceding section. The caller’s rights feature is used in this context.

For example, suppose that you create a service that exposes a public endpoint for a web application that displays a dashboard that uses data stored
in Snowflake. You grant other users in your Snowflake account access to the dashboard by granting them the
[service role](working-with-services.md). When a user signs in, the dashboard displays only the data that user
is authorized to access.

However, because containers by default execute queries by using the service user and the service’s owner role,
the dashboard shows the data that the service’s owner role has access to, regardless of which end user
connected to the endpoint. As a result, the dashboard isn’t limited to the data the end user is authorized to access, allowing the signed-in user to see data they shouldn’t have access to.

To limit the dashboard to show only data that is accessible to the signed in user, the application containers
must execute SQL by using privileges granted to the end user. You can enable this by using caller’s
rights in the application.

> **Note:**
>
> * The caller’s rights feature is supported only when [accessing a service](working-with-services.md) using network ingress. The feature isn’t available when using a service function to access the service.
> * The caller’s rights feature is currently not supported in a Snowflake Native App ([apps with containers](../native-apps/native-apps-about.md)).

#### Configure caller’s rights for your service

Configuring caller’s rights for your application is a two-step procedure.

1. In the [service specification](specification-reference.md), set the `executeAsCaller` to `true`, in as shown in the following specification fragment:

   ```yaml
   spec:
     containers:
     ...
   capabilities:
     securityContext:
       executeAsCaller: true
   ```

   This setting tells Snowflake that the application intends to use caller’s rights and causes Snowflake to insert the `Sf-Context-Current-User-Token` header in every incoming request before sending the request to the application container. This user token facilitates query execution as the calling user. If not specified, `executeAsCaller` defaults to `false`.

   Specifying the `executeAsCaller` option doesn’t affect the service’s ability to execute queries as the service user and service’s owner role. With `executeAsCaller` enabled, the service has the option to connect to Snowflake both as a calling user and as a service user.
2. To establish a Snowflake connection on behalf of the calling user, update your application code to create a login token that includes both the OAuth token that Snowflake provided to the service and the user token from the `Sf-Context-Current-User-Token` header.

   The login token must follow this format: `<service-oauth-token>.<Sf-Context-Current-User-Token>`.

   This update is demonstrated in the following Python code fragment:

   ```python
   # Environment variables below will be automatically populated by Snowflake.
   SNOWFLAKE_ACCOUNT = os.getenv("SNOWFLAKE_ACCOUNT")
   SNOWFLAKE_HOST = os.getenv("SNOWFLAKE_HOST")

   def get_login_token():
       with open("/snowflake/session/token", "r") as f:
           return f.read()

   def get_connection_params(ingress_user_token = None):
       # start a Snowflake session as ingress user
       # (if user token header provided)
       if ingress_user_token:
           logger.info("Creating a session on behalf of the current user.")
           token = get_login_token() + "." + ingress_user_token
       else:
           logger.info("Creating a session as the service user.")
           token = get_login_token()

       return {
           "account": SNOWFLAKE_ACCOUNT,
           "host": SNOWFLAKE_HOST,
           "authenticator": "oauth",
           "token": token
       }

   def run_query(request, query):
       ingress_user_token = request.headers.get('Sf-Context-Current-User-Token')
       # ingress_user_token is None if header not present
       connection_params = get_connection_params(ingress_user_token)
       with Session.builder.configs(connection_params).create() as session:
         # use the session to execute a query.
   ```

In the example above:

* The `get_login_token` function reads the file where Snowflake copied the OAuth token for the container to use.
* The `get_connection_params` function constructs a token by concatenating the OAuth token and the user token from the
  `Sf-Context-Current-User-Token` header. The function includes this token in a dictionary of parameters that the application uses to
  connect to Snowflake.

> **Note:**
>
> When a service uses caller’s rights, it can connect to Snowflake as multiple users. You are responsible for managing access to resources that aren’t managed by Snowflake.
>
> For example, in Streamlit apps, the `st.connection` object automatically caches the connection by using `st.cache_resource` in the global state, making it accessible across Streamlit sessions that are started by different users. When you use caller’s rights, consider using `st.session_state` to store connections on a per-session basis to avoid sharing connections between users.

For an example with step-by-step instructions, see [Create a service with caller’s rights enabled](tutorials/advanced/tutorial-7-callers-rights.md).

#### Accessing a service with caller’s rights configured

*Configuring caller’s rights* means that your service is establishing a Snowflake connection on behalf of the caller. How you log in to the
service’s ingress endpoints, either programmatically or by using a browser, remains the same. After log in, the following behaviors and options apply:

* **Accessing a public endpoint using a browser:** After you log into an endpoint, the service establishes a connection to Snowflake on behalf
  of the calling user using the default role of the user. If there is no default role configured for the user, the PUBLIC role is used.
* **Accessing a public endpoint programmatically:** When [logging into an endpoint programmatically](../../user-guide/oauth-custom.md) using JWT token, you can optionally set the `scope` parameter to specify the role to activate

Currently, after a service establishes a caller’s right connection to Snowflake on behalf of the caller, switching roles is not supported. If your application needs to use different roles to access different objects, you must change the user’s default secondary roles property.

* To set up the user to have all secondary roles active by default, use the [ALTER USER](../../sql-reference/sql/alter-user.md) command to set the [DEFAULT_SECONDARY_ROLES](../../sql-reference/sql/create-user.md) property of the user to (‘ALL’), as shown in the following example:

```sqlexample
ALTER USER my_user SET DEFAULT_SECONDARY_ROLES = ( 'ALL' );
```

#### Managing caller grants to a service

When a service creates a caller’s rights session, the session operates as the calling user, *not* as the service user. When an operation is performed by using this session, Snowflake applies a sequence of two permissions checks:

1. The first permissions check is performed as if the user created the session directly. This check is part of the normal permission checks that Snowflake performs
   for the user.
2. The second permissions check verifies that the service is allowed to perform the operation on behalf of a user. Snowflake verifies this by
   ensuring that the service’s owner role was granted the necessary caller grants.

In a caller’s rights session, both the normal permission check and the service owner role’s caller grants check must allow the
operation; this is referred to as [restricted caller’s rights](../restricted-callers-rights.md). By default the service has no permission to do anything on behalf of a user. You must explicitly grant caller grants to
the service so it can run with the caller’s privileges.

For example, suppose a user `U1` uses a role `R1` that has the SELECT privilege on the table `T1`. When `U1` logs into the public endpoint of your
service (`example_service`), which is configured to use the caller’s rights, the service then establishes a connection with Snowflake on
behalf of `U1`.

To allow the service to query table `T1` on behalf of `U1`, you need to grant the service’s owner role the following privileges:

* Privileges to resolve the table’s name, by granting a caller grant that allows the service to run with the USAGE privilege on the database and schema for that table.
* Privileges to use a warehouse to execute queries by granting a caller grant that allows the service to run with the USAGE privilege on a warehouse.
* Privileges to query the table by granting a caller grant that allows the service to run with the SELECT privilege on table `T1`.

The following example shows how to grant the service’s owner role with these privileges:

```sqlexample
-- Permissions to resolve the table's name.
GRANT CALLER USAGE ON DATABASE <db_name> TO ROLE <service_owner_role>;
GRANT CALLER USAGE ON SCHEMA <schema_name> TO ROLE <service_owner_role>;
-- Permissions to use a warehouse
GRANT CALLER USAGE ON WAREHOUSE <warehouse_name> TO ROLE <service_owner_role>;
-- Permissions to query the table.
GRANT CALLER SELECT ON TABLE T1 TO ROLE <service_owner_role>;
```

Any role in your account that has the global MANAGE CALLER GRANT privilege can grant caller grants. For more information about caller grants, see [GRANT CALLER](../../sql-reference/sql/grant-caller.md) and [Restricted caller’s rights](../restricted-callers-rights.md).

#### Example

For an example of a service that uses the caller’s rights feature when executing SQL queries on behalf of users, see [Create a service with caller’s rights enabled](tutorials/advanced/tutorial-7-callers-rights.md).

### Connect to Snowflake by using other credentials

You can use other forms of authentication to connect to Snowflake, not just the Snowflake-provided OAuth token. To do this, you create an external access integration (EAI) that enables your container to connect to Snowflake as if the container is running outside Snowflake and connecting through the internet. When you connect this way, you don’t need to configure the host that is used by the client.

> **Note:**
>
> Because these connections traverse an EAI, Snowflake authentication also enforces network policies. If your business requires network policies, connecting with other credentials isn’t supported.

For example, the following connection specifies the username and password to authenticate:

```python
conn = snowflake.connector.connect(
  account = '<acct-name>',
  user = '<user-name>',
  password = '<password>'
)
```

To use a default hostname, you need external access integration with a network rule that allows access from your service to the
Snowflake internet hostname for your account. For example, if your account name is `MYACCOUNT` in the organization `MYORG`, the hostname is
`myorg-myaccount.snowflakecomputing.com`. For more information, see [Configure service egress](service-network-communications.md). [Privatelink](../../user-guide/private-connectivity-inbound.md) hostnames are not supported

* Create a network rule that matches your account’s Snowflake API hostname:

  ```sqlexample
  CREATE OR REPLACE NETWORK RULE snowflake_egress_access
    MODE = EGRESS
    TYPE = HOST_PORT
    VALUE_LIST = ('myorg-myaccount.snowflakecomputing.com');
  ```
* Create an integration that uses the preceding network rule:

  ```sqlexample
  CREATE EXTERNAL ACCESS INTEGRATION snowflake_egress_access_integration
    ALLOWED_NETWORK_RULES = (snowflake_egress_access)
    ENABLED = TRUE;
  ```

## Configuration of the database and schema context for executing SQL

In addition to providing credentials, Snowflake also provides the database and schema context in which the service is created. The container code can use this information to execute SQL in the same database and schema context as the service.

This section explains two concepts:

* The logic Snowflake uses to determine the database and schema in which to create your service.
* The method through which Snowflake conveys this information to your containers, thus enabling the container code to execute
  SQL in the same database and schema context.

Snowflake uses the service name to determine the database and schema in which to create a service:

* Example 1: In the following CREATE SERVICE and EXECUTE JOB SERVICE commands, the service name does not explicitly specify a database and schema name. Snowflake creates the service and the job service in the current database and schema.

  ```sqlexample
  -- Create a service.
  CREATE SERVICE test_service IN COMPUTE POOL ...

  -- Execute a job service.
  EXECUTE JOB SERVICE
    IN COMPUTE POOL tutorial_compute_pool
    NAME = example_job_service ...
  ```
* Example 2: In the following CREATE SERVICE and EXECUTE JOB SERVICE commands, the service name includes a database and schema name. Snowflake creates the service and job service in the specified database (`test_db`) and schema (`test_schema`), regardless of the current schema.

  ```sqlexample
  -- Create a service.
  CREATE SERVICE test_db.test_schema.test_service IN COMPUTE POOL ...

  -- Execute a job service.
  EXECUTE JOB SERVICE
    IN COMPUTE POOL tutorial_compute_pool
    NAME = test_db.test_schema.example_job_service ...
  ```

When Snowflake starts a service, it provides the database and schema information to the running containers using the
following environment variables:

* SNOWFLAKE_DATABASE
* SNOWFLAKE_SCHEMA

Your container code can use environment variables in the connection code to determine which database and schema to use, as
shown in this example:

```python
conn = snowflake.connector.connect(
  host = os.getenv('SNOWFLAKE_HOST'),
  account = os.getenv('SNOWFLAKE_ACCOUNT'),
  token = get_login_token(),
  authenticator = 'oauth',
  database = os.getenv('SNOWFLAKE_DATABASE'),
  schema = os.getenv('SNOWFLAKE_SCHEMA')
)
```

**Example**

In [Tutorial 2](tutorials/tutorial-2.md), you create a Snowflake job service that connects with Snowflake and executes SQL statements.
The following steps summarize how the tutorial code uses the environment variables:

1. In the common setup (see the [Common Setup](tutorials/common-setup.md) section), you create resources, including a
   database and a schema. You also set the current database and schema for the session:

   ```sqlexample
   USE DATABASE tutorial_db;
   ...
   USE SCHEMA data_schema;
   ```
2. After you create a job service (by running EXECUTE JOB SERVICE), Snowflake starts the container and sets the following environment variables in
   the container to the current database and schema of the session:

   * SNOWFLAKE_DATABASE is set to “TUTORIAL_DB”
   * SNOWFLAKE_SCHEMA is set to “DATA_SCHEMA”
3. The job code (see `main.py` in Tutorial 2) reads these environment variables:

   ```python
   SNOWFLAKE_DATABASE = os.getenv('SNOWFLAKE_DATABASE')
   SNOWFLAKE_SCHEMA = os.getenv('SNOWFLAKE_SCHEMA')
   ```
4. The job code sets the database and schema as the context in which to execute the SQL statements (`run_job()` function
   in `main.py`):

   ```sqljson
   {
      "account": SNOWFLAKE_ACCOUNT,
      "host": SNOWFLAKE_HOST,
      "authenticator": "oauth",
      "token": get_login_token(),
      "warehouse": SNOWFLAKE_WAREHOUSE,
      "database": SNOWFLAKE_DATABASE,
      "schema": SNOWFLAKE_SCHEMA
   }
   ...
   ```

   > **Note:**
   >
   > SNOWFLAKE_ACCOUNT, SNOWFLAKE_HOST, SNOWFLAKE_DATABASE, SNOWFLAKE_SCHEMA are environment variables that Snowflake generates for the application container, but SNOWFLAKE_WAREHOUSE is not (the Tutorial 2 application code created this variable because Snowflake does not pass a warehouse name to a container).

## Specifying the warehouse for your container

If your service connects to Snowflake to execute a query in a Snowflake warehouse, you have the following options to specify a warehouse:

* **Specify a warehouse in your application code.** Specify a warehouse as part of the connection configuration when starting a Snowflake session to run queries in your code. For an example, see [Tutorial 2](tutorials/tutorial-2.md).
* **Specify a default warehouse when creating a service.** Specify the optional QUERY_WAREHOUSE
  parameter in the
  [CREATE SERVICE](../../sql-reference/sql/create-service.md) or [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md)
  command to provide a default warehouse. If your application code doesn’t provide a warehouse
  as part of connection configuration, Snowflake uses the default warehouse. Use the
  [ALTER SERVICE](../../sql-reference/sql/alter-service.md) command to change the default warehouse.

  > **Note:**
  >
  > The warehouse specified using the QUERY_WAREHOUSE parameter is the default only for the
  > service user.
  > When the service connects to Snowflake on behalf of another user — in the context of
  > caller’s rights scenario,
  > Snowflake uses the user’s default warehouse.

If you specify a warehouse by using both methods, the warehouse that is specified in the application code is used.

## Access service user query history

You can find queries executed by your service as the service user by filtering the [QUERY_HISTORY view](../../sql-reference/account-usage/query_history.md) or [QUERY_HISTORY](../../sql-reference/functions/query_history.md) function where `user_type` is SNOWFLAKE_SERVICE.

**Example 1:** Fetch queries run by a service.

```sqlexample
SELECT *
FROM snowflake.account_usage.query_history
WHERE user_type = 'SNOWFLAKE_SERVICE'
AND user_name = '<service_name>'
AND user_database_name = '<service_db_name>'
AND user_schema_name = '<service_schema_name>'
order by start_time;
```

In the WHERE clause:

* `user_name = '<service_name>'`: You specify the service name as the user name because a service executes queries as the service user, and the service user’s name is the same as the service name.
* `user_type = 'SNOWFLAKE_SERVICE'` and `user_name = '<service_name>'`: This limits the query result to retrieve only queries executed by a service.
* `user_database_name` and `user_schema_name`: For a service user, these are the service’s database and schema.

You can get the same results by calling the QUERY_HISTORY function.

```sqlexample
SELECT *
FROM TABLE(<service_db_name>.information_schema.query_history())
WHERE user_database_name = '<service_db_name>'
AND user_schema_name = '<service_schema_name>'
AND user_type = 'SNOWFLAKE_SERVICE'
AND user_name = '<service_name>'
order by start_time;
```

In the WHERE clause:

* `user_type = 'SNOWFLAKE_SERVICE'` and `user_name = '<service_name>'` limit the query result to retrieve only queries executed by a service.
* `user_database_name` and `user_schema_name` names (for a service user) are the service’s database and schema.

**Example 2:** Fetch queries run by services and the corresponding service information.

```sqlexample
SELECT query_history.*, services.*
FROM snowflake.account_usage.query_history
JOIN snowflake.account_usage.services
ON query_history.user_name = services.service_name
AND query_history.user_schema_id = services.service_schema_id
AND query_history.user_type = 'SNOWFLAKE_SERVICE'
```

The query joins the QUERY_HISTORY and SERVICES views to retrieve information about the queries and services that executed the queries. Note the following:

* For queries run by services, the `query_history.user_name` is the service user’s name, which is the same as the service name.
* The query joins the views using the schema IDs (not schema name) to ensure you refer to the same schema, because if you drop and recreate a schema, the schema ID changes but the name remains the same.

You can add optional filters to the query. For example:

* Filter `query_history` to retrieve only services that executed specific queries.
* Filter `services` to retrieve only queries executed by specific services.

**Example 3:** For every service, fetch service user information.

```sqlexample
SELECT services.*, users.*
FROM snowflake.account_usage.users
JOIN snowflake.account_usage.services
ON users.name = services.service_name
AND users.schema_id = services.service_schema_id
AND users.type = 'SNOWFLAKE_SERVICE'
```

The query join SERVICES and USERS views in the ACCOUNT_USAGE schema to retrieve services and service user information. Note the following:

* When a service runs queries, it runs the queries as service user and the service user’s name is the same as the service name. Therefore, you specify the join condition: `users.name = services.service_name`.
* Service names are unique only within a schema. Therefore, the query specifies the join condition (`users.schema_id = services.service_schema_id`) to ensure each service user is matched against the specific service they belong to (and not any other same-named service running in different schemas).

---
title: Snowpark Container Services: Working with an image registry and repository
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/working-with-registry-repository.md
section: Snowpark Container Services
---

# Snowpark Container Services: Working with an image registry and repository

Snowpark Container Services provides an
[OCIv2](https://github.com/opencontainers/distribution-spec/blob/main/spec.md)-compliant image registry service and a storage unit call repository to store images.

## Image registry

The image registry service serves the OCIv2 API for storing OCI-compliant container images.

## Image registry hostname

Each image registry in a Snowflake account has a unique hostname, which allows OCI clients
(such as Docker CLI) to access an image registry using REST API calls.
The general syntax for an image registry hostname is:

```output
<orgname>-<acctname>.registry.snowflakecomputing.com
```

In the hostname:

* `<orgname>-<acctname>` identifies a Snowflake account.
* `registry` allows Snowflake to provide hostnames
  per account for registry customers.

  The hostname is always all lowercase.

> **Note:**
>
> A Snowflake account name (`<acctname>`) can have an underscore
> (for example, `my_account`), but underscores are not valid in a URL.
> Therefore, when using a registry hostname, you need to replace an underscore
> with a dash. For example, change `my_account` to `my-account`.

You can find your organization name and account name information for image
repository host names in one of the following ways:

* The Snowsight web interface: Use the account selector. For more information, see [Getting started with Snowsight](../../user-guide/ui-snowsight-gs.md).
* Execute the [SHOW IMAGE REPOSITORIES](../../sql-reference/sql/show-image-repositories.md) command.

## Image registry authentication

To access an image repository in your Snowflake account, users must authenticate to the image registry using their Snowflake credentials. Additionally, appropriate privileges are required to access repositories within the registry. To obtain these privileges, a user must have a role that grants access privileges to the repository.

You have the following options to authenticate your client with an image registry in your account:

* **Use Snowflake CLI:** The Snowflake CLI supports all forms of Snowflake authentication.

  + For the Docker client, use the [snow spcs image-registry login](../snowflake-cli/command-reference/spcs-commands/image-registry-commands/login.md) command to authenticate Docker with a registry.
  + For any client (including Docker), Snowflake CLI also provides the option to first generate an authentication token and use it to authenticate the client. For more information, see [snow spcs image-registry token](../snowflake-cli/command-reference/spcs-commands/image-registry-commands/token.md).
* **Use client-provided commands:** Tools like Docker offer commands to authenticate with a registry by using a username and password.
  There are several ways to use username and password authentication with external tools:

  + Instead of using your own username and password, you can use a [programmatic access token (PAT)](../../user-guide/programmatic-access-tokens.md). First, [generate a token](../../user-guide/programmatic-access-tokens.md), then use it with a tool — such as Docker — by providing “USER” as the username and the token as the password.
  + You can also provide your Snowflake username and password, but this is only allowed if your account administrator enables username/password authentication. By default, username/password login isn’t supported without multi-factor authentication (MFA), which is incompatible with the `docker login` command.

## Image repository

A *registry* is a service that serves the OCIv2 API, and a *repository* is a
storage unit that you create within the service.

A repository is a named location in your account where you store images.
This is similar to the relationship between a DBMS and a table within the DBMS.
That is, a DBMS is equivalent to a registry, and a table is equivalent to a repository.

You can create one or more repositories in your Snowflake account. For example,
DEV, TEST, and PROD repositories can store images during development, testing,
and production. You can also create repositories that have different
permissions; for example, some repositories may be read-only for some roles.

Access control is supported at the repository level;
individual image-level access control is not supported.

For uploading images to an image repository, the registry service offers various authentication options and single sign-on (SSO).

For an example of creating a repository and uploading an image, see [Tutorial 1](tutorials/tutorial-1.md).

## Image repository URL

The following is a general syntax for a Snowflake repository URL:

```output
<registry-hostname>/<db_name>/<schema_name>/<repository_name>
```

For example,

```output
myorg-myacct.registry.snowflake.com/my_db/my_schema/my_repository
```

To look up the repository URL in your account, use the SHOW IMAGE REPOSITORIES
SQL command.

> **Note:**
>
> * Snowflake URL-encodes the $ character, which is the only non-URL
>   character Snowflake supports in identifiers
>   (See [Identifier Requirements](../../sql-reference/identifiers-syntax.md)).
>   Double-quoted names that contain special characters are not supported.
> * When you manually construct a repository URL, replace an underscore in an account
>   name (`my_acct`) with a dash (`my-acct`).

### Repository operations

To create and manage repositories, Snowflake supports the following
[repository operations](../../sql-reference/commands-snowpark-container-services.md):

* [CREATE IMAGE REPOSITORY](../../sql-reference/sql/create-image-repository.md)
* [DROP IMAGE REPOSITORY](../../sql-reference/sql/drop-image-repository.md)
* [SHOW IMAGE REPOSITORIES](../../sql-reference/sql/show-image-repositories.md)

To list images stored within a Snowflake image repository, use the following command:

* [SHOW IMAGES IN IMAGE REPOSITORY](../../sql-reference/sql/show-images-in-image-repository.md)

For an example of creating a repository and uploading an image,
see [Tutorial Common Setup](tutorials/common-setup.md).

### Repository privileges

When you work with a repository, the following privilege model applies:

* To create a repository in a schema, you must have the
  CREATE IMAGE REPOSITORY privilege on the schema.
* For repository management, the following privileges (capabilities)
  are supported:

  | Privilege | Usage |
  | --- | --- |
  | READ | Enables listing and downloading images from a repository. |
  | WRITE | Enables listing and downloading images from a repository. You can also push images in the repository. |
  | OWNERSHIP | Enables listing and downloading images from a repository. You can also push images in the repository. |
  | SERVICE READ | Enables a container service to list and download images from a repository. This is needed for the image building step of [model serving](../snowflake-ml/inference/real-time-inference-rest-api.md). |
  | SERVICE WRITE | Enables a container service to push images in the repository. This is needed for the image building step of [model serving](../snowflake-ml/inference/real-time-inference-rest-api.md). |

## Guidelines and Limitations

* Dropping images from a repository is currently not supported. You can drop a repository, but that removes all images from that repository.
* Contact your account representative if you require inbound private connectivity.
* The maximum layer size permitted for an image registry in compressed format is 160 GiB for Snowflake accounts on AWS, and 195 GiB for Snowflake accounts on Azure.

---
title: Snowpark Container Services: Working with compute pools
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/working-with-compute-pool.md
section: Snowpark Container Services
---

# Snowpark Container Services: Working with compute pools

A compute pool is a collection of one or more virtual machine (VM) nodes
on which Snowflake runs your Snowpark Container Services services (including job services).
You create a compute pool using the [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) command.
You then specify it when
[creating a service](../../sql-reference/sql/create-service.md) or [executing a job service](../../sql-reference/sql/execute-job-service.md).

## Creating a compute pool

A compute pool is an account-level construct, analogous to a Snowflake virtual
warehouse. The naming scope of the compute pool is your account.
That is, you cannot have multiple compute pools with the same name in your
account.

The minimum information required to create a compute pool includes the following:

* The machine type (referred to as the *instance family*) to provision for the compute pool nodes
* The minimum nodes to launch the compute pool with
* The maximum number of nodes the compute pool can scale to (Snowflake manages the scaling.)

If you expect a substantial load or sudden bursts of activity on the services you intend to run within your compute pool, you can set a minimum node count greater than 1. This approach ensures that additional nodes are readily available when needed, instead of waiting for autoscaling to start.

Setting a maximum node limit prevents an unexpectedly large number of nodes from being added to your compute pool by Snowflake autoscaling. This can be crucial in scenarios such as unexpected load spikes or issues in your code that might cause Snowflake to allocate a larger number of compute pool nodes than originally planned.

To create a compute pool using [Snowsight](../../user-guide/ui-snowsight-gs.md), or SQL:

Snowsight:
:   1. In the navigation menu, select Compute » Compute Pools.
    2. Select your username at the bottom of the navigation bar and switch to the ACCOUNTADMIN role, or any role that is allowed to create a compute pool.
    3. Select + Compute Pool.
    4. In the New compute pool UI, specify the required information (the compute pool name, the instance family, and the node limit).
    5. Select Create Compute Pool.

SQL:
:   Execute the [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) command.

    For example, the following command creates a one-node compute pool:

    ```sqlexample
    CREATE COMPUTE POOL tutorial_compute_pool
      MIN_NODES = 1
      MAX_NODES = 1
      INSTANCE_FAMILY = CPU_X64_XS;
    ```

The instance family identifies the type of machine you want to provision
for compute pool nodes. Specifying instance family in
creating a compute pool is similar to specifying warehouse size
(XSMALL, SMALL, MEDIUM, LARGE and so on) when creating a warehouse. The following table lists the available machine types. You can also use the [SHOW COMPUTE POOL INSTANCE FAMILIES](../../sql-reference/sql/show-compute-pool-instance-families.md) command to get this list of available instance families.

### Compute pool placement

A *placement group* is a fault-isolation domain within a Snowflake region, similar to an availability zone (AZ) in AWS or Azure. You can optionally specify which placement group to provision compute pool nodes in by using the `placement_group` parameter in the [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) statement.

If `placement_group` is not specified, Snowflake places compute pool nodes based on availability, which might span multiple placement groups.

If you choose to specify a `placement_group`, you have two options:

* **Specify a specific placement group:** When you specify `placement_group`, Snowflake provisions all nodes for that pool from the specified placement group. You should set `placement_group` to a specific placement group in the following situations:

  + You need reduced cross-node latency and lower communication costs for highly
    interactive, tightly coupled services.
  + You are building a highly available service and you choose to deploy the same code
    across multiple services, each one running on a separate compute pool that is assigned to a distinct placement group.

  The following guidelines apply when you set a specific placement group for a compute pool:

  + Instance family availability varies by placement group and region. Smaller regions might offer fewer placement group options,
    especially for GPU families. Call the
    [SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS](../../sql-reference/functions/system_get_instance_family_placement_groups.md) system function to list the placement groups available for
    a specific instance family in your region.
  + Placement group names are consistent within an account across different instance families.
    Different Snowflake accounts might observe different names for the same underlying placement groups.
  + When you configure a placement group for a compute pool, it restricts Snowpark Container Services’ flexibility to
    optimize node placement. This restriction can increase the likelihood of insufficient-capacity errors and lengthen startup times
    during peak demand.
  + You can alter a placement group only if the compute pool is fully suspended and your services
    don’t use block storage.
* **Specify DISTRIBUTED:** When you set `placement_group` to DISTRIBUTED, Snowflake attempts to distribute nodes for that compute pool
  across all available placement groups. You should set `placement_group` to `DISTRIBUTED` if you want to maintain healthy fault tolerance across multiple placement groups. When compute pool nodes are distributed across multiple placement groups, if one placement group goes down, you don’t lose all the nodes

  The following behaviors apply when you set `placement_group` to DISTRIBUTED for a compute pool:

  + Node distribution: Snowflake uses an equal-partition strategy to spread nodes across all available placement groups in a region. If a
    specific placement group encounters insufficient capacity errors, nodes are provisioned in other placement groups with available capacity, which can result in an uneven distribution.
  + Service instances distribution: When there is more than one service instance, Snowflake attempts to evenly distribute the instances across placement groups.
    Sometimes even distribution can’t be achieved because of constraints, such as capacity limitations.
  + Outage behavior: In the current implementation, if a placement group fails, Snowflake doesn’t automatically fail over nodes to
    healthy placement groups. You should overprovision your service instances (N+1) so that nodes in the remaining placement groups can handle the traffic load during an outage. In the event of placement group outage, Snowflake takes the following actions:

    - Stops placing new service instances in the impacted placement group.
    - Routes ingress traffic to service instances in the healthy placement groups.
    - Recreates service instances in the impacted placement group on the healthy placement groups.

> **Note:**
>
> * In smaller Snowflake regions, some instance types might not be available across multiple placement groups, which can reduce the compute pool’s resilience to placement group failures.
> * After a placement group recovers, Snowflake doesn’t automatically move service instances back to it; the system gradually rebalances during node upgrades or routine service maintenance.

### Available instance families (machine types) for compute pool nodes

> | INSTANCE_FAMILY, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) | vCPU | Memory (GiB) | Storage (GB) | Bandwidth limit (Gbps) | GPU | GPU Memory per GPU (GB) | Node limit | Description |
> | --- | --- | --- | --- | --- | --- | --- | --- | --- |
> | CPU_X64_XS | 1 | 6 | 100 | Up to 12.5 | n/a | n/a | 150 | Smallest instance available for Snowpark Containers. Ideal for cost-savings and getting started. |
> | CPU_X64_S | 3 | 13 | 100 | Up to 12.5 | n/a | n/a | 150 | Ideal for hosting multiple services/jobs while saving cost. |
> | CPU_X64_M | 6 | 28 | 100 | Up to 12.5 | n/a | n/a | 150 | Ideal for having a full stack application or multiple services |
> | CPU_X64_SL (except China) | 14 | 54 | 100 | Up to 12.5 | n/a | n/a | 150 | For applications which need a large number of CPUs, memory and Storage. |
> | CPU_X64_L | 28 | 116 | 100 | 12.5 | n/a | n/a | 150 | For applications which need an unusually large number of CPUs, memory and Storage. |
> | HIGHMEM_X64_S | 6 | 58 | 100 | AWS and GCP: Up to 12.5, Azure: 8 | n/a | n/a | 150 | For memory intensive applications. |
> | HIGHMEM_X64_M | 28 | AWS: 240, Azure and GCP: 244 | 100 | AWS: 12.5, Azure and GCP: 16 | n/a | n/a | 150 | For hosting multiple memory intensive applications on a single machine. |
> | HIGHMEM_X64_SL (Azure and GCP, except GCP Dammam region) | 92 | 654 | 100 | 32 | n/a | n/a | 20 | Largest Azure or GCP high-memory machine available for processing large in-memory data. |
> | HIGHMEM_X64_L (AWS only) | 124 | 984 | 100 | 50 | n/a | n/a | 150 | Largest AWS high-memory machine available for processing large in-memory data. |
> | GPU_NV_S (AWS only, except Singapore, Switzerland North, Paris, and Osaka regions) | 6 | 27 | 300 (NVMe) | Up to 10 | 1 NVIDIA A10G | 24 | 150 | Our smallest NVIDIA GPU size available for Snowpark Containers to get started. |
> | GPU_NV_M (AWS only, except gov regions, Singapore, Switzerland North, Paris, and Osaka regions) | 44 | 178 | 3.4 TB (NVMe) | 40 | 4 NVIDIA A10G | 24 | 10 | Optimized for intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |
> | GPU_NV_L (AWS only, available only in AWS US West and US East non-gov regions by request; limited availability might be possible in other regions upon request) | 92 | 1112 | 6.8 TB (NVMe) | 400 | 8 NVIDIA A100 | 40 | On request | Largest GPU instance for specialized and advanced GPU cases like LLMs and Clustering, etc. |
> | GPU_NV_XS (Azure only, except Switzerland North, UAE North, Central US, and UK South regions) | 3 | 26 | 100 | 8 | 1 NVIDIA T4 | 16 | 10 | Our smallest Azure NVIDIA GPU size available for Snowpark Containers to get started. |
> | GPU_NV_SM (Azure only, except Central US region) | 32 | 424 | 100 | 40 | 1 NVIDIA A10 | 24 | 10 | A smaller Azure NVIDIA GPU size available for Snowpark Containers to get started. |
> | GPU_NV_2M (Azure only, except Central US region) | 68 | 858 | 100 | 80 | 2 NVIDIA A10 | 24 | 5 | Optimized for intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |
> | GPU_NV_3M (Azure only, except Central US, North Europe, and UAE North regions) | 44 | 424 | 100 | 40 | 2 NVIDIA A100 | 80 | On request | Optimized for memory-intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |
> | GPU_NV_SL (Azure only, except Central US, North Europe, and UAE North regions) | 92 | 858 | 100 | 80 | 4 NVIDIA A100 | 80 | On request | Largest GPU instance for specialized and advanced GPU cases like LLMs and Clustering, etc. |
> | GPU_GCP_NV_L4_1_24G (Google Cloud only) | 6 | 28 | 300 | Up to 16 | 1 NVIDIA L4 | 24 | 10 | Our smallest NVIDIA GPU size available for Snowpark Containers to get started. |
> | GPU_GCP_NV_L4_4_24G (Google Cloud only) | 44 | 178 | 1200 | Up to 50 | 4 NVIDIA L4 | 24 | 10 | GPU usage scenarios like Computer Vision or LLMs. |
> | GPU_GCP_NV_A100_8_40G (Google Cloud only, available only in GCP US Central1 and Europe West4 regions by request) | 92 | 654 | 2500 | Up to 100 | 8 NVIDIA A100 | 40 | On request | Optimized for memory-intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |

For information about available instance families, see
[CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md).

### Autoscaling of compute pool nodes

After you create a compute pool, Snowflake launches the minimum number of nodes
and automatically creates additional nodes up to the maximum allowed. This is
called *autoscaling*. New nodes are allocated when the running nodes
cannot take any additional workload. For example,
suppose that two service instances are running on two nodes
within your compute pool. If you execute another service within
the same compute pool, the additional resource requirements might cause Snowflake to start an additional node.

However, if no services run on a node for a specific duration, Snowflake automatically removes the node, ensuring that the compute pool maintains the minimum required nodes even after the removal.

## Managing a compute pool

You can manage a compute pool using [Snowsight](../../user-guide/ui-snowsight-gs.md), or SQL.

In [Snowsight](../../user-guide/ui-snowsight-gs.md), you choose the more option (…) next to the compute pool name, and choose the desired operation from the menu. The section explains SQL commands you can use to manage a compute pool.

Snowpark Container Services provides the following commands to manage compute pools:

* **Monitoring:** Use the [SHOW COMPUTE POOLS](../../sql-reference/sql/show-compute-pools.md) command to get information about compute pools.
* **Operating:** Use the [ALTER COMPUTE POOL](../../sql-reference/sql/alter-compute-pool.md) command to change the state of a compute pool.

  ```sqlexample
  ALTER COMPUTE POOL <name> { SUSPEND | RESUME | STOP ALL }
  ```

  When you suspend a compute pool, Snowflake suspends all services except the job services. The job services continue to run until they reach a terminal state
  (DONE or FAILED), after which the compute pool nodes are released.

  A suspended compute pool must be resumed before you can start a new service. If the compute pool is configured to auto-resume
  (with the AUTO_RESUME property set to TRUE), Snowflake automatically resumes the pool when a service is submitted to it. Otherwise, you
  need to run the ALTER COMPUTE POOL command to manually resume the compute pool.
* **Modifying:** Use the [ALTER COMPUTE POOL](../../sql-reference/sql/alter-compute-pool.md) command to change compute pool properties.

  ```sqlexample
  ALTER COMPUTE POOL <name> SET propertiesToAlter = <value>
  propertiesToAlter := { MIN_NODES | MAX_NODES | AUTO_RESUME | AUTO_SUSPEND_SECS | PLACEMENT_GROUP | INSTANCE_FAMILY | TAG | COMMENT }
  ```

  When you decrease MAX_NODES, note the following potential effects:

  + Snowflake might need to terminate one or more service instances and restart them on other available nodes in the compute pool. If
    MAX_NODES is set too low, Snowflake might be unable to schedule certain service instances.
  + If the node terminated had a job service execution in progress, the job execution will fail. Snowflake will not restart the job service.

    **Example:**

    > ```sqlexample
    > ALTER COMPUTE POOL my_pool SET MIN_NODES = 2  MAX_NODES = 2;
    > ```
* **Removing:** Use the [DROP COMPUTE POOL](../../sql-reference/sql/drop-compute-pool.md) command to remove a compute pool.

  > **Example:**
  >
  > > ```sqlexample
  > > DROP COMPUTE POOL <name>
  > > ```
  > >
  > > You must stop all running services before you can drop a compute pool.
* **Listing compute pools and viewing properties:** Use SHOW COMPUTE POOLS and DESCRIBE COMPUTE POOL commands. For examples, see [Show Compute Pools](../../sql-reference/sql/show-compute-pools.md).

### About the target_nodes compute pool property

This section explains the `target_nodes` property with examples. The `target_nodes` property indicates the number of nodes that Snowflake is targeting for your compute pool. If `active_nodes` isn’t equal to the `target_nodes`, Snowflake autoscales the cluster
to add or remove the nodes.

There are several properties related to the number of nodes in a compute pool. These includes: `min_nodes`, `max_nodes`, `active_nodes`, `idle_nodes`, and `target_nodes`. For more information about these properties, see [DESC COMPUTE POOL](../../sql-reference/sql/desc-compute-pool.md) and [SHOW COMPUTE POOLS](../../sql-reference/sql/show-compute-pools.md).

The following examples demonstrate how to interpret the values in the `target_nodes` column.

#### Example 1

Suppose in a [CREATE COMPUTE POOL](../../sql-reference/sql/create-compute-pool.md) command, you specify MIN_NODES=1 and MAX_NODES=3.

While Snowflake is provisioning a node, initially the value in the `active_nodes` and `idle_nodes` columns is 0, and the value in the `target_nodes` column is 1. (The value in the `target_nodes` column is the same as the value that you specified for the MIN_NODES parameter.) This indicates that there should be one node in the compute pool that Snowflake is provisioning.

After Snowflake provisions one node, the value in the `idle_nodes` column is 1 (assuming that there are no services running). The value in the `target_nodes` column is still 1, indicating there should be one node in the compute pool.

#### Example 2

Snowflake might try to add a node to an existing compute pool due to autoscaling or changes to the minimum number of nodes (through [ALTER COMPUTE POOL … SET MIN_NODES](../../sql-reference/sql/alter-compute-pool.md)).

While Snowflake is provisioning a node, the value in the `state` column is `resizing`. To determine how many nodes Snowflake is adding, check the value in the `target_nodes` column.

For example, suppose that the value in the, `active_nodes` column is 1, the value in the `idle_nodes` column is 0, and you resize the compute pool by updating the MIN_NODES property from 1 to 2. In this case, the value in the `target_nodes` column is 2 (the number of nodes that should be in the compute pool). From this, you can infer that Snowflake is provisioning one additional node.

## Compute pool lifecycle

A compute pool can be in any of the following states:

* **IDLE:** The compute pool has the desired number of virtual machine (VM) nodes, but no
  services are scheduled. In this state, autoscaling can shrink the
  compute pool to the minimum size due to lack of activity.
* **ACTIVE:** The compute pool has at least one service running or
  scheduled to run on it. The pool can grow (up to the maximum nodes) or
  shrink (down to the minimum nodes) in response to load or user actions.
* **SUSPENDED:** The pool currently contains no running virtual machine nodes, but if the AUTO_RESUME compute pool property is set to TRUE, the pool will automatically resume when a service is scheduled.

The following states are transient:

* **STARTING:** When you create or resume a compute pool, the compute pool enters the STARTING state until at least one node is provisioned.
* **STOPPING:** When you suspend a compute pool (using ALTER COMPUTE POOL), the compute pool enters the STOPPING state until Snowflake has released all nodes in the compute pool. When you suspend a compute pool, Snowflake suspends all services except the job services. The job services continue to run until they reach a terminal state (DONE or FAILED), after which the compute pool nodes are released.
* **RESIZING:** When you create a compute pool, initially it enters the STARTING state. After it has one node provisioned, it enters the RESIZING state until the minimum number of nodes (as specified in CREATE COMPUTE POOL) are provisioned. When you change a compute pool (ALTER COMPUTE POOL) and update the minimum and maximum node values, the pool enters the RESIZING state until the minimum nodes are provisioned. Note that autoscaling of a compute pool also puts the compute pool in the RESIZING state.

For information about how the costs incurred during the different states of the compute pool lifecycle, see [Compute pool cost](accounts-orgs-usage-views.md).

## Compute pool privileges

When you work with compute pools, the following privilege model applies:

* To create a compute pool in an account, the current role needs the
  CREATE COMPUTE POOL privilege on the account. If you create a pool, as an owner you have OWNERSHIP permission, which grants full control over that compute pool. Having OWNERSHIP of one compute pool doesn’t imply any permissions on other compute pools.
* For compute pool management, the following privileges (capabilities)
  are supported:

  | Privilege | Usage |
  | --- | --- |
  | MODIFY | Enables altering any compute pool properties, including changing the size. |
  | MONITOR | Enables viewing compute pool usage, including describing compute pool properties. Enables access to the monitoring endpoint exposed by the compute pool. |
  | OPERATE | Enables changing the state of the compute pool (suspend, resume). In addition, enables stopping any scheduled services (including job services). |
  | USAGE | Enables creating services in the compute pool. Note that when a compute pool is in a suspended state and has its AUTO_RESUME property set to true, a role with USAGE permission on the compute pool can implicitly trigger the compute pool’s resumption when they start or resume a service, even if the role lacks the OPERATE permission. |
  | OWNERSHIP | Grants full control over the compute pool. Only a single role can hold this privilege on a specific object at a time. Enables access to the monitoring endpoint exposed by the compute pool. |
  | ALL [ PRIVILEGES ] | Grants all privileges, except OWNERSHIP, on the compute pool. |

## Compute pool maintenance

As part of routine internal-infrastructure maintenance, Snowflake regularly updates
compute pool nodes to ensure optimal performance and security. This includes
operating system upgrades, driver enhancements, and security fixes. Maintenance
involves replacing outdated nodes with updated ones every few weeks, with each
node active for up to a month.

### Maintenance window

In general, scheduled maintenance occurs every Saturday from 8 PM to Sunday at 8 AM, and every Sunday from 8 PM to Monday at 8 AM. For [early access accounts](../../user-guide/intro-releases.md), maintenance takes place daily starting at 11 PM and can last up to 6 hours.

### Service disruption

During maintenance, Snowflake automatically recreates service instances running on older compute pool nodes on the new nodes. Snowflake uses a rolling method to recreate service instances.

* If a service only has one instance, service disruption occurs while Snowflake is recreating the instance.
* For services with multiple instances, Snowflake recreates the service instances incrementally on the upgraded nodes. No more than 50
  percent of the service instances are replaced at a time. Note that this might lead to fewer available instances than the MIN_INSTANCES
  requested for the service. If the available instances drop to fewer than MIN_READY_INSTANCES, it causes the service to transition from
  the READY state to the PENDING state, causing service disruption. Therefore, to avoid service disruption, consider setting
  MIN_READY_INSTANCES to less than 50 percent of MIN_INSTANCES.

Ongoing job services will be disrupted and must be restarted by customers after maintenance is complete.

> **Attention:**
>
> Service disruptions during a maintenance window or critical updates are not covered by the Service Level set forth in [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

### Best practices to minimize downtime

* **Run multiple service instances:** Having multiple instances minimizes service disruption during maintenance, ensuring high availability.
* **Store application state in persistent storage:** Store data and stateful objects on persistent storage including block storage, Snowflake stages, or Snowflake tables.
* **Catch the SIGTERM signal:** When terminating a service instance, Snowflake first sends a SIGTERM signal to each service container (see [Terminate service](working-with-services.md)). As part of processing the signal, the container code can save the service state before the service instance is shut down or restarted.
* **Design high availability services to run in degraded state during maintenance:** To remain available during maintenance, your service must be tolerant to running with only 50% of the instances.
* **Provide a readiness probe:** If you don’t provide a readiness probe, Snowflake assumes your service instance is ready as soon as the code starts executing. Typically it takes some time for a container to complete initialization and be ready to handle requests. You should provide a readiness probe in the service configuration to explicitly tell Snowflake when your service instance is ready to handle requests.
* **Monitor maintenance schedules:** Avoid scheduling critical tasks during a maintenance window.
* **Avoid scheduling job Service to run during maintenance windows:** Snowflake might cancel a running job during a maintenance window.
* **Perform regular backups or checkpoints:** Periodically back up or checkpoint your application state on persistent storage (including block storage, Snowflake stages, or Snowflake tables).

## How services are scheduled on a compute pool

At the time of [creating a service](../../sql-reference/sql/create-service.md), you might choose to run multiple instances to manage incoming load.
Snowflake uses the following general guidelines when scheduling your service
instances on compute pool nodes:

* All containers in a service instance always run on a single compute pool node.
  That is, a service instance never spans across multiple nodes.
* When you run multiple service instances,
  Snowflake may run these service instances on the same node or different
  nodes within the compute pool. When making this decision, Snowflake considers any specified hard
  resource requirements (such as memory and GPU) as outlined in the
  service specification file (see [containers.resources field](specification-reference.md)).

  For example, suppose each node in your compute pool provides 8 GB of memory.
  If your service specification includes a 6-GB memory requirement, and
  you choose to run two instances when creating a service,
  Snowflake cannot run both instances on the same node. In this case,
  Snowflake schedules each instance on a separate node within the compute pool to fulfill
  the memory requirements.

> **Note:**
>
> Snowflake supports stage mounts for use by application containers. Snowflake internal stage is one of the supported storage volume types.
>
> For optimal performance, Snowflake now limits the total number of [stage volume](snowflake-stage-volume.md) mounts to eight per compute pool node, regardless of whether these volumes belong to the same service instance, the same service, or different services.
>
> When the limit on a node is reached, Snowflake doesn’t use that node to start new service instances that use a stage volume. If the limit is reached on all nodes in the compute pool, Snowflake will be unable to start your service instance. In this scenario, when you execute the SHOW SERVICE CONTAINERS IN SERVICE command, Snowflake returns PENDING status with the “Unschedulable due to insufficient resources” message.
>
> To accommodate this stage mount allotment limit on a node, in some cases, you can increase the maximum number of nodes that you request for a compute pool. This ensures that additional nodes are available for Snowflake to start your service instances.

## System compute pools

Every Snowflake account includes two system compute pools: one CPU-based and one GPU-based exclusively for the following workloads:

* Notebooks
* Streamlit apps (CPU only)
* Model serving
* ML jobs

With system compute pools, you can run these workloads immediately, no compute pool setup required.

The system compute pools have the following default configuration:

* **Compute pool name:** SYSTEM_COMPUTE_POOL_GPU

  + **Instance family:** Depending on whether your Snowflake account is in AWS or Microsoft Azure regions, Snowflake uses the following GPU instance family for this compute pool.

    - In Azure, GPU_NV_SM.
    - In AWS, GPU_NV_S.

    Note that, the following regions do not support SYSTEM_COMPUTE_POOL_GPU:

    - In AWS: Singapore, Switzerland North, Paris, and Osaka.
    - In Azure: Central US.
    - Google Cloud: GPU compute pool isn’t available.
  + **Default configuration:**

    - MIN_NODES=1
    - MAX_NODES=50
    - INITIALLY_SUSPENDED=true
    - AUTO_SUSPEND_SECS=600
* **Compute pool name:** SYSTEM_COMPUTE_POOL_CPU

  + **Instance family:** CPU_X64_S
  + **Default configuration:**

    - MIN_NODES=1
    - MAX_NODES=150
    - INITIALLY_SUSPENDED=true
    - AUTO_SUSPEND_SECS=259200

Note that,

* Compute pools are initially in a suspended state and only begin incurring costs when a supported Snowflake workload starts using them.
* For the CPU system compute pool, Snowflake keeps one idle node in the pool at no cost to you whenever the pool is active, so that new workloads can start quickly. The following details apply:

  + The idle node is visible in the `idle_nodes` column of SHOW COMPUTE POOLS and DESCRIBE COMPUTE POOL output.
  + The idle node counts against the compute pool’s MAX_NODES limit and the per-account node limit.
  + Snowflake covers the cost of one idle node. It doesn’t appear in your billing.
  + When a workload starts on the idle node, that node is billed to you normally and Snowflake provisions a new idle node to replace it.
  + This behavior isn’t configurable. Contact [Snowflake Support](https://community.snowflake.com/s/article/How-To-Submit-a-Support-Case-in-Snowflake-Lodge) if you have questions about this behavior.
* If no workloads are running, the GPU compute pool is automatically suspended after 10 minutes and the CPU compute pool is automatically suspended after 3 days. To modify the auto-suspension policy for system compute pools, use the [ALTER COMPUTE POOL SET AUTO_SUSPEND_SECS](../../sql-reference/sql/alter-compute-pool.md) command.

### Managing the system compute pools

In a Snowflake account, the ACCOUNTADMIN role owns these system compute pools. Administrators have full control over the compute pools, including modifying their properties, suspending operations, and monitoring consumption. The ACCOUNTADMIN role can delete the compute pool. For example:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER COMPUTE POOL SYSTEM_COMPUTE_POOL_CPU STOP ALL;
DROP COMPUTE POOL SYSTEM_COMPUTE_POOL_CPU;
```

By default, the USAGE permission on system compute pools is granted to the PUBLIC role, allowing all roles in the account to use them. However, the ACCOUNTADMIN can modify these privileges to restrict access if necessary.

To restrict access to system compute pools to specific roles in your account, use the ACCOUNTADMIN role to revoke the USAGE permission from the PUBLIC role and grant it to the desired role(s). For example:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> REVOKE USAGE ON COMPUTE POOL SYSTEM_COMPUTE_POOL_CPU FROM ROLE PUBLIC;
> GRANT USAGE ON COMPUTE POOL SYSTEM_COMPUTE_POOL_CPU TO ROLE <role-name>;
> ```

System compute pools can be associated with [budgets](../../user-guide/budgets.md). for cost management.

### Configuring your own preferred compute pools for Streamlit apps

When you create a container-runtime Streamlit app and don’t specify a COMPUTE_POOL, Snowflake uses
the compute pool specified by the [DEFAULT_STREAMLIT_COMPUTE_POOL](../../sql-reference/parameters.md) parameter. Snowflake
sets this parameter to SYSTEM_COMPUTE_POOL_CPU for new accounts, so Streamlit apps run on the system
compute pool by default. To use a different compute pool, set this account-level parameter.

When DEFAULT_STREAMLIT_COMPUTE_POOL is set, the compute pool selector is not shown in the Snowsight
creation dialog. The app is created on the default compute pool automatically. To use a different
pool, change it after creation using App Settings or ALTER STREAMLIT. See
[Change the compute pool](../streamlit/app-development/managing-your-app.md).

The following example configures `my_pool` as the default compute pool for Streamlit apps:

```sqlexample
ALTER ACCOUNT SET DEFAULT_STREAMLIT_COMPUTE_POOL='my_pool';
```

To restore the compute pool selector in the Snowsight creation dialog, unset the parameter:

```sqlexample
ALTER ACCOUNT UNSET DEFAULT_STREAMLIT_COMPUTE_POOL;
```

Use the following command to check the current compute pool preference configured in your account for Streamlit apps:

```sqlexample
SHOW PARAMETERS LIKE 'DEFAULT_STREAMLIT_COMPUTE_POOL' IN ACCOUNT;
```

For more information, see [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md).

### Configuring your own preferred compute pools for Notebooks

By default, Notebook services run in system compute pools. If you don’t want to use the Snowflake-provisioned compute pools, you have the option to choose other compute pools in your account for Notebooks. To override the Snowflake-provisioned compute pools you can set these parameters ([DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU](../../sql-reference/parameters.md) and [DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU](../../sql-reference/parameters.md)). Note that, this will change your Snowsight experience. When creating a Notebook in Snowsight, the compute pool you configure using these parameters appears as the first preference in the UI. The following example commands set these parameters:

* Configure `my_pool` as the account-level compute pool preferred for Notebooks using GPU runtime.

  ```sqlexample
  ALTER ACCOUNT SET DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU='my_pool';
  ```
* Configure `my_pool` as the compute pool preferred for Notebooks created in the database `my_db`.

  ```sqlexample
  ALTER DATABASE my_db SET DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU='my_pool';
  ```
* Configure `my_pool` as the compute pool preferred for Notebooks created in the schema `my_db.my_schema`.

  > ```sqlexample
  > ALTER SCHEMA my_db.my_schema SET DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU='my_pool';
  > ```

Use the following commands to check the current GPU compute pool preference configured in your account to run Notebooks:

```sqlexample
SHOW PARAMETERS LIKE 'DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU' IN ACCOUNT;

SHOW PARAMETERS LIKE 'DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU' IN DATABASE my_db;

SHOW PARAMETERS LIKE 'DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU' IN SCHEMA my_db.my_schema;
```

For more information, see [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md).

## Guidelines and limitations

* **CREATE COMPUTE POOL permission:** If you cannot create a compute pool under the current role,
  consult your account administrator to grant permission. For example:

  ```sqlexample
  GRANT CREATE COMPUTE POOL ON ACCOUNT TO ROLE <role_name> [WITH GRANT OPTION];
  ```

  For more information, see [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md).
* **Per account limit on compute pool nodes**.

  + The maximum number of nodes you can create in your account (regardless of the number of compute pools) is 500.
  + The maximum number of nodes per compute pool is 50.

  In addition, there is a limit on the number of nodes allowed for each instance family (see the **Node limit** column in the instance family table). If you see an error message like `Requested number of nodes <#> exceeds the node limit for the account`, you have encountered these limits. For more information, contact your account representative.

---
title: Snowpark Container Services: Working with services
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/working-with-services.md
section: Snowpark Container Services
---

# Snowpark Container Services: Working with services

[Snowpark Container Services](overview.md) lets you more easily
deploy, manage, and scale containerized applications.
After you create an application and upload the
application image to a repository in your Snowflake account, you can run your
application containers as a service.

A service represents Snowflake running your containerized application on a
[compute pool](working-with-compute-pool.md), which is a collection of virtual machine (VM) nodes. There are two types of services:

* **Long-running services.** A long-running service is like a web service that does not end
  automatically. After you create a service, Snowflake manages the running service. For example, if a service container stops, for whatever reason, Snowflake restarts that container so the service runs uninterrupted.
* **Job services.** A job service terminates when your code exits, similar to a stored procedure. When all containers exit, the job service is done.

The following diagram shows the architecture of a service:

The highlights of the diagram are the following:

* Users upload their application code to a repository in their Snowflake account. The image registry service serves the OCIv2 API for storing
  OCI-compliant images in a repository. For example, you can use Docker API to upload images to a repository. When you create a service, you specify the image to use.
* A compute pool is where Snowflake runs your services. The diagram shows a compute pool having two compute nodes (Node 0 and Node 1). Snowflake runs your service instance on a node. When running multiple service instances, depending on resource requirements, Snowflake might run them on the same node or distribute them across multiple nodes. For example:

  + Node 0 is running service A (two instances of the three total instances for that service), and a job (with a single instance).
  + Node 1 is running the third instance of service A. This node is also running an instance of service B.
* Depending on your application code, a service instance can consist of multiple containers. While Snowflake might distribute instances of a service across multiple compute pool nodes, all containers within a single service instance always run on the same compute pool node.
* Services can optionally communicate with the public internet.
* A service can use storage including transient storage (for example, memory and local disk) and persistent volumes (for example, block volumes).
* Snowflake can record logs, traces, and metrics from your services to the event table in your Snowflake account.

Snowflake provides APIs for you to create and manage repositories, compute pools, and services. This topic explains working with services. APIs for managing services include the following:

* **SQL commands:**

  + **Creating a service.** [CREATE SERVICE](../../sql-reference/sql/create-service.md), [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md).
  + **Altering a service.** [ALTER SERVICE](../../sql-reference/sql/alter-service.md), [DROP SERVICE](../../sql-reference/sql/drop-service.md).
  + **Getting information about a service.** [SHOW SERVICES](../../sql-reference/sql/show-services.md), [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md), and [other commands](../../sql-reference/commands-snowpark-container-services.md).
* **Non-SQL interfaces:** [Snowflake Python APIs](../snowflake-python-api/snowflake-python-overview.md), [Snowflake REST APIs](../snowflake-rest-api/snowflake-rest-api.md), and [Snowflake CLI](../snowflake-cli/index.md).

## Starting services

After you upload your application code to a [repository](working-with-registry-repository.md) in your Snowflake account, you can start a service. The minimum information required to start a service includes:

* **A name:** Name of the service.
* **A service specification:** This [specification](specification-reference.md) provides Snowflake
  with the information needed to run your service. The specification is a YAML file.
* **A compute pool:** Snowflake runs your service in the specified
  [compute pool](working-with-compute-pool.md).

### Create a long running service

Use [CREATE SERVICE](../../sql-reference/sql/create-service.md) to create a long running service.

* In most cases, you create a service by specifying an inline specification, as shown below:

  ```sqlexample
  CREATE SERVICE echo_service
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
     spec:
       containers:
       - name: echo
         image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:tutorial
         readinessProbe:
           port: 8000
           path: /healthcheck
       endpoints:
       - name: echoendpoint
         port: 8000
         public: true
     $$;
  ```
* Create a service by referencing a service specification stored on a Snowflake stage. When you deploy the service in a production
  environment, you can apply the separation of concerns design principle and upload the specification to a stage, providing stage
  information in the CREATE SERVICE command, as shown:

  ```sqlexample
  CREATE SERVICE echo_service
    IN COMPUTE POOL tutorial_compute_pool
    FROM @tutorial_stage
    SPECIFICATION_FILE='echo_spec.yaml';
  ```

### Run a job service

Use [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) to create a job service. By default this command runs synchronously, and returns a response after all containers of the job service exit. You can optionally specify the `ASYNC` parameter to run the job service asynchronously.

* Execute a job service using an inline specification. The command waits until the job finishes executing:

  ```sqlexample
  EXECUTE JOB SERVICE
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
     spec:
       containers:
       - name: main
         image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
         env:
           SNOWFLAKE_WAREHOUSE: tutorial_warehouse
         args:
         - "--query=select current_time() as time,'hello'"
         - "--result_table=results"
     $$;
  ```

  You can optionally execute this job asynchronously using the `ASYNC` property.

  ```sqlexample
  EXECUTE JOB SERVICE
     IN COMPUTE POOL tutorial_compute_pool
     NAME = example_job_service
     ASYNC = TRUE
     FROM SPECIFICATION $$
     ...
     $$;
  ```

  When you execute an asynchronous job, you can use the helper function [<service_name>!SPCS_WAIT_FOR](../../sql-reference/functions/spcs_wait_for.md) to wait for the job to complete.

  ```sqlexample
  CALL example_job_service!spcs_wait_for('DONE', 120)
  ```
* Execute a job service using stage information:

  ```sqlexample
  EXECUTE JOB SERVICE
    IN COMPUTE POOL tutorial_compute_pool
    NAME = example_job_service
    FROM @tutorial_stage
    SPECIFICATION_FILE='my_job_spec.yaml';
  ```

### Run multiple replicas of a job service (batch jobs)

By default, [EXECUTE JOB SERVICE](../../sql-reference/sql/execute-job-service.md) runs a single job service instance on a compute pool to execute the job.
However, you might choose to run multiple job service replicas to distribute the workload across compute pool nodes. For example, you might use 10 replicas to process a 10-million-row dataset, with each handling 1 million rows.

Batch jobs support scenarios where the work can be partitioned into independent tasks — one per job service instance (also referred to as replica) — that can potentially be executed concurrently. Snowflake’s ability to execute the instances concurrently depends on the size of the compute pool.

To execute a batch job with multiple instances, use the optional REPLICAS parameter of the EXECUTE JOB SERVICE as shown. The following example executes a job service with 10 instances:

```sqlexample-yaml
EXECUTE JOB SERVICE
  IN COMPUTE POOL my_pool
  NAME = example_job
  REPLICAS = 10
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: main
      image: my_repo/my_job_image:latest
$$;
```

When the REPLICAS parameter is specified in EXECUTE JOB SERVICE, Snowflake populates the following two environment variables in the job container:

* `SNOWFLAKE_JOBS_COUNT`: The value of the REPLICAS property specified on the EXECUTE JOB SERVICE.
* `SNOWFLAKE_JOB_INDEX`: The ID of the job service instance, starting from 0. If you have three replicas, the instance IDs will be 0, 1, and 2.

These environment variables are provided so that a job container can use them to partition the input and assign each instance a specific partition to process. For example, when processing 10 million rows with 10 job replicas, the instance with job index 0 would process rows 1 through 1 million, the instance with job index 1 would process rows from 1 million to 2 million, and so on.

Use the [SHOW SERVICE INSTANCES IN SERVICE](../../sql-reference/sql/show-service-instances-in-service.md) command to find the status of each job service instance.

Use the [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md) command to get overall job service status. Snowflake calculates the overall job service status as follows:

* If any instance fails, the job status is FAILED.
* If all instances complete successfully, the job status is DONE.
* If any instance is currently running, the job status is RUNNING.
* Otherwise, the job service status is PENDING.

### Using specification templates

There are times you might want to create multiple services using the same specification but with different configurations. For example, you suppose that you define an [environment variable](specification-reference.md) in a service specification and you want to create multiple services using the same specification but different values for the environment variable.

Specification templates enable you to define variables for field values in the specification. When you create a service you provide values for these variables.

In a specification template, you specify variables as values for various specification fields. Use the `{{ variable_name }}` syntax to specify these variables. Then, in the CREATE SERVICE command, specify the USING parameter to set values for these variables.

For example, the inline specification template in the following CREATE SERVICE command uses a variable named `tag_name` for the image tag name. You can use this variable to specify a different image tag for each service. In this example, the USING parameter sets the `tag_name` variable to the value `latest`.

```sqlexample-yaml
CREATE SERVICE echo_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: echo
      image: myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:{{ tag_name }}
        ...
    endpoints:
    - name: ...
      ...
  $$
  USING (tag_name=>'latest');
```

If you choose to save the specification template to a Snowflake stage in your account, you can point to the location of the template in the CREATE SERVICE command:

```sqlexample
CREATE SERVICE echo_service
    IN COMPUTE POOL tutorial_compute_pool
    FROM @STAGE SPECIFICATION_TEMPLATE_FILE='echo.yaml'
    USING (tag_name=>'latest');
```

#### Guidelines for defining variables in a specification

* Use the `{{ variable_name }}` syntax to define variables as field values in the specification.
* These variables can have default values. To specify the default value, use the `default` function in the variable declaration. For example, the following specification defines two variables (`character_name` and `endpoint_name`) with default values.

  ```yaml
  spec:
    containers:
    - name: echo
      image: <image_name>
      env:
        CHARACTER_NAME: {{ character_name | default('Bob') }}
        SERVER_PORT: 8085
    endpoints:
    - name: {{ endpoint_name | default('echo-endpoint') }}
      port: 8085
  ```

  In addition, you can specify an optional boolean parameter to the `default` function to indicate whether you want the default value used when a blank value is passed in for the variable. Consider this specification:

  ```yaml
  spec:
    containers:
    - name: echo
      image: <image_name>
      env:
        CHARACTER_NAME: {{ character_name | default('Bob', false) }}
        SERVER_PORT: 8085
    endpoints:
    - name: {{ endpoint_name | default('echo-endpoint', true) }}
      port: 8085
  ```

  In the specification:

  + For the `character_name` variable, the boolean parameter is set to `false`. Therefore, if the variable is set to an empty string value (‘’) to this parameter, the value remains blank; the default value (“Bob”) is not used.
  + For the `echo_endpoint` variable, the boolean parameter is set to `true`. Therefore, if you pass a blank value to this parameter, the default value (“echo-endpoint”) is used.

  By default, the boolean parameter for the `default` function is `false`.

#### Guidelines for passing values for specification variables

Specify the USING parameter in the CREATE SERVICE command to provide values for variables. The general syntax for USING is:

```sqlsyntax
USING( var_name=>var_value, [var_name=>var_value, ... ] );
```

where

* `var_name` is case sensitive and it should be a valid Snowflake identifier (see
  [Identifier requirements](../../sql-reference/identifiers-syntax.md)).
* `var_value` can be either an alphanumeric value or a valid JSON value.

  Examples:

  ```sqlexample
  -- Alphanumeric string and literal values
  USING(some_alphanumeric_var=>'blah123',
        some_int_var=>111,
        some_bool_var=>true,
        some_float_var=>-1.2)

  -- JSON string
  USING(some_json_var=>' "/path/file.txt" ')

  -- JSON map
  USING(env_values=>'{"SERVER_PORT": 8000, "CHARACTER_NAME": "Bob"}' );

  -- JSON list
  USING (ARGS=>'["-n", 2]' );
  ```
* The USING parameter in CREATE SERVICE must provide values for the specification variables (except the variables for which the specification provides default values). Otherwise, an error is returned.

### Examples

These examples show creating services using specification templates. The CREATE SERVICE commands in these examples use inline specification.

#### Example 1: Provide simple values

In [Tutorial 1](tutorials/tutorial-1.md) you create a service by providing an inline specification. The following example is a modified version of the same where the specification defines two variables: `image_url` and `SERVER_PORT`. Note that the `SERVER_PORT` variable is repeated in three places. This has the added benefit of using variables that ensure all these fields that are expected to have the same value do have the same value.

```sqlexample-yaml
CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   MIN_INSTANCES=1
   MAX_INSTANCES=1
   FROM SPECIFICATION_TEMPLATE $$
      spec:
         containers:
         - name: echo
           image: {{ image_url }}
           env:
             SERVER_PORT: {{SERVER_PORT}}
             CHARACTER_NAME: Bob
           readinessProbe:
             port: {{SERVER_PORT}}
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: {{SERVER_PORT}}
           public: true
         $$
      USING (image_url=>' "/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest" ', SERVER_PORT=>8000 );
```

In this CREATE SERVICE command, the USING parameter provides values for the two specification variables. The `image_url` value includes slashes and a colon. These are not alphanumeric characters. Therefore, the example wraps the value in double quotes to make it a valid JSON string value. The template specification expands the following specification:

```yaml
spec:
  containers:
  - name: echo
    image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
    env:
      SERVER_PORT: 8000
      CHARACTER_NAME: Bob
    readinessProbe:
      port: 8000
      path: /healthcheck
    endpoints:
    - name: echoendpoint
      port: 8000
      public: true
```

#### Example 2: Provide a JSON value

In Tutorial 1, the specification defines two environment variables (`SERVER_PORT` and `CHARACTER_NAME`) as shown:

```yaml
spec:
 containers:
 - name: echo
   image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
   env:
     SERVER_PORT: 8000
     CHARACTER_NAME: Bob
   …
```

You can templatize this specification by using a variable for the `env` field. This lets you create multiple services with different values for the environment variables. The following CREATE SERVICE command uses a variable (`env_values`) for the env field.

```sqlexample
CREATE SERVICE echo_service
  IN COMPUTE POOL tutorial_compute_pool
  MIN_INSTANCES=1
  MAX_INSTANCES=1
  FROM SPECIFICATION_TEMPLATE $$
     spec:
       containers:
       - name: echo
         image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
         env: {{env_values}}
         readinessProbe:
           port: {{SERVER_PORT}}    #this and next tell SF to connect to port 8000
           path: /healthcheck
       endpoints:
       - name: echoendpoint
         port: {{SERVER_PORT}}
         public: true
        $$
     USING (env_values=>'{"SERVER_PORT": 8000, "CHARACTER_NAME": "Bob"}' );
```

The USING parameter in CREATE SERVICE provides value for the `env_values` variable. The value is a JSON map that provides values for both the environment variables.

#### Example 3: Provide list as variable value

In [Tutorial 2](tutorials/tutorial-2.md), the specification includes the `args` field that includes two arguments.

```yaml
spec:
  container:
  - name: main
    image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
    env:
      SNOWFLAKE_WAREHOUSE: tutorial_warehouse
    args:
    - "--query=select current_time() as time,'hello'"
    - "--result_table=results"
```

In a template version of the specification, you can provide these arguments as a JSON list as shown:

```yaml
spec:
  container:
  - name: main
    image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
    env:
      SNOWFLAKE_WAREHOUSE: tutorial_warehouse
    args: {{ARGS}}
  $$
  USING (ARGS=>$$["--query=select current_time() as time,'hello'", "--result_table=results"]$$ );
```

## Scaling services

By default, Snowflake runs one instance of the service in the specified compute pool.
To manage heavy workloads, you can run multiple service instances by setting the MIN_INSTANCES and MAX_INSTANCES properties, which specify the minimum number of instances of the service to start with and the maximum instances Snowflake can scale to when needed.

**Example**

```sqlexample
CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   FROM @tutorial_stage
   SPECIFICATION_FILE='echo_spec.yaml'
   MIN_INSTANCES=2
   MAX_INSTANCES=4;
```

When multiple service instances are running, Snowflake automatically
provides a load balancer to distribute the incoming requests.

Snowflake does not consider the service to be READY until at least two instances are available. While the service is not ready, Snowflake blocks access to it, meaning that associated service functions or ingress requests are denied until readiness is confirmed.

In some cases, you might want Snowflake to consider the service ready (and forward incoming requests) even if fewer than the specified minimum instances are available. You can achieve this by setting the MIN_READY_INSTANCES property.

Consider this scenario: During maintenance or a rolling service upgrade, Snowflake might terminate one or more service instances. This could lead to fewer available instances than the specified MIN_INSTANCES, which prevents the service from entering the READY state. In these cases, you can set MIN_READY_INSTANCES to a value smaller than MIN_INSTANCES to ensure that the service can continue to accept requests.

**Example**

```sqlexample
CREATE SERVICE echo_service
   IN COMPUTE POOL tutorial_compute_pool
   FROM @tutorial_stage
   SPECIFICATION_FILE='echo_spec.yaml'
   MIN_INSTANCES=2
   MAX_INSTANCES=4
   MIN_READY_INSTANCES=1;
```

For more information, see [CREATE SERVICE](../../sql-reference/sql/create-service.md).

### Enabling autoscaling

To configure Snowflake to autoscale the number of service instances running, set the MIN_INSTANCES and MAX_INSTANCES parameters in the CREATE SERVICE command. You can also use ALTER SERVICE to change these values. Autoscaling occurs when the specified MAX_INSTANCES is greater than MIN_INSTANCES.

Snowflake starts by creating the minimum number of service instances on the specified compute pool. Snowflake then scales up or scales down the number of service instances based on an 80% CPU resource requests. Snowflake continuously monitors CPU utilization within the compute pool, aggregating the usage data from all currently running service instances.

When the aggregated CPU usage (across all service instances) surpasses 80%, Snowflake deploys an additional service instance within the compute pool. If the aggregated CPU usage falls below 80%, Snowflake scales down by removing a running service instance. Snowflake uses a five-minute stabilization window to prevent frequent scaling. The `target_instances` service property reports the target number of service instances that Snowflake is scaling towards.

Note the following scaling behaviors:

* The scaling of service instances is constrained by the MIN_INSTANCES and MAX_INSTANCES parameters configured for the service.
* If scaling up is necessary and the compute pool nodes lack the necessary resource capacity to start up another service instance, compute pool autoscaling can be triggered. For more information, see
  [Autoscaling of compute pool nodes](working-with-compute-pool.md).
* If you specify the MAX_INSTANCES and MIN_INSTANCES parameters when creating a service but don’t specify the CPU and memory requirements for your service instance in the service specification file, no autoscaling occurs; Snowflake starts with the number of instances specified by the MIN_INSTANCES parameter and does not autoscale.

### Suspending a service

A long-running service consumes compute pool resources, incurring costs, but you can suspend the service when it’s not performing meaningful work. When no services or jobs are active on any compute pool node, Snowflake’s compute pool auto-suspend mechanism suspends the pool to reduce costs.

To suspend a service, you can either explicitly call [ALTER SERVICE … SUSPEND](../../sql-reference/sql/alter-service.md) to suspend a service or set the AUTO_SUSPEND_SECS property using [CREATE SERVICE](../../sql-reference/sql/create-service.md) or [ALTER SERVICE](../../sql-reference/sql/alter-service.md) to define the idle duration after which Snowflake automatically suspends the service.

[Preview Feature](../../release-notes/preview-features.md) — Open

Configuring the automatic suspension of a Snowpark Container Services service using the AUTO_SUSPEND_SECS property is a [preview feature](../../release-notes/preview-features.md).

When the AUTO_SUSPEND_SECS property is set, Snowflake automatically suspends a service if it’s not already suspended and it’s idle for more than AUTO_SUSPEND_SECS seconds. A service is idle when
both of the following are true:

* There is no query currently running that includes a service function invocation to that service.
* The service status is RUNNING.

> **Caution:**
>
> Auto-suspension doesn’t track data processing initiated by a service function invocation, where the processing continues after the service
> function returns. In the current implementation, auto-suspension also doesn’t track ingress and service-to-service communications. Therefore, you should not enable auto-suspension
> for services that provide such features, because it might disrupt these potentially ongoing processes.

When Snowflake suspends a service, it shuts down all service instances on the compute pool. If there are no other services running on the compute pool and if auto-suspend is configured for the compute pool, then Snowflake also suspends the compute pool nodes. You thus avoid having to pay for an inactive compute pool.

Also, note the following:

* Auto-suspension is not supported for job services.
* Auto-suspension is not supported on services with public endpoints because Snowflake currently only tracks service function traffic and
  not ingress traffic in deciding when a service is idle.

## Modify and drop services

After your create a service or a job service, you can perform the following actions:

* Use the [DROP SERVICE](../../sql-reference/sql/drop-service.md) command to remove a service from a schema, Snowflake terminates all the service containers.
* Call the [<service_name>!SPCS_CANCEL_JOB](../../sql-reference/functions/spcs_cancel_job.md) function to cancel a job service. When you cancel a job, Snowflake stops the job from running and removes the resources allocated for the job run.
* Use the [ALTER SERVICE](../../sql-reference/sql/alter-service.md) command to modify the service; for example, suspend or resume the service, change the
  number of instances running, and direct Snowflake to redeploy your service by using a new service specification.

  > **Note:**
  >
  > You can’t alter a job service.

### Terminate service

When you suspend a service (ALTER SERVICE … SUSPEND) or drop a service (DROP SERVICE), Snowflake terminates all the service instances. Similarly, when you upgrade service code (ALTER SERVICE … <fromSpecification>), Snowflake applies rolling upgrades by terminating and redeploying one service instance at a time.

When terminating a service instance, Snowflake first sends a SIGTERM signal to each service container. The container has the option to process the signal and shut down gracefully with a 30-second window. Otherwise, after the grace period, Snowflake terminates all the processes in the container.

### Updating service code and redeploying the service

After a service is created, use the ALTER SERVICE … <fromSpecification> command to update service code and redeploy the service.

You first upload the modified application code to your image repository. You then execute the ALTER SERVICE command, either providing the service specification inline or specifying the path to a specification file in the Snowflake stage. For example:

```sqlexample
ALTER SERVICE echo_service
FROM SPECIFICATION $$
spec:
  …
  …
$$;
```

Upon receiving the request, Snowflake redeploys the service using the new code.

> **Note:**
>
> When you run the CREATE SERVICE … <fromSpecification> command, Snowflake records the specific version of the provided image. Snowflake deploys that same image version in the following scenarios, even if the image in the repository has been updated:
>
> * When a suspended service is resumed (using ALTER SERVICE … RESUME).
> * When autoscaling adds more service instances.
> * When service instances are restarted during cluster maintenance.
>
> But when you call ALTER SERVICE … <fromSpecification>, Snowflake uses the latest version in the repository for that image.

If you are the service owner, the output of the DESCRIBE SERVICE command includes the service specification, which includes the image digest (the value of the `sha256` field in the specification), as shown below:

```yaml
spec:
containers:
- name: "echo"
    image: "/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest"
    sha256: "@sha256:8d912284f935ecf6c4753f42016777e09e3893eed61218b2960f782ef2b367af"
    env:
      SERVER_PORT: "8000"
      CHARACTER_NAME: "Bob"
    readinessProbe:
      port: 8000
      path: "/healthcheck"
endpoints:
- name: "echoendpoint"
    port: 8000
    public: true
```

ALTER SERVICE can impact communications (see Using a service) with the service.

* If ALTER SERVICE … <fromSpecification> removes an endpoint or removes relevant permissions required to use an endpoint (see [serviceRoles in Specification Reference](specification-reference.md)), access to the service will fail. For more information, see Using a Service.
* While the upgrade is in progress, new connections might get routed to the new version. If the new service version is not backward compatible, it will disrupt any active service usage. For example, ongoing queries using a service function might fail.

> **Note:**
>
> When updating service code that is part of a native app with containers, you can use the [SYSTEM$WAIT_FOR_SERVICES](../../sql-reference/functions/system_wait_for_services.md) system function to pause the native app setup script to allow for the services to upgrade completely. For more information, see [Upgrade an app (Legacy)](../native-apps/update-app-upgrade.md).

#### Monitoring rolling updates

When multiple service instances are running, Snowflake performs a rolling update, in descending order, based on the ID of the service instances. Use the following commands to monitor service updates:

* [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md) and [SHOW SERVICES](../../sql-reference/sql/show-services.md):

  + The `is_upgrading` column in the output shows TRUE if the service is being upgraded.
  + The `spec_digest` column in the output represents the spec digest of the current service specification. You can execute this command periodically; a change in the `spec_digest` value indicates a service upgrade was triggered. The `spec_digest` is in use only after `is_upgrading` is FALSE; otherwise, the service upgrade is still in progress.

    Use the [SHOW SERVICE INSTANCES IN SERVICE](../../sql-reference/sql/show-service-instances-in-service.md) command to check whether all the instances have been updated to the latest version as explained below.
* [SHOW SERVICE INSTANCES IN SERVICE](../../sql-reference/sql/show-service-instances-in-service.md):

  + The `status` column in the output provides the status of each individual service instance while the rolling upgrade is in progress. During the upgrade, you will observe each service instance transition status, such as TERMINATING to PENDING, and PENDING to READY.
  + During the service upgrade, the `spec_digest` column in the output of this command might show a different value from SHOW SERVICES, which always returns the latest spec digest. This difference simply indicates that the service upgrade is in progress and service instances are still running the old version of the service.

## Get information about services

You can use the these commands:

* Use the [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md) command to retrieve the properties and status of a service. The output returns all service properties.
* Use the [SHOW SERVICES](../../sql-reference/sql/show-services.md) command to list current services (including job services) for which you have permissions. The output provides some of the properties and status for these services.

  By default, the output lists services in the current database and schema. You can alternatively specify any of the following scopes. For example:

  + **List the services in the account, in a specific database, or in a specific schema:** For example, use the IN ACCOUNT filter to list
    services in your Snowflake account, regardless of which
    database or schema the services belong to. This is useful if you have Snowflake services
    created in multiple databases and schemas in your account. Like all other commands, SHOW SERVICES IN ACCOUNTS is gated by privileges, returning only the services for which the role you are using has viewing permissions.

    You can also specify IN DATABASE or IN SCHEMA to list the services in the current (or specified) database or schema.
  + **List the services running in a compute pool:** For example, use IN COMPUTE POOL filter to list the services running in a compute pool.
  + **List the services that start with a prefix or that match a pattern:** You can apply the LIKE and STARTS WITH filters to filter the services by name.
  + **List job services. or exclude job services from the list:** You can use SHOW JOB SERVICES or SHOW SERVICES EXCLUDE JOBS to list only
    job services or exclude job services.

  You can also combine these options to customize the SHOW SERVICES output.
* Use the [SHOW SERVICE INSTANCES IN SERVICE](../../sql-reference/sql/show-service-instances-in-service.md) command to retrieve properties of the service instances.
* Use the [SHOW SERVICE CONTAINERS IN SERVICE](../../sql-reference/sql/show-service-containers-in-service.md) command to retrieve the properties and status of the service instances.
* Call [GET_JOB_HISTORY](../../sql-reference/functions/get_job_history.md) function to get the job histories for jobs that were run within a specified time range.
* Call the [<service_name>!SPCS_WAIT_FOR](../../sql-reference/functions/spcs_wait_for.md) function to wait and retrieve the service state — including the state of a job service — after a specific time.

## Monitoring services

Snowpark Container Services offers tools to monitor compute pools in your account and the services running on them. For more information, see [Snowpark Container Services: Monitoring Services](monitoring-services.md).

## Managing service-related privileges

There are three aspects to managing service-related privileges:

* What privileges are needed for the service to run?
* What privileges are needed to perform operations on the service?
* What privileges are needed to access the service endpoints?

The following section provides the details.

### Privileges needed for the service to run (service owner role)

The role that creates a service is the *service’s owner role*. The service executes all SQL in the context of this role. For more information, see [Using Snowflake-provided service user credentials](spcs-execute-sql.md).

If a service requires a privilege — for example, to access or perform an operation on a database object— the owner role must be granted that privilege.

In service-to-service communication, the owner role determines which endpoints on the destination service are accessible (see service roles).

### Privileges needed to perform operations on the service

If a role needs to perform an operation on a service (for example, suspend the service), that role must be granted the privilege to perform that operation.

The following list shows the privileges that you can grant a role to perform operations on a service:

* **USAGE:** This privilege allows a role to list services, [SHOW SERVICES](../../sql-reference/sql/show-services.md) and [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md). The privilege doesn’t allow access to service endpoints. For information, see Privileges needed to access the service endpoints (service roles).
* **MONITOR:** This privilege allows a role to inspect [service telemetry](monitoring-services.md) such as logs and container runtime status information. For more information, see [SHOW SERVICE CONTAINERS IN SERVICE](../../sql-reference/sql/show-service-containers-in-service.md).
* **OPERATE:** This privilege allows a role to operate on a service; for example, suspend, resume, upgrade a long running service, or cancel a job service. For more information about these operations, see [ALTER SERVICE](../../sql-reference/sql/alter-service.md).
* **OWNERSHIP:** This privilege grants a role to all the preceding privileges. It also grants a role privilege to modify service properties and to inspect the service specification — [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md) output includes the service specification only if the role has the OWNERSHIP privilege.

Each SQL command reference provides access control requirements for the commands. For more information, see [SHOW SERVICES](../../sql-reference/sql/show-services.md), [ALTER SERVICE](../../sql-reference/sql/alter-service.md), and [DROP SERVICE](../../sql-reference/sql/drop-service.md).

The following rule apply to future grants on services:

* You can define future grants to a role on services that aren’t created. For example, the following command grants MONITOR privilege on a future service (myservice) to a role (service_admin_role).

  ```sqlexample
  GRANT MONITOR ON FUTURE SERVICES IN SCHEMA myschema TO ROLE service_admin_role
  ```

  The only exception is the OWNERSHIP privilege. You can’t grant the OWNERSHIP privilege on a future service. For more information about future grants and related required privileges, see [Future grants on database or schema objects](../../sql-reference/sql/grant-privilege.md).
* Transfer of service ownership — including future ownership transfer by using GRANT OWNERSHIP ON FUTURE SERVICES — isn’t supported.

### Privileges needed to access the service endpoints (service roles)

A service can expose one or more endpoints that clients access by sending requests. The service’s owner role has full access to the service and its endpoints. To allow other roles to access the endpoints, you must grant them the appropriate privileges. Snowflake supports defining “service roles” in the service specification to manage access to the exposed endpoints.

Examples where a role needs a service role include:

* An owner role of a service function needs a service role that grants access to the endpoint the service function references. Otherwise, you cannot create the service function. If the owner role of the service function loses permission to the service role after creation of the service function, queries using that service function will fail with a permission error.
* In service-to-service communications, the owner role of the service needs the service role that grants access to another service’s endpoint to call that endpoint.
* A user making ingress requests from outside Snowflake to a public endpoint exposed by a service must be granted a role that is granted a service role to allow access to that endpoint.

To enable a role (say `some_role`) to access a service endpoint, you do the following:

1. Grant the USAGE privilege on the database and schema where the service is created. These privileges enable resolving the names of objects in the schema. In this case, the object is the service.

   For example, the following commands grant these USAGE privileges to a role (some_role).

   ```sqlexample
   GRANT USAGE ON DATABASE my_db TO ROLE some_role;
   GRANT USAGE ON SCHEMA my_schema TO ROLE some_role;
   ```
2. Grant the service role that has permission to access the endpoints (see [GRANT SERVICE ROLE](../../sql-reference/sql/grant-service-role.md)).

   > You have these options:
   >
   > * Grant access to all endpoints that the service exposes using the `all_endpoints_usage` service role, a pre-defined service role that Snowflake creates for every service. A service role name uses this syntax: `service-name!service-role`.
   >
   >   ```sqlexample
   >   GRANT SERVICE ROLE my_service!all_endpoints_usage TO ROLE some_role;
   >   ```
   > * Grant access to specific endpoints that a service exposes. This requires you to define one or more service roles in the specification with permission to specific endpoints. Then grant these service roles to manage fine-grained endpoint access.
   >
   >   In the following CREATE SERVICE command, the inline specification defines two endpoints (ep1 and ep2) and a service role (ep1_role) that is granted access to only the ep1 endpoint.
   >
   >   ```sqlexample-yaml
   >   USE DATABASE my_db;
   >   USE SCHEMA my_schema;
   >
   >
   >   CREATE SERVICE my_service
   >   IN COMPUTE POOL tutorial_pool
   >   FROM SPECIFICATION $$
   >   spec:
   >     containers:
   >     - name: echo
   >       image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
   >     endpoints:
   >     - name: ep1
   >       port: 8000
   >       public: true
   >     - name: ep2
   >       port: 8082
   >       public: true
   >   serviceRoles:
   >   - name: ep1_role
   >     endpoints:
   >     - ep1
   >   $$
   >   ```
   >
   >   Now, to grant `some_role` access to the `ep1` endpoint, grant the `ep1_role` service role as shown:
   >
   >   ```sqlexample
   >   GRANT SERVICE ROLE my_service!ep1_role TO ROLE some_role;
   >   ```

## Using a service

After creating a service, users in the same account (that created the service) can use it. There are three methods to use a service as illustrated in the diagram. The user needs access to roles having the necessary privileges.

The diagram highlights the methods for using the service, while other service-related components are grayed out for clarity. For a detailed explanation of the service components, refer to the diagram at the beginning of this page.

* **Use the service from a SQL query** (Service function):
  You create a service function, a user-defined function (UDF) associated with a service, and use it in a SQL query and leverage custom data processing that your service provides.
  For an example, see [Tutorial 1](tutorials/tutorial-1.md).
* **Use the service from outside Snowflake** (Ingress): You can declare one or more service endpoints as public to allow network ingress access to the service. This can be used to build web apps or exposed APIs over your Snowflake data. For an example, see [Tutorial 1](tutorials/tutorial-1.md).
* **Use service from another service** (Service-to-service communications): Services can communicate with each other by using Snowflake-assigned service DNS name for service-to-service communication
  For an example, see [Tutorial 4](tutorials/advanced/tutorial-4.md).

As the diagram illustrates, when communicating with a service using any of these methods, you send requests to endpoints that the service exposes and get results.

> **Note:**
>
> Service functions cannot be used to communicate with a job service.

The following sections provide details.

### Service functions: Using a service from an SQL query

A service function is a user-defined function (UDF) you
create using [CREATE FUNCTION (Snowpark Container Services)](../../sql-reference/sql/create-function-spcs.md). However, instead of writing
the UDF code directly, you associate the UDF with your
service endpoint. Note that you can associate a service function only with a service endpoint that supports the HTTP protocol (see [spec.endpoints field (optional)](specification-reference.md)).

For example, in [Tutorial 1](tutorials/tutorial-1.md), you create a
service named `echo_service` that exposes one endpoint (echoendoint) as defined in the service specification:

```yaml
spec:
…
  endpoints:
  - name: echoendpoint
    port: 8080
```

`echoendpoint` is a user-friendly endpoint name that represents the
corresponding port (8080). To communicate with this service endpoint, you create
a service function by providing the SERVICE and ENDPOINT parameters as shown:

```sqlexample
CREATE FUNCTION my_echo_udf (text varchar)
   RETURNS varchar
   SERVICE=echo_service
   ENDPOINT=echoendpoint
   AS '/echo';
```

The `AS` parameter provides the HTTP path to the service code.
You get this path value from the service code. For example, the following code lines are from `service.py` in [Tutorial 1](tutorials/tutorial-1.md).

```python
@app.post("/echo")
def echo():
...
```

You invoke the service function in a SELECT statement such as the following:

```sqlexample
SELECT service_function_name(<parameter-list>);
```

Snowflake directs the request to the associated service endpoint and path.

> **Note:**
>
> A service function is used to communicate with a service, and not with a job. In other words, you can only associate a service (not a job) with a service function.

#### Data exchange format

For data exchange between a service function and an application container,
Snowflake follows the same format that external functions
use (see [Data Formats](../../sql-reference/external-functions-data-format.md)).
For example, suppose you have data rows stored in a table (`input_table`):

```output
"Alex", "2014-01-01 16:00:00"
"Steve", "2015-01-01 16:00:00"
…
```

To send this data to your service, you invoke the service function by passing
these rows as parameters:

```sqlexample
SELECT service_func(col1, col2) FROM input_table;
```

Snowflake sends a series of requests to the container,
with batches of data rows in the request body in this
format:

```sqljson
{
   "data":[
      [
         0,
         "Alex",
         "2014-01-01 16:00:00"
      ],
      [
         1,
         "Steve",
         "2015-01-01 16:00:00"
      ],
      …
      [
         <row_index>,
         "<column1>",
         "<column2>"
      ],
   ]
}
```

The container then returns the output in the following format:

```sqljson
{
   "data":[
      [0, "a"],
      [1, "b"],
      …
      [ row_index,  output_column1]
   ]
}
```

The example output shown assumes that the result is a one-column
table with rows (“a”, “b” …).

#### Configuring batch processing

The [CREATE FUNCTION](../../sql-reference/sql/create-function-spcs.md) and [ALTER FUNCTION](../../sql-reference/sql/alter-function-spcs.md) commands support parameters that configure how Snowflake handles batches of data processed by your service..

* Configuring batch size

  You can use the MAX_BATCH_ROWS parameter to limit the batch size, that is, the maximum number of rows Snowflake sends to your service in a single request. This helps control the volume of data transferred. This can also result in more, smaller batches that might be processed in parallel if your service supports multiple instances or concurrent requests.
* Handling errors

  You can use the these parameters for batch error handling: `ON_BATCH_FAILURE`, `MAX_BATCH_RETRIES`, and `BATCH_TIMEOUT_SECS`.

For example, the following ALTER FUNCTION command configures the MAX_BATCH_ROWS and MAX_BATCH_RETRIES parameters of the `my_echo_udf` service function:

```sqlexample
ALTER FUNCTION my_echo_udf(VARCHAR) SET
   MAX_BATCH_ROWS = 15
   MAX_BATCH_RETRIES = 5;
```

#### Privileges required to create and manage service functions

To create and manage service functions, a role needs the following
privileges:

* The current role must have the service role granted for the endpoint referenced in [CREATE FUNCTION](../../sql-reference/sql/create-function-spcs.md) or [ALTER FUNCTION](../../sql-reference/sql/alter-function-spcs.md) command.
* To use a service function in a SQL query, the current session must have a role with usage privilege on the service function and the owner role of the service function must be granted the service role for the associated service endpoint.

The following example script shows how you might grant permissions to create and use
a service function:

```sqlexample
USE ROLE service_owner;
GRANT USAGE ON DATABASE service_db TO ROLE func_owner;
GRANT USAGE ON SCHEMA my_schema TO ROLE func_owner;
GRANT SERVICE ROLE ON service service_db.my_schema.my_service!all_endpoints_usage TO ROLE func_owner;
USE ROLE func_owner;
CREATE OR REPLACE test_udf(v VARCHAR)
  RETURNS VARCHAR
  SERVICE=service_db.my_schema.my_service
  ENDPOINT=endpointname1
  AS '/run';

SELECT test_udf(col1) FROM some_table;

ALTER FUNCTION test_udf(VARCHAR) SET
  SERVICE = service_db.other_schema.other_service
  ENDPOINT=anotherendpoint;

GRANT USAGE ON DATABASE service_db TO ROLE func_user;
GRANT USAGE ON SCHEMA my_schema TO ROLE func_user;
GRANT USAGE ON FUNCTION test_udf(varchar) TO ROLE func_user;
USE ROLE func_user;
SELECT my_test_udf('abcd');
```

### Ingress: Using a service from outside Snowflake

You can declare one or more endpoints as public in the service specification to allow users to use the service from the public.
Note that users must be Snowflake users in the same Snowflake account that created the service.

```yaml
spec:
  ...
  endpoints:
  - name: <endpoint name>
    port: <port number>
    public: true
```

Note that ingress is allowed only with an HTTP endpoint (see [spec.endpoints field (optional)](specification-reference.md)).

#### Ingress authentication

A user can access a public endpoint when that user is granted a service role that allows access to that endpoint. (see Privileges needed to access the service endpoints (service roles)).

Then users can access the public endpoint using a browser or programmatically.

* **Accessing a public endpoint by using a browser:** When the user uses a browser to access a public endpoint, Snowflake
  automatically redirects the user to a sign-in page. The user must provide their Snowflake credentials to sign in. After
  successfully signing in, the user has access to the endpoint. Behind the scenes, the user sign-in generates an OAuth token from
  Snowflake. The OAuth token is then used to send a request to the service endpoint.

  For an example, see [Tutorial 1](tutorials/tutorial-1.md).
* **Accessing a public endpoint programmatically:** There are three ways for programmatic clients to
  access endpoints:

  + Using a [programmatic access token (PAT)](../../user-guide/programmatic-access-tokens.md): Your application passes
    the token in the `Authorization` header of requests to the endpoint to represent its identity.
  + Using [key-pair authentication](../../user-guide/key-pair-auth.md): Your application generates a JWT by using
    a key pair, exchanges the JWT with Snowflake for an OAuth token, and then passes the OAuth token in the
    `Authorization` header of requests to the endpoint to represent its identity.
  + Using the [Python connector](../python-connector/python-connector.md): Your application uses
    the Python connector to generate a session token, and then passes the session token in the
    `Authorization` header of requests to the endpoint to represent its identity.

  For related examples, see [Tutorial 8](tutorials/advanced/tutorial-8-access-public-endpoint-programmatically.md).

#### User-specific headers in ingress requests

When a request for a public endpoint arrives, Snowflake automatically passes the following header along with the HTTP request to the container.

> ```none
> Sf-Context-Current-User: <user_name>
> ```

Your container code can optionally read the header, know who the caller is, and apply context-specific customization for different users. In addition, Snowflake can optionally include the `Sf-Context-Current-User-Email` header. To include this header, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### Service-to-service communications

Service instances can communicate directly with each other over TCP (including HTTP). This is true both for instances that belong to the same service and for instances that belong to different services.

Instances can only receive communications (requests) on the endpoints declared in the service specification. The client (the service sending the request) must have the required roles and grants to connect to that endpoint (see Privileges needed to access the service endpoints (service roles)).

* By default, a service instance can connect to other instances of the same service on the declared endpoints. In broader terms, a service’s owner role has permissions to connect to endpoints of services with the same owner role.
* In order for a client service to connect to an endpoint of a service that has a different owner role, the owner role of the client service needs the service role that grants access to another service’s endpoint to call that endpoint. For more information, see Privileges needed to access the service endpoints (service roles).
* If you want to prevent your services from communicating with each other (for reasons such as security), use different Snowflake roles
  to create those services.

A service instance can be reached using either the service IP address or the service instance IP addresses.

* Requests using the service IP address are routed to a load-balancer that in turn routes requests to a randomly selected service instance.
* Requests using the service instance IP address are routed directly to the specific service instance. You must use the service instance IP when connecting to an endpoint defined using the `portRange` field (see [spec.endpoints field (optional)](specification-reference.md)).

Both IP addresses are discoverable using the DNS name that Snowflake automatically assigns to each service. Note that it’s not possible to use DNS to connect to a specific instance. For example, it doesn’t make sense to construct a URL using the service instance DNS name, because there is no way to use the service instance DNS name to reference a specific service instance.

The service instance IP addresses are shown in the output of the [SHOW SERVICE INSTANCES IN SERVICE](../../sql-reference/sql/show-service-instances-in-service.md) command when the [2025_01 behavior change bundle](../../release-notes/bcr-bundles/2025_01/bcr-1883.md) is enabled.

For a service-to-service communication example, see [Tutorial 4](tutorials/advanced/tutorial-4.md).

Note that if a service endpoint is created only to allow service-to-service communications, the TCP protocol should be used (see [spec.endpoints field (optional)](specification-reference.md)).

#### Service DNS name

The DNS name format is:

```output
<service-name>.<hash>.svc.spcs.internal
```

Use [SHOW SERVICES](../../sql-reference/sql/show-services.md) (or [DESCRIBE SERVICE](../../sql-reference/sql/desc-service.md)) to get the DNS name of a service.
The preceding DNS name is a fully qualified name. Services created in the same schema can
communicate using just the `<service-name>`. Services that are in a different schema or database must provide the hash, such as `<service-name>.<hash>` or provide the fully qualified name (`<service-name>.<hash>.svc.spcs.internal`).

Use the [SYSTEM$GET_SERVICE_DNS_DOMAIN](../../sql-reference/functions/system_get_service_dns_domain.md) function to find the DNS domain for a given schema. The DNS hash domain is specific to the current version of the schema. Note the following:

* If that schema or its database is renamed, the hash does not change.
* If the schema is dropped and then recreated (for example using CREATE OR REPLACE SCHEMA) the new schema will have a new hash. If you UNDROP a schema, the hash remains the same.

DNS names have the following limitations:

* Your service names must be a valid DNS label. (See also <https://www.ietf.org/rfc/rfc1035.html#section-2.3.1>). Otherwise, creating a service will fail.
* Snowflake replaces an underscore (_) in the service name by a dash (-) in the DNS name.
* A DNS name is only for internal communications within Snowflake between services running in the same account. It is not accessible from the internet.

#### Service instances DNS name

The Service instances DNS name format is the following:

```output
instances.<service-name>.<hash>.svc.spcs.internal
```

It resolves to a list of service instance IP addresses, one for each instance of the service. Note that there is no guaranteed order to the list of IP addresses that DNS returns. This DNS name should only be used with DNS APIs, not as the hostname in a URL. The expectation is that your application uses this hostname with DNS APIs to collect the set of service instance IPs and then programmatically connect directly to those instance IPs.

This list of IP addresses enables the creation of a mesh network for direct communication between specific service instances.

#### Which DNS name to choose

The following considerations apply when choosing which DNS name to use when connecting to a service in service-to-service communication.

Use the service DNS name when any of the following is true:

* You need to access a specific destination port in the simplest possible way.
* You want each request to be sent to a randomly selected service instance.
* You don’t know how your application framework performs and caches DNS responses.

Use the service instance DNS name or service instance IP when any of the following is true:

* You want to discover the IP addresses of all the service instances.
* You want to skip an intermediate load balancer.
* You use distributed frameworks or databases, such as Ray or Cassandra, that use service instance IP addresses as identities.

#### General guidelines related to service-to-service communications

* Traffic for service-to-service communications is sent over the virtual interface `eth0`.
* If your server is listening on the port `0`, when a process binds to the port `0` in Linux, a port is chosen randomly from the
  ephemeral port range defined with the `sysctl` parameter `net.ipv4.ip_local_port_range`. Currently, this parameter is not configurable and is equal to `32768 60999`.
* The IP address of a service instance is the IP address of the virtual interface `eth0`. Use the following methods to get this IP address:

  + From the output of `ipconfig`:

    ```output
    eth0Ip=$(ifconfig eth0 | sed -En -e 's/.*inet ([0-9.]+).*/\1/p')
    ```
  + Use the following Python code to get the service IP, service instance IP, and list of all service instance IPs:

    ```python
    import os
    import socket

    service_name = os.environ['SNOWFLAKE_SERVICE_NAME']
    service_dns_name = service_name.lower().replace("_","-")

    service_ip = socket.gethostbyname(service_dns_name)
    instance_ip = socket.gethostbyname(socket.gethostname())
    fqdn, _, instance_ips = socket.gethostbyname_ex(
        "instances." + service_dns_name)
    print(f"""
      service name: {service_name}
      service dns name: {service_dns_name}
      service fqdn: {fqdn}
      service ip: {service_ip}
      instance ip: {instance_ip}
      instances ips: {instance_ips}
    """)
    ```

## Manage types of services allowed in your account

Snowflake supports different types of services (workload types) that you can create in your account. These types include user-deployed workloads, such as services and jobs, and first-party workloads that are managed by Snowflake, such as notebooks, model serving, and ML jobs. For a list of workload types, see [ALLOWED_SPCS_WORKLOAD_TYPES](../../sql-reference/parameters.md).

When you list services in your account using [SHOW SERVICES](../../sql-reference/sql/show-services.md), you can include a filter to list only specific workload types. For example, show user-deployed services only:

```sqlexample
SHOW SERVICES OF TYPE USER;
```

You can restrict the types of workloads that are allowed in your Snowflake account by using the account-level parameters ALLOWED_SPCS_WORKLOAD_TYPES and DISALLOWED_SPCS_WORKLOAD_TYPES. For example, to allow only NOTEBOOK workloads, run the following statement:

```sqlexample
ALTER ACCOUNT SET ALLOWED_SPCS_WORKLOAD_TYPES = NOTEBOOK;
```

> **Note:**
>
> * Workload types that are specified in DISALLOWED_SPCS_WORKLOAD_TYPES can’t be deployed. If you configure both ALLOWED_SPCS_WORKLOAD_TYPES and DISALLOWED_SPCS_WORKLOAD_TYPES, the disallowed list takes precedence. For example, if both parameters specify the NOTEBOOK workload type, NOTEBOOK workloads aren’t allowed to run on Snowpark Container Services.
> * Services that are created before you configure these account-level parameters continue to run.
>   However, if you suspend a service whose workload type is disallowed, you can’t restart it.
> * To delete all the previously created services of disallowed types, run the [ALTER COMPUTE POOL … STOP ALL OF TYPE](../../sql-reference/sql/alter-compute-pool.md) command.

## Passing credentials to a container using Snowflake secrets

There are many reasons why you might want to pass Snowflake managed credentials into your container. For example, your service might
communicate with external endpoints (outside Snowflake), in which case you will need to provide credential information in your container
for your application code to use.

To provide credentials, first store them in [Snowflake secret](../../user-guide/api-authentication.md) objects. Then, in the service specification, use `containers.secrets` to specify which secret objects to use and where to place them inside the container. You can either pass these credentials to environment variables in the containers, or make them available in local files in the containers.

### Specifying Snowflake secrets

Specify a Snowflake secret by name or reference (reference is applicable only in the Native Application scenario):

* **Pass Snowflake secret by name:** You can pass a secret name as the `snowflakeSecret` field value.

  ```yaml
  ...
  secrets:
  - snowflakeSecret:
      objectName: '<secret-name>'
    <other info about where in the container to copy the secret>
    ...
  ```

  Note that you can optionally specify `<secret-name>` directly as the `snowflakeSecret` value.
* **Pass Snowflake secret by reference:** When using Snowpark Container Services to create a Native App (an app with containers), the app
  producer and consumers use different Snowflake accounts. In some contexts an installed Snowflake Native App needs to access existing
  secret objects in the consumer account that exist outside the APPLICATION object. In this case, developers can use the “secrets by
  reference” specification syntax to handle credentials as shown:

  ```yaml
  containers:
  - name: main
    image: <url>
    secrets:
    - snowflakeSecret:
        objectReference: '<reference-name>'
      <other info about where in the container to copy the secret>
  ```

  Note that the specification uses `objectReference` instead of `objectName` to provide a secret reference name.

### Specifying secrets placement inside the container

You can tell Snowflake to either place the secrets in the containers as environment variables or write them into local container files.

#### Pass secrets as environment variables

To pass Snowflake secrets to containers as environment variables, include `envVarName` in the `containers.secrets` field.

```yaml
containers:
- name: main
  image: <url>
  secrets:
  - snowflakeSecret: <secret-name>
    secretKeyRef: username | password | secret_string |  'access_token'
    envVarName: '<env-variable-name>'
```

The `secretKeyRef` value depends on the type of Snowflake secret. Possible values are the following:

* `username` or `password` if the Snowflake secret is of the `password` type.
* `secret_string` if the Snowflake secret is of the `generic_string` type.

Note that Snowflake does not update secrets passed as environment variables after a service is created.

##### Example 1: Passing secrets of the *password* type as environment variables

In this example, you create the following Snowflake secret object of the `password` type:

```sqlexample
CREATE SECRET testdb.testschema.my_secret_object
  TYPE = password
  USERNAME = 'snowman'
  PASSWORD = '1234abc';
```

To provide this Snowflake secret object to the environment variables (for example, `LOGIN_USER` and `LOGIN_PASSWORD`)
in your container, add the following `containers.secrets` field in the specification file:

```yaml
containers:
- name: main
  image: <url>
  secrets:
  - snowflakeSecret: testdb.testschema.my_secret_object
    secretKeyRef: username
    envVarName: LOGIN_USER
  - snowflakeSecret: testdb.testschema.my_secret_object
    secretKeyRef: password
    envVarName: LOGIN_PASSWORD
```

In this example, the `snowflakeSecret` value is a fully qualified object name because secrets can be stored in a different schema than the service that is being created.

The `containers.secrets` field in this example is a list of two `snowflakeSecret` objects:

* The first object maps `username` in the Snowflake secret object to the `LOGIN_USER` environment variable in your
  container.
* The second object maps the `password` in the Snowflake secret object to the `LOGIN_PASSWORD` environment variable
  in your container.

##### Example 2: Passing secrets of the *generic_string* type as environment variables

In this example, you create the following Snowflake secret object of the `generic_string` type:

```sqlexample
CREATE SECRET testdb.testschema.my_secret
  TYPE=generic_string
  SECRET_STRING='
       some_magic: config
  ';
```

To provide this Snowflake secret object to environment variables (for example, GENERIC_SECRET) in your container, you add the
following `containers.secrets` field in the specification file:

```yaml
containers:
- name: main
  image: <url>
  secrets:
  - snowflakeSecret: testdb.testschema.my_secret
    secretKeyRef: secret_string
    envVarName: GENERIC_SECRET
```

#### Write secrets in local container files

To make Snowflake secrets available to your application container in local container files, include a `containers.secrets`
field:
To make Snowflake secrets available to your application container in local container files, include `directoryPath` in the `containers.secrets`:

```yaml
containers:
- name: <name>
  image: <url>
  ...
  secrets:
  - snowflakeSecret: <snowflake-secret-name>
    directoryPath: '<local directory path in the container>'
```

Snowflake populates necessary files for the secret in this specified `directoryPath`; specifying the `secretKeyRef` is not necessary. Depending on the secret type, Snowflake creates the following files in the container under the directory path you provided:

* `username` and `password` if the Snowflake secret is of the `password` type.
* `secret_string` if the Snowflake secret is of the `generic_string` type.
* `access_token` if the Snowflake secret is of the `oauth2` type.

> **Note:**
>
> After a service is created, if the Snowflake secret object is updated, Snowflake will update the corresponding secret
> files in the running containers.

##### Example 1: Passing secrets of the *password* type in local container files

In this example, you create the following Snowflake secret object of the `password` type:

```sqlexample
CREATE SECRET testdb.testschema.my_secret_object
  TYPE = password
  USERNAME = 'snowman'
  PASSWORD = '1234abc';
```

To make these credentials available in local container files, add the following `containers.secrets` field in the
specification file:

```yaml
containers:
- name: main
  image: <url>
  secrets:
  - snowflakeSecret: testdb.testschema.my_secret_object
    directoryPath: '/usr/local/creds'
```

When you start your service, Snowflake creates two files inside the container: `/usr/local/creds/username` and
`/usr/local/creds/password`. Your application code can then read these files.

##### Example 2: Passing secrets of the *generic_string* type in local container files

In this example, you create the following Snowflake secret object of the `generic_string` type:

```sqlexample
CREATE SECRET testdb.testschema.my_secret
  TYPE=generic_string
  SECRET_STRING='
       some_magic: config
  ';
```

To provide this Snowflake secret object in local container files, you add the
following `containers.secrets` field in the specification file:

```yaml
containers:
- name: main
  image: <url>
  secrets:
  - snowflakeSecret: testdb.testschema.my_secret
    directoryPath: '/usr/local/creds'
```

When you start your service, Snowflake creates this file inside the containers: `/usr/local/creds/secret_string`.

##### Example 3: Passing secrets of the *oauth2* type in local container files

In this example, you create the following Snowflake secret object of the `oauth2` type:

```sqlexample
CREATE SECRET testdb.testschema.oauth_secret
  TYPE = OAUTH2
  OAUTH_REFRESH_TOKEN = '34n;vods4nQsdg09wee4qnfvadH'
  OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '2023-12-31 20:00:00'
  API_AUTHENTICATION = my_integration;
```

To make these credentials available in local container files, add the following `containers.secrets` field in the
specification file:

```yaml
containers:
- name: main
  image: <url>
  secrets:
  - snowflakeSecret: testdb.testschema.oauth_secret
    directoryPath: '/usr/local/creds'
```

Snowflake fetches the access token from the OAuth secret object and creates `/usr/local/creds/access_token` in the
containers.

When a service uses secrets of the oauth2 type, the service is expected to use that secret to access an internet
destination. An oauth secret must be allowed by
[External Access Integration (EAI)](../external-network-access/creating-using-external-network-access.md);
otherwise CREATE SERVICE or EXECUTE JOB SERVICE will fail. This extra EAI requirement only applies to secrets of the oauth2 type and
not to other types of secrets.

In summary, the typical steps in creating such a service are:

1. Create a secret of the oauth2 type (shown earlier).
2. Create an EAI to allow use of the secret by a service. For example:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION example_eai
     ALLOWED_NETWORK_RULES = (<name>)
     ALLOWED_AUTHENTICATION_SECRETS = (testdb.testschema.oauth_secret)
     ENABLED = true;
   ```
3. Create a service that includes a `containers.secrets` field in the specification. That also specifies the optional
   EXTERNAL_ACCESS_INTEGRATIONS property to include an EAI to allow use of the oauth2 secret.

   An example CREATE SERVICE (with inline specification) command:

   ```sqlexample
   CREATE SERVICE eai_service
     IN COMPUTE POOL MYPOOL
     EXTERNAL_ACCESS_INTEGRATIONS = (example_eai)
     FROM SPECIFICATION
     $$
     spec:
       containers:
         - name: main
           image: <url>
           secrets:
           - snowflakeSecret: testdb.testschema.oauth_secret
             directoryPath: '/usr/local/creds'
       endpoints:
         - name: api
           port: 8080
     $$;
   ```

For more information about egress, see [Configure service egress](service-network-communications.md).

## Guidelines and limitations

For more information, see [Snowpark Container Services: Guidelines and limitations](spcs-guidelines-and-limitations.md).

---
title: Tutorial 1: Create a Snowpark Container Services Service
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/tutorial-1.md
section: Snowpark Container Services
---

App Development

# Tutorial 1: Create a Snowpark Container Services Service

## Introduction

After completing the [common setup](common-setup.md), you are ready to create a service. In this tutorial, you create a
service (named `echo_service`) that simply echoes back text that you provide as input. For example, if the input string is
“Hello World,” the service returns “I said, Hello World.”

There are two parts to this tutorial:

**Part 1: Create and test a service.** You download code provided for this tutorial and follow step-by-step instructions:

1. Download the service code for this tutorial.
2. Build a Docker image for Snowpark Container Services, and upload the image to a repository in your account.
3. Create a service by providing the service specification file and the compute pool in which to run the service.
4. Create a service function to communicate with the service.
5. Use the service. You send echo requests to the service and verify the response.

**Part 2: Understand the service**. This section provides an overview of the service code and highlights how different
components collaborate.

## 1: Download the service code

Code (a Python application) is provided to create the Echo service.

1. Download [`SnowparkContainerServices-Tutorials.zip`](../../../_downloads/c3a8f6109048f2ecca7734c7fd3b0b3b/SnowparkContainerServices-Tutorials.zip).
2. Unzip the content, which includes one directory for each tutorial. The `Tutorial-1` directory has the following files:

   * `Dockerfile`
   * `echo_service.py`
   * `templates/basic_ui.html`

## 2: Build an image and upload

Build an image for the linux/amd64 platform that Snowpark Container Services supports, and then upload the image to the image
repository in your account (see [Common Setup](common-setup.md)).

You will need information about the repository (the repository URL and the registry hostname) before you can build and upload the image. For more information, see
[Registry and Repositories](../working-with-registry-repository.md).

**Get information about the repository**

1. To get the repository URL, execute the [SHOW IMAGE REPOSITORIES](../../../sql-reference/sql/show-image-repositories.md) SQL command.

   ```bash
   SHOW IMAGE REPOSITORIES;
   ```

   * The `repository_url` column in the output provides the URL. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
     ```
   * The host name in the repository URL is the registry host name. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com
     ```

**Build image and upload it to the repository**

1. Open a terminal window, and change to the directory containing the files you unzipped.
2. To build a Docker image, execute the following `docker build` command using the Docker CLI.
   Note the command specifies current working directory (`.`)
   as the `PATH` for files to use for building the image.

   ```bash
   docker build --rm --platform linux/amd64 -t <repository_url>/<image_name> .
   ```

   * For `image_name`, use `my_echo_service_image:latest`.

   **Example**

   ```bash
   docker build --rm --platform linux/amd64 -t myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest .
   ```
3. Upload the image to the repository in your Snowflake account.

   1. For Docker to upload an image on your behalf to your repository,
      first [authenticate Docker with the registry](../working-with-registry-repository.md).

      1. We recommend using [Snowflake CLI](../../snowflake-cli/index.md)
         to authenticate your local Docker instance with the image
         registry for your Snowflake account. Make sure that you configured Snowflake CLI to connect to Snowflake. For more information,
         see [Configuring Snowflake CLI and connecting to Snowflake](../../snowflake-cli/connecting/connect.md).
      2. To authenticate, execute the following Snowflake CLI command:

         ```snowcli
         snow spcs image-registry login
         ```
   2. To upload the image, execute the following command:

      ```bash
      docker push <repository_url>/<image_name>
      ```

      **Example**

      ```bash
      docker push myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
      ```

## 3: Create a service

In this section you create a service.

To create a service, you need the following items:

* A [compute pool](../working-with-compute-pool.md). Snowflake runs your service in the specified compute pool. You created a compute pool named `tutorial_compute_pool` as part of the common setup.
* A [service specification](../specification-reference.md). This specification provides Snowflake with the information
  needed to configure and run your service. For more information, see [Snowpark Container Services: Working with services](../working-with-services.md). In this tutorial, you provide the specification inline, in CREATE SERVICE command. You can also save the specification to a file in your Snowflake stage and provide file information in the CREATE SERVICE command as shown in Tutorial 2.

The following sections explain how to create a service by using SQL or Snowsight. Use one of the methods to create a service.

> **Note:**
>
> You can create a service by using SQL or Snowsight. However, you can’t perform the preceding step, by uploading an image to image repository, cannot be done using Snowsight.

### Create a service by using SQL

1. Verify that the compute pool is ready and that you are in the right context to create the service.

   1. Previously you set the context in the [Common Setup](common-setup.md) step. To ensure you are in the right context for the SQL statements in this step, execute the following:
   > ```sqlexample
   > USE ROLE test_role;
   > USE DATABASE tutorial_db;
   > USE SCHEMA data_schema;
   > USE WAREHOUSE tutorial_warehouse;
   > ```

   1. To ensure the compute pool you created in the [common setup](common-setup.md) is ready, execute `DESCRIBE COMPUTE POOL`, and verify that the `state` is `ACTIVE` or `IDLE`. If the `state` is `STARTING`, you need to wait until the `state` changes to either `ACTIVE` or `IDLE`.
   > ```sqlexample
   > DESCRIBE COMPUTE POOL tutorial_compute_pool;
   > ```
2. To create the service, execute the following command using `test_role`:

   ```sqlexample
   CREATE SERVICE echo_service
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
       spec:
         containers:
         - name: echo
           image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
           env:
             SERVER_PORT: 8000
             CHARACTER_NAME: Bob
           readinessProbe:
             port: 8000
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: 8000
           public: true
         $$
      MIN_INSTANCES=1
      MAX_INSTANCES=1;
   ```

   > **Note:**
   >
   > If a service with that name already exists, use the DROP SERVICE command to delete the previously created service, and then
   > create this service.
3. Execute the following SQL commands to get detailed information about the service you just created. For more information, see
   [Snowpark Container Services: Working with services](../working-with-services.md).

   * To list services in your account, execute the SHOW SERVICES command:

     ```sqlexample
     SHOW SERVICES;
     ```
   * To get information about your service including the service status, execute the [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) command.

     ```sqlexample
     DESC SERVICE echo_service;
     ```

     Verify the `status` column shows the service status as RUNNING; if the status is PENDING, it indicates the service is still starting. To investigate why the service is not RUNNING, execute the [SHOW SERVICE CONTAINERS IN SERVICE](../../../sql-reference/sql/show-service-containers-in-service.md) command and review the `status` of individual containers:

     ```sqlexample
     SHOW SERVICE CONTAINERS IN SERVICE echo_service;
     ```

### Create a service by using Snowsight

[Sign in to Snowflake](../../../user-guide/connecting.md) and complete the following steps to create a service:

1. In the navigation menu, select Monitoring » Services & jobs.
2. Under Create new, select Create new service.
3. On the Create service page, provide the following information:

   * Service details:

     + In the top-right corner, from the role selector menu, select `test_role`.
     + Specify the Service details:

       - Service name: `echo_service`
       - Compute pool: `tutorial_compute_pool`
       - Location to store: `tutorial_db.data_schema`
       - Select Next.
   * Service specification: You can get this information from the service specification in the preceding section.

     + On the Containers tab, specify the following information:

       - Name: `echo`
       - Image: `TUTORIAL_REPOSITORY.my_echo_service_image:latest`
     + On the Endpoints tab, specify the following information:

       - Name: `echoendpoint`
       - Network port: `8080`
       - Protocol: `HTTP`
       - Select the Allow public access checkbox.
     + Select Next.
4. To create the service, select Create.

   Snowflake creates the service and displays the service details.

> **Note:**
>
> The SQL version of creating the service in the preceding section includes an environment variable (`SERVER_PORT`) that changes the network port the code listens on to 8000. The Snowsight version of the tutorial doesn’t provide this override. Therefore, we specify the network port as 8080, which is the default in the code.
>
> > To use Snowsight and use some of the service specification’s advanced features, you can provide a YAML version of the service specification in the UI.

## 4: Use the service

First, setup the context for the SQL statements in this section, execute the following:

```sqlexample
USE ROLE test_role;
USE DATABASE tutorial_db;
USE SCHEMA data_schema;
USE WAREHOUSE tutorial_warehouse;
```

Now you can communicate with the Echo service. As explained in [Using a service](../working-with-services.md), you can use the service in the following ways:

* From a SQL query by using a service function
* From outside Snowflake, which is called *network ingress*

In the following sections you use these methods to communicate with the Echo service.

### Communicate with your service by using a service function

A service function is one of the methods available to communicate with your service. A service function is a user-defined
function (UDF) that you associate with the service endpoint. When the service function is executed, it sends a request to the
service endpoint, and hen receives a response.

To create a service function, execute the following command:

```sqlexample
CREATE FUNCTION my_echo_udf (InputText varchar)
  RETURNS varchar
  SERVICE=echo_service
  ENDPOINT=echoendpoint
  AS '/echo';
```

In the query:

* The SERVICE property associates the UDF with the `echo_service` service.
* The ENDPOINT property associates the UDF with the `echoendpoint` endpoint within the service.
* AS ‘/echo’ specifies the HTTP path to the Echo server. You can find this path in the service code: `echo_service.py`.

You can invoke the service function in a query.
The example service function, `my_echo_udf`, can take either a single string or a list of strings as input.

#### Example 1.1: Pass a single string

* To call the `my_echo_udf` service function, execute the following
  SELECT statement, passing one input string (`'hello'`):

  ```sqlexample
  SELECT my_echo_udf('hello!');
  ```

  Snowflake sends a POST request to the service endpoint (`echoendpoint`). Upon receiving the request, the service echos the input string in the response:

  ```output
  +--------------------------+
  | **MY_ECHO_UDF('HELLO!')**|
  |------------------------- |
  | Bob said hello!          |
  +--------------------------+
  ```

If service is created by using Snowsight, the following response is returned:

> ```output
> +--------------------------+
> | **MY_ECHO_UDF('HELLO!')**|
> |------------------------- |
> | I said hello!            |
> +--------------------------+
> ```

#### Example 1.2: Pass a list of strings

When you pass a list of strings to the service function, Snowflake batches these input strings, and then sends a series of POST
requests to the service. After the service processes all the strings, Snowflake combines the results, and then returns them.

The following example passes a table column as input to the service function.

1. Create a table with multiple strings:

   ```sqlexample
   CREATE TABLE messages (message_text VARCHAR)
     AS (SELECT * FROM (VALUES ('Thank you'), ('Hello'), ('Hello World')));
   ```
2. Verify that the table was created:

   ```sqlexample
   SELECT * FROM messages;
   ```
3. To call the service function, execute the following SELECT statement, passing table rows as input:

   ```sqlexample
   SELECT my_echo_udf(message_text) FROM messages;
   ```

   Output:

   ```output
   +---------------------------+
   | MY_ECHO_UDF(MESSAGE_TEXT) |
   |---------------------------|
   | Bob said Thank you        |
   | Bob said Hello            |
   | Bob said Hello World      |
   +---------------------------+
   ```

### Communicate with your service by using a web browser

The service exposes the endpoint publicly. For details, see the inline specification provided in the CREATE SERVICE command. Therefore, you can sign in to a web UI that the service exposes to the internet, and then send requests to the service from a web browser.

1. Find the URL of the public endpoint that the service exposes:

   ```sqlexample
   SHOW ENDPOINTS IN SERVICE echo_service;
   ```

   The `ingress_url` column in the response provides the URL.

   **Example**

   ```output
   p6bye-myorg-myacct.snowflakecomputing.app
   ```
2. Append `/ui` to the endpoint URL, and then paste it in the web browser. This causes the service to execute the `ui()` function (see `echo_service.py`).

   The first time you access the endpoint URL, you are asked to sign in to Snowflake. For this test, use the same user that you used to create the service to ensure that the user has the necessary privileges.
3. In the Input box, enter string “Hello”, and then press **Return** (**Enter**).

> **Note:**
>
> You can access the public endpoint programmatically. For more information, see [Tutorial 8: Access the public endpoint programmatically](advanced/tutorial-8-access-public-endpoint-programmatically.md).

## 5: Clean up

If you don’t plan to continue with [Tutorial 2](tutorial-2.md) or [Tutorial 4](advanced/tutorial-4.md), you should remove billable
resources you created. For more information, see Step 5 in [Tutorial 4](advanced/tutorial-4.md).

## 6: Reviewing the service code

This section covers the following topics:

* Examining the tutorial 1 code: Review the code files that implement the Echo service.
* Understanding the service function: This section explains how the service function in this tutorial is linked
  with the service.
* Building and testing an image locally. The section provides an explanation of how you can locally test the
  Docker image before uploading it to a repository in your Snowflake account.

### Examining the tutorial 1 code

The zip file you downloaded in Step 1 includes the following files:

* `Dockerfile`
* `echo_service.py`
* `templates/basic_ui.html`

You also use service specification when creating the service. The following section explains how these code components work together to create the service.

#### echo_service.py file

This Python file contains the code that implements a minimal HTTP server that returns (echoes back) input text. The code
primarily performs two tasks: handling echo requests from Snowflake service functions, and providing a web user interface (UI)
for submitting echo requests.

```python
from flask import Flask
from flask import request
from flask import make_response
from flask import render_template
import logging
import os
import sys

SERVICE_HOST = os.getenv('SERVER_HOST', '0.0.0.0')
SERVER_PORT = os.getenv('SERVER_PORT', 8080)
CHARACTER_NAME = os.getenv('CHARACTER_NAME', 'I')

def get_logger(logger_name):
  logger = logging.getLogger(logger_name)
  logger.setLevel(logging.DEBUG)
  handler = logging.StreamHandler(sys.stdout)
  handler.setLevel(logging.DEBUG)
  handler.setFormatter(
    logging.Formatter(
      '%(name)s [%(asctime)s] [%(levelname)s] %(message)s'))
  logger.addHandler(handler)
  return logger

logger = get_logger('echo-service')

app = Flask(__name__)

@app.get("/healthcheck")
def readiness_probe():
  return "I'm ready!"

@app.post("/echo")
def echo():
  '''
  Main handler for input data sent by Snowflake.
  '''
  message = request.json
  logger.debug(f'Received request: {message}')

  if message is None or not message['data']:
    logger.info('Received empty message')
    return {}

  # input format:
  #   {"data": [
  #     [row_index, column_1_value, column_2_value, ...],
  #     ...
  #   ]}
  input_rows = message['data']
  logger.info(f'Received {len(input_rows)} rows')

  # output format:
  #   {"data": [
  #     [row_index, column_1_value, column_2_value, ...}],
  #     ...
  #   ]}
  output_rows = [[row[0], get_echo_response(row[1])] for row in input_rows]
  logger.info(f'Produced {len(output_rows)} rows')

  response = make_response({"data": output_rows})
  response.headers['Content-type'] = 'application/json'
  logger.debug(f'Sending response: {response.json}')
  return response

@app.route("/ui", methods=["GET", "POST"])
def ui():
  '''
  Main handler for providing a web UI.
  '''
  if request.method == "POST":
    # getting input in HTML form
    input_text = request.form.get("input")
    # display input and output
    return render_template("basic_ui.html",
      echo_input=input_text,
      echo_reponse=get_echo_response(input_text))
  return render_template("basic_ui.html")

def get_echo_response(input):
  return f'{CHARACTER_NAME} said {input}'

if __name__ == '__main__':
  app.run(host=SERVICE_HOST, port=SERVER_PORT)
```

In the code:

* The `echo` function enables a Snowflake service function to communicate with the service. This function specifies the
  `@app.post()` decoration as shown:

  ```python
  @app.post("/echo")
  def echo():
  ```

  When the echo server receives your HTTP POST request with the `/echo` path, the server routes the request to this
  function. The function executes and echoes back the strings from the request body in the response.

  To support communication from a Snowflake service function, this server implements the external functions. That is, the
  server implementation follows a certain input/output data format in order to serve a SQL function, and this is the same
  [input/output data format](../../../sql-reference/external-functions-data-format.md) used by
  [External Functions](../../../sql-reference/external-functions.md).
* The `ui` function section of the code displays a web form and handles echo requests submitted from the web form. This
  function uses the `@app.route()` decorator to specify that requests for `/ui` are handled by this function:

  ```python
  @app.route("/ui", methods=["GET", "POST"])
  def ui():
  ```

  The Echo service exposes the `echoendpoint` endpoint publicly (see service specification), enabling communication with
  the service over the web. When you load the URL of the public endpoint with /ui appended in your browser, the browser sends
  an HTTP GET request for this path, and the server routes the request to this function. The function executes and returns a
  simple HTML form for the user to enter a string in.

  After the user enters a string and submits the form, the browser sends an HTTP post request for this path, and the server
  routes the request to this same function. The function executes and returns an HTTP response containing the original string.
* The `readiness_probe` function uses the `@app.get()` decorator to specify that requests for `/healthcheck`
  are handled by this function:

  ```python
  @app.get("/healthcheck")
  def readiness_probe():
  ```

  This function enables Snowflake to check the readiness of the service. When the container starts, Snowflake wants to confirm
  that the application is working and that the service is ready to serve the requests. Snowflake sends an HTTP GET request with
  this path (as a health probe, readiness probe) to ensure that only healthy containers serve traffic. The function can do
  whatever you want.
* The `get_logger` function helps set up logging.

#### Dockerfile

This file contains all the commands to build an image using Docker.

```bash
ARG BASE_IMAGE=python:3.10-slim-buster
FROM $BASE_IMAGE
COPY echo_service.py ./
COPY templates/ ./templates/
RUN pip install --upgrade pip && \\
pip install flask
CMD ["python", "echo_service.py"]
```

The Dockerfile contains instructions to install the Flask library in the Docker container. The code in `echo_service.py`
relies on the Flask library to handle HTTP requests.

#### /template/basic_ui.html

The Echo service exposes the `echoendpoint` endpoint publicly (see service specification), enabling communication with the
service over the web. When you load the public endpoint URL with `/ui` appended in your browser, the Echo service displays
this form. You can enter a string in the form and submit the form, and the service returns the string in an HTTP response.

```html
<!DOCTYPE html>
<html lang="en">
  <head>
    <title>Welcome to echo service!</title>
  </head>
  <body>
    <h1>Welcome to echo service!</h1>
    <form action="{{ url_for("ui") }}" method="post">
      <label for="input">Input:<label><br>
      <input type="text" id="input" name="input"><br>
    </form>
    <h2>Input:</h2>
    {{ echo_input }}
    <h2>Output:</h2>
    {{ echo_reponse }}
  </body>
</html>
```

#### Service specification

Snowflake uses information you provide in this specification to configure and run your service.

```yaml
spec:
  containers:
  - name: echo
    image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
    env:
      SERVER_PORT: 8000
      CHARACTER_NAME: Bob
    readinessProbe:
      port: 8000
      path: /healthcheck
  endpoints:
  - name: echoendpoint
    port: 8000
    public: true
```

In the service specification:

* The `containers.image` specifies the image for Snowflake to start a container.
* The optional `endpoints` field specifies the endpoint the service exposes.

  + The `name` specifies a user-friendly name for the TCP network port the container is listening on. You use this
    user-friendly endpoint name to send requests to the corresponding port. Note that the `env.SERVER_PORT` controls this
    port number.
  + The endpoint is also configured as `public`. This allows traffic to this endpoint from the public web.
* The optional `containers.env` field is added to illustrate how you might override environment variables that Snowflake
  passes to all processes in your container. For example, the service code (`echo_service.py`) reads the environment
  variables with default values as shown:

  ```python
  CHARACTER_NAME = os.getenv('CHARACTER_NAME', 'I')
  SERVER_PORT = os.getenv('SERVER_PORT', 8080)
  ```

  It works as follows:

  + When the Echo service receives an HTTP POST request with a string (e.g., “Hello”) in the request body, the service returns
    “I said Hello” by default. The code uses the `CHARACTER_NAME` environment variable to determine the word before
    “said.” By default, `CHARACTER_NAME` is set to “I.”

    You can overwrite the CHARACTER_NAME default value in the service specification. For example, if you set the value to “Bob,”
    the Echo service returns a “Bob said Hello” response.
  + Similarly, the service specification overrides the port (SERVER_PORT) that the service listens on to 8000, overriding the
    default port 8080.
* The `readinessProbe` field identifies the `port` and `path` that Snowflake can use to send an HTTP GET
  request to the readiness probe to verify that the service is ready to handle traffic.

  The service code (`echo_python.py`) implements the readiness probe as follows:

  ```python
  @app.get("/healthcheck")
  def readiness_probe():
  ```

  Therefore, the specification file includes the `container.readinessProbe` field accordingly.

For more information about service specifications, see [Service specification reference](../specification-reference.md).

### Understanding the service function

A service function is one of the methods of communicating with your service (see
[Using a service](../working-with-services.md)). A service function is a user-defined function (UDF) that you associate
with a service endpoint. When the service function is executed, it sends a request to the associated service endpoint and
receives a response.

You create the following service function by executing the CREATE FUNCTION command with the following parameters:

```sqlexample
CREATE FUNCTION my_echo_udf (InputText VARCHAR)
  RETURNS VARCHAR
  SERVICE=echo_service
  ENDPOINT=echoendpoint
  AS '/echo';
```

Note the following:

* The `my_echo_udf` function takes a string as input and returns a string.
* The SERVICE property identifies the service (`echo_service`), and the ENDPOINT property identifies the user-friendly
  endpoint name (`echoendpoint`).
* The AS ‘/echo’ specifies the path for the service. In `echo_service.py`, the `@app.post` decorator associates this
  path with the `echo` function.

This function connects with the specific ENDPOINT of the specified SERVICE. When you invoke this function, Snowflake sends a
request to the `/echo` path inside the service container.

### Building and testing an image locally

You can test the Docker image locally before uploading it to a repository in your Snowflake account. In local testing, your
container runs standalone (it is not a service that Snowflake runs).

To test the Tutorial 1 Docker image:

1. To create a Docker image, in the Docker CLI, execute the following command:

   ```bash
   docker build --rm -t my_service:local .
   ```
2. To launch your code, execute the following command:

   ```bash
   docker run --rm -p 8080:8080 my_service:local
   ```
3. Send an echo request to the service using one of the following methods:

   * **Using the cURL command:**

     In another terminal window, using cURL, send the following POST request to port 8080:

     ```bash
     curl -X POST http://localhost:8080/echo \
       -H "Content-Type: application/json" \
       -d '{"data":[[0, "Hello friend"], [1, "Hello World"]]}'
     ```

     Note that the request body includes two strings. This cURL command sends a POST request to port 8080 on which the service
     is listening. The 0 in the data is the index of the input string in the list. The Echo service echoes the input strings
     in response as shown:

     ```output
     {"data":[[0,"I said Hello Friend"],[1,"I said Hello World"]]}
     ```
   * **Using a web browser:**

     1. In your browser, on the same computer, open `http://localhost:8080/ui`.

        This sends a GET request to port 8080, which the service is listening on. The service executes the `ui()`
        function, which renders a HTML form as shown:
     2. Enter the string “Hello” in the **Input** box, and press **Return**.

## What’s next?

You can now test the [Tutorial 2](tutorial-2.md) that executes a job.

---
title: Tutorial 2: Create a Snowpark Container Services Job Service
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/tutorial-2.md
section: Snowpark Container Services
---

App Development

# Tutorial 2: Create a Snowpark Container Services Job Service

## Introduction

After completing the [Tutorial Common Setup](common-setup.md), you are ready to create a job service. In this tutorial, you create a
simple job service that connects to Snowflake, executes a SQL SELECT query, and saves the result to a table.

There are two parts to this tutorial:

**Part 1: Create and test a job service.** You download code provided for this tutorial and follow step-by-step instructions:

1. Download the job service code for this tutorial.
2. Build a Docker image for Snowpark Container Services, and upload the image to a repository in your account.
3. Stage the service specification file, which gives Snowflake the container configuration information. In addition to the name of
   the image to use to start a container, the specification file specifies three arguments: a SELECT query, a virtual warehouse to execute the query, and the name of the table to save the result to.
4. Execute the job service. Using the EXECUTE JOB SERVICE command, you can execute the job service by providing the specification file and the
   compute pool where Snowflake can run the container. And finally, verify the service results.

**Part 2: Understand the job service code**. This section provides an overview of the job service code and highlights how different
components collaborate.

## 1: Download the job service code

Code (a Python application) is provided to implement a job service.

1. Download [`SnowparkContainerServices-Tutorials.zip`](../../../_downloads/c3a8f6109048f2ecca7734c7fd3b0b3b/SnowparkContainerServices-Tutorials.zip).
2. Unzip the content, which includes one directory for each tutorial. The `Tutorial-2` directory has the following files:

   * `main.py`
   * `Dockerfile`
   * `my_job_spec.yaml`

## 2: Build and upload an image

Build an image for the linux/amd64 platform that Snowpark Container Services supports, and then upload the image to the image
repository in your account (see [Common Setup](common-setup.md)).

You will need information about the repository (the repository URL and the registry hostname) before you can build and upload the image. For more information, see
[Registry and Repositories](../working-with-registry-repository.md).

**Get information about the repository**

1. To get the repository URL, execute the [SHOW IMAGE REPOSITORIES](../../../sql-reference/sql/show-image-repositories.md) SQL command.

   ```bash
   SHOW IMAGE REPOSITORIES;
   ```

   * The `repository_url` column in the output provides the URL. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
     ```
   * The host name in the repository URL is registry host name. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com
     ```

**Build image and upload it to the repository**

1. Open a terminal window, and change to the directory containing the files you unzipped.
2. To build a Docker image, execute the following `docker build` command using the Docker CLI.
   Note the command specifies current working directory (.)
   as the `PATH` for files to use for building the image.

   ```bash
   docker build --rm --platform linux/amd64 -t <repository_url>/<image_name> .
   ```

   * For `image_name`, use `my_job_image:latest`.

   **Example**

   ```bash
   docker build --rm --platform linux/amd64 -t myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_job_image:latest .
   ```
3. Upload the image to the repository in your Snowflake account. In order for Docker to upload an image on your behalf to your repository,
   you must first [authenticate Docker with the registry](../working-with-registry-repository.md).

   1. For Docker to upload an image on your behalf to your repository,
      first [authenticate Docker with the registry](../working-with-registry-repository.md).

      1. We recommend using [Snowflake CLI](../../snowflake-cli/index.md)
         to authenticate your local Docker instance with the image
         registry for your Snowflake account. Make sure that you configured Snowflake CLI to connect to Snowflake. For more information,
         see [Configuring Snowflake CLI and connecting to Snowflake](../../snowflake-cli/connecting/connect.md).
      2. To authenticate, execute the following Snowflake CLI command:

         ```snowcli
         snow spcs image-registry login
         ```
   2. To upload the image, execute the following command:

      ```bash
      docker push <repository_url>/<image_name>
      ```

      **Example**

      ```bash
      docker push myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_job_image:latest
      ```

## 3: Stage the specification file

* To upload your service specification file (`my_job_spec.yaml`) to the stage, use one of the following options:

  + **The Snowsight web interface:** For instructions, see [Choosing an internal stage for local files](../../../user-guide/data-load-local-file-system-create-stage.md).
  + **The SnowSQL CLI:** Execute the following [PUT](../../../sql-reference/sql/put.md) command:

    ```sqlexample
    PUT file://<file-path>[/\]my_job_spec.yaml @tutorial_stage
      AUTO_COMPRESS=FALSE
      OVERWRITE=TRUE;
    ```

    For example:

    - Linux or macOS

      ```sqlexample
      PUT file:///tmp/my_job_spec.yaml @tutorial_stage
        AUTO_COMPRESS=FALSE
        OVERWRITE=TRUE;
      ```
    - Windows

      ```sqlexample
      PUT file://C:\temp\my_job_spec.yaml @tutorial_stage
        AUTO_COMPRESS=FALSE
        OVERWRITE=TRUE;
      ```

    You can also specify a relative path.

    ```sqlexample
    PUT file://./my_job_spec.yaml @tutorial_stage
      AUTO_COMPRESS=FALSE
      OVERWRITE=TRUE;
    ```

    The command sets OVERWRITE=TRUE so that you can upload the file again, if needed (for example, if you fixed an error in
    your specification file). If the PUT command is executed successfully, information about the uploaded file is printed out.

## 4: Execute the job service

Now you are ready to create a job.

1. To start a job service, run the EXECUTE JOB SERVICE command:

   ```sqlexample
   EXECUTE JOB SERVICE
     IN COMPUTE POOL tutorial_compute_pool
     NAME=tutorial_2_job_service
     FROM @tutorial_stage
     SPEC='my_job_spec.yaml';
   ```

   Note the following:

   * FROM and SPEC provide the stage name and the name of the job service specification file. When the job service is executed, it runs the
     SQL statement and saves the result to a table as specified in `my_job_spec.yaml`.

     The SQL statement is not executed within the Docker container. Instead, the running container connects to
     Snowflake and runs the SQL statement in a Snowflake warehouse.
   * COMPUTE_POOL provides the compute resources where Snowflake executes the job service.
   * You can optionally include the QUERY_WAREHOUSE parameter to specify a default warehouse for the container to execute SQL statements. However, the job service code in this tutorial specifies an environment variable to define the warehouse, so the preceding command omits the default.
   * EXECUTE JOB SERVICE returns output that includes the job name, as shown in the following sample output:

     ```output
     +------------------------------------------------------------------------------------+
     |                      status                                                        |
     -------------------------------------------------------------------------------------+
     | Job TUTORIAL_2_JOB_SERVICE completed successfully with status: DONE.               |
     +------------------------------------------------------------------------------------+
     ```
2. The job service runs a simple query and saves result to the results table.
   You can verify the job service successfully completed by querying the results table:

   ```sqlexample
   SELECT * FROM results;
   ```

   Sample output:

   ```output
   +----------+-----------+
   | TIME     | TEXT      |
   |----------+-----------|
   | 10:56:52 | hello     |
   +----------+-----------+
   ```
3. If you want to debug execution of your job service, execute SHOW SERVICE CONTAINERS IN SERVICE to determine if the job service is still running, if it failed to start, or why it failed if it did. Also,
   assuming your code outputs useful logs to standard output or standard error, you can access the logs using SYSTEM$GET_SERVICE_LOGS.

   > 1. To get the job service status, execute [SHOW SERVICE CONTAINERS IN SERVICE](../../../sql-reference/sql/show-service-containers-in-service.md):
   >
   >    ```sqlexample
   >    SHOW SERVICE CONTAINERS IN SERVICE tutorial_2_job_service;
   >    ```
   >
   >    Sample output:
   >
   >    ```output
   >    +---------------+-------------+------------------------+-------------+----------------+--------+------------------------+----------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+------------+
   >    | database_name | schema_name | service_name           | instance_id | container_name | status | message                | image_name                                                                                                                             | image_digest                                                            | restart_count | start_time |
   >    |---------------+-------------+------------------------+-------------+----------------+--------+------------------------+----------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+------------|
   >    | TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_2_JOB_SERVICE | 0           | main           | DONE   | Completed successfully | myorg-myacct.registry.snowflakecomputing.com/tutorial_db/tutorial_db/data_schema/tutorial_repository/my_job_image:latest | sha256:aa3fa2e5c1552d16904a5bbc97d400316ebb4a608bb110467410485491d9d8d0 |             0 |            |
   >    +---------------+-------------+------------------------+-------------+----------------+--------+------------------------+----------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+------------+
   >    ```
   > 2. To get the job service log information, use the system function [SYSTEM$GET_SERVICE_LOGS](../../../sql-reference/functions/system_get_service_logs.md):
   >
   >    ```sqlexample
   >    SELECT SYSTEM$GET_SERVICE_LOGS('tutorial_2_job_service', 0, 'main')
   >    ```
   >
   >    ```output
   >    job-tutorial - INFO - Job started
   >    job-tutorial - INFO - Connection succeeded. Current session context: database="TUTORIAL_DB", schema="DATA_SCHEMA", warehouse="TUTORIAL_WAREHOUSE", role="TEST_ROLE"
   >    job-tutorial - INFO - Executing query [select current_time() as time,'hello'] and writing result to table [results]
   >    job-tutorial - INFO - Job finished
   >    ```

## 5: Clean up

If you do not plan to continue with [Tutorial 4](advanced/tutorial-4.md), you should remove billable resources you created. For more
information, see Step 5 in [Tutorial 4](advanced/tutorial-4.md).

## 6: Reviewing the job service code

This section covers the following topics:

* Examining the files provided: Review various code files that implement the job service.
* Building and testing an image locally. The section provides an explanation of how you can locally test the
  Docker image before uploading it to a repository in your Snowflake account.

### Examining the files provided

The zip file you downloaded at the beginning of the tutorial includes the following files:

* `main.py`
* `Dockerfile`
* `my_job_spec.yaml`

This section provides an overview of the code.

#### main.py file

```python
#!/opt/conda/bin/python3

import argparse
import logging
import os
import sys

from snowflake.snowpark import Session
from snowflake.snowpark.exceptions import *

# Environment variables below will be automatically populated by Snowflake.
SNOWFLAKE_ACCOUNT = os.getenv("SNOWFLAKE_ACCOUNT")
SNOWFLAKE_HOST = os.getenv("SNOWFLAKE_HOST")
SNOWFLAKE_DATABASE = os.getenv("SNOWFLAKE_DATABASE")
SNOWFLAKE_SCHEMA = os.getenv("SNOWFLAKE_SCHEMA")

# Custom environment variables
SNOWFLAKE_USER = os.getenv("SNOWFLAKE_USER")
SNOWFLAKE_PASSWORD = os.getenv("SNOWFLAKE_PASSWORD")
SNOWFLAKE_ROLE = os.getenv("SNOWFLAKE_ROLE")
SNOWFLAKE_WAREHOUSE = os.getenv("SNOWFLAKE_WAREHOUSE")

def get_arg_parser():
  """
  Input argument list.
  """
  parser = argparse.ArgumentParser()
  parser.add_argument("--query", required=True, help="query text to execute")
  parser.add_argument(
    "--result_table",
    required=True,
    help="name of the table to store result of query specified by flag --query")

  return parser

def get_logger():
  """
  Get a logger for local logging.
  """
  logger = logging.getLogger("job-tutorial")
  logger.setLevel(logging.DEBUG)
  handler = logging.StreamHandler(sys.stdout)
  handler.setLevel(logging.DEBUG)
  formatter = logging.Formatter("%(name)s - %(levelname)s - %(message)s")
  handler.setFormatter(formatter)
  logger.addHandler(handler)
  return logger

def get_login_token():
  """
  Read the login token supplied automatically by Snowflake. These tokens
  are short lived and should always be read right before creating any new connection.
  """
  with open("/snowflake/session/token", "r") as f:
    return f.read()

def get_connection_params():
  """
  Construct Snowflake connection params from environment variables.
  """
  if os.path.exists("/snowflake/session/token"):
    return {
      "account": SNOWFLAKE_ACCOUNT,
      "host": SNOWFLAKE_HOST,
      "authenticator": "oauth",
      "token": get_login_token(),
      "warehouse": SNOWFLAKE_WAREHOUSE,
      "database": SNOWFLAKE_DATABASE,
      "schema": SNOWFLAKE_SCHEMA
    }
  else:
    return {
      "account": SNOWFLAKE_ACCOUNT,
      "host": SNOWFLAKE_HOST,
      "user": SNOWFLAKE_USER,
      "password": SNOWFLAKE_PASSWORD,
      "role": SNOWFLAKE_ROLE,
      "warehouse": SNOWFLAKE_WAREHOUSE,
      "database": SNOWFLAKE_DATABASE,
      "schema": SNOWFLAKE_SCHEMA
    }

def run_job():
  """
  Main body of this job.
  """
  logger = get_logger()
  logger.info("Job started")

  # Parse input arguments
  args = get_arg_parser().parse_args()
  query = args.query
  result_table = args.result_table

  # Start a Snowflake session, run the query and write results to specified table
  with Session.builder.configs(get_connection_params()).create() as session:
    # Print out current session context information.
    database = session.get_current_database()
    schema = session.get_current_schema()
    warehouse = session.get_current_warehouse()
    role = session.get_current_role()
    logger.info(
      f"Connection succeeded. Current session context: database={database}, schema={schema}, warehouse={warehouse}, role={role}"
    )

    # Execute query and persist results in a table.
    logger.info(
      f"Executing query [{query}] and writing result to table [{result_table}]"
    )
    res = session.sql(query)
    # If the table already exists, the query result must match the table scheme.
    # If the table does not exist, this will create a new table.
    res.write.mode("append").save_as_table(result_table)

  logger.info("Job finished")

if __name__ == "__main__":
  run_job()
```

In the code:

* Python code executes at `main`, which then executes the `run_job()` function:

  ```python
  if __name__ == "__main__":
    run_job()
  ```
* The `run_job()` function reads the environment variables and uses them to set default values for various parameters.
  The container uses these parameters to connect to Snowflake. Note that:

  + You can override the parameter values, used in the service, using the `containers.env` and `containers.args` fields in the service specification. For more information, see [Service specification reference](../specification-reference.md).
  + When the image runs in Snowflake, Snowflake populates some of these parameters (see source code) automatically. However,
    when testing the image locally, you need to explicitly provide these parameters (as shown in the next section,
    Building and testing an image locally).

#### Dockerfile

This file contains all the commands to build an image using Docker.

```bash
ARG BASE_IMAGE=continuumio/miniconda3:4.12.0
FROM $BASE_IMAGE
RUN conda install python=3.12 && \
  conda install snowflake-snowpark-python
COPY main.py ./
ENTRYPOINT ["python3", "main.py"]
```

#### my_job_spec.yaml File (Service Specification)

Snowflake uses information you provide in this specification to configure and run your job service.

```yaml
spec:
  containers:
  - name: main
    image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
    env:
      SNOWFLAKE_WAREHOUSE: tutorial_warehouse
    args:
    - "--query=select current_time() as time,'hello'"
    - "--result_table=results"
```

In addition to the `container.name` and `container.image` required fields (see [Service specification reference](../specification-reference.md)),
the specification includes the optional `container.args` field to list the arguments:

* `--query` provides the query to execute when the service runs.
* `--result_table` identifies the table to save the query results.

### Building and testing an image locally

You can test the Docker image locally before uploading it to a repository in your Snowflake account. In local testing, your
container runs standalone (it is not a job service that Snowflake executes).

Use the following steps to test the Tutorial 2 Docker image:

1. To create a Docker image, in the Docker CLI, execute the `docker build` command:

   ```bash
   docker build --rm -t my_service:local .
   ```
2. To launch your code, execute the `docker run` command, providing `<orgname>-<acctname>`, `<username>`, and
   `<password>`:

   ```bash
   docker run --rm \
     -e SNOWFLAKE_ACCOUNT=<orgname>-<acctname> \
     -e SNOWFLAKE_HOST=<orgname>-<acctname>.snowflakecomputing.com \
     -e SNOWFLAKE_DATABASE=tutorial_db \
     -e SNOWFLAKE_SCHEMA=data_schema \
     -e SNOWFLAKE_ROLE=test_role \
     -e SNOWFLAKE_USER=<username> \
     -e SNOWFLAKE_PASSWORD=<password> \
     -e SNOWFLAKE_WAREHOUSE=tutorial_warehouse \
     my_job:local \
     --query="select current_time() as time,'hello'" \
     --result_table=tutorial_db.data_schema.results
   ```

   When testing the image locally, note that, in addition to the three arguments (a query, the warehouse to run the query, and
   a table to save the result to), you also provide the connection parameters for the container running locally to connect to
   Snowflake.

   When you run the container as a service, Snowflake provides these parameters to the container as the environment variables. For
   more information, see [Configure Snowflake Client](../spcs-execute-sql.md).

   The job service executes the query (`select current_time() as time,'hello'`) and writes result to the table
   (`tutorial_db.data_schema.results`). If the table does not exist, it is created. If the table exists, the job service adds a row.

   Sample result of querying the results table:

   ```output
   +----------+----------+
   | TIME     | TEXT     |
   |----------+----------|
   | 10:56:52 | hello    |
   +----------+----------+
   ```

## What’s next?

You can now test [Tutorial 4](advanced/tutorial-4.md), which shows how service-to-service communication works.

---
title: Tutorial 3: Create a service and a job using the Snowflake Python APIs
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/tutorial-1-with-sf-python.md
section: Snowpark Container Services
---

App Development

# Tutorial 3: Create a service and a job using the Snowflake Python APIs

## Introduction

In [Tutorial 1](tutorial-1.md) and [Tutorial 2](tutorial-2.md), you use the SQL interface to create a Snowpark Container Services service and job. In this tutorial you use the [Snowflake Python APIs](../../snowflake-python-api/snowflake-python-overview.md) to create the same service and job and thus explore using the Snowflake Python APIs to manage Snowpark Container Services resources.

The tutorial uses a [Snowflake notebook](../../../user-guide/ui-snowsight/notebooks.md) to execute the Python code, but the code is independent of the notebook and you can execute the code in other environments.

## 1: Initial configuration

In this initial configure, you create a Snowflake notebook, import libraries you need, and define constants that are used by cells in the subsequent steps.

1. Create a Snowflake notebook.

   1. Create a notebook. For instructions, see [Create a new notebook](../../../user-guide/ui-snowsight/notebooks-create.md). Note that the **Python environment** you choose in the UI (Run on warehouse or Run on container) doesn’t matter.
   2. From the **Packages** drop-down menu, choose the “snowflake” package and install the latest version of the Snowflake Python APIs library.
   3. (Optional) Delete the cells provided in the notebook by default. As you follow the steps in this tutorial, you add Python cells to the notebook.
2. Create and run the cell to import Python libraries used by many cells in this tutorial.

   > ```python
   > from snowflake.snowpark.context import get_active_session
   > from snowflake.core import Root
   > from snowflake.core import CreateMode
   > ```
3. Create and run the cell to define constants that you use in subsequent cells. The values provided below match Tutorials 1 and 2. You can optionally change these values.

   > ```python
   > current_user = get_active_session().get_current_user()
   > user_role_name = "test_role"
   > compute_pool_name = "tutorial_compute_pool"
   > warehouse_name = "tutorial_warehouse"
   > database_name = "tutorial_db"
   > schema_name = "data_schema"
   > repo_name = "tutorial_repository"
   > stage_name = "tutorial_stage"
   > service_name = "echo_service"
   > print("configured!")
   > ```

## 2: Create Snowflake objects

Before you can create a service, you need Snowflake objects, such as a database, a user, a role, a compute pool, and an image repository. Some of these objects are account-scoped object that require administrative privileges to create them. The names of the objects created are defined in the preceding step.

### 2.1: Create account-scoped Snowflake objects

The following Python code creates these objects:

* Role (`test_role`). You grant this role all the privileges required to create and use the service. In the code, you grant this role to the current user to enable the user to create and use the service.
* Database (`tutorial_db`). In the next step, you create a schema in this database.
* Compute pool (`tutorial_compute_pool`). Your service container executes in this compute pool.
* Warehouse (`tutorial_warehouse`). When the service connects to Snowflake and executes queries, this warehouse is used to execute the queries.

Create and run the cell to create these account-scoped objects using the ACCOUNTADMIN role. Note that the script creates resources only if they don’t exist. The comments in the code show the equivalent SQL statements.

```python
from snowflake.core.compute_pool import ComputePool
from snowflake.core.database import Database
from snowflake.core.grant import Grant, Grantees, Privileges, Securable, Securables
from snowflake.core.role import Role
from snowflake.core.warehouse import Warehouse

session = get_active_session()
session.use_role("ACCOUNTADMIN")
root = Root(session)

# CREATE ROLE test_role;
root.roles.create(
    Role(name=user_role_name),
    mode=CreateMode.if_not_exists)
print(f"Created role:", user_role_name)

# GRANT ROLE test_role TO USER <user_name>
root.grants.grant(Grant(
    securable=Securables.role(user_role_name),
    grantee=Grantees.user(name=current_user),
    ))

# CREATE COMPUTE POOL IF NOT EXISTS tutorial_compute_pool
#   MIN_NODES = 1 MAX_NODES = 1
#   INSTANCE_FAMILY = CPU_X64_XS
root.compute_pools.create(
    mode=CreateMode.if_not_exists,
    compute_pool=ComputePool(
        name=compute_pool_name,
        instance_family="CPU_X64_XS",
        min_nodes=1,
        max_nodes=2,
    )
)

# GRANT USAGE, OPERATE, MONITOR ON COMPUTE POOL tutorial_compute_pool TO ROLE test_role
root.grants.grant(Grant(
    privileges=[Privileges.usage, Privileges.operate, Privileges.monitor],
    securable=Securables.compute_pool(compute_pool_name),
    grantee=Grantees.role(name=user_role_name)
    ))

print(f"Created compute pool:", compute_pool_name)

# CREATE DATABASE IF NOT EXISTS tutorial_db;
root.databases.create(
    Database(name=database_name),
    mode=CreateMode.if_not_exists)

# GRANT ALL ON DATABASE tutorial_db TO ROLE test_role;
root.grants.grant(Grant(
    privileges=[Privileges.all_privileges],
    securable=Securables.database(database_name),
    grantee=Grantees.role(name=user_role_name),
    ))

print("Created database:", database_name)

# CREATE OR REPLACE WAREHOUSE tutorial_warehouse WITH WAREHOUSE_SIZE='X-SMALL';
root.warehouses.create(
    Warehouse(name=warehouse_name, warehouse_size="X-SMALL"),
    mode=CreateMode.if_not_exists)

# GRANT USAGE ON WAREHOUSE tutorial_warehouse TO ROLE test_role;
root.grants.grant(Grant(
    privileges=[Privileges.usage],
    grantee=Grantees.role(name=user_role_name),
    securable=Securables.warehouse(warehouse_name)
    ))

print("Created warehouse:", warehouse_name)

# GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO ROLE test_role
root.grants.grant(Grant(
    privileges=[Privileges.bind_service_endpoint],
    securable=Securables.current_account,
    grantee=Grantees.role(name=user_role_name)
    ))

print("Done: GRANT BIND SERVICE ENDPOINT")
```

As you create resources, the code also grants required privileges to the role (`test_role`) so the role can use these resources. Additionally, note that the echo service you create in this tutorial exposes one public endpoint. This public endpoint allows other users in your account to access the service from the public web (ingress). To create a service with a public endpoint, the role (`test_role`) must have the `BIND SERVICE ENDPOINT` privilege on the account.

### 2.2 Create schema-scoped objects

The Python code in this section uses the `test_role` role to create a schema and objects in that schema. You don’t need administrative privileges to create these resources.

* Schema (`data_schema`). You create an image repository, service, and job in this schema.
* Image repository (`tutorial_repository`). You store your application image in this repository.
* Stage (`tutorial_stage`). The stage is created only for illustration. While not demonstrated in this tutorial, stages can be used to pass data into
  or collect data from your services.

Note that the script creates resources only if they don’t exist.

```python
from snowflake.core.image_repository import ImageRepository
from snowflake.core.schema import Schema
from snowflake.core.stage import Stage, StageDirectoryTable

session = get_active_session()
session.use_role(user_role_name)
root = Root(session)

# CREATE SCHEMA IF NOT EXISTS {schema_name}
schema = root.databases[database_name].schemas.create(
    Schema(name=schema_name),
    mode=CreateMode.if_not_exists)
print("Created schema:", schema.name)

# CREATE IMAGE REPOSITORY IF NOT EXISTS {repo}
repo = schema.image_repositories.create(
    ImageRepository(name=repo_name),
    mode=CreateMode.if_not_exists)
print("Create image repository:", repo.fully_qualified_name)

repo_url = repo.fetch().repository_url
print("image registry hostname:", repo_url.split("/")[0])
print("image repository url:", repo_url + "/")

#CREATE STAGE IF NOT EXISTS tutorial_stage
#  DIRECTORY = ( ENABLE = true );
stage = schema.stages.create(
    Stage(
        name=stage_name,
        directory_table=StageDirectoryTable(enable=True)),
    mode=CreateMode.if_not_exists)
print("Created stage:", stage.fully_qualified_name)
```

The Python code also prints out useful information about the repository (the repository URL) that you use when pushing your images to the repository.

## 3: Build an image and upload

You download locally the code as described in [Tutorial 1](tutorial-1.md), use Docker commands to build the image, and upload it to the image repository in your account.

1. Create and run the cell to obtain the hostname of your image registry and the URL to your image repository.

   ```python
   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]
   repo =  schema.image_repositories[repo_name]

   repo_url = repo.fetch().repository_url
   print("image registry hostname:", repo_url.split("/")[0])
   print("image repository url:", repo_url + "/")
   ```

   The Python code retrieves the image repository [resource](../../snowflake-python-api/snowflake-python-general-concepts.md) object (`repo`), accesses the [model](../../snowflake-python-api/snowflake-python-general-concepts.md) object, and extracts the repository URL from it.
2. Follow [Tutorial 1](tutorial-1.md) steps 1 and 2 to download the service code, build an image, and upload it to the repository.
3. Create and run the cell to verify the image is in the repository.

   ```python
   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]

   repo = schema.image_repositories[repo_name]
   for image in repo.list_images_in_repository():
       print(image.image_path)
   ```

   The code enumerates the images from the image repository resource (`repo`) and prints the `image_path` for each image.

## 4: Create a service

Create a service and a service function to communicate with the service.

1. Verify that the compute pool is ready. After you create a compute pool, it takes some time for Snowflake to provision all the nodes. Ensure that
   the compute pool is ready before creating a service, because service containers execute within the specified compute pool.

   Create and run the cell to get the compute pool status:

   > ```python
   > import time
   >
   > session = get_active_session()
   > session.use_role(user_role_name)
   > root = Root(session)
   >
   > cp = root.compute_pools[compute_pool_name]
   >
   > cpm = cp.fetch()
   > print(cpm.state, cpm.status_message)
   > if cpm.state == 'SUSPENDED':
   >     cp.resume()
   > while cpm.state in ['STARTING', 'SUSPENDED']:
   >     time.sleep(5)
   >     cpm = cp.fetch()
   >     print(cpm.state, cpm.status_message)
   > ```

   The code fetches the compute pool model (`cpm`) from the compute pool resource (`cp`) to retrieve the current compute pool state. If the compute pool is suspended, the code resumes the compute pool. The code loops, pausing for five seconds each time, until the compute pool is no longer in the STARTING or SUSPENDED state.

   The last line of output should be “IDLE” or “ACTIVE”, which indicates that the compute pool is ready to run your service. For more information, see [Compute pool lifecycle](../working-with-compute-pool.md). If the compute pool is not ready, your services can’t start.
2. Create and run the cell to create the echo service.

   ```python
   from snowflake.core.service import Service, ServiceSpec

   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]

   repo = schema.image_repositories[repo_name]
   repo_url = repo.fetch().repository_url

   specification = f"""
       spec:
         containers:
         - name: echo
           image: {repo_url}/my_echo_service_image:latest
           env:
             SERVER_PORT: 8000
             CHARACTER_NAME: Bob
           readinessProbe:
             port: 8000
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: 8000
           public: true

       """
   echo_service = schema.services.create(Service(
       name=service_name,
       compute_pool=compute_pool_name,
       spec=ServiceSpec(specification),
       min_instances=1,
       max_instances=1),
       mode=CreateMode.if_not_exists)
   print("created service:", echo_service.name)
   ```

   The code retrieves the repository URL, as done in the preceding step.
   The code then creates the `echo_service` using an inline specification and the image from the specified image repository.

   As you see from the Python code, it’s easy to parameterize the names of resources. The following is the equivalent SQL command that creates a service but doesn’t use parameters.

   ```sqlexample-yaml
   CREATE SERVICE echo_service
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
       spec:
         containers:
         - name: echo
           image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
           env:
             SERVER_PORT: 8000
             CHARACTER_NAME: Bob
           readinessProbe:
             port: 8000
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: 8000
           public: true
     $$
     MIN_INSTANCES=1
     MAX_INSTANCES=1;
   ```
3. Run the cell to create a service function (`my_echo_function`). A service function is one of the ways of using the service.

   ```python
   from snowflake.core.function import ServiceFunction, FunctionArgument

   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]

   # CREATE FUNCTION my_echo_udf (inputtext VARCHAR)
   #  RETURNS VARCHAR
   #  SERVICE=echo_service
   #  ENDPOINT=echoendpoint
   #  AS '/echo';
   svcfn = schema.functions.create(mode=CreateMode.or_replace,
       function=ServiceFunction(
           name="my_echo_function",
           arguments=[FunctionArgument(name="inputtext", datatype="TEXT")],
           returns="TEXT",
           service=service_name,
           endpoint="echoendpoint",
           path="/echo"))
   print("created service function:", svcfn.name_with_args)
   ```

   The code calls the `create` method on the `functions` collection of the `schema` to create the service function (`my_echo_function`).

## 5: Use the service

In this section, you use the service as follows:

* Invoke the service function.
* Use a browser to interact with the service’s public endpoint.

1. Invoke the service function.

   ```python
   svcfn = schema.functions["my_echo_function(TEXT)"]
   print(svcfn.execute(["hello"]))
   ```

   Snowflake sends a POST request to the service endpoint (`echoendpoint`). Upon receiving the request, the service echoes the input string in the response.

   Output:

   ```Output
   +--------------------------+
   | **MY_ECHO_UDF('HELLO!')**|
   |------------------------- |
   | Bob said hello!          |
   +--------------------------+
   ```
2. Access from a browser the public endpoint that the service exposes.

   1. Get the URL of the public endpoint.

      ```python
      # helper to check if service is ready and return endpoint url
      def get_ingress_for_endpoint(svc, endpoint):
          for _ in range(10): # only try 10 times
              # Find the target endpoint.
              target_endpoint = None
              for ep in svc.get_endpoints():
                  if ep.is_public and ep.name == endpoint:
                      target_endpoint = ep
                      break;
              else:
                  print(f"Endpoint {endpoint} not found")
                  return None

              # Return endpoint URL or wait for it to be provisioned.
              if target_endpoint.ingress_url.startswith("Endpoints provisioning "):
                  print(f"{target_endpoint.ingress_url} is still in provisioning. Wait for 10 seconds.")
                  time.sleep(10)
              else:
                  return target_endpoint.ingress_url
          print("Timed out waiting for endpoint to become available")

      endpoint_url = get_ingress_for_endpoint(echo_service, "echoendpoint")
      print(f"https://{endpoint_url}/ui")
      ```
   2. Paste the printed URL in a browser window. This causes the service to execute the `ui()` function (see `echo_service.py`).

      Note that the first time you access the endpoint URL, you will be asked to log in to Snowflake. For this test, use the same user that you used to create the service to ensure the user has the necessary privileges.
   3. Enter the string “Hello” in the **Input** box, and press **Return**.

## 6: Create a job

In Tutorial 2, you use the SQL interface to create a Snowpark Container Services job. In this section, you create the same job using the Snowflake Python APIs.

1. Create and run the cell to obtain the hostname of your image registry and the URL to your image repository.

   ```python
   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]
   repo =  schema.image_repositories[repo_name]

   repo_url = repo.fetch().repository_url
   print("image registry hostname:", repo_url.split("/")[0])
   print("image repository url:", repo_url + "/")
   ```

   The Python code retrieves the image repository resource object (`repo`), accesses the model object, and extracts the repository URL from it.
2. Follow [Tutorial 2](tutorial-2.md) steps 1 and 2 to download the service code, build an image, and upload it to the repository.
3. Create and run the cell to verify the image is in the repository.

   ```python
   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]

   repo = schema.image_repositories[repo_name]
   for image in repo.list_images_in_repository():
       print(image.image_path)
   ```

   The code enumerates the images from the image repository resource (`repo`) and prints the `image_path` for each image.
4. Create and run the cell to create the job.

   ```python
   from snowflake.core.service import JobService, ServiceSpec

   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]

   repo = schema.image_repositories[repo_name]
   repo_url = repo.fetch().repository_url

   job_name = "test_job"

   # cleanup previous job if present.
   schema.services[job_name].drop()(if_exists=True)

   specification = f"""
       spec:
         containers:
         - name: main
           image: {repo_url}/my_job_image:latest
           env:
             SNOWFLAKE_WAREHOUSE: {warehouse_name}
           args:
           - "--query=select current_time() as time,'hello'"
           - "--result_table=results"
       """
   job = schema.services.execute_job(JobService(
       name=job_name,
       compute_pool=compute_pool_name,
       spec=ServiceSpec(specification)))
   print("executed job:", job.name, "status:", job.fetch().status)

   print("job logs:")
   print(job.get_service_logs(0, "main"))
   ```

   The job runs the given query and stores the results in a table.
5. Run the following cell to review the result written to the table. This code uses Snowpark Python to query that table.

   ```python
   session = get_active_session()
   session.use_role(user_role_name)
   # show that above job wrote to results table
   session.sql(f"select * from {database_name}.{schema_name}.results").collect()
   ```

## 7: Clean up

1. Stop the service and drop it. After dropping the service, Snowflake by default automatically suspends the compute pool (assuming there are no other services and job services running). For more information, see [compute pool lifecycle](../working-with-compute-pool.md).

   ```python
   session = get_active_session()
   session.use_role(user_role_name)
   root = Root(session)

   schema = root.databases[database_name].schemas[schema_name]

   # now let's clean up

   schema.functions["my_echo_function(TEXT)"].drop()
   schema.services[service_name].drop()
   ```
2. Drop the image repository to avoid paying for storage. Note that, if you have any other images stored in the repository they will be deleted.

   ```python
   schema.image_repositories[repo_name].drop()
   ```
3. Drop the schema. Dropping a schema also drops all objects in that schema. For this tutorial that includes the service, the function, the image repository, and the stage you created.

   ```python
   root.databases[database_name].schemas[schema_name].drop()
   ```
4. Instead of waiting for Snowflake to suspend your compute pool, you also can explicitly suspend the compute pool. In this case, Snowflake suspends any running services, and waits for any jobs running to finish, and then suspends the compute pool.

   ```python
   root.compute_pool[compute_pool_name].suspend()
   ```

## What’s next?

This tutorial demonstrates using Snowflake Python APIs to create and manage Snowpark Container Services services and jobs. For more information about the Snowflake Python APIs, see [Snowflake Python APIs: Managing Snowflake objects with Python](../../snowflake-python-api/snowflake-python-overview.md).

---
title: Tutorial 4: Service-to-Service Communications with Snowpark Container Services
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/advanced/tutorial-4.md
section: Snowpark Container Services
---

App Development

# Tutorial 4: Service-to-Service Communications with Snowpark Container Services

## Introduction

In this tutorial, you create a Snowpark Container Services job service that communicates with the Echo service you created in
[Tutorial 1](../tutorial-1.md). When the job service runs, it sends a POST request to the Echo service URL (that you
provide in the service specification) with a “Hello” string in the request body. The Echo service returns a response with the
“Bob said Hello” string in the response body. You access the job service container logs to verify that the communications succeeded.

There are two parts to this tutorial:

* **Part 1: Create and test a job service.** You download code provided for this tutorial and follow step-by-step instructions:

  1. Download the job service code for this tutorial.
  2. Build a Docker image for Snowpark Container Services, and upload the image to a repository in your account.
  3. Stage the specification file, which gives Snowflake the container configuration information. In addition to the name of
     the image to use to start a container, the specification sets the environment variable (`SERVICE_URL`) to the Echo service
     URL. The application code reads this environment variable to send requests to the Echo service.
  4. Execute the job service. Using the EXECUTE JOB SERVICE command, you can execute the job service by providing the specification file and the
     compute pool where Snowflake can run the container. And finally, access logs from the job service container to verify that the
     communication between the job service and service succeeded.
* **Part 2: Understand the job service code**. This section provides an overview of the service code and highlights how different
  components collaborate.

## Prerequisites

Complete [Tutorial 1](../tutorial-1.md). To verify the service is running, execute the [DESCRIBE SERVICE](../../../../sql-reference/sql/desc-service.md) command.

```sqlexample
DESC SERVICE echo_service;
```

Verify the `status` column shows the service status as RUNNING; if the status is PENDING, it indicates the service is still starting. To investigate why the service is not RUNNING, execute the [SHOW SERVICE CONTAINERS IN SERVICE](../../../../sql-reference/sql/show-service-containers-in-service.md) command and review the `status` of individual containers:

```sqlexample
SHOW SERVICE CONTAINERS IN SERVICE echo_service;
```

You need the service running before you can proceed.

## 1: Download the service code

Code (a Python application) is provided to create a job service.

1. Download [`SnowparkContainerServices-Tutorials.zip`](../../../../_downloads/c3a8f6109048f2ecca7734c7fd3b0b3b/SnowparkContainerServices-Tutorials.zip).
2. Unzip the content, which includes one directory for each tutorial. The `Tutorial-3` directory has the following files:

   * `service_to_service.py`
   * `Dockerfile`
   * `service_to_service_spec.yaml`

## 2: Build and upload an image

Build an image for the linux/amd64 platform that Snowpark Container Services supports, and then upload the image to the image
repository in your account (see [Common Setup](../common-setup.md)).

You will need information about the repository (the repository URL and the registry hostname) before you can build and upload the image. For more information, see
[Registry and Repositories](../../working-with-registry-repository.md).

**Get information about the repository**

1. To get the repository URL, execute the [SHOW IMAGE REPOSITORIES](../../../../sql-reference/sql/show-image-repositories.md) SQL command.

   ```bash
   SHOW IMAGE REPOSITORIES;
   ```

   * The `repository_url` column in the output provides the URL. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
     ```
   * The host name in the repository URL is registry host name. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com
     ```

**Build image and upload it to the repository**

1. Open a terminal window, and change to the directory containing the files you unzipped.
2. To build a Docker image, execute the following `docker build` command using the Docker CLI.
   Note the command specifies the current working directory (.)
   as the `PATH` for files to use for building the image.

   ```bash
   docker build --rm --platform linux/amd64 -t <repository_url>/<image_name> .
   ```

   * For `image_name`, use `service_to_service:latest`.

   **Example**

   ```bash
   docker build --rm --platform linux/amd64 -t myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/service_to_service:latest .
   ```
3. Upload the image to the repository in your Snowflake account. In order for Docker to upload an image on your behalf to your repository,
   you must first [authenticate Docker with the registry](../../working-with-registry-repository.md).

   1. For Docker to upload an image on your behalf to your repository,
      first [authenticate Docker with the registry](../../working-with-registry-repository.md).

      1. We recommend using [Snowflake CLI](../../../snowflake-cli/index.md)
         to authenticate your local Docker instance with the image
         registry for your Snowflake account. Make sure that you configured Snowflake CLI to connect to Snowflake. For more information,
         see [Configuring Snowflake CLI and connecting to Snowflake](../../../snowflake-cli/connecting/connect.md).
      2. To authenticate, execute the following Snowflake CLI command:

         ```snowcli
         snow spcs image-registry login
         ```
   2. To upload the image, execute the following command:

      ```bash
      docker push <repository_url>/<image_name>
      ```

      **Example**

      ```bash
      docker push myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/service_to_service:latest
      ```

## 3: Stage the specification file

* To upload your service specification file (`service_to_service_spec.yaml`) to the stage, use one of the following options:

  + **The Snowsight web interface**. For instructions, see [Choosing an internal stage for local files](../../../../user-guide/data-load-local-file-system-create-stage.md).
  + **The SnowSQL CLI.** Execute the following [PUT](../../../../sql-reference/sql/put.md) command:

    ```sqlexample
    PUT file://<absolute-path-to-spec.yaml> @tutorial_stage
      AUTO_COMPRESS=FALSE
      OVERWRITE=TRUE;
    ```
  > The command sets OVERWRITE=TRUE so that you can upload the file again, if needed (for example, if you fixed an error in your
  > specification file). If the PUT command is executed successfully, information about the uploaded file is printed out.

## 4: Execute the job service

Now you are ready to test the Snowflake job service you created. When the job service is executed, Snowflake collects anything that your code in
the container outputs to standard output or standard error as logs. You can use the `SYSTEM$GET_SERVICE_LOGS` system function
to access the logs.

1. To start a job service, run the EXECUTE JOB SERVICE command:

   ```sqlexample
   EXECUTE JOB SERVICE
     IN COMPUTE POOL tutorial_compute_pool
     NAME=tutorial_db.data_schema.tutorial_4_job_service
     FROM @tutorial_stage
     SPEC='service_to_service_spec.yaml';
   ```

   Note the following:

   * FROM and SPEC provide the stage name and the name of the service specification file.
   * COMPUTE_POOL provides the compute resources where Snowflake executes the job service.

   Snowflake runs the container identified in the specification file. The container reads the `SERVICE_URL` environment
   variable value (`http://echo-service:8000/echo`) and sends a request to the Echo service at port 8000 at `/echo`
   HTTP path.

   Snowflake starts the job service and returns the following output:

   ```output
   +------------------------------------------------------------------------+
   | status                                                                 |
   |------------------------------------------------------------------------|
   | Job TUTORIAL4_JOB_SERVICE completed successfully with status: DONE     |
   +------------------------------------------------------------------------+
   ```

   Note that the response includes the job service name.
2. (optional) After the job service completes, you can get more information about the job service that executed. This is useful for debugging job service failure.
   To get the job service status, execute [SHOW SERVICE CONTAINERS IN SERVICE](../../../../sql-reference/sql/show-service-containers-in-service.md).

   ```sqlexample
   SHOW SERVICE CONTAINERS IN SERVICE tutorial_4_job_service;
   ```

   Sample output:

   ```output
   +---------------+-------------+------------------------+-------------+----------------+--------+------------------------+----------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+------------+
   | database_name | schema_name | service_name           | instance_id | container_name | status | message                | image_name                                                                                                                             | image_digest                                                            | restart_count | start_time |
   |---------------+-------------+------------------------+-------------+----------------+--------+------------------------+----------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+------------|
   | TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_4_JOB_SERVICE | 0           | main           | DONE   | Completed successfully | myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/service_to_service:latest | sha256:aa3fa2e5c1552d16904a5bbc97d400316ebb4a608bb110467410485491d9d8d0 |             0 |            |
   +---------------+-------------+------------------------+-------------+----------------+--------+------------------------+----------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+------------+
   ```
3. To read the job service logs call [SYSTEM$GET_SERVICE_LOGS](../../../../sql-reference/functions/system_get_service_logs.md):

   ```sqlexample
   CALL SYSTEM$GET_SERVICE_LOGS('tutorial_4_job_service', 0, 'main');
   ```

   `main` is the name of the container you retrieve the log from. You set this container name for the container in the
   service specification file.

   Sample log:

   ```output
   +--------------------------------------------------------------------------------------------------------------------------+
   | SYSTEM$GET_JOB_LOGS                                                                                                      |
   |--------------------------------------------------------------------------------------------------------------------------|
   | service-to-service [2023-04-29 21:52:09,208] [INFO] Calling http://echo-service:8000/echo with input Hello               |
   | service-to-service [2023-04-29 21:52:09,212] [INFO] Received response from http://echo-service:8000/echo: Bob said Hello |
   +--------------------------------------------------------------------------------------------------------------------------+
   ```

## 5: Clean up

Snowflake charges for the Compute Pool nodes that are active for your account. (See
[Working With Compute Pools](../../working-with-compute-pool.md)). To prevent unwanted charges, first stop all services that are
currently running on a compute pool. Then, either suspend the compute pool (if you intend to use it again later) or drop it.

1. Stop all services and job services on the compute pool.

   ```sqlexample
   ALTER COMPUTE POOL tutorial_compute_pool STOP ALL;
   ```
2. Delete the compute pool.

   ```sqlexample
   DROP COMPUTE POOL tutorial_compute_pool;
   ```

You can also clean up the image registry (remove all images) and the internal stage (remove specifications).

```sqlexample
DROP IMAGE REPOSITORY tutorial_repository;
DROP STAGE tutorial_stage;
```

## 6: Reviewing the job service code

This section covers the following topics:

* Examining the files provided: Review various code files that implement the job service.
* Building and testing an image locally. Learn how to locally test the Docker image before uploading it to a
  repository in your Snowflake account.

### Examining the files provided

The zip file you downloaded includes the following files:

* `service_to_service.py`
* `Dockerfile`
* `service_to_service_spec.yaml`

This section provides an overview of how the code implements job service.

#### service_to_service.py file

```python
import json
import logging
import os
import requests
import sys

SERVICE_URL = os.getenv('SERVICE_URL', 'http://localhost:8080/echo')
ECHO_TEXT = 'Hello'

def get_logger(logger_name):
  logger = logging.getLogger(logger_name)
  logger.setLevel(logging.DEBUG)
  handler = logging.StreamHandler(sys.stdout)
  handler.setLevel(logging.DEBUG)
  handler.setFormatter(
    logging.Formatter(
      '%(name)s [%(asctime)s] [%(levelname)s] %(message)s'))
  logger.addHandler(handler)
  return logger

logger = get_logger('service-to-service')

def call_service(service_url, echo_input):
  logger.info(f'Calling {service_url} with input {echo_input}')

  row_to_send = {"data": [[0, echo_input]]}
  response = requests.post(url=service_url,
                           data=json.dumps(row_to_send),
                           headers={"Content-Type": "application/json"})

  message = response.json()
  if message is None or not message["data"]:
    logger.error('Received empty response from service ' + service_url)

  response_row = message["data"][0]
  if len(response_row) != 2:
    logger.error('Unexpected response format: ' + response_row)

  echo_reponse = response_row[1]
  logger.info(f'Received response from {service_url}: ' + echo_reponse)

if __name__ == '__main__':
  call_service(SERVICE_URL, ECHO_TEXT)
```

When the job service runs:

1. Snowflake uses the value provided in the specification file to set the SERVICE_URL environment variable in the container.
2. The code reads the environment variable.

   ```python
   SERVICE_URL = os.getenv('SERVICE_URL', 'http://localhost:8080/echo').
   ```
3. The `call_service()` function uses the `SERVICE_URL` to communicate with the Echo service.

#### Dockerfile

This file contains all the commands to build an image using Docker.

```bash
ARG BASE_IMAGE=python:3.10-slim-buster
FROM $BASE_IMAGE
COPY service_to_service.py ./
RUN pip install --upgrade pip && \
  pip install requests
CMD ["python3", "service_to_service.py"]
```

#### service_to_service_spec.yaml file (service specification)

Snowflake uses information you provide in this specification to configure and run your service.

```yaml
spec:
container:
   - name: main
      image: /tutorial_db/data_schema/tutorial_repository/service_to_service:latest
      env:
      SERVICE_URL: "http://echo-service:8000/echo"
```

This specification provides information to Snowflake for configuring and running your job. To communicate with the Echo service,
the job needs the following:

* DNS name of the Echo service to send requests to.
* HTTP port on which the Echo service is listening.
* HTTP path where the Echo service expects the request to be sent.

To get this information:

1. To get the DNS name of the Echo service ([Tutorial 1](../tutorial-1.md)), execute the [DESCRIBE SERVICE](../../../../sql-reference/sql/desc-service.md) SQL
   command:

   ```sqlexample
   DESCRIBE SERVICE echo_service;
   ```

   Resulting DNS name for the Echo service:

   ```none
   echo-service.fsvv.svc.spcs.internal
   ```

   Note that, in this tutorial, you create the job service in the same database schema (`data-schema`) where the Echo service
   ([Tutorial 1](../tutorial-1.md)) is created. Therefore, you only need the “echo-service” portion of the
   preceding DNS name for constructing the `SERVICE_URL`.
2. Get the port number (8000) where Echo service is listening from the Echo service specification file
   ([Tutorial 1](../tutorial-1.md)). You can also use the [SHOW ENDPOINTS](../../../../sql-reference/sql/show-endpoints.md) SQL command.

You then create the preceding specification file (`service_to_service_spec.yaml`). In addition to the required
`containers.name` and `containers.image` fields, you also include the optional `containers.env` field to specify environment variables used by the service.

### Building and testing an image locally

You can test the Docker image locally before uploading it to a repository in your Snowflake account. In local testing, your
container runs standalone (it is not a job service that Snowflake executes).

> **Note:**
>
> The Python code provided for this tutorial uses the `requests` library to send requests to another Snowpark Containers
> service. If you don’t have this library installed, run pip (for example, `pip3 install requests`).

Use the following steps to test the Tutorial 4 Docker image:

1. You need the Echo service running ([Tutorial 1](../tutorial-1.md)). To start the Tutorial 1 Echo service, in a
   terminal window, execute the following Python command:

   ```bash
   SERVER_PORT=8000 python3 echo_service.py
   ```
2. Open another terminal window and, run the Python code provided for this tutorial:

   ```bash
   SERVICE_URL=http://localhost:8000/echo python3 service_to_service.py
   ```

   Note that the `SERVICE_URL` is an environment variable. For local testing, you need to explicitly set this variable.
   This URL matches the port and HTTP path explicitly specified when you started the Echo service.

   When the job is executed, it sends a POST request to the Echo service listening on port 8000 with the “Hello” string in the
   request body. The Echo service echoes the input back and returns a response - “I said Hello”.

   Sample response:

   ```output
   service-to-service
     [2023-04-23 22:30:41,278]
     [INFO] Calling http://localhost:8000/echo with input Hello

   service-to-service
     [2023-04-23 22:30:41,287]
     [INFO] Received response from http://localhost:8000/echo: I said Hello
   ```

   Review the log to verify that the service-to-service communication succeeded.

## What’s next?

[Tutorial 5: Create a service with a block storage volume mounted](tutorial-5-block-storage.md)

---
title: Tutorial 5: Create a service with a block storage volume mounted
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/advanced/tutorial-5-block-storage.md
section: Snowpark Container Services
---

App Development

# Tutorial 5: Create a service with a block storage volume mounted

## Introduction

This tutorial provides step-by-step instructions for you to create a simple service that uses a block storage volume. You also take a snapshot of the storage volume and explore ways to use the snapshot.

## Create a service

1. Follow [Tutorial 1](../tutorial-1.md) to download code for the sample service, create a Docker image, and upload it to a repository in your Snowflake account.
2. Verify you have the `my_echo_service_image` image in the repository.

   ```sqlexample
   SHOW IMAGES IN IMAGE REPOSITORY tutorial_db.data_schema.tutorial_repository;
   ```
3. Create a service. When the service runs, the container will have a 10 Gi block volume storage mounted.

   ```sqlexample
   CREATE SERVICE my_service
    IN COMPUTE POOL tutorial_compute_pool
    FROM SPECIFICATION $$
   spec:
     containers:
     - name: echo
       image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
       volumeMounts:
       - name: block-vol1
         mountPath: /opt/block/path
       readinessProbe:
         port: 8080
         path: /healthcheck
     endpoints:
     - name: echoendpoint
       port: 8080
       public: true
     volumes:
     - name: block-vol1
       source: block
       size: 10Gi
   $$;
   ```

   > **Note:**
   >
   > This tutorial only shows how to create a service with a block storage volume. The service code
   > doesn’t use the volume.
4. To verify that the service is running, execute the [DESCRIBE SERVICE](../../../../sql-reference/sql/desc-service.md) command.

   ```sqlexample
   DESC SERVICE echo_service;
   ```

   Verify the `status` column shows the service status as RUNNING; if the status is PENDING, it indicates the service is still starting. To investigate why the service is not RUNNING, execute the [SHOW SERVICE CONTAINERS IN SERVICE](../../../../sql-reference/sql/show-service-containers-in-service.md) command and review the `status` of individual containers:

   ```sqlexample
   SHOW SERVICE CONTAINERS IN SERVICE echo_service;
   ```

## Take a snapshot

1. Use the [CREATE SNAPSHOT](../../../../sql-reference/sql/create-snapshot.md) command to take a snapshot of the block storage volume attached to the service
   instance 0. You specify instance 0 because you are running only one service instance.

   Use double-quotes around the name in the VOLUME parameter to match the case of the name in the service specification.

   ```sqlexample
   CREATE SNAPSHOT my_snapshot
     FROM SERVICE my_service
     VOLUME "block-vol1"
     INSTANCE 0
     COMMENT='new snapshot';
   ```
2. Review the snapshot

   * List snapshots using [SHOW SNAPSHOTS](../../../../sql-reference/sql/show-snapshots.md).

     ```sqlexample
     SHOW SNAPSHOTS;
     ```
   * Retrieve information for a specific snapshot using [DESCRIBE SNAPSHOT](../../../../sql-reference/sql/desc-snapshot.md).

     ```sqlexample
     DESC SNAPSHOT my_snapshot;
     ```
3. Run the [ALTER SNAPSHOT](../../../../sql-reference/sql/alter-snapshot.md) command to modify the snapshot.

   ```sqlexample
   ALTER SNAPSHOT my_snapshot SET comment='updated comment';
   ```

## Use the snapshot

1. You can use the snapshot two ways:

   * **Use snapshot to create a new service:** When creating a new service, you can use the snapshot as the initial content for a block storage volume as shown. The following CREATE SERVICE command creates another service (`new_service`) with a 50 Gi block storage volume. The inline specification includes the snapshot name to use for initializing the block storage volume.

     > ```sqlexample
     > CREATE SERVICE new_service
     >   IN COMPUTE POOL tutorial_compute_pool
     >   FROM SPECIFICATION $$
     > spec:
     >   containers:
     >   - name: echo
     >     image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:tutorial
     >     volumeMounts:
     >     - name: fromsnapshotvol
     >       mountPath: /opt/block/path
     >     readinessProbe:
     >       port: 8080
     >       path: /healthcheck
     >   endpoints:
     >   - name: echoendpoint
     >     port: 8080
     >     public: true
     >   volumes:
     >   - name: fromsnapshotvol
     >     source: block
     >     size: 50Gi
     >     blockConfig:
     >       initialContents:
     >         fromSnapshot: MY_SNAPSHOT
     > $$
     > min_instances=3
     > max_instances=3;
     > ```
   * **Restore a snapshot on a storage volume of an existing service:** This example restarts the first service (`my_service`) you created by replacing the original block volume content with the content from the snapshot.

     > 1. Suspend the service so you can restore the snapshot on the block storage volume.
     >
     >    ```sqlexample
     >    ALTER SERVICE my_service SUSPEND;
     >    ```
     > 2. Restore the snapshot on the block storage volume mounted on the container of the
     >    new_service instance. You are running only one instance of the Echo Service, so you specify instance ID 0.
     >
     >    ```sqlexample
     >    ALTER SERVICE my_service RESTORE     -- this will auto RESUME the service.
     >      VOLUME "block-vol1"
     >      INSTANCES 0
     >      FROM SNAPSHOT my_snapshot;
     >    ```
     > 3. Verify the service status.
     >
     >    ```sqlexample
     >    DESC SERVICE echo_service;
     >    ```
     >
     >    The `status` column should show the service status as RUNNING;
2. Use the [DROP SNAPSHOT](../../../../sql-reference/sql/drop-snapshot.md) command to drop the snapshot.

   ```sqlexample
   DROP SNAPSHOT my_snapshot;
   ```

## Clean up

Remove the resources you created.

1. Drop the two services you created:

> ```sqlexample
> DROP SERVICE my_service;
> DROP SERVICE new_service;
> ```

1. Follow [Tutorial 1](../tutorial-1.md) steps to clean up other resources created in tutorial 1.

## What’s next?

Now that you’ve completed this tutorial, you can return to [Advanced tutorials](../../overview-advanced-tutorials.md) to explore other topics.

---
title: Tutorial 6: Configure and test service endpoint privileges
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/advanced/tutorial-6-configure-test-service-role.md
section: Snowpark Container Services
---

App Development

# Tutorial 6: Configure and test service endpoint privileges

## Introduction

In Tutorial 1, you use the same role to create and test a service. The role that creates the service is the service’s owner role, so you’re able to communicate with the service by using that role.

In this tutorial, you explore by using a different role to communicate with the service.

You grant this role the USAGE privilege by using a [service role](../../working-with-services.md) that you define in the service specification.

In this tutorial, you modify the [Tutorial 1](../tutorial-1.md) as follows:

1. Create a new role that you will use to communicate with the service.
2. Modify the service specification as follows:

   * Define two endpoints, instead of just one endpoint. Note that the second endpoint is added only to demonstrate how endpoint permissions work.
   * Define a service role that is allowed to access only one of the two endpoints.
3. Grant the service role to the new role you created to allow access to one of the service endpoints.
4. Use the new role to communicate with the service endpoint.

## Prepare

Follow [Common Setup](../common-setup.md) with the following modifications:

1. Complete the common setup steps.
2. By using the ACCOUNTADMIN role, execute the following script to create another role (`service_function_user_role`), replacing
   `user_name` with the name of your Snowflake user. After creating the echo service, you use this role to communicate with the service.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE ROLE service_function_user_role;
   GRANT ROLE service_function_user_role TO USER <user-name>;
   GRANT USAGE ON WAREHOUSE tutorial_warehouse TO ROLE service_function_user_role;
   ```
3. Follow [Tutorial 1](../tutorial-1.md), steps 1 and 2, to build and upload an image to a repository in your account. Don’t proceed with step 3 because you will create the service as part of this tutorial.

## Create a service

1. To ensure you’re in the right context for the SQL statements in this step, execute the following:

   ```sqlexample
   USE ROLE test_role;
   USE DATABASE tutorial_db;
   USE SCHEMA data_schema;
   USE WAREHOUSE tutorial_warehouse;
   ```
2. To create the service, execute the following command by using `test_role` (the service’s owner role).

   ```sqlexample
   CREATE SERVICE echo_service
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
       spec:
         containers:
         - name: echo
           image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
           env:
             SERVER_PORT: 8000
             CHARACTER_NAME: Bob
           readinessProbe:
             port: 8000
             path: /healthcheck
         endpoints:
         - name: echoendpoint
           port: 8000
           public: true
         - name: echoendpoint2
           port: 8002
           public: true
       serviceRoles:
       - name: echoendpoint_role
         endpoints:
         - echoendpoint
         $$;
   ```

   Per the inline specification, the `echo_service` exposes two public endpoints but the service role (`echoendpoint_role`) grants USAGE privilege only on one of the endpoints.
3. Verify the service is running.

   ```sqlexample
   SHOW SERVICES;
   SHOW SERVICE CONTAINERS IN SERVICE echo_service;
   DESCRIBE SERVICE echo_service;
   ```
4. By using `test_role` (the service’s owner role), grant the service role defined in the specification to the new role (`service_function_user_role`) you created as part of the common setup. Also grant USAGE privileges on the database and the schema.

   ```sqlexample
   USE ROLE test_role;
   USE DATABASE tutorial_db;
   USE SCHEMA data_schema;

   GRANT USAGE ON DATABASE tutorial_db TO ROLE service_function_user_role;
   GRANT USAGE ON SCHEMA data_schema TO ROLE service_function_user_role;
   GRANT SERVICE ROLE echo_service!echoendpoint_Role TO ROLE service_function_user_role;
   ```

   This service role grants the `service_function_user_role` USAGE privilege on the `echoendpoint` endpoint.

   To demonstrate that the service role name is case in-sensitive, the example uses the `echoendpoint_Role` role name.

## Use the service

Create a service function to communicate with the service. You create a service function by using the `service_function_user_role` (not the service’s owner role) and use the service.

1. Create a service function.

   ```sqlexample
   USE ROLE service_function_user_role;
   CREATE OR REPLACE FUNCTION my_echo_udf_try1 (InputText VARCHAR)
     RETURNS varchar
     SERVICE=echo_service
     ENDPOINT=echoendpoint
     AS '/echo';
   ```
2. Try creating another service function that refers to the `echoservice2` endpoint for which the role has no access privilege. Therefore, the command should *fail*.

   ```sqlexample
   CREATE OR REPLACE FUNCTION my_echo_udf_try2 (InputText varchar)
     RETURNS varchar
     SERVICE=echo_service
     ENDPOINT=echoendpoint2
     AS '/echo';
   ```
3. Use the service function.

   ```sqlexample
   SELECT my_echo_udf_try1('Hello');
   ```

## Clean up

To remove the resources you created, follow the steps in [Tutorial 1](../tutorial-1.md) steps to clean up other resources created in Tutorial 1.

## What’s next?

Now that you’ve completed this tutorial, you can return to [Working with Services](../../working-with-services.md) to explore other topics.

---
title: Tutorial 7:Create a Snowpark Container Services service that uses caller’s rights
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/advanced/tutorial-7-callers-rights.md
section: Snowpark Container Services
---

App Development

# Tutorial 7:Create a Snowpark Container Services service that uses caller’s rights

## Introduction

In this tutorial you explore building a service, presenting a web UI, that uses the caller’s rights feature when executing SQL queries on behalf of the users.

You create a service (named `query_service`) that executes a query provided in the request. By default, application containers connect to Snowflake as the service user using the service’s owner role. But this application uses the caller’s rights feature to connect to the service endpoint as the end user and using privileges granted to that user.

When testing, you use the service from a web browser because the caller’s rights feature is only supported when accessing a service using network ingress. The caller’s rights feature is not available when accessing a service using a service function.

The service does the following:

* Exposes one public endpoint.
* When a user logs in to the endpoint, the service provides a Web UI to provide a query. The service executes the query in Snowflake and returns the results. In this tutorial you execute the following SQL command:

  ```sqlexample
  SELECT CURRENT_USER(), CURRENT_ROLE();
  ```

  The command returns the name of the currently logged-in user and the currently active role, both of which depend on whether caller’s rights is used.

  + When caller’s rights is used, the service connects to Snowflake as the calling user and the user’s default role. The command returns your user name and default role.
  + When caller’s rights is not used, the default behavior kicks in where the service connects to Snowflake as the service user and the service’s owner role. Therefore, the command returns the service user name in the form: `SF$SERVICE$unique-id`, `TEST_ROLE`.

There are two parts to this tutorial:

**Part 1: Create and test a service.** You download code provided for this tutorial and follow step-by-step instructions:

1. Download the service code for this tutorial.
2. Build a Docker image for Snowpark Container Services, and upload the image to a repository in your account.
3. Create a service.
4. Communicate with the service using network ingress to connect with the public endpoint that the service exposes. Using a web browser, you login to the public endpoint and execute the SELECT CURRENT_USER(); command. Verify the command output to ensure that the container executed the command as the logged-in user.

**Part 2: Understand the service**. This section provides an overview of the service code and highlights how the application code uses the caller’s rights.

## Prepare

Follow [Common Setup](../common-setup.md) to configure prerequisites and create snowflake resources that are required for all Snowpark Container Services tutorials provided in this documentation.

## Download the service code

Code (a Python application) is provided to create the query service.

1. Download [`SnowparkContainerServices-Tutorials.zip`](../../../../_downloads/c3a8f6109048f2ecca7734c7fd3b0b3b/SnowparkContainerServices-Tutorials.zip).
2. Unzip the content, which includes one directory for each tutorial. The `Tutorial-6-callers-rights` directory has the following files:

   * `Dockerfile`
   * `main.py`
   * `templates/basic_ui.html`

## Build an image and upload

Build an image for the linux/amd64 platform that Snowpark Container Services supports, and then upload the image to the image
repository in your account (see [Common Setup](../common-setup.md)).

You will need information about the repository (the repository URL and the registry hostname) before you can build and upload the image. For more information, see
[Registry and Repositories](../../working-with-registry-repository.md).

**Get information about the repository**

1. To get the repository URL, execute the [SHOW IMAGE REPOSITORIES](../../../../sql-reference/sql/show-image-repositories.md) SQL command.

   ```bash
   SHOW IMAGE REPOSITORIES;
   ```

   * The `repository_url` column in the output provides the URL. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
     ```
   * The host name in the repository URL is the registry host name. An example is shown:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com
     ```

**Build image and upload it to the repository**

1. Open a terminal window, and change to the directory containing the files you unzipped.
2. To build a Docker image, execute the following `docker build` command using the Docker CLI.
   Note that the command specifies the current working directory (`.`)
   as the `PATH` for files to use to build the image.

   ```bash
   docker build --rm --platform linux/amd64 -t <repository_url>/<image_name> .
   ```

   * For `image_name`, use `query_service:latest`.

   **Example**

   ```bash
   docker build --rm --platform linux/amd64 -t myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/query_service:latest .
   ```
3. Upload the image to the repository in your Snowflake account. For Docker to upload an image on your behalf to your repository,
   you must first authenticate Docker with Snowflake.

   1. For Docker to upload an image on your behalf to your repository,
      first [authenticate Docker with the registry](../../working-with-registry-repository.md).

      1. We recommend using [Snowflake CLI](../../../snowflake-cli/index.md)
         to authenticate your local Docker instance with the image
         registry for your Snowflake account. Make sure that you configured Snowflake CLI to connect to Snowflake. For more information,
         see [Configuring Snowflake CLI and connecting to Snowflake](../../../snowflake-cli/connecting/connect.md).
      2. To authenticate, execute the following Snowflake CLI command:

         ```snowcli
         snow spcs image-registry login
         ```
   2. To upload the image, execute the following command:

      ```bash
      docker push <repository_url>/<image_name>
      ```

      **Example**

      ```bash
      docker push myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/query_service:latest
      ```

## Create a service

In this section you create a service (query_service).

1. Verify that the compute pool is ready and that you are in the right context to create the service.

   1. Previously, you set the context in the [Common Setup](../common-setup.md) step. To ensure that you’re in the right context for the SQL statements in this step, execute the following:
   > ```sqlexample
   > USE ROLE test_role;
   > USE DATABASE tutorial_db;
   > USE SCHEMA data_schema;
   > USE WAREHOUSE tutorial_warehouse;
   > ```

   1. To ensure that the compute pool you created in the [common setup](../common-setup.md) is ready, execute `DESCRIBE COMPUTE POOL`, and verify that the `state` is `ACTIVE` or `IDLE`. If the `state` is `STARTING`, you need to wait until the `state` changes to either `ACTIVE` or `IDLE`.
   > ```sqlexample
   > DESCRIBE COMPUTE POOL tutorial_compute_pool;
   > ```
2. To create the service, execute the following command using `test_role`:

   ```sqlexample
   CREATE SERVICE query_service
     IN COMPUTE POOL tutorial_compute_pool
     FROM SPECIFICATION $$
       spec:
         containers:
         - name: main
           image: /tutorial_db/data_schema/tutorial_repository/query_service:latest
           env:
             SERVER_PORT: 8000
           readinessProbe:
             port: 8000
             path: /healthcheck
         endpoints:
         - name: execute
           port: 8000
           public: true
       capabilities:
         securityContext:
           executeAsCaller: true
       serviceRoles:
       - name: ui_usage
         endpoints:
         - execute
   $$;
   ```

   > **Note:**
   >
   > If a service with that name already exists, use the DROP SERVICE command to delete the previously created service, and then
   > create this service.
3. Execute the following SQL commands to get detailed information about the service you just created. For more information, see
   [Snowpark Container Services: Working with services](../../working-with-services.md).

   * To list services in your account, execute the SHOW SERVICES command:

     ```sqlexample
     SHOW SERVICES;
     ```
   * To get the status of your service, execute the SHOW SERVICE CONTAINERS IN SERVICE command:

     ```sqlexample
     SHOW SERVICE CONTAINERS IN SERVICE query_service;
     ```
   * To get information about your service, execute the DESCRIBE SERVICE command:

     ```sqlexample
     DESCRIBE SERVICE query_service;
     ```

## Use the service

In this section you verify that the [caller’s rights](../../spcs-execute-sql.md) configured for the service work. You log in to the public endpoint from a browser, execute a query, and verify that the Snowflake session that the service created operates as the calling user, instead of as the service user.

First, to set up the context for the SQL statements in this section, execute the following:

```sqlexample
USE ROLE test_role;
USE DATABASE tutorial_db;
USE SCHEMA data_schema;
USE WAREHOUSE tutorial_warehouse;
```

The service exposes a public endpoint (see the inline specification provided in the CREATE SERVICE command); therefore, first log in to the endpoint using a web browser, then use the web UI that the service exposes to the internet to send query requests to the service endpoint.

1. Find the URL of the public endpoint that the service exposes:

   ```sqlexample
   SHOW ENDPOINTS IN SERVICE query_service;
   ```

   The `ingress_url` column in the response provides the URL.

   **Example**

   ```output
   p6bye-myorg-myacct.snowflakecomputing.app
   ```
2. Append `/ui` to the endpoint URL, and paste it in the web browser. This causes the service to execute the `ui()` function (see `main.py`).

   Note that the first time you access the endpoint URL, you will be asked to log in to Snowflake.
3. Use the same user that you used to create the service. Upon successful login, the service shows the following Web UI.

   Enter the following command in the text box and press enter to see the results.

   ```sqlexample
   SELECT CURRENT_USER(), CURRENT_ROLE()DONE;
   ```

   Because you included the `executeAsCaller` capability in the service specification, when a request arrives, Snowflake inserts the `Sf-Context-Current-User-Token` header in the request and then forwards the request to your service endpoint.

   For illustration purposes, the service code in this tutorial executes the query both as the caller and the service user.

   * **Executes the query on behalf of the caller (ingress user):** In this case, the code uses the user token that Snowflake provides to construct a login token for connecting with Snowflake. Thus, the service uses the caller’s rights. Snowflake executes the query on behalf of the caller, displaying the caller’s name and active role name in the query result. For example:

     ```output
     ['TESTUSER, PUBLIC']
     ```
   * **Executes the query on behalf of the service user:** In this case, the code doesn’t use the user token that Snowflake provides in the request when constructing the login token to connect with Snowflake. Thus, the service doesn’t utilize the caller’s rights, causing Snowflake to execute the query on behalf of the service user. The query result shows the service user’s name (which is the same as the service name) and the active role.

     ```output
     ['QUERY_SERVICE, TEST_ROLE']
     ```

When the service executes the query (`SELECT CURRENT_USER(), CURRENT_ROLE();`) on behalf of the caller, Snowflake doesn’t need the user’s warehouse to execute this simple query. Therefore, the service didn’t need any [caller grants](../../spcs-execute-sql.md). In the next section, the service executes a non-trivial query on behalf of the calling user that requires you to grant [caller grants](../../spcs-execute-sql.md) to the service.

> **Note:**
>
> You can access the ingress endpoint programmatically. For sample code, see [Ingress authentication](../../working-with-services.md). Note that you need to append `/ui` to the endpoint URL in the code so that Snowflake can route the request to the `ui()` function in the service code.

## Use the service with caller grants

In this section, the service executes the following query on behalf of the caller (the user who logs in the service’s ingress endpoint).

```sqlexample
SELECT * FROM ingress_user_db.ingress_user_schema.ingress_user_table;
```

The service doesn’t have permissions to access the table and doesn’t have permission to run the query in the default warehouse. To enable the service to execute this query on behalf of the caller, you grant the required [caller grants](../../spcs-execute-sql.md) to the service.

To demonstrate the scenario, you create a new role (`ingress_user_role`) and a table (`ingress_user_table`) that’s accessible to the new role but not to the service’s owner role (`test_role`). Therefore, when the service attempts to execute the query using the service credentials, Snowflake returns an error. But when the service executes the query on behalf of the user, Snowflake executes the query and returns the result.

### Create roles and resources

1. Create a role (`ingress_user_role`) and a database (`ingress_user_db`) that only this role can access. You then grant this role to the your user, so that the user can log in to the service’s public endpoint and query this table.

   ```sqlexample
   USE ROLE accountadmin;

   CREATE ROLE ingress_user_role;
   GRANT ROLE ingress_user_role TO USER <your_user_name>;

   GRANT USAGE ON WAREHOUSE tutorial_warehouse TO ROLE ingress_user_role;

   CREATE DATABASE IF NOT EXISTS ingress_user_db;
   GRANT OWNERSHIP ON DATABASE ingress_user_db TO ROLE ingress_user_role COPY CURRENT GRANTS;
   ```
2. Create a table (`ingress_user_table`) that only the `ingress_user_role` role can access.

   ```sqlexample
   USE ROLE ingress_user_role;

   CREATE SCHEMA IF NOT EXISTS ingress_user_db.ingress_user_schema;
   USE WAREHOUSE tutorial_warehouse;
   CREATE TABLE ingress_user_db.ingress_user_schema.ingress_user_table (col string) AS (
       SELECT 'this table is only accessible to the ingress_user_role'
   );
   ```

   Note that when the service tries to query the table on behalf of the caller, the service operates only as a `test_role`, the role that was used to create the service (the service’s owner role). This role does not have permissions to access the user table.
3. Grant caller grants to the service’s owner role (`test_role`) to query tables in the `ingress_user_db` database. This privilege allows the service to query tables in this database only if the following are true:

   * The service is using a [caller’s rights session](../../spcs-execute-sql.md).
   * In the session, the caller also has permission to execute these queries.

   ```sqlexample
   USE ROLE accountadmin;

   GRANT CALLER USAGE ON DATABASE ingress_user_db TO ROLE test_role;
   GRANT INHERITED CALLER USAGE ON ALL SCHEMAS IN DATABASE ingress_user_db TO ROLE test_role;
   GRANT INHERITED CALLER SELECT ON ALL TABLES IN DATABASE ingress_user_db TO ROLE test_role;
   GRANT CALLER USAGE ON WAREHOUSE tutorial_warehouse TO ROLE test_role;
   SHOW CALLER GRANTS TO ROLE test_role;
   ```
4. Configure the default warehouse and default secondary roles.

   When a session is created for a user, Snowflake activates the default primary role, default secondary roles, and the default warehouse of the logged-in user. In this tutorial,

   * You set the `DEFAULT_SECONDARY_ROLES` to ALL so that when a session is created for the current user, Snowflake sets the current secondary roles to be all roles granted to the user.
   * You also set the default warehouse to `tutorial_warehouse` where the `ingress_user_table` queries are executed.

   ```sqlexample
   ALTER USER SET DEFAULT_SECONDARY_ROLES = ('ALL');
   ALTER USER SET DEFAULT_WAREHOUSE = TUTORIAL_WAREHOUSE;
   ```

   Note the following:

   * In this tutorial, you log in to the public endpoint of the service. The user has `test_role` as the primary role and the `ingress_user_role` as the secondary role. This allows the session to do anything that the `ingress_user_role` allows.
   * The default role and default warehouse only affect the role and warehouse activated when the service establishes a session on behalf of your user. After a caller’s rights session is established you cannot change the role, but you can change the warehouse.

### Use the service and test the caller grants

1. Find the URL of the public endpoint that the service exposes:

   ```sqlexample
   SHOW ENDPOINTS IN SERVICE tutorial_db.data_schema.query_service;
   ```

   The `ingress_url` column in the response provides the URL.

   **Example**

   ```output
   p6bye-myorg-myacct.snowflakecomputing.app
   ```
2. Append `/ui` to the endpoint URL, and paste it in the web browser. This causes the service to execute the `ui()` function
   (see `echo_service.py`).:
   Note that the first time you access the endpoint URL, you will be asked to log in to Snowflake. For this test, use the same user that
   you used to create the service to ensure that the user has the necessary privileges.:
3. Use the same user that you used to create the service. Upon successful login, the service shows the following Web UI.

   Enter the following command in the text box and press enter to see the results.

   ```sqlexample
   SELECT * FROM ingress_user_db.ingress_user_schema.ingress_user_table;
   ```

   For illustration purposes the service code in this tutorial executes the query both as the caller and the service user.

   * **Executes the query on behalf of the caller (ingress user):** In this case, the code uses the user token provided by Snowflake to construct a login token for connecting with Snowflake. Thus, the service uses the caller’s rights. Snowflake executes the query on behalf of the caller. Because the caller is using the `ingress_user_role role` that has the privilege to query the `ingress_user_table` table, the query returns one row in the result:

     ```output
     ['this table is only accessible to ingress_user_role']
     ```
   * **Executes the query on behalf of the service user:** In this case, the code does not use the user token that Snowflake provides in the request when constructing the login token to connect with Snowflake. Thus, Snowflake executes the query on behalf of the service user. Because the service owner uses the default `test_role`, which does not have permission to query the table, you see an error:

     ```output
     Encountered an error when executing query:... SQL compilation error: Database 'INGRESS_USER_DB' does not exist or not authorized.
     ```

## Cleanup

You should remove billable resources that you created. For more information, see Step 5 in
[Tutorial 4](tutorial-4.md).

## Reviewing the service code

This section covers the following topics:

* Examining the tutorial code: Review the code files that implement the query service.

### Examining the tutorial code

The zip file you downloaded in Step 1 includes the following files:

* `Dockerfile`
* `main.py`
* `templates/basic_ui.html`

You also use service specification when creating the service. The following section explains how these code components work together to create the service.

#### main.py file

This Python file contains the code that implements a minimal HTTP server that executes a query in the request and returns query results. The code
provides a web user interface (UI) for submitting echo requests.

```python
from flask import Flask
from flask import request
from flask import render_template
import logging
import os
import sys

from snowflake.snowpark import Session
from snowflake.snowpark.exceptions import *

# Environment variables below will be automatically populated by Snowflake.
SNOWFLAKE_ACCOUNT = os.getenv("SNOWFLAKE_ACCOUNT")
SNOWFLAKE_HOST = os.getenv("SNOWFLAKE_HOST")
SNOWFLAKE_DATABASE = os.getenv("SNOWFLAKE_DATABASE")
SNOWFLAKE_SCHEMA = os.getenv("SNOWFLAKE_SCHEMA")

# Custom environment variables
SNOWFLAKE_USER = os.getenv("SNOWFLAKE_USER")
SNOWFLAKE_PASSWORD = os.getenv("SNOWFLAKE_PASSWORD")
SNOWFLAKE_ROLE = os.getenv("SNOWFLAKE_ROLE")
SNOWFLAKE_WAREHOUSE = os.getenv("SNOWFLAKE_WAREHOUSE")

SERVICE_HOST = os.getenv("SERVER_HOST", "0.0.0.0")
SERVER_PORT = os.getenv("SERVER_PORT", 8080)

def get_logger(logger_name):
    logger = logging.getLogger(logger_name)
    logger.setLevel(logging.DEBUG)
    handler = logging.StreamHandler(sys.stdout)
    handler.setLevel(logging.DEBUG)
    handler.setFormatter(
        logging.Formatter("%(name)s [%(asctime)s] [%(levelname)s] %(message)s")
    )
    logger.addHandler(handler)
    return logger

def get_login_token():
    """
    Read the login token supplied automatically by Snowflake. These tokens
    are short lived and should always be read right before creating any new connection.
    """
    with open("/snowflake/session/token", "r") as f:
        return f.read()

def get_connection_params(ingress_user_token=None):
    """
    Construct Snowflake connection params from environment variables.
    """
    if os.path.exists("/snowflake/session/token"):
        if ingress_user_token:
            logger.info("Creating a session on behalf of user.")
            token = get_login_token() + "." + ingress_user_token
        else:
            logger.info("Creating a session as service user.")
            token = get_login_token()

        return {
            "account": SNOWFLAKE_ACCOUNT,
            "host": SNOWFLAKE_HOST,
            "authenticator": "oauth",
            "token": token,
            "warehouse": SNOWFLAKE_WAREHOUSE,
            "database": SNOWFLAKE_DATABASE,
            "schema": SNOWFLAKE_SCHEMA,
        }
    else:
        return {
            "account": SNOWFLAKE_ACCOUNT,
            "host": SNOWFLAKE_HOST,
            "user": SNOWFLAKE_USER,
            "password": SNOWFLAKE_PASSWORD,
            "role": SNOWFLAKE_ROLE,
            "warehouse": SNOWFLAKE_WAREHOUSE,
            "database": SNOWFLAKE_DATABASE,
            "schema": SNOWFLAKE_SCHEMA,
        }

logger = get_logger("query-service")
app = Flask(__name__)

@app.get("/healthcheck")
def readiness_probe():
    return "I'm ready!"

@app.route("/ui", methods=["GET", "POST"])
def ui():
    """
    Main handler for providing a web UI.
    """
    if request.method == "POST":
        # get ingress user token
        ingress_user = request.headers.get("Sf-Context-Current-User")
        ingress_user_token = request.headers.get("Sf-Context-Current-User-Token")

        if ingress_user:
            logger.info(f"Received a request from user {ingress_user}")

        # getting input in HTML form
        query = request.form.get("query")
        if query:
            logger.info(f"Received a request for query: {query}.")
            query_result_ingress_user = (
                run_query(query, ingress_user_token)
                if ingress_user_token
                else "Token is missing. Can't execute as ingress user."
            )
            query_result_service_user = run_query(query)
            return render_template(
                "basic_ui.html",
                query_input=query,
                query_result_ingress_user=query_result_ingress_user,
                query_result_service_user=query_result_service_user,
            )
    return render_template("basic_ui.html")

@app.route("/query", methods=["GET"])
def query():
    """
    Main handler for providing programmatic access.
    """
    # get ingress user token
    query = request.args.get("query")
    logger.info(f"Received query request: {query}.")
    if query:
        ingress_user = request.headers.get("Sf-Context-Current-User")
        ingress_user_token = request.headers.get("Sf-Context-Current-User-Token")

        if ingress_user:
            logger.info(f"Received a request from user {ingress_user}")

        res = run_query(query, ingress_user_token)
        return str(res)
    return "DONE"

def run_query(query, ingress_user_token=None):
    # start a Snowflake session as the ingress user
    try:
        with Session.builder.configs(
            get_connection_params(ingress_user_token)
        ).create() as session:
            logger.info(
                f"Snowflake connection established (id={session.session_id}). Now executing query: {query}."
            )
            try:
                res = session.sql(query).collect()
                logger.info(f"Query execution done: {query}.")
                return (
                    "[Empty Result]"
                    if len(res) == 0
                    else [", ".join(row) for row in res]
                )
            except Exception as e:
                return "Encountered an error when executing query: " + str(e)
    except Exception as e:
        return "Encountered an error when connecting to Snowflake: " + str(e)

if __name__ == '__main__':
  app.run(host=SERVICE_HOST, port=SERVER_PORT)
```

In the code:

* The `ui` function displays the following web form and handles query requests submitted from the web form.

  This function uses the `@app.route()` decorator to specify that requests for `/ui` are handled by this function:

  ```python
  @app.route("/ui", methods=["GET", "POST"])
  def ui():
  ```

  The query service exposes the `execute` endpoint publicly (see the inline service specification you provided when creating the service), enabling communication with
  the service over the web. When you load the URL of the public endpoint with /ui appended in your browser, the browser sends
  an HTTP GET request for this path, and the server routes the request to this function. The function executes and returns a
  simple HTML form for the user to enter a query.

  After the user enters a query and submits the form, the browser sends an HTTP POST request for this path. Because the service specification includes the `executeAsCaller` capability, Snowflake adds the `Sf-Context-Current-User-Token` header to the incoming request and forwards the request to this same function (see [Connecting to Snowflake using caller’s rights](../../spcs-execute-sql.md)).

  The code executes the `run_query` function twice:

  + As the ingress user. In this case, the login token is concatenation of both OAuth token and ingress user token.

    ```python
    token = get_login_token() + "." + ingress_user_token
    ```
  + As the service user. In this case the login token is only the OAuth token.

    > ```python
    > token = get_login_token()
    > ```
* The `readiness_probe` function uses the `@app.get()` decorator to specify that requests for `/healthcheck`
  are handled by this function:

  ```python
  @app.get("/healthcheck")
  def readiness_probe():
  ```

  This function enables Snowflake to check the readiness of the service. When the container starts, Snowflake wants to confirm
  that the application is working and that the service is ready to serve the requests. Snowflake sends an HTTP GET request with
  this path (as a health probe, readiness probe) to ensure that only healthy containers serve traffic. The function can do
  whatever you want.
* The `get_logger` function helps set up logging.

#### Dockerfile

This file contains all the commands to build an image using Docker.

```bash
ARG BASE_IMAGE=python:3.10-slim-buster
FROM $BASE_IMAGE
COPY main.py ./
COPY templates/ ./templates/
RUN pip install --upgrade pip && pip install flask snowflake-snowpark-python
CMD ["python", "main.py"]
```

The Dockerfile contains instructions to install the Flask library ind the Docker container. The code in `main.py`
relies on the Flask library to handle HTTP requests.

#### /template/basic_ui.html

The query service exposes the `echoendpoint` endpoint publicly (see service specification), enabling communication with the
service over the web. When you load the public endpoint URL with `/ui` appended in your browser, the query service displays
this form.

You can enter a query in the form and submit the form, and the service returns the results in an HTTP response.

```html
<!DOCTYPE html>
<html lang="en">
  <head>
    <title>Welcome to the query service!</title>
  </head>
  <body>
    <h1>Welcome to the query service!</h1>
    <form action="{{ url_for("ui") }}" method="post">
      <label for="query">query:<label><br>
      <input type="text" id="query" name="query" size="50"><br>
    </form>
    <h2>Query:</h2>
    {{ query_input }}
    <h2>Result (executed on behalf of ingress user):</h2>
    {{ query_result_ingress_user }}
    <h2>Result (executed as service user):</h2>
    {{ query_result_service_user }}
  </body>
</html>
```

#### Service specification

Snowflake uses information you provide in this specification to configure and run your service.

```yaml
spec:
  containers:
  - name: main
    image: /tutorial_db/data_schema/tutorial_repository/query_service:latest
    env:
      SERVER_PORT: 8000
    readinessProbe:
      port: 8000
      path: /healthcheck
  endpoints:
  - name: execute
    port: 8000
    public: true
capabilities:
  securityContext:
    executeAsCaller: true
serviceRoles:
- name: ui_usage
  endpoints:
  - execute
```

In the service specification, the `spec`, `capabilities`, and `serviceRoles` are the top-level fields.

* `spec` provides specification details (see [Service specification reference](../../specification-reference.md)). Note that the service exposes one public endpoint (`execute`) that enables ingress access to the service from the public web.
* `capabilities` Specifies the `executeAsCaller` capability. This tells Snowflake that the application intends to use [caller’s rights](../../spcs-execute-sql.md).
* `serviceRoles` specifies one service role (`ui_usage`) and endpoint name (`execute`) to grant the USAGE privilege on.
* The `readinessProbe` field identifies the `port` and `path` that Snowflake can use to send an HTTP GET
  request to the readiness probe to verify that the service is ready to handle traffic.

  The service code (`echo_python.py`) implements the readiness probe as follows:

  ```python
  @app.get("/healthcheck")
  def readiness_probe():
  ```

  Therefore, the specification file includes the `container.readinessProbe` field accordingly.

For more information about service specifications, see [Service specification reference](../../specification-reference.md).

## What’s next?

Now that you’ve completed this tutorial, you can return to [Working with Services](../../working-with-services.md) to explore other topics.

---
title: Tutorial 8: Access the public endpoint programmatically
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/advanced/tutorial-8-access-public-endpoint-programmatically.md
section: Snowpark Container Services
---

SPCS

Snowpark Container Services

App Development

# Tutorial 8: Access the public endpoint programmatically

## Introduction

In [Tutorial 1](../tutorial-1.md), you learned how to
[access a public endpoint by using a web browser](../tutorial-1.md).
Using the browser, you sent a request to the public endpoint, which is the ingress endpoint.
This required you to first authenticated with Snowflake, and then
you interacted with the service by using the web UI that the service provides.

In this tutorial, you access the same public endpoint programmatically.
The tutorial shows you three different options
to authenticate when you log into Snowflake: by using a programmatic access token (PAT),
by using a JSON Web Token (JWT), and using a Session Token from the Python Connector

## Prerequisites

1. Start `echo_service` service as described in [Tutorial 1](../tutorial-1.md).
2. To verify that the service is running, execute the [DESCRIBE SERVICE](../../../../sql-reference/sql/desc-service.md) command.

   > ```sqlexample
   > DESC SERVICE echo_service;
   > ```
3. In the `status` column, verify that it shows that the service status as RUNNING.

   If the status is PENDING, it indicates that the service is still starting.
4. To investigate why the service isn’t RUNNING,
   execute the [SHOW SERVICE CONTAINERS IN SERVICE](../../../../sql-reference/sql/show-service-containers-in-service.md) command, and then
   review the `status` of individual containers:

   ```sqlexample
   SHOW SERVICE CONTAINERS IN SERVICE echo_service;
   ```

> **Important:**
>
> Don’t proceed with this tutorial until you have the `echo_service` running.

## Option 1: Send requests to the service endpoint programmatically by using a PAT

This option shows you how to access a service endpoint programmatically by using
curl and Python.
In both cases you use a
[programmatic access token (PAT)](../../../../user-guide/programmatic-access-tokens.md)
for authentication. Snowflake recommends that you use PAT for programmatic access.

### Set up a PAT

This procedure is a continuation of [Tutorial 1](../tutorial-1.md). Use the same user (`testuser`), database (`tutorial_db`),
and schema (`data_schema`) as in Tutorial 1.

1. To create a PAT for the user, run the following command.

   You should review the PAT-related
   [prerequisites](../../../../user-guide/programmatic-access-tokens.md) because if you don’t meet those prerequisites, you can generate a PAT but
   you can’t authenticate by using the PAT.

   ```sqlexample
   ALTER USER ADD PROGRAMMATIC ACCESS TOKEN example_token role_restriction='PUBLIC';
   ```

   This command creates a PAT to sign in to Snowflake as a `testuser` with the role `public`.

   Example output:

   ```output
   +---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   | token_name    | token_secret                                                                                                                                                                                                                    |
   |---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
   | EXAMPLE_TOKEN | exampleiOiIyMDY0Mzc2MDQ1MzIyNDIiLCJhbGciOiJFUzI1NiJ9.eyJwIjoiMzE0OTk4ODUxMzozMTQ5OTg4MTAxIiwiaXNzIjoiU0Y6MTAwMyIsImV4cCI6MTc2NTUwMTY4NH0.tYDChZeiA9rIUR5Oow9ztoNoaAhyEWMaXZdZKAP0ELnuY8gN3_hMsMy4PE9dGIs2JE9CafYjxgCFOOrku4LP4g |
   +---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
   ```
2. Save the `token_secret` value for later use.

   You need this value later to authenticate when you send requests to the public endpoint.
3. To find the ingress URL for the public endpoint that the `echo_service` exposes, run the following command:

   ```sqlexample
   SHOW ENDPOINTS IN SERVICE echo_service;
   ```

   Example output:

   ```output
   +--------------+------+------------+----------+-----------+---------------------------------------------------------------------+
   | name         | port | port_range | protocol | is_public | ingress_url                                                         |
   |--------------+------+------------+----------+-----------+---------------------------------------------------------------------|
   | echoendpoint | 8000 | NULL       | HTTP     | true      | <endpoint-id>-<orgname>-<acctname>.snowflakecomputing.app           |
   +--------------+------+------------+----------+-----------+---------------------------------------------------------------------+
   ```
4. Save the `ingress_url` value for later use.

   You need this value later to send requests to the public endpoint.

Now you are ready to sign in to Snowflake by using the PAT for authentication and send programmatic
requests to the `ingress_url` of the public endpoint of the `echo_service`.

### Send requests to the service endpoint programmatically by using a PAT

In this section, you send programmatic requests to the public endpoint of the `echo_service` using curl and Python.

#### Send request using curl

Save the PAT to an environment variable.

For example, on a Mac or Linux operating system,
you can use the following command:

```bash
$ pat=<pat-token-from-previous-step>
```

Send a request to the public endpoint of the `echo_service`, as shown
in the following example:

```bash
$ curl -v "https://<ingress-URL>/ui" \
      --header "Authorization: Snowflake Token=\"${pat}\""
```

The command sends a GET request to the public endpoint of the `echo_service`, providing the
following information:

* The URL (`https://<ingress-URL>/ui`) of the endpoint. The string `/ui` appended to the
  ingress URL causes the service to execute the `ui()` function.
  For more information, see the `echo_service.py` file.
* The `Authorization` header with PAT token for authentication.

In response, the `echo_service` in this example serves an HTML page, which curl prints to the console.
Without the PAT, the endpoint returns a redirect to the Snowflake sign-in page.

#### Send request using Python

To send a request to the public endpoint of the `echo_service`,
use a PAT for authentication by using Python, as shown in the following
example steps:

1. In the `invokeUsingPat.py` file, save the following code:

> ```python
> import argparse
> import logging
> import sys
> import requests
> logger = logging.getLogger(__name__)
> def main():
>     args = _parse_args()
>     if args.pat is None:
>         logger.error("PAT is required to proceed.")
>         sys.exit(1)
>     logger.info("Using PAT for authentication.")
>     url = args.spcs_url
>     connect_to_spcs(args.pat, url)
> def connect_to_spcs(token, url):
>     headers = {'Authorization': f'Snowflake Token="{token}"'}
>     data = {"input": "test"}
>     logger.info(f"Headers: {headers}")
>     logger.info(f"URL: {url}")
>     response = requests.post(f'{url}', headers=headers, data=data)
>     assert response.status_code == 200, f"Response code is not 200: {response.text}"
>     logger.info("========================================")
>     logger.info("Response succeeded. Details below:")
>     logger.info(response.text)
> def _parse_args():
>     logging.basicConfig(stream=sys.stdout, level=logging.INFO)
>     cli_parser = argparse.ArgumentParser()
>     cli_parser.add_argument('--pat', required=True, help='Personal Access Token (PAT) for the user.')
>     cli_parser.add_argument('--spcs_url', required=True,
>                             help='The SPCS URL to connect programmatically.')
>     args = cli_parser.parse_args()
>     return args
> if __name__ == "__main__":
>     main()
> ```

1. Run the code that you saved by sending the following request:

> ```bash
> $ python ./invokeUsingPat.py \
>   --spcs_url "https://<endpoint-id>-<orgname>-<acctname>.snowflakecomputing.app/ui" \
>   –pat ${pat}
> ```
>
> When the request arrives, the service executes the `ui()` function, which
> renders an HTML form as shown in the following example. For more information,
> see the
> “Reviewing the service code” step of [Tutorial 1](../tutorial-1.md).
>
> ```html
> <!DOCTYPE html>
> <html lang="en">
> <head>
>   <title>Welcome to echo service!</title>
> </head>
>
> <body>
>   <h1>Welcome to echo service!</h1>
>   <form action="/ui" method="post">
>     <label for="input">Input:<label><br>
>     <input type="text" id="input" name="input"><br>
>   </form>
>   <h2>Input:</h2>
>
>   <h2>Output:</h2>
>
> </body>
> ```

## Option 2: Send requests to the service endpoint programmatically by using a JWT

In this option, the Python sample code that you are provided uses [key pair authentication](../../../../user-guide/key-pair-auth.md). By using the key pair
that you provide, the sample code performs the following actions:

1. Generates a JSON Web Token (JWT).
2. Exchanges the JWT with Snowflake for an OAuth token.
3. Uses the OAuth token for authentication when the sample code communicates
   with the `echo_service` public endpoint.

### Set up a JWT

To communicate with the `echo_service` programmatically,
complete the following steps. By using the Python code provided, you send
requests to the public endpoint that the `echo_service` exposes.

1. At the command prompt or in the terminal, create a directory, and then navigate to it.
2. Configure key pair authentication for the user:

   1. Generate a [key pair](../../../../user-guide/key-pair-auth.md):

      1. Generate a private key by running the following command.

         To simplify the steps, you generate an unencrypted private key. You can also use an encrypted private key but it requires that you enter the password.

         ```bash
         openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out rsa_key.p8 -nocrypt
         ```
      2. To generate a public key (`rsa_key.pub`) by referencing the private key that you created, run the following command:

         ```bash
         openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
         ```
   2. In the directory, verify that you generated the private key and public key.
   3. Assign the public key to the user that you are using to test the programmatic access.

      This action lets the user specify the key for authentication.

      ```sqlexample
      ALTER USER <user-name> SET RSA_PUBLIC_KEY='MIIBIjANBgkqh...';
      ```
3. In Python files, save the provided sample code:

   1. In the `generateJWT.py` file, save the following code:

      ```python
      # To run this on the command line, enter:
      #   python3 generateJWT.py --account=<account_identifier> --user=<username> --private_key_file_path=<path_to_private_key_file>

      from cryptography.hazmat.primitives.serialization import load_pem_private_key
      from cryptography.hazmat.primitives.serialization import Encoding
      from cryptography.hazmat.primitives.serialization import PublicFormat
      from cryptography.hazmat.backends import default_backend
      from datetime import timedelta, timezone, datetime
      import argparse
      import base64
      from getpass import getpass
      import hashlib
      import logging
      import sys

      # This class relies on the PyJWT module (https://pypi.org/project/PyJWT/).
      import jwt

      logger = logging.getLogger(__name__)

      try:
          from typing import Text
      except ImportError:
          logger.debug('# Python 3.5.0 and 3.5.1 have incompatible typing modules.', exc_info=True)
          from typing_extensions import Text

      ISSUER = "iss"
      EXPIRE_TIME = "exp"
      ISSUE_TIME = "iat"
      SUBJECT = "sub"

      # If you generated an encrypted private key, implement this method to return
      # the passphrase for decrypting your private key. As an example, this function
      # prompts the user for the passphrase.
      def get_private_key_passphrase():
          return getpass('Passphrase for private key: ')

      class JWTGenerator(object):
          """
          Creates and signs a JWT with the specified private key file, username, and account identifier. The JWTGenerator keeps the
          generated token and only regenerates the token if a specified period of time has passed.
          """
          LIFETIME = timedelta(minutes=59)  # The tokens will have a 59-minute lifetime
          RENEWAL_DELTA = timedelta(minutes=54)  # Tokens will be renewed after 54 minutes
          ALGORITHM = "RS256"  # Tokens will be generated by using RSA with SHA256

          def __init__(self, account: Text, user: Text, private_key_file_path: Text,
                      lifetime: timedelta = LIFETIME, renewal_delay: timedelta = RENEWAL_DELTA):
              """
              __init__ creates an object that generates JWTs for the specified user, account identifier, and private key.
              :param account: Your Snowflake account identifier. See https://docs.snowflake.com/en/user-guide/admin-account-identifier.html. Note that if you are by using the account locator, exclude any region information from the account locator.
              :param user: The Snowflake username.
              :param private_key_file_path: Path to the private key file used for signing the JWTs.
              :param lifetime: The number of minutes (as a timedelta) during which the key will be valid.
              :param renewal_delay: The number of minutes (as a timedelta) from now after which the JWT generator should renew the JWT.
              """

              logger.info(
                  """Creating JWTGenerator with arguments
                  account : %s, user : %s, lifetime : %s, renewal_delay : %s""",
                  account, user, lifetime, renewal_delay)

              # Construct the fully qualified name of the user in uppercase.
              self.account = self.prepare_account_name_for_jwt(account)
              self.user = user.upper()
              self.qualified_username = self.account + "." + self.user

              self.lifetime = lifetime
              self.renewal_delay = renewal_delay
              self.private_key_file_path = private_key_file_path
              self.renew_time = datetime.now(timezone.utc)
              self.token = None

              # Load the private key from the specified file.
              with open(self.private_key_file_path, 'rb') as pem_in:
                  pemlines = pem_in.read()
                  try:
                      # Try to access the private key without a passphrase.
                      self.private_key = load_pem_private_key(pemlines, None, default_backend())
                  except TypeError:
                      # If that fails, provide the passphrase returned from get_private_key_passphrase().
                      self.private_key = load_pem_private_key(pemlines, get_private_key_passphrase().encode(), default_backend())

          def prepare_account_name_for_jwt(self, raw_account: Text) -> Text:
              """
              Prepare the account identifier for use in the JWT.
              For the JWT, the account identifier must not include the subdomain or any region or cloud provider information.
              :param raw_account: The specified account identifier.
              :return: The account identifier in a form that can be used to generate the JWT.
              """
              account = raw_account
              if not '.global' in account:
                  # Handle the general case.
                  idx = account.find('.')
                  if idx > 0:
                      account = account[0:idx]
              else:
                  # Handle the replication case.
                  idx = account.find('-')
                  if idx > 0:
                      account = account[0:idx]
              # Use uppercase for the account identifier.
              return account.upper()

          def get_token(self) -> Text:
              """
              Generates a new JWT. If a JWT has already been generated earlier, return the previously generated token unless the
              specified renewal time has passed.
              :return: the new token
              """
              now = datetime.now(timezone.utc)  # Fetch the current time

              # If the token has expired or doesn't exist, regenerate the token.
              if self.token is None or self.renew_time <= now:
                  logger.info("Generating a new token because the present time (%s) is later than the renewal time (%s)",
                              now, self.renew_time)
                  # Calculate the next time we need to renew the token.
                  self.renew_time = now + self.renewal_delay

                  # Prepare the fields for the payload.
                  # Generate the public key fingerprint for the issuer in the payload.
                  public_key_fp = self.calculate_public_key_fingerprint(self.private_key)

                  # Create our payload
                  payload = {
                      # Set the issuer to the fully qualified username concatenated with the public key fingerprint.
                      ISSUER: self.qualified_username + '.' + public_key_fp,

                      # Set the subject to the fully qualified username.
                      SUBJECT: self.qualified_username,

                      # Set the issue time to now.
                      ISSUE_TIME: now,

                      # Set the expiration time, based on the lifetime specified for this object.
                      EXPIRE_TIME: now + self.lifetime
                  }

                  # Regenerate the actual token
                  token = jwt.encode(payload, key=self.private_key, algorithm=JWTGenerator.ALGORITHM)
                  # If you are by using a version of PyJWT prior to 2.0, jwt.encode returns a byte string instead of a string.
                  # If the token is a byte string, convert it to a string.
                  if isinstance(token, bytes):
                    token = token.decode('utf-8')
                  self.token = token
                  logger.info("Generated a JWT with the following payload: %s", jwt.decode(self.token, key=self.private_key.public_key(), algorithms=[JWTGenerator.ALGORITHM]))

              return self.token

          def calculate_public_key_fingerprint(self, private_key: Text) -> Text:
              """
              Given a private key in PEM format, return the public key fingerprint.
              :param private_key: private key string
              :return: public key fingerprint
              """
              # Get the raw bytes of public key.
              public_key_raw = private_key.public_key().public_bytes(Encoding.DER, PublicFormat.SubjectPublicKeyInfo)

              # Get the sha256 hash of the raw bytes.
              sha256hash = hashlib.sha256()
              sha256hash.update(public_key_raw)

              # Base64-encode the value and prepend the prefix 'SHA256:'.
              public_key_fp = 'SHA256:' + base64.b64encode(sha256hash.digest()).decode('utf-8')
              logger.info("Public key fingerprint is %s", public_key_fp)

              return public_key_fp

      def main():
          logging.basicConfig(stream=sys.stdout, level=logging.INFO)
          cli_parser = argparse.ArgumentParser()
          cli_parser.add_argument('--account', required=True, help='The account identifier (e.g. "myorganization-myaccount" for "myorganization-myaccount.snowflakecomputing.com").')
          cli_parser.add_argument('--user', required=True, help='The user name.')
          cli_parser.add_argument('--private_key_file_path', required=True, help='Path to the private key file used for signing the JWT.')
          cli_parser.add_argument('--lifetime', type=int, default=59, help='The number of minutes that the JWT should be valid for.')
          cli_parser.add_argument('--renewal_delay', type=int, default=54, help='The number of minutes before the JWT generator should produce a new JWT.')
          args = cli_parser.parse_args()

          token = JWTGenerator(args.account, args.user, args.private_key_file_path, timedelta(minutes=args.lifetime), timedelta(minutes=args.renewal_delay)).get_token()
          print('JWT:')
          print(token)

      if __name__ == "__main__":
          main()
      ```
   2. In the `access-via-keypair.py` file, save the following code:

      ```python
      from generateJWT import JWTGenerator
      from datetime import timedelta
      import argparse
      import logging
      import sys
      import requests
      logger = logging.getLogger(__name__)

      def main():
        args = _parse_args()
        token = _get_token(args)
        snowflake_jwt = token_exchange(token,endpoint=args.endpoint, role=args.role,
                        snowflake_account_url=args.snowflake_account_url,
                        snowflake_account=args.account)
        spcs_url=f'https://{args.endpoint}{args.endpoint_path}'
        connect_to_spcs(snowflake_jwt, spcs_url)

      def _get_token(args):
        token = JWTGenerator(args.account, args.user, args.private_key_file_path, timedelta(minutes=args.lifetime),
                  timedelta(minutes=args.renewal_delay)).get_token()
        logger.info("Key Pair JWT: %s" % token)
        return token

      def token_exchange(token, role, endpoint, snowflake_account_url, snowflake_account):
        scope_role = f'session:role:{role}' if role is not None else None
        scope = f'{scope_role} {endpoint}' if scope_role is not None else endpoint
        data = {
          'grant_type': 'urn:ietf:params:oauth:grant-type:jwt-bearer',
          'scope': scope,
          'assertion': token,
        }
        logger.info(data)
        url = f'https://{snowflake_account}.snowflakecomputing.com/oauth/token'
        if snowflake_account_url:
          url =       f'{snowflake_account_url}/oauth/token'
        logger.info("oauth url: %s" %url)
        response = requests.post(url, data=data)
        logger.info("snowflake jwt : %s" % response.text)
        assert 200 == response.status_code, "unable to get snowflake token"
        return response.text

      def connect_to_spcs(token, url):
        # Create a request to the ingress endpoint with authz.
        headers = {'Authorization': f'Snowflake Token="{token}"'}
        response = requests.post(f'{url}', headers=headers)
        logger.info("return code %s" % response.status_code)
        logger.info(response.text)

      def _parse_args():
        logging.basicConfig(stream=sys.stdout, level=logging.INFO)
        cli_parser = argparse.ArgumentParser()
        cli_parser.add_argument('--account', required=True,
                    help='The account identifier (for example, "myorganization-myaccount" for '
                      '"myorganization-myaccount.snowflakecomputing.com").')
        cli_parser.add_argument('--user', required=True, help='The user name.')
        cli_parser.add_argument('--private_key_file_path', required=True,
                    help='Path to the private key file used for signing the JWT.')
        cli_parser.add_argument('--lifetime', type=int, default=59,
                    help='The number of minutes that the JWT should be valid for.')
        cli_parser.add_argument('--renewal_delay', type=int, default=54,
                    help='The number of minutes before the JWT generator should produce a new JWT.')
        cli_parser.add_argument('--role',
                    help='The role we want to use to create and maintain a session for. If a role isn\'t provided, '
                      'use the default role.')
        cli_parser.add_argument('--endpoint', required=True,
                    help='The ingress endpoint of the service')
        cli_parser.add_argument('--endpoint-path', default='/',
                    help='The url path for the ingress endpoint of the service')
        cli_parser.add_argument('--snowflake_account_url', default=None,
                    help='The account url of the account for which we want to log in. Type of '
                      'https://myorganization-myaccount.snowflakecomputing.com')
        args = cli_parser.parse_args()
        return args

      if __name__ == "__main__":
        main()
      ```

### Send requests to the service endpoint programmatically by using a JWT

* To make the ingress call to the `echo_service` public endpoint, execute the
  `access-via-keypair.py` Python code:

  ```none
  python3 access-via-keypair.py \
    --account <account-identifier> \
    --user <user-name> \
    --role TEST_ROLE \
    --private_key_file_path rsa_key.p8 \
    --endpoint <ingress-hostname> \
    --endpoint-path /ui
  ```

> **Important:**
>
> The name specified by the `--role` flag must exactly match the case of
> the role name shown by [SHOW ROLES](../../../../sql-reference/sql/show-roles.md).

For more information about `account-identifier`, see [Account identifiers](../../../../user-guide/admin-account-identifier.md).

### How authentication works when you use a JWT

The code first converts the provided key pair into a JWT token. It then sends
the JWT token to Snowflake to obtain an OAuth token. Finally, the code uses the
OAuth token to connect to Snowflake and access the public endpoint.

Specifically, the code performs the following actions:

1. The code calls the `_get_token(args)` function to generate a JWT from the key pair that you provide.

   The function implementation is shown in the following example:

   ```python
   def _get_token(args):
       token = JWTGenerator(args.account,
                           args.user,
                           args.private_key_file_path,
                           timedelta(minutes=args.lifetime),
                           timedelta(minutes=args.renewal_delay)).get_token()
       logger.info("Key Pair JWT: %s" % token)
       return token
   ```

   `JWTGenerator` is a helper class that is provided to you. The following list
   includes information about the parameters that you provide when you create
   this object:

   * `args.account` and the `args.user` parameters: A JWT has several fields.
     For more information, see [token format](../../../sql-api/authenticating.md). `iss` is one of the JWT’s fields. This field value includes
     the Snowflake account name and a user name. Therefore, you provide these values as parameters.
   * Two `timedelta` parameters provide the following information:

     + `lifetime` specifies the number of minutes during which the key will be valid (60 minutes).
     + `renewal_delay` specifies the number of minutes from now after which the JWT generator should renew the JWT.
2. The code calls the `token_exchange()` function to connect to Snowflake, and
   then exchange the JWT for an OAuth token:

   ```python
   scope_role = f'session:role:{role}' if role is not None else None
   scope = f'{scope_role} {endpoint}' if scope_role is not None else endpoint

   data = {
       'grant_type': 'urn:ietf:params:oauth:grant-type:jwt-bearer',
       'scope': scope,
       'assertion': token,
   }
   ```

   The preceding code constructs JSON text that sets the scope for the OAuth token,
   which is the public endpoint that can be accessed by using the specified role.
   This code then makes a POST request to Snowflake. Snowflake passes the JSON
   text to exchange the JWT
   for an OAuth token (see [Token exchange](../../../../user-guide/oauth-custom.md)), as shown in the
   following example:

   ```python
   url = f'{snowflake_account_url}/oauth/token'
   response = requests.post(url, data=data)
   assert 200 == response.status_code, "unable to get Snowflake token"
   return response.text
   ```
3. To connect to the public endpoint of the `echo_service`, the code then calls `connect_to_spcs()` function.

   It provides the URL (`https://<ingress-URL>/ui`) of the endpoint and the OAuth token for authentication.

   ```python
   headers = {'Authorization': f'Snowflake Token="{token}"'}
   response = requests.post(f'{url}', headers=headers)
   ```

   The `url` is the `spcs_url` that you provided to the program and the `token` is the OAuth token.

   The `echo_service` in this example serves an HTML page, as explained in the preceding section.
   This sample code simply prints the HTML in the response.

## Option 3: Send requests to the service endpoint programmatically by using a session token

This option shows how to access a service endpoint programmatically by using a session token for authentication. You can obtain the session token by using the Python Connector, as shown in the following example.

This code provides an alternative to key-pair authentication; however, there is no guarantee that it will work with future versions of the [Snowflake Connector](../../../python-connector/python-connector.md) for Python. The example first uses the connector to generate a session token that represents your identity, then uses that token to authenticate to the public endpoint.

1. Configure a connection named “test”.

   For more instructions, see [Connecting using the connections.toml file](../../../python-connector/python-connector-connect.md).
2. Save the following Python code to a `spcs-connect.py` file.

   ```python
   import argparse
   import requests
   import snowflake.connector

   parser = argparse.ArgumentParser(prog='myprogram')
   parser.add_argument('target', help="https endpoint or fully qualified service name")
   parser.add_argument('-c', '--config', default="default", help="snowflake connection name")
   parser.add_argument('-p', '--path', default="/", help="url path when service name is provided")
   args = parser.parse_args()

   with snowflake.connector.connect(
           connection_name=args.config,
           session_parameters={ 'PYTHON_CONNECTOR_QUERY_RESULT_FORMAT': 'json' },
   ) as conn:
       target = args.target
       # derive url from target arg
       if target.startswith("https:"):
           url = target
       else: # assume target is service name
           print(f"lookup up endpoint url for service: {target}")
           for (name, port, range, protocol, is_public, hostname) in conn.cursor().execute(
                   f"SHOW ENDPOINTS IN SERVICE {target}"):
               if is_public:
                   url = f"https://{hostname}{args.path}"
                   break

       # Obtain a session token.
       token_data = conn._rest._token_request('ISSUE')
       token = token_data['data']['sessionToken']

       # Request headers
       headers = {'Authorization': f'Snowflake Token="{token}"'}

       # connect
       print(f"connecting to {url} ...")
       response = requests.get(url, headers=headers)
       print(response.text)
   ```
3. Update the Python code by supplying the connection name to
   `snowflake.connector.connect`, similar to the following example:

   ```python
   with snowflake.connector.connect(
           connection_name="test",
   ) as conn:
       target = args.target
   ```
4. Use the following command to run the Python code that first generates
   a session token, and then sends a request to the public endpoint of
   the service programmatically by using the session token:

   ```bash
   python spcs-connect.py "https://<ingress-URL>/ui"
   ```

   Alternatively, if you know the hostname of the public endpoint —
   [SHOW ENDPOINTS](../../../../sql-reference/sql/show-endpoints.md) — you can
   use the following script. For example, if the hostname of the public endpoint is
   `ewapx-testorg-testaccount.snowflakecomputing.app`, you can use the following script:

   ```bash
   python spcs-connect.py -c test -p "https://ewapx-testorg-testaccount.snowflakecomputing.app/"
   ```

   Example output:

   ```html
   <!DOCTYPE html>
   <html lang=“en”>

   <head>
     <title>Welcome to echo service!</title>
   </head>

   <body>
     <h1>Welcome to echo service!</h1>
     <form action="/ui" method="post">
       <label for="input">Input:<label><br>
       <input type="text" id="input" name="input"><br>
     </form>
     <h2>Input:</h2>

     <h2>Output:</h2>

   </body>

   </html>
   ```

## Clean up

For instructions,
see the [Tutorial 1, Clean up step](../tutorial-1.md).

---
title: Tutorial: Run a Snowflake Container Services job as a Snowflake task
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/tutorials/advanced/run-job-as-task.md
section: Snowpark Container Services
---

App Development

# Tutorial: Run a Snowflake Container Services job as a Snowflake task

## Introduction

When you run a Snowpark Container Services [job service](../../working-with-services.md) as a Snowflake task, the integration enables scenarios that leverage the robust containerization and scalability of Snowpark Container Services.

In this tutorial you create a task graph with the following two tasks:

1. A SQL task that creates a table (`sales_number`) and returns the table name as the return value for use by the dependent job service task.

   > **Note:**
   >
   > For simplicity, this SQL job only creates a table. In actual work, you use the table to perform more complex computations, such as model training.
2. A dependent job service task that queries the sales_number table and returns results as JSON.

### Prerequisites

Complete the [Tutorial Common Setup](../common-setup.md) required for Snowpark Container Services tutorials provided in this guide.

## Step 1: Create example job service image

Save the sample code provided for the job service, build an image, and upload it to an image repository you created as part of common setup. This is the job service that you run as a Snowflake task in this tutorial.

### Save the code that is provided for the job service

Save the followng example job service code files to your local machine:

* `main.py`

  ```python
  #!/usr/bin/env python3

  import json
  import logging
  import os
  import sys
  from snowflake.core.task.context import TaskContext
  from snowflake.snowpark import Session

  def get_logger(logger_name):
      """Set up logging for the application."""
      logger = logging.getLogger(logger_name)
      logger.setLevel(logging.DEBUG)
      handler = logging.StreamHandler(sys.stdout)
      handler.setLevel(logging.DEBUG)
      handler.setFormatter(
          logging.Formatter(
              '%(name)s [%(asctime)s] [%(levelname)s] %(message)s'))
      logger.addHandler(handler)
      return logger

  logger = get_logger('job-in-task-graph')

  def run(session: Session):
      """
      Example job that reads the return value of the predecessor task
      and performs a simple query.
      """
      context = TaskContext(session)

      task_name = context.get_current_root_task_name()
      task_uuid = context.get_current_root_task_uuid()
      logger.info(f"Executing in task {task_name} with UUID {task_uuid}")

      # Fetch task graph configuration. It is equivalent to the following SQL
      # SELECT SYSTEM$GET_TASK_GRAPH_CONFIG('top_k')
      task_config = context.get_task_graph_config()
      top_k = task_config.get("top_k")

      # Fetch result of the predecessor task
      table_name = context.get_predecessor_return_value()

      # Select top k rows
      # In a real scenario, your code would use 'table_name' and
      # task graph configs to perform more complex computations
      # such as model training
      result = session.sql(f"""
          SELECT name FROM {table_name}
          ORDER BY total_sales DESC
          LIMIT {top_k}
      """).collect()
      names = [row[0] for row in result]

      # Set the return value for the task
      context.set_return_value(json.dumps(names))

  def get_login_token():
      """
      Read the login token supplied automatically by Snowflake. These tokens
      are short lived and should always be read right before creating any new connection.
      """
      with open("/snowflake/session/token", "r") as f:
          return f.read()

  def get_connection_params():
      """
      Construct Snowflake connection params from environment variables.
      """
      SNOWFLAKE_ACCOUNT = os.getenv("SNOWFLAKE_ACCOUNT")
      SNOWFLAKE_HOST = os.getenv("SNOWFLAKE_HOST")
      SNOWFLAKE_DATABASE = os.getenv("SNOWFLAKE_DATABASE")
      SNOWFLAKE_SCHEMA = os.getenv("SNOWFLAKE_SCHEMA")
      return {
              "account": SNOWFLAKE_ACCOUNT,
              "host": SNOWFLAKE_HOST,
              "database": SNOWFLAKE_DATABASE,
              "schema": SNOWFLAKE_SCHEMA,
              "authenticator": "oauth",
              "token": get_login_token(),
          }

  if __name__ == '__main__':
      with Session.builder.configs(get_connection_params()).create() as session:
          logger.info(f"Snowflake connection established (id={session.session_id})")
          run(session)
          logger.info("Job execution completed")
  ```
* `requirements.txt`

  ```none
  snowflake-snowpark-python>=1.33.0
  snowflake-core
  ```
* `DOCKERFILE`

  ```bash
  FROM python:3.10-slim

  # Set working directory
  WORKDIR /app

  # Copy requirements first for better layer caching
  COPY requirements.txt .

  # Install dependencies
  RUN pip install -U pip && \
      pip install -r requirements.txt

  # Copy application code
  COPY main.py .

  # Make the script executable
  RUN chmod +x main.py

  # Set the entrypoint
  ENTRYPOINT ["python3", "main.py"]
  ```

You should now have a directory with three files.

### Build image and upload to image repository

Build an image for the linux/amd64 platform that Snowpark Container Services supports, and then upload the image to the image
repository in your account. For more information, see [Common Setup](../common-setup.md).

You need the repository URL and the registry hostname before you can build and upload the image. For more information, see
[Registry and Repositories](../../working-with-registry-repository.md).

#### Get information about the repository

To get the repository URL, execute the [SHOW IMAGE REPOSITORIES](../../../../sql-reference/sql/show-image-repositories.md) SQL command:

```bash
SHOW IMAGE REPOSITORIES;
```

* The `repository_url` column in the output provides the URL, as shown in the following example:

  ```output
  <orgname>-<acctname>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository
  ```
* The host name in the repository URL is registry host name. An example is shown:

  ```output
  <orgname>-<acctname>.registry.snowflakecomputing.com
  ```

#### Build image and upload it to the repository

1. Open a terminal window, and then change your directory to the directory that contains the files that you saved.
2. To build a Docker image, execute the following `docker build` command by using the Docker CLI.

The command ends with a period (.) which specifies current working directory as the `PATH` for files to use for building the image.

> ```bash
> docker build --rm --platform linux/amd64 -t <repository_url>/<image_name> .
> ```
>
> * For `image_name`, use `my_task_job_image:latest`.
>
> **Example**
>
> ```bash
> docker build --rm --platform linux/amd64 -t myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_task_job_image:latest .
> ```

1. Upload the image to the repository in your Snowflake account. In order for Docker to upload an image on your behalf to your repository,
   you must first [authenticate Docker with the registry](../../working-with-registry-repository.md).

   1. For Docker to upload an image on your behalf to your repository,
      first [authenticate Docker with the registry](../../working-with-registry-repository.md).

      1. We recommend by using [Snowflake CLI](../../../snowflake-cli/index.md)
         to authenticate your local Docker instance with the image
         registry for your Snowflake account. Make sure that you configured Snowflake CLI to connect to Snowflake. For more information,
         see [Configuring Snowflake CLI and connecting to Snowflake](../../../snowflake-cli/connecting/connect.md).
      2. To authenticate, execute the following Snowflake CLI command:

         ```snowcli
         snow spcs image-registry login
         ```
   2. To upload the image, execute the following command:

      ```bash
      docker push <repository_url>/<image_name>
      ```

      **Example**

      ```bash
      docker push myorg-myacct.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_task_job_image:latest
      ```

## Step 2: Create and execute a task graph

To create a task graph, run the following SQL code:

```sqlexample
use role accountadmin;
grant execute managed task on account to role test_role;

use role test_role;
use database tutorial_db;
use schema data_schema;

CREATE OR REPLACE TASK step1_prepare_data
CONFIG = '{"top_k": 3}'
AS BEGIN
    create or replace table sales_number (date timestamp, id number, name string, total_sales number)
    as select '2026-01-01' as date, 1 as id, 'Alice' as name, 100 as total_sales
    union all
    select '2026-01-01' as date, 2 as id, 'Bob' as name, 200 as total_sales;
    CALL SYSTEM$SET_RETURN_VALUE('sales_number');
END;

CREATE OR REPLACE TASK step2_execute_spcs_job
AFTER step1_prepare_data
AS
    EXECUTE JOB SERVICE
    IN COMPUTE POOL tutorial_compute_pool
    QUERY_WAREHOUSE=TUTORIAL_WAREHOUSE
    FROM SPECIFICATION $$
    spec:
        containers:
        - image: /tutorial_db/data_schema/tutorial_repository/my_task_job_image:latest
          name: main
    $$

;

-- Tasks you created are initially suspended state. So you first resume the tasks and run.
select SYSTEM$TASK_DEPENDENTS_ENABLE ('step1_prepare_data');
ALTER TASK step1_prepare_data RESUME;

-- now run the task graph
EXECUTE TASK step1_prepare_data;
```

In the next step, you can view the task and job details in Snowsight.

### Python code equivalent to the preceding SQL code

The following Snowflake Python code is the equivalent of the preceding SQL code to create the task graph:

```python
# DEFINE VARIABLES
from snowflake.snowpark.context import get_active_session
session = get_active_session()

STAGE_NAME = "TUTORIAL_DB.DATA_SCHEMA.TUTORIAL_STAGE"
WAREHOUSE_NAME = "TUTORIAL_WAREHOUSE"
DATABASE_NAME = "TUTORIAL_DB"
SCHEMA_NAME = "DATA_SCHEMA"
COMPUTE_POOL_NAME = "TUTORIAL_COMPUTE_POOL"
TEST_ROLE = "TEST_ROLE"

session.use_schema(f"{DATABASE_NAME}.{SCHEMA_NAME}")
session.use_role(f"{TEST_ROLE}")

# Create a task (only creates a task definition).
# Then you run the tasks

from datetime import timedelta

from snowflake.core import CreateMode, Root
from snowflake.core.task.context import TaskContext
from snowflake.core.task.dagv1 import DAG, DAGOperation, DAGTask
from snowflake.snowpark import Session

def prepare_dataset_task(session: Session)-> str:
    table_name = f"{DATABASE_NAME}.{SCHEMA_NAME}.daily_agg"
    session.sql(f"""
        create or replace table {table_name} (date timestamp, id number, name string, total_sales number)
        as select '2026-01-01' as date, 1 as id, 'Alice' as name, 100 as total_sales
        union all
        select '2026-01-01' as date, 2 as id, 'Bob' as name, 200 as total_sales
    """).collect()
    return table_name

def execute_spcs_job() -> str:
    return f"""
    EXECUTE JOB SERVICE
    IN COMPUTE POOL {COMPUTE_POOL_NAME}
    QUERY_WAREHOUSE={WAREHOUSE_NAME}
    FROM SPECIFICATION $$
    spec:
        containers:
        - image: /tutorial_db/data_schema/tutorial_repository/my_task_job_image:latest
          name: main
    $$
    """

def create_dag(name: str) -> DAG:
    with DAG(
        name,
        warehouse=WAREHOUSE_NAME,
        schedule=timedelta(minutes=100),
        use_func_return_value=True,
        stage_location=STAGE_NAME,
        packages=["snowflake-snowpark-python"],
        config={
            "top_k": 3,
        },
    ) as dag:
        # Step1 passes a function object to DAGTask and
        # the task, when run, executes the function object.
        step1 = DAGTask("step1", prepare_dataset_task)
        # In contrast, for step2 you execute the execute_spcs_job function
        # that returns a SQL string that is passed  to DAGTask.
        step2 = DAGTask("step2", execute_spcs_job())

        # Build the DAG
        step1 >> step2

    return dag

root = Root(session)
schema = root.databases[DATABASE_NAME].schemas[SCHEMA_NAME]
# *************** start code execution below **********
# Python API call (data_schema)
op = DAGOperation(schema)

# Directed acyclic graph. Create DAG definition. See function above.
# you can use "test_task_graph" to reference the task later.
dag = create_dag("test_task_graph")

# Create graph in Snowflake
op.deploy(dag, mode=CreateMode.or_replace)

# runs the code in both tasks.
op.run(dag)
```

## Step 3: View task and job details in Snowsight

To view the job service details in task history, perform the following steps:

1. Sign in to [Snowsight](../../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, locate the database and schema that contain the tasks that you want to view.
4. For the selected schema, select Tasks.
5. Select a specific task.

   The task details appear, with additional Graph, and Run History tabs.
6. To view the ID of the job service that you executed as part of the task run, select the Run History tab.

To view the task details in job history, perform the following steps:

1. In the navigation menu, select Monitoring » Services & jobs.
2. Select the Jobs tab.
3. Select the job that you want to view.

   The job details page appears. The Overview tab displays the task name if the task ran a job service.
4. To view the task details, select the task name.
5. To view the ID of the job service that you executed as part of the task run, select the Run History tab.

---
title: Use Gateways to route ingress requests to multiple endpoints
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/gateway.md
section: Snowpark Container Services
---

# Use Gateways to route ingress requests to multiple endpoints

If you want to expose multiple service endpoints behind a single host name, you can create a *gateway*
. A gateway has a hostname similar to a public service endpoint. For more information about public service endpoints, see [Configure service ingress](service-network-communications.md).

Gateways route [ingress requests](service-network-communications.md), including [inference requests](../snowflake-ml/inference/real-time-inference-rest-api.md), from outside Snowflake to one or more service endpoints. With Gateways, you can do the following:

* **Traffic split among services:** You can allow multiple services to share the same hostname. Routing is done based on the percentage given for each service. This is useful in the following scenarios:

  + **A/B testing scenario:** You might choose to update a service and deploy it while keeping the original service running. For testing, you might choose to route a certain percentage of incoming ingress requests to the updated service for testing.
  + **High-availability scenario:** You have a highly available service that is deployed across, say, two compute pools, where each compute pool is created in a different [placement group](working-with-compute-pool.md). You might choose to use the gateway to split incoming ingress requests.
* **Stable URL:** Each gateway has a hostname allocated at creation. The hostname doesn’t change for the lifetime of the gateway object. You can alter the gateway object to route to different endpoints or have different percentage configurations. Changes take effect within a minute.

The following list shows the differences between a service endpoint and a gateway:

* **Browser security:** Service endpoint supports CORS configuration (corsSettings) and cloud service provider (CSP) headers for browser-based access through external access integrations. A gateway currently doesn’t support CORS or CSP headers.
* **Caller’s rights:** Service endpoint supports caller’s rights. A gateway currently doesn’t support caller’s rights.
* **Role-based access control (RBAC):** When you use a service endpoint, access is managed by using [service roles](working-with-services.md).
  When you use a gateway, access is managed by granting the USAGE privilege on the gateway object. Users accessing a gateway don’t need service roles for the underlying service endpoints.

Gateway routing respects the relative percentage of the specified healthy endpoints. For more information about a
gateway’s failover behavior, see Gateway failover behavior.

After you’ve reviewed the following sections, you can create and alter a gateway. For information about creating a gateway, see [CREATE GATEWAY](../../sql-reference/sql/create-gateway.md). For information about altering a gateway, see [ALTER GATEWAY](../../sql-reference/sql/alter-gateway.md).

## Access control requirements

The owner role of the gateway must have the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE GATEWAY | Schema | Required to create a gateway. |
| BIND SERVICE ENDPOINT | Account | Required to bind service endpoints to the gateway. |
| USAGE | Database | Required to access the database containing the gateway. |
| USAGE | Schema | Required to access the schema containing the gateway. |
| USAGE | Target endpoints | Required to route traffic to the target endpoints. |
| MODIFY or OWNERSHIP | Gateway | Required to alter the gateway configuration. |
| USAGE, MODIFY, or OWNERSHIP | Gateway | Required to view the gateway specification. |

> **Note:**
>
> When listing gateways, Snowflake only shows gateways that the role has USAGE, MODIFY, or OWNERSHIP privileges on. The role used must also have USAGE privileges on the database and schema containing the gateway.

For gateway CREATE, ALTER, and DROP operations, see [CREATE GATEWAY](../../sql-reference/sql/create-gateway.md),
[ALTER GATEWAY](../../sql-reference/sql/alter-gateway.md), and [DROP GATEWAY](../../sql-reference/sql/drop-gateway.md).

## Configurations

By default, you get a maximum of 5 endpoints per gateway. For additional endpoints, contact support to split traffic into more endpoints.

## Gateway failover behavior

Gateway failover is the process where a gateway automatically redirects traffic from one endpoint (Endpoint A) to
other endpoints when Endpoint A becomes unavailable or non-operational.

> **Note:**
>
> Snowflake does not fail over onto an endpoint with 0% traffic split. The endpoint must have at least 1% traffic
> split.

The relative percentage of the available endpoints is respected.

Failover from one endpoint (Endpoint A) to other endpoints with at least 1% traffic split happens if any of the
following conditions is true:

* The service of Endpoint A is suspended and `auto_resume` is set to false.
* The compute pool of Endpoint A is suspended.
* The service of Endpoint A fails the readiness probe. This is updated once every 40 seconds (cache refresh rate) at
  the longest. At the time of the update, traffic is immediately adjusted with no ramp up period.
* The service of Endpoint A is dropped.
* The gateway owner role loses privilege (USAGE or OWNERSHIP) on Endpoint A.

---
title: Using block storage volumes with services
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/block-storage-volume.md
section: Snowpark Container Services
---

# Using block storage volumes with services

Snowflake supports these storage volume types for your containerized applications: Snowflake internal stage, local storage, memory storage volumes, and block storage volumes.

[Preview Feature](../../release-notes/preview-features.md) — Open

Using block storage volumes with job services is a [preview feature](../../release-notes/preview-features.md).

## Specifying block storage in service specification

To create a service (including job service) that uses block storage, you provide the necessary configuration in the service specification as follows:

### Step 1: Define a block storage volume

Specify the `spec.volumes` field to define the block storage volumes to create.

```yaml
spec:
  containers:
  ...
  volumes:
    - name: <name>
      source: block
      size: <size-in-Gi>
      blockConfig:                             # optional
        initialContents:
          fromSnapshot: <snapshot-name>
        iops: <number-of-operations>
        throughput: <MiB-per-second>
        encryption: SNOWFLAKE_SSE | SNOWFLAKE_FULL
```

The following fields are required:

* `name`: Name of the volume.
* `source`: Type of the volume. For block storage volume, the value
  is `block`.
* `size`: Storage capacity of the block storage volume measured in bytes.
  The value must always be an integer, specified using the Gi unit suffix. For example, `5Gi` means `5*1024*1024*1024`
  bytes. The size value ranges for cloud providers:

  + `1Gi` to `65536Gi` for AWS.
  + `1Gi` to `16384Gi` for Azure.
  + `4Gi` to `16384Gi` for Google Cloud.

The following are optional fields:

* `blockConfig.initialContents.fromSnapshot`: Specifies a previously taken snapshot of another volume to initialize the block volume.
  The snapshot name can be a [fully qualified object identifier](../../sql-reference/name-resolution.md), such as
  `TUTORIAL_DB.DATA_SCHEMA.MY_SNAPSHOT`. Also, the snapshot name is resolved relative to the database and the schema of the service.
  For example, if you created your service in `TUTORIAL_DB.DATA_SCHEMA`, then `fromSnapshot: MY_SNAPSHOT` is equivalent to
  `fromSnapshot: TUTORIAL_DB.DATA_SCHEMA.MY_SNAPSHOT`.

  Note the following:

  + The snapshot must be in the CREATED state before it can be used to create a volume or the service creation will fail.
  + The encryption type of the snapshot must match that of the volume being created.

  Use the [DESCRIBE SNAPSHOT](../../sql-reference/sql/desc-snapshot.md) command to get the snapshot’s status and encryption type.
* `blockConfig.iops`: Specifies the supported peak number of input/output operations per second. Note that the data size per operation is capped at 256 KiB.

  + For AWS: The supported range is 3000-80000, with a default of 3000.
  + For Azure: The supported range is 3000-80000, with a default of 3000.
  + For Google Cloud:

    - Google Cloud CPU instances: The supported range is 2000-160000, with the following defaults:

      * 2000 IOPS for a 4 Gi disk size
      * 2500 IOPS for a 5 Gi disk size
      * 3000 IOPS for all other disk sizes
    - Google Cloud GPU instances: Snowflake recommends specifying only throughput. `blockConfig.iops` must be 16 \* `blockConfig.throughput` for GPU instances in Google Cloud.
* `blockConfig.throughput`: Specifies the peak throughput, in MiB/second, to provision for the volume.

  + For AWS: The supported range is 125 - 2000, with a default of 125.
  + For Azure: The supported range is 125 - 1200, with a default of 125.
  + For Google Cloud:

    - Google Cloud CPU instances: The supported range is 140 - 2400, with the default of 140.
    - Google Cloud GPU instances: The supported range is 400 - 1,200,000, with the default of 400, but not less than 0.12 per GB of volume size.
* `blockConfig.encryption`: Specify encryption type of the volume: `SNOWFLAKE_SSE` or `SNOWFLAKE_FULL`. For more information, see Encryption support.

### Step 2: Specify where to mount the volume in the container

After you define a block storage volume by adding the `spec.volumes` field, use the `spec.containers.volumeMounts` field to describe where to mount the volume in your application containers, as shown in the following example:

```yaml
spec:
  containers:
  - name: ...
    image: ...
    volumeMounts:
    - name: <volume-name>
      mountPath: <absolute_directory_path>
```

### Example

* Create a service with a block storage volume with size 10Gi. The volume is mounted at path `/opt/block/path` in the main container.

  ```sqlexample
  CREATE SERVICE my_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: echo
      image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
      volumeMounts:
      - name: block-vol
        mountPath: /opt/block/path
      readinessProbe:
        port: 8080
        path: /healthcheck
    endpoints:
    - name: echoendpoint
      port: 8080
      public: true
    volumes:
    - name: block-vol
      source: block
      size: 10Gi
  $$;
  ```
* Create a service with a block storage volume initialized from a snapshot.

  ```sqlexample
  CREATE SERVICE new_service
    IN COMPUTE POOL tutorial_compute_pool
    FROM SPECIFICATION $$
  spec:
    containers:
    - name: echo
      image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:tutorial
      volumeMounts:
      - name: vol-from-snapshot
        mountPath: /opt/block/path
      readinessProbe:
        port: 8080
        path: /healthcheck
    endpoints:
    - name: echoendpoint
      port: 8080
      public: true
    volumes:
    - name: vol-from-snapshot
      source: block
      size: 50Gi
      blockConfig:
        initialContents:
          fromSnapshot: BACKUP_DB.SNAPSHOTS.MY_SNAPSHOT
  $$
  ```

For an example with step-by-step instructions, see [Tutorial 5: Create a service with a block storage volume mounted](tutorials/advanced/tutorial-5-block-storage.md). This tutorial shows you how to create a service with a block storage volume mounted.

### About IOPS and throughput

If your service IO performance is not meeting your expectations and the service is affected by block volume IO or throughput, you might consider
increasing IOPS or throughput. In the current implementation, any such changes require you to recreate the service.

You can review these [available platform metrics](monitoring-services.md) to identify if your service is bottlenecked on block storage:

> * `container.cpu.usage`
> * `volume.read.iops`
> * `volume.write.iops`
> * `volume.read.throughput`
> * `volume.write.throughput`

Depending on the cloud provider the following considerations apply:

* Configuring iops and throughput for AWS:

  + The maximum IOPS that can be configured is 500 IOPS per GiB of volume size, to a maximum of 80,000 IOPS. For example, the
    maximum IOPS of a 10 GiB volume can be 500 \* 10 = 5000. Accordingly, note that the maximum IOPS of 80,000 can only be configured if your volume is 160 GiB or larger.
  + The maximum throughput that can be configured is 1 MiB/second for every 4 IOPS, to a maximum of 2000 MiBs/second.
    For example, with the default 3000 IOPS you can configure throughput up to 750 MiB/second (3000/4=750).
* Configuring iops and throughput for Azure:

  + After a volume size of 6 GB, the supported number of IOPS increase by 500 for each GB beyond 6 GB (disks-types). The maximum IOPS of a 10GB volume can be 500 \* 4 + 3000 = 5000. Accordingly, note that the maximum IOPS of 80,000 can only be configured if your volume is 160 GiB or larger.
  + After 6 GB, the maximum throughput that can be configured is 0.25 MiB/second for every IOPS, to a maximum of 1200 MiBs/second. For example, with the default 3000 IOPS you can configure throughput up to 750 MiB/second (3000\*0.25=750).
* Configuring iops and throughput for Google Cloud:

  + For CPU instances:

    - IOPS are configurable up to 500 IOPS per Gi of volume size, with a maximum of 160,000 IOPS. For example, a 10 Gi volume can achieve a maximum of 5,000 IOPS (500 IOPS \* 10 Gi). To reach the maximum of 160,000 IOPS, a volume size of 320 Gi or larger is required.
    - A maximum throughput of 2400 MiB/second can be configured, with a rate of 1 MiB/second for every 4 IOPS. For example, 3000 IOPS enables up to 750 MiB/second throughput (3000 / 4 = 750).
  + For GPU instances:

    - IOPS cannot be set independent of throughput; IOPS is calculated as 16 multiplied by the throughput value. Therefore, specifying throughput automatically determines the IOPS. Configuring IOPS is not advised for disks used with GPU instances.
    - You must configure a minimum throughput. It must be at least 400 MiB/s, or 0.12 MiB/s for every GiB of volume size, whichever is higher.
    - The configurable throughput rate is 1600 MiB/s per GiB of volume size, subject to a maximum of 1,200,000 MiB/s. As an example, a 10 GiB volume can achieve a maximum throughput of 16,000 MiB/s (1600 \* 10). Note that the upper limit of 1,200,000 MiB/s is only attainable with volumes of 750 GiB or greater.

### Snapshot on Delete

Any of the following commands result in deletion of block volume associated with the service:

* DROP SERVICE <service-name> FORCE
* ALTER COMPUTE POOL <compute-pool-name> STOP ALL
* ALTER SERVICE <service-name> RESTORE VOLUME <volume-name> FROM SNAPSHOT

The `snapshotOnDelete` option defaults to true for services and false for jobs. When the value is true, Snowflake takes a snapshot of the volume before deletion, to protect you from accidental data loss. You add this option in the service specification as part of the `blockConfig` configuration.

Unlike other snapshots, these snapshots are automatically deleted after a period of time. The snapshot retention period defaults to 7 days and can be configured using the `snapshotDeleteAfter` field.

Snowflake assigns a snapshot name in this format: `SYS_BACKUP_ON_DELETE<string>_<timestamp>`.

## Access control requirements

If you want to use an existing snapshot (`fromSnapshot` is in the specification) to initialize the volume, the service’s owner role must have the USAGE privilege on the snapshot.

The service’s owner role must also have the USAGE privilege on the database and schema that contain the snapshot.

## Managing snapshots

You can take snapshots of your block storage volume and later use the backup as follows:

* Use the snapshot backup to restore an existing block storage volume.
* Use the snapshot backup as seed data to initialize a new block storage volume when creating a new service.

You should ensure all your updates are flushed to the disk before you take the snapshot.

Snowflake provides the following commands to create and manage snapshots:

* [CREATE SNAPSHOT](../../sql-reference/sql/create-snapshot.md)
* [ALTER SNAPSHOT](../../sql-reference/sql/alter-snapshot.md)
* [DESCRIBE SNAPSHOT](../../sql-reference/sql/desc-snapshot.md)
* [SHOW SNAPSHOTS](../../sql-reference/sql/show-snapshots.md)
* [DROP SNAPSHOT](../../sql-reference/sql/drop-snapshot.md)

In addition, to restore a snapshot on an existing
block storage volume, you can execute the
[ALTER SERVICE … RESTORE VOLUME](../../sql-reference/sql/alter-service.md) command. Note that you need to suspend the service before you can restore a snapshot. After restoring a volume, the service is automatically resumed.

## Block storage costs

For more information, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

When a block storage volume is used with a job service, Snowflake stops charging block storage costs after the job service is either dropped by the user or cleaned up by Snowflake after completion.

After a snapshot is dropped, you will continue to be billed through the configured [data retention period](../../user-guide/data-time-travel.md). The default data retention period is 1 day.

## Encryption support

Block storage volumes and snapshots support the same two encryption modes that are also used for other Snowflake-managed storage:

* **SNOWFLAKE_SSE:** Server-side encryption only. This is the default configuration for customers who don’t have Tri-Secret-Secure enabled on their Snowflake accounts.

  Snowflake uses the cloud service provider’s (CSP) encryption for block storage volumes and snapshots.
* **SNOWFLAKE_FULL:** On-host and server-side encryption. This is the default configuration for customers who have Tri-Secret-Secure enabled on their Snowflake accounts.

  Data is first encrypted at the client (Snowpark Container Services host) before being sent to a CSP for storage. Each volume is encrypted with a unique volume key. The same key is used for encrypting snapshots that you create from that volume.

  Because Snowflake performs additional encryption of data, there is a performance and resource usage impact associated with using `SNOWFLAKE_FULL` volumes. Snowflake uses the encryption mechanisms provided by the Linux kernel, so the effect should not be significant. Any performance impact is likely workload-specific, so we recommend that you identify service or job bottlenecks, increase volume throughput, or provide a more powerful server.

  Key rotation or rekeying isn’t supported for block storage volumes and snapshots in Snowflake. To change a volume’s encryption key, create a new volume, and copy the data through the snapshot.

  For customers who have Tri-Secret Secure enabled on their accounts, note that when the access to a customer managed key is revoked, the volume data remains available for currently running services using the volume. We recommend that you shut down these services when you revoke access to the customer managed key so that data is not available. Also, after you revoke the key, the services with encrypted volumes cannot start.

Volume snapshots retain the encryption type of their source volume. For example, a snapshot of a `SNOWFLAKE_SSE` volume also uses `SNOWFLAKE_SSE` encryption. When a snapshot is used as the initial content of a volume or with the [ALTER SERVICE … RESTORE VOLUME](../../sql-reference/sql/alter-service.md) command, its encryption type must match the volume’s encryption type. Otherwise, the command fails.

You can require the SNOWFLAKE_FULL encryption type for all Snowpark Container Services
block-storage volumes and snapshots in the account by setting
the [ENABLE_SPCS_BLOCK_STORAGE_SNOWFLAKE_FULL_ENCRYPTION_ENFORCEMENT](../../sql-reference/parameters.md) parameter to TRUE for the account.

After this parameter is enabled, creation of block-storage volumes and snapshot with the SNOWFLAKE_SSE encryption type isn’t permitted.

## Example

For an example, see
[Tutorial](tutorials/advanced/tutorial-5-block-storage.md). The tutorial provides step-by-step instructions to create a service with a block storage volume mounted.

## Guidelines and limitations

The following restrictions apply on services that use block storage volumes:

* General limitations. If you encounter any issues with these limitations, contact your account representative.

  + The maximum number of block storage volumes per service is 3.
  + The maximum number of block storage volumes per Snowflake account is 100.
  + The following table lists the maximum number of block storage volumes that can be mounted per compute pool node depending on the instance type of the node. Snowflake ensures that service instances using block storage volumes are placed in accordance with these limits. This might result in services in the PENDING state waiting for additional resources.

    | Instance family | AWS limit | Azure limit | GCP limit |
    | --- | --- | --- | --- |
    | CPU_X64_XS | 22 | 3 | 14 |
    | CPU_X64_S | 22 | 8 | 14 |
    | CPU_X64_M | 22 | 16 | 14 |
    | CPU_X64_SL | 27 | 31 | 14 |
    | CPU_X64_L | 22 | 32 | 14 |
    | HIGHMEM_X64_S | 22 | 16 | 14 |
    | HIGHMEM_X64_M | 22 | 32 | 14 |
    | HIGHMEM_X64_SL | n/a | 32 | 14 |
    | HIGHMEM_X64_L | 22 | n/a | n/a |
    | GPU_NV_S (AWS only) | 22 | n/a | n/a |
    | GPU_NV_M (AWS only) | 21 | n/a | n/a |
    | GPU_NV_L (AWS only) | 14 | n/a | n/a |
    | GPU_NV_XS (Azure only) | n/a | 8 | n/a |
    | GPU_NV_SM (Azure only) | n/a | 32 | n/a |
    | GPU_NV_2M (Azure only) | n/a | 32 | n/a |
    | GPU_NV_3M (Azure only) | n/a | 16 | n/a |
    | GPU_NV_SL (Azure only) | n/a | 32 | n/a |
    | GPU_GCP_NV_L4_1_24G (Google Cloud only) | n/a | n/a | 14 |
    | GPU_GCP_NV_L4_4_24G (Google Cloud only) | n/a | n/a | 14 |
    | GPU_GCP_NV_A100_8_40G (Google Cloud only) | n/a | n/a | 14 |
  + The maximum number of snapshots allowed per Snowflake account is 100.
* The service using block storage volumes must have the same minimum and maximum number of instances.
* After the service is created, the following apply:

  + You can’t change the number of service instances using the ALTER SERVICE … SET … command when a service is using block storage volumes.
  + You can’t change the `size`, `iops`, `throughput`, or `encryption` fields of block storage volumes.
  + No new block storage volumes can be added, and no existing block storage volumes can be removed.
  + Block storage volumes are preserved if a service is upgraded, or suspended and resumed. When a service is suspended, you continue to pay for the volume because it is preserved. After you upgrade or resume a service, Snowflake attaches each block storage volume to the same service instance ID as before.
  + Block storage volumes are deleted if the service is dropped. To preserve data in the volumes, take snapshots of the volumes. You can use the snapshots later to initialize new volumes.

---
title: Using Snowflake stage volumes with services
source: https://docs.snowflake.com/en/developer-guide/snowpark-container-services/snowflake-stage-volume.md
section: Snowpark Container Services
---

# Using Snowflake stage volumes with services

Snowflake supports [various storage volume types](specification-reference.md) for your application containers, including internal stage, local storage, memory storage, and block storage volumes. This section explains how to configure volumes and volume mounts for internal stages. An *internal stage volume* is a volume configured to use a Snowflake stage as persistent storage.

With stage volumes your service can access an internal stage’s objects as if they are files on your file system, simplifying your service code compared to using a Snowflake driver and [GET](../../sql-reference/sql/get.md) and [PUT](../../sql-reference/sql/put.md) SQL commands to access these objects. Stage volumes can also perform better for scenarios with streaming reads or writes of large data files.

If your file system operations can easily be translated to streaming GET and PUT operations, then Stage volumes will work well for your scenario. If your application needs to rename or move files, modify existing files, or perform file system based locking, then stage volume is not a good fit for your workload.

> **Note:**
>
> There are currently two implementations of stage volumes; a generally available version and a deprecated version. Snowflake recommends that you use the generally available version for new services and that you migrate your existing applications from the deprecated version.

The stage volume implementation streams file contents directly to and from cloud storage, ensuring that you always get the latest contents. Consider the following points when you use a stage volume:

* A stage volume is optimized for large, sequential reads and writes, providing strong performance for these access patterns. For best results, read and write data in large, contiguous chunks.
* Reads always return the latest data, which lets data sharing occur between services.
* Random writes or file appends aren’t supported. Attempting these operations results in an error. Snowflake recommends that you use [block storage volumes](block-storage-volume.md) for these workloads.

## Configure a Snowflake stage as a storage volume in a service specification

To create a service where service containers use a stage volume, you perform two steps to specify the required settings in the service specification:

* Define a stage volume that identifies the Snowflake stage to use as storage volume.
* Specify where to mount the stage volume in your application container.

### Step 1: Define a stage volume

To define a stage volume, add the `spec.volumes` field in the service specification as shown in the following example:

```yaml
spec:
  containers:
    ..
  volumes:
  - name: <name>
    source: stage
    stageConfig:
       name: <stage_name>
       metadataCache: <time_period>
       resources:
         requests:
           memory: <amount-of-memory>
           cpu: <cpu-units>
         limits:
           memory: <amount-of-memory>
           cpu: <cpu-units>
```

The following list defines the fields from the example:

* `name`: Provides the name of the volume.
* `source`: Identifies the type of the volume (stage).
* `stageConfig.name`: Identifies the Snowflake internal stage or folder on a stage to mount; for example `@my_stage`, `@my_stage/folder`, or `@my_db.my_schema.my_stage/folder/nestedfolder`. Double quotes must surround this value.

You can include the following optional fields in `stageConfig`:

* `stageConfig.resources` field: The Snowflake component that provides the mounted stage volume requires CPU and memory resources. Use
  this field to specify these CPU and memory requirements, similar to the resource specifications for your application containers. For
  more information, see [containers.resources field](specification-reference.md) fields. If this field isn’t specified, the following default resource settings apply:

  + `resources.requests.cpu: 0`
  + `resources.requests.memory: 0.5Gi`
  + `resources.limits.cpu: 0.5`
  + `resources.limits.memory: 1Gi`

  For most applications with typical data traffic to stage volumes, you don’t need to set this field, because the default resource settings
  should be sufficient. However, if your application performs a high volume of reads and writes, the default settings might lead to performance
  constraints or read/write failures. For more information,
  see Common guidelines for both implementations of stage volumes.

  To avoid these problems, check the [CPU and memory metrics](monitoring-services.md) for the container (`stage-mount-v2-sidecar-<stage-volume-name>`). Snowflake adds this container to your service that provides the implementation of the stage volume you configured. This lets you to see exactly what resources your stage volume is using and determine if it is constrained by CPU or memory. Use these metrics to update the CPU and memory limits as needed.
* `stageConfig.metadataCache` field: If your application frequently retrieves file metadata or lists files on a Snowflake stage in a
  loop, and you don’t expect the data to change often, you can enable metadata caching for the Snowflake stage storage volume to significantly
  improve performance. The cache stores this metadata for a specified time period, after which it is cleared. If the application then
  tries to access the metadata, Snowflake refreshes the cache. Use the hours, minutes, and seconds units to specify the `metadataCache`. For example `90s`, `5m`, `1h`, `1h30m`, `1h30m45s`. If not specified, there is no caching.

  > **Note:**
  >
  > Configure metadata caching only when the data in your Snowflake stage doesn’t change for service lifetime; for example, a service that has read-only workloads that need to work on a static set of data in the stage. Don’t configure metadata caching for workloads where data in your Snowflake stage is updated while the service is running.

The following `spec.volumes` field defines a Snowflake stage volume. The field includes the optional `stageConfig` fields:

```yaml
spec:
  containers:
    ..
  volumes:
  - name: <name>
    source: stage
    stageConfig:
      name: <stage_name>
      metadataCache: 1h
      resources:
        requests:
          cpu: 0.35
          memory: 0.4Gi
        limits:
          cpu: 2.0
          memory: 1Gi
```

### Step 2: Specify where to mount the stage volume in the container

After you define a Snowflake stage storage volume by adding the `spec.volumes` field, use the `spec.containers.volumeMounts` field to describe where to mount the stage volume in your application containers, as shown in the following example:

```yaml
spec:
  containers:
  - name: ...
    image: ...
    volumeMounts:
    - name: <name>
      mountPath: <absolute_directory_path>
```

The information you provide in this field is consistent across all supported storage volume types and applies to both implementations of stage volumes.

## Example

* Create a service with a stage `mydb.myschema.ai_models_stage` mounted at `/path/to/stage` in the main container.

  ```sqlexample
  CREATE SERVICE my_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: echo
      image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
      volumeMounts:
      - name: stage-vol
        mountPath: /path/to/stage
    volumes:
    - name: stage-vol
      source: stage
      stageConfig:
        name: "@mydb.myschema.ai_models_stage"
  $$;
  ```
* Create a service with a stage subpath `mydb.myschema.ai_models_stage/subpath` mounted at `/path/to/stage` in the main container.

  ```sqlexample
  CREATE SERVICE my_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: echo
      image: /tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest
      volumeMounts:
      - name: stage-vol
        mountPath: /path/to/stage
    volumes:
    - name: stage-vol
      source: stage
      stageConfig:
        name: "@mydb.myschema.ai_models_stage/subpath"
        metadataCache: 1h
        resources:
          requests:
            cpu: 0.35
            memory: 0.4Gi
          limits:
            cpu: 2.0
            memory: 1Gi
  $$;
  ```

## Access control requirements

The service’s owner role is the role that is used to create the service. It is also the role the services use when interacting with Snowflake. This owner role determines the permissions granted to application containers for accessing a mounted stage. The owner role must have the READ privilege on the stage.

If the owner role doesn’t have the WRITE privilege on a stage, the mount for that stage is read-only. That is, the containers can only read the files from the stage. The owner role needs the WRITE privilege on a stage for the stage mount to support both read and write.

## About the deprecated implementation

The deprecated stage-volume implementation uses a shared cache for reads and writes. Although this works well for some scenarios, you can’t control whether data is read from the cache or directly from the stage, which might not be suitable for all applications. Additionally, when you use the cache for reads and writes, this can introduce performance overhead.

### Migrating code from the deprecated implementation

The newer implementation replaces the deprecated implementation, with the following behavioral changes:

* The newer stage-volume implementation streams file contents directly to and from cloud storage, ensuring that you always get the latest contents. This provides predictable behavior and significantly faster throughput. The deprecated stage-volume implementation caches chunks of file data, making it difficult to know if you are reading the latest data.
* Random read performance might be lower with the new implementation because of the removal of caching. However, without a local disk cache, consistency across volumes is improved. File changes are written directly to the backing stage when the file is closed, with no local disk buffering.
* Reads always return the latest data, making this configuration better for sharing data between services.
* Random writes or file appends aren’t supported. Attempting these operations results in an error. Snowflake recommends that you use [block storage volumes](block-storage-volume.md) for these workloads.

### Specify a Snowflake stage volume in a service specification (deprecated)

To create a service where service containers use Snowflake stage volume, specify the required settings in the service specification as shown in the following steps:

1. To specify the stage volume, use the `spec.volumes` field as shown in the following example:

   ```yaml
   volumes:
   - name: <name>
     source: <stage_name>
   ```

   The following fields are required:

   * `name`: The name of the volume.
   * `source`: The Snowflake internal stage or folder on the stage to mount; for example `@my_stage`, `@my_stage/folder`. Quotes must surround this value.
2. To describe where to mount the stage volume in your application containers, use the `spec.containers.volumeMounts` field, as shown in the following example:

   ```yaml
   volumeMounts:
   - name: <name>
     mountPath: <absolute_directory_path>
   ```

   The information you provide in this field is consistent across all supported storage volume types and applies to both implementations of stage volumes.

### Example (deprecated)

In the example service specification, the app container mounts an internal stage `@model_stage` by using the deprecated stage volume implementations:

```yaml
spec:
  containers:
  - name: app
    image: <image1-name>
    volumeMounts:
    - name: models-legacy
      mountPath: /opt/model-legacy
  volumes:
  - name: models-legacy
    source: "@model_stage"
```

The `volumeMounts` field specifies where inside the container to mount the stage volume. This specification remains the same for both the stage volume implementations.

## Guidelines when using stage volumes

This section provides you with guidelines to follow when you implement application code in which a container uses a Snowflake stage as storage volume.

### Common guidelines for both implementations of stage volumes

* Stage mount is optimized for sequential reads and writes.
* Stage mount I/O operations might have higher latencies than I/O operations on the container’s file system and block storage volumes. You should always check the status code of I/O operations to ensure they succeeded.
* Stage mounts upload file updates asynchronously. Changes to files on a stage mount are only guaranteed to be persisted to the stage after the file descriptor is successfully closed or flushed. There might be a delay before the changes to files on a stage mount become visible to other containers and Snowflake.
* Each directory in a mounted stage should contain fewer than 100,000 files. Expect `readdir` latency to increase with the number of files in the directory.

### Guidelines when using the deprecated version of the stage volume implementation

* Avoid concurrently writing to multiple files within a stage mount.
* Stage mount isn’t a network file system. Don’t use stage mounts for multi-client coordination.
* Don’t open multiple handles to the same file concurrently. Use opened file handles for either read or write operations, but not a mixture of both. To read from a file after writing to it, close the file and then re-open the file before reading.

### Guidelines when using the generally available stage volume implementation

* Concurrent writes to the same file from multiple stage mounts — same stage volume mounted on different containers — aren’t recommended.
* The absence of a local disk cache improves consistency across mounts. File changes are flushed directly to the backing stage upon closing the file, with no local disk buffering. Reads always return the latest data, making the new stage mount better for sharing data between services.
* Read and write data in large, contiguous chunks for optimal performance. The performance penalty for small reads and writes when compared to the generally available stage volume implementation, can mitigate the performance gains from the new implementation.

## Limitations when using stage volumes

This section describes limitations you should be aware of when you implement application code in which containers use stage volumes. If you encounter any issues with these limits, contact your account representative.

### Common limitations for both implementations of stage volumes

* You can only mount a stage or a subdirectory in a stage; for example, @my_stage, `@my_stage/folder`. You can’t mount a single file in a stage; for example, `@my_stage/folder/file`.
* External stages aren’t supported. Only Snowflake internal stages are supported.
* Stage mounts are not fully POSIX compatible file systems. For example:

  + File and directory renames are not atomic.
  + Hard links are not supported.
* The Linux kernel subsystem inode notify (`inotify`) that monitors changes to file systems doesn’t work on stage mounts.

### Limitations when using the deprecated version of the stage volume implementation

* A maximum of 5 stage volumes is allowed per service. For more information, see [spec.volumes](specification-reference.md).
* A maximum of 8 stage volumes per compute pool node are supported. Snowflake manages the stage
  mount per node limit similar to how it manages memory, CPU, and GPU. Launching a new
  service instance can cause Snowflake to launch new nodes when no existing nodes have the
  capacity to support the requested stage mounts.
* The stage volume capabilities vary depending on the cloud platform for your Snowflake account:

  + Accounts on AWS support internal stages with both SNOWFLAKE_FULL and SNOWFLAKE_SSE stage encryption. For more information, see [Internal stage parameters](../../sql-reference/sql/create-stage.md).
  + Accounts on Azure currently support internal stages with SNOWFLAKE_SSE encryption. When you run [CREATE STAGE](../../sql-reference/sql/create-stage.md), use the ENCRYPTION parameter to specify the encryption type: `CREATE STAGE my_stage ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');`
  + Accounts on Google Cloud aren’t supported.
* Concurrent writes to the same file from multiple stage mounts — that is, the same stage volume mounted on different containers — aren’t supported.

### Limitations when using the generally available version of the stage volume implementation

* Random writes, and file appends aren’t supported.
* Each stage that is mounted requires 512 MB memory per stage. This means that there is
  a limitation on the number of stage volumes that can be used based on instance size. Mounting the
  volume on multiple containers doesn’t increase memory consumption.
* A maximum of 20 stage volumes are allowed per service. For more information, see
  [spec.volumes](specification-reference.md).

## Snowflake REST API

REST API reference for managing Snowflake resources programmatically.

---
title: Authenticating Snowflake REST APIs with Snowflake
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/authentication.md
section: Snowflake REST API
---

# Authenticating Snowflake REST APIs with Snowflake

This topic describes how to authenticate to the server when using the Snowflake REST APIs.

When you send a request, the request must include authentication information using either of the following:

* Using key pair authentication
* Using OAuth
* Using a programmatic access token (PAT)

## Using key pair authentication

When using key pair authentication, you need to complete the following tasks:

1. Set up key pair authentication
2. Generate a JWT token

### Set up key pair authentication

To use key pair authentication, follow these steps:

1. Set up key pair authentication.

   As part of this process, you must:

   1. Generate a public-private key pair. The generated private key should be in a file (e.g. named `rsa_key.p8`).
   2. Assign the public key to your Snowflake user. After you assign the key to the user, run the
      [DESCRIBE USER](../../sql-reference/sql/desc-user.md) command. In the output, the `RSA_PUBLIC_KEY_FP` property should be set to the fingerprint of the public key assigned to the user.

   For instructions on how to generate the key pair and assign a key to a user,
   see [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md).
2. Use Snowflake CLI to verify that you can use the generated private key to
   [connect to Snowflake](../snowflake-cli/connecting/configure-connections.md):

   ```snowcli
   $ snow connection test --account <account_identifier> --user <user> --private-key-path <path>/rsa_key.p8
   ```

   If you generated an encrypted private key, Snowflake CLI prompts you for the passphrase that you created when you generated the key.

### Generate a JWT token

To generate a JWT token in your application code, use the following steps:

1. Generate the fingerprint (a SHA-256 hash) of the public key for the user. Prefix the fingerprint with `SHA256:`.

   > For example:
   >
   > > `SHA256:hash`
   >
   > You can also execute the SQL [DESCRIBE USER](../../sql-reference/sql/desc-user.md) command to get the value from
   > the RSA_PUBLIC_KEY_FP property.
2. Generate [a JSON Web Token (JWT)](https://en.wikipedia.org/wiki/JSON_Web_Token) with the following fields in the payload:

   > | Field | Description | Example |
   > | --- | --- | --- |
   > | `iss` | Issuer of the JWT. Set it to the following value:  `account_identifier.user.SHA256:public_key_fingerprint`  where:  * `account_identifier` is your Snowflake [account identifier](../../user-guide/admin-account-identifier.md).  If you are using the [account locator](../../user-guide/admin-account-identifier.md), exclude any region information from   the account locator. * `user` is your Snowflake user name. * `SHA256:public_key_fingerprint` is the fingerprint that you generated in the previous step. **Note:** The `account_identifier` and `user` values must use all uppercase characters. | `MYORGANIZATION-MYACCOUNT.MYUSER.SHA256:public_key_fingerprint` |
   > | `sub` | Subject for the JWT. Set it to the following value:  `account_identifier.user` | `MYORGANIZATION-MYACCOUNT.MYUSER` |
   > | `iat` | Issue time for the JWT in UTC. Set the value to the current time value as either seconds or milliseconds. | `1615370644` (seconds) . `1615370644000` (milliseconds) |
   > | `exp` | Expiration time for the JWT in UTC. You can specify the value as either seconds or milliseconds.    **Note:** The JWT is valid for at most one hour after the token is issued, even if you specify a longer expiration time. | `1615374184` (seconds) . `1615374184000` (milliseconds) |
3. In each API request that you send, set the following headers:

   > * `Authorization: Bearer JWT`
   >
   >   where `JWT` is the token that you generated.
   > * (Optional) `X-Snowflake-Authorization-Token-Type: KEYPAIR_JWT`
   >
   >   If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.
   >
   >   Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:
   >
   >   + `KEYPAIR_JWT` (for key-pair authentication)
   >   + `OAUTH` (for OAuth)
   >   + `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md))

## Using OAuth

To use OAuth, follow these steps:

1. Set up OAuth for authentication.

   See [Introduction to OAuth](../../user-guide/oauth-intro.md) for details on how to set up OAuth and get an OAuth token.
2. Use Snowflake CLI to verify that you can use a generated OAuth token to connect to Snowflake:

   * For Linux and MacOS systems
   > ```bash
   > $ snow connection test --account <account_identifier> --user <user> --authenticator=oauth --token=<oauth_token>
   > ```

   * For Windows systems
   > ```bash
   > $ snow connection test --account <account_identifier> --user <user> --authenticator=oauth --token="<oauth_token>"
   > ```
3. In each API request you send, set the following headers:

   * `Authorization: Bearer oauth_token`

     where `oauth_token` is the generated OAuth token.
   * (Optional) `X-Snowflake-Authorization-Token-Type: OAUTH`

     If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.

     Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:

     + `KEYPAIR_JWT` (for key-pair authentication)
     + `OAUTH` (for OAuth)
     + `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md))

## Using a programmatic access token (PAT)

To authenticate with a programmatic access token, set the following HTTP headers in the request:

* `Authorization: Bearer token_secret`
* `X-Snowflake-Authorization-Token-Type: PROGRAMMATIC_ACCESS_TOKEN` (optional)

For example, if you are using cURL to send a request to a
[Snowflake REST API](snowflake-rest-api.md) endpoint:

```bash
curl --location 'https://myorganization-myaccount.snowflakecomputing.com/api/v2/databases' \
  --header "Authorization: Bearer <token_secret>"
```

If the request fails with a `PAT_INVALID` error, the error might have occurred for one of the following reasons:

* The user associated with the programmatic access token was not found.
* Validation failed.
* The role associated with the programmatic access token was not found.
* The user is not associated with the specified programmatic access token.

For more information, see [Using a programmatic access token to authenticate to an endpoint](../../user-guide/programmatic-access-tokens.md).

---
title: Common setup for Snowflake REST APIs tutorials
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tutorials/common-setup.md
section: Snowflake REST API
---

Snowflake

Getting Started

App Development

Data Engineering

REST API

# Common setup for Snowflake REST APIs tutorials

## Introduction

This topic provides instructions for the common setup required for all Snowflake REST APIs tutorials available in this documentation.

### Overview of the Snowflake REST APIs

Before starting your setup, take a look at the Snowflake REST APIs.

The Snowflake REST APIs supports the following resources through the corresponding APIs. The APIs support CREATE OR ALTER operations for applicable resources.

* Working with accounts

  + [Accounts](../account/account-introduction.md)
  + [Managed accounts](../managed-account/managed-account-introduction.md)
* Working with users, roles, and privileges

  + [Users](../users/users-introduction.md)
  + [Roles](../roles/roles-introduction.md)
  + [Database roles](../database-role/database-role-introduction.md)
  + [Grants](../grants/grants-introduction.md)
* Managing virtual warehouses

  + [Warehouses](../warehouses/warehouses-introduction.md)
* Working with databases and schemas

  + [Databases](../databases/db-introduction.md)
  + [Schemas](../schemas/schemas-introduction.md)
* Managing tables and views

  + [Tables](../tables/tables-introduction.md)
  + [Dynamic tables](../dynamic-tables/dynamic-tables-introduction.md)
  + [Event tables](../event-table/event-table-introduction.md)
  + [Views](../view/view-introduction.md)
* Loading and unloading data

  + [Stages](../stages/stages-introduction.md)
  + [External volumes](../external-volume/external-volume-introduction.md)
  + [Pipes](../pipe/pipe-introduction.md)
* Managing notebooks

  + [Notebooks](../notebook/notebook-introduction.md)
* Working with Snowpark Container Services

  + [Compute Pools](../compute-pools/cp-introduction.md)
  + [Image Repositories](../image-repositories/images-introduction.md)
  + [Services](../services/services-introduction.md)
* Using functions and procedures

  + [Functions](../functions/functions-introduction.md)
  + [User-defined functions](../user-defined-function/user-defined-function-introduction.md)
  + [Procedures](../procedure/procedure-introduction.md)
* Managing security

  + [Network policies](../network-policy/network-policy-introduction.md)
* Managing alerts

  + [Alerts](../alert/alert-introduction.md)
* Leveraging AI/ML

  + [Cortex Inference](../cortex-inference/cortex-inference-introduction.md)
  + [Cortex Search Service](../cortex-search/cortex-search-introduction.md)
* Managing streams and tasks

  + [Streams](../stream/stream-introduction.md)
  + [Tasks](../tasks/tasks-introduction.md)
* Managing integrations

  + [Catalog Integration](../catalog-integration/catalog-integration-introduction.md)
  + [Notification](../notification-integration/notification-integration-introduction.md)

For reference information about the APIs and their endpoints, see [Snowflake REST APIs reference](../reference.md).

> **Tip:**
>
> If you prefer writing Python applications, you can use the Snowflake Python APIs to manage Snowflake objects. For more information, see [Snowflake Python APIs: Managing Snowflake objects with Python](../../snowflake-python-api/snowflake-python-overview.md).

## Import the Snowflake REST APIs collections

This tutorial walks you through the process of importing the Snowflake REST APIs collections from Postman.

1. Download the API collections from the [Git repository](https://github.com/snowflakedb/snowflake-rest-api-specs/tree/main/collections) into a folder.
2. Open the Postman application, and create an account, if necessary.
3. In Postman, open the desired workspace.
4. Select Import.
5. Select folders.
6. In the dialog, select the folder where you extracted the collection, and select Open.
7. Verify that all of the items are selected, and select Import.

   You should see the collections listed in the left panel, as shown:

## Specify the bearer token in Postman

REST requests require a JWT token in the request header to authenticate the request. If you don’t have a JWT token, see [Generate a JWT token](../authentication.md).

In Postman, you can copy the JWT token into the `bearerToken` header property, as shown.

> **Note:**
>
> As mentioned in the tutorial [prerequisites](../tutorials-overview.md), you must define an AUTHENTICATION POLICY. If you receive an error message similar to `{ "code": "390202", "message": "Authentication attempt rejected by the current authentication policy." }`, you can run the following SQL command to define a policy:
>
> ```sqlsyntax
> SHOW AUTHENTICATION POLICIES; alter AUTHENTICATION POLICY <your authentication policy> set AUTHENTICATION_METHODS = ('KEYPAIR', 'PASSWORD', 'OAUTH');
> ```

## Set environment variables in the Postman environment

You can set environment variables in your Postman environment. You can then use these variables in Postman, in the form `{{variable_name}}`.

All endpoint URLs begin with a `baseURL`, which identifies your Snowflake account. The baseURL has the form: `<account_locator>.snowflakecomputing.com`, where `<account_locator>` is your Snowflake account name.

To set the `baseURL` variable, as well as any other variables, in Postman, enable each parameter and set its value, as shown:

For each value you set, you must select Save to save the new value.

## What’s next?

Congratulations! In this tutorial, you learned the fundamentals for managing Snowflake database, schema, and table resources using the Snowflake REST APIs.

### Summary

Along the way, you completed the following steps:

* Import Snowflake REST APIs collections.
* Specify a bearer token in Postman.
* Set environment variables in the Postman environment.

### Next tutorial

You can now proceed to [Tutorial 1: Create and manage databases, schemas, and tables](tutorial-1.md), which shows you how to create and manage Snowflake databases, schemas, and tables.

---
title: Getting started with the Snowflake REST APIs
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/getting-started.md
section: Snowflake REST API
---

# Getting started with the Snowflake REST APIs

This section describes how to access the Snowflake REST APIs using Postman.

## Create a Postman account and import Snowflake REST APIs collections

> **Note:**
>
> These steps are only shown as an example, and following along with the example may require additional rights in third-party data, products, or services that are
> not owned or provided by Snowflake. Please ensure that you have the appropriate rights to third-party data, products, or services before continuing.

To create an account and import the collections:

1. Download the API collections from the [Git repository](https://github.com/snowflakedb/snowflake-rest-api-specs/tree/main/collections) into a folder.
2. Open the Postman application, and create an account, if necessary.
3. In Postman, open the desired workspace.
4. Select Import.
5. Select folders.
6. In the dialog, select the folder where you extracted the collection, and select Open.
7. Verify that all of the items are selected, and select Import.

   You should see the collections listed in the left panel, as shown:

### Specify the `bearerToken` in Postman

REST requests require a JWT token in the request header to authenticate the request. In Postman, you can copy the JWT token into the `bearerToken` header property, as shown.

> **Note:**
>
> If you prefer writing Python applications, you can use the Snowflake Python API to manage Snowflake objects. For more information, see [Snowflake Python APIs: Managing Snowflake objects with Python](../snowflake-python-api/snowflake-python-overview.md).

## Submit a request

To submit a request, you can send a `GET`, `POST`, or `PUT` request to the desired endpoint:

```rest
POST /api/v2/databases/{database}/schemas/{schema}/tasks
(request body)
```

For example, to submit a request to create a task, you would create a POST request similar to the following:

```python
def create_task(task_name, create_mode):
    """
    Create a task given the task name and create mode
    """
    headers = {
    "Content-Type": "application/json",
    "Authorization": "Bearer " + generate_JWT_token(),
    "Accept": "application/json",
    "User-Agent": "myApplicationName/1.0"
    }
    request_body = {
        "name": task_name,
        "warehouse": "myWarehouse",
        "definition": "select 1"
    }
    request_url = "{}/api/v2/databases/{}/schemas/{}/tasks?createMode={}".format(SNOWFLAKE_URL, DATABASE_NAME, SCHEMA_NAME, create_mode)
    response = requests.post(request_url, json=request_body, headers=headers, timeout=60)
    print_response("POST {}".format(request_url), response)
```

The following shows how you can get a list of tasks using `GET /api/v2/databases/database/schemas/schema/tasks` in Postman:

## Handle a response

Each of the Snowflake REST APIs endpoints returns a response as JSON, similar to the following:

```json
{
  [
      {
          "name": "name_example",
          "warehouse": "test_wh",
          "schedule": {
          "schedule_type": "MINUTES_TYPE",
          "minutes": 10
          },
          "comment": "test_comment",
          "config": {
          "output_dir": "/temp/test_directory/",
          "learning_rate": "0.1"
          },
          "definition": "this task does...",
          "predecessors": [
          "task1",
          "task2",
          "task3"
          ],
          "user_task_managed_initial_warehouse_size": "XSMALL",
          "user_task_timeout_ms": 10,
          "suspend_task_after_num_failures": 3,
          "condition": "select 1",
          "allow_overlapping_execution": false,
          "error_integration": "my_notification_int",
          "created_on": "2024-06-18T01:01:01.111111",
          "id": "task_id",
          "owner": "TASK_ADMIN",
          "owner_role_type": "ADMIN",
          "state": "started",
          "last_committed_on": "2024-06-18T01:01:01.111111",
          "last_suspended_on": "2024-06-18T01:01:01.111111",
          "database_name": "TESTDB",
          "schema_name": "TESTSCHEMA"
      }
  ]
}
```

### Handle a long-running request (202 response)

When Snowflake accepts a request that takes longer than 45 seconds to complete, the request returns a 202 response code. The 202 response header includes a `Location` parameter that provides a relative URL similar to the following that you can use to check the status of the ongoing request.

```none
Location: /api/v2/results/5b3ce6ae-d123-4c27-afb3-8a26422d5f321
```

You can create a loop in your code to check the status until the request returns a 200 message. The following pseudo-code sample illustrates a flow you could use:

```output
location = <content of the Location header>

while TRUE {
    sleep for x milliseconds
    response = call GET ( host + location )

    if response is 202
      continue

    if response = 200 {
        <code to extract data from the response header>
        exit
    }
}
```

For full Snowflake REST APIs reference documentation, see [Snowflake Result API reference](/developer-guide/snowflake-rest-api/reference/result.md).

### Handle a large result

In the case of large response, the complete result is divided into multiple pages. The first page of data (page 0) is returned as a response body to the original request. For the remaining pages, clients need to use the URLs in the `Link` header to fetch them.

Sample `Link` header:

```output
Link: </api/v2/results/01b66701-0000-001c-0000-0030000b91521?page=0>; rel="first",</api/v2/results/01b66701-0000-001c-0000-0030000b91521?page=1>; rel="next",</api/v2/results/01b66701-0000-001c-0000-0030000b91521?page=9>; rel="last"
```

The `Link` header in the example contains the first page, next page, and last page’s path. The header could also contain a `rel="prev"` path for previous page in some situations.

---
title: Manage accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/account/account-introduction.md
section: Snowflake REST API
---

# Manage accounts

The Snowflake REST [Account API](/developer-guide/snowflake-rest-api/reference/account.md) provides the following endpoints to access, update, and perform certain actions on Account resources.

Snowflake REST Account API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/accounts` | Lists available accounts. |
| `POST /api/v2/accounts` | Creates an account. |
| `DELETE /api/v2/accounts/name` | Deletes an account. |
| `POST /api/v2/accounts/name:undrop` | Restores a dropped account. |

For reference documentation, see [Snowflake Account API reference](/developer-guide/snowflake-rest-api/reference/account.md).

---
title: Manage alerts
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/alert/alert-introduction.md
section: Snowflake REST API
---

# Manage alerts

The Snowflake REST [Alert API](/developer-guide/snowflake-rest-api/reference/alert.md) provides the following endpoints to access, update, and perform certain actions on Alert resources.

Snowflake REST Alert API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/alerts` | Lists alerts. |
| `POST /api/v2/databases/database/schemas/`.`schema/alerts` | Creates an alert. |
| `GET /api/v2/databases/database/schemas/`.`schema/alerts/name` | Fetches an alert. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/alerts/name` | Deletes an alert. |
| `POST /api/v2/databases/database/schemas/`.`schema/alerts/name:clone` | Creates a new alert by cloning from the specified resource. |
| `POST /api/v2/databases/database/schemas/`.`schema/alerts/name:execute` | Executes an alert. |

For reference documentation, see [Snowflake Alert API reference](/developer-guide/snowflake-rest-api/reference/alert.md).

---
title: Manage API integrations
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/api-integration/api-integration-introduction.md
section: Snowflake REST API
---

# Manage API integrations

The Snowflake REST [API integration API](/developer-guide/snowflake-rest-api/reference/api-integration.md) provides the following endpoints to access, update, and perform certain actions on API integration resources.

Snowflake REST API Integration API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/api-integrations` | Lists API integrations. |
| `POST /api/v2/api-integrations` | Creates an API integration. |
| `PUT /api/v2/api-integrations` | Creates or alters an API integration. |
| `GET /api/v2/api-integrations/name` | Fetches an API integration. |
| `DELETE /api/v2/api-integrations/name` | Deletes an API integration. |

For reference documentation, see [Snowflake API integration API reference](/developer-guide/snowflake-rest-api/reference/api-integration.md).

---
title: Manage artifact repositories
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/artifact-repository/artifact-repository-introduction.md
section: Snowflake REST API
---

# Manage artifact repositories

The Snowflake REST [Artifact Repository API](/developer-guide/snowflake-rest-api/reference/artifact-repository.md) provides the following endpoints to access, update, and perform certain actions on Artifact Repository resources.

Snowflake REST Artifact Repository API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/artifact-repositories` | Lists available artifact repositories. |
| `POST /api/v2/databases/database/schemas/`.`schema/artifact-repositories` | Creates an artifact repository. |
| `GET /api/v2/databases/database/schemas/`.`schema/artifact-repositories/name` | Fetches an artifact repository. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/artifact-repositories/name` | Deletes an artifact repository. |
| `PUT /api/v2/databases/database/schemas/`.`schema/artifact-repositories/name` | Creates or updates an artifact repository. |
| `POST /api/v2/databases/database/schemas/`.`schema/artifact-repositories/name:rename` | Renames an artifact repository. |

For reference documentation, see [Snowflake Artifact Repository API reference](/developer-guide/snowflake-rest-api/reference/artifact-repository.md).

---
title: Manage compute pools
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/compute-pools/cp-introduction.md
section: Snowflake REST API
---

# Manage compute pools

The Snowflake REST [Compute Pool API](/developer-guide/snowflake-rest-api/reference/compute-pool.md) provides the following endpoints to access, update, and perform certain actions on Compute Pool resources.

Snowflake REST Compute Pool API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/compute-pools` | Lists available compute pools. |
| `POST /api/v2/compute-pools` | Creates a compute pool. |
| `GET /api/v2/compute-pools/name` | Fetches a compute pool. |
| `PUT /api/v2/compute-pools/name` | Creates a new, or alters an existing, compute pool. |
| `DELETE /api/v2/compute-pools/name` | Deletes a compute pool. |
| `POST /api/v2/compute-pools/name:resume` | Resumes a suspended compute pool. |
| `POST /api/v2/compute-pools/name:suspend` | Suspends an active compute pool. |
| `POST /api/v2/compute-pools/`.`name:stopallservices` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/compute-pools/`.`name:stop-all-services` | Stops all active services on the compute pool. |
| `GET /api/v2/compute-pools/instance-families` | Lists available compute pool instance families. |

For reference documentation, see [Snowflake Compute Pool API reference](/developer-guide/snowflake-rest-api/reference/compute-pool.md).

---
title: Manage data pipes
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/pipe/pipe-introduction.md
section: Snowflake REST API
---

# Manage data pipes

The Snowflake REST [Pipe API](/developer-guide/snowflake-rest-api/reference/pipe.md) provides the following endpoints to access, update, and perform certain actions on Pipe resources.

Snowflake REST Pipe API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/schema/pipes` | Lists available pipes. |
| `POST /api/v2/databases/database/schemas/schema/pipes` | Creates a pipe. |
| `GET /api/v2/databases/database/schemas/schema/pipes/name` | Fetches a pipe. |
| `DELETE /api/v2/databases/database/schemas/schema/pipes/name` | Deletes a pipe. |
| `POST /api/v2/databases/database/schemas/schema/pipes/name:refresh` | Refreshes a pipe. |

For reference documentation, see [Snowflake Pipe API reference](/developer-guide/snowflake-rest-api/reference/pipe.md).

---
title: Manage database roles
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/database-role/database-role-introduction.md
section: Snowflake REST API
---

# Manage database roles

The Snowflake REST [Database Role API](/developer-guide/snowflake-rest-api/reference/database-role.md) provides the following endpoints to access, update, and perform certain actions on Database Role resources.

Snowflake REST Database Role API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/`.`database-roles` | Lists available database roles. |
| `POST /api/v2/databases/database/`.`database-roles` | Creates a database role. |
| `DELETE /api/v2/databases/database/`.`database-roles/name` | Deletes a database role. |
| `POST /api/v2/databases/database/`.`database-roles/name:clone` | Creates a new database role by cloning from the specified resource. |
| `GET /api/v2/databases/database/database-roles/name/grants` | Lists all grants to the role. |
| `POST /api/v2/databases/database/database-roles/name/grants` | Grants privileges to the specified role. |
| `POST /api/v2/databases/database/database-roles/name/grants:revoke` | Revokes grants from the specified role. |
| `GET /api/v2/databases/database/database-roles/name/future-grants` | Lists all future grants to the specified role. |
| `POST /api/v2/databases/database/database-roles/name/future-grants` | Grants future privileges to the specified role. |
| `POST /api/v2/databases/database/database-roles/name/future-grants:revoke` | Revokes future grants from the specified role |

For reference documentation, see [Snowflake Database Role API reference](/developer-guide/snowflake-rest-api/reference/database-role.md).

---
title: Manage database schemas
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/schemas/schemas-introduction.md
section: Snowflake REST API
---

# Manage database schemas

The Snowflake REST [Schema API](/developer-guide/snowflake-rest-api/reference/schema.md) provides the following endpoints to manage Snowflake schemas:

Snowflake REST Schemas API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas` | Lists the available schemas. |
| `POST /api/v2/databases/database/schemas` | Creates a schema. |
| `POST /api/v2/databases/database/schemas/name:clone` | Clones a schema. |
| `POST /api/v2/databases/database/schemas/name:undrop` | Undrops a schema. |
| `GET /api/v2/databases/database/schemas/name` | Fetches a schema. |
| `PUT /api/v2/databases/database/schemas/name` | Creates a new or alters an existing schema. |
| `DELETE /api/v2/databases/database/schemas/name` | Deletes a schema. |

For reference documentation, see [Snowflake Schema API reference](/developer-guide/snowflake-rest-api/reference/schema.md).

---
title: Manage databases
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/databases/db-introduction.md
section: Snowflake REST API
---

# Manage databases

The Snowflake REST [Database API](/developer-guide/snowflake-rest-api/reference/database.md) provides the following endpoints to manage Snowflake databases:

Snowflake REST Database API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases` | Lists accessible databases. |
| `POST /api/v2/databases` | Creates a database. |
| `POST /api/v2/databases:from-share` | Creates a database from a specified share. |
| `POST /api/v2/databases/name:clone` | Clones an existing database. |
| `GET /api/v2/databases/name` | Fetches a named database. |
| `PUT /api/v2/databases/name` | Creates a new, or alters an existing, database. |
| `DELETE /api/v2/databases/name` | Deletes a named database. |
| `POST /api/v2/databases/name:undrop` | Undrops a named database. |
| `POST /api/v2/databases/name/replication:enable` | Enables database replication. |
| `POST /api/v2/databases/name/replication:disable` | Disables replication for a named database. |
| `POST /api/v2/databases/name/replication:refresh` | Refreshes database replications. |
| `POST /api/v2/databases/name/failover:enable` | Enables failover for a named database. |
| `POST /api/v2/databases/name/failover:disable` | Disables failover for a named database. |
| `POST /api/v2/databases/name/failover:primary` | Sets a named database as the primary database. |

For reference documentation, see [Snowflake Database API reference](/developer-guide/snowflake-rest-api/reference/database.md).

---
title: Manage dynamic tables
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/dynamic-tables/dynamic-tables-introduction.md
section: Snowflake REST API
---

# Manage dynamic tables

The [Dynamic Table API](/developer-guide/snowflake-rest-api/reference/dynamic-table.md) provides the following endpoints to manage Snowflake dynamic tables:

Snowflake Dynamic Table API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/dynamic-tables` | Lists the dynamic tables under the database and schema. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables` | Creates a dynamic table with standard create modifiers as query parameters. |
| `GET /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name` | Fetches a dynamic table. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name` | Deletes a dynamic table with the given name. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name:clone` | Creates a new dynamic table by cloning from the specified resource. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name:undrop` | Undrops a dynamic table. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name:suspend` | Suspends refreshes on the specified dynamic table. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name:resume` | Resumes refreshes on the specified dynamic table. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name:refresh` | Specifies that the specified dynamic table should be manually refreshed. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/`.`name:suspend-recluster` | Suspends reclustering of the specified dynamic table. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/`.`name:resume-recluster` | Resumes reclustering of the specified dynamic table. |
| `POST /api/v2/databases/database/schemas/`.`schema/dynamic-tables/name:swap-with` | Swaps with another dynamic table. |

For reference documentation, see [Snowflake Dynamic Table API reference](/developer-guide/snowflake-rest-api/reference/dynamic-table.md).

---
title: Manage event tables
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/event-table/event-table-introduction.md
section: Snowflake REST API
---

# Manage event tables

The Snowflake REST [Event Table API](/developer-guide/snowflake-rest-api/reference/event-table.md) provides the following endpoints to access, update, and perform certain actions on Event Table resources.

Snowflake REST Event Table API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/event-tables` | Lists available event tables. |
| `POST /api/v2/databases/database/schemas/`.`schema/event-tables` | Creates an event table. |
| `GET /api/v2/databases/database/schemas/`.`schema/event-tables/name` | Fetches an event table. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/event-tables/name` | Deletes an event table. |
| `POST /api/v2/databases/database/schemas/`.`schema/event-tables/name:rename` | Renames an event table. |

For reference documentation, see [Snowflake Event Table API reference](/developer-guide/snowflake-rest-api/reference/event-table.md).

---
title: Manage external volumes
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/external-volume/external-volume-introduction.md
section: Snowflake REST API
---

# Manage external volumes

The Snowflake REST [External Volume API](/developer-guide/snowflake-rest-api/reference/external-volume.md) provides the following endpoints to access, update, and perform certain actions on External Volume resources.

Snowflake REST External Volume API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/external-volumes` | Lists available external volumes. |
| `POST /api/v2/external-volumes` | Creates an external volume. |
| `GET /api/v2/external-volumes/name` | Fetches an external volume. |
| `DELETE /api/v2/external-volumes/name` | Deletes an external volume. |
| `POST /api/v2/external-volumes/name:undrop` | Undrops an external volume. |

For reference documentation, see [Snowflake External Volume API reference](/developer-guide/snowflake-rest-api/reference/external-volume.md).

---
title: Manage functions
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/functions/functions-introduction.md
section: Snowflake REST API
---

# Manage functions

The Snowflake REST [Function API](/developer-guide/snowflake-rest-api/reference/function.md) provides the following Snowflake endpoints to manage Snowflake functions:

Snowflake REST Function API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/functions` | Lists the user functions under the database and schema. |
| `POST /api/v2/databases/database/schemas/`.`schema/functions` | Creates a function. |
| `GET /api/v2/databases/database/schemas/`.`schema/functions/nameWithArgs` | Fetches a function using the DESCRIBE COMMAND output. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/functions/nameWithArgs` | Deletes a function with the given name and args. |
| `POST /api/v2/databases/database/schemas/`.`schema/functions/name:execute` | Executes a function. |

For reference documentation, see [Snowflake Function API reference](/developer-guide/snowflake-rest-api/reference/function.md).

---
title: Manage grants
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/grants/grants-introduction.md
section: Snowflake REST API
---

# Manage grants

The SNOWFLAKE REST [Grant API](/developer-guide/snowflake-rest-api/reference/grant.md) provides the following endpoints to manage Snowflake grants:

Snowflake Grant API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/grants/granteeType/granteeName/securableType/securableName/privileges` | Grants privileges listed in the request body. |
| `POST /api/v2/grants/granteeType/granteeName/bulkGrantType/securableTypePlural/scopeType/scopeName/privileges` | Grants privileges listed in the request body to all securables of the specified type in the given scope. |
| `DELETE /api/v2/grants/granteeType/granteeName/securableType/securableName/privileges/privilege` | Revokes privileges listed in the path parameters. |
| `DELETE /api/v2/grants/granteeType/granteeName/securableType/securableName/privileges/privilege/grant-option` | Revokes the grant option for the privileges listed in the path parameters. |
| `DELETE /api/v2/grants/granteeType/granteeName/bulkGrantType/securableTypePlural/scopeType/scopeName/privileges/privilege` | Revokes the privilege listed on the group securable in the specified scope. |
| `DELETE /api/v2/grants/granteeType/granteeName/bulkGrantType/securableTypePlural/scopeType/scopeName/privileges/privilege/grant-option` | Revokes the grant option for the privilege listed on the group securable in the given scope. |
| `GET /api/v2/grants/granteeType/granteeName` | Lists the roles and privileges granted to the specified grantee using the output of SHOW GRANTS TO. |

For reference documentation, see [Snowflake Grant API reference](/developer-guide/snowflake-rest-api/reference/grant.md).

---
title: Manage Iceberg tables
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/iceberg-table/iceberg-table-introduction.md
section: Snowflake REST API
---

# Manage Iceberg tables

The Snowflake REST [Iceberg Table API](/developer-guide/snowflake-rest-api/reference/iceberg-table.md) provides the following Snowflake endpoints to access, update, and perform certain actions on Iceberg Table resource in Snowflake:

Snowflake REST Iceberg Table API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/iceberg-tables` | Lists available iceberg tables. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables` | Creates an iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables:as-select` | Creates an iceberg table using the result of the specified select query. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables:from-aws-glue-catalog` | Creates an iceberg table from an AWS Glue catalog. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables:from-delta` | Creates an iceberg table from a Delta catalog. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables:from-iceberg-files` | Creates an iceberg table from Iceberg files in object storage (external cloud storage). |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables:from-iceberg-rest` | Creates an iceberg table from an Iceberg REST catalog. |
| `GET /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name` | Describes an iceberg table. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name` | Drops an iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:resume-recluster` | Resumes recluster for an iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:suspend-recluster` | Suspends recluster for an iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:refresh` | Refreshes the metadata of an iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:convert-to-managed` | Converts an externally managed iceberg table to a managed table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:undrop` | Restores a previously dropped iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:clone` | Clones an Snowflake managed iceberg table. |
| `POST /api/v2/databases/database/schemas/`.`schema/iceberg-tables/name:create-like` | Creates a new iceberg table like a specified one. |

For reference documentation, see [Snowflake Iceberg Table API reference](/developer-guide/snowflake-rest-api/reference/iceberg-table.md).

---
title: Manage image repositories
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/image-repositories/images-introduction.md
section: Snowflake REST API
---

# Manage image repositories

The Snowflake REST [Image Repository API reference](/developer-guide/snowflake-rest-api/reference/image-repository.md) provides the following Snowflake endpoints to access, update, and perform certain actions on Image Repository resource in Snowflake:

Snowflake REST Image Repositories API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/image-repositories` | Lists available image repositories. |
| `POST /api/v2/databases/database/schemas/`.`schema/image-repositories` | Creates an image repository. |
| `GET /api/v2/databases/database/schemas/`.`schema/image-repositories/name` | Fetches an image repository. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/image-repositories/name` | Deletes an image repository. |
| `GET /api/v2/databases/database/schemas/schema/image-repositories/name/images` | Lists images in the specified repository. |

For reference documentation, see [Snowflake Image Repository API reference](/developer-guide/snowflake-rest-api/reference/image-repository.md).

---
title: Manage network policies
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/network-policy/network-policy-introduction.md
section: Snowflake REST API
---

# Manage network policies

The Snowflake REST [Network Policy API](/developer-guide/snowflake-rest-api/reference/network-policy.md) provides the following endpoints to access, update, and perform certain actions on Network Policy resources.

Snowflake REST Network Policy API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/network-policies` | Lists available network policies. |
| `POST /api/v2/network-policies` | Creates a network policy. |
| `GET /api/v2/network-policies/name` | Fetches a network policy. |
| `DELETE /api/v2/network-policies/name` | Deletes a network policy. |

For reference documentation, see [Snowflake Network Policy API reference](/developer-guide/snowflake-rest-api/reference/network-policy.md).

---
title: Manage network rules
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/network-rule/network-rule-introduction.md
section: Snowflake REST API
---

# Manage network rules

The Snowflake REST [Network Rule API](/developer-guide/snowflake-rest-api/reference/network-rule.md) provides the following endpoints to access, update, and perform certain actions on Network Rule resources.

Snowflake REST Network Rule API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/`.`schemas/schema/network-rules` | Lists network rules. |
| `POST /api/v2/databases/database/`.`schemas/schema/network-rules` | Creates a network rule. |
| `GET /api/v2/databases/database/`.`schemas/schema/network-rules/name` | Fetches a network rule. |
| `DELETE /api/v2/databases/database/`.`schemas/schema/network-rules/name` | Deletes a network rule. |

For reference documentation, see [Snowflake Network Rule API reference](/developer-guide/snowflake-rest-api/reference/network-rule.md).

---
title: Manage notebooks
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/notebook/notebook-introduction.md
section: Snowflake REST API
---

# Manage notebooks

The Snowflake REST [Notebook API](/developer-guide/snowflake-rest-api/reference/notebook.md) provides the following endpoints to access, update, and perform certain actions on Notebook resources.

Snowflake REST Notebook API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/notebooks` | Lists available notebooks. |
| `POST /api/v2/databases/database/schemas/`.`schema/notebooks` | Creates a notebook. |
| `GET /api/v2/databases/database/schemas/`.`schema/notebooks/name` | Fetches a notebook. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/notebooks/name` | Deletes a notebook. |
| `POST /api/v2/databases/database/schemas/`.`schema/notebooks/name:execute` | Execute a notebook.  **Note:** This endpoint only works with a session token. |
| `POST /api/v2/databases/database/schemas/`.`schema/notebooks/name:rename` | Changes the name of a notebook. |
| `POST /api/v2/databases/database/schemas/`.`schema/notebooks/name:add-live-version` | Adds a live version to the notebook |
| `POST /api/v2/databases/database/schemas/`.`schema/notebooks/name:commit` | Commits the live version of the specified notebook to a Git repository. |

For reference documentation, see [Snowflake Notebook API reference](/developer-guide/snowflake-rest-api/reference/notebook.md).

---
title: Manage password policies
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/password-policy/password-policy-introduction.md
section: Snowflake REST API
---

# Manage password policies

The Snowflake REST [Password Policy API](/developer-guide/snowflake-rest-api/reference/password-policy.md) provides the following endpoints to access, update, and perform certain actions on Password Policy resources.

Snowflake REST Password Policy API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/password-policies` | Lists available password policies. |
| `POST /api/v2/databases/database/schemas/`.`schema/password-policies` | Creates a password policy. |
| `GET /api/v2/databases/database/schemas/`.`schema/password-policies/name` | Fetches a password policy. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/password-policies/name` | Deletes a password policy. |
| `POST /api/v2/databases/database/schemas/`.`schema/password-policies/name:rename` | Renames a password policy. |

For reference documentation, see [Snowflake Password Policy API reference](/developer-guide/snowflake-rest-api/reference/password-policy.md).

---
title: Manage procedures
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/procedure/procedure-introduction.md
section: Snowflake REST API
---

# Manage procedures

The Snowflake REST [Procedure API](/developer-guide/snowflake-rest-api/reference/procedure.md) provides the following endpoints to access, update, and perform certain actions on Procedure resources.

Snowflake REST Procedure API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/procedures` | Lists available procedures. |
| `POST /api/v2/databases/database/schemas/`.`schema/procedures` | Creates a procedure. |
| `GET /api/v2/databases/database/schemas/`.`schema/procedures/nameWithArgs` | Fetches a procedure. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/procedures/nameWithArgs` | Deletes a procedure. |
| `POST /api/v2/databases/database/schemas/`.`schema/procedures/nameWithArgs:call` | Calls a procedure. |

For reference documentation, see [Snowflake Procedure API reference](/developer-guide/snowflake-rest-api/reference/procedure.md).

---
title: Manage roles
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/roles/roles-introduction.md
section: Snowflake REST API
---

# Manage roles

The Snowflake REST [Role API](/developer-guide/snowflake-rest-api/reference/role.md) provides the following endpoints to manage Snowflake roles:

Snowflake REST Role API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/roles` | Creates a role according to the specified parameters. |
| `GET /api/v2/roles` | Lists the roles available to the user’s account. |
| `DELETE /api/v2/roles/name` | Deletes the specified role. |
| `GET /api/v2/roles/name/grants` | Lists all grants to the role. |
| `POST /api/v2/roles/name/grants` | Grants privileges to the specified role. |
| `POST /api/v2/roles/name/grants:revoke` | Revokes grants from the specified role. |
| `GET /api/v2/roles/name/grants-of` | Lists all grants of the specified role. |
| `GET /api/v2/roles/name/grants-on` | Lists all grants on the specified role. |
| `GET /api/v2/roles/name/future-grants` | Lists all future grants to the specified role. |
| `POST /api/v2/roles/name/future-grants` | Grants future privileges to the specified role. |
| `POST /api/v2/roles/name/future-grants:revoke` | Revokes future grants from the specified role |

For reference documentation, see [Snowflake Role API reference](/developer-guide/snowflake-rest-api/reference/role.md).

---
title: Manage secrets
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/secret/secret-introduction.md
section: Snowflake REST API
---

# Manage secrets

The Snowflake REST [Secret API](/developer-guide/snowflake-rest-api/reference/secret.md) provides the following endpoints to manage Snowflake secrets:

Snowflake REST Secret API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/secrets` | Lists secrets. |
| `POST /api/v2/databases/database/schemas/`.`schema/secrets` | Creates a secret. |
| `GET /api/v2/databases/database/schemas/`.`schema/secrets/name` | Fetches a secret. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/secrets/name` | Deletes a secret. |

For reference documentation, see [Snowflake Secret API reference](/developer-guide/snowflake-rest-api/reference/secret.md).

---
title: Manage sequences
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/sequence/sequence-introduction.md
section: Snowflake REST API
---

# Manage sequences

The Snowflake REST [Sequence API](/developer-guide/snowflake-rest-api/reference/sequence.md) provides the following endpoints to manage Snowflake secrets:

Snowflake REST Sequence API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/sequences` | Lists sequences. |
| `POST /api/v2/databases/database/schemas/`.`schema/sequences` | Creates a sequence. |
| `GET /api/v2/databases/database/schemas/`.`schema/sequences/name` | Fetches a sequence. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/sequences/name` | Deletes a sequence. |
| `POST /api/v2/databases/database/schemas/`.`schema/sequences/name:clone` | Creates a new sequence by cloning from the specified resource. |
| `POST /api/v2/databases/database/schemas/`.`schema/sequences/name:rename` | Renames a sequence with a new identifier. |

For reference documentation, see [Snowflake Sequence API reference](/developer-guide/snowflake-rest-api/reference/sequence.md).

---
title: Manage Snowflake Container Services
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/services/services-introduction.md
section: Snowflake REST API
---

# Manage Snowflake Container Services

The Snowflake REST [Service API](/developer-guide/snowflake-rest-api/reference/service.md) provides the following endpoints to manage Snowflake services:

Snowflake REST Services API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/services` | Lists available services for the named database and schema. |
| `POST /api/v2/databases/database/schemas/`.`schema/services` | Creates a service. |
| `POST /api/v2/databases/database/schemas/`.`schema/services:execute-job` | Creates and executes a job service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name` | Fetches the named service. |
| `PUT /api/v2/databases/database/schemas/`.`schema/services/name` | Creates a new, or alter an existing, service. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/services/name` | Deletes a named service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name/logs` | Fetches the logs for a named service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name/status` | Returns the status of a named service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name/containers` | Lists all the containers of the specified service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name/instances` | Lists all the instances of the specified service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name/roles` | Lists all the service roles of the specified service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/service/roles/`.`name/grants-of` | Lists all the grants of the specified service role. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/service/roles/`.`name/grants` | Lists all the grants given to the specified service role. |
| `POST /api/v2/databases/database/schemas/`.`schema/services/name:resume` | Resumes a previously suspended service. |
| `POST /api/v2/databases/database/schemas/`.`schema/services/name:suspend` | Suspends a named service. |
| `GET /api/v2/databases/database/schemas/`.`schema/services/name/endpoints` | Lists the endpoints defined in the specified service. |

For reference documentation, see [Snowflake Service API reference](/developer-guide/snowflake-rest-api/reference/service.md).

---
title: Manage Spark Connect
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/spark-connect/spark-connect-introduction.md
section: Snowflake REST API
---

# Manage Spark Connect

The Snowflake REST [Spark Connect API](/developer-guide/snowflake-rest-api/reference/spark-connect.md) provides the following endpoints to manage Spark Connect:

Snowflake REST Spark Connect API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/spark-connect/execute-plan` | Executes a request that contains the query and returns a stream of [[ExecutePlanResponse]]. |
| `POST /api/v2/spark-connect/analyze-plan` | Analyzes a query and return a [[AnalyzeResponse]] containing metadata about the query. |
| `POST /api/v2/spark-connect/config` | Updates or fetches the configurations and returns a [[ConfigResponse]] containing the result. |
| `POST /api/v2/spark-connect/add-artifacts` | Add artifacts to the session and returns a [[AddArtifactsResponse]] containing metadata about the added artifacts. |
| `POST /api/v2/spark-connect/push-response` | Pushes Spark response to the GS. |
| `POST /api/v2/spark-connect/pull-request` | Pulls Spark request from the GS. |
| `POST /api/v2/spark-connect/release-execute` | Releases a re-attachable execution, or parts thereof. |
| `POST /api/v2/spark-connect/reattach-execute` | Reattaches to an existing re-attachable execution, or parts thereof. |
| `POST /api/v2/spark-connect/interrupt` | Interrupts running executions. |
| `POST /api/v2/spark-connect/artifact-status` | Check statuses of artifacts in the session and returns them in a [[ArtifactStatusesResponse]]. |

For reference documentation, see [Snowflake Spark Connect API reference](/developer-guide/snowflake-rest-api/reference/spark-connect.md).

---
title: Manage stages
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/stages/stages-introduction.md
section: Snowflake REST API
---

# Manage stages

The Snowflake REST [Stage API](/developer-guide/snowflake-rest-api/reference/stage.md) provides the following endpoints to manage Snowflake stages:

Snowflake REST Stage API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/stages` | Lists stages under the database and schema, with show options as query parameters. |
| `POST /api/v2/databases/database/schemas/`.`schema/stages` | Creates a stage with standard create modifiers as query parameters. |
| `GET /api/v2/databases/database/schemas/`.`schema/stages/name` | Fetches a stage using the describe command output. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/stages/name` | Deletes the stage with the specified name. |
| `GET /api/v2/databases/database/schemas/`.`schema/stages/name/files` | Lists the files in the specified stage. |
| `POST /api/v2/databases/database/schemas/`.`schema/stages/name/files/filePath:presigned-url` | Generates a pre-signed URL. |

For reference documentation, see [Snowflake Stage API reference](/developer-guide/snowflake-rest-api/reference/stage.md).

---
title: Manage Streamlit
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/streamlit/streamlit-introduction.md
section: Snowflake REST API
---

# Manage Streamlit

The Snowflake REST [Streamlit API](/developer-guide/snowflake-rest-api/reference/streamlit.md) provides the following endpoints to access, update, and perform certain actions on Streamlit resources.

Snowflake REST Streamlit API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/streamlits` | List Streamlits in a schema. Supports filtering with pattern matching. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits` | Create a new Streamlit application, or replace an existing one. |
| `GET /api/v2/databases/database/schemas/`.`schema/streamlits/name` | Fetch detailed information about a specific Streamlit by name. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/streamlits/name` | Delete a Streamlit. The Streamlit can be restored using undrop within the retention period. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:undrop` | Restore a previously deleted Streamlit within the retention period. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:rename` | Rename a Streamlit to a new name, optionally in a different database or schema. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:add-live-version` | Add a live version to the Streamlit, making a specific version active for users. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:commit` | Commit the LIVE version of the Streamlit to the Git repository. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:add-version` | Add a new version to the Streamlit by copying files from a specified stage location. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:add-version-from-git` | Add a new version to the Streamlit using a Git reference URI. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:abort` | Abort the live version of the Streamlit, discarding uncommitted changes. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:pull` | Pull the latest changes from the Git repository for a Streamlit with Git integration. |
| `POST /api/v2/databases/database/schemas/`.`schema/streamlits/name:push` | Push committed changes from the Streamlit back to its connected Git repository. |

For reference documentation, see [Snowflake Streamlit API reference](/developer-guide/snowflake-rest-api/reference/streamlit.md).

---
title: Manage streams
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/stream/stream-introduction.md
section: Snowflake REST API
---

# Manage streams

The Snowflake REST [Stream API](/developer-guide/snowflake-rest-api/reference/stream.md) provides the following endpoints to access, update, and perform certain actions on Stream resources.

Snowflake REST Stream API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/streams` | Lists available streams. |
| `POST /api/v2/databases/database/schemas/`.`schema/streams` | Creates a stream. |
| `GET /api/v2/databases/database/schemas/`.`schema/streams/name` | Fetches a stream. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/streams/name` | Deletes a stream. |
| `POST /api/v2/databases/database/schemas/`.`schema/streams/name:clone` | Clones a stream. |

For reference documentation, see [Snowflake Stream API reference](/developer-guide/snowflake-rest-api/reference/stream.md).

---
title: Manage Tables
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tables/tables-introduction.md
section: Snowflake REST API
---

# Manage Tables

The Snowflake REST [Table API](/developer-guide/snowflake-rest-api/reference/table.md) provides the following endpoints to manage Snowflake tables:

Snowflake REST Table API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/tables` | Lists the tables under the database and schema. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables` | Creates a table. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:as_select` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/databases/database/schemas/`.`schema/tables:as-select` | Creates a table using the result of the specified select query. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:using_template` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/databases/database/schemas/`.`schema/tables:using-template` | Creates a table using the templates specified in staged files. |
| `GET /api/v2/databases/database/schemas/`.`schema/tables/name` | Fetches a table. |
| `PUT /api/v2/databases/database/schemas/`.`schema/tables/name` | Creates a new or alters an existing table. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/tables/name` | Deletes a table. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:clone` | Creates a new table by cloning from the specified resource. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:create_like` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:create-like` | Creates a table like a specified one. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:undrop` | Undrops a table. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:suspend_recluster` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:suspend-recluster` | Suspends a table reclustering action. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:resume_recluster` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:resume-recluster` | Resumes a suspended table reclustering action. |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:swapwith` | *Deprecated. Use the replacement endpoint below.* |
| `POST /api/v2/databases/database/schemas/`.`schema/tables/name:swap-with` | Swaps one table with another. |

For reference documentation, see [Snowflake Table API reference](/developer-guide/snowflake-rest-api/reference/table.md).

---
title: Manage tags
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tag/tag-introduction.md
section: Snowflake REST API
---

# Manage tags

The Snowflake REST [Tag API](/developer-guide/snowflake-rest-api/reference/tag.md) provides the following endpoints to access, update, and perform certain actions on Tag resources in a Snowflake database:

Snowflake REST Tag API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/tags` | List tags. |
| `POST /api/v2/databases/database/schemas/`.`schema/tags` | Create a tag. |
| `GET /api/v2/databases/database/schemas/`.`schema/tags/name` | Fetch a tag. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/tags/name` | Delete a tag. |
| `PUT /api/v2/databases/database/schemas/`.`schema/tags/name` | Create or update a tag. |
| `POST /api/v2/databases/database/schemas/`.`schema/tags/name:undrop` | Undrop a tag. |
| `POST /api/v2/databases/database/schemas/`.`schema/tags/name:rename` | Rename a tag with a new identifier. |

For reference documentation, see [Snowflake Tag API reference](/developer-guide/snowflake-rest-api/reference/tag.md).

---
title: Manage tasks
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tasks/tasks-introduction.md
section: Snowflake REST API
---

# Manage tasks

The Snowflake REST [Task API](/developer-guide/snowflake-rest-api/reference/task.md) provides the following endpoints to access, update, and perform certain actions on task resources in a Snowflake database:

Snowflake REST Task API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks` | Lists tasks under the database and schema. |
| `POST /api/v2/databases/database/schemas/`.`schema/tasks` | Creates a task, with standard create modifiers as query parameters. |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks/name` | Fetches a task. |
| `PUT /api/v2/databases/database/schemas/`.`schema/tasks/name` | Creates (or alters an existing) task. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/tasks/name` | Deletes a task. |
| `POST /api/v2/databases/database/schemas/`.`schema/tasks/name:execute` | Executes a task. |
| `POST /api/v2/databases/database/schemas/`.`schema/tasks/name:resume` | Resumes a suspended task. |
| `POST /api/v2/databases/database/schemas/`.`schema/tasks/name:suspend` | Suspends an active task. |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks/name/dependents` | Fetches the dependent tasks of a task. |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks/name/current_graphs` | *Deprecated. Use the replacement endpoint below.* |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks/name/current-graphs` | Gets the graph runs that are executing or scheduled for the task for the next 8 days. |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks/name/complete_graphs` | *Deprecated. Use the replacement endpoint below.* |
| `GET /api/v2/databases/database/schemas/`.`schema/tasks/name/complete-graphs` | Gets the graph runs that are completed for the task. |

For reference documentation, see [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

---
title: Manage user-defined functions
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/user-defined-function/user-defined-function-introduction.md
section: Snowflake REST API
---

# Manage user-defined functions

The Snowflake REST [User-Defined Function API](/developer-guide/snowflake-rest-api/reference/user-defined-function.md) provides the following endpoints to access, update, and perform certain actions on User-Defined Function resources.

Snowflake REST User-Defined Function API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/user-defined-functions` | Lists available user-defined functions. |
| `POST /api/v2/databases/database/schemas/`.`schema/user-defined-functions` | Creates a user-defined function. |
| `GET /api/v2/databases/database/schemas/`.`schema/user-defined-functions/nameWithArgs` | Fetches a user-defined function. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/user-defined-functions/nameWithArgs` | Deletes a user-defined function. |
| `POST /api/v2/databases/database/schemas/`.`schema/user-defined-functions/name:execute` | Executes a user-defined function. |
| `POST /api/v2/databases/database/schemas/`.`schema/user-defined-functions/`.`nameWithArgs:rename` | Renames a user-defined function. |

For reference documentation, see [Snowflake User-Defined Function API reference](/developer-guide/snowflake-rest-api/reference/user-defined-function.md).

---
title: Manage users
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/users/users-introduction.md
section: Snowflake REST API
---

# Manage users

The Snowflake REST [User API](/developer-guide/snowflake-rest-api/reference/user.md) provides the following endpoints to manage Snowflake users:

Snowflake REST User API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/users` | Creates a Snowflake user. |
| `GET /api/v2/users` | Lists the users in the system. |
| `GET /api/v2/users/{name}` | Fetches user information using the result of the DESCRIBE command. |
| `DELETE /api/v2/users/{name}` | Deletes a user with the given name. |
| `PUT /api/v2/users/{name}` | Creates a new, or alters an existing, user. |
| `GET /api/v2/users/{name}/grants` | List all grants to the user. |
| `POST /api/v2/users/{name}/grants` | Grants a role to the specified user. |
| `POST /api/v2/users/{name}/grants:revoke` | Revokes grants from the specified user. |

For reference documentation, see [Snowflake User API reference](/developer-guide/snowflake-rest-api/reference/user.md)

---
title: Manage views
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/view/view-introduction.md
section: Snowflake REST API
---

# Manage views

The Snowflake REST [View API](/developer-guide/snowflake-rest-api/reference/view.md) provides the following endpoints to access, update, and perform certain actions on View resources.

Snowflake REST View API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/databases/database/schemas/`.`schema/views` | Lists available views. |
| `POST /api/v2/databases/database/schemas/`.`schema/views` | Creates a view. |
| `GET /api/v2/databases/database/schemas/`.`schema/views/name` | Fetches a view. |
| `DELETE /api/v2/databases/database/schemas/`.`schema/views/name` | Deletes a view. |

For reference documentation, see [Snowflake View API reference](/developer-guide/snowflake-rest-api/reference/view.md).

---
title: Manage warehouses
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/warehouses/warehouses-introduction.md
section: Snowflake REST API
---

# Manage warehouses

The Snowflake REST [Warehouse API](/developer-guide/snowflake-rest-api/reference/warehouse.md) provides the following endpoints for managing Snowflake warehouses:

Snowflake REST Warehouse API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/warehouses` | Creates a new, or replaces an existing, warehouse. |
| `GET /api/v2/warehouses` | Returns a list of available warehouses. |
| `GET /api/v2/warehouses/name` | Describes a named warehouse. |
| `DELETE /api/v2/warehouses/name` | Deletes a named warehouse. |
| `PUT /api/v2/warehouses/name` | Updates the properties of a named warehouse. |
| `POST /api/v2/warehouses/name:resume` | Resumes a currently suspended warehouse. |
| `POST /api/v2/warehouses/name:suspend` | Suspends a named warehouse. |
| `POST /api/v2/warehouses/name:rename` | Renames a named warehouse. |
| `POST /api/v2/warehouses/name:abort` | Aborts all running or queued queries in a named warehouse. |
| `POST /api/v2/warehouses/name:enable` | Enables an adaptive warehouse. |
| `POST /api/v2/warehouses/name:disable` | Disables an adaptive warehouse. |
| `POST /api/v2/warehouses/name:use` | *Deprecated.* |

For reference documentation, see [Snowflake Warehouse API reference](/developer-guide/snowflake-rest-api/reference/warehouse.md).

---
title: Snowflake REST APIs
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/snowflake-rest-api.md
section: Snowflake REST API
---

# Snowflake REST APIs

Snowflake REST APIs for resource management provide a set of endpoints that lets users programmatically interact with and control various resources within the Snowflake Data Cloud.

The Snowflake REST APIs suite of APIs enables developers to build end-to-end automation and integration with Snowflake resources. These REST APIs are compliant with the [OpenAPI specification](https://spec.openapis.org/oas/v3.1.0). Snowflake REST APIs enable developers and partners to use the language of their choice to build integrations with Snowflake using the openAPI specifications.

The Snowflake REST APIs supports the following resources through the corresponding APIs. The APIs support CREATE OR ALTER operations for applicable resources.

* Working with accounts

  + [Accounts](account/account-introduction.md)
  + [Managed accounts](managed-account/managed-account-introduction.md)
* Working with users, roles, and privileges

  + [Users](users/users-introduction.md)
  + [Roles](roles/roles-introduction.md)
  + [Database roles](database-role/database-role-introduction.md)
  + [Grants](grants/grants-introduction.md)
* Managing virtual warehouses

  + [Warehouses](warehouses/warehouses-introduction.md)
* Working with databases and schemas

  + [Databases](databases/db-introduction.md)
  + [Schemas](schemas/schemas-introduction.md)
* Managing tables and views

  + [Tables](tables/tables-introduction.md)
  + [Dynamic tables](dynamic-tables/dynamic-tables-introduction.md)
  + [Event tables](event-table/event-table-introduction.md)
  + [Iceberg tables](iceberg-table/iceberg-table-introduction.md)
  + [Sequences](sequence/sequence-introduction.md)
  + [Views](view/view-introduction.md)
* Loading and unloading data

  + [Stages](stages/stages-introduction.md)
  + [External volumes](external-volume/external-volume-introduction.md)
  + [Pipes](pipe/pipe-introduction.md)
* Managing notebooks and Streamlit apps

  + [Notebooks](notebook/notebook-introduction.md)
  + [Streamlit](streamlit/streamlit-introduction.md)
* Working with Snowpark Container Services

  + [Compute Pools](compute-pools/cp-introduction.md)
  + [Image Repositories](image-repositories/images-introduction.md)
  + [Services](services/services-introduction.md)
* Using functions and procedures

  + [Artifact repositories](artifact-repository/artifact-repository-introduction.md)
  + [Functions](functions/functions-introduction.md)
  + [User-defined functions](user-defined-function/user-defined-function-introduction.md)
  + [Procedures](procedure/procedure-introduction.md)
* Managing security

  + [Network policies](network-policy/network-policy-introduction.md)
  + [Network rules](network-rule/network-rule-introduction.md)
  + [Password policies](password-policy/password-policy-introduction.md)
  + [Secrets](secret/secret-introduction.md)
* Managing alerts

  + [Alerts](alert/alert-introduction.md)
* Leveraging AI/ML

  + [Cortex Embed](cortex-embed/cortex-embed-introduction.md)
  + [Cortex Inference](cortex-inference/cortex-inference-introduction.md)
  + [Cortex Search Service](cortex-search/cortex-search-introduction.md)
* Managing streams and tasks

  + [Streams](stream/stream-introduction.md)
  + [Tasks](tasks/tasks-introduction.md)
* Managing integrations

  + [API integration](api-integration/api-integration-introduction.md)
  + [Use catalog integrations](catalog-integration/catalog-integration-introduction.md)
  + [Use notification integrations](notification-integration/notification-integration-introduction.md)
* Using Spark Connect

  + [Spark Connect](spark-connect/spark-connect-introduction.md)
* Managing tags

  + [Tags](tag/tag-introduction.md)

For reference information about the APIs and their endpoints, see [Snowflake REST APIs reference](reference.md).

You can access the Snowflake REST APIs OpenAPI specifications in the [snowflake-rest-api-specs](https://github.com/snowflakedb/snowflake-rest-api-specs) Git repository.

> **Note:**
>
> The Snowflake REST APIs reference documentation reflects the
> latest version of the Snowflake REST APIs. Note that not all resources in the API currently provide 100% coverage of their
> equivalent [SQL commands](../../sql-reference-commands.md), but the Snowflake REST APIs are under active development and are continuously expanding.

## Requirements

The Snowflake REST APIs has the following requirements:

* You must have a way to submit REST requests, such as the [Postman app](https://www.postman.com/downloads/), [curl](https://curl.se/), or an HTTP client in the programming language of your choice, installed on your machine.

## Suggested tools

* [Postman app](https://www.postman.com/downloads/)
* [curl](https://curl.se/)
* [Snowflake CLI](../snowflake-cli/index.md)
* [SnowSQL](../../user-guide/snowsql.md)

---
title: Specify Snowflake context with Snowflake REST APIs
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/setting-context.md
section: Snowflake REST API
---

# Specify Snowflake context with Snowflake REST APIs

You can specify aspects of Snowflake context when making a request to the Snowflake REST APIs.

Using request headers, you can specify the following in the context of a REST API call:

* The Snowflake role used to authorize the request with the `X-Snowflake-Role` header.
* The Snowflake warehouse used to execute the request with the `X-Snowflake-Warehouse` header.

Instead of relying on a user’s default settings, these headers make each call explicit, isolated, and auditable. You guarantee that
each request uses the correct role and warehouse without needing extra API calls to set context.

By specifying context when making REST API requests, you can accomplish the following tasks:

* Run stateless calls.

  Guarantee that a call uses a specific role without needing a separate API call first to set the session context.
* Avoid mutating users.

  Safely switch roles per-request instead of running ALTER USER … SET DEFAULT_ROLE=…, which is slow and affects all other sessions for
  that user.
* Enable on-demand compute.

  Allow users or service accounts without a default warehouse to run queries or create procedures by simply providing the
  `X-Snowflake-Warehouse` header.
* Simplify user management.

  Use one service user granted multiple roles—for example, READER and WRITER. Your application then sends the `X-Snowflake-Role`
  header to pick the right permission for the right task. In this way, you can avoid managing multiple single-role users.

## Precedence

When a header is provided, it takes precedence over a user’s default settings, in the following order:

1. Headers (if provided) are used.
2. Otherwise, the session’s default role or default warehouse is used.
3. If neither is available where required, the call fails.

## Specify the role to use when authorizing the request

You can specify the role to use when authorizing the request by using the `X-Snowflake-Role` header.

### Requirements

* The role you specify must exist, be granted to the user, and be allowed by the authentication method in use.
* If you use a [programmatic access token (PAT)](authentication.md), the requested role must be within the PAT’s
  `ROLE_RESTRICTION`. If you specify a more privileged role than the PAT allows, the request will fail even if the user was granted
  the specified role.

### Example

The following example creates a database by using the ACCOUNTADMIN role for authorization regardless of the user’s default role.

You can specify the `X-Snowflake-Role` header’s value either in double quotes or without quotes.

```bash
curl -X POST "$API_BASE/database/databases" \
  -H "Authorization: Bearer $TOKEN" \
  -H "Content-Type: application/json" \
  -H "X-Snowflake-Role: ACCOUNTADMIN" \
  -d '{"name": "HDR_DEMO_DB", "comment": "Created via REST with role header"}'
```

## Specify the warehouse on which a statement should execute

You can specify the warehouse to use when executing statements by using the `X-Snowflake-Warehouse` header. Such statements include
those executing procedures, creating Python functions, and executing queries that need compute resources.

### Requirements

* The role in effect must have the USAGE privilege on the warehouse.
* If no default warehouse is set and this header is omitted, warehouse-dependent calls will fail.

### Example

The following example creates a Python procedure that uses the `BUILD_WH` warehouse. The specified role must have the USAGE privilege
on the warehouse. The `PYTHON_WH_TEST` procedure created returns the active warehouse name.

You can specify the `X-Snowflake-Warehouse` header’s value either in double quotes or without quotes.

```bash
curl -X POST "$API_BASE/procedure/databases/TEST_DB/schemas/TEST_SCHEMA/procedures" \
  -H "Authorization: Bearer $TOKEN" \
  -H "Content-Type: application/json" \
  -H "X-Snowflake-Role: ACCOUNTADMIN" \
  -H "X-Snowflake-Warehouse: BUILD_WH" \
  -d '{
        "name": "PYTHON_WH_TEST",
        "arguments": [],
        "return_type": {"datatype": "VARIANT", "nullable": true},
        "language_config": {
          "python_function": {
            "handler":"main",
            "runtime_version":"3.11",
            "packages":["snowflake-snowpark-python"]
          }
        },
        "body": "def main(session):\n    return {\"warehouse\": session.get_current_warehouse()}"
      }'
```

---
title: Tutorial 1: Create and manage databases, schemas, and tables
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tutorials/tutorial-1.md
section: Snowflake REST API
---

Snowflake

Getting Started

App Development

Data Engineering

REST API

# Tutorial 1: Create and manage databases, schemas, and tables

## Introduction

In this tutorial, you learn how to submit REST queries to create and manage databases, tables, and schemas.

### Prerequisites

> **Note:**
>
> If you have already completed the steps in [Common setup for Snowflake REST APIs tutorials](common-setup.md), you can skip these prerequisites and proceed to the first step of this
> tutorial.

Before you start this tutorial, you must complete the [common setup](common-setup.md) instructions, which includes the following steps:

> * Import the Snowflake REST APIs Postman collections.
> * Authenticate your connection by setting the bearer token in Postman.

After completing these prerequisites, you are ready to start using the API.

## Create a database and list available databases

You can use Postman to create a database and list available databases.

* To create a database, send a `POST` request with the following request body to the `/api/v2/databases` endpoint, as shown.

  ```json
  {
    "name": "demo_db",
    "kind": "PERMANENT",
    "comment": "snowflake rest api demo-db",
    "data_retention_time_in_days": "1",
    "max_data_extension_time_in_days": "1"
  }
  ```
* To list available databases, send a `GET` request to the `/api/v2/databases` endpoint, as shown in the following examples:

  + To find databases whose name contains the string, `demo`, specify `%25demo%25` in the like query parameter.
  + To return the first database whose name starts with the string, `DEMO_DB`, specify `DEMO_DB` and `1` in the startsWith and showLimit query parameters, respectively.

For more information, see the [Snowflake Database API reference](/developer-guide/snowflake-rest-api/reference/database.md).

## Create a schema and list available schemas

You can use Postman to create a schema and list available schemas.

* To create a schema, send a `POST` request to the `/api/v2/databases/{database}/schemas` endpoint, as follows:

  > 1. Add the database name (`demo_db`) to the database path variable in the request header.
  > 2. Add the schema name (`demo_sc`) to the request body.
  >
  >    ```json
  >    {
  >      "name": "demo_sc",
  >    }
  >    ```
* To list available schemas, send a `GET` request to the `/api/v2/databases/{database}/schemas` endpoint. In this example, you return the first schema whose name starts with the string, `DEMO_SC`, by specifying `DEMO_SC` and `1` in the startsWith and showLimit query parameters, respectively.

For more information, see the [Snowflake Schema API reference](/developer-guide/snowflake-rest-api/reference/schema.md).

## Create a table and fetch the table details

You can use Postman to create a table and list available tables.

* To create a table, send a `POST` request to the `/api/v2/databases/{database}/schemas/{schema}/tables` endpoint, as follows:

  > 1. Add the database name (`demo_db`) and the schema name (`demo_sc`) in the database and database path variables, respectively, in the request header.
  > 2. Add the table name (`demo_tbl`) and the table columns to the request body. In this case, you added one column named `C1`.
  >
  >    ```json
  >    {
  >      "name": "demo_tbl",
  >      "columns": [
  >        {
  >        "name": "c1",
  >        "datatype": "integer",
  >        "nullable": true,
  >        "comment": "An integral value column"
  >        }
  >      ],
  >      "comment": "Demo table for Snowflake REST API"
  >    }
  >    ```
* To fetch the table you just created, send a `GET` request to the `/api/v2/databases/{database}/schemas/{schema}/tables/{name}` endpoint. In this case, you specify `demo_db`, `demo_sc`, and `demo_tbl` in the database, schema and name path variables, respectively.

For more information, see the [Snowflake Table API reference](/developer-guide/snowflake-rest-api/reference/table.md).

## Alter a table and fetch the table details

You can use Postman to alter a table.

* To alter the table you created in the last tutorial, send a `PUT` request to the `/api/v2/databases/{database}/schemas/{schema}/tables/{name}` endpoint, as follows:

  1. Specify the names of the database, schema, and table you created in the corresponding path variables.
  2. In the request body, enter the new table definition. In this case, you add a new column to the table.

     > ```json
     > {
     >   "name": "demo_tbl",
     >   "columns": [
     >     {
     >     "name": "c1",
     >     "datatype": "integer",
     >     "nullable": true,
     >     "comment": "An integral value column"
     >     },
     >     {
     >     "name": "c2",
     >     "datatype": "string",
     >     "comment": "An string value column"
     >     }
     >   ],
     >   "comment": "Demo table for Snowflake REST API"
     > }
     > ```
* Verify the change by fetching the table details by sending a `GET` request to the `/api/v2/databases/{database}/schemas/{schema}/tables/{name}` endpoint. In this case, you specify `demo_db`, `demo_sc`, and `demo_tbl` in the database, schema and name path variables, respectively.

  Notice the table now contains a new `C2` column.

For more information, see the [Snowflake Table API reference](/developer-guide/snowflake-rest-api/reference/table.md).

## List available tables

You can use the `/api/v2/databases/{database}/schemas/{schema}/tables` endpoint to return lists of all tables available to you.

* To list all available tables, send a `GET` request to the `/api/v2/databases/{database}/schemas/{schema}/tables` endpoint with no query parameters, as follows. In this case, you specify `demo_db` and `demo_sc`, and `demo_tbl` in the database, schema and name path variables, respectively.
* To list full details of the columns and constraints in every table, add the recursive query parameter and set the value to `true`, as shown. Be aware that enabling this query parameter can overwhelm your connection if you have multiple complex tables.

For more information, see the [Snowflake Table API reference](/developer-guide/snowflake-rest-api/reference/table.md).

## What’s next?

Congratulations! In this tutorial, you learned the fundamentals for managing Snowflake database, schema, and table resources using the Snowflake REST APIs.

### Summary

Along the way, you completed the following steps:

* Create and list databases.
* Create and list schemas.
* Create a table and fetch the table details.
* Alter a table and fetch the table details.
* List available tables.

### Next tutorial

You can now proceed to [Tutorial 2: Create and manage tasks](tutorial-2.md), which shows you how to create and manage Snowflake tasks.

---
title: Tutorial 2: Create and manage tasks
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tutorials/tutorial-2.md
section: Snowflake REST API
---

Snowflake

Getting Started

App Development

Data Engineering

REST API

# Tutorial 2: Create and manage tasks

## Introduction

In this tutorial, you learn how to submit REST queries to create and manage tasks.

### Prerequisites

> **Note:**
>
> If you have already completed the steps in [Common setup for Snowflake REST APIs tutorials](common-setup.md), you can skip these prerequisites and proceed to the first step of this
> tutorial.

Before you start this tutorial, you must complete the [common setup](common-setup.md) instructions, which includes the following steps:

> * Import the Snowflake REST APIs Postman collections.
> * Authenticate your connection by setting the bearer token in Postman.

After completing these prerequisites, you are ready to start using the API.

## Create a warehouse

You can use the Warehouse API to create a Snowflake warehouse.

To create an extra small (`xsmall`) warehouse named `demo_wh`, send the following POST request to the `/api/v2/warehouses` endpoint, as shown:

* In the Params tab, set the `createMode` parameter to `errorIfExists`, which ensures that you don’t unintentionally overwrite an existing warehouse.
* In the Body tab, add the following code to the request body as shown.

  ```json
  {
    "name": "demo_wh",
    "warehouse_size": "xsmall"
  }
  ```

For more information, see the [Snowflake Warehouse API reference](/developer-guide/snowflake-rest-api/reference/warehouse.md).

## Create a task

You can use the Task API to create a Snowflake task.

To create a task, send a POST request to the `/api/v2/databases/{database}/schemas/{schema}/tasks` endpoint, as shown:

* In the Params tab, set the `createMode` parameter to `orReplace`, and set the `database` and `schema` path variables to use the environment variables (`{{default_db}}` and `{{default_schema}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.
* In the Body tab, add the request body as shown.

  ```json
  {
    "name": "{{test_task_name}}",
    "definition": "SELECT 1",
    "warehouse": "{{default_wh}}",
    "schedule": {"minutes": 2, "schedule_type": "MINUTES_TYPE"},
    "config": {"consecteture": false, "sed_9": 61393640, "doloref3": -85761000},
    "comment": "comment",
    "session_parameters": {
      "TIMEZONE": "America/Los Angeles",
      "AUTOCOMMIT": true
    },
    "error_integration": null,
    "user_task_managed_initial_warehouse_size": null,
    "predecessors": null,
    "task_auto_retry_attempts": 3,
    "user_task_timeout_ms": 10000,
    "suspend_task_after_num_failures": 3,
    "condition": true,
    "allow_overlapping_execution": false
  }
  ```

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## Fetch a task

You can use the Task API to fetch a Snowflake task.

To fetch details about a task, send a GET request to the `/api/v2/databases/{database}/schemas/{schema}/tasks` endpoint, as shown:

* In the Params tab, set the `database`, `schema`, and `name` path variables to use the environment variables (`{{default_db}}`, `{{default_schema}}`, and `{{test_task_name}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## List tasks

You can use the Task API to list Snowflake tasks.

To list all available tasks, send a GET request to the `/api/v2/databases/{database}/schemas/{schema}/tasks` endpoint, as shown:

* In the Params tab, set the `rootOnly` parameter to `false`, and set the `database` and `schema` path variables to use the environment variables (`{{default_db}}` and `{{default_schema}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## Delete a task

You can use the Task API to delete a Snowflake task.

To delete a task, send a DELETE request to the `/api/v2/databases/{database}/schemas/{schema}/tasks/{name}` endpoint, as shown:

* In the Params tab, set the `database`, `schema`, and `name` path variables to use the environment variables (`{{default_db}}`, `{{default_schema}}`, and `{{test_task_name}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## Execute a task

You can use the Task API to execute a Snowflake task.

To execute a task that will not retry if it fails, send a POST request to the `/api/v2/databases/{database}/schemas/{schema}/tasks/{name}:execute` endpoint, as shown:

* In the Params tab, set the `retryLast` parameter to `false`, and set the `database` and `schema` path variables to use the environment variables (`{{default_db}}` and `{{default_schema}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## Complete graphs

> **Note:**
>
> This tutorial assumes your have a default warehouse defined.

You can use the Task API to return details for graph runs that have completed.

To return details for completed graph runs for a task, send a GET request to the `/api/v2/databases/{database}/schemas/{schema}/tasks/{name}:execute` endpoint, as shown:

* In the Params tab, do the following:

  + Set the `resultLimit` and `errorOnly` query parameters to `5` and `false`, respectively.
  + Set the `database`, `schema`, and `name` path variables to use the environment variables (`{{default_db}}`, `{{default_schema}}`, and `{{test_task_name}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## Create a child task

You can use the Task API to create a child task for an existing Snowflake task.

To create a child task, send a POST request to the `/api/v2/databases/{database}/schemas/{schema}/tasks` endpoint, as shown:

* In the Params tab, set the `createMode` parameter to `orReplace`, and set the `database` and `schema` path variables to use the environment variables (`{{default_db}}` and `{{default_schema}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.
* In the Body tab, add the request body as shown. The `name` parameter specifies the name of the child task and `predecessors` identifies the name of the parent task.

  ```JSON
  {
    "name": "test_child_task",
    "definition": "SELECT 1",
    "warehouse": "{{default_wh}}",
    "predecessors": "{{test_task_name}}"
  }
  ```

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## Fetch a parent task’s dependent tasks

> **Note:**
>
> This tutorial assumes your have a default warehouse defined.

You can use the Task API to fetch a Snowflake task’s child (dependent) task.

To fetch details about a child (dependent) task, send a GET request to the `/api/v2/databases/{database}/schemas/{schema}/tasks/{name}/dependents` endpoint, as shown:

* In the Params tab, set the `recursive` query parameter to `true`, and set the `database`, `schema`, and `name` path variables to use the environment variables (`{{default_db}}`, `{{default_schema}}`, and `{{test_task_name}}`) you set in the [Common setup for Snowflake REST APIs tutorials](common-setup.md) tutorials.

  Note that the result includes both the parent task and its child task.

For more information, see the [Snowflake Task API reference](/developer-guide/snowflake-rest-api/reference/task.md).

## What’s next?

Congratulations! In this tutorial, you learned the fundamentals for managing Snowflake warehouse and task resources using the Snowflake REST APIs.

### Summary

Along the way, you completed the following steps:

* Create a warehouse.
* Create a task.
* Fetch a task.
* Delete a task.
* Execute task.
* Complete graphs.
* Create a child task.
* Fetch a parent task’s dependent tasks.

---
title: Tutorials: Getting started with the Snowflake REST APIs
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/tutorials-overview.md
section: Snowflake REST API
---

Snowflake

Getting Started

App Development

Data Engineering

REST API

# Tutorials: Getting started with the Snowflake REST APIs

With the Snowflake REST APIs, you can use REST to manage Snowflake resource objects. You can create, drop, and alter
tables, schemas, warehouses, tasks, and more, without writing SQL or using the Snowflake Connector for Python.

In the following tutorials, you learn how to get started with the API for object and task management in Snowflake.

## Prerequisites

* A Snowflake account
* A Snowflake AUTHENTICATION POLICY
* A JWT token for authentication
* Postman installed on your systems
* Familiarity with using Postman

## What you’ll learn

* How to import Postman collections
* How to authenticate your connection using Postman
* How to create databases, schemes, tables, and warehouses using the API
* How to create and manage tasks using the API

## Tutorials

The following tutorials provide step-by-step instructions for you to explore the Snowflake REST APIs:

[Common setup for Snowflake REST APIs tutorials](tutorials/common-setup.md)
:   Setup steps for exploring the tutorials

[Tutorial 1: Create and manage databases, schemas, and tables](tutorials/tutorial-1.md)
:   Step-by-step instructions to create a Snowflake database, schema, table, and virtual warehouse

[Tutorial 2: Create and manage tasks](tutorials/tutorial-2.md)
:   Step-by-step instructions to create and manage tasks and task graphs

---
title: Use catalog integrations
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/catalog-integration/catalog-integration-introduction.md
section: Snowflake REST API
---

# Use catalog integrations

The Snowflake REST [Catalog Integration API](/developer-guide/snowflake-rest-api/reference/catalog-integration.md) provides the following endpoints to access, update, and perform certain actions on Catalog Integration resources.

Snowflake REST Catalog Integration API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/catalog-integrations` | Lists available catalog integrations. |
| `POST /api/v2/catalog-integrations` | Creates a catalog integration. |
| `GET /api/v2/catalog-integrations/name` | Fetches a catalog integration. |
| `DELETE /api/v2/catalog-integrations/name` | Deletes a catalog integration. |

For reference documentation, see [Snowflake Catalog Integration API reference](/developer-guide/snowflake-rest-api/reference/catalog-integration.md).

---
title: Use Cortex Embed
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/cortex-embed/cortex-embed-introduction.md
section: Snowflake REST API
---

# Use Cortex Embed

The Snowflake REST [Cortex Embed API](/developer-guide/snowflake-rest-api/reference/cortex-embed.md) provides the following Snowflake endpoints:

Snowflake Cortex Embed API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/cortex/inference:embed` | Perform LLM embedding for input text, similar to the Snowflake Cortex `EMBED_TEXT` functions. |

For reference documentation, see [Snowflake Cortex Embed API reference](/developer-guide/snowflake-rest-api/reference/cortex-embed.md).

---
title: Use Cortex Inference
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/cortex-inference/cortex-inference-introduction.md
section: Snowflake REST API
---

# Use Cortex Inference

The Snowflake REST [Cortex Inference API](/developer-guide/snowflake-rest-api/reference/cortex-inference.md) provides the following Snowflake endpoints:

Snowflake Cortex Inference API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/cortex/inference/complete` | Performs LLM text completion inference, similar to snowflake.cortex. |
| `GET /api/v2/cortex/models` | Returns the LLMs available for the current session. |

For reference documentation, see [Snowflake Cortex Inference API reference](/developer-guide/snowflake-rest-api/reference/cortex-inference.md).

---
title: Use Cortex Lite Agent
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/cortex-lite-agent/cortex-lite-agent-introduction.md
section: Snowflake REST API
---

# Use Cortex Lite Agent

The Snowflake REST [Cortex Lite Agent API](/developer-guide/snowflake-rest-api/reference/cortex-lite-agent.md) provides the following Snowflake endpoints:

Snowflake Cortex Lite Agent API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/cortex/agent:run` | Send a Cortex Agent Run Request to get results. |

For reference documentation, see [Snowflake Cortex Lite Agent API reference](/developer-guide/snowflake-rest-api/reference/cortex-lite-agent.md).

---
title: Use notification integrations
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/notification-integration/notification-integration-introduction.md
section: Snowflake REST API
---

# Use notification integrations

The Snowflake REST [Notification Integration API](/developer-guide/snowflake-rest-api/reference/notification-integration.md) provides the following endpoints to access, update, and perform certain actions on Notification Integration resources.

Snowflake REST Notification Integration API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/notification-integrations` | Lists available notification integrations. |
| `POST /api/v2/notification-integrations` | Creates a notification integration. |
| `GET /api/v2/notification-integrations/name` | Fetches a notification integration. |
| `DELETE /api/v2/notification-integrations/name` | Deletes a notification integration. |

For reference documentation, see [Snowflake Notification Integration API reference](/developer-guide/snowflake-rest-api/reference/notification-integration.md).

---
title: Use the Cortex Search Service
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/cortex-search/cortex-search-introduction.md
section: Snowflake REST API
---

# Use the Cortex Search Service

The SNOWFLAKE REST [Cortex Search Service API](/developer-guide/snowflake-rest-api/reference/cortex-search-service.md) provides the following endpoints:

Snowflake Cortex Search Service API endpoints

| Endpoint | Description |
| --- | --- |
| `POST /api/v2/databases/database/schemas/schema/`.`cortex-search-services/service_name:query` | Queries a Cortex Search Service to get search results. |
| `GET /api/v2/databases/database/schemas/`.`schema/cortex-search-services` | Lists the Cortex Search Services under the database and schema. |
| `POST /api/v2/databases/database/schemas/schema/`.`cortex-search-services` | Creates a Cortex Search Service, with standard create modifiers as query parameters. |
| `GET /api/v2/databases/database/schemas/`.`schema/cortex-search-services/name` | Fetches a Cortex Search Service. |
| `DELETE /api/v2/databases/database/schemas/schema/`.`cortex-search-services/name` | Deletes a Cortex Search Service with the given name. |
| `POST /api/v2/databases/database/schemas/schema/`.`cortex-search-services/service_name:suggest` | Suggests from a Cortex Search Service to get auto-complete or contextual suggestions. |
| `POST /api/v2/databases/database/schemas/schema/`.`cortex-search-services/name:suspend` | Suspends one or both of the indexing or serving targets of a Cortex Search Service. |
| `POST /api/v2/databases/database/schemas/schema/`.`cortex-search-services/name:resume` | Resumes the Cortex Search Service. |

For reference documentation, see [Snowflake Cortex Search Service API reference](/developer-guide/snowflake-rest-api/reference/cortex-search-service.md).

---
title: Work with managed accounts
source: https://docs.snowflake.com/en/developer-guide/snowflake-rest-api/managed-account/managed-account-introduction.md
section: Snowflake REST API
---

# Work with managed accounts

The Snowflake REST [Managed Account API](/developer-guide/snowflake-rest-api/reference/managed-account.md) provides the following endpoints to access, update, and perform certain actions on Managed Account resources.

Snowflake REST Managed Account API endpoints

| Endpoint | Description |
| --- | --- |
| `GET /api/v2/managed-accounts` | Lists available managed accounts. |
| `POST /api/v2/managed-accounts` | Creates a managed account. |
| `DELETE /api/v2/managed-accounts/name` | Deletes a managed account. |

For reference documentation, see [Snowflake Managed Account API reference](/developer-guide/snowflake-rest-api/reference/managed-account.md).

## Developer Guide

UDFs, stored procedures, drivers (JDBC, ODBC, Python, Go, Node.js), SQL API, logging, tracing, and extensibility.

---
title: .NET Driver
source: https://docs.snowflake.com/en/developer-guide/dotnet/dotnet-driver.md
section: Developer Guide
---

# .NET Driver

> **Note:**
>
> This driver currently does not support GCP regional endpoints. Please ensure that any workloads using through this driver do not require support for regional endpoints on GCP. If you have questions about this, please contact Snowflake Support.

The Snowflake .NET driver provides an interface to the Microsoft .NET open source software framework for developing applications. The driver was developed using Visual Studio.

For more information, see the [.NET](https://www.microsoft.com/net/) website.

For complete installation and usage instructions, as well as developer notes and the source code, see the GitHub [Snowflake .NET driver repo](https://github.com/snowflakedb/snowflake-connector-net).

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

---
title: About Declarative Sharing in the Native Application Framework
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/about.md
section: Developer Guide
---

# About Declarative Sharing in the Native Application Framework

## About Declarative Sharing

Declarative Sharing in the Snowflake Native App Framework enables providers to share and sell data products, and to enhance those apps by including [notebooks](../../user-guide/ui-snowsight/notebooks.md) that help Snowflake consumers visualize and explore the data.

Declarative Sharing introduces a simplified creation experience, similar to setting up Secured Data Shares, making it easier to get started quickly.

The Declarative Native Apps development experience provides the following features:

* **A declarative sharing model** that allows you to define shared objects using a simple text-based YAML file.
* **Streamlined testing**, so developers can work directly with the content in a live environment.
* **Automatic versioning and updates** of the app.
* **Capabilities to prepare multiple views of data**, including filtered data views, that are optimized for different consumer types.
* **Capabilities to protect sensitive data** by categorizing data into application roles. Consumers can delegate these app roles to teams, so that team members only see data that’s relevant to their work.
* **Runs in the consumer account**, allowing the customer to manage their resource usage and costs.

## What is a Declarative Sharing app?

A Declarative Native App is a data product that uses the declarative sharing model to share data and logic with Snowflake consumers.

Declarative Native Apps are built using a combination of Snowflake objects, including the following:

[Databases](manifest-reference.md)
:   Databases that the provider shares with consumers.

[Schemas](manifest-reference.md)
:   Schemas in the shared databases.

[Tables](manifest-reference.md)
:   Database tables in the shared schemas.

[Views](manifest-reference.md)
:   Shared views in the shared databases. Views can reference other databases that are not shared, if the dependent databases are included in the manifest file.

[Required databases](manifest-reference.md)
:   Databases that are not shared, but are referenced by views in the shared databases. These databases must be included in the manifest file. Consumers can only access data in these databases using the views shared in the app.

[Notebooks](../../user-guide/ui-snowsight/notebooks.md), [stored procedures](../stored-procedure/stored-procedures-overview.md), and [user-defined functions](../udf/udf-overview.md)
:   Code objects that help consumers visualize and explore the data.

[Manifest file](manifest-reference.md)
:   A manifest file defines the app structure and shared objects.

[App roles](app-roles.md)
:   App roles categorize data and control access to shared objects.

For information about how to declare these objects in the manifest file, see [Declarative Native App manifest reference](manifest-reference.md).

## Security

Declarative Native Apps have a similar security model to secure data sharing:

* Apps only have access to the data included in the app.
* Apps can’t access the consumer’s private data.
* Apps aren’t allowed to make external calls or to access data outside of the Snowflake account.

## Data product types

Choosing the right data product for your organization is determined by your business needs.
Do you want to get started quickly? Do you need an app with advanced features?
The following table lists the available Snowflake data products and provides their typical use cases.
An overview of Snowflake data products.

Data product best uses

| Data product | Description | Best for |
| --- | --- | --- |
| Secure Data Sharing | Traditional read-only sharing of tables and views. | Organizations beginning data monetization or with simple sharing needs. |
| Declarative Native Apps | Enhanced sharing with notebooks and other logic, role-based access control (RBAC), and declarative configuration. | Data providers ready to add value through guided experiences and documentation |
| Full Native Apps | Apps running fully inside of a consumer account with complex business logic and interfaces. | Organizations building complex data products with advanced capabilities. |

## Choosing a data product

Before choosing a data product, consider the following:

Data product types

| Data product | Description | Provider Builds | Security and Functionality Balance | Best provider use cases |
| --- | --- | --- | --- | --- |
| Secure Data Sharing | Traditional read-only sharing of tables and views   * **Technical Expertise**: Basic Snowflake * **Development Skills**: SQL knowledge * **Maintenance Effort**: Low - SQL updates only | SQL grants for tables, views | * Data stays within Snowflake | * Providers focusing on datasets only * Initial marketplace entry |
| Declarative Native App | Enhanced sharing   * **Technical Expertise**: Intermediate Snowflake * **Development Skills**: SQL, YAML, notebooks * **Maintenance Effort**: Low - declarative updates, notebook/ SQL changes | Application package | * Data stays within Snowflake * Limited functionality–that is, notebooks, Streamlit, stored procedures, and user-defined functions | * Complex data requiring explanation * Demonstrating data value through examples * Reducing support burden through better documentation |
| Full Native Apps | Apps running fully inside the Snowflake customer’s account with complex business logic and interfaces   * **Technical Expertise**: Advanced Snowflake * **Development Skills**: SQL, containers, programming languages * **Maintenance Effort**: High - containers, security reviews | Application package, services (in Containers) | * Data by default within Snowflake, can leave Snowflake with consumer consent * Snowflake primitives and container runtime | * Data requiring complex logic and workflows * Complex visualization needs * Re-use of SaaS application components |

## Declarative Native Apps resources

In the following topics, you’ll find information you need to get started with Declarative Native Apps.

* [Introduction to Declarative Sharing in the Native Application Framework](introduction.md)
* [Application Packages in Declarative Sharing in the Native Application Framework](package.md)
* [Editing Notebooks in Declarative Shared Native Applications](live-editing.md)
* [User-Defined Functions and Stored Procedures in Declarative Shared Native Applications](udfs-sprocs.md)
* [Creating a Listing using Declarative Sharing](listing.md)
* [Monitoring Usage with Declarative Sharing in the Native Application Framework](monitoring.md)
* [Package Versions in Declarative Sharing in the Native Application Framework](versioning.md)
* [Versioning Application Packages in Declarative Sharing](versioning.md)
* [Tutorial: Getting started with Declarative Native Apps](tutorials/getting-started.md)
* [Declarative Native App command reference](command-reference.md)
* [Application roles: Allow consumers to share different views of the same data](app-roles.md)
* [Declarative Native App manifest reference](manifest-reference.md)
* [Install a Declarative Native App](consumer/install.md)
* [Access content in a Declarative Native App](consumer/access-app-content.md)
* [Declarative App Consumer-Side Execution Model](consumer/consumer-execution.md)
* [Declarative Sharing in Native Apps: Limitations](limitations.md)

---
title: About the SQL API endpoints
source: https://docs.snowflake.com/en/developer-guide/sql-api/about-endpoints.md
section: Developer Guide
---

# About the SQL API endpoints

The SQL API is available at `https://account_identifier.snowflakecomputing.com/api`, where `account_identifier` is
your [account identifier](../../user-guide/admin-account-identifier.md).

Beginning with Snowflake version 6.3, the API consists of the `/api/v2/statements/` resource and provides the following endpoints:

| Endpoint | Description |
| --- | --- |
| `/api/v2/statements/` | Use this endpoint to [submit SQL statements for execution](submitting-requests.md). |
| `/api/v2/statements/statementHandle` | Use this endpoint to [check the status of the execution of a statement](handling-responses.md). (`statementHandle` is a unique identifier for the statement submitted for execution.) |
| `/api/v2/statements/statementHandle/cancel` | Use this endpoint to [cancel the execution of a statement](cancelling-requests.md). |

These endpoints include the new method of retrieving results, which was introduced in Snowflake version 5.40. However, when sending a request to these new endpoints, you do not need to set the format field to `jsonv2` in the `resultSetMetaData` field. If the format field is set in the request, the SQL API ignores the field.

The new version of the SQL API also removes concurrency limits, enabling you to retrieve query results from multiple threads.

You can use development tools and libraries for REST APIs (e.g. Postman) to send requests and handle responses.

---
title: Access content in a Declarative Native App
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/consumer/access-app-content.md
section: Developer Guide
---

# Access content in a Declarative Native App

If you have installed a Snowflake Declarative Native App, or have had a Declarative Native App shared with you by a member of your organization, you can access the data and functionality through Snowsight or [Snowflake CLI](../../snowflake-cli/index.md).

## Access app content from Snowsight

1. [Sign in to Snowsight](https://app.snowflake.com) with your Snowflake account.
2. In the navigation menu, select Catalog » Apps.
3. Select the app you want to access.
4. Browse the app’s content, which includes:

   * **Notebooks**: If the app includes notebooks, you can run them to see visualizations and other content.
   * **Tables and views**: You can query the tables and views that are part of the app.
   > **Note:**
   >
   > Notebooks in Declarative Native Apps are read-only. You can run the cells in a notebook, or run entire notebook, but you can’t modify it.

## Access app notebooks

You can access the app’s notebooks, either through Snowsight or through [Snowflake CLI](../../snowflake-cli/index.md).

### Find and open notebooks available to your role using Snowsight

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the app you want to access. A side panel appears with information about the app and its notebooks.
4. Select **Open**. If notebooks are available to your role, they appear in the drop-down list. If no notebooks are available, the **Open** button opens the worksheet directly.
5. If a list of notebooks appears, select a notebook from the list. The notebook opens, and is listed as a part of the app.
6. You can run individual cells in the notebook, or run the entire notebook by selecting Run » Run all cells.
7. Selecting the notebook name opens a menu with the following items:

   * Other notebooks in the same app that you can navigate to.
   * A link to the listing for this application.
8. The “<” (left chevron) button takes you to the notebook list page. The notebook list page has two tabs:

   * All Notebooks: Lists all notebooks available to your role.
   * Shared with me: Lists notebooks for which you aren’t the owner.

### Find and open notebooks available to your role using SQL commands

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md), and select **Write SQL queries**.
2. Use the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command to see what apps are installed in your account.

   ```sqlexample
   SHOW APPLICATIONS;
   ```

   Use the application name (for example, `market_data_app`) to access the app’s content.
3. See what notebooks are in the app with the command: [SHOW NOTEBOOKS IN APPLICATION](../../../sql-reference/sql/show-notebooks.md).

   ```sqlexample
   SHOW NOTEBOOKS IN APPLICATION market_data_app;
   ```

   For example, the command might return a notebook called `MARKETING_NB`.

   Optional: Use the [DESC NOTEBOOK](../../../sql-reference/sql/desc-notebook.md) command to see more information about the notebook.

   ```sqlexample
   DESC NOTEBOOK market_data_app.APP$UI.MARKETING_NB;
   ```
4. Run the notebook with the command: [EXECUTE NOTEBOOK](../../../sql-reference/sql/execute-notebook.md).

   ```sqlexample
   EXECUTE NOTEBOOK market_data_app.APP$UI.MARKETING_NB();
   ```
5. In the navigation menu, select Projects » Notebooks.

   The notebook should appear in your list of available notebooks.
6. Open the notebook by selecting it from the list.

   The notebook opens, and is listed as a part of the app.

### Access tables and views in the app

Tables and views are available in the app’s schema. You can access them using SQL commands.

* See what schemas are in the app using [SHOW SCHEMAS IN APPLICATION](../../../sql-reference/sql/show-schemas.md).

  > ```sqlexample
  > SHOW SCHEMAS IN APPLICATION <app_name>;
  > ```
* See tables, dynamic tables, views, and semantic views in a schema, application, or account using the [SHOW TABLES](../../../sql-reference/sql/show-tables.md), [SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md), [SHOW VIEWS](../../../sql-reference/sql/show-views.md), and [SHOW SEMANTIC VIEWS](../../../sql-reference/sql/show-semantic-views.md) commands:

  > ```sqlexample
  > -- Using SHOW TABLES
  > SHOW TABLES IN SCHEMA <app_name>.<schema_name>;
  > SHOW TABLES IN APPLICATION <app_name>;
  > SHOW TABLES IN ACCOUNT;
  >
  > -- Using SHOW DYNAMIC TABLES
  > SHOW DYNAMIC TABLES IN SCHEMA <app_name>.<schema_name>;
  > SHOW DYNAMIC TABLES IN APPLICATION <app_name>;
  > SHOW DYNAMIC TABLES IN ACCOUNT;
  >
  > -- Using SHOW VIEWS
  > SHOW VIEWS IN SCHEMA <app_name>.<schema_name>;
  > SHOW VIEWS IN APPLICATION <app_name>;
  > SHOW VIEWS IN ACCOUNT;
  >
  > -- Using SHOW SEMANTIC VIEWS
  > SHOW SEMANTIC VIEWS IN SCHEMA <app_name>.<schema_name>;
  > SHOW SEMANTIC VIEWS IN APPLICATION <app_name>;
  > SHOW SEMANTIC VIEWS IN ACCOUNT;
  > ```
* Select items in a view or table, for example:

  > ```sqlexample
  > SELECT * from <app_name>.<schema>.<view>;
  > SELECT * from <app_name>.<schema>.<table>;
  > ```
* Create streams on shared tables and views to track changes. See [Introduction to streams](../../../user-guide/streams-intro.md) for syntax examples. Note that creating a stream on a shared table or view requires that the provider has enabled change tracking on the source object. If you encounter an error creating a stream, contact the provider to request that they enable `CHANGE_TRACKING`.

### Considerations

Notebooks in Declarative Native Apps are interactive, but are read-only. They can’t be modified, copied, or cloned.

To view past notebook executions, select Schedule notebook run » View run history.

---
title: Access to cloud service file data with Snowpark Connect for Spark
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-file-data.md
section: Developer Guide
---

# Access to cloud service file data with Snowpark Connect for Spark

With Snowpark Connect for Spark, you can interact directly with external cloud storage systems such as Amazon S3, Google Cloud Storage, and Azure Blob.
You can read data from cloud storage into Snowflake, process the data, then write it back.

For example, you might want to use Snowpark Connect for Spark to perform the following tasks:

* Ingest raw data.

  Land files (for example, CSV, JSON, and Parquet) in S3, Google Cloud, or Azure before moving them into Snowflake.
* Export data for downstream use.

  Write processed Snowpark DataFrames back to cloud storage for ML training, sharing with external partners, or further Spark-based
  analytics.
* Create hybrid pipelines.

  Keep part of the pipeline in Snowflake while maintaining compatibility with existing data lakes.
* Comply with regulations or reduce costs.

  Store specific datasets externally due to regulations, governance, or budget constraints.

Use the steps listed in this topic to read from and write to files stored on these cloud service providers. You can access files using
either Snowflake external stages or direct access.

## Caveats

When using Snowpark Connect for Spark to work with cloud services, keep in mind the following caveats:

* Authentication—Snowpark Connect for Spark does not automatically manage cloud credentials. You must configure access keys (AWS), storage account
  keys or SAS tokens (Azure), or maintain external stages by yourself. Expired or missing credentials will result in read/write failures.
* Performance—Cloud I/O depends on network bandwidth and object store latency. Reading many small files can significantly impact performance.
* Format support—Ensure that the file formats you’re reading and writing are supported. Currently Snowpark Connect for Spark has parity with common
  formats, including TEXT, CSV, JSON, and Parquet. However, advanced features (such as Parquet partition discovery and JSON schema evolution)
  may differ from Spark.
* Permissions and policies—Writing to cloud buckets requires proper IAM/ACL policies. You might encounter an AccessDenied error if
  policies aren’t aligned between Snowflake roles and cloud credentials.

## Best Practices

To get the most reliable integration that performs well, follow these best practices:

* Use secure, temporary credentials and rotate credentials frequently.
* Partition and bucket data.

  When writing Parquet, partition on frequently filtered columns to reduce scan costs. Use fewer, larger files (for example, at 100MB to
  500MB each) instead of many small files.
* Validate schema on write.

  Always define the schema explicitly, especially for semi-structured formats such as JSON and CSV. This prevents drift between
  Snowflake and external data.
* Monitor costs.

  Consider consolidating files and filtering data before writing to reduce costs. Cloud provider costs are accrued per request and per byte scanned.
* Standardize API calls.

  Follow the documented guidance precisely when using functionality and parameters, avoiding ad-hoc variations. In this way, you can
  maintain compatibility, prevent regressions, and ensure expected behavior across different cloud providers.

## Access using Snowflake external stages

AWSAzureGoogle Cloud

1. [Configure secure access to Amazon S3](../../user-guide/data-load-s3-config.md) to create an external stage that points to your
   S3 location.
2. Read from your external stage.

   ```python
   # Read CSV
   spark.read.csv('@<your external stage name>/<file path>')
   spark.read.option("header", True).csv('@<your external stage name>/<file path>') # read with header in file

   # Write to CSV
   df.write.csv('@<your external stage name>/<file path>')
   df.write.option("header", True).csv('@<your external stage name>/<file path>') # write with header in file

   # Read Text
   spark.read.text('@<your external stage name>/<file path>')

   # Write to Text
   df.write.text('@<your external stage name>/<file path>')
   df.write.format("text").mode("overwrite").save('@<your external stage name>/<file path>')

   # Read Parquet
   spark.read.parquet('@<your external stage name>/<file path>')

   # Write to Parquet
   df.write.parquet('@<your external stage name>/<file path>')

   # Read JSON
   spark.read.json('@<your external stage name>/<file path>')

   # Write to JSON
   df.write.json('@<your external stage name>/<file path>')
   ```

1. [Configure secure access to Azure](../../user-guide/data-load-azure-create-stage.md) to create an external stage that points to your
   Azure container.
2. Read from your external stage.

   ```python
   # Read CSV
   spark.read.csv('@<your external stage name>/<file path>')
   spark.read.option("header", True).csv('@<your external stage name>/<file path>')
   # read with header in file

   # Write to CSV
   df.write.csv('@<your external stage name>/<file path>')
   df.write.option("header", True).csv('@<your external stage name>/<file path>') # write with header in file

   # Read Text
   spark.read.text('@<your external stage name>/<file path>')

   # Write to Text
   df.write.text('@<your external stage name>/<file path>')
   df.write.format("text").mode("overwrite").save('@<your external stage name>/<file path>')

   # Read Parquet
   spark.read.parquet('@<your external stage name>/<file path>')

   # Write to Parquet
   df.write.parquet('@<your external stage name>/<file path>')

   # Read JSON
   spark.read.json('@<your external stage name>/<file path>')

   # Write to JSON
   df.write.json('@<your external stage name>/<file path>')
   ```

1. [Configure secure access to Google Cloud](../../user-guide/data-load-gcs-config.md) to create an external stage that points to your
   Google Cloud Storage bucket.
2. Read from your external stage.

   ```python
   # Read CSV
   spark.read.csv('@<your external stage name>/<file path>')
   spark.read.option("header", True).csv('@<your external stage name>/<file path>') # read with header in file

   # Write to CSV
   df.write.csv('@<your external stage name>/<file path>')
   df.write.option("header", True).csv('@<your external stage name>/<file path>') # write with header in file

   # Read Text
   spark.read.text('@<your external stage name>/<file path>')

   # Write to Text
   df.write.text('@<your external stage name>/<file path>')
   df.write.format("text").mode("overwrite").save('@<your external stage name>/<file path>')

   # Read Parquet
   spark.read.parquet('@<your external stage name>/<file path>')

   # Write to Parquet
   df.write.parquet('@<your external stage name>/<file path>')

   # Read JSON
   spark.read.json('@<your external stage name>/<file path>')

   # Write to JSON
   df.write.json('@<your external stage name>/<file path>')
   ```

## Access using direct access

You can access files directly on cloud service providers using the steps and code described here.

AWSAzure

1. Set the Spark configuration with AWS credentials.

   ```python
   # For S3 related access with public/private buckets, please add these config change
   spark.conf.set("spark.hadoop.fs.s3a.connection.ssl.enabled","false")
   spark.conf.set("spark.hadoop.fs.s3a.impl","org.apache.hadoop.fs.s3a.S3AFileSystem")
   spark.conf.set("spark.jars.packages","org.apache.hadoop:hadoop-aws:3.3.2")

   # For private S3 access, please also provide credentials
   spark.conf.set("spark.hadoop.fs.s3a.access.key","<AWS_ACCESS_KEY_ID>")
   spark.conf.set("spark.hadoop.fs.s3a.secret.key","<AWS_SECRET_ACCESS_KEY>")
   spark.conf.set("spark.hadoop.fs.s3a.session.token","<AWS_SESSION_TOKEN>")
   ```
2. Read and write directly with S3.

   ```python
   # Read CSV
   spark.read.csv('s3a://<bucket name>/<file path>')
   spark.read.option("header", True).csv('s3a://<bucket name>/<file path>') # read with header in file

   # Write to CSV
   df.write.csv('s3a://<bucket name>/<file path>')
   df.write.option("header", True).csv('s3a://<bucket name>/<file path>') # write with header in file

   # Read Text
   spark.read.text('s3a://<bucket name>/<file path>')

   # Write to Text
   df.write.text('s3a://<bucket name>/<file path>')
   df.write.format("text").mode("overwrite").save('s3a://<bucket name>/<file path>')

   # Read Parquet
   spark.read.parquet('s3a://<bucket name>/<file path>')

   # Write to Parquet
   df.write.parquet('s3a://<bucket name>/<file path>')

   # Read JSON
   spark.read.json('s3a://<bucket name>/<file path>')

   # Write to JSON
   df.write.json('s3a://<bucket name>/<file path>')
   ```

1. Set the Spark configuration with Azure credentials.

   ```python
   # For private Azure access, please also provide blob SAS token
   #   * Make sure all required permissions are in place before proceeding
   spark.conf.set("fs.azure.sas.fixed.token.<storage-account>.dfs.core.windows.net","<Shared Access Token>")
   ```
2. Read and write directly with Azure.

   ```python
   # Read CSV
   spark.read.csv('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')
   spark.read.option("header", True).csv('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>') # read with header in file

   # Write to CSV
   df.write.csv('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')
   df.write.option("header", True).csv('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>') # write with header in file

   # Read Text
   spark.read.text('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')

   # Write to Text
   df.write.text('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')
   df.write.format("text").mode("overwrite").save('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')

   # Read Parquet
   spark.read.parquet('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')

   # Write to Parquet
   df.write.parquet('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')

   # Read JSON
   spark.read.json('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')

   # Write to JSON
   df.write.json('wasbs://<container name>@<storage account name>.blob.core.windows.net/<bucket name>/<file path>')
   ```

---
title: Accessing data from a Java stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/java/procedure-java-access-data.md
section: Developer Guide
---

# Accessing data from a Java stored procedure

To access data with a stored procedure handler written in Java, use the Snowpark library APIs.

When handling a call to your Java stored procedure, Snowflake creates a Snowpark `Session` object and passes the object to
the method for your stored procedure.

As is the case with stored procedures in other languages, the context for the session (including the privileges, current database and
schema, and so on) is determined by whether the stored procedure runs with caller’s rights or owner’s rights. For details, see
[Accessing and setting the session state](../stored-procedures-rights.md).

You can use this `Session` object to call APIs in the
[Snowpark library](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/index.html).
For example, you can [create a DataFrame for a table](../../snowpark/java/working-with-dataframes.md) or execute an
SQL statement.

See the [Snowpark Developer Guide for Java](../../snowpark/java/index.md) for more information.

> **Note:**
>
> For information about limitations, including limitations on accessing data, see [Java stored procedure limitations](procedure-java-limitations.md).

## Data access example

In the following example, a Java method copies a specified number of rows from one table to another table. The method takes the following
arguments:

* A Snowpark `Session` object
* The name of the table to copy the rows from
* The name of the table to save the rows to
* The number of rows to copy

The method in this example returns a string.

```java
import com.snowflake.snowpark_java.*;

public class MyClass
{
  public String myMethod(Session session, String fromTable, String toTable, int count)
  {
    session.table(fromTable).limit(count).write().saveAsTable(toTable);
    return "Success";
  }
}
```

---
title: Accessing data from a Python stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-access-data.md
section: Developer Guide
---

# Accessing data from a Python stored procedure

You can access data from a stored procedure by using the Snowpark library APIs.

You can use the `Session` object that Snowflake creates for your stored procedure to access data by calling APIs in the
[Snowpark library](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/index).
For example, you can [create a DataFrame for a table](../../snowpark/python/working-with-dataframes.md) or execute a SQL statement.

The context for the session (including the privileges, current database and schema, and so on) is determined by whether the stored
procedure runs with [caller’s rights or owner’s rights](../stored-procedures-rights.md). For details,
see [Accessing and setting the session state](../stored-procedures-rights.md).

See the [Snowpark Developer Guide](../../snowpark/python/index.md) for more information.

## Data access example

In the following example, a Python method copies a specified number of rows from one table to another table. The method takes the
following arguments:

* A Snowpark `Session` object
* The name of the table to copy the rows from
* The name of the table to save the rows to
* The number of rows to copy

The method in this example returns a string. If you run this example in a
[Python worksheet](../../snowpark/python/python-worksheets.md),
[change the return type for the worksheet](../../snowpark/python/python-worksheets.md) to a String

```python
def run(session, from_table, to_table, count):

  session.table(from_table).limit(count).write.save_as_table(to_table)

  return "SUCCESS"
```

---
title: Accessing data with Scala from stored procedures created with SQL
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/scala/procedure-scala-access-data.md
section: Developer Guide
---

# Accessing data with Scala from stored procedures created with SQL

To access data with a stored procedure handler written in Scala, use the Snowpark library APIs.

When handling a call to your Scala stored procedure, Snowflake creates a Snowpark `Session` object and passes the object to
the method or function for your stored procedure.

As is the case with stored procedures whose handlers are written in other languages, the context for the session (including the privileges,
current database and schema, and so on) is determined by whether the stored procedure runs with caller’s rights or owner’s rights. For
details, see [Accessing and setting the session state](../stored-procedures-rights.md).

You can use this `Session` object to call APIs in the
[Snowpark library](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/index.html).
For example, you can [create a DataFrame for a table](../../snowpark/scala/working-with-dataframes.md) or execute a
SQL statement.

See the [Snowpark Developer Guide for Scala](../../snowpark/scala/index.md) for more information.

> **Note:**
>
> For information about limitations, including limitations on accessing data, see [Limitations for Scala in stored procedures created with SQL](procedure-scala-limitations.md).

## Data access example

The following is an example of a Scala method that copies a specified number of rows from one table to another table. The method
takes the following arguments:

* A Snowpark `Session` object
* The name of the table to copy the rows from
* The name of the table to save the rows to
* The number of rows to copy

The method in this example returns a string.

```scala
object MyObject
{
  def myProcedure(session: com.snowflake.snowpark.Session, fromTable: String, toTable: String, count: Int): String =
  {
    session.table(fromTable).limit(count).write.saveAsTable(toTable)
    return "Success"
  }
}
```

The following example defines a function, rather than a method:

```scala
object MyObject
{
  val myProcedure = (session: com.snowflake.snowpark.Session, fromTable: String, toTable: String, count: Int): String =>
  {
    session.table(fromTable).limit(count).write.saveAsTable(toTable)
    "Success"
  }
}
```

---
title: Add Anaconda packages to a notebook
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/add-packages.md
section: Developer Guide
---

# Add Anaconda packages to a notebook

The notebook environment includes a set of pre-installed Anaconda packages, such as Python and Streamlit.

If your notebook uses additional Anaconda packages, you must add those packages to your application package so that your notebook can access them.

You can add them while editing the notebook in development mode. You can also add the packages by providing an `environment.yml` file.

> **Note:**
>
> If an `environment.yml` file is present in the same directory as a notebook, it overwrites the list of dependent packages, and any packages added through the Snowsight UI are ignored.
>
> Using an `environment.yml` file is recommended for production applications as it allows you to manage dependencies in source control.
>
> Using the UI is convenient for interactive development and testing.

## Adding Anaconda packages while editing the notebook in development mode

You can add Anaconda packages to your notebook while editing it in development mode. We recommend using this method rather than adding packages to the `environment.yml` file, because the process is considerably simpler.

To add packages while editing the notebook:

1. Install your application package locally from the live version.
2. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
3. In the navigation menu, select Projects » Notebooks.
4. Open your notebook file.
5. Make sure the notebook is in development mode. For information about development mode, see [Editing Notebooks in Declarative Shared Native Applications](live-editing.md).
6. Select the Packages button in the top center of the notebook editor.
7. Search for the package you want to add, and select it.

The notebook environment now automatically loads the selected dependencies when the notebook is run.

## Adding Anaconda packages to the `environment.yml` file

You can define your Python dependencies by creating an `environment.yml` file, and uploading it to the same stage directory as your notebook (.ipynb) file.

For information about creating an `environment.yml` file that includes your new packages, see
[Manage packages by using the environment.yml file](../streamlit/app-development/dependency-management.md)

> **Note:**
>
> You can only install packages listed in the
> [Snowflake Anaconda Channel](https://repo.anaconda.com/pkgs/snowflake/).
> Streamlit in Snowflake does not support external Anaconda channels.

Use the PUT command to upload your `environment.yml` file from your local machine to the application package stage. The `environment.yml` file must be in the same directory on the stage as the notebook file it configures.

Replace the placeholders in the following command with your own values. If your notebook is at the root of the live version, do not include a directory path after `live/`.

```bash
PUT <file:///path/to/your/environment.yml> snow://package/<PACKAGE_NAME>/versions/live/<path/to/your/notebook> OVERWRITE=TRUE AUTO_COMPRESS=FALSE;
```

---
title: Adding custom spans to a trace
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-custom-spans.md
section: Developer Guide
---

# Adding custom spans to a trace

You can add your own custom spans to traces for finer-grained tracing within the handler for a procedure or function.

By default, when you [have tracing enabled](logging-tracing-enabling.md), Snowflake starts a span for
you (as described in [How Snowflake represents trace events](tracing-how-events-work.md)) and adds all trace events to that span. (This is
known internally as the “auto_instrumented” span.) Using OpenTelemetry APIs, you can create your own spans. To the new span, you can add
events and attributes using either the OpenTelemetry API or Snowflake API for your language.

You might want to create your own span when, for example, you want to isolate the trace data for computation-heavy actions that happen
within a procedure, such as when you’re using the code to train an ML model.

Custom spans you create match the default behavior of spans created by OpenTelemetry.

## Supported languages

You can add custom spans from code written in the following languages, including when handler code is written with
[Snowpark APIs](../snowpark/index.md).

| Language/Type | Java | Python | JavaScript | Scala | Snowflake Scripting |
| --- | --- | --- | --- | --- | --- |
| Stored procedure handler | ✔ | ✔ | ✔ | ✔ [1] |  |
| Streamlit app |  | ✔ |  |  |  |
| UDF handler (scalar function) | ✔ | ✔ | ✔ | ✔ [1] |  |
| UDTF handler (table function) | ✔ | ✔ | ✔ | ✔ [1] [2] |  |

[1]
(1,2,3)

Supported with the Java API.

[2]

Scala UDTF handler written in Snowpark.

## Creating a custom span

To add a custom span with handler code, use the OpenTelemetry API for your handler language within the existing Snowflake telemetry
environment to create a new span, add events and attributes as needed, and then close the span.

1. Use the OpenTelemetry API to create a tracer to manage context for the span.

   From this tracer created from the existing Snowflake telemetry environment, you can create custom spans that use the existing
   infrastructure in which trace data is captured by the event table.
2. From the new tracer, create the custom span with an API that ensures that the new span is the current span.

   By creating the new span in the existing context managed by Snowflake, you ensure that information from the context — including
   the [trace_id](event-table-columns.md) and [parent_span_id](event-table-columns.md) values — is
   passed from the Snowflake default span to other spans.
3. When your code finishes with the custom span, it must close the span before the handler completes execution to have trace data
   captured by the event table.

   This behavior of custom spans matches the default behavior of OpenTelemetry.

For information on adding a custom span with a supported language, see the following topics:

* [Java custom span](tracing-java.md)
* [JavaScript custom span](tracing-javascript.md)
* [Python custom span](tracing-python.md)
* [Scala custom span](tracing-scala.md)

### Python example

Code in the following example uses the [OpenTelemetry Python API](https://opentelemetry-python.readthedocs.io/en/latest/api/index.html) to create the `my.span` span as the current span with
`start_as_current_span`. It then adds an event with attributes to the new span using the [OpenTelemetry Python API](https://opentelemetry-python.readthedocs.io/en/latest/api/index.html).

Event data won’t be captured by the event table unless the span ends before your handler completes execution. In this example, closing the
span happens automatically when the `with` statement concludes.

```sqlexample-python
CREATE OR REPLACE FUNCTION customSpansPythonExample() RETURNS STRING
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('opentelemetry-api')
HANDLER = 'custom_spans_function'
AS $$
from snowflake import telemetry
from opentelemetry import trace

def custom_spans_function():
  tracer = trace.get_tracer("my.tracer")
  with tracer.start_as_current_span("my.span") as span:
    span.add_event("Event2 in custom span", {"key1": "value1", "key2": "value2"})

  return "success"
$$;
```

---
title: API Reference for Access to Secrets
source: https://docs.snowflake.com/en/developer-guide/external-network-access/secret-api-reference.md
section: Developer Guide
---

# API Reference for Access to Secrets

You can use Java, Python, or Scala to retrieve credentials contained in a secret you created with the [CREATE SECRET](../../sql-reference/sql/create-secret.md)
statement. This topic lists the methods for getting information from a secret. These are available with APIs included in Snowflake.

## Java API for Secret Access

For code in Java, use the `com.snowflake.snowpark_java.types.SnowflakeSecrets` class.

> **Note:**
>
> You can also use the Java API in Scala code.

The following table lists methods for accessing data in a secret.

| Method | Description |
| --- | --- |
| `public String getGenericSecretString(String genericStringSecretName)` | Gets the generic token string held by the secret specified by `genericStringSecretName`. Returns a valid token string. |
| `public String getOAuthAccessToken(String oauthSecretName)` | Gets the OAuth2 access token held by the secret specified by `oauthSecretName`. Returns an OAuth2 token string. |
| `public String getSecretType(String secretName)` | Gets the type of the secret specified by `secretName`. Returns the TYPE parameter value set for this secret when it was created with the [CREATE SECRET](../../sql-reference/sql/create-secret.md) statement. |
| `public UsernamePassword getUsernamePassword(String usernamePasswordSecretName)` | Gets the username and password from the secret specified by `usernamePasswordSecretName`. Returns a `com.snowflake.snowpark_java.types.UsernamePassword` with username and password. |
| `public CloudProviderToken getCloudProviderToken(String cloudProviderSecretName)` | Gets a cloud provider token containing values you can use to create a session with the cloud provider, such as AWS. Returns a `com.snowflake.snowpark_java.types.CloudProviderToken` with the following methods:   * `String getAccessKeyId` * `String getSecretAccessKey` * `String getToken` |

To use the `SnowflakeSecrets` class:

1. Make the Snowpark library available to your handler code using the PACKAGES clause as described in
   [CREATE FUNCTION](../../sql-reference/sql/create-function.md).
2. In your handler code, import `com.snowflake.snowpark_java.types.SnowflakeSecrets`.
3. Construct a `SnowflakeSecrets` object, and call one of the methods listed above to access the secret.

Code in the following example retrieves the value set for the TYPE clause when the secret was created with CREATE SECRET. Here,
the `oauth_token` secret is of type OAUTH2.

```sqlexample-java
CREATE OR REPLACE FUNCTION get_secret_type()
  RETURNS STRING
  LANGUAGE JAVA
  HANDLER = 'SecretTest.getSecretType'
  EXTERNAL_ACCESS_INTEGRATIONS = (external_access_integration)
  PACKAGES = ('com.snowflake:snowpark:latest')
  SECRETS = ('cred' = oauth_token )
  AS
  $$
  import com.snowflake.snowpark_java.types.SnowflakeSecrets;

  public class SecretTest {
    public static String getSecretType() {
      SnowflakeSecrets sfSecrets = SnowflakeSecrets.newInstance();

      String secretType = sfSecrets.getSecretType("cred");

      return secretType;
    }
  }
  $$;
```

## Python API for Secret Access

For code in Python, use the `_snowflake` module exposed to Python UDFs that execute within Snowflake. The following table lists
`_snowflake` functions for accessing data in a secret.

| Function | Description |
| --- | --- |
| `get_generic_secret_string(generic_string_secret_name)` | Gets the generic token string held by the secret specified by `generic_string_secret_name`. Returns a valid token string. |
| `get_oauth_access_token(oauth_secret_name)` | Gets the OAuth2 access token held by the secret specified by `oauth_secret_name`. Returns an OAuth2 token string. |
| `get_secret_type(secret_name)` | Gets the type of the secret specified by `secret_name`. Returns the TYPE parameter value set for this secret when it was created with the [CREATE SECRET](../../sql-reference/sql/create-secret.md) statement. |
| `get_username_password(username_password_secret_name)` | Gets the username and password from the secret specified by `username_password_secret_name`. Returns an object with `username` and `password` attributes. |
| `get_cloud_provider_token(cloud_provider_secret_name)` | Gets a cloud provider object containing values you can use to create a session with the cloud provider, such as AWS. Returns a type with the following attributes:   * `access_key_id` * `secret_access_key` * `token` |

To use the `_snowflake` module in your handler code, import it as you would another module.

Code in the following example retrieves the value set for the TYPE clause when the secret was created with CREATE SECRET. Here,
the `oauth_token` secret is of type OAUTH2.

```sqlexample-python
CREATE OR REPLACE FUNCTION get_secret_type()
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'get_secret'
  EXTERNAL_ACCESS_INTEGRATIONS = (external_access_integration)
  SECRETS = ('cred' = oauth_token )
  AS
$$
import _snowflake

def get_secret():
  secret_type = _snowflake.get_secret_type('cred')
  return secret_type
$$;
```

Code in the following example retrieves the username and password held by the secret.

```sqlexample-python
CREATE OR REPLACE FUNCTION get_secret_username_password()
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'get_secret_username_password'
  EXTERNAL_ACCESS_INTEGRATIONS = (external_access_integration)
  SECRETS = ('cred' = credentials_secret )
  AS
$$
import _snowflake

def get_secret_username_password():
  username_password_object = _snowflake.get_username_password('cred');

  username_password_dictionary = {}
  username_password_dictionary["Username"] = username_password_object.username
  username_password_dictionary["Password"] = username_password_object.password

  return username_password_dictionary
$$;
```

---
title: Application Packages in Declarative Sharing in the Native Application Framework
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/package.md
section: Developer Guide
---

# Application Packages in Declarative Sharing in the Native Application Framework

As a provider, you create an application package to bundle your data content and notebooks into a Declarative Native App.
This topic explains what an app package is and describes the high-level steps to create one, from creating the initial package to adding your manifest and notebook files.

## The application package and the live version

The app package is a container for all the files that make up the app, including the [manifest](manifest-reference.md) file and any [Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md). When you create an app package, a live (staged) version of the app package is also created. The live version is a development workspace where you can add or update files, such as the manifest file and notebook files, and preview and test the experience before publishing.

Once you’re satisfied with the live version of the app, you can commit the live version to create a new immutable version of the app package, that you can then publish.

Using the live version for development simplifies version management by maintaining a single immutable version of the app package that is ready to be published, and a single live version for development. The Snowflake Native App Framework automatically manages the versioning of the app package, so you don’t need to manually track version numbers.

The Snowflake Native App Framework maintains a live version for any app package automatically. Even if you remove the live version, a new live version is created automatically from the last committed version of the app package.

## Create an application package

Providers develop and test an **application (app) package**. An app package includes files necessary to share the data in the app, and defines how data can be accessed by consumers.

The process involves the following steps:

1. **Create an app package project** (first time only): creates an app package project that will later be published. This also creates the live version of the app package.
2. **Add content to the app package**:

   1. **Create or update a manifest file**: This file describes the app package and its contents.
   2. **Download notebook files**. If notebooks are to be included, download a copy to be included in the app package.
   3. **Add the files to the live version of the app package**.
3. **Build the app package**: allows you to verify that the manifest file is valid and that all links in the manifest file are correct.
4. **Test the app**. Install the app and try it out. Make changes, and rebuild.
5. **Commit the app package**: creates a new immutable version of the app that can be published.
6. **Release the app package**. With a released package, you can create a new listing, either privately or publicly on the Snowflake Marketplace.

This process is described in the [Tutorial: Getting started with Declarative Native Apps](tutorials/getting-started.md). This section includes additional details of options available at the different stages of development.

### Create a new application package

First, create a new Declarative Native App package to hold the app’s files, either via Snowsight or SQL commands from [Snowflake CLI](../snowflake-cli/index.md), using the `snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/` URL scheme.

SnowsightSnowflake CLI

To use Snowsight to create a new app package:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App Packages.
3. In the Share Data + Code card, select Create.
4. Enter a name for your app package, and then select Create.

To use SQL in [Snowflake CLI](../snowflake-cli/index.md) to create a new app package:

* Create a Declarative Native App package using the [CREATE APPLICATION PACKAGE … TYPE=DATA](command-reference.md) command, replacing `<DECL_SHARE_APP_PKG>` with the name you want to give the app package:

```snowcli
snow sql -q "CREATE APPLICATION PACKAGE <DECL_SHARE_APP_PKG> TYPE = DATA;"
```

The new empty app package is created. This also creates a live version of the app package that you can edit.

### Assemble content for the application package

An app package includes the following components:

* A [manifest](manifest-reference.md) file (required): A text-based file that defines the app’s structure.
* [Snowflake Notebook](../../user-guide/ui-snowsight/notebooks.md) files (optional): One or more text-based files that can act as a front end to the consumer experience, referencing the shared views and tables. They can also include code, reference visualizations, and include logic to help present the data.

### Create or update a manifest file

You can create or update a [manifest](manifest-reference.md) file, which describes the app package and its shared content–for example, notebooks, tables, and views. It defines other metadata, such as [app roles](app-roles.md) included with the app.

The manifest file must be named `manifest.yml`, and must be added to the root level of the app package.

For more information, see [Declarative Native App manifest reference](manifest-reference.md). The associated [Tutorial: Getting started with Declarative Native Apps](tutorials/getting-started.md) includes an example manifest file.

#### Create or update a manifest file from a Snowflake data share

> **Note:**
>
> The following content is not supported by Snowflake. All code is provided “AS IS” and without warranty.

If you have an existing data share in Snowflake, you can automatically create a manifest file using the open-source Manifest from Share tool. This Snowflake-provided tool generates a manifest file based on the objects in a specified share. The tool also includes options to customize the generated manifest file. You can use this tool in the following ways:

* Generate a manifest file using the command-line interface (CLI).
* Integrate the tool into an existing Python automation workflow as a library.

For more information about how to download and use the tool, see the [Snowflake Manifest from Share](https://github.com/snowflakedb/native-apps-examples/tree/main/snowflake-manifest-from-share-library) repository on GitHub.

> **Note:**
>
> The Manifest from Share tool only creates the manifest file using the data share’s databases, schemas, tables and views. The tool doesn’t include any other objects in the generated manifest file.

### Get notebook files

If [Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md) are to be included in the app, download a copy of each notebook file so you can include it in the app package.

From Snowsight:

1. In the navigation menu, select Projects » Notebooks, and then select the notebook you want to download.
2. In the left pane, next to your notebook, select … » Download.

The file is downloaded to your local machine, as a file named `<notebook_name>.ipynb`.

> **Note:**
>
> The notebook environment has a set of pre-installed Anaconda packages, including Python and Streamlit. If your notebook uses additional Anaconda packages, you must add those as packages to your notebook so they can be used in the consumer’s environment. For information about how to add Anaconda packages to your notebook, see the [Add Anaconda packages to a notebook](add-packages.md).

### Add files to the live version

Add the manifest and notebook files to the live version of the app package:

SnowsightSnowflake CLI

To use Snowsight to populate the app package:

1. If you’re not already viewing the app package’s listing, from the navigation menu, select Projects » App Packages, and then select the app package you want to add files to.
2. Select Upload files. (If you are replacing or adding additional files, select Manage files, and then select Upload files.)
3. Drag the notebook files and the manifest from your hard disk to the Upload files dialog where indicated, or select Browse to locate and select the files.
4. Select Upload to upload the files to the live stage and trigger a build.

1. To use SQL in Snowflake CLI to populate the app package, use the following commands, replacing the file paths with your own and `<DECL_SHARE_APP_PKG>` with the name of the app package:

   1. Manifest file:

      ```snowcli
      snow sql -q "PUT file:////Users/test_user/Documents/manifest.yml  snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/ OVERWRITE=TRUE AUTO_COMPRESS=false;"
      ```
   2. Notebook file:

      ```snowcli
      snow sql -q "PUT file:////Users/test_user/Documents/NOTEBOOK.ipynb  snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE OVERWRITE=TRUE AUTO_COMPRESS=false;"
      ```
2. Verify the files are in the application package with the following command:

   ```snowcli
   snow sql -q "LIST snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE"
   ```

   The output shows the files in the live version of the app package, similar to the following:

   ```output
   +--------------------------------------------------------------------------------+
   | name                          | size | md5     | last_modified                 |
   |-----------------------------------------+------+---------+---------------------|
   | /versions/live/manifest.yml   | 304  | 843a... | Wed, 23 Jul 2025 08:27:26 GMT |
   | /versions/live/NOTEBOOK.ipynb | 832  | b014... | Wed, 23 Jul 2025 04:32:22 GMT |
   +--------------------------------------------------------------------------------+
   ```

### Download a file from the application package

You can download a file from the app package using the [GET](../../sql-reference/sql/get.md) SQL command in Snowflake CLI:

> ```snowcli
> snow sql -q "GET snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/manifest.yml file://manifest.yml"
> ```

### Remove content from the application package

You can remove files from the application package.

SnowsightSnowflake CLI

Using Snowsight:

1. If you’re not already viewing the app package’s listing, from the navigation menu, select Projects » App Packages, and then select the app package you want to remove files from.
2. Select Manage files » Remove files.
3. Select the file(s) you want to remove, and then select Delete.
4. In the Remove files dialog, choose the files to remove, and then select Remove & build.

Using the REMOVE (or RM) SQL command in Snowflake CLI:

```snowcli
snow sql -q "RM snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/manifest.yml"
```

### Build the application package

Next, build a testable version of the app.

SnowsightSQL

In Snowsight:

* When you upload a complete set of files to the app package, a build kicks off automatically.
* To perform a build at any other time, select the Build button on the app package’s page.

If there are any errors in the manifest file, the build fails and gives information on how to fix the error. Correct the errors and rebuild the app package.

The [ALTER APPLICATION PACKAGE … BUILD](command-reference.md) command builds a testable version of the app package and verifies that the manifest file is valid and that all links work.

```sqlexample
ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> BUILD;
```

If there are any errors in the manifest file, the build fails and gives information on how to fix the error. Correct those errors and rebuild the app package.

The built app remains in the live state, and you can continue to make changes to the application package.

#### Skip ahead!

For updates that don’t require further testing, you can skip ahead by building, committing, and releasing an app package all at once using the [ALTER APPLICATION PACKAGE … RELEASE LIVE VERSION](command-reference.md) command.

```sqlexample
ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> RELEASE LIVE VERSION;
```

### Test the application

After building the app package, you can perform basic tests on it from the live environment.

Install the app from an app package using the command: [CREATE APPLICATION … FROM APPLICATION PACKAGE](../../sql-reference/sql/create-application.md), replacing `<DECL_SHARE_APP>` with the name of the app. For example:

```sqlexample
CREATE APPLICATION <DECL_SHARE_APP> FROM APPLICATION PACKAGE <DECL_SHARE_APP_PKG>
```

Update the files in the app package as needed, and then see if it worked by using the command: [ALTER APPLICATION PACKAGE … UPGRADE USING VERSION LIVE](../../sql-reference/sql/alter-application.md).

```sqlexample
ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> UPGRADE USING VERSION LIVE;
```

To test some features, such as app roles, you must first release a new version of the app package, and then test using a separate consumer account. For more information, see [Install a Declarative Native App](consumer/install.md) and [Access content in a Declarative Native App](consumer/access-app-content.md).

#### Optional: Reset edits on a live version

If the edits made to the live version of the app package are no longer needed, you can reset the app package to the state before the edits were made with the [ALTER APPLICATION PACKAGE … ABORT LIVE VERSION](command-reference.md) command.

```sqlexample
ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> ABORT LIVE VERSION;
```

When you use the preceding command to remove the current live version, a new live version is created with the same contents as the last committed version of the app package. The live version is reset to the last committed version, and all changes made to the live version are discarded.

### Commit and release the application package

Committing the app package builds a new immutable version of the app that can’t be edited and is ready to be published. Releasing the app package does the following:

* Makes a committed app ready to be shared with consumers.
* If the provider has already shared the app with consumers, the new version is automatically available to those consumers.
* If there’s already a live version of the app on the Snowflake Marketplace, the new version is automatically available to consumers who have installed the app.

SnowsightSQL

To use Snowsight to commit and release the app package:

1. If you’re not already viewing the app package’s listing, from the navigation menu, select Projects » App Packages, and then select the app package you want to release.
2. Select Commit & release.
3. In the confirmation dialog, select Acknowledge & continue.

The committed app package is released, and the live version of the app package is removed. A new live version of the app package is created from the new committed version for further development.

Once you’ve committed and released the app package, the Latest release tab shows the contents of the release, which is the same as the contents of the last build.

1. In SQL, use the [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application.md) commands as shown:

   ```sqlexample
   ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> COMMIT;
   ```

   The live version of the app package is removed. A new live version of the app package is created from the new committed version for further development.
2. Next, release the committed app package:

   ```sqlexample
   ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> RELEASE;
   ```

   You can also build, commit, and release a live version of the app package, all at once:

   ```sqlexample
   ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> RELEASE LIVE VERSION;
   ```

After you release the app package, you can create a new listing for the app, either privately or publicly on the Snowflake Marketplace. For more information, see [Creating a Listing using Declarative Sharing](listing.md).

---
title: Application roles: Allow consumers to share different views of the same data
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/app-roles.md
section: Developer Guide
---

# Application roles: Allow consumers to share different views of the same data

As a provider, you can enhance the consumer experience by including application roles in your Declarative Native App. An application role is a credential created within the context of a Declarative Native App. For information about application roles, see [About application roles](../native-apps/creating-setup-script.md).

Application roles isolate security for the application, so that the application’s specific security credentials don’t need to be managed within the consumer’s broader organizational security model. Using application roles, providers can control access to application resources simply. Consumer accounts can then grant access to application logic and data using simple SQL `GRANT` statements.

For example, if an application uses an **Operations** application role to access a log table, the consumer doesn’t need to maintain that application role outside the context of the application; they only need to know that they can share the application with their support team using the **Operations** application role.

Using the manifest file, you define application roles, and assign them to content in the app. When the consumer installs the app, they can share the content with their organization members by assigning the application roles to their account roles and users. Consumers can also create hierarchies of application roles by assigning application roles to other application roles.

Application roles allow consumers to share data in different ways with members of their organization. For example, an app can include two notebooks to present the data, with one that includes a full view of the data, and the other a filtered view.

The consumer app owner can then choose to share a filtered view with a team, while still having access to the full view for themselves.

When you use application roles to permit access to database resources, the child resources inherit the roles of their parent resources. For example, if you assign an application role to a schema, all tables and views in that schema inherit the role. If you assign an application role to a database, all schemas, tables, and views in that database inherit the role.

## Assign application roles to content in the manifest file

1. In the [manifest file](manifest-reference.md), in the [top-level roles field](manifest-reference.md), define the available application roles, for example, `sales`, `marketing`, and `operations`.

   ```yaml
   roles:
     - sales:
         comment: "The sales role provides access to the filtered view of the sales data."
     - marketing:
         comment: "The marketing role provides access to the filtered view of the marketing data."
     - operations:
         comment: "The operations role provides access to the full view of the data, including logs."
   ```
2. Assign app roles to the contents of the manifest file, using a list. for example, `roles: [sales, support]`:

   ```yaml
   - customer_table:
     roles: [sales,marketing] # Accessible to sales and marketing, app owners
   ```
3. To add a table, add the role to both `<named table>.roles` and to `<named schema>.roles` where the table resides.

   ```yaml
   schemas:
     - sales_table:
       roles: [sales]
   tables:
     - sales_table:
       roles: [sales]
   ```
4. To add a view, add the role to both `<named view>.roles` and to `<named schema>.roles` where the view resides.

   ```yaml
   schemas:
      - sales_view:
        roles: [sales]
   views:
      - sales_view:
        roles: [sales]
   ```
5. When adding a filtered view of a table, don’t add the underlying table; this prevents users from accessing the unfiltered data.
6. To include a notebook, add the role to `<named notebook>.roles`, and add the tables and views (and their underlying schemas) referenced in the notebook.

   ```yaml
   notebooks:
     - SALES_NB:
       main_file: ALL-DATA.ipynb
        roles: [sales]
        comment: Accessible to sales and app owners, references full view of the sales data
   ```
7. When adding a notebook that references a filtered view of a table, don’t add the underlying table; this prevents users accessing the unfiltered data.
8. To give an object no app roles, either leave the field empty (`[]`) or omit it. These objects are only accessible by the app owner and roles with [granted IMPORTED PRIVILEGES](consumer/install.md).

> ```yaml
> - my_schema:
>   roles: [] # Accessible to app owners only
> ```

Example manifest file:

```yaml
roles:
  - sales:
      comment: "The sales role provides access to the filtered view of the sales data."
  - marketing:
      comment: "The marketing role provides access to the filtered view of the marketing data."
  - operations:
      comment: "The operations role provides access to the full view of the log data."

application_content:
  notebooks:
    - SALES_NB:
        main_file: ALL-DATA.ipynb
        roles: [sales]
        comment: Accessible to sales and app owners, references full view of the sales data

    - MARKETING_NB:
        main_file: FILTERED.ipynb
        roles: [marketing] #
        comment: Accessible to marketing and app owners, references filtered view of the marketing data

shared_content:
  databases:
    - my_database:
        schemas:
          - my_schema:
              roles: [] # Accessible to app owners
              tables:
                - sales_table:
                    roles: [sales] # Accessible to sales, app owners
                - marketing_table:
                    roles: [marketing] # Accessible to marketing, app owners
                - customer_table:
                    roles: [sales,marketing] # Accessible to sales and marketing, app owners
                - logs_table:
                    roles: [operations] # Accessible to operations and app owners
              views:
                - sales_view:
                    roles: [sales]   # Accessible to sales and app owners
                - marketing_view:
                    roles: [marketing] # Accessible to marketing and app owners
                - customer_view:
                    roles: [sales,marketing] # Accessible to sales, marketing, and app owners
                - operations_view:
                    roles: [operations] # Accessible to operations and app owners
```

Later, when the consumer installs the app, they’ll have access to both notebooks, the tables, and the views.

To share the operations view with their support team, they grant the **operations** application role to their support team organization role.

```sqlexample
GRANT APPLICATION ROLE customer_app.operations TO ROLE support_team_west;
```

Consumer team members with the `support_team_west` role can see the **logs** table, but they can’t see the notebooks in the **Available Notebooks** tab in Snowsight, or access the **sales** and **customers** tables and views.

To share the sales view with their sales team, they grant the **sales** application role to their sales organization role.

```sqlexample
GRANT APPLICATION ROLE customer_app.sales TO ROLE sales_team_east;
```

Consumer team members with the `sales_team_east` role can see the notebook in the **Available Notebooks** tab in Snowsight. They can’t see the **logs** table, but can access the **sales** and **customers** tables and views.

For more information about how consumers share roles, see [Share access to the app](consumer/install.md).

---
title: Authenticating connections
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-authenticate.md
section: Developer Guide
---

# Authenticating connections

To authenticate to Snowflake, you can use one of the following options:

* Password-based authentication

  To use this method, set the `password` option when establishing the connection.
* Single sign-on (SSO) through a web browser
* Native SSO through Okta
* Key pair authentication
* OAuth
* MFA

Additionally, the Snowflake Node.js driver supports the ability to cache SSO and MFA tokens. For more information, see Authentication token caching.

## Use single sign-on (SSO) through a web browser

If you have [configured Snowflake to use single sign-on (SSO)](../../user-guide/admin-security-fed-auth-overview.md), you can configure
your client application to use browser-based SSO for authentication.

In your application code:

1. Set the `authenticator` option to `EXTERNALBROWSER`.
2. To establish a connection, call the `connect` or `connectAsync` method.

For example:

```javascript
// Use a browser to authenticate via SSO.
const connection = snowflake.createConnection({
  ...,
  authenticator: 'EXTERNALBROWSER'
});

// Establish a connection.
connection.connect((err, conn) => {
  if (err) {
    ... // Handle any errors.
  } else {
    // Execute SQL statements.
    const statement = connection.execute({...});
  }
});
```

For more information about using browser-based SSO for authentication, see [Browser-based SSO](../../user-guide/admin-security-fed-auth-use.md).

## Use native SSO through Okta

If you have [configured Snowflake to use single sign-on (SSO)](../../user-guide/admin-security-fed-auth-overview.md) through Okta, you can
configure your client application to use native SSO authentication through Okta.

In your application code:

1. Set the following options:

   * Set the `authenticator` option to the Okta URL endpoint for your Okta account (e.g.
     `https://<okta_account_name>.okta.com`).
   * Set the `username` and `password` options to the user name and password for your Identity Provider (IdP).
2. To establish a connection, call the `connect` or `connectAsync` method.

For example:

```javascript
// Use native SSO authentication through Okta.
const connection = snowflake.createConnection({
  ...,
  username: '<user_name_for_okta>',
  password: '<password_for_okta>',
  authenticator: 'https://myaccount.okta.com'
});

// Establish a connection.
connection.connect((err, conn) => {
  if (err) {
    ... // Handle any errors.
  } else {
    // Execute SQL statements.
    const statement = connection.execute({...});
  }
});
```

For more information about using native SSO authentication through Okta, see [Native SSO — Okta only](../../user-guide/admin-security-fed-auth-use.md).

## Use key-pair authentication and key-pair rotation

The driver supports key pair authentication and key rotation. To use key-pair authentication and key rotation, follow the steps below:

1. Configure key pair authentication, as explained in [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md).
2. In your application code:

   1. Set the `authenticator` option to `SNOWFLAKE_JWT`.
   2. Use the private key to authenticate in one of the following ways:

      * Set the `privateKey` option to the private key.
      * Set the `privateKeyPath` option to the path to the private key file.

        If the file is encrypted, you must also set the `privateKeyPass` option to the passphrase to decrypt the private
        key.

> The following example loads the private key from a file and sets the `privateKey` option to the private key:
>
> ```javascript
> import crypto from 'crypto';
> import fs from 'fs';
>
> // Read the private key file from the filesystem.
> const privateKeyFile = fs.readFileSync('<path_to_private_key_file>/rsa_key.p8');
>
> // Get the private key from the file as an object.
> const privateKeyObject = crypto.createPrivateKey({
>   key: privateKeyFile,
>   format: 'pem',
>   passphrase: 'passphrase'
> });
>
> // Extract the private key from the object as a PEM-encoded string.
> const privateKey = privateKeyObject.export({
>   format: 'pem',
>   type: 'pkcs8'
> });
>
> // Use the private key for authentication.
> const connection = snowflake.createConnection({
>   ...
>   authenticator: 'SNOWFLAKE_JWT',
>   privateKey: privateKey
> });
>
> // Establish a connection.
> connection.connect((err, conn) => {
>   ... // Handle any errors.
> });
>
> // Execute SQL statements.
> const statement = connection.execute({...});
> ```
>
> The following example sets the `privateKeyPath` option to an encrypted private key file and sets the
> `privateKeyPass` option to the passphrase used to decrypt the private key:
>
> ```javascript
> // Use an encrypted private key file for authentication.
> // Specify the passphrase for decrypting the key.
> const connection = snowflake.createConnection({
>   ...
>   authenticator: 'SNOWFLAKE_JWT',
>   privateKeyPath: '<path-to-privatekey>/privatekey.p8',
>   privateKeyPass: '<passphrase_to_decrypt_the_private_key>'
> });
>
> // Establish a connection.
> connection.connect((err, conn) => {
>   ... // Handle any errors.
> });
>
> // Execute SQL statements.
> const statement = connection.execute({...});
> ```

## Use OAuth

To connect using OAuth, set the `authenticator` option to `OAUTH` and the `token` option to the OAuth access
token. For example:

```javascript
// Use OAuth for authentication.
const connection = snowflake.createConnection({
  ...
  authenticator: 'OAUTH',
  token: '<your_oauth_token>'
});

// Establish a connection.
connection.connect((err, conn) => {
  ... // Handle any errors.
});

// Execute SQL statements.
const statement = connection.execute({...});
```

For more information, see [Clients, drivers, and connectors](../../user-guide/oauth-intro.md).

## Use the OAuth 2.0 Authorization Code flow

The OAuth 2.0 Authorization Code flow is a secure method for a client application to obtain an access token from an authorization server on behalf of a user, without revealing the user’s credentials.

To enable the OAuth 2.0 Authorization Code flow:

1. Set the `authenticator` connection parameter to `oauth_authorization_code`.
2. Set the following OAuth connection parameters:

   > * `oauthClientId`: Value of `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `oauthClientSecret`: Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `oauthAuthorizationUrl`: Identity provider endpoint supplying the authorization code to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `oauthTokenRequestUrl`: Identity provider endpoint supplying the access tokens to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `oauthScope`: Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.
   > * `oauthRedirectUri`: URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.

## Use the OAuth 2.0 Client Credentials flow

The OAuth 2.0 Client Credentials flow provides a secure way for machine-to-machine (M2M) authentication, such as the Snowflake Connector for Python connecting to a backend service. Unlike the OAuth 2.0 Authorization Code flow, this method does not rely on any user-specific data.

To enable the OAuth 2.0 Client Credentials flow:

1. Set the `authenticator` connection parameter to `oauth_client_credentials`.
2. Set the following OAuth connection parameters:

   > * `oauthClientId`: Value of `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `oauthClientSecret`: Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata)
   > * `oauthTokenRequestUrl`: Identity provider endpoint supplying the access tokens to the driver.
   > * `oauthScope`: Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

## Authenticate with workload identity federation (WIF)

[Workload identity federation](../../user-guide/workload-identity-federation.md) provides a service-to-service authentication method for Snowflake. This method enables applications, services, or containers to authenticate with Snowflake by leveraging their cloud provider’s native identity system, such as AWS IAM, Microsoft Entra ID, or Google Cloud service accounts. This approach eliminates the need for managing long-lived credentials and simplifies credential acquisition compared to other methods like External OAuth. Snowflake connectors are designed to automatically obtain short-lived credentials from the platform’s identity provider.

To enable the Workload Identity Federation authenticator, do the following:

1. Set the `authenticator` connection parameter to `WORKLOAD_IDENTITY`.
2. Set the `workloadIdentityProvider` connection parameter to `AWS`, `AZURE`, `GCP`, or `OIDC`, based on your platform.
3. For OpenID Connect (OIDC), specify the `token` connection parameter.

## Use an MFA passcode

> **Note:**
>
> This feature requires Snowflake Node.js driver version 1.13.1 or higher.

You can connect to Snowflake by passing a multi-factor authentication (MFA) passcode instead of waiting for an external confirmation, such as a push notification from Duo. The driver provides the following ways to specify an MFA passcode:

* Set the `passcodeInPassword` option to `true` and include the passcode as part of the password string, similar to the following:

  ```javascript
  const connection = snowflake.createConnection({
    account: '<account_identifier>',
    username: '<username>',
    ...
    authenticator: 'USERNAME_PASSWORD_MFA',
    password: 'abc123987654', // passcode 987654 is part of the password
    passcodeInPassword: true // because passcodeInPassword is true
  });
  ```
* Set the `passcode` option to the value of the passcode to specify the password and the passcode separately, similar to the following:

  ```javascript
  const connection = snowflake.createConnection({
    account: '<account_identifier>',
    username: '<username>',
    ...
    authenticator: 'USERNAME_PASSWORD_MFA',
    password: 'abc123', // password and MFA passcode are input separately
    passcode: '987654'
  });
  ```

  To use this approach, ensure that the `passcodeInPassword` option is `false` (the default value).

> **Note:**
>
> If you enable the `passcodeInPassword` option and set the `passcode` option, the `passcodeInPassword` option takes precedence.

For more information about these options, see [passcode](nodejs-driver-options.md).

## Authentication token caching

The Snowflake Node.js driver provides the ability to cache SSO and MFA tokens.

> **Important:**
>
> Token caching is disabled by default. Caching tokens locally increases security risks. Because tokens do not expire for four hours, someone who accesses a token on a local system can impersonate the token owner until the token naturally expires. Consequently, before choosing to cache tokens, consider the following:
>
> * Be aware and mindful of the potential risks.
> * Consult with your internal security and compliance personnel to check whether your organization’s policies permit token caching.
> * With the default settings, the file that stores the cached tokens is written in your `$HOME` directory, or in a path you configure. You are responsible for the security of the data in the designated directory.
> * You are responsible to ensure that the file has proper permissions to be accessed only by the file owner.

### Cache SSO (ID) tokens

An SSO (ID) token is generated from the request when you connect to Snowflake with external browser authentication. Caching SSO (ID) tokens on the client driver’s side only works if the server allows them to be cached. Caching SSO tokens can be enabled on the server-side with executing the following SQL statement, as described in [Using SSO with client applications that connect to Snowflake](../../user-guide/admin-security-fed-auth-use.md):

```sqlexample
ALTER ACCOUNT SET ALLOW_ID_TOKEN = TRUE;
```

To use an SSO token cache in the Node.js driver, set the following options in the `snowflake.createConnection()` call:

* Set `authenticator` to `EXTERNALBROWSER`. For details, see [Authentication options](nodejs-driver-options.md).
* Set `clientStoreTemporaryCredential` to `true`.

```javascript
const connection = snowflake.createConnection({
  account: '<account_identifier>',
  username: '<username>',
  authenticator: 'EXTERNALBROWSER',
  clientStoreTemporaryCredential: true
});
```

When enabled, driver uses the cached token for subsequent connections until the token expires. If the driver opens a browser to authenticate the connection again, either the driver cannot find the token information in the local credential storage or the token has expired.

### Cache MFA tokens

An MFA token is generated from the request when you connect to Snowflake with USERNAME_PASSWORD_MFA authentication. Caching MFA tokens on the client driver’s side only works if the server allows them to be cached. Caching MFA tokens can be enabled on the server-side with executing the following SQL statement, as described in [Using MFA token caching to minimize the number of prompts during authentication — optional](../../user-guide/security-mfa.md):

```sqlexample
ALTER ACCOUNT SET ALLOW_CLIENT_MFA_CACHING = TRUE;
```

To use an MFA token cache in the Node.js driver, set the following options in the `snowflake.createConnection()` call:

* Set `authenticator` to `USERNAME_PASSWORD_MFA`. For details, see [Authentication options](nodejs-driver-options.md).
* Set `clientRequestMFAToken` to `true`.

```javascript
const connection = snowflake.createConnection({
  account: '<account_identifier>',
  username: '<username>',
  password: '<password>',
  authenticator: 'USERNAME_PASSWORD_MFA',
  clientRequestMFAToken: true
});
```

When enabled, driver uses the cached token for subsequent connections until the token expires. If the driver reaches out to the MFA provider again, either the driver cannot find the token information in the local credential storage or the token has expired.

### Use the default credential manager

The Snowflake Node.js driver provides a credential manager and credential storage. By default, the driver stores cached tokens in your `$HOME` directory.

If you want to store the cached tokens in an alternate location, you can specify the desired location in the `credentialCacheDir` parameter of the `snowflake.createConnection()` function. You can specify either a relative or absolute path, as shown below:

* Relative path

  ```javascript
  const connection = snowflake.createConnection({
    credentialCacheDir: '../../<folder name>'
  });
  ```
* Absolute path

  ```javascript
  const connection = snowflake.createConnection({
    credentialCacheDir: 'C:\\<folder name>\\<subfolder name>'
  });
  ```

If you do not configure `credentialCacheDir`, the Snowflake Node.js driver uses `$HOME/temporary_credential.json` to store the credentials.

### Use a custom credential manager

The Snowflake node.js driver provides a default credential manager, which uses a local JSON file to store the credentials. When no credential manager is explicitly configured, the driver will use this default credential manager.

If you prefer not to use the default credential manager, you can create a custom credential manager. A custom credential manager must meet the following requirements:

* It must minimally contain `read`, `write`, and `remove` functions. You can include other functions as well.
* It must be an `object` data type.

The following example shows a template for minimal custom credential manager.

```javascript
const sampleCustomManager = {
  read: function (key) {
    // (do something with the key)
    return token;
  },
  write: function (key, token) {
    // (do something with the key and token)
  },
  remove: function (key) {
    // (do something with the key)
  }
};
```

After completing your custom credential manager, you can configure it for the driver in the `snowflake.configure()` method, as shown. This example reflects MFA tokens, though you can also create custom credential managers for SSO tokens.

```javascript
import snowflake from 'snowflake-sdk';
import myCredentialManager from '<your custom credential manager module>';

snowflake.configure({
  customCredentialManager: myCredentialManager
});

const connection = snowflake.createConnection({
  account: '<account_identifier>',
  username: '<username>',
  password: '<password>',
  authenticator: 'USERNAME_PASSWORD_MFA',
  clientRequestMFAToken: true
});
```

Although the Snowflake Node.js driver provides a plugin-like interface to implement and use custom credential managers, Snowflake is not responsible for creating, implementing, or supporting custom credential managers for the customers.

---
title: Authenticating to the server
source: https://docs.snowflake.com/en/developer-guide/sql-api/authenticating.md
section: Developer Guide
---

# Authenticating to the server

This topic describes how to authenticate to the server when using the Snowflake SQL API.

When you send a request, the request must include authentication information. The next sections explain how to add
this information to the request:

* Using OAuth
* Using key-pair authentication

## Using OAuth

To use OAuth, follow these steps:

1. Set up OAuth for authentication.

   See [Introduction to OAuth](../../user-guide/oauth-intro.md) for details on how to set up OAuth and get an OAuth token.
2. Use Snowflake CLI to verify that you can use a generated OAuth token to connect to Snowflake:

   * For Linux and MacOS systems
   > ```snowcli
   > $ snow connection test --account <account_identifier> --user <user> --authenticator=oauth --token=<oauth_token>
   > ```

   * For Windows systems
   > ```snowcli
   > $ snow connection test --account <account_identifier> --user <user> --authenticator=oauth --token=<oauth_token>
   > ```
3. In each API request you send, set the following headers:

   * `Authorization: Bearer oauth_token`

     where `oauth_token` is the generated OAuth token.
   * (Optional) `X-Snowflake-Authorization-Token-Type: OAUTH`

     If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.

     Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:

     + `KEYPAIR_JWT` (for key-pair authentication)
     + `OAUTH` (for OAuth)
     + `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md))

## Using key-pair authentication

To use key pair authentication, follow these steps:

1. Set up key-pair authentication.

   As part of this process, you must:

   1. Generate a public-private key pair. The generated private key should be in a file (e.g. named `rsa_key.p8`).
   2. Assign the public key to your Snowflake user. After you assign the key to the user, run the
      [DESCRIBE USER](../../sql-reference/sql/desc-user.md) command. In the output, the `RSA_PUBLIC_KEY_FP` property should be set to the fingerprint of the public key assigned to the user.

   For instructions on how to generate the key pair and assign a key to a user,
   see [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md). For language-specific examples of creating a fingerprint and generating a
   JWT token, see the following:

   > * Python
   > * Java
   > * Node.js
2. Use Snowflake CLI to verify that you can use the generated private key to
   [connect to Snowflake](../snowflake-cli/connecting/configure-connections.md):

   ```snowcli
   $ snow connection generate-jwt --account <account_identifier> --user <user> --private-key-path <path>/rsa_key.p8
   ```

   The command prompts you for a private key passphrase to complete the connection. You can avoid the prompt by providing the passphrase in the `PRIVATE_KEY_PASSPHRASE` environment variable.
3. In your application code:

   1. Generate the fingerprint (a SHA-256 hash) of the public key for the user. Prefix the fingerprint with `SHA256:`.
      For example:

      `SHA256:hash`

      You can also execute the SQL [DESCRIBE USER](../../sql-reference/sql/desc-user.md) command to get the value from
      the RSA_PUBLIC_KEY_FP property.
   2. Generate [a JSON Web Token (JWT)](https://en.wikipedia.org/wiki/JSON_Web_Token) with the following fields in the payload:

      | Field | Description | Example |
      | --- | --- | --- |
      | `iss` | Issuer of the JWT. Set it to the following value:  `account_identifier.user.SHA256:public_key_fingerprint`  where:  * `account_identifier` is your Snowflake [account identifier](../../user-guide/admin-account-identifier.md).  If you are using the [account locator](../../user-guide/admin-account-identifier.md), exclude any region information from   the account locator. * `user` is your Snowflake user name. * `SHA256:public_key_fingerprint` is the fingerprint that you generated in the previous step. **Note:** The `account_identifier` and `user` values must use all uppercase characters. If your account ID contains periods (`.`), you must replace them with hyphens (`-`), as periods in an account identifier cause the JWT to be invalid. | `MYORGANIZATION-MYACCOUNT.MYUSER.SHA256:public_key_fingerprint` |
      | `sub` | Subject for the JWT. Set it to the following value:  `account_identifier.user` | `MYORGANIZATION-MYACCOUNT.MYUSER` |
      | `iat` | Issue time for the JWT in UTC. Set the value to the current time value as either seconds or milliseconds. | `1615370644` (seconds) . `1615370644000` (milliseconds) |
      | `exp` | Expiration time for the JWT in UTC. You can specify the value as either seconds or milliseconds.    **Note:** The JWT is valid for at most one hour after the token is issued, even if you specify a longer expiration time. | `1615374184` (seconds) . `1615374184000` (milliseconds) |
   3. In each API request that you send, set the following headers:

      * `Authorization: Bearer JWT`

        where `JWT` is the token that you generated.
      * (Optional) `X-Snowflake-Authorization-Token-Type: KEYPAIR_JWT`

        If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.

        Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:

        + `KEYPAIR_JWT` (for key-pair authentication)
        + `OAUTH` (for OAuth)
        + `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md))

### Python example

The following sections describe how to generate a JWT and fingerprint using Python.

For an example of generating a JWT in Python, see [`sql-api-generate-jwt.py`](../../_downloads/aeb84cdfe91dcfbd889465403b875515/sql-api-generate-jwt.py). The
`sql-api-generate-jwt.py` example uses the [PyJWT module](https://pypi.org/project/PyJWT/), which you can install by running:

> ```bash
> pip install pyjwt
> ```

#### Generating a JWT in Python

The following sections of code demonstrate how to generate a JWT. For a full example,
see [`sql-api-generate-jwt.py`](../../_downloads/aeb84cdfe91dcfbd889465403b875515/sql-api-generate-jwt.py).

> **Note:**
>
> This example is intended for use as a reference only. Do not use this code in production applications or environments.
>
> > ```python
> > from datetime import timedelta, timezone, datetime
> >
> > # This example relies on the PyJWT module (https://pypi.org/project/PyJWT/).
> > import jwt
> >
> > # Construct the fully qualified name of the user in uppercase.
> > # - Replace <account_identifier> with your account identifier.
> > #   (See https://docs.snowflake.com/en/user-guide/admin-account-identifier.html .)
> > # - Replace <user_name> with your Snowflake user name.
> > account = "<account_identifier>"
> >
> > # Get the account identifier without the region, cloud provider, or subdomain.
> > if not '.global' in account:
> >     idx = account.find('.')
> >     if idx > 0:
> >         account = account[0:idx]
> >     else:
> >         # Handle the replication case.
> >         idx = account.find('-')
> >         if idx > 0:
> >             account = account[0:idx]
> >
> > # Use uppercase for the account identifier and user name.
> > account = account.upper()
> > user = "<user_name>".upper()
> > qualified_username = account + "." + user
> >
> > # Get the current time in order to specify the time when the JWT was issued and the expiration time of the JWT.
> > now = datetime.now(timezone.utc)
> >
> > # Specify the length of time during which the JWT will be valid. You can specify at most 1 hour.
> > lifetime = timedelta(minutes=59)
> >
> > # Create the payload for the token.
> > payload = {
> >
> >     # Set the issuer to the fully qualified username concatenated with the public key fingerprint (calculated in the  previous step).
> >     "iss": qualified_username + '.' + public_key_fp,
> >
> >     # Set the subject to the fully qualified username.
> >     "sub": qualified_username,
> >
> >     # Set the issue time to now.
> >     "iat": now,
> >
> >     # Set the expiration time, based on the lifetime specified for this object.
> >     "exp": now + lifetime
> > }
> >
> > # Generate the JWT. private_key is the private key that you read from the private key file in the previous step when you generated the public key fingerprint.
> > encoding_algorithm="RS256"
> > token = jwt.encode(payload, key=private_key, algorithms=encoding_algorithm)
> >
> > # If you are using a version of PyJWT prior to 2.0, jwt.encode returns a byte string, rather than a string.
> > # If the token is a byte string, convert it to a string.
> > if isinstance(token, bytes):
> >   token = token.decode('utf-8')
> > print(token)
> > decoded_token = jwt.decode(token, key=private_key.public_key(), algorithms=[encoding_algorithm])
> > print("Generated a JWT with the following payload:\n{}".format(decoded_token))
> > ```

#### Generating a fingerprint in Python

The following sections of code demonstrate how to generate the fingerprint. For a full example, see
[`sql-api-generate-jwt.py`](../../_downloads/aeb84cdfe91dcfbd889465403b875515/sql-api-generate-jwt.py).

> ```python
> from cryptography.hazmat.primitives.serialization import load_pem_private_key
> from cryptography.hazmat.primitives.serialization import Encoding
> from cryptography.hazmat.primitives.serialization import PublicFormat
> from cryptography.hazmat.backends import default_backend
> ..
> import base64
> from getpass import getpass
> import hashlib
> ..
> # If you generated an encrypted private key, implement this method to return
> # the passphrase for decrypting your private key. As an example, this function
> # prompts the user for the passphrase.
> def get_private_key_passphrase():
>     return getpass('Passphrase for private key: ')
>
> # Private key that you will load from the private key file.
> private_key = None
>
> # Open the private key file.
> # Replace <private_key_file_path> with the path to your private key file (e.g. /x/y/z/rsa_key.p8).
> with open('<private_key_file_path>', 'rb') as pem_in:
>     pemlines = pem_in.read()
>     try:
>         # Try to access the private key without a passphrase.
>         private_key = load_pem_private_key(pemlines, None, default_backend())
>     except TypeError:
>         # If that fails, provide the passphrase returned from get_private_key_passphrase().
>         private_key = load_pem_private_key(pemlines, get_private_key_passphrase().encode(), default_backend())
>
> # Get the raw bytes of the public key.
> public_key_raw = private_key.public_key().public_bytes(Encoding.DER, PublicFormat.SubjectPublicKeyInfo)
>
> # Get the sha256 hash of the raw bytes.
> sha256hash = hashlib.sha256()
> sha256hash.update(public_key_raw)
>
> # Base64-encode the value and prepend the prefix 'SHA256:'.
> public_key_fp = 'SHA256:' + base64.b64encode(sha256hash.digest()).decode('utf-8')
> ```

### Snowflake CLI example

You can use the Snowflake CLI `snow connection generate-jwt` command to generate a JWT for key-pair authentication. For more information, see [snow connection generate-jwt](../snowflake-cli/command-reference/connection-commands/generate-jwt.md).

This example generates a token for account `TEST` and user `JDOE`, using the private key from `rsa_key.p8`:

```snowcli
snow connection generate-jwt --user JDOE --account TEST --private-key-file=rsa_key.p8
```

The command prompts you for a private key passphrase to complete the connection. You can avoid the prompt by providing the passphrase in the `PRIVATE_KEY_PASSPHRASE` environment variable.

### Java example

For an example of generating a JWT in Java, see
[`SimpleStatementsApi.java`](../../_downloads/7e213524766700040e775708363bd176/SimpleStatementsApi.java).

> **Note:**
>
> This example is intended for use as a reference only. Do not use this code in production applications or environments.

This example uses the following third-party libraries:

* [Swagger Codegen](https://swagger.io/tools/swagger-codegen/download/): an open source library useful in developing REST
  APIs and applications.
* [Auth0](https://auth0.com/docs/libraries): provides Java APIs for authentication and generating JWT tokens.

### Node.js example

For an example of generating a JWT in Node.js, see
[`sql-api-generate-jwt.js`](../../_downloads/f9ab0412f4093929578d63b5096a83c3/sql-api-generate-jwt.js).

> **Note:**
>
> This example is intended for use as a reference only. Do not use this code in production applications or environments.

---
title: Calling a stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-calling.md
section: Developer Guide
---

# Calling a stored procedure

You can call a stored procedure in one of several ways.

## Tools for calling procedures

Choose the tool for calling the procedure.

| Language | Approach |
| --- | --- |
| **SQL**  Execute a SQL command, such as by using Snowsight. | Execute the SQL CALL command to call a procedure. |
| **Java, Python, or Scala with Snowpark**  Write code locally in one of the supported languages, having the call execute in Snowflake. | Execute client code that uses Snowpark APIs in one of the following languages.   * [Java](../snowpark/java/creating-sprocs.md) * [Python](../snowpark/python/creating-sprocs.md) * [Scala](../snowpark/scala/creating-sprocs.md) |
| **Command line**  Create and manage Snowflake entities by executing commands from the command line. | Execute commands of the [Snowflake CLI](../snowflake-cli/index.md):   * [To execute SQL commands](../snowflake-cli/command-reference/sql-commands/sql.md). * [To execute Snowpark commands](../snowflake-cli/command-reference/snowpark-commands/execute.md). |
| **Python**  On the client, write code that executes management operations on Snowflake. | Execute code that uses the [Snowflake Python API](../snowflake-python-api/snowflake-python-managing-functions-procedures.md). |
| **RESTful APIs** (language-agnostic)  Make requests of RESTful endpoints to create and manage Snowflake entities. | Make a request to create a procedure using the [Snowflake REST API](../snowflake-rest-api/procedure/procedure-introduction.md) |

Once you have the privileges to call the stored procedure, you can use a CALL statement to call the stored procedure.

> **Note:**
>
> To both create and call an anonymous procedure, use [CALL (with anonymous procedure)](../../sql-reference/sql/call-with.md). Creating and calling an anonymous procedure does
> not require a role with CREATE PROCEDURE schema privileges.

## Usage notes

* Procedure names are not necessarily unique within the schema; stored procedures are identified and resolved by their arguments types as well
  as their names (that is, stored procedures can be overloaded).
* Outside of a [Snowflake Scripting block](../snowflake-scripting/blocks.md), the value returned by the stored
  procedure cannot be used, because the call cannot be part of an expression.

  In a Snowflake Scripting block, you can specify `INTO :snowflake_scripting_variable` to capture the return value from
  the stored procedure in a Snowflake Scripting variable.
* Stored procedures are not atomic; if one statement in a stored procedure fails, the other statements in the stored
  procedure are not necessarily rolled back. For information about stored procedures and transactions, see
  [Transaction management](stored-procedures-usage.md).
* You can also create and call an anonymous procedure using [CALL (with anonymous procedure)](../../sql-reference/sql/call-with.md).

## Calling a stored procedure with SQL

If the stored procedure has arguments, you can specify those arguments by name or by position.

For example, the following stored procedure accepts three arguments:

```sqlexample
CREATE OR REPLACE PROCEDURE sp_concatenate_strings(
    first_arg VARCHAR,
    second_arg VARCHAR,
    third_arg VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SQL
  AS
  $$
  BEGIN
    RETURN first_arg || second_arg || third_arg;
  END;
  $$;
```

When calling the procedure, you can specify the arguments by name:

```sqlexample
CALL sp_concatenate_strings(
  first_arg => 'one',
  second_arg => 'two',
  third_arg => 'three');
```

```output
+------------------------+
| SP_CONCATENATE_STRINGS |
|------------------------|
| onetwothree            |
+------------------------+
```

If you specify the arguments by name, you do not need to specify the arguments in any particular order:

```sqlexample
CALL sp_concatenate_strings(
  third_arg => 'three',
  first_arg => 'one',
  second_arg => 'two');
```

```output
+------------------------+
| SP_CONCATENATE_STRINGS |
|------------------------|
| onetwothree            |
+------------------------+
```

You can also specify the arguments by position:

```sqlexample
CALL sp_concatenate_strings(
  'one',
  'two',
  'three');
```

```output
+------------------------+
| SP_CONCATENATE_STRINGS |
|------------------------|
| onetwothree            |
+------------------------+
```

> **Note:**
>
> * You must either specify all arguments by name or by position. You can’t specify some of the arguments by name and other
>   arguments by position.
> * When you specify an argument by name, you can’t use double quotes around the argument name.
> * If two procedures have the same name but different argument types, you can use the argument names to specify
>   which procedure to execute, if the argument names are different. For more information, see
>   [Overloading procedures and functions](../udf-stored-procedure-naming-conventions.md).

### Specifying optional arguments

If the stored procedure has [optional arguments](../udf-stored-procedure-arguments.md), you can omit the optional
arguments in the call. Each optional argument has a default value that is used when the argument is omitted.

For example, the following stored procedure has one required argument and two optional arguments. Each optional argument has a
default value.

```sqlexample
CREATE OR REPLACE PROCEDURE build_string_proc(
    word VARCHAR,
    prefix VARCHAR DEFAULT 'pre-',
    suffix VARCHAR DEFAULT '-post'
  )
  RETURNS VARCHAR
  LANGUAGE SQL
  AS
  $$
    BEGIN
      RETURN prefix || word || suffix;
    END;
  $$
  ;
```

You can omit any of the optional arguments in the call. When you omit an argument, the default value of the argument is used.

```sqlexample
CALL build_string_proc('hello');
```

```output
+-------------------+
| BUILD_STRING_PROC |
|-------------------|
| pre-hello-post    |
+-------------------+
```

```sqlexample
CALL build_string_proc('hello', 'before-');
```

```output
+-------------------+
| BUILD_STRING_PROC |
|-------------------|
| before-hello-post |
+-------------------+
```

If you need to omit an optional argument and specify another optional argument that appears after the omitted argument in the
signature, use named arguments, rather than positional arguments.

For example, suppose that you want to omit the `prefix` argument and specify the `suffix` argument. The `suffix` argument
appears after the `prefix` in the signature, so you must specify the arguments by name:

```sqlexample
CALL build_string_proc(word => 'hello', suffix => '-after');
```

```output
+-------------------+
| BUILD_STRING_PROC |
|-------------------|
| pre-hello-after   |
+-------------------+
```

---
title: Canceling the execution of a SQL statement
source: https://docs.snowflake.com/en/developer-guide/sql-api/cancelling-requests.md
section: Developer Guide
---

# Canceling the execution of a SQL statement

To cancel the execution of a statement, send a `POST` request to the cancel endpoint. See
[POST /api/v2/statements/{statementHandle}/cancel](reference.md) for details.

```none
POST /api/v2/statements/{statementHandle}/cancel
```

The following flow chart illustrates the steps that you take to cancel a request.

---
title: Capturing messages from unhandled exceptions
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/unhandled-exception-messages.md
section: Developer Guide
---

# Capturing messages from unhandled exceptions

By default, when you’ve [set up an event table](event-table-setting-up.md), Snowflake automatically logs
unhandled exceptions in procedure and UDF handlers in the event table. Capturing these messages does not require that you add handler
code specific to logging or tracing. You can disable this feature so that unhandled exceptions aren’t automatically logged.

> **Important:**
>
> Error messages can contain sensitive information. Consider disabling this feature if you don’t want potentially sensitive
> information captured in an event table. To learn more, see Protecting sensitive data.

## Configuring logging and tracing to capture unhandled exceptions

Set log or trace level so that Snowflake captures entries for unhandled exceptions. You can have entries captured as log entries, trace
event entries, or both.

* To capture messages as log entries, [set the log level](telemetry-levels.md) to `ERROR` or
  more verbose.
* To capture messages as trace event entries, [set the trace level](telemetry-levels.md) to
  `ALWAYS` or `ON_EVENT`.

## Data captured for unhandled exceptions

You can capture message data as a log entry, a trace event, or both. The captured data will differ between log and trace event entries.

### Data captured in a log entry

By default, Snowflake records the following in the event table for unhandled exceptions in procedure and UDF handlers:

| Column | Data |
| --- | --- |
| [RECORD column](event-table-columns.md) | A `severity_text` attribute whose value is the highest-severity error level for the current language runtime. For example, for a handler written in Python, the value is `FATAL`. |
| [RECORD_ATTRIBUTES column](event-table-columns.md) | The following attributes are recorded for an unhandled exception.   * `exception.message` – The error message. * `exception.type` – The name of the exception’s class. * `exception.stacktrace` – The exception’s stack trace formatted by a language runtime. * `exception.escaped` – `true` if this entry is from an unhandled exception. |
| [VALUE column](event-table-columns.md) | The string `exception`. |

#### Example

Code in the following example queries an event table for log data recorded for an unhandled exception from a UDF handler.

For more about querying an event table for log data, see [Viewing log messages](logging-accessing-messages.md).

```sqlexample
SET event_table_name = 'my_db.public.my_event_table';

SELECT
  RECORD['severity_text'] AS severity,
  RECORD_ATTRIBUTES['exception.message'] AS error_message,
  RECORD_ATTRIBUTES['exception.type'] AS exception_type,
  RECORD_ATTRIBUTES['exception.stacktrace'] AS stacktrace
FROM
  my_event_table
WHERE
  RECORD_TYPE = 'LOG';
```

The following is possible output from the query.

```output
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| SEVERITY | ERROR_MESSAGE                                        | EXCEPTION_TYPE | STACKTRACE                                                                                                                                          |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| "FATAL"  | "could not convert string to float: '$1,000,000.00'" | "ValueError"   | "Traceback (most recent call last):\n  File \"_udf_code.py\", line 6, in compute\nValueError: could not convert string to float: '$1,000,000.00'\n" |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

### Data captured in a trace event entry

By default, Snowflake records the following in the event table for unhandled exceptions in procedure and UDF handlers:

| Column | Data |
| --- | --- |
| [RECORD column](event-table-columns.md) | A `name` attribute whose value is `exception` and a `status` attribute whose value is `STATUS_CODE_ERROR`. |
| [RECORD_ATTRIBUTES column](event-table-columns.md) | The following attributes are recorded for an unhandled exception.   * `exception.message` – The error message. * `exception.type` – The name of the exception’s class. * `exception.stacktrace` – The exception’s stack trace formatted by a language runtime. * `exception.escaped` – `true` if this entry is from an unhandled exception. |

#### Examples

Code in the following examples query an event table for trace event data recorded for an unhandled exception from a UDF handler.

For more about querying an event table for trace event data, see [Viewing trace data](tracing-accessing-events.md).

##### Span example

```sqlexample
SET event_table_name = 'my_db.public.my_event_table';

SELECT
  RECORD['status']['code'] AS span_status
FROM
  my_event_table
WHERE
  record_type = 'SPAN';
```

The following is possible output from the query.

```output
-----------------------
| SPAN_STATUS         |
-----------------------
| "STATUS_CODE_ERROR" |
-----------------------
```

##### Span event example

```sqlexample
SET event_table_name = 'my_db.public.my_event_table';

SELECT
  RECORD['name'] AS event_name,
  RECORD_ATTRIBUTES['exception.message'] AS error_message,
  RECORD_ATTRIBUTES['exception.type'] AS exception_type,
  RECORD_ATTRIBUTES['exception.stacktrace'] AS stacktrace
FROM
  my_event_table
WHERE
  RECORD_TYPE = 'SPAN_EVENT';
```

The following is possible output from the query.

```output
-----------------------------------------------------------------------------------------------------------------------------------------
| EVENT_NAME  | ERROR_MESSAGE                                        | EXCEPTION_TYPE | STACKTRACE                                      |
-----------------------------------------------------------------------------------------------------------------------------------------
| "exception" | "could not convert string to float: '$1,000,000.00'" | "ValueError"   | "  File \"_udf_code.py\", line 6, in compute\n" |
-----------------------------------------------------------------------------------------------------------------------------------------
```

## Protecting sensitive data

Given that log and trace messages from unhandled exceptions can include sensitive data, consider doing the following to protect that data:

* Take steps to protect sensitive data, such as by doing the following:

  + Improve your exception handling code to minimize the risk of unhandled exceptions.
  + Apply [row access policies](../../user-guide/security-row-intro.md) to your event table to restrict access to rows that contain
    personally identifiable information (PII).
  + [Create a view](../../sql-reference/sql/create-view.md) on top of the event table and
    [apply masking policies](../../sql-reference/sql/create-masking-policy.md) to it to mask or delete personally identifiable
    information (PII).
* Turn off unhandled exception logging by setting the [ENABLE_UNHANDLED_EXCEPTIONS_REPORTING](../../sql-reference/parameters.md) parameter to `false`.

---
title: Choosing whether to write a stored procedure or a user-defined function
source: https://docs.snowflake.com/en/developer-guide/stored-procedures-vs-udfs.md
section: Developer Guide
---

# Choosing whether to write a stored procedure or a user-defined function

This topic describes key differences between stored procedures and UDFs, including differences in how each may be invoked and in what
they may do.

At a high level, stored procedures and UDFs differ in how they are typically used, as described below.

| Stored Procedure Purpose | User-Defined Function Purpose |
| --- | --- |
| Generally to perform administrative operations by executing SQL statements. The body of a stored procedure is allowed, but not required, to explicitly return a value (such as an error indicator). | Calculate and return a value. A function always returns a value explicitly by specifying an expression. For example, the body of a JavaScript UDF must have a `return` statement that returns a value. |

## When to create a stored procedure or a UDF

In general, when deciding whether to create a stored procedure or UDF, consider the following recommendations:

| Create a Stored Procedure When… | Create a UDF When… |
| --- | --- |
| * You’re migrating an existing stored procedure from another application/system. * You need to perform DDL or DML database operations:    + Administrative tasks, including DDL such as deleting temporary tables, deleting data older than `N` days, or adding users.   + DML statements (UPDATE statements, for example) | * You’re migrating an existing UDF from another application/system. * You need a function that can be called as part of a SQL statement and that must return a value that will be used in the statement. * Your output needs to include a value for every input row or every group. For example:  ```sqlexample   SELECT MyFunction(col1) FROM table1;   ``` * You need to perform simple queries with SQL, such as SELECT statements. |

## Supported handler languages

When you write a procedure or UDF, you write its logic as a handler in one of the supported languages. The following table lists the
supported languages.

| Stored Procedures | User-Defined Functions |
| --- | --- |
| [Java](stored-procedure/java/procedure-java-overview.md) | [Java](udf/java/udf-java-introduction.md) |
| [JavaScript](stored-procedure/stored-procedures-javascript.md) | [JavaScript](udf/javascript/udf-javascript-introduction.md) |
| [Python](stored-procedure/python/procedure-python-overview.md) | [Python](udf/python/udf-python-introduction.md) |
| [Scala](stored-procedure/scala/procedure-scala-overview.md) | [Scala](udf/scala/udf-scala-introduction.md) |
| [Snowflake Scripting](snowflake-scripting/index.md) | [SQL](udf/sql/udf-sql-introduction.md) or [Snowflake Scripting](snowflake-scripting/index.md) |

## Usage and behavior differences

The following sections describe specific differences in the behaviors supported by procedures and UDFs.

### UDFs return a value; stored procedures need not

* A UDF always returns a value explicitly by specifying an expression. A UDF’s purpose is to calculate and return a value.
  For example, the body of a JavaScript UDF must have a `return` statement that returns a value.
* A stored procedure is allowed, but not required, to explicitly return a value (such as an error indicator). The purpose of a stored
  procedure generally is to perform administrative operations by executing SQL statements. If a procedure does not explicitly return a
  value, then it implicitly returns NULL.

  Note that every CREATE PROCEDURE statement must include a RETURNS clause that specifies a return type, even if the procedure
  does not explicitly return anything. If a procedure does not explicitly return a value, then it implicitly returns NULL.

  Code in the following example declares a return type for the procedure with a RETURNS clause, but a value is only returned in the
  case of an error. In other words, not every code path returns a value.

  ```sqlexample
  CREATE OR REPLACE PROCEDURE do_stuff(input NUMBER)
  RETURNS VARCHAR
  LANGUAGE SQL
  AS
  $$
  DECLARE
    ERROR VARCHAR DEFAULT 'Bad input. Number must be less than 10.';

  BEGIN
    IF (input > 10) THEN
      RETURN ERROR;
    END IF;

    -- Perform an operation that doesn't return a value.

  END;
  $$
  ;
  ```

### UDF return values are directly usable in SQL; stored procedure return values may not be

If you are not calling the stored procedure from a Snowflake Scripting block, you cannot use the value returned by a stored
procedure directly in SQL (unlike the value returned by a function). The syntax of the CALL command does not provide a place to
store the returned value or a way to operate on it or pass the value to another operation. In other words, the following statement
is not a valid SQL statement:

```sqlexample
y = stored_procedure1(x);                         -- Not allowed.
```

If you call a stored procedure within a [Snowflake Scripting block](snowflake-scripting/blocks.md),
you can [capture the value returned by the stored procedure](stored-procedure/stored-procedures-snowflake-scripting.md)
in a [Snowflake Scripting variable](snowflake-scripting/variables.md).

You can also indirectly use the return value of a stored procedure (outside of a Snowflake Scripting block), as described in the following
list:

* You can call the stored procedure inside another stored procedure. For example, when the stored procedure handler is written in JavaScript,
  the JavaScript in the outer stored procedure can retrieve and store the output of the inner stored procedure. Remember, however, that
  the outer stored procedure (and each inner stored procedure) is still unable to return more than one value to its caller.
* You can call a stored procedure that returns tabular data in the
  [FROM clause of a SELECT statement](stored-procedure/stored-procedures-selecting-from.md).
* You can call the stored procedure and then call the [RESULT_SCAN](../sql-reference/functions/result_scan.md) function and pass it
  the statement ID generated for the stored procedure.
* You can store a result set in a temporary table or permanent table, and use that table after returning from the
  stored procedure call.
* If the volume of data is not too large, you can store multiple rows and multiple columns in a VARIANT (for
  example, as a JSON value) and return that VARIANT.

### UDFs can be called in the context of another statement; stored procedures are called independently

* A UDF evaluates to a value and can be used in contexts in which a general expression can be used, such as the following:

  ```sqlexample
  SELECT MyFunction_1(column_1) FROM table1;
  ```
* A stored procedure does not evaluate to a value, and cannot be used in all contexts in which a general expression can be used.
  For example, you cannot execute `SELECT my_stored_procedure()...`.

  You call a stored procedure as an independent statement, as in the following example:

  ```sqlexample
  CALL MyStoredProcedure_1(argument_1);
  ```

For more details about calling functions and procedures, see the following:

* [Calling a stored procedure](stored-procedure/stored-procedures-calling.md)
* [Executing a UDF](udf/udf-calling-sql.md)

### Multiple UDFs may be called with one statement; a single stored procedure is called with one statement

* A single SQL statement can call multiple UDFs.
* A single SQL statement can call only one stored procedure.

  Similarly, a stored procedure, unlike a UDF, cannot be called as part of an expression. However, inside a stored procedure, the stored
  procedure can call another stored procedure, or call itself recursively. For example, see the code examples section
  [Examples](stored-procedure/stored-procedures-javascript.md).

For more details about calling functions and procedures, see the following:

* [Calling a stored procedure](stored-procedure/stored-procedures-calling.md)
* [Executing a UDF](udf/udf-calling-sql.md)

### UDFs may access the database with simple queries only; stored procedures can execute DDL and DML statements

* In a UDF, you can use SQL to execute queries only (not DML or DDL statements).
* Within a stored procedure, you can execute database operations, such as SELECT, UPDATE, and CREATE:

  + For example, in a JavaScript stored procedure, you can use the JavaScript API to perform these operations.

    The example below shows how a stored procedure can create and execute a SQL statement that calls another stored
    procedure. The `$$` indicates the beginning and end of the JavaScript handler code in the stored procedure.

    ```sqlexample-javascript
    CREATE PROCEDURE ...
      $$
      // Create a Statement object that can call a stored procedure named
      // MY_PROCEDURE().
      var stmt1 = snowflake.createStatement( { sqlText: "call MY_PROCEDURE(22)" } );
      // Execute the SQL command; in other words, call MY_PROCEDURE(22).
      stmt1.execute();
      // Create a Statement object that executes a SQL command that includes
      // a call to a UDF.
      var stmt2 = snowflake.createStatement( { sqlText: "select MY_UDF(column1) from table1" } );
      // Execute the SQL statement and store the output (the "result set") in
      // a variable named "rs", which we can access later.
      var rs = stmt2.execute();
      // etc.
      $$;
    ```
  + In a [Snowflake Scripting](snowflake-scripting/index.md) stored procedure, you can execute SQL statements.

    The example below shows how a stored procedure can create and execute a SQL statement that calls another stored
    procedure. The `$$` indicates the beginning and end of the Snowflake Scripting code in the stored procedure.

    ```sqlexample
    CREATE PROCEDURE ...
      -- Call a stored procedure named my_procedure().
      CALL my_procedure(22);
      -- Execute a SQL statement that includes a call to a UDF.
      SELECT my_udf(column1) FROM table1;
    ```

---
title: Collecting metrics data
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/metrics.md
section: Developer Guide
---

# Collecting metrics data

You can better understand stored procedure and UDF resource consumption by using CPU and memory metrics that Snowflake generates.
With this information, you can troubleshoot errors and performance issues. The metrics data is stored in your account event table.

After you’ve collected data in the event table, you can access the data for analysis via SQL or in Snowsight. For more information,
see [Viewing metrics data](metrics-viewing-data.md).

> **Note:**
>
> Before you can collect metrics data, you must [enable telemetry data collection](logging-tracing-enabling.md).
> You don’t need to add code to emit metrics data. Snowflake generates the data and collects it in an event table.

## Level for metrics data

You can specify whether to collect metrics data in the event table by setting the metric level. Be sure to set the level so that data
will be collected.

For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

When you’ve collected data, you can [view metric data](tracing-accessing-events.md)
by using a graphical tool or by querying the event table with SQL.

## Supported languages

You can collect metrics from code written in the following languages, including when handler code is written with
[Snowpark APIs](../snowpark/index.md).

| Language / Type | Java | Python | JavaScript | Scala | Snowflake Scripting |
| --- | --- | --- | --- | --- | --- |
| Stored procedure handler | ✔ | ✔ |  |  |  |
| Streamlit app | ✔ | ✔ |  |  |  |
| UDF handler (scalar function) | ✔ | ✔ |  |  |  |
| UDTF handler (table function) | ✔ | ✔ |  |  |  |

### Metrics data from handler code

Snowflake automatically captures metrics data when your code is executed. You don’t need to make any changes to your handler code.

For more information, see [Emitting metrics data from handler code](metrics-handler.md)

## Viewing metrics data

You can view collected metrics data either through Snowsight or by querying the event table where data is stored. For more
information, see [Viewing metrics data](metrics-viewing-data.md).

---
title: Common setup for Snowflake Python APIs tutorials
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/tutorials/common-setup.md
section: Developer Guide
---

Snowflake

Getting Started

App Development

Data Engineering

# Common setup for Snowflake Python APIs tutorials

## Introduction

This topic provides instructions for the common setup required for all Snowflake Python APIs tutorials available in this documentation.

### Overview of the Snowflake Python APIs

Before starting your setup, take a look at the Snowflake Python APIs structure. The following table lists some common modules in the API:

| Module | Description |
| --- | --- |
| `snowflake.core` | Defines an iterator to represent certain resource instances fetched from the Snowflake database. |
| `snowflake.core.database` | Manages Snowflake databases. |
| `snowflake.core.schema` | Manages Snowflake schemas. |
| `snowflake.core.table` | Manages Snowflake tables. |
| `snowflake.core.task` | Manages Snowflake tasks. |
| `snowflake.core.task.dagv1` | A set of APIs at a higher level than the task APIs in `snowflake.core.task` to more conveniently manage task graphs (DAGs). |
| `snowflake.core.compute_pool` | Manages compute pools in Snowpark Container Services. |
| `snowflake.core.image_repository` | Manages image repositories in Snowpark Container Services. |
| `snowflake.core.service` | Manages services in Snowpark Container Services. |

For a complete list of the APIs currently available, see the
[API reference documentation](https://docs.snowflake.com/developer-guide/snowflake-python-api/reference/latest/index).

The `snowflake.core` module represents the entry point to the core Snowflake Python APIs that manage Snowflake objects. To use the
API, you follow a common pattern:

1. Establish a session using Snowpark or a Python Connector connection, representing your connection to Snowflake.
2. Import and instantiate the `Root` class from `snowflake.core`, and pass the Snowpark session object as an argument.

   You use the resulting `Root` object to access the rest of the methods and types in the API.

For more information about the programming model of the API, see [Snowflake Python APIs: General concepts](../snowflake-python-general-concepts.md).

The following code is an example of what this pattern typically looks like:

```python
from snowflake.snowpark import Session
from snowflake.core import Root

session = Session.builder.config("connection_name", "default").create()
root = Root(session)
```

For more information about various connection options and attributes, see
[Connect to Snowflake with the Snowflake Python APIs](../snowflake-python-connecting-snowflake.md).

> **Note:**
>
> The Snowflake Python APIs can establish a connection to Snowflake using either a Snowpark session or a Python Connector connection. The
> preceding example uses a Snowpark session.

Continue to the next step to start setting up the API and your development environment!

## Install the Snowflake Python APIs

> **Important:**
>
> The Snowflake Python APIs currently supports the following versions of Python:
>
> Generally available versions:
>
> * 3.9 (deprecated)
> * 3.10
> * 3.11
> * 3.12
> * 3.13

Before installing the API, you need to activate a Python environment.

In this tutorial, you can use conda or a virtual environment (venv).

1. To create and activate a conda or virtual environment, open a command-line terminal and run the following commands:

   condavenv

   ```bash
   conda create -n <env_name> python==3.10
   conda activate <env_name>
   ```

   ```bash
   python3 -m venv '.venv'
   source '.venv/bin/activate'
   ```
2. The Snowflake Python APIs package is available in PyPI.

   To install the API package in the new conda or virtual environment, run the following command:

   ```bash
   pip install snowflake -U
   ```
3. To install the `snowflake-snowpark-python` package, run the following command:

   ```bash
   pip install 'snowflake-snowpark-python>=1.5.0,<2.0.0'
   ```

   In these tutorials, you use the `snowflake.snowpark.Session` object from the [Snowpark API for Python](../../snowpark/python/index.md)
   to create a connection to Snowflake.

## Set up your development environment

This tutorial walks through code examples that you can run in a Jupyter notebook. Each step in the tutorial incrementally showcases the
capabilities of the Snowflake Python APIs.

You start by setting up your development environment so that you can run the code examples in a notebook.

1. Create a file named `$HOME/.snowflake/connections.toml` with the following connection parameters, and update it with your real
   credentials:

   ```toml
   [default]
   account = "<YOUR ACCOUNT NAME>"
   user = "<YOUR ACCOUNT USER>"
   password = "<YOUR ACCOUNT USER PASSWORD>"
   # optional
   # warehouse = "<YOUR COMPUTE WH>"
   # optional
   # database = "<YOUR DATABASE>"
   # optional
   # schema = "<YOUR SCHEMA>"
   ```

   > **Note:**
   >
   > The `account` parameter does not support [account identifiers](../../../user-guide/admin-account-identifier.md) with
   > underscores. You must specify an account identifier with dashes in place of any underscores. For more information, see
   > [Account name in your organization](../../../user-guide/admin-account-identifier.md).

   This example specifies these parameters as the default connection to Snowflake in your environment by creating a connection
   definition named `default`.
2. Use one of the following methods to open a notebook:

   * Open a new notebook in a code editor that supports Jupyter notebooks (such as Visual Studio Code).
   * To open a notebook in your browser, start a notebook server with the command `jupyter notebook`.

     To ensure that your environment can run a notebook, run `conda install notebook` in your terminal before starting the
     notebook server.
3. In the first cell of the notebook, run the following import statements:

   ```python
   from datetime import timedelta

   from snowflake.snowpark import Session
   from snowflake.snowpark.functions import col
   from snowflake.core import Root, CreateMode
   from snowflake.core.database import Database
   from snowflake.core.schema import Schema
   from snowflake.core.stage import Stage
   from snowflake.core.table import Table, TableColumn, PrimaryKey
   from snowflake.core.task import StoredProcedureCall, Task
   from snowflake.core.task.dagv1 import DAGOperation, DAG, DAGTask
   from snowflake.core.warehouse import Warehouse
   ```

   > **Note:**
   >
   > After running this cell, you might be prompted to set your Python kernel. If you activated a conda environment, select conda as the
   > Python kernel (for example, something similar to: `~/miniconda3/envs/<your conda env>/bin/python`).

   In this cell, you import Snowpark and the core APIs that manage Snowflake objects.
4. To establish a connection to Snowflake, in the next cell, run the following code:

   ```python
   session = Session.builder.config("connection_name", "default").create()
   ```

   In this cell, you create a Snowpark session and set the connection parameters for your session by specifying the `connection_name` as
   the `default` connection definition that you previously configured.
5. To create a `Root` object, pass your `session` object to the `Root` constructor:

   ```python
   root = Root(session)
   ```

And that’s it! By running the code in these three cells, you’re now ready to use the Snowflake Python APIs.

### What’s next?

You can now explore [Tutorial 1: Create a database, schema, table, and warehouse](tutorial-1.md).

---
title: Configuring log levels and files
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-logs.md
section: Developer Guide
---

# Configuring log levels and files

The Node.js driver supports two types of loggers to track activity:

* Browser logger, which stores logs in an in-memory buffer within the browser.
* Node logger, which by default, stores logs in a `snowflake.log` file and displays them in the console.

You can use the following code to switch from the browser logger to the node logger. This example switches to the node logger and sends messages to the console.

```javascript
Logger.setInstance(new NodeLogger({ logFilePath: 'STDOUT'}));
```

## Supported log levels

The Node.js driver supports the following log levels:

* OFF
* ERROR
* WARNING
* INFO
* DEBUG
* TRACE

## Configure the default logging behavior

You can configure standard logging by calling `snowflake.configure`, similar to the following:

```javascript
import snowflake from 'snowflake-sdk';

snowflake.configure({
  logLevel: 'INFO',
  logFilePath: '/some/path/log_file.log',
  additionalLogToConsole: false
});
```

where:

* `logLevel` is the desired logging level.
* `logFilePath` is the location of the log file or `STDOUT` for console output.
* `additionalLogToConsole` is a Boolean value that indicates whether to send log messages also to the console when a `logFilePath` is specified. Default: `true`.

## Use easy logging while debugging your code

When debugging an application, increasing the log level can provide more granular information about what the application is doing.
The Easy Logging feature simplifies debugging by letting you change the log level and the log file destination using a configuration file (default: `sf_client_config.json`).

You typically change log levels only when debugging your application.

This configuration file uses JSON to define the `log_level` and `log_path` logging parameters, as follows:

```json
{
  "common": {
    "log_level": "INFO",
    "log_path": "/some-path/some-directory"
  }
}
```

where:

* `log_level` is the desired logging level.
* `log_path` is the location to store the log files. The driver automatically creates a `nodejs` sub-directory in the specified `log_path`. For example, if you set `log_path` to `/Users/me/logs`, the drivers creates the `/Users/me/logs/nodejs` directory and stores the logs there.

The driver looks for the location of the configuration file in the following order:

* `clientConfigFile` connection parameter, containing the full path to the configuration file, such as the following:

  ```javascript
  import snowflake from 'snowflake-sdk';

  const connection = snowflake.createConnection({
    account: account,
    username: user,
    password: password,
    application: application,
    clientConfigFile: '/some/path/client_config.json'
  });
  ```
* `SF_CLIENT_CONFIG_FILE` environment variable, containing the full path to the configuration file (for example, `export SF_CLIENT_CONFIG_FILE=/some_path/some-directory/client_config.json`).
* Node.js driver installation directory, where the file must be named `sf_client_config.json`.
* User’s home directory, where the file must be named `sf_client_config.json`.

> **Note:**
>
> To enhance security, the driver requires the logging configuration file on Unix-style systems to limit file permissions to allow only the file owner to modify the files (such as `chmod 0600` or `chmod 0644`).

To minimize the number of searches for a configuration file, the driver reads the configuration file only:

* for the first connection.
* for the first connection using the `clientConfigFile` parameter.

---
title: Configuring the JDBC Driver
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-configure.md
section: Developer Guide
---

# Configuring the JDBC Driver

This topic describes how to configure the JDBC driver, including how to
connect to Snowflake using the driver.

> **Note:**
>
> The connection parameters are now documented in the [JDBC Driver connection parameter reference](jdbc-parameters.md).

## JDBC Driver class

Version 4.xVersion 3.x

Use `net.snowflake.client.api.driver.SnowflakeDriver` as the driver class in your JDBC application.

> **Note:**
>
> * Don’t reference any other Snowflake classes or methods in your application code because they are subject to change in the future to implement improvements and fixes.
> * The previous driver class, `net.snowflake.client.api.driver.SnowflakeDriver`, is still supported but is deprecated (meaning it will be removed in a future release). Any code that references the previous class name will continue to work, but you should update the code to reference the new class name because the change has been implemented.

Use `net.snowflake.client.jdbc.SnowflakeDriver` as the driver class in your JDBC application.

> **Note:**
>
> * Don’t reference any other Snowflake classes or methods in your application code because they are subject to change in the future to implement improvements and fixes.
> * The previous driver class, `com.snowflake.client.jdbc.SnowflakeDriver`, is still supported but is deprecated (meaning it will be removed in a future release, TBD).
>   Any code that references the previous class name will continue to work, but you should update the code to reference the new class name because the
>   change has been implemented.

## JDBC Driver connection string

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

Using the JDBC driver to connect to Snowflake requires a connection string with the syntax described below.

You can generate the basic connection string in Snowsight. For information, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

> **Note:**
>
> You cannot set the [SEARCH_PATH](../../sql-reference/parameters.md) parameter within a JDBC client connection string. You must
> establish a session before setting a search path.

### Syntax

```none
jdbc:snowflake://<account_identifier>.snowflakecomputing.com/?<connection_params>
```

### Connection parameters

> **Note:**
>
> For documentation on individual connection parameters, see the [JDBC Driver connection parameter reference](jdbc-parameters.md).

`<account_identifier>`
:   Specifies the account identifier for your Snowflake account. For details, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).
    For examples of the account identifier used in a JDBC connection string, see Examples.

`<connection_params>`
:   Specifies a series of one or more [JDBC connection parameters](jdbc-parameters.md)
    and [session parameters](../../sql-reference/parameters.md), in the form of `<param>=<value>`, with
    each parameter separated by the ampersand character (`&`), and no spaces anywhere in the connection string.

    If you need to set parameter values that use spaces, ampersands (`&`), equals signs (`=`), or other special characters, you
    should [URL-encode](https://en.wikipedia.org/wiki/Percent-encoding) the special characters. For example, if you need to
    specify a value that contains a space, ampersand, and equals sign in the [query_tag](../../sql-reference/parameters.md) session parameter:

    ```none
    String connectionURL = "jdbc:snowflake://myorganization-myaccount.snowflakecomputing.com/?query_tag='folder=folder1 folder2&'
    ```

    encode the space as `%20`, the ampersand as `%26`, and the equals sign as `%3D`:

    ```none
    String connectionURL = "jdbc:snowflake://myorganization-myaccount.snowflakecomputing.com/?query_tag='folder%3Dfolder1%20folder2%26'
    ```

    As an alternative, rather than specifying these parameters in the connection string, you can set these parameters in a
    `Properties` object that you pass to the `DriverManager.getConnectionIO` method.

    ```java
    Properties props = new Properties();
    props.put("parameter1", parameter1Value);
    props.put("parameter2", parameter2Value);
    Connection con = DriverManager.getConnection("jdbc:snowflake://<account_identifier>.snowflakecomputing.com/", props);
    ```

> **Note:**
>
> For documentation on individual connection parameters, see the [JDBC Driver connection parameter reference](jdbc-parameters.md).

### Other parameters

Any session parameter can be included in the connection string. For example:

> `BROWSER_RESPONSE_TIMEOUT=<Integer>`
> :   Specifies the timeout, in seconds, to wait for a successful authentication from an external browser.
>
>     Default is `120`.
>
> `CLIENT_OUT_OF_BAND_TELEMETRY_ENABLED=<Boolean>`
> :   Specifies whether to enable out-of-band telemetry.
>
>     Default is `true`.
>
> `CLIENT_SESSION_KEEP_ALIVE=<Boolean>`
> :   Specifies whether to keep the current session active after a period of inactivity, or to force the user to login again. If the value is `true`, Snowflake keeps the session active indefinitely,
>     even if there is no activity from the user. If the value is `false`, the user must log in again after four hours of inactivity.
>
>     Default is `false`.
>
> `CLIENT_SESSION_KEEP_ALIVE_HEARTBEAT_FREQUENCY=<Integer>`
> :   Specifies the number of seconds (900-3600) in-between client attempts to update the token for the session.
>
>     Default is `3600`.

> `net.snowflake.jdbc.commons_logging_wrapper`
> :   Specifies how to handle logs from commons logging. Possible values are:
>
>     * `ALL`: All logs from common logging are passed to `SFLogger` (`java.util.logging` or SLF4J is used internally).
>     * `Default`: All logs from commons logging are forwarded to `java.util.logging`, and no logs are forwarded to the SLF4J logger.
>     * `OFF`: No logs from commons logging are forwarded. You can use this value if you need to replace commons logging with the SLF4J bridge when using a thin JAR file.
>
> `JDBC_QUERY_RESULT_FORMAT=JSON`
> :   Specifies `JSON` as the result format to use while fetching or processing the results of a query sent to Snowflake.
>
>     Default is `Arrow`.

For descriptions of all the session parameters, see [Parameters](../../sql-reference/parameters.md).

### Examples

The following is an example of the connection string that uses the
[account name as an identifier](../../user-guide/admin-account-identifier.md) for the account `myaccount` in the organization
`myorganization`.

> ```none
> jdbc:snowflake://myorganization-myaccount.snowflakecomputing.com/?user=peter&warehouse=mywh&db=mydb&schema=public
> ```

The following is an example of a connection string that uses the [account locator](../../user-guide/admin-account-identifier.md)
`xy12345` as the account identifier:

> ```none
> jdbc:snowflake://xy12345.snowflakecomputing.com/?user=peter&warehouse=mywh&db=mydb&schema=public
> ```

Note that this example uses an account in the AWS US West (Oregon) region. If the account is in a different region or if the
account uses a different cloud provider, you need to
[specify additional segments after the account locator](../../user-guide/admin-account-identifier.md).

## Connecting using the `connections.toml` file

The JDBC driver lets you add connection definitions to a `connections.toml` configuration file.
A connection definition refers to a collection of connection-related parameters. The driver currently supports TOML version 1.0.0.

For more information about `toml` file formats, see [TOML (Tom’s Obvious Minimal Language)](https://toml.io/en/).

The connection string prefix: `jdbc:snowflake:auto` tells the driver to look for the connection configuration within the predefined (default) files.
The JDBC driver looks for the `connections.toml` file in the following locations, in order:

* If a `~/.snowflake` directory exists on your machine, Snowflake CLI uses the
  `~/.snowflake/connections.toml` file.
* Location specified in the `SNOWFLAKE_HOME` environment variable.
* Otherwise, Snowflake CLI uses the `connections.toml` file in the one of the following locations, based on your operating system:

  > + Linux: `~/.config/snowflake/connections.toml`, but you can update it with XDG vars
  > + Windows: `%USERPROFILE%\AppData\Local\snowflake\connections.toml`
  > + Mac: `~/Library/Application Support/snowflake/connections.toml`

You can generate the basic settings for the TOML configuration file in Snowsight. For information, see
[Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

If you want to switch between multiple existing connections, you can configure them in the `connections.toml` file. The default key is `default`, but you change the name of the default connection by setting the `SNOWFLAKE_DEFAULT_CONNECTION_NAME` shell environment variable.

The following sample `connections.toml` files defines three connections:

```output
[default]
account = 'my_organization-my_account'
user = 'test_user'
warehouse = 'testw'
database = 'test_db'
schema = 'test_nodejs'
protocol = 'https'
port = '443'
authenticator = 'oauth'
token_file_path = '/Users/test/.snowflake/token'

[production]
account = 'my_organization-my_account'
user = 'prod_user'
warehouse = 'prodw'
database = 'prod_db'
schema = 'prod_nodejs'
protocol = 'https'
port = '443'
authenticator = 'oauth'
token_file_path = '/Users/test/.snowflake/token'

[aws-oauth-file]
account = 'my_organization-my_account'
user = 'test_user'
warehouse = 'testw'
database = 'test_db'
schema = 'test_nodejs'
protocol = 'https'
port = '443'
authenticator = 'oauth'
token_file_path = '/Users/test/.snowflake/token'
```

### Specifying a connection to use in the `auto` connection prefix

You can specify which connection configuration to use by appending the connection name to the `auto` prefix in the connection string. Continuing the previous example, to connect with `aws-oauth-file`, use the following connection string:

```none
jdbc:snowflake:auto?connectionName=aws-oauth-file
```

This connection string tells the JDBC driver to look for the `aws-oauth-file` connection definition in the `connections.toml` file.

The driver determines which connection to use in the following order:

1. Connection name specified in the connection string using the `jdbc:snowflake:auto?connectionName=<connection_name_in_toml_file>` syntax
2. Connection name specified in the `SNOWFLAKE_DEFAULT_CONNECTION_NAME` shell environment variable
3. The default connection name, `default`

If the connection name specified in the connection string does not exist in the `connections.toml` file, the driver does the following:

* Logs a message indicating the missing name and the file path checked.
* Throws an exception containing the name of the missing connection, in the following format:

  > `The Connection <connection name> not found in connections.toml file.`
* Terminates the connection attempt.

## Using single sign-on (SSO) for authentication

If you have [configured Snowflake to use single sign-on (SSO)](../../user-guide/admin-security-fed-auth-overview.md), you can configure
your client application to use SSO for authentication. See [Using SSO with client applications that connect to Snowflake](../../user-guide/admin-security-fed-auth-use.md) for details.

## Using multi-factor authentication

Snowflake supports caching MFA tokens, including combining MFA token caching with SSO.

For more information, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../../user-guide/security-mfa.md).

## Using key pair authentication and key rotation

The Snowflake JDBC driver supports key pair authentication and key rotation. This authentication method requires a 2048-bit (minimum) RSA key pair.

To start, complete the initial configuration for key pair authentication as shown in [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md).

Next, choose one of the following three options to configure either the JDBC connection properties or the JDBC connection string.

1. Specify the private key via the privateKey property in the connection properties.
2. Specify the private key file name and password for that file as separate properties in the connection properties.
3. Specify the private key file name and password for that file as part of the connection string.

These options are described in more detail in the next three sections.

### `privateKey` property in connection properties

This section provides an example of setting the `privateKey` property to a private key in a file.

This example uses the [Bouncy Castle Crypto APIs](https://www.bouncycastle.org/java.html). In order to compile and run this
example, you must include the following JAR files in your classpath:

* the provider JAR file (`bcprov-jdkversions.jar`)
* the PKIX / CMS / EAC / PKCS / OCSP / TSP / OPENSSL JAR file (`bcpkix-jdkversions.jar`)

where `versions` specifies the versions of the JDK that the JAR file supports.

To use this example:

1. Copy the sample code below, and replace the following placeholder values:

   | Placeholder | Description |
   | --- | --- |
   | `path/rsa_key.p8` | Set this to the path and name of the private key file that you generated earlier. |
   | `private_key_passphrase` | If you generated an encrypted key, implement the `getPrivateKeyPassphrase()` method to return the passphrase for decrypting that key. |
   | `account_identifier` | Set this to your [account identifier](../../user-guide/gen-conn-config.md). |
   | `user` | Set this to your Snowflake login name. |
   | `database_name` | Set this to the name of the database that you want to use. |
   | `schema_name` | Set this to the name of the schema that you want to use. |
   | `warehouse_name` | Set this to the name of the warehouse that you want to use. |
   | `role` | Set this to the name of the role that you want to use. |
2. Compile and run the sample code. Include the Bouncy Castle JAR files in the classpath.

   For example, on Linux and macOS:

   ```bash
   javac -cp bcprov-jdk<versions>.jar:bcpkix-jdk<versions>.jar TestJdbc.java

   java -cp .:snowflake-jdbc-<ver>.jar:bcprov-jdk<versions>.jar:bcpkix-jdk<versions>.jar TestJdbc.java
   ```

   On Windows:

   ```bash
   javac -cp bcprov-jdk<versions>.jar;bcpkix-jdk<versions>.jar TestJdbc.java

   java -cp .;snowflake-jdbc-<ver>.jar;bcprov-jdk<versions>.jar;bcpkix-jdk<versions>.jar TestJdbc.java
   ```

**Sample code**

```java
import org.bouncycastle.asn1.pkcs.PrivateKeyInfo;
import org.bouncycastle.jce.provider.BouncyCastleProvider;
import org.bouncycastle.openssl.PEMParser;
import org.bouncycastle.openssl.jcajce.JcaPEMKeyConverter;
import org.bouncycastle.openssl.jcajce.JceOpenSSLPKCS8DecryptorProviderBuilder;
import org.bouncycastle.operator.InputDecryptorProvider;
import org.bouncycastle.operator.OperatorCreationException;
import org.bouncycastle.pkcs.PKCS8EncryptedPrivateKeyInfo;
import org.bouncycastle.pkcs.PKCSException;

import java.io.FileReader;
import java.io.IOException;
import java.nio.file.Paths;
import java.security.PrivateKey;
import java.security.Security;
import java.sql.Connection;
import java.sql.Statement;
import java.sql.ResultSet;
import java.sql.DriverManager;
import java.util.Properties;

public class TestJdbc
{
  // Path to the private key file that you generated earlier.
  private static final String PRIVATE_KEY_FILE = "/<path>/rsa_key.p8";

  public static class PrivateKeyReader
  {

    // If you generated an encrypted private key, implement this method to return
    // the passphrase for decrypting your private key.
    private static String getPrivateKeyPassphrase() {
      return "<private_key_passphrase>";
    }

    public static PrivateKey get(String filename)
            throws Exception
    {
      PrivateKeyInfo privateKeyInfo = null;
      Security.addProvider(new BouncyCastleProvider());
      // Read an object from the private key file.
      PEMParser pemParser = new PEMParser(new FileReader(Paths.get(filename).toFile()));
      Object pemObject = pemParser.readObject();
      if (pemObject instanceof PKCS8EncryptedPrivateKeyInfo) {
        // Handle the case where the private key is encrypted.
        PKCS8EncryptedPrivateKeyInfo encryptedPrivateKeyInfo = (PKCS8EncryptedPrivateKeyInfo) pemObject;
        String passphrase = getPrivateKeyPassphrase();
        InputDecryptorProvider pkcs8Prov = new JceOpenSSLPKCS8DecryptorProviderBuilder().build(passphrase.toCharArray());
        privateKeyInfo = encryptedPrivateKeyInfo.decryptPrivateKeyInfo(pkcs8Prov);
      } else if (pemObject instanceof PrivateKeyInfo) {
        // Handle the case where the private key is unencrypted.
        privateKeyInfo = (PrivateKeyInfo) pemObject;
      }
      pemParser.close();
      JcaPEMKeyConverter converter = new JcaPEMKeyConverter().setProvider(BouncyCastleProvider.PROVIDER_NAME);
      return converter.getPrivateKey(privateKeyInfo);
    }
  }

  public static void main(String[] args)
      throws Exception
  {
    String url = "jdbc:snowflake://<account_identifier>.snowflakecomputing.com";
    Properties prop = new Properties();
    prop.put("user", "<user>");
    prop.put("privateKey", PrivateKeyReader.get(PRIVATE_KEY_FILE));
    prop.put("db", "<database_name>");
    prop.put("schema", "<schema_name>");
    prop.put("warehouse", "<warehouse_name>");
    prop.put("role", "<role_name>");

    Connection conn = DriverManager.getConnection(url, prop);
    Statement stat = conn.createStatement();
    ResultSet res = stat.executeQuery("select 1");
    res.next();
    System.out.println(res.getString(1));
    conn.close();
  }
}
```

> **Note:**
>
> Use forward slashes as file path separators on all operating systems, including Windows. The JDBC driver replaces forward slashes
> with the appropriate path separator for the platform.

### Private key file name and password as connection properties

You can specify the private key file name and password as separate connection properties, for example:

```java
Properties props = new Properties();
props.put("private_key_file", "/tmp/rsa_key.p8");
props.put("private_key_file_pwd", "dummyPassword");
Connection connection = DriverManager.getConnection("jdbc:snowflake://myorganization-myaccount.snowflake.com", props);
```

If you specify the `private_key_file` and `private_key_file_pwd` parameters, do not specify the
`privateKey` parameter in the connection properties.

> **Note:**
>
> Use forward slashes as file path separators on all operating systems, including Windows. The JDBC driver replaces forward slashes
> with the appropriate path separator for the platform.

### Private key file name and password in connection string

You can specify the private key file name and password in the connection string, as shown below:

```java
Connection connection = DriverManager.getConnection(
    "jdbc:snowflake://myorganization-myaccount.snowflake.com/?private_key_file=/tmp/rsa_key.p8&private_key_file_pwd=dummyPassword",
    props);
```

> **Note:**
>
> Use forward slashes as file path separators on all operating systems, including Windows. The JDBC driver replaces forward slashes
> with the appropriate path separator for the platform.

If you specify the private key and password in the connection string, then do not specify the parameters
`private_key_file`, `private_key_file_pwd`, or `privateKey` in the connection properties.

### Key decryption errors

If you use encrypted keys that were generated using OpenSSL V3, you might receive errors similar to the following:

```output
java.security.NoSuchAlgorithmException: 1.2.840.113549.1.5.13 SecretKeyFactory not available

java.security.InvalidKeyException: IOException : DER input, Integer tag error
```

In this situation, you can use Bouncy Castle to decrypt the key by specifying the following JVM argument:

Version 4.xVersion 3.x

```bash
-Dnet.snowflake.jdbc.useBundledBouncyCastleForPrivateKeyDecryption=true
```

The default value is `true`, which means that the bundled Bouncy Castle library is used to decrypt the key.

```bash
-Dnet.snowflake.jdbc.enableBouncyCastle=true
```

## Using the OAuth 2.0 Authorization Code flow

The OAuth 2.0 Authorization Code flow is a secure method for a client application to obtain an access token from an authorization server on behalf of a user, without revealing the user’s credentials.

To enable the OAuth 2.0 Authorization Code flow:

1. Set the `authenticator` connection parameter to `oauth_authorization_code`.
2. Set the following OAuth connection parameters:

   > * `oauthClientId`: Value of `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata). Default: `LOCAL_APPLICATION` if unset and the IDP is Snowflake.
   > * `oauthClientSecret`: Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata). Default: `LOCAL_APPLICATION` if unset and the IDP is Snowflake.
   > * `oauthAuthorizationUrl`: Identity provider endpoint supplying the authorization code to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `oauthTokenRequestUrl`: Identity provider endpoint supplying the access tokens to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `oauthScope`: Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.
   > * `oauthRedirectUri`: URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.

## Using the OAuth 2.0 Client Credentials flow

The OAuth 2.0 Client Credentials flow provides a secure way for machine-to-machine (M2M) authentication, such as the Snowflake Connector for Python connecting to a backend service. Unlike the OAuth 2.0 Authorization Code flow, this method does not rely on any user-specific data.

To enable the OAuth 2.0 Client Credentials flow:

1. Set the `authenticator` connection parameter to `oauth_client_credentials`.
2. Set the following OAuth connection parameters:

   > * `oauthClientId`: Value of `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `oauthClientSecret`: Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata)
   > * `oauthTokenRequestUrl`: Identity provider endpoint supplying the access tokens to the driver.
   > * `oauthScope`: Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

## Authenticating with a programmatic access token (PAT)

Programmatic access token (PAT) is a Snowflake-specific authentication method. The feature must be enabled for the account before usage (see the [Prerequisites](../../user-guide/programmatic-access-tokens.md) for more information). Authentication with PAT doesn’t involve any human interaction.

## Authenticating with workload identity federation (WIF)

[Workload identity federation](../../user-guide/workload-identity-federation.md) provides a service-to-service authentication method for Snowflake. This method enables applications, services, or containers to authenticate with Snowflake by leveraging their cloud provider’s native identity system, such as AWS IAM, Microsoft Entra ID, or Google Cloud service accounts. This approach eliminates the need for managing long-lived credentials and simplifies credential acquisition compared to other methods like External OAuth. Snowflake connectors are designed to automatically obtain short-lived credentials from the platform’s identity provider.

To enable the workload identity federation authenticator, do the following:

1. Set the `authenticator` connection parameter to `WORKLOAD_IDENTITY`.
2. Set the `workloadIdentityProvider` connection parameter to `AWS`, `AZURE`, `GCP`, or `OIDC`, based on your platform.
3. For OpenID Connect (OIDC), specify the `token` connection parameter.

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

## Connecting using a proxy server

You can use a proxy server with the Snowflake JDBC Driver in the following ways:

* Set system properties for your proxy settings in the JVM (Java Virtual Machine) for your client application.
* Include the proxy host and port information in the JDBC connection string or the `Properties` object passed to the
  `DriverManager.getConnection()` method.

> **Note:**
>
> Proxy settings specified in the connection string take precedence over JVM system properties.

> **Tip:**
>
> Snowflake’s security model does not allow Transport Layer Security (TLS) proxies (using an HTTPS certificate). Your
> proxy server must use a publicly-available Certificate Authority (CA), reducing potential security risks such as
> a MITM (Man In The Middle) attack through a compromised proxy.
>
> If you must use your TLS proxy, we strongly recommend that you update the server policy to pass through
> the Snowflake certificate such that no certificate is altered in the middle of communications.
>
> As an alternative, you can set the `nonProxyHosts` parameter in the connection string or `Properties` object to
> bypass the proxy for specific communications. For example, Amazon S3 access can be bypassed by specifying
> `nonProxyHosts=".amazonaws.com"`.

### Specifying a proxy server by setting Java system properties

To connect through a proxy server, you can set the proxy system properties. You can either set these in
your code or pass them on the command line to the JVM (Java virtual machine) for your client application.

For more information, see [Java Networking and Proxies](https://docs.oracle.com/javase/8/docs/technotes/guides/net/proxies.html).

To set the system properties in your code, call `System.setProperty`:

> ```java
> System.setProperty("http.useProxy", "true");
> System.setProperty("http.proxyHost", "proxyHost Value");
> System.setProperty("http.proxyPort", "proxyPort Value");
> System.setProperty("http.proxyUser", "proxyUser Value");
> System.setProperty("http.proxyPassword", "proxyPassword Value");
> System.setProperty("https.proxyHost", "proxyHost HTTPS Value");
> System.setProperty("https.proxyPort", "proxyPort HTTPS Value");
> System.setProperty("https.proxyUser", "proxyUser HTTPS Value");
> System.setProperty("https.proxyPassword", "proxyPassword HTTPS Value");
> System.setProperty("http.proxyProtocol", "https");
> ```

To pass the system properties on the command line to your JVM, use the `-D` command-line option:

> ```bash
> -Dhttp.useProxy=true
> -Dhttps.proxyHost=<proxy_host>
> -Dhttps.proxyPort=<proxy_port>
> -Dhttps.proxyUser=<proxy_user>
> -Dhttps.proxyPassword=<proxy_password>
> -Dhttp.proxyHost=<proxy_host>
> -Dhttp.proxyPort=<proxy_port>
> -Dhttp.proxyUser=<proxy_user>
> -Dhttp.proxyPassword=<proxy_password>
> -Dhttp.proxyProtocol="https"
> ```

To bypass the proxy for one or more IP addresses or hosts, set the `http.nonProxyHosts` system property to the list of these
hosts:

* Use a pipe symbol (`|`) to separate the host names.
* To specify hostnames that match a pattern, use an asterisk (`*`) as a wildcard character.

The following example demonstrates how to set this system property on the command line:

```bash
-Dhttp.nonProxyHosts="*.example.com|localhost|myorganization-myaccount.snowflakecomputing.com|192.168.91.*"
```

### Specifying a proxy server in the JDBC connection string

> **Note:**
>
> Specifying the proxy information as part of the URL is less secure than other methods of specifying the
> proxy information.

To use a proxy server by setting the following parameters in the JDBC connection string:

* [useProxy](jdbc-parameters.md)
* [proxyHost](jdbc-parameters.md)
* [proxyPort](jdbc-parameters.md)
* [proxyUser](jdbc-parameters.md)
* [proxyPassword](jdbc-parameters.md)
* [proxyProtocol](jdbc-parameters.md)

If your proxy server does not require authentication, you can omit the `proxyUser` and `proxyPassword` parameters.

If your proxy server connection requires authentication using a proxy username and proxy password, those
credentials may be exposed as plain text by other applications when using the HTTP protocol. To avoid
exposing these credentials, use the `proxyProtocol` parameter to specify the HTTPS protocol.

```none
jdbc:snowflake://<account_identifier>.snowflakecomputing.com/?warehouse=<warehouse_name>&useProxy=true&proxyHost=<ip_address>&proxyPort=<port>&proxyUser=test&proxyPassword=test
```

For example:

```none
jdbc:snowflake://myorganization-myaccount.snowflakecomputing.com/?warehouse=DemoWarehouse1&useProxy=true&proxyHost=172.31.89.76&proxyPort=8888&proxyUser=test&proxyPassword=test
```

The proxy settings specified in the connection string take precedence over JVM system properties.

If the proxy JVM arguments are set and you do not want to proxy any of your connections, do not set `useProxy=false`, as it has no effect. Instead, use the following, which effectively bypasses the JVM proxy settings:

```none
useProxy=true
proxyHost=127.0.0.1
proxyPort=8080
nonProxyHosts=*
```

#### Bypassing the Proxy Server

If you need to bypass the proxy server when connecting to one or more hosts, specify the list of hosts in the
`nonProxyHosts` parameter:

```none
&nonProxyHosts=<bypass_proxy_for_these_hosts>
```

Separate the hostnames with a URL-escaped pipe symbol (`%7C`). You can also use an asterisk (`*`) as a wildcard. For example:

```none
&nonProxyHosts=*.example.com%7Clocalhost%7Cmyorganization-myaccount.snowflakecomputing.com%7C192.168.91.*
```

#### Specifying the Protocol Used to Connect to the Proxy Server

* To specify the protocol used to connect to the proxy server, use the `proxyProtocol` parameter. The default value is `http`, but `https` is also valid.

For example:

```none
&proxyProtocol=https
```

## OCSP

When the driver connects, Snowflake sends a certificate to confirm that the connection is to Snowflake rather than to
a host that is impersonating Snowflake. The driver sends that certificate to an OCSP (Online Certificate Status
Protocol) server to verify that the certificate has not been revoked.

If the driver cannot reach the OCSP server to verify the certificate, the driver can
[“fail open” or “fail closed”](../../user-guide/ocsp.md).

### Choosing fail-open or fail-close mode

JDBC Driver versions prior to 3.8.0 default to fail-close. Versions 3.8.0 and later default to fail-open.
You can override the default behavior in any of the following ways:

* Set the connection property `ocspFailOpen` to `true` or `false`. For example:

  ```java
  Properties connection_properties = new Properties();
  connection_properties.put("ocspFailOpen", "false");
  ...
  connection = DriverManager.getConnection(connectionString, connection_properties);
  ```
* Set the system property `net.snowflake.jdbc.ocspFailOpen` to `true` or `false`. For
  example:

  ```java
  Properties p = new Properties(System.getProperties());
  p.put("net.snowflake.jdbc.ocspFailOpen", "false");
  System.setProperties(p);
  ```

### Verifying the OCSP connector or driver version

For more information about the driver or connector version, configuration, and OCSP behavior, see
[OCSP Configuration](../../user-guide/ocsp.md).

### OCSP response cache server

> **Note:**
>
> The OCSP response cache server is currently supported by the Snowflake JDBC Driver 3.6.0 and higher.

Snowflake clients initiate every connection to a Snowflake service endpoint with a “handshake” that establishes a secure connection before actually transferring data. As part of the handshake, a
client authenticates the TLS certificate for the service endpoint. The revocation status of the certificate is checked by sending a client certificate request to one of the OCSP
(Online Certificate Status Protocol) servers for the CA (certificate authority).

A connection failure occurs when the response from the OCSP server is delayed beyond a reasonable time. The following caches persist the revocation status, helping alleviate these issues:

* Memory cache, which persists for the life of the process.
* File cache, which persists until the cache directory (e.g. `~/.cache/snowflake` or `~/.snowsql/ocsp_response_cache`) is purged.
* Snowflake OCSP response cache server, which fetches OCSP responses from the CA’s OCSP servers hourly and stores them for 24 hours. Clients can then request the validation status of a given Snowflake
  certificate from this server cache.

  > **Important:**
  >
  > If your server policy denies access to most or all external IP addresses and web sites, you must allowlist the cache server
  > address to allow normal service operation. The cache server hostname is `ocsp*.snowflakecomputing.com:80`.

  If you need to disable the cache server for any reason, set the `SF_OCSP_RESPONSE_CACHE_SERVER_ENABLED` environment variable to `false`. Note that the value is case-sensitive and must
  be in lowercase.

If none of the cache layers contain the OCSP response, the client then attempts to fetch the validation status directly from the OCSP server for the CA.

## File caches

To improve usability, the driver uses file caches for authentication and OCSP responses. By default, these files are stored in the following directories:

Linux:
:   `~/.cache/snowflake`

macOS:
:   `~/Library/Caches/Snowflake`

Windows:
:   `%USERPROFILE%AppDataLocalSnowflakeCaches`

If the JDBC application user does not have a user profile in the local operating system, the driver attempts to store the cache files in the temporary directory. You can configure the driver to write
cache files to another directory using the following environment variables:

> `SF_TEMPORARY_CREDENTIAL_CACHE_DIR=string`
> :   Specifies the location of the temporary credential cache file in a local directory. This can also be configured with the JVM option `-Dnet.snowflake.jdbc.temporaryCredentialCacheDir=string`
>     on launch.
>
> `SF_OCSP_RESPONSE_CACHE_DIR=string`
> :   Specifies the location of the OCSP response cache file in a local directory. This can also be configured with the JVM option `-Dnet.snowflake.jdbc.ocspResponseCacheDir=string` on launch.
>
>     For more information, see OCSP Response Cache Server (in this topic).

Note that the JVM options should be set on launch, and not programmatically (via `System.setProperty()`). If both environment variable and JVM options are provided, the JVM option will be used.

## Configuring JDBC logging

Starting with version 3.0.4, the JDBC driver supports the following logging frameworks:

* Java Core Logging Facilities (default logger for the driver)
* Simple Logging Facade for Java
* Logging Configuration File

### Java core logging facilities (`Java.util.logging`)

By default, the `java.util.logging` uses `ConsoleHandler` to write to the standard error stream. You can set the Boolean `JAVA_LOGGING_CONSOLE_STD_OUT` java or connection property to `true`, which writes all logs to the standard output stream. The default value is `false`.

If you enable `JAVA_LOGGING_CONSOLE_STD_OUT`, you can also set the `JAVA_LOGGING_CONSOLE_STD_OUT_THRESHOLD` java or connection property to set the maximum log level the driver should write to standard output. Any log messages with a higher level than specified are sent to standard error. Possible values for this property include:

* `OFF`
* `SEVERE`
* `WARNING`
* `INFO`
* `CONFIG`
* `FINE`
* `FINER`
* `FINEST`
* `ALL`

To choose this logger explicitly, specify the following option for the JVM:

> `-Dnet.snowflake.jdbc.loggerImpl=net.snowflake.client.log.JDK14Logger`

Then, you can customize the logging configuration using the application programming interface (API) for the logger.

For more details, see the [java.util.logging Package documentation](https://docs.oracle.com/javase/8/docs/api/java/util/logging/package-summary.html).

For example, create a file named `logging.properties` that includes the following contents:

> ```none
> ###########################################################
> #   Default Logging Configuration File
> #
> # You can use a different file by specifying a filename
> # with the java.util.logging.config.file system property.
> # For example java -Djava.util.logging.config.file=myfile
> ############################################################
>
> ############################################################
> #   Global properties
> ############################################################
>
> # "handlers" specifies a comma-separated list of log Handler
> # classes.  These handlers will be installed during VM startup.
> # Note that these classes must be on the system classpath.
> # ConsoleHandler and FileHandler are configured here such that
> # the logs are dumped into both a standard error and a file.
> handlers = java.util.logging.ConsoleHandler, java.util.logging.FileHandler
>
> # Default global logging level.
> # This specifies which kinds of events are logged across
> # all loggers.  For any given facility this global level
> # can be overriden by a facility specific level.
> # Note that the ConsoleHandler also has a separate level
> # setting to limit messages printed to the console.
> .level = INFO
>
> ############################################################
> # Handler specific properties.
> # Describes specific configuration information for Handlers.
> ############################################################
>
> # default file output is in the tmp dir
> java.util.logging.FileHandler.pattern = /tmp/snowflake_jdbc%u.log
> java.util.logging.FileHandler.limit = 5000000000000000
> java.util.logging.FileHandler.count = 10
> java.util.logging.FileHandler.level = INFO
> java.util.logging.FileHandler.formatter = net.snowflake.client.log.SFFormatter
>
> # Limit the messages that are printed on the console to INFO and above.
> java.util.logging.ConsoleHandler.level = INFO
> java.util.logging.ConsoleHandler.formatter = net.snowflake.client.log.SFFormatter
>
> # Example to customize the SimpleFormatter output format
> # to print one-line log message like this:
> #     <level>: <log message> [<date/time>]
> #
> # java.util.logging.SimpleFormatter.format=%4$s: %5$s [%1$tc]%n
>
> ############################################################
> # Facility specific properties.
> # Provides extra control for each logger.
> ############################################################
>
> # Snowflake JDBC logging level.
> net.snowflake.level = INFO
> net.snowflake.handler = java.util.logging.FileHandler
> ```

Specify the JVM parameters in the command line:

> ```bash
> java -jar application.jar -Dnet.snowflake.jdbc.loggerImpl=net.snowflake.client.log.JDK14Logger -Djava.util.logging.config.file=logging.properties
> ```

Where `application.jar` references the application code for the JDBC driver. The log files are located in `/tmp/snowflake_jdbc*`.

### Simple logging facade for Java (`org.slf4j`)

To choose this logger, set the JVM option:

> `-Dnet.snowflake.jdbc.loggerImpl=net.snowflake.client.log.SLF4JLogger`.
>
> You must add `slf4j-api` and its implementation (for example, `logback`) to the `classpath`.

For more information, see the [Simple Logging Facade for Java (SLF4J) documentation](http://www.slf4j.org).

### Bridging logs from commons-logging

Some of the libraries use Apache commons-logging for logging. Handling these logs is configured by the `net.snowflake.jdbc.commons_logging_wrapper` JVM option that was added in version 3.22.0. For details, see Other parameters.

### Logging configuration file

Alternatively, you can easily specify the [log level](https://github.com/snowflakedb/snowflake-jdbc/blob/master/src/main/java/net/snowflake/client/log/SFLogLevel.java) and
the directory in which to save log files in the `sf_client_config.json` configuration file.

> **Note:**
>
> This logging configuration file feature supports only the following log levels:
>
> > * `DEBUG`
> > * `ERROR`
> > * `INFO`
> > * `OFF`
> > * `TRACE`
> > * `WARNING`

This configuration file uses JSON to define the `log_level` and `log_path` logging parameters, as follows:

```bash
{
  "common": {
    "log_level": "DEBUG",
    "log_path": "/home/user/logs"
  }
}
```

The driver looks configuration details in the following order:

* `client_config_file` [connection parameter](jdbc-parameters.md),
  containing the full path to the user-defined logging configuration file. For example:

  ```properties
  client_config_file=/opt/snowflake/snowflake_jdbc/my_jdbc_config.json
  ```
* `SF_CLIENT_CONFIG_FILE` environment variable, containing the full path to the user-defined logging configuration file.

  ```bash
  export SF_CLIENT_CONFIG_FILE=/home/myuser/my_jdbc_config.json
  ```
* JDBC driver installation directory, where the file must be named `sf_client_config.json`.
* User’s home directory, where the file must be named `sf_client_config.json`.

> **Note:**
>
> * If the configuration file is not found in any of the preceding locations, the driver uses the
>   Java core logging facilities.
> * If a configuration file specified in either the `client_config_file` connection parameter or
>   `SF_CLIENT_CONFIG_FILE` environment variable cannot be found or read, the driver throws an error message.

## Disabling PUT and GET commands

By default, the JDBC driver allows you to execute PUT and GET commands. If you don’t want to allow PUT and GET
commands access to the local file system, you can disable these commands in the following ways:

Version 4 .xVersion 3.x

* Set the [JDBC_ENABLE_PUT_GET](../../sql-reference/parameters.md) server parameter to `FALSE`.
* Set the JDBC [enablePutGet](jdbc-parameters.md) connection parameter to `false`.

* Set the [JDBC_ENABLE_PUT_GET](../../sql-reference/parameters.md) server parameter to `FALSE`.
* Set the JDBC [enablePutGet](jdbc-parameters.md) connection parameter to `false`.
* Call the `SFBaseSession.setEnablePutGet(false)` method.

## HTTP headers customization feature in Snowflake JDBC driver

To programmatically add custom HTTP headers to requests made by the Snowflake JDBC driver, implement the `HttpHeadersCustomizer` interface and register your implementation(s). This allows flexible, programmatic injection of dynamic or static headers.

Key considerations:

* The driver iterates registered customizers for applicable requests (Snowflake API, S3, private link OCSP). Then it calls `applies()`, then `newHeaders()` (respecting `invokeOnce()` for retries).
* Customizers cannot override essential driver-set headers. This is enforced by the driver.
* Keep `applies()` and `newHeaders()` efficient.

The following example shows how to implement `net.snowflake.client.jdbc.HttpHeadersCustomizer`.

```java
public class MyDynamicCustomizer implements HttpHeadersCustomizer {
    public boolean applies(String method, String uri, Map<String, List> headers) {
        return true;
    }

    public Map<String, List<String>> newHeaders() {
        Map<String, List<String>> headers = new HashMap<>();
        headers.put("X-Dynamic-Token", Collections.singletonList("token-" + System.nanoTime()));
        return headers;
    }

    public boolean invokeOnce() {
        return false;
    }
}
```

The following examples show different ways to register customizers:

* Via `net.snowflake.client.jdbc.SnowflakeBasicDataSource`:

  ```java
  SnowflakeBasicDataSource ds = new SnowflakeBasicDataSource();
  // ... set URL, user, password ...
  List<HttpHeadersCustomizer> myCustomizers = new ArrayList<>();
  myCustomizers.add(new MyDynamicHeaderCustomizer());
  Properties props = new Properties();
  props.put(HttpHeadersCustomizer.HTTP_HEADER_CUSTOMIZERS_PROPERTY_KEY, myCustomizers);
  ds.setConnectionProperties(props);
  ```
* Via `java.sql.DriverManager`:

  ```java
  Properties props = new Properties();
  // ... set user, password ...
  List<HttpHeadersCustomizer> myCustomizers = new ArrayList<>();
  myCustomizers.add(new MyDynamicHeaderCustomizer());
  props.put(HttpHeadersCustomizer.HTTP_HEADER_CUSTOMIZERS_PROPERTY_KEY, myCustomizers);
  Connection conn = DriverManager.getConnection(jdbcUrl, props);
  ```

## Troubleshooting tips

### Ensure properties are set correctly

The `DriverManager.getConnection()` method reads only the values of the
Properties parameter that match specific, predefined names (“password”, “warehouse”, etc.). If you
misspell a property name, or include extra properties, the driver ignores those properties without issuing an
error or warning message. This can make it difficult to detect minor misspellings.

### Use the right values for connection string and account

If you can’t establish a connection, verify that you are specifying the account identifier correctly in the JDBC connection
string. For more information about finding your account identifier, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

---
title: Connect to Snowflake with the Snowflake Python APIs
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-connecting-snowflake.md
section: Developer Guide
---

# Connect to Snowflake with the Snowflake Python APIs

Before you can perform actions with the Snowflake Python APIs, you must define a connection to Snowflake. With the connection, you can
create a `Root` object for access to resources modeled by the API.

## Specify connection properties

You can define a connection to Snowflake using one of the following mechanisms:

* Python dictionary
* Configuration file

### Connect using a Python dictionary

You can specify the values needed to connect to Snowflake by using a Python dictionary. When you connect, you pass this dictionary as an
argument to the function or method you’re using to connect:

```python
import os

CONNECTION_PARAMETERS = {
    "account": os.environ["snowflake_account_demo"],
    "user": os.environ["snowflake_user_demo"],
    "password": os.environ["snowflake_password_demo"],
    "role": "test_role",
    "database": "test_database",
    "warehouse": "test_warehouse",
    "schema": "test_schema",
}
```

### Connect using a configuration file

You can specify connection definitions in a [TOML configuration file](../python-connector/python-connector-connect.md). This eliminates the need to
explicitly define a connection to Snowflake in your code.

You can generate the basic settings for the TOML configuration file in Snowsight. For information, see
[Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

You can also configure the connection settings manually. For example, create a configuration file located at
`~/.snowflake/connections.toml`, and add connection settings similar to the following:

```toml
[myconnection]
account = "test-account"
user = "test_user"
password = "******"
role = "test_role"
warehouse = "test_warehouse"
database = "test_database"
schema = "test_schema"
```

In this example, you define a Snowflake connection named `myconnection` with the account `test-account`, user `test_user`,
password credentials, and database information.

> **Note:**
>
> Underscores are not supported in the `account` setting. If the [account identifier](../../user-guide/admin-account-identifier.md)
> includes underscores, replace them with dashes. For more information, see [Account name in your organization](../../user-guide/admin-account-identifier.md).

Connection definitions support the same configuration options available in the
[Snowflake Python Connector](../python-connector/python-connector-connect.md).

## Connect and create a `Root` object

Using the connection properties you’ve specified, you can create a connection to Snowflake. With the connection, you can create a
Snowflake Python APIs `Root` object with which to begin using the API.

You can connect using one of the following objects:

* A Snowpark Session object
* A Snowflake Python Connector Connection object

### Connect with a Snowpark `Session`

If you’re using the [Snowpark API for Python](../snowpark/python/index.md), you can create a connection to Snowflake
by using its `snowflake.snowpark.Session` object.

The Snowpark Python library is not automatically installed as a dependency of `snowflake.core`. To connect to Snowflake using the
Snowpark `Session` object, follow these steps:

1. To install the `snowflake-snowpark-python` package, run the following command:

   ```shell
   pip install 'snowflake-snowpark-python>=1.5.0,<2.0.0'
   ```
2. To create a connection to Snowflake, run code similar to the following example:

   ```python
   from snowflake.core import Root
   from snowflake.snowpark import Session

   session = Session.builder.config("connection_name", "myconnection").create()
   root = Root(session)
   ```

   In this example, the code creates a `Session` object using a connection definition named `myconnection`, which is specified in a
   configuration file. Using the resulting `Session` object, the code creates a `Root` object from which to use the API.

For more information about creating a `Session`, see [Creating a Session for Snowpark Python](../snowpark/python/creating-session.md).

### Connect with a Python Connector `Connection`

If you’re using the [Snowflake Connector for Python](../python-connector/python-connector.md), you can create a connection
to Snowflake by using its `snowflake.connector.connect` function. The function returns a `Connection` object.

You don’t need to install the Python Connector library separately. The `snowflake-connector-python` package is installed automatically as
a dependency when you install the `snowflake` parent package.

Code in the following example creates a `Connection` object using a connection definition named `myconnection`, which is specified
in a configuration file. Using the resulting `Connection` object, the code creates a `Root` object from which to use the API:

```python
from snowflake.connector import connect
from snowflake.core import Root

connection = connect(connection_name="myconnection")
root = Root(connection)
```

For more information about the Snowflake Connector for Python API, see [Python Connector API](../python-connector/python-connector-api.md).

## Use the `Root` object

With a `Root` object created from your connection to Snowflake, you can access
objects and methods of the Snowflake Python APIs. The `Root` object is the root of the resource tree modeled by the API.
You use the `Root` object to interact with Snowflake objects represented by the API.

Code in the following example uses the `Root` object to access Snowflake objects in order to resume the task named `mytask`.
The task is in the schema named `myschema`, which is in the database named `mydb`. The code uses the `databases`,
`schemas`, and `tasks` methods to get an object that represents this task:

```python
tasks = root.databases["mydb"].schemas["myschema"].tasks
mytask = tasks["mytask"]
mytask.resume()
```

---
title: Connecting to Snowflake with the Python Connector
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-connect.md
section: Developer Guide
---

# Connecting to Snowflake with the Python Connector

This topic explains the various ways you can connect to Snowflake with the Python connector.

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand to evaluate and troubleshoot your network connection to Snowflake.

## Importing the `snowflake.connector` module

To import the `snowflake.connector` module, execute the following command:

```python
import snowflake.connector
```

You can get login information from environment variables, the command line, a configuration file, or another appropriate
source. For example:

```none
PASSWORD = os.getenv('SNOWSQL_PWD')
WAREHOUSE = os.getenv('WAREHOUSE')
...
```

For the ACCOUNT parameter, use your [account identifier](../../user-guide/gen-conn-config.md). Note that the account identifier
does not include the `snowflakecomputing.com` suffix.

For details and examples, see [Usage notes for the account parameter (for the connect method)](python-connector-api.md).

> **Note:**
>
> For descriptions of available connector parameters, see the `snowflake.connector` [methods](python-connector-api.md).

If you copy data from your own Amazon S3 bucket, then you need the AWS_ACCESS_KEY_ID and AWS_SECRET_ACCESS_KEY.

```python
import os

AWS_ACCESS_KEY_ID = os.getenv('AWS_ACCESS_KEY_ID')
AWS_SECRET_ACCESS_KEY = os.getenv('AWS_SECRET_ACCESS_KEY')
```

> **Note:**
>
> If your data is stored in a Microsoft Azure container, provide the credentials directly in the COPY statement.

After reading the connection information, connect using either the default authenticator or federated authentication
(if enabled).

## Setting session parameters

You can set session parameters, such as QUERY_TAG, in multiple ways when using the Python Connector:

* You can set session-level parameters at the time you connect to Snowflake by passing
  the optional connection parameter named `session_parameters`, as follows:

  ```python
  con = snowflake.connector.connect(
      user='XXXX',
      password='XXXX',
      account='XXXX',
      session_parameters={
          'QUERY_TAG': 'EndOfMonthFinancials',
      }
  )
  ```

  The `session_parameters` dictionary passed to the [snowflake.connector.connect](python-connector-api.md) method can contain one or more session-level parameters.

  > **Note:**
  >
  > You cannot set the [SEARCH_PATH](../../sql-reference/parameters.md) parameter within a Python connection. You must establish a session before setting a search path.
* You can also set session parameters by executing the ALTER SESSION SET SQL statement after connecting:

  ```python
  con.cursor().execute("ALTER SESSION SET QUERY_TAG = 'EndOfMonthFinancials'")
  ```

For more information about session parameters, see the descriptions of individual parameters on the general
[Parameters](../../sql-reference/parameters.md) page.

## Connecting using the default authenticator

Connect to Snowflake using the login parameters:

```python
conn = snowflake.connector.connect(
    user=USER,
    password=PASSWORD,
    account=ACCOUNT,
    warehouse=WAREHOUSE,
    database=DATABASE,
    schema=SCHEMA
    )
```

You might need to extend this with other information available in the [snowflake.connector.connect](python-connector-api.md) method.

## Connecting using the `connections.toml` file

The Python connector lets you add connection definitions to a `connections.toml` configuration file.
A connection definition refers to a collection of connection-related parameters. Snowflake Python libraries currently support TOML version 1.0.0.

For more information about `toml` file formats, see [TOML (Tom’s Obvious Minimal Language)](https://toml.io/en/).

The Python connector looks for the `connections.toml` file in the following locations, in order:

* If a `~/.snowflake` directory exists on your machine, the Python Connector uses the
  `~/.snowflake/connections.toml` file. You can override the default `~/.snowflake` directory by setting the
  location in the `SNOWFLAKE_HOME` environment variable.
* Otherwise, the Python Connector uses the `connections.toml` file in the one of the following locations, based on your operating system:

  > + Linux: `~/.config/snowflake/connections.toml`, but you can update it with XDG vars
  > + Windows: `%USERPROFILE%\AppData\Local\snowflake\connections.toml`
  > + Mac: `~/Library/Application Support/snowflake/connections.toml`

To add credentials in a connections configuration file:

1. In a text editor, open the `connections.toml` file for editing. For example, to open the file in the Linux **vi** editor:

   ```bash
   $ vi connections.toml
   ```
2. Add a new Snowflake connection definition.

   You can generate the basic settings for the TOML configuration file in Snowsight. For information, see
   [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

   For example, to add a Snowflake connection called `myconnection` with the account `myaccount`,
   user `johndoe`, and password credentials, as well as database information,
   add the following lines to the configuration file:

   ```toml
   [myconnection]
   account = "myorganization-myaccount"
   user = "jdoe"
   password = "******"
   warehouse = "my-wh"
   database = "my_db"
   schema = "my_schema"
   ```

   Connection definitions support the same configuration options available in the
   [snowflake.connector.connect](python-connector-api.md) method.
3. Optional: Add more connections, as shown:

   ```toml
   [myconnection_test]
   account = "myorganization-myaccount"
   user = "jdoe-test"
   password = "******"
   warehouse = "my-test_wh"
   database = "my_test_db"
   schema = "my_schema"
   ```
4. Save changes to the file.
5. In your Python code, supply the connection name to `snowflake.connector.connect`, similar to the following:

   ```python
   with snowflake.connector.connect(
         connection_name="myconnection",
   ) as conn:
   ```

   You can also override values defined for the connection in the `connections.toml` file, as follows:

   ```python
   with snowflake.connector.connect(
         connection_name="myconnection",
         warehouse="test_xl_wh",
         database="testdb_2"
   ) as conn:
   ```

## Setting a default connection

You can set a connection as the default, so you don’t have to specify one every time you call
`snowflake.connector.connect()` to connect to Snowflake. You can define a default connection in any of
the following ways, which are listed in increasing order of precedence:

* Create a connection definition named `default`.

  1. In the `connections.toml` file, create the connection definition and give it the name `default`, as shown:

     ```toml
     [default]
     account = "myorganization-myaccount"
     user = "jdoe-test"
     password = "******"
     warehouse = "my-test_wh"
     database = "my_test_db"
     schema = "my_schema"
     ```
  2. Save the file.
* Specify a named connection as the default connection in the Snowflake `config.toml` file, in the same directory
  as the `connections.toml` file.

  1. Open the `config.toml` file for editing; then:
  2. Set the `default_connection_name` parameter similar to the following:

     ```toml
     default_connection_name = "myaccount"
     ```
  3. Save the file.
* Set the `SNOWFLAKE_DEFAULT_CONNECTION_NAME` environment variable.

  Sometimes you might want to override the default connection temporarily, such as trying a test connection, without needing
  to change the normal default connection. You can override the default connection specified in the `connections.toml`
  and `config.toml` files by setting the `SNOWFLAKE_DEFAULT_CONNECTION_NAME` environment variable as follows:

  ```bash
  SNOWFLAKE_DEFAULT_CONNECTION_NAME = myconnection_test
  ```

To use the default connection, execute Python code similar to the following:

> ```python
> with snowflake.connector.connect() as conn:
>     with conn.cursor() as cur:
>         print(cur.execute("SELECT 1;").fetchall())
> ```

> **Note:**
>
> If you choose to rely on a default connection, you cannot override connection parameters, such as `username`,
> `database`, or `schema`.

## Using single sign-on (SSO) for authentication

If you have [configured Snowflake to use single sign-on (SSO)](../../user-guide/admin-security-fed-auth-overview.md), you can configure
your client application to use SSO for authentication. See [Using SSO with client applications that connect to Snowflake](../../user-guide/admin-security-fed-auth-use.md) for details.

## Using multi-factor authentication (MFA)

Snowflake supports caching MFA tokens, including combining MFA token caching with SSO.

The following sample code shows how to use MFA with the Python connector using a variety of methods:

* Push notification (Duo Push) - Default behavior when no passcode is provided.
* TOTP (Time-based One-Time Password) passcode - Provide the passcode from your authenticator app
* Passcode in password - Append the passcode to your password.
* MFA token caching - Cache the MFA token to skip prompts on reconnect.

```python
#!/usr/bin/env python
"""
This sample shows how to use Multi-Factor Authentication (MFA) with the
Snowflake Python Connector.

There are several ways to authenticate with MFA:
1. Push notification (Duo Push) - default behavior when no passcode is provided
2. TOTP passcode - provide the passcode from your authenticator app
3. Passcode in password - append the passcode to your password
4. MFA token caching - cache the MFA token to skip prompts on reconnect

Prerequisites:
- MFA must be enabled for your Snowflake user account
- You need a compatible authenticator app (such as Duo Mobile, Google Authenticator)
"""

import snowflake.connector

# Replace with your own Snowflake credentials
CONNECTION_PARAMETERS = {
    "account": "<account_name>",
    "user": "<user_name>",
    "password": "<password>",
    "database": "<database_name>",
    "schema": "<schema_name>",
    "warehouse": "<warehouse_name>",
}

def connect_with_mfa_push():
    """
    Example 1: MFA with Push Notification (Duo Push)

    When no passcode is provided, Snowflake sends a push notification
    to your registered device. You need to approve it to complete login.
    """
    print("Connecting with MFA push notification...")
    print("Please approve the push notification on your device.")

    with snowflake.connector.connect(
        **CONNECTION_PARAMETERS,
        authenticator="username_password_mfa",
    ) as conn:
        result = conn.cursor().execute("SELECT CURRENT_USER(), CURRENT_ROLE()").fetchone()
        print(f"Connected as: {result[0]}, Role: {result[1]}")

def connect_with_mfa_passcode(passcode: str):
    """
    Example 2: MFA with TOTP Passcode

    Provide the time-based one-time password (TOTP) from your
    authenticator app directly in the connection parameters.

    Args:
        passcode: The 6-digit TOTP code from your authenticator app
    """
    print(f"Connecting with MFA passcode: {passcode[:2]}****")

    with snowflake.connector.connect(
        **CONNECTION_PARAMETERS,
        authenticator="username_password_mfa",
        passcode=passcode,
    ) as conn:
        result = conn.cursor().execute("SELECT CURRENT_USER(), CURRENT_ROLE()").fetchone()
        print(f"Connected as: {result[0]}, Role: {result[1]}")

def connect_with_passcode_in_password(passcode: str):
    """
    Example 3: MFA with Passcode Appended to Password

    Instead of providing the passcode separately, you can append it
    to your password. This is useful for tools that don't support
    separate passcode parameters.

    Args:
        passcode: The 6-digit TOTP code from your authenticator app
    """
    print("Connecting with passcode appended to password...")

    # Create a copy of parameters with modified password
    params = CONNECTION_PARAMETERS.copy()
    params["password"] = params["password"] + passcode

    with snowflake.connector.connect(
        **params,
        authenticator="username_password_mfa",
        passcode_in_password=True,
    ) as conn:
        result = conn.cursor().execute("SELECT CURRENT_USER(), CURRENT_ROLE()").fetchone()
        print(f"Connected as: {result[0]}, Role: {result[1]}")

def connect_with_mfa_callback():
    """
    Example 4: MFA with Callback Function

    You can provide a callback function that gets called while waiting
    for MFA approval. This is useful for providing user feedback.
    """
    print("Connecting with MFA callback...")

    def mfa_callback():
        """Called while waiting for MFA approval."""
        print("  ... waiting for MFA approval ...")
        return None

    with snowflake.connector.connect(
        **CONNECTION_PARAMETERS,
        authenticator="username_password_mfa",
        mfa_callback=mfa_callback,
    ) as conn:
        result = conn.cursor().execute("SELECT CURRENT_USER(), CURRENT_ROLE()").fetchone()
        print(f"Connected as: {result[0]}, Role: {result[1]}")

def connect_with_mfa_token_caching():
    """
    Example 5: MFA with Token Caching

    Enable MFA token caching to skip MFA prompts on subsequent connections.
    The token is securely stored and reused for future connections.

    Note: This feature must also be enabled on the server side.
    On Linux, token caching requires a secure credential storage
    (such as keyring with a backend like Secret Service).
    """
    print("Connecting with MFA token caching enabled...")
    print("First connection - MFA required. Please approve.")

    # First connection - requires MFA approval
    with snowflake.connector.connect(
        **CONNECTION_PARAMETERS,
        authenticator="username_password_mfa",
        client_request_mfa_token=True,
    ) as conn:
        result = conn.cursor().execute("SELECT CURRENT_USER()").fetchone()
        print(f"First connection successful as: {result[0]}")

    print("\nSecond connection - using cached MFA token (no prompt expected)...")

    # Second connection - should use cached token
    with snowflake.connector.connect(
        **CONNECTION_PARAMETERS,
        authenticator="username_password_mfa",
        client_request_mfa_token=True,
    ) as conn:
        result = conn.cursor().execute("SELECT CURRENT_USER()").fetchone()
        print(f"Second connection successful as: {result[0]}")

if __name__ == "__main__":
    import sys

    print("Snowflake MFA Authentication Examples")
    print("=" * 40)

    if len(sys.argv) < 2:
        print(
            """
Usage: python auth_by_mfa.py <example> [passcode]

Examples:
  python auth_by_mfa.py push              # MFA with push notification
  python auth_by_mfa.py passcode 123456   # MFA with TOTP passcode
  python auth_by_mfa.py password 123456   # MFA with passcode in password
  python auth_by_mfa.py callback          # MFA with callback function
  python auth_by_mfa.py cache             # MFA with token caching
"""
        )
        sys.exit(1)

    example = sys.argv[1].lower()

    if example == "push":
        connect_with_mfa_push()
    elif example == "passcode":
        if len(sys.argv) < 3:
            print("Error: Please provide a passcode")
            sys.exit(1)
        connect_with_mfa_passcode(sys.argv[2])
    elif example == "password":
        if len(sys.argv) < 3:
            print("Error: Please provide a passcode")
            sys.exit(1)
        connect_with_passcode_in_password(sys.argv[2])
    elif example == "callback":
        connect_with_mfa_callback()
    elif example == "cache":
        connect_with_mfa_token_caching()
    else:
        print(f"Unknown example: {example}")
        sys.exit(1)
```

For more information, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../../user-guide/security-mfa.md).

## Using key-pair authentication and key-pair rotation

The Python connector supports key pair authentication and key rotation.

For more information on how to configure key pair authentication and key rotation, see [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md).

1. After completing the key pair authentication configuration, set the `private_key_file` parameter in the `connect` function to the path to the private key file. Also, set the `private_key_file_pwd` parameter to the passphrase of the private key file.
2. Modify and execute the sample code, below:

> * Update the security parameters:
>
>   > + `path`: Specifies the local path to the private key file you created.
> * Update the connection parameters:
>
>   > + `user`: Specifies your Snowflake login name.
>   > + `account_identifier`: Specifies your [account identifier](../../user-guide/gen-conn-config.md).
>   >
>   >   For more details, see [Usage notes for the account parameter (for the connect method)](python-connector-api.md).
>
> > **Sample code**
> >
> > ```python
> > import os
> > import snowflake.connector as sc
> >
> > private_key_file = '<path>'
> > private_key_file_pwd = '<password>'
> >
> > conn_params = {
> >     'account': '<account_identifier>',
> >     'user': '<user>',
> >     'authenticator': 'SNOWFLAKE_JWT',
> >     'private_key_file': private_key_file,
> >     'private_key_file_pwd':private_key_file_pwd,
> >     'warehouse': '<warehouse>',
> >     'database': '<database>',
> >     'schema': '<schema>'
> > }
> >
> > ctx = sc.connect(**conn_params)
> > cs = ctx.cursor()
> > ```

## Using a proxy server

To use a proxy server, configure the following environment variables:

* HTTP_PROXY
* HTTPS_PROXY
* NO_PROXY

For example:

Linux or macOS:
:   ```bash
    export HTTP_PROXY='http://username:password@proxyserver.example.com:80'
    export HTTPS_PROXY='http://username:password@proxyserver.example.com:80'
    ```

Windows:
:   ```bash
    set HTTP_PROXY=http://username:password@proxyserver.example.com:80
    set HTTPS_PROXY=http://username:password@proxyserver.example.com:80
    ```

> **Tip:**
>
> Snowflake’s security model does not allow Secure Sockets Layer (SSL) proxies (using an HTTPS certificate). Your proxy server must use a publicly-available Certificate Authority (CA), reducing potential security risks such as a MITM (Man In The Middle) attack through a compromised proxy.
>
> If you must use your SSL proxy, we strongly recommend that you update the server policy to pass through the Snowflake certificate such that no certificate is altered in the middle of
> communications.
>
> Optionally `NO_PROXY` can be used to bypass the proxy for specific communications. For example, access to Amazon S3 can bypass the proxy server by specifying `NO_PROXY=".amazonaws.com"`.
>
> `NO_PROXY` does not support wildcards. Each value specified should be one of the following:
>
> * The end of a hostname (or a complete hostname), for example:
>
>   + .amazonaws.com
>   + myorganization-myaccount.snowflakecomputing.com
> * An IP address, for example:
>
>   + 192.196.1.15
>
> If more than one value is specified, values should be separated by commas, for example:
>
> > ```none
> > localhost,.example.com,.snowflakecomputing.com,192.168.1.15,192.168.1.16
> > ```

## Connecting with OAuth

To connect using OAuth, the connection string must include the `authenticator` parameter set to `oauth` and the `token` parameter set to the `oauth_access_token`. For more information, see [Clients, drivers, and connectors](../../user-guide/oauth-intro.md).

```python
ctx = snowflake.connector.connect(
    user="<username>",
    host="<hostname>",
    account="<account_identifier>",
    authenticator="oauth",
    token="<oauth_access_token>",
    warehouse="test_warehouse",
    database="test_db",
    schema="test_schema"
)
```

### Enable the OAuth 2.0 Authorization Code flow

The OAuth 2.0 Authorization Code flow is a secure method for a client application to obtain an access token from an authorization server on behalf of a user, without revealing the user’s credentials.

To enable the OAuth 2.0 Authorization Code flow:

* Set the `authenticator` connection parameter to `OAUTH_AUTHORIZATION_CODE`.
* Set the following OAuth connection parameters:

  + `oauth_client_id`: Value of `client id` provided by the Identity Provider for Snowflake integration (Snowflake security integration metadata).
  + `oauth_client_secret`: Value of the `client secret` provided by the Identity Provider for Snowflake integration (Snowflake security integration metadata).
  + `oauth_authorization_url`: Identity Provider endpoint supplying the authorization code to the driver. When using Snowflake as an Identity Provider ,this value is derived from the `server` or `account` parameters.
  + `oauth_token_request_url`: Identity Provider endpoint supplying the access tokens to the driver. When using Snowflake as an Identity Provider ,this value is derived from the `server` or `account` parameters.
  + `oauth_scope`: Scope requested in the Identity Provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.
  + `oauth_redirect_uri`: URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.
  + `oauth_disable_pkce:` Disables Proof Key for Code Exchange (PKCE), a security enhancement that ensures that even if malicious attackers intercept an Authorization Code, they won’t be able to change it to a valid access token.
  + `oauth_enable_refresh_token:` Enables a silent re-authentication when the actual access token becomes outdated, providing it’s supported by the Authorization Server and `client_store_temporary_credential` is set to `True`.
  + `oauth_enable_single_use_refresh_tokens:` Whether to opt-in to single-use refresh token semantics.

### Enable the OAuth 2.0 Client Credentials flow

The OAuth 2.0 Client Credentials flow provides a secure way for machine-to-machine (M2M) authentication, such as the Snowflake Connector for Python connecting to a backend service. Unlike the OAuth 2.0 Authorization Code flow, this method does not rely on any user-specific data.

To enable the OAuth 2.0 Client Credentials flow:

* Set the `authenticator` connection parameter to `OAUTH_CLIENT_CREDENTIALS`.
* Set the following OAuth connection parameters:

  + `oauth_client_id`: Value of `client id` provided by the Identity Provider for Snowflake integration (Snowflake security integration metadata).
  + `oauth_client_secret`: Value of the `client secret` provided by the Identity Provider for Snowflake integration (Snowflake security integration metadata)
  + `oauth_token_request_url`: Identity Provider endpoint supplying the access tokens to the driver. When using Snowflake as an Identity Provider, this value is derived from the `server` or `account` parameters.
  + `oauth_scope`: Scope requested in the Identity Provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

## Authenticating with workload identity federation (WIF)

[Workload identity federation](../../user-guide/workload-identity-federation.md) provides a service-to-service authentication method for Snowflake. This method enables applications, services, or containers to authenticate with Snowflake by leveraging their cloud provider’s native identity system, such as AWS IAM, Microsoft Entra ID, or Google Cloud service accounts. This approach eliminates the need for managing long-lived credentials and simplifies credential acquisition compared to other methods like External OAuth. Snowflake connectors are designed to automatically obtain short-lived credentials from the platform’s identity provider.

To enable the workload identity federation authenticator, do the following:

1. Set the `authenticator` connection parameter to `WORKLOAD_IDENTITY`.
2. Set the `workload_identity_provider` connection parameter to `AWS`, `AZURE`, `GCP`, or `OIDC`, based on your platform.
3. For OpenID Connect (OIDC), specify the `token` connection parameter.

## Authenticating with a programmatic access token (PAT)

Programmatic Access Token (PAT) is a Snowflake-specific authentication method. The feature must be enabled for the account before usage (see the [Prerequisites](../../user-guide/programmatic-access-tokens.md) for more information). Authentication with PAT doesn’t involve any human interaction.

For more information about PATs, see [Using programmatic access tokens for authentication](../../user-guide/programmatic-access-tokens.md).

## Managing connection timeouts

Calling `snowflake.connector.connect` submits a login request. If a login request fails, the connector can resend the connection
request. The following parameters set time limits after which the connector stops retrying requests:

* `login_timeout`: Specifies how long, in seconds, to keep resending the connection request. If the connection is
  unsuccessful within that time, the connector fails with a timeout error after completing the current attempt instead of
  continuing to retry the login request. After the timeout passes, further retries are prevented. However, the current ongoing attempt terminates naturally.
* `network_timeout`: Specifies how long to wait for network issues to resolve for other requests, such as query
  requests from `cursor.execute`. When `network_timeout` seconds have passed, if the current attempt fails, a timeout occurs, and the request in question is not retried.
  After `network_timeout` seconds pass,
  the current attempt is still allowed to finish (fail on its own), after which the timeout occurs.
* `socket_timeout`: Specifies the connection and request timeouts at the socket level.

The following example overrides the `socket_timeout` for the SNOWFLAKE_JWT authenticator:

```python
# this request itself stops retrying after 60 seconds as it is a login request
conn = snowflake.connector.connect(
login_timeout=60,
network_timeout=30,
socket_timeout=10
)

# this request stops retrying after 30 seconds
conn.cursor.execute("SELECT * FROM table")
```

The following example demonstrates the effect of setting `socket_timeout` to a large value:

```python
# even though login_timeout is 1, connect will take up to n*300 seconds before failing
# (n depends on possible socket addresses)
# this issue arises because socket operations cannot be cancelled once started
conn = snowflake.connector.connect(
login_timeout=1,
socket_timeout=300
)
```

The following example shows how to override the socket timeout for the SNOWFLAKE_JWT authenticator:

```python
# socket timeout for login request overriden by env variable JWT_CNXN_WAIT_TIME
conn = snowflake.connector.connect(
authenticator="SNOWFLAKE_JWT",
socket_timeout=300
)

# socket timeout for this request is still 300 seconds
conn.cursor.execute("SELECT * FROM table")
```

Note that the `MAX_CON_RETRY_ATTEMPTS` environment variable limits the maximum number of retry attempts for login requests.
If a request has not timed out but the maximum number of retry attempts is reached, the request immediately fails.
The default value is 1, meaning the connector makes only one retry attempt.

## Managing connection backoff policies for retries

In some situations, you might want to vary the rate or frequency the connector uses to retry failed requests due to timeouts.
For example, if you notice that very large numbers of attempts occur concurrently, you can spread those requests out by
defining a retry backoff policy. A backoff policy specifies the time to wait between retry attempts.

The Snowflake Connector for Python implements backoff policies with the `backoff_policy` connection parameter
that specifies a Python generator function. The generator function lets you specify how long to wait (back off) before sending
the next retry request.

Snowflake provides the following helpers to create predefined generator functions with your desired parameters.
You can use these if you do not want to create your own:

* `linear_backoff`, which increases the backoff duration by a constant each iteration.
* `exponential_backoff`, which multiplies the backoff duration by a constant each iteration.
* `mixed_backoff`, which randomly chooses between incrementing the backoff duration with `exponential_backoff` and leaving it unchanged each iteration.

These predefined generator functions use the following parameters to specify their behaviors:

* `base`: Initial backoff time, in seconds (default = `1`).
* `factor`: Coefficient for incrementing the backoff time. The effect depends on implementation (default = `2`); `linear_backup` adds the value, while `exponential_backup` multiplies the value.
* `cap`: Maximum backoff time, in seconds (default = `16`).
* `enable_jitter`: Whether to enable jitter on computed durations (default = `True`).

  For more information about jitter in exponential backoff, see the [AWS Exponential Backoff And Jitter](https://aws.amazon.com/blogs/architecture/exponential-backoff-and-jitter/) article.

For example, you can use the `exponential_backoff` policy with default values or with custom values, as shown:

```python
from snowflake.connector.backoff_policies import exponential_backoff

# correct, no required arguments
snowflake.connector.connect(
backoff_policy=exponential_backoff()
)

# correct, parameters are customizable
snowflake.connector.connect(
backoff_policy=exponential_backoff(
    factor=5,
    base=10,
    cap=60,
    enable_jitter=False
  )
)
```

You can also create your own backoff policy generator functions, similar to the following that defines
the `my_backoff_policy` generator function:

```python
def my_backoff_policy() -> int:
  while True:
    # yield the desired backoff duration
```

You then set the `backoff_policy` connection parameter to the name of your generator function as follows:

```python
snowflake.connector.connect(
  backoff_policy=constant_backoff
)
```

## OCSP

When the driver connects, Snowflake sends a certificate to confirm that the connection is to Snowflake rather than to
a host that is impersonating Snowflake. The driver sends that certificate to an OCSP (Online Certificate Status
Protocol) server to verify that the certificate has not been revoked.

If the driver cannot reach the OCSP server to verify the certificate, the driver can
[“fail open” or “fail closed”](../../user-guide/ocsp.md).

### Choosing fail-open or fail-close mode

Versions of the Snowflake Connector for Python prior to 1.8.0 default to fail-close mode. Versions 1.8.0 and later
default to fail-open. You can override the default behavior by setting the optional connection parameter
`ocsp_fail_open` when calling the connect() method. For example:

```javascript
con = snowflake.connector.connect(
    account=<account_identifier>,
    user=<user>,
    ...,
    ocsp_fail_open=False,
    ...);
```

### Verifying the OCSP connector or driver version

The driver or connector version and its configuration both determine the OCSP behavior. For more information about the driver or connector version, their configuration, and OCSP behavior, see [OCSP Configuration](../../user-guide/ocsp.md).

### Caching OCSP responses

To ensure all communications are secure, the Snowflake Connector for Python uses the HTTPS protocol to connect to Snowflake, as well as to connect to all other services (e.g. Amazon S3 for staging data files and Okta for federated authentication). In addition to the regular HTTPS protocol, the connector also checks the TLS/SSL certificate revocation status on each connection via OCSP (Online Certificate Status Protocol) and aborts the connection if it finds the certificate is revoked or the OCSP status is not reliable.

Because each Snowflake connection triggers up to three round trips with the OCSP server, multiple levels of cache for OCSP responses have been introduced to reduce the network overhead added to the connection:

* Memory cache, which persists for the life of the process.
* File cache, which persists until the cache directory (e.g. `~/.cache/snowflake`) is purged.
* OCSP response server cache.

Caching also addresses availability issues for OCSP servers (i.e. in the event the actual OCSP server is down). As long as the cache is valid, the connector can still validate the certificate revocation status.

If none of the cache layers contain the OCSP response, the client attempts to fetch the validation status directly from the CA’s OCSP server.

#### Modifying the OCSP response file cache location

By default, the file cache is enabled in the following locations, so no additional configuration tasks are required:

Linux:
:   `~/.cache/snowflake/ocsp_response_cache.json`

macOS:
:   `~/Library/Caches/Snowflake/ocsp_response_cache.json`

Windows:
:   `%USERPROFILE%\AppData\Local\Snowflake\Caches\ocsp_response_cache.json`

However, if you want to specify a different location and/or file name for the OCSP response cache file, the `connect` method accepts the `ocsp_response_cache_filename` parameter, which specifies the path and name for the OCSP cache file in the form of a URI.

#### OCSP response cache server

> **Note:**
>
> The OCSP response cache server is currently supported by the Snowflake Connector for Python 1.6.0 and higher.

The memory and file types of OCSP cache work well for applications connected to Snowflake using one of the clients Snowflake provides, with a persistent host. However, they don’t work in dynamically-provisioned environments such as AWS Lambda or Docker.

To address this situation, Snowflake provides a third level of caching: the OCSP response cache server. The OCSP response cache server fetches OCSP responses hourly from the CA’s OCSP servers and stores them for 24 hours. Clients can then request the validation status of a given Snowflake certificate from this server cache.

> **Important:**
>
> If your server policy denies access to most or all external IP addresses and web sites, you must allow the cache server address to allow normal service operation. The cache server URL is
> `ocsp*.snowflakecomputing.com:80`.

If you need to disable the cache server for any reason, set the `SF_OCSP_RESPONSE_CACHE_SERVER_ENABLED` environment variable to `false`. Note that the value is case-sensitive and must be in
lowercase.

## Running connectivity tests and diagnostics

> **Note:**
>
> Running diagnostics for a connection requires the following:
>
> * Snowflake Connector for Python version 3.9.1 or newer.
> * Python version 3.9 (deprecated) or newer.

If you encounter connectivity issues, you can run diagnostics directly within the connector. Snowflake Support might also request this information to help you with connectivity issues.

The diagnostics collection uses the following [connection](python-connector-api.md) parameters:

* `enable_connection_diag`: Whether to generate a diagnostic report.
* `connection_diag_log_path`: Absolute path for the generated report.
* `connection_diag_allowlist_path`: Absolute path to a JSON file containing the output of `SYSTEM$ALLOWLIST()` or `SYSTEM$ALLOWLIST_PRIVATELINK()`. Required only if the user defined in the connection does not have permission to run the system allowlist functions or if connecting to the account URL fails.

To configure a connection to generate diagnostics:

1. Configure the connection parameters:

   * Set your Snowflake account credential parameters.
   * Set `enable_connection_diag=True` in your `connect` parameters.
   * If desired, change the location for the generated report by setting the `connection_diag_log_path` parameter.

   For example:

   ```python
   from snowflake import connector

   ctx = connector.connect(
           user=<user>,
           password=<password>,
           account=<account>,
           enable_connection_diag=True,
           connection_diag_log_path="<HOME>/diag-tests",
           )
   print('connected')
   ```
2. Execute the code snippet.
3. Review the diagnostic test output of the generated `SnowflakeConnectionTestReport.txt` file located in the specified log path.

You can review the report for any connectivity issues and discuss them with your network team. You can also provide the report to Snowflake Support for additional assistance.

---
title: Considerations when reusing a session or connection among multiple threads
source: https://docs.snowflake.com/en/developer-guide/driver-connections.md
section: Developer Guide
---

# Considerations when reusing a session or connection among multiple threads

Snowflake drivers use stateful connections. Reusing the same session or connection among threads has multiple drawbacks. For example, when a session is initialized, it starts with a default database, schema, role, and a set of parameters. A connection starts and ends a session, which establishes a one-to-one relationship between a session and a connection. The following section highlights common effects of reusing connections across concurrent threads.

## Effects of reusing a session or connection across multiple threads

Driver users often create multi-threaded applications. Rather than creating separate sessions and connections for each thread, you might try to save some overhead by reusing a session or connection in different threads. Be aware that doing so can cause the following undesirable behaviors:

* **Session state**

  Sessions keep track of the current database, schema, and role. If one thread changes these values (like USE DATABASE), the other thread might be affected. This impact is particularly important because changing to another schema with the same tables might cause a thread to accidentally modify the wrong table. Additionally, changing connection or configuration parameters in one session can affect all threads that use that session.
* **Transaction state**

  A transaction might start in one session. If multiple threads have access to that session, each one can potentially modify data in the same transaction, which might cause the data to be accidentally persisted or lost if a transaction is committed or rolled back.
* Sequence counter

  Drivers use a sequence counter to retry requests. Because sequence counters are global for a session, reusing a session in different threads might also inadvertently alter the global sequence counter that can result in unpredictable behavior for retrying requests.
* **Query context cache**

  For improved performance, sessions keep track of some internal information in a driver-specific or internal cache. The cache updates after every query, so running multiple queries concurrently in a session could result in data corruption.
* **Last query ID**

  Connections keep track of the last query ID, which can then be retrieved and used. If two queries run in parallel in different threads, a race condition can affect which one sets the last query ID.

## Snowflake recommendations

* Use connection pools when possible.

  If you reuse connections across threads to avoid authenticating frequently, you should consider using connection pools. Using connection pools decreases the number of authentication requests, because the session is not closed at the end—it’s just returned to a pool where it’s ready to be used for the next occasion. Even when using connection pools, your application must be careful not to alter or reset parameters that affect only a specific query or a current database. Also, the application is responsible for committing or aborting active transactions before returning a connection to the connection pool.
* Use asynchronous queries cautiously.

  Starting multiple asynchronous queries at the same time on a single connection, or starting a synchronous query while an asynchronous query is still in progress, can result in a race condition that might cause unpredictable results.
* Use additional authentication optimizations.

  Specific drivers support some or all of the following optimizations that you can use to improve authentication performance:

  + SSO token caches
  + MFA token caches
  + Tokens in TOML configuration files
  + Custom token accessors

  You should check the driver documentation to see which of these options the driver supports.
* Disable query context caching.

  If you’re aware of all of the issues associated with reusing sessions and connections in multiple threads, but still want to use them, Snowflake recommends that you disable query caching by setting the `DisableQueryContextCache` parameter to `true` in the connection definition.

---
title: Consuming results
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-consume.md
section: Developer Guide
---

# Consuming results

## Returning results inline

The most common way of consuming results is by passing a `complete` callback to `connection.execute()`. When the statement has finished executing and the result is ready to be consumed, the `complete`
callback is invoked with the result rows returned inline:

> ```javascript
> connection.execute({
>   sqlText: 'SELECT * FROM sometable',
>   complete: (err, stmt, rows) => {
>     if (err) {
>       console.error(`Failed to execute statement due to the following error: ${err.message}`);
>     } else {
>       console.log(`Number of rows produced: ${rows.length}`);
>     }
>   }
> });
> ```

## Streaming results

You can also consume a result as a stream of rows by setting the `streamResult` connection parameter to `true`
in `connection.execute` when calling the `statement.streamRows()` method. Enabling this parameter causes
the method to return a Node.js `Readable` stream, which you can use to consume rows as they are received.

For more information about the `Readable` stream, refer to the [Node.js documentation](https://nodejs.org/dist/latest/docs/api/stream.html#stream_class_stream_readable).

> **Important:**
>
> For any result set that might exceed Node’s default memory, Snowflake highly recommends that you set
> `streamResult` to `true` when streaming results. With the default value (`false`), the connector
> stores all of the rows in an array before streaming the results. With smaller result sets, this factor is
> not normally an issue. However, with larger result sets, storing all the results in memory can contribute to an OOM error.

Recent versions of the Snowflake Node.js driver (1.6.23 and later) implement backpressure functionality to ensure that,
when consuming results,
data is not pushed to the stream faster than data is read from the stream.

For example, the following code fragment consumes the results using the `Readable` event:

> ```javascript
> const connection = snowflake.createConnection({
>   account: process.env.SFACCOUNT,
>   username: process.env.SFUSER,
>   // ...
>   streamResult: true
> });
>
> // [..rest of the code..]
>
> connection.execute({
>   sqlText: 'select L_COMMENT from SNOWFLAKE_SAMPLE_DATA.TPCH_SF100.LINEITEM limit 100000;',
>   streamResult: true,
>   complete: function (err, stmt) {
>     const stream = stmt.streamRows();
>     // Read data from the stream when it is available
>     stream.on('readable', function (row) {
>       while ((row = this.read()) !== null) {
>         console.log(row);
>       }
>     }).on('end', () => {
>       console.log('done');
>     }).on('error', err => {
>       console.log(err);
>     });
>   }
> });
> ```

## Batch processing results

By default, the `statement.streamRows()` method produces a stream that includes every row in the result. However, if you only want to consume a subset of the result, or if you want to consume result rows
in batches, you can call `streamRows()` with `start` and `end` arguments. When these additional options are specified, only rows in the requested range are streamed:

> ```javascript
> connection.execute({
>   sqlText: 'SELECT * FROM sometable',
>   streamResult: true, // prevent rows from being returned inline in the complete callback
>   complete: function (err, stmt, rows) {
>     // no rows returned inline because streamResult was set to true
>     console.log(`rows: ${rows}`); // 'rows: undefined'
>
>     // only consume at most the last 5 rows in the result
>     rows = [];
>     stmt.streamRows({
>       start: Math.max(0, stmt.getNumRows() - 5),
>       end: stmt.getNumRows() - 1
>     })
>     .on('error', err => {
>       console.error('Unable to consume requested rows');
>     })
>     .on('data', row => {
>       rows.push(row);
>     })
>     .on('end', () => {
>       console.log(`Number of rows consumed: ${rows.length}`);
>     });
>   }
> });
> ```

## Data type casting

When result rows are produced, the driver automatically maps SQL data types to their corresponding JavaScript equivalents. For example, values of type TIMESTAMP and DATE are returned as JavaScript Date
objects.

For the full mapping of JavaScript to SQL data types, refer to the table below:

> | SQL Data Type | JavaScript Data Type | Notes |
> | --- | --- | --- |
> | VARCHAR, CHAR, CHARACTER, STRING, TEXT | String |  |
> | INT, INTEGER, BIGINT, SMALLINT | Number | This is the default mapping. Use the session parameter JS_TREAT_INTEGER_AS_BIGINT to map to JavaScript Bigint. |
> | NUMBER(precision, scale), DECIMAL(p, s), NUMERIC(p, s) where `scale` = 0 | Number | This is the default mapping. Use the session parameter JS_TREAT_INTEGER_AS_BIGINT to map to JavaScript Bigint. |
> | NUMBER(precision, scale), DECIMAL(p, s), NUMERIC(p, s) where `scale` > 0 | Number |  |
> | FLOAT, FLOAT4, FLOAT8, DOUBLE, DOUBLE PRECISION, REAL | Number |  |
> | DECFLOAT | String | A string in scientific notation format, such as 9.8765432099999998623226732747455716901e-250  JavaScript does not natively support high-precision decimal numbers. Use libraries like [BigNumber.js](https://github.com/MikeMcl/bignumber.js) or [Decimal.js](https://github.com/MikeMcl/decimal.js) to perform accurate arithmetic operations on these values. |
> | TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ | Date | TIMESTAMP_NTZ values are returned in UTC. |
> | DATE | Date |  |
> | TIME | String | The TIME data type in SQL has no equivalent in JavaScript, so it is mapped to a JavaScript string. |
> | BOOLEAN | Boolean |  |
> | VARIANT, ARRAY, OBJECT | JSON |  |

## Fetching integer data types as Bigint

By default, Snowflake INTEGER columns (including `BIGINT`, `NUMBER(p, 0)`, etc.) are converted to JavaScript’s `Number` data type.
However, the largest legal Snowflake integer values are larger than the largest legal JavaScript Number values. To convert Snowflake `INTEGER`
columns to JavaScript `Bigint`, which can store larger values than JavaScript `Number`, set the session parameter `JS_TREAT_INTEGER_AS_BIGINT`.

You can use the following methods to set these parameter values:

* Use the ALTER SESSION statement, as shown below:

  > ```javascript
  > connection.execute({
  >   sqlText: 'ALTER SESSION SET JS_TREAT_INTEGER_AS_BIGINT = TRUE',
  >   complete: function ...
  > });
  > ```
* Specify the parameter in the connection configuration information:

  > ```javascript
  > const connection = snowflake.createConnection({
  >   username: 'fakeusername',
  >   password: 'fakepassword',
  >   account: 'fakeaccountidentifier',
  >   jsTreatIntegerAsBigInt: true
  > });
  > ```

## Fetching data types as strings

When calling `connection.execute()`, you can use the `fetchAsString` option to return the following
data types as strings: `Boolean`, `Number`, `Date`, `Buffer`, and `JSON`.

You can use this option, for example, to return:

* Formatted versions of values of type DATE and TIMESTAMP (or its variants).
* String versions of numerical SQL types that can’t be converted to JavaScript numbers without loss in precision.

The following example uses `fetchAsString` to convert a high-precision `Number` value to a string.:

```javascript
connection.execute({
  sqlText: 'SELECT 1.123456789123456789123456789 as "c1"',
  fetchAsString: ['Number'],
  complete: (err, stmt, rows) => {
    if (err) {
      console.error(`Failed to execute statement due to the following error: ${err.message}`);
    } else {
      console.log(`c1: ${rows[0].c1}`); // c1: 1.123456789123456789123456789
    }
  }
});
```

## Parsing XML data

Beginning with version 1.7.0 of the driver, you can use the following `fast-xml-parser`
library configuration options to customize how the driver processes XML document attributes when querying columns
with XML content. For more information about these supported options and how they affect XML data parsing,
see [xmlParserConfig options](nodejs-driver-options.md).

You can download the [fast-xml-parser](https://www.npmjs.com/package/fast-xml-parser).

By default, the Node.js driver ignores XML element attributes when returning XML data from a query. For example,
in the following XML content, the `<piece>` element includes an `id` attribute:

```xml
<exhibit name="Art Show">
  <piece id="000001">
    <name>Mona Lisa</name>
    <artist>Leonardo da Vinci</artist>
    <year>1503</year>
  </piece>
  <piece id="000002">
    <name>The Starry Night</name>
    <artist>Vincent van Gogh</artist>
    <year>1889</year>
  </piece>
</exhibit>
```

By default, when the Node.js driver returns the result set, it ignores the `id` attribute and returns the following
output. Notice the attribute names and values are not included.

```output
{
  exhibit: {
    piece: [
      {
        "name": "Mona Lisa",
        "artist": "Leonardo da Vinci",
        "year": "1503",
      },
      {
        "name": "The Starry Night",
        "artist": "Vincent van Gogh",
        "year": "1889",
      }
    ]
  }
}
```

To set the `fast-xml-parser` options, create an `xmlParserConfig`
element similar to the following example:

> ```javascript
> import snowflake from 'snowflake-sdk';
>
> snowflake.configure({
>   xmlParserConfig: {
>     /*  Parameters that you can override
>     *   ignoreAttributes - default true,
>     *   attributeNamePrefix - default '@_',
>     *   attributesGroupName - default unset,
>     *   alwaysCreateTextNode - default false
>     */
>     ignoreAttributes: false, attributesGroupName: '@', attributeNamePrefix: ''
>   }
> });
> ```

With these settings, the driver parses the XML data and produces the following:

```output
{
  exhibit: {
    piece: [
      {
        "name": "Mona Lisa",
        "artist": "Leonardo da Vinci",
        "year": "1503",
        '@': { id: '000001' }
      },
      {
        "name": "The Starry Night",
        "artist": "Vincent van Gogh",
        "year": "1889",
        '@': { id: '000002' }
      }
    ],
    '@': { name: 'Art Show' }
  }
```

## Returning result sets that contain duplicate column names

In version 1.8.0, the Snowflake Node.js Driver introduced a new [rowMode](nodejs-driver-options.md) configuration option
that lets you specify how you want the driver to return result sets that contain duplicate column names.

Prior to version 1.8.0, the Snowflake Node.js Driver always returned the result set from a SELECT command as a JavaScript
object. In cases where the result set contained duplicate column names and values, some elements could be omitted due to the way
JavaScript objects handle duplicate names.

The `rowMode` option lets you specify how result set data is returned to avoid potential loss of information, including as:

* `array`
* `object` (default)
* `object_with_renamed_duplicated_columns`

To illustrate, assume you submit the following query:

```sqlexample
select *
from (select 'a' as key, 1 as foo, 3 as name) as table1
join (select 'a' as key, 2 as foo, 3 as name2) as table2 on table1.key = table2.key
join (select 'a' as key, 3 as foo) as table3 on table1.key = table3.key
```

Based on the value of `rowMode`, the driver returns the result sets as follows:

* `object` (or unset)

  ```output
  {KEY: 'a', FOO: 3, NAME: 3, NAME2: 3};
  ```
* `array`

  ```output
  ['a', 1, 3, 'a', 2, 3, 'a', 3];
  ```
* `object_with_renamed_duplicated_columns`

  ```output
  {KEY: 'a', FOO: 1, NAME: 3, KEY_2: 'a', FOO_2: 2, NAME2: 3, KEY_3: 'a', FOO_3: 3};
  ```

You can set the `rowMode` parameter in the connection or statement configuration level, as shown below. If set in
both places, the statement level value takes precedence.

* Configuration level

  ```javascript
  snowflake.createConnection({
    account: account,
    username: username,
    // ...
    rowMode: 'array'
  });
  ```
* Statement level

  ```javascript
  connection.execute({
    sqlText: sql,
    rowMode: 'array',
    // ...
  });
  ```

## Customizing how result sets process JSON and XML data

The Snowflake Node.js driver provides the following default parsers for processing JSON and XML data in result sets:

* JSON: Returns the result from a new `Function` object.
* XML: `fast-xml-parser`.

  The default `fast-xml-parser` module supports a subset of features, as described in Parsing XML data.

  You can download [fast-xml-parser](https://www.npmjs.com/package/fast-xml-parser).

If you prefer to use a custom parser, you can use the following examples to configure them:

* Use the eval JSON parser, which the driver used prior to version 1.6.21:

  ```javascript
  import snowflake from 'snowflake-sdk';

  snowflake.configure({
    jsonColumnVariantParser: rawColumnValue => JSON.parse(rawColumnValue)
  });
  ```
* Use the `fast-xml-parser` parser, with the ability to [customize all of its options](https://github.com/NaturalIntelligence/fast-xml-parser/blob/master/docs/v4/2.XMLparseOptions.md):

  ```javascript
  import snowflake from 'snowflake-sdk';
  import { XMLParser } from 'fast-xml-parser';

  snowflake.configure({
    xmlColumnVariantParser: rawColumnValue => new XMLParser().parse(rawColumnValue)
  });
  ```
* Configure custom parsers for both in the same declaration:

  ```javascript
  import snowflake from 'snowflake-sdk';
  import { XMLParser } from 'fast-xml-parser';

  snowflake.configure({
    jsonColumnVariantParser: rawColumnValue => JSON.parse(rawColumnValue),
    xmlColumnVariantParser: rawColumnValue => new XMLParser().parse(rawColumnValue)
  });
  ```

---
title: Controlling global state in scalar Scala UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-optimizing.md
section: Developer Guide
---

# Controlling global state in scalar Scala UDFs

When designing a UDF and handler that requires access to shared state, you will need to account for the way Snowflake executes UDFs to
process rows.

Most handlers should follow these guidelines:

* If you need to initialize shared state that does not change across rows, initialize it outside the handler function, such as in a
  constructor.
* Write your handler method to be thread safe.
* Avoid storing and sharing dynamic state across rows.

If your UDF cannot follow these guidelines, or if you would like a deeper understanding of the reasons for these guidelines,
please read the next few subsections.

## Sharing state across calls

Snowflake expects scalar UDFs to be processed independently. Relying on state shared between invocations can result in unexpected
behavior. This is because the system can process rows in any order and spread those invocations across several JVMs (for handlers written
in Java or Scala).

UDFs should avoid relying on shared state across calls to the handler method. However, there are two situations in which you might want a
UDF to store shared state:

* Code that contains expensive initialization logic that you do not want to repeat for each row.
* Code that leverages shared state across rows, such as a cache.

If you need to share state across multiple rows, and if that state does not change over time, then use a constructor to create
shared state by setting instance-level variables. The constructor is executed only once per instance, while the handler is called
once per row, so initializing in the constructor is cheaper when a handler processes multiple rows. And because the constructor is
called only once, the constructor does not need to be written to be thread-safe.

If your UDF stores shared state that changes, then your code must be prepared to handle concurrent access to that state.

For more information about parallelism and shared state, refer to Understanding parallelization and
Storing JVM state information in this topic.

## Understanding parallelization

To improve performance, Snowflake parallelizes both across and within JVMs.

### Parallelizing across JVMs

Snowflake parallelizes across workers in a [warehouse](../../../user-guide/warehouses-overview.md). Each worker runs one (or more)
JVMs. This means that there is no global shared state. At most, state can be shared only within a single JVM.

### Parallelizing within JVMs

* Each JVM can execute multiple threads that can call the same instance’s handler method in parallel. This means that each
  handler method needs to be thread-safe.
* If a UDF is IMMUTABLE and a SQL statement calls the UDF more than once with the same arguments for the same row, then the UDF
  returns the same value for each call for that row.

  For example, the following returns the same value twice for each row if the UDF is IMMUTABLE:

  ```sqlexample
  SELECT my_scala_udf(42), my_scala_udf(42) FROM table1;
  ```

  If you would like multiple calls to return independent values even when passed the same arguments, and if you do not want
  to declare the function VOLATILE, then bind multiple separate UDFs to the same handler method.

  You might do this using the following steps.

  1. Create a JAR file named `@udf_libs/rand.jar` with the following code:

     ```scala
     class MyClass {

       var x: Double = 0.0

       // Constructor
       def this() = {
         x = Math.random()
       }

       // Handler
       def myHandler(): Double = x
     }
     ```
  2. Create the Scala UDFs as shown below.

     These UDFs have different names, but use the same JAR file and the same handler within that JAR file.

     Scala 2.12Scala 2.13 (Preview)

     ```sqlexample
     CREATE FUNCTION my_scala_udf_1()
       RETURNS DOUBLE
       LANGUAGE SCALA
       IMPORTS = ('@udf_libs/rand.jar')
       HANDLER = 'MyClass.myHandler';

     CREATE FUNCTION my_scala_udf_2()
       RETURNS DOUBLE
       LANGUAGE SCALA
       IMPORTS = ('@udf_libs/rand.jar')
       HANDLER = 'MyClass.myHandler';
     ```

     ```sqlexample
     CREATE FUNCTION my_scala_udf_1()
       RETURNS DOUBLE
       LANGUAGE SCALA
       IMPORTS = ('@udf_libs/rand.jar')
       HANDLER = 'MyClass.myHandler';

     CREATE FUNCTION my_scala_udf_2()
       RETURNS DOUBLE
       LANGUAGE SCALA
       IMPORTS = ('@udf_libs/rand.jar')
       HANDLER = 'MyClass.myHandler';
     ```
  3. Use the following code to call both UDFs.

     The UDFs point to the same JAR file and handler. These calls create two instances of the same class. Each instance returns an
     independent value, so the example below returns two independent values, rather than returning the same value twice:

     ```sqlexample
     SELECT my_scala_udf_1(), my_scala_udf_2() FROM table1;
     ```

## Storing JVM state information

One reason to avoid relying on dynamic shared state is that rows are not necessarily processed in a predictable order. Each time a SQL
statement is executed, Snowflake can vary the number of batches, the order in which batches are processed, and the order of rows within
a batch. If a scalar UDF is designed so that one row affects the return value for a subsequent row, then the UDF can return different
results each time that the UDF is executed.

---
title: Costs of external network access
source: https://docs.snowflake.com/en/developer-guide/external-network-access/external-network-access-billing.md
section: Developer Guide
---

# Costs of external network access

Using external network access incurs normal costs associated with:

* [Snowflake warehouse usage.](../../user-guide/cost-understanding-compute.md)
* [Data transfer.](../../user-guide/cost-understanding-data-transfer.md)

Data transfer charges will appear as a TRANSFER_TYPE of EXTERNAL_ACCESS in the [DATA_TRANSFER_HISTORY view](../../sql-reference/account-usage/data_transfer_history.md).
Any data egress traffic associated with a “bring-your-own-IP” (BYOIP) destination will be charged at cross-cloud or Internet traffic rates.

Calling to an external network location from a handler will result in payload egress. As data egress, this call results in data
transfer cost.

In addition, you might need to pay indirect or third-party charges, including charges by the provider of the remote service. Charges can
vary from vendor to vendor.

---
title: Costs of telemetry data collection
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-tracing-billing.md
section: Developer Guide
---

# Costs of telemetry data collection

When you log messages from a function or procedure, Snowflake collects the messages in batches and ingests the batches into the event table.

To perform this work, Snowflake uses Snowflake-managed resources, also referred to as the serverless compute model. As is the case with
[other serverless features](../../user-guide/cost-understanding-compute.md), Snowflake bills your account for the compute resource and cloud
services usage needed to ingest the logged messages. These costs appear on your bill as separate line items.

To determine the credit usage for logging over time, use the following views:

* [METERING_HISTORY view](../../sql-reference/account-usage/metering_history.md).
* [METERING_DAILY_HISTORY view](../../sql-reference/account-usage/metering_daily_history.md) (Account Usage).
* [METERING_DAILY_HISTORY view](../../sql-reference/organization-usage/metering_daily_history.md) (Organization Usage).

To reduce the cost of logging:

* Avoid logging frequently over a long period of time.
* [Set the level of messages ingested](telemetry-levels.md) on specific objects. For example, set the
  log level for specific functions or procedures in a session, instead of setting the log level for all functions or procedures.

If you do not want to collect telemetry data, you can do any one of the following:

* Disable or change telemetry levels appropriately. For more information, see [Set telemetry levels](logging-tracing-overview.md).

  This option is not applicable for [Native Apps](../native-apps/native-apps-about.md).
* Uninstall the applications or connectors emitting telemetry data, or drop the unnecessary objects.
* If you do not want any logging and tracing events to be collected at all in the account, execute the following command to deactivate
  the event table:

  ```sqlexample
  ALTER ACCOUNT SET EVENT_TABLE = NONE
  ```

---
title: Creating a Java UDF handler
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-creating.md
section: Developer Guide
---

# Creating a Java UDF handler

This topic describes how to write the Java handler for a user-defined function (UDF). When you write a Java UDF, you write Java code for
Snowflake to execute as UDF logic. This Java code is the UDF’s handler.

You deploy the UDF with [CREATE FUNCTION](../../../sql-reference/sql/create-function.md), giving the UDF a name and specifying the Java method as the handler to use when the
UDF is called. For more information about creating a UDF with SQL, see [Creating a user-defined function](../udf-creating-sql.md).

For more example code, see [Java UDF handler examples](udf-java-cookbook.md).

## Writing the UDF handler in Java

Use the following requirements and guidelines when writing your Java UDF handler.

* Define the class as public.
* Inside the class, declare at least one public method to use as a UDF handler.

  For an inline UDF, declare one handler method only. If instead you intend to package
  the class into a JAR as a staged handler, you can declare multiple handler methods, later
  specifying each as a handler with the HANDLER clause of a [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) statement.

  For more about the difference between an in-line and staged handler, see [Keeping handler code in-line or on a stage](../../inline-or-staged.md).

  You can declare other methods, if needed, to be called by the handler method.

  Use the following requirements and guidelines for each handler method:

  + Declare the handler method as public, either static or non-static.

    If the method is non-static, your class must also declare a zero-argument constructor or no constructor at all.

    Snowflake does not pass any arguments to the constructor when it instantiates the class. If the constructor throws an error, the error
    is thrown as a user error, along with the exception message.
  + Specify an appropriate return type.

    The return type must be one of the data types specified in the
    `Java Data Type` column of the [SQL-Java Type Mappings table](../../udf-stored-procedure-data-type-mapping.md). The return type must be
    compatible with the SQL data type specified in the RETURNS clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) statement.
  + Ensure that each handler method argument (if any) is a data type specified in the `Java Data Type` column of the
    [SQL-Java Type Mappings table](../../udf-stored-procedure-data-type-mapping.md).

    When choosing data types of Java variables, take into account the maximum and minimum possible values of the data that could be sent
    from (and returned to) Snowflake.
  + If you overload a method in a given Java class, keep in mind that Snowflake uses only the *number* of method arguments, not
    their *types*, to differentiate handler methods within a class. Resolving based on data types is impractical because some SQL
    data types can be mapped to more than one Java data type and thus potentially to more than one handler method signature.

    For example, if two Java methods in a class have the same name, the same number of arguments, and different data types,
    calling a UDF that uses one of these methods as a handler generates an error similar to the following:

    ```output
    Cannot determine which implementation of handler "handler name" to invoke since there are multiple
    definitions with <number of args> arguments in function <user defined function name> with
    handler <class name>.<handler name>
    ```

    If a warehouse is available, the error is detected at the time that the UDF is created. Otherwise, the error occurs when the
    UDF is called.
  + Comply with the Snowflake-imposed constraints for Java UDFs in each handler method and methods it calls. For more on these
    constraints, see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).

## Adding dependencies to the classpath

When your handler code requires classes packaged in external JAR files, you can add these dependencies to the Snowflake-managed classpath
available to your handler. The following describes how to add JAR files to the classpath visible to a Java UDF handler. For more information,
see [Making dependencies available to your code](../../upload-dependencies.md).

## Organizing your files

If you plan to compile the Java code to create the JAR file yourself, you can organize the files as shown below. This example assumes that
you plan to use Java’s package mechanism.

* developmentDirectory

  + packageDirectory

    - class_file1.java
    - class_file2.java
  + classDirectory

    - class_file1.class
    - class_file2.class
  + manifest_file.manifest (optional)
  + jar_file.jar
  + put_command.sql

`developmentDirectory`
:   This directory contains the project-specific files required to create your Java UDF.

`packageDirectory`
:   This directory contains the .java files to compile and include in the package.

`class_file#.java`
:   These files contain the Java source code of the UDF.

`class_file#.class`
:   These are the .class file(s) created by compiling the .java files.

`manifest_file.manifest`
:   The optional manifest file used when combining the .class files (and optionally, dependency JAR files) into the JAR file.

`jar_file.jar`
:   The JAR file that contains the UDF code.

`put_command.sql`
:   This file contains the SQL [PUT](../../../sql-reference/sql/put.md) command to copy the JAR file to a Snowflake
    [stage](../../../sql-reference/sql/create-stage.md).

### Compiling the Java code and creating the JAR file

To create a JAR file that contains the compiled Java code:

* Use javac to compile your .java file to a .class file.

  If you use a compiler newer than version 11.x, you can use the “–release” option to specify that the target version is
  version 11.
* Put your .class file into a JAR file. You can package multiple class files (and other JAR files) into your JAR file.

  For example:

  ```none
  jar cf ./my_udf.jar MyClass.class
  ```

  A manifest file is required if your handler class is in a package, and optional otherwise. The following example
  uses a manifest file:

  ```none
  jar cmf my_udf.manifest ./my_udf.jar example/MyClass.class
  ```

  To build the jar file with all dependencies included, you can use Maven’s `mvn package` command with
  the maven-assembly-plugin. For more information about the maven-assembly-plugin, see the
  [Maven usage page](https://maven.apache.org/plugins/maven-assembly-plugin/usage.html).

  Snowflake automatically supplies the [standard Java libraries](https://docs.oracle.com/en/java/javase/11/docs/api/index.html) (e.g. `java.util`). If your code calls those libraries,
  you do not need to include them in your JAR file.

  The methods that you call in libraries must follow the same Snowflake-imposed constraints as your Java method. For more on these
  constraints, see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).

### Copying the JAR file to your stage

In order for Snowflake to read from the JAR containing your handler method, you need to copy the JAR to one of the following kinds of stage:

* A user or named internal stage.

  Snowflake does not currently support using a table stage to store a JAR file with UDF handlers. For more on internal stages, see
  [Choosing an internal stage for local files](../../../user-guide/data-load-local-file-system-create-stage.md).
* An external stage.

The stage hosting the JAR file must be readable by the owner of the UDF.

Typically, you upload the JAR to a named internal stage using the [PUT](../../../sql-reference/sql/put.md) command. Note that you can’t execute
the `PUT` command through the Snowflake GUI; you can use SnowSQL to execute `PUT`. See the
[Java UDF handler examples](udf-java-cookbook.md) section for an example `PUT` command to copy a .jar file to a stage.

For more about creating stages, see [CREATE STAGE](../../../sql-reference/sql/create-stage.md).

**Caveats and Best Practices**

If you delete or rename the JAR file, you can no longer call the UDF.

If you need to update your JAR file, then:

* Update it while no calls to the UDF can be made.
* If the old .jar file is still in the stage, the `PUT` command should include the clause `OVERWRITE=TRUE`.

> **Note:**
>
> A user performing related to UDFs must have a role that has been assigned permissions required for the action. For more information,
> see [Granting privileges for user-defined functions](../udf-access-control.md).

---
title: Creating a Listing using Declarative Sharing
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/listing.md
section: Developer Guide
---

# Creating a Listing using Declarative Sharing

After you have created and tested your Declarative Native App, you can make it available to consumers by creating a listing. A listing makes your app discoverable and accessible to other Snowflake accounts, either privately to specific accounts or publicly on the Snowflake Marketplace. This topic provides an overview of the requirements and steps to create a listing for your Declarative Native App.

## Create a listing

To publish to consumers, a provider can share a Declarative Native App with consumers by publishing a [listing](https://other-docs.snowflake.com/collaboration/collaboration-listings-about).

The process is the same as with Snowflake Native Apps. For more information, see [Native Apps workflow:publish](../native-apps/native-apps-workflow.md).

> **Note:**
>
> Organizational listings are currently only supported when the provider and consumer are in different Snowflake accounts.
>
> For same-account testing, perform the installation directly from the package rather than through the organizational listing mechanism.

### Access control requirements

To create a listing, the provider requires additional privileges, including:

* **OWNERSHIP** on the application package
* Global **CREATE LISTING** privilege

For additional requirements to create a listing, see [Use listings as a provider](https://other-docs.snowflake.com/en/collaboration/provider-becoming).

## Consumers install and use the app

After you create a listing for your app, consumers can install the app from the Snowflake Marketplace. For more information, see [Install a Declarative Native App](consumer/install.md).

---
title: Creating a stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-creating.md
section: Developer Guide
---

# Creating a stored procedure

You can create a [stored procedure](stored-procedures-overview.md) using any of several methods
available with Snowflake. These methods are described in this topic.

## Create a procedure

1. Write procedure logic as a handler using one of several supported languages, including Python, Java, and Scala.
2. Choose a tool or API to create the procedure with the handler you wrote.

   For more information about each of these, see Tools for creating procedures.

   |  |  |
   | --- | --- |
   | SQL | Use SQL and write logic in one of several languages. |
   | Snowpark | Use the Snowpark API for Java, Python, or Scala. |
   | Command line | Execute Snowflake CLI commands to create the procedure. |
   | Python API | Execute client-side Python commands to create the procedure. |
   | REST | Make requests to a RESTful API to create the procedure. |
3. [Execute the procedure](stored-procedures-calling.md) using one of several tools, depending on
   your needs.

## Tools for creating procedures

You can create a [procedure](stored-procedures-overview.md) using any of several methods available
with Snowflake, depending on the language and skill set you have available. Choose the tool that’s right for your needs from the
following table.

| Language | Approach |
| --- | --- |
| **SQL**  Execute a SQL command, such as by using Snowsight. | Execute the SQL CREATE PROCEDURE command to create a procedure with handler code written in one of the following languages:   * [Java](java/procedure-java-overview.md) * [JavaScript](stored-procedures-javascript.md) * [Python](python/procedure-python-overview.md) * [Scala](scala/procedure-scala-overview.md) * [Snowflake Scripting](stored-procedures-snowflake-scripting.md) |
| **Java, Python, or Scala**  Write code in one of the supported languages, then execute the code locally to perform operations in Snowflake. | Execute client code that uses Snowpark APIs in one of the following languages.   * [Java](../snowpark/java/creating-sprocs.md) * [Python](../snowpark/python/creating-sprocs.md) * [Scala](../snowpark/scala/creating-sprocs.md) |
| **Command line**  Create and manage Snowflake entities by executing commands from the command line. | Execute commands of the [Snowflake CLI](../snowflake-cli/objects/manage-objects.md). |
| **Python**  On the client, write code that executes management operations on Snowflake. | Execute code that uses the [Snowflake Python API](../snowflake-python-api/snowflake-python-managing-functions-procedures.md). |
| **RESTful APIs** (language-agnostic)  Make requests of RESTful endpoints to create and manage Snowflake entities. | Make a request to create a procedure using the [Snowflake REST API](../snowflake-rest-api/procedure/procedure-introduction.md). |

## Key properties

The following describes some of the properties required or typically used when creating a procedure.

Procedure name:
:   The procedure name does not need to match the name of the handler. For more about name constraints and conventions, see
    [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).

Arguments:
:   For more information on requirements, see [Defining arguments for UDFs and stored procedures](../udf-stored-procedure-arguments.md).

Return type:
:   For information about how Snowflake maps SQL data types to handler data types, see
    [Data Type Mappings Between SQL and Handler Languages](../udf-stored-procedure-data-type-mapping.md).

Handler name:
:   When required, this is the name of the class or method containing code that executes when the procedure is called. You need specify a
    handler name only for handlers written in Java, Python, and Scala. For JavaScript and Snowflake Scripting handlers, all code specified
    in-line will be executed as the handler.

Dependencies:
:   For a handler written in Java, Python, or Scala, you might also need to specify the Snowpark library, such as when creating the procedure.

    For more about making dependencies available to your handler, see [Making dependencies available to your code](../upload-dependencies.md).

Handler language runtime:
:   When the handler language is Java, Python, or Scala, specify the runtime version to indicate which supported runtime version to use.
    Keep in mind that if you use the default version, that default will change over time.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE PROCEDURE | Schema | Required to create a permanent stored procedure. Not required when creating a temporary procedure that persists for only the duration of the session in which the procedure was created. |
| USAGE | Procedure | Granting the USAGE privilege on the newly created procedure to a role allows users with that role to call the procedure elsewhere in Snowflake. |
| USAGE | External access integration | Required on integrations, if any, specified when creating the procedure. For more information, see [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md). |
| READ | Secret | Required on secrets, if any, specified when creating the procedure. For more information, see [Creating a secret to represent credentials](../external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../external-network-access/creating-using-external-network-access.md). |
| USAGE | Schema | Required on schemas containing secrets, if any, specified when creating the procedure. For more information, see [Creating a secret to represent credentials](../external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../external-network-access/creating-using-external-network-access.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

### All handler languages

* Stored procedures support [overloading](../udf-stored-procedure-naming-conventions.md). Two procedures can have the same
  name if they have a different number of parameters or different data types for their parameters.
* Stored procedures are not atomic; if one statement in a stored procedure fails, the other statements in the stored
  procedure are not necessarily rolled back. For information about stored procedures and transactions, see
  [Transaction management](stored-procedures-usage.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../../sql-reference/metadata.md).

> **Tip:**
>
> If your organization uses a mix of caller’s rights and owner’s rights stored procedures, you might want to use a
> naming convention for your stored procedures to indicate whether an individual stored procedure is a caller’s
> rights stored procedure or an owner’s rights stored procedure.

* Setting LOG_LEVEL or TRACE_LEVEL as properties in a CREATE PROCEDURE statement is not supported. To set these properties
  on a procedure, use [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md) after creating the procedure, or use
  [CREATE OR ALTER PROCEDURE](../../sql-reference/sql/create-procedure.md).

### Java

See the [known limitations](java/procedure-java-limitations.md).

### Javascript

A JavaScript stored procedure can return only a single value, such as a string (for example, a success/failure indicator)
or a number (for example, an error code). If you need to return more extensive information, you can return a
VARCHAR that contains values separated by a delimiter (such as a comma), or a semi-structured data type, such
as [VARIANT](../../sql-reference/data-types-semistructured.md).

### Python

See the [known limitations](python/procedure-python-limitations.md).

### Scala

See the [known limitations](scala/procedure-scala-limitations.md).

## Create a stored procedure with SQL

You can create a stored procedure with SQL using the following steps.

> **Note:**
>
> You can also create and call a procedure that isn’t stored for later use. Many of the properties for that kind of procedure are the same
> as for a stored procedure. For more information, see [CALL (with anonymous procedure)](../../sql-reference/sql/call-with.md).

1. Write handler code that executes when the procedure is called.

   You can use one of the supported handler languages. For more information, see [Supported languages and tools](stored-procedures-overview.md).
2. Choose whether you’ll keep the handler code in-line with the CREATE PROCEDURE SQL statement or refer to it on a stage.

   Each has its advantages. For more information, see [Keeping handler code in-line or on a stage](../inline-or-staged.md).
3. Execute a [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) statement in SQL, specifying properties of the procedure.

   Code in the following example creates a procedure called `myProc` with a in-line handler `MyClass.myMethod`. The
   handler language is Java, which (like procedure handlers written in Scala and Python) requires a Session object from the Snowpark
   library. Here, the PACKAGES clause refers to the Snowpark library included with Snowflake.

   ```sqlexample-java
   CREATE OR REPLACE PROCEDURE myProc(fromTable STRING, toTable STRING, count INT)
     RETURNS STRING
     LANGUAGE JAVA
     RUNTIME_VERSION = '11'
     PACKAGES = ('com.snowflake:snowpark:latest')
     HANDLER = 'MyClass.myMethod'
     AS
     $$
       import com.snowflake.snowpark_java.*;

       public class MyClass
       {
         public String myMethod(Session session, String fromTable, String toTable, int count)
         {
           session.table(fromTable).limit(count).write().saveAsTable(toTable);
           return "Success";
         }
       }
     $$;
   ```

---
title: Creating a stored procedure from a Python worksheet
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-create-worksheet.md
section: Developer Guide
---

# Creating a stored procedure from a Python worksheet

You can create a stored procedure from a [Python worksheet](../../snowpark/python/python-worksheets.md) by using Snowsight.

For example, you might write code in a Python worksheet that extracts data from stages or database objects in Snowflake, transforms the
data, then stores the transformed data in Snowflake. You could then deploy that code as a stored procedure and build a data pipeline,
all without leaving Snowflake.

Create a Python stored procedure from your Python worksheet to automate your code. For details on writing Python worksheets, see
[Writing Snowpark Code in Python Worksheets](../../snowpark/python/python-worksheets.md).

## Prerequisites

Your role must have OWNERSHIP or CREATE PROCEDURE privileges on the database schema in which you run your Python worksheet to deploy it
as a stored procedure.

## Deploy a Python worksheet as a stored procedure

To create a Python stored procedure that automates the code in your Python worksheet, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Worksheets.
3. Open the Python worksheet you want to deploy as a stored procedure.
4. Select Deploy.
5. Enter a name for the stored procedure.
6. (Optional) Enter a comment with details about the stored procedure.
7. (Optional) Select Replace if exists to replace an existing stored procedure with the same name.
8. For Handler, select the handler function for your stored procedure. For example, `main`.
9. Review the arguments used by your handler function and, if needed, override the SQL data type mapping for a typed argument.
   For details about how Python types are mapped to SQL types, see [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
10. (Optional) Select Open in Worksheets to open the stored procedure definition in a SQL worksheet.
11. Select Deploy to create the stored procedure.
12. After the stored procedure is created, you can go to the procedure details or select Done.

You can create multiple stored procedures from one Python worksheet.

After you create a stored procedure, you can automate it as part of a task. Refer to [Introduction to tasks](../../../user-guide/tasks-intro.md).

---
title: Creating a user-defined function
source: https://docs.snowflake.com/en/developer-guide/udf/udf-creating-sql.md
section: Developer Guide
---

# Creating a user-defined function

You can create a [user-defined function (UDF)](udf-overview.md) using any of several methods available with Snowflake.
These methods are described in this topic.

## Create a UDF

1. Write function logic as a handler using one of several supported languages, including Python, Java, and Scala.
2. Choose a tool or API to create the function with the handler you wrote.

   For more information about each of these, see Tools for creating UDFs.

   |  |  |
   | --- | --- |
   | SQL | Use SQL and write logic in one of several languages. |
   | Snowpark | Use the Snowpark API for Java, Python, or Scala. |
   | Command line | Execute CLI commands to create the function. |
   | Python API | Execute client-side Python commands to create the function. |
   | REST | Make requests to a RESTful API to create the function. |
3. [Execute the function](udf-calling-sql.md) using one of several tools, depending on your needs.

## Tools for creating UDFs

You can create a [UDF](udf-overview.md) using any of several methods available with Snowflake, depending on the
language and skill set you have available. Choose the tool that’s right for your needs from the following table.

| Language | Approach |
| --- | --- |
| **SQL**  Execute a SQL command, such as by using Snowsight. | Execute the SQL CREATE FUNCTION command to create a function with handler code written in one of the following languages:   * [Java](java/udf-java-introduction.md) * [JavaScript](javascript/udf-javascript-introduction.md) * [Python](python/udf-python-introduction.md) * [Scala](scala/udf-scala-introduction.md) * [SQL](sql/udf-sql-introduction.md) |
| **Java, Python, or Scala with Snowpark**  Write code in one of the supported languages, then execute the code locally to perform operations in Snowflake. | Execute client code that uses Snowpark APIs in one of the following languages.   * Java [UDFs](../snowpark/java/creating-udfs.md) * Python [UDFs](../snowpark/python/creating-udfs.md) | [UDTFs](../snowpark/python/creating-udtfs.md)   | [UDAFs](../snowpark/python/creating-udafs.md) * Scala [UDFs](../snowpark/scala/creating-udfs.md) |
| **Command line**  Create and manage Snowflake entities by executing commands from the command line. | Execute commands of the [Snowflake CLI](../snowflake-cli/objects/manage-objects.md). |
| **Python**  On the client, write code that executes management operations on Snowflake. | Execute code that uses the [Snowflake Python API](../snowflake-python-api/snowflake-python-managing-functions-procedures.md). |
| **RESTful APIs** (language-agnostic)  Make requests of RESTful endpoints to create and manage Snowflake entities. | Make a request to create a procedure using the [Snowflake REST API](../snowflake-rest-api/user-defined-function/user-defined-function-introduction.md) |

## Key properties

The following describes some of the properties required or typically used when creating a function.

Function name:
:   The function name does not need to match the name of the handler. For more about name constraints and conventions, see
    [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).

Arguments:
:   For more information on requirements, see [Defining arguments for UDFs and stored procedures](../udf-stored-procedure-arguments.md).

Return type:
:   For information about how Snowflake maps SQL data types to handler data types, see
    [Data Type Mappings Between SQL and Handler Languages](../udf-stored-procedure-data-type-mapping.md).

Handler name:
:   When required, this is the name of the class or method containing code that executes when the function is executed. You need specify a
    handler name only for handlers written in Java, Python, and Scala. For JavaScript and SQL handlers, all code specified
    in-line will be executed as the handler.

Dependencies:
:   For a handler written in Java, Python, or Scala, you might also need to specify the Snowpark library, such as when creating the function.

    For more about making dependencies available to your handler, see [Making dependencies available to your code](../upload-dependencies.md).

Handler language runtime:
:   When the handler language is Java, Python, or Scala, specify the runtime version to indicate which supported runtime version to use.
    Keep in mind that if you use the default version, that default will change over time.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FUNCTION | Schema | The privilege only enables the creation of user-defined functions in the schema.  If you want to enable the creation of data metric functions, the role must have the CREATE DATA METRIC FUNCTION privilege. |
| USAGE | Function | Granting the USAGE privilege on the newly created function to a role allows users with that role to call the function elsewhere in Snowflake (such as masking policy owner role for External Tokenization). |
| USAGE | External access integration | Required on integrations, if any, specified by the EXTERNAL_ACCESS_INTEGRATIONS parameter. For more information, see [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md). |
| READ | Secret | Required on secrets, if any, specified by the SECRETS parameter. For more information, see [Creating a secret to represent credentials](../external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../external-network-access/creating-using-external-network-access.md). |
| USAGE | Schema | Required on schemas containing secrets, if any, specified by the SECRETS parameter. For more information, see [Creating a secret to represent credentials](../external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../external-network-access/creating-using-external-network-access.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

The following notes describe usage notes relevant to the languages supported for writing handlers. Although notes in the following
sections refer to clauses of the SQL CREATE FUNCTION command, these clauses are typically represented in other ways in other tools
you can use to create functions.

### All languages

* `function_definition` has size restrictions. The maximum allowable size is subject to change.
* The delimiters around the `function_definition` can be either single quotes or a pair of dollar signs.

  Using `$$` as the delimiter makes it easier to write functions that contain single quotes.

  If the delimiter for the body of the function is the single quote character,
  then any single quotes within `function_definition` (such as string
  literals) must be escaped by single quotes.
* If using a UDF in a [masking policy](../../sql-reference/sql/create-masking-policy.md), ensure the data type of the column, UDF, and masking policy match. For
  more information, see [User-defined functions in a masking policy](../../user-guide/security-column-intro.md).
* If you specify the [CURRENT_DATABASE](../../sql-reference/functions/current_database.md) or [CURRENT_SCHEMA](../../sql-reference/functions/current_schema.md) function in the
  handler code of the UDF, the function returns the database or schema that contains the UDF, not the database or schema in use for
  the session.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../../sql-reference/metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Setting LOG_LEVEL or TRACE_LEVEL as properties in a CREATE FUNCTION statement is not supported. To set these properties
  on a function, use [ALTER FUNCTION](../../sql-reference/sql/alter-function.md) after creating the function, or use
  [CREATE OR ALTER FUNCTION](../../sql-reference/sql/create-function.md).

### Java

* In Java, primitive data types don’t allow NULL values, so passing a NULL for an argument of such a type results in
  an error.
* In the HANDLER clause, the method name is case-sensitive.
* In the IMPORTS and TARGET_PATH clauses:

  + Package, class, and file name(s) are case-sensitive.
  + Stage name(s) are case-insensitive.
* You can use the PACKAGES clause to specify package names and version numbers for Snowflake system-defined dependencies, such as those
  from Snowpark. For other dependencies, specify dependency JAR files with the IMPORTS clause.
* Snowflake validates that:

  + The JAR file specified in the CREATE FUNCTION statement’s HANDLER exists and contains the specified
    class and method.
  + The input and output types specified in the UDF declaration are compatible with the input and output types
    of the Java method.

  Validation can be done at creation time or execution time, depending on whether you are connected to an active Snowflake warehouse.

  + Creation time — If you are connected to an active Snowflake warehouse at the time the CREATE FUNCTION statement is
    executed, the UDF is validated at creation time.
  + Execution time — If you are not connected to an active Snowflake warehouse, the UDF is created, but is not validated
    immediately, and Snowflake returns the following message:

    `Function <name> created successfully, but could not be validated since there is no active warehouse`.

### JavaScript

* Snowflake does not validate JavaScript code at UDF creation time. In other words, creation of the UDF succeeds regardless of whether
  the code is valid. If the code is not valid, Snowflake returns errors when the UDF is called at query time.

### Python

* In the HANDLER clause, the handler function name is case-sensitive.
* In the IMPORTS clause:

  + File name(s) are case-sensitive.
  + Stage name(s) are case-insensitive.
* You can use the PACKAGES clause to specify package names and version numbers for dependencies, such as those
  from Snowpark. For other dependencies, specify dependency files with the IMPORTS clause.
* Snowflake validates that:

  + The function or class specified in the CREATE FUNCTION statement’s HANDLER exists.
  + The input and output types specified in the UDF declaration are compatible with the input and output types
    of the handler.

### Scala

* In the HANDLER clause, the method name is case-sensitive.
* In the IMPORTS and TARGET_PATH clauses:

  + Package, class, and file name(s) are case-sensitive.
  + Stage name(s) are case-insensitive.
* You can use the PACKAGES clause to specify package names and version numbers for Snowflake system-defined dependencies, such as those
  from Snowpark. For other dependencies, specify dependency JAR files with the IMPORTS clause.
* Snowflake validates that:

  + The JAR file specified in the CREATE FUNCTION statement’s HANDLER exists and contains the specified
    class and method.
  + The input and output types specified in the UDF declaration are compatible with the input and output types
    of the Scala method.

  Validation can be done at creation time or execution time, depending on whether you are connected to an active Snowflake warehouse.

  + Creation time — If you are connected to an active Snowflake warehouse at the time the CREATE FUNCTION statement is
    executed, the UDF is validated at creation time.
  + Execution time — If you are not connected to an active Snowflake warehouse, the UDF is created, but is not validated
    immediately, and Snowflake returns the following message:

    `Function <name> created successfully, but could not be validated since there is no active warehouse`.

### SQL

* Currently, the NOT NULL clause is not enforced for SQL UDFs.

## Create a UDF with SQL

You can create a UDF with SQL using the following steps.

You create a UDF with the following steps:

1. Write handler code that executes when the UDF is called.

   You can use one of the supported handler languages. For more information, see [Supported languages and tools](udf-overview.md).
2. Choose whether you’ll keep the handler code in-line with the CREATE FUNCTION SQL statement or refer to it on a stage.

   Each has its advantages. For more information, see [Keeping handler code in-line or on a stage](../inline-or-staged.md).
3. Execute a [CREATE FUNCTION](../../sql-reference/sql/create-function.md) statement in SQL, specifying properties of the function.

   Code in the following example creates a UDF called `function_name` with the in-line handler
   `HandlerClass.handlerMethod`.

   ```sqlsyntax
   create function function_name(x integer, y integer)
     returns integer
     language java
     handler='HandlerClass.handlerMethod'
     target_path='@~/HandlerCode.jar'
     as
     $$
         class HandlerClass {
             public static int handlerMethod(int x, int y) {
               return x + y;
             }
         }
     $$;
   ```

   The following describes some of the properties required or typically used when creating a function.

   * Function name.

     The UDF name does not need to match the name of the handler. The CREATE FUNCTION statement associates the UDF name with the handler.

     For more about name constraints and conventions, see [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).
   * Function arguments, if any.

     See [Defining arguments for UDFs and stored procedures](../udf-stored-procedure-arguments.md).
   * Return type with the RETURNS clause.

     For a scalar return value, the RETURNS clause will specify a single return type; for a tabular return value, RETURNS will specify
     the TABLE keyword specifying column type in the tabular return value.

     For information about how Snowflake maps SQL data types to handler data types, see
     [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).
   * Handler name with the HANDLER clause.

     When required, this is the name of the class or method containing code that executes when the UDF is called. You need specify a
     handler name only for handlers written in Java and Python. For JavaScript and SQL handlers, all code specified in-line will be
     executed as the handler.

     The following table describes the form of the HANDLER clause’s value based on the handler language and function type.

     | Handler Language | UDF | UDTF |
     | --- | --- | --- |
     | Java | Class and method name.  For example: `MyClass.myMethod` | Class name only. Handler method name is predetermined by the required interface. |
     | JavaScript | None. | None. |
     | Python | Class and method name if a class is used; otherwise, function name.  For example: `module.my_function` or `my_function` | Class name only. Handler method name is predetermined by the required interface. |
     | SQL | None. | None. |
   * Dependencies required by the handler, if any, using the IMPORTS or PACKAGES clauses.

     For more about making dependencies available to your handler, see [Making dependencies available to your code](../upload-dependencies.md).
   * Handler language runtime with RUNTIME_VERSION clause.

     When the handler language is Java or Python, use the RUNTIME_VERSION clause to specify which supported runtime version to use.
     Omitting the clause will prompt Snowflake to use the default, which may change in the future.

---
title: Creating and calling stored procedures
source: https://docs.snowflake.com/en/developer-guide/sql-api/using-stored-procedures.md
section: Developer Guide
---

# Creating and calling stored procedures

You can use the SQL API to create and call stored procedures. The following is an example of the body of a POST request that
creates a new stored procedure that passes in the name of a table and returns the number of rows in that table:

```sqljson
{
  "statement": "create or replace procedure sql_api_stored_proc(table_name varchar) returns varchar language javascript as $$var sql_command = \"select count(*) from \" + TABLE_NAME; var rs = snowflake.execute({sqlText: sql_command}); rs.next(); var rowCount = rs.getColumnValue(1); return rowCount; $$;",
  "role": "MY_ROLE",
  "warehouse": "MY_WAREHOUSE",
  "database": "MY_DB",
  "schema": "MY_SCHEMA"
}
```

The following is an example of the body of the response for this request:

```sqljson
{
  "resultSetMetaData": {
    "numRows": 1,
    "format": "jsonv2",
    "rowType": [ {
      "name": "status",
      "database": "",
      "schema": "",
      "table": "",
      "type": "text",
      "byteLength": 16777216,
      "scale": null,
      "precision": null,
      "nullable": true,
      "collation": null,
      "length": 16777216
    } ]
  },
  "data": [ [ "Function SQL_API_STORED_PROC successfully created." ] ],
  "code": "090001",
  "statementStatusUrl": "/api/v2/statements/019c9f28-0502-f257-0000-438300e0a02a?requestId=...",
  "sqlState": "00000",
  "statementHandle": "019c9f28-0502-f257-0000-438300e0a02a",
  "message": "Statement executed successfully.",
  "createdOn": 1622494569592
}
```

The following is an example of the body of a POST request that calls the stored procedure, passing in the table name “prices”:

```sqljson
{
  "statement": "call sql_api_stored_proc('prices');",
  "role": "MY_ROLE",
  "warehouse": "MY_WAREHOUSE",
  "database": "MY_DB",
  "schema": "MY_SCHEMA"
}
```

The following is an example of the body of the response for this request:

```sqljson
{
  "resultSetMetaData": {
    "numRows": 1,
    "format": "jsonv2",
    "rowType": [ {
      "name": "SQL_API_STORED_PROC",
      "database": "",
      "schema": "",
      "table": "",
      "type": "text",
      "byteLength": 16777216,
      "length": 16777216,
      "scale": null,
      "precision": null,
      "nullable": true,
      "collation": null
    } ]
  },
  "data": [ [ "4" ] ],
  "code": "090001",
  "statementStatusUrl": "/api/v2/statements/019c9f2a-0502-f244-0000-438300e04496?requestId=...",
  "sqlState": "00000",
  "statementHandle": "019c9f2a-0502-f244-0000-438300e04496",
  "message": "Statement executed successfully.",
  "createdOn": 1622494718694
}
```

---
title: Creating and using an external access integration
source: https://docs.snowflake.com/en/developer-guide/external-network-access/creating-using-external-network-access.md
section: Developer Guide
---

# Creating and using an external access integration

To enable access to specific external network locations, you create an external access integration that specifies a list of network rules
that specify external locations and a list of secrets you are allowed to use. By using the EXTERNAL_ACCESS_INTEGRATIONS clause to refer to
this integration when creating the UDF or procedure with CREATE FUNCTION or CREATE PROCEDURE, you allow the handler code to use the secret
to authenticate with the external location.

An administrator can monitor requests made to external network locations by using the
[EXTERNAL_ACCESS_HISTORY](../../sql-reference/account-usage/external_access_history.md) view.

For an end-to-end sequence of code examples you might use to set up and use external access, refer to
[External network access examples](external-network-access-examples.md).

Use the following steps to set up access to an external network location from a UDF or procedure.

1. Choose whether to connect to the external network location using the
   public internet or private connectivity.
2. Create a network rule to represent the external network location.
3. Create a secret to hold credentials.
4. Create an external access integration,
   aggregating the secret and network rule so that they may be used by the handler when accessing the external location.
5. Create the UDF or procedure with the EXTERNAL_ACCESS_INTEGRATIONS
   parameter set to the integration’s name as a value. This gives the function or procedure permission to access the external network
   locations and use the credentials specified by network rules and secrets in the integration.

   You separately set the SECRET parameter to the name of a secret included in the integration so that you have access to the
   secret’s contents from handler code.

   In function or procedure handler code, access the external network location specified in a network rule included in the integration.
   An attempt to access a network location that is not specified by an allowed network rule will be denied.

## Choosing the public internet or private connectivity

When you connect to an external network location, the connectivity from Snowflake to the external network location can go through the
public internet or use private connectivity through [Azure Private Link](https://learn.microsoft.com/en-us/azure/private-link/private-link-overview)
(Microsoft documentation), [AWS PrivateLink](https://docs.aws.amazon.com/vpc/latest/privatelink/what-is-privatelink.html) (AWS documentation), or
[Google Cloud Private Service Connect](https://cloud.google.com/vpc/docs/private-service-connect) (Google Cloud documentation).
You might use private connectivity based on the security requirements of your connection to the external network location.
Using private connectivity can help you meet your security requirements.

If you use the public internet, follow the instructions in the following sections in this topic.

If you use private connectivity, the person configuring the interaction must have been assigned the ACCOUNTADMIN role. In addition,
your Snowflake account must be [Business Critical Edition](../../user-guide/intro-editions.md) (or higher). Using
[Azure Private Link](creating-using-private-azure.md), [AWS PrivateLink](creating-using-private-aws.md), or
[Google Cloud Private Service Connect](creating-using-private-gcp.md) incurs an additional billing charge. Review these
topics for more information:

* [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md)
* [Manage private connectivity endpoints: Azure](../../user-guide/private-manage-endpoints-azure.md)
* [Manage private connectivity endpoints: AWS](../../user-guide/private-manage-endpoints-aws.md)
* [Manage private connectivity endpoints: Google Cloud](../../user-guide/private-manage-endpoints-gcp.md)

Next, configure the access to the external network location to use private connectivity as shown in one of the following topics:

* [External network access and private connectivity on Microsoft Azure](creating-using-private-azure.md).
* [External network access and private connectivity on AWS](creating-using-private-aws.md).
* [External network access and private connectivity on Google Cloud](creating-using-private-gcp.md).

## Creating a network rule to represent the external network location

You can use the [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md) command to create a network rule that represents the external network’s
location and restrictions for access. For example, a network rule specifies network identifiers such as a hostname and the direction of
communication with the network (ingress or egress).

To support access to an external network, an administrator will include the rule when creating an
external access integration. Each rule included in the
integration specifies an external network location that the function or procedure is allowed to access.

When creating a network rule for use in an external access integration, you specify the following:

* EGRESS as the MODE parameter value.
* A TYPE parameter value that indicates the type of network, such as HOST_PORT or PRIVATE_HOST_PORT.
* The external location’s endpoint in the VALUE_LIST parameter.
* (Optional) A port number with the external location’s endpoint name. If you omit a port number, Snowflake will use the default port number for
  external access, 443.

  For example, if the endpoint requires port 80, the VALUE_LIST parameter might be as follows:

  ```sqlexample
  VALUE_LIST = ('example.com:80')
  ```

### Access control

For security, Snowflake requires that when creating a network rule, you must use a role that has the following:

* The CREATE NETWORK RULE privilege on the schema that will hold the rule.

### Example

Code in the following example creates a network rule called `google_apis_network_rule` for outbound requests to the Google
Translation API.

For more examples, see [External network access examples](external-network-access-examples.md).

```sqlexample
CREATE OR REPLACE NETWORK RULE google_apis_network_rule
  MODE = EGRESS
  TYPE = HOST_PORT
  VALUE_LIST = ('translation.googleapis.com');
```

## Creating a secret to represent credentials

You can use [CREATE SECRET](../../sql-reference/sql/create-secret.md) to create a secret that represents credentials required to authenticate with the
external network location. For example, the secret can contain credentials such as a username and password or a
[security integration](../../sql-reference/sql/create-security-integration.md).

For access to an external network location that supports OAuth, a best practice is have your secret contain a reference to a
[security integration](../../sql-reference/sql/create-security-integration.md) that contains values needed for OAuth flow such as a client ID,
client secret, token endpoint, and so on.

The secret will be used in the following ways:

* By an administrator when creating the external access integration.

  When creating the integration, the administrator will specify the secrets that developers may use in handler code when creating a
  function or procedure that uses the integration.
* By a developer when creating a UDF or procedure handler.

  The developer will specify the allowed secret that contains the credentials that handler code can use to authenticate when making a request
  to the external location. When writing a handler, a developer can use a Snowflake API to retrieve credentials contained by the secret
  rather than including the credentials as literal values in handler code.

> **Note:**
>
> For an OAuth secret that requires a refresh token, you can obtain the token in multiple ways, including through system functions available
> in Snowflake. For an example, see [Accessing the Google Translate API with OAuth](external-network-access-examples.md).

### Access control

For security, Snowflake requires that when creating a secret, you must use a role that has the following:

* The CREATE SECRET privilege on the schema that will hold the secret.

### Example

Code in the following example creates a secret called `oauth_token` that specifies a security integration (represented by
`google_translate_oauth`) containing values needed to authenticate using OAuth.

For a more complete example, including the code for creating the security integration, refer to
[External network access examples](external-network-access-examples.md).

```sqlexample
CREATE OR REPLACE SECRET oauth_token
  TYPE = OAUTH2
  API_AUTHENTICATION = google_translate_oauth
  OAUTH_REFRESH_TOKEN = 'my-refresh-token';
```

> **Tip:**
>
> In this preview, you can specify the `TYPE` as `GENERIC_STRING` when you want to use an API key only as credentials.
>
> ```sqlexample
> CREATE OR REPLACE SECRET bp_maps_api
>   TYPE = GENERIC_STRING
>   SECRET_STRING = 'replace-with-your-api-key';
> ```

## Creating an external access integration

You can use the [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md) command to create an external access integration that aggregates allowed network
rules (representing external network locations) and allowed secrets (representing credentials for authenticating) for use with UDFs
and procedures.

In particular, the external access integration specifies those network rules and secrets that UDFs and procedures referencing the
integration may use.

The external access integration will be used by an administrator to manage access to external network locations from UDFs and procedures.
The integration specifies only those locations and credentials allowed for use by UDFs and procedures that reference the integration.
An administrator can also enable or disable the integration to manage access to external locations.

### Access control

For security, Snowflake requires that when creating an external access integration, you must use a role that has the following:

* The CREATE INTEGRATION privilege on the account.
* The USAGE privilege on any secret the integration uses, as well as the USAGE privilege on the secret’s schema.

### Example

Code in the following example creates an external access integration called `google_apis_access_integration`. The integration specifies
the `google_apis_network_rule` network rule (representing the network location) and the `oauth_token` secret
(representing credentials).

For more information about this rule and secret, refer to Creating a network rule to represent the external network location and
Creating a secret to represent credentials.

```sqlexample
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION google_apis_access_integration
  ALLOWED_NETWORK_RULES = (google_apis_network_rule)
  ALLOWED_AUTHENTICATION_SECRETS = (oauth_token)
  ENABLED = true;
```

## Using the external access integration in a function or procedure

When using the [CREATE FUNCTION](../../sql-reference/sql/create-function.md) or [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) command to create a UDF or
procedure, you can enable access to external network locations as follows:

* Include the EXTERNAL_ACCESS_INTEGRATIONS parameter, setting its value to one or more integrations.

  Each integration you specify allows access to the external network locations and secrets the integration specifies.
* Include the SECRETS parameter, setting its value to one or more secrets and the names you’ll use to access them from handler code.

  The secrets you specify as values must also be specified in the external access integration.
* In handler code, access the secret to retrieve credentials for authenticating with the external network location.

> **Note:**
>
> Always use a Snowflake secret to represent credentials rather than including the credentials as literal values in code. In addition to
> protecting credentials, using a secret makes it possible to audit and manage use of the credentials because only those granted the
> READ privilege on the secret may use an integration containing it in a UDF or procedure.

Snowflake limits the total number of connections that can be made from a particular UDF. To avoid running into resource exhaustion issues,
reuse connections as much as possible. You can achieve this by creating the TCP client or session once during the UDF initialization,
then using it in the UDF handler for the rest of the query.

### Access control

For security, Snowflake requires that when creating a UDF or procedure, you must use a role that has the following:

* The READ privilege on any secret it references, as well the USAGE privilege on the secret’s schema.
* The USAGE privilege on any integration it references.

Requiring these privileges enables an administrator to manage the set of users who can enable external access. For more information, refer
to [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) and [Access control privileges](../../user-guide/security-access-control-privileges.md).

### Example

Code in the following example creates a UDF called `google_translate_python`, specifying an external access integration called
`google_apis_access_integration` (refer to Creating an external access integration for details).
The integration specifies a network rule (representing an external network location) and secret (representing credentials) that a UDF
referencing the integration is allowed to use. For more information about this rule and secret, refer to
Creating a network rule to represent the external network location and Creating a secret to represent credentials.

The Python handler code uses the `_snowflake.get_oauth_access_token` function to retrieve the OAuth token from the secret, then uses
the token to authenticate with the external location. The handler code may make a request to the specified URL because that URL’s host is
listed in the network rule specified by the integration.

```sqlexample
CREATE OR REPLACE FUNCTION google_translate_python(sentence STRING, language STRING)
RETURNS STRING
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
HANDLER = 'get_translation'
EXTERNAL_ACCESS_INTEGRATIONS = (google_apis_access_integration)
PACKAGES = ('snowflake-snowpark-python','requests')
SECRETS = ('cred' = oauth_token )
AS
$$
import _snowflake
import requests
import json
session = requests.Session()
def get_translation(sentence, language):
  token = _snowflake.get_oauth_access_token('cred')
  url = "https://translation.googleapis.com/language/translate/v2"
  data = {'q': sentence,'target': language}
  response = session.post(url, json = data, headers = {"Authorization": "Bearer " + token})
  return response.json()['data']['translations'][0]['translatedText']
$$;
```

---
title: Creating Python UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-creating.md
section: Developer Guide
---

# Creating Python UDFs

This topic shows how to create and install a Python UDF (user-defined function).

## Writing the Python code

### Writing the Python module and function

Write a module that follows the specifications below:

* Define the module. A module is a file containing Python definitions and statements.
* Define a function inside the module.
* If the function accepts arguments, each argument must be one of the data types specified in the `Python Data Type` column of the
  [SQL-Python Type Mappings table](../../udf-stored-procedure-data-type-mapping.md).

  Function arguments are bound by position, not name. The first argument passed to the
  UDF is the first argument received by the Python function.
* Specify an appropriate return value. Because a Python UDF must be a scalar function, it must return one value each
  time that it is invoked. The type of the return value must be one of the data types specified in the
  `Python Data Type` column of the [SQL-Python Type Mappings table](../../udf-stored-procedure-data-type-mapping.md).
  The type of the return value must be
  compatible with the SQL data type specified in the `RETURNS` clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) statement.
* Your module can contain more than one function. The function that is called by Snowflake can call other functions in the same module, or in
  other modules.
* Your function (and any functions called by your function) must comply with the
  [Snowflake-imposed constraints for Python UDFs](udf-python-designing.md).

> **Note:**
>
> Vectorized Python UDFs let you define Python functions that receive batches of input rows
> as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) and
> return batches of results as [Pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html)
> or [Series](https://pandas.pydata.org/docs/reference/series.html). For more information, see [Vectorized Python UDFs](udf-python-batch.md).

## Creating the function in Snowflake

You must execute a [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) statement to specify:

* The name to use for the UDF.
* The name of the Python function to call when the Python UDF is called.

The name of the UDF does not need to match the name of the handler function written in Python. The HANDLER clause in the CREATE
FUNCTION statement associates the UDF name with the Python function.

When choosing a name for the UDF, refer to [Naming and overloading procedures and UDFs](../../udf-stored-procedure-naming-conventions.md).

Within the body of the CREATE FUNCTION statement, function arguments are bound by position, not name. The first argument declared
in the CREATE FUNCTION statement is the first argument passed to the Python function.

For information about the data types of arguments, see [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

Set `runtime_version` to the version of the Python runtime that your code requires. The supported versions of Python are:

> Generally available versions:
>
> * 3.9 (deprecated)
> * 3.10
> * 3.11
> * 3.12
> * 3.13

## UDFs with in-line code vs. UDFs with code uploaded from a stage

The code for a Python UDF can be specified either of the following ways:

* Uploaded from a stage: The CREATE FUNCTION statement specifies the location of an existing Python source
  code in a [stage](../../../sql-reference/sql/create-stage.md).
* In-line: The CREATE FUNCTION statement specifies the Python source code.

### Creating an in-line Python UDF

For an in-line UDF, you supply the Python source code as part of the CREATE FUNCTION statement.

For example, the following statement creates an in-line Python UDF that adds one to a given integer:

```sqlexample-python
CREATE OR REPLACE FUNCTION addone(i INT)
  RETURNS INT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  HANDLER = 'addone_py'
AS $$
def addone_py(i):
 return i+1
$$;
```

The Python source code is specified in the `AS` clause. The source code can be surrounded by either single quotes or by a pair of
dollar signs (`$$`). Using the double dollar signs is usually easier if the source code contains embedded single quotes.

Call the UDF:

```sqlexample
SELECT addone(10);
```

Here is the output:

```output
+------------+
| ADDONE(10) |
|------------|
|         11 |
+------------+
```

The Python source code can contain more than one module, and more than one function in a module, so the `HANDLER` clause specifies
the module and function to call.

An in-line Python UDF can call code in modules that are included in the `IMPORTS` clause.

For more details about the syntax of the CREATE FUNCTION statement, see [CREATE FUNCTION](../../../sql-reference/sql/create-function.md).

For more examples, see [in-line Python UDF examples](udf-python-examples.md).

### Creating a Python UDF with code uploaded from a stage

The following statements create a simple Python UDF using code uploaded from a [stage](../../../sql-reference/sql/create-stage.md).
The stage hosting the file must be readable by the owner
of the UDF. Also, ZIP files must be self-contained and not rely on any additional setup scripts to be executed.

Create a Python file named `sleepy.py` that contains your source code:

```python
def snore(n):   # return a series of n snores
  result = []
  for a in range(n):
    result.append("Zzz")
  return result
```

Launch the [SnowSQL (CLI client)](../../../user-guide/snowsql.md) and use the [PUT](../../../sql-reference/sql/put.md) command to copy the file from
the local file system to the default user stage, named `@~`. (The `PUT` command cannot be executed through the Snowflake GUI.)

```sqlexample
put
file:///Users/Me/sleepy.py
@~/
auto_compress = false
overwrite = true
;
```

If you delete or rename the file, you can no longer call the UDF.
If you need to update your file, then update it while no calls to the UDF can be made.
If the old file is still in the stage, the `PUT` command should include the clause `OVERWRITE=TRUE`.

Create the UDF. The handler specifies the module and the function.

```sqlexample
CREATE OR REPLACE FUNCTION dream(i INT)
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  HANDLER = 'sleepy.snore'
  IMPORTS = ('@~/sleepy.py')
```

Call the UDF:

```sqlexample
SELECT dream(3);
```

```output
+----------+
| DREAM(3) |
|----------|
| [        |
|   "Zzz", |
|   "Zzz", |
|   "Zzz"  |
| ]        |
+----------+
```

#### Specifying multiple import files

Here is an example of how to specify multiple import files.

```sqlexample-python
CREATE OR REPLACE FUNCTION multiple_import_files(s STRING)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  IMPORTS = ('@python_udf_dep/bar/python_imports_a.zip', '@python_udf_dep/foo/python_imports_b.zip')
  HANDLER = 'compute'
AS $$
def compute(s):
  return s
$$;
```

> **Note:**
>
> The import file names specified must be different.
> For example, this will not work:
> `imports=('@python_udf_dep/bar/python_imports.zip', '@python_udf_dep/foo/python_imports.zip')`.

## Granting privileges on the function

For any role other than the owner of the function to call the function, the owner must grant the appropriate
privileges to the role.

The [GRANT](../../../sql-reference/sql/grant-privilege.md) statements for a Python UDF are essentially identical to
the GRANT statements for other UDFs, such as JavaScript UDFs.

For example:

```sqlexample
GRANT USAGE ON FUNCTION my_python_udf(number, number) TO my_role;
```

---
title: Data Type Mappings Between SQL and Handler Languages
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-data-type-mapping.md
section: Developer Guide
---

# Data Type Mappings Between SQL and Handler Languages

A stored procedure or user-defined function you write is called from SQL, and so receives and returns values in SQL data types. However,
its underlying handler will use data types from the handler’s language, such as Java, Python, or Scala. At runtime, Snowflake converts
between the SQL types and handler types for arguments and return values.

Note that Snowflake makes these conversions the following cases as well:

* When dynamically constructing a SQL statement that uses a value in a handler variable.
* When binding a handler variable’s value to a prepared statement.

This topic describes valid mappings between SQL data and types and those from the supported handler languages. Use this content to choose
data types when writing a handler.

For information about Snowflake SQL data types, see [Summary of data types](../sql-reference/intro-summary-data-types.md).

## SQL-Java Data Type Mappings

The table below shows the type mappings between SQL and Java. These mappings generally apply to both the arguments
passed to the procedure or function and the values returned from it. However, there are some exceptions, which are listed
in footnotes.

Note that some SQL data types (e.g. NUMBER) are compatible with multiple Java data types (e.g. `int`, `long`, etc.). In these cases,
you can use any Java data type that has enough capacity to hold the actual values that will be passed. If you
pass a SQL value to an incompatible Java data type (or vice versa), Snowflake throws an error.

| SQL Type | Java Type | Notes |
| --- | --- | --- |
| ARRAY | `String[]` | Formats the elements of the array as strings. |
| ARRAY | `String` | Formats the array as a JSON string (e.g. `[1, "foo", null]`). |
| BINARY | `byte[]` |  |
| BINARY | `String` | Encodes the binary string in hexadecimal. [4] |
| BINARY | `InputStream` | Exposes the BINARY value as a sequence of bytes. |
| BOOLEAN | `boolean` | Cannot be null. |
| BOOLEAN | `Boolean` |  |
| BOOLEAN | `String` | [4] |
| DATE | `java.sql.Date` |  |
| DATE | `String` | Formats the date as `YYYY-MM-DD`. [4] |
| FLOAT | `double` | Cannot be null. |
| FLOAT | `Double` |  |
| FLOAT | `float` | Cannot be null. Might result in precision loss. |
| FLOAT | `Float` | Might result in precision loss. |
| FLOAT | `String` | Might result in precision loss (float -> string conversion is lossy). |
| GEOGRAPHY | `String` | Formats the geography as [GeoJSON](https://tools.ietf.org/html/rfc7946). |
| GEOGRAPHY | [Geography](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Geography.html) | [5] |
| MAP | `Map<String, String>` | The output format is MAP(VARCHAR, VARCHAR). |
| NUMBER | `short` | Cannot be null. Must fit in the range of short (no fractional part, and integer part cannot exceed the max/min short values). |
| NUMBER | `Short` | Must fit in the range of short (no fractional part, and integer part cannot exceed the max/min short values). |
| NUMBER | `int` | Cannot be null. Must fit in the range of int (no fractional part, and integer part cannot exceed the max/min int values). |
| NUMBER | `Integer` | Must fit in the range of int (no fractional part, and integer part cannot exceed the max/min int values). |
| NUMBER | `long` | Cannot be null. Must fit in the range of long (no fractional part, and integer part cannot exceed the max/min long values). |
| NUMBER | `Long` | Must fit in the range of long (no fractional part, and integer part cannot exceed the max/min long values). |
| NUMBER | `java.math.BigDecimal` |  |
| NUMBER | `java.math.BigInteger` | Must fit into the range of BigInteger (no fractional part). |
| NUMBER | `String` |  |
| OBJECT | `Map<String, String>` | The map’s keys are the object’s keys, and the values are formatted as strings. |
| OBJECT | `String` | Formats the object as a JSON string (e.g. `{"x": 3, "y": true}`). |
| TIME | `java.sql.Time` | [3] |
| TIME | `String` | Formats the time as `HH:MI:SS.SSSSSSSSS` where the fractional seconds part depends on the precision of the time. [3] |
| TIMESTAMP_LTZ | `java.sql.Timestamp` | Must fit in the range of java.sql.Timestamp. [3] |
| TIMESTAMP_LTZ | `String` | The output format is `DY, DD MON YYYY HH24:MI:SS TZHTZM`. [1] , [3] , [4] |
| TIMESTAMP_NTZ | `java.sql.Timestamp` | Must fit in the range of java.sql.Timestamp. Treats the wallclock time as an offset from the Unix epoch (imposing a UTC time zone, effectively). [3] |
| TIMESTAMP_NTZ | `String` | Treats the wallclock time as an offset from the Unix epoch (imposing a UTC time zone, effectively). The output format is `DY, DD MON YYYY HH:MI:SS`. [2] , [3] , [4] |
| TIMESTAMP_TZ | `java.sql.Timestamp` | Must fit in the range of java.sql.Timestamp. [3] |
| TIMESTAMP_TZ | `String` | The output format is `DY, DD MON YYYY HH24:MI:SS TZHTZM`. [1] , [3] , [4] |
| VARCHAR | `String` |  |
| VARIANT | [Variant](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Variant.html) | The [Variant](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Variant.html) data type is a class in the Snowpark package. For more information, see Snowpark Package Types Supported for User-Defined Functions. For an example, see [Passing a VARIANT value to an in-line Java UDF](udf/java/udf-java-cookbook.md). |

[1]
(1,2)

The format matches the Internet (RFC) Timestamp Format `DY, DD MON YYYY HH24:MI:SS TZHTZM` as described in [Timestamp formats](../sql-reference/date-time-input-output.md). If a timezone offset (the `TZHTZM` component) is present, it is typically digits (e.g. `-0700` indicates 7 hours behind UTC). If the timezone offset is `Z` (for “Zulu”) rather than digits, that is synonymous with “+0000” (UTC).

[2]

The format matches the Internet (RFC) Timestamp Format `DY, DD MON YYYY HH24:MI:SS` as described in [Timestamp formats](../sql-reference/date-time-input-output.md). If the string is followed by a space and `Z` (for “Zulu”), that explicitly indicates that the offset is “+0000” (UTC).

[3]
(1,2,3,4,5,6,7,8)

Although Snowflake can store time values with nanosecond precision, the java.sql.time library maintains only millisecond precision. Conversion between Snowflake and Java data types can reduce effective precision to milliseconds.

[4]
(1,2,3,4,5,6)

This type mapping is supported when converting SQL arguments to Java, but not when converting Java return types to SQL types.

[5]

Java does not have a native Geography data type. The [Geography](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Geography.html) data type referred to here is a class in the Snowpark package. For more information, see Snowpark Package Types Supported for User-Defined Functions.

### Arrays

Java UDFs can receive arrays of any of the following Java data types:

| Data Type | Notes |
| --- | --- |
| `String` |  |
| `boolean` | The Snowflake ARRAY must contain only BOOLEAN elements and must not contain any NULL values. |
| `double`  `float` | The Snowflake ARRAY must contain either of the following, and must not contain any NULL values.   * [FLOAT](../sql-reference/data-types-numeric.md) elements. * [Fixed-point](../sql-reference/data-types-numeric.md) elements (with any scale). |
| `int`  `long`  `short` | The Snowflake ARRAY must contain only [fixed-point](../sql-reference/data-types-numeric.md) elements with a scale of 0, and must not contain any NULL values. |

### NULL Values

Snowflake supports two distinct NULL values: SQL `NULL` and VARIANT’s JSON `null`. (For information about Snowflake
VARIANT NULL, see [NULL values](../user-guide/semistructured-considerations.md).)

Java supports one `null` value, which is only for non-primitive data types.

A SQL `NULL` argument to a Java handler translates to the Java `null` value, but only for Java data types that
support `null`.

A returned Java `null` value translates back to SQL `NULL`.

### TIMESTAMP_LTZ Values and Time Zones

A Java UDF is largely isolated from the environment in which it is called. However, the timezone is inherited from
the calling environment. If the caller’s session set a default time zone before calling the Java UDF, then the Java
UDF has the same default time zone. Java UDF uses the same [IANA Time Zone Database](https://www.iana.org/time-zones) data as the native [TIMEZONE](../sql-reference/parameters.md)
Snowflake SQL uses (i.e. data from release 2025b of the Time Zone Database).

### Snowpark Package Types Supported for User-Defined Functions

In a user-defined function, you can use a specific subset of types that are included in the Snowflake
[Snowpark Java package](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/index.html). Although these types are
designed for use in Snowpark code, a few are also supported for use in UDFs for the convenience they can provide. (For more about
Snowpark, see the [Snowpark documentation](snowpark/index.md).)

> **Note:**
>
> The Snowpark library is a requirement for stored procedures written in Java, Python, and Scala. As a result, you can use Snowpark types
> there without restriction.

Snowpark types in the following table are supported in UDF code. You should not use other Snowpark types in UDF code; they are not
supported there.

| Snowpark Type | Snowpark Version Required | Description |
| --- | --- | --- |
| [Geography](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Geography.html) | 1.2.0 and later | Represents the Snowflake [GEOGRAPHY](../sql-reference/data-types-geospatial.md) type. For an example that uses the `Geography` data type, see [Passing a GEOGRAPHY value to an in-line Java UDF](udf/java/udf-java-cookbook.md). |
| [Variant](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Variant.html) | 1.4.0 and later | Represents Snowflake [VARIANT](../sql-reference/data-types-semistructured.md) data. For an example that uses the `Variant` data type, see [Passing a VARIANT value to an in-line Java UDF](udf/java/udf-java-cookbook.md). |

#### Specifying the Snowpark Package as a Dependency

When developing UDF code that uses the Snowpark package, you’ll need to set up your development environment so that you can compile and
run code with Snowpark dependencies. For more, see [Setting Up Other Development Environments for Snowpark Java](snowpark/java/setup-other-environments.md).

When deploying a UDF by executing the [CREATE FUNCTION](../sql-reference/sql/create-function.md) statement, you can specify the Snowpark
package as a dependency without uploading the JAR file to a stage (the library is already in Snowflake). To do this, specify the package
name and version in the `PACKAGES` clause. For a syntax example, see [Passing a GEOGRAPHY value to an in-line Java UDF](udf/java/udf-java-cookbook.md).

## SQL-JavaScript Data Type Mappings

The following table shows the Snowflake SQL data types and the corresponding JavaScript data types:

| SQL Data Type | JavaScript Data Type | Notes |
| --- | --- | --- |
| ARRAY | `JSON` |  |
| BOOLEAN | `number` | The values `true` and `false` are represented by `1` and `0` respectively. Note that this behavior may change in future releases, so you should rely on JavaScript truthiness rather than direct value comparisons. |
| DATE | `date` |  |
| GEOGRAPHY, GEOMETRY | `JSON` |  |
| REAL, FLOAT, FLOAT8, FLOAT4, DOUBLE, DOUBLE PRECISION | `number` |  |
| TIME | `string` |  |
| TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ | `date` or `SfDate` | When a timestamp is passed as an argument to a stored procedure, the timestamp is converted to a JavaScript `date` object. In other situations (e.g. when retrieving from `ResultSet`), a timestamp is converted to an `SfDate` object. For more details about the `SfDate` data type, which is not a standard JavaScript data type, see the [JavaScript stored procedures API](stored-procedure/stored-procedures-api.md). |
| VARCHAR, CHAR, CHARACTER, STRING, TEXT | `string` |  |
| VARIANT | `JSON` |  |

### Notes

Not all Snowflake SQL data types have a corresponding JavaScript data type. For example, JavaScript does not
directly support the INTEGER or NUMBER data types. In these cases, you should convert the SQL data type to an
appropriate alternative data type. For example, you can convert a SQL INTEGER into a SQL FLOAT, which can then be
converted to a JavaScript value of data type `number`.

The table below shows appropriate conversions for the incompatible SQL data types:

| Incompatible SQL Data Type | Compatible SQL Data Type |
| --- | --- |
| BINARY | Uint8Array |
| INTEGER | FLOAT |
| NUMBER, NUMERIC, DECIMAL | FLOAT |
| OBJECT | Uint8Array |

#### When Returning Values

If the `return`
statement in the JavaScript returns a data type different from the stored procedure’s declared return type,
the JavaScript value is cast to the SQL data type if possible. For example, if a number is returned, but the
stored procedure is declared as returning a string, the number is converted to a string within JavaScript, and
then copied to the string returned in the SQL statement. (Keep in mind that some JavaScript programming errors, such as
returning the wrong data type, can be hidden by this behavior.)

If no valid cast for the conversion exists, then an error occurs.

#### When Binding Values

When you bind JavaScript variables to SQL statements, Snowflake converts from the JavaScript data types to
the SQL data types. You can bind variables of the following JavaScript data types:

* number
* string
* SfDate

  For more details about the `SfDate` data type, which is not a standard JavaScript data type, see
  the [JavaScript stored procedures API](stored-procedure/stored-procedures-api.md).

For more information about binding, including some examples, see [Binding variables](stored-procedure/stored-procedures-javascript.md).

You might also find the following topics helpful:

* [JavaScript data types](udf/javascript/udf-javascript-introduction.md)
* [JavaScript arguments and returned values](udf/javascript/udf-javascript-introduction.md)

## SQL-Python Data Type Mappings

The table below shows the type mappings between SQL and Python. These mappings generally apply to both the arguments
passed to the Python handler and the values returned from it.

| SQL Type | Python Type | Notes |
| --- | --- | --- |
| ARRAY | `list` | When a Python data type is converted to ARRAY, if there is any embedded Python decimal data, the embedded Python decimal will be converted to a String in the ARRAY. |
| BINARY | `bytes` |  |
| BOOLEAN | `bool` |  |
| DATE | `datetime.date` |  |
| FLOAT | `float` | Floating point operations can have small rounding errors, which can accumulate, especially when aggregate functions process large numbers of rows. Rounding errors can vary each time a query is executed if the rows are processed in a different order. For more information, see [Numeric Data Types: Float](../sql-reference/data-types-numeric.md). |
| GEOGRAPHY, GEOMETRY | `dict` | Formats the geography as [GeoJSON](https://tools.ietf.org/html/rfc7946) and then converts it to a Python dict. |
| MAP | `dict` | MAP is not supported as a return type. |
| NUMBER | `int` or `decimal.Decimal` | If the scale of the NUMBER type is 0 then the int Python type is used. Otherwise decimal.Decimal type is used. |
| OBJECT | `dict` | When a Python data type is converted to OBJECT, if there is any embedded Python decimal data, the embedded Python decimal will be converted to a String in the OBJECT. |
| TIME | `datetime.time` | Although Snowflake can store time values with nanosecond precision, the Python datetime.time type maintains only millisecond precision. Conversion between Snowflake and Python data types can reduce effective precision to milliseconds. |
| TIMESTAMP_LTZ | `datetime.datetime` | Use local timezone to convert internal UTC time to local “naive” datetime. Requires “naive” datetime as return type. |
| TIMESTAMP_NTZ | `datetime.datetime` | Directly convert to “naive” datetime. Requires “naive” datetime as return type. |
| TIMESTAMP_TZ | `datetime.datetime` | Convert to “aware” datetime with timezone information. Requires “aware” datetime as return type. |
| VARCHAR | `str` |  |
| VARIANT | `dict`, `list`, `int`, `float`, `str`, or `bool` | Each variant row is converted to a Python type dynamically for arguments and vice versa for return values. The following types are converted to strings instead of native Python types: decimal, binary, date, time, timestamp_ltz, timestamp_ntz, timestamp_tz. When a Python data type is converted to VARIANT and Python decimal data is embedded, the embedded Python decimal is converted to a string in the VARIANT. |
| VECTOR | `memoryview` |  |

## SQL-Scala Data Type Mappings

Snowflake supports the following Scala data types in addition to the Java types listed in SQL-Java Data Type Mappings:

| SQL Data Type | Scala Type | Notes |
| --- | --- | --- |
| ARRAY | `Array[String]` |  |
| BINARY | `Array[Byte]` |  |
| BOOLEAN | `Boolean` or `Option[Boolean]` |  |
| DOUBLE | `Double` or `Option[Double]` |  |
| FLOAT | `Float` or `Option[Float]` |  |
| MAP | `Map[String, String]` | The output format is MAP(VARCHAR, VARCHAR). |
| NUMBER | The following types are supported:   * `Int` or `Option[Int]` * `Long` or `Option[Long]` |  |
| OBJECT | `Map[String, String]` |  |
| VARCHAR | `String` |  |
| VARIANT | `String` | Formats the value depending on the type that is represented. [Variant null](../user-guide/semistructured-considerations.md) is formatted as the string “null”. |

For [DATE](../sql-reference/data-types-datetime.md) and [TIMESTAMP](../sql-reference/data-types-datetime.md), use the Java types listed in
SQL-Java Data Type Mappings.

---
title: Declarative App Consumer-Side Execution Model
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/consumer/consumer-execution.md
section: Developer Guide
---

# Declarative App Consumer-Side Execution Model

When you install a Declarative Native App as a consumer, the Native App Framework isolates the app’s data access and code execution in a controlled environment, preventing the app from accessing data or otherwise affecting resources in the consumer account.

The Native App Framework enforces sandbox-style security boundaries for Declarative Native Apps, so that the app can only
access data included in the app package. This ensures that the app cannot access any other data, code resources, or system resources in the consumer account, providing a secure environment for running the app, and protecting the consumer’s assets.

The security boundaries for Declarative Native Apps are more restrictive than those for Native App Framework apps, which can access additional resources in the consumer account when given the appropriate permissions.

## Embedded code objects in Declarative Native Apps

The only code objects currently supported for Declarative Native Apps are Snowflake Notebooks.
Currently, Declarative Native Apps can’t use other types of code resources for logic, such as Streamlits, stored procedures, or UDFs. The embedded code objects in a Declarative Native App can only do the following:

* Access data or code objects from inside the app package.
* Run queries, visualizations, and functions on the tables and views exposed by the app package. The app has SELECT access to these tables, views, and functions.

The embedded logic cannot do the following:

* Access any other data products in the consumer account.
* Access any other logic in the consumer account.
* Access metadata about other data products installed in the consumer account.
* Access any system resources in the consumer account, such as system tables or views. For example, running `SHOW DATABASES` or `SHOW TABLES` returns only the databases and tables that are part of the app package. Other databases and tables in the consumer account are not visible to the app.
* Change system parameters or settings in the consumer account. For example, changing the warehouse size or modifying user roles.
* Create resources or external integrations in the consumer account, such as creating new warehouses, databases, tables, or views that are not part of the app package.

> **Note:**
>
> The app uses the current user account’s default warehouse. For information about creating a warehouse, see [CREATE WAREHOUSE](../../../sql-reference/sql/create-warehouse.md). For information about setting a default warehouse for a user account, see [ALTER USER](../../../sql-reference/sql/alter-user.md).

---
title: Declarative Native App command reference
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/command-reference.md
section: Developer Guide
---

# Declarative Native App command reference

The following commands include new parameters to support creating and publishing application packages:

* CREATE APPLICATION PACKAGE
* ALTER APPLICATION PACKAGE
* [GRANT IMPORTED PRIVILEGES ON APPLICATION](consumer/install.md)

## CREATE APPLICATION PACKAGE

The [CREATE APPLICATION PACKAGE](../../sql-reference/sql/create-application-package.md)
command supports a new optional parameter, `TYPE = DATA`, which specifies that the app will be a Declarative Native App.

### Syntax

```sqlsyntax
CREATE APPLICATION PACKAGE [ IF NOT EXISTS ] <name> TYPE = DATA
```

New optional parameter:

`TYPE = [ DATA | NATIVE ]`
:   Specifies which type of application package to create:

    * `DATA`: indicates that the application package will contain a Declarative Native App.
    * `NATIVE`: indicates that the application package will contain a Snowflake Native App. This is the default value.

    After you specify an application package type, you cannot use ALTER APPLICATION PACKAGE to change the type later.

    When `TYPE = DATA` is specified, the other parameters in this command, such as DATA_RETENTION_TIME_IN_DAYS and COMMENT, are not supported.

    This parameter requires a [role](../../user-guide/security-access-control-overview.md) with the CREATE APPLICATION PACKAGE and CREATE DATABASE [privileges](../../user-guide/security-access-control-overview.md).

    The creator of the application package is automatically granted the OWNERSHIP privilege on that application package.

## ALTER APPLICATION PACKAGE

The [ALTER APPLICATION PACKAGE](../../sql-reference/sql/alter-application-package.md) command
supports the following new optional parameters to support creating and publishing Declarative Native Apps. These new parameters are not supported for Snowflake Native Apps.

### Syntax

```sqlsyntax
ALTER APPLICATION PACKAGE <name>
[ ADD LIVE VERSION
| ADD VERSION FROM @STAGE/path
| BUILD
| COMMIT
| RELEASE [LIVE VERSION]
| ABORT LIVE VERSION ]
[COMMENT = 'string_literal']
```

### New optional parameters

`ADD LIVE VERSION`
:   Create a live version of the application package that can be edited. This live version is used to add or update files, such as the manifest file and notebook files.

`ADD VERSION FROM @<STAGE>/<path>`
:   Creates a live version of the application package based on files from a [stage](../../user-guide/data-load-local-file-system-stage.md). This method is useful if you have a set of files that you want to include in the application package, and you want to add them all at once.

> **Note:**
>
> If you iterate on the files after creating the live version, you’ll need to make the same changes to the files on the stage to keep future versions consistent.

`BUILD`
:   Builds the app, but doesn’t commit it. Use this command to validate the manifest
    file and to continue working on the application package.

`COMMIT`
:   Builds the app, commits it for publishing, but doesn’t release it.

    The commit process prepares the application package for publishing by adding an
    internal version number, and makes the application package immutable.

`RELEASE`
:   Releases a committed version of the app to the Snowflake Marketplace.

`RELEASE LIVE VERSION`
:   Builds the app, commits it for publishing, and releases it to the Snowflake Marketplace.

    Equivalent to running the BUILD, COMMIT, and RELEASE commands in sequence.

`ABORT LIVE VERSION`
:   Removes the LIVE version of the application package. Restores the application package to the last committed version.

### Existing parameters

These parameters are supported for both Declarative Native Apps and Snowflake Native Apps.

`<name>`
:   Specifies the identifier for the application package.

    If the identifier contains spaces, special characters, or mixed-case characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`COMMENT = 'string_literal'`

> Optional: Adds a comment or overwrites an existing comment for the app version. This comment is displayed in SHOW APPLICATION PACKAGES.

### Access control requirements

This command requires a role with the OWNERSHIP privilege for the application package.

### Examples

* Create a new application package:

  ```sqlexample
  CREATE APPLICATION PACKAGE market_data_app TYPE = DATA;
  ```
* Create a live version of the application package that can be edited:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app
    ADD LIVE VERSION
    COMMENT = 'Market views for Northern region';
  ```
* Create a new version of the application package from an existing staged application package:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app
    ADD VERSION FROM @my_stage/market_data_app_v1;
  ```
* Build the application package, but don’t commit it:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app BUILD;
  ```
* Build and commit the application package for publishing, but don’t release it:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app
    COMMIT
    COMMENT = 'Market views for North and East regions';
  ```
* Release the application package to the Snowflake Marketplace:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app RELEASE;
  ```
* Build, commit, and release the live version of the application package to the Snowflake Marketplace:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app RELEASE LIVE VERSION
    COMMENT = 'Market views for North, East, and West regions';
  ```
* After adding a live version of the app end and editing it, stop editing and restore to the last committed version:

  ```sqlexample
  ALTER APPLICATION PACKAGE market_data_app ABORT LIVE VERSION
  ```

## GRANT IMPORTED PRIVILEGES ON APPLICATION

The [GRANT IMPORTED PRIVILEGES](../../sql-reference/sql/grant-privilege.md) command supports a new optional parameter, `ON APPLICATION <name>`.

This command allows consumers to grant access to all of the data and views in a Declarative Native App to other members of their organization.

This command can be used on any Declarative Native App, and does not require app roles to be defined for the application package.

### Access control requirements

This command requires a role with the OWNER privilege for the installed app.

### Syntax

```sqlsyntax
GRANT IMPORTED PRIVILEGES ON APPLICATION <name> TO ROLE <role_name>;
```

### Example

```sqlexample
GRANT IMPORTED PRIVILEGES ON APPLICATION market_data_app TO ROLE marketing_team_east;
```

---
title: Declarative Native App manifest reference
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/manifest-reference.md
section: Developer Guide
---

# Declarative Native App manifest reference

Providers create a manifest file as part of a [package](package.md).

The manifest file is a text-based [YAML](https://yaml.org/spec/) file, with the filename: `manifest.yml`. It’s used to declaratively share data and logic with consumers, such as notebooks, user-defined functions, stored procedures, tables, and views.

The manifest file also defines [app roles](app-roles.md), which app owners can use to share a subset of the app’s data and features to teams in their organization teams by role.

For information about developing an application package, see [Application Packages in Declarative Sharing in the Native Application Framework](package.md).

## Declarative Native App manifest

The general format of a Declarative Native App manifest contains:

```yaml
manifest_version: # Added automatically. Don't include.
application_content: # Optional, describes associated app logic
roles: # Optional, describes roles associated with shared_content
shared_content: # Required, describes associated data to be shared
```

## Fields

Declarative Native App manifests include the following fields:

### `manifest_version` field

This field is added automatically to the manifest file when you release a new version of an application package.

Don’t include this field when creating a manifest file to include in an application package. Editing this field manually is not supported.

The `manifest_version` top level field (Integer, required) specifies the version
number of the manifest file.

For more information about versioning, see [Package Versions in Declarative Sharing in the Native Application Framework](versioning.md).

### `application_content` field

The `application_content` field (list, optional) defines bundled content declaratively shared by the app.

This field includes a single `notebooks` field:

* `application_content.notebooks` (List, required): A list of named [notebooks](../../user-guide/ui-snowsight/notebooks.md).

#### `application_content.notebooks.{named notebook}` field

Each named notebook supports the following name value pairs:

* `main_file` (string, required) the path to the interactive Python notebook (.ipynb) file, relative to the root of the package version.
* `comment` (string, optional): A comment describing the notebook.
* `runtime_environment_version` (string, optional): Specifies a particular [runtime environment version](../../user-guide/ui-snowsight/notebooks.md)
  for the notebook execution context, if applicable within the platform.
* `roles` (list, optional): A list of app roles that can grant access to the notebook, for example, `[sales,marketing]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field.

> **Note:**
>
> The `main_file` path is always relative to the root of the package
> version (the `snow://package/<DECL_SHARE_APP_PKG>/versions/<version>` prefix).
> For example, if the full path to the notebook file is
> `snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/NOTEBOOK.ipynb`,
> then specify `main_file` as just `NOTEBOOK.ipynb`.

##### `application_context` example

In this example, a single notebook, **salesbook**, is defined using the
notebook file **NOTEBOOK1.ipynb** which uses the known runtime **stable**
and provides access to those granted either the **sales** or **marketing** roles.

```yaml
application_content:
    notebooks:
        - salesbook:
              roles: [sales, marketing]
              main_file: NOTEBOOK1.ipynb
              comment: Notebook1: Sales and marketing notebook
              runtime_environment_version: stable

roles:
  - sales:
  - marketing:
```

### `roles` field

The `roles` top level field (list, optional) defines a list of [app roles](app-roles.md). These roles allow app owners to provide access to shared objects in an app — such as schemas, tables, views, and notebooks — to their organization.

Each named role can optionally contain a `comment`, which appears as a description when the app owner lists the roles in the application.

These roles are referenced in the manifest by shared objects, at the named `notebook`, `schema`, `table`, `view`, or `semantic_view` level. For objects at the `table`, `view`, or `semantic_view` level, roles must also be specified at the `schema` level.

> **Note:**
>
> * All content in the manifest is accessible to the app owner, the ACCOUNTADMIN, and to roles that are granted [IMPORTED PRIVILEGES](consumer/install.md) to the app.
> * The object name defined in this manifest file is used for the runtime object resolution. If the provider changes the object name without updating the manifest file with a new version, consumers will lose access to the object.

#### `roles` example

```yaml
roles:
  - sales:
  - marketing:

application_content:
  notebooks:
    - salesbook:
        roles: [sales, marketing]
        main_file: NOTEBOOK1.ipynb
        comment: Sales and marketing notebook

shared_content:
  databases:
    - sales:
        schemas:
          - orders:
              roles: [sales, marketing]
              tables:
                - january_2025:        # App owners/assignees only
                - february_2025:
                    roles: [sales]     # Accessible to sales only
                - march_2025:
                    roles: [marketing] # Accessible to marketing only
    - customer_info:
        schemas:
          - customer_contact:
              roles: [customer_support]
              views:
                - customer_address:
                    roles: [customer_support] # Accessible to customer_support
                - customer_details:
                    roles: []                 # App owners/assignees only
```

For more information about roles, see [app roles](app-roles.md).

### `shared_content` field

The `shared_content` field (list, required) defines a list of databases declaratively shared by the app. Each database includes a list of named `schemas`. Each schema can include a list of named entities grouped by type.

This field includes a single `databases` field and an optional `required_databases` field:

* `shared_content.databases` (List, required): A list of named database instances and the underlying objects to share. In the example below, the manifest adds a database named `sales`.

#### `shared_content.databases.{named database}` field

Each named database supports the following name value pairs:

* `schemas` (list, required): A list of schemas within the database.

#### `shared_content.required_databases.{named database}` field

The `required_databases` field (list, optional) defines a list of databases that
are dependencies of the shared databases. These databases are referenced by
views in the shared databases, but are not shared directly. For more information
about managing cross-database dependencies, see [Dependency databases: Managing cross-database references](dependency-databases.md).

When your application shares data from multiple databases, you must explicitly list all
additional databases that are referenced by objects in your shared content under
the `required_databases` field. This ensures that the application can be
deployed successfully in other regions where these databases may not exist by
default.

Including a database in the `required_databases` field is similar to
referencing a database using the REFERENCE_USAGE privilege in traditional
Secure Data Sharing. For information about the REFERENCE_USAGE privilege and how
dependent databases are shared in traditional data sharing, see
[Share data from multiple databases](../../user-guide/data-sharing-multiple-db.md).

#### `schemas.{named schema}` field

Each named schema supports the following name value pairs:

* `tables` (list, [OneOfRequired]): A list of named tables, which can include [dynamic tables](../../user-guide/dynamic-tables-about.md) and [Apache Iceberg tables](../../user-guide/tables-iceberg.md).
* `views` (list, [OneOfRequired]): A list of named views.
* `semantic_views` (list, [OneOfRequired]): A list of named semantic views.
* `functions` (list, [OneOfRequired]): A list of named user-defined functions (UDFs).
* `procedures` (list, [OneOfRequired]): A list of named stored procedures.
* `cortex_agents` (list, [OneOfRequired]): A list of named [Cortex Agents](../../user-guide/snowflake-cortex/cortex-agents.md).
* `roles` (list, optional): A list of app roles that the objects in the schema can use, for example, `[sales,marketing]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field.

[OneOfRequired]
(1,2,3,4,5,6,7,8,9,10,11,12)

at least one of `tables`, `views`, `semantic_views`, `functions`, `procedures`, or `cortex_agents` is required.

> **Important:**
>
> You must enforce schema separation between data objects (objects shared by reference: `tables`, `views`, and `semantic_views`) and logic objects (objects shared by copy: `functions`, `procedures`, and :`cortex_agents`). You can’t mix data and logic objects in the same schema.

#### `tables.{named table}` field

Each named standard table, [dynamic table](../../user-guide/dynamic-tables-about.md), and [Apache Iceberg table](../../user-guide/tables-iceberg.md) (List, required [OneOfRequired] ) supports the following name value pair:

* `roles` (list, optional): A list of app roles that can access the table; for example, `[sales]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field and included in the {named schema}.roles field.

> **Note:**
>
> Shared [dynamic tables](../../user-guide/dynamic-tables-about.md) and [Apache Iceberg tables](../../user-guide/tables-iceberg.md) replicated to remote regions are read-only and do not refresh automatically. Data freshness depends on the replication frequency from the source, and underlying source objects do not need to be replicated. For details, see [Replication considerations](../../user-guide/account-replication-considerations.md).

> **Note:**
>
> To allow consumers to create streams on this shared object (for change data capture or incremental loading), you must enable `CHANGE_TRACKING = TRUE` on the source table in your provider account using standard SQL commands (for example, `ALTER TABLE ... SET CHANGE_TRACKING = TRUE`). This setting can’t be changed by the consumer on the shared object; it must be set on the source.

#### `views.{named view}` field

Each named view (List, required [OneOfRequired] ): supports the following name value pair:

* `roles` (list, optional): A list of app roles that can access the view; for example, `[marketing]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field and included in the {named schema}.roles field.

> **Note:**
>
> To allow consumers to create streams on this shared object (for change data capture or incremental loading), you must enable `CHANGE_TRACKING = TRUE` on the source table in your provider account using standard SQL commands (for example, `ALTER TABLE ... SET CHANGE_TRACKING = TRUE`). This setting can’t be changed by the consumer on the shared object; it must be set on the source.

#### `semantic_views.{named semantic view}` field

Each named semantic view (List, required [OneOfRequired] ): supports the following name value pair:

* `roles` (list, optional): A list of app roles that can access the semantic view; for example, `[sales]`. Note that, when sharing a semantic view, its referenced tables or views must be shared as well. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field and included in the {named schema}.roles field.

#### `functions.{named function}` field

Each named function (List, required [OneOfRequired] ): supports the following name value pair:

* `roles` (list, optional): A list of app roles that can access the function; for example, `[analyst]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field and included in the {named schema}.roles field.

#### `procedures.{named procedure}` field

Each named stored procedure (List, required [OneOfRequired] ): supports the following name value pair:

* `roles` (list, optional): A list of app roles that can access the procedure; for example, `[analyst]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field and included in the {named schema}.roles field.

#### `cortex_agents.{named cortex agent}` field

Each named [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md) (List, required [OneOfRequired] ): supports the following name value pair:

* `roles` (list, optional): A list of app roles that can access the Cortex Agent; for example, `[app_user]`. When this field is empty (`[]`) or omitted, then only app owners and roles with [granted IMPORTED PRIVILEGES](consumer/install.md) receive access. The included roles must be defined in the top-level roles field and included in the {named schema}.roles field.

#### `shared_content` example

In this example, two databases are exposed: **sales** and **customer_info**.
Within these databases the **orders.[january_2025|february_2025]** tables are exposed
as well as the **customer_contact.customer_address** view.

Two required databases are also exposed: **sales_projections** and **customer_analytics**.
These databases can be referenced by views in the shared databases, but are not shared directly.

```yaml
roles:
  - sales:
  - marketing:

shared_content:
  required_databases:
    sales_projections
    customer_analytics
  databases:
    - sales:
        schemas:
          - orders:
              roles: [sales, marketing]
              tables:
                - january_2025:        # App owners/assignees only
                - february_2025:
                    roles: [sales]     # Accessible to sales only
                - march_2025:
                    roles: [marketing] # Accessible to marketing only
    - customer_info:
        schemas:
          - customer_contact:
              roles: [customer_support]
              views:
                - customer_address:
                    roles: [customer_support] # Accessible to customer_support
                - customer_details:
                    roles: []                 # App owners/assignees only
```

## Manifest file example

The following code block is an example of a Declarative Native App manifest file.

Note that data and code objects must be in different schemas.

```yaml
manifest_version: 2

roles:
  - VIEWER:
      comment: "The VIEWER role provides access to only one view."
  - ANALYST:
      comment: "The ANALYST role provides access to views, the table, and logic."
  - APP_USER:
      comment: "The APP_USER role provides access to the Cortex Agent and the underlying data."

shared_content:
  databases:
    - SNAF_POPULATION_DB:
        schemas:
          - DATA_SCHEMA:
              roles: [VIEWER, ANALYST]
              tables:
                - COUNTRY_POP_BY_YEAR:
                    roles: [ANALYST]
                - POPULATION_DYNAMIC_TABLE:
                    roles: [ANALYST]
                - MANAGED_POPULATION_ICEBERG_TABLE:
                    roles: [ANALYST]
              views:
                - COUNTRY_POP_BY_YEAR_2000:
                    roles: [VIEWER, ANALYST]
          - LOGIC_SCHEMA:
              roles: [ANALYST]
              functions:
                - POPULATION_ANALYSIS_FUNCTION(NUMBER):
                    roles: [ANALYST]
              procedures:
                - POPULATION_ANALYSIS_PROCEDURE():
                    roles: [ANALYST]
          - AGENT_SCHEMA:
              roles: [APP_USER]
              cortex_agents:
                - PRODUCT_AGENT:
                    roles: [APP_USER]
application_content:
  notebooks:
      - intro_notebook:
          roles: [VIEWER, ANALYST]
          main_file: INTRO_NB.ipynb
      - analyst_notebook:
          roles: [ANALYST]
          main_file: ANALYST_NB.ipynb
```

---
title: Declarative Sharing in Native Apps: Limitations
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/limitations.md
section: Developer Guide
---

# Declarative Sharing in Native Apps: Limitations

Declarative sharing is a feature in Snowflake Native Apps that allows providers to quickly define and share objects across multiple
databases using a simple YAML configuration file. While this feature significantly simplifies data sharing workflows, it has limitations
that providers should understand before implementation.

## Supported Object Types

Declarative sharing supports these object types:

* **Notebooks**
* **Tables**, including:

  + Dynamic tables
  + Apache Iceberg tables
* **Views**, including:

  + Semantic views
* **Stored procedures**
* **User-defined functions (UDFs)**
* **Cortex Agents**
* **Streams**

All other object types are not supported for sharing in Declarative Sharing in Native Apps.

## Shared Objects

Object limit
:   A maximum of 1,000 objects can be defined in the shared content section of
    the `manifest.yml` file. To raise this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Streams
:   Requires `CHANGE_TRACKING = TRUE` to be enabled on the provider’s source object.

## Notebook Limitations

Read-only for consumers
:   Consumers can’t edit provider notebooks in place, nor can they clone them.

Network access
:   Notebooks cannot access external endpoints or consumer data when running in customer accounts.

Specialized libraries
:   Geospatial and other 3rd party libraries aren’t guaranteed to work out-of-the-box in notebooks.

External dependencies
:   Declarative Sharing apps have limited support for external libraries (Snowflake Anaconda channel and Python files in code stage.)

Non-interactive execution
:   Notebooks that are part of native applications cannot be executed non-interactively by
    worksheets or SQL commands.

## Security and Access Control

Role definition
:   All application roles referenced in the shared content must be predefined in the `roles` field in the manifest.

Object-level roles
:   Object roles must be subsets of their parent schema roles.

Missing role validation
:   Validating the manifest returns an error if roles referenced in the sharing configuration don’t exist.

Minimum privileges
:   The provider role committing the `shared_content.yaml` file must have at least the same privileges on shared objects as those being granted to consumers.

No REFERENCE_USAGE required
:   Unlike traditional data sharing, providers don’t need to grant REFERENCE_USAGE privileges to the application package.

## Migration & Compatibility

Declarative Sharing migration
:   Migration support for switching from data shares to Declarative Sharing in the Native App framework is unavailable.

## Naming and Configuration Constraints

No wildcards
:   Object names must be explicitly specified; wildcard or regular expression matching is not supported.

Name collision prevention
:   No two shared objects can have the same DOMAIN and name.

Schema mapping
:   Schema mapping is not supported. Overlapping schema names from multiple databases are not allowed.

Schemas for data objects and logic objects
:   You must use separate schemas for data objects (shared by reference: tables and views) and logic objects (shared by copy: UDFs, stored procedures, Cortex Agents). For example, you can use a schema named `DATA_SCHEMA` for tables and views, and a schema named `LOGIC_SCHEMA` for UDFs.

## Monitoring

Auditability
:   Declarative Native Apps don’t provide monitoring resources (such as audit trails) to let the provider receive information from the consumer about how the shared data is being used. If a consumer has compliance or regulatory requirements that require auditing, the consumer must work with the provider to implement their own monitoring solutions.

## Cortex Agents

Execution Environment
:   When creating a Cortex Agent for sharing that uses Cortex Analyst and semantic views, you must explicitly define the `execution_environment` with an empty string for the warehouse (`warehouse: ""`). You can’t omit this field, nor can you specify a specific warehouse name.

Tools
:   All tools must be in the same database as the Agent. While procedures and UDFs are shared by copy and may be in the same schema as the Agent, semantic views and Cortex Search-based tools must be in a different schema.

---
title: Defining arguments for UDFs and stored procedures
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-arguments.md
section: Developer Guide
---

# Defining arguments for UDFs and stored procedures

In the [CREATE FUNCTION](../sql-reference/sql/create-function.md) or [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md) that you execute to define a
function or procedure, you specify arguments that can be passed in. For example:

```sqlexample
CREATE FUNCTION my_function(integer_argument INT, varchar_argument VARCHAR)
  ...
```

```sqlexample
CREATE PROCEDURE my_procedure(boolean_argument BOOLEAN, date_argument DATE)
  ...
```

When you call a function or procedure, the argument values are bound to the handler’s arguments. They may be bound based on
matching names or by argument position, depending on the language you’re using for the handler.

This topic provides guidelines on specifying the arguments for a function or procedure.

## Limits on the number of input arguments

Scalar functions (UDFs) have a limit of 500 input arguments.

## Specify the data types for the arguments

Choose the SQL data type that corresponds to the data type of the argument that you are using in the handler code.

For information about how Snowflake maps SQL data types to handler data types, see
[Data Type Mappings Between SQL and Handler Languages](udf-stored-procedure-data-type-mapping.md).

## Omit the `Session` argument for Java, Python, and Scala procedures

In the [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md) statement for a procedure written in Java, Python, or Scala, do not define the
argument for the Snowpark `Session` object.

For example, suppose that your handler code passes in a `Session` object and a `String` object:

```java
public String queryTable(Session session, String tableName) { ... }
```

In the CREATE PROCEDURE statement, do not define an argument for the `Session` object. Instead, just define an argument
for the input string:

```sqlexample
CREATE OR REPLACE PROCEDURE query_table(table_name VARCHAR)
  ...
```

`Session` is an implicit argument that you do not specify when calling the procedure. At runtime, when you call your stored
procedure, Snowflake creates a `Session` object and passes it to your stored procedure.

## Specify optional arguments

You can specify that an argument is optional. For details, see the next sections:

* Designating an argument as optional
* Overloading functions and procedures with optional arguments
* Calling functions and procedures that have optional arguments

### Designating an argument as optional

If you want an argument to be optional, use the DEFAULT keyword to specify the default value for the argument.
For example:

```sqlexample
CREATE OR REPLACE FUNCTION build_string_udf(
    word VARCHAR,
    prefix VARCHAR DEFAULT 'pre-',
    suffix VARCHAR DEFAULT '-post'
  )
  ...
```

```sqlexample
CREATE OR REPLACE PROCEDURE build_string_proc(
    word VARCHAR,
    prefix VARCHAR DEFAULT 'pre-',
    suffix VARCHAR DEFAULT '-post'
  )
  ...
```

For the default value of the argument, you can use an expression. For example:

```sqlexample
CREATE OR REPLACE FUNCTION my_date_udf(optional_date_arg DATE DEFAULT CURRENT_DATE())
  ...
```

You must specify optional arguments after the required arguments (if any). You cannot
specify an optional argument before a required argument.

```sqlexample
-- This is not allowed.
CREATE FUNCTION wrong_order(optional_argument INTEGER DEFAULT 0, required_argument INTEGER)
  ...
```

### Overloading functions and procedures with optional arguments

If you are [overloading](udf-stored-procedure-naming-conventions.md) a function or procedure, you cannot use an optional
argument to distinguish between different signatures. For example, suppose that you create the following UDF that passes in
no arguments:

```sqlexample
CREATE FUNCTION my_udf_a()
  ...
```

If you attempt to create a UDF with the same name that passes in an optional argument, the CREATE FUNCTION statement fails:

```sqlexample
CREATE FUNCTION my_udf_a(optional_arg INTEGER DEFAULT 0)
  ...
```

```output
000949 (42723): SQL compilation error:
  Cannot overload FUNCTION 'MY_UDF_A' as it would cause ambiguous FUNCTION overloading.
```

As another example, suppose that you create a UDF that passes in a required INTEGER argument:

```sqlexample
CREATE FUNCTION my_udf_b(required_arg INTEGER)
  ...
```

If you attempt to create a UDF with the same name that passes in a required INTEGER argument and an optional argument, the CREATE
FUNCTION statement fails:

```sqlexample
CREATE FUNCTION my_udf_b(required_arg INTEGER, optional_arg INTEGER DEFAULT 0)
  ...
```

```output
000949 (42723): SQL compilation error:
  Cannot overload FUNCTION 'MY_UDF_B' as it would cause ambiguous FUNCTION overloading.
```

This also affects cases in which you use [ALTER FUNCTION … RENAME](../sql-reference/sql/alter-function.md) or
[ALTER PROCEDURE … RENAME](../sql-reference/sql/alter-procedure.md) to rename a function or procedure. If you want to rename a
function or procedure, there cannot be an existing function with the same name and signature. Optional arguments do not
distinguish one signature from another.

For example, suppose that you create a UDF named `abc_udf` that passes in a required INTEGER argument:

```sqlexample
CREATE FUNCTION abc_udf(required_arg INTEGER)
  ...
```

Suppose that you create a UDF with a different name (`def_udf`) that passes in a required INTEGER argument and an optional
argument:

```sqlexample
CREATE FUNCTION def_udf(required_arg INTEGER, optional_arg INTEGER DEFAULT 0)
  ...
```

If you attempt to change the name of `def_udf` to `abc_udf`, an error occurs because there is already a UDF that has the
same name and the same types of required arguments:

```output
000949 (42723): SQL compilation error:
  Cannot overload FUNCTION 'ABC_UDF' as it would cause ambiguous FUNCTION overloading.
```

### Calling functions and procedures that have optional arguments

To call functions and procedures that have optional arguments, see:

* [Calling a UDF that has optional arguments](udf/udf-calling-sql.md)
* [Specifying optional arguments](stored-procedure/stored-procedures-calling.md)

---
title: Dependency databases: Managing cross-database references
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/dependency-databases.md
section: Developer Guide
---

# Dependency databases: Managing cross-database references

With Snowflake Declarative Native Apps, providers can share data products using a manifest-driven model. In many apps, secure views reference objects in other provider databases. In classic secure sharing, you (the provider) grant REFERENCE_USAGE on each referenced database to the share. In Declarative Native Apps, you declare those dependency databases in the manifest using `required_databases`, thus ensuring that installs — especially in other regions — can resolve cross-database references reliably. This also applies to semantic views, user-defined functions (UDFs), or procedures used by secure views.

## When to use required_databases

You must include a database in `required_databases` whenever a shared object in `shared_content` references objects in a database that is not listed under `shared_content/databases`. This is essential for cross-region deployments where the presence of those dependencies can’t be assumed; for example, in the following situations:

* Secure views in the shared database that JOIN/SELECT from tables or views in other provider databases
* Views referencing UDFs or procedures that live in other databases
* Notebooks included via `application_content` if notebook logic or views queried by the notebook depend on objects in other databases
* Semantic views whose underlying physical tables or views are in another database

Cross-database dependencies are common. If you don’t explicitly declare external databases, an app might validate or install successfully in the provider’s region but fail in other regions because the required references can’t be resolved. `required_databases` removes this ambiguity by providing a declarative list of dependency databases that must be present and resolvable wherever the app is built, released, and installed.

The package version release will be blocked if any dependency databases are not explicitly declared in `required_databases`. An error message will be generated at the time of BUILD, COMMIT, or RELEASE, specifically stating that the referenced database is missing from the manifest’s `required_databases` section.

## When not to use required_databases

Note that including a database in `required_databases` doesn’t apply to:

* Objects fully contained in the databases listed under `shared_content/databases`
* Classic sharing, which uses privilege grants to shares (including REFERENCE_USAGE) instead of manifest declarations

## Replication limitations

Declaring databases in `required_databases` does not replicate those databases or their contents. It registers the dependency so the framework and listing workflows can prepare and resolve references appropriately.

To support cross-region installs and failover when your manifest uses `required_databases`:

* Identify dependency databases: For each entry under `shared_content.required_databases`, confirm which provider-owned database it maps to in your source account.
* Configure replication for each dependency: Set up database (or account) replication for every dependency database to the regions and accounts where you plan to build, release, and install the app. Use standard Snowflake replication features for this step.
* Keep names consistent: Ensure the database names in target regions exactly match the names you declare in `required_databases`. Name mismatches will cause BUILD/COMMIT/RELEASE to fail with an error indicating that the referenced database is missing from `required_databases`.
* Validate after replication completes: After the initial replication and any subsequent refreshes complete, run your BUILD, COMMIT, or RELEASE commands in the target region. If you see errors about unresolved or missing dependency databases, verify:

  + The database is replicated and available in the target account and region.
  + The database name matches the value in `required_databases`.
  + Any chained dependencies those databases rely on are also replicated and correctly named.

For end-to-end steps and options when configuring database and account replication, see [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md).

---
title: Dependency management policy for the Python Connector
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-dependencies.md
section: Developer Guide
---

# Dependency management policy for the Python Connector

The Snowflake Connector for Python depends on third party libraries, all of which are essential for communicating with the
Snowflake database. Although we intend to make dependency management easy and reliable, each library can introduce changes that
might result in unexpected behavior in the Snowflake Connector for Python or cause conflicts with other libraries.

This topic covers the following information:

* The policy for determining how the dependent library versions are selected as requirements for the Snowflake Connector for
  Python.
* The process for handling incidents that might occur as a result of changes in the dependent libraries.

## Policy for determining dependency requirements

The Snowflake Connector for Python sets dependency requirements according to the following rules:

* If the Snowflake Connector for Python directly refers to a package, the name of that package is included in the list of
  dependencies.
* For each dependent library, the requirements specify both the lower bound and upper bound versions of the library.

  + The lower bound version (the minimum version) is the earliest version used to verify that the library worked.
  + The upper bound version (the first version that is not supported) is the next major version.

    If a minor version introduces a change that breaks compatibility, the upper bound version will be set to that minor version.

  This provides a stable environment in which the connector can be tested with specific versions of the dependent libraries.

> **Note:**
>
> These rules are based on the assumption that all packages, including the Snowflake Connector for Python, follow the
> semantic versioning guidelines. According to these guidelines, a minor or patch version of a library
> should not introduce any API changes.
>
> For more information, see the [semantic versioning guidelines](https://semver.org/)..

## Handling incidents resulting from changes to dependent libraries

Although the dependency management policy is designed to minimize the effects of changes made in dependent libraries, incidents
can occur under unexpected conditions. This section discusses how each case is handled.

### Case 1. A dependent library introduces a change in API or behavior

If an incident occurs because a new version of a dependent library introduced a change to their API (and/or behavior), we will
release a new version of the Snowflake Connector for Python that excludes the new version of the dependent library from the range
of supported versions. We will make this change at the earliest opportunity. (Snowflake will make a best effort to address the
issue in the next release.)

For example, suppose that the requirements file specifies this range of versions for the dependent library `package1`:

```none
package1>=1.0,<2.0
```

In theory, the API should not change in any versions released within this range. However, if a change in version 1.3 breaks
compatibility, the upper bound version will be changed to exclude version 1.3 and later versions:

```none
package1>=1.0,<1.3
```

This change is intended to be a temporary solution to the problem. Once the issue has been resolved, we will change the upper
bound version back to the next major version of the library.

### Case 2. A dependent library introduces a new version greater than the upper bound

In this case, after we verify that the Snowflake Connector for Python works with the new version of the library, we’ll include the
new version in the range of supported versions for the next release of the Snowflake Connector for Python. For example, suppose
that the requirements file specifies this range of versions for the dependent library `package1`:

```none
package1>=1.0,<2.0
```

If `package1` version 2.0 is released, the new version cannot be used with the Snowflake Connector for Python because the
version is out of the range of required versions. We have automated tests that detect this case.

Note that if there are critical reasons for supporting this new version of the library (for example, if the new version includes a
security patch), we’ll make a best effort to release the updated Snowflake Connector for Python in the next release after the
incident is reported.

---
title: Deprecated functionality
source: https://docs.snowflake.com/en/developer-guide/sql-api/sql-api-old.md
section: Developer Guide
---

# Deprecated functionality

This topic describes functionality of the Snowflake SQL API that was deprecated in Snowflake version 5.40.

See the [Snowflake SQL API](index.md) for information on the current behavior of the SQL API.

## Using the deprecated SQL API functionality

The [current version](about-endpoints.md) of the SQL API is enabled by default. To access the deprecated version, use the following endpoints:

| Endpoint | Description |
| --- | --- |
| `/api/statements/` | Use this endpoint to submit SQL statements for execution. |
| `/api/statements/statementHandle` | Use this endpoint to check the status of the execution of a statement. (`statementHandle` is a unique identifier for the statement submitted for execution.) |
| `/api/statements/statementHandle/cancel` | Use this endpoint to cancel the execution of a statement. |

> **Note:**
>
> These endpoints are no longer supported and are provided only for backwards compatibility. They will be disabled in a
> future release.

## Changed and deprecated functionality

When using the deprecated SQL API functionality, if you set the `pageSize` request parameter to paginate the results, Snowflake returns the first page of results in the response. You can use the `numPages` field in the
`ResultSet_resultSetMetaData` object in the `ResultSet` object to determine the total number of pages of results.

To get the next page of results or other pages of results, use the URLs provided in the `Link` header in the HTTP response. The `Link` header specifies the URLs for retrieving the first, next, previous, and last page of the results

The following functionality is changed or deprecated:

* You can specify the `nullable` parameter in both GET and POST requests.
* Use the `pageSize` parameter to specify the number of rows returned by a query. The page size can range from the minimum supported number (10) to the maximum supported number (10000) of rows per page. By default, the number of rows returned varies, depending on the execution of the statement.
* You use the `page` to identify which page of results to return. The number can range from 0 to the total number of pages minus 1.
* Row numbers are returned by default as part of the data set.

## Determining if the result set page size exceeds the limit

The deprecated functionality in the SQL API can return a result set page that has a maximum size of approximately 10 MB.

If the result set page exceeds this size, the endpoint returns an HTTP 200 response with a truncated result set in the body and
the `code` field set to `391908`:

```none
HTTP/1.1 200 OK
...
{
  "code": "391908",
  ...
}
```

If this occurs, send the request again with the `pageSize` parameter set to a smaller value that fits within the maximum
size of a page.

---
title: Design Guidelines and Constraints for Functions and Procedures
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-guidelines.md
section: Developer Guide
---

# Design Guidelines and Constraints for Functions and Procedures

This topic describes constraints and guidelines to keep in mind when writing UDFs and stored procedures.

[Keeping handler code in-line or on a stage](inline-or-staged.md)
:   Choose whether to have your handler code in-line or packaged in a separate file.

[Designing Handlers that Stay Within Snowflake-Imposed Constraints](udf-stored-procedure-constraints.md)
:   Ensure stability within the Snowflake environment by developing within constraints described in this topic.

[Naming and overloading procedures and UDFs](udf-stored-procedure-naming-conventions.md)
:   Learn the rules for naming and overloading procedures and UDFs.

[Defining arguments for UDFs and stored procedures](udf-stored-procedure-arguments.md)
:   Specify the arguments for your procedures and UDFs.

[Data Type Mappings Between SQL and Handler Languages](udf-stored-procedure-data-type-mapping.md)
:   Choose the best data types for argument and return values in handler code.

[Making dependencies available to your code](upload-dependencies.md)
:   Make your handler or its dependencies available for use at runtime on Snowflake.

## Security

[Security Practices for UDFs and Procedures](udf-stored-procedure-security-practices.md)
:   Help your handler code execute securely using these best practices.

[Protecting Sensitive Information with Secure UDFs and Stored Procedures](secure-udf-procedure.md)
:   Ensure that sensitive information is concealed from users who should not have access to it.

[Pushdown Optimization and Data Visibility](pushdown-optimization.md)
:   Learn about the pushdown optimization that makes queries more efficient, but which can also expose data that you might not want to be
    visible.

---
title: Designing Handlers that Stay Within Snowflake-Imposed Constraints
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-constraints.md
section: Developer Guide
---

# Designing Handlers that Stay Within Snowflake-Imposed Constraints

To ensure stability within the Snowflake environment, Snowflake places the following constraints on handler code. Unless stated otherwise,
these limitations are enforced when the handler is executed, not when it is created.

## Avoid Consuming Too Much Memory

Avoid the following, which can consume large amounts of memory:

* Large data values. These can include binary values, as well as large arrays, objects, or variant.

  Snowflake converts between SQL data types and corresponding types in the handler language. For more information, see
  [Data Type Mappings Between SQL and Handler Languages](udf-stored-procedure-data-type-mapping.md).
* Excessive stack depth. Snowflake has tested simple function calls nested 50 levels deep without error. The practical maximum limit
  depends upon how much information is put on the stack.

Handler code will return an error if it consumes too much memory. The specific limit is subject to change.

## Avoid Algorithms That Take a Large Amount of Time Per Call

If a handler takes too long to complete, Snowflake kills the SQL statement and returns an error to the user. This limits
the impact and cost of errors such as infinite loops.

## Don’t Use Libraries That Could Introduce Security Vulnerabilities

Although your handler can use functionality in external libraries, Snowflake security restrictions disable some
capabilities, such as writing to files. For details about library restrictions, see
[Security Practices for UDFs and Procedures](udf-stored-procedure-security-practices.md).

---
title: Designing Java UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-designing.md
section: Developer Guide
---

# Designing Java UDFs

This topic helps you design Java UDFs.

## Choosing your data types

Before you write your code:

* Choose the data types your function should accept as arguments and the data
  type your function should return.
* Take into account time-zone related issues.
* Decide how to handle NULL values.

### SQL-Java data type mappings for parameters and return types

For information on how Snowflake converts between Java and SQL data types, see
[Data Type Mappings Between SQL and Handler Languages](../../udf-stored-procedure-data-type-mapping.md).

### TIMESTAMP_LTZ values and time zones

A Java UDF is largely isolated from the environment in which it is called. However, the timezone is inherited from
the calling environment. If the caller’s session set a default time zone before calling the Java UDF, then the Java
UDF has the same default time zone. Java UDF uses the same [IANA Time Zone Database](https://www.iana.org/time-zones) data as the native [TIMEZONE](../../../sql-reference/parameters.md)
Snowflake SQL uses (i.e. data from release 2025b of the Time Zone Database).

### NULL values

Snowflake supports two distinct NULL values: SQL `NULL` and VARIANT’s JSON `null`. (For information about Snowflake
VARIANT NULL, see [NULL values](../../../user-guide/semistructured-considerations.md).)

Java supports one `null` value, which is only for non-primitive data types.

A SQL `NULL` argument to a Java UDF translates to the Java `null` value, but only for Java data types that
support `null`.

A returned Java `null` value translates back to SQL `NULL`.

### Arrays and variable number of arguments

Java UDFs can receive arrays of any of the following Java data types:

* String
* boolean
* double
* float
* int
* long
* short

The data type of the SQL values passed must be compatible with the corresponding Java data type. For details about data type compatibility,
see [SQL-Java Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

The following additional rules apply for each of the specified Java data types:

* boolean: The Snowflake ARRAY must contain only BOOLEAN elements, and must not contain any NULL values.
* int/short/long: The Snowflake ARRAY must contain only [fixed-point](../../../sql-reference/data-types-numeric.md) elements with a
  scale of 0, and must not contain any NULL values.
* float/double: The Snowflake ARRAY must contain either:

  + [FLOAT](../../../sql-reference/data-types-numeric.md) elements.
  + [Fixed-point](../../../sql-reference/data-types-numeric.md) elements (with any scale).

  The ARRAY must not contain any NULL values.

Java methods can receive these arrays in either of two ways:

* Using Java’s array feature.
* Using Java’s *varargs* (variable number of arguments) feature.

In both cases, your SQL code must pass an [ARRAY](../../../sql-reference/data-types-semistructured.md).

#### Passing via an ARRAY

Declare the Java parameter as an array. For example, the third parameter in the following method is a String array:

```java
static int myMethod(int fixedArgument1, int fixedArgument2, String[] stringArray)
```

Below is a complete example:

Create and load the table:

```sqlexample
CREATE TABLE string_array_table(id INTEGER, a ARRAY);
INSERT INTO string_array_table (id, a) SELECT
        1, ARRAY_CONSTRUCT('Hello');
INSERT INTO string_array_table (id, a) SELECT
        2, ARRAY_CONSTRUCT('Hello', 'Jay');
INSERT INTO string_array_table (id, a) SELECT
        3, ARRAY_CONSTRUCT('Hello', 'Jay', 'Smith');
```

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION concat_varchar_2(a ARRAY)
  RETURNS VARCHAR
  LANGUAGE JAVA
  HANDLER = 'TestFunc_2.concatVarchar2'
  TARGET_PATH = '@~/TestFunc_2.jar'
  AS
  $$
  class TestFunc_2 {
      public static String concatVarchar2(String[] strings) {
          return String.join(" ", strings);
      }
  }
  $$;
```

Call the UDF:

```sqlexample
SELECT concat_varchar_2(a)
  FROM string_array_table
  ORDER BY id;
+---------------------+
| CONCAT_VARCHAR_2(A) |
|---------------------|
| Hello               |
| Hello Jay           |
| Hello Jay Smith     |
+---------------------+
```

#### Passing via varargs

Using varargs is very similar to using an array.

In your Java code, use Java’s varargs declaration style:

```java
static int myMethod(int fixedArgument1, int fixedArgument2, String ... stringArray)
```

Below is a complete example. The only significant difference between this example and the preceding example (for arrays) is the
declaration of the parameters to the method.

Create and load the table:

```sqlexample
CREATE TABLE string_array_table(id INTEGER, a ARRAY);
INSERT INTO string_array_table (id, a) SELECT
        1, ARRAY_CONSTRUCT('Hello');
INSERT INTO string_array_table (id, a) SELECT
        2, ARRAY_CONSTRUCT('Hello', 'Jay');
INSERT INTO string_array_table (id, a) SELECT
        3, ARRAY_CONSTRUCT('Hello', 'Jay', 'Smith');
```

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION concat_varchar(a ARRAY)
  RETURNS VARCHAR
  LANGUAGE JAVA
  HANDLER = 'TestFunc.concatVarchar'
  TARGET_PATH = '@~/TestFunc.jar'
  AS
  $$
  class TestFunc {
      public static String concatVarchar(String ... stringArray) {
          return String.join(" ", stringArray);
      }
  }
  $$;
```

Call the UDF:

```sqlexample
SELECT concat_varchar(a)
    FROM string_array_table
    ORDER BY id;
+-------------------+
| CONCAT_VARCHAR(A) |
|-------------------|
| Hello             |
| Hello Jay         |
| Hello Jay Smith   |
+-------------------+
```

## Designing Java UDFs that stay within Snowflake-imposed constraints

For information on designing handler code that runs well on Snowflake, see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).

## Designing the class

When a SQL statement calls your Java UDF, Snowflake calls a Java method you have written. Your Java method is called a
“handler method”, or “handler” for short.

As with any Java method, your method must be declared as part of a class. Your handler method can be a static method or an instance
method of the class. If your handler is an instance method, and your class defines a zero-argument constructor, then Snowflake
invokes your constructor at initialization time to create an instance of your class. If
your handler is a static method, your class is not required to have a constructor.

The handler is called once for each row passed to the Java UDF. (Note: a new instance of the class is not created for each row;
Snowflake can call the same instance’s handler method more than once, or call the same static method more than once.)

To optimize execution of your code, Snowflake assumes that initialization might be slow, while execution of the handler method
is fast. Snowflake sets a longer timeout for executing initialization (including the time to load your UDF and the time
to call the constructor of the handler method’s containing class, if a constructor is defined) than for executing the handler
(the time to call your handler with one row of input).

Additional information about designing the class is in [Creating a Java UDF handler](udf-java-creating.md).

## Optimizing initialization and controlling global state in scalar UDFs

Most function and procedure handlers should follow the guidelines below:

* If you need to initialize shared state that does not change across rows, initialize it outside the handler function, such as in the
  module or constructor.
* Write your handler function or method to be thread safe.
* Avoid storing and sharing dynamic state across rows.

If your UDF cannot follow these guidelines, or if you would like a deeper understanding of the reasons for these guidelines,
please read the next few subsections.

### Sharing state across calls

Snowflake expects scalar UDFs to be processed independently. Relying on state shared between invocations can result in unexpected
behavior. This is because the system can process rows in any order and spread those invocations across several JVMs (for handlers written
in Java or Scala) or instances (for handlers written in Python).

UDFs should avoid relying on shared state across calls to the handler method. However, there are two situations in which you might want a
UDF to store shared state:

* Code that contains expensive initialization logic that you do not want to repeat for each row.
* Code that leverages shared state across rows, such as a cache.

If you need to share state across multiple rows, and if that state does not change over time, then use a constructor to create
shared state by setting instance-level variables. The constructor is executed only once per instance, while the handler is called
once per row, so initializing in the constructor is cheaper when a handler processes multiple rows. And because the constructor is
called only once, the constructor does not need to be written to be thread-safe.

If your UDF stores shared state that changes, then your code must be prepared to handle concurrent access to that state.
The next two sections provide more information about parallelism and shared state.

### Understanding Java UDF parallelization

To improve performance, Snowflake parallelizes both across and within JVMs.

* Across JVMs:

  Snowflake parallelizes across workers in a [warehouse](../../../user-guide/warehouses-overview.md). Each worker runs one (or more)
  JVMs. This means that there is no global shared state. At most, state can be shared only within a single JVM.
* Within JVMs:

  + Each JVM can execute multiple threads that can call the same instance’s handler method in parallel. This means that each
    handler method needs to be thread-safe.
  + If a UDF is IMMUTABLE and a SQL statement calls the UDF more than once with the same arguments for the same row, then the UDF
    returns the same value for each call for that row. For example, the following returns the same value twice for each row
    if the UDF is IMMUTABLE:

    ```sqlexample
    SELECT
        my_java_udf(42),
        my_java_udf(42)
      FROM table1;
    ```

    If you would like multiple calls to return independent values even when passed the same arguments, and if you do not want
    to declare the function VOLATILE, then bind multiple separate UDFs to the same handler method. For example:

    1. Create a JAR file named `@java_udf_stage/rand.jar` with code:

       ```java
       class MyClass {

         private double x;

         // Constructor
         public MyClass()  {
           x = Math.random();
         }

         // Handler
         public double myHandler() {
           return x;
         }
       }
       ```
    2. Create the Java UDFs as shown below. These UDFs have different names, but use the same JAR file and the same handler
       within that JAR file.

       ```sqlexample
       CREATE FUNCTION my_java_udf_1()
         RETURNS DOUBLE
         LANGUAGE JAVA
         IMPORTS = ('@java_udf_stage/rand.jar')
         HANDLER = 'MyClass.myHandler';

       CREATE FUNCTION my_java_udf_2()
         RETURNS DOUBLE
         LANGUAGE JAVA
         IMPORTS = ('@java_udf_stage/rand.jar')
         HANDLER = 'MyClass.myHandler';
       ```
    3. The following code calls both UDFs. The UDFs point to the same JAR file and handler. These calls create two
       instances of the same class. Each instance returns an independent value, so the example below returns two independent
       values, rather than returning the same value twice:

       ```sqlexample
       SELECT
           my_java_udf_1(),
           my_java_udf_2()
         FROM table1;
       ```

### Storing JVM state information

One reason to avoid relying on dynamic shared state is that rows are not necessarily processed in a predictable order.
Each time a SQL statement is executed, Snowflake can vary the number of batches, the order in which batches are
processed, and the order of rows within a batch. If a scalar UDF is designed so that one row affects the return value for a
subsequent row, then the UDF can return different results each time that the UDF is executed.

## Handling errors

A Java method used as a UDF can use the normal Java exception-handling techniques to catch errors within the
method.

If an exception occurs inside the method and is not caught by the method, Snowflake raises an error that includes the stack trace for the
exception. When [logging of unhandled exceptions](../../logging-tracing/unhandled-exception-messages.md) is enabled,
Snowflake logs data about unhandled exceptions in an event table.

You can explicitly throw an exception without catching it in order to end the query and produce a SQL error. For
example:

```java
if (x < 0) {
  throw new IllegalArgumentException("x must be non-negative.");
}
```

When debugging, you can include values in the SQL error message text. To do so, place an entire Java method body in a
try-catch block; append argument values to the caught error’s message; and throw an exception with the extended
message. To avoid revealing sensitive data, remove argument values prior to deploying JAR files to a production
environment.

## Following best practices

* Write platform-independent code.

  + Avoid code that assumes a specific CPU architecture (e.g. x86).
  + Avoid code that assumes a specific operating system.
* If you need to execute initialization code and do not want to include it in the method that you call, you can put
  the initialization code into a static initialization block.
* Whenever possible when using an in-line handler, specify a value for the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) or
  [CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) TARGET_PATH parameter. This will prompt Snowflake to reuse previously-generated
  handler code output rather than recompiling for each call. For more information, see [Using an in-line handler](../../inline-or-staged.md).

See also:

* [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md)
* [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md)

## Following good security practices

To help ensure that your handler functions in a secure way, see the best practices described in
[Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).

---
title: Designing Python UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-designing.md
section: Developer Guide
---

# Designing Python UDFs

This topic helps you design Python UDFs.

> **Note:**
>
> Vectorized Python UDFs let you define Python functions that receive batches of input rows
> as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) and
> return batches of results as [Pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html)
> or [Series](https://pandas.pydata.org/docs/reference/series.html).
> The batch interface results in much better performance with machine learning inference scenarios.
> For more information, see [Vectorized Python UDFs](udf-python-batch.md).

## Choosing your data types

Before you write your code:

* Choose the data types your function should accept as arguments and the data
  type your function should return.
* Take into account time-zone related issues.
* Decide how to handle NULL values.

For more information about how Snowflake maps Python and SQL data types, see [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

### TIMESTAMP_LTZ values and time zones

A Python UDF is largely isolated from the environment in which it is called. However, the timezone is inherited from
the calling environment. If the caller’s session set a default time zone before calling the Python UDF, then the Python
UDF has the same default time zone. For more information about timezones, see [TIMEZONE](../../../sql-reference/parameters.md).

### NULL values

For all Snowflake types except Variant, a SQL `NULL` argument to a Python UDF translates to the
Python `None` value and a returned Python `None` value translates back to SQL `NULL`.

A Variant type value can be: SQL `NULL` or a VARIANT JSON `null`. For information about Snowflake
VARIANT NULL, see [NULL values](../../../user-guide/semistructured-considerations.md).

* A VARIANT JSON `null` is translated to Python `None`.
* A SQL `NULL` is translated to a Python object, which has the `is_sql_null` attribute.

For an example, see [NULL Handling in Python UDFs](udf-python-examples.md).

## Designing Python UDFs that stay within Snowflake-imposed constraints

To ensure stability within the Snowflake environment, Snowflake places the following constraints on Python UDFs.
Unless stated otherwise, these limitations are enforced when the UDF is executed, not when it is created.

Training machine learning (ML) models can sometimes be very resource intensive.
Snowpark-optimized warehouses are a type of Snowflake virtual warehouse that can be used for workloads
that require a large amount of memory and compute resources.
For information on machine learning models and Snowpark Python, see [Training Machine Learning Models with Snowpark Python](../../snowpark/python/python-snowpark-training-ml.md).

### Memory

Avoid consuming too much memory.

* Large data values can consume a large amount of memory.
* Excessive stack depth can consume a large amount of memory.

UDFs return an error if they consume too much memory. The specific limit is subject to change.

If UDFs fail due to consuming too much memory, consider using [Snowpark-optimized warehouses](../../../user-guide/warehouses-snowpark-optimized.md).

### Time

Avoid algorithms that take a large amount of time per call.

If a UDF takes too long to complete, Snowflake kills the SQL statement and returns an error to the user. This limits
the impact and cost of errors such as infinite loops.

## Designing the module

When a SQL statement calls your Python UDF, Snowflake calls a Python function you have written. Your Python function is called a
“handler function”, or “handler” for short. The handler is a function implemented inside a user-supplied module.

As with any Python function, your function must be declared as part of a module.

The handler is called once for each row passed to the Python UDF.
The module that contains the function is not re-imported for each row. Snowflake can call the same module’s handler function more than once.

To optimize execution of your code, Snowflake assumes that initialization might be slow, while execution of the handler function
is fast. Snowflake sets a longer timeout for executing initialization (including the time to load your UDF and the time
to initialize the module) than for executing the handler
(the time to call your handler with one row of input).

Additional information about designing the module is in [Creating Python UDFs](udf-python-creating.md).

## Optimizing initialization and controlling global state in scalar UDFs

Most scalar UDFs should follow the guidelines below:

* If you need to initialize shared state that does not change across rows, initialize it in the module instead of the handler function.
* Write your handler function to be thread safe.
* Avoid storing and sharing dynamic state across rows.

If your UDF cannot follow these guidelines, be aware that Snowflake expects scalar UDFs to be processed independently. Relying on state
shared between invocations can result in unexpected behavior, as the system can process rows in any order and spread those invocations
across several instances. In addition, there can be multiple executions of the same handler function within the same Python
interpreter on multiple threads.

UDFs should avoid relying on shared state across calls to the handler function. However, there are two situations in which you might want a
UDF to store shared state:

* Code that contains expensive initialization logic that you do not want to repeat for each row.
* Code that leverages shared state across rows, such as a cache.

When it’s necessary to maintain global state that will be shared across handler invocations, you must protect global state against
data races by using the synchronization primitives described in
[threading - Thread-based parallelism](https://docs.python.org/3.12/library/threading.html).

## Optimizing for scale and performance

### Use vectorized Python UDFs with data science libraries

When your code will use machine learning or data science libraries, use vectorized Python UDFs to
define Python functions that receive input rows in batches on which these libraries are optimized to operate.

For more information, see [Vectorized Python UDFs](udf-python-batch.md).

### Write single-threaded UDF handlers

Write UDF handlers that are single-threaded. Snowflake will handle partitioning the data and scaling the UDF across the virtual warehouse
compute resources.

### Put expensive initialization in the module

Put expensive initialization code into the module scope. There, it will be performed once when the UDF is initialized.
Avoid rerunning the expensive initialization code on every UDF handler invocation.

## Handling errors

A Python function used as a UDF can use the normal Python exception-handling techniques to catch errors within the
function.

If an exception occurs inside the function and is not caught by the function, Snowflake raises an error that includes the stack trace for the
exception. When [logging of unhandled exceptions](../../logging-tracing/unhandled-exception-messages.md) is enabled,
Snowflake logs data about unhandled exceptions in an event table.

You can explicitly throw an exception without catching it in order to end the query and produce a SQL error. For
example:

```python
if (x < 0):
  raise ValueError("x must be non-negative.");
```

When debugging, you can include values in the SQL error message text. To do so, place an entire Python function body in a
try-catch block; append argument values to the caught error’s message; and throw an exception with the extended
message. To avoid revealing sensitive data, remove argument values prior to deploying to a production
environment.

## Following good security practices

To help ensure that your handler functions in a secure way, see the best practices described in
[Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).

---
title: Determining the number of rows affected by SQL statements
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/dml-status.md
section: Developer Guide
---

# Determining the number of rows affected by SQL statements

After a [DML command](../../sql-reference/sql-dml.md) is executed (excluding the [TRUNCATE TABLE](../../sql-reference/sql/truncate-table.md)
command), Snowflake Scripting sets the following global variables. You can use these variables to determine if the last
DML statement affected any rows, or how many rows were returned by a query.

| Variable | Description |
| --- | --- |
| `ACTIVITY_COUNT` | Number of rows affected by the last DML statement, or the number of rows returned by the last SELECT query. Set after each statement execution. |
| `SQLROWCOUNT` | Number of rows affected by the last DML statement.  This is equivalent to [`getNumRowsAffected()`](../stored-procedure/stored-procedures-api.md "getNumRowsAffected") in JavaScript stored procedures. |
| `SQLFOUND` | `true` if the last DML statement affected one or more rows. |
| `SQLNOTFOUND` | `true` if the last DML statement affected zero rows. |

> **Note:**
>
> The [2025_01 behavior change bundle](../../release-notes/bcr-bundles/2025_01_bundle.md) changes the behavior
> of these variables. When the bundle is enabled, the variables return NULL when a non-DML statement is executed
> after the last DML statement in a Snowflake Scripting block or stored procedure. The bundle is enabled by
> default. For more information about the behavior change, see [Snowflake Scripting: Changes to global variables](../../release-notes/bcr-bundles/2025_01/bcr-1850.md).
>
> If the bundle is disabled, you can [enable it in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md) by
> executing the following statement:
>
> ```sqlexample
> SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_01');
> ```
>
> To disable the bundle, execute the following statement:
>
> ```sqlexample
> SELECT SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE('2025_01');
> ```

The examples in this section use the following table:

```sqlexample
CREATE OR REPLACE TABLE my_values (value NUMBER);
```

The following example uses the `SQLROWCOUNT` variable to return the number of rows affected by the last
DML statement (the INSERT statement).

```sqlexample
BEGIN
  LET sql_row_count_var INT := 0;
  INSERT INTO my_values VALUES (1), (2), (3);
  sql_row_count_var := SQLROWCOUNT;
  SELECT * from my_values;
  RETURN sql_row_count_var;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET sql_row_count_var INT := 0;
  INSERT INTO my_values VALUES (1), (2), (3);
  sql_row_count_var := SQLROWCOUNT;
  SELECT * from my_values;
  RETURN sql_row_count_var;
END;
$$;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               3 |
+-----------------+
```

The following example uses the `SQLFOUND` and `SQLNOTFOUND` variables to return the number of rows affected by the
last DML statement (the UPDATE statement).

```sqlexample
BEGIN
  LET sql_row_count_var INT := 0;
  LET sql_found_var BOOLEAN := NULL;
  LET sql_notfound_var BOOLEAN := NULL;
  IF ((SELECT MAX(value) FROM my_values) > 2) THEN
    UPDATE my_values SET value = 4 WHERE value < 3;
    sql_row_count_var := SQLROWCOUNT;
    sql_found_var := SQLFOUND;
    sql_notfound_var := SQLNOTFOUND;
  END IF;
  SELECT * from my_values;
  IF (sql_found_var = true) THEN
    RETURN 'Updated ' || sql_row_count_var || ' rows.';
  ELSEIF (sql_notfound_var = true) THEN
    RETURN 'No rows updated.';
  ELSE
    RETURN 'No DML statements executed.';
  END IF;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET sql_row_count_var INT := 0;
  LET sql_found_var BOOLEAN := NULL;
  LET sql_notfound_var BOOLEAN := NULL;
  IF ((SELECT MAX(value) FROM my_values) > 2) THEN
    UPDATE my_values SET value = 4 WHERE value < 3;
    sql_row_count_var := SQLROWCOUNT;
    sql_found_var := SQLFOUND;
    sql_notfound_var := SQLNOTFOUND;
  END IF;
  SELECT * from my_values;
  IF (sql_found_var = true) THEN
    RETURN 'Updated ' || sql_row_count_var || ' rows.';
  ELSEIF (sql_notfound_var = true) THEN
    RETURN 'No rows updated.';
  ELSE
    RETURN 'No DML statements executed.';
  END IF;
END;
$$;
```

When the anonymous block runs, the `SQLFOUND` variable is `true` because the UPDATE statement updates two rows.

```output
+-----------------+
| anonymous block |
|-----------------|
| Updated 2 rows. |
+-----------------+
```

Query the table to see the current values:

```sqlexample
SELECT * FROM my_values;
```

```output
+-------+
| VALUE |
|-------|
|     4 |
|     4 |
|     3 |
+-------+
```

Run the same anonymous block again, and the results are the following:

* The UPDATE statement is executed because there is a value in the table that is greater than `2`. That is,
  the IF condition is satisfied.
* The `SQLNOTFOUND` variable is `true` because no rows are updated. The UPDATE statement doesn’t update
  any rows because none of the values in the table are less than `3` (specified in the WHERE clause).

The query returns the following output:

```output
+------------------+
| anonymous block  |
|------------------|
| No rows updated. |
+------------------+
```

Now, update the table to set all of the values to `1`:

```sqlexample
UPDATE my_values SET value = 1;

SELECT * FROM my_values;
```

```output
+-------+
| VALUE |
|-------|
|     1 |
|     1 |
|     1 |
+-------+
```

Run the same anonymous block again, and the UPDATE statement isn’t executed because none of the values
in the table are greater than `2`. That is, the IF condition isn’t satisfied, so the UPDATE statement
doesn’t execute.

```output
+-----------------------------+
| anonymous block             |
|-----------------------------|
| No DML statements executed. |
+-----------------------------+
```

## ACTIVITY_COUNT examples

Unlike `SQLROWCOUNT`, the `ACTIVITY_COUNT` variable is set after each statement execution, including
SELECT queries. This makes it useful for tracking both the number of rows affected by DML operations and the number
of rows returned by queries.

The following example demonstrates `ACTIVITY_COUNT` after an INSERT statement and a SELECT query:

```sqlexample
BEGIN
  INSERT INTO my_values VALUES (1), (2), (3);
  LET insert_count INT := ACTIVITY_COUNT;
  SELECT * FROM my_values WHERE value > 1;
  LET select_count INT := ACTIVITY_COUNT;
  RETURN 'Inserted ' || insert_count || ' rows, query returned ' || select_count || ' rows.';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  INSERT INTO my_values VALUES (1), (2), (3);
  LET insert_count INT := ACTIVITY_COUNT;
  SELECT * FROM my_values WHERE value > 1;
  LET select_count INT := ACTIVITY_COUNT;
  RETURN 'Inserted ' || insert_count || ' rows, query returned ' || select_count || ' rows.';
END;
$$;
```

After the INSERT, `ACTIVITY_COUNT` is `3` (three rows inserted). After the SELECT,
`ACTIVITY_COUNT` is `2` (two rows match the `WHERE value > 1` condition).

```output
+-------------------------------------------+
| anonymous block                           |
|-------------------------------------------|
| Inserted 3 rows, query returned 2 rows.   |
+-------------------------------------------+
```

---
title: Development clients for Snowpark Connect for Spark
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-clients.md
section: Developer Guide
---

# Development clients for Snowpark Connect for Spark

You can run Spark workloads interactively from clients such as Snowflake Notebooks, Jupyter Notebooks, VS Code, or any Python-based
interface without needing to manage a Spark cluster. The workloads run on the Snowflake infrastructure.

When you develop Spark workloads interactively with Snowpark Connect for Spark, you can perform the following tasks:

* Run Spark workloads from local tools without setting up any infrastructure.
* Run code that is compatible with PySpark APIs and workflows.
* Access Snowflake compute resources for running queries and transformations.
* Integrate Spark into existing data science, exploration, or development workflows.
* Authenticate with programmatic access tokens (PATs) for secure authentication that is aligned with modern enterprise access controls.

The following table lists some of the tools you can use when you work with Spark workloads on Snowflake:

| Purpose | Tools |
| --- | --- |
| Interactively develop Spark workloads that run on Snowflake. | * [Run Spark workloads from Snowflake Notebooks](snowpark-connect-workloads-snowflake-notebook.md) * [Run Spark workloads from VS Code, Jupyter Notebooks, or a terminal](snowpark-connect-workloads-jupyter.md) |
| Run Spark workloads as a batch. | * [Submitting Spark applications](snowpark-submit.md) |

---
title: DevOps with Snowflake
source: https://docs.snowflake.com/en/developer-guide/builders/devops-with-snowflake.md
section: Developer Guide
---

# DevOps with Snowflake

Snowflake provides tools and practices for managing your Snowflake environments as code, validating changes before they reach
production, and automating deployments through CI/CD pipelines.

## What is DevOps with Snowflake?

DevOps with Snowflake brings software engineering best practices to data infrastructure management. The core principles are:

* **Define as code.** Declare the desired state of your Snowflake objects in version-controlled files. Snowflake determines and applies
  the necessary changes (create, alter, or drop) to reach that state.
* **Validate before you deploy.** Preview proposed changes in a plan step before applying them to your account. Review creates, alters,
  and drops, then deploy when you’re confident the changes are correct.
* **Automate with CI/CD.** Integrate Snowflake into your existing CI/CD pipelines so that deployments are triggered by pull requests,
  merges, or scheduled runs rather than manual steps.

The recommended approach is to use [DCM Projects](../../user-guide/dcm-projects/dcm-projects-overview.md) (Database Change Management
Projects), which unify declarative object management, plan-then-deploy validation, multi-environment targeting, and CI/CD automation
into a single workflow.

## Define your Snowflake objects as code

### DCM Projects (recommended)

[DCM Projects](../../user-guide/dcm-projects/dcm-projects-overview.md) (Database Change Management Projects) provide a declarative,
infrastructure-as-code approach to managing your Snowflake environment. Instead of writing imperative scripts that specify each step, you
define the desired target state of your objects. Snowflake compares those definitions against the current state and determines the
necessary changes.

A DCM project consists of:

* A **manifest file** (`manifest.yml`) that specifies deployment targets, owner roles, and templating configurations for each
  environment.
* **Definition files** (SQL files under `sources/definitions/`) that contain DEFINE statements for your Snowflake objects, GRANT
  statements for access control, and ATTACH statements for data quality expectations.

The following example shows a definition file that creates infrastructure for multiple teams using Jinja2 templating:

```sqlexample
{% for team in teams %}

  DEFINE DATABASE {{team.name}}_DB;

  DEFINE WAREHOUSE {{team.name}}_WH
    WITH
      warehouse_size = '{{team.wh_size}}'
      auto_suspend = 300;

  DEFINE ROLE {{team.name}}_ADMIN;

  GRANT OWNERSHIP ON DATABASE {{team.name}}_DB TO ROLE {{team.name}}_ADMIN;
  GRANT OWNERSHIP ON WAREHOUSE {{team.name}}_WH TO ROLE {{team.name}}_ADMIN;

{% endfor %}
```

For complete documentation on DCM Projects, including how to set up your project files, manage multiple environments, and automate
deployments, see [Snowflake DCM Projects](../../user-guide/dcm-projects/dcm-projects-overview.md).

### dbt Projects on Snowflake

[dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake.md) let you deploy and run
[dbt Core](https://www.getdbt.com/) projects as native Snowflake objects. You define SQL transformations in dbt models, deploy them
as a versioned DBT PROJECT object, and execute them with Snowflake SQL or the Snowflake CLI. You can schedule runs with Snowflake tasks
and integrate deployment into CI/CD pipelines.

For more information, see [dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

### Alternative: CREATE OR ALTER with versioned scripts

For individual object changes outside of a DCM project, you can use the [CREATE OR ALTER <object>](../../sql-reference/sql/create-or-alter.md) command, which creates
the object or alters it to match the definition specified by the command. By using this command from a versioned file in a remote
repository, you can roll back changes to a previous version by executing a previous version of the file.

SQLPython

```sqlexample
CREATE OR ALTER TABLE vacation_spots (
  city VARCHAR,
  airport VARCHAR,
  avg_temperature_air_f FLOAT,
  avg_relative_humidity_pct FLOAT,
  avg_cloud_cover_pct FLOAT,
  precipitation_probability_pct FLOAT
) data_retention_time_in_days = 1;
```

```Python
from snowflake.core import Root
from snowflake.core.table import PrimaryKey, Table, TableColumn

my_table = root.databases["my_db"].schemas["my_schema"].tables["vacation_spots"].fetch()
my_table.columns.append(TableColumn(name="city", datatype="varchar", nullable=False))
my_table.columns.append(TableColumn(name="airport", datatype="varchar", nullable=False))
my_table.columns.append(TableColumn(name="avg_temperature_air_f", datatype="float", nullable=False))
my_table.columns.append(TableColumn(name="avg_relative_humidity_pct", datatype="float", nullable=False))
my_table.columns.append(TableColumn(name="avg_cloud_cover_pct", datatype="float", nullable=False))
my_table.columns.append(TableColumn(name="precipitation_probability_pct", datatype="float", nullable=False))

my_table_res = root.databases["my_db"].schemas["my_schema"].tables["vacation_spots"]
my_table_res.create_or_alter(my_table)
```

> **Note:**
>
> You can also use the [Snowflake Python APIs](../snowflake-python-api/snowflake-python-overview.md) and
> [Snowflake CLI](../snowflake-cli/index.md) to manage Snowflake resources. If you prefer to do your data engineering work
> in Python, Snowflake’s first-class Python API enables you to do the same resource management in the language you are most productive in.

## Validate and preview changes

Before deploying changes to your Snowflake account, you can preview the proposed modifications to verify they match your intent.

### Plan with DCM Projects

DCM Projects use a plan-then-deploy model. The PLAN command compares your definition files against the current state of your account
and produces a list of proposed changes without modifying anything.

You can run a plan using the Snowflake CLI:

```bash
snow dcm plan --target PROD
```

Or using SQL:

```sqlexample
EXECUTE DCM PROJECT my_db.my_schema.my_project
  PLAN
  USING CONFIGURATION PROD
FROM
  '@my_stage/my_project/';
```

Review the output to confirm the expected creates, alters, and drops before proceeding to deploy.

## Automate deployment with CI/CD

You can integrate Snowflake into your CI/CD pipelines so that deployments are triggered automatically by events such as pull request
merges, branch pushes, or scheduled runs.

The following table maps common CI/CD pipeline jobs to the corresponding Snowflake CLI commands:

| Pipeline job | CLI command | Description |
| --- | --- | --- |
| Plan on pull request | `snow dcm plan` | Generates a plan that previews the changes that would be applied to the target environment. You can post the plan output as a PR comment for review. |
| Deploy on merge | `snow dcm deploy` | Applies the planned changes to the target environment. Typically runs after a PR is merged to the main branch. |
| Refresh dynamic tables | `snow dcm refresh` | Triggers a refresh of dynamic tables after deployment to ensure downstream data is up to date. |
| Test expectations | `snow dcm test` | Runs expectation checks defined in your DCM project to verify that the deployment produced the expected results. |

### GitHub Actions

You can use [GitHub Actions](https://docs.github.com/en/actions) to automate the jobs that constitute a CI/CD pipeline.

To authenticate securely, Snowflake recommends using workload identity federation (WIF) with OpenID Connect (OIDC) instead of static
credentials like passwords or private keys. With WIF OIDC, GitHub Actions requests a short-lived token from GitHub’s OIDC provider,
and Snowflake verifies the token directly. No long-lived secrets are stored in your repository.

To set up WIF OIDC, create a Snowflake service user that trusts GitHub’s OIDC provider:

```sqlexample
CREATE USER github_deployer
  TYPE = SERVICE
  DEFAULT_ROLE = deployer_role
  WORKLOAD_IDENTITY = (
    TYPE = OIDC
    ISSUER = 'https://token.actions.githubusercontent.com'
    SUBJECT = 'repo:<owner>/<repo>:environment:<environment_name>'
  );
```

For more information about configuring the subject claim and WIF in general, see
[Workload identity federation](../../user-guide/workload-identity-federation.md).

The following example shows a workflow that uses WIF OIDC and DCM Projects to plan and deploy changes on push to the `main` branch:

```yaml
name: Deploy DCM project to production

on:
  push:
    branches:
      - main

permissions:
  id-token: write
  contents: read

jobs:
  deploy:
    runs-on: ubuntu-latest
    environment: production

    steps:
      - name: Checkout repository
        uses: actions/checkout@v4
        with:
          persist-credentials: false

      - name: Set up Snowflake CLI
        uses: snowflakedb/snowflake-cli-action@v2
        with:
          use-oidc: true
          cli-version: "3.11"

      - name: Plan DCM project
        env:
          SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
        run: snow dcm plan --target PROD --save-output -x

      - name: Deploy DCM project
        env:
          SNOWFLAKE_ACCOUNT: ${{ secrets.SNOWFLAKE_ACCOUNT }}
        run: snow dcm deploy --target PROD -x
```

For more information about setting up CI/CD with the Snowflake CLI, including alternative authentication methods, see
[Integrating CI/CD with Snowflake CLI](../snowflake-cli/cicd/integrate-ci-cd.md).

## Manage environments

By maintaining separate environments for development, test, and production, your teams can isolate development activities from the
production environment, which reduces the chance of unintended consequences and data corruption.

### Connection profiles for environment targeting

With DCM Projects, you can define multiple deployment targets in your `manifest.yml` file. Each target maps to a specific Snowflake
account (or database), project object, owner role, and templating configuration. The same definition files can deploy to all environments
with environment-specific settings applied through configuration profiles.

```yaml
targets:
  DEV:
    account_identifier: MYORG-MYACCOUNT_DEV
    project_name: MY_DB.MY_SCHEMA.MY_PROJECT_DEV
    project_owner: DEV_DEPLOYER
    templating_config: DEV

  PROD:
    account_identifier: MYORG-MYACCOUNT_PROD
    project_name: MY_DB.MY_SCHEMA.MY_PROJECT_PROD
    project_owner: PROD_DEPLOYER
    templating_config: PROD

templating:
  configurations:
    DEV:
      wh_size: "X-SMALL"
    PROD:
      wh_size: "LARGE"
```

For enterprise patterns such as multi-project setups and team collaboration, see
[Enterprise use cases for DCM Projects](../../user-guide/dcm-projects/dcm-projects-enterprise.md).

### Advanced: Jinja parameterization for custom scripts

DCM Projects natively support Jinja2 templating in definition files. You can use template variables, loops, conditions, macros, and
dictionaries to make your definitions reusable across environments. Variable values come from configuration profiles in the
`manifest.yml` or from runtime overrides.

For details on DCM templating, see [DCM Projects files and templates](../../user-guide/dcm-projects/dcm-projects-files.md).

You can also parameterize standalone SQL scripts (outside of DCM Projects) using Jinja2 with
[EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md). The Snowflake CLI allows you to pass environment variables to Python
scripts as well.

To change a deployment target, for example, you replace the name of the target database with a Jinja variable such as
`{{ environment }}` in SQL scripts, or an environment variable in Python scripts. This technique is shown in the following SQL
and Python code examples:

SQLPython

```sqlexample
CREATE OR ALTER TASK {{ environment }}.my_schema.my_task
  WAREHOUSE = my_warehouse
  SCHEDULE = '60 minute'
  AS select pi();
```

```python
import os
from snowflake.core import Root, CreateMode
from datetime import timedelta
from snowflake.core.task import Task

my_task = Task(
    name="my_task",
    warehouse="my_warehouse",
    definition="select pi()",
    schedule=timedelta(minutes=60)
)
root = Root(Session.builder.getOrCreate())
tasks = root.databases[os.environ["environment"]].schemas["my_schema"].tasks
tasks.create(my_task, mode=CreateMode.or_replace)
```

## Getting started

To get started with DCM Projects, see [Snowflake DCM Projects](../../user-guide/dcm-projects/dcm-projects-overview.md) for a complete overview of the feature,
including how to set up your project files, configure environments, and deploy changes.

For sample projects, CI/CD templates, and quickstarts, see the
[snowflake-labs DCM repository](https://github.com/Snowflake-Labs/snowflake_dcm_projects).

To follow a step-by-step tutorial, try the
[Getting Started with Snowflake DCM Projects](https://www.snowflake.com/en/developers/guides/get-started-snowflake-dcm-projects/)
quickstart.

---
title: Distributing workloads that fetch results with the Snowflake Connector for Python
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-distributed-fetch.md
section: Developer Guide
---

# Distributing workloads that fetch results with the Snowflake Connector for Python

If you are using a distributed environment to parallelize workloads, you can use the Snowflake Connector for Python to distribute
the work of fetching and processing results.

## Introduction

After you use the `Cursor` object to execute a query, you can distribute the work of fetching the results by using
result batches. A *result batch* encapsulates a function that retrieves a subset of the results. You can assign different workers
to use different result batches to fetch and process results in parallel.

## Retrieving the list of result batches

After executing a query, you can retrieve the results in one of the following formats:

* [ResultBatch](python-connector-api.md) objects.

  To do this, call the [`get_result_batches()`](python-connector-api.md "get_result_batches") method in the [Cursor](python-connector-api.md) object.
  This returns a list of `ResultBatch` objects that you can assign to different workers for processing. For example:

  ```python
  with connect(...) as conn:
      with conn.cursor() as cur:
          # Execute a query.
          cur.execute('select seq4() as n from table(generator(rowcount => 100000));')

          # Get the list of result batches
          result_batch_list = cur.get_result_batches()

          # Get the number of result batches in the list.
          num_result_batches = len(result_batch_list)

          # Split the list of result batches into two
          # to distribute the work of fetching results
          # between two workers.
          result_batch_list_1 = result_batch_list[:: 2]
          result_batch_list_2 = result_batch_list[1 :: 2]
  ```
* PyArrow tables.

  For more information, see [PyArrow tables](https://arrow.apache.org/docs/python/data.html#tables).

  You can use the following methods to retrieve the result batches as PyArrow tables:

  + [`fetch_arrow_all()`](python-connector-api.md "fetch_arrow_all"): Call this method to return a PyArrow table containing all of the results.
  + [`fetch_arrow_batches()`](python-connector-api.md "fetch_arrow_batches"): Call this method to return an iterator that you can use to return a PyArrow table for each
    result batch.

  For example:

  ```python
  with connect(...) as conn:
      with conn.cursor() as cur:
          # Execute a query.
          cur.execute('select seq4() as n from table(generator(rowcount => 100000));')

          # Return a PyArrow table containing all of the results.
          table = cur.fetch_arrow_all()

          # Iterate over a list of PyArrow tables for result batches.
          for table_for_batch in cur.fetch_arrow_batches():
            my_pyarrow_table_processing_function(table_for_batch)
  ```
* [pandas DataFrame](python-connector-pandas.md) objects.

  If you have
  [installed the pandas-compatible version of the Snowflake Connector for Python](python-connector-pandas.md),
  you can use the following methods to retrieve the result batches as pandas DataFrame objects:

  + [`fetch_pandas_all()`](python-connector-api.md "fetch_pandas_all"): Call this method to return a pandas DataFrame containing all of the results.
  + [`fetch_pandas_batches()`](python-connector-api.md "fetch_pandas_batches"): Call this method to return an iterator that you can use to return a pandas DataFrame for each
    result batch.

  For example:

  ```python
  with connect(...) as conn:
      with conn.cursor() as cur:
          # Execute a query.
          cur.execute('select seq4() as n from table(generator(rowcount => 100000));')

          # Return a pandas DataFrame containing all of the results.
          table = cur.fetch_pandas_all()

          # Iterate over a list of pandas DataFrames for result batches.
          for dataframe_for_batch in cur.fetch_pandas_batches():
            my_dataframe_processing_function(dataframe_for_batch)
  ```

## Serializing result batches

To move the result batches to other workers or nodes, you can serialize and deserialize the result batches. For example:

```python
import pickle

# Serialize a result batch from the first list.
pickled_batch = pickle.dumps(result_batch_list_1[1])

# At this point, you can move the serialized data to
# another worker/node.
...

# Deserialize the result batch for processing.
unpickled_batch = pickle.loads(pickled_batch)
```

## Working with result batches

The next sections explain how to work with [ResultBatch](python-connector-api.md) objects:

* Iterating over rows in a result batch
* Materializing the rows in a result batch
* Getting the number of rows and size of a result batch
* Converting an arrow result batch to a PyArrow table or pandas DataFrame

### Iterating over rows in a result batch

With a `ResultBatch` object, you can iterate over the rows that are part of that batch. For example:

```python
# Iterate over the list of result batches.
for batch in result_batch_list_1:
    # Iterate over the subset of rows in a result batch.
    for row in batch:
        print(row)
```

When you create an iterator of a `ResultBatch` object, the object fetches and converts the subset of rows for that batch.

### Materializing the rows in a result batch

To materialize the subset of rows in a result batch by passing that `ResultBatch` object to the `list()` function. For
example:

```python
# Materialize the subset of results for the first result batch
# in the list.
first_result_batch = result_batch_list_1[1]
first_result_batch_data = list(first_result_batch)
```

### Getting the number of rows and size of a result batch

If you need to determine the number of rows in a result batch and the size of the data, you can use
[rowcount](python-connector-api.md),
[compressed_size](python-connector-api.md),
and [uncompressed_size](python-connector-api.md) attributes of the `ResultBatch` object.
For example:

```python
# Get the number of rows in a result batch.
num_rows = first_result_batch.rowcount

# Get the size of the data in a result batch.
compressed_size = first_result_batch.compressed_size
uncompressed_size = first_result_batch.uncompressed_size
```

Note that these attributes are available before you iterate over the result batch. You don’t need to fetch the subset of rows for
the batch in order to get the values of these attributes.

### Converting an arrow result batch to a PyArrow table or pandas DataFrame

To convert an `ArrowResultBatch` to a PyArrow table or a pandas DataFrame, use the following methods:

* [`to_pandas()`](python-connector-api.md "to_pandas"): Call this method to return a pandas DataFrame containing the rows in a `ArrowResultBatch`, if you have
  [installed the pandas-compatible version of the Snowflake Connector for Python](python-connector-pandas.md).
* [`to_arrow()`](python-connector-api.md "to_arrow"): Call this method to return a PyArrow table containing the rows in a `ResultBatch`.

For example:

```python
with conn_cnx as con:
  with con.cursor() as cur:
    cur.execute("select col1 from table")
    batches = cur.get_result_batches()

    # Get the row from the ResultBatch as a pandas DataFrame.
    dataframe = batches[0].to_pandas()

    # Get the row from the ResultBatch as a PyArrow table.
    table = batches[0].to_arrow()
```

---
title: Downloading / integrating the JDBC Driver
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-download.md
section: Developer Guide
---

# Downloading / integrating the JDBC Driver

The JDBC driver (`snowflake-jdbc`) is provided as a JAR file, available as an artifact in Maven for download or for integration directly into your Java-based projects. Additionally, you can also download the following types of drivers from Maven:

* FIPS-compliant fat jar named `snowflake-jdbc-fips`.
* Thin jar named `snowflake-jdbc-thin`.

Source code for the additional types is the same, but their dependencies’ configurations are different, as follows:

* The FIPS-compliant fat jar does not embed the FIPS Bouncy Castle dependencies, so you need to provide them during runtime.
* The thin jar embeds only some dependencies, so you need to provide the other dependencies according to the pom file declaration.

All JAR types are released together with the same version number.

Before downloading or integrating the driver, you should verify the version of the driver you are currently using. To verify your driver version, connect to Snowflake through a client application
that uses the driver and check the driver version. If the application supports executing SQL queries, you can do this by calling the [CURRENT_CLIENT](../../sql-reference/functions/current_client.md) function.

## Requirements

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

## Downloading the desired type of driver

To download the driver, follow the steps for the desired driver type.

### Download a standard driver

To download the standard driver:

1. Go to the Maven Central Repository for the standard driver.

   > <https://repo1.maven.org/maven2/net/snowflake/snowflake-jdbc>
2. Select the directory for the version that you need.

   > The most recent version (4.1.0) is not always at the end of the list. Versions are listed alphabetically,
   > not numerically. For example, 3.10.x comes after 3.1.x, not after 3.9.x.
3. Download the appropriate `snowflake-jdbc-#.#.#.jar` file:

   > > **Note:**
   > >
   > > You can verify the JDBC driver version by entering the following command:
   > >
   > > `java -jar snowflake-jdbc-#.#.#.jar --version`, where `#.#.#` matches the version numbers in the downloaded file name.
4. Optional: To verify the driver package signature, download the corresponding `snowflake-jdbc-#.#.#.jar.asc` file.

### Download a FIPS-compliant driver

To download the FIPS-compliant driver:

1. Go to the Maven Central Repository for the FIPS-compliant driver.

   > <https://repo1.maven.org/maven2/net/snowflake/snowflake-jdbc-fips>
2. Select the directory for the version that you need.

   > The most recent version (4.1.0) is not always at the end of the list. Versions are listed alphabetically,
   > not numerically. For example, 3.10.x comes after 3.1.x, not after 3.9.x.
3. Download the appropriate `snowflake-jdbc-fips-#.#.#.jar` file:

   > > **Note:**
   > >
   > > You can verify the JDBC driver version by entering the following command:
   > >
   > > `java -jar snowflake-jdbc-fips-#.#.#.jar --version`, where `#.#.#` matches the version numbers in the downloaded file name.
4. Optionally, you can verify the driver package signature, download the corresponding `snowflake-jdbc-fips-#.#.#.jar.asc` file.

### Download a thin-jar driver

To download the thin-jar driver:

1. Go to the Maven Central Repository for the thin-jar driver.

   > <https://repo1.maven.org/maven2/net/snowflake/snowflake-jdbc-thin>
2. Select the directory for the version that you need.

   > The most recent version (4.1.0) is not always at the end of the list. Versions are listed alphabetically,
   > not numerically. For example, 3.10.x comes after 3.1.x, not after 3.9.x.
3. Download the appropriate `snowflake-jdbc-thin-#.#.#.jar` file:

   > > **Note:**
   > >
   > > You can verify the JDBC driver version by entering the following command:
   > >
   > > `java -jar snowflake-jdbc-thin-#.#.#.jar --version`, where `#.#.#` matches the version numbers in the downloaded file name.
4. Optionally, you can verify the driver package signature, download the corresponding `snowflake-jdbc-thin-#.#.#.jar.asc` file.

## Optional: Verify the driver package signature

To verify the JDBC driver package signature:

1. From the public keyserver, download and import the Snowflake GPG public key for the version of the JDBC driver that you are
   using:

   * For version 3.22.0 and higher:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 2A3149C82551A34A
     ```
   * For version 3.19.1 through 3.21.0:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 5A125630709DD64B
     ```
   * For version 3.13.23 through 3.19.0:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 630D9F3CAB551AF3
     ```
   * For version 3.12.13 through 3.13.22:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 37C7086698CB005C
     ```
   * For version 3.6.26 through 3.12.12:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys EC218558EABB25A1
     ```
   * For version 3.6.25 and lower:

     ```
     $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 93DB296A69BE019A
     ```
   > **Note:**
   >
   > If this command fails with the following error:
   >
   > > ```none
   > > gpg: keyserver receive failed: Server indicated a failure
   > > ```
   >
   > then specify that you want to use port 80 for the keyserver:
   >
   > > ```bash
   > > gpg --keyserver hkp://keyserver.ubuntu.com:80  ...
   > > ```
2. Run the `gpg --verify` command to verify the signature of the package.

   For the `--verify` command-line flag, specify the `.asc` file that you
   downloaded earlier as the signature file and the JAR file as the file containing the signed
   data.

   For example:

   > ```
   > $ gpg --verify snowflake-jdbc-4.1.0.jar.asc snowflake-jdbc-4.1.0.jar
   > gpg: Signature made Wed 22 Feb 2017 04:31:58 PM UTC using RSA key ID <gpg_key_id>
   > gpg: Good signature from "Snowflake Computing <snowflake_gpg@snowflake.net>"
   > ```

   Specify the correct version numbers for the JDBC driver package you are verifying. Version 4.1.0 is used in this
   example for illustration purposes only. The latest available version of the driver may be higher.

   > **Note:**
   >
   > Verifying the signature produces a warning similar to the following:
   >
   > > ```none
   > > gpg: Signature made Mon 24 Sep 2018 03:03:45 AM UTC using RSA key ID <gpg_key_id>
   > > gpg: Good signature from "Snowflake Computing <snowflake_gpg@snowflake.net>" unknown
   > > gpg: WARNING: This key is not certified with a trusted signature!
   > > gpg: There is no indication that the signature belongs to the owner.
   > > ```
   >
   > To avoid the warning, you can grant the Snowflake GPG public key implicit trust.
3. Your local environment can contain multiple GPG keys; however, for security reasons, Snowflake periodically rotates the public GPG key. As a best practice, we recommend deleting the existing public key
   after confirming that the latest key works with the latest signed package:

   > ```bash
   > $ gpg --delete-key "Snowflake Computing"
   > ```

## Integrate the driver into a maven project

To integrate the driver into a Maven project:

* Add the driver as a dependency to your `pom.xml` file.

For example:

* Standard driver

  ```
  <dependencies>
    ...
    <dependency>
      <groupId>net.snowflake</groupId>
      <artifactId>snowflake-jdbc</artifactId>
      <version>4.1.0</version>
    </dependency>
    ...
  </dependencies>
  ```
* FIPS-compliant driver

  ```
  <dependencies>
    ...
    <dependency>
      <groupId>net.snowflake</groupId>
      <artifactId>snowflake-jdbc-fips</artifactId>
      <version>4.1.0</version>
    </dependency>
    ...
  </dependencies>
  ```
* Thin-jar driver

  ```
  <dependencies>
    ...
    <dependency>
      <groupId>net.snowflake</groupId>
      <artifactId>snowflake-jdbc-thin</artifactId>
      <version>4.1.0</version>
    </dependency>
    ...
  </dependencies>
  ```

where the `<version>` tag specifies the version of the driver you want to integrate. Note that version 4.1.0 is used in this example for illustration purposes only. A later version of the driver might be available.

The developer notes are hosted along with the source code on [GitHub](https://github.com/snowflakedb/snowflake-jdbc).

## Add the JNA classes to your classpath

[Connection caching for browser-based SSO](../../user-guide/admin-security-fed-auth-use.md) and
[token caching for multi-factor authentication (MFA)](../../user-guide/security-mfa.md) require the use of the
[Java Native Access (JNA) classes](https://github.com/java-native-access/jna/blob/master/README.md) to save data securely to the
filesystem.

As of version 3.12.18 of the JDBC Driver, the JNA classes are no longer packaged in the JDBC Driver JAR file. In the JDBC Driver
`pom.xml` file, the dependencies on these classes are marked as optional.

If you need to use connection caching or token caching, you must add the following libraries to your classpath:

* For Mac and Linux:

  + [JNA](https://mvnrepository.com/artifact/net.java.dev.jna/jna)
* For Windows:

  + [JNA](https://mvnrepository.com/artifact/net.java.dev.jna/jna)
  + [JNA Platform](https://mvnrepository.com/artifact/net.java.dev.jna/jna-platform)

The `pom.xml` file for the JDBC Driver specifies the version of the JNA classes that have been tested with the JDBC Driver. Snowflake recommends using this version (or the same major version) of
the JNA classes.

For more information, see [JDBC Driver page in the Maven Central Repository](https://central.sonatype.com/search?q=g%3Anet.snowflake%20snowflake-jdbc).

> **Note:**
>
> For systems that use the aarch64 architecture (e.g. the Apple M1 chip), use version 5.7.0 or later of the JNA libraries.
> (JNA versions prior to v5.7.0 are not compatible with Windows and macOS systems that run on aarch64.)

---
title: Downloading the ODBC Driver
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-download.md
section: Developer Guide
---

# Downloading the ODBC Driver

Snowflake provides an installer for the ODBC driver.

> **Note:**
>
> If you plan to use `yum` to download and install the ODBC driver for Linux, skip ahead to
> [Using yum to download and install the driver](odbc-linux.md).

To download the installer:

1. Review the [license agreement](https://sfc-repo.snowflakecomputing.com/odbc/Snowflake_ODBC_Driver_License_Agreement.pdf).
2. If you are already using the ODBC driver and need to download an updated version, check the version that you are using, and
   review the changes between your version and the updated version in the [ODBC Driver release notes](../../release-notes/clients-drivers/odbc.md)

   To find the version of the driver that you are using, call the [CURRENT_CLIENT](../../sql-reference/functions/current_client.md) SQL function from
   an application using the driver. You can also [verify the driver version](../../user-guide/snowflake-client-version-check.md) by
   examining queries executed by the driver in the QUERY_HISTORY view.
3. Go to the [ODBC Download](https://developers.snowflake.com/odbc/) page, and download the installer.

   > **Note:**
   >
   > The Linux installation package is provided in three variations:
   >
   > * TGZ (TAR file compressed using .GZIP)
   > * RPM
   > * DEB
   >
   > The TGZ package requires some manual configuration tasks. The RPM and DEB packages include an automated installer and support
   > validation using the public GPG key provided by Snowflake.
4. See the following topics to install and configure the driver:

   * [Installing and configuring the ODBC Driver for Windows](odbc-windows.md)
   * [Installing and configuring the ODBC Driver for macOS](odbc-mac.md)
   * [Installing and configuring the ODBC Driver for Linux](odbc-linux.md)
   * [ODBC configuration and connection parameters](odbc-parameters.md)

---
title: Drivers
source: https://docs.snowflake.com/en/developer-guide/drivers.md
section: Developer Guide
---

# Drivers

Using languages such as Go, C#, JavaScript, and Python, you can write applications that perform operations on Snowflake. Use the drivers described in
this section to access Snowflake from applications written in the driver’s supported language.

[Go Snowflake Driver](golang/go-driver.md)
:   Connect to Snowflake and perform all standard operations with an interface for developing applications using the Go programming language.

[JDBC Driver](jdbc/jdbc.md)
:   Connect to Snowflake from most client tools/applications that support JDBC.

[.NET Driver](dotnet/dotnet-driver.md)
:   Connect to Snowflake with an interface to the Microsoft .NET open source software framework for developing applications.

[Node.js Driver](node-js/nodejs-driver.md)
:   Connect to Snowflake with a native asynchronous Node.js interface.

[ODBC Driver](odbc/odbc.md)
:   Connect to Snowflake using ODBC-based client applications.

[PHP PDO Driver for Snowflake](php-pdo/php-pdo-driver.md)
:   Connect to Snowflake and perform all standard operations with an interface for developing PHP applications.

[Snowflake Connector for Python](python-connector/python-connector.md)
:   Develop Python applications that can connect to Snowflake and perform all standard operations.

## Transport Layer Security (TLS) support

All Snowflake drivers support TLS to secure communications between the client and the Snowflake service. TLS 1.3 or later is supported for all drivers, except as noted in the following table.

Snowflake Driver TLS Support

| Driver | TLS 1.2 | TLS 1.3 | Notes |
| --- | --- | --- | --- |
| Go Snowflake Driver | ✔ | ✔ |  |
| JDBC Driver | ✔ | ✔ |  |
| .NET Driver | ✔ |  | * MacOS currently does not support TLS 1.3, but will once .NET 10 is released. * Windows supports TLS 1.3 for .NET 3.0 and .NET Framework 4.8 and later versions. |
| Node.js Driver | ✔ | ✔ |  |
| ODBC Driver | ✔ | ✔ |  |
| PHP PDO Driver for Snowflake | ✔ | ✔ |  |
| Snowflake Connector for Python | ✔ | ✔ |  |

---
title: Editing Notebooks in Declarative Shared Native Applications
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/live-editing.md
section: Developer Guide
---

# Editing Notebooks in Declarative Shared Native Applications

Declarative Native Apps can include [notebooks](../../user-guide/ui-snowsight/notebooks.md) to query, visualize, and explore the data. This topic describes how to use **Notebook Live Editing** to streamline the development and testing of notebooks within an application.

Using the **Notebook Live Editing** feature, you can speed up the development process by editing and testing notebooks directly within an application. This saves you from having to develop notebooks externally, or from having to rebuild the application package for every change.

**Notebook Live Editing** uses a **Development Mode** that lets you make changes to notebooks “live” inside an application instance. Your edits are saved to a dedicated live version of the application package, allowing for rapid, on-the-fly
testing and iteration.

## How It Works

The workflow uses a live version of your application package, which functions as a development sandbox. This tutorial describes how to set up
and use the Notebook Live Editing feature.

### Step 1: Set Up the Development Environment

To begin, you need a package that contains the following:

* A manifest file that defines the application and its components.
* A notebook that you can edit and test.

You then create an application instance from the live version of your package. The live version is created automatically when you create your application.

1. Build the package.

   ```sqlexample
   ALTER APPLICATION PACKAGE pkg_name BUILD;
   ```
2. Create an application instance from the live version. Notebooks in this new application will automatically be in **Development
   Mode**, allowing live editing. Prior to this step, notebooks in the application are in **Read-only mode**.

   ```sqlexample
   CREATE APPLICATION live_app_name
     FROM APPLICATION PACKAGE pkg_name
     USING VERSION LIVE;
   ```

### Step 2: Live Edit and Test Notebooks

With your `live_app_name` application running, in SnowSight, open your app from the list of apps in your account, and open one of its notebooks from its listing page. After creating the application from the application package in the previous step, the applications’ notebooks will be in **Developer mode**.You can now do the following:

* Edit notebook cells directly in the browser.
* Run and test your code immediately within the context of the application.

Any changes you make are instantly saved to the live version of the `pkg_name` application package. This allows you to iterate changes
to your application quickly, without needing to perform a full package build for each minor adjustment.

### Step 3: Finalize and Release Changes

Once you’re happy with the state of your notebooks, you can promote the live version to a stable release. This freezes the current state of
your notebooks and makes them part of a permanent application version. The app framework automatically creates a version number for your
release.

* Release the live version to finalize your work.

```sqlexample
ALTER APPLICATION PACKAGE pkg_name RELEASE LIVE VERSION;
```

This command creates a new, immutable version of your application package containing all the notebook changes you made. For more information
about the application package and the live version, see [Application Packages in Declarative Sharing in the Native Application Framework](package.md).

---
title: Emitting metrics data from handler code
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/metrics-handler.md
section: Developer Guide
---

# Emitting metrics data from handler code

To have your procedure or UDF emit metrics data, you don’t need to add any code to your handler. Snowflake generates the data collected in an
event table.

## How metrics are measured

Due to the way the Java and Python execution environments differ, the metrics data collected also differs. For reference information
about the data collected, see [RECORD column reference](event-table-columns.md).

The following describes how the captured data corresponds to the execution environment.

Java:
:   JVM (Java Virtual Machine) CPU and memory metrics are reported for each query ID.

    Each stored procedure is allocated its own JVM. The following describes the metric data collected:

    * `process.memory.usage`: Amount of memory, in bytes, consumed by the JVM executing the stored procedure handler.
    * `process.cpu.utilization`: Total CPU time divided by the wall-clock time per logical CPU, measured as a percentage where
      1.0 indicates 100 percent utilization. Total CPU time is the total time spent on non-idle tasks.

    Each Java and Scala UDF called in a query shares a single JVM. Metric values are aggregated across each Java or Scala function in the query. The
    following describes the metric data collected:

    * `process.memory.usage`: Memory use, shown as the sum of all the associated Java functions called in the query.
    * `process.cpu.utilization`: CPU use, shown as the average of all the Java and Scala functions called in the query.

Python:
:   CPU and memory metrics are reported for each Python function or procedure.

    Each stored procedure executes on only one Python process. The following describes the metric data collected:

    * `process.memory.usage`: Amount of memory, in bytes, consumed by the Python process executing the stored procedure handler.
    * `process.cpu.utilization`: Total CPU time divided by the wall-clock time per logical CPU, measured as a percentage where 1.0
      indicates 100 percent use. Total CPU time is the total time spent on non-idle tasks.

    Each UDF can be executed on multiple Python execution processes. Values are aggregated across multiple processes. The following describes
    the metric data collected:

    * `process.memory.usage`: Memory use, shown as the sum of all the associated Python processes of that UDF.
    * `process.cpu.utilization`: Reported CPU, shown as the average of all the associated Python processes of that UDF.

## Python example

Use the following steps to generate metrics example data.

1. Set the metrics level of your session. The `METRIC_LEVEL` parameter controls whether to emit auto-instrumented resource
   metrics data points to the event table. You can set the parameter to `NONE` or `ALL`, and set it on the object and
   session level. For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

   ```sqlexample
   ALTER SESSION SET METRIC_LEVEL = ALL;
   ```
2. Create a stored procedure.

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE DEMO_SP(n_queries number)
   RETURNS VARCHAR
   LANGUAGE PYTHON
   RUNTIME_VERSION = '3.10'
   PACKAGES = ('snowflake-snowpark-python')
   HANDLER = 'my_handler'
   AS $$
   import time
   def my_handler(session, n_queries):
     import snowflake.snowpark
     from snowflake.snowpark.functions import col, udf
     from snowflake import telemetry

     session.sql('create or replace stage udf_stage;').collect()

     @udf(name='example_udf', is_permanent=True, stage_location='@udf_stage', replace=True)
     def example_udf(x: int) -> int:
       # This UDF will consume 1GB of memory to illustrate the memory consumption metric
       one_gb_list = [0] * (1024**3 // 8)
       return x

     pandas_grouped_df = session.table('snowflake.account_usage.query_history').select(
       col('total_elapsed_time'),
       col('rows_written_to_result'),
       col('database_name'),
       example_udf(col('bytes_scanned'))
     ).limit(n_queries)\
     .to_pandas()\
     .groupby('DATABASE_NAME')

     mean_time = pandas_grouped_df['TOTAL_ELAPSED_TIME'].mean()
     mean_rows_written = pandas_grouped_df['ROWS_WRITTEN_TO_RESULT'].mean()

     return f"""
     {mean_time}
     {mean_rows_written}
     """
   $$;
   ```
3. Run the stored procedure

   ```sqlexample
   CALL DEMO_SP(100);
   ```
4. When the query is completed, view metrics data as described in [Viewing metrics data](metrics-viewing-data.md).

---
title: Emitting trace events in Java
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-java.md
section: Developer Guide
---

# Emitting trace events in Java

You can use the `com.snowflake.telemetry.Telemetry` class in the [Snowflake Telemetry API](https://javadoc.io/doc/com.snowflake/telemetry/latest/index.html) library to emit trace events from a function or
procedure handler written in Java. The `Telemetry` class is included with Snowflake.

> **Note:**
>
> Using the Snowflake Telemetry Library adds other libraries to your function or procedure’s execution environment. For more information,
> see [Snowflake telemetry package dependencies](telemetry-package-dependencies.md).

For information on including the Telemetry library when packaging your code with Maven, see
[Setting up your Java and Scala environment to use the Telemetry class](telemetry-build-maven.md).

You can access stored trace event data by executing a SELECT command on the event table. For more information, see
[Viewing trace data](tracing-accessing-events.md).

> **Note:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

Before logging from code, you must:

* Set up an event table to collect messages logged from handler code.

  For more information, see [Event table overview](event-table-setting-up.md).
* Be sure you have the trace level set so that the data you want are stored in the event table.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Adding support for the telemetry API

To use `Telemetry` methods, you must make the open source [Snowflake telemetry package](https://central.sonatype.com/artifact/com.snowflake/telemetry) available to your handler code.

* In the PACKAGES clause in your CREATE PROCEDURE or CREATE FUNCTION statement, include the `com.snowflake:telemetry` package. The
  PACKAGES clause makes the included Snowflake Telemetry API available to your code.

  Code in the following example uses the PACKAGES clause to reference the Telemetry library as well as the Snowpark library (which is
  required for stored procedures written in Java – for more information, see [Writing Java handlers for stored procedures created with SQL](../stored-procedure/java/procedure-java-overview.md)).

  ```sqlexample
  CREATE OR REPLACE PROCEDURE myproc(...)
    RETURNS ...
    LANGUAGE JAVA
    ...
    PACKAGES = ('com.snowflake:snowpark:latest', 'com.snowflake:telemetry:latest')
    ...
  ```
* Import the `com.snowflake.telemetry` package in your Java handler code.

  ```java
  import com.snowflake.telemetry.Telemetry;
  ```

## Adding trace events

You can add trace events by calling the `Telemetry.addEvent` method, passing a name for the event. You can also optionally associate
attributes – key-value pairs – with an event.

The `addEvent` method has the following signatures:

```java
public static void addEvent(String name)
public static void addEvent(String name, Attributes attributes)
```

Code in the following example adds an event called `testEvent`, associating with the event two attributes: `key` and `result`.

```java
// Adding an event without attributes.
Telemetry.addEvent("testEvent");

// Adding an event with attributes.
Attributes eventAttributes = Attributes.of(
  AttributeKey.stringKey("key"), "run",
  AttributeKey.longKey("result"), Long.valueOf(123));
Telemetry.addEvent("testEventWithAttributes", eventAttributes);
```

Adding these events results in two rows in the event table, each with a different value in the RECORD column:

```sqljson
{
  "name": "testEvent"
}
```

```sqljson
{
  "name": "testEventWithAttributes"
}
```

The `testEventWithAttributes` event row includes the following attributes in the row’s RECORD_ATTRIBUTES column:

```sqljson
{
  "key": "run",
  "result": 123
}
```

## Adding span attributes

You can set attributes – key-value pairs – associated with spans by calling the `Telemetry.setSpanAttribute` method.

The `setSpanAttribute` method has the following signatures:

```java
public static void setSpanAttribute(String key, boolean value)
public static void setSpanAttribute(String key, long value)
public static void setSpanAttribute(String key, double value)
public static void setSpanAttribute(String key, String value)
```

For details on spans, see [How Snowflake represents trace events](tracing-how-events-work.md).

Code in the following example creates four attributes and sets their values:

```java
// Setting span attributes.
Telemetry.setSpanAttribute("example.boolean", true);
Telemetry.setSpanAttribute("example.long", 2L);
Telemetry.setSpanAttribute("example.double", 2.5);
Telemetry.setSpanAttribute("example.string", "testAttribute");
```

Setting these attributes results in the following in the event table’s RECORD_ATTRIBUTES column:

```sqljson
{
  "example.boolean": true,
  "example.long": 2,
  "example.double": 2.5,
  "example.string": "testAttribute"
}
```

## Adding custom spans

You can add custom spans that are separate from the default span created by Snowflake. For details on custom spans, see
[Adding custom spans to a trace](tracing-custom-spans.md).

Code in the following example uses the [OpenTelemetry API](https://javadoc.io/doc/io.opentelemetry/opentelemetry-api/latest/index.html) and [OpenTelemetry context propagation API](https://www.javadoc.io/doc/io.opentelemetry/opentelemetry-context-prop/latest/index.html) to create a new `my.span`
span. It then adds an event with attributes to the new span. Finally, the code ends the span to have the span’s event data
captured in the event table. If the code doesn’t call the `Span.end` method, data is not captured in the event table.

```sqlexample-java
CREATE OR REPLACE FUNCTION customSpansJavaExample() RETURNS STRING
  LANGUAGE JAVA
  PACKAGES = ('com.snowflake:telemetry:latest')
  HANDLER = 'MyJavaClass.run'
  as
  $$
  import com.snowflake.telemetry.Telemetry;
  import io.opentelemetry.api.common.AttributeKey;
  import io.opentelemetry.api.common.Attributes;
  import io.opentelemetry.api.GlobalOpenTelemetry;
  import io.opentelemetry.api.trace.Tracer;
  import io.opentelemetry.api.trace.Span;
  import io.opentelemetry.context.Scope;

  class MyJavaClass {
    public static String run() {
      Tracer tracer = GlobalOpenTelemetry.getTracerProvider().get("my.tracer");
      Span span = tracer.spanBuilder("my.span").startSpan();
      try (Scope scope = span.makeCurrent()) {
        // Do processing, adding events that will be captured in a my.span.

        // Add an event with attributes.
        Attributes eventAttributes = Attributes.of(
          AttributeKey.stringKey("key"), "run",
          AttributeKey.longKey("result"), Long.valueOf(123));

        span.addEvent("testEventWithAttributes", eventAttributes);
        span.setAttribute("testAttribute", "value");

      } finally {
        span.end();
      }
      return "success";
    }
  }
  $$;
```

## Java examples

### Stored procedure example

```sqlexample-java
CREATE OR REPLACE PROCEDURE do_tracing()
  RETURNS STRING
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES=('com.snowflake:snowpark:latest', 'com.snowflake:telemetry:latest')
  HANDLER = 'ProcedureHandler.run'
  AS
  $$
  import com.snowflake.snowpark_java.Session;
  import com.snowflake.telemetry.Telemetry;
  import io.opentelemetry.api.common.AttributeKey;
  import io.opentelemetry.api.common.Attributes;

  public class ProcedureHandler {

    public String run(Session session) {

      // Set span attribute.
      Telemetry.setSpanAttribute("example.proc.do_tracing", "begin");

      // Add an event without attributes.
      Telemetry.addEvent("run_method_start");

      // Add an event with attributes.
      Attributes eventAttributes = Attributes.of(
        AttributeKey.stringKey("example.method.name"), "run",
        AttributeKey.longKey("example.long"), Long.valueOf(123));
      Telemetry.addEvent("event_with_attributes", eventAttributes);

      // Set span attribute.
      Telemetry.setSpanAttribute("example.proc.do_tracing", "complete");

      return "SUCCESS";
    }
  }
  $$;
```

### UDF example

```sqlexample-java
CREATE OR REPLACE FUNCTION add_two_numbers(A FLOAT, B FLOAT) RETURNS FLOAT
  LANGUAGE JAVA
  PACKAGES=('com.snowflake:telemetry:latest')
  HANDLER = 'ScalarFunctionHandler.run'
  AS
  $$
  import com.snowflake.telemetry.Telemetry;
  import io.opentelemetry.api.common.AttributeKey;
  import io.opentelemetry.api.common.Attributes;
  import io.opentelemetry.api.common.AttributesBuilder;

  class ScalarFunctionHandler {

    public static Double run(Double d0, Double d1) {

      // Set span attribute.
      Telemetry.setSpanAttribute("example.func.add_two_numbers", "begin");

      // Add an event without attributes.
      Telemetry.addEvent("run_method_start");

      // Add an event with attributes.
      Attributes eventAttributes = Attributes.of(
        AttributeKey.stringKey("example.method.name"), "run",
        AttributeKey.longKey("example.long"), Long.valueOf(123));
      Telemetry.addEvent("event_with_attributes", eventAttributes);

      Double response = d0 == null || d1 == null ? null : (d0 + d1);

      // Set span attribute.
      Telemetry.setSpanAttribute("example.func.add_two_numbers.response", response);
      Telemetry.setSpanAttribute("example.func.add_two_numbers", "complete");

      return response;
    }
  }
  $$;
```

### UDTF example

```sqlexample-java
CREATE OR REPLACE FUNCTION digits_of_number(x int)
  RETURNS TABLE(result INT)
  LANGUAGE JAVA
  PACKAGES = ('com.snowflake:telemetry:latest')
  HANDLER = 'TableFunctionHandler'
  AS
  $$
  import com.snowflake.telemetry.Telemetry;
  import io.opentelemetry.api.common.AttributeKey;
  import io.opentelemetry.api.common.Attributes;
  import io.opentelemetry.api.common.AttributesBuilder;
  import java.util.stream.Stream;

  public class TableFunctionHandler {

    public TableFunctionHandler() {
      // Set span attribute.
      Telemetry.setSpanAttribute("example.func.digits_of_number", "begin");
    }

    static class OutputRow {
      public int result;

      public OutputRow(int result) {
        this.result = result;
      }
    }

    public static Class getOutputClass() {
      return OutputRow.class;
    }

    public Stream<OutputRow> process(int input) {

      // Add an event with attributes.
      Attributes eventAttributes = Attributes.of(
        AttributeKey.longKey("example.received.value"), Long.valueOf(input));
      Telemetry.addEvent("digits_of_number", eventAttributes);

      Stream.Builder<OutputRow> stream = Stream.builder();
      while (input > 0) {

        stream.add(new OutputRow(input %10));
        input /= 10;
      }

      // Set span attribute.
      Telemetry.setSpanAttribute("example.func.digits_of_number", "complete");

      return stream.build();
    }
  }
  $$;
```

---
title: Emitting trace events in JavaScript
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-javascript.md
section: Developer Guide
---

# Emitting trace events in JavaScript

You can use the `snowflake` class in the [Snowflake JavaScript API](../stored-procedure/stored-procedures-api.md) to emit trace events
from a function or procedure handler written in JavaScript. The JavaScript API is already available to your JavaScript handler code.

Before emitting trace events, be sure you have the trace level set so that the data you want are stored in the event table. For more
information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

> **Note:**
>
> Before you can begin emitting trace events, you must set up an event table. For more information, see
> [Event table overview](event-table-setting-up.md).

You can access stored trace event data by executing a SELECT command on the event table. For more information, see
[Viewing trace data](tracing-accessing-events.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Trace events for functions and procedures](tracing.md).

> **Note:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

## Adding trace events

You can add trace events by calling the `snowflake.addEvent` function, passing a name for the event. You can also optionally associate
attributes – key-value pairs – with an event.

The `addEvent` method is available in the following form:

```javascript
snowflake.addEvent(name [, { key:value [, key:value] } ] );
```

Handler code in the following example adds two events, `name_a` and `name_b`. With `name_b`, the code also adds two
attributes, `score` and `pass`.

```sqlexample-javascript
create procedure PI_JS()
  returns double
  language javascript
  as
  $$
    snowflake.addEvent('name_a');  // add an event without attributes
    snowflake.addEvent('name_b', {'score': 89, 'pass': true});
    return 3.14;
  $$
  ;
```

Setting these attributes results in two rows in the event table, each with a different value in the RECORD column:

```json
{
  "name": "name_a"
}
```

```json
{
  "name": "name_b"
}
```

The `name_b` event row includes the following attributes in the row’s RECORD_ATTRIBUTES column:

```json
{
  "score": 89,
  "pass": true
}
```

## Adding span attributes

You can set attributes – key-value pairs – associated with spans by calling the `snowflake.setSpanAttribute` function.

The `setSpanAttribute` function is available in the following form:

```javascript
snowflake.setSpanAttribute(key, value);
```

For details on spans, see [How Snowflake represents trace events](tracing-how-events-work.md).

Code in the following example creates four attributes and sets their values:

```javascript
// Setting span attributes.
snowflake.setSpanAttribute("example.boolean", true);
snowflake.setSpanAttribute("example.long", 2L);
snowflake.setSpanAttribute("example.double", 2.5);
snowflake.setSpanAttribute("example.string", "testAttribute");
```

Setting these attributes results in the following in the event table’s RECORD_ATTRIBUTES column:

```json
{
  "example.boolean": true,
  "example.long": 2,
  "example.double": 2.5,
  "example.string": "testAttribute"
}
```

## Adding custom spans

You can add custom spans that are separate from the default span created by Snowflake. For details on custom spans, see
[Adding custom spans to a trace](tracing-custom-spans.md).

Code in the following example uses the [OpenTelemetry API](https://open-telemetry.github.io/opentelemetry-js/modules/_opentelemetry_api.html) to create a new
`example_custom_span` span. It then adds an event and attribute to the new span. Finally, the code ends the span to have the span’s
event data captured in the event table. If the code doesn’t call the `Span.end` method, data is not captured in the event table.

```sqlexample-javascript
CREATE OR REPLACE FUNCTION javascript_custom_span()
RETURNS STRING
LANGUAGE JAVASCRIPT
AS
$$
const { trace } = opentelemetry;
const tracer = trace.getTracer("example_tracer");
// Alternatively, const tracer = opentelemetry.trace.getTracer("example_tracer");

tracer.startActiveSpan("example_custom_span", (span) => {
  span.addEvent("testEventWithAttributes");
  span.setAttribute("testAttribute", "value");

  span.end();
});
$$;
```

---
title: Emitting trace events in Python
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-python.md
section: Developer Guide
---

# Emitting trace events in Python

You can use the Snowflake `telemetry` package to emit trace events from a function or procedure handler written in Python.
The package is available from [the Anaconda Snowflake channel](https://repo.anaconda.com/pkgs/snowflake).

You can access stored trace event data by executing a SELECT command on the event table. For more information, see
[Viewing trace data](tracing-accessing-events.md).

> **Note:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

Before logging from code, you must:

* Set up an event table to collect messages logged from handler code.

  For more information, see [Event table overview](event-table-setting-up.md).
* Be sure you have the trace level set so that the data you want are stored in the event table.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

> **Note:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

## Adding support for the telemetry package

To use telemetry package, you must make the open source
[Snowflake telemetry package](https://github.com/snowflakedb/snowflake-telemetry-python), which is included with Snowflake, available
to your handler code. The package is available from [the Anaconda Snowflake channel](https://repo.anaconda.com/pkgs/snowflake).

By default, the telemetry package is included when you create a Python handler for a stored procedure or function. However, if you
specify a package policy to allow or disallow specific packages explicitly, Snowflake doesn’t automatically include the
`snowflake-telemetry-python` package. In this case, you must specify the package in the PACKAGES clause.

* **For a Streamlit app.** You can add the `snowflake-telemetry-python` package to your app by using Snowsight or an
  `environment.yml.` file.

  Code in the following example uses the PACKAGES clause to reference the telemetry package as well as the Snowpark library (which is
  required for stored procedures written in Python – for more information, see [Writing stored procedures with SQL and Python](../stored-procedure/python/procedure-python-overview.md)).

  ```sqlexample
  CREATE OR REPLACE FUNCTION my_function(...)
    RETURNS ...
    LANGUAGE PYTHON
    ...
    PACKAGES = ('snowflake-telemetry-python')
    ...
  ```
* Import the `telemetry` package in your code.

  ```python
  from snowflake import telemetry
  ```

## Adding trace events

You can add trace events by calling the `telemetry.add_event` method, passing a name for the event. You can also optionally associate
attributes – key-value pairs – with an event.

The `add_event` method is available in the following form:

```python
telemetry.add_event(<name>, <attributes>)
```

where

* `name` is a Python string that specifies the name of the trace event.
* `attributes` is an [OpenTelemetry Attributes object](https://github.com/open-telemetry/opentelemetry-python/blob/main/opentelemetry-api/src/opentelemetry/util/types.py) that specifies the attributes for this trace event. This argument is
  optional. Omit the argument if you do not have any attributes to specify for this trace event.

Handler code in the following example adds two events, `FunctionEmptyEvent` and `FunctionEventWithAttributes`. With
`FunctionEventWithAttributes`, the code also adds two attributes: `key1` and `key2`.

```python
telemetry.add_event("FunctionEmptyEvent")
telemetry.add_event("FunctionEventWithAttributes", {"key1": "value1", "key2": "value2"})
```

Adding these events results in two rows in the event table, each with a different value in the RECORD column:

```json
{
  "name": "FunctionEmptyEvent"
}
```

```json
{
  "name": "FunctionEventWithAttributes"
}
```

The `FunctionEventWithAttributes` event row includes the following attributes in the row’s RECORD_ATTRIBUTES column:

```json
{
  "key1": "value1",
  "key2": "value2"
}
```

## Adding span attributes

You can set attributes – key-value pairs – associated with spans by calling the `telemetry.set_span_attribute` method.

For details on spans, see [How Snowflake represents trace events](tracing-how-events-work.md).

The `set_span_attribute` method is available in the following form:

```python
telemetry.set_span_attribute(<key>, <value>)
```

where:

> * `key` is a Python string that specifies the key for an attribute.
> * `value` is an [OpenTelemetry AttributeValue object](https://github.com/open-telemetry/opentelemetry-python/blob/main/opentelemetry-api/src/opentelemetry/util/types.py) that specifies the value of the attribute.

Code in the following example creates four attributes and sets their values:

```python
// Setting span attributes.
telemetry.set_span_attribute("example.boolean", true);
telemetry.set_span_attribute("example.long", 2);
telemetry.set_span_attribute("example.double", 2.5);
telemetry.set_span_attribute("example.string", "testAttribute");
```

Setting these attributes results in the following in the event table’s RECORD_ATTRIBUTES column:

```json
{
  "example.boolean": true,
  "example.long": 2,
  "example.double": 2.5,
  "example.string": "testAttribute"
}
```

## Adding custom spans

You can add custom spans that are separate from the default span created by Snowflake. For details on custom spans, see
[Adding custom spans to a trace](tracing-custom-spans.md).

Code in the following example uses the [OpenTelemetry Python API](https://opentelemetry-python.readthedocs.io/en/latest/api/index.html) to create the `my.span` span as the current span with
`start_as_current_span`. It then adds an event with attributes to the new span using the [OpenTelemetry Python API](https://opentelemetry-python.readthedocs.io/en/latest/api/index.html).

Event data won’t be captured by the event table unless the span ends before your handler completes execution. In this example, closing the
span happens automatically when the `with` statement concludes.

```sqlexample-python
CREATE OR REPLACE FUNCTION customSpansPythonExample() RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('opentelemetry-api')
  HANDLER = 'custom_spans_function'
  AS $$
  from snowflake import telemetry
  from opentelemetry import trace

  def custom_spans_function():
    tracer = trace.get_tracer("my.tracer")
    with tracer.start_as_current_span("my.span") as span:
      span.add_event("Event2 in custom span", {"key1": "value1", "key2": "value2"})

    return "success"
  $$;
```

## Python examples

The following sections provide examples of adding support for trace events from Python code.

### Stored procedure example

```sqlexample-python
CREATE OR REPLACE PROCEDURE do_tracing()
  RETURNS VARIANT
  LANGUAGE PYTHON
  PACKAGES = ('snowflake-snowpark-python', 'snowflake-telemetry-python')
  RUNTIME_VERSION = 3.12
  HANDLER = 'run'
  AS $$
  from snowflake import telemetry
  def run(session):
    telemetry.set_span_attribute("example.proc.do_tracing", "begin")
    telemetry.add_event("event_with_attributes", {"example.key1": "value1", "example.key2": "value2"})
    return "SUCCESS"
  $$;
```

### Streamlit example

```python
import streamlit as st
from snowflake import telemetry

st.title("Streamlit trace event example")

hifives_val = st.slider("Number of high-fives", min_value=0, max_value=90, value=60)

if st.button("Submit"):
    telemetry.add_event("new_submission", {"high_fives": hifives_val})
```

### UDF example

```sqlexample-python
CREATE OR REPLACE FUNCTION times_two(x NUMBER)
  RETURNS NUMBER
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'times_two'
AS $$
from snowflake import telemetry
def times_two(x):
  telemetry.set_span_attribute("example.func.times_two", "begin")
  telemetry.add_event("event_without_attributes")
  telemetry.add_event("event_with_attributes", {"example.key1": "value1", "example.key2": "value2"})

  response = 2 * x

  telemetry.set_span_attribute("example.func.times_two.response", response)

  return response
$$;
```

When you call the trace event API from a Python function that processes an input row, the API will be called *for every row* processed
by the UDF.

For example, the following statement calls the Python function defined in the previous example for 50 rows, resulting in 100 trace events
(two for each row):

```sqlexample
select count(times_two(seq8())) from table(generator(rowcount => 50));
```

### UDTF example

```sqlexample-python
CREATE OR REPLACE FUNCTION digits_of_number(input NUMBER)
  RETURNS TABLE(result NUMBER)
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'TableFunctionHandler'
  AS
$$
from snowflake import telemetry

class TableFunctionHandler:

  def __init__(self):
    telemetry.add_event("test_udtf_init")

  def process(self, input):
    telemetry.add_event("test_udtf_process", {"input": str(input)})
    response = input

    while input > 0:
      response = input % 10
      input /= 10
      yield (response,)

  def end_partition(self):
    telemetry.add_event("test_udtf_end_partition")
$$;
```

When you call the trace event API in the `process()` method of a UDTF handler class, the API will be called *for every row* processed.

For example, the following statement calls the `process()` method defined in the previous example for 50 rows, resulting in 100 trace
events (two for each row) added by the `process()` method:

```sqlexample
select * from table(generator(rowcount => 50)), table(digits_of_number(seq8())) order by 1;
```

---
title: Emitting trace events in Scala
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-scala.md
section: Developer Guide
---

# Emitting trace events in Scala

You can use the `com.snowflake.telemetry.Telemetry` class in the [Snowflake Telemetry API](https://javadoc.io/doc/com.snowflake/telemetry/latest/index.html) library to emit trace events from a function or
procedure handler written in Scala. The `Telemetry` class is included with Snowflake.

Snowflake currently supports the following versions of Scala:

[Preview Feature](../../release-notes/preview-features.md) — Open

Support for version 2.13 is in preview. Available to all accounts.

* 2.13
* 2.12

For more information, see [Writing code to support different Scala versions](../scala-version-differences.md).

> **Note:**
>
> Using the Snowflake Telemetry Library adds other libraries to your function or procedure’s execution environment. For more information,
> see [Snowflake telemetry package dependencies](telemetry-package-dependencies.md).

For information on including the Telemetry library when packaging your code with Maven, see
[Setting up your Java and Scala environment to use the Telemetry class](telemetry-build-maven.md).

You can access stored trace event data by executing a SELECT command on the event table. For more information, see
[Viewing trace data](tracing-accessing-events.md).

> **Note:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Trace events for functions and procedures](tracing.md).

Before logging from code, you must:

* Set up an event table to collect messages logged from handler code.

  For more information, see [Event table overview](event-table-setting-up.md).
* Be sure you have the trace level set so that the data you want are stored in the event table.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Adding support for the Telemetry API

To use `Telemetry` methods, you must make the open source [Snowflake telemetry package](https://central.sonatype.com/artifact/com.snowflake/telemetry) available to your handler code.

* In the PACKAGES clause in your CREATE PROCEDURE or CREATE FUNCTION statement, include the `com.snowflake:telemetry` package. The
  PACKAGES clause makes the included Snowflake Telemetry API available to your code.

  Code in the following example uses the PACKAGES clause to reference the Telemetry library as well as the Snowpark library (which is
  required for stored procedures written in Scala – for more information, see [Writing Scala handlers for stored procedures created with SQL](../stored-procedure/scala/procedure-scala-overview.md)).

  Scala 2.12Scala 2.13 (Preview)

  ```sqlexample
  CREATE OR REPLACE PROCEDURE myproc(...)
    RETURNS ...
    LANGUAGE SCALA
    ...
    PACKAGES = ('com.snowflake:snowpark_2.12:latest', 'com.snowflake:telemetry:latest')
    ...
  ```

  ```sqlexample
  CREATE OR REPLACE PROCEDURE myproc(...)
    RETURNS ...
    LANGUAGE SCALA
    ...
    PACKAGES = ('com.snowflake:snowpark_2.13:latest', 'com.snowflake:telemetry:latest')
    ...
  ```
* Import the `com.snowflake.telemetry` package in your handler code.

  ```scala
  import com.snowflake.telemetry.Telemetry
  ```

## Adding trace events

You can add trace events by calling the `Telemetry.addEvent` method, passing a name for the event. You can also optionally associate
attributes – key-value pairs – with an event.

The `addEvent` method has the following signatures:

```scala
public static void addEvent(String name)
public static void addEvent(String name, Attributes attributes)
```

Code in the following example adds an event called `testEvent`, associating with the event two attributes: `key` and
`result`.

```scala
// Adding an event without attributes.
Telemetry.addEvent("testEvent")

// Adding an event with attributes.
Attributes eventAttributes = Attributes.of(
  AttributeKey.stringKey("key"), "run",
  AttributeKey.longKey("result"), Long.valueOf(123))
Telemetry.addEvent("testEventWithAttributes", eventAttributes)
```

Adding these events results in two rows in the event table, each with a different value in the RECORD column:

```sqljson
{
  "name": "testEvent"
}
```

```sqljson
{
  "name": "testEventWithAttributes"
}
```

The `testEventWithAttributes` event row includes the following attributes in the row’s RECORD_ATTRIBUTES column:

```sqljson
{
  "key": "run",
  "result": 123
}
```

## Adding span attributes

You can set attributes – key-value pairs – associated with spans by calling the `Telemetry.setSpanAttribute` method.

The `setSpanAttribute` method has the following signatures:

```scala
public static void setSpanAttribute(String key, boolean value)
public static void setSpanAttribute(String key, long value)
public static void setSpanAttribute(String key, double value)
public static void setSpanAttribute(String key, String value)
```

For details on spans, see [How Snowflake represents trace events](tracing-how-events-work.md).

Code in the following example creates four attributes and sets their values:

```scala
// Setting span attributes.
Telemetry.setSpanAttribute("example.boolean", true)
Telemetry.setSpanAttribute("example.long", 2L)
Telemetry.setSpanAttribute("example.double", 2.5)
Telemetry.setSpanAttribute("example.string", "testAttribute")
```

Setting these attributes results in the following in the event table’s RECORD_ATTRIBUTES column:

```sqljson
{
  "example.boolean": true,
  "example.long": 2,
  "example.double": 2.5,
  "example.string": "testAttribute"
}
```

## Adding custom spans

You can add custom spans that are separate from the default span created by Snowflake. For details on custom spans, see
[Adding custom spans to a trace](tracing-custom-spans.md).

Code in the following example uses the [OpenTelemetry API](https://javadoc.io/doc/io.opentelemetry/opentelemetry-api/latest/index.html) and [OpenTelemetry context propagation API](https://www.javadoc.io/doc/io.opentelemetry/opentelemetry-context-prop/latest/index.html) to create a new `my.span`
span. It then adds an event to the new span. Finally, the code ends the span to have the span’s event data captured in the event table.
If the code doesn’t call the `Span.end` method, data is not captured in the event table.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION testScalaUserSpans(x VARCHAR) RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:telemetry:latest')
  HANDLER = 'TestScalaClass.run'
  AS
  $$
  class TestScalaClass {
    import com.snowflake.telemetry.Telemetry
    import io.opentelemetry.api.GlobalOpenTelemetry
    import io.opentelemetry.api.trace.Tracer
    import io.opentelemetry.api.trace.Span
    import io.opentelemetry.context.Scope

    def run(x: String): String = {
      val tracer: Tracer = GlobalOpenTelemetry.getTracerProvider().get("my.tracer")
      val span: Span = tracer.spanBuilder("my.span").startSpan()
      span.addEvent("test event from scala")
      span.end()
      return x
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION testScalaUserSpans(x VARCHAR) RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:telemetry:latest')
  HANDLER = 'TestScalaClass.run'
  AS
  $$
  class TestScalaClass {
    import com.snowflake.telemetry.Telemetry
    import io.opentelemetry.api.GlobalOpenTelemetry
    import io.opentelemetry.api.trace.Tracer
    import io.opentelemetry.api.trace.Span
    import io.opentelemetry.context.Scope

    def run(x: String): String = {
      val tracer: Tracer = GlobalOpenTelemetry.getTracerProvider().get("my.tracer")
      val span: Span = tracer.spanBuilder("my.span").startSpan()
      span.addEvent("test event from scala")
      span.end()
      return x
    }
  }
  $$;
```

## Examples

### Stored procedure example

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE do_tracing()
  RETURNS STRING
  LANGUAGE SCALA
  RUNTIME_VERSION = '2.12'
  PACKAGES=('com.snowflake:snowpark_2.12:latest', 'com.snowflake:telemetry:latest')
  HANDLER = 'ProcedureHandler.run'
  AS
  $$
  import com.snowflake.snowpark_java.Session
  import com.snowflake.telemetry.Telemetry
  import io.opentelemetry.api.common.AttributeKey
  import io.opentelemetry.api.common.Attributes

  class ProcedureHandler {

    def run(session: Session): String = {

      // Set span attribute.
      Telemetry.setSpanAttribute("example.proc.do_tracing", "begin")

      // Add an event without attributes.
      Telemetry.addEvent("run_method_start")

      // Add an event with attributes.
      val eventAttributes: Attributes = Attributes.of(
        AttributeKey.stringKey("example.method.name"), "run")
      Telemetry.addEvent("event_with_attributes", eventAttributes)

      // Set span attribute.
      Telemetry.setSpanAttribute("example.proc.do_tracing", "complete")

      return "SUCCESS"
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE do_tracing()
  RETURNS STRING
  LANGUAGE SCALA
  RUNTIME_VERSION = '2.13'
  PACKAGES=('com.snowflake:snowpark_2.13:latest', 'com.snowflake:telemetry:latest')
  HANDLER = 'ProcedureHandler.run'
  AS
  $$
  import com.snowflake.snowpark_java.Session
  import com.snowflake.telemetry.Telemetry
  import io.opentelemetry.api.common.AttributeKey
  import io.opentelemetry.api.common.Attributes

  class ProcedureHandler {

    def run(session: Session): String = {

      // Set span attribute.
      Telemetry.setSpanAttribute("example.proc.do_tracing", "begin")

      // Add an event without attributes.
      Telemetry.addEvent("run_method_start")

      // Add an event with attributes.
      val eventAttributes: Attributes = Attributes.of(
        AttributeKey.stringKey("example.method.name"), "run")
      Telemetry.addEvent("event_with_attributes", eventAttributes)

      // Set span attribute.
      Telemetry.setSpanAttribute("example.proc.do_tracing", "complete")

      return "SUCCESS"
    }
  }
  $$;
```

---
title: Emitting trace events in Snowflake Scripting
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-snowflake-scripting.md
section: Developer Guide
---

# Emitting trace events in Snowflake Scripting

You can use the Snowflake `SYSTEM` functions to emit trace events from a stored procedure handler written in Snowflake Scripting.

Before emitting trace events, be sure you have the trace level set so that the data you want are stored in the event table. For more
information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

> **Note:**
>
> Before you can begin emitting trace events, you must set up an event table. For more information, see
> [Event table overview](event-table-setting-up.md).

You can access stored trace event data by executing a SELECT command on the event table. For more information, see
[Viewing trace data](tracing-accessing-events.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Trace events for functions and procedures](tracing.md).

> **Note:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

## Adding trace events

You can add trace events by calling the [SYSTEM$ADD_EVENT](../../sql-reference/functions/system_add_event.md) function,
passing a name for the event. You can also optionally associate attributes (key-value pairs) with an event.

Code in the following example adds two events, `SProcEmptyEvent` and `SProcEventWithAttributes`. With
`SProcEventWithAttributes`, the code also adds two attributes: `key1` and `key2`.

```sqlexample
SYSTEM$ADD_EVENT('SProcEmptyEvent');
SYSTEM$ADD_EVENT('SProcEventWithAttributes', {'key1': 'value1', 'key2': 'value2'});
```

Adding these events results in two rows in the event table, each with a different value in the RECORD column:

```json
{
  "name": "SProcEmptyEvent"
}
```

```json
{
  "name": "SProcEventWithAttributes"
}
```

The `SProcEventWithAttributes` event row includes the following attributes in the row’s RECORD_ATTRIBUTES column:

```json
{
  "key1": "value1",
  "key2": "value2"
}
```

## Adding span attributes

You can set attributes (key-value pairs) associated with spans by calling the
[SYSTEM$SET_SPAN_ATTRIBUTES](../../sql-reference/functions/system_set_span_attributes.md) function.

For details on spans, see [How Snowflake represents trace events](tracing-how-events-work.md).

The SYSTEM$SET_SPAN_ATTRIBUTES function is available in the following form:

```sqlsyntax
SYSTEM$SET_SPAN_ATTRIBUTES(<object>);
```

where

> * `object` is a Snowflake Scripting object with key-value pairs that specify the attributes for this trace event.

Code in the following example creates two attributes and sets their values:

```sqlexample
SYSTEM$SET_SPAN_ATTRIBUTES('{'attr1':'value1', 'attr2':true}');
```

Setting these attributes results in the following in the event table’s RECORD_ATTRIBUTES column:

```json
{
  "attr1": "value1",
  "attr2": "value2"
}
```

## Examples

Code in the following example uses the SYSTEM$ADD_EVENT function to add an event named `name_a` and an event named `name_b`.
With `name_b`, it associates two attributes, `score` and `pass`. The code also uses SYSTEM$SET_SPAN_ATTRIBUTES to
set two attributes for the span, `key1` and `key2`.

```sqlexample
CREATE OR REPLACE PROCEDURE pi_proc()
  RETURNS DOUBLE
  LANGUAGE SQL
  AS $$
  BEGIN
    -- Add an event without attributes
    SYSTEM$ADD_EVENT('name_a');

    -- Add an event with attributes
    LET attr := {'score': 89, 'pass': TRUE};
    SYSTEM$ADD_EVENT('name_b', attr);

    -- Set attributes for the span
    SYSTEM$SET_SPAN_ATTRIBUTES({'key1': 'value1', 'key2': TRUE});

    RETURN 3.14;
  END;
  $$;
```

```sqlexample
CALL pi_proc();
```

## Automatically emit trace events for child jobs and exceptions

You can automatically emit the following additional types of trace events for a Snowflake Scripting stored procedure
in the event table:

* Exception catching.
* Information about child job execution.
* Child job statistics.
* Stored procedure statistics, including execution time and input values.

Automatic trace emission is intended for the following use cases:

* You want to emit predefined trace events without modifying the body of the stored procedure.
* You want to collect information about stored procedure execution that you can analyze later,
  including:

  + Information about child job execution (such as `childJobUUID`, `rowCount`, `exceptionCode`, and so on).
  + Child job execution time.
  + Input argument values.
* You want more visibility into stored procedure execution to make it easier to develop and debug it without
  manually adding tracing code in the procedure.

To automatically emit these additional trace events for a stored procedure, set the [AUTO_EVENT_LOGGING](../../sql-reference/parameters.md) parameter
to `TRACING` or `ALL` using the [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md) command. When you set
this parameter to `ALL`, additional [log messages](logging-snowflake-scripting.md) are also generated automatically
for the stored procedure.

> **Important:**
>
> The additional information is added to the event table only if the effective [TRACE_LEVEL](../../sql-reference/parameters.md) is set
> to `ALWAYS` or `ON_EVENT`. For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

For example, create a simple table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_auto_event_logging (id INTEGER, num NUMBER(12, 2));

INSERT INTO test_auto_event_logging (id, num) VALUES
  (1, 11.11),
  (2, 22.22);
```

Next, create a stored procedure named `auto_event_logging_sp`. This sample stored procedure updates a table row and
then queries the table:

```sqlexample
CREATE OR REPLACE PROCEDURE auto_event_logging_sp(
  table_name VARCHAR,
  id_val INTEGER,
  num_val NUMBER(12, 2))
RETURNS TABLE()
LANGUAGE SQL
AS
$$
BEGIN
  UPDATE IDENTIFIER(:table_name)
    SET num = :num_val
    WHERE id = :id_val;
  LET res RESULTSET := (SELECT * FROM IDENTIFIER(:table_name) ORDER BY id);
  RETURN TABLE(res);
EXCEPTION
  WHEN statement_error THEN
    res := (SELECT :sqlcode sql_code, :sqlerrm error_message, :sqlstate sql_state);
    RETURN TABLE(res);
END;
$$
;
```

The following examples set the AUTO_EVENT_LOGGING parameter for the stored procedure:

```sqlexample
ALTER PROCEDURE auto_event_logging_sp(VARCHAR, INTEGER, NUMBER)
  SET AUTO_EVENT_LOGGING = 'TRACING';
```

```sqlexample
ALTER PROCEDURE auto_event_logging_sp(VARCHAR, INTEGER, NUMBER)
  SET AUTO_EVENT_LOGGING = 'ALL';
```

Call the stored procedure:

```sqlexample
CALL auto_event_logging_sp('test_auto_event_logging', 2, 44.44);
```

```output
+----+-------+
| ID |   NUM |
|----+-------|
|  1 | 11.11 |
|  2 | 44.44 |
+----+-------+
```

Query the event table for trace data recorded by the stored procedure named `auto_event_logging_sp`.
For each trace event, print out the timestamp, name, and attributes of the event.

```sqlexample
SELECT
    TIMESTAMP as time,
    RECORD['name'] as event_name,
    RECORD_ATTRIBUTES as attributes,
  FROM
    my_db.public.my_events
  WHERE
    RESOURCE_ATTRIBUTES['snow.executable.name'] LIKE '%AUTO_EVENT_LOGGING_SP%'
    AND RECORD_TYPE LIKE 'SPAN%';
```

```output
+-------------------------+--------------------------+-----------------------------------------------------------------------------------------------+
| TIME                    | EVENT_NAME               | ATTRIBUTES                                                                                    |
|-------------------------+--------------------------+-----------------------------------------------------------------------------------------------|
| 2024-10-25 20:48:49.844 | "snow.auto_instrumented" | {                                                                                             |
|                         |                          |   "childJobTime": 474,                                                                        |
|                         |                          |   "executionTime": 2,                                                                         |
|                         |                          |   "inputArgumentValues": "{ ID_VAL: 2, TABLE_NAME: test_auto_event_logging, NUM_VAL: 44.44 }" |
|                         |                          | }                                                                                             |
| 2024-10-25 20:48:49.740 | "child_job"              | {                                                                                             |
|                         |                          |   "childJobUUID": "01b7ef00-0003-01d1-0000-a99501233092",                                     |
|                         |                          |   "rowCount": 1,                                                                              |
|                         |                          |   "rowsAffected": 1                                                                           |
|                         |                          | }                                                                                             |
| 2024-10-25 20:48:49.843 | "child_job"              | {                                                                                             |
|                         |                          |   "childJobUUID": "01b7ef00-0003-01d1-0000-a99501233096",                                     |
|                         |                          |   "rowCount": 2,                                                                              |
|                         |                          |   "rowsAffected": 0                                                                           |
|                         |                          | }                                                                                             |
+-------------------------+--------------------------+-----------------------------------------------------------------------------------------------+
```

Now, call the stored procedure, but specify a table that doesn’t exist to cause an exception:

```sqlexample
CALL auto_event_logging_sp('no_table', 2, 82.44);
```

```output
+----------+-----------------------------------------------------+-----------+
| SQL_CODE | ERROR_MESSAGE                                       | SQL_STATE |
|----------+-----------------------------------------------------+-----------|
|     2003 | SQL compilation error:                              | 42S02     |
|          | Object 'NO_TABLE' does not exist or not authorized. |           |
+----------+-----------------------------------------------------+-----------+
```

Run the query on the event table again to see the information about the exception:

```sqlexample
SELECT
    TIMESTAMP as time,
    RECORD['name'] as event_name,
    RECORD_ATTRIBUTES as attributes,
  FROM
    my_db.public.my_events
  WHERE
    RESOURCE_ATTRIBUTES['snow.executable.name'] LIKE '%AUTO_EVENT_LOGGING_SP%'
    AND RECORD_TYPE LIKE 'SPAN%';
```

```output
+-------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+
| TIME                    | EVENT_NAME               | ATTRIBUTES                                                                                          |
|-------------------------+--------------------------+-----------------------------------------------------------------------------------------------------|
| 2024-10-25 20:52:43.633 | "snow.auto_instrumented" | {                                                                                                   |
|                         |                          |   "childJobTime": 66,                                                                               |
|                         |                          |   "executionTime": 4,                                                                               |
|                         |                          |   "inputArgumentValues": "{ ID_VAL: 2, TABLE_NAME: no_table, NUM_VAL: 82.44 }"                      |
|                         |                          | }                                                                                                   |
| 2024-10-25 20:52:43.601 | "caught_exception"       | {                                                                                                   |
|                         |                          |   "exceptionCode": 2003,                                                                            |
|                         |                          |   "exceptionMessage": "SQL compilation error:\nObject 'NO_TABLE' does not exist or not authorized." |
|                         |                          | }                                                                                                   |
+-------------------------+--------------------------+-----------------------------------------------------------------------------------------------------+
```

---
title: Enabling telemetry collection
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-tracing-enabling.md
section: Developer Guide
---

# Enabling telemetry collection

You must enable telemetry collection before you can use telemetry data — including log messages, trace event data, and metrics data —
to debug, optimize, and troubleshoot your Snowflake applications.

Use this topic to confirm that you’re set up to capture data.

**To enable telemetry collection:**

1. Ensure that you have an active [event table](event-table-setting-up.md).

   By default, Snowflake includes a [predefined event table](event-table-setting-up.md) that is active for your account
   until you deactivate it or [specify an event table you create](event-table-setting-up.md).
2. Set logging, tracing, and metrics levels.

   You must [set levels](telemetry-levels.md) for the data you want to capture. For example, you
   must set the tracing level to a level other than `OFF` for tracing data to be collected.
3. Add code to your applications, if needed.

   For some telemetry data, data is emitted without your needing to add your own code. For example, Snowflake can record metrics data
   without your needing to add code. For other cases, such as logging and tracing, you can add code to emit your own data to be
   captured in an event table.

   For information on languages supported to emit your own data, see the following:

   * [Logging from handler code](logging.md)
   * [Event tracing from handler code](tracing.md)

---
title: Event table columns
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/event-table-columns.md
section: Developer Guide
---

# Event table columns

An [event table](event-table-setting-up.md) is a special kind of database table with a predefined set of
columns. The table’s structure is designed to support the data model for [OpenTelemetry](https://opentelemetry.io/), a framework for handling telemetry data.

For more information about working with event tables, see [Working with event tables](event-table-operations.md).

## Event table columns

Event tables have the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| TIMESTAMP | TIMESTAMP_NTZ | The UTC timestamp when an event was created. For events representing a span of time, this is the end of the time span. |
| START_TIMESTAMP | TIMESTAMP_NTZ | For events representing a span of time, such as trace events, the start of the time span as a UTC timestamp. |
| OBSERVED_TIMESTAMP | TIMESTAMP_NTZ | A UTC time used for logs. Currently the same value as for TIMESTAMP. |
| TRACE | OBJECT | Tracing context for all signal types. Contains string values `trace_id` and `span_id`. |
| RESOURCE | OBJECT | Reserved for future use. |
| RESOURCE_ATTRIBUTES | OBJECT | Attributes that identify the source of an event such as database, schema, user, warehouse, [Openflow](../../user-guide/data-integration/openflow/monitor.md), etc. |
| SCOPE | OBJECT | Scopes for events. For example, class names for logs. |
| SCOPE_ATTRIBUTES | OBJECT | Reserved for future use. |
| RECORD_TYPE | STRING | The event type. One of the following:   * `LOG` for a log message. * `SPAN` for user-defined function invocations performed sequentially on the same thread. For more information, see   RECORD_TYPE column. * `SPAN_EVENT` for a single trace event. A single query can emit more than one `SPAN_EVENT`. * `EVENT` for an event associated with an operation such as Iceberg automated refresh. |
| RECORD | OBJECT | Fixed values for each record type, as described in RECORD column. |
| RECORD_ATTRIBUTES | OBJECT | Variable attributes for each record type, as described in RECORD_ATTRIBUTES column. |
| VALUE | VARIANT | Primary event value. |
| EXEMPLARS | ARRAY | Reserved for future use. |

## Data captured by event type

### Data for logs

| Attribute | Description |
| --- | --- |
| OBSERVED_TIMESTAMP | Currently the same value as for TIMESTAMP. |
| RECORD | The severity level recorded by the log event. |
| RECORD_ATTRIBUTES | The location in code from which the log event was emitted. The values vary by language, but can include the code file path, function name, line number, and so on. |
| RECORD_TYPE | The event type: `LOG` for a log message |
| RESOURCE_ATTRIBUTES | Attributes that identify the source of the event, such as database, schema, user, warehouse, and so on. |
| SCOPE | Scope within which the event occurred, such as the name of the class where the log event was created. |
| TIMESTAMP | The timestamp when the event was created. |
| VALUE | The log message. |

### Data for metrics

| Attribute | Description |
| --- | --- |
| RECORD | For a metric event, an object that includes the metric’s name and unit. |
| RECORD_TYPE | The event type: `METRIC` for a metric data point. |
| RESOURCE_ATTRIBUTES | Attributes that identify the source of the event, such as database, schema, user, or warehouse. |
| START_TIMESTAMP | When the RECORD column `metric_type` value is `sum`, this is the time when the metric was collected. Not used when the `metric_type` value is `gauge`. |
| TIMESTAMP | The timestamp when the event was created. |
| VALUE | The numeric value of the metric. |

### Data for trace events

| Attribute | Description |
| --- | --- |
| RECORD | For a span, an object that includes the span’s name and kind; for a span event, the object includes the span’s name. |
| RECORD_ATTRIBUTES | Attribute data associated with a span or span event. |
| RECORD_TYPE | The event type: `SPAN` for a span, `SPAN_EVENT` for a span event. |
| RESOURCE_ATTRIBUTES | Attributes that identify the source of the event, such as database, schema, user, warehouse, and so on. |
| START_TIMESTAMP | For a span, the time when the span began. Not used for a span event. |
| TIMESTAMP | The timestamp when the event was created. |
| TRACE | Identifiers `trace_id` and `span_id` for a span and the span events within it. |

### Data for Iceberg automated refresh events

Snowflake logs an event to your event table when it processes a snapshot for [Iceberg automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).
The following table describes the columns for Iceberg automated refresh events:

| Attribute | Description |
| --- | --- |
| TIMESTAMP | The timestamp when the event was created. |
| RESOURCE_ATTRIBUTES | Attributes that identify the Iceberg Auto Refresh event, such as database, schema, table, and catalog names. |
| RECORD_TYPE | The event type `EVENT`. |
| RECORD | Detailed information about the status of the operation associated with the event, including name and severity. |
| RECORD_ATTRIBUTES | Attributes associated with the event. |
| VALUE | Additional information specific to the event. |

## EXEMPLARS column

Reserved for future use.

## OBSERVED_TIMESTAMP column

A log’s UTC timestamp. Not used for trace events.

## RECORD column

Provides core information about the event, include the log level for a log event, or the name for trace event (span or span event record).

Attributes, if any, for the record are recorded in the RECORD_ATTRIBUTES column.

Values contained by this column will vary depending on the value of the RECORD_TYPE column
(`LOG`, `SPAN` or `SPAN_EVENT`), as described in the following sections.

### For `LOG` RECORD_TYPE

When the RECORD_TYPE column value is `LOG`, the RECORD column value contains the severity of the log message. The column value may
contain the following keys:

| Key | Type | Description |
| --- | --- | --- |
| `severity_text` | STRING | The text for the log severity. One of the following:   * `TRACE` * `DEBUG` * `INFO` * `WARN` * `ERROR` * `FATAL`   When the log entry is for an [unhandled exception](unhandled-exception-messages.md), this value is the highest-severity error level for the current language runtime. For example, for code written in Python, the value is `FATAL`. |

#### Example

```json
{
  "severity_text": "INFO"
}
```

### For `METRIC` RECORD_TYPE

Metrics are CPU and memory data generated by Snowflake. You can use this data to analyze resource consumption.

The execution handler language and its environment significantly affect the meaning of the metrics data. See
[Emitting metrics data from handler code](metrics-handler.md) for more information.

| Key | Type | Description |
| --- | --- | --- |
| `metric.name` | string | The name of the metric recorded by the row. One of the following:   * `process.memory.usage`: Amount of memory, in bytes, consumed during execution. * `process.cpu.utilization`: CPU use. Measured differently based on the handler language.   For more information, see [Emitting metrics data from handler code](metrics-handler.md). |
| `metric.unit` | string | The units of the metric; for example, `bytes`. |
| `metric_type` | string | The OpenTelemetry Metric Point type of the metric data; for example, `sum` or `gauge`. |
| `value_type` | string | The data type of the value in the VALUE column; for example, `DOUBLE` or `INT`. |

### Example

```json
{
  "metric": {
    "name": "process.memory.usage",
    "unit": "bytes"
  },
  "metric_type": "sum",
  "value_type": "INT"
}
```

### For `SPAN` RECORD_TYPE

Spans represent individual executions of functions and procedures. For stored procedures there will be a single span. For user-defined
functions there may be multiple spans for a single function call, depending on how Snowflake decides to schedule execution.

All spans for a given query have the same value for the `trace_id` key of the `TRACE` column.

The duration of a span is the difference between the values in the `start_timestamp` and `timestamp` columns, indicating the
time of the beginning and end of the span execution, respectively.

The ID of the span and the query trace are represented in the value in the TRACE column.

Snowflake will create one span for each execution with the keys shown below:

| Key | Type | Description |
| --- | --- | --- |
| `dropped_attributes_count` | int | The number of attributes ignored after the recorded maximum has been reached. |
| `name` | string | When the executable’s handler is written in Python, this identifies the handler for the function or procedure that emitted the data. This varies by executable type, as follows:   * Procedure: handler function name * User-defined function (UDF): handler function name * User-defined table function (UDTF): handler class name * Client code: name of the client-side API that began the span.   When the traced code is written in SQL, such as a SQL statement executed within a procedure, this is the name of the executed statement, such as `SELECT` or `INSERT`.  When the executable’s handler is written in a language other than Python or SQL, this is a fixed value such as `snow.auto_instrumented`. |
| `kind` | string | `SPAN_KIND_SERVER` when the traced code is written in SQL. Otherwise, this is the fixed value `SPAN_KIND_INTERNAL`. |
| `parent_span_id` | Hex string | Identifies the span of the procedure or UDF from which the current trace passed. When this value is present, it means that the current procedure or UDF call was made by another procedure in a call chain relationship. That “parent” procedure’s `span_id` value is the same as this `parent_span_id`. |
| `snow.process.memory.usage.max` | string | Optional. When present, specifies the maximum amount of memory, in bytes, used during this span’s execution. |
| `status` | string | `STATUS_CODE_ERROR` when the span corresponds to an [unhandled exception](unhandled-exception-messages.md). Otherwise, `STATUS_CODE_UNSET`. |

In the case of user-defined functions, Snowflake may add attributes for spans to indicate the number of rows processed and emitted by the function.

### For `SPAN_EVENT` RECORD_TYPE

Span events are event records attached to a particular span execution, described above. You can create events to fit the needs of your
application. The number of span events is limited to 128.

The value of the TRACE column will identify the span in which the event was created.

Span events have a single key, `name`, and can have arbitrary attributes added in the RECORD_ATTRIBUTES column.

| Key | Type | Description |
| --- | --- | --- |
| `name` | string | The name of the span event. |

### For `EVENT` RECORD_TYPE for Iceberg automated refresh events

| Key | Type | Description |
| --- | --- | --- |
| `name` | VARCHAR | The name of the event; for example, `iceberg_auto_refresh_snapshot_lifecycle` |
| `severity_text` | VARCHAR | The text for the event severity. One of the following values:   * `DEBUG` * `WARN` * `ERROR`   The severity text when the `snapshot_state` is `"errored"` is one of the following values:   * `WARN` if Snowflake encountered an error that can be resolved without human intervention. * `ERROR` if Snowflake encountered an error that requires human intervention to resolve.   The `snapshot_state` is a key-value pair in an object in the VALUE column. For more information, see For EVENT VALUE for Iceberg automated refresh events. |

### For `EVENT` RECORD_TYPE for Snowflake Native Apps lifecycle events

| Key | Type | Description |
| --- | --- | --- |
| `name` | VARCHAR | The name of the event; for example, `application.state_change` |
| `severity_number` | NUMBER | The severity number of the event. |
| `severity_text` | VARCHAR | The text for the event severity. One of the following values:   * `INFO` * `WARN` * `ERROR` |

## RECORD_ATTRIBUTES column

Describes the event with metadata set by Snowflake or by code. The value will vary depending on the type of record the row
contains, as described in the following sections.

### For `LOG` RECORD_TYPE

The location in code from which the log event was emitted, including the code file path, function name, line number, and so on.

In addition to the attributes listed below, you can add your own attributes to include in the RECORD_ATTRIBUTES value.

| Attribute | Type | Description |
| --- | --- | --- |
| `code.filepath` | int | The file containing code that generated the message. |
| `code.function` | string | The name of the function that generated the message. |
| `code.lineno` | int | The line number in code that generated the message. |
| `code.namespace` | int | The namespace of code that generated the messages. |
| `exception.message` | string | The error message from an [unhandled exception](unhandled-exception-messages.md). |
| `exception.type` | string | The name of the class for an [unhandled exception](unhandled-exception-messages.md). |
| `exception.stacktrace` | string | An [unhandled exception](unhandled-exception-messages.md)’s stack trace formatted by a language runtime. |
| `exception.escaped` | boolean | `true` if this entry is from an [unhandled exception](unhandled-exception-messages.md). |
| `thread.id` | int | The thread on which the log event was created. |
| `thread.name` | string | The thread on which the log event was created. |

### Example

In the following example, all attributes have been added by Snowflake except `employee.id`, which was added by a custom attribute.

```json
{
  "code.filepath": "main.scala",
  "code.function": "$anonfun$new$10",
  "code.lineno": 149,
  "code.namespace": "main.main$",
  "thread.id": 1,
  "thread.name": "main"
  "employee.id": "52307953446424"
}
```

### For `SPAN` RECORD_TYPE

Attributes, if any, assigned to the span when it is recorded. Attribute names and values are set by code or by Snowflake.

The following table lists attributes that might be set by Snowflake.

| Attribute | Type | Description |
| --- | --- | --- |
| `db.query.executable.names` | string | The names of the executables executed under the query traced in this span. |
| `db.query.table.names` | string | The names of tables read or modified in the query traced in this span. |
| `db.query.view.names` | string | The names of views accessed in the query traced in this span. |
| `db.query.text` | string | The text of the SQL query traced in this span. Included only if SQL trace query text is enabled for tracing. For more information, see [SQL statement tracing](tracing.md). |
| `snow.input.rows` | int | The number of input rows processed by the span of the function. |
| `snow.output.rows` | int | The number of output rows successfully processed by the span of the function. |

#### Example

Code in the following example includes attributes set by Snowflake.

```json
{
  "snow.input.rows": 12
  "snow.output.rows": 12
}
```

#### Example

Code in the following example includes attributes set by handler code.

```json
{
  "MyFunctionVersion": "1.1.0"
}
```

### For `SPAN_EVENT` RECORD_TYPE

Attributes, if any, assigned to the span event when it is recorded. Attribute names and values may be set by Snowflake or by user code.

#### Example

Code in the following example includes attributes set by handler code.

```json
{
  "mykey1": "value1",
  "mykey2": "value2"
}
```

### For `EVENT` RECORD_TYPE for Iceberg automated refresh events

Attributes assigned to the EVENT event when it is recorded for [Iceberg automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).
Attribute names and values are set by Snowflake.

| Attribute | Type | Description | Example |
| --- | --- | --- | --- |
| `snow.snapshot.id` | INTEGER | The ID of the Iceberg snapshot being processed during Iceberg automated refresh. NULL if the automated refresh process fails. | `12345` |

## RECORD_TYPE column

Specifies the kind of record described by the event table row. This column’s value identifies which of the various types of records for which
the event table may contain data.

The RECORD column contains this
record’s data. The RECORD_ATTRIBUTES column contains this record’s metadata, if any.

The following table lists possible values for this column.

| Column Value | Description |
| --- | --- |
| `LOG` | The row represents a log entry generated by handler code. |
| `SPAN` | The row represents a span.  For a stored procedure there will be a single span. For a user-defined function, which may be parallelized, there will be a span for each thread on which the function executes. The number of threads will vary depending on multiple factors, including the size of the Snowflake warehouse in which the function executes.  A span may contain multiple span events. For more information, see [Span data recorded](tracing-how-events-work.md). |
| `SPAN_EVENT` | The row represents a span event. The may be multiple span event records attached to a particular span. Your handler code may create events to fit your needs. The number of span events is limited to 128. |
| `METRIC` | The row represents an observation of a metric. Multiple observations of multiple metrics can be associated with a particular span. |
| `EVENT` | The row represents an event associated with a particular operation, such as Iceberg automated refresh. |

## RESOURCE column

Reserved for future use.

## RESOURCE_ATTRIBUTES column

Describes the source of an event in terms of Snowflake objects.

Attributes making up this column’s value are set by Snowflake and cannot be changed.

### Resource attributes for event source

> **Note:**
>
> When the event source is Iceberg automated refresh, only the following attribute types are set:
>
> * snow.catalog.integration.name
> * snow.catalog.table.name
> * snow.database.name
> * snow.schema.name
> * snow.table.name

| Attribute Name | Attribute Type | Description | Example |
| --- | --- | --- | --- |
| `snow.catalog.integration.name` | string | The name of the catalog integration associated with the executable for Iceberg automated refresh. | `MY_CATALOG_INTEGRATION_NAME` |
| `snow.catalog.table.name` | string | The name of the Iceberg table in the catalog. | `MY_CATALOG_TABLE_NAME` |
| `snow.database.id` | int | The internal/system-generated identifier of the database containing the executable. | `12345` |
| `snow.database.name` | string | The name of the database containing the executable. | `MY_DATABASE` |
| `snow.executable.id` | int | The internal/system-generated identifier of the executable (procedure, function, SnowService, etc.) generating the event. | `12345` |
| `snow.executable.name` | string | The name of the executable generating the event. For example, this might be the name of the procedure, function, or Streamlit app. | `MY_UDF` |
| `snow.executable.runtime.version` | string | The executable language’s runtime version. This will be a value specific to the language, as described below:   * Java: `11` or `17` * JavaScript: No value * Python: From `3.10` to `3.12` * Scala: `2.12` * SQL: No value | `procedure` |
| `snow.executable.type` | string | One of the following:   * `procedure` for stored procedure * `function` for a user-defined function * `query` for an event in which SQL was executed, such as when a SQL statement is executed within a stored procedure. * `sql` for an event from a single query, such as a Snowflake Scripting block. * `spcs` for a Snowpark Container Services service. * `streamlit` for a Streamlit app | `procedure` |
| `snow.owner.id` | int | The internal/system-generated identifier of the role with OWNERSHIP privilege for the executable. | `1234` |
| `snow.owner.name` | string | The name of the role with OWNERSHIP privilege for the executable. | `UDF_OWNER_RL` |
| `snow.schema.id` | int | The internal/system-generated identifier of the schema containing the executable. | `12345` |
| `snow.schema.name` | string | The name of the schema containing the executable. | `MY_SCHEMA` |
| `snow.table.name` | string | The name of the table associated with the executable. | `MY_TABLE_NAME` |
| `telemetry.sdk.language` | string | The language of the resource/SDK. Snowflake uses java, scala, python, javascript and sql. | `java` |

### Resource attributes for execution environment

| Attribute | Type | Description | Examples |
| --- | --- | --- | --- |
| `db.user` | string | For a function or procedure, the name of the user executing the function or procedure. For a Streamlit app, the name of the user who was viewing the app for a given event. | `MY_USER_NAME` |
| `snow.query.id` | string | The ID of the query. | `01a6aeb7-0604-c466-0000-097127d13812` |
| `snow.release.version` | string | The Snowflake release running when event was generated | `7.9.0` |
| `snow.session.id` | int | The ID of the session running the executable. | `10` |
| `snow.session.role.primary.id` | int | The internal/system-generated identifier of the primary role in the session. | `10` |
| `snow.session.role.primary.name` | string | The name of the primary role in the session. | `MY_ROLE` |
| `snow.user.id` | int | The internal/system-generated identifier of the user running the query. | `1234` |
| `snow.warehouse.id` | int | The internal/system-generated identifier of the warehouse running the query generating the event. | `12345` |
| `snow.warehouse.name` | string | The name of the warehouse running the query generating the event. | `MY_WAREHOUSE` |

### Resource attributes for apps

| Attribute | Type | Description | Examples |
| --- | --- | --- | --- |
| `snow.application.consumer.name` | string | For a Snowflake Native App, the name of the consumer’s account. | `CONSUMER_NAME` |
| `snow.application.consumer.organization` | string | For a Snowflake Native App, the name of the consumer’s organization. | `CONSUMER_ORG_NAME` |
| `snow.application.id` | string | For a Snowflake Native App, the internal/system-generated identifier of the app. | `ABCZN3J3` |
| `snow.application.name` | string | For a Snowflake Native App, the name of the app. | `MY_INSTALLED_APP_NAME` |
| `snow.application.package.name` | string | For a Snowflake Native App, the name of the application package. | `MY_INSTALLED_PACKAGE_NAME` |
| `snow.listing.global_name` | string | For a Snowflake Native App, the internal/system-generated identifier of the listing. | `GZYZN3J3` |
| `snow.listing.name` | string | For a Snowflake Native App, the name of the listing. | `MY_LISTING_NAME` |

### Resource attributes for Snowflake version

| Attribute | Type | Description | Examples |
| --- | --- | --- | --- |
| `service.version` | string | The version of the executable, where relevant. The combination of `snow.version` and `snow.patch` joined by a dot where they exist. Standard OpenTelemetry attribute. | `2.3.1` |
| `snow.patch` | string | The patch level of the executable running. | `1` |
| `snow.version` | string | The version of the executable running. | `2.3` |

### Example

```json
{
  "db.user": "MYUSERNAME",
  "snow.database.id": 13,
  "snow.database.name": "MY_DB",
  "snow.executable.id": 197,
  "snow.executable.name": "FUNCTION_NAME(I NUMBER):ARG_NAME(38,0)",
  "snow.executable.type": "FUNCTION",
  "snow.owner.id": 2,
  "snow.owner.name": "MY_ROLE",
  "snow.query.id": "01ab0f07-0000-15c8-0000-0129000592c2",
  "snow.schema.id": 16,
  "snow.schema.name": "PUBLIC",
  "snow.session.id": 1275605667850,
  "snow.session.role.primary.id": 2,
  "snow.session.role.primary.name": "MY_ROLE",
  "snow.user.id": 25,
  "snow.warehouse.id": 5,
  "snow.warehouse.name": "MYWH",
  "telemetry.sdk.language": "python"
}
```

## SCOPE column

For log events, the namespace of the code that emitted the event, such as the name of the class creating a log entry. This is not used
for trace events.

The following table lists attributes that may be included in this column.

### Scope value

| Attribute | Type | Description | Examples |
| --- | --- | --- | --- |
| `name` | String | Namespace of code emitting the event. | `com.sample.MyClass` |

### Example

```json
{
  "name": "com.sample.MyClass"
}
```

## SCOPE_ATTRIBUTES column

Reserved for future use.

## START_TIMESTAMP column

The time a span started as a UTC timestamp.

| RECORD_TYPE Column Value | START_TIMESTAMP Value Description |
| --- | --- |
| `LOG` | Not used. |
| `SPAN` | The time the span started. |
| `SPAN_EVENT` | Not used. |
| `METRIC` | When the RECORD column `metric_type` value is `sum`, this is the time when the metric was collected. Not used when the `metric_type` value is `gauge`. |

## TIMESTAMP column

The time an event was emitted. The value’s meaning will vary depending on the type of record the row represents, as listed in the following
table:

| RECORD_TYPE Column Value | TIMESTAMP Value Description |
| --- | --- |
| `LOG` | The wall-clock time that the event was emitted. |
| `SPAN` | The time at which execution concluded. |
| `SPAN_EVENT` | The wall-clock time that the event was emitted. |

## TRACE column

Unique identifiers representing execution for functions and procedures.

| RECORD_TYPE Column Value | TRACE Value Description |
| --- | --- |
| `LOG` | Not used. |
| `SPAN` | `trace_id` and `span_id` |
| `SPAN_EVENT` | `trace_id` and `span_id` |

### Trace value

The following table lists attributes that may be included in this column.

| Attribute | Type | Description | Examples |
| --- | --- | --- | --- |
| `span_id` | Hex string | A unique identifier to the threading model. Procedures, which are single-threaded, will have a single `span_id` value. Functions, which may be executed by Snowflake on multiple threads (such as for multiple rows), may have multiple `span_id` values.  When the current span is from a procedure that called another procedure or UDF in the trace, this `span_id` value is the same as the RECORD column `parent_span_id` value of the span for the procedure or UDF it called. | `b4c28078330873a2` |
| `trace_id` | Hex string | A unique identifier for calls made from a query. When a stored procedure is not being called in a chain of calls, each call has its own `trace_id` value. Within a query, calls to all functions made from the query share the same `trace_id` value.  When a procedure is called by another procedure or UDF in a call chain, it has the same `trace_id` value as other procedures and UDFs in the chain.  This value is unique for each query and will be the same for all spans within a query. You can use it for grouping events within a single query execution. | `6992e9febf0b97f45b34a62e54936adb` |

### Example

Code in the following example shows the attributes that would be present for a span or span event.

```json
{
  "span_id": "b4c28078330873a2",
  "trace_id": "6992e9febf0b97f45b34a62e54936adb"
}
```

## VALUE column

* For log events, this is usually the log message. When the event logged is for an
  [unhandled exception](unhandled-exception-messages.md), the value in this column will be simply
  `exception`.
* For metrics, this is the numeric value of the metric.

Note that the VALUE column’s type is VARIANT (not STRING) so that it can have non-string values for some languages, such as JavaScript.

### For `EVENT` VALUE for Iceberg automated refresh events

| Key | Type | Description | Example |
| --- | --- | --- | --- |
| `metadata_file_location` | VARCHAR | The location of the Iceberg metadata file, which can be NULL if the automated refresh process failed. | `"s3://my_bucket/iceberg_snapshots/metadata/...metadata.json"` |
| `snapshot_state` | VARCHAR | The state of the automated refresh process, which can be one of the following values:   * `started`: Snowflake has started to process a snapshot. * `completed`: Snowflake completed processing a snapshot * `errored`: Snowflake failed to process a snapshot. | `"errored"` |
| `error_message` | VARCHAR | If the value in `snapshot_state` is `errored`, this column includes an error message. | “Iceberg Auto Refresh encountered a fatal error. Please disable Auto Refresh and manually refresh the table before re-enabling Auto Refresh. FailedMetadataFile: s3://my_bucket/…, FailedSnapshotId: null.n” |

### For `EVENT` VALUE for Snowflake Native Apps application lifecycle events

| Key | Type | Description |
| --- | --- | --- |
| `upgrade_state` | VARCHAR | The current state of the background installation or upgrade. |
| `upgrade_attempt` | VARCHAR | Indicates whether an upgrade was attempted for the app. |
| `target_upgrade_version` | VARCHAR | The version of the app that is running or pending upgrade. |
| `target_upgrade_patch` | VARCHAR | The version patch level of the app that is running or pending upgrade. |
| `upgrade_failure_reason` | VARCHAR | The reason the upgrade failed, if applicable. |
| `health_status` | VARCHAR | The health status of the app. Possible values:   * `OK` * `PAUSE` * `FAILED` |
| `action` | VARCHAR | The action applied to privileges during installation or upgrade. Possible values:   * `GRANTED` * `REVOKED` |
| `privileges` | VARCHAR | A list of privileges that were granted or revoked during installation or upgrade. |

### Example

```json
{
  "metadata_file_location": "<path>",
  "snapshot_state": "errored",
  "error_message": "<error_message>"
}
```

---
title: Event table overview
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/event-table-setting-up.md
section: Developer Guide
---

# Event table overview

As your Snowflake objects—including procedures and UDFs—emit telemetry data, Snowflake collects the data in an event table whose
data is available for queries. Snowflake includes an event table by default, but you can also create a new one.

To collect telemetry data, you must have an active event table and have
[set telemetry levels](logging-tracing-overview.md) to allow data collection. If you don’t already have an active event table,
Snowflake makes the default event table the active event table.

When collecting telemetry data, you incur costs. To understand these costs—or to reduce or avoid these costs—see
[Costs of telemetry data collection](logging-tracing-billing.md).

## What is an event table?

An event table is a special kind of database table with a predefined set of columns. The table’s [structure](event-table-columns.md)
supports the data model for [OpenTelemetry](https://opentelemetry.io/), a framework for handling telemetry data. When an
event table is active, Snowflake collects telemetry data in the table—including data that
Snowflake itself generates and data that you emit by instrumenting your handler code using certain APIs. You can view the collected data
by executing SQL queries.

After installation, Snowflake includes a default event table called
SNOWFLAKE.TELEMETRY.EVENTS. This event table is active and collects data until you deactivate it. You can also
create your own.

## Use the default event table

If you do not set an active event table, Snowflake uses as the active event table a default event table named SNOWFLAKE.TELEMETRY.EVENTS.
You can also create your own event tables for specific uses.

By default, Snowflake also includes a predefined view called [SNOWFLAKE.TELEMETRY.EVENTS_VIEW view](../../sql-reference/telemetry/events_view.md),
with which you more securely make event table data available to a range of users. You can manage access to the view with
a row access policy.

> **Note:**
>
> The default event table supports only a subset of DDL commands supported for event tables you create or for regular tables. For more
> information, see [Working with event tables](event-table-operations.md).

### Roles for access to the default event table and EVENTS_VIEW

Snowflake includes the following predefined application roles you can use to manage access to the default event table and EVENTS_VIEW view:
A person with the ACCOUNTADMIN role can access the default event table and EVENTS_VIEW view and can grant the roles described here
to other roles for access to them.

You must grant these roles to other roles, rather than to a user. For example, you might grant the EVENTS_ADMIN role to another admin
role you’ve created for broader administrative use.

```sqlexample
GRANT APPLICATION ROLE SNOWFLAKE.EVENTS_ADMIN TO ROLE my_admin_role;

GRANT APPLICATION ROLE SNOWFLAKE.EVENTS_VIEWER TO ROLE my_analysis_role;
```

EVENTS_VIEWER:
:   Role with privileges to execute a SELECT statement on the [EVENTS_VIEW view](../../sql-reference/telemetry/events_view.md).

EVENTS_ADMIN:
:   Role with the following privileges:

    * SELECT, TRUNCATE, DELETE on the default event table.
    * SELECT on the [EVENTS_VIEW view](../../sql-reference/telemetry/events_view.md) of the default event table.
    * USAGE on the following stored procedures:

      + [ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW](../../sql-reference/stored-procedures/snowflake_telemetry_add_row_access_policy_on_events_view.md)
      + [DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW](../../sql-reference/stored-procedures/snowflake_telemetry_drop_row_access_policy_on_events_view.md)
    * This role also has privileges to execute a stored procedure to apply a row access policy (RAP) on the EVENTS_VIEW view
      whose data is based on the default event table.

### Manage access to the EVENTS_VIEW view

You can manage access to data in the [EVENTS_VIEW](../../sql-reference/telemetry/events_view.md) view with
[row access policies](../../user-guide/security-row-intro.md). Snowflake provides stored procedures you can use to add and remove a row
access policy to the EVENT_VIEW view.

* [ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW(VARCHAR, ARRAY)](../../sql-reference/stored-procedures/snowflake_telemetry_add_row_access_policy_on_events_view.md)—Binds
  a row access policy to the specified columns in the EVENTS_VIEW.
* [DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW(VARCHAR)](../../sql-reference/stored-procedures/snowflake_telemetry_drop_row_access_policy_on_events_view.md)—Deletes
  the specified row access policy bound to the EVENTS_VIEW.

> **Note:**
>
> You must have the [EVENTS_ADMIN role](../../user-guide/security-access-control-overview.md) to execute these procedures.
>
> Using row access policies on the EVENT_VIEW view is an [Enterprise Edition](../../user-guide/intro-editions.md) feature.

## Use a custom event table

To create a new event table, execute the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command and specify a name for the event table.

> **Note:**
>
> If you don’t create an event table, Snowflake uses the default event table to collect telemetry data.

1. Create an event table by executing the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command,
   specifying a name for the event table.
2. Associate the event table with an object by executing the
   [ALTER <object>](../../sql-reference/sql/alter.md) command on the object, setting the [EVENT_TABLE](../../sql-reference/parameters.md) parameter to the name of your event table.

   This sets the scope of data captured by the event table to the object with which you’re associating the table.

### Create an event table

To create an event table, execute the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command.

When you create an event table, you do not specify the columns in the table. An event table already has a set of predefined
columns, as described in [Event table columns](event-table-columns.md).

1. Ensure that you’re using a role that has the CREATE EVENT TABLE [privilege](../../user-guide/security-access-control-privileges.md).
2. Execute the [CREATE EVENT TABLE](../../sql-reference/sql/create-event-table.md) command to create the event table, specifying a name for the event table.

   You use the event table name to associate the table with an object, such as a database.

   For example, to create an event table with the name `my_events`, execute the following statement:

   ```sqlexample
   CREATE EVENT TABLE my_database.my_schema.my_events;
   ```

> **Note:**
>
> Replication of event tables is not currently supported. Any event tables that are contained in primary databases
> are skipped during replication.

### Associate an event table with an object

To specify the object for which an event table is active, execute the [ALTER <object>](../../sql-reference/sql/alter.md) command on the object.

Associating an event table with a database is an [Enterprise Edition](../../user-guide/intro-editions.md) feature.

1. Ensure that you’re using a role that has the required privileges.
2. Execute the [ALTER <object>](../../sql-reference/sql/alter.md) command on the object, setting the [EVENT_TABLE](../../sql-reference/parameters.md) parameter to the name of
   your event table.

   Setting this parameter sets the object as the scope within which events will be collected in the specified event table.

   For example, to associate the event table with a database, use ALTER DATABASE, as in the following example:

   ```sqlexample
   ALTER DATABASE my_database SET EVENT_TABLE = my_database.my_schema.my_events;
   ```

   In this example, Snowflake—depending on how you’ve [specified telemetry levels](telemetry-levels.md)—captures
   telemetry data for procedures and UDFs in `my_database` in the `telemetry_database.telemetry_schema.my_events` event table.

#### Supported objects

The following table lists the objects with which you can associate an event, along with the privileges required to make the association.

| Object | Privileges required | Scope of objects whose data is collected |
| --- | --- | --- |
| Account | * ACCOUNTADMIN role. * OWNERSHIP privilege for the account. * [OWNERSHIP or INSERT privileges for the event table](../../user-guide/security-access-control-privileges.md). | Procedures and UDFs in the account. Use this for the broadest scope. |
| Database | * ACCOUNTADMIN role. * [OWNERSHIP or INSERT privileges for the event table](../../user-guide/security-access-control-privileges.md). | Procedures and UDFs in the specified database. |

An order of precedence determines which event table is used to collect telemetry data for an object. In that precedence order, an event
table associated with a database takes precedence over an event table associated with an account.

* Account » Database

In other words, if you have event tables associated with both your account and a database `my_database`, telemetry data generated by
objects in `my_database` will be collected in the database’s event table. For other databases in the account that don’t have an
associated event table, telemetry data will be collected in the event table associated with the account.

### Set the event table for the account

> **Note:**
>
> To execute this command, you must use the ACCOUNTADMIN role.
>
> In addition, you must have both of the following privileges:
>
> * OWNERSHIP privilege for the account.
> * [OWNERSHIP or INSERT privileges for the event table](../../user-guide/security-access-control-privileges.md).
>
> See the [documentation on the ALTER ACCOUNT command](../../sql-reference/sql/alter-account.md) for more information on the
> privileges needed to execute ALTER ACCOUNT.

For example, to set up the event table named `my_events` in the schema `my_schema` in the database `my_database` as
the active event table for your account, execute the following statement:

```sqlexample
ALTER ACCOUNT SET EVENT_TABLE = my_database.my_schema.my_events;
```

As shown above, you must specify the [fully-qualified name](../../sql-reference/name-resolution.md) of the event table.

To disassociate an event table from an account, execute the ALTER ACCOUNT command and unset the EVENT_TABLE parameter. For example:

```sqlexample
ALTER ACCOUNT UNSET EVENT_TABLE;
```

You can confirm the EVENT_TABLE value with the [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md) command:

```sqlexample
SHOW PARAMETERS LIKE 'event_table' IN ACCOUNT;
```

### Set the event table for a database

To set up the event table named `my_events` in the schema `my_schema` in the database `my_database` as
the active event table for the database `my_database`, execute the following statement:

```sqlexample
ALTER DATABASE my_database SET EVENT_TABLE = my_database.my_schema.my_events;
```

To disassociate an event table from a database, execute the ALTER DATABASE command and unset the EVENT_TABLE parameter. For example:

```sqlexample
ALTER DATABASE my_database UNSET EVENT_TABLE;
```

You can confirm the EVENT_TABLE value with the [SHOW PARAMETERS](../../sql-reference/sql/show-parameters.md) command:

```sqlexample
SHOW PARAMETERS LIKE 'event_table' IN DATABASE my_database;
```

---
title: Examples for common use cases of Snowflake Scripting
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/use-cases.md
section: Developer Guide
---

# Examples for common use cases of Snowflake Scripting

You can write anonymous blocks and stored procedures that use Snowflake Scripting language elements,
data types, and variables for solutions that address common use cases. This topic includes examples of
Snowflake Scripting code for some common use cases.

## Update table data with user input

The following example creates a stored procedure that updates table data with user input. It uses
a [FOR loop](loops.md) to iterate over the rows in a [RESULTSET](resultsets.md)
for the table. The FOR loop contains [conditional logic](branch.md). [Bind variables](variables.md) based on user input determine the exact updates performed by the
stored procedure.

The example uses the following data:

```sqlexample
CREATE OR REPLACE TABLE bonuses (
  emp_id INT,
  performance_rating INT,
  salary NUMBER(12, 2),
  bonus NUMBER(12, 2)
);

INSERT INTO bonuses (emp_id, performance_rating, salary, bonus) VALUES
  (1001, 3, 100000, NULL),
  (1002, 1, 50000, NULL),
  (1003, 4, 75000, NULL),
  (1004, 4, 80000, NULL),
  (1005, 5, 120000, NULL),
  (1006, 2, 60000, NULL),
  (1007, 5, 40000, NULL),
  (1008, 3, 140000, NULL),
  (1009, 1, 95000, NULL);

SELECT * FROM bonuses;
```

```output
+--------+--------------------+-----------+-------+
| EMP_ID | PERFORMANCE_RATING |    SALARY | BONUS |
|--------+--------------------+-----------+-------|
|   1001 |                  3 | 100000.00 |  NULL |
|   1002 |                  1 |  50000.00 |  NULL |
|   1003 |                  4 |  75000.00 |  NULL |
|   1004 |                  4 |  80000.00 |  NULL |
|   1005 |                  5 | 120000.00 |  NULL |
|   1006 |                  2 |  60000.00 |  NULL |
|   1007 |                  5 |  40000.00 |  NULL |
|   1008 |                  3 | 140000.00 |  NULL |
|   1009 |                  1 |  95000.00 |  NULL |
+--------+--------------------+-----------+-------+
```

The following stored procedure uses a FOR loop to iterate over the rows in a RESULTSET for the `bonuses` table.
It applies the bonus as the specified percentage of the salary of each employee with the specified performance
rating. The stored procedure uses conditional logic to apply the bonus only to the employees with the specified
performance rating. It also uses the inputs (`bonus_percentage` and `performance_value`) as bind variables.

```sqlexample
CREATE OR REPLACE PROCEDURE apply_bonus(bonus_percentage INT, performance_value INT)
  RETURNS TEXT
  LANGUAGE SQL
AS
DECLARE
  -- Use input to calculate the bonus percentage
  updated_bonus_percentage NUMBER(2,2) DEFAULT (:bonus_percentage/100);
  --  Declare a result set
  rs RESULTSET;
BEGIN
  -- Assign a query to the result set and execute the query
  rs := (SELECT * FROM bonuses);
  -- Use a FOR loop to iterate over the records in the result set
  FOR record IN rs DO
    -- Assign variable values using values in the current record
    LET emp_id_value INT := record.emp_id;
    LET performance_rating_value INT := record.performance_rating;
    LET salary_value NUMBER(12, 2) := record.salary;
    -- Determine whether the performance rating in the record matches the user input
    IF (performance_rating_value = :performance_value) THEN
      -- If the condition is met, update the bonuses table using the calculated bonus percentage
      UPDATE bonuses SET bonus = ( :salary_value * :updated_bonus_percentage )
        WHERE emp_id = :emp_id_value;
    END IF;
  END FOR;
  -- Return text when the stored procedure completes
  RETURN 'Update applied';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE apply_bonus(bonus_percentage INT, performance_value INT)
  RETURNS TEXT
  LANGUAGE SQL
AS
$$
DECLARE
  -- Use input to calculate the bonus percentage
  updated_bonus_percentage NUMBER(2,2) DEFAULT (:bonus_percentage/100);
  --  Declare a result set
  rs RESULTSET;
BEGIN
  -- Assign a query to the result set and execute the query
  rs := (SELECT * FROM bonuses);
  -- Use a FOR loop to iterate over the records in the result set
  FOR record IN rs DO
    -- Assign variable values using values in the current record
    LET emp_id_value INT := record.emp_id;
    LET performance_rating_value INT := record.performance_rating;
    LET salary_value NUMBER(12, 2) := record.salary;
    -- Determine whether the performance rating in the record matches the user input
    IF (performance_rating_value = :performance_value) THEN
      -- If the condition is met, update the bonuses table using the calculated bonus percentage
      UPDATE bonuses SET bonus = ( :salary_value * :updated_bonus_percentage )
        WHERE emp_id = :emp_id_value;
    END IF;
  END FOR;
  -- Return text when the stored procedure completes
  RETURN 'Update applied';
END;
$$
;
```

To run the stored procedure, specify the bonus percentage and the performance rating. For example, call
the stored procedure and apply a 3% bonus for employees with a performance rating of 5:

```sqlexample
CALL apply_bonus(3, 5);
```

Run a query to show the results:

```sqlexample
SELECT * FROM bonuses;
```

```output
+--------+--------------------+-----------+---------+
| EMP_ID | PERFORMANCE_RATING |    SALARY |   BONUS |
|--------+--------------------+-----------+---------|
|   1001 |                  3 | 100000.00 |    NULL |
|   1002 |                  1 |  50000.00 |    NULL |
|   1003 |                  4 |  75000.00 |    NULL |
|   1004 |                  4 |  80000.00 |    NULL |
|   1005 |                  5 | 120000.00 | 3600.00 |
|   1006 |                  2 |  60000.00 |    NULL |
|   1007 |                  5 |  40000.00 | 1200.00 |
|   1008 |                  3 | 140000.00 |    NULL |
|   1009 |                  1 |  95000.00 |    NULL |
+--------+--------------------+-----------+---------+
```

## Filter and collect data

The following example creates a stored procedure that filters and collects the data in a table.
The procedure inserts rows using the collected data into another table to track historical trends.

The example uses the following data, which tracks the ownership and settings of virtual machines (VMs):

```sqlexample
CREATE OR REPLACE TABLE vm_ownership (
  emp_id INT,
  vm_id VARCHAR
);

INSERT INTO vm_ownership (emp_id, vm_id) VALUES
  (1001, 1),
  (1001, 5),
  (1002, 3),
  (1003, 4),
  (1003, 6),
  (1003, 2);

CREATE OR REPLACE TABLE vm_settings (
  vm_id INT,
  vm_setting VARCHAR,
  value NUMBER
);

INSERT INTO vm_settings (vm_id, vm_setting, value) VALUES
  (1, 's1', 5),
  (1, 's2', 500),
  (2, 's1', 10),
  (2, 's2', 600),
  (3, 's1', 3),
  (3, 's2', 400),
  (4, 's1', 8),
  (4, 's2', 700),
  (5, 's1', 1),
  (5, 's2', 300),
  (6, 's1', 7),
  (6, 's2', 800);

CREATE OR REPLACE TABLE vm_settings_history (
  vm_id INT,
  vm_setting VARCHAR,
  value NUMBER,
  owner INT,
  date DATE
);
```

Assume that a company wants to track the data in this table over time when the values of the settings exceed specific
thresholds. The following stored procedure collects and filters the data in the `vm_settings` table, then inserts rows into
the `vm_settings_history` table when the following conditions are met:

* A `vm_setting` with a value of `s1` is set lower than `5`.
* A `vm_setting` with a value of `s2` is set higher than `500`.

The rows inserted into the `vm_settings_history` table include all of the column values from the `vm_settings`
table, along with the `emp_id` of the employee who owns the VM and the current date.

```sqlexample
CREATE OR REPLACE PROCEDURE vm_user_settings()
  RETURNS VARCHAR
  LANGUAGE SQL
AS
DECLARE
  -- Declare a cursor and a variable
  c1 CURSOR FOR SELECT * FROM vm_settings;
  current_owner NUMBER;
BEGIN
  -- Open the cursor to execute the query and retrieve the rows into the cursor
  OPEN c1;
  -- Use a FOR loop to iterate over the records in the result set
  FOR record IN c1 DO
    -- Assign variable values using values in the current record
    LET current_vm_id NUMBER := record.vm_id;
    LET current_vm_setting VARCHAR := record.vm_setting;
    LET current_value NUMBER := record.value;
    -- Assign a value to the current_owner variable by querying the vm_ownership table
    SELECT emp_id INTO :current_owner
      FROM vm_ownership
      WHERE vm_id = :current_vm_id;
    -- If the record has a vm_setting equal to 's1', determine whether its value is less than 5
    IF (current_vm_setting = 's1' AND current_value < 5) THEN
      -- If the condition is met, insert a row into the vm_settings_history table
      INSERT INTO vm_settings_history VALUES (
        :current_vm_id,
        :current_vm_setting,
        :current_value,
        :current_owner,
        SYSDATE());
    -- If the record has a vm_setting equal to 's2', determine whether its value is greater than 500
    ELSEIF (current_vm_setting = 's2' AND current_value > 500) THEN
      -- If the condition is met, insert a row into the vm_settings_history table
      INSERT INTO vm_settings_history VALUES (
        :current_vm_id,
        :current_vm_setting,
        :current_value,
        :current_owner,
        SYSDATE());
    END IF;
  END FOR;
  -- Close the cursor
  CLOSE c1;
  -- Return text when the stored procedure completes
  RETURN 'Success';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE vm_user_settings()
  RETURNS VARCHAR
  LANGUAGE SQL
AS
$$
DECLARE
  -- Declare a cursor and a variable
  c1 CURSOR FOR SELECT * FROM vm_settings;
  current_owner NUMBER;
BEGIN
  -- Open the cursor to execute the query and retrieve the rows into the cursor
  OPEN c1;
  -- Use a FOR loop to iterate over the records in the result set
  FOR record IN c1 DO
    -- Assign variable values using values in the current record
    LET current_vm_id NUMBER := record.vm_id;
    LET current_vm_setting VARCHAR := record.vm_setting;
    LET current_value NUMBER := record.value;
    -- Assign a value to the current_owner variable by querying the vm_ownership table
    SELECT emp_id INTO :current_owner
      FROM vm_ownership
      WHERE vm_id = :current_vm_id;
    -- If the record has a vm_setting equal to 's1', determine whether its value is less than 5
    IF (current_vm_setting = 's1' AND current_value < 5) THEN
      -- If the condition is met, insert a row into the vm_settings_history table
      INSERT INTO vm_settings_history VALUES (
        :current_vm_id,
        :current_vm_setting,
        :current_value,
        :current_owner,
        SYSDATE());
    -- If the record has a vm_setting equal to 's2', determine whether its value is greater than 500
    ELSEIF (current_vm_setting = 's2' AND current_value > 500) THEN
      -- If the condition is met, insert a row into the vm_settings_history table
      INSERT INTO vm_settings_history VALUES (
        :current_vm_id,
        :current_vm_setting,
        :current_value,
        :current_owner,
        SYSDATE());
    END IF;
  END FOR;
  -- Close the cursor
  CLOSE c1;
  -- Return text when the stored procedure completes
  RETURN 'Success';
END;
$$;
```

Run the stored procedure:

```sqlexample
CALL vm_user_settings();
```

You can see the data that the procedure inserted into the `vm_settings_history` table by running
the following query:

```sqlexample
SELECT * FROM vm_settings_history ORDER BY vm_id;
```

```output
+-------+------------+-------+-------+------------+
| VM_ID | VM_SETTING | VALUE | OWNER | DATE       |
|-------+------------+-------+-------+------------|
|     2 | s2         |   600 |  1003 | 2024-04-01 |
|     3 | s1         |     3 |  1002 | 2024-04-01 |
|     4 | s2         |   700 |  1003 | 2024-04-01 |
|     5 | s1         |     1 |  1001 | 2024-04-01 |
|     6 | s2         |   800 |  1003 | 2024-04-01 |
+-------+------------+-------+-------+------------+
```

---
title: Examples of using Git with Snowflake
source: https://docs.snowflake.com/en/developer-guide/git/git-examples.md
section: Developer Guide
---

# Examples of using Git with Snowflake

Examples in this topic describe how to use files from a remote Git repository when developing Snowflake applications and how to execute SQL
scripts in a Git repository clone.

Be sure to see the following, which describe other ways to interact with a Git repository clone.

* [Sync Streamlit in Snowflake apps with a Git repository](../streamlit/features/git-integration.md)
* [Sync notebooks with a Git repository](../../user-guide/ui-snowsight/notebooks-snowgit.md)

## Use a Git repository file as a stored procedure handler

After you’ve [set up integration between Snowflake and your remote Git repository](git-setting-up.md),
you can use files from the repository as handler code in stored procedures and UDFs. Note that,
[as with staged handlers](../inline-or-staged.md), you must qualify the handler function name with the name of its containing
class or module.

This example describes how to use Python handler code from the repository in a stored procedure.

### Code required by this example

The handler in this example depends on a database created with SQL code similar to the following:

```sqlexample
CREATE DATABASE example_db;
USE DATABASE example_db;
CREATE SCHEMA example_schema;
USE SCHEMA example_schema;

CREATE OR REPLACE TABLE employees(id NUMBER, name VARCHAR, role VARCHAR);
INSERT INTO employees (id, name, role) VALUES (1, 'Alice', 'op'), (2, 'Bob', 'dev'), (3, 'Cindy', 'dev');
```

The example uses the following Python handler code contained in `filter.py`:

```python
from snowflake.snowpark.functions import col

def filter_by_role(session, table_name, role):
  df = session.table(table_name)
  return df.filter(col("role") == role)
```

### Commit the file and refresh the Git repository clone

1. From your Git client, add the code to the remote repository.

   Code in the following example uses the git command-line tool to add and commit the handler file to the local repository, then push it
   to the remote repository referenced by the Git repository clone in Snowflake:

   ```bash
   $ git add python-handlers/filter.py
   $ git commit -m "Adding code to filter by role"
   $ git push
   ```
2. In Snowflake, refresh the Git repository clone.

   Assuming you’ve [set up integration between Snowflake and your remote Git repository](git-setting-up.md),
   resulting in a Git repository clone in Snowflake, you can refresh the Git repository clone by fetching from the remote repository.

   Using Snowflake to refresh from your remote repository is similar to working with other Git client tools, where you fetch from the remote
   repository before beginning work to ensure that you have the latest changes.

   Code in the following example executes the [ALTER GIT REPOSITORY](../../sql-reference/sql/alter-git-repository.md) command to
   retrieve the latest changes from the remote repository. The code generates a full clone that includes branches, tags, and commits.

   ```sqlexample
   ALTER GIT REPOSITORY snowflake_extensions FETCH;
   ```

### Create and execute a procedure that uses the file in the Git repository clone

1. In Snowflake, write the procedure.

   When you write a procedure, you can reference its handler code at the code file’s location in the Git repository clone in Snowflake.
   For example, to refer to a file `python-handlers/filter.py` in the main branch of a remote repository synchronized to a Git repository
   clone called `snowflake_extensions`, you would use syntax similar to the following:

   ```none
   @snowflake_extensions/branches/main/python-handlers/filter.py
   ```

   Code in the following example creates a procedure called `filter_by_role`, specifying handler code stored in the Git repository clone:

   ```sqlexample
   CREATE OR REPLACE PROCEDURE filter_by_role(tableName VARCHAR, role VARCHAR)
     RETURNS TABLE(id NUMBER, name VARCHAR, role VARCHAR)
     LANGUAGE PYTHON
     RUNTIME_VERSION = '3.12'
     PACKAGES = ('snowflake-snowpark-python')
     IMPORTS = ('@example_db.example_schema.snowflake_extensions/branches/main/python-handlers/filter.py')
     HANDLER = 'filter.filter_by_role';
   ```
2. Execute the procedure.

   The following code executes the procedure.

   ```sqlexample
   CALL filter_by_role('employees', 'dev');
   ```

   The following is an example of output from the procedure.

   ```output
   ---------------------
   | ID | NAME  | ROLE |
   ---------------------
   | 2  | Bob   | dev  |
   ---------------------
   | 3  | Cindy | dev  |
   ---------------------
   ```

## Use a Git repository clone file to configure new accounts

This example describes how to execute a SQL script contained in a Git repository clone in Snowflake. The script in the example creates a
user and role.

This example uses the [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md) command to execute the SQL statements contained in a file in
the Git repository clone.

With EXECUTE IMMEDIATE FROM, you can execute (from any Snowflake session) scripts you manage in your remote Git repository. For
example, you might have a script that sets up every new Snowflake account in your organization. The script might contain statements
to create users, roles, objects, and grant privileges on the account and objects.

1. Create the file `setup.sql` with the following contents:

   ```sqlexample
   CREATE ROLE analyst;

   CREATE USER gladys;

   GRANT ROLE analyst TO USER gladys;

   SHOW GRANTS TO USER gladys;
   ```
2. Commit your SQL file to your remote Git repository.

   Use the git command-line tool to commit the file to your remote Git repository:

   ```bash
   git add scripts/setup.sql
   git commit -m "Adding code to set up new accounts"
   git push
   ```

   For detailed instructions, see Commit the file and refresh the Git repository clone.
3. Refresh the Git repository clone.

   Refresh the Git repository clone `configuration_repo`:

   ```sqlexample
   ALTER GIT REPOSITORY configuration_repo FETCH;
   ```

   For detailed instructions, see Commit the file and refresh the Git repository clone.
4. In Snowflake, execute the file in your Git repository clone:

   > **Note:**
   >
   > The user executing the following statement must use a role that has the required privileges to execute all statements in the file.
   > For more information, see [Access control requirements](../../sql-reference/sql/execute-immediate-from.md).

   ```sqlexample
   EXECUTE IMMEDIATE FROM @configuration_repo/branches/main/scripts/setup.sql;
   ```

   The EXECUTE IMMEDIATE FROM commands [returns](../../sql-reference/sql/execute-immediate-from.md) the results of the last SQL statement
   in the file:

   ```output
   +-------------------------------+---------+------------+--------------+--------------+
   | created_on                    | role    | granted_to | grantee_name | granted_by   |
   |-------------------------------+---------+------------+--------------+--------------|
   | 2023-07-24 22:07:04.354 -0700 | ANALYST | USER       | GLADYS       | ACCOUNTADMIN |
   +-------------------------------+---------+------------+--------------+--------------+
   ```

---
title: Executing a UDF
source: https://docs.snowflake.com/en/developer-guide/udf/udf-calling-sql.md
section: Developer Guide
---

# Executing a UDF

You can execute a user-defined function (UDF) or user-defined table function (UDTF) in the same way that you execute other functions.

## Tools for executing UDFs

Choose the tool for executing the function.

| Language | Approach |
| --- | --- |
| **SQL**  Execute a SQL command, such as by using Snowsight. | Execute the SQL SELECT command to execute a UDF. |
| **Java, Python, or Scala with Snowpark**  Write code locally in one of the supported languages, having the code execute in Snowflake. | Execute client code that uses Snowpark APIs in one of the following languages.   * [Java](../snowpark/java/calling-functions.md) * [Python](../snowpark/python/calling-functions.md) * [Scala](../snowpark/scala/calling-functions.md) |
| **Command line**  Create and manage Snowflake entities by executing commands from the command line. | Execute commands of the [Snowflake CLI](../snowflake-cli/index.md):   * [To execute SQL commands](../snowflake-cli/command-reference/sql-commands/sql.md). * [To execute Snowpark commands](../snowflake-cli/command-reference/snowpark-commands/execute.md). |
| **Python**  On the client, write code that executes management operations on Snowflake. | Execute code that uses the [Snowflake Python API](../snowflake-python-api/snowflake-python-managing-functions-procedures.md). |
| **RESTful APIs** (language-agnostic)  Make requests of RESTful endpoints to create and manage Snowflake entities. | Make a request to create a function using the [Snowflake REST API](../snowflake-rest-api/user-defined-function/user-defined-function-introduction.md) |

## Calling a UDF with SQL

In general, you call a UDF same way that you call other functions.

If a UDF has arguments, you can specify those arguments by name and by position.

For example, the following UDF accepts three arguments:

```sqlexample
CREATE OR REPLACE FUNCTION udf_concatenate_strings(
    first_arg VARCHAR,
    second_arg VARCHAR,
    third_arg VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SQL
  AS
  $$
    SELECT first_arg || second_arg || third_arg
  $$;
```

When calling the UDF, you can specify the arguments by name:

```sqlexample
SELECT udf_concatenate_strings(
  first_arg => 'one',
  second_arg => 'two',
  third_arg => 'three');
```

```output
+--------------------------+
| UDF_CONCATENATE_STRINGS( |
|   FIRST_ARG => 'ONE',    |
|   SECOND_ARG => 'TWO',   |
|   THIRD_ARG => 'THREE')  |
|--------------------------|
| onetwothree              |
+--------------------------+
```

If you specify the arguments by name, you don’t need to specify the arguments in any particular order:

```sqlexample
SELECT udf_concatenate_strings(
  third_arg => 'three',
  first_arg => 'one',
  second_arg => 'two');
```

```output
+--------------------------+
| UDF_CONCATENATE_STRINGS( |
|   THIRD_ARG => 'THREE',  |
|   FIRST_ARG => 'ONE',    |
|   SECOND_ARG => 'TWO')   |
|--------------------------|
| onetwothree              |
+--------------------------+
```

You can also specify the arguments by position:

```sqlexample
SELECT udf_concatenate_strings(
  'one',
  'two',
  'three');
```

```output
+--------------------------+
| UDF_CONCATENATE_STRINGS( |
|   'ONE',                 |
|   'TWO',                 |
|   'THREE')               |
|--------------------------|
| onetwothree              |
+--------------------------+
```

You can also specify the arguments by both position and name:

```sqlexample
SELECT udf_concatenate_strings(
  'one',
  'two',
  third_arg => 'three');
```

```output
+--------------------------+
| UDF_CONCATENATE_STRINGS( |
|   'ONE',                 |
|   'TWO',                 |
|   THIRD_ARG => 'THREE')  |
|--------------------------|
| onetwothree              |
+--------------------------+
```

> **Note:**
>
> * When you mix arguments by position and by name, all of the positional arguments must come before
>   all of the named arguments.
> * When you specify an argument by name, you can’t use double quotes around the argument name.
>
> * If two functions have the same name but different argument types, you can use the argument names to specify
>   which function to execute, if the argument names are different. For more information, see
>   [Overloading procedures and functions](../udf-stored-procedure-naming-conventions.md).

### Calling a UDF that has optional arguments

If the UDF has [optional arguments](../udf-stored-procedure-arguments.md), you can omit the optional arguments in
the call. Each optional argument has a default value that is used when the argument is omitted.

For example, the following UDF has one required argument and two optional arguments. Each optional argument has a default value.

```sqlexample
CREATE OR REPLACE FUNCTION build_string_udf(
    word VARCHAR,
    prefix VARCHAR DEFAULT 'pre-',
    suffix VARCHAR DEFAULT '-post'
  )
  RETURNS VARCHAR
  AS
  $$
    SELECT prefix || word || suffix
  $$
  ;
```

You can omit any of the optional arguments in the call. When you omit an argument, the default value of the argument is used.

```sqlexample
SELECT build_string_udf('hello');
```

```output
+---------------------------+
| BUILD_STRING_UDF('HELLO') |
|---------------------------|
| pre-hello-post            |
+---------------------------+
```

```sqlexample
SELECT build_string_udf('hello', 'before-');
```

```output
+--------------------------------------+
| BUILD_STRING_UDF('HELLO', 'BEFORE-') |
|--------------------------------------|
| before-hello-post                    |
+--------------------------------------+
```

If you need to omit an optional argument and specify another optional argument that appears after the omitted argument in the
signature, use named arguments, rather than positional arguments.

For example, suppose that you want to omit the `prefix` argument and specify the `suffix` argument. The `suffix` argument
appears after the `prefix` in the signature, so you must specify the arguments by name:

```sqlexample
SELECT build_string_udf(word => 'hello', suffix => '-after');
```

```output
+-------------------------------------------------------+
| BUILD_STRING_UDF(WORD => 'HELLO', SUFFIX => '-AFTER') |
|-------------------------------------------------------|
| pre-hello-after                                       |
+-------------------------------------------------------+
```

### Calling a UDTF

You can call a UDTF the way you would call any table function. When calling a UDTF in the FROM clause of a query, specify the
UDTF’s name and arguments inside the parentheses that follow the TABLE keyword, as you would when
[calling a built-in table function](../../sql-reference/functions-table.md).

In other words, use a form such as the following for the TABLE keyword when calling a UDTF:

```sqlexample
SELECT ...
  FROM TABLE ( udtf_name (udtf_arguments) )
```

Code in the following example calls the `my_java_udtf` table function, specifying a DATE literal in the argument
`'2021-01-16'::DATE`.

```sqlexample
SELECT ...
  FROM TABLE(my_java_udtf('2021-01-16'::DATE));
```

The argument to a table function can be an expression, not just a literal. For example, a table function can be called using
a column from a table. Some examples are below, including in the Examples section.

As is the case with calling UDFs, you can specify the arguments by name or by position.

For more information about table functions in general, see [table function](../../sql-reference/functions-table.md).

> **Note:**
>
> You cannot call a UDF within the DEFAULT clause of a CREATE TABLE statement.

#### Using a table or UDTF as input to a UDTF

The input to a table function can come from a table or from another UDTF, as documented in
[Using a table as input to a table function](../../sql-reference/functions-table.md).

The example below shows how to use a table to provide input to the UDTF `split_file_into_words`:

```sqlexample
create table file_names (file_name varchar);
insert into file_names (file_name) values ('sample.txt'),
                                          ('sample_2.txt');

select f.file_name, w.word
   from file_names as f, table(split_file_into_words(f.file_name)) as w;
```

The output looks similar to the following:

```sqlexample
+-------------------+------------+
| FILE_NAME         | WORD       |
+-------------------+------------+
| sample_data.txt   | some       |
| sample_data.txt   | words      |
| sample_data_2.txt | additional |
| sample_data_2.txt | words      |
+-------------------+------------+
```

The IMPORTS clause of the UDTF must specify the name and path of each file passed to the UDTF. For example:

```sqlexample
create function split_file_into_words(inputFileName string)
    ...
    imports = ('@inline_jars/sample.txt', '@inline_jars/sample_2.txt')
    ...
```

Each file must already have been copied to a stage (in this case, the stage named `@inline_jars`) before the UDTF reads the file.

For an example of using a UDTF as an input to another UDTF, see [Extended examples using table values and other UDTFs as input](javascript/udf-javascript-tabular-functions.md) in
the JavaScript UDTF documentation.

#### Table functions and partitions

Before rows are passed to table functions, the rows can be grouped into *partitions*. Partitioning has two main benefits:

* Partitioning allows Snowflake to divide up the workload to improve parallelization and thus performance.
* Partitioning allows Snowflake to process all rows with a common characteristic as a group.
  You can return results that are based on all rows in the group, not just on individual rows.

For example, you might partition stock price data into one group per stock. All stock prices for an individual company can be
analyzed together, while stock prices for each company can be analyzed independently of any other company.

Data can be partitioned explicitly or implicitly.

##### Explicit partitioning

**Explicit Partitioning into Multiple Groups**

The following statement calls the UDTF named `my_udtf` on individual partitions. Each partition contains all rows for which
the `PARTITION BY` expression evaluates to the same value (e.g. the same company or stock symbol).

```sqlexample
SELECT *
    FROM stocks_table AS st,
         TABLE(my_udtf(st.symbol, st.transaction_date, st.price) OVER (PARTITION BY st.symbol))
```

**Explicit Partitioning into a Single Group**

The following statement calls the UDTF named `my_udtf` on one partition. The `PARTITION BY <constant>` clause
(in this case `PARTITION BY 1`) puts all rows in the same partition.

```sqlexample
SELECT *
    FROM stocks_table AS st,
         TABLE(my_udtf(st.symbol, st.transaction_date, st.price) OVER (PARTITION BY 1))
```

For a more complete and realistic example, see [Examples of calling Java UDTFs in queries](java/udf-java-tabular-functions.md), in particular the subsection
titled [Single Partition](java/udf-java-tabular-functions.md).

**Sorting Rows for Partitions**

To process each partition’s rows in a specified order, include an ORDER BY clause. This tells Snowflake to pass the rows
to the per-row handler method in the specified order.

For example, if you want to calculate the moving average of a stock price over time, then order the stock prices by timestamp (and
partition by stock symbol). The following example shows how to do this:

```sqlexample
SELECT *
     FROM stocks_table AS st,
          TABLE(my_udtf(st.symbol, st.transaction_date, st.price) OVER (PARTITION BY st.symbol ORDER BY st.transaction_date))
```

An OVER clause can contain an ORDER BY clause even without a PARTITION BY clause.

Remember that including an ORDER BY clause inside an OVER clause is not the same as putting an ORDER BY clause at the
outermost level of the query. If you want the entire query results to be ordered, you need a separate ORDER BY clause. For
example:

```sqlexample
SELECT *
    FROM stocks_table AS st,
         TABLE(my_udtf(st.symbol, st.transaction_date, st.price) OVER (PARTITION BY st.symbol ORDER BY st.transaction_date))
    ORDER BY st.symbol, st.transaction_date, st.transaction_time;
```

**Usage Notes for Explicit Partitioning**

When using a UDTF with a PARTITION BY clause, the PARTITION BY clause must use a column reference or a literal,
not a general expression. For example, the following is not allowed:

```sqlexample
SELECT * FROM udtf_table, TABLE(my_func(col1) OVER (PARTITION BY udtf_table.col2 * 2));   -- NO!
```

##### Implicit partitioning

If a table function does not explicitly partition the rows by using a PARTITION BY clause, then Snowflake typically partitions
the rows implicitly to use parallel processing to improve performance.

The number of partitions is typically based on factors such as the size of the warehouse processing the function and the
cardinality of the input relation. The rows are typically assigned to specific partitions based on factors such
as physical location of the rows (e.g. by micro-partition), so the partition grouping has no meaning.

---
title: Executing Snowflake SQL with Snowpark Connect for Spark
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-snowflake-sql.md
section: Developer Guide
---

# Executing Snowflake SQL with Snowpark Connect for Spark

To execute SQL commands specific to Snowflake, you can use the `SnowflakeSession` interface. As with the `spark.sql` method,
query results are returned as Spark DataFrames with which you can continue applying or chaining Spark DataFrame transformations and
actions on the resulting data.

With most SQL operations, you can use the `spark.sql` method to execute SQL statements directly and retrieve the results as Spark
DataFrames. However, some parts of Snowflake SQL syntax—including QUALIFY, CONNECT BY, LATERAL FLATTEN, and time travel queries—are
not compatible with Spark SQL.

The following example shows how to use `SnowflakeSession` to execute a Snowflake SQL command that includes the CONNECT BY clause.

```python
import snowflake.snowpark_connect
from snowflake.snowpark_connect.snowflake_session import SnowflakeSession

spark = snowflake.snowpark_connect.init_spark_session()
snowflake_session = SnowflakeSession(spark)
result_df = snowflake_session.sql("""
  SELECT
  employee_name,
  manager_name,
  LEVEL
FROM employees
START WITH employee_name = 'Alice'
CONNECT BY PRIOR manager_name = employee_name
""").show()
result_df.limit(1).show()
```

You can also use the `SnowflakeSession` interface to execute configuration directives specific to Snowflake. These directives
include setting session-level parameters such as the active database, schema, or warehouse.

The following example shows how to use `SnowflakeSession` to set session-level parameters.

```python
import snowflake.snowpark_connect
from snowflake.snowpark_connect.client import SnowflakeSession

spark = snowflake.snowpark_connect.init_spark_session()
snowflake_session = SnowflakeSession(spark)

snowflake_session.use_database("MY_DATABASE")
snowflake_session.use_schema("MY_SCHEMA")
snowflake_session.use_warehouse("MY_WH")
snowflake_session.use_role("PUBLIC")
```

---
title: Executing statements
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-execute.md
section: Developer Guide
---

# Executing statements

Statements can be executed by calling the `connection.execute()` method. The `execute()` method accepts an `options`
object that can be used to specify the SQL text and a `complete` callback.
The `complete` callback is invoked when a statement has finished executing and the result is ready to be consumed:

> ```javascript
> const statement = connection.execute({
>   sqlText: 'CREATE DATABASE testdb',
>   complete: function (err, stmt, rows) {
>     if (err) {
>       console.error(`Failed to execute statement due to the following error: ${err.message}`);
>     } else {
>       console.log(`Successfully executed statement: ${stmt.getSqlText()}`);
>     }
>   }
> });
> ```

> **Note:**
>
> The maximum payload size of a single request is 128 MB.

## Execute queries asynchronously

The Snowflake Node.js Driver supports asynchronous queries (that is, queries that return control to the user before the query completes). You can start a query, then use polling to determine when the query has completed. After the query completes, you can read the result set.

You enable asynchronous queries by including `asyncExec: true` in the `connection.execute` method.

The following example shows how to execute queries asynchronously using a `Promise`.

```javascript
let queryId;

// 1. Execute query with asyncExec set to true
await new Promise((resolve) => {
  connection.execute({
    sqlText: "CALL SYSTEM$WAIT(3, 'SECONDS')",
    asyncExec: true,
    complete: async function (err, stmt, rows) {
      queryId = stmt.getQueryId(); // Get the query ID
      resolve();
    }
  });
});

// 2. Get results using the query ID
const statement = await connection.getResultsFromQueryId({ queryId: queryId });
await new Promise((resolve, reject) => {
  const stream = statement.streamRows();
  stream.on('error', err => {
    reject(err);
  });
  stream.on('data', row => {
    console.log(row);
  });
  stream.on('end', () => {
    resolve();
  });
});
```

You can also use callbacks to monitor asynchronous queries, as shown in the following example.

1. Enable asynchronous queries by including `asyncExec: true` in the `connection.execute` method.

   > ```javascript
   > // 1. Execute query with asyncExec set to true
   > connection.execute({
   >   sqlText: "CALL SYSTEM$WAIT(3, 'SECONDS')",
   >   asyncExec: true,
   >   complete: async function (err, stmt, rows) {
   >     const queryId = stmt.getQueryId();
   >
   >     // 2. Get results using the query ID
   >     connection.getResultsFromQueryId({
   >       queryId: queryId,
   >       complete: async function (err, _stmt, rows) {
   >         console.log(rows);
   >       }
   >     });
   >   }
   > });
   > ```
2. Check on the status of the query, which was submitted to be executed asynchronously.

   > ```javascript
   > let queryId;
   >
   > // 1. Execute query with asyncExec set to true
   > await new Promise((resolve, reject) => {
   >   const statement = connection.execute({
   >     sqlText: "CALL SYSTEM$WAIT(3, 'SECONDS')",
   >     asyncExec: true,
   >     complete: async function (err, stmt, rows) {
   >       queryId = statement.getQueryId();
   >       resolve();
   >     }
   >   });
   > });
   >
   > // 2. Check query status until it's finished executing
   > const seconds = 2;
   > let status = await connection.getQueryStatus(queryId);
   > while (connection.isStillRunning(status)) {
   >   console.log(`Query status is ${status}, timeout for ${seconds} seconds`);
   >
   >   await new Promise((resolve) => {
   >     setTimeout(() => resolve(), 1000 * seconds);
   >   });
   >
   >   status = await connection.getQueryStatus(queryId);
   > }
   >
   > console.log(`Query has finished executing, status is ${status}`);
   > ```

## Execute a batch of SQL statements (multi-statement support)

With version 1.6.18 and later of the Node.js connector, you can send
a batch of SQL statements separated by semicolons to be executed in a single request.

> **Note:**
>
> * Executing multiple statements in a single query requires that a valid warehouse is available in a session.
> * By default, Snowflake returns an error for queries issued with multiple statements to protect against SQL injection attacks.
>   Executing multiple statements in a single query increases the risk of SQL injection. Snowflake recommends using it sparingly.
>   You can reduce the risk by using the `MULTI_STATEMENT_COUNT` parameter to specify the number of statements to be executed, which makes it more difficult to inject a statement by appending to it.
>
> For more information about these types of attacks, see [SQL injection](https://en.wikipedia.org/wiki/SQL_injection).

You can execute multiple statements as a batch in the same way you execute queries with single statements, except that the query string
contains multiple statements separated by semicolons. Note that multiple statements execute sequentially, not in parallel.
The `MULTI_STATEMENT_COUNT` parameter specifies the exact number of statements the batch contains.

For example, if you set `MULTI_STATEMENT_COUNT=3`, a batch statement must include precisely three
statements. If you submit a batch statement with any other number of statements, the Node.js driver rejects the request. You can set
`MULTI_STATEMENT_COUNT=0` to allow batch queries to contain any number of statements. However, be aware that using this value
reduces the protection against SQL injection attacks.

You can set this parameter at the session level using the following command, or you can set the value
separately each time you submit a query.

```sqlsyntax
ALTER SESSION SET multi_statement_count = <n>
```

By setting the value the session level, you do not need to set it when you execute each time you execute a batch statement.
The following example sets the number of statements at the session level to three and then executes three SQL statements:

> ```javascript
> const statement = connection.execute({
>   sqlText: 'ALTER SESSION SET multi_statement_count=0',
>   complete: function (err, stmt, rows) {
>     if (err) {
>       console.error(`Failed to execute statement due to the following error: ${err.message}`);
>     } else {
>       testMulti();
>     }
>   }
> });
>
> function testMulti() {
>   console.log('select bind execute.');
>   const selectStatement = connection.execute({
>     sqlText: 'create or replace table test(n int); insert into test values(1), (2); select * from test order by n',
>     complete: function (err, stmt, rows) {
>       if (err) {
>         console.error(`Failed to execute statement due to the following error: ${err.message}`);
>       } else {
>         console.log('==== complete');
>         console.log(`==== sqlText=${stmt.getSqlText()}`);
>         if (stmt.hasNext()) {
>           stmt.NextResult();
>         } else {
>           // do something else, for example close the connection
>         }
>       }
>     }
>   });
> }
> ```

You can also set the number of statements in a batch each time you execute a multi-statement query by setting
`MULTI_STATEMENT_COUNT` as a parameter for the `connection.execute` function. The following example sets the number of
statements to three for the batch and includes three SQL statements in the batch query:

> ```javascript
> // connection needs to be already set up
> connection.connect((err, conn) => {
>   if (err) {
>     console.error(`Unable to connect: ${err.message}`);
>   } else {
>     console.log(`Successfully connected to Snowflake, connection id ${conn.getId()}`);
>     testMulti();
>   }
> });
>
> function testMulti() {
>   console.log('execute multi-statement query');
>   connection.execute({
>     sqlText: 'create or replace table test(n int); insert into test values(1), (2); select * from test order by n',
>     parameters: { MULTI_STATEMENT_COUNT: 3 },
>     complete: function (err, stmt, rows) {
>       if (err) {
>         console.error(`Failed to execute statement: ${err.message}`);
>       } else {
>         console.log('==== complete');
>         console.log(`==== sqlText=${stmt.getSqlText()}`);
>         if (rows) {
>           const stream = stmt.streamRows();
>           console.log(`====QueryId=${stmt.getQueryId()}`);
>
>           stream.on('data', row => {
>             console.log(row);
>           });
>           stream.on('end', () => {
>             console.log('done');
>           });
>         }
>
>         if ('hasNext' in stmt && stmt.hasNext()) {
>           stmt.NextResult();
>         } else {
>           connection.destroy(err1 => {
>             if (err1) {
>               console.error(`Unable to disconnect: ${err1.message}`);
>             } else {
>               console.log(`Disconnected connection with id: ${connection.getId()}`);
>             }
>           });
>         }
>       }
>     }
>   });
> }
> ```

## Binding statement parameters

Occasionally, you might want to [bind](../../sql-reference/bind-variables.md) data in a statement with a placeholder.
Executing statements in this manner is useful because it helps prevent SQL injection attacks. Consider the
following statement:

> ```javascript
> connection.execute({
>   sqlText: 'SELECT c1 FROM (SELECT 1 AS c1 UNION ALL SELECT 2 AS c1) WHERE c1 = 1;'
> });
> ```

You can achieve the same result using the following bindings:

> ```javascript
> connection.execute({
>   sqlText: 'SELECT c1 FROM (SELECT :1 AS c1 UNION ALL SELECT :2 AS c1) WHERE c1 = :1;',
>   binds: [1, 2]
> });
> ```

The `?` syntax for bindings is also supported:

> ```javascript
> connection.execute({
>   sqlText: 'SELECT c1 FROM (SELECT ? AS c1 UNION ALL SELECT ? AS c1) WHERE c1 = ?;',
>   binds: [1, 2, 1]
> });
> ```

> **Note:**
>
> There is an upper limit to the size of data that you can bind, or that you can combine in a batch.
> For details, see [Limits on Query Text Size](../../user-guide/query-size-limits.md).

## Binding an array for bulk insertions

Binding an array of data is supported for bulk INSERT operation. Pass an array of arrays as follows:

> ```javascript
> connection.execute({
>   sqlText: 'INSERT INTO t(c1, c2, c3) values(?, ?, ?)',
>   binds: [[1, 'string1', 2.0], [2, 'string2', 4.0], [3, 'string3', 6.0]]
> });
> ```

> **Note:**
>
> Binding a large array will impact performance and might be rejected if the size of data is too large to be handled by the server.

You can also bind arrays of `VARIANT` data. To illustrate, assume you create a table with a column of `VARIANT` data, as follows:

```sqlexample
create or replace table test(id int, foo variant);
```

You could then execute the following script:

```javascript
// standard stuff like defining connection, etc
const statement = connection.execute({
  // table columns are id: int, foo: variant
  sqlText: 'insert into test_db.public.test select value:id, value:foo from table(flatten(parse_json(?)))',
  binds: [JSON.stringify([
    { id: 1, foo: [{ a: '1', b: '2' }] },
    { id: 2, foo: [{ c: '3', d: '4' }] }
  ])],
  complete: function (err, stmt, rows) {
    if (err) {
      console.error(`Failed to execute statement due to the following error: ${err.message}`);
    } else {
      console.log(`[queryID ${statement.getStatementId()}, requestId ${statement.getRequestId()}] Number of rows produced: ${rows.length}`);
      // rest of the code
    }
  }
});
```

## Canceling statements

A statement can be canceled by calling the `statement.cancel()` method:

> ```javascript
> statement.cancel((err, stmt) => {
>   if (err) {
>     console.error(`Unable to abort statement due to the following error: ${err.message}`);
>   } else {
>     console.log('Successfully aborted statement');
>   }
> });
> ```

## Resubmitting requests

If you are unsure whether Snowflake successfully executed an SQL statement, perhaps due to a network error or timeout,
you can resubmit the same statement using its request ID. For example, suppose you submit an INSERT command to add data but
did not receive an acknowledgement in a timely
manner, so you don’t know what happened with the command. In this scenario, you don’t just want to execute the same
command as a new command
because it could result in executing the command twice, producing data duplication.

By including the request ID in the SQL statement,
you can avoid the potential for data duplication. Resubmitting the request with the request ID from the
initial request ensures that the resubmitted command executes only if the initial request failed. For more information, refer to
[Resubmitting a request to execute SQL statements](../sql-api/submitting-requests.md).

> **Note:**
>
> To resubmit a query using a request ID, you must use the same connection that generated the request ID. If you want to
> retrieve the result of query from a different connection, refer to [RESULT_SCAN](../../sql-reference/functions/result_scan.md).

The following code samples demonstrate how you can save and use a request ID to resubmit a statement. When you execute a statement,
you can use the `getRequestId()` function to retrieve the ID of the submitted request. You can then use that ID to execute the same
statement at a later time. The following example executes an INSERT statement and saves its request ID in the `requestId` variable.

> ```javascript
> let requestId;
> connection.execute({
>   sqlText: 'INSERT INTO testTable VALUES (1);',
>   complete: function (err, stmt, rows) {
>     const stream = stmt.streamRows();
>     requestId = stmt.getRequestId(); // Retrieves the request ID
>     stream.on('data', row => {
>       console.log(row);
>     });
>     stream.on('end', () => {
>       console.log('done');
>     });
>   }
> });
> ```

If you do not receive an acknowledgement that the command executed successfully, you can resubmit the request using the saved
request ID as shown below.

> ```javascript
> connection.execute({
>   sqlText: 'INSERT INTO testTable VALUES (1);',  // optional
>   requestId: requestId,  // Uses the request ID from before
>   complete: function (err, stmt, rows) {
>     const stream = stmt.streamRows();
>     stream.on('data', row => {
>       console.log(row);
>     });
>     stream.on('end', () => {
>       console.log('done');
>     });
>   }
> });
> ```

If you choose to resubmit a request with a `requestId` and `sqlText`, be aware of the following interactions:

* If the `requestId` already exists, meaning it matches a previous request, the command ignores the `sqlText` query and resubmits
  the query from the original command.
* If the `requestId` does not exist, meaning it does not match a previous request, the command executes the `sqlText` query.

---
title: Extending Snowflake with Functions and Procedures
source: https://docs.snowflake.com/en/developer-guide/extensibility.md
section: Developer Guide
---

# Extending Snowflake with Functions and Procedures

You can extend the SQL you use in Snowflake by writing user-defined functions (UDFs) and stored procedures that you can call from SQL. When
you write a UDF or procedure, you write its logic in one of the supported handler languages, then create it using SQL.

With a UDF, you calculate and return a value. With a stored procedure, you generally perform one or more operations by executing
statements in SQL or another supported language.

You can also write an external function whose logic executes on a system external to Snowflake, such as a cloud provider.

[Choosing whether to write a stored procedure or a user-defined function](stored-procedures-vs-udfs.md)
:   Choose between writing a stored procedure and writing a user-defined function.

[Design Guidelines and Constraints for Functions and Procedures](udf-stored-procedure-guidelines.md)
:   Read more about the guidelines that functions and procedures share, including guidelines related to deployment options, security practices,
    platform constraints, and conventions.

[Packaging Handler Code](udf-stored-procedure-building.md)
:   Use tools to package handler code and ensure that dependencies are available on Snowflake.

[Stored procedures overview](stored-procedure/stored-procedures-overview.md)
:   Learn the benefits and supported languages.

[User-defined functions overview](udf/udf-overview.md)
:   Learn the types of UDFs and supported languages.

[Logging, tracing, and metrics](logging-tracing/logging-tracing-overview.md)
:   Record handler code activity by capturing log messages and trace events, storing the data in a database you can query later.

[External network access overview](external-network-access/external-network-access-overview.md)
:   Create secure access to specific network locations external to Snowflake, then use that access from within the handler code.

[Introduction to external functions](../sql-reference/external-functions-introduction.md)
:   Access custom code that runs outside of Snowflake, such as API services that provide geocoding and machine learning models.

---
title: External network access and private connectivity on AWS
source: https://docs.snowflake.com/en/developer-guide/external-network-access/creating-using-private-aws.md
section: Developer Guide
---

# External network access and private connectivity on AWS

You can configure Snowflake for outbound private connectivity to an AWS external service by way of
[external network access](external-network-access-overview.md).

Unlike public connectivity, with private connectivity you must do the following operations:

* Create a private connectivity endpoint. This step requires the ACCOUNTADMIN role.
* Create the network rule so the `TYPE` property is set to `PRIVATE_HOST_PORT`.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Set up private connectivity to an external Amazon S3 service

1. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to specify Snowflake is connecting to an
   AWS S3 service, and the hostname to use when connecting to the service:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'com.amazonaws.us-west-2.s3',
     '*.s3.us-west-2.amazonaws.com'
   );
   ```

   > **Note:**
   >
   > The asterisk in `*.s3.us-west-2.amazonaws.com` specifies that you can use the endpoint to access multiple S3 buckets.
2. Execute the following SQL statement to create a network rule that allows Snowflake to send requests to an external destination, being
   sure to set the `TYPE` property to `PRIVATE_HOST_PORT`:

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE aws_s3_network_rule
     MODE = EGRESS
     TYPE = PRIVATE_HOST_PORT
     VALUE_LIST = ('external-access-iam-bucket.s3.us-west-2.amazonaws.com');
   ```
3. Execute the following SQL statement to create a security integration for external API authentication:

   ```sqlexample
   CREATE OR REPLACE SECURITY INTEGRATION aws_s3_security_integration
     TYPE = API_AUTHENTICATION
     AUTH_TYPE = AWS_IAM
     ENABLED = TRUE
     AWS_ROLE_ARN = 'arn:aws:iam::736112632310:role/external-access-iam-bucket';
   ```
4. Execute the following SQL statement to get the `STORAGE_AWS_IAM_USER_ARN` and `STORAGE_AWS_EXTERNAL_ID` values for the IAM user:

   ```sqlexample
   DESC SECURITY INTEGRATION aws_s3_security_integration;
   ```
5. Using the `STORAGE_AWS_IAM_USER_ARN` and `STORAGE_AWS_EXTERNAL_ID` values, follow **Step 5** in
   [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md) to grant the IAM user access to the Amazon S3 service.
6. Execute the following SQL statement to create a token to use for authentication with the AWS S3 service:

   ```sqlexample
   CREATE OR REPLACE SECRET aws_s3_access_token
     TYPE = CLOUD_PROVIDER_TOKEN
     API_AUTHENTICATION = aws_s3_security_integration;
   ```
7. Execute the following SQL statement to create an external access integration that uses the network rule and token created in previous
   steps:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION aws_s3_external_access_integration
     ALLOWED_NETWORK_RULES = (aws_s3_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (aws_s3_access_token)
     ENABLED = TRUE
     COMMENT = 'Testing S3 connectivity';
   ```
8. Execute one of the following SQL statements to create a function that can use the external access integration and the token that were
   previously created:

   PythonJava

   ```sqlexample-python
   CREATE OR REPLACE FUNCTION aws_s3_python_function()
     RETURNS VARCHAR
     LANGUAGE PYTHON
     EXTERNAL_ACCESS_INTEGRATIONS = (aws_s3_external_access_integration)
     RUNTIME_VERSION = '3.12'
     SECRETS = ('cred' = aws_s3_access_token)
     PACKAGES = ('boto3')
     HANDLER = 'main_handler'
   AS
   $$
     import boto3
     import _snowflake
     from botocore.config import Config

     def main_handler():
         # Get the previously created token as an object
         cloud_provider_object = _snowflake.get_cloud_provider_token('cred')

         # Configure boto3 connection settings
         config = Config(
             retries=dict(total_max_attempts=9),
             connect_timeout=30,
             read_timeout=30,
             max_pool_connections=50
         )

         # Connect to S3 using boto3
         s3 = boto3.client(
             's3',
             region_name='us-west-2',
             aws_access_key_id=cloud_provider_object.access_key_id,
             aws_secret_access_key=cloud_provider_object.secret_access_key,
             aws_session_token=cloud_provider_object.token,
             config=config
         )

         # Use the s3 object upload/download resources
         # ...

         return 'Successfully connected to AWS S3'
   $$;
   ```

   ```sqlexample-java
   CREATE OR REPLACE FUNCTION aws_s3_java_function()
     RETURNS STRING
     LANGUAGE JAVA
     EXTERNAL_ACCESS_INTEGRATIONS = (aws_s3_external_access_integration)
     SECRETS = ('cred' = aws_s3_access_token)
     HANDLER = 'AWSTokenProvider.handle'
   AS
   $$
     import com.snowflake.snowpark_java.types.CloudProviderToken;
     import com.snowflake.snowpark_java.types.SnowflakeSecrets;

     public class AWSTokenProvider {
         public static String handle() {
             // Get the previously created token as an object
             SnowflakeSecrets sfSecret = SnowflakeSecrets.newInstance();
             CloudProviderToken cloudProviderToken = sfSecret.getCloudProviderToken("cred");

             // Create variables for the AWS session credentials
             String accessKeyId = cloudProviderToken.getAccessKeyId();
             String secretAccessKey = cloudProviderToken.getSecretAccessKey();
             String token = cloudProviderToken.getToken();

             // Use the token to create an S3 client
             // ...

             return "Successfully connected to AWS S3 with the following access token: " + token;
         }
     }
   $$;
   ```
9. Execute one of the following SQL statements to run the function you created:

   PythonJava

   ```sqlexample
   SELECT aws_s3_python_function();
   ```

   ```sqlexample
   SELECT aws_s3_java_function();
   ```

## Set up private connectivity to an external Amazon Bedrock service

1. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to specify that Snowflake is connecting to
   the AWS S3 and Amazon Bedrock services, and the hostnames to use when connecting to the services:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'com.amazonaws.us-west-2.s3',
     '*.s3.us-west-2.amazonaws.com'
   );

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'com.amazonaws.us-west-2.bedrock-runtime',
     'bedrock-runtime.us-west-2.amazonaws.com'
   );
   ```
2. Execute the following SQL statement to create a network rule that allows Snowflake to send requests to an external destination, being
   sure to set the `TYPE` property to `PRIVATE_HOST_PORT`:

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE bedrock_network_rule
     MODE = EGRESS
     TYPE = PRIVATE_HOST_PORT
     VALUE_LIST = ('bedrock-runtime.us-west-2.amazonaws.com');
   ```
3. Execute the following SQL statement to create a security integration for external API authentication:

   ```sqlexample
   CREATE OR REPLACE SECURITY INTEGRATION bedrock_security_integration
     TYPE = API_AUTHENTICATION
     AUTH_TYPE = AWS_IAM
     ENABLED = TRUE
     AWS_ROLE_ARN = 'arn:aws:iam::736112632310:role/external-access-iam-bucket';
   ```
4. Execute the following SQL statement to get the `STORAGE_AWS_IAM_USER_ARN` and `STORAGE_AWS_EXTERNAL_ID` values for the IAM user:

   ```sqlexample
   DESC  SECURITY INTEGRATION bedrock_security_integration;
   ```
5. Using the `STORAGE_AWS_IAM_USER_ARN` and `STORAGE_AWS_EXTERNAL_ID` values, follow **Step 5** in
   [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md) to grant the IAM user access to the Amazon Bedrock service.
6. Execute the following SQL statement to create a token to use for authentication with the AWS Bedrock service:

   ```sqlexample
   CREATE OR REPLACE SECRET aws_bedrock_access_token
     TYPE = CLOUD_PROVIDER_TOKEN
     API_AUTHENTICATION = bedrock_security_integration;
   ```
7. Execute the following SQL statement to create an external access integration that uses the network rule and token created in previous
   steps:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION bedrock_external_access_integration
     ALLOWED_NETWORK_RULES = (bedrock_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS=(aws_bedrock_access_token)
     ENABLED=true ;
   ```
8. Execute the following SQL statement to create a function that can use the external access integration and the token that were
   previously created:

   ```sqlexample-python
   CREATE OR REPLACE FUNCTION bedrock_private_connectivity_tests(
     id INT,
     instructions VARCHAR,
     user_context VARCHAR,
     model_id VARCHAR
   )
     RETURNS VARCHAR
     LANGUAGE PYTHON
     EXTERNAL_ACCESS_INTEGRATIONS = (bedrock_external_access_integration)
     RUNTIME_VERSION = '3.8'
     SECRETS = ('cred' = aws_bedrock_access_token)
     PACKAGES = ('boto3')
     HANDLER = 'bedrock_py'
   AS
   $$
     import boto3
     import json
     import _snowflake
     def bedrock_py(id, instructions, user_context, model_id):
         # Get the previously created token as an object
         cloud_provider_object = _snowflake.get_cloud_provider_token('cred')
         cloud_provider_dictionary = {
             "ACCESS_KEY_ID": cloud_provider_object.access_key_id,
             "SECRET_ACCESS_KEY": cloud_provider_object.secret_access_key,
             "TOKEN": cloud_provider_object.token
         }
         # Assign AWS credentials and choose a region
         boto3_session_args = {
             'aws_access_key_id': cloud_provider_dictionary["ACCESS_KEY_ID"],
             'aws_secret_access_key': cloud_provider_dictionary["SECRET_ACCESS_KEY"],
             'aws_session_token': cloud_provider_dictionary["TOKEN"],
             'region_name': 'us-west-2'
         }
         session = boto3.Session(**boto3_session_args)
         client = session.client('bedrock-runtime')
         # Prepare the request body for the specified model
         def prepare_request_body(model_id, instructions, user_context):
             default_max_tokens = 512
             default_temperature = 0.7
             default_top_p = 1.0
             if model_id == 'amazon.titan-text-express-v1':
                 body = {
                     "inputText": f"<SYSTEM>Follow these:{instructions}<END_SYSTEM>\n<USER_CONTEXT>Use this user context in your response:{user_context}<END_USER_CONTEXT>",
                     "textGenerationConfig": {
                         "maxTokenCount": default_max_tokens,
                         "stopSequences": [],
                         "temperature": default_temperature,
                         "topP": default_top_p
                     }
                 }
             elif model_id == 'ai21.j2-ultra-v1':
                 body = {
                     "prompt": f"<SYSTEM>Follow these:{instructions}<END_SYSTEM>\n<USER_CONTEXT>Use this user context in your response:{user_context}<END_USER_CONTEXT>",
                     "temperature": default_temperature,
                     "topP": default_top_p,
                     "maxTokens": default_max_tokens
                 }
             elif model_id == 'anthropic.claude-3-sonnet-20240229-v1:0':
                 body = {
                     "max_tokens": default_max_tokens,
                     "messages": [{"role": "user", "content": f"<SYSTEM>Follow these:{instructions}<END_SYSTEM>\n<USER_CONTEXT>Use this user context in your response:{user_context}<END_USER_CONTEXT>"}],
                     "anthropic_version": "bedrock-2023-05-31"
                 }
             else:
                 raise ValueError("Unsupported model ID")
             return json.dumps(body)
         # Call Bedrock to get a completion
         body = prepare_request_body(model_id, instructions, user_context)
         response = client.invoke_model(modelId=model_id, body=body)
         response_body = json.loads(response.get('body').read())
         # Parse the API response based on the model
         def get_completion_from_response(response_body, model_id):
             if model_id == 'amazon.titan-text-express-v1':
                 output_text = response_body.get('results')[0].get('outputText')
             elif model_id == 'ai21.j2-ultra-v1':
                 output_text = response_body.get('completions')[0].get('data').get('text')
             elif model_id == 'anthropic.claude-3-sonnet-20240229-v1:0':
                 output_text = response_body.get('content')[0].get('text')
             else:
                 raise ValueError("Unsupported model ID")
             return output_text
         # Get the generated text from Bedrock
         output_text = get_completion_from_response(response_body, model_id)
         return output_text
     $$;
   ```
9. Execute the following SQL statement to run the function you created:

   ```sqlexample
   SELECT bedrock_private_connectivity_tests(1, 'Summarize the main benefits of attending this university.', 'University of Waterloo', 'amazon.titan-text-express-v1');
   ```

---
title: External network access and private connectivity on Google Cloud
source: https://docs.snowflake.com/en/developer-guide/external-network-access/creating-using-private-gcp.md
section: Developer Guide
---

# External network access and private connectivity on Google Cloud

This topic provides configuration details to set up outbound private connectivity to a Google Cloud external service by way of
[external network access](external-network-access-overview.md). The primary differences between
the outbound public connectivity and outbound private connectivity configurations are that, with private connectivity, you must do the
following operations:

* Create a private connectivity endpoint. This step requires the ACCOUNTADMIN role.
* Create a network rule so the `TYPE` property is set to `PRIVATE_HOST_PORT`.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Configure external network access

To configure outbound private connectivity with external network access on Google Cloud, do the following steps:

1. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a private
   connectivity endpoint in your Snowflake VNet to enable Snowflake to connect to a Google Cloud external service using private connectivity:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'projects/<project_id>/regions/us-west2/serviceAttachments/cloud-func',
     'my-hello-echo-function.com',
   );
   ```
2. In the Google Cloud console, go to the service attachment and accept the newly connected Snowflake project.
3. In Snowflake, create a [network rule](../../sql-reference/sql/create-network-rule.md), specifying the `PRIVATE_HOST_PORT` property to
   enable private connectivity:

   ```sqlexample
   CREATE DATABASE IF NOT EXISTS external_access_db;

   CREATE OR REPLACE NETWORK RULE external_access_db.public.cloud_func_rule
     MODE = EGRESS
     TYPE = PRIVATE_HOST_PORT
     VALUE_LIST = ('my-hello-echo-function:443');
   ```
4. In Snowflake, create an [external access integration](../../sql-reference/sql/create-external-access-integration.md), specifying the network rule from
   the previous step:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION web_server_eai
     ALLOWED_NETWORK_RULES = (external_access_db.public.cloud_func_rule)
     ENABLED = TRUE;
   ```
5. In Snowflake, execute the following SQL statements to create a function that can use the external access integration:

   ```sqlexample-java
   CREATE OR REPLACE FUNCTION call_func(name VARCHAR)
     returns VARCHAR
     LANGUAGE JAVA
     EXTERNAL_ACCESS_INTEGRATIONS = (web_server_eai)
     HANDLER = 'UDFClient.call'
     AS
     $$
     import java.net.http.HttpClient;
     import java.net.http.HttpRequest;
     import java.net.http.HttpResponse;
     import java.net.URI;
     import java.io.IOException;

     public class UDFClient {
       private HttpClient client;

       public UDFClient() {
         this.client = HttpClient.newBuilder().version(HttpClient.Version.HTTP_1_1).build();
    }

     public String call(String name) throws IOException, InterruptedException {
       HttpRequest request = HttpRequest.newBuilder()
            .header("Content-Type", "application/json")
            .uri(URI.create("http://my-hello-echo-function?name=" + name))
            .GET()
            .build();

       HttpResponse<String> response =
            client.send(request, HttpResponse.BodyHandlers.ofString());

       return String.valueOf(response.body());
      }
     }
     $$;
   ```
6. In Snowflake, call the function you created in the previous step:

   ```sqlexample
   SELECT call_func("snowflake");

   -- Returns "Hello snowflake!"
   ```

If you no longer need the private connectivity endpoint for the external network access integration, call the [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function.

---
title: External network access and private connectivity on Microsoft Azure
source: https://docs.snowflake.com/en/developer-guide/external-network-access/creating-using-private-azure.md
section: Developer Guide
---

# External network access and private connectivity on Microsoft Azure

This topic provides configuration details to set up outbound private connectivity to an external service by way of
[external network access](external-network-access-overview.md). The primary differences between
the outbound public connectivity and outbound private connectivity configurations are that, with private connectivity, you must do the
following operations:

* Create a private connectivity endpoint. This step requires the ACCOUNTADMIN role.
* Create the network rule to use the `PRIVATE_HOST_PORT` property. This property includes the Azure URL and port number, which
  enables the connection from Snowflake to Microsoft Azure to go through the Microsoft Azure internal network, avoiding the public Internet.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](../../sql-reference/organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Configure external network access

These steps are unique to using outbound private connectivity with external network access on Microsoft Azure:

1. Call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_provision_privatelink_endpoint.md) system function to provision a private
   connectivity endpoint in your Snowflake VNet to enable Snowflake to connect to an external service using private connectivity:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;

   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/1111-22-333-4444-55555/resourceGroups/external-access/providers/Microsoft.Sql/servers/externalaccessdemo',
     'externalaccessdemo.database.windows.net',
     'sqlServer'
   );
   ```
2. In the Azure Portal and as the owner of the Azure API Management resource, approve the private endpoint. For more information, see the
   [Microsoft Azure documentation](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
3. Create a [database](../../sql-reference/sql/create-database.md) and [schemas](../../sql-reference/sql/create-schema.md) to store the network
   rule, secret, and procedure:

   ```sqlexample
   CREATE DATABASE ext_network_access_db;
   CREATE SCHEMA secrets;
   CREATE SCHEMA network_rules;
   CREATE SCHEMA procedures;
   ```
4. Create a [network rule](../../sql-reference/sql/create-network-rule.md), specifying the `PRIVATE_HOST_PORT` property to enable
   private connectivity:

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE ext_network_access_db.network_rules.azure_sql_private_rule
      MODE = EGRESS
      TYPE = PRIVATE_HOST_PORT
      VALUE_LIST = ('externalaccessdemo.database.windows.net');
   ```
5. Create a [secret](../../sql-reference/sql/create-secret.md) to securely store the access credentials:

   ```sqlexample
   CREATE OR REPLACE SECRET ext_network_access_db.secrets.secret_password
      TYPE = PASSWORD
      USERNAME = 'my-username'
      PASSWORD = 'my-password';
   ```
6. Create an [external access integration](../../sql-reference/sql/create-external-access-integration.md), specifying the network rule from
   the previous step:

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION azure_private_access_sql_store_integration
      ALLOWED_NETWORK_RULES = (ext_network_access_db.network_rules.azure_sql_private_rule)
      ALLOWED_AUTHENTICATION_SECRETS = (ext_network_access_db.secrets.secret_password)
      ENABLED = TRUE;
   ```
7. Create a [procedure](../../sql-reference/sql/create-procedure.md) to connect to the external service:

   ```sqlexample-python
   CREATE OR REPLACE PROCEDURE ext_network_access_db.procedures.connect_azure_sqlserver()
     RETURNS TABLE()
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.10
     HANDLER = 'connect_sqlserver'
     EXTERNAL_ACCESS_INTEGRATIONS = (azure_private_access_sql_store_integration)
     SECRETS = ('cred' = ext_network_access_db.secrets.secret_password)
     IMPORTS = ('@demo/pytds.zip')
     PACKAGES = ('snowflake-snowpark-python','pyopenssl','bitarray','certifi')
   AS $$
   import pytds
   import certifi
   import _snowflake
   from snowflake.snowpark import types as T

   def connect_sqlserver(session):
      server = 'externalaccessdemo.database.windows.net'
      database = 'externalaccess'
      username_password_object = _snowflake.get_username_password('cred');

      # Create a connection to the database
      with pytds.connect(server, database, username_password_object.username, username_password_object.password, cafile=certifi.where(), validate_host=False) as conn:
            with conn.cursor() as cur:
               cur.execute("""
               SELECT O.OrderId,
                     O.OrderDate,
                     O.SodName,
                     O.UnitPrice,
                     O.Quantity,
                     C.Region
               FROM Orders AS O
               INNER JOIN Customers AS C
                  ON O.CustomerID = C.CustomerID;""")
               rows = cur.fetchall()

               schema = T.StructType([
                     T.StructField("ORDER_ID", T.LongType(), True),
                     T.StructField("ORDER_DATE", T.DateType(), True),
                     T.StructField("SOD_NAME", T.StringType(), True),
                     T.StructField("UNIT_PRICE", T.FloatType(), True),
                     T.StructField("QUANTITY", T.FloatType(), True),
                     T.StructField("REGION", T.StringType(), True)
                  ])

               final_df = session.createDataFrame(rows, schema)

               return final_df
      $$;
   ```
8. Call the procedure to connect to the external service:

   ```sqlexample
   CALL ext_network_access_db.procedures.connect_azure_sqlserver();
   ```

Repeat these steps for each external network access configuration that requires private connectivity.

If you no longer need the private connectivity endpoint for the external network access integration, call the
[SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_deprovision_privatelink_endpoint.md) system function.

---
title: External network access best practices
source: https://docs.snowflake.com/en/developer-guide/external-network-access/external-network-access-best-practices.md
section: Developer Guide
---

# External network access best practices

This topic provides best practices for accessing external network locations from user-defined functions and procedures.

## Follow applicable best practices from external functions

Follow best practices described for [external functions](../../sql-reference/external-functions.md), including the following:

* [Use a remote service’s batch API if available](../../sql-reference/external-functions-best-practices.md)
* [Do not assume that the remote service is passed each row exactly once](../../sql-reference/external-functions-best-practices.md)

Note that, unlike external functions, you are responsible in your handler code for performing retries, sending batch requests from
vectorized UDFs, and managing exceptions.

## Process one row at a time when using external access in a vectorized UDF or UDTF

When your vectorized UDF or UDTF handler code makes requests of an external network, you should process each row independently to
avoid non-deterministic results.

To minimize networking overhead, Snowflake typically batches rows to send to remote services. The number of batches
and the size of each batch can vary.

In addition, the order of batches can vary, and the order of rows within a batch can vary. Even if the query contains
an ORDER BY clause, the ORDER BY is usually applied after the request to the external network location.

Because batch size and row order are not guaranteed, writing a handler code that returns a value for a row
that depends on any other row in this batch or previous batches can produce non-deterministic results.

Snowflake strongly recommends that the your code process each row independently.

The return value for each input row should depend on only that input row, not on other input rows. (Currently,
handlers performing external network access do not support [window functions](../../sql-reference/functions-window.md), for example.)

Note also that because batch size is not guaranteed, counting batches is not meaningful.

## Reuse the TCP connection if possible

Snowflake limits the total number of connections that can be made from a UDF. When this limit is reached, you might see the following
error message:

```none
Cannot assign requested address
```

To avoid running into resource exhaustion issues, you should try to reuse connections as much as possible. You can achieve this by creating
the TCP client or session once during the UDF initialization, then using it in the UDF handler for the rest of the query. For example, for
code written in Python, you can reuse the `Session` object (available from the Python `requests` library) for multiple HTTP
calls.

For more information and an example, refer to [Using the external access integration in a function or procedure](creating-using-external-network-access.md).

## Expect and handle transient errors in code

When you have a long-running query that calls the remote service multiple times, it’s possible for one of the calls to fail with a
transient error. To avoid query failures, your code should execute retries and handle failures on the assumption that failures may
occur.

---
title: External network access examples
source: https://docs.snowflake.com/en/developer-guide/external-network-access/external-network-access-examples.md
section: Developer Guide
---

# External network access examples

This topic provides examples of accessing external network locations from user-defined functions and procedures.

## Accessing PyPi to install packages in Snowpark Container

You can access the PyPi package repository by creating an external access integration.
You might do this when you want to allow Notebook users on Container Runtime to install `pip`
packages using the `pip install` command. With this kind of integration, you can also
allow Snowpark Container Services to install pip packages.

This example uses the Snowflake-managed network rule `snowflake.external_access.pypi_rule`
described in [Privileges and commands](../../user-guide/network-rules.md).

1. Create an external access integration using the `snowflake.external_access.pypi_rule` network rule.

   ```sqlexample
   CREATE [OR REPLACE] EXTERNAL ACCESS INTEGRATION pypi_access
     ALLOWED_NETWORK_RULES = (snowflake.external_access.pypi_rule)
     ENABLED = true;
   ```
2. Create a `developer` role for users who need to use `pip install`
   in a Snowpark Container or Notebook on Container Runtime.

   ```sqlexample
   CREATE OR REPLACE ROLE developer;
   ```
3. Grant to the `developer` role the privileges needed to use the external
   access integration you created.

   ```sqlexample
   GRANT USAGE ON INTEGRATION pypi_access TO ROLE developer;
   ```

## Accessing the Google Translate API with OAuth

The following steps include code to create an external access integration for access to the
Google Translation API. The steps add the security integration and the permissions needed to execute the statements.

1. Create a network rule representing the external location.

   For more information about the role of a network rule in external access, including privileges required, see
   [Creating a network rule to represent the external network location](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE google_apis_network_rule
     MODE = EGRESS
     TYPE = HOST_PORT
     VALUE_LIST = ('translation.googleapis.com');
   ```
2. Create a security integration to hold the OAuth credentials required to authenticate with the external network location specified
   by the `google_apis_network_rule` network rule.

   For reference information on the command, including privileges required, see
   [CREATE SECURITY INTEGRATION (External API Authentication)](../../sql-reference/sql/create-security-integration-api-auth.md).

   ```sqlexample
   CREATE OR REPLACE SECURITY INTEGRATION google_translate_oauth
     TYPE = API_AUTHENTICATION
     AUTH_TYPE = OAUTH2
     OAUTH_CLIENT_ID = 'my-client-id'
     OAUTH_CLIENT_SECRET = 'my-client-secret'
     OAUTH_TOKEN_ENDPOINT = 'https://oauth2.googleapis.com/token'
     OAUTH_AUTHORIZATION_ENDPOINT = 'https://accounts.google.com/o/oauth2/auth'
     OAUTH_ALLOWED_SCOPES = ('https://www.googleapis.com/auth/cloud-platform')
     ENABLED = TRUE;
   ```
3. Create a secret to represent the credentials contained by the `google_translate_oauth` security integration.

   For more information about the role of the secret in external access, including privileges required, see
   [Creating a secret to represent credentials](creating-using-external-network-access.md).

   The secret must specify a refresh token with its OAUTH_REFRESH_TOKEN parameter. To obtain a refresh token from the service provider
   (in this case, from the Google Cloud Translation API service), you can use a way the provider offers or use Snowflake system functions.

   To create a secret with a refresh token, use *either* Google OAuth Playground or Snowflake system functions, as described by the
   following:

   * Snowflake system functions

     1. Execute CREATE SECRET to create a secret. You’ll update it with the refresh token in a later step.

        ```sqlexample
        USE DATABASE my_db;
        USE SCHEMA secret_schema;

        CREATE OR REPLACE SECRET oauth_token
          TYPE = oauth2
          API_AUTHENTICATION = google_translate_oauth;
        ```
     2. Execute the [SYSTEM$START_OAUTH_FLOW](../../sql-reference/functions/system_start_oauth_flow.md) function to retrieve a URL with which you can obtain a
        refresh token, specifying as its argument the name of the secret you created previously.

        ```sqlexample
        CALL SYSTEM$START_OAUTH_FLOW( 'my_db.secret_schema.oauth_token' );
        ```

        The function generates a URL you can use to complete the OAuth consent process.
     3. In a browser, visit the generated URL and complete the OAuth2 consent process. When you’ve finished, leave the browser open to the
        last page of the process.
     4. From the browser address bar, copy all of the text after the question mark in the URL of the last page of the consent process.
     5. Execute the [SYSTEM$FINISH_OAUTH_FLOW](../../sql-reference/functions/system_finish_oauth_flow.md) function, specifying as an argument the parameters you just
        copied from the browser address bar to update the secret with a refresh token.

        Be sure to execute SYSTEM$FINISH_OAUTH_FLOW in the same session as SYSTEM$START_OAUTH_FLOW. SYSTEM$FINISH_OAUTH_FLOW updates
        the secret you specified in SYSTEM$START_OAUTH_FLOW with access token and refresh token it obtained from the OAuth server.

        ```sqlexample
        CALL SYSTEM$FINISH_OAUTH_FLOW( 'state=<remaining_url_text>' );
        ```
   * Google OAuth Playground

     1. In [Google OAuth Playground](https://developers.google.com/oauthplayground/), select and authorize the Cloud Translation API as
        specified in step 1.
     2. In Step 2, click exchange authorization code for tokens, then copy the refresh token
        token value.
     3. Execute CREATE SECRET to create a secret that specifies the refresh token value you copied.

        For more information about the role of a secret in external access, including privileges required, see
        [Creating a secret to represent credentials](creating-using-external-network-access.md).

        ```sqlexample
        CREATE OR REPLACE SECRET oauth_token
          TYPE = oauth2
          API_AUTHENTICATION = google_translate_oauth
          OAUTH_REFRESH_TOKEN = 'my-refresh-token';
        ```
4. Create an external access integration using the network rule and secret.

   For more information about the role of an external access integration, including privileges required, see
   [Creating an external access integration](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION google_apis_access_integration
     ALLOWED_NETWORK_RULES = (google_apis_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (oauth_token)
     ENABLED = TRUE;
   ```
5. Create a `developer` role that will be assigned to users who need to create a UDF or procedure that uses the integration.

   ```sqlexample
   CREATE OR REPLACE ROLE developer;
   CREATE OR REPLACE ROLE user;
   ```
6. Grant to the `developer` role privileges needed to create a UDF that uses the objects for external access. This includes the
   following:

   * The READ privilege on the secret.
   * The USAGE privilege on the schema containing the secret.
   * The USAGE privilege on the integration.

     ```sqlexample
     GRANT READ ON SECRET oauth_token TO ROLE developer;
     GRANT USAGE ON SCHEMA secret_schema TO ROLE developer;
     GRANT USAGE ON INTEGRATION google_apis_access_integration TO ROLE developer;
     ```
7. Create a UDF `google_translate_python` that translates the specified text into a phrase in the specified language. For more
   information, see [Using the external access integration in a function or procedure](creating-using-external-network-access.md).

   ```sqlexample-python
   USE ROLE developer;

   CREATE OR REPLACE FUNCTION google_translate_python(sentence STRING, language STRING)
     RETURNS STRING
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.12
     HANDLER = 'get_translation'
     EXTERNAL_ACCESS_INTEGRATIONS = (google_apis_access_integration)
     PACKAGES = ('snowflake-snowpark-python','requests')
     SECRETS = ('cred' = oauth_token )
   AS $$
   import _snowflake
   import requests
   import json
   session = requests.Session()
   def get_translation(sentence, language):
     token = _snowflake.get_oauth_access_token('cred')
     url = "https://translation.googleapis.com/language/translate/v2"
     data = {'q': sentence,'target': language}
     response = session.post(url, json = data, headers = {"Authorization": "Bearer " + token})
     return response.json()['data']['translations'][0]['translatedText']
   $$;
   ```
8. Grant the USAGE privilege on the `google_translate_python` function so that those with the `user` role can call it.

   ```sqlexample
   GRANT USAGE ON FUNCTION google_translate_python(string, string) TO ROLE user;
   ```
9. Execute the `google_translate_python` function to translate a phrase.

   ```sqlexample
   USE ROLE user;

   SELECT google_translate_python('Happy Thursday!', 'zh-CN');
   ```

   This generates the following output.

   ```output
   -------------------------------------------------------
   | GOOGLE_TRANSLATE_PYTHON('HAPPY THURSDAY!', 'ZH-CN') |
   -------------------------------------------------------
   | 快乐星期四！                                          |
   -------------------------------------------------------
   ```

## Accessing an external lambda function with basic authentication

The following steps include example code to create an external access integration for access to a lambda function external to Snowflake.
The example uses a placeholder for the external endpoint itself, but it could be a function available at a REST service endpoint, for example.

The external access is used in a [vectorized Python UDF](../udf/python/udf-python-batch.md) that receives a Pandas
DataFrame containing the data.

1. Create a network rule `lambda_network_rule` representing the external location `my_external_service` (here, a placeholder
   value for the location of an external endpoint).

   For more information about the role of a network rule in external access, see
   [Creating a network rule to represent the external network location](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE lambda_network_rule
     MODE = EGRESS
     TYPE = HOST_PORT
     VALUE_LIST = ('my_external_service');
   ```
2. Create a secret to represent credentials required by the external service.

   Handler code later in this example retrieves the credentials from the secret using a Snowflake API for Python.

   For more information about the role of the secret in external access, see [Creating a secret to represent credentials](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE SECRET secret_password
     TYPE = PASSWORD
     USERNAME = 'my_user_name'
     PASSWORD = 'my_password';
   ```
3. Create a `developer` role and grant to it READ privileges on the secret. This role will be assigned to users who need to create
   a UDF or procedure that uses the secret.

   Also, create the role that users will use to call the function.

   ```sqlexample
   CREATE OR REPLACE ROLE developer;
   CREATE OR REPLACE ROLE user;
   ```
4. Grant to the `developer` role privileges needed to create a UDF that uses the objects for external access. This includes the
   following:

   * The READ privilege on the secret.
   * The USAGE privilege on the schema containing the secret.

   ```sqlexample
   GRANT READ ON SECRET secret_password TO ROLE developer;
   GRANT USAGE ON SCHEMA secret_schema TO ROLE developer;
   ```
5. Create an external access integration to specify the external endpoint and credentials through the network rule and secret you created.

   For more information about the role of an external access integration, including privileges required, see
   [Creating an external access integration](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION lambda_external_access_integration
     ALLOWED_NETWORK_RULES = (lambda_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (secret_password)
     ENABLED = TRUE;
   ```
6. Create a [vectorized Python UDF](../udf/python/udf-python-batch.md) `return_double_column`
   that accesses an external network location to process data received as a
   [Pandas DataFrame](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html).

   For more information on using external access in a UDF, see [Using the external access integration in a function or procedure](creating-using-external-network-access.md).

   ```sqlexample-python
   CREATE OR REPLACE FUNCTION return_double_column(x int)
     RETURNS INT
     LANGUAGE PYTHON
     EXTERNAL_ACCESS_INTEGRATIONS = (lambda_external_access_integration)
     SECRETS = ('cred' = secret_password)
     RUNTIME_VERSION = 3.12
     HANDLER = 'return_first_column'
     PACKAGES = ('pandas', 'requests')
   AS $$
   import pandas
   import numpy as np
   import json
   import requests
   import base64
   import _snowflake
   from _snowflake import vectorized
   from requests.auth import HTTPBasicAuth
   from requests.adapters import HTTPAdapter
   from requests.packages.urllib3.util.retry import Retry

   session = requests.Session()
   retries = Retry(total=10, backoff_factor=1, status_forcelist=[429, 500, 502, 503, 504], allowed_methods = None)

   session.mount('https://', HTTPAdapter(max_retries=retries))

   @vectorized(input=pandas.DataFrame)
   def return_first_column(df):
     request_rows = []

     df.iloc[:,0] = df.iloc[:,0].astype(int)
     request_rows = np.column_stack([df.index, df.iloc[:,0]]).tolist()

     request_payload = {"data" : request_rows}

     username_password_object = _snowflake.get_username_password('cred');
     basic = HTTPBasicAuth(username_password_object.username, username_password_object.password)

     url = 'my_external_service'

     response = session.post(url, json=request_payload, auth=basic)

     response.raise_for_status()
     response_payload = json.loads(response.text)

     response_rows = response_payload["data"]

     return pandas.DataFrame(response_rows)[1]
   $$;
   ```
7. Grant the USAGE privilege on the `return_double_column` function so that those with the `user` role can call it.

   ```sqlexample
   GRANT USAGE ON FUNCTION return_double_column(int) TO ROLE user;
   ```
8. Execute the `return_double_column` function, making a request to the external endpoint.

   Code in the following example creates a two-column table and inserts 100,000,000 rows containing 4-byte integers. The code
   then executes the `return_double_column` function, passing values from column `a` for processing by the external endpoint.

   ```sqlexample
   CREATE OR REPLACE TABLE t1 (a INT, b INT);
   INSERT INTO t1 SELECT SEQ4(), SEQ4() FROM TABLE(GENERATOR(ROWCOUNT => 100000000));

   SELECT return_double_column(a) AS retval FROM t1 ORDER BY retval;
   ```

## Accessing Amazon S3 with AWS IAM

The following steps include example code to connect to an AWS S3 bucket using IAM.

For more information about AWS IAM, see [AWS IAM documentation](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_identifiers.html).

1. Create a network rule, `aws_s3_network_rule`, that represents the AWS S3 bucket at the location specified by the VALUE_LIST
   parameter.

   For more information about the role of a network rule in external access, see
   [Creating a network rule to represent the external network location](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE NETWORK RULE aws_s3_network_rule
     MODE = EGRESS
     TYPE = PRIVATE_HOST_PORT
     VALUE_LIST = ('external-access-iam-bucket.s3.us-west-2.amazonaws.com');
   ```
2. Create a security integration to hold the AWS IAM Amazon Resource Name (ARN) credentials required to authenticate with the external
   network location specified by the `aws_s3_network_rule` network rule.

   For reference information on the command, including privileges required, see
   [CREATE SECURITY INTEGRATION (AWS IAM Authentication)](../../sql-reference/sql/create-security-integration-aws-iam.md).

   ```sqlexample
   CREATE OR REPLACE SECURITY INTEGRATION aws_s3_security_integration
     TYPE = API_AUTHENTICATION
     AUTH_TYPE = AWS_IAM
     ENABLED = TRUE
     AWS_ROLE_ARN = 'arn:aws:iam::736112632310:role/external-access-iam-bucket';
   ```
3. Get the ARN and ID for the IAM USER.

   1. Execute the DESC command on the security integration you created.

      ```sqlexample
      DESC SECURITY INTEGRATION aws_s3_security_integration;
      ```
   2. From the output displayed, copy the values of the following properties to use in the next step:

      * API_AWS_IAM_USER_ARN
      * API_AWS_EXTERNAL_ID
4. Grant the IAM user permissions needed to access the bucket.

   Use the ARN and ID values when configuring a trust policy as described in Step 5 of [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md).
5. Create a [secret](../../sql-reference/sql/create-secret.md) of type CLOUD_PROVIDER_TOKEN to
   represent credentials required by the external service.

   Handler code later in this example retrieves the credentials from the secret using a
   [Snowflake API](secret-api-reference.md).

   For more information about the role of the secret in external access, see [Creating a secret to represent credentials](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE SECRET aws_s3_access_token
     TYPE = CLOUD_PROVIDER_TOKEN
     API_AUTHENTICATION = aws_s3_security_integration;
   ```
6. Create a `developer` role and grant to it READ privileges on the secret. This role will be assigned to users who need to create
   a UDF or procedure that uses the secret.

   Also, create the role that users will use to call the function.

   ```sqlexample
   CREATE OR REPLACE ROLE developer;
   CREATE OR REPLACE ROLE user;
   ```
7. Grant to the `developer` role the privileges needed to create a UDF that uses the objects for external access. This includes the
   following:

   * The READ privilege on the secret.
   * The USAGE privilege on the schema containing the secret.

   ```sqlexample
   GRANT READ ON SECRET aws_s3_access_token TO ROLE developer;
   GRANT USAGE ON SCHEMA secret_schema TO ROLE developer;
   ```
8. Create an [external access integration](../../sql-reference/sql/create-external-access-integration.md)
   to specify the external endpoint and credentials through the network rule and secret you created.

   For more information about the role of an external access integration, including privileges required, see
   [Creating an external access integration](creating-using-external-network-access.md).

   ```sqlexample
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION aws_s3_external_access_integration
     ALLOWED_NETWORK_RULES = (aws_s3_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (aws_s3_access_token)
     ENABLED = TRUE
     COMMENT = 'Testing S3 connectivity';
   ```
9. Create a UDF that uses the external access integration to connect with the Amazon S3 bucket specified in the network rule you created.

   The handler code uses [Snowflake APIs](secret-api-reference.md) to retrieve a token from
   the secret you created. From this token, you can use Snowflake APIs to retrieve values needed to create a session for connecting with
   Amazon S3, including an access key ID, secret access key, and session token.

   For more information on using external access in a UDF, see [Using the external access integration in a function or procedure](creating-using-external-network-access.md).

   PythonJava

   ```sqlexample-python
   CREATE OR REPLACE FUNCTION aws_s3_python_function()
     RETURNS VARCHAR
     LANGUAGE PYTHON
     EXTERNAL_ACCESS_INTEGRATIONS = (aws_s3_external_access_integration)
     RUNTIME_VERSION = '3.12'
     SECRETS = ('cred' = aws_s3_access_token)
     PACKAGES = ('boto3')
     HANDLER = 'main_handler'
   AS $$
   import boto3
   import _snowflake
   from botocore.config import Config

   def main_handler():
     # Get token object
     cloud_provider_object = _snowflake.get_cloud_provider_token('cred')

     # Boto3 configuration
     config = Config(
       retries=dict(total_max_attempts=9),
       connect_timeout=30,
       read_timeout=30,
       max_pool_connections=50
     )

     # Connect to S3 using boto3
     s3 = boto3.client(
       's3',
       region_name='us-west-2',
       aws_access_key_id=cloud_provider_object.access_key_id,
       aws_secret_access_key=cloud_provider_object.secret_access_key,
       aws_session_token=cloud_provider_object.token,
       config=config
     )

     # Use S3 object to upload/download
     return 'Successfully connected with S3'
   $$;
   ```

   ```sqlexample-java
   CREATE OR REPLACE FUNCTION aws_s3_java_function()
     RETURNS STRING
     LANGUAGE JAVA
     EXTERNAL_ACCESS_INTEGRATIONS = (aws_s3_external_access_integration)
     SECRETS = ('cred' = aws_s3_access_token)
     HANDLER = 'AWSTokenProvider.handle'
   AS $$
   import com.snowflake.snowpark_java.types.CloudProviderToken;
   import com.snowflake.snowpark_java.types.SnowflakeSecrets;

   public class AWSTokenProvider {
     public static String handle() {
       // Get token object
       SnowflakeSecrets sfSecret = SnowflakeSecrets.newInstance();
       CloudProviderToken cloudProviderToken = sfSecret.getCloudProviderToken("cred");

       // Create variables for AWS session credentials
       String accessKeyId = cloudProviderToken.getAccessKeyId();
       String secretAccessKey = cloudProviderToken.getSecretAccessKey();
       String token = cloudProviderToken.getToken();

       // Create S3 client using AWS APIs.

       return "Successfully connected with S3 with temp access token: " + token;
     }
   }
   $$;
   ```
10. Grant the USAGE privilege on the UDF so that those with the `user` role can call it.

    ```sqlexample
    GRANT USAGE ON FUNCTION return_double_column(int) TO ROLE user;
    ```
11. Execute the function to connect to the external endpoint.

    PythonJava

    ```sqlexample
    SELECT aws_s3_python_function();
    ```

    ```sqlexample
    SELECT aws_s3_java_function();
    ```

---
title: External network access limitations
source: https://docs.snowflake.com/en/developer-guide/external-network-access/external-network-access-limitations.md
section: Developer Guide
---

# External network access limitations

This topic describes limitations for accessing external network locations from user-defined functions and procedures.

## Limitations

* Currently, handlers written only in Java, Python, or Scala may access network locations external to Snowflake.
* When using a wildcard in a VALUE_LIST value in a [network rule](../../sql-reference/sql/create-network-rule.md), the following are not
  valid wildcard uses:

  + `snowflake.*.google.com`

    Cannot be used to match `snowflake.sub1.sub2.google.com` because the asterisk can only be used to
    match alphanumeric characters and hyphens.
  + `*.*.google.com`

    Invalid because there are multiple asterisks in the wildcard.
  + `*.com`

    Invalid because the asterisk cannot be used to match the secondary level domain.
* When using a [secret](../../sql-reference/sql/create-secret.md) of the PASSWORD type, the colon character (`:`) is not supported in the
  USERNAME or PASSWORD parameters.
* By default, Snowflake does not enable external access for [trial accounts](../../user-guide/admin-trial-account.md). Contact
  your account representative to get external access enabled for a trial account.

---
title: External network access overview
source: https://docs.snowflake.com/en/developer-guide/external-network-access/external-network-access-overview.md
section: Developer Guide
---

# External network access overview

You can create secure access to specific network locations external to Snowflake, then use that access from within the handler code for
user-defined functions (UDFs) and stored procedures. You can enable this access through an external access integration.

With an external access integration, you can:

* Write UDF and procedure handlers that access external locations.
* Allow or block access to locations on a network external to Snowflake.
* Use secrets that represent stored credentials, rather than using literal values, within handler code to authenticate with external
  network locations.
* Specify which secrets are allowed for use with external network locations.
* Choose whether your connectivity to the external network location uses the public internet or a private network, such as by using
  Azure Private Link, AWS PrivateLink, or Google Cloud Private Service Connect.

  If you choose to use private connectivity, your Snowflake account must be Business Critical Edition (or later).

  For more information, see the following topics:

  + [External network access and private connectivity on Microsoft Azure](creating-using-private-azure.md)
  + [External network access and private connectivity on AWS](creating-using-private-aws.md)
  + [External network access and private connectivity on Google Cloud](creating-using-private-gcp.md)

## Get started

For an introduction to external network access, including code examples, refer to
[External network access examples](external-network-access-examples.md).

## References

* [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md)
* [ALTER EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/alter-external-access-integration.md)
* [DESCRIBE INTEGRATION](../../sql-reference/sql/desc-integration.md)
* [DROP INTEGRATION](../../sql-reference/sql/drop-integration.md)
* [SHOW INTEGRATIONS](../../sql-reference/sql/show-integrations.md)

---
title: General Scala UDF handler coding guidelines
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-general.md
section: Developer Guide
---

# General Scala UDF handler coding guidelines

This topic describes general guidelines for writing handler code in Scala. For information specific to scalar function handlers, refer to
[Writing a scalar UDF in Scala](udf-scala-scalar.md).

For suggestions on structuring your project, packaging your code, and managing dependencies, refer to
[Scala UDF handler project and packaging](udf-scala-packaging.md).

## Best practices

* Write platform-independent code.

  + Avoid code that assumes a specific CPU architecture (e.g. x86).
  + Avoid code that assumes a specific operating system.
* If you need to execute initialization code and do not want to include it in the method that you call, you can put
  the initialization code into a companion object of your handler class.
* Whenever possible when using an in-line handler, specify a value for the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) or
  [CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) TARGET_PATH parameter. This will prompt Snowflake to reuse previously-generated
  handler code output rather than recompiling for each call. For more information, see [Using an in-line handler](../../inline-or-staged.md).

## Writing a handler

You can write a scalar UDF with a handler written in Scala.

The handler is called once for each row passed to the Scala UDF. A new instance of the class is not created for each row;
Snowflake can call the same instance’s handler method more than once.

To optimize execution of your code, Snowflake timeout thresholds differ between the time it takes to initialize your handler class or
object, and the time it takes to execute its handler method. Snowflake allows more time to initialize the handler class or object on the
assumption that initialization might take longer. This includes the time to load your UDF and the time
to call the constructor of the handler method’s containing class, if a constructor is defined.

## Handling errors

You can handle exceptions with common exception-handling techniques to catch errors within the handler method.

If an exception occurs inside the method and is not caught by the method, Snowflake raises an error that includes the stack trace for the
exception.

You can explicitly throw an exception without catching it in order to end the query and produce a SQL error. For example:

```scala
if (x < 0) throw new IllegalArgumentException("x must be non-negative.")
```

When debugging, you can include values in the SQL error message text. To do so:

* Place an entire Scala method body in a try-catch block;
* Append argument values to the caught error’s message; and
* Throw an exception with the extended message.

To avoid revealing sensitive data, remove argument values prior to deploying JAR files to a production environment.

## Choosing data types

When writing your handler, you’ll need to declare parameter and return data types (from the handler’s language) that map well with the
UDF’s parameter and return data types (from SQL).

When the UDF is called, Snowflake converts the UDF’s arguments from the SQL parameter types to the handler’s parameter types. When
returning a value, Snowflake converts the return value from the handler’s return type to the UDF’s return type.

Snowflake converts values between types according to supported mappings between SQL types and Scala types. For more about those mappings,
refer to [SQL-Scala Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

When choosing data types of Scala variables, take into account the maximum and minimum possible values of the data that could be sent
from (and returned to) Snowflake.

## Creating the UDF with `CREATE FUNCTION`

You create a UDF in SQL using the CREATE FUNCTION command, specifying the code you wrote as the handler. For the command reference, see
[CREATE FUNCTION](../../../sql-reference/sql/create-function.md).

Scala 2.12Scala 2.13 (Preview)

```sqlsyntax
CREATE OR REPLACE FUNCTION <name> ( [ <arguments> ] )
  RETURNS <type>
  LANGUAGE SCALA
  [ IMPORTS = ( '<imports>' ) ]
  RUNTIME_VERSION = 2.12
  [ PACKAGES = ( '<package_name>' [, '<package_name>' . . .] ) ]
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  HANDLER = '<handler_class>.<handler_method>'
  [ AS '<scala_code>' ]
```

```sqlsyntax
CREATE OR REPLACE FUNCTION <name> ( [ <arguments> ] )
  RETURNS <type>
  LANGUAGE SCALA
  [ IMPORTS = ( '<imports>' ) ]
  RUNTIME_VERSION = 2.13
  [ PACKAGES = ( '<package_name>' [, '<package_name>' . . .] ) ]
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  HANDLER = '<handler_class>.<handler_method>'
  [ AS '<scala_code>' ]
```

To associate the handler code you’ve written with the UDF, you do the following when executing CREATE FUNCTION:

* Set LANGUAGE to SCALA.
* Set the IMPORTS clause value to the path and name of the handler class if the class is in an external location, such as on a stage.
* Set RUNTIME_VERSION to the version of the Scala runtime that your code requires.
* Set the PACKAGES clause value to the name of one or more packages, if any, required by the handler class.
* Set the HANDLER clause value to the name of the handler object and method.
* The `AS '<scala_code>'` clause is required if the handler code is specified in-line with CREATE FUNCTION.

---
title: Getting details about an error
source: https://docs.snowflake.com/en/developer-guide/sql-api/handling-errors.md
section: Developer Guide
---

# Getting details about an error

If the statement does not execute successfully, Snowflake returns one of the following response codes, as shown in the flow chart
below:

As shown in this flow chart:

* If the statement execution takes longer than the timeout period specified by the `timeout` field in the request (or the
  timeout specified by the [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) parameter, if the `timeout` field is not set),
  Snowflake returns the HTTP response code 408 with a [QueryStatus](reference.md) object.

  Use this object to get [details about the cancellation of the statement execution](handling-responses.md).
* If an error occurred when executing the statement, Snowflake returns the HTTP response code 422 with a
  [QueryFailureStatus](reference.md) object.

  You can get details about the error from this object.

---
title: Getting the query ID of the last query
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/query-id.md
section: Developer Guide
---

# Getting the query ID of the last query

If you need to access the query ID of the last query that was executed, use the global variable SQLID.

> **Note:**
>
> If no query was executed, the default value of SQLID is NULL.

The following example executes two queries and returns an ARRAY containing the query IDs:

```sqlexample
DECLARE
  query_id_1 VARCHAR;
  query_id_2 VARCHAR;
BEGIN
  SELECT 1;
  query_id_1 := SQLID;
  SELECT 2;
  query_id_2 := SQLID;
  RETURN [query_id_1, query_id_2];
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  query_id_1 VARCHAR;
  query_id_2 VARCHAR;
BEGIN
  SELECT 1;
  query_id_1 := SQLID;
  SELECT 2;
  query_id_2 := SQLID;
  RETURN [query_id_1, query_id_2];
END;
$$
;
```

---
title: Git in Snowflake limitations
source: https://docs.snowflake.com/en/developer-guide/git/git-limitations.md
section: Developer Guide
---

# Git in Snowflake limitations

This topic describes limitations for using Git repositories from within Snowflake.

* Currently, only the following Snowflake features can write to the repository:

  + [Workspaces](../../user-guide/ui-snowsight/workspaces-git.md)
  + [Streamlit applications](../streamlit/features/git-integration.md)
  + [Notebooks](../../user-guide/ui-snowsight/notebooks-snowgit.md)

  For other Snowflake code, access to the repository is read-only.
* When you connect to a Git repository using a workspace, the following limitation applies:

  + The Git repository can’t be empty. It must have at least one commit.
* [Preview Feature](../../release-notes/preview-features.md) — Open

  OAuth support is generally available only when the repository is hosted at [github.com](https://github.com/).

  OAuth support is in preview for repository providers other than github.com.

  Creating a local Git repository in Snowflake is supported only when using the Workspaces user interface to create it. It isn’t
  supported when you create the repository by using [CREATE GIT REPOSITORY](../../sql-reference/sql/create-git-repository.md) in a workspace. This is because
  when using the SQL command, the flow does not include presenting a user interface with which to sign in.
* Sharing Snowflake Git repository clones is not supported through data sharing or apps built on the Snowflake Native App Framework.
* Creating Snowflake Git repository clones inside application packages is not supported and might be blocked in the future.
* Creating Snowflake Git repository clones inside native applications on the consumer side is not supported.
* Snowflake doesn’t currently support submodules, so you won’t be able to see submodule files. Snowflake won’t download those files
  from the remote repository nor upload them to the remote repository.
* Git repositories larger than 2GB aren’t supported.
* Setting up an API integration on Snowflake is required to set up a Git repository object in Snowflake. For more information, see
  [Setting up Snowflake to use Git](git-setting-up.md).

---
title: Git operations in Snowflake
source: https://docs.snowflake.com/en/developer-guide/git/git-operations.md
section: Developer Guide
---

# Git operations in Snowflake

This topic describes how to perform common repository operations using SQL commands and Snowsight.

You can also use the following features with Git, each of which includes its own way to perform Git operations:

* [Workspaces](../../user-guide/ui-snowsight/workspaces-git.md)
* [Streamlit apps](../streamlit/features/git-integration.md)
* [Snowflake notebooks](../../user-guide/ui-snowsight/notebooks-snowgit.md)

## Integrate a Git repository with your Snowflake account

You can have Snowflake connect to your Git repository by using SQL or Snowsight.

SQLSnowsight

For information about using SQL to set up an integration with a Git repository, see [Setting up Snowflake to use Git](git-setting-up.md).

In Snowsight, you can use Workspaces to [integrate with a Git repository](../../user-guide/ui-snowsight/workspaces-git.md).

## Fetch from the remote Git repository

You can fetch to a Git repository clone in Snowflake all branches, tags, and commits from the remote repository. When you do so, you also
prune branches and commits that were fetched earlier but no longer exist in the remote repository.

To perform the operations described in this section, you’ll need the Snowflake access described in
[Access control for ALTER GIT REPOSITORY](../../sql-reference/sql/alter-git-repository.md).

You can fetch from the remote Git repository using either Snowsight or SQL.

SQLSnowsight

You can fetch from the remote Git repository to the Git repository clone in Snowflake by using the
[ALTER GIT REPOSITORY](../../sql-reference/sql/alter-git-repository.md) command.

Code in the following example updates the Git repository clone with the contents of the repository:

```sqlexample
ALTER GIT REPOSITORY snowflake_extensions FETCH;
```

You can use Snowsight to fetch from the remote repository.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, select the database and schema that contain the Git repository clone to which you want to fetch.
4. Inside the schema, open Git Repositories.
5. In Git Repositories, select the repository to view the details page.
6. In the repository details, on the Files Explorer tab, select the Fetch button to fetch from the remote repository.

## View a list of repository branches or tags

You can view a list of branches and tags available in the Snowflake Git repository clone fetched from the remote repository.

To perform the operations described in this section, you’ll need the Snowflake access described in the following topics:

* For listing branches: [Access control for SHOW GIT BRANCHES](../../sql-reference/sql/show-git-branches.md)
* For listing tags: [Access control for SHOW GIT TAGS](../../sql-reference/sql/show-git-tags.md)

You can view a list of branches or tags using either Snowsight or SQL.

SQLSnowsight

You can view branches and tags by using the [SHOW GIT BRANCHES](../../sql-reference/sql/show-git-branches.md) and
[SHOW GIT TAGS](../../sql-reference/sql/show-git-tags.md) commands.

The following example generates output that lists branches in the Git repository `snowflake_extensions`:

```sqlexample
SHOW GIT BRANCHES IN snowflake_extensions;
```

The preceding command generates output similar to the following:

```output
--------------------------------------------------------------------------------
| name | path           | checkouts | commit_hash                              |
--------------------------------------------------------------------------------
| main | /branches/main |           | 0f81b1487dfc822df9f73ac6b3096b9ea9e42d69 |
--------------------------------------------------------------------------------
```

You can use Snowsight to view a list of the files in your Git repository.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, select the database and schema that contain the Git repository clone you want to view.
4. Inside the schema, open Git Repositories.
5. Locate and select the repository to view its details page.
6. In the repository’s details page, on the Files Explorer tab, select the Branch button.
7. From the Branch drop-down menu, do one of the following:

   * To view a list of branches cloned from the repository, select Branches .
   * To view a list of the tags cloned from the repository, select Tags.

## View a list of repository files

You can view a list of files in a branch, tag, or commit using either Snowsight or SQL.

SQLSnowsight

You can view a list of files in the repository by using the [LIST](../../sql-reference/sql/list.md) command in the following forms,
specifying the Git repository clone as you would a stage (you can abbreviate LIST to LS):

* List by branch name:

  ```sqlexample
  LS @repository_name/branches/branch_name;
  ```
* List by tag name:

  ```sqlexample
  LS @repository_name/tags/tag_name;
  ```
* List by commit hash:

  ```sqlexample
  LS @repository_name/commits/commit_hash;
  ```

The following example generates output that lists files in the main branch of the Git repository `snowflake_extensions`:

```sqlexample
LS @snowflake_extensions/branches/main;
```

The preceding command generates output similar to the following. For descriptions of the output columns, see the
[LIST command reference](../../sql-reference/sql/list.md).

```output
-------------------------------------------------------------------------------------------------------------------------------------------------------
| name                                                         | size | md5 | sha1                                     | last_modified                |
-------------------------------------------------------------------------------------------------------------------------------------------------------
| snowflake_extensions/branches/main/.gitignore                | 10   |     | e43b0f988953ae3a84b00331d0ccf5f7d51cb3cf | Wed, 5 Jul 2023 22:42:34 GMT |
-------------------------------------------------------------------------------------------------------------------------------------------------------
| snowflake_extensions/branches/main/python-handlers/filter.py | 169  |     | c717137b18d7b75005849d76d89037fafc7b5223 | Wed, 5 Jul 2023 22:42:34 GMT |
-------------------------------------------------------------------------------------------------------------------------------------------------------
```

You can use Snowsight to view a list of the branches and tags in your Git repository.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, select the database and schema that contain the Git repository clone you want to view.
4. Inside the schema, open Git Repositories.
5. Inside Git Repositories, select a repository to view its details page.
6. In the repository’s details page, on the Files Explorer tab, select the Branch button.
7. From the Branch drop-down menu, select one of the following:

   * To view a list of branches cloned from the repository, select Branches.
   * To view a list of the tags cloned from the repository, select Tags.
8. Select the branch or tag whose files you want to list.
9. Below the repository name, view the list of folders and files corresponding to the selection you made.

## View Git repository clone properties

You can view the properties associated with a Git repository clone in Snowflake.

To perform the operations described in this section, you’ll need the Snowflake access described in
[Access control for DESC GIT REPOSITORY](../../sql-reference/sql/desc-git-repository.md).

You can view Git repository clone properties by using either Snowsight or SQL.

SQLSnowsight

You can view Git repository clone properties by using the SQL commands [SHOW GIT REPOSITORIES](../../sql-reference/sql/show-git-repositories.md)
and [DESCRIBE GIT REPOSITORY](../../sql-reference/sql/desc-git-repository.md).

The properties information includes the Git origin URL, name of the API integration and credentials (specified as a
[secret](../../sql-reference/sql/create-secret.md)) used to connect with the remote repository, and so on.

```sqlexample
DESCRIBE GIT REPOSITORY snowflake_extensions;
```

The preceding command generates output similar to the following:

```output
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| CREATED_ON                    | NAME                 | DATABASE_NAME | SCHEMA_NAME | ORIGIN                                                 | API_INTEGRATION     | GIT_CREDENTIALS           | OWNER        | OWNER_ROLE_TYPE | COMMENT |
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-06-28 08:46:10.886 -0700 | SNOWFLAKE_EXTENSIONS | MY_DB         | MAIN        | https://github.com/my-account/snowflake-extensions.git | GIT_API_INTEGRATION | MY_DB.MAIN.GIT_SECRET     | ACCOUNTADMIN | ROLE            |         |
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

You can use Snowsight to view repository properties.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, select the database and schema that contain the Git repository clone you want to view.
4. Inside the schema, open Git Repositories.
5. Inside Git Repositories, select a repository to view its details page.
6. In the repository’s details page, select the Git Repository Details tab to view information that includes the following details:

   * The repository’s origin
   * The [API integration](../../sql-reference/sql/create-api-integration.md) and credentials (specified as a
     [secret](../../sql-reference/sql/create-secret.md)) used by Snowflake to interact with the remote repository
   * Privileges granted on the Git repository clone

## Execute code from a repository

You can execute the code contained by a file from the repository.

To perform the operations described in this section, you’ll need the Snowflake access described in
[Access control for EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md).

You can execute code by using either Snowsight or SQL.

SQLSnowsight

You can use [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md) to execute code in a Git repository clone.

Code in the following example executes code in `create-database.sql` from the Git repository clone `snowflake_extensions`:

```sqlexample
EXECUTE IMMEDIATE FROM @snowflake_extensions/branches/main/sql/create-database.sql;
```

You can use Snowsight to execute SQL code from a Git repository.

Note that when you execute code this way, you won’t see output generated by the code’s execution.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, select the database and schema that contain the Git repository clone you want to view.
4. Inside the schema, open Git Repositories.
5. Inside Git Repositories, select a repository to view its details page.
6. In the repository’s details page, on the Files Explorer tab, select the Branch button.
7. From the Branch drop-down menu, select one of the following:

   * Branches to view a list of branches cloned from the repository.
   * Tags to view a list of the tags cloned from the repository.
8. Select the branch or tag containing the file whose code you want to execute.
9. Beneath the repository name, select the folder containing the file you want to execute.
10. Locate the file whose code you want to execute, and select  » Execute immediate.
11. In the box that appears, review the code contained by the file.

    This is the code that Snowflake will execute.
12. To execute the displayed code, select Execute Immediate.

    The details page displays a notification that the code’s execution was started. It later indicates whether the execution
    succeeded or failed.

## Copy repository-based code into a worksheet

You can quickly copy code from a repository file into a worksheet. You can edit and run the copied code or use it as a read-only template
for other users.

You can copy the content of the following types of files: `.sql` and `.py`.

To save your changes in your repository, you need to copy the edited code from the worksheet into a file (such as the file corresponding to
the one you copied from) in your local Git repository and commit the changes from there.

Snowsight:
:   You can use Snowsight to copy content from a file in your repository into a worksheet.

    1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
    2. In the navigation menu, select Catalog » Database Explorer.
    3. In the object explorer, select the database and schema that contain the Git repository clone you want to view.
    4. Inside the schema, open Git Repositories.
    5. Inside Git Repositories, select a repository to view its details page.
    6. In the repository’s details page, on the Files Explorer tab, select the Branch button.
    7. From the Branch drop-down menu, do one of the following:

       * To view a list of branches cloned from the repository, select Branches.
       * To view a list of the tags cloned from the repository, select Tags.
    8. Select the branch or tag containing the file whose code you want to copy.
    9. Beneath the repository name, select the folder containing the file you want to execute.
    10. Locate the file whose code you want to execute, and select  » Copy into worksheet.

        Snowflake copies code from the file you selected into a new worksheet.

---
title: Go Snowflake Driver
source: https://docs.snowflake.com/en/developer-guide/golang/go-driver.md
section: Developer Guide
---

# Go Snowflake Driver

> **Note:**
>
> This driver currently does not support GCP regional endpoints. Please ensure that any workloads using through this driver do not require support for regional endpoints on GCP. If you have questions about this, please contact Snowflake Support.

The Go Snowflake Driver provides an interface for developing applications using the Go programming language to connect to Snowflake and perform all standard operations. The driver implements the Go
[database/sql](https://golang.org/pkg/database/sql) package.

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

For complete installation instructions, as well as developer notes and the source code, see the GitHub [Go Snowflake Driver repo](https://github.com/snowflakedb/gosnowflake).

For usage information, see the GoDoc [gosnowflake documentation](https://godoc.org/github.com/snowflakedb/gosnowflake).

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

---
title: Granting privileges for user-defined functions
source: https://docs.snowflake.com/en/developer-guide/udf/udf-access-control.md
section: Developer Guide
---

# Granting privileges for user-defined functions

This topic lists the minimum privileges required on objects to perform specific SQL actions with a UDF or UDTF.

## Granting privileges for UDFs and UDTFs

To perform SQL actions on a UDF or UDTF, the person performing the action must have been assigned a role that has been granted the required
privileges. These SQL actions include:

* Creating the function, such as with [CREATE FUNCTION](../../sql-reference/sql/create-function.md) or with the
  [Snowpark API](../snowpark/index.md).
* Owning the function in order to delete, alter, and manage access to the function, whether through SQL or the Snowpark API.
* Calling the function, whether with SQL or the Snowpark API.

The role must be assigned privileges on objects related to the function, including the database and schema, and (if required) a
stage that holds function dependencies.

To grant privileges on an object to a role, use a [GRANT](../../sql-reference/sql/grant-privilege.md) statement.

Code in the following example grants to `my_role` the USAGE privilege on the function `my_java_udf`.

```sqlexample
GRANT USAGE ON FUNCTION my_java_udf(number, number) TO my_role;
```

### Creating UDFs or UDTFs

Creating, managing, and executing a UDF or UDTF requires a role with a minimum of the following privileges:

| Object | Privileges | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE, CREATE FUNCTION |  |
| Stage | USAGE (external stage) or READ (internal stage) | Required if the function depends on or reads from files on a stage. This would include the following staged files:   * File containing handler code for a UDF. For more information about staged and in-line handlers, see   [Keeping handler code in-line or on a stage](../inline-or-staged.md). * Libraries that the handler code requires as dependencies, including JAR files, Python modules, .zip files, and so on. For more   information, see [Making dependencies available to your code](../upload-dependencies.md). * Files containing content read by code in the handler. This includes unstructured data processed by the handler. |

### Owning UDFs or UDTFs

After a UDF or UDTF is created, the function owner (that is, a person with the role that has the OWNERSHIP privilege on the function)
must have a minimum of the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE |  |
| Stage | USAGE (external stage) or READ (internal stage) | Required if the function depends on or reads from files on a stage. This would include the following staged files:   * File containing handler code for a UDF. For more information about staged and in-line handlers, see   [Keeping handler code in-line or on a stage](../inline-or-staged.md). * Libraries that the handler code requires as dependencies, including JAR files, Python modules, .zip files, and so on. For more   information, see [Making dependencies available to your code](../upload-dependencies.md). * Files containing content read by code in the handler. This includes unstructured data processed by the handler. |
| Function | OWNERSHIP |  |

### Calling UDFs or UDTFs

A role that calls a UDF or UDTF must have a minimum of the following privileges:

| Object | Privilege | Notes |
| --- | --- | --- |
| Database | USAGE |  |
| Schema | USAGE | Schema that contains the schema-level objects in this table. If the objects are contained in multiple schemas, the USAGE privilege is required on each. |
| Stage | USAGE (external stage) or READ (internal stage) | Required if the function depends on or reads from files on a stage. This would include the following staged files:   * File containing handler code for a UDF. For more information about staged and in-line handlers, see   [Keeping handler code in-line or on a stage](../inline-or-staged.md). * Libraries that the handler code requires as dependencies, including JAR files, Python modules, .zip files, and so on. For more   information, see [Making dependencies available to your code](../upload-dependencies.md). * Files containing content read by code in the handler. This includes unstructured data processed by the handler. |
| Function | USAGE | Required when anyone other than the function’s owner will call the function. USAGE on the function must be granted to a role that is assigned to a person who will call the function. |

---
title: Handling exceptions
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions.md
section: Developer Guide
---

# Handling exceptions

In a Snowflake Scripting block, you can raise an exception if an error occurs. You can also handle exceptions that occur in your
Snowflake Scripting code.

## Introduction to handling exceptions in Snowflake Scripting

Snowflake Scripting raises an exception if an error occurs while executing a statement. For example, if a statement
attempts to drop a table that doesn’t exist, Snowflake Scripting raises an exception.

In a Snowflake Scripting block, you can write exception handlers that catch specific types of exceptions declared in that block
and in blocks nested inside that block. In addition, for errors that can occur in your code, you can define your own exceptions
that you can raise when errors occur.

After the statements in the handler are run, you can choose to exit the block or continue running the statements in the block.
For more information, see Handling an exception in Snowflake Scripting.

When an exception is raised in a Snowflake Scripting block, either by your code or by a statement that fails to execute,
Snowflake Scripting attempts to find a handler for that exception:

* If the block in which the exception occurred has a handler for that exception, then execution resumes at the
  beginning of that exception handler.
* If the block doesn’t have its own exception handler, then the exception can be caught by the enclosing block.

  If the exception occurs more than one layer deep, then the exception is sent upward one layer at a time until either:

  + A layer with an appropriate exception handler handles the exception.
  + The outermost layer is reached, in which case an error occurs.
* If there is no handler for the exception in the current block or in any enclosing blocks, execution of the block stops, and the
  client that submits the block for execution (for example, Snowsight, SnowSQL, and so on) reports this as a Snowflake error.

An exception handler can contain its own exception handler in case an exception occurs while handling another exception.

## Declaring an exception in Snowflake Scripting

You can declare your own exception in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section of the block. Use
the syntax described in [Exception declaration syntax](../../sql-reference/snowflake-scripting/declare.md). For example:

```sqlexample
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
```

## Raising a declared exception in Snowflake Scripting

To raise an exception, execute the [RAISE](../../sql-reference/snowflake-scripting/raise.md) command. For example:

```sqlexample
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
BEGIN
  LET counter := 0;
  LET should_raise_exception := true;
  IF (should_raise_exception) THEN
    RAISE my_exception;
  END IF;
  counter := counter + 1;
  RETURN counter;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
BEGIN
  LET counter := 0;
  LET should_raise_exception := true;
  IF (should_raise_exception) THEN
    RAISE my_exception;
  END IF;
  counter := counter + 1;
  RETURN counter;
END;
$$
;
```

If there is no handler, execution stops at the point when the exception is raised. In the example, `counter` is never
incremented and isn’t returned.

The client that submits this block for execution — for example, Snowsight — reports an error and indicates that the exception
was not caught:

```none
-20002 (P0001): Uncaught exception of type 'MY_EXCEPTION' on line 8 at position 4 : Raised MY_EXCEPTION.
```

If you want to add code to handle any exceptions that you raise (as well as exceptions raised when statements fail to execute),
you can write exception handlers. See Handling an exception in Snowflake Scripting.

> **Note:**
>
> In an exception handler, if you need to raise the same exception again, see
> Raising the same exception again in an exception handler in Snowflake Scripting.

## Handling an exception in Snowflake Scripting

You can explicitly handle an exception by catching it with an [EXCEPTION](../../sql-reference/snowflake-scripting/exception.md)
clause, or you can allow the block to pass the exception on to the enclosing block.

Within the [EXCEPTION](../../sql-reference/snowflake-scripting/exception.md) clause, use a WHEN clause to handle an
exception by name. You can handle exceptions that you declare as well as built-in exceptions. Currently, Snowflake provides the
following built-in exceptions:

* STATEMENT_ERROR: This exception indicates an error while executing a statement. For example, if you attempt to drop a table
  that does not exist, this exception is raised.
* EXPRESSION_ERROR: This exception indicates an error related to an expression. For example, if you create an expression that
  evaluates to a VARCHAR, and you attempt to assign the value of the expression to a FLOAT, this error is raised.

Each WHEN clause in an exception block can be one of the following types:

* EXIT - The block runs the statements in the handler and then exits the current block. If the block runs an
  exception of this type, and the block contains statements after the statement that caused the error, those statements
  aren’t run.

  If the block is an inner block, and the exception handler doesn’t contain a RETURN statement, then
  execution exits the inner block and continues with the code in the outer block.

  EXIT is the default.
* CONTINUE - The block runs the statements in the exception block and continues with the statement
  immediately following the one that caused the error.

  A CONTINUE handler can catch and handle exceptions without ending the statement block that raised the exception.
  With the default EXIT handler, when an error occurs in a block, the flow is interrupted and the error is
  returned to the caller. However, you can use a CONTINUE handler when the error condition isn’t severe enough
  to warrant interrupting the flow.

An EXCEPTION clause can have WHEN clauses of both types — EXIT and CONTINUE.

When an exception occurs, you can get information about the exception by reading the following three built-in variables:

* SQLCODE: This is a 5-digit signed integer. For user-defined exceptions, this is the `exception_number` shown in
  the syntax for declaring an exception.
* SQLERRM: This is an error message. For user-defined exceptions, this is the `exception_message` shown in
  the syntax for declaring an exception.
* SQLSTATE: This is a 5-character code modeled on the ANSI SQL standard [SQLSTATE](https://en.wikipedia.org/wiki/SQLSTATE).
  Snowflake uses additional values beyond those in the ANSI SQL standard.

When you use a WHEN clause of the CONTINUE type, these built-in variables reflect the error that caused the exception
in the WHEN clause. After the statements in the WHEN clause complete, and statement execution continues in the block,
the values of these variables return the values they had before the exception was raised.

To handle all other exceptions that aren’t built-in or declared, use a WHEN OTHER THEN clause. The WHEN OTHER THEN
clause can be of type EXIT or CONTINUE.

For example, assume that you have the following error log table to track your exceptions:

```sqlexample
CREATE OR REPLACE TABLE test_error_log(
  error_type VARCHAR,
  error_code VARCHAR,
  error_message VARCHAR,
  error_state VARCHAR,
  error_timestamp TIMESTAMP);
```

The following anonymous block inserts information about the exceptions into the table
and returns information about them to the user:

> **Tip:**
>
> The example defines an exception in the DECLARE section and then handles that exception. For an example
> that handles a STATEMENT_ERROR exception, remove the comments (`--`) from this line:
>
> ```sqlexample
> -- SELECT 1/0;
> ```
>
> For an example that handles other errors, remove the comments from this line:
>
> ```sqlexample
> -- LET var := 1/0;
> ```

```sqlexample
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
BEGIN
  -- SELECT 1/0;
  -- LET var := 1/0;
  LET counter := 0;
  LET should_raise_exception := true;
  IF (should_raise_exception) THEN
    RAISE my_exception;
  END IF;
  counter := counter + 1;
  RETURN 'My counter value: ' || counter;
EXCEPTION
  WHEN STATEMENT_ERROR THEN
    INSERT INTO test_error_log VALUES(
      'STATEMENT_ERROR', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
    RETURN OBJECT_CONSTRUCT('Error type', 'STATEMENT_ERROR',
                            'SQLCODE', sqlcode,
                            'SQLERRM', sqlerrm,
                            'SQLSTATE', sqlstate);
  WHEN my_exception THEN
    INSERT INTO test_error_log VALUES(
      'MY_EXCEPTION', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
    RETURN OBJECT_CONSTRUCT('Error type', 'MY_EXCEPTION',
                            'SQLCODE', sqlcode,
                            'SQLERRM', sqlerrm,
                            'SQLSTATE', sqlstate);
  WHEN OTHER THEN
    INSERT INTO test_error_log VALUES(
      'OTHER', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
    RETURN OBJECT_CONSTRUCT('Error type', 'Other error',
                            'SQLCODE', sqlcode,
                            'SQLERRM', sqlerrm,
                            'SQLSTATE', sqlstate);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
BEGIN
  -- SELECT 1/0;
  -- LET var := 1/0;
  LET counter := 0;
  LET should_raise_exception := true;
  IF (should_raise_exception) THEN
    RAISE my_exception;
  END IF;
  counter := counter + 1;
  RETURN 'My counter value: ' || counter;
EXCEPTION
  WHEN STATEMENT_ERROR THEN
    INSERT INTO test_error_log VALUES(
      'STATEMENT_ERROR', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
    RETURN OBJECT_CONSTRUCT('Error type', 'STATEMENT_ERROR',
                            'SQLCODE', sqlcode,
                            'SQLERRM', sqlerrm,
                            'SQLSTATE', sqlstate);
  WHEN my_exception THEN
    INSERT INTO test_error_log VALUES(
      'MY_EXCEPTION', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
    RETURN OBJECT_CONSTRUCT('Error type', 'MY_EXCEPTION',
                            'SQLCODE', sqlcode,
                            'SQLERRM', sqlerrm,
                            'SQLSTATE', sqlstate);
  WHEN OTHER THEN
    INSERT INTO test_error_log VALUES(
      'OTHER', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
    RETURN OBJECT_CONSTRUCT('Error type', 'Other error',
                            'SQLCODE', sqlcode,
                            'SQLERRM', sqlerrm,
                            'SQLSTATE', sqlstate);
END;
$$
;
```

For the returned value, this example handles each type of exception by calling
[OBJECT_CONSTRUCT](../../sql-reference/functions/object_construct.md) to construct and return an object that contains
the details about the exception. The example produces the following output:

```output
+--------------------------------------+
| anonymous block                      |
|--------------------------------------|
| {                                    |
|   "Error type": "MY_EXCEPTION",      |
|   "SQLCODE": -20002,                 |
|   "SQLERRM": "Raised MY_EXCEPTION.", |
|   "SQLSTATE": "P0001"                |
| }                                    |
+--------------------------------------+
```

You can query the `test_error_log` table to confirm that the error was logged:

```sqlexample
SELECT * FROM test_error_log;
```

```output
+--------------+------------+----------------------+-------------+-------------------------+
| ERROR_TYPE   | ERROR_CODE | ERROR_MESSAGE        | ERROR_STATE | ERROR_TIMESTAMP         |
|--------------+------------+----------------------+-------------+-------------------------|
| MY_EXCEPTION | -20002     | Raised MY_EXCEPTION. | P0001       | 2025-09-05 12:15:00.068 |
+--------------+------------+----------------------+-------------+-------------------------+
```

The previous example used WHEN clauses of the default type (EXIT). If one of the WHEN clauses catches
an exception, it runs the statements in the WHEN clause and then exits. Therefore, the following
code isn’t run:

```sqlexample
counter := counter + 1;
RETURN 'My counter value: ' || counter;
```

If you want to handle an exception and then continue running the code in the block, specify WHEN clauses
of the CONTINUE type. The following example is the same as the previous example, but it specifies WHEN
clauses of the CONTINUE type and removes the RETURN statement from each WHEN clause:

```sqlexample
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
BEGIN
  -- SELECT 1/0;
  -- LET var := 1/0;
  LET counter := 0;
  LET should_raise_exception := true;
  IF (should_raise_exception) THEN
    RAISE my_exception;
  END IF;
  counter := counter + 1;
  RETURN 'My counter value: ' || counter;
EXCEPTION
  WHEN STATEMENT_ERROR CONTINUE THEN
    INSERT INTO test_error_log VALUES(
      'STATEMENT_ERROR', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
  WHEN my_exception CONTINUE THEN
    INSERT INTO test_error_log VALUES(
      'MY_EXCEPTION', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
  WHEN OTHER CONTINUE THEN
    INSERT INTO test_error_log VALUES(
      'OTHER', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  my_exception EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
BEGIN
  -- SELECT 1/0;
  -- LET var := 1/0;
  LET counter := 0;
  LET should_raise_exception := true;
  IF (should_raise_exception) THEN
    RAISE my_exception;
  END IF;
  counter := counter + 1;
  RETURN 'My counter value: ' || counter;
EXCEPTION
  WHEN STATEMENT_ERROR CONTINUE THEN
    INSERT INTO test_error_log VALUES(
      'STATEMENT_ERROR', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
  WHEN my_exception CONTINUE THEN
    INSERT INTO test_error_log VALUES(
      'MY_EXCEPTION', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
  WHEN OTHER CONTINUE THEN
    INSERT INTO test_error_log VALUES(
      'OTHER', :sqlcode, :sqlerrm, :sqlstate, CURRENT_TIMESTAMP());
END;
$$
;
```

```output
+---------------------+
| anonymous block     |
|---------------------|
| My counter value: 1 |
+---------------------+
```

The output shows that the example continued running the following code after the exception was raised:

```sqlexample
counter := counter + 1;
RETURN counter;
```

For more information about CONTINUE handlers, see [EXCEPTION (Snowflake Scripting)](../../sql-reference/snowflake-scripting/exception.md).

In rare cases, you might want to explicitly handle an exception by doing nothing. This enables you to continue, rather than terminate,
when the exception occurs. For more information, see the [NULL](../../sql-reference/snowflake-scripting/null.md) command.

> **Note:**
>
> If you need to raise the same exception again, see Raising the same exception again in an exception handler in Snowflake Scripting.

If you don’t set up a handler for an exception, the client that submits the block for execution; for example,
Snowsight reports an error as explained in Raising a declared exception in Snowflake Scripting.

```none
-20002 (P0001): Uncaught exception of type 'MY_EXCEPTION' on line 8 at position 4 : Raised MY_EXCEPTION.
```

## Raising the same exception again in an exception handler in Snowflake Scripting

In some cases, you might need to raise the same exception that you caught in your exception handler. In these cases, execute the
RAISE command without specifying any arguments.

For example, suppose that during exception handling, you need to capture some details about the exception before raising the same
exception again. After capturing the details, execute the RAISE command:

```sqlexample
BEGIN
  SELECT * FROM non_existent_table;
EXCEPTION
  WHEN OTHER THEN
    LET LINE := SQLCODE || ': ' || SQLERRM;
    INSERT INTO myexceptions VALUES (:line);
    RAISE; -- Raise the same exception that you are handling.
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  SELECT * FROM non_existent_table;
EXCEPTION
  WHEN OTHER THEN
    LET LINE := SQLCODE || ': ' || SQLERRM;
    INSERT INTO myexceptions VALUES (:line);
    RAISE; -- Raise the same exception that you are handling.
END;
$$;
```

## Passing variables to an exception handler in Snowflake Scripting

You can pass variables to an exception handler. The exception handler can execute code based
on the value of the variable, and the variable value can be returned in error messages.

For a variable to be passed to a handler in the EXCEPTION section, the variable must be declared
in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section. If a variable is declared
in the [BEGIN … END](../../sql-reference/snowflake-scripting/begin.md) section of the block, it can’t
be accessed in the EXCEPTION section.

In addition, if you are writing a Snowflake Scripting stored procedure that accepts arguments, you can use those
arguments in an exception handler.

For example, the following anonymous block passes the value of the `counter_val` variable to
the exception handler:

```sqlexample
DECLARE
  counter_val INTEGER DEFAULT 0;
  my_exception EXCEPTION (-20002, 'My exception text');
BEGIN
  WHILE (counter_val < 12) DO
    counter_val := counter_val + 1;
    IF (counter_val > 10) THEN
      RAISE my_exception;
    END IF;
  END WHILE;
  RETURN counter_val;
EXCEPTION
  WHEN my_exception THEN
    RETURN 'Error ' || sqlcode || ': Counter value ' || counter_val || ' exceeds the limit of 10.';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  counter_val INTEGER DEFAULT 0;
  my_exception EXCEPTION (-20002, 'My exception text');
BEGIN
  WHILE (counter_val < 12) DO
    counter_val := counter_val + 1;
    IF (counter_val > 10) THEN
      RAISE my_exception;
    END IF;
  END WHILE;
  RETURN counter_val;
EXCEPTION
  WHEN my_exception THEN
    RETURN 'Error ' || sqlcode || ': Counter value ' || counter_val || ' exceeds the limit of 10.';
END;
$$
;
```

The block returns the following error message:

```output
+---------------------------------------------------------+
| anonymous block                                         |
|---------------------------------------------------------|
| Error -20002: Counter value 11 exceeds the limit of 10. |
+---------------------------------------------------------+
```

The following is an example of a Snowflake Scripting stored procedure that passes in an argument. The example demonstrates
how you can use the argument in an exception handler:

```sqlexample
CREATE OR REPLACE PROCEDURE exception_test_vars(amount INT)
  RETURNS TEXT
  LANGUAGE SQL
AS
DECLARE
  my_exception_1 EXCEPTION (-20002, 'Value too low');
  my_exception_2 EXCEPTION (-20003, 'Value too high');
BEGIN
  CREATE OR REPLACE TABLE test_order_insert(units INT);
  IF (amount < 1) THEN
    RAISE my_exception_1;
  ELSEIF (amount > 10) THEN
    RAISE my_exception_2;
  ELSE
    INSERT INTO test_order_insert VALUES (:amount);
  END IF;
  RETURN 'Order inserted successfully.';
EXCEPTION
  WHEN my_exception_1 THEN
    RETURN 'Error ' || sqlcode || ': Submitted amount ' || amount || ' is too low (1 or greater required).';
  WHEN my_exception_2 THEN
    RETURN 'Error ' || sqlcode || ': Submitted amount ' || amount || ' is too high (exceeds limit of 10).';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE exception_test_vars(amount INT)
  RETURNS TEXT
  LANGUAGE SQL
AS
$$
DECLARE
  my_exception_1 EXCEPTION (-20002, 'Value too low');
  my_exception_2 EXCEPTION (-20003, 'Value too high');
BEGIN
  CREATE OR REPLACE TABLE test_order_insert(units INT);
  IF (amount < 1) THEN
    RAISE my_exception_1;
  ELSEIF (amount > 10) THEN
    RAISE my_exception_2;
  ELSE
    INSERT INTO test_order_insert VALUES (:amount);
  END IF;
  RETURN 'Order inserted successfully.';
EXCEPTION
  WHEN my_exception_1 THEN
    RETURN 'Error ' || sqlcode || ': Submitted amount ' || amount || ' is too low (1 or greater required).';
  WHEN my_exception_2 THEN
    RETURN 'Error ' || sqlcode || ': Submitted amount ' || amount || ' is too high (exceeds limit of 10).';
END;
$$
;
```

The following calls to the stored procedure show the expected output:

```sqlexample
CALL exception_test_vars(7);
```

```output
+------------------------------+
| EXCEPTION_TEST_VARS          |
|------------------------------|
| Order inserted successfully. |
+------------------------------+
```

```sqlexample
CALL exception_test_vars(-3);
```

```output
+-----------------------------------------------------------------------+
| EXCEPTION_TEST_VARS                                                   |
|-----------------------------------------------------------------------|
| Error -20002: Submitted amount -3 is too low (1 or greater required). |
+-----------------------------------------------------------------------+
```

```sqlexample
CALL exception_test_vars(20);
```

```output
+----------------------------------------------------------------------+
| EXCEPTION_TEST_VARS                                                  |
|----------------------------------------------------------------------|
| Error -20003: Submitted amount 20 is too high (exceeds limit of 10). |
+----------------------------------------------------------------------+
```

---
title: Handling responses
source: https://docs.snowflake.com/en/developer-guide/sql-api/handling-responses.md
section: Developer Guide
---

# Handling responses

This topic explains how to handle a response from the SQL API.

## Understanding the flow of execution

By default, Snowflake executes the statement synchronously and returns one of the response codes shown in the flow chart below:

As shown in the flow chart above:

* If you submitted a single statement that was executed successfully, Snowflake returns the HTTP response code 200 and the
  rows from the results in a [ResultSet](reference.md) object.

  Use the `ResultSet` object to retrieve the results.
* If you submitted [multiple statements in a single request](submitting-multiple-statements.md) and the request
  was processed successfully, Snowflake returns the HTTP response code 200 and a
  [ResultSet](reference.md) object.

  The `ResultSet` object does not contain any rows from the results. Instead, the `data` field just contains the
  message “Multiple statements executed successfully.”

  To retrieve the data, you must get the handles of the individual statements in the request from the `statementHandles`
  field. For each statement handle, send a request to check the status of the execution of the statement. See
  Checking the status of the statement execution and retrieving the data.

  For more information about the process of handling a response for a request that specifies multiple SQL statements, see
  [Getting the results for each SQL statement in the request](submitting-multiple-statements.md).
* If the statement takes longer than 45 seconds to execute or if you specified that the statement should be executed
  asynchronously, Snowflake returns the HTTP response code 202 with a
  [QueryStatus](reference.md) object.

  You can send a request to the endpoint specified by the `statementStatusUrl` field in the `QueryStatus` object to
  check the status of the execution of the statement. See
  Checking the status of the statement execution and retrieving the data.

  If you want to cancel the execution of the statement, you can send a request to the
  `/api/v2/statements/statementHandle/cancel`, using the statement handle from the `statementHandle` field in the
  `QueryStatus` object. See [Canceling the execution of a SQL statement](cancelling-requests.md).

## Checking the status of the statement execution and retrieving the data

In some cases, you need to send a request to check the status of the execution of a statement:

* When you [submit a SQL statement for execution](submitting-requests.md), Snowflake returns a 202 response code
  if the execution of the statement has not yet completed or if you submitted an asynchronous query.

  To check if the statement has finished executing, you must send a request to check the status of the statement.
* If you [submitted multiple SQL statements in a single request](submitting-multiple-statements.md), you get the
  results of each individual statement by sending a request to check the status of the statement.

In both of these cases, you send a `GET` request to the `/api/v2/statements/` endpoint and append the statement handle to
the end of the URL path as a path parameter. See [GET /api/v2/statements/{statementHandle}](reference.md) for details.

```none
GET /api/v2/statements/{statementHandle}
```

`{statementHandle}` is the handle of the statement that you want to check. To get the statement handle:

* If you received response with a 202 response code, the body of the response includes a
  [QueryStatus](reference.md) object. You can get the statement handle from the
  `statementHandle` field of this object.

  Note that you can also get the full URL for the request from the `statementStatusUrl` field of this object.

  ```sqljson
  {
    "code": "090001",
    "sqlState": "00000",
    "message": "successfully executed",
    "statementHandle": "e4ce975e-f7ff-4b5e-b15e-bf25f59371ae",
    "statementStatusUrl": "/api/v2/statements/e4ce975e-f7ff-4b5e-b15e-bf25f59371ae"
  }
  ```
* If you submitted a request containing multiple SQL statements, the body of the response includes a `ResultSet` object that
  contains a `statementHandles` field. You can get the handles for the individual statements from this field.

  ```sqljson
  {
    ...
    "statementHandles" : [ "019c9fce-0502-f1fc-0000-438300e02412", "019c9fce-0502-f1fc-0000-438300e02416" ],
    ...
  }
  ```

For example, the following `curl` command checks that status of the statement with the handle
`e4ce975e-f7ff-4b5e-b15e-bf25f59371ae`:

```bash
curl -i -X GET \
    -H "Authorization: Bearer <jwt>" \
    -H "Content-Type: application/json" \
    -H "Accept: application/json" \
    -H "User-Agent: myApplicationName/1.0" \
    "https://<account_identifier>.snowflakecomputing.com/api/v2/statements/e4ce975e-f7ff-4b5e-b15e-bf25f59371ae"
```

where:

* `jwt` is the [JWT that you generated for authentication](authenticating.md).
* `myApplicationName` is an example of an identifier for your application.
* `account_identifier` is your [account identifier](../../user-guide/admin-account-identifier.md).

When you send a request to check the status, Snowflake returns one of the response codes shown in the flow chart below:

As shown in the flow chart above:

* If the statement has finished executing successfully, Snowflake returns the HTTP response code 200 and the rows from the results
  in a [ResultSet](reference.md) object.

  Use the `ResultSet` object to retrieve the results.
* If the statement has not finished executing, Snowflake returns the HTTP response code 202 or 429 with a
  [QueryStatus](reference.md) object.

  Use this object to check the status of the execution of the statement.
* If an error occurred when executing the statement, Snowflake returns the HTTP response code 422 with a
  [QueryFailureStatus](reference.md) object.

  You can get [details about the error](handling-errors.md) from this object.

## Getting the results from the response

> **Note:**
>
> Snowflake version 5.40 introduces changes to the way data returned by the Snowflake SQL API is handled, among other changes.
>
> This section describes how to get the results from a response using newer functionality of the Snowflake SQL API.
> For information on using the older, deprecated behavior see [Deprecated functionality](sql-api-old.md).

If you [submit a SQL statement for execution](submitting-requests.md) or
check the status of statement execution, Snowflake returns a
[ResultSet](reference.md) object in the body of the response if the statement was executed
successfully.

The Snowflake API returns data in partitions. Snowflake determines the number of partitions and the size of each
partition that is returned. The size of a partition is variable and is based on the amount of data returned by Snowflake for
a particular SQL query.

When you submit a request, the body of this response includes a `partitionInfo` field. This field contains an array of objects, each of which describes a partition of data. This first object describes the partition of data returned in this response.
The rest of the objects describe the additional partitions that you can retrieve by submitting subsequent requests with `partition=partition_number`.

Each object in the array specifies the number of rows and size of a partition. Your application can use this partition metadata to determine how to handle the partitions returned for subsequent requests.

The following shows an example of part of the response:

```sqljson
{
  "code": "090001",
  "statementHandle": "536fad38-b564-4dc5-9892-a4543504df6c",
  "sqlState": "00000",
  "message": "successfully executed",
  "createdOn": 1597090533987,
  "statementStatusUrl": "/api/v2/statements/536fad38-b564-4dc5-9892-a4543504df6c",
  "resultSetMetaData" : {
    "numRows" : 50000,
    "format" : "jsonv2",
    "partitionInfo" : [ {
      "rowCount" : 12288,
      "uncompressedSize" : 124067,
      "compressedSize" : 29591
    }, {
      "rowCount" : 37712,
      "uncompressedSize" : 414841,
      "compressedSize" : 84469
    }],
  },
  "data": [
    ["customer1", "1234 A Avenue", "98765", "2021-01-20
    12:34:56.03459878"],
    ["customer2", "987 B Street", "98765", "2020-05-31
    01:15:43.765432134"],
    ["customer3", "8777 C Blvd", "98765", "2019-07-01
    23:12:55.123467865"],
    ["customer4", "64646 D Circle", "98765", "2021-08-03
    13:43:23.0"]
  ]
}
```

## Getting metadata about the results

In the `ResultSet` object returned in the response, the `resultSetMetaData` field contains a
[ResultSet_resultSetMetaData](reference.md) object that describes the
result set (for example, the format of the results, the number of partitions returned, etc.).

### Getting metadata about the columns returned in the `ResultSet` object

The `resultSetMetaData` field contains information about the columns returned in the `ResultSet` object.

In the example below, the `rowType` field contains an array of
[ResultSet_resultSetMetaData_rowType](reference.md)
objects. Each object describes a column in the results. The `type` field specifies the Snowflake data type of the
column.

```sqljson
{
 "resultSetMetaData": {
  "numRows": 1300,
  "rowType": [
   {
    "name":"ROWNUM",
    "type":"FIXED",
    "length":0,
    "precision":38,
    "scale":0,
    "nullable":false
   }, {
    "name":"ACCOUNT_NAME",
    "type":"TEXT",
    "length":1024,
    "precision":0,
    "scale":0,
    "nullable":false
   }, {
    "name":"ADDRESS",
    "type":"TEXT",
    "length":16777216,
    "precision":0,
    "scale":0,
    "nullable":true
   }, {
    "name":"ZIP",
    "type":"TEXT",
    "length":100,
    "precision":0,
    "scale":0,
    "nullable":true
   }, {
    "name":"CREATED_ON",
    "type":"TIMESTAMP_NTZ",
    "length":0,
    "precision":0,
    "scale":3,
    "nullable":false
   }],
  "partitionInfo": [{
    ... // Partition metadata
  }]
 }
}
```

### Getting metadata about the partitions returned by the `ResultSet` object

When you submit a request to execute a query, the response includes metadata that describes how the data is partitioned across responses as well as the first partition of data.

The `resultSetMetaData` field contains a `partitionInfo` field. This field contain an array of objects, each of which
describes a partition of data. This first object describes the partition of data returned in this response. The rest of the objects
describe the additional partitions that you can retrieve by submitting subsequent requests with `partition=partition_number`.

The following shows an example of part of the response:

```sqljson
  {
    "resultSetMetaData": {
    "numRows: 103477,
    "format": "jsonv2"
    "rowType": {
      ... // Column metadata.
    },
    "partitionInfo": [{
        "rowCount": 12344,
        "uncompressedSize": 14384873,
      },{
        "rowCount": 47387,
        "uncompressedSize": 76483423,
        "compressedSize": 4342748
      },{
        "rowCount": 43746,
        "uncompressedSize": 43748274,
        "compressedSize": 746323
    }]
  },
  ...
}
```

In this example, the first object in the `partitionInfo` field describes the partition of data in the data field of this response.

The second object describes the second partition of data, which contains 47387 rows and which you can retrieve by sending the request

`GET /api/v2/statements/handle?partition=1`.

The third object describes the third partition of data, which contains 43746 rows and which you can retrieve by sending the request

`GET /api/v2/statements/handle?partition=2`.

## Getting the data from the results

In the `ResultSet` object in the response, the results are in the `data` field. The `data` field
contains an array of an array of strings in JSON. For example:

```sqljson
{
  ...
  "data": [
    ["customer1", "1234 A Avenue", "98765", "2021-01-20 12:34:56.03459878"],
    ["customer2", "987 B Street", "98765", "2020-05-31 01:15:43.765432134"],
    ["customer3", "8777 C Blvd", "98765", "2019-07-01 23:12:55.123467865"],
    ["customer4", "64646 D Circle", "98765", "2021-08-03 13:43:23.0"]
  ],
  ...
}
```

Each array within the array contains the data for a row. The elements in each array represent the data in a row.

The data in the result set is encoded in JSON expressed as strings, regardless of the Snowflake data type of the column.

For example, the value `1.0` in a `NUMBER` column is returned as the string `"1.0"`. As another example, timestamps
are returned as the number of nanoseconds since the epoch. For example, the timestamp for Thursday, January 28, 2021
10:09:37.123456789 PM is returned as `"1611871777123456789"`.

You are responsible for converting the strings to the appropriate data types.

Snowflake returns the values as strings in the following formats (if no
output format parameter is specified in the POST submit statement request), depending on the
[Snowflake data type](../../sql-reference-data-types.md):

> INT / NUMBER:
> :   Decimal number in a string.
>
> FLOAT:
> :   Integer or float in a string.
>
> DECFLOAT:
> :   Integer or float in a string. If the number of significant digits is less than or equal to 38, the value is returned in plain format. Otherwise, the value is returned in scientific notation.
>
> VARCHAR:
> :   String.
>
> BINARY:
> :   Hexadecimal number in a string.
>
> BOOLEAN:
> :   “false” or “true” in a string.
>
> DATE:
> :   Integer value (in a string) of the number of days since the epoch (e.g. `18262`).
>
> TIME, TIMESTAMP_LTZ, TIMESTAMP_NTZ:
> :   Float value (with 9 decimal places) of the number of seconds since the epoch (e.g.
>     `82919.000000000`).
>
> TIMESTAMP_TZ:
> :   Float value (with 9 decimal places) of the number of seconds since the epoch, followed by a space and the timezone offset in minutes (for example, `1616173619.000000000 1500`). The offset value is a positive integer ranging from 720 (-12) to 2160 (+12), so `timezone_in_minutes = offset - 1440`. To calculate the timezone, you must subtract 1440 from the offset value (1500, in this example), which equals +60 minutes for a +0100 timezone.

## Retrieving additional partitions

The Snowflake SQL API returns data in partitions. The first partition is returned in JSON format and contains metadata about all of the partitions returned for a specific query. Your application can use this partition metadata to determine how to handle the partitions returned for subsequent requests.

After receiving the response containing the first partition of data, you can get the rest of the partitions by submitting requests with `partition=partition_number`, where `partition_number` identifies the partition of data to return. The partition number `0` identifies the first partition of data, which is returned in the initial request.

For example, after receiving the first partition of data, you can get the second partition of data by submitting a request with the partition parameter set to `1`:

```none
GET /api/v2/statements/<handle>?partition=1
```

In the response for a `GET /api/v2/statements/<handle>?partition=partition_number` request, the body contains JSON data in compressed form (using gzip).

The response includes the HTTP header `Content-Encoding: gzip`, which indicates that the body of the response is compressed.

These responses do not contain any metadata. Metadata for all partitions is provided in the first partition.

## Returning SQL NULL values as strings

By default, SQL NULL values are returned as the value `null`:

```sqljson
"data" : [ [ null ], ... ]
```

If you want these values returned as the string `"null"` instead, set the `nullable` query parameter to `false`
in the [POST request to submit the SQL statement for execution](submitting-requests.md). For example:

```none
POST /api/v2/statements?nullable=false
```

This returns SQL NULL values as `"null"`:

```sqljson
"data" : [ [ "null" ], ... ]
```

> **Note:**
>
> You cannot specify the `nullable` parameter in
> GET requests to check the statement status.

## Formatting the output of query results

The Snowflake SQL API supports parameters for formatting output (e.g. [Session parameters for dates, times, and timestamps](../../sql-reference/date-time-input-output.md)).

For example, by default, a DATE value like 2019-03-27 is returned as “17982” (2019-03-27 is 17982 days after the epoch). If you specify that the DATE_OUTPUT_FORMAT should be “MM/DD/YY” in the request:

```sqljson
{
  "statement": "select date_column from mytable",
  "resultSetMetaData": {
    "format": "jsonv2",
  },
  "parameters": {
    "DATE_OUTPUT_FORMAT": "MM/DD/YYYY"
  }
  ...
}
```

The DATE value is returned as “03/27/2019”.

In the `parameters` field in the body of the request, you can set the following parameters that determine the output format of
the data:

* BINARY_OUTPUT_FORMAT
* DATE_OUTPUT_FORMAT
* TIME_OUTPUT_FORMAT
* TIMESTAMP_LTZ_OUTPUT_FORMAT
* TIMESTAMP_NTZ_OUTPUT_FORMAT
* TIMESTAMP_TZ_OUTPUT_FORMAT
* TIMESTAMP_OUTPUT_FORMAT
* TIMEZONE

> **Note:**
>
> Snowflake ignores the account-level and user-level settings for these parameters. In order to change the format of the values in SQL API results, you must set these output parameters in the body of the request.

## Including row numbers in the `resultSet` object

Row numbers are not returned in the result set. To include row numbers in the response, call the SEQUENCE or ROW_NUMBER window function in your query to generate the row numbers.

---
title: How Snowflake represents trace events
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-how-events-work.md
section: Developer Guide
---

# How Snowflake represents trace events

Internally, Snowflake uses the [OpenTelemetry](https://opentelemetry.io/) data model to represent trace events inside an object called a span. A [span](https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/trace/api.md#span)
describes an operation, such as the invocation of a stored procedure or the execution of a UDF over a set of rows. A span includes the
start time and end time of the operation.

> **Tip:**
>
> For guidelines to keep in mind when adding trace events, see [General guidelines for adding trace events](tracing.md).

## How Snowflake emits trace events

For a stored procedure or UDF, Snowflake may execute it in parallel when it is called, where each parallel execution unit executes on a different
set of rows. Any trace events that are emitted are scoped to their execution unit and are wrapped inside the same span.

For a Streamlit app, each user session is captured in a single span.

Trace events are emitted only after their execution unit completes. If the execution unit fails before completion, trace events from that
execution unit are not guaranteed to be emitted.

Trace events from different execution units are stored in separate rows of the event table (in other words, in different spans).

> **Note:**
>
> Because UDFs are applied per input table row, calls to trace event APIs in your UDF are executed for each input table row. Adding a trace
> event for every row is inadvisable in most cases. There is a limit of 128 events per execution unit.

## Example: Emitting events from a Java procedure

The following example illustrates how you can emit events from handler code. It also shows how the generated event data is stored in an
event table.

### Stored procedure with Java handler

The Java code in the following example illustrates how you can add events to a span, along with attribute data. For more information about
APIs for handler languages, see [Event tracing from handler code](tracing.md).

```sqlexample-java
CREATE OR REPLACE PROCEDURE test_stored_proc()
RETURNS STRING
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES=('com.snowflake:snowpark:latest', 'com.snowflake:telemetry:latest')
HANDLER = 'MyClass.run'
AS
$$
  import com.snowflake.snowpark_java.Session;
  import com.snowflake.telemetry.Telemetry;
  import io.opentelemetry.api.common.AttributeKey;
  import io.opentelemetry.api.common.Attributes;
  import io.opentelemetry.api.common.AttributesBuilder;

  public class MyClass {

    public String run(Session session) {
      // Adding an event without attributes.
      Telemetry.addEvent("testEvent");

      // Adding an event with attributes.
      Attributes eventAttributes = Attributes.of(
          AttributeKey.stringKey("key"), "run",
          AttributeKey.longKey("result"), Long.valueOf(123));
      Telemetry.addEvent("testEventWithAttributes", eventAttributes);

      // Setting span attributes of different types.
      Telemetry.setSpanAttribute("example.boolean", true);
      Telemetry.setSpanAttribute("example.long", 2L);
      Telemetry.setSpanAttribute("example.double", 2.5);
      Telemetry.setSpanAttribute("example.string", "testAttribute");

      return "SUCCESS";
    }
  }
$$;
```

### Span data recorded

After the function or procedure executes successfully, Snowflake renders the OpenTelemetry span object as objects in columns of the
event table, as shown in the following tables.

A span can have its own attributes. Since a span represents a stored procedure and UDF execution unit, you might find it useful to set
span-level attributes for later data analysis. For more information about how to set span attributes, see the content
specific to the language in which you’re writing handler code. For a list of these languages, see [Event tracing from handler code](tracing.md).

A span can hold a maximum number of 128 trace events and a maximum number of 128 span attributes.

* If the number of trace events exceeds the limit, events are dropped as follows, depending on the handler language:

  + For Python handlers, events are dropped in the order in which they were added (in other words, in first-in-first-out order).
  + For handlers written in Java, JavaScript, Scala, and Snowflake Scripting, new events are dropped once the limit has been reached.
* If the number of span attributes exceeds the limit, no more span attributes can be added.

> **Note:**
>
> As of November 2022, all `dropped_*_count` keys are not set for JavaScript because the OpenTelemetry JavaScript Tracing SDK does
> not report on dropped counts.

| Description | Data |
| --- | --- |
| Span recorded by Snowflake for the execution of the procedure containing the handler code. | * Start timestamp from the START_TIMESTAMP column:  `2023-03-21 23:12:06.231` * Finish timestamp from the TIMESTAMP column:  `2023-03-21 23:12:06.944` * Data from the RECORD column:  ```sqljson   {     "kind": "SPAN_KIND_INTERNAL",     "name": "snow.auto_instrumented",     "status": {       "code": "STATUS_CODE_UNSET"     }   }   ``` |
| Attributes added by handler code for the span. | * Data from the RECORD_ATTRIBUTES column:  ```sqljson   {     "example.boolean": true,     "example.double": 2.5,     "example.long": 2,     "example.string": "testAttribute"   }   ``` |

### Event data recorded

The span contains a list of trace events with timestamps that capture when the trace
events were added. Not shown here: The span has a `trace_id` which is the query ID without dashes. The span also has system-generated
values for the `span_id` and `name` keys. Events that are part of the span share the same `span_id`.

The following data was recorded for the event `testEvent`.

| Description | Data |
| --- | --- |
| Event name | * Timestamp from the TIMESTAMP column:  `2023-03-21 23:12:06.939` * Data from the RECORD column:  ```sqljson   {     "dropped_attributes_count": 0,     "name": "testEvent"   }   ``` |

The following data recorded for the event `testEventWithAttributes`.

| Description | Data |
| --- | --- |
| Event name | * Timestamp from the TIMESTAMP column:  `2023-03-21 23:12:06.940` * Data from the RECORD column:  ```sqljson   {     "dropped_attributes_count": 0,     "name": "testEventWithAttributes"   }   ``` |
| Event attributes | * Data from the RECORD_ATTRIBUTES column:  ```sqljson   {     "key": "run",     "result": 123   }   ``` |

---
title: Install a Declarative Native App
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/consumer/install.md
section: Developer Guide
---

# Install a Declarative Native App

Snowflake Declarative Native Apps are databases that you can use to gain access to data and functionality shared by Snowflake data providers.

You can use Snowsight to install and access Declarative Native Apps, or you can use SQL commands to access the data directly.

After you install an app, you can share it with other members of your organization.

## Security

Declarative Native Apps have a similar security model to secure data sharing:

* Apps only have access to the data included in the app.
* Apps can’t access the consumer’s private data.
* Apps aren’t allowed to make external calls or to access data outside of the Snowflake account.

## Prerequisites

To install a Declarative Native App, you must have a Snowflake account, and a role with either of the following privileges:

* The **ACCOUNTADMIN** role
* A role with both **CREATE APPLICATION** and **IMPORT LISTING** privileges

To purchase a paid listing, the role must also have the **PURCHASE DATA EXCHANGE LISTING** privilege.

### Grant installation privileges

An ACCOUNTADMIN can allow members of the organization to install
Declarative Native Apps by granting privileges to the member’s role,
using the [GRANT privileges TO ROLE](../../../sql-reference/sql/grant-privilege.md) commands:

```sqlsyntax
GRANT CREATE APPLICATION ON ACCOUNT TO ROLE <role_name>;
GRANT IMPORT LISTING ON ACCOUNT TO ROLE <role_name>;
```

## Install an app

Roles with installation privileges can install a Declarative Native App from the Snowflake Marketplace, or from a privately shared listing.

Snowflake MarketplaceFrom a privately shared listingFrom SQL

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search or browse to the listing you want to access.
4. Select the listing, and select Get or Buy.
5. (Optional) Enter a name for Application name.
6. Select Get.
7. Select Open to view the app, or select Done to finish.

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the tile for the listing under Recently shared with you.
4. Select Get.
5. Select Options and enter a name for the app.
6. Select the warehouse where you want to install the app.
7. Select Get.
8. Select Open to view your listing or Done to finish.
9. Explore the listing as you would any other listing.

1. Show the available listings in the Snowflake Data Marketplace with the command: [SHOW AVAILABLE LISTINGS IN DATA EXCHANGE SNOWFLAKE_DATA_MARKETPLACE](../../../sql-reference/sql/show-available-listings.md).

   ```sqlexample
   SHOW AVAILABLE LISTINGS IN DATA EXCHANGE SNOWFLAKE_DATA_MARKETPLACE;
   ```
2. Install the app with the command: [CREATE APPLICATION FROM LISTING](../../../sql-reference/sql/create-application.md).

   ```sqlexample
   CREATE APPLICATION <app_name> FROM LISTING <listing_name>;
   ```

The user who installs the app is the app owner. The app owner and the ACCOUNTADMIN have access to all objects shared in the app, including notebooks, tables, views, and other objects.

## Share access to the app

The app owner (or the ACCOUNTADMIN) can share access to the data and features in a Snowflake Declarative Native App
to members of their organization by their organization role.

They can share access to the entire app, or for some apps, they can share access to a subset of the data and features in the app, defined by app roles.

### Share access to all data and features in an app

App owners can share access to all of the data and features in an app with the command: [GRANT IMPORTED PRIVILEGES ON APPLICATION](../../../sql-reference/sql/grant-privilege.md).

In this example, an app owner imports privileges for the application: `marketing_data_app` to the `team_admin_role` organizational role:

```sqlexample
GRANT IMPORTED PRIVILEGES ON APPLICATION marketing_data_app TO ROLE team_admin_role;
```

> **Note:**
>
> Sharing app access doesn’t share the ability to share app privileges with others.

### App roles: Share access to a portion of the data and features in an app

Some Declarative Native Apps include app roles, which provide access to a subset of the data and features in an app. App owners can assign app roles to their organization roles. This grants members of the organization roles access to the data and features defined in the app roles.

1. List the available roles with the command: [SHOW APPLICATION ROLES](../../../sql-reference/sql/show-application-roles.md). For example:

   ```sqlexample
   SHOW APPLICATION ROLES IN APPLICATION marketing_data_app;
   ```

   The command lists the available app roles. If the app has no app roles, the command returns an empty result set.
2. Grant app roles to teams by their organization roles with the command: [GRANT APPLICATION ROLE …TO ROLE](../../../sql-reference/sql/grant-application-role.md) command.

   ```sqlexample
   GRANT APPLICATION ROLE marketing_data_app.sales TO ROLE sales_team_west;
   ```

Considerations:

* Consumers can’t share access to individual objects in the app, such as individual tables, views, or notebooks, except as defined by app roles.
* Consumers can’t define new app roles, or modify the existing app roles.
* Consumers can’t share access to the objects in the app with members outside their organization.

## Access the app

For information about using the app, see [Access content in a Declarative Native App](access-app-content.md).

---
title: Install Snowpark Submit
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-submit-install.md
section: Developer Guide
---

# Install Snowpark Submit

You can install Snowpark Submit to run batch-oriented Spark workloads directly on Snowflake’s infrastructure.

To install Snowpark Submit, complete the following steps:

1. Install Snowpark Submit by using `pip`.

   ```bash
   pip install snowpark-submit
   ```
2. In a [connections.toml](../python-connector/python-connector-connect.md) file for Snowflake authentication, add a Snowflake connection. If you already have a Snowflake connection, you can use that connection.

   If you don’t have a [connections.toml](../python-connector/python-connector-connect.md) file already, create one as described in [Connecting using the connections.toml file](../python-connector/python-connector-connect.md).

   Once you have a [connections.toml](../python-connector/python-connector-connect.md) file, you can add a Snowflake connection to it. For example, to add a Snowflake connection called `snowpark-submit`, add the following lines to the configuration file:

   ```toml
   [snowpark-submit]
   host = "<account>.snowflakecomputing.com"
   port = 443
   account = "<account>"
   user = "test_user"
   role = "test_role"
   password = "<password for user>"
   protocol = "https"
   warehouse = "test_warehouse"
   database = "test_db"
   schema = "test_schema"
   compute_pool = "test_compute_pool"
   ```
3. Verify that you can connect to Snowflake from your client computer.

   To verify that the connection works from your client computer, create a `.py` file with code that connects to Snowflake.

   1. Create a `connection_test.py` file, and then add the following code:

      ```python
      # connection_test.py code

      import sys
      import snowflake.connector

      conn_name = sys.argv[1]

      print(f"Trying connection named {conn_name}..")
      conn = snowflake.connector.connect(connection_name=conn_name)
      print("Connected.")

      cursor = conn.cursor()
      cursor.execute("SELECT 'Connection successful'")
      for col in cursor:
          print(col)

      print("\nListing first 5 tables:\n")
      cursor = conn.cursor()
      cursor.execute('show tables limit 5')
      for col in cursor:
          print(col)
      print("\nDone")
      ```
   2. From your active Python virtual environment, run the following command, specifying the name of the connection that you added to your
      `connections.toml` file.

      ```bash
      python connection_test.py snowpark-submit
      ```

Once you have verified that you can connect to Snowflake from your client computer, you can use Snowpark Submit to run batch-oriented Spark workloads directly on Snowflake’s infrastructure. See [Snowpark Submit reference](snowpark-submit-reference.md) for the Snowpark Submit command-line reference or [Snowpark Submit examples](snowpark-submit-examples.md) for examples of how to use Snowpark Submit.

---
title: Install the Snowflake Python APIs library
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-installing.md
section: Developer Guide
---

# Install the Snowflake Python APIs library

You can install the Snowflake Python APIs library for use with conda or a virtual environment. Before you start, be sure to review the
[supported Python versions](snowflake-python-overview.md).

To set up the Snowflake Python APIs library, complete the following steps:

1. Activate a Python environment.
2. Optional: To use the library in government regions, build the Python cryptography library
   in the environment.

   > **Note:**
   >
   > The Snowflake Python APIs library relies on the [Python cryptography library](https://pypi.org/project/cryptography/) for authentication.
   > If you’re using a FIPS-compliant Python environment, you must compile the cryptography library against the system’s FIPS-compliant OpenSSL.
3. Install the library.
4. Set options for the Python API client.

## Activate a Python environment

To set up an environment in which to run Python code, you need to activate a Python environment. For example, you can use conda or a
virtual environment (venv).

condavenv

> **Note:**
>
> These steps are only shown as an example, and following along with the example might require additional rights to third-party data,
> products, or services that are not owned or provided by Snowflake. Ensure that you have the appropriate rights to third-party data,
> products, or services before continuing.

You can use `conda` to create an environment for running Python code. If you don’t have conda, you can install it from the conda website.

For information about conda, see [Conda Documentation](https://docs.conda.io/en/latest/). To download and install conda, see
[Installing conda](https://conda.io/projects/conda/en/latest/user-guide/install/index.html).

1. Create a conda environment:

   ```bash
   conda create -n <env_name> python==3.10
   ```
2. Activate the environment:

   ```bash
   conda activate <env_name>
   ```

You can use `venv` to create a virtual environment for running Python code. If you don’t have Python yet, you can download and install Python,
and then create a virtual environment.

For information about venv, see [venv — Creation of virtual environments](https://docs.python.org/3/library/venv.html#module-venv).
To download Python, see [Python Downloads](https://www.python.org/downloads/).

1. Use `venv` to create a virtual environment:

   ```bash
   cd <your Python project root folder>
   python3 -m venv '.venv'
   ```
2. Activate the environment:

   ```bash
   source '.venv/bin/activate'
   ```

## Build the Python cryptography library for government regions

For authentication, the Snowflake Python APIs use the [Snowflake Connector for Python](../python-connector/python-connector.md),
which relies on the [Python cryptography library](https://pypi.org/project/cryptography/). The cryptography library depends on the
[OpenSSL](https://www.openssl.org/) C library for all cryptographic operations and ships wheel packages with a statically linked OpenSSL
dependency included.

As such, when you install `cryptography` by using the default command `pip install cryptography`, the library uses its own version
of OpenSSL rather than the system’s version. For more information, see [Use of OpenSSL](https://cryptography.io/en/latest/openssl/).

If you’re using the Python API to connect to Snowflake accounts in government regions, you need to ensure that you use a FIPS-compliant
Python environment. To ensure FIPS compliance, instead of installing the cryptography library from a PyPI wheel, you must compile it
yourself against your system’s FIPS-compliant OpenSSL.

* For instructions on building the cryptography library on your specific operating system, see
  [Installation](https://cryptography.io/en/latest/installation/#installation) in the `cryptography` documentation.

> **Important:**
>
> You must build the cryptography library in this manner before you run `pip install snowflake -U`. This build sets the `cryptography`
> dependency and ensures that the `cryptography` package is not pulled from PyPI.
>
> The cryptography library must be compiled using a version that meets the dependency requirements defined in the
> [Snowflake Connector for Python library](https://github.com/snowflakedb/snowflake-connector-python/blob/main/setup.cfg#L50).

## Install the Snowflake Python APIs library

You can install the Snowflake Python APIs library from the Python Package Index (PyPI).

* In the conda or virtual environment that you created, run the following `pip` command to install the library:

  ```bash
  pip install snowflake -U
  ```

  The [snowflake](https://pypi.org/project/snowflake/) package is the [PEP 420 namespace](https://peps.python.org/pep-0420/) parent
  package for the Snowflake Python APIs. It includes `snowflake.core`, which is the subpackage that provides Python APIs for managing Snowflake
  resource objects.

  Installing the `snowflake` package automatically installs `snowflake.core` along with its required dependencies, including
  `snowflake-connector-python`.
* To also install the [Snowpark ML](../snowflake-ml/snowpark-ml.md) library as an extra package dependency, you can run
  the following `pip` command:

  ```bash
  pip install "snowflake[ml]" -U
  ```

After you install the library, you must create a connection to Snowflake before you can use the API. For more information about connecting,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Set Python API client options

You can set the following environment variables to control client options for the Snowflake Python APIs:

`_SNOWFLAKE_PRINT_VERBOSE_STACK_TRACE`
:   Specifies whether full stack tracing is enabled in printed error messages.

    Possible values:

    * Enabled: `true`, `t`, `yes`, `y`, `on`, or undefined
    * Disabled: Any other value

    Default: Enabled

    When this option is disabled, the API client sets `sys.tracebacklimit` to `0` when processing requests. This setting causes the
    client to suppress traceback information for all types of exceptions (not only the ones related to the API client) and to print only the error
    messages.

    To disable this option for Python notebook environments, run the following line in your notebook:

    ```bash
    %env _SNOWFLAKE_PRINT_VERBOSE_STACK_TRACE=false
    ```

`_SNOWFLAKE_ENABLE_RETRY_REQUEST_QUERY`
:   Specifies whether automatic retries are enabled on query requests with specific status codes.

    Possible values:

    * Enabled: `true`, `t`, `yes`, `y`, `on`
    * Disabled: Any other value or undefined

    Default: Enabled

    When this option is enabled, the API client automatically retries query requests when they have the following status codes:

    * `202`
    * `429`
    * `503`
    * `504`

---
title: Installing and configuring the ODBC Driver for Linux
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-linux.md
section: Developer Guide
---

# Installing and configuring the ODBC Driver for Linux

Linux uses named data sources (DSNs) for connecting ODBC-based client applications to Snowflake. You can choose to install the ODBC driver using the TGZ file, RPM package, or DEB package provided in the Snowflake Client Repository.

## Prerequisites

### Operating system

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

With ODBC version 3.0.1, the driver no longer supports CentOS 6 versions.

### Driver manager: iODBC or unixODBC

A driver manager is required to manage communication between Snowflake and the ODBC driver. The driver supports using either iODBC or unixODBC as the driver manager.

#### iODBC

If iODBC is not installed on CentOS, as `sudo`, execute the following command:

```bash
yum install libiodbc
```

#### unixODBC

unixODBC provides the `odbcinst` and `isql` command-line utilities used to install, configure, and test the driver. To verify whether unixODBC is installed, execute the following commands:

```bash
which odbcinst

which isql
```

If unixODBC is not installed:

1. As `sudo`, execute the following commands:

> ```bash
> yum search unixODBC
>
> yum install unixODBC.x86_64
> ```

1. Verify the directory where `odbcinst` expects the `odbcinst.ini` and `odbc.ini` files to be located:

   > ```bash
   > odbcinst -j
   > ```

   The location should be `/etc`.

## Step 1: Verify the package signature (RPM or DEB only) — *Optional*

> **Note:**
>
> If you are installing the ODBC driver by using `yum` or the
> TGZ file, skip this step.

If you are installing the ODBC driver using the RPM or DEB package and wish to verify the package signature before installation, perform the following tasks:

### 1.1: Download and import the latest Snowflake public key

From the public keyserver, download and import the Snowflake GPG public key for the version of the ODBC driver that you are using:

* For version 3.6.0 and higher:

  ```
  $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 2A3149C82551A34A
  ```
* For version 3.5.0:

  ```
  $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 5A125630709DD64B
  ```
* For version 2.25.6 through 3.4.1:

  ```
  $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 630D9F3CAB551AF3
  ```
* For version 2.22.1 through 2.25.5:

  ```
  $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 37C7086698CB005C
  ```
* For version 2.18.2 through 2.22.0:

  ```
  $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys EC218558EABB25A1
  ```
* For version 2.18.1 and lower:

  ```
  $ gpg --keyserver hkp://keyserver.ubuntu.com --recv-keys 93DB296A69BE019A
  ```

> **Note:**
>
> If this command fails with the following error:
>
> > ```none
> > gpg: keyserver receive failed: Server indicated a failure
> > ```
>
> then specify that you want to use port 80 for the keyserver:
>
> > ```bash
> > gpg --keyserver hkp://keyserver.ubuntu.com:80  ...
> > ```

### 1.2: Download the RPM or DEB driver package

Download the package from the Snowflake Client Repository. For details, see [Downloading the ODBC Driver](odbc-download.md).

### 1.3: Verify the signature for the RPM or DEB driver package

#### RPM package signature

1. Verify the key was imported successfully:

   > ```bash
   > gpg --list-keys
   > ```

   The command should display the Snowflake key.
2. Verify the signature:

   > ```bash
   > rpm -K snowflake-odbc-<version>.x86_64.rpm
   > ```

   > **Note:**
   >
   > If `rpm` does not have the GPG key that you imported, the command will report that the signatures are not OK and will
   > produce a `NOKEY` warning:
   >
   > > ```bash
   > > rpm -K snowflake-odbc-<version>.x86_64.rpm
   > > ```
   > >
   > > ```output
   > > snowflake-odbc-<version>.x86_64.rpm: digests SIGNATURES NOT OK
   > >
   > > rpm -Kv snowflake-odbc-<version>.x86_64.rpm
   > > ```
   > >
   > > ```output
   > > snowflake-odbc-<version>.rpm:
   > >     Header V4 RSA/SHA1 Signature, key ID 98cb005c: NOKEY
   > >     Header SHA1 digest: OK
   > >     V4 RSA/SHA1 Signature, key ID 98cb005c: NOKEY
   > >     MD5 digest: OK
   > > ```
   >
   > If this occurs, run the following commands to export the GPG key, import the key into `rpm`, and verify the
   > signature again:
   >
   > > ```bash
   > > gpg --export -a <GPG_KEY_ID> > odbc-signing-key.asc
   > > sudo rpm --import odbc-signing-key.asc
   > > rpm -K snowflake-odbc-<version>.x86_64.rpm
   > > ```
   >
   > where `<GPG_KEY_ID>` is the ID for the key that you installed in 1.1: Download and import the latest Snowflake public key.

#### DEB package signature

1. Install the package signature verification tool:

   > ```bash
   > sudo apt-get install debsig-verify
   > ```
2. Import the public key to the keyring:

   > ```bash
   > mkdir /usr/share/debsig/keyrings/<GPG_KEY_ID>
   > gpg --export <GPG_KEY_ID> > snowflakeKey.asc
   > touch /usr/share/debsig/keyrings/<GPG_KEY_ID>/debsig.gpg
   > gpg --no-default-keyring --keyring /usr/share/debsig/keyrings/<GPG_KEY_ID>/debsig.gpg --import snowflakeKey.asc
   > ```

   where `<GPG_KEY_ID>` is the ID for the key that you installed in 1.1: Download and import the latest Snowflake public key.
3. Configure a policy for the key. For details, see `/usr/share/doc/debsig-verify`. The policy must be stored in the following directory:

   > ```bash
   > /etc/debsig/policies/<GPG_KEY_ID>
   > ```

   where `<GPG_KEY_ID>` is the ID for the key that you installed in 1.1: Download and import the latest Snowflake public key.

   Store the policy in a file named `policy_name.pol`, where `policy_name` is your name for the policy. For the policy name, you can use any text string, however the string cannot contain blank spaces.

   Here is a sample policy file for a key with the ID 2A3149C82551A34A:

   > ```
   > <?xml version="1.0"?>
   > <!DOCTYPE Policy SYSTEM "http://www.debian.org/debsig/1.0/policy.dtd">
   > <Policy xmlns="https://www.debian.org/debsig/1.0/">
   > <Origin Name="Snowflake Computing" id="2A3149C82551A34A"
   > Description="Snowflake ODBC Driver DEB package"/>
   >
   > <Selection>
   > <Required Type="origin" File="debsig.gpg" id="2A3149C82551A34A"/>
   > </Selection>
   >
   > <Verification MinOptional="0">
   > <Required Type="origin" File="debsig.gpg" id="2A3149C82551A34A"/>
   > </Verification>
   >
   > </Policy>
   > ```
4. Verify the signature:

   > ```bash
   > sudo debsig-verify snowflake-odbc-<version>.x86_64.deb
   > ```

> **Note:**
>
> By default, the dpkg package signature verification tool does not check the signature when you install the package. If you want to verify the signature every time you run dpkg, remove the
> `--no-debsig` line in the `/etc/dpkg/dpkg.cfg` file.

### 1.4: Delete the old Snowflake public key — *Optional*

Your local environment can contain multiple GPG keys; however, for security reasons, Snowflake periodically rotates the public GPG key. As a best practice, we recommend deleting the existing public key
after confirming that the latest key works with the latest signed package.

To delete the key:

> ```bash
> gpg --delete-key "Snowflake Computing"
> ```

## Step 2: Install the ODBC Driver

Install the driver using one of the following approaches:

* Use yum to download and install the driver.
* Install the driver by using the downloaded TGZ file (TAR file compressed using .GZIP).
* Install the downloaded RPM package.
* Install the downloaded DEB package.

### Using yum to download and install the driver

With version 2.21.1 of the ODBC Driver (and later versions), you can use `yum` to download and install the driver.

To download and install the Snowflake ODBC driver for Linux using `yum`:

1. Create a file named `/etc/yum.repos.d/snowflake-odbc.repo`, and add the following text to the file:

   ```ini
   [snowflake-odbc]
   name=snowflake-odbc
   baseurl=https://sfc-repo.snowflakecomputing.com/odbc/linux/<VERSION_NUMBER>/
   gpgkey=https://sfc-repo.snowflakecomputing.com/odbc/Snowkey-<GPG_KEY_ID>-gpg
   ```

   where `VERSION_NUMBER` is the specific version number of the driver (for example, 3.16.0) and `GPG_KEY_ID` is one of the
   following key IDs:

   | ODBC Driver Version | GPG Key ID |
   | --- | --- |
   | 3.6.0 and higher | 2A3149C82551A34A |
   | 3.5.0 | 5A125630709DD64B |
   | 2.25.6 through 3.4.1 | 630D9F3CAB551AF3 |
   | 2.22.1 through 2.25.5 | 37C7086698CB005C |

   In the settings above, `baseurl` and `gpgkey` point to the [Snowflake Client Repository](../../user-guide/snowflake-client-repository.md) on Amazon S3. If
   you want to use the mirror on Azure Blob instead, change the hostname to `https://sfc-repo.azure.snowflakecomputing.com/`:

   ```ini
   [snowflake-odbc]
   name=snowflake-odbc
   baseurl=https://sfc-repo.azure.snowflakecomputing.com/odbc/linux/<VERSION_NUMBER>/
   gpgkey=https://sfc-repo.azure.snowflakecomputing.com/odbc/Snowkey-<GPG_KEY_ID>-gpg
   ```
2. Run the following command to install the driver:

   ```bash
   yum install snowflake-odbc
   ```

### Installing the TGZ file

To install the Snowflake ODBC driver for Linux using
[the TGZ file that you downloaded earlier](odbc-download.md).

1. Copy the downloaded file (`snowflake_linux_x8664_odbc-version.tgz`) to a working directory.
2. Unzip the file:

> ```bash
> gunzip snowflake_linux_x8664_odbc-<version>.tgz
> ```

1. Extract the files from the .tar file:

> ```bash
> tar -xvf snowflake_linux_x8664_odbc-<version>.tar
> ```

1. Copy the resulting `snowflake_odbc` folder to the directory where you want to install the driver. Make note of this directory. You’ll need the location later in the instructions.

### Installing the RPM package

> **Note:**
>
> The RPM package requires unixODBC as the driver manager.

To install the Snowflake ODBC driver for Linux using
[the RPM package that you downloaded earlier](odbc-download.md), after
optionally verifying the package signature, run the following command:

> ```bash
> yum install snowflake-odbc-<version>.x86_64.rpm
> ```

> **Note:**
>
> The installation directory is `/usr/lib64/snowflake/odbc/`. You’ll need the location later in the instructions.
>
> If the driver cannot find the library, it displays an `Unable to locate SQLGetPrivateProfileString function` error. In this case, you must set `ODBCInstLib=<driver_manager_path>` manually in the `simba.snowflake.ini` configuration file with the name of the driver manager on your system. For more information, see Configure the ODBC Driver.
>
> For example, `ODBCInstLib=/usr/lib/x86_64-linux-gnu/libodbcinst.so.2`.

### Installing the DEB package

> **Note:**
>
> The DEB package requires unixODBC as the driver manager. Please make sure that unixodbc and odbcinst packages are installed, before attempting to install the DEB package.

To install the Snowflake ODBC driver for Linux using
[the DEB package that you downloaded earlier](odbc-download.md), after
optionally verifying the package signature, run the following command:

```bash
sudo SF_ACCOUNT="<account>" dpkg -i snowflake-odbc-<version>.x86_64.deb
```

If the `SF_ACCOUNT` variable is unset, the `dpkg` command shows a warning. When you set the variable as shown, a Snowflake connection is added to the `odbc.ini` file.

The command might fail if any required dependencies for the package manager are not installed. If that happens, install them now:

```bash
sudo apt-get install -f
```

> **Note:**
>
> The installation directory is `/usr/lib/snowflake/odbc/`. You’ll need the location later in the instructions.

## Step 3: Configure the environment (TGZ only)

> **Note:**
>
> If you installed the ODBC driver using the RPM or DEB package file, skip this step.

If you installed using the TGZ file, configure the environment using the installed driver manager (either iODBC or unixODBC).

### Configuring with iODBC

In a terminal window, change to the `snowflake_odbc` directory, and run the following command to install Snowflake ODBC:

```bash
./iodbc_setup.sh
```

This script completes the following steps:

> * Adds one Snowflake connection to your system-level `/etc/odbc.ini` file.
> * Adds the Snowflake driver information to your system-level `/etc/odbcinst.ini` file.
> * Adds all certificate authority (CA) certificates required by the Snowflake ODBC driver to your system-level `simba.snowflake.ini` file.

By running `iodbc_setup.sh`, you don’t need to set any environment variables.

Alternatively, if you don’t want Snowflake to change your system configurations, add the following environment variables to your shell configuration file (e.g. `.profile`, `.bash_profile`):

> * `ODBCINI = <path>/conf/odbc.ini`
> * `ODBCINSTINI = <path>/conf/odbcinst.ini`

Where `path` is the location of the `snowflake_odbc` directory. If you have configured other ODBC drivers in your system and plan to add the Snowflake ODBC entries to your existing `odbc.ini` and
`odbcinst.ini` files in the next step, then point ODBCINI and ODBCINSTINI to the location of those files.

### Configuring with unixODBC

In a terminal window, change to the `snowflake_odbc` directory, and run the following command to install Snowflake ODBC:

```bash
./unixodbc_setup.sh
```

This script completes the following steps:

> * Adds a Snowflake connection to your system-level `/etc/odbc.ini` file.
> * Adds the Snowflake driver information to your system-level `/etc/odbcinst.ini` file.
> * Adds all certificate authority (CA) certificates required by the Snowflake ODBC driver to your system-level `simba.snowflake.ini` file.

By running `unixodbc_setup.sh`, you don’t need to set any environment variables.

Alternatively, if you don’t want Snowflake change your system configurations, add the following environment variables to your shell configuration file, e.g. `.profile`, `.bash_profile`:

> * `ODBCSYSINI = <path>/conf/`

Where `path` is the location of the `snowflake_odbc` directory. If you have configured other ODBC drivers in your system and plan to add the Snowflake ODBC entries to your existing `odbc.ini` and
`odbcinst.ini` files in the next step, then point ODBCSYSINI to the location of those files.

## Step 4: Configure the ODBC Driver

Configuring the ODBC driver requires adding entries to the following files:

* `<path>/lib/simba.snowflake.ini`
* `/etc/odbcinst.ini` (or `<path>/conf/odbc.ini`, if you are using environment variables)
* `/etc/odbc.ini` (or `<path>/conf/odbcinst.ini`, if you are using environment variables)

Where `path` is the location of the `snowflake_odbc` directory.

### 4.1: `simba.snowflake.ini` file (driver manager and logging)

Add the following entries to the `simba.snowflake.ini` file:

> ```ini
> ErrorMessagesPath=<path>/ErrorMessages/
> LogPath=/tmp/
> ODBCInstLib=<driver_manager_path>
> CABundleFile=<path>/lib/cacert.pem
> ANSIENCODING=UTF-8
> ```

Where:

> * `path` is the location of the `snowflake_odbc` directory.
> * `driver_manager_path` is the location of your driver manager directory:
>
>   > + iODBC: `ODBCInstLib=libiodbcinst.so.2`
>   > + unixODBC: `ODBCInstLib=libodbcinst.so`
>   > > **Note:**
>   > >
>   > > If your driver manager directory is not included in the `LD_LIBRARY_PATH` environment variable, specify the full path to the driver manager library here.

Verify that you have write permissions on the log path.

The `ANSIENCODING` parameter specifies the application’s character encoding. The default is `UTF-8`. The
parameter is intended for use only by Snowflake; customers should not change the value.

### 4.2: `odbcinst.ini` file (driver registration)

Add the following entries to the `odbcinst.ini` file:

> ```ini
> [ODBC Drivers]
> SnowflakeDSIIDriver=Installed
>
> [SnowflakeDSIIDriver]
> APILevel=1
> ConnectFunctions=YYY
> Description=Snowflake DSII
> Driver=/<path>/lib/libSnowflake.so
> DriverODBCVer=03.52
> SQLLevel=1
> ```

Where `path` is the location of the `snowflake_odbc` directory.

### 4.3: `odbc.ini` file (DSN entries)

For each DSN, add the following entries to the `odbc.ini` file:

* DSN Name and driver name (SnowflakeDSIIDriver), in the form of `<dsn_name> = <driver_name>`.
* Parameters:

  > + Required connection parameters, such as `server`.
  > + Any additional, optional parameters, such as default `role`, `database`, and `warehouse`.

  Parameters are specified in the form of `<parameter_name> = <value>`. For details about the parameters that can be set for each DSN, see [ODBC configuration and connection parameters](odbc-parameters.md).

The following example illustrates an `odbc.ini` file that configures two data sources that use different forms of an
[account identifier](../../user-guide/gen-conn-config.md) in the `server` URL:

* `testodbc1` uses the [account name as an identifier](../../user-guide/admin-account-identifier.md) for the account `myaccount` in the
  organization `myorganization`.
* `testodbc2` uses the [account locator](../../user-guide/admin-account-identifier.md) `xy12345` as the account identifier.

  Note that `testodbc2` uses an account in the AWS US West (Oregon) region. If the account is in a different region or if
  the account uses a different cloud provider, you need to
  [specify additional segments after the account locator](../../user-guide/admin-account-identifier.md).

  ```ini
  [ODBC Data Sources]
  testodbc1 = SnowflakeDSIIDriver
  testodbc2 = SnowflakeDSIIDriver

  [testodbc1]
  Driver      = /usr/jsmith/snowflake_odbc/lib/libSnowflake.so
  Description =
  server      = myorganization-myaccount.snowflakecomputing.com
  role        = sysadmin

  [testodbc2]
  Driver      = /usr/jsmith/snowflake_odbc/lib/libSnowflake.so
  Description =
  server      = xy12345.snowflakecomputing.com
  role        = analyst
  database    = sales
  warehouse   = analysis
  ```

Note the following:

* Both `testodbc1` and `testodbc2` have default roles.
* `testodbc2` also has a default database and warehouse.

## Step 5: Test the ODBC Driver

Test the driver using the installed driver manager (either iODBC or unixODBC).

### Testing with iODBC

Test the DSNs you created. On the command line, specify the DSN name, user login name, and password, using the following format:

> `iodbctest "DSN=<dsn_name>;UID=<user_name>;PWD=<password>"`

For example:

```bash
iodbctest "DSN=testodbc2;UID=mary;PWD=password"
```

```output
iODBC Demonstration program
This program shows an interactive SQL processor
Driver Manager: 03.52.0709.0909
Driver: 2.12.70 (Snowflake)

SQL>
```

### Testing with unixODBC

Test the DSNs you created using the `isql` command-line utility provided with `unixODBC`.

On the command line, specify the DSN name, user login name, and password.

For example:

```bash
isql -v testodbc2 mary <password>
```

```output
Dec 14 22:57:50 INFO  2022078208 Driver::LogVersions: SDK Version: 09.04.09.1013
Dec 14 22:57:50 INFO  2022078208 Driver::LogVersions: DSII Version: 2.12.36
Dec 14 22:57:50 INFO  2022078208 SFConnection::connect: Tracing level: 4

+---------------------------------------+
| Connected!                            |
|                                       |
| sql-statement                         |
| help [tablename]                      |
| quit                                  |
|                                       |
+---------------------------------------+
SQL>
```

---
title: Installing and configuring the ODBC Driver for macOS
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-mac.md
section: Developer Guide
---

# Installing and configuring the ODBC Driver for macOS

Similar to Windows, macOS utilizes named data sources (DSNs) for connecting ODBC-based client applications to Snowflake.

## Prerequisites

### Operating system

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

With ODBC version 3.0.1, the driver no longer supports MacOS 10.14 and 10.15 versions.

### iODBC

The Snowflake ODBC driver for Mac requires iODBC.

You can download the iODBC from:

> <https://www.iodbc.org/dataspace/doc/iodbc/wiki/iodbcWiki/Downloads>

To install iODBC:

1. After downloading iODBC, double-click on the downloaded .dmg file.
2. Double-click on the installer file, `iODBC-SDK.pkg`, and follow the prompts.

   By default, the package installs the software in the `/Library/Application Support/iODBC/bin` directory. You can add this directory to the
   `$PATH` environment variable to avoid needing to specify the full pathname to execute any of the iODBC commands.

> **Note:**
>
> iODBC provides a GUI administrator tool for configuring drivers and DSNs; however, this tool has not been tested for use with Snowflake and,
> therefore, should not be used to create or manage DSNs.

## Step 1: Install the ODBC Driver

To install the Snowflake ODBC driver for macOS:

1. If you haven’t already downloaded the driver, download it now. For details, see [Downloading the ODBC Driver](odbc-download.md).
2. Open the downloaded .dmg file, `snowflake_odbc_mac-<version>.dmg`.
3. Open the installer file, `snowflakeODBC_<version>.pkg`, and follow the prompts.

   You will likely be prompted for the administrator/sudo password for the machine on which you are installing the driver.

If you choose the default directory when prompted, the installer installs the ODBC driver files in the following directories:

> `/opt/snowflake/snowflakeodbc`
>
> `/Library/ODBC`

## Step 2: Configure the ODBC Driver

To configure the ODBC driver for macOS, create one or more data source (DSNs), which are stored in the following files, depending on the type of DSN you create:

> * User DSNs: `~/Library/ODBC/odbc.ini`
> * System DSNs: `/Library/ODBC/odbc.ini`

To create a DSN, edit the appropriate `odbc.ini` file.

### Creating a DSN by adding an entry in the `odbc.ini` file

If a user or system DSN has already been created for the driver, add the new entry to the `odbc.ini` file that already exists in the corresponding directory for the type of DSN you are creating. If you are creating the first DSN
for the driver, you must manually create the `odbc.ini` file and add the entry to the file.

For each DSN, specify:

* DSN name and driver name (Snowflake), in the form of `<dsn_name> = <driver_name>`.
* Directory path and name of the driver file, in the form of `Driver = /opt/snowflake/snowflakeodbc/lib/universal/libSnowflake.dylib`.
* Connection parameters, such as `server` and `uid` (user login name). Any connection parameters you add to the DSN do not need to be specified in the ODBC connect string.
* Any additional parameters, such as default `role`, `database`, and `warehouse`.

Parameters are specified in the form of `<parameter_name> = <value>`. For details about the parameters that can be set for each DSN, see [ODBC configuration and connection parameters](odbc-parameters.md).

The following example illustrates an `odbc.ini` file that configures two data sources that use different forms of an
[account identifier](../../user-guide/gen-conn-config.md) in the `server` URL:

* `testodbc1` uses the [account name as an identifier](../../user-guide/admin-account-identifier.md) for the account `myaccount` in the
  organization `myorganization`.
* `testodbc2` uses the [account locator](../../user-guide/admin-account-identifier.md) `xy12345` as the account identifier.

  Note that `testodbc2` uses an account in the AWS US West (Oregon) region. If the account is in a different region or if
  the account uses a different cloud provider, you need to
  [specify additional segments after the account locator](../../user-guide/admin-account-identifier.md).

  ```ini
  [ODBC Data Sources]
  testodbc1 = Snowflake
  testodbc2 = Snowflake

  [testodbc1]
  Driver      = /opt/snowflake/snowflakeodbc/lib/universal/libSnowflake.dylib
  Description =
  uid         = peter
  server      = myorganization-myaccount.snowflakecomputing.com
  role        = sysadmin

  [testodbc2]
  Driver      = /opt/snowflake/snowflakeodbc/lib/universal/libSnowflake.dylib
  Description =
  uid         = mary
  server      = xy12345.snowflakecomputing.com
  role        = analyst
  database    = sales
  warehouse   = analysis
  ```

Note the following:

* Both `testodbc1` and `testodbc2` have default roles.
* `testodbc2` also has a default database and warehouse.

## Step 3: Test the ODBC Driver

You can use the `iodbctest` command-line utility provided with iODBC to test the DSNs you create.

When prompted for the ODBC connect string, enter the required connection parameters (DSN name, server, user login name, and password), as well as any other parameters that you would like to enter as part of the connect string. The
connect string takes parameters in the form of `<parameter_name>=<value>`, e.g. `dsn=testodbc2`, with each parameter separated by a semi-colon (`;`) and no blank spaces. For the list of supported parameters, see
[ODBC configuration and connection parameters](odbc-parameters.md).

> **Note:**
>
> If you set the server and user login name in the DSN, the only required parameters in the connect string are the DSN name and user password.

For example:

```none
$ "/Library/Application Support/iODBC/bin/iodbctest"

iODBC Demonstration program
This program shows an interactive SQL processor
Driver Manager: 03.52.0607.1008

Enter ODBC connect string (? shows list): dsn=testodbc2;pwd=<password>

Dec 14 20:16:08 INFO  1299 SFConnection::connect: Tracing level: 4

Driver: 2.12.36 (Snowflake - Latest version supported by Snowflake: 2.12.38)

SQL>
```

---
title: Installing and configuring the ODBC Driver for Windows
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-windows.md
section: Developer Guide
---

# Installing and configuring the ODBC Driver for Windows

Windows utilizes named data sources (DSNs) for connecting ODBC-based client applications to Snowflake.

## Prerequisites

### Operating system

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

### Administrator privileges

To install the ODBC driver, you need administrator-level privileges so
that the driver can be installed in the `C:Program Files` directory.

### Visual C++ Redistributable for Visual Studio 2015

To use Snowflake ODBC Driver in a Windows environment, you have to first install Visual C++ Redistributable for Visual Studio 2015.

You can download the installation file from:

> <https://learn.microsoft.com/en-us/cpp/windows/latest-supported-vc-redist?view=msvc-170#visual-studio-2015-2017-2019-and-2022>

## Step 1: Install the ODBC Driver

1. If you haven’t already downloaded the latest driver version, download it now. For details, see [Downloading the ODBC Driver](odbc-download.md).
2. Double-click on the downloaded .msi file:

   > **Note:**
   >
   > The driver is installed in `C:Program Files`.

## Step 2: Configure the ODBC Driver

To configure the ODBC driver in a Windows environment, create a DSN for the driver:

1. Launch the Windows Data Source Administration Tool:

   Search on your Windows machine for the launcher for the ODBC Data Source Administration Tool:

   Once you find the ODBC administration tool, click on the tool to launch it and display the set up window.
2. Verify that the Snowflake ODBC driver is installed:

   Navigate to the Drivers tab in the set up window and verify that the driver (SnowflakeDSIIDriver) appears:

   If you do not see SnowflakeDSIIDriver, then the Snowflake ODBC driver installation did not complete successfully and you need to re-install it.
3. Create a new DSN:

   1. Navigate to the User DSN or System DSN tab and click the Add button:
   2. Select SnowflakeDSIIDriver from the list of installed drivers.
   3. Enter the connection parameters for the driver.

      In the fields provided in Snowflake Configuration dialog, enter the parameters for the DSN:

      When entering parameters, note the following:

      > * Data Source, User and Server are the only parameters required to create a DSN.
      >
      >   For more information on these parameters, see [Required connection parameters](odbc-parameters.md).
      > * All other parameters in the dialog are optional. In particular, the
      >   proxy-related parameters should be specified only if you are using a proxy, and the
      >   Authenticator should be changed from the default (“snowflake”) only if needed.
      >   For more details about ODBC Data Source parameters, see
      >   [ODBC configuration and connection parameters](odbc-parameters.md)
      >   and, in particular, [Optional connection parameters](odbc-parameters.md).
      > * The Password field accepts a value, but does not store the value. This is a security precaution to ensure passwords are never stored directly in the driver.

      > **Note:**
      > * The ODBC driver supports additional parameters that are not displayed in the dialog. These parameters can only be set in the Windows registry using regedit.
      >
      >   For descriptions of all the parameters, see [ODBC configuration and connection parameters](odbc-parameters.md).
      > * Specifying a value in the **Authenticator** field is only required if you are using federated authentication. For more information, see the `authenticator` parameter description in [ODBC configuration and connection parameters](odbc-parameters.md).
   4. Click OK to create the DSN.

You can now reference this DSN in ODBC-based client applications for connecting to Snowflake.

---
title: Installing the Node.js Driver
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-install.md
section: Developer Guide
---

# Installing the Node.js Driver

This topic describes how to install the Node.js driver using `npm`, the default package manager for the Node.js JavaScript runtime
environment.

## Prerequisites

* Node.js must already be installed in the environment where you wish to install the driver.
* You need to be able to run the `node` and `npm` commands.
* Depending on your environment, you may need `sudo` privileges.

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

## Installing the driver

The Node.js driver (`snowflake-sdk`) is available directly from `npm`.

To install the driver, open a terminal window and type the following command:

> ```bash
> npm install snowflake-sdk
> ```

The command downloads and installs the Snowflake Node.js driver. The driver should now appear in your `node_modules` directory and
you should be able to use the driver using `require('snowflake-sdk')`.

---
title: Installing the Python Connector
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-install.md
section: Developer Guide
---

# Installing the Python Connector

To install the latest Python Connector for Snowflake, use:

> ```bash
> pip install snowflake-connector-python
> ```

If you won’t use Snowflake on AWS, you can exclude the `boto3` and `botocore` dependencies for AWS. These libraries take up both disk space and memory in the Python Connector, even when you don’t need them. To disable these libraries, set the `SNOWFLAKE_NO_BOTO` environment variable to `true` during installation:

> ```bash
> SNOWFLAKE_NO_BOTO=true pip install snowflake-connector-python
> ```

The source code for the Python driver is available on [GitHub](https://github.com/snowflakedb/snowflake-connector-python).

## Prerequisites

Requires Python version 3.9 (deprecated) or later.

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

---
title: Introduction to Declarative Sharing in the Native Application Framework
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/introduction.md
section: Developer Guide
---

# Introduction to Declarative Sharing in the Native Application Framework

Declarative Sharing in the Snowflake Native App Framework enables providers to share not just data,
but code objects — notebooks, stored procedures, and user-defined functions — alongside
tables and views as a single data product. Providers define all shared objects in a YAML
manifest file, and the framework handles privilege management, object resolution, and
versioning automatically, with no setup script required. This topic provides a high-level
overview of the steps required to get started as a provider.

## Become a Snowflake listing provider

Becoming a provider allows you to create and manage listings to share your app with consumers.

For more information, see [Become a provider](https://other-docs.snowflake.com/collaboration/provider-becoming).

## Create your data content

Providers create data and objects to share with consumers. This can include the following:

> * [Tables](../../user-guide/tables-micro-partitions.md), including:
>
>   > + [Dynamic tables](../../user-guide/dynamic-tables-about.md)
>   > + [Apache Iceberg tables](../../user-guide/tables-iceberg.md)
> * [Views](../../user-guide/views-introduction.md), including:
>
>   > + [Semantic views](../../user-guide/views-semantic/overview.md)
> * [Notebooks](../../user-guide/ui-snowsight/notebooks-working.md)
> * [Stored procedures](../stored-procedure/stored-procedures-overview.md)
> * [User-defined functions](../udf/udf-overview.md)

### Access control requirements

The provider account should have the necessary privileges to create and manage Snowflake objects such as databases, schemas, tables, and virtual warehouses.

This includes:

* Databases and schemas: Requires USAGE privileges
* Tables and views: Requires SELECT privileges

---
title: Introduction to Java UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-introduction.md
section: Developer Guide
---

# Introduction to Java UDFs

You can write the handler for a user-defined function (UDF) in Java. Topics in this section describe how to design and write a Java handler.
You’ll also find examples.

For an introduction to UDFs, including a list of languages in which you can write a UDF handler, refer to [User-defined functions overview](../udf-overview.md).

Once you have a handler, you create the UDF with SQL. For information on using SQL to create or call a UDF, refer to
[Creating a user-defined function](../udf-creating-sql.md) or [Executing a UDF](../udf-calling-sql.md).

Snowflake currently supports writing UDFs in the following versions of Java:

* 11.x
* 17.x

> **Note:**
>
> For limitations related to Java UDF handlers, refer to [Java UDF limitations](udf-java-limitations.md).

## How a Java handler works

When a user calls a UDF, the user passes UDF’s name and arguments to Snowflake. Snowflake calls the associated handler code
(with arguments, if any) to execute the UDF’s logic. The handler method then returns the output to Snowflake, which passes it back to the
client.

For each row passed to a UDF, the UDF returns either a scalar (i.e. single) value or, if defined as a table function, a set of rows.

Java UDFs can contain both new code and calls to existing libraries, allowing you both flexibility and code reuse.
For example, if you already have data analysis code in Java, then you can probably incorporate that into a Java UDF.

Below is a simplified illustration of the data flow:

### Example

Code in the following example creates a UDF called `echo_varchar` with a handler method `TestFunc.echoVarchar`. The Java
argument and return types are converted to and from SQL by Snowflake according to mappings described in
[SQL-Java Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

```sqlexample
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE JAVA
  CALLED ON NULL INPUT
  HANDLER = 'TestFunc.echoVarchar'
  TARGET_PATH = '@~/testfunc.jar'
  AS
  'class TestFunc {
    public static String echoVarchar(String x) {
      return x;
    }
  }';
```

## Design considerations

Keep in mind the following for designing a useful handler.

* **General considerations.** For considerations common to UDFs and procedures, refer to
  [Design Guidelines and Constraints for Functions and Procedures](../../udf-stored-procedure-guidelines.md).
* **SQL-Java type mapping.** When exchanging argument and return values with a UDF, Snowflake converts between the handler language and SQL.
  For more information on choosing data types for your handler code, refer to [Choosing your data types](udf-java-designing.md).
* **Code packaging.** You can make your handler code available either in-line with the CREATE FUNCTION statement or on a stage as compiled
  code in a JAR. For more information on the difference, refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).
* **Code optimization.** For information about optimizing your handler code, such as when the code handles state shared across rows, refer to
  [Optimizing initialization and controlling global state in scalar UDFs](udf-java-designing.md).
* **Best practices.** For information about best practices, refer to [Following best practices](udf-java-designing.md) and
  [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).

## Handler coding

From basics to detailed examples, the following topics describe how to write a UDF handler in Java.

* **Java class definition.** You write the logic for a UDF in a Java class. For more about how Snowflake interacts with your code, refer to
  [Designing the class](udf-java-designing.md).
* **Error handling.** For information about how Snowflake surfaces errors generated by handlers, refer to
  [Handling errors](udf-java-designing.md).
* **Tabular return values.** You can return tabular values as well as scalar (single) values from a UDF. For information on how to write
  a handler that returns tabular values, refer to [Tabular Java UDFs (UDTFs)](udf-java-tabular-functions.md).
* **Logging and event tracing.** For information on capturing log and trace data as your handler code executes, refer to
  [Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).
* **Dependencies.** You can make dependences available to your code at run time by uploading them to a stage. For more informaiton, refer
  to [Making dependencies available to your code](../../upload-dependencies.md).
* **Handler files organization.** If you intend to package compiled handler code into a JAR file, organize and build your code using the
  suggestions in [Organizing your files](udf-java-creating.md).
* **Code examples** For a range of handler examples in Java, refer to [Java UDF handler examples](udf-java-cookbook.md).

---
title: Introduction to JavaScript UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/javascript/udf-javascript-introduction.md
section: Developer Guide
---

# Introduction to JavaScript UDFs

You can write the handler for a user-defined function (UDF) in JavaScript. Topics in this section describe how to design and write a
JavaScript handler.

For an introduction to UDFs, including a list of languages in which you can write a UDF handler, refer to [User-defined functions overview](../udf-overview.md).

Once you have a handler, you create the UDF with SQL. For information on using SQL to create or call a UDF, refer to
[Creating a user-defined function](../udf-creating-sql.md) or [Executing a UDF](../udf-calling-sql.md).

You can capture log and trace data as your handler code executes. For more information, refer to
[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).

> **Note:**
>
> For limitations related to JavaScript UDF handlers, refer to [JavaScript UDF limitations](udf-javascript-limitations.md).

## How a JavaScript handler works

When a user calls a UDF, the user passes UDF’s name and arguments to Snowflake. Snowflake calls the associated handler code
(with arguments, if any) to execute the UDF’s logic. The handler function then returns the output to Snowflake, which passes it back to the
client.

For each row passed to a UDF, the UDF returns either a scalar (i.e. single) value or, if defined as a table function, a set of rows.

### Example

Code in the following example creates a UDF called `my_array_reverse` with a handler code that accepts an input ARRAY and
returns an ARRAY containing the elements in reverse order. The JavaScript argument and return types are converted to and from SQL
by Snowflake, according to mappings described in [SQL-JavaScript Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

Note that the JavaScript code must refer to the input parameter names as all uppercase, even if the names are not uppercase in the
SQL code.

```javascript
-- Create the UDF.
CREATE OR REPLACE FUNCTION my_array_reverse(a ARRAY)
  RETURNS ARRAY
  LANGUAGE JAVASCRIPT
AS
$$
  return A.reverse();
$$
;
```

## JavaScript data types

SQL and JavaScript UDFs provide similar, but different, data types, based on their native data type support. Objects within Snowflake
and JavaScript are transferred using the following mappings.

### Integers and doubles

JavaScript has no integer type; all numbers are represented as doubles. JavaScript UDFs do not accept or return integer values except
through type conversion (i.e. you can pass an integer to a JavaScript UDF that accepts a double).

Both Snowflake SQL and JavaScript support double values. These values are transferred as-is.

### Strings

Both Snowflake SQL and JavaScript support string values. These values are transferred as-is.

### Binary values

All binary values are converted into JavaScript `Uint8Array` objects. These typed arrays can be accessed in the same way as
regular JavaScript arrays, but they are more efficient and support additional methods.

If a JavaScript UDF returns a `Uint8Array` object, it is converted into a Snowflake SQL binary value.

### Dates

All timestamp and date types are converted into JavaScript `Date()` objects. The JavaScript date type is equivalent to
TIMESTAMP_LTZ(3) in Snowflake SQL.

Consider the following notes for JavaScript UDFs that accept a date or time:

* All precision beyond milliseconds is lost.
* A JavaScript `Date` generated from SQL TIMESTAMP_NTZ no longer acts as “wallclock” time; it is influenced by daylight saving time.
  This is similar to behavior when converting TIMESTAMP_NTZ to TIMESTAMP_LTZ.
* A JavaScript `Date` generated from SQL TIMESTAMP_TZ loses time zone information, but represents the same moment in time as the
  input (similar to when converting TIMESTAMP_TZ to TIMESTAMP_LTZ).
* SQL DATE is converted to JavaScript `Date` representing midnight of the current day in the local time zone.

Additionally, consider the following notes for JavaScript UDFs that return DATE and TIMESTAMP types:

* JavaScript `Date` objects are converted to the UDF’s result data type, adhering to the same conversion semantics as casts from
  TIMESTAMP_LTZ(3) to the return data type.
* JavaScript `Date` objects nested inside VARIANT objects are always of type TIMESTAMP_LTZ(3).

### Variant, objects, and arrays

JavaScript UDFs allow easy, intuitive manipulation of variant and JSON data. Variant objects passed to a UDF are transformed to native
JavaScript types and values. Any of the previously-listed values are translated into their corresponding JavaScript types. Variant objects
and arrays are converted to JavaScript objects and arrays. Similarly, all values returned by the UDF are transformed into the appropriate
variant values. Note that objects and arrays returned by the UDF are subject to size and depth limitations.

```javascript
-- flatten all arrays and values of objects into a single array
-- order of objects may be lost
CREATE OR REPLACE FUNCTION flatten_complete(v variant)
  RETURNS variant
  LANGUAGE JAVASCRIPT
  AS '
  // Define a function flatten(), which always returns an array.
  function flatten(input) {
    var returnArray = [];
    if (Array.isArray(input)) {
      var arrayLength = input.length;
      for (var i = 0; i < arrayLength; i++) {
        returnArray.push.apply(returnArray, flatten(input[i]));
      }
    } else if (typeof input === "object") {
      for (var key in input) {
        if (input.hasOwnProperty(key)) {
          returnArray.push.apply(returnArray, flatten(input[key]));
        }
      }
    } else {
      returnArray.push(input);
    }
    return returnArray;
  }

  // Now call the function flatten() that we defined earlier.
  return flatten(V);
  ';

select value from table(flatten(flatten_complete(parse_json(
'[
  {"key1" : [1, 2], "key2" : ["string1", "string2"]},
  {"key3" : [{"inner key 1" : 10, "inner key 2" : 11}, 12]}
  ]'))));

-----------+
   VALUE   |
-----------+
 1         |
 2         |
 "string1" |
 "string2" |
 10        |
 11        |
 12        |
-----------+
```

## JavaScript arguments and returned values

Arguments may be referenced directly by name within JavaScript. Note that an unquoted identifier must be referenced with the
capitalized variable name. As arguments and the UDF are referenced from within JavaScript, they must be legal JavaScript identifiers.
Specifically, UDF and argument names must begin with a letter or `$`, while subsequent characters can be alphanumeric, `$`,
or `_`. Additionally, names can not be JavaScript-reserved words.

The following three examples illustrate UDFs that use arguments referenced by name:

```javascript
-- Valid UDF.  'N' must be capitalized.
CREATE OR REPLACE FUNCTION add5(n double)
  RETURNS double
  LANGUAGE JAVASCRIPT
  AS 'return N + 5;';

select add5(0.0);

-- Valid UDF. Lowercase argument is double-quoted.
CREATE OR REPLACE FUNCTION add5_quoted("n" double)
  RETURNS double
  LANGUAGE JAVASCRIPT
  AS 'return n + 5;';

select add5_quoted(0.0);

-- Invalid UDF. Error returned at runtime because JavaScript identifier 'n' cannot be resolved.
CREATE OR REPLACE FUNCTION add5_lowercase(n double)
  RETURNS double
  LANGUAGE JAVASCRIPT
  AS 'return n + 5;';

select add5_lowercase(0.0);
```

### NULL and undefined values

When using JavaScript UDFs, pay close attention to rows and variables that might contain NULL values. Specifically, Snowflake
contains two distinct NULL values (SQL `NULL` and variant’s JSON `null`), while JavaScript contains the `undefined`
value in addition to `null`.

SQL `NULL` arguments to a JavaScript UDF will translate to the JavaScript `undefined` value. Likewise, returned
JavaScript `undefined` values translate back to SQL `NULL`. This is true for all data types, including variant.
For non-variant types, a returned JavaScript `null` will also result in a SQL `NULL` value.

Arguments and returned values of the variant type distinguish between JavaScript’s `undefined` and `null` values.
SQL `NULL` continues to translate to JavaScript `undefined` (and JavaScript `undefined` back to SQL `NULL`);
variant JSON `null` translates to JavaScript `null` (and JavaScript `null` back to variant JSON `null`).
An `undefined` value embedded in a JavaScript object (as the value) or array will cause the element to be omitted.

> Create a table with one string and one `NULL` value:
>
> ```sqlexample
> create or replace table strings (s string);
> insert into strings values (null), ('non-null string');
> ```
>
> Create a function that converts a string to a `NULL` and a `NULL` to a string:
>
> ```sqlexample
> CREATE OR REPLACE FUNCTION string_reverse_nulls(s string)
>     RETURNS string
>     LANGUAGE JAVASCRIPT
>     AS '
>     if (S === undefined) {
>         return "string was null";
>     } else
>     {
>         return undefined;
>     }
>     ';
> ```
>
> Call the function:
>
> ```sqlexample
> select string_reverse_nulls(s)
>     from strings
>     order by 1;
> +-------------------------+
> | STRING_REVERSE_NULLS(S) |
> |-------------------------|
> | string was null         |
> | NULL                    |
> +-------------------------+
> ```
>
> Create a function that shows the difference between passing a SQL `NULL` and passing a variant JSON `null`:
>
> ```sqlexample
> CREATE OR REPLACE FUNCTION variant_nulls(V VARIANT)
>       RETURNS VARCHAR
>       LANGUAGE JAVASCRIPT
>       AS '
>       if (V === undefined) {
>         return "input was SQL null";
>       } else if (V === null) {
>         return "input was variant null";
>       } else {
>         return V;
>       }
>       ';
> ```
>
> ```sqlexample
> select null,
>        variant_nulls(cast(null as variant)),
>        variant_nulls(PARSE_JSON('null'))
>        ;
> +------+--------------------------------------+-----------------------------------+
> | NULL | VARIANT_NULLS(CAST(NULL AS VARIANT)) | VARIANT_NULLS(PARSE_JSON('NULL')) |
> |------+--------------------------------------+-----------------------------------|
> | NULL | input was SQL null                   | input was variant null            |
> +------+--------------------------------------+-----------------------------------+
> ```
>
> Create a function that shows the difference between returning `undefined`, `null`, and a variant that contains
> `undefined` and `null` (note that the `undefined` value is removed from the returned variant):
>
> ```sqlexample
> CREATE OR REPLACE FUNCTION variant_nulls(V VARIANT)
>       RETURNS variant
>       LANGUAGE JAVASCRIPT
>       AS $$
>       if (V == 'return undefined') {
>         return undefined;
>       } else if (V == 'return null') {
>         return null;
>       } else if (V == 3) {
>         return {
>             key1 : undefined,
>             key2 : null
>             };
>       } else {
>         return V;
>       }
>       $$;
> ```
>
> ```sqlexample
> select variant_nulls('return undefined'::VARIANT) AS "RETURNED UNDEFINED",
>        variant_nulls('return null'::VARIANT) AS "RETURNED NULL",
>        variant_nulls(3) AS "RETURNED VARIANT WITH UNDEFINED AND NULL; NOTE THAT UNDEFINED WAS REMOVED";
> +--------------------+---------------+---------------------------------------------------------------------------+
> | RETURNED UNDEFINED | RETURNED NULL | RETURNED VARIANT WITH UNDEFINED AND NULL; NOTE THAT UNDEFINED WAS REMOVED |
> |--------------------+---------------+---------------------------------------------------------------------------|
> | NULL               | null          | {                                                                         |
> |                    |               |   "key2": null                                                            |
> |                    |               | }                                                                         |
> +--------------------+---------------+---------------------------------------------------------------------------+
> ```

### Type conversion within JavaScript

JavaScript will implicitly convert values between many different types. When any value is returned, the value is first converted to
the requested return type before being translated to a SQL value. For example, if a number is returned, but the UDF is declared as
returning a string, this number will converted to a string within JavaScript. Keep in mind that JavaScript programming errors, such as
returning the wrong type, may be hidden by this behavior. In addition, if an error is thrown while converting the value’s type, an
error will result.

### JavaScript Number Range

The range for numbers with precision intact is from

> -(2^53 -1)

to

> (2^53 -1)

The range of valid values in Snowflake NUMBER(p, s) and DOUBLE data types is larger. Retrieving a value from Snowflake
and storing it in a JavaScript numeric variable can result in loss of precision. For example:

> ```javascript
> CREATE OR REPLACE FUNCTION num_test(a double)
>   RETURNS string
>   LANGUAGE JAVASCRIPT
> AS
> $$
>   return A;
> $$
> ;
> ```
>
> ```javascript
> select hash(1) AS a,
>        num_test(hash(1)) AS b,
>        a - b;
> +----------------------+----------------------+------------+
> |                    A | B                    |      A - B |
> |----------------------+----------------------+------------|
> | -4730168494964875235 | -4730168494964875000 | -235.00000 |
> +----------------------+----------------------+------------+
> ```

The first two columns should match, and the third should contain 0.0.

The problem applies to JavaScript user-defined functions (UDFs) and stored procedures.

If you experience the problem in stored procedures when using `getColumnValue()`, you might be able to avoid the
problem by retrieving a value as a string, e.g. with:

```javascript
getColumnValueAsString()
```

You can then return the string from the stored procedure, and cast the string to a numeric data type in SQL.

## JavaScript errors

Any errors encountered while executing JavaScript appear to the user as SQL errors. This includes parsing errors, runtime errors,
and uncaught error thrown within the UDF. If the error contains a stacktrace, it will be printed along with the error message. It is
acceptable to throw an error without catching it in order to end the query and produce a SQL error.

When debugging, you may find it useful to print argument values along with the error message so that they appear in the SQL error
message text. For deterministic UDFs, this provides the necessary data to reproduce errors in a local JavaScript engine. One common
pattern is to place an entire JavaScript UDF body in a try-catch block, append argument values to the caught error’s message, and
throw an error with the extended message. You should consider removing such mechanisms prior to deploying UDFs to a production
environment; recording values in error messages may unintentionally reveal sensitive data.

The function can throw and catch pre-defined exceptions or custom exceptions. A simple example of throwing a
custom exception is [here](udf-javascript-scalar-functions.md).

See also [Troubleshooting JavaScript UDFs](udf-javascript-troubleshooting.md).

## JavaScript UDF security

JavaScript UDFs are designed to be safe and secure by providing several layers of query and data isolation:

* Compute resources within the virtual warehouse that executes a JavaScript UDF are accessible only from within your account
  (i.e. warehouses do not share resources with other Snowflake accounts).
* Table data is encrypted within the virtual warehouse to prevent unauthorized access.
* JavaScript code is executed within a restricted engine, preventing system calls from the JavaScript context (e.g. no network and
  disk access) and constraining the system resources available to the engine, specifically memory.

As a result, JavaScript UDFs can access only the data needed to perform the defined function and can not affect the state of the
underlying system, other than consuming a reasonable amount of memory and processor time.

---
title: Introduction to Python UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-introduction.md
section: Developer Guide
---

# Introduction to Python UDFs

You can write the handler for a user-defined function (UDF) in Python. Topics in this section describe how to design and write a Python
handler. You’ll also find examples.

For an introduction to UDFs, including a list of languages in which you can write a UDF handler, refer to [User-defined functions overview](../udf-overview.md).

Once you have a handler, you create the UDF with SQL. For information on using SQL to create or call a UDF, refer to
[Creating a user-defined function](../udf-creating-sql.md) or [Executing a UDF](../udf-calling-sql.md).

Snowflake currently supports writing UDFs in the following versions of Python:

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

> **Note:**
>
> For limitations related to Python UDF handlers, refer to [Python UDF limitations](udf-python-limitations.md).

## How a Python handler works

When a user calls a UDF, the user passes UDF’s name and arguments to Snowflake. Snowflake calls the associated handler code
(with arguments, if any) to execute the UDF’s logic. The handler method then returns the output to Snowflake, which passes it back to the
client.

For each row passed to a UDF, the UDF returns either a scalar (i.e. single) value or, if defined as a table function, a set of rows.

Python UDFs can contain both new code and calls to existing packages, allowing you both flexibility and code reuse.
For example, if you already have data analysis code in Python, then you can probably incorporate that into a Python UDF handler.

### Example

Code in the following example creates a UDF called `addone` with a handler method `addone_py`. The Python argument and return
types are converted to and from SQL by Snowflake according to mappings described in
[SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

```sqlexample-python
CREATE OR REPLACE FUNCTION addone(i INT)
  RETURNS INT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  HANDLER = 'addone_py'
AS $$
def addone_py(i):
  return i+1
$$;
```

## Design considerations

Keep in mind the following for designing a useful handler.

* **General considerations.** For considerations common to UDFs and procedures, refer to
  [Design Guidelines and Constraints for Functions and Procedures](../../udf-stored-procedure-guidelines.md).
* **SQL-Python type mapping.** When exchanging argument and return values with a UDF, Snowflake converts between the handler language and SQL.
  For more information on choosing data types for your handler code, refer to [Choosing your data types](udf-python-designing.md).
* **Code packaging.** You can make your handler code available either in-line with the CREATE FUNCTION statement or on a stage. For more
  information on the difference, refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).
* **Code optimization.** For information about optimizing your handler code, such as when the code handles state shared across rows, refer to
  [Optimizing initialization and controlling global state in scalar UDFs](udf-python-designing.md) and [Optimizing for scale and performance](udf-python-designing.md).
* **Best practices.** For information about best practices, refer to [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).

## Handler coding

From basics to detailed examples, the following topics describe how to write a UDF handler in Python.

* **Python module definition.** You write the logic for a UDF in a Python module. For more about how Snowflake interacts with your code,
  refer to [Designing the module](udf-python-designing.md).
* **Error handling.** For information about how Snowflake surfaces errors generated by handlers, refer to
  [Handling errors](udf-python-designing.md).
* **Tabular return values.** You can return tabular values as well as scalar (single) values from a UDF. For information on how to write
  a handler that returns tabular values, refer to [Writing a UDTF in Python](udf-python-tabular-functions.md).
* **Logging and event tracing.** For information on capturing log and trace data as your handler code executes, refer to
  [Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).
* **Dependencies.** You can make dependences available to your code at run time by uploading them to a stage. For more informaiton, refer
  to [Making dependencies available to your code](../../upload-dependencies.md).
* **Code examples** For a range of handler examples in Python, refer to [Python UDF handler examples](udf-python-examples.md).

---
title: Introduction to Scala UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-introduction.md
section: Developer Guide
---

# Introduction to Scala UDFs

You can write the handler for a user-defined function (UDF) in Scala. A handler executes as the function’s logic when it’s called in SQL.

Once you have a handler, you create the UDF with SQL. For information on using SQL to create or call a UDF, refer to
[Creating a user-defined function](../udf-creating-sql.md) or [Executing a UDF](../udf-calling-sql.md).

For an introduction to UDFs, including a list of languages in which you can write a UDF handler, refer to [User-defined functions overview](../udf-overview.md).

> **Note:**
>
> For limitations related to Scala handlers, refer to [Scala UDF limitations](udf-scala-limitations.md).

You can also use Scala to write a UDF when using the Snowpark API. For more information, refer to
[Creating User-Defined Functions (UDFs) for DataFrames in Scala](../../snowpark/scala/creating-udfs.md).

## Prerequisites

Snowflake currently supports writing UDFs in the following versions of Scala:

[Preview Feature](../../../release-notes/preview-features.md) — Open

Support for version 2.13 is in preview. Available to all accounts.

* 2.13
* 2.12

For more information, see [Writing code to support different Scala versions](../../scala-version-differences.md).

## How a handler works

When a user calls a UDF, the user passes UDF’s name and arguments to Snowflake. Snowflake calls the handler method associated with the UDF
to execute the UDF’s logic. The handler method then returns the output to Snowflake, which passes it back to the client.

For a scalar function (one that returns a single value), the UDF returns a single value for each row passed to the UDF.

To support your handler’s logic, your code can make calls to libraries that are external to the handler. For example, if you already have
data analysis code in Scala, then you can probably use it from your handler code.

For general information on writing a handler in Scala, refer to [General Scala UDF handler coding guidelines](udf-scala-general.md). For information on
writing a scalar function, refer to [Writing a scalar UDF in Scala](udf-scala-scalar.md).

### Example

Code in the following example creates a UDF called `echo_varchar` with a handler method `TestFunc.echoVarchar`. The Scala
argument and return types are converted to and from SQL by Snowflake according to mappings described in
[SQL-Scala Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER='TestFunc.echoVarchar'
  AS
  $$
  class TestFunc {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER='TestFunc.echoVarchar'
  AS
  $$
  class TestFunc {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

### Call the UDF

```sqlexample
SELECT echo_varchar('Hello');
```

## Design considerations

Keep in mind the following for designing a useful handler.

* **General considerations.** For considerations common to UDFs and procedures, refer to
  [Design Guidelines and Constraints for Functions and Procedures](../../udf-stored-procedure-guidelines.md).
* **Staying within Snowflake-imposed constraints.** For information on designing handler code that runs well on Snowflake,
  refer to [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).
* **SQL-Scala type mapping.** When exchanging argument and return values with a UDF, Snowflake converts between the handler language and SQL.
  For more information on choosing data types for your handler code, refer to [SQL-Scala Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
* **Code packaging.** You can make your handler code available either in-line with the CREATE FUNCTION statement or on a stage as compiled
  code in a JAR. For more information on the difference, refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).

  For information on using sbt to package the compiled code for your Scala handler, refer to
  [Packaging Scala Handler Code with sbt](../../udf-stored-procedure-build-sbt.md).
* **Code optimization.** For information about optimizing your handler code, such as when the code handles state shared across rows,
  refer to [Controlling global state in scalar Scala UDFs](udf-scala-optimizing.md).
* **Best practices.** For information about best practices, refer to [Best practices](udf-scala-general.md) and
  [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).

## Handler coding

From basics to detailed examples, the following topics describe how to write a UDF handler in Scala.

* **General guidelines.** For general information about handler coding, including handling errors, choosing data types, and more, refer to
  [General Scala UDF handler coding guidelines](udf-scala-general.md).
* **Writing a scalar function** For more information, refer to [Writing a scalar UDF in Scala](udf-scala-scalar.md).
* **Logging and event tracing.** For information on capturing log and trace data as your handler code executes, refer to
  [Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).
* **Code examples** For a range of handler examples, refer to [Scala UDF handler examples](udf-scala-examples.md).
* **Dependencies.** You can make dependencies available to your code at run time by uploading them to a stage. For more information, refer
  to [Making dependencies available to your code](../../upload-dependencies.md).
* **Handler files organization.** If you intend to package compiled handler code into a JAR file to stage, organize and build your code
  using the suggestions in [Scala UDF handler project and packaging](udf-scala-packaging.md).

---
title: Introduction to SQL UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-introduction.md
section: Developer Guide
---

# Introduction to SQL UDFs

You can write the handler for a user-defined function (UDF) in SQL. Topics in this section describe how to design and write a SQL
handler. You’ll also find examples.

For an introduction to UDFs, including a list of languages in which you can write a UDF handler, see [User-defined functions overview](../udf-overview.md).

After you have a handler, you create the UDF with SQL. For information about using SQL to create or call a UDF, see
[Creating a user-defined function](../udf-creating-sql.md) or [Executing a UDF](../udf-calling-sql.md).

> **Note:**
>
> For limitations related to SQL UDF handlers, see [SQL UDF limitations](udf-sql-limitations.md).

## How a SQL handler works

When a user calls a UDF, the user passes UDF’s name and arguments to Snowflake. Snowflake calls the associated handler code
(with arguments, if any) to execute the UDF’s logic. The handler method then returns the output to Snowflake, which passes it back to the
client.

The function definition can be a SQL expression that returns either a scalar — that is, single — value or, if defined as a
table function, a set of rows.

### Example

Code in the following example creates a UDF called `area_of_circle` containing handler code that calculates a circle’s area from
the radius value received by the UDF as an argument.

```sqlexample
CREATE FUNCTION area_of_circle(radius FLOAT)
  RETURNS FLOAT
  AS
  $$
    pi() * radius * radius
  $$
  ;
```

## General usage

A SQL UDF evaluates an arbitrary SQL expression and returns the results of the expression.

The function definition can be a SQL expression that returns either a scalar — that is, single — value or,
if defined as a table function, a set of rows.

## Security/privilege requirements for SQL UDFs

If a function definition refers to an unqualified table, then that table is resolved in the schema containing the function. A reference
to another schema object — such as a table, view, or other function — requires that the owner of the function has privileges to access that
schema object. The invoker of the function need not have access to the objects referenced in the function definition, but only needs the
privilege to use the function.

For example, an administrator owns a table named `users`, which contains sensitive data that is not generally accessible, but the
administrator can expose the total user count through a function which other users have access privileges on:

```sqlexample
USE ROLE dataadmin;

DESC TABLE users;
```

```output
+-----------+--------------+--------+-------+---------+-------------+------------+--------+------------+---------+
| name      | type         | kind   | null? | default | primary key | unique key | check  | expression | comment |
|-----------+--------------+--------+-------+---------+-------------+------------+--------+------------+---------|
| USER_ID   | NUMBER(38,0) | COLUMN | Y     | [NULL]  | N           | N          | [NULL] | [NULL]     | [NULL]  |
| USER_NAME | VARCHAR(100) | COLUMN | Y     | [NULL]  | N           | N          | [NULL] | [NULL]     | [NULL]  |
  ...
  ...
  ...
+-----------+--------------+--------+-------+---------+-------------+------------+--------+------------+---------+
```

```sqlexample
CREATE FUNCTION total_user_count() RETURNS NUMBER AS 'select count(*) from users';

GRANT USAGE ON FUNCTION total_user_count() TO ROLE analyst;

USE ROLE analyst;

-- This will fail because the role named "analyst" does not have the
-- privileges required in order to access the table named "users".
SELECT * FROM users;
```

```output
FAILURE: SQL compilation error:
Object 'USERS' does not exist.
```

```sqlexample
-- However, this will succeed.
SELECT total_user_count();
```

```output
+--------------------+
| TOTAL_USER_COUNT() |
|--------------------+
| 123                |
+--------------------+
```

For more information about using roles and privileges to manage access control, see [Overview of Access Control](../../../user-guide/security-access-control-overview.md).

---
title: Introduction to the SQL API
source: https://docs.snowflake.com/en/developer-guide/sql-api/intro.md
section: Developer Guide
---

# Introduction to the SQL API

The Snowflake SQL API is a REST API that you can use to access and update data in a Snowflake database. You can use
this API to develop custom applications and integrations that:

* Perform queries.
* Manage your deployment (e.g. provision users and roles, create tables, etc.).

## Capabilities of the SQL API

The Snowflake SQL API provides operations that you can use to:

* Submit SQL statements for execution.
* Check the status of the execution of a statement.
* Cancel the execution of a statement.
* Fetch query results concurrently.

You can use this API to execute [standard queries](../../sql-reference/constructs.md) and most
[DDL](../../sql-reference/sql-ddl-summary.md) and [DML](../../sql-reference/sql-dml.md) statements.
See Limitations of the SQL API for the types of statements that are not supported.

For queries, the SQL API returns data in partitions. Snowflake determines the number of partitions returned and the size of each partition.

The endpoint for the SQL API (`/api/v2/statements`) is protected by the [network policies](../../user-guide/network-policies.md)
that restrict access to the account where the API is enabled.

> **Note:**
>
> The [AUTOCOMMIT](../../sql-reference/parameters.md) parameter must be set to `TRUE` per query or statement level, regardless of the
> value set at the user or account level.

## Limitations of the SQL API

The SQL API has the following limitations:

* The following commands are not supported:

  + The [PUT](../../sql-reference/sql/put.md) command (in Snowflake SQL)
  + The [GET](../../sql-reference/sql/get.md) command (in Snowflake SQL)

The following commands and statements are supported only within a
[request that specifies multiple statements](submitting-multiple-statements.md):

> * Commands that perform explicit transactions, including:
>
>   + [BEGIN](../../sql-reference/sql/begin.md)
>   + [COMMIT](../../sql-reference/sql/commit.md)
>   + [ROLLBACK](../../sql-reference/sql/rollback.md)
> * Commands that change the context of the session, including:
>
>   + [USE <object>](../../sql-reference/sql/use.md)
>   + [ALTER SESSION](../../sql-reference/sql/alter-session.md)
> * Statements that set session variables.
> * Statements that create temporary tables and stages (tables and stages that are available only in the current session).

The SQL API does not support certain types of stored procedures. You might encounter errors, for example, when trying to call Python and Java/Scala stored procedures
that return a `resultset` in Arrow format. Even if you don’t directly call these stored procedures from the SQL API, but call another stored procedure,
such as SQL, errors might result when the outer procedure internally calls an inner Python or Java/Scala stored procedure.

## Billing considerations when using the SQL API

The SQL API leverages the cloud services layer when fetching some query results. For more information about cloud services,
see [Cloud Services Credit Usage](../../user-guide/cost-understanding-compute.md).

---
title: Java handler examples for stored procedures
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/java/procedure-java-examples.md
section: Developer Guide
---

# Java handler examples for stored procedures

## Using Snowpark APIs for asynchronous processing

In the following example, the `getResultJDBC` procedure executes an asynchronous child job that waits 10 seconds.

```sqlexample-java
CREATE OR REPLACE PROCEDURE getResultJDBC()
RETURNS VARCHAR
LANGUAGE JAVA
RUNTIME_VERSION = 11
PACKAGES = ('com.snowflake:snowpark:latest')
HANDLER = 'TestJavaSP.asyncBasic'
AS
$$
import java.sql.*;
import net.snowflake.client.jdbc.*;

class TestJavaSP {
  public String asyncBasic(com.snowflake.snowpark.Session session) throws Exception {
    Connection connection = session.jdbcConnection();
    SnowflakeStatement stmt = (SnowflakeStatement)connection.createStatement();
    ResultSet resultSet = stmt.executeAsyncQuery("CALL SYSTEM$WAIT(10)");
    resultSet.next();
    return resultSet.getString(1);
  }
}
$$;
```

---
title: Java requirements for the JDBC Driver
source: https://docs.snowflake.com/en/developer-guide/jdbc/java-install.md
section: Developer Guide
---

# Java requirements for the JDBC Driver

The Snowflake JDBC driver requires Java LTS (Long-Term Support) versions 1.8 or higher. If the minimum required version of Java is not
installed on the client machines where the JDBC driver is installed, you must install either Oracle Java or OpenJDK.

> **Note:**
>
> If you use JDK 1.8 u91 or earlier, or if you use a custom trust store, please read the
> [DigiCert Global Root G2 certificate authority (CA) TLS certificate updates](https://community.snowflake.com/s/article/check-impact-from-digicert-g2-certificate-update)
> Knowledge Base article for information about updating the trust store with the required certificate.

## Oracle Java

Oracle Java currently supports Java 8.

For download and installation instructions, go to:

> <http://www.java.com/en/download/manual.jsp>

## OpenJDK

OpenJDK is an open-source implementation of Java that provides JDK 8 packages for various Linux environments. Packages for non-Linux environments or higher Java versions are only available through 3rd parties.

For more information, go to:

> <http://openjdk.java.net>

## Client-side data encryption requirements

The JDBC driver uses the AES specification to encrypt files uploaded to Snowflake stages (using [PUT](../../sql-reference/sql/put.md)) and decrypt downloaded files (via [GET](../../sql-reference/sql/get.md)).
The driver automatically encrypts staged files using 128-bit keys, but also supports encrypting files using 256-bit keys for a higher level of AES encryption.

To use 256-bit keys instead of the default 128-bit keys for encryption of staged files, your account administrator must set the [CLIENT_ENCRYPTION_KEY_SIZE](../../sql-reference/parameters.md) account parameter. For more information
about setting parameters for your account, see [Parameter management](../../user-guide/admin-account-management.md).

However, to encrypt stage files using 256-bit keys, the Java Runtime Environment (JRE) used by the JDBC driver requires the **Java Cryptography Extension (JCE) Unlimited Strength Jurisdiction Policy Files**
on each machine where the JDBC driver is installed:

* Oracle Java does not include the policy files; they must be downloaded and installed separately (see below).
* OpenJDK includes the policy files automatically; no additional tasks are necessary.

The next section provides instructions for installing the policy files for Oracle Java.

### Installing the JCE Unlimited Strength Jurisdiction Policy Files for Oracle Java

> **Attention:**
>
> Each time you install a new version of Oracle Java on your client machine, you may need to reinstall the policy files as described below.

To install the policy files for Oracle Java:

1. Download the policy files for your version of Oracle Java.

   * [JCE Unlimited Strength Jurisdiction Policy Files 8 Download](http://www.oracle.com/technetwork/java/javase/downloads/jce8-download-2133166.html)
   * [JCE Unlimited Strength Jurisdiction Policy Files 7 Download](http://www.oracle.com/technetwork/java/javase/downloads/jce-7-download-432124.html)

   The zip file contains a `README.txt` file and two `.jar` files.
2. Install the files. Depending on your environment, you can install the files in the following ways:

   > * If version 2.4.26 (or higher) of the Snowflake JDBC driver is installed, you can connect to Snowflake and attempt to execute a
   >   [PUT](../../sql-reference/sql/put.md) or [GET](../../sql-reference/sql/get.md) command.
   >
   >   If the policy files are not installed or installed incorrectly (i.e. the JRE cannot locate the files), the system returns an error, which includes the directory where the JRE expected to find the
   >   policy files. You can then copy the files to the directory specified in the error.
   >
   >   To get the latest version of the driver, download it from the Maven Central Repository. For more information, see [Downloading / integrating the JDBC Driver](jdbc-download.md).
   > * If a single version of Java is installed on your client machine, put the two `.jar` files in the `jre/lib/security` sub-directory of your Java installation as described in the `README.txt` file
   >   included with the policy files.
   >
   >   For example, on macOS with Java 8 installed, the directory would be:
   >
   >   > `/Library/Java/JavaVirtualMachines/jdk1.8.0_45.jdk/Contents/Home/jre/lib/security`
   > * If multiple versions of Java are installed, the JDBC driver will automatically locate a Java installation to use; however, we recommend using JAVA_HOME to explicitly specify the version to use in your
   >   environment:
   >
   >   > + If JAVA_HOME is set, put the `.jar` files in the `jre/lib/security` directory for the Java installation referenced in JAVA_HOME.
   >   > + If JAVA_HOME is not set, we recommend putting the `.jar` files in the `lib/security` directory for each installed JRE.
3. After installing the files, you may need to log out of your client and log back in.

---
title: Java stored procedure limitations
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/java/procedure-java-limitations.md
section: Developer Guide
---

# Java stored procedure limitations

## Limitations

Stored procedures have the following limitations:

* Concurrency isn’t supported. For example, from within your code, you can’t submit queries from multiple threads. Code that
  concurrently issues multiple queries will produce an error.
* Consider the following limitations when you use some Snowpark APIs in your stored procedure:

  + When you use [APIs that execute PUT and GET commands](../../snowpark/java/working-with-dataframes.md) (including
    `Session.sql("PUT ...")` and `Session.sql("GET ...")`), you may write only to the /tmp directory in the memory-backed
    file system provided for the query calling the procedure.
  + Do not use [APIs that create new sessions](../../snowpark/java/creating-session.md) (for example,
    `Session.builder().configs(...).create()`).
  + Using `session.jdbcConnection` (and the connection returned from it) is not supported because it may result in unsafe behavior.
* Creating named temp objects is not supported in an owner’s rights stored procedure. An owner’s rights stored procedure is a stored
  procedure that runs with the privileges of the stored procedure owner.
  For more information, refer to [caller’s rights or owner’s rights](../stored-procedures-rights.md).

---
title: Java UDF handler examples
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-cookbook.md
section: Developer Guide
---

# Java UDF handler examples

This topic includes simple examples of UDF handler code written in Java.

For more on using Java to create a UDF handler, see [Creating a Java UDF handler](udf-java-creating.md).

## Creating and calling a simple in-line Java UDF

The following statements create and call an in-line Java UDF. This code returns the VARCHAR passed to it.

This function is declared with the optional `CALLED ON NULL INPUT` clause to indicate that the function is
called even if the value of the input is NULL. (This function would return NULL with or without this clause, but
you could modify the code to handle NULL another way, for example, to return an empty string.)

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE JAVA
  CALLED ON NULL INPUT
  HANDLER = 'TestFunc.echoVarchar'
  TARGET_PATH = '@~/testfunc.jar'
  AS
  'class TestFunc {
    public static String echoVarchar(String x) {
      return x;
    }
  }';
```

Call the UDF:

```sqlexample
SELECT echo_varchar('Hello');
+-----------------------+
| ECHO_VARCHAR('HELLO') |
|-----------------------|
| Hello                 |
+-----------------------+
```

### Passing a NULL to an in-line Java UDF

This uses the `echo_varchar()` UDF defined above. The SQL `NULL` value is implicitly converted to
Java `null`, and that Java `null` is returned and implicitly converted back to SQL `NULL`:

Call the UDF:

```sqlexample
SELECT echo_varchar(NULL);
+--------------------+
| ECHO_VARCHAR(NULL) |
|--------------------|
| NULL               |
+--------------------+
```

## Passing array values

Java methods can receive SQL arrays in either of two ways:

* Using Java’s array feature.
* Using Java’s *varargs* (variable number of arguments) feature.

In both cases, your SQL code must pass an [ARRAY](../../../sql-reference/data-types-semistructured.md).

> **Note:**
>
> Be sure to use Java types with valid mappings to SQL types. For more information, refer to [SQL-Java Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

### Passing via an ARRAY

Declare the Java parameter as an array. For example, the third parameter in the following method is a String array:

```java
static int myMethod(int fixedArgument1, int fixedArgument2, String[] stringArray)
```

Below is a complete example:

Create and load the table:

```sqlexample
CREATE TABLE string_array_table(id INTEGER, a ARRAY);
INSERT INTO string_array_table (id, a) SELECT
        1, ARRAY_CONSTRUCT('Hello');
INSERT INTO string_array_table (id, a) SELECT
        2, ARRAY_CONSTRUCT('Hello', 'Jay');
INSERT INTO string_array_table (id, a) SELECT
        3, ARRAY_CONSTRUCT('Hello', 'Jay', 'Smith');
```

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION concat_varchar_2(a ARRAY)
  RETURNS VARCHAR
  LANGUAGE JAVA
  HANDLER = 'TestFunc_2.concatVarchar2'
  TARGET_PATH = '@~/TestFunc_2.jar'
  AS
  $$
  class TestFunc_2 {
      public static String concatVarchar2(String[] strings) {
          return String.join(" ", strings);
      }
  }
  $$;
```

Call the UDF:

```sqlexample
SELECT concat_varchar_2(a)
  FROM string_array_table
  ORDER BY id;
+---------------------+
| CONCAT_VARCHAR_2(A) |
|---------------------|
| Hello               |
| Hello Jay           |
| Hello Jay Smith     |
+---------------------+
```

### Passing via varargs

Using varargs is very similar to using an array.

In your Java code, use Java’s varargs declaration style:

```java
static int myMethod(int fixedArgument1, int fixedArgument2, String ... stringArray)
```

Below is a complete example. The only significant difference between this example and the preceding example (for arrays) is the
declaration of the parameters to the method.

Create and load the table:

```sqlexample
CREATE TABLE string_array_table(id INTEGER, a ARRAY);
INSERT INTO string_array_table (id, a) SELECT
        1, ARRAY_CONSTRUCT('Hello');
INSERT INTO string_array_table (id, a) SELECT
        2, ARRAY_CONSTRUCT('Hello', 'Jay');
INSERT INTO string_array_table (id, a) SELECT
        3, ARRAY_CONSTRUCT('Hello', 'Jay', 'Smith');
```

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION concat_varchar(a ARRAY)
  RETURNS VARCHAR
  LANGUAGE JAVA
  HANDLER = 'TestFunc.concatVarchar'
  TARGET_PATH = '@~/TestFunc.jar'
  AS
  $$
  class TestFunc {
      public static String concatVarchar(String ... stringArray) {
          return String.join(" ", stringArray);
      }
  }
  $$;
```

Call the UDF:

```sqlexample
SELECT concat_varchar(a)
    FROM string_array_table
    ORDER BY id;
+-------------------+
| CONCAT_VARCHAR(A) |
|-------------------|
| Hello             |
| Hello Jay         |
| Hello Jay Smith   |
+-------------------+
```

## Returning NULL explicitly from an in-line UDF

The following code shows how to return a NULL value explicitly. The Java value `null` is converted to
SQL `NULL`.

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION return_a_null()
  RETURNS VARCHAR
  NULL
  LANGUAGE JAVA
  HANDLER = 'TemporaryTestLibrary.returnNull'
  TARGET_PATH = '@~/TemporaryTestLibrary.jar'
  AS
  $$
  class TemporaryTestLibrary {
    public static String returnNull() {
      return null;
    }
  }
  $$;
```

Call the UDF:

```sqlexample
SELECT return_a_null();
+-----------------+
| RETURN_A_NULL() |
|-----------------|
| NULL            |
+-----------------+
```

## Passing an OBJECT to an in-line Java UDF

The following example uses the SQL [OBJECT](../../../sql-reference/data-types-semistructured.md) data type and the corresponding Java
data type (`Map<String, String>`), and extracts a value from the OBJECT. This example also shows that you
can pass multiple parameters to a Java UDF.

Create and load a table that contains a column of type OBJECT:

```sqlexample
CREATE TABLE objectives (o OBJECT);
INSERT INTO objectives SELECT PARSE_JSON('{"outer_key" : {"inner_key" : "inner_value"} }');
```

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION extract_from_object(x OBJECT, key VARCHAR)
  RETURNS VARIANT
  LANGUAGE JAVA
  HANDLER = 'VariantLibrary.extract'
  TARGET_PATH = '@~/VariantLibrary.jar'
  AS
  $$
  import java.util.Map;
  class VariantLibrary {
    public static String extract(Map<String, String> m, String key) {
      return m.get(key);
    }
  }
  $$;
```

Call the UDF:

```sqlexample
SELECT extract_from_object(o, 'outer_key'),
       extract_from_object(o, 'outer_key')['inner_key'] FROM objectives;
+-------------------------------------+--------------------------------------------------+
| EXTRACT_FROM_OBJECT(O, 'OUTER_KEY') | EXTRACT_FROM_OBJECT(O, 'OUTER_KEY')['INNER_KEY'] |
|-------------------------------------+--------------------------------------------------|
| {                                   | "inner_value"                                    |
|   "inner_key": "inner_value"        |                                                  |
| }                                   |                                                  |
+-------------------------------------+--------------------------------------------------+
```

## Passing a GEOGRAPHY value to an in-line Java UDF

The following example uses the SQL [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) data type.

Create the UDF:

```sqlexample-java
CREATE OR REPLACE FUNCTION geography_equals(x GEOGRAPHY, y GEOGRAPHY)
  RETURNS BOOLEAN
  LANGUAGE JAVA
  PACKAGES = ('com.snowflake:snowpark:1.2.0')
  HANDLER = 'TestGeography.compute'
  AS
  $$
  import com.snowflake.snowpark_java.types.Geography;

  class TestGeography {
    public static boolean compute(Geography geo1, Geography geo2) {
      return geo1.equals(geo2);
    }
  }
  $$;
```

You can use the PACKAGES clause to specify a Snowflake system package such as
the [Snowpark package](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/index.html).
When you do, you don’t need to also include the Snowpark JAR file as a value of an IMPORTS clause. For more on PACKAGES, see
[CREATE FUNCTION optional parameters](../../../sql-reference/sql/create-function.md).

Create data and call the UDF with that data:

```sqlexample
CREATE TABLE geocache_table (id INTEGER, g1 GEOGRAPHY, g2 GEOGRAPHY);

INSERT INTO geocache_table (id, g1, g2)
  SELECT 1, TO_GEOGRAPHY('POINT(-122.35 37.55)'), TO_GEOGRAPHY('POINT(-122.35 37.55)');
INSERT INTO geocache_table (id, g1, g2)
  SELECT 2, TO_GEOGRAPHY('POINT(-122.35 37.55)'), TO_GEOGRAPHY('POINT(90.0 45.0)');

SELECT id, g1, g2, geography_equals(g1, g2) AS "EQUAL?"
  FROM geocache_table
  ORDER BY id;
```

The output looks similar to:

```output
+----+--------------------------------------------------------+---------------------------------------------------------+--------+
| ID | G1                                                     | G2                                                      | EQUAL? |
+----+--------------------------------------------------------|---------------------------------------------------------+--------+
| 1  | { "coordinates": [ -122.35, 37.55 ], "type": "Point" } | { "coordinates": [ -122.35,  37.55 ], "type": "Point" } | TRUE   |
| 2  | { "coordinates": [ -122.35, 37.55 ], "type": "Point" } | { "coordinates": [   90.0,   45.0  ], "type": "Point" } | FALSE  |
+----+--------------------------------------------------------+---------------------------------------------------------+--------+
```

## Passing a VARIANT value to an in-line Java UDF

When you pass a value of the SQL [VARIANT](../../../sql-reference/data-types-semistructured.md) type to a Java UDF, Snowflake can convert the value to the
[Variant](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/types/Variant.html) type
provided with the [Snowpark package](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/index.html). Note that
`Variant` is supported from the Snowpark package version 1.4.0 and later.

The Snowpark `Variant` type provides methods for converting values between `Variant` and other types.

To use the Snowpark `Variant` type, use the PACKAGES clause to specify the Snowpark package when creating the UDF. When you do, you
don’t need to also include the Snowpark JAR file as a value of an IMPORTS clause. For more information on PACKAGES, see
[CREATE FUNCTION optional parameters](../../../sql-reference/sql/create-function.md).

Code in the following example receives JSON data stored as the VARIANT type, then uses the `Variant` type in the Snowpark library
to retrieve the `price` value from the JSON. The received JSON has a structure similar to the JSON displayed in
[Sample Data Used in Examples](../../../user-guide/querying-semistructured.md).

```sqlexample-java
CREATE OR REPLACE FUNCTION retrieve_price(v VARIANT)
  RETURNS INTEGER
  LANGUAGE JAVA
  PACKAGES = ('com.snowflake:snowpark:1.4.0')
  HANDLER = 'VariantTest.retrievePrice'
  AS
  $$
  import java.util.Map;
  import com.snowflake.snowpark_java.types.Variant;

  public class VariantTest {
    public static Integer retrievePrice(Variant v) throws Exception {
      Map<String, Variant> saleMap = v.asMap();
      int price = saleMap.get("vehicle").asMap().get("price").asInt();
      return price;
    }
  }
  $$;
```

## Reading a file with a Java UDF

You can read the contents of a file with handler code. For example, you might want to read a file to process unstructured data with the
handler. For more information on processing unstructured data, along with example code, refer to [Process unstructured data with UDF and procedure handlers](../../../user-guide/unstructured-data-java.md).

The file must be on a Snowflake stage that’s available to your handler.

To read the contents of staged files, your handler can:

* Read a file whose file path is statically-specified in the IMPORTS clause. At run time,
  your code reads the file from the UDF’s home directory.

  This can be useful when you want to access the file during initialization.
* Read a file from a directory imported using IMPORTS.
* Read a dynamically-specified file by calling methods of either the `SnowflakeFile` class or the `InputStream` class.

  You might do this if you need to access a file specified by the caller. For more information, see the following in this topic:

  + Reading a dynamically-specified file with SnowflakeFile
  + Reading a dynamically-specified file with InputStream

  `SnowflakeFile` provides features not available with `InputStream`, as described in the following table.

  | Class | Input | Notes |
  | --- | --- | --- |
  | `SnowflakeFile` | URL formats:  + Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner. + File URL or string path for files that the UDF owner has access to. The file must be located in a named internal stage or an external stage. | Easily access additional file attributes, such as file size. |
  | `InputStream` | URL formats:  + Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner. The file must be located in an internal or external stage. |  |

### Prerequisites

Before your Java handler code can read a file on a stage, you must do the following to make the file available to the code:

1. Create a stage that’s available to your handler.

   You can use an external stage or internal stage. If you use an internal stage, it must be a user or named stage;
   Snowflake does not currently support using a table stage for UDF dependencies. For more on creating a stage, see
   [CREATE STAGE](../../../sql-reference/sql/create-stage.md). For more on choosing an internal stage type, see
   [Choosing an internal stage for local files](../../../user-guide/data-load-local-file-system-create-stage.md).

   Keep in mind that adequate privileges on the stage must be assigned to roles performing SQL actions that read from the stage. For more
   information, see [Granting privileges for user-defined functions](../udf-access-control.md).
2. To the stage, copy the file that will be read by code.

   You can copy the file from a local drive to a stage by using the PUT command. For command reference, see [PUT](../../../sql-reference/sql/put.md).
   For information on staging files with PUT, see [Staging data files from a local file system](../../../user-guide/data-load-local-file-system-stage.md).

### Reading a file specified statically in IMPORTS

Your handler can read a file whose stage path has been specified in the IMPORTS clause of the
[CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.

When you specify a file in the IMPORTS clause, Snowflake copies that file from the stage to the UDF’s
*home directory* (also called the *import directory*), which is the directory from which the UDF actually reads the file.

Because imported files are copied to a single directory and must have unique names within that directory, each file in the
IMPORTS clause must have a distinct name, even if the files start out in different stages or different subdirectories within a
stage.

The following example creates and calls a Java UDF that reads a file.

The Java source code below creates a Java method named `readFile`. This UDF uses this method.

```java
import java.io.IOException;
import java.nio.charset.StandardCharsets;
import java.nio.file.Files;
import java.nio.file.Paths;
import java.util.stream.Stream;

class TestReadRelativeFile {
  public static String readFile(String fileName) throws IOException {
    StringBuilder contentBuilder = new StringBuilder();
    String importDirectory = System.getProperty("com.snowflake.import_directory");
    String fPath = importDirectory + fileName;
    Stream<String> stream = Files.lines(Paths.get(fPath), StandardCharsets.UTF_8);
    stream.forEach(s -> contentBuilder.append(s).append("\n"));
    return contentBuilder.toString();
  }
}
```

The following SQL code creates the UDF. This code assumes that the Java source code has been compiled and put into a JAR
file named `TestReadRelativeFile.jar`, which the UDF imports. The second and third imported files,
`my_config_file_1.txt` and `my_config_file_2.txt`, are configuration files that the UDF can read.

```sqlexample
CREATE FUNCTION file_reader(file_name VARCHAR)
  RETURNS VARCHAR
  LANGUAGE JAVA
  IMPORTS = ('@my_stage/my_package/TestReadRelativeFile.jar',
             '@my_stage/my_path/my_config_file_1.txt',
             '@my_stage/my_path/my_config_file_2.txt')
  HANDLER = 'my_package.TestReadRelativeFile.readFile';
```

This code calls the UDF:

```sqlexample
SELECT file_reader('my_config_file_1.txt') ...;
...
SELECT file_reader('my_config_file_2.txt') ...;
```

#### Choosing whether to access a file in compressed or uncompressed format

Files in a stage can be stored in compressed or uncompressed format. Users can compress the file before copying it
to the stage, or can tell the [PUT](../../../sql-reference/sql/put.md) command to compress the file.

When Snowflake copies a file compressed in GZIP format from a stage to the UDF home directory, Snowflake can write the copy as-is, or
Snowflake can decompress the content before writing the file.

If the file in the stage is compressed, and if you would like the copy in the UDF home directory to also be compressed, then when
you specify the file name in the IMPORTS clause, simply use the original file name (e.g. “MyData.txt.gz”) in the IMPORTS
clause. For example:

```sqlexample
... IMPORTS = ('@MyStage/MyData.txt.gz', ...)
```

If the file in the stage is GZIP-compressed, but you would like the copy in the UDF home directory to be uncompressed, then when you
specify the file name in the IMPORTS clause, omit the “.gz” extension. For example, if your stage contains “MyData.txt.gz”, but you
want your UDF to read the file in uncompressed format, then specify “MyData.txt” in the IMPORTS clause. If there is not already an
uncompressed file named “MyData.txt”, then Snowflake searches for “MyData.txt.gz” and automatically writes a decompressed copy to
“MyData.txt” in the UDF home directory. Your UDF can then open and read the uncompressed file “MyData.txt”.

Note that smart decompression applies only to the copy in the UDF home directory; the original file in the stage is not changed.

Follow these best practices for handling compressed files:

* Follow proper file naming conventions. If a file is in GZIP-compressed format, then include the extension “.gz” at the end of the file
  name. If a file is not in GZIP-compressed format, then do not end the file name with the “.gz” extension.
* Avoid creating files whose names differ only by the extension “.gz”. For example, do not create both “MyData.txt” and “MyData.txt.gz”
  in the same stage and directory, and do not try to import both “MyData.txt” and “MyData.txt.gz” in the same CREATE FUNCTION command.
* Do not compress files twice. For example, if you compress a file manually, and then you PUT that file without using
  AUTO_COMPRESS=FALSE, the file will be compressed a second time. Smart decompression will decompress it only once, so the data
  (or JAR) file will still be compressed when it is stored in the UDF home directory.
* In the future, Snowflake might extend smart decompression to compression algorithms other than GZIP. To prevent compatibility issues
  in the future, apply these best practices to files that use any type of compression.

> **Note:**
>
> JAR files can also be stored in compressed or uncompressed format in a stage. Snowflake automatically decompresses all compressed
> JAR files before making them available to the Java UDF.

### Importing a directory using IMPORTS

[Preview Feature](../../../release-notes/preview-features.md) — Open

Available to all accounts.

You can import a directory using the IMPORTS clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.

> **Note:**
>
> * The import path for a directory must end with a trailing slash (`/`). For example, `IMPORTS = ('@my_stage/my_dir/')`.
> * To rename a directory on import, append `/=custom_name/` to the stage path. The custom name must be a single directory name, not a path. For example, `IMPORTS = ('@my_stage/my_dir/=custom_name/')`.
> * Directory imports are not supported in Native Apps.

The following example imports a directory called `my_dir` from a stage named `my_stage` and reads the files contained within it.

```sqlexample-java
CREATE OR REPLACE FUNCTION test_java_udf(fileName STRING)
  RETURNS STRING
  LANGUAGE JAVA
  IMPORTS = ('@my_stage/my_dir/')
  HANDLER = 'TestClass.readFile'
  AS
  $$
import java.io.IOException;
import java.nio.charset.StandardCharsets;
import java.nio.file.Files;
import java.nio.file.Paths;
import java.util.stream.Stream;

class TestClass {
  public static String readFile(String fileName) throws IOException {
    StringBuilder contentBuilder = new StringBuilder();
    String importDirectory = System.getProperty("com.snowflake.import_directory") + "my_dir/";
    String fPath = importDirectory + fileName;
    Stream<String> stream = Files.lines(Paths.get(fPath), StandardCharsets.UTF_8);
    stream.forEach(s -> contentBuilder.append(s).append("\n"));
    return contentBuilder.toString();
  }
}
$$;
SELECT test_java_udf('file.txt');
```

### Reading a dynamically-specified file with `SnowflakeFile`

Using methods of the `SnowflakeFile` class, you can read files from a stage with your Java handler code. The `SnowflakeFile`
class is included on the classpath available to Java UDF handlers on Snowflake.

> **Note:**
>
> To make your code resilient to file injection attacks, always use a scoped URL when passing a file’s location to a UDF, particularly
> when the function’s caller is not also its owner. You can create a scoped URL in SQL using the built-in function
> [BUILD_SCOPED_FILE_URL](../../../sql-reference/functions/build_scoped_file_url.md). For more information about what the BUILD_SCOPED_FILE_URL does, see
> [Introduction to unstructured data](../../../user-guide/unstructured-intro.md).

To develop your UDF code locally, add the Snowpark JAR containing `SnowflakeFile` to your code’s class path. For information about
`snowpark.jar`, see [Setting Up Your Development Environment for Snowpark Java](../../snowpark/java/setup.md). Note that Snowpark client applications cannot use this class.

When you use `SnowflakeFile`, it isn’t necessary to also specify either the staged file or the JAR containing
`SnowflakeFile` with an IMPORTS clause when you create the UDF, as in SQL with a CREATE FUNCTION statement.

Code in the following example uses `SnowflakeFile` to read a file from a specified stage location. Using an
`InputStream` from the `getInputStream` method, it reads the file’s contents into a `String` variable.

```sqlexample-java
CREATE OR REPLACE FUNCTION sum_total_sales(file STRING)
  RETURNS INTEGER
  LANGUAGE JAVA
  HANDLER = 'SalesSum.sumTotalSales'
  TARGET_PATH = '@jar_stage/sales_functions2.jar'
  AS
  $$
  import java.io.InputStream;
  import java.io.IOException;
  import java.nio.charset.StandardCharsets;
  import com.snowflake.snowpark_java.types.SnowflakeFile;

  public class SalesSum {

    public static int sumTotalSales(String filePath) throws IOException {
      int total = -1;

      // Use a SnowflakeFile instance to read sales data from a stage.
      SnowflakeFile file = SnowflakeFile.newInstance(filePath);
      InputStream stream = file.getInputStream();
      String contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8);

      // Omitted for brevity: code to retrieve sales data from JSON and assign it to the total variable.

      return total;
    }
  }
  $$;
```

Call the UDF, passing the location of the file in a scoped URL to reduce the likelihood of file injection attacks.

```sqlexample
SELECT sum_total_sales(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

> **Note:**
>
> The UDF owner must have access to any files whose locations are not scoped URLs. You can read these staged files by having the handler
> code call the `SnowflakeFile.newInstance` method with a `boolean` value for a new `requireScopedUrl` parameter.
>
> The following example uses `SnowflakeFile.newInstance` while specifying that a scoped URL is not required.
>
> ```java
> String filename = "@my_stage/filename.txt";
> String sfFile = SnowflakeFile.newInstance(filename, false);
> ```

### Reading a dynamically-specified file with `InputStream`

You can read file contents directly into a `java.io.InputStream` by making your handler function’s argument an `InputStream`
variable. This can be useful when the function’s caller will want to pass a file path as an argument.

> **Note:**
>
> To make your code resilient to file injection attacks, always use a scoped URL when passing a file’s location to a UDF, particularly
> when the function’s caller is not also its owner. You can create a scoped URL in SQL using the built-in function
> BUILD_SCOPED_FILE_URL. For more information about what the BUILD_SCOPED_FILE_URL does, see
> [Introduction to unstructured data](../../../user-guide/unstructured-intro.md).

Code in the following example has a handler function `sumTotalSales` that takes an `InputStream` and returns an `int`.
At run time, Snowflake automatically assigns the contents of the file at the `file` variable’s path to the `stream`
argument variable.

```sqlexample-java
CREATE OR REPLACE FUNCTION sum_total_sales(file STRING)
  RETURNS INTEGER
  LANGUAGE JAVA
  HANDLER = 'SalesSum.sumTotalSales'
  TARGET_PATH = '@jar_stage/sales_functions2.jar'
  AS
  $$
  import java.io.InputStream;
  import java.io.IOException;
  import java.nio.charset.StandardCharsets;

  public class SalesSum {

    public static int sumTotalSales(InputStream stream) throws IOException {
      int total = -1;
      String contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8);

      // Omitted for brevity: code to retrieve sales data from JSON and assign it to the total variable.

      return total;
    }
  }
  $$;
```

Call the UDF, passing the location of the file in a scoped URL to reduce the likelihood of file injection attacks.

```sqlexample
SELECT sum_total_sales(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

## Creating and calling a simple staged Java UDF

The following statements create a simple Java UDF. This sample generally follows the file and directory structure
described in [Organizing your files](udf-java-creating.md).

### Create and compile the Java handler code

1. In the root directory of your project (here, `my_udf`), create a `src` subdirectory to hold the source .java files and a
   `classes` subdirectory to hold the generated .class files.

   You should have a directory hierarchy similar to the following:

   ```none
   my_udf/
   |-- classes/
   |-- src/
   ```
2. In the `src` directory, create a directory called `mypackage` to hold .java files whose classes are in the
   `mypackage` package.
3. In the `mypackage` directory, create a `MyUDFHandler.java` file that contains your source code.

   ```java
   package mypackage;

   public class MyUDFHandler {

     public static int decrementValue(int i)
     {
       return i - 1;
     }

     public static void main(String[] argv)
     {
       System.out.println("This main() function won't be called.");
     }
   }
   ```
4. From your project root directory (here, `my_udf`), use the `javac` command to compile the source code.

   The `javac` command in the following example compiles `MyUDFHandler.java` to generate a `MyUDFHandler.class` file in the
   `classes` directory.

   ```shell
   javac -d classes src/mypackage/MyUDFHandler.java
   ```

   This example includes the following arguments:

   * `-d classes` – Directory into which generated class files should be written.
   * `src/mypackage/MyUDFHandler.java` – Path to the .java file in the form: `source_directory/package_directory/Java_file_name`.

### Package the compiled code into a JAR file

1. Optionally, in the project root directory create a manifest file named `my_udf.manifest` that contains the following attributes:

   ```none
   Manifest-Version: 1.0
   Main-Class: mypackage.MyUDFHandler
   ```
2. From your project root directory, run the `jar` command to create a JAR file containing the .class file and manifest.

   The `jar` command in the following example puts the generated `MyUDFHandler.class` file in a `mypackage` package folder
   into a .jar file called `my_udf.jar`. The `-C ./classes` flag specifies the location of the .class files.

   ```shell
   jar cmf my_udf.manifest my_udf.jar -C ./classes mypackage/MyUDFHandler.class
   ```

   This example includes the following arguments:

   * `cmf` – Command arguments: `c` to create a JAR file, `m` to use the specified .manifest file, and `f` to give the JAR
     file the specified name.
   * `my_udf.manifest` – Manifest file.
   * `my_udf.jar` – Name of the JAR file to create.
   * `-C ./classes` – Directory containing the generated .class files.
   * `mypackage/MyUDFHandler.class` – Package and name of .class file to include in the JAR.

### Upload the JAR file with the compiled handler to a stage

1. In Snowflake, create a stage called `jar_stage` to store the JAR file containing your UDF handler.

   For more information on creating a stage, see [CREATE STAGE](../../../sql-reference/sql/create-stage.md).
2. Use the `PUT` command to copy the JAR file from the local file system to a stage.

   ```sqlexample
   put
       file:///Users/Me/my_udf/my_udf.jar
       @jar_stage
       auto_compress = false
       overwrite = true
       ;
   ```

   You can store the `PUT` command in a script file and then execute that file through [SnowSQL](../../../user-guide/snowsql.md).

   The `snowsql` command looks similar to the following:

   ```shell
   snowsql -a <account_identifier> -w <warehouse> -d <database> -s <schema> -u <user> -f put_command.sql
   ```

   This example assumes that the user’s password is specified in the SNOWSQL_PWD environment variable.

### Create the UDF with the compiled code as handler

Create the UDF:

```sqlexample
CREATE FUNCTION decrement_value(i NUMERIC(9, 0))
  RETURNS NUMERIC
  LANGUAGE JAVA
  IMPORTS = ('@jar_stage/my_udf.jar')
  HANDLER = 'mypackage.MyUDFHandler.decrementValue'
  ;
```

Call the UDF:

```sqlexample
SELECT decrement_value(-15);
```

```output
+----------------------+
| DECREMENT_VALUE(-15) |
|----------------------|
|                  -16 |
+----------------------+
```

---
title: Java UDF limitations
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-limitations.md
section: Developer Guide
---

# Java UDF limitations

This topic describes the limitations in place for handlers written in Java.

## General limitations

* Although your Java method can use classes and methods in the standard Java libraries, Snowflake security
  constraints disable some capabilities, such as writing to files. For details, see the section
  titled [Following good security practices](udf-java-designing.md).
* Java UDFs are not sharable. Database objects that use Java UDFs are also not sharable. For example, you cannot:

  + Directly share a Java UDF.
  + Share a view that calls a Java UDF.
  + Share a function that calls a Java UDF.
  + Share a table with a masking or row access policy that calls a Java UDF.
* Granting USAGE privilege on a Java UDF might allow the recipient to see the contents of files imported by that UDF. If you grant the
  USAGE privilege on a Java UDF to a role, and if that role executes a statement that calls that Java UDF, then any Java UDF in the same
  statement could read the contents of any files imported by the Java UDF on which you granted USAGE privilege.
* [Replication](../../../user-guide/account-replication-intro.md) does not include external or internal stages yet.
  When you promote a secondary database to serve as the primary database, you must recreate stage objects and re-import any files missing
  in internal stages. The files should have the same path and filenames as in the original primary database.
* The maximum size for a Java UDF output row is 128 MB.

## Limitations on cloning

A Java UDF can be cloned when the database or schema containing the Java UDF is cloned.
To be cloned, the Java UDF must meet the following condition(s):

* If the Java UDF references a stage (for example, the stage that contains the UDF’s JAR file), that stage must be
  outside the schema (or database) being cloned.

  You can keep a Java UDF and its referenced stage(s) in separate schemas (and/or separate databases) the following ways:

  + Wherever the Java UDF references a stage, use a qualified stage name (e.g. “my_db.my_schema.my_stage()”)
    different from the schema or database of the Java UDF. If the cloning operation clones a database, the stage
    reference should include the database and schema. If the cloning operation clones a schema, the stage reference
    should include the schema (and optionally the database).
  + Create the referenced stage by using a non-qualified stage name (which implicitly uses the current session’s active
    database and schema), and create the Java UDF by using a qualified name that does not match the session’s
    current database and schema.
  + Use the user’s stage as the referenced stage (the user’s stage is separate from any database’s stage or schema’s stage).

If one or more Java UDFs in the schema or database do not meet the required conditions, the schema or database can
still be cloned, but the non-compliant Java UDFs are omitted from the clone without any error or warning message.

Each cloned Java UDF has the same definition as the original. That definition includes any references to stages.
The stage references in the Java UDF must be fully-qualified, and therefore are absolute, not relative to the
schema or database being cloned. Because both the original and the clone point to the same stage(s) and file(s):

* Dropping the stage or removing required files from the stage disables both the original and cloned UDF.
* Altering the stage or the files on the stage (e.g. replacing the JAR file with a newer JAR file) affects both the
  original and cloned UDF.

For more information about cloning, see [Cloning considerations](../../../user-guide/object-clone.md).

---
title: JavaScript stored procedures API
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-api.md
section: Developer Guide
---

# JavaScript stored procedures API

This topic covers the JavaScript API for Snowflake stored procedures.
The API consists of JavaScript objects and the methods in those objects.

## Object: `snowflake`

The `snowflake` object is accessible by default to the JavaScript code in a stored procedure; you do not need to create the object.
This object contains the methods in the stored procedure API. For example:

> ```sqlexample-javascript
> CREATE PROCEDURE stproc1()
>   RETURNS STRING NOT NULL
>   LANGUAGE JAVASCRIPT
>   AS
>   -- "$$" is the delimiter for the beginning and end of the stored procedure.
>   $$
>   // The "snowflake" object is provided automatically in each stored procedure.
>   var statement = snowflake.createStatement(...);
>   ...
>   $$
>   ;
> ```

More extensive code examples are provided in [Working with stored procedures](stored-procedures-usage.md).

### Constants

None.

### Methods

addEvent(*name*[, *attributes*])
:   Adds an event for tracing.

    For more information about trace events with JavaScript, refer to [Emitting trace events in JavaScript](../logging-tracing/tracing-javascript.md)

    Parameters:
    :   `name`

        > The name of the event to add.

        `attributes`

        > An object specifying attributes to associate with the event.

    Errors:
    :   Throws a JavaScript Error if:

        * `name` is not a string.
        * There are zero or more than two arguments.

    Examples:
    :   Add a `my_event` event with `score` and `pass` attributes.

        ```javascript
        snowflake.addEvent('my_event', {'score': 89, 'pass': true});
        ```

createStatement(*sql_command_object*)
:   Creates a `Statement` object representing the statement specified by `sql_command_object`. You can use the
    `Statement.execute()` method to execute the statement.

    Parameter(s):
    :   `sql_command_object`

        > A JSON object (dictionary) that contains the text of the SQL statement to execute and values to bind to that statement.
        > Properties of the `sql_command_object` JSON object include:
        >
        > * `sqlText`: A string containing the SQL statement to execute.
        > * `binds`: An array of values to bind to placeholders in the SQL statement specified by `sqlText`.

    Returns:
    :   A `Statement` object.

    Errors:
    :   Throws a JavaScript Error if:

        * `sqlText` is missing or contains an empty query text.
        * The statement tries to bind an argument whose data type is not supported. For information about data type
          mapping, see [SQL and JavaScript data type mapping](stored-procedures-javascript.md).
          For more information about binding, see [Binding variables](stored-procedures-javascript.md).

    Examples:
    :   The following example does not bind any values:

        ```javascript
        var stmt = snowflake.createStatement(
          {sqlText: "INSERT INTO table1 (col1) VALUES (1);"}
        );
        ```

        The following example binds values. Values in the `binds` property array are bound to `?` placeholders in the SQL text
        in the order they appear in the array.

        ```javascript
        var stmt = snowflake.createStatement(
          {
          sqlText: "INSERT INTO table2 (col1, col2) VALUES (?, ?);",
          binds:["LiteralValue1", variable2]
          }
        );
        ```

        For more information about binding, including additional examples,
        see [Binding variables](stored-procedures-javascript.md).

execute(*sql_command_object*)
:   Executes the SQL statement specified as the argument.

    `snowflake.execute` differs from `Statement.execute`, which you use to execute the statement represented by the
    `Statement` object, rather than executing the method’s argument.

    Parameters:
    :   `sql_command_object`

        > A JSON object (dictionary) that contains the text of the SQL statement to execute and values to bind to that statement.
        > Properties of the `sql_command_object` JSON object include:
        >
        > * `sqlText`: A string containing the SQL statement to execute.
        > * `binds`: An array of values to bind to placeholders in the SQL statement specified by `sqlText`.

    Returns:
    :   A result set in the form of a `ResultSet` object.

    Errors:
    :   Throws a JavaScript Error if:

        * An error, such as a compile error, occurred while executing the query.
        * `sqlText` is missing or contains an empty query text.
        * The statement tries to bind an argument whose data type is not supported. For information about data type
          mapping, see [SQL and JavaScript data type mapping](stored-procedures-javascript.md).
          For more information about binding, including additional examples,
          see [Binding variables](stored-procedures-javascript.md).

log(*level*, *message*[, *attributes*])
:   Logs a message at the specified severity level, optionally with attributes.

    For more information, see [Logging messages in JavaScript](../logging-tracing/logging-javascript.md).

    Parameters:
    :   `level`

        > The severity level at which to log the message. You can specify one of the following strings:
        >
        > * `'off'`
        > * `'trace'`
        > * `'debug'`
        > * `'info'`
        > * `'warn'`
        > * `'error'`
        > * `'fatal'`

        `message`

        > The message to log.

        `attributes`

        > Optional. A JSON object with key-value pairs.

    Errors:
    :   Throws a JavaScript error if:

        * `level` is not a string.
        * `level` is not one of the supported `level` values listed above.

    Examples:
    :   ```javascript
        snowflake.log("error", "Error message", {"custom1": "value1", "custom2": "value2"});
        ```

setSpanAttribute(*key*, *value*)
:   Sets an attribute for the current span when tracing events.

    For more information about trace events with JavaScript, refer to [Emitting trace events in JavaScript](../logging-tracing/tracing-javascript.md)

    Parameters:
    :   `key`

        > The attribute’s key.

        `value`

        > The attribute’s value.

    Errors:
    :   Throws a JavaScript error if:

        * Two arguments aren’t specified.
        * `key` is not a string.

    Examples:
    :   Set an attribute whose key is `example.boolean` and whose value is `true`.

        ```javascript
        snowflake.setSpanAttribute("example.boolean", true);
        ```

## Object: `Statement`

A stored procedure `Statement` object provides the methods for executing a query statement and accessing
metadata (such as column data types) about the statement.

At the time the Statement object is created, the SQL is parsed, and a prepared statement is created.

### Constants

None.

### Methods

execute()
:   This method executes the prepared statement stored in this `Statement` object.

    `Statement.execute` differs from `snowflake.execute`, which you use to execute the method’s argument, rather than
    a statement represented by the `Statement` object.

    Parameters:
    :   None because the method uses information that is already stored in the `Statement` object.

    Returns:
    :   A result set in the form of a `ResultSet` object.

    Errors:
    :   Throws a JavaScript Error if the query fails.

    Examples:
    :   See [Working with stored procedures](stored-procedures-usage.md).

getColumnCount()
:   This method returns the number of columns in the result set for an executed query. If the query has not yet been executed, this method throws an Error.

    Parameters:
    :   None.

    Returns:
    :   The number of columns.

    Errors:
    :   Throw a JavaScript Error if the statement has not yet been executed (and thus the number of returned columns cannot necessarily
        be determined).

    Examples:
    :   ```javascript
        var column_count = statement.getColumnCount();
        ```

getColumnName(*colIdx*)
:   This method returns the name of the specified column.

    Parameters:
    :   The index number of the column (starting from `1`, not `0`).

    Returns:
    :   The name of the column.

    Errors:
    :   Throws a JavaScript Error if:

        * The `Statement` has not yet been executed.
        * No column with the specified index exists.

getColumnScale(*colIdx*)
:   This method returns the scale of the specified column. The scale is the number of digits after the decimal point. The scale of the column was specified
    in the CREATE TABLE or ALTER TABLE statement. For example:

    > ```sqlexample
    > CREATE TABLE scale_example  (
    >     n10_4 NUMERIC(10, 4)    // Precision is 10, Scale is 4.
    >     );
    > ```

    Although this method can be called for any data type, it is intended for use with numeric data types.

    Parameters:
    :   The index of the column for which you want the scale (starting from `1`, not `0`).

    Returns:
    :   The scale of the column (for numeric columns); `0` for non-numeric (columns).

    Errors:
    :   Throws a JavaScript Error if:

        * The `Statement` has not yet been executed.
        * No column with the specified index exists.

    Examples:
    :   See [Working with stored procedures](stored-procedures-usage.md) (search for `getColumnScale()`).

getColumnSqlType(*colIdx|colName*)
:   This method returns the SQL data type of the specified column.

    Parameters:
    :   Either the index number of the column (starting from `1`, not `0`) or the name of the column. (The method is overloaded to accept different
        data types as parameters.)

        The column name should be all uppercase unless double quotes were used in the column name when the table was created (i.e. the case of the column
        name was preserved).

    Returns:
    :   The SQL data type of the column.

    Errors:
    :   Throws a JavaScript Error if:

        * The `Statement` has not yet been executed.
        * No column with the specified name or index exists.

getColumnType(*colIdx|colName*)
:   This method returns the JavaScript data type of the specified column.

    Parameters:
    :   Either the index number of the column (starting from `1`, not `0`) or the name of the column. (The method is overloaded to accept different
        data types as parameters.)

        The column name should be all uppercase unless double quotes were used in the column name when the table was created (i.e. the case of the column
        name was preserved).

    Returns:
    :   The JavaScript data type of the column.

    Errors:
    :   Throws a JavaScript Error if:

        * The `Statement` has not yet been executed.
        * No column with the specified index or name exists.

getNumDuplicateRowsUpdated()
:   This method returns the number of “duplicate” rows (often called *multi-joined rows*) updated by this Statement.
    (For information about how multi-joined rows are formed, see the
    [Usage Notes and Examples for the UPDATE statement](../../sql-reference/sql/update.md).)

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of multi-joined rows updated.

    Errors:
    :   Throws a JavaScript error if the statement has not yet been executed.

getNumRowsAffected()
:   This method returns the number of rows affected (e.g. inserted/updated/deleted) by this Statement.

    If more than one type of change applies (e.g. a [MERGE](../../sql-reference/sql/merge.md) operation inserted some rows and
    updated others), then the number is the total number of rows affected by all of the changes.

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of rows affected.

    Errors:
    :   Throws a JavaScript error if the statement has not yet been executed.

getNumRowsDeleted()
:   This method returns the number of rows deleted by this Statement.

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of rows deleted.

    Errors:
    :   Throws a JavaScript error if the statement has not yet been executed.

getNumRowsInserted()
:   This method returns the number of rows inserted by this Statement.

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of rows inserted.

    Errors:
    :   Throws a JavaScript error if the statement has not yet been executed.

getNumRowsUpdated()
:   This method returns the number of rows updated by this Statement.

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of rows updated.

    Errors:
    :   Throws a JavaScript error if the statement has not yet been executed.

getRowCount()
:   This method returns the number of rows in the result set for an executed query. If the query has not yet been executed, this method throws an Error.

    Parameters:
    :   None.

    Returns:
    :   The number of rows.

    Errors:
    :   Throw a JavaScript Error if the statement has not yet been executed (and thus the number of returned rows cannot be determined).

    Examples:
    :   ```javascript
        var row_count = statement.getRowCount();
        ```

getQueryId()
:   This method returns the UUID of the most recent query executed.

    Parameters:
    :   None.

    Returns:
    :   A string containing a UUID, which is the query ID.

    Errors:
    :   If no query has been executed yet by this statement, the method throws the error
        “Statement is not executed yet.”

    Examples:
    :   ```javascript
        var queryId = statement.getQueryId();
        ```

getSqlText()
:   This method returns the text of the prepared query in the `Statement` object.

    Parameters:
    :   None.

    Returns:
    :   A string of the prepared query text.

    Errors:
    :   None.

    Examples:
    :   ```javascript
        var queryText = statement.getSqlText();
        ```

isColumnNullable(*colIdx*)
:   This method returns whether the specified column allows SQL NULL values.

    Parameters:
    :   The index of the column (starting from `1`, not `0`).

    Returns:
    :   `true` if the column allows SQL NULL values; otherwise, `false`.

    Errors:
    :   Throws a JavaScript Error if:

        * The `Statement` has not yet been executed.
        * No column with the specified index exists.

isColumnText(*colIdx*)
:   This method returns true if the column data type is one of the following SQL text data types:

    * CHAR or CHAR(N), as well as their synonyms CHARACTER and CHARACTER(N)
    * VARCHAR or VARCHAR(N)
    * STRING
    * TEXT

    Otherwise, it returns false.

    Parameters:
    :   The index of the column (starting from `1`, not `0`).

    Returns:
    :   `true` if the column data type is one of the SQL text data types; `false` for all other data types.

    Errors:
    :   Throws a JavaScript Error if:

        * The `Statement` has not yet been executed.
        * No column with the specified index exists.

    > **Note:**
    >
    > The API provides several methods for determining the data type of a column. The first method is described in detail above. The remaining methods have
    > the same parameters and errors; the only difference is the return value.

isColumnArray(*colIdx*)
:   Returns:
    :   `true` if the column data type is ARRAY (for semi-structured data); `false` for all other data types.

isColumnBinary(*colIdx*)
:   Returns:
    :   `true` if the column data type is BINARY or VARBINARY; `false` for all other data types.

isColumnBoolean(*colIdx*)
:   Returns:
    :   `true` if the column data type is BOOLEAN; `false` for all other data types.

isColumnDate(*colIdx*)
:   Returns:
    :   `true` if the column data type is DATE; `false` for all other data types.

isColumnNumber(*colIdx*)
:   Returns:
    :   `true` if the column data type is one of the SQL numeric types (NUMBER, NUMERIC, DECIMAL, INT, INTEGER, BIGINT, SMALLINT, TINYINT, BYTEINT,
        FLOAT, FLOAT4, FLOAT8, DOUBLE, DOUBLE PRECISION, or REAL); `false` for all other data types.

isColumnObject(*colIdx*)
:   Returns:
    :   `true` if the column data type is OBJECT (for semi-structured data); `false` for all other data types.

isColumnTime(*colIdx*)
:   Returns:
    :   `true` if the column data type is TIME or DATETIME; `false` for all other data types.

isColumnTimestamp(*colIdx*)
:   Returns:
    :   `true` if the column data type is one of the SQL timestamp types (TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ); `false`
        for all other data types, including other date and time data types (DATE, TIME, or DATETIME).

isColumnVariant(*colIdx*)
:   Returns:
    :   `true` if the column data type is VARIANT (for semi-structured data); `false` for all other data types.

## Object: `ResultSet`

This object contains the results returned by a query. The results are treated as a set of zero or more rows, each of which contains one or more columns. The term
“set” is not used here in the mathematical sense. In mathematics, a set is unordered, whereas a `ResultSet` has an order.

A `ResultSet` is similar in some ways to the concept of a SQL cursor. For example, you can see one row at a time in a `ResultSet`, just as you can see
one row at a time in a cursor.

Typically, after you retrieve a `ResultSet`, you iterate through it by repeating the following operations:

* Call `next()` to get the next row.
* Retrieve data from the current row by calling methods such as `getColumnValue()`.

If you do not know enough about the data in the `ResultSet` (e.g. you do not know the data type of each column), then you can call other methods that provide information about
the data.

Some of the methods of the `ResultSet` object are similar to the methods of the `Statement` object. For example, both objects have a
`getColumnSqlType(colIdx)` method.

### Constants

None.

### Methods

getColumnCount()
:   This method returns the number of columns in this ResultSet.

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of columns.

    Errors:
    :   None.

getColumnSqlType(*colIdx|colName*)
:   This method returns the SQL data type of the specified column.

    Parameters:
    :   Either the index number of the column (starting from `1`, not `0`) or the name of the column. (The method is overloaded to accept different
        data types as parameters.)

        The column name should be all uppercase unless double quotes were used in the column name when the table was created (i.e. the case of the column
        name was preserved).

    Returns:
    :   The SQL data type of the column.

    Errors:
    :   Throws a JavaScript Error if:

        * `ResultSet` is empty or `next()` has not yet been called.
        * No column with the specified index or name exists.

getColumnValue(*colIdx|colName*)
:   This method returns the value of a column in the current row (i.e. the row most recently retrieved by `next()`).

    Parameters:
    :   Either the index number of the column (starting from `1`, not `0`) or the name of the column. (The method is overloaded to accept different
        data types as parameters.)

        The column name should be all uppercase unless double quotes were used in the column name when the table was created (i.e. the case of the column
        name was preserved).

    Returns:
    :   The value of the specified column.

    Errors:
    :   Throws a JavaScript Error if:

        * `ResultSet` is empty or `next()` has not yet been called.
        * No column with the specified index or name exists.

    Examples:
    :   Convert a row in the database into a JavaScript array:

        > ```javascript
        > var valueArray = [];
        > // For each row...
        > while (myResultSet.next())  {
        >     // Append each column of the current row...
        >     valueArray.push(myResultSet.getColumnValue('MY_COLUMN_NAME1'));
        >     valueArray.push(myResultSet.getColumnValue('MY_COLUMN_NAME2'));
        >     ...
        >     // Do something with the row of data that we retrieved.
        >     f(valueArray);
        >     // Reset the array before getting the next row.
        >     valueArray = [];
        >     }
        > ```

        Also, a column’s value can be accessed as a property of the `ResultSet` object (e.g. `myResultSet.MY_COLUMN_NAME`).

        > ```javascript
        > var valueArray = [];
        > // For each row...
        > while (myResultSet.next())  {
        >     // Append each column of the current row...
        >     valueArray.push(myResultSet.MY_COLUMN_NAME1);
        >     valueArray.push(myResultSet.MY_COLUMN_NAME2);
        >     ...
        >     // Do something with the row of data that we retrieved.
        >     f(valueArray);
        >     // Reset the array before getting the next row.
        >     valueArray = [];
        >     }
        > ```

    > **Note:**
    >
    > Remember that unless the column name was delimited with double quotes in the CREATE TABLE statement, the column name should be all uppercase in the
    > JavaScript code.

getColumnValueAsString(*colIdx|colName*)
:   This method returns the value of a column as a string, which is useful when you need a column value regardless of the original data type in the table.

    The method is identical to the method `getColumnValue()` except that it returns a string value.

    For more details, see `getColumnValue()`.

getNumRowsAffected()
:   This method returns the number of rows affected (e.g. inserted/updated/deleted) by the Statement that generated this ResultSet.

    If more than one type of change applies (e.g. a [MERGE](../../sql-reference/sql/merge.md) operation inserted some rows and
    updated others), then the number is the total number of rows affected by all of the changes.

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of rows affected.

    Errors:
    :   None.

getQueryId()
:   This method returns the UUID of the most recent query executed.

    Parameters:
    :   None.

    Returns:
    :   A string containing a UUID, which is the query ID.

    Examples:
    :   ```javascript
        var queryId = resultSet.getQueryId();
        ```

getRowCount()
:   This method returns the number of rows in this ResultSet. (This is the total number of rows, not the number of rows that
    haven’t been consumed yet.)

    Parameters:
    :   None.

    Returns:
    :   A value of type Number that indicates the number of rows.

    Errors:
    :   None.

next()
:   This method gets the next row in the `ResultSet` and makes it available for access.

    This method does not return the new data row. Instead, it makes the row available so that you can call methods such as `ResultSet.getColumnValue()` to
    retrieve the data.

    Note that you must call `next()` for each row in the result set, including the first row.

    Parameters:
    :   None.

    Returns:
    :   `true` if it retrieved a row and `false` if there are no more rows to retrieve.

        Thus, you can iterate through `ResultSet` until `next()` returns false.

    Errors:
    :   None.

## Object: `SfDate`

JavaScript does not have a native data type that corresponds to the Snowflake SQL data types
TIMESTAMP_LTZ, TIMESTAMP_NTZ, and TIMESTAMP_TZ. When you retrieve a value of type TIMESTAMP from the database
and want to store it as a JavaScript variable (for example, copy the value from a ResultSet to a JavaScript variable),
use the Snowflake-defined JavaScript data type `SfDate`.
The `SfDate` (“SnowFlake Date”) data type is an extension of the JavaScript date data type.
`SfDate` has extra methods, which are documented below.

### Constants

None.

### Methods

Unless otherwise specified, the examples below assume UTC time zone.

getEpochSeconds()
:   This method returns the number of seconds since the beginning of “the epoch” (midnight January 1, 1970).

    Parameters:
    :   None.

    Returns:
    :   The number of seconds between midnight January 1, 1970 and the timestamp stored in the variable.

    Examples:
    :   Create the stored procedure:

        > ```sqlexample
        > CREATE OR REPLACE PROCEDURE test_get_epoch_seconds(TSV VARCHAR)
        >     RETURNS FLOAT
        >     LANGUAGE JAVASCRIPT
        >     AS
        >     $$
        >     var sql_command = "SELECT '" + TSV + "'::TIMESTAMP_NTZ;";
        >     var stmt = snowflake.createStatement( {sqlText: sql_command} );
        >     var resultSet = stmt.execute();
        >     resultSet.next();
        >     var my_sfDate = resultSet.getColumnValue(1);
        >     return my_sfDate.getEpochSeconds();
        >     $$
        >     ;
        > ```

        Pass the procedure different timestamps and retrieve the number of seconds since the epoch for each timestamp.

        > ```sqlexample
        > CALL test_get_epoch_seconds('1970-01-01 00:00:00.000000000');
        > +------------------------+
        > | TEST_GET_EPOCH_SECONDS |
        > |------------------------|
        > |                      0 |
        > +------------------------+
        > ```
        >
        > ```sqlexample
        > CALL test_get_epoch_seconds('1970-01-01 00:00:01.987654321');
        > +------------------------+
        > | TEST_GET_EPOCH_SECONDS |
        > |------------------------|
        > |                      1 |
        > +------------------------+
        > ```
        >
        > ```sqlexample
        > CALL test_get_epoch_seconds('1971-01-01 00:00:00');
        > +------------------------+
        > | TEST_GET_EPOCH_SECONDS |
        > |------------------------|
        > |               31536000 |
        > +------------------------+
        > ```

getNanoSeconds()
:   This method returns the value of the nanoseconds field of the object. Note that this is just the fractional
    seconds, not the nanoseconds since the beginning of the epoch. Thus the value is always between 0 and 999999999.

    Parameters:
    :   None.

    Returns:
    :   The number of nanoseconds.

    Examples:
    :   Create the stored procedure:

        > ```sqlexample
        > CREATE OR REPLACE PROCEDURE test_get_nano_seconds2(TSV VARCHAR)
        >     RETURNS FLOAT
        >     LANGUAGE JAVASCRIPT
        >     AS
        >     $$
        >     var sql_command = "SELECT '" + TSV + "'::TIMESTAMP_NTZ;";
        >     var stmt = snowflake.createStatement( {sqlText: sql_command} );
        >     var resultSet = stmt.execute();
        >     resultSet.next();
        >     var my_sfDate = resultSet.getColumnValue(1);
        >     return my_sfDate.getNanoSeconds();
        >     $$
        >     ;
        > -- Should be 0 nanoseconds.
        > -- (> SNIPPET_TAG=query_03_01
        > CALL test_get_nano_seconds2('1970-01-01 00:00:00.000000000');
        > ```

        Pass the procedure different timestamps and retrieve the number of nanoseconds from each.

        > ```sqlexample
        > CALL test_get_nano_seconds2('1970-01-01 00:00:00.000000000');
        > +------------------------+
        > | TEST_GET_NANO_SECONDS2 |
        > |------------------------|
        > |                      0 |
        > +------------------------+
        > ```
        >
        > ```sqlexample
        > CALL test_get_nano_seconds2('1970-01-01 00:00:01.987654321');
        > +------------------------+
        > | TEST_GET_NANO_SECONDS2 |
        > |------------------------|
        > |              987654321 |
        > +------------------------+
        > ```
        >
        > ```sqlexample
        > CALL test_get_nano_seconds2('1971-01-01 00:00:00.000123456');
        > +------------------------+
        > | TEST_GET_NANO_SECONDS2 |
        > |------------------------|
        > |                 123456 |
        > +------------------------+
        > ```

getScale()
:   This method returns the precision of the data type, i.e. the number of digits after the decimal point.
    For example, the precision of TIMESTAMP_NTZ(3) is 3 (milliseconds). The precision of TIMESTAMP_NTZ(0) is 0 (no
    fractional seconds). The precision of TIMESTAMP_NTZ is 9 (nanoseconds).

    The minimum is 0. The maximum is 9 (precision is to 1 nanosecond). The default precision is 9.

    Parameters:
    :   None.

    Returns:
    :   The number of digits after the decimal place (number of digits in the fractional seconds field).

    Examples:
    :   Create the stored procedure:

        > ```sqlexample
        > CREATE OR REPLACE PROCEDURE test_get_scale(TSV VARCHAR, SCALE VARCHAR)
        >     RETURNS FLOAT
        >     LANGUAGE JAVASCRIPT
        >     AS
        >     $$
        >     var sql_command = "SELECT '" + TSV + "'::TIMESTAMP_NTZ(" + SCALE + ");";
        >     var stmt = snowflake.createStatement( {sqlText: sql_command} );
        >     var resultSet = stmt.execute();
        >     resultSet.next();
        >     var my_sfDate = resultSet.getColumnValue(1);
        >     return my_sfDate.getScale();
        >     $$
        >     ;
        >
        > -- Should be 0.
        > -- (> SNIPPET_TAG=query_04_01
        > CALL test_get_scale('1970-01-01 00:00:00', '0');
        > ```

        In this example, the timestamp is defined as TIMESTAMP_NTZ(0), so the precision is 0.

        > ```sqlexample
        > CALL test_get_scale('1970-01-01 00:00:00', '0');
        > +----------------+
        > | TEST_GET_SCALE |
        > |----------------|
        > |              0 |
        > +----------------+
        > ```

        In this example, the timestamp is defined as TIMESTAMP_NTZ(2), so the precision is 2.

        > ```sqlexample
        > CALL test_get_scale('1970-01-01 00:00:01.123', '2');
        > +----------------+
        > | TEST_GET_SCALE |
        > |----------------|
        > |              2 |
        > +----------------+
        > ```

        In this example, the timestamp is defined as TIMESTAMP_NTZ, so the precision is 9, which is the default.

        > ```sqlexample
        > CALL test_get_scale('1971-01-01 00:00:00.000123456', '9');
        > +----------------+
        > | TEST_GET_SCALE |
        > |----------------|
        > |              9 |
        > +----------------+
        > ```

getTimezone()
:   This method returns the timezone as the number of minutes before or after UTC.

    Parameters:
    :   None.

    Returns:
    :   The timezone as a number of minutes before or after UTC.

    Examples:
    :   Create the stored procedure:

        > ```sqlexample
        > CREATE OR REPLACE PROCEDURE test_get_Timezone(TSV VARCHAR)
        >     RETURNS FLOAT
        >     LANGUAGE JAVASCRIPT
        >     AS
        >     $$
        >     var sql_command = "SELECT '" + TSV + "'::TIMESTAMP_TZ;";
        >     var stmt = snowflake.createStatement( {sqlText: sql_command} );
        >     var resultSet = stmt.execute();
        >     resultSet.next();
        >     var my_sfDate = resultSet.getColumnValue(1);
        >     return my_sfDate.getTimezone();
        >     $$
        >     ;
        > ```

        In this example, the time zone is 8 hours (480 minutes) behind UTC.

        > ```sqlexample
        > CALL test_get_timezone('1970-01-01 00:00:01-08:00');
        > +-------------------+
        > | TEST_GET_TIMEZONE |
        > |-------------------|
        > |              -480 |
        > +-------------------+
        > ```

        In this example, the time zone is 11 hours (660 minutes) ahead of UTC.

        > ```sqlexample
        > CALL test_get_timezone('1971-01-01 00:00:00.000123456+11:00');
        > +-------------------+
        > | TEST_GET_TIMEZONE |
        > |-------------------|
        > |               660 |
        > +-------------------+
        > ```

toString()
:   Parameters:
    :   None.

    Returns:
    :   This method returns a string representation of the timestamp.

    Examples:
    :   This shows a simple example of creating an `SfDate` and calling its `toString` method:

        > ```sqlexample
        > CREATE OR REPLACE PROCEDURE test_toString(TSV VARCHAR)
        >     RETURNS VARIANT
        >     LANGUAGE JAVASCRIPT
        >     AS
        >     $$
        >     var sql_command = "SELECT '" + TSV + "'::TIMESTAMP_TZ;";
        >     var stmt = snowflake.createStatement( {sqlText: sql_command} );
        >     var resultSet = stmt.execute();
        >     resultSet.next();
        >     var my_sfDate = resultSet.getColumnValue(1);
        >     return my_sfDate.toString();
        >     $$
        >     ;
        > ```
        >
        > ```sqlexample
        > CALL test_toString('1970-01-02 03:04:05');
        > +------------------------------------------------------------------+
        > | TEST_TOSTRING                                                    |
        > |------------------------------------------------------------------|
        > | "Fri Jan 02 1970 03:04:05 GMT+0000 (Coordinated Universal Time)" |
        > +------------------------------------------------------------------+
        > ```

---
title: JavaScript UDF limitations
source: https://docs.snowflake.com/en/developer-guide/udf/javascript/udf-javascript-limitations.md
section: Developer Guide
---

# JavaScript UDF limitations

To ensure stability within the Snowflake environment, Snowflake places the following limitations on JavaScript UDFs. These
limitations are not invoked at the time of UDF creation, but rather at runtime when the UDF is called.
This topic covers general JavaScript UDF (user-defined function) requirements and usage details, as well as limitations that are
specific to JavaScript UDFs.

## Maximum size of JavaScript source code

Snowflake limits the maximum size of the JavaScript source code in the body of a JavaScript UDF. Snowflake recommends limiting
the size to 100 KB. (The code is stored in a compressed form, and the exact limit depends on the compressibility of the code.)

## Consuming too much memory will cause UDF to fail

JavaScript UDFs will fail if they consume too much memory. The specific limit is subject to change. Using too much memory will
result in an error being returned.

## Taking too long to complete will cause a UDF to be killed and an error returned

JavaScript UDFs that take too long to complete will be killed and an error returned to the user. In addition, JavaScript UDFs that
enter endless loops will result in errors.

## Excess stack depth will result in an error

Excessive stack depth due to recursion will result in an error.

## Global state

Snowflake usually preserves the JavaScript global state between iterations of a UDF. However, you should not rely on previous
modifications to the global state being available between function calls. Additionally, you should not assume that all rows will
execute within the same JavaScript environment.

In practice, the global state is relevant with:

* Complex/expensive initialization logic. By default, the provided UDF code is evaluated for every row processed. If that code
  contains complex logic, this might be inefficient.
* Functions that contain code that is not idempotent. A typical pattern would be:

  > ```javascript
  > Date.prototype._originalToString = Date.prototype.toString;
  > Date.prototype.toString = function() {
  >   /* ... SOME CUSTOM CODE ... */
  >   this._originalToString()
  >   }
  > ```

  The first time that this code is executed, it changes the state of `toString` and `_originalToString`. Those changes
  are preserved in the global state, and the second time that this code is executed, the values are changed again in a way that
  creates recursion. The second time that `toString` is called, the code recurses infinitely (until it runs out of stack space).

For these situations, a recommended pattern is to guarantee that relevant code is evaluated only once, using JavaScript’s global
variable semantics. For example:

> ```javascript
> var setup = function() {
> /* SETUP LOGIC */
> };
>
> if (typeof(setup_done) === "undefined") {
>   setup();
>   setup_done = true;  // setting global variable to true
> }
> ```

Note that this mechanism is only safe for caching the effects of code evaluation. It is not guaranteed that after an initialization
the global context will be preserved for all rows, and no business logic should depend on it.

## JavaScript libraries

JavaScript UDFs support access to the standard JavaScript library. Note that this excludes many objects and methods typically
provided by browsers. There is no mechanism to import, include, or call additional libraries. All required code should be embedded
within the UDF.

Additionally, the built-in JavaScript `eval()` function is disabled.

## Returned variant size and depth

Returned variant objects are subject to size and nesting-depth limitations:

Size:
:   Currently limited to several megabytes, but subject to change.

Depth:
:   Currently limited to a nesting depth of 1000, but subject to change.

If any object is too large or too deep, an error is returned when the UDF is called.

## Argument and return type constraints are sometimes ignored

Certain type characteristics declared for an argument or return value will be ignored when the UDF is called. In these cases, the
received value may be used as received whether or not it conforms to constraints specified in the declaration.

The following are ignored for UDFs whose logic is written in JavaScript:

* Length for arguments of type VARCHAR

### Example

Code in the following example declares that the `arg1` argument and the return value must be a VARCHAR no more than one character
long. However, calling this function with an `arg1` whose value is longer than one character will succeed as if the constraint were
not specified.

```sqlexample
CREATE OR REPLACE FUNCTION tf (arg1 VARCHAR(1))
RETURNS VARCHAR(1)
LANGUAGE JAVASCRIPT AS 'return A.substr(3, 3);';
```

---
title: JDBC Datasources Setup for Snowpark Connect for Spark
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-jdbc-datasources.md
section: Developer Guide
---

# JDBC Datasources Setup for Snowpark Connect for Spark

This section provides a guide and sample code for reading data from
external databases and writing data to external databases (such as MySQL
and PostgreSQL) using the Snowpark Connect JDBC data source feature. It
covers both client-side and Snowflake Notebook setup.

## Part 1: Client-side setup (MySQL)

This setup is required when running Snowpark Connect from a local client
application, such as a Python script or IDE.

### Prerequisites

1. **Java Runtime Environment (JRE) / Java Development Kit (JDK):**

   * Install a JRE or JDK. The architecture (for example, 64-bit) of your Java
     installation **must** match the architecture of your Python
     installation.
   * *Example source for installation:* [Adoptium Temurin
     Releases](https://adoptium.net/temurin/releases/?version=11) (if
     using Java 11).
2. **Set ``JAVA_HOME`` Environment Variable:**

   * Configure the `JAVA_HOME` environment variable to point to the
     root directory of your Java installation.
   * *Example (macOS/Linux):*

```bash
export JAVA_HOME=/path/to/your/jdk/home
```

3. **Set ``CLASSPATH`` Environment Variable:**

   * Add the path to your specific database’s JDBC driver `.jar` file
     to the `CLASSPATH` environment variable. This allows the Java
     environment to find the necessary driver.
   * *Example (for MySQL driver):*

```bash
export CLASSPATH=$CLASSPATH:/path/to/your/driver/mysql-connector-j-9.2.0.jar
```

### Sample client code (read from MySQL)

This example demonstrates how to read a table from a MySQL database
using `spark_session.read.jdbc()`.

```python
from pyspark.sql import Row

# Adjust the URL for your server host, port, and database name
MYSQL_JDBC_URL = "jdbc:mysql://localhost/test_db"

# Ensure this driver name matches your version of the JDBC driver
MYSQL_JDBC_DRIVER = "com.mysql.cj.jdbc.Driver"

def test_jdbc_read_from_mysql(self, spark_session):
    # This code snippet uses the Snowpark Connect Spark session
    jdbc_df = spark_session.read.jdbc(
        MYSQL_JDBC_URL,
        "my_schema.my_table",  # Specify your table name in MySQL
        properties={
            "user": "root",           # Your MySQL user name
            "password": "****",       # Your password for MySQL
            "driver": MYSQL_JDBC_DRIVER,
        },
    ).collect()

    # After reading via JDBC, the data is loaded into a temporary table in Snowflake.
    # You can now perform any standard DataFrame operations supported by Snowpark Connect.
```

### Sample client code (write to MySQL)

This example demonstrates how to write data into a MySQL database using
`spark_session.write.jdbc()`.

```python
from pyspark.sql import Row

# Adjust the URL for your server host, port, and database name
MYSQL_JDBC_URL = "jdbc:mysql://localhost/test_db"

# Ensure this driver name matches your version of the JDBC driver
MYSQL_JDBC_DRIVER = "com.mysql.cj.jdbc.Driver"

def test_jdbc_write_overwrite_to_mysql(self, spark_session):
    # This code snippet uses the Snowpark Connect Spark session
    jdbc_df = spark_session.createDataFrame(
        [
            Row(a=1, b=2.0, c="test1"),
            Row(a=2, b=3.0, c="test2"),
            Row(a=4, b=5.0, c="test3"),
        ]
    )

    jdbc_df.write.jdbc(
        MYSQL_JDBC_URL,
        "my_schema.my_table2",  # Specify your table name in MySQL
        mode="overwrite",
        properties={
            "user": "root",        # Your MySQL user name
            "password": "****",    # Your password for MySQL
            "driver": MYSQL_JDBC_DRIVER,
        },
    )
```

## Part 2: Snowflake Warehouse Notebook setup (PostgreSQL)

This setup is used when running Snowpark Connect directly within a
Snowflake Notebook environment.

### Setup steps

* **Add the ``snowpark-connect`` Package:**

  + Ensure the `snowflake-snowpark-connect` package is added to your
    notebook environment.

* **Download and Upload JDBC Driver:**

  + Download the appropriate JDBC driver `.jar` file for your external
    database (for example, [PostgreSQL JDBC
    Driver](https://jdbc.postgresql.org/download/postgresql-42.7.8.jar)).
  + Upload the downloaded `.jar` file directly into your notebook
    environment.
* **Activate External Integrations (Network Rule & Integration):**

  + Snowflake requires an **External Access Integration** to allow the
    notebook to communicate with external network locations. You must
    define a **Network Rule** for the host and port of your external
    database.

```sql
-- 1. Create a Network Rule for the external database host and port
CREATE OR REPLACE NETWORK RULE JDBC_READ_NETWORK_RULE
  MODE = EGRESS
  TYPE = HOST_PORT
  VALUE_LIST = ('hh-pgsql-public.ebi.ac.uk:5432'); -- REPLACE with your host:port

-- 2. Create the External Access Integration using the new Network Rule
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION JDBC_READ_ACCESS_INTEGRATION
  ALLOWED_NETWORK_RULES = (JDBC_READ_NETWORK_RULE)
  ENABLED = true;

-- NOTE: This integration must be referenced/activated within your notebook's settings.
```

### Sample Warehouse Notebook code (read from PostgreSQL)

This example shows the necessary Python code to initialize the session,
load the driver, and read data from PostgreSQL.

```python
from snowflake import snowpark_connect
import jpype

# Initialize the Spark session for Snowpark Connect
spark = snowpark_connect.server.init_spark_session()
df = spark.sql("show schemas").limit(2)
df.show()

# Add the uploaded JDBC driver JAR to the Java Classpath using jpype
# Adjust the path to match the name of the JAR file you uploaded
jpype.addClassPath('/tmp/appRoot/postgresql-42.7.8.jar')

# Using public PostgreSQL DB as an example: https://rnacentral.org/help/public-database
jdbc_df = spark.read.jdbc(
    # Adjust this URL as per your server host, port, and database
    "jdbc:postgresql://hh-pgsql-public.ebi.ac.uk:5432/pfmegrnargs",
    "",  # Empty string for table name when providing a custom query
    properties={
        "user": "reader",                # Your PostgreSQL user name
        "password": "***",               # Your password for PostgreSQL
        "driver": "org.postgresql.Driver",
        # Use the "query" property for a custom SQL statement
        "query": """SELECT
  upi,     -- RNAcentral URS identifier
  taxid,   -- NCBI taxid
  ac       -- external accession
FROM xref
WHERE ac IN ('OTTHUMT00000106564.1', 'OTTHUMT00000416802.1')"""
    },
)

jdbc_df.show()
```

### Sample Warehouse Notebook code (write to PostgreSQL)

This example shows the necessary Python code to initialize the session,
load the driver, and write data into PostgreSQL.

```python
from snowflake import snowpark_connect
from pyspark.sql import Row
import jpype

# Initialize the Spark session for Snowpark Connect
spark = snowpark_connect.server.init_spark_session()
df = spark.sql("show schemas").limit(2)
df.show()

# Add the uploaded JDBC driver JAR to the Java Classpath using jpype
# Adjust the path to match the name of the JAR file you uploaded
jpype.addClassPath('/tmp/appRoot/postgresql-42.7.8.jar')

# Create dataframe
jdbc_df = spark.createDataFrame(
    [
        Row(a=1, b=2.0, c="test1"),
        Row(a=2, b=3.0, c="test2"),
        Row(a=4, b=5.0, c="test3"),
    ]
)

# Using public PostgreSQL DB as an example: https://rnacentral.org/help/public-database
jdbc_df.write.jdbc(
    # Adjust this URL as per your server host, port, and database
    "jdbc:postgresql://hh-pgsql-public.ebi.ac.uk:5432/pfmegrnargs",
    "public.my_table2",  # Specify your table name in PostgreSQL
    mode="overwrite",
    properties={
        "user": "writer",                # Your PostgreSQL user name
        "password": "***",               # Your password for PostgreSQL
        "driver": "org.postgresql.Driver",
    },
)
```

## Part 3: Snowflake Workspace Notebook setup (PostgreSQL)

This setup is used when running Snowpark Connect directly within a
Snowflake Workspace Notebook environment.

### Setup steps

* The `snowpark-connect` package is included in Workspace Notebook by default.
* **Download and Upload JDBC Driver:**

  + Download the appropriate JDBC driver `.jar` file for your external
    database (for example, [PostgreSQL JDBC
    Driver](https://jdbc.postgresql.org/download/postgresql-42.7.8.jar)).
  + Upload the downloaded `.jar` file directly into your notebook
    environment.

* **Create External Integration:**

```sql
-- 1. Create a Network Rule for the external database host and port
CREATE OR REPLACE NETWORK RULE JDBC_READ_NETWORK_RULE
  MODE = EGRESS
  TYPE = HOST_PORT
  VALUE_LIST = ('hh-pgsql-public.ebi.ac.uk:5432'); -- REPLACE with your host:port

-- 2. Create the External Access Integration using the new Network Rule
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION JDBC_READ_ACCESS_INTEGRATION
  ALLOWED_NETWORK_RULES = (JDBC_READ_NETWORK_RULE)
  ENABLED = true;

-- NOTE: This integration must be referenced/activated within your notebook's settings.
```

* **Activate External Integrations (Network Rule & Integration):**

  + Snowflake requires an **External Access Integration** to allow the
    notebook to communicate with external network locations. You must
    define a **Network Rule** for the host and port of your external
    database.

### Sample Workspace Notebook code (read from PostgreSQL)

This example shows the necessary Python code to initialize the session,
load the driver, and read data from PostgreSQL.

```python
from snowflake import snowpark_connect
import jpype
import os

# Initialize the Spark session for Snowpark Connect
spark = snowpark_connect.server.init_spark_session()
df = spark.sql("show schemas").limit(2)
df.show()

# Add the uploaded JDBC driver JAR to the Java Classpath using jpype
# Adjust the path to match the name of the JAR file you uploaded
# Copy the driver to /tmp directory
os.system("cp ./postgresql-42.7.8.jar /tmp/postgresql-42.7.8.jar")
jpype.addClassPath('/tmp/postgresql-42.7.8.jar')

# Using public PostgreSQL DB as an example: https://rnacentral.org/help/public-database
jdbc_df = spark.read.jdbc(
    # Adjust this URL as per your server host, port, and database
    "jdbc:postgresql://hh-pgsql-public.ebi.ac.uk:5432/pfmegrnargs",
    "",  # Empty string for table name when providing a custom query
    properties={
        "user": "reader",                # Your PostgreSQL user name
        "password": "***",               # Your password for PostgreSQL
        "driver": "org.postgresql.Driver",
        # Use the "query" property for a custom SQL statement
        "query": """SELECT
  upi,     -- RNAcentral URS identifier
  taxid,   -- NCBI taxid
  ac       -- external accession
FROM xref
WHERE ac IN ('OTTHUMT00000106564.1', 'OTTHUMT00000416802.1')"""
    },
)

jdbc_df.show()
```

### Sample Workspace Notebook code (write to PostgreSQL)

This example shows the necessary Python code to initialize the session,
load the driver, and write data into PostgreSQL.

```python
from snowflake import snowpark_connect
from pyspark.sql import Row
import jpype
import os

# Initialize the Spark session for Snowpark Connect
spark = snowpark_connect.server.init_spark_session()
df = spark.sql("show schemas").limit(2)
df.show()

# Add the uploaded JDBC driver JAR to the Java Classpath using jpype
# Adjust the path to match the name of the JAR file you uploaded
# Copy the driver to /tmp directory
os.system("cp ./postgresql-42.7.8.jar /tmp/postgresql-42.7.8.jar")
jpype.addClassPath('/tmp/postgresql-42.7.8.jar')

# Create dataframe
jdbc_df = spark.createDataFrame(
    [
        Row(a=1, b=2.0, c="test1"),
        Row(a=2, b=3.0, c="test2"),
        Row(a=4, b=5.0, c="test3"),
    ]
)

# Using public PostgreSQL DB as an example: https://rnacentral.org/help/public-database
jdbc_df.write.jdbc(
    # Adjust this URL as per your server host, port, and database
    "jdbc:postgresql://hh-pgsql-public.ebi.ac.uk:5432/pfmegrnargs",
    "public.my_table2",  # Specify your table name in PostgreSQL
    mode="overwrite",
    properties={
        "user": "writer",                # Your PostgreSQL user name
        "password": "***",               # Your password for PostgreSQL
        "driver": "org.postgresql.Driver",
    },
)
```

## Supported datasources

* SQL Server
* MySQL
* PostgreSQL

---
title: JDBC Driver
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc.md
section: Developer Guide
---

# JDBC Driver

> **Note:**
>
> Version 3.21.0 introduced support for Google Cloud Storage regional endpoints.
>
> Earlier versions of the driver do not support Google Cloud Storage regional endpoints. Please ensure that any workloads that use this driver do not require support for regional endpoints on Google Cloud. If you have questions about this, please contact Snowflake Support.

Snowflake provides a JDBC type 4 driver that supports core JDBC functionality. The JDBC driver must be installed in a
64-bit environment and requires Java LTS (Long-Term Support) versions 1.8 or higher.

The driver can be used with most client tools/applications that support JDBC for connecting to a database server. [sfsql](../../user-guide/sfsql.md), the now-deprecated command-line client provided by
Snowflake, is an example of a JDBC-based application.

**Next Topics:**

* [Java requirements for the JDBC Driver](java-install.md)
* [Downloading / integrating the JDBC Driver](jdbc-download.md)
* [Configuring the JDBC Driver](jdbc-configure.md)
* [Using the JDBC Driver](jdbc-using.md)
* [JDBC Driver diagnostic service](jdbc-diagnostic-service.md)
* [JDBC Driver connection parameter reference](jdbc-parameters.md)
* [JDBC Driver API support](jdbc-api.md)

---
title: JDBC Driver API support
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-api.md
section: Developer Guide
---

# JDBC Driver API support

The Snowflake JDBC driver is a JDBC type 4 driver that supports the core JDBC functionality in version 1.0 of the JDBC API. You are welcome to try methods from later
versions of the API, but Snowflake does not guarantee that these methods are supported.

For the complete API reference, see the [Java SE Technologies documentation](http://www.oracle.com/technetwork/java/javase/jdbc/index.html).

The Snowflake JDBC driver requires Java LTS (Long-Term Support) versions 1.8 or higher. The driver requires the `java.sql` package, which is included in the Standard Edition (SE) and the Enterprise Edition (EE)
of Java.

As of August, 2019, the `java.sql` package documentation is available at <https://docs.oracle.com/javase/8/docs/api/java/sql/package-summary.html>

The driver can be used with most client tools and applications that support JDBC for connecting to a database server.

This topic does not document the entire JDBC API. Instead, the topic:

* Lists the supported interfaces from the JDBC API and the supported methods within each interface.
* Documents areas where Snowflake extends the JDBC API standard.
* Documents areas where the JDBC API standard is ambiguous and the Snowflake implementation might behave differently
  from other systems.

In general, if a method is called and fails, the method will raise an exception (e.g. `SQLException`).

The supported JDBC interfaces are listed alphabetically and paired with their corresponding Snowflake extension classes (where applicable).

## Object: `CallableStatement`

A CallableStatement is used to execute a stored procedure.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `getBigDecimal(int, int)` |  |
| `getBoolean(int)` |  |
| `getByte(int)` |  |
| `getBytes(int)` |  |
| `getDate(int)` |  |
| `getDouble(int)` |  |
| `getFloat(int)` |  |
| `getInt(int)` |  |
| `getLong(int)` |  |
| `getObject(int)` |  |
| `getShort(int)` |  |
| `getString(int)` |  |
| `getTime(int)` |  |
| `getTimestamp(int)` |  |
| `registerOutParameter(int, int, int)` |  |
| `registerOutParameter(int, int)` |  |
| `wasNull()` |  |
|  |  |
| **Unsupported Methods** |  |
| None. |  |

### Snowflake-specific behavior

None.

## Interface: `SnowflakeCallableStatement`

The SnowflakeCallableStatement interface contains Snowflake-specific methods. When you use the
Snowflake JDBC driver to create an object of type CallableStatement, for example by calling the
Connection.prepareCall() method, you actually get an object of a different (hidden)
Snowflake-specific type, which implements both the JDBC CallableStatement interface and the
SnowflakeCallableStatement interface. To access the SnowflakeCallableStatement methods in that object,
you [unwrap](jdbc-using.md) the object.

### Additional methods

| Method Name | Description |
| --- | --- |
| `getQueryID()` | Returns the Snowflake query ID of the most recently executed query of this `CallableStatement` |

getQueryID()
:   Purpose:
    :   This method returns the Snowflake query ID of the most recently executed query of this `CallableStatement`. If no
        query has been executed yet with the callable statement, the method returns null.

    Arguments:
    :   None.

    Returns:
    :   This method returns the ID as a String that contains a UUID.
        Information about UUIDs is included in the description of the SQL function
        [UUID_STRING](../../sql-reference/functions/uuid_string.md).

    Throws:
    :   The method can throw `SQLException`.

## Object: `Connection`

A `Connection` object represents a connection to a database server. The connection object allows users not only
to connect to a particular database server, but also create `Statement` objects, which can be used to
execute SQL statements.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `abort()` |  |
| `clearWarnings()` |  |
| `close()` | Snowflake-specific behavior (see below). |
| `commit()` |  |
| `createStatement()` |  |
| `createStatement(int, int)` |  |
| `createStatement(int, int, int)` |  |
| `getAutoCommit()` |  |
| `getCatalog()` |  |
| `getClientInfo()` |  |
| `getHoldability()` |  |
| `getMetaData()` | Snowflake-specific behavior (see below). |
| `getSchema()` |  |
| `getTransactionIsolation()` |  |
| `getTypeMap()` |  |
| `getWarnings()` |  |
| `isClosed()` |  |
| `isReadOnly()` |  |
| `isValid()` |  |
| `nativeSQL(String)` |  |
| `prepareCall(String)` |  |
| `prepareCall(String, boolean)` |  |
| `prepareCall(String, int, int)` |  |
| `prepareCall(String, int, int, int)` |  |
| `prepareStatement(String)` |  |
| `prepareStatement(String, int)` |  |
| `prepareStatement(String, int[])` |  |
| `prepareStatement(String, String[])` |  |
| `prepareStatement(String, int, int)` |  |
| `prepareStatement(String, int, int, int)` | Snowflake-specific behavior (see below) |
| `prepareStatement(String, boolean)` |  |
| `setAutoCommit(boolean)` |  |
| `setCatalog(String)` |  |
| `setClientInfo(String, String)` | Calling this method causes a `SQLClientInfoException`. |
| `setClientInfo(Properties)` | Calling this method causes a `SQLClientInfoException`. |
| `setReadOnly(boolean)` |  |
| `setSchema(String)` |  |
| **Unsupported Methods** |  |
| `rollback()` |  |
| `setTransactionIsolation(int)` |  |

### Snowflake-specific behavior

* `close()`

  > Closes the object. After an object has been closed, calling almost any method of the closed object will raise a `SQLException`. Calling
  > `close` on an already closed object is harmless and will not raise an exception.

* `getMetaData()`

  > Lets you get metadata about the JDBC driver and Snowflake. For example, you can find out whether transactions are supported.
  >
  > For more information about the methods that you can call on the returned value,
  > see Object: DatabaseMetaData.
* `prepareStatement(String sql)`

  > This method returns a `preparedStatement` object that can be used to execute the SQL statement.
  > The `preparedStatement` object’s `execute()` method can be called to
  > execute the statement. The statement can be executed as-is, or after binding values to the statement.
  >
  > > **Note:**
  > >
  > > In some systems, after a statement has been prepared, that statement can be executed repeatedly without re-compiling the statement.
  > > Preparing once and executing repeatedly can save a small amount of time and resources.
  > >
  > > In Snowflake, prepareStatement() does not actually compile the code. Instead,
  > > `PreparedStatement.execute()`, `PreparedStatement.executeQuery()`,
  > > and `PreparedStatement.executeUpdate()` compile and execute the statement.
  > > Therefore preparing the statement before execution does not save resources
  > > compared to simply using `Statement.execute()`.
* `prepareCall(String sql)`
* `prepareCall(String sql, boolean)`
* `prepareCall(String sql, int, int)`
* `prepareCall(String sql, int, int, int)`

  > As in most JDBC implementations, the `prepareCall` methods can be used to bind parameters to a stored procedure. For example, the
  > following is supported:
  >
  > > ```java
  > > CallableStatement stmt = testConnection.prepareCall("call read_result_set(?,?) ");
  > > ```
  >
  > However, in the Snowflake JDBC Driver, the `prepareCall` methods do not support the `? =` syntax to support binding the return value of a stored procedure.
  > For example, the following is not supported:
  >
  > > ```java
  > > CallableStatement stmt = testConnection.prepareCall("? = call read_result_set() ");  -- INVALID
  > > ```

## Interface: `SnowflakeConnection`

The SnowflakeConnection interface contains Snowflake-specific methods. When you use the
Snowflake JDBC driver to create an object of type Connection, for example by calling the
DriverManager.getConnection() method, you actually get an object of a different (hidden)
Snowflake-specific type, which implements both the JDBC Connection interface and the
SnowflakeConnection interface. To access the SnowflakeConnection methods in that object,
you [unwrap](jdbc-using.md) the object.

### Additional methods

These methods are in addition to the methods supported by the JDBC `Connection` interface.

| Method Name | Description |
| --- | --- |
| `createResultSet(String)` | Given the query ID of an asynchronously-launched SQL statement, retrieves the query results and returns them in a ResultSet object. |
| `downloadStream(String, String, boolean)` | Downloads a file from the given internal stage and returns an InputStream. |
| `getSessionID()` | Gets the session ID of the current session. |
| `prepareStatement(String, Boolean)` | Overloaded `prepareStatement()` method (see below for details). |
| `uploadStream(String, String, InputStream, String, boolean)` | Compresses data from a stream and uploads it to the specified path and file name in an internal stage. |

public ResultSet createResultSet(String queryID)
:   Purpose:
    :   Given the queryID of an [asynchronously-launched SQL statement](jdbc-using.md), retrieve the
        query results and return them in a ResultSet object.

        This method can typically be called up to 24 hours after the SQL statement finished.

    Arguments:
    :   queryID: The query ID of the query for which you want the results.

    Returns:
    :   The ResultSet. If the query has not yet finished running, the server returns an “empty” ResultSet. The user
        can call `resultSet.unwrap(SnowflakeResultSet.class).getStatus()` to find out when the data is available.

    Throws:
    :   This method can throw `SQLException`.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the Connection object.

    Examples:
    :   ```java
        ResultSet resultSet;
        resultSet = connection.unwrap(SnowflakeConnection.class).createResultSet(queryID);
        ```

        See [Examples of asynchronous queries](jdbc-using.md) for a more extensive example that includes a call to this
        method.

public InputStream downloadStream(String stageName, String sourceFileName, boolean decompress)
:   Purpose:
    :   This method downloads a file from the given internal stage and returns an input stream.

    Arguments:
    :   stageName: Stage name.

        sourceFileName: File path in stage.

        decompress: True if file compressed.

    Returns:
    :   This method returns an InputStream.

    Throws:
    :   This method throws SQLException if a SQL error occurs.

    Examples:
    :   For a partial example, see [Download data files directly from an internal stage to a stream](jdbc-using.md).

public String getSessionID()
:   Purpose:
    :   This method returns the session ID of the current session.

    Arguments:
    :   None

    Returns:
    :   Returns the session ID as a String.

    Throws:
    :   This method throws SQLException if any SQL error occurs, for example if the connection is closed.

    Usage Notes:
    :   Since the session ID does not change while the connection is open, the session ID is cached locally (rather
        than retrieved from the server each time) to improve performance.

public prepareStatement(String sql, Boolean skipParsing)
:   This method is deprecated. The skipParsing parameter no longer affects the behavior of the method; this
    method behaves the same as the `prepareStatement(String sql)` method, regardless of the setting of the
    skipParsing parameter.

    New code should use the method `prepareStatement(String sql)`.

    When convenient, existing code that uses the two-argument version of this method should be updated to use
    the one-argument method `prepareStatement(String sql)`.

public void uploadStream(String stageName, String destPrefix, InputStream inputStream, String destFileName, boolean compressData)
:   Purpose:
    :   This method compresses data from a stream and uploads it at an internal stage location.
        The data will be uploaded as one file. No splitting is done in this method.

    Arguments:
    :   stageName: Stage name (e.g. `~` or table name or stage name).

        destPrefix: Path / prefix under which the data should be uploaded on the stage.

        inputStream: Input stream from which the data will be uploaded.

        destFileName: Destination file name to use.

        compressData: Compress data or not before uploading stream.

    Returns:
    :   Nothing.

    Throws:
    :   This method throws a `java.sql.SQLException` if it failed to compress and put data from a stream at the stage.

    Notes:
    :   The caller is responsible for releasing the `inputStream` after the method is called.

    Examples:
    :   For a partial example, see [Upload data files directly from a stream to an internal stage](jdbc-using.md).

## Object: `DatabaseMetaData`

The DatabaseMetaData class provides information about the features that the database server (in this case,
Snowflake) supports.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `allProceduresAreCallable()` |  |
| `allTablesAreSelectable()` |  |
| `dataDefinitionCausesTransactionCommit()` |  |
| `dataDefinitionIgnoredInTransactions()` |  |
| `doesMaxRowSizeIncludeBlobs()` |  |
| `getCatalogs()` |  |
| `getCatalogSeparator()` |  |
| `getCatalogTerm()` |  |
| `getColumnPrivileges(String, String, String, String)` | Supports wildcards for the `columnNamePattern` argument. Supports null for the `catalog`, `schemaPattern`, `tableNamePattern`, and `columnNamePattern` arguments. See Snowflake-specific behavior for additional information about this method. |
| `getColumns(String, String, String, String)` | Supports wildcards for the `schemaPattern`, `tableNamePattern`, and `columnNamePattern` arguments. Supports null for the `catalog`, `schemaPattern`, `tableNamePattern`, and `columnNamePattern` arguments. |
| `getCrossReference(String, String, String, String, String, String)` | Supports null for the `parentCatalog`, `parentSchema`, `parentTable`, `foreignCatalog`, `foreignSchema`, and `foreignTable` arguments. |
| `getDatabaseProductName()` |  |
| `getDatabaseProductVersion()` |  |
| `getDefaultTransactionIsolation()` |  |
| `getDriverMajorVersion()` |  |
| `getDriverMinorVersion()` |  |
| `getDriverName()` |  |
| `getDriverVersion()` |  |
| `getExportedKeys(String, String, String)` | Supports null for the `catalog`, `schema`, and `table` arguments. |
| `getExtraNameCharacters()` |  |
| `getFunctionColumns()` | Supports wildcards for the `schemaPattern`, `functionNamePattern`, and `columnNamePattern` arguments. Supports null for the `columnNamePattern` argument. |
| `getFunctions(String, String, String)` | Supports wildcards for the `schemaPattern` and `functionNamePattern` arguments. Supports null for the `schemaPattern` and `functionNamePattern` arguments. |
| `getIdentifierQuoteString()` |  |
| `getImportedKeys(String, String, String)` | Supports null for the `catalog`, `schema`, and `table` arguments. |
| `getIndexInfo(String, String, String, boolean, boolean)` |  |
| `getMaxBinaryLiteralLength()` |  |
| `getMaxCatalogNameLength()` |  |
| `getMaxCharLiteralLength()` |  |
| `getMaxColumnNameLength()` |  |
| `getMaxColumnsInGroupBy()` |  |
| `getMaxColumnsInIndex()` |  |
| `getMaxColumnsInOrderBy()` |  |
| `getMaxColumnsInSelect()` |  |
| `getMaxColumnsInTable()` |  |
| `getMaxConnections()` |  |
| `getMaxCursorNameLength()` |  |
| `getMaxIndexLength()` |  |
| `getMaxProcedureNameLength()` |  |
| `getMaxRowSize()` |  |
| `getMaxSchemaNameLength()` |  |
| `getMaxStatementLength()` |  |
| `getMaxStatements()` |  |
| `getMaxTableNameLength()` |  |
| `getMaxTablesInSelect()` |  |
| `getMaxUserNameLength()` |  |
| `getNumericFunctions()` |  |
| `getPrimaryKeys(String, String, String)` | Supports null for the `catalog`, `schema`, and `table` arguments. |
| `getProcedureColumns(String, String, String, String)` | Supports wildcards for the `schemaPattern`, `procedureNamePattern`, and `columnNamePattern` arguments. Supports null for the `columnNamePattern` argument. |
| `getProcedures(String, String, String)` | Supports wildcards for the `schemaPattern` and `procedureNamePattern` arguments. Supports null for the `columnNamePattern` argument. |
| `getProcedureTerm()` |  |
| `getSchemas()` |  |
| `getSchemas(String, String)` | Supports wildcards for the `schemaPattern` argument. Supports null for the `catalogName` and `schemaPattern` arguments. |
| `getSchemaTerm()` |  |
| `getSearchStringEscape()` |  |
| `getSQLKeywords()` |  |
| `getSQLStateType()` |  |
| `getStreams(String, String, String)` | Supports wildcards for the `originalSchemaPattern` and `streamName` arguments. Supports null for the `originalCatalog`, `originalSchemaPattern`, and `streamName` arguments. See Snowflake-specific behavior for additional information about this method. |
| `getStringFunctions()` |  |
| `getSystemFunctions()` |  |
| `getTablePrivileges(String, String, String)` | Supports wildcards for the `schemaPattern` and `tableNamePattern` arguments. Supports null for the `catalog` and `schemaPattern` arguments. |
| `getTables(String, String, String, String[])` | Supports wildcards for the `schemaPattern` and `tableNamePattern` arguments. Supports null for the `catalog`, `schemaPattern`, `tableNamePattern`, and `types` arguments. |
| `getTableTypes()` |  |
| `getTimeDateFunctions()` |  |
| `getTypeInfo()` |  |
| `getURL()` |  |
| `getUserName()` |  |
| `isCatalogAtStart()` |  |
| `isReadOnly()` |  |
| `nullPlusNonNullIsNull()` |  |
| `nullsAreSortedAtEnd()` |  |
| `nullsAreSortedAtStart()` |  |
| `nullsAreSortedHigh()` |  |
| `nullsAreSortedLow()` |  |
| `storesLowerCaseIdentifiers()` |  |
| `storesLowerCaseQuotedIdentifiers()` |  |
| `storesMixedCaseIdentifiers()` |  |
| `storesMixedCaseQuotedIdentifiers()` |  |
| `storesUpperCaseIdentifiers()` |  |
| `storesUpperCaseQuotedIdentifiers()` |  |
| `supportsAlterTableWithAddColumn()` |  |
| `supportsAlterTableWithDropColumn()` |  |
| `supportsANSI92EntryLevelSQL()` |  |
| `supportsANSI92FullSQL()` |  |
| `supportsANSI92IntermediateSQL()` |  |
| `supportsCatalogsInDataManipulation()` |  |
| `supportsCatalogsInIndexDefinitions()` |  |
| `supportsCatalogsInPrivilegeDefinitions()` |  |
| `supportsCatalogsInProcedureCalls()` |  |
| `supportsCatalogsInTableDefinitions()` |  |
| `supportsColumnAliasing()` |  |
| `supportsConvert()` |  |
| `supportsConvert(int, int)` |  |
| `supportsCoreSQLGrammar()` |  |
| `supportsCorrelatedSubqueries()` |  |
| `supportsDataDefinitionAndDataManipulationTransactions()` |  |
| `supportsDataManipulationTransactionsOnly()` |  |
| `supportsDifferentTableCorrelationNames()` |  |
| `supportsExpressionsInOrderBy()` |  |
| `supportsExtendedSQLGrammar()` |  |
| `supportsFullOuterJoins()` |  |
| `supportsGroupBy()` |  |
| `supportsGroupByBeyondSelect()` |  |
| `supportsGroupByUnrelated()` |  |
| `supportsIntegrityEnhancementFacility()` |  |
| `supportsLikeEscapeClause()` |  |
| `supportsLimitedOuterJoins()` |  |
| `supportsMinimumSQLGrammar()` |  |
| `supportsMixedCaseIdentifiers()` |  |
| `supportsMixedCaseQuotedIdentifiers()` |  |
| `supportsMultipleResultSets()` |  |
| `supportsMultipleTransactions()` |  |
| `supportsNonNullableColumns()` |  |
| `supportsOpenCursorsAcrossCommit()` |  |
| `supportsOpenCursorsAcrossRollback()` |  |
| `supportsOpenStatementsAcrossCommit()` |  |
| `supportsOpenStatementsAcrossRollback()` |  |
| `supportsOrderByUnrelated()` |  |
| `supportsOuterJoins()` |  |
| `supportsPositionedDelete()` |  |
| `supportsPositionedUpdate()` |  |
| `supportsSchemasInDataManipulation()` |  |
| `supportsSchemasInIndexDefinitions()` |  |
| `supportsSchemasInPrivilegeDefinitions()` |  |
| `supportsSchemasInProcedureCalls()` |  |
| `supportsSchemasInTableDefinitions()` |  |
| `supportsSelectForUpdate()` |  |
| `supportsStoredProcedures()` |  |
| `supportsSubqueriesInComparisons()` |  |
| `supportsSubqueriesInExists()` |  |
| `supportsSubqueriesInIns()` |  |
| `supportsSubqueriesInQuantifieds()` |  |
| `supportsTableCorrelationNames()` |  |
| `supportsTransactionIsolationLevel(int)` |  |
| `supportsTransactions()` |  |
| `supportsUnion()` |  |
| `supportsUnionAll()` |  |
| `usesLocalFilePerTable()` |  |
| `usesLocalFiles()` |  |
| **Unsupported Methods** |  |
| `getBestRowIdentifier(String, String, String, int, boolean)` |  |
| `getVersionColumns(String, String, String)` |  |

### Snowflake-specific behavior

public ResultSet getColumnPrivileges(String, String, String, String)
:   This method always returns an empty set because Snowflake does not support column-level privileges.

public ResultSet getStreams(String, String, String)
:   Purpose:
    :   This method returns information about [streams](../../user-guide/streams-intro.md) contained within specified databases and schemas.

    Arguments:
    :   * `originalCatalog`: Name of the database.
        * `originalSchemaPattern`: Pattern to identify the schema (supports wildcards).
        * `streamName`: Name of the stream (supports wildcards).

    Returns:
    :   This method returns a `ResultSet` containing rows for each stream, with each row including the following columns:

        * `name`: Name of the stream.
        * `database_name`: Name of the database for the schema containing the stream.

          A database object (e.g. a stream) is contained in a schema, which in turn is contained in a database.
        * `schema_name`: Name of the schema containing the stream.
        * `owner`: Role that owns the stream.
        * `comment`: Comments associated with the stream.
        * `table_name`: Name of the table whose DML updates are tracked by the stream.
        * `source_type`: Source object for the stream. Possible values include:

          + `table`
          + `view`
          + `directory table`
          + `external table`
        * `base_tables`: Underlying tables for the view. This column applies only to streams on views.
        * `type`: Type of the stream. Currently, the function always returns `DELTA`.
        * `stale`: Whether the stream was last read before the `stale_after` time passed. If `TRUE`, the stream might be stale.

          When a stream is stale, it cannot be read. You can recreate the stream to resume reading from it. To prevent
          a stream from become stale, you should consume the stream before the `stale_after` time has passed.
        * `mode`: Type of stream. Possible values include:

          + `APPEND_ONLY`: Indicates the stream is an append-only stream.
          + `INSERT_ONLY`: Indicates the stream only returns information for inserted rows. This value applies only to streams on external tables.
          + `DEFAULT`: Indicates the stream is on tables.

    Throws:
    :   This method throws an `SQLException` if a SQL error occurs.

### Support for null parameters

Some DatabaseMetaData methods accept `null` values for database object names (e.g. table/catalog names).
By default, a `null` value means that the method does not filter on that argument. For example, if you pass
`getColumns()` a `null` value for the `schemaPattern` argument, then `getColumns()` returns values
for all schemas.

For some of those methods, the default behavior for `null` arguments can be overridden with the following
[parameters](../../sql-reference/parameters.md):

* [CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX](../../sql-reference/parameters.md).
* [CLIENT_METADATA_USE_SESSION_DATABASE](../../sql-reference/parameters.md).

### Support for wildcards in database object names

Some DatabaseMetaData methods support pattern-matching wildcards in database object names, such as table/catalog
names. The supported wildcard characters are:

* `%`: Matches any string of zero or more characters.
* `_`: Matches any one character.

The following example shows what to pass to the `getColumns()` method to get the names of all tables and all
columns in the specified database (`TEMPORARYDB1`) and schema (`TEMPORARYSCHEMA1`):

```java
getColumns( connection,
    "TEMPORARYDB1",      // Database name.
    "TEMPORARYSCHEMA1",  // Schema name.
    "%",                 // All table names.
    "%"                  // All column names.
    );
```

It is common for database object names, such as table names, to contain underscores, for example
`SHIPPING_ADDRESSES`. Searching for `SHIPPING_ADDRESSES` without escaping the underscore will of course
find not only the table named `SHIPPING_ADDRESSES`, but also tables such as `SHIPPING2ADDRESSES`. If
you want to search for `SHIPPING_ADDRESSES`, but not `SHIPPING2ADDRESSES`, then you need to escape the
wildcard character to indicate that you want it treated as a literal. To escape the character, precede it with a
backslash.

The backslash character itself must also be escaped if you want to use it as a literal character. For example, to
search for a table named `T_&`, in which the underscore, the ampersand, and the backslash are literal parts
of the name, not wildcard characters or escape characters, the method call should look similar to the following:

```none
getColumns(
    connection, "TEMPORARYDB1", "TEMPORARYSCHEMA1", "T\_\\\\", "%" // All column names.
    );
```

Each backslash above must be escaped an extra time because the Java compiler expects backslashes to be escaped:

```none
Java sees...............: T\_\\%\\\\
SQL sees................: T_\%\\
The actual table name is: T_%\
```

## Object: `Driver`

The Driver provides methods that allow you to get a connection to the database, as well as get information
about the driver itself.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `acceptsURL(String)` |  |
| `connect(String, Properties)` |  |
| `getMajorVersion()` |  |
| `getMinorVersion()` |  |
| `getPropertyInfo(String, Properties)` |  |
| `isDisableIncidents()` |  |
| `jdbcCompliant()` |  |
| `setDisableIncidents()` |  |

### Snowflake-specific behavior

None.

### Examples

The following code snippet shows part of a program to get property information:

```java
  // Demonstrate the Driver.getPropertyInfo() method.
  public static void do_the_real_work(String connectionString) throws Exception {
    Properties properties = new Properties();
    Driver driver = DriverManager.getDriver(connectionString);
    DriverPropertyInfo[] dpinfo = driver.getPropertyInfo("", properties);
    System.out.println(dpinfo[0].description);
  }
```

Note that in the general case, the call to this method should be inside a loop. If you retrieve information about
a property and then set that property, the new setting might make additional properties relevant, so you might
need to retrieve those and set them.

## Object: `ParameterMetaData`

This provides information about parameters in a PreparedStatement.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `getParameterCount()` |  |
| `getParameterType(int)` |  |
| `getParameterTypeName(int)` |  |
| `getPrecision(int)` |  |
| `getScale(int)` |  |
| `isNullable` |  |
|  |  |
| **Unsupported Methods** |  |
| `getParameterClassName(int)` |  |
| `getParameterMode()` |  |
| `isSigned` |  |

### Snowflake-specific behavior

None.

## Object: `PreparedStatement`

The PreparedStatement interface describes methods that, for example, allow you to execute queries.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `addBatch()` | Snowflake-specific behavior (see below for details). |
| `clearParameters()` |  |
| `getParameterMetaData()` |  |
| `execute()` | Snowflake-specific behavior (see below for details). |
| `executeBatch(String)` |  |
| `executeLargeBatch(String)` |  |
| `executeLargeUpdate(String)` |  |
| `executeQuery()` | Snowflake-specific behavior (see below for details). |
| `executeUpdate()` | Snowflake-specific behavior (see below for details). |
| `setBigDecimal(int, BigDecimal)` |  |
| `setBoolean(int, boolean)` |  |
| `setByte(int, byte)` |  |
| `setBytes(int, byte[])` |  |
| `setDate(int, Date)` |  |
| `setDouble(int, double)` |  |
| `setFloat(int, float)` |  |
| `setInt(int, int)` |  |
| `setLong(int, long)` |  |
| `setNull(int, int)` |  |
| `setObject(int, Object, int, int)` | Snowflake-specific behavior (see below for details). |
| `setObject(int, Object, int)` | Snowflake-specific behavior (see below for details). |
| `setObject(int, Object)` |  |
| `setShort(int, short)` |  |
| `setString(int, String)` |  |
| `setTime(int, Time)` |  |
| `setTimestamp(int, Timestamp)` |  |
| **Unsupported Methods** |  |
| `setAsciiStream(int, InputStream, int)` |  |
| `setBinaryStream(int, InputStream, int)` |  |
| `setUnicodeStream(int, InputStream, int)` |  |

### Snowflake-specific behavior

* `addBatch()`

  > Supported for INSERT statements only.
  >
  > The `addBatch` method (combined with `executeBatch`) allows multiple rows of data to be inserted as part of a single INSERT
  > statement.
  >
  > The difference between using a batch and not using a batch is similar to the difference between using a multi-row insert and a single-row
  > insert:
  >
  > > ```sqlexample
  > > INSERT INTO t1 (c1, c2) VALUES (1, 'One');   -- single row inserted.
  > >
  > > INSERT INTO t1 (c1, c2) VALUES (1, 'One'),   -- multiple rows inserted.
  > >                                (2, 'Two'),
  > >                                (3, 'Three');
  > > ```
  >
  > Inserting batches of rows is usually more efficient than inserting the same number of rows in individual `INSERT` statements. The
  > advantage is even greater when using AUTOCOMMIT (i.e. when each INSERT is an individual transaction).
  >
  > For an example of using `addBatch`, see [Batch inserts](jdbc-using.md).
  >
  > > **Note:**
  > >
  > > There is an upper limit to the size of data that you can bind, or that you can combine in a batch. For details, see [Limits on Query Text Size](../../user-guide/query-size-limits.md).
* `execute()`

  > This method compiles and executes the SQL statement that was provided when the `PreparedStatement` object was created. The statement can be any
  > type of SQL statement. The `execute()` method does not return a `ResultSet`.
  >
  > This method does not return anything. If you are executing a query and need to get a `ResultSet` back when the statement executes,
  > then use the `executeQuery()` method.
* `executeQuery()`

  > This method compiles and executes the SQL statement that was provided when the `PreparedStatement` object was created, and returns a `ResultSet`.
* `executeUpdate()`

  > This method compiles and executes the SQL statement that was provided when the `PreparedStatement` object was created. The statement must be a DML
  > statement (INSERT, UPDATE, DELETE, etc.) or a SQL statement that does not return anything (e.g. a DDL statement).
  >
  > The `executeUpdate()` method returns an integer, which is the number of rows updated if the statement was a DML statement. If the
  > statement did not update any rows, the function returns `0`.
  >
  > If you need to execute a SQL statement that returns a ResultSet, then use a different method, such as executeQuery().
* `setObject()`

  When you bind a timestamp variable to a timestamp column, you can use this method to specify the timestamp variation
  ([TIMESTAMP_LTZ , TIMESTAMP_NTZ , TIMESTAMP_TZ](../../sql-reference/data-types-datetime.md)) that should be used to interpret the timestamp value. For details, see
  [Binding variables to timestamp columns](jdbc-using.md).

## Interface: `SnowflakePreparedStatement`

The SnowflakePreparedStatement interface contains Snowflake-specific methods. When you use the
Snowflake JDBC driver to create an object of type PreparedStatement, for example by calling the
Connection.prepareStatement() method, you actually get an object of a different (hidden)
Snowflake-specific type, which implements both the JDBC PreparedStatement interface and the
SnowflakePreparedStatement interface. To access the SnowflakePreparedStatement methods in that object,
you [unwrap](jdbc-using.md) the object.

### Additional methods

The methods below are in addition to the methods supported by the `PreparedStatement` interface.

| Method Name | Description |
| --- | --- |
| `executeAsyncQuery()` | Performs an asynchronous query. |
| `getQueryID()` | Returns the Snowflake query ID of the most recently executed query of this `SnowflakePreparedStatement`. |

executeAsyncQuery()
:   Purpose:
    :   This method performs an [asynchronous query](jdbc-using.md), which involves submitting an SQL
        statement for execution, then returning control to the caller without waiting for the query to finish.

        Any SQL statement that is valid for `executeQuery()` is also valid for `executeAsyncQuery()`.

        > **Note:**
        >
        > File transfer statements, such as PUT and GET, are valid for `executeAsyncQuery()`, but behave synchronously.

    Arguments:
    :   None.

    Returns:
    :   An “empty” ResultSet. The user should poll the result set by calling
        `resultSet.unwrap(SnowflakeResultSet.class).getStatus()` until the query results become available.

    Throws:
    :   This method can throw `SQLException`.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the PreparedStatement object.

    Examples:
    :   ```java
        ...
        PreparedStatement prepStatement = connection.prepareStatement("insert into testTable values (?)");
        prepStatement.setInt(1, 33);
        ResultSet rs = prepStatement.executeAsyncQuery();
        ...
        ```

        See [Examples of asynchronous queries](jdbc-using.md) for a more extensive example using the very similar
        `SnowflakeStatement.executeAsyncQuery()` method.

getQueryID()
:   Purpose:
    :   This method returns the Snowflake query ID of the most recently executed query of this `SnowflakePreparedStatement`. If no query has been
        executed yet with this prepared statement, the method returns null.

    Arguments:
    :   None.

    Returns:
    :   The method returns the ID as a String that contains a UUID.

    Throws:
    :   The method can throw `SQLException`.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the
        `SnowflakePreparedStatement`.

        For [asynchronous queries](jdbc-using.md), the query ID does not become available until the
        execution of the statement completes. If you call `SnowflakePreparedStatement.getQueryID()` after calling
        `executeAsyncQuery()` but before the statement finishes executing, the return value could be NULL.
        Instead, call `resultSet.unwrap(SnowflakeResultSet.class).getQueryID()` on the `ResultSet` object
        returned by `executeAsyncQuery()`.

    Examples:
    :   This partial example shows how to call the method:

        ```javascript
            // Retrieve the query ID from the PreparedStatement.
            String queryID;
            queryID = preparedStatement.unwrap(SnowflakePreparedStatement.class).getQueryID();
        ```

## Enum: `QueryStatus`

The enum type is a Snowflake-specific type that:

* Defines the constants that represent the status of [an asynchronous query](jdbc-using.md).
* Defines methods that return details about any errors that occurred when executing SQL statements.

This enum type is in the `net.snowflake.client.core` package.

### Enum constants

Each enum constant represents a different possible status for the asynchronous query.

| Enum Constant | Description |
| --- | --- |
| RUNNING | The query is still running. |
| ABORTING | The query is in the process of being aborted on the server side. |
| SUCCESS | The query finished successfully. |
| FAILED_WITH_ERROR | The query finished unsuccessfully. |
| QUEUED | The query is queued for execution (i.e. has not yet started running), typically because it is waiting for resources. |
| DISCONNECTED | The session’s connection is broken. The query’s state will change to “FAILED_WITH_ERROR” soon. |
| RESUMING_WAREHOUSE | The warehouse is starting up and the query is not yet running. |
| BLOCKED | The statement is waiting on a lock held by another statement. |
| NO_DATA | Data about the statement is not yet available, typically because the statement has not yet started executing. |

### Methods

The enum type defines the following methods, which you can use to get details about an error when the query status is
`FAILED_WITH_ERROR`.

| Method Name | Description |
| --- | --- |
| `getErrorCode()` | Returns the error code from the server if an error occurred during query execution. |
| `getErrorMessage()` | Returns the error message from the server if an error occurred during query execution. |

getErrorCode()
:   Purpose:
    :   If an error occurred during the execution of the query, this method returns the error code from the server.

    Arguments:
    :   None.

    Returns:
    :   The method returns the error code as an `int`. If no error occurred, the method returns the value `0`.

    Examples:
    :   ```java
        QueryStatus queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
        if (queryStatus == queryStatus.FAILED_WITH_ERROR) {
          // Print the error code to stdout
          System.out.format("Error code: %d%n", queryStatus.getErrorCode());
        }
        ```

        See [Examples of asynchronous queries](jdbc-using.md) for a more extensive example that includes a call to this
        method.

getErrorMessage()
:   Purpose:
    :   If an error occurred during the execution of the query, this method returns the error message from the server.

    Arguments:
    :   None.

    Returns:
    :   The method returns the error message as a `String`. If no error occurred, the method returns the value
        `No error reported`.

    Examples:
    :   ```java
        QueryStatus queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
        if (queryStatus == queryStatus.FAILED_WITH_ERROR) {
          // Print the error message to stdout
          System.out.format("Error message: %s%n", queryStatus.getErrorMessage());
        }
        ```

        See [Examples of asynchronous queries](jdbc-using.md) for a more extensive example that includes a call to this
        method.

## Object: `ResultSet`

The ResultSet interface documents methods that retrieve the results of queries, for example to read the rows and
columns returned by a SELECT statement.

A Snowflake ResultSet is a read-only object; it is not updatable.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `close()` | Snowflake-specific behavior (see below for details). |
| `findColumn(String)` |  |
| `getBigDecimal(int, int)` |  |
| `getBigDecimal(String, int)` |  |
| `getBoolean(int)` |  |
| `getBoolean(String)` |  |
| `getByte(int)` |  |
| `getByte(String)` |  |
| `getBytes(int)` |  |
| `getBytes(String)` |  |
| `getDate(int)` | Snowflake-specific behavior (see below for details). |
| `getDate(int, Calendar)` | Snowflake-specific behavior (see below for details). |
| `getDate(String)` | Snowflake-specific behavior (see below for details). |
| `getDate(String, Calendar)` | Snowflake-specific behavior (see below for details). |
| `getDouble(int)` |  |
| `getDouble(String)` |  |
| `getFloat(int)` |  |
| `getFloat(String)` |  |
| `getInt(int)` |  |
| `getInt(String)` |  |
| `getLong(int)` |  |
| `getLong(String)` |  |
| `getMetaData()` | Snowflake-specific behavior (see below for details). |
| `getObject(int)` |  |
| `getObject(String)` |  |
| `getShort(int)` |  |
| `getShort(String)` |  |
| `getString(int)` |  |
| `getString(String)` |  |
| `getTime(int)` | Snowflake-specific behavior (see below for details). |
| `getTime(String)` | Snowflake-specific behavior (see below for details). |
| `getTimestamp(int)` | Snowflake-specific behavior (see below for details). |
| `getTimestamp(String)` | Snowflake-specific behavior (see below for details). |
| `next()` | Snowflake-specific behavior (see below for details). |
| `wasNull()` |  |
| **Unsupported Methods** |  |
| `clearWarnings()` |  |
| `getArray(int)` |  |
| `getArray(String)` |  |
| `getAsciiStream(int)` |  |
| `getAsciiStream(String)` |  |
| `getBinaryStream(int)` |  |
| `getBinaryStream(String)` |  |
| `getCursorName()` |  |
| `getUnicodeStream(int)` |  |
| `getUnicodeStream(String)` |  |
| `getWarnings()` |  |

### Snowflake-specific behavior

* `close()`

  > Closes the object. After an object has been closed, calling almost any method of the closed object will raise a `SQLException`. Calling
  > `close` on an already closed object is harmless and will not raise an exception.
* `getDate()`, `getTime()`, `getTimestamp()`

  > In version 3.12.17 and later versions of the JDBC Driver, these methods use the session time zone (specified by the
  > [TIMEZONE](../../sql-reference/parameters.md) parameter). Older versions use the time zone of the JVM.
  >
  > To change these methods to use the time zone of the JVM, set the [JDBC_USE_SESSION_TIMEZONE](../../sql-reference/parameters.md) parameter to
  > `FALSE`.
* `getMetaData()`

  > If the ResultSet object is for [an asynchronous query](jdbc-using.md), then this method will block
  > until the query has finished executing. You can use `resultSet.unwrap(SnowflakeResultSet.class).getStatus()` to get
  > the query status before calling this method.
* `next()`

  > This makes the next row in the result set the “current” row. Calls to the `get*()` methods, such as `getInt()`,
  > get values from the current row.
  >
  > If the `ResultSet` has been closed by a call to the `close` method, then subsequent calls to `next` return false, rather than raise an exception.
  >
  > If the ResultSet object is for [an asynchronous query](jdbc-using.md), then this method will block
  > until the results are available. You can use `resultSet.unwrap(SnowflakeResultSet.class).getStatus()` to get the
  > query status before calling this
  > method.

## Interface: `SnowflakeResultSet`

The SnowflakeResultSet interface contains Snowflake-specific methods. When you use the
Snowflake JDBC driver to create an object of type ResultSet, for example by calling the
Statement.getResultSet() method, you actually get an object of a different (hidden)
Snowflake-specific type, which implements both the JDBC ResultSet interface and the
SnowflakeResultSet interface. To access the SnowflakeResultSet methods in that object,
you [unwrap](jdbc-using.md) the object.

### Additional methods

| Method Name | Description |
| --- | --- |
| `getQueryID()` | Returns the Snowflake query ID of the statement that generated this result set. |
| `getStatus()` | For a ResultSet returned by an asynchronous query, returns the status of the query. |

getQueryID()
:   Purpose:
    :   This method returns the Snowflake query ID of the statement that generated this result set.

    Arguments:
    :   None.

    Returns:
    :   The method returns the ID as a String that contains a UUID.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the
        `ResultSet`.

    Examples:
    :   ```javascript
            String queryID2;
            queryID2 = resultSet.unwrap(SnowflakeResultSet.class).getQueryID();
        ```

getStatus()
:   Purpose:
    :   For a ResultSet returned by an asynchronous query, such as `SnowflakeStatement.executeAsyncQuery()`,
        this method returns the status of the query. The status indicates whether the query finished successfully,
        finished unsuccessfully, or has not yet finished.

    Arguments:
    :   None.

    Returns:
    :   A QueryStatus enum constant.

    Throws:
    :   This method can throw `SQLException`.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the ResultSet object.

    Examples:
    :   ```java
        QueryStatus queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
        ```

        See [Examples of asynchronous queries](jdbc-using.md) for a more extensive example that includes a call to this
        method.

## Object: `ResultSetMetaData`

This provides information about a ResultSet, for example, the number of columns in the ResultSet.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `getCatalogName(int)` | Snowflake-specific behavior (see below for details). |
| `getColumnCount()` |  |
| `getColumnDisplaySize(int)` |  |
| `getColumnLabel(int)` |  |
| `getColumnName(int)` |  |
| `getColumnType(int)` |  |
| `getColumnTypeName(int)` | Snowflake-specific behavior (see below for details). |
| `getPrecision(int)` |  |
| `getScale(int)` |  |
| `getSchemaName(int)` | Snowflake-specific behavior (see below for details). |
| `getTableName(int)` | Snowflake-specific behavior (see below for details). |
| `isAutoIncrement(int)` |  |
| `isCaseSensitive(int)` |  |
| `isCurrency(int)` |  |
| `isDefinitelyWritable(int)` |  |
| `isNullable(int)` |  |
| `isReadOnly(int)` |  |
| `isSearchable(int)` |  |
| `isSigned(int)` |  |
| `isWritable(int)` |  |
| **Unsupported Methods** |  |
| None. |  |

### Snowflake-specific behavior

* The `ResultSetMetaData` class does not have a `close()` method. An open `ResultSetMetaData` object is implicitly closed when the user closes
  the `ResultSet` from which the `ResultSetMetaData` object was created.
* `getCatalogName()`, `getSchemaName()`, `getTableName()`

  If the ResultSet object is for [an asynchronous query](jdbc-using.md), these methods return empty
  strings.
* For [GEOGRAPHY](../../sql-reference/data-types-geospatial.md) columns, `getColumnTypeName` returns `GEOGRAPHY`.

  Note that the `getColumnType` and `getColumnClassName` methods do not indicate that the column type is
  `GEOGRAPHY`.

## Interface: `SnowflakeResultSetMetaData`

The SnowflakeResultSetMetaData interface contains Snowflake-specific methods. When you use the
Snowflake JDBC driver to create an object of type ResultSetMetaData, for example by calling the
ResultSet.getMetaData() method, you actually get an object of a different (hidden)
Snowflake-specific type, which implements both the JDBC ResultSetMetaData interface and the
SnowflakeResultSetMetaData interface. To access the SnowflakeResultSetMetaData methods in that object,
you [unwrap](jdbc-using.md) the object.

### Additional methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `getColumnIndex(String columnName)` |  |
| `getColumnNames()` |  |
| `getInternalColumnType(int column)` |  |
| `getQueryID()` |  |

getColumnIndex(String columnName):
:   Purpose:
    :   Returns the index of the column that corresponds to the columnName. For example, if a column named “BirthDate”
        is the third column in the table, then getColumnIndex(“BirthDate”) returns 2. (Indexes are 0-based, not 1-based.)

    Arguments:
    :   The name of the column for which you want to find the index.

    Returns:
    :   Returns an integer that contains the index of the column that corresponds to the columnName.
        If the columnName does not match any column in the result set, this returns -1.

    Throws:
    :   The method can throw `SQLException`.

getColumnNames():
:   Purpose:
    :   This function returns the list of all column names in the resultset.

        This is different from the function getColumnName(int column) in ResultSetMetaData, which returns a single
        column name based on an index.

    Arguments:
    :   None.

    Returns:
    :   The data type of the returned value is “List<String>”. The list contains the names of the columns. The names
        are in the same order as the column indexes.

    Throws:
    :   The method can throw `SQLException`.

getInternalColumnType(int column):
:   Purpose:
    :   Returns the data type of the specified column.

    Arguments:
    :   column: This indicates the index (1-based) of the column for which you want the data type.

    Returns:
    :   Returns the data type of the specified column. The data type is an integer.

    Throws:
    :   The method can throw `SQLException`.

getQueryID()
:   Purpose:
    :   Returns the Snowflake query ID of the query to which this metadata applies.

    Arguments:
    :   None.

    Returns:
    :   This method returns the query ID of the query for which this metadata was generated.
        The query ID is a String that contains a UUID. Information about UUIDs is included in the description of the
        SQL function [UUID_STRING](../../sql-reference/functions/uuid_string.md).

    Throws:
    :   The method can throw `SQLException`.

## Object: `SnowflakeTimestampWithTimezone`

A `SnowflakeTimestampWithTimezone` object provides information
about the time zone associated with the Java `Timestamp` object’s time stamp.
You can use this object to extract the time zone directly instead of parsing
the information from the Java `Timestamp` string. To access this functionality, you must import the
following Java libraries:

* `java.sql.Timestamp;`
* `java.time.ZonedDateTime;`
* `java.util.TimeZone;`

### Methods

| Method Name | Notes |
| --- | --- |
| **Constructors** |  |
| ```java SnowflakeTimestampWithTimezone(     long seconds,     int nanoseconds,     TimeZone tz) ``` | * Number of seconds since January 1, 1970 (Internet time). * Number of fractional nanoseconds. * ID of the time zone. |
| ```java SnowflakeTimestampWithTimezone(     Timestamp ts,     TimeZone tz) ``` | * `Timestamp` object representing the desired time. * ID of the time zone. |
| ```java SnowflakeTimestampWithTimezone(     Timestamp ts) ``` | * `Timestamp` object representing the desired time. |
| **Supported Methods** |  |
| `getTimezone()` | Snowflake-specific behavior (see below for details). |
| `toZonedDateTime()` | Snowflake-specific behavior (see below for details). |

### Snowflake-specific behavior

* `getTimezone()`

  Returns the time zone from the time stamp.

  ```java
  import java.sql.Timestamp;
  import java.time.ZonedDateTime;
  import java.util.TimeZone;

  public void testGetTimezone() {
      String timezone = "Australia/Sydney";

      // Create a timestamp from a point in time
      Long datetime = System.currentTimeMillis();
      Timestamp currentTimestamp = new Timestamp(datetime);
      SnowflakeTimestampWithTimezone ts =
          new SnowflakeTimestampWithTimezone(currentTimestamp, TimeZone.getTimeZone(timezone));

      // Verify timezone was set
      assertEquals(ts.getTimezone().getID(), timezone);
  }
  ```
* `toZonedDateTime()`

  Converts a `SnowflakeTimestampWithTimezone` time stamp to a zoned date time (Java `ZonedDateTime` object).

  ```java
  import java.sql.Timestamp;
  import java.time.ZonedDateTime;
  import java.util.TimeZone;

  public void testToZonedDateTime() {
      String timezone = "Australia/Sydney";
      String zonedDateTime = "2022-03-17T10:10:08+11:00[Australia/Sydney]";

      // Create a timestamp from a point in time
      Long datetime = 1647472208000L;
      Timestamp timestamp = new Timestamp(datetime);
      SnowflakeTimestampWithTimezone ts =
          new SnowflakeTimestampWithTimezone(timestamp, TimeZone.getTimeZone(timezone));
      ZonedDateTime zd = ts.toZonedDateTime();

      // Verify timestamp was converted to zoned datetime
      assertEquals(zd.toString(), zonedDateTime);
  }
  ```

## Object: `Statement`

A `Statement` object represents a SQL statement. The statement object allows users to perform tasks such as:

* Execute a SQL statement.
* Set a timeout for the execution of the statement.
* Retrieve a result set for a query.

### Methods

| Method Name | Notes |
| --- | --- |
| **Supported Methods** |  |
| `cancel()` |  |
| `close()` | Snowflake-specific behavior (see below for details). |
| `execute(String)` |  |
| `executeBatch(String)` |  |
| `executeLargeBatch(String)` |  |
| `executeLargeUpdate(String)` |  |
| `executeQuery(String)` |  |
| `executeUpdate(String)` |  |
| `getBatchQueryID()` | Snowflake-specific behavior (see below for details). |
| `getMaxFieldSize()` |  |
| `getMaxRows()` |  |
| `getMoreResults()` |  |
| `getQueryTimeout()` |  |
| `getResultSet()` |  |
| `getUpdateCount()` | Snowflake-specific behavior (see below for details). |
| `setCursorName(String)` |  |
| `setMaxRows(int)` |  |
| `setQueryTimeout(int)` |  |
| **Unsupported Methods** |  |
| `clearWarnings()` |  |
| `getWarnings()` |  |
| `setEscapeProcessing(boolean)` |  |
| `setMaxFieldSize(int)` |  |

### Snowflake-specific behavior

* `close()`

  > This method closes the object. After an object has been closed, calling almost any method of the closed object will raise a
  > `SQLException`. Calling `close` on an already closed object is harmless and will not raise an exception.
* `getBatchQueryID()`

  > This method returns a list of the Snowflake query IDs of the most recently executed query batch of this `Statement`. If no
  > query has been executed yet with the statement, the method returns null.
  >
  > This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the
  > statement. For example:
  >
  > > ```javascript
  > >     connection.setAutoCommit(false);
  > >     statement.addBatch("SELECT 1;");
  > >     statement.addBatch("SELECT 2;");
  > >     statement.executeBatch();
  > >     connection.commit();
  > >     connection.setAutoCommit(true);
  > >     List<String> batchQueryIDs1;
  > >     // Since getQueryID is not standard JDBC API, we must call unwrap() to
  > >     // use these Snowflake methods.
  > >     batchQueryIDs1 = statement.unwrap(SnowflakeStatement.class).getBatchQueryIDs();
  > >     int num_query_ids = batchQueryIDs1.size();
  > >     if (num_query_ids != 2) {
  > >       System.out.println("ERROR: wrong number of query IDs in batch 1.");
  > >     }
  > >     // Check that each query ID is plausible.
  > >     for (int i = 0; i < num_query_ids; i++) {
  > >       String qid = batchQueryIDs1.get(i);
  > >       if (!is_plausible_query_id(qid)) {
  > >         msg = "SEVERE WARNING: suspicious query ID in batch";
  > >         System.out.println("msg");
  > >         System.out.println(qid);
  > >       }
  > >     }
  > > ```
* `getUpdateCount()`

  > This method returns the number of rows updated by the most recently executed SQL statement.
  >
  > + If the statement was a DML statement (INSERT, UPDATE, DELETE, etc.), then `getUpdateCount()` returns the number of rows that
  >   were added, deleted, or changed. Note that this value can be `0` if no rows were changed.
  > + If the statement was a SELECT statement, then `getUpdateCount()` returns `-1`.
  > + If the statement was a DDL statement, then `getUpdateCount()` returns `-1`.

## Interface: `SnowflakeStatement`

The SnowflakeStatement interface contains Snowflake-specific methods. When you use the
Snowflake JDBC driver to create an object of type Statement, for example by calling the
Connection.createStatement() method, you actually get an object of a different (hidden)
Snowflake-specific type, which implements both the JDBC Statement interface and the
SnowflakeStatement interface. To access the SnowflakeStatement methods in that object,
you [unwrap](jdbc-using.md) the object.

### Additional methods

| Method Name | Description |
| --- | --- |
| `executeAsyncQuery()` | Performs an asynchronous query. |
| `getQueryID()` | Returns the Snowflake query ID of the most recently executed query of this `Statement`. |
| `setParameter(String, Value)` | Sets Snowflake-specific parameters. |

executeAsyncQuery(*String*)
:   Purpose:
    :   This method performs an [asynchronous query](jdbc-using.md), which involves submitting an SQL
        statement for execution, then returning control to the caller without waiting for the query to finish.

    Arguments:
    :   A string containing the SQL command to execute. Any SQL statement that is valid for `executeQuery()` is
        also valid for `executeAsyncQuery()`.

        > **Note:**
        >
        > File transfer statements, such as PUT and GET, are valid for `executeAsyncQuery()`, but behave synchronously.

    Returns:
    :   An “empty” ResultSet. The user should poll the result set by calling
        `resultSet.unwrap(SnowflakeResultSet.class).getStatus()` until the query results become available.

    Throws:
    :   The method can throw `SQLException`.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the Statement object.

    Examples:
    :   See [Examples of asynchronous queries](jdbc-using.md) for an example that includes a call to this method.

getQueryID()
:   Purpose:
    :   This method returns the Snowflake query ID of the most recently executed query of this `Statement`.

    Arguments:
    :   None.

    Returns:
    :   The query ID of the most recently executed query of this statement.
        The query ID is a String that contains a UUID.
        If no query has been executed yet with the statement, the method returns null.

    Throws:
    :   The method can throw `SQLException`.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the Statement.

        For [asynchronous queries](jdbc-using.md), the query ID does not become available until the
        execution of the statement completes. If you call `SnowflakeStatement.getQueryID()` after calling
        `executeAsyncQuery()` but before the statement finishes executing, the return value could be NULL.
        Instead, call `resultSet.unwrap(SnowflakeResultSet.class).getQueryID()` on the `ResultSet` object
        returned by `executeAsyncQuery()`.

    Examples:
    :   ```javascript
            String queryID1;
            queryID1 = statement.unwrap(SnowflakeStatement.class).getQueryID();
        ```

setParameter(*String parameter_name*, *<type> <value>*)
:   Purpose:
    :   The `SnowflakeStatement` class provides the `setParameter` method as a Snowflake extension. This allows
        the caller to set Snowflake-specific JDBC parameters.

        The method is overloaded. Different JDBC parameters require different data types. A method exists for each
        valid data type that can be passed as the second argument to the function.

    Arguments:
    :   parameter_name:
        :   This string must contain the name of a pre-defined Snowflake JDBC parameter. The pre-defined
            JDBC parameters (and their valid values or ranges) are listed below:

            | JDBC Parameter | Notes |
            | --- | --- |
            | MULTI_STATEMENT_COUNT | Integer specifying the number of statements (`0` = unlimited number of statements; `1` or higher indicates the exact number of statements that should be executed). |

        value:
        :   This is the value to assign to the specified JDBC parameter. Make sure that the data type is compatible
            with the JDBC parameter you specified.

    Returns:
    :   Nothing.

    Throws:
    :   This function can throw SQLException.

    Notes:
    :   This method is a Snowflake extension to the JDBC standard. To use this method, you must [unwrap](jdbc-using.md) the Statement.

    Examples:
    :   ```java
        Statement statement1;
        ...
        // Tell Statement to expect to execute 2 statements:
        statement1.unwrap(SnowflakeStatement.class).setParameter(
                "MULTI_STATEMENT_COUNT", 2);
        ```

## Interface: `SQLException`

SQLException objects are thrown by JDBC driver methods when an error occurs, and contain information about that error.

| Method Name | Description |
| --- | --- |
| `getErrorCode()` | Returns a Snowflake-specific error code. |
| `getMessage()` | This returns a string that describes the error. |
| `getSQLState()` | Returns the SQLState. |

getErrorCode()
:   Purpose:
    :   This method returns a custom Snowflake error code.

    Arguments:
    :   None.

    Returns:
    :   A Snowflake-specific error code.

    Notes:
    :   See also the `getSQLState()` method.

getMessage()
:   Purpose:
    :   This method returns a string that describes the error.

    Arguments:
    :   None.

    Returns:
    :   A Snowflake-specific error message.

getSQLState()
:   Purpose:
    :   This method returns a string that contains a 5-character alphanumeric value based on the error.

    Arguments:
    :   None.

    Returns:
    :   A Snowflake-specific SQLState. An SQLState is a 5-character alphanumeric string that indicates the
        specific error that occurred.

---
title: JDBC Driver connection parameter reference
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-parameters.md
section: Developer Guide
---

# JDBC Driver connection parameter reference

This topic lists the connection parameters that you can use to configure the JDBC driver.
You can set these parameters in the [JDBC connection string](jdbc-configure.md) or in a Java `Properties`
object.

## Required parameters

This section lists the parameters that you must set in the connection string or in the `Map` of properties.

> **Note:**
>
> You must also set the parameters for authentication.

### `user`

> Description:
> :   Specifies the login name of the user for the connection.

## Authentication parameters

### `allowUnderscoresInHost`

> Description:
> :   Specifies whether to allow underscores in account names. The JDBC Driver does not support underscores in URLs,
>     which include the account name, so
>     the JDBC Driver automatically converts underscores to hyphens. The default value is `false`.
>
>     > **Note:**
>     >
>     > Beginning with version 3.13.25, the Snowflake JDBC driver changes the default value of the `allowUnderscoresInHost` parameter to `false`.
>     > This change impacts PrivateLink customers whose account names contain underscores. In this situation, you must override
>     > the default value by setting `allowUnderscoresInHost` to `true`.

### `authenticator`

> Description:
> :   Specifies the authenticator to use for verifying user login credentials. You can set this to one of the following
>     values:
>
>     Version 4.xVersion 3.x
>
>     | Value | Description |
>     | --- | --- |
>     | `snowflake` | Use the internal Snowflake authenticator. |
>     | `externalbrowser` | [Use your web browser](../../user-guide/admin-security-fed-auth-use.md) to authenticate with Okta, AD FS, or any other SAML 2.0-compliant identity provider (IdP) that has been defined for your account. |
>     | `https://<okta_account_name>.okta.com` | The URL endpoint for your Okta account to [authenticate through native Okta](../../user-guide/admin-security-fed-auth-use.md) (only supported if your IdP is Okta). |
>     | `oauth` | Authenticate using OAuth. When OAuth is specified as the authenticator, you must also set the `token` parameter to specify the OAuth token (see below). |
>     | `snowflake_jwt` | Authenticate using key pair authentication. For more details about key pair authentication, see [Using key pair authentication and key rotation](jdbc-configure.md). |
>     | `username_password_mfa` | Authenticate with MFA token caching. For more details, see [Using multi-factor authentication](jdbc-configure.md). |
>     | `oauth_authorization_code` | Manually authenticate using an OAuth authorization code with your web browser and a chosen identity provider (including Snowflake as an IdP). For more information, see [Using the OAuth 2.0 Authorization Code flow](jdbc-configure.md). |
>     | `oauth_client_credentials` | Automatically authenticate using OAuth client credentials with your chosen identity provider (Snowflake as an IdP doesn’t support the client credentials flow). For more information, see [Using the OAuth 2.0 Client Credentials flow](jdbc-configure.md). |
>     | `programmatic_access_token` | Authenticate with a programmatic access token (PAT). For more information, see [Authenticating with a programmatic access token (PAT)](jdbc-configure.md). |
>     | `WORKLOAD_IDENTITY` | Authenticate with the [workload identity federation (WIF)](../../user-guide/workload-identity-federation.md) authenticator. |
>
>     | Value | Description |
>     | --- | --- |
>     | `snowflake` | Use the internal Snowflake authenticator. |
>     | `externalbrowser` | [Use your web browser](../../user-guide/admin-security-fed-auth-use.md) to authenticate with Okta, AD FS, or any other SAML 2.0-compliant identity provider (IdP) that has been defined for your account. |
>     | `https://<okta_account_name>.okta.com` | The URL endpoint for your Okta account to [authenticate through native Okta](../../user-guide/admin-security-fed-auth-use.md) (only supported if your IdP is Okta). |
>     | `oauth` | Authenticate using OAuth. When OAuth is specified as the authenticator, you must also set the `token` parameter to specify the OAuth token (see below). |
>     | `snowflake_jwt` | Authenticate using key pair authentication. For more details about key pair authentication, see [Using key pair authentication and key rotation](jdbc-configure.md). |
>     | `username_password_mfa` | Authenticate with MFA token caching. For more details, see [Using multi-factor authentication](jdbc-configure.md). |
>     | `oauth_authorization_code` | Manually authenticate using an OAuth authorization code with your web browser and a chosen identity provider (including Snowflake as an IdP). For more information, see [Using the OAuth 2.0 Authorization Code flow](jdbc-configure.md). |
>     | `oauth_client_credentials` | Automatically authenticate using OAuth client credentials with your chosen identity provider (Snowflake as an IdP doesn’t support the client credentials flow). For more information, see [Using the OAuth 2.0 Client Credentials flow](jdbc-configure.md). |
>     | `programmatic_access_token` | Authenticate with a programmatic access token (PAT). For more information, see [Authenticating with a programmatic access token (PAT)](jdbc-configure.md). |
>     | `WORKLOAD_IDENTITY` | Authenticate with the [workload identity federation (WIF)](../../user-guide/workload-identity-federation.md) authenticator. |
>
>     If the connection string specifies a key pair and the `authenticator` parameter is unset or is set to ‘snowflake’, then key pair authentication will be used.
>
>     For more information on authentication, see [Managing/Using federated authentication](../../user-guide/admin-security-fed-auth-use.md) and
>     [Clients, drivers, and connectors](../../user-guide/oauth-intro.md).
>
> Default:
> :   `snowflake`

### `disableGcsDefaultCredentials`

> Description:
> :   Specifies whether use the default credential lookup instead of external application default credentials when using GCP (Google Cloud Platform).
>
>     By default, GCP users can use a variety of options to set up Google Application Default Credentials outside of Snowflake. Occasionally, these authentication methods can interfere with cloud storage operations that originate from the Snowflake JDBC driver. In such cases, you can set the value to `true` to force the driver to ignore GCP credentials from other sources.
>
>     For more information, see [Application Default Credentials](https://cloud.google.com/docs/authentication/provide-credentials-adc)
>
>     You can also use the `net.snowflake.jdbc.disableGcsDefaultCredentials` Java property to achieve the same effect.
>
> Default:
> :   `true`

### `disableSamlURLCheck`

> Description:
> :   Specifies whether to disable the validation check of a SAML response.
>
> Default:
> :   `false`

### `passcode`

> Description:
> :   Specifies the passcode to use for multi-factor authentication.
>
>     For more information about multi-factor authentication, see [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md).

### `passcodeInPassword`

> Description:
> :   Specifies whether the passcode for multi-factor authentication is appended to the password:
>
>     * `on` (or `true`) specifies the passcode is appended.
>     * `off` (or `false`) or any other value specifies the passcode is not appended.
>
> Default:
> :   `off`

### `password`

> Description:
> :   Specifies the password for the specified user.
>
>     There are two ways to specify the password:
>
>     > * The first way is to pass the user ID and password directly to the `getConnection` method:
>     >
>     >   > ```java
>     >   > String user = "<user>";          // replace "<user>" with your user name
>     >   > String password = "<password>";  // replace "<password>" with your password
>     >   > Connection con = DriverManager.getConnection("jdbc:snowflake://<account>.snowflakecomputing.com/", user, password);
>     >   > ```
>     > * The second way is to create a `Properties` object, update the object with the password, and pass the object to the
>     >   `getConnection` method:
>     >
>     >   > ```java
>     >   > String user = "<user>";          // replace "<user>" with your user name
>     >   > String password = "<password>";  // replace "<password>" with your password
>     >   > Properties props = new Properties();
>     >   > props.put("user", user);
>     >   > props.put("password", password);
>     >   > Connection con = DriverManager.getConnection("jdbc:snowflake://<account>.snowflakecomputing.com/", props);
>     >   > ```
>
>     > **Attention:**
>     >
>     > We strongly recommend that you do not include the user password directly in the JDBC connection string because the
>     > password could be inadvertently exposed by the client application that uses the string to connect to Snowflake. Instead, use
>     > the interface(s) provided by the application to specify the user password.

### `privatekey`

> Description:
> :   Specifies the private key for the specified user. See [Using key pair authentication and key rotation](jdbc-configure.md).

### `private_key_base64`

> Description:
> :   Specifies the base64 encoded private key for the specified user. See
>     [Using key pair authentication and key rotation](jdbc-configure.md).

### `private_key_file`

> Description:
> :   Specifies the path to the private key file for the specified user. See
>     [Using key pair authentication and key rotation](jdbc-configure.md).

### `private_key_file_pwd`

> Description:
> :   (Deprecated) Use private_key_pwd instead.

### `private_key_pwd`

> Description:
> :   Specifies the passphrase to decrypt the private key file or base64 encoded private key for the specified user. See
>     [Using key pair authentication and key rotation](jdbc-configure.md).

### `token`

> Description:
> :   Specifies the OAuth token to use for authentication, where `<string>` is the token. This parameter is
>     required only when setting the authenticator parameter to `oauth`, except as noted below.
>
>     > **Note:**
>     >
>     > Beginning with version 3.13.24, the Snowflake JDBC Driver lets you send the OAuth token in the connection password
>     > in addition to including it in the `token` configuration parameter. If the `token` configuration parameter is not specified,
>     > the `Driver.connect()` method expects the token to be stored in the connection password.
>     >
>     > This feature primarily supports using OAuth authentication for connection
>     > pools, allowing you to pass refreshed tokens as needed instead of being restricted by an expired token specified in the
>     > `token` configuration parameter.
>
>     For example, instead of setting he `token` configuration parameter, you can pass the token as the password
>     in the `getConnection()` method properties, similar to the following:
>
>     ```java
>     Properties props = new Properties();
>     props.put("user", "myusername");
>     props.put("authenticator", "oauth");
>     props.put("role", "myrole");
>     props.put("password", "xxxxxxxxxxxxx"); // where xxxxxxxxxxxxx is the token string
>     Connection myconnection = DriverManager.getConnection(url, props);
>     ```
>
> Default:
> :   None

### `oauthClientId`

> Description:
> :   Value of `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).

### `oauthClientSecret`

> Description:
> :   Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).

### `oauthAuthorizationUrl`

> Description:
> :   Identity provider endpoint supplying the authorization code to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.

### `oauthTokenRequestUrl`

> Description:
> :   Identity Provider endpoint supplying the access tokens to the driver. When using Snowflake as an Identity Provider, this value is derived from the `server` or `account` parameters.

### `oauthScope`

> Description:
> :   Scope requested in the Identity Provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

### `oauthRedirectUri`

> Description:
> :   URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.

### `workloadIdentityProvider`

> Description:
> :   Platform of the workload identity provider. Possible values include: `AWS`, `AZURE`, `GCP`, and `OIDC`.

### `workloadImpersonationPath`

> Description:
> :   String containing a list of identities separated with commas that provide an identity chain to use when connecting to Snowflake. Elements are either a full service account address or a service account’s unique ID.
>
>     Impersonation works by following each entry in order to obtain a token that allows authorization of the next service account. Each account in the identity chain needs permissions to impersonate the next account only. The final account in the list obtains your Snowflake connection token and uses it to connect to Snowflake.

## Parameters for the default database, role, schema, and warehouse

### `db`

> Description:
> :   Specifies the default database to use once connected, or specifies an empty string. The specified database should
>     be an existing database for which the specified default role has privileges.
>
>     If you need to use a different database after connecting, execute the [USE DATABASE](../../sql-reference/sql/use-database.md) command.

### `role`

> Description:
> :   Specifies the default access control role to use in the Snowflake session initiated by the driver. The specified
>     role should be an existing role that has already been assigned to the specified user for the driver. If the specified role has
>     not already been assigned to the user, the role is not used when the session is initiated by the driver.
>
>     If you need to use a different role after connecting, execute the [USE ROLE](../../sql-reference/sql/use-role.md) command.
>
>     For more information about roles and access control, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

### `schema`

> Description:
> :   Specifies the default schema to use for the specified database once connected, or specifies an empty string. The
>     specified schema should be an existing schema for which the specified default role has privileges.
>
>     If you need to use a different schema after connecting, execute the [USE SCHEMA](../../sql-reference/sql/use-schema.md) command.

### `warehouse`

> Description:
> :   Specifies the virtual warehouse to use once connected, or specifies an empty string. The specified warehouse
>     should be an existing warehouse for which the specified default role has privileges.
>
>     If you need to use a different warehouse after connecting, execute the [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) command can
>     be executed to set a different warehouse for the session.

## Proxy parameters

### `disableSocksProxy`

> Description:
> :   Specifies whether the driver should ignore the SOCKS proxy configuration specified in the Java system options:
>
>     * `on` (or `true`) specifies to ignore the proxy.
>     * `off` (or `false`) or any other value specifies to use the proxy.
>
>     > **Note:**
>     >
>     > Setting this connection parameter alters the behavior for all connections on the same JVM (Java virtual machine).
>
> Default:
> :   `off`

### `nonProxyHosts`

> Description:
> :   Specifies the lists of hosts that the driver should connect to directly, bypassing the proxy server. See
>     [Specifying a proxy server in the JDBC connection string](jdbc-configure.md) for details.

### `proxyHost`

> Description:
> :   Specifies the hostname of the proxy server to use. See
>     [Specifying a proxy server in the JDBC connection string](jdbc-configure.md) for details.

### `proxyPassword`

> Description:
> :   Specifies the password for authenticating to the proxy server. See
>     [Specifying a proxy server in the JDBC connection string](jdbc-configure.md) for details.

### `proxyPort`

> Description:
> :   Specifies the port number of the proxy server to use. See
>     [Specifying a proxy server in the JDBC connection string](jdbc-configure.md) for details.

### `proxyProtocol`

> Description:
> :   Specifies the protocol used to connect to the proxy server. See
>     [Specifying a proxy server in the JDBC connection string](jdbc-configure.md) for details.
>
> Default:
> :   `http`

### `proxyUser`

> Description:
> :   Specifies the user name for authenticating to the proxy server. See
>     [Specifying a proxy server in the JDBC connection string](jdbc-configure.md) for details.

### `useProxy`

> Description:
> :   Specifies whether the driver should use a proxy:
>
>     * `on` (or `true`) specifies that the driver should use a proxy.
>     * `off` (or `false`) or any other value specifies that the driver should not use a proxy. This setting has no effect if JVM proxy arguments are present.
>
>     See [Specifying a proxy server in the JDBC connection string](jdbc-configure.md).
>
> Default:
> :   `off`

## Timeout parameters

### `loginTimeout`

> Description:
> :   Specifies the number of seconds to wait for a response when connecting to the Snowflake service before returning a login failure error.
>
> Default:
> :   `60`

### `networkTimeout`

> Description:
> :   Specifies the number of milliseconds to wait for a response when interacting with the Snowflake service before returning an error. `0` (zero) specifies that no network timeout is set.
>
> Default:
> :   `0`

### `net.snowflake.jdbc.http_client_connection_timeout_in_ms`

> Description:
> :   Specifies the maximum time, in milliseconds, to wait on fully establishing a new connection (including TLS negotiation) with the remote host.
>
> You can also set this in the connection string with `${HTTP_CLIENT_CONNECTION_TIMEOUT}`.
>
> Default:
> :   `60000` (1 minute)

### `net.snowflake.jdbc.http_client_socket_timeout_in_ms`

> Description:
> :   Specifies the maximum time, in milliseconds, to wait for data (time of inactivity between two data packets) after a connection is successfully established.
>
> You can also set this in the connection string with `${HTTP_CLIENT_SOCKET_TIMEOUT}`.
>
> Default:
> :   `300000` (5 minutes)

### `queryTimeout`

> Description:
> :   Specifies the number of seconds to wait for a query to complete before returning an error. `0` (zero) specifies
>     that the driver should wait indefinitely.
>
> Default:
> :   `0`

## Certificate revocation list (CRL) options

These options are available in driver versions 3.27.0 and later.

### `CERT_REVOCATION_CHECK_MODE`

> Description:
> :   How to treat certificate revocation. Note that certificate revocation checks with CRLs are resource-heavy tasks, both for memory and CPU. The following values are supported:
>
>     * `ENABLED`: Enables CRLs. Connections are terminated if there are errors related to obtaining and parsing the CRL.
>     * `ADVISORY`: Enables CRLs. Errors are logged but do not block the connection; revocation status is not enforced.
>     * `DISABLED`: Disables CRLs. Certificates can only be revoked manually.
>
> Default:
> :   `DISABLED`

### `ALLOW_CERTIFICATES_WITHOUT_CRL_URL`

> Description:
> :   Whether certificates without an associated CRL are accepted. If false, certificates lacking a CRL distribution point cause the connection to fail. Applies only when `CERT_REVOCATION_CHECK_MODE` is not `DISABLED`.
>
> Default:
> :   `false`

### `ENABLE_CRL_IN_MEMORY_CACHING`

> Description:
> :   Whether to enable in-memory caching of CRLs. If enabled, the driver caches CRLs in memory to improve performance. Applies only when `CERT_REVOCATION_CHECK_MODE` is not `DISABLED`.
>
> Default:
> :   `true`

### `ENABLE_CRL_DISK_CACHING`

> Description:
> :   Whether to enable disk caching of CRLs. If enabled, the driver caches CRLs on disk to improve performance. Applies only when `CERT_REVOCATION_CHECK_MODE` is not `DISABLED`.
>
> Default:
> :   `true`

### `CRL_CACHE_VALIDITY_TIME`

> Description:
> :   Specifies the time, in seconds, that a CRL is considered valid. After this time, the CRL is refreshed from the source.
>
> Default:
> :   `86400` (1 day)

### `CRL_RESPONSE_CACHE_DIR`

> Description:
> :   Specifies the directory where the CRL response cache is stored.
>
> Default:
>
> * Windows: `%USERPROFILE%AppDataLocalSnowflakeCachescrls`
> * Linux: `$HOME/.cache/snowflake/crls`
> * macOS: `$HOME/Library/Caches/Snowflake/crls`

### `CRL_ON_DISK_CACHE_REMOVAL_DELAY`

> Description:
> :   Specifies the time, in seconds, to delay removing the on-disk cache.
>
> Default:
> :   `604800` (1 week)

## Other parameters

### `application`

> Description:
> :   Snowflake partner use only: Specifies the name of a partner application to connect through JDBC.

### `CLEAR_BATCH_ONLY_AFTER_SUCCESSFUL_EXECUTION`

> Description:
> :   Specifies whether to clear batch entries only when a batch updates successfully.
>
>     * `true`: Batch entries are cleared only when the batch updated successfully.
>     * `false`: The `Statement.executeBatch` and `Statement.executeLargeBatch` never clears batch entries after execution, while `PreparedStatement.executeBatch` and `PreparedStatement.executeLargeBatch` always clears batch entries after execution.
>
>     This parameter is available for backward compatibility.
>
> Default:
> :   `false`

### `client_config_file`

> Description:
> :   Specifies the path of a [logging configuration file](jdbc-configure.md) that you
>     can use to define the logging level and directory for saving log files. .. :Default: `sf_client_config.json`

### `CLIENT_TELEMETRY_ENABLED`

> Description:
> :   Specifies whether to send in-band telemetry data to Snowflake.
>
> Default:
> :   `true`.

### `CLIENT_TREAT_TIME_AS_WALL_CLOCK_TIME`

> Description:
> :   Specifies whether time values should be processed as literal wall-clock times, thereby avoiding potential discrepancies caused by timezone-sensitive epoch conversions when the parameter is `false`.
>
> : Default: `false`.

### `DIAGNOSTICS_ALLOWLIST_FILE`

> Description:
> :   Full path and filename of the JSON file containing the output of the [SYSTEM$ALLOWLIST](../../sql-reference/functions/system_allowlist.md) or [SYSTEM$ALLOWLIST_PRIVATELINK](../../sql-reference/functions/system_allowlist_privatelink.md) functions.
>
>     If `ENABLE_DIAGNOSTICS` is `true`, you must provide this parameter.

### `disableOCSPChecks`

> Description:
> :   When `true`, driver does not perform any OCSP checks
>
> Default:
> :   `false`

### `ENABLE_DIAGNOSTICS`

> Description:
> :   When `true` and the calling application invokes the `DriverManager` or `DataSource` `getConnection()` method, the driver runs several connectivity tests and writes the results in a pre-configured log file. The driver also returns the following exception:
>
>     ```output
>     net.snowflake.client.jdbc.SnowflakeSQLException: A connection was not created because the driver is running in diagnostics mode. If this is unintended then disable diagnostics check by removing the ENABLE_DIAGNOSTICS connection parameter
>     ```
>
>     If you enable this parameter, you must provide a value for the `DIAGNOSTICS_ALLOWLIST_FILE` parameter.
>
> Default:
> :   `false`

### `ENABLE_EXACT_SCHEMA_SEARCH_ENABLED`

> Description:
> :   Enables or disables exact schema searches in some `DatabaseMetaData` methods.
>
> Default:
> :   `false` (for backward compatibility)

### `enablePatternSearch`

> Description:
> :   Enables or disables pattern search for `getCrossReference`, `getExportedKeys`, `getImportedKeys`, and `getPrimaryKeys` metadata operations that should not use their parameters as patterns.
>
> Default:
> :   `true`

### `ENABLE_WILDCARDS_IN_SHOW_METADATA_COMMANDS`

> Description:
> :   Enables or disables treating wildcards as literals in some `DatabaseMetaData` methods when creating SQL queries. This setting can be useful when a client is not able to escape wildcards in identifiers.
>
> Default:
> :   `true`

### `enablePutGet`

> Description:
> :   Specifies whether to allow PUT and GET commands access to local file systems. Setting the value to
>     `false` disables PUT and GET command execution.
>
> Default:
> :   `true`

### `IMPLICIT_SERVER_SIDE_QUERY_TIMEOUT`

> Description:
> :   Specifies whether to send a timeout in the query sent to Snowflake.
>
>     * `true`: Calling `Statement.setQueryTimeout` sets the timeout on the query sent to Snowflake in addition to the client-side timeout.
>     * `false`: Calling `Statement.setQueryTimeout` sets only client side timeout is set.
>
> Default:
> :   `false`

### `insecureMode`

> Description:
> :   Deprecated. See disableOCSPChecks.

### `JAVA_LOGGING_CONSOLE_STD_OUT`

> Description:
> :   Specifies whether to write log message to standard output instead of standard error.
>
> Default:
> :   `false`

### `JAVA_LOGGING_CONSOLE_STD_OUT_THRESHOLD`

> Description:
> :   Specifies the maximum log message level to write to standard output. Higher log levels are written to standard error. Valid only when `JAVA_LOGGING_CONSOLE_STD_OUT` is `true`. Possible values include:
>
>     * `OFF`
>     * `SEVERE`
>     * `WARNING`
>     * `INFO`
>     * `CONFIG`
>     * `FINE`
>     * `FINER`
>     * `FINEST`
>     * `ALL`
>
> Default:
> :   none, which is equivalent to setting the value to `OFF` or `SEVERE`.

### `JDBC_ARROW_TREAT_DECIMAL_AS_INT`

> Description:
> :   Specifies whether to return all numbers in an arrow result set from a `getObject` call as integers. If this value and the JDBC_TREAT_DECIMAL_AS_INT parameter values are both `false`, all integer numbers in arrow return sets from a `getObject` call are returned as a `BigDecimal` type.
>
> Default:
> :   `true`

### `JDBC_DEFAULT_FORMAT_DATE_WITH_TIMEZONE`

> Description:
> :   Specifies whether to use the previously hardcoded value for the formatter (for backwards compatibility).
>
> Default:
> :   `true`

### `JDBC_GET_DATE_USE_NULL_TIMEZONE`

> Description:
> :   Specifies whether to use the previously null timezone value for the `getDate` method (for backwards compatibility).
>
> Default:
> :   `true`

### `JDBC_QUERY_RESULT_FORMAT`

> Description:
> :   Specifies which result format to use while fetching or processing the results of a query sent to Snowflake. Possible values include:
>
>     * `Arrow`
>     * `JSON`
>
> Default:
> :   `Arrow`

### `maxHttpRetries`

> Description:
> :   Specifies the maximum number of times to retry failed HTTP requests before returning an error.
>
> Default:
> :   7

### `net.snowflake.jdbc.max_connections`

> Description:
> :   Specifies the total maximum connections available in the connection pool.
>
> Default:
> :   `300`

### `net.snowflake.jdbc.max_connections_per_route`

> Description:
> :   Specifies the maximum number of connections allowed for a single port or URL. The value cannot
>     exceed the net.snowflake.jdbc.max_connections value.
>
> Default:
> :   `300`

### `net.snowflake.jdbc.objectMapper.maxJsonStringLength`

> Description:
> :   Specifies the maximum number of bytes for a string. You can increase the value for this Java property to set a larger buffer for Snowflake response deserialization if you receive error messages similar to the following:
>
>     ```output
>     com.fasterxml.jackson.core.exc.StreamConstraintsException: String length (XXXXXXX) exceeds the maximum length (180000000)
>     ```
>
> Default:
> :   `180000000`

### `ocspFailOpen`

> Description:
> :   Specifies that the driver should “fail open” if unable reach the OCSP server to verify the certificate. See
>     [OCSP](jdbc-configure.md).

### `OWNER_ONLY_STAGE_FILE_PERMISSIONS_ENABLED`

> Description:
> :   Sets owner-only permissions (0600) on the directory created for stage files.
>
> Default:
> :   `false`

### `putGetMaxRetries`

> Description:
> :   Specifies the maximum number of times to retry PUT/GET exceptions for storage clients.
>
> Default:
> :   25

### `stringsQuotedForColumnDef`

> Description:
> :   If this parameter is set to `true`, then when `DatabaseMetaData.getColumns()` and
>     `DatabaseMetaData.getProcedureColumns()` return a value of type `String` in the COLUMN_DEF column, that
>     value is embedded in single quotes. (If the data type of the value is not `String`, then the value is not
>     quoted, regardless of the setting of this parameter.)
>
>     * `true` specifies that string values should be embedded in single quotes (the quotes are part of the string, not
>       delimiters). This complies with the JDBC standard.
>     * `false` specifies that string values are not embedded in single quotes.
>
> Default:
> :   `false`

### `MAX_TLS_VERSION`

> Description:
> :   Specifies the maximum SSL/TLS version to use when initiating a TLS handshake. Valid values include:
>
>     * `TLSv1.2`
>     * `TLSv1.3`
>
>     Snowflake recommends leaving this setting at its default when you don’t have a specific need to change it.
>
> Default:
> :   `TLSv1.3`

### `MIN_TLS_VERSION`

> Description:
> :   Specifies the minimum SSL/TLS version to use when initiating a TLS handshake. Valid values include:
>
>     * `TLSv1.2`
>     * `TLSv1.3`
>
>     Snowflake recommends leaving this setting at its default when you don’t have a specific need to change it.
>
> Default:
> :   `TLSv1.2`

### `tracing`

> Description:
> :   Specifies the log level for the driver. The driver uses the standard Java log utility. You can set this parameter
>     to one of the following log levels:
>
>     * `OFF`
>     * `SEVERE`
>     * `WARNING`
>     * `INFO`
>     * `CONFIG`
>     * `FINE`
>     * `FINER`
>     * `FINEST`
>     * `ALL`
>
> Default:
> :   `INFO`

---
title: JDBC Driver diagnostic service
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-diagnostic-service.md
section: Developer Guide
---

# JDBC Driver diagnostic service

To aid Snowflake Support in diagnosing customer incidents, the Snowflake JDBC driver utilizes a diagnostic service that runs in the background. When the driver encounters an issue that
prevents it from performing normally, the diagnostic service records information about the issue in a pair of compressed dump files located in the `/tmp/snowflake_dumps` folder:

* `sf_incident_<incident_number>.dmp.gz`
* `sf_log_<incident_number>.dmp.gz`

> **Important:**
>
> The dump files may contain sensitive information (such as IP addresses) to further assist in solving the issue. Note that these files are only stored locally; they are not sent
> to Snowflake. You must choose to share the files, such as when diagnosing issues with Snowflake Support.
>
> If you wish to prevent the creation of these dump files by the drivers, set the `snowflake.disable_debug_dumps=true` system property.

When the driver encounters an issue, the service may also send diagnostic information to Snowflake to help fix the problem. This information includes:

* Driver version information.
* A generic description of the issue.
* Stack traces for the driver that pertain to the issue. Other than the account identifier, these stack traces include no customer information.

---
title: Keeping handler code in-line or on a stage
source: https://docs.snowflake.com/en/developer-guide/inline-or-staged.md
section: Developer Guide
---

# Keeping handler code in-line or on a stage

When creating a user-defined function (UDF) or stored procedure with SQL, you can specify whether the handler code is in-line with the
SQL that creates it or external to the SQL, such as in a file on a stage. This topic describes the difference.

Not all languages support using either an in-line or staged handler. For the list of supported languages, see language choice for
[stored procedures](stored-procedure/stored-procedures-overview.md) or [UDFs](udf/udf-overview.md).

## Practical differences

### In-line handler advantages

Functions and procedures with in-line handlers may be easier to manage. After using your development tools to verify that your code
works as it should, you can deploy it by copying it into the SQL statement you execute to create the function or procedure. You can
maintain the code there, updating it with a SQL statement (such as with ALTER FUNCTION or ALTER PROCEDURE) without having to maintain the
code elsewhere.

### Staged handler advantages

When using a staged handler, you can do the following:

* Use code you manage separately in a Git repository you’re using from Snowflake.

  For more information, see [How Snowflake works with a remote Git repository](git/git-overview.md).
* Use previously compiled code, such as when you already have compiled output but don’t have the source.
* Use handler code that might be too large to paste into the SQL statement with which you create the function or procedure.
  In-line code has an upper limit on the source code size.
* Reuse handler code from multiple functions or procedures. Staged code can contain multiple handler functions in which each function can
  be used by a different UDF or procedure. As you create multiple UDFs or procedures, they can each specify the same handler file, but
  specify a different handler function implemented in that file.

  In contrast, the handler for in-line functions or procedures typically contain only one callable function. That callable function can
  call other functions, and those other functions can be defined in the same code file or in another staged code file.
* Use existing testing and debugging tools to do most of the development work. This is particularly true
  if the code is large or complex.

## Using an in-line handler

When you’re using an in-line handler, you include the handler source code in the AS clause of the SQL statement creating the function or
procedure. For example, you would include the handler code in the AS clause of the CREATE FUNCTION or CREATE PROCEDURE statement itself.

Inside the AS clause, you surround the code with single quotes or a pair of dollar signs (`$$`). Using the double dollar signs might be
easier, such as when the source code contains embedded single quotes.

If the in-line handler source code needs to be compiled (such as with a handler written in Java or Scala), Snowflake compiles the source
and stores the output (such as a JAR file) for use later. You can optionally specify a location for a resulting output file with the
TARGET_PATH clause.

Snowflake manages compiled output in the following ways:

* If the SQL statement (such as CREATE FUNCTION) uses TARGET_PATH to specify a location for the output file, Snowflake compiles the
  code once and keeps the compiled output for future use.
* If the SQL statement does not specify a location for the file, Snowflake re-compiles the code for each SQL statement that
  calls the function or procedure. Snowflake automatically cleans up the file after the SQL statement finishes.

> **Note:**
>
> As a best practice when using an in-line Java or Scala handler, consider specifying a value for the TARGET_PATH parameter. This can
> increase performance because Snowflake will reuse the compiled result of the handler code instead of recompiling the code for each
> call to the procedure or UDF.

> **Attention:**
>
> When handler code is defined in-line, it will be captured as metadata. If you do not wish to have the code captured as metadata, you can
> instead deploy it in other ways, such as by using a staged handler.
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated
> data is entered as metadata when using the Snowflake Service. For more information, see [Metadata fields in Snowflake](../sql-reference/metadata.md).

### In-line example with Java handler

Code in the following example creates a MYPROC stored procedure with an in-line handler in Java. The handler is the `run` method of
the `MyJavaClass` class.

```sqlexample-java
CREATE OR REPLACE PROCEDURE MYPROC(fromTable STRING, toTable STRING, count INT)
  RETURNS STRING
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:latest')
  HANDLER = 'MyJavaClass.run'
  AS
  $$
    import com.snowflake.snowpark_java.*;

    public class MyJavaClass {
      public String run(Session session, String fromTable, String toTable, int count) {
        session.table(fromTable).limit(count).write().saveAsTable(toTable);
        return "SUCCESS";
      }
    }
  $$;
```

For CREATE PROCEDURE reference information, refer to [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md).

## Using a staged handler

When you’re using a staged handler, you use the IMPORTS clause to reference the handler at another location, such as a stage.
For example, you would specify the path to the handler with the IMPORTS clause of a SQL statement such as CREATE PROCEDURE or
CREATE FUNCTION.

When referencing the handler function name with the HANDLER clause, you must qualify the function name with the name of its containing
class or module. This is in contrast with an in-line handler, where you can sometimes simply reference the handler function by its name
alone.

### Staging a handler for use from a function or procedure

The following describes how to add a handler file to the environment in which your function or procedure executes.

1. If necessary, such as with a handler written in Java or Scala, compile and package the handler code for uploading to a stage. For more
   information on build tools, see [Packaging Handler Code](udf-stored-procedure-building.md).

   For a handler written in Python, you can use the handler module source.

   > When your handler code is written in Java or Scala, build a JAR file that contains all of the dependencies needed for your stored
   > procedure. Later, you’ll need to upload the JAR file to a stage and reference the JAR file from your CREATE PROCEDURE statement.
   > This process is simpler if you have fewer JAR files to upload and reference.
   >
   > * Use Maven to build a JAR file with dependencies.
   >
   >   If you are using Maven to build and package your code, you can use the
   >   [Maven Assembly Plugin](https://maven.apache.org/plugins/maven-assembly-plugin/index.html) to create a JAR file that contains
   >   all of the dependencies. For more information, see [Packaging Java or Scala Handler Code with Maven](udf-stored-procedure-build-maven.md).
   > * Use other tools to build a JAR file with dependencies.
   >
   >   If you are not using Maven, see the documentation for your build tool for instructions on building a JAR file with all of
   >   the dependencies.
   >
   >   For example, if you are using an IntelliJ IDEA project, see the
   >   [instructions on setting up an artifact configuration](https://www.jetbrains.com/help/idea/compiling-applications.html#configure_artifact).
2. Upload the handler file to a stage as described in [Making dependencies available to your code](upload-dependencies.md).

   If your handler is from [a Git repository you’re using with Snowflake](git/git-overview.md), you might
   instead need to [fetch the latest](git/git-operations.md) from your remote repository to the Snowflake
   Git repository.
3. Reference the handler file when you create the function or procedure.

   You reference the handler file in the IMPORTS clause, as described in [Referencing the dependency](upload-dependencies.md).

   Code in the following example creates a UDF called `my_udf` whose handler, `MyClass.myFunction` is written in Java. The code’s
   IMPORTS clause specifies that the handler file, called `my_handler.jar`, is at the stage `mystage` in the stage’s
   subdirectory `handlers`. At runtime, Snowflake adds the handler JAR to the classpath.

   ```sqlexample
   CREATE FUNCTION my_udf(i NUMBER)
     RETURNS NUMBER
     LANGUAGE JAVA
     IMPORTS = ('@mystage/handlers/my_handler.jar')
     HANDLER = 'MyClass.myFunction'
   ```

   For CREATE FUNCTION reference information, see [CREATE FUNCTION](../sql-reference/sql/create-function.md).

### Caveats and best practices

If you delete or rename the handler file, you can no longer call the function or procedure. If you need to update your handler file, then:

* First ensure that no calls are being made to the function or procedure that uses the handler.
* Use the [PUT](../sql-reference/sql/put.md) command to upload a new handler file. If the old handler file is still in the stage when you upload the new one,
  use the `PUT` command’s `OVERWRITE=TRUE` clause to overwrite the old handler file.

---
title: Limitations for Scala in stored procedures created with SQL
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/scala/procedure-scala-limitations.md
section: Developer Guide
---

# Limitations for Scala in stored procedures created with SQL

Stored procedures have the following limitations:

* Concurrency is not supported. For example, from within your code, you cannot submit queries
  from multiple threads. Code that concurrently issues multiple queries will produce an error.
* If you are executing your stored procedure from a task, you must specify a warehouse when creating the task. (You cannot use
  serverless compute resources to run the task.)
* Keep in mind the following limitations for using some Snowpark APIs in your stored procedure.

  + When you use [APIs that execute PUT and GET commands](../../snowpark/scala/working-with-dataframes.md) (including
    `Session.sql("PUT ...")` and `Session.sql("GET ...")`), you may write only to the `/tmp` directory in the memory-backed
    file system provided for the query calling the procedure.
  + Do not use [APIs that create new sessions](../../snowpark/scala/creating-session.md) (for example,
    `Session.builder().configs(...).create()`).
  + Using `session.jdbcConnection` (and the connection returned from it) is not supported because it can result in unsafe behavior.
* Creating named temp objects is not supported in an owner’s rights stored procedure. An owner’s rights stored procedure is a stored
  procedure that runs with the privileges of the stored procedure owner.
  For more information, refer to [caller’s rights or owner’s rights](../stored-procedures-rights.md).

---
title: Logging and tracing limitations
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-tracing-limitations.md
section: Developer Guide
---

# Logging and tracing limitations

## General limitations

There is a 1MB limit for log and trace event payloads. If the payload is over the 1MB threshold, the record in the event table will be
incomplete and only contain values for the following columns: TIMESTAMP, RECORD_TYPE, and RESOURCE_ATTRIBUTES. Currently there is no
additional indication that the threshold was exceeded.

## Event tables associated with databases

* When you use [event tables associated with databases](event-table-setting-up.md), the Snowsight trace explorer
  currently won’t show the entire span for traces with spans across multiple event tables. Instead, you can see the partial trace with
  the spans in the currently selected event table from the drop-down.
* Snowflake does not support collecting events for Snowpark Container Services when the event table is associated with a database.

---
title: Logging messages from functions and procedures
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging.md
section: Developer Guide
---

# Logging messages from functions and procedures

You can log messages (such as warning or error messages) from a stored procedure, UDF, or UDTF, including those you write
[using Snowpark APIs](../snowpark/index.md). You can access the logged messages from an event table (a type of
predefined table that captures events, including logged messages). For a list of supported handler languages, see
Supported languages.

For example, in a Java UDF, you can use the [SLF4J API](http://www.slf4j.org/) to log messages. Later, you can access those logged messages in an event
table.

> **Note:**
>
> Before you can collect log messages, you must [enable telemetry data collection](logging-tracing-enabling.md).
> When you instrument your code, Snowflake generates the data and collects it in an event table.

## Logging example

The Python code in the following example imports the `logging` module, gets a logger, and logs a message at the `INFO` level.

> **Note:**
>
> A message logged from a method that processes an input row will be logged *for every row* processed by the UDF. If the UDF is executed in a
> large table, this can result in a large number of messages in the event table.

```python
import logging

logger = logging.getLogger("mylog")

def test_logging(self):
    logger.info("This is an INFO test.")
```

## Getting started

To get started logging from handler code, follow these high-level steps:

1. [Set up an event table.](event-table-setting-up.md)

   Snowflake will use your event table to store messages logged from your handler code. An event table has
   columns [predefined by Snowflake](event-table-columns.md).
2. Get acquainted with the logging API for the handler language you’ll be using.

   see Supported languages for a list of handler languages, then view
   content about how to log from your language.
3. Add logging code to your handler.
4. Learn how to [retrieve logging data](logging-accessing-messages.md) from the event table.

## Level for log messages

You can manage the level of log event data stored in the event table by setting the log level. Before logging, use this setting to make
sure you’re capturing the log message severity.

For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Supported languages

You can log messages from code written in the following languages, including when handler code is written with
[Snowpark APIs](../snowpark/index.md).

| Language / Type | Java | JavaScript | Python | Scala | SQL |
| --- | --- | --- | --- | --- | --- |
| Stored procedure handler | ✔ | ✔ | ✔ | ✔ | ✔ \*\* |
| Streamlit app |  |  | ✔ |  |  |
| UDF handler (scalar function) | ✔ | ✔ | ✔ | ✔ |  |
| UDTF handler (table function) | ✔ | ✔ | ✔ | ✔ \* |  |

**Legend**

\*:
:   Scala UDTF handler written in Snowpark.

\*\*:
:   Snowflake Scripting used to write stored procedures.

> **Note:**
>
> Logging is not supported for [Request and response translators in external functions](../../sql-reference/external-functions-translators.md).

### Logging from handler code

To log messages, you can use functions common to your handler code language. Snowflake intercepts messages and stores them in the
event table you create.

For example, in a Java UDF, you can use the [SLF4J API](http://www.slf4j.org/) to log messages. Later, you can access those logged messages in an event table.

If you plan to log messages when errors occur, you should log them from within the construct for handling errors in the language
that you are using. For example, in a Java UDF, call the method for logging a message in the `catch` block where you handle
the exception.

The following table lists handler languages supported for logging, along with links to content on logging from code.

| Language | Logging Library | Documentation |
| --- | --- | --- |
| Java | SLF4J API | [Logging messages in Java](logging-java.md) |
| JavaScript | Snowflake JavaScript API `snowflake` object | [Logging messages in JavaScript](logging-javascript.md) |
| Python | Standard Library `logging` module | [Logging messages in Python](logging-python.md) |
| Scala | SLF4J API | [Logging messages in Scala](logging-scala.md) |
| Snowflake Scripting | Snowflake SYSTEM$LOG function. | [Logging messages in Snowflake Scripting](logging-snowflake-scripting.md) |

## Viewing log messages

You can view the log messages either through Snowsight or by querying the event table in which log entries are stored. For more
information, see [Viewing log messages](logging-accessing-messages.md).

---
title: Logging messages in Java
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-java.md
section: Developer Guide
---

# Logging messages in Java

You can log messages from a function or procedure handler written in Java by using the [SLF4J API](http://www.slf4j.org/). When you’ve
[set up an event table](event-table-setting-up.md) to store log entries, Snowflake stores log
entries generated by your handler code in the table.

You can use the [SLF4J API](http://www.slf4j.org/) included with the Snowflake Telemetry library included on Snowflake. To do so, include the following
value in the PACKAGES clause when you create the function or procedure: `com.snowflake:telemetry:latest`.

For information on including the Telemetry library when packaging your code with Maven, see
[Setting up your Java and Scala environment to use the Telemetry class](telemetry-build-maven.md).

> **Note:**
>
> Using the Snowflake Telemetry Library adds other libraries to your function or procedure’s execution environment. For more information,
> see [Snowflake telemetry package dependencies](telemetry-package-dependencies.md).

> **Note:**
>
> SLF4J does not support logging messages at the `FATAL` level. For handlers written in Java or Scala, the `FATAL` level is
> treated as the `ERROR` level.
>
> For example, if you set the `LOG_LEVEL` parameter to `FATAL`, `ERROR`-level messages from a Java or Scala
> handler are ingested.

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

Before logging from code, you must:

* Set up an event table to collect messages logged from handler code.

  For more information, see [Event table overview](event-table-setting-up.md).
* Be sure you have the logging level set so that the messages you want are stored in the event table.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Adding custom attributes

When you create a log entry, you can add your own attributes in key-value pairs. Snowflake saves these custom attributes to the event
table’s [RECORD_ATTRIBUTES column](event-table-columns.md).

To add custom attributes, call methods of the slf4j fluent API, such as `Logger.atInfo` and `Logger.atError`. Use these
methods to set key-value pairs in the log entry. Each returns an [org.slf4j.spi.LoggingEventBuilder](https://www.slf4j.org/apidocs/org/slf4j/spi/LoggingEventBuilder.html), which you can use to set the
log message.

Code in the following example logs a message “Logging with attributes” to the event table’s VALUE column. It also adds a custom
attribute to the RECORD_ATTRIBUTES column.

```sqlexample-java
CREATE OR REPLACE PROCEDURE do_logging_java()
RETURNS VARCHAR
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:telemetry:latest','com.snowflake:snowpark:latest')
HANDLER = 'JavaLoggingHandler.doThings'
AS
$$
  import org.slf4j.Logger;
  import org.slf4j.LoggerFactory;
  import com.snowflake.snowpark_java.Session;

  public class JavaLoggingHandler {
    private static Logger logger = LoggerFactory.getLogger(JavaLoggingHandler.class);

    public String doThings(Session session) {
      logger.atInfo().addKeyValue("custom1", "value1").setMessage("Logging with attributes").log();
      return "SUCCESS";
    }
  }
$$;
```

Output of this `Logger.atInfo` call appears in the event table as follows. Note that the RECORD_ATTRIBUTES column will include
attributes that Snowflake adds automatically.

```output
------------------------------------------------------------------
| VALUE                     | RECORD_ATTRIBUTES                  |
------------------------------------------------------------------
| "Logging with attributes" | {                                  |
|                           |   "custom1": "value1",             |
|                           |   "thread.name": "Thread-5"        |
|                           | }                                  |
------------------------------------------------------------------
```

## Java example

Code in the following example imports references the Snowflake Telemetry library and from it gets a logger. It logs a message at the
`INFO` level. It also logs an error for an exception.

For more information about the methods you can use to log at specific levels, see [SLF4J methods](https://www.slf4j.org/apidocs/org/slf4j/Logger.html).

```sqlexample-java
CREATE OR REPLACE PROCEDURE do_logging()
RETURNS VARCHAR
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES=('com.snowflake:snowpark:latest', 'com.snowflake:telemetry:latest')
HANDLER = 'JavaLoggingHandler.doThings'
AS
$$
  import org.slf4j.Logger;
  import org.slf4j.LoggerFactory;
  import com.snowflake.snowpark_java.Session;

  public class JavaLoggingHandler {
    private static Logger logger = LoggerFactory.getLogger(JavaLoggingHandler.class);

    public JavaLoggingHandler() {
      logger.info("Logging from within the constructor.");
    }

    public String doThings(Session session) {
      logger.info("Logging from method start.");

      try {
        throwException();
      } catch (Exception e) {
        logger.error("Logging an error: " + e.getMessage());
        return "ERROR";
      }
      return "SUCCESS";
    }

    // Simulate a thrown exception to catch.
    private void throwException() throws Exception {
      throw new Exception("Something went wrong.");
    }
  }
$$
;
```

You can access log messages by executing a SELECT command on the event table. For more information, see
[Viewing log messages](logging-accessing-messages.md).

Code in the following example queries the event table where the log messages are stored. The query reports on the severity and message of
each log entry from the handler class.

```sqlexample
SET event_table_name='my_db.public.my_event_table';

SELECT
  RECORD['severity_text'] AS SEVERITY,
  VALUE AS MESSAGE
FROM
  IDENTIFIER($event_table_name)
WHERE
  SCOPE['name'] = 'JavaLoggingHandler'
  AND RECORD_TYPE = 'LOG';
```

The preceding example generates the following output.

```output
--------------------------------------------------------
| SEVERITY | MESSAGE                                   |
--------------------------------------------------------
| "INFO"   | "Logging from within the constructor."    |
--------------------------------------------------------
| "INFO"   | "Logging from method start."              |
--------------------------------------------------------
| "ERROR"  | "Logging an error: Something went wrong." |
--------------------------------------------------------
```

---
title: Logging messages in JavaScript
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-javascript.md
section: Developer Guide
---

# Logging messages in JavaScript

You can log messages from a function or procedure handler written in JavaScript by using the `snowflake` object included with the
Snowflake JavaScript API. When you’ve set up an event table to store log entries, Snowflake stores log entries generated by your handler
code in the table. For reference about the JavaScript API, see [JavaScript stored procedures API](../stored-procedure/stored-procedures-api.md).

Before logging from code, be sure you have the logging level set so that the messages you want are stored in the event table. For more
information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

> **Note:**
>
> Before you can begin logging messages, you must set up an event table. For more information, see
> [Event table overview](event-table-setting-up.md).

You can access log messages by executing a SELECT command on the event table. For more information, see
[Viewing log messages](logging-accessing-messages.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

Code in the following example uses the included `snowflake` object to log messages at each of the supported levels.
Note that a message logged from a method that processes an input row will be logged *for every row* processed by the UDF. If the UDF is
executed in a large table, this can result in a large number of messages in the event table.

```javascript
snowflake.log("info", "Information-level message");
snowflake.log("error", "Error message");
snowflake.log("warn", "Warning message");
snowflake.log("debug", "Debug message");
snowflake.log("trace", "Trace message");
snowflake.log("fatal", "Fatal message");
```

## Adding custom attributes

When you create a log entry, you can add your own attributes in key-value pairs. Snowflake saves these custom attributes to the event
table’s [RECORD_ATTRIBUTES column](event-table-columns.md).

To add custom attributes when calling the `snowflake.log` method, assemble the key-value pairs in JSON that you pass as a third
argument to the `log` function.

Code in the following example logs a message “Logging with attributes” to the event table’s VALUE column. It also adds two custom
attributes to the RECORD_ATTRIBUTES column.

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE do_logging_javascript()
RETURNS VARCHAR
LANGUAGE JAVASCRIPT
AS $$
  let log_attributes = {
    "custom1": "value1",
    "custom2": "value2"
  }
  snowflake.log("info", "Logging with attributes", log_attributes)
  return "success";
$$;
```

Output of this `log` call appears in the event table as follows. Note that the RECORD_ATTRIBUTES column will include
attributes that Snowflake adds automatically.

```output
------------------------------------------------------------------
| VALUE                     | RECORD_ATTRIBUTES                  |
------------------------------------------------------------------
| "Logging with attributes" | {                                  |
|                           |   "custom1": "value1",             |
|                           |   "custom2": "value2"              |
|                           | }                                  |
------------------------------------------------------------------
```

---
title: Logging messages in Python
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-python.md
section: Developer Guide
---

# Logging messages in Python

You can log messages from a function or procedure handler written in Python by using
[logging](https://docs.python.org/library/logging.html), the logging module in Python’s standard library. When you’ve set up an event
table to store log entries, Snowflake stores log entries generated by your handler code in the table.

For more information about logging levels supported by Python, see the
[logging levels documentation](https://docs.python.org/3/library/logging.html#levels). Note that Snowflake treats two of the Python
logging levels in a particular way:

* The Python `CRITICAL` level will be treated as FATAL.
* The Python `NOTSET` level will be treated as TRACE.

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

Before logging from code, you must:

* Set up an event table to collect messages logged from handler code.

  For more information, see [Event table overview](event-table-setting-up.md).
* Be sure you have the logging level set so that the messages you want are stored in the event table.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Overriding log threshold levels with Python

You can use Python handler code to override the log threshold levels that have been
[log level set with SQL](telemetry-levels.md). When you set the log level with Python, log entries will
use the [logging levels defined by Python](https://docs.python.org/3/library/logging.html#levels).

By setting log levels in Python, you can do the following:

* Override the threshold set for the Snowflake session or for objects such as the procedure or UDF.
* Set thresholds scoped to specified Python packages.

  For example, you can use the logger name you set (and which is stored in the
  [event table](event-table-columns.md)) to set a threshold for that logger with Python.

Python code in the following example sets the log level for the Snowpark `session` package to DEBUG.

```python
session_logger = logging.getLogger('snowflake.snowpark.session')
session_logger.setLevel(logging.DEBUG)
```

### Using the logger name to set logging level

You can use the logger name recorded in the event table to set a threshold for log entries from that logger. This can be useful when you want
to set a logger’s threshold so that it filters out unwanted log entries above a particular level.

To do that, you’d first query the event table to discover the logger name associated with the entries for which you want to capture a different
logging level. Then, using that logger name, you’d set the log level to the threshold you want.

Code in the following example queries for log entries, including the logger name in the returned data. You can get the name as the value
of the [Scope column](event-table-columns.md).

```sqlexample
SET event_table_name='my_db.public.my_event_table';

SELECT
  TIMESTAMP as time,
  RECORD['severity_text'] as log_level,
  SCOPE['name'] as logger_name,
  VALUE as message
FROM
  IDENTIFIER($event_table_name)
WHERE
  RECORD_TYPE = 'LOG';
```

This query might return many entries from several loggers. If, after looking through the results, you decide that you’re getting many
`INFO` messages that you don’t want from the numpy logger, you can use Python to set that logger’s threshold to capture log entries
at the `ERROR` level and above.

```python
numpy_logger = logging.getLogger('numpy_logs')
numpy_logger.setLevel(logging.ERROR)
```

For more about querying the event table, see [Viewing log messages](logging-accessing-messages.md).

## Adding custom attributes

When you create a log entry, you can add your own attributes in key-value pairs. Snowflake saves these custom attributes to the event
table’s [RECORD_ATTRIBUTES column](event-table-columns.md).

To add custom attributes when calling one of the logging level functions — including `logger.info`, `logger.error`, and so
on — add an `extra` keyword argument, setting the argument’s value to the key-value pairs to record as custom attributes.

Code in the following example logs a message “Logging with attributes” to the event table’s VALUE column. It also adds two custom
attributes to the RECORD_ATTRIBUTES column.

```sqlexample-python
CREATE OR REPLACE PROCEDURE do_logging_python()
RETURNS VARCHAR
LANGUAGE PYTHON
PACKAGES = ('snowflake-snowpark-python')
RUNTIME_VERSION = 3.12
HANDLER = 'do_things'
AS $$
import logging

logger = logging.getLogger("python_logger")

def do_things(session):

  logger.info("Logging with attributes in SP", extra = {'custom1': 'value1', 'custom2': 'value2'})

  return "SUCCESS"
$$;
```

Output of the `logger.info` call appears in the event table as follows. Note that the RECORD_ATTRIBUTES column will include
attributes that Snowflake adds automatically.

```output
---------------------------------------------------------------------
| VALUE                        | RECORD_ATTRIBUTES                  |
---------------------------------------------------------------------
| "Logging with attributes in" | {                                  |
|                              |   "code.filepath": "_udf_code.py", |
|                              |   "code.function": "do_things",    |
|                              |   "code.lineno": 10,               |
|                              |   "custom1": "value1",             |
|                              |   "custom2": "value2"              |
|                              | }                                  |
---------------------------------------------------------------------
```

## Python examples

The following sections provide examples of adding support for logging from Python code.

### Stored procedure example

Code in the following example imports the `logging` module, gets a logger, and logs a message at the `INFO` level.

For more information about logging levels supported by Python, see the
[logging levels documentation](https://docs.python.org/3/library/logging.html#levels).

```sqlexample-python
CREATE OR REPLACE PROCEDURE do_logging()
RETURNS VARCHAR
LANGUAGE PYTHON
PACKAGES=('snowflake-snowpark-python')
RUNTIME_VERSION = 3.12
HANDLER='do_things'
AS $$
import logging

logger = logging.getLogger("python_logger")
logger.info("Logging from Python module.")

def do_things(session):
  logger.info("Logging from Python function start.")

  try:
    throw_exception()
  except Exception:
    logger.error("Logging an error from Python handler: ")
    return "ERROR"

  return "SUCCESS"

def throw_exception():
  raise Exception("Something went wrong.")

$$;
```

You can access log messages by executing a SELECT command on the event table. For more information, see
[Viewing log messages](logging-accessing-messages.md).

Code in the following example queries the event table where the log messages are stored. The query reports on the severity and message of
each log entry from the handler class.

```sqlexample
SET event_table_name='my_db.public.my_event_table';

SELECT
  RECORD['severity_text'] AS SEVERITY,
  VALUE AS MESSAGE
FROM
  IDENTIFIER($event_table_name)
WHERE
  SCOPE['name'] = 'python_logger'
  AND RECORD_TYPE = 'LOG';
```

The preceding example generates the following output.

```output
---------------------------------------------------------------------------
| SEVERITY | MESSAGE                                                      |
---------------------------------------------------------------------------
| "INFO"   | "Logging from Python module."                                |
---------------------------------------------------------------------------
| "INFO"   | "Logging from Python function start."                        |
---------------------------------------------------------------------------
| "ERROR"  | "Logging an error from Python handler."                      |
---------------------------------------------------------------------------
```

### Streamlit example

Code in the following example imports the `logging` module, gets a logger, and logs a message at the `INFO` level.

For more information about logging levels supported by Python, see the
[logging levels documentation](https://docs.python.org/3/library/logging.html#levels).

```python
import streamlit as st
import logging

logger = logging.getLogger('app_logger')

st.title("Streamlit logging example")

hifives_val = st.slider("Number of high-fives", min_value=0, max_value=90, value=60)

if st.button("Submit"):
    logger.info(f"Submitted with high-fives: {hifives_val}")
```

---
title: Logging messages in Scala
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-scala.md
section: Developer Guide
---

# Logging messages in Scala

You can log messages from a function or procedure handler written in Scala by using the [SLF4J API](http://www.slf4j.org/). When you’ve
[set up an event table](event-table-setting-up.md) to store log entries, Snowflake stores log
entries generated by your handler code in the table.

You can use the [SLF4J API](http://www.slf4j.org/) included with the Snowflake Telemetry library included on Snowflake. To do so, include the following
value in the PACKAGES clause when you create the function or procedure: `com.snowflake:telemetry:latest`.

For information on including the Telemetry library when packaging your code with Maven, see
[Setting up your Java and Scala environment to use the Telemetry class](telemetry-build-maven.md).

Snowflake supports the following versions of Scala:

[Preview Feature](../../release-notes/preview-features.md) — Open

Support for version 2.13 is in preview. Available to all accounts.

* 2.13
* 2.12

For more information, see [Writing code to support different Scala versions](../scala-version-differences.md).

> **Note:**
>
> Using the Snowflake Telemetry Library adds other libraries to your function or procedure’s execution environment. For more information,
> see [Snowflake telemetry package dependencies](telemetry-package-dependencies.md).

> **Note:**
>
> SLF4J does not support logging messages at the `FATAL` level. For handlers written in Java or Scala, the `FATAL` level is
> treated as the `ERROR` level.
>
> For example, if you set the `LOG_LEVEL` parameter to `FATAL`, `ERROR`-level messages from a Java or Scala
> handler are ingested.

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

Before logging from code, you must:

* Set up an event table to collect messages logged from handler code.

  For more information, see [Event table overview](event-table-setting-up.md).
* Be sure you have the logging level set so that the messages you want are stored in the event table.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Adding custom attributes

When you create a log entry, you can add your own attributes in key-value pairs. Snowflake saves these custom attributes to the event
table’s [RECORD_ATTRIBUTES column](event-table-columns.md).

To add custom attributes, call methods of the slf4j fluent API, such as `Logger.atInfo` and `Logger.atError`. Use these
methods to set key-value pairs in the log entry. Each returns an [org.slf4j.spi.LoggingEventBuilder](https://www.slf4j.org/apidocs/org/slf4j/spi/LoggingEventBuilder.html), which you can use to set the
log message.

Code in the following example logs a message “Logging with attributes” to the event table’s VALUE column. It also adds a custom
attribute to the RECORD_ATTRIBUTES column.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE do_logging_scala()
RETURNS VARCHAR
LANGUAGE SCALA
RUNTIME_VERSION = '2.12'
PACKAGES=('com.snowflake:telemetry:latest', 'com.snowflake:snowpark_2.12:latest')
HANDLER = 'ScalaLoggingHandler.doThings'
AS
$$
  import org.slf4j.Logger
  import org.slf4j.LoggerFactory
  import com.snowflake.snowpark.Session

  class ScalaLoggingHandler {
    private val logger: Logger = LoggerFactory.getLogger(getClass)

    def doThings(session: Session): String = {
      logger.atInfo().addKeyValue("custom1", "value1").setMessage("Logging with attributes").log();
      return "SUCCESS"
    }
  }
$$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE do_logging_scala()
RETURNS VARCHAR
LANGUAGE SCALA
RUNTIME_VERSION = '2.13'
PACKAGES=('com.snowflake:telemetry:latest', 'com.snowflake:snowpark_2.13:latest')
HANDLER = 'ScalaLoggingHandler.doThings'
AS
$$
  import org.slf4j.Logger
  import org.slf4j.LoggerFactory
  import com.snowflake.snowpark.Session

  class ScalaLoggingHandler {
    private val logger: Logger = LoggerFactory.getLogger(getClass)

    def doThings(session: Session): String = {
      logger.atInfo().addKeyValue("custom1", "value1").setMessage("Logging with attributes").log();
      return "SUCCESS"
    }
  }
$$;
```

Output of this `Logger.atInfo` call appears in the event table as follows. Note that the RECORD_ATTRIBUTES column will include
attributes that Snowflake adds automatically.

```output
------------------------------------------------------------------
| VALUE                     | RECORD_ATTRIBUTES                  |
------------------------------------------------------------------
| "Logging with attributes" | {                                  |
|                           |   "custom1": "value1",             |
|                           |   "thread.name": "Thread-5"        |
|                           | }                                  |
------------------------------------------------------------------
```

## Scala example

Code in the following example imports references the Snowflake Telemetry library and from it gets a logger. It logs a message at the
`INFO` level. It also logs an error for an exception.

For more information about the methods you can use to log at specific levels, see [SLF4J methods](https://www.slf4j.org/apidocs/org/slf4j/Logger.html).

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE do_logging()
RETURNS VARCHAR
LANGUAGE SCALA
RUNTIME_VERSION = '2.12'
PACKAGES=('com.snowflake:snowpark_2.12:latest', 'com.snowflake:telemetry:latest')
HANDLER = 'ScalaLoggingHandler.doThings'
AS
$$
  import org.slf4j.Logger
  import org.slf4j.LoggerFactory
  import com.snowflake.snowpark.Session

  class ScalaLoggingHandler {
    private val logger: Logger = LoggerFactory.getLogger(getClass)

    logger.info("Logging from within the Scala constructor.")

    def doThings(session: Session): String = {
      logger.info("Logging from Scala method start.")

      try {
        throwException
      } catch {
        case e: Exception => logger.error("Logging an error from Scala handler: " + e.getMessage())
        return "ERROR"
      }
      return "SUCCESS"
    }

    // Simulate a thrown exception to catch.
    @throws(classOf[Exception])
    private def throwException = {
      throw new Exception("Something went wrong.")
    }
  }
$$
;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE do_logging()
RETURNS VARCHAR
LANGUAGE SCALA
RUNTIME_VERSION = '2.13'
PACKAGES=('com.snowflake:snowpark_2.13:latest', 'com.snowflake:telemetry:latest')
HANDLER = 'ScalaLoggingHandler.doThings'
AS
$$
  import org.slf4j.Logger
  import org.slf4j.LoggerFactory
  import com.snowflake.snowpark.Session

  class ScalaLoggingHandler {
    private val logger: Logger = LoggerFactory.getLogger(getClass)

    logger.info("Logging from within the Scala constructor.")

    def doThings(session: Session): String = {
      logger.info("Logging from Scala method start.")

      try {
        throwException
      } catch {
        case e: Exception => logger.error("Logging an error from Scala handler: " + e.getMessage())
        return "ERROR"
      }
      return "SUCCESS"
    }

    // Simulate a thrown exception to catch.
    @throws(classOf[Exception])
    private def throwException = {
      throw new Exception("Something went wrong.")
    }
  }
$$
;
```

You can access log messages by executing a SELECT command on the event table. For more information, see
[Viewing log messages](logging-accessing-messages.md).

Code in the following example queries the event table where the log messages are stored. The query reports on the severity and message of
each log entry from the handler class.

```sqlexample
SET event_table_name='my_db.public.my_event_table';

SELECT
  RECORD['severity_text'] AS SEVERITY,
  VALUE AS MESSAGE
FROM
  IDENTIFIER($event_table_name)
WHERE
  SCOPE['name'] = 'ScalaLoggingHandler'
  AND RECORD_TYPE = 'LOG';
```

The preceding example generates the following output.

```output
---------------------------------------------------------------------------
| SEVERITY | MESSAGE                                                      |
---------------------------------------------------------------------------
| "INFO"   | "Logging from within the Scala constructor."                 |
---------------------------------------------------------------------------
| "INFO"   | "Logging from Scala method start."                           |
---------------------------------------------------------------------------
| "ERROR"  | "Logging an error from Scala handler: Something went wrong." |
---------------------------------------------------------------------------
```

---
title: Logging messages in Snowflake Scripting
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-snowflake-scripting.md
section: Developer Guide
---

# Logging messages in Snowflake Scripting

You can log messages from a stored procedure handler written in Snowflake Scripting by using the Snowflake
[SYSTEM$LOG, SYSTEM$LOG_<level> (for Snowflake Scripting)](../../sql-reference/functions/system_log.md) function. When you’ve set up an event table to store log entries, Snowflake stores log entries
generated by your handler code in the table.

Before logging from code, be sure you have the logging level set so that the messages you want are stored in the event table. For more
information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

> **Note:**
>
> Before you can begin logging messages, you must set up an event table. For more information, see
> [Event table overview](event-table-setting-up.md).

You can access log messages by executing a SELECT command on the event table. For more information, see
[Viewing log messages](logging-accessing-messages.md).

For general information about setting up logging and retrieving messages in Snowflake, see
[Logging messages from functions and procedures](logging.md).

## Snowflake Scripting example

Code in the following example uses the SYSTEM$LOG function to log messages at each of the supported levels. Note that a message logged
from code that processes an input row will be logged *for every row* processed by the handler. If the handler is executed in a large table,
this can result in a large number of messages in the event table.

```sqlexample
-- The following calls are equivalent.
-- Both log information-level messages.
SYSTEM$LOG('info', 'Information-level message');
SYSTEM$LOG_INFO('Information-level message');

-- The following calls are equivalent.
-- Both log error messages.
SYSTEM$LOG('error', 'Error message');
SYSTEM$LOG_ERROR('Error message');

-- The following calls are equivalent.
-- Both log warning messages.
SYSTEM$LOG('warning', 'Warning message');
SYSTEM$LOG_WARN('Warning message');

-- The following calls are equivalent.
-- Both log debug messages.
SYSTEM$LOG('debug', 'Debug message');
SYSTEM$LOG_DEBUG('Debug message');

-- The following calls are equivalent.
-- Both log trace messages.
SYSTEM$LOG('trace', 'Trace message');
SYSTEM$LOG_TRACE('Trace message');

-- The following calls are equivalent.
-- Both log fatal messages.
SYSTEM$LOG('fatal', 'Fatal message');
SYSTEM$LOG_FATAL('Fatal message');
```

## Automatically add log messages about blocks and child jobs

You can automatically log the following additional information about the execution of a Snowflake Scripting
stored procedure:

* BEGIN/END of a Snowflake Scripting block.
* BEGIN/END of a child job request.

Automatic logging is intended for the following use cases:

* You want to generate the additional log messages without modifying the body of the stored procedure.
* You want comprehensive information about the execution of the stored procedure.
* You want more visibility into stored procedure execution to make it easier to develop and debug it without
  manually adding logging code in the procedure.

To automatically log these Snowflake Scripting messages for a stored procedure, set the [AUTO_EVENT_LOGGING](../../sql-reference/parameters.md) parameter
for the stored procedure to `LOGGING` or `ALL` using the [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md) command. When
you set this parameter to `ALL`, additional [trace events](tracing-snowflake-scripting.md) are also emitted automatically
for the stored procedure.

> **Important:**
>
> The additional information is added to the event table only if the effective [LOG_LEVEL](../../sql-reference/parameters.md) is set
> to `TRACE`. For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

For example, create a simple table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_auto_event_logging (id INTEGER, num NUMBER(12, 2));

INSERT INTO test_auto_event_logging (id, num) VALUES
  (1, 11.11),
  (2, 22.22);
```

Next, create a stored procedure named `auto_event_logging_sp`. This sample stored procedure updates a table row and
then queries the table:

```sqlexample
CREATE OR REPLACE PROCEDURE auto_event_logging_sp(
  table_name VARCHAR,
  id_val INTEGER,
  num_val NUMBER(12, 2))
RETURNS TABLE()
LANGUAGE SQL
AS
$$
BEGIN
  UPDATE IDENTIFIER(:table_name)
    SET num = :num_val
    WHERE id = :id_val;
  LET res RESULTSET := (SELECT * FROM IDENTIFIER(:table_name) ORDER BY id);
  RETURN TABLE(res);
EXCEPTION
  WHEN statement_error THEN
    res := (SELECT :sqlcode sql_code, :sqlerrm error_message, :sqlstate sql_state);
    RETURN TABLE(res);
END;
$$
;
```

The following examples set the AUTO_EVENT_LOGGING parameter for the stored procedure:

```sqlexample
ALTER PROCEDURE auto_event_logging_sp(VARCHAR, INTEGER, NUMBER)
  SET AUTO_EVENT_LOGGING = 'LOGGING';
```

```sqlexample
ALTER PROCEDURE auto_event_logging_sp(VARCHAR, INTEGER, NUMBER)
  SET AUTO_EVENT_LOGGING = 'ALL';
```

Call the stored procedure:

```sqlexample
CALL auto_event_logging_sp('test_auto_event_logging', 2, 33.33);
```

```output
+----+-------+
| ID |   NUM |
|----+-------|
|  1 | 11.11 |
|  2 | 33.33 |
+----+-------+
```

Query the event table for messages logged by the stored procedure named `auto_event_logging_sp`. For each message,
print out the timestamp, log level, and text of the message.

```sqlexample
SELECT
    TIMESTAMP as time,
    RECORD['severity_text'] as severity,
    VALUE as message
  FROM
    my_db.public.my_events
  WHERE
    RESOURCE_ATTRIBUTES['snow.executable.name'] LIKE '%AUTO_EVENT_LOGGING_SP%'
    AND RECORD_TYPE = 'LOG';
```

```output
+-------------------------+----------+----------------------------------+
| TIME                    | SEVERITY | MESSAGE                          |
|-------------------------+----------+----------------------------------|
| 2024-10-25 20:42:24.134 | "TRACE"  | "Entering outer block at line 2" |
| 2024-10-25 20:42:24.135 | "TRACE"  | "Entering block at line 2"       |
| 2024-10-25 20:42:24.135 | "TRACE"  | "Starting child job"             |
| 2024-10-25 20:42:24.633 | "TRACE"  | "Ending child job"               |
| 2024-10-25 20:42:24.633 | "TRACE"  | "Starting child job"             |
| 2024-10-25 20:42:24.721 | "TRACE"  | "Ending child job"               |
| 2024-10-25 20:42:24.721 | "TRACE"  | "Exiting with return at line 7"  |
+-------------------------+----------+----------------------------------+
```

---
title: Logging, tracing, and metrics
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-tracing-overview.md
section: Developer Guide
---

# Logging, tracing, and metrics

You can record the activity of your Snowflake function and procedure handler code (including code you write
[using Snowpark APIs](../snowpark/index.md)) by capturing log messages and trace events from the code as it executes.
Once you’ve collected the data, you can query it with SQL to analyze the results.

Logging, tracing, and metrics are among the observability features Snowflake provides to make it easier for you to debug and optimize
applications. Snowflake captures observability data in a structure based on the [OpenTelemetry](https://opentelemetry.io/) standard.

In particular, you can record and analyze the following:

* [Log messages](logging.md) — Independent, detailed messages with information about the state of a
  specific piece of your code.
* [Metrics data](metrics.md) — CPU and memory metrics that Snowflake generates.
* [Trace events](tracing.md) — Structured data you can use to get information spanning and grouping
  multiple parts of your code.

## Get started

Use the following high-level steps to begin capturing and using log and trace data.

1. Ensure that you have an active event table. You can do one of the following:

   * [Use the default event table](event-table-setting-up.md) that is active by default.
   * [Create and set as active an event table](event-table-setting-up.md).

   Snowflake collects telemetry data from your code in the event table.
2. Set telemetry levels so that data is collected.

   With levels, you can specify which data – and how much data – is collected. Make sure the levels are set correctly.
3. Begin emitting log or trace data from handler code.

   Once you’ve created an event table and associated it with your account, you can use an API in your handler’s language to emit log
   messages. After you’ve captured log and trace data, you can query the data to analyze the results.

   For more information on instrumenting your code, see the following:

   * [Logging messages from functions and procedures](logging.md)
   * [Trace events for functions and procedures](tracing.md)
4. Query the event table to analyze collected log and trace data.

   For more information, see the following:

   * [Viewing log messages](logging-accessing-messages.md)
   * [Viewing metrics data](metrics-viewing-data.md)
   * [Viewing trace data](tracing-accessing-events.md)

## Set telemetry levels

You can manage the level of telemetry data stored in the event table — such as log, trace, and metrics data — by setting the level
for each type of data. Use level settings to ensure that you’re capturing the amount and kind of data you want.

For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Compare log messages and trace events

The following table compares the characteristics and benefits of log messages and trace events.

| Characteristic | Log entries | Trace events |
| --- | --- | --- |
| Intended use | Record detailed but unstructured information about the state of your code. Use this information to understand what happened during a particular invocation of your function or procedure. | Record a brief but structured summary of each invocation of your code. Aggregate this information to understand behavior of your code at a high level. |
| Structure as a payload | None. A log entry is just a string. | Structured with attributes you can attach to trace events. Attributes are key-value pairs that can be easily queried with a SQL query. |
| Supports grouping | No. Each log entry is an independent event. | Yes. Trace events are organized into spans. A span can have its own attributes. |
| Quantity limits | Unlimited. All log entries emitted by your code are ingested into the event table. | The number of trace events per span is capped at 128. There is also a limit on the number of span attributes. |
| Complexity of queries against recorded data | Relatively high. Your queries must parse each log entry to extract meaningful information from it. | Relatively low. Your queries can take advantage of the structured nature of trace events. |

---
title: Making dependencies available to your code
source: https://docs.snowflake.com/en/developer-guide/upload-dependencies.md
section: Developer Guide
---

# Making dependencies available to your code

When your user-defined function (UDF) or stored procedure depends on code or files that are external to the UDF or procedure, you can make
the dependency available to the UDF or procedure from a stage or from a Git repository clone in Snowflake from a
[remote Git repository that Snowflake is using](git/git-overview.md).

For example, you might want your UDF or procedure to have access to the following:

* Python handler code in a module.
* Java or Scala handler code compiled and packaged in a JAR.
* Dependency code written in Java, Python, or Scala.
* Files to be read by your handler code and whose name and location is known when you create the UDF. This can be useful with configuration
  files, for example.

> **Note:**
>
> You can also use the PACKAGES clause of [CREATE FUNCTION](../sql-reference/sql/create-function.md) or [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md) to
> import libraries that are included in Snowflake.

## High-level steps

Follow these steps to make dependencies available to your function or procedure.

1. Choose or create a stage that’s available to your handler.
2. Upload the dependency to the stage.
3. Reference the dependency with IMPORTS when you create the function or procedure.

## Choosing or creating a stage for dependency files

To make your dependency file available to a function or procedure, the file will need to be on a stage where it can be
reached at runtime. The owner of the function or procedure must have the [READ privilege](../user-guide/security-access-control-privileges.md)
to the stage.

For more about creating stages, see [CREATE STAGE](../sql-reference/sql/create-stage.md).

You can also set up Snowflake to
[use a remote Git repository](git/git-overview.md), creating a Git repository clone with a full clone
of the remote repository’s files.

> **Note:**
>
> You can’t execute the `PUT` command through the Snowflake GUI; you can use SnowSQL to execute `PUT`. For an example `PUT` command
> to copy a .jar file to a stage, see Uploading the dependency to the stage in this topic.

Choose or create one of the following for your dependency:

* A Git repository clone in Snowflake with files from the remote repository.

  For more information, see [Using a Git repository in Snowflake](git/git-overview.md).
* A user or named internal stage.

  If you plan to use the `PUT` command to upload the files, use a named internal stage. For more on choosing an internal stage type,
  see [Choosing an internal stage for local files](../user-guide/data-load-local-file-system-create-stage.md).
* An external stage.

  External stages are locations associated with external storage services, as described in [CREATE STAGE](../sql-reference/sql/create-stage.md). The
  `PUT` command does not support uploading files to external stages.

If you don’t already have a user stage, named internal stage, or external stage, you can create one by executing
[CREATE STAGE](../sql-reference/sql/create-stage.md). For example, the following command creates a new internal stage named `mystage`:

```sqlexample
CREATE STAGE mystage;
```

> **Note:**
>
> Snowflake does not currently support using a table stage to store handler code.

## Uploading the dependency to the stage

Upload the files required for your stored procedure to a stage.

If your handler is from [a Git repository you’re using with Snowflake](git/git-overview.md), you might
instead need to [fetch the latest](git/git-operations.md) from your remote repository to the Snowflake
Git repository clone.

If you’re using an external stage, use that storage service’s means for uploading files. If you’re using an internal stage, you can copy
the file from a local drive to the stage by using the `PUT` command. For command reference, see [PUT](../sql-reference/sql/put.md). For
information on staging files with PUT, see [Staging data files from a local file system](../user-guide/data-load-local-file-system-stage.md).

Use the `PUT` command to upload files to the stage.

Code in the following example uploads `myjar.jar` to a stage called `mystage`, overwriting an existing file of the same
name if it exists.

```sqlexample
PUT file:///Users/MyUserName/MyCompiledJavaCode.jar
  @mystage
  AUTO_COMPRESS = FALSE
  OVERWRITE = TRUE
  ;
```

> **Note:**
>
> If you omit `AUTO_COMPRESS = FALSE`, the PUT command automatically compresses the file. The name of the compressed
> file on the stage will be `myjar.jar.gz`. Later, when you execute a command such as
> [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md), you will need to specify the filename
> with this `.gz` extension in the command’s IMPORTS clause.

> **Note:**
>
> The PUT command does not support uploading files to external stages.
> To upload files to external stages, use the utilities provided by the cloud service.

## Referencing the dependency

To make a function or procedure you’re creating aware of the dependency’s location, specify the dependency’s location in the IMPORTS
clause of the SQL you use to create the function or procedure.

If you have multiple dependency files, such as when you have third-party libraries on which a handler depends, you can specify the stage
location and file path-and-name of all dependency files as values of the IMPORTS clause.

Code in the following example creates a procedure called `MYPROC`, specifying that the file `MyCompiledJavaCode.jar` (on
the `mystage` stage) should be included in the procedure’s execution environment. In this case, `MyCompiledJavaCode.jar`
contains the procedure’s handler – the compiled code for `MyJavaClass.run`.

```sqlexample
CREATE OR REPLACE PROCEDURE MYPROC(value INT, fromTable STRING, toTable STRING, count INT)
  RETURNS INT
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:latest')
  IMPORTS = ('@mystage/MyCompiledJavaCode.jar')
  HANDLER = 'MyJavaClass.run';
```

---
title: Managing connections
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-connect.md
section: Developer Guide
---

# Managing connections

To execute statements against Snowflake, you first need to establish a connection. The Snowflake Node.js Driver lets you
establish connections as follows:

* Create a single connection
* Create a pool of connections
* Connect through a proxy
* Connect through an authenticated proxy

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

## Creating a single connection

To create a single connection to Snowflake:

1. Call `snowflake.createConnection` to create a new `Connection` object, and pass in a JavaScript object that
   specifies the [connection options](nodejs-driver-options.md).
2. Using the `Connection` object, call the `connect` method to establish a connection.

   To handle connection errors, pass in a callback function that has the following signature:

   ```javascript
   function(err, conn)
   ```

   where:

   * `err` is a JavaScript `Error` object.
   * `conn` is the current `Connection` object.

   If an error occurs during connection, the `connect` method passes an `Error` object to your callback function.
   You can use this object in your callback function to get details about the error. If you need information about the current
   `Connection` object, you can use the `conn` argument passed to your callback function.

The following example establishes a connection and uses a password for authentication. To use other authentication methods, see
[Authentication options](nodejs-driver-options.md).

> ```javascript
> // Load the Snowflake Node.js driver.
> import snowflake from 'snowflake-sdk';
> ```
>
> ```javascript
> // Create a Connection object that we can use later to connect.
> const connection = snowflake.createConnection({
>   account: account,
>   username: user,
>   password: password,
>   application: application,
> });
> ```
>
> ```javascript
> // Try to connect to Snowflake, and check whether the connection was successful.
> connection.connect((err, conn) => {
>   if (err) {
>     console.error(`Unable to connect: ${err.message}`);
>   } else {
>     console.log('Successfully connected to Snowflake.');
>     // Optional: store the connection ID.
>     connectionId = conn.getId();
>   }
> });
> ```

When creating a connection, you can set the connection options as described
in [Options Reference](nodejs-driver-options.md).

## Verifying that a connection is ready to receive queries

Before submitting Snowflake queries, you can use the `connection.isValidAsync()` method (in version 1.6.23 and later)
to ensure the connection is up
and ready to execute requests on Snowflake. The method returns `true` if the connection is ready or `false` otherwise.

```javascript
// Create a Connection object and connect to Snowflake
// ..

// Verify if connection is still valid for sending queries over it
const isConnectionValid = await connection.isValidAsync();

// Do further actions based on the value (true or false) of isConnectionValid
```

## Creating a connection pool

Instead of creating a connection each time your client application needs to access Snowflake, you can define a
cache of Snowflake connections to reuse as needed. Connection pooling usually reduces the lag time to
make a connection. However, it can slow down client failover to an alternative DNS when a DNS problem occurs.

To create a connection pool:

1. Call `snowflake.createPool(connectionOptions, poolOptions)` to create a new `ConnectionPool` object, and
   pass in two JavaScript objects that specify the [connection options](nodejs-driver-options.md)
   and pool options.

   > **Note:**
   >
   > The Snowflake Node.js Driver uses the open-source [node-pool](https://github.com/coopernurse/node-pool) library for implementing connection pools. For information about
   > the supported `poolOptions`, see the description of the `opts` argument in the
   > [node-pool library documentation](https://github.com/coopernurse/node-pool/blob/master/README.md).
2. With the `ConnectionPool` object, call the `use` function to execute statements for a single connection
   in the connection pool.

   To handle connection errors, pass in a callback function that has the following signature:

   ```javascript
   function(err, stmt, rows)
   ```

   where:

   * `err` is a JavaScript `Error` object.
   * `stmt` is an object with information about the SQL statement that was executed, including the literal
     text of the statement.
   * `rows` is an array containing the “result set” of the statement.

   If an error occurs while executing the statement, the `connect` method passes an `Error` object to your callback function.
   You can use this object in your callback function to get details about the error.

The following example creates a connection pool that supports a maximum of ten active connections. It uses a password
for authentication. To use other authentication methods, see [Authentication options](nodejs-driver-options.md).

> ```javascript
> // Create the connection pool instance
> const connectionPool = snowflake.createPool(
>   // connection options
>   {
>     account: account,
>     username: user,
>     password: password,
>   },
>   // pool options
>   {
>     max: 10, // specifies the maximum number of connections in the pool
>     min: 0, // specifies the minimum number of connections in the pool
>   },
> );
> ```

The following example uses the `connectionPool.use` method to execute a SQL statement using the connections in the pool.
The `clientConnection.execute` method specifies the SQL statement to execute and defines a callback function.

> ```javascript
> // Use the connection pool and execute a statement
> connectionPool.use(async (clientConnection) => {
>   const statement = await clientConnection.execute({
>     sqlText: 'select 1;',
>     complete: function (err, stmt, rows) {
>       const stream = stmt.streamRows();
>       stream.on('data', (row) => {
>         console.log(row);
>       });
>       stream.on('end', () => {
>         console.log('All rows consumed');
>       });
>     },
>   });
> });
> ```

When creating a connection pool, you can set the connection options as described
in [Options Reference](nodejs-driver-options.md).

### Handling idle connections

With the default setting of the node-pool’s `evictionRunIntervalMillis` option set to 0, idle connection eviction checks are not run. If you have a longer running application, this behavior can lead to terminated connections lingering around in the connection pool, which when the driver acquires them and tries to send a query over them to Snowflake, causes an error.

To address this behavior in a long-running application, you could consider the following ways to handle it:

* Create the Snowflake `ConnectionPool` with an enabled evictor.

  You can add the `evictionRunIntervalMillis` option to the pool options, as shown in the following example:

  ```javascript
  const pool = snowflake.createPool(
    {
      account: account,
      username: username,

      // rest of the connection options

    },
    {
      evictionRunIntervalMillis: 60000, // default = 0, off
      idleTimeoutMillis: 60000, // default = 30000
      max: 2,
      min: 0
    }
  );
  ```

  This example runs the evictor every minute and evicts any connections that are idle for more than one minute. You can also tweak `numTestsPerEvictionRun` (default: 3) to change the number of resources checked during each eviction run.

  See the node-pool library [documentation](https://github.com/coopernurse/node-pool/blob/master/README.md) for details and more options.
* Keep existing connections alive in the pool

  If you need to keep a connection alive more frequently than every hour, you can add the following to the pool options:

  + `clientSessionKeepAlive: true`
  + `clientSessionKeepAliveHeartbeatFrequency: n`, where `n` is between 900 (15m) and 3600 (1h) seconds (default: 3600).

  The following example sends a keep-alive heartbeat every 15 minutes to keep the connection alive even if no other activity, such as a query from a client, occurs.

  ```javascript
  const pool = snowflake.createPool(
    {
      account: account,
      username: username,

      // rest of the connection options

      clientSessionKeepAlive: true, // default = false
      clientSessionKeepAliveHeartbeatFrequency: 900 // default = 3600
    },
    {
      max: 2,
      min: 0
    }
  );
  ```

  You can also use the `clientSessionKeepAlive` option without using pooled connections.

  For more information about the session keep-alive, see [Node.js options reference](nodejs-driver-options.md).

## Connecting through a proxy

You can connect to Snowflake through a proxy, by supplying the details as connection options when creating a `Connection` object.

The following example shows how to connect to a proxy using HTTP:

```javascript
const connection = snowflake.createConnection({
  account: 'account',
  username: 'user',
  password: 'password',
  proxyHost: 'localhost',
  proxyPort: 3128
});
```

Beginning with version 1.15.0, the Snowflake Node.js driver fully supports the `HTTP_PROXY`, `HTTPS_PROXY`, and `NO_PROXY` environment variables in addition to their corresponding connection parameters.

By default, the new `useEnvProxy` global configuration setting is set to `true`, which enables support for the environment variables.

With the ability to set these proxies both in the `Connection` object and in the environment variables, the driver uses the following hierarchy to determine which values to use:

* If a proxy is defined in the `Connection`, it takes precedence. The driver ignores the `HTTP_PROXY` and `HTTPS_PROXY` environment variables.
* If the Connection does not set the proxy values, the driver uses the values in the `HTTP_PROXY` and `HTTPS_PROXY` environment variables if they are defined.
* If the `useEnvProxy` connection setting is set to `false`, the driver ignores `HTTP_PROXY` and `HTTPS_PROXY` environment variables if they are defined.

If you want to disable support for proxy environment variables, you must disable it in the global configuration, as follows:

```javascript
import snowflake from 'snowflake-sdk';

snowflake.configure({ useEnvProxy: false });
```

> **Note:**
>
> The environmental variables are case-sensitive on Linux and MacOS. On Windows, they are not.
>
> * If both the lower-case (`https_proxy`) and upper-case (`HTTPS_PROXY`) variants are defined for the same environment variable, the driver uses the value from the lower-case (`https_proxy`) variable.
> * If only the upper-case (`HTTPS_PROXY`) variant is present, the driver use the upper-case variable’s value.

## Connecting through an authenticated proxy

You can connect to Snowflake through an authenticated proxy by supplying authentication credentials as connection
options when creating a `Connection` object.

> **Note:**
>
> Connecting through an authenticated proxy server is supported starting with version 1.6.4 of the Snowflake Node.js Driver.

The following example shows how to connect to an authenticated proxy using HTTP:

```javascript
const connection = snowflake.createConnection({
  account: 'account',
  username: 'user',
  password: 'password',
  proxyHost: 'localhost',
  proxyPort: 3128,
  proxyUser: 'myname',
  proxyPassword: 'mypass'
});
```

To connect to an authenticated proxy using HTTPS you must also provide the `proxyProtocol` connection property as shown below:

```javascript
const connection = snowflake.createConnection({
  account: 'account',
  username: 'user',
  password: 'password',
  proxyHost: 'localhost',
  proxyPort: 3128,
  proxyUser: 'myname',
  proxyPassword: 'mypass',
  proxyProtocol: 'https'
});
```

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

## OCSP (online certificate status protocol)

When the driver connects, Snowflake sends a certificate to confirm that the connection is to Snowflake rather than to
a host that is impersonating Snowflake. The driver sends that certificate to an OCSP (Online Certificate Status
Protocol) server to verify that the certificate has not been revoked.

If the driver cannot reach the OCSP server to verify the certificate, the driver can
[“fail open” or “fail closed”](../../user-guide/ocsp.md).

## Terminating a connection

A connection can be terminated by calling the `connection.destroy()` method. This immediately ends the session associated with the connection without waiting for running statements to complete:

> ```javascript
> connection.destroy((err, conn) => {
>   if (err) {
>     console.error(`Unable to disconnect: ${err.message}`);
>   } else {
>     console.log(`Disconnected connection with id: ${connection.getId()}`);
>   }
> });
> ```

---
title: Managing data loading and unloading resources with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-data-loading.md
section: Developer Guide
---

# Managing data loading and unloading resources with Python

You can use Python to manage data loading and unloading resources in Snowflake, including external volumes, pipes, and stages.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing stages

You can manage Snowflake stages, which are locations of data files in cloud storage. For an overview of stages, see
[Overview of data loading](../../user-guide/data-load-overview.md).

The Snowflake Python APIs represents stages with two separate types:

* `Stage`: Exposes a stage’s properties such as its name, encryption type, credentials, and directory table settings.
* `StageResource`: Exposes methods you can use to fetch a corresponding `Stage` object, upload and list files on the stage, and
  drop the stage.

### Creating a stage

To create a stage, first create a `Stage` object, and then create a `StageCollection` object from the API `Root`
object. Using `StageCollection.create`, add the new stage to Snowflake.

Code in the following example creates a `Stage` object that represents a stage named `my_stage` with an encryption type of
`SNOWFLAKE_SSE` (server-side encryption only):

```python
from snowflake.core.stage import Stage, StageEncryption

my_stage = Stage(
  name="my_stage",
  encryption=StageEncryption(type="SNOWFLAKE_SSE")
)
stages = root.databases["my_db"].schemas["my_schema"].stages
stages.create(my_stage)
```

The code creates a `StageCollection` variable `stages` and uses `StageCollection.create` to create a new stage in Snowflake.

### Getting stage details

You can get information about a stage by calling the `StageResource.fetch` method, which returns a `Stage` object.

Code in the following example gets information about a stage named `my_stage`:

```python
my_stage = root.databases["my_db"].schemas["my_schema"].stages["my_stage"].fetch()
print(my_stage.to_dict())
```

### Listing stages

You can list stages using the `StageCollection.iter` method, which returns a `PagedIter` iterator of `Stage` objects.

Code in the following example lists stages whose name includes the text `my` and prints the name of each:

```python
from snowflake.core.stage import StageCollection

stages: StageCollection = root.databases["my_db"].schemas["my_schema"].stages
stage_iter = stages.iter(like="my%")  # returns a PagedIter[Stage]
for stage_obj in stage_iter:
  print(stage_obj.name)
```

### Performing stage operations

You can perform common stage operations—such as uploading a file to a stage and listing files on a stage—with a `StageResource`
object.

To demonstrate some operations you can do with a stage resource, code in the following example does the following:

1. Uploads a file named `my-file.yaml` to the `my_stage` stage with the specified auto-compress and overwrite options.
2. Lists all files on the stage to verify that the file was uploaded successfully.
3. Drops the stage.

```python
my_stage_res = root.databases["my_db"].schemas["my_schema"].stages["my_stage"]

my_stage_res.put("./my-file.yaml", "/", auto_compress=False, overwrite=True)

stageFiles = root.databases["my_db"].schemas["my_schema"].stages["my_stage"].list_files()
for stageFile in stageFiles:
  print(stageFile)

my_stage_res.drop()
```

## Managing pipes

You can manage Snowflake pipes, which are named, first-class Snowflake objects that contain a COPY INTO statement used by Snowpipe to load
data from an ingestion queue into tables. For an overview of pipes, see [Snowpipe](../../user-guide/data-load-snowpipe-intro.md).

The Snowflake Python APIs represents pipes with two separate types:

* `Pipe`: Exposes a pipe’s properties such as its name and the COPY INTO statement to be used by Snowpipe.
* `PipeResource`: Exposes methods you can use to fetch a corresponding `Pipe` object, refresh the pipe with staged data files,
  and drop the pipe.

### Creating a pipe

To create a pipe, first create a `Pipe` object, and then create a `PipeCollection` object from the API `Root`
object. Using `PipeCollection.create`, add the new pipe to Snowflake.

Code in the following example creates a `Pipe` object that represents a pipe named `my_pipe` with the specified COPY INTO
statement:

```python
from snowflake.core.pipe import Pipe

my_pipe = Pipe(
  name="my_pipe",
  comment="creating my pipe",
  copy_statement="COPY INTO my_table FROM @mystage FILE_FORMAT = (TYPE = 'JSON')",
)

pipes = root.databases["my_db"].schemas["my_schema"].pipes
pipes.create(my_pipe)
```

The code creates a `PipeCollection` variable `pipes` and uses `PipeCollection.create` to create a new pipe in Snowflake.

### Getting pipe details

You can get information about a pipe by calling the `PipeResource.fetch` method, which returns a `Pipe` object.

Code in the following example gets information about a pipe named `my_pipe`:

```python
my_pipe = root.databases["my_db"].schemas["my_schema"].pipes["my_pipe"].fetch()
print(my_pipe.to_dict())
```

### Listing pipes

You can list pipes using the `PipeCollection.iter` method, which returns a `PagedIter` iterator of `Pipe` objects.

Code in the following example lists pipes whose name starts with `my` and prints the name of each:

```python
from snowflake.core.pipe import PipeCollection

pipes: PipeCollection = root.databases["my_db"].schemas["my_schema"].pipes
pipe_iter = pipes.iter(like="my%")  # returns a PagedIter[Pipe]
for pipe_obj in pipe_iter:
  print(pipe_obj.name)
```

### Performing pipe operations

You can perform common pipe operations—such as refreshing a pipe and dropping a pipe—with a `PipeResource` object.

> **Note:**
>
> Only the REFRESH functionality of [ALTER PIPE](../../sql-reference/sql/alter-pipe.md) is currently supported.

To demonstrate operations you can do with a pipe resource, code in the following example does the following:

1. Gets the `my_pipe` pipe resource object.
2. Refreshes the pipe with staged data files with the specified, optional prefix (or path).
3. Drops the pipe.

```python
my_pipe_res = root.databases["my_db"].schemas["my_schema"].pipes["my_pipe"]

# equivalent to: ALTER PIPE my_pipe REFRESH PREFIX = 'dir3/'
my_pipe_res.refresh(prefix="dir3/")

my_pipe_res.drop()
```

## Managing external volumes

You can manage external volumes, which are named, account-level Snowflake objects that you use to connect Snowflake to your external cloud
storage for Apache Iceberg™ tables. For more information, see the [External volume](../../user-guide/tables-iceberg.md) section of
[Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

The Snowflake Python APIs represents external volumes with two separate types:

* `ExternalVolume`: Exposes an external volume’s properties, such as its name and storage locations.
* `ExternalVolumeResource`: Exposes methods you can use to fetch a corresponding `ExternalVolume` object and drop or restore
  the external volume.

### Creating an external volume

To create an external volume, first create an `ExternalVolume` object, and then create an `ExternalVolumeCollection` object
from the API `Root` object. Using `ExternalVolumeCollection.create`, add the new external volume to Snowflake.

Code in the following example creates an `ExternalVolume` object that represents an external volume named `my_external_volume`
with the specified AWS S3 storage locations:

```python
from snowflake.core.external_volume import (
    ExternalVolume,
    StorageLocationS3,
)

my_external_volume = ExternalVolume(
    name="my_external_volume",
    storage_locations=[
        StorageLocationS3(
            name="my-s3-us-west-1",
            storage_base_url="s3://MY_EXAMPLE_BUCKET/",
            storage_aws_role_arn="arn:aws:iam::123456789012:role/myrole",
            encryption=Encryption(type="AWS_SSE_KMS", kms_key_id="1234abcd-12ab-34cd-56ef-1234567890ab"),
        ),
        StorageLocationS3(
            name="my-s3-us-west-2",
            storage_base_url="s3://MY_EXAMPLE_BUCKET/",
            storage_aws_role_arn="arn:aws:iam::123456789012:role/myrole",
            encryption=Encryption(type="AWS_SSE_KMS", kms_key_id="1234abcd-12ab-34cd-56ef-1234567890ab"),
        ),
    ]
)

root.external_volumes.create(my_external_volume)
```

### Getting external volume details

You can get information about an external volume by calling the `ExternalVolumeResource.fetch` method, which returns an
`ExternalVolume` object.

Code in the following example gets information about an external volume named `my_external_volume`:

```python
my_external_volume = root.external_volumes["my_external_volume"].fetch()
print(my_external_volume.to_dict())
```

### Listing external volumes

You can list external volumes using the `ExternalVolumeCollection.iter` method, which returns a `PagedIter` iterator of
`ExternalVolume` objects.

Code in the following example lists external volumes whose name starts with `my` and prints the name of each:

```python
external_volume_iter = root.external_volumes.iter(like="my%")
for external_volume_obj in external_volume_iter:
  print(external_volume_obj.name)
```

### Performing external volume operations

You can perform common external volume operations—such as dropping and restoring an external volume—with an
`ExternalVolumeResource` object.

To demonstrate operations you can do with an external volume resource, code in the following example does the following:

1. Gets the `my_external_volume` external volume resource object.
2. Drops the external volume.
3. Restores the most recent version of the dropped external volume.

```python
my_external_volume_res = root.external_volumes["my_external_volume"]
my_external_volume_res.drop()
my_external_volume_res.undrop()
```

---
title: Managing Snowflake accounts and managed accounts with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-accounts.md
section: Developer Guide
---

# Managing Snowflake accounts and managed accounts with Python

You can use Python to manage accounts and managed accounts in Snowflake.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing accounts

You can manage accounts in your Snowflake organization. For an overview of accounts in Snowflake, see
[Managing accounts in your organization](../../user-guide/organizations-manage-accounts.md).

The Snowflake Python APIs represents accounts with two separate types:

* `Account`: Exposes an account’s properties such as its name identifier, the login name and password of its initial administrative
  user, and its Snowflake edition.
* `AccountResource`: Exposes methods you can use to drop and restore a corresponding `Account` object.

### Creating an account

To create an account, first create an `Account` object, and then create an `AccountCollection` object from the API `Root`
object. Using `AccountCollection.create`, add the new account to Snowflake.

Code in the following example creates an `Account` object that represents an account named `my_account1` with the specified
account properties:

```python
from snowflake.core.account import Account

my_account = Account(
  name="my_account1",
  admin_name="admin",
  admin_password="TestPassword1",
  first_name="Jane",
  last_name="Smith",
  email="myemail@myorg.org",
  edition="ENTERPRISE",
  region="aws_us_west_2",
  comment="creating my account",
)

root.accounts.create(my_account)
```

### Listing accounts

You can list accounts using the `AccountCollection.iter` method, which returns a `PagedIter` iterator of `Account`
objects.

Code in the following example lists accounts whose name starts with `my` and prints the name of each:

```python
account_iter = root.accounts.iter(like="my%")  # returns a PagedIter[Account]
for account_obj in account_iter:
  print(account_obj.name)
```

Code in the following example sets the optional parameter `history=True` to list a history of accounts including dropped accounts that
have not yet been deleted.

```python
account_iter = root.accounts.iter(history=True)  # returns a PagedIter[Account]
for account_obj in account_iter:
  print(account_obj.name)
```

### Performing account operations

You can perform common account operations—such as dropping and restoring an account—with an `AccountResource` object.

To demonstrate operations you can do with an account resource, code in the following example does the following:

1. Gets the `my_account1` account resource object.
2. Drops the account with the specified grace period, which is the number of days during which the account can be restored (“undropped”).
3. Restores the dropped account within the specified grace period (that is, before it’s permanently deleted).

```python
my_account_res = root.accounts["my_account1"]
my_account_res.drop(grace_period_in_days=4)
my_account_res.undrop()
```

## Managing managed accounts

You can manage Snowflake managed accounts, which are currently used by data providers to create reader accounts for their consumers. For
more information, see [Manage reader accounts](../../user-guide/data-sharing-reader-create.md).

The Snowflake Python APIs represents managed accounts with two separate types:

* `ManagedAccount`: Exposes a managed account’s properties such as its name identifier, the login name and password of its initial
  administrative user, and its account type.
* `ManagedAccountResource`: Exposes methods you can use to drop a corresponding `ManagedAccount` object.

### Creating a managed account

To create a managed account, first create a `ManagedAccount` object, and then create a `ManagedAccountCollection` object from
the API `Root` object. Using `ManagedAccountCollection.create`, add the new managed account to Snowflake.

Code in the following example creates a `ManagedAccount` object that represents a managed account named `reader_acct1` with the
specified account properties:

```python
from snowflake.core.managed_account import ManagedAccount

my_managed_account = ManagedAccount(
  name="reader_acct1",
  admin_name="admin",
  admin_password="TestPassword1",
  type="READER",
  comment="creating my managed account",
)

root.managed_accounts.create(my_managed_account)
```

### Listing managed accounts

You can list managed accounts using the `ManagedAccountCollection.iter` method, which returns a `PagedIter` iterator of
`ManagedAccount` objects.

Code in the following example lists managed accounts whose name starts with `reader` and prints the name of each:

```python
account_iter = root.managed_accounts.iter(like="reader%")  # returns a PagedIter[ManagedAccount]
for account_obj in account_iter:
  print(account_obj.name)
```

### Dropping a managed account

You can drop a managed account with a `ManagedAccountResource` object.

Code in the following example gets the `reader_acct1` managed account resource object and then drops the account.

```python
my_managed_account_res = root.managed_accounts["reader_acct1"]
my_managed_account_res.drop()
```

---
title: Managing Snowflake alerts with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-alerts.md
section: Developer Guide
---

# Managing Snowflake alerts with Python

You can use Python to manage Snowflake alerts, which you can set up to periodically perform an action under specific conditions, based on
data within Snowflake. For more information about alerts, see [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md).

> **Note:**
>
> [ALTER ALERT](../../sql-reference/sql/alter-alert.md) is currently not supported.

The Snowflake Python APIs represents alerts with two separate types:

* `Alert`: Exposes an alert’s properties such as its name, condition, action, and schedule.
* `AlertResource`: Exposes methods you can use to fetch a corresponding `Alert` object, execute the alert, and drop the alert.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating an alert

To create an alert, first create an `Alert` object, and then create an `AlertCollection` object from the API `Root`
object. Using `AlertCollection.create`, add the new alert to Snowflake.

Code in the following example creates an `Alert` object that represents an alert named `my_alert` in your account:

```python
from snowflake.core.alert import Alert, MinutesSchedule

root.alerts.create(Alert(
    name="my_alert",
    condition="SELECT 1",
    action="SELECT 2",
    schedule=MinutesSchedule(minutes=1),
    comment="test comment"
))
```

The code creates an `AlertCollection` variable `alerts` and uses `AlertCollection.create` to create a new alert in
Snowflake.

## Getting alert details

You can get information about an alert by calling the `AlertResource.fetch` method, which returns an `Alert` object.

Code in the following example gets information about an alert named `my_alert`:

```python
my_alert = root.alerts["my_alert"].fetch()
print(my_alert.to_dict())
```

## Listing alerts

You can list alerts using the `AlertCollection.iter` method, which returns a `PagedIter` iterator of `Alert` objects.

Code in the following example lists alerts whose name starts with `my`, and then prints the name of each. This example also sets the
optional parameter `show_limit=5` to limit the number of results to `5`:

```python
alerts_iter = root.alerts.iter(like="my%", show_limit=5)
for alert_obj in alerts_iter:
  print(alert_obj.name)
```

## Performing alert operations

You can perform common alert operations, such as executing and dropping alerts, with an `AlertResource` object.

To demonstrate some operations you can do with an alert resource, code in the following example does the following:

1. Gets the `my_alert` alert resource object.
2. Executes the alert.
3. Drops the alert.

```python
my_alert_res = root.alerts["my_alert"]

my_alert_res.execute()
my_alert_res.drop()
```

---
title: Managing Snowflake databases, schemas, tables, and views with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-databases.md
section: Developer Guide
---

# Managing Snowflake databases, schemas, tables, and views with Python

You can use Python to manage Snowflake databases, schemas, tables, and views. For more information about managing and working with data in
Snowflake, see [Databases, Tables and Views - Overview](../../guides-overview-db.md).

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing databases

You can manage databases in Snowflake. The Snowflake Python APIs represents databases with two separate types:

* `Database`: Exposes a database’s properties, such as its name.
* `DatabaseResource`: Exposes methods you can use to fetch a corresponding `Database` object and to drop the database.

**Topics**

* Creating a database
* Getting database details
* Listing databases
* Dropping or restoring a database

### Creating a database

You can create a database by calling the `DatabaseCollection.create` method and passing a `Database` object that represents the
database you want to create. To create a database, first create a `Database` object that specifies the database name.

Code in the following example creates a `Database` object representing a database named `my_db` and then creates the
database by passing the `Database` object to the `DatabaseCollection.create` method:

```python
from snowflake.core.database import Database

my_db = Database(name="my_db")
root.databases.create(my_db)
```

### Getting database details

You can get information about a database by calling the `DatabaseResource.fetch` method, which returns a `Database`
object.

Code in the following example gets information about a database named `my_db`:

```python
my_db = root.databases["my_db"].fetch()
print(my_db.to_dict())
```

### Listing databases

You can list databases using the `iter` method, which returns a `PagedIter` iterator.

Code in the following example lists databases whose name begins with `my`:

```python
databases = root.databases.iter(like="my%")
for database in databases:
  print(database.name)
```

### Dropping or restoring a database

You can drop a database using the `DatabaseResource.drop` method or restore a database using the `DatabaseResource.undrop`
method.

To demonstrate these operations, code in the following example drops and then restores the most recent version of the `my_db` database:

```python
my_db_res = root.databases["my_db"]
my_db_res.drop()
my_db_res.undrop()
```

## Managing schemas

You can manage schemas in Snowflake. A schema is a database-level object. When you create or reference a schema, you do so in the context
of its database.

The Snowflake Python APIs represents schemas with two separate types:

* `Schema`: Exposes a schema’s properties, such as its name.
* `SchemaResource`: Exposes methods you can use to fetch a corresponding `Schema` object and to drop the schema.

**Topics**

* Creating a schema
* Getting schema details
* Listing schemas
* Dropping or restoring a schema

### Creating a schema

To create a schema, first create a `Schema` object that specifies the schema name.

Code in the following example creates a `Schema` object representing a schema named `my_schema`:

```python
from snowflake.core.schema import Schema

my_schema = Schema(name="my_schema")
root.databases["my_db"].schemas.create(my_schema)
```

The code then creates the schema in the `my_db` database by passing the `Schema` object to the `SchemaCollection.create`
method.

### Getting schema details

You can get information about a schema by calling the `SchemaResource.fetch` method, which returns a `Schema` object.

Code in the following example gets a `Schema` object that represents the `my_schema` schema:

```python
my_schema = root.databases["my_db"].schemas["my_schema"].fetch()
print(my_schema.to_dict())
```

### Listing schemas

You can list the schemas in a specified database using the `iter` method. The method returns a `PagedIter` iterator of
`Schema` objects.

Code in the following example lists schema names in the `my_db` database:

```python
schema_list = root.databases["my_db"].schemas.iter()
for schema_obj in schema_list:
  print(schema_obj.name)
```

### Dropping or restoring a schema

You can drop a schema using the `SchemaResource.drop` method or restore a schema using the `SchemaResource.undrop` method.

To demonstrate these operations, code in the following example drops and then restores the most recent version of the `my_schema` schema:

```python
my_schema_res = root.databases["my_db"].schemas["my_schema"]
my_schema_res.drop()
my_schema_res.undrop()
```

## Managing standard tables

You can manage standard tables in Snowflake. A table is a schema-level object. When you create or reference a table, you do so in the
context of its schema.

The Snowflake Python APIs represents tables with two separate types:

* `Table`: Exposes a table’s properties, such as its name and columns.
* `TableResource`: Exposes methods you can use to fetch a corresponding `Table` object, update the properties of the table, and
  drop the table.

**Topics**

* Creating a table
* Getting table details
* Creating or altering a table
* Listing tables
* Swapping table names
* Performing table operations

### Creating a table

To create a table, first create a `Table` object that specifies the table name, column names, and column data types.

Code in the following example creates a `Table` object representing a table named `my_table` with the specified columns:

```python
from snowflake.core.table import Table, TableColumn

my_table = Table(
  name="my_table",
  columns=[TableColumn(name="c1", datatype="int", nullable=False),
           TableColumn(name="c2", datatype="string")]
)
root.databases["my_db"].schemas["my_schema"].tables.create(my_table)
```

The code then creates the table in the `my_db` database and `my_schema` schema by passing the `Table` object to the
`TableCollection.create` method.

### Getting table details

You can get information about a table by calling the `TableResource.fetch` method, which returns a `Table` object.

Code in the following example gets information about a table named `my_table`:

```python
my_table = root.databases["my_db"].schemas["my_schema"].tables["my_table"].fetch()
print(my_table.to_dict())
```

### Creating or altering a table

You can set properties of a `Table` object and pass it to the `TableResource.create_or_alter` method to create a table if it
doesn’t exist, or alter it according to the table definition if it does exist. The behavior of `create_or_alter` is intended to be
idempotent, which means that the resulting table object will be the same regardless of whether the table exists before you call the method.

> **Note:**
>
> The `create_or_alter` method uses default values for any [Table](https://docs.snowflake.com/en/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.table.Table)
> properties that you don’t explicitly define. For example, if you don’t set `data_retention_time_in_days`, its value defaults to
> `None` even if the table previously existed with a different value.

Code in the following example appends a new column named `c3` of datatype `int` to the `my_table` table, and then alters the
table in Snowflake:

```python
from snowflake.core.table import PrimaryKey, TableColumn

my_table = root.databases["my_db"].schemas["my_schema"].tables["my_table"].fetch()
my_table.columns.append(TableColumn(name="c3", datatype="int", nullable=False, constraints=[PrimaryKey()]))

my_table_res = root.databases["my_db"].schemas["my_schema"].tables["my_table"]
my_table_res.create_or_alter(my_table)
```

### Listing tables

You can list the tables in a specified schema using the `iter` method, which returns a `PagedIter` iterator of
`Table` objects.

Code in the following example lists tables whose name begins with `my`:

```python
tables = root.databases["my_db"].schemas["my_schema"].tables.iter(like="my%")
for table_obj in tables:
  print(table_obj.name)
```

### Swapping table names

You can swap (exchange) the name of a table with another table in a single transaction using the `TableResource.swap_with` method.
For more information, see the SWAP WITH parameter description in [ALTER TABLE](../../sql-reference/sql/alter-table.md).

Code in the following example swaps `my_table` with `other_table` in the same database and schema:

```python
my_table_res = root.databases["my_db"].schemas["my_schema"].tables["my_table"]
my_table_res.swap_with("other_table")
```

Code in the following example swaps `my_table` (in the `my_db` database and `my_schema` schema) with `other_table` (in the
`other_db` database and `other_schema` schema):

```python
my_table_res = root.databases["my_db"].schemas["my_schema"].tables["my_table"]
my_table_res.swap_with(to_swap_table_name="other_table", target_database="other_db", target_schema="other_schema")
```

### Performing table operations

You can perform common table operations—such as managing [reclustering](../../user-guide/tables-auto-reclustering.md) for a table and
dropping or restoring a table—with a `TableResource` object.

For more information about these table operations, see [Table, view, sequence, and user-defined type commands](../../sql-reference/commands-table.md) in the SQL command reference.

To demonstrate some operations you can do with a table resource, code in the following example does the following:

1. Gets the `my_table` table resource object in the `my_db` database and the `my_schema` schema.
2. Suspends reclustering for the table.
3. Resumes reclustering for the table.
4. Drops the table.
5. Restores the most recent version of the dropped table.

```python
my_table_res = root.databases["my_db"].schemas["my_schema"].tables["my_table"]

my_table_res.suspend_recluster()
my_table_res.resume_recluster()
my_table_res.drop()
my_table_res.undrop()
```

## Managing event tables

You can manage Snowflake event tables, which are a special kind of database table with a predefined set of columns where Snowflake can
collect telemetry data. For more information, see [Event table overview](../logging-tracing/event-table-setting-up.md).

The Snowflake Python APIs represents event tables with two separate types:

* `EventTable`: Exposes an event table’s properties such as its name, data retention time, max data extension time, and change
  tracking option.
* `EventTableResource`: Exposes methods you can use to fetch a corresponding `EventTable` object, rename the event table, and
  drop the event table.

**Topics**

* Creating an event table
* Getting event table details
* Listing event tables
* Performing event table operations

### Creating an event table

To create an event table, first create a `EventTable` object, and then create a `EventTableCollection` object from the API
`Root` object. Using `EventTableCollection.create`, add the new event table to Snowflake.

Code in the following example creates a `EventTable` object that represents an event table named `my_event_table` with the
specified parameters:

```python
from snowflake.core.event_table import EventTable

event_table = EventTable(
  name="my_event_table",
  data_retention_time_in_days = 3,
  max_data_extension_time_in_days = 5,
  change_tracking = True,
  default_ddl_collation = 'EN-CI',
  comment = 'CREATE EVENT TABLE'
)

event_tables = root.databases["my_db"].schemas["my_schema"].event_tables
event_tables.create(my_event_table)
```

The code creates a `EventTableCollection` variable `event_tables` and uses `EventTableCollection.create` to create a new
event table in Snowflake.

### Getting event table details

You can get information about an event table by calling the `EventTableResource.fetch` method, which returns a `EventTable`
object.

Code in the following example gets information about an event table named `my_event_table`:

```python
my_event_table = root.databases["my_db"].schemas["my_schema"].event_tables["my_event_table"].fetch()
print(my_event_table.to_dict())
```

### Listing event tables

You can list event tables using the `EventTableCollection.iter` method, which returns a `PagedIter` iterator of
`EventTable` objects.

Code in the following example lists event tables whose name starts with `my` in the `my_db` database and `my_schema` schema, and
prints the name of each:

```python
from snowflake.core.event_table import EventTableCollection

event_tables: EventTableCollection = root.databases["my_db"].schemas["my_schema"].event_tables
event_table_iter = event_tables.iter(like="my%")  # returns a PagedIter[EventTable]
for event_table_obj in event_table_iter:
  print(event_table_obj.name)
```

Code in the following example also lists event tables whose name begins with `my`, but it uses the `starts_with` parameter instead
of `like`. This example also sets the optional parameter `show_limit=10` to limit the number of results to `10`:

```python
event_tables: EventTableCollection = root.databases["my_db"].schemas["my_schema"].event_tables
event_table_iter = event_tables.iter(starts_with="my", show_limit=10)
for event_table_obj in event_table_iter:
  print(event_table_obj.name)
```

### Performing event table operations

You can perform common event table operations—such as renaming an event table and dropping an event table—with a
`EventTableResource` object.

> **Note:**
>
> Only the RENAME functionality of [ALTER TABLE (event tables)](../../sql-reference/sql/alter-table-event-table.md) is currently supported.
>
> RENAME is not supported on the default event table, SNOWFLAKE.TELEMETRY.EVENTS.

To demonstrate operations you can do with an event table resource, code in the following example does the following:

1. Gets the `my_event_table` event table resource object in the `my_db` database and the `my_schema` schema.
2. Renames the event table.
3. Drops the event table.

```python
my_event_table_res = root.databases["my_db"].schemas["my_schema"].event_tables["my_event_table"]

my_event_table_res.rename("my_other_event_table")
my_event_table_res.drop()
```

## Managing views

You can manage views in Snowflake. A view is a schema-level object and allows the result of a query to be accessed as if it were a table.
When you create or reference a view, you do so in the context of its schema.

> **Note:**
>
> [ALTER VIEW](../../sql-reference/sql/alter-view.md) is currently not supported.

The Snowflake Python APIs represents views with two separate types:

* `View`: Exposes a view’s properties, such as its name, columns, and SQL query statement.
* `ViewResource`: Exposes methods you can use to fetch a corresponding `View` object and to drop the view.

**Topics**

* Creating a view
* Getting view details
* Listing views
* Dropping a view

### Creating a view

To create a view, first create a `View` object that specifies the view name, columns, and SQL query statement.

Code in the following example creates a `View` object representing a view named `my_view` with the specified columns and SQL
query:

```python
from snowflake.core.view import View, ViewColumn

my_view = View(
  name="my_view",
  columns=[
      ViewColumn(name="c1"), ViewColumn(name="c2"), ViewColumn(name="c3"),
  ],
  query="SELECT * FROM my_table",
)

root.databases["my_db"].schemas["my_schema"].views.create(my_view)
```

The code then creates the view in the `my_db` database and `my_schema` schema by passing the `View` object to the
`ViewCollection.create` method.

### Getting view details

You can get information about a view by calling the `ViewResource.fetch` method, which returns a `View` object.

Code in the following example gets a `View` object that represents the `my_view` view:

```python
my_view = root.databases["my_db"].schemas["my_schema"].views["my_view"].fetch()
print(my_view.to_dict())
```

### Listing views

You can list the views in a specified database using the `iter` method. The method returns a `PagedIter` iterator of
`View` objects.

Code in the following example lists views whose name begins with `my` in the `my_db` database and `my_schema` schema:

```python
view_list = root.databases["my_db"].schemas["my_schema"].views.iter(like="my%")
for view_obj in view_list:
  print(view_obj.name)
```

Code in the following example also lists views whose name begins with `my`, but it uses the `starts_with` parameter instead of
`like`. This example also sets the optional parameter `show_limit=10` to limit the number of results to `10`:

```python
view_list = root.databases["my_db"].schemas["my_schema"].views.iter(starts_with="my", show_limit=10)
for view_obj in view_list:
  print(view_obj.name)
```

### Dropping a view

You can drop a view using the `ViewResource.drop` method.

Code in the following example drops the `my_view` view:

```python
my_view_res = root.databases["my_db"].schemas["my_schema"].views["my_view"]
my_view_res.drop()
```

---
title: Managing Snowflake dynamic tables with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-dynamic-tables.md
section: Developer Guide
---

# Managing Snowflake dynamic tables with Python

You can use Python to manage Snowflake dynamic tables, which are a new table type for continuous processing pipelines. Dynamic tables
materialize the results of a specified query. For an overview of this feature, see [Dynamic tables](../../user-guide/dynamic-tables-about.md).

The Snowflake Python APIs represents dynamic tables with two separate types:

* `DynamicTable`: Exposes a dynamic table’s properties such as its name, target lag, warehouse, and query statement.
* `DynamicTableResource`: Exposes methods you can use to fetch a corresponding `DynamicTable` object, suspend and resume the
  dynamic table, and drop the dynamic table.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating a dynamic table

To create a dynamic table, first create a `DynamicTable` object, and then create a `DynamicTableCollection` object from the API
`Root` object. Using `DynamicTableCollection.create`, add the new dynamic table to Snowflake.

Code in the following example creates a `DynamicTable` object that represents a dynamic table named `my_dynamic_table` in the
`my_db` database and the `my_schema` schema, with the minimum required options specified:

```python
from snowflake.core.dynamic_table import DynamicTable, DownstreamLag

my_dt = DynamicTable(
  name='my_dynamic_table',
  target_lag=DownstreamLag(),
  warehouse='my_wh',
  query='SELECT * FROM t',
)
dynamic_tables = root.databases['my_db'].schemas['my_schema'].dynamic_tables
dynamic_tables.create(my_dt)
```

The code creates a `DynamicTableCollection` variable `dynamic_tables` and uses `DynamicTableCollection.create` to create
a new dynamic table in Snowflake.

Code in the following example creates a `DynamicTable` object that represents a dynamic table named `my_dynamic_table2` in the
`my_db` database and the `my_schema` schema with all currently possible options specified:

```python
from snowflake.core.dynamic_table import DynamicTable, UserDefinedLag

root.databases['my_db'].schemas['my_schema'].dynamic_tables.create(
  DynamicTable(
      name='my_dynamic_table2',
      kind='PERMANENT',
      target_lag=UserDefinedLag(seconds=60),
      warehouse='my_wh',
      query='SELECT * FROM t',
      refresh_mode='FULL',
      initialize='ON_SCHEDULE',
      cluster_by=['id > 1'],
      comment='test table',
      data_retention_time_in_days=7,
      max_data_extension_time_in_days=7,
  )
)
```

### Cloning a dynamic table

Code in the following example creates a new dynamic table named `my_dynamic_table2` with the same column definitions and all existing
data from the source dynamic table `my_dynamic_table` in the `my_db` database and the `my_schema` schema:

> > **Note:**
> >
> > This clone operation uses the `DynamicTableClone` object, which includes the optional `target_lag` and `warehouse`
> > parameters, and currently does not support other parameters.

```python
from snowflake.core.dynamic_table import DynamicTableClone

root.databases['my_db'].schemas['my_schema'].dynamic_tables.create(
  DynamicTableClone(
      name='my_dynamic_table2',
      warehouse='my_wh2',
  ),
  clone_table='my_dynamic_table',
)
```

For more information about this functionality, see [CREATE DYNAMIC TABLE … CLONE](../../sql-reference/sql/create-dynamic-table.md).

## Getting dynamic table details

You can get information about a dynamic table by calling the `DynamicTableResource.fetch` method, which returns a
`DynamicTable` object.

Code in the following example gets information about a dynamic table named `my_dynamic_table` in the `my_db` database and the
`my_schema` schema:

```python
dynamic_table = root.databases['my_db'].schemas['my_schema'].dynamic_tables['my_dynamic_table']
dt_details = dynamic_table.fetch()
print(dt_details.to_dict())
```

## Listing dynamic tables

You can list dynamic tables using the `DynamicTableCollection.iter` method, which returns a `PagedIter` iterator of
`DynamicTable` objects.

Code in the following example lists dynamic tables whose name starts with the text `my` in the `my_db` database and the `my_schema`
schema, and then prints the name of each:

```python
from snowflake.core.dynamic_table import DynamicTableCollection

dt_list = root.databases['my_db'].schemas['my_schema'].dynamic_tables.iter(like='my%')
for dt_obj in dt_list:
  print(dt_obj.name)
```

## Swapping dynamic table names

You can swap the name of a dynamic table with another dynamic table in a single transaction using the `DynamicTableResource.swap_with`
method. For more information, see the SWAP WITH parameter description in [ALTER DYNAMIC TABLE](../../sql-reference/sql/alter-dynamic-table.md).

Code in the following example swaps `my_dynamic_table` with `other_dynamic_table` in the same database and schema:

```python
my_table_res = root.databases['my_db'].schemas['my_schema'].tables['my_dynamic_table']
my_table_res.swap_with('other_dynamic_table')
```

## Performing dynamic table operations

You can perform common dynamic table operations—such as refreshing, suspending, and resuming a dynamic table—with a
`DynamicTableResource` object.

For more information about these dynamic table operations, see [Table, view, sequence, and user-defined type commands](../../sql-reference/commands-table.md) in the SQL command reference.

To demonstrate some operations you can do with a dynamic table resource, code in the following example does the following:

1. Gets the `my_dynamic_table` dynamic table resource object in the `my_db` database and the `my_schema` schema.
2. Refreshes the dynamic table.
3. Suspends the dynamic table.
4. Resumes the dynamic table.
5. Drops the dynamic table.
6. Restores the most recent version of the dropped dynamic table.

```python
my_dynamic_table_res = root.databases["my_db"].schemas["my_schema"].dynamic_tables["my_dynamic_table"]

my_dynamic_table_res.refresh()
my_dynamic_table_res.suspend()
my_dynamic_table_res.resume()
my_dynamic_table_res.drop()
my_dynamic_table_res.undrop()
```

---
title: Managing Snowflake functions and stored procedures with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-functions-procedures.md
section: Developer Guide
---

# Managing Snowflake functions and stored procedures with Python

You can use Python to manage user-defined functions (UDFs) and stored procedures in Snowflake. When you create a UDF or procedure, you
write its logic in one of the supported handler languages, then create it using the Snowflake Python APIs. For more information about UDFs and
procedures, see [Extending Snowflake with Functions and Procedures](../extensibility.md).

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing user-defined functions (UDFs)

You can manage user-defined functions (UDFs), which you can write to extend the system to perform operations that are not available through
the built-in system-defined functions provided by Snowflake. After you create a UDF, you can reuse it multiple times. For more information,
see [User-defined functions overview](../udf/udf-overview.md).

> **Note:**
>
> Calling UDFs by using the API is currently not supported.

The Snowflake Python APIs represents UDFs with two separate types:

* `UserDefinedFunction`: Exposes a UDF’s properties such as its name, list of arguments, return type, and function definition.
* `UserDefinedFunctionResource`: Exposes methods you can use to fetch a corresponding `UserDefinedFunction` object, rename the
  UDF, and drop the UDF.

### Creating a UDF

To create a UDF, first create a `UserDefinedFunction` object, and then create a `UserDefinedFunctionCollection` object from the
API `Root` object. Using `UserDefinedFunctionCollection.create`, add the new UDF to Snowflake.

When you create a UDF, you specify a handler whose code is written in one of the following supported languages.

#### Python

Code in the following example creates a `UserDefinedFunction` object that represents a UDF named `my_python_function` in the
`my_db` database and the `my_schema` schema, with the specified arguments, return type, language, and UDF Python definition:

```python
from snowflake.core.user_defined_function import (
    PythonFunction,
    ReturnDataType,
    UserDefinedFunction
)

function_of_python = UserDefinedFunction(
    "my_python_function",
    arguments=[],
    return_type=ReturnDataType(datatype="VARIANT"),
    language_config=PythonFunction(runtime_version="3.12", packages=[], handler="udf"),
    body="""
def udf():
    return {"key": "value"}
    """,
)

root.databases["my_db"].schemas["my_schema"].user_defined_functions.create(function_of_python)
```

#### Java

Code in the following example creates a `UserDefinedFunction` object that represents a UDF named `my_java_function` in the
`my_db` database and the `my_schema` schema, with the specified arguments, return type, language, and UDF Java definition:

```python
from snowflake.core.user_defined_function import (
    Argument,
    JavaFunction,
    ReturnDataType,
    UserDefinedFunction
)

function_body = """
    class TestFunc {
        public static String echoVarchar(String x) {
            return x;
        }
    }
"""

function_of_java = UserDefinedFunction(
    name="my_java_function",
    arguments=[Argument(name="x", datatype="STRING")],
    return_type=ReturnDataType(datatype="VARCHAR", nullable=True),
    language_config=JavaFunction(
        handler="TestFunc.echoVarchar",
        runtime_version="11",
        target_path=target_path,
        packages=[],
        called_on_null_input=True,
        is_volatile=True,
    ),
    body=function_body,
    comment="test_comment",
)

root.databases["my_db"].schemas["my_schema"].user_defined_functions.create(function_of_java)
```

#### JavaScript

Code in the following example creates a `UserDefinedFunction` object that represents a UDF named `my_javascript_function` in the
`my_db` database and the `my_schema` schema, with the specified arguments, return type, language, and UDF JavaScript definition:

```python
from snowflake.core.user_defined_function import (
    Argument,
    ReturnDataType,
    JavaScriptFunction,
    UserDefinedFunction
)

function_body = """
    if (D <= 0) {
        return 1;
    } else {
        var result = 1;
        for (var i = 2; i <= D; i++) {
            result = result * i;
        }
        return result;
    }
"""

function_of_javascript = UserDefinedFunction(
    name="my_javascript_function",
    arguments=[Argument(name="d", datatype="DOUBLE")],
    return_type=ReturnDataType(datatype="DOUBLE"),
    language_config=JavaScriptFunction(),
    body=function_body,
)

root.databases["my_db"].schemas["my_schema"].user_defined_functions.create(function_of_javascript)
```

#### Scala

Code in the following example creates a `UserDefinedFunction` object that represents a UDF named `my_scala_function` in the
`my_db` database and the `my_schema` schema, with the specified arguments, return type, language, and UDF Scala definition:

```python
from snowflake.core.user_defined_function import (
    Argument,
    ReturnDataType,
    ScalaFunction,
    UserDefinedFunction
)

function_body = """
    class Echo {
        def echoVarchar(x : String): String = {
            return x
        }
    }
"""

function_of_scala = UserDefinedFunction(
    name="my_scala_function",
    arguments=[Argument(name="x", datatype="VARCHAR")],
    return_type=ReturnDataType(datatype="VARCHAR"),
    language_config=ScalaFunction(
        runtime_version="2.12", handler="Echo.echoVarchar", target_path=target_path, packages=[]
    ),
    body=function_body,
    comment="test_comment",
)

root.databases["my_db"].schemas["my_schema"].user_defined_functions.create(function_of_scala)
```

#### SQL

Code in the following example creates a `UserDefinedFunction` object that represents a UDF named `my_sql_function` in the
`my_db` database and the `my_schema` schema, with the specified arguments, return type, language, and UDF SQL definition:

```python
from snowflake.core.user_defined_function import (
    ReturnDataType,
    SQLFunction,
    UserDefinedFunction
)

function_body = """3.141592654::FLOAT"""

function_of_sql = UserDefinedFunction(
    name="my_sql_function",
    arguments=[],
    return_type=ReturnDataType(datatype="FLOAT"),
    language_config=SQLFunction(),
    body=function_body,
)

root.databases["my_db"].schemas["my_schema"].user_defined_functions.create(function_of_sql)
```

### Getting UDF details

You can get information about a UDF by calling the `UserDefinedFunctionResource.fetch` method, which returns a
`UserDefinedFunction` object.

Code in the following example fetches information about the `my_javascript_function(DOUBLE)` UDF in the `my_db` database and the
`my_schema` schema:

> **Note:**
>
> When getting a UDF resource object, you must specify the full signature (the UDF name and its parameter data types) because UDFs can be
> overloaded.

```python
my_udf = root.databases["my_db"].schemas["my_schema"].user_defined_functions["my_javascript_function(DOUBLE)"].fetch()
print(my_udf.to_dict())
```

### Listing UDFs

You can list UDFs using the `UserDefinedFunctionCollection.iter` method, which returns a `PagedIter` iterator of
`UserDefinedFunction` objects.

Code in the following example lists UDFs whose name starts with `my_java` in the `my_db` database and the `my_schema` schema, and
then prints the name of each:

```python
udf_iter = root.databases["my_db"].schemas["my_schema"].user_defined_functions.iter(like="my_java%")
for udf_obj in udf_iter:
    print(udf_obj.name)
```

### Renaming a UDF

You can rename a UDF with a `UserDefinedFunctionResource` object.

Code in the following example gets the `my_javascript_function(DOUBLE)` UDF resource object in the `my_db` database and the
`my_schema` schema, and then renames the UDF to `my_other_js_function` while also moving it to the `my_other_db` database and the
`my_other_schema` schema:

```python
root.databases["my_db"].schemas["my_schema"].user_defined_functions["my_javascript_function(DOUBLE)"].rename(
    "my_other_js_function",
    target_database = "my_other_database",
    target_schema = "my_other_schema"
)
```

### Dropping a UDF

You can drop a UDF with a `UserDefinedFunctionResource` object.

Code in the following example gets the `my_javascript_function(DOUBLE)` UDF resource object and then drops the UDF:

```python
my_udf_res = root.databases["my_db"].schemas["my_schema"].user_defined_functions["my_javascript_function(DOUBLE)"]
my_udf_res.drop()
```

## Managing stored procedures

You can manage stored procedures, which you can write to extend the system with procedural code that executes SQL. In a stored procedure,
you can use programmatic constructs to perform branching and looping. After you create a stored procedure, you can reuse it multiple times.
For more information, see [Stored procedures overview](../stored-procedure/stored-procedures-overview.md).

The Snowflake Python APIs represents procedures with two separate types:

* `Procedure`: Exposes a procedure’s properties such as its name, list of arguments, return type, and procedure definition.
* `ProcedureResource`: Exposes methods you can use to fetch a corresponding `Procedure` object, call the procedure, and drop
  the procedure.

### Creating a procedure

To create a procedure, first create a `Procedure` object, and then create a `ProcedureCollection` object from the API
`Root` object. Using `ProcedureCollection.create`, add the new procedure to Snowflake.

Code in the following example creates a `Procedure` object that represents a procedure named `my_procedure` in the `my_db`
database and the `my_schema` schema, with the specified arguments, return type, and SQL procedure definition:

```python
from snowflake.core.procedure import Argument, ColumnType, Procedure, ReturnTable, SQLFunction

procedure = Procedure(
    name="my_procedure",
    arguments=[Argument(name="id", datatype="VARCHAR")],
    return_type=ReturnTable(
        column_list=[
            ColumnType(name="id", datatype="NUMBER"),
            ColumnType(name="price", datatype="NUMBER"),
        ]
    ),
    language_config=SQLFunction(),
    body="""
        DECLARE
            res RESULTSET DEFAULT (SELECT * FROM invoices WHERE id = :id);
        BEGIN
            RETURN TABLE(res);
        END;
    """,
)

procedures = root.databases["my_db"].schemas["my_schema"].procedures
procedures.create(procedure)
```

### Calling a procedure

You can call a procedure with a `ProcedureResource` object.

Code in the following example gets the `my_procedure(VARCHAR)` procedure resource object, creates a `CallArgumentList` object, and
then calls the procedure using that list of arguments.

> **Note:**
>
> When getting a procedure resource object, you must specify the full signature (the procedure name and its parameter data types) because
> procedures can be overloaded.

```python
from snowflake.core.procedure import CallArgument, CallArgumentList

procedure_reference = root.databases["my_db"].schemas["my_schema"].procedures["my_procedure(VARCHAR)"]
call_argument_list = CallArgumentList(call_arguments=[
    CallArgument(name="id", datatype="VARCHAR", value="1"),
])
procedure_reference.call(call_argument_list=call_argument_list, extract=False)
```

### Getting procedure details

You can get information about a procedure by calling the `ProcedureResource.fetch` method, which returns a `Procedure` object.

Code in the following example fetches information about the `my_procedure(VARCHAR)` procedure in the `my_db` database and the
`my_schema` schema:

```python
my_procedure = root.databases["my_db"].schemas["my_schema"].procedures["my_procedure(VARCHAR)"].fetch()
print(my_procedure.to_dict())
```

### Listing procedures

You can list procedures using the `ProcedureCollection.iter` method, which returns a `PagedIter` iterator of `Procedure`
objects.

Code in the following example lists procedures whose name starts with `my` in the `my_db` database and the `my_schema` schema, and
then prints the name of each:

```python
procedure_iter = root.databases["my_db"].schemas["my_schema"].procedures.iter(like="my%")
for procedure_obj in procedure_iter:
    print(procedure_obj.name)
```

### Dropping a procedure

You can drop a procedure with a `ProcedureResource` object.

Code in the following example gets the `my_procedure(VARCHAR)` procedure resource object in the `my_db` database and the `my_schema`
schema, and then drops the procedure.

```python
my_procedure_res = root.databases["my_db"].schemas["my_schema"].procedures["my_procedure(VARCHAR)"]
my_procedure_res.drop()
```

---
title: Managing Snowflake integrations with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-integrations.md
section: Developer Guide
---

# Managing Snowflake integrations with Python

You can use Python to manage different types of integrations in Snowflake.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing catalog integrations

You can manage catalog integrations for Apache Iceberg™ tables in your account. A catalog integration is a named, account-level Snowflake
object that stores information about how your Iceberg table metadata is organized for scenarios when you don’t use Snowflake as the Iceberg
catalog, or when you want to integrate with Snowflake Open Catalog. For more information, see the [Catalog integration](../../user-guide/tables-iceberg.md)
section in [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

> **Note:**
>
> [ALTER CATALOG INTEGRATION](../../sql-reference/sql/alter-catalog-integration.md) is currently not supported.

The Snowflake Python APIs represents catalog integrations with two separate types:

* `CatalogIntegration`: Exposes a catalog integration’s properties such as its name, table format, and catalog settings.
* `CatalogIntegrationResource`: Exposes methods you can use to fetch a corresponding `CatalogIntegration` object and drop the
  catalog integration.

### Creating a catalog integration

To create a catalog integration, first create a `CatalogIntegration` object, and then create a `CatalogIntegrationCollection`
object from the API `Root` object. Using `CatalogIntegrationCollection.create`, add the new catalog integration to Snowflake.

You can create catalog integrations in your account for the following types of external Iceberg catalogs.

#### AWS Glue

Code in the following example creates a `CatalogIntegration` object that represents a catalog integration named
`my_catalog_integration` for Iceberg tables that use AWS Glue with the specified properties:

```python
from snowflake.core.catalog_integration import CatalogIntegration, Glue

root.catalog_integrations.create(CatalogIntegration(
    name="my_catalog_integration",
    catalog = Glue(
        catalog_namespace="abcd-ns",
        glue_aws_role_arn="arn:aws:iam::123456789012:role/sqsAccess",
        glue_catalog_id="1234567",
    ),
    table_format="ICEBERG",
    enabled=True,
))
```

#### Object store

Code in the following example creates a `CatalogIntegration` object that represents a catalog integration named
`my_catalog_integration` for Iceberg tables that use an object store:

```python
from snowflake.core.catalog_integration import CatalogIntegration, ObjectStore

root.catalog_integrations.create(CatalogIntegration(
    name="my_catalog_integration",
    catalog = ObjectStore(),
    table_format="ICEBERG",
    enabled=True,
))
```

#### Snowflake Open Catalog

Code in the following example creates a `CatalogIntegration` object that represents a catalog integration named
`my_catalog_integration` for Iceberg tables that use Open Catalog with the specified properties:

```python
from snowflake.core.catalog_integration import CatalogIntegration, OAuth, Polaris, RestConfig

root.catalog_integrations.create(CatalogIntegration(
    name="my_catalog_integration",
    catalog = Polaris(
        catalog_namespace="abcd-ns",
        rest_config=RestConfig(
            catalog_uri="https://my_account.snowflakecomputing.com/polaris/api/catalog",
            warehouse="my-warehouse",
        ),
        rest_authentication=OAuth(
            type="OAUTH",
            oauth_client_id="my_client_id",
            oauth_client_secret="my_client_secret",
            oauth_allowed_scopes=["PRINCIPAL_ROLE:ALL"],
        ),
    ),
    table_format="ICEBERG",
    enabled=True,
))
```

### Getting catalog integration details

You can get information about a catalog integration by calling the `CatalogIntegrationResource.fetch` method, which returns a
`CatalogIntegration` object.

Code in the following example gets information about a catalog integration named `my_catalog_integration`:

```python
my_catalog_integration = root.catalog_integrations["my_catalog_integration"].fetch()
print(my_catalog_integration.to_dict())
```

### Listing catalog integrations

You can list catalog integrations using the `CatalogIntegrationCollection.iter` method, which returns a `PagedIter` iterator of
`CatalogIntegration` objects.

Code in the following example lists catalog integrations whose name starts with `my`, and prints the name of each:

```python
catalog_integration_iter = root.catalog_integrations.iter(like="my%")
for catalog_integration_obj in catalog_integration_iter:
  print(catalog_integration_obj.name)
```

### Dropping a catalog integration

You can drop a catalog integration with a `CatalogIntegrationResource` object.

Code in the following example gets the `my_catalog_integration` catalog integration resource object and then drops the catalog
integration.

```python
my_catalog_integration_res = root.catalog_integrations["my_catalog_integration"]
my_catalog_integration_res.drop()
```

## Managing notification integrations

You can manage notification integrations, which are Snowflake objects that provide an interface between Snowflake and third-party messaging
services such as third-party cloud message queuing services, email services, and webhooks. For more information, see
[Notifications in Snowflake](../../user-guide/notifications/about-notifications.md).

> **Note:**
>
> [ALTER NOTIFICATION INTEGRATION](../../sql-reference/sql/alter-notification-integration.md) is currently not supported.

The Snowflake Python APIs represents notification integrations with two separate types:

* `NotificationIntegration`: Exposes a notification integration’s properties such as its name and notification hook settings.
* `NotificationIntegrationResource`: Exposes methods you can use to fetch a corresponding `NotificationIntegration` object and
  drop the notification integration.

### Creating a notification integration

To create a notification integration, first create a `NotificationIntegration` object, and then create a
`NotificationIntegrationCollection` object from the API `Root` object. Using `NotificationIntegrationCollection.create`,
add the new notification integration to Snowflake.

You can create a notification integration for the following types of messaging services.

#### Email

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_email_notification_integration` with the specified `NotificationEmail` properties:

```python
from snowflake.core.notification_integration import NotificationEmail, NotificationIntegration

my_notification_integration = NotificationIntegration(
  name="my_email_notification_integration",
  notification_hook=NotificationEmail(
      allowed_recipients=["test1@snowflake.com", "test2@snowflake.com"],
      default_recipients=["test1@snowflake.com"],
      default_subject="test default subject",
  ),
  enabled=True,
)

root.notification_integrations.create(my_notification_integration)
```

#### Webhooks

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_webhook_notification_integration` with the specified `NotificationWebhook` properties:

```python
from snowflake.core.notification_integration import NotificationIntegration, NotificationWebhook

my_notification_integration = NotificationIntegration(
  name="my_webhook_notification_integration",
  enabled=False,
  notification_hook=NotificationWebhook(
      webhook_url=webhook_url,
      webhook_secret=WebhookSecret(
          # This example assumes that this secret already exists
          name="mySecret".upper(), database_name=database, schema_name=schema
      ),
      webhook_body_template=webhook_template,
      webhook_headers=webhook_headers,
  ),
)

root.notification_integrations.create(my_notification_integration)
```

#### Amazon SNS topics (outbound)

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_aws_sns_outbound_notification_integration` with the specified `NotificationQueueAwsSnsOutbound` properties:

```python
from snowflake.core.notification_integration import NotificationIntegration, NotificationQueueAwsSnsOutbound

my_notification_integration = NotificationIntegration(
  name="my_aws_sns_outbound_notification_integration",
  enabled=False,
  notification_hook=NotificationQueueAwsSnsOutbound(
      aws_sns_topic_arn="arn:aws:sns:us-west-1:123456789012:sns-test-topic",
      aws_sns_role_arn="arn:aws:iam::123456789012:role/sns-test-topic",
  )
)

root.notification_integrations.create(my_notification_integration)
```

#### Azure Event Grid topics (outbound)

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_azure_outbound_notification_integration` with the specified `NotificationQueueAzureEventGridOutbound` properties:

```python
from snowflake.core.notification_integration import NotificationIntegration, NotificationQueueAzureEventGridOutbound

my_notification_integration = NotificationIntegration(
  name="my_azure_outbound_notification_integration",
  enabled=False,
  notification_hook=NotificationQueueAzureEventGridOutbound(
      azure_event_grid_topic_endpoint="https://fake.queue.core.windows.net/api/events",
      azure_tenant_id="fake.onmicrosoft.com",
  )
)

root.notification_integrations.create(my_notification_integration)
```

#### Azure Event Grid topics (inbound)

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_azure_inbound_notification_integration` with the specified `NotificationQueueAzureEventGridInbound` properties:

```python
from snowflake.core.notification_integration import NotificationIntegration, NotificationQueueAzureEventGridInbound

my_notification_integration = NotificationIntegration(
  name="my_azure_inbound_notification_integration",
  enabled=False,
  notification_hook=NotificationQueueAzureEventGridInbound(
      azure_storage_queue_primary_uri="https://fake.queue.core.windows.net/snowapi_queue",
      azure_tenant_id="fake.onmicrosoft.com",
  ),
)

root.notification_integrations.create(my_notification_integration)
```

#### Google Pub/Sub topics (outbound)

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_gcp_outbound_notification_integration` with the specified `NotificationQueueGcpPubsubOutbound` properties:

```python
from snowflake.core.notification_integration import NotificationIntegration, NotificationQueueGcpPubsubOutbound

my_notification_integration = NotificationIntegration(
  name="my_gcp_outbound_notification_integration",
  enabled=False,
  notification_hook=NotificationQueueGcpPubsubOutbound(
      gcp_pubsub_topic_name="projects/fake-project-name/topics/pythonapi-test",
  )
)

root.notification_integrations.create(my_notification_integration)
```

#### Google Pub/Sub topics (inbound)

Code in the following example creates a `NotificationIntegration` object that represents a notification integration named
`my_gcp_inbound_notification_integration` with the specified `NotificationQueueGcpPubsubInbound` properties:

```python
from snowflake.core.notification_integration import NotificationIntegration, NotificationQueueGcpPubsubInbound

my_notification_integration = NotificationIntegration(
  name="my_gcp_inbound_notification_integration",
  enabled=True,
  notification_hook=NotificationQueueGcpPubsubInbound(
      gcp_pubsub_subscription_name="projects/fake-project-name/subscriptions/sub-test",
  )
)

root.notification_integrations.create(my_notification_integration)
```

### Getting notification integration details

You can get information about a notification integration by calling the `NotificationIntegrationResource.fetch` method, which returns
a `NotificationIntegration` object.

Code in the following example gets information about a notification integration named `my_notification_integration`:

```python
my_notification_integration = root.notification_integrations["my_notification_integration"].fetch()
print(my_notification_integration.to_dict())
```

### Listing notification integrations

You can list notification integrations using the `NotificationIntegrationCollection.iter` method, which returns a `PagedIter`
iterator of `NotificationIntegration` objects.

Code in the following example lists notification integrations whose name starts with `my`, and prints the name of each:

```python
notification_integration_iter = root.notification_integrations.iter(like="my%")
for notification_integration_obj in notification_integration_iter:
  print(notification_integration_obj.name)
```

### Dropping a notification integration

You can drop a notification integration with a `NotificationIntegrationResource` object.

Code in the following example gets the `my_notification_integration` notification integration resource object and then drops the
notification integration.

```python
my_notification_integration_res = root.notification_integrations["my_notification_integration"]
my_notification_integration_res.drop()
```

---
title: Managing Snowflake network policies with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-network-policies.md
section: Developer Guide
---

# Managing Snowflake network policies with Python

You can use Python to manage Snowflake network policies, which you can use to control inbound access to the Snowflake service and internal
stage. For more information, see [Controlling network traffic with network policies](../../user-guide/network-policies.md).

> **Note:**
>
> [ALTER NETWORK POLICY](../../sql-reference/sql/alter-network-policy.md) is currently not supported.

The Snowflake Python APIs represents network policies with two separate types:

* `NetworkPolicy`: Exposes a network policy’s properties such as its name, network rules, and allowed and blocked IP lists.
* `NetworkPolicyResource`: Exposes methods you can use to fetch a corresponding `NetworkPolicy` object and drop the network
  policy.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating a network policy

To create a network policy, first create a `NetworkPolicy` object, and then create a `NetworkPolicyCollection` object from the
API `Root` object. Using `NetworkPolicyCollection.create`, add the new network policy to Snowflake.

Code in the following example creates a `NetworkPolicy` object that represents a network policy named `my_network_policy` with
the specified allowed and blocked network rules and IP addresses:

```python
from snowflake.core.network_policy import NetworkPolicy

my_network_policy = NetworkPolicy(
  name = 'my_network_policy',
  allowed_network_rule_list = ['allowed_network_rule1','allowed_network_rule2'],
  blocked_network_rule_list = ['blocked_network_rule1','blocked_network_rule2'],
  allowed_ip_list=['8.8.8.8'],
  blocked_ip_list=['192.100.123.0'],
)

root.network_policies.create(my_network_policy)
```

## Getting network policy details

You can get information about a network policy by calling the `NetworkPolicyResource.fetch` method, which returns a
`NetworkPolicy` object.

Code in the following example gets information about a network policy named `my_network_policy`:

```python
my_network_policy = root.network_policies["my_network_policy"].fetch()
print(my_network_policy.to_dict())
```

## Listing network policies

You can list network policies using the `NetworkPolicyCollection.iter` method, which returns a `PagedIter` iterator of
`NetworkPolicy` objects.

Code in the following example lists network policies whose name starts with `my` and prints the name of each:

```python
network_policy_iter = root.network_policies.iter(like="my%")  # returns a PagedIter[NetworkPolicy]
for network_policy_obj in network_policy_iter:
  print(network_policy_obj.name)
```

## Dropping a network policy

You can drop a network policy with a `NetworkPolicyResource` object.

Code in the following example gets the `my_network_policy` network policy resource object and then drops the network policy.

```python
my_network_policy_res = root.network_policies["my_network_policy"]
my_network_policy_res.drop()
```

---
title: Managing Snowflake Notebooks with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-notebooks.md
section: Developer Guide
---

# Managing Snowflake Notebooks with Python

You can use Python to manage Snowflake Notebooks, which is a development interface in Snowsight that offers an interactive, cell-based
programming environment for Python and SQL. For more information, see [About Legacy Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md).

The Snowflake Python APIs represents notebooks with two separate types:

* `Notebook`: Exposes a notebook’s properties such as its name, version, query warehouse, and `.ipynb` file.
* `NotebookResource`: Exposes methods you can use to fetch a corresponding `Notebook` object, manage versions of the notebook,
  and execute the notebook.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating a notebook

To create a notebook, first create a `Notebook` object, and then create a `NotebookCollection` object from the API `Root`
object. Using `NotebookCollection.create`, add the new notebook to Snowflake.

Code in the following example creates a `Notebook` object that represents a notebook named `my_nb` in the `my_db` database
and the `my_schema` schema:

```python
from snowflake.core.notebook import Notebook

my_nb = Notebook(name="my_nb")

notebooks = root.databases["my_db"].schemas["my_schema"].notebooks
notebooks.create(my_nb)
```

The code creates a `NotebookCollection` variable `notebooks` and uses `NotebookCollection.create` to create a new
notebook in Snowflake.

You can also create a notebook from a stage with an existing `.ipynb` file. Code in the following example creates a notebook from the
`@my_stage` stage with the `notebook_file.ipynb` file:

```python
from snowflake.core.notebook import Notebook

my_nb = Notebook(name="my_nb",
  query_warehouse="my_wh",
  from_location="@my_stage",
  main_file="notebook_file.ipynb")

notebooks = root.databases["my_db"].schemas["my_schema"].notebooks
notebooks.create(my_nb)
```

## Getting notebook details

You can get information about a notebook by calling the `NotebookResource.fetch` method, which returns a `Notebook` object.

Code in the following example gets information about a notebook named `my_nb` in the `my_db` database and the `my_schema` schema:

```python
my_nb = root.databases["my_db"].schemas["my_schema"].notebooks["my_nb"].fetch()
print(my_nb.to_dict())
```

## Listing notebooks

You can list notebooks using the `NotebookCollection.iter` method, which returns a `PagedIter` iterator of
`Notebook` objects.

Code in the following example lists notebooks whose name starts with `my` in the `my_db` database and the `my_schema` schema, and
then prints the name of each:

```python
from snowflake.core.notebook import NotebookCollection

notebooks: NotebookCollection = root.databases["my_db"].schemas["my_schema"].notebooks
nb_iter = notebooks.iter(like="my%")  # returns a PagedIter[Notebook]
for nb_obj in nb_iter:
  print(nb_obj.name)
```

## Performing notebook operations

You can perform common notebook operations—such as managing versions and executing notebooks—with a `NotebookResource` object.

To demonstrate some operations you can do with a notebook resource, code in the following example does the following:

1. Gets the `my_nb` notebook resource object.
2. Adds a lives version to the notebook object. This is equivalent to [ALTER NOTEBOOK … ADD LIVE VERSION](../../sql-reference/sql/alter-notebook.md).
3. Commits the live version of the notebook to a Git repository, if a Git connection is set up. Otherwise, sets the live version to `null`.

   For more information, see [ALTER NOTEBOOK](../../sql-reference/sql/alter-notebook.md).
4. Executes the notebook.

   > **Note:**
   >
   > To execute a notebook, you must add a live version to it first.
5. Drops the notebook.

```python
my_nb_res = root.databases["my_db"].schemas["my_schema"].notebooks["my_nb"]

my_nb_res.add_live_version(from_last=True)
my_nb_res.commit()
my_nb_res.execute()
my_nb_res.drop()
```

---
title: Managing Snowflake streams with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-streams.md
section: Developer Guide
---

# Managing Snowflake streams with Python

You can use Python to manage Snowflake streams, which are objects that record data manipulation language (DML) changes made to tables,
including inserts, updates, and deletes, as well as metadata about each change. For more information, see [Introduction to streams](../../user-guide/streams-intro.md).

> **Note:**
>
> [ALTER STREAM](../../sql-reference/sql/alter-stream.md) is currently not supported.

The Snowflake Python APIs represents streams with two separate types:

* `Stream`: Exposes a stream’s properties such as its name, target lag, warehouse, and query statement.
* `StreamResource`: Exposes methods you can use to fetch a corresponding `Stream` object, suspend and resume the stream, and
  drop the stream.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating a stream

To create a stream, first create a `Stream` object, and then create a `StreamCollection` object from the API `Root`
object. Using `StreamCollection.create`, add the new stream to Snowflake.

You can create a stream on the following object types:

* Standard tables
* Views
* Directory tables

### On a source table

Code in the following example creates a `Stream` object that represents a stream named `my_stream_on_table` on the source table
`my_table` in the `my_db` database and the `my_schema` schema, with the specified stream properties:

> **Note:**
>
> The `StreamSourceTable` type only supports standard tables. Other types of tables—such as dynamic tables, event tables, external
> tables, and Iceberg tables—are currently not supported.

```python
from snowflake.core.stream import PointOfTimeOffset, Stream, StreamSourceTable

stream_on_table = Stream(
  "my_stream_on_table",
  StreamSourceTable(
      point_of_time = PointOfTimeOffset(reference="before", offset="1"),
      name = 'my_table',
      append_only = True,
      show_initial_rows = False,
  ),
  comment = 'create stream on table'
)

streams = root.databases['my_db'].schemas['my_schema'].streams
streams.create(stream_on_table)
```

The code creates a `StreamCollection` variable `streams` and uses `StreamCollection.create` to create a new stream in
Snowflake.

### On a source view

Code in the following example creates a `Stream` object that represents a stream named `my_stream_on_view` on the source view
`my_view` in the `my_db` database and the `my_schema` schema, with the specified stream properties:

```python
from snowflake.core.stream import PointOfTimeOffset, Stream, StreamSourceView

stream_on_view = Stream(
  "my_stream_on_view",
  StreamSourceView(
      point_of_time = PointOfTimeOffset(reference="before", offset="1"),
      name = 'my_view',
  ),
  comment = 'create stream on view'
)

streams = root.databases['my_db'].schemas['my_schema'].streams
streams.create(stream_on_view)
```

### On a source directory table

Code in the following example creates a `Stream` object that represents a stream named `my_stream_on_directory_table` on the source
directory table `my_directory_table` in the `my_db` database and the `my_schema` schema, with the specified stream properties:

```python
from snowflake.core.stream import PointOfTimeOffset, Stream, StreamSourceStage

stream_on_directory_table = Stream(
  "my_stream_on_directory_table",
  StreamSourceStage(
      point_of_time = PointOfTimeOffset(reference="before", offset="1"),
      name = 'my_directory_table',
  ),
  comment = 'create stream on directory table'
)

streams = root.databases['my_db'].schemas['my_schema'].streams
streams.create(stream_on_directory_table)
```

### Cloning a stream

Code in the following example creates a new stream named `my_stream` with the same definition as the source stream `my_other_stream` in
the `my_db` database and the `my_schema` schema:

```python
from snowflake.core.stream import Stream

streams = root.databases['my_db'].schemas['my_schema'].streams
streams.create("my_stream", clone_stream="my_other_stream")
```

## Getting stream details

You can get information about a stream by calling the `StreamResource.fetch` method, which returns a `Stream` object.

Code in the following example gets information about a stream named `my_stream` in the `my_db` database and the `my_schema` schema:

```python
stream = root.databases['my_db'].schemas['my_schema'].streams['my_stream']
stream_details = stream.fetch()
print(stream_details.to_dict())
```

## Listing streams

You can list streams using the `StreamCollection.iter` method, which returns a `PagedIter` iterator of `Stream` objects.

Code in the following example lists streams whose name starts with `my` in the `my_db` database and the `my_schema` schema, and then
prints the name of each:

```python
stream_list = root.databases['my_db'].schemas['my_schema'].streams.iter(like='my%')
for stream_obj in stream_list:
  print(stream_obj.name)
```

Code in the following example also lists streams whose name begins with `my`, but it uses the `starts_with` parameter instead of
`like`. This example also sets the optional parameter `show_limit=10` to limit the number of results to `10`:

```python
stream_list = root.databases['my_db'].schemas['my_schema'].streams.iter(starts_with="my", show_limit=10)
for stream_obj in stream_list:
  print(stream_obj.name)
```

## Dropping a stream

You can drop a stream with a `StreamResource` object.

Code in the following example gets the `my_stream` stream resource object and then drops the stream.

```python
my_stream_res = root.streams["my_stream"]
my_stream_res.drop()
```

---
title: Managing Snowflake tasks and task graphs with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-tasks.md
section: Developer Guide
---

# Managing Snowflake tasks and task graphs with Python

You can use Python to manage Snowflake tasks, with which you can execute SQL statements, procedure calls, and logic in
[Snowflake Scripting](../snowflake-scripting/index.md). For an overview of tasks, see [Introduction to tasks](../../user-guide/tasks-intro.md).

The Snowflake Python APIs represents tasks with two separate types:

* `Task`: Exposes a task’s properties such as its schedule, parameters, and predecessors.
* `TaskResource`: Exposes methods you can use to fetch a corresponding `Task` object, execute the task, and alter the task.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating a task

To create a task, first create a `Task` object. Then, specifying the database and schema in which to create the task,
create a `TaskCollection` object. Using `TaskCollection.create`, add the new task to Snowflake.

Code in the following example creates a `Task` object representing a task named `my_task` that runs a SQL query specified in
the `definition` parameter:

```python
from datetime import timedelta
from snowflake.core.task import Task

my_task = Task(name="my_task", definition="<sql query>", schedule=timedelta(hours=1))
tasks = root.databases['my_db'].schemas['my_schema'].tasks
tasks.create(my_task)
```

This code creates a `TaskCollection` variable `tasks` from the `my_db` database and the
`my_schema` schema. Using `TaskCollection.create`, it creates a new task in Snowflake.

This code example also specifies a `timedelta` value of one hour for the task’s schedule. You can define the schedule of a task using
either a `timedelta` value or a `Cron` expression.

You can also create a task that runs a Python function or a stored procedure. Code in the following example creates a task named
`my_task2` that runs a function represented by a `StoredProcedureCall` object:

```python
from snowflake.core.task import StoredProcedureCall, Task

my_task2 = Task(
  "my_task2",
  StoredProcedureCall(
      dosomething, stage_location="@mystage"
  ),
  warehouse="test_warehouse"
)
tasks = root.databases['my_db'].schemas['my_schema'].tasks
tasks.create(my_task2)
```

This object specifies a function named
`dosomething` located in the `@mystage` stage location. You must also specify a `warehouse` when creating a task with a `StoredProcedureCall` object.

## Creating or altering a task

You can set properties of a `Task` object and pass it to the `TaskResource.create_or_alter` method to create a task if it
doesn’t exist, or alter it according to the task definition if it does exist. The behavior of `create_or_alter` is intended to be
idempotent, which means that the resulting task object will be the same regardless of whether the task exists before you call the method.

> **Note:**
>
> The `create_or_alter` method uses default values for any [Task](https://docs.snowflake.com/en/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.Task)
> properties that you don’t explicitly define. For example, if you don’t set `schedule`, its value defaults to `None` even if the
> task previously existed with a different value.

Code in the following example updates the definition and schedule of the `my_task` task, and then alters the task on Snowflake:

```python
from datetime import timedelta
from snowflake.core.task import Task

my_task = root.databases['my_db'].schemas['my_schema'].tasks['my_task'].fetch()
my_task.definition = "<sql query 2>"
my_task.schedule = timedelta(hours=2)

my_task_res = root.databases['my_db'].schemas['my_schema'].tasks['my_task']
my_task_res.create_or_alter(my_task)
```

## Listing tasks

You can list tasks using the `TaskCollection.iter` method. The method returns a `PagedIter` iterator of `Task` objects.

Code in the following example lists tasks whose name begins with *my*:

```python
from snowflake.core.task import TaskCollection

tasks: TaskCollection = root.databases['my_db'].schemas['my_schema'].tasks
task_iter = tasks.iter(like="my%")  # returns a PagedIter[Task]
for task_obj in task_iter:
  print(task_obj.name)
```

## Performing task operations

You can perform common task operations—such as executing, suspending, and resuming tasks—with a `TaskResource` object.

Code in the following example executes, suspends, resumes, and drops the `my_task` task:

```python
tasks = root.databases['my_db'].schemas['my_schema'].tasks
task_res = tasks['my_task']

task_res.execute()
task_res.suspend()
task_res.resume()
task_res.drop()
```

## Managing tasks in a task graph

You can manage tasks collected in a task graph. A task graph is a series of tasks with a single root task and additional tasks
organized by their dependencies.

For more about tasks in a task graph, see [Create a sequence of tasks with a task graph](../../user-guide/tasks-graphs.md).

### Creating a task graph

To create a task graph, first create a `DAG` object that specifies its name and other optional properties, such as its schedule.
You can define the schedule of a task graph using either a `timedelta` value or a `Cron` expression.

Code in the following example defines a Python function `dosomething`, then specifies the function as a `DAGTask` object named
`dag_task2` in the task graph:

```python
from snowflake.core.task import StoredProcedureCall
from snowflake.core.task.dagv1 import DAG, DAGTask, DAGOperation
from snowflake.snowpark import Session
from snowflake.snowpark.functions import sum as sum_

def dosomething(session: Session) -> None:
  df = session.table("target")
  df.group_by("a").agg(sum_("b")).save_as_table("agg_table")

with DAG("my_dag", schedule=timedelta(days=1)) as dag:
  # Create a task that runs some SQL.
  dag_task1 = DAGTask(
    "dagtask1",
    "MERGE INTO target USING source_stream WHEN MATCHED THEN UPDATE SET target.v = source_stream.v"
  )
  # Create a task that runs a Python function.
  dag_task2 = DAGTask(
    StoredProcedureCall(
      dosomething, stage_location="@mystage",
      packages=["snowflake-snowpark-python"]
    ),
    warehouse="test_warehouse"
  )
# Shift right and left operators can specify task relationships.
dag_task1 >> dag_task2  # dag_task1 is a predecessor of dag_task2
schema = root.databases["my_db"].schemas["my_schema"]
dag_op = DAGOperation(schema)
dag_op.deploy(dag)
```

This code also defines a SQL statement as another `DAGTask` object named `dag_task1`, and then specifies
`dag_task1` as a predecessor of `dag_task2`. Finally, it deploys the task graph to Snowflake in the `my_db` database and the
`my_schema` schema.

### Creating a task graph with a cron schedule, task branches, and function return values

You can also create a task graph with a specified cron schedule, task branches, and function return values that are used as task return
values.

Code in the following example creates a `DAG` object with a `Cron` object specifying its schedule. It defines a
`DAGTaskBranch` object named `task1_branch` along with other `DAGTask` objects, and specifies their dependencies to one
another:

```python
from snowflake.core._common import CreateMode
from snowflake.core.task import Cron
from snowflake.core.task.dagv1 import DAG, DAGTask, DAGOperation, DAGTaskBranch
from snowflake.snowpark import Session

def task_handler(session: Session) -> None:
  pass  # do something

def task_branch_handler(session: Session) -> str:
  # do something
  return "task3"

try:
  with DAG(
    "my_dag",
    schedule=Cron("10 * * * *", "America/Los_Angeles"),
    stage_location="@mystage",
    packages=["snowflake-snowpark-python"],
    use_func_return_value=True,
  ) as dag:
    task1 = DAGTask(
      "task1",
      task_handler,
      warehouse=test_warehouse,
    )
    task1_branch = DAGTaskBranch("task1_branch", task_branch_handler, warehouse=test_warehouse)
    task2 = DAGTask("task2", task_handler, warehouse=test_warehouse)
    task3 = DAGTask("task3", task_handler, warehouse=test_warehouse, condition="1=1")
    task1 >> task1_branch
    task1_branch >> [task2, task3]
  schema = root.databases["my_db"].schemas["my_schema"]
  op = DAGOperation(schema)
  op.deploy(dag, mode=CreateMode.or_replace)
finally:
  session.close()
```

This code example also defines task handler functions and creates each `DAGTask` and `DAGTaskBranch` object with a specified
task handler assigned to the task. The code sets the DAG’s `use_func_return_value` parameter to `True`, which specifies to use the
Python function’s return value as the corresponding task’s return value. Otherwise the default value of `use_func_return_value` is
`False`.

### Setting and getting the return value of a task in a task graph

When a task’s definition is a `StoredProcedureCall` object, the handler of the stored procedure (or function) can explicitly set the
return value of the task by using a `TaskContext` object.

For more information, see [SYSTEM$SET_RETURN_VALUE](../../sql-reference/functions/system_set_return_value.md).

Code in the following example defines a task handler function that creates a `TaskContext` object named `context` from the
current session. Then it uses the `TaskContext.set_return_value` method to explicitly set the return value to a specified string:

```python
from snowflake.core.task.context import TaskContext
from snowflake.snowpark import Session

def task_handler(session: Session) -> None:
  context = TaskContext(session)
  # this return value can be retrieved by successor Tasks.
  context.set_return_value("predecessor_return_value")
```

In a task graph, an immediate successor task that identifies the previous task as its predecessor can then retrieve the return value
explicitly set by the predecessor task.

For more information, see [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](../../sql-reference/functions/system_get_predecessor_return_value.md).

Code in the following example defines a task handler function that uses the `TaskContext.get_predecessor_return_value` method to get
the return value of the predecessor task named `pred_task_name`:

```python
from snowflake.core.task.context import TaskContext
from snowflake.snowpark import Session

def task_handler(session: Session) -> None:
  context = TaskContext(session)
  pred_return_value = context.get_predecessor_return_value("pred_task_name")
```

---
title: Managing Snowflake users, roles, and grants with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-user-roles.md
section: Developer Guide
---

# Managing Snowflake users, roles, and grants with Python

You can use Python to manage Snowflake users, roles, and grants. For more information about managing users and their privileges in
Snowflake, see [User management](../../user-guide/admin-user-management.md).

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing users

You can manage users in Snowflake. A user is an account-level object in Snowflake. The Snowflake Python APIs represents users with two
separate types:

* `User`: Exposes a user’s properties, such as its name.
* `UserResource`: Exposes methods you can use to fetch a corresponding `User` object and to drop the user.

### Creating a user

You can create a user by calling the `UserCollection.create` method and passing a `User` object that represents the user you
want to create. To create a user, first create a `User` object that specifies the user name.

Code in the following example creates a `User` object representing a user named `my_user` and then creates the user by passing
the `User` object to the `UserCollection.create` method:

```python
from snowflake.core.user import User

my_user = User(name="my_user")
root.users.create(my_user)
```

### Getting user details

You can get information about a user by calling the `UserResource.fetch` method, which returns a `User` object.

Code in the following example gets information about a user named `my_user`:

```python
my_user = root.users["my_user"].fetch()
print(my_user.to_dict())
```

### Creating or altering a user

You can set properties of a `User` object and pass it to the `UserResource.create_or_alter` method to create a user if it
doesn’t exist, or alter it according to the user definition if it does exist. The behavior of `create_or_alter` is intended to be
idempotent, which means that the resulting user object will be the same regardless of whether the user exists before you call the method.

`create_or_alter` uses default values for any [User](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.user.User)
properties that you don’t explicitly define. For example, if you don’t set `snowflake_support`, its value defaults to `False` even
if the user previously existed with a different value.

> **Note:**
>
> The `create_or_alter` method currently does not support changing the `password` for an existing user. You can only set the
> password when creating a new user.

Code in the following example updates the first name, last name, and `must_change_password` properties of the `my_user` user, and
then alters the user on Snowflake:

```python
user_parameters = root.users["my_user"].fetch()
user_parameters.first_name="Snowy"
user_parameters.last_name="User"
user_parameters.must_change_password=False
root.users["my_user"].create_or_alter(user_parameters)
```

### Listing users

You can list users using the `iter` method, which returns a `PagedIter` iterator.

Code in the following example lists users whose name begins with `my`:

```python
users = root.users.iter(like="my%")
for user in users:
  print(user.name)
```

### Dropping a user

You can drop a user using the `UserResource.drop` method.

Code in the following example drops the `my_user` user:

```python
my_user_res = root.users["my_user"]
my_user_res.drop()
```

## Managing roles

You can manage roles in Snowflake. A role is an account-level object. The Snowflake Python APIs represents roles with two separate types:

* `Role`: Exposes a role’s properties, such as its name.
* `RoleResource`: Exposes methods you can use to grant and manage privileges on a corresponding `Role` object, and to drop the role.

### Creating a role

To create a role, first create a `Role` object that specifies the role name.

Code in the following example creates a `Role` object representing a role named `my_role`:

```python
from snowflake.core.role import Role

my_role = Role(name="my_role")
root.roles.create(my_role)
```

The code then creates the role by passing the `Role` object to the `RoleCollection.create` method.

### Using a role in a session

Code in the following example applies the role `my_role` in the current session.

```python
root.session.use_role("my_role")
```

### Listing roles

You can list the roles in an account using the `iter` method. The method returns a `PagedIter` iterator of `Role` objects.

Code in the following example lists all role names in an account:

```python
role_list = root.roles.iter()
for role_obj in role_list:
  print(role_obj.name)
```

### Dropping a role

You can drop a role using the `RoleResource.drop` method.

Code in the following example drops the `my_role` role:

```python
my_role_res = root.roles["my_role"]
my_role_res.drop()
```

## Managing database roles

You can manage [database roles](../../user-guide/security-access-control-considerations.md) in Snowflake. A database role is a database-level
object. The Snowflake Python APIs represents database roles with two separate types:

* `DatabaseRole`: Exposes a database role’s properties, such as its name and a comment.
* `DatabaseRoleResource`: Exposes methods you can use to grant and manage privileges on a corresponding `DatabaseRole` object,
  and to drop the database role.

### Creating a database role

To create a database role, first create a `DatabaseRole` object that specifies the role name.

Code in the following example creates a `DatabaseRole` object representing a database role named `my_db_role`:

```python
from snowflake.core.database_role import DatabaseRole

my_db_role = DatabaseRole(
  name="my_db_role",
  comment="sample comment"
)

my_db_role_ref = root.databases['my_db'].database_roles.create(my_db_role)
```

The code then creates the database role by passing the `DatabaseRole` object to the `DatabaseRoleCollection.create` method.

#### Cloning a database role

Code in the following example creates a database role named `dr2` in the `my_db_2` target database as a copy of the existing `dr1`
database role in the `my_db` database.

```python
database_role_ref = root.databases['my_db'].database_roles['dr1'].clone(target_database_role='dr2', target_database='my_db_2')
```

### Listing database roles

You can list the database roles in an account using the `iter` method. The method returns a `PagedIter` iterator of
`DatabaseRole` objects.

Code in the following example lists the database role named `my_db_role` in the `my_db` database, limiting the number of results to `1`:

```python
db_role_list = root.databases['my_db'].database_roles.iter(limit=1, from_name='my_db_role')
for db_role_obj in db_role_list:
  print(db_role_obj.name)
```

### Dropping a database role

You can drop a database role using the `DatabaseRoleResource.drop` method.

Code in the following example drops the `my_db_role` database role:

```python
root.databases['my_db'].database_roles['my_db_role'].drop()
```

## Managing access privileges

You can use the API to manage access privileges on a securable Snowflake object to an account role, database role, or user. For more
information about roles, securable objects, and the access control framework in Snowflake, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

### For account roles

The following code examples demonstrate the API operations to grant privileges, revoke privileges, and list grants for
[account roles](../../user-guide/security-access-control-overview.md).

#### Grant privileges

```python
from snowflake.core.role import Securable

root.roles['my_role'].grant_privileges(
    privileges=["OPERATE"], securable_type="WAREHOUSE", securable=Securable(name='my_wh')
)
```

#### Grant role

```python
from snowflake.core.role import Securable

root.roles['my_role'].grant_role(role_type="ROLE", role=Securable(name='my_role_1'))
```

#### Grant privileges on all

```python
from snowflake.core.role import ContainingScope

root.roles['my_role'].grant_privileges_on_all(
    privileges=["SELECT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Grant future privileges

```python
from snowflake.core.role import ContainingScope

root.roles['my_role'].grant_future_privileges(
    privileges=["SELECT", "INSERT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke privileges

```python
from snowflake.core.role import Securable

root.roles['my_role'].revoke_privileges(
    privileges=["OPERATE"], securable_type="WAREHOUSE", securable=Securable(name='my_wh')
)
```

#### Revoke role

```python
from snowflake.core.role import Securable

root.roles['my_role'].revoke_role(role_type="ROLE", role=Securable(name='my_role_1'))
```

#### Revoke privileges on all

```python
from snowflake.core.role import ContainingScope

root.roles['my_role'].revoke_privileges_on_all(
    privileges=["SELECT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke future privileges

```python
from snowflake.core.role import ContainingScope

root.roles['my_role'].revoke_future_privileges(
    privileges=["SELECT", "INSERT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke grant option for privileges

```python
from snowflake.core.role import Securable

 root.roles['my_role'].revoke_grant_option_for_privileges(
    privileges=["OPERATE"], securable_type="WAREHOUSE", securable=Securable(name='my_wh')
)
```

#### Revoke grant option for privileges on all

```python
from snowflake.core.role import ContainingScope

root.roles['my_role'].revoke_grant_option_for_privileges_on_all(
    privileges=["SELECT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke grant option for future privileges

```python
from snowflake.core.role import ContainingScope

root.roles['my_role'].revoke_grant_option_for_future_privileges(
    privileges=["SELECT", "INSERT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### List grants to the role

```python
root.roles['my_role'].iter_grants_to()
```

#### List grants on the role

```python
root.roles['my_role'].iter_grants_on()
```

#### List grants of the role

```python
root.roles['my_role'].iter_grants_of()
```

#### List future grants to the role

```python
root.roles['my_role'].iter_future_grants_to()
```

### For users

The following code examples demonstrate the API operations to grant a role, revoke a role, and list roles for users.

#### Grant role to a user

```python
from snowflake.core.user import Securable

root.users['my_user'].grant_role(role_type="ROLE", role=Securable(name='my_role'))
```

#### Revoke role from a user

```python
from snowflake.core.user import Securable

root.users['my_user'].revoke_role(role_type="ROLE", role=Securable(name='my_role'))
```

#### List roles granted to a user

```python
root.users['my_user'].iter_grants_to()
```

### For database roles

The following code examples demonstrate the API operations to grant privileges, revoke privileges, and list grants for
[database roles](../../user-guide/security-access-control-overview.md).

#### Grant privileges

```python
from snowflake.core.database_role import Securable

root.databases['my_db'].database_roles['my_db_role'].grant_privileges(
    privileges=["MODIFY"], securable_type="DATABASE", securable=Securable(name='my_db')
)
```

#### Grant role

```python
from snowflake.core.database_role import Securable

root.databases['my_db'].database_roles['my_db_role'].grant_role(role_type="DATABASE ROLE", role=Securable(name='my_db_role_1'))
```

#### Grant privileges on all

```python
from snowflake.core.database_role import ContainingScope

root.databases['my_db'].database_roles['my_db_role'].grant_privileges_on_all(
    privileges=["SELECT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Grant future privileges

```python
from snowflake.core.database_role import ContainingScope

root.databases['my_db'].database_roles['my_db_role'].grant_future_privileges(
    privileges=["SELECT", "INSERT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke privileges

```python
from snowflake.core.database_role import Securable

root.databases['my_db'].database_roles['my_db_role'].revoke_privileges(
    privileges=["MODIFY"], securable_type="DATABASE", securable=Securable(name='my_db')
)
```

#### Revoke role

```python
from snowflake.core.database_role import Securable

root.databases['my_db'].database_roles['my_db_role'].revoke_role(role_type="DATABASE ROLE", role=Securable(name='my_db_role_1'))
```

#### Revoke all privileges

```python
from snowflake.core.database_role import ContainingScope

root.databases['my_db'].database_roles['my_db_role'].revoke_privileges_on_all(
    privileges=["SELECT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke future privileges

```python
from snowflake.core.database_role import ContainingScope

root.databases['my_db'].database_roles['my_db_role'].revoke_future_privileges(
    privileges=["SELECT", "INSERT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke grant option for privileges

```python
from snowflake.core.database_role import Securable

root.databases['my_db'].database_roles['my_db_role'].revoke_grant_option_for_privileges(
    privileges=["MODIFY"], securable_type="DATABASE", securable=Securable(name='my_db')
)
```

#### Revoke grant option for privileges on all

```python
from snowflake.core.database_role import ContainingScope

root.databases['my_db'].database_roles['my_db_role'].revoke_grant_option_for_privileges_on_all(
    privileges=["SELECT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### Revoke grant option for future privileges

```python
from snowflake.core.database_role import ContainingScope

root.databases['my_db'].database_roles['my_db_role'].revoke_grant_option_for_future_privileges(
    privileges=["SELECT", "INSERT"],
    securable_type="TABLE",
    containing_scope=ContainingScope(database='my_db', schema='my_schema'),
)
```

#### List grants to the role

```python
root.databases['my_db'].database_roles['my_db_role'].iter_grants_to()
```

#### List future grants to the role

```python
root.databases['my_db'].database_roles['my_db_role'].iter_future_grants_to()
```

## Managing grants using the `Grant` resource — *Deprecated*

You can execute [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) operations to grant access privileges on a securable Snowflake object to a role.

### Granting privileges

To grant privileges on a Snowflake object, you first create a `Grant` object that specifies the following attributes:

* `grantee`: The role or user that is being granted the privileges.
* `securable`: The Snowflake object that is being secured by the privileges.
* `privileges`: The privileges that are being granted to a role.

#### Granting CREATE privileges in an account to a role

Code in the following example creates a `Grant` object representing a grant operation that grants the privileges `create_database`
and `create_warehouse` to the role `my_role` in the current Snowflake account. The code executes the operation using the
`root.grants.grant` method.

```python
from snowflake.core.grant import Grant
from snowflake.core.grant._grantee import Grantees
from snowflake.core.grant._privileges import Privileges
from snowflake.core.grant._securables import Securables

root.grants.grant(
  Grant(
    grantee=Grantees.role(name='my_role'),
    securable=Securables.current_account,
    privileges=[Privileges.create_database,
                Privileges.create_warehouse],
  )
)
```

#### Granting privileges on a database to a role

Code in the following example grants [imported privileges](../../user-guide/data-share-consumers.md) on the database `my_db`
to the role `my_role`:

```python
from snowflake.core.grant import Grant
from snowflake.core.grant._grantee import Grantees
from snowflake.core.grant._privileges import Privileges
from snowflake.core.grant._securables import Securables

root.grants.grant(
  Grant(
    grantee=Grantees.role('my_role'),
    securable=Securables.database('my_db'),
    privileges=[Privileges.imported_privileges],
  )
)
```

### Granting a role to another role

You can assign a role to another role to create a “parent-child” relationship between the roles (also referred to as a *role hierarchy*).

Code in the following example grants the `my_role` user role to the `ACCOUNTADMIN` system role:

```python
from snowflake.core.grant import Grant
from snowflake.core.grant._grantee import Grantees
from snowflake.core.grant._securables import Securables

root.grants.grant(
  Grant(
    grantee=Grantees.role('ACCOUNTADMIN'),
    securable=Securables.role('my_role'),
  )
)
```

---
title: Managing Snowflake virtual warehouses with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-warehouses.md
section: Developer Guide
---

# Managing Snowflake virtual warehouses with Python

You can use Python to manage Snowflake virtual warehouses, which are clusters of compute resources in Snowflake. For an overview of
warehouses, see [Virtual warehouses](../../user-guide/warehouses.md).

The Snowflake Python APIs represents warehouses with two separate types:

* `Warehouse`: Exposes a warehouse’s properties such as its name, size, type, and auto-resume and auto-suspend settings.
* `WarehouseResource`: Exposes methods you can use to fetch a corresponding `Warehouse` object, suspend and resume the
  warehouse, and drop the warehouse.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Creating a warehouse

To create a warehouse, first create a `Warehouse` object, and then create a `WarehouseCollection` object from the API `Root`
object. Using `WarehouseCollection.create`, add the new warehouse to Snowflake.

Code in the following example creates a `Warehouse` object that represents a warehouse named `my_wh`:

```python
from snowflake.core.warehouse import Warehouse

my_wh = Warehouse(
  name="my_wh",
  warehouse_size="SMALL",
  auto_suspend=600,
)
warehouses = root.warehouses
warehouses.create(my_wh)
```

The code creates a `WarehouseCollection` variable `warehouses` and uses `WarehouseCollection.create` to create a new warehouse in Snowflake.

## Getting warehouse details

You can get information about a warehouse by calling the `WarehouseResource.fetch` method, which returns a `Warehouse` object.

Code in the following example gets information about a warehouse named `my_wh`:

```python
my_wh = root.warehouses["my_wh"].fetch()
print(my_wh.to_dict())
```

## Creating or altering a warehouse

You can set properties of a `Warehouse` object and pass it to the `WarehouseResource.create_or_alter` method to create a
warehouse if it doesn’t exist, or alter it according to the warehouse definition if it does exist. The behavior of `create_or_alter`
is intended to be idempotent, which means that the resulting warehouse object will be the same regardless of whether the warehouse exists
before you call the method.

> **Note:**
>
> The `create_or_alter` method uses default values for any [Warehouse](https://docs.snowflake.com/en/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.warehouse.Warehouse)
> properties that you don’t explicitly define. For example, if you don’t set `auto_suspend`, its value defaults to `None` even if
> the warehouse previously existed with a different value.

Code in the following example updates the size and auto-suspend setting of the `my_wh` warehouse, and then alters the warehouse on
Snowflake.

```python
from snowflake.core.warehouse import Warehouse

my_wh = root.warehouses["my_wh"].fetch()
my_wh.warehouse_size = "LARGE"
my_wh.auto_suspend = 1800

my_wh_res = root.warehouses["my_wh"]
my_wh_res.create_or_alter(my_wh)
```

In this case, it changes the `my_wh` warehouse’s size to `LARGE` and its
auto-suspend setting to `1800` if you previously created it with different properties.

## Listing warehouses

You can list warehouses using the `WarehouseCollection.iter` method, which returns a `PagedIter` iterator of
`Warehouse` objects.

Code in the following example lists warehouses whose name includes the text *my* and prints the name of each:

```python
from snowflake.core.warehouse import WarehouseCollection

warehouses: WarehouseCollection = root.warehouses
wh_iter = warehouses.iter(like="my%")  # returns a PagedIter[Warehouse]
for wh_obj in wh_iter:
  print(wh_obj.name)
```

## Performing warehouse operations

You can perform common warehouse operations—such as suspending and resuming warehouses and aborting all queries on warehouses—with a
`WarehouseResource` object.

Code in the following example suspends and resumes the `my_wh` warehouse, aborts all running or queued queries on the warehouse, and
then drops the warehouse:

```python
my_wh_res = root.warehouses["my_wh"]

my_wh_res.suspend()
my_wh_res.resume()
my_wh_res.abort_all_queries()
my_wh_res.drop()
```

---
title: Managing Snowpark Container Services (including service functions) with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-containers.md
section: Developer Guide
---

# Managing Snowpark Container Services (including service functions) with Python

You can use Python to manage Snowpark Container Services, a fully managed container service through which you can deploy, manage,
and scale containerized applications. For an overview of Snowpark Container Services,
see [About Snowpark Container Services](../snowpark-container-services/overview.md).

With the Snowflake Python APIs, you can manage compute pools, image repositories, and services.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Managing compute pools

You can manage compute pools, which are collections of virtual machine (VM) nodes on which Snowflake runs your Snowpark Container Services
jobs and services.

The Snowflake Python APIs represents compute pools with two separate types:

* `ComputePool`: Exposes a compute pool’s properties, such as its warehouse, maximum and minimum nodes, and auto resume and auto
  suspend settings.
* `ComputePoolResource`: Exposes methods for performing actions on compute pools, such as fetching a corresponding
  `ComputePool` object and suspending, resuming, and stopping pools.

For more information about compute pools, see [Snowpark Container Services: Working with compute pools](../snowpark-container-services/working-with-compute-pool.md).

### Creating a compute pool

You can create a compute pool by calling the `ComputePoolCollection.create` method, passing a `ComputePool` object
that represents the compute pool you want to create.

To create a compute pool, first create a `ComputePool` object that specifies pool properties such as the following:

* Compute pool name
* Maximum and minimum number of nodes that the pool will contain
* Name of the instance family that identifies the type of machine to provision for nodes in the pool
* Whether the pool should automatically resume when a service or job is submitted to it

Code in the following example creates a `ComputePool` object that represents a pool named `my_compute_pool`:

```python
from snowflake.core.compute_pool import ComputePool

compute_pool = ComputePool(name="my_compute_pool", min_nodes=1, max_nodes=2, instance_family="CPU_X64_XS", auto_resume=False)
root.compute_pools.create(compute_pool)
```

The code then creates the compute pool by passing the `ComputePool` object to the `ComputePoolCollection.create` method.

### Getting compute pool details

You can get information about a compute pool by calling the `ComputePoolResource.fetch` method, which returns a `ComputePool`
object.

Code in the following example gets information about a pool named `my_compute_pool`:

```python
compute_pool = root.compute_pools["my_compute_pool"].fetch()
print(compute_pool.to_dict())
```

### Creating or altering a compute pool

You can set properties of a `ComputePool` object and pass it to the `ComputePoolResource.create_or_alter` method to create a
compute pool if it doesn’t exist, or alter it according to the compute pool definition if it does exist. The behavior of
`create_or_alter` is intended to be idempotent, which means that the resulting compute pool object will be the same regardless of
whether the compute pool exists before you call the method.

> **Note:**
>
> The `create_or_alter` method uses default values for any [ComputePool](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.compute_pool.ComputePool)
> properties that you don’t explicitly define. For example, if you don’t set `auto_resume`, its value defaults to `None` even if
> the compute pool previously existed with a different value.

Code in the following example updates the maximum allowed nodes of the `my_compute_pool` compute pool, and then alters the compute pool
on Snowflake:

```python
compute_pool = root.compute_pools["my_compute_pool"].fetch()
compute_pool.max_nodes = 3
compute_pool_res = root.compute_pools["my_compute_pool"].create_or_alter(compute_pool)
```

### Listing compute pools

You can list compute pools using the `iter` method, which returns a `PagedIter` iterator.

Code in the following example lists compute pools whose name begins with `my`:

```python
compute_pools = root.compute_pools.iter(like="my%")
for compute_pool in compute_pools:
  print(compute_pool.name)
```

### Performing compute pool operations

You can perform common compute pool operations—such as suspending, resuming, and stopping pools—with a `ComputePoolResource`
object, which you can get by using the `ComputePool.fetch` method.

Code in the following example suspends, resumes, and stops the `my_compute_pool` compute pool:

```python
compute_pool_res = root.compute_pools["my_compute_pool"]
compute_pool_res.suspend()
compute_pool_res.resume()
compute_pool_res.stop_all_services()
```

## Managing image repositories

You can manage image repositories, which store images for applications you run on container services.

An image repository is a schema-level object. When you create or reference a repository, you do so in the context of its schema.

The Snowflake Python APIs represents image repositories with two separate types:

* `ImageRepository`: Exposes an image repository’s properties, such as its database and schema names, repository URL, and owner.
* `ImageRepositoryResource`: Exposes methods you can use to fetch a corresponding `ImageRepository` object and to drop
  the image repository resource.

For more information about image repositories, see [Snowpark Container Services: Working with an image registry and repository](../snowpark-container-services/working-with-registry-repository.md).

### Creating an image repository

To create an image repository, first create an `ImageRepository` object that specifies the repository name.

Code in the following example creates an `ImageRepository` object that represents a repository named `my_repo`:

```python
from snowflake.core.image_repository import ImageRepository

my_repo = ImageRepository("my_repo")
root.databases["my_db"].schemas["my_schema"].image_repositories.create(my_repo)
```

The code then creates the image repository by passing the `ImageRepository` object to the `ImageRepositoryCollection.create`
method, creating the image repository in the `my_db` database and `my_schema` schema.

### Getting image repository details

You can get information about an image repository by calling the `ImageRepositoryResource.fetch` method, which returns an
`ImageRepository` object.

Code in the following example gets an `ImageRepository` object representing the `my_repo` image repository and then prints the
name of the repository’s owner:

```python
my_repo_res = root.databases["my_db"].schemas["my_schema"].image_repositories["my_repo"]
my_repo = my_repo_res.fetch()
print(my_repo.owner)
```

### Listing image repositories

You can list the image repositories in a specified schema using the `iter` method, which returns a `PagedIter` iterator
of `ImageRepository` objects.

Code in the following example lists repository names in the `my_db` database and `my_schema` schema:

```python
repo_list = root.databases["my_db"].schemas["my_schema"].image_repositories.iter()
for repo_obj in repo_list:
  print(repo_obj.name)
```

### Dropping an image repository

You can drop an image repository using the `ImageRepositoryResource.drop` method.

Code in the following example drops the `my_repo` repository:

```python
my_repo_res = root.databases["my_db"].schemas["my_schema"].image_repositories["my_repo"]
my_repo_res.drop()
```

## Managing services and service functions

You can manage services, which run application containers until you stop them. Snowflake restarts a service automatically if the service
container stops. In this way, the service effectively runs uninterrupted.

A service is a schema-level object. When you create or reference a service, you do so in the context of its schema.

The Snowflake Python APIs represents services with two separate types:

* `Service`: Exposes a service’s properties such as its specification, minimum and maximum instances, and database and schema name.
* `ServiceResource`: Exposes methods you can use to fetch a corresponding `Service` object, suspend and resume
  the service, and get its status.

For more information about services, see [Snowpark Container Services: Working with services](../snowpark-container-services/working-with-services.md).

### Creating a service

To create a service, you run the `services.create` method, passing a `Service` object representing the service you want to
create.

You create a service from a service specification `.yaml` file that has been uploaded to a stage. For more information about creating a
service specification, see [Service specification reference](../snowpark-container-services/specification-reference.md).

#### Uploading the specification

If you’re creating a service from a specification that hasn’t yet been uploaded to a stage, you can upload the specification using a
Snowpark [FileOperation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.FileOperation)
object.

Code in the following example uses the `FileOperation.put` method to upload a specification as a file:

```python
session.file.put("/local_location/my_service_spec.yaml", "@my_stage")
```

Code in the following example uses the `FileOperation.put_stream` method to upload a specification as a string:

```python
service_spec_string = """
// Specification as a string.
"""
session.file.put_stream(StringIO(sepc_in_string), "@my_stage/my_service_spec.yaml")
```

#### Creating the service

To create a service from a staged specification, first create a `Service` object that specifies service properties such as the
following:

* Service name
* Maximum and minimum number of service instances that Snowflake can create
* Compute pool to which the service should be added
* Stage location and name of the specification

Code in the following example creates a `Service` object representing a service named `my_service` from a specification in
`@my_stage/my_service_spec.yaml`:

```python
from snowflake.core.service import Service, ServiceSpec

my_service = Service(name="my_service", min_instances=1, max_instances=2, compute_pool="my_compute_pool", spec=ServiceSpec("@my_stage/my_service_spec.yaml"))
root.databases["my_db"].schemas["my_schema"].services.create(my_service)
```

The code then creates the service by passing the `Service` object to the `ServiceCollection.create` method, creating the service
in the `my_db` database and `my_schema` schema.

You can also create a service from a specification that you provide as inline text, as shown in the following example.
The `ServiceSpec` function takes a single string argument `spec`. If the string starts with `@`, the function interprets and
validates it as a stage file path. Otherwise the string is passed through as inline text.

```python
from textwrap import dedent
from snowflake.core.service import Service, ServiceSpec

spec_text = dedent(f"""\
    spec:
      containers:
      - name: hello-world
        image: repo/hello-world:latest
      endpoints:
      - name: hello-world-endpoint
        port: 8080
        public: true
    """)

my_service = Service(name="my_service", min_instances=1, max_instances=2, compute_pool="my_compute_pool", spec=ServiceSpec(spec_text))
root.databases["my_db"].schemas["my_schema"].services.create(my_service)
```

#### Creating a service function

After the service is up and running, you can create a service function that communicates with the service endpoint. A service function is a
user-defined function (UDF) that you create and associate with a service in Snowpark Container Services. For more information, see
[Service functions: Using a service from an SQL query](../snowpark-container-services/working-with-services.md).

Code in the following example creates a UDF named `my-udf` that specifies the `hello-world` service and `hello-world-endpoint`
endpoint that you previously defined:

```python
from snowflake.core import CreateMode
from snowflake.core.function import FunctionArgument, ServiceFunction

root.databases["my_db"].schemas["my_schema"].functions.create(
  ServiceFunction(
    name="my-udf",
    arguments=[
        FunctionArgument(name="input", datatype="TEXT")
    ],
    returns="TEXT",
    service="hello-world",
    endpoint="'hello-world-endpoint'",
    path="/hello-world-path",
    max_batch_rows=5,
  ),
  mode = CreateMode.or_replace
)
```

#### Invoking a service function

After the service function is created, you can then invoke the function to test it.

Code in the following example invokes the `my-udf` service function that you previously created:

```python
result = root.databases["my_db"].schemas["my_schema"].functions["my-udf(TEXT)"].execute_function(["test"])
print(result)
```

### Getting service details

You can get information about a Snowflake service by calling the `ServiceResource.fetch` method, which returns a `Service`
object.

Code in the following example gets information about a service named `my_service`:

```python
my_service = root.databases["my_db"].schemas["my_schema"].services["my_service"].fetch()
```

### Listing services

You can list the services in a specified schema using the `iter` method, which returns a `PagedIter` iterator of
`Service` objects.

Code in the following example lists services whose name begins with `my`:

```python
services = root.databases["my_db"].schemas["my_schema"].services.iter(like="my%")
for service_obj in services:
  print(service_obj.name)
```

### Performing service operations

You can perform common service operations—such as suspending, resuming, and getting the service’s containers—with a `ServiceResource`
object.

Code in the following example suspends and resumes the `my_service` service and also gets the status of the containers corresponding
to the service:

```python
my_service_res = root.databases["my_db"].schemas["my_schema"].services["my_service"]

my_service_res.suspend()
my_service_res.resume()
container_statuses = [container.status for container in my_service_res.get_containers()]
```

---
title: Managing tags with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-managing-tags.md
section: Developer Guide
---

# Managing tags with Python

You can use Python to manage tags in Snowflake. A tag is a schema-level object that can be assigned to another Snowflake object. You
associate a tag with an arbitrary string value when assigning the tag, and Snowflake stores the tag and its string value as a key-value
pair. After defining and assigning tags, you can query them to monitor usage on objects and facilitate data governance operations, such as
auditing and reporting.

For more information about tags, see [Introduction to object tagging](../../user-guide/object-tagging/introduction.md).

The Snowflake Python APIs represents tags with the following types:

* `Tag`: Represents a tag object model with properties such as its name, the database and schema it’s stored in, and when it was
  created.
* `TagValue`: Represents the value of a tag.
* `TagResource`: Represents a reference to a tag object that you can use to fetch information about the tag object and perform
  operations on the tag object, such as renaming the tag and dropping the tag.

## Prerequisites

The examples in this topic assume that you’ve added code to connect with Snowflake and to create a `Root` object from which to use the
Snowflake Python APIs.

For example, the following code uses connection parameters defined in a configuration file to create a connection to Snowflake:

```python
from snowflake.core import Root
from snowflake.snowpark import Session

session = Session.builder.config("connection_name", "myconnection").create()
root = Root(session)
```

Using the resulting `Session` object, the code creates a `Root` object to use the API’s types and methods. For more information,
see [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md).

## Create tags and manage them for a table

The following code example creates tags named `environment_tag` and `custom_tag`, assigns them to a table named `my_tagged_table`, and
fetches the tag assignments for the table. Then it unsets the `environment_tag` tag from the table and fetches the tag assignments again.

```python
from snowflake.core.tag import Tag, TagValue

table = root.databases["my_database"].schemas["my_schema"].tables["my_tagged_table"]
tags = root.databases["tag_database"].schemas["tag_schema"].tags

# Create tags
environment_tag = Tag(
    name="environment_tag",
    allowed_values=["prod", "dev", "staging"],
    comment="Tag to classify environment",
)
custom_tag = Tag(
    name="custom_tag",
)
environment_tag_resource = tags.create(environment_tag, mode="ifNotExists")
custom_tag_resource = tags.create(custom_tag, mode="ifNotExists")

# Set tags on a table
table.set_tags({environment_tag_resource: TagValue(value="prod"), custom_tag_resource: TagValue(value="custom value")})

# Fetch tag assignments for the table
fetched_tags = table.get_tags()
print(f"Tags on table: {fetched_tags}")
# Tags on table: {<TagResource: 'TAG_DATABASE.TAG_SCHEMA.CUSTOM_TAG'>: TagValue(value='custom value', level='TABLE'), <TagResource: 'TAG_DATABASE.TAG_SCHEMA.ENVIRONMENT_TAG'>: TagValue(value='prod', level='TABLE')}

# Unset one of the tags from the table
table.unset_tags({environment_tag_resource})

# Fetch tag assignments again
fetched_tags_after_unset = table.get_tags()
print(f"Tags after unset: {fetched_tags_after_unset}")
# Tags after unset: {<TagResource: 'TAG_DATABASE.TAG_SCHEMA.CUSTOM_TAG'>: TagValue(value='custom value', level='TABLE')}
```

## Manage tags for a schema and table with inheritance

The following code example assigns an existing tag named `environment_tag` to a schema named `my_schema` and fetches the tag assignments
for a table named `another_tagged_table` in the schema. Then it fetches the tag assignments for the table with inheritance
(`with_lineage=True`).

```python
from snowflake.core.tag import TagValue

schema = root.databases["my_database"].schemas["my_schema"]
table = schema.tables["another_tagged_table"]
environment_tag = root.databases["tag_database"].schemas["tag_schema"].tags["environment_tag"]

# Set tag on a schema
schema.set_tags({environment_tag: TagValue(value="prod")})

# Fetch tag assignments for the table
fetched_tags = table.get_tags()
print(f"Tags on table: {fetched_tags}")
# Tags on table: {}

# Fetch tag assignments for the table with inheritance
fetched_tags_with_lineage = table.get_tags(with_lineage=True)
print(f"Tags including inheritance: {fetched_tags_with_lineage}")
# Tags including inheritance: {<TagResource: 'TAG_DATABASE.TAG_SCHEMA.ENVIRONMENT_TAG'>: TagValue(value='prod', level='SCHEMA')}
```

---
title: Metrics limitations
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/metrics-limitations.md
section: Developer Guide
---

# Metrics limitations

* Metrics are collected from the Python and Java environments at regular 10-second intervals.
  If a UDF or stored procedure is completed before the first interval, no metrics are collected for the execution.
* Snowpark CPU and memory metrics are not supported for JavaScript stored procedures or UDFs.

---
title: Migrating from JDBC Driver 3.x to JDBC Driver 4.x
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-migration.md
section: Developer Guide
---

# Migrating from JDBC Driver 3.x to JDBC Driver 4.x

The JDBC Driver 4.x introduces several new features and improvements over the JDBC Driver 3.x. This topic provides an overview of the public API changes and new features and also provides information about how to migrate from JDBC Driver 3.x to JDBC Driver 4.x.

## Public API overview

The Snowflake JDBC driver public API is located under the `net.snowflake.client.api` package (see [API Reference](https://docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/index.html)). The changes to the public API between JDBC Driver 3.x and JDBC Driver 4.x are listed in the following table:

| Package | Description |
| --- | --- |
| [api.driver](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/driver/package-summary.html) | JDBC driver registration and entry point |
| [api.connection](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/connection/package-summary.html) | Snowflake-specific connection and database metadata interfaces, stream transfer configuration |
| [api.datasource](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/datasource/package-summary.html) | DataSource implementation for creating and managing connections |
| [api.pooling](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/pooling/package-summary.html) | Connection pool data source for applications requiring pooled connections |
| [api.resultset](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/resultset/package-summary.html) | Result set interfaces, field metadata, Snowflake data types, and async query status |
| [api.auth](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/auth/package-summary.html) | Authentication method definitions |
| [api.loader](https://staging.docs.snowflake.com/developer-guide/jdbc/reference/java/v4.0/net/snowflake/client/api/loader/package-summary.html) | Bulk data loading API for high-volume ingestion with progress callbacks |

Additionally, the driver includes classes in the `net.snowflake.client.internal` package that are not part of the public API. These classes are used internally by the driver and are not intended for use by application developers. Use the internal APIs at your own risk: They are subject to change without notice and without backward compatibility guarantees.

## Code changes from JDBC Driver 3.x to JDBC Driver 4.x

### Driver class name changes

The driver class name has changed.

| Before (3.x) | After (4.x) |
| --- | --- |
| `com.snowflake.client.jdbc.SnowflakeDriver` | `net.snowflake.client.api.driver.SnowflakeDriver` |

### Data source creation changes

`SnowflakeDataSource` and `SnowflakeConnectionPoolDataSource` are now interfaces. Use factory classes instead of direct instantiation.

| Component | Factory method |
| --- | --- |
| `SnowflakeDataSource` | `SnowflakeDataSourceFactory.createDataSource()` |
| `SnowflakeConnectionPoolDataSource` | `SnowflakeConnectionPoolDataSourceFactory.createConnectionPoolDataSource()` |

### Stream upload and download changes

The `SnowflakeConnection` interface simplified overloads for stream operations:

* Upload:

  + `uploadStream(stageName, destFileName, inputStream)`
  + `uploadStream(stageName, destFileName, inputStream, UploadStreamConfig)`
  + `UploadStreamConfig` options: `destPrefix`, `compressData` (default: `true`)
* Download:

  + `downloadStream(stageName, sourceFileName)`
  + `downloadStream(stageName, sourceFileName, DownloadStreamConfig)`
  + `DownloadStreamConfig` options: `decompress` (default: `false`)

### `SnowflakeType` changes

The `SnowflakeType` enum has been removed. Type values remain the same, but the enum is no longer supported.

### `QueryStatus` and `SnowflakeAsyncResultSet` changes

Version 4.0.0 made the following changes regarding queries and result sets:

* The `QueryStatus` enum was replaced with DTO (previously known as `QueryStatusV2`). It carries the same data, but in a thread-safe manner. To retrieve query status, unwrap your result set to `SnowflakeAsyncResultSet` and call `getStatus`.
* The `getQueryErrorMessage` on a result set is removed, but it can be retrieved directly from `getErrorMessage` on `QueryStatus`.

If you need an enum value representing the status, call `getStatus` on `QueryStatus`.

---
title: Monitoring Usage with Declarative Sharing in the Native Application Framework
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/monitoring.md
section: Developer Guide
---

# Monitoring Usage with Declarative Sharing in the Native Application Framework

As a provider, you can monitor the usage of your Declarative Native App to gain insights into how consumers are interacting with your data product.
This topic describes the views available in Snowflake to track application usage, access history, and the current state of installed applications.

## Monitor app usage

To monitor the usage of your Declarative Native Apps, you can use the following views:

* [APPLICATION_DAILY_USAGE_HISTORY view](../../sql-reference/account-usage/application_daily_usage_history.md).
* [LISTING_ACCESS_HISTORY view](../../sql-reference/data-sharing-usage/listing-access-history.md).
* [APPLICATION_STATE view](../../sql-reference/data-sharing-usage/application-state-view.md).

For more information, see [Data Sharing Usage](../../sql-reference/data-sharing-usage.md).

---
title: Naming and overloading procedures and UDFs
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-naming-conventions.md
section: Developer Guide
---

# Naming and overloading procedures and UDFs

When you create or call stored procedures or user-defined functions (UDF), you’ll need to be aware of the naming conventions that
Snowflake allows and enforces for them.

You can also overload stored procedures and UDFs, providing different signatures for a given procedure or function.

> **Note:**
>
> The length of a user-defined function’s name – the combined length of its name, return type, and the names of all of its
> parameters – must not exceed 10,000 bytes. Attempting to create a function whose name exceeds this limit will result in the following
> error message:
>
> ```output
> Function name (including parameter and return type) too long.
> ```

## Choosing a name for a procedure or UDF

Names for procedures and UDFs must conform to the rules for [Object identifiers](../sql-reference/identifiers.md).

> **Note:**
>
> Snowflake does not allow creating functions with the same name as any of the system-defined functions.

## Calling a procedure or UDF

When you create a stored procedures or UDF, you create it in a specified database and schema. Procedures and UDFs have a
fully-qualified name defined by their namespace in the form of `db.schema.procedure_or_function_name`.

The following statement uses the fully-qualified name to call a stored procedure:

```sqlexample
CALL mydatabase.myschema.myprocedure();
```

When called without their fully-qualified name, procedures and UDFs are
[resolved according to the database and schema in use for the session](../sql-reference/name-resolution.md). If
[you specified a search path](../sql-reference/name-resolution.md), that search path is used to
determine the function or procedure to call.

In contrast, many of the built-in, system-defined functions provided by Snowflake have no namespace. As a result, you can call
them from anywhere.

## Overloading procedures and functions

Snowflake supports [overloading procedures and functions](https://en.wikipedia.org/wiki/Function_overloading). In a given
schema, you can define multiple procedures or functions that have the same name but different signatures. The signatures must
differ by the number of arguments, the types of the arguments, or both.

For example, for UDFs:

```sqlexample
CREATE OR REPLACE FUNCTION myudf (number_argument NUMBER) ...
```

```sqlexample
CREATE OR REPLACE FUNCTION myudf (varchar_argument VARCHAR) ...
```

```sqlexample
CREATE OR REPLACE FUNCTION myudf (number_argument NUMBER, varchar_argument VARCHAR) ...
```

For stored procedures:

```sqlexample
CREATE OR REPLACE PROCEDURE myproc (number_argument NUMBER) ...
```

```sqlexample
CREATE OR REPLACE PROCEDURE myproc (varchar_argument VARCHAR) ...
```

```sqlexample
CREATE OR REPLACE PROCEDURE myproc (number_argument NUMBER, varchar_argument VARCHAR) ...
```

If multiple signatures use the same number of arguments but have different types of arguments, you can use different names for
the arguments to indicate which signature to use when you call the function or procedure.

```sqlexample
CREATE OR REPLACE FUNCTION echo_input (numeric_input NUMBER)
  RETURNS NUMBER
  AS 'numeric_input';
```

```sqlexample
CREATE OR REPLACE FUNCTION echo_input (varchar_input VARCHAR)
  RETURNS VARCHAR
  AS 'varchar_input';
```

```sqlexample
SELECT echo_input(numeric_input => 10);
```

```sqlexample
SELECT echo_input(varchar_input => 'hello world');
```

> **Note:**
>
> For commands other than those that call the function or procedure (e.g. executing [DESCRIBE FUNCTION](../sql-reference/sql/desc-function.md),
> [DROP PROCEDURE](../sql-reference/sql/drop-procedure.md), [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md), etc.), you must use the data types of the
> arguments to identify the signature to use.

### Calling overloaded procedures and functions

As is the case with calling any other [procedure](stored-procedure/stored-procedures-calling.md) or
[function](udf/udf-calling-sql.md), you can specify the arguments by name or by position.

```sqlexample
SELECT myudf(text_input => 'hello world');
```

```sqlexample
SELECT myudf('hello world');
```

If you omit the argument names or if you use the same argument name for arguments of different types, Snowflake uses the number of
arguments and the types of the arguments to determine the signature to use. In these cases,
[automatic type conversion (coercion)](../sql-reference/data-type-conversion.md) can affect the signature that is selected. For details,
refer to Caveat about relying on the argument data type to identify the signature to call.

### Caveat about relying on the argument data type to identify the signature to call

If you are relying on the data type of the argument (rather than the argument name) to identify the signature of the function or
procedure to call, note that the combination of automatic type conversion and overloading makes it easy for minor user errors to
cause unexpected results.

Consider the following examples, which create two SQL UDFs named `add5`:

```sqlexample
CREATE OR REPLACE FUNCTION add5 (n NUMBER)
  RETURNS NUMBER
  AS 'n + 5';

CREATE OR REPLACE FUNCTION add5 (s VARCHAR)
  RETURNS VARCHAR
  AS
  $$
    s || '5'
  $$;
```

If you call `add5` and specify a numeric argument without the argument name, then the first implementation is called. If you
specify a string-typed argument instead, the second implementation called.

If the argument is neither a number nor a string, then the implementation depends on
[Snowflake’s implicit type conversion rules](../sql-reference/data-type-conversion.md).
For example, a date-typed argument is converted to a string because conversion from DATE to NUMBER is not supported. As a result,
the string implementation is called.

For example:

```sqlexample
SELECT add5(1);
```

```output
+---------+
| ADD5(1) |
|---------|
|       6 |
+---------+
```

```sqlexample
SELECT add5('1');
```

```output
+-----------+
| ADD5('1') |
|-----------|
| 15        |
+-----------+
```

```sqlexample
SELECT add5('hello');
```

```output
+---------------+
| ADD5('HELLO') |
|---------------|
| hello5        |
+---------------+
```

```sqlexample
SELECT add5(TO_DATE('2014-01-01'));
```

```output
+-----------------------------+
| ADD5(TO_DATE('2014-01-01')) |
|-----------------------------|
| 2014-01-015                 |
+-----------------------------+
```

To avoid potential confusion, assign different argument names for different signatures, and use the argument names when calling
the function.

In the example above, the two signatures use different argument names (`n` for the NUMBER argument and `s` for the VARCHAR
argument). You can specify which signature to use by specifying the argument name:

```sqlexample
SELECT add5(n => 1);
```

```sqlexample
SELECT add5(s => '1');
```

## How the search path determines which function or procedure to call

If you [specified a search path](../sql-reference/name-resolution.md), then each schema appearing in the search path
is searched for a matching function, in the order that the schema appears in the search path. For each searched schema, Snowflake
attempts to find a matching function, using implicit type conversions if necessary. If no match is found in a schema, then the
next schema is considered. Consider again the `add5` functions, if they were defined in different schemas:

```sqlexample
USE SCHEMA s1;
CREATE OR REPLACE FUNCTION add5 ( n number)
  RETURNS number
  AS 'n + 5';
```

```sqlexample
USE SCHEMA s2;
CREATE OR REPLACE FUNCTION add5 ( s string)
  RETURNS string
  AS 's || ''5''';
```

The choice of which function to use for a numeric or string argument would depend on the search path:

```sqlexample
USE SCHEMA s3;
ALTER SESSION SET SEARCH_PATH='s1,s2';

SELECT add5(5);
```

```output
+---------+
| ADD5(5) |
+---------+
| 10      |
+---------+
```

```sqlexample
ALTER SESSION SET SEARCH_PATH='s2,s1';

SELECT add5(5);

+---------+
| ADD5(5) |
*---------+
| 55      |
+---------+
```

With the search path set to search schema `s2` first, the function in `s2` is used, even though it requires that an
implicit type conversion is applied to the argument.

---
title: Node.js Driver
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver.md
section: Developer Guide
---

# Node.js Driver

> **Note:**
>
> This driver currently does not support GCP regional endpoints. Please ensure that any workloads using through this driver do not require support for regional endpoints on GCP. If you have questions about this, please contact Snowflake Support.

Written in pure JavaScript, the Node.js driver provides a native asynchronous Node.js interface to Snowflake.

For more information about Node.js, see [nodejs.org](https://nodejs.org).

The driver supports the versions of Node.js supported by the Node.js Foundation. The driver supports the following Node.js versions:

* v18
* v20
* v22
* v24

See the [driver release timeline](https://nodejs.org/en/about/previous-releases) for more information.

The typical workflow for using the driver is:

1. Establish a connection with Snowflake.
2. Execute statements, e.g. queries and DDL/DML commands.
3. Consume the results.
4. Terminate the connection.

> **Important:**
>
> To upload and download files from a Snowflake stage, you must use the following minimum versions of the driver:
>
> * Version 1.6.2 to upload files (using the [PUT](../../sql-reference/sql/put.md) command)
> * Version 1.6.6 to download files (using the [GET](../../sql-reference/sql/get.md) command)

**Next topics:**

* [Installing the Node.js Driver](nodejs-driver-install.md)
* [Managing connections](nodejs-driver-connect.md)
* [Authenticating connections](nodejs-driver-authenticate.md)
* [Executing statements](nodejs-driver-execute.md)
* [Consuming results](nodejs-driver-consume.md)
* [Configuring log levels and files](nodejs-driver-logs.md)
* [Node.js options reference](nodejs-driver-options.md)

---
title: Node.js options reference
source: https://docs.snowflake.com/en/developer-guide/node-js/nodejs-driver-options.md
section: Developer Guide
---

# Node.js options reference

When constructing a new `Connection` object, you pass in a JavaScript object that specifies the options for the connection
(e.g. your account identifier, your user name, etc.). The following sections describe the options that you can set. To set an
option, specify the option name as the property name in the JavaScript object.

* Connection options

  + Required connection options
  + Authentication options
  + Additional connection options
* Other options

  + xmlParserConfig options
  + Certificate revocation list (CRL) options

## Required connection options

`account`
:   Your [account identifier](../../user-guide/gen-conn-config.md).

`region` (**Deprecated**)
:   The ID for the [region](../../user-guide/intro-regions.md) where your account is located.

    > **Note:**
    >
    > This option is deprecated and is included here only for backward compatibility.
    > Snowflake recommends transitioning to embedding the region in the account identifier,
    > as described in [Using an account locator as an identifier](../../user-guide/admin-account-identifier.md), such as follows.
    >
    > ```javascript
    > const connection = snowflake.createConnection({
    >   account: "myaccount.us-east-2",
    >   username: "myusername",
    >   password: "mypassword"
    > });
    > ```

In addition, you must specify the options for authenticating to the server.

## Authentication options

`application`
:   Specifies the name of the client application connecting to Snowflake.

`authenticator`
:   Specifies the authenticator to use for verifying user login credentials. You can set this to one of the following values:

    | Value | Description |
    | --- | --- |
    | `SNOWFLAKE` | Use the internal Snowflake authenticator. You must also set the `password` option. |
    | `EXTERNALBROWSER` | [Use your web browser](nodejs-driver-authenticate.md) to authenticate with Okta, AD FS, or any other SAML 2.0-compliant identity provider (IdP) that has been defined for your account. |
    | `https://<okta_account_name>.okta.com` | [Use Native SSO through Okta](nodejs-driver-authenticate.md). |
    | `OAUTH` | Use OAuth for authentication. You must also set the `token` option to the OAuth token (see below). |
    | `SNOWFLAKE_JWT` | Use key pair authentication. See [Use key-pair authentication and key-pair rotation](nodejs-driver-authenticate.md). |
    | `USERNAME_PASSWORD_MFA` | Use multi-factor authentication (MFA). See [Use an MFA passcode](nodejs-driver-authenticate.md). |
    | `OAUTH_AUTHORIZATION_CODE` | Manually authenticate using an OAuth authorization code with your web browser and a chosen identity provider (including Snowflake as an IdP). For more information, see [Use the OAuth 2.0 Authorization Code flow](nodejs-driver-authenticate.md). |
    | `OAUTH_CLIENT_CREDENTIALS` | Automatically authenticate using OAuth client credentials with your chosen identity provider (Snowflake as an IdP doesn’t support the client credentials flow). For more information, see [Use the OAuth 2.0 Client Credentials flow](nodejs-driver-authenticate.md). |
    | `PROGRAMMATIC_ACCESS_TOKEN` | Authenticate with a programmatic access token (PAT). It reads the token from the `token` or `password` options. For more information, see [Using programmatic access tokens for authentication](../../user-guide/programmatic-access-tokens.md). |
    | `WORKLOAD_IDENTITY` | Authenticate with the [workload identity federation (WIF)](../../user-guide/workload-identity-federation.md) authenticator. |

    The default value is `SNOWFLAKE`.

    For more information on authentication, see [Managing/Using federated authentication](../../user-guide/admin-security-fed-auth-use.md) and
    [Clients, drivers, and connectors](../../user-guide/oauth-intro.md).

`username`
:   The login name for your Snowflake user or your Identity Provider (e.g. your login name for Okta). Set this option if you set the `authenticator` option to `SNOWFLAKE`, `SNOWFLAKE_JWT`, or the
    [Okta URL endpoint for your Okta account](nodejs-driver-authenticate.md) (e.g. `https://<okta_account_name>.okta.com`).
    If you don’t set the `authenticator` option, you must set this value.

`password`
:   Password for the user. Set this option if you set the `authenticator` option to `SNOWFLAKE` or the
    [Okta URL endpoint for your Okta account](nodejs-driver-authenticate.md) (e.g. `https://<okta_account_name>.okta.com`)
    or if you left the `authenticator` option unset.

    If you set the `authenticator` option to `PROGRAMMATIC_ACCESS_TOKEN`, you can pass the programmatic access token in this option.

`token`
:   Specifies the OAuth token to use for authentication or programmatic access token. Set this option if you set the `authenticator` option to
    `OAUTH` or `PROGRAMMATIC_ACCESS_TOKEN`.

`privateKey`
:   Specifies the private key (in PEM format) for key pair authentication. For details, see
    [Use key-pair authentication and key-pair rotation](nodejs-driver-authenticate.md).

`privateKeyPath`
:   Specifies the local path to the private key file (e.g. `rsa_key.p8`). For details, see
    [Use key-pair authentication and key-pair rotation](nodejs-driver-authenticate.md).

`privateKeyPass`
:   Specifies the passcode to decrypt the private key file, if the file is encrypted. For details, see
    [Use key-pair authentication and key-pair rotation](nodejs-driver-authenticate.md).

`oauthClientId`
:   Value of `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).

`oauthClientSecret`
:   Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).

`oauthAuthorizationUrl`
:   Identity provider endpoint supplying the authorization code to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.

`oauthTokenRequestUrl`
:   Identity Provider endpoint supplying the access tokens to the driver. When using Snowflake as an Identity Provider, this value is derived from the `server` or `account` parameters.

`oauthScope`
:   Scope requested in the Identity Provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

`oauthRedirectUri`
:   URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.

`workloadIdentityProvider`
:   Description:
    :   Platform of the workload identity provider. Possible values include: `AWS`, `AZURE`, `GCP`, and `OIDC`.

`workloadIdentityAzureClientId`
:   The Azure Managed Identity Client ID to use when connecting to Snowflake. Applies only when `workloadIdentityProvider=AZURE`.

`workloadIdentityImpersonationPath`
:   An array of strings that provides an identity chain to use when connecting to Snowflake. Array elements are either a full service account address or a service account’s unique ID.

    Impersonation works by following each array entry in order to obtain a token that allows authorization of the next service account. Each account in the identity chain needs permissions to impersonate the next account only. The final account in the list obtains your Snowflake connection token and is used to connect to Snowflake.

    This argument is supported for AWS and Google Cloud workloads and only applies when `authenticator=WORKLOAD_IDENTITY`.

`passcode`
:   Specifies the `passcode` provided by Duo when using multi-factor authentication (MFA) for logins. For details, see [Use an MFA passcode](nodejs-driver-authenticate.md).

`passcodeInPassword`
:   Specifies whether the MFA `passcode` is embedded in the login password. If `true`, the MFA passcode is appended to the end of the `password`. Default: `false`. For details, see [Use an MFA passcode](nodejs-driver-authenticate.md).

## Additional connection options

`accessUrl`
:   Specifies a fully-qualified endpoint for connecting to Snowflake. The `accessUrl` includes the full schema and host,
    as well as an optional port number, similar to `https://myaccount.us-east-1.snowflakecomputing.com`.

    > **Note:**
    >
    > When using the `accessUrl` option, the value specified in the `account` option is not used.

`browserActionTimeout`
:   Specifies the timeout, in milliseconds, for browser activities related to SSO authentication. The default value is
    120000 (milliseconds).

`openExternalBrowserCallback`
:   Opens a browser window for SSO authentication. By default, the driver uses the npm `open` package. For example:

    ```javascript
    var connection = snowflake.createConnection({
      ...,
      openExternalBrowserCallback: () => {
        // your custom code to open browser window instead of our default implementation
      }
    });
    ```

`clientConfigFile`
:   Path to the client configuration file associated with the [easy logging](nodejs-driver-logs.md) feature.

`clientRequestMFAToken`
:   Sets whether the driver uses the MFA token in the local credential storage for authentication instead of requesting a new token from the server. Default: `false`.

`clientStoreTemporaryCredential`
:   Sets whether the driver uses the SSO token in the local credential storage for authentication instead of requesting a new token from the server. Default: `false`.

`clientSessionKeepAlive`
:   By default, client connections typically time out approximately 3-4 hours after the most recent query was executed.

    If the `clientSessionKeepAlive` option is set to `true`, the client’s connection to the server will be kept alive
    indefinitely, even if no queries are executed.

    The default setting of this option is `false`.

    If you set this option to `true`, make sure that your program explicitly disconnects from the server when your
    program has finished. Do not exit without disconnecting.

`clientSessionKeepAliveHeartbeatFrequency`
:   (Applies only when `clientSessionKeepAlive` is true)

    Sets the frequency (interval in seconds) between heartbeat messages.

    You can loosely think of a connection heartbeat message as substituting for a query and restarting the timeout
    countdown for the connection. In other words, if the connection would time out after at least 4 hours of inactivity,
    the heartbeat resets the timer so that the timeout will not occur until at least 4 hours after the most recent
    heartbeat (or query).

    The default value is 3600 seconds (one hour). The valid range of values is 900 - 3600. Because timeouts usually
    occur after at least 4 hours, a heartbeat every 1 hour is normally sufficient to keep the connection alive.
    Heartbeat intervals of less than 3600 seconds are rarely necessary or useful.

`credentialCacheDir`
:   Sets the directory in which to store the credential cache when token caching is enabled. Default: user’s `$HOME` directory.

`database`
:   The default database to use for the session after connecting.

`disableSamlUrlCheck`
:   Specifies whether to disable the validation check of a SAML response. Default: `false`.

`host`
:   Host address to which the driver should connect.

`keepAlive`
:   Specifies whether to enable keep-alive functionality on the socket immediately after receiving a new connection request.

    By default, the HTTP protocol creates a new TCP connection for every request. Enabling this parameter allows the driver to re-use connections for multiple requests
    instead of creating new connections for each request.

    The default value is `true`.

`noProxy`
:   Specifies the lists of hosts that the driver should connect to directly, bypassing the proxy
    server (e.g. `*.amazonaws.com` to bypass Amazon S3 access). For multiple hosts, separate the hostnames with a pipe
    symbol (`|`). You can also use an asterisk as a wild card. For example:

    `noProxy: "*.amazonaws.com|*.example.com"`

`proxyHost`
:   Specifies the hostname of an authenticated proxy server.

`proxyPassword`
:   Specifies the password for the user specified by `proxyUser`.

`proxyPort`
:   Specifies the port of an authenticated proxy server.

`proxyProtocol`
:   Specifies the protocol used to connect to the authenticated proxy server.
    Use this property to specify the HTTP protocol: `http` or `https`.

`proxyUser`
:   Specifies the username used to connect to an authenticated proxy server.

`queryTag`
:   The optional [QUERY_TAG](../../sql-reference/parameters.md) to use for the connection, for tagging statements.

`role`
:   The default security role to use for the session after connecting.

`schema`
:   The default schema to use for the session after connecting.

`timeout`
:   Number of milliseconds to keep the connection alive with no response. Default: 60000 (1 minute).

`warehouse`
:   The default virtual warehouse to use for the session after connecting. Used for performing queries, loading data, etc.

Some connection options assume that the specified database object (database, schema, warehouse, or role) already
exists in the system. If the specified object does not exist, a default is not set during connection.

After connecting, all of the optional connection options can also be set or overridden through the [USE <object>](../../sql-reference/sql/use.md) command.

## Configuration options

`arrayBindingThreshold`
:   Sets the maximum number of binds the driver uses in a bulk insert operation. The default value is 65280.

`cwd`
:   Current working directory to use for GET and PUt operations when it differs from the connector directory.

`representNullAsStringNull`
:   Specifies how the `fetchAsString` method returns null values.

    * `true` (enabled): Returns null values as the string, “NULL”.
    * `false` (disabled): Returns null values as `null`.

    Default: `true` (enabled)

`resultPrefetch`
:   Number of threads for clients to use to prefetch large result sets. Valid values: 1-10.

`rowMode`
:   Specifies how to return results that contain duplicate column names. Values include:

    * `array`: returns the result set as an array, including duplicate column names.
    * `object`: returns the result set as an object, omitting duplicate column names.
    * `object_with_renamed_duplicated_columns`: returns the result set as an object, while adding suffixes to duplicate names to make them unique.

    The default value is `object`.

## xmlParserConfig options

Beginning with version 1.7.0 of the driver, you can use the following `fast-xml-parser`
library configuration options to customize how the driver processes XML document attributes when querying columns
with XML content.

You can download the [fast-xml-parser](https://www.npmjs.com/package/fast-xml-parser).

By default, the Node.js driver ignores XML element attributes when returning XML data from a query. For example,
in the following XML content, the `<animal>` element includes an `id` attribute:

```xml
<exhibit name="Polar Bear Plunge">
  <animal id="000001">
    <scientificName>Ursus maritimus</scientificName>
    <englishName>Polar Bear</englishName>
    <name>Kalluk</name>
  </animal>
  <animal id="000002">
    <scientificName>Ursus maritimus</scientificName>
    <englishName>Polar Bear</englishName>
    <name>Chinook</name>
  </animal>
</exhibit>
```

By default, when the Node.js driver returns the result set, it ignores the `id` attribute and returns the following
output. Notice the attribute names and values are not included.

```output
{
  exhibit: {
    animal: [
      {
        "scientificName": "Ursus maritimus",
        "englishName": "Polar Bear",
        "name": "Kalluk",
      },
      {
        "scientificName": "Ursus maritimus",
        "englishName": "Polar Bear",
        "name": "Chinook"
      }
    ]
  }
}
```

For information about how to set these options, refer to [Parsing XML data](nodejs-driver-consume.md).

To help illustrate how the following options affect how the driver parses XML data, each option description shows how it
affects this example.

`ignoreAttributes`
:   Whether to ignore XML attributes during parsing. If you want to use the other parser options, you must set
    `ignoreAttributes: false`.

    Default: `true`

    When set to `false`, the driver returns the output as follows. Notice the `id` attribute is now
    included in the output (by default, the driver prefixes attribute names with `@_`):

    ```output
    {
        exhibit: {
          animal: [
            {
              "scientificName": "Ursus maritimus",
              "englishName": "Polar Bear",
              "name": "Kalluk",
              "@_id": "000001"
            },
            {
              "scientificName": "Ursus maritimus",
              "englishName": "Polar Bear",
              "name": "Chinook",
              "@_id": "000002"
            }
          ],
          "@_name": "Polar Bear Plunge"
        }
    }
    ```

`alwaysCreateTextNode`
:   Whether to create a property with the tag name and assign the value directly.

    Default: `false`

    When set to `true`, the driver returns the output as follows:

    ```output
    {
      exhibit: {
        animal: [
          {
            "scientificName": {
              "#text": "Ursus maritimus"
            },
            "englishName": {
              "#text": "Polar Bear"
            },
            "name": {
              "#text": "Kalluk"
            },
            "@_id": "000001"
          },
          {
            "scientificName": {
              "#text": "Ursus maritimus"
            },
            "englishName": {
              "#text": "Polar Bear"
            },
            "name": {
              "#text": "Chinook"
            },
            "@_id": "000002"
          }
          "@_name": "Polar Bear Plunge"
        ]
      }
    }
    ```

`attributeNamePrefix`
:   String to prepend to attribute names.

    Default: “@_”

    When set to `""` to specify no prefix for attribute names, the driver returns the output as follows:

    ```output
    {
        exhibit: {
          animal: [
            {
              "scientificName": "Ursus maritimus",
              "englishName": "Polar Bear",
              "name": "Kalluk",
              "id": "000001"
            },
            {
              "scientificName": "Ursus maritimus",
              "englishName": "Polar Bear",
              "name": "Chinook",
              "id": "000002"
            }
          ],
          "name": "Polar Bear Plunge"
        }
    }
    ```

`attributesGroupName`
:   Groups all attributes of a tag under a specified property name.

    Default: unset

    When set to `@@` to group all tag attributes in an element named `@@,` the driver returns the output as follows:

    ```output
    {
        exhibit: {
          "@@": {
            "@_name": "Polar Bear Plunge"
          }
          animal: [
            {
              "@@": {
                "@_id": "000001"
              },
              "scientificName": "Ursus maritimus",
              "englishName": "Polar Bear",
              "name": "Kalluk"
            },
            {
              "@@": {
                "@_id": "000002"
              },
              "scientificName": "Ursus maritimus",
              "englishName": "Polar Bear",
              "name": "Chinook"
            } an
          ]
        }
    }
    ```

## Certificate revocation list (CRL) options

These options are available in driver versions 2.3.0 and later.

`certRevocationCheckMode`
:   How to treat certificate revocation. The following values are supported:

    * `ENABLED`: Enables CRLs. Connections are terminated if there are errors related to obtaining and parsing the CRL.
    * `ADVISORY`: Enables CRLs. Errors related to obtaining and parsing the CRL are reported, but no certificates are revoked and the connection is allowed.
    * `DISABLED`: Disables CRLs. Certificates can only be revoked manually.

    Default: `DISABLED`

`crlAllowCertificatesWithoutCrlURL`
:   Whether certificates without an associated CRL are accepted. Applies only when `certRevocationCheckMode` is not `DISABLED`.

    Default: `false`

`crlInMemoryCache`
:   Whether CRLs should be cached in memory. Applies only when `certRevocationCheckMode` is not `DISABLED`.

    Default: `true`

`crlOnDiskCache`
:   Whether CRLs should be cached to disk. Applies only when `certRevocationCheckMode` is not `DISABLED`.

    Default: `true`

---
title: Observability in Snowflake apps
source: https://docs.snowflake.com/en/developer-guide/builders/observability.md
section: Developer Guide
---

# Observability in Snowflake apps

Through observability built into Snowflake, you can ensure that your applications are running as efficiently as possible.
Using the practices and features described in this topic, you can make the most of observability features that show you where you
can improve your code.

## What is observability?

In an observable system, you can understand what’s happening internally through external evidence generated by the system—evidence
that includes telemetry data, alerts, and notifications.

Through the evidence of internal functioning it provides, observability makes it easier for you to troubleshoot hard-to-understand behaviors
on a production system. This is especially true in a distributed system, where evidence collected from observation provides a view of
behavior across multiple components. Rather than disrupting a production environment to diagnose issues, you can analyze the collected
data from it.

With an observable system, you can start to answer questions such as the following:

* How well is the system performing?
* Where is there latency and what’s causing it?
* Why is a particular component or process not working as it should?
* What improvements can be made?

## Observability in Snowflake

Snowflake supports a model that provides built-in observable data while also giving you ways to add more instrumentation where you need it.
While Snowflake provides support for telemetry data such as logs, metrics, and traces (which are typical of observability), it also
includes other features you can use to keep track of system usage and performance.

The following lists features you can use to receive and analyze system performance and usage.

|  |  |
| --- | --- |
| Collected telemetry data | As your application generates logs, metrics, and traces, Snowflake collects that telemetry data in an event table. Using Snowsight, you can explore the data, looking for patterns.  You can emit custom telemetry into the event table to provide contextual, domain-specific information to expedite debugging. |
| History Tables | Use the following views and their associated tables to monitor all usage in your account.   * [Query History](../../user-guide/ui-snowsight-activity.md) * [Copy History](../../user-guide/data-load-monitor.md) * [Tasks](../../user-guide/ui-snowsight-tasks.md) |
| Alerts and notifications | Alerts allow for customizable triggering conditions, actions, and a schedule, in combination with notification integrations for proactive monitoring. |
| Extensibility with third-party tools | The Snowflake [event table](../logging-tracing/event-table-setting-up.md) adopts [OpenTelemetry](https://opentelemetry.io/docs/) standards, so your Snowflake telemetry can easily be consumed by other ecosystem tools. |

## Telemetry data collected for analysis

As code in your application executes, you can have Snowflake collect data from the code that tells you about the application’s internal
state. Using this telemetry data—collected in a Snowflake event table (your account
[has one by default](../logging-tracing/event-table-setting-up.md))—you can look for bottlenecks and other opportunities to optimize.

Telemetry data must be emitted as your code executes. Snowflake emits some of this data on your code’s behalf without
you needing to instrument your code. You can use also APIs included with Snowflake to emit telemetry data from specific parts of your code.

As described below, you can analyze the collected data by querying the event table or using the visualizations that capture the data
in Snowsight.

### Types of telemetry data

To ensure that the telemetry data you collect is broadly useful, Snowflake telemetry is built on the standard [OpenTelemetry](https://opentelemetry.io/docs/)
(sometimes called OTel) framework, an incubating project of the Cloud Native Compute Foundation. Through this framework (and APIs and
tools designed for it), you can reuse collected data with tools besides Snowflake.
Through OpenTelemetry, you can instrument application code to add observability where you want it.

Snowflake event tables collect log, span, and metrics data in the OpenTelemetry data model. The following describes each type of telemetry
data collected in an event table.

|  |  |
| --- | --- |
| [Logs](../logging-tracing/logging.md) | Logs record individual operations performed by code. Each log message is generated at a discrete point during the execution of the code.  **Instrumenting code** You can log from your code using libraries standard for the language you’re using, as listed in [Logging from handler code](../logging-tracing/logging.md).  **Viewing data** You can [view log messages](../logging-tracing/logging-accessing-messages.md) for analysis either by querying the event table or looking at the visualizations provided in Snowsight.  The following image from Snowsight shows a list of collected log messages for a two-hour period in a single database. |
| [Metrics](../logging-tracing/metrics.md) | Metrics are measurements calculated over a time period. These values include CPU and memory measurements.  **Instrumenting code** Snowflake emits metrics data automatically as your code executes, so you don’t need to instrument your code.  **Viewing data** You can [view metrics data](../logging-tracing/metrics-viewing-data.md) for analysis either by querying the event table or looking at the visualizations provided in Snowsight.  The following image from Snowsight shows changes in collected metrics data for the execution of a user-defined function. |
| [Traces](../logging-tracing/tracing.md) | Traces show distributed events as data flows through a system. In a trace, you can see where time is spent as processing flows from component to component.  You can emit trace events—both within the default span Snowflake creates or from a custom span you create—using libraries standard for the language you’re using, as listed in [Logging from handler code](../logging-tracing/logging.md).  **Instrumenting code** You can emit trace events from your code using libraries standard for the language you’re using, as listed in [Event tracing from handler code](../logging-tracing/tracing.md).  **Viewing data** You can [view trace events](../logging-tracing/tracing-accessing-events.md) for analysis either by querying the event table or looking at the visualizations provided in Snowsight.  The following image from Snowsight shows the spans resulting as a UDF executes. |

## Telemetry best practices

Use the following best practices to get the most out of observablity in Snowflake.

* Set up your environment to capture telemetry data before you need it
* Optimize procedures with query telemetry
* Cache redundant DataFrame operations
* Manage the amount of telemetry data received for UDFs
* Optimize user-defined functions with query telemetry

### Set up your environment to capture telemetry data before you need it

You can’t analyze data that you haven’t collected, so it’s best to start collecting telemetry data so you’ll have it when you need it.
As your deployment grows, your need to understand how your code is performing grows.

Use the following best practices:

* [Enable telemetry data collection](../logging-tracing/logging-tracing-enabling.md) for your Snowflake environment.

  To collect the data you’ll need, ensure that you have an active event table.
* To ensure you’re collecting telemetry data you want, [set telemetry levels](../logging-tracing/telemetry-levels.md) to
  useful thresholds.

  At first, you’ll want to set these levels to ensure that you’re collecting data. For example, set log levels to at least WARN for any
  production or business critical jobs. Over time, you might adjust these levels to meet changing needs.

  Organize your production stored procedures, UDFs, and other objects under a database or schema so you can simply enable warning logs
  for that database or schema. This saves the trouble of managing settings for separate objects.
* To generate data for troubleshooting, add [log statements](../logging-tracing/logging.md) or
  [trace events](../logging-tracing/tracing.md) to your production jobs.

  When you use [standard logging libraries](../logging-tracing/logging.md) such as Java’s SLF4J or Python’s logging libraries, Snowflake routes logs from those packages to
  your event table automatically.

  For tracing, you can use [telemetry libraries](../logging-tracing/tracing.md) included with Snowflake.
* To include in trace data parts of the handler’s processing that you want to measure, add
  [custom spans](../logging-tracing/tracing-custom-spans.md) to your stored procedure handler code.

  Along with the built-in spans from Snowflake objects, Snowflake represents custom spans you create in the trace diagram. With custom
  spans, you can capture data about arbitrary parts of your code’s processing to see how long those parts take to execute. You can also
  attach arbitrary metadata to custom spans to add descriptions to the data for troubleshooting and optimizing.

### Optimize procedures with query telemetry

In the Query Telemetry trace diagram, you’ll find data about all the spans emitted from a query.

* The horizontal axis displays duration. A span that appears longer horizontally took longer to complete than a shorter
  span.
* The vertical axis displays the call hierarchy. In that hierarchy, any span that is directly under another span is a “child” of
  the “parent” span above it.

You can use this diagram to find opportunities for optimization in stored procedures. Using what you see in the diagram as a starting
place, you can take steps to optimize your code.

For example, you might organize sequential operations so they execute in parallel using libraries like joblib. [Joblib](https://joblib.readthedocs.io/en/stable/) is a set of
tools for adding pipelining to Python code. With it, you can more easily write parallel code.

### Cache redundant DataFrame operations

When you have a chain of DataFrame operations that is used multiple times, you’ll see them in the trace diagram as a span for each
DataFrame action. Depending on the query’s complexity, this span can be quite long.

For example, in the code below the same DataFrame chain is called in multiple contexts:

```python
count = session.table(...).select().filter().join().count()

if count > 0:
  session.table(...).select().filter().join().write.save_as_table(...) # same query as the count, this will execute again
else:
  session.table(...).select().filter('other criteria').join() # nearly same query as the count
```

Using caching improves performance by caching the intermediate DataFrame as a temporary table, reducing redundant queries:

```python
cached_df = session.table(...).select().filter().join().cache_result()
count = cached_df.count()

if count > 0:
  cached_df.write.save_as_table() # reuses the cached DF
else:
  cached_df
```

### Manage the amount of telemetry data received for UDFs

When adding code to collect telemetry data with UDFs, remember that the UDF execution model can mean many more rows in the event table
than for a procedure.

When a UDF is called on every input row, your handler code emits logging statements or span events for every row of the input dataset.
For example, a dataset of 10 million rows passed to a UDF would emit 10 million log entries.

Consider using the following patterns when adding logs and span events to UDFs:

* Initially, use [logging levels](../logging-tracing/telemetry-levels.md) designed to reduce the number of entries recorded.

  Use DEBUG- or INFO-level logging statements and set the logging level to WARN in production. If an issue is found, you can lower the
  logging level to DEBUG or INFO for the duration of the debugging session.
* Use try/catch blocks to isolate the code from which you want to emit logging data.

  Using try/catch can be useful to catch any unexpected UDF input, log it as a WARN-level log for awareness, and return a default value.
* Use condition statements to log only for scenarios that are meaningful to you.

  With if/else statements or other constraints, you can control the volume of logging output.

### Optimize user-defined functions with query telemetry

When a UDF is called, Snowflake executes it in parallel by creating an instance of the handler code for each input row. You’ll see each of
these instances represented as its own span in a trace diagram.

You can use these spans to troubleshoot slow queries and find opportunities to improve performance. For example, you might see scenarios
such as the following:

* One or more instances of your UDF code might receive a row with data that is significantly larger or otherwise unlike the rest of your
  data. When this happens, that instance might take much longer to complete, and therefore its span is much longer.
* Depending on your query’s input partitioning and preceding clauses, a minority of the instances might receive an outsized amount of
  input data.

The following image shows a span for each row passed to a UDF, where one span’s longer duration suggests that the row might have larger
data than the others.

## Alerts and notifications for time-sensitive response

You can use Snowflake alerts and notifications to have your system reveal what’s going on inside, then take action or send information
about system state. Unlike telemetry data, which you collect and analyze later, alerts and notifications are useful when you want an
immediate response to what’s happening in the system.

* With an [alert](../../user-guide/alerts.md), you can specify a condition, action, and schedule, then specify that the action should take
  place when the condition and schedule details are met.

  For example, you might use an alert to monitor complex conditions that you specify in SQL. The most common action after an alert
  condition is met is to send a notification. Snowflake supports sending notifications to email, cloud service provider queues, Slack,
  PagerDuty, and Microsoft Teams.
* With a [notification](../../user-guide/notifications/about-notifications.md), you can use included stored procedures to send messages to
  destinations such as [email addresses](../../user-guide/notifications/email-notifications.md),
  [webhooks](../../user-guide/notifications/webhook-notifications.md) (for client tool integrations such as a chat tool), or to
  [a queue hosted by a cloud service](../../user-guide/notifications/queue-notifications.md).

### Alerts and notifications best practices

Use the following practices to improve observability by refining and increasing the amount of information you receive from the system.

* Avoid duplicating event evaluation.

  You can avoid duplicating evaluation on events by accounting for the latency between the alert schedule and execution. To do this,
  specify alert timestamps using [SCHEDULED_TIME](../../sql-reference/functions/scheduled_time.md) and [LAST_SUCCESSFUL_SCHEDULED_TIME](../../sql-reference/functions/last_successful_scheduled_time.md)
  instead of using [CURRENT_TIMESTAMP](../../sql-reference/functions/current_timestamp.md).

  For more information, see [Specifying timestamps based on alert schedules](../../user-guide/alerts.md).
* Enrich an alert action or notification with query results.

  You can check the results from the SQL statement specified by an alert condition. To obtain the query results, do the following:

  1. Retrieve the query ID for the alert condition’s SQL statement by calling [GET_CONDITION_QUERY_UUID](../../sql-reference/functions/get_condition_query_uuid.md).
  2. Pass the query ID to [RESULT_SCAN](../../sql-reference/functions/result_scan.md) to obtain the query results.
* Log a result or take automated action in addition to sending a notification.

  You can specify that an alert action [runs a task](../../user-guide/tasks-intro.md) or logs a new row to a table whenever an alert
  condition is met. For example, you might do this if you’ll take an action in Snowflake each time the alert condition is met.

  If you intend to perform a complex action after a condition is met, ensure that your warehouse is an appropriate size.

## Tools for analysis and visualization

You can use the telemetry data collected in your event table with other tools that support the OpenTelemetry data model.

Through Snowflake support of OpenTelemetry, you can use APIs, SDKs, and other tools to instrument, generate, collect, and export telemetry
data. Using these tools, you can more thoroughly analyze software performance and behavior. Because a Snowflake event table uses this
widely-adopted standard, you might also be able to integrate your organization’s observability tools with event tables with little overhead.

Consider integrating your external tools in one of the following ways:

* If your observability tools can read from external sources, point them to the event table.
* If your tools use a push model—in which telemetry data must be sent to the tool—consider using a stored procedure with
  [external access](../external-network-access/external-network-access-overview.md) to regularly read telemetry data from
  the event table and emit it to your tool.

The following lists tools you might integrate with Snowflake event tables:

* [Snowflake integration for Datadog](https://docs.datadoghq.com/integrations/snowflake_web/)
* Snowflake integration for Grafana dashboard

  + [Snowflake data source for Grafana](https://grafana.com/docs/plugins/grafana-snowflake-datasource/latest/)
  + [Snowflake integration for Grafana Cloud](https://grafana.com/docs/grafana-cloud/monitor-infrastructure/integrations/integration-reference/integration-snowflake/)

  For an introduction to using Grafana with Snowflake, see [How to monitor Snowflake with Grafana Cloud](https://grafana.com/blog/2023/05/24/how-to-monitor-snowflake-with-grafana-cloud/).
* [Observe for Snowflake](https://app.snowflake.com/marketplace/listing/GZTYZY3AR0U/observe-inc-observe-for-snowflake), Observe’s native app for observability

---
title: ODBC configuration and connection parameters
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-parameters.md
section: Developer Guide
---

# ODBC configuration and connection parameters

The Snowflake ODBC driver utilizes both configuration and connection parameters. The methods for setting the parameters are different depending on the environment in which the driver is installed.

> **Note:**
>
> You cannot set the [SEARCH_PATH](../../sql-reference/parameters.md) parameter within an ODBC client connection string. You must
> establish a session before setting a search path.

## Setting parameters in Windows

In Windows:

* Configuration parameters are set in the Windows registry using regedit
  and the following registry path:

  ```none
  [HKEY_LOCAL_MACHINE\SOFTWARE\Snowflake\Driver]
  ```
* Connection parameters are set in Data Source Names (DSNs):

  + DSNs are typically created and edited using the Windows Data Source Administration tool.
  + If you wish, the registry keys for DSNs can be edited directly in the Windows registry using regedit. The registry path to the keys is different depending on whether you’re using 64-bit and
    32-bit Windows and whether you’re editing a user or system DSN:

    - 64-bit Windows:

      ```none
      [HKEY_CURRENT_USER\SOFTWARE\ODBC\ODBC.INI\<DSN_NAME>]

      [HKEY_LOCAL_MACHINE\SOFTWARE\ODBC\ODBC.INI\<DSN_NAME>]
      ```
    - 32-bit Windows:

      ```none
      [HKEY_CURRENT_USER\SOFTWARE\WOW6432NODE\ODBC\ODBC.INI\<DSN_NAME>]

      [HKEY_LOCAL_MACHINE\SOFTWARE\WOW6432NODE\ODBC\ODBC.INI\<DSN_NAME>]
      ```

    To add a connection parameter using regedit, add a new String Value, double-click on the value you created, then enter the ODBC parameter as the Value name and the parameter value
    as the Value data.

## Setting parameters in macOS or Linux

In macOS or Linux:

* Configuration parameters are set in the configuration file
  (`simba.snowflake.ini`).
* Connection parameters are set in the data source name (DSN) file (`odbc.ini`).

## Configuration parameters

`CABundleFile`
:   Set the location of the Certificate Authority (CA) bundle file. Must reference a file that includes a valid list of CA certificates.

    For Linux, the RPM and DEB installers automatically copy the file and set this parameter.

    For Mac, the PKG installer copies the file and sets this parameter.

    For Windows, the MSI installer copies the file and sets this parameter.

    A manual installation requires you to download the file from <https://curl.haxx.se/docs/caextract.html> and set the location of the file.

`client_config_file`
:   Specifies the path of a logging configuration file that you can use to define the logging level and directory for saving log files.

`CURLVerboseMode`
:   Set to `true` to enable cURL verbose logging. The log file `snowflake_odbc_curl.dmp` is created and updated. The Snowflake ODBC driver uses cURL as the HTTP and TLS library. This parameter
    is useful for diagnosing network issues.

`DisableOCSPCheck`
:   Set to `true` to disable the TLS certificate revocation status check by the Online Certificate Status Protocol (OCSP). In normal circumstances, this flag should not set. But if the OCSP
    availability problem persists, the application might temporarily set this parameter in order to unblock connectivity issues and remove it when the OCSP availability problem is addressed.

`DisableTelemetry`
:   Specifies whether toggling the in-band telemetry handler is enabled or not. If this driver configuration setting is set to `true`, the telemetry handler is not created in the driver.

`DriverManagerOverride`
:   By default, the driver auto-detects which driver manager to use. However, if your specific situation calls for it, starting from ODBC driver version 3.9.0, you can override this auto-detection and manually specify which driver manager to use.

    Possible values are: UnixODBC and iODBC.

    If `DriverManagerOverride` is not specified, the driver uses auto-detection for the driver manage (call backtrace()) to get driver manager information. This is the default behavior.

    The parameter works only on Linux and MacOS.

`EnableAutoIpdByDefault`
:   Set to `false` to configure the ODBC Driver to set SQL_ATTR_ENABLE_AUTO_IPD to `false` (which is the default value in the
    ODBC standard).

    Otherwise, by default, the ODBC Driver sets SQL_ATTR_ENABLE_AUTO_IPD to true for compatibility with third-party tools.

    This parameter was introduced in version 2.22.0 of the ODBC Driver.

`EnablePidLogFileNames`
:   Set to `true` to include the process ID in the name of the log file. For example, if the process ID is 7394, the log files
    will be named:

    * `snowflake_odbc_connection_7394_0.log`
    * `snowflake_odbc_generic7394_0.log`
    * `snowflake_odbc_curl_7394.dmp`

    You can set this parameter to prevent different processes from overwriting the same log files. Each process will generate its
    own set of log files.

    By default, the value of this parameter is `false`.

    This parameter was introduced in version 2.22.2 of the ODBC Driver.

`get_size_threshold`
:   Specifies the minimum file size, in megabytes (MB), to break files into smaller parts when downloading files with the [GET](../../sql-reference/sql/get.md) command.
    Files with sizes smaller than this threshold will not use multi-part downloading.

    Default is **5** (MB).

    > **Note:**
    >
    > You can override this value for specific cases by setting the corresponding get_size_threshold connection parameter.

`KeepLeadingTrailingZeros`
:   Determines how leading or trailing zeros in numbers formatted as string values are handled. By default, the parameter is set to `true`,
    which means the driver retains any leading or trailing zeros. Set the parameter to `false` to remove leading or trailing zeros, for example:

    * `0.23` is changed to `.23`
    * `7.00` is changed to `7`

`LogFileCount`
:   Sets the maximum number of log files to keep before rotating older files to make room for new log files.

`LogFileSize`
:   Specifies the maximum size, in bytes, of a log file. When a log file reaches the specified size,
    the ODBC driver automatically creates a new log file.

    Default is **20971520**.

`LogLevel`
:   Specifies the level of detail logged for clients that use the ODBC driver:

    * 0 = Off
    * 1 = Fatal
    * 2 = Error
    * 3 = Warning
    * 4 = Info
    * 5 = Debug
    * 6 = Trace

`LogPath`
:   Specifies the location of the Snowflake log files for clients that use the ODBC
    driver.

`MapToLongVarchar`
:   Specifies the length of a string at which to begin mapping string values to an ODBC `SQL_LONGVARCHAR` data type
    instead of the default ODBC `SQL_CHAR` or `SQL_VARCHAR` data types.

    * < 0 (or unset): Maps string values in their default ODBC data types. Default = **-1**.
    * >= 0: Specifies the maximum number of string characters to map to default ODBC string data types.
      All strings larger than this value are mapped to `SQL_LONGVARCHAR`.

    You can also specify this parameter as a connection parameter. (See the instructions for setting the
    parameters in Windows,
    macOS and Linux.)
    If set both as a connection parameter and
    a configuration parameter, the connection parameter in the DSN (or connection string) takes precedence.

    This parameter was introduced in version 2.24.3 of the ODBC Driver.

`NoExecuteInSQLPrepare`
:   Set to `true` to configure the ODBC Driver to use the standard ODBC behavior when passing DDL statements (such as
    CREATE and DROP) to `SQLPrepare()` and `SQLExecute()`.

    In Snowflake, by default, when you pass a DDL statement to `SQLPrepare()`, the ODBC Driver sends the statement to the
    data source for execution (not preparation). When you pass a DDL statement to `SQLExecute()`, the ODBC Driver does not
    send the statement to the data source.

    If you set `NoExecuteInSQLPrepare` to `true`, the ODBC Driver follows the standard ODBC behavior. Calling
    `SQLPrepare()` sends the statement to the data source for preparation (not execution). Calling `SQLExecute()`
    sends the statement to the data source for execution.

    This parameter was introduced in version 2.21.6 of the ODBC Driver.

`NoProxy`
:   Specifies the hostname patterns to bypass the proxy server (e.g. `.amazonaws.com` to bypass Amazon S3 access).

    > **Note:**
    >
    > The Snowflake ODBC driver passes the `NoProxy` value to the curl option `CURLOPT_NOPROXY`.
    >
    > The format of the `NoProxy` value can be found [CURLOPT_NOPROXY explained”](https://curl.haxx.se/libcurl/c/CURLOPT_NOPROXY.html).

`Proxy`
:   Specifies a proxy server in the form of `<host>:<port>` for clients that use the ODBC driver.

    > **Note:**
    >
    > In Windows, entries for `LogLevel` and `LogPath` are created and populated with default values when the ODBC
    > driver is installed; however, an entry for `Proxy` is not created during install. To specify a proxy to use with the driver,
    > you must manually add the entry to the driver registry key.

    To bypass the proxy for one or more IP addresses or URLs, add the NoProxy parameter.

`SSLVersion`
:   Specifies the minimum SSL/TLS version to use when initiating TLS handshake. The values correspond to libcurl’s capabilities. For more information, see `CURL_SSLVERSION_*` entries in [CURLOPT_SSLVERSION explained](https://curl.se/libcurl/c/CURLOPT_SSLVERSION.html).

    Possible values: one of TLSv1, SSLv2, SSLv3, TLSv1_0, TLSv1_1, TLSv1_2, TLSv1_3 (default: TLSv1_2).

    Snowflake recommends leaving this setting at its default when you don’t have a very specific need to change it.

`SSLVersionMax`
:   Specifies the maximum SSL/TLS version to use when initiating TLS handshake. The values correspond to libcurl’s capabilities. For more information, see `CURL_SSLVERSION_MAX_*` entries in [CURLOPT_SSLVERSION explained](https://curl.se/libcurl/c/CURLOPT_SSLVERSION.html).

    Possible values: one of TLSv1_0, TLSv1_1, TLSv1_2, TLSv1_3 (default: TLSv1_3).

    Snowflake recommends leaving this setting at its default when you don’t have a very specific need to change it.

## Connection parameters

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

### Required connection parameters

`<name>` (Data Source)
:   Specifies the name of your DSN.

`port` (Port)
:   Specifies the port on which the driver listens for Snowflake communication.

    > **Note:**
    >
    > You do not need to change the default `Port` value of `443`.

`pwd` (Password)
:   A password is required to connect to Snowflake; however, for security and authentication reasons, Snowflake strongly discourages storing password credentials directly within any DSN definition.

    Typically, the credentials are passed to the driver programmatically by the client application that is attempting to connect to Snowflake.

    > **Note:**
    >
    > In Windows, the ODBC driver displays a Password field in the Data Source Administration tool; however, the driver does not store any values entered in the field. Instead, the driver
    > requires login credentials to be provided at connection time.

`server` (Server)
:   Specifies the *hostname* for your account in the following format:

    `account_identifier.snowflakecomputing.com`

    To determine the account identifier to use, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

`uid` (User)
:   Specifies the login name of the Snowflake user to authenticate.

### Optional connection parameters

`BROWSER_RESPONSE_TIMEOUT`
:   Specifies the number of seconds to wait for an authentication response in an external browser.

    Default is 120.

`CLIENT_OUT_OF_BAND_TELEMETRY_ENABLED=<Boolean>`
:   Specifies whether to enable out-of-band telemetry.

    Default is `true`

`CLIENT_SESSION_KEEP_ALIVE=<Boolean>`
:   Specifies whether to keep the current session active after a period of inactivity or to force the user to log in again.
    If the value is `true`, Snowflake keeps the session active indefinitely,
    even if there is no activity from the user. If the value is `false`, the user must log in again after four hours of inactivity.

    Default is `false`.

`database` (Database)
:   Specifies the default database to use for sessions initiated by the driver.

`disableSamlUrlCheck`
:   Specifies whether to disable verification for SAML URLs. By default, the driver verifies SAML URLs.

    Default is `false`.

`maxHttpRetries` (Database)
:   Specifies the maximum number of HTTP retries for queries with failed HTTP requests before returning an error. Setting `maxHttpRetries=0` removes the retry limit, but doing so runs the risk of the driver infinitely retrying failed HTTP calls.

    Default value is 7.

`role` (Role)
:   Specifies the default role to use for sessions initiated by the driver. The specified role should be a role that has been assigned to the specified user for the driver. If the specified role does not
    match any of the roles assigned to the user, sessions initiated by the driver have no role initially; however, a role can always be specified from within the session.

`schema` (Schema)
:   Specifies the default schema to use for sessions initiated by the driver.

    Default is `public`.

`SecondaryRoles` (Role)
:   Specifies the secondary roles to use for sessions initiated by the driver.
    The roles must already be granted to the specified user for the driver.
    Secondary roles can also be activated from within a user session using the `USE SECONDARY ROLES` command.

    Possible values include:

    * **All**: All roles granted to the user.
    * **None**: No roles allowed (disables secondary roles).

`token_file_path` (Token File Path)
:   Specifies the path to the file that contains the SPCS token to use when logging in to an SPCS container.

`tracing` (Tracing)
:   The level of detail to be logged in the driver trace files:

    * 0 = Disable tracing
    * 1 = Fatal only error tracing
    * 2 = Error tracing
    * 3 = Warning tracing
    * 4 = Info tracing
    * 5 = Debug tracing
    * 6 = Detailed tracing

`warehouse` (Warehouse)
:   Specifies the default warehouse to use for sessions initiated by the driver.

### Certificate revocation list (CRL) options

These options are available in driver versions 3.13.0 and later.

`CRL_CHECK`
:   Specifies whether to enable or disable CRL checking. When set to `true`, the driver checks the CRL to verify the server certificate has not been revoked. The connection fails if the server’s certificate is revoked or another revocation check issue (such as downloading or parsing) occurs.

    Default is `false`.

`CRL_ADVISORY`
:   Modifies the CRL connection checking to fail only when the certificate is revoked explicitly. When any other problem (such as parsing errors or download errors) is present, the connection is allowed.

    Default is `false`.

`CRL_ALLOW_NO_CRL`
:   Specifies whether to allow connections when no CRL is found. When set to `true`, the driver allows connections when no CRL is found.

    Default is `false`.

`CRL_DISK_CACHING`
:   Specifies whether to enable or disable disk caching of CRLs. When set to `true`, the driver caches CRLs on disk, which reduces the time spent re-downloading the certificate distribution lists.

    The driver stores the cached CRLs in the following directories:

    * MacOS: `$HOME/Library/Caches/Snowflake/crls`
    * Linux: `$HOME/.cache/snowflake/crls`
    * Windows: `%LOCALAPPDATA%SnowflakeCachescrls`

    Default is `true`.

    You can override the default disk cache location by setting the `SF_CRL_RESPONSE_CACHE_DIR` environment variable.

`CRL_MEMORY_CACHING`
:   Specifies whether to enable or disable memory caching of CRLs. When set to `true`, the driver caches CRLs in memory.

    Default is `true`.

`CRL_DOWNLOAD_TIMEOUT`
:   Specifies the timeout, in seconds, for downloading CRLs. If the download does not complete within the specified time, the connection fails.

    Default is **120** (seconds).

### Additional connection parameters

> **Note:**
>
> In Windows, these additional connection parameters can be set in the Windows Registry (by using regedit).
>
> In macOS or Linux, they are set in the `odbc.ini` file, similar to the rest of the connection parameters.

`allowEmptyProxy`
:   Specifies whether to allow empty values for the proxy
    and no_proxy connection parameters, as described in the following sections:

    * Using connection parameters
    * Using configuration parameters
    * Using environment variables

    Setting this value produces the following effects:

    > * `true`: The driver treats empty proxy values as valid proxy settings and overrides any existing settings or environment variable.
    > * `false`: The driver ignores empty proxy values and uses the specified configuration parameters or environment variable.

    Default is `true`.

`application`
:   Snowflake partner use only: Specifies the name of a partner application to connect through ODBC.

    This parameter can also be set by calling the `SQLSetConnectAttr()` function. For more details, see
    [Snowflake-specific behavior of the SQLSetConnectAttr function](odbc-api.md).

`authenticator`
:   Specifies the authenticator to use for verifying user login credentials:

    > * `snowflake` (Default) to use the internal Snowflake authenticator.
    > * `externalbrowser` to [use your web browser](../../user-guide/admin-security-fed-auth-use.md) to authenticate with Okta, AD FS, or any other
    >   SAML 2.0-compliant identity provider (IdP) that has been defined for your account.
    >
    >   > **Note:**
    >   >
    >   > The Snowflake ODBC driver does not support `externalbrowser` authentication using Microsoft Excel with MacOS.
    > * `https://<okta_account_name>.okta.com` (i.e. the URL endpoint for your Okta account) to [authenticate through native Okta](../../user-guide/admin-security-fed-auth-use.md) (only supported if your IdP is Okta).
    > * `oauth` to authenticate using OAuth. When OAuth is specified as the authenticator, you must also set the `token` parameter to specify the OAuth token (see below).
    > * `username_password_mfa` to authenticate with MFA token caching. For more details, see Using Multi-Factor Authentication (in this topic).
    > * `oauth_authorization_code` Manually authenticate using an OAuth authorization code with your web browser and a chosen identity provider (including Snowflake as an IdP). For more information, see Using the OAuth 2.0 Authorization Code flow.
    > * `oauth_client_credentials` Automatically authenticate using OAuth client credentials with your chosen identity provider (Snowflake as an IdP doesn’t support the client credentials flow). For more information, see Using the OAuth 2.0 Client Credentials flow.
    > * `programmatic_access_token` to authenticate with a programmatic access token (PAT).
    > * `workload_identity` to authenticate with the [workload identity federation (WIF)](../../user-guide/workload-identity-federation.md) authenticator.

    Default is `snowflake`.

    On Windows, you can use the [ODBC Data Source Administration Tool](odbc-windows.md) to set this parameter.

    For more information on authentication, see [Managing/Using federated authentication](../../user-guide/admin-security-fed-auth-use.md) and [Clients, drivers, and connectors](../../user-guide/oauth-intro.md).

`singleAuthenticationPrompt`
:   Specifies whether to prompt for authentication when a single authentication is required. When enabled, concurrent connections wait for the initial authentication process to complete and reuse the retrieved token instead of prompting for authentication again.

    Default: `true`.

`default_binary_size`, . `default_varchar_size`
:   Specifies the default size, in bytes, that the driver uses when retrieving and converting values from BINARY or VARCHAR columns of
    undetermined sizes. Set this when retrieving values from these types of columns.

    By default, the driver uses `67108864` (for BINARY columns) and `134217728` (for VARCHAR columns) as the default sizes when
    allocating memory for retrieving the value of a column of undetermined size.

    To reduce the amount of memory allocated for these values, you can set `default_binary_size` and
    `default_varchar_size` to the maximum size of the values in these types of columns.

    > **Note:**
    >
    > Setting these values only changes the `SQL_DESC_LENGTH` field in Implementation Row Descriptor (IRD) and the
    > corresponding values returned from `SQLDescribeCol/SQLColAttribute/SQLColAttributes`. The driver still returns the
    > entire data even when it’s length exceeds the setting.
    >
    > However, an application could allocate a data buffer based on the length
    > specified in these parameters that could truncate the data because of insufficient space in the buffer. As the best practice,
    > Snowflake recommends setting the default size larger than the maximum size of the typical data (for example, 4000 or 8000 bytes)
    > to reduce the memory usage significantly from the original default values of 134217728/67108864 bytes and to minimize
    > the chance of data truncation.

    You can also use these settings to avoid the following error, which can occur when using the Microsoft OLE DB
    provider (MSDASQL) with a Snowflake database:

    ```none
    Requested conversion is not supported
    Cannot get the current row value of column
    ```

    You can specify these parameters as connection
    configuration parameters (for example, in the `simba.snowflake.ini` on
    macOS and Linux). If this is set as both a connection parameter and
    a configuration parameter, the connection parameter in the DSN (or connection string) takes precedence.

    These parameters were introduced in version 2.23.2 of the ODBC Driver.

`get_size_threshold`
:   Specifies the minimum file size, in megabytes (MB), to break files into smaller parts when downloading files with the [GET](../../sql-reference/sql/get.md) command.
    Files with sizes smaller than this threshold will not use multi-part downloading.

    Default is **5** (MB).

    > **Note:**
    >
    > Setting this value as a connection parameter overrides the value of the corresponding get_size_threshold configuration parameter.

`login_timeout`
:   Specifies how long, in seconds, to wait for a response when connecting to the Snowflake service before returning a login failure error.

    Default is **300** (seconds).

`network_timeout`
:   Specifies how long, in seconds, to wait for a response when interacting with the Snowflake service before returning an error. Zero (0) indicates no network timeout is set.

    Default is **0** (seconds).

`retryTimeout`
:   Specifies how long, in seconds, to wait before returning an error for HTTP retries from queries with failed HTTP requests. Zero (0) indicates no retry timeout is set.

    Default is **300** (seconds).

`no_proxy`
:   Specifies which hostname endings should be allowed to bypass the proxy server (e.g. `no_proxy=.amazonaws.com` means that Amazon S3 access does not need to go through the proxy).

    This parameter does not support wildcards. Each value specified should be one of the following:

    * The end of a hostname (or a complete hostname), for example:

      > + .amazonaws.com
      > + myorganization-myaccount.snowflakecomputing.com
    * An IP address, for example:

      > + 192.196.1.15

    If more than one value is specified, values should be separated by commas, for example:

    ```none
    no_proxy=localhost,.example.com,myorganization-myaccount.snowflakecomputing.com,192.168.1.15,192.168.1.16
    ```

    > **Note:**
    >
    > This parameter is applied to the process. If another connection shares the same process, the proxy setting must be identical or the behavior is not predictable.

`odbc_use_standard_timestamp_columnsize`
:   This boolean parameter affects the column size (in characters) returned for SQL_TYPE_TIMESTAMP.
    When this parameter is set to true, the driver returns 29, following the ODBC standard. When this parameter is set
    to `false`, the driver returns 35, which allows room for the timezone offset (e.g. “-08:00”).

    This value can be set via not only the odbc.ini file (Linux or macOS) or the Microsoft Windows registry, but also
    the connection string.

    Default is `false`.

`passcode`
:   Specifies the passcode to use for multi-factor authentication.

    For more information about multi-factor authentication, see [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md).

`passcodeInPassword`
:   Specifies whether the passcode for multi-factor authentication is appended to the password:

    * `on` (or `true`) specifies the passcode is appended.
    * `off` (or `false`) or any other value specifies the passcode is not appended.

    The default value is `off`.

`proxy`
:   Specifies the proxy server URL in the format `http://<hostname>:<port>/` or `<hostname>:<port_number>` so that all communications from ODBC use the proxy server.

    > **Note:**
    >
    > This parameter is applied to the process. If another connection shares the same process, the proxy setting must be identical or the behavior is not predictable.

`put_compresslv`
:   Specifies the compression rate the ODBC driver uses when transferring data with the [PUT](../../sql-reference/sql/put.md) command. This parameter overrides the default gzip
    compression level. If you do not specify `put_compresslv` the ODBC driver uses the default
    compression level.

    Valid values are `-1` to `9`. The default value is `-1` and specifies the default
    `Z_DEFAULT_COMPRESSION`.

    Values `0` through `9` specify a custom compression rate. `0` causes the ODBC driver to use a lower
    compression rate and `9` uses a higher compression rate. Using a higher compression rate results in slower data
    transfer speeds.

    You can also specify this parameter as a
    configuration parameter (for example, in the `simba.snowflake.ini` on
    macOS and Linux). If this is set as both a connection parameter and
    a configuration parameter, the connection parameter in the DSN (or connection string) takes precedence.

    This parameter was introduced in version 2.23.3 of the ODBC Driver.

`put_fastfail`
:   If you are using wildcard characters with the [PUT](../../sql-reference/sql/put.md) command to upload multiple files at once and you
    want the driver to stop uploading the files when an error occurs, set this parameter to `true`.

    The default value is `false`, which means that if an error occurs with one file, the driver continues uploading the rest
    of the files.

    This parameter was introduced in version 2.22.3 of the ODBC Driver.

    As of version 2.22.5 of the ODBC Driver, you can also specify this parameter as a
    configuration parameter (for example, in the `simba.snowflake.ini` on
    macOS and Linux). If this is set as both a connection parameter and
    a configuration parameter, the connection parameter in the DSN (or connection string) takes precedence.

`put_maxretries`
:   Specifies the number of times that the driver should retry the [PUT](../../sql-reference/sql/put.md) command if the command fails.

    The default value is **5**.

    The valid range of values for this parameter is `0` to `100`. If you specify a value outside this range, the driver
    uses the default value of `5`.

    This parameter was introduced in version 2.22.3 of the ODBC Driver.

    As of version 2.22.5 of the ODBC Driver, you can also specify this parameter as a
    configuration parameter (for example, in the `simba.snowflake.ini` on
    macOS and Linux). If this is set as both a connection parameter and
    a configuration parameter, the connection parameter in the DSN (or connection string) takes precedence.

`put_tempdir`
:   Specifies the temporary directory to use for [PUT](../../sql-reference/sql/put.md) command requests. The driver uses this temporary
    directory to create temporary compressed files before uploading those files to Snowflake.

    If this parameter is not set, the driver creates and uses the temporary directory `/tmp/snowflakeTmp_username`, where
    `username` is the username of the current user in the operating system.

    You can also specify this parameter as a
    configuration parameter (for example, in the `simba.snowflake.ini` on
    macOS and Linux). If this is set as both a connection parameter and
    a configuration parameter, the connection parameter in the DSN (or connection string) takes precedence.

    This parameter was introduced in version 2.23.1 of the ODBC Driver.

`token=<string>`
:   Specifies the token for OAuth or PAT authentication, where `<string>` is the token. This parameter is required only when the `authenticator=oauth` or `authenticator=programmatic_access_token` parameter is set.

    Default is none.

`query_timeout`
:   Specifies how long, in seconds, to wait for a query to complete before returning an error. Zero (0) indicates to wait indefinitely.

    Default is **0** (seconds).

`validateSessionParam`
:   Specifies how to respond when any of the following
    session connection parameters are invalid:

`enable_connection_diag`
:   Specifies whether the connector generates a connectivity diagnostic report.

    Default is `false`.

`connection_diag_log_path`
:   Specifies the absolute path where the connectivity report is stored.

    Valid only when `enable_connection_diag` is `true`.

    Example: `connection_diag_log_path=C:\\reports`

`connection_diag_allowlist_path`
:   Specifies the absolute path to a JSON file containing the output of `SYSTEM$ALLOWLIST()`
    or `SYSTEM$ALLOWLIST_PRIVATELINK()`.

    Valid only when `enable_connection_diag` is `true`.

    Example: `connection_diag_log_path=C:\\allowlist.json`

`OAUTH_CLIENT_ID`
:   Value of the `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).

    Default: `LOCAL_APPLICATION`.

`OAUTH_CLIENT_SECRET`
:   Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).

    Default: `LOCAL_APPLICATION`.

`OAUTH_AUTHORIZATION_URL`
:   Identity provider endpoint supplying the authorization code to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.

`OAUTH_TOKEN_REQUEST_URL`
:   Identity provider endpoint supplying the access tokens to the driver. When using Snowflake as an identity provider, this value is derived from the `server` or `account` parameters.

`OAUTH_SCOPE`
:   Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

`OAUTH_REDIRECT_URI`
:   URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.

`WORKLOAD_IDENTITY_PROVIDER`
:   Platform of the workload identity provider. Possible values include: `AWS`, `AZURE`, `GCP`, and `OIDC`.

`WORKLOAD_IDENTITY_IMPERSONATION_PATH`
:   An array of strings that provides an identity chain to use when connecting to Snowflake. Array elements are either a full service account address or a service account’s unique ID.

    Impersonation works by following each array entry to obtain a token that allows authorization of the next service account. Each account in the identity chain needs permissions to impersonate the next account only. The final account in the list obtains your Snowflake connection token and is used to connect to Snowflake.

    This argument is supported for AWS and Google Cloud workloads and only applies when `authenticator=WORKLOAD_IDENTITY`.

`PRIV_KEY_BASE64`
:   Base64-encoded private key.

`PRIV_KEY_PWD`
:   Base64-encoded private key password.

## Connecting using the `connections.toml` file

The ODBC driver lets you add connection definitions to a `connections.toml` configuration file.
A connection definition refers to a collection of connection-related parameters. The driver currently supports TOML version 1.0.0.

For more information about `toml` file formats, see [TOML (Tom’s Obvious Minimal Language)](https://toml.io/en/).

The connection string containing only the `Driver` parameter tells the driver to look for the connection configuration within the predefined (default) files.
The ODBC driver looks for the `connections.toml` file in the following locations, in order:

* If a `~/.snowflake` directory exists on your machine, ODBC uses the
  `~/.snowflake/connections.toml` file.
* Location specified in the `SNOWFLAKE_HOME` environment variable.
* Otherwise, ODBC uses the `connections.toml` file in the one of the following locations, based on your operating system:

  > + Linux: `~/.config/snowflake/connections.toml`, but you can update it with XDG vars
  > + Windows: `%USERPROFILE%\AppData\Local\snowflake\connections.toml`
  > + Mac: `~/Library/Application Support/snowflake/connections.toml`

You can generate the basic settings for the TOML configuration file in Snowsight. For information, see
[Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

If you want to switch between multiple existing connections, you can configure them in the `connections.toml` file. The default key is `default`, but you change the name of the default connection by setting the `SNOWFLAKE_DEFAULT_CONNECTION_NAME` shell environment variable.

The following sample `connections.toml` files defines two connections:

```toml
[default]
account = 'my_organization-my_account'
user = 'test_user'
password = '******'
warehouse = 'testw'
database = 'test_db'
schema = 'test_odbc'
protocol = 'https'
port = '443'

[aws-oauth-file]
account = 'my_organization-my_account'
user = 'test_user'
password = '******'
warehouse = 'testw'
database = 'test_db'
schema = 'test_odbc'
protocol = 'https'
port = '443'
authenticator = 'oauth'
token_file_path = '/Users/test/.snowflake/token'
```

## Specifying parameters in a connection string

You can specify connection parameters as name-value pairs in a connection string, using
an equals sign (`=`) between each parameter and value, and using a semicolon (`;`) between parameters. For example:

```none
driver={SnowflakeDSIIDriver};server=myorganization-myaccount.snowflakecomputing.com;uid=myloginname;pwd=mypassword;database=mydatabase;schema=myschema;warehouse=mywarehouse;role=myrole;...
```

You can generate the basic connection string in Snowsight. For information, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

## Connecting through a proxy server

The instructions for configuring a proxy server connection depend on your operating system and driver version:

| Operating System | Driver Version | Supported Instructions |
| --- | --- | --- |
| Linux | 2.16.0 (released May 3, 2018) or higher | * Using Connection Parameters * Using Environment Variables |
| 2.13.18 (released February 7, 2018) - 2.15.0 (released April 30, 2018) | Using Environment Variables |
| 2.13.17 or lower | Using Configuration Parameters |
| macOS | 2.16.0 (released May 3, 2018) or higher | * Using Connection Parameters * Using Environment Variables |
| 2.14.0 (released March 28, 2018) - 2.15.0 (released April 30, 2018) | Using Environment Variables |
| 2.13.21 or lower | Using Configuration Parameters |
| Windows | 2.16.0 (released May 3, 2018) or higher | * Using Connection Parameters * Using Environment Variables |
| 2.15.0 (released April 30, 2018) | Using Environment Variables |
| 2.14.0 or lower | Using Configuration Parameters |

> **Note:**
>
> The latest versions of ODBC driver, indicated above, support any of the listed configuration options. The options are listed
> in the order of precedence. If more than one option is defined, the setting with the highest precedence is applied.

### Using connection parameters

To connect through a proxy server, add the following connection parameters to the DSN:

* `proxy`
* `no_proxy`

For example:

> ```none
> [connection]
> Description = SnowflakeDB
> Driver      = SnowflakeDSIIDriver
> Locale      = en-US
> server      = myorganization-myaccount.snowflakecomputing.com
> proxy       = http://proxyserver.company:80
> no_proxy    = .amazonaws.com
> ```

See Connection Parameters for parameter descriptions.

### Using configuration parameters

To connect through a proxy server, add the following configuration parameters:

* `Proxy`
* `NoProxy`

See Configuration Parameters for parameter descriptions.

### Using environment variables

To connect through a proxy server, configure the following environment variables:

* `http_proxy`
* `https_proxy`
* `no_proxy`

> **Note:**
>
> These environment variables are case-sensitive for Linux and macOS, and must be set in lowercase. For Windows, the environment variables are case-insensitive.

For example:

* Linux or macOS:

  > ```bash
  > export http_proxy=http://proxyserver.example.com:80
  > export https_proxy=http://proxyserver.example.com:80
  > ```
  >
  > If the proxy server requires a user name and password, include the credentials, for example:
  >
  > ```bash
  > export https_proxy=http://username:password@proxyserver.example.com:80
  > ```
* Windows:

  > ```bash
  > set http_proxy=http://proxyserver.example.com:80
  > set https_proxy=http://proxyserver.example.com:80
  > ```
  >
  > If the proxy server requires a user name and password, include the credentials, for example:
  >
  > ```bash
  > set https_proxy=http://username:password@proxyserver.example.com:80
  > ```

Optional: To bypass the proxy for specific communications, set `no_proxy` (for example, to bypass Amazon S3 access , use `no_proxy=.amazonaws.com`).

When using a the `SPCS_TOKEN` service identifier token for SPCS containers, you can set the `SKIP_TOKEN_FILE_PERMISSIONS_VERIFICATION` parameter to `true` to bypass the permission verification for the token file.

## Using single sign-on (SSO) for authentication

If you have [configured Snowflake to use single sign-on (SSO)](../../user-guide/admin-security-fed-auth-overview.md), you can configure
your client application to use SSO for authentication. See [Using SSO with client applications that connect to Snowflake](../../user-guide/admin-security-fed-auth-use.md) for details.

## Using multi-factor authentication

Snowflake supports caching MFA tokens, including combining MFA token caching with SSO.

For more information, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../../user-guide/security-mfa.md).

## Using key-pair authentication

The ODBC driver supports key pair authentication and key rotation.

1. To start, complete the initial configuration for key pair authentication as shown in [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md).
2. Modify the data source name (DSN) entries for the driver. For information about the DSN entries, see the appropriate topic for your operating system:

   * [Installing and configuring the ODBC Driver for Linux](odbc-linux.md)
   * [Installing and configuring the ODBC Driver for Windows](odbc-windows.md)

   Add the following (case-sensitive) parameters:

   `AUTHENTICATOR = SNOWFLAKE_JWT`
   :   Specifies to authenticate the Snowflake connection using key pair authentication with JSON Web Token (JWT).

   `JWT_TIME_OUT = integer`
   :   Optional. Specifies the length of time Snowflake waits to receive the JWT (in seconds) before timing out. If that happens, authentication fails and the driver returns an `Invalid JWT token` error. To resolve repeated occurrences of the error, increase the parameter value. Default: `30`

   `PRIV_KEY_FILE = path/rsa_key.p8`
   :   Specifies the local path to the private key file you created (i.e. `rsa_key.p8`).

       The value set in DSN can be overridden by calling the `SQLSetConnectAttr()` function. For more details, see
       [Snowflake-specific behavior of the SQLSetConnectAttr function](odbc-api.md).

   `PRIV_KEY_FILE_PWD = <password>`
   :   Specifies the passcode to decode the private key file.

       This parameter should be set only if the parameter PRIV_KEY_FILE is also set.

       The value set in DSN can be overridden by calling the `SQLSetConnectAttr()` function. For more details, see
       [Snowflake-specific behavior of the SQLSetConnectAttr function](odbc-api.md).
3. Save the settings.

## Using the OAuth 2.0 Authorization Code flow

The OAuth 2.0 Authorization Code flow is a secure method for a client application to obtain an access token from an authorization server on behalf of a user, without revealing the user’s credentials.

To enable the OAuth 2.0 Authorization Code flow:

1. Set the `authenticator` connection parameter to `oauth_authorization_code`.
2. Set the following OAuth connection parameters:

   > * `OAUTH_CLIENT_ID`: Value of the `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `OAUTH_CLIENT_SECRET`: Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `OAUTH_AUTHORIZATION_URL`: Identity provider endpoint supplying the authorization code to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `OAUTH_TOKEN_REQUEST_URL`: Identity provider endpoint supplying the access tokens to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `OAUTH_SCOPE`: Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.
   > * `OAUTH_REDIRECT_URI`: URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`.

## Using the OAuth 2.0 Client Credentials flow

The OAuth 2.0 Client Credentials flow provides a secure way for machine-to-machine (M2M) authentication, such as the Snowflake Connector for Python connecting to a backend service. Unlike the OAuth 2.0 Authorization Code flow, this method does not rely on any user-specific data.

To enable the OAuth 2.0 Client Credentials flow:

1. Set the `authenticator` connection parameter to `oauth_client_credentials`.
2. Set the following OAuth connection parameters:

   > * `OAUTH_CLIENT_ID`: Value of the `client id` provided by the identity provider for Snowflake integration (Snowflake security integration metadata).
   > * `OAUTH_CLIENT_SECRET`: Value of the `client secret` provided by the identity provider for Snowflake integration (Snowflake security integration metadata)
   > * `OAUTH_TOKEN_REQUEST_URL`: Identity provider endpoint supplying the access tokens to the driver. When Snowflake is used as an identity provider, this value is derived from the `server` or `account` parameters.
   > * `OAUTH_SCOPE`: Scope requested in the identity provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes.

## Authenticating with a programmatic access token (PAT)

Programmatic access token (PAT) is a Snowflake-specific authentication method. The feature must be enabled for the account before usage (see the [Prerequisites](../../user-guide/programmatic-access-tokens.md) for more information). Authentication with PAT doesn’t involve any human interaction.

## Authenticating with workload identity federation (WIF)

[Workload identity federation](../../user-guide/workload-identity-federation.md) provides a service-to-service authentication method for Snowflake. This method enables applications, services, or containers to authenticate with Snowflake by leveraging their cloud provider’s native identity system, such as AWS IAM, Microsoft Entra ID, or Google Cloud service accounts. This approach eliminates the need for managing long-lived credentials and simplifies credential acquisition compared to other methods like External OAuth. Snowflake connectors are designed to automatically obtain short-lived credentials from the platform’s identity provider.

To enable the workload identity federation authenticator, do the following:

1. Set the `authenticator` connection parameter to `WORKLOAD_IDENTITY`.
2. Set the `workload_identity_provider` connection parameter to `AWS`, `AZURE`, `GCP`, or `OIDC`, based on your platform.
3. For OpenID Connect (OIDC), specify the `token` connection parameter.

## Managing log files

To help you to track issues that might arise, you can enable logging in the ODBC driver.
The ODBC driver provides the following configuration options that you can use to set up logging and manage log files:

* EnablePidLogFileNames: Adds the process ID to the name of a log file.
* LogFileCount: Sets the maximum number of saved log files.
* LogFileSize: Specifies the maximum size of a log file.
* LogLevel: Specifies the types of information to log.
* LogPath: Sets the location for log files.

You can use these parameters to manage how you name, store, and rotate log files. You can specify how large and how many log files you want to keep
before replacing them with newly-created log files. The following example appends the process ID to file names to ensure uniqueness,
sets the maximum file size to 30MB, and keeps the 100 most recent log files.

```ini
EnablePidLogFileNames = true  # Appends the process ID to each log file
LogFileSize = 30,145,728      # Sets log files size to 30MB
LogFileCount = 100            # Saves the 100 most recent log files
```

### Logging configuration file

Alternatively, you can easily specify the log level and
the directory in which to save log files in the `sf_client_config.json` configuration file.

> **Note:**
>
> This logging configuration file feature supports only the following log levels:
>
> > * `DEBUG`
> > * `ERROR`
> > * `INFO`
> > * `OFF`
> > * `TRACE`
> > * `WARNING`
> > * `FATAL`

This configuration file uses JSON to define the `log_level` and `log_path` logging parameters, as follows:

```bash
{
  "common": {
    "log_level": "DEBUG",
    "log_path": "/home/user/logs"
  }
}
```

The driver looks for the location of the configuration file in the following order:

* `client_config_file` containing the full path to the configuration file.
* `SF_CLIENT_CONFIG_FILE` environment variable, containing the full path to the configuration file.
* ODBC driver installation directory, where the file must be named `sf_client_config.json`.
* User’s home directory, where the file must be named `sf_client_config.json`.

> **Note:**
>
> * The values of the `LogLevel` and `LogPath` take precedence over values defined in the `sf_client_config.json` file.
> * If a configuration file specified in either the `client_config_file` connection parameter or
>   `SF_CLIENT_CONFIG_FILE` environment variable cannot be found or read, the driver throws an error message.

## Verifying the OCSP connector or driver version

Snowflake uses OCSP to evaluate the certificate chain when making a connection to Snowflake. The driver or connector version and its configuration both determine the OCSP behavior. For more information about the driver or connector version, their configuration, and OCSP behavior, see [OCSP Configuration](../../user-guide/ocsp.md).

## OCSP response cache server

> **Note:**
>
> The OCSP response cache server is currently supported by the Snowflake ODBC Driver 2.15.0 and higher.

Snowflake clients initiate every connection to a Snowflake service endpoint with a “handshake” that establishes a secure connection before actually transferring data. As part of the handshake, a
client authenticates the TLS certificate for the service endpoint. The revocation status of the certificate is checked by sending a client certificate request to one of the OCSP
(Online Certificate Status Protocol) servers for the CA (certificate authority).

A connection failure occurs when the response from the OCSP server is delayed beyond a reasonable time. The following caches persist the revocation status, helping alleviate these issues:

* Memory cache, which persists for the life of the process.
* File cache, which persists until the cache directory (e.g. `~/.cache/snowflake` or `~/.snowsql/ocsp_response_cache`) is purged.
* Snowflake OCSP response cache server, which fetches OCSP responses from the CA’s OCSP servers hourly and stores them for 24 hours. Clients can then request the validation status of a given Snowflake
  certificate from this server cache.

  > **Important:**
  >
  > If your server policy denies access to most or all external IP addresses and web sites, you must allowlist the cache server
  > address to allow normal service operation. The cache server hostname is `ocsp*.snowflakecomputing.com:80`.

  If you need to disable the cache server for any reason, set the `SF_OCSP_RESPONSE_CACHE_SERVER_ENABLED` environment variable to `false`. Note that the value is case-sensitive and must
  be in lowercase.

If none of the cache layers contain the OCSP response, the client then attempts to fetch the validation status directly from the OCSP server for the CA.

---
title: ODBC Driver
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc.md
section: Developer Guide
---

# ODBC Driver

Snowflake provides a driver for connecting to Snowflake using ODBC-based client applications.

> **Important:**
>
> The ODBC driver has different prerequisites depending on the platform where it is installed. For details, see the individual installation and configuration instructions for each platform.
>
> In addition, different versions of the ODBC driver support the [GET](../../sql-reference/sql/get.md) and [PUT](../../sql-reference/sql/put.md) commands, depending on the cloud service that hosts your Snowflake account:
>
> * Amazon Web Services: Version 2.17.5 (and higher)
> * Google Cloud Platform: Version 2.21.5 (and higher)
> * Microsoft Azure: Version 2.20.2 (and higher)

**Next Topics:**

* [Downloading the ODBC Driver](odbc-download.md)
* [Installing and configuring the ODBC Driver for Windows](odbc-windows.md)
* [Installing and configuring the ODBC Driver for macOS](odbc-mac.md)
* [Installing and configuring the ODBC Driver for Linux](odbc-linux.md)
* [ODBC configuration and connection parameters](odbc-parameters.md)
* [ODBC Driver API support](odbc-api.md)
* [Using the ODBC Driver](odbc-using.md)
* [ODBC Driver diagnostic service](odbc-diagnostic-service.md)

---
title: ODBC Driver API support
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-api.md
section: Developer Guide
---

# ODBC Driver API support

The Snowflake ODBC driver supports version 3.52 of the ODBC API. This topic lists the ODBC routines relevant to Snowflake and indicates whether they are supported. The routines are organized into
categories based on the function they perform.

For the complete API reference, see the [Microsoft ODBC Programmer’s Reference](https://msdn.microsoft.com/en-us/library/ms714177.aspx).

## Connecting to a data source

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLAllocHandle` | ✔ |  |
| `SQLConnect` | ✔ |  |
| `SQLDriverConnect` | ✔ |  |
| `SQLAllocEnv` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |
| `SQLAllocConnect` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |
| `SQLBrowseConnect` | ✔ |  |

## Obtaining information about a driver and data source

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLDataSources` | ✔ |  |
| `SQLDrivers` | ✔ |  |
| `SQLGetInfo` | ✔ |  |
| `SQLGetFunctions` | ✔ |  |
| `SQLGetTypeInfo` | ✔ |  |

## Setting and retrieving driver attributes

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLSetConnectAttr` | ✔ | Setting SQL_ATTR_METADATA_ID only affects the SQLTables and SQLColumns functions (and not the other supported catalog functions). |
| `SQLGetConnectAttr` | ✔ | Read-only mode is not supported. SQL_MODE_READ_ONLY is passed to the driver, but Snowflake still writes to the database. . . Also, some attributes were introduced post API version 3.52: SQL_ATTR_ASYNC_DBC_EVENT, SQL_ATTR_ASYNC_DBC_FUNCTIONS_ENABLE, SQL_ATTR_ASYNC_DBC_PCALLBACK, SQL_ATTR_ASYNC_DBC_PCONTEXT, SQL_ATTR_DBC_INFO_TOKEN. |
| `SQLSetConnectOption` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |
| `SQLGetConnectOption` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |
| `SQLSetEnvAttr` | ✔ |  |
| `SQLGetEnvAttr` | ✔ | The SQL_ATTR_CONNECTION_POOLING attribute was introduced after ODBC API version 3.52 and is not supported. |
| `SQLSetStmtAttr` | ✔ | SQL_ATTR_CURSOR_SCROLLABLE only supports a SQL_NONSCROLLABLE value. . SQL_ATTR_USE_BOOKMARKS only supports a SQL_UB_OFF value. . . For compatibility with third-party tools, SQL_ATTR_ENABLE_AUTO_IPD defaults to true, even though the ODBC standard says it should default to false. To change the default value to false, set the [EnableAutoIpdByDefault](odbc-parameters.md) parameter to `false`. . . Setting SQL_ATTR_METADATA_ID only affects the SQLTables and SQLColumns functions (and not the other supported catalog functions). . . Unsupported attributes: SQL_ATTR_SIMULATE_CURSOR, SQL_ATTR_FETCH_BOOKMARK_PTR, SQL_ATTR_KEYSET_SIZE. |
| `SQLGetStmtAttr` | ✔ | In addition to the standard attributes, the Snowflake implementation supports the attribute SQL_SF_STMT_ATTR_LAST_QUERY_ID, which allows the user to retrieve the most recent query ID associated with the specified statement handle. A partial example is in the Examples section below. |
| `SQLSetStmtOption` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. Replaced by `SQLSetStmtAttr`. |
| `SQLGetStmtOption` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. Replaced by `SQLGetStmtAttr`. |
| `SQLParamOptions` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. Replaced by `SQLSetStmtAttr`. |

Each of the preceding functions has a corresponding function that accepts wide characters (unicode). Each such
unicode function has the name shown above, followed by “W”. For example, the function `SQLGetStmtAttr`, which
accepts a char array as the third parameter, has a corresponding function named `SQLGetStmtAttrW`, which accepts a
wchar array as the third parameter.

### Snowflake-specific behavior

* `SQLSetConnectAttr`

  > This method supports two Snowflake-specific attributes:
  >
  > | Attribute Name | Description |
  > | --- | --- |
  > | SQL_SF_CONN_ATTR_APPLICATION | This overrides the value specified by the APPLICATION setting in the registry or .ini file. |
  > | SQL_SF_CONN_ATTR_PRIV_KEY | This is an EVP_PKEY\* pointer that points to an in-memory copy of the private key. This overrides the PRIV_KEY_FILE and PRIV_KEY_PWD settings in the registry or .ini file. Snowflake recommends using this attribute to set the private key. |
  >
  > In Snowflake ODBC driver version 3.4.0 and up, you can use the following two additional attributes in `SQLSetConnectAttr`:
  >
  > | Attribute name | Description |
  > | --- | --- |
  > | `SQL_SF_CONN_ATTR_PRIV_KEY_CONTENT` | Lets you pass the contents of a private key directly into the connection. Make sure to pass the full key contents, including the header and footer. |
  > | `SQL_SF_CONN_ATTR_PRIV_KEY_PASSWORD` | If you’re passing an encrypted private key in the `SQL_SF_CONN_ATTR_PRIV_KEY_CONTENT`, this attribute lets you specify the password.  Using `SQL_SF_CONN_ATTR_PRIV_KEY_CONTENT` might be necessary, if your application and the ODBC driver are linked to incompatible versions of OpenSSL, and you’re seeing crashes coming from the ODBC driver when key-pair authentication is used.  The following C++ code illustrates the implementation:  ```C++ std::string fileContent = loadKeyFileContent(keyFilePath); SQLSetConnectAttr(dbc, SQL_SF_CONN_ATTR_PRIV_KEY_CONTENT, (SQLPOINTER)fileContent.c_str(), SQL_NTS); ``` |

## Setting and retrieving descriptor fields

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLGetDescField` | ✔ |  |
| `SQLGetDescRec` | ✔ |  |
| `SQLSetDescField` | ✔ |  |
| `SQLSetDescRec` | ✔ |  |

## Preparing SQL requests

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLAllocStmt` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |
| `SQLBindParameter` | ✔ |  |
| `SQLPrepare` | ✔ |  |
| `SQLGetCursorName` | ✔ |  |
| `SQLSetCursorName` | ✔ |  |
| `SQLSetScrollOptions` | ✔ | Supported by the Snowflake driver, but deprecated ODBC API. |
| `SQLSetParam` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 2.x. Replaced by `SQLBindParameter`. |

> **Note:**
>
> * There is an upper limit to the size of data that you can bind. For details, see [Limits on Query Text Size](../../user-guide/query-size-limits.md).
> * [SQL Statements Supported for Preparation](../../user-guide/sql-prepare.md) lists the types of SQL statements that are supported for preparation.

## Submitting requests

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLExecute` | ✔ |  |
| `SQLExecDirect` | ✔ |  |
| `SQLNativeSql` | ✔ |  |
| `SQLDescribeParam` | ✔ | Regardless of the data type bound to the parameter, Snowflake performs a server-side conversion and returns a VARCHAR with a maximum length of 134217728. |
| `SQLNumParams` | ✔ |  |
| `SQLParamData` | ✔ | Support for this function was added in version 2.23.3 of the ODBC Driver. |
| `SQLPutData` | ✔ | Support for this function was added in version 2.23.3 of the ODBC Driver. |

## Retrieving results and information about results

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLBindCol` | ✔ | The ODBC driver does not currently support semi-structured data, including `VARIANT`, `OBJECT` and `ARRAY` data types. |
| `SQLError` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. Replaced by `SQLGetDiagRec`. |
| `SQLGetData` | ✔ |  |
| `SQLGetDiagField` | ✔ |  |
| `SQLGetDiagRec` | ✔ |  |
| `SQLRowCount` | ✔ |  |
| `SQLNumResultCols` | ✔ |  |
| `SQLDescribeCol` | ✔ |  |
| `SQLColAttribute` | ✔ | For [GEOGRAPHY](../../sql-reference/data-types-geospatial.md) columns, `SQL_DESC_TYPE_NAME` returns `GEOGRAPHY`. Note that other descriptors (e.g. `SQL_DESC_CONCISE_TYPE`) do not indicate that the column type is `GEOGRAPHY`. |
| `SQLColAttributes` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 2.x. Replaced by `SQLColAttribute`. |
| `SQLFetch` | ✔ |  |
| `SQLFetchScroll` | ✔ | The `FetchOrientation` argument supports the SQL_FETCH_NEXT value only. All other types of fetch fail. |
| `SQLExtendedFetch` |  | Replaced by `SQLFetchScroll` in API version 3.x driver. |
| `SQLSetPos` |  | Snowflake does not support the functionality. |
| `SQLBulkOperations` |  | Snowflake does not support the functionality. |

## Obtaining information about the data source’s system tables (catalog functions)

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLColumnPrivileges` |  | Returns an empty results set. |
| `SQLColumns` | ✔ |  |
| `SQLForeignKeys` | ✔ |  |
| `SQLPrimaryKeys` | ✔ |  |
| `SQLProcedureColumns` | ✔ |  |
| `SQLProcedures` | ✔ | In the result set, the `NUM_INPUT_PARAMS` column contains the number of arguments for the procedure (the value of the max_num_arguments column in the output of the `SHOW PROCEDURES` command). . . The `NUM_OUTPUT_PARAMS` column contains NULL values because stored procedures in Snowflake don’t support output parameters. . . The `NUM_RESULT_SETS` column also contains NULL values because stored procedures in Snowflake don’t return result sets. . . The `PROCEDURE_TYPE` column always contains `SQL_PT_FUNCTION` because stored procedures in Snowflake always return a value. |
| `SQLSpecialColumns` |  | Returns an empty results set. |
| `SQLStatistics` |  | Returns an empty results set. |
| `SQLTablePrivileges` |  | Returns an empty results set. |
| `SQLTables` | ✔ | If the parameter passed to the function is “TABLE”, the function returns all types of tables, including transient tables and temporary tables. . . If the parameter passed to the function is “VIEW”, the function returns all types of views, including materialized views. . . If the parameter passed to the function is “TABLE, VIEW” or “%”, the function returns information about all types of tables and all types of views. |

If the name passed to the catalog function has an invalid character, or if the name does not match any database object, the function returns an empty result set.

Setting `SQL_ATTR_METADATA_ID` only affects the `SQLTables`, `SQLColumns`, and `SQLProcedures` functions.

## Terminating a statement

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLFreeStmt` | ✔ |  |
| `SQLCloseCursor` | ✔ |  |
| `SQLCancel` | ✔ |  |
| `SQLEndTran` | ✔ |  |
| `SQLTransact` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. Replaced by `SQLEndTran`. |

## Terminating a connection

| Function Name | Supported | Notes |
| --- | --- | --- |
| `SQLCancelHandle` |  | Introduced into the API after version 3.52. |
| `SQLDisconnect` | ✔ |  |
| `SQLFreeHandle` | ✔ |  |
| `SQLFreeConnect` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |
| `SQLFreeEnv` | ✔ | Supported by the Snowflake driver, but deprecated in ODBC API version 3.x. |

## Custom SQL data types

Some SQL data types supported by Snowflake have no direct mapping in ODBC (e.g. TIMESTAMP_\*tz, VARIANT). To enable the ODBC driver to work with
the unsupported data types, the header file shipped with the driver includes definitions for the following custom data types:

```C
////////////////////////////////////////////////////////////////////////////////////////////////////
/// Custom SQL Data Type Definition
///
///
////////////////////////////////////////////////////////////////////////////////////////////////////

#define SQL_SF_TIMESTAMP_LTZ 2000
#define SQL_SF_TIMESTAMP_TZ  2001
#define SQL_SF_TIMESTAMP_NTZ 2002
#define SQL_SF_ARRAY         2003
#define SQL_SF_OBJECT        2004
#define SQL_SF_VARIANT       2005
```

The following code demonstrates sample usage of the custom data types:

```C
// bind insert as timestamp_ntz
SQLRETURN rc;
rc = SQLPrepare(odbc.StmtHandle,
               (SQLCHAR *) "insert into testtimestampntz values (?)",
               SQL_NTS);

 SQL_TIMESTAMP_STRUCT bindData;
 SQLLEN datalen = sizeof(SQL_TIMESTAMP_STRUCT);
 bindData.year = 2017;
 bindData.month = 11;
 bindData.day = 30;
 bindData.hour = 18;
 bindData.minute = 17;
 bindData.second = 5;
 bindData.fraction = 123456789;

 rc = SQLBindParameter(
   odbc.StmtHandle, 1, SQL_PARAM_INPUT,
   SQL_C_TIMESTAMP, SQL_SF_TIMESTAMP_NTZ,
   100, 0, &bindData, sizeof(bindData), &datalen);

 rc = SQLExecute(odbc.StmtHandle);

 // query table
 rc = SQLExecDirect(odbc.StmtHandle, (SQLCHAR *)"select * from testtimestampntz", SQL_NTS);

 rc = SQLFetch(odbc.StmtHandle);

 // fetch data as timestamp
 SQL_TIMESTAMP_STRUCT ret;
 SQLLEN retLen = (SQLLEN) 0;
 rc = SQLGetData(odbc.StmtHandle, 1, SQL_C_TIMESTAMP, &ret, (SQLLEN)sizeof(ret), &retLen);
```

## Examples

This section provides examples of using the API.

### Retrieving the last query ID

Retrieving the last query ID is a Snowflake extension to the ODBC standard.

To retrieve the last query ID, call the function `SQLGetStmtAttr` (or `SQLGetStmtAttrW`), passing the attribute
SQL_SF_STMT_ATTR_LAST_QUERY_ID and a character array large enough to hold the query ID.

The example below shows how to retrieve the query ID for a query:

```C
// Space to store the query ID.
// The SQLGetStmtAttr() function fills this in with the actual ID.
char queryId[37];   // Maximum 36 chars plus string terminator.

// The length (in characters) of the query ID. The SQLGetStmtAttr() function fills this in
// with the actual length of the query ID (usually 36).
SQLINTEGER idLen;

// Execute a query.
rc = SQLExecDirect(odbc.StmtHandle, (SQLCHAR *) "select 1", SQL_NTS);

// Retrieve the query ID (queryId) and the length of that query ID (idLen).
SQLGetStmtAttr(odbc.StmtHandle, SQL_SF_STMT_ATTR_LAST_QUERY_ID, queryId, sizeof(queryId), &idLen);
```

If you are executing on Linux or macOS, call `SQLGetStmtAttrW` and pass parameters
of the appropriate data type (for example, “wchar” rather than “char”).

### Best practices to improve performance when retrieving data

When retrieving data with `SQLFetch`, you can use the `SQLGetData` or `SQLBindCol` functions to access
the contents of the cells. In most cases, using `SQLBindCol` provides better performance because it reduces the number
of ODBC calls you need to make to retrieve data and because it lets you take advantage of copying data in-memory.

#### Using `SQLGetData` to retrieve cell data

The following example uses the `SQLGetData` function to retrieve cell values from the data buffer returned
by `SQLFetch`. Notice that you need to call `SQLGetData` once for each cell in the row.

```C
SQLRETURN rc;
SQLSMALLINT numCols;
const size_t s_MaxDataLen = 300;

// fetch with SQLGetData()
// query table
rc = SQLExecDirect(stmt, (SQLCHAR *)"select * from table", SQL_NTS);

// Find out the number of result set columns.
rc = SQLNumResultCols(stmt, &numCols);

// buffer for one cell
vector<char> dataBuffer(s_MaxDataLen);
SQLLEN dataLen = (SQLLEN)0;

// call SQLFetch() per row and SQLGetData() per column per row
while (true)
{
    rc = SQLFetch(stmt);
    if ((rc != SQL_SUCCESS) && (rc != SQL_SUCCESS_WITH_INFO))
    {
        break;
    }
    for (SQLUSMALLINT i = 0; i < numCols; i++)
    {
        rc = SQLGetData(stmt, i + 1, SQL_C_CHAR, dataBuffer.data(), (SQLLEN)s_MaxDataLen, &dataLen);
        std::string data;
        if (SQL_NULL_DATA == dataLen)
            continue;
        if (SQL_NO_TOTAL == dataLen)
            dataLen = s_MaxDataLen;
        data = std::string(dataBuffer.data(), dataLen);
    }
}
rc = SQLCloseCursor(stmt);
```

#### Using `SQLBindCol` to bind the columns for one row of data

The following example uses the `SQLBindCol` function to retrieve cell values from the data buffer returned by
`SQLFetch`. It creates an in-memory buffer for the number of columns in a row and then makes a single
`SQLBindCol` call to bind the application buffers to the result set. Finally, it calls `SQLFetch` once per row and
loads the cell values into the buffer. This approach can significantly increase the speed and efficiency of retrieving data.

```C
SQLRETURN rc;
SQLSMALLINT numCols;
const size_t s_MaxDataLen = 300;

// fetch with SQLBindCol()
// query table
rc = SQLExecDirect(stmt, (SQLCHAR *)"select * from table", SQL_NTS);

// Find out the number of result set columns.
rc = SQLNumResultCols(stmt, &numCols);

// buffer for one row
vector<char> rowBuffer(s_MaxDataLen * numCols);
vector<SQLLEN> columnLenBuffer(numCols);

// call SQLBindCol() per column
for (SQLSMALLINT i = 0; i < numCols; ++i)
{
    SQLBindCol(stmt, i + 1, SQL_C_CHAR, &rowBuffer[s_MaxDataLen * i],
               s_MaxDataLen, &columnLenBuffer[i]);
}

// call SQLFetch() per row
while (true)
{
    rc = SQLFetch(stmt);
    if ((rc != SQL_SUCCESS) && (rc != SQL_SUCCESS_WITH_INFO))
    {
         break;
    }
    // go through data for each cell in buffer without ODBC calls
    for (SQLUSMALLINT i = 0; i < numCols; i++)
    {
        std::string data;
        SQLLEN len = columnLenBuffer[i];
        if (SQL_NULL_DATA == len)
            continue;
        if (SQL_NO_TOTAL == len)
            len = s_MaxDataLen;
        data = std::string(&rowBuffer[s_MaxDataLen * i], len);
    }
}
rc = SQLCloseCursor(stmt);
```

#### Using `SQLBindCol` to bind the columns for multiple rows of data

You can improve performance even more by fetching multiple rows in a single `SQLFetch` call, which reduces
the number of ODBC `SQLFetch` calls needed to process all the rows of a query table.

The following example:

* Determines the number of columns in the result set.
* Creates an in-memory array to store the data from multiple columns.
* Calls `SQLBindCol` for each column to bind the application buffers to the result set.
* Calls `SQLFetch` to get the specified number of rows (100) and processes the data in the in-memory buffer without making ODBC calls, until the end of the query table is reached.

This approach can significantly increase the speed and efficiency of retrieving data. For a query table with 20 columns and 1000 rows, this example would make only 20 `SQLBindCol` and 10 `SQLFetch` calls instead of 20000 `SQLGetData` calls to load all of the table data.

```C
SQLRETURN rc;
SQLSMALLINT numCols;
const size_t s_MaxDataLen = 300;

// fetch with SQLBindCol() and SQL_ATTR_ROW_ARRAY_SIZE > 1
const size_t s_numRowsPerSQLFetch = 100;
SQLULEN numRowsFetched = 0;
rc = SQLSetStmtAttr(stmt, SQL_ATTR_ROW_ARRAY_SIZE, (SQLPOINTER)s_numRowsPerSQLFetch, 0);
rc = SQLSetStmtAttr(stmt, SQL_ATTR_ROWS_FETCHED_PTR, (SQLPOINTER)&numRowsFetched, sizeof(SQLULEN));

// query table
rc = SQLExecDirect(stmt, (SQLCHAR *)"select * from table", SQL_NTS);

// Find out the number of result set columns.
rc = SQLNumResultCols(stmt, &numCols);

// buffer for all columns; each column has buffer size of s_numRowsPerSQLFetch
// To retrieve multiple rows per SQLFetch() call, use the default behavior of SQL_BIND_BY_COLUMN
vector<vector<char> > colArray(numCols);
vector<vector<SQLLEN> > colLenArray(numCols);

// call SQLBindCol() per column
for (SQLSMALLINT i = 0; i < numCols; ++i)
{
    // initialize buffer for each column
    colArray[i].resize(s_MaxDataLen * s_numRowsPerSQLFetch);
    colLenArray[i].resize(s_numRowsPerSQLFetch);

    SQLBindCol(stmt, i + 1, SQL_C_CHAR, colArray[i].data(),
                s_MaxDataLen, colLenArray[i].data());
}

// call SQLFetch() per s_numRowsPerSQLFetch rows
while (true)
{
    rc = SQLFetch(stmt);
    if ((rc != SQL_SUCCESS) && (rc != SQL_SUCCESS_WITH_INFO))
    {
        break;
    }
    // go through data for each cell in buffer without ODBC calls
    for (SQLULEN rowIndex = 0; rowIndex < numRowsFetched; rowIndex++)
    {
        for (SQLUSMALLINT colIndex = 0; colIndex < colIndex; colIndex++)
        {
            std::string data;
            SQLLEN len = colLenArray[colIndex][rowIndex];
            if (SQL_NULL_DATA == len)
                continue;
            if (SQL_NO_TOTAL == len)
                len = s_MaxDataLen;
            data = std::string(&(colArray[colIndex][s_MaxDataLen * rowIndex]), len);
        }
    }
}
rc = SQLCloseCursor(stmt);
```

---
title: ODBC Driver diagnostic service
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-diagnostic-service.md
section: Developer Guide
---

# ODBC Driver diagnostic service

To aid Snowflake Support in diagnosing customer incidents, the Snowflake ODBC driver utilizes a diagnostic service that runs in the background. When the driver encounters an issue that prevents
it from performing normally, the diagnostic service records information about the issue:

* The service writes a single compressed `sf_incident_log.dmp.gz` file to the `/tmp` folder by default.
* A different ODBC dump file location can be specified using the `LogPath` property in the `simba.snowflake.ini` configuration file.

> **Important:**
>
> The dump file may contain sensitive information (such as IP addresses) to further assist in solving the issue. Note that this file is only stored locally; it is not sent to Snowflake.
> You must choose to share the files, such as when diagnosing issues with Snowflake Support.
>
> If you wish to prevent the creation of dump files by the drivers, set the `DisableSfDumps=true` parameter in the `simba.snowflake.ini` configuration file.

When a driver encounters an issue, the service may also send diagnostic information to Snowflake to help fix the problem. This information includes:

* Driver version information.
* A generic description of the issue.
* Stack traces for the driver that pertain to the issue. Other than the account identifier, these stack traces include no customer information.

---
title: Package Versions in Declarative Sharing in the Native Application Framework
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/versioning.md
section: Developer Guide
---

# Package Versions in Declarative Sharing in the Native Application Framework

# Versioning Application Packages in Declarative Sharing

With Declarative Sharing, versioning of your Declarative Native App is handled automatically, so providers and consumers don’t need to manually track version numbers. This simplifies the development and release process.
As a provider, you can iterate on your application in a live development environment and release new versions without manually tracking version numbers.
This topic describes how versioning works in the Snowflake Native App Framework for Declarative Native Apps and how new versions are made available to consumers.

## Make new versions of the application package / version management

With Declarative Native Apps, versioning is handled automatically,

Providers can make changes to the new live version of the app, update the contents, and re-release the application package.

Shortly after releasing a new version, it appears both for existing consumers and for new consumers in the Snowflake Marketplace.

Neither providers nor consumers can revert to a previous version of the app.

Notebooks are embedded in the application package and versioned along with the manifest.

Notebook versioning commands, such as ALTER NOTEBOOK CREATE LIVE VERSION FROM LAST, aren’t supported for these notebooks.

---
title: Packages policies
source: https://docs.snowflake.com/en/developer-guide/udf/python/packages-policy.md
section: Developer Guide
---

# Packages policies

## Introduction

Using a packages policy enables you to set allowlists and blocklists for third-party Python packages from Anaconda and Artifact Repository at the account level.
This lets you meet stricter auditing and security requirements and gives you more fine-grained control over which packages are
available or blocked in your environment.

For more information about how Snowpark Python allows you to bring in third-party packages from
Anaconda, see [Using third-party packages](udf-python-packages.md).

When you create a Python UDF, UDTF or stored procedure, the allowlist and blocklist will be taken into account when creating the Python environment. The allowlist and blocklist will apply to all packages that are required to create the Python environment, including packages from both Anaconda and Artifact Repository.
If it’s not possible to create an environment with the specified packages, the query will fail.

When you execute a Python UDF, UDTF or stored procedure, Snowflake will check the allowlist and blocklist and
make sure that all of the packages are allowed by the packages policy. Otherwise, the query will fail.

## Limitations

* Packages policies will only apply if the Anaconda legal terms have been accepted.
* Packages policies will not be applied for built-in functions and will also not be applied for native apps.
* Packages policies apply to Artifact Repository packages and Anaconda packages.

## Implementing and using a packages policy

In order to create a packages policy object, you must have the following privileges:

* USAGE on the database and schema in which you plan to create the packages policy.
* CREATE PACKAGES POLICY on the schema in which you plan to create the packages policy.

After the packages policy object is created, you must have the following privileges to apply it to the account:

* OWNERSHIP on the packages policy object.
* APPLY PACKAGES POLICY on the account.

Follow these steps to implement a packages policy.

### Step 1: Create a packages policy admin custom role

Create a custom role that allows users to create and manage packages policies. Throughout this topic, the example custom role is named
`policy_admin`, although the role could have any appropriate name.

If the custom role already exists, continue to the next step.

Otherwise, create the `policy_admin` custom role.

```sqlexample
USE ROLE useradmin;

CREATE ROLE policy_admin;
```

### Step 2: Grant privileges to the `policy_admin` custom role

If the `policy_admin` custom role does not already have the following privileges, grant these privileges as shown below:

* USAGE on the database and schema that will contain the packages policy.
* CREATE PACKAGES POLICY on the schema that will contain the packages policy.
* APPLY PACKAGES POLICY on the account.

```sqlexample
USE ROLE securityadmin;

GRANT USAGE ON DATABASE yourdb TO ROLE policy_admin;

GRANT USAGE, CREATE PACKAGES POLICY ON SCHEMA yourdb.yourschema TO ROLE policy_admin;

GRANT APPLY PACKAGES POLICY ON ACCOUNT TO ROLE policy_admin;
```

### Step 3: Create a new packages policy

Using the `policy_admin` custom role, create a new packages policy, with a language, allowlist, and blocklist
specified. ALLOWLIST, BLOCKLIST, ADDITIONAL_CREATION_BLOCKLIST, and COMMENT are optional parameters. By default, the allowlist value is `('*')`,
and the blocklist value is `()`.

If a package is specified in both the allowlist and the blocklist, then the blocklist takes precedence.
You must explicitly add the Python runtime version in the allowlist and
you must also explicitly add all packages and underlying dependencies of a parent package to the allowlist.

You can specify a particular package version or a range of versions by using these version
specifiers in the allowlist or blocklist: : `==`, `<=`, `>=`, `<`,or `>`.
For example, `numpy>=1.2.3`.
You can use wildcards, such as, `numpy==1.2.*`, which means any micro version of numpy 1.2.

> **Note:**
>
> Currently, in an allowlist or blocklist, only one range operator can be specified per package.
> Specifying multiple range operators is not supported, for example `pkg>1.0, <1.5`.
> Because of this, to configure a policy to allow an interval of a package version, set one side of
> the range in the allowlist and the other side of the range in the blocklist.
> For example, to allow package versions greater than 1.0 and less than 1.5,
> set the allowlist to `pkg>1.0` and the blocklist to `pkg>1.5`.

```sqlexample
USE ROLE policy_admin;

CREATE PACKAGES POLICY yourdb.yourschema.packages_policy_prod_1
  LANGUAGE PYTHON
  ALLOWLIST = ('numpy', 'pandas==1.2.3', ...)
  BLOCKLIST = ('numpy==1.2.3', 'bad_package', ...)
  ADDITIONAL_CREATION_BLOCKLIST = ('bad_package2', 'bad_package3', ...)
  COMMENT = 'Packages policy for the prod_1 environment'
;
```

Where:

> `yourdb.yourschema.packages_policy_prod_1`
> :   The fully qualified name of the packages policy.
>
> `LANGUAGE PYTHON`
> :   The language that this packages policy will apply to.
>
> `ALLOWLIST = ('numpy', 'pandas==1.2.3', ...)`
> :   The allowlist for this packages policy. This is a comma-separated string with package specs.
>
> `BLOCKLIST = ('numpy==1.2.3', 'bad_package', ...)`
> :   The blocklist for this packages policy. This is a comma-separated string with package specs.
>
> `ADDITIONAL_CREATION_BLOCKLIST = ('bad_package2', 'bad_package3', ...)`
> :   Specifies a list of package specs that are blocked at creation time. To unset this parameter, specify an empty list.
>     If the `ADDITIONAL_CREATION_BLOCKLIST` is set, it is appended to the basic BLOCKLIST at the creation time.
>     For temporary UDFs and anonymous stored procedures, the `ADDITIONAL_CREATION_BLOCKLIST` is appended to the `BLOCKLIST` at both creation and execution time.
>
> `COMMENT = 'Packages policy for the prod_1 environment'`
> :   A comment specifying the purpose of the packages policy.

In the example above, the blocklist applied for the creation time will be the `ADDITIONAL_CREATION_BLOCKLIST` plus the `BLOCKLIST` so the blocked packages will be
`numpy==1.2.3`, `bad_package`, `bad_package2` and `bad_package3`.
The blocklist applied for the execution will be: `numpy==1.2.3` and `bad_package`.
For temporary UDFs and anonymous stored procedures, the blocklist containing `numpy==1.2.3`, `bad_package`, `bad_package2` and `bad_package3`
will be applied at both creation and execution time.

#### Find package dependencies

To get a list of the dependencies of a Python package, use one of the following functions, depending on your requirements:

* [SYSTEM$RESOLVE_PYTHON_PACKAGES](../../../sql-reference/functions/system_resolve_python_packages.md)
* [SHOW_PYTHON_PACKAGES_DEPENDENCIES](../../../sql-reference/functions/show_python_packages_dependencies.md) - Note that this function only works for Anaconda (Conda) packages.

**SYSTEM$RESOLVE_PYTHON_PACKAGES**

For packages from both Artifact Repository and Anaconda, use the `SYSTEM$RESOLVE_PYTHON_PACKAGES` system function.
This function works with packages from PyPI (via Artifact Repository) and packages from Anaconda.

> **Syntax:**
>
> ```sqlsyntax
> SYSTEM$RESOLVE_PYTHON_PACKAGES(<python_version>, <package_spec_string>, [<artifact_repository_name>])
> ```
>
> Where:
>
> * `python_version`: String specifying the Python version (e.g., ‘3.12’)
> * `package_spec_string`: Package specifications in PACKAGES clause format (e.g., `$$('numpy>=1.20.0', 'pandas==1.3.0')$$`). Use `$$()$$` to return only base packages.
> * `artifact_repository_name`: Optional artifact repository name. If not provided, uses the default Anaconda repository.
>
> **Returns:** A JSON array of resolved package specifications in the format `["package1==version1", "package2==version2", ...]`.
> Always includes base packages (e.g., Python runtime) in addition to the requested packages.
>
> **Examples:**
>
> Using the default Anaconda repository:
>
> ```sqlexample
> SELECT SYSTEM$RESOLVE_PYTHON_PACKAGES('3.12', $$('numpy>=1.20.0', 'pandas==1.3.0')$$);
> ```
>
> The result is a list of the resolved packages and their dependencies:
>
> ```output
> ["_libgcc_mutex==0.1", "_openmp_mutex==5.1", "numpy==1.24.3", "pandas==1.5.3", "python==3.12.20", ...]
> ```
>
> Using a custom PyPI artifact repository:
>
> ```sqlexample
> SELECT SYSTEM$RESOLVE_PYTHON_PACKAGES('3.12', $$('scikit-learn')$$, 'snowflake.snowpark.pypi_shared_repository');
> ```
>
> To show only the base packages (Python runtime and dependencies):
>
> ```sqlexample
> SELECT SYSTEM$RESOLVE_PYTHON_PACKAGES('3.12', $$()$$);
> ```
>
> Unlike `SHOW_PYTHON_PACKAGES_DEPENDENCIES`, which only supports Anaconda packages,
> `SYSTEM$RESOLVE_PYTHON_PACKAGES` can resolve dependencies for packages from both Artifact Repository and Anaconda.
> This function can be called by any user without special privileges.
>
> If you want to know which packages a function is using, you can use [DESCRIBE FUNCTION](../../../sql-reference/sql/desc-function.md) to print them out.
> This is an alternative way to identify all of the dependencies of a package.
> To do this, create a function and in the package specification, provide the top level packages.
> Next, use DESCRIBE FUNCTION to get a list of all of the packages and their dependencies.
> You can copy and paste this list into the package allowlist.
> Note that the packages policy must be temporarily unset or some packages might be blocked.
> The following example shows how to find the dependencies for the ‘snowflake-snowpark-python’ package.
>
> ```sqlexample-python
> CREATE OR REPLACE FUNCTION my_udf()
>   RETURNS STRING
>   LANGUAGE PYTHON
>   PACKAGES = ('snowflake-snowpark-python')
>   RUNTIME_VERSION = 3.10
>   HANDLER = 'echo'
> AS $$
> def echo():
> return 'hi'
> $$;
>
> DESCRIBE FUNCTION my_udf();
> ```
>
> If you want to show all of the packages and versions that are available,
> query the INFORMATION_SCHEMA.PACKAGES view.
>
> ```sqlexample
> SELECT * FROM information_schema.packages;
> ```
>
> If you want to see the current set of packages you are using, you can use this SQL statement.
>
> ```sqlexample
> -- at the database level
>
> CREATE OR REPLACE VIEW USED_ANACONDA_PACKAGES
>   AS SELECT FUNCTION_NAME, VALUE PACKAGE_NAME
>   FROM (SELECT FUNCTION_NAME,PARSE_JSON(PACKAGES)
>   PACKAGES FROM INFORMATION_SCHEMA.FUNCTIONS
>   WHERE FUNCTION_LANGUAGE='PYTHON') USED_PACKAGES,LATERAL FLATTEN(USED_PACKAGES.PACKAGES);
>
> -- at the account level
>
> CREATE OR REPLACE VIEW ACCOUNT_USED_ANACONDA_PACKAGES
>   AS SELECT FUNCTION_CATALOG, FUNCTION_SCHEMA, FUNCTION_NAME, VALUE PACKAGE_NAME
>   FROM (SELECT FUNCTION_CATALOG, FUNCTION_SCHEMA, FUNCTION_NAME,PARSE_JSON(PACKAGES)
>   PACKAGES FROM SNOWFLAKE.ACCOUNT_USAGE.FUNCTIONS
>   WHERE FUNCTION_LANGUAGE='PYTHON') USED_PACKAGES,LATERAL FLATTEN(USED_PACKAGES.PACKAGES);
> ```
>
> To get a list of third-party packages that are available from Anaconda, use the [GET_ANACONDA_PACKAGES_REPODATA](../../../sql-reference/functions/get_anaconda_packages_repodata.md) function.
> The parameter is the architecture, which can be:
> `linux-64`, `linux-aarch64`, `osx-64`, `osx-arm64`, `win-64`, or `noarch`.
>
> For example, to show the list of third-party packages from Anaconda for the `linux-64` archtecture, use this command.
>
> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SNOWFLAKE.SNOWPARK.GET_ANACONDA_PACKAGES_REPODATA('linux-64');
> ```

**SHOW_PYTHON_PACKAGES_DEPENDENCIES**

The first parameter is the Python runtime version you are using and the second is a list of the packages to show dependencies for.
For example, to show the dependencies of the `numpy` package, use this command.

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT SNOWFLAKE.SNOWPARK.SHOW_PYTHON_PACKAGES_DEPENDENCIES('3.12', ['numpy']);
```

The result is a list of the dependencies and their versions.

```output
['_libgcc_mutex==0.1', '_openmp_mutex==5.1', 'blas==1.0', 'ca-certificates==2024.9.24', 'intel-openmp==2023.1.0',
'ld_impl_linux-64==2.40', 'ld_impl_linux-aarch64==2.40', 'libffi==3.4.4', 'libgcc-ng==11.2.0', 'libgfortran-ng==11.2.0',
'libgfortran5==11.2.0', 'libgomp==11.2.0', 'libopenblas==0.3.21', 'libstdcxx-ng==11.2.0', 'mkl-service==2.4.0', 'mkl==2023.1.0',
'mkl_fft==1.3.10', 'mkl_random==1.2.7', 'ncurses==6.4', 'numpy-base==2.0.1', 'numpy==2.0.1', 'openssl==3.0.15', 'python==3.12.20',
'readline==8.2', 'sqlite==3.45.3', 'tbb==2021.8.0', 'tk==8.6.14', 'tzdata==2024b', 'xz==5.4.6', 'zlib==1.2.13']
```

To show the dependencies of Python 3.12 within Snowpark environment, call the function without specifying any packages.

```sqlexample
SELECT SNOWFLAKE.SNOWPARK.SHOW_PYTHON_PACKAGES_DEPENDENCIES('3.12', []);
```

### Step 4: Set the packages policy on an account

Using the `policy_admin` custom role, set the policy on an account with the [ALTER ACCOUNT](../../../sql-reference/sql/alter-account.md) command.

```sqlexample
USE ROLE policy_admin;

ALTER ACCOUNT SET PACKAGES POLICY yourdb.yourschema.packages_policy_prod_1;
```

> **Note:**
>
> To replace a packages policy that is already set for an account, unset the packages policy first and then set the new packages
> policy for the account. Alternatively, you can use FORCE to set the packages policy without having to unset the packages policy. For example:
>
> ```sqlexample
> ALTER ACCOUNT SET PACKAGES POLICY yourdb.yourschema.packages_policy_prod_2 force;
> ```

If you want to see which policy is active on the account, you can use this SQL statement.

```sqlexample
SELECT * FROM TABLE(information_schema.policy_references(ref_entity_domain=>'ACCOUNT', ref_entity_name=>'<your_account_name>'))
```

The result of this query will display a column with the name `POLICY_STATUS`.

Later, if you want to unset the package policy on your account, use this SQL statement.

```sqlexample
ALTER ACCOUNT UNSET PACKAGES POLICY;
```

### Privileges required to execute DDL commands

The following table summarizes the relationship between the packages policy DDL operations and their necessary privileges.

| Operation | Privilege required |
| --- | --- |
| Create packages policy | A role with the CREATE PACKAGES POLICY privilege on the schema. |
| Alter packages policy | A role with the OWNERSHIP privilege on the packages policy. |
| Drop packages policy | A role with the OWNERSHIP privilege on the packages policy. |
| Describe packages policy | A role with the OWNERSHIP or USAGE privilege on the packages policy. |
| Show packages policies | A role with the OWNERSHIP or USAGE privilege on the packages policy. |
| Set & unset packages policy | A role with the APPLY PACKAGES POLICY privilege on the account and the OWNERSHIP privilege on the packages policy. |

## Packages policy DDL

Snowflake provides the following DDL commands to manage packages policy objects:

* [CREATE PACKAGES POLICY](../../../sql-reference/sql/create-packages-policy.md)
* [ALTER PACKAGES POLICY](../../../sql-reference/sql/alter-packages-policy.md)
* [DROP PACKAGES POLICY](../../../sql-reference/sql/drop-packages-policy.md)
* [SHOW PACKAGES POLICIES](../../../sql-reference/sql/show-packages-policies.md)
* [DESCRIBE PACKAGES POLICY](../../../sql-reference/sql/desc-packages-policy.md)

## Packages policy observability

Users who do not have access to the packages policy that is set on the account are able to see the contents of it.

Users can control who sees the contents of the packages policy by adding the USAGE privilege to the packages policies.
The account administrator or packages policy owner can grant this privilege to roles that need to use packages policies.

```sqlexample
GRANT USAGE ON PACKAGES POLICY <packages policy name> TO ROLE <user role>;
```

The [CURRENT_PACKAGES_POLICY](../../../sql-reference/info-schema/current_packages_policy.md) Information Schema view displays a row for each
Snowpark packages policy on the current account.

```sqlexample
SELECT * FROM information_schema.current_packages_policy;
```

```output
+------+----------+-----------+-----------+-------------------------------+---------+
| NAME | LANGUAGE | ALLOWLIST | BLOCKLIST | ADDITIONAL_CREATION_BLOCKLIST | COMMENT |
+------+----------+-----------+-----------+-------------------------------+---------+
| P1   | PYTHON   | ['*']     | []        | [NULL]                        | [NULL]  |
+------+----------+-----------+-----------+-------------------------------+---------+
```

To see the Anaconda packages that are used at the database level for function, use this SQL statement.

```sqlexample
USE DATABASE mydb;

CREATE OR REPLACE VIEW USED_ANACONDA_PACKAGES
  AS
  SELECT FUNCTION_NAME, VALUE PACKAGE_NAME
  FROM (SELECT FUNCTION_NAME,PARSE_JSON(PACKAGES)
  PACKAGES FROM INFORMATION_SCHEMA.FUNCTIONS
  WHERE FUNCTION_LANGUAGE='PYTHON') USED_PACKAGES,LATERAL FLATTEN(USED_PACKAGES.PACKAGES);
```

To see the Anaconda packages that are used at the account level for function, use this SQL statement.

```sqlexample
USE DATABASE mydb;

CREATE OR REPLACE VIEW ACCOUNT_USED_ANACONDA_PACKAGES
  AS
  SELECT  FUNCTION_CATALOG, FUNCTION_SCHEMA, FUNCTION_NAME, VALUE PACKAGE_NAME
  FROM (SELECT FUNCTION_CATALOG, FUNCTION_SCHEMA, FUNCTION_NAME,PARSE_JSON(PACKAGES)
  PACKAGES FROM SNOWFLAKE.ACCOUNT_USAGE.FUNCTIONS
  WHERE FUNCTION_LANGUAGE='PYTHON') USED_PACKAGES,LATERAL FLATTEN(USED_PACKAGES.PACKAGES);
```

To see all of the installed Anaconda packages on your account, use this SQL statement.

```sqlexample
USE DATABASE mydb;

CREATE OR REPLACE VIEW ACCOUNT_USED_ANACONDA_PACKAGES
  AS
  SELECT 'FUNCTION' TYPE, FUNCTION_CATALOG DATABASE, FUNCTION_SCHEMA SCHEMA, FUNCTION_NAME NAME, VALUE::STRING PACKAGE_NAME
  FROM (SELECT FUNCTION_CATALOG, FUNCTION_SCHEMA, FUNCTION_NAME,PARSE_JSON(PACKAGES)
  PACKAGES FROM SNOWFLAKE.ACCOUNT_USAGE.FUNCTIONS
  WHERE FUNCTION_LANGUAGE='PYTHON' AND PACKAGES IS NOT NULL) USED_PACKAGES,LATERAL FLATTEN(USED_PACKAGES.PACKAGES)
  UNION
  (SELECT 'PROCEDURE' TYPE, PROCEDURE_CATALOG DATABASE, PROCEDURE_SCHEMA SCHEMA, PROCEDURE_NAME, VALUE::STRING PACKAGE_NAME
  FROM (SELECT PROCEDURE_CATALOG, PROCEDURE_SCHEMA,PROCEDURE_NAME,PARSE_JSON(PACKAGES)
  PACKAGES FROM SNOWFLAKE.ACCOUNT_USAGE.PROCEDURES
  WHERE PROCEDURE_LANGUAGE='PYTHON' AND PACKAGES IS NOT NULL) USED_PACKAGES,LATERAL FLATTEN(USED_PACKAGES.PACKAGES));
```

## Replication and packages policies

Packages policies are replicated from a source account to target accounts if the database containing the packages policy is
[replicated](../../../user-guide/account-replication-intro.md). For more information, see [Dangling references and packages policies](../../../user-guide/account-replication-considerations.md).

---
title: Packaging Handler Code
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-building.md
section: Developer Guide
---

# Packaging Handler Code

To make function or procedure handler code written in Java or Scala easier to reuse, you can build a JAR file that contains the handler and
its dependencies. When you create the function or procedure, you reference the handler JAR on a stage.

Topics in this section describe how to build handlers with commonly-used build tools.

For more information about using packaged handler code (as well as other dependencies) by referencing them on a stage, see
[Keeping handler code in-line or on a stage](inline-or-staged.md).

> **Note:**
>
> You can also use an IntelliJ IDEA project (not an SBT project in IntelliJ) to create the handler JAR. For more information, see the
> [instructions on setting up an artifact configuration](https://www.jetbrains.com/help/idea/compiling-applications.html#configure_artifact).

[Packaging Scala Handler Code with sbt](udf-stored-procedure-build-sbt.md)
:   Build Scala handler code with sbt.

[Packaging Java or Scala Handler Code with Maven](udf-stored-procedure-build-maven.md)
:   Build handler code with Maven.

---
title: Packaging Java or Scala Handler Code with Maven
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-build-maven.md
section: Developer Guide
---

# Packaging Java or Scala Handler Code with Maven

If you are using Maven to build and package your code, you can use the
[Maven Assembly Plugin](https://maven.apache.org/plugins/maven-assembly-plugin/index.html) to create a JAR file that contains
all of the dependencies.

Once you have a JAR file, you can upload the file to a Snowflake stage, then reference it in an IMPORTS statement when you create a
function or procedure. For more information on uploading JAR files, refer to [Making dependencies available to your code](upload-dependencies.md). For more
information on choosing whether to have code inline or on a stage, refer to [Keeping handler code in-line or on a stage](inline-or-staged.md).

To create an JAR file with your handler code, use the following steps.

1. In the directory for your project (for example, `hello-snowpark/`), create a subdirectory named `assembly/`.
2. In that directory, create an
   [assembly descriptor file](https://maven.apache.org/plugins/maven-assembly-plugin/assembly.html)
   that specifies that you want to include dependencies in your JAR file.

   For an example, see
   [jar-with-dependencies](https://maven.apache.org/plugins/maven-assembly-plugin/descriptor-refs.html#jar-with-dependencies).
3. If your project requires the Snowpark library, exclude its JAR file from the output archive because the library is already included on
   Snowflake.

   In the assembly descriptor, add a `<dependencySet>` element that excludes the Snowpark library from your JAR file.

   For example:

   ```xml
   <assembly xmlns="http://maven.apache.org/ASSEMBLY/2.1.0"
             xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
             xsi:schemaLocation="http://maven.apache.org/ASSEMBLY/2.1.0 http://maven.apache.org/xsd/assembly-2.1.0.xsd">
     <id>jar-with-dependencies</id>
     <formats>
        <format>jar</format>
     </formats>
     <includeBaseDirectory>false</includeBaseDirectory>
     <dependencySets>
       <dependencySet>
         <outputDirectory>/</outputDirectory>
         <useProjectArtifact>false</useProjectArtifact>
         <unpack>true</unpack>
         <scope>provided</scope>
         <excludes>
           <exclude>com.snowflake:snowpark</exclude>
         </excludes>
       </dependencySet>
     </dependencySets>
   </assembly>
   ```

   For information about the elements in an assembly descriptor, see
   [Assembly Descriptor Format](https://maven.apache.org/plugins/maven-assembly-plugin/assembly.html).
4. In your `pom.xml` file, under the `<project>` » `<build>` » `<plugins>`, add a `<plugin>`
   element for the Maven Assembly Plugin.

   In addition, under `<configuration>` » `<descriptors>`, add a `<descriptor>` that points to the assembly
   descriptor file that you created in the previous steps.

   For example:

   ```xml
   <project>
     [...]
     <build>
       [...]
       <plugins>
         <plugin>
           <artifactId>maven-assembly-plugin</artifactId>
           <version>3.3.0</version>
           <configuration>
             <descriptors>
               <descriptor>src/assembly/jar-with-dependencies.xml</descriptor>
             </descriptors>
           </configuration>
           [...]
         </plugin>
         [...]
       </plugins>
       [...]
     </build>
     [...]
   </project>
   ```

---
title: Packaging Scala Handler Code with sbt
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-build-sbt.md
section: Developer Guide
---

# Packaging Scala Handler Code with sbt

You can use the Scala build tool (sbt) to build and package your code as an assembly JAR. You can use the
[sbt-assembly plugin](https://github.com/sbt/sbt-assembly/blob/develop/README.md) to create a JAR file containing all of the
dependencies.

Once you have a JAR file, you can upload the file to a Snowflake stage, then reference it in the IMPORTS parameter in the
[CREATE FUNCTION](../sql-reference/sql/create-function.md) or [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md) statement that you use to create the function or
procedure . For more information on uploading JAR files, refer to [Making dependencies available to your code](upload-dependencies.md). For more
information on choosing whether to have code inline or on a stage, refer to [Keeping handler code in-line or on a stage](inline-or-staged.md).

To create an assembly JAR file with your handler code, use the following steps.

1. In the directory containing your `build.sbt` file, in the `project/` subdirectory, create a file named `plugins.sbt`.

   For example, if the directory containing your `build.sbt` file is `hello-snowpark/`, create the file
   `hello-snowpark/project/plugins.sbt`:

   ```none
   hello-snowpark/
   |-- build.sbt
       |-- project/
           |-- plugins.sbt
   ```
2. In the `plugins.sbt` file, add the following line:

   ```scala
   addSbtPlugin("com.eed3si9n" % "sbt-assembly" % "1.1.0")
   ```

   This adds the [sbt-assembly plugin](https://github.com/sbt/sbt-assembly/blob/develop/README.md) to your project.
3. If your project requires multiple versions of the same library (e.g. if your project depends on two libraries that require
   different versions of a third library), define a merge strategy in your `build.sbt` file to resolve the dependencies. See
   [Merge Strategy](https://github.com/sbt/sbt-assembly/blob/develop/README.md#merge-strategy) for details.
4. If your project requires the Snowpark library, refer to it in your `build.sbt` file with `libraryDependencies`, as shown below.
   Be sure to use at least the [minimum version required](stored-procedure/scala/procedure-scala-overview.md).

   Because the Snowpark library is included on Snowflake, exclude it from the JAR file by specifying that the dependency is
   `"provided"`.

   ```scala
   libraryDependencies += "com.snowflake" % "snowpark" % "1.1.0" % "provided"
   ```
5. Change to the directory for your project (e.g. `hello-snowpark`), and run the following command:

   ```bash
   sbt assembly
   ```

   > **Note:**
   >
   > If you encounter the error `Not a valid command: assembly`, `Not a valid project ID: assembly`, or
   > `Not a valid key: assembly`, make sure that the `plugins.sbt` file is in the subdirectory named `project/` (as
   > mentioned in step 1).

   This command creates a JAR file in the following location:

   ```none
   target/scala-<version>/<project-name>-assembly-1.0.jar
   ```

---
title: Passing references for objects and queries to stored procedures
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-calling-references.md
section: Developer Guide
---

# Passing references for objects and queries to stored procedures

In cases in which you call a stored procedure and pass an identifier for a table, view, function, or procedure to a stored
procedure, you might need to:

* Allow the stored procedure to perform SQL actions on the object, even if the stored procedure uses
  [owner’s rights](stored-procedures-rights.md).
* Allow the stored procedure to resolve the fully qualified name of the object, if the identifier is not qualified or is
  partially qualified.

In these cases, you can create and pass in a reference to the object (for example, the table, view, function, or procedure). A
reference is a unique identifier for an object. Within the stored procedure, when you execute SQL actions on a reference
to an object, the actions are performed using the active role or secondary roles of the user who created the reference. In
addition, if the object identifier is not fully qualified, the name of the object is resolved by using the current database and
schema when the reference was created (in other words, the database and schema of the user who created the reference).

Similarly, if you need to pass in a query to a stored procedure and
use that query in the FROM clause of a SELECT statement, you can create and pass
in a query reference. Within the stored procedure, the query is performed using the active role or secondary roles of the
user who created the query reference. As is the case with references to objects, if the object name in the query is not fully
qualified, the name of the object is resolved by using the database and schema that were in use when the query reference was
created.

This topic explains how to create and use references.

## Background: The problem with passing objects and queries to stored procedures

Suppose that an owner’s rights stored procedure is designed to insert rows into a table specified by an input argument. The
following are examples written in Snowflake Scripting and JavaScript:

Snowflake ScriptingJavaScript

```sqlexample
USE ROLE stored_proc_owner;

CREATE OR REPLACE PROCEDURE insert_row(table_identifier VARCHAR)
RETURNS TABLE()
LANGUAGE SQL
AS
$$
BEGIN
  LET stmt VARCHAR := 'INSERT INTO ' || table_identifier || ' VALUES (10)';
  LET res RESULTSET := (EXECUTE IMMEDIATE stmt);
  RETURN TABLE(res);
END;
$$;
```

```sqlexample-javascript
USE ROLE stored_proc_owner;

CREATE OR REPLACE PROCEDURE insert_row(table_identifier VARCHAR)
RETURNS FLOAT
LANGUAGE JAVASCRIPT
AS
$$
  let res = snowflake.execute({
    sqlText: "INSERT INTO IDENTIFIER(?) VALUES (10);",
    binds : [TABLE_IDENTIFIER]
  });
  res.next()
  return res.getColumnValue(1);
$$;
```

Suppose that you need to call this procedure for a table that is owned by a different role:

```sqlexample
USE ROLE table_owner;

CREATE OR REPLACE TABLE table_with_different_owner (x NUMBER) AS SELECT 42;
```

If you call the stored procedure and pass in the name of the table, the stored procedure will fail because the owner of the
stored procedure does not have sufficient privileges to access the table:

```sqlexample
USE ROLE table_owner;

CALL insert_row('table_with_different_owner');
```

```output
002003 (42S02): Uncaught exception of type 'STATEMENT_ERROR' on line 4 at position 25 : SQL compilation error:
Table 'TABLE_WITH_DIFFERENT_OWNER' does not exist or not authorized.
```

To enable the stored procedure to perform SQL actions on the table as the caller,
create a reference to the table and pass in that reference, rather than the table name.

## Creating a reference

To create the reference, call the [SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) function. For example:

```sqlexample
USE ROLE table_owner;

CALL insert_row(SYSTEM$REFERENCE('TABLE', 'table_with_different_owner', 'SESSION', 'INSERT'));
```

The example above passes in the following arguments to the SYSTEM$REFERENCE function:

* `'TABLE'` for the type of the object.
* `'table_with_different_owner'` for the name of the table.
* `'SESSION'` to indicate that the reference should be scoped to the session.
* `'INSERT'` as the
  privilege needed to perform the action on the object.

> **Note:**
>
> If you need to create a reference to an object that you don’t plan to modify (for example, if you are passing in a table that
> the stored procedure will query) and you want that reference to be valid for the scope of the call (rather than for the entire
> session), you can use the TABLE keyword instead of calling SYSTEM$REFERENCE. For details, see
> Using the TABLE keyword to create a reference to a table, view, or query.

## Specifying the scope of the reference

The reference is valid for either the duration of the call in which the reference is passed or the duration of the session. The
context in which the reference is created determines the scope:

* If you create and pass a reference to a stored procedure in a single statement, the reference has the same visibility as a
  variable declared in the outermost block of the stored procedure:

  ```sqlexample
  CALL select_from_table(SYSTEM$REFERENCE('TABLE', 'my_table');
  ```
* If you create a reference and assign the reference to a [session variable](../../sql-reference/session-variables.md), the
  reference is valid for the duration of the session, even if you unset the session variable:

  ```sqlexample
  SET tableRef = (SELECT SYSTEM$REFERENCE('TABLE', 'my_table'));

  SELECT * FROM IDENTIFIER($tableRef);
  ```

To specify that the scope of the reference should be the duration of the session, regardless of the context in which the
reference is created, pass `'SESSION'` for the third argument (`session_scope`) of the SYSTEM$REFERENCE function:

```sqlexample
CALL insert_row(SYSTEM$REFERENCE('TABLE', 'table_with_different_owner', 'SESSION', 'INSERT'));
```

## Conferring additional privileges in a reference

By default, a reference confers a subset of privileges, based on the type of the object being referenced. For example, a
reference to a table confers the SELECT privilege on that table for the active role or secondary role of the user who created the
reference. The default privileges depend on the object type. For the list of supported objects, privileges, and default privileges,
see [Supported object types and privileges for references](../../sql-reference/references.md).

To confer additional privileges, specify those privileges as additional arguments to the
[SYSTEM$REFERENCE](../../sql-reference/functions/system_reference.md) function. For example, to confer the INSERT, UPDATE, and TRUNCATE privileges on
a table:

```sqlexample
SELECT SYSTEM$REFERENCE('TABLE', 'table_with_different_owner', 'SESSION', 'INSERT', 'UPDATE', 'TRUNCATE');
```

Note that you cannot specify OWNERSHIP or ALL as privileges.

After a reference is created, changes to the privileges of the creator of the reference are reflected in the privileges
associated with the reference. For example, if the INSERT privilege is revoked for the creator of a reference, the INSERT
privilege is no longer associated with the reference.

## Using references to tables and views with masking policies

When you use a reference to a table or view that has a masking policy, the reference role is the invoker role (the role returned
by [INVOKER_ROLE](../../sql-reference/functions/invoker_role.md)), regardless of whether the reference is used in a query, stored procedure, or
user-defined function.

Using a reference does not change the current role (the role returned by [CURRENT_ROLE](../../sql-reference/functions/current_role.md)).

## Creating references in stored procedures

If you are writing an [owner’s rights stored procedure](stored-procedures-rights.md), do not create a
reference within the body of the stored procedure.

A reference created in an owner’s rights stored procedure uses the role of the owner of the stored procedure. References should
use the role of the user calling the stored procedure. For an owner’s rights stored procedure, the user calling the stored
procedure should create the reference and pass it in to the stored procedure.

If you are writing a caller’s rights stored procedure, you can create a reference within the body of the stored procedure.

## Using query references

If you need to pass in a query that is used in the FROM clause of a SELECT statement in a stored procedure, create and pass in a
query reference.

For example, suppose that a stored procedure passes in a SELECT statement that is intended to be used in the FROM clause of
another SELECT statement. In the example below, the query argument is intended to be a SELECT statement. This examples are in
Snowflake Scripting and JavaScript:

Snowflake ScriptingJavaScript

```sqlexample
USE ROLE stored_proc_owner;

CREATE OR REPLACE PROCEDURE get_num_results(query VARCHAR)
  RETURNS INTEGER
  LANGUAGE SQL
  AS
  DECLARE
    row_count INTEGER DEFAULT 0;
    stmt VARCHAR DEFAULT 'SELECT COUNT(*) FROM (' || query || ')';
    res RESULTSET DEFAULT (EXECUTE IMMEDIATE :stmt);
    cur CURSOR FOR res;
  BEGIN
    OPEN cur;
    FETCH cur INTO row_count;
    RETURN row_count;
  END;
```

```sqlexample-javascript
USE ROLE stored_proc_owner;

CREATE OR REPLACE PROCEDURE get_num_results(query VARCHAR)
RETURNS FLOAT
LANGUAGE JAVASCRIPT
AS
$$
  let res = snowflake.execute({
    sqlText: "SELECT COUNT(*) FROM (" + QUERY + ");",
  });
  res.next()
  return res.getColumnValue(1);
$$;
```

The stored procedure uses owner’s rights. If the stored procedure owner does not have the privileges to query the table in the
SELECT statement, the call to the stored procedure fails.

```sqlexample
USE ROLE table_owner;
CREATE OR REPLACE TABLE table_with_different_owner (x NUMBER) AS SELECT 42;

CALL get_num_results('SELECT x FROM table_with_different_owner');
```

```output
002003 (42S02): Uncaught exception of type 'STATEMENT_ERROR' on line 4 at position 29 : SQL compilation error:
Object 'TABLE_WITH_DIFFERENT_OWNER' does not exist or not authorized.
```

To enable the stored procedure to execute the query as the caller, create a query reference for the SELECT statement, and pass in
that reference, rather than the SELECT statement.

To create the query reference, you can call the [SYSTEM$QUERY_REFERENCE](../../sql-reference/functions/system_query_reference.md) function.

> **Note:**
>
> If you need to create a query reference that is valid for the scope of the call (rather than for the entire session), you can
> use the TABLE keyword instead of calling SYSTEM$QUERY_REFERENCE. For details, see Using the TABLE keyword to create a reference to a table, view, or query.

If you call the SYSTEM$QUERY_REFERENCE function, pass in:

* `'SELECT x FROM table_with_different_owner'` as the query.

  Note that if the SELECT statement contains any single quotes or other special characters (e.g. newlines), you must
  [escape those characters with backslashes](../../sql-reference/data-types-text.md).
* `true` to indicate that the query reference should be scoped to the session.

For example:

```sqlexample
USE ROLE table_owner;

CALL get_num_results(
  SYSTEM$QUERY_REFERENCE('SELECT x FROM table_with_different_owner', true)
);
```

```output
+-----------------+
| GET_NUM_RESULTS |
|-----------------|
|               1 |
+-----------------+
```

Within the stored procedure, you can add a query reference to the FROM clause of a query. For example:

```javascript
snowflake.execute({
  sqlText: "SELECT COUNT(*) FROM (" + QUERY + ");"
});
```

For details on this function, refer to [SYSTEM$QUERY_REFERENCE](../../sql-reference/functions/system_query_reference.md).

For the limitations with creating and using query references, refer to Current limitations.

## Using the TABLE keyword to create a reference to a table, view, or query

If you need to create reference to a table, view, or secure view that you are not modifying that the stored procedure should
query, and you want the reference to be valid for the scope of the call (rather than for the entire session), use the TABLE
keyword with the following syntax:

```sqlsyntax
TABLE( [[<database_name>.]<schema_name>.]<object_name> )
```

```sqlsyntax
TABLE("<object_name_that_requires_double_quotes>")
```

```sqlsyntax
TABLE(IDENTIFIER('string_literal_for_object_name'))
```

The TABLE keyword provides a simpler syntax for calling the SYSTEM$REFERENCE function for a table or view without having to
specify the argument for the object type. When you use the TABLE keyword, the reference just confers the SELECT privilege, and
the scope of the reference is the call (not the session).

The following examples call the stored procedure `my_procedure` and pass in references to tables and views:

```sqlexample
CALL my_procedure(TABLE(my_table));
```

```sqlexample
CALL my_procedure(TABLE(my_database.my_schema.my_view));
```

```sqlexample
CALL my_procedure(TABLE("My Table Name"));
```

```sqlexample
CALL my_procedure(TABLE(IDENTIFIER('my_view')));
```

> **Note:**
>
> You cannot use the TABLE keyword with the name of a function or procedure.

If you want to create a reference to a query, you can use the TABLE keyword as an alternative to calling the
SYSTEM$QUERY_REFERENCE function, if the reference just needs to be valid for the scope of the call (rather than for the entire
session). To use the TABLE keyword, use the following syntax:

```sqlsyntax
TABLE(<select_statement>)
```

For example:

```sqlexample
CALL my_procedure(TABLE(SELECT * FROM my_view));
```

```sqlexample
CALL my_procedure(TABLE(WITH c(s) as (SELECT $1 FROM VALUES (1), (2)) SELECT a, count(*) FROM T, C WHERE s = a GROUP BY a));
```

Note the following:

* You cannot use bind variables in the object name or query.
* The reference created by the TABLE keyword is valid for the duration of the call. You cannot specify a different scope for the
  reference.
* The reference has the
  [default privileges conferred for the type of object](../../sql-reference/references.md).

## Current limitations

Currently, references have the following limitations:

* [GET_DDL](../../sql-reference/functions/get_ddl.md) and [SYSTEM$GET_TAG](../../sql-reference/functions/system_get_tag.md) do not support references as input
  arguments.
* You can only create references to tables, views, functions, and procedures.
* In queries that contain references, plan cache and result caching are not used.
* For query references:

  + You can only create query references for SELECT statements that serve as inline views.
  + When you create a query reference, you cannot specify a bind variable or session variable.
  + In your stored procedure, you can only use a query reference in the FROM clause of a SELECT statement.

---
title: PHP PDO Driver for Snowflake
source: https://docs.snowflake.com/en/developer-guide/php-pdo/php-pdo-driver.md
section: Developer Guide
---

# PHP PDO Driver for Snowflake

> **Note:**
>
> This driver currently does not support GCP regional endpoints. Please ensure that any workloads using through this driver do not require support for regional endpoints on GCP. If you have questions about this, please contact Snowflake Support.

The PHP PDO driver for Snowflake provides an interface for developing PHP applications that can connect to Snowflake and perform
all standard operations.

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

For instructions on installing and using the driver, see the GitHub
[PHP PDO Driver for Snowflake repository](https://github.com/snowflakedb/pdo_snowflake) in GitHub.

## Verifying the network connection to Snowflake with SnowCD

After configuring your driver, you can evaluate and troubleshoot your network connectivity to Snowflake using [SnowCD](../../user-guide/snowcd.md).

You can use SnowCD during the initial configuration process and on-demand at any time to evaluate and troubleshoot your network connection to Snowflake.

> **Important:**
>
> Beginning with Snowflake version 8.24, network administrators have the option to require multi-factor authentication (MFA) for all connections to Snowflake. If your administrator decides to enable this feature, you must configure your client or driver to use MFA when connecting to Snowflake. For more information, see the following resources:
>
> * [8.24 release notes](../../release-notes/2024/8_24.md)
> * [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md)
> * [Troubleshooting service users authentication issues with Snowflake MFA](https://community.snowflake.com/s/article/Troubleshooting-service-users-authentication-issues-with-Snowflake-MFA) Knowledge Base article

---
title: Profiling Python procedure handler code
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-profiler.md
section: Developer Guide
---

# Profiling Python procedure handler code

You can discover how much time or memory was spent executing your handler code by using the built-in code profiler. The profiler generates
information describing how much time or memory was spent executing each line of the procedure handler.

Using the profiler, you can generate reports that focus on one of the following at a time:

* **Amount of time per line**, in which the report shows the number of times a line was executed, how long the execution took, and so on.
* **Amount of memory usage per line**, in which the report shows the amount of memory consumed per line.

The profiler saves the generated report to a Snowflake [internal user stage](../../../user-guide/data-load-overview.md) you specify.
You can read the profiler output using the [GET_PYTHON_PROFILER_OUTPUT (SNOWFLAKE.CORE)](../../../sql-reference/functions/get_python_profiler_output.md)
system function.

> **Note:**
>
> Profiling introduces performance overhead on Python execution and can affect the performance of the query.
> It’s intended for development, testing, and troubleshooting and should not be enabled on continuous production workloads.

## Required privileges

Setting the session-level parameter does not trigger privilege check, but when a stored procedure is executed with [ACTIVE_PYTHON_PROFILER](../../../sql-reference/parameters.md)
session parameter to either LINE or MEMORY, Snowflake will check the following privileges.

* You must have read/write privileges on the profiling output stage.
* You must have OWNERSHIP privilege on the stored procedure.

## Limitations

* Only stored procedures are supported. UDFs support is not available yet.
* Recursive profiling is not supported. Only top-level functions of the specified modules are profiled. Functions defined inside
  functions are not.
* Support for profiling stored procedures created on the client-side via the `snowflake.snowpark` API is not supported (for example,
  stored procedures created from `Session.sproc.register`).
* Python functions running in parallel through `joblib` will not be profiled.
* System defined stored procedures cannot be profiled. They will produce no output.

## Usage

Once you’ve set up the profiler for use, you can use it simply by calling the stored procedure to generate profiler output. After the
procedure finishes executing, the profiler’s output is written to a file on the stage you specify. You can fetch the profiler output
using a system function.

Follow these steps to set up and use the profiler:

1. Specify the Snowflake stage where profile output should be written.

   Set the parameter PYTHON_PROFILER_TARGET_STAGE to the stage’s fully-qualified name.
2. Enable the profiler and specify what the profile should focus on.

   Set the ACTIVE_PYTHON_PROFILER session parameter.
3. Call the stored procedure.

   After the profiler is enabled, call your stored procedure.
4. View profiling output.

   At the end of execution, the profiling output will be uploaded as a file to the output stage with the naming pattern of `<query_id>_<sproc_name>.lprof`
   or `<query_id>_<sproc_name>.mprof`.

### Specify the Snowflake stage where profile output should be written

Before running the profiler, you must specify a stage to which its report will be saved. To specify the stage, set the
[PYTHON_PROFILER_TARGET_STAGE](../../../sql-reference/parameters.md) parameter to the stage’s fully-qualified name.

* Use a temporary stage to store output only for the duration of the session.
* Use a permanent stage to preserve the profiler output outside of the scope of a session.

Code in the following example creates a temporary `profiler_output` stage to receive the profiler output.

```sqlexample
USE DATABASE my_database;
USE SCHEMA my_schema;

CREATE TEMPORARY STAGE profiler_output;
ALTER SESSION SET PYTHON_PROFILER_TARGET_STAGE = "my_database.my_schema.profiler_output";
```

### Enable the profiler and specify what the profile should focus on

Set the [ACTIVE_PYTHON_PROFILER](../../../sql-reference/parameters.md) session parameter to a value specifying which kind of profile report you want to generate.

* To have the profile focus on line use activity, set the parameter to the `LINE` value (case insensitive), as shown below:

  ```sqlexample
  ALTER SESSION SET ACTIVE_PYTHON_PROFILER = 'LINE';
  ```
* To have the profile focus on memory use activity, set the parameter to the `MEMORY` value (case insensitive), as shown below:

  ```sqlexample
  ALTER SESSION SET ACTIVE_PYTHON_PROFILER = 'MEMORY';
  ```

### Call the stored procedure

After the profiler is enabled, call your stored procedure.

```sqlexample
CALL YOUR_STORED_PROCEDURE();
```

By default, the profiler will profile methods that are defined in the user’s module. You can register other modules to profile as well. For more information,
see Profile Additional Modules.

### View profiling output

At the end of execution, the profiling output will be uploaded as a file to the output stage with the naming pattern of `<query_id>_<sproc_name>.lprof`
or `<query_id>_<sproc_name>.mprof`.

The output can be accessed via a system function [GET_PYTHON_PROFILER_OUTPUT](../../../sql-reference/functions/get_python_profiler_output.md)
in the [SNOWFLAKE database](../../../sql-reference/snowflake-db.md).

The format of the system function’s signature is as follows:

```sqlexample
SELECT SNOWFLAKE.CORE.GET_PYTHON_PROFILER_OUTPUT(<query_id>);
```

Replace `<query_id>` with the query ID of the stored procedure query for which profiling was enabled.

You can also directly access the output file on the output stage. For more information, see [Viewing staged files](../../../user-guide/data-load-local-file-system-stage-ui.md).

> **Note:**
>
> The system function looks for profiling output files from the stage specified with the PYTHON_PROFILER_TARGET_STAGE parameter.
>
> The profiling output for child stored procedures is not appended into the parent procedure output.
> To view the output for a child stored procedure, call the system function on the child procedure query ID explicitly.

## Including additional modules for profiling

You can include for profiling modules that aren’t included by default. To include additional modules for profiling, set the
PYTHON_PROFILER_MODULES parameter to the names of modules you want to include.

By default, methods defined in the your module will be profiled. These methods include the following:

* The handler method
* Methods defined in the module
* Methods imported from packages or other modules.

In the example below, `handler`, `helper` and `some_method` will all be profiled by default.

```sqlexample-python
CREATE OR REPLACE PROCEDURE my_sproc()
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.10
  PACKAGES = ('snowflake-snowpark-python', 'other_package')
  HANDLER='handler'
AS $$
from other_package import some_method

def helper():
...

def handler(session):
...
$$;
```

### Including modules with the PYTHON_PROFILER_MODULES parameter

You can use the [PYTHON_PROFILER_MODULES](../../../sql-reference/parameters.md) parameter to include for profiling modules that wouldn’t be included by default. When
you include a module in this way, all functions used from that module will be included in the profiler output. By default, the
PYTHON_PROFILER_MODULES parameter value is an empty string (`''`), in which the profile would profile only inline handler code, if
any.

To include modules for profiling, specify their names as the parameter’s value in a comma-separated list, as illustrated below.

```sqlexample
ALTER SESSION SET PYTHON_PROFILER_MODULES = 'module_a, my_module';
```

## Profiling staged handler code

To profile handler code that is staged rather than inline — including helper functions — you must explicitly specify the staged handler
for profiling using the [PYTHON_PROFILER_MODULES](../../../sql-reference/parameters.md) parameter.

By default, the profiler doesn’t profile handler code that is [staged, rather than inline](../../inline-or-staged.md) —
that is, when the handler module is specified with the IMPORTS clause.

For example, by default this procedure will generate no detailed profiling output.

```sqlexample
CREATE OR REPLACE PROCEDURE test_udf_1()
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES=('snowflake-snowpark-python')
  HANDLER = 'test_python_import_main.my_udf'
  IMPORTS = ('@stage1/test_python_import_main.py', '@stage2/test_python_import_module.py');
```

To include staged code for profiling, specify staged module names as the PYTHON_PROFILER_MODULES parameter’s value in a comma-separated
list, as illustrated below.

```sqlexample
ALTER SESSION SET PYTHON_PROFILER_MODULES = 'test_python_import_main, test_python_import_module';
```

## Example

Code in this example illustrates how to use the profiler to generate and retrieve a report of line usage.

```sqlexample-python
CREATE OR REPLACE PROCEDURE last_n_query_duration(last_n NUMBER, total NUMBER)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION=3.12
  PACKAGES=('snowflake-snowpark-python')
  HANDLER='main'
AS $$
import snowflake.snowpark.functions as funcs

def main(session, last_n, total):
  # create sample dataset to emulate id + elapsed time
  session.sql('''
  CREATE OR REPLACE TABLE sample_query_history (query_id INT, elapsed_time FLOAT)
  ''').collect()
  session.sql('''
  INSERT INTO sample_query_history
  SELECT
  seq8() AS query_id,
  uniform(0::float, 100::float, random()) as elapsed_time
  FROM table(generator(rowCount => {0}));'''.format(total)).collect()

  # get the mean of the last n query elapsed time
  df = session.table('sample_query_history').select(
    funcs.col('query_id'),
    funcs.col('elapsed_time')).limit(last_n)

  pandas_df = df.to_pandas()
  mean_time = pandas_df.loc[:, 'ELAPSED_TIME'].mean()
  del pandas_df
  return mean_time
$$;

CREATE TEMPORARY STAGE profiler_output;
ALTER SESSION SET PYTHON_PROFILER_TARGET_STAGE = "my_database.my_schema.profiler_output";
ALTER SESSION SET ACTIVE_PYTHON_PROFILER = 'LINE';

-- Sample 1 million from 10 million records
CALL last_n_query_duration(1000000, 10000000);

SELECT SNOWFLAKE.CORE.GET_PYTHON_PROFILER_OUTPUT(last_query_id());
```

The line profiler output will look like this:

```output
Handler Name: main
Python Runtime Version: 3.12
Modules Profiled: ['main_module']
Timer Unit: 0.001 s

Total Time: 8.96127 s
File: _udf_code.py
Function: main at line 4

Line #      Hits        Time  Per Hit   % Time  Line Contents
==============================================================
    4                                           def main(session, last_n, total):
    5                                               # create sample dataset to emulate id + elapsed time
    6         1        122.3    122.3      1.4      session.sql('''
    7                                                   CREATE OR REPLACE TABLE sample_query_history (query_id INT, elapsed_time FLOAT)''').collect()
    8         2       7248.4   3624.2     80.9      session.sql('''
    9                                               INSERT INTO sample_query_history
    10                                               SELECT
    11                                               seq8() AS query_id,
    12                                               uniform(0::float, 100::float, random()) as elapsed_time
    13         1          0.0      0.0      0.0      FROM table(generator(rowCount => {0}));'''.format(total)).collect()
    14
    15                                               # get the mean of the last n query elapsed time
    16         3         58.6     19.5      0.7      df = session.table('sample_query_history').select(
    17         1          0.0      0.0      0.0          funcs.col('query_id'),
    18         2          0.0      0.0      0.0          funcs.col('elapsed_time')).limit(last_n)
    19
    20         1       1528.4   1528.4     17.1      pandas_df = df.to_pandas()
    21         1          3.2      3.2      0.0      mean_time = pandas_df.loc[:, 'ELAPSED_TIME'].mean()
    22         1          0.3      0.3      0.0      del pandas_df
    23         1          0.0      0.0      0.0      return mean_time
```

The memory profiler output will look like this:

```output
ALTER SESSION SET ACTIVE_PYTHON_PROFILER = 'MEMORY';

Handler Name: main
Python Runtime Version: 3.12
Modules Profiled: ['main_module']
File: _udf_code.py
Function: main at line 4

Line #   Mem usage    Increment  Occurrences  Line Contents
=============================================================
    4    245.3 MiB    245.3 MiB           1   def main(session, last_n, total):
    5                                             # create sample dataset to emulate id + elapsed time
    6    245.8 MiB      0.5 MiB           1       session.sql('''
    7                                                 CREATE OR REPLACE TABLE sample_query_history (query_id INT, elapsed_time FLOAT)''').collect()
    8    245.8 MiB      0.0 MiB           2       session.sql('''
    9                                             INSERT INTO sample_query_history
    10                                             SELECT
    11                                             seq8() AS query_id,
    12                                             uniform(0::float, 100::float, random()) as elapsed_time
    13    245.8 MiB      0.0 MiB           1       FROM table(generator(rowCount => {0}));'''.format(total)).collect()
    14
    15                                             # get the mean of the last n query elapsed time
    16    245.8 MiB      0.0 MiB           3       df = session.table('sample_query_history').select(
    17    245.8 MiB      0.0 MiB           1           funcs.col('query_id'),
    18    245.8 MiB      0.0 MiB           2           funcs.col('elapsed_time')).limit(last_n)
    19
    20    327.9 MiB     82.1 MiB           1       pandas_df = df.to_pandas()
    21    328.9 MiB      1.0 MiB           1       mean_time = pandas_df.loc[:, 'ELAPSED_TIME'].mean()
    22    320.9 MiB     -8.0 MiB           1       del pandas_df
    23    320.9 MiB      0.0 MiB           1       return mean_time
```

---
title: Profiling Snowpark Python user-defined function handlers
source: https://docs.snowflake.com/en/developer-guide/udf/python/profiling-udf-handlers.md
section: Developer Guide
---

# Profiling Snowpark Python user-defined function handlers

You can discover how much time or memory was spent executing your handler code by using the built-in code profiler. The profiler generates
information describing how much time or memory was spent executing each line of the handler.

Using the profiler, you can generate reports that focus on one of the following at a time:

* **Amount of time per line**, which shows the number of times a line was executed, how long the execution took, and so on.
* **Amount of memory usage per line**, which shows the amount of memory consumed per line.

The profiler saves the generated report to an internal [event table](../../logging-tracing/event-table-columns.md). You can
retrieve the results by using a function designed to access the table.

> **Note:**
>
> Profiling introduces performance overhead to Python execution and can affect the performance of the query.
> It’s intended for development and testing and should not be enabled on continuous production workloads.

## Required privileges

To manage and use the profiler results data, which is stored in the `SNOWFLAKE.LOCAL.PROFILER_EVENTS_RAW` event table, you must
use the following roles:

| Application Role | Notes |
| --- | --- |
| PROFILER_EVENTS_ADMIN | Required to manage data in the event table where profiler data is stored, including to select, truncate, or drop records. |
| PROFILER_USER | Required to read profiler results from the event table. |

For more information on granting an application role, see [GRANT APPLICATION ROLE](../../../sql-reference/sql/grant-application-role.md). The following example uses the `ACCOUNTADMIN` role to grant the application role `PROFILER_USER` to a user.

```sqlexample
USE ROLE ACCOUNTADMIN;
CREATE ROLE PROFILER_ROLE;
GRANT APPLICATION ROLE SNOWFLAKE.PROFILER_USER TO ROLE PROFILER_ROLE;
GRANT ROLE PROFILER_ROLE TO USER some_user;
```

## Limitations

* It can take 15-20 seconds after query execution for results from the profiler to be ready.
* Profiler output is not saved if the UDF execution fails.
* Recursive profiling is not supported. Only top-level functions of the specified modules are profiled. Functions defined inside
  functions are not profiled.
* Profiling third party modules is not supported.
* Support for profiling UDFs created on the client side via the `snowflake.snowpark` API is not available.
* Python functions running in parallel through `joblib` are not profiled.
* UDTFs are not supported.
* Time is measured in wall-clock time, not CPU time.

## Usage

Once you’ve set up the profiler, you can use it simply by executing the UDF to generate profiler output. After the
UDF finishes executing, the profiler’s output is written to an internal event table. You can
fetch the profiler output using a system function.

Follow these steps in your code to set up and use the profiler:

1. Enable the profiler and set what the profile report should focus on.
2. Execute the UDF.
3. View profiling output.

## Enable the profiler by specifying its focus

To enable the profiler set one of the following session parameters:

```sqlexample
-- To enable profiling that focuses on activity per line
ALTER SESSION SET ACTIVE_PYTHON_PROFILER = 'LINE';

-- To enable profiling that focuses on memory usage
ALTER SESSION SET ACTIVE_PYTHON_PROFILER = 'MEMORY';
```

> **Note:**
>
> Profiling introduces performance overhead on Python execution. You should profile your code during development and testing.
> Do not enable profiling on continuous production workloads.

## Specifying the code to be profiled

By default, the profiler profiles methods defined inline with the UDF declaration. In other words, the profiler will profile all the
methods defined in the handler.

For the following UDF example, the profiler will profile the `handler` method and `helper` method.

```sqlexample-python
CREATE OR REPLACE function my_udf()
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.11
  PACKAGES = ('other_package')
  HANDLER = 'handler'
  AS $$
from other_package import some_method

def helper():
...

def handler():
...
$$;
```

### Specify external code to profile

You can specify that the profiler should profile handler code defined outside the UDF declaration, such as code imported from a stage.

To specify external code for profiling, set the PYTHON_UDF_PROFILER_MODULES session parameter’s value to a comma-separated list of the
modules containing the code.

```sqlexample
ALTER SESSION SET PYTHON_UDF_PROFILER_MODULES = 'test_python_import_main, test_python_import_module';
```

The profiler will include the specified modules in its profiling output when you execute a UDF that imports them.

Code in the following example shows a UDF that imports code from the specified modules:

```sqlexample
CREATE OR REPLACE function test_udf_1()
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.11
  HANDLER = 'test_python_import_main.my_udf'
  IMPORTS = ('@stage1/test_python_import_main.py', '@stage2/test_python_import_module.py');
```

## Execute the user-defined function

After you’ve enabled the profiler, execute your user-defined function (UDF) to begin profiling.

By default, the profiler profiles methods that are defined in your module. For information on registering other modules from imported
files to profile, see Specifying the code to be profiled for more information.

```sqlexample
SELECT return_mean(my_col) FROM MY_TABLE;
```

## View profiling output

* To view profiling output, query an internal [event table](../../logging-tracing/event-table-columns.md).

Profiling results are typically available in the event table 15-20 seconds after the UDF execution finishes. You can access the output by using
the table system function, GET_PYTHON_UDF_PROFILER_OUTPUT.

Code in the following example shows a query of the event table for profiler results. The `query_id` specified as an argument is the
query ID of the UDF query for which profiling was enabled.

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_PYTHON_UDF_PROFILER_OUTPUT(<query_id>));
```

### Profile results

When you view profiler results, you’ll see a report that differs depending on whether you specified profiling for a line report or
a memory report.

The memory profiler output will look like this:

```none
Handler Name: return_mean
Python Runtime Version: 3.12
Modules Profiled: ['return_mean_module']
Extension Function ID: 1

File: _udf_code.py
Function: return_mean at line 2

Line #    Mem usage    Increment  Occurrences    Line Contents
==============================================================
     2    107.0 MiB    107.0 MiB           1    def return_mean():
     3    144.6 MiB     37.6 MiB           1        import numpy as np
     4
     5                                              # Generate a numpy array with 10 random integers between 1 and 100
     6                                              # np.random.randint(low, high, size)
     7    147.3 MiB      2.7 MiB           1        random_array = np.random.randint(1, 101, 10)
     8
     9                                              # Use a numpy function to calculate the mean
    10    147.3 MiB      0.0 MiB           1        mean_value = np.mean(random_array)
    11
    12    147.3 MiB      0.0 MiB           1        count = 0
    13    147.3 MiB      0.0 MiB         101        for i in range(100):
    14    147.3 MiB      0.0 MiB         100            count = count + 1
    15
    16    147.3 MiB      0.0 MiB           1        return mean_value
```

The line profiler output will look like this:

```none
Handler Name: return_mean
Python Runtime Version: 3.12
Extension Function ID: 1
Modules Profiled: ['return_mean_module']
Timer Unit: 0.001 s

Total Time: 0.229063 s
File: _udf_code.py
Function: return_mean at line 2

Line #      Hits         Time  Per Hit   % Time  Line Contents
==============================================================
     2                                           def return_mean():
     3         1        206.1    206.1     90.0      import numpy as np
     4
     5                                               # Generate a numpy array with 10 random integers between 1 and 100
     6                                               # np.random.randint(low, high, size)
     7         1         22.8     22.8     10.0      random_array = np.random.randint(1, 101, 10)
     8
     9                                               # Use a numpy function to calculate the mean
    10         1          0.1      0.1      0.0      mean_value = np.mean(random_array)
    11
    12         1          0.0      0.0      0.0      count = 0
    13       101          0.0      0.0      0.0      for i in range(100):
    14       100          0.0      0.0      0.0          count = count + 1
    15
    16         1          0.0      0.0      0.0      return mean_value
```

---
title: Protecting Sensitive Information with Secure UDFs and Stored Procedures
source: https://docs.snowflake.com/en/developer-guide/secure-udf-procedure.md
section: Developer Guide
---

# Protecting Sensitive Information with Secure UDFs and Stored Procedures

To help ensure that sensitive information is concealed from users who should not have access to it, you can use the SECURE keyword when
creating a user-defined function (UDF) and stored procedure.

This topic describes how you can:

* Limit the visibility of UDF or stored procedure definitions.
* Limit the visibility of sensitive data that can be exposed by UDFs.

> **Note:**
>
> In some cases, error messages related to secure functions might be redacted. For more information, see
> [Secure objects: Redaction of information in error messages](../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

## Limiting the Visibility of a UDF or Procedure Definition

For a UDF or stored procedure, you can prevent users from seeing definition specifics. When you specify that the UDF or procedure is
secure, these details are visible only to authorized users – in other words, to users who are granted a role that owns the function.

For example, for a secure function or procedure, information omitted for unauthorized users includes its:

* Body (the handler code that comprises its logic)
* List of imports
* Handler name
* Packages list

Unauthorized users will still be able to see information that includes its:

* Parameter types
* Return type
* Handler language
* Null handling
* Volatility

For more on granting roles, see [GRANT ROLE](../sql-reference/sql/grant-role.md) and [Overview of Access Control](../user-guide/security-access-control-overview.md).

With a function or procedure that is secure, an unauthorized user – one who has *not* been granted a role that owns the function or
procedure – may not view the function or procedure definition when using any of the following:

* For UDFs

  + [SHOW FUNCTIONS](../sql-reference/sql/show-functions.md) and [SHOW USER FUNCTIONS](../sql-reference/sql/show-user-functions.md) commands
  + [DESCRIBE FUNCTION](../sql-reference/sql/desc-function.md) command
  + [FUNCTIONS](../sql-reference/info-schema/functions.md) Information Schema view
* For procedures

  + [SHOW PROCEDURES](../sql-reference/sql/show-procedures.md) command
  + [DESCRIBE PROCEDURE](../sql-reference/sql/desc-procedure.md) command
  + [PROCEDURES](../sql-reference/info-schema/procedures.md) Information Schema view
* For both

  + [Query Profile](../user-guide/ui-snowsight-activity.md) (in the web interface)
  + [GET_DDL](../sql-reference/functions/get_ddl.md) utility function

Note that functions and procedures whose handlers are written in Java, Python, or Scala allow the IMPORTS clause, which imports code or
data files from Snowflake stages. Using the SECURE keyword does *not* have any effect on the visibility of or access to those stages.

In addition, for functions and procedures whose handlers are written in Java, Python, or Scala, making the functions and procedures secure
ensures that they are executed in separate sandboxes, such that no resources are shared between them.

For more information on using the SECURE keyword, see Creating a Secure UDF or Stored Procedure.

## Limiting the Visibility of a UDF’s Sensitive Data

In UDFs, you can prevent users from seeing data that should be hidden by making the UDF secure. You do this by using the SECURE keyword
when creating or altering the UDF.

Define a UDF as secure when it is specifically designated for data privacy (in other words, to limit access to sensitive data that should
not be exposed to all users of the underlying tables).

You should not make a UDF secure when it is defined for query convenience, such as when it is created for simplifying querying data
for which users do not need to understand the underlying data representation. This is because the Snowflake query optimizer, when evaluating
secure UDFs, bypasses the optimizations used for regular UDFs. This might reduce query performance for secure UDFs.

To limit visibility into a UDF’s underlying data, use the SECURE keyword when creating or altering it. For more information, see
Creating a Secure UDF or Stored Procedure.

### How Data Can Become Visible

Some of the internal optimizations for UDFs, including an optimization called [pushdown](pushdown-optimization.md), require
access to the underlying data in the base tables. This access might allow data that is hidden from users of the UDF to be exposed
indirectly through programmatic methods. In certain situations, a user might be able to deduce information about rows that the user cannot
see directly.

Secure UDFs do not use these optimizations, ensuring that users do not have even indirect access to the underlying data. For more
information on pushdown, see [Pushdown Optimization and Data Visibility](pushdown-optimization.md).

> **Tip:**
>
> When deciding whether to use a secure UDF, you should consider the purpose of the UDF and weigh the trade-off between data privacy/security
> and query performance.
>
> Also, if your data is sensitive enough that you decide that accesses via one type of object (such as UDFs) should be secure, then you
> should strongly consider ensuring that accesses via other types of objects (such as views) are also secure.
>
> For example, if you only allow secure UDFs to access a given table, then any views that you allow to access the same table probably also should
> be secure.

### How Secure UDFs Protect Data

As described in [Pushdown Optimization and Data Visibility](pushdown-optimization.md), the pushdown optimization can re-order the filters that determine how a
query is processed. If the optimization re-orders the filters in a way that allows a general filter to run before the appropriate filter(s)
used to secure data are applied, underlying details could be exposed. Therefore, the solution is to prevent the optimizer from pushing down
certain types of filters (more generally, to prevent the optimizer from using certain types of optimizations, including but not limited to
filter pushdown) if those optimizations are not safe.

Declaring a UDF as “secure” tells the optimizer to not push down certain filters (more generally, not to use certain optimizations). However,
preventing certain types of optimizations can impact performance.

### Best Practices for Protecting Access to Sensitive Data

Secure UDFs prevent users from possibly being exposed to data from rows of tables that are filtered by the function. However,
there are still ways that a data owner might inadvertently expose information about the underlying data if UDFs are not
constructed carefully. This section describes some potential pitfalls to avoid.

#### Avoid Exposing Sequence-Generated Column Values

A common practice for generating surrogate keys is to use a sequence or auto-increment column. If these keys are exposed to users
who do not have access to all of the underlying data, then a user might be able to guess details of the underlying data distribution.

For example, suppose that we have a function `get_widgets_function()` that exposes the ID column. If ID is generated from a sequence,
then a user of `get_widgets_function()` could deduce the total number of widgets created between the creation timestamps of two
widgets that the user has access to. Consider the following query and result:

```sqlexample
SELECT * FROM TABLE(get_widgets_function()) ORDER BY created_on;

------+-----------------------+-------+-------+-------------------------------+
  ID  |         NAME          | COLOR | PRICE |          CREATED_ON           |
------+-----------------------+-------+-------+-------------------------------+
...
 315  | Small round widget    | Red   | 1     | 2017-01-07 15:22:14.810 -0700 |
 1455 | Small cylinder widget | Blue  | 2     | 2017-01-15 03:00:12.106 -0700 |
...
```

Based on the result, the user might suspect that 1139 widgets (`1455 - 315`) were created between January 7 and January 15. If this
information is too sensitive to expose to users of a function, you can use any of the following alternatives:

* Do not expose the sequence-generated column as part of the function.
* Use randomized identifiers (such as those generated by [UUID_STRING](../sql-reference/functions/uuid_string.md)) instead of sequence-generated values.
* Programmatically obfuscate the identifiers.

#### Limit Visibility into Scanned Data Size

For queries containing secure functions, Snowflake does not expose the amount of data scanned (either in terms of bytes or micro-partitions)
or the total amount of data. This is to protect the information from users who have access to only a subset of the data.

However, users might still be able to make observations about the quantity of underlying data based on performance characteristics of
queries. For example, a query that runs twice as long might process twice as much data. While any such observations are approximate at best,
in some cases it might be undesirable for even this level of information to be exposed.

In such cases, you should materialize data per user/role instead of exposing functions on the base data to users. In the case of the
`widgets` table described in this topic, a table would be created for each role that has access to widgets. Each of those tables would
contains only the widgets accessible by that role, and a role would be granted access to its table. This is much more cumbersome than using
a single function, but for extremely high-security situations, this might be warranted.

#### Authorize Base Table Access for Users from a Specific Account

When using secure UDFs with [data sharing](../user-guide/data-sharing-gs.md), the [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md) function can
be used to authorize users from a specific account to access rows in a base table.

> > **Note:**
> >
> > When using the [CURRENT_ROLE](../sql-reference/functions/current_role.md) and [CURRENT_USER](../sql-reference/functions/current_user.md) functions with secure
> > UDFs that will be shared with Snowflake accounts, Snowflake returns a NULL value for these functions. The reason is that the owner
> > of the data being shared does not typically control the users or roles in the account with which the UDF is being shared.

#### Secure UDFs and Masking Policies

If using a UDF, whether or not the UDF is a secure UDF, in a [masking policy](../sql-reference/sql/create-masking-policy.md), ensure the
data type of the column, UDF, and masking policy match.

For more information, see [User-defined functions in a masking policy](../user-guide/security-column-intro.md).

## Creating a Secure UDF or Stored Procedure

You can make a UDF or procedure secure by using the SECURE keyword when creating or altering it.

To create or convert a UDF so that it’s secure, specify SECURE when using the following:

* [CREATE FUNCTION](../sql-reference/sql/create-function.md)
* [ALTER FUNCTION](../sql-reference/sql/alter-function.md)

To create a procedure so that it’s secure, specify SECURE when using the following:

* [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md)

## Determining if a UDF or Procedure is Secure

You can determine if a function or procedure is secure by using the SHOW FUNCTIONS or SHOW PROCEDURES command. The commands return a
table with an IS_SECURE column whose value is `Y` for secure and `N` for not secure.

Code in the following example returns a table of properties for a `MYFUNCTION` function.

```sqlexample
SHOW FUNCTIONS LIKE 'MYFUNCTION';
```

## Viewing Secure Function Details in Query Profile

The internals of a secure function are not exposed in [Query Profile](../user-guide/ui-snowsight-activity.md) (in the web interface). This is
the case even for the owner of the secure function, since non-owners might have access to an owner’s Query Profile.

---
title: Pushdown Optimization and Data Visibility
source: https://docs.snowflake.com/en/developer-guide/pushdown-optimization.md
section: Developer Guide
---

# Pushdown Optimization and Data Visibility

Through the pushdown optimization, Snowflake helps make query processing faster and more efficient by filtering rows. However, due to the
way filters can be reordered, pushdown can expose data that you might not want to be visible.

This topic describes pushdown and how it can expose sensitive data. To prevent sensitive data from becoming visible, you can make a
UDF secure as described in [Protecting Sensitive Information with Secure UDFs and Stored Procedures](secure-udf-procedure.md).

## What is Pushdown?

Pushdown improves performance by filtering out unneeded rows as early as possible during query processing. Pushdown can also reduce memory
consumption. However, pushdown can allow confidential data to be exposed indirectly.

Consider the following query:

```sqlexample
SELECT col1
  FROM tab1
  WHERE location = 'New York';
```

One approach to processing the query is:

1. Read all rows from the table into memory (i.e. execute the FROM clause).
2. Scan the rows in memory, filtering out any rows that do not match `New York` (i.e. execute the WHERE clause).
3. Select `col1` from the rows still remaining in memory (i.e. execute the SELECT list).

You can think of this as a “load first, filter later” strategy, which is straight-forward, but inefficient.

It’s usually more efficient to filter as early as possible. Early filtering is called “pushing the filter down deeper into the query plan”,
or simply “pushdown”.

In example query above, it would be more efficient to tell the table-scanning code not to load records that don’t match the WHERE clause. This
doesn’t save filtering time (every row’s location must still be read once), but it can save considerable memory and reduce subsequent processing
time because there are fewer rows to process.

In some cases, you can process the data even more efficiently. For example, suppose that the data is partitioned by state (i.e. all the data
for New York is in one micro-partition, all the data for Florida is in another micro-partition, and so on). In this scenario:

* Snowflake does not need to store all the rows in memory.
* Snowflake does not need to read all the rows.

We loosely define this as another form of “pushdown”.

The principle of “pushing down the filters” applies to a wide range of queries. Often, the filter that is the most selective (screens out
the most data) is pushed deepest (executed earliest) to reduce the work that the remaining query must do.

Pushdown can be combined with other techniques, such as clustering (sorting/ordering the data), to reduce the amount of irrelevant data that
needs to be read, loaded, and processed.

## Example of Indirect Data Exposure Through Pushdown

The following example shows one way that pushdown could indirectly result in the exposure of underlying details about a query. This example
focuses on views, but the same principles apply to UDFs.

Suppose there is a table that stores information about patients:

> ```sqlexample
> CREATE TABLE patients
>   (patient_ID INTEGER,
>    category VARCHAR,      -- 'PhysicalHealth' or 'MentalHealth'
>    diagnosis VARCHAR
>    );
>
> INSERT INTO patients (patient_ID, category, diagnosis) VALUES
>   (1, 'MentalHealth', 'paranoia'),
>   (2, 'PhysicalHealth', 'lung cancer');
> ```

There are two views, one of which shows mental health information and one of which shows physical health information:

> ```sqlexample
> CREATE VIEW mental_health_view AS
>   SELECT * FROM patients WHERE category = 'MentalHealth';
>
> CREATE VIEW physical_health_view AS
>   SELECT * FROM patients WHERE category = 'PhysicalHealth';
> ```

Most users don’t have direct access to the table. Instead, users are assigned one of two roles:

* `MentalHealth`, which has privileges to read from `mental_health_view`, or
* `PhysicalHealth`, which has privileges to read from `physical_health_view`.

Now suppose that a doctor with privileges only on physical health data wants to know whether there are currently any mental health patients
in the table. The doctor can construct a query similar to the following:

> ```sqlexample
> SELECT * FROM physical_health_view
>   WHERE 1/IFF(category = 'MentalHealth', 0, 1) = 1;
> ```

This query is equivalent to:

> ```sqlexample
> SELECT * FROM patients
>   WHERE
>     category = 'PhysicalHealth' AND
>     1/IFF(category = 'MentalHealth', 0, 1) = 1;
> ```

There are (at least) two methods that Snowflake can use to process this query.

* Method 1:

  1. Read all the rows in the patients table.
  2. Apply the view’s security filter (i.e. filter out the rows for which the category is not `PhysicalHealth`).
  3. Apply the WHERE clause in the query (i.e. filter based on `WHERE 1/IFF(category = 'MentalHealth', 0, 1) = 1`).
* Method 2 changes the order of the filters, so that the query executes as follows:

  1. Read all the rows in the patients table.
  2. Apply the WHERE clause in the query (i.e. filter based on `WHERE 1/IFF(category = 'MentalHealth', 0, 1) = 1`).
  3. Apply the view’s security filter (i.e. filter out the rows for which the category is not `PhysicalHealth`).

Logically, these two sequences seem equivalent; they return the same set of rows. However, depending on how selective these two filters are,
one order of processing might be faster, and Snowflake’s query planner might choose the plan that executes faster.

Suppose that the optimizer chooses the second plan, in which the clause `WHERE 1/IFF(category = 'MentalHealth', 0, 1) = 1` is executed
before the security filter. If the patients table has any rows in which `category = 'MentalHealth'`, then the `IFF` function returns
0 for that row, and the clause effectively becomes `WHERE 1/0 = 1`, so the statement causes a divide-by-zero error. The user with
`physical_health_view` privileges does not see any rows for people with mental health issues, but can deduce that at least one person in the
mental health category exists.

Note that this technique does not always result in exposing underlying details; it relies heavily on the choices that the query planner makes,
as well as on how the views (or UDFs) are written. But this example shows that a user can deduce information about rows that the user cannot
view directly.

---
title: PySpark APIs supported for Snowpark Connect for Spark
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-supported-apis.md
section: Developer Guide
---

# PySpark APIs supported for Snowpark Connect for Spark

Snowpark Connect for Spark supports PySpark APIs as described in this topic.

Snowpark Connect for Spark provides compatibility with PySpark’s 3.5.3 Spark Connect API, allowing you to run Spark workloads on Snowflake.
Snowpark Connect for Spark compatibility is defined by its execution behavior when running a Spark application that uses the Pyspark 3.5.3
Spark Connect API. This guide details which APIs are supported and their compatibility levels.

## Compatibility level definitions

Full compatibility APIs
:   APIs with full compatibility behave identically to native PySpark. You can use these APIs with confidence that results will match exactly.

High compatibility APIs
:   APIs with high compatibility work correctly but might have minor differences:

    * Error message formatting might differ.
    * Output display format might vary (such as decimal precision, column name casing).
    * Edge cases might produce slightly different results.

Partial compatibility APIs
:   APIs with partial compatibility are functional but have notable limitations:

    * Only a subset of functionality might be available.
    * Behavior might differ from PySpark in specific scenarios.
    * Additional configuration might be required.
    * Performance characteristics might differ.

Unsupported APIs
:   APIs that are not currently implemented or cannot be supported on Snowflake.

## Python APIs

### DataFrame APIs

The core DataFrame API coverage.

#### Full compatibility APIs

* `cache`
* `coalesce`
* `collect`
* `count`
* `crossJoin`
* `dropDuplicates`
* `drop_duplicates`
* `dropna`
* `fillna`
* `first`
* `head`
* `isEmpty`
* `join`
* `limit`
* `melt`
* `offset`
* `persist`
* `repartitionByRange`
* `replace`
* `select`
* `show`
* `tail`
* `take`
* `toDF`
* `toLocalIterator`
* `toPandas`
* `unionAll`
* `unpersist`
* `unpivot`
* `where`
* `withColumnsRenamed`
* `toLocalIterator`
* `toPandas`
* `unionAll`
* `unpersist`
* `unpivot`
* `where`
* `withColumnsRenamed`

#### High compatibility APIs

* `agg`
* `colRegex`
* `corr`
* `cov`
* `crosstab`
* `cube`
* `describe`
* `distinct`
* `drop`
* `exceptAll`
* `groupBy`
* `groupby`
* `intersect`
* `intersectAll`
* `isLocal`
* `mapInPandas`
* `orderBy`
* `rollup`
* `sort`
* `union`
* `unionByName`
* `withColumn`

#### Notes

* `orderBy` / `sort`: Column ordering inferred from the last DataFrame in the chain.
* `union` / `unionByName`: Type widening behavior might differ slightly.
* `describe`: Statistical output format might vary.

#### Partial compatibility APIs

* `alias`
* `approxQuantile`
* `createGlobalTempView`
* `createOrReplaceGlobalTempView`
* `createOrReplaceTempView`
* `createTempView`
* `explain`
* `filter`
* `freqItems`
* `hint`
* `inputFiles`
* `printSchema`
* `randomSplit`
* `repartition`
* `sameSemantics`
* `sample`
* `sampleBy`
* `selectExpr`
* `semanticHash`
* `sortWithinPartitions`
* `subtract`
* `summary`
* `transform`
* `withColumns`
* `withMetadata`

#### Notes

* `explain`: Query plan format differs from Spark.
* `repartition`: Partition count might not be exact.
* `sample`: Random sampling implementation differs.
* `createTempView`: View lifecycle might differ.

#### Unsupported APIs

* `checkSameSparkSession`
* `dropDuplicatesWithinWatermark`
* `observe`
* `pandas_api`
* `registerTempTable`
* `to_pandas_on_spark`
* `withWatermark`

### Column APIs

Coverage for column operations.

#### Full compatibility APIs

* `asc`
* `between`
* `contains`
* `desc`
* `eqNullSafe`
* `getItem`
* `isNull`
* `isin`
* `like`
* `otherwise`
* `startswith`
* `substr`
* `when`

#### High compatibility APIs

* `alias`
* `asc_nulls_first`
* `asc_nulls_last`
* `astype`
* `bitwiseAND`
* `bitwiseOR`
* `bitwiseXOR`
* `cast`
* `desc_nulls_first`
* `desc_nulls_last`
* `endswith`
* `isNotNull`

##### Notes

* `cast`: Some invalid casts return NULL in Spark but error in Snowpark.
* `alias`: Struct field display format might differ.

#### Partial compatibility APIs

* `dropFields`
* `ilike`
* `over`
* `rlike`
* `withField`

##### Notes

* `over`: Window frame specifications might have subtle differences.
* `rlike`: Regex syntax follows Snowflake conventions.

### SparkSession APIs

#### Full compatibility APIs

* `range`
* `sql`
* `table`

#### High compatibility APIs

* `createDataFrame`

##### Notes

Schema inference might produce different types (such as `NUMBER(38,0)` vs `LONG`).

#### Partial compatibility APIs

* `addArtifact`
* `addArtifacts`
* `addTag`
* `clearTags`
* `getTags`
* `interruptAll`
* `interruptOperation`
* `interruptTag`
* `removeTag`

##### Notes

* Tags are mapped to Snowflake query tags.
* Interrupt operations use Snowflake query IDs instead of operation IDs.

#### Unsupported APIs

* `copyFromLocalToFs`
* `stop`

### GroupedData APIs

#### Full compatibility APIs

* `agg`
* `mean`
* `pivot`

#### High compatibility APIs

* `agg`
* `mean`
* `pivot`

#### Partial compatibility APIs

* `apply`
* `avg`
* `sum`

#### Unsupported APIs

* `applyInPandasWithState`
* `cogroup`

### DataFrameReader APIs

#### Full compatibility APIs

* `table`

#### High compatibility APIs

* `csv`

#### Partial compatibility APIs

* `json`
* `load`
* `parquet`
* `jdbc`

##### Notes

* File paths use Snowflake stages or cloud storage (S3, GCS, Azure).
* Schema inference might differ from native Spark.
* Some format-specific options might not be supported.

#### Unsupported APIs

* `orc`

### DataFrameWriter APIs

#### Full compatibility APIs

* `mode`
* `saveAsTable`
* `text`

#### Partial compatibility APIs

* `csv`
* `json`
* `options`
* `parquet`

##### Notes

* Writes go to Snowflake stages or cloud storage.
* Partitioning behavior might differ.

#### Unsupported APIs

* `bucketBy`
* `insertInto`
* `jdbc`
* `orc`
* `sortBy`

### DataFrameWriterV2 APIs

Coverage for the newer DataFrameWriterV2 API.

#### Full compatibility APIs

* `replace`

#### Partial compatibility APIs

* `append`
* `create`
* `createOrReplace`
* `option`
* `options`
* `partitionedBy`
* `tableProperty`
* `using`

### Catalog APIs

#### Full compatibility APIs

* `cacheTable`
* `clearCache`
* `dropGlobalTempView`
* `dropTempView`
* `isCached`
* `refreshByPath`
* `refreshTable`
* `uncacheTable`

#### High compatibility APIs

* `currentCatalog`
* `listCatalogs`
* `listColumns`
* `recoverPartitions`
* `setCurrentCatalog`

##### Notes

* `listColumns`: Column names are uppercase, types are Snowflake-specific.
* Error messages might differ in format.

#### Unsupported APIs

* `createExternalTable`
* `createTable`
* `functionExists`
* `getFunction`
* `listFunctions`
* `registerFunction`

### Window & WindowSpec APIs

Coverage for window functions.

#### Window (all D0) APIs

* `partitionBy`
* `orderBy`
* `rangeBetween`
* `rowsBetween`
* `unboundedPreceding`
* `unboundedFollowing`
* `currentRow`

#### WindowSpec(all D0) APIs

* `partitionBy`
* `orderBy`
* `rangeBetween`
* `rowsBetween`

## Java/Scala APIs

Python functions listed above are also supported in the Java/Scala client. The difference lies in the [Dataset API](https://spark.apache.org/docs/3.5.6/api/scala/org/apache/spark/sql/Dataset.html) which is Java/Scala specific. This section outlines supported and unsupported Dataset APIs.

There is only one JVM client so there is no significant support difference between using Java or Scala, hence they will be described together.

### Supported APIs

All of the Dataset APIs included in the [Spark documentation](https://spark.apache.org/docs/3.5.6/api/scala/org/apache/spark/sql/Dataset.html) excluding the APIs in the following list, are supported.

#### Unsupported APIs

* `Rdd`
* `javaRdd`
* `toJavaRdd`
* `Checkpoint`
* `localCheckpoint`
* `randomSplit`
* `randomSplitAsList`
* `toJSON`
* `toLocalIterator`
* `isEmpty`
* `sortWithinPartitions`
* `writeStream`
* `withWatermark`
* `dropDuplicatesWithinWatermark`
* `as_of_join`
* `with_relations`
* `reduce`

### Known limitations

* Only Java 11 and 17 are supported
* Only Scala 2.12 and 2.13 are supported
* Java/Scala UDTF and UDAFs are not supported
* Use of `Interval` types inside UDxFs is not supported

---
title: Python Connector API
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-api.md
section: Developer Guide
---

# Python Connector API

The Snowflake Connector for Python implements the Python Database API v2.0 specification
(PEP-249). This topic covers the standard
API and the Snowflake-specific extensions.

For more information, see the [PEP-249](https://www.python.org/dev/peps/pep-0249/) documentation.

## Module: `snowflake.connector`

The main module is `snowflake.connector`, which creates a `Connection` object and provides
`Error` classes.

### Constants

apilevel
:   String constant stating the supported API level. The connector supports API
    `"2.0"`.

threadsafety
:   Integer constant stating the level of thread safety the interface supports. The
    Snowflake Connector for Python supports level `2`, which states that threads can share
    the module and connections.

paramstyle
:   String constant stating the type of parameter marker formatting expected
    by the interface. The connector supports the `"pyformat"` type by default, which applies to
    Python extended format codes (e.g. `...WHERE name=%s` or `...WHERE name=%(name)s`).
    `Connection.connect` can override `paramstyle` to change the bind variable formats to
    `"qmark"` or `"numeric"`, where the variables are `?` or `:N`, respectively.

    For example:

    ```bash
    format: .execute("... WHERE my_column = %s", (value,))
    pyformat: .execute("... WHERE my_column = %(name)s", {"name": value})
    qmark: .execute("... WHERE my_column = ?", (value,))
    numeric: .execute("... WHERE my_column = :1", (value,))
    ```

    > **Note:**
    >
    > The binding variable occurs on the client side if `paramstyle` is `"pyformat"` or
    > `"format"`, and on the server side if `"qmark"` or `"numeric"`. Currently,
    > there is no significant difference between those options in terms of performance or features
    > because the connector doesn’t support compiling SQL text followed by
    > multiple executions. Instead, the `"qmark"` and `"numeric"` options align with the query text
    > compatibility of other drivers (i.e. JDBC, ODBC, Go Snowflake Driver), which support server
    > side bindings with the variable format `?` or `:N`.

### Functions

connect(*parameters...*)
:   Purpose:
    :   Constructor for creating a connection to the database. Returns a `Connection` object.

        By default, autocommit mode is enabled (i.e. if the connection is closed, all changes are committed). If you need a
        transaction, use the [BEGIN](../../sql-reference/sql/begin.md) command to start the transaction, and [COMMIT](../../sql-reference/sql/commit.md)
        or [ROLLBACK](../../sql-reference/sql/rollback.md) to commit or roll back any changes.

    Parameters:
    :   The valid input parameters are:

        | Parameter | Required | Description |
        | --- | --- | --- |
        | `account` | Yes | Your account identifier. The account identifier does not include the `snowflakecomputing.com` suffix. . . For details and examples, see Usage Notes (in this topic). |
        | `user` | Yes | Login name for the user. |
        | `password` | Yes | Password for the user. |
        | `application` |  | Name that identifies the application making the connection. |
        | `region` |  | *Deprecated* This description of the parameter is for backwards compatibility only.. |
        | `host` |  | Host name. |
        | `port` |  | Port number (`443` by default). |
        | `database` |  | Name of the default database to use. After login, you can use [USE DATABASE](../../sql-reference/sql/use-database.md) to change the database. |
        | `schema` |  | Name of the default schema to use for the database. After login, you can use [USE SCHEMA](../../sql-reference/sql/use-schema.md) to change the schema. |
        | `role` |  | Name of the default role to use. After login, you can use [USE ROLE](../../sql-reference/sql/use-role.md) to change the role. |
        | `warehouse` |  | Name of the default warehouse to use. After login, you can use [USE WAREHOUSE](../../sql-reference/sql/use-warehouse.md) to change the warehouse.. |
        | `passcode_in_password` |  | `False` by default. Set this to `True` if the MFA (Multi-Factor Authentication) passcode is embedded in the login password. |
        | `passcode` |  | The passcode provided by Duo when using MFA (Multi-Factor Authentication) for login. |
        | `private_key` |  | The private key used for authentication. For more information, see [Using key-pair authentication and key-pair rotation](python-connector-connect.md). |
        | `private_key_file` |  | Specifies the path to the private key file for the specified user. See [Using key-pair authentication and key-pair rotation](python-connector-connect.md). |
        | `private_key_file_pwd` |  | Specifies the passphrase to decrypt the private key file for the specified user. See [Using key-pair authentication and key-pair rotation](python-connector-connect.md). |
        | `autocommit` |  | `None` by default, which honors the Snowflake parameter [AUTOCOMMIT](../../sql-reference/parameters.md). Set to `True` or `False` to enable or disable autocommit mode in the session, respectively. |
        | `client_fetch_use_mp` |  | When set to `True`, it enables multi-processed fetching, which for many cases should reduce the fetching time. Default: `False`. |
        | `client_prefetch_threads` |  | Number of threads used to download the results sets (`4` by default). Increasing the value improves fetch performance but requires more memory. |
        | `client_session_keep_alive` |  | To keep the session active indefinitely (even if there is no activity from the user), set this to `True`. When setting this to `True`, call the `close` method to terminate the thread properly; otherwise, the process might hang. The default value depends on the version of the connector that you are using:   * **Version 2.4.6 and later:** `None` by default. . When the value is `None`, the [CLIENT_SESSION_KEEP_ALIVE](../../sql-reference/parameters.md) session parameter takes precedence. . . To override the session parameter, pass in `True` or `False` for this argument. * **Version 2.4.5 and earlier:** `False` by default. . When the value is `False` (either by specifying the value explicitly or by omitting the argument), the [CLIENT_SESSION_KEEP_ALIVE](../../sql-reference/parameters.md) session parameter takes precedence. . .   Passing `client_session_keep_alive=False` to the `connect` method does not override the value `TRUE` in the `CLIENT_SESSION_KEEP_ALIVE` session parameter. |
        | `login_timeout` |  | Timeout in seconds for login. By default, 60 seconds. The login request gives up after the timeout length if the HTTP response is “success”. |
        | `network_timeout` |  | Timeout in seconds for all other operations. By default, none/infinite. A general request gives up after the timeout length if the HTTP response is not “success”. |
        | `ocsp_response_cache_filename` |  | URI for the OCSP response cache file. By default, the OCSP response cache file is created in the cache directory:   * Linux: `~/.cache/snowflake/ocsp_response_cache` * macOS: `~/Library/Caches/Snowflake/ocsp_response_cache` * Windows: `%USERPROFILE%AppDataLocalSnowflakeCachesocsp_response_cache`   To locate the file in a different directory, specify the path and file name in the URI (e.g. `file:///tmp/my_ocsp_response_cache.txt`).. |
        | `authenticator` |  | Authenticator for Snowflake:   * `snowflake` (default) to use the internal Snowflake authenticator. * `externalbrowser` to authenticate using your web browser and Okta, AD FS, or any other SAML 2.0-compliant identity provider (IdP) that has been defined for your account.  You can enable the `SNOWFLAKE_AUTH_FORCE_SERVER` environment variable to force re-authentication through the browser even if a valid SSO session exists. For more information, see [Using connection caching to minimize the number of prompts for authentication — Optional](../../user-guide/admin-security-fed-auth-use.md). * `https://<okta_account_name>.okta.com` (i.e. the URL endpoint for your Okta account) to authenticate through native Okta. * `oauth` to authenticate using OAuth. You must also specify the `token` parameter and set its value to the OAuth access token. * `username_password_mfa` to authenticate with MFA token caching. For more details, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../../user-guide/security-mfa.md). * `OAUTH_AUTHORIZATION_CODE` to use the OAuth 2.0 Authorization Code flow. * `OAUTH_CLIENT_CREDENTIALS` to use the OAuth 2.0 Client Credentials flow. * `WORKLOAD_IDENTITY` to authenticate with the [workload identity federation (WIF)](../../user-guide/workload-identity-federation.md) authenticator.   If the value is not `snowflake`, the user and password parameters must be your login credentials for the IdP. |
        | `validate_default_parameters` |  | `False` by default. If `True`, then:   * Raise an exception if the specified database, schema, or warehouse doesn’t exist. * Print a warning to stderr if an invalid argument name or an argument value of the wrong data type is passed. |
        | `paramstyle` |  | `pyformat` by default for client side binding. Specify `qmark` or `numeric` to change bind variable formats for server side binding. |
        | `timezone` |  | `None` by default, which honors the Snowflake parameter [TIMEZONE](../../sql-reference/parameters.md). Set to a valid time zone (e.g. `America/Los_Angeles`) to set the session time zone. |
        | `arrow_number_to_decimal` |  | `False` by default, which means that [NUMBER](../../sql-reference/data-types-numeric.md) column values are returned as double-precision floating point numbers (`float64`). . . Set this to `True` to return DECIMAL column values as decimal numbers (`decimal.Decimal`) when calling the `fetch_pandas_all()` and `fetch_pandas_batches()` methods. . . This parameter was introduced in version 2.4.3 of the Snowflake Connector for Python. |
        | `socket_timeout` |  | Timeout in seconds for socket-level read and connect requests. For more information, see [Managing connection timeouts](python-connector-connect.md). |
        | `backoff_policy` |  | Name of the generator function that defines how long to wait between retries. For more information, see [Managing connection backoff policies for retries](python-connector-connect.md). |
        | `enable_connection_diag` |  | Whether to generate a connectivity diagnostic report. Default is `False`. |
        | `connection_diag_log_path` |  | Absolute path for the location of the diagnostic report. Used only if `enable_connection_diag` is `True`. Default is the default temp directory for your operating system, such as `/tmp` for Linux or Mac. |
        | `connection_diag_allowlist_path` |  | Absolute path to a JSON file containing the output of `SYSTEM$ALLOWLIST()` or `SYSTEM$ALLOWLIST_PRIVATELINK()`. Required only if the user defined in the connection does not have permission to run the system allowlist functions or if connecting to the account URL fails. |
        | `iobound_tpe_limit` |  | Size of the preprocess_tpe and postprocess threadpool executors (TPEs). By default, the value is the lesser of the number of files and the number of CPU cores. |
        | `unsafe_file_write` |  | Specifies which file permissions to assign to files downloaded from a stage using a GET command. `False` (default) sets the file permissions to `600`, which means only the owner can access the files. `True` sets the permissions to `644`, which gives the owner read and write permissions and read-only permissions to everyone else. For more information, see [Downloading data](python-connector-example.md). |
        | `oauth_client_id` |  | Value of `client id` provided by the Identity Provider for Snowflake integration (Snowflake security integration metadata). |
        | `oauth_client_secret` |  | Value of the `client secret` provided by the Identity Provider for Snowflake integration (Snowflake security integration metadata). |
        | `oauth_authorization_url` |  | Identity Provider endpoint supplying the authorization code to the driver. When using Snowflake as an Identity Provider ,this value is derived from the `server` or `account` parameters. |
        | `oauth_token_request_url` |  | Identity Provider endpoint supplying the access tokens to the driver. When using Snowflake as an Identity Provider ,this value is derived from the `server` or `account` parameters. |
        | `oauth_scope` |  | Scope requested in the Identity Provider authorization request. By default, it is derived from the role. When multiple scopes are required, the value should be a space-separated list of multiple scopes. |
        | `oauth_redirect_uri` |  | URI to use for authorization code redirection (Snowflake security integration metadata). Default: `http://127.0.0.1:{randomAvailablePort}`. |
        | `oauth_disable_pkce` |  | Disables Proof Key for Code Exchange (PKCE), a security enhancement that ensures that even if malicious attackers intercept an Authorization Code, they won’t be able to change it to a valid access token. |
        | `oauth_enable_refresh_token` |  | Enables a silent re-authentication when the actual access token becomes outdated, providing it’s supported by the Authorization Server and `client_store_temporary_credential` is set to `True`. |
        | `oauth_enable_single_use_refresh_tokens` |  | Whether to opt-in to single-use refresh token semantics. |
        | `oauth_credentials_in_body` |  | Whether or not credentials should be sent in the body for OAuth authentication. Default is `False`. |
        | `oauth_socket_uri` |  | URI to use for the OAuth socket connection. Default: `http://127.0.0.1:{randomAvailablePort}`. |
        | `client_store_temporary_credential` |  | Whether or not to allow clients to cache SSO credentials on the client side. For this setting to take effect, caching must be enabled on the server. For more information, see [Using connection caching to minimize the number of prompts for authentication — Optional](../../user-guide/admin-security-fed-auth-use.md). Default is `False`. |
        | `client_request_mfa_token` |  | Whether or not to allow clients to cache MFA credentials on the client side For this setting to take effect, caching must be enabled on the server. For more information, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../../user-guide/security-mfa.md). Default is `False`. |
        | `workload_identity_provider` |  | Platform of the workload identity provider. Possible values include: `AWS`, `AZURE`, `GCP`, and `OIDC`. |
        | `workload_identity_impersonation_path` |  | An array of strings that provide an identity chain to use when connecting to Snowflake. Array elements are either a full service account address or a service account’s unique ID.  Impersonation works by following each array entry in order to obtain a token that allows authorization of the next service account. Each account in the identity chain needs permissions to impersonate the next account only. The final account in the list obtains your Snowflake connection token and is used to connect to Snowflake.  Account impersonation is supported only for Google Cloud and AWS workloads. |
        | `unsafe_skip_file_permissions_check` |  | Whether to skip permissions checks on file access. Default is `False`. |
        | `cert_revocation_check_mode` |  | Certificate revocation check mode. For accepted values, see CertRevocationCheckMode. |
        | `allow_certificates_without_crl_url` |  | Whether to allow certificates without certificate revocation list distribution points. Default is `False`. |
        | `crl_connection_timeout_ms` |  | The connection timeout for certificate revocation list downloads, in milliseconds. Default is `3000`. |
        | `crl_read_timeout_ms` |  | The read timeout for certificate revocation list downloads, in milliseconds. Default is `3000`. |
        | `crl_cache_validity_hours` |  | How long the certificate revocation list cache is valid, in hours. Default is `24`. |
        | `enable_crl_cache` |  | Whether or not to enable certificate revocation list caching. Default is `True`. |
        | `enable_crl_file_cache` |  | Whether to cache certificate revocation list to disk. Applies only when `enable_crl_cache` is `True`. Default is `True`. |
        | `crl_cache_dir` |  | The directory to store the certificate revocation list cache in. Applies only when `enable_crl_file_cache` is `True`. |
        | `crl_cache_removal_delay_days` |  | The amount of time to keep expired certificate revocation list files on disk, in days. Default is `7`. |
        | `crl_cache_cleanup_interval_hours` |  | How often to clean the certificate revocation list cache, in hours. Default is `1`. |
        | `crl_cache_start_cleanup` |  | Whether to run certificate revocation list cleanup activities in the background. Default is `False`. |
        | `ocsp_root_certs_dict_lock_timeout` |  | Timeout for acquiring the lock on the OCSP root certs dictionary, in seconds. A value of `-1` disables timeouts. Default is `-1`. |
        | `no_proxy` |  | Comma-separated list of hostnames that should bypass the proxy. You can use an asterisk (`*`) in the hostnames. For more information, see [Using a proxy server](python-connector-connect.md). |

### Attributes

Error, Warning, ...
:   All exception classes defined by the Python database API standard. The Snowflake
    Connector for Python provides the attributes `msg`, `errno`, `sqlstate`,
    `sfqid` and `raw_msg`.

### Usage notes for the `account` parameter (for the `connect` method)

For the required `account` parameter, specify your [account identifier](../../user-guide/gen-conn-config.md).

Note that the account identifier does not include the `snowflakecomputing.com` domain name. Snowflake automatically
appends this when creating the connection.

The following example uses the [account name as an identifier](../../user-guide/admin-account-identifier.md) for the account `myaccount` in
the organization `myorganization`.

```python
ctx = snowflake.connector.connect(
    user='<user_name>',
    password='<password>',
    account='myorganization-myaccount',
    ... )
```

The following example uses the [account locator](../../user-guide/admin-account-identifier.md) `xy12345` as the account identifier:

```python
ctx = snowflake.connector.connect(
    user='<user_name>',
    password='<password>',
    account='xy12345',
    ... )
```

Note that this example uses an account in the AWS US West (Oregon) region. If the account is in a different region or if the
account uses a different cloud provider, you need to
[specify additional segments after the account locator](../../user-guide/admin-account-identifier.md).

## Object: `Connection`

A `Connection` object holds the connection and session information to keep the database connection active. If it is closed or the session expires, any subsequent operations will fail.

### Methods

autocommit(*True|False*)
:   Purpose:
    :   Enables or disables autocommit mode. By default, autocommit is enabled (`True`).

close()
:   Purpose:
    :   Closes the connection. If a transaction is still open when the connection is closed, the
        changes are rolled back.

        Closing the connection explicitly removes the active session from the server; otherwise, the active session continues until it is eventually purged from the server, limiting the number of concurrent queries.

        For example:

        ```python
        # context manager ensures the connection is closed
        with snowflake.connector.connect(...) as con:
            con.cursor().execute(...)

        # try & finally to ensure the connection is closed.
        con = snowflake.connector.connect(...)
        try:
            con.cursor().execute(...)
        finally:
            con.close()
        ```

commit()
:   Purpose:
    :   If autocommit is disabled, commits the current transaction. If autocommit is enabled, this
        method is ignored.

rollback()
:   Purpose:
    :   If autocommit is disabled, rolls back the current transaction. If autocommit is enabled,
        this method is ignored.

cursor()
:   Purpose:
    :   Constructor for creating a `Cursor` object. The return values from
        `fetch*()` calls will be a single sequence or list of sequences.

cursor(*snowflake.connector.DictCursor*)
:   Purpose:
    :   Constructor for creating a `DictCursor` object. The return values from
        `fetch*()` calls will be a single dict or list of dict objects. This
        is useful for fetching values by column name from the results.

execute_string(*sql_text*, *remove_comments=False*, *return_cursors=True*)
:   Purpose:
    :   Execute one or more SQL statements passed as strings. If `remove_comments` is set to `True`,
        comments are removed from the query. If `return_cursors` is set to `True`, this
        method returns a sequence of `Cursor` objects in the order of execution.

    Example:
    :   This example shows executing multiple commands in a single string and then using the sequence of
        cursors that is returned:

        ```python
        cursor_list = connection1.execute_string(
            "SELECT * FROM testtable WHERE col1 LIKE 'T%';"
            "SELECT * FROM testtable WHERE col2 LIKE 'A%';"
            )

        for cursor in cursor_list:
           for row in cursor:
              print(row[0], row[1])
        ```

    > **Note:**
    >
    > Methods such as `execute_string()` that allow multiple SQL statements in a single
    > string are vulnerable to SQL injection attacks. Avoid using string concatenation,
    > or functions such as Python’s `format()` function, to dynamically compose a SQL statement
    > by combining SQL with data from users unless you have validated the user data. The example
    > below demonstrates the problem:
    >
    > ```python
    > # "Binding" data via the format() function (UNSAFE EXAMPLE)
    > value1_from_user = "'ok3'); DELETE FROM testtable WHERE col1 = 'ok1'; select pi("
    > sql_cmd = "insert into testtable(col1) values('ok1'); "                  \
    >           "insert into testtable(col1) values('ok2'); "                  \
    >           "insert into testtable(col1) values({col1});".format(col1=value1_from_user)
    > # Show what SQL Injection can do to a composed statement.
    > print(sql_cmd)
    >
    > connection1.execute_string(sql_cmd)
    > ```
    >
    > The dynamically-composed statement looks like the following (newlines have
    > been added for readability):
    >
    > ```sqlexample
    > insert into testtable(col1) values('ok1');
    > insert into testtable(col1) values('ok2');
    > insert into testtable(col1) values('ok3');
    > DELETE FROM testtable WHERE col1 = 'ok1';
    > select pi();
    > ```
    >
    > If you are combining SQL statements with strings entered by untrusted users,
    > then it is safer to bind data to a statement than to compose a string.
    > The `execute_string()` method doesn’t take binding parameters, so to bind parameters
    > use `Cursor.execute()` or `Cursor.executemany()`.

execute_stream(*sql_stream*, *remove_comments=False*)
:   Purpose:
    :   Execute one or more SQL statements passed as a stream object. If `remove_comments` is set to `True`,
        comments are removed from the query. This generator yields each `Cursor` object as SQL statements run.

        If `sql_stream` ends with comment lines, you must set `remove_comments` to `True`, similar to the following:

        ```sqlexample
        sql_script = """
        -- This is first comment line;
        select 1;
        select 2;
        -- This is comment in middle;
        -- With some extra comment lines;
        select 3;
        -- This is the end with last line comment;
        """
        sql_stream = StringIO(sql_script)
        with con.cursor() as cur:
                for result_cursor in con.execute_stream(sql_stream,remove_comments=True):
                    for result in result_cursor:
                        print(f"Result: {result}")
        ```

get_query_status(*query_id*)
:   Purpose:
    :   Returns the status of a query.

    Parameters:
    :   `query_id`

        > The ID of the query. See [Retrieving the Snowflake query ID](python-connector-example.md).

    Returns:
    :   Returns the `QueryStatus` object that represents the status of the query.

    Example:
    :   See [Checking the status of a query](python-connector-example.md).

get_query_status_throw_if_error(*query_id*)
:   Purpose:
    :   Returns the status of a query. If the query results in an error, this method raises a `ProgrammingError` (as the
        `execute()` method would).

    Parameters:
    :   `query_id`

        > The ID of the query. See [Retrieving the Snowflake query ID](python-connector-example.md).

    Returns:
    :   Returns the `QueryStatus` object that represents the status of the query.

    Example:
    :   See [Checking the status of a query](python-connector-example.md).

is_valid()
:   Purpose:
    :   Returns `True` if the connection is stable enough to receive queries.

is_still_running(*query_status*)
:   Purpose:
    :   Returns `True` if the query status indicates that the query has not yet completed or is still in process.

    Parameters:
    :   `query_status`

        > The `QueryStatus` object that represents the status of the query. To get this object for a query, see
        > [Checking the status of a query](python-connector-example.md).

    Example:
    :   See [Checking the status of a query](python-connector-example.md).

is_an_error(*query_status*)
:   Purpose:
    :   Returns `True` if the query status indicates that the query resulted in an error.

    Parameters:
    :   `query_status`

        > The `QueryStatus` object that represents the status of the query. To get this object for a query, see
        > [Checking the status of a query](python-connector-example.md).

    Example:
    :   See [Checking the status of a query](python-connector-example.md).

### Attributes

expired
:   Tracks whether the connection’s master token has expired.

messages
:   The list object including sequences (exception class, exception value) for all
    messages received from the underlying database for this connection.

    The list is cleared automatically by any method call.

errorhandler
:   Read/Write attribute that references an error handler to call in case an
    error condition is met.

    The handler must be a Python callable that accepts the following arguments:

    > `errorhandler(connection, cursor, errorclass, errorvalue)`

Error, Warning, ...
:   All exception classes defined by the Python database API standard.

## Object: `Cursor`

A `Cursor` object represents a database cursor for execute and fetch operations.
Each cursor has its own attributes, `description` and `rowcount`, such that
cursors are isolated.

### Methods

close()
:   Purpose:
    :   Closes the cursor object.

describe(*command [, parameters][, timeout][, file_stream]*)
:   Purpose:
    :   Returns metadata about the result set without executing a database command. This returns the same metadata that is
        available in the `description` attribute after executing a query.

        This method was introduced in version 2.4.6 of the Snowflake Connector for Python.

    Parameters:
    :   See the parameters for the `execute()` method.

    Returns:
    :   Returns a list of ResultMetadata objects that describe the columns
        in the result set.

    Example:
    :   See [Retrieving column metadata](python-connector-example.md).

execute(*command [, parameters][, timeout][, file_stream]*)
:   Purpose:
    :   Prepares and executes a database command.

    Parameters:
    :   `command`

        > A string containing the SQL statement to execute.

        `parameters`

        > (Optional) If you used parameters for [binding data](python-connector-example.md) in the SQL
        > statement, set this to the list or dictionary of variables that should be bound to those parameters.
        >
        > For more information about mapping the Python data types for the variables to the SQL data types of the corresponding
        > columns, see Data type mappings for qmark and numeric bindings.

        `timeout`

        > (Optional) Number of seconds to wait for the query to complete. If the query has not completed after this time has
        > passed, the query should be aborted.

        `file_stream`

        > (Optional) When executing a PUT command, you can use this parameter to upload an in-memory file-like object (e.g. the
        > I/O object returned from the Python `open()` function), rather than a file on the filesystem. Set this
        > parameter to that I/O object.
        >
        > When specifying the URI for the data file in the PUT command:
        >
        > * You can use any directory path. The directory path that you specify in the URI is ignored.
        > * For the filename, specify the name of the file that should be created on the stage.
        >
        > For example, to upload a file from a file stream to a file named:
        >
        > ```none
        > @mystage/myfile.csv
        > ```
        >
        > use the following call:
        >
        > ```python
        > cursor.execute(
        >     "PUT file://this_directory_path/is_ignored/myfile.csv @mystage",
        >     file_stream=<io_object>)
        > ```

    Returns:
    :   Returns the reference of a `Cursor` object.

executemany(*command*, *seq_of_parameters*)
:   Purpose:
    :   Prepares a database command and executes it against all parameter sequences
        found in `seq_of_parameters`. You can use this method to
        [perform a batch insert operation](python-connector-example.md).

    Parameters:
    :   `command`

        > The command is a string containing the code to execute.
        > The string should contain one or more placeholders (such as
        > question marks) for [Binding data](python-connector-example.md).
        >
        > For example:
        >
        > ```python
        > "insert into testy (v1, v2) values (?, ?)"
        > ```

        `seq_of_parameters`

        > This should be a sequence (list or tuple) of lists or tuples. See the example code below for example
        > sequences.

    Returns:
    :   Returns the reference of a `Cursor` object.

    Example:
    :   ```python
        # This example uses qmark (question mark) binding, so
        # you must configure the connector to use this binding style.
        from snowflake import connector
        connector.paramstyle='qmark'

        stmt1 = "create table testy (V1 varchar, V2 varchar)"
        cs.execute(stmt1)

        # A list of lists
        sequence_of_parameters1 = [ ['Smith', 'Ann'], ['Jones', 'Ed'] ]
        # A tuple of tuples
        sequence_of_parameters2 = ( ('Cho', 'Kim'), ('Cooper', 'Pat') )

        stmt2 = "insert into testy (v1, v2) values (?, ?)"
        cs.executemany(stmt2, sequence_of_parameters1)
        cs.executemany(stmt2, sequence_of_parameters2)
        ```

    Internally, multiple `execute` methods are called and the result set from the
    last `execute` call will remain.

    > **Note:**
    >
    > The `executemany` method can only be used to execute a single parameterized SQL statement
    > and pass multiple bind values to it.
    >
    > Executing multiple SQL statements separated by a semicolon in one `execute` call is not supported.
    > Instead, issue a separate `execute` call for each statement.

execute_async(*...*)
:   Purpose:
    :   Prepares and submits a database command for asynchronous execution.
        See [Performing an asynchronous query](python-connector-example.md).

    Parameters:
    :   This method uses the same parameters as the `execute()` method.

    Returns:
    :   Returns the reference of a `Cursor` object.

    Example:
    :   See [Examples of asynchronous queries](python-connector-example.md).

fetch_arrow_all()
:   Purpose:
    :   This method fetches all the rows in a cursor and loads them into a PyArrow table.

    Parameters:
    :   `force_microsecond_precision`

        > When `True`, all timestamp columns are converted to microsecond precision, ensuring consistent schema across all batches. This feature is useful when your data contains timestamps outside the nanosecond range (1677-2262), such as ‘9999-12-31’ or ‘0001-01-01’. When `False` (default), precision is determined per-batch based on the data, which might cause pyarrow schema mismatch errors when combining batches. Note that enabling this truncates sub-microsecond precision (scale 7-9).

    Returns:
    :   Returns a PyArrow table containing all the rows from the result set.

        If there are no rows, this returns None.

    Example:
    :   See [Distributing workloads that fetch results with the Snowflake Connector for Python](python-connector-distributed-fetch.md).

fetch_arrow_batches()
:   Purpose:
    :   This method fetches a subset of the rows in a cursor and delivers them to a PyArrow table.

    Parameters:
    :   `force_microsecond_precision`

        > When `True`, all timestamp columns are converted to microsecond precision, ensuring consistent schema across all batches. This feature is useful when your data contains timestamps outside the nanosecond range (1677-2262), such as ‘9999-12-31’ or ‘0001-01-01’. When `False` (default), precision is determined per-batch based on the data, which might cause pyarrow schema mismatch errors when combining batches. Note that enabling this truncates sub-microsecond precision (scale 7-9).

    Returns:
    :   Returns a PyArrow table containing a subset of the rows from the result set.

        Returns None if there are no more rows to fetch.

    Example:
    :   See [Distributing workloads that fetch results with the Snowflake Connector for Python](python-connector-distributed-fetch.md).

get_result_batches()
:   Purpose:
    :   Returns a list of ResultBatch objects that you can use to fetch a
        subset of rows from the result set.

    Parameters:
    :   None.

    Returns:
    :   Returns a list of ResultBatch objects or `None` if the query has
        not finished executing.

    Example:
    :   See [Distributing workloads that fetch results with the Snowflake Connector for Python](python-connector-distributed-fetch.md).

get_results_from_sfqid(*query_id*)
:   Purpose:
    :   Retrieves the results of an asynchronous query or a previously submitted synchronous query.

    Parameters:
    :   `query_id`

        > The ID of the query. See [Retrieving the Snowflake query ID](python-connector-example.md).

    Example:
    :   See [Using the query ID to retrieve the results of a query](python-connector-example.md).

fetchone()
:   Purpose:
    :   Fetches the next row of a query result set and returns a single sequence/dict or
        `None` when no more data is available.

fetchmany([*size=cursor.arraysize*])
:   Purpose:
    :   Fetches the next rows of a query result set and returns a list of
        sequences/dict. An empty sequence is returned when no more rows are available.

fetchall()
:   Purpose:
    :   Fetches all or remaining rows of a query result set and returns a list of
        sequences/dict.

fetch_pandas_all()
:   Purpose:
    :   This method fetches all the rows in a cursor and loads them into a pandas DataFrame.

    Parameters:
    :   `force_microsecond_precision`

        > When `True`, all timestamp columns are converted to microsecond precision, ensuring consistent schema across all batches. This feature is useful when your data contains timestamps outside the nanosecond range (1677-2262), such as ‘9999-12-31’ or ‘0001-01-01’. When `False` (default), precision is determined per-batch based on the data, which might cause pyarrow schema mismatch errors when combining batches. Note that enabling this truncates sub-microsecond precision (scale 7-9).

    Returns:
    :   Returns a DataFrame containing all the rows from the result set.

        For more information about pandas data frames, see the [pandas DataFrame](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html) documentation .

        If there are no rows, this returns `None`.

    Usage Notes:
    :   * This method is not a complete replacement for the `read_sql()` method of pandas; this method is to provide
          a fast way to retrieve data from a SELECT query and store the data in a pandas DataFrame.
        * Currently, this method works only for SELECT statements.

    Examples:
    :   ```python
        ctx = snowflake.connector.connect(
                  host=host,
                  user=user,
                  password=password,
                  account=account,
                  warehouse=warehouse,
                  database=database,
                  schema=schema,
                  protocol='https',
                  port=port)

        # Create a cursor object.
        cur = ctx.cursor()

        # Execute a statement that will generate a result set.
        sql = "select * from t"
        cur.execute(sql)

        # Fetch the result set from the cursor and deliver it as the pandas DataFrame.
        df = cur.fetch_pandas_all()

        # ...
        ```

fetch_pandas_batches()
:   Purpose:
    :   This method fetches a subset of the rows in a cursor and delivers them to a pandas DataFrame.

    Parameters:
    :   `force_microsecond_precision`

        > When `True`, all timestamp columns are converted to microsecond precision, ensuring consistent schema across all batches. This feature is useful when your data contains timestamps outside the nanosecond range (1677-2262), such as ‘9999-12-31’ or ‘0001-01-01’. When `False` (default), precision is determined per-batch based on the data, which might cause pyarrow schema mismatch errors when combining batches. Note that enabling this truncates sub-microsecond precision (scale 7-9).

    Returns:
    :   Returns a DataFrame containing a subset of the rows from the result set.

        For more information about pandas data frames, see the [pandas DataFrame](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html) documentation.

        Returns `None` if there are no more rows to fetch.

    Usage Notes:
    :   * Depending upon the number of rows in the result set, as well as the number of rows specified in the method
          call, the method might need to be called more than once, or it might return all rows in a single batch if
          they all fit.
        * This method is not a complete replacement for the `read_sql()` method of pandas; this method is to provide
          a fast way to retrieve data from a SELECT query and store the data in a pandas DataFrame.
        * Currently, this method works only for SELECT statements.

    Examples:
    :   ```python
        ctx = snowflake.connector.connect(
                  host=host,
                  user=user,
                  password=password,
                  account=account,
                  warehouse=warehouse,
                  database=database,
                  schema=schema,
                  protocol='https',
                  port=port)

        # Create a cursor object.
        cur = ctx.cursor()

        # Execute a statement that will generate a result set.
        sql = "select * from t"
        cur.execute(sql)

        # Fetch the result set from the cursor and deliver it as the pandas DataFrame.
        for df in cur.fetch_pandas_batches():
            my_dataframe_processing_function(df)

        # ...
        ```

__iter__()
:   Returns self to make cursors compatible with the iteration protocol.

### Attributes

description
:   Read-only attribute that returns metadata about the columns in the result set.

    This attribute is set after you call the `execute()` method to execute the query. (In version 2.4.6 or later, you can
    retrieve this metadata without executing the query by calling the `describe()` method.)

    This attribute is set to one of the following:

    * **Versions 2.4.5 and earlier:** This attribute is set to a list of tuples.
    * **Versions 2.4.6 and later:** This attribute is set to a list of
      ResultMetadata objects.

    Each tuple or `ResultMetadata` object contains the metadata that describes a column in the result set. You can access
    the metadata by index or, in versions 2.4.6 and later, by `ResultMetadata` object attribute:

    | Index of Value | ResultMetadata Attribute | Description |
    | --- | --- | --- |
    | `0` | `name` | Column name. |
    | `1` | `type_code` | Internal type code. |
    | `2` | `display_size` | (Not used. Same as internal_size.) |
    | `3` | `internal_size` | Internal data size. |
    | `4` | `precision` | Precision of numeric data. |
    | `5` | `scale` | Scale for numeric data. |
    | `6` | `is_nullable` | `True` if NULL values allowed for the column or `False`. |

    For examples of getting this attribute, see [Retrieving column metadata](python-connector-example.md).

rowcount
:   Read-only attribute that returns the number of rows in the last `execute` produced.
    The value is `-1` or `None` if no `execute` is executed.

sfqid
:   Read-only attribute that returns the Snowflake query ID in the last `execute` or `execute_async` executed.

arraysize
:   Read/write attribute that specifies the number of rows to fetch at a time with `fetchmany()`.
    It defaults to `1` meaning to fetch a single row at a time.

connection
:   Read-only attribute that returns a reference to the `Connection` object on which the cursor
    was created.

messages
:   List object that includes the sequences (exception class, exception value) for all messages
    which it received from the underlying database for the cursor.

    The list is cleared automatically by any method call except for `fetch*()` calls.

errorhandler
:   Read/write attribute that references an error handler to call in case an error condition is
    met.

    The handler must be a Python callable that accepts the following arguments:

    > `errorhandler(connection, cursor, errorclass, errorvalue)`

stats
:   Provides detailed row-level statistics for DML operations, particularly useful for CTAS (CREATE TABLE AS SELECT) statements where DML statistics were previously unavailable.

    Returns a `QueryResultStats` `NamedTuple` with four fields:

    * `num_rows_inserted` : Number of rows inserted (`int` | `None`)
    * `num_rows_deleted` : Number of rows deleted (`int` | `None`)
    * `num_rows_updated` : Number of rows updated (`int` | `None`)
    * `num_dml_duplicates` : Number of duplicate rows in DML statement (`int` | `None`)

    If no DML stats are available, returns a `QueryResultStats` instance with all fields set to `None`, including the following situations:

    * DML operations where no rows were affected (such as a DELETE … clause with a WHERE condition returning `FALSE` for all entries)
    * Non-DML type of SQL statements (such as DDL and DQL)
    * Multi-statements
    * Async queries (`execute_async)`
    * Result retrieval with QueryID (`get_results_from_sfqid`)

    Note that the `stats` property does not return `None` in these cases; it always returns a `QueryResultStats` instance with all fields set to `None`.

### Type codes

In the `Cursor` object, the `description` attribute and the `describe()` method provide a list of tuples
(or, in versions 2.4.6 and later, ResultMetadata objects) that describe the
columns in the result set.

In a tuple, the value at the index `1` (the `type_code` attribute In the `ResultMetadata` object) represents the
column data type. The Snowflake Connector for Python uses the following map to get the string representation, based on the type
code:

| type_code | String Representation | Data Type |
| --- | --- | --- |
| 0 | FIXED | NUMBER/INT |
| 1 | REAL | REAL |
| 2 | TEXT | VARCHAR/STRING |
| 3 | DATE | DATE |
| 4 | TIMESTAMP | TIMESTAMP |
| 5 | VARIANT | VARIANT |
| 6 | TIMESTAMP_LTZ | TIMESTAMP_LTZ |
| 7 | TIMESTAMP_TZ | TIMESTAMP_TZ |
| 8 | TIMESTAMP_NTZ | TIMESTAMP_TZ |
| 9 | OBJECT | OBJECT |
| 10 | ARRAY | ARRAY |
| 11 | BINARY | BINARY |
| 12 | TIME | TIME |
| 13 | BOOLEAN | BOOLEAN |
| 14 | GEOGRAPHY | GEOGRAPHY |
| 15 | GEOMETRY | GEOMETRY |
| 16 | VECTOR | VECTOR |

### Data type mappings for `qmark` and `numeric` bindings

If `paramstyle` is either `"qmark"` or `"numeric"`, the following default mappings from
Python to Snowflake data type are used:

| Python Data Type | Data Type in Snowflake |
| --- | --- |
| `int` | NUMBER(38, 0) |
| `long` | NUMBER(38, 0) |
| `decimal` | NUMBER(38, <scale>) |
| `float` | REAL |
| `str` | TEXT |
| `unicode` | TEXT |
| `bytes` | BINARY |
| `bytearray` | BINARY |
| `bool` | BOOLEAN |
| `date` | DATE |
| `time` | TIME |
| `timedelta` | TIME |
| `datetime` | TIMESTAMP_NTZ |
| `struct_time` | TIMESTAMP_NTZ |

If you need to map to another Snowflake type (e.g. `datetime` to `TIMESTAMP_LTZ`), specify the
Snowflake data type in a tuple consisting of the Snowflake data type followed by the value. See
[Binding datetime with TIMESTAMP](python-connector-example.md) for examples.

## Object: `Exception`

PEP-249 defines the exceptions that the
Snowflake Connector for Python can raise in case of errors or warnings. The application must
handle them properly and decide to continue or stop running the code.

For more information, see the [PEP-249](https://www.python.org/dev/peps/pep-0249/) documentation.

### Methods

No methods are available for `Exception` objects.

### Attributes

errno
:   Snowflake DB error code.

msg
:   Error message including error code, SQL State code and query ID.

raw_msg
:   Error message. No error code, SQL State code or query ID is included.

sqlstate
:   ANSI-compliant SQL State code

sfqid
:   Snowflake query ID.

## Object `ResultBatch`

A `ResultBatch` object encapsulates a function that retrieves a subset of rows in a result set. To
[distribute the work of fetching results across multiple workers or nodes](python-connector-distributed-fetch.md), you can call
`get_result_batches()` method in the Cursor object to retrieve a list of
`ResultBatch` objects and distribute these objects to different workers or nodes for processing.

### Attributes

#### rowcount

Read-only attribute that returns the number of rows in the result batch.

#### compressed_size

Read-only attribute that returns the size of the data (when compressed) in the result batch.

#### uncompressed_size

Read-only attribute that returns the size of the data (uncompressed) in the result batch.

### Methods

to_arrow()
:   Purpose:
    :   This method returns a PyArrow table containing the rows in the `ResultBatch` object.

    Parameters:
    :   None.

    Returns:
    :   Returns a PyArrow table containing the rows from the `ResultBatch` object.

        If there are no rows, this returns None.

to_pandas()
:   Purpose:
    :   This method returns a pandas DataFrame containing the rows in the `ResultBatch` object.

    Parameters:
    :   None.

    Returns:
    :   Returns a pandas DataFrame containing the rows from the `ResultBatch` object.

        If there are no rows, this returns an empty pandas DataFrame.

## Object: `ResultMetadata`

A `ResultMetadata` object represents metadata about a column in the result set.
A list of these objects is returned by the `description` attribute and `describe` method of the `Cursor`
object.

This object was introduced in version 2.4.6 of the Snowflake Connector for Python.

### Methods

None.

### Attributes

name
:   Name of the column

type_code
:   Internal type code.

display_size
:   Not used. Same as internal_size.

internal_size
:   Internal data size.

precision
:   Precision of numeric data.

scale
:   Scale for numeric data.

is_nullable
:   `True` if NULL values allowed for the column or `False`.

## Module: `snowflake.connector.constants`

The `snowflake.connector.constants` module defines constants used in the API.

### Enums

*class* QueryStatus
:   Represents the status of an asynchronous query. This enum has the following constants:

    | Enum Constant | Description |
    | --- | --- |
    | RUNNING | The query is still running. |
    | ABORTING | The query is in the process of being aborted on the server side. |
    | SUCCESS | The query finished successfully. |
    | FAILED_WITH_ERROR | The query finished unsuccessfully. |
    | QUEUED | The query is queued for execution (i.e. has not yet started running), typically because it is waiting for resources. |
    | DISCONNECTED | The session’s connection is broken. The query’s state will change to “FAILED_WITH_ERROR” soon. |
    | RESUMING_WAREHOUSE | The warehouse is starting up and the query is not yet running. |
    | BLOCKED | The statement is waiting on a lock held by another statement. |
    | NO_DATA | Data about the statement is not yet available, typically because the statement has not yet started executing. |

*class* CertRevocationCheckMode
:   How to treat certificate revocation lists (CRLs) attached to a certificate. This enum has the following constants:

    | Enum Constant | Description |
    | --- | --- |
    | DISABLED | No revocation check is done. |
    | ADVISORY | Only a revoked certificate can invalidate the chain. Errors related to the CRL don’t revoke a certificate. |
    | ENABLED | Each certificate must have at least one valid CRL. Errors in connection, parsing, or validation of all associated CRLs revokes a certificate. |

## Module: `snowflake.connector.pandas_tools`

The `snowflake.connector.pandas_tools` module provides functions for
working with the pandas data analysis library.

For more information, see the [pandas data analysis library](https://pandas.pydata.org/) documentation.

### Functions

write_pandas(*parameters...*)
:   Purpose:
    :   Writes a pandas DataFrame to a table in a Snowflake database.

        To write the data to the table, the function saves the data to Parquet files, uses the [PUT](../../sql-reference/sql/put.md) command to upload these files to a temporary stage, and uses the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command to copy the data from the files to the table. You can use some of the function parameters to control how the `PUT` and `COPY INTO <table>` statements are executed.

    Parameters:
    :   The valid input parameters are:

        | Parameter | Required | Description |
        | --- | --- | --- |
        | `conn` | Yes | `Connection` object that holds the connection to the Snowflake database. |
        | `df` | Yes | `pandas.DataFrame` object containing the data to be copied into the table. |
        | `table_name` | Yes | Name of the table where the data should be copied. |
        | `database` |  | Name of the database containing the table. By default, the function writes to the database that is currently in use in the session. Note: If you specify this parameter, you must also specify the `schema` parameter. |
        | `schema` |  | Name of the schema containing the table. By default, the function writes to the table in the schema that is currently in use in the session. |
        | `bulk_upload_chunks` |  | Setting this parameter to `True` changes the behavior of the `write_pandas` function to first write all the data chunks to the local disk and then perform the wildcard upload of the chunks folder to the stage. When set to `False` (default), the chunks are saved, uploaded, and deleted one by one. |
        | `chunk_size` |  | Number of elements to insert at a time. By default, the function inserts all elements at once in one chunk. |
        | `compression` |  | The compression algorithm to use for the Parquet files. You can specify either `"gzip"` for better compression or `"snappy"` for faster compression. By default, the function uses `"gzip"`. |
        | `on_error` |  | Specifies how errors should be handled. Set this to one of the string values documented in the `ON_ERROR` [copy option](../../sql-reference/sql/copy-into-table.md). By default, the function uses `"ABORT_STATEMENT"`. |
        | `parallel` |  | Number of threads to use when uploading the Parquet files to the temporary stage. For the default number of threads used and guidelines on choosing the number of threads, see [the parallel parameter of the PUT command](../../sql-reference/sql/put.md). |
        | `quote_identifiers` |  | If `False`, prevents the connector from [putting double quotes around identifiers](../../sql-reference/identifiers-syntax.md) before sending the identifiers to the server. By default, the connector puts double quotes around identifiers. |

    Returns:
    :   Returns a tuple of `(success, num_chunks, num_rows, output)` where:

        * `success` is `True` if the function successfully wrote the data to the table.
        * `num_chunks` is the number of chunks of data that the function copied.
        * `num_rows` is the number of rows that the function inserted.
        * `output` is the output of the `COPY INTO <table>` command.

    Example:
    :   The following example writes the data from a pandas DataFrame to the table named ‘customers’.

        ```python
        import pandas
        from snowflake.connector.pandas_tools import write_pandas

        # Create the connection to the Snowflake database.
        cnx = snowflake.connector.connect(...)

        # Create a DataFrame containing data about customers
        df = pandas.DataFrame([('Mark', 10), ('Luke', 20)], columns=['name', 'balance'])

        # Write the data from the DataFrame to the table named "customers".
        success, nchunks, nrows, _ = write_pandas(cnx, df, 'customers')
        ```

pd_writer(*parameters...*)
:   Purpose:
    :   `pd_writer` is an
        insertion method for inserting data into
        a Snowflake database.

        When calling `pandas.DataFrame.to_sql`,
        pass in `method=pd_writer` to specify that you want to use `pd_writer` as the method for inserting data.
        (You do not need to call `pd_writer` from your own code. The `to_sql` method calls `pd_writer` and
        supplies the input parameters needed.)

        For more information see:

        * [insertion method](https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html#io-sql-method) documentation.
        * [pandas](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_sql.html) documentation.

        > **Note:**
        >
        > Please note that when column names in the pandas `DataFrame` contain only lowercase letters, you must enclose
        > the column names in double quotes; otherwise the connector raises a `ProgrammingError`.
        >
        > The `snowflake-sqlalchemy` library does not quote lowercase column names when creating a table,
        > while `pd_writer` quotes column names by default. The issue arises because the COPY INTO
        > command expects column names to be quoted.
        >
        > Future improvements will be made in the `snowflake-sqlalchemy` library.
        >
        > For example:
        >
        > ```python
        > import pandas as pd
        > from snowflake.connector.pandas_tools import pd_writer
        >
        > sf_connector_version_df = pd.DataFrame([('snowflake-connector-python', '1.0')], columns=['NAME', 'NEWEST_VERSION'])
        >
        > # Specify that the to_sql method should use the pd_writer function
        > # to write the data from the DataFrame to the table named "driver_versions"
        > # in the Snowflake database.
        > sf_connector_version_df.to_sql('driver_versions', engine, index=False, method=pd_writer)
        >
        > # When the column names consist of only lower case letters, quote the column names
        > sf_connector_version_df = pd.DataFrame([('snowflake-connector-python', '1.0')], columns=['"name"', '"newest_version"'])
        > sf_connector_version_df.to_sql('driver_versions', engine, index=False, method=pd_writer)
        > ```

        The `pd_writer` function uses the `write_pandas()` function to write the data in the DataFrame to the
        Snowflake database.

    Parameters:
    :   The valid input parameters are:

        | Parameter | Required | Description |
        | --- | --- | --- |
        | `table` | Yes | `pandas.io.sql.SQLTable` object for the table. |
        | `conn` | Yes | `sqlalchemy.engine.Engine` or `sqlalchemy.engine.Connection` object used to connect to the Snowflake database. |
        | `keys` | Yes | Names of the table columns for the data to be inserted. |
        | `data_iter` | Yes | Iterator for the rows containing the data to be inserted. |

    Example:
    :   The following example passes `method=pd_writer` to the `pandas.DataFrame.to_sql` method, which in turn calls
        the `pd_writer` function to write the data in the pandas DataFrame to a Snowflake database.

        ```python
        import pandas
        from snowflake.connector.pandas_tools import pd_writer

        # Create a DataFrame containing data about customers
        df = pandas.DataFrame([('Mark', 10), ('Luke', 20)], columns=['name', 'balance'])

        # Specify that the to_sql method should use the pd_writer function
        # to write the data from the DataFrame to the table named "customers"
        # in the Snowflake database.
        df.to_sql('customers', engine, index=False, method=pd_writer)
        ```

## Date and timestamp support

Snowflake supports multiple DATE and TIMESTAMP data types, and the Snowflake Connector
allows binding native `datetime` and `date` objects for update and fetch operations.

### Fetching data

When fetching date and time data, the Snowflake data types are converted into Python data types:

| Snowflake Data Types | Python Data Type | Behavior |
| --- | --- | --- |
| TIMESTAMP_TZ | [datetime](https://docs.python.org/2/library/datetime.html#datetime.datetime) with [tzinfo](https://docs.python.org/2/library/datetime.html#tzinfo-objects) | Fetches data, including the time zone offset, and translates it into a `datetime` with `tzinfo` object. |
| TIMESTAMP_LTZ, TIMESTAMP | [datetime](https://docs.python.org/2/library/datetime.html#datetime.datetime) with [tzinfo](https://docs.python.org/2/library/datetime.html#tzinfo-objects) | Fetches data, translates it into a `datetime` object, and attaches `tzinfo` based on the [TIMESTAMP_TYPE_MAPPING](../../sql-reference/parameters.md) session parameter. |
| TIMESTAMP_NTZ | [datetime](https://docs.python.org/2/library/datetime.html#datetime.datetime) | Fetches data and translates it into a `datetime` object. No time zone information is attached to the object. |
| DATE | [date](https://docs.python.org/2/library/datetime.html#datetime.date) | Fetches data and translates it into a `date` object. No time zone information is attached to the object. |

> **Note:**
>
> `tzinfo` is a UTC offset-based time zone object and not IANA time zone
> names. The time zone names might not match, but equivalent offset-based
> time zone objects are considered identical.

### Updating data

When updating date and time data, the Python data types are converted to Snowflake data types:

| Python Data Type | Snowflake Data Types | Behavior |
| --- | --- | --- |
| datetime | TIMESTAMP_TZ, TIMESTAMP_LTZ, TIMESTAMP_NTZ, DATE | Converts a datetime object into a string in the format of `YYYY-MM-DD HH24:MI:SS.FF TZH:TZM` and updates it. If no time zone offset is provided, the string will be in the format of `YYYY-MM-DD HH24:MI:SS.FF`. The user is responsible for setting the `tzinfo` for the `datetime` object. |
| struct_time | TIMESTAMP_TZ, TIMESTAMP_LTZ, TIMESTAMP_NTZ, DATE | Converts a struct_time object into a string in the format of `YYYY-MM-DD HH24:MI:SS.FF TZH:TZM` and updates it. The time zone information is retrieved from `time.timezone`, which includes the time zone offset from UTC. The user is responsible for setting the TZ environment variable for `time.timezone`. |
| date | TIMESTAMP_TZ, TIMESTAMP_LTZ, TIMESTAMP_NTZ, DATE | Converts a date object into a string in the format of `YYYY-MM-DD`. No time zone is considered. |
| time | TIMESTAMP_TZ, TIMESTAMP_LTZ, TIMESTAMP_NTZ, DATE | Converts a time object into a string in the format of `HH24:MI:SS.FF`. No time zone is considered. |
| timedelta | TIMESTAMP_TZ, TIMESTAMP_LTZ, TIMESTAMP_NTZ, DATE | Converts a timedelta object into a string in the format of `HH24:MI:SS.FF`. No time zone is considered. |

---
title: Python handler examples for stored procedures
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-examples.md
section: Developer Guide
---

# Python handler examples for stored procedures

## Running concurrent tasks with worker processes

You can run concurrent tasks using Python worker processes. You might find this useful when you need to run parallel tasks that take
advantage of multiple CPU cores on warehouse nodes.

> **Note:**
>
> Snowflake recommends that you not use the built-in Python multiprocessing module.

To work around cases where the [Python Global Interpreter Lock](https://wiki.python.org/moin/GlobalInterpreterLock) prevents a
multi-tasking approach from scaling across all CPU cores, you can execute concurrent tasks using separate worker processes, rather than threads.

You can do this on Snowflake warehouses by using the `joblib` library’s `Parallel` class, as in the following example.

```sqlexample-python
CREATE OR REPLACE PROCEDURE joblib_multiprocessing_proc(i INT)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'joblib_multiprocessing'
  PACKAGES = ('snowflake-snowpark-python', 'joblib')
AS $$
import joblib
from math import sqrt

def joblib_multiprocessing(session, i):
  result = joblib.Parallel(n_jobs=-1)(joblib.delayed(sqrt)(i ** 2) for i in range(10))
  return str(result)
$$;
```

> **Note:**
>
> The default backend used for `joblib.Parallel` differs between Snowflake standard and Snowpark-optimized warehouses.
>
> * Standard warehouse default: `threading`
> * Snowpark-optimized warehouse default: `loky` (multiprocessing)
>
> You can override the default backend setting by calling the `joblib.parallel_backend` function, as in the following example.
>
> ```python
> import joblib
> joblib.parallel_backend('loky')
> ```

## Using Snowpark APIs for asynchrononous processing

The following examples illustrate how you can use Snowpark APIs to begin asynchronous child jobs, as well as how those jobs behave under
different conditions.

### Checking the status of an asynchronous child job

In the following example, the `checkStatus` procedure executes an asynchronous child job that waits 60 seconds. The procedure then
checks on the status of the job before it can have finished, so the check returns `False`.

```sqlexample-python
CREATE OR REPLACE PROCEDURE checkStatus()
RETURNS VARCHAR
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER='async_handler'
EXECUTE AS CALLER
AS $$
def async_handler(session):
    async_job = session.sql("select system$wait(60)").collect_nowait()
    return async_job.is_done()
$$;
```

The following code calls the procedure.

```sqlexample
CALL checkStatus();
```

```output
+-------------+
| checkStatus |
|-------------|
| False       |
+-------------+
```

### Cancelling an asynchronous child job

In the following example, the `cancelJob` procedure uses SQL to insert data into the `test_tb` table with an asynchronous
child job that would take 10 seconds to finish. It then cancels the child job before it finishes and the data has been inserted.

```sqlexample
CREATE OR REPLACE TABLE test_tb(c1 STRING);
```

```sqlexample-python
CREATE OR REPLACE PROCEDURE cancelJob()
RETURNS VARCHAR
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'async_handler'
EXECUTE AS OWNER
AS $$
def async_handler(session):
    async_job = session.sql("insert into test_tb (select system$wait(10))").collect_nowait()
    return async_job.cancel()
$$;

CALL cancelJob();
```

The following code queries the `test_tb` table, but returns no results because no data has been inserted.

```sqlexample
SELECT * FROM test_tb;
```

```output
+----+
| C1 |
|----|
+----+
```

### Waiting and blocking while an asynchronous child job runs

In the following example, the `blockUntilDone` procedure executes an asynchronous child job that takes 5 seconds to finish. Using
the `snowflake.snowpark.AsyncJob.result` method, the procedure waits and returns when the job has finished.

```sqlexample-python
CREATE OR REPLACE PROCEDURE blockUntilDone()
RETURNS VARCHAR
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER='async_handler'
EXECUTE AS CALLER
AS $$
def async_handler(session):
    async_job = session.sql("select system$wait(5)").collect_nowait()
    return async_job.result()
$$;
```

The following code calls the `blockUntilDone` procedure, which returns after waiting 5 seconds.

```sqlexample
CALL blockUntilDone();
```

```output
+------------------------------------------+
| blockUntilDone                               |
|------------------------------------------|
| [Row(SYSTEM$WAIT(5)='waited 5 seconds')] |
+------------------------------------------+
```

### Returning an error after requesting results from an unfinished asynchronous child job

In the following example, the `earlyReturn` procedure executes an asynchronous child job that takes 60 seconds to finish. The
procedure then attempts to return a `DataFrame` from the job’s result before it can have finished. The result is an error.

```sqlexample-python
CREATE OR REPLACE PROCEDURE earlyReturn()
RETURNS VARCHAR
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER='async_handler'
EXECUTE AS CALLER
AS $$
def async_handler(session):
    async_job = session.sql("select system$wait(60)").collect_nowait()
    df = async_job.to_df()
    try:
        df.collect()
    except Exception as ex:
        return 'Error: (02000): Result for query <UUID> has expired'
$$;
```

The following code calls the `earlyReturn` procedure, returning the error.

```sqlexample
CALL earlyReturn();
```

```output
+------------------------------------------------------------+
| earlyReturn                                                 |
|------------------------------------------------------------|
| Error: (02000): Result for query <UUID> has expired        |
+------------------------------------------------------------+
```

### Finishing a parent job before a child job finishes, canceling the child job

In the following example, the `earlyCancelJob` procedure executes an asynchronous child job to insert data into a table and takes 10
seconds to finish. However, the parent job — `async_handler` — returns before the child job finishes, which cancels the child job.

```sqlexample-python
CREATE OR REPLACE PROCEDURE earlyCancelJob()
RETURNS VARCHAR
LANGUAGE PYTHON
RUNTIME_VERSION = 3.12
PACKAGES = ('snowflake-snowpark-python')
HANDLER='async_handler'
EXECUTE AS OWNER
AS $$
def async_handler(session):
    async_job = session.sql("insert into test_tb (select system$wait(10))").collect_nowait()
$$;
```

The following code calls the `earlyCancelJob` procedure. It then queries the `test_tb` table, which returns no result because
no data was inserted by the canceled child job.

```sqlexample
CALL earlyCancelJob();
SELECT * FROM test_tb;
```

```output
+----+
| C1 |
|----|
+----+
```

## Reading files and assets

### Reading a statically-specified file using IMPORTS

You can read a file by specifying the file name and stage name in the IMPORTS clause of the
[CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) command.

When you specify a file in the IMPORTS clause, Snowflake copies that file from the stage to the stored procedure’s *home directory* (also called the *import directory*), which is the directory from which the stored procedure reads the file.

Snowflake copies imported files to a single directory. All files in that directory must have unique names, so each file in your
IMPORTS clause must have a distinct name. This applies even if the files start out in different stages or different subdirectories within a
stage.

The following example uses an in-line Python handler that reads a file called `file.txt` from a stage named `my_stage`.
The handler retrieves the location of the stored procedure’s home directory using
the Python `sys._xoptions` method with the `snowflake_import_directory` system option.

Snowflake reads the file only once during stored procedure creation,
and will not read it again during stored procedure execution if reading the file happens outside of the target handler.

Create the stored procedure with an in-line handler:

```sqlexample-python
CREATE OR REPLACE PROCEDURE test_file_import_sp()
RETURNS STRING
LANGUAGE PYTHON
PACKAGES = ('snowflake-snowpark-python')
IMPORTS = ('@my_stage/dir/file.txt')
HANDLER = 'run'
RUNTIME_VERSION = 3.12
EXECUTE AS CALLER
AS $$
import os
import sys

def run(session):
  with open(os.path.join(sys._xoptions["snowflake_import_directory"], 'file.txt'), "r") as f:
    return f.read()
$$;
CALL test_file_import_sp();
// return file content
```

### Importing a directory using IMPORTS

[Preview Feature](../../../release-notes/preview-features.md) — Open

Available to all accounts.

You can import a directory using the IMPORTS clause of the [CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) command.

> **Note:**
>
> * The import path for a directory must end with a trailing slash (`/`). For example, `IMPORTS = ('@my_stage/my_dir/')`.
> * To rename a directory on import, append `/=custom_name/` to the stage path. The custom name must be a single directory name, not a path. For example, `IMPORTS = ('@my_stage/my_dir/=custom_name/')`.
> * Directory imports are not supported in Native Apps.

The following example imports a directory called `my_dir` from a stage named `my_stage` and lists the files it contains.

```sqlexample-python
CREATE OR REPLACE PROCEDURE my_directory_import_list_sp()
RETURNS STRING
LANGUAGE PYTHON
PACKAGES = ('snowflake-snowpark-python')
IMPORTS = ('@my_stage/my_dir/')
HANDLER = 'run'
RUNTIME_VERSION = 3.12
EXECUTE AS CALLER
AS $$
import os
import sys
def list_files(directory):
  files = []
  # Walk through the directory and its subdirectories
  for dirpath, _, filenames in os.walk(directory):
    for filename in filenames:
      # Append the relative path to each file to the list
      full_path = os.path.join(dirpath, filename)
      files.append(os.path.relpath(full_path, directory))
  return files
def run(session):
  directory_path = sys._xoptions["snowflake_import_directory"]
  file_list = list_files(directory_path)
  file_list_str = ' '.join(file_list)
  return file_list_str
$$;
CALL my_directory_import_list_sp();
```

---
title: Python stored procedure limitations
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-limitations.md
section: Developer Guide
---

# Python stored procedure limitations

Stored procedures have the following limitations:

* Creating processes is not supported in stored procedures.
* Running concurrent queries is not supported in stored procedures.
* You cannot use APIs that execute PUT and GET commands, including `Session.sql("PUT ...")` and `Session.sql("GET ...")`.
* When you download files from a stage using `session.file.get`, pattern matching is not supported.
* Creating named temp objects is not supported in an owner’s rights stored procedure. An owner’s rights stored procedure is a stored
  procedure that runs with the privileges of the stored procedure owner.
  For more information, refer to [caller’s rights or owner’s rights](../stored-procedures-rights.md).

---
title: Python UDF handler examples
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-examples.md
section: Developer Guide
---

# Python UDF handler examples

This topic includes simple examples of UDF handler code written in Python.

For information on using Python to create a UDF handler, refer to [Creating Python UDFs](udf-python-creating.md).

Set `runtime_version` to the version of the Python runtime that your code requires. The supported versions of Python are:

> Generally available versions:
>
> * 3.9 (deprecated)
> * 3.10
> * 3.11
> * 3.12
> * 3.13

## Importing a package in an in-line handler

A curated list of third-party packages from Anaconda is available.
For more information, see [Using third-party packages](udf-python-packages.md).

> **Note:**
>
> Before you can use the packages provided by Anaconda, your Snowflake organization administrator must
> acknowledge the Snowflake [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/). For more information,
> see [Using third-party packages from Anaconda](udf-python-packages.md).

The following code shows how to import packages and return their versions.

Create the UDF:

```sqlexample-python
CREATE OR REPLACE FUNCTION py_udf()
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('numpy','pandas','xgboost==1.5.0')
  HANDLER = 'udf'
AS $$
import numpy as np
import pandas as pd
import xgboost as xgb
def udf():
  return [np.__version__, pd.__version__, xgb.__version__]
$$;
```

Call the UDF:

```sqlexample
SELECT py_udf();
```

```output
+-------------+
| PY_UDF()    |
|-------------|
| [           |
|   "1.19.2", |
|   "1.4.0",  |
|   "1.5.0"   |
| ]           |
+-------------+
```

You can use the PACKAGES keyword to specify package versions as follows:

* Without a version (e.g `numpy`)
* Pinned to an exact version (e.g. `numpy==1.25.2`)
* Constrained to a version prefix by using wildcards (e.g. `numpy==1.*`)
* Constrained to a version range (e.g. `numpy>=1.25`)
* Constrained by multiple version specifiers (e.g. `numpy>=1.25,<2`) so that a package that satisfies all version specifiers will be selected.

> **Note:**
>
> Using multiple range operators (e.g. `numpy>=1.25,<2`) is not supported in package policies but you can use them when creating Python UDF, UDTF, and stored procedures.

Here is an example of how to use the wildcard `*` to constrain a package to a version prefix.

```sqlexample-python
CREATE OR REPLACE FUNCTION my_udf()
  RETURNS STRING
  LANGUAGE PYTHON
  PACKAGES = ('numpy==1.*')
  RUNTIME_VERSION = 3.10
  HANDLER = 'echo'
AS $$
def echo():
  return 'hi'
$$;
```

This example shows how to constrain a package to be greater than or equal to a specified version.

```sqlexample-python
CREATE OR REPLACE FUNCTION my_udf()
  RETURNS STRING
  LANGUAGE PYTHON
  PACKAGES = ('numpy>=1.2')
  RUNTIME_VERSION = 3.10
  HANDLER = 'echo'
AS $$
def echo():
  return 'hi'
$$;
```

This example shows how to use multiple package version specifiers.

```sqlexample-python
CREATE OR REPLACE FUNCTION my_udf()
  RETURNS STRING
  LANGUAGE PYTHON
  PACKAGES = ('numpy>=1.2,<2')
  RUNTIME_VERSION = 3.10
  HANDLER = 'echo'
AS $$
def echo():
  return 'hi'
$$;
```

## Reading files and assets

### Reading a file

You can read the contents of a file with Python UDF handler code. For example, you might want to read a file to process unstructured data.

To read the contents of a file, you can:

* Statically specify the file path and name with the IMPORTS clause, then read it from the UDF’s home
  directory. This can be useful when a file name is static, consistent within the function, and you know the file name in advance.
* Dynamically specify the file and read its contents with SnowflakeFile. You might do this if you need to access a file during computation.

#### Reading a statically-specified file using IMPORTS

You can read a file by specifying the file name and stage name in the IMPORTS clause of the
[CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.

When you specify a file in the IMPORTS clause, Snowflake copies that file from the stage to the UDF’s
*home directory* (also called the *import directory*), which is the directory from which the UDF reads the file.

Snowflake copies imported files to a single directory. All files in that directory must have unique names, so each file in your
IMPORTS clause must have a distinct name. This applies even if the files start out in different stages or different subdirectories within a
stage.

The following example uses an in-line Python handler that reads a file called `file.txt` from a stage named `my_stage`.
The handler retrieves the location of the UDF’s home directory using
the Python `sys._xoptions` method with the `snowflake_import_directory` system option.

Snowflake reads the file only once during UDF creation,
and will not read it again during UDF execution if reading the file happens outside of the target handler.

Create the UDF with an in-line handler:

```sqlexample-python
CREATE OR REPLACE FUNCTION my_udf()
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  IMPORTS = ('@my_stage/file.txt')
  HANDLER = 'compute'
AS $$
import sys
import os

with open(os.path.join(sys._xoptions["snowflake_import_directory"], 'file.txt'), "r") as f:
  s = f.read()

def compute():
  return s
$$;
```

#### Reading a dynamically-specified file with `SnowflakeFile`

You can read a file from a stage using the `SnowflakeFile` class in the Snowpark `snowflake.snowpark.files` module.
The `SnowflakeFile` class provides dynamic file access, which lets you stream files of any size. Dynamic file access is also useful when you want to iterate over multiple files. For example, see Processing multiple files.

The `SnowflakeFile` class has one method for opening a file: `open`. The `open` method returns a `SnowflakeFile` object that extends Python’s `IOBase` file objects.

The `SnowflakeFile` object supports the following `IOBase`, `BufferedIOBase`, and `RawIOBase` methods:

> * `IOBase.fileno`
> * `IOBase.isatty`
> * `IOBase.readable`
> * `IOBase.readinto`
> * `IOBase.readline`
> * `IOBase.readlines`
> * `IOBase.seek`
> * `IOBase.seekable`
> * `IOBase.tell`
> * `BufferedIOBase.readinto1`
> * `RawIOBase.read`
> * `RawIOBase.readall`

For more information, see the [Python 3.12 documentation on IOBase](https://docs.python.org/3.12/library/io.html).
Calling unsupported methods in a Snowflake server, such as the method `fileno`, will return an error.

> **Note:**
>
> By default, file access with `SnowflakeFile` requires scoped URLs in order to make your code resilient to file injection attacks. You can create a scoped URL in SQL using the built-in function [BUILD_SCOPED_FILE_URL](../../../sql-reference/functions/build_scoped_file_url.md). For more information about scoped URLs, see [Types of URLs available to access files](../../../user-guide/unstructured-intro.md). Only users with access to the file can create a scoped URL.

The examples in this section use `SnowflakeFile` to read one or more files from a specified stage location.

##### Prerequisites

Before your Python handler code can read a file on a stage, you must do the following to make the file available to the code:

1. Create a stage that is available to your handler.

   You can use an external stage or internal stage. If you use an internal stage, it can be a user stage when you plan to create a caller’s rights stored procedure.
   Otherwise, you must use a named stage. Snowflake does not currently support using a table stage for UDF dependencies.

   For more on creating a stage, see
   [CREATE STAGE](../../../sql-reference/sql/create-stage.md). For more on choosing an internal stage type, see
   [Choosing an internal stage for local files](../../../user-guide/data-load-local-file-system-create-stage.md).

   Adequate privileges on the stage must be assigned to the following role, depending on your use case:

   > | Use case | Role |
   > | --- | --- |
   > | UDF or owner’s rights stored procedure | The role that owns the executing UDF or stored procedure. |
   > | Caller’s rights stored procedure | The user role. |

   For more information, see [Granting privileges for user-defined functions](../udf-access-control.md).
2. Copy the file that your code will read to the stage.

   You can copy the file from a local drive to an internal stage using the [PUT](../../../sql-reference/sql/put.md) command.
   For information on staging files with PUT, see [Staging data files from a local file system](../../../user-guide/data-load-local-file-system-stage.md).

   You can copy the file from a local drive to an external stage location using any of the tools provided by your cloud storage service.
   For help, see the documentation for your cloud storage service.

##### Calculating the perceptual hash of an image with an in-line Python handler

This example uses `SnowflakeFile` to read a pair of staged image files and use the [perceptual hash](https://www.phash.org/)
(pHash) of each file to determine how similar the images are to each other.

Create a UDF that returns the phash value of an image, specifying the input mode as binary by passing `rb` for the `mode` argument:

```sqlexample-python
CREATE OR REPLACE FUNCTION calc_phash(file_path STRING)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES = ('snowflake-snowpark-python','imagehash','pillow')
  HANDLER = 'run'
AS $$
from PIL import Image
import imagehash
from snowflake.snowpark.files import SnowflakeFile

def run(file_path):
  with SnowflakeFile.open(file_path, 'rb') as f:
  return imagehash.average_hash(Image.open(f))
$$;
```

Create a second UDF that calculates the distance between the phash values of two images:

```sqlexample-python
CREATE OR REPLACE FUNCTION calc_phash_distance(h1 STRING, h2 STRING)
  RETURNS INT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES = ('imagehash')
  HANDLER = 'run'
AS $$
import imagehash

def run(h1, h2):
  return imagehash.hex_to_hash(h1) - imagehash.hex_to_hash(h2)
$$;
```

Stage the image files and refresh the directory table:

```sqlexample
PUT file:///tmp/image1.jpg @images AUTO_COMPRESS=FALSE;
PUT file:///tmp/image2.jpg @images AUTO_COMPRESS=FALSE;

ALTER STAGE images REFRESH;
```

Call the UDFs:

```sqlexample
SELECT
  calc_phash_distance(
    calc_phash(build_scoped_file_url(@images, 'image1.jpg')),
    calc_phash(build_scoped_file_url(@images, 'image2.jpg'))
  ) ;
```

##### Processing a CSV file with a UDTF

This example uses `SnowflakeFile` to create a UDTF that extracts the contents
of a CSV file and returns the rows in a table.

Create the UDTF with an in-line handler:

```sqlexample-python
CREATE FUNCTION parse_csv(file_path STRING)
  RETURNS TABLE (col1 STRING, col2 STRING, col3 STRING)
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES = ('snowflake-snowpark-python')
  HANDLER = 'csvparser'
AS $$
from snowflake.snowpark.files import SnowflakeFile

class csvparser:
  def process(self, stagefile):
    with SnowflakeFile.open(stagefile) as f:
      for line in f.readlines():
        lineStr = line.strip()
        row = lineStr.split(",")
        try:
          # Read the columns from the line.
          yield (row[1], row[0], row[2], )
        except:
          pass
$$;
```

Stage the CSV file and refresh the directory table:

```sqlexample
PUT file:///tmp/sample.csv @data_stage AUTO_COMPRESS=FALSE;

ALTER STAGE data_stage REFRESH;
```

Call the UDTF, passing a file URL:

```sqlexample
SELECT * FROM TABLE(PARSE_CSV(build_scoped_file_url(@data_stage, 'sample.csv')));
```

##### Processing multiple files

You can read and process multiple files by passing the RELATIVE_PATH column of a directory table to your handler. For more information on the RELATIVE_PATH column, see [the output from a directory table query](../../../user-guide/data-load-dirtables-query.md).

> **Note:**
>
> Depending on your file size and compute needs, you might want to use [ALTER WAREHOUSE](../../../sql-reference/sql/alter-warehouse.md) to scale your warehouse up before you execute a statement that reads and processes multiple files.

Call a UDF to process multiple files:
:   The following example calls a UDF within a CREATE TABLE statement to process each file on a stage and then store the results in a new table.

    For demonstration purposes, the example assumes the following:

    * There are multiple text files on a stage named `my_stage`.
    * There is an existing UDF named `get_sentiment` that performs sentiment analysis on unstructured text. The UDF takes a path to a text file as input and returns a value that represents sentiment.

    ```sqlexample
    CREATE OR REPLACE TABLE sentiment_results AS
    SELECT
      relative_path
      , get_sentiment(build_scoped_file_url(@my_stage, relative_path)) AS sentiment
    FROM directory(@my_stage);
    ```

Call a UDTF to process multiple files:
:   This next example calls a UDTF named `parse_excel_udtf`. The example passes the `relative_path` from the directory table on the stage named `my_excel_stage`.

    ```sqlexample
    SELECT t.*
    FROM directory(@my_stage) d,
    TABLE(parse_excel_udtf(build_scoped_file_url(@my_excel_stage, relative_path)) t;
    ```

##### Reading files with stage URIs and URLs

File access with `SnowflakeFile` requires scoped URLs by default. This makes your code resilient to file injection attacks. However, you can refer to a file location using a stage URI or a stage URL instead. To do so, you must call the `SnowflakeFile.open` method with the keyword argument `require_scoped_url = False`.

This option is useful when you want to let a caller provide a URI that is accessible only to the UDF owner. For example, you might use a stage URI for file access if you own a UDF and you want to read in your configuration files or machine learning models. We do not recommend this option when you work with files that have unpredictable names, such as files that are created based on user input.

This example reads a machine learning model from a file and uses the model in a function to perform natural language processing for sentiment analysis. The example calls the `open` with `require_scoped_url = False`. In both file location formats (stage URI and stage URL), the UDF owner must have access to the model file.

Create the UDF with an in-line handler:

```sqlexample-python
CREATE OR REPLACE FUNCTION extract_sentiment(input_data STRING)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES = ('snowflake-snowpark-python','scikit-learn')
  HANDLER = 'run'
AS $$
from snowflake.snowpark.files import SnowflakeFile
from sklearn.linear_model import SGDClassifier
import pickle

def run(input_data):
  model_file = '@models/NLP_model.pickle'
  # Specify 'mode = rb' to open the file in binary mode.
  with SnowflakeFile.open(model_file, 'rb', require_scoped_url = False) as f:
    model = pickle.load(f)
    return model.predict([input_data])[0]
$$;
```

Stage the model file and refresh the directory table:

```sqlexample
PUT file:///tmp/NLP_model.pickle @models AUTO_COMPRESS=FALSE;

ALTER STAGE models REFRESH;
```

Alternatively, you can specify the UDF with the model’s stage URL to extract the sentiment.

For example, create a UDF with an in-line handler that specifies a file using a stage URL:

```sqlexample-python
CREATE OR REPLACE FUNCTION extract_sentiment(input_data STRING)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES = ('snowflake-snowpark-python','scikit-learn')
  HANDLER = 'run'
AS $$
from snowflake.snowpark.files import SnowflakeFile
from sklearn.linear_model import SGDClassifier
import pickle

def run(input_data):
  model_file = 'https://my_account/api/files/my_db/my_schema/models/NLP_model.pickle'
  # Specify 'rb' to open the file in binary mode.
  with SnowflakeFile.open(model_file, 'rb', require_scoped_url = False) as f:
    model = pickle.load(f)
    return model.predict([input_data])[0]
$$;
```

Call the UDF with the input data:

```sqlexample
SELECT extract_sentiment('I am writing to express my interest in a recent posting made.');
```

### Importing directories and repositories

[Preview Feature](../../../release-notes/preview-features.md) — Open

Available to all accounts.

You can use the IMPORTS clause to import entire directories or Git repositories, in addition to individual files.

> **Note:**
>
> * The import path for a directory must end with a trailing slash (`/`). For example, `IMPORTS = ('@my_stage/my_dir/')`.
> * To rename a directory on import, append `/=custom_name/` to the stage path. The custom name must be a single directory name, not a path. For example, `IMPORTS = ('@my_stage/my_dir/=custom_name/')`.
> * Directory imports are not supported in Native Apps.

#### Importing a directory using IMPORTS

You can import a directory using the IMPORTS clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.

The following example imports a directory called `my_dir` from a stage named `my_stage` and lists the files it contains.

```sqlexample-python
CREATE OR REPLACE FUNCTION my_directory_import_list_udf()
  RETURNS STRING
  LANGUAGE PYTHON
  IMPORTS = ('@my_stage/my_dir/')
  PACKAGES = ('snowflake-snowpark-python')
  RUNTIME_VERSION = 3.12
  HANDLER = 'run'
AS $$
import os
import sys
def list_files(directory):
  files = []
  # Walk through the directory and its subdirectories
  for dirpath, _, filenames in os.walk(directory):
    for filename in filenames:
      # Append the relative path to each file to the list
      full_path = os.path.join(dirpath, filename)
      files.append(os.path.relpath(full_path, directory))
  return files
def run():
    directory_path = sys._xoptions["snowflake_import_directory"]
    file_list = list_files(directory_path)
    file_list_str = ' '.join(file_list)
    return file_list_str
$$;
select my_directory_import_list_udf();
```

#### Importing from the root of a stage using IMPORTS

You can import from the root of a stage using the IMPORTS clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.

The following example imports from the root of a stage named `my_stage`. The directory is renamed to `customer_dir` to access and read the files within it.

```sqlexample-python
CREATE OR REPLACE FUNCTION my_directory_import_read_udf()
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  IMPORTS = ('@my_stage/=customer_dir/')
  HANDLER = 'run'
AS $$
import sys
import os

with open(os.path.join(sys._xoptions["snowflake_import_directory"], 'customer_dir', 'my_dir', 'file.txt'), "r") as f:
  s = f.read()

def run():
  return s
$$;
select my_directory_import_read_udf();
```

#### Importing from a Git repo using IMPORTS

You can point imports at a Git repo stored in Snowflake (SnowGit), enabling a cleaner deployment workflow.
For more information, see [CREATE GIT REPOSITORY](../../../sql-reference/sql/create-git-repository.md).

```sqlexample-python
CREATE OR REPLACE FUNCTION run_adapter(x STRING)
  RETURNS STRING
LANGUAGE PYTHON
RUNTIME_VERSION = '3.13'
IMPORTS = ('@my_snow_git/branches/main/src/adapter/')
HANDLER = 'run'
AS $$
from adapter.core import transform

def run(x: str) -> str:
  return transform(x)
$$;
```

## Writing files

A UDF handler can write files to a `/tmp` directory created for the query calling the UDF.

Keep in mind that a `/tmp` directory is set aside for a single calling query, yet multiple Python worker processes might be running at the
same time. To prevent collisions, you must ensure either that access to the /tmp directory is synchronized with other Python worker
processes or that the names of files written to /tmp are unique.

For example code, see Unzipping a staged file in this topic.

Code in the following example writes the input `text` to the `/tmp` directory. It also appends the function’s process ID to ensure the
file location’s uniqueness.

```python
def func(text):
  # Append the function's process ID to ensure the file name's uniqueness.
  file_path = '/tmp/content' + str(os.getpid())
  with open(file_path, "w") as file:
    file.write(text)
```

For information on writing files, see [Writing files from Snowpark Python UDFs and UDTFs](../../snowpark/python/creating-udfs.md).

## Unzipping a staged file

You can store a .zip file on a stage, then unzip it in a UDF by using the Python zipfile module.

For example, you can upload a .zip file to a stage, then reference the .zip file at its staged location in the IMPORTS clause when you
create the UDF. At run time, Snowflake will copy the staged file into an import directory from which your code can access it.

For more about reading and writing files, see Reading files and assets and Writing files.

In the following example, the UDF code uses an NLP model to discover entities in text. The code returns an array of these entities.
To set up the NLP model for processing the text, the code first uses the zipfile module to extract the file for the model
(en_core_web_sm-2.3.1) from a .zip file. The code then uses the spaCy module to load the model from the file.

Note that the code writes extracted file contents to the /tmp directory created for the query calling this function. The code uses file
locks to ensure that the extraction is synchronized across Python worker processes; this way, contents are unzipped only once. For more about
writing files, see Writing files.

For more about the zipfile module, see the [zipfile reference](https://docs.python.org/3/library/zipfile.html). For more about the spaCy
module, see the [spaCy API documentation](https://spacy.io/api).

Create the UDF with an in-line handler:

```sqlexample-python
CREATE OR REPLACE FUNCTION py_spacy(str STRING)
  RETURNS ARRAY
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'func'
  PACKAGES = ('spacy')
  IMPORTS = ('@spacy_stage/spacy_en_core_web_sm.zip')
AS $$
import fcntl
import os
import spacy
import sys
import threading
import zipfile

 # File lock class for synchronizing write access to /tmp.
 class FileLock:
   def __enter__(self):
       self._lock = threading.Lock()
       self._lock.acquire()
       self._fd = open('/tmp/lockfile.LOCK', 'w+')
       fcntl.lockf(self._fd, fcntl.LOCK_EX)

    def __exit__(self, type, value, traceback):
       self._fd.close()
       self._lock.release()

 # Get the location of the import directory. Snowflake sets the import
 # directory location so code can retrieve the location via sys._xoptions.
 IMPORT_DIRECTORY_NAME = "snowflake_import_directory"
 import_dir = sys._xoptions[IMPORT_DIRECTORY_NAME]

 # Get the path to the ZIP file and set the location to extract to.
 zip_file_path = import_dir + "spacy_en_core_web_sm.zip"
 extracted = '/tmp/en_core_web_sm'

 # Extract the contents of the ZIP. This is done under the file lock
 # to ensure that only one worker process unzips the contents.
 with FileLock():
    if not os.path.isdir(extracted + '/en_core_web_sm/en_core_web_sm-2.3.1'):
       with zipfile.ZipFile(zip_file_path, 'r') as myzip:
          myzip.extractall(extracted)

 # Load the model from the extracted file.
 nlp = spacy.load(extracted + "/en_core_web_sm/en_core_web_sm-2.3.1")

 def func(text):
    doc = nlp(text)
    result = []

    for ent in doc.ents:
       result.append((ent.text, ent.start_char, ent.end_char, ent.label_))
    return result
 $$;
```

## Handling NULL values

The following code shows how NULL values are handled.
For more information, see [NULL values](udf-python-designing.md).

Create the UDF:

```sqlexample-python
CREATE OR REPLACE FUNCTION py_udf_null(a VARIANT)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'udf'
AS $$

def udf(a):
   if not a:
       return 'JSON null'
   elif getattr(a, "is_sql_null", False):
       return 'SQL null'
   else:
       return 'not null'
$$;
```

Call the UDF:

```sqlexample
SELECT py_udf_null(null);
```

```output
+-------------------+
| PY_UDF_NULL(NULL) |
|-------------------|
| SQL null          |
+-------------------+
```

```sqlexample
SELECT py_udf_null(parse_json('null'));
```

```output
+---------------------------------+
| PY_UDF_NULL(PARSE_JSON('NULL')) |
|---------------------------------|
| JSON null                       |
+---------------------------------+
```

```sqlexample
SELECT py_udf_null(10);
```

```output
+-----------------+
| PY_UDF_NULL(10) |
|-----------------|
| not null        |
+-----------------+
```

---
title: Python UDF limitations
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-limitations.md
section: Developer Guide
---

# Python UDF limitations

This topic describes the limitations in place for handlers written in Python.

## General limitations

* Although your Python function can use modules and functions in the standard Python packages, Snowflake security
  constraints disable some capabilities. For details, see the section
  titled [Following good security practices](udf-python-designing.md).
* Avoid code that assumes a specific operating system.
* Python UDFs are not sharable. Database objects that use Python UDFs are also not sharable. For example, you cannot:

  + Directly share a Python UDF.
  + Share a view that calls a Python UDF.
  + Share a function that calls a Python UDF.
  + Share a table with a masking or row access policy that calls a Python UDF.
* Granting USAGE privilege on a Python UDF might allow the recipient to see the contents of files imported by that UDF. If you grant the
  USAGE privilege on a Python UDF to a role, and if that role executes a statement that calls that Python UDF, then any Python UDF in the same
  statement could read the contents of any files imported by the Python UDF on which you granted USAGE privilege.
* Database [replication](../../../user-guide/account-replication-intro.md) is supported for in-line Python UDFs. However, replication is blocked if a Python UDF has a dependency on a file in a stage (i.e.
  a function created using the IMPORTS clause). This limitation might be removed in future versions.
* Snowflake uses the Python `zipimport` module to import Python code from stages. As a result, any `zipimport` limitations
  will also be present with UDFs. For more about `zipimport`, see the
  [zipimport reference](https://docs.python.org/3/library/zipimport.html).

## Limitations on cloning

A Python UDF can be cloned when the database or schema containing the Python UDF is cloned.
To be cloned, the Python UDF must meet the following condition(s):

* If the Python UDF references a stage, that stage must be
  outside the schema (or database) being cloned.

  You can keep a Python UDF and its referenced stage(s) in separate schemas (and/or separate databases) the following ways:

  + Wherever the Python UDF references a stage, use a qualified stage name (e.g. “my_db.my_schema.my_stage()”)
    different from the schema or database of the Python UDF. If the cloning operation clones a database, the stage
    reference should include the database and schema. If the cloning operation clones a schema, the stage reference
    should include the schema (and optionally the database).
  + Create the referenced stage by using a non-qualified stage name (which implicitly uses the current session’s active
    database and schema), and create the Python UDF by using a qualified name that does not match the session’s
    current database and schema.
  + Use the user’s stage as the referenced stage (the user’s stage is separate from any database’s stage or schema’s stage).

If one or more Python UDFs in the schema or database do not meet the required conditions, the schema or database can
still be cloned, but the non-compliant Python UDFs are omitted from the clone without any error or warning message.

Each cloned Python UDF has the same definition as the original. That definition includes any references to stages.
The stage references in the Python UDF must be fully-qualified, and therefore are absolute, not relative to the
schema or database being cloned. Because both the original and the clone point to the same stage(s) and file(s):

* Dropping the stage or removing required files from the stage disables both the original and cloned UDF.
* Altering the stage or the files on the stage affects both the
  original and cloned UDF.

---
title: Python user-defined aggregate functions
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-aggregate-functions.md
section: Developer Guide
---

# Python user-defined aggregate functions

User-defined aggregate functions (UDAFs) take one or more rows as input and produce a single row of output.
They operate on values across multiple rows to perform mathematical calculations such as sum, average, counting,
finding minimum or maximum values, standard deviation, and estimation, as well as some non-mathematical operations.

Python UDAFs provide a way for you to write your own aggregate functions that are similar to the Snowflake system-defined
SQL [aggregate functions](../../../sql-reference/functions-aggregation.md).

You can also create your own UDAFs using Snowpark APIs as described in [Creating User-Defined Aggregate Functions (UDAFs) for DataFrames in Python](../../snowpark/python/creating-udafs.md).

## Limitations

* The `aggregate_state` has a maximum size of 64 MB in a serialized version, so try to control the size of the aggregate state.
* You can’t call a UDAF as a [window function](../../../sql-reference/functions-window.md) (in other words, with an OVER clause).
* IMMUTABLE is not supported on an aggregate function (when you use the AGGREGATE parameter). Therefore, all aggregate functions are
  VOLATILE by default.
* User-defined aggregate functions cannot be used in conjunction with the WITHIN GROUP clause. Queries will fail to execute.

## Interface for aggregate function handler

An aggregate function aggregates state in child nodes and then, eventually, those aggregate states are serialized and sent to the parent
node where they get merged and the final result is calculated.

To define an aggregate function, you must define a Python class (which is the function’s handler) that includes methods that Snowflake
invokes at run time. Those methods are described in the table below. See examples elsewhere in this topic.

| Method | Requirement | Description |
| --- | --- | --- |
| `__init__` | Required | Initializes the internal state of an aggregate. |
| `aggregate_state` | Required | Returns the current state of an aggregate.   * The method must have a [@property decorator](https://docs.python.org/3.12/library/functions.html#property). * An aggregate state object can be any Python data type serializable by the   [Python pickle library](https://docs.python.org/3/library/pickle.html#what-can-be-pickled-and-unpickled). * For simple aggregate states, use a primitive Python data type. For more complex aggregate states, use   [Python data classes](https://docs.python.org/3/library/dataclasses.html). |
| `accumulate` | Required | Accumulates the state of the aggregate based on the new input row. |
| `merge` | Required | Combines two intermediate aggregated states. |
| `finish` | Required | Produces the final result based on the aggregated state. |

## Example: Calculate a sum

Code in the following example defines a `python_sum` user-defined aggregate function (UDAF) to return the sum of the numeric values.

1. Create the UDAF.

   ```sqlexample-python
   CREATE OR REPLACE AGGREGATE FUNCTION PYTHON_SUM(a INT)
     RETURNS INT
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.12
     HANDLER = 'PythonSum'
   AS $$
   class PythonSum:
     def __init__(self):
       # This aggregate state is a primitive Python data type.
       self._partial_sum = 0

     @property
     def aggregate_state(self):
       return self._partial_sum

     def accumulate(self, input_value):
       self._partial_sum += input_value

     def merge(self, other_partial_sum):
       self._partial_sum += other_partial_sum

     def finish(self):
       return self._partial_sum
   $$;
   ```
2. Create a table of test data.

   ```sqlexample
   CREATE OR REPLACE TABLE sales(item STRING, price INT);

   INSERT INTO sales VALUES ('car', 10000), ('motorcycle', 5000), ('car', 7500), ('motorcycle', 3500), ('motorcycle', 1500), ('car', 20000);

   SELECT * FROM sales;
   ```
3. Call the `python_sum` UDAF.

   ```sqlexample
   SELECT python_sum(price) FROM sales;
   ```
4. Compare results with the output of the Snowflake system-defined SQL function, [SUM](../../../sql-reference/functions/sum.md), and see that the result
   is the same.

   ```sqlexample
   SELECT sum(col) FROM sales;
   ```
5. Group by sum values by the item type in the sales table.

   ```sqlexample
   SELECT item, python_sum(price) FROM sales GROUP BY item;
   ```

## Example: Calculate an average

Code in the following example defines a `python_avg` user-defined aggregate function to return the average of the numeric values.

1. Create the function.

   ```sqlexample-python
   CREATE OR REPLACE AGGREGATE FUNCTION python_avg(a INT)
     RETURNS FLOAT
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.12
     HANDLER = 'PythonAvg'
   AS $$
   from dataclasses import dataclass

   @dataclass
   class AvgAggState:
       count: int
       sum: int

   class PythonAvg:
       def __init__(self):
           # This aggregate state is an object data type.
           self._agg_state = AvgAggState(0, 0)

       @property
       def aggregate_state(self):
           return self._agg_state

       def accumulate(self, input_value):
           sum = self._agg_state.sum
           count = self._agg_state.count

           self._agg_state.sum = sum + input_value
           self._agg_state.count = count + 1

       def merge(self, other_agg_state):
           sum = self._agg_state.sum
           count = self._agg_state.count

           other_sum = other_agg_state.sum
           other_count = other_agg_state.count

           self._agg_state.sum = sum + other_sum
           self._agg_state.count = count + other_count

       def finish(self):
           sum = self._agg_state.sum
           count = self._agg_state.count
           return sum / count
   $$;
   ```
2. Create a table of test data.

   ```sqlexample
   CREATE OR REPLACE TABLE sales(item STRING, price INT);
   INSERT INTO sales VALUES ('car', 10000), ('motorcycle', 5000), ('car', 7500), ('motorcycle', 3500), ('motorcycle', 1500), ('car', 20000);
   ```
3. Call the `python_avg` user-defined function.

   ```sqlexample
   SELECT python_avg(price) FROM sales;
   ```
4. Compare results with the output of the Snowflake system-defined SQL function, [AVG](../../../sql-reference/functions/avg.md), and see that the
   result is the same.

   ```sqlexample
   SELECT avg(price) FROM sales;
   ```
5. Group average values by the item type in the sales table.

   ```sqlexample
   SELECT item, python_avg(price) FROM sales GROUP BY item;
   ```

## Example: Return only unique values

Code in the following example takes an array and returns an array containing only the unique values.

```sqlexample-python
CREATE OR REPLACE AGGREGATE FUNCTION pythonGetUniqueValues(input ARRAY)
  RETURNS ARRAY
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'PythonGetUniqueValues'
AS $$
class PythonGetUniqueValues:
    def __init__(self):
        self._agg_state = set()

    @property
    def aggregate_state(self):
        return self._agg_state

    def accumulate(self, input):
        self._agg_state.update(input)

    def merge(self, other_agg_state):
        self._agg_state.update(other_agg_state)

    def finish(self):
        return list(self._agg_state)
$$;
```

```sqlexample
CREATE OR REPLACE TABLE array_table(x array) AS
SELECT ARRAY_CONSTRUCT(0, 1, 2, 3, 4, 'foo', 'bar', 'snowflake') UNION ALL
SELECT ARRAY_CONSTRUCT(1, 3, 5, 7, 9, 'foo', 'barbar', 'snowpark') UNION ALL
SELECT ARRAY_CONSTRUCT(0, 2, 4, 6, 8, 'snow');

SELECT * FROM array_table;

SELECT pythonGetUniqueValues(x) FROM array_table;
```

## Example: Return a count of strings

Code in the following example returns counts of all instances of strings in an object.

```sqlexample-python
CREATE OR REPLACE AGGREGATE FUNCTION pythonMapCount(input STRING)
  RETURNS OBJECT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'PythonMapCount'
AS $$
from collections import defaultdict

class PythonMapCount:
    def __init__(self):
        self._agg_state = defaultdict(int)

    @property
    def aggregate_state(self):
        return self._agg_state

    def accumulate(self, input):
        # Increment count of lowercase input
        self._agg_state[input.lower()] += 1

    def merge(self, other_agg_state):
        for item, count in other_agg_state.items():
            self._agg_state[item] += count

    def finish(self):
        return dict(self._agg_state)
$$;
```

```sqlexample
CREATE OR REPLACE TABLE string_table(x STRING);
INSERT INTO string_table SELECT 'foo' FROM TABLE(GENERATOR(ROWCOUNT => 1000));
INSERT INTO string_table SELECT 'bar' FROM TABLE(GENERATOR(ROWCOUNT => 2000));
INSERT INTO string_table SELECT 'snowflake' FROM TABLE(GENERATOR(ROWCOUNT => 50));
INSERT INTO string_table SELECT 'snowpark' FROM TABLE(GENERATOR(ROWCOUNT => 123));
INSERT INTO string_table SELECT 'SnOw' FROM TABLE(GENERATOR(ROWCOUNT => 1));
INSERT INTO string_table SELECT 'snow' FROM TABLE(GENERATOR(ROWCOUNT => 4));

SELECT pythonMapCount(x) FROM string_table;
```

## Example: Return top k largest values

Code in the following example returns a list of the top largest values for `k`. The code accumulates negated input values on a min
heap, then returns the top `k` largest values.

```sqlexample-python
CREATE OR REPLACE AGGREGATE FUNCTION pythonTopK(input INT, k INT)
  RETURNS ARRAY
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'PythonTopK'
AS $$
import heapq
from dataclasses import dataclass
import itertools
from typing import List

@dataclass
class AggState:
    minheap: List[int]
    k: int

class PythonTopK:
    def __init__(self):
        self._agg_state = AggState([], 0)

    @property
    def aggregate_state(self):
        return self._agg_state

    @staticmethod
    def get_top_k_items(minheap, k):
      # Return k smallest elements if there are more than k elements on the min heap.
      if (len(minheap) > k):
        return [heapq.heappop(minheap) for i in range(k)]
      return minheap

    def accumulate(self, input, k):
        self._agg_state.k = k

        # Store the input as negative value, as heapq is a min heap.
        heapq.heappush(self._agg_state.minheap, -input)

        # Store only top k items on the min heap.
        self._agg_state.minheap = self.get_top_k_items(self._agg_state.minheap, k)

    def merge(self, other_agg_state):
        k = self._agg_state.k if self._agg_state.k > 0 else other_agg_state.k

        # Merge two min heaps by popping off elements from one and pushing them onto another.
        while(len(other_agg_state.minheap) > 0):
            heapq.heappush(self._agg_state.minheap, heapq.heappop(other_agg_state.minheap))

        # Store only k elements on the min heap.
        self._agg_state.minheap = self.get_top_k_items(self._agg_state.minheap, k)

    def finish(self):
        return [-x for x in self._agg_state.minheap]
$$;
```

```sqlexample
CREATE OR REPLACE TABLE numbers_table(num_column INT);
INSERT INTO numbers_table SELECT 5 FROM TABLE(GENERATOR(ROWCOUNT => 10));
INSERT INTO numbers_table SELECT 1 FROM TABLE(GENERATOR(ROWCOUNT => 10));
INSERT INTO numbers_table SELECT 9 FROM TABLE(GENERATOR(ROWCOUNT => 10));
INSERT INTO numbers_table SELECT 7 FROM TABLE(GENERATOR(ROWCOUNT => 10));
INSERT INTO numbers_table SELECT 10 FROM TABLE(GENERATOR(ROWCOUNT => 10));
INSERT INTO numbers_table SELECT 3 FROM TABLE(GENERATOR(ROWCOUNT => 10));

-- Return top 15 largest values from numbers_table.
SELECT pythonTopK(num_column, 15) FROM numbers_table;
```

---
title: Reading files with a Java stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/java/procedure-java-read-files.md
section: Developer Guide
---

# Reading files with a Java stored procedure

You can read the contents of a file with handler code. The file must be on a Snowflake stage that’s available to your handler.
For example, you might want to read a file to process unstructured data in the handler.

To read the contents of staged files, your handler can call methods in either the `SnowflakeFile` class or the `InputStream`
class. You might do this if you need to access the file dynamically during compute. For more information, see
Reading a dynamically-specified file with SnowflakeFile or Reading a dynamically-specified file with InputStream in this topic.

`SnowflakeFile` provides features not available with `InputStream`, as described in the following table.

| Class | Input | Notes |
| --- | --- | --- |
| `SnowflakeFile` | URL formats:   * Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner. * File URL or string path for files that the procedure owner has access to.   The file must be located in a named internal stage or an external stage. | Easily access additional file attributes, such as file size. |
| `InputStream` | URL formats:   * Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner.   The file must be located in a named internal stage or an external stage. |  |

## Reading a dynamically-specified file with `SnowflakeFile`

Code in the following example has a handler function `execute` that takes a `String` and returns a `String`
with the file’s contents. At run time, Snowflake initializes the handler’s `fileName` variable from the incoming file path in the
procedure’s `input` variable. The handler code uses a `SnowflakeFile` instance to read the file.

```sqlexample-java
CREATE OR REPLACE PROCEDURE file_reader_java_proc_snowflakefile(input VARCHAR)
RETURNS VARCHAR
LANGUAGE JAVA
RUNTIME_VERSION = 11
HANDLER = 'FileReader.execute'
PACKAGES=('com.snowflake:snowpark:latest')
AS $$
import java.io.InputStream;
import java.io.IOException;
import java.nio.charset.StandardCharsets;
import com.snowflake.snowpark_java.types.SnowflakeFile;
import com.snowflake.snowpark_java.Session;

class FileReader {
  public String execute(Session session, String fileName) throws IOException {
    InputStream input = SnowflakeFile.newInstance(fileName).getInputStream();
    return new String(input.readAllBytes(), StandardCharsets.UTF_8);
  }
}
$$;
```

Code in the following CALL example creates a scoped file URL that points to the file. This is an encoded URL that permits temporary
access to a staged file without granting privileges to the stage itself.

```sqlexample
CALL file_reader_java_proc_snowflakefile(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

> **Note:**
>
> For an owner’s rights stored procedure, the procedure’s owner must have access to any files that are not scoped URLs. For caller’s rights
> procedures, the caller must have access to any files that are not scoped URLs. In either case, you can read the staged file by having the
> handler code call the `SnowflakeFile.newInstance` method with a `boolean` value for a new `requireScopedUrl` parameter.
>
> The following example uses `SnowflakeFile.newInstance` while specifying that a scoped URL is not required.
>
> ```java
> String filename = "@my_stage/filename.txt";
> SnowflakeFile sfFile = SnowflakeFile.newInstance(filename, false);
> ```

## Reading a dynamically-specified file with `InputStream`

Code in the following example has a handler function `execute` that takes an `InputStream` and returns a `String`
with the file’s contents. At run time, Snowflake initializes the handler’s `stream` variable from the incoming file path in the
procedure’s `input` argument. The handler code uses the `InputStream` to read the file.

```sqlexample-java
CREATE OR REPLACE PROCEDURE file_reader_java_proc_input(input VARCHAR)
RETURNS VARCHAR
LANGUAGE JAVA
RUNTIME_VERSION = 11
HANDLER = 'FileReader.execute'
PACKAGES=('com.snowflake:snowpark:latest')
AS $$
import java.io.InputStream;
import java.io.IOException;
import java.nio.charset.StandardCharsets;
import com.snowflake.snowpark.Session;

class FileReader {
  public String execute(Session session, InputStream stream) throws IOException {
    String contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8);
    return contents;
  }
}
$$;
```

Code in the following CALL example creates a scoped file URL that points to the file. This is an encoded URL that permits temporary
access to a staged file without granting privileges to the stage itself.

```sqlexample
CALL file_reader_java_proc_input(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

---
title: Reading files with a Python stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-read-files.md
section: Developer Guide
---

# Reading files with a Python stored procedure

## Reading from stages

By using the `SnowflakeFile` class in the Snowpark `snowflake.snowpark.files` module, your Python handler can dynamically read a
file [from either internal or external stages](../../../sql-reference/sql/create-stage.md).

Snowflake supports reading files with `SnowflakeFile` for both stored procedures and user-defined functions. For more information
about reading files in your handler code, as well as more examples, refer to [Reading a File with a Python UDF Handler](../../udf/python/udf-python-examples.md).

## Example

This example demonstrates how to create and call an [owner’s rights stored procedure](../stored-procedures-rights.md)
that reads a file using the `SnowflakeFile` class.

Create the stored procedure with an in-line handler, specifying the input mode as binary by passing `rb` for the `mode` argument:

```sqlexample-python
CREATE OR REPLACE PROCEDURE calc_phash(file_path string)
RETURNS STRING
LANGUAGE PYTHON
RUNTIME_VERSION = '3.9'
PACKAGES = ('snowflake-snowpark-python','imagehash','pillow')
HANDLER = 'run'
AS
$$
from PIL import Image
import imagehash
from snowflake.snowpark.files import SnowflakeFile

def run(ignored_session, file_path):
    with SnowflakeFile.open(file_path, 'rb') as f:
        return imagehash.average_hash(Image.open(f))
$$;
```

Call the stored procedure:

```sqlexample
CALL calc_phash(build_scoped_file_url(@my_files, 'my_image.jpg'));
```

---
title: Reading files with a Scala stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/scala/procedure-scala-read-files.md
section: Developer Guide
---

# Reading files with a Scala stored procedure

You can read the contents of a file with handler code. The file must be on a Snowflake stage that’s available to your handler.
For example, you might want to read a file to process unstructured data in the handler.

To read the contents of staged files, your handler can call methods in either the `SnowflakeFile` class or the `InputStream`
class. You might do this if you need to access the file dynamically during compute. For more information, see
Reading a dynamically-specified file with SnowflakeFile or Reading a dynamically-specified file with InputStream in this topic.

`SnowflakeFile` provides features not available with `InputStream`, as described in the following table.

| Class | Input | Notes |
| --- | --- | --- |
| `SnowflakeFile` | URL formats:   * Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner. * File URL or string path for files that the UDF owner has access to.   The file must be located in a named internal stage or an external stage. | Easily access additional file attributes, such as file size. |
| `InputStream` | URL formats:   * Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner.   The file must be located in a named internal stage or an external stage. |  |

> **Note:**
>
> For an owner’s rights stored procedure, the procedure’s owner must have access to any files that are not scoped URLs. For caller’s rights
> procedures, the caller must have access to any files that are not scoped URLs. In either case, you can read the staged file by having the
> handler code call the `SnowflakeFile.newInstance` method with a `boolean` value for a new `requireScopedUrl` parameter.
>
> The following example uses `SnowflakeFile.newInstance` while specifying that a scoped URL is not required.
>
> ```scala
> var filename = "@my_stage/filename.txt"
> var sfFile = SnowflakeFile.newInstance(filename, false)
> ```

## Reading a dynamically-specified file with `SnowflakeFile`

Code in the following example has a handler function `execute` that takes a `String` and returns a `String`
with the file’s contents. At run time, Snowflake initializes the handler’s `fileName` variable from the incoming file path in the
procedure’s `input` variable. The handler code uses a `SnowflakeFile` instance to read the file.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE file_reader_scala_proc_snowflakefile(input VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER = 'FileReader.execute'
  PACKAGES=('com.snowflake:snowpark_2.12:latest')
  AS $$
  import java.io.InputStream
  import java.nio.charset.StandardCharsets
  import com.snowflake.snowpark_java.types.SnowflakeFile
  import com.snowflake.snowpark_java.Session

  object FileReader {
    def execute(session: Session, fileName: String): String = {
      var input: InputStream = SnowflakeFile.newInstance(fileName).getInputStream()
      return new String(input.readAllBytes(), StandardCharsets.UTF_8)
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE file_reader_scala_proc_snowflakefile(input VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER = 'FileReader.execute'
  PACKAGES=('com.snowflake:snowpark_2.13:latest')
  AS $$
  import java.io.InputStream
  import java.nio.charset.StandardCharsets
  import com.snowflake.snowpark_java.types.SnowflakeFile
  import com.snowflake.snowpark_java.Session

  object FileReader {
    def execute(session: Session, fileName: String): String = {
      var input: InputStream = SnowflakeFile.newInstance(fileName).getInputStream()
      return new String(input.readAllBytes(), StandardCharsets.UTF_8)
    }
  }
  $$;
```

Code in the following CALL example creates a scoped file URL that points to the file. This is an encoded URL that permits temporary
access to a staged file without granting privileges to the stage itself.

```sqlexample
CALL file_reader_scala_proc_snowflakefile(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

## Reading a dynamically-specified file with `InputStream`

Code in the following example defines a handler function `execute` that takes an `InputStream` and returns a `String`
with the file’s contents. At run time, Snowflake initializes the handler’s `stream` variable from the incoming file path in the
procedure’s `input` variable. The handler code uses the `InputStream` to read the file.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE file_reader_scala_proc_input(input VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER = 'FileReader.execute'
  PACKAGES=('com.snowflake:snowpark_2.12:latest')
  AS $$
  import java.io.InputStream
  import java.nio.charset.StandardCharsets
  import com.snowflake.snowpark_java.Session

  object FileReader {
    def execute(session: Session, stream: InputStream): String = {
      val contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8)
      return contents
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE file_reader_scala_proc_input(input VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER = 'FileReader.execute'
  PACKAGES=('com.snowflake:snowpark_2.13:latest')
  AS $$
  import java.io.InputStream
  import java.nio.charset.StandardCharsets
  import com.snowflake.snowpark_java.Session

  object FileReader {
    def execute(session: Session, stream: InputStream): String = {
      val contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8)
      return contents
    }
  }
  $$;
```

Code in the following CALL example creates an encoded scoped file URL that points to the file. An encoded URL permits temporary
access to a staged file without granting privileges to the stage itself.

```sqlexample
CALL file_reader_scala_proc_input(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

---
title: Restricted caller’s rights
source: https://docs.snowflake.com/en/developer-guide/restricted-callers-rights.md
section: Developer Guide
---

# Restricted caller’s rights

An executable such as a stored procedure, Snowpark Container Services service, or Streamlit in Snowflake app can run with privileges from the owner of
the executable (owner’s rights) or from the caller of the executable (caller’s rights). If an executable runs with caller’s
rights, the executable can perform an action only if the caller has privileges to perform that action outside the context of the executable.

Restricted caller’s rights allows an executable to run with caller’s rights, but restricts which of the caller’s privileges the executable
runs with. With restricted caller’s rights, an executable cannot run with a specific privilege unless an administrator expressly allows it.

## About caller grants

Administrators use *caller grants* to define which of the caller’s privileges an executable can run with. For example, if
a caller has SELECT and INSERT privileges on a table, but there isn’t a caller grant that allows the executable to run with the INSERT
privilege, then the executable with restricted caller’s rights cannot run with the INSERT privilege when acting upon the table.

A caller grant doesn’t give any privileges but rather restricts which of the caller’s existing privileges are used when they run
the executable. For example, if a caller runs a stored procedure to select from a table, the caller must already have the SELECT
privilege on the table and the caller grant must allow the stored procedure to run with the SELECT privilege.

Caller grants are granted by the administrator to the role that owns an executable. The caller grants are granted on objects
such as tables and warehouses that the executable accesses. When the executable attempts to access the objects, the caller grants associated
with the owner of the executable are used to determine which of the caller’s privileges can be used for the operation.

## Executables that run with restricted caller’s rights

The user who creates an executable defines whether the executable runs with owner’s rights, caller’s rights, or restricted caller’s rights.
If they choose restricted caller’s rights, every privilege required by the executable must be specified in one or more caller grants that
are granted to the owner of the executable.

For a stored procedure, the `EXECUTE AS` parameter defines whether the procedure runs with owner’s rights, caller’s rights, or
restricted caller’s rights. The following is an example of defining the procedure to run with restricted caller’s rights:

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE sp_pi()
  RETURNS FLOAT NOT NULL
  LANGUAGE JAVASCRIPT
  EXECUTE AS RESTRICTED CALLER
  AS
  $$
  RETURN 3.1415926;
  $$
  ;
```

For a Streamlit in Snowflake app using a container runtime, restricted caller’s rights are configured through code.
For more information, see [Restricted caller’s rights and Streamlit in Snowflake](streamlit/features/restricted-callers-rights.md).

For a list of restrictions on executables that run with restricted caller’s rights, see Limitations of an executable with restricted caller’s rights.

## Grant caller grants

Caller grants are granted on objects such as tables and databases that an executable accesses. The caller grants are granted
to the role or database role that owns the executable.

The GRANT statement that an administrator uses to grant a caller grant has different
variations, depending on how you want to grant caller grants. The variations are as follows:

* GRANT CALLER — Grant caller grants on a specific object. Each caller grant created by the statement allows the executable to
  run with a specified privilege.
* GRANT ALL CALLER PRIVILEGES — Grant caller grants on a specific object. The caller grants created by the statement allow the
  executable to run with all of the caller’s privileges.
* GRANT INHERITED CALLER — Grant caller grants on all current and future objects of the same type when they share a common schema, database,
  or account. Each caller grant created by the statement allows the executable to run with a specified privilege.
* GRANT ALL INHERITED CALLER PRIVILEGES — Grant caller grants on all current and future objects of the same type when they share a common
  schema, database, or account. The caller grants created by the statement allow the executable to run with all of the caller’s privileges.

A single GRANT statement can result in multiple caller grants being granted to the executable owner. For example, GRANT CALLER INSERT,
SELECT … results in two caller grants, one for the INSERT privilege and another for the SELECT privilege. Similarly, a GRANT ALL INHERITED
CALLER PRIVILEGES statement results in a caller grant for every privilege that can be granted on the specified object type.

> **Note:**
>
> Use caution when granting caller grants to the PUBLIC role, as these caller grants become available to all roles in the account.

For the complete syntax, including parameters, for granting a caller grant, see [GRANT CALLER](../sql-reference/sql/grant-caller.md).

### Examples

The following are examples of how an administrator can use caller grants to control which of the caller’s privileges an executable can run with.

Executables owned by `owner_role` that access a `v1` view can run with the SELECT privilege on the view:

> ```sqlexample
> GRANT CALLER SELECT ON VIEW v1 TO owner_role;
> ```

Executables owned by `owner_role` that access any table in the `db.sch` schema can run with the caller’s SELECT and INSERT privileges.

> ```sqlexample
> GRANT INHERITED CALLER SELECT, INSERT ON ALL TABLES IN SCHEMA db.sch TO ROLE owner_role;
> ```

Executables owned by `owner_role` that access schemas in the current account can run with all of the caller’s privileges on the schemas.

> ```sqlexample
> GRANT ALL INHERITED CALLER PRIVILEGES ON ALL SCHEMAS IN ACCOUNT TO ROLE owner_role;
> ```

Executables owned by the `db.r` database role that access the `db.sch1.t1` table can run with the SELECT privilege on the table.

> ```sqlexample
> GRANT CALLER SELECT ON TABLE db.sch1.t1 TO DATABASE ROLE db.r;
> ```

Executables owned by `owner_role` that access the `my_db` database can run with all of the caller’s privileges on the database.

> ```sqlexample
> GRANT ALL CALLER PRIVILEGES ON DATABASE my_db TO ROLE owner_role;
> ```

## Revoke a caller grant

Administrators use a REVOKE statement to revoke privileges that were previously granted to an executable owner through a caller grant. This
statement has different variations, depending on how you want to revoke caller grants.

* REVOKE CALLER — Revoke specific privileges on a specific object.
* REVOKE ALL CALLER PRIVILEGES — Revoke all privileges on a specific object. The executable will not be
  able to run with any privileges from the caller when it tries to access the object.
* REVOKE INHERITED CALLER — Revoke caller grants on all current and future objects of the same type when they share a common schema, database,
  or account. Only privileges in a specified list are revoked.
* REVOKE ALL INHERITED CALLER PRIVILEGES — Revoke caller grants on all current and future objects of the same type when they share a common
  schema, database, or account. All privileges are revoked; the executable will not be able to run with any privileges from the caller.

Executing a REVOKE INHERITED CALLER or REVOKE ALL INHERITED CALLER PRIVILEGES command does not revoke a caller grant
that was granted on a specific object within the account, database, or schema using a GRANT CALLER statement. For example, if you granted a
caller grant on table `my_db.sch1.t1` directly, executing `REVOKE INHERITED CALLER SELECT ON ALL TABLES IN DATABASE my_db ...` does not
revoke the caller grant on `t1`.

For the complete syntax, including parameters, of revoking a caller grant, see [REVOKE CALLER](../sql-reference/sql/revoke-caller.md).

### Examples

Executables owned by `owner_role` can no longer run with the caller’s privileges when they access views in the current account.

> ```sqlexample
> REVOKE ALL INHERITED CALLER PRIVILEGES ON ALL VIEWS IN ACCOUNT FROM ROLE owner_role;
> ```

Executables owned by `owner_role` can no longer run with the USAGE privilege when they access the `db.sch1` schema.

> ```sqlexample
> REVOKE CALLER USAGE ON SCHEMA db.sch1 FROM ROLE owner_role;
> ```

## List caller grants

Users can use the [SHOW CALLER GRANTS](../sql-reference/sql/show-caller-grants.md) command to list caller grants. You can use this command to list all caller grants that have been granted to a specific owner (SHOW CALLER GRANTS TO …) or to list all caller grants on a specific object (SHOW CALLER GRANTS ON …).

If you execute a SHOW CALLER GRANTS ON … command for a specific object, each row could indicate any of the following:

* A caller grant was granted directly on the object.

  For example, the output of `SHOW CALLER GRANTS ON TABLE db.sch.t1` contains a row if the administrator executed `GRANT CALLER SELECT ON TABLE db.sch.t1`.
* The object inherited a caller grant.

  For example, the output of `SHOW CALLER GRANTS ON TABLE db1.sch.t1` contains a row if the administrator executed `GRANT INHERITED CALLER SELECT ON ALL TABLES IN SCHEMA db1.sch`.
* The object was specified with an IN clause so other objects that it contains inherited caller grants.

  For example, the output of `SHOW CALLER GRANTS ON ACCOUNT` contains a row if the administrator executed `GRANT INHERITED CALLER SELECT ON ALL TABLES IN ACCOUNT`.
* The object is an ancestor of an object with an inherited caller grant as well as the descendant of the object that was specified with an IN clause that resulted in the inheritance.

  For example, `SHOW CALLER GRANTS ON SCHEMA my_db.sch1` contains a row if the administrator executed `GRANT INHERITED CALLER SELECT ON ALL TABLES IN DATABASE my_db`.

### Conditional output

The output of the SHOW CALLER GRANTS command varies depending on the privileges of the executing role. When a user executes SHOW CALLER
GRANTS, the results only contain objects on which they have at least one privilege; they cannot discover the existence of an object unless
they can access it, even if there is a caller grant on it.

For example, suppose there is a caller grant on databases `DB1` and `DB2`. Now suppose role `R2` has the USAGE privilege on
`DB1`, but no privileges on `DB2`. When `R2` executes SHOW CALLER GRANTS, the output shows that there is a caller grant on `DB1`,
but does not list `DB2`. If `R2` had privileges on both databases, then the output would show that the caller grant is on both
databases.

### Examples

List caller grants that have been granted on the table `t1`.

> ```sqlexample
> SHOW CALLER GRANTS ON TABLE t1;
> ```

List all of the caller grants that have been granted for the current account. This includes grants directly on the account
(GRANT CALLER … ON ACCOUNT) and grants to all objects in an account (GRANT INHERITED CALLER … IN ACCOUNT).

> ```sqlexample
> SHOW CALLER GRANTS ON ACCOUNT;
> ```

List all of the caller grants that have been granted to the database role `db.owner_role`.

> ```sqlexample
> SHOW CALLER GRANTS TO DATABASE ROLE db.owner_role;
> ```

## Limitations of an executable with restricted caller’s rights

If an executable runs with restricted caller’s rights, then it is subject to the following restrictions.

**External stages**

* Executable cannot create an external stage without specifying a storage integration.
* Executable cannot copy into an external stage.
* Executable cannot copy into an external URL without specifying a storage integration.

**Stored procedures**

* Executable cannot create Snowflake objects that run with owner’s rights, caller’s rights, or restricted caller’s rights. For example,
  it cannot create a stored procedure.
* Executable cannot change the rights with which a stored procedure runs. For example, the executable cannot change a stored
  procedure from owner’s rights to caller’s rights.

**Roles and privileges**

* Executable cannot execute the USE ROLE and USE SECONDARY ROLES commands.
* Executable cannot use GRANT statements to grant privileges and caller grants.
* Executable cannot use REVOKE statements to revoke privileges and caller grants.

**References**

* Executable cannot create transient and persisted [references](../sql-reference/references.md).

**Session-related operations**

* Executable cannot execute [SET](../sql-reference/sql/set.md) or [UNSET](../sql-reference/sql/unset.md) commands.
* Executable cannot execute SHOW VARIABLES or SHOW PARAMETERS.
* Executable cannot use or read session variables.
* Executable cannot execute ALTER SESSION.
* Executable cannot create session-scoped temporary objects.
* Executable cannot execute USE DATABASE, USE SCHEMA, or USE WAREHOUSE.

---
title: Returning a value
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/return.md
section: Developer Guide
---

# Returning a value

To return a value, use the [RETURN](../../sql-reference/snowflake-scripting/return.md) command. You can return a
value from the following items:

* A block in a [stored procedure](../stored-procedure/stored-procedures-overview.md) or
  [Snowflake Scripting user-defined function (UDF)](../udf/sql/udf-sql-procedural-functions.md).
* An [anonymous block](blocks.md).

## Types of return values

You can return a value of one of the following types:

* A [SQL data type](../../sql-reference-data-types.md)

* A table

  Use `TABLE(...)` in the RETURN statement.

  If your block is in a stored procedure, you must also specify the `RETURNS TABLE(...)` clause in the
  [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) statement.

  > **Note:**
  >
  > Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
  > applies whether you are creating a stored or anonymous procedure.
  >
  > ```sqlexample
  > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
  >   RETURNS TABLE(g GEOGRAPHY)
  >   ...
  > ```
  >
  > ```sqlexample
  > WITH test_return_geography_table_1() AS PROCEDURE
  >   RETURNS TABLE(g GEOGRAPHY)
  >   ...
  > CALL test_return_geography_table_1();
  > ```
  >
  > If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
  >
  > ```none
  > Stored procedure execution error: data type of returned table does not match expected returned table type
  > ```
  >
  > To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
  >
  > ```sqlexample
  > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
  >   RETURNS TABLE()
  >   ...
  > ```
  >
  > ```sqlexample
  > WITH test_return_geography_table_1() AS PROCEDURE
  >   RETURNS TABLE()
  >   ...
  > CALL test_return_geography_table_1();
  > ```

  If you want to return the data that a [RESULTSET](resultsets.md) points to, pass the RESULTSET to
  `TABLE(...)`, as shown in the example below:

  ```sqlexample
  CREATE PROCEDURE ...
  RETURNS TABLE(...)
  ...
  RETURN TABLE(my_result_set);
  ...
  ```

  See [Returning a RESULTSET as a table](resultsets.md).

## Returning the value of a variable

This example declares a variable named `my_var` for use in a Snowflake Scripting anonymous block and
then returns the value of the variable:

```sqlexample
DECLARE
  my_var VARCHAR;
BEGIN
  my_var := 'Snowflake';
  RETURN my_var;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  my_var VARCHAR;
BEGIN
  my_var := 'Snowflake';
  RETURN my_var;
END;
$$;
```

## Using the value returned from a stored procedure call

See [Using the value returned from a stored procedure call](../stored-procedure/stored-procedures-snowflake-scripting.md).

## Using the value returned from a Snowflake Scripting UDF

See [Snowflake Scripting UDFs](../udf/sql/udf-sql-procedural-functions.md).

---
title: Returning tabular data from a Java stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/java/procedure-java-tabular-data.md
section: Developer Guide
---

# Returning tabular data from a Java stored procedure

You can write a procedure that returns data in tabular form. To write a procedure that returns tabular data, do the following:

* Specify `TABLE(...)` as the procedure’s return type in your [CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) statement.

  As TABLE parameters, you can specify the returned data’s column names and [types](../../../sql-reference-data-types.md) if you know them.
  If you don’t know the returned columns when defining the procedure – such as when they’re specified at run time – you can leave out the
  TABLE parameters. When you do, the procedure’s return value columns will be converted from the columns in the dataframe returned by its
  handler. Column data types will be converted to SQL according to the mapping specified in [SQL-Java Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
* Write the handler so that it returns the tabular result in a Snowpark DataFrame.

  For more information about dataframes, see [Working with DataFrames in Snowpark Java](../../snowpark/java/working-with-dataframes.md).

> **Note:**
>
> A procedure will generate an error at runtime if either of the following is true:
>
> * It declares TABLE as its return type but its handler does not return a DataFrame.
> * Its handler returns a DataFrame but the procedure doesn’t declare TABLE as its return type.

## Example

The examples in this section illustrate returning tabular values from a procedure that filters for rows where a column matches a string.

### Defining the data

Code in the following example creates a table of employees.

```sqlexample
CREATE OR REPLACE TABLE employees(id NUMBER, name VARCHAR, role VARCHAR);
INSERT INTO employees (id, name, role) VALUES (1, 'Alice', 'op'), (2, 'Bob', 'dev'), (3, 'Cindy', 'dev');
```

### Declaring a procedure to filter rows

Code in the following two examples create a stored procedure that takes the table name and role as arguments, returning the rows in the table
whose role column value matches the role specified as an argument.

### Specifying return column names and types

This example specifies column names and types in the `RETURNS TABLE()` statement.

```sqlexample-java
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
RETURNS TABLE(id NUMBER, name VARCHAR, role VARCHAR)
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:latest')
HANDLER = 'Filter.filterByRole'
AS
$$
import com.snowflake.snowpark_java.*;

public class Filter {
  public DataFrame filterByRole(Session session, String tableName, String role) {
    DataFrame table = session.table(tableName);
    DataFrame filteredRows = table.filter(Functions.col("role").equal_to(Functions.lit(role)));
    return filteredRows;
  }
}
$$;
```

> **Note:**
>
> Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
> applies whether you are creating a stored or anonymous procedure.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
>   RETURNS TABLE(g GEOGRAPHY)
>   ...
> ```
>
> ```sqlexample
> WITH test_return_geography_table_1() AS PROCEDURE
>   RETURNS TABLE(g GEOGRAPHY)
>   ...
> CALL test_return_geography_table_1();
> ```
>
> If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
>
> ```none
> Stored procedure execution error: data type of returned table does not match expected returned table type
> ```
>
> To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
>   RETURNS TABLE()
>   ...
> ```
>
> ```sqlexample
> WITH test_return_geography_table_1() AS PROCEDURE
>   RETURNS TABLE()
>   ...
> CALL test_return_geography_table_1();
> ```

### Omitting return column names and types

Code in the following example declares a procedure that allows return value column names and types to be extrapolated from columns in the
handler’s return value. It omits the column names and types from the `RETURNS TABLE()` statement.

```sqlexample-java
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
RETURNS TABLE()
LANGUAGE JAVA
RUNTIME_VERSION = '11'
PACKAGES = ('com.snowflake:snowpark:latest')
HANDLER = 'FilterClass.filterByRole'
AS
$$
import com.snowflake.snowpark_java.*;

public class FilterClass {
  public DataFrame filterByRole(Session session, String tableName, String role) {
    DataFrame table = session.table(tableName);
    DataFrame filteredRows = table.filter(Functions.col("role").equal_to(Functions.lit(role)));
    return filteredRows;
  }
}
$$;
```

### Calling the procedure

The following example calls the stored procedure:

```sqlexample
CALL filter_by_role('employees', 'dev');
```

The procedure call produces the following output:

```output
+----+-------+------+
| ID | NAME  | ROLE |
+----+-------+------+
| 2  | Bob   | dev  |
| 3  | Cindy | dev  |
+----+-------+------+
```

---
title: Returning tabular data from a Python stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-tabular-data.md
section: Developer Guide
---

# Returning tabular data from a Python stored procedure

You can write a procedure that returns data in tabular form. To write a procedure that returns tabular data, do the following:

* Specify `TABLE(...)` as the procedure’s return type in your [CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) statement.

  As TABLE parameters, you can specify the returned data’s column names and [types](../../../sql-reference-data-types.md) if you know them.
  If you don’t know the returned columns when defining the procedure — such as when they’re specified at run time — you can leave out the
  TABLE parameters. When you do, the procedure’s return value columns will be converted from the columns in the `DataFrame` returned by its
  handler. Column data types will be converted to SQL according to the mapping specified in [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
* Write the handler so that it returns the tabular result in a Snowpark DataFrame.

  For more information about dataframes, see [Working with DataFrames in Snowpark Python](../../snowpark/python/working-with-dataframes.md).

## Examples

The examples in this section illustrate returning tabular values from a procedure that filters for rows where a column matches a string.

### Defining the data

Code in the following example creates a table of employees.

```sqlexample
CREATE OR REPLACE TABLE employees(id NUMBER, name VARCHAR, role VARCHAR);
INSERT INTO employees (id, name, role) VALUES (1, 'Alice', 'op'), (2, 'Bob', 'dev'), (3, 'Cindy', 'dev');
```

### Specifying return column names and types

This example specifies column names and types in the `RETURNS TABLE()` statement.

```sqlexample-python
CREATE OR REPLACE PROCEDURE filterByRole(tableName VARCHAR, role VARCHAR)
RETURNS TABLE(id NUMBER, name VARCHAR, role VARCHAR)
LANGUAGE PYTHON
RUNTIME_VERSION = '3.9'
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'filter_by_role'
AS
$$
from snowflake.snowpark.functions import col

def filter_by_role(session, table_name, role):
   df = session.table(table_name)
   return df.filter(col("role") == role)
$$;
```

### Omitting return column names and types

Code in the following example declares a procedure that allows return value column names and types to be extrapolated from columns in the
handler’s return value. It omits the column names and types from the `RETURNS TABLE()` statement.

```sqlexample-python
CREATE OR REPLACE PROCEDURE filterByRole(tableName VARCHAR, role VARCHAR)
RETURNS TABLE()
LANGUAGE PYTHON
RUNTIME_VERSION = '3.9'
PACKAGES = ('snowflake-snowpark-python')
HANDLER = 'filter_by_role'
AS
$$
from snowflake.snowpark.functions import col

def filter_by_role(session, table_name, role):
  df = session.table(table_name)
  return df.filter(col("role") == role)
$$;
```

### Calling the procedure

The following example calls the stored procedure:

```sqlexample
CALL filterByRole('employees', 'dev');
```

The procedure call produces the following output:

```output
+----+-------+------+
| ID | NAME  | ROLE |
+----+-------+------+
| 2  | Bob   | dev  |
| 3  | Cindy | dev  |
+----+-------+------+
```

---
title: Returning tabular with Scala in stored procedures created with SQL
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/scala/procedure-scala-tabular-data.md
section: Developer Guide
---

# Returning tabular with Scala in stored procedures created with SQL

You can write a procedure that returns data in tabular form. To write a procedure that returns tabular data, do the following:

* Specify `TABLE(...)` as the procedure’s return type in your [CREATE PROCEDURE](../../../sql-reference/sql/create-procedure.md) statement.

  As TABLE parameters, you can specify the returned data’s column names and [types](../../../sql-reference-data-types.md) if you know them.
  If you don’t know the returned columns when defining the procedure—such as when they’re specified at run time—you can leave out the
  TABLE parameters. When you do, the procedure’s return value columns are converted from the columns in the dataframe returned by its
  handler. Column data types are converted to SQL according to the mapping specified in [SQL-Scala Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
* Write the handler so that it returns the tabular result in a Snowpark dataframe.

  For more information about dataframes, see [Working with DataFrames in Snowpark Scala](../../snowpark/scala/working-with-dataframes.md).

> **Note:**
>
> A procedure generates an error at runtime if either of the following is true:
>
> * It declares TABLE as its return type, but its handler does not return a dataframe.
> * Its handler returns a dataframe, but the procedure doesn’t declare TABLE as its return type.

## Example

The examples in this section illustrate returning tabular values from a procedure that filters for rows where a column matches a string.

### Defining the data

Code in the following example creates a table of employees.

```sqlexample
CREATE OR REPLACE TABLE employees(id NUMBER, name VARCHAR, role VARCHAR);
INSERT INTO employees (id, name, role) VALUES (1, 'Alice', 'op'), (2, 'Bob', 'dev'), (3, 'Cindy', 'dev');
```

### Declaring a procedure to filter rows

Code in the following two examples create a stored procedure that takes the table name and role as arguments, returning the rows in the table
whose role column value matches the role specified as an argument.

#### Specifying return column names and types

This example specifies column names and types in the `RETURNS TABLE()` statement.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
  RETURNS TABLE(id NUMBER, name VARCHAR, role VARCHAR)
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'Filter.filterByRole'
  AS
  $$
  import com.snowflake.snowpark.functions._
  import com.snowflake.snowpark._

  object Filter {
    def filterByRole(session: Session, tableName: String, role: String): DataFrame = {
      val table = session.table(tableName)
      val filteredRows = table.filter(col("role") === role)
      return filteredRows
    }
  }
$$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
  RETURNS TABLE(id NUMBER, name VARCHAR, role VARCHAR)
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'Filter.filterByRole'
  AS
  $$
  import com.snowflake.snowpark.functions._
  import com.snowflake.snowpark._

  object Filter {
    def filterByRole(session: Session, tableName: String, role: String): DataFrame = {
      val table = session.table(tableName)
      val filteredRows = table.filter(col("role") === role)
      return filteredRows
    }
  }
$$;
```

> **Note:**
>
> Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
> applies whether you are creating a stored or anonymous procedure.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
>   RETURNS TABLE(g GEOGRAPHY)
>   ...
> ```
>
> ```sqlexample
> WITH test_return_geography_table_1() AS PROCEDURE
>   RETURNS TABLE(g GEOGRAPHY)
>   ...
> CALL test_return_geography_table_1();
> ```
>
> If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
>
> ```none
> Stored procedure execution error: data type of returned table does not match expected returned table type
> ```
>
> To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
>   RETURNS TABLE()
>   ...
> ```
>
> ```sqlexample
> WITH test_return_geography_table_1() AS PROCEDURE
>   RETURNS TABLE()
>   ...
> CALL test_return_geography_table_1();
> ```

#### Omitting return column names and types

Code in the following example declares a procedure that allows return value column names and types to be extrapolated from columns in the
handler’s return value. It omits the column names and types from the `RETURNS TABLE()` statement.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
  RETURNS TABLE()
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'Filter.filterByRole'
  AS
  $$
  import com.snowflake.snowpark.functions._
  import com.snowflake.snowpark._

  object Filter {
    def filterByRole(session: Session, tableName: String, role: String): DataFrame = {
      val table = session.table(tableName)
      val filteredRows = table.filter(col("role") === role)
      return filteredRows
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE filter_by_role(table_name VARCHAR, role VARCHAR)
  RETURNS TABLE()
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'Filter.filterByRole'
  AS
  $$
  import com.snowflake.snowpark.functions._
  import com.snowflake.snowpark._

  object Filter {
    def filterByRole(session: Session, tableName: String, role: String): DataFrame = {
      val table = session.table(tableName)
      val filteredRows = table.filter(col("role") === role)
      return filteredRows
    }
  }
  $$;
```

### Calling the procedure

The following example calls the stored procedure:

```sqlexample
CALL filter_by_role('employees', 'dev');
```

The procedure call produces the following output:

```output
+----+-------+------+
| ID | NAME  | ROLE |
+----+-------+------+
| 2  | Bob   | dev  |
| 3  | Cindy | dev  |
+----+-------+------+
```

---
title: Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-overview.md
section: Developer Guide
---

# Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark

With Snowpark Connect for Apache Spark™, you can connect your existing Spark workloads directly to Snowflake and run them on the Snowflake compute engine.
Snowpark Connect for Spark supports using the [Spark DataFrame API](https://spark.apache.org/docs/latest/sql-programming-guide.html) on Snowflake.
All workloads run on Snowflake warehouse. As a result, you can run your PySpark dataframe code with all the benefits of the
Snowflake engine.

In Apache Spark™ version 3.4, the Apache Spark community introduced Spark Connect. Its decoupled client-server architecture separates
the user’s code from the Spark cluster where the work is done. This new architecture makes it possible for Snowflake to power Spark jobs.

You can develop using familiar client tools.

Snowpark Connect for Spark offers the following benefits:

* Decouples client and server, so that Spark code can run remotely against the Snowflake compute engine without your needing to manage a
  Spark cluster.
* Lets team use their existing ecosystem to author and orchestrate their Spark workloads—for example, Jupyter notebooks, VS code,
  and Airflow.
* Allows you to reuse open source Spark dataframes and Spark SQL code with minimal migrations or changes.
* Offers a streamlined way to integrate Snowflake governance, security, and scalability into Spark-based workflows, supporting a familiar
  PySpark experience with pushdown optimizations into Snowflake.
* Allows you to use any of several languages, including PySpark and Spark SQL.

## Get started with Snowpark Connect for Spark

To get started with Snowpark Connect for Spark, follow these steps:

1. [Set up the client tool](snowpark-connect-clients.md) that you’ll use to develop Spark
   workloads to run on Snowflake.

   For example, you can use [Snowflake Notebooks](snowpark-connect-workloads-snowflake-notebook.md)
   or [another tool](snowpark-connect-workloads-jupyter.md).
2. Run Spark workloads asynchronously using Snowpark Submit.

   For more information, see [Submitting Spark applications](snowpark-submit.md).
3. Get to know Snowpark Connect for Spark support for Spark particulars.

   For more information, see [Snowpark Connect for Spark compatibility guide](snowpark-connect-compatibility.md).

## Develop and run Spark workloads on Snowflake

You can use familiar development tools to develop Spark workloads that run on Snowflake, and then run those workloads in batches by
using the Snowpark Submit command-line tool. For more information on which development clients are supported and how to use them, see [Development clients for Snowpark Connect for Spark](snowpark-connect-clients.md).

* For interactive development, use tools such as Snowflake Notebooks or VS Code to develop Spark workloads. You can authenticate with Snowflake,
  start a Spark session, and run PySpark code to load, transform, and analyze data. For more information, see [Development clients for Snowpark Connect for Spark](snowpark-connect-clients.md).
* For non-interactive batch workloads, you can run asynchronous Spark workloads directly on Snowflake’s infrastructure while using familiar Spark semantics. Use Snowpark Submit to submit production-ready Spark applications using a
  simple CLI interface and using your tools, including Airflow. For more information, see [Submitting Spark applications](snowpark-submit.md).

---
title: Run Spark workloads from Snowflake Notebooks
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-workloads-snowflake-notebook.md
section: Developer Guide
---

# Run Spark workloads from Snowflake Notebooks

You can run Spark workloads interactively from Snowflake Notebooks without needing to manage a Spark cluster. The workloads run on the
Snowflake infrastructure.

To use Snowflake Notebooks as a client for developing Spark workloads to run on Snowflake:

1. Launch Snowflake Notebooks.
2. Within the notebook, start a Spark session.
3. Write PySpark code to load, transform, and analyze data—such as to filter high-value customer orders or
   aggregate revenue.

## Use a Snowflake Notebook that runs on a warehouse

For more information about Snowflake Notebooks, see [Create a notebook](../../user-guide/ui-snowsight/notebooks-create.md).

1. Create a Snowflake Notebook by completing the following steps:

   1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
   2. At the top of the navigation menu, select  (Create) » Notebook » New Notebook.
   3. In the Create notebook dialog, enter a name, database, and schema for the new notebook.

      For more information, see [Create a notebook](../../user-guide/ui-snowsight/notebooks-create.md).
   4. For Runtime, select Run on warehouse.
   5. For Runtime version, select Snowflake Warehouse Runtime 2.0.

      When you select version 2.0, you ensure that you have the dependency support you need, including Python 3.10. For more information,
      see [Legacy Notebook runtimes](../../user-guide/ui-snowsight/notebooks.md).
   6. For Query warehouse and Notebook warehouse, select warehouses for running query code and kernel and Python code,
      as described in [Create a notebook](../../user-guide/ui-snowsight/notebooks-create.md).
   7. Select Create.
   8. In the notebook you created, under Packages, ensure that you have the following packages listed to support code in your
      notebook:

      * Python, version 3.10 or later
      * snowpark-connect, latest version

        If you need to add these packages, use the following steps:

        1. Under Anaconda Packages, type the packages name in the search box.
        2. Select the package name.
        3. Select Save.
2. To connect to the Snowpark Connect for Spark server and test the connection, copy the following code and paste it in the Python cell of the
   notebook you created:

   ```python
   from snowflake import snowpark_connect

   spark = snowpark_connect.init_spark_session()
   df = spark.sql("show schemas").limit(10)
   df.show()
   ```

## Use a Snowflake Notebook that runs in a workspace

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all AWS and Azure accounts. PrivateLink is not supported.

For more information about Snowflake Notebooks in Workspaces, see [Snowflake Notebooks in Workspaces](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md).

1. Create a PyPI external access integration.

   You must use the ACCOUNTADMIN role and have a database you can access.

   Run the following commands from a SQL file in a workspace.

   ```sqlexample
   USE DATABASE mydb;
   USE ROLE accountadmin;

   CREATE OR REPLACE NETWORK RULE pypi_network_rule
   MODE = EGRESS
   TYPE = HOST_PORT
   VALUE_LIST = ('pypi.org', 'pypi.python.org', 'pythonhosted.org', 'files.pythonhosted.org');

   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION pypi_access_integration
   ALLOWED_NETWORK_RULES = (pypi_network_rule)
   ENABLED = true;
   ```
2. Enable PyPI integration in a notebook.

   1. In the notebook, for Service name, select a service.
   2. For External access integrations, select the PyPI integration you created.
   3. For Python version, select Python 3.11.
   4. Select Create.
3. Install the `snowpark_connect` package from PyPI in the notebook, using code such as the following:

   ```bash
   pip install snowpark-connect[jdk]
   ```
4. Restart the kernel.

   * From the Connect button, select Restart kernel.
5. Start the `snowpark_connect` server using code such as the following:

   ```python
   import snowflake.snowpark_connect

   spark = snowflake.snowpark_connect.init_spark_session()
   ```
6. Run your Spark code, as shown in the following example:

   ```python
   from pyspark.sql.connect.functions import *
   from pyspark.sql.connect.types import *
   from pyspark.sql import Row

   # Sample nested data
   data = [(1, ("Alice", 30))]
   schema = "id INT, info STRUCT<name:STRING, age:INT>"

   df = spark.createDataFrame(data, schema=schema)
   df.show()

   spark.sql("show databases").show()
   ```

---
title: Run Spark workloads from VS Code, Jupyter Notebooks, or a terminal
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-workloads-jupyter.md
section: Developer Guide
---

# Run Spark workloads from VS Code, Jupyter Notebooks, or a terminal

You can run Spark workloads interactively from Jupyter Notebooks, VS Code, or any Python-based interface without needing to manage a
Spark cluster. The workloads run on the Snowflake infrastructure.

For example, you can do the following tasks:

1. Confirm that you have prerequisites.
2. Set up your environment to connect with Snowpark Connect for Spark on Snowflake.
3. Install Snowpark Connect for Spark.
4. Run PySpark code from your client to run on Snowflake.

## Prerequisites

Confirm that your Python and Java installations are based on the same computer architecture. For example, if Python is based is arm64, Java
must also be arm64 (not x86_64, for example).

## Set up your environment

You can set up your development environment by ensuring the your code can connect to Snowpark Connect for Spark on Snowflake. To connect to Snowflake
client code will use a `.toml` file containing connection details.

If you have Snowflake CLI installed, you can use it to define a connection. Otherwise, you can manually write connection parameters in a
`config.toml` file.

### Add a connection by using Snowflake CLI

You can use Snowflake CLI to add connection properties that Snowpark Connect for Spark can use to connect to Snowflake. Your changes are saved to a
`config.toml` file.

1. Run the following command to add a connection using the snow connection **add** command.

   ```snowcli
   snow connection add
   ```
2. Follow the prompts to define a connection.

   Be sure to specify `spark-connect` as the connection name.

   This command adds a connection to your `config.toml` file, as in the following example:

   ```toml
   [connections.spark-connect]
   host = "example.snowflakecomputing.com"
   port = 443
   account = "example"
   user = "test_example"
   password = "password"
   protocol = "https"
   warehouse = "example_wh"
   database = "example_db"
   schema = "public"
   ```
3. Run the following command to confirm that the connection works.

   You can test the connection in this way when you’ve added it by using Snowflake CLI.

   ```snowcli
   snow connection list
   snow connection test --connection spark-connect
   ```

### Add a connection by manually writing a connection file

You can manually write or update a `connections.toml` file so that your code can connect to Snowpark Connect for Spark on Snowflake.

1. Run the following command to ensure that your `connections.toml` file allows only the owner (user) to have read and write access.

   ```bash
   chmod 0600 "~/.snowflake/connections.toml"
   ```
2. Edit the `connections.toml` file so that it contains a `[spark-connect]` connection with the connection properties in the
   following example.

   Be sure to replace values with your own connection specifics.

   ```toml
   [spark-connect]
   host="my_snowflake_account.snowflakecomputing.com"
   account="my_snowflake_account"
   user="my_user"
   password="&&&&&&&&"
   warehouse="my_wh"
   database="my_db"
   schema="public"
   ```

### Install Snowpark Connect for Spark

You can install Snowpark Connect for Spark as a Python package.

1. Create a Python virtual environment.

   Confirm that your Python version is 3.10 or later and earlier than 3.13 by running `python3 --version`.

   ```bash
   python3 -m venv .venv
   source .venv/bin/activate
   ```
2. Install the Snowpark Connect for Spark package.

   ```bash
   pip install --upgrade --force-reinstall 'snowpark-connect[jdk]'
   ```
3. Add Python code to start a Snowpark Connect for Spark server and create a Snowpark Connect for Spark session.

   ```python
   from snowflake import snowpark_connect
   spark=snowpark_connect.init_spark_session()
   ```

## Run Python code from your client

Once you have an authenticated connection in place, you can write code as you normally would.

You can run PySpark code that connects to Snowpark Connect for Spark by using the PySpark client library.

```python
# Row is imported in the previous code snippet

df = spark.createDataFrame([
    Row(a=1, b=2.),
    Row(a=2, b=3.),
    Row(a=4, b=5.),])

print(df.count())
```

## Run Scala code from your client

You can run Scala applications that connect to Snowpark Connect for Spark by using the Spark Connect client library.

This guide walks you through setting up Snowpark Connect and connecting your Scala applications to the Snowpark Connect for Spark server.

### Step 1: Set up your Snowpark Connect for Spark environment

Set up your environment by using steps described in the following topics:

1. Create a Python virtual environment and install Snowpark Connect.
2. Set up a connection.

### Step 2: Create a Snowpark Connect for Spark server script and launch the server

1. Create a Python script to launch the Snowpark Connect for Spark server.

   ```python
   # launch-snowpark-connect.py

   from snowflake import snowpark_connect

   def main():
       snowpark_connect.start_session(is_daemon=False, remote_url="sc://localhost:15002")
       print("SAS started on port 15002")

   if __name__ == "__main__":
       main()
   ```
2. Launch the Snowpark Connect for Spark server.

   ```python
   # Make sure you're in the correct Python environment
   pyenv activate your-snowpark-connect-env

   # Run the server script
   python launch-snowpark-connect.py
   ```

### Step 3: Set up your Scala application

1. Add the Spark Connect client dependency to your build.sbt file.

   ```scala
   libraryDependencies += "org.apache.spark" %% "spark-connect-client-jvm" % "3.5.6"

   // Add JVM options for Java 9+ module system compatibility
   javaOptions ++= Seq(
     "--add-opens=java.base/java.nio=ALL-UNNAMED"
   )
   ```
2. Execute Scala code to connect to the Snowpark Connect for Spark server.

   ```scala
   import org.apache.spark.sql.SparkSession
   import org.apache.spark.sql.connect.client.REPLClassDirMonitor

   object SnowparkConnectExample {
     def main(args: Array[String]): Unit = {
       // Create Spark session with Snowpark Connect
       val spark = SparkSession.builder().remote("sc://localhost:15002").getOrCreate()

       // Register ClassFinder for UDF support (if needed)
       // val classFinder = new REPLClassDirMonitor("target/scala-2.12/classes")
       // spark.registerClassFinder(classFinder)

       try {
         // Simple DataFrame operations
         import spark.implicits._

         val data = Seq(
           (1, "Alice", 25),
           (2, "Bob", 30),
           (3, "Charlie", 35)
         )

         val df = spark.createDataFrame(data).toDF("id", "name", "age")

         println("Original DataFrame:")
         df.show()

         println("Filtered DataFrame (age > 28):")
         df.filter($"age" > 28).show()

         println("Aggregated result:")
         df.groupBy().avg("age").show()

       } finally {
         spark.stop()
       }
     }
   }
   ```
3. Compile and run your application.

   ```bash
   # Compile your Scala application
   sbt compile

   # Run the application
   sbt "runMain SnowparkConnectExample"
   ```

### Scala UDF support on Snowpark Connect for Spark

When using user-defined functions or custom code, do one of the following:

* Register a class finder to monitor and upload class files.

  ```scala
  import org.apache.spark.sql.connect.client.REPLClassDirMonitor

  val classFinder = new REPLClassDirMonitor("/absolute/path/to/target/scala-2.12/classes")
  spark.registerClassFinder(classFinder)
  ```
* Upload JAR dependencies if needed. You can include the workload JAR itself if a class finder is not used.

  ```scala
  spark.addArtifact("/absolute/path/to/dependency.jar")
  ```
* Use a staged JAR.

  ```scala
  spark.conf.set("snowpark.connect.udf.java.imports", "[@mystage/dependency.jar, @db.schema.stage/other_dependency.jar]")
  ```

### Using Scala 2.13

By default, Snowpark Connect for Spark uses Scala 2.12. Workloads built with Scala 2.13 must specify the Scala version using the “snowpark.connect.scala.version” configuration option.

```scala
// Directly in the session builder
val spark = SparkSession.builder()
  .remote("sc://localhost:15002")
  .config("snowpark.connect.scala.version", "2.13")
  .getOrCreate()

// Or via session configuration
spark.conf.set("snowpark.connect.scala.version", "2.13")
```

### Troubleshoot Snowpark Connect for Spark installation

With the following list of checks, you can troubleshoot Snowpark Connect for Spark installation and use.

* Ensure that Java and Python are based on the same architecture.
* Use the most recent Snowpark Connect for Spark package file, as described in Install Snowpark Connect for Spark.
* Confirm that the **python** command with PySpark code is working correctly for local execution—that is, without Snowflake connectivity.

  For example, execute a command such as the following:

  ```python
  python your_pyspark_file.py
  ```

## Open source clients

You can use standard, off-the-shelf open source software (OSS) Spark client packages—such as PySpark and Spark clients for Java or
Scala—from your preferred local environments, including Jupyter Notebooks and VS Code. In this way, you can avoid installing packages
specific to Snowflake.

You might find this useful if you want to write Spark code locally and have the code use Snowflake compute resources and enterprise governance.
In this scenario, you perform authentication and authorization through programmatic access tokens (PATs).

The following sections cover installation, configuration, and authentication. You’ll also find a simple PySpark example to validate your
connection.

### Step 1: Install Required Packages

* Install `pyspark`. You don’t need to install any Snowflake packages.

  ```bash
  pip install "pyspark[connect]>=3.5.0,<4"
  ```

### Step 2: Setup and Authentication

1. Generate a programmatic access token (PAT).

   For more information, see the following topics:

   * [Using programmatic access tokens for authentication](../../user-guide/programmatic-access-tokens.md)
   * [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](../../sql-reference/sql/alter-user-add-programmatic-access-token.md)

   The following example adds a PAT named `TEST_PAT` for the user `sysadmin` and sets the expiration to 30 days.

   ```sqlexample
   ALTER USER add PAT TEST_PAT ROLE_RESTRICTION = sysadmin DAYS_TO_EXPIRY = 30;
   ```
2. Find your Snowflake Spark Connect host URL.

   Run the following SQL in Snowflake to find the hostname for your account:

   ```sqlexample
   SELECT t.VALUE:type::VARCHAR as type,
          t.VALUE:host::VARCHAR as host,
          t.VALUE:port as port
     FROM TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$ALLOWLIST()))) AS t where type = 'SNOWPARK_CONNECT';
   ```

### Step 3: Connect to Spark Connect server

* To connect to the Spark Connect server, use code such as the following:

  ```python
  from pyspark.sql import SparkSession
  import urllib.parse

  # Replace with your actual PAT.
  pat = urllib.parse.quote("<pat>", safe="")

  # Replace with your Snowpark Connect host from the above SQL query.
  snowpark_connect_host = ""

  # Define database/schema/warehouse for executing your Spark session in Snowflake (recommended); otherwise, it will be resolved from your default_namespace and default_warehouse

  db_name = urllib.parse.quote("TESTDB", safe="")
  schema_name = urllib.parse.quote("TESTSCHEMA", safe="")
  warehouse_name = urllib.parse.quote("TESTWH", safe="")

  spark = SparkSession.builder.remote(f"sc://{snowpark_connect_host}/;token={pat};token_type=PAT;database={db_name};schema={schema_name};warehouse={warehouse_name}").getOrCreate()

  # Spark session is ready to use. You can write regular Spark DataFrame code, as in the following example:

  from pyspark.sql import Row

  df = spark.createDataFrame([
      Row(a=1, b=2.),
      Row(a=2, b=3.),
      Row(a=4, b=5.),])
  print(df.count())
  ```

---
title: Scala examples for stored procedures created with SQL
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/scala/procedure-scala-examples.md
section: Developer Guide
---

# Scala examples for stored procedures created with SQL

## Using Snowpark APIs for asynchronous processing

The following examples illustrate how you can use Snowpark APIs to begin asynchronous child jobs, as well as how those jobs behave under
different conditions.

In the following example, the `asyncWait` procedure executes an asynchronous child job that waits 10 seconds.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE asyncWait()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'TestScalaSP.asyncBasic'
  AS
  $$
  import com.snowflake.snowpark._
  object TestScalaSP {
    def asyncBasic(session: com.snowflake.snowpark.Session): String = {
      val df = session.sql("select system$wait(10)")
      val asyncJob = df.async.collect()
      while(!asyncJob.isDone()) {
        Thread.sleep(1000)
      }
      "Done"
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE asyncWait()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'TestScalaSP.asyncBasic'
  AS
  $$
  import com.snowflake.snowpark._
  object TestScalaSP {
    def asyncBasic(session: com.snowflake.snowpark.Session): String = {
      val df = session.sql("select system$wait(10)")
      val asyncJob = df.async.collect()
      while(!asyncJob.isDone()) {
        Thread.sleep(1000)
      }
      "Done"
    }
  }
  $$;
```

```sqlexample
CALL asyncWait();
```

In the following example, the `cancelJob` procedure uses SQL to start a job that would take 10 seconds to finish. It then cancels
the child job before it finishes.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE cancelJob()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'TestScalaSP.asyncBasic'
  AS
  $$
  import com.snowflake.snowpark._
  object TestScalaSP {
    def asyncBasic(session: com.snowflake.snowpark.Session): String = {
      val df = session.sql("select system$wait(10)")
      val asyncJob = df.async.collect()
      asyncJob.cancel()
      "Done"
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE cancelJob()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'TestScalaSP.asyncBasic'
  AS
  $$
  import com.snowflake.snowpark._
  object TestScalaSP {
    def asyncBasic(session: com.snowflake.snowpark.Session): String = {
      val df = session.sql("select system$wait(10)")
      val asyncJob = df.async.collect()
      asyncJob.cancel()
      "Done"
    }
  }
  $$;
```

In the following example, the `checkStatus` procedure executes an asynchronous child job that waits 10 seconds. The procedure then
checks on the status of the job before it finishes, so the check returns `False`.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE checkStatus()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'TestScalaSP.asyncBasic'
  AS
  $$
  import java.sql.ResultSet
  import net.snowflake.client.jdbc.{SnowflakeConnectionV1, SnowflakeResultSet, SnowflakeStatement}
  object TestScalaSP {
    def asyncBasic(session: com.snowflake.snowpark.Session): String = {
      val connection = session.jdbcConnection
      val stmt = connection.createStatement()
      val rs = stmt.asInstanceOf[SnowflakeStatement].executeAsyncQuery("CALL SYSTEM$WAIT(10)")
      val status = rs.asInstanceOf[SnowflakeResultSet].getStatus.toString
      s"""status:    ${status}"""
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE checkStatus()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'TestScalaSP.asyncBasic'
  AS
  $$
  import java.sql.ResultSet
  import net.snowflake.client.jdbc.{SnowflakeConnectionV1, SnowflakeResultSet, SnowflakeStatement}
  object TestScalaSP {
    def asyncBasic(session: com.snowflake.snowpark.Session): String = {
      val connection = session.jdbcConnection
      val stmt = connection.createStatement()
      val rs = stmt.asInstanceOf[SnowflakeStatement].executeAsyncQuery("CALL SYSTEM$WAIT(10)")
      val status = rs.asInstanceOf[SnowflakeResultSet].getStatus.toString
      s"""status:    ${status}"""
    }
  }
  $$;
```

---
title: Scala UDF handler examples
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-examples.md
section: Developer Guide
---

# Scala UDF handler examples

This topic includes simple examples of UDF handler code written in Scala.

For information on using Scala to create a scalar UDF handler, refer to [Writing a scalar UDF in Scala](udf-scala-scalar.md). For general
coding guidelines, refer to [General Scala UDF handler coding guidelines](udf-scala-general.md).

## Creating and calling a simple in-line Scala UDF

The following statements create and call an in-line Scala UDF. This code returns the VARCHAR passed to it.

This function is declared with the optional `CALLED ON NULL INPUT` clause to indicate that the function is
called even if the value of the input is NULL. (This function would return NULL with or without this clause, but
you could modify the code to handle NULL another way, for example, to return an empty string.)

### Create the UDF

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  CALLED ON NULL INPUT
  RUNTIME_VERSION = 2.12
  HANDLER='Echo.echoVarchar'
  AS
  $$
  class Echo {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  CALLED ON NULL INPUT
  RUNTIME_VERSION = 2.13
  HANDLER='Echo.echoVarchar'
  AS
  $$
  class Echo {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

### Call the UDF

```sqlexample
SELECT echo_varchar('Hello');
```

### Passing a NULL to an in-line Scala UDF

This uses the `echo_varchar()` UDF defined above. The SQL `NULL` value is implicitly converted to
Scala [Null](https://www.scala-lang.org/api/2.12.17/scala/Null.html), and that Scala `Null` is returned and implicitly converted
back to SQL `NULL`:

Call the UDF:

```sqlexample
SELECT echo_varchar(NULL);
```

## Returning NULL explicitly from an in-line UDF

The following code shows how to return a NULL value explicitly. The Scala value `Null` is converted to
SQL `NULL`.

### Create the UDF

Scala 2.12Scala 2.13

```sqlexample-scala
CREATE OR REPLACE FUNCTION return_a_null()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER='TemporaryTestLibrary.returnNull'
  AS
  $$
  class TemporaryTestLibrary {
    def returnNull(): String = {
      return null
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION return_a_null()
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER='TemporaryTestLibrary.returnNull'
  AS
  $$
  class TemporaryTestLibrary {
    def returnNull(): String = {
      return null
    }
  }
  $$;
```

### Call the UDF

```sqlexample
SELECT return_a_null();
```

## Passing an OBJECT to an in-line Scala UDF

The following example uses the SQL [OBJECT](../../../sql-reference/data-types-semistructured.md) data type and the corresponding Scala
data type (`Map[String, String]`), and extracts a value from the OBJECT. This example also shows that you
can pass multiple parameters to a Scala UDF.

Create and load a table that contains a column of type OBJECT:

```sqlexample
CREATE TABLE objectives (o OBJECT);
INSERT INTO objectives SELECT PARSE_JSON('{"outer_key" : {"inner_key" : "inner_value"} }');
```

### Create the UDF

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION extract_from_object(x OBJECT, key VARCHAR)
  RETURNS VARIANT
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER='VariantLibrary.extract'
  AS
  $$
  import scala.collection.immutable.Map

  class VariantLibrary {
    def extract(m: Map[String, String], key: String): String = {
      return m(key)
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION extract_from_object(x OBJECT, key VARCHAR)
  RETURNS VARIANT
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER='VariantLibrary.extract'
  AS
  $$
  import scala.collection.immutable.Map

  class VariantLibrary {
    def extract(m: Map[String, String], key: String): String = {
      return m(key)
    }
  }
  $$;
```

### Call the UDF

```sqlexample
SELECT extract_from_object(o, 'outer_key'),
  extract_from_object(o, 'outer_key')['inner_key'] FROM OBJECTIVES;
```

## Passing an ARRAY to an in-line Scala UDF

The following example uses the SQL [ARRAY](../../../sql-reference/data-types-semistructured.md) data type.

### Create the UDF

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION generate_greeting(greeting_words ARRAY)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER='StringHandler.handleStrings'
  AS
  $$
  class StringHandler {
    def handleStrings(strings: Array[String]): String = {
      return concatenate(strings)
    }
    private def concatenate(strings: Array[String]): String = {
      var concatenated : String = ""
      for (newString <- strings)  {
          concatenated = concatenated + " " + newString
      }
      return concatenated
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION generate_greeting(greeting_words ARRAY)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER='StringHandler.handleStrings'
  AS
  $$
  class StringHandler {
    def handleStrings(strings: Array[String]): String = {
      return concatenate(strings)
    }
    private def concatenate(strings: Array[String]): String = {
      var concatenated : String = ""
      for (newString <- strings)  {
          concatenated = concatenated + " " + newString
      }
      return concatenated
    }
  }
  $$;
```

## Reading a file with a Scala UDF

You can read the contents of a file or directory with handler code. For example, you might want to read a file to process unstructured data with the
handler.

The file must be on a Snowflake stage that’s available to your handler.

To read the contents of staged files, your handler can:

* Read a file from a directory imported using IMPORTS.
* Read a dynamically-specified file by calling methods of either the
  `SnowflakeFile` class or the `InputStream` class.

  You might do this if you need to access a file specified by the caller. For more information, see the following in this topic:

  + Reading a dynamically-specified file with SnowflakeFile
  + Reading a dynamically-specified file with InputStream

`SnowflakeFile` provides features not available with `InputStream`, as described in the following table.

| Class | Input | Notes |
| --- | --- | --- |
| `SnowflakeFile` | URL formats:   * Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner. * File URL or string path for files that the UDF owner has access to.   The file must be located in a named internal stage or an external stage. | Easily access additional file attributes, such as file size. |
| `InputStream` | URL formats:   * Scoped URL to reduce the risk of file injection attacks when the function’s caller is not also its owner.   The file must be located in a named internal stage or an external stage. |  |

> **Note:**
>
> The UDF owner must have access to any files whose locations are not scoped URLs. You can read these staged files by having the handler
> code call the `SnowflakeFile.newInstance` method with a `boolean` value for a new `requireScopedUrl` parameter.
>
> The following example uses `SnowflakeFile.newInstance` while specifying that a scoped URL is not required.
>
> ```scala
> var filename = "@my_stage/filename.txt"
> var sfFile = SnowflakeFile.newInstance(filename, false)
> ```

### Importing a directory using IMPORTS

[Preview Feature](../../../release-notes/preview-features.md) — Open

Available to all accounts.

You can import a directory using the IMPORTS clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.

> **Note:**
>
> * The import path for a directory must end with a trailing slash (`/`). For example, `IMPORTS = ('@my_stage/my_dir/')`.
> * To rename a directory on import, append `/=custom_name/` to the stage path. The custom name must be a single directory name, not a path. For example, `IMPORTS = ('@my_stage/my_dir/=custom_name/')`.
> * Directory imports are not supported in Native Apps.

The following example imports a directory called `my_dir` from a stage named `my_stage` and reads the files contained within it.

```sqlexample-scala
CREATE OR REPLACE FUNCTION scala_udf(fileName STRING)
  RETURNS STRING
LANGUAGE SCALA
RUNTIME_VERSION = '2.12'
IMPORTS = ('@my_stage/my_dir/')
HANDLER = 'FileReader.compute'
AS $$
import java.io._
import java.nio.file.{Paths, Files}
import scala.io.Source

class FileReader {
  def compute(fileName: String): String = {
    // Get the base import directory
    val importDir = System.getProperty("com.snowflake.import_directory")

    // Construct the path
    val filePath = Paths.get(importDir, "/", "my_dir", "/", fileName)

    // Read the file using Scala Source
    if (Files.exists(filePath)) {
      val source = Source.fromFile(filePath.toFile)
      try {
        source.getLines().mkString("\n")
      } finally {
        source.close()
      }
    } else {
      s"File not found: $fileName"
    }
  }
}
$$;
SELECT scala_udf('file.txt');
```

### Reading a dynamically-specified file with `SnowflakeFile`

Using methods of the `SnowflakeFile` class, you can read files from a stage with your handler code. The `SnowflakeFile`
class is included on the classpath available to Scala UDF handlers on Snowflake.

> **Note:**
>
> To make your code resilient to file injection attacks, always use a scoped URL when passing a file’s location to a UDF, particularly
> when the function’s caller is not also its owner. You can create a scoped URL in SQL using the built-in function
> [BUILD_SCOPED_FILE_URL](../../../sql-reference/functions/build_scoped_file_url.md). For more information about what the BUILD_SCOPED_FILE_URL does, see
> [Introduction to unstructured data](../../../user-guide/unstructured-intro.md).

To develop your UDF code locally, add the Snowpark JAR containing `SnowflakeFile` to your code’s class path. For information about
`snowpark.jar`, see [Setting Up Your Development Environment for Snowpark Scala](../../snowpark/scala/setup.md). Note that Snowpark client applications cannot use this class.

When you use `SnowflakeFile`, it isn’t necessary to also specify either the staged file or the JAR containing
`SnowflakeFile` with an IMPORTS clause when you create the UDF, as in SQL with a CREATE FUNCTION statement.

### Create the UDF

Code in the following example uses `SnowflakeFile` to read a file from a specified stage location. Using an
`InputStream` from the `getInputStream` method, it reads the file’s contents into a `String` variable.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION sum_total_sales_snowflake_file(file STRING)
  RETURNS INTEGER
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES=('com.snowflake:snowpark_2.12:latest')
  HANDLER='SalesSum.sumTotalSales'
  AS
  $$
  import java.io.InputStream
  import java.io.IOException
  import java.nio.charset.StandardCharsets
  import com.snowflake.snowpark_java.types.SnowflakeFile

  object SalesSum {
    @throws(classOf[IOException])
    def sumTotalSales(filePath: String): Int = {
      var total = -1

      // Use a SnowflakeFile instance to read sales data from a stage.
      val file = SnowflakeFile.newInstance(filePath)
      val stream = file.getInputStream()
      val contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8)

      // Omitted for brevity: code to retrieve sales data from JSON and assign it to the total variable.

      return total
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION sum_total_sales_snowflake_file(file STRING)
  RETURNS INTEGER
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES=('com.snowflake:snowpark_2.13:latest')
  HANDLER='SalesSum.sumTotalSales'
  AS
  $$
  import java.io.InputStream
  import java.io.IOException
  import java.nio.charset.StandardCharsets
  import com.snowflake.snowpark_java.types.SnowflakeFile

  object SalesSum {
    @throws(classOf[IOException])
    def sumTotalSales(filePath: String): Int = {
      var total = -1

      // Use a SnowflakeFile instance to read sales data from a stage.
      val file = SnowflakeFile.newInstance(filePath)
      val stream = file.getInputStream()
      val contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8)

      // Omitted for brevity: code to retrieve sales data from JSON and assign it to the total variable.

      return total
    }
  }
  $$;
```

### Call the UDF

```sqlexample
SELECT sum_total_sales_input_stream(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

### Reading a dynamically-specified file with `InputStream`

You can read file contents directly into a `java.io.InputStream` by making your handler function’s argument an `InputStream`
variable. This can be useful when the function’s caller will want to pass a file path as an argument.

> **Note:**
>
> To make your code resilient to file injection attacks scoped URLs are required when passing a file’s location to a UDF. You can create a
> scoped URL in SQL using the built-in function BUILD_SCOPED_FILE_URL. For more information about what the BUILD_SCOPED_FILE_URL does,
> see [Introduction to unstructured data](../../../user-guide/unstructured-intro.md).

### Create the UDF

Code in the following example has a handler function `sumTotalSales` that takes an `InputStream` and returns an `Int`.
At run time, Snowflake automatically assigns the contents of the file at the `file` variable’s path to the `stream`
argument variable.

Scala 2.12Scala 2.13

```sqlexample-scala
CREATE OR REPLACE FUNCTION sum_total_sales_input_stream(file STRING)
  RETURNS NUMBER
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER = 'SalesSum.sumTotalSales'
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  AS $$
  import com.snowflake.snowpark.types.Variant
  import java.io.InputStream
  import java.io.IOException
  import java.nio.charset.StandardCharsets
  object SalesSum {
    @throws(classOf[IOException])
    def sumTotalSales(stream: InputStream): Int = {
      val total = -1
      val contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8)

      // Omitted for brevity: code to retrieve sales data from JSON and assign it to the total variable.

      return total
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION sum_total_sales_input_stream(file STRING)
  RETURNS NUMBER
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER = 'SalesSum.sumTotalSales'
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  AS $$
  import com.snowflake.snowpark.types.Variant
  import java.io.InputStream
  import java.io.IOException
  import java.nio.charset.StandardCharsets
  object SalesSum {
    @throws(classOf[IOException])
    def sumTotalSales(stream: InputStream): Int = {
      val total = -1
      val contents = new String(stream.readAllBytes(), StandardCharsets.UTF_8)

      // Omitted for brevity: code to retrieve sales data from JSON and assign it to the total variable.

      return total
    }
  }
  $$;
```

### Call the UDF

```sqlexample
SELECT sum_total_sales_input_stream(BUILD_SCOPED_FILE_URL('@sales_data_stage', '/car_sales.json'));
```

---
title: Scala UDF handler project and packaging
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-packaging.md
section: Developer Guide
---

# Scala UDF handler project and packaging

You can make handler code projects easier to maintain by using a well-organized project hierarchy and popular build tools. These are
useful when you intend to copy handler code to a Snowflake stage, then refer to it from functions and procedures.

To build and package handler code, you can use popular tools such as sbt, Maven, and Gradle. For more information, refer to the
following topics:

* [Packaging Scala Handler Code with sbt](../../udf-stored-procedure-build-sbt.md)
* [Packaging Java or Scala Handler Code with Maven](../../udf-stored-procedure-build-maven.md)

Once you’ve packaged handler code, you can add it to a stage as described in [Making dependencies available to your code](../../upload-dependencies.md).

For more information on choosing whether to keep your handler in-line or on a stage, refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).

## Organize your files

If you intend to package your handler in a JAR file and put it on a Snowflake stage, you might find it useful to use a project
hierarchy that organizes Snowflake handler code. This section suggests a hierarchy for organizing files.

For a GitHub template you can use to create a project hierarchy like this one, refer to the
[Snowflake-Labs GitHub repository](https://github.com/Snowflake-Labs/snowpark-scala-template/tree/v1.0.0).

```none
SnowflakeProject
|-- project
|   |-- plugins.sbt
|-- src
|   |-- main / scala / org / example
|   |   |-- function
|   |   |   |-- FunctionHandler.scala
|   |   |-- procedure
|   |   |-- utils
|   |-- test / scala / org / example
|   |   |-- function
|   |   |-- procedure
|-- build.sbt
|-- pom.xml
```

The following table describes the sections of the hierarchy.

| Directory/File | Description |
| --- | --- |
| `project` directory | Contains files used by sbt to guide build and packaging of code.   * `plugins.sbt` file specifies plugins used by sbt. To build code for use in Snowflake, add a plugin to help create a JAR with   your handler’s dependencies. For more information, refer to [Packaging Scala Handler Code with sbt](../../udf-stored-procedure-build-sbt.md). |
| `src / main / scala / org / example` directory | Contains handler code source files.   * Use the `function` directory to hold handler source for user-defined functions (UDFs). * Use the `procedure` directory to hold handler source for stored procedures. * Use the `utils` directory to hold handler source required for both. |
| `src / test / scala / org / example` directory | Contains handler test source files.   * Use the `function` directory to hold tests for user-defined functions (UDFs). * Use the `procedure` directory to hold tests for stored procedures. |
| `build.sbt` file | Specifies the build definition used by sbt, including name and version of the built output, dependencies, and so on. For more information, refer to [Packaging Scala Handler Code with sbt](../../udf-stored-procedure-build-sbt.md). |
| `pom.xml` file | Specifies the build definition used by Maven. For more information, refer to [Packaging Java or Scala Handler Code with Maven](../../udf-stored-procedure-build-maven.md). |

---
title: Scala UDF limitations
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-limitations.md
section: Developer Guide
---

# Scala UDF limitations

This topic describes the limitations in place for handlers written in Scala.

## General limitations

* Although your Scala method can use classes and methods in the standard libraries, Snowflake security
  constraints disable some capabilities, such as writing to files. For details, see the section
  titled [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).
* Scala UDFs are not sharable. Database objects that use Scala UDFs are also not sharable. For example, you cannot:

  + Directly share a Scala UDF.
  + Share a view that calls a Scala UDF.
  + Share a function that calls a Scala UDF.
  + Share a table with a masking or row access policy that calls a Scala UDF.
* Granting USAGE privilege on a Scala UDF might allow the recipient to see the contents of files imported by that UDF. If you grant the
  USAGE privilege on a Scala UDF to a role, and if that role executes a statement that calls that Scala UDF, then any Scala UDF in the same
  statement could read the contents of any files imported by the Scala UDF on which you granted USAGE privilege.
* [Database replication](../../../user-guide/replication-intro.md) does not include external or internal stages yet.
  When you promote a secondary database to serve as the primary database, you must recreate stage objects and re-import any files missing
  in internal stages. The files should have the same path and filenames as in the original primary database.
* The maximum size for a Scala UDF output row is 128 MB.
* Concurrency is not supported. For example, from within your code, you cannot submit queries
  from multiple threads. Code that concurrently issues multiple queries will produce an error.
* If a query calls a UDF to access staged files, the operation fails with a user error if the SQL statement also queries a view that
  calls any UDF, regardless if the function in the view accesses staged files or not.
* UDFs currently process files serially. As a workaround, group rows in a subquery using the [GROUP BY](../../../sql-reference/constructs/group-by.md)
  clause.
* Currently, if the staged files referenced in a query are modified or deleted while the query is running, the function call fails with an
  error.

## Limitations on cloning

A Scala UDF can be cloned when the database or schema containing the Scala UDF is cloned.
To be cloned, the Scala UDF must meet the following condition(s):

* If the Scala UDF references a stage (for example, the stage that contains the UDF’s JAR file), that stage must be
  outside the schema (or database) being cloned.

  You can keep a Scala UDF and its referenced stage(s) in separate schemas (and/or separate databases) the following ways:

  + Wherever the Scala UDF references a stage, use a qualified stage name (such as `my_db.my_schema.my_stage`) different from the
    schema or database of the Scala UDF. If the cloning operation clones a database, the stage reference should include the database and
    schema. If the cloning operation clones a schema, the stage reference should include the schema (and optionally the database).
  + Create the referenced stage by using a non-qualified stage name (which implicitly uses the current session’s active database and
    schema), and create the Scala UDF by using a qualified name that does not match the session’s current database and schema.
  + Use the user’s stage as the referenced stage (the user’s stage is separate from any database’s stage or schema’s stage).

If one or more Scala UDFs in the schema or database do not meet the required conditions, the schema or database can still be cloned, but
the non-compliant Scala UDFs are omitted from the clone without any error or warning message.

Each cloned Scala UDF has the same definition as the original. That definition includes any references to stages. The stage references in
the Scala UDF must be fully-qualified, and therefore are absolute, not relative to the schema or database being cloned. Because both the
original and the clone point to the same stage(s) and file(s):

* Dropping the stage or removing required files from the stage disables both the original and cloned UDF.
* Altering the stage or the files on the stage (e.g. replacing the JAR file with a newer JAR file) affects both the original and cloned UDF.

---
title: Scalar JavaScript UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/javascript/udf-javascript-scalar-functions.md
section: Developer Guide
---

# Scalar JavaScript UDFs

This topic covers Scalar JavaScript UDFs (user-defined function).

## Introduction

A scalar JavaScript UDF returns one output row for each input row. The output row must contain only one column/value.

A basic example is in [Introduction to JavaScript UDFs](udf-javascript-introduction.md). Additional examples are below.

> **Note:**
>
> Scalar functions (UDFs) have a limit of 500 input arguments.

## Examples

This section contains examples of scalar JavaScript UDFs.

### Recursion

The following example shows that a JavaScript UDF can call itself (i.e. it can use recursion).

Create a recursive UDF:

```javascript
CREATE OR REPLACE FUNCTION RECURSION_TEST (STR VARCHAR)
  RETURNS VARCHAR
  LANGUAGE JAVASCRIPT
  AS $$
  return (STR.length <= 1 ? STR : STR.substring(0,1) + '_' + RECURSION_TEST(STR.substring(1)));
  $$
  ;
```

Call the recursive UDF:

```javascript
SELECT RECURSION_TEST('ABC');
+-----------------------+
| RECURSION_TEST('ABC') |
|-----------------------|
| A_B_C                 |
+-----------------------+
```

### Custom exception

The following example shows a JavaScript UDF that throws a custom exception.

Create the function:

```javascript
CREATE FUNCTION validate_ID(ID FLOAT)
RETURNS VARCHAR
LANGUAGE JAVASCRIPT
AS $$
    try {
        if (ID < 0) {
            throw "ID cannot be negative!";
        } else {
            return "ID validated.";
        }
    } catch (err) {
        return "Error: " + err;
    }
$$;
```

Create a table with valid and invalid values:

```javascript
CREATE TABLE employees (ID INTEGER);
INSERT INTO employees (ID) VALUES
    (1),
    (-1);
```

Call the function:

```javascript
SELECT ID, validate_ID(ID) FROM employees ORDER BY ID;
+----+-------------------------------+
| ID | VALIDATE_ID(ID)               |
|----+-------------------------------|
| -1 | Error: ID cannot be negative! |
|  1 | ID validated.                 |
+----+-------------------------------+
```

## Troubleshooting

See [Troubleshooting JavaScript UDFs](udf-javascript-troubleshooting.md).

---
title: Scalar SQL UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-scalar-functions.md
section: Developer Guide
---

# Scalar SQL UDFs

This topic covers concepts and usage details that are specific to SQL UDFs (user-defined functions).

## General usage

A SQL UDF evaluates an arbitrary SQL expression and returns the result(s) of the expression.

The function definition can be a SQL expression that returns either a scalar (i.e. single) value or, if defined as a table function, a
set of rows. For example, here is a basic example of a scalar UDF that calculates the area of a circle:

```sqlexample
CREATE FUNCTION area_of_circle(radius FLOAT)
  RETURNS FLOAT
  AS
  $$
    pi() * radius * radius
  $$
  ;
```

```sqlexample
SELECT area_of_circle(1.0);
```

Output:

```sqlexample
SELECT area_of_circle(1.0);
+---------------------+
| AREA_OF_CIRCLE(1.0) |
|---------------------|
|         3.141592654 |
+---------------------+
```

The expression can be a query expression (a [SELECT](../../../sql-reference/sql/select.md) expression). For example:

```sqlexample
CREATE FUNCTION profit()
  RETURNS NUMERIC(11, 2)
  AS
  $$
    SELECT SUM((retail_price - wholesale_price) * number_sold)
        FROM purchases
  $$
  ;
```

When using a query expression in a SQL UDF, do not include a semicolon within the UDF body to terminate the query expression.

You can include only one query expression. The expression can include
UNION [ALL].

> **Note:**
>
> Although the body of a UDF can contain a complete SELECT statement, it cannot contain DDL statements or any DML statement other
> than SELECT.

> **Note:**
>
> Scalar functions (UDFs) have a limit of 500 input arguments.

## Memoizable UDFs

A scalar SQL UDF can be memoizable. A memoizable function caches the result of calling a scalar SQL UDF and then returns the
cached result when the output is needed at a later time. The benefit of using a memoizable function is to improve performance for complex
queries, such as multiple column lookups in [mapping tables](https://en.wikipedia.org/wiki/Associative_entity) referenced within a row
access policy or masking policy.

Policy owners (e.g. the role with the OWNERSHIP privilege on the row access policy) can update their policy conditions to replace
subqueries that have mapping tables with a memoizable function. When users reference the policy-protected column in a query later, the
cached results from the memoizable function are available to use as needed.

> **Note:**
>
> The [USE_CACHED_RESULT](../../../sql-reference/parameters.md) session parameter must be set to TRUE to use memoizable functions.

### Create a memoizable function

You can define a scalar SQL UDF to be memoizable in the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) statement by specifying the
`MEMOIZABLE` keyword. You can create a memoizable to function with or without arguments. By using arguments, you have more freedom to
define the SQL UDF. When you write a policy to call the memoizable function, you have more freedom in terms of how to define the policy.

If you specify arguments, the arguments must be constant values with one of the following data types:

* VARCHAR and other string data types.
* NUMBER and other numeric data types.
* TIMESTAMP and other date data types.
* BOOLEAN.

Nonconstant values and their data types, such as [semi-structured data types](../../../user-guide/semistructured-data-formats.md) and table
columns are not supported.

When you write a memoizable function:

* Specify BOOLEAN or other scalar data types as the `result_data_type`.

  Exercise caution when specifying ARRAY as the `result_data_type` because there are limits to cache size.
* Do not specify other data types such as OBJECT and VARIANT.
* Do not reference another memoizable function in any way.

### Call a memoizable function

A memoizable function can be called in a SELECT statement or be included in a policy definition, which then calls the memoizable function
based on the policy conditions.

When calling a memoizable function, note:

* For SQL UDFs that return the ARRAY data type or specify a non-scalar value, use the memoizable function as an argument in the
  [ARRAY_CONTAINS](../../../sql-reference/functions/array_contains.md) function.
* Cache size limit:

  Each memoizable function has a 10 KB limit for the current Snowflake session.

  If the memoizable function exceeds this limit for result set cache, Snowflake does not cache the result of calling the
  memoizable function. Instead, the UDF acts as a normal scalar UDF based on how the function is written.
* Cache usage:

  Memoizable functions have a reusable result cache for different SQL statements when the query environment and context do not
  change. Generally, this means the result cache applies to different SQL statements provided that:

  + The access control authorization on objects and columns referenced in a query remain the same.
  + The objects referenced in the query are not modified (e.g. through DML statements).

  The CHILD_QUERIES_WAIT_TIME column in the Account Usage [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) view records
  the time (in milliseconds) to complete the cached lookup when calling a memoizable function.
* Memoizable functions do not reuse cached results when:

  + The function references a table or other object and there is an update to the referenced table.
  + There is a change in access control to the table.
  + The function calls nondeterministic function.
  + The function calls an external function or a UDF that is not a SQL UDF.

## Examples

### Basic SQL scalar UDF example(s)

This example returns a hard-coded approximation of the mathematical constant pi.

```sqlexample
CREATE FUNCTION pi_udf()
  RETURNS FLOAT
  AS '3.141592654::FLOAT'
  ;
```

```sqlexample
SELECT pi_udf();
```

Output:

```sqlexample
SELECT pi_udf();
+-------------+
|    PI_UDF() |
|-------------|
| 3.141592654 |
+-------------+
```

### Common SQL examples

#### Query expression with [SELECT](../../../sql-reference/sql/select.md) statement

Create the table and data to use:

```sqlexample
CREATE TABLE purchases (number_sold INTEGER, wholesale_price NUMBER(7,2), retail_price NUMBER(7,2));
INSERT INTO purchases (number_sold, wholesale_price, retail_price) VALUES
   (3,  10.00,  20.00),
   (5, 100.00, 200.00)
   ;
```

Create the UDF:

```sqlexample
CREATE FUNCTION profit()
  RETURNS NUMERIC(11, 2)
  AS
  $$
    SELECT SUM((retail_price - wholesale_price) * number_sold)
        FROM purchases
  $$
  ;
```

Call the UDF in a query:

```sqlexample
SELECT profit();
```

Output:

```sqlexample
SELECT profit();
+----------+
| PROFIT() |
|----------|
|   530.00 |
+----------+
```

#### UDF in a WITH clause

```sqlexample
CREATE TABLE circles (diameter FLOAT);

INSERT INTO circles (diameter) VALUES
    (2.0),
    (4.0);

CREATE FUNCTION diameter_to_radius(f FLOAT)
  RETURNS FLOAT
  AS
  $$ f / 2 $$
  ;
```

```sqlexample
WITH
    radii AS (SELECT diameter_to_radius(diameter) AS radius FROM circles)
  SELECT radius FROM radii
    ORDER BY radius
  ;
```

Output:

```sqlexample
+--------+
| RADIUS |
|--------|
|      1 |
|      2 |
+--------+
```

#### JOIN operation

This example uses a more complex query, which includes a JOIN operation:

Create the table and data to use:

```sqlexample
CREATE TABLE orders (product_ID varchar, quantity integer, price numeric(11, 2), buyer_info varchar);
CREATE TABLE inventory (product_ID varchar, quantity integer, price numeric(11, 2), vendor_info varchar);
INSERT INTO inventory (product_ID, quantity, price, vendor_info) VALUES
  ('X24 Bicycle', 4, 1000.00, 'HelloVelo'),
  ('GreenStar Helmet', 8, 50.00, 'MellowVelo'),
  ('SoundFX', 5, 20.00, 'Annoying FX Corporation');
INSERT INTO orders (product_id, quantity, price, buyer_info) VALUES
  ('X24 Bicycle', 1, 1500.00, 'Jennifer Juniper'),
  ('GreenStar Helmet', 1, 75.00, 'Donovan Liege'),
  ('GreenStar Helmet', 1, 75.00, 'Montgomery Python');
```

Create the UDF:

```sqlexample
CREATE FUNCTION store_profit()
  RETURNS NUMERIC(11, 2)
  AS
  $$
  SELECT SUM( (o.price - i.price) * o.quantity)
    FROM orders AS o, inventory AS i
    WHERE o.product_id = i.product_id
  $$
  ;
```

Call the UDF in a query:

```sqlexample
SELECT store_profit();
```

Output:

```sqlexample
SELECT store_profit();
+----------------+
| STORE_PROFIT() |
|----------------|
|         550.00 |
+----------------+
```

The topic [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) contains additional examples.

### Using UDFs in different clauses

A scalar UDF can be used any place a scalar expression can be used. For example:

```sqlexample
-- ----- These examples show a UDF called from different clauses ----- --

select MyFunc(column1) from table1;

select * from table1 where column2 > MyFunc(column1);
```

### Using SQL variables in a UDF

This example shows how to set a SQL variable and use that variable inside a UDF:

```sqlexample
SET id_threshold = (SELECT COUNT(*)/2 FROM table1);
```

```sqlexample
CREATE OR REPLACE FUNCTION my_filter_function()
RETURNS TABLE (id int)
AS
$$
SELECT id FROM table1 WHERE id > $id_threshold
$$
;
```

### Memoizable functions

For examples, see:

* Memoizable function without arguments in a [row access policy](../../../user-guide/security-row-using.md).
* Memoizable function with arguments in a [masking policy](../../../user-guide/security-column-ddm-use.md).

---
title: Security Practices for UDFs and Procedures
source: https://docs.snowflake.com/en/developer-guide/udf-stored-procedure-security-practices.md
section: Developer Guide
---

# Security Practices for UDFs and Procedures

This topic describes best practices for writing secure user-defined functions (UDFs) and procedures.

## Practices for UDF Handlers

Your function or method (and any library functions or methods that you call) must act as a pure function, acting only on the data it receives
and returning a value based on that data, without causing side-effects. Your code should not attempt to affect the
state of the underlying system, other than consuming a reasonable amount of memory and processor time.

## Practices for Procedure and UDF Handlers

Handler code executes within a restricted engine. Neither your code nor the code in library methods that you use
should employ any prohibited system calls, including:

* Access to the file system on which the handler is running.

  With the following exceptions, a handler should not read or write files:

  + A handler can read staged files specified in the IMPORTS clause. For more information, see [CREATE FUNCTION](../sql-reference/sql/create-function.md)
    or [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md).
  + A handler can write files, such as log files, to the `/tmp` directory.

    Each query gets its own memory-backed file system in which its own `/tmp` is stored, so different queries cannot
    have file name conflicts.

    However, conflicts within a query are possible if a single query calls more than one UDF, and those UDFs
    try to write to the same file name.

    > **Note:**
    >
    > Also, because Python UDFs may execute in separate worker processes in parallel, you should
    > be careful about writing into the /tmp directory.
    >
    > For more on writing files, see [Writing files](udf/python/udf-python-examples.md). For an example, see
    > [Unzipping a staged file](udf/python/udf-python-examples.md).
  + A handler can write files to stages using user-defined functions (UDFs), vectorized UDFs, user-defined table functions (UDTFs), and vectorized UDTFs. For more information, see [Writing files from Snowpark Python UDFs and UDTFs](snowpark/python/creating-udfs.md).
* Network access.

  You can’t use a handler to create sockets, but you can use a handler to
  [access resources on an external network](external-network-access/external-network-access-overview.md).

  > **Note:**
  >
  > You cannot use the code in the Snowflake JDBC Driver to access the database. Your UDF cannot itself act as a client of Snowflake.

### For Handlers in Java or Scala

* When used within a [government region](../user-guide/intro-regions.md), Java UDFs support encryption algorithms that are validated to meet
  the Federal Information Processing Standard (140-2) (FIPS 140-2) requirements. Only cryptographic algorithms that are allowed in the
  FIPS approved mode of the BouncyCastle cryptography API for Java can be used.
  For information about FIPS 140-2, see [FIPS 140-2](https://csrc.nist.gov/publications/detail/fips/140/2/final).

---
title: Selecting from a stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-selecting-from.md
section: Developer Guide
---

# Selecting from a stored procedure

Some stored procedures return tabular data. To select and manipulate this tabular data, you can call these
stored procedures in the [FROM](../../sql-reference/constructs/from.md) clause of a SELECT statement.

## Run a SELECT statement with the TABLE keyword

When calling the stored procedure, omit the [CALL](../../sql-reference/sql/call.md) command. Instead, use the TABLE keyword,
and name the procedure inside parentheses:

```sqlsyntax
SELECT ... FROM TABLE( <stored_procedure_name>( <arg> [ , <arg> ... ] ) );
```

## Example that selects from a stored procedure

This example uses the data in the following table:

```sqlexample
CREATE OR REPLACE TABLE orders (
  order_id INT,
  u_id VARCHAR,
  order_date DATE,
  order_amount NUMBER(12,2));

INSERT INTO orders VALUES (1, 'user_id_001', current_date, 500.00);
INSERT INTO orders VALUES (2, 'user_id_003', current_date, 225.00);
INSERT INTO orders VALUES (3, 'user_id_001', current_date, 725.00);
INSERT INTO orders VALUES (4, 'user_id_002', current_date, 150.00);
INSERT INTO orders VALUES (5, 'user_id_002', current_date, 900.00);
```

The following stored procedure returns order information based on a user ID:

```sqlexample
CREATE OR REPLACE PROCEDURE find_orders_by_user_id(user_id VARCHAR)
RETURNS TABLE (
  order_id INT, order_date DATE, order_amount NUMBER(12,2)
)
LANGUAGE SQL AS
DECLARE
  res RESULTSET;
BEGIN
  res := (SELECT order_id, order_date, order_amount FROM orders WHERE u_id = :user_id);
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE find_orders_by_user_id(user_id VARCHAR)
RETURNS TABLE (
  order_id INT, order_date DATE, order_amount NUMBER(12,2)
)
LANGUAGE SQL AS
$$
DECLARE
  res RESULTSET;
BEGIN
  res := (SELECT order_id, order_date, order_amount FROM orders WHERE u_id = :user_id);
  RETURN TABLE(res);
END;
$$
;
```

The following SELECT statement retrieves the stored procedure’s results:

```sqlexample
SELECT * FROM TABLE(find_orders_by_user_id('user_id_001'));
```

```output
+----------+------------+--------------+
| ORDER_ID | ORDER_DATE | ORDER_AMOUNT |
|----------+------------+--------------|
|        1 | 2024-08-30 |       500.00 |
|        3 | 2024-08-30 |       725.00 |
+----------+------------+--------------+
```

## Limitations for selecting from a stored procedure

The following limitations apply to selecting from a stored procedure:

* Only stored procedures that perform SELECT, SHOW, DESCRIBE, or CALL statements can be placed in the FROM clause
  of a SELECT statement. Stored procedures that make modifications using DDL or DML operations aren’t allowed.
  For stored procedures that issue CALL statements, these limitations apply to the stored procedures that are called.
* Only stored procedures that return tabular data with a static output schema can be placed in the FROM clause
  of a SELECT statement. The output columns must be named and typed. For example, a stored procedure with the
  following RETURNS clause is supported:

  ```sqlexample
  RETURNS TABLE (col1 INT, col2 STRING)
  ```

  A stored procedure with the following RETURNS clause is not supported because it doesn’t return tabular data:

  ```sqlexample
  RETURNS STRING
  ```

  A stored procedure with the following RETURNS clause is not supported because it doesn’t provide
  a fixed output schema:

  ```sqlexample
  RETURNS TABLE()
  ```
* The stored procedure must be called in the FROM clause of a SELECT block in one of the following statements:

  + [SELECT](../../sql-reference/sql/select.md)
  + [INSERT](../../sql-reference/sql/insert.md), [UPDATE](../../sql-reference/sql/update.md), [DELETE](../../sql-reference/sql/delete.md), or [MERGE](../../sql-reference/sql/merge.md)
  + [CREATE TABLE AS SELECT](../../sql-reference/sql/create-table.md)
* The stored procedure can’t accept correlated input arguments from their outer scope, such as a reference to any
  [CTE](../../user-guide/queries-cte.md) defined outside of the SELECT statement.
* If an argument contains a subquery, then that subquery can’t use a CTE defined by the WITH clause.
* A SELECT statement containing a stored procedure call can’t be used in the body of a view, a user-defined function (UDF),
  a user-defined table function (UDTF), or in objects such as [row access policies](../../user-guide/security-row-intro.md) and
  [data masking policies](../../user-guide/security-column-intro.md).
* You can’t use [bind variables](../../sql-reference/bind-variables.md) in a SELECT statement that calls a stored
  procedure. For example, the following SELECT statements aren’t allowed:

  ```sqlexample
  SELECT * FROM TABLE(my_stored_procedure(?));

  SELECT * FROM TABLE(my_stored_procedure('a')) WHERE my_var = :var2;
  ```

---
title: Setting levels for logging, metrics, and tracing
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/telemetry-levels.md
section: Developer Guide
---

# Setting levels for logging, metrics, and tracing

You can set the threshold levels for log messages, log events, trace data, or metrics data captured in an event table.

Each kind of telemetry data supports its own set of levels that are specific to its purpose. You can set these levels by using the
[parameter](../../sql-reference/parameters.md) Snowflake provides for each. You can also set some levels using Snowsight, which
represents the level parameters in a simplified way.

For each kind of telemetry data, you can do the following:

* Set levels specific to that kind of data.
* Set system-wide levels that are in effect unless overridden.
* Override system-wide levels by setting the level for a session or on specific objects (such as procedures and UDFs).

  Levels are represented as both [session parameters](../../sql-reference/parameters.md) and [object parameters](../../sql-reference/parameters.md).

> **Note:**
>
> You can use handler code to override the log level you set with SQL (as described in this topic) when your handler is written in Python.
> For more information, see [Overriding log threshold levels with Python](logging-python.md).

## Scope

For each type of telemetry data, you can set levels so that they’re in effect at the scope that best suits your requirements. In many
cases, you can override levels set at a larger scope by setting them at a smaller scope, as described in How Snowflake determines the level in effect.
For example, you might want to have a set of default levels at the account scope and then set different levels for objects in a particular
database.

You can set each of these in the following scopes:

Account:
:   A level set for the account is in effect everywhere in the account except where overridden by being set at the object or session level.

Object:
:   You can set the telemetry levels on the following kinds of objects:

    * Database or schema containing procedures and functions
    * Stored procedure
    * User-defined function (UDF) or a user-defined table function (UDTF)
    * Externally managed Apache Iceberg™ table with automated refresh configured

    For example, to set the log level for log messages from logging APIs for a specific UDF, use [ALTER FUNCTION](../../sql-reference/sql/alter-function.md) to set
    the LOG_LEVEL parameter for that UDF. As another example, to set the default log level for all functions and procedures in a database, use
    [ALTER DATABASE](../../sql-reference/sql/alter-database.md) to set the LOG_LEVEL parameter on that database. As another example, to set the log event level for a
    specific externally managed Iceberg table with automated refresh configured, use [ALTER ICEBERG TABLE](../../sql-reference/sql/alter-iceberg-table.md) to set the
    LOG_EVENT_LEVEL parameter on that table. Use [ALTER <object>](../../sql-reference/sql/alter.md) commands to set the LOG_EVENT_LEVEL parameter for other
    objects that emit log events (record type EVENT).

    > **Note:**
    >
    > You can’t set the level on Streamlit objects. Instead, set the level on the database or schema that contains the object.

Session:
:   You can set the telemetry level for calls to functions and procedures made in the current session.

## Levels

You can set the following levels for each kind of telemetry data:

Log messages:
:   When you set a level, only log messages from logging APIs at that level and more severe levels are captured in an event table and visible in
    Snowsight. For example, setting the LOG_LEVEL parameter to WARN means that log messages at the WARN, ERROR, and FATAL levels are
    captured in the event table.

    Set the [LOG_LEVEL](../../sql-reference/parameters.md) parameter.

Log events:
:   When you set a level, only log events (record type EVENT) at that level and more severe levels are captured in an event table. Examples
    include events from Snowpipe, tasks, dynamic tables, Snowpark Container Services compute pools, Iceberg tables, and data governance tag
    activity.

    Set the [LOG_EVENT_LEVEL](../../sql-reference/parameters.md) parameter.

Metrics:
:   You can currently have all metrics data captured or none.

    Set the [METRIC_LEVEL](../../sql-reference/parameters.md) parameter.

Tracing:
:   You can specify the following characteristics:

    * Scope of trace event data stored in the event table

      Set the [TRACE_LEVEL](../../sql-reference/parameters.md) parameter.
    * Whether to capture SQL text in a traced SQL statement

      This is determined by the [SQL_TRACE_QUERY_TEXT](../../sql-reference/parameters.md) parameter. For more information, see [SQL statement tracing](tracing.md).

## Privileges needed

To set levels on an object, you must use a role that is granted or inherits the privileges described in this section.

For example, the code in the following example grants privileges needed for someone using the `central_log_admin` role to set the
log level for the account:

```sqlexample
GRANT MODIFY LOG LEVEL ON ACCOUNT TO ROLE central_log_admin;
```

For more information about privileges, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

| Level to modify | Parameter to set | Privileges needed |
| --- | --- | --- |
| Log level (log messages) | [LOG_LEVEL](../../sql-reference/parameters.md) | **Account**   * MODIFY LOG LEVEL on the account   **Object**   * MODIFY LOG LEVEL on the account * MODIFY on the object for which you want to set the level * USAGE on the database or schema containing the procedure or UDF for which you want to set the level   **Session**   * MODIFY SESSION LOG LEVEL |
| Log event level | [LOG_EVENT_LEVEL](../../sql-reference/parameters.md) | **Account**   * MODIFY LOG EVENT LEVEL on the account   **Object**   * MODIFY LOG EVENT LEVEL on the account * MODIFY on the object for which you want to set the level * USAGE on the database or schema containing the object for which you want to set the level   **Session**   * MODIFY SESSION LOG EVENT LEVEL |
| Metric level | [METRIC_LEVEL](../../sql-reference/parameters.md) | **Account**   * MODIFY METRIC LEVEL on the account   **Object**   * MODIFY METRIC LEVEL on the account * MODIFY on the object for which you want to set the level * USAGE on the database or schema containing the procedure or UDF for which you want to set the level   **Session**   * MODIFY SESSION METRIC LEVEL |
| Trace level | [TRACE_LEVEL](../../sql-reference/parameters.md) | **Account**   * MODIFY TRACE LEVEL on the account   **Object**   * MODIFY TRACE LEVEL on the account * MODIFY on the object for which you want to set the level * USAGE on the database or schema containing the procedure or UDF for which you want to set the level   **Session**   * MODIFY SESSION TRACE LEVEL |
| SQL text in SQL tracing | [SQL_TRACE_QUERY_TEXT](../../sql-reference/parameters.md) | **Account**   * SQL_TRACE_QUERY_TEXT on the account |

## Setting telemetry levels

You can set telemetry levels using either SQL or, in some cases, Snowsight. In many cases, you can override levels set at a
larger scope by setting them at a smaller scope, as described in How Snowflake determines the level in effect.

Before you set levels, verify that you have access to a role with the privileges needed.

SnowsightSQL

You can use Snowsight to set telemetry levels at the account level.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Traces & logs.
3. On the Traces & Logs page, select Set Event Level.
4. For Set logging & tracing for, select the scope you want from one of the following:

   * Account
   * The database and, optionally, the schema
5. Select levels for the telemetry data you want to adjust.

   |  |  |
   | --- | --- |
   | All Events | On to turn on collection for all kinds of telemetry data; Off to turn off collection for all kinds of data. |
   | Traces | On to set trace data collection to `ALWAYS`; Off to set trace data collection to `OFF`. For information about levels, see [TRACE_LEVEL](../../sql-reference/parameters.md). |
   | Logs | On to set log data collection to `INFO`. For information about levels, see [LOG_LEVEL](../../sql-reference/parameters.md). |
   | Metrics | On to set trace data collection to `ALL`; Off to set trace data collection to `NONE`. For information about levels, see [METRIC_LEVEL](../../sql-reference/parameters.md). |

You can use SQL to set telemetry levels for the account and for objects such as databases, functions, procedures, and externally managed
Iceberg tables with automated refresh configured.

AccountObjectSession

Use the [ALTER ACCOUNT](../../sql-reference/sql/alter-account.md) command to set the appropriate parameter, based on the telemetry data you want to collect.

The following example sets the log level to ERROR for the account:

```sqlexample
-- Set the log level on the account
ALTER ACCOUNT SET LOG_LEVEL = ERROR;
```

To set the LOG_LEVEL parameter on the object, use the [ALTER <object>](../../sql-reference/sql/alter.md) command.

The following example sets the log level to ERROR for all functions and procedures in the database `db`. The example
overrides this level to WARN for the UDF `f1(int)`.

```sqlexample
USE ROLE central_log_admin;

-- Set the log levels on a database and UDF.
ALTER DATABASE db1 SET LOG_LEVEL = ERROR;
ALTER FUNCTION f1(int) SET LOG_LEVEL = WARN;

-- Set the log levels on a Snowpark Container Services service.
ALTER SERVICE test_service SET LOG_LEVEL = ERROR;
```

For details on how Snowflake determines the effective log level when the LOG LEVEL is set on different objects, see
How Snowflake determines the level in effect.

To set the LOG_LEVEL parameter for the current session, use the [ALTER SESSION](../../sql-reference/sql/alter-session.md) command.

```sqlexample
USE ROLE developer_debugging;

-- Set the logging level to DEBUG for the current session.
ALTER SESSION SET LOG_LEVEL = DEBUG;
```

If the level parameter is set to different levels for the current session and on the functions and procedures called in that
session, Snowflake determines the effective level to use. See How Snowflake determines the level in effect.

## How Snowflake determines the level in effect

You can override the telemetry level-related parameters (for both [objects](../../sql-reference/parameters.md)
and [sessions](../../sql-reference/parameters.md)) by using a [hierarchy of levels](../../sql-reference/parameters.md).

For example, you can set a level to one value for the account, then override it by setting the level for an object, which is lower in the
hierarchy.

The following describes the hierarchy for session and object level parameters.

* For session parameters, the hierarchy is Account » User » Session.

  This means that you can set the parameter for an account, override the account-level parameter for a user, and override the
  user-level parameter for the current session.
* For object parameters, the hierarchy is Account » Database » Schema » Object.

  This means that you can set the parameter for an account, override the account-level parameter for a database or schema, and
  override the database- or schema-level parameter for specific stored procedures and UDFs in that database or schema.

For example, the LOG_LEVEL for log messages for a function overrides the LOG_LEVEL for the account that contains the function. If the
LOG_LEVEL for the account is FATAL and the LOG_LEVEL for the Java UDF in the account is INFO, the effective LOG_LEVEL is INFO (the level for
the function, not the account):

```sqlexample
ALTER ACCOUNT SET LOG_LEVEL = FATAL;

ALTER FUNCTION MyJavaUDF SET LOG_LEVEL = INFO;

-- The INFO log level is used because the FUNCTION MYJAVAUDF
-- is lower than the ACCOUNT in the hierarchy.
```

In cases where the level is set in both the session and object parameter hierarchies, the most verbose level is used.

* For LOG_LEVEL (log messages from logging APIs), the following table lists examples of how parameters set on the session and object affect
  the log level used.

  | Value for the Session | Value for the Object, Schema, Database, or Account | LOG_LEVEL Used |
  | --- | --- | --- |
  | (unset) | `WARN` | `WARN` |
  | `DEBUG` | (unset) | `DEBUG` |
  | `WARN` | `ERROR` | `WARN` |
  | `INFO` | `DEBUG` | `DEBUG` |
  | (unset) | (unset) | `OFF` |

  The same precedence rules apply to LOG_EVENT_LEVEL for log events (record type EVENT).
* For metric level — `ALL` overrides `NONE`.
* For trace level — `ALWAYS` overrides `ON_EVENT` and `OFF`; `ON_EVENT` overrides `OFF`.

---
title: Setting up Snowflake to use Git
source: https://docs.snowflake.com/en/developer-guide/git/git-setting-up.md
section: Developer Guide
---

# Setting up Snowflake to use Git

When you connect your Snowflake account to a remote Git repository, Snowflake creates a Git repository clone, copying the latest version
of all files in the repository (a shallow clone) and storing metadata about the location of the remote repository, credentials (if needed),
and configuration details about how Snowflake should interact with the Git repository API.

By configuring components for authentication, interaction with the Git API, and communication over a private
link between Snowflake and your cloud service provider, you can set up Snowflake so that a remote Git repository becomes an
integral part of your workflow within Snowflake.

## Choose a configuration model

Depending on your network and workflow requirements, you can configure Snowflake for access to a remote Git repository in any of several
ways. The following lists example use cases, along with the repository access strategies you might use to support them.

* Work with files on a Git repository through a workflow that includes pulling, pushing, and creating files.

  When using [Snowflake Workspaces](../../user-guide/ui-snowsight/workspaces-git.md), you can configure an API Integration for OAuth2 to simplify user
  authentication to Git repositories.
* Reference files on a Git repository as part of a data pipeline or ML project.

  If a scripted process will access the repository, consider authenticating using a token.
* Get started by cloning a public repository (including Snowflake Labs) to run SQL scripts or notebook files in
  [Snowflake Workspaces](../../user-guide/ui-snowsight/workspaces-git.md).

  You can use Workspaces for `.sql` files, [Snowflake notebooks](../../user-guide/ui-snowsight/notebooks-snowgit.md) for `.ipynb`
  files, or [Snowflake Workspaces](../../user-guide/ui-snowsight/workspaces-git.md) for `.py` files.

The following describes options in terms of whether you want access over a public network or a private network:

| Access over a public network | Access over a private network |
| --- | --- |
| Access over a public network allows you to authenticate to your remote Git repository server over the public internet. If your Git server uses IP-based allowlisting, Snowflake can route Git traffic through stable egress IP addresses on supported cloud providers. For details, see [Securing ingress of Snowflake requests with egress IP addresses](../../user-guide/egress-ip/network-egress.md).   1. Configure Snowflake for access to the repository.  Choose one of the following authentication methods:     * No authentication.  Configure an API integration with details about the Git repository server. You don’t provide credentials.    * Authenticate with a token, such as a personal access token.  Configure a secret containing the username and token to use, then configure an API integration that allows Snowflake to use the      secret when authenticating.    * Authenticate through an OAuth flow.  Configure an API integration to support OAuth2 authentication. In this case, you don’t need to create a secret. 2. Create a Git repository clone to which you can synchronize files from the    remote repository. | Access over a private network routes Git traffic through a dedicated outbound private link connection instead of the public internet. Use this when your organization requires full network isolation between Snowflake and your Git server.   1. Configure the private link connection.  Before you can configure Snowflake for access to the remote Git repository, you’ll need to set up a private link between Snowflake and    your cloud service provider. 2. Configure Snowflake access to the remote Git repository.  After you’ve set up private link between Snowflake and your cloud service provider, you can configure Snowflake access to the remote    Git repository. 3. Create a Git repository clone to which you can synchronize files from the    remote repository. |

## Configure Snowflake for access over a public network

You can set up Snowflake to access your Git repository over a public network. If your Git server
uses IP-based allowlisting, see [Securing ingress of Snowflake requests with egress IP addresses](../../user-guide/egress-ip/network-egress.md) to configure stable
egress IPs for Snowflake Git traffic.

You can have Snowflake authenticate using any of the following strategies:

* No authentication.

  Configure an API integration with details about the Git repository server.
* Authenticate with a token, such as a personal access token.

  Configure a secret containing the username and token to use, then configure an API integration that allows Snowflake to use the
  secret when authenticating.
* Authenticate through an OAuth flow.

  Configure an API integration to allow for an OAuth2 flow.

  [Preview Feature](../../release-notes/preview-features.md) — Open

  OAuth support is generally available only when the repository is hosted at [github.com](https://github.com/).

  OAuth support is in preview for repository providers other than github.com.

### Configure for no authentication

To set up Snowflake to use a Git repository without authenticating, follow these steps:

1. Create an API integration that supports access without authenticating, and specify the following details:

   * `git_https_api` as the value of the API_PROVIDER parameter
   * HTTPS endpoints to which requests must be limited as values of the API_ALLOWED_PREFIXES parameter

   For more information, see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md).

   ```sqlexample
   CREATE OR REPLACE API INTEGRATION my_git_api_integration
     API_PROVIDER = git_https_api
     API_ALLOWED_PREFIXES = ('https://example.com/my-account')
     ENABLED = TRUE;
   ```
2. Create a Git repository clone as described in Create a Snowflake Git repository clone.

### Configure for authenticating with a token

To have Snowflake authenticate with the Git repository by using a username and token such as a personal access token (PAT), follow
these steps:

1. Provide credentials in a [basic authentication secret](../../sql-reference/sql/create-secret.md).

   To provide the credentials that Snowflake uses to authenticate with the repository, create a secret that contains the following:

   * A TYPE value of `password`
   * A username and token, such as a personal access token (PAT)

     If your Git repository is hosted on Bitbucket, specify `x-token-auth` as the username value.

     > **Note:**
     >
     > For information about creating a personal access token in GitHub, see
     > [Managing your personal access tokens](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens)
     > in the GitHub documentation.

   For more information on the SQL command for creating a secret, see the [CREATE SECRET](../../sql-reference/sql/create-secret.md).

   Code in the following example creates a secret called `my_git_secret` with a username and the user’s personal access token to use as
   credentials:

   ```sqlexample
   CREATE OR REPLACE SECRET db.schema.my_git_secret
     TYPE = password
     USERNAME = 'gladyskravitz'
     PASSWORD = 'ghp_token';
   ```
2. Create an API integration that supports authenticating with a token.

   To create an API integration for access to a Git repository without authenticating, specify the following details:

   * `git_https_api` as the value of the API_PROVIDER parameter
   * HTTPS endpoints to which requests must be limited as values of the API_ALLOWED_PREFIXES parameter

   For more information, see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md).

   ```sqlexample
   CREATE OR REPLACE API INTEGRATION my_git_api_integration
     API_PROVIDER = git_https_api
     API_ALLOWED_PREFIXES = ('https://github.com/my-account')
     ALLOWED_AUTHENTICATION_SECRETS = (my_git_secret)
     ENABLED = TRUE;
   ```
3. Create a Git repository clone as described in Create a Snowflake Git repository clone.

### Configure for authenticating with OAuth

[Preview Feature](../../release-notes/preview-features.md) — Open

OAuth support is generally available only when the repository is hosted at [github.com](https://github.com/).

OAuth support is in preview for repository providers other than github.com.

You can configure Snowflake to authenticate with the remote Git repository using an OAuth2 flow. How you set up for OAuth2 authentication
differs depending on the repository provider.

* If you’re using GitHub, you can create an API integration that uses the Snowflake GitHub App to authenticate.

  The Snowflake GitHub App is a pre-configured OAuth2 application used by Snowflake and designed to simplify authentication. You don’t
  need to configure this app; you only need to create an API integration that specifies the [Snowflake GitHub App](https://github.com/apps/snowflakedb).
* For all repository providers, including GitHub, you can instead create an API integration that specifies values for OAuth2 parameters,
  including the client ID and secret, to use when authenticating.

  Before you create the API integration, collect OAuth2 parameters for your repository provider, including the client ID and secret.
  You’ll specify these values in the API integration.

  For more information, see the repository provider’s documentation.

To set up Snowflake so that it authenticates with the remote Git repository using an OAuth2 flow, follow these steps:

1. Create an API integration that supports authenticating through OAuth2.

   Create an API integration that specifies the following values:

   * An API_PROVIDER parameter value of `git_https_api`
   * An API_ALLOWED_PREFIXES parameter value that specifies the HTTPS endpoints to which requests must be limited
   * An API_USER_AUTHENTICATION value that corresponds to the Git repository provider you’re using

     + When authenticating with GitHub using the Snowflake GitHub App, specify `(TYPE = SNOWFLAKE_GITHUB_APP)`.
     + When authenticating with a repository provider without using the Snowflake GitHub App — such as with any repository provider other
       than GitHub — specify values for the following parameters, as described in
       [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md):

       - OAUTH_CLIENT_ID
       - OAUTH_CLIENT_SECRET
       - API_USER_AUTHENTICATION
       - OAUTH_AUTHORIZATION_ENDPOINT
       - OAUTH_TOKEN_ENDPOINT
       - OAUTH_ACCESS_TOKEN_VALIDITY
       - OAUTH_REFRESH_TOKEN_VALIDITY
       - OAUTH_ALLOWED_SCOPES

   Code in the following examples creates an API integration called `my_git_api_integration`:

   GitHub appOAuth2 parameters

   ```sqlexample
   CREATE OR REPLACE API INTEGRATION my_git_api_integration
     API_PROVIDER = git_https_api
     API_ALLOWED_PREFIXES = ('https://github.com')
     API_USER_AUTHENTICATION = (TYPE = SNOWFLAKE_GITHUB_APP)
     ENABLED = TRUE;
   ```

   ```sqlexample
   CREATE OR REPLACE API INTEGRATION my_git_api_integration
     API_PROVIDER = git_https_api
     API_ALLOWED_PREFIXES = ('https://example.com/my_account')
     API_USER_AUTHENTICATION = (
       TYPE = OAUTH2
       OAUTH_AUTHORIZATION_ENDPOINT = '<your_oauth_authorization_endpoint>'
       OAUTH_TOKEN_ENDPOINT = '<your_oauth_token_endpoint>'
       OAUTH_CLIENT_ID = '<your_oauth_client_id>'
       OAUTH_CLIENT_SECRET = '<your_oauth_client_secret>'
       OAUTH_ACCESS_TOKEN_VALIDITY = 3600
       OAUTH_REFRESH_TOKEN_VALIDITY = 2592000
       OAUTH_ALLOWED_SCOPES = ( 'read_api', 'read_repository', 'write_repository' )
     )
     ENABLED = TRUE;
   ```
2. Create a workspace connected to a Git repository as described in [Create a Git workspace](../../user-guide/ui-snowsight/workspaces-git.md).

## Configure Snowflake for access over a private network

You can configure Snowflake to establish connectivity through an outbound private link connection between Snowflake and your cloud
infrastructure. Snowflake routes Git traffic through this connection to the Git repository server.

With a private link connection, Snowflake routes Git traffic through a dedicated private
network connection, avoiding the public internet entirely. This section describes the steps at a high level.

1. Configure the private link connection.

   You’ll apply configuration changes to both Snowflake and your cloud service infrastructure. This topic describes the steps on the
   Snowflake side. For details about all the steps, including about configuring your cloud service provider, see the knowledge base article
   [Configuring Git Integration with Snowflake over Private Link](https://community.snowflake.com/s/article/Configuring-Git-Integration-with-Snowflake-over-Private-Link).
2. Configure Snowflake access to the remote Git repository.

> **Note:**
>
> Snowflake supports only connections within the same cloud and region. For example, if your Snowflake deployment is on AWS in the
> us-west-2 region, then your other components must also be in that region.

### Configure the private link connection

Before you can configure Snowflake for access to the remote Git repository, you must set up a private link between Snowflake and
your cloud service provider.

To apply configuration changes to both Snowflake and your infrastructure, follow these steps:

1. In your cloud service provider, create a private link service to receive requests from the Snowflake private endpoint service.

   For details, see the knowledge base article
   [Configuring Git Integration with Snowflake over Private Link](https://community.snowflake.com/s/article/Configuring-Git-Integration-with-Snowflake-over-Private-Link).
2. In Snowflake, provision a private endpoint that will reach your infrastructure through a private IP.

   To provision the endpoint, use the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](../../sql-reference/functions/system_provision_privatelink_endpoint.md) function with the following
   two arguments:

   * Your cloud provider’s private link service ID
   * Your Git server’s domain name

   AWSAzureGoogle Cloud

   ```sqlexample
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'com.amazonaws.vpce.us-west-2.vpce-svc-xxx', // VPC Endpoint Service Name
     'git_address.com' // Git server domain
   );
   ```

   ```sqlexample
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
   '/subscriptions/9217bbdd-434e-4dbb-97c2-0825c627a277/resourceGroups/git-server_group/providers/Microsoft.Network/privateLinkServices/git-server-pl-service', // Private Service ID
     'git_address.com' // Git server domain
   );
   ```

   ```sqlexample
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     'projects/my-google-project/regions/us-east4/serviceAttachments/gitservice', // Service attachement field
     'git_address.com' // Git server domain
   );
   ```
3. In your cloud service provider, accept the Snowflake private endpoint setup to finish setting up the private link connection.
4. To check status of the provisioning, call the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](../../sql-reference/functions/system_get_privatelink_endpoints_info.md)
   system function.

### Configure Snowflake access to the remote Git repository

After you set up a private link between Snowflake and your cloud service provider, you can configure Snowflake access to the remote
Git repository.

1. Create an API integration that supports authenticating with a certificate.

   Because Snowflake will reach your Git server using the HTTPS protocol, the domain name needs to have a valid certificate. The
   configuration you use differs depending on whether you use a self-signed certificate or a certificate signed by a certificate authority.

   * Using a self-signed certificate:

     1. Provide credentials in a [generic string secret](../../sql-reference/sql/create-secret.md).

        This should be a public key of a self-signed domain to establish an HTTPS connection. To provide to Snowflake the credentials
        it will use to authenticate with the server, create a secret that contains the following details:

        + A TYPE parameter value of `GENERIC_STRING`
        + A public certificate string as the value of the SECRET_STRING parameter

          For the parameter’s value, specify a secret string, such as a public certificate body.

        ```sqlexample
        CREATE OR REPLACE SECRET my_public_certificate
          TYPE = GENERIC_STRING
          SECRET_STRING = '-----BEGIN CERTIFICATE-----
                    <certificate_body>
                    -----END CERTIFICATE-----';
        ```
     2. Create an API integration to integrate with the Git API, and specify the following details:

        + An API_PROVIDER parameter set to `git_https_api`
        + An API_ALLOWED_PREFIXES set to the base URL beneath which access is allowed
        + A USE_PRIVATELINK_ENDPOINT parameter set to `TRUE`
        + A TLS_TRUSTED_CERTIFICATES parameter set to the name of the secret you created, which contains the certificate

        For more information, see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md).

        ```sqlexample
        CREATE OR REPLACE API INTEGRATION my_git_api_integration
          API_PROVIDER = git_https_api
          API_ALLOWED_PREFIXES = ('https://example.com/my-account')
          ALLOWED_AUTHENTICATION_SECRETS = ALL
          USE_PRIVATELINK_ENDPOINT = TRUE
          TLS_TRUSTED_CERTIFICATES = (my_public_certificate)
          ENABLED = TRUE;
        ```
   * Using a certificate signed by a certificate authority:

     1. Create an API integration to integrate with the Git API, and specify the following details:

        + An API_PROVIDER parameter set to `git_https_api`
        + An API_ALLOWED_PREFIXES set to the base URL beneath which access is allowed
        + A USE_PRIVATELINK_ENDPOINT parameter set to `TRUE`
        + A TLS_TRUSTED_CERTIFICATES parameter set to the name of the secret you created, which contains the certificate

        For more information, see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md).

        ```sqlexample
        CREATE OR REPLACE API INTEGRATION my_git_api_integration
          API_PROVIDER = git_https_api
          API_ALLOWED_PREFIXES = ('https://example.com/my-account')
          ALLOWED_AUTHENTICATION_SECRETS = ALL
          USE_PRIVATELINK_ENDPOINT = TRUE
          ENABLED = TRUE;
        ```
2. Provide credentials in a [basic authentication secret](../../sql-reference/sql/create-secret.md).

   After successfully connecting to the Git server over private link, you must still authenticate with the repository by creating
   another secret that provides credentials for the repository.

   To provide the credentials that Snowflake uses to authenticate with the repository, create a secret that contains the following:

   * A TYPE value of `password`
   * A username and token, such as a personal access token (PAT)

     > **Note:**
     >
     > For information about creating a personal access token in GitHub, see
     > [Managing your personal access tokens](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens)
     > in the GitHub documentation.

   For more information on the SQL command for creating a secret, see the [CREATE SECRET](../../sql-reference/sql/create-secret.md).
3. Create a Git repository clone as described in Create a Snowflake Git repository clone.

## Create a Snowflake Git repository clone

To set up Snowflake to work with a remote Git repository, create a Git repository clone in Snowflake to contain files fetched from the remote
repository.

> **Note:**
>
> Before beginning the steps in this section, consider first configuring components you might need, including a secret
> (if the remote repository requires authentication), an API integration, and private link connection between Snowflake and your cloud
> service provider.

> **Note:**
>
> For information on creating a Git workspace in Snowsight, see [Create a Git workspace](../../user-guide/ui-snowsight/workspaces-git.md).

A Git repository clone in Snowflake specifies the following details:

* The remote repository’s origin

  In Git, `origin` is the remote repository’s URL. Use that URL when setting up Snowflake to use a remote Git repository.
  The URL must use HTTPS. For example, you can retrieve the origin URL in the following ways:

  + In the GitHub user interface, you can get the origin URL from the repository home page. Select the Code button,
    and then copy the HTTPS URL from the box displayed beneath the button.
  + From the command line, use the `git config` command from within your local repository, as in the following example:

    ```shell
    $ git config --get remote.origin.url
    ```

    The command produces output such as the following:

    ```output
    https://github.com/my-account/snowflake-extensions.git
    ```

    For reference information about `git config`, see the [git documentation](https://git-scm.com/docs/git-config).
* Credentials, if needed, for Snowflake to use when authenticating with the repository

  For the GIT_CREDENTIALS parameter, specify a Snowflake [secret](../../sql-reference/sql/create-secret.md) you created.
* [An API integration](../../sql-reference/sql/create-api-integration.md) specifying details for Snowflake interaction with the
  repository API

You can create a Git repository clone by using either Snowsight or SQL.

SQLSnowsight

> **Note:**
>
> Before creating a Git repository clone, you’ll need to create [a secret](../../sql-reference/sql/create-secret.md) (if the remote
> repository requires authentication) and [an API integration](../../sql-reference/sql/create-api-integration.md).

Code in the following example creates a Git repository clone called `snowflake_extensions`. The clone specifies
the `my_git_api_integration` API integration and the `my_git_secret` secret with credentials for authenticating.

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT CREATE GIT REPOSITORY ON SCHEMA myco_db.integrations TO ROLE myco_git_admin;
GRANT USAGE ON INTEGRATION my_git_api_integration TO ROLE myco_git_admin;
GRANT USAGE ON SECRET db.schema.my_git_secret TO ROLE myco_git_admin;

USE ROLE myco_git_admin;

CREATE OR REPLACE GIT REPOSITORY snowflake_extensions
  API_INTEGRATION = my_git_api_integration
  GIT_CREDENTIALS = my_git_secret
  ORIGIN = 'https://github.com/my-account/snowflake-extensions.git';
```

> **Note:**
>
> For information on creating a Git workspace in Snowsight, see [Create a Git workspace](../../user-guide/ui-snowsight/workspaces-git.md).

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Database Explorer.
3. In the object explorer, select the database and schema that you want to contain the Git repository clone you’re creating.
4. Select Create » Git Repository.
5. In the Create Git Repository dialog, for Repository Name, enter a name that will uniquely identify this repository
   clone in the schema.

   For naming guidelines, see [Identifier requirements](../../sql-reference/identifiers-syntax.md).
6. For Origin, enter the remote repository’s origin URL.
7. From the API Integration drop-down menu, select the API integration to reference when creating the Git repository clone.

   If you don’t have an API integration to use, select Create new API integration in Worksheets to use SQL to create one.
   For more information, see [CREATE API INTEGRATION](../../sql-reference/sql/create-api-integration.md).
8. Optional: For the Comment, enter text describing this integration for others.
9. Optional: If the remote repository requires authentication, set the Authentication toggle to the _on_ position.

   * If you turned on the toggle, from the Secret menu, select the secret that should be referenced by the Git integration to
     authenticate with the remote repository.

     If you don’t have a secret to use, select Create new secret in Worksheets to use SQL to create one. For
     more information, see [CREATE SECRET](../../sql-reference/sql/create-secret.md).
10. Select Create.

    When you successfully create the integration, the Git repository clone appears beneath the schema, in a Git Repositories directory.
    You’ll also see a page that lists repository directories, branches, and tags.

---
title: Setting up your Java and Scala environment to use the Telemetry class
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/telemetry-build-maven.md
section: Developer Guide
---

# Setting up your Java and Scala environment to use the Telemetry class

You can build and package handler code that uses the `com.snowflake.telemetry.Telemetry` class, then reference the handler on a stage.
The Telemetry library is available through Maven and through an archive file that you can download from the
[Drivers and Libraries page](https://developers.snowflake.com/drivers-and-libraries/) in the Snowflake Developer site.

If you are using Maven to develop function or procedure handlers in Java or Scala, you can build a JAR file containing your code:

1. In the pom.xml file for your project, add a dependency on the `com.snowflake:telemetry` package:

   > ```xml
   > <dependency>
   >   <groupId>com.snowflake</groupId>
   >   <artifactId>telemetry</artifactId>
   >   <version>0.01</version>
   > </dependency>
   > ```
2. Exclude the `telemetry` package from the JAR file that you build because it is already included in Snowflake.

   1. In the directory for your project, create a subdirectory named `assembly/`.
   2. In that directory, create an assembly descriptor file that specifies that you want to include dependencies in your JAR file.

      For an example, see [jar-with-dependencies](https://maven.apache.org/plugins/maven-assembly-plugin/descriptor-refs.html#jar-with-dependencies).
   3. In the assembly descriptor, add a `<dependencySet>` element that excludes the Snowpark library from your JAR file. For example:

      ```xml
      <assembly xmlns="http://maven.apache.org/ASSEMBLY/2.1.0"
                xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
                xsi:schemaLocation="http://maven.apache.org/ASSEMBLY/2.1.0 http://maven.apache.org/xsd/assembly-2.1.0.xsd">
        <id>jar-with-dependencies</id>
        <formats>
            <format>jar</format>
        </formats>
        <includeBaseDirectory>false</includeBaseDirectory>
        <dependencySets>
          <dependencySet>
            <outputDirectory>/</outputDirectory>
            <useProjectArtifact>true</useProjectArtifact>
            <unpack>true</unpack>
            <scope>runtime</scope>
            <excludes>
              <exclude>com.snowflake:telemetry</exclude>
            </excludes>
          </dependencySet>
        </dependencySets>
      </assembly>
      ```

      For information about the elements in an assembly descriptor, see
      [Assembly Descriptor Format](https://maven.apache.org/plugins/maven-assembly-plugin/assembly.html).
3. In your pom.xml file, under the `<project>` » `<build>` » `<plugins>`, add a `<plugin>` element for the
   Maven Assembly Plugin.

   In addition, under `<configuration>` » `<descriptors>`, add a `<descriptor>` that points to the assembly
   descriptor file that you created in the previous steps.

   For example:

   ```xml
   <project>
     [...]
     <build>
       [...]
       <plugins>
         <plugin>
           <artifactId>maven-assembly-plugin</artifactId>
           <version>3.3.0</version>
           <configuration>
             <descriptors>
               <descriptor>src/assembly/jar-with-dependencies.xml</descriptor>
             </descriptors>
           </configuration>
           [...]
         </plugin>
         [...]
       </plugins>
       [...]
     </build>
     [...]
   </project>
   ```

---
title: Snowflake Connector for Python
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector.md
section: Developer Guide
---

# Snowflake Connector for Python

> **Note:**
>
> This driver currently does not support GCP regional endpoints. Please ensure that any workloads using through this driver do not require support for regional endpoints on GCP. If you have questions about this, please contact Snowflake Support.

The Snowflake Connector for Python provides an interface for developing Python applications that can connect to Snowflake and perform all standard operations. It provides a programming alternative to
developing applications in Java or C/C++ using the Snowflake JDBC or ODBC drivers.

The connector is a native, pure Python package that has no dependencies on JDBC or ODBC. It can be installed using `pip` on
Linux, MacOS, and Windows platforms where Python is installed.

The connector supports developing applications using the Python Database API v2 specification (PEP-249), including using the following standard API objects:

* `Connection` objects for connecting to Snowflake.
* `Cursor` objects for executing DDL/DML statements and queries.

For more information, see [PEP-249](https://www.python.org/dev/peps/pep-0249/).

[SnowSQL](../../user-guide/snowsql.md), the command-line client provided by Snowflake, is an example of an application developed using the connector.

> **Note:**
>
> Snowflake now provides first-class Python APIs for managing core Snowflake resources including databases, schemas, tables, tasks, and
> warehouses, without using SQL. For more information, see [Snowflake Python APIs: Managing Snowflake objects with Python](../snowflake-python-api/snowflake-python-overview.md).

**Next Topics:**

* [Installing the Python Connector](python-connector-install.md)
* [Using the Python Connector](python-connector-example.md)
* [Using pandas DataFrames with the Python Connector](python-connector-pandas.md)
* [Distributing workloads that fetch results with the Snowflake Connector for Python](python-connector-distributed-fetch.md)
* [Using the Snowflake SQLAlchemy toolkit with the Python Connector](sqlalchemy.md)
* [Python Connector API](python-connector-api.md)
* [Dependency management policy for the Python Connector](python-connector-dependencies.md)

---
title: Snowflake Java Runtime Support
source: https://docs.snowflake.com/en/developer-guide/java-runtime-support-policy.md
section: Developer Guide
---

# Snowflake Java Runtime Support

Going forward, Snowflake intends to support new LTS Java runtimes within 1 year of their
[first official release](https://adoptium.net/support/).

## Deprecating and decommissioning runtimes (end of support)

To keep your functions up-to-date and secure, we occasionally need you to update your UDFs and stored procedures and re-deploy them to
use a supported runtime.

Snowflake applies updates to Java runtimes as the updates are made available by the upstream maintainers. When a Java runtime is no longer
actively maintained, Snowflake will deprecate and, eventually, remove the runtime. The Snowflake deprecation schedule will follow the
End-of-Availability schedule of [Adoptium Temurin](https://adoptium.net/support/) ™.

This process involves three aspects: a publication of the deprecation date, a deprecation period, and a target decommission date.
The deprecation date posted below indicates the start of the deprecation period and the expected decommission date.

| Java Runtime | Snowflake Deprecation Date | Decommission Date |
| --- | --- | --- |
| 11 | Oct 2027 | Jan 2028 |
| 17 | Oct 2027 | TBD |

During the deprecation period, Snowflake will no longer apply security patches or other updates to the runtime. You can continue to use
the runtime but you should mainly aim to use this time to migrate any functions that still use the deprecated runtime to a more up-to-date
runtime. Note that functions that use a deprecated runtime are not eligible for technical support.

After the decommission date, you can no longer create, update or invoke functions using the runtime. You must choose a more up-to-date
runtime to deploy your functions.

---
title: Snowflake Python APIs: General concepts
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-general-concepts.md
section: Developer Guide
---

# Snowflake Python APIs: General concepts

The programming model of the Snowflake Python APIs is *resource-based*, which means that the APIs consist of a set of objects that represent their
respective object counterparts in Snowflake. Some basic examples of Snowflake resource object types include the following:

* Databases
* Schemas
* Tables
* Views
* Alerts
* Pipes
* Stages
* Users
* Warehouses

For each supported resource, the Python API provides three distinct classes that you can use to create and manage objects:

* Collection class
* Model class
* Resource class

## Entry point: The `Root` object

The `Root` object is the entry point for the Python API. To create an instance of `Root` that is configured with the Snowflake
context in which it will run, you use a Python Connector `Connection` object or a Snowpark `Session` object.

For example, the following code instantiates a `Root` object with a `Connection` object named `my_connection`:

```python
from snowflake.core import Root

root = Root(my_connection)
```

You can also instantiate a `Root` object with a `Session` object. In a notebook environment or stored procedure, you retrieve
the session by using `get_active_session()`, as follows:

```python
from snowflake.core import Root
from snowflake.snowpark.context import get_active_session

session = get_active_session()
root = Root(session)
```

### Account, database, and schema scopes

With a `Root` object, you can access the collections of *account-scoped* objects, such as warehouses (`root.warehouses`),
databases (`root.databases`), and external volumes (`root.external_volumes`).

You can access *database-scoped* objects under a `DatabaseResource`, which in turn you can retrieve through the
`DatabaseCollection` object under `Root`. Currently, `SchemaCollection` is the only object type available under the
database scope.

You can access *schema-scoped* objects, such as tables, views, streams, and stages, through the `SchemaResource` object.

For example, the following code accesses a `StageCollection` first, and then a `StageResource`:

```python
root = Root(my_connection)
stages = root.databases["my_db"].schemas["my_schema"].stages
my_stage = stages["my_stage"] # Access the "my_stage" StageResource
```

### `snowflake.core` class diagram

The following diagram shows some basic classes in the `snowflake.core` package and how they relate to each other, starting with the
`Root` object:

## Collection class

The `Collection` classes correspond to classes that are named `<SnowflakeObjectType>Collection`.

A `Collection` class represents the set of a particular object type visible within the given context. For schema-scoped objects
(like tables, views, functions, and streams), the collection consists of all objects of that type within the given schema that are visible
to the current role or user.

`SchemaCollection` objects are scoped to a database. Account-scoped objects like `DatabaseCollection` and `WarehouseCollection` are accessible
directly from the `Root` instance.

In general, collections enable you to do the following operations:

* Create an object in the schema, database, or account (depending the scope and context, as described previously).
* Iterate through the set of objects visible in that scope.

For example, the following code creates a new warehouse using a `WarehouseCollection` object:

```python
# my_wh is created from scratch
my_wh = Warehouse(name="my_wh", warehouse_size="X-Small")
root.warehouses.create(my_wh)
```

### Retrieving a `Resource` object from a collection

Additionally, collections provide an entry point to retrieve specific `Resource` objects in the underlying Snowflake database to
which the API is connected. You use the square bracket index operator (`[ ]`) on a collection to “point” to, or get a reference to, a
Snowflake object within that collection.

For example, the following code retrieves a reference to an existing warehouse named `my_wh` in your Snowflake account:

```python
# my_wh_ref is retrieved from an existing warehouse
# This returns a WarehouseResource object, which is a reference to a warehouse named "my_wh" in your Snowflake account
my_wh_ref = root.warehouses["my_wh"]
```

## Model class

The model classes simply have the same names as their equivalent resources in Snowflake, such as `Warehouse` for warehouses and
`Table` for tables.

A model class represents a Snowflake object along with its associated properties, such as its name, the database and schema to which it
belongs (if applicable), and attributes specific to that object type. For example, a Warehouse model indicates the `warehouse_size`,
`type`, and `auto_resume` properties for that particular warehouse object.

Model objects contain a *property bag* (a collection of properties and their values) that describes the object. You can use
these properties to either describe an existing object in Snowflake, or to provide the specification of that resource to alter an existing
object with.

### Fetching a model object from a `Resource`

To return the property bag of an object as it currently exists in your Snowflake database, you run a `fetch()` operation on the
`Resource` object.

For example, the following code demonstrates some operations you can perform using a model object:

```python
# my_wh is fetched from an existing warehouse
my_wh = root.warehouses["my_wh"].fetch()
print(my_wh.name, my_wh.auto_resume)
```

```python
# my_wh is fetched from an existing warehouse
my_wh = root.warehouses["my_wh"].fetch()
my_wh.warehouse_size = "X-Small"
root.warehouses["my_wh"].create_or_alter(my_wh)
```

> **Note:**
>
> This fetch operation fails if the `my_wh` object does not exist in Snowflake.

## Resource class

The `Resource` classes correspond to classes that are named `<SnowflakeObjectType>Resource`.

You can consider a `Resource` object as a pointer or reference to an underlying Snowflake object. Whereas the model class is a simple
property bag representing the properties or specification of an object, the `Resource` class is a reference to the actual object in
your Snowflake database.

To get a `Resource` object, you typically refer to it by name from its corresponding `Collection` and use the square bracket
index operator (`[ ]`). The following code example retrieves an existing warehouse named `my_wh` from the warehouse collection:

```python
# my_wh_ref is retrieved from an existing warehouse
# This returns a WarehouseResource object, which is a reference to a warehouse named "my_wh" in your Snowflake account
my_wh_ref = root.warehouses["my_wh"]

# Fetch returns the properties of the object (returns a "Model" Warehouse object that represents that warehouse's properties)
wh_properties = my_wh_ref.fetch()
```

To convert a `Resource` object to its corresponding model, perform a `fetch()` on the resource, which retrieves the properties
of the corresponding object in Snowflake. Note that this fetch operation fails if the object does not actually exist in Snowflake.

### Performing type-specific operations on a `Resource` object

The `Resource` class also implements the object type’s specialized API operations. For example, you use a `WarehouseResource`
object to resume a warehouse, or a `StageResource` object to list the files on a stage.

The following code examples show how to perform these type-specific operations using their respective `Resource` objects:

```python
# my_wh_ref is retrieved from an existing warehouse
my_wh_ref = root.warehouses["my_wh"]

# Resume a warehouse using a WarehouseResource object
my_wh_ref.resume()
```

```python
# my_stage is retrieved from an existing stage
stage_ref = root.databases["my_db"].schemas["my_schema"].stages["my_stage"]

# Print file names and their sizes on a stage using a StageResource object
for file in stage_ref.list_files():
  print(file.name, file.size)
```

### Using the `create_or_alter` API

`Resource` objects also expose the `create_or_alter` API method if it’s supported by the resource. This method enables you to,
as the name suggests, create or alter Snowflake objects.

> **Note:**
>
> The Python API uses this create-or-alter (COA) mechanism for modifying objects in Snowflake. The purpose of this mechanism is to ensure
> that the result of a COA operation is the same regardless of whether that particular object already exists in your Snowflake database.
>
> In other words, if the object does not exist, the COA operation creates one with the provided specification; if it does
> already exist, the operation alters the existing object to match the requested specification. This logic enables you to create or
> alter resources by using a single piece of code in an idempotent and atomic manner.

## Consistent design pattern to manage resources

The Snowflake Python APIs have a consistent design pattern that you use to manage resources in Snowflake. Consider an example scenario where you need
to alter an existing warehouse object in your account. The following steps outline how you typically work with the API’s design pattern by
using all three class types, as described previously.

### 1. Get a `WarehouseCollection` from `Root`

Warehouses are account-scoped objects that you can directly access from `Root`:

```python
my_warehouses = root.warehouses # my_warehouses is a WarehouseCollection
```

### 2. Get a `WarehouseResource` object from `WarehouseCollection`

To retrieve a `Resource` object, you typically start with its collection. `Collection` objects provide an entry point for you to
retrieve specific resources in the underlying Snowflake database by using the square bracket index operator (`[ ]`):

```python
my_wh_ref = my_warehouses.warehouses["my_wh"] # my_wh_ref is a WarehouseResource
```

### 3. Fetch the `Warehouse` model from `WarehouseResource`

Using the `WarehouseResource` object, you fetch the corresponding `Warehouse` model and its properties from Snowflake:

```python
my_wh = my_wh_ref.fetch() # my_wh is a Warehouse model object
```

### 4. Modify a property in the `Warehouse` model

Modify a property, such as the `warehouse_size`, in your warehouse model:

```python
my_wh.warehouse_size = "X-Small"
```

### 5. Alter the existing warehouse object in Snowflake

Finally, using your modified warehouse model specification, you alter the existing warehouse object in Snowflake (or create the warehouse
object if it doesn’t exist):

```python
my_wh_ref.create_or_alter(my_wh) # Use the WarehouseResource to perform create_or_alter
```

Using this `my_wh_ref` reference, you can also perform other operations on the object in Snowflake, such as dropping it, if necessary.

### Full code example

The following code example shows the create-or-alter warehouse operation in full from start to end:

```python
# my_wh is fetched from an existing warehouse
my_warehouses = root.warehouses # my_warehouses is a WarehouseCollection
my_wh_ref = my_warehouses.warehouses["my_wh"] # my_wh_ref is a WarehouseResource
my_wh = my_wh_ref.fetch() # my_wh is a Warehouse model object
my_wh.warehouse_size = "X-Small"

my_wh_ref.create_or_alter(my_wh) # Use the WarehouseResource perform create_or_alter
```

---
title: Snowflake Python APIs: Managing Snowflake objects with Python
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-overview.md
section: Developer Guide
---

# Snowflake Python APIs: Managing Snowflake objects with Python

The Snowflake Python APIs package is a unified library that seamlessly connects Python with Snowflake workloads. It is intended to provide
comprehensive APIs for interacting with Snowflake resources across data engineering, Snowpark, Snowpark ML, and application workloads using
a first-class Python API.

You can use the Snowflake Python APIs to manage Snowflake resources by creating, dropping, or altering them, and more. You can use
Python to perform tasks you might otherwise perform with Snowflake [SQL commands](../../sql-reference-commands.md).

To learn more about the API, including its general concepts and design patterns, see [Snowflake Python APIs: General concepts](snowflake-python-general-concepts.md).

## Supported Snowflake resource objects

> **Note:**
>
> The [API reference documentation](https://docs.snowflake.com/developer-guide/snowflake-python-api/reference/latest/index) reflects the
> latest version of the Snowflake Python APIs. Note that not all resources in the API currently provide 100% coverage of their
> equivalent [SQL commands](../../sql-reference-commands.md), but the Python APIs are under active development and are continuously expanding.

With the Snowflake Python APIs, you can currently manage the following Snowflake resource objects:

* [Accounts](snowflake-python-managing-accounts.md)

  + [Account](snowflake-python-managing-accounts.md)
  + [Managed account](snowflake-python-managing-accounts.md)
* [Users, roles, and privileges](snowflake-python-managing-user-roles.md)

  + [User](snowflake-python-managing-user-roles.md)
  + [Role](snowflake-python-managing-user-roles.md)
  + [Database role](snowflake-python-managing-user-roles.md)
  + [Access privileges](snowflake-python-managing-user-roles.md)
* [Integrations](snowflake-python-managing-integrations.md)

  + [API integration](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.api_integration)
  + [Catalog integration](snowflake-python-managing-integrations.md)
  + [Notification integration](snowflake-python-managing-integrations.md)
* [Virtual warehouse](snowflake-python-managing-warehouses.md)
* [Databases, schemas, tables, and views](snowflake-python-managing-databases.md)

  + [Database](snowflake-python-managing-databases.md)
  + [Schema](snowflake-python-managing-databases.md)
  + [Standard table](snowflake-python-managing-databases.md)
  + [Dynamic table](snowflake-python-managing-dynamic-tables.md)
  + [Event table](snowflake-python-managing-databases.md)
  + [Iceberg table](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.iceberg_table)
  + [View](snowflake-python-managing-databases.md)
  + [Sequence](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.sequence)
* [Functions and procedures](snowflake-python-managing-functions-procedures.md)

  + [Stored procedure](snowflake-python-managing-functions-procedures.md)
  + [User-defined function (UDF)](snowflake-python-managing-functions-procedures.md)
  + [Artifact repository](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.artifact_repository)
* Data pipeline

  + [Stream](snowflake-python-managing-streams.md)
  + [Task](snowflake-python-managing-tasks.md)
* AI and ML (not available in government regions)

  + [Cortex Chat service](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.cortex.chat_service)
  + [Cortex Embed service](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.cortex.embed_service)
  + [Cortex Inference service](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.cortex.inference_service)
  + [Cortex Lite Agent service](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.cortex.lite_agent_service)
  + [Cortex Search service](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.cortex.search_service)
* Security

  + [Network policy](snowflake-python-managing-network-policies.md)
  + [Network rule](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.network_rule)
  + [Password policy](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.password_policy)
  + [Secret](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.secret)
* Data governance

  + [Tag](snowflake-python-managing-tags.md)
* [Data loading and unloading](snowflake-python-managing-data-loading.md)

  + [External volume](snowflake-python-managing-data-loading.md)
  + [Pipe](snowflake-python-managing-data-loading.md)
  + [Stage](snowflake-python-managing-data-loading.md)
* [Alert](snowflake-python-managing-alerts.md)
* [Notebook](snowflake-python-managing-notebooks.md)
* [Snowpark Container Services components](snowflake-python-managing-containers.md)
  (not available in government regions)

  + [Compute pool](snowflake-python-managing-containers.md)
  + [Image repository](snowflake-python-managing-containers.md)
  + [Service and service function](snowflake-python-managing-containers.md)
* Streamlit

  + [Streamlit object](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.streamlit)

## Python ecosystem in Snowflake

The Snowflake Python APIs, the [Snowpark API for Python](../snowpark/python/index.md), and the
[Snowflake Connector for Python](../python-connector/python-connector.md) are interfaces that each have distinct purposes
in Snowflake. This section explains their differences and describes the typical use cases for each.

Snowflake Python APIs
:   You can use this set of first-class Python APIs to define and manage core resources (such as tables, warehouses, and tasks) across
    Snowflake workloads. Unlike the Python Connector, these APIs interact with Snowflake using native Python without the need to use SQL.

    The Snowflake Python APIs package unifies all Snowflake Python libraries (including `connector`, `core`, `snowpark`, and
    `ml`) so that you can simply start with the command `pip install snowflake`.

    Following the declarative programming approach, this API can be used as a DevOps tool to manage changes to your resources and automate
    code and infrastructure deployment in Snowflake.

Snowpark
:   This set of libraries and code execution environments can run Python and other programming languages next to your data in Snowflake.

    * Libraries: With the [Snowpark API](../snowpark/index.md), you can use Snowpark DataFrames in your code to query and transform data
      entirely within Snowflake. Snowpark applications process your data at scale directly on the Snowflake engine without moving the data to
      the system where your application code runs.

      The Snowpark API is available in Python, Java, and Scala.
    * Code execution environments: Snowpark runtime environments support container images and Python, Java, and Scala code.

      + You can execute custom Python code through Python user-defined functions (UDFs) or stored procedures for building data pipelines,
        apps, and more. Python runtime environments have access to a package repository and package manager from Anaconda.

        Runtime environments are also available in Scala and Java.
      + You can run containerized applications directly within Snowflake using
        [Snowpark Container Services](../snowpark-container-services/overview.md).

Snowflake Connector for Python
:   Use this SQL driver to connect to Snowflake, execute SQL statements, and then get the results using a Python client.

    With the Python Connector, you write all of your interactions with Snowflake using SQL statement strings.

## Get started with the Snowflake Python APIs

To get started with the Snowflake Python APIs, see the instructions in the following topics:

1. [Install the library](snowflake-python-installing.md).
2. [Connect to Snowflake](snowflake-python-connecting-snowflake.md).

For tutorials on getting started with the Snowflake Python APIs, see [Tutorials: Getting started with the Snowflake Python APIs](overview-tutorials.md).

## Supported Python versions

The supported versions of Python are:

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

## Developer guides

| Guide | Description |
| --- | --- |
| [Install the Snowflake Python APIs library](snowflake-python-installing.md) | Install the Snowflake Python APIs package. |
| [Connect to Snowflake with the Snowflake Python APIs](snowflake-python-connecting-snowflake.md) | Connect to Snowflake from Python code. |
| [Managing Snowflake accounts and managed accounts with Python](snowflake-python-managing-accounts.md) | Use the API to create and manage accounts and managed accounts. |
| [Managing Snowflake alerts with Python](snowflake-python-managing-alerts.md) | Use the API to create and manage alerts. |
| [Managing data loading and unloading resources with Python](snowflake-python-managing-data-loading.md) | Use the API to create and manage data loading and unloading resources, including external volumes, pipes, and stages. |
| [Managing Snowflake databases, schemas, tables, and views with Python](snowflake-python-managing-databases.md) | Use the API to create and manage databases, schemas, and tables. |
| [Managing Snowflake dynamic tables with Python](snowflake-python-managing-dynamic-tables.md) | Use the API to create and manage dynamic tables. |
| [Managing Snowflake functions and stored procedures with Python](snowflake-python-managing-functions-procedures.md) | Use the API to create and manage user-defined functions (UDFs) and stored procedures. |
| [Managing Snowflake integrations with Python](snowflake-python-managing-integrations.md) | Use the API to create and manage catalog integrations and notification integrations. |
| [Managing Snowflake network policies with Python](snowflake-python-managing-network-policies.md) | Use the API to create and manage network policies. |
| [Managing Snowflake Notebooks with Python](snowflake-python-managing-notebooks.md) | Use the API to create and manage Snowflake Notebooks. |
| [Managing Snowpark Container Services (including service functions) with Python](snowflake-python-managing-containers.md) | Use the API to manage components of Snowpark Container Services, including compute pools, image repositories, services, and service functions. |
| [Managing Snowflake streams with Python](snowflake-python-managing-streams.md) | Use the API to create and manage streams. |
| [Managing Snowflake tasks and task graphs with Python](snowflake-python-managing-tasks.md) | Use the API to create, execute, and manage tasks and task graphs. |
| [Managing Snowflake users, roles, and grants with Python](snowflake-python-managing-user-roles.md) | Use the API to create and manage users, roles, and grants. |
| [Managing Snowflake virtual warehouses with Python](snowflake-python-managing-warehouses.md) | Use the API to create and manage virtual warehouses. |

## References

[Snowflake Python APIs Reference](https://docs.snowflake.com/developer-guide/snowflake-python-api/reference/latest/index)

## Costs of Snowflake access

To reduce costs—–for both usage credit and network activity—–the Snowflake Python APIs are designed to communicate with Snowflake
only when you call methods designed to synchronize with Snowflake.

Objects in the API are either local references (or *handles*) or snapshots of state stored on Snowflake. In general, when you process
information that was retrieved from Snowflake, you do so through a local, in-memory reference object.

These references do not synchronize with Snowflake until you call a method. When you call a method, you are usually incurring costs in
both usage credit and network activity. In contrast, when you work with in-memory references, such as when accessing attributes, your work
is performed locally and incurs no such costs.

---
title: Snowflake Python Demos API
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/snowflake-python-demos.md
section: Developer Guide
---

# Snowflake Python Demos API

The Snowflake Python Demos library (`snowflake.demos`) helps you rapidly scaffold demos for
[Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md) by automating environment setup tasks — such as configuring the database,
schema, role, permissions, and dataset access — to streamline getting started with Snowflake Notebooks.

With this library, you can perform these tasks:

* Load and set up Snowflake Notebooks demos in your Snowflake environment.
* Explore interactive notebooks step by step to get hands-on experience.
* Tear down resources easily when you’re done.

## Prerequisites

Before you get started with the Snowflake Demos API, complete the following steps:

1. Verify that you have installed one of the supported Python versions:

   * 3.9
   * 3.10
   * 3.11
   * 3.12
2. Install the Snowflake Demos library.
3. Set up a default Snowflake connection.
4. Import snowflake.demos.

### Install the Snowflake Demos library

You can install the Snowflake Demos library for use with conda or a virtual environment. To set up the library, follow these steps:

1. [Activate a Python environment](snowflake-python-installing.md).
2. To install the library, run the following `pip install` command:

   ```bash
   pip install snowflake.demos
   ```

### Set up a default Snowflake connection

The Snowflake Demos API uses the default connection for the [Snowflake Python Connector](../python-connector/python-connector.md).
To configure this connection, follow the instructions in [Setting a default connection](../python-connector/python-connector-connect.md).

For example, to specify a named connection as the default connection in your Snowflake `config.toml` file, you add your default
connection name to the `config.toml` file as follows:

```toml
default_connection_name = '<connection_name>'
```

For information about specifying connection definitions in a TOML configuration file, see [Connecting using the connections.toml file](../python-connector/python-connector-connect.md).

### Import `snowflake.demos`

To use the library in your terminal, you can open an interactive shell such as the standard Python REPL.

1. Run the following command (which might vary depending on your Python environment):

   ```bash
   python3
   ```
2. In the REPL session, to import the library and the relevant functions, run the following code:

   ```python
   from snowflake.demos import help, load_demo, teardown
   ```

## Listing available demos

After importing the library, you can use the `help()` function to see the list of available demos that you can load and start
exploring. This function returns a table with the following columns:

* `demo_name`: A dash-delimited string that represents the demo name.
* `title`: The human-readable demo name title.
* `num_steps`: The number of steps in the demo.

### Current list of available demos

> **Note:**
>
> The following content is not supported by Snowflake. All code is provided “AS IS” and without warranty.

The Snowflake Demos API currently works with the following list of demos:

| demo_name | title | num_steps |
| --- | --- | --- |
| `analysis-churn-notebooks` | Data analysis and churn prediction using Snowflake Notebooks | 2 |
| `analytics-cortex` | Customer reviews analytics using Snowflake Cortex | 1 |
| `anthropic-cortex` | Getting started with Anthropic on Snowflake Cortex | 1 |
| `external-access-nb` | Access external endpoints | 1 |
| `get-started-partitioned-models` | Getting started with partitioned models and Snowflake Model Registry | 1 |
| `get-started-snowapi-nb` | Creating Snowflake objects using Python API | 1 |
| `get-started-snowpark-ws-nb` | Getting started with Snowpark in Snowflake Notebooks and Python Worksheets | 1 |
| `get-started-snowflake-ml` | Getting started with Snowflake ML | 4 |
| `ingest-json-data` | Ingest public JSON | 1 |
| `intro-snowpark-pandas` | Introduction to Snowpark pandas | 1 |
| `intro-to-feature-store-nb` | Introduction to Feature Store using Snowflake Notebooks | 1 |
| `intro-to-snowflake-nb` | My first Notebook project | 1 |
| `load-csv-to-stage` | Load CSV from S3 | 1 |
| `ref-cells-and-vars` | Reference cells and variables | 1 |
| `visual-data-stories` | Visual data stories with Snowflake Notebooks | 1 |
| `working-with-files` | Working with files | 1 |

## Working with demos

After completing the prerequisites, you can start using the Snowflake Demos API to work
with demos as described in the following sections.

### Load and explore a demo

* To load a specific demo and set up its associated resources in Snowflake, call `load_demo()` with an argument that specifies the
  `demo_name` of any available demo, as found in the `help()` output.

  For example:

  ```python
  load_demo('get-started-snowflake-ml')
  ```

> **Tip:**
>
> * To store a reference to the demo as an object, assign the result of `load_demo()` to a variable:
>
>   ```python
>   demo = load_demo('get-started-snowflake-ml')
>   ```
>
> Assigning the result to a variable is required if you’re working with a multi-step demo (`num_steps` > 1). You will need this
> reference to call `show_next()` or `show(step=<number>)` to move to the next notebook in the demo.
>
> You can also use this reference to quickly tear down the demo later.

This function does the following:

* Creates a connection to Snowflake if it’s the first time you’re loading a demo.
* Creates the necessary notebooks.
* Displays the notebook URL for the first step of the demo (step 1), if you are not assigning `load_demo()` to a variable.

  + If you assign `load_demo()` to a variable, you need to call `demo.show()` to get the first notebook URL.

The output should look similar to the following:

```output
Connecting to Snowflake...✅
Using ACCOUNTADMIN role...✅
Creating Database SNOWFLAKE_DEMO_DB...✅
Creating Schema SNOWFLAKE_DEMO_SCHEMA...✅
Creating Warehouse SNOWFLAKE_DEMO_WH...✅
Creating Stage SNOWFLAKE_DEMO_STAGE...✅
Uploading files to stage SNOWFLAKE_DEMO_STAGE/get-started-snowflake-ml and creating notebooks...
Creating notebook get_started_snowflake_ml_start_here...✅
Creating notebook get_started_snowflake_ml_sf_nb_snowflake_ml_feature_transformations...✅
Creating notebook get_started_snowflake_ml_sf_nb_snowflake_ml_model_training_inference...✅
Creating notebook get_started_snowflake_ml_sf_nb_snowpark_ml_adv_mlops...✅
Running setup for this demo...✅
```

> **Note:**
>
> A known issue exists with the printed notebook URLs. If the URL doesn’t open directly, you can copy and paste it into a new browser tab
> or access the notebook manually in Snowsight under the Notebooks tab.

### View the demo URL

You can use the `show()` function to view the URL to a specific step in the demo.

* To view the URL for the current step, first assign the result of `load_demo()` to a variable, such as `demo`, and then call
  `show()` with no arguments:

  ```python
  demo.show()
  ```

  The output should look similar to this:

  ```output
  Showing step 1.
  Please copy and paste this url in your web browser to open the notebook:
  https://app.snowflake.com/myorg/myaccount/#/notebooks/SNOWFLAKE_DEMO_DB.SNOWFLAKE_DEMO_SCHEMA.GET_STARTED_SNOWFLAKE_ML_START_HERE
  ```
* To get the notebook URL for a specific step in the demo, pass the `step` argument with a specified step number to `show()`:

  ```python
  demo.show(step=1)
  ```
* To get the notebook URL for the next step in a multi-step demo, use the `show_next()` function:

  ```python
  demo.show_next()
  ```

### Delete a demo and its resources

When you’re done exploring the demos that you set up, you might want to clean up all created resources, datasets, and notebooks that were
created.

* To delete a single demo and its associated resources, first assign the result of `load_demo()` to a variable such as `demo`, and
  then call `teardown()` on it:

  ```python
  demo.teardown()
  ```
* To delete all demos and any associated resources that have been set up, call `teardown()` as a top-level function:

  ```python
  teardown()
  ```

---
title: Snowflake Python Runtime Support
source: https://docs.snowflake.com/en/developer-guide/python-runtime-support-policy.md
section: Developer Guide
---

# Snowflake Python Runtime Support

Going forward, Snowflake intends to support new Python runtimes within 1 year of their
[first official release](https://devguide.python.org/versions/).

## Deprecating and decommissioning runtimes (end of support)

To keep your functions up-to-date and secure, we occasionally need you to update your UDFs and stored
procedures and re-deploy them to use a supported runtime.

Snowflake applies updates to Python runtimes as the updates are made available by the upstream maintainers.
When a Python runtime is no longer actively maintained, Snowflake will deprecate and, eventually, remove
the runtime. The Snowflake deprecation schedule will follow
[Python’s official end-of-life schedule](https://devguide.python.org/versions/).

This process involves three aspects: a publication of the deprecation date, a deprecation period,
and a target decommission date. The deprecation date posted below indicates the start of the deprecation
period and the expected decommission date.

| Python Runtime | Snowflake Deprecation Date | Decommission Date |
| --- | --- | --- |
| 3.8 | 14 Oct 2024 | 30 Apr 2025 |
| 3.9 | 05 Oct 2025 | 30 Apr 2026 |
| 3.10 | 04 Oct 2026 | TBD |
| 3.11 | 24 Oct 2027 | TBD |
| 3.12 | 02 Oct 2028 | TBD |
| 3.13 | 07 Oct 2029 | TBD |

During the deprecation period, Snowflake will no longer apply security patches or other updates
to the runtime. You can continue to use the runtime but you should mainly aim to use this time
to migrate any functions that still use the deprecated runtime to a more up-to-date runtime.
Note that functions that use a deprecated runtime are not eligible for technical support.

After the decommission date, you can no longer create or update functions using the runtime.
You must choose a more up-to-date runtime to deploy your functions.
Note that existing functions using the runtime can still be invoked.

---
title: Snowflake Scripting Developer Guide
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/index.md
section: Developer Guide
---

# Snowflake Scripting Developer Guide

Snowflake Scripting is an extension to Snowflake SQL that adds support for procedural logic. You can use Snowflake
Scripting syntax in [stored procedures](../stored-procedure/stored-procedures-overview.md) and
[user-defined functions (UDFs)](../udf/sql/udf-sql-procedural-functions.md). You can also use Snowflake
Scripting syntax outside of stored procedures and UDFs and stored procedures. The next topics explain how to use
Snowflake Scripting.

[Understanding blocks in Snowflake Scripting](blocks.md)
:   Learn the basic structure of Snowflake Scripting code.

[Working with variables](variables.md)
:   Declare and use variables.

[Returning a value](return.md)
:   Return values from stored procedures and an anonymous block.

[Working with conditional logic](branch.md)
:   Control flow with IF and CASE statements.

[Working with loops](loops.md)
:   Control flow with FOR, WHILE, REPEAT, and LOOP.

[Working with cursors](cursors.md)
:   Iterate through query results with a cursor.

[Working with RESULTSETs](resultsets.md)
:   Iterate over the result set returned by a query.

[Handling exceptions](exceptions.md)
:   Handle errors by handling and raising exceptions.

[Determining the number of rows affected by SQL statements](dml-status.md)
:   Use global variables to determine the effect of data manipulation language (DML) commands.

[Getting the query ID of the last query](query-id.md)
:   Use the global variable SQLID to get the query ID of the last query.

[Examples for common use cases of Snowflake Scripting](use-cases.md)
:   Explore examples of Snowflake Scripting code for some common use cases.

[Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)
:   Run the Snowflake Scripting examples in SnowSQL, Snowsight and Python Connector code.

---
title: Snowflake Scripting UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-procedural-functions.md
section: Developer Guide
---

# Snowflake Scripting UDFs

Snowflake supports SQL user-defined functions (UDFs) that contain Snowflake Scripting procedural language.
These UDFs are called *Snowflake Scripting UDFs*.

Snowflake Scripting UDFs can be called in a SQL statement, such as a SELECT statement or INSERT statement.
Therefore, they are more flexible than a Snowflake Scripting stored procedure, which can only be called in
a SQL [CALL](../../../sql-reference/sql/call.md) command.

## General usage

A Snowflake Scripting UDF evaluates procedural code and returns a scalar (that is, single) value.

You can use the following subset of [Snowflake Scripting](../../snowflake-scripting/index.md)
syntax in Snowflake Scripting UDFs:

* [Blocks](../../snowflake-scripting/blocks.md)
* [Variables](../../snowflake-scripting/variables.md)
* [RETURN command](../../snowflake-scripting/return.md)
* [Conditional logic](../../snowflake-scripting/branch.md)
* [Loops](../../snowflake-scripting/loops.md)
* [Exceptions](../../snowflake-scripting/exceptions.md)

## Supported data types

Snowflake Scripting UDFs support the following data types for both input arguments and
return values:

* [Numeric data types](../../../sql-reference/data-types-numeric.md) (for example, INTEGER, NUMBER, and FLOAT)
* [String & binary data types](../../../sql-reference/data-types-text.md) (for example, VARCHAR and BINARY)
* [Date & time data types](../../../sql-reference/data-types-datetime.md) (for example, DATE, TIME, and TIMESTAMP)
* [Logical data types](../../../sql-reference/data-types-logical.md) (for example, BOOLEAN)

Snowflake Scripting UDFs support the following data types for input arguments only:

* [Semi-structured data types](../../../sql-reference/data-types-semistructured.md) (for example, VARIANT, OBJECT, and ARRAY)
* [Structured data types](../../../sql-reference/data-types-structured.md) (for example, ARRAY, OBJECT, and MAP)

## Limitations

The following limitations apply to Snowflake Scripting UDFs:

* The following types of Snowflake Scripting syntax aren’t supported in Snowflake Scripting UDFs:

  + [Cursors](../../snowflake-scripting/cursors.md)
  + [RESULTSETs](../../snowflake-scripting/resultsets.md)
  + [Asynchronous child jobs](../../snowflake-scripting/asynchronous-child-jobs.md)
* SQL statements aren’t supported in Snowflake Scripting UDFs (including SELECT, INSERT, UPDATE, and so on).
* Snowflake Scripting UDFs can’t be defined as table functions.
* The following expression types aren’t supported in Snowflake Scripting UDFs:

  + User-defined functions
  + Aggregation functions
  + Window functions
* Snowflake Scripting UDFs can’t be used when creating a materialized view.
* Snowflake Scripting UDFs can’t be used when creating row access policies and masking policies.
* Snowflake Scripting UDFs can’t be used to specify a default column value.
* Snowflake Scripting UDFs can’t be used in a COPY INTO command for data loading and unloading.
* Snowflake Scripting UDFs can’t be memoizable.
* Snowflake Scripting UDFs have a limit of 500 input arguments.
* You can’t [log messages](../../logging-tracing/logging.md) for Snowflake Scripting UDFs.

## Examples

The following examples create and call Snowflake Scripting UDFs:

* Create a Snowflake Scripting UDF with variables
* Create a Snowflake Scripting UDF with conditional logic
* Create a Snowflake Scripting UDF with a loop
* Create a Snowflake Scripting UDF with exception handling
* Create a Snowflake Scripting UDF that returns a value for an INSERT statement
* Create a Snowflake Scripting UDF called in WHERE and ORDER BY clauses

### Create a Snowflake Scripting UDF with variables

Create a Snowflake Scripting UDF that calculates profit based on the values of two arguments:

```sqlexample
CREATE OR REPLACE FUNCTION calculate_profit(
  cost NUMBER(38, 2),
  revenue NUMBER(38, 2))
RETURNS number(38, 2)
LANGUAGE SQL
AS
DECLARE
  profit NUMBER(38, 2) DEFAULT 0.0;
BEGIN
  profit := revenue - cost;
  RETURN profit;
END;
```

> **Note:**
>
> If you use [Snowflake CLI](../../snowflake-cli/index.md), [SnowSQL](../../../user-guide/snowsql.md),
> the Classic Console, or the `execute_stream` or `execute_string` method in
> [Python Connector](../../python-connector/python-connector.md) code, this example requires minor
> changes. For more information, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../snowflake-scripting/running-examples.md).

Call `calculate_profit` in a query:

```sqlexample
SELECT calculate_profit(100, 110);
```

```output
+----------------------------+
| CALCULATE_PROFIT(100, 110) |
|----------------------------|
|                      10.00 |
+----------------------------+
```

You can use the same Snowflake Scripting UDF and specify columns for the arguments. First,
create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE snowflake_scripting_udf_profit(
  cost NUMBER(38, 2),
  revenue NUMBER(38, 2));

INSERT INTO snowflake_scripting_udf_profit VALUES
  (100, 200),
  (200, 190),
  (300, 500),
  (400, 401);
```

Call `calculate_profit` in a query and specify the columns for the arguments:

```sqlexample
SELECT calculate_profit(cost, revenue)
  FROM snowflake_scripting_udf_profit;
```

```output
+---------------------------------+
| CALCULATE_PROFIT(COST, REVENUE) |
|---------------------------------|
|                          100.00 |
|                          -10.00 |
|                          200.00 |
|                            1.00 |
+---------------------------------+
```

### Create a Snowflake Scripting UDF with conditional logic

Create a Snowflake Scripting UDF that uses conditional logic to determine the department name
based on an input INTEGER value:

```sqlexample
CREATE OR REPLACE function check_dept(department_id INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
AS
BEGIN
  IF (department_id < 3) THEN
    RETURN 'Engineering';
  ELSEIF (department_id = 3) THEN
    RETURN 'Tool Design';
  ELSE
    RETURN 'Marketing';
  END IF;
END;
```

> **Note:**
>
> If you use [Snowflake CLI](../../snowflake-cli/index.md), [SnowSQL](../../../user-guide/snowsql.md),
> the Classic Console, or the `execute_stream` or `execute_string` method in
> [Python Connector](../../python-connector/python-connector.md) code, this example requires minor
> changes. For more information, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../snowflake-scripting/running-examples.md).

Call `check_dept` in a query:

```sqlexample
SELECT check_dept(2);
```

```output
+---------------+
| CHECK_DEPT(2) |
|---------------|
| Engineering   |
+---------------+
```

You can use a [SQL variable](../../../sql-reference/session-variables.md) in an argument when you
call a Snowflake Scripting UDF. The following example sets a SQL variable and then uses the
variable in a call to the `check_dept` UDF:

```sqlexample
SET my_variable = 3;

SELECT check_dept($my_variable);
```

```output
+--------------------------+
| CHECK_DEPT($MY_VARIABLE) |
|--------------------------|
| Tool Design              |
+--------------------------+
```

### Create a Snowflake Scripting UDF with a loop

Create a Snowflake Scripting UDF that uses a loop to count all numbers up to a target number provided
in an argument and calculate the sum of all of the numbers counted:

```sqlexample
CREATE OR REPLACE function count_to(
  target_number INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
AS
DECLARE
  counter INTEGER DEFAULT 0;
  sum_total INTEGER DEFAULT 0;
BEGIN
  WHILE (counter < target_number) DO
    counter := counter + 1;
    sum_total := sum_total + counter;
  END WHILE;
  RETURN 'Counted to ' || counter || '. Sum of all numbers: ' || sum_total;
END;
```

> **Note:**
>
> If you use [Snowflake CLI](../../snowflake-cli/index.md), [SnowSQL](../../../user-guide/snowsql.md),
> the Classic Console, or the `execute_stream` or `execute_string` method in
> [Python Connector](../../python-connector/python-connector.md) code, this example requires minor
> changes. For more information, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../snowflake-scripting/running-examples.md).

Call `count_to` in a query:

```sqlexample
SELECT count_to(10);
```

```output
+---------------------------------------+
| COUNT_TO(10)                          |
|---------------------------------------|
| Counted to 10. Sum of all numbers: 55 |
+---------------------------------------+
```

### Create a Snowflake Scripting UDF with exception handling

Create a Snowflake Scripting UDF that declares an exception and then raises the exception:

```sqlexample
CREATE OR REPLACE FUNCTION raise_exception(input_value INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
AS
DECLARE
  counter_val INTEGER DEFAULT 0;
  my_exception EXCEPTION (-20002, 'My exception text');
BEGIN
  WHILE (counter_val < 12) DO
    counter_val := counter_val + 1;
    IF (counter_val > 10) THEN
      RAISE my_exception;
    END IF;
  END WHILE;
  RETURN counter_val;
EXCEPTION
  WHEN my_exception THEN
    IF (input_value = 1) THEN
      RETURN 'My exception caught: ' || sqlcode;
    ELSEIF (input_value = 2) THEN
      RETURN 'My exception caught with different path: ' || sqlcode;
    END IF;
    RETURN 'Default exception handling path: ' || sqlcode;
END;
```

> **Note:**
>
> If you use [Snowflake CLI](../../snowflake-cli/index.md), [SnowSQL](../../../user-guide/snowsql.md),
> the Classic Console, or the `execute_stream` or `execute_string` method in
> [Python Connector](../../python-connector/python-connector.md) code, this example requires minor
> changes. For more information, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../snowflake-scripting/running-examples.md).

Call `raise_exception` in a query and specify `1` for the input value:

```sqlexample
SELECT raise_exception(1);
```

```output
+-----------------------------+
| RAISE_EXCEPTION(1)          |
|-----------------------------|
| My exception caught: -20002 |
+-----------------------------+
```

Call `raise_exception` in a query and specify `2` for the input value:

```sqlexample
SELECT raise_exception(2);
```

```output
+-------------------------------------------------+
| RAISE_EXCEPTION(2)                              |
|-------------------------------------------------|
| My exception caught with different path: -20002 |
+-------------------------------------------------+t
```

Call `raise_exception` in a query and specify `NULL` for the input value:

```sqlexample
SELECT raise_exception(NULL);
```

```output
+-----------------------------------------+
| RAISE_EXCEPTION(NULL)                   |
|-----------------------------------------|
| Default exception handling path: -20002 |
+-----------------------------------------+
```

### Create a Snowflake Scripting UDF that returns a value for an INSERT statement

Create a Snowflake Scripting UDF that returns a value that is used in an INSERT statement. Create the table
that the values will be inserted into:

```sqlexample
CREATE OR REPLACE TABLE test_sql_udf_insert (num NUMBER);
```

Create a SQL UDF that returns a numeric value:

```sqlexample
CREATE OR REPLACE FUNCTION value_to_insert(l NUMBER, r NUMBER)
RETURNS number
LANGUAGE SQL
AS
BEGIN
  IF (r < 0) THEN
    RETURN l/r * -1;
  ELSEIF (r > 0) THEN
    RETURN l/r;
  ELSE
    RETURN 0;
END IF;
END;
```

> **Note:**
>
> If you use [Snowflake CLI](../../snowflake-cli/index.md), [SnowSQL](../../../user-guide/snowsql.md),
> the Classic Console, or the `execute_stream` or `execute_string` method in
> [Python Connector](../../python-connector/python-connector.md) code, this example requires minor
> changes. For more information, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../snowflake-scripting/running-examples.md).

Call `value_to_insert` in multiple INSERT statements:

```sqlexample
INSERT INTO test_sql_udf_insert SELECT value_to_insert(10, 2);
INSERT INTO test_sql_udf_insert SELECT value_to_insert(10, -2);
INSERT INTO test_sql_udf_insert SELECT value_to_insert(10, 0);
```

Query the table to view the inserted values:

```sqlexample
SELECT * FROM test_sql_udf_insert;
```

```output
+-----+
| NUM |
|-----|
|   5 |
|   5 |
|   0 |
+-----+
```

### Create a Snowflake Scripting UDF called in WHERE and ORDER BY clauses

Create a Snowflake Scripting UDF that returns a value that is used in a WHERE or ORDER BY clause.
Create a table and insert values:

```sqlexample
CREATE OR REPLACE TABLE test_sql_udf_clauses (p1 INT, p2 INT);

INSERT INTO test_sql_udf_clauses VALUES
  (100, 7),
  (100, 3),
  (100, 4),
  (NULL, NULL);
```

Create a SQL UDF that returns a numeric value that is the product of the multiplication
of two input values:

```sqlexample
CREATE OR REPLACE FUNCTION get_product(a INTEGER, b INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
AS
BEGIN
  RETURN a * b;
END;
```

> **Note:**
>
> If you use [Snowflake CLI](../../snowflake-cli/index.md), [SnowSQL](../../../user-guide/snowsql.md),
> the Classic Console, or the `execute_stream` or `execute_string` method in
> [Python Connector](../../python-connector/python-connector.md) code, this example requires minor
> changes. For more information, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../snowflake-scripting/running-examples.md).

Call `get_product` in the WHERE clause of a query to return the rows
where the product is greater than `350`:

```sqlexample
SELECT *
  FROM test_sql_udf_clauses
  WHERE get_product(p1, p2) > 350;
```

```output
+-----+----+
|  P1 | P2 |
|-----+----|
| 100 |  7 |
| 100 |  4 |
+-----+----+
```

Call `get_product` in the ORDER BY clause of a query to order
the results from the lowest to the highest product returned by
the UDF:

```sqlexample
SELECT *
  FROM test_sql_udf_clauses
  ORDER BY get_product(p1, p2);
```

```output
+------+------+
|  P1  | P2   |
|------+------|
| 100  | 3    |
| 100  | 4    |
| 100  | 7    |
| NULL | NULL |
+------+------+
```

---
title: Snowflake SQL API
source: https://docs.snowflake.com/en/developer-guide/sql-api/index.md
section: Developer Guide
---

# Snowflake SQL API

The Snowflake SQL API is a REST API that you can use to access and update data in a Snowflake database. You can use
this API to develop custom applications and integrations that:

* Perform queries
* Manage your deployment (e.g. provision users and roles, create tables, etc.)

The Snowflake SQL API provides operations that you can use to:

* Submit SQL statements for execution.
* Check the status of the execution of a statement.
* Cancel the execution of a statement.

You can use this API to execute [standard queries](../../sql-reference/constructs.md) and most
[DDL](../../sql-reference/sql-ddl-summary.md) and [DML](../../sql-reference/sql-dml.md) statements.
See [Limitations of the SQL API](intro.md) for the types of statements that are not supported.

[Introduction to the SQL API](intro.md)
:   Get an overview of the API.

[About the SQL API endpoints](about-endpoints.md)
:   Learn about the endpoints that make up the API.

[Authenticating to the server](authenticating.md)
:   Use OAuth or Key Pair to authenticate with the Snowflake server.

[Submitting a request to execute SQL statements](submitting-requests.md)
:   Set up and submit requests using an API endpoint.

[Handling responses](handling-responses.md)
:   Check request status and get results and other data after a request.

[Submitting multiple SQL statements in a single request](submitting-multiple-statements.md)
:   Send multiple SQL statements in a single API request.

[Creating and calling stored procedures](using-stored-procedures.md)
:   Create a stored procedure by specifying it in the body of a request.

[Using explicit transactions](using-transactions.md)
:   Execute SQL in a transaction by specifying the start, end, and statements in the transaction.

[Getting details about an error](handling-errors.md)
:   Retrieve error information.

[Canceling the execution of a SQL statement](cancelling-requests.md)
:   Cancel SQL statement execution.

[Snowflake SQL API reference](reference.md)
:   Read details about the operations, objects, HTTP headers, and response codes for this API.

[Deprecated functionality](sql-api-old.md)
:   Learn about deprecated functionality.

---
title: Snowflake SQL API reference
source: https://docs.snowflake.com/en/developer-guide/sql-api/reference.md
section: Developer Guide
---

# Snowflake SQL API reference

This topic documents the operations, requests, and responses for the SQL API.

## Operations

### `POST /api/v2/statements`

To submit one or more SQL statements for execution, send a POST request to `/api/v2/statements`. You can specify that the
statement should be executed asynchronously.

#### Request syntax

```none
POST /api/v2/statements
(request body)
```

#### Query parameters

| Parameter | Description |
| --- | --- |
| `requestId` | (Optional) Unique ID (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier)) of the API request. See [Resubmitting a request to execute SQL statements](submitting-requests.md). |
| `async` | (Optional) Set to `true` to execute the statement asynchronously and return the statement handle.  If the parameter is not specified or is set to false, a statement is executed and the results are returned if the execution is completed in 45 seconds. If the statement execution takes longer to complete, the statement handle is returned. |
| `nullable` | (Optional) Set to `false` to return a SQL NULL value as the string `"null"`, rather than as the value `null`.  **Note:** You cannot specify this parameter in a GET request.  By default, SQL NULL values are returned as the value `null`:  ```sqljson "data" : [ [ null ], ... ] ```  Setting this query parameter to false (e.g. `/api/v2/statements?nullable=false` returns a SQL NULL value as the string `"null"`:  ```sqljson "data" : [ [ "null" ], ... ] ``` |

#### Request headers

The request must include the headers listed in Request headers for all operations.

#### Request body

(Required) The request body must contain the object specified in Body of the POST request to /api/v2/statements/.

#### Response

This operation can return the response codes listed below.

| Code | Description |
| --- | --- |
| 200 | The statement was executed successfully.  For this response code, the response can have the following headers:   * Link   **If a single SQL statement was submitted in the request,** the body of the response contains a ResultSet object containing the requested data.  **Note:** If the `code` field in the response is set to `391908`, the result set is too large, . and the response does not include the entire result set.  The following is an example of a response for a single SQL statement in which the results are returned in a single partition. `{handle}` is the statement handle and `{id1}`, `{id2}`, and `{id3}` are uniquely generated request IDs:  ```none HTTP/1.1 200 OK Date: Tue, 04 May 2021 18:06:24 GMT Content-Type: application/json Link:   </api/v2/statements/{handle}?requestId={id1}&partition=0>; rel="first",   </api/v2/statements/{handle}?requestId={id2}&partition=0>; rel="last" {   "resultSetMetaData" : {     "numRows" : 4,     "format" : "jsonv2",     "rowType" : [ {       "name" : "COLUMN1",       "database" : "",       "schema" : "",       "table" : "",       "scale" : null,       "precision" : null,       "length" : 4,       "type" : "text",       "nullable" : false,       "byteLength" : 16,       "collation" : null     }, {       "name" : "COLUMN2",       "database" : "",       "schema" : "",       "table" : "\"VALUES\"",       "scale" : 0,       "precision" : 1,       "length" : null,       "type" : "fixed",       "nullable" : false,       "byteLength" : null,       "collation" : null     } ],     "partitionInfo": [{       "rowCount": 4,       "uncompressedSize": 1438,     }]   },   "data" : [ [ "test", "2" ], [ "test", "3" ], [ "test", "4" ], [ "test", "5" ] ],   "code" : "090001",   "statementStatusUrl" : "/api/v2/statements/{handle}?requestId={id3}&partition=0",   "sqlState" : "00000",   "statementHandle" : "{handle}",   "message" : "Statement executed successfully.",   "createdOn" : 1620151584132 } ```  The following is an example of a response for a single SQL statement in which the results need to be returned in multiple partitions, where `{handle}` is the statement handle and `{id1}`, `{id2}`, `{id3}`, and `{id4}` are uniquely generated request IDs:  ```none HTTP/1.1 200 OK Date: Tue, 04 May 2021 18:08:15 GMT Content-Type: application/json Link:   </api/v2/statements/{handle}?requestId={id1}&partition=0>; rel="first",   </api/v2/statements/{handle}?requestId={id2}&partition=1>; rel="next",   </api/v2/statements/{handle}?requestId={id3}&partition=1>; rel="last" {   "resultSetMetaData" : {     "numRows" : 56090,     "format" : "jsonv2",     "rowType" : [ {       "name" : "SEQ8()",       "database" : "",       "schema" : "",       "table" : "",       "scale" : 0,       "precision" : 19,       "length" : null,       "type" : "fixed",       "nullable" : false,       "byteLength" : null,       "collation" : null     }, {       "name" : "RANDSTR(1000, RANDOM())",       "database" : "",       "schema" : "",       "table" : "",       "scale" : null,       "precision" : null,       "length" : 16777216,       "type" : "text",       "nullable" : false,       "byteLength" : 16777216,       "collation" : null     } ],     "partitionInfo": [{       "rowCount": 12344,       "uncompressedSize": 14384873,     },{       "rowCount": 43746,       "uncompressedSize": 43748274,       "compressedSize": 746323     }]   },   "data" : [ [ "0", "QqKow2xzdJ....." ],.... [ "98", "ZugTcURrcy...." ] ],   "code" : "090001",   "statementStatusUrl" : "/api/v2/statements/{handle}?requestId={id4}",   "sqlState" : "00000",   "statementHandle" : "{handle}",   "message" : "Statement executed successfully.",   "createdOn" : 1620151693299 } ```  **If multiple SQL statements were submitted in the request,** the body of the response contains a ResultSet object with details about the status of the execution of the multiple statements.  In this case, the response does not contain the requested data. Instead, the `data` field just contains the message “Multiple statements executed successfully”.  The response contains the `statementHandles` field, which is an array of statement handles that you can use to retrieve the results of the individual statements.  The following is an example of a response for a request that specifies multiple SQL statements, where:   * `{handle}` is the statement handle for the set of statements. * `{handle1}`, `{handle2}`, and `{handle3}` are the handles for the individual SQL statements in the request. * `{id1}`, `{id2}`, and `{id3}` are uniquely generated request IDs:   ```none HTTP/1.1 200 OK Date: Mon, 31 May 2021 22:50:31 GMT Content-Type: application/json Link:   </api/v2/statements/{handle}?requestId={id1}&partition=0>; rel="first",   </api/v2/statements/{handle}?requestId={id2}&partition=1>; rel="last"  {   "resultSetMetaData" : {   "numRows" : 56090,   "format" : "jsonv2",   "rowType" : [ {       "name" : "multiple statement execution",       "database" : "",       "schema" : "",       "table" : "",       "type" : "text",       "scale" : null,       "precision" : null,       "byteLength" : 16777216,       "nullable" : false,       "collation" : null,       "length" : 16777216     } ],     "partitionInfo": [{       "rowCount": 12344,       "uncompressedSize": 14384873,     },{      "rowCount": 43746,      "uncompressedSize": 43748274,      "compressedSize": 746323     }]   },   "data" : [ [ "Multiple statements executed successfully." ] ],   "code" : "090001",   "statementHandles" : [ "{handle1}", "{handle2}", "{handle3}" ],   "statementStatusUrl" : "/api/v2/statements/{handle}?requestId={id3}",   "sqlState" : "00000",   "statementHandle" : "{handle}",   "message" : "Statement executed successfully.",   "createdOn" : 1622501430333 } ``` |
| 202 | The execution of the statement is still in progress. Use `GET /api/v2/statements/{statementHandle}` to check the status of the statement execution. See GET /api/v2/statements/{statementHandle} for details.  The body of the response contains a QueryStatus object with details about the status of the statement execution.  The following is an example of a response:  ```none HTTP/1.1 202 Accepted Date: Tue, 04 May 2021 18:12:37 GMT Content-Type: application/json Content-Length: 285 {   "code" : "333334",   "message" :       "Asynchronous execution in progress. Use provided query id to perform query monitoring and management.",   "statementHandle" : "019c06a4-0000-df4f-0000-00100006589e",   "statementStatusUrl" : "/api/v2/statements/019c06a4-0000-df4f-0000-00100006589e" } ``` |
| 408 | The execution of the statement exceeded the timeout period. The execution of the statement was cancelled.  The body of the response contains a QueryStatus object with details about the cancellation of the statement execution. |
| 422 | An error occurred when executing the statement. Check the error code and error message for details.  The body of the response contains a QueryFailureStatus object with details about the error.  The following is an example of a response:  ```none HTTP/1.1 422 Unprocessable Entity Date: Tue, 04 May 2021 20:24:11 GMT Content-Type: application/json {   "code" : "000904",   "message" : "SQL compilation error: error line 1 at position 7\ninvalid identifier 'AFAF'",   "sqlState" : "42000",   "statementHandle" : "019c0728-0000-df4f-0000-00100006606e" } ``` |

For the other response codes returned by this operation, see Response codes for all operations.

### `GET /api/v2/statements/{statementHandle}`

To check the status of the execution of a statement, send a GET request to `/api/v2/statements/{statementHandle}`. If the
statement has been executed successfully, the body of the response includes a ResultSet
object containing the requested data.

#### Request syntax

```none
GET /api/v2/statements/{statementHandle}
```

#### Path parameters

| Parameter | Description |
| --- | --- |
| `statementHandle` | (Required) The handle of the statement that you want to check. You can get this handle from the QueryStatus object returned in the response to the request to execute the statement. |

#### Query parameters

| `requestId` | (Optional) Unique ID (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier)) of the API request. See [Resubmitting a request to execute SQL statements](submitting-requests.md). |
| --- | --- |
| `partition` | (Optional) The partition number to return. The size of each partition is determined by Snowflake.  See [Getting the results from the response](handling-responses.md) for more information. |

#### Request headers

The request must include the headers listed in Request headers for all operations.

#### Response

This operation can return the response codes listed below.

| Code | Description |
| --- | --- |
| 200 | The statement was executed successfully.  For this response code, the response can have the following headers:   * Link   The body of the response has a ResultSet object containing the requested data.  The following is an example of a response, where `{handle}` is the statement handle and `{id1}`, `{id2}`, `{id3}`, `{id4}`, and `{id5}}` are uniquely generated request IDs:  ```none HTTP/1.1 200 OK Date: Tue, 04 May 2021 20:25:46 GMT Content-Type: application/json Link:   </api/v2/statements/{handle}?requestId={id1}&partition=0>; rel="first",   </api/v2/statements/{handle}?requestId={id2}&partition=0>; rel="prev",   </api/v2/statements/{handle}?requestId={id3}&partition=1>; rel="next",   </api/v2/statements/{handle}?requestId={id4}&partition=10>; rel="last" {   "resultSetMetaData" : {     "numRows" : 10000,     "format" : "jsonv2",     "rowType" : [ {       "name" : "SEQ8()",       "database" : "",       "schema" : "",       "table" : "",       "scale" : 0,       "precision" : 19,       "length" : null,       "type" : "fixed",       "nullable" : false,       "byteLength" : null,       "collation" : null     }, {       "name" : "RANDSTR(1000, RANDOM())",       "database" : "",       "schema" : "",       "table" : "",       "scale" : null,       "precision" : null,       "length" : 16777216,       "type" : "text",       "nullable" : false,       "byteLength" : 16777216,       "collation" : null     } ],     "partitionInfo": [{       "rowCount": 12344,       "uncompressedSize": 14384873,     },{       "rowCount": 43746,       "uncompressedSize": 43748274,       "compressedSize": 746323     }]   },   "data" : [ [ "10", "lJPPMTSwps......" ], ... [ "19", "VJKoHmUFJz......" ] ],   "code" : "090001",   "statementStatusUrl" : "/api/v2/statements/{handle}?requestId={id5}&partition=10",   "sqlState" : "00000",   "statementHandle" : "{handle}",   "message" : "Statement executed successfully.",   "createdOn" : 1620151693299 } ``` |
| 202 | The execution of the statement is still in progress. Repeat the request to check the status of the statement execution.  The body of the response contains a QueryStatus object with details about the status of the statement execution.  The following is an example of a response:  ```none HTTP/1.1 202 Accepted Date: Tue, 04 May 2021 22:31:33 GMT Content-Type: application/json Content-Length: 285 {   "code" : "333334",   "message" :       "Asynchronous execution in progress. Use provided query id to perform query monitoring and management.",   "statementHandle" : "019c07a7-0000-df4f-0000-001000067872",   "statementStatusUrl" : "/api/v2/statements/019c07a7-0000-df4f-0000-001000067872" } ``` |
| 422 | An error occurred when executing the statement. Check the error code and error message for details.  The body of the response contains a QueryFailureStatus object with details about the error. |

For the other response codes returned by this operation, see Response codes for all operations.

### `POST /api/v2/statements/{statementHandle}/cancel`

To cancel the execution of a statement, send a POST request to `/api/v2/statements/{statementHandle}/cancel`.

#### Request syntax

```none
POST /api/v2/statements/{statementHandle}/cancel
```

#### Path parameters

| Parameter | Description |
| --- | --- |
| `statementHandle` | (Required) The handle of the statement that you want to check. You can get this handle from the QueryStatus object returned in the response to the request to execute the statement. |

#### Query parameters

| Parameter | Description |
| --- | --- |
| `requestId` | (Optional) Unique ID (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier)) of the API request. See [Resubmitting a request to execute SQL statements](submitting-requests.md). |

#### Request headers

The request must include the headers listed in Request headers for all operations.

#### Response

This operation can return the response codes listed below.

| Code | Description |
| --- | --- |
| 200 | Execution of the statement was cancelled successfully.  The body of the response contains a CancelStatus object that contains information about the cancellation of the statement.  The following is an example of a response:  ```none HTTP/1.1 200 OK Date: Tue, 04 May 2021 22:52:15 GMT Content-Type: application/json Content-Length: 230 {   "code" : "000604",   "sqlState" : "57014",   "message" : "SQL execution canceled",   "statementHandle" : "019c07bc-0000-df4f-0000-001000067c3e",   "statementStatusUrl" : "/api/v2/statements/019c07bc-0000-df4f-0000-001000067c3e" } ``` |
| 422 | An error occurred when executing the statement. Check the error code and error message for details.  The body of the response contains a QueryFailureStatus object with details about the error.  The following is an example of a response:  ```none HTTP/1.1 422 Unprocessable Entity Date: Tue, 04 May 2021 22:52:49 GMT Content-Type: application/json Content-Length: 183 {   "code" : "000709",   "message" : "Statement 019c07bc-0000-df4f-0000-001000067c3e not found",   "sqlState" : "02000",   "statementHandle" : "019c07bc-0000-df4f-0000-001000067c3e" } ``` |

For the other response codes returned by this operation, see Response codes for all operations.

## Request headers for all operations

The following request headers are apply to all operations:

| Header | Required or Optional? | Description |
| --- | --- | --- |
| `Authorization` | Required | Set this to `Bearer`, followed by the token used to authenticate to Snowflake.   * For [key pair authentication](authenticating.md), use the generated JWT as the token. * For [OAuth](authenticating.md), use the generated OAuth token as the token.   For example:  `Authorization: Bearer token`  See [Authenticating to the server](authenticating.md). |
| `Accept` | Required | Set this to the list of media types (MIME types) that are acceptable in the body of the response. Include the type `application/json` (or, if all types are acceptable, set this to `*/*`). |
| `Content-Type` | Required | Set this to the media type (MIME type) of the body of the request. Set this to `application/json`. |
| `User-Agent` | Required | Set this to the name and version of your application (e.g. `applicationName/applicationVersion`). You must use a value that complies with [RFC 7231](https://tools.ietf.org/html/rfc7231#section-5.5.3). |
| `X-Snowflake-Authorization-Token-Type` | Optional | Set this to one of the following values:   * `KEYPAIR_JWT`, if you are using [key pair authentication](authenticating.md). * `OAUTH`, if you are using [OAuth](authenticating.md).   If you omit the `X-Snowflake-Authorization-Token-Type` header, Snowflake determines the token type by examining the token.  Even though this header is optional, you can choose to specify this header. You can set the header to one of the following values:   * `KEYPAIR_JWT` (for key-pair authentication) * `OAUTH` (for OAuth) * `PROGRAMMATIC_ACCESS_TOKEN` (for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md)) |

## Types of objects in the request body

### Body of the `POST` request to `/api/v2/statements/`

The body of a `POST` request to the `/api/v2/statements/` endpoint (see
POST /api/v2/statements) is a JSON object that you use to specify the SQL statement to execute, the
statement context, and the format of data in the result set. You use this object in the body of a request to execute a statement.

#### Fields

| Field | Description |
| --- | --- |
| `statement` | (Optional) SQL statement to execute. See [Limitations of the SQL API](intro.md) for the lists of statements that are supported and not supported.  Type: string |
| `timeout` | (Optional) Timeout in seconds for statement execution. If the execution of a statement takes longer than the specified timeout, the execution is automatically canceled. To set the timeout to the maximum value (604800 seconds), set timeout to 0. If this field is not set, the timeout specified by the [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) parameter is used.  Type: 64-bit signed integer  Example: `10` |
| `database` | (Optional) Database in which the statement should be executed. The value in this field is case-sensitive.  If you omit this field, the SQL API uses the database from the value of the `DEFAULT_NAMESPACE` [property of the user](../../sql-reference/sql/alter-user.md).  Type: string  Example: `TESTDB` |
| `schema` | (Optional) Schema in which the statement should be executed. The value in this field is case-sensitive.  If you omit this field, the SQL API uses the schema from the value of the `DEFAULT_NAMESPACE` [property of the user](../../sql-reference/sql/alter-user.md).  Type: string  Example: `TESTSCHEMA` |
| `warehouse` | (Optional) Warehouse to use when executing the statement. The value in this field is case-sensitive.  If you omit this field, the SQL API uses the value of the `DEFAULT_WAREHOUSE` [property of the user](../../sql-reference/sql/alter-user.md).  Type: string  Example: `TESTWH` |
| `role` | (Optional) Role to use when executing the statement. The value in this field is case-sensitive.  If you omit this field, the SQL API uses the value of the `DEFAULT_ROLE` [property of the user](../../sql-reference/sql/alter-user.md).  Type: string  Example: `TESTROLE` |
| `bindings` | (Optional) Values of [bind variables](submitting-requests.md) in the SQL statement. When executing the statement, Snowflake replaces placeholders (`?` and `:name`) in the statement with these specified values.  Note that the format of this field may change for the GA release of the SQL API.  Type: object  Example:  ```sqljson {"1":{"type":"FIXED","value":"123"},"2":{"type":"TEXT","value":"teststring"}} ``` |
| `parameters` | (Optional) [Session parameters](../../sql-reference/parameters.md) that you want to set for this request.  Type: object (statements_parameters) |

#### Example

The following is an example of the body object:

```sqljson
{
  "statement" : "select * from T where c1=?",
  "timeout" : 10,
  "database" : "TESTDB",
  "schema" : "TESTSCHEMA",
  "warehouse" : "TESTWH",
  "role" : "TESTROLE",
  "bindings" : {
    "1" : {
      "type" : "FIXED",
      "value" : "123"
    }
  }
}
```

### `statements_parameters`

`statements_parameters` is a JSON object that you use to specify the [session parameters](../../sql-reference/parameters.md)
that you want to set for this request. This object should be in the `parameters` field of the body of the `POST`
request to the `/api/v2/statements` endpoint (see Body of the POST request to /api/v2/statements/).

> **Note:**
>
> The SQL API only supports the session parameters listed in the following table.

#### Fields

| Field | Description |
| --- | --- |
| `binary_output_format` | (Optional) Specifies format for VARCHAR values returned as output by BINARY-to-VARCHAR conversion functions. For details, see [BINARY_OUTPUT_FORMAT](../../sql-reference/parameters.md).  Type: string  Example: `HEX` |
| `client_result_chunk_size` | (Optional) Specifies the maximum size of each set (or chunk) of query results to download (in MB). For details, see [CLIENT_RESULT_CHUNK_SIZE](../../sql-reference/parameters.md).  Type: integer  Example: `100` |
| `date_output_format` | (Optional) Specifies the display format for the DATE data type. For details, see [DATE_OUTPUT_FORMAT](../../sql-reference/parameters.md). See [Formatting the Output of Query Results](handling-responses.md) for details on using parameters to determine the output format of query results.  Type: string  Example: `YYYY-MM-DD` |
| `multi_statement_count` | (Required when specifying more than one SQL statement in a request) Specifies the number of SQL statements to be submitted in a request when using the multi-statement capability. Valid values are:   * `0`: Indicates that a variable number of statements can be included in the request. * `1`: Indicates that a single SQL statement can be included in the request. This is the default   value used if you do not specify the `MULTI_STATEMENT_COUNT` field. * `> 1`: Indicates the number of SQL statements submitted in the request. This number must match   the number of statements specified in the `statement` field.   Type: string  Example: `2` |
| `query_tag` | (Optional) Query tag that you want to associate with the SQL statement. For details, see [QUERY_TAG parameter](../../sql-reference/parameters.md).  Type: string  Example: `tag-1234` |
| `rows_per_resultset` | (Optional) Specifies the maximum number of rows returned in a result set, with 0 (default) meaning no maximum. For details, see [ROWS_PER_RESULTSET parameter](../../sql-reference/parameters.md).  Type: integer  Example: 200 |
| `time_output_format` | (Optional) Specifies the display format for the TIME data type. For details, see [TIME_OUTPUT_FORMAT](../../sql-reference/parameters.md). See [Formatting the Output of Query Results](handling-responses.md) for details on using parameters to determine the output format of query results.  Type: string  Example: `HH24:MI:SS` |
| `timestamp_ltz_output_format` | (Optional) Specifies the display format for the TIMESTAMP_LTZ data type. For details, see [TIMESTAMP_LTZ_OUTPUT_FORMAT](../../sql-reference/parameters.md). See [Formatting the Output of Query Results](handling-responses.md) for details on using parameters to determine the output format of query results.  Type: string  Example: `YYYY-MM-DD HH24:MI:SS.FF3` |
| `timestamp_ntz_output_format` | (Optional) Specifies the display format for the TIMESTAMP_NTZ data type. For details, see [TIMESTAMP_NTZ_OUTPUT_FORMAT](../../sql-reference/parameters.md). See [Formatting the Output of Query Results](handling-responses.md) for details on using parameters to determine the output format of query results.  Type: string  Example: `YYYY-MM-DD HH24:MI:SS.FF3` |
| `timestamp_output_format` | (Optional) Specifies the display format for the TIMESTAMP data type alias. For details, see [TIMESTAMP_OUTPUT_FORMAT](../../sql-reference/parameters.md). See [Formatting the Output of Query Results](handling-responses.md) for details on using parameters to determine the output format of query results.  Type: string  Example: `YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM` |
| `timestamp_tz_output_format` | (Optional) Specifies the display format for the TIMESTAMP_TZ data type. For details, see [TIMESTAMP_TZ_OUTPUT_FORMAT](../../sql-reference/parameters.md). See [Formatting the Output of Query Results](handling-responses.md) for details on using parameters to determine the output format of query results.  Type: string  Example: `YYYY-MM-DD HH24:MI:SS.FF3` |
| `timezone` | (Optional) Time zone to use when executing the statement. For details, see [TIMEZONE parameter](../../sql-reference/parameters.md).  Type: string  Example: `america/los_angeles` |
| `use_cached_result` | (Optional) Whether query results can be reused between successive invocations of the same query as long as the original result has not expired. For details, see [USE_CACHED_RESULT parameter](../../sql-reference/parameters.md)  Type: string  Example: `true` |

## Response codes for all operations

This section lists the response codes that apply to all operations.

| Code | Description |
| --- | --- |
| 400 | Bad Request.  The request payload is invalid or malformed. This happens if the application didn’t send the correct request payload. The response body may include the error code and message indicating the actual cause. The application must reconstruct the request body for retry.  The following is an example of a response:  ```none HTTP/1.1 400 Bad Request Date: Tue, 04 May 2021 22:54:21 GMT Content-Type: application/json {   "code" : "390142",   "message" : "Incoming request does not contain a valid payload." } ``` |
| 401 | Unauthorized.  The request is not authorized. This happens if the attached access token is invalid or missing. The response body may include the error code and message indicating the actual cause, e.g., expired, invalid token. The application must obtain a new access token for retry.  See [Authenticating to the server](authenticating.md).  The following is an example of a response:  ```none HTTP/1.1 401 Unauthorized Date: Tue, 04 May 2021 20:17:57 GMT Content-Type: application/json {   "code" : "390303",   "message" : "Invalid OAuth access token. ...TTTTTTTT" } ``` |
| 403 | Forbidden.  The request is forbidden. This happens if the request is made even if the API is not enabled. |
| 404 | Not Found.  The request endpoint is not valid. This happens if the API endpoint is wrong. For example, if the application requests `/api/v2/hello`, which does not exist, the server returns this code. |
| 405 | Method Not Allowed.  The request method does not match the supported API. This happens, for example, if the application calls the API with the GET method but the endpoint accepts only POST. The application must use a supported method when sending the request.  The following is an example of a response:  ```none HTTP/1.1 405 Method Not Allowed Date: Tue, 04 May 2021 22:55:38 GMT Content-Length: 0 ``` |
| 415 | The request header `Content-Type` includes unsupported media type. |
| 422 | The request was well-formed (i.e., syntactically correct) but could not be processed.  The API supports `application/json` only. If no `Content-Type` is specified, the request payload is interpreted as JSON, but if any other media type is specified, this error is returned. |
| 429 | Too many requests.  The number of requests hit the rate limit. The application must reduce the frequency of requests sent to the API endpoints. The application may retry with backoff. Exponentially jittered backoff is recommended.  This response can also occur when the server receives too many concurrent requests. Concurrency limits on the API are determined by the concurrency limits enforced by Snowflake.  The following is an example of a response:  ```none HTTP/1.1 429 Too many requests Content-Type: application/json Content-Length: 69 {   "code" : "390505",   "message" : "Too many requests."  } ``` |
| 500 | Internal Server Error.  The server encountered an unrecoverable system error. The response body can include the error code and message for further guidance.  You can retry exponential backoff by setting the `requestId` and `retry` parameters to `true`. For more information, see [Resubmitting a request to execute SQL statements](submitting-requests.md). |
| 502 | Bad Gateway.  The server was acting as a gateway or proxy and received an invalid response from the upstream server.  You can retry exponential backoff by setting the `requestId` and `retry` parameters to `true`. For more information, see [Resubmitting a request to execute SQL statements](submitting-requests.md). |
| 503 | Service Unavailable.  The request was not processed due to a timeout on the server. The application may retry with backoff. Exponentially jittered backoff is recommended. |
| 504 | Gateway Timeout.  The request was not processed due to a timeout on the server. The application may retry with backoff. Exponentially jittered backoff is recommended. |
| 522 | Invalid SSL Certificate.  The server could not validate the provided SSL certificate. |

## Response headers for all operations

Responses can contain the following headers:

| Header | Description |
| --- | --- |
| `Link` | This header is in the 200 response for a request to execute the statement and a request to check the status of the execution of a statement.  This header provides links to other partitions of results (e.g. the first partition, the last partition, etc.). The header can include multiple URL entries with different `rel` attribute values that specify the partition to return (`first`, `next`, `prev`, and `last`).  For example:  ```none Link: </api/v2/statements/e127cc7c-7812-4e72-9a55-3b4d4f969840?partition=1; rel="last">,       </api/v2/statements/e127cc7c-7812-4e72-9a55-3b4d4f969840?partition=1; rel="next">,       </api/v2/statements/e127cc7c-7812-4e72-9a55-3b4d4f969840?partition=0; rel="first"> ``` |

## Types of objects in the response body

### `CancelStatus`

`CancelStatus` is a JSON object that contains information about the cancellation of the execution of a statement. This
object is returned in the body of the response for a cancellation request.

#### Fields

| Field | Description |
| --- | --- |
| `code` | Type: string |
| `sqlState` | Type: string |
| `message`  Example: `successfully cancelled` | Type: string |
| `statementHandle` | Unique identifier for the statement being executed.  Type: string (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier))  Example: `536fad38-b564-4dc5-9892-a4543504df6c` |
| `statementStatusUrl` | URL to get the statement status and result set.  Type: string (a URL)  Example: `/api/v2/statements/536fad38-b564-4dc5-9892-a4543504df6c` |

#### Example

```sqljson
{
  "code" : "0",
  "sqlState" : "",
  "message" : "successfully canceled",
  "statementHandle" : "536fad38-b564-4dc5-9892-a4543504df6c",
  "statementStatusUrl" : "/api/v2/statements/536fad38-b564-4dc5-9892-a4543504df6c"
}
```

### `QueryFailureStatus`

QueryFailureStatus is a JSON object that contains information about a failure to execute a statement. This object is returned in
the body of the 422 response for a request to execute the statement.

#### Fields

| Field | Description |
| --- | --- |
| `code` | Type: string  Example: `0` |
| `sqlState` | Type: string |
| `message` | Type: string  Example: `successfully executed` |
| `statementHandle` | Unique identifier for the statement being executed.  Type: string (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier))  Example: `536fad38-b564-4dc5-9892-a4543504df6c` |
| `createdOn` | Timestamp that specifies when the statement execution started. The timestamp is expressed in milliseconds since the epoch.  Type: 64-bit signed integer  Example: `1597090533987` |
| `statementStatusUrl` | URL to get the statement status and result set.  Type: string (a URL)  Example: `/api/v2/statements/536fad38-b564-4dc5-9892-a4543504df6c` |

#### Example

```sqljson
{
  "code" : "002139",
  "sqlState" : "42601",
  "message" : "SQL compilation error: Unknown function",
  "statementHandle" : "e4ce975e-f7ff-4b5e-b15e-bf25f59371ae",
  "statementStatusUrl" : "/api/v2/statements/e4ce975e-f7ff-4b5e-b15e-bf25f59371ae"
}
```

### `QueryStatus`

`QueryStatus` is a JSON object that contains information about the status of the execution of a statement. This object is
returned in the following:

* the body of the 202 and 408 response for a request to execute the statement.
* the body of a 202 and 422 response for a
  request to check the status of the execution of a statement.

#### Fields

| Field | Description |
| --- | --- |
| `code` | Type: string  Example: `0` |
| `sqlState` | Type: string |
| `message` | Type: string  Example: `successfully executed` |
| `statementHandle` | Unique identifier for the statement being executed.  Type: string (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier))  Example: `536fad38-b564-4dc5-9892-a4543504df6c` |
| `createdOn` | Timestamp that specifies when the statement execution started. The timestamp is expressed in milliseconds since the epoch.  Type: 64-bit signed integer  Example: `1597090533987` |
| `statementStatusUrl` | URL to get the statement status and result set.  Type: string (a URL)  Example: `/api/v2/statements/536fad38-b564-4dc5-9892-a4543504df6c` |

#### Example

```sqljson
{
  "code" : "0",
  "sqlState" : "",
  "message" : "successfully executed",
  "statementHandle" : "e4ce975e-f7ff-4b5e-b15e-bf25f59371ae",
  "statementStatusUrl" : "/api/v2/statements/e4ce975e-f7ff-4b5e-b15e-bf25f59371ae"
}
```

### `ResultSet`

`ResultSet` is a JSON object that contains the results of the execution of a statement. This object is returned in the body
of the 200 response for a request to execute the statement and a
request to check the status of the execution of a statement.

#### Fields

| Field | Description |
| --- | --- |
| `code` | Type: string  Example: `0` |
| `sqlState` | Type: string |
| `message` | Type: string  Example: `successfully executed` |
| `statementHandle` | Unique identifier for the statement being executed.  If multiple statements were specified in the request, this handle corresponds to the set of those statements. For the handles of the individual statements in the request, see the `statementHandles` field.  Type: string (a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier))  Example: `536fad38-b564-4dc5-9892-a4543504df6c` |
| `statementHandles` | Array of unique identifiers for the statements being executed for this request.  Type: array of strings ([UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier))  Example: `[ "019c9f9a-0502-f25e-0000-438300e0d046", "019c9f9a-0502-f25e-0000-438300e0d04a", "019c9f9a-0502-f25e-0000-438300e0d04e" ]` |
| `createdOn` | Timestamp that specifies when the statement execution started. The timestamp is expressed in milliseconds since the epoch.  Example: `1597090533987` |
| `statementStatusUrl` | URL to get the statement status and result set.  Type: string (a URL)  Example: `/api/v2/statements/536fad38-b564-4dc5-9892-a4543504df6c` |
| `resultSetMetaData` | Metadata about the result set returned.  Type: object (ResultSet_resultSetMetaData) |
| `data` | **If the request contains a single SQL statement,** this field contains the result set data.  A result set format is an array of arrays in JSON:   * Each array corresponds to a single row. * The elements in a row correspond to the values in the columns for that row. * The data is encoded as JSON strings, regardless of the Snowflake datatype.   Type: array of arrays  Example:  ```sqljson [   ["customer1","1234 A Avenue","98765","1565481394123000000"],   ["customer2","987 B Street","98765","1565516712912012345"],   ["customer3","8777 C Blvd","98765","1565605431999999999"],   ["customer4","64646 D Circle","98765","1565661272000000000"] ] ```  **If the request contains multiple SQL statements,** this field just contains the message “Multiple statements executed successfully”. To retrieve the results for each statement in the request, get the handles for these statements from the `statementHandles` field, and send requests to get the results of each statement. |
| `stats` | For DML statements, this field contains statistics about the number of rows affected by the operation.  Type: object (ResultSet_stats) |

### `ResultSet_resultSetMetaData`

`ResultSet_resultSetMetaData` is a JSON object that contains metadata about the results of the execution of a statement.
This object is in the `resultSetMetaData` field of the ResultSet object.

#### Fields

| Field | Description |
| --- | --- |
| `partition` | The index number of the partition that you want to return (where `0` specifies the first partition of data). Snowflake returns data in partitions. Snowflake determines the number of partitions and the size of each partition at runtime. You can get the list of partitions from the `resultSetMetaData` object in the response to the POST request.  See [Getting the results from the response](handling-responses.md) for more information. |
| `numRows` | The total number of rows of results.  Type: 64-bit signed integer  Example: `100` |
| `format` | Format of the data in the result set.  Type: string |
| `rowType` | Array of ResultSet_resultSetMetaData_rowType objects that describe the columns in the set of results.  Type: array of ResultSet_resultSetMetaData_rowType.  Example:  ```sqljson [  {"name":"ROWNUM","type":"FIXED","length":0,"precision":38,"scale":0,"nullable":false},  {"name":"ACCOUNT_ID","type":"FIXED","length":0,"precision":38,"scale":0,"nullable":false},  {"name":"ACCOUNT_NAME","type":"TEXT","length":1024,"precision":0,"scale":0,"nullable":false},  {"name":"ADDRESS","type":"TEXT","length":16777216,"precision":0,"scale":0,"nullable":true},  {"name":"ZIP","type":"TEXT","length":100,"precision":0,"scale":0,"nullable":true},  {"name":"CREATED_ON","type":"TIMESTAMP_NTZ","length":0,"precision":0,"scale":3,"nullable":false} ] ``` |

### `ResultSet_resultSetMetaData_rowType`

`ResultSet_resultSetMetaData_rowType` is a JSON object that describes a column in a set of results. An array of these
objects is in the `rowType` field of the ResultSet_resultSetMetaData object.

#### Fields

| Field | Description |
| --- | --- |
| `name` | Name of the column.  Type: string |
| `type` | [Snowflake data type](../../sql-reference/intro-summary-data-types.md) of the column.  Type: string |
| `length` | Length of the column.  Type: 64-bit signed integer |
| `precision` | Precision of the column.  Type: 64-bit signed integer |
| `scale` | Scale of the column.  Type: 64-bit signed integer |
| `nullable` | Specifies whether or not the column is nullable.  Type: boolean |

#### Example

```sqljson
{
 "name":"ACCOUNT_NAME",
 "type":"TEXT",
 "length":1024,
 "precision":0,
 "scale":0,
 "nullable":false
}
```

### `ResultSet_stats`

`ResultSet_stats` is a JSON object that contains statistics about the execution of a DML statement. This object is in the
`stats` field of the ResultSet_resultSetMetaData object.

#### Fields

| Field | Description |
| --- | --- |
| `numRowsInserted` | Number of rows that were inserted.  Type: 64-bit signed integer  Example: `12` |
| `numRowsUpdated` | Number of rows that were updated.  Type: 64-bit signed integer  Example: `9` |
| `numRowsDeleted` | Number of rows that were deleted.  Type: 64-bit signed integer  Example: `8` |
| `numDuplicateRowsUpdated` | Number of duplicate rows that were updated.  Type: 64-bit signed integer  Example: `20` |

---
title: Snowflake telemetry package dependencies
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/telemetry-package-dependencies.md
section: Developer Guide
---

# Snowflake telemetry package dependencies

When you add the `com.snowflake:telemetry` package to the definition of your function or procedure, libraries that are dependencies of
Snowflake telemetry will be added to environment in which the function or procedure executes. Avoid importing other versions of these
libraries to support your own code because doing so may result in collisions and unexpected behavior.

You can use the telemetry package when adding code to perform logging or tracing to handlers written in Java or Scala.

The following tables list the libraries included for each telemetry package version.

## Version 0.1.0

| Group ID | Artifact ID | Version |
| --- | --- | --- |
| ch.qos.logback | logback-core | 1.3.6 |
| ch.qos.logback | logback-classic | 1.3.6 |
| io.opentelemetry | opentelemetry-api | 1.35.0 |
| io.opentelemetry | opentelemetry-context | 1.35.0 |
| io.opentelemetry | opentelemetry-sdk | 1.35.0 |
| io.opentelemetry | opentelemetry-sdk-common | 1.35.0 |
| io.opentelemetry | opentelemetry-sdk-trace | 1.35.0 |
| io.opentelemetry | opentelemetry-sdk-metrics | 1.35.0 |
| io.opentelemetry | opentelemetry-sdk-logs | 1.35.0 |
| io.opentelemetry | opentelemetry-api-events | 1.35.0-alpha |
| io.opentelemetry | opentelemetry-exporter-otlp-common | 1.35.0 |
| io.opentelemetry | opentelemetry-exporter-common | 1.35.0 |
| io.opentelemetry | opentelemetry-extension-incubator | 1.35.0-alpha |
| org.slf4j | slf4j-api | 2.0.4 |

## Version 0.0.1

| Group ID | Artifact ID | Version |
| --- | --- | --- |
| io.opentelemetry | opentelemetry-api | 1.21.0 |
| io.opentelemetry | opentelemetry-context | 1.21.0 |
| io.opentelemetry | opentelemetry-sdk | 1.21.0 |
| io.opentelemetry | opentelemetry-sdk-trace | 1.21.0 |
| io.opentelemetry | opentelemetry-sdk-metrics | 1.21.0 |
| io.opentelemetry | opentelemetry-sdk-logs | 1.21.0-alpha |
| io.opentelemetry | opentelemetry-api-logs | 1.21.0-alpha |
| io.opentelemetry | opentelemetry-semconv | 1.21.0-alpha |
| io.opentelemetry | opentelemetry-exporter-otlp-common | 1.21.0 |
| io.opentelemetry | opentelemetry-exporter-common | 1.21.0 |
| org.slf4j | slf4j-api | 1.7.25 |
| ch.qos.logback | logback-core | 1.2.3 |
| ch.qos.logback | logback-classic | 1.2.3 |
| io.opentelemetry.proto | opentelemetry-proto | 0.19.0-alpha |

---
title: Snowpark Connect for Spark compatibility guide
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-compatibility.md
section: Developer Guide
---

# Snowpark Connect for Spark compatibility guide

This guide documents the compatibility between the Snowpark Connect for Spark implementation of the Spark DataFrame APIs and native
Apache Spark. It is intended to help users understand the key differences, unsupported features, and migration considerations when moving
Spark workloads to Snowpark Connect for Spark.

Snowpark Connect for Spark aims to provide a familiar Spark DataFrame API experience on top of the Snowflake execution engine.
However, there are the compatibility gaps described in this topic. This guide highlights those differences to help you plan and adapt
your migration. These might be addressed in a future release.

## DataTypes

### Unsupported data types

* [DayTimeIntervalType](https://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/types/DayTimeIntervalType.html)
* [YearMonthIntervalType](https://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/types/YearMonthIntervalType.html)
* [UserDefinedTypes](https://spark.apache.org/docs/latest/api/java/org/apache/spark/sql/types/UserDefinedType.html)

### Implicit data type conversion

When using Snowpark Connect for Spark, keep in mind how data types are handled. Snowpark Connect for Spark implicitly represents `ByteType`,
`ShortType`, and `IntegerType` as `LongType`. This means that while you might define columns or data with
`ByteType`, `ShortType`, or `IntegerType`, the data will be represented and returned by Snowpark Connect for Spark as
`LongType`. Similarly, implicit conversion might also occur for `FloatType` and
`DoubleType` depending on the specific operations and context. The Snowflake execution engine will internally handle data
type compression and may in fact store the data as `Byte` or `Short`, but these are considered implementation details and not exposed to the
end user.

Semantically, this representation will not impact the correctness of your Spark queries.

| Data type from native PySpark | Data type from Snowpark Connect for Spark |
| --- | --- |
| `ByteType` | `LongType` |
| `ShortType` | `LongType` |
| `IntegerType` | `LongType` |
| `LongType` | `LongType` |

The following example shows a difference in how Spark and Snowpark Connect for Spark handle data types in query results.

#### Query

```python
query = """
    SELECT * FROM VALUES
    (float(1.0), double(1.0), 1.0, "1", true, :code:`NULL`),
    (float(2.0), double(2.0), 2.0, "2", false, :code:`NULL`),
    (float(3.0), double(3.0), :code:`NULL`, "3", false, :code:`NULL`)
    AS tab(a, b, c, d, e, f)
    """
```

#### Spark

```python
spark.sql(query).printSchema()
```

```output
root
 |-- a: float (nullable = false)
 |-- b: double (nullable = false)
 |-- c: decimal(2,1) (nullable = true)
 |-- d: string (nullable = false)
 |-- e: boolean (nullable = false)
 |-- f: void (nullable = true)
```

#### Snowpark Connect for Spark

```python
snowpark_connect_spark.sql(query).printSchema()
```

```output
root
 |-- a: double (nullable = false)
 |-- b: double (nullable = false)
 |-- c: decimal (nullable = true)
 |-- d: string (nullable = false)
 |-- e: boolean (nullable = true)
 |-- f: string (nullable = true)
```

### `NullType` nuance

Snowpark Connect for Spark doesn’t support the [NullType](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.types.NullType.html)
datatype, which is a supported data type in Spark. This causes behavior changes when using `Null` or `None` in dataframes.

In Spark, a literal `NULL` (for example, with `lit(None)`) is automatically inferred as a `NullType`. In Snowpark Connect for Spark, it is inferred as a
`StringType` during schema inference.

```python
df = self.spark.range(1).select(lit(None).alias("null_col"))
field = df.schema["null_col"]

# Spark: StructField('null_col', :code:`NullType`(), True)
# Snowpark Connect for Spark: StructField('null_col', :code:`StringType`(), True)
```

### Structured data types in `ArrayType`, `MapType`, and `ObjectType`

While structured type support is not available by default in Snowpark Connect for Spark, `ARRAY`, `MAP` and `Object` datatypes are
treated as generic, untyped collections. This means there is no enforcement of element types, field names, schema, or nullability, unlike
what would be provided by structured type support.

If you have a dependency on this support, please work with your account team to enable this feature for your account.

## Unsupported Spark APIs

The following are the APIs supported by classic Spark and Spark Connect but not supported in Snowpark Connect for Spark.

* [Dataframe.hint](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.hint.html):
  Snowpark Connect for Spark ignores any hint that is set on a dataframe. The Snowflake query optimizer automatically determines the
  most efficient execution strategy.
* [DataFrame.repartition](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.repartition.html):
  This is a no-op in Snowpark Connect for Spark. Snowflake automatically manages data distribution and partitioning across its distributed
  computing infrastructure.
* [pyspark.RDD](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.RDD.html): RDD APIs are not supported in
  Spark Connect (including Snowpark Connect for Spark).
* [pyspark.ml](https://spark.apache.org/docs/latest/api/python/reference/pyspark.ml.html)
* [pyspark streaming](https://spark.apache.org/docs/latest/streaming-programming-guide.html)

## UDF differences

### `StructType` differences

When Spark converts a `StructType` to be used in a user-defined function (UDF), it converts it to a `tuple` type in Python. Snowpark Connect for Spark will convert a
`StructType` into a `dict` type in Python. This has fundamental differences in element access and output.

* Spark will access indexes with 0, 1, 2, 3, and so on.
* Snowpark Connect for Spark will access indexes using ‘_1’, ‘_2’, and so on.

```python
def f(e):
    return e[0]

    df = self.spark.createDataFrame([((1.0, 1.0), (1, 1))], ["c1", "c2"])
    result = df.select("*", udf(f, DoubleType())("c1"))

# This results in an index access issue. Workaround is to use _1, _2 as indicies.
# Workaround:

def f(e):
    return e['_1']

row = (
    self.spark.range(1)
    .selectExpr("struct(1, 2) as struct")
    .select(
        udf(lambda x: x, "struct<col1:int,col2:int>")("struct"),
    )
    .first()
)

self.assertEquals(row[0], Row(col1=1, col2=2))

# Spark: Row(col1=1, col2=2)

# Snowpark Connect for Spark: {'col1': 1, 'col2': 2}
```

### Iterator Type in UDFs

Iterator isn’t supported as a return type or as an input type.

```python
# This will not work
def func(iterator):
  for _ in iterator:
              ...

df = self.spark.range(10)
actual = df.repartition(1).mapInArrow(func, "a long").collect()
```

### Importing files to a Python UDF

With Snowpark Connect for Spark, you can specify external libraries and files in Python UDFs. Snowflake includes Python files and archives in your code’s
execution context. You can import functions from these included files in a UDF without additional steps. This dependency-handling behavior
works as described in [Creating a Python UDF with code uploaded from a stage](../udf/python/udf-python-creating.md).

To include external libraries and files, you provide stage paths to the files as the value of the configuration setting
`snowpark.connect.udf.imports`. The configuration value should be an array of stage paths to the files, where the paths are
separated by commas.

Code in the following example includes two files in the UDF’s execution context. The UDF imports functions from these files and uses them
in its logic.

```python
# Files need to be previously staged
spark.conf.set("snowpark.connect.udf.imports", "[@stage/library.py, @other_lib.zip]")

@udf(returnType = StringType())
def import_example(input: str) -> str:
  from library import first_function
  from other_lib.custom import second_function

  return first_function(input) + second_function(input)

spark.range(1).select(import_read_example("example_string")).show()
```

You can use the `snowpark.connect.udf.imports` setting to include other kinds of files as well, such as those with data your code
needs to read. Note that when you do this, your code should only read from the included files; any writes to such files will be lost after
the function’s execution ends.

```python
# Files need to be previously staged
spark.conf.set("snowpark.connect.udf.imports", "[@stage/data.csv]")

@udf(returnType = StringType())
def import_read_example(file_name: str) -> str:
  with open(file_name) as f:
    return f.read()

spark.range(1).select(import_read_example("data.csv")).show()
```

## Lambda function limitations

User-defined functions (UDFs) are not supported within lambda expressions. This includes both custom UDFs and
certain built-in functions whose underlying implementation relies on Snowflake UDFs. Attempting to use a UDF inside a lambda expression
will result in an error.

```python
df = spark.createDataFrame([({"a": 123},)], ("data",))
df.select(map_filter("data", lambda _, v: bit_count(v) > 3)).show() # does not work, since `bit_count` is implemented with UDF
```

## Using path-sensitive modules

If the Python UDF body imports a module that requires a precise path, you need to take additional steps. When loading dependencies for UDFs, Snowflake puts all of the files in the working directory without preserving the original path. To preserve the original structure, you must zip dependencies and then add as an import for SCOS by using either `addArtifacts` or configuration `snowpark.connect.udf.python.imports`.

```python
# Make sure to zip module before importing to stage
spark.conf.set("snowpark.connect.udf.python.imports", "[@nested_library.zip]")

@udf(returnType = StringType())
def import_example(input: str) -> str:
  from nested_library.sub_module.functions import example_func

  return example_func(input)

spark.range(1).select(import_read_example("example_string")).show()

#add dependencies for import
spark.addArtifacts("nested_library.zip", pyfile=True)

@udf(returnType = StringType())
def import_example(input: str) -> str:
  from nested_library.sub_module.functions import example_func

  return example_func(input)

spark.range(1).select(import_read_example("example_string")).show()
```

## Data sources

| Data source | Compatibility issues compared with PySpark |
| --- | --- |
| Avro | File type is not supported. |
| CSV | Save mode is not supported for the following: `Append`, `Ignore`.  The followings are known limitations:   * `compression`: This parameter supports only the following values: GZIP, BZ2, BROTLI, ZSTD, DEFLATE, RAW_DEFLATE, NONE, UNCOMPRESSED. * `dateFormat`: Custom date formats must follow the formats at [Datetime Patterns](../../sql-reference/data-types-datetime.md). * `encoding`: Encoding in multiLine mode is not supported. * `lineSep`: This parameter cannot be set to an empty string. * `quote`: This parameter cannot be set to an empty string. * `timestampFormat`: Custom date formats must follow the formats at [Datetime Patterns](../../sql-reference/data-types-datetime.md). * Reading an empty file is not supported.   The following options are not supported: `charToEscapeQuoteEscaping`, `columnNameOfCorruptRecord`, `comment`, `emptyValue`, `enableDateTimeParsingFallback`, `enforceSchema`, `escape`, `escapeQuotes`, `ignoreLeadingWhiteSpace`, `ignoreTrailingWhiteSpace`, `locale`, `maxCharsPerColumn`, `maxColumns`, `mode`, `nanValue`, `negativeInf`, `positiveInf`, `preferDate`, `quoteAll`, `samplingRatio`, `timestampNTZFormat`, `unescapedQuoteHandling`. |
| JSON | Save mode not supported for the following: `Append`, `Ignore`.  The followings are known limitations:   * `compression`: This parameter supports only the following values: GZIP, BZ2, BROTLI, ZSTD, DEFLATE, RAW_DEFLATE, NONE, UNCOMPRESSED. * `dateFormat`: Custom date formats must follow the formats at [Datetime Patterns](../../sql-reference/data-types-datetime.md). * `encoding`: Encoding in multiline mode is not supported. * `timestampFormat`: Custom date formats must follow the formats at [Datetime Patterns](../../sql-reference/data-types-datetime.md). * Difference in `show`: If the value of field is a string, it would be quoted. An extra `n` character would be shown in the result. * Array-of-struct field projection via dot notation is not supported * Reading a JSON file with Spark SQL is not supported. * MapType is not supported.   The following options are not supported: `allowBackslashEscapingAnyCharacter`, `allowComments`, `allowNonNumericNumbers`, `allowNumericLeadingZeros`, `allowSingleQuotes`, `allowUnquotedControlChars`, `allowUnquotedFieldNames`, `columnNameOfCorruptRecord`, `dropFieldIfAllNull`, `enableDateTimeParsingFallback`, `ignoreNullFields`, `lineSep`, `locale`, `mode`, `prefersDecimal`, `primitivesAsString`, `samplingRatio`, `timeZone`, `timestampNTZFormat`. |
| Orc | File type is not supported. |
| Parquet | Save mode is not supported for the following: `Append`, `Ignore`.  The followings are known limitations:   * `compression`: This parameter supports only the following values: GZIP, BZ2, BROTLI, ZSTD, DEFLATE, RAW_DEFLATE, NONE, UNCOMPRESSED. * Date formats must follow the formats at [Datetime Patterns](../../sql-reference/data-types-datetime.md). * MapType and IntervalType are not supported. * Configuration is not supported: (ALL).   The following options are not supported: `datetimeRebaseMode`, `int96RebaseMode`, `mergeSchema`. |
| Text | Save mode is not supported for the following: `Append`, `Ignore`.  The following are known limitations:   * `compression`: This parameter supports only the following values: GZIP, BZ2, BROTLI, ZSTD, DEFLATE, RAW_DEFLATE, NONE, UNCOMPRESSED. * The `lineSep` parameter is not supported in write. * Partitioned directory is not supported. |
| XML | Save mode is not supported for the following: `Append`, `Ignore`.  The followings are known limitations:   * Schema inference is not supported. A schema must be provided using `.schema()`. * Permissive mode is not supported. If input data does not match the user schema type and cannot be coerced, an error will be thrown. * `compression`: This parameter is not supported when `rowTag` is specified. Supports only the following values: GZIP, BZ2, BROTLI, ZSTD, DEFLATE, RAW_DEFLATE, NONE, UNCOMPRESSED. * MapType is not supported. * Reading a XML file with Spark SQL is not supported.   The following options are not supported: `arrayElementName`, `dateFormat`, `declaration`, `inferSchema`, `locale`, `modifiedBefore`, `recursiveFileLookup`, `rootTag`, `samplingRatio`, `timeZone`, `timestampFormat`, `timestampNTZFormat`, `validateName`, `wildcardColName`. |
| Snowflake table | Write to table doesn’t need a provider format.  Bucketing and partitioning are not supported.  Storage format and versioning are not supported. |

## Catalog

### Snowflake Horizon Catalog provider support

* Only Snowflake is supported as a catalog provider.

### Unsupported catalog APIs

* `registerFunction`
* `listFunctions`
* `getFunction`
* `functionExists`
* `createExternalTable`

### Partially supported catalog APIs

* `createTable` (no external table support)

## Iceberg

### Snowflake managed iceberg table

Snowpark Connect for Spark works with Apache Iceberg™ tables, including externally managed Iceberg tables and catalog-linked databases.

#### Read

Time travel is not supported, including historical snapshot, branch, and incremental read.

#### Write

* Using Spark SQL to create tables is not supported.
* Schema merge is not supported.
* To create the table, you must:

  + Create an external volume.
  + Link the external volume needs to the table creation in either of the following ways:

    - Set the EXTERNAL_VOLUME to the database.
    - Set `snowpark.connect.iceberg.external_volume` to Spark configuration.

### External managed Iceberg table

#### Read

* You must create a Snowflake unmanaged table entity.
* Time travel is not supported, including historical snapshot, branch, and incremental read.

#### Write

* Table creation is not supported.
* Writing to the existing Iceberg table is supported.

---
title: Snowpark Connect for Spark properties
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-parameters.md
section: Developer Guide
---

# Snowpark Connect for Spark properties

Snowpark Connect for Spark supports custom configuration in a way that’s similar to standard Spark. You can modify configuration properties only
through the session’s `set` method by using a key-value pair. Note that Snowpark Connect for Spark recognizes only a limited set of properties that
influence execution. Any unsupported properties are silently ignored without raising an exception.

## Supported Spark properties

Snowpark Connect for Spark supports a subset of Spark properties.

| Property Name | Default | Meaning | Since |
| --- | --- | --- | --- |
| `spark.app.name` | (none) | Application name set as Snowflake `query_tag` (`Spark-Connect-App-Name={name}`) for query tracking. | 1.0.0 |
| `spark.Catalog.databaseFilterInformationSchema` | `false` | When `true`, filters out `INFORMATION_SCHEMA` from database listings in catalog operations. | 1.0.0 |
| `spark.hadoop.fs.s3a.access.key` | (none) | AWS access key ID for S3 authentication when reading or writing to S3 locations. | 1.0.0 |
| `spark.hadoop.fs.s3a.assumed.role.arn` | (none) | AWS IAM role ARN with S3 access when using role-based authentication. | 1.0.0 |
| `spark.hadoop.fs.s3a.secret.key` | (none) | AWS secret access key for S3 authentication when reading or writing to S3 locations. | 1.0.0 |
| `spark.hadoop.fs.s3a.server-side-encryption.key` | (none) | AWS KMS key ID for server-side encryption when using the `AWS_SSE_KMS` encryption type. | 1.0.0 |
| `spark.hadoop.fs.s3a.session.token` | (none) | AWS session token for temporary S3 credentials when using STS. | 1.0.0 |
| `spark.sql.ansi.enabled` | `false` | Enables ANSI SQL mode for stricter type checking and error handling. When `true`, arithmetic overflows and invalid casts raise errors instead of returning `NULL`. | 1.0.0 |
| `spark.sql.caseSensitive` | `false` | Controls case sensitivity for identifiers. When `false`, column and table names are case-insensitive (auto-uppercased in Snowflake). | 1.0.0 |
| `spark.sql.crossJoin.enabled` | `true` | Enables or disables implicit cross joins. A `false` and missing or trivial join condition will result in an error. | 1.0.0 |
| `spark.sql.execution.pythonUDTF.arrow.enabled` | `false` | When `true`, enables Apache Arrow optimization for Python UDTF serialization/deserialization. | 1.0.0 |
| `spark.sql.globalTempDatabase` | `global_temp` | Schema name for global temporary views; created automatically if it does not exist. | 1.0.0 |
| `spark.sql.legacy.allowHashOnMapType` | `false` | When `true`, allows hashing MAP type columns. By default, MAP types cannot be hashed for consistency with Spark behavior. | 1.0.0 |
| `spark.sql.legacy.dataset.nameNonStructGroupingKeyAsValue` | `false` | Legacy behavior for dataset grouping key naming. | 1.6.0 |
| `spark.sql.mapKeyDedupPolicy` | `EXCEPTION` | Controls behavior when duplicate keys are found in map creation. Values: `EXCEPTION` (raise error) or `LAST_WIN` (keep last value). | 1.0.0 |
| `spark.sql.parser.quotedRegexColumnNames` | `false` | When `true`, enables regex pattern matching in quoted column names in SQL queries (e.g. `SELECT '(col1|col2)' FROM table`). | 1.0.0 |
| `spark.sql.parquet.outputTimestampType` | `TIMESTAMP_MILLIS` | Controls Parquet output timestamp type. Supports `TIMESTAMP_MILLIS` or `TIMESTAMP_MICROS`. | 1.7.0 |
| `spark.sql.pyspark.inferNestedDictAsStruct.enabled` | `false` | When `true`, infers nested Python dictionaries as `StructType` instead of `MapType` during DataFrame creation. | 1.0.0 |
| `spark.sql.pyspark.legacy.inferArrayTypeFromFirstElement.enabled` | `false` | When `true`, infers array element type from first element only instead of sampling all elements. | 1.0.0 |
| `spark.sql.repl.eagerEval.enabled` | `false` | When `true`, enables eager evaluation in REPL showing DataFrame results automatically without calling `show()`. | 1.0.0 |
| `spark.sql.repl.eagerEval.maxNumRows` | `20` | Maximum number of rows to display in REPL eager evaluation mode. | 1.0.0 |
| `spark.sql.repl.eagerEval.truncate` | `20` | Maximum width for column values in REPL eager evaluation display before truncation. | 1.0.0 |
| `spark.sql.session.localRelationCacheThreshold` | `2147483647` | Byte threshold for caching local relations. Relations larger than this are cached to improve performance. | 1.0.0 |
| `spark.sql.session.timeZone` | `<system_local_timezone>` | Session timezone used for timestamp operations. Synced with Snowflake session via `ALTER SESSION SET TIMEZONE`. | 1.0.0 |
| `spark.sql.sources.default` | `parquet` | Default data source format for read/write operations when format is not explicitly specified. | 1.0.0 |
| `spark.sql.timestampType` | `TIMESTAMP_LTZ` | Default timestamp type for timestamp operations. Values: `TIMESTAMP_LTZ` (with local timezone) or `TIMESTAMP_NTZ` (no timezone). | 1.0.0 |
| `spark.sql.tvf.allowMultipleTableArguments.enabled` | `true` | When `true`, allows table-valued functions to accept multiple table arguments. | 1.0.0 |

## Supported Snowpark Connect for Spark properties

Custom configuration properties specific to Snowpark Connect for Spark.

| Property Name | Default | Meaning | Since |
| --- | --- | --- | --- |
| `fs.azure.sas.<container>.<account>.blob.core.windows.net` | (none) | Azure SAS token for Blob Storage authentication. Used when reading or writing to Azure Blob Storage locations. | 1.0.0 |
| `fs.azure.sas.fixed.token.<account>.dfs.core.windows.net` | (none) | Azure SAS token for ADLS Gen2 (Data Lake Storage) authentication. Used when reading or writing to Azure Data Lake Storage Gen2 locations. | 1.0.0 |
| `mapreduce.fileoutputcommitter.marksuccessfuljobs` | `false` | When `true`, generates `_SUCCESS` file after successful write operations for compatibility with Hadoop/Spark workflows. | 1.0.0 |
| `parquet.enable.summary-metadata` | `false` | Alternative config for generating Parquet summary metadata files. Either this or `spark.sql.parquet.enable.summary-metadata` enables the feature. | 1.4.0 |
| `snowflake.repartition.for.writes` | `false` | When `true`, forces `DataFrame.repartition(n)` to split output into `n` files during writes. Matches Spark behavior but adds overhead. | 1.0.0 |
| `snowpark.connect.cte.optimization_enabled` | `false` | When `true`, enables Common Table Expression (CTE) optimization in Snowpark sessions for improved query performance. | 1.0.0 |
| `snowpark.connect.describe_cache_ttl_seconds` | `300` | Time-to-live in seconds for query cache entries. Reduces repeated schema lookups. | 1.0.0 |
| `snowpark.connect.enable_snowflake_extension_behavior` | `false` | When `true`, enables Snowflake-specific extensions that can differ from Spark behavior (such as hash on MAP types or MD5 return type). | 1.0.0 |
| `snowpark.connect.handleIntegralOverflow` | `false` | When `true`, integral overflow behavior is aligned with the Spark approach. | 1.7.0 |
| `snowpark.connect.iceberg.external_volume` | (none) | Snowflake external volume name for Iceberg table operations. | 1.0.0 |
| `snowpark.connect.integralTypesEmulation` | `client_default` | Controls conversion of decimal to integral types. Values: `client_default`, `enabled`, `disabled` | 1.7.0 |
| `snowpark.connect.scala.version` | `2.12` | Controls the Scala version used (supports `2.12` or `2.13`) | 1.7.0 |
| `snowpark.connect.sql.partition.external_table_location` | (none) | External table location path for partitioned writes. | 1.4.0 |
| `snowpark.connect.temporary.views.create_in_snowflake` | `false` | When `true`, creates temporary views directly in Snowflake instead of managing them locally. | 1.0.1 |
| `snowpark.connect.udf.imports [DEPRECATED 1.7.0]` | (none) | Comma-separated list of files or modules to import for UDF execution. Triggers UDF recreation when changed. | 1.0.0 |
| `snowpark.connect.udf.python.imports` | (none) | Comma-separated list of files/modules to import for python UDF execution. Triggers UDF recreation when changed. | 1.7.0 |
| `snowpark.connect.udf.java.imports` | (none) | Comma-separated list of files or modules to import for Java UDF execution. Triggers UDF recreation when changed. | 1.7.0 |
| `snowpark.connect.udf.packages` | (none) | Comma-separated list of Python packages to include when registering UDFs. | 1.0.0 |
| `snowpark.connect.udtf.compatibility_mode` | `false` | When `true`, enables Spark-compatible UDTF behavior for improved compatibility with Spark UDTF semantics. | 1.0.0 |
| `snowpark.connect.version` | `<current_version>` | Read-only. Returns the current Snowpark Connect for Spark version. | 1.0.0 |
| `snowpark.connect.views.duplicate_column_names_handling_mode` | `rename` | How to handle duplicate column names in views. Values: `rename` (add suffix) `fail` (raise error) or `drop` (remove duplicates). | 1.0.0 |
| `spark.sql.parquet.enable.summary-metadata` | `false` | When `true`, generates Parquet summary metadata files (`_metadata` `_common_metadata`) during Parquet writes. | 1.4.0 |
| `snowpark.connect.sql.emulatePartitionOverwritesForSnowflakeTables` | `false` | When `true`, allows partition overwrites on Snowflake tables in Spark SQL (`INSERT OVERWRITE <table> PARTITION(<partition spec>)`). | 1.12.3 |
| `snowpark.connect.artifact_repository` | (none) | Specifies the name of a Snowflake artifact repository for UDF/UDTF package resolution. When set, packages are resolved from the specified repository instead of Anaconda. | 1.14.0 |
| `snowpark.connect.udf.resource_constraint.architecture` | (none) | When set to `x86`, UDFs, UDTFs, and `applyInPandas` operations are created with an x86 architecture constraint. Requires a warehouse with an x86 resource constraint. | 1.13.0 |

### `fs.azure.sas.<container>.<account>.blob.core.windows.net`

Specifies the Azure SAS token for Blob Storage authentication. Used when reading or writing to Azure Blob Storage locations.

Default: (none)

Since: 1.0.0

### `fs.azure.sas.fixed.token.<account>.dfs.core.windows.net`

Specifies the Azure SAS token for ADLS Gen2 (Data Lake Storage) authentication. Used when reading or writing to Azure Data Lake Storage Gen2 locations.

Default: (none)

Since: 1.0.0

### `mapreduce.fileoutputcommitter.marksuccessfuljobs`

Specify `true` to generate `_SUCCESS` file after successful write operations for compatibility with Hadoop or Spark workflows.

Default: `false`

Since: 1.0.0

### `parquet.enable.summary-metadata`

Specifies the alternative configuration for generating Parquet summary metadata files. Enables that feature with this property or `spark.sql.parquet.enable.summary-metadata`.

Default: `false`

Since: 1.4.0

### `snowflake.repartition.for.writes`

Specify `true` to force `DataFrame.repartition(n)` to split output into `n` files during writes. Matches Spark behavior but adds overhead.

Default: `false`

Since: 1.0.0

### `snowpark.connect.cte.optimization_enabled`

Specify `true` to enable Common Table Expression (CTE) optimization in the Snowpark session for query performance.

Default: `false`

Since: 1.0.0

#### Comments

Configuration that enables [Snowflake Common Table Expressions (CTEs)](../../user-guide/queries-cte.md). This configuration optimizes the
Snowflake queries in which there are a lot of repetitive code blocks. This modification will lead to improvements in both query compilation and
execution performance.

### `snowpark.connect.describe_cache_ttl_seconds`

Specifies the time to live, in seconds, for query cache entries. Reduces repeated schema lookups.

Default: `300`

Since: 1.0.0

### `snowpark.connect.enable_snowflake_extension_behavior`

Specify `true` to enable Snowflake-specific extensions that can differ from Spark behavior (such as a hash on MAP types MD5 return type).

Default: `false`

Since: 1.0.0

#### Comments

When set to `true`, changes the behavior of certain operations:

> * `bit_get/getbit` — Explicit use of [Snowflake getbit function](../../sql-reference/functions/getbit.md)
> * `hash` — Explicit use of [Snowflake hash function](../../sql-reference/functions/hash.md)
> * `md5` — Explicit use of [Snowflake md5 function](../../sql-reference/functions/md5.md)
> * Renaming table columns — Allows for altering table columns

### `snowpark.connect.handleIntegralOverflow`

Specify `true` to align integral overflow behavior with the Spark approach.

Default: `false`

Since: 1.7.0

### `snowpark.connect.iceberg.external_volume`

Specifies the Snowflake external volume name for Iceberg table operations.

Default: (none)

Since: 1.0.0

### `snowpark.connect.integralTypesEmulation`

Specifies how to convert decimal to integral types. Values: `client_default`, `enabled`, `disabled`

Default: `client_default`

Since: 1.7.0

#### Comments

By default, Snowpark Connect for Spark treats all integral types as `Long` types. This is caused by the way [numbers are represented in Snowflake](../../sql-reference/data-types-numeric.md). Integral types emulation allows for an exact mapping between Snowpark and Spark types when reading from datasources.

The default option `client_default` activates the emulation only when the script is executed from the Scala client. Integral types are mapped based on the following precisions:

| Precision | Spark Type |
| --- | --- |
| 19 | `LongType` |
| 10 | `IntegerType` |
| 5 | `ShortType` |
| 3 | `ByteType` |
| Other | `DecimalType(precision, 0)` |

When other precisions are found, the final type is mapped to the `DecimalType`.

### `snowpark.connect.scala.version`

Specifies the Scala version to use (supports `2.12` or `2.13`).

Default: `2.12`

Since: 1.7.0

### `snowpark.connect.sql.partition.external_table_location`

Specifies the external table location path for partitioned writes.

Default: (none)

Since: 1.4.0

#### Comments

To read only an exact subset of partitioned files from the provided directory, additional configuration is required. This feature is only available for files stored on [external stages](../../sql-reference/sql/create-stage.md). To prune the read files, Snowpark Connect for Spark uses [external tables](../../sql-reference/sql/create-external-table.md).

This feature is enabled when the configuration `snowpark.connect.sql.partition.external_table_location` is set. It should contain existing database and schema names where external tables will be created.

Reading parquet files that are stored on external stages will create an external table; for files on internal stages, it will not be created. Providing the schema will reduce the execution time, eliminating the cost of inferring it from sources.

For best performance, filter according to the [Snowflake External Tables filtering limitations](../../user-guide/tables-external-intro.md).

##### Example

```python
spark.conf.set("snowpark.connect.sql.partition.external_table_location", "<database-name>.<schema-name>")

spark.read.parquet("@external-stage/example").filter(col("x") > lit(1)).show()

schema = StructType([StructField("x",IntegerType()),StructField("y",DoubleType())])

spark.read.schema(schema).parquet("@external-stage/example").filter(col("x") > lit(1)).show()
```

### `snowpark.connect.temporary.views.create_in_snowflake`

Specify `true` to create temporary views directly in Snowflake instead of managing them locally.

Default: `false`

Since: 1.0.1

### `snowpark.connect.udf.imports [DEPRECATED 1.7.0]`

Specifies a comma-separated list of files and modules to import for UDF execution. When this value is changed, it triggers UDF recreation.

Default: (none)

Since: 1.0.0

### `snowpark.connect.udf.python.imports`

Specifies a comma-separated list of files and modules to import for python UDF execution. When this value is changed, it triggers UDF recreation.

Default: (none)

Since: 1.7.0

### `snowpark.connect.udf.java.imports`

Specifies a comma-separated list of files and modules to import for Java UDF execution. Triggers UDF recreation when changed.

Default: (none)

Since: 1.7.0

#### Comments

This configuration works very similarly to the `snowpark.connect.udf.python.imports`. With it, you can specify external libraries and files for Java UDFs created using [registerJavaFunction](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.UDFRegistration.registerJavaFunction.html). Configurations are mutually exclusive to prevent unnecessary dependency mixing.

To include external libraries and files, you provide stage paths to the files as the value of the configuration setting `snowpark.connect.udf.java.imports`. The configuration value should be an array of stage paths to the files, where the paths are separated by commas.

##### Example

Code in the following example includes two files in the UDF’s execution context. The UDF imports functions from these files and uses them in its logic.

```python
# Files need to be previously staged

spark.conf.set("snowpark.connect.udf.java.imports", "[@stage/library.jar]")

spark.registerJavaFunction("javaFunction", "com.example.ExampleFunction")

spark.sql("SELECT javaFunction('arg')").show()
```

You can use the `snowpark.connect.udf.java.imports` setting to include other kinds of files as well, such as those with data your code needs to read. Note that when you do this, your code should only read from the included files; any writes to such files will be lost after the function’s execution ends.

### `snowpark.connect.udf.packages`

Specifies a comma-separated list of Python packages to include when registering UDFs.

Default: (none)

Since: 1.0.0

#### Comments

You can use this to define additional packages to be available in Python UDFs. The value is a comma-separated list of dependencies.

You can discover the list of supported packages by executing the following SQL in Snowflake:

```sqlexample
SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'python';
```

##### Example

```python
spark.conf.set("snowpark.connect.udf.packages", "[numpy]")

@udtf(returnType="val: int")

class Powers:

  def eval(self, x: int):
      import numpy as np

      for v in np.power(np.array([x, x, x]), [0, 1, 2]):
          yield (int(v),)

spark.udtf.register(name="powers", f=Powers)

spark.sql("SELECT * FROM powers(10)").show()
```

For more information, see [Python](../../sql-reference/sql/create-function.md).

### `snowpark.connect.udtf.compatibility_mode`

Specify `true` to enables Spark-compatible UDTF behavior for improved compatibility with Spark UDTF semantics.

Default: `false`

Since: 1.0.0

#### Comments

This property determines whether UDTFs should use Spark-compatible behavior or the default Snowpark behavior. When set to `true`, it applies a compatibility wrapper that mimics Spark’s output type coercion and error handling patterns.

When enabled, UDTFs use a compatibility wrapper that applies Spark-style automatic type coercion (e.g., string “true” to boolean, boolean to integer) and error handling. The wrapper also converts table arguments to Row-like objects for both positional and named access, and properly handles SQL null values to match Spark’s behavior patterns.

### `snowpark.connect.version`

Returns the current Snowpark Connect for Spark version. Read only.

Default: `<current_version>`

Since: 1.0.0

### `snowpark.connect.views.duplicate_column_names_handling_mode`

Specifies how to handle duplicate column names in views. Allowed values include `rename` (add suffix) `fail` (raise error) or `drop` (remove duplicates).

Default: `rename`

Since: 1.0.0

#### Comments

Snowflake does not support duplicate column names.

##### Example

The following code fails at the view creation step with the following SQL compilation error: “duplicate column name ‘foo’”.

```python
df = spark.createDataFrame([
(1, 1),
(2, 2)
], ["foo", "foo"])

df.show() # works

df.createTempView("df_view") # Fails with SQL compilation error: duplicate column name 'foo'
```

To work around this, set the `snowpark.connect.views.duplicate_column_names_handling_mode` configuration option to one of the following values:

* `rename`: A suffix such as `_dedup_1`, `_dedup_2`, and so on will be appended to all of the duplicate column names after the first one.
* `drop`: All of the duplicate columns except one will be dropped. If the columns have different values, this might lead to incorrect results.

### `snowpark.connect.udf.java.imports`

Specifies a comma-separated list of files and modules to import for Java UDF execution. Triggers UDF recreation when changed.

Default: (none)

Since: 1.7.0

#### Comments

This configuration works very similarly to the `snowpark.connect.udf.python.imports`. You can use it to specify external libraries and files for Java UDFs created using [registerJavaFunction](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.UDFRegistration.registerJavaFunction.html). Configurations are mutually exclusive to prevent unnecessary dependency mixing.

To include external libraries and files, you provide stage paths to the files as the value of the configuration setting `snowpark.connect.udf.java.imports`. The value is an array of stage paths to the files, where the paths are separated by commas.

##### Example

Code in the following example includes two files in the UDF’s execution context. The UDF imports functions from these files and uses them in its logic.

```python
# Files need to be previously staged

spark.conf.set("snowpark.connect.udf.java.imports", "[@stage/library.jar]")

spark.registerJavaFunction("javaFunction", "com.example.ExampleFunction")

spark.sql("SELECT javaFunction('arg')").show()
```

You can use the `snowpark.connect.udf.java.imports` setting to include other kinds of files as well, such as those with data your code needs to read. When you do this, your code should only read from the included files; any writes to such files will be lost after the function’s execution ends.

### `snowpark.connect.udf.packages`

Specifies a comma-separated list of Python packages to include when registering UDFs.

Default: (none)

Since: 1.0.0

#### Comments

Configuration allows for defining additional packages available in Python UDFs. The value is a comma separated list of dependencies.

You can discover the list of supported packages by executing the following SQL in Snowflake:

```sqlexample
SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'python';
```

##### Example

```python
spark.conf.set("snowpark.connect.udf.packages", "[numpy]")

@udtf(returnType="val: int")

class Powers:

  def eval(self, x: int):
      import numpy as np

      for v in np.power(np.array([x, x, x]), [0, 1, 2]):
          yield (int(v),)

spark.udtf.register(name="powers", f=Powers)

spark.sql("SELECT * FROM powers(10)").show()
```

Reference: [Packages Reference](../../sql-reference/sql/create-function.md)

### `snowpark.connect.udtf.compatibility_mode`

Specify `true` to enable Spark-compatible UDTF behavior for improved compatibility with Spark UDTF semantics.

Default: `false`

Since: 1.0.0

#### Comments

This configuration determines whether UDTFs should use Spark-compatible behavior or the default Snowpark behavior. When enabled (`true`), it applies a compatibility wrapper that mimics Spark’s output type coercion (for example, string “true” to boolean, boolean to integer) and error handling patterns.

The wrapper also converts table arguments to row-like objects for both positional and named access, and properly handles SQL null values to match Spark’s behavior patterns.

### `snowpark.connect.sql.emulatePartitionOverwritesForSnowflakeTables`

When `true`, allows partition overwrites on Snowflake tables in Spark SQL (`INSERT OVERWRITE <table> PARTITION(<partition spec>)`).

Default: `false`

Since: 1.12.3

#### Comments

Snowflake tables do not support user-defined partitioning, and by default, partition overwrites will result in an error. Enabling this option allows using `INSERT OVERWRITE <table> PARTITION(<partition spec>)` syntax to perform overwrites.

The `<partition spec>` will accept any columns that exist in the target table.

##### Example

Code in the following example overwrites all rows in the students table that have a student_id of 222222.

```python
spark.conf.set("snowpark.connect.sql.emulatePartitionOverwritesForSnowflakeTables", True)

# create the students and persons tables as standard Snowflake tables
students_data = [
  ("Ashua Hill", "456 Erica Ct, Cupertino", 111111),
  ("Brian Reed", "723 Kern Ave, Palo Alto", 222222)
]

students_df = spark.createDataFrame(students_data, ["name", "address", "student_id"])
students_df.write.mode("overwrite").saveAsTable("students")

persons_data = [
    ("Dora Williams", "134 Forest Ave, Menlo Park", 123456789),
    ("Eddie Davis", "245 Market St, Milpitas", 345678901)
]

persons_df = spark.createDataFrame(persons_data, ["name", "address", "ssn"])
persons_df.write.mode("overwrite").saveAsTable("persons")

# overwrites all rows in the students table that have a student_id of 222222
spark.sql("""
    INSERT OVERWRITE students PARTITION (student_id = 222222)
    SELECT name, address FROM persons WHERE name = 'Dora Williams'
""").collect()
```

### `snowpark.connect.artifact_repository`

Specifies the name of a Snowflake [artifact repository](../udf/python/udf-python-packages.md) to use for package resolution when registering UDFs, UDTFs, `applyInPandas`, `mapInArrow`, and `cogroup` operations. When set, packages specified via `snowpark.connect.udf.packages` are resolved from the specified artifact repository instead of Anaconda.

Default: (none)

Since: 1.14.0

#### Comments

By default, Snowpark Connect for Spark resolves Python packages from Snowflake’s curated Anaconda channel. Setting this configuration to an artifact repository name allows resolving packages from PyPI or other configured sources, enabling the use of packages that are not available in the Anaconda channel.

For information on how to create and configure an artifact repository in Snowflake, see [Using third-party packages](../udf/python/udf-python-packages.md).

Changing this configuration invalidates cached UDFs and UDTFs, causing them to be recreated with the new repository on next invocation.

This configuration applies to the following operations:

* UDFs registered via `@udf` decorator or `spark.udf.register()`
* UDTFs registered via `@udtf` decorator or `spark.udtf.register()`
* `applyInPandas` via `groupBy().applyInPandas()`
* `mapInArrow` via `DataFrame.mapInArrow()`
* `cogroup` via `groupBy().cogroup().applyInPandas()`

##### Example

The following example configures the artifact repository, then defines a UDF that uses `pykalman`, a package available from the artifact repository, to apply Kalman filter smoothing.

```python
spark.conf.set("snowpark.connect.artifact_repository", "my_pypi_repo")
spark.conf.set("snowpark.connect.udf.packages", "[pykalman]")

@udf(returnType=DoubleType())
def kalman_smooth_value(value: float) -> float:
    import numpy as np
    from pykalman import KalmanFilter

    kf = KalmanFilter(
        transition_matrices=[1],
        observation_matrices=[1],
        initial_state_mean=0,
        initial_state_covariance=1,
        observation_covariance=1,
        transition_covariance=0.1,
    )
    observations = np.array([value, value, value])
    smoothed_state_means, _ = kf.smooth(observations)
    return float(smoothed_state_means[-1][0])

df = spark.createDataFrame([(1, 10.0), (2, 20.0), (3, 30.0)], ["id", "value"])
df.select("id", kalman_smooth_value("value").alias("smoothed")).show()
```

For more information on artifact repositories and available packages, see [Using third-party packages](../udf/python/udf-python-packages.md).

### `snowpark.connect.udf.resource_constraint.architecture`

When set to `x86`, UDFs, UDTFs, and `applyInPandas` operations are created with an x86 architecture constraint. This requires a warehouse configured with an x86 resource constraint for execution.

Default: (none)

Since: 1.13.0

#### Comments

Some third-party Python packages (such as TensorFlow, XGBoost, and certain scientific libraries) are built only for the x86 CPU architecture. Setting this configuration to `x86` adds `RESOURCE_CONSTRAINT=(architecture='x86')` to the `CREATE FUNCTION` statement generated by Snowpark Connect for Spark, ensuring the UDF runs on x86-compatible infrastructure.

To use this configuration, you must execute your workload on a warehouse that has been created with an x86 resource constraint. The following resource constraint values support x86:

* `MEMORY_1X_x86` (minimum warehouse size: XSMALL)
* `MEMORY_16X_x86` (minimum warehouse size: MEDIUM)
* `MEMORY_64X_x86` (minimum warehouse size: LARGE)

If the warehouse does not have an x86 resource constraint, UDF execution will fail.

This configuration applies to the following operations:

* UDFs registered via `@udf` decorator or `spark.udf.register()`
* UDTFs registered via `@udtf` decorator or `spark.udtf.register()`
* `applyInPandas` via `groupBy().applyInPandas()`

##### Example

The following example creates a warehouse with an x86 resource constraint, then configures Snowpark Connect for Spark to use x86 architecture for UDFs.

```sqlexample
CREATE WAREHOUSE my_x86_warehouse WITH
  WAREHOUSE_SIZE = 'MEDIUM'
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  RESOURCE_CONSTRAINT = 'MEMORY_16X_x86';

USE WAREHOUSE my_x86_warehouse;
```

```python
spark.conf.set("snowpark.connect.udf.resource_constraint.architecture", "x86")

@udf(returnType=IntegerType())
def add_one(x: int) -> int:
    return x + 1

df = spark.createDataFrame([(1,), (2,), (3,)], ["value"])
df.select(add_one(df["value"]).alias("result")).show()
```

For more information on warehouses and resource constraints, see [Snowpark-optimized warehouses](../../user-guide/warehouses-snowpark-optimized.md).

---
title: Snowpark Submit examples
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-submit-examples.md
section: Developer Guide
---

# Snowpark Submit examples

This topic includes examples that use Snowpark Submit to submit production-ready Spark applications.

## Write and submit a simple Spark application

The following example shows how to write and submit a simple Spark application with no dependencies.

1. In your local IDE, create a new Python file called `app.py` with the following content:

   ```python
   from pyspark.sql import SparkSession
   from pyspark.sql.functions import col, lit, upper, concat

   # Create Spark session
   spark = SparkSession.builder.appName("SimpleSession").getOrCreate()

   # Create a DataFrame from inline data
   data = [
       (1, "alice", "engineering", 95000),
       (2, "bob", "marketing", 72000),
       (3, "carol", "engineering", 105000),
       (4, "david", "sales", 68000),
       (5, "eva", "engineering", 88000),
   ]
   df = spark.createDataFrame(data, ["id", "name", "department", "salary"])

   # Add a new column
   df_with_bonus = df.withColumn("bonus", col("salary") * 0.1)
   df_with_bonus.show()

   # Filter and transform
   engineers = df.filter(col("department") == "engineering") \
       .withColumn("name_upper", upper(col("name"))) \
       .withColumn("greeting", concat(lit("Hello, "), col("name")))
   engineers.show()

   # Aggregate
   df.groupBy("department").avg("salary").show()

   # Stop the Spark session
   spark.stop()
   ```
2. To submit the application, use the following command:

   ```bash
   snowpark-submit \
     --snowflake-workload-name MY_JOB \
     --snowflake-connection-name MY_CONNECTION \
     /path/to/app.py
   ```

   You can use the `--wait-for-completion` option to wait for the job to complete, the `--workload-status` option to check the status of the job, and the `--display-logs` option to display the logs of the job. For a complete list of options, see [Snowpark Submit reference](snowpark-submit-reference.md).

## Deploy an application from a Snowflake stage

If the application has dependencies, like files it needs to read, you can deploy them from a Snowflake stage. The following example shows how to deploy an application and its dependencies from a Snowflake stage.

1. To upload files to a stage from the terminal, you can use the Snowflake CLI. Note that SnowSQL is the legacy CLI and if you are already using it, you can use that as well to upload files to a stage. If you have not already installed the Snowflake CLI, you can install it by following the instructions in [Installing Snowflake CLI](../snowflake-cli/installation/installation.md).
2. Create a new CSV file in your local IDE called `sample_employees.csv` with the following content:

   ```text
   employee_id,name,department,salary,years_employed
   1,Alice Johnson,Engineering,95000,5
   2,Bob Smith,Marketing,72000,3
   3,Carol Williams,Engineering,105000,8
   4,David Brown,Sales,68000,2
   5,Eva Martinez,Engineering,88000,4
   6,Frank Wilson,Marketing,75000,6
   7,Grace Lee,Sales,92000,7
   8,Henry Taylor,Engineering,110000,10
   9,Ivy Chen,Marketing,65000,1
   10,Jack Davis,Sales,78000,4
   11,Karen White,Engineering,98000,6
   12,Leo Harris,Marketing,71000,3
   13,Maria Garcia,Sales,85000,5
   14,Nathan Clark,Engineering,102000,9
   15,Olivia Moore,Marketing,69000,2
   ```

   Upload your dependency files to a stage by using the following command, where `my_stage` is the name of a stage in your account. (If you do not have a stage created, you can use [`snow stage create`](/developer-guide/snowflake-cli/command-reference/stage-commands/create).)

   ```bash
   snow stage copy sample_employees.csv @<database>.<schema>.<stage>/sample_employees.csv -c MY_CONNECTION
   ```

   To verify that the file uploaded successfully, you can use the following command to list the files in the stage:

   ```bash
   snow sql -c MY_CONNECTION -q "ls @<database>.<schema>.<stage>"
   ```

   You should see the file `sample_employees.csv` in the list.
3. In your local IDE, create a new Python file called `app.py` with the following content:

   ```python
   from pyspark.sql import SparkSession

   # Create Spark session
   spark = SparkSession.builder.appName("SimpleStageExample").getOrCreate()

   # Load data from stage (adjust stage name to match yours)
   df = spark.read.csv("/app/<YOUR_STAGE>/sample_employees.csv", header=True, inferSchema=True)
   df.show()

   # Filter: Engineering department only
   engineers = df.filter(df["department"] == "Engineering")
   engineers.show()

   # Filter: Salary > 80000 and years_employed > 3
   senior_high_earners = df.filter((df["salary"] > 80000) & (df["years_employed"] > 3))
   senior_high_earners.show()

   # Aggregate: Average salary by department
   df.groupBy("department").avg("salary").show()

   # Select specific columns
   result = senior_high_earners.select("name", "department", "salary")
   result.show()

   # Stop the Spark session
   spark.stop()
   ```

   To submit the application which uses the files you uploaded to the stage, use the following command:

   ```bash
   snowpark-submit \
     --snowflake-connection-name MY_CONNECTION \
     --snowflake-workload-name MY_JOB \
     --snowflake-stage @<database>.<schema>.<stage> \
     /path/to/app.py
   ```

   Note that a compute pool is required to run the application and must be either specified in the `connections.toml` file or on the command line using the `--compute-pool` option. For more information, see [Snowpark Submit reference](snowpark-submit-reference.md).

## Monitor with wait and logs

The following example shows how to submit a job, wait for its completion, and then retrieve logs.

1. Submit the job and wait for completion by using the following command:

   ```bash
   snowpark-submit \
     --snowflake-workload-name MY_JOB \
     --wait-for-completion \
     --snowflake-connection-name MY_CONNECTION \
     /path/to/app.py
   ```
2. If the job fails, check the detailed logs by using the following command:

   ```bash
   snowpark-submit
     --snowflake-workload-name MY_JOB \
     --workload-status \
     --display-logs \
     --snowflake-connection-name MY_CONNECTION
   ```

## Use Snowpark Submit in an Apache Airflow DAG

You can submit a Spark job to Snowflake via Snowpark Connect for Spark. You can use **snowpark-submit** in cluster mode to leverage a
compute pool to run the job.

When you use Apache Airflow in this way, ensure that the Docker service or Snowpark Container Services container that runs Apache Airflow
has proper access to Snowflake and the required files in the Snowflake stage.

The code in the following example performs the following tasks:

* Creates a Python virtual environment at `/tmp/myenv`.

  In the `create_venv` task, the code uses `pip` to install the `snowpark-submit` package by using a `.whl` file.
* Generates a secure `connections.toml` file with Snowflake connection credentials and an OAuth token.

  In the `create_connections_toml` task, the code creates the `/app/.snowflake` directory, creates the `.toml` file,
  and then changes file permissions to allow only the owner (user) to have read and write access.
* Runs a Spark job by using the **snowpark-submit** command.

  In the `run_snowpark_script` task, the code does the following things:

  + Activates the virtual environment.
  + Runs the Spark job by using the **snowpark-submit** command.
  + Deploys to Snowflake by using cluster mode.
  + Uses the Snowpark Connect for Spark remote URI sc://localhost:15002.
  + Specifies the Spark application class `org.example.SnowparkConnectApp`.
  + Pulls the script from the @snowflake_stage stage.
  + Blocks deployment until the job finishes by using `--wait-for-completion`.

```python
import airflow
from airflow import DAG
from airflow.operators.bash import BashOperator
from datetime import datetime
from airflow.operators.trigger_dagrun import TriggerDagRunOperator

default_args = {
  'start_date': airflow.utils.dates.days_ago(1),
  'retries': 0,
}

with DAG(
  'run_sparkconnect_python_script',
  default_args=default_args,
  schedule_interval=None,
  catchup=False,
) as dag:

  create_venv = BashOperator(
      task_id='create_venv',
      bash_command="""
      python3 -m venv /tmp/myenv &&
      source /tmp/myenv/bin/activate &&
      export PIP_USER=false &&
      pip install --upgrade pip &&
      pip install --no-cache-dir grpcio-tools>=1.48.1 &&
      pip install /app/snowpark_submit-<version>.whl
      """
  )

  create_connections_toml = BashOperator(
      task_id='create_connections_toml',
      bash_command="""
      mkdir -p /app/.snowflake
      echo "${SNOWFLAKE_USER}"
      cat <<EOF > /app/.snowflake/connections.toml

[snowpark-submit]
host = "${SNOWFLAKE_HOST}"
port = "${SNOWFLAKE_PORT}"
protocol = "https"
account = "${SNOWFLAKE_ACCOUNT}"
authenticator = "oauth"
token = "$(cat /snowflake/session/token)"
warehouse = "airflow_wh"
database = "${SNOWFLAKE_DATABASE}"
schema = "${SNOWFLAKE_SCHEMA}"
client_session_keep_alive = true
EOF
  chmod 600 /app/.snowflake/connections.toml
  """
  )

  run_script = BashOperator(
      task_id='run_snowpark_script',
      bash_command="""
      set -e
      echo "Using SNOWFLAKE_HOME: $SNOWFLAKE_HOME"

      echo "Running Python script with Snowpark..."
      source /tmp/myenv/bin/activate &&
      snowpark-submit --deploy-mode cluster --class org.example.SnowparkConnectApp --compute-pool="snowparksubmit" --snowflake-workload-name="spcstest" --snowflake-stage="@AIRFLOW_APP_FILES" --wait-for-completion "@AIRFLOW_APP_FILES/transformation.py" --snowflake-connection-name snowpark-submit
      """,
      env={
          'SNOWFLAKE_HOME': '/app/.snowflake'
      }
  )

create_venv >> create_connections_toml >> run_script
```

You can monitor the DAG by using the Apache Airflow user interface’s Graph View or Tree View. Inspect the task logs for the following items:

* Environment setup
* Status of Snowpark Connect for Spark
* **snowpark-submit** job output

You can also monitor for jobs that ran in Snowflake from the logs stored in Snowflake stage or from event tables.

---
title: Snowpark Submit reference
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-submit-reference.md
section: Developer Guide
---

# Snowpark Submit reference

With Snowpark Submit, you can use familiar Spark semantics to run non-interactive, batch-oriented Spark workloads on Snowflake.

> **Note:**
>
> **snowpark-submit** supports much of the same functionality as **spark-submit**. However, some functionality has been
> omitted because it is not needed when running Spark workloads on Snowflake.

## Syntax

```none
snowpark-submit
  --name <application_name>
  --exclude-packages <package_to_exclude> [, <package_to_exclude>, ...]
  --py-files <files_to_place_on_path>
  --conf <spark_config_property=value> [<spark_config_property=value> ...]
  --properties-file <path_to_properties_file>
  --help, -h
  --verbose, -v
  --version
  --account <snowflake_account>
  --user <snowflake_user>
  --authenticator <snowflake_authenticator>
  --token-file-path <snowflake_token_file_path>
  --password <snowflake_password>
  --role <snowflake_role>
  --host <snowflake_host>
  --database <snowflake_database_name>
  --schema <snowflake_schema_name>
  --warehouse <snowflake_warehouse_name>
  --compute-pool <snowflake_compute_pool>
  --comment <comment>
  --snowflake-stage <snowflake_stage>
  --external-access-integrations <snowflake_external_access_integrations> [, ...]
  --snowflake-log-level <snowflake_log_level>
  --snowflake-workload-name <snowflake_workload_name>
  --snowflake-connection-name <snowflake_connection_name>
  --snowflake-grpc-max-message-size <message_size>
  --snowflake-grpc-max-metadata-size <metadata_size>
  --workload-status
  --display-logs
  --wait-for-completion
  --jars <jar_files> [, <jar_files>, ...]
  --scala-version <scala_version>
  <application.jar | application.py> [<application_arguments>]
```

## Arguments

`application.jar | application.py`
:   Path to a file containing the application and dependencies.

`[application arguments]`
:   Application-specific arguments passed to the application’s main method.

## Options

`--class CLASS_NAME`
:   Your application’s main class (for Java and Scala applications). This option is required if the main class is not specified in the application JAR.

`--conf [PROP=VALUEPROP=VALUE ...]`
:   Arbitrary Spark configuration property.

`--exclude-packages [EXCLUDE_PACKAGES ...]`
:   Comma-separated list of groupId:artifactId pairs, to exclude while resolving the dependencies provided in `--packages` to avoid
    dependency conflicts.

`--help, -h`
:   Show help message and exit.

`--jars JAR`
:   Comma-separated list of `.jar` files to include. This can include the workload JAR itself, if a class finder is not used.

> **Note:**
>
> The files are not automatically included in the classpath and need to be explicitly registered using `addArtifact`.

`--name NAME`
Name of your application.

`--properties-file FILE`
:   Path to a file from which to load extra properties. If not specified, this will look for conf/spark-defaults.conf.

`--py-files PY_FILES`
:   Comma-separated list of `.zip`, `.egg`, or `.py` files to place on the PYTHONPATH for Python apps.

`--verbose, -v`
:   Print additional debug output.

`--version`
:   Print the version of current Spark.

### Snowflake specific options

`--account SNOWFLAKE_ACCOUNT`
:   Snowflake account to use. Overrides the account in the `connections.toml` file if specified.

`--authenticator SNOWFLAKE_AUTHENTICATOR`
:   Authenticator for Snowflake login. Overrides the authenticator in the `connections.toml` file if specified. If not specified,
    defaults to user password authenticator.

`--comment COMMENT`
:   A message associated with the workload. Can be used to identify the workload in Snowflake.

`--compute-pool SNOWFLAKE_COMPUTE_POOL`
:   Snowflake compute pool for running the provided workload. Overrides the compute pool in the `connections.toml` file if specified.

`--database SNOWFLAKE_DATABASE_NAME`
:   Snowflake database to be used in the session. Overrides the database in the `connections.toml` file if specified.

`--display-logs`
:   Whether to print application logs to console when `--workload-status` is specified.

`--external-access-integrations [SNOWFLAKE_EXTERNAL_ACCESS_INTEGRATIONS ...]`
:   Snowflake external acccess integrations required by the workload.

`--host SNOWFLAKE_HOST`
:   Host for snowflake deployment. Overrides the host in the `connections.toml` file if specified.

`--password SNOWFLAKE_PASSWORD`
:   Password for the Snowflake user. Overrides the password in the `connections.toml` file if specified.

`--requirements-file REQUIREMENTS_FILE`
:   Path to a `requirements.txt` file containing Python package dependencies to install before running the workload. Requires
    external access integration for PyPI. This parameter will not function if you also specify the `--snowflake-stage` parameter.

`--role SNOWFLAKE_ROLE`
:   Snowflake role to use. Overrides the role in the `connections.toml` file if specified.

`--schema SNOWFLAKE_SCHEMA_NAME`
:   Snowflake schema to use in the session. Overrides the schema in the `connections.toml` file if specified.

`--snowflake-connection-name SNOWFLAKE_CONNECTION_NAME`
:   Name of the connection in `connections.toml` file to use as the base configuration. Command-line arguments override any
    values from the `connections.toml` file.

`--snowflake-grpc-max-message-size MESSAGE_SIZE`
:   Maximum message size, in bytes, for gRPC communication in Snowpark Submit.

`--snowflake-grpc-max-metadata-size METADATA_SIZE`
:   Maximum metadata size, in bytes, for gRPC communication in Snowpark Submit.

`--snowflake-log-level SNOWFLAKE_LOG_LEVEL`
:   Log level for Snowflake event table—`'INFO'`, `'ERROR'`, `'NONE'`. (Default: INFO).

`--snowflake-stage SNOWFLAKE_STAGE`
:   Snowflake stage where workload files are uploaded.

`--snowflake-workload-name SNOWFLAKE_WORKLOAD_NAME`
:   Name of the workload to be run in Snowflake.

`--token-file-path SNOWFLAKE_TOKEN_FILE_PATH`
:   Path to a file containing the OAuth token for Snowflake. Overrides the token file path in the `connections.toml` file if specified.

`--user SNOWFLAKE_USER`
:   Snowflake user to use. Overrides the user in the `connections.toml` file if specified.

`--wait-for-completion`
:   In cluster mode, when specified, run the workload in blocking mode and wait for completion.

`--warehouse SNOWFLAKE_WAREHOUSE_NAME`
:   Snowflake warehouse to use in the session. Overrides the warehouse in the `connections.toml` file if specified.

`--wheel-files WHEEL_FILES`
:   Comma-separated list of .whl files to install before running the Python workload. Used for private dependencies not available on PyPI.

`--workload-status`
:   Print the detailed status of the workload.

`--scala-version SCALA_VERSION`
:   Scala version to use. Can be `2.12` or `2.13`. The default value is `2.12`.

## Common option examples

### Application deployment

Snowflake’s Snowpark Container Services (SPCS) is the primary infrastructure for running your Spark applications. You need to have created
an SPCS compute pool in advance.

#### Basic Python application

To deploy a basic Python application in cluster mode:

```bash
snowpark-submit \
  --snowflake-workload-name MY_PYTHON_JOB \
  --snowflake-connection-name MY_CONNECTION_CONFIG_NAME
  app.py arg1 arg2
```

#### Basic Scala application

To deploy a basic Scala application in cluster mode, use a command such as the following:

```bash
snowpark-submit \
  --class com.example.MainClass \
  --snowflake-workload-name MY_SCALA_JOB \
  --snowflake-connection-name MY_CONNECTION_CONFIG_NAME
  app.jar arg1 arg2
```

You can omit the `--class` option if you specified the main class in the application JAR.

##### Specifying the Scala version

Scala 2.12 and 2.13 are supported, with the default version being 2.12. If the application is built with Scala 2.13, the Scala version must be specified in the CLI and the script.

```bash
snowpark-submit \
  --scala-version 2.13 \
  --snowflake-workload-name MY_SCALA_JOB \
  --snowflake-connection-name MY_CONNECTION_CONFIG_NAME
  app.jar arg1 arg2
```

You must specify the Scala version in the application code:

```scala
// Directly in the session builder
val spark = SparkSession.builder()
  .remote("sc://localhost:15002")
  .config("snowpark.connect.scala.version", "2.13")
  .getOrCreate()

// Via session configuration
spark.conf.set("snowpark.connect.scala.version", "2.13")
```

### Authentication

Snowpark Submit offers various methods for authenticating with Snowflake. You must use at least one method. Connection profile and
direct authentication can be used together or separately. The command-line option overrides corresponding fields in connection profile
when it is also present.

### Connection profile

To use a pre-configured Snowflake connection profile:

```bash
snowpark-submit \
  --snowflake-connection-name my_connection \
  --snowflake-workload-name MY_JOB \
  app.py
```

### Direct authentication

#### Username and password

To provide authentication details directly in the command:

```bash
snowpark-submit \
  --host myhost \
  --account myaccount \
  --user myuser \
  --password mypassword \
  --role myrole \
  --snowflake-workload-name MY_JOB \
  app.py
```

#### OAuth

To authenticate by using an OAuth token:

```bash
snowpark-submit \
  --host myhost \
  --account myaccount \
  --authenticator oauth \
  --token-file-path /path/to/token.txt \
  --snowflake-workload-name MY_JOB \
  --compute-pool MY_COMPUTE_POOL \
  app.py
```

### Snowflake resources

To specify the Snowflake database, schema, warehouse, and compute pool for your job:

```bash
snowpark-submit \
  --database MY_DB \
  --schema MY_SCHEMA \
  --warehouse MY_WH \
  --snowflake-workload-name MY_JOB \
  --snowflake-connection-name MY_CONNECTION \
  app.py
```

### Snowflake stages

You can use Snowpark Submit to store and access files directly on a Snowflake stage.

To submit a job using a file on a Snowflake stage:

```bash
snowpark-submit \
  --snowflake-stage @my_stage \
  --snowflake-workload-name MY_JOB \
  --snowflake-connection-name MY_CONNECTION \
  @my_stage/app.py
```

### Dependencies management

You can manage your application’s dependencies.

#### Python dependencies

To specify additional Python files or archives that are needed by your application:

```bash
snowpark-submit \
  --py-files dependencies.zip,module.py \
  --snowflake-workload-name MY_PYTHON_JOB \
  --snowflake-connection-name MY_CONNECTION \
  app.py
```

#### Java or Scala dependencies

To include external JAR files for Java or Scala applications:

```bash
snowpark-submit \
  --jars dep1.jar,dep2.jar \
  --snowflake-workload-name MY_SCALA_JOB \
  --snowflake-connection-name MY_CONNECTION \
  --compute-pool MY_COMPUTE_POOL \
  app.jar
```

When you add dependencies via the `--jars` option, you must explicitly register them using `addArtifact`.

```scala
spark.addArtifact("dep1.jar")
spark.addArtifact("dep2.jar")
```

It’s also possible to use staged JAR files. These files do not have to be specified in the `--jars` option.

```scala
spark.conf.set("snowpark.connect.udf.java.imports", "[@mystage/dep1.jar, @mystage/dep2.jar]")
```

### Monitoring and control

You can monitor and control your Snowpark Submit jobs effectively.

#### Waiting for job completion

By default, Snowpark Submit starts the job and returns immediately. To run in blocking mode and wait for the job to finish:

```bash
snowpark-submit \
  --snowflake-connection-name my_connection \
  --snowflake-workload-name MY_JOB \
  --wait-for-completion \
  app.py
```

The `wait-for-completion` flag causes the command to block until the job completes (either successfully or with failure), showing
periodic status updates. This is useful for workflows where you need to ensure a job completes before proceeding with other tasks,
such as when you use Apache Airflow.

#### Checking workload status

Check the status of a workload (running or completed).

```bash
snowpark-submit --snowflake-connection-name my_connection --snowflake-workload-name MY_JOB --workload-status
```

This command returns the following information about the workload:

* Current state (`DEPLOYING`, `RUNNING`, `SUCCEEDED`, `FAILED`)
* Start time and duration
* Service details

#### Viewing application logs

To view detailed logs along with the workload status:

```bash
snowpark-submit --snowflake-connection-name my_connection --snowflake-workload-name MY_JOB --workload-status --display-logs
```

The `display-logs` flag will fetch and print the application’s output logs to the console. Using these logs, you can perform the
following tasks:

* Debug application errors
* Monitor execution progress
* View application output

> **Note:**
>
> There is a small latency—from a few seconds to a minute—for logs to be ready for fetching. When an event table is not used to
> store log data, logs are retained for a short period of time, such as five minutes or less.

### Advanced configuration

Fine-tune your Snowpark Submit jobs with advanced configurations.

#### External access integration

Connect to external services from your Spark application:

```bash
snowpark-submit \
  --external-access-integrations "MY_NETWORK_RULE,MY_STORAGE_INTEGRATION" \
  --snowflake-workload-name MY_JOB \
  --snowflake-connection-name my_connection \
  app.py
```

#### Logging level configuration

Control the logging level for your application to the Snowflake event table:

```bash
snowpark-submit \
  --snowflake-log-level INFO \
  --snowflake-workload-name MY_JOB \
  --snowflake-connection-name MY_CONNECTION \
  app.py
```

Options for –snowflake-log-level: INFO, ERROR, NONE.

#### Adding job context

Add a descriptive comment for easier workload identification in Snowflake:

```bash
snowpark-submit \
  --comment "Daily data processing job" \
  --snowflake-workload-name MY_JOB \
  --snowflake-connection-name my_connection \
  app.py
```

---
title: SQL UDF limitations
source: https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-limitations.md
section: Developer Guide
---

# SQL UDF limitations

This topic describes the limitations for handlers written in SQL.

## Argument and return type constraints are sometimes ignored

Certain type characteristics declared for an argument or return value will be ignored when the UDF is called. In these cases, the
received value may be used as received whether or not it conforms to constraints specified in the declaration.

The following are ignored for UDFs whose logic is written in SQL:

* Precision and scale for arguments and return values of type NUMBER
* Length for arguments and return values of type VARCHAR

### Example

Code in the following example declares that the `arg1` argument and the return value must be a VARCHAR no more than one character
long. However, calling this function with an `arg1` whose value is longer than one character will succeed as if the constraint were
not specified.

```sqlexample
CREATE OR REPLACE FUNCTION tf (arg1 VARCHAR(1))
RETURNS VARCHAR(1)
LANGUAGE SQL AS 'SHA2(a)';
```

## Dynamic SQL is not supported when referring to database objects

Referring to database objects using dynamic SQL will produce an error that includes text such as the following:

```output
Compilation of SQL UDF failed: SQL compilation error: syntax error... unexpected '<variable_name>'
```

If you need to construct dynamic SQL statements that use different database objects, consider writing a stored procedure instead.
You can write stored procedures in one of the following languages:

* [Java](../../stored-procedure/java/procedure-java-overview.md)
* [JavaScript](../../stored-procedure/stored-procedures-javascript.md)
* [Python](../../stored-procedure/python/procedure-python-overview.md)
* [Scala](../../stored-procedure/scala/procedure-scala-overview.md)
* [Snowflake Scripting](../../stored-procedure/stored-procedures-snowflake-scripting.md)

### Example

Code in the following example will fail because it uses the IDENTIFIER function to refer to a table whose name is dynamically specified
with the `table_name_parameter` variable.

```sqlexample
CREATE OR REPLACE FUNCTION profit2(table_name_parameter VARCHAR)
  RETURNS NUMERIC(11, 2)
  AS
  $$
    SELECT SUM((retail_price - wholesale_price) * number_sold)
        FROM IDENTIFIER(table_name_parameter)
  $$
  ;
```

---
title: Stored procedures overview
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-overview.md
section: Developer Guide
---

# Stored procedures overview

You can write stored procedures to extend the system with procedural code. With a procedure, you can use branching, looping, and other
programmatic constructs. You can reuse a procedure multiple times by calling it from other code.

With a stored procedure, you can:

* Automate tasks that require multiple database operations performed frequently.
* Dynamically create and execute database operations.
* Execute code with the privileges of the role that owns the procedure, rather than with the privileges of the role that runs the procedure.

  This allows the stored procedure owner to delegate the power to perform specified operations to users who otherwise could not do so.
  However, there are limitations on these owner’s rights stored procedures.

For example, imagine that you want to clean up a database by deleting data older than a specified date. You can execute the delete operation
multiple times in your code, each time deleting data from a specific table. You can put all of those statements in a single stored
procedure, then pass a parameter that specifies the cut-off date.

With the procedure deployed, you can call it to clean up the database. As your database changes, you can update the procedure to clean
up additional tables; if there are multiple users who use the new cleanup command, they can call one procedure, rather than remember
every table name and clean up each table individually.

A stored procedure is like a UDF, but the two differ in important ways. For more information, see
[Choosing whether to write a stored procedure or a user-defined function](../stored-procedures-vs-udfs.md).

A procedure is just one way to extend Snowflake. For others, see the following:

* [User-defined functions overview](../udf/udf-overview.md)
* [Writing external functions](../../sql-reference/external-functions.md)
* [Snowpark API](../snowpark/index.md)

## Supported languages and tools

You can create and manage stored procedures (and other Snowflake entities) by using any of multiple tools, depending on how you prefer to work.

| Language | Approach | Support |
| --- | --- | --- |
| **SQL**  With handler in Java, JavaScript, Python, Scala, or SQL Scripting | Write SQL code in Snowflake to create and manage Snowflake entities. Write the procedure’s logic in one of the supported handler languages. | [Java](java/procedure-java-overview.md)  [JavaScript](stored-procedures-javascript.md)  [Python](python/procedure-python-overview.md)  [Scala](scala/procedure-scala-overview.md)  [SQL Scripting](stored-procedures-snowflake-scripting.md) |
| **Java, Python, or Scala**  [Snowpark API](../snowpark/index.md) | On the client, write code for operations that are pushed to Snowflake for processing. | [Java](../snowpark/java/creating-sprocs.md)  [Python](../snowpark/python/creating-sprocs.md)  [Scala](../snowpark/scala/creating-sprocs.md) |
| **Command-line interface**  [Snowflake CLI](../snowflake-cli/index.md) | Use the command line to create and manage Snowflake entities, specifying properties as properties of JSON objects. | [Managing Snowflake objects](../snowflake-cli/objects/manage-objects.md) |
| **Python**  [Snowflake Python API](../snowflake-python-api/snowflake-python-overview.md) | On the client, write code that executes management operations on Snowflake. | [Managing stored procedures](../snowflake-python-api/snowflake-python-managing-functions-procedures.md) |
| **REST**  [Snowflake REST API](../snowflake-rest-api/snowflake-rest-api.md) | Make requests of RESTful endpoints to create and manage Snowflake entities. | [Manage procedures](../snowflake-rest-api/procedure/procedure-introduction.md) |

You write a procedure’s logic — its handler — in one of the supported languages. Once
you have a handler, you can [create a procedure](stored-procedures-creating.md) with a CREATE PROCEDURE command, then
[call the procedure](stored-procedures-calling.md) with a CALL statement.

From a stored procedure, you can return a single value or (where supported with the handler language) tabular data. For more information
about supported return types, see [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md).

When choosing a language, consider also the handler locations supported. Not all languages support referring to the handler on a stage
(the handler code must instead be in-line). For more information, see [Keeping handler code in-line or on a stage](../inline-or-staged.md).

| Language | Handler Location |
| --- | --- |
| Java | In-line or staged |
| JavaScript | In-line |
| Python | In-line or staged |
| Scala | In-line or staged |
| Snowflake Scripting | In-line |

## Temporary procedures

You can create a procedure that is discarded after you use it. You might find this useful when you don’t
need the procedure to be available in a durable way, such as for multiple sessions or to multiple users.

In addition, creating a procedure in one of the following ways doesn’t require the CREATE PROCEDURE privilege, so these approaches are more broadly available to users:

* Create a temporary stored procedure that persists for only the current session, then is dropped.

  The following Snowflake tools support creating a temporary procedure:

  + [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) with the TEMP or TEMPORARY parameter
  + The Snowpark API for [Java](../snowpark/java/creating-sprocs.md), [Python](../snowpark/python/creating-sprocs.md),
    or [Scala](../snowpark/scala/creating-sprocs.md)
* Create an anonymous procedure that you call immediately, and which is dropped immediately.

  + To create a procedure and immediately call it in a single SQL statement, use the [CALL (with anonymous procedure)](../../sql-reference/sql/call-with.md) syntax.

## Stored procedure example

Code in the following example creates a stored procedure called `myproc` with a Python handler called `run`.

```sqlexample-python
CREATE OR REPLACE PROCEDURE myproc(from_table STRING, to_table STRING, count INT)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  PACKAGES = ('snowflake-snowpark-python')
  HANDLER = 'run'
as
$$
def run(session, from_table, to_table, count):
  session.table(from_table).limit(count).write.save_as_table(to_table)
  return "SUCCESS"
$$;
```

Code in the following example calls the stored procedure `myproc`.

```sqlexample
CALL myproc('table_a', 'table_b', 5);
```

## Guidelines and constraints

Tips:
:   For tips on writing stored procedures, see [Working with stored procedures](stored-procedures-usage.md).

Snowflake constraints:
:   You can ensure stability within the Snowflake environment by developing within Snowflake constraints. For more information, see
    [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../udf-stored-procedure-constraints.md).

Naming:
:   Be sure to name procedures in a way that avoids collisions with other procedures. For more information, see
    [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).

Arguments:
:   Specify the arguments for your stored procedure and indicate which arguments are optional. For more information, see
    [Defining arguments for UDFs and stored procedures](../udf-stored-procedure-arguments.md).

Data type mappings:
:   For each handler language, there’s a separate set of mappings between the language’s data types and the SQL types used for arguments and
    return values. For more about the mappings for each language, see [Data Type Mappings Between SQL and Handler Languages](../udf-stored-procedure-data-type-mapping.md).

## Handler writing

Handler languages:
:   For language-specific content on writing a handler, see Supported languages and tools.

External network access:
:   You can access external network locations with
    [external network access](../external-network-access/external-network-access-overview.md). You can create secure
    access to specific network locations external to Snowflake, then use that access from within the handler code.

Logging and tracing:
:   You can record code activity by [capturing log messages and trace events](../logging-tracing/logging-tracing-overview.md),
    storing the data in a database you can query later.

## Security

Whether you choose to have a stored procedure run with caller’s rights or owner’s rights can impact the information it has access to and
the tasks it may be allowed to perform. For more information, see [Understanding caller’s rights and owner’s rights stored procedures](stored-procedures-rights.md).

Stored procedures share certain security concerns with user-defined functions (UDFs). For more information, see the following:

* You can help a procedure’s handler code execute securely by following the best practices described in
  [Security Practices for UDFs and Procedures](../udf-stored-procedure-security-practices.md)
* Ensure that sensitive information is concealed from users who should not have access to it. For more information, see
  [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../secure-udf-procedure.md)

## Handler code deployment

When creating a procedure, you can specify its handler – which implements the procedure’s logic – as code in-line with the CREATE
PROCEDURE statement or as code external to the statement, such as compiled code packaged and copied to a stage.

For more information, see [Keeping handler code in-line or on a stage](../inline-or-staged.md).

---
title: Submitting a request to execute SQL statements
source: https://docs.snowflake.com/en/developer-guide/sql-api/submitting-requests.md
section: Developer Guide
---

# Submitting a request to execute SQL statements

This topic explains how to submit a request to the SQL API.

To submit SQL statements for execution, send a `POST` request to the `/api/v2/statements/` endpoint. See
[POST /api/v2/statements](reference.md) for details.

```none
POST /api/v2/statements HTTP/1.1
(request body)
```

## Setting up the request

In the request URL, you can set query parameters to:

* Specify a request ID that distinguishes this request from other requests.
* Execute the statement asynchronously.

> **Note:**
>
> Your code must be able to handle async query executions. It is not guaranteed that your query will always be executed synchronously if you don’t specify the explicit `async=true` property. For more information, see the [response workflow](handling-responses.md).

For the [body of the request](reference.md), set the following fields:

* Set the `statement` field to the SQL statement that you want to execute. For example:

  ```sqljson
  {
    "statement": "select * from my_table",
    ...
  }
  ```

  If you want to submit multiple statements in a single request, use a semicolon (`;`) between statements.
  See [Submitting multiple SQL statements in a single request](submitting-multiple-statements.md) for details.
* If you include bind variables (`?` placeholders) in the statement, set the `bindings` field to an object that specifies
  the corresponding Snowflake data types and values for each variable.

  For details, see Using bind variables in a statement.
* To specify the warehouse, database, schema, and role to use, set the `warehouse`, `database`, `schema`, and
  `role` fields.

  The values in these fields are case-sensitive and must match the case of the field returned by a SQL SHOW command.
  For example, suppose you create a database using the following SQL command:

  ```sqlexample
  CREATE OR REPLACE DATABASE Xpto;
  ```

  In this example, [the object identifier is not quoted, so it will be created with uppercase by default](../../sql-reference/identifiers-syntax.md).

  If the [SHOW DATABASES](../../sql-reference/sql/show-databases.md) command returns `XPTO` in uppercase for the name of the database, you must specify `XPTO` in uppercase for the field value.

  If you omit these fields, the SQL API uses the values of the corresponding properties for the user (i.e. the
  `DEFAULT_WAREHOUSE`, `DEFAULT_NAMESPACE`, and `DEFAULT_ROLE`
  [properties of the user](../../sql-reference/sql/alter-user.md)).
* To set a timeout for the statement execution, set the `timeout` field to the maximum number of seconds to wait.
  If the `timeout` field is not set, the timeout specified by the [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md) parameter is
  used.

## Example of a request

For example, the following `curl` command sends a SQL statement for execution. The example uses the file
`request-body.json` to specify the body of the request.

```bash
curl -i -X POST \
    -H "Content-Type: application/json" \
    -H "Authorization: Bearer <jwt>" \
    -H "Accept: application/json" \
    -H "User-Agent: myApplicationName/1.0" \
    -d "@request-body.json" \
    "https://<account_identifier>.snowflakecomputing.com/api/v2/statements"
```

where:

* `jwt` is the [JWT that you generated for authentication](authenticating.md).
* `myApplicationName` is an example of an identifier for your application.
* `account_identifier` is your [account identifier](../../user-guide/admin-account-identifier.md).

In this example, `request-body.json` contains the
[body of the request](reference.md):

```sqljson
{
  "statement": "select * from T where c1=?",
  "timeout": 60,
  "database": "TESTDB",
  "schema": "TESTSCHEMA",
  "warehouse": "TESTWH",
  "role": "TESTROLE",
  "bindings": {
    "1": {
      "type": "FIXED",
      "value": "123"
    }
  }
}
```

In the body of the request in the example above:

* The `statement` field specifies the SQL statement to execute.

  The statement includes a bind variable (the question mark in `"cl=?"`), which
  evaluates to the first binding (`"1"`) specified in the `bindings` field.
* The `timeout` field specifies that the server allows 60 seconds for the statement to be executed.
* The `database`, `schema`, `warehouse`, and `role` fields specify that the `TESTDB` database,
  `TESTSCHEMA` schema, `TESTWH` warehouse, and `TESTROLE` role should be used when executing the statement.

## Using bind variables in a statement

If you want to use bind variables (`?` placeholders) in the statement, use the `bindings` field to specify the values that
should be inserted.

Set this field to a JSON object that specifies the [Snowflake data type](../../sql-reference/intro-summary-data-types.md) and value
for each bind variable.

```sqljson
...
"statement": "select * from T where c1=?",
...
"bindings": {
  "1": {
    "type": "FIXED",
    "value": "123"
  }
},
...
```

Choose the binding type that corresponds to the type of the value that you are binding. For example, if the value is a
string representing a date (e.g. `2021-04-15`) and you want to insert the value into a DATE column, use the
`TEXT` binding type.

The following table specifies the values of the `type` field that you can use to bind to different
[Snowflake data types](../../sql-reference-data-types.md).

* The first column on the left specifies the binding types that you can use.
* The rest of the columns specify the Snowflake data type of the column where you plan to insert the data.
* Each cell specifies the type of value that you can use with a binding type to insert data into a column of a particular
  Snowflake data type.

  If the cell for a binding type and Snowflake data type is empty, you cannot use the specified binding type to insert data into
  a column of that Snowflake data type.

Binding types supported for different Snowflake data types

| Snowflake Data Types | INT / NUMBER | FLOAT | DECFLOAT | VARCHAR | BINARY | BOOLEAN | DATE | TIME | TIMESTAMP_TZ | TIMESTAMP_LTZ | TIMESTAMP_NTZ |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| Binding Types |  |  |  |  |  |  |  |  |  |  |  |
| FIXED | integer | integer | integer | integer |  | 0 (false) / nonzero (true) |  |  |  |  |  |
| REAL | integer | int or float | int or float | int or float |  | 0/non-0 |  |  |  |  |  |
| DECFLOAT | integer | int or float | int or float (see notes below) | int or float |  | 0/non-0 |  |  |  |  |  |
| TEXT | integer | int or float | int or float (see notes below) | any text | hexdec | `"true"`/ `"false"` | see notes below | see notes below | see notes below | see notes below | see notes below |
| BINARY |  |  |  | hexdec |  |  |  |  |  |  |  |
| BOOLEAN |  |  |  | true/false, 0/1 |  | true/false |  |  |  |  |  |
| DATE |  |  |  | epoch (ms) |  |  | epoch (ms) |  | epoch (ms) | epoch (ms) | epoch (ms) |
| TIME |  |  |  | epoch (nano) |  |  |  | epoch (nano) |  |  |  |
| TIMESTAMP_TZ |  |  |  | epoch (nano) |  |  | epoch (nano) | epoch (nano) | epoch (nano) |  |  |
| TIMESTAMP_LTZ |  |  |  | epoch (nano) |  |  | epoch (nano) | epoch (nano) | epoch (nano) | epoch (nano) | epoch (nano) |
| TIMESTAMP_NTZ |  |  |  | epoch (nano) |  |  | epoch (nano) | epoch (nano) | epoch (nano) | epoch (nano) | epoch (nano) |

Note the following:

* The values of the bind variables must be strings (e.g. `"1.0"` for the value 1.0).
* When using the DECFLOAT or TEXT binding type, to insert data into a DECFLOAT column, you can specify the value in scientific notation (e.g. `"1.23e-40"`).
* When using the DATE binding type, specify the number of milliseconds since the epoch.
* When using the TIME or TIMESTAMP\* binding type, specify the number of nanoseconds since the epoch.
* When using the TIMESTAMP_TZ binding type, specify the number of nanoseconds since the epoch followed by a space and the
  timezone offset in minutes (e.g. `1616173619000000000 960`).
* When using the `TEXT` binding type:

  + To insert data into a DATE column, you can use any [date format](../../sql-reference/date-time-input-output.md) that is
    supported by AUTO detection.
  + To insert data into a TIME column, you can use any [time format](../../sql-reference/date-time-input-output.md) that is supported by AUTO
    detection.
  + To insert data into a TIMEZONE\* column, you can use any
    [date-time format](../../sql-reference/date-time-input-output.md) that is supported by AUTO detection.

If the value is in a format not supported by Snowflake, the API returns an error:

```sqljson
{
  code: "100037",
  message: "<bind type> value '<value>' is not recognized",
  sqlState: "22018",
  statementHandle: "<ID>"
}
```

> **Note:**
>
> Snowflake does not currently support variable binding in multi-statement SQL requests.

## Submitting concurrent requests

The Snowflake SQL API supports sending concurrent requests to the server. Concurrency limits on the API are determined by the concurrency limits enforced by Snowflake.

Depending on the current server load, you might receive an HTTP 429 error which indicates that the server is currently
receiving too many requests.

To ensure that your application correctly handles 429 errors, wrap concurrent requests within retry logic.

## Resubmitting a request to execute SQL statements

In some cases, it might not be clear if Snowflake executed the SQL statement in an API request (e.g. due to a network error or a
timeout). You might choose to resubmit the same request to Snowflake again, in case Snowflake did not execute the statement.

However, if Snowflake already executed the statement in the initial request and you resubmit the request again, the statement is
executed twice. For some types of requests, repeatedly executing the same statement can have unintended consequences (e.g.
inserting duplicate data into a table).

To prevent Snowflake from executing the same statement twice when you resubmit a request, you can use a request ID to distinguish
your request from other requests. If you specify the same request ID in the initial request along with the `retry=true` parameter in the resubmitted request, Snowflake does not execute the statement again if the statement has already been executed successfully.

To specify a request ID, generate a
[universally unique identifier (UUID)](https://en.wikipedia.org/wiki/Universally_unique_identifier). You can then include this identifier in the `requestId` query parameter. You must also specify the `retry=true` parameter as part of the request as shown in the following example.

```none
POST /api/v2/statements?requestId=ea7b46ed-bdc1-8c32-d593-764fcad64e83&retry=true HTTP/1.1
```

If Snowflake fails to process a request, you can submit the same request again with the same request ID. Using the same request ID
indicates to the server that you are submitting the same request again.

> **Note:**
>
> The `retry=true` parameter adds overhead to processing the SQL statement because Snowflake must scan and match a
> statement in the statement history. Use this parameter only when retrying the statement is required.

---
title: Submitting multiple SQL statements in a single request
source: https://docs.snowflake.com/en/developer-guide/sql-api/submitting-multiple-statements.md
section: Developer Guide
---

# Submitting multiple SQL statements in a single request

This topic explains how to submit a request containing multiple statements to the Snowflake SQL API.

> **Note:**
>
> Executing multiple statements in a single query requires that a valid warehouse is available in a session.

## Introduction

In some cases, you might need to specify multiple SQL statements in a request. For example, you might need to:

* Define an explicit transaction
* Set and use session variables in statements in a request
* Create and use temporary tables in statements in a request
* Change the database, schema, warehouse, or role for statements in a request

The following sections explain how to submit a request that contains multiple SQL statements.

* Specifying multiple SQL statements in the request
* Getting the results for each SQL statement in the request
* Handling errors when specifying multiple statements in a request

## Specifying multiple SQL statements in the request

To submit multiple SQL statements in a single request:

* In the `statement` field, use a semicolon (`;`) between each statement.
* In the `parameters` field, set the `MULTI_STATEMENT_COUNT` field to the number of SQL statements in the request.

For example:

```none
POST /api/v2/statements HTTP/1.1
Authorization: Bearer <jwt>
Content-Type: application/json
Accept: application/json
User-Agent: myApplication/1.0

{
  "statement": "alter session set QUERY_TAG='mytesttag'; select count(*) from mytable",
  ...
  "parameters": {
      "MULTI_STATEMENT_COUNT": "2"
  }
}
```

In this example `MULTI_STATEMENT_COUNT` is set to `2` which corresponds to the number of SQL statements being submitted.

To submit a variable number of SQL statements in the `statement` field, set `MULTI_STATEMENT_COUNT` to
`0`. This is useful in an application where the number of SQL statements submitted is not known at runtime.

If the value of `MULTI_STATEMENT_COUNT` does not match the number of SQL statements specified in the
`statement` field, the SQL API returns the following error:

```none
Actual statement count <actual_count> did not match the desired statement count <desired_count>.
```

Where

* `actual_count` is the number of statements specified in the `statement` field.
* `desired_count` is the value of `MULTI_STATEMENT_COUNT`.

If you specify multiple SQL statements in the `statement` field, but do not specify the
`MULTI_STATEMENT_COUNT` field, the SQL API returns the following error:

> ```none
> Actual statement count 3 did not match the desired statement count 1.
> ```

> **Note:**
>
> Snowflake does not currently support variable binding in multi-statement SQL requests.

## Getting the results for each SQL statement in the request

If a request that contains multiple SQL statements is processed successfully, the response does not include the data returned from
executing the individual statements. Instead, the response contains a `statementHandles` field that contains an array of the
handles for the individual statements.

> **Note:**
>
> The `statementHandles` field is different from the `statementHandle` field:
>
> * The `statementHandle` field specifies the handle for the set of SQL statements in the request.
> * The `statementHandles` field is an array of the handles of the individual SQL statements in the request.

For example, suppose that you send a request that specifies two SQL statements for execution:

```none
POST /api/v2/statements HTTP/1.1
Authorization: Bearer <jwt>
Content-Type: application/json
Accept: application/json
User-Agent: myApplication/1.0

{
  "statement": "select * from A; select * from B",
  ...
}
```

The response contains a `statementHandles` field that contains an array of the handles for the individual statements.

```none
HTTP/1.1 200 OK
...
{
  ...
  "statementHandles" : [ "019c9fce-0502-f1fc-0000-438300e02412", "019c9fce-0502-f1fc-0000-438300e02416" ],
  ...
}
```

To check the status and retrieve the data for the individual statements, send a `GET` request to the
`/api/v2/statements/` endpoint and append the handle for each statement to the URL path. See
[Checking the status of the statement execution and retrieving the data](handling-responses.md) for details.

```none
GET /api/v2/statements/019c9fce-0502-f1fc-0000-438300e02412
...
```

```none
GET /api/v2/statements/019c9fce-0502-f1fc-0000-438300e02416
...
```

## Handling errors when specifying multiple statements in a request

If you specified multiple SQL statements in the request and an error occurred when executing any of the statements, Snowflake
returns the HTTP response code 422 with a [QueryFailureStatus](reference.md) object.

You can get [details about the error](handling-errors.md) from this object.

For example, suppose that your request specifies the following statements in which the second INSERT statement contains an error:

```sqljson
{
  "statement": "create or replace table table1 (i int); insert into table1 (i) values (1); insert into table1 (i) values ('This is not a valid integer.'); insert into table1 (i) values (2); select i from table1 order by i",
  ...
}
```

Snowflake returns a response with the HTTP response code 422 and with a `QueryFailureStatus` object that contains the
details about the error:

```none
HTTP/1.1 422 Unprocessable Entity
Content-Type: application/json
...
{
  "code" : "100132",
  "message" : "JavaScript execution error: Uncaught Execution of multiple statements failed on statement \"insert into table1 (i) values ...\" (at line 1, position 75).\nDML operation to table TABLE1 failed on column I with error: Numeric value 'This is not a valid integer.' is not recognized in SYSTEM$MULTISTMT at '    throw `Execution of multiple statements failed on statement {0} (at line {1}, position {2}).`.replace('{1}', LINES[i])' position 4\nstackstrace: \nSYSTEM$MULTISTMT line: 10",
  "sqlState" : "P0000",
  "statementHandle" : "019d6e97-0502-317e-0000-096d0041f036"
}
```

In the example above, the INSERT statement with the error starts at the character position 75 in the value of the
`statement` field.

The statements before the statement with the error are executed successfully (the CREATE TABLE and first INSERT statement in this
example). The statements after the statement with the error are not executed.

---
title: Submitting Spark applications
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-submit.md
section: Developer Guide
---

# Submitting Spark applications

You can run Spark workloads in a non-interactive, asynchronous way directly on Snowflake’s infrastructure while you use familiar
Spark semantics. With Snowpark Submit, you can submit production-ready Spark applications—such as ETL pipelines and scheduled data
transformations—by using a simple CLI interface. In this way, you can maintain your existing Spark development workflows without a
dedicated Spark cluster.

For example, you can package your PySpark ETL script, then use the Snowpark Submit CLI to run the script as a batch job on a Snowpark Container Services container.
This method lets you automate nightly data pipelines with Apache Airflow or CI/CD tools. Your Spark code runs in cluster mode on Snowpark Container Services,
scaling seamlessly with built-in dependency and resource management.

For examples of Snowpark Submit in use, see [Snowpark Submit examples](snowpark-submit-examples.md).

Snowpark Submit runs Spark workloads on Snowflake by using Snowpark Connect for Spark. For more information about Snowpark Connect for Spark, see
[Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark](snowpark-connect-overview.md).

Snowpark Submit offers the following benefits:

* Ability to run in cluster mode on Snowflake-managed infrastructure with no external Spark setup
* Workflow integration, supporting automation through CI/CD pipelines, Apache Airflow, or cron-based scheduling
* Support for Python, enabling reuse of existing Spark applications across languages
* Dependency management, with support for packaging external Python modules or JARs

> **Note:**
>
> **snowpark-submit** supports much of the same functionality as **spark-submit**. However, some functionality has been
> omitted because it is not needed when running Spark workloads on Snowflake.

## Get started with Snowpark Submit

To get started using Snowpark Submit, follow these steps:

1. Install Snowpark Submit by following the steps in [Install Snowpark Submit](snowpark-submit-install.md).
2. Study the [Snowpark Submit examples](snowpark-submit-examples.md).
3. Get to know how to use Snowpark Submit with [Snowpark Submit reference](snowpark-submit-reference.md).

---
title: Tabular Java UDFs (UDTFs)
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-tabular-functions.md
section: Developer Guide
---

# Tabular Java UDFs (UDTFs)

This document explains how to write a UDTF (user-defined [table function](../../../sql-reference/functions-table.md)) in Java.

## Introduction

Your Java UDTF handler class processes rows received in the UDTF call and returns a tabular result. The received rows are partitioned,
either implicitly by Snowflake or explicitly in the syntax of the function call. You can use the methods you implement in the class to
process individual rows as well as the partitions into which they’re grouped.

Your handler class can process partitions and rows with the following:

* A zero-argument constructor as an initializer. You can use this to set up partition-scoped state.
* A `process` method for processing each row.
* A zero-argument `endPartition` method as a finalizer to complete partition processing, including returning a value scoped to the
  partition.

For more detail, see Java classes for UDTFs (in this topic).

Each Java UDTF also requires an *output row class*, which specifies the Java data types of the columns of the output row(s) that
are generated by the handler class. Details are included in The output row class (in this topic).

### Usage notes for partitioning

* When it receives rows that are implicitly partitioned by Snowflake, your handler code can make no assumptions about partitions. Running
  with implicit partitioning is most useful when the UDTF only needs to look at rows in isolation to produce its output and no state is
  aggregated across rows. In this case, the code probably does not need a constructor or an `endPartition` method.
* To improve performance, Snowflake usually executes multiple instances of the UDTF handler code in parallel. Each partition of rows
  is passed to a single instance of the UDTF.
* Although each partition is processed by only one UDTF instance, the converse is not necessarily true — a single UDTF instance can
  process multiple partitions sequentially. It is therefore important to use the initializer and finalizer to initialize and clean
  up for each partition to avoid carrying over accumulated values from the processing of one partition to the processing of another
  partition.

> **Note:**
>
> Tabular functions (UDTFs) have a limit of 500 input arguments and 500 output columns.

## Java classes for UDTFs

The primary components of the UDTF are the handler class and the output row class.

### The handler class

Snowflake interacts with the UDTF primarily by invoking the following methods of the handler class:

* The initializer (the constructor).
* The per-row method (`process`).
* The finalizer method (`endPartition`).

The handler class can contain additional methods needed to support these three methods.

The handler class also contains a method `getOutputClass`, which is described later.

Throwing an exception from any method in the handler class (or the output row class)
causes processing to stop. The query that called the UDTF fails with an error message.

#### The constructor

A handler class can have a constructor, which must take zero arguments.

The constructor is invoked once for each [partition](../udf-calling-sql.md) prior to any invocations of `process`.

The constructor cannot produce output rows.

Use the constructor to initialize state for the partition; this state can be used by the `process` and
`endPartition` methods. The constructor is also the appropriate place to put any long-running initialization that
needs to be done only once per partition rather than once per row.

The constructor is optional.

#### The `process` method

The `process` method is invoked once for each row in the input partition.

The arguments passed to the UDTF are passed to `process`. The values of the arguments are converted from SQL data types to
Java data types. (For information about mapping SQL and Java data types, see [SQL-Java Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).)

The parameter names of the `process` method can be any valid Java identifiers; the names do not need to match the names
specified in the `CREATE FUNCTION` statement.

Each time that `process` is called, it can return zero, one, or multiple rows.

The data type returned by the `process` method must be `Stream<OutputRow>`, where Stream is defined in
java.util.stream.Stream, and `OutputRow` is the name of the output row class. The example below shows a simple
`process` method that merely returns its input via a Stream:

```java
import java.util.stream.Stream;

...

public Stream<OutputRow> process(String v) {
  return Stream.of(new OutputRow(v));
}

...
```

If the `process` method does not keep or use any state in the object
(e.g. if the method is designed to just exclude selected input rows from
the output), you can declare the method `static`. If the
`process` method is `static` and the handler class does not
have a constructor or non-static `endPartition` method, Snowflake
passes each row directly to the static `process` method without
constructing an instance of the handler class.

If you need to skip an input row and process the next row (e.g. if you are
validating the input rows), return an empty `Stream` object. For
example, the `process` method below only returns the rows for
which `number` is a positive integer. If `number` is not positive,
the method returns an empty `Stream` object to skip the current
row and continue processing the next row.

```java
public Stream<OutputRow> process(int number) {
  if (inputNumber < 1) {
    return Stream.empty();
  }
  return Stream.of(new OutputRow(number));
}
```

If `process` returns a null Stream, then processing stops. (The `endPartition` method is still called even if
a null Stream is returned.)

This method is required.

#### The `endPartition` method

This optional method can be used to generate output rows that are based on any state information aggregated in `process`. This method is
invoked once for each [partition](../udf-calling-sql.md), after all rows in that partition have been
passed to `process`.

If you include this method, it is called on each partition, regardless of whether the data was partitioned explicitly or implicitly.
If the data is not partitioned meaningfully, the output of the finalizer might not be meaningful.

> **Note:**
>
> If the user does not partition the data explicitly, Snowflake partitions the data implicitly. For details, see:
> [partitions](../udf-calling-sql.md).

This method can output zero, one, or multiple rows.

> **Note:**
>
> While Snowflake supports large partitions with timeouts tuned to process them successfully, especially large partitions can cause
> processing to time out (such as when `endPartition` takes too long to complete). Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need the
> timeout threshold adjusted for specific usage scenarios.

#### The `getOutputClass` method

This method returns information about the output row class. The output row class contains
information about the data types of the returned row.

### The output row class

Snowflake uses the output row class to help specify conversions between Java data types and SQL data types.

When a Java UDTF returns a row, the value in each column of the row must be converted from the
Java data type to the corresponding SQL data type. The SQL data types are specified in the `RETURNS` clause of the
`CREATE FUNCTION` statement. However, the mapping between Java and SQL data types is not 1-to-1, so Snowflake needs to know
the Java data type for each returned column. (For more information about mapping SQL and Java data types, see
[SQL-Java Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).)

A Java UDTF specifies the Java data types of the output columns by defining an output row class. Each row returned from the UDTF is
returned as an instance of the output row class. Each instance of the output row class contains one public field for each output
column. Snowflake reads the values of the public fields from each instance of the output row class, converts the Java values to SQL
values, and constructs a SQL output row containing those values.

The values in each instance of the output row class are set by calling the output row class’s constructor. The constructor
accepts parameters that correspond to the output columns and then sets the public fields to those parameters.

The code below defines a sample output row class:

```java
class OutputRow {

  public String name;
  public int id;

  public OutputRow(String pName, int pId) {
    this.name = pName;
    this.id = pId;
  }

}
```

The public variables specified by this class must match the columns specified in the `RETURNS TABLE (...)` clause of the
`CREATE FUNCTION` statement. For example, the `OutputRow` class above corresponds to the `RETURNS` clause below:

```sqlexample
CREATE FUNCTION F(...)
  RETURNS TABLE(NAME VARCHAR, ID INTEGER)
  ...
```

> **Important:**
>
> The matching between the SQL column names and the Java public field names in the output row class is case-insensitive.
> For example, in the Java and SQL code shown above, the Java field named `id` corresponds to the SQL column named `ID`.

The output row class is used as follows:

* The handler class uses the output row class to specify the return type of the `process` method and the
  `endPartition` method. The handler class also uses the output row class to construct returned values. For example:

  ```java
  public Stream<OutputRow> process(String v) {
    ...
    return Stream.of(new OutputRow(...));
  }

  public Stream<OutputRow> endPartition() {
    ...
    return Stream.of(new OutputRow(...));
  }
  ```
* The output row class is also used in the handler class’s `getOutputClass` method, which is a static method that
  Snowflake calls in order to learn the Java data types of the outputs:

  ```java
  public static Class getOutputClass() {
    return OutputRow.class;
  }
  ```

Throwing an exception from any method in the output row class (or the handler class) causes
processing to stop. The query that called the UDTF fails with an error message.

### Summary of requirements

The UDTF’s Java code must meet the following requirements:

* The code must define an output row class.
* The UDTF handler class must include a public method named `process` that returns a Stream of `<output_row_class>`, where
  Stream is defined in java.util.stream.Stream.
* The UDTF handler class must define a public static method named `getOutputClass`, which must return
  `<output_row_class>.class`.

If the Java code does not meet these requirements, then either creation or execution of the UDTF fails:

* If the session has an active warehouse at the time the CREATE FUNCTION statement executes, then Snowflake detects
  violations when the function is created.
* If the session does not have an active warehouse at the time the CREATE FUNCTION statement executes, then Snowflake detects
  violations when the function is called.

## Examples of calling Java UDTFs in queries

For general information about calling UDFs and UDTFs, see [Executing a UDF](../udf-calling-sql.md).

### Calling without explicit partitioning

This example shows how to create a UDTF. This example returns two copies of each input and returns one additional row for
each partition.

```sqlexample
create function return_two_copies(v varchar)
returns table(output_value varchar)
language java
handler='TestFunction'
target_path='@~/TestFunction.jar'
as
$$

  import java.util.stream.Stream;

  class OutputRow {

    public String output_value;

    public OutputRow(String outputValue) {
      this.output_value = outputValue;
    }

  }

  class TestFunction {

    String myString;

    public TestFunction()  {
      myString = "Created in constructor and output from endPartition()";
    }

    public static Class getOutputClass() {
      return OutputRow.class;
    }

    public Stream<OutputRow> process(String inputValue) {
      // Return two rows with the same value.
      return Stream.of(new OutputRow(inputValue), new OutputRow(inputValue));
    }

    public Stream<OutputRow> endPartition() {
      // Returns the value we initialized in the constructor.
      return Stream.of(new OutputRow(myString));
    }

  }

$$;
```

This example shows how to call a UDTF. To keep this example simple, the statement passes a literal value rather than a column,
and omits the `OVER()` clause.

```sqlexample
SELECT output_value
   FROM TABLE(return_two_copies('Input string'));
+-------------------------------------------------------+
| OUTPUT_VALUE                                          |
|-------------------------------------------------------|
| Input string                                          |
| Input string                                          |
| Created in constructor and output from endPartition() |
+-------------------------------------------------------+
```

This example calls the UDTF with values read from another table. Each time that the `process` method is called,
it is passed a value from the `city_name` column of the current row of the `cities_of_interest` table. As above, the UDTF is
called without an explicit `OVER()` clause.

Create a simple table to use as a source of inputs:

```sqlexample
CREATE TABLE cities_of_interest (city_name VARCHAR);
INSERT INTO cities_of_interest (city_name) VALUES
    ('Toronto'),
    ('Warsaw'),
    ('Kyoto');
```

Call the Java UDTF:

```sqlexample
SELECT city_name, output_value
   FROM cities_of_interest,
       TABLE(return_two_copies(city_name))
   ORDER BY city_name, output_value;
+-----------+-------------------------------------------------------+
| CITY_NAME | OUTPUT_VALUE                                          |
|-----------+-------------------------------------------------------|
| Kyoto     | Kyoto                                                 |
| Kyoto     | Kyoto                                                 |
| Toronto   | Toronto                                               |
| Toronto   | Toronto                                               |
| Warsaw    | Warsaw                                                |
| Warsaw    | Warsaw                                                |
| NULL      | Created in constructor and output from endPartition() |
+-----------+-------------------------------------------------------+
```

> **Attention:**
>
> In this example, the syntax used in the FROM clause is identical to the syntax of an inner join (i.e. `FROM t1, t2`);
> however, the operation performed is not a true inner join. The actual behavior is that the function is called with
> the values from each row in the table. In other words, given the following FROM clause:
>
> ```sqlexample
> FROM cities_of_interest, TABLE(f(city_name))
> ```
>
> the behavior would be equivalent to the following pseudocode:
>
> ```python
> for city_name in cities_of_interest:
>     output_row = f(city_name)
> ```

The [examples section in the documentation for JavaScript UDTFs](../javascript/udf-javascript-tabular-functions.md) contains more
complex examples of queries that call UDTFs with values from tables.

If the statement does not explicitly specify partitioning, then the Snowflake execution engine
uses [implicit partitioning](../udf-calling-sql.md).

If there is only one partition, then the `endPartition` method is called only once and the output of the query includes only
one row that contains the value `Created in constructor and output from endPartition()`. If the data is grouped into
different numbers of partitions during different executions of the statement, the `endPartition` method is called different
numbers of times, and the output contains different numbers of copies of this row.

For more information, see [implicit partitioning](../udf-calling-sql.md).

### Calling with explicit partitioning

Java UDTFs can also be called using explicit partitioning.

#### Multiple partitions

The following example uses the same UDTF and table created earlier. The example partitions the data by city_name.

```sqlexample
SELECT city_name, output_value
   FROM cities_of_interest,
       TABLE(return_two_copies(city_name) OVER (PARTITION BY city_name))
   ORDER BY city_name, output_value;
+-----------+-------------------------------------------------------+
| CITY_NAME | OUTPUT_VALUE                                          |
|-----------+-------------------------------------------------------|
| Kyoto     | Created in constructor and output from endPartition() |
| Kyoto     | Kyoto                                                 |
| Kyoto     | Kyoto                                                 |
| Toronto   | Created in constructor and output from endPartition() |
| Toronto   | Toronto                                               |
| Toronto   | Toronto                                               |
| Warsaw    | Created in constructor and output from endPartition() |
| Warsaw    | Warsaw                                                |
| Warsaw    | Warsaw                                                |
+-----------+-------------------------------------------------------+
```

#### Single partition

The following example uses the same UDTF and table created earlier and partitions the data by a constant, which forces
Snowflake to use only a single partition:

```sqlexample
SELECT city_name, output_value
   FROM cities_of_interest,
       TABLE(return_two_copies(city_name) OVER (PARTITION BY 1))
   ORDER BY city_name, output_value;
+-----------+-------------------------------------------------------+
| CITY_NAME | OUTPUT_VALUE                                          |
|-----------+-------------------------------------------------------|
| Kyoto     | Kyoto                                                 |
| Kyoto     | Kyoto                                                 |
| Toronto   | Toronto                                               |
| Toronto   | Toronto                                               |
| Warsaw    | Warsaw                                                |
| Warsaw    | Warsaw                                                |
| NULL      | Created in constructor and output from endPartition() |
+-----------+-------------------------------------------------------+
```

Note that only one copy of the message `Created in constructor and output from endPartition()` was included in the output,
which indicates that `endPartition` was called only once.

### Processing very large inputs (e.g. large files)

In some cases, a UDTF requires a very large amount of memory to process each input row. For example, a UDTF might read and
process a file that is too large to fit into memory.

To process large files in a UDF or UDTF, use the `SnowflakeFile` or `InputStream` class. For more information, see
[Process unstructured data with UDF and procedure handlers](../../../user-guide/unstructured-data-java.md).

---
title: Tabular JavaScript UDFs (UDTFs)
source: https://docs.snowflake.com/en/developer-guide/udf/javascript/udf-javascript-tabular-functions.md
section: Developer Guide
---

# Tabular JavaScript UDFs (UDTFs)

You can write the handler for a user-defined [table function](../../../sql-reference/functions-table.md) (UDTF) in JavaScript.

Your handler code processes rows received in the UDTF call and returns a tabular result. The received rows are partitioned,
either implicitly by Snowflake or explicitly in the syntax of the function call. You use callback functions you write to
process individual rows as well as the partitions into which they’re grouped.

The JavaScript code must meet the following requirements for the UDTF to be valid:

* The code must define a single literal JavaScript object.
* The defined object must include a callback function named `processRow()`. For more information, see
  Object callback functions.

> **Important:**
>
> If the JavaScript code does not meet these requirements, the UDTF will still be created; however, it will fail when called in a query.

> **Note:**
>
> Tabular functions (UDTFs) have a limit of 500 input arguments and 500 output columns.

## Object callback functions

Through the JavaScript code, Snowflake interacts with the UDTF by invoking callback functions during the execution of the query. The
following skeleton outlines all available callback functions and their expected signatures:

```javascript
{
   processRow: function (row, rowWriter, context) {/*...*/},
   finalize: function (rowWriter, context) {/*...*/},
   initialize: function (argumentInfo, context) {/*...*/},
}
```

Note that only `processRow()` is required; the other functions are optional.

### `processRow()`

This callback function is invoked once for each row in the input relation. The arguments to `processRow()` are passed in
the `row` object. For each of the arguments defined in the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) statement used to
create the UDTF, there is a property on the `row` object with the same name in all uppercase. The value of this property
is the value of the argument for the current row. (The value is converted to a JavaScript value.)

The `rowWriter` argument is used by the user-supplied code to produce output rows. The `rowWriter` object defines a
single function, `writeRow()`. The `writeRow()` function takes one argument,
the *row object*, which is a single row in the output table represented as a JavaScript object. For each column defined in the RETURNS
clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command, a corresponding property can be defined on the row object. The value
of that property on the row object becomes the value for the corresponding column in the output relation. Any output columns without
a corresponding property on the row object will have the value NULL in the result table.

### `finalize()`

The `finalize()` callback function is invoked once, after all rows have been passed to `processRow()`. (If the data is
grouped into partitions, then `finalize()` is invoked once for each partition,
after all rows in that partition have been passed to `processRow()`.)

This callback function can be used to output any state that might have been aggregated in `processRow()` using the same row
`rowWriter` as is passed to `processRow()`.

> **Note:**
>
> While Snowflake supports large partitions with timeouts tuned to process them successfully, especially large partitions can cause
> processing to time out (such as when `finalize` takes too long to complete). Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need the
> timeout threshold adjusted for specific usage scenarios.

### `initialize()`

This callback function is invoked once for each partition prior to any invocations of `processRow()`.

Use `initialize()` to set up any state needed during the result computation.

The `initialize()` function’s `argumentInfo` parameter contains metadata about the arguments to the user-defined
function. For example, if the UDF is defined as:

```sqlexample
CREATE FUNCTION f(argument_1 INTEGER, argument_2 VARCHAR) ...
```

then `argumentInfo` contains information about `argument_1` and `argument_2`.

`argumentInfo` has a property for each of those arguments. Each property is an object with the following values:

* `type`: String. The type of this argument.
* `isConst`: Boolean. If true, the value of this argument is constant (i.e. is the same for every row).
* `constValue`: If `isConst` (as defined above) is true, this entry contains the constant value of the argument; otherwise,
  this field is `undefined`.

The `initialize()` function cannot produce output rows.

### General usage notes for callback functions

* All three callback functions take a `context` object; this is reserved for future use and currently is empty.

  > **Caution:**
  >
  > Modifying the `context` object can yield undefined behavior.
* Additional functions and properties can be defined, as needed, on the object for use in the UDTF.
* The arguments to the callback functions are positional and can be named anything; however, for the purposes of this topic, the above
  names are used for the remaining discussion and examples.

## Partitions

In many situations, you might want to group rows into *partitions*. Partitioning has two main benefits:

* It allows you to group rows based on a common characteristic. This allows you to process all rows within the group together,
  and process each group independently.
* It allows Snowflake to divide up the workload to improve parallelization and thus performance.

For example, you might partition stock price data into one group per stock. All stock prices for an individual company can be
processed together, and the groups for different companies are processed independently.

The following statement calls the UDTF named `js_udtf()` on individual partitions. Each partition contains all rows for which
the `PARTITION BY` expression evaluates to the same value (e.g. the same stock symbol).

```sqlexample
SELECT * FROM tab1, TABLE(js_udtf(tab1.c1, tab1.c2) OVER (PARTITION BY <expression>)) ...;
```

When you specify a partition expression to use with a UDTF, Snowflake calls:

* `initialize()` once for each partition.
* `processRow()` once for each individual row in that partition.
* `finalize()` once for each partition (after processing the last row in that partition).

You might also want to process each partition’s rows in a specified order. For example, if you want to calculate the moving average
of a stock price over time, then order the stock prices by timestamp (as well as partitioning by stock or company). The following
example shows how to do this:

```sqlexample
SELECT * FROM tab1, TABLE(js_udtf(tab1.c1, tab1.c2) OVER (PARTITION BY <expression> ORDER BY <expression>)) ...;
```

When you specify an `ORDER BY` clause, the rows are processed in the order defined by the `ORDER BY` expression. Specifically,
the rows are passed to `processRow()` in the order defined by the `ORDER BY` expression.

In most cases, partitioning data almost automatically improves opportunities for parallelization and thus higher performance.
Snowflake usually executes several UDTF *instances* in parallel. (For this discussion, an instance of a JavaScript UDTF is defined
as one instance of the JavaScript object used to represent the function in Snowflake.) Each partition of rows is passed to a single
instance of the UDTF.

Note, however, that there is not necessarily a one-to-one relationship between partitions and UDTF instances. Although each
partition is processed by only one UDTF instance, the converse is not necessarily true — a single UDTF instance can process
multiple partitions. It is therefore important to use `initialize()` and `finalize()` to specifically set up and tear
down each partition, for example, to avoid “carrying over” accumulated values from the processing of one partition to the
processing of another partition.

### Result columns

When a table is joined to a table function, as in the partitioning examples above, the result set can contain the following, depending
on what is selected:

* The columns defined in the RETURNS clause of the [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command.
* The columns from the table, including both columns used to partition the data and other columns, whether or not they are used as input
  parameters to the UDTF.

Note that rows produced in the `processRow` callback and rows produced by `finalize` differ in the following ways:

* When a row is produced in `processRow`, Snowflake can correlate it to an input row, namely the one passed into the function
  as the `row` argument. Note that if a given `processRow` invocation produces more than one row, the input attributes are
  correlated with each output row.

  For rows produced in `processRow`, all input columns can be joined to the output relation.

In the `finalize` callback, Snowflake is unable to correlate it to any single row because there is no current row to correlate to.

* For rows produced in the `finalize` callback, only the columns used in the PARTITION BY clause are available (as these are the
  same for any row in the current partition); all other attributes are NULL. If no PARTITION BY clause is specified, all those attributes
  are NULL.

## Calling JavaScript UDTFs in queries

When calling a UDTF in the FROM clause of a query, specify the UDTF’s name and arguments inside the parentheses that follow the TABLE
keyword.

In other words, use a form such as the following for the TABLE keyword when calling a UDTF:

```sqlexample
SELECT ...
  FROM TABLE ( udtf_name (udtf_arguments) )
```

> **Note:**
>
> For more about calling UDFs and UDTFs, see [Executing a UDF](../udf-calling-sql.md).

### No partitioning

This simple example shows how to call a UDTF. This example passes literal values.
The UDTF merely returns the parameters in the reverse of the order in which they were passed.
This example does not use partitioning.

```javascript
SELECT * FROM TABLE(js_udtf(10.0::FLOAT, 20.0::FLOAT));
+----+----+
|  Y |  X |
|----+----|
| 20 | 10 |
+----+----+
```

This example calls a UDTF and passes it values from another table. In this example, the
UDTF named `js_udtf` is called once for each row in the table named `tab1`. Each time that the function is called,
it is passed values from columns `c1` and `c2` of the current row.
As above, the UDTF is called without a `PARTITION BY` clause.

```javascript
SELECT * FROM tab1, TABLE(js_udtf(tab1.c1, tab1.c2)) ;
```

When no partitioning is used, the Snowflake execution engine partitions the input itself according to multiple factors, such as the
size of the warehouse processing the function and the cardinality of the input relation. When running in this mode, the user code
can make no assumptions about partitions. This is most useful when the function only needs to look at rows in isolation to produce
its output and no state is aggregated across rows.

### Explicit partitioning

JavaScript UDTFs can also be called using a partition. For example:

```sqlexample
SELECT * FROM tab1, TABLE(js_udtf(tab1.c1, tab1.c2) OVER (PARTITION BY tab1.c3 ORDER BY tab1.c1));
```

### Explicit partitioning with an empty `OVER` clause

```sqlexample
SELECT * FROM tab1, TABLE(js_udtf(tab1.c1, tab1.c2) OVER ());
```

An empty `OVER` clause means that every row belongs to the same partition (i.e. the entire input relation is one partition).

> **Note:**
>
> You should exercise caution when calling a JavaScript UDTF with an empty `OVER` clause because this limits Snowflake to
> creating one instance of the function and, therefore, Snowflake is unable to parallelize the computation.

## Sample JavaScript UDTFs

This section contains several sample JavaScript UDTFs.

### Basic `Hello World` examples

The following JavaScript UDTF takes no parameters and always returns the same values. It is provided primarily for illustration purposes:

```javascript
CREATE OR REPLACE FUNCTION HelloWorld0()
    RETURNS TABLE (OUTPUT_COL VARCHAR)
    LANGUAGE JAVASCRIPT
    AS '{
        processRow: function f(row, rowWriter, context){
           rowWriter.writeRow({OUTPUT_COL: "Hello"});
           rowWriter.writeRow({OUTPUT_COL: "World"});
           }
        }';

SELECT output_col FROM TABLE(HelloWorld0());
```

Output:

```sqlexample
+------------+
| OUTPUT_COL |
+============+
| Hello      |
+------------+
| World      |
+------------+
```

The following JavaScript UDTF is also for illustration purposes, but uses an input parameter. Note that JavaScript is case-sensitive,
but SQL forces identifiers to uppercase, so when the JavaScript code references a SQL parameter name, the JavaScript code must use
uppercase.

Note also that function parameters are accessed through the parameter named `row` in the `get_params()` function:

```javascript
CREATE OR REPLACE FUNCTION HelloHuman(First_Name VARCHAR, Last_Name VARCHAR)
    RETURNS TABLE (V VARCHAR)
    LANGUAGE JAVASCRIPT
    AS '{
        processRow: function get_params(row, rowWriter, context){
           rowWriter.writeRow({V: "Hello"});
           rowWriter.writeRow({V: row.FIRST_NAME});  // Note the capitalization and the use of "row."!
           rowWriter.writeRow({V: row.LAST_NAME});   // Note the capitalization and the use of "row."!
           }
        }';

SELECT V AS Greeting FROM TABLE(HelloHuman('James', 'Kirk'));
```

Output:

```sqlexample
+------------+
|  GREETING  |
+============+
| Hello      |
+------------+
| James      |
+------------+
| Kirk       |
+------------+
```

### Basic examples illustrating the callback functions

The following JavaScript UDTF illustrates all the API callback functions and various output columns. It simply returns all rows as-is
and provides a count of the number of characters seen in each partition. It also illustrates how to share state across a partition
using a `THIS` reference. Note that the example uses an `initialize()` callback to initialize the counter to zero; this
is needed because a given function instance can be used to process multiple partitions:

```javascript
-- set up for the sample
CREATE TABLE parts (p FLOAT, s STRING);

INSERT INTO parts VALUES (1, 'michael'), (1, 'kelly'), (1, 'brian');
INSERT INTO parts VALUES (2, 'clara'), (2, 'maggie'), (2, 'reagan');

-- creation of the UDTF
CREATE OR REPLACE FUNCTION "CHAR_SUM"(INS STRING)
    RETURNS TABLE (NUM FLOAT)
    LANGUAGE JAVASCRIPT
    AS '{
    processRow: function (row, rowWriter, context) {
      this.ccount = this.ccount + 1;
      this.csum = this.csum + row.INS.length;
      rowWriter.writeRow({NUM: row.INS.length});
    },
    finalize: function (rowWriter, context) {
     rowWriter.writeRow({NUM: this.csum});
    },
    initialize: function(argumentInfo, context) {
     this.ccount = 0;
     this.csum = 0;
    }}';
```

The following query illustrates calling the `CHAR_SUM` UDTF on the `parts` table with no partitioning:

```sqlexample
SELECT * FROM parts, TABLE(char_sum(s));
```

Output:

```sqlexample
+--------+---------+-----+
| P      | S       | NUM |
+--------+---------+-----+
| 1      | michael | 7   |
| 1      | kelly   | 5   |
| 1      | brian   | 5   |
| 2      | clara   | 5   |
| 2      | maggie  | 6   |
| 2      | reagan  | 6   |
| [NULL] | [NULL]  | 34  |
+--------+---------+-----+
```

When no partitioning is specified, Snowflake automatically defines partitions. In this example, due to the small number of rows,
only one partition is created (i.e. only one invocation of `finalize()` is executed). Note that the final row has NULL values
in the input columns.

Same query, but with explicit partitioning:

```sqlexample
SELECT * FROM parts, TABLE(char_sum(s) OVER (PARTITION BY p));
```

Output:

```sqlexample
+--------+---------+-----+
| P      | S       | NUM |
+--------+---------+-----+
| 1      | michael | 7   |
| 1      | kelly   | 5   |
| 1      | brian   | 5   |
| 1      | [NULL]  | 17  |
| 2      | clara   | 5   |
| 2      | maggie  | 6   |
| 2      | reagan  | 6   |
| 2      | [NULL]  | 17  |
+--------+---------+-----+
```

This example partitions over the `p` column, yielding two partitions. For each partition, a single row is returned in the
`finalize()` callback, yielding a total of two rows, distinguished by the NULL value in the `s` column. Because
`p` is the PARTITION BY column, the rows created in `finalize()` have the value of `p` that defines the current partition.

### Extended examples using table values and other UDTFs as input

This basic UDTF converts a “range” of IP addresses to a complete list of IP addresses. The input consists of the first 3 segments
of the IP address (e.g. `'192.168.1'`) and then the start and end of the range used to generate the last segment (e.g. `42` and
`45`):

```javascript
CREATE OR REPLACE FUNCTION range_to_values(PREFIX VARCHAR, RANGE_START FLOAT, RANGE_END FLOAT)
    RETURNS TABLE (IP_ADDRESS VARCHAR)
    LANGUAGE JAVASCRIPT
    AS $$
      {
        processRow: function f(row, rowWriter, context)  {
          var suffix = row.RANGE_START;
          while (suffix <= row.RANGE_END)  {
            rowWriter.writeRow( {IP_ADDRESS: row.PREFIX + "." + suffix} );
            suffix = suffix + 1;
            }
          }
      }
      $$;

SELECT * FROM TABLE(range_to_values('192.168.1', 42::FLOAT, 45::FLOAT));
```

Output:

```sqlexample
+--------------+
| IP_ADDRESS   |
+==============+
| 192.168.1.42 |
+--------------+
| 192.168.1.43 |
+--------------+
| 192.168.1.44 |
+--------------+
| 192.168.1.45 |
+--------------+
```

Building on the previous example, you might want to calculate individual IP addresses for more than one range. This next statement
creates a table of ranges that can be used to expand to individual IP addresses. The query then inputs the rows from the table into
the `range_to_values()` UDTF to return the individual IP addresses:

```sqlexample
CREATE TABLE ip_address_ranges(prefix VARCHAR, range_start INTEGER, range_end INTEGER);
INSERT INTO ip_address_ranges (prefix, range_start, range_end) VALUES
    ('192.168.1', 42, 44),
    ('192.168.2', 10, 12),
    ('192.168.2', 40, 40)
    ;

SELECT rtv.ip_address
  FROM ip_address_ranges AS r, TABLE(range_to_values(r.prefix, r.range_start::FLOAT, r.range_end::FLOAT)) AS rtv;
```

Output:

```sqlexample
+--------------+
| IP_ADDRESS   |
+==============+
| 192.168.1.42 |
+--------------+
| 192.168.1.43 |
+--------------+
| 192.168.1.44 |
+--------------+
| 192.168.2.10 |
+--------------+
| 192.168.2.11 |
+--------------+
| 192.168.2.12 |
+--------------+
| 192.168.2.40 |
+--------------+
```

> **Attention:**
>
> In this example, the syntax used in the FROM clause is identical to the syntax of an inner join (i.e. `FROM t1, t2`); however,
> the operation performed is not a true inner join. The actual behavior is the `range_to_values()` function is called with the values
> from each row in the `ip_address changes` table. In other words, it would be equivalent to writing:
>
> > ```python
> > for input_row in ip_address_ranges:
> >   output_row = range_to_values(input_row.prefix, input_row.range_start, input_row.range_end)
> > ```

The concept of passing values to a UDTF can be extended to multiple UDTFs. The next example creates a UDTF named `fake_ipv4_to_ipv6()`
that “converts” IPV4 address to IPV6 addresses. The query then calls the function as part of a more complex statement involving another UDTF:

```javascript
-- Example UDTF that "converts" an IPV4 address to a range of IPV6 addresses.
-- (for illustration purposes only and is not intended for actual use)
CREATE OR REPLACE FUNCTION fake_ipv4_to_ipv6(ipv4 VARCHAR)
    RETURNS TABLE (IPV6 VARCHAR)
    LANGUAGE JAVASCRIPT
    AS $$
      {
        processRow: function f(row, rowWriter, context)  {
          rowWriter.writeRow( {IPV6: row.IPV4 + "." + "000.000.000.000"} );
          rowWriter.writeRow( {IPV6: row.IPV4 + "." + "..."} );
          rowWriter.writeRow( {IPV6: row.IPV4 + "." + "FFF.FFF.FFF.FFF"} );
          }
      }
      $$;

SELECT ipv6 FROM TABLE(fake_ipv4_to_ipv6('192.168.3.100'));
```

Output:

```sqlexample
+-------------------------------+
| IPV6                          |
+===============================+
| 192.168.3.100.000.000.000.000 |
+-------------------------------+
| 192.168.3.100....             |
+-------------------------------+
| 192.168.3.100.FFF.FFF.FFF.FFF |
+-------------------------------+
```

The following query uses the `fake_ipv4_to_ipv6` and `range_to_values()` UDTFs created earlier, with input from the
`ip_address changes` table. In other words, it starts with a set of IP address ranges, converts them to individual IPV4 addresses, and
then takes each IPV4 address and “converts” it to a range of IPV6 addresses:

```sqlexample
SELECT rtv6.ipv6
  FROM ip_address_ranges AS r,
       TABLE(range_to_values(r.prefix, r.range_start::FLOAT, r.range_end::FLOAT)) AS rtv,
       TABLE(fake_ipv4_to_ipv6(rtv.ip_address)) AS rtv6
  WHERE r.prefix = '192.168.2'  -- limits the output for this example
  ;
```

Output:

```sqlexample
+------------------------------+
| IPV6                         |
+==============================+
| 192.168.2.10.000.000.000.000 |
+------------------------------+
| 192.168.2.10....             |
+------------------------------+
| 192.168.2.10.FFF.FFF.FFF.FFF |
+------------------------------+
| 192.168.2.11.000.000.000.000 |
+------------------------------+
| 192.168.2.11....             |
+------------------------------+
| 192.168.2.11.FFF.FFF.FFF.FFF |
+------------------------------+
| 192.168.2.12.000.000.000.000 |
+------------------------------+
| 192.168.2.12....             |
+------------------------------+
| 192.168.2.12.FFF.FFF.FFF.FFF |
+------------------------------+
| 192.168.2.40.000.000.000.000 |
+------------------------------+
| 192.168.2.40....             |
+------------------------------+
| 192.168.2.40.FFF.FFF.FFF.FFF |
+------------------------------+
```

Note that this example used join syntax twice, but neither of the operations was a true join; both were calls to a UDTF using the
output of a table or another UDTF as input.

A true inner join is order-insensitive. For example, the following statements are identical:

`table1 INNER JOIN table2 ON ...`

`table2 INNER JOIN table1 ON ...`

Inputting values to a UDTF is not a true join, and the operations are not order-insensitive. For example, the
following query is identical to the previous example, except it reverses the order of the UDTFs in the FROM clause:

```sqlexample
SELECT rtv6.ipv6
  FROM ip_address_ranges AS r,
       TABLE(fake_ipv4_to_ipv6(rtv.ip_address)) AS rtv6,
       TABLE(range_to_values(r.prefix, r.range_start::FLOAT, r.range_end::FLOAT)) AS rtv
 WHERE r.prefix = '192.168.2'  -- limits the output for this example
  ;
```

The query fails with the following error message:

`SQL compilation error: error line 3 at position 35 invalid identifier 'RTV.IP_ADDRESS'`

The `rtv.ip_address` identifier is invalid because it was not defined before it was used. In a true join, this wouldn’t happen, but
when processing UDTFs using join syntax, this error might occur.

Next, try a statement that mixes inputting to a UDTF with a true join; however, remember that inputting to a UDTF and doing an inner join
both use the same syntax, which might be confusing:

```sqlexample
-- First, create a small table of IP address owners.
-- This table uses only IPv4 addresses for simplicity.
DROP TABLE ip_address_owners;
CREATE TABLE ip_address_owners (ip_address VARCHAR, owner_name VARCHAR);
INSERT INTO ip_address_owners (ip_address, owner_name) VALUES
  ('192.168.2.10', 'Barbara Hart'),
  ('192.168.2.11', 'David Saugus'),
  ('192.168.2.12', 'Diego King'),
  ('192.168.2.40', 'Victoria Valencia')
  ;

-- Now join the IP address owner table to the IPv4 addresses.
SELECT rtv.ip_address, ipo.owner_name
  FROM ip_address_ranges AS r,
       TABLE(range_to_values(r.prefix, r.range_start::FLOAT, r.range_end::FLOAT)) AS rtv,
       ip_address_owners AS ipo
 WHERE ipo.ip_address = rtv.ip_address AND
      r.prefix = '192.168.2'   -- limits the output for this example
  ;
```

Output:

```sqlexample
+--------------+-------------------+
| IP_ADDRESS   | OWNER_NAME        |
+==============+===================+
| 192.168.2.10 | Barbara Hart      |
+--------------+-------------------+
| 192.168.2.11 | David Saugus      |
+--------------+-------------------+
| 192.168.2.12 | Diego King        |
+--------------+-------------------+
| 192.168.2.40 | Victoria Valencia |
+--------------+-------------------+
```

> **Attention:**
>
> The preceding example works as described; however, you should take care when combining UDTFs with true joins because this might result in non-deterministic and/or unexpected behavior.
>
> Also, note that this behavior might change in the future.

---
title: Tabular SQL UDFs (UDTFs)
source: https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-tabular-functions.md
section: Developer Guide
---

# Tabular SQL UDFs (UDTFs)

Snowflake supports SQL UDFs that return a set of rows, consisting of 0, 1, or multiple rows, each of which has 1 or more columns.
Such UDFs are called *tabular UDFs*, *table UDFs*, or, most frequently, *UDTFs* (user-defined table functions).

A UDTF can be accessed in the FROM clause of a query.

## Syntax

```sqlsyntax
CREATE OR REPLACE FUNCTION <name> ( [ <arguments> ] )
  RETURNS TABLE ( <output_col_name> <output_col_type> [, <output_col_name> <output_col_type> ... ] )
  AS '<sql_expression>'
```

For a more detailed description of the general syntax for all UDFs, including SQL UDTFs, see [CREATE FUNCTION](../../../sql-reference/sql/create-function.md).

## Arguments

`name`:
:   This should be valid database object name that follows the rules described at:
    [Identifier requirements](../../../sql-reference/identifiers-syntax.md).

`arguments`:
:   This must be an expression, for example a column name, a literal, or an
    expression that can be evaluated to a single value.
    Typically, a function takes one argument, which is a column name.
    You can pass more than one value, for example, more than one column name,
    or a column name and one or more literal values.

    It is possible to pass a constant or no value at all. However, in most cases,
    if the input is the same every time, then the output is the same every time.

`RETURNS TABLE(...)`
:   Specifies that the UDF should return a table. Inside the parentheses, specify name-and-type pairs for columns (as described below) to
    include in the returned table.

    `output_col_name`:
    :   The name of an output column to include in the returned table. There must be at least one output column.

    `output_col_type`:
    :   The data type of the output column.

`sql_expression`:
:   A valid SQL expression or statement that returns a table with zero or more rows, each of which has one or more columns.
    The outputs must match the number and data types specified in the RETURNS clause.

## Usage notes

* The main body (aka “definition”) of a SQL UDTF must be a [SELECT](../../../sql-reference/sql/select.md) expression.
* Although the delimiters around the `sql_expression` are typically single quotes, you can use
  a pair of dollar signs `$$` as the delimiter. The closing delimiter must match the opening
  delimiter. A pair of dollar signs is convenient when the `sql_expression`
  contains single quotes. An example using a pair of dollar signs is included in
  the Examples section below.

  If the delimiter is a single quote, and the body contains a single quote, you can escape the single quote in the
  body by using the backslash character `\` as the escape character. An example is included in the Examples
  section below.
* The columns defined in the UDTF can appear anywhere that a normal table column can be used.
* The return types specified in the RETURNS clause determine the names and types of the columns in the tabular results and must match the
  types of the expressions in the corresponding positions of the SELECT statement in the function body.
* When calling a UDTF, you must include the UDTF name and arguments inside parentheses following the TABLE keyword. For more, see
  Calling a SQL UDTF.

> **Note:**
>
> Tabular functions (UDTFs) have a limit of 500 input arguments and 500 output columns.

## Calling a SQL UDTF

When calling a UDTF in the FROM clause of a query, specify the UDTF’s name and arguments inside the parentheses that follow the TABLE
keyword.

In other words, use a form such as the following for the TABLE keyword when calling a UDTF:

```sqlexample
SELECT ...
  FROM TABLE ( udtf_name (udtf_arguments) )
```

## Sample SQL UDTFs

### Basic examples

This is an artificially simple example of a UDTF, which hard-codes the output. This also illustrates the
use of `$$` as a delimiter:

```sqlexample
CREATE FUNCTION t()
    RETURNS TABLE(msg VARCHAR)
    AS
    $$
        SELECT 'Hello'
        UNION
        SELECT 'World'
    $$;
```

```sqlexample
SELECT msg
    FROM TABLE(t())
    ORDER BY msg;
+-------+
| MSG   |
|-------|
| Hello |
| World |
+-------+
```

This example is similar to the preceding example, but it uses single quotes as the delimiter, and uses the `\`
escape character to escape the single quotes in the body of the UDTF:

```sqlexample
CREATE FUNCTION t()
    RETURNS TABLE(msg VARCHAR)
    AS
    '
        SELECT \'Hello\'
        UNION
        SELECT \'World\'
    ';
```

```sqlexample
SELECT msg
    FROM TABLE(t())
    ORDER BY msg;
+-------+
| MSG   |
|-------|
| Hello |
| World |
+-------+
```

This is another basic example of a UDTF. It queries a table and returns two of the columns from that table:

```sqlexample
create or replace table orders (
    product_id varchar,
    quantity_sold numeric(11, 2)
    );

insert into orders (product_id, quantity_sold) values
    ('compostable bags', 2000),
    ('re-usable cups',  1000);
```

```sqlexample
create or replace function orders_for_product(PROD_ID varchar)
    returns table (Product_ID varchar, Quantity_Sold numeric(11, 2))
    as
    $$
        select product_ID, quantity_sold
            from orders
            where product_ID = PROD_ID
    $$
    ;
```

```sqlexample
select product_id, quantity_sold
    from table(orders_for_product('compostable bags'))
    order by product_id;
+------------------+---------------+
| PRODUCT_ID       | QUANTITY_SOLD |
|------------------+---------------|
| compostable bags |       2000.00 |
+------------------+---------------+
```

This same functionality can also be implemented using a view.

### Examples with joins

Create and use a SQL UDTF that returns country information (`COUNTRY_CODE` and `COUNTRY_NAME`) for a specified user ID:

```sqlexample
create or replace table countries (country_code char(2), country_name varchar);
insert into countries (country_code, country_name) values
    ('FR', 'FRANCE'),
    ('US', 'UNITED STATES'),
    ('ES', 'SPAIN');

create or replace table user_addresses (user_ID integer, country_code char(2));
insert into user_addresses (user_id, country_code) values
    (100, 'ES'),
    (123, 'FR'),
    (123, 'US');
```

```sqlexample
CREATE OR REPLACE FUNCTION get_countries_for_user ( id number )
  RETURNS TABLE (country_code char, country_name varchar)
  AS 'select distinct c.country_code, c.country_name
      from user_addresses a, countries c
      where a.user_id = id
      and c.country_code = a.country_code';
```

```sqlexample
select *
    from table(get_countries_for_user(123)) cc
    where cc.country_code in ('US','FR','CA')
    order by country_code;
+--------------+---------------+
| COUNTRY_CODE | COUNTRY_NAME  |
|--------------+---------------|
| FR           | FRANCE        |
| US           | UNITED STATES |
+--------------+---------------+
```

Create a SQL UDTF that returns the favorite color for a specified year:

```sqlexample
create or replace table favorite_years as
    select 2016 year
    UNION ALL
    select 2017
    UNION ALL
    select 2018
    UNION ALL
    select 2019;

 create or replace table colors as
    select 2017 year, 'red' color, true favorite
    UNION ALL
    select 2017 year, 'orange' color, true favorite
    UNION ALL
    select 2017 year, 'green' color, false favorite
    UNION ALL
    select 2018 year, 'blue' color, true favorite
    UNION ALL
    select 2018 year, 'violet' color, true favorite
    UNION ALL
    select 2018 year, 'brown' color, false favorite;

create or replace table fashion as
    select 2017 year, 'red' fashion_color
    UNION ALL
    select 2018 year, 'black' fashion_color
    UNION ALL
    select 2019 year, 'orange' fashion_color;
```

```sqlexample
create or replace function favorite_colors(the_year int)
    returns table(color string) as
    'select color from colors where year=the_year and favorite=true';
```

Use the UDTF in a query:

```sqlexample
select color
    from table(favorite_colors(2017))
    order by color;
+--------+
| COLOR  |
|--------|
| orange |
| red    |
+--------+
```

Use the UDTF in a join with another table; note that the join column from the table is passed as an argument to the function.

```sqlexample
select *
    from favorite_years y join table(favorite_colors(y.year)) c
    order by year, color;
+------+--------+
| YEAR | COLOR  |
|------+--------|
| 2017 | orange |
| 2017 | red    |
| 2018 | blue   |
| 2018 | violet |
+------+--------+
```

Use a WHERE clause, rather than ON, for additional predicates:

```sqlexample
select *
    from fashion f join table(favorite_colors(f.year)) fav
    where fav.color = f.fashion_color ;
+------+---------------+-------+
| YEAR | FASHION_COLOR | COLOR |
|------+---------------+-------|
| 2017 | red           | red   |
+------+---------------+-------+
```

Use the UDTF with a constant in a join expression; note that a WHERE clause, rather than ON, must be used for additional join conditions:

```sqlexample
select fav.color as favorite_2017, f.*
    from fashion f JOIN table(favorite_colors(2017)) fav
    where fav.color = f.fashion_color
    order by year;
+---------------+------+---------------+
| FAVORITE_2017 | YEAR | FASHION_COLOR |
|---------------+------+---------------|
| red           | 2017 | red           |
| orange        | 2019 | orange        |
+---------------+------+---------------+
```

---
title: Trace events for functions and procedures
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing.md
section: Developer Guide
---

# Trace events for functions and procedures

You can emit trace events from the handler code for a procedure, UDF, or UDTF, including those you write
[using Snowpark APIs](../snowpark/index.md). For a list of supported handler languages, see
Supported languages.

> **Note:**
>
> Before you can collect trace event data, you must [enable telemetry data collection](logging-tracing-enabling.md).
> When you instrument your code, Snowflake generates the data and collects it in an event table.

Trace events are a type of telemetry data (like log messages) that can capture when something has happened in the system or the
application. Unlike log messages, trace events have a structured payload, which makes them a good choice for data analysis. For example,
you can use trace events to capture some numbers that are calculated during the execution of your function, and analyze these numbers
afterwards.

In a procedure or UDF, you can associate attributes (key-value pairs) that should be captured as part of the trace events. For example,
if you want to capture the names and values of parameters in a trace event, you can add a trace event named `parameters` and set the
names and values of the parameters as attributes of the event.

When a procedure or function executes successfully, Snowflake emits the trace events that were added. Snowflake makes these trace events
available in the active event table associated with the account. For an explanation of event tables, see
[Event table overview](event-table-setting-up.md).

You can [access trace event data](tracing-accessing-events.md) for analysis in the following ways:

* Execute a SELECT command on the event table.
* View trace entries in Snowsight.

## Trace example

Python code in the following example sets a `example.proc.do_tracing` attribute on the span with a value of `begin`. It also
emits within the span an `event_with_attributes` event with `example.key1` and `example.key2` attributes.

```sqlexample-python
CREATE OR REPLACE PROCEDURE do_tracing()
RETURNS VARIANT
LANGUAGE PYTHON
PACKAGES=('snowflake-snowpark-python')
RUNTIME_VERSION = 3.12
HANDLER='run'
AS $$
from snowflake import telemetry
def run(session):
  telemetry.set_span_attribute("example.proc.do_tracing", "begin")
  telemetry.add_event("event_with_attributes", {"example.key1": "value1", "example.key2": "value2"})
  return "SUCCESS"
$$;
```

## Getting started

To get started with event traces from handler code, follow these high-level steps:

1. [Set up an event table.](event-table-setting-up.md)

   Snowflake uses your event table to store event data emitted by your handler code. An event table has
   columns [predefined by Snowflake](event-table-columns.md).
2. Get acquainted with the event trace API for the handler language you’ll be using.

   see Supported languages for a list of handler languages, then view
   content about how to emit trace events from your language.
3. Add event trace code to your handler.
4. Learn how to [retrieve event trace data](tracing-accessing-events.md) from the event table.

## Level for trace events

You can manage the verbosity of trace event data stored in the event table by setting the trace level. Before tracing, use this setting
to ensure that you’re capturing the log message severity. If you find that event data isn’t being written to the table, check the trace
level to ensure that Snowflake is capturing the data you want.

For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Supported languages

You can trace events from code written in the following languages, including when handler code is written with
[Snowpark APIs](../snowpark/index.md).

| Language / Type | Java | Python | JavaScript | Scala | Snowflake Scripting |
| --- | --- | --- | --- | --- | --- |
| Stored procedure handler | ✔ | ✔ | ✔ | ✔ | ✔ |
| Streamlit app |  | ✔ |  |  |  |
| UDF handler (scalar function) | ✔ | ✔ | ✔ | ✔ |  |
| UDTF handler (table function) | ✔ | ✔ | ✔ | ✔ \* |  |

\*:
:   Scala UDTF handler written in Snowpark.

### Event tracing from handler code

To trace events, you can use a Snowflake-provided library designed for the handler code you’re using. Snowflake intercepts trace events and
stores them in the event table you create.

The following table lists handler languages supported for logging, along with links to content on logging from code.

| Language | Telemetry Library | Documentation |
| --- | --- | --- |
| Java | Snowflake `Telemetry` class. | [Emitting trace events in Java](tracing-java.md) |
| JavaScript | Snowflake JavaScript API. | [Emitting trace events in JavaScript](tracing-javascript.md) |
| Python | Snowflake `telemetry` package. | [Emitting trace events in Python](tracing-python.md) |
| Scala | Snowflake `Telemetry` class. | [Emitting trace events in Scala](tracing-scala.md) |
| Snowflake Scripting | Snowflake SQL functions. | [Emitting trace events in Snowflake Scripting](tracing-snowflake-scripting.md) |

### SQL statement tracing

By default when [tracing is enabled](logging-tracing-enabling.md), Snowflake traces SQL statements
executed in conjunction with other traced code, such as within the handler for a stored procedure or user-defined function.

By default, Snowflake traces SQL in the following contexts:

* SQL executed within a stored procedure
* SQL that executes a stored procedure
* SQL that executes one or more user defined functions
* SQL executed by DBT
* SQL executed by Streamlit
* SQL executed by a Notebook
* SQL executed in Snowpark Container Services when the code context is a Python or Go connector

Note that the following are not supported:

* SQL statements in a Snowflake Native App
* Direct execution of SQL in worksheets or workspaces

For a traced SQL statement, you can find emitted data in the event table, including in the following columns:

* In the [RESOURCE_ATTRIBUTES column](event-table-columns.md), the `snow.executable.type` property
  value is `QUERY`.
* In the [RECORD column](event-table-columns.md), the `name` property value is the type of SQL statement whose
  execution was traced, such as SELECT, CALL, or INSERT.
* In the [RECORD_ATTRIBUTES column](event-table-columns.md), the following properties contain values related to
  SQL tracing:

  + `db.query.table.names`
  + `db.query.view.names`
  + `db.query.executable.names`
  + `db.query.text` (if enabled)

You can specify whether the SQL text itself (up to 1024 characters) should be included among trace data captured in an event table. You
might want to omit the SQL text if it can contain sensitive information or if it would not be useful.

* To capture SQL text when tracing, set the [SQL_TRACE_QUERY_TEXT](../../sql-reference/parameters.md) parameter to `"ON"` (you must use the ACCOUNTADMIN
  role to set this parameter).

### General guidelines for adding trace events

When calling the trace event APIs to add trace events and set span attributes, note the following:

* A span can hold a maximum number of 128 trace events and a maximum number of 128 span attributes.
* If you add a trace event that has the same name as an event that you added earlier, a new event record is created.
* If you set a span attribute that has the same key as a span attribute that you set earlier, the value for that key is overwritten.

## Viewing collected event data

You can view trace data either through Snowsight or by querying the event table in which trace data is stored. For more information,
see [Viewing trace data](tracing-accessing-events.md).

---
title: Troubleshooting external network access
source: https://docs.snowflake.com/en/developer-guide/external-network-access/external-network-access-troubleshooting.md
section: Developer Guide
---

# Troubleshooting external network access

The following lists errors you might encounter, along with their likely cause and a suggested resolution.

## Error: UnknownHostException or Temporary Failure in Name Resolution

This error can have one of multiple causes, as described below.

Cause:
:   The [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) or [CREATE FUNCTION](../../sql-reference/sql/create-function.md) statement does not reference the
    external access integration when creating a procedure or UDF that accesses a URL specified by the network rule.

Resolution:
:   When creating the procedure or UDF, be sure to specify the external access integration as a value of the
    CREATE statement’s EXTERNAL_ACCESS_INTEGRATIONS parameter.

Cause:
:   The UDF or procedure is attempting to access a URL that is not part of the network rule included in the external access integration.

Resolution:
:   A user with the ACCOUNTADMIN role can add the host URL to the network rule included in the external access integration.

## Error: Connect Timed Out or Receive Timed out

Cause:
:   You are attempting to access an IP address or port that is not part of the network rule included in the external access integration.

Resolution:
:   A user with the ACCOUNTADMIN role can add the IP address to the network rule included in the external access integration.

---
title: Troubleshooting Git in Snowflake
source: https://docs.snowflake.com/en/developer-guide/git/git-troubleshooting.md
section: Developer Guide
---

# Troubleshooting Git in Snowflake

Use the tips described in this topic to resolve issues when using a Git repository in Snowflake.

## Error message: “Failed to access the Git repository. Operation ‘clone’ is not authorized”

You might see this message for one of multiple reasons, but it’s typically due to a misconfiguration in Snowflake integration with the
remote Git repository. To eliminate common misconfiguration issues, confirm the following:

* You’re using correct credentials for authenticating with the remote Git repository, such as a correct username-and-password combination or
  correct personal access token.

  For more on authenticating from Snowflake, see [Setting up Snowflake to use Git](git-setting-up.md).
* You’ve correctly configured the Git repository URL, including the allowed prefixes in the API configuration.

  Read more about specifying an allowed prefix and origin URL in [Setting up Snowflake to use Git](git-setting-up.md).
* You aren’t experiencing a connectivity issue, such as when the repository is in a private network.

  Access to a remote Git repository from Snowflake is allowed only over a public network. See [Git in Snowflake limitations](git-limitations.md)
  for more.

If you continue to have this issue after verifying that your configuration is correct, try the following:

* If you’re using a fine-grained token for authorization (not the Classic token), confirm that you’ve set the proper permissions on the
  token. For read-only access, setting the “Content” to “read-only” should be enough.

  For information about managing a personal access token in GitHub, see
  [Managing your personal access tokens](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens)
  in the GitHub documentation.
* Outside of Snowflake, clone the repository with the command-line Git client using the same URL and TOKEN values that are resulting in the
  error in Snowflake.

  This should generate more verbose output, including messages indicating what the issue might be. For example, cloning might fail from the
  command line because SSO authorization is required for the operation, and this authorization was not available for the fine-grained token.
  Switching to a Classic token might resolve this issue.

## Error message: “Processing aborted due to error” when using the `SHOW GIT BRANCHES` or `SHOW GIT TAGS` commands

You might see this message if you used Git from Snowflake during an early preview of the feature. An optimization in reading from a remote
Git repository, added in a later release, might be complicating access to remote repositories for which you configured access in that early preview.

To ensure that you’re benefitting from the optimization — and to stop receiving this error — re-create your Git repository clone using
[REPLACE GIT REPOSITORY](../../sql-reference/sql/create-git-repository.md).

## Error message: “Private endpoint corresponding to service name xxx does not exist.”

You might see this message if you didn’t create a Private Endpoint for the domain (service) that you’re trying to reach.

Ensure that you’ve provisioned a Private Endpoint in Snowflake and approved it on the cloud provider side. For more information, see
[Configure the private link connection](git-setting-up.md).

## Error message: “Failed to perform operation ‘clone’. SSL problems when connect to Git server”

You might see this message when there’s a problem with an HTTPS certificate. For example, the domain’s certificate is not signed by a
certificate authority or it does not contain the Git server domain in the chain.

## Error message: “Failed to connect to the Git Repository via Private Link. Please check your network configuration and ensure Private Link traffic is correctly routed.”

You might see this message when HTTPS traffic was not routed properly to the Git server.

Ensure that you’re routing traffic correctly in your cloud service provider. For more information, see
[Configuring Git Integration with Snowflake over Private Link](https://community.snowflake.com/s/article/Configuring-Git-Integration-with-Snowflake-over-Private-Link).

---
title: Troubleshooting Java UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/java/udf-java-troubleshooting.md
section: Developer Guide
---

# Troubleshooting Java UDFs

This topic provides troubleshooting information about Java UDFs (user-defined functions).

## Troubleshooting

### Tips

If using a Java UDF in a [masking policy](../../../sql-reference/sql/create-masking-policy.md), ensure the data type of the column, UDF, and
masking policy match.

---
title: Troubleshooting JavaScript UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/javascript/udf-javascript-troubleshooting.md
section: Developer Guide
---

# Troubleshooting JavaScript UDFs

This topic provides information about troubleshooting JavaScript UDFs.

## Tips

* JavaScript is case sensitive, but SQL forces names to upper case. This can affect UDF input parameter names, for example.
  JavaScript code should reference input parameter names by using all upper case.
* If using a JavaScript UDF in a [masking policy](../../../sql-reference/sql/create-masking-policy.md), ensure the data type of the column, UDF,
  and masking policy match.

## Troubleshooting

### Error Message: `Variable is not defined`

Cause:
:   If you see this error message when running commands in SnowSQL, the cause might be an ampersand (`&`) inside a
    [CREATE FUNCTION](../../../sql-reference/sql/create-function.md) command. (The ampersand is the SnowSQL variable substitution character.) For
    example, executing the following in SnowSQL causes this error:

    ```javascript
    create function mask_bits(...)
        ...
        as
        $$
        var masked = (x & y);
        ...
        $$;
    ```

    The error occurs when the function is created, not when the function is called.

Solution:
:   If you do not intend to use variable substitution in SnowSQL, you can explicitly disable variable substitution by executing the
    following command:

    ```sqlexample
    !set variable_substitution=false;
    ```

    For more information about variable substitution, see [Using variables](../../../user-guide/snowsql-use.md).

---
title: Troubleshooting Python UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-troubleshooting.md
section: Developer Guide
---

# Troubleshooting Python UDFs

This topic provides troubleshooting information about Python UDFs (user-defined functions).

## Troubleshooting

### Problem: A required Python library is not available through Anaconda

Third-party Python libraries, which do not have C/C++ extensions, can be imported by UDFs directly via Snowflake stages.
For more information, see [Creating a Python UDF with code uploaded from a stage](udf-python-creating.md).

To learn how to submit a request to support additional Anaconda packages, see [Using third-party packages](udf-python-packages.md).

### Problem: A UDF fails with the error “function available memory exhausted”

Reduce the amount of memory used by the UDF.

Check the UDF code for bugs or memory leaks.

For more information, see [Memory](udf-python-designing.md).

### Problem: I want to extract ZIP or other archives inside a UDF

To see an example of how to upload a ZIP file to a Snowflake stage and then unzip it into the `/tmp` directory
inside the UDF, see [Unzipping a staged file](udf-python-examples.md).

### Problem: UDF performance is slow

For information about how to improve the performance of UDFs, see [Optimizing for scale and performance](udf-python-designing.md).

### Problem: The ORGADMIN role is not enabled so Anaconda packages cannot be used

When going through the steps to [get started using third-party packages from Anaconda](udf-python-packages.md),
the organization administrator (ORGADMIN) role is required.

To resolve this problem, follow the instructions in [Enabling the ORGADMIN role in an account](../../../user-guide/organization-administrators.md).

### Problem: A UDF fails with the error “UnicodeDecodeError” when reading a file

When you use the `SnowflakeFile` class to read files that contain non-text data, you must specify the input mode as binary.
Otherwise you might encounter the following error:

```python
UnicodeDecodeError: 'utf-8' codec can't decode byte 0xf7 in position 12: invalid start byte
```

To resolve this problem, specify the input mode as binary by passing `'rb'` for the `mode` argument (the second argument). For example:

```python
with SnowflakeFile.open(file_name, 'rb') as f:
```

### Tips

Training machine learning (ML) models can sometimes be very resource intensive.
Snowpark-optimized warehouses are a type of Snowflake virtual warehouse that can be used for workloads
that require a large amount of memory and compute resources.
For information on machine learning models and Snowpark Python, see [Training Machine Learning Models with Snowpark Python](../../snowpark/python/python-snowpark-training-ml.md).

If using a Python UDF in a [masking policy](../../../sql-reference/sql/create-masking-policy.md), ensure the data type of the column, UDF, and
masking policy match.

For troubleshooting information about third-party packages, see [Known issues with third-party packages](udf-python-packages.md).

---
title: Troubleshooting Scala UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-troubleshooting.md
section: Developer Guide
---

# Troubleshooting Scala UDFs

This topic provides troubleshooting information about Scala UDFs (user-defined functions).

## Troubleshooting

### Tips

If using a Scala UDF in a [masking policy](../../../sql-reference/sql/create-masking-policy.md), ensure the data type of the column, UDF, and
masking policy match.

---
title: Troubleshooting Snowpark Submit operation
source: https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-submit-troubleshooting.md
section: Developer Guide
---

# Troubleshooting Snowpark Submit operation

To diagnose and resolve issues that you encounter when you use Snowpark Submit, try the following the suggestion:

For more detailed options and command-line help, run `snowpark-submit --help` or see the
[Snowpark Submit reference](snowpark-submit-reference.md).

* Check the workload status and logs.

  ```bash
  snowpark-submit --snowflake-workload-name MY_JOB --workload-status --display-logs --snowflake-connection-name MY_CONNECTION
  ```

  When an event table is not used to store log data, logs are retained for a short period of time, such as five minutes or less.
* Verify your compute pool’s configuration.

  Ensure that your compute pool exists and has the appropriate privileges to run the workload.
* Ensure access to stages that are needed.

  Confirm that your application has proper access to any referenced stages and files within them.
* Ensure that dependencies are included.

  Verify that all application dependencies are correctly packaged and accessible to your Spark application.

---
title: Troubleshooting SQL UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-troubleshooting.md
section: Developer Guide
---

# Troubleshooting SQL UDFs

This topic provides troubleshooting information about SQL UDFs (user-defined functions).

## Troubleshooting

### Tips

If using a SQL UDF in a [masking policy](../../../sql-reference/sql/create-masking-policy.md), ensure the data type of the column, UDF, and
masking policy match.

### Error message: `Unsupported subquery type`

Cause:
:   If a UDF contains a query expression, then the UDF can act as a [subquery](../../../user-guide/querying-subqueries.md).
    If a subquery is passed a column name, then the subquery can act as a
    [correlated subquery](../../../user-guide/querying-subqueries.md). If a correlated subquery violates the
    Snowflake rules for correlated subqueries, then the user gets the error message `Unsupported subquery type`.
    The example below shows an invalid correlated subquery, and how a UDF can act like a similar invalid correlated
    subquery.

    Create a pair of tables and load data into them:

    ```sqlexample
    CREATE TABLE stores (store_ID INTEGER, city VARCHAR);
    CREATE TABLE employee_sales (employee_ID INTEGER, store_ID INTEGER, sales NUMERIC(10,2),
        sales_date DATE);
    INSERT INTO stores (store_ID, city) VALUES
        (1, 'Winnipeg'),
        (2, 'Toronto');
    INSERT INTO employee_sales (employee_ID, store_ID, sales, sales_date) VALUES
        (1001, 1, 9000.00, '2020-01-27'),
        (1002, 1, 2000.00, '2020-01-27'),
        (2001, 2, 6000.00, '2020-01-27'),
        (2002, 2, 4000.00, '2020-01-27'),
        (2002, 2, 5000.00, '2020-01-28')
        ;
    ```

    The following SQL statement contains a correlated subquery that does not follow Snowflake rules. This code
    causes an `Unsupported subquery type` error:

    ```sqlexample
    SELECT employee_ID,
           store_ID,
           (SELECT city FROM stores WHERE stores.store_ID = employee_sales.store_ID)
        FROM employee_sales;
    ```

    The code below creates and then calls a subquery-like UDF in a way that creates a correlated subquery similar to
    the one shown above:

    ```sqlexample
    CREATE FUNCTION subquery_like_udf(X INT)
        RETURNS VARCHAR
        LANGUAGE SQL
        AS
        $$
            SELECT city FROM stores WHERE stores.store_ID = X
        $$;
    ```

    ```sqlexample
    SELECT employee_ID, subquery_like_udf(employee_sales.store_ID)
        FROM employee_sales;
    ```

Solution #1:
:   If the UDF contains a query expression, then call the UDF only in ways consistent with the rules for
    [subqueries](../../../user-guide/querying-subqueries.md).

    For example, the following statement calls the UDF with a constant rather than with a column name, so the UDF
    does not act like a correlated subquery:

    ```sqlexample
    SELECT subquery_like_udf(1);
    +----------------------+
    | SUBQUERY_LIKE_UDF(1) |
    |----------------------|
    | Winnipeg             |
    +----------------------+
    ```

Solution #2:
:   In some cases, you can re-write the UDF to achieve the same goal a different way. A correlated subquery is allowed
    if the subquery can be statically determined to return one row. The following UDF uses an aggregate function
    and therefore returns only one row:

    ```sqlexample
    CREATE FUNCTION subquery_like_udf_2(X INT)
        RETURNS VARCHAR
        LANGUAGE SQL
        AS
        $$
            SELECT ANY_VALUE(city) FROM stores WHERE stores.store_ID = X
        $$;
    ```

    ```sqlexample
    SELECT employee_ID, sales_date, subquery_like_udf_2(employee_sales.store_ID)
        FROM employee_sales
        ORDER BY employee_ID, sales_date;
    +-------------+------------+----------------------------------------------+
    | EMPLOYEE_ID | SALES_DATE | SUBQUERY_LIKE_UDF_2(EMPLOYEE_SALES.STORE_ID) |
    |-------------+------------+----------------------------------------------|
    |        1001 | 2020-01-27 | Winnipeg                                     |
    |        1002 | 2020-01-27 | Winnipeg                                     |
    |        2001 | 2020-01-27 | Toronto                                      |
    |        2002 | 2020-01-27 | Toronto                                      |
    |        2002 | 2020-01-28 | Toronto                                      |
    +-------------+------------+----------------------------------------------+
    ```

---
title: Troubleshooting telemetry data collection
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-tracing-troubleshooting.md
section: Developer Guide
---

# Troubleshooting telemetry data collection

## Logging, metrics, or tracing data is not visible

For example, you might see No Metrics Data on the Related Metrics panel under Query History > Query Telemetry. Or your
event table queries for data return no results. There’s a good chance this is due to telemetry not being fully enabled. To learn more, see
[Enabling telemetry collection](logging-tracing-enabling.md).

To troubleshoot, confirm the following:

* Your account has an active event table and that the table is the one you’re checking for data.

  For more information, see [Event table overview](event-table-setting-up.md).
* The default level for the data you’re looking for (logging, metrics, or tracing) is set to a value that allows data to be recorded.

  For more information, see [Setting levels for logging, metrics, and tracing](telemetry-levels.md).
* You are setting the levels for logs, traces, and metrics high enough at runtime.

  For example, although you might have set the level for each when you [enabled telemetry collection](logging-tracing-enabling.md),
  you might be overriding those levels for individual objects. For more information on setting and overriding levels, see
  [Setting levels for logging, metrics, and tracing](telemetry-levels.md).
* You have installed the telemetry package you need for your handler language. These packages should be added to the PACKAGES statement of
  your UDF or stored procedure, or added to your Streamlit with the Packages dropdown.

  + For Java and Scala: `com.snowflake.telemetry`
* The type of object from which you want to collect data supports emitting telemetry data. For information about language support for
  types of telemetry data, see the following topics on supported languages:

  + [For logging](logging.md)
  + [For metrics](metrics.md)
  + [For tracing](tracing.md)
* The event table has not been truncated.

  For more information, see [TRUNCATE TABLE](../../sql-reference/sql/truncate-table.md).
* You have raw data in the event table.

  + If your queries of the event table return data but you don’t see the data in Snowsight, ensure that you’ve chosen a warehouse
    in Snowsight.
  + **Metrics:** If your queries of the event table return no data, ensure that the duration of the procedure or UDF execution for which you
    want to collect data is longer than the metrics collection interval. Short-running jobs may not emit any metrics data.

    For information about the role time plays in metrics data collection, see [Metrics limitations](metrics-limitations.md).
  + Remember that the data might not yet be in the event table.

    For example, it might take longer due to latency. It can take up to 5 minutes for the metrics data to be available in the event table
    and in Snowsight.

  You can query the event table for raw data as described in the following topics:

  + [Query the event table for log entries](logging-accessing-messages.md)
  + [Query the event table for trace entries](tracing-accessing-events.md)

---
title: Tutorial 1: Create a database, schema, table, and warehouse
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/tutorials/tutorial-1.md
section: Developer Guide
---

Snowflake

Getting Started

App Development

Data Engineering

# Tutorial 1: Create a database, schema, table, and warehouse

## Introduction

In this tutorial, you learn the fundamentals for managing Snowflake resource objects using the Snowflake Python APIs. To get started with the
API, you create and manage a Snowflake database, schema, table, and virtual warehouse.

### Prerequisites

> **Note:**
>
> If you have already completed the steps in [Common setup for Snowflake Python APIs tutorials](common-setup.md), you can skip these prerequisites and proceed to the first step of this
> tutorial.

Before you start this tutorial, you must complete the [common setup](common-setup.md) instructions, which includes the following steps:

> * Set up your development environment.
> * Install the Snowflake Python APIs package.
> * Configure your Snowflake connection.
> * Import all the modules required for the Python API tutorials.
> * Create an API `Root` object.

After completing these prerequisites, you are ready to start using the API.

## Create a database, schema, and table

You can use your `root` object to create a database, schema, and table in your Snowflake account.

1. To create a database, in the next cell of your notebook, run the following code:

   ```python
   database = root.databases.create(
     Database(
       name="PYTHON_API_DB"),
       mode=CreateMode.or_replace
     )
   ```

   This code, which is functionally equivalent to the SQL command `CREATE OR REPLACE DATABASE PYTHON_API_DB`, creates a database in your
   account named `PYTHON_API_DB`. This code follows a common pattern for managing objects in Snowflake:

   * `root.databases.create()` creates a database in Snowflake. It accepts two arguments: a `Database` object and a mode.
   * You pass a `Database` object by using `Database(name="PYTHON_API_DB")`, and set the name of the database by using the
     `name` argument. Recall that you imported `Database` on line 3 of the notebook.
   * You specify the creation mode by passing the `mode` argument. In this case, you set it to `CreateMode.or_replace`, but the
     following values are also valid:

     + `CreateMode.if_not_exists`: Functionally equivalent to CREATE IF NOT EXISTS in SQL.
     + `CreateMode.error_if_exists`: Raises an exception if the object already exists in Snowflake. This is the default value if a
       mode is not specified.
   * You manage the database programmatically by storing a reference to the database in an object you created named `database`.

   For more information, see [Managing Snowflake databases, schemas, tables, and views with Python](../snowflake-python-managing-databases.md).
2. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
3. In the navigation menu, select Catalog » Database Explorer. If your code was successful, the
   `PYTHON_API_DB` database is listed.

   > **Tip:**
   >
   > If you use VS Code, install the [Snowflake extension](https://marketplace.visualstudio.com/items?itemName=snowflake.snowflake-vsc) to
   > explore all Snowflake objects within your editor.
4. To create a schema in the `PYTHON_API_DB` database, in your next cell, run the following code:

   ```python
   schema = database.schemas.create(
     Schema(
       name="PYTHON_API_SCHEMA"),
       mode=CreateMode.or_replace,
     )
   ```

   Note that you call `.schemas.create()` on the `database` object you created previously.
5. To create a table in the schema you just created, in your next cell, run the following code:

   ```python
   table = schema.tables.create(
     Table(
       name="PYTHON_API_TABLE",
       columns=[
         TableColumn(
           name="TEMPERATURE",
           datatype="int",
           nullable=False,
         ),
         TableColumn(
           name="LOCATION",
           datatype="string",
         ),
       ],
     ),
   mode=CreateMode.or_replace
   )
   ```

   This code creates a table in the `PYTHON_API_SCHEMA` schema with two columns and their data types specified: `TEMPERATURE` as
   `int`, and `LOCATION` as `string`.

   These last two code examples should look familiar because they follow the pattern in the first step where you created the
   `PYTHON_API_DB` database.
6. To confirm that the objects were created, return to your Snowflake account in Snowsight.

## Retrieve object data

You can retrieve metadata about an object in Snowflake.

1. To retrieve details about the `PYTHON_API_TABLE` table you created previously, in your next notebook cell, run the following code:

   ```python
   table_details = table.fetch()
   ```

   `fetch()` returns a `TableModel` object.
2. You can then call `.to_dict()` on the resulting object to view its detailed information.

   To print the table details, in your next cell, run the following code:

   ```python
   table_details.to_dict()
   ```

   The notebook should display a dictionary that contains metadata about the `PYTHON_API_TABLE` table, similar to this:

   ```python
   {
       "name": "PYTHON_API_TABLE",
       "kind": "TABLE",
       "enable_schema_evolution": False,
       "change_tracking": False,
       "data_retention_time_in_days": 1,
       "max_data_extension_time_in_days": 14,
       "default_ddl_collation": "",
       "columns": [
           {"name": "TEMPERATURE", "datatype": "NUMBER(38,0)", "nullable": False},
           {"name": "LOCATION", "datatype": "VARCHAR(16777216)", "nullable": True},
       ],
       "created_on": datetime.datetime(
           2024, 5, 9, 8, 59, 15, 832000, tzinfo=datetime.timezone.utc
       ),
       "database_name": "PYTHON_API_DB",
       "schema_name": "PYTHON_API_SCHEMA",
       "rows": 0,
       "bytes": 0,
       "owner": "ACCOUNTADMIN",
       "automatic_clustering": False,
       "search_optimization": False,
       "owner_role_type": "ROLE",
   }
   ```

   As shown, this dictionary contains information about the `PYTHON_API_TABLE` table that you created previously, with detailed
   information about `columns`, `owner`, `database`, `schema`, and more.

Object metadata is useful when you are building business logic in your application. For example, you might build logic that runs depending
on certain information about an object. You can use `fetch()` to retrieve object metadata in such scenarios.

## Programmatically alter a table

You can programmatically add a column to a table. The `PYTHON_API_TABLE` table currently has two columns: `TEMPERATURE` and `LOCATION`.
In this scenario, you want to add a new column named `ELEVATION` of type `int` and set it as the table’s primary key.

1. In your next cell, run the following code:

   ```python
   table_details.columns.append(
       TableColumn(
         name="elevation",
         datatype="int",
         nullable=False,
         constraints=[PrimaryKey()],
       )
   )
   ```

   > **Note:**
   >
   > This code does not create the column. Instead, this column definition is appended to the array that represents the table’s columns in
   > the `TableModel`. To view this array, review the value of `columns` as described in the instructions for viewing the table
   > metadata.
2. To modify the table and add the column, in your next cell, run the following code:

   ```python
   table.create_or_alter(table_details)
   ```

   In this line, you call `create_or_alter()` on the object representing `PYTHON_API_TABLE`, and pass the updated value of
   `table_details`. This line adds the `ELEVATION` column to `PYTHON_API_TABLE`.
3. To confirm that the column was added, in your next cell, run the following code:

   ```python
   table.fetch().to_dict()
   ```

   The output should look similar to this:

   ```python
   {
       "name": "PYTHON_API_TABLE",
       "kind": "TABLE",
       "enable_schema_evolution": False,
       "change_tracking": False,
       "data_retention_time_in_days": 1,
       "max_data_extension_time_in_days": 14,
       "default_ddl_collation": "",
       "columns": [
           {"name": "TEMPERATURE", "datatype": "NUMBER(38,0)", "nullable": False},
           {"name": "LOCATION", "datatype": "VARCHAR(16777216)", "nullable": True},
           {"name": "ELEVATION", "datatype": "NUMBER(38,0)", "nullable": False},
       ],
       "created_on": datetime.datetime(
           2024, 5, 9, 8, 59, 15, 832000, tzinfo=datetime.timezone.utc
       ),
       "database_name": "PYTHON_API_DB",
       "schema_name": "PYTHON_API_SCHEMA",
       "rows": 0,
       "bytes": 0,
       "owner": "ACCOUNTADMIN",
       "automatic_clustering": False,
       "search_optimization": False,
       "owner_role_type": "ROLE",
       "constraints": [
           {"name": "ELEVATION", "column_names": ["ELEVATION"], "constraint_type": "PRIMARY KEY"}
       ]
   }
   ```

   Review the value of `columns` and the value of `constraints`, both of which now include the `ELEVATION` column.
4. To confirm the existence of the new column, return to your Snowflake account in Snowsight and inspect the table.

## Create and manage a warehouse

You can also manage virtual warehouses with the Snowflake Python APIs. For example, you might need to create another warehouse temporarily to
run certain queries. In this scenario, you can use the API to create, suspend, or drop a warehouse.

1. To retrieve the collection of warehouses associated with your session, in your next cell, run the following code:

   ```python
   warehouses = root.warehouses
   ```

   You manage warehouses in your session using the resulting `warehouses` object.
2. To define and create a new warehouse, in your next cell, run the following code:

   ```python
   python_api_wh = Warehouse(
       name="PYTHON_API_WH",
       warehouse_size="SMALL",
       auto_suspend=500,
   )

   warehouse = warehouses.create(python_api_wh,mode=CreateMode.or_replace)
   ```

   In this code, you define a new warehouse by instantiating `Warehouse` and specifying the warehouse’s name, size, and auto-suspend
   policy. The auto-suspend timeout is in units of seconds. In this case, the warehouse will be suspended after 8.33 minutes of inactivity.

   You then create the warehouse by calling `create()` on your warehouse collection. You store the reference in the resulting
   `warehouse` object.
3. Navigate to your Snowflake account in Snowsight and confirm that the warehouse was created.
4. To retrieve information about the warehouse, in your next cell, run the following code:

   ```python
   warehouse_details = warehouse.fetch()
   warehouse_details.to_dict()
   ```

   This code should look familiar because it follows the same pattern you used to fetch table metadata in a previous step. The output should
   be similar to this:

   ```python
   {
     'name': 'PYTHON_API_WH',
     'auto_suspend': 500,
     'auto_resume': 'true',
     'resource_monitor': 'null',
     'comment': '',
     'max_concurrency_level': 8,
     'statement_queued_timeout_in_seconds': 0,
     'statement_timeout_in_seconds': 172800,
     'tags': {},
     'warehouse_type': 'STANDARD',
     'warehouse_size': 'Small'
   }
   ```
5. Optional: If you have multiple warehouses in your session, use the API to iterate through them or to search for a specific warehouse.

   In your next cell, run the following code:

   ```python
   warehouse_list = warehouses.iter(like="PYTHON_API_WH")
   result = next(warehouse_list)
   result.to_dict()
   ```

   In this code, you call `iter()` on the warehouse collection and pass the `like` argument, which returns any
   warehouses whose names match the specified string. In this case, you pass the name of the warehouse you defined previously, but
   this argument is generally a case-insensitive string that functions as a filter, with support for SQL wildcard characters like `%`
   and `_`.

   After you run the cell, output similar to the following code shows that you successfully returned a matching warehouse:

   ```python
   {
     'name': 'PYTHON_API_WH',
     'auto_suspend': 500,
     'auto_resume': 'true',
     'resource_monitor': 'null',
     'comment': '',
     'max_concurrency_level': 8,
     'statement_queued_timeout_in_seconds': 0,
     'statement_timeout_in_seconds': 172800,
     'tags': {},
     'warehouse_type': 'STANDARD',
     'warehouse_size': 'Small'
   }
   ```
6. To programmatically modify the warehouse by changing its size to `LARGE`, in your next cell, run the following code:

   ```python
   warehouse = root.warehouses.create(Warehouse(
       name="PYTHON_API_WH",
       warehouse_size="LARGE",
       auto_suspend=500,
   ), mode=CreateMode.or_replace)
   ```
7. To confirm that the warehouse size was updated to `LARGE`, do one of the following:

   * In your next cell, run the following code:

     ```python
     warehouse.fetch().size
     ```
   * Navigate to your Snowflake account in Snowsight and confirm the change in warehouse size.
8. Optional: If you don’t want to continue using the warehouse, drop it. In your next cell, run the following code:

   ```python
   warehouse.drop()
   ```
9. To confirm the warehouse deletion, return to your Snowflake account in Snowsight.

## What’s next?

Congratulations! In this tutorial, you learned the fundamentals for managing Snowflake resource objects using the Snowflake Python APIs.

### Summary

Along the way, you completed the following steps:

* Install the Snowflake Python APIs.
* Set up a connection to Snowflake.
* Create a database, schema, and table.
* Retrieve object information.
* Programmatically alter an object.
* Create, suspend, and drop a warehouse.

### Next tutorial

You can now proceed to [Tutorial 2: Create and manage tasks and task graphs (DAGs)](tutorial-2.md), which shows how to create and manage tasks and task graphs.

### Additional resources

For more examples of using the API to manage other types of objects in Snowflake, see the following developer guides:

| Guide | Description |
| --- | --- |
| [Managing Snowflake users, roles, and grants with Python](../snowflake-python-managing-user-roles.md) | Use the API to create and manage users, roles, and grants. |
| [Managing data loading and unloading resources with Python](../snowflake-python-managing-data-loading.md) | Use the API to create and manage data loading and unloading resources, including external volumes, pipes, and stages. |
| [Managing Snowflake tasks and task graphs with Python](../snowflake-python-managing-tasks.md) | Use the API to create, execute, and manage tasks and task graphs. |
| [Managing Snowpark Container Services (including service functions) with Python](../snowflake-python-managing-containers.md) | Use the API to manage components of Snowpark Container Services, including compute pools, image repositories, services, and service functions. |

---
title: Tutorial 2: Create and manage tasks and task graphs (DAGs)
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/tutorials/tutorial-2.md
section: Developer Guide
---

Snowflake

Getting Started

App Development

Data Engineering

# Tutorial 2: Create and manage tasks and task graphs (DAGs)

## Introduction

In this tutorial, you create and use Snowflake tasks to manage some basic stored procedures. You also create a task graph — also
called a directed acyclic graph (DAG) — to orchestrate tasks with a higher-level task graph API.

### Prerequisites

> **Note:**
>
> If you have already completed both [Common setup for Snowflake Python APIs tutorials](common-setup.md) and [Tutorial 1: Create a database, schema, table, and warehouse](tutorial-1.md), you can skip these prerequisites and proceed to the first
> step of this tutorial.

Before you start this tutorial, you must complete the following steps:

1. Follow the [common setup](common-setup.md) instructions, which includes the following steps:

   * Set up your development environment.
   * Install the Snowflake Python APIs package.
   * Configure your Snowflake connection.
   * Import all the modules required for the Python API tutorials.
   * Create an API `Root` object.
2. Run the following code to create a database named `PYTHON_API_DB` and a schema named `PYTHON_API_SCHEMA` in that database.

   ```python
   database = root.databases.create(
     Database(
       name="PYTHON_API_DB"),
       mode=CreateMode.or_replace
     )

   schema = database.schemas.create(
     Schema(
       name="PYTHON_API_SCHEMA"),
       mode=CreateMode.or_replace,
     )
   ```

   These are the same database and schema objects you create in [Tutorial 1](tutorial-1.md).

After completing these prerequisites, you are ready to start using the API for task management.

## Set up Snowflake objects

Set up the stored procedures that your tasks will invoke and the stage that will hold the stored procedures. You can use your
Snowflake Python APIs `root` object to create a stage in the `PYTHON_API_DB` database and `PYTHON_API_SCHEMA` schema you previously
created.

1. To create a stage named `TASKS_STAGE`, in the next cell of your notebook, run the following code:

   ```python
   stages = root.databases[database.name].schemas[schema.name].stages
   stages.create(Stage(name="TASKS_STAGE"))
   ```

   This stage will hold the stored procedures and any dependencies those procedures need.
2. To create two basic Python functions that the tasks will run as stored procedures, in your next cell, run the following code:

   ```python
   def trunc(session: Session, from_table: str, to_table: str, count: int) -> str:
     (
       session
       .table(from_table)
       .limit(count)
       .write.save_as_table(to_table)
     )
     return "Truncated table successfully created!"

   def filter_by_shipmode(session: Session, mode: str) -> str:
     (
       session
       .table("snowflake_sample_data.tpch_sf100.lineitem")
       .filter(col("L_SHIPMODE") == mode)
       .limit(10)
       .write.save_as_table("filter_table")
     )
     return "Filter table successfully created!"
   ```

   These functions do the following:

   * `trunc()`: Creates a truncated version of an input table.
   * `filter_by_shipmode()`: Filters the `SNOWFLAKE_SAMPLE_DATA.TPCH_SF100.LINEITEM` table by ship mode, limits the results to 10
     rows, and writes the results in a new table.

     > **Note:**
     >
     > This function queries the [TPC-H sample data](../../../user-guide/sample-data-tpch.md) in the SNOWFLAKE_SAMPLE_DATA database. Snowflake
     > creates the sample database in new accounts by default. If the database has not been created in your account, see
     > [Use the sample database](../../../user-guide/sample-data-using.md).

   The functions are intentionally basic and are intended for demonstration purposes.

## Create and manage tasks

Define, create, and manage two tasks that will run your previously created Python functions as stored procedures.

1. To define the two tasks, `task1` and `task2`, in the next cell of your notebook, run the following code:

   ```python
   tasks_stage = f"{database.name}.{schema.name}.TASKS_STAGE"

   task1 = Task(
       name="task_python_api_trunc",
       definition=StoredProcedureCall(
         func=trunc,
         stage_location=f"@{tasks_stage}",
         packages=["snowflake-snowpark-python"],
       ),
       warehouse="COMPUTE_WH",
       schedule=timedelta(minutes=1)
   )

   task2 = Task(
       name="task_python_api_filter",
       definition=StoredProcedureCall(
         func=filter_by_shipmode,
         stage_location=f"@{tasks_stage}",
         packages=["snowflake-snowpark-python"],
       ),
       warehouse="COMPUTE_WH"
   )
   ```

   In this code, you specify the following task parameters:

   * For each task, a definition represented by a [StoredProcedureCall](https://docs.snowflake.com/en/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.task.StoredProcedureCall#snowflake.core.task.StoredProcedureCall)
     object that includes the following attributes:

     + The callable function to run
     + The stage location where the contents of your Python function and its dependencies are uploaded
     + The stored procedure’s package dependencies
   * A warehouse to run the stored procedure (required when creating a task with a `StoredProcedureCall` object). This tutorial uses
     the `COMPUTE_WH` warehouse that is included with your trial account.
   * A run schedule for the root task, `task1`. The schedule specifies the interval at which to run the task periodically.

   For more information about stored procedures, see [Writing stored procedures with SQL and Python](../../stored-procedure/python/procedure-python-overview.md).
2. To create the two tasks, retrieve a `TaskCollection` object (`tasks`) from your database schema and call `.create()` on
   your task collection:

   ```python
   # create the task in the Snowflake database
   tasks = schema.tasks
   trunc_task = tasks.create(task1, mode=CreateMode.or_replace)

   task2.predecessors = [trunc_task.name]
   filter_task = tasks.create(task2, mode=CreateMode.or_replace)
   ```

   In this code example, you also link the tasks by setting `task1` as a predecessor to `task2`, which creates a minimal task graph.
3. To confirm that the two tasks now exist, in your next cell, run the following code:

   ```python
   taskiter = tasks.iter()
   for t in taskiter:
       print(t.name)
   ```
4. When you create tasks, they are suspended by default.

   To start a task, call `.resume()` on the task resource object:

   ```python
   trunc_task.resume()
   ```
5. To confirm that the `trunc_task` task was started, in your next cell, run the following code:

   ```python
   taskiter = tasks.iter()
   for t in taskiter:
       print("Name: ", t.name, "| State: ", t.state)
   ```

   The output should be similar to this:

   ```output
   Name:  TASK_PYTHON_API_FILTER | State:  suspended
   Name:  TASK_PYTHON_API_TRUNC | State:  started
   ```

   You can repeat this step whenever you want to confirm the status of the tasks.
6. To clean up your task resources, you first suspend the task before dropping it.

   In your next cell, run the following code:

   ```python
   trunc_task.suspend()
   ```
7. To confirm that the task is suspended, repeat step 5.
8. Optional: To drop both tasks, in your next cell, run the following code:

   ```python
   trunc_task.drop()
   filter_task.drop()
   ```

## Create and manage a task graph

When you’re coordinating the execution of a large number of tasks, individually managing each task can be a challenge.
The Snowflake Python APIs provides functionality to orchestrate tasks with a higher-level task graph API.

A task graph, which is also called a directed acyclic graph (DAG), is a series of tasks composed of a root task and child tasks,
organized by their dependencies. For more information, see [Create a sequence of tasks with a task graph](../../../user-guide/tasks-graphs.md).

1. To create and deploy a task graph, run the following code:

   ```python
   dag_name = "python_api_dag"
   dag = DAG(name=dag_name, schedule=timedelta(days=1))
   with dag:
       dag_task1 = DAGTask(
           name="task_python_api_trunc",
           definition=StoredProcedureCall(
               func=trunc,
               stage_location=f"@{tasks_stage}",
               packages=["snowflake-snowpark-python"]),
           warehouse="COMPUTE_WH",
       )
       dag_task2 = DAGTask(
           name="task_python_api_filter",
           definition=StoredProcedureCall(
               func=filter_by_shipmode,
               stage_location=f"@{tasks_stage}",
               packages=["snowflake-snowpark-python"]),
           warehouse="COMPUTE_WH",
       )
       dag_task1 >> dag_task2
   dag_op = DAGOperation(schema)
   dag_op.deploy(dag, mode=CreateMode.or_replace)
   ```

   In this code, you do the following:

   * Create a task graph object by calling the `DAG` constructor and specifying a name and schedule.
   * Define task graph–specific tasks using the `DAGTask` constructor. Note that the constructor accepts the same arguments that you
     specified for the `StoredProcedureCall` class in a previous step.
   * Specify `dag_task1` as the root task and predecessor to `dag_task2` with more convenient syntax.
   * Deploy the task graph to the `PYTHON_API_SCHEMA` schema of the `PYTHON_API_DB` database.
2. To confirm the creation of the task graph, in your next cell, run the following code:

   ```python
   taskiter = tasks.iter()
   for t in taskiter:
       print("Name: ", t.name, "| State: ", t.state)
   ```

   You can repeat this step whenever you want to confirm the status of the tasks.
3. To start the task graph by starting the root task, in your next cell, run the following code:

   ```python
   dag_op.run(dag)
   ```
4. To confirm that the `PYTHON_API_DAG$TASK_PYTHON_API_TRUNC` task started, repeat step 2.

   > **Note:**
   >
   > The function call invoked by the task graph will not succeed because you are not calling it with any of its required arguments.
   > The purpose of this step is only to demonstrate how to programmatically start the task graph.
5. To drop the task graph, in your next cell, run the following code:

   ```python
   dag_op.drop(dag)
   ```
6. Clean up the database object that you created in these tutorials:

   ```python
   database.drop()
   ```

## What’s next?

Congratulations! In this tutorial, you learned how to create and manage tasks and task graphs using the Snowflake Python APIs.

### Summary

Along the way, you completed the following steps:

* Create a stage that can hold stored procedures and their dependencies.
* Create and manage tasks.
* Create and manage a task graph.
* Clean up your Snowflake resource objects by dropping them.

### Next tutorial

You can now proceed to [Tutorial 3: Create and manage Snowpark Container Services](tutorial-3.md), which shows how to create and manage components in Snowpark Container Services.

### Additional resources

For more examples of using the API to manage other types of objects in Snowflake, see the following developer guides:

| Guide | Description |
| --- | --- |
| [Managing Snowflake databases, schemas, tables, and views with Python](../snowflake-python-managing-databases.md) | Use the API to create and manage databases, schemas, and tables. |
| [Managing Snowflake users, roles, and grants with Python](../snowflake-python-managing-user-roles.md) | Use the API to create and manage users, roles, and grants. |
| [Managing data loading and unloading resources with Python](../snowflake-python-managing-data-loading.md) | Use the API to create and manage data loading and unloading resources, including external volumes, pipes, and stages. |
| [Managing Snowpark Container Services (including service functions) with Python](../snowflake-python-managing-containers.md) | Use the API to manage components of Snowpark Container Services, including compute pools, image repositories, services, and service functions. |

---
title: Tutorial 3: Create and manage Snowpark Container Services
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/tutorials/tutorial-3.md
section: Developer Guide
---

Snowflake

Getting Started

App Development

Data Engineering

# Tutorial 3: Create and manage Snowpark Container Services

## Introduction

Snowpark Container Services is a fully managed container offering designed to facilitate the deployment, management, and scaling of containerized applications
within the Snowflake ecosystem. With this feature, you can run containerized workloads directly within Snowflake.

In this tutorial, you learn how to use Snowflake Python APIs to manage components in Snowpark Container Services.

> **Important:**
>
> Snowpark Container Services is generally available to Snowflake accounts in AWS. [Preview support](../../../release-notes/preview-features.md) is available to
> accounts in Azure. For more information, see [Snowpark Container Services – Available regions](../../snowpark-container-services/overview.md).

### Prerequisites

Before you start this tutorial, you must complete these steps:

1. Install Docker Desktop.

   This tutorial provides instructions that require Docker Desktop. For installation instructions, see <https://docs.docker.com/get-docker/>.
2. Follow the [common setup](common-setup.md) instructions, which include the following steps:

   * Set up your development environment.
   * Install the Snowflake Python APIs package.
   * Configure your Snowflake connection.
   * Import all the modules required for the Python API tutorials.
   * Create an API `Root` object.
   > **Note:**
   >
   > If you have already completed the [common setup](common-setup.md), you can skip this step and begin the tutorial.

After completing these prerequisites, you are ready to start using the API for managing Snowpark Container Services.

## Set up your development environment

If you were using a notebook for the previous Snowflake Python APIs tutorials, you switch to a new notebook in this tutorial. The notebook
will contain sample code that runs an NGINX web server using Snowpark Container Services, all of which runs in Snowflake.

1. Open a new notebook using your preferred code editor or by running the command `jupyter notebook`.
2. In the first cell of your notebook, run the following code:

   ```python
   from snowflake.core.database import Database
   from snowflake.core.schema import Schema

   database = root.databases.create(Database(name="spcs_python_api_db"), mode="orreplace")
   schema = database.schemas.create(Schema(name="public"), mode="orreplace")
   ```

   Using the Snowflake connection and `root` object that you created previously in the [common setup](common-setup.md), you create
   a database named `spcs_python_api_db` and a schema named `public` in that database. You also save references that represent these
   newly created objects. Your Snowpark Container Services components will live in this database and schema.

## Overview of Snowpark Container Services

Before you continue with the tutorial, briefly review the main components of Snowpark Container Services. To run containerized applications in Snowpark Container Services, you
typically work with the following objects:

* **Image repository**: Provides a storage unit where you can upload your application images in your Snowflake account.

  Snowpark Container Services provides an OCIv2-compliant image registry service that enables OCI clients (such as Docker CLI and SnowSQL) to access an image
  registry in your Snowflake account. Using these clients, you can upload your application images to a repository.

  For more information, see [Working with an image registry and repository](../../snowpark-container-services/working-with-registry-repository.md).
* **Compute pool**: Represents a set of compute resources (virtual machine nodes).

  These compute resources are analogous, but not equivalent, to Snowflake virtual warehouses. The service (in this case, your NGINX service)
  will run in the compute pool. Compute-intensive services require high-powered compute pools with many cores and many GPUs, while less
  intensive services can run in smaller compute pools with fewer cores.

  For more information, see [Working with compute pools](../../snowpark-container-services/working-with-compute-pool.md).
* **Service**: Provides a way to run an application container.

  At a minimum, services require a specification and a compute pool. A specification contains the information needed to run the application
  container, such as the path to a container image and the endpoints that the services will expose. The specification is written in YAML.
  The compute pool is the set of compute resources in which the service will run.

  For more information, see [Working with services](../../snowpark-container-services/working-with-services.md).

Continue to the next steps to create and set up these objects.

## Create an image repository

In this section, first you create an image repository using the Snowflake Python APIs. Then you fetch an NGINX application image from
Docker Hub and upload the image to the image repository using the Docker CLI.

**Create a repository and get information about the repository**

1. In the next cell of your notebook, run the following code:

   ```python
   from snowflake.core.image_repository import ImageRepository

   my_repo = ImageRepository("MyImageRepository")
   schema.image_repositories.create(my_repo)
   ```

   In this code example, you create an image repository in the database and schema you created previously in this tutorial.
2. To confirm the repository was created successfully by fetching its details and printing its name, run the following code:

   ```python
   my_repo_res = schema.image_repositories["MyImageRepository"]
   my_repo = my_repo_res.fetch()
   print(my_repo.name)
   ```
3. You will need information about the repository (the repository URL and the registry hostname) before you can upload the image.

   To get the repository URL, in your next cell, run the following code:

   ```python
   repositories = schema.image_repositories
     for repo_obj in repositories.iter():
       print(repo_obj.repository_url)
   ```

   * The `repository_url` attribute in the output provides the URL. For example:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com/spcs_python_api_db/public/myimagerepository
     ```
   * The hostname in the repository URL is the registry hostname. For example:

     ```output
     <orgname>-<acctname>.registry.snowflakecomputing.com
     ```

**Fetch the NGINX image and upload it to the repository**

1. For Docker to upload an image to your repository on your behalf, you must first authenticate Docker with Snowflake.

   To authenticate Docker with the Snowflake registry, open a command-line terminal and run the following `docker login` command
   using the Docker CLI:

   ```bash
   docker login <registry_hostname> -u <username>
   ```

   * `registry_hostname`: Specify the hostname in `repository_url` from the result of the previous step.
   * `username`: Specify your Snowflake username. Docker will prompt you for your password.

   **Example**

   ```bash
   docker login myorg-myacct.registry.snowflakecomputing.com -u admin
   ```
2. Fetch the AMD64 build of the [NGINX image from Docker Hub](https://hub.docker.com/r/amd64/nginx/):

   ```bash
   docker pull --platform linux/amd64 amd64/nginx
   ```
3. Tag the `amd64/nginx` image with the Snowflake image repository URL:

   ```bash
   docker tag docker.io/amd64/nginx:latest <repository_url>/<image_name>
   ```

   **Example**

   ```bash
   docker tag docker.io/amd64/nginx:latest myorg-myacct.registry.snowflakecomputing.com/spcs_python_api_db/public/myimagerepository/amd64/nginx:latest
   ```

   A tag is a custom, human-readable identifier that you can optionally use to identify a specific version or variant of an image.
4. Upload the image to the repository in your Snowflake account:

   ```bash
   docker push <repository_url>/<image_name>
   ```

   **Example**

   ```bash
   docker push myorg-myacct.registry.snowflakecomputing.com/spcs_python_api_db/public/myimagerepository/amd64/nginx:latest
   ```

## Create a compute pool

To define and create a compute pool, in the next cell of your notebook, run the following code:

```python
new_compute_pool_def = ComputePool(
    name="MyComputePool",
    instance_family="CPU_X64_XS",
    min_nodes=1,
    max_nodes=2,
)

new_compute_pool = root.compute_pools.create(new_compute_pool_def)
```

In this cell, you define a compute pool using the `ComputePool` constructor by providing values for the following attributes:

* `instance_family`: The instance family identifies the type of machine you want to provision for the nodes in the compute pool.

  Each machine type provides a different amount of compute resources to their compute pools. In this cell, you use the smallest available
  machine type, `CPU_X64_XS`. For more information, see [CREATE COMPUTE POOL](../../../sql-reference/sql/create-compute-pool.md).
* `min_nodes`: The minimum number of nodes to launch the compute pool with.
* `max_nodes`: The maximum number of nodes the compute pool can scale to.

  When you create a compute pool, Snowflake launches it with the minimum number of nodes specified. Snowflake then manages the scaling
  automatically and creates new nodes—up to the maximum number specified—when the running nodes can’t take any additional workload.

Then you create the compute pool by passing the compute pool definition to `compute_pools.create()`.

## Create a service

Using the image repository and compute pool you set up, you can now define and create your service. A service refers to a collection of
containers running in a compute pool, which are all orchestrated in Snowflake.

1. To retrieve the repository containing your container image, in the next cell of your notebook, run the following code:

   ```python
   image_repository = schema.image_repositories["MyImageRepository"]
   ```

   This repository is in your Snowflake account, listed as a stage in the PUBLIC schema. You need this reference to fetch the container
   image information in the next step.
2. To define and create your service, in your next cell, run the following code:

   ```python
   from textwrap import dedent
   from io import BytesIO
   from snowflake.core.service import Service, ServiceSpecInlineText

   specification = dedent(f"""\
       spec:
         containers:
         - name: web-server
           image: {image_repository.fetch().repository_url}/amd64/nginx:latest
         endpoints:
         - name: ui
           port: 80
           public: true
       """)

   service_def = Service(
       name="MyService",
       compute_pool="MyComputePool",
       spec=ServiceSpecInlineText(spec_text=specification),
       min_instances=1,
       max_instances=1,
   )

   nginx_service = schema.services.create(service_def)
   ```

   This cell defines the service specification and the service, and then creates the service for your NGINX web server. The definitions for
   the specification and the service have the following properties:

   * `specification` – You define the specification using a Python *formatted string literal* (f-string). The string is formatted as
     YAML.

     The specification contains the name of the container, a path to the container image, and the endpoints that the service will expose
     for public access. In this example, you define the specification inline, but you can also define a specification as a reference to a
     `.yml` file in a stage.
   * `service_def` – You define a service with the `Service` constructor, passing in a name for the service, the compute pool
     it will run in, a path to the specification, and the total number of instances for the service.

     In this cell, you use `ServiceSpecInlineText` to set the value of `spec` because you define the specification inline as an
     f-string. You can specify the service to run multiple instances, but in this example you specify only one instance of the service to
     run by setting `min_instances` and `max_instances` to `1`.
3. To check the status of the service, in your next cell, run the following code:

   ```python
   from pprint import pprint

   pprint(nginx_service.get_service_status(timeout=5))
   ```

   The output should be similar to this:

   ```output
   {'auto_resume': True,
   'auto_suspend_secs': 3600,
   'instance_family': 'CPU_X64_XS',
   'max_nodes': 1,
   'min_nodes': 1,
   'name': 'MyService'}
   ```

## Use your service

After you create the service, Snowpark Container Services will take a few minutes to provision the endpoints that are needed to access the service.

1. To check the status of the endpoints, in the next cell of your notebook, run the following code:

   ```python
   import json, time

   while True:
       public_endpoints = nginx_service.fetch().public_endpoints
       try:
           endpoints = json.loads(public_endpoints)
       except json.JSONDecodeError:
           print(public_endpoints)
           time.sleep(15)
       else:
           break
   ```

   The code example isn’t specific to Snowpark Container Services or the Snowflake Python APIs – it simply provides a handy way to check whether the endpoints
   are ready. Note that you fetch the endpoints by calling `.fetch().public_endpoints` on your service object.

   The output should be similar to this:

   ```output
   Endpoints provisioning in progress... check back in a few minutes
   Endpoints provisioning in progress... check back in a few minutes
   Endpoints provisioning in progress... check back in a few minutes
   ```
2. After the endpoints are provisioned, you can open the public endpoints in your browser.

   In your next cell, run the following code:

   ```python
   import webbrowser

   print(f"Visiting {endpoints['ui']} in your browser. You might need to log in there.")
   webbrowser.open(f"https://{endpoints['ui']}")
   ```

   The output should be similar to this:

   ```output
   Visiting myorg-myacct.snowflakecomputing.app in your browser. You might need to log in there.
   ```

   If successful, you’ll see the following NGINX success page in your browser when visiting the endpoint:
3. You can use the Python API to manage your new service.

   For example, to suspend the service and then check its status, run the following code:

   ```python
   from time import sleep

   nginx_service.suspend()
   sleep(3)
   print(nginx_service.get_service_status(timeout=5))
   ```
4. To resume the service, run the following code:

   ```python
   nginx_service.resume()
   sleep(3)
   print(nginx_service.get_service_status(timeout=5))
   ```

With just a few lines of Python, you were able to run an NGINX web server in Snowflake using Snowpark Container Services.

## Clean up

Snowflake charges for active compute pool nodes in your account. To prevent unwanted charges, first suspend the service and the compute
pool, and then drop both objects.

1. To suspend the compute pool and the service, in the next cell of your notebook, run the following code:

   ```python
   new_compute_pool_def.suspend()
   nginx_service.suspend()
   ```
2. To drop the compute pool and the service, run the following code:

   ```python
   new_compute_pool_def.drop()
   nginx_service.drop()
   ```

## What’s next?

Congratulations! In this tutorial, you learned the fundamentals for managing components in Snowpark Container Services using the Snowflake Python APIs.

### Summary

Along the way, you completed these steps:

* Create an image repository where you upload your application images.
* Create a compute pool where your service runs.
* Create a service to run your application container.
* Use and manage your service.
* Clean up your Snowpark Container Services resource objects by suspending and dropping them.

### Additional resources

For more examples of using the API to manage other types of objects in Snowflake, see the following developer guides:

| Guide | Description |
| --- | --- |
| [Managing Snowflake databases, schemas, tables, and views with Python](../snowflake-python-managing-databases.md) | Use the API to create and manage databases, schemas, and tables. |
| [Managing Snowflake users, roles, and grants with Python](../snowflake-python-managing-user-roles.md) | Use the API to create and manage users, roles, and grants. |
| [Managing data loading and unloading resources with Python](../snowflake-python-managing-data-loading.md) | Use the API to create and manage data loading and unloading resources, including external volumes, pipes, and stages. |
| [Managing Snowflake tasks and task graphs with Python](../snowflake-python-managing-tasks.md) | Use the API to create, execute, and manage tasks and task graphs. |

---
title: Tutorial: Get started with logging and tracing
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tutorials/logging-tracing-getting-started.md
section: Developer Guide
---

App Development

# Tutorial: Get started with logging and tracing

## Introduction

This tutorial introduces the basics of emitting, collecting, and querying log and trace data from function and procedure handler code.

The tutorial uses the Snowsight web interface, but you can use any Snowflake client that supports executing SQL. For more
information about Snowsight, see [Getting started with worksheets](../../../user-guide/ui-snowsight-worksheets-gs.md) and [Work with worksheets in Snowsight](../../../user-guide/ui-snowsight-worksheets.md).

### What you will learn

In this tutorial, you will learn how to:

* Create an event table to store log and trace data.

  Snowflake collects log and trace data in the table’s predefined structure.
* Emit log messages and trace data from a user-defined function (UDF).

  You can use an API designed for your handler language to emit log messages and trace data from handler code.
* View the collected log and trace data by querying the event table.

  You can query the table with a SELECT statement to analyze the collected data.

### Prerequisites

* You must execute all of the SQL commands in the same SQL command session because the session context is required.

  To do this in Snowsight, for example, paste all of your code into the same worksheet as you go along. As you progress from
  section to section, each section builds on the previous.
* You must be able to use the ACCOUNTADMIN role.

  In this tutorial, you will perform all the steps using the ACCOUNTADMIN role. In general practice, however, you would use roles
  with privileges specifically defined for the action you’re performing. For example, you might have separate roles for developers who
  create UDFs, for analysts who query collected log and trace data, and so on.

  For more about roles, see [Switch your primary role](../../../user-guide/ui-snowsight-gs.md) and [Access control best practices](../../../user-guide/security-access-control-considerations.md).

## Set up the database, warehouse, and access

In this section, you’ll create a warehouse and database you’ll need for the tutorial. You’ll also begin using the ACCOUNTADMIN role, which
is required to execute some of the statements in this tutorial.

You’re creating a database in which you’ll later create the event table and the user-defined function. You can delete all of the objects
you create in the tutorial, including the database and warehouse, when you no longer need them.

To create a database and warehouse for use in the tutorial:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Switch role » ACCOUNTADMIN.
3. At the top of the navigation menu, select  (Create) » SQL Worksheet.
4. [Rename the new worksheet](../../../user-guide/ui-snowsight-worksheets.md) to `Logging-tracing tutorial`.
5. In the new worksheet, paste and run the following statement to create a database. The new database is just for this tutorial.

   > ```sqlexample
   > CREATE OR REPLACE DATABASE tutorial_log_trace_db;
   > ```
6. Paste and run the following statement to create a warehouse. The new warehouse is just for this tutorial.

   > ```sqlexample
   > CREATE OR REPLACE WAREHOUSE tutorial_log_trace_wh
   >   WAREHOUSE_TYPE = STANDARD
   >   WAREHOUSE_SIZE = XSMALL;
   > ```

In this section, you put in place the pieces you need for the tutorial. In the next section, you’ll create an event table for storing
log and trace data.

## Create an event table

In this section, you’ll create an event table. As your handler code emits log messages and trace data, Snowflake saves the emitted data in
event table rows. You can query the event table to analyze the data.

You must create an event table to collect log and trace data. An event table always uses the
[predefined structure](../event-table-columns.md) defined by Snowflake.

> **Important:**
>
> To complete this section, you’ll need to be able to use the ACCOUNTADMIN role, which is required when altering an account so that the new event
> table is the account’s active event table.

To create the event table, you must use a role with the CREATE EVENT TABLE privilege assigned.

To create the event table and make it the active event table for the account:

1. Paste and run the following statement to create an event table.

   > ```sqlexample
   > CREATE OR REPLACE EVENT TABLE tutorial_event_table;
   > ```
   >
   > This table is where Snowflake stores log and trace data.
2. Paste and run the following statement to alter the account so that the event table you created is the active one for the account.

   > ```sqlexample
   > USE ROLE ACCOUNTADMIN;
   >
   > ALTER ACCOUNT SET EVENT_TABLE = tutorial_log_trace_db.public.tutorial_event_table;
   > ```

   This statement sets the new event table as the table that Snowflake should use for storing log messages and trace data from handlers
   in the current account. You can have only one active event table for an account.

In this section, you created an event table. In the next section, you’ll start emitting log messages that Snowflake stores in the table.

## Emit log messages

In this section, you’ll create a user-defined function (UDF) with Python handler code that emits log messages. As your code emits log
messages, Snowflake collects the message data and stores it in the event table you created.

Snowflake supports APIs to log messages from each supported handler language. For handlers you write in Python, you can use the
`logging` module in Python’s standard library.

To create a UDF that emits log messages:

1. Paste and run the following statement to set the log level to `INFO`.

   > ```sqlexample
   > ALTER SESSION SET LOG_LEVEL = INFO;
   > ```

   This specifies the severity of log messages that Snowflake should capture as the UDF runs. In this case, the level permits all
   messages ranging from informational (`INFO`) to the most severe (`FATAL`).
2. Paste and run the following statement to create a user-defined function.

   > ```sqlexample-python
   > CREATE OR REPLACE FUNCTION log_trace_data()
   > RETURNS VARCHAR
   > LANGUAGE PYTHON
   > RUNTIME_VERSION = 3.12
   > HANDLER = 'run'
   > AS $$
   > import logging
   > logger = logging.getLogger("tutorial_logger")
   >
   > def run():
   >   logger.info("Logging from Python function.")
   >   return "SUCCESS"
   > $$;
   > ```

   Highlighted lines in the code do the following:

   > * Import the Python `logging` module so that the handler code can use it.
   > * Create a logger, which exposes the interface your code will use to log messages.
   > * Log a message at the `INFO` level.
3. Paste and run the following statement to execute the function you just created.

   > ```sqlexample
   > SELECT log_trace_data();
   > ```
   >
   > This produces the following output. In addition, as the function executed, it emitted a log message that Snowflake collected in the
   > event table.
   >
   > ```output
   > --------------------
   > | LOG_TRACE_DATA() |
   > --------------------
   > | SUCCESS          |
   > --------------------
   > ```

In this section, you emitted a log message from a UDF. In the next section, you’ll query the event table to retrieve data related to the message.

## Query for log messages

In this section, you’ll query the event table for log message data emitted by the UDF you ran in the previous section.

> **Note:**
>
> It can take several seconds for log or trace data emitted by handler code to be recorded in the event table. If you don’t see
> results immediately, try again in a few seconds.

Snowflake uses [predefined event table columns](../event-table-columns.md) to collect and store log and
trace data of the following kinds:

* **Data you emit from handler code**, such as log messages and trace event data.

  You’ll find these in columns such as RECORD_TYPE, RECORD, RECORD_ATTRIBUTES, and others.
* **Data about the context** in which the log or trace data was emitted, such as the timestamp, name of the handler method from which the data
  was emitted, and so on.

  You’ll find this data in columns such as RESOURCE_ATTRIBUTES, TIMESTAMP, and SCOPE.

To query the event table for log message data:

1. Paste and run the following statement to query the event table.

   > ```sqlexample
   > SELECT
   >   TIMESTAMP AS time,
   >   RESOURCE_ATTRIBUTES['snow.executable.name'] as executable,
   >   RECORD['severity_text'] AS severity,
   >   VALUE AS message
   > FROM
   >   tutorial_log_trace_db.public.tutorial_event_table
   > WHERE
   >   RECORD_TYPE = 'LOG'
   >   AND SCOPE['name'] = 'tutorial_logger';
   > ```

   Some columns contain structured data expressed as key-value pairs. In this query, you specify attribute keys within a column by using
   [bracket notation](../../../user-guide/querying-semistructured.md) such as `RECORD['severity_text']`.

   You also use bracket notation (`SCOPE['name']`) to specify that you want to select column values only where the log entries are
   emitted with the Python logger, `tutorial_logger`, you created in handler code.
2. View the output.

   > ```output
   > -----------------------------------------------------------------------------------------------------------
   > | TIME                | EXECUTABLE                           | SEVERITY | MESSAGE                         |
   > -----------------------------------------------------------------------------------------------------------
   > | 2023-04-19 22:00:49 | "LOG_TRACE_DATA():VARCHAR(16777216)" | "INFO"   | "Logging from Python function." |
   > -----------------------------------------------------------------------------------------------------------
   > ```

   The output illustrates how the [event table’s predefined columns](../event-table-columns.md) each
   contain parts of the collected data. For the `EXECUTABLE` and `SEVERITY` values, you’ve used bracket notation
   to specify the attributes whose values you want.

   > | Output Column | Description |
   > | --- | --- |
   > | TIME | The time the entry was created (from the TIMESTAMP column). |
   > | EXECUTABLE | UDF name and parameters (from the RESOURCE_ATTRIBUTES column’s `snow.executable.name` attribute). |
   > | SEVERITY | Log entry severity (from the RECORD column’s `severity_text` attribute). |
   > | MESSAGE | Log message (from the VALUE column). |

In this section, you used a SELECT statement to query for log data. In the next section, you’ll update the UDF so that it emits trace data.

## Emit trace data

In this section, you’ll update the UDF handler code so that it also emits trace data. As your code emits trace data, Snowflake collects
the data and stores it in the event table you created.

Trace data has structural qualities, including event data grouped into spans and data captured as key-value pairs, that let
you assemble a more detailed picture of your code’s activity than log data typically allows.

Snowflake supports APIs to emit trace data from each supported handler language. For handlers you write in Python, you can use the
Snowflake `telemetry` package.

To update the UDF to emit trace data:

1. Paste and run the following statement to specify what trace data should be captured.

   > ```sqlexample
   > ALTER SESSION SET TRACE_LEVEL = ON_EVENT;
   > ```

   This sets the trace level to `ON_EVENT`. This specifies that only trace data emitted explicitly by your own code should be
   captured.
2. Paste and run the following statement to create a UDF that emits trace data.

   > ```sqlexample-python
   > CREATE OR REPLACE FUNCTION log_trace_data()
   > RETURNS VARCHAR
   > LANGUAGE PYTHON
   > RUNTIME_VERSION = 3.12
   > HANDLER = 'run'
   > AS $$
   > import logging
   > logger = logging.getLogger("tutorial_logger")
   > from snowflake import telemetry
   >
   > def run():
   >   telemetry.set_span_attribute("example.proc.run", "begin")
   >   telemetry.add_event("event_with_attributes", {"example.key1": "value1", "example.key2": "value2"})
   >   logger.info("Logging from Python function.")
   >   return "SUCCESS"
   > $$;
   > ```

   By running this code, you’re replacing the function you created earlier with one that adds code for emitting trace data. The highlighted
   lines do the following:

   * Import the `telemetry` package so you can call its functions.
   * Set an attribute and attribute value to the span that Snowflake creates when the code runs.

     A span represents a procedure’s or UDF’s execution unit, within which you can add multiple events.
   * Add an event (with its own attributes) to record as part of the span.
3. Paste and run the following statement to execute the function you just created.

   > ```sqlexample
   > SELECT log_trace_data();
   > ```
   >
   > This produces the following output. In addition, as the function executed, it emitted trace data that Snowflake collected in the
   > event table.
   >
   > ```output
   > --------------------
   > | LOG_TRACE_DATA() |
   > --------------------
   > | SUCCESS          |
   > --------------------
   > ```

In this section, you emitted trace data from a UDF. In the next section, you’ll query the event table to retrieve data related to the trace.

## Query for trace messages

In this section, you’ll query the event table for trace data emitted by the UDF you ran in the previous section.

> **Note:**
>
> It can take several seconds for log or trace data emitted by handler code to be recorded in the event table. If you don’t see
> results immediately, try again in a few seconds.

The query you write will retrieve contextual information about events emitted by the function. That context includes the name of the
function that emitted it.

To query the event table for trace data:

1. Paste and run the following statement to query the event table for trace data.

   > ```sqlexample
   > SELECT
   >   TIMESTAMP AS time,
   >   RESOURCE_ATTRIBUTES['snow.executable.name'] AS handler_name,
   >   RECORD['name'] AS event_name,
   >   RECORD_ATTRIBUTES AS attributes
   > FROM
   >   tutorial_log_trace_db.public.tutorial_event_table
   > WHERE
   >   RECORD_TYPE = 'SPAN_EVENT'
   >   AND HANDLER_NAME LIKE 'LOG_TRACE_DATA%';
   > ```

   Some columns contain structured data expressed as key-value pairs. For these, you can select attribute values within a column by using
   [bracket notation](../../../user-guide/querying-semistructured.md), as shown in the code.
2. View the output.

   > ```output
   > -----------------------------------------------------------------------------------------------------------------------------------------------------
   > | TIME                    | HANDLER_NAME                         | EVENT_NAME              | ATTRIBUTES                                             |
   > -----------------------------------------------------------------------------------------------------------------------------------------------------
   > | 2023-05-10 20:49:35.080 | "LOG_TRACE_DATA():VARCHAR(16777216)" | "event_with_attributes" | { "example.key1": "value1", "example.key2": "value2" } |
   > -----------------------------------------------------------------------------------------------------------------------------------------------------
   > ```

   The output illustrates how the [event table’s predefined columns](../event-table-columns.md) each
   contain parts of the collected data. For the `EXECUTABLE` and `SEVERITY` values, you’ve used bracket notation
   to specify the attribute whose value you want.

   > | Output Column | Description |
   > | --- | --- |
   > | TIME | Time the entry was created (from the TIMESTAMP column). |
   > | HANDLER_NAME | UDF name and parameters (from the RESOURCE_ATTRIBUTES column’s `snow.executable.name` attribute). |
   > | EVENT_NAME | Name of the event added with the `add_event` function (from the RECORD column’s `name` attribute). |
   > | ATTRIBUTES | Attributes added to accompany the event (from the RECORD_ATTRIBUTES column). |

In this section, you queried the event table for trace data emitted by the UDF you wrote. In the last section, you’ll get links to information
related to the things you did during the tutorial.

## Learn more

You finished! Nicely done.

In this tutorial, you got an end-to-end view of how you can emit and store log and trace data from handler code, then query the stored data.
Along the way, you:

* **Created an event table.** For information related to event tables, see the following:

  + For more detail on setting up an event table, see [Event table overview](../event-table-setting-up.md).
  + For reference information about the columns that make up an event table, see
    [Event table columns](../event-table-columns.md).
  + For more on things you can do with event tables, see [Working with event tables](../event-table-operations.md).
* **Created a user-defined function (UDF)** that emitted log and trace data. For related information, see the following:

  + For an overview of logging support in Snowflake, see [Logging messages from functions and procedures](../logging.md). For specific about
    logging with Python, see [Logging messages from functions and procedures](../logging.md) and the
    [logging](https://docs.python.org/library/logging.html) module in Python’s standard library.
  + For details on setting levels, see [Setting levels for logging, metrics, and tracing](../telemetry-levels.md).
  + For an overview of tracing support, see [Trace events for functions and procedures](../tracing.md). For specific about tracing with Python,
    see [Emitting trace events in Python](../tracing-python.md).
  + For general information on creating UDFs, see [User-defined functions overview](../../udf/udf-overview.md).
* **Queried the event table** for log and trace data. For information related to event tables, see the following:

  + For a more complete view of how to query for log data, see [Viewing log messages](../logging-accessing-messages.md).
  + For a view of how to query for trace data, see [Viewing trace data](../tracing-accessing-events.md).
  + For more information on spans and events, along with information how Snowflake stores data for them, see
    [How Snowflake represents trace events](../tracing-how-events-work.md).

---
title: Tutorial: Getting started with Declarative Native Apps
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/tutorials/getting-started.md
section: Developer Guide
---

App Development

Declarative Sharing

SnowCLI

# Tutorial: Getting started with Declarative Native Apps

## Introduction

This tutorial takes Snowflake data providers through the process of creating, publishing, and accessing a Snowflake Declarative Native App.
The tutorial uses [SnowCLI](../../snowflake-cli/index.md) as well
as a provided notebook file and a partially complete manifest file to create a Declarative Native App.

This tutorial includes two personas:

* **Provider**: The provider creates a Declarative Native App, creates a listing for it, and shares it with a consumer
* **Consumer**: The consumer installs the Declarative Native App and uses its features and functionality.

### What you’ll learn

As a provider, you will learn how to:

* Create a manifest that declares the data and logic of a Declarative Native App.
* Package and test the app locally.
* Create and share a listing for the app that a consumer can see.

and as a consumer:

* Install a Declarative Native App listing into a test consumer account and explore its features.

### Prerequisites

Before getting started, make sure that you meet the following requirements:

* You are familiar with YAML. YAML is the language used to define the manifest of a Declarative Native App.

  If you are not familiar with YAML, see <https://yaml.org/spec/>.
* You have installed the SnowCLI command-line interface.

  SnowCLI allows you to manage Snowflake objects and perform various tasks.

  For more information on SnowCLI installation, see the [Installing Snowflake CLI](../../snowflake-cli/installation/installation.md).
* You’ll require access to two Snowflake accounts:

  + **Provider account**, used to create and publish the Declarative Native App.
    This account should have the necessary privileges to create and manage Snowflake objects
    such as databases, schemas, tables, and virtual warehouses.
  + **Consumer account**, A separate test account representing a consumer,
    used to test the Declarative Native App consumer experience.
    This account should have the necessary privileges to install apps and access shared data.

Each section in the tutorial specifies whether the steps should be completed using the provider or consumer account.

In addition, you need to do the following before you start the tutorial:

* Download sample files provided for this exercise.
* Create a database, and tables for this tutorial.

  These are the basic Snowflake objects needed for most Snowflake activities.

## Preparing the tutorial environment

The tutorial provides sample data files and instructions for setting up your local environment.

### Downloading the sample data files

For this tutorial, download the sample data files provided by Snowflake.

To download and unzip the sample data files:

1. Click the name of the
   archive file, [`tutorial-getting-started.zip`](../../../_downloads/db8742917abab0460cf11bc3994f9513/tutorial-getting-started.zip)
   and save the link/file to your local file system.
2. Unzip the sample files. The tutorial assumes you unpacked files in to the following directories:

> * Linux/macOS: `/tmp/tutorial`
> * Windows: `C:temp\tutorial`

For example to unzip the file on Linux/macOS, assuming they were downloaded to your
`Downloads` folder execute the following command:

```bash
unzip /tmp/tutorial-getting-started.zip -d /tmp/tutorial
```

These files include:

* SQL files for creating all required artifacts. These files can be used to speed the process of setting up and tearing down your tutorial environment.
* A notebook file that contains the logic for the Declarative Native App.
* A manifest file that contains the metadata for the Declarative Native App, which you will need to make minor modifications to during the tutorial.
* A sample configuration file for SnowCLI, which you can use to configure your Snowflake connection.

### Snowflake CLI configuration

The [Snowflake CLI](../../snowflake-cli/index.md) tool is
required to build, deploy, and install the application in this tutorial.
If you do not have Snowflake CLI on your machine, install it as per instructions
available in [Installing Snowflake CLI](../../snowflake-cli/installation/installation.md).

After SnowCLI is installed, you need to configure a connection to Snowflake in your
[configuration file](../../snowflake-cli/connecting/configure-cli.md). This tutorial uses `connections.toml`.
`connections.toml` can be found in:

* macOS: `~/.snowflake/connections.toml`
* Windows: `%USERPROFILE%\.snowflake\connections.toml`.

For more information on configuring SnowCLI connections, see [Define connections](../../snowflake-cli/connecting/configure-connections.md).

To add and test a connection:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the lower-left corner, select your name » Account, and then select View account details.
3. In the Account Details dialog, in the Account tab, copy the Account Identifier value.
4. Using a text editor, open the `connections.toml` file.
5. Add a new connection named `tutorial_connection` to the `connections.toml` file.
   The connection should look similar to:

   ```toml
   [tutorial_connection]
   account = "your_account_identifier"
   user = "your_username"
   password = "your_password"
   authenticator = "snowflake"
   ```
6. Save the `connections.toml` file.
7. Open a command prompt or terminal window and execute the following command to test the connection:

   ```snowcli
   snow connection test
   ```

   If the connection is successful, you should see output similar to:

   ```output
   +-------------------------------------------------------------------+
   | key             | value                                           |
   |-----------------+-------------------------------------------------|
   | Connection name | tutorial_connection                             |
   | Status          | OK                                              |
   | Host            | . . .                                           |
   | Account         | . . .                                           |
   | . . .                                                             |
   +-------------------------------------------------------------------+
   ```
8. Optionally, you can use the following to set the default connection for SnowCLI.
   Setting the default connection allows you to use the connection name without specifying it in every command.

   ```snowcli
   snow connection set-default tutorial_connection
   ```

If you already have a connection configured and would like to use it with the connector, use its
name instead of `tutorial_connection` whenever this connection is used in this tutorial.

> **Note:**
>
> The `tutorial_connection` connection is used throughout this tutorial to refer to the provider account.
> The remainder of this tutorial assumes you have set this as the default connection for SnowCLI.
>
> When connecting to the consumer account, you will use Snowsight.
> For instructions on how to access Snowsight,
> see [Snowflake in 20 minutes: Prerequisites](../../../user-guide/tutorials/snowflake-in-20minutes.md),
> and then return to this tutorial.

## Create Snowflake objects

During this step, you, as a provider, will use SnowCLI and create the required Snowflake objects.

* A database (`DB_TO_SHARE`)
* A schema (`SCHEMA_TO_SHARE`)
* A table (`TABLE_TO_SHARE`).
* Sample data for the table.

At the completion of this tutorial, you’ll remove these objects.

> **Note:**
>
> Commands are shown at the command line using SnowCLI as well as in combination using files.
>
> For example, the following command creates a database named `DB_TO_SHARE` using SnowCLI and the required connection.
>
> > > ```snowcli
> > > snow sql -q "CREATE OR REPLACE DATABASE DB_TO_SHARE"
> > > ```
> >
> > Assuming the same SQL command exists in a text file named `create_db.sql`, you can also run the command using SnowCLI as follows:
> >
> > > ```snowcli
> > > snow sql -f create_db.sql
> > > ```
>
> Assuming you set a default connection, you can also run the command without specifying the connection:
>
> > ```snowcli
> > snow sql -f create_db.sql
> > ```

Review each of the following steps to create the required objects.
Again, as a reminder, a script is provided which can be used to create all the objects in one command.

1. Create the `DB_TO_SHARE` database using the [CREATE DATABASE](../../../sql-reference/sql/create-database.md) command:

   ```snowcli
   snow sql -q "CREATE OR REPLACE DATABASE DB_TO_SHARE;"
   ```
2. Create the `SCHEMA_TO_SHARE` schema using the [CREATE SCHEMA](../../../sql-reference/sql/create-schema.md) command:

   ```snowcli
   snow sql -q "CREATE OR REPLACE SCHEMA DB_TO_SHARE.SCHEMA_TO_SHARE;"
   ```

   > **Note:**
   >
   > The database and schema you just created are now in use for your current session. You can also use the context functions to get this information.
3. Create the table named `TABLE_TO_SHARE` in `SCHEMA_TO_SHARE` using the [CREATE TABLE](../../../sql-reference/sql/create-table.md) command:

   ```snowcli
   snow sql -q "CREATE OR REPLACE TABLE DB_TO_SHARE.SCHEMA_TO_SHARE.TABLE_TO_SHARE (random_number INTEGER); \
       "
   ```
4. Add data to the table.

   To add a row containing a random value use the [INSERT](../../../sql-reference/sql/insert.md) command:

   ```snowcli
   snow sql -q "INSERT INTO DB_TO_SHARE.SCHEMA_TO_SHARE.TABLE_TO_SHARE (random_number) \
       SELECT UNIFORM(1, 100, RANDOM()) AS RANDOMNUMBER;"
   ```

   Validate that data was inserted.

   ```snowcli
   snow sql -q "SELECT * FROM DB_TO_SHARE.SCHEMA_TO_SHARE.TABLE_TO_SHARE;"
   ```

At this point the backing database, schema, and populated table exist and are ready for use.

To create all the objects in one step, you can use the provided `1.create-database-artifacts.sql`.

For example:

```snowcli
snow sql  -f /tmp/tutorial/sql/1.create-database-artifacts.sql
```

## Create and package a Declarative Native App

As a provider, create and package the Declarative Native App. Note that this step uses:

* A provided NOTEBOOK file that contains the logic for the Declarative Native App.
  This notebook contains a single SQL statement that queries the table created in the previous step.

Creating a Declarative Native App involves:

1. Defining a YAML manifest representing the data and logic in the Declarative Native App.
   A starting point for the manifest is provided in the `manifest.yml` file.
2. Creating the Declarative Native App package.
3. Packaging the app with its manifest and the associated notebook.
4. Validating the Declarative Native App package.

See the [Declarative Native App manifest reference](../manifest-reference.md) for a complete list of all required and optional fields.

### Create the application package

To create an app package, you first create an app package project.

SnowsightSnowflake CLI

To use Snowsight to create a new app package:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » App Packages.
3. In the Share Data + Code card, select Create.
4. Enter a name for your app package, and then select Create.

To use SQL to create a new app package, complete the following procedure in Snowflake CLI, where `<DECL_SHARE_APP_PKG>` is the name of the app package to create:

1. Create a new app package using [CREATE APPLICATION PACKAGE](../command-reference.md). Creating the app package also creates a live version of the app package that you can modify.

   ```snowcli
   snow sql -q "CREATE APPLICATION PACKAGE <DECL_SHARE_APP_PKG> TYPE=DATA;"
   ```
2. To verify that your app was created, use [SHOW APPLICATION PACKAGES](../../../sql-reference/sql/show-application-packages.md):

   ```snowcli
   snow sql -q "SHOW APPLICATION PACKAGES LIKE '%<DECL_SHARE_%';"
   ```

   Which will return a result similar to:

   ```output
   +-------------------------------+----------------------+-. . .-+
   | Created_on                    | name                 |       |
   +-------------------------------+----------------------+-. . .-+
   + 2025-06-12 09:40:08.845 -0700 | <DECL_SHARE_APP_PKG> | . . . |
   +-------------------------------+----------------------+-. . .-+
   ```

Once the app package is created, you populate it with the required files.

### Populate the application package

Next, populate the app package with the manifest and notebook files.

SnowsightSnowflake CLI

To use Snowsight to populate the app package:

1. In Snowsight, navigate to the listing for the app package you created in the previous step.
2. Select Manage files, and then select Upload files.
3. Drag the notebook files and the manifest from the `tmp/tutorial/app` folder to the Upload files dialog where indicated, or select Browse to locate and select the files.
4. Select Upload to upload the files to the live stage and trigger a build.

If the build is successful, the Live version tab on the app package’s listing displays the following:

* The last build time
* The contents of the last build, including:

  + A list of the app package’s notebooks
  + A list of shared objects
* A file list of the package’s contents

To use SQL in Snowflake CLI to populate the app package:

1. First, add a notebook using the command below. For more information see [PUT](../../../sql-reference/sql/put.md).

   The notebook file provided with this tutorial contains a single SQL statement that queries the previously created table.

   > **Note:**
   >
   > This command assumes the notebook is named `NOTEBOOK.ipynb` and is located in the `/tmp/tutorial/app` folder. You may need to modify the path to the notebook file based on where you unzipped the tutorial files.

   ```snowcli
   snow sql -q "PUT file:////tmp/tutorial/app/NOTEBOOK.ipynb \
               snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/ \
               OVERWRITE=TRUE AUTO_COMPRESS=false;"
   ```

   This command will return a result similar to:

   ```output
   +----------------------------------------------------------------------------------------------------------------------------+
   | source         | target         | source_size | target_size | source_compression | target_compression | status   | message |
   |----------------+----------------+-------------+-------------+--------------------+--------------------+----------+---------|
   | NOTEBOOK.ipynb | NOTEBOOK.ipynb | 817         | 832         | NONE               | NONE               | UPLOADED |         |
   +----------------------------------------------------------------------------------------------------------------------------+
   ```
2. Add a manifest. Note that this command assumes the manifest is located in the `/tmp/tutorial/app` folder.

   ```snowcli
   snow sql -q "PUT file:////tmp/tutorial/app/manifest.yml \
               snow://package/<DECL_SHARE_APP_PKG>/versions/LIVE/ \
               OVERWRITE=TRUE AUTO_COMPRESS=false;"
   ```

   This command will return a result similar to:

   ```output
   +------------------------------------------------------------------------------------------------------------------------+
   | source       | target       | source_size | target_size | source_compression | target_compression | status   | message |
   |--------------+--------------+-------------+-------------+--------------------+--------------------+----------+---------|
   | manifest.yml | manifest.yml | 359         | 368         | NONE               | NONE               | UPLOADED |         |
   +------------------------------------------------------------------------------------------------------------------------+
   ```
3. Verify the application package contents.

   ```snowcli
   snow sql -q "ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> BUILD;"
   ```

   Which will return a result similar to:

   ```output
   +----------------------------------------------------------------------------------+
   | status                                                                           |
   |----------------------------------------------------------------------------------|
   | Built the live version of APPLICATION PACKAGE <DECL_SHARE_APP_PKG> successfully. |
   +----------------------------------------------------------------------------------+
   ```

To create, package, and test the app in one step, use the provided `2.create-package-build-app.sql` file
containing all SQL commands to create, package, test a Declarative Native App.

```snowcli
snow sql  -f /tmp/tutorial/sql/2.create-package-build-app.sql
```

Once the app package has been populated, you can commit and release it.

### Version and release the app

Once an app package has been created and populated with a notebook and manifest, all that remains is to commit (version) and release it.

SnowsightSnowflake CLI

To release a new version of the app package using Snowsight:

1. Within the app package’s listing, select the Commit & release button.
2. In the confirmation dialog, select Acknowledge & continue.

Once you’ve committed and released the app package, the Latest release tab shows the contents of the release, which is the same as the contents of the last build.

To release a new version of the app package using SQL in Snowflake CLI:

* Use the following command:

  > ```snowcli
  > snow sql -q "ALTER APPLICATION PACKAGE <DECL_SHARE_APP_PKG> RELEASE LIVE VERSION;"
  > ```
  >
  > Which will return a result similar to:
  >
  > ```output
  > +---------------------------------------------------------------------------------------------+
  > | status                                                                                      |
  > |---------------------------------------------------------------------------------------------|
  > | Released LIVE version (VERSION$2) of APPLICATION PACKAGE <DECL_SHARE_APP_PKG> successfully. |
  > +---------------------------------------------------------------------------------------------+
  > ```
  >
  > You can also use the provided `3.release-app.sql` file containing SQL commands to release a Declarative Native App.
  >
  > ```snowcli
  > snow sql  -f /tmp/tutorial/sql/3.release-app.sql
  > ```

## Test a Declarative Native App

Once a Declarative Native App is packaged and released the database and logic it contains can be tested locally.

This section describes how to complete the following tasks:

* Create an app from an application package.
* Create a database from an application package.
* Examine the contents of a database created from an application package.

> **Note:**
>
> These steps are used by providers to test their app before it is published.

### Create an app from an application package

1. Using the [CREATE APPLICATION](../../../sql-reference/sql/create-application.md) command, create an app from an application package using a command similar to:

> ```snowcli
> snow sql -q "CREATE APPLICATION DECL_SHARE FROM APPLICATION PACKAGE <DECL_SHARE_APP_PKG>;"
> ```

### Examine app contents

You can test your Declarative Native App by examining the contents using either SQL commands or the Snowsight.

Snowflake CLISnowsight

```snowcli
snow sql -q "USE DATABASE DECL_SHARE;
             DESCRIBE DATABASE DECL_SHARE; \
             SHOW SCHEMAS IN DATABASE DECL_SHARE; \
             SHOW TABLES IN DATABASE DECL_SHARE; "
```

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. At the top of the navigation menu, select  (Create) » SQL Worksheet.
3. In the worksheet, enter the following command:

   ```sqlexample
   USE DATABASE DECL_SHARE;
   DESCRIBE DATABASE DECL_SHARE;
   SHOW SCHEMAS IN DATABASE DECL_SHARE;
   SHOW TABLES IN DATABASE DECL_SHARE;
   ```

Declarative Native Apps includes a special schema, `APP$UI`, which when in use shows the notebooks contained with the app.

For example:

```sqlexample
USE APP$UI;
SHOW NOTEBOOKS;
```

Use the provided `4.test-locally-app.sql` file which contains commands to examine the content of a Declarative Native App.
In addition the file, `4.test-only-app.sql`, contains commands to test the app but does not attempt to create an database
from the application package.

```snowcli
snow sql  -f /tmp/tutorial/sql/4.test-locally-app.sql
```

## Share your Declarative Native App using a listing

After successfully creating and testing a Declarative Native App, we can create a listing and add the app as a data product for that listing.
Making the app available as a listing allows other Snowflake users to discover and install the app.

This allows you to share your app with other Snowflake users and allows them to install and use the app in their account.

### Create a listing for your app

> **Note:**
>
> The following steps are performed by a provider to create a listing for the Declarative Native App.
> The listing is then used by consumers to install the Declarative Native App.

To create a listing for your app:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the + Create Listing menu, select Specified consumers for selected accounts.
4. Under What’s the title of the listing?, enter `Declarative Sharing Tutorial`.
5. Select Only Specified Consumers. Select Next.
6. Under What’s in the listing, click + Select.
7. Select `DECL_SHARE_APP_PKG`.
8. Optional: Enter a description for your listing.
9. Under Consumer Data Sharing Account ID, provide a valid account identifier to share the listing to.

   > **Note:**
   >
   > The account identifier is a unique identifier for your Snowflake account.
   > It is used to identify your account when sharing data and apps with other Snowflake accounts.
   > To find your account identifier see [Account identifiers](../../../user-guide/admin-account-identifier.md).
10. Select Publish.

### Install the app in a consumer account

> **Important:**
>
> The consumer account installing an app must specify a default warehouse.
>
> To create a warehouse:
>
> 1. In the navigation menu, select Compute » Warehouses.
> 2. Click + Warehouse.
> 3. Configure the warehouse as needed.
>
> To set a default warehouse:
>
> 1. In the lower-left corner, select your name » Settings, and then select Preferences.
> 2. Select warehouse from the Default Warehouse drop-down list.

To install your app from the listing:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select the tile for the listing under Recently shared with you.
4. Select Get.
5. Select Options and enter a name for the app. For this tutorial, use `DeclarativeAppConsumer`.
6. Select the warehouse where you want to install the app.
7. Select Get.
8. Select Open to view your listing or Done to finish.
9. Explore the listing as you would any other listing.

   For more information see [Access content in a Declarative Native App](../consumer/access-app-content.md).

## Summary, clean up, and additional resources

Congratulations! You’ve successfully completed this tutorial.

Take a few minutes to review a short summary and the key points covered in the tutorial.

You might also want to consider cleaning up by dropping any objects you created in the tutorial. Learn more by reviewing other topics in the Snowflake Documentation.

### Summary and key points

In summary, Declarative Native Apps:

* Can be easily used to expose databases, tables, views, and schemas.
* Have a well-defined lifecycle.
* Can be accessed by consumers using [listings](../../../collaboration/collaboration-listings-about.md) or app.

### Cleanup (Optional)

On the consumer account used to install the Declarative Native App, to uninstall the listing, follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. In the row for `DeclarativeAppConsumer` on the … menu, select Uninstall.

If the objects you created in this tutorial are no longer needed, you can remove them from the system using the following commands:

As the provider who originally created the objects, run the following commands using SnowCLI

> ```snowcli
> snow sql -q "DROP LISTING IF EXISTS DECLARATIVE_APP_TUTORIAL; \
>              DROP APPLICATION IF EXISTS DECL_SHARE; \
>              DROP APPLICATION PACKAGE IF EXISTS DECL_SHARE_APP_PKG; \
>              DROP TABLE IF EXISTS DB_TO_SHARE.SCHEMA_TO_SHARE.TABLE_TO_SHARE; \
>              DROP SCHEMA IF EXISTS DB_TO_SHARE.SCHEMA_TO_SHARE; \
>              DROP DATABASE IF EXISTS DB_TO_SHARE; "
> ```

For simplicity, you can use the provided `teardown-tutorial.sql` file containing all SQL commands to remove the objects created in this tutorial.

> ```snowcli
> snow sql  -f /tmp/tutorial/sql/teardown-tutorial.sql
> ```

## Learn more

To learn more about Declarative Native Apps, see the following topics:

* [About Declarative Sharing in the Native Application Framework](../about.md)

---
title: Tutorials: Getting started with the Snowflake Python APIs
source: https://docs.snowflake.com/en/developer-guide/snowflake-python-api/overview-tutorials.md
section: Developer Guide
---

# Tutorials: Getting started with the Snowflake Python APIs

With the Snowflake Python APIs, you can use Python to manage Snowflake resource objects. You can create, drop, and alter
tables, schemas, warehouses, tasks, and more, without writing SQL or using the Snowflake Connector for Python.

In the following tutorials, you learn how to get started with the API for object and task management in Snowflake.

## Prerequisites

* A Snowflake account (Note: trial accounts are not supported in [Tutorial 3: Create and manage Snowpark Container Services](tutorials/tutorial-3.md))
* Familiarity with Python, and one of the following supported versions of Python:

  Generally available versions:

  + 3.9 (deprecated)
  + 3.10
  + 3.11
  + 3.12
  + 3.13
* Familiarity with Jupyter notebooks
* A code editor that supports Jupyter notebooks, or the ability to run notebooks in your browser using `jupyter notebook`

## What you’ll learn

* How to install the Snowflake Python APIs library
* How to create a `Root` object to use the API
* How to create tables, schemas, and warehouses using the API
* How to create and manage tasks using the API
* How to create and manage components in Snowpark Container Services using the API

## What you’ll build

* Multiple objects within Snowflake

## Tutorials

The following tutorials provide step-by-step instructions for you to explore the Snowflake Python APIs:

[Common setup for Snowflake Python APIs tutorials](tutorials/common-setup.md)
:   Installation and setup steps for exploring the tutorials

[Tutorial 1: Create a database, schema, table, and warehouse](tutorials/tutorial-1.md)
:   Step-by-step instructions to create a Snowflake database, schema, table, and virtual warehouse

[Tutorial 2: Create and manage tasks and task graphs (DAGs)](tutorials/tutorial-2.md)
:   Step-by-step instructions to create and manage tasks and task graphs

[Tutorial 3: Create and manage Snowpark Container Services](tutorials/tutorial-3.md)
:   Step-by-step instructions to create and manage components in Snowpark Container Services

---
title: Understanding blocks in Snowflake Scripting
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/blocks.md
section: Developer Guide
---

# Understanding blocks in Snowflake Scripting

In Snowflake Scripting, you write procedural code in a Snowflake Scripting block. This topic explains how to write procedural
code in a block.

## Understanding the structure of a block

A block has the following basic structure:

```sqlsyntax
DECLARE
  -- (variable declarations, cursor declarations, etc.) ...
BEGIN
  -- (Snowflake Scripting and SQL statements) ...
EXCEPTION
  -- (statements for handling exceptions) ...
END;
```

A block consists of required and optional sections that are delimited by keywords. Each section serves a different purpose:

* [DECLARE](../../sql-reference/snowflake-scripting/declare.md): If you need to use any variables, cursors, RESULTSETs, or exceptions
  in the block, you can either declare these in the DECLARE section of the block or in the BEGIN … END section
  of the block.

  You can declare:

  + [Variables](variables.md)
  + [Cursors](cursors.md)
  + [RESULTSETs](resultsets.md)
  + [Exceptions](exceptions.md)

  This section of the block is optional.
* [BEGIN … END](../../sql-reference/snowflake-scripting/begin.md): Write SQL statements and Snowflake Scripting constructs in the
  section of the block between BEGIN and END.
* [EXCEPTION](../../sql-reference/snowflake-scripting/exception.md): If you need to add exception handling code, add this to the
  EXCEPTION section of the block.

  This section of the block is optional.

A simple block only requires the keywords BEGIN and END. For example:

```sqlexample
BEGIN
  CREATE TABLE employee (id INTEGER, ...);
  CREATE TABLE dependents (id INTEGER, ...);
END;
```

> **Important:**
>
> The keyword BEGIN that starts a block is different from the keyword BEGIN that starts a transaction.
> To minimize confusion, Snowflake strongly recommends starting transactions with BEGIN TRANSACTION (or the older form
> BEGIN WORK), rather than BEGIN.

Any database objects that you create in a block (e.g. the tables in the example above) can be used outside of the block.

If the code uses variables, you can [declare those variables](variables.md) in the block. One way to do
this is in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section of the block. For example:

```sqlexample
DECLARE
  radius_of_circle FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius_of_circle := 3;
  area_of_circle := pi() * radius_of_circle * radius_of_circle;
  RETURN area_of_circle;
END;
```

This example declares a variable, uses the variable, and returns the value of the variable. For details on how values are
returned from a block, see [Returning a value](return.md).

These variables cannot be used outside of the block. See [Understanding the scope of declarations](variables.md).

You can also declare a variable in the BEGIN … END section of the block by using
[LET](../../sql-reference/snowflake-scripting/let.md). For details, see [Declaring a variable](variables.md).

## Using a block in a stored procedure

You can use a block in the definition of a stored procedure. The following is an example that you can run in
[Snowsight](../../user-guide/ui-snowsight-gs.md) to create a stored procedure containing a Snowflake Scripting block:

```sqlexample
CREATE OR REPLACE PROCEDURE area()
RETURNS FLOAT
LANGUAGE SQL
AS
DECLARE
  radius FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius := 3;
  area_of_circle := PI() * radius * radius;
  RETURN area_of_circle;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE area()
RETURNS FLOAT
LANGUAGE SQL
AS
$$
DECLARE
  radius FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius := 3;
  area_of_circle := PI() * radius * radius;
  RETURN area_of_circle;
END;
$$
;
```

You can call a stored procedure using the [CALL](../../sql-reference/sql/call.md) command. The following example calls
the stored procedure `area` in the previous example:

```sqlexample
CALL area();
```

The stored procedure returns the following output:

```output
+--------------+
|         AREA |
|--------------|
| 28.274333882 |
+--------------+
```

For more information, see [Writing stored procedures in Snowflake Scripting](../stored-procedure/stored-procedures-snowflake-scripting.md).

## Using a block in a user-defined function (UDF)

You can use a block in the definition of a Snowflake Scripting UDF. The following example shows code that you can run in
[Snowsight](../../user-guide/ui-snowsight-gs.md) to create a UDF that contains a Snowflake Scripting block:

```sqlexample
CREATE OR REPLACE FUNCTION area()
RETURNS FLOAT
LANGUAGE SQL
AS
DECLARE
  radius FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius := 3;
  area_of_circle := PI() * radius * radius;
  RETURN area_of_circle;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE FUNCTION area()
RETURNS FLOAT
LANGUAGE SQL
AS
$$
DECLARE
  radius FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius := 3;
  area_of_circle := PI() * radius * radius;
  RETURN area_of_circle;
END;
$$;
```

You can call the function in a SQL statement, such as a SELECT or INSERT statement. The following example calls
the Snowflake Scripting UDF `area` in the previous example in a SELECT statement:

```sqlexample
SELECT area();
```

```output
+--------------+
|       AREA() |
|--------------|
| 28.274333882 |
+--------------+
```

For more information, see [Snowflake Scripting UDFs](../udf/sql/udf-sql-procedural-functions.md).

## Using an anonymous block

If you want to run procedural code outside of a stored procedure or UDF, you can define and use an *anonymous block*. An
anonymous block is a block that is not part of a stored procedure or UDF. You define the block as a separate, standalone SQL statement.

The [BEGIN](../../sql-reference/snowflake-scripting/begin.md) statement that defines the block also executes the block. (You don’t
run a separate CALL command to execute the block.)

The following is an example of an anonymous block that you can run in
[Snowsight](../../user-guide/ui-snowsight-gs.md):

```sqlexample
DECLARE
  radius_of_circle FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius_of_circle := 3;
  area_of_circle := PI() * radius_of_circle * radius_of_circle;
  RETURN area_of_circle;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  radius_of_circle FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius_of_circle := 3;
  area_of_circle := PI() * radius_of_circle * radius_of_circle;
  RETURN area_of_circle;
END;
$$
;
```

The example produces the following output:

```output
+-----------------+
| anonymous block |
|-----------------|
|    28.274333882 |
+-----------------+
```

The column header in the output is `anonymous block`. If the code had been executed in a stored procedure, the
column header would have been the name of the stored procedure.

The following example defines an anonymous block that creates two tables that are related. In this example, the block of
procedural code does not need to use variables, so the DECLARE section of the block is omitted.

```sqlexample
BEGIN
  CREATE TABLE parent (ID INTEGER);
  CREATE TABLE child (ID INTEGER, parent_ID INTEGER);
  RETURN 'Completed';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
    CREATE TABLE parent (ID INTEGER);
    CREATE TABLE child (ID INTEGER, parent_ID INTEGER);
    RETURN 'Completed';
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| Completed       |
+-----------------+
```

---
title: Understanding caller’s rights and owner’s rights stored procedures
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-rights.md
section: Developer Guide
---

# Understanding caller’s rights and owner’s rights stored procedures

A stored procedure runs with either the caller’s rights or the owner’s rights. It cannot run with
both at the same time. This topic describes the differences between a caller’s rights stored procedure and an
owner’s rights stored procedure.

## Introduction

A caller’s rights stored procedure runs with the privileges of the caller. The primary advantage of a caller’s
rights stored procedure is that it can access information about that caller or about the caller’s current session.
For example, a caller’s rights stored procedure can read the caller’s session variables and use them in a query.

An owner’s rights stored procedure runs mostly with the privileges of the stored procedure’s owner.
The primary advantage of an owner’s rights stored procedure is that
the owner can delegate specific administrative tasks, such as cleaning up old data,
to another role without granting that role more general privileges, such as privileges to delete all data from a
specific table.

At the time that the stored procedure is created, the creator specifies whether the procedure runs with owner’s rights
or caller’s rights. The default is owner’s rights.

The owner can change the procedure from an owner’s rights stored procedure to a caller’s rights stored procedure
(or vice-versa) by executing an [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md) command.

> **Tip:**
>
> Use [references](stored-procedures-calling-references.md) to allow operations on
> specific SQL objects or queries to be performed using the caller’s rights, even when the stored procedure
> runs with owner’s rights.

## Privileges on database objects

A caller’s rights stored procedure runs with the database privileges of the role that called the stored procedure.
Any statement that the caller could not execute outside the stored procedure cannot be executed inside the stored
procedure, either. For example, if the role named “Nurse” does not have privileges to delete rows
from the `medical_records` table, then if a user with the role “Nurse” calls a caller’s rights stored
procedure that tries to delete rows from that table, the stored procedure will fail.

An owner’s rights procedure runs with the rights of the procedure owner. This means that if the owner has the
privileges to perform a task, then the stored procedure can perform that task even when called by a role that
does not have privileges to perform that task directly. For example, if the role named “Doctor” has the database
privileges to delete rows from the `medical_records` table, and the “Doctor” role creates a stored procedure
that deletes rows older than 7 years from that table, then if the “Doctor” role grants the “Nurse” role appropriate
privileges on the stored procedure, then the “Nurse” role can run the stored procedure (and delete old rows from the
table via that stored procedure), even if the “Nurse” role doesn’t have delete privileges on the table.

> **Tip:**
>
> If you need an owner’s rights stored procedure to perform actions on a table, view, or function that the caller has the
> privileges to access, you can have the caller pass a reference to that table, view, or function.
>
> For details, refer to [Passing references for objects and queries to stored procedures](stored-procedures-calling-references.md).

## Accessing and setting the session state

As with other SQL statements, a `CALL` statement runs within a session, and inherits context from that session,
such as session-level variables, current database, etc. The exact context that the procedure inherits depends upon
whether the stored procedure is a caller’s rights stored procedure or an owner’s rights stored procedure.

If a caller’s rights stored procedure makes changes to the session, those changes can persist after the end of the
`CALL`. Owner’s rights stored procedures are not permitted to change session state.

## Caller’s rights stored procedures

Caller’s rights stored procedures adhere to the following rules within a session:

* Run with the privileges of the caller, not the privileges of the owner.
* Inherit the current warehouse of the caller.
* Use the database and schema that the caller is currently using.
* Can view, set, and unset the caller’s session variables.
* Can view, set, and unset the caller’s session parameters.

### Running with a subset of caller’s rights

You might want a stored procedure to run with caller’s rights, but only a subset of those rights. An administrator can use caller
grants to control which of the caller’s rights the stored procedure can run with. For more information, see
[Restricted caller’s rights](../restricted-callers-rights.md).

### Session variables for caller’s rights procedures

Suppose that the stored procedure named `MyProcedure` executes SQL statements that read and set session-level
variables. In this example, the details of the read and set commands are not important, so the statements are
represented as pseudo-code:

* `READ SESSION_VAR1`
* `SET SESSION_VAR2`

The stored procedure looks similar to the following pseudocode:

```sqlexample
CREATE PROCEDURE MyProcedure()
...
$$
   READ SESSION_VAR1;
   SET SESSION_VAR2;
$$
;
```

Suppose that you execute the following sequence of statements in the same session:

```sqlexample
SET SESSION_VAR1 = 'some interesting value';
CALL MyProcedure();
SELECT *
  FROM table1
  WHERE column1 = $SESSION_VAR2;
```

This is equivalent to executing the following sequence:

```sqlexample
SET SESSION_VAR1 = 'some interesting value';
READ SESSION_VAR1;
SET SESSION_VAR2;
SELECT *
  FROM table1
  WHERE column1 = $SESSION_VAR2;
```

In other words:

* The stored procedure can see the variable that was set by statements before the procedure was called.
* The statements after the stored procedure can see the variable that was set inside the procedure.

For a complete example that does not rely on pseudo-code, see [Using session variables with caller’s rights and owner’s rights stored procedures](stored-procedures-javascript.md) (in this topic).

In many stored procedures, you want to inherit context information such as the current database and the current
session-level variables.

However, in some cases, you might want your stored procedure to be more isolated. For example, if your stored
procedure sets a session-level variable, you might not want the session-level variable to influence future statements
outside your stored procedure.

To better isolate your stored procedure from the rest of your session:

* Avoid using session-level variables directly. Instead, pass them as explicit parameters. This forces the caller to
  think about exactly which session-level variables the stored procedure will use.
* Clean up any session-level variables that you set inside the stored procedure (and use names that are not likely to
  be used anywhere else, so that you don’t accidentally clean up a session variable that existed prior to the stored
  procedure call).

The following stored procedure uses the value of a session variable by receiving it as a parameter, not
by using the session variable directly:

```sqlexample-javascript
SET Variable_1 = 49;

CREATE PROCEDURE sv_proc2(PARAMETER_1 FLOAT)
  RETURNS VARCHAR
  LANGUAGE JAVASCRIPT
  AS
  $$
    var rs = snowflake.execute( {sqlText: "SELECT 2 * " + PARAMETER_1} );
    rs.next();
    var MyString = rs.getColumnValue(1);
    return MyString;
  $$
  ;

CALL sv_proc2($Variable_1);
```

The following stored procedure creates a temporary session variable with an unusual name and cleans up
that variable before the stored procedure finishes. When a statement after the procedure call tries to use the session
variable that was cleaned up, that statement will fail:

```sqlexample-javascript
CREATE PROCEDURE sv_proc1()
  RETURNS VARCHAR
  LANGUAGE JAVASCRIPT
  EXECUTE AS CALLER
  AS
  $$
    var rs = snowflake.execute( {sqlText: "SET SESSION_VAR_ZYXW = 51"} );

    var rs = snowflake.execute( {sqlText: "SELECT 2 * $SESSION_VAR_ZYXW"} );
    rs.next();
    var MyString = rs.getColumnValue(1);

    rs = snowflake.execute( {sqlText: "UNSET SESSION_VAR_ZYXW"} );

    return MyString;
  $$
  ;

CALL sv_proc1();

-- This fails because SESSION_VAR_ZYXW is no longer defined.
SELECT $SESSION_VAR_ZYXW;
```

> **Note:**
>
> If you program in the C language (or similar languages such as Java), note that session variables you set
> inside a stored procedure are not like the local variables in C that disappear when a C function
> finishes running. Isolating your stored procedure from its environment requires more effort in SQL than in C.

## Owner’s rights stored procedures

Owner’s rights stored procedures adhere to the following rules within a session:

* Run with the privileges of the owner, not the privileges of the caller.

  > **Tip:**
  >
  > If you need an owner’s rights stored procedure to perform actions on a table, view, or function that the caller has the
  > privileges to access, you can have the caller pass a reference to that table, view, or function.
  >
  > For details, refer to [Passing references for objects and queries to stored procedures](stored-procedures-calling-references.md).
* Inherit the current warehouse of the caller.
* Use the database and schema that the stored procedure is created in, not the database and schema that the
  caller is currently using.
* Cannot access most caller-specific information. For example:

  + Cannot view, set, or unset the caller’s session variables.
  + Can use only a subset of session parameters set by the caller. For example, SQL commands that output date values can use the
    [DATE_OUTPUT_FORMAT](../../sql-reference/parameters.md) parameter that is set for the caller’s session).

    For the list of these parameters, see Understanding the effects of a caller’s session parameters on an owner’s rights procedure.
  + Cannot set or unset any of the caller’s session parameters.
* Do not allow non-owners to view information about the procedure from the
  [PROCEDURES](../../sql-reference/info-schema/procedures.md) view.

Restrictions on session variables and
session parameters are described in more detail below.

### Session variables for owner’s rights procedures

A stored procedure does not have access to [SQL variables](../../sql-reference/session-variables.md) created outside the stored
procedure. This restriction prevents a stored procedure written or owned by one user from reading SQL
variables created by another user (the stored procedure caller).

If your stored procedure needs values that are stored in the current session’s SQL variables, then the values in those
variables should be passed as explicit arguments to the stored procedure. For example:

```sqlexample
SET PROVINCE = 'Manitoba';
CALL MyProcedure($PROVINCE);
```

### Understanding the effects of a caller’s session parameters on an owner’s rights procedure

The value of a session [parameter](../../sql-reference/parameters.md) can affect the behavior of commands and functions. For
example, commands that output date values use the format specified by the [DATE_OUTPUT_FORMAT](../../sql-reference/parameters.md) session parameter.

In a caller’s session, the caller can set or override a session parameter. In a caller’s rights stored procedure, session
parameters can affect the execution of any queries and expressions executed inside the procedure. For example, the
[TIMESTAMP_OUTPUT_FORMAT](../../sql-reference/parameters.md) parameter affects the output format of a child query like
`select current_timestamp::string`.

However, for an owner’s rights stored procedure, the values from the caller’s session are used only for the following
parameters:

* AUTOCOMMIT
* BINARY_INPUT_FORMAT
* BINARY_OUTPUT_FORMAT
* DATE_INPUT_FORMAT
* DATE_OUTPUT_FORMAT
* ENABLE_UNLOAD_PHYSICAL_TYPE_OPTIMIZATION
* ERROR_ON_NONDETERMINISTIC_MERGE
* ERROR_ON_NONDETERMINISTIC_UPDATE
* JDBC_TREAT_DECIMAL_AS_INT
* JSON_INDENT
* LOCK_TIMEOUT
* MAX_CONCURRENCY_LEVEL
* ODBC_USE_CUSTOM_SQL_DATA_TYPES
* PERIODIC_DATA_REKEYING
* QUERY_TAG
* QUERY_WAREHOUSE_NAME
* ROWS_PER_RESULTSET
* STATEMENT_QUEUED_TIMEOUT_IN_SECONDS
* STATEMENT_TIMEOUT_IN_SECONDS
* STRICT_JSON_OUTPUT
* TIMESTAMP_DAY_IS_ALWAYS_24H
* TIMESTAMP_INPUT_FORMAT
* TIMESTAMP_LTZ_OUTPUT_FORMAT
* TIMESTAMP_NTZ_OUTPUT_FORMAT
* TIMESTAMP_OUTPUT_FORMAT
* TIMESTAMP_TYPE_MAPPING
* TIMESTAMP_TZ_OUTPUT_FORMAT
* TIMEZONE
* TIME_INPUT_FORMAT
* TIME_OUTPUT_FORMAT
* TRANSACTION_ABORT_ON_ERROR
* TRANSACTION_DEFAULT_ISOLATION_LEVEL
* TWO_DIGIT_CENTURY_START
* UNSUPPORTED_DDL_ACTION
* USE_CACHED_RESULT
* WEEK_OF_YEAR_POLICY
* WEEK_START

> **Note:**
>
> This list might change over time.

For other parameters (not listed above):

* The value of the owner’s account-level parameter is used.
* If the account-level parameter is not set for owner’s account, the default value for the account parameter is used.

This restriction is in place to avoid potential issues that could occur if an owner’s rights stored procedure used the caller’s
session parameters. For example:

* If the author (owner) of a stored procedure has set a specific session parameter, but callers of the stored procedure have not
  set that parameter, the stored procedure might fail or behave differently when called by users other than the author.
* If a stored procedure were allowed to use the value of any session parameter set by the caller, the owner of a stored procedure
  might be able to determine those values without the caller’s knowledge.

## Additional restrictions on owner’s rights stored procedures

Owner’s rights stored procedures have several additional restrictions, besides the restrictions related to
session variables and session parameters. These restrictions affect the following:

* The built-in functions that can be called from inside a stored procedure.
* Ability to execute ALTER USER statements.
* Monitoring stored procedures at execution time.
* LIST command.
* The types of SQL statements that can be called from inside a stored procedure.

The following sections explain these restrictions in more detail.

> **Note:**
>
> Most restrictions on an owner’s rights stored procedure apply to all callers, including the owner.

### Restrictions on built-in functions

If a stored procedure is created as an owner’s rights stored procedure, then callers (other than the owner)
cannot call the following built-in functions:

* GET_DDL()

  This prevents users other than the stored procedure owner from viewing the source code of the stored procedure.
* SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE()
* SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE()

### ALTER USER

In the handler for an owner’s rights procedure, you *cannot* execute [ALTER USER](../../sql-reference/sql/alter-user.md) statements that implicitly
use the current user for the session.

However, you *can* execute ALTER USER statements that explicitly identify the user, as long as the user is not the current user.

### Monitoring stored procedures at execution time

Neither the owner nor the caller of an owner’s rights stored procedure necessarily has privileges to monitor
execution of the stored procedure.

A user with the WAREHOUSE MONITOR privilege can monitor execution of the individual warehouse-related SQL
statements within that stored procedure. Most queries and DML statements are warehouse-related statements.
DDL statements, such as CREATE, ALTER, etc. do not use the warehouse and cannot be monitored as part of
monitoring stored procedures.

### SHOW and DESCRIBE commands

An owner’s rights stored procedure can use any SHOW or DESCRIBE command that
does not return information about the current session or rely on the current user.

The following SHOW commands read from the current session,
and as a result are not permitted:

* SHOW PARAMETERS [ { IN | FOR } SESSION ]
* SHOW VARIABLES

The following commands return output that depend on the current user, and
as a result are not permitted:

* SHOW GRANTS commands that do not have an IN, ON, TO, or OF clause. Using SHOW GRANTS
  without one of these clauses implicitly references the current user.
* SHOW LOCKS
* SHOW TRANSACTIONS
* SHOW DELEGATED AUTHORIZATIONS

### LIST command

You can’t use the LIST command in the [Snowflake Scripting](stored-procedures-snowflake-scripting.md)
or [JavaScript](stored-procedures-javascript.md) handler code of an owner’s rights stored procedure,
regardless of the procedure’s owner and caller. You *can* use LIST when the procedure handler is written in one of the other handler
languages.

### Restrictions on SQL statements

Although caller’s rights stored procedures can execute any SQL statement that the caller has sufficient privileges
to execute outside a stored procedure, owner’s rights stored procedures can call only a subset of SQL statements.

The following SQL statements can be called from inside an owner’s rights stored procedure:

* SELECT
* DML
* DDL (see above for restrictions on the ALTER USER statement)
* GRANT/REVOKE
* Variable assignment
* DESCRIBE and SHOW (see limitations documented above)
* LIST (see limitations documented above)

Other SQL statements cannot be called from inside an owner’s rights stored procedure.

## Nested stored procedures with different rights

If an owner’s rights stored procedure is called by a caller’s rights stored procedure, or vice-versa, the following
rules apply:

* A stored procedure behaves as a caller’s rights stored procedure if and only if the procedure and the entire
  call hierarchy above it are caller’s rights stored procedures.
* An owner’s rights stored procedure always behaves as an owner’s rights stored procedure, no matter where it was
  called from.
* Any stored procedure called directly or indirectly from an owner’s rights stored procedure behaves as an owner’s
  rights stored procedure.

## Choosing between owner’s rights and caller’s rights

Create a stored procedure as an owner’s rights stored procedure if all of the following are true:

* You want to delegate a task(s) to another user(s) who will run with the owner’s privileges, not the caller’s own
  privileges.

  For example, if you want a user without DELETE privilege on a table to be able to call a stored procedure that deletes old data,
  but not current data, then you probably want to use an owner’s rights stored procedure. That procedure will contain a DELETE
  statement that includes a filter (a WHERE clause) to control which data can be deleted through the filter.

  > **Tip:**
  >
  > If you need an owner’s rights stored procedure to perform actions on a table, view, or function that the caller has the
  > privileges to access, you can have the caller pass a reference to that table, view, or function.
  >
  > For details, refer to [Passing references for objects and queries to stored procedures](stored-procedures-calling-references.md).
* The restrictions in owner’s rights stored procedures will not prevent the stored procedure from working properly.

Create a stored procedure as a caller’s rights stored procedure if the following are true:

* The stored procedure operates only on objects that the caller owns or has the required privileges on.
* The restrictions in owner’s rights stored procedures would prevent the stored procedure from working.
  For example, use a caller’s rights procedure if the caller of the stored procedure needs to use that caller’s
  environment (e.g. session variables or account parameters).

If a particular procedure can work correctly with either caller’s rights or owner’s rights, then the following rule
might help you choose which rights to use:

* If a procedure is an owner’s rights procedure, the caller does not have the privilege to view the code in
  the stored procedure (unless the caller is also the owner). If you want to prevent callers from
  viewing the source code of the procedure, then create the procedure as an owner’s rights procedure.
  Conversely, if you want callers to be able to read the source code, then create the procedure as a caller’s
  rights prodecure.

---
title: User-Defined Functions and Stored Procedures in Declarative Shared Native Applications
source: https://docs.snowflake.com/en/developer-guide/declarative-sharing/udfs-sprocs.md
section: Developer Guide
---

# User-Defined Functions and Stored Procedures in Declarative Shared Native Applications

Declarative Native Apps can include [stored procedures](../stored-procedure/stored-procedures-overview.md) and [user-defined functions](../udf/udf-overview.md) (UDFs) to
query, visualize, and explore the data. This topic describes how to include
these logic objects in your app.

## Supported User-Defined Functions and Stored Procedures

You can share the following types of user-defined functions (UDFs) and stored
procedures (sprocs) in a Declarative Native App:

* Stored procedures that have OWNERS RIGHTS or RESTRICTED CALLERS RIGHTS.
  For more information, see [Understanding caller’s rights and owner’s rights stored procedures](../stored-procedure/stored-procedures-rights.md).
* All types of UDFs, except EXTERNAL functions
* Snowpark UDFs and stored procedures written in Python, Java, Javascript, and
  Scala. Snowpark Container Service Functions are not supported.

## Including User-Defined Functions and Stored Procedures in your application

To include UDFs and stored procedures in your Declarative Native App, add the
names of the objects and their permissions to the `manifest.yaml` file.
You don’t need to add the objects using separate files, as you do with
notebooks.

The following example shows how to include a UDF and a stored procedure in the
`manifest.yaml` file:

```yaml
manifest_version: 2

roles:
  - ANALYST:
      comment: "The ANALYST role provides access to logic objects."

shared_content:
  databases:
    - SNAF_POPULATION_DB:
        schemas:
          - LOGIC_SCHEMA:
              roles: [ANALYST]
              functions:
                - POPULATION_ANALYSIS_FUNCTION(NUMBER):
                    roles: [ANALYST]
              procedures:
                - POPULATION_ANALYSIS_PROCEDURE():
                    roles: [ANALYST]
```

In this example, the `POPULATION_ANALYSIS_FUNCTION` UDF and the
`POPULATION_ANALYSIS_PROCEDURE` stored procedure are included in the
`manifest.yaml` file. The `ANALYST` app role is granted access to
both objects.

## Accessing private (non-shared) objects using UDFs and stored procedures

You can use UDFs and stored procedures to access private (non-shared) tables and
views. For example, your database can have a view that isn’t visible to
consumers, but consumers can use a stored procedure to retrieve
data from that view.

To allow customers to access private objects using UDFs and stored procedures,
mark the object with the `private: true` keyword in the
`manifest.yaml` file.

The following example shows how to allow a stored procedure to access a private
table in the `manifest.yaml` file:

```yaml
manifest_version: 2

roles:
  - VIEWER:
      comment: "The VIEWER role can access a stored procedure that retrieves data from a view, but not the underlying view."

shared_content:
  databases:
    - SNAF_POPULATION_DB:
        schemas:
          - DATA_SCHEMA:
              views: # This view is private as no roles are granted
                - COUNTRY_POP_BY_YEAR_2000:
                    private: true
          - LOGIC_SCHEMA:
              roles: [VIEWER]
              procedures:
                - POPULATION_DISPLAY_PROCEDURE():
                    roles: [VIEWER]
```

In the previous example, the `COUNTRY_POP_BY_YEAR_2000` view is private
because no roles are granted access to it, but the `private` parameter
allows logic objects to access it. The `VIEWER` app role can execute the
stored procedure, but it can’t query the private view directly. Note that the
tables that the `COUNTRY_POP_BY_YEAR_2000` view references don’t need to
be included in the `manifest.yaml` file for the view to access them.

## Limitations

Supported languages and types
:   Snowpark UDFs and stored procedures written in Python, Java, Javascript, and
    Scala. Snowpark Container Service functions are not supported.

Schemas for data objects and logic objects
:   You must use separate schemas for data objects (tables and views) and logic
    objects (UDFs and stored procedures). For example, you can use a schema named
    `DATA_SCHEMA` for tables and views, and a schema named
    `LOGIC_SCHEMA` for UDFs and stored procedures.

Referencing private objects
:   Your UDFs and stored procedures must reference private objects by their
    schema-qualified names. Your logic objects can’t reference private objects by
    their fully-qualified names.

Object count
:   A Declarative Native App can include up to 100 UDFs and stored procedures. To
    raise this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Dynamic tables
:   Referencing dynamic tables in UDFs and stored procedures is not supported.

---
title: User-defined functions overview
source: https://docs.snowflake.com/en/developer-guide/udf/udf-overview.md
section: Developer Guide
---

# User-defined functions overview

You can write user-defined functions (UDFs) to extend the system to perform operations that are not available through the
[built-in system-defined functions](../../sql-reference/intro-summary-operators-functions.md) provided by Snowflake. Once you create a UDF,
you can reuse it multiple times. A function always returns a value explicitly by specifying an expression, so it’s a good choice for
calculating and return a value.

You can use UDFs to extend built-in functions or to encapsulate calculations that are standard for your organization. UDFs you create
can be called in a way similar to built-in functions.

You write a UDF’s logic – its handler – in one of the supported languages. Once you have a handler,
you can [create a UDF](udf-creating-sql.md) using any of several tools included in Snowflake, then
[execute the UDF](udf-calling-sql.md).

A UDF is like a stored procedure, but the two differ in important ways. For more information, see
[Choosing whether to write a stored procedure or a user-defined function](../stored-procedures-vs-udfs.md).

A UDF is just one way to extend Snowflake. For others, see the following:

* [Stored procedures overview](../stored-procedure/stored-procedures-overview.md)
* [Writing external functions](../../sql-reference/external-functions.md)
* [Snowpark API](../snowpark/index.md)

## User-defined function variations

You can write a UDF in one of several variations, depending on the input and output requirements your function must meet.

| Variation | Description |
| --- | --- |
| User-defined function (UDF) | Also known as a *scalar function*, returns one output row for each input row. The returned row consists of a single column/value. |
| User-defined aggregate function (UDAF) | Operates on values across multiple rows to perform mathematical calculations such as sum, average, counting, finding minimum or maximum values, standard deviation, and estimation, as well as some non-mathematical operations. |
| User-defined table function (UDTF) | Returns a tabular value for each input row. |
| Vectorized user-defined function (UDF) | Receive batches of input rows as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) and return batches of results as [Pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html) or [Series](https://pandas.pydata.org/docs/reference/series.html). |
| Vectorized user-defined table function (UDTF) | Receive batches of input rows as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) and return tabular results. |

## Supported languages and tools

You can [create](udf-creating-sql.md) and manage UDFs (and other Snowflake entities) by using any of multiple
tools, depending on how you prefer to work.

| Language | Approach | Support |
| --- | --- | --- |
| **SQL**  With handler in Java, JavaScript, Python, Scala, or SQL | Write SQL code in Snowflake to create and manage Snowflake entities. Write the function’s logic in one of the supported handler languages. | Java:  [UDF](java/udf-java-introduction.md), [UDTF](java/udf-java-tabular-functions.md)  JavaScript:  [UDF](javascript/udf-javascript-introduction.md), [UDTF](javascript/udf-javascript-tabular-functions.md)  Python:  [UDF](python/udf-python-introduction.md), [UDAF](python/udf-python-aggregate-functions.md), [UDTF](python/udf-python-tabular-functions.md), [Vectorized UDF](python/udf-python-batch.md), [Vectorized UDTF](python/udf-python-tabular-vectorized.md)  Scala:  [UDF](scala/udf-scala-introduction.md)  SQL:  [UDF](sql/udf-sql-introduction.md), [UDTF](sql/udf-sql-tabular-functions.md) |
| **Java, Python, or Scala**  [Snowpark API](../snowpark/index.md) | On the client, write code for operations that are pushed to Snowflake for processing. | Java:  [UDF](../snowpark/java/creating-udfs.md), [UDTF](../snowpark/java/creating-udfs.md)  Python:  [UDF](../snowpark/python/creating-udfs.md), [UDAF](../snowpark/python/creating-udafs.md), [UDTF](../snowpark/python/creating-udtfs.md), [Vectorized UDF or UDTF](../snowpark/python/creating-udfs.md)  Scala:  [UDF](../snowpark/scala/creating-udfs.md), [UDTF](../snowpark/scala/creating-udfs.md) |
| **Command-line Interface**  [Snowflake CLI](../snowflake-cli/index.md) | Use the command line to create and manage Snowflake entities, specifying properties as properties of JSON objects. | [Managing Snowflake objects](../snowflake-cli/objects/manage-objects.md) |
| **Python**  [Snowflake Python API](../snowflake-python-api/snowflake-python-overview.md) | On the client, Execute commands to create the function with Python, writing the function’s handler in one of the supported handler languages. | [Managing user-defined functions (UDFs)](../snowflake-python-api/snowflake-python-managing-functions-procedures.md) |
| **REST**  [Snowflake REST API](../snowflake-rest-api/snowflake-rest-api.md) | Make requests of RESTful endpoints to create and manage Snowflake entities. | [Manage user-defined functions](../snowflake-rest-api/user-defined-function/user-defined-function-introduction.md) |

When choosing a language, consider also the following:

* **Handler locations supported.** Not all languages support referring to the handler on a stage (the handler code must instead be in-line).
  For more information, see [Keeping handler code in-line or on a stage](../inline-or-staged.md).
* **Whether the handler results in a UDF that’s sharable.** A sharable UDF can be used with the Snowflake
  [Secure Data Sharing](../../user-guide/data-sharing-intro.md) feature.

| Language | Handler Location | Sharable |
| --- | --- | --- |
| Java | In-line or staged | No [1] |
| JavaScript | In-line | Yes |
| Python | In-line or staged | No [2] |
| Scala | In-line or staged | No [3] |
| SQL | In-line | Yes |

[1]

For more information about limits on sharing Java UDFs, see [General limitations](java/udf-java-limitations.md).

[2]

For more information about limits on sharing Python UDFs, see [General limitations](python/udf-python-limitations.md).

[3]

For more information about limits on sharing Scala UDFs, see [Scala UDF limitations](scala/udf-scala-limitations.md).

## Considerations

* If a query calls a UDF to access staged files, the operation fails with a user error if the SQL statement also queries a view that
  calls any UDF or UDTF, regardless of whether the function in the view accesses staged files or not.
* UDTFs can process multiple files in parallel; however, UDFs currently process files serially. As a workaround,
  group rows in a subquery using the [GROUP BY](../../sql-reference/constructs/group-by.md) clause. See [Process a CSV with a UDTF](../../user-guide/unstructured-data-java.md)
  for an example.
* Currently, if staged files referenced in a query are modified or deleted while the query is running, the function call fails with an
  error.
* If you specify the [CURRENT_DATABASE](../../sql-reference/functions/current_database.md) or [CURRENT_SCHEMA](../../sql-reference/functions/current_schema.md) function in the
  handler code of the UDF, the function returns the database or schema that contains the UDF, not the database or schema in use for
  the session.

## UDF example

Code in the following example creates a UDF called `addone` with a handler written in Python. The handler function is
`addone_py`. This UDF returns an `int`.

```sqlexample-python
CREATE OR REPLACE FUNCTION addone(i INT)
  RETURNS INT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.12'
  HANDLER = 'addone_py'
AS $$
def addone_py(i):
 return i+1
$$;
```

Code in the following example executes the `addone` UDF.

```sqlexample
SELECT addone(3);
```

## Guidelines and constraints

Snowflake constraints:
:   You can ensure stability within the Snowflake environment by developing within Snowflake constraints. For
    more information, see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../udf-stored-procedure-constraints.md).

Naming:
:   Be sure to name functions in a way that avoids collisions with other functions. For more information, see
    [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).

Arguments:
:   Specify the arguments and indicate which arguments are optional. For more information, see
    [Defining arguments for UDFs and stored procedures](../udf-stored-procedure-arguments.md).

Data type mappings:
:   For each handler language, there’s a separate set of mappings between the language’s data types and the SQL types
    used for arguments and return values. For more about the mappings for each language, see [Data Type Mappings Between SQL and Handler Languages](../udf-stored-procedure-data-type-mapping.md).

## Handler writing

Handler languages:
:   For language-specific content on writing a handler, see Supported languages and tools.

External network access:
:   You can access external network locations with
    [external network access](../external-network-access/external-network-access-overview.md). You can create secure
    access to specific network locations external to Snowflake, then use that access from within the handler code.

Logging, tracing, and metrics:
:   You can record code activity by
    [capturing log messages, trace events, and metrics data](../logging-tracing/logging-tracing-overview.md),
    storing the data in a database you can query later.

## Security

You can grant privileges on objects needed for them to perform specific SQL actions with a UDF or UDTF. For more information, see
[Granting privileges for user-defined functions](udf-access-control.md)

Functions share certain security concerns with stored procedures. For more information, see the following:

* You can help a procedure’s handler code execute securely by following the best practices described in
  [Security Practices for UDFs and Procedures](../udf-stored-procedure-security-practices.md)
* Ensure that sensitive information is concealed from users who should not have access to it. For more information, see
  [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../secure-udf-procedure.md)

## Handler code deployment

When creating a function, you can specify its handler – which implements the function’s logic – as code in-line with the function
definition or as code external to the definition, such as code packaged and copied to a stage.

For more information, see [Keeping handler code in-line or on a stage](../inline-or-staged.md).

---
title: Using a Git repository in Snowflake
source: https://docs.snowflake.com/en/developer-guide/git/git-overview.md
section: Developer Guide
---

# Using a Git repository in Snowflake

You can integrate your remote Git repository with Snowflake so that files from the remote repository are synchronized to a local clone of the
repository in Snowflake. The Git repository clone in Snowflake acts as a local Git repository with a full clone of the remote
repository, including branches, tags, and commits.

With a Git repository clone in Snowflake, you can do the following:

* Perform common Git tasks, including the following:

  + Fetch the latest version.

    For more information, see [Fetch from the remote Git repository](git-operations.md).
  + Select branches or tags.
  + Browse folders and search for files by name.

    For more information, see [View a list of repository branches or tags](git-operations.md) and [View a list of repository files](git-operations.md).
  + Copy the full path to any selected file for referencing it in Snowflake code (such as handler code for functions, tasks, or procedures).
  + [Execute immediate from](../../sql-reference/sql/execute-immediate-from.md) `.sql` files (with a code preview).

    For an example, see [Use a Git repository clone file to configure new accounts](git-examples.md).
* Commit and push changes to the remote repository.

  Writing to the remote repository is supported only from the following Snowflake features:

  + [Workspaces](../../user-guide/ui-snowsight/workspaces-git.md)
  + [Streamlit apps](../streamlit/features/git-integration.md)
  + [Snowflake notebooks](../../user-guide/ui-snowsight/notebooks-snowgit.md)
* In Snowflake, use files from any branch or tag.
* From a Git repository clone synchronized from your remote repository, import files into code you execute in Snowflake.

  For example, you can write procedures and user-defined functions (UDFs) whose handler code is held by the Git repository clone synchronized
  from the repository.

## How Snowflake works with a remote Git repository

With a remote Git repository integrated with your Snowflake account, you synchronize files from the remote repository to a
Git repository clone in Snowflake. To access a file in Snowflake, you refer to it in the Git repository clone. For more information about
using repository files, see [Use a Git repository file as a stored procedure handler](git-examples.md).

### Snowflake Git repository clone

A Git repository clone in Snowflake is a full clone with all branches, tags, and commits from the remote repository.

After remote repository contents are in the Git repository clone, you can reference files there as you would a file on a stage.

You can perform operations similar to those you perform with Git commands in a local repository, including:

* [Fetching the remote repository](git-operations.md) to refresh the Git repository clone as the remote
  repository changes.
* [Viewing repository branches or tags](git-operations.md) contained by the Git repository clone.
* Pushing to the repository from Workspaces ([supported only from Workspaces](../../user-guide/ui-snowsight/workspaces-git.md)).

### Git repository and development tools

After you integrate your remote repository with Snowflake, you can continue using your development tools and local repository as before.
Through the Git repository clone, Snowflake becomes another client of your repository separate from your local repository.

## Supported platforms

You can currently integrate Git repositories that use the following Git platforms. This includes repositories based on these platforms, but
available at custom URLs. For example, a repository based on GitHub does not need to be at github.com.

* GitHub
* GitLab
* BitBucket
* Azure DevOps
* AWS CodeCommit

## References

* [CREATE GIT REPOSITORY](../../sql-reference/sql/create-git-repository.md)
* [ALTER GIT REPOSITORY](../../sql-reference/sql/alter-git-repository.md)
* [DESCRIBE GIT REPOSITORY](../../sql-reference/sql/desc-git-repository.md)
* [DROP GIT REPOSITORY](../../sql-reference/sql/drop-git-repository.md)
* [SHOW GIT REPOSITORIES](../../sql-reference/sql/show-git-repositories.md)
* [SHOW GIT BRANCHES](../../sql-reference/sql/show-git-branches.md)
* [SHOW GIT TAGS](../../sql-reference/sql/show-git-tags.md)

---
title: Using explicit transactions
source: https://docs.snowflake.com/en/developer-guide/sql-api/using-transactions.md
section: Developer Guide
---

# Using explicit transactions

To execute SQL statements in an explicit [transaction](../../sql-reference/transactions.md), you must use a
[single HTTP request](submitting-multiple-statements.md) to specify the start, end, and statements in the
transaction. For example:

```sqljson
{
  "statement": "begin transaction; insert into table2 (i) values (1); commit; select i from table1 order by i",
  ...
  "parameters": {
      "MULTI_STATEMENT_COUNT": "4"
  }
  ...
}
```

As is the case when you specify [multiple statements in a request](submitting-multiple-statements.md),
if the request was processed successfully, Snowflake returns a response containing the `statementHandles` field, which is
set to an array of handles for the statements in the request (including the BEGIN TRANSACTION and COMMIT statements).

```none
HTTP/1.1 200 OK
Content-Type: application/json

{
  "resultSetMetaData" : {
    "numRows" : 1,
    "format" : "jsonv2",
    "rowType" : [ {
      "name" : "multiple statement execution",
      "database" : "",
      "schema" : "",
      "table" : "",
      "type" : "text",
      "byteLength" : 16777216,
      "scale" : null,
      "precision" : null,
      "nullable" : false,
      "collation" : null,
      "length" : 16777216
    } ]
  },
  "data" : [ [ "Multiple statements executed successfully." ] ],
  "code" : "090001",
  "statementHandles" : [ "019d6ed0-0502-3101-0000-096d00421082", "019d6ed0-0502-3101-0000-096d00421086", "019d6ed0-0502-3101-0000-096d0042108a", "019d6ed0-0502-3101-0000-096d0042108e" ],
  "statementStatusUrl" : "/api/v2/statements/019d6ed0-0502-3101-0000-096d0042107e?requestId=066920fa-e589-43c6-8cca-9dcb2d4be978",
  "sqlState" : "00000",
  "statementHandle" : "019d6ed0-0502-3101-0000-096d0042107e",
  "message" : "Statement executed successfully.",
  "createdOn" : 1625684162876
}
```

The handles in the `statementHandles` field correspond to the statements in the request. In this example, the statements and
their corresponding handles are:

* BEGIN TRANSACTION (`019d6ed0-0502-3101-0000-096d00421082`)
* INSERT (`019d6ed0-0502-3101-0000-096d00421086`)
* COMMIT (`019d6ed0-0502-3101-0000-096d0042108a`)
* SELECT (`019d6ed0-0502-3101-0000-096d0042108e`)

You can use these handles to [check the status of each statement](submitting-multiple-statements.md).

---
title: Using pandas DataFrames with the Python Connector
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-pandas.md
section: Developer Guide
---

# Using pandas DataFrames with the Python Connector

pandas is a library for data analysis. With pandas, you use a data structure called a DataFrame
to analyze and manipulate two-dimensional data (such as data from a database table).

For more information see the [pandas](https://pandas.pydata.org/) documentation.

If you need to get data from a Snowflake database to a pandas DataFrame, you can use the API methods provided with the Snowflake
Connector for Python. The connector also provides API methods for writing data from a pandas DataFrame to a Snowflake database.

> **Note:**
>
> Some of these API methods require a specific version of the PyArrow library.
> See Requirements for details.
>
> For more information, see the [PyArrow library](https://arrow.apache.org/docs/python/) documentation.

## Requirements

Currently, the pandas-oriented API methods in the Python connector API work with:

* Snowflake Connector 2.1.2 (or later) for Python.
* PyArrow library version 3.0.x (or later), depending on your connector version.

  If you do not have PyArrow installed, you do not need to install PyArrow yourself;
  installing the Python Connector as documented below automatically installs the appropriate version of PyArrow.

  > **Caution:**
  >
  > If you already have any version of the PyArrow library other than the recommended version listed above,
  > please uninstall PyArrow before installing the Snowflake Connector for Python. Do not re-install a different
  > version of PyArrow after installing the Snowflake Connector for Python.

  For more information, see the [PyArrow library](https://arrow.apache.org/docs/python/) documentation.
* pandas 0.25.2 (or later), depending on your connector version. Earlier versions might work, but have not been tested.

## Installation

To install the pandas-compatible version of the Snowflake Connector for Python, execute the command:

```bash
pip install "snowflake-connector-python[pandas]"
```

You must enter the square brackets (`[` and `]`) as shown in the command. The square brackets specify the
extra part of the package to install.

For more information, see the [extras](https://www.python.org/dev/peps/pep-0508/#extras) Python dependency.

Use quotes around the name of the package (as shown) to prevent the square brackets from being interpreted as a wildcard.

If you need to install other extras (for example, `secure-local-storage` for
[caching connections with browser-based SSO](../../user-guide/admin-security-fed-auth-use.md) or
[caching MFA tokens](../../user-guide/security-mfa.md)), use a comma between the extras:

```bash
pip install "snowflake-connector-python[secure-local-storage,pandas]"
```

## Reading data from a Snowflake database to a pandas DataFrame

To read data into a pandas DataFrame, you use a [Cursor](python-connector-api.md) to
[retrieve the data](python-connector-example.md) and then call one of these `Cursor` methods to put the data
into a pandas DataFrame:

* [`fetch_pandas_all()`](python-connector-api.md "fetch_pandas_all").
* [`fetch_pandas_batches()`](python-connector-api.md "fetch_pandas_batches").

## Writing data from a pandas DataFrame to a Snowflake database

To write data from a pandas DataFrame to a Snowflake database, do one of the following:

* Call the [`write_pandas()`](python-connector-api.md "write_pandas") function.
* Call the `pandas.DataFrame.to_sql()` method.

  For more information, see the
  [pandas.DataFrame.to_sql](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.to_sql.html) documentation,
  and specify [`pd_writer()`](python-connector-api.md "pd_writer") as the method to use to insert the data into the database.

## Snowflake to pandas data mapping

The table below shows the mapping from Snowflake data types to pandas data types:

| Snowflake Data Type | pandas Data Type |
| --- | --- |
| FIXED NUMERIC type (scale = 0) except DECIMAL | `(u)int{8,16,32,64}` or `float64` (for NULL) |
| FIXED NUMERIC type (scale > 0) except DECIMAL | `float64` |
| FIXED NUMERIC type DECIMAL | `decimal` |
| FLOAT/DOUBLE | `float64` |
| VARCHAR | `str` |
| BINARY | `str` |
| VARIANT | `str` |
| DATE | `object` (with `datetime.date` objects) |
| TIME | `pandas.Timestamp(np.datetime64[ns])` |
| TIMESTAMP_NTZ, TIMESTAMP_LTZ, TIMESTAMP_TZ | `pandas.Timestamp(np.datetime64[ns])` |

Notes:

* If the Snowflake data type is FIXED NUMERIC and the scale is zero, and if the value is NULL, then the value is
  converted to `float64`, not an integer type.
* If any conversion causes overflow, the Python connector throws an exception.

## Importing pandas

Customarily, pandas is imported with the following statement:

> ```python
> import pandas as pd
> ```

You might see references to pandas objects as either `pandas.object` or `pd.object`.

## Migrating to pandas DataFrames

This section is primarily for users who have used pandas (and possibly SQLAlchemy) previously.

Previous pandas users might have code similar to either of the following:

* This example shows the original way to generate a pandas DataFrame from the Python connector:

  > ```python
  > import pandas as pd
  >
  > def fetch_pandas_old(cur, sql):
  >     cur.execute(sql)
  >     rows = 0
  >     while True:
  >         dat = cur.fetchmany(50000)
  >         if not dat:
  >             break
  >         df = pd.DataFrame(dat, columns=cur.description)
  >         rows += df.shape[0]
  >     print(rows)
  > ```
* This example shows how to use SQLAlchemy to generate a pandas DataFrame:

  > ```python
  > import pandas as pd
  >
  > def fetch_pandas_sqlalchemy(sql):
  >     rows = 0
  >     for chunk in pd.read_sql_query(sql, engine, chunksize=50000):
  >         rows += chunk.shape[0]
  >     print(rows)
  > ```

Code that is similar to either of the preceding examples can be converted to use the Python connector pandas
API calls listed in Reading Data from a Snowflake Database to a pandas DataFrame (in this topic).

> **Note:**
>
> With support for pandas in the Python connector, SQLAlchemy is no longer needed to convert data in a cursor
> into a DataFrame.
>
> However, you can continue to use SQLAlchemy if you wish; the Python connector maintains compatibility with
> SQLAlchemy.

---
title: Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/running-examples.md
section: Developer Guide
---

# Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector

This topic explains how to run the Snowflake Scripting examples in [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), and the [Python Connector](../python-connector/python-connector.md).

> **Note:**
>
> If you are using other clients and interfaces, such as [Snowflake CLI](../snowflake-cli/index.md) or the
> [JDBC driver](../jdbc/jdbc.md), you can skip this topic and refer to
> [Snowflake Scripting blocks](blocks.md).

## Introduction

Currently, the following interfaces do not correctly parse Snowflake Scripting blocks:

* [Snowflake CLI](../snowflake-cli/index.md)
* [SnowSQL](../../user-guide/snowsql.md)
* The `execute_stream()` and `execute_string()` methods in
  [Python Connector](../python-connector/python-connector.md) code

  > **Note:**
  >
  > The other Python Connector methods parse Snowflake Scripting blocks correctly.

Entering and running a Snowflake Scripting block can result in the following error:

```none
SQL compilation error: syntax error line 2 at position 25 unexpected '<EOF>'
```

To work around this, use delimiters around the start and end of a Snowflake Scripting block if you are using
these interfaces.

The following sections explain how to do this:

* Using string constant delimiters around a block in a stored procedure
* Passing a block as a string literal to EXECUTE IMMEDIATE

## Using string constant delimiters around a block in a stored procedure

If you are creating a stored procedure, enclose the Snowflake Scripting block in
[single quotes or double dollar signs](../../sql-reference/data-types-text.md). For example:

```sqlexample
CREATE OR REPLACE PROCEDURE myprocedure()
  RETURNS VARCHAR
  LANGUAGE SQL
  AS
  $$
    -- Snowflake Scripting code
    DECLARE
      radius_of_circle FLOAT;
      area_of_circle FLOAT;
    BEGIN
      radius_of_circle := 3;
      area_of_circle := pi() * radius_of_circle * radius_of_circle;
      RETURN area_of_circle;
    END;
  $$
  ;
```

> **Note:**
>
> When specifying the scripting block directly on the Snowflake CLI command line, the `$$` delimiters might not work for some shells because they interpret that delimiter as something else. For example, the bash and zsh shells interpret it as the process ID (PID). To address this limitation, you can use the following alternatives:
>
> * If you still want to specify the scripting block on the command line, you can escape the `$$` delimiters, as in `\$\$`.
> * You can also put the scripting block with the default `$$` delimiters into a separate file and call it with the `snow sql -f <filename>` command.

## Passing a block as a string literal to EXECUTE IMMEDIATE

If you are writing an [anonymous block](blocks.md), pass the block as a string literal to the
[EXECUTE IMMEDIATE](../../sql-reference/sql/execute-immediate.md) command. To delimit the string literal, use
[single quotes or double dollar signs](../../sql-reference/data-types-text.md).

For example:

```sqlexample
EXECUTE IMMEDIATE $$
-- Snowflake Scripting code
DECLARE
  radius_of_circle FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius_of_circle := 3;
  area_of_circle := pi() * radius_of_circle * radius_of_circle;
  RETURN area_of_circle;
END;
$$
;
```

As an alternative, you can define a [session variable](../../sql-reference/session-variables.md) that is a string literal
containing the block, and you can pass that session variable to the EXECUTE IMMEDIATE command. For example:

```sqlexample
SET stmt =
$$
DECLARE
    radius_of_circle FLOAT;
    area_of_circle FLOAT;
BEGIN
    radius_of_circle := 3;
    area_of_circle := pi() * radius_of_circle * radius_of_circle;
    RETURN area_of_circle;
END;
$$
;

EXECUTE IMMEDIATE $stmt;
```

> **Note:**
>
> When specifying the scripting block directly on the Snowflake CLI command line, the `$$` delimiters might not work for some shells because they interpret that delimiter as something else. For example, the bash and zsh shells interpret it as the process ID (PID). To address this limitation, you can use the following alternatives:
>
> * If you still want to specify the scripting block on the command line, you can escape the `$$` delimiters, as in `\$\$`.
> * You can also put the scripting block with the default `$$` delimiters into a separate file and call it with the `snow sql -f <filename>` command.

---
title: Using the JDBC Driver
source: https://docs.snowflake.com/en/developer-guide/jdbc/jdbc-using.md
section: Developer Guide
---

# Using the JDBC Driver

This topic provides information about how to use the JDBC driver.

## Snowflake JDBC API extensions

The Snowflake JDBC driver supports additional methods beyond the standard JDBC specification. This section documents
how to use unwrapping to access the Snowflake-specific methods, then describes three of the situations in which you might
need to unwrap:

* Performing an asynchronous query.
* Uploading data files directly from a stream to an internal stage.
* Downloading data files directly from an internal stage to a stream.

### Unwrapping Snowflake-specific classes

The Snowflake JDBC driver supports Snowflake-specific methods. These methods are defined in Snowflake-specific
Java-language interfaces, such as SnowflakeConnection, SnowflakeStatement, and SnowflakeResultSet. For example,
the SnowflakeStatement interface contains a `getQueryID()` method that is not in the JDBC Statement interface.

When the Snowflake JDBC driver is asked to create a JDBC object (e.g. create a JDBC `Statement` object by
calling a `Connection` object’s `createStatement()` method), the Snowflake JDBC driver actually creates
Snowflake-specific objects that implement not only the methods of the JDBC standard, but also the additional methods
from the Snowflake interfaces.

To access these Snowflake methods, you “unwrap” an object (such as a `Statement` object) to expose the Snowflake
object and its methods. You can then call the additional methods.

The following code shows how to unwrap a JDBC `Statement` object to expose the methods of the
`SnowflakeStatement` interface, and then call one of those methods, in this case `setParameter`:

> ```java
> Statement statement1;
> ...
> // Unwrap the statement1 object to expose the SnowflakeStatement object, and call the
> // SnowflakeStatement object's setParameter() method.
> statement1.unwrap(SnowflakeStatement.class).setParameter(...);
> ```

### Performing an asynchronous query

The Snowflake JDBC Driver supports asynchronous queries, such as queries that return control to the user before they finish. You can start a query and then use polling to determine when the query has finished. When it does, the user can read the result set.

This feature allows a client program to run multiple queries in parallel without the client program itself using
multi-threading.

Asynchronous queries use methods added to the `SnowflakeConnection`, `SnowflakeStatement`, `SnowflakePreparedStatement`, and
`SnowflakeResultSet` classes.

> **Note:**
>
> To perform asynchronous queries, you must ensure the `ABORT_DETACHED_QUERY` configuration parameter is `FALSE` (default value).
>
> If the connection to client is lost:
>
> * For synchronous queries, all in-progress synchronous queries are aborted immediately regardless of the parameter value.
> * For asynchronous queries:
>
>   + If ABORT_DETACHED_QUERY is set to `FALSE`, in-progress asynchronous queries continue to run until they end normally.
>   + If ABORT_DETACHED_QUERY is set to `TRUE`, Snowflake automatically aborts all in-progress asynchronous queries when a client connection is not re-established after five minutes.
>
>     You can prevent the asynchronous query from being aborted at the five minute mark by calling `cursor.query_result(queryId)`. While this call does not retrieve the actual query result as the query is still running, it does prevent the query from being canceled. Invoking `query_result` is a synchronous operation, which might or might not be appropriate for your particular use case.

You can run a mix of synchronous and asynchronous queries in the same session.

> **Note:**
>
> Asynchronous queries don’t support PUT/GET statements.

When `executeAsyncQuery(query)` is used, the Snowflake JDBC driver automatically keeps track of the queries submitted asynchronously. When the connection is explicitly closed with `connection.close()`, the list of async queries is examined and, if any of them are still running, the Snowflake-side session is not deleted.

If no async queries are running within the same connection, the Snowflake session belonging to the connection is logged out when `connection.close()` is called, which implicitly cancels all other queries running in the same session.

This behavior also depends on the SQL ABORT_DETACHED_QUERY parameter. For more information, see the [ABORT_DETACHED_QUERY parameter](../../sql-reference/parameters.md) documentation.

As a best practice, isolate all long-running async tasks (especially those intended to continue after the connection is closed) into a separate connection.

To better understand the hierarchy of the drivers’ business logic and the ABORT_DETACHED_QUERY parameter’s interaction, see the following flowchart:

#### Best practices for asynchronous queries

* Ensure that you know which queries are dependent on other queries before you run any queries in parallel. Queries that are interdependent and order-sensitive are not suitable for parallelizing. For example, an INSERT statement should not start until after the corresponding CREATE TABLE statement has finished.
* Ensure that you do not run too many queries for the memory that you have available.
  Running multiple queries in parallel typically consumes more memory, especially if more than one ResultSet is stored in memory at the same time.
* When polling, handle the rare cases where a query does not succeed. For example, avoid the following potential infinite loop:

  Version 4.xVersion 3.x

  ```java
  QueryStatus queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
  while (!queryStatus.isSuccess())  {     //  NOT RECOMMENDED
      Thread.sleep(2000);   // 2000 milliseconds.
      queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
      }
  ```

  Instead, use code similar to the following:

  ```java
  // Assume that the query is not done yet.
  QueryStatus queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
  while (queryStatus.isStillRunning())  {
      Thread.sleep(2000);   // 2000 milliseconds.
      queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
      }

  if (queryStatus.isSuccess()) {
      ...
      }
  ```

  ```java
  QueryStatus queryStatus = QueryStatus.RUNNING;
  while (queryStatus != QueryStatus.SUCCESS)  {     //  NOT RECOMMENDED
      Thread.sleep(2000);   // 2000 milliseconds.
      queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
      }
  ```

  Instead, use code similar to the following:

  ```java
  // Assume that the query is not done yet.
  QueryStatus queryStatus = QueryStatus.RUNNING;
  while (queryStatus == QueryStatus.RUNNING || queryStatus == QueryStatus.RESUMING_WAREHOUSE)  {
      Thread.sleep(2000);   // 2000 milliseconds.
      queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
      }

  if (queryStatus == QueryStatus.SUCCESS) {
      ...
      }
  ```
* Ensure that transaction control statements (BEGIN, COMMIT, and ROLLBACK) are not executed in parallel with other statements.

#### Examples of asynchronous queries

Most of these examples require that the program import classes as shown below:

> Version 4.xVersion 3.x
>
> ```java
> import java.sql.Connection;
> import java.sql.ResultSet;
> import java.sql.Statement;
> import net.snowflake.client.api.resultset.QueryStatus;
> import net.snowflake.client.api.resultset.SnowflakeAsyncResultSet;
> import net.snowflake.client.api.connection.SnowflakeConnection;
> import net.snowflake.client.api.resultset.SnowflakeResultSet;
> import net.snowflake.client.api.statement.SnowflakeStatement;
> ```
>
> This is a very simple example:
>
> ```java
> String sql_command = "";
> ResultSet resultSet;
>
> System.out.println("Create JDBC statement.");
> Statement statement = connection.createStatement();
> sql_command = "SELECT PI()";
> System.out.println("Simple SELECT query: " + sql_command);
> resultSet = statement.unwrap(SnowflakeStatement.class).executeAsyncQuery(sql_command);
>
> // Assume that the query isn't done yet.
> QueryStatus queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
> while (queryStatus.isStillRunning()) {
>   Thread.sleep(2000); // 2000 milliseconds.
>   queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
> }
>
> if (queryStatus.getStatus() == QueryStatus.Status.FAILED_WITH_ERROR) {
>   // Print the error code to stdout
>   System.out.format("Error code: %d%n", queryStatus.getErrorCode());
>   System.out.format("Error message: %s%n", queryStatus.getErrorMessage());
> } else if (!queryStatus.isSuccess()) {
>   System.out.println("ERROR: unexpected QueryStatus: " + queryStatus.getStatus());
> } else {
>   boolean result_exists = resultSet.next();
>   if (!result_exists) {
>     System.out.println("ERROR: No rows returned.");
>   } else {
>     float pi_result = resultSet.getFloat(1);
>     System.out.println("pi = " + pi_result);
>   }
> }
> ```
>
> This example stores the query ID, closes the connection, re-opens the connection, and uses the query ID to retrieve the data:
>
> ```java
> String sql_command = "";
> ResultSet resultSet;
> String queryID = "";
>
> System.out.println("Create JDBC statement.");
> Statement statement = connection.createStatement();
> sql_command = "SELECT PI() * 2";
> System.out.println("Simple SELECT query: " + sql_command);
> resultSet = statement.unwrap(SnowflakeStatement.class).executeAsyncQuery(sql_command);
> queryID = resultSet.unwrap(SnowflakeResultSet.class).getQueryID();
> System.out.println("INFO: Closing statement.");
> statement.close();
> System.out.println("INFO: Closing connection.");
> connection.close();
>
> System.out.println("INFO: Re-opening connection.");
> connection = create_connection(args);
> use_warehouse_db_and_schema(connection);
> resultSet = connection.unwrap(SnowflakeConnection.class).createResultSet(queryID);
>
> // Assume that the query isn't done yet.
> QueryStatus queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
> while (queryStatus.isStillRunning()) {
>   Thread.sleep(2000); // 2000 milliseconds.
>   queryStatus = resultSet.unwrap(SnowflakeAsyncResultSet.class).getStatus();
> }
>
> if (queryStatus.getStatus() == QueryStatus.Status.FAILED_WITH_ERROR) {
>   System.out.format(
>       "ERROR %d: %s%n", queryStatus.getErrorMessage(), queryStatus.getErrorCode());
> } else if (!queryStatus.isSuccess()) {
>   System.out.println("ERROR: unexpected QueryStatus: " + queryStatus.getStatus());
> } else {
>   boolean result_exists = resultSet.next();
>   if (!result_exists) {
>     System.out.println("ERROR: No rows returned.");
>   } else {
>     float pi_result = resultSet.getFloat(1);
>     System.out.println("pi = " + pi_result);
>   }
> }
> ```
>
> ```java
> import java.sql.Connection;
> import java.sql.ResultSet;
> import java.sql.Statement;
> import net.snowflake.client.core.QueryStatus;
> import net.snowflake.client.jdbc.SnowflakeConnection;
> import net.snowflake.client.jdbc.SnowflakeResultSet;
> import net.snowflake.client.jdbc.SnowflakeStatement;
> ```
>
> This is a very simple example:
>
> ```java
>     String sql_command = "";
>     ResultSet resultSet;
>
>     System.out.println("Create JDBC statement.");
>     Statement statement = connection.createStatement();
>     sql_command = "SELECT PI()";
>     System.out.println("Simple SELECT query: " + sql_command);
>     resultSet = statement.unwrap(SnowflakeStatement.class).executeAsyncQuery(sql_command);
>
>     // Assume that the query isn't done yet.
>     QueryStatus queryStatus = QueryStatus.RUNNING;
>     while (queryStatus == QueryStatus.RUNNING || queryStatus == QueryStatus.RESUMING_WAREHOUSE) {
>       Thread.sleep(2000); // 2000 milliseconds.
>       queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
>     }
>
>     if (queryStatus == QueryStatus.FAILED_WITH_ERROR) {
>       // Print the error code to stdout
>       System.out.format("Error code: %d%n", queryStatus.getErrorCode());
>       System.out.format("Error message: %s%n", queryStatus.getErrorMessage());
>     } else if (queryStatus != QueryStatus.SUCCESS) {
>       System.out.println("ERROR: unexpected QueryStatus: " + queryStatus);
>     } else {
>       boolean result_exists = resultSet.next();
>       if (!result_exists) {
>         System.out.println("ERROR: No rows returned.");
>       } else {
>         float pi_result = resultSet.getFloat(1);
>         System.out.println("pi = " + pi_result);
>       }
>     }
> ```
>
> This example stores the query ID, closes the connection, re-opens the connection, and uses the query ID to retrieve the data:
>
> ```java
>     String sql_command = "";
>     ResultSet resultSet;
>     String queryID = "";
>
>     System.out.println("Create JDBC statement.");
>     Statement statement = connection.createStatement();
>     sql_command = "SELECT PI() * 2";
>     System.out.println("Simple SELECT query: " + sql_command);
>     resultSet = statement.unwrap(SnowflakeStatement.class).executeAsyncQuery(sql_command);
>     queryID = resultSet.unwrap(SnowflakeResultSet.class).getQueryID();
>     System.out.println("INFO: Closing statement.");
>     statement.close();
>     System.out.println("INFO: Closing connection.");
>     connection.close();
>
>     System.out.println("INFO: Re-opening connection.");
>     connection = create_connection(args);
>     use_warehouse_db_and_schema(connection);
>     resultSet = connection.unwrap(SnowflakeConnection.class).createResultSet(queryID);
>
>     // Assume that the query isn't done yet.
>     QueryStatus queryStatus = QueryStatus.RUNNING;
>     while (queryStatus == QueryStatus.RUNNING) {
>       Thread.sleep(2000); // 2000 milliseconds.
>       queryStatus = resultSet.unwrap(SnowflakeResultSet.class).getStatus();
>     }
>
>     if (queryStatus == QueryStatus.FAILED_WITH_ERROR) {
>       System.out.format(
>           "ERROR %d: %s%n", queryStatus.getErrorMessage(), queryStatus.getErrorCode());
>     } else if (queryStatus != QueryStatus.SUCCESS) {
>       System.out.println("ERROR: unexpected QueryStatus: " + queryStatus);
>     } else {
>       boolean result_exists = resultSet.next();
>       if (!result_exists) {
>         System.out.println("ERROR: No rows returned.");
>       } else {
>         float pi_result = resultSet.getFloat(1);
>         System.out.println("pi = " + pi_result);
>       }
>     }
> ```

### Upload data files directly from a stream to an internal stage

You can upload data files using the PUT command. However, sometimes it makes sense to transfer data directly from a
stream to an internal (i.e. Snowflake) stage as a file. (The [stage](../../user-guide/data-load-local-file-system-create-stage.md)
can be any internal stage type: table stage, user stage, or named stage. The JDBC driver does not support uploading to an external
stage.) Here is the method exposed in the `SnowflakeConnection` class:

Version 4.xVersion 3.x

```java
/**
  * Method to compress data from a stream and upload it at a stage location.
  * The data will be uploaded as one file. No splitting is done in this method.
  *
  * Caller is responsible for releasing the inputStream after the method is
  * called.
  *
  * @param stageName    stage name: e.g. ~ or table name or stage name
  * @param destPrefix   path / prefix under which the data should be uploaded on the stage
  * @param inputStream  input stream from which the data will be uploaded
  * @param destFileName destination file name to use
  * @param compressData compress data or not before uploading stream
  * @throws java.sql.SQLException failed to compress and put data from a stream at stage
  */
  void uploadStream(String stageName, String destFileName, InputStream inputStream)
      throws SQLException;

  void uploadStream(String stageName, String destFileName, InputStream inputStream,
                    UploadStreamConfig config)
      throws SQLException;
```

Sample usage:

```java
Connection connection = DriverManager.getConnection(url, prop);
File file = new File("/tmp/test.csv");
FileInputStream fileInputStream = new FileInputStream(file);

// upload file stream to user stage
UploadStreamConfig config = UploadStreamConfig.builder()
    .setDestPrefix("testUploadStream")
    .setCompressData(true)
    .build();

connection.unwrap(SnowflakeConnection.class).uploadStream("MYSTAGE", "destFile.csv",
  fileInputStream, config);
```

```java
/**
  * Method to compress data from a stream and upload it at a stage location.
  * The data will be uploaded as one file. No splitting is done in this method.
  *
  * Caller is responsible for releasing the inputStream after the method is
  * called.
  *
  * @param stageName    stage name: e.g. ~ or table name or stage name
  * @param destPrefix   path / prefix under which the data should be uploaded on the stage
  * @param inputStream  input stream from which the data will be uploaded
  * @param destFileName destination file name to use
  * @param compressData compress data or not before uploading stream
  * @throws java.sql.SQLException failed to compress and put data from a stream at stage
  */
  public void uploadStream(String stageName,
                            String destPrefix,
                            InputStream inputStream,
                            String destFileName,
                            boolean compressData)
      throws SQLException
```

Sample usage:

> ```java
> Connection connection = DriverManager.getConnection(url, prop);
> File file = new File("/tmp/test.csv");
> FileInputStream fileInputStream = new FileInputStream(file);
>
> // upload file stream to user stage
> connection.unwrap(SnowflakeConnection.class).uploadStream("MYSTAGE", "testUploadStream",
>     fileInputStream, "destFile.csv", true);
> ```

Code written for JDBC Driver versions prior to 3.9.2 might cast `SnowflakeConnectionV1` rather than
unwrap `SnowflakeConnection.class`. For example:

> ```java
> ...
>
> // For versions prior to 3.9.2:
> // upload file stream to user stage
> ((SnowflakeConnectionV1) connection.uploadStream("MYSTAGE", "testUploadStream",
>     fileInputStream, "destFile.csv", true));
> ```

> **Note:**
>
> Customers using newer versions of the driver should update their code to use `unwrap`.

### Download data files directly from an internal stage to a stream

You can download data files using the GET command. However, sometimes it makes sense to transfer data directly from a
file in an internal (i.e. Snowflake) stage to a stream. (The [stage](../../user-guide/data-load-local-file-system-create-stage.md)
can be any internal stage type: table stage, user stage, or named stage. The JDBC driver does not support downloading to an
external stage.) Here is the method exposed in the `SnowflakeConnection` class:

Version 4.xVersion 3.x

```java
/**
  * Download a file from a Snowflake stage as a stream with required parameters only.
  *
  * <p>This is a convenience method that uses default options (no decompression). For advanced
  * configuration, use {@link #downloadStream(String, String, DownloadStreamConfig)}.
  *
  * <p>The caller is responsible for closing the returned input stream.
  *
  * @param stageName the name of the stage (e.g., "@my_stage")
  * @param sourceFileName the path to the file within the stage
  * @return an input stream containing the file data
  * @throws SQLException if download fails
  */
  InputStream downloadStream(String stageName, String sourceFileName) throws SQLException;

  /**
  * Download a file from a Snowflake stage as a stream with optional configuration.
  *
  * <p>This method allows customization of download behavior via {@link DownloadStreamConfig}, such
  * as automatic decompression.
  *
  * <p>The caller is responsible for closing the returned input stream.
  *
  * @param stageName the name of the stage (e.g., "@my_stage")
  * @param sourceFileName the path to the file within the stage
  * @param config optional configuration for download behavior
  * @return an input stream containing the file data
  * @throws SQLException if download fails
  */
  InputStream downloadStream(String stageName, String sourceFileName, DownloadStreamConfig config)
      throws SQLException;
```

Sample usage:

```java
Connection connection = DriverManager.getConnection(url, prop);
DownloadStreamConfig config = DownloadStreamConfig.builder()
    .setDecompress(true)
    .build();

InputStream out = connection.unwrap(SnowflakeConnection.class).downloadStream(
    "~",
    DEST_PREFIX + "/" + TEST_DATA_FILE + ".gz",
    config);
```

```java
/**
 * Download file from the given stage and return an input stream
 *
 * @param stageName      stage name
 * @param sourceFileName file path in stage
 * @param decompress     true if file compressed
 * @return an input stream
 * @throws SnowflakeSQLException if any SQL error occurs.
 */
InputStream downloadStream(String stageName,
                           String sourceFileName,
                           boolean decompress) throws SQLException;
```

Sample usage:

```java
Connection connection = DriverManager.getConnection(url, prop);
InputStream out = connection.unwrap(SnowflakeConnection.class).downloadStream(
    "~",
    DEST_PREFIX + "/" + TEST_DATA_FILE + ".gz",
    true);
```

Code written for JDBC Driver versions prior to 3.9.2 might cast `SnowflakeConnectionV1` rather than
unwrap `SnowflakeConnection.class`. For example:

```java
...

// For versions prior to 3.9.2:
// download file stream to user stage
((SnowflakeConnectionV1) connection.downloadStream(...));
```

## Multi-statement support

This section describes how to execute multiple statements in a single request using the [JDBC Driver](jdbc.md).

> **Note:**
>
> * Executing multiple statements in a single query requires that a valid warehouse is available in a session.
> * By default, Snowflake returns an error for queries issued with multiple statements to protect against [SQL injection](https://en.wikipedia.org/wiki/SQL_injection) .
>   Executing multiple statements in a single query increases the risk of SQL injection. Snowflake recommends using it sparingly.
>   To reduce the SQL injection risk, use the `SnowflakeStatement` class’s `setParameter()` method to specify the number of
>   statements to be executed, which makes it more difficult to inject a statement by appending it. For more information about `SnowflakeStatement`, see
>   [Interface: SnowflakeStatement](jdbc-api.md).

### Sending multiple statements and handling results

Queries containing multiple statements can be executed the same way as queries with a single statement, except that the
query string contains multiple statements separated by semicolons.

There are two ways to allow multiple statements:

* Call Statement.setParameter(“MULTI_STATEMENT_COUNT”, n) to specify how many statements at a time this Statement
  should be allowed to execute. See below for more details.
* Set the [MULTI_STATEMENT_COUNT](../../sql-reference/parameters.md) parameter at the session level or
  the account level by executing one of the following commands:

  > ```sqlexample
  > alter session set MULTI_STATEMENT_COUNT = 0;
  > ```
  >
  > Or:
  >
  > ```sqlexample
  > alter account set MULTI_STATEMENT_COUNT = 0;
  > ```

  Setting the parameter to 0 allows an unlimited number of statements. Setting the parameter to 1 allows only one
  statement at a time.

In order to make SQL Injection attacks more difficult, users can call the `setParameter` method to
specify the number of statements to be executed in a single call, as shown below.
In this example, the number of statements to execute in a single call is 3:

> ```java
> // Specify the number of statements that we expect to execute.
> statement.unwrap(SnowflakeStatement.class).setParameter(
>         "MULTI_STATEMENT_COUNT", 3);
> ```

The default number of statements is 1; in other words, multi-statement mode is off.

To execute multiple statements without specifying the exact number, pass a value of 0.

The MULTI_STATEMENT_COUNT parameter is not part of the JDBC standard; it is a Snowflake extension. This parameter
affects more than one Snowflake driver/connector.

When multiple statements are executed in a single `execute()` call, the result of the first statement is
available through the standard `getResultSet()` and `getUpdateCount()` methods.
To access the results of the statements that follow, use the `getMoreResults()` method.
This method returns `true` when more statements are available for iteration, and `false` otherwise.

The example below sets the MULTI_STATEMENT_COUNT parameter, executes 3 statements, and retrieves update counts
and result sets:

> ```java
> // Create a string that contains multiple SQL statements.
> String command_string = "create table test(n int); " +
>                         "insert into test values (1), (2); " +
>                         "select * from test order by n";
> Statement stmt = connection.createStatement();
> // Specify the number of statements (3) that we expect to execute.
> stmt.unwrap(SnowflakeStatement.class).setParameter(
>         "MULTI_STATEMENT_COUNT", 3);
>
> // Execute all of the statements.
> stmt.execute(command_string);                       // false
>
> // --- Get results. ---
> // First statement (create table)
> stmt.getUpdateCount();                              // 0 (DDL)
>
> // Second statement (insert)
> stmt.getMoreResults();                              // true
> stmt.getUpdateCount();                              // 2
>
> // Third statement (select)
> stmt.getMoreResults();                              // true
> ResultSet rs = stmt.getResultSet();
> rs.next();                                          // true
> rs.getInt(1);                                       // 1
> rs.next();                                          // true
> rs.getInt(1);                                       // 2
> rs.next();                                          // false
>
> // Past the last statement executed.
> stmt.getMoreResults();                              // false
> stmt.getUpdateCount();                              // 0 (no more results)
> ```

Snowflake recommends using `execute()` for multi-statement queries.
The methods `executeQuery()` and `executeUpdate()` also support multiple statements, but will throw an exception if the first result is not the expected result type (result set and update count, respectively).

### Failed statements

If any of the SQL statements fails to compile or execute, execution is aborted. Any previous statements that ran before are unaffected.

For example, if the statements below are run as a single multi-statement query, the query will fail on the third statement, and an exception will be thrown.

> ```sqlexample
> CREATE OR REPLACE TABLE test(n int);
> INSERT INTO TEST VALUES (1), (2);
> INSERT INTO TEST VALUES ('not_an_int');  -- execution fails here
> INSERT INTO TEST VALUES (3);
> ```

If you were to then query the contents of table `test`, values `1` and `2` would be present.

### Unsupported features

PUT and GET statements are not supported for multi-statement queries.

Preparing statements and using bind variables are also not supported for multi-statement queries.

## Binding variables to statements

[Binding](../../sql-reference/bind-variables.md) allows a SQL statement to use a value that is stored in a Java variable.

### Simple binding

Without binding, a SQL statement specifies values by specifying literals inside the statement. For example, the following
statement uses the literal value `42` in an UPDATE statement:

> ```java
> stmt.execute("UPDATE table1 SET integer_column = 42 WHERE ID = 1000");
> ```

With binding, you can execute a SQL statement that uses a value that is inside a variable. For example:

> ```java
> int my_integer_variable = 42;
> PreparedStatement pstmt = connection.prepareStatement("UPDATE table1 SET integer_column = ? WHERE ID = 1000");
> pstmt.setInt(1, my_integer_variable);
> pstmt.executeUpdate();
> ```

The `?` inside the `VALUES` clause specifies that the SQL statement uses the value from a variable. The `setInt()` method
specifies that the first question mark in the SQL statement should be replaced with the value in the variable named
`my_integer_variable`. Note that `setInt()` uses 1-based, rather than 0-based values (i.e. the first question mark is
referenced by 1, not 0).

### Binding variables to timestamp columns

Snowflake supports three different variations for timestamps: [TIMESTAMP_LTZ , TIMESTAMP_NTZ , TIMESTAMP_TZ](../../sql-reference/data-types-datetime.md). When you call
`PreparedStatement.setTimestamp` to bind a variable to a timestamp column, the JDBC Driver interprets the timestamp value in
terms of the local time zone (`TIMESTAMP_LTZ`) or the time zone of the `Calendar` object passed in as an argument:

```java
// The following call interprets the timestamp in terms of the local time zone.
insertStmt.setTimestamp(1, myTimestamp);
// The following call interprets the timestamp in terms of the time zone of the Calendar object.
insertStmt.setTimestamp(1, myTimestamp, Calendar.getInstance(TimeZone.getTimeZone("America/New_York")));
```

If you want the driver to interpret the timestamp using a different variation (e.g. `TIMESTAMP_NTZ`), use one of the
following approaches:

* Set the session parameter [CLIENT_TIMESTAMP_TYPE_MAPPING](../../sql-reference/parameters.md) to the variation.

  Note that the parameter affects all binding operations for the current session. If you need to change the variation (e.g. back
  to `TIMESTAMP_LTZ`), you must set this session parameter again.
* (In the JDBC Driver 3.13.3 and later versions) Call the `PreparedStatement.setObject` method, and use the
  `targetSqlType` parameter to specify one of the following Snowflake timestamp variations:

  Version 4.xVersion 3.x

  + `SnowflakeType.EXTRA_TYPES_TIMESTAMP_LTZ`
  + `SnowflakeType.EXTRA_TYPES_TIMESTAMP_TZ`
  + `SnowflakeType.EXTRA_TYPES_TIMESTAMP_NTZ`
  + `SnowflakeType.EXTRA_TYPES_VECTOR`
  + `SnowflakeType.EXTRA_TYPES_DECFLOAT`
  + `SnowflakeType.EXTRA_TYPES_YEAR_MONTH_INTERVAL`
  + `SnowflakeType.EXTRA_TYPES_DAY_TIME_INTERVAL`

  For example:

  > ```java
  > import net.snowflake.client.api.resultset.SnowflakeType;
  > ...
  > insertStmt.setObject(1, myTimestamp, SnowflakeType.EXTRA_TYPES_TIMESTAMP_NTZ);
  > ```

  + `SnowflakeUtil.EXTRA_TYPES_TIMESTAMP_LTZ`
  + `SnowflakeUtil.EXTRA_TYPES_TIMESTAMP_TZ`
  + `SnowflakeUtil.EXTRA_TYPES_TIMESTAMP_NTZ`

  For example:

  > ```java
  > import net.snowflake.client.jdbc.SnowflakeUtil;
  > ...
  > insertStmt.setObject(1, myTimestamp, SnowflakeUtil.EXTRA_TYPES_TIMESTAMP_NTZ);
  > ```

### Batch inserts

In your Java application code, you can insert multiple rows in a single batch by binding parameters in an INSERT statement and
calling `addBatch()` and `executeBatch()`.

As an example, the following code inserts two rows into a table that contains an INTEGER column and a VARCHAR column. The example
binds values to the parameters in the INSERT statement and calls `addBatch()` and `executeBatch()` to perform a batch
insert.

> ```java
> Connection connection = DriverManager.getConnection(url, prop);
> connection.setAutoCommit(false);
>
> PreparedStatement pstmt = connection.prepareStatement("INSERT INTO t(c1, c2) VALUES(?, ?)");
> pstmt.setInt(1, 101);
> pstmt.setString(2, "test1");
> pstmt.addBatch();
>
> pstmt.setInt(1, 102);
> pstmt.setString(2, "test2");
> pstmt.addBatch();
>
> int[] count = pstmt.executeBatch(); // After execution, count[0]=1, count[1]=1
> connection.commit();
> ```

When you use this technique to insert a large number of values, the driver can improve performance by streaming the data (without
creating files on the local machine) to a temporary stage for ingestion. The driver automatically does this when the number of
values exceeds a threshold.

In addition, the current database and schema for the session must be set. If these are not set, the CREATE TEMPORARY STAGE command
executed by the driver can fail with the following error:

```none
CREATE TEMPORARY STAGE SYSTEM$BIND file_format=(type=csv field_optionally_enclosed_by='"')
Cannot perform CREATE STAGE. This session does not have a current schema. Call 'USE SCHEMA', or use a qualified name.
```

> **Note:**
>
> For alternative ways to load data into the Snowflake database (including bulk loading using the COPY command), see
> [Load data into Snowflake](../../guides-overview-loading-data.md).

## Java sample program

For a working sample written in Java, right-click the name of the file, [`SnowflakeJDBCExample.java`](../../_downloads/5394ceb64bb233d57d7d30a1bf741e5e/SnowflakeJDBCExample.java), and save the link/file to your local file system.

## Troubleshooting

### I/O error: Connection reset

In some cases, the JDBC Driver might fail with the following error message after a period of inactivity:

```none
I/O error: Connection reset
```

You can work around the problem by setting a specific “time to live” for the connections. If a connection is idle for longer than
the “time to live”, the JDBC Driver removes the connection from the connection pool and creates a new connection.

To set the time to live, set the Java system property named `net.snowflake.jdbc.ttl` to the number of seconds that the
connection should live:

* To set this property programmatically, call `System.setProperty`:

  ```java
  // Set the "time to live" to 60 seconds.
  System.setProperty("net.snowflake.jdbc.ttl", "60")
  ```
* To set this property when running the `java` command, use the `-D` flag:

  ```bash
  # Set the "time to live" to 60 seconds.
  java -cp .:snowflake-jdbc-<version>.jar -Dnet.snowflake.jdbc.ttl=60 <ClassName>
  ```

The default value of the `net.snowflake.jdbc.ttl` property is `-1`, which means that idle connections are not removed from
the connection pool.

### Handling errors

When handling errors and exceptions for a JDBC application, you can use the
[ErrorCode.java](https://github.com/snowflakedb/snowflake-jdbc/blob/master/src/main/java/net/snowflake/client/jdbc/ErrorCode.java)
file that Snowflake provides to determine the cause of the problems.
Error codes specific to the JDBC driver start with **2**, in the form: 2*NNNNN*.

> **Note:**
>
> The link to the **ErrorCode.java** in the public snowflake-jdbc git repository points to the latest version of the file, which might differ from
> the version of the JDBC driver you currently use.

---
title: Using the ODBC Driver
source: https://docs.snowflake.com/en/developer-guide/odbc/odbc-using.md
section: Developer Guide
---

# Using the ODBC Driver

This topic provides information about how to use the ODBC driver.

## Compiling your code

### Linux

* If a C/C++ application is built with the Snowflake ODBC driver library and loads a non-pthread-compatible
  library, the application could crash due to unsafe concurrent access to shared memory. To prevent this,
  compile the application with the option which ensures that only pthread-compatible libraries are loaded
  with the application.

  For gcc/g++, the option is “-pthread”.

### macOS

* If a C/C++ application is built with the Snowflake ODBC driver library and loads a non-pthread-compatible
  library, the application could crash due to unsafe concurrent access to shared memory. To prevent this,
  compile the application with the option which ensures that only pthread-compatible libraries are loaded
  with the application.

  For clang/clang++, the option is “-pthread”.

## Executing a batch of SQL statements (multi-statement support)

> **Note:**
>
> Executing multiple statements in a single query requires that a valid warehouse is available in a session.

In ODBC, you can send a batch of SQL statements (separated by semicolons) to execute in a single request. The following example sends a batch of three SELECT statements:

```cpp
// Sending a batch of SQL statements to be executed
rc = SQLExecDirect(hstmt,
      (SQLCHAR *) "select c1 from t1; select c2 from t2; select c3 from t3",
      SQL_NTS);
```

For more information about sending SQL statement batches, see
[Batches of SQL statements](https://docs.microsoft.com/en-us/sql/odbc/reference/develop-app/batches-of-sql-statements?view=sql-server-ver15).

To send a batch of statements with the Snowflake ODBC Driver, you must specify the number of statements in the batch. The Snowflake database requires the exact number of statements in order to guard against SQL injection attacks.

For more information about these types of attacks, see [SQL injection](https://en.wikipedia.org/wiki/SQL_injection).

The next section explains how to specify the number of statements in a batch.

### Specifying the number of statements in a batch

By default, the Snowflake database expects the driver to prepare and send a single statement for execution.

You can override this by specifying the number of statements in a batch for a given request or by enabling multiple statements for
the current session or account:

* To specify the number for a given request, call
  `SqlSetStmtAttr` to set the `SQL_SF_STMT_ATTR_MULTI_STATEMENT_COUNT` attribute to the number of statements in the batch.

  ```cpp
  // Specify that you want to execute a batch of 3 SQL statements
  rc = SQLSetStmtAttr(hstmt, SQL_SF_STMT_ATTR_MULTI_STATEMENT_COUNT, (SQLPOINTER)3, 0);
  ```

  If you want to use the setting for the current session or account (rather than specify the number for the request), set
  `SQL_SF_STMT_ATTR_MULTI_STATEMENT_COUNT` to `-1`.

  For more information, see the [SqlSetStmtAttr](https://docs.microsoft.com/en-us/sql/odbc/reference/syntax/sqlsetstmtattr-function?view=sql-server-ver15) documentation.
* To enable multiple statements for the current session, account, or user, execute the appropriate ALTER command and set the Snowflake
  [MULTI_STATEMENT_COUNT](../../sql-reference/parameters.md) parameter to `0` as shown in the following examples:

  > ```sqlexample
  > alter session set MULTI_STATEMENT_COUNT = 0;
  > ```
  >
  > ```sqlexample
  > alter account set MULTI_STATEMENT_COUNT = 0;
  > ```
  >
  > ```sqlexample
  > alter user set MULTI_STATEMENT_COUNT = 0;
  > ```
  >
  > By default, `MULTI_STATEMENT_COUNT` is set to `1`, which indicates that only one SQL statement can be executed.
  >
  > > **Note:**
  > >
  > > Setting the `MULTI_STATEMENT_COUNT` parameter at the account level also affects other Snowflake connectors and drivers that use the account (e.g. [the Snowflake JDBC Driver](../jdbc/jdbc-using.md)).

### Preparing a batch of SQL statements

The ODBC Driver supports the ability to prepare a batch of SQL statements (e.g. by calling the `SQLPrepare` function). Note
the following:

* If the statements have parameters, calling the `SQLNumParams` function returns the total number of parameters in all the statements in the batch.

  For more informaiton about parameters and the `SQLNumParams` function, see [Statement Parameters](https://docs.microsoft.com/en-us/sql/odbc/reference/develop-app/statement-parameters?view=sql-server-ver15) and [SQLNumParams Function](https://docs.microsoft.com/en-us/sql/odbc/reference/syntax/sqlnumparams-function?view=sql-server-ver15).
* Column information about the result set (e.g. data returned by `SQLNumResultCols`, `SQLDescribeCol`, `SQLColAttribute`, and
  `SQLColAttributes`) is available when you call `SQLExecute` or `SQLExecDirect`.

  Although some column information is available when you call SQLPrepare, the information might not be completely accurate, and
  subsequent calls to SQLExecute or SQLExecDirect might provide more accurate information.

### Limitations

GET and PUT commands are not supported in batches of SQL statements. When you send a batch of SQL statements with GET and PUT
comments to be executed, the GET and PUT commands are ignored, and no errors are reported.

## Binding parameters to array variables for batch inserts

In your application code, you can insert multiple rows in a single batch by [binding](../../sql-reference/bind-variables.md)
parameters in an INSERT statement to array variables.

As an example, the following code inserts rows into a table that contains an INTEGER column and a VARCHAR column. The example
binds arrays to the parameters in the INSERT statement.

> ```cpp
> SQLCHAR * Statement = "INSERT INTO t (c1, c2) VALUES (?, ?)";
>
> SQLSetStmtAttr(hstmt, SQL_ATTR_PARAM_BIND_TYPE, SQL_PARAM_BIND_BY_COLUMN, 0);
> SQLSetStmtAttr(hstmt, SQL_ATTR_PARAMSET_SIZE, ARRAY_SIZE, 0);
> SQLSetStmtAttr(hstmt, SQL_ATTR_PARAM_STATUS_PTR, ParamStatusArray, 0);
> SQLSetStmtAttr(hstmt, SQL_ATTR_PARAMS_PROCESSED_PTR, &ParamsProcessed, 0);
> SQLBindParameter(hstmt, 1, SQL_PARAM_INPUT, SQL_C_ULONG, SQL_INTEGER, 5, 0,
>                  IntValuesArray, 0, IntValuesIndArray);
> SQLBindParameter(hstmt, 2, SQL_PARAM_INPUT, SQL_C_CHAR, SQL_CHAR, STR_VALUE_LEN - 1, 0,
>                  StringValuesArray, STR_VALUE_LEN, StringValuesLenOrIndArray);
> ...
> SQLExecDirect(hstmt, Statement, SQL_NTS);
> ```

When you use this technique to insert a large number of values, the driver can improve performance by streaming the data (without
creating files on the local machine) to a temporary stage for ingestion. The driver automatically does this when the number of
values exceeds a threshold.

In addition, the current database and schema for the session must be set. If these are not set, the CREATE TEMPORARY STAGE command
executed by the driver can fail with the following error:

```none
CREATE TEMPORARY STAGE SYSTEM$BIND file_format=(type=csv field_optionally_enclosed_by='"')
Cannot perform CREATE STAGE. This session does not have a current schema. Call 'USE SCHEMA', or use a qualified name.
```

> **Note:**
>
> For alternative ways to load data into the Snowflake database (including bulk loading using the COPY command), see
> [Load data into Snowflake](../../guides-overview-loading-data.md).

---
title: Using the Python Connector
source: https://docs.snowflake.com/en/developer-guide/python-connector/python-connector-example.md
section: Developer Guide
---

# Using the Python Connector

This topic provides a series of examples that illustrate how to use the Snowflake Connector to perform standard Snowflake operations such as user login, database and table creation, warehouse creation,
data insertion/loading, and querying.

The sample code at the end of this topic combines the examples into a single, working Python program.

> **Note:**
>
> Snowflake now provides first-class Python APIs for managing core Snowflake resources including databases, schemas, tables, tasks, and
> warehouses, without using SQL. For more information, see [Snowflake Python APIs: Managing Snowflake objects with Python](../snowflake-python-api/snowflake-python-overview.md).

## Creating a database, schema, and warehouse

After you log in, create a database, schema, and warehouse if they don’t yet exist, using the
[CREATE DATABASE](../../sql-reference/sql/create-database.md), [CREATE SCHEMA](../../sql-reference/sql/create-schema.md), and
[CREATE WAREHOUSE](../../sql-reference/sql/create-warehouse.md) commands.

The example below shows how to create a warehouse named `tiny_warehouse`, database named `testdb`, and a
schema named `testschema`. Note that when you create the schema, you must either specify the name of the
database in which to create the schema, or you must already be connected to the database in which to create the
schema. The example below executes a `USE DATABASE` command before the `CREATE SCHEMA` command to ensure
that the schema is created in the correct database.

> ```python
> conn.cursor().execute("CREATE WAREHOUSE IF NOT EXISTS tiny_warehouse_mg")
> conn.cursor().execute("CREATE DATABASE IF NOT EXISTS testdb_mg")
> conn.cursor().execute("USE DATABASE testdb_mg")
> conn.cursor().execute("CREATE SCHEMA IF NOT EXISTS testschema_mg")
> ```

## Using the database, schema, and warehouse

Specify the database and schema in which you want to create tables. Also specify the warehouse that will provide
resources for executing DML statements and queries.

For example, to use the database `testdb`, schema `testschema` and warehouse `tiny_warehouse` (created earlier):

> ```python
> conn.cursor().execute("USE WAREHOUSE tiny_warehouse_mg")
> conn.cursor().execute("USE DATABASE testdb_mg")
> conn.cursor().execute("USE SCHEMA testdb_mg.testschema_mg")
> ```

## Creating tables and inserting data

Use the [CREATE TABLE](../../sql-reference/sql/create-table.md) command to create tables and the [INSERT](../../sql-reference/sql/insert.md) command to populate the tables with data.

For example, create a table named `testtable` and insert two rows into the table:

> ```python
> conn.cursor().execute(
>     "CREATE OR REPLACE TABLE "
>     "test_table(col1 integer, col2 string)")
>
> conn.cursor().execute(
>     "INSERT INTO test_table(col1, col2) VALUES " +
>     "    (123, 'test string1'), " +
>     "    (456, 'test string2')")
> ```

## Loading data

Instead of inserting data into tables using individual [INSERT](../../sql-reference/sql/insert.md) commands, you can bulk load data from files staged in either an internal or external location.

### Copying data from an internal location

To load data from files on your host machine into a table, first use the [PUT](../../sql-reference/sql/put.md) command to stage the file in an internal location, then use the
[COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command to copy the data in the files into the table.

For example:

> ```python
> # Putting Data
> con.cursor().execute("PUT file:///tmp/data/file* @%testtable")
> con.cursor().execute("COPY INTO testtable")
> ```
>
> Where your CSV data is stored in a local directory named `/tmp/data` in a Linux or macOS environment, and the directory contains files named `file0`, `file1`, … `file100`.

### Copying data from an external location

To load data from files already staged in an external location (i.e. your S3 bucket) into a table, use the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command.

For example:

> ```python
> # Copying Data
> con.cursor().execute("""
> COPY INTO testtable FROM s3://<s3_bucket>/data/
>     STORAGE_INTEGRATION = myint
>     FILE_FORMAT=(field_delimiter=',')
> """.format(
>     aws_access_key_id=AWS_ACCESS_KEY_ID,
>     aws_secret_access_key=AWS_SECRET_ACCESS_KEY))
> ```
>
> Where:
>
> * `s3://<s3_bucket>/data/` specifies the name of your S3 bucket
> * The files in the bucket are prefixed with `data`.
> * The bucket is accessed using a storage integration created using [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md) by an account administrator (i.e. a user with the ACCOUNTADMIN role) or a role with the global CREATE INTEGRATION privilege. A storage integration allows users to avoid supplying credentials to access a private storage location.

> **Note:**
>
> This example uses the format() function to compose the statement. If your environment has a risk of SQL injection
> attacks, you might prefer to bind values rather than use format().

## Querying data

With the Snowflake Connector for Python, you can submit:

* a synchronous query, which returns control to your application after
  the query completes.
* an asynchronous query, which returns control to your application
  before the query completes.

After the query has completed, you use the `Cursor` object to
fetch the values in the results. By default, the Snowflake Connector for
Python converts the values from [Snowflake data types](../../sql-reference-data-types.md) to native Python data types. (Note that
you can choose to return the values as strings and perform the type conversions in your application.
See Improving query performance by bypassing data conversion.)

> **Note:**
>
> By default, values from NUMBER columns are returned as double-precision floating-point values (`float64`). To return these
> as decimal values (`decimal.Decimal`) in the [`fetch_pandas_all()`](python-connector-api.md "fetch_pandas_all") and [`fetch_pandas_batches()`](python-connector-api.md "fetch_pandas_batches") methods, set
> the `arrow_number_to_decimal` parameter in the [`connect()`](python-connector-api.md "connect") method to `True`.

### Performing a synchronous query

To perform a synchronous query, call the [`execute()`](python-connector-api.md "execute") method in the `Cursor` object. For example:

```python
conn = snowflake.connector.connect( ... )
cur = conn.cursor()
cur.execute('select * from products')
```

Use the `Cursor` object to fetch the values in the results, as explained in
Using cursor to fetch values.

### Performing an asynchronous query

The Snowflake Connector for Python supports asynchronous queries (i.e. queries that return control to the user before the query
completes). You can submit an asynchronous query and use polling to determine when the query has completed. After the query
completes, you can get the results.

> **Note:**
>
> To perform asynchronous queries, you must ensure the `ABORT_DETACHED_QUERY` configuration parameter is `FALSE` (default value).
>
> If the connection to client is lost:
>
> * For synchronous queries, all in-progress synchronous queries are aborted immediately regardless of the parameter value.
> * For asynchronous queries:
>
>   + If ABORT_DETACHED_QUERY is set to `FALSE`, in-progress asynchronous queries continue to run until they end normally.
>   + If ABORT_DETACHED_QUERY is set to `TRUE`, Snowflake automatically aborts all in-progress asynchronous queries when a client connection is not re-established after five minutes.
>
>     You can prevent the asynchronous query from being aborted at the five minute mark by calling `cursor.query_result(queryId)`. While this call does not retrieve the actual query result as the query is still running, it does prevent the query from being canceled. Invoking `query_result` is a synchronous operation, which might or might not be appropriate for your particular use case.

With this feature, you can submit multiple queries in parallel without waiting for each query to complete. You can also run a
combination of synchronous and asynchronous queries during the same session.

> **Note:**
>
> Executing multiple statements in a single query requires that a valid warehouse is available in a session.

Finally, you can submit an asynchronous query from one connection and check the results from a different connection. For example,
a user can initiate a long-running query from your application, exit the application, and restart the application at a later time
to check the results.

To better understand the hierarchy of the drivers’ business logic and the ABORT_DETACHED_QUERY parameter’s interaction, see the following flowchart:

#### Submitting an asynchronous query

> **Note:**
>
> Asynchronous queries don’t support PUT/GET statements.

When `cursor.execute_async(query)` is used, the Snowflake Python driver automatically keeps track of the queries submitted asynchronously. When the connection is explicitly closed with `connection.close()` or the context manager is used with `connect()...`, the list of async queries is examined and, if any of them are still running, the Snowflake-side session is not deleted.

If no async queries are running within the same connection, the Snowflake session belonging to the connection is logged out when `connection.close()` is called, which implicitly cancels all other queries running in the same session.

This behavior also depends on the SQL [ABORT_DETACHED_QUERY](../../sql-reference/parameters.md) parameter.

As a best practice, isolate all long-running async tasks (especially those intended to continue after the connection is closed) into a separate connection.

You can use the `server_session_keep_alive` (default: `False`) connection parameter to override this automatic behavior. By default, the Snowflake session is logged out when `connection.close()` is called *only* when no async queries are running in it. The default behavior doesn’t consider or track sync queries.

When `server_session_keep_alive=True`, `connection.close()` won’t log out the Snowflake session, regardless of the status of any queries. For connections designed to issue long-running asynchronous queries, enabling this setting can reduce CPU overhead and accelerate the connection-closing process.

> **Important:**
>
> Enabling this parameter might have unexpected, billable effects (for example, it might leave queries running up to the configured value of [STATEMENT_TIMEOUT_IN_SECONDS](../../sql-reference/parameters.md)). Snowflake strongly recommends that you carefully decide whether changing the value of `server_session_keep_alive` from the default is needed and, if possible, thoroughly test the change in non-production environments before implementing it in production.

To submit an asynchronous query, call the [`execute_async()`](python-connector-api.md "execute_async") method in the `Cursor` object. For example:

```python
conn = snowflake.connector.connect( ... )
cur = conn.cursor()
# Submit an asynchronous query for execution.
cur.execute_async('select count(*) from table(generator(timeLimit => 25))')
```

After submitting the query:

* To determine if the query is still running, see Checking the status of a query.
* To retrieve the results of the query, see Using the query ID to retrieve the results of a query.

For examples of performing asynchronous queries, see Examples of asynchronous queries.

#### Best practices for asynchronous queries

When submitting an asynchronous query, follow these best practices:

* Ensure that you know which queries are dependent upon other queries before you run any queries in parallel. Some queries are
  interdependent and order sensitive, and therefore not suitable for parallelizing. For example, obviously an INSERT statement
  should not start until after the corresponding CREATE TABLE statement has finished.
* Ensure that you do not run too many queries for the memory that you have available. Running multiple queries in parallel
  typically consumes more memory, especially if more than one set of results is stored in memory at the same time.
* When polling, handle the rare cases where a query does not succeed.
* Ensure that transaction control statements (BEGIN, COMMIT, and ROLLBACK) do not execute in parallel with other statements.
* Be aware that asynchronous queries are not guaranteed to return ordered results, even if the SQL itself has an ORDER BY clause. Consequently, the `result_scan` function does not guarantee ordered results.

### Retrieving the Snowflake query ID

A query ID identifies each query executed by Snowflake. When you use the Snowflake Connector for Python to execute a query, you
can access the query ID through the [`sfqid`](python-connector-api.md "sfqid") attribute in the `Cursor` object:

> ```python
> # Retrieving a Snowflake Query ID
> cur = con.cursor()
> cur.execute("SELECT * FROM testtable")
> print(cur.sfqid)
> ```

You can use the query ID to:

* Check the status of the query in the web interface.

  In the Snowsight, query IDs are displayed in the Query History page.
  See [Monitor query activity with Query History](../../user-guide/ui-snowsight-activity.md).
* Programmatically check the status of the query (e.g. to determine if an asynchronous query has completed).

  See Checking the status of a query.
* Retrieve the results of an asynchronous query or a previously submitted synchronous query.

  See Using the query ID to retrieve the results of a query.
* Cancel a running query.

  See Canceling a query by query ID.

### Checking the status of a query

To check the status of a query:

1. Get the query ID from the [`sfqid`](python-connector-api.md "sfqid") field in the `Cursor` object.
2. Pass the query ID to the [`get_query_status()`](python-connector-api.md "get_query_status") method of the `Connection` object to return the
   [`QueryStatus`](python-connector-api.md "QueryStatus") enum constant that represents the status of the query.

   By default, `get_query_status()` does not raise an error if the query resulted in an error. If you want an error raised,
   call [`get_query_status_throw_if_error()`](python-connector-api.md "get_query_status_throw_if_error") instead.
3. Use the `QueryStatus` enum constant to check the status of the query.

   * To determine if the query is still running (for example, if this is an asynchronous query), pass the constant to the
     [`is_still_running()`](python-connector-api.md "is_still_running") method of the `Connection` object.
   * To determine if an error occurred, pass the constant to the [`is_an_error()`](python-connector-api.md "is_an_error") method.

   For the full list of enum constants, see [`QueryStatus`](python-connector-api.md "QueryStatus").

The following example executes an asynchronous query and checks the status of the query:

```python
import time
...
# Execute a long-running query asynchronously.
cur.execute_async('select count(*) from table(generator(timeLimit => 25))')
...
# Wait for the query to finish running.
query_id = cur.sfqid
while conn.is_still_running(conn.get_query_status(query_id)):
  time.sleep(1)
```

The following example raises an error if the query has resulted in an error:

```python
from snowflake.connector import ProgrammingError
import time
...
# Wait for the query to finish running and raise an error
# if a problem occurred with the execution of the query.
try:
  query_id = cur.sfqid
  while conn.is_still_running(conn.get_query_status_throw_if_error(query_id)):
    time.sleep(1)
except ProgrammingError as err:
  print('Programming Error: {0}'.format(err))
```

### Using the query ID to retrieve the results of a query

> **Note:**
>
> If you performed a synchronous query by calling the [`execute()`](python-connector-api.md "execute")
> method on a `Cursor` object, you don’t need to use the query ID to retrieve the results. You can just fetch the values
> from the results, as explained in Using cursor to fetch values.

If you want to retrieve the results of an asynchronous query or a previously submitted synchronous query, follow these steps:

1. Get the query ID of the query. See Retrieving the Snowflake query ID.
2. Call the [`get_results_from_sfqid()`](python-connector-api.md "get_results_from_sfqid") method in the `Cursor` object to retrieve the results.
3. Use the `Cursor` object to fetch the values in the results, as explained in
   Using cursor to fetch values.

Note that if the query is still running, the fetch methods (`fetchone()`, `fetchmany()`, `fetchall()`, etc.)
will wait for the query to complete.

For example:

```python
# Get the results from a query.
cur.get_results_from_sfqid(query_id)
results = cur.fetchall()
print(f'{results[0]}')
```

### Using `cursor` to fetch values

Fetch values from a table using the cursor object iterator method.

For example, to fetch columns named “col1” and “col2” from the table
named `testtable`, which was created earlier
(in Creating tables and inserting data),
use code similar to the following:

> ```python
> cur = conn.cursor()
> try:
>     cur.execute("SELECT col1, col2 FROM test_table ORDER BY col1")
>     for (col1, col2) in cur:
>         print('{0}, {1}'.format(col1, col2))
> finally:
>     cur.close()
> ```

Alternatively, the Snowflake Connector for Python provides a convenient shortcut:

> ```python
> for (col1, col2) in con.cursor().execute("SELECT col1, col2 FROM testtable"):
>     print('{0}, {1}'.format(col1, col2))
> ```

If you need to get a single result (i.e. a single row), use the `fetchone` method:

> ```python
> col1, col2 = con.cursor().execute("SELECT col1, col2 FROM testtable").fetchone()
> print('{0}, {1}'.format(col1, col2))
> ```

If you need to get the specified number of rows at a time, use the `fetchmany` method with the number of rows:

> ```python
> cur = con.cursor().execute("SELECT col1, col2 FROM testtable")
> ret = cur.fetchmany(3)
> print(ret)
> while len(ret) > 0:
>     ret = cur.fetchmany(3)
>     print(ret)
> ```
>
> > **Note:**
> >
> > Use `fetchone` or `fetchmany` if the result set is too large
> > to fit into memory.

If you need to get all results at once:

> ```python
> results = con.cursor().execute("SELECT col1, col2 FROM testtable").fetchall()
> for rec in results:
>     print('%s, %s' % (rec[0], rec[1]))
> ```

To set a timeout for a query, execute a “begin” command and include a timeout parameter on the query. If the query exceeds the length of the parameter value, an error is produced and a rollback occurs.

In the following code, error 604 means the query was canceled. The timeout parameter starts `Timer()` and cancels if the query does not finish within the specified time.

> ```python
> conn.cursor().execute("create or replace table testtbl(a int, b string)")
>
> conn.cursor().execute("begin")
> try:
>    conn.cursor().execute("insert into testtbl(a,b) values(3, 'test3'), (4,'test4')", timeout=10) # long query
>
> except ProgrammingError as e:
>    if e.errno == 604:
>       print("timeout")
>       conn.cursor().execute("rollback")
>    else:
>       raise e
> else:
>    conn.cursor().execute("commit")
> ```

### Using `DictCursor` to fetch values by column name

If you want to fetch a value by column name, create a `cursor` object of type `DictCursor`.

For example:

> ```python
> # Querying data by DictCursor
> from snowflake.connector import DictCursor
> cur = con.cursor(DictCursor)
> try:
>     cur.execute("SELECT col1, col2 FROM testtable")
>     for rec in cur:
>         print('{0}, {1}'.format(rec['COL1'], rec['COL2']))
> finally:
>     cur.close()
> ```

### Examples of asynchronous queries

The following is a simple example of an asynchronous query:

```python
from snowflake.connector import ProgrammingError
import time

conn = snowflake.connector.connect( ... )
cur = conn.cursor()

# Submit an asynchronous query for execution.
cur.execute_async('select count(*) from table(generator(timeLimit => 25))')

# Retrieve the results.
cur.get_results_from_sfqid(query_id)
results = cur.fetchall()
print(f'{results[0]}')
```

The next example submits an asynchronous query from one connection and retrieves the results from a different connection:

```python
from snowflake.connector import ProgrammingError
import time

conn = snowflake.connector.connect( ... )
cur = conn.cursor()

# Submit an asynchronous query for execution.
cur.execute_async('select count(*) from table(generator(timeLimit => 25))')

# Get the query ID for the asynchronous query.
query_id = cur.sfqid

# Close the cursor and the connection.
cur.close()
conn.close()

# Open a new connection.
new_conn = snowflake.connector.connect( ... )

# Create a new cursor.
new_cur = new_conn.cursor()

# Retrieve the results.
new_cur.get_results_from_sfqid(query_id)
results = new_cur.fetchall()
print(f'{results[0]}')
```

### Canceling a query by query ID

Cancel a query by query ID:

> ```python
> cur = cn.cursor()
>
> try:
>   cur.execute(r"SELECT SYSTEM$CANCEL_QUERY('queryID')")
>   result = cur.fetchall()
>   print(len(result))
>   print(result[0])
> finally:
>   cur.close()
> ```

Replace the string “queryID” with the actual query ID. To get the ID for a query, see
Retrieving the Snowflake query ID.

### Improving query performance by bypassing data conversion

To improve query performance, use the `SnowflakeNoConverterToPython` class in the `snowflake.connector.converter_null` module to bypass
data conversions from the Snowflake internal data type to the native Python data type, e.g.:

> ```python
> from snowflake.connector.converter_null import SnowflakeNoConverterToPython
>
> con = snowflake.connector.connect(
>     ...
>     converter_class=SnowflakeNoConverterToPython
> )
> for rec in con.cursor().execute("SELECT * FROM large_table"):
>     # rec includes raw Snowflake data
> ```

As a result, all data is represented in string form such that the application is responsible for
converting it to the native Python data types. For example, `TIMESTAMP_NTZ` and `TIMESTAMP_LTZ`
data are the epoch time represented in string form, and `TIMESTAMP_TZ` data is the epoch time followed by a space
followed by the offset to UTC in minutes represented in string form.

No impact is made to binding data; Python native data can still be bound for updates.

## Downloading data

Snowflake Connector for Python version 3.14.0 introduced the `unsafe_file_write` connection parameter that specifies how the connector should set file permissions when downloading files for a Snowflake stage with the GET command. These files are always owned by the same user who runs the Python process.

By default, the `unsafe_file_write` parameter is `False` to provide a more secure and strict `600` file permission, which means that only the owner has read and write permissions of the downloaded files.
Other groups and users have no permissions for the files downloaded with the GET command.

If your organization requires less restrictive file permissions for the files, you can set the `unsafe_file_write` parameter to `True`.
Enabling this parameter sets the file permissions for the files downloaded from a stage to `644`, which allows the owner to read and write the files, but allows others only to read them.
This setting might be necessary, for example, for some ETL tools that run under a different system user who needs to be able to read and process the downloaded files.

If you are unsure of which value to use, consult with the team responsible for your organization’s applicable security policy.

## Binding data

To specify values to be used in a SQL statement, you can include literals in the statement, or you can
[bind variables](../../sql-reference/bind-variables.md). When you bind variables, you put one or more
placeholders in the text of the SQL statement, and then specify the variable (the value to be used)
for each placeholder.

The following example contrasts the use of literals and binding:

> Literals:
>
> ```python
> con.cursor().execute("INSERT INTO testtable(col1, col2) VALUES(789, 'test string3')")
> ```
>
> Binding:
>
> ```python
> con.cursor().execute(
>     "INSERT INTO testtable(col1, col2) "
>     "VALUES(%s, %s)", (
>         789,
>         'test string3'
>     ))
> ```

> **Note:**
>
> There is an upper limit to the size of data that you can bind, or that you can combine in a batch. For details, see [Limits on Query Text Size](../../user-guide/query-size-limits.md).

Snowflake supports the following types of binding:

* `pyformat` and `format`, which bind data on the client.
* `qmark` and `numeric`, which bind data on the server.

Each of these is explained below.

### `pyformat` or `format` binding

Both `pyformat` binding and `format` binding bind data on the client side rather than on the server side.

By default, the Snowflake Connector for Python supports both `pyformat` and `format`, so you can use `%(name)s` or `%s` as the
placeholder. For example:

* Using `%(name)s` as the placeholder:

  > ```python
  > conn.cursor().execute(
  >     "INSERT INTO test_table(col1, col2) "
  >     "VALUES(%(col1)s, %(col2)s)", {
  >         'col1': 789,
  >         'col2': 'test string3',
  >         })
  > ```
* Using `%s` as the placeholder:

  > ```python
  > con.cursor().execute(
  >     "INSERT INTO testtable(col1, col2) "
  >     "VALUES(%s, %s)", (
  >         789,
  >         'test string3'
  >     ))
  > ```

With `pyformat` and `format`, you can also use a list object to bind data for the IN operator:

> ```python
> # Binding data for IN operator
> con.cursor().execute(
>     "SELECT col1, col2 FROM testtable"
>     " WHERE col2 IN (%s)", (
>         ['test string1', 'test string3'],
>     ))
> ```

The percent character (“%”) is both a wildcard character for SQL LIKE and a format binding character for Python. If you
use format binding, and if your SQL command contains the percent character, you might need to escape the percent
character. For example, if your SQL statement is:

> ```sqlexample
> SELECT col1, col2
>     FROM test_table
>     WHERE col2 ILIKE '%York' LIMIT 1;  -- Find York, New York, etc.
> ```

then your Python code should look like the following (note the extra percent sign to escape the original percent sign):

> ```python
> sql_command = "select col1, col2 from test_table "
> sql_command += " where col2 like '%%York' limit %(lim)s"
> parameter_dictionary = {'lim': 1 }
> cur.execute(sql_command, parameter_dictionary)
> ```

### `qmark` or `numeric` binding

Both `qmark` binding and `numeric` binding bind data on the server side rather than on the client side:

* For `qmark` binding, use a question mark character (`?`) to indicate where in the string you want a variable’s value
  inserted.
* For `numeric` binding, use a colon (`:`) followed by a number to indicate the position of the variable that you want
  substituted at that position. For example, `:2` specifies the second variable.

  Use numeric binding to bind the same value more than once in the same query. For example, if you have a long VARCHAR or BINARY
  or [semi-structured](../../sql-reference/data-types-semistructured.md) value that you want to use more than once, then `numeric`
  binding allows you to send the value to the server once and use it multiple times.

The next sections explain how to use `qmark` and `numeric` binding:

* Using qmark or numeric binding
* Using qmark or numeric binding with datetime objects
* Using bind variables with the IN operator

#### Using `qmark` or `numeric` binding

To use `qmark` or `numeric` style binding, you can either execute one of the following or set `paramstyle` as part of the connection parameters when calling `connect()`.

* `snowflake.connector.paramstyle='qmark'`
* `snowflake.connector.paramstyle='numeric'`

If you set `paramstyle` to `qmark` or `numeric`, you must use `?` or `:N` (where `N` is replaced
with a number) as the placeholders, respectively.

For example:

* Using `?` as the placeholder:

  > ```python
  > from snowflake.connector import connect
  >
  > connection_parameters = {
  >     'account': 'xxxxx',
  >     'user': 'xxxx',
  >     'password': 'xxxxxx',
  >     "host": "xxxxxx",
  >     "port": 443,
  >     'protocol': 'https',
  >     'warehouse': 'xxx',
  >     'database': 'xxx',
  >     'schema': 'xxx',
  >     'paramstyle': 'qmark'  # note paramstyle setting here at connection level
  > }
  >
  > con = connect(**connection_parameters)
  >
  > con.cursor().execute(
  >     "INSERT INTO testtable2(col1,col2,col3) "
  >     "VALUES(?,?,?)", (
  >         987,
  >         'test string4',
  >         ("TIMESTAMP_LTZ", datetime.now())
  >     )
  > )
  > ```
* Using `:N` as the placeholder:

  > ```python
  > import snowflake.connector
  >
  > snowflake.connector.paramstyle='numeric'
  >
  > con = snowflake.connector.connect(...)
  >
  > con.cursor().execute(
  >     "INSERT INTO testtable(col1, col2) "
  >     "VALUES(:1, :2)", (
  >         789,
  >         'test string3'
  >     ))
  > ```
  >
  > The following query shows how to use `numeric` binding to reuse a variable:
  >
  > ```python
  > con.cursor().execute(
  >     "INSERT INTO testtable(complete_video, short_sample_of_video) "
  >     "VALUES(:1, SUBSTRING(:1, :2, :3))", (
  >         binary_value_that_stores_video,          # variable :1
  >         starting_offset_in_bytes_of_video_clip,  # variable :2
  >         length_in_bytes_of_video_clip            # variable :3
  >     ))
  > ```

#### Using `qmark` or `numeric` binding with `datetime` objects

When using `qmark` or `numeric` binding to bind data to a Snowflake TIMESTAMP data type, set the bind variable to a tuple that
specifies the Snowflake timestamp data type (`TIMESTAMP_LTZ` or `TIMESTAMP_TZ`) and the value. For example:

> ```python
> import snowflake.connector
>
> snowflake.connector.paramstyle='qmark'
>
> con = snowflake.connector.connect(...)
>
> con.cursor().execute(
>     "CREATE OR REPLACE TABLE testtable2 ("
>     "   col1 int, "
>     "   col2 string, "
>     "   col3 timestamp_ltz"
>     ")"
> )
>
> con.cursor().execute(
>     "INSERT INTO testtable2(col1,col2,col3) "
>     "VALUES(?,?,?)", (
>         987,
>         'test string4',
>         ("TIMESTAMP_LTZ", datetime.now())
>     )
>  )
> ```

Unlike client side binding, the server side binding requires the Snowflake data type for the column. Most common Python data types
already have implicit mappings to Snowflake data types (e.g. `int` is mapped to `FIXED`). However, because the Python
`datetime` data can be bound to one of multiple Snowflake data types (`TIMESTAMP_NTZ`, `TIMESTAMP_LTZ`,
or `TIMESTAMP_TZ`), and the default mapping is `TIMESTAMP_NTZ`, you must specify the Snowflake data type to use.

#### Using bind variables with the IN operator

`qmark` and `numeric` (server side binding) do not support the use of bind variables with the IN operator.

If you need to use bind variables with the IN operator, use
client side binding (`pyformat` or `format`).

### Binding parameters to variables for batch inserts

In your application code, you can insert multiple rows in a single batch. To do this, use parameters for values in an INSERT
statement. For example, the following statement uses placeholders for `qmark` binding in an INSERT statement:

> ```sqlexample
> insert into grocery (item, quantity) values (?, ?)
> ```

Then, to specify the data that should be inserted, define a variable that is a sequence of sequences (for example, a list of
tuples):

> ```python
> rows_to_insert = [('milk', 2), ('apple', 3), ('egg', 2)]
> ```

As shown in the example above, each item in the list is a tuple that contains the column values for a row to be inserted.

To perform the binding, call the [`executemany()`](python-connector-api.md "executemany") method, passing the variable as the second argument. For example:

> ```python
> conn = snowflake.connector.connect( ... )
> rows_to_insert = [('milk', 2), ('apple', 3), ('egg', 2)]
> conn.cursor().executemany(
>     "insert into grocery (item, quantity) values (?, ?)",
>     rows_to_insert)
> ```

If you are binding data on the server (i.e. by using `qmark` or
`numeric` binding), the connector can optimize the performance of batch inserts through binding.

When you use this technique to insert a large number of values, the driver can improve performance by streaming the data (without
creating files on the local machine) to a temporary stage for ingestion. The driver automatically does this when the number of
values exceeds a threshold.

In addition, the current database and schema for the session must be set. If these are not set, the CREATE TEMPORARY STAGE command
executed by the driver can fail with the following error:

```none
CREATE TEMPORARY STAGE SYSTEM$BIND file_format=(type=csv field_optionally_enclosed_by='"')
Cannot perform CREATE STAGE. This session does not have a current schema. Call 'USE SCHEMA', or use a qualified name.
```

> **Note:**
>
> For alternative ways to load data into the Snowflake database (including bulk loading using the COPY command), see
> [Load data into Snowflake](../../guides-overview-loading-data.md).

### Avoid SQL injection attacks

Avoid binding data using Python’s formatting function because you risk SQL injection. For example:

> ```python
> # Binding data (UNSAFE EXAMPLE)
> con.cursor().execute(
>     "INSERT INTO testtable(col1, col2) "
>     "VALUES(%(col1)d, '%(col2)s')" % {
>         'col1': 789,
>         'col2': 'test string3'
>     })
> ```
>
> ```python
> # Binding data (UNSAFE EXAMPLE)
> con.cursor().execute(
>     "INSERT INTO testtable(col1, col2) "
>     "VALUES(%d, '%s')" % (
>         789,
>         'test string3'
>     ))
> ```
>
> ```python
> # Binding data (UNSAFE EXAMPLE)
> con.cursor().execute(
>     "INSERT INTO testtable(col1, col2) "
>     "VALUES({col1}, '{col2}')".format(
>         col1=789,
>         col2='test string3')
>     )
> ```

Instead, store the values in variables and then bind those variables using qmark or numeric binding style.

## Retrieving column metadata

To retrieve metadata about each column in the result set (e.g. the name, type, precision, scale, etc. of each column), use one of
the following approaches:

* To access the metadata after calling the [`execute()`](python-connector-api.md "execute") method to execute the query, use the [`description`](python-connector-api.md "description")
  attribute of the `Cursor` object.
* To access the metadata without having to execute the query, call the [`describe()`](python-connector-api.md "describe") method.

  The `describe` method is available in the Snowflake Connector for Python 2.4.6 and more recent versions.

The `description` attribute is set to one of the following values:

* **Version 2.4.5 and earlier:** A list of tuples.
* **Version 2.4.6 and later:** A list of [ResultMetadata](python-connector-api.md) objects. (The
  `describe` method also returns this list.)

Each tuple and `ResultMetadata` object contains the metadata for a column (the column name, data type, etc.). You can
[access the metadata by index](python-connector-api.md) or, in 2.4.6 and later versions, by
`ResultMetadata` attribute.

The following examples demonstrate how to access the metadata from the returned tuples and `ResultMetadata` objects.

**Example: Getting the column name metadata by index (versions 2.4.5 and earlier):**

The following example uses the `description` attribute to retrieve the list of column names after executing a query. The
attribute is a list of tuples, and the
example accesses the column name from the first value in each tuple.

> ```python
> cur = conn.cursor()
> cur.execute("SELECT * FROM test_table")
> print(','.join([col[0] for col in cur.description]))
> ```

**Example: Getting the column name metadata by attribute (versions 2.4.6 and later):**

The following example uses the `description` attribute to retrieve the list of column names after executing a query. The
attribute is a list of [ResultMetaData](python-connector-api.md) objects, and the
example accesses the column name from the `name` attribute of each `ResultMetadata` object.

> ```python
> cur = conn.cursor()
> cur.execute("SELECT * FROM test_table")
> print(','.join([col.name for col in cur.description]))
> ```

**Example: Getting the column name metadata without executing the query (versions 2.4.6 and later):**

The following example uses the `describe` method to retrieve the list of column names without executing a query.
The `describe()` method returns a list of [ResultMetaData](python-connector-api.md) objects, and the
example accesses the column name from the `name` attribute of each `ResultMetadata` object.

> ```python
> cur = conn.cursor()
> result_metadata_list = cur.describe("SELECT * FROM test_table")
> print(','.join([col.name for col in result_metadata_list]))
> ```

## Handling errors

The application must handle exceptions raised from Snowflake Connector properly and decide to continue or stop running the code.

> ```python
> # Catching the syntax error
> cur = con.cursor()
> try:
>     cur.execute("SELECT * FROM testtable")
> except snowflake.connector.errors.ProgrammingError as e:
>     # default error message
>     print(e)
>     # customer error message
>     print('Error {0} ({1}): {2} ({3})'.format(e.errno, e.sqlstate, e.msg, e.sfqid))
> finally:
>     cur.close()
> ```

## Using `execute_stream` to execute SQL scripts

The `execute_stream` function enables you to run one or more SQL scripts in a stream:

> ```python
> from codecs import open
> with open(sqlfile, 'r', encoding='utf-8') as f:
>     for cur in con.execute_stream(f):
>         for ret in cur:
>             print(ret)
> ```

> **Note:**
>
> Additional configuration might be required if `sql_stream` contains comments. See [Using execute_stream to execute SQL scripts](python-connector-api.md).

## Closing the connection

As a best practice, close the connection by calling the `close` method:

> ```python
> connection.close()
> ```

This ensures the collected client metrics are submitted to the server and the session is deleted. Also, `try-finally` blocks help ensure the connection is closed even if an exception is raised in the middle:

> ```python
> # Connecting to Snowflake
> con = snowflake.connector.connect(...)
> try:
>     # Running queries
>     con.cursor().execute(...)
>     ...
> finally:
>     # Closing the connection
>     con.close()
> ```

> **Caution:**
>
> Multiple non-closed connections can exhaust your system resources and eventually cause an application crash.

## Using context manager to connect and control transactions

The Snowflake Connector for Python supports a context manager that allocates and releases resources as required. The context manager is useful for committing or rolling back transactions based on the statement status when `autocommit` is disabled.

> ```python
> # Connecting to Snowflake using the context manager
> with snowflake.connector.connect(
>   user=USER,
>   password=PASSWORD,
>   account=ACCOUNT,
>   autocommit=False,
> ) as con:
>     con.cursor().execute("INSERT INTO a VALUES(1, 'test1')")
>     con.cursor().execute("INSERT INTO a VALUES(2, 'test2')")
>     con.cursor().execute("INSERT INTO a VALUES(not numeric value, 'test3')") # fail
> ```

In the above example, when the third statement fails, the context manager rolls back the changes in the transaction and closes the connection. If all statements were successful, the context manager would commit the changes and close the connection.

An equivalent code with `try` and `except` blocks is as follows:

> ```python
> # Connecting to Snowflake using try and except blocks
> con = snowflake.connector.connect(
>   user=USER,
>   password=PASSWORD,
>   account=ACCOUNT,
>   autocommit=False)
> try:
>     con.cursor().execute("INSERT INTO a VALUES(1, 'test1')")
>     con.cursor().execute("INSERT INTO a VALUES(2, 'test2')")
>     con.cursor().execute("INSERT INTO a VALUES(not numeric value, 'test3')") # fail
>     con.commit()
> except Exception as e:
>     con.rollback()
>     raise e
> finally:
>     con.close()
> ```

## Using the VECTOR data type

Support for the [VECTOR data type](../../sql-reference/data-types-vector.md) was introduced in version 3.6.0 of the
Snowflake Python Connector. You can use the VECTOR data type with the [vector similarity functions](../../sql-reference/functions-vector.md)
to implement applications based on vector search or retrieval-augmented-generation (RAG).

The following code example shows how to use the Python Connector to create tables with VECTOR columns and call the
[VECTOR_INNER_PRODUCT](../../sql-reference/functions/vector_inner_product.md) function:

```python
import snowflake.connector

conn = ... # Set up connection
cur = conn.cursor()

# Create a table and insert some vectors
cur.execute("CREATE OR REPLACE TABLE vectors (a VECTOR(FLOAT, 3), b VECTOR(FLOAT, 3))")
values = [([1.1, 2.2, 3], [1, 1, 1]), ([1, 2.2, 3], [4, 6, 8])]
for row in values:
    cur.execute(f"""
        INSERT INTO vectors(a, b)
          SELECT {row[0]}::VECTOR(FLOAT,3), {row[1]}::VECTOR(FLOAT,3)
    """)

# Compute the pairwise inner product between columns a and b
cur.execute("SELECT VECTOR_INNER_PRODUCT(a, b) FROM vectors")
print(cur.fetchall())
```

```output
[(6.30...,), (41.2...,)]
```

The following code example shows how to use the Python Connector to call the [VECTOR_COSINE_SIMILARITY](../../sql-reference/functions/vector_cosine_similarity.md) in order to find the closest vectors to `[1,2,3]`:

```python
cur.execute(f"""
    SELECT a, VECTOR_COSINE_SIMILARITY(a, {[1,2,3]}::VECTOR(FLOAT, 3))
      AS similarity
      FROM vectors
      ORDER BY similarity DESC
      LIMIT 1;
""")
print(cur.fetchall())
```

```output
[([1.0, 2.2..., 3.0], 0.9990...)]
```

> **Note:**
>
> Variable binds are not supported for VECTOR data types.

## Logging

The Snowflake Connector for Python leverages the standard Python `logging` module to log status at regular intervals so that the application can trace its activity working behind the scenes. The
simplest way to enable logging is call `logging.basicConfig()` in the beginning of the application.

For example, to set the logging level to `INFO` and store the logs in a file named `/tmp/snowflake_python_connector.log`:

> ```python
> logging.basicConfig(
>     filename=file_name,
>     level=logging.INFO)
> ```

More comprehensive logging can be enabled by setting the logging level to `DEBUG` as follows:

> ```python
> # Logging including the timestamp, thread and the source code location
> import logging
> for logger_name in ['snowflake.connector', 'botocore', 'boto3']:
>     logger = logging.getLogger(logger_name)
>     logger.setLevel(logging.DEBUG)
>     ch = logging.FileHandler('/tmp/python_connector.log')
>     ch.setLevel(logging.DEBUG)
>     ch.setFormatter(logging.Formatter('%(asctime)s - %(threadName)s %(filename)s:%(lineno)d - %(funcName)s() - %(levelname)s - %(message)s'))
>     logger.addHandler(ch)
> ```
>
> The optional but recommended SecretDetector formatter class ensures that a certain set of known sensitive
> information is masked before being written to Snowflake Python Connector log files. To use SecretDetector, use
> code similar to the following:
>
> ```python
> # Logging including the timestamp, thread and the source code location
> import logging
> from snowflake.connector.secret_detector import SecretDetector
> for logger_name in ['snowflake.connector', 'botocore', 'boto3']:
>     logger = logging.getLogger(logger_name)
>     logger.setLevel(logging.DEBUG)
>     ch = logging.FileHandler('/tmp/python_connector.log')
>     ch.setLevel(logging.DEBUG)
>     ch.setFormatter(SecretDetector('%(asctime)s - %(threadName)s %(filename)s:%(lineno)d - %(funcName)s() - %(levelname)s - %(message)s'))
>     logger.addHandler(ch)
> ```
>
> > **Note:**
> >
> > `botocore` and `boto3` are available through the AWS (Amazon Web Services) SDK for Python.

### Logging configuration file

Alternatively, you can easily specify the log level and the directory in which to save log files in the `config.toml` configuration file. For more information about the this file, see [Connecting using the connections.toml file](python-connector-connect.md).

> **Note:**
>
> This logging configuration feature supports log levels as defined in the Python logging document.
>
> For more information about logging levels, see the Python [Basic Logging Tutorial](https://docs.python.org/3/howto/logging.html#basic-logging-tutorial).

This logging configuration file uses toml to define the `save_logs`, `level`, and `path` logging parameters, as follows:

```toml
[log]
save_logs = true
level = "INFO"
path = "<directory to store logs>"
```

where:

* `save_logs` determines whether to save logs.
* `level` specifies the logging level. If not defined, the driver defaults to `INFO`.
* `path` identifies the directory in which to save the log files. If not defined, the driver saves the logs in the default `$SNOWFLAKE_HOME/logs/` directory.

> **Note:**
>
> If your `config.toml` file does not contain a `[log]` section, log messages are not saved.

Log message from a single day are appended to the `python-connector.log` file, which is later renamed to `python-connector.log.YYYY-MM-DD`.

## Sample program

The following sample code combines many of the examples described in the previous sections into a working python
program. This example contains two parts:

* A parent class (“python_veritas_base”) contains the code for many common operations, such as connecting to the server.
* A child class (“python_connector_example”) represents the custom portions of a particular client, for example,
  querying a table.

This sample code is imported directly from one of our tests to help ensure that it is has been executed on a recent
build of the product.

Because this is taken from a test, it includes a small amount of code to set an alternative port and protocol used in
some tests. Users should not set the protocol or port number; instead, omit these and use the defaults.

This also contains some section markers (sometimes called “snippet tags”) to identify code that can be imported
independently into the documentation. Section markers typically look similar to:

```none
# -- (> ---------------------- SECTION=import_connector ---------------------
...
# -- <) ---------------------------- END_SECTION ----------------------------
```

These section markers are not required in user code.

The first part of the code sample contains the common subroutines to:

* Read command-line arguments (for example, “–warehouse MyWarehouse”) that contain connection information.
* Connect to the server.
* Create and use a warehouse, database, and schema.
* Drop the schema, database, and warehouse when you are done with them.

```python
import logging
import os
import sys

# -- (> ---------------------- SECTION=import_connector ---------------------
import snowflake.connector
# -- <) ---------------------------- END_SECTION ----------------------------

class python_veritas_base:

    """
    PURPOSE:
        This is the Base/Parent class for programs that use the Snowflake
        Connector for Python.
        This class is intended primarily for:
            * Sample programs, e.g. in the documentation.
            * Tests.
    """

    def __init__(self, p_log_file_name = None):

        """
        PURPOSE:
            This does any required initialization steps, which in this class is
            basically just turning on logging.
        """

        file_name = p_log_file_name
        if file_name is None:
            file_name = '/tmp/snowflake_python_connector.log'

        # -- (> ---------- SECTION=begin_logging -----------------------------
        logging.basicConfig(
            filename=file_name,
            level=logging.INFO)
        # -- <) ---------- END_SECTION ---------------------------------------

    # -- (> ---------------------------- SECTION=main ------------------------
    def main(self, argv):

        """
        PURPOSE:
            Most tests follow the same basic pattern in this main() method:
               * Create a connection.
               * Set up, e.g. use (or create and use) the warehouse, database,
                 and schema.
               * Run the queries (or do the other tasks, e.g. load data).
               * Clean up. In this test/demo, we drop the warehouse, database,
                 and schema. In a customer scenario, you'd typically clean up
                 temporary tables, etc., but wouldn't drop your database.
               * Close the connection.
        """

        # Read the connection parameters (e.g. user ID) from the command line
        # and environment variables, then connect to Snowflake.
        connection = self.create_connection(argv)

        # Set up anything we need (e.g. a separate schema for the test/demo).
        self.set_up(connection)

        # Do the "real work", for example, create a table, insert rows, SELECT
        # from the table, etc.
        self.do_the_real_work(connection)

        # Clean up. In this case, we drop the temporary warehouse, database, and
        # schema.
        self.clean_up(connection)

        print("\nClosing connection...")
        # -- (> ------------------- SECTION=close_connection -----------------
        connection.close()
        # -- <) ---------------------------- END_SECTION ---------------------

    # -- <) ---------------------------- END_SECTION=main --------------------

    def args_to_properties(self, args):

        """
        PURPOSE:
            Read the command-line arguments and store them in a dictionary.
            Command-line arguments should come in pairs, e.g.:
                "--user MyUser"
        INPUTS:
            The command line arguments (sys.argv).
        RETURNS:
            Returns the dictionary.
        DESIRABLE ENHANCEMENTS:
            Improve error detection and handling.
        """

        connection_parameters = {}

        i = 1
        while i < len(args) - 1:
            property_name = args[i]
            # Strip off the leading "--" from the tag, e.g. from "--user".
            property_name = property_name[2:]
            property_value = args[i + 1]
            connection_parameters[property_name] = property_value
            i += 2

        return connection_parameters

    def create_connection(self, argv):

        """
        PURPOSE:
            This gets account identifier and login information from the
            environment variables and command-line parameters, connects to the
            server, and returns the connection object.
        INPUTS:
            argv: This is usually sys.argv, which contains the command-line
                  parameters. It could be an equivalent substitute if you get
                  the parameter information from another source.
        RETURNS:
            A connection.
        """

        # Get account identifier and login information from environment variables and command-line parameters.
        # For information about account identifiers, see
        # https://docs.snowflake.com/en/user-guide/admin-account-identifier.html .
        # -- (> ----------------------- SECTION=set_login_info ---------------

        # Get the password from an appropriate environment variable, if
        # available.
        PASSWORD = os.getenv('SNOWSQL_PWD')

        # Get the other login info etc. from the command line.
        if len(argv) < 11:
            msg = "ERROR: Please pass the following command-line parameters:\n"
            msg += "--warehouse <warehouse> --database <db> --schema <schema> "
            msg += "--user <user> --account <account_identifier> "
            print(msg)
            sys.exit(-1)
        else:
            connection_parameters = self.args_to_properties(argv)
            USER = connection_parameters["user"]
            ACCOUNT = connection_parameters["account"]
            WAREHOUSE = connection_parameters["warehouse"]
            DATABASE = connection_parameters["database"]
            SCHEMA = connection_parameters["schema"]
            # Optional: for internal testing only.
            try:
                PORT = connection_parameters["port"]
            except:
                PORT = ""
            try:
                PROTOCOL = connection_parameters["protocol"]
            except:
                PROTOCOL = ""

        # If the password is set by both command line and env var, the
        # command-line value takes precedence over (is written over) the
        # env var value.

        # If the password wasn't set either in the environment var or on
        # the command line...
        if PASSWORD is None or PASSWORD == '':
            print("ERROR: Set password, e.g. with SNOWSQL_PWD environment variable")
            sys.exit(-2)
        # -- <) ---------------------------- END_SECTION ---------------------

        # Optional diagnostic:
        #print("USER:", USER)
        #print("ACCOUNT:", ACCOUNT)
        #print("WAREHOUSE:", WAREHOUSE)
        #print("DATABASE:", DATABASE)
        #print("SCHEMA:", SCHEMA)
        #print("PASSWORD:", PASSWORD)
        #print("PROTOCOL:" "'" + PROTOCOL + "'")
        #print("PORT:" + "'" + PORT + "'")

        print("Connecting...")
        # If the PORT is set but the protocol is not, we ignore the PORT (bug!!).
        if PROTOCOL is None or PROTOCOL == "" or PORT is None or PORT == "":
            # -- (> ------------------- SECTION=connect_to_snowflake ---------
            conn = snowflake.connector.connect(
                user=USER,
                password=PASSWORD,
                account=ACCOUNT,
                warehouse=WAREHOUSE,
                database=DATABASE,
                schema=SCHEMA
                )
            # -- <) ---------------------------- END_SECTION -----------------
        else:

            conn = snowflake.connector.connect(
                user=USER,
                password=PASSWORD,
                account=ACCOUNT,
                warehouse=WAREHOUSE,
                database=DATABASE,
                schema=SCHEMA,
                # Optional: for internal testing only.
                protocol=PROTOCOL,
                port=PORT
                )

        return conn

    def set_up(self, connection):

        """
        PURPOSE:
            Set up to run a test. You can override this method with one
            appropriate to your test/demo.
        """

        # Create a temporary warehouse, database, and schema.
        self.create_warehouse_database_and_schema(connection)

    def do_the_real_work(self, conn):

        """
        PURPOSE:
            Your sub-class should override this to include the code required for
            your documentation sample or your test case.
            This default method does a very simple self-test that shows that the
            connection was successful.
        """

        # Create a cursor for this connection.
        cursor1 = conn.cursor()
        # This is an example of an SQL statement we might want to run.
        command = "SELECT PI()"
        # Run the statement.
        cursor1.execute(command)
        # Get the results (should be only one):
        for row in cursor1:
            print(row[0])
        # Close this cursor.
        cursor1.close()

    def clean_up(self, connection):

        """
        PURPOSE:
            Clean up after a test. You can override this method with one
            appropriate to your test/demo.
        """

        # Create a temporary warehouse, database, and schema.
        self.drop_warehouse_database_and_schema(connection)

    def create_warehouse_database_and_schema(self, conn):

        """
        PURPOSE:
            Create the temporary schema, database, and warehouse that we use
            for most tests/demos.
        """

        # Create a database, schema, and warehouse if they don't already exist.
        print("\nCreating warehouse, database, schema...")
        # -- (> ------------- SECTION=create_warehouse_database_schema -------
        conn.cursor().execute("CREATE WAREHOUSE IF NOT EXISTS tiny_warehouse_mg")
        conn.cursor().execute("CREATE DATABASE IF NOT EXISTS testdb_mg")
        conn.cursor().execute("USE DATABASE testdb_mg")
        conn.cursor().execute("CREATE SCHEMA IF NOT EXISTS testschema_mg")
        # -- <) ---------------------------- END_SECTION ---------------------

        # -- (> --------------- SECTION=use_warehouse_database_schema --------
        conn.cursor().execute("USE WAREHOUSE tiny_warehouse_mg")
        conn.cursor().execute("USE DATABASE testdb_mg")
        conn.cursor().execute("USE SCHEMA testdb_mg.testschema_mg")
        # -- <) ---------------------------- END_SECTION ---------------------

    def drop_warehouse_database_and_schema(self, conn):

        """
        PURPOSE:
            Drop the temporary schema, database, and warehouse that we create
            for most tests/demos.
        """

        # -- (> ------------- SECTION=drop_warehouse_database_schema ---------
        conn.cursor().execute("DROP SCHEMA IF EXISTS testschema_mg")
        conn.cursor().execute("DROP DATABASE IF EXISTS testdb_mg")
        conn.cursor().execute("DROP WAREHOUSE IF EXISTS tiny_warehouse_mg")
        # -- <) ---------------------------- END_SECTION ---------------------

# ----------------------------------------------------------------------------

if __name__ == '__main__':
    pvb = python_veritas_base()
    pvb.main(sys.argv)
```

The second part of the code sample creates a table, inserts rows into it, etc.:

```python
import sys

# -- (> ---------------------- SECTION=import_connector ---------------------
import snowflake.connector
# -- <) ---------------------------- END_SECTION ----------------------------

# Import the base class that contains methods used in many tests and code
# examples.
from python_veritas_base import python_veritas_base

class python_connector_example (python_veritas_base):

  """
  PURPOSE:
      This is a simple example program that shows how to use the Snowflake
      Python Connector to create and query a table.
  """

  def __init__(self):
    pass

  def do_the_real_work(self, conn):

    """
    INPUTS:
        conn is a Connection object returned from snowflake.connector.connect().
    """

    print("\nCreating table test_table...")
    # -- (> ----------------------- SECTION=create_table ---------------------
    conn.cursor().execute(
        "CREATE OR REPLACE TABLE "
        "test_table(col1 integer, col2 string)")

    conn.cursor().execute(
        "INSERT INTO test_table(col1, col2) VALUES " +
        "    (123, 'test string1'), " +
        "    (456, 'test string2')")
    # -- <) ---------------------------- END_SECTION -------------------------

    print("\nSelecting from test_table...")
    # -- (> ----------------------- SECTION=querying_data --------------------
    cur = conn.cursor()
    try:
        cur.execute("SELECT col1, col2 FROM test_table ORDER BY col1")
        for (col1, col2) in cur:
            print('{0}, {1}'.format(col1, col2))
    finally:
        cur.close()
    # -- <) ---------------------------- END_SECTION -------------------------

# ============================================================================

if __name__ == '__main__':

    test_case = python_connector_example()
    test_case.main(sys.argv)
```

To run this sample, do the following:

> 1. Copy the first piece of code to a file named “python_veritas_base.py”.
> 2. Copy the second piece of code to a file named “python_connector_example.py”
> 3. Set the SNOWSQL_PWD environment variable to your password, for example:
>
>    ```none
>    export SNOWSQL_PWD='MyPassword'
>    ```
> 4. Execute the program using a command line similar to the following (replace the user and account information
>    with your own user and account information, of course).
>
>    > **Warning:**
>    >
>    > This deletes the warehouse, database, and schema at the end of the program! Do not use
>    > the name of an existing database because you will lose it!
>
>    ```none
>    python3 python_connector_example.py --warehouse <unique_warehouse_name> --database <new_warehouse_zzz_test> --schema <new_schema_zzz_test> --account myorganization-myaccount --user MyUserName
>    ```

Here is the output:

```python
Connecting...

Creating warehouse, database, schema...

Creating table test_table...

Selecting from test_table...
123, test string1
456, test string2

Closing connection...
```

Here is a longer example:

> > **Note:**
> >
> > In the section where you set your account and login information, make sure to replace the variables as needed to match your Snowflake login information (name, password, etc.).

This example uses the format() function to compose the statement. If your environment has a risk of SQL injection
attacks, you might prefer to bind values rather than use format().

```python
#!/usr/bin/env python
#
# Snowflake Connector for Python Sample Program
#

# Logging
import logging
logging.basicConfig(
    filename='/tmp/snowflake_python_connector.log',
    level=logging.INFO)

import snowflake.connector

# Set ACCOUNT to your account identifier.
# See https://docs.snowflake.com/en/user-guide/gen-conn-config.
ACCOUNT = '<my_organization>-<my_account>'
# Set your login information.
USER = '<login_name>'
PASSWORD = '<password>'

import os

# Only required if you copy data from your S3 bucket
AWS_ACCESS_KEY_ID = os.getenv('AWS_ACCESS_KEY_ID')
AWS_SECRET_ACCESS_KEY = os.getenv('AWS_SECRET_ACCESS_KEY')

# Connecting to Snowflake
con = snowflake.connector.connect(
  user=USER,
  password=PASSWORD,
  account=ACCOUNT,
)

# Creating a database, schema, and warehouse if none exists
con.cursor().execute("CREATE WAREHOUSE IF NOT EXISTS tiny_warehouse")
con.cursor().execute("CREATE DATABASE IF NOT EXISTS testdb")
con.cursor().execute("USE DATABASE testdb")
con.cursor().execute("CREATE SCHEMA IF NOT EXISTS testschema")

# Using the database, schema and warehouse
con.cursor().execute("USE WAREHOUSE tiny_warehouse")
con.cursor().execute("USE SCHEMA testdb.testschema")

# Creating a table and inserting data
con.cursor().execute(
    "CREATE OR REPLACE TABLE "
    "testtable(col1 integer, col2 string)")
con.cursor().execute(
    "INSERT INTO testtable(col1, col2) "
    "VALUES(123, 'test string1'),(456, 'test string2')")

# Copying data from internal stage (for testtable table)
con.cursor().execute("PUT file:///tmp/data0/file* @%testtable")
con.cursor().execute("COPY INTO testtable")

# Copying data from external stage (S3 bucket -
# replace <s3_bucket> with the name of your bucket)
con.cursor().execute("""
COPY INTO testtable FROM s3://<s3_bucket>/data/
     STORAGE_INTEGRATION = myint
     FILE_FORMAT=(field_delimiter=',')
""".format(
    aws_access_key_id=AWS_ACCESS_KEY_ID,
    aws_secret_access_key=AWS_SECRET_ACCESS_KEY))

# Querying data
cur = con.cursor()
try:
    cur.execute("SELECT col1, col2 FROM testtable")
    for (col1, col2) in cur:
        print('{0}, {1}'.format(col1, col2))
finally:
    cur.close()

# Binding data
con.cursor().execute(
    "INSERT INTO testtable(col1, col2) "
    "VALUES(%(col1)s, %(col2)s)", {
        'col1': 789,
        'col2': 'test string3',
        })

# Retrieving column names
cur = con.cursor()
cur.execute("SELECT * FROM testtable")
print(','.join([col[0] for col in cur.description]))

# Catching syntax errors
cur = con.cursor()
try:
    cur.execute("SELECT * FROM testtable")
except snowflake.connector.errors.ProgrammingError as e:
    # default error message
    print(e)
    # user error message
    print('Error {0} ({1}): {2} ({3})'.format(e.errno, e.sqlstate, e.msg, e.sfqid))
finally:
    cur.close()

# Retrieving the Snowflake query ID
cur = con.cursor()
cur.execute("SELECT * FROM testtable")
print(cur.sfqid)

# Closing the connection
con.close()
```

---
title: Using the Snowflake SQLAlchemy toolkit with the Python Connector
source: https://docs.snowflake.com/en/developer-guide/python-connector/sqlalchemy.md
section: Developer Guide
---

# Using the Snowflake SQLAlchemy toolkit with the Python Connector

Snowflake SQLAlchemy runs on the top of the Snowflake Connector for Python as a dialect to bridge a Snowflake database and SQLAlchemy applications.

For more information, see the [dialect](http://docs.sqlalchemy.org/en/latest/dialects/) documentation.

## Prerequisites

### Snowflake Connector for Python

The only requirement for Snowflake SQLAlchemy is the Snowflake Connector for Python; however, the connector does not need to be installed because installing Snowflake SQLAlchemy automatically installs
the connector.

### Data analytics and web application frameworks (optional)

Snowflake SQLAlchemy can be used with pandas, Jupyter, and Pyramid, which provide higher levels of application
frameworks for data analytics and web applications. However, building a working environment from scratch is not a trivial task, particularly for novice users. Installing the frameworks requires
C compilers and tools, and choosing the right tools and versions is a hurdle that might deter users from using Python applications.

An easier way to build an environment is through Anaconda, which provides a complete, precompiled technology stack for all users, including non-Python experts
such as data analysts and students. For Anaconda installation instructions, see the Anaconda install documentation. The Snowflake SQLAlchemy package can
then be installed on top of Anaconda using `pip`.

For more information, see the following documentation:

* [pandas](http://pandas.pydata.org/)
* [Jupyter](http://jupyter.org/)
* [Pyramid](http://www.pylonsproject.org/)
* [Anaconda](https://docs.continuum.io/anaconda/)
* [Anaconda install](https://docs.continuum.io/anaconda/install)
* [pip](https://pypi.org/project/pip/)

## Installing Snowflake SQLAlchemy

The Snowflake SQLAlchemy package can be installed from the public PyPI repository using `pip`:

> ```bash
> pip install --upgrade snowflake-sqlalchemy
> ```

`pip` automatically installs all required modules, including the Snowflake Connector for Python.

Note that the developer notes are hosted with the source code on [GitHub](https://github.com/snowflakedb/snowflake-sqlalchemy).

## Verifying your installation

1. Create a file (e.g. `validate.py`) that contains the following Python sample code,
   which connects to Snowflake and displays the Snowflake version:

   > ```python
   > #!/usr/bin/env python
   > from sqlalchemy import create_engine
   >
   > engine = create_engine(
   >     'snowflake://{user}:{password}@{account_identifier}/'.format(
   >         user='<user_login_name>',
   >         password='<password>',
   >         account_identifier='<account_identifier>',
   >     )
   > )
   > try:
   >     connection = engine.connect()
   >     results = connection.execute('select current_version()').fetchone()
   >     print(results[0])
   > finally:
   >     connection.close()
   >     engine.dispose()
   > ```
2. Replace `<user_login_name>`, `<password>`, and `<account_identifier>` with the appropriate values for your Snowflake account and user. For more details, see
   Connection Parameters (in this topic).
3. Execute the sample code. For example, if you created a file named `validate.py`:

   > ```python
   > python validate.py
   > ```

The Snowflake version (e.g. `1.6.0`) should be displayed.

## Snowflake-specific parameters and behavior

As much as possible, Snowflake SQLAlchemy provides compatible functionality for SQLAlchemy applications.

For information on using SQLAlchemy, see the [SQLAlchemy](http://docs.sqlalchemy.org/en/latest/) documentation.

However, Snowflake SQLAlchemy also provides Snowflake-specific parameters and behavior, which are described in the following sections.

### Connection parameters

#### Required parameters

Snowflake SQLAlchemy uses the following connection string syntax to connect to Snowflake and initiate a session:

```python
'snowflake://<user_login_name>:<password>@<account_identifier>'
```

Where:

* `<user_login_name>` is the login name for your Snowflake user.
* `<password>` is the password for your Snowflake user.
* `<account_identifier>` is your account identifier. See [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../user-guide/gen-conn-config.md).

  > **Note:**
  >
  > Do not include the `snowflakecomputing.com` domain name as part of your account identifier. Snowflake
  > automatically appends the domain name to your account identifier to create the required connection.

#### Additional connection parameters

You can optionally include the following additional information at the end of the connection string (after `<account_name>`):

```python
'snowflake://<user_login_name>:<password>@<account_identifier>/<database_name>/<schema_name>?warehouse=<warehouse_name>&role=<role_name>'
```

Where:

* `<database_name>` and `<schema_name>` are the initial database and schema for the Snowflake session, separated by forward slashes (`/`).
* `warehouse=<warehouse_name>` and `role=<role_name>'` are the initial warehouse and role for the session, specified as parameter strings, separated by question marks (`?`).

> **Note:**
>
> After login, the initial database, schema, warehouse, and role specified in the connection string can always be changed for the session.

#### Proxy server configuration

Proxy server parameters are not supported. Instead, use the supported environment variables to configure a proxy server. For information, see [Using a proxy server](python-connector-connect.md).

#### Connection string examples

The following example calls the `create_engine` method with the user name `testuser1`, password `0123456`, account
identifier `myorganization-myaccount`, database `testdb`, schema `public`, warehouse `testwh`, and role `myrole`:

> ```python
> from sqlalchemy import create_engine
> engine = create_engine(
>     'snowflake://testuser1:0123456@myorganization-myaccount/testdb/public?warehouse=testwh&role=myrole'
> )
> ```

For convenience, you can use the `snowflake.sqlalchemy.URL` method to construct the connection string and connect to the database. The following example constructs the same connection string
from the previous example:

> ```python
> from snowflake.sqlalchemy import URL
> from sqlalchemy import create_engine
>
> engine = create_engine(URL(
>     account = 'myorganization-myaccount',
>     user = 'testuser1',
>     password = '0123456',
>     database = 'testdb',
>     schema = 'public',
>     warehouse = 'testwh',
>     role='myrole',
> ))
> ```

### Opening and closing a connection

Open a connection by executing `engine.connect()`; avoid using `engine.execute()`.

> ```python
> # Avoid this.
> engine = create_engine(...)
> engine.execute(<SQL>)
> engine.dispose()
>
> # Do this.
> engine = create_engine(...)
> connection = engine.connect()
> try:
>     connection.execute(<SQL>)
> finally:
>     connection.close()
>     engine.dispose()
> ```

> **Note:**
>
> Make certain to close the connection by executing `connection.close()` before `engine.dispose()`; otherwise, the Python Garbage collector removes the resources required to communicate
> with Snowflake, preventing the Python connector from closing the session properly.

If you plan to use explicit transactions, you must disable the AUTOCOMMIT execution option in SQLAlchemy.

For more information, see [AUTOCOMMIT execution option in SQLAlchemy](https://docs.sqlalchemy.org/en/14/core/connections.html#library-level-e-g-emulated-autocommit)..

By default, SQLAlchemy enables this option. When this option is enabled, INSERT, UPDATE, and DELETE statements are committed
automatically upon execution, even when these statements are run within an explicit transaction.

To disable AUTOCOMMIT, pass `autocommit=False` to the `Connection.execution_options()` method. For example:

```python
# Disable AUTOCOMMIT if you need to use an explicit transaction.
with engine.connect().execution_options(autocommit=False) as connection:

  try:
    connection.execute("BEGIN")
    connection.execute("INSERT INTO test_table VALUES (88888, 'X', 434354)")
    connection.execute("INSERT INTO test_table VALUES (99999, 'Y', 453654654)")
    connection.execute("COMMIT")
  except Exception as e:
    connection.execute("ROLLBACK")
  finally:
    connection.close()

engine.dispose()
```

### Auto-increment behavior

Auto-incrementing a value requires the `Sequence` object. Include the `Sequence` object in the primary key column to automatically increment the value as each new record is inserted.
For example:

> ```python
> t = Table('mytable', metadata,
>     Column('id', Integer, Sequence('id_seq'), primary_key=True),
>     Column(...), ...
> )
> ```

### Object name case handling

Snowflake stores all case-insensitive object names in uppercase text. In contrast, SQLAlchemy considers all lowercase object names to be case-insensitive. Snowflake SQLAlchemy converts the object
name case during schema-level communication (i.e. during table and index reflection). If you use uppercase object names, SQLAlchemy assumes they are case-sensitive and encloses the names with quotes.
This behavior will cause mismatches against data dictionary data received from Snowflake, so unless identifier names have been truly created as case sensitive using quotes (e.g. `"TestDb"`),
all lowercase names should be used on the SQLAlchemy side.

### Index support

Indexes are supported only for Hybrid Tables in Snowflake SqlAlchemy. For more details on limitations and use cases, refer to
[the usage notes for CREATE INDEX](../../sql-reference/sql/create-index.md). You can create an index using the following methods:

* Single column index

  You can create a single column index by setting the `index=True` parameter on the column or by explicitly defining an `Index` object.

  ```python
  hybrid_test_table_1 = HybridTable(
    "table_name",
    metadata,
    Column("column1", Integer, primary_key=True),
    Column("column2", String, index=True),
    Index("index_1","column1", "column2")
  )

  metadata.create_all(engine_testaccount)
  ```
* Multi-column index

  For multi-column indexes, you define the Index object specifying the columns that should be indexed.

  ```python
  hybrid_test_table_1 = HybridTable(
    "table_name",
    metadata,
    Column("column1", Integer, primary_key=True),
    Column("column2", String),
    Index("index_1","column1", "column2")
  )

  metadata.create_all(engine_testaccount)
  ```

### Numpy data type support

Snowflake SQLAlchemy supports binding and fetching `NumPy` data types. Binding is always supported. To enable fetching `NumPy` data types, add `numpy=True` to the connection
parameters.

The following `NumPy` data types are supported:

* `numpy.int64`
* `numpy.float64`
* `numpy.datetime64`

The following example shows the round trip of `numpy.datetime64` data:

> ```python
> import numpy as np
> import pandas as pd
> engine = create_engine(URL(
>     account = 'myorganization-myaccount',
>     user = 'testuser1',
>     password = 'pass',
>     database = 'db',
>     schema = 'public',
>     warehouse = 'testwh',
>     role='myrole',
>     numpy=True,
> ))
>
> specific_date = np.datetime64('2016-03-04T12:03:05.123456789Z')
>
> with engine.connect() as connection:
>     connection.exec_sql_query(
>         "CREATE OR REPLACE TABLE ts_tbl(c1 TIMESTAMP_NTZ)")
>     connection.exec_sql_query(
>         "INSERT INTO ts_tbl(c1) values(%s)", (specific_date,)
>     )
>     df = pd.read_sql_query("SELECT * FROM ts_tbl", connection)
>     assert df.c1.values[0] == specific_date
> ```

### Cache column metadata

SQLAlchemy provides the runtime inspection API to get the runtime information about the various objects. One of the common use case
is get all tables and their column metadata in a schema in order to construct a schema catalog.

For more information, see [runtime inspection API](http://docs.sqlalchemy.org/en/latest/core/inspection.html). For an example managing database schema migrations with SQLAlchemy, [alembic](http://alembic.zzzcomputing.com/) .

A pseudo code flow is as follows:

> ```python
> inspector = inspect(engine)
> schema = inspector.default_schema_name
> for table_name in inspector.get_table_names(schema):
>     column_metadata = inspector.get_columns(table_name, schema)
>     primary_keys = inspector.get_primary_keys(table_name, schema)
>     foreign_keys = inspector.get_foreign_keys(table_name, schema)
>     ...
> ```

In this flow, a potential problem is it may take quite a while as queries run on each table. The results are cached but getting column metadata is expensive.

To mitigate the problem, Snowflake SQLAlchemy takes a flag `cache_column_metadata=True` such that all of column metadata for all tables are cached when `get_table_names` is called and
the rest of `get_columns`, `get_primary_keys` and `get_foreign_keys` can take advantage of the cache.

> ```python
> engine = create_engine(URL(
>     account = 'myorganization-myaccount',
>     user = 'testuser1',
>     password = 'pass',
>     database = 'db',
>     schema = 'public',
>     warehouse = 'testwh',
>     role='myrole',
>     cache_column_metadata=True,
> ))
> ```

> **Note:**
>
> Memory usage will go up higher as all of column metadata are cached associated with `Inspector` object. Use the flag only if you need to get all of column metadata.

### VARIANT, ARRAY, and OBJECT support

Snowflake SQLAlchemy supports fetching `VARIANT`, `ARRAY` and `OBJECT` data types. All types are converted into `str` in Python so that you can convert them to native data
types using `json.loads`.

This example shows how to create a table including `VARIANT`, `ARRAY`, and `OBJECT` data type columns:

> ```python
> from snowflake.sqlalchemy import (VARIANT, ARRAY, OBJECT)
> ...
> t = Table('my_semi_structured_datatype_table', metadata,
>     Column('va', VARIANT),
>     Column('ob', OBJECT),
>     Column('ar', ARRAY))
> metadata.create_all(engine)
> ```

In order to retrieve `VARIANT`, `ARRAY`, and `OBJECT` data type columns and convert them to the native Python data types, fetch data and call the `json.loads` method as follows:

> ```python
> import json
> connection = engine.connect()
> results = connection.execute(select([t]))
> row = results.fetchone()
> data_variant = json.loads(row[0])
> data_object  = json.loads(row[1])
> data_array   = json.loads(row[2])
> ```

### Structured data types support

This module defines custom SQLAlchemy types for Snowflake structured data, specifically for Iceberg tables. The MAP, OBJECT, and ARRAY types allow you to store complex data structures in your SQLAlchemy models. For detailed information, refer to the Snowflake [Structured data types](../../sql-reference/data-types-structured.md) documentation.

#### MAP

The `MAP` type represents a collection of key-value pairs, where each key and value can have different types, as shown:

* **Key Type:** The type of the key, such as `TEXT` or `NUMBER`).
* **Value Type:** The type of the value, such as `TEXT` or `NUMBER`).
* **Not Null:** Whether NULL values are allowed (default is `False`).

Usage example:

```python
IcebergTable(
  table_name,
  metadata,
  Column("id", Integer, primary_key=True),
  Column("map_col", MAP(NUMBER(10, 0), TEXT(134217728))),
  external_volume="external_volume",
  base_location="base_location",
)
```

#### OBJECT

The `OBJECT` type represents a semi-structured object with named fields. Each field can have a specific type, and you can also specify whether each field is nullable.

* **Items Types:** A dictionary of field names and their types. The type can optionally include a nullable flag (`True` for not nullable or `False` for nullable [default]).

Usage example:

```python
IcebergTable(
    table_name,
    metadata,
    Column("id", Integer, primary_key=True),
    Column(
        "object_col",
        OBJECT(key1=(TEXT(134217728), False), key2=(NUMBER(10, 0), False)),
        OBJECT(key1=TEXT(1134217728), key2=NUMBER(10, 0)), # Without nullable flag
    ),
    external_volume="external_volume",
    base_location="base_location",
)
```

#### ARRAY

The `ARRAY` type represents an ordered list of values, where each element has the same type. The type of the elements is defined when the array is created.

* **Value Type:** The type of the elements in the array, such as `TEXT` or `NUMBER`).
* **Not Null:** Whether `NULL` values are allowed (default is `False`).

Usage example:

```python
IcebergTable(
    table_name,
    metadata,
    Column("id", Integer, primary_key=True),
    Column("array_col", ARRAY(TEXT(134217728))),
    external_volume="external_volume",
    base_location="base_location",
)
```

### CLUSTER BY support

Snowflake SQLAlchemy supports the `CLUSTER BY` parameter for tables. For information about the parameter, see [CREATE TABLE](../../sql-reference/sql/create-table.md).

This example shows how to create a table with two columns, `id` and `name`, as the clustering key:

> ```python
> t = Table('myuser', metadata,
>     Column('id', Integer, primary_key=True),
>     Column('name', String),
>     snowflake_clusterby=['id', 'name'], ...
> )
> metadata.create_all(engine)
> ```

### Alembic support

Alembic is a database migration tool on top of `SQLAlchemy`. Snowflake SQLAlchemy works by adding the following code to `alembic/env.py` so that Alembic can recognize Snowflake SQLAlchemy.

> ```python
> from alembic.ddl.impl import DefaultImpl
>
> class SnowflakeImpl(DefaultImpl):
>     __dialect__ = 'snowflake'
> ```

See [Alembic Documentation](http://alembic.zzzcomputing.com/) for general usage.

### Key pair authentication support

Snowflake SQLAlchemy supports key pair authentication by leveraging the functionality of Snowflake Connector for Python.
See [Using key-pair authentication and key-pair rotation](python-connector-connect.md) for steps to create the private and public keys.

The private key parameter is passed through `connect_args` as follows:

> ```python
> ...
> from snowflake.sqlalchemy import URL
> from sqlalchemy import create_engine
>
> from cryptography.hazmat.backends import default_backend
> from cryptography.hazmat.primitives import serialization
>
> with open("rsa_key.p8", "rb") as key:
>     p_key= serialization.load_pem_private_key(
>         key.read(),
>         password=os.environ['PRIVATE_KEY_PASSPHRASE'].encode(),
>         backend=default_backend()
>     )
>
> pkb = p_key.private_bytes(
>     encoding=serialization.Encoding.DER,
>     format=serialization.PrivateFormat.PKCS8,
>     encryption_algorithm=serialization.NoEncryption())
>
> engine = create_engine(URL(
>     account='abc123',
>     user='testuser1',
>     ),
>     connect_args={
>         'private_key': pkb,
>         },
>     )
> ```

Where `PRIVATE_KEY_PASSPHRASE` is a passphrase to decrypt the private key file, `rsa_key.p8`.

The `snowflake.sqlalchemy.URL` method does not support private key parameters.

### Merge command support

Snowflake SQLAlchemy supports performing an upsert with its `MergeInto` custom expression. See [MERGE](../../sql-reference/sql/merge.md) for full documentation.

Use it as follows:

> ```python
> from sqlalchemy.orm import sessionmaker
> from sqlalchemy import MetaData, create_engine
> from snowflake.sqlalchemy import MergeInto
>
> engine = create_engine(db.url, echo=False)
> session = sessionmaker(bind=engine)()
> connection = engine.connect()
>
> meta = MetaData()
> meta.reflect(bind=session.bind)
> t1 = meta.tables['t1']
> t2 = meta.tables['t2']
>
> merge = MergeInto(target=t1, source=t2, on=t1.c.t1key == t2.c.t2key)
> merge.when_matched_then_delete().where(t2.c.marked == 1)
> merge.when_matched_then_update().where(t2.c.isnewstatus == 1).values(val = t2.c.newval, status=t2.c.newstatus)
> merge.when_matched_then_update().values(val=t2.c.newval)
> merge.when_not_matched_then_insert().values(val=t2.c.newval, status=t2.c.newstatus)
> connection.execute(merge)
> ```

### CopyIntoStorage support

Snowflake SQLAlchemy supports saving tables and query results into different Snowflake stages, Azure Containers, and AWS buckets with
its custom `CopyIntoStorage` expression. See [COPY INTO <location>](../../sql-reference/sql/copy-into-location.md) for full documentation.

Use it as follows:

> ```python
> from sqlalchemy.orm import sessionmaker
> from sqlalchemy import MetaData, create_engine
> from snowflake.sqlalchemy import CopyIntoStorage, AWSBucket, CSVFormatter
>
> engine = create_engine(db.url, echo=False)
> session = sessionmaker(bind=engine)()
> connection = engine.connect()
>
> meta = MetaData()
> meta.reflect(bind=session.bind)
> users = meta.tables['users']
>
> copy_into = CopyIntoStorage(from_=users,
>                             into=AWSBucket.from_uri('s3://my_private_backup').encryption_aws_sse_kms('1234abcd-12ab-34cd-56ef-1234567890ab'),
>                             formatter=CSVFormatter().null_if(['null', 'Null']))
> connection.execute(copy_into)
> ```

### Iceberg Table with Snowflake Catalog support

Snowflake SQLAlchemy supports Iceberg Tables with the Snowflake Catalog, along with various related parameters. For detailed information about Iceberg Tables, refer to the Snowflake [CREATE ICEBERG](../../sql-reference/sql/create-iceberg-table-snowflake.md) documentation.

To create an Iceberg Table using Snowflake SQLAlchemy, you can define the table using the SQLAlchemy Core syntax as follows:

```python
table = IcebergTable(
        "myuser",
        metadata,
        Column("id", Integer, primary_key=True),
        Column("name", String),
        external_volume=external_volume_name,
        base_location="my_iceberg_table",
  as_query="SELECT * FROM table"
    )
```

Alternatively, you can define the table using a declarative approach:

```python
class MyUser(Base):
    __tablename__ = "myuser"

    @classmethod
    def __table_cls__(cls, name, metadata, *arg, **kw):
        return IcebergTable(name, metadata, *arg, **kw)

    __table_args__ = {
        "external_volume": "my_external_volume",
        "base_location": "my_iceberg_table",
  "as_query": "SELECT * FROM table",
    }

    id = Column(Integer, primary_key=True)
    name = Column(String)
```

### Hybrid Table support

Snowflake SQLAlchemy supports Hybrid Tables with indexes. For detailed information refer to the Snowflake [CREATE HYBRID TABLE](../../sql-reference/sql/create-hybrid-table.md) documentation.

To create a Hybrid Table and add an index, you can use the SQLAlchemy Core syntax as follows:

```python
table = HybridTable(
    "myuser",
    metadata,
    Column("id", Integer, primary_key=True),
    Column("name", String),
    Index("idx_name", "name")
```

Alternatively, you can define the table using the declarative approach:

```python
class MyUser(Base):
    __tablename__ = "myuser"

    @classmethod
    def __table_cls__(cls, name, metadata, *arg, **kw):
        return HybridTable(name, metadata, *arg, **kw)

    __table_args__ = (
        Index("idx_name", "name"),
    )

    id = Column(Integer, primary_key=True)
    name = Column(String)
```

### Dynamic Tables support

Snowflake SQLAlchemy supports Dynamic Tables. For detailed information refer to the Snowflake [CREATE DYNAMIC TABLE](../../sql-reference/sql/create-dynamic-table.md) documentation.

To create a Dynamic Table, you can use the SQLAlchemy Core syntax as follows:

```python
 dynamic_test_table_1 = DynamicTable(
       "dynamic_MyUser",
       metadata,
       Column("id", Integer),
       Column("name", String),
       target_lag=(1, TimeUnit.HOURS), # Additionally you can use SnowflakeKeyword.DOWNSTREAM
       warehouse='test_wh',
refresh_mode=SnowflakeKeyword.FULL
       as_query="SELECT id, name from MyUser;"
   )
```

Additionally you can define a table without columns using SqlAlchemy select() construct:

```python
 dynamic_test_table_1 = DynamicTable(
       "dynamic_MyUser",
       metadata,
       target_lag=(1, TimeUnit.HOURS),
       warehouse='test_wh',
refresh_mode=SnowflakeKeyword.FULL
       as_query=select(MyUser.id, MyUser.name)
   )
```

> **Note:**
>
> * Defining a primary key in a Dynamic Table is not supported, meaning declarative tables don’t support Dynamic Tables.
> * When using the `as_query` parameter with a string, you must explicitly define the columns. However, if you use the SQLAlchemy `select()` construct, you don’t need to explicitly define the columns.
> * Direct data insertion into Dynamic Tables is not supported.

---
title: Using third-party packages
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-packages.md
section: Developer Guide
---

# Using third-party packages

Stages can be used to import third-party packages. You can also specify Anaconda packages to install when you create Python UDFs.

## Artifact Repository overview

With Artifact Repository, you can directly use Python packages from the Python Package Index ([PyPI](https://pypi.org/)) within Snowpark Python user-defined functions (UDFs) and stored procedures so that building and scaling Python-powered applications in Snowflake is easier.

### Get started

Use Snowflake’s default Artifact Repository (`snowflake.snowpark.pypi_shared_repository`) to connect and install PyPI packages within Snowpark UDFs and procedures.

Before you use this repository, the account administrator (a user who has been granted the ACCOUNTADMIN role) must grant the SNOWFLAKE.PYPI_REPOSITORY_USER database role to your role:

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.PYPI_REPOSITORY_USER TO ROLE some_user_role;
```

The account administrator may also grant this database role to all users in the account:

```sqlexample
GRANT DATABASE ROLE SNOWFLAKE.PYPI_REPOSITORY_USER TO ROLE PUBLIC;
```

`SNOWFLAKE.PYPI_REPOSITORY_USER` is the required database role for any role that uses the `snowflake.snowpark.pypi_shared_repository`, including execution of UDFs/SPs that reference `snowflake.snowpark.pypi_shared_repository`.

With this role, you can install the package from the repository. When you create the UDF, you set the `ARTIFACT_REPOSITORY` parameter to the artifact repository name.
You also set the `PACKAGES` parameter to the list of the names of the packages that will come from artifact repository. In the following example, because the artifact repository is configured with PyPI, the package `scikit-learn` is sourced from PyPI:

```sqlexample-python
CREATE OR REPLACE FUNCTION sklearn_udf()
  RETURNS FLOAT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  ARTIFACT_REPOSITORY = snowflake.snowpark.pypi_shared_repository
  PACKAGES = ('scikit-learn')
  HANDLER = 'udf'
  AS
$$
from sklearn.datasets import load_iris
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier

def udf():
  X, y = load_iris(return_X_y=True)
  X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.25, random_state=42)

  model = RandomForestClassifier()
  model.fit(X_train, y_train)
  return model.score(X_test, y_test)
$$;

SELECT sklearn_udf();
```

> **Note:**
>
> To specify a package version, add it as shown:
>
> ```python
> PACKAGES = ('scikit-learn==1.5')
> ```

### Packages built only for x86

If a package is built only for x86, choose one of the warehouses that uses x86 CPU architecture — `MEMORY_1X_x86` or `MEMORY_16X_x86` — and then specify `RESOURCE_CONSTRAINT=(architecture='x86')`, as in the following example:

```sqlexample-python
CREATE OR REPLACE FUNCTION pymeos_example()
RETURNS STRING
LANGUAGE PYTHON
HANDLER='main'
RUNTIME_VERSION='3.11'
ARTIFACT_REPOSITORY=snowflake.snowpark.pypi_shared_repository
PACKAGES=('pymeos') -- dependency pymeos-cffi is x86 only
RESOURCE_CONSTRAINT=(architecture='x86')
AS $$
def main() -> str:
   from pymeos import pymeos_initialize, pymeos_finalize, TGeogPointInst, TGeogPointSeq

   # Always initialize MEOS library
   pymeos_initialize()

   sequence_from_string = TGeogPointSeq(
      string='[Point(10.0 10.0)@2019-09-01 00:00:00+01, Point(20.0 20.0)@2019-09-02 00:00:00+01, Point(10.0 10.0)@2019-09-03 00:00:00+01]')

   sequence_from_points = TGeogPointSeq(instant_list=[TGeogPointInst(string='Point(10.0 10.0)@2019-09-01 00:00:00+01'),
        TGeogPointInst(string='Point(20.0 20.0)@2019-09-02 00:00:00+01'),
        TGeogPointInst(string='Point(10.0 10.0)@2019-09-03 00:00:00+01')],
          lower_inc=True, upper_inc=True)
   speed = sequence_from_points.speed()

   # Call finish at the end of your code
   pymeos_finalize()

   return speed
$$;

SELECT pymeos_example();
```

For more information, see [Snowpark-optimized warehouses](../../../user-guide/warehouses-snowpark-optimized.md).

You can use Artifact Repository with UDF and Stored Procedure client APIs such as the following:

* [snowflake.snowpark.udf.UDFRegistration](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.udf.UDFRegistration)
* [snowflake.snowpark.functions.sproc](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.sproc)

When using them, specify the following parameters:

> * `ARTIFACT_REPOSITORY`
> * `PACKAGES`

and provide the package name in the `PACKAGES` field.

See the following example:

> ```python
> ...
> ARTIFACT_REPOSITORY="snowflake.snowpark.pypi_shared_repository",
> PACKAGES=["urllib3", "requests"],
> ...
> ```

### Troubleshooting

If the package install fails for the function or procedure creation part, run the following pip command locally to see whether the package specification is valid:

```bash
pip install <package name> --only-binary=:all: --python-version 3.12 –platform <platform_tag>
```

### Limitations

* Access to private repositories is not supported.
* You cannot use this feature directly in Notebooks. However, you can use a UDF or stored procedure that uses PyPI packages within a notebook.
* You cannot use Artifact Repository within anonymous stored procedures.

> **Note:**
>
> * Snowflake does not check or curate the security of Python packages from external sources. You are responsible for evaluating these packages and ensuring that they are safe and reliable.
> * Snowflake reserves the right to block or remove any package that may be harmful or risky, without prior notice. This is to protect the platform’s integrity.

## Importing packages through a Snowflake stage

Snowflake stages can be used to import packages. You can bring in any Python code that follows guidelines defined in [General limitations](udf-python-limitations.md).
For more information, see [Creating a Python UDF with code uploaded from a stage](udf-python-creating.md).

You can only upload pure Python packages or packages with native code through a Snowflake stage.

As an example, you can use the following SQL, which creates a warehouse named `so_warehouse` that has x86 CPU architecture:

```sqlexample
CREATE WAREHOUSE so_warehouse WITH
   WAREHOUSE_SIZE = 'LARGE'
   WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
   RESOURCE_CONSTRAINT = 'MEMORY_16X_X86';
```

To install a package with native code via importing from Stage, use the following example:

```sqlexample-python
CREATE or REPLACE function native_module_test_zip()
  RETURNS string
  LANGUAGE python
  RUNTIME_VERSION=3.12
  RESOURCE_CONSTRAINT=(architecture='x86')
  IMPORTS=('@mystage/mycustompackage.zip')
  HANDLER='compute'
  as
  $$
  def compute():
      import mycustompackage
      return mycustompackage.mycustompackage()
  $$;
```

## Using third-party packages from Anaconda

Snowflake provides access to a curated set of Python packages built by Anaconda. These packages integrate directly into Snowflake’s Python features at no extra cost.

### Licensing terms

* **In Snowflake:** Governed by your existing Snowflake customer agreement, including the Anaconda usage restrictions described in this documentation. No separate Anaconda terms apply for in-Snowflake use.
* **Local development:** From Snowflake’s [dedicated Anaconda repository](https://repo.anaconda.com/pkgs/snowflake/) : Subject to Anaconda’s Embedded End Customer Terms and Anaconda’s Terms of Service posted on the repository. Local use is limited to developing/testing workloads intended for deployment in Snowflake.

### User guidelines

#### Permitted uses

* **Within Snowflake:** Use packages freely across all supported Python features.

  > > **Note:**
  > >
  > > You cannot call a UDF within the DEFAULT clause of a CREATE TABLE statement, with the exception of packages that remain freely available in Snowflake Notebooks on Snowpark Container Services.
* **Local development:** Use packages from Snowflake’s dedicated Anaconda repository to develop or test workloads intended for Snowflake.

#### Prohibited uses

The following uses of packages are prohibited:

* Using packages for projects not related to Snowflake.
* Hosting or mirroring package content externally.
* Removing or modifying copyright or license notices.

### Finding and managing packages

Can’t find a package you need?

* Submit requests via the [Snowflake Ideas forum](https://community.snowflake.com/s/ideas).
* Pure Python packages (without compiled extensions) can be [uploaded directly to a Snowflake stage](../../snowflake-cli/snowpark/upload.md).

### Support and security

#### Support coverage

Snowflake provides standard package support, including:

* Installation guidance
* Environment troubleshooting
* Integration assistance

#### Warranty and SLA

Anaconda packages are third-party software provided *as-is* and are not covered by Snowflake’s warranty or SLA (Service-level agreement).

#### Security practices

Anaconda packages provided by Snowflake are built on trusted infrastructure and digitally signed.

For more details, see [Anaconda’s Security Practices](https://www.anaconda.com/docs/reference/policies-practices/security) .

#### Compliance and licensing

Each package includes its own open-source license. Customers must comply with individual package license terms in addition to the usage guidelines outlined in this documentation.

#### Frequently asked questions

* **Can I use packages from other Anaconda channels (e.g., conda-forge or Anaconda Defaults)?** No. Other channels are separate offerings and may require a commercial license from Anaconda.
* **Can I use these packages locally for projects unrelated to Snowflake?** No. Local usage is strictly limited to developing or testing workloads intended for Snowflake deployment. Other uses require a separate Anaconda license.
* **Why does Snowpark Container Services require separate licensing?** Using packages in custom Docker images extends beyond Snowflake’s integrated environment, necessitating separate Anaconda licensing.

### Displaying and using packages

#### Displaying available packages

You can display all packages available and their version information by querying the PACKAGES view in the Information Schema.

```sqlexample
select * from information_schema.packages where language = 'python';
```

To display version information about a specific package, for example `numpy`, use this command:

```sqlexample
select * from information_schema.packages where (package_name = 'numpy' and language = 'python');
```

> **Note:**
>
> Some packages in the Anaconda Snowflake channel are not intended for use inside Snowflake UDFs because UDFs are executed within a restricted engine.
> For more information, see [Following good security practices](udf-python-designing.md).

When queries that call Python UDFs are executed inside a Snowflake warehouse, Anaconda packages are installed seamlessly and cached on the virtual warehouse on your behalf.

#### Displaying imported packages

You can display a list of the packages and modules a UDF or UDTF is using by executing the [DESCRIBE FUNCTION](../../../sql-reference/sql/desc-function.md) command.
Executing the DESCRIBE FUNCTION command for a UDF whose handler is implemented in Python returns the values of several properties, including a list of imported modules and packages,
as well as installed packages, the function signature, and its return type.

When specifying the identifier for the UDF, be sure to include function parameter types, if any.

```sqlexample
desc function stock_sale_average(varchar, number, number);
```

#### Using Anaconda packages

For an example of how to use an imported Anaconda package in a Python UDF,
refer to [Importing a package in an in-line handler](udf-python-examples.md).

#### Setting packages policies

You can use a packages policy to set allowlists and blocklists for third-party Python packages from Anaconda at the account level.
This lets you meet stricter auditing and security requirements and gives you more fine-grained control over which packages are available or blocked in your environment.
For more information, see [Packages policies](packages-policy.md).

### Performance on cold warehouses

For more efficient resource management, newly provisioned virtual warehouses do not preinstall Anaconda packages.
Instead, Anaconda packages are installed on-demand the first time a UDF is used.
The packages are cached for future UDF execution on the same warehouse. The cache is dropped when the warehouse is suspended.
This may result in slower performance the first time a UDF is used or after the warehouse is resumed.
The additional latency could be approximately 30 seconds.

## Local development and testing

To help you create a conda environment on your local machine for development and testing, Anaconda has
created a Snowflake channel which mirrors a subset of the packages and versions that are supported in
the Snowflake Python UDF environment.
You may use the Snowflake conda channel for local testing and development at no cost under the Supplemental Embedded Software
Terms to Anaconda’s Terms of Service.

For example, to create a new conda environment locally using the Snowflake channel, type something like
this on the command line:

```bash
conda create --name py312_env -c https://repo.anaconda.com/pkgs/snowflake python=3.12 numpy pandas
```

Note that because of platform differences, your local conda environment may not be exactly the same as
the server environment.

## Best practices

Within the `create function` statement, the package specification (for example, `packages = ('numpy','pandas')`) should
only specify the top-level packages that the UDF is using directly.

Anaconda manages dependencies and installs them automatically. You don’t need to specify dependency packages. If you don’t specify a package version, Anaconda installs the most up-to-date version of the package and its dependencies. Specifying a particular version is generally not necessary.

When using `artifact_repository` to source packages from PyPI, specify version constraints to ensure production stability. Unlike Anaconda packages, which are curated for Snowflake compatibility, PyPI packages might introduce breaking changes in new releases. Consider using version bounds, such as `packages = ('pandas<3.0.0',)`, or pinning to specific versions.

Note that version resolution is performed once, when the UDF is created using the `create function` command.
After that, the resulting version resolution is frozen and the same set of packages will be used when this particular UDF executes.

For an example of how to use the package specification within the `create function` statement, see [Importing a package in an in-line handler](udf-python-examples.md).

## Known issues with third-party packages

### Performance with single row prediction

Some data science frameworks, such as Scikit-learn and TensorFlow, might be slow when doing single-row ML prediction.
To improve performance, do batch prediction instead of single-row prediction.
To do this, you can use vectorized Python UDFs, with which you can define Python functions that receive input rows in batches, on which machine
learning or data science libraries are optimized to operate. For more information, see [Vectorized Python UDFs](udf-python-batch.md).

### Downloading data on demand from data science libraries

Some data science libraries, such as [NLTK](https://www.nltk.org/data.html), [Keras](https://www.tensorflow.org/api_docs/python/tf/keras/datasets),
and [spaCy](https://spacy.io) provide functionality to download additional corpora, data, or models on demand.

However, on-demand downloading does not work with Python UDFs due to Snowflake security constraints, which disable some
capabilities, such as network access and writing to files.

To work around this issue, download the data to your local environment and then provide it
to the UDF via a Snowflake stage.

### XGBoost

When using XGBoost in UDF or UDTF for parallel prediction or training, the concurrency for each XGBoost instance should
be set to 1. This ensures that XGBoost is configured for optimal performance when executing in
the Snowflake environment.

Examples:

```python
import xgboost as xgb
model = xgb.Booster()
model.set_param('nthread', 1)
model.load_model(...)
```

```python
import xgboost as xgb
model = xgb.XGBRegressor(n_jobs=1)
```

### TensorFlow/Keras

When using Tensorflow/Keras for prediction, use Model.predict_on_batch and
not Model.predict.

Example:

```python
import keras
model = keras.models.load_model(...)
model.predict_on_batch(np.array([input]))
```

---
title: Vectorized Python UDFs
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-batch.md
section: Developer Guide
---

# Vectorized Python UDFs

This topic introduces vectorized Python UDFs.

## Overview

Vectorized Python UDFs let you define Python functions that receive batches of input rows
as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html) and
return batches of results as [Pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html)
or [Series](https://pandas.pydata.org/docs/reference/series.html).
You call vectorized Python UDFs the same way you call other Python UDFs.

Advantages of using vectorized Python UDFs compared to the default row-by-row processing pattern include:

* The potential for better performance if your Python code operates efficiently on batches of rows.
* Less transformation logic required if you are calling into libraries that operate on Pandas DataFrames or Pandas arrays.

When you use vectorized Python UDFs:

* You do not need to change how you write queries using Python UDFs. All batching is handled by the UDF framework rather than your own code.
* As with non-vectorized UDFs, there is no guarantee of which instances of your handler code will see which batches of input.

## Getting started with vectorized Python UDFs

To create a vectorized Python UDF, use one of the supported mechanisms for annotating your handler function.

### Using the `vectorized` decorator

The `_snowflake` module is exposed to Python UDFs that execute within Snowflake. In your Python code, import the `_snowflake` module,
and use the `vectorized` decorator to specify that your handler expects to receive a Pandas DataFrame by setting the `input` parameter to `pandas.DataFrame`.

```sqlexample-python
CREATE FUNCTION add_one_to_inputs(x NUMBER(10, 0), y NUMBER(10, 0))
  RETURNS NUMBER(10, 0)
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('pandas')
  HANDLER = 'add_one_to_inputs'
AS $$
import pandas
from _snowflake import vectorized

@vectorized(input=pandas.DataFrame)
def add_one_to_inputs(df):
 return df[0] + df[1] + 1
$$;
```

### Using a function attribute

Rather than importing the _snowflake module and using the `vectorized` decorator, you can set the special `_sf_vectorized_input` attribute on your handler function.

```sqlexample-python
CREATE FUNCTION add_one_to_inputs(x NUMBER(10, 0), y NUMBER(10, 0))
  RETURNS NUMBER(10, 0)
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('pandas')
  HANDLER = 'add_one_to_inputs'
AS $$
import pandas

def add_one_to_inputs(df):
 return df[0] + df[1] + 1

add_one_to_inputs._sf_vectorized_input = pandas.DataFrame
$$;
```

## Setting a target batch size

Calls to the Python handler function must execute within a time limit,
which is 180 seconds, and each DataFrame passed as input to the handler function may currently contain
up to a few thousand rows. In order to stay within the time limit, you may want to set the target batch
size for your handler function, which imposes a maximum number of rows per input DataFrame.
Note that setting a larger value does not guarantee that Snowflake will encode batches with the specified number of rows.
You can set the target batch size using either the `vectorized` decorator or an attribute on the function.

> **Note:**
>
> Using `max_batch_size` is only meant as a mechanism to limit the number of rows that UDF can handle per single batch.
> For example, if the UDF is written in a way that can only process at most 100 rows at a time, then `max_batch_size` should be set to 100.
> Setting `max_batch_size` is not meant to be used as a mechanism to specify arbitrary large batch sizes.
> If the UDF is able to process batches of any size, it is recommended to leave this parameter unset.

### Using the `vectorized` decorator

To set the target batch size using the `vectorized` decorator, pass a positive integer value for the argument named `max_batch_size`.

As an example, this statement creates a vectorized Python UDF and limits each Dataframe to a maximum of 100 rows:

```sqlexample-python
CREATE FUNCTION add_one_to_inputs(x NUMBER(10, 0), y NUMBER(10, 0))
  RETURNS NUMBER(10, 0)
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('pandas')
  HANDLER = 'add_one_to_inputs'
AS $$
import pandas
from _snowflake import vectorized

@vectorized(input=pandas.DataFrame, max_batch_size=100)
def add_one_to_inputs(df):
 return df[0] + df[1] + 1
$$;
```

### Using a function attribute

To set the target batch size using a function attribute, set a positive integer value for the `_sf_max_batch_size` attribute on your handler function.

As an example, this statement creates a vectorized Python UDF and limits each DataFrame to a maximum of 100 rows:

```sqlexample-python
CREATE FUNCTION add_one_to_inputs(x NUMBER(10, 0), y NUMBER(10, 0))
  RETURNS NUMBER(10, 0)
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('pandas')
  HANDLER = 'add_one_to_inputs'
AS $$
import pandas

def add_one_to_inputs(df):
 return df[0] + df[1] + 1

add_one_to_inputs._sf_vectorized_input = pandas.DataFrame
add_one_to_inputs._sf_max_batch_size = 100
$$;
```

## DataFrame encoding

Batches of arguments to the UDF are encoded as arrays in the input Pandas DataFrames, and the number of rows in each
DataFrame may vary. For more information, see Setting a target batch size. Arguments can be accessed in the
DataFrame by their index, i.e. the first argument has an index of 0, the second has an index of 1, and so on.
The Pandas array or Series that the UDF handler returns must have the same length as that of the input DataFrame.

To illustrate, suppose that you define a vectorized Python UDF as follows:

```sqlexample-python
CREATE OR REPLACE FUNCTION add_inputs(x INT, y FLOAT)
  RETURNS FLOAT
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('pandas')
  HANDLER = 'add_inputs'
AS $$
import pandas
from _snowflake import vectorized

@vectorized(input=pandas.DataFrame)
def add_inputs(df):
  return df[0] + df[1]
$$;
```

This UDF uses `df[0]` to access the Pandas array for the first argument, and `df[1]` for the second. `df[0] + df[1]` results in a Pandas array with the pairwise sums of corresponding elements from the two arrays. After creating the UDF, you might call it with some input rows:

```sqlexample
SELECT add_inputs(x, y)
FROM (
  SELECT 1 AS x, 3.14::FLOAT as y UNION ALL
  SELECT 2, 1.59 UNION ALL
  SELECT 3, -0.5
);
```

```output
+------------------+
| ADD_INPUTS(X, Y) |
|------------------|
|             4.14 |
|             3.59 |
|             2.5  |
+------------------+
```

Here the `add_inputs` Python function receives a DataFrame analogous to one created with the following Python code:

```python
>>> import pandas
>>> df = pandas.DataFrame({0: pandas.array([1, 2, 3]), 1: pandas.array([3.14, 1.59, -0.5])})
>>> df
   0     1
0  1  3.14
1  2  1.59
2  3 -0.50
```

The line `return df[0] + df[1]` in the handler function results in an array similar to the following Python code:

```python
>>> df[0] + df[1]
0    4.14
1    3.59
2    2.50
dtype: float64
```

### Type support

Vectorized Python UDFs support the following [SQL types](../../../sql-reference-data-types.md) for arguments and return values. The table reflects how each SQL argument is encoded as a Pandas array of a particular [dtype](https://pandas.pydata.org/docs/user_guide/basics.html#basics-dtypes).

| SQL Type | Pandas dtype | Notes |
| --- | --- | --- |
| NUMBER | `Int16`, `Int32`, or `Int64` for `NUMBER` arguments with a scale of 0 that all fit in a 64-bit or smaller integer type. If the argument is not nullable, `int16`, `int32`, or `int64` is used instead. (For UDTFs, `Int16`, `Int32`, or `Int64` will always be used.)  `object` for arguments with a scale other than 0, or for arguments that do not fit within a 64-bit integer, where array elements are encoded as `decimal.Decimal` values.  To ensure a 16-bit dtype, use a maximum `NUMBER` precision of 4. To ensure a 32-bit dtype, use a maximum `NUMBER` precision of 9. To ensure a 64-bit dtype, use a maximum `NUMBER` precision of 18. | To ensure that an input argument to a UDF is interpreted as not nullable, pass a column from a table created using the `NOT NULL` column constraint, or use a function such as `IFNULL` on the argument. |
| FLOAT | `float64` | NULL values are encoded as NaN values. In the output, NaN values are interpreted as NULLs. |
| BOOLEAN | `boolean` for nullable arguments or `bool` for non-nullable arguments. |  |
| VARCHAR | `string` | Both Snowflake SQL and Pandas represent strings using UTF-8 encoding. |
| BINARY | `bytes` |  |
| DATE | `datetime64` | Each value is encoded as a `datetime64` with no time component. NULL values are encoded as `numpy.timedelta('NaT')`. |
| VARIANT | `object`  Each value is encoded as a `dict`, `list`, `int`, `float`, `str`, or `bool`. | Each variant row is converted to a Python type dynamically for arguments and vice versa for return values. The following types are converted to strings rather than native Python types: `decimal`, `binary`, `date`, `time`, `timestamp_ltz`, `timestamp_ntz`, `timestamp_tz`. |
| OBJECT | `object`  Each element is encoded as a dict. |  |
| ARRAY | `object`  Each element is encoded as a list. |  |
| TIME | `timedelta64` | Each value is encoded as an offset from midnight. NULL values are encoded as `numpy.timedelta64('NaT')`. When used as a return type, elements of the output may be `numpy.timedelta64` or `datetime.time` values in the range `[00:00:00, 23:59:59.999999999]`. |
| TIMESTAMP_LTZ | `datetime64` | Uses the local time zone to encode each value as a nanosecond-scale `numpy.datetime64` relative to the UTC Unix epoch. NULL values are encoded as `numpy.datetime64('NaT')`. When used as a return type, elements of the output may be `numpy.datetime64` or time zone naive `datetime.datetime` or `pandas.Timestamp` values. |
| TIMESTAMP_NTZ | `datetime64` | Encodes each value as a nanosecond-scale `numpy.datetime64`. NULL values are encoded as `numpy.datetime64('NaT')`. When used as a return type, elements of the output may be `numpy.datetime64` or time zone naive `datetime.datetime` or `pandas.Timestamp` values. |
| TIMESTAMP_TZ | `object` | Encodes each value as a nanosecond-scale `pandas.Timestamp`. NULL values are encoded as `pandas.NA`. When used as a return type, elements of the output may be time zone-aware `datetime.datetime` or `pandas.Timestamp` values. |
| GEOGRAPHY | `object` | Formats each value as GeoJSON and then converts it to a Python `dict`. |

The following types are accepted as output: Pandas `Series` or `array`, NumPy `array`, regular
Python `list`, and any iterable sequence that contains the expected types described
in Type support. It is efficient to use Pandas `Series` and `array` and NumPy `array`
where the dtype is `bool`, `boolean`,
`int16`, `int32`, `int64`, `Int16`, `Int32`, `Int64`, or `float64`
because they expose their contents as `memoryviews`. This means that the contents can be copied rather than each value
being read sequentially.

---
title: Vectorized Python UDTFs
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-tabular-vectorized.md
section: Developer Guide
---

# Vectorized Python UDTFs

This topic introduces vectorized Python UDTFs.

## Overview

Vectorized Python UDTFs (user-defined table functions) provide a way to operate over rows in batches.

Snowflake supports two kinds of vectorized UDTFs:

* UDTFs with a vectorized `end_partition` method
* UDTFs with a vectorized `process` method

You must choose one kind because a UDTF can’t have both a vectorized `process` method and a vectorized `end_partition` method.

### UDTFs with a vectorized end_partition method

UDTFs with a vectorized `end_partition` method enable seamless partition-by-partition processing by operating on
partitions as [pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html)
and returning results as
[pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html)
or lists of [pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html)
or [pandas Series](https://pandas.pydata.org/docs/reference/series.html).
This facilitates integration with libraries that operate on pandas DataFrames or pandas arrays.

Use a vectorized `end_partition` method for the following tasks:

* Process your data on a partition-by-partition basis instead of on a row-by-row basis.
* Return multiple rows or columns for each partition.
* Use libraries that operate on pandas DataFrames for data analysis.

### UDTFs with a vectorized process method

UDTFs with a vectorized `process` method provide a way to operate over rows in batches, when the operation performs a 1-to-1 mapping.
In other words, the method returns one output row for each input row. The number of columns is not restricted.

Use a vectorized `process` method for the following tasks:

* Apply a 1-to-1 transformation with a multi-columnar result in batches.
* Use a library that requires `pandas.DataFrame`.
* Process rows in batches, without explicit partitioning.
* Leverage the [to_pandas()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.DataFrame.to_pandas) API to transform the query result directly to a pandas DataFrame.

## Prerequisites

The Snowpark Library for Python version 1.14.0 or later is required.

## Create a UDTF with a vectorized end_partition method

1. Optional: Define a handler class with an `__init__` method, which will be invoked before each partition is processed.

   Note: Do not define a `process` method.
2. Define an `end_partition` method that takes in a DataFrame argument and returns or yields a `pandas.DataFrame` or a tuple of `pandas.Series` or `pandas.arrays` where each array is a column.

   The column types of the result must match the column types in the UDTF definition.
3. To mark the `end_partition` method as vectorized, use the `@vectorized` decorator or the `_sf_vectorized_input` function attribute.

   For more information, see [Vectorized Python UDFs](udf-python-batch.md). The `@vectorized` decorator can only be used when the Python UDTF is executed within Snowflake; for example, when using a SQL worksheet. When you are executing using the client or a Python worksheet, you must use the function attribute.

> **Note:**
>
> The default column names for the input DataFrame to a UDTF with a vectorized `end_partition` method match the signature of the SQL function.
> The column names follow the [SQL identifier requirements](../../../sql-reference/identifiers-syntax.md).
> That is, if an identifier is unquoted it will be capitalized, and if it is double quoted it will remain unchanged.

The following code block is an example of creating a UDTF with a vectorized `end_partition` method, using the `@vectorized` decorator:

```python
from _snowflake import vectorized
import pandas

class handler:
  def __init__(self):
    # initialize a state
  @vectorized(input=pandas.DataFrame)
  def end_partition(self, df):
    # process the DataFrame
    return result_df
```

The following code block is an example of creating a UDTF with a vectorized `end_partition` method, using the function attribute:

```python
import pandas

class handler:
  def __init__(self):
    # initialize a state
  def end_partition(self, df):
    # process the DataFrame
    return result_df

handler.end_partition._sf_vectorized_input = pandas.DataFrame
```

> **Note:**
>
> A UDTF with a vectorized `end_partition` method must be called with a PARTITION BY clause to build the partitions.

To call the UDTF with all the data in the same partition:

```sqlexample
SELECT * FROM table(udtf(x,y,z) OVER (PARTITION BY 1));
```

To call the UDTF with the data partitioned by column x:

```sqlexample
SELECT * FROM table(udtf(x,y,z) OVER (PARTITION BY x));
```

### Example: Row collection using a regular UDTF versus using a UDTF with a vectorized end_partition method

Row collection using a regular UDTF:

```python
import pandas

class handler:
  def __init__(self):
    self.rows = []
  def process(self, *row):
    self.rows.append(row)
  def end_partition(self):
    df = pandas.DataFrame(self.rows)
    # process the DataFrame
    return result_df
```

Row collection using a UDTF with a vectorized `end_partition` method:

```python
from _snowflake import vectorized
import pandas

class handler:
  def __init__(self):
    self.rows = []
  @vectorized(input=pandas.DataFrame)
  def end_partition(self, df):
  # process the DataFrame
    return result_df
```

### Example: Calculate the summary statistic for each column in the partition

Here is an example of how to calculate the summary statistic for each column in the partition using
the pandas `describe()` method.

1. Create a table and generate three partitions of five rows each:

   ```sqlexample
   CREATE OR REPLACE TABLE test_values(id VARCHAR, col1 FLOAT, col2 FLOAT, col3 FLOAT, col4 FLOAT, col5 FLOAT);

   -- generate 3 partitions of 5 rows each
   INSERT INTO test_values
     SELECT 'x',
     UNIFORM(1.5,1000.5,RANDOM(1))::FLOAT col1,
     UNIFORM(1.5,1000.5,RANDOM(2))::FLOAT col2,
     UNIFORM(1.5,1000.5,RANDOM(3))::FLOAT col3,
     UNIFORM(1.5,1000.5,RANDOM(4))::FLOAT col4,
     UNIFORM(1.5,1000.5,RANDOM(5))::FLOAT col5
     FROM TABLE(GENERATOR(ROWCOUNT => 5));

   INSERT INTO test_values
     SELECT 'y',
     UNIFORM(1.5,1000.5,RANDOM(10))::FLOAT col1,
     UNIFORM(1.5,1000.5,RANDOM(20))::FLOAT col2,
     UNIFORM(1.5,1000.5,RANDOM(30))::FLOAT col3,
     UNIFORM(1.5,1000.5,RANDOM(40))::FLOAT col4,
     UNIFORM(1.5,1000.5,RANDOM(50))::FLOAT col5
     FROM TABLE(GENERATOR(ROWCOUNT => 5));

   INSERT INTO test_values
     SELECT 'z',
     UNIFORM(1.5,1000.5,RANDOM(100))::FLOAT col1,
     UNIFORM(1.5,1000.5,RANDOM(200))::FLOAT col2,
     UNIFORM(1.5,1000.5,RANDOM(300))::FLOAT col3,
     UNIFORM(1.5,1000.5,RANDOM(400))::FLOAT col4,
     UNIFORM(1.5,1000.5,RANDOM(500))::FLOAT col5
     FROM TABLE(GENERATOR(ROWCOUNT => 5));
   ```
2. Look at the data:

   ```sqlexample
   SELECT * FROM test_values;
   ```

   ```output
   -----------------------------------------------------
   |"ID"  |"COL1"  |"COL2"  |"COL3"  |"COL4"  |"COL5"  |
   -----------------------------------------------------
   |x     |8.0     |99.4    |714.6   |168.7   |397.2   |
   |x     |106.4   |237.1   |971.7   |828.4   |988.2   |
   |x     |741.3   |207.9   |32.6    |640.6   |63.2    |
   |x     |541.3   |828.6   |844.9   |77.3    |403.1   |
   |x     |4.3     |723.3   |924.3   |282.5   |158.1   |
   |y     |976.1   |562.4   |968.7   |934.3   |977.3   |
   |y     |390.0   |244.3   |952.6   |101.7   |24.9    |
   |y     |599.7   |191.8   |90.2    |788.2   |761.2   |
   |y     |589.5   |201.0   |863.4   |415.1   |696.1   |
   |y     |46.7    |659.7   |571.1   |938.0   |513.7   |
   |z     |313.9   |188.5   |964.6   |435.4   |519.6   |
   |z     |328.3   |643.1   |766.4   |148.1   |596.4   |
   |z     |929.0   |255.4   |915.9   |857.2   |425.5   |
   |z     |612.8   |816.4   |220.2   |879.5   |331.4   |
   |z     |487.1   |704.5   |471.5   |378.9   |481.2   |
   -----------------------------------------------------
   ```
3. Create the function:

   ```sqlexample-python
   CREATE OR REPLACE FUNCTION summary_stats(id VARCHAR, col1 FLOAT, col2 FLOAT, col3 FLOAT, col4 FLOAT, col5 FLOAT)
     RETURNS TABLE (column_name VARCHAR, count INT, mean FLOAT, std FLOAT, min FLOAT, q1 FLOAT, median FLOAT, q3 FLOAT, max FLOAT)
     LANGUAGE PYTHON
     RUNTIME_VERSION = 3.12
     PACKAGES = ('pandas')
     HANDLER = 'handler'
   AS $$
   from _snowflake import vectorized
   import pandas

   class handler:
       @vectorized(input=pandas.DataFrame)
       def end_partition(self, df):
         # using describe function to get the summary statistics
         result = df.describe().transpose()
         # add a column at the beginning for column ids
         result.insert(loc=0, column='column_name', value=['col1', 'col2', 'col3', 'col4', 'col5'])
         return result
   $$;
   ```
4. Do one of the following steps:

   * Call the function and partition by `id`:

     ```sqlexample
     -- partition by id
     SELECT * FROM test_values, TABLE(summary_stats(id, col1, col2, col3, col4, col5)
       OVER (PARTITION BY id))
       ORDER BY id, column_name;
     ```

     ```output
     --------------------------------------------------------------------------------------------------------------------------------------------------------------------
     |"ID"  |"COL1"  |"COL2"  |"COL3"  |"COL4"  |"COL5"  |"COLUMN_NAME"  |"COUNT"  |"MEAN"              |"STD"               |"MIN"  |"Q1"   |"MEDIAN"  |"Q3"   |"MAX"  |
     --------------------------------------------------------------------------------------------------------------------------------------------------------------------
     |x     |NULL    |NULL    |NULL    |NULL    |NULL    |col1           |5        |280.25999999999993  |339.5609267863427   |4.3    |8.0    |106.4     |541.3  |741.3  |
     |x     |NULL    |NULL    |NULL    |NULL    |NULL    |col2           |5        |419.25999999999993  |331.72476995244114  |99.4   |207.9  |237.1     |723.3  |828.6  |
     |x     |NULL    |NULL    |NULL    |NULL    |NULL    |col3           |5        |697.62              |384.2964311569911   |32.6   |714.6  |844.9     |924.3  |971.7  |
     |x     |NULL    |NULL    |NULL    |NULL    |NULL    |col4           |5        |399.5               |321.2689294033894   |77.3   |168.7  |282.5     |640.6  |828.4  |
     |x     |NULL    |NULL    |NULL    |NULL    |NULL    |col5           |5        |401.96000000000004  |359.83584173897964  |63.2   |158.1  |397.2     |403.1  |988.2  |
     |y     |NULL    |NULL    |NULL    |NULL    |NULL    |col1           |5        |520.4               |339.16133329139984  |46.7   |390.0  |589.5     |599.7  |976.1  |
     |y     |NULL    |NULL    |NULL    |NULL    |NULL    |col2           |5        |371.84              |221.94799616126298  |191.8  |201.0  |244.3     |562.4  |659.7  |
     |y     |NULL    |NULL    |NULL    |NULL    |NULL    |col3           |5        |689.2               |371.01012789410476  |90.2   |571.1  |863.4     |952.6  |968.7  |
     |y     |NULL    |NULL    |NULL    |NULL    |NULL    |col4           |5        |635.46              |366.6140927460372   |101.7  |415.1  |788.2     |934.3  |938.0  |
     |y     |NULL    |NULL    |NULL    |NULL    |NULL    |col5           |5        |594.64              |359.0334218425911   |24.9   |513.7  |696.1     |761.2  |977.3  |
     |z     |NULL    |NULL    |NULL    |NULL    |NULL    |col1           |5        |534.22              |252.58182238633088  |313.9  |328.3  |487.1     |612.8  |929.0  |
     |z     |NULL    |NULL    |NULL    |NULL    |NULL    |col2           |5        |521.58              |281.4870103574941   |188.5  |255.4  |643.1     |704.5  |816.4  |
     |z     |NULL    |NULL    |NULL    |NULL    |NULL    |col3           |5        |667.72              |315.53336907528495  |220.2  |471.5  |766.4     |915.9  |964.6  |
     |z     |NULL    |NULL    |NULL    |NULL    |NULL    |col4           |5        |539.8199999999999   |318.73025742781306  |148.1  |378.9  |435.4     |857.2  |879.5  |
     |z     |NULL    |NULL    |NULL    |NULL    |NULL    |col5           |5        |470.82              |99.68626786072393   |331.4  |425.5  |481.2     |519.6  |596.4  |
     --------------------------------------------------------------------------------------------------------------------------------------------------------------------
     ```
   * Call the function and treat the whole table as one partition:

     > ```sqlexample
     > -- treat the whole table as one partition
     > SELECT * FROM test_values, TABLE(summary_stats(id, col1, col2, col3, col4, col5)
     >   OVER (PARTITION BY 1))
     >   ORDER BY id, column_name;
     > ```
     >
     > ```output
     > ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
     > |"ID"  |"COL1"  |"COL2"  |"COL3"  |"COL4"  |"COL5"  |"COLUMN_NAME"  |"COUNT"  |"MEAN"             |"STD"               |"MIN"  |"Q1"                |"MEDIAN"  |"Q3"    |"MAX"  |
     > ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
     > |NULL  |NULL    |NULL    |NULL    |NULL    |NULL    |col1           |15       |444.96             |314.01110034974425  |4.3    |210.14999999999998  |487.1     |606.25  |976.1  |
     > |NULL  |NULL    |NULL    |NULL    |NULL    |NULL    |col2           |15       |437.56             |268.95505944302295  |99.4   |204.45              |255.4     |682.1   |828.6  |
     > |NULL  |NULL    |NULL    |NULL    |NULL    |NULL    |col3           |15       |684.8466666666667  |331.87254839915937  |32.6   |521.3               |844.9     |938.45  |971.7  |
     > |NULL  |NULL    |NULL    |NULL    |NULL    |NULL    |col4           |15       |524.9266666666666  |327.074780585783    |77.3   |225.6               |435.4     |842.8   |938.0  |
     > |NULL  |NULL    |NULL    |NULL    |NULL    |NULL    |col5           |15       |489.14             |288.9176669671038   |24.9   |364.29999999999995  |481.2     |646.25  |988.2  |
     > ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
     > ```

## Create a UDTF with a vectorized process method

1. Define a handler class, similar to regular UDTFs, with optional `__init__` and `end_partition` methods.
2. Define a `process` method that takes in a DataFrame argument and returns either a `pandas.DataFrame` or a tuple of `pandas.Series` or `pandas.arrays` where each array is a column.

   The column types of the result must match the column types in the UDTF definition.
   The returned result must be exactly one DataFrame or tuple. This is different from a vectorized `end_partition` method where you can yield or return a list.
3. To mark the `process` method as vectorized, use the `@vectorized` decorator or the `_sf_vectorized_input` function attribute.

   For more information, see [Vectorized Python UDFs](udf-python-batch.md).
   The `@vectorized` decorator can only be used when the Python UDTF is executed within Snowflake; for example, when using a SQL worksheet.
   When you are executing using the client or a Python worksheet, you must use the function attribute.
4. Optional: If your Python handler function is exceeding the execution time limit, [set a target batch size](udf-python-batch.md).

> **Note:**
>
> The default column names for the input DataFrame to a UDTF with a vectorized `process` method match the signature of the SQL function.
> The column names follow the [SQL identifier requirements](../../../sql-reference/identifiers-syntax.md).
> Namely, if an identifier is unquoted it will be capitalized, and if it is double quoted it will remain unchanged.

The handler for a UDTF with a vectorized `process` method can be implemented to process batches in a partition-aware manner or to process them simply batch by batch.
For more information, see [Stateful and Stateless Processing](udf-python-tabular-functions.md).

### Example: Use a UDTF with a vectorized process method to apply one hot encoding

Use a UDTF with a vectorized `process` method to apply one hot encoding on a table with ten categories:

```python
import pandas as pd
from snowflake.snowpark import Session
from snowflake.snowpark.types import PandasDataFrame

class one_hot_encode:
  def process(self, df: PandasDataFrame[str]) -> PandasDataFrame[int,int,int,int,int,int,int,int,int,int]:
      return pd.get_dummies(df)
  process._sf_vectorized_input = pd.DataFrame

one_hot_encode_udtf = session.udtf.register(
  one_hot_encode,
  output_schema=["categ0", "categ1", "categ2", "categ3", "categ4", "categ5", "categ6", "categ7", "categ8", "categ9"],
  input_names=['"categ"']
)

df_table = session.table("categories")
df_table.show()
```

Sample result:

```output
-----------
|"CATEG"  |
-----------
|categ1   |
|categ6   |
|categ8   |
|categ5   |
|categ7   |
|categ5   |
|categ1   |
|categ2   |
|categ2   |
|categ4   |
-----------
```

Prepare to print the table:

```python
res = df_table.select("categ", one_hot_encode_udtf("categ")).to_pandas()
print(res.head())
```

Sample result:

```output
    CATEG  CATEG0  CATEG1  CATEG2  CATEG3  CATEG4  CATEG5  CATEG6  CATEG7  CATEG8  CATEG9
0  categ0       1       0       0       0       0       0       0       0       0       0
1  categ0       1       0       0       0       0       0       0       0       0       0
2  categ5       0       0       0       0       0       1       0       0       0       0
3  categ3       0       0       0       1       0       0       0       0       0       0
4  categ8       0       0       0       0       0       0       0       0       1       0
```

You can obtain the same result with a vectorized UDF, although is less convenient.
You need to package the results into one column, and then unpack the column to restore the results to a usable pandas DataFrame.

Example of using a vectorized UDF:

```python
def one_hot_encode(df: PandasSeries[str]) -> PandasSeries[Variant]:
  return pd.get_dummies(df).to_dict('records')

one_hot_encode._sf_vectorized_input = pd.DataFrame

one_hot_encode_udf = session.udf.register(
  one_hot_encode,
  output_schema=["encoding"],
)

df_table = session.table("categories")
df_table.show()
res = df_table.select(one_hot_encode_udf("categ")).to_df("encoding").to_pandas()
print(res.head())
0  {\n  "categ0": false,\n  "categ1": false,\n  "...
1  {\n  "categ0": false,\n  "categ1": true,\n  "c...
2  {\n  "categ0": false,\n  "categ1": false,\n  "...
3  {\n  "categ0": false,\n  "categ1": false,\n  "...
4  {\n  "categ0": true,\n  "categ1": false,\n  "c...
```

## Type support

Vectorized UDTFs support the same [SQL types](../../../sql-reference-data-types.md) as
vectorized UDFs. However, for vectorized UDTFs,
SQL `NUMBER` arguments with a scale of 0 that all fit in a 64-bit
or smaller integer type will always be mapped to `Int16`, `Int32`, or `Int64`.
Unlike scalar UDFs, if the argument of a UDTF is not nullable, it will not be converted to `int16`, `int32`, or `int64`.

To view a table showing how SQL types are mapped to pandas dtypes, see [the type support table](udf-python-batch.md) in the
vectorized Python UDFs topic.

## Best practices

* If a scalar must be returned with each row, build a list of repeated values instead of unpackaging the `numpy` array to create tuples.
  For example, for a two-column result, instead of:

  ```python
  return tuple(map(lambda n: (scalar_value, n[0], n[1]), results))
  ```

  Use this:

  ```python
  return tuple([scalar_value] * len(results), results[:, 0], results[:, 1])
  ```
* To improve performance, unpackage semi-structured data into columns.

  For example, if you have a variant column, `obj`, with elements, `x(int)`, `y(float)`, and `z(string)`,
  instead of defining a UDTF with a signature like this, and calling it using `vec_udtf(obj)`:

  ```sqlexample
  CREATE FUNCTION vec_udtf(variant OBJ)
  ```

  Define the UDTF with a signature like this, and call it using `vec_udtf(obj:x, obj:y, obj:z)`:

  ```python
  CREATE FUNCTION vec_udtf(a INTEGER, b FLOAT, c STRING)
  ```
* By default, Snowflake encodes the inputs into pandas dtypes that support NULL values (for example, [Int64](https://pandas.pydata.org/docs/reference/api/pandas.Int64Dtype.html)).
  If you are using a library that requires a primitive type (such as `numpy`) and your input has no NULL values, you should cast the column to a primitive type before using the library. For example:

  ```python
  input_df['y'] =  input_df['y'].astype("int64")
  ```

  For more information, see Type Support.
* When using UDTFs with a vectorized `end_partition` method, to improve performance and prevent timeouts, avoid using `pandas.concat` to accumulate partial results. Instead, yield the partial result whenever one is ready.

  For example, instead of:

  ```python
  results = []
  while(...):
    partial_result = pd.DataFrame(...)
    results.append(partial_result)
  return pd.concat(results)
  ```

  Do this:

  ```python
  while(...):
    partial_result = pd.DataFrame(...)
    yield partial_result
  ```

---
title: Viewing log messages
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-accessing-messages.md
section: Developer Guide
---

# Viewing log messages

You can view the log messages either through Snowsight or by querying the event table in which log entries are stored.

> **Note:**
>
> Before you can begin using log messages, you must [enable telemetry data collection](logging-tracing-enabling.md).

## Required privileges

To view log entries in Snowsight or query the event table directly, your active role must have one
of the following:

* The ACCOUNTADMIN role.
* The `SNOWFLAKE.EVENTS_VIEWER` application role, which grants SELECT access to the
  [EVENTS_VIEW](../../sql-reference/telemetry/events_view.md) of the default event table.
* The `SNOWFLAKE.EVENTS_ADMIN` application role, which grants broader access including SELECT,
  TRUNCATE, and DELETE on the default event table.
* The SELECT privilege on a [custom event table](event-table-setting-up.md), if your account
  uses a custom event table instead of the default.

For information on granting these roles, see
[Roles for access to the default event table and EVENTS_VIEW](event-table-setting-up.md).

## View log entries in Snowsight

You can use Snowsight to view log data captured in the event table.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Traces & logs.
3. In the Traces & Logs page, you can do the following actions:

   * To filter the displayed rows, use the drop-down menus at the top of the page. You can filter by the following characteristics:

     + Date range during which the entry was recorded
     + Name of the Snowflake user executing the code that emitted the entry
     + Log entry severity
     + Programming language of the code that emitted the log entry
   * To filter entries by a specific time period in the displayed data, select the graph bar representing the time period.
   * To sort rows, select the name of the column by which you want to sort.
   * To view more detailed information about an entry in its Details panel, select the entry’s row.

     On this panel, you can view more information stored in the event table. The following table describes values in the panel:

     | Detail | Description |
     | --- | --- |
     | Record Type | Type of event the selected row represents. Retrieved from the [RECORD_TYPE column](event-table-columns.md) value. |
     | Database | Name of the database containing the code that emitted the entry. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.database.name` value. |
     | Schema | Name of the schema containing the code that emitted the entry. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.schema.name` value. |
     | Severity | Severity of the log entry. Retrieved from the [RECORD column](event-table-columns.md) `severity_text` value. |
     | Query ID | ID of the query within which the log entry was emitted. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.query.id` value. |
     | Object | Name of the emitted log entry’s source, such as the function or procedure. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.executable.name` value. |
     | Warehouse | Name of the warehouse running the query that generated the event. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.warehouse.name` value. |
     | Owner | Name of the primary role in the session. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.session.role.primary.name` value. |
     | Log Text | Log message text. Retrieved from the [VALUE column](event-table-columns.md) value. |

## Query the event table for log entries

To access the logged messages, execute the SELECT command on the event table.

An event table has a set of predefined columns that capture information about the logged messages, including the following:

* The timestamp when the message was ingested
* The scope of the log event, such as the name of the class where the log event was created
* The log event source, including the database, schema, user, warehouse
* The severity level of the log
* The log message

For reference information about the event table’s structure, see [Event table columns](event-table-columns.md).

The following sections illustrate with example data how you can query the event table for log message data.

### Collected data

Output in the following example shows content from a selected subset of columns from an event table after log messages have been captured
for two separate handlers: one written in Scala and the other in Python.

For reference information about event table columns that collect log message data, see [Data for logs](event-table-columns.md).

```output
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| TIMESTAMP           | SCOPE                             | RESOURCE_ATTRIBUTES   | RECORD_TYPE | RECORD                       | VALUE                                                      |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:00:49 | { "name": "python_logger" }       | **See excerpt below** | LOG         | { "severity_text": "INFO" }  | Logging from Python module.                                |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:00:49 | { "name": "python_logger" }       | **See excerpt below** | LOG         | { "severity_text": "INFO" }  | Logging from Python function start.                        |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:00:49 | { "name": "python_logger" }       | **See excerpt below** | LOG         | { "severity_text": "ERROR" } | Logging an error from Python handler.                      |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:12:55 | { "name": "ScalaLoggingHandler" } | **See excerpt below** | LOG         | { "severity_text": "INFO" }  | Logging from within the Scala constructor.                 |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:12:56 | { "name": "ScalaLoggingHandler" } | **See excerpt below** | LOG         | { "severity_text": "INFO" }  | Logging from Scala method start.                           |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:12:56 | { "name": "ScalaLoggingHandler" } | **See excerpt below** | LOG         | { "severity_text": "ERROR" } | Logging an error from Scala handler: Something went wrong. |
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

#### RESOURCE_ATTRIBUTES excerpts

The following JSON includes excerpts from values you’d find in the preceding output’s RESOURCE_ATTRIBUTES column. Each
`snow.executable.name` name-value pair is from a different row in the preceding output.

The SELECT query code following this excerpt selects from the RESOURCE_ATTRIBUTES column’s value.

The RESOURCE_ATTRIBUTES column contains data about the event’s source. For reference information, see
[RESOURCE_ATTRIBUTES column](event-table-columns.md).

```sqljson
{
  ...
  "snow.executable.name": "ADD_TWO_NUMBERS(A FLOAT, B FLOAT):FLOAT"
  ...
}

{
  ...
  "snow.executable.name": "ADD_TWO_NUMBERS(A FLOAT, B FLOAT):FLOAT"
  ...
}

{
  ...
  "snow.executable.name": "ADD_TWO_NUMBERS(A FLOAT, B FLOAT):FLOAT"
  ...
}

{
  ...
  "snow.executable.name": "DO_LOGGING():VARCHAR(16777216)"
  ...
}

{
  ...
  "snow.executable.name": "DO_LOGGING():VARCHAR(16777216)"
  ...
}

{
  ...
  "snow.executable.name": "DO_LOGGING():VARCHAR(16777216)"
  ...
}
```

### Query with SELECT statement

When querying for message data, to select attribute values within a column, use
[bracket notation](../../user-guide/querying-semistructured.md), as in the following form:

```sqlexample
COLUMN_NAME['attribute_name']
```

Code in the following example queries the preceding table with the intention of isolating data related to the Python handler’s log messages.
The query selects the `severity_text` attribute for the log entry severity. It selects the `VALUE` column’s content for the log
message.

The procedure containing the handler is called `do_logging`. Note that for the query to work, you must specify the procedure name
in all capital letters.

```sqlexample
SET event_table_name='my_db.public.my_event_table';

SELECT
  TIMESTAMP as time,
  RESOURCE_ATTRIBUTES['snow.executable.name'] as executable,
  RECORD['severity_text'] as severity,
  VALUE as message
FROM
  IDENTIFIER($event_table_name)
WHERE
  SCOPE['name'] = 'python_logger'
  AND RESOURCE_ATTRIBUTES['snow.executable.name'] LIKE '%DO_LOGGING%'
  AND RECORD_TYPE = 'LOG';
```

### Query results

Output in the following example illustrates the query’s result:

```output
----------------------------------------------------------------------------------------------------------------
| TIME                | EXECUTABLE                       | SEVERITY   | MESSAGE                                |
----------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:00:49 | "DO_LOGGING():VARCHAR(16777216)" | "INFO"     | "Logging from Python module."          |
----------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:00:49 | "DO_LOGGING():VARCHAR(16777216)" | "INFO"     | "Logging from Python function start."  |
----------------------------------------------------------------------------------------------------------------
| 2023-04-19 22:00:49 | "DO_LOGGING():VARCHAR(16777216)" | "ERROR"    | "Logging an error from Python handler" |
----------------------------------------------------------------------------------------------------------------
```

---
title: Viewing metrics data
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/metrics-viewing-data.md
section: Developer Guide
---

# Viewing metrics data

You can view metrics data for analysis in the following ways:

* Use Snowsight or Grafana.
* Execute a SELECT command on the event table.

> **Note:**
>
> Before you can begin using metrics data, you must [enable telemetry data collection](logging-tracing-enabling.md).

## Visualize on Snowsight

You can use Snowsight to view metrics data captured in the event table.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Query History.
3. In the Query History page, select the query for which you want to view metrics data.
4. Select the Query Telemetry tab.
5. Select the span for which you want to view data, then select the Related Metrics tab.
6. View the time series metrics over the course of the query and metrics over time.

## Visualize on Grafana

You can also use free partner tools, such as Grafana Cloud, to visualize the metrics data.

For Snowflake dashboard templates and installation instructions for Grafana Cloud, see [Snowflake Telemetry Dashboard Templates](https://github.com/snowflakedb/snowflake-telemetry-dashboard-templates).

## Query with SELECT statement

When querying for data, you can select attribute values within a column by using
[bracket notation](../../user-guide/querying-semistructured.md), as in the following form:

Code in the following example queries the preceding table with the intention of isolating data related to the `DIGITS_OF_NUMBER`
function.

```sqlexample
SET EVENT_TABLE_NAME='my_db.public.my_events';

SELECT TIMESTAMP, RESOURCE_ATTRIBUTES['snow.executable.name'] AS FUNCTION_NAME, RECORD['METRIC']['NAME']AS METRIC_NAME, VALUE
FROM EVENT_TABLE_NAME
WHERE
  RESOURCE_ATTRIBUTES['snow.query.id']  = <INSERT YOUR QUERY ID>
  AND RECORD_TYPE = 'METRIC'
ORDER BY TIMESTAMP DESC;
```

---
title: Viewing trace data
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/tracing-accessing-events.md
section: Developer Guide
---

# Viewing trace data

You can view trace data either through Snowsight or by querying the event table in which trace data is stored.

> **Note:**
>
> Before you can begin using trace data, you must [enable telemetry data collection](logging-tracing-enabling.md).

## View trace entries in Snowsight

You can use Snowsight to view trace data captured in the event table.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Monitoring » Traces & logs.
3. In the Traces & Logs page, you can view trace entries with the following columns:

   | Column | Description |
   | --- | --- |
   | Date | Date on which the entry was recorded. |
   | Duration | Length of time from the trace’s start to its end. |
   | Trace Name | The name of the executable generating the event. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.executable.name` value. |
   | Status | `Error` if the trace reported errors; otherwise, `Success`. |
   | Spans | Number of spans in the trace. |
4. In the Traces & Logs page, you can perform the following actions:

   * To filter the displayed rows, use the drop-down menus at the top of the page. You can filter by the following characteristics:

     + Date range during which the entry was recorded
     + Status of the trace, such as `Success` or `Error`
     + Database on which the trace was run
   * To sort rows, select the name of the column by which you want to sort.
5. To view more detailed information about an entry in its Trace Details page, select the entry’s row.
6. In the Traces Details page, you can view a list of spans.

   A span object contains trace events. For more information, see [How Snowflake represents trace events](tracing-how-events-work.md).

   * To filter the displayed rows use the drop-down menu at the top of the page. You can filter by Span Type: UDF, procedure, or
     Streamlit.
   * To view a legend describing data shown in the rows, select the Legend dropdown, and then select the legend you want to see.
   * To view more detailed information about an entry, select the entry’s row.

     On this panel, you can view more information stored in the event table. The following tables describe values in the panel:

     **Details tab**

     | Detail | Description |
     | --- | --- |
     | Trace ID | A unique identifier for calls made from a query. Retrieved from the [TRACE column](event-table-columns.md) `trace_id` value. For more information, see [Trace value](event-table-columns.md). |
     | Span ID | A unique identifier tied to the threading model. Retrieved from the [TRACE column](event-table-columns.md) `span_id` value. For more information, see [Trace value](event-table-columns.md). |
     | Scope | Namespace of code emitting the event. Retrieved from the [SCOPE column](event-table-columns.md). |
     | Duration | The span’s duration from start to finish. For more information, see [For SPAN RECORD_TYPE](event-table-columns.md). |
     | Name | The span’s name. Retrieved from the [RECORD column](event-table-columns.md) `name` value. |
     | Parent Span ID | Unique ID of the span containing the selected span. |
     | Status Code | The span’s status code. Retrieved from the [RECORD column](event-table-columns.md) `status` value. |
     | Other attributes | Attributes and values added by user code. |
     | Query ID | ID of the query that initiated the trace. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.query.id` value. |
     | Name | The name of the executable generating the event. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.executable.name` value. |
     | Type | The type of executable that generated the event. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.executable.type` value. |
     | User | The name of the user executing the function or procedure. For a Streamlit app, the name of the user who was viewing the app for a given event. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `db.user` value. |
     | Owner | The name of the role with OWNERSHIP privilege for the executable. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.owner.name` value. |
     | Role | The name of the primary role in the session. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.session.role.primary.name` value. |
     | Warehouse | The name of the warehouse running the query generating the event. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.warehouse.name` value. |
     | Database | The name of the database containing the executable. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.database.name` value. |
     | Schema | The name of the schema containing the executable. Retrieved from the [RESOURCE_ATTRIBUTES column](event-table-columns.md) `snow.schema.name` value. |

     **Span Events tab**

     Displays data recorded for trace events. For more information, see [Event data recorded](tracing-how-events-work.md).

     **Related Metrics tab**

     Displays charts illustrating CPU and memory metrics for resource consumption by Snowpark Python stored procedures and UDFs. Metrics
     associated with the UDF are for a specific query. If you select a UDF span from the list, the metrics are related to one or more
     UDF spans for the same query. If you select a procedure from the list, you will see procedure metrics for a single span.

     **Logs tab**

     Displays the value logged by the event. Retrieved from the [VALUE column](event-table-columns.md).

## Query the event table for trace entries

An event table has a set of predefined columns that capture information about the logged messages, including:

* The timestamp when a span began.
* The timestamp when the event was created.
* The type of data recorded, such as whether the data is for a span or span event.
* The name of the span or event.
* Attributes, if any, associated with the span or event.

For reference information about event table columns, see [Event table columns](event-table-columns.md).

### Trace data query example

The following sections illustrate with example data how you can query the event table for trace data.

#### Collected data

Output in the following example shows content from a selected subset of columns from an event table after trace data has been captured
for three separate handlers written in Python.

For reference information on event table columns that collect trace data, see [Data for trace events](event-table-columns.md).

```output
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| TIMESTAMP          | START_TIMESTAMP    | RESOURCE_ATTRIBUTES   | RECORD_TYPE | RECORD                                                                                                  | RECORD_ATTRIBUTES                                                           |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 | 2023-04-20 0:45:49 | **See excerpt below** | SPAN        | { "kind": "SPAN_KIND_INTERNAL", "name": "digits_of_number", "status": { "code": "STATUS_CODE_UNSET" } } |                                                                             |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 |                    |                       | SPAN_EVENT  | { "name": "test_udtf_init" }                                                                            |                                                                             |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 |                    |                       | SPAN_EVENT  | { "name": "test_udtf_process" }                                                                         | { "input": "42" }                                                           |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 |                    |                       | SPAN_EVENT  | { "name": "test_udtf_end_partition" }                                                                   |                                                                             |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:46:00 | 2023-04-20 0:46:00 |                       | SPAN        | { "kind": "SPAN_KIND_INTERNAL", "name": "times_two", "status": { "code": "STATUS_CODE_UNSET" } }        | { "example.func.times_two": "begin", "example.func.times_two.response": 8 } |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:46:00 |                    |                       | SPAN_EVENT  | { "name": "event_without_attributes" }                                                                  |                                                                             |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:46:00 |                    |                       | SPAN_EVENT  | { "name": "event_with_attributes" }                                                                     | { "example.key1": "value1", "example.key2": "value2" }                      |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:46:08 | 2023-04-20 0:46:08 |                       | SPAN        | { "kind": "SPAN_KIND_INTERNAL", "name": "do_tracing", "status": { "code": "STATUS_CODE_UNSET" } }       | { "example.proc.do_tracing": "begin" }                                      |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:46:08 |                    |                       | SPAN_EVENT  | { "name": "event_with_attributes" }                                                                     | { "example.key1": "value1", "example.key2": "value2" }                      |
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

**RESOURCE_ATTRIBUTES Excerpts**

The following JSON excerpts contain two of the attributes included in the RESOURCE_ATTRIBUTES column for each of the three handlers whose
data is included in the preceding output. The SELECT statement following these excerpts selects values from these attributes.

The RESOURCE_ATTRIBUTES column contains data about the event’s source. For reference information, see
[RESOURCE_ATTRIBUTES column](event-table-columns.md).

```sqljson
{
  ...
  "snow.executable.name": "DIGITS_OF_NUMBER(INPUT NUMBER):TABLE: (RESULT NUMBER)",
  "snow.executable.type": "FUNCTION",
  ...
}

{
  ...
  "snow.executable.name": "TIMES_TWO(X NUMBER):NUMBER(38,0)",
  "snow.executable.type": "FUNCTION",
  ...
}

{
  ...
  "snow.executable.name": "DO_TRACING():VARIANT",
  "snow.executable.type": "PROCEDURE",
  ...
}
```

#### Query with SELECT statement

When querying for data, you can select attribute values within a column by using
[bracket notation](../../user-guide/querying-semistructured.md), as in the following form:

```sqlexample
COLUMN_NAME['attribute_name']
```

Code in the example below queries the preceding table with the intention of isolating data related to the `DIGITS_OF_NUMBER`
function.

```sqlexample
SET EVENT_TABLE_NAME='my_db.public.my_events';

SELECT
  TIMESTAMP as time,
  RESOURCE_ATTRIBUTES['snow.executable.name'] as handler_name,
  RESOURCE_ATTRIBUTES['snow.executable.type'] as handler_type,
  RECORD['name'] as event_name,
  RECORD_ATTRIBUTES as attributes
FROM
  IDENTIFIER($event_table_name)
WHERE
  RECORD_TYPE = 'SPAN_EVENT'
  AND HANDLER_NAME LIKE 'DIGITS_OF_NUMBER%';
```

#### Query results

Output in the following example illustrates the query’s result.

```output
-------------------------------------------------------------------------------------------------------------------------------------------
| TIME               | HANDLER_NAME                                          | HANDLER_TYPE | EVENT_NAME              | ATTRIBUTES        |
-------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 | DIGITS_OF_NUMBER(INPUT NUMBER):TABLE: (RESULT NUMBER) | FUNCTION     | test_udtf_init          |                   |
-------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 | DIGITS_OF_NUMBER(INPUT NUMBER):TABLE: (RESULT NUMBER) | FUNCTION     | test_udtf_process       | { "input": "42" } |
-------------------------------------------------------------------------------------------------------------------------------------------
| 2023-04-20 0:45:49 | DIGITS_OF_NUMBER(INPUT NUMBER):TABLE: (RESULT NUMBER) | FUNCTION     | test_udtf_end_partition |                   |
-------------------------------------------------------------------------------------------------------------------------------------------
```

---
title: Working with asynchronous child jobs
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/asynchronous-child-jobs.md
section: Developer Guide
---

# Working with asynchronous child jobs

This topic explains how to use asynchronous child jobs in Snowflake Scripting.

## Introduction to asynchronous child jobs

In Snowflake Scripting, an asynchronous child job is a query that runs in the background while code in a block
continues to run. The query can be any valid SQL statement, including SELECT statements and DML statements, such
as INSERT or UPDATE.

To run a query as an asynchronous child job, place the ASYNC keyword before the query. When this keyword is omitted,
the Snowflake Scripting block runs child jobs sequentially, and each child job waits for the running child job to finish before
it starts. Asynchronous child jobs can run concurrently, which can improve efficiency and reduce overall run time.

You can use the ASYNC keyword in the following ways:

* For a query that is run for a [RESULTSET](resultsets.md).
* For a query that is run independent of a RESULTSET.

To manage asynchronous child jobs, use the [AWAIT](../../sql-reference/snowflake-scripting/await.md) and
[CANCEL](../../sql-reference/snowflake-scripting/cancel.md) statements:

* The AWAIT statement waits for all asynchronous child jobs that are running to finish or for a specific child job that is
  running for a RESULTSET to finish, then returns when the all jobs have finished or the specific job has finished, respectively.
* The CANCEL statement cancels an asynchronous child job that is running for a RESULTSET.

You can check the status of an asynchronous child job that is running for a RESULTSET by calling the
[SYSTEM$GET_RESULTSET_STATUS](../../sql-reference/functions/system_get_resultset_status.md) function.

Currently, up to 4,000 asynchronous child jobs can run concurrently. An error is returned if the number of concurrent
asynchronous child jobs exceeds this limit.

> **Note:**
>
> When multiple asynchronous child jobs run concurrently in the same session, the job completion order isn’t
> known until the jobs have finished running. Since the completion order can vary, using the
> [LAST_QUERY_ID](../../sql-reference/functions/last_query_id.md) function with asynchronous child jobs is
> non-deterministic.

## Examples of using asynchronous child jobs

The following sections provide examples of using asynchronous child jobs:

* Example: Running child jobs that query tables concurrently
* Example: Running child jobs that insert rows into tables concurrently
* Example: Running child jobs in stored procedures with AWAIT ALL statements
* Example: Running child jobs for inserts in a loop

### Example: Running child jobs that query tables concurrently

The following code shows how to use the ASYNC keyword to run multiple child jobs that query
tables concurrently. The example specifies the ASYNC keyword for queries that are run for
RESULTSETs.

This example uses the data in the following tables:

```sqlexample
CREATE OR REPLACE TABLE orders_q1_2024 (
  order_id INT,
  order_amount NUMBER(12,2));

INSERT INTO orders_q1_2024 VALUES (1, 500.00);
INSERT INTO orders_q1_2024 VALUES (2, 225.00);
INSERT INTO orders_q1_2024 VALUES (3, 725.00);
INSERT INTO orders_q1_2024 VALUES (4, 150.00);
INSERT INTO orders_q1_2024 VALUES (5, 900.00);

CREATE OR REPLACE TABLE orders_q2_2024 (
  order_id INT,
  order_amount NUMBER(12,2));

INSERT INTO orders_q2_2024 VALUES (1, 100.00);
INSERT INTO orders_q2_2024 VALUES (2, 645.00);
INSERT INTO orders_q2_2024 VALUES (3, 275.00);
INSERT INTO orders_q2_2024 VALUES (4, 800.00);
INSERT INTO orders_q2_2024 VALUES (5, 250.00);
```

The following stored procedure performs the following actions:

* Queries both tables for the `order_amount` values in all rows and returns the results to
  different RESULTSETs (one for each table).
* Specifies that the queries run as concurrent child jobs by using the ASYNC keyword.
* Executes the AWAIT statement for each RESULTSET so
  that the procedure waits for the queries to finish before proceeding. Query results for a
  RESULTSET can’t be accessed until AWAIT is run for the RESULTSET.
* Uses a cursor to calculate the sum of the `order_amount` rows for each table.
* Adds the totals for the tables and returns the value.

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_async_child_jobs_query()
RETURNS INTEGER
LANGUAGE SQL
AS
DECLARE
  accumulator1 INTEGER DEFAULT 0;
  accumulator2 INTEGER DEFAULT 0;
  res1 RESULTSET DEFAULT ASYNC (SELECT order_amount FROM orders_q1_2024);
  res2 RESULTSET DEFAULT ASYNC (SELECT order_amount FROM orders_q2_2024);
BEGIN
  AWAIT res1;
  LET cur1 CURSOR FOR res1;
  OPEN cur1;
  AWAIT res2;
  LET cur2 CURSOR FOR res2;
  OPEN cur2;
  FOR row_variable IN cur1 DO
      accumulator1 := accumulator1 + row_variable.order_amount;
  END FOR;
  FOR row_variable IN cur2 DO
      accumulator2 := accumulator2 + row_variable.order_amount;
  END FOR;
  RETURN accumulator1 + accumulator2;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_async_child_jobs_query()
RETURNS INTEGER
LANGUAGE SQL
AS
$$
  DECLARE
    accumulator1 INTEGER DEFAULT 0;
    accumulator2 INTEGER DEFAULT 0;
    res1 RESULTSET DEFAULT ASYNC (SELECT order_amount FROM orders_q1_2024);
    res2 RESULTSET DEFAULT ASYNC (SELECT order_amount FROM orders_q2_2024);
  BEGIN
    AWAIT res1;
    LET cur1 CURSOR FOR res1;
    OPEN cur1;
    AWAIT res2;
    LET cur2 CURSOR FOR res2;
    OPEN cur2;
    FOR row_variable IN cur1 DO
        accumulator1 := accumulator1 + row_variable.order_amount;
    END FOR;
    FOR row_variable IN cur2 DO
        accumulator2 := accumulator2 + row_variable.order_amount;
    END FOR;
    RETURN accumulator1 + accumulator2;
  END;
$$;
```

Call the stored procedure:

```sqlexample
CALL test_sp_async_child_jobs_query();
```

```output
+--------------------------------+
| TEST_SP_ASYNC_CHILD_JOBS_QUERY |
|--------------------------------|
|                           4570 |
+--------------------------------+
```

### Example: Running child jobs that insert rows into tables concurrently

The following code shows how to use the ASYNC keyword to run multiple child jobs that insert
rows into a table concurrently. The example specifies the ASYNC keyword for queries that are run for
RESULTSETs.

The following stored procedure performs the following actions:

* Creates the `orders_q3_2024` table if it doesn’t exist.
* Creates two RESULTSETs, `insert_1` and `insert_2`, that hold the results of inserts into the table.
  The stored procedure arguments specify the values that are inserted into the table.
* Specifies that the inserts run as concurrent child jobs by using the ASYNC keyword.
* Executes the AWAIT statement for each RESULTSET so
  that the procedure waits for the inserts to finish before proceeding. The results of a
  RESULTSET can’t be accessed until AWAIT is run for the RESULTSET.
* Creates a new RESULTSET `res` that holds the results of a query on the `orders_q3_2024` table.
* Returns the results of the query.

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_async_child_jobs_insert(
  arg1 INT,
  arg2 NUMBER(12,2),
  arg3 INT,
  arg4 NUMBER(12,2))
RETURNS TABLE()
LANGUAGE SQL
AS
  BEGIN
   CREATE TABLE IF NOT EXISTS orders_q3_2024 (
      order_id INT,
      order_amount NUMBER(12,2));
    LET insert_1 RESULTSET := ASYNC (INSERT INTO orders_q3_2024 SELECT :arg1, :arg2);
    LET insert_2 RESULTSET := ASYNC (INSERT INTO orders_q3_2024 SELECT :arg3, :arg4);
    AWAIT insert_1;
    AWAIT insert_2;
    LET res RESULTSET := (SELECT * FROM orders_q3_2024 ORDER BY order_id);
    RETURN TABLE(res);
  END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_async_child_jobs_insert(
  arg1 INT,
  arg2 NUMBER(12,2),
  arg3 INT,
  arg4 NUMBER(12,2))
RETURNS TABLE()
LANGUAGE SQL
AS
$$
  BEGIN
   CREATE TABLE IF NOT EXISTS orders_q3_2024 (
      order_id INT,
      order_amount NUMBER(12,2));
    LET insert_1 RESULTSET := ASYNC (INSERT INTO orders_q3_2024 SELECT :arg1, :arg2);
    LET insert_2 RESULTSET := ASYNC (INSERT INTO orders_q3_2024 SELECT :arg3, :arg4);
    AWAIT insert_1;
    AWAIT insert_2;
    LET res RESULTSET := (SELECT * FROM orders_q3_2024 ORDER BY order_id);
    RETURN TABLE(res);
  END;
$$;
```

Call the stored procedure:

```sqlexample
CALL test_sp_async_child_jobs_insert(1, 325, 2, 241);
```

```output
+----------+--------------+
| ORDER_ID | ORDER_AMOUNT |
|----------+--------------|
|        1 |       325.00 |
|        2 |       241.00 |
+----------+--------------+
```

### Example: Running child jobs in stored procedures with AWAIT ALL statements

The following examples use the ASYNC keyword to run multiple child jobs concurrently in stored
procedures. The examples specify the ASYNC keyword for statements that aren’t associated with a
RESULTSET, then use the AWAIT ALL statement so that the stored procedure code waits for all of the
asynchronous child jobs to complete.

* Create a stored procedure that inserts values concurrently
* Create a stored procedure that updates values concurrently
* Create a stored procedure that calls other stored procedures concurrently

#### Create a stored procedure that inserts values concurrently

The following stored procedure uses the ASYNC keyword to run multiple child jobs that insert rows
into a table concurrently. The example specifies the ASYNC keyword for the INSERT statements. The
example also uses the AWAIT ALL statement so that the stored procedure waits for all of the
asynchronous child jobs to complete.

```sqlexample
CREATE OR REPLACE PROCEDURE test_async_child_job_inserts()
RETURNS VARCHAR
LANGUAGE SQL
AS
BEGIN
  CREATE OR REPLACE TABLE test_child_job_queries1 (col1 INT);
  ASYNC (INSERT INTO test_child_job_queries1(col1) VALUES(1));
  ASYNC (INSERT INTO test_child_job_queries1(col1) VALUES(2));
  ASYNC (INSERT INTO test_child_job_queries1(col1) VALUES(3));
  AWAIT ALL;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_async_child_job_inserts()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  CREATE OR REPLACE TABLE test_child_job_queries1 (col1 INT);
  ASYNC (INSERT INTO test_child_job_queries1(col1) VALUES(1));
  ASYNC (INSERT INTO test_child_job_queries1(col1) VALUES(2));
  ASYNC (INSERT INTO test_child_job_queries1(col1) VALUES(3));
  AWAIT ALL;
END;
$$
;
```

#### Create a stored procedure that updates values concurrently

The following stored procedure uses the ASYNC keyword to run multiple child jobs that update rows
in a table concurrently. The example specifies the ASYNC keyword for the UPDATE statements. The
example also uses the AWAIT ALL statement so that the stored procedure waits for all of the
asynchronous child jobs to complete.

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_child_job_queries2 (id INT, cola INT);

INSERT INTO test_child_job_queries2 VALUES
  (1, 100), (2, 101), (3, 102);
```

Create the stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE test_async_child_job_updates()
RETURNS VARCHAR
LANGUAGE SQL
AS
BEGIN
  ASYNC (UPDATE test_child_job_queries2 SET cola=200 WHERE id=1);
  ASYNC (UPDATE test_child_job_queries2 SET cola=201 WHERE id=2);
  ASYNC (UPDATE test_child_job_queries2 SET cola=202 WHERE id=3);
  AWAIT ALL;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_async_child_job_updates()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  ASYNC (UPDATE test_child_job_queries2 SET cola=200 WHERE id=1);
  ASYNC (UPDATE test_child_job_queries2 SET cola=201 WHERE id=2);
  ASYNC (UPDATE test_child_job_queries2 SET cola=202 WHERE id=3);
  AWAIT ALL;
END;
$$
;
```

#### Create a stored procedure that calls other stored procedures concurrently

```sqlexample
CREATE OR REPLACE PROCEDURE test_async_child_job_calls()
RETURNS VARCHAR
LANGUAGE SQL
AS
BEGIN
  ASYNC (CALL test_async_child_job_inserts());
  ASYNC (CALL test_async_child_job_updates());
  AWAIT ALL;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_async_child_job_calls()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  ASYNC (CALL test_async_child_job_inserts());
  ASYNC (CALL test_async_child_job_updates());
  AWAIT ALL;
END;
$$
;
```

Call the `test_async_child_job_calls` stored procedure:

```sqlexample
CALL test_async_child_job_calls();
```

Query the tables to see the results:

```sqlexample
SELECT col1 FROM test_child_job_queries1 ORDER BY col1;
```

```output
+------+
| COL1 |
|------|
|    1 |
|    2 |
|    3 |
+------+
```

```sqlexample
SELECT * FROM test_child_job_queries2 ORDER BY id;
```

```output
+----+------+
| ID | COLA |
|----+------|
|  1 |  200 |
|  2 |  201 |
|  3 |  202 |
+----+------+
```

### Example: Running child jobs for inserts in a loop

The following code shows how to use the ASYNC keyword in a loop to run multiple child jobs that insert
rows into a table concurrently.

This example uses the data in the following tables:

```sqlexample
CREATE OR REPLACE TABLE async_loop_test1(col1 VARCHAR, col2 INT);

INSERT INTO async_loop_test1 VALUES
  ('child', 0),
  ('job', 1),
  ('loop', 2),
  ('test', 3);

CREATE OR REPLACE TABLE async_loop_test2(col1 INT, col2 VARCHAR);
```

Create a stored procedure that inserts values from `async_loop_test1`, concatenated with the text
`async_` into `async_loop_test2` using asynchronous child jobs in a FOR loop. The loop creates a
separate asynchronous child job on each iteration. The AWAIT ALL statement blocks progress in the
stored procedure until all of the child jobs are done.

```sqlexample
CREATE OR REPLACE PROCEDURE async_insert()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
begin
  LET res RESULTSET := (SELECT * FROM async_loop_test1 ORDER BY 1);

  FOR record IN res DO
    LET v VARCHAR := record.col1;
    LET x INT := record.col2;
      ASYNC (INSERT INTO async_loop_test2(col1, col2) VALUES (:x, (SELECT 'async_' || :v)));
    END FOR;

    AWAIT ALL;
    RETURN 'Success';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE async_insert()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
begin
  LET res RESULTSET := (SELECT * FROM async_loop_test1 ORDER BY 1);

  FOR record IN res DO
    LET v VARCHAR := record.col1;
    LET x INT := record.col2;
      ASYNC (INSERT INTO async_loop_test2(col1, col2) VALUES (:x, (SELECT 'async_' || :v)));
    END FOR;

    AWAIT ALL;
    RETURN 'Success';
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL async_insert();
```

```output
+--------------+
| ASYNC_INSERT |
|--------------|
| Success      |
+--------------+
```

Query the `async_loop_test2` table to see the results:

```sqlexample
SELECT * FROM async_loop_test2 ORDER BY col1;
```

```output
+------+-------------+
| COL1 | COL2        |
|------+-------------|
|    0 | async_child |
|    1 | async_job   |
|    2 | async_loop  |
|    3 | async_test  |
+------+-------------+
```

---
title: Working with conditional logic
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/branch.md
section: Developer Guide
---

# Working with conditional logic

Snowflake Scripting supports the following branching constructs for conditional logic:

* IF-THEN-ELSEIF-ELSE
* CASE

## IF statements

In Snowflake Scripting, you can execute a set of statements if a condition is met by using an
[IF](../../sql-reference/snowflake-scripting/if.md) statement.

The syntax for the IF statement is:

```sqlsyntax
 IF (<condition>) THEN
   -- Statements to execute if the <condition> is true.

[ ELSEIF ( <condition_2> ) THEN
  -- Statements to execute if the <condition_2> is true.
]

[ ELSE
  -- Statements to execute if none of the conditions are true.
]

  END IF ;
```

In an IF statement:

* If you need to specify additional conditions, add an ELSEIF clause for each condition.
* To specify the statements to execute when none of the conditions evaluate to TRUE, add an ELSE clause.
* The ELSEIF and ELSE clauses are optional.

The following is a simple example of an IF statement:

```sqlexample
BEGIN
  LET count := 1;
  IF (count < 0) THEN
    RETURN 'negative value';
  ELSEIF (count = 0) THEN
    RETURN 'zero';
  ELSE
    RETURN 'positive value';
  END IF;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET count := 1;
  IF (count < 0) THEN
    RETURN 'negative value';
  ELSEIF (count = 0) THEN
    RETURN 'zero';
  ELSE
    RETURN 'positive value';
  END IF;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| positive value  |
+-----------------+
```

For the full syntax and details about IF statements, see [IF (Snowflake Scripting)](../../sql-reference/snowflake-scripting/if.md).

For more examples that use the IF statement, see:

* [Examples for common use cases of Snowflake Scripting](use-cases.md) - Execute SQL statements based on IF conditions in loops.
* [BREAK](../../sql-reference/snowflake-scripting/break.md), [LOOP](../../sql-reference/snowflake-scripting/loop.md),
  and [Working with loops](loops.md) - Execute BREAK statements to terminate a loop based on IF conditions.
* [EXCEPTION](../../sql-reference/snowflake-scripting/exception.md) - Raise exceptions based on IF conditions.

## CASE statements

A CASE statement behaves similarly to an IF statement but provides a simpler way to specify multiple conditions.

Snowflake Scripting supports two forms of the CASE statement:

* Simple CASE statements
* Searched CASE statements

The next sections explain how to use these different forms.

> **Note:**
>
> Snowflake supports other uses of the keyword CASE outside of Snowflake Scripting (e.g. the
> conditional expression [CASE](../../sql-reference/functions/case.md)).

### Simple CASE statements

In a simple CASE statement, you define different branches (WHEN clauses) for different possible values of a given expression.

The syntax for the simple CASE statement is:

```sqlsyntax
CASE ( <expression_to_match> )

    WHEN <value_1_of_expression> THEN
        <statement>;
        [ <statement>; ... ]

    [ WHEN <value_2_of_expression> THEN
        <statement>;
        [ <statement>; ... ]
    ]

    ... -- Additional WHEN clauses for other possible values;

    [ ELSE
        <statement>;
        [ <statement>; ... ]
    ]

END [ CASE ] ;
```

Snowflake executes the first branch for which `value_n_of_expression` matches the value of `expression_to_match`.

For example, suppose that you want to execute different statements, based on the value of the `expression_to_evaluate` variable.
For each possible value of this variable (e.g. `value a`, `value b`, etc.), you can define a WHEN clause that
specifies the statement(s) to execute:

```sqlexample
DECLARE
  expression_to_evaluate VARCHAR DEFAULT 'default value';
BEGIN
  expression_to_evaluate := 'value a';
  CASE (expression_to_evaluate)
    WHEN 'value a' THEN
      RETURN 'x';
    WHEN 'value b' THEN
      RETURN 'y';
    WHEN 'value c' THEN
      RETURN 'z';
    WHEN 'default value' THEN
      RETURN 'default';
    ELSE
      RETURN 'other';
  END;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  expression_to_evaluate VARCHAR DEFAULT 'default value';
BEGIN
  expression_to_evaluate := 'value a';
  CASE (expression_to_evaluate)
    WHEN 'value a' THEN
      RETURN 'x';
    WHEN 'value b' THEN
      RETURN 'y';
    WHEN 'value c' THEN
      RETURN 'z';
    WHEN 'default value' THEN
      RETURN 'default';
    ELSE
      RETURN 'other';
  END;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| x               |
+-----------------+
```

For the full syntax and details about CASE statements, see [CASE (Snowflake Scripting)](../../sql-reference/snowflake-scripting/case.md).

### Searched CASE statements

In the searched CASE statement, you specify different conditions for each branch (WHEN clause). Snowflake
executes the first branch for which the expression evaluates to TRUE.

The syntax for the searched CASE statement is:

```sqlsyntax
CASE

  WHEN <condition_1> THEN
    <statement>;
    [ <statement>; ... ]

  [ WHEN <condition_2> THEN
    <statement>;
    [ <statement>; ... ]
  ]

  ... -- Additional WHEN clauses for other possible conditions;

  [ ELSE
    <statement>;
    [ <statement>; ... ]
  ]

END [ CASE ] ;
```

For example, when you execute the following CASE statement, the returned value is `a is x` because that branch is the first
branch in which the expression evaluates to TRUE:

```sqlexample
DECLARE
  a VARCHAR DEFAULT 'x';
  b VARCHAR DEFAULT 'y';
  c VARCHAR DEFAULT 'z';
BEGIN
  CASE
    WHEN a = 'x' THEN
      RETURN 'a is x';
    WHEN b = 'y' THEN
      RETURN 'b is y';
    WHEN c = 'z' THEN
      RETURN 'c is z';
    ELSE
      RETURN 'a is not x, b is not y, and c is not z';
  END;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  a VARCHAR DEFAULT 'x';
  b VARCHAR DEFAULT 'y';
  c VARCHAR DEFAULT 'z';
BEGIN
  CASE
    WHEN a = 'x' THEN
      RETURN 'a is x';
    WHEN b = 'y' THEN
      RETURN 'b is y';
    WHEN c = 'z' THEN
      RETURN 'c is z';
    ELSE
      RETURN 'a is not x, b is not y, and c is not z';
  END;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| a is x          |
+-----------------+
```

For the full syntax and details about CASE statements, see [CASE (Snowflake Scripting)](../../sql-reference/snowflake-scripting/case.md).

---
title: Working with cursors
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/cursors.md
section: Developer Guide
---

# Working with cursors

You can use a cursor to iterate through query results one row at a time.

## Introduction

To retrieve data from the results of a query, you can use a cursor. To iterate over the rows in the results,
you can use a cursor in [loops](loops.md).

To use a cursor, do the following:

1. In the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section,
   declare the cursor. The declaration includes the query for the cursor.
2. Before you use the cursor for the first time, execute the [OPEN](../../sql-reference/snowflake-scripting/open.md) command to
   open the cursor. This executes the query and loads the results into the cursor.
3. Execute the [FETCH](../../sql-reference/snowflake-scripting/fetch.md) command to
   fetch one or more rows and process those rows.
4. When you are done with the results or the cursor is no longer needed, execute the [CLOSE](../../sql-reference/snowflake-scripting/close.md)
   command to close the cursor.

> **Note:**
>
> You can also use a RESULTSET to retrieve the results of a query when you use Snowflake Scripting. For information
> about the differences between a cursor and a RESULTSET, see [Understanding the differences between a cursor and a RESULTSET](resultsets.md).

## Setting up the data for the examples

The examples in this section uses the following data:

```sqlexample
CREATE OR REPLACE TABLE invoices (id INTEGER, price NUMBER(12, 2));

INSERT INTO invoices (id, price) VALUES
  (1, 11.11),
  (2, 22.22);
```

## Declaring a cursor

You can declare a cursor for a SELECT statement or a [RESULTSET](resultsets.md).

You declare a cursor in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section of a block or in the
[BEGIN … END](../../sql-reference/snowflake-scripting/begin.md) section of the block:

* Within the DECLARE section, use the syntax described in [Cursor declaration syntax](../../sql-reference/snowflake-scripting/declare.md).

  For example, to declare a cursor for a query:

  ```sqlexample
  DECLARE
    ...
    c1 CURSOR FOR SELECT price FROM invoices;
  ```

  To declare a cursor for a RESULTSET:

  ```sqlexample
  DECLARE
    ...
    res RESULTSET DEFAULT (SELECT price FROM invoices);
    c1 CURSOR FOR res;
  ```
* Within the BEGIN … END block, use the syntax described in [Cursor assignment syntax](../../sql-reference/snowflake-scripting/let.md). For example:

  ```sqlexample
  BEGIN
    ...
    LET c1 CURSOR FOR SELECT price FROM invoices;
  ```

In the SELECT statement, you can specify bind parameters (`?` characters) that you can bind to variables when opening the
cursor. To bind variables to the parameters, specify the variables in the USING clause of the OPEN command. For example:

```sqlexample
DECLARE
  id INTEGER DEFAULT 0;
  minimum_price NUMBER(13,2) DEFAULT 22.00;
  maximum_price NUMBER(13,2) DEFAULT 33.00;
  c1 CURSOR FOR SELECT id FROM invoices WHERE price > ? AND price < ?;
BEGIN
  OPEN c1 USING (minimum_price, maximum_price);
  FETCH c1 INTO id;
  RETURN id;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  id INTEGER DEFAULT 0;
  minimum_price NUMBER(13,2) DEFAULT 22.00;
  maximum_price NUMBER(13,2) DEFAULT 33.00;
  c1 CURSOR FOR SELECT id FROM invoices WHERE price > ? AND price < ?;
BEGIN
  OPEN c1 USING (minimum_price, maximum_price);
  FETCH c1 INTO id;
  RETURN id;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               2 |
+-----------------+
```

## Opening a cursor

Although the statement that declares a cursor defines the query associated with that cursor, the query is not executed until you
open the cursor by executing the [OPEN](../../sql-reference/snowflake-scripting/open.md) command. For example:

```sqlexample
OPEN c1;
```

> **Note:**
>
> * When using a cursor in a [FOR](loops.md) loop, you do not need to open the cursor explicitly.
> * If you declare a cursor for a RESULTSET object, the query is executed when you associate the object with the query. In this
>   case, opening the cursor does not cause the query to be executed again.

If your query contains any bind parameters (`?` characters), add a USING clause to specify the list of variables to bind
to those parameters. For example:

```sqlexample
LET c1 CURSOR FOR SELECT id FROM invoices WHERE price > ? AND price < ?;
OPEN c1 USING (minimum_price, maximum_price);
```

Opening the cursor executes the query, retrieves the specified rows into the cursor, and sets up an internal pointer that points
to the first row. You can use the FETCH command to
fetch (read) individual rows using the cursor.

As with any SQL query, if the query definition does not contain an ORDER BY at the outermost level, then the result set has no
defined order. When the result set for the cursor is created, the order of the rows is persistent until the cursor is closed.
If you declare or open the cursor again, the rows might be in a different order. Similarly, if you close the cursor and
the underlying table is updated before you open the cursor again, the result set can change.

## Using a cursor to fetch data

Use the [FETCH](../../sql-reference/snowflake-scripting/fetch.md) command to retrieve the current row from the result set and
advance the internal current row pointer to point to the next row in the result set.

In the INTO clause, specify the variables that hold the values from the row.

For example:

```sqlexample
FETCH c1 INTO var_for_column_value;
```

If the number of variables does not match the number of expressions in the SELECT clause of the cursor declaration, Snowflake
attempts to match the variables with the columns by position:

* If there are more variables than columns, Snowflake leaves the remaining variables unset.
* If there are more columns than variables, Snowflake ignores the remaining columns.

Each subsequent FETCH command that you execute gets the next row until the last row has been fetched. If you try to FETCH
a row after the last row, you get NULL values.

A RESULTSET or cursor does not necessarily cache all the rows of the result set at the time that the query is executed. FETCH operations can experience latency.

## Using a cursor to retrieve a GEOGRAPHY value

If the results include a column of the type GEOGRAPHY, the type of the value in the column is OBJECT, not GEOGRAPHY. This means
that you cannot directly pass this value to [geospatial functions](../../sql-reference/functions-geospatial.md) that accept a
GEOGRAPHY object as input:

```sqlexample
DECLARE
  geohash_value VARCHAR;
BEGIN
  LET res RESULTSET := (SELECT TO_GEOGRAPHY('POINT(1 1)') AS GEOGRAPHY_VALUE);
  LET cur CURSOR FOR res;
  FOR row_variable IN cur DO
    geohash_value := ST_GEOHASH(row_variable.geography_value);
  END FOR;
  RETURN geohash_value;
END;
```

```none
001044 (42P13): Uncaught exception of type 'EXPRESSION_ERROR' on line 7 at position 21 : SQL compilation error: ...
Invalid argument types for function 'ST_GEOHASH': (OBJECT)
```

To work around this, cast the column value to the GEOGRAPHY type:

```sqlexample
geohash_value := ST_GEOHASH(TO_GEOGRAPHY(row_variable.geography_value));
```

## Returning a table for a cursor

If you need to return a table of data from a cursor, you can pass the cursor to `RESULTSET_FROM_CURSOR(cursor)`, which in
turn you can pass to `TABLE(...)`.

The following block returns a table of data from a cursor:

```sqlexample
DECLARE
  c1 CURSOR FOR SELECT * FROM invoices;
BEGIN
  OPEN c1;
  RETURN TABLE(RESULTSET_FROM_CURSOR(c1));
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  c1 CURSOR FOR SELECT * FROM invoices;
BEGIN
  OPEN c1;
  RETURN TABLE(RESULTSET_FROM_CURSOR(c1));
END;
$$
;
```

```output
+----+-------+
| ID | PRICE |
|----+-------|
|  1 | 11.11 |
|  2 | 22.22 |
+----+-------+
```

Even if you have already used the cursor to fetch rows,
`RESULTSET_FROM_CURSOR` still returns a RESULTSET containing all of the rows, not just the rows starting from the internal
row pointer.

As shown above, the example fetches the first row and sets the internal row pointer to the second row.
`RESULTSET_FROM_CURSOR` returns a RESULTSET containing both rows (not just the second row).

## Closing a cursor

When you are done with the result set, close the cursor by executing the [CLOSE](../../sql-reference/snowflake-scripting/close.md)
command. For example:

```sqlexample
CLOSE c1;
```

> **Note:**
>
> When using a cursor in a [FOR](loops.md) loop, you do not need to close the cursor explicitly.

You cannot execute the FETCH command on a cursor that has been closed.

In addition, after you close a cursor, the current row pointer becomes invalid. If you open the cursor again, the pointer points
to the first row in the new result set.

## Example of using a cursor

This example uses data that you set up in Setting up the data for the examples.

Here is an anonymous block that uses a cursor to read two rows and sum the prices in those rows:

```sqlexample
DECLARE
  row_price FLOAT;
  total_price FLOAT;
  c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
  row_price := 0.0;
  total_price := 0.0;
  OPEN c1;
  FETCH c1 INTO row_price;
  total_price := total_price + row_price;
  FETCH c1 INTO row_price;
  total_price := total_price + row_price;
  CLOSE c1;
  RETURN total_price;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
    row_price FLOAT;
    total_price FLOAT;
    c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
    row_price := 0.0;
    total_price := 0.0;
    OPEN c1;
    FETCH c1 INTO row_price;
    total_price := total_price + row_price;
    FETCH c1 INTO row_price;
    total_price := total_price + row_price;
    CLOSE c1;
    RETURN total_price;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|           33.33 |
+-----------------+
```

You can achieve the same result by using a cursor with a [FOR loop](loops.md):

```sqlexample
DECLARE
  total_price FLOAT;
  c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
  total_price := 0.0;
  FOR record IN c1 DO
    total_price := total_price + record.price;
  END FOR;
  RETURN total_price;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  total_price FLOAT;
  c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
  total_price := 0.0;
  FOR record IN c1 DO
    total_price := total_price + record.price;
  END FOR;
  RETURN total_price;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|           33.33 |
+-----------------+
```

## Troubleshooting problems with cursors

The following section describes common problems with cursors and identifies a possible cause and solution in each case.

### Symptom: Cursor retrieves every second row rather than every row

* **Possible cause:** You might have executed FETCH inside a FOR `<record>` IN `<cursor>` loop. A FOR loop over a cursor
  automatically fetches the next row. If you do another fetch inside the loop, you get every second row.
* **Possible solution:** Remove any unneeded FETCH commands inside a FOR loop.

### Symptom: FETCH command retrieves unexpected NULL values

* **Possible cause:** You might have executed FETCH inside a FOR `<record>` IN `<cursor>` loop. A FOR loop over a cursor
  automatically fetches the next row. If you do another fetch inside the loop, you get every second row. If
  there is an odd number of rows, the last fetch will try to fetch a row beyond the last row, and the
  values will be NULL.
* **Possible solution:** Remove any unneeded FETCH commands inside a FOR loop.

---
title: Working with event tables
source: https://docs.snowflake.com/en/developer-guide/logging-tracing/event-table-operations.md
section: Developer Guide
---

# Working with event tables

You can perform a subset of table operations on an event table you create, which is specifically designed for capturing events. The
sections in this topic describe the operations an event table supports.

> **Note:**
>
> You can perform only a subset of the operations listed here on the default event table, as noted in this topic.

## Operations supported on an event table

An event table is designed specifically for capturing events. You cannot perform some of the operations on an event table that you can
perform on a regular table.

With an event table, you can perform the following operations (note exceptions for the default event table):

| Operation | Default event table support | User-created event table support |
| --- | --- | --- |
| [SHOW EVENT TABLES](../../sql-reference/sql/show-event-tables.md) | ✔ | ✔ |
| [DESCRIBE EVENT TABLE](../../sql-reference/sql/desc-event-table.md) | ✔ | ✔ |
| [SELECT](../../sql-reference/sql/select.md) | ✔ | ✔ |
| [DROP TABLE](../../sql-reference/sql/drop-table.md) |  | ✔ |
| [UNDROP TABLE](../../sql-reference/sql/undrop-table.md) |  | ✔ |
| [CREATE TABLE](../../sql-reference/sql/create-table.md) |  | ✔ |
| [TRUNCATE TABLE](../../sql-reference/sql/truncate-table.md) | ✔ | ✔ |
| [DELETE](../../sql-reference/sql/delete.md) | ✔ | ✔ |
| [ALTER TABLE (event tables)](../../sql-reference/sql/alter-table-event-table.md) | ✔ (rename is not supported) | ✔ (rename is not supported) |

## Deleting rows from an event table

If you need to delete rows from an event table, you can use the following commands:

* Use [TRUNCATE TABLE](../../sql-reference/sql/truncate-table.md) to remove all rows from the event table.
* Use [DELETE](../../sql-reference/sql/delete.md) to remove selected rows from the event table.

  You can use this if you need to implement more complex log retention policies (e.g. if you need to retain logs for some functions for a
  longer period of time than other functions).

## Parameters for event tables

You can use the following parameters to specify how the event table should be used by handler code.

EVENT_TABLE
:   Specifies the name of the event table for logging messages from stored procedures and UDFs in this account. For reference information,
    see [EVENT_TABLE](../../sql-reference/parameters.md).

LOG_LEVEL
:   Specifies the severity level of log messages produced through logging APIs that should be ingested and made available in the active event
    table. Log messages at the specified level (and at more severe levels) are ingested. For more information, see [LOG_LEVEL](../../sql-reference/parameters.md) and
    [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

LOG_EVENT_LEVEL
:   Specifies the severity level of log events (rows with record type EVENT) that should be ingested and made available in the active event
    table. Log events at the specified level (and at more severe levels) are ingested. For more information, see [LOG_EVENT_LEVEL](../../sql-reference/parameters.md) and
    [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

METRIC_LEVEL
:   Specifies whether metrics data should be ingested and made available in the active event table. For more information, see
    [METRIC_LEVEL](../../sql-reference/parameters.md) and [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

TRACE_LEVEL
:   Specifies the verbosity of trace events that should be ingested and made available in the active event table. Events at the specified
    level are ingested. For more information, see [TRACE_LEVEL](../../sql-reference/parameters.md) and
    [Setting levels for logging, metrics, and tracing](telemetry-levels.md).

## Access control privileges for event tables

You can use privileges in the global and event table scope to manage access to operations on an event table.

For more information, see [Event table privileges](../../user-guide/security-access-control-privileges.md) and log level privileges in [Global privileges (account privileges)](../../user-guide/security-access-control-privileges.md).

## Managing access to event table data

When it’s impractical for you to make event table data available to a range of users and roles, you can create views
for access by users with specific roles.

When you want to manage access to the data in this table, you can create views on the event table, then grant access for each view to
separate roles. Through the view, a role might have access to specified subset of the data in the event table.

For more information about creating views, see [CREATE VIEW](../../sql-reference/sql/create-view.md).

## Using streams to track changes to event tables

You can create a stream on an event table, such as to capture changes to the table.

For more information about streams, see [Introduction to streams](../../user-guide/streams-intro.md) and [CREATE STREAM](../../sql-reference/sql/create-stream.md).

Code in the following example creates a stream to capture inserts on the event table `my_event_table`.

```sqlexample
CREATE STREAM append_only_comparison ON EVENT TABLE my_event_table APPEND_ONLY=TRUE;
```

---
title: Working with loops
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/loops.md
section: Developer Guide
---

# Working with loops

Snowflake Scripting supports the following types of loops:

* FOR
* WHILE
* REPEAT
* LOOP

This topic explains how to use each of these types of loops.

## FOR loop

A [FOR](../../sql-reference/snowflake-scripting/for.md) loop repeats a sequence of steps for a specified number of times or for each
row in a result set. Snowflake Scripting supports the following types of FOR loops:

* Counter-based FOR loops
* Cursor-based FOR loops
* RESULTSET-based FOR loops

The next sections explain how to use these types of FOR loops.

### Counter-based FOR loops

A counter-based FOR loop executes a specified number of times.

Use the following syntax for a counter-based FOR loop:

```sqlsyntax
FOR <counter_variable> IN [ REVERSE ] <start> TO <end> { DO | LOOP }
  <statement>;
  [ <statement>; ... ]
END { FOR | LOOP } [ <label> ] ;
```

For example, the following FOR loop executes five times:

```sqlexample
DECLARE
  counter INTEGER DEFAULT 0;
  maximum_count INTEGER default 5;
BEGIN
  FOR i IN 1 TO maximum_count DO
    counter := counter + 1;
  END FOR;
  RETURN counter;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  counter INTEGER DEFAULT 0;
  maximum_count INTEGER default 5;
BEGIN
  FOR i IN 1 TO maximum_count DO
    counter := counter + 1;
  END FOR;
  RETURN counter;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               5 |
+-----------------+
```

You can include SQL statements inside Snowflake Scripting loops. For example, the following FOR loop executes an INSERT
statement five times to insert the value of the counter into a table:

```sqlexample
DECLARE
  counter INTEGER DEFAULT 0;
  maximum_count INTEGER default 5;
BEGIN
  CREATE OR REPLACE TABLE test_for_loop_insert(i INTEGER);
  FOR i IN 1 TO maximum_count DO
    INSERT INTO test_for_loop_insert VALUES (:i);
    counter := counter + 1;
  END FOR;
  RETURN counter || ' rows inserted';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  counter INTEGER DEFAULT 0;
  maximum_count INTEGER default 5;
BEGIN
  CREATE OR REPLACE TABLE test_for_loop_insert(i INTEGER);
  FOR i IN 1 TO maximum_count DO
    INSERT INTO test_for_loop_insert VALUES (:i);
    counter := counter + 1;
  END FOR;
  RETURN counter || ' rows inserted';
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| 5 rows inserted |
+-----------------+
```

Query the table to view the inserted rows:

```sqlexample
SELECT * FROM test_for_loop_insert;
```

```output
+---+
| I |
|---|
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
+---+
```

The following example uses a counter-based FOR loop to populate a date dimension table, which is a common task when setting up
a data warehouse. The loop iterates over a range of days and inserts a row for each date with computed attributes:

```sqlexample
DECLARE
  start_date DATE DEFAULT '2025-01-01';
  current_date_val DATE;
BEGIN
  CREATE OR REPLACE TABLE date_dimension (
    date_key INTEGER,
    full_date DATE,
    day_of_week VARCHAR,
    month_name VARCHAR,
    quarter INTEGER,
    year INTEGER
  );
  FOR i IN 1 TO 7 DO
    current_date_val := DATEADD('day', :i - 1, :start_date);
    INSERT INTO date_dimension
      SELECT :i, :current_date_val, DAYNAME(:current_date_val),
        MONTHNAME(:current_date_val), QUARTER(:current_date_val),
        YEAR(:current_date_val);
  END FOR;
  RETURN 'Populated date dimension with 7 rows';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  start_date DATE DEFAULT '2025-01-01';
  current_date_val DATE;
BEGIN
  CREATE OR REPLACE TABLE date_dimension (
    date_key INTEGER,
    full_date DATE,
    day_of_week VARCHAR,
    month_name VARCHAR,
    quarter INTEGER,
    year INTEGER
  );
  FOR i IN 1 TO 7 DO
    current_date_val := DATEADD('day', :i - 1, :start_date);
    INSERT INTO date_dimension
      SELECT :i, :current_date_val, DAYNAME(:current_date_val),
        MONTHNAME(:current_date_val), QUARTER(:current_date_val),
        YEAR(:current_date_val);
  END FOR;
  RETURN 'Populated date dimension with 7 rows';
END;
$$
;
```

```output
+----------------------------------------+
| anonymous block                        |
|----------------------------------------|
| Populated date dimension with 7 rows   |
+----------------------------------------+
```

To verify the results, query the table:

```sqlexample
SELECT * FROM date_dimension ORDER BY date_key;
```

```output
+----------+------------+-------------+------------+---------+------+
| DATE_KEY | FULL_DATE  | DAY_OF_WEEK | MONTH_NAME | QUARTER | YEAR |
|----------+------------+-------------+------------+---------+------|
|        1 | 2025-01-01 | Wed         | Jan        |       1 | 2025 |
|        2 | 2025-01-02 | Thu         | Jan        |       1 | 2025 |
|        3 | 2025-01-03 | Fri         | Jan        |       1 | 2025 |
|        4 | 2025-01-04 | Sat         | Jan        |       1 | 2025 |
|        5 | 2025-01-05 | Sun         | Jan        |       1 | 2025 |
|        6 | 2025-01-06 | Mon         | Jan        |       1 | 2025 |
|        7 | 2025-01-07 | Tue         | Jan        |       1 | 2025 |
+----------+------------+-------------+------------+---------+------+
```

For the full syntax and details about FOR loops, see [FOR (Snowflake Scripting)](../../sql-reference/snowflake-scripting/for.md).

### Cursor-based FOR loops

A cursor-based FOR loop iterates over a result set. The number of iterations is determined by the number of
rows in the [cursor](cursors.md).

The syntax for a cursor-based FOR loop is:

```sqlsyntax
FOR <row_variable> IN <cursor_name> DO
  <statement>;
  [ <statement>; ... ]
END FOR [ <label> ] ;
```

The first example in this section uses the data in the following `invoices` table:

```sqlexample
CREATE OR REPLACE TABLE invoices (price NUMBER(12, 2));

INSERT INTO invoices (price) VALUES
  (11.11),
  (22.22);
```

The following example uses a FOR loop to iterate over the rows in a cursor for the `invoices` table:

```sqlexample
DECLARE
  total_price FLOAT;
  c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
  total_price := 0.0;
  FOR record IN c1 DO
    total_price := total_price + record.price;
  END FOR;
  RETURN total_price;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  total_price FLOAT;
  c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
  total_price := 0.0;
  FOR record IN c1 DO
    total_price := total_price + record.price;
  END FOR;
  RETURN total_price;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|           33.33 |
+-----------------+
```

The following example uses a cursor-based FOR loop to iterate over a table of employees, give each employee a raise based on
their department, and insert an audit record for each update:

```sqlexample
CREATE OR REPLACE TABLE loop_test_employees (
  emp_id INTEGER,
  name VARCHAR,
  department VARCHAR,
  salary NUMBER(12,2));

INSERT INTO loop_test_employees VALUES
  (1, 'Alice', 'Engineering', 90000),
  (2, 'Bob', 'Sales', 70000),
  (3, 'Carol', 'Engineering', 95000),
  (4, 'Dave', 'Sales', 72000);

CREATE OR REPLACE TABLE salary_audit (
  emp_id INTEGER,
  old_salary NUMBER(12,2),
  new_salary NUMBER(12,2),
  updated_on TIMESTAMP);
```

```sqlexample
DECLARE
  rows_updated INTEGER DEFAULT 0;
  raise_pct INTEGER;
  new_salary NUMBER(12,2);
  cur_emp_id INTEGER;
  cur_salary NUMBER(12,2);
  c1 CURSOR FOR SELECT emp_id, department, salary FROM loop_test_employees;
BEGIN
  FOR record IN c1 DO
    cur_emp_id := record.emp_id;
    cur_salary := record.salary;
    IF (record.department = 'Engineering') THEN
      raise_pct := 10;
    ELSE
      raise_pct := 5;
    END IF;
    new_salary := :cur_salary * (1 + :raise_pct / 100);
    UPDATE loop_test_employees SET salary = :new_salary WHERE emp_id = :cur_emp_id;
    INSERT INTO salary_audit
      SELECT :cur_emp_id, :cur_salary, :new_salary, CURRENT_TIMESTAMP();
    rows_updated := rows_updated + 1;
  END FOR;
  RETURN rows_updated || ' employees updated';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  rows_updated INTEGER DEFAULT 0;
  raise_pct INTEGER;
  new_salary NUMBER(12,2);
  cur_emp_id INTEGER;
  cur_salary NUMBER(12,2);
  c1 CURSOR FOR SELECT emp_id, department, salary FROM loop_test_employees;
BEGIN
  FOR record IN c1 DO
    cur_emp_id := record.emp_id;
    cur_salary := record.salary;
    IF (record.department = 'Engineering') THEN
      raise_pct := 10;
    ELSE
      raise_pct := 5;
    END IF;
    new_salary := :cur_salary * (1 + :raise_pct / 100);
    UPDATE loop_test_employees SET salary = :new_salary WHERE emp_id = :cur_emp_id;
    INSERT INTO salary_audit
      SELECT :cur_emp_id, :cur_salary, :new_salary, CURRENT_TIMESTAMP();
    rows_updated := rows_updated + 1;
  END FOR;
  RETURN rows_updated || ' employees updated';
END;
$$
;
```

```output
+----------------------+
| anonymous block      |
|----------------------|
| 4 employees updated  |
+----------------------+
```

To verify the updates, query the tables:

```sqlexample
SELECT emp_id, name, department, salary FROM loop_test_employees ORDER BY emp_id;
```

```output
+--------+-------+-------------+-----------+
| EMP_ID | NAME  | DEPARTMENT  |    SALARY |
|--------+-------+-------------+-----------|
|      1 | Alice | Engineering |  99000.00 |
|      2 | Bob   | Sales       |  73500.00 |
|      3 | Carol | Engineering | 104500.00 |
|      4 | Dave  | Sales       |  75600.00 |
+--------+-------+-------------+-----------+
```

```sqlexample
SELECT emp_id, old_salary, new_salary FROM salary_audit ORDER BY emp_id;
```

```output
+--------+------------+------------+
| EMP_ID | OLD_SALARY | NEW_SALARY |
|--------+------------+------------|
|      1 |   90000.00 |   99000.00 |
|      2 |   70000.00 |   73500.00 |
|      3 |   95000.00 |  104500.00 |
|      4 |   72000.00 |   75600.00 |
+--------+------------+------------+
```

For the full syntax and details about FOR loops, see [FOR (Snowflake Scripting)](../../sql-reference/snowflake-scripting/for.md).

### RESULTSET-based FOR loops

A RESULTSET-based FOR loop iterates over a result set. The number of iterations is determined by the number of
rows returned by the [RESULTSET](resultsets.md) query.

The syntax for a RESULTSET-based FOR loop is:

```sqlsyntax
FOR <row_variable> IN <RESULTSET_name> DO
  <statement>;
  [ <statement>; ... ]
END FOR [ <label> ] ;
```

The first example in this section uses the data in the following `invoices` table:

```sqlexample
CREATE OR REPLACE TABLE invoices (price NUMBER(12, 2));
INSERT INTO invoices (price) VALUES
  (11.11),
  (22.22);
```

The following example uses a FOR loop to iterate over the rows in a RESULTSET for the `invoices` table:

```sqlexample
DECLARE
  total_price FLOAT;
  rs RESULTSET;
BEGIN
  total_price := 0.0;
  rs := (SELECT price FROM invoices);
  FOR record IN rs DO
    total_price := total_price + record.price;
  END FOR;
  RETURN total_price;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  total_price FLOAT;
  rs RESULTSET;
BEGIN
  total_price := 0.0;
  rs := (SELECT price FROM invoices);
  FOR record IN rs DO
    total_price := total_price + record.price;
  END FOR;
  RETURN total_price;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|           33.33 |
+-----------------+
```

The following example uses a RESULTSET-based FOR loop to validate customer records. It checks each record for required contact
information and updates the `status` column to mark each record as verified or incomplete. This type of data-quality check
is a common step in extract, transform, and load (ETL) pipelines:

```sqlexample
CREATE OR REPLACE TABLE loop_test_customers (
  customer_id INTEGER,
  customer_name VARCHAR,
  customer_email VARCHAR,
  customer_phone VARCHAR,
  status VARCHAR DEFAULT 'pending_review');

INSERT INTO loop_test_customers (customer_id, customer_name, customer_email, customer_phone) VALUES
  (1, 'Alice Smith', 'alice@example.com', '800-555-0101'),
  (2, 'Bob Jones', NULL, '800-555-0102'),
  (3, 'Carol White', 'carol@example.com', NULL),
  (4, 'Dave Brown', NULL, NULL),
  (5, 'Eve Davis', 'eve@example.com', '800-555-0105');
```

```sqlexample
DECLARE
  rs RESULTSET;
  valid_count INTEGER DEFAULT 0;
  invalid_count INTEGER DEFAULT 0;
  cur_customer_id INTEGER;
BEGIN
  rs := (SELECT customer_id, customer_email, customer_phone FROM loop_test_customers WHERE status = 'pending_review');
  FOR record IN rs DO
    cur_customer_id := record.customer_id;
    IF (record.customer_email IS NOT NULL AND record.customer_phone IS NOT NULL) THEN
      UPDATE loop_test_customers SET status = 'verified' WHERE customer_id = :cur_customer_id;
      valid_count := valid_count + 1;
    ELSE
      UPDATE loop_test_customers SET status = 'incomplete' WHERE customer_id = :cur_customer_id;
      invalid_count := invalid_count + 1;
    END IF;
  END FOR;
  RETURN 'Verified: ' || valid_count || ', Incomplete: ' || invalid_count;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  rs RESULTSET;
  valid_count INTEGER DEFAULT 0;
  invalid_count INTEGER DEFAULT 0;
  cur_customer_id INTEGER;
BEGIN
  rs := (SELECT customer_id, customer_email, customer_phone FROM loop_test_customers WHERE status = 'pending_review');
  FOR record IN rs DO
    cur_customer_id := record.customer_id;
    IF (record.customer_email IS NOT NULL AND record.customer_phone IS NOT NULL) THEN
      UPDATE loop_test_customers SET status = 'verified' WHERE customer_id = :cur_customer_id;
      valid_count := valid_count + 1;
    ELSE
      UPDATE loop_test_customers SET status = 'incomplete' WHERE customer_id = :cur_customer_id;
      invalid_count := invalid_count + 1;
    END IF;
  END FOR;
  RETURN 'Verified: ' || valid_count || ', Incomplete: ' || invalid_count;
END;
$$
;
```

```output
+-------------------------------+
| anonymous block               |
|-------------------------------|
| Verified: 2, Incomplete: 3   |
+-------------------------------+
```

To verify the changes, query the table:

```sqlexample
SELECT * FROM loop_test_customers ORDER BY customer_id;
```

```output
+-------------+-----------------+-------------------+----------------+------------+
| CUSTOMER_ID | CUSTOMER_NAME   | CUSTOMER_EMAIL    | CUSTOMER_PHONE | STATUS     |
|-------------+-----------------+-------------------+----------------+------------|
|           1 | Alice Smith     | alice@example.com | 800-555-0101   | verified   |
|           2 | Bob Jones       | NULL              | 800-555-0102   | incomplete |
|           3 | Carol White     | carol@example.com | NULL           | incomplete |
|           4 | Dave Brown      | NULL              | NULL           | incomplete |
|           5 | Eve Davis       | eve@example.com   | 800-555-0105   | verified   |
+-------------+-----------------+-------------------+----------------+------------+
```

For the full syntax and details about FOR loops, see [FOR (Snowflake Scripting)](../../sql-reference/snowflake-scripting/for.md).

## WHILE loop

A [WHILE](../../sql-reference/snowflake-scripting/while.md) loop iterates while a condition is true. In a WHILE
loop, the condition is tested immediately before executing the body of the loop. If the condition is false before the first
iteration, then the body of the loop does not execute even once.

The syntax for a WHILE loop is:

```sqlsyntax
WHILE ( <condition> ) { DO | LOOP }
  <statement>;
  [ <statement>; ... ]
END { WHILE | LOOP } [ <label> ] ;
```

For example:

```sqlexample
BEGIN
  LET counter := 0;
  WHILE (counter < 5) DO
    counter := counter + 1;
  END WHILE;
  RETURN counter;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET counter := 0;
  WHILE (counter < 5) DO
    counter := counter + 1;
  END WHILE;
  RETURN counter;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               5 |
+-----------------+
```

The following example uses a WHILE loop to build a daily sales summary from raw transaction data. It processes one date at
a time, aggregating the transactions for that date into a summary row and marking them as loaded. This date-by-date
summarization pattern is common in extract, transform, and load (ETL) pipelines:

```sqlexample
CREATE OR REPLACE TABLE loop_test_raw_transactions (
  txn_id INTEGER,
  amount NUMBER(12,2),
  txn_date DATE,
  loaded BOOLEAN DEFAULT FALSE);

INSERT INTO loop_test_raw_transactions (txn_id, amount, txn_date) VALUES
  (1, 150.00, '2025-03-01'),
  (2, 230.50, '2025-03-01'),
  (3, 89.99, '2025-03-01'),
  (4, 412.00, '2025-03-02'),
  (5, 55.25, '2025-03-03'),
  (6, 178.75, '2025-03-03');

CREATE OR REPLACE TABLE loop_test_daily_sales_summary (
  summary_date DATE,
  total_sales NUMBER(12,2),
  txn_count INTEGER);
```

```sqlexample
DECLARE
  next_date DATE;
BEGIN
  next_date := (SELECT MIN(txn_date) FROM loop_test_raw_transactions WHERE NOT loaded);
  WHILE (next_date IS NOT NULL) DO
    INSERT INTO loop_test_daily_sales_summary
      SELECT txn_date, SUM(amount), COUNT(*)
      FROM loop_test_raw_transactions
      WHERE txn_date = :next_date AND NOT loaded
      GROUP BY txn_date;
    UPDATE loop_test_raw_transactions SET loaded = TRUE WHERE txn_date = :next_date;
    next_date := (SELECT MIN(txn_date) FROM loop_test_raw_transactions WHERE NOT loaded);
  END WHILE;
  RETURN 'Daily summaries created for all transaction dates';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  next_date DATE;
BEGIN
  next_date := (SELECT MIN(txn_date) FROM loop_test_raw_transactions WHERE NOT loaded);
  WHILE (next_date IS NOT NULL) DO
    INSERT INTO loop_test_daily_sales_summary
      SELECT txn_date, SUM(amount), COUNT(*)
      FROM loop_test_raw_transactions
      WHERE txn_date = :next_date AND NOT loaded
      GROUP BY txn_date;
    UPDATE loop_test_raw_transactions SET loaded = TRUE WHERE txn_date = :next_date;
    next_date := (SELECT MIN(txn_date) FROM loop_test_raw_transactions WHERE NOT loaded);
  END WHILE;
  RETURN 'Daily summaries created for all transaction dates';
END;
$$
;
```

```output
+----------------------------------------------------+
| anonymous block                                    |
|----------------------------------------------------|
| Daily summaries created for all transaction dates  |
+----------------------------------------------------+
```

To verify the results, query the summary table:

```sqlexample
SELECT * FROM loop_test_daily_sales_summary ORDER BY summary_date;
```

```output
+--------------+-------------+-----------+
| SUMMARY_DATE | TOTAL_SALES | TXN_COUNT |
|--------------+-------------+-----------|
| 2025-03-01   |      470.49 |         3 |
| 2025-03-02   |      412.00 |         1 |
| 2025-03-03   |      234.00 |         2 |
+--------------+-------------+-----------+
```

For the full syntax and details about WHILE loops, see [WHILE (Snowflake Scripting)](../../sql-reference/snowflake-scripting/while.md).

## REPEAT loop

A [REPEAT](../../sql-reference/snowflake-scripting/repeat.md) loop iterates until a condition is true. In a REPEAT
loop, the condition is tested immediately after executing the body of the loop. As a result, the body of the loop always executes
at least once.

The syntax for a REPEAT loop is:

```sqlsyntax
REPEAT
  <statement>;
  [ <statement>; ... ]
UNTIL ( <condition> )
END REPEAT [ <label> ] ;
```

For example:

```sqlexample
BEGIN
  LET counter := 5;
  LET number_of_iterations := 0;
  REPEAT
    counter := counter - 1;
    number_of_iterations := number_of_iterations + 1;
  UNTIL (counter = 0)
  END REPEAT;
  RETURN number_of_iterations;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET counter := 5;
  LET number_of_iterations := 0;
  REPEAT
    counter := counter - 1;
    number_of_iterations := number_of_iterations + 1;
  UNTIL (counter = 0)
  END REPEAT;
  RETURN number_of_iterations;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               5 |
+-----------------+
```

The following example creates a staging table and a target table with an additional `batch_id` column. It then uses a
REPEAT loop to move rows in batches from the staging table to the target table, incrementing the batch ID after each
iteration, until no rows remain in the staging table:

```sqlexample
CREATE OR REPLACE TABLE loop_test_orders_staging (
  order_id INTEGER,
  customer VARCHAR,
  amount NUMBER(12,2));

INSERT INTO loop_test_orders_staging VALUES
  (101, 'TestA Corp', 500.00),
  (102, 'TestB Corp', 1200.00),
  (103, 'TestA Corp', 300.00),
  (104, 'TestC Corp', 750.00),
  (105, 'TestB Corp', 425.00),
  (106, 'TestC Corp', 980.00);

CREATE OR REPLACE TABLE loop_test_orders_processed (
    order_id INTEGER,
    customer VARCHAR,
    amount NUMBER(12,2),
    batch_id INTEGER);
```

```sqlexample
DECLARE
  batch_size INTEGER DEFAULT 2;
  batch_id INTEGER DEFAULT 1;
  remaining INTEGER;
BEGIN
  remaining := (SELECT COUNT(*) FROM loop_test_orders_staging);
  REPEAT
    INSERT INTO loop_test_orders_processed
      SELECT order_id, customer, amount, :batch_id
      FROM loop_test_orders_staging
      ORDER BY order_id
      LIMIT :batch_size;
    DELETE FROM loop_test_orders_staging WHERE order_id IN (
      SELECT order_id
        FROM loop_test_orders_processed
        WHERE batch_id = :batch_id
    );
    batch_id := batch_id + 1;
    remaining := (SELECT COUNT(*) FROM loop_test_orders_staging);
  UNTIL (remaining = 0)
  END REPEAT;
  RETURN 'Processed all orders in ' || (batch_id - 1) || ' batches';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  batch_size INTEGER DEFAULT 2;
  batch_id INTEGER DEFAULT 1;
  remaining INTEGER;
BEGIN
  remaining := (SELECT COUNT(*) FROM loop_test_orders_staging);
  REPEAT
    INSERT INTO loop_test_orders_processed
      SELECT order_id, customer, amount, :batch_id
      FROM loop_test_orders_staging
      ORDER BY order_id
      LIMIT :batch_size;
    DELETE FROM loop_test_orders_staging WHERE order_id IN (
      SELECT order_id
        FROM loop_test_orders_processed
        WHERE batch_id = :batch_id
    );
    batch_id := batch_id + 1;
    remaining := (SELECT COUNT(*) FROM loop_test_orders_staging);
  UNTIL (remaining = 0)
  END REPEAT;
  RETURN 'Processed all orders in ' || (batch_id - 1) || ' batches';
END;
$$
;
```

```output
+------------------------------------+
| anonymous block                    |
|------------------------------------|
| Processed all orders in 3 batches  |
+------------------------------------+
```

To verify the results, query the target table:

```sqlexample
SELECT * FROM loop_test_orders_processed ORDER BY batch_id, order_id;
```

```output
+----------+------------+---------+----------+
| ORDER_ID | CUSTOMER   |  AMOUNT | BATCH_ID |
|----------+------------+---------+----------|
|      101 | TestA Corp |  500.00 |        1 |
|      102 | TestB Corp | 1200.00 |        1 |
|      103 | TestA Corp |  300.00 |        2 |
|      104 | TestC Corp |  750.00 |        2 |
|      105 | TestB Corp |  425.00 |        3 |
|      106 | TestC Corp |  980.00 |        3 |
+----------+------------+---------+----------+
```

For the full syntax and details about REPEAT loops, see [REPEAT (Snowflake Scripting)](../../sql-reference/snowflake-scripting/repeat.md).

## LOOP loop

A [LOOP](../../sql-reference/snowflake-scripting/loop.md) loop executes until a BREAK
command is executed. A BREAK command is normally embedded inside branching logic
(e.g. [IF statements](branch.md) or [CASE statements](branch.md)).

The syntax for a LOOP statement is:

```sqlsyntax
LOOP
  <statement>;
  [ <statement>; ... ]
END LOOP [ <label> ] ;
```

For example:

```sqlexample
BEGIN
  LET counter := 5;
  LOOP
    IF (counter = 0) THEN
      BREAK;
    END IF;
    counter := counter - 1;
  END LOOP;
  RETURN counter;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET counter := 5;
  LOOP
    IF (counter = 0) THEN
      BREAK;
    END IF;
    counter := counter - 1;
  END LOOP;
  RETURN counter;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               0 |
+-----------------+
```

The following example creates a log table with entries spanning multiple dates. It uses a LOOP to archive rows older
than a cutoff date into an archive table and delete them from the source, processing one day at a time until no
qualifying rows remain:

```sqlexample
CREATE OR REPLACE TABLE loop_test_event_log (
  event_id INTEGER,
  event_date DATE,
  event_description VARCHAR);

INSERT INTO loop_test_event_log VALUES
  (1, DATEADD('month', -3, CURRENT_DATE()), 'User login'),
  (2, DATEADD('month', -3, CURRENT_DATE()), 'File upload'),
  (3, DATEADD('month', -2, CURRENT_DATE()), 'Password change'),
  (4, DATEADD('month', -1, CURRENT_DATE()), 'User login'),
  (5, DATEADD('month', -1, CURRENT_DATE()), 'Data export');

CREATE OR REPLACE TABLE loop_test_event_log_archive (
  event_id INTEGER,
  event_date DATE,
  event_description VARCHAR,
  archived_on DATE);
```

```sqlexample
DECLARE
  cutoff_date DATE DEFAULT DATEADD('month', -1, CURRENT_DATE());
  oldest_date DATE;
  archived_total INTEGER DEFAULT 0;
  batch_count INTEGER;
BEGIN
  LOOP
    oldest_date := (SELECT MIN(event_date)
                      FROM loop_test_event_log
                      WHERE event_date < :cutoff_date);
    IF (oldest_date IS NULL) THEN
      BREAK;
    END IF;
    batch_count := (SELECT COUNT(*)
                      FROM loop_test_event_log
                      WHERE event_date = :oldest_date);
    INSERT INTO loop_test_event_log_archive
      SELECT event_id, event_date, event_description, CURRENT_DATE()
        FROM loop_test_event_log
        WHERE event_date = :oldest_date;
    DELETE FROM loop_test_event_log WHERE event_date = :oldest_date;
    archived_total := archived_total + batch_count;
  END LOOP;
  RETURN 'Archived ' || archived_total || ' events';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  cutoff_date DATE DEFAULT DATEADD('month', -1, CURRENT_DATE());
  oldest_date DATE;
  archived_total INTEGER DEFAULT 0;
  batch_count INTEGER;
BEGIN
  LOOP
    oldest_date := (SELECT MIN(event_date)
                      FROM loop_test_event_log
                      WHERE event_date < :cutoff_date);
    IF (oldest_date IS NULL) THEN
      BREAK;
    END IF;
    batch_count := (SELECT COUNT(*)
                      FROM loop_test_event_log
                      WHERE event_date = :oldest_date);
    INSERT INTO loop_test_event_log_archive
      SELECT event_id, event_date, event_description, CURRENT_DATE()
        FROM loop_test_event_log
        WHERE event_date = :oldest_date;
    DELETE FROM loop_test_event_log WHERE event_date = :oldest_date;
    archived_total := archived_total + batch_count;
  END LOOP;
  RETURN 'Archived ' || archived_total || ' events';
END;
$$
;
```

```output
+---------------------+
| anonymous block     |
|---------------------|
| Archived 3 events   |
+---------------------+
```

To verify the results, query both tables:

```sqlexample
SELECT * FROM loop_test_event_log ORDER BY event_id;
```

```output
+----------+------------+-------------------+
| EVENT_ID | EVENT_DATE | EVENT_DESCRIPTION |
|----------+------------+-------------------|
|        4 | 2026-02-04 | User login        |
|        5 | 2026-02-04 | Data export       |
+----------+------------+-------------------+
```

```sqlexample
SELECT event_id,
       event_date,
       event_description
  FROM loop_test_event_log_archive
  ORDER BY event_id;
```

```output
+----------+------------+-------------------+
| EVENT_ID | EVENT_DATE | EVENT_DESCRIPTION |
|----------+------------+-------------------|
|        1 | 2025-12-04 | User login        |
|        2 | 2025-12-04 | File upload       |
|        3 | 2026-01-04 | Password change   |
+----------+------------+-------------------+
```

For the full syntax and details about LOOP loops, see [LOOP (Snowflake Scripting)](../../sql-reference/snowflake-scripting/loop.md).

## Terminating a loop or iteration

In a loop construct, you can specify when the loop or an iteration of the loop must terminate early. The next sections explain
this in more detail:

* Terminating a loop
* Terminating an iteration without terminating the loop
* Specifying where execution should continue after termination

### Terminating a loop

You can explicitly terminate a loop early by executing the [BREAK](../../sql-reference/snowflake-scripting/break.md) command.
BREAK (and its synonym EXIT) immediately stops the current iteration, and skips any remaining iterations.
You can think of BREAK as jumping to the first executable statement after the end of the loop.

BREAK is required in a LOOP loop but is not necessary in WHILE, FOR, and REPEAT loops. In most cases,
if you have statements that you want to skip, you can use the standard branching constructs ([IF statements](branch.md) and
[CASE statements](branch.md)) to control which statements inside a loop are executed.

A BREAK command itself is usually inside an IF or CASE statement.

### Terminating an iteration without terminating the loop

You can use the CONTINUE (or ITERATE) command to jump to the end of an iteration of a loop, skipping the
remaining statements in the loop. The loop continues at the start of the next iteration.

Such jumps are rarely necessary. In most cases, if you have statements that you want to skip, you can use the
standard branching constructs ([IF statements](branch.md) and [CASE statements](branch.md)) to control which
statements inside a loop are executed.

A CONTINUE or ITERATE command itself is usually inside an IF or CASE statement.

### Specifying where execution should continue after termination

In a BREAK or CONTINUE command, if you need to continue execution at a specific point in the code (e.g. the outer
loop in a nested loop), specify a label that identifies the point at which execution should continue.

The following example demonstrates this in a nested loop:

```sqlexample
BEGIN
  LET inner_counter := 0;
  LET outer_counter := 0;
  LOOP
    LOOP
      IF (inner_counter < 5) THEN
        inner_counter := inner_counter + 1;
        CONTINUE OUTER;
      ELSE
        BREAK OUTER;
      END IF;
    END LOOP INNER;
    outer_counter := outer_counter + 1;
    BREAK;
  END LOOP OUTER;
  RETURN ARRAY_CONSTRUCT(outer_counter, inner_counter);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET inner_counter := 0;
  LET outer_counter := 0;
  LOOP
    LOOP
      IF (inner_counter < 5) THEN
        inner_counter := inner_counter + 1;
        CONTINUE OUTER;
      ELSE
        BREAK OUTER;
      END IF;
    END LOOP INNER;
    outer_counter := outer_counter + 1;
    BREAK;
  END LOOP OUTER;
  RETURN ARRAY_CONSTRUCT(outer_counter, inner_counter);
END;
$$;
```

In this example:

* There is a loop labeled INNER that is nested in a loop labeled OUTER.
* CONTINUE OUTER starts another iteration of the loop with the label OUTER.
* BREAK OUTER terminates the inner loop and transfers control to the end of the outer loop (labeled OUTER).

The output of this command is:

```output
+-----------------+
| anonymous block |
|-----------------|
| [               |
|   0,            |
|   5             |
| ]               |
+-----------------+
```

As shown in the output:

* `inner_counter` is incremented up to 5. CONTINUE OUTER starts a new iteration of the outer loop, which starts a new
  iteration of the inner loop, which increments this counter up to 5. These iterations continue until the value of
  `inner_counter` equals 5 and BREAK OUTER terminates the inner loop.
* `outer_counter` is never incremented. The statement that increments this counter is never reached because BREAK OUTER
  transfers control to the end of the outer loop.

---
title: Working with RESULTSETs
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/resultsets.md
section: Developer Guide
---

# Working with RESULTSETs

This topic explains how to use a RESULTSET in Snowflake Scripting.

## Introduction

In Snowflake Scripting, a RESULTSET is a SQL data type that points to the result set of a query.

Because a RESULTSET is just a pointer to the results, you must do one of the following to access the results through the
RESULTSET:

* Use the `TABLE(...)` syntax to retrieve the results as a table.
* Iterate over the RESULTSET with a [cursor](cursors.md).

Examples of both of these are included below.

## Understanding the differences between a cursor and a RESULTSET

A RESULTSET and a [cursor](cursors.md) both provide access to the result set of a query. However, these objects differ in the
following ways:

* The point in time when the query is executed.

  + For a cursor, the query is executed when you execute the [OPEN](../../sql-reference/snowflake-scripting/open.md) command on the
    cursor.
  + For a RESULTSET, the query is executed when you assign the query to the RESULTSET (either in the DECLARE section
    or in the BEGIN … END block).
* Support for binding in the OPEN command.

  + When you declare a cursor, you can specify bind parameters (`?` characters). Later, when you execute the
    [OPEN](../../sql-reference/snowflake-scripting/open.md) command, you can bind variables to those parameters in the USING clause.
  + RESULTSET does not support the OPEN command. However, you can bind variables in SQL commands before returning the
    result set.

In general, it is simpler to use a RESULTSET when you want to return a table that contains the result set of a query. However,
you can also return a table from a Snowflake Scripting block with a cursor. To do so, you can pass the cursor to
`RESULTSET_FROM_CURSOR(cursor)` to return a RESULTSET and pass that RESULTSET to `TABLE(...)`. See
[Returning a table for a cursor](cursors.md).

## Declaring a RESULTSET

You can declare a RESULTSET in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section of a block or in the
[BEGIN … END](../../sql-reference/snowflake-scripting/begin.md) section of the block.

* Within the DECLARE section, use the syntax described in [RESULTSET declaration syntax](../../sql-reference/snowflake-scripting/declare.md). For example:

  ```sqlexample
  DECLARE
    ...
    res RESULTSET DEFAULT (SELECT col1 FROM mytable ORDER BY col1);
  ```
* Within the BEGIN … END block, use the syntax described in [RESULTSET assignment syntax](../../sql-reference/snowflake-scripting/let.md). For example:

  ```sqlexample
  BEGIN
    ...
    LET res RESULTSET := (SELECT col1 FROM mytable ORDER BY col1);
  ```

## Assigning a query to a declared RESULTSET

To assign the result of a query to a RESULTSET that has already been declared, use the following syntax:

```sqlsyntax
<resultset_name> := [ ASYNC ] ( <query> ) ;
```

Where:

> `resultset_name`
> :   The name of the RESULTSET.
>
>     The name must be unique within the current scope.
>
>     The name must follow the naming rules for [Object identifiers](../../sql-reference/identifiers.md).
>
> `ASYNC`
> :   Runs the query as an [asynchronous child job](asynchronous-child-jobs.md).
>
> `query`
> :   The query to assign to the RESULTSET.

To assign a query to a RESULTSET:

```sqlexample
DECLARE
  res RESULTSET;
BEGIN
  res := (SELECT col1 FROM mytable ORDER BY col1);
  ...
```

To assign a query to a RESULTSET and run the query as an asynchronous child job:

```sqlexample
DECLARE
  res RESULTSET;
BEGIN
  res := ASYNC (SELECT col1 FROM mytable ORDER BY col1);
  ...
```

To build a SQL string dynamically for the query, set `query` to
`(EXECUTE IMMEDIATE string_of_sql)`. For example:

```sqlexample
DECLARE
  res RESULTSET;
  col_name VARCHAR;
  select_statement VARCHAR;
BEGIN
  col_name := 'col1';
  select_statement := 'SELECT ' || col_name || ' FROM mytable';
  res := (EXECUTE IMMEDIATE :select_statement);
  RETURN TABLE(res);
END;
```

Although you can set `query` to an EXECUTE IMMEDIATE statement for a RESULTSET, you can’t do this for a
cursor.

## Using a RESULTSET

The query for a RESULTSET is executed when the object is associated with that query. For example:

* When you declare a RESULTSET and set the DEFAULT clause to a query, the query is executed at that point in time.
* When you use the `:=` operator to assign a query to a RESULTSET, the query is executed at that point in time.

> **Note:**
>
> Because a RESULTSET points to the result set of a query (and does not contain the result set of a query), a RESULTSET
> is valid only as long as the query results are cached (typically 24 hours). For details about query result caching,
> see [Using Persisted Query Results](../../user-guide/querying-persisted-results.md).

Once the query is executed, you can access the results by using a cursor. You can also return the results as a table from a stored
procedure.

* Using a cursor to access data from a RESULTSET
* Returning a RESULTSET as a table

### Using a cursor to access data from a RESULTSET

To use a cursor to access the data from a RESULTSET, [declare the cursor](cursors.md) on the
object. For example:

```sqlexample
DECLARE
  ...
  res RESULTSET DEFAULT (SELECT col1 FROM mytable ORDER BY col1);
  c1 CURSOR FOR res;
```

When you declare a cursor on a RESULTSET, the cursor gets access to the data already in the RESULTSET. Executing
the [OPEN](../../sql-reference/snowflake-scripting/open.md) command on the cursor does not execute the query for the RESULTSET
again.

You can then [open the cursor](cursors.md) and use the cursor to
[fetch the data](cursors.md).

> **Note:**
>
> If the results include GEOGRAPHY values, you must cast the values to the GEOGRAPHY type before passing the values to any
> functions that expect GEOGRAPHY input values. See [Using a cursor to retrieve a GEOGRAPHY value](cursors.md).

### Returning a RESULTSET as a table

If you want to return the results that the RESULTSET points to, pass the RESULTSET to `TABLE(...)`. For example:

```sqlexample
CREATE PROCEDURE f()
  RETURNS TABLE(column_1 INTEGER, column_2 VARCHAR)
  ...
    RETURN TABLE(my_resultset_1);
  ...
```

This is similar to the way that `TABLE(...)` is used with
[table functions](../../sql-reference/functions-table.md) (such as [RESULT_SCAN](../../sql-reference/functions/result_scan.md)).

As shown in the example, if you write a stored procedure that returns a table, you must declare the stored procedure as returning
a table.

> **Note:**
>
> Currently, the `TABLE(resultset_name)` syntax is supported only in the
> [RETURN](../../sql-reference/snowflake-scripting/return.md) statement.

Even if you have used a cursor to fetch rows from the RESULTSET, the
table returned by `TABLE(resultset_name)` still contains all of the rows (not just the rows starting from the cursor’s
internal row pointer).

## Limitations of the RESULTSET data type

Although RESULTSET is a data type, Snowflake does not yet support:

* Declaring a column of type RESULTSET.
* Declaring a parameter of type RESULTSET.
* Declaring a stored procedure’s return type as a RESULTSET.

Snowflake supports RESULTSET only inside Snowflake Scripting.

In addition, you can’t use a RESULTSET directly as a table. For example, the following is invalid:

```sqlexample
SELECT * FROM my_result_set;
```

## Examples of using a RESULTSET

The following sections provide examples of using a RESULTSET:

* Setting up the data for the examples
* Example: Returning a table from a stored procedure
* Example: Constructing the SQL statement dynamically
* Example: Declaring a RESULTSET variable without a DEFAULT clause
* Example: Using a CURSOR with a RESULTSET
* Additional examples that use a RESULTSET

For examples that use the ASYNC keyword to run queries specified for RESULTSETs as asynchronous child jobs,
see [Examples of using asynchronous child jobs](asynchronous-child-jobs.md).

### Setting up the data for the examples

Many of the examples below use the table and data shown below:

```sqlexample
CREATE OR REPLACE TABLE t001 (a INTEGER, b VARCHAR);
INSERT INTO t001 (a, b) VALUES
  (1, 'row1'),
  (2, 'row2');
```

### Example: Returning a table from a stored procedure

The following code shows how to declare a RESULTSET and return the results that the RESULTSET points to. The RETURNS
clause in the CREATE PROCEDURE command declares that the stored procedure returns a table, which contains one column of
type INTEGER.

The RETURN statement inside the block uses the `TABLE(...)` syntax to return the results as a table.

Create the stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp()
RETURNS TABLE(a INTEGER)
LANGUAGE SQL
AS
  DECLARE
    res RESULTSET DEFAULT (SELECT a FROM t001 ORDER BY a);
  BEGIN
    RETURN TABLE(res);
  END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp()
RETURNS TABLE(a INTEGER)
LANGUAGE SQL
AS
$$
  DECLARE
      res RESULTSET default (SELECT a FROM t001 ORDER BY a);
  BEGIN
      RETURN TABLE(res);
  END;
$$;
```

Call the stored procedure:

```sqlexample
CALL test_sp();
```

```output
+---+
| A |
|---|
| 1 |
| 2 |
+---+
```

You can use the [pipe operator](../../sql-reference/operators-flow.md) (`->>`) to process the results of the stored procedure
call:

```sqlexample
CALL test_sp()
  ->> SELECT *
        FROM $1
        WHERE a > 1;
```

```output
+---+
| A |
|---|
| 2 |
+---+
```

You can also use the [RESULT_SCAN](../../sql-reference/functions/result_scan.md) function to process the results after you call the
stored procedure:

```sqlexample
CALL test_sp();
```

```output
+---+
| A |
|---|
| 1 |
| 2 |
+---+
```

```sqlexample
SELECT *
  FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
  WHERE a < 2;
```

```output
+---+
| A |
|---|
| 1 |
+---+
```

### Example: Constructing the SQL statement dynamically

You can construct the SQL dynamically. The following is an example that executes the same query as the previous stored procedure
but that uses a SQL statement that is constructed dynamically:

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_dynamic(table_name VARCHAR)
  RETURNS TABLE(a INTEGER)
  LANGUAGE SQL
AS
DECLARE
  res RESULTSET;
  query VARCHAR DEFAULT 'SELECT a FROM IDENTIFIER(?) ORDER BY a;';
BEGIN
  res := (EXECUTE IMMEDIATE :query USING(table_name));
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_dynamic(table_name VARCHAR)
RETURNS TABLE(a INTEGER)
LANGUAGE SQL
AS
$$
  DECLARE
    res RESULTSET;
    query VARCHAR DEFAULT 'SELECT a FROM IDENTIFIER(?) ORDER BY a;';
  BEGIN
    res := (EXECUTE IMMEDIATE :query USING(table_name));
    RETURN TABLE(res);
  END
$$
;
```

To run the example, call the stored procedure and pass in the table name:

```sqlexample
CALL test_sp_dynamic('t001');
```

```output
+---+
| A |
|---|
| 1 |
| 2 |
+---+
```

### Example: Declaring a RESULTSET variable without a DEFAULT clause

The following code shows how to declare a RESULTSET without a DEFAULT clause (i.e. without associating a query with the RESULTSET),
and then associate the RESULTSET with a query later.

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_02()
RETURNS TABLE(a INTEGER)
LANGUAGE SQL
AS
  DECLARE
    res RESULTSET;
  BEGIN
    res := (SELECT a FROM t001 ORDER BY a);
    RETURN TABLE(res);
  END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_02()
RETURNS TABLE(a INTEGER)
LANGUAGE SQL
AS
$$
  DECLARE
      res RESULTSET;
  BEGIN
      res := (SELECT a FROM t001 ORDER BY a);
      RETURN TABLE(res);
  END;
$$;
```

To run the example, call the stored procedure:

```sqlexample
CALL test_sp_02();
```

```output
+---+
| A |
|---|
| 1 |
| 2 |
+---+
```

### Example: Using a CURSOR with a RESULTSET

The following code shows how to use a [cursor](cursors.md) to iterate over the rows in a RESULTSET:

Create the stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_03()
RETURNS VARCHAR
LANGUAGE SQL
AS

DECLARE
  accumulator INTEGER DEFAULT 0;
  res1 RESULTSET DEFAULT (SELECT a FROM t001 ORDER BY a);
  cur1 CURSOR FOR res1;
BEGIN
  FOR row_variable IN cur1 DO
    accumulator := accumulator + row_variable.a;
  END FOR;
  RETURN accumulator::VARCHAR;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_sp_03()
RETURNS INTEGER
LANGUAGE SQL
AS
$$
  DECLARE
    accumulator INTEGER DEFAULT 0;
    res1 RESULTSET DEFAULT (SELECT a FROM t001 ORDER BY a);
    cur1 CURSOR FOR res1;
  BEGIN
    FOR row_variable IN cur1 DO
        accumulator := accumulator + row_variable.a;
    END FOR;
    RETURN accumulator;
  END;
$$;
```

Call the stored procedure, and the results add the values for `a` in the table (1 + 2):

```sqlexample
CALL test_sp_03();
```

```output
+------------+
| TEST_SP_03 |
|------------|
| 3          |
+------------+
```

### Additional examples that use a RESULTSET

Here are additional examples that use a RESULTSET:

* [Use a RESULTSET-based FOR loop](loops.md)

  This example shows you how to use a FOR loop that iterates over a RESULTSET.
* [Return a table for a cursor](cursors.md)

  This example shows you how to use a cursor to return a table of data in a RESULTSET.
* [Update table data with user input](use-cases.md)

  This example shows you how to use bind variables based on user input to update
  data in a table. It uses a FOR loop with conditional logic to iterate over the rows
  in a RESULTSET.
* [Filter and collect data](use-cases.md)

  This example shows you how to use a RESULTSET to collect data and insert that
  data into a table to track historical trends.

---
title: Working with stored procedures
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-usage.md
section: Developer Guide
---

# Working with stored procedures

Stored procedures enable users to create modular code that can include complex business logic by combining multiple
SQL statements with procedural logic.

> **Note:**
>
> To both create and call an anonymous procedure, use [CALL (with anonymous procedure)](../../sql-reference/sql/call-with.md). Creating and calling an anonymous procedure does
> not require a role with CREATE PROCEDURE schema privileges.

## Naming conventions for stored procedures

You must name procedures according to conventions enforced by Snowflake.

For more information, see [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).

## Transaction management

Stored procedures are not atomic; if one statement in a stored procedure fails, the other statements in the stored
procedure are not necessarily rolled back.

You can use stored procedures with transactions to make a group of statements atomic. For details, see
[Stored procedures and transactions](../../sql-reference/transactions.md).

## General tips

### Symmetric code

If you are familiar with programming in assembly language, you might
find the following analogy helpful. In assembly language, functions
often create and undo their environments in a symmetric way. For example:

> ```none
> -- Set up.
> push a;
> push b;
> ...
> -- Clean up in the reverse order that you set up.
> pop b;
> pop a;
> ```

You might want to use this approach in your stored procedures:

* If a stored procedure makes temporary changes to your session, then that procedure should undo those changes before returning.
* If a stored procedure utilizes exception handling or branching, or other logic that might impact which statements
  are executed, you need to clean up whatever you created, regardless of which branches you take during a particular
  invocation.

For example your code might look similar to the pseudo-code shown below:

> ```sqlexample
> CREATE PROCEDURE f() ...
>     $$
>     set x;
>     set y;
>     try  {
>        set z;
>        -- Do something interesting...
>        ...
>        unset z;
>        }
>     catch  {
>        -- Give error message...
>        ...
>        unset z;
>        }
>     unset y;
>     unset x;
>     $$
>     ;
> ```

## Calling a stored procedure

You call a stored procedure using a SQL command. For more information on calling stored procedures, see
[Calling a stored procedure](stored-procedures-calling.md).

## Selecting from a stored procedure

You call a stored procedure in the FROM clause of a SELECT statement. For more information on selecting from a stored procedure, see
[Selecting from a stored procedure](stored-procedures-selecting-from.md).

## Privileges

Stored Procedures utilize two types of privileges:

* Privileges directly on the stored procedure itself.
* Privileges on the database objects (e.g. tables) that the stored procedure accesses.

### Privileges on stored procedures

Similar to other database objects (tables, views, UDFs, etc.), stored procedures are
owned by a role and have one or more privileges that can be granted to other roles.

Currently, the following privileges apply to stored procedures:

* USAGE
* OWNERSHIP

For a role to use a stored procedure, the role must either be the owner or
have been granted USAGE privilege on the stored procedure.

### Privileges on the database objects accessed by the stored procedure

This subject is covered in [Understanding caller’s rights and owner’s rights stored procedures](stored-procedures-rights.md).

## Stored procedure considerations

* Although stored procedures allow nesting and recursion, the current maximum stack depth
  of nested calls for user-defined stored procedures is 16 (including the top-level
  stored procedure), and can be less if individual stored procedures in the call chain
  consume large amounts of resources.
* In rare cases, calling too many stored procedures at the same time can cause a deadlock.

## Working with stored procedures in Snowsight

You can work with stored procedures in SQL or in Snowsight.

For any stored procedure in Snowflake, you can open Catalog » Database Explorer and search for or browse to the stored procedure.
Select the stored procedure to review details and manage the procedure.

You must have the relevant privileges to access and manage the
stored procedure in Snowsight.

### Explore stored procedure details in Snowsight

After opening a stored procedure in Snowsight, you can do the following:

* Identify when the procedure was created, and any comment on the procedure.
  You can hover over the time details to see the exact creation date and time.
* Review additional details about the stored procedure, including:

  + Arguments that the stored procedure takes, if applicable.
  + The data type of the result of the procedure.
  + Whether the procedure is an aggregate function.
  + Whether the procedure is a secure function.
  + Whether the procedure is a table function.
  + The language in which the stored procedure is written. For example, JavaScript.
* Review the SQL definition of the stored procedure in the Procedure definition section.
* Review the roles with privileges on the stored procedure in the Privileges section.

### Manage a stored procedure in Snowsight

You can perform the following basic management tasks for a stored procedure in Snowsight:

* To edit the stored procedure name or add a comment, select  » Edit.
* To drop the stored procedure, select  » Drop.
* To transfer ownership of the stored procedure to another role, select  » Transfer Ownership

### SQL injection

Stored procedures can dynamically create a SQL statement and execute it. However, this can allow
SQL injection attacks, particularly if you create the SQL statement using input from
a public or untrusted source.

You can minimize the risk of SQL injection attacks by binding parameters rather than concatenating text. For an
example of binding variables, see [Binding variables](stored-procedures-javascript.md).

If you choose to use concatenation, you should check inputs carefully when
constructing SQL dynamically using input from public sources.
You might also want to take other precautions, such as querying using a
role that has limited privileges (e.g. read-only access, or access to
only certain tables or views).

For more information about SQL injection attacks, see
[SQL injection](https://en.wikipedia.org/wiki/SQL_injection) (in Wikipedia).

## Design tips for stored procedures

Here are some tips for designing a stored procedure:

* What resources, for example, tables, does this stored procedure need?
* What privileges are needed?

  Think about which database objects will be accessed, and which roles will run your stored
  procedure, and which privileges those roles will need.

  If the procedure should be a caller’s rights stored procedure, then
  you might want to create a role to run that specific procedure, or any of a
  group of related procedures. You can then grant any required privileges to that role, and
  then grant that role to appropriate users.
* Should the stored procedure run with caller’s rights or owner’s rights? For more information on this topic, see
  [Understanding caller’s rights and owner’s rights stored procedures](stored-procedures-rights.md).
* How should the procedure handle errors, for example what should the procedure do if a required table is missing,
  or if an argument is invalid?
* Should the stored procedure log its activities or errors, for example by writing to a log table?
* See also the discussion about when to use a stored procedure vs. when to use a UDF:
  [Choosing whether to write a stored procedure or a user-defined function](../stored-procedures-vs-udfs.md).

## Documenting stored procedures

Stored procedures are usually written to be re-used, and often to be shared. Documenting stored procedures
can make stored procedures easier to use and easier to maintain.

Below are some general recommendations for documenting stored procedures.

Typically, there are at least two audiences who want to know about a stored procedure:

* Users/callers.
* Programmers/authors.

For users (and programmers), document each of the following:

> * The name of the stored procedure.
> * The “location” of the stored procedure (database and schema).
> * The purpose of the stored procedure.
> * The name, data type, and meaning of each input parameter.
> * The name, data type, and meaning of the return value. If the return value is a complex type, such as a VARIANT
>   that contains sub-fields, document those sub-fields.
> * If the stored procedure relies on information from its environment, for example session variables or session
>   parameters, document the names, purposes, and valid values of those.
> * Errors returned, exceptions thrown, etc.
> * Roles or privileges required in order to run the procedure. (For more on this topic, see the discussion of
>   roles in Design tips for stored procedures.)
> * Whether the procedure is a caller’s rights procedure or an owner’s rights procedure.
> * Any prerequisites, for example tables that must exist before the procedure is called.
> * Any outputs (besides the return value), for example new tables that are created.
> * Any “side-effects”, for example changes in privileges, deletions of old data, etc. Most stored procedures
>   (unlike functions) are called specifically for their side effects, not their return values, so make sure to
>   document those effects.
> * If cleanup is required after running the stored procedure, document that cleanup.
> * Whether the procedure can be called as part of a multi-statement transaction (with AUTOCOMMIT=FALSE), or whether
>   it should be run outside a transaction (with AUTOCOMMIT=TRUE).
> * An example of a call and an example of what is returned.
> * Limitations (if applicable). For example, suppose that the procedure reads a table and returns a VARIANT that
>   contains information from each row of the table. It is possible for the VARIANT to grow larger than the
>   maximum legal size of a VARIANT, so you might need to give the caller some idea of the maximum number of rows
>   in the table that the procedure accesses.
> * Warnings (if applicable).
> * Troubleshooting tips.

For programmers:

> * The author(s).
> * Explain why the procedure was created as a caller’s rights procedure or an owner’s rights procedure – the reason
>   might not be obvious.
> * Stored procedures can be nested, but there is a limit to the depth of the nesting. If your stored procedure
>   calls other stored procedures, and is itself likely to be called by other stored procedures, then you
>   might want to specify the maximum known depth of your stored procedure’s call stack so that callers have
>   some idea of whether calling your stored procedure might exceed the maximum call stack depth.
> * Debugging tips.

The location and format of this information are up to you. You might store the information in HTML format in an
internal web site, for example. Before deciding where to store it, think about where your organization stores similar
information for other products, or similar information for other Snowflake features, such as views,
user-defined functions, etc.

Other tips:

* Include comments in the source code, as you should for almost any piece of source code.

  + Remember that reverse engineering meaning from code is difficult. Describe not only how your algorithm works,
    but also the purpose of that algorithm.
* Stored procedures allow an optional COMMENT that can be specified with the `CREATE PROCEDURE` or
  `ALTER PROCEDURE` statement. Other people can read this comment by running the `SHOW PROCEDURES` command.
* If practical, consider keeping a master copy of each stored procedure’s CREATE PROCEDURE command in a
  source code control system. Snowflake’s Time Travel feature does not apply to stored procedures, so looking up
  old versions of stored procedures must be done outside Snowflake. If a source code control system is not available,
  you can partly simulate one by storing the CREATE PROCEDURE commands in a VARCHAR field in a table, and adding each
  new version (without replacing the older version(s)).
* Consider using a naming convention to help provide information about stored procedures.
  For example, a prefix or suffix in the name might indicate whether the procedure is a caller’s rights stored
  procedure or an owner’s rights stored procedure. (E.g. you could use `cr_` as a prefix for Caller’s Rights.)
* To see the data types and order of the input arguments, as well as the comment, you can use the SHOW PROCEDURES
  command. Remember, however, that this shows only the names and data types of the arguments; it does not explain
  the arguments.
* If you have appropriate privileges, you can use the DESCRIBE PROCEDURE command to see:

  + The names and data types of the arguments.
  + The body of the procedure, and whether the procedure executes as owner or caller.
  + The data type of the return value.
  + Other useful information.

---
title: Working with variables
source: https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables.md
section: Developer Guide
---

# Working with variables

In Snowflake Scripting, you can use variables in expressions, Snowflake Scripting statements, and SQL statements.

## Declaring a variable

Before you can use a variable, you must declare the variable. When you declare a variable, you must specify the type of the
variable in one of the following ways:

* Explicitly specify the data type.
* Specify an expression for the initial value of the variable. Snowflake Scripting uses the expression to determine the data
  type of the variable. See How Snowflake Scripting infers the data type of a variable.

You can declare a variable in the following ways:

* Within the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section of the block by using any of the following:

  ```sqlsyntax
  <variable_name> <type> ;

  <variable_name> DEFAULT <expression> ;

  <variable_name> <type> DEFAULT <expression> ;
  ```
* Within the [BEGIN … END](../../sql-reference/snowflake-scripting/begin.md) section of the block (before you use the variable)
  by using the [LET](../../sql-reference/snowflake-scripting/let.md) command in any of the following ways:

  ```sqlsyntax
  LET <variable_name> <type> { DEFAULT | := } <expression> ;

  LET <variable_name> { DEFAULT | := } <expression> ;
  ```

Where:

> `variable_name`
> :   The name of the variable. The name must follow the naming rules for [Object identifiers](../../sql-reference/identifiers.md).
>
> `type`
> :   The data type of the variable. The data type can be any of the following:
>
>     * A [SQL data type](../../sql-reference-data-types.md)
>
>     * [CURSOR](cursors.md)
>     * [RESULTSET](resultsets.md)
>     * [EXCEPTION](exceptions.md)
>
> `DEFAULT expression` or . `:= expression`
> :   Assigns the value of `expression` to the variable.
>
>     If both `type` and `expression` are specified, the expression must evaluate to a data type that matches.
>     If the types do not match, you can [cast](../../sql-reference/functions/cast.md) the value to the specified `type`.

The following example declares variables in the DECLARE section and in the BEGIN … END section of the block:

```sqlexample
DECLARE
  profit number(38, 2) DEFAULT 0.0;
BEGIN
  LET cost number(38, 2) := 100.0;
  LET revenue number(38, 2) DEFAULT 110.0;

  profit := revenue - cost;
  RETURN profit;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  profit number(38, 2) DEFAULT 0.0;
BEGIN
  LET cost number(38, 2) := 100.0;
  LET revenue number(38, 2) DEFAULT 110.0;

  profit := revenue - cost;
  RETURN profit;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|           10.00 |
+-----------------+
```

The next sections explain how the data type and scope of a variable are determined:

* How Snowflake Scripting infers the data type of a variable
* Understanding the scope of declarations

For information about assigning a value to a variable, see Assigning a value to a declared variable.

### How Snowflake Scripting infers the data type of a variable

When you declare a variable without explicitly specifying the data type, Snowflake Scripting infers the
data type from the expression that you assign to the variable.

If you choose to omit the data type from the declaration, note the following:

* If the expression can resolve to different data types of different sizes, Snowflake typically chooses the type that is flexible
  (for example, FLOAT rather than NUMBER(3, 1)) and has a high storage capacity (for example, VARCHAR rather than VARCHAR(4)).

  For example, if you set a variable to the value `12.3`, Snowflake can choose one of several data types for the variable,
  including:

  + NUMBER(3, 1)
  + NUMBER(38, 1)
  + FLOAT

  In this example, Snowflake chooses FLOAT.

  If you need a specific data type for a variable (especially a numeric or timestamp type), Snowflake recommends that you specify
  the data type explicitly, even if you provide an initial value.
* If Snowflake is unable to infer the intended data type, Snowflake reports a SQL compilation error.

  For example, the following code declares a variable without explicitly specifying the data type. The code sets the variable to
  the value in a cursor.

  ```sqlexample
  ...
  FOR current_row IN cursor_1 DO:
    LET price := current_row.price_column;
    ...
  ```

  When the Snowflake Scripting block is compiled (for example, when the CREATE PROCEDURE command is executed), the cursor has not been
  opened, and the data type of the column in the cursor is unknown. As a result, Snowflake reports a SQL compilation error:

  ```none
  092228 (P0000): SQL compilation error:
    error line 7 at position 4 variable 'PRICE' cannot have its type inferred from initializer
  ```

### Understanding the scope of declarations

Snowflake Scripting uses [lexical scoping](https://en.wikipedia.org/wiki/Scope_(computer_science)#Lexical_scope). When a
variable for a value, result set, cursor, or exception is declared in the DECLARE section of a block, the scope (or visibility)
of the declared object is that block and any blocks nested in that block.

If a block declares an object with the same name as an object declared in an outer block, then within the inner
block (and any blocks inside that block), only the inner block’s object is in scope. When an object name is
referenced, Snowflake looks for the object with that name by starting first in the current block, and then working
outward one block at a time until an object with a matching name is found.

For example, if an exception is declared inside a stored procedure, the exception’s scope is limited to that stored
procedure. Stored procedures called by that stored procedure cannot raise (or handle) that exception. Stored
procedures that call that procedure cannot handle (or raise) that exception.

## Assigning a value to a declared variable

To assign a value to a variable that has already been declared, use the `:=` operator:

```sqlsyntax
<variable_name> := <expression> ;
```

Where:

> `variable_name`
> :   The name of the variable. The name must follow the naming rules for [Object identifiers](../../sql-reference/identifiers.md).
>
> `expression`
> :   The expression is evaluated and the resulting value is assigned to the variable.
>
>     The expression must evaluate to a data type that matches the type of the variable.
>     If the expression does not match the type, you can [cast](../../sql-reference/functions/cast.md) the value to the type of
>     the variable.
>
>     In the expression, you can use functions, including [built-in SQL functions](../../sql-reference-functions.md)
>     and [UDFs](../udf/udf-overview.md) (user-defined functions).

## Using a variable

You can use variables in expressions and with Snowflake Scripting language elements (such as
[RETURN](../../sql-reference/snowflake-scripting/return.md)). You can add these language elements
to [stored procedures](../stored-procedure/stored-procedures-overview.md),
[Snowflake Scripting user-defined functions (UDF)](../udf/sql/udf-sql-procedural-functions.md),
and [anonymous blocks](blocks.md).

For example, the following code uses the variables `revenue` and `cost` in an expression and the
variable `profit` in a RETURN statement:

```sqlexample
DECLARE
  profit NUMBER(38, 2);
  revenue NUMBER(38, 2);
  cost NUMBER(38, 2);
BEGIN
  ...
  profit := revenue - cost;
  ...
RETURN profit;
```

To use a variable in an exception handler (the [EXCEPTION](../../sql-reference/snowflake-scripting/exception.md) section of
a block), the variable must be declared in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section or passed
as an argument to a stored procedure. It can’t be declared in the [BEGIN … END](../../sql-reference/snowflake-scripting/begin.md)
section. For more information, see [Passing variables to an exception handler in Snowflake Scripting](exceptions.md).

> **Tip:**
>
> You can also use and set SQL (session) variables in Snowflake Scripting anonymous blocks and in stored procedures that run with
> caller’s rights. For more information, see [Using and setting SQL variables in a stored procedure](../stored-procedure/stored-procedures-snowflake-scripting.md).

## Using a variable in a SQL statement (binding)

You can use a variable in a SQL statement, which is sometimes referred to as [binding](../../sql-reference/bind-variables.md)
a variable. To do so, prefix the variable name with a colon. For example:

```sqlexample
INSERT INTO my_table (x) VALUES (:my_variable)
```

You can expand a bind variable that represents an [array](../../sql-reference/data-types-semistructured.md) into a list of individual values
by using the spread operator (`**`). For more information, see [Expansion operators](../../sql-reference/operators-expansion.md).

For information about binding variables in Snowflake Scripting stored procedures, see
[Using an argument in a SQL statement (binding)](../stored-procedure/stored-procedures-snowflake-scripting.md).

If you are using the variable as the name of an object (for example, the name of a table in the FROM clause of a SELECT statement), use
the [IDENTIFIER](../../sql-reference/identifier-literal.md) keyword to indicate that the variable represents an object identifier.
For example:

```sqlexample
SELECT COUNT(*) FROM IDENTIFIER(:table_name)
```

If you are using a variable in an expression or with a
[Snowflake Scripting language element](../../sql-reference-snowflake-scripting.md) (for example,
[RETURN](../../sql-reference/snowflake-scripting/return.md)), you do not need to prefix the variable with a colon.

For example, you do not need the colon prefix in the following cases:

* You are using the variable with RETURN. In this example, the variable `profit` is used with a Snowflake Scripting language
  element and does not need the colon prefix.

  ```sqlexample
  RETURN profit;
  ```
* You are building a string containing a SQL statement to execute. In this example, the variable `id_variable` is used in an
  expression and does not need the colon prefix.

  ```sqlexample
  LET select_statement := 'SELECT * FROM invoices WHERE id = ' || id_variable;
  ```

In addition, the [TO_QUERY](../../sql-reference/functions/to_query.md) function provides a simple syntax for accepting a SQL string
directly in the FROM clause of a SELECT statement. For a comparison of the TO_QUERY function with dynamic SQL,
see [Constructing SQL at runtime](../../user-guide/querying-construct-at-runtime.md).

## Setting variables to the results of a SELECT statement

In a Snowflake Scripting block, you can use the [INTO](../../sql-reference/constructs/into.md) clause to set variables to the values of
expressions specified in a SELECT clause:

```sqlsyntax
SELECT <expression1>, <expression2>, ... INTO :<variable1>, :<variable2>, ... FROM ... WHERE ...;
```

When you use this syntax:

* `variable1` is set to the value of `expression1`.
* `variable2` is set to the value of `expression2`.

The SELECT statement must return a single row.

The following example contains a SELECT statement that returns a single row. The example relies on data from this table:

```sqlexample
CREATE OR REPLACE TABLE some_data (id INTEGER, name VARCHAR);
INSERT INTO some_data (id, name) VALUES
  (1, 'a'),
  (2, 'b');
```

The example sets the Snowflake Scripting variables `id` and `name` to the values returned for the columns with those names.

```sqlexample
DECLARE
  id INTEGER;
  name VARCHAR;
BEGIN
  SELECT id, name INTO :id, :name FROM some_data WHERE id = 1;
  RETURN id || ' ' || name;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  id INTEGER;
  name VARCHAR;
BEGIN
  SELECT id, name INTO :id, :name FROM some_data WHERE id = 1;
  RETURN :id || ' ' || :name;
END;
$$
;
```

The example prints out the `id` and `name` from the row returned by the SELECT statement.

```none
+-----------------+
| anonymous block |
|-----------------|
| 1 a             |
+-----------------+
```

## Setting a variable to the return value of a stored procedure

See [Using the value returned from a stored procedure call](../stored-procedure/stored-procedures-snowflake-scripting.md).

## Using stored procedure arguments

You can create [Snowflake Scripting stored procedures](../stored-procedure/stored-procedures-snowflake-scripting.md)
that are passed arguments when they are called. These arguments behave like declared variables in the body of the stored procedure.

Snowflake Scripting supports input (IN) and output (OUT) arguments. The argument type determines how you can use it in a stored
procedure.

For more information, see [Using arguments passed to a stored procedure](../stored-procedure/stored-procedures-snowflake-scripting.md).

## Examples of using variables

The following example shows how to declare a variable, assign a value or expression to a variable, and cast a value to the data
type of a variable:

```sqlexample
DECLARE
  w INTEGER;
  x INTEGER DEFAULT 0;
  dt DATE;
  result_string VARCHAR;
BEGIN
  w := 1;                     -- Assign a value.
  w := 24 * 7;                -- Assign the result of an expression.
  dt := '2020-09-30'::DATE;   -- Explicit cast.
  dt := '2020-09-30';         -- Implicit cast.
  result_string := w::VARCHAR || ', ' || dt::VARCHAR;
  RETURN result_string;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
    w INTEGER;
    x INTEGER DEFAULT 0;
    dt DATE;
    result_string VARCHAR;
BEGIN
    w := 1;                     -- Assign a value.
    w := 24 * 7;                -- Assign the result of an expression.
    dt := '2020-09-30'::DATE;   -- Explicit cast.
    dt := '2020-09-30';         -- Implicit cast.
    result_string := w::VARCHAR || ', ' || dt::VARCHAR;
    RETURN result_string;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| 168, 2020-09-30 |
+-----------------+
```

The following example uses a built-in SQL function in the expression:

```sqlexample
my_variable := SQRT(variable_x);
```

The following declaration implicitly specifies the data types of the variables `profit`, `cost`, and `revenue` by
specifying an initial value of the intended data type for each variable.

The example also demonstrates how to use the [LET](../../sql-reference/snowflake-scripting/let.md) statement to declare the
`cost` and `revenue` variables outside of the DECLARE portion of the block:

```sqlexample
DECLARE
  profit number(38, 2) DEFAULT 0.0;
BEGIN
  LET cost number(38, 2) := 100.0;
  LET revenue number(38, 2) DEFAULT 110.0;

  profit := revenue - cost;
  RETURN profit;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
    profit DEFAULT 0.0;
BEGIN
    LET cost := 100.0;
    LET revenue DEFAULT 110.0;
    profit := revenue - cost;
    RETURN profit;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|              10 |
+-----------------+
```

The following example demonstrates the scope of a variable. This example includes two variables and a parameter that all have the
same name but different scope.

The example contains three blocks: the outermost, middle, and innermost blocks.

* Within the innermost block, PV_NAME resolves to the variable declared and set in that innermost block
  (which is set to `innermost block variable`).
* Within the middle block (and outside of the innermost block), PV_NAME resolves to the variable declared and set in the
  middle block (which is set to `middle block variable`).
* Within the outermost block (and outside any of the nested blocks), PV_NAME resolves to the parameter passed to the stored
  procedure (which is set to `parameter` by the CALL statement).

The example relies on this table:

```sqlexample
CREATE OR REPLACE TABLE names (v VARCHAR);
```

In this example, the assignment of the string `innermost block variable` to PV_NAME in the innermost block does not affect the
value of the variable in the middle block. The variable in the innermost block is different from the variable in the middle block,
even if both variables have the same name.

```sqlexample
CREATE OR REPLACE PROCEDURE duplicate_name(pv_name VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
AS
BEGIN
  DECLARE
    PV_NAME VARCHAR;
  BEGIN
    PV_NAME := 'middle block variable';
    DECLARE
      PV_NAME VARCHAR;
    BEGIN
      PV_NAME := 'innermost block variable';
      INSERT INTO names (v) VALUES (:PV_NAME);
    END;
    -- Because the innermost and middle blocks have separate variables
    -- named "pv_name", the INSERT below inserts the value
    -- 'middle block variable'.
    INSERT INTO names (v) VALUES (:PV_NAME);
  END;
  -- This inserts the value of the input parameter.
  INSERT INTO names (v) VALUES (:PV_NAME);
  RETURN 'Completed.';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE duplicate_name(pv_name VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  DECLARE
    PV_NAME VARCHAR;
  BEGIN
    PV_NAME := 'middle block variable';
    DECLARE
    PV_NAME VARCHAR;
    BEGIN
      PV_NAME := 'innermost block variable';
      INSERT INTO names (v) VALUES (:PV_NAME);
    END;
    -- Because the innermost and middle blocks have separate variables
    -- named "pv_name", the INSERT below inserts the value
    -- 'middle block variable'.
    INSERT INTO names (v) VALUES (:PV_NAME);
  END;
  -- This inserts the value of the input parameter.
  INSERT INTO names (v) VALUES (:PV_NAME);
  RETURN 'Completed.';
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL duplicate_name('parameter');
```

Check the values in the table:

```sqlexample
SELECT *
    FROM names
    ORDER BY v;
```

```output
+--------------------------+
| V                        |
|--------------------------|
| innermost block variable |
| middle block variable    |
| parameter                |
+--------------------------+
```

The output shows that:

* In the innermost nested block (which was nested two layers), the inner block’s variable `PV_NAME` was used.
* In the middle block (which was nested one layer), that middle block’s variable `PV_NAME` was used.
* In the outermost block, the parameter was used.

For an example of binding a variable when opening a cursor, see the
[examples of opening cursors](../../sql-reference/snowflake-scripting/open.md).

---
title: Writing a scalar UDF in Scala
source: https://docs.snowflake.com/en/developer-guide/udf/scala/udf-scala-scalar.md
section: Developer Guide
---

# Writing a scalar UDF in Scala

You can write a scalar user-defined function (UDF) in Scala. The Scala handler code executes when the UDF is called. This topic describes how
to write a handler in Scala and create the UDF.

A UDF is a user-defined function that returns scalar results – meaning a single value rather than multiple rows. For more general
information about UDFs, see [User-defined functions overview](../udf-overview.md).

When you create a UDF, you do the following:

1. Write a Scala object or class with a method that Snowflake will invoke when the UDF is called.

   For more information, see Implementing a handler in this topic.
2. Create the UDF in SQL with the CREATE FUNCTION command, specifying your object or class and method as the handler. When you create the
   UDF, you specify:

   * Data types of UDF input parameters.
   * Data type of the UDF return value.
   * Code to execute as a handler when the UDF is called.
   * The language in which the handler is written.

   For more about CREATE FUNCTION syntax, see [Creating the UDF with CREATE FUNCTION](udf-scala-general.md).

You can call a UDF as described in [Executing a UDF](../udf-calling-sql.md).

## Implementing a handler

You implement an object or class with a handler method to process UDF argument values into the UDF’s return value.

When writing a handler, you:

* Write a public class with a public method to specify as the handler.

  This will be the method that Snowflake invokes when the UDF is called in SQL.

  You can define multiple other methods in the same object or class, then use each as the handler for a different UDF. For example, you
  might want to do this when you intend to keep the compiled handler code on a stage and reference it from multiple functions.

  For more information on a staged handler, refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).
* Optionally write a zero-argument constructor for Snowflake to invoke to initialize the handler.

> **Note:**
>
> Be sure to write your handler in keeping with the Snowflake-imposed constraints in each handler method and methods it calls.
> For more on these constraints, see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).

## Handler example

Code in the following example includes a `MyHandler.echoVarchar` handler method that receives and returns string. The value received
by the UDF – a VARCHAR – is mapped by Snowflake to the handler method’s parameter type – a String.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER='MyHandler.echoVarchar'
  AS
  $$
  class MyHandler {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

```sqlexample-scala
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER='MyHandler.echoVarchar'
  AS
  $$
  class MyHandler {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

Call the UDF

```sqlexample
SELECT echo_varchar('Hello');
```

## Initializing the handler

You can optionally initialize your handler by adding a zero-argument constructor.

If the constructor throws an error, the error is thrown as a user error, along with the exception message.

```scala
def this() = {
  // Initialize here.
}
```

## Processing function arguments

To process data passed to the UDF as arguments, implement a public method that Snowflake will invoke when the UDF is called in SQL code.
When you create the UDF with a CREATE FUNCTION command, you’ll use the HANDLER clause to specify the method as the handler.

When declaring a handler method, you:

* Declare the handler method as public.

  You can optionally include a zero-argument constructor to initialize the handler. For more information, refer to
  Initializing the handler in this topic.

  If you intend to package the class into a JAR as a staged handler, you can declare multiple handler methods, later specifying each as a
  handler with the HANDLER clause of a CREATE FUNCTION statement. For more information on a staged handler, refer to
  [Keeping handler code in-line or on a stage](../../inline-or-staged.md).
* Specify handler method parameter and return types that map to the SQL types specified by the UDF declaration.

  For more information, refer to [SQL-Scala Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
* Optionally declare additional methods to support the handler method’s processing, such as methods to be called from the handler method.

  Code in the following example features a `handleStrings` handler method that calls a non-handler method `concatenate` to
  help process the array received as an argument.

  Scala 2.12Scala 2.13 (Preview)

  ```sqlexample-scala
  CREATE OR REPLACE FUNCTION generate_greeting(greeting_words ARRAY)
    RETURNS VARCHAR
    LANGUAGE SCALA
    RUNTIME_VERSION = 2.12
    HANDLER='StringHandler.handleStrings'
    AS
    $$
    class StringHandler {
      def handleStrings(strings: Array[String]): String = {
        return concatenate(strings)
      }
      private def concatenate(strings: Array[String]): String = {
        var concatenated : String = ""
        for (newString <- strings)  {
            concatenated = concatenated + " " + newString
        }
        return concatenated
      }
    }
    $$;
  ```

  ```sqlexample-scala
  CREATE OR REPLACE FUNCTION generate_greeting(greeting_words ARRAY)
    RETURNS VARCHAR
    LANGUAGE SCALA
    RUNTIME_VERSION = 2.13
    HANDLER='StringHandler.handleStrings'
    AS
    $$
    class StringHandler {
      def handleStrings(strings: Array[String]): String = {
        return concatenate(strings)
      }
      private def concatenate(strings: Array[String]): String = {
        var concatenated : String = ""
        for (newString <- strings)  {
            concatenated = concatenated + " " + newString
        }
        return concatenated
      }
    }
    $$;
  ```

  The following calls the `generate_greeting` function.

  ```sqlexample
  SELECT generate_greeting(['Hello', 'world']);
  ```

  The following illustrates the output from calling `generate_greeting` with the values above.

  ```output
  Hello world
  ```

## Overloading handler methods

You can overload handler methods in the same class or object as long as they have different numbers of parameters.

For Scala UDFs, Snowflake uses only the *number* of method arguments, not their *types*, to differentiate handler methods.
Resolving based on data types is impractical because some SQL data types can be mapped to more than one Scala or Java data type and
thus potentially to more than one handler method signature.

For example, if two Scala methods have the same name and the same number of arguments, but different data types, then calling a UDF using
one of those methods as a handler generates an error similar to the following:

```none
Cannot determine which implementation of handler "handler name" to invoke since there are multiple
definitions with <number of args> arguments in function <user defined function name> with
handler <class name>.<handler name>
```

If a warehouse is available, the error is detected at the time that the UDF is created. Otherwise, the error occurs when the UDF is
called.

---
title: Writing a UDTF in Python
source: https://docs.snowflake.com/en/developer-guide/udf/python/udf-python-tabular-functions.md
section: Developer Guide
---

# Writing a UDTF in Python

You can implement a user-defined [table function](../../../sql-reference/functions-table.md) (UDTF) handler in Python. This handler code
executes when the UDTF is called. This topic describes how to implement a handler in Python and create the UDTF.

A UDTF is a user-defined function (UDF) that returns tabular results. For more about UDF handlers implemented in Python, see
[Creating Python UDFs](udf-python-creating.md). For more general information about UDFs, see [User-defined functions overview](../udf-overview.md).

In the handler for a UDTF, you can process input rows (see Processing rows in this topic). You can also have logic
that executes for each input partition (see Processing partitions in this topic).

When you create a Python UDTF, you do the following:

1. Implement a class with methods that Snowflake will invoke when the UDTF is called.

   For more details, see Implementing a handler in this topic.
2. Create the UDTF in SQL with the CREATE FUNCTION command, specifying your class as the handler. When you create the UDTF, you specify:

   * Data types of UDTF input parameters.
   * Data types of columns returned by the UDTF.
   * Code to execute as a handler when the UDTF is called.
   * The language in which the handler is implemented.

   For more about syntax, see Creating the UDTF with CREATE FUNCTION in this topic.

You can call a UDF or UDTF as described in [Executing a UDF](../udf-calling-sql.md).

> **Note:**
>
> Table functions (UDTFs) have a limit of 500 input arguments and 500 output columns.

Snowflake currently supports writing UDTFs in the following versions of Python:

Generally available versions:

* 3.9 (deprecated)
* 3.10
* 3.11
* 3.12
* 3.13

In your CREATE FUNCTION statement, set `runtime_version` to the desired version.

## Implementing a handler

You implement a handler class to process UDTF argument values into tabular results and handle partitioned input. For a handler class
example, see Handler class example in this topic.

When you create the UDTF with CREATE FUNCTION, you specify this class as the UDTF’s handler. For more on the SQL to create the function,
see Creating the UDTF with CREATE FUNCTION in this topic.

A handler class implements methods Snowflake will invoke when the UDTF is called. This class contains the UDTF’s logic.

| Method | Requirement | Description |
| --- | --- | --- |
| `__init__` method | Optional | Initializes state for stateful processing of input partitions. For more information, see Initializing the handler in this topic. |
| `process` method | Required | Processes each input row, returning a tabular value as tuples. Snowflake invokes this method, passing input from the UDTF’s arguments. For more information, see Defining a process method in this topic. |
| `end_partition` method | Optional | Finalizes processing of input partitions, returning a tabular value as tuples. For more information, see Finalizing partition processing in this topic. |

Note that throwing an exception from any method in the handler class causes processing to stop. The query that called the UDTF fails with
an error message.

> **Note:**
>
> If your code doesn’t meet the requirements described here, UDTF creation or execution may fail. Snowflake will detect violations when the
> CREATE FUNCTION statement executes.

### Handler class example

Code in the following example creates a UDTF whose handler class processes rows in a partition. The `process` method processes each
input row, returning a row with the total cost for a stock sale. After processing rows in the partition, it returns (from its
`end_partition` method) the total for all sales included in the partition.

```sqlexample-python
CREATE OR REPLACE FUNCTION stock_sale_sum(symbol VARCHAR, quantity NUMBER, price NUMBER(10,2))
  RETURNS TABLE (symbol VARCHAR, total NUMBER(10,2))
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'StockSaleSum'
AS $$
class StockSaleSum:
    def __init__(self):
        self._cost_total = 0
        self._symbol = ""

    def process(self, symbol, quantity, price):
      self._symbol = symbol
      cost = quantity * price
      self._cost_total += cost
      yield (symbol, cost)

    def end_partition(self):
      yield (self._symbol, self._cost_total)
$$;
```

Code in the following example calls the preceding UDF, passing values from columns `symbol`, `quantity`, and `price`
from the `stocks_table` table. For more information about calling a UDTF, refer to [Executing a UDF](../udf-calling-sql.md).

```sqlexample
SELECT stock_sale_sum.symbol, total
  FROM stocks_table, TABLE(stock_sale_sum(symbol, quantity, price) OVER (PARTITION BY symbol));
```

### Initializing the handler

You can optionally implement an `__init__` method in your handler class that Snowflake will invoke before the handler has begun
processing rows. For example, you can use this method to establish some partition-scoped state for the handler. Your `__init__`
method may not produce output rows.

The method’s signature must be of the following form:

```python
def __init__(self):
```

For example, you might want to:

* Initialize state for a partition, then use this state in the `process` and `end_partition` methods.
* Execute long-running initialization that needs to be done only once per partition rather than once per row.

> **Note:**
>
> You can also execute logic once before partition handling begins by including that code outside the handler class, such as before the
> class declaration.

For more about processing partitions, see Processing partitions in this topic.

If you use an `__init__` method, keep in mind that `__init__`:

* Can take only `self` as an argument.
* Cannot produce output rows. Use your `process` method implementation for that.
* Is invoked once for each partition, and before the `process` method is invoked.

### Processing rows

Implement a `process` method that Snowflake will invoke for each input row.

#### Defining a `process` method

Define a `process` method that receives as values the UDTF arguments converted from SQL types, returning data that Snowflake will
use to create the UDTF’s tabular return value.

The method’s signature must be of the following form:

```python
def process(self, *args):
```

Your `process` method must:

* Have a `self` parameter.
* Declare method parameters corresponding to UDTF parameters.

  Method parameter names needn’t match UDTF parameter names, but the method parameters must be declared *in the same order* as UDTF
  parameters are declared.

  When passing UDTF argument values to your method, Snowflake will convert the values from SQL types to the Python types you use in the
  method. For information about how Snowflake maps between SQL and Python data types, see [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).
* Yield one or more tuples (or return an iterable containing tuples), in which the sequence of tuples corresponds to the sequence of UDTF
  return value columns.

  The tuple elements must appear *in the same order* as UDTF return value columns are declared. For more information, see
  Returning a value in this topic.

  Snowflake will convert values from Python types to SQL types required by the UDTF declaration. For information about how Snowflake maps
  between SQL and Python data types, see [SQL-Python Data Type Mappings](../../udf-stored-procedure-data-type-mapping.md).

If a method in the handler class throws an exception, processing will stop. The query that called the UDTF will fail with an
error message. If the `process` method returns `None`, processing stops. (The `end_partition` method is still invoked even if
the `process` method returns `None`.)

**process Method Example**

Code in the following example shows a `StockSale` handler class with a `process` method that processes three UDTF arguments
(`symbol`, `quantity`, and `price`), returning a single row with two columns (`symbol` and `total`). Note that
`process` method parameters are declared in the same order as `stock_sale` parameters. Arguments in the `process`
method’s `yield` statement are in the same order as columns declared in the `stock_sale` RETURNS TABLE clause.

```sqlexample-python
CREATE OR REPLACE FUNCTION stock_sale(symbol VARCHAR, quantity NUMBER, price NUMBER(10,2))
  RETURNS TABLE (symbol VARCHAR, total NUMBER(10,2))
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'StockSale'
AS $$
class StockSale:
    def process(self, symbol, quantity, price):
      cost = quantity * price
      yield (symbol, cost)
$$;
```

Code in the following example calls the preceding UDF, passing values from columns `symbol`, `quantity`, and `price`
from the `stocks_table` table.

```sqlexample
SELECT stock_sale.symbol, total
  FROM stocks_table, TABLE(stock_sale(symbol, quantity, price) OVER (PARTITION BY symbol));
```

#### Returning a value

When returning output rows, you can use either `yield` or `return` (but not both) to return tuples with the tabular value. If
the method returns or yields `None`, processing for the current row stops.

* When using `yield`, execute a separate `yield` statement for each output row. This is the best practice because the lazy
  evaluation that comes with `yield` enables more efficient processing and can help avoid timeouts.

  Each element in the tuple becomes a column value in the result returned by the UDTF. The order of `yield` arguments must
  match the order of columns declared for the return value in the RETURNS TABLE clause of CREATE FUNCTION.

  Code in the following example returns values representing two rows.

  ```python
  def process(self, symbol, quantity, price):
    cost = quantity * price
    yield (symbol, cost)
    yield (symbol, cost)
  ```

  Note that because the yield argument is a tuple, you must include a trailing comma when passing a single value in the tuple, as in the
  following example.

  ```python
  yield (cost,)
  ```
* When using `return`, return an iterable with tuples.

  Each value in a tuple becomes a column value in the result returned
  by the UDTF. The order of column values in a tuple must match the order of columns declared for the return value in the RETURNS TABLE
  clause of CREATE FUNCTION.

  Code in the following example returns two rows, each with two columns: symbol and total.

  ```python
  def process(self, symbol, quantity, price):
    cost = quantity * price
    return [(symbol, cost), (symbol, cost)]
  ```

#### Skipping rows

To skip an input row and process the next row (such as when you’re validating the input rows), have the `process` method return one
of the following:

* When using `return`, return `None`, a list containing `None`, or an empty list to skip the row.
* When using `yield`, return `None` to skip a row.

  Note that if you have multiple calls to `yield`, any calls after a call that returns `None` will be ignored by Snowflake.

Code in the following example returns only the rows for which `number` is a positive integer. If `number` is not positive, the
method returns `None` to skip the current row and continue processing the next row.

```python
def process(self, number):
  if number < 1:
    yield None
  else:
    yield (number)
```

#### Stateful and stateless processing

You can implement the handler to process rows in a partition-aware manner or to process them simply row by row.

* In **partition-aware processing**, the handler includes code to manage partition-scoped state. This includes an `__init__` method
  that executes at the start of partition processing and an `end_partition` method that Snowflake invokes after processing the
  partition’s last row. For more information, see Processing partitions in this topic.
* In **partition-unaware processing**, the handler executes statelessly, ignoring partition boundaries.

  To have the handler execute this way, do not include an `__init__` or `end_partition` method.

### Processing partitions

You can process partitions in input with code that executes per partition (such as to manage state) as well as code that executes for each
row in the partition.

> **Note:**
>
> For more information on specifying partitions when calling a UDTF, refer to [Table functions and partitions](../udf-calling-sql.md).

When a query includes partitions, it aggregates rows using a specified value, such as the value of a column. The aggregated rows your
handler receives are said to be partitioned by that value. Your code can process these partitions and their rows so that the
processing for each partition includes partition-scoped state.

Code in the following SQL example queries for stock sale information. It executes a `stock_sale_sum` UDTF whose input is
partitioned by the value of the `symbol` column.

```sqlexample
SELECT stock_sale_sum.symbol, total
  FROM stocks_table, TABLE(stock_sale_sum(symbol, quantity, price) OVER (PARTITION BY symbol));
```

Keep in mind that even when incoming rows are partitioned, your code can ignore the partition separation and just process the rows. For
example, you can omit code designed to handle partition-scoped state, such as a handler class `__init__` method and
`end_partition` method, and just implement the `process` method. For more information, see
Stateful and stateless processing in this topic.

To process each partition as a unit, you would:

* Implement a handler class `__init__` method in which to initialize processing for the partition.

  For more information, see Initializing the handler in this topic.
* Include partition-aware code when processing each row with the `process` method.

  For more information on processing rows, see Processing rows in this topic.
* Implement an `end_partition` method to finalize partition processing.

  For more information, see Finalizing partition processing in this topic.

The following describes the sequence of invocations to your handler when you’ve included code designed to execute per partition.

1. When processing for a partition starts, and before the first row has been processed, Snowflake uses the `__init__` method of your
   handler class to create an instance of the class.

   Here, you can establish partition-scoped state. For example, you might initialize an instance variable to hold a value
   calculated from rows in the partition.
2. For each row in the partition, Snowflake invokes the `process` method.

   Each time the method executes, it can make changes to state values. For example, you might have the `process` method update the
   value of the instance variable.
3. After your code has processed the last row in the partition, Snowflake invokes your `end_partition` method.

   From this method you can return output rows containing a partition-level value you want to return. For example, you might return the
   value of the instance variable you’ve been updating as you processed rows in the partition.

   Your `end_partition` method won’t receive any arguments from Snowflake, which simply invokes it after you process the last row in the
   partition.

#### Finalizing partition processing

You can optionally implement an `end_partition` method in your handler class that Snowflake will invoke after you have processed all
rows in a partition. In this method, you can execute code for a partition after all of the partition’s rows have been processed.
Your `end_partition` method may produce output rows, such as to return the results of a partition-scoped calculation. For more
information, see Processing partitions in this topic.

The method’s signature must be of the following form:

```python
def end_partition(self):
```

Snowflake expects the following of your `end_partition` method implementation:

> * It must not be static.
> * It may not have any parameters other than `self`.
> * As an alternative to returning a tabular value, it may produce an empty list or `None`.

> **Note:**
>
> While Snowflake supports large partitions with timeouts tuned to process them successfully, especially large partitions can cause
> processing to time out (such as when `end_partition` takes too long to complete). Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you need the
> timeout threshold adjusted for specific usage scenarios.

#### Partition handling example

Code in the following example calculates the total cost paid across purchases for a stock by first calculating the cost per purchase and
adding purchases together (in the `process` method). The code returns the total in the `end_partition` method.

For an example of a UDTF that includes this handler, along with calling the UDTF, refer to Handler class example.

```python
class StockSaleSum:
  def __init__(self):
    self._cost_total = 0
    self._symbol = ""

  def process(self, symbol, quantity, price):
    self._symbol = symbol
    cost = quantity * price
    self._cost_total += cost
    yield (symbol, cost)

  def end_partition(self):
    yield (self._symbol, self._cost_total)
```

When processing partitions, keep in mind the following:

* Your code may handle partitions that aren’t explicitly specified in a call to the UDTF. Even when a call to the UDTF doesn’t include
  a PARTITION BY clause, Snowflake partitions the data implicitly.
* Your `process` method will receive row data in the order specified by the partition’s ORDER BY clause, if any.

## Examples

### Using an imported package

You can use Python packages that are included in a curated list of third party packages from Anaconda available in Snowflake. To specify
these packages as dependencies in the UDTF, use the PACKAGES clause in CREATE FUNCTION.

You can discover the list of included packages by executing the following SQL in Snowflake:

```sqlexample
SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'python';
```

For more information, see [Using third-party packages](udf-python-packages.md) and [Creating Python UDFs](udf-python-creating.md).

Code in the following example uses a function in the [NumPy (Numerical Python)](https://numpy.org/doc/stable/reference/index.html)
package to calculate the average price per share from an array of stock purchases, each with a different price per share.

```sqlexample-python
CREATE OR REPLACE FUNCTION stock_sale_average(symbol VARCHAR, quantity NUMBER, price NUMBER(10,2))
  RETURNS TABLE (symbol VARCHAR, total NUMBER(10,2))
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  PACKAGES = ('numpy')
  HANDLER = 'StockSaleAverage'
AS $$
import numpy as np

class StockSaleAverage:
    def __init__(self):
      self._price_array = []
      self._quantity_total = 0
      self._symbol = ""

    def process(self, symbol, quantity, price):
      self._symbol = symbol
      self._price_array.append(float(price))
      cost = quantity * price
      yield (symbol, cost)

    def end_partition(self):
      np_array = np.array(self._price_array)
      avg = np.average(np_array)
      yield (self._symbol, avg)
$$;
```

Code in the following example calls the preceding UDF, passing values from columns `symbol`, `quantity`, and `price`
from the `stocks_table` table. For more information about calling a UDTF, refer to [Executing a UDF](../udf-calling-sql.md).

```sqlexample
SELECT stock_sale_average.symbol, total
  FROM stocks_table,
  TABLE(stock_sale_average(symbol, quantity, price)
    OVER (PARTITION BY symbol));
```

### Running concurrent tasks with worker processes

You can run concurrent tasks using Python worker processes. You might find this useful when you need to run parallel tasks that take
advantage of multiple CPU cores on warehouse nodes.

> **Note:**
>
> Snowflake recommends that you not use the built-in Python multiprocessing module.

To work around cases where the [Python Global Interpreter Lock](https://wiki.python.org/moin/GlobalInterpreterLock) prevents a
multi-tasking approach from scaling across all CPU cores, you can execute concurrent tasks using separate worker processes, rather than threads.

You can do this on Snowflake warehouses by using the `joblib` library’s `Parallel` class, as in the following example.

```sqlexample-python
CREATE OR REPLACE FUNCTION joblib_multiprocessing_udtf(i INT)
  RETURNS TABLE (result INT)
  LANGUAGE PYTHON
  RUNTIME_VERSION = 3.12
  HANDLER = 'JoblibMultiprocessing'
  PACKAGES = ('joblib')
AS $$
import joblib
from math import sqrt

class JoblibMultiprocessing:
  def process(self, i):
    pass

  def end_partition(self):
    result = joblib.Parallel(n_jobs=-1)(joblib.delayed(sqrt)(i ** 2) for i in range(10))
    for r in result:
      yield (r, )
$$;
```

> **Note:**
>
> The default backend used for `joblib.Parallel` differs between Snowflake standard and Snowpark-optimized warehouses.
>
> * Standard warehouse default: `threading`
> * Snowpark-optimized warehouse default: `loky` (multiprocessing)
>
> You can override the default backend setting by calling the `joblib.parallel_backend` function, as in the following example.
>
> ```python
> import joblib
> joblib.parallel_backend('loky')
> ```

## Creating the UDTF with `CREATE FUNCTION`

You create a UDTF in SQL using the CREATE FUNCTION command, specifying the code you wrote as the handler. For the command reference, see
[CREATE FUNCTION](../../../sql-reference/sql/create-function.md).

Use the following syntax when creating a UDTF.

```sqlsyntax
CREATE OR REPLACE FUNCTION <name> ( [ <arguments> ] )
  RETURNS TABLE ( <output_column_name> <output_column_type> [, <output_column_name> <output_column_type> ... ] )
  LANGUAGE PYTHON
  [ IMPORTS = ( '<imports>' ) ]
  RUNTIME_VERSION = 3.12
  [ PACKAGES = ( '<package_name>' [, '<package_name>' . . .] ) ]
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  HANDLER = '<handler_class>'
  [ AS '<python_code>' ]
```

To associate the handler code you’ve written with the UDTF, you do the following when executing CREATE FUNCTION:

* In RETURNS TABLE, specify output columns in column name and type pairs.
* Set LANGUAGE to PYTHON.
* Set the IMPORTS clause value to the path and name of the handler class if the class is in an external location, such as on a stage.

  For more information, see [Creating Python UDFs](udf-python-creating.md).
* Set RUNTIME_VERSION to the version of the Python runtime that your code requires. The supported versions of Python are:

  Generally available versions:

  + 3.9 (deprecated)
  + 3.10
  + 3.11
  + 3.12
  + 3.13
* Set the PACKAGES clause value to the name of one or more packages, if any, required by the handler class.

  For more information, see [Using third-party packages](udf-python-packages.md) and [Creating Python UDFs](udf-python-creating.md).
* Set the HANDLER clause value to the name of the handler class.

  When associating Python handler code with a UDTF, you can either include the code in-line or refer to it at a location on a Snowflake stage. The HANDLER value is case-sensitive and must match the name of the Python class.

  For more information, see [UDFs with in-line code vs. UDFs with code uploaded from a stage](udf-python-creating.md).

  > **Important:**
  >
  > For a scalar Python UDF, the HANDLER clause value contains the method name.
  >
  > For a Python UDTF, the HANDLER clause value contains the class name but not a method name.
  >
  > The reason for the difference is that for a scalar Python UDF, the name of the handler method is chosen by the user and
  > therefore not known in advance by Snowflake, but for a Python UDTF, the names of the methods (such as the
  > `end_partition` method) are known because they must match the names specified by Snowflake.
* The `AS '<python_code>'` clause is required if the handler code is specified in-line with CREATE FUNCTION.

---
title: Writing code to support different Scala versions
source: https://docs.snowflake.com/en/developer-guide/scala-version-differences.md
section: Developer Guide
---

# Writing code to support different Scala versions

You can write code to support different versions of Scala. For some Snowflake features, you’ll need to account for the Scala version you’re
using. For example, when you’re declaring a stored procedure with SQL, you’ll need to reference the Snowpark package with a name that ends
with the Scala version you’re using.

For Scala code differences between the versions, please see Scala documentation.

## Referencing Snowpark packages

When you reference the Snowpark package in your code, such as when you’re declaring a stored procedure with SQL, the package name will
depend on the following:

* The version of the Snowpark package you’re using.
* The version of Scala you’re using.

The following describes how to reference the Snowpark package for different Scala versions.

### Names for Snowpark package versions 1.16 and earlier

When referencing Snowpark package version 1.16 and earlier, you reference the package with the name `com.snowflake:snowpark` – in
other words, without the Scala version suffix.

* Scala 2.12: `com.snowflake:snowpark:1.16`
* Scala 2.13: `com.snowflake:snowpark:1.16`

### Names for Snowpark package versions 1.17 and later

When referencing the Snowpark package 1.17 and later, you reference the package with the name `com.snowflake:snowpark_<scala_version>`.

* Scala 2.12: `com.snowflake:snowpark_2.12:latest`
* Scala 2.13: `com.snowflake:snowpark_2.13:latest`

### Examples

The following examples show how to reference the Snowpark package versions 1.17 and later.

Scala 2.12Scala 2.13 (Preview)

```sqlexample-scala
CREATE OR REPLACE PROCEDURE MYPROC(value INT, fromTable STRING, toTable STRING, count INT)
  RETURNS INT
  LANGUAGE SCALA
  RUNTIME_VERSION = '2.12'
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  IMPORTS = ('@mystage/MyCompiledJavaCode.jar')
  HANDLER = 'MyJavaClass.run';
```

```sqlexample-scala
CREATE OR REPLACE PROCEDURE MYPROC(value INT, fromTable STRING, toTable STRING, count INT)
  RETURNS INT
  LANGUAGE SCALA
  RUNTIME_VERSION = '2.13'
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  IMPORTS = ('@mystage/MyCompiledJavaCode.jar')
  HANDLER = 'MyJavaClass.run';
```

---
title: Writing Java handlers for stored procedures created with SQL
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/java/procedure-java-overview.md
section: Developer Guide
---

# Writing Java handlers for stored procedures created with SQL

You can create a stored procedure whose handler is written in Java. You can use the [Snowpark library](../../snowpark/java/setup.md)
within your stored procedure to perform queries, updates, and other work on tables in Snowflake.

With stored procedures, you can build and run your data pipeline within Snowflake, using a Snowflake warehouse as the
compute framework. For the code for your data pipeline, you use the Snowpark API for Java to write stored procedures. To schedule
the execution of these stored procedures, you use [tasks](../../../user-guide/tasks-intro.md).

You can capture log and trace data as your handler code executes. For more information, refer to
[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).

## Writing a Java handler for a stored procedure

1. Be sure your environment meets the prerequisites.
2. If you’re developing locally, set up your environment to use Snowpark.
3. Choose whether to deploy your handler [inline or on a stage](../../inline-or-staged.md).
4. Follow guidelines for the handler class, method,
   and performance.
5. Implement support for features such as [data access](procedure-java-access-data.md),
   [file reading](procedure-java-read-files.md),
   [returning tabular data](procedure-java-tabular-data.md), and
   [logging and tracing](../../logging-tracing/logging-java.md).
6. Make your code’s dependencies available on Snowflake.
7. Include your handler code inline or imported from a stage when you
   [create the stored procedure](../stored-procedures-creating.md).

> **Note:**
>
> To both create and call an anonymous procedure, use [CALL (with anonymous procedure)](../../../sql-reference/sql/call-with.md). Creating and calling an anonymous procedure does
> not require a role with CREATE PROCEDURE schema privileges.

## Prerequisites

You must use version 1.3.0 or a more recent version of the Snowpark library.

If you are writing a stored procedure, you must compile your classes to run in one of the following versions of Java:

* 11.x
* 17.x

## Setting up your development environment for Snowpark

If you’re developing your code locally, set up your development environment to use the Snowpark library.
See [Setting Up Your Development Environment for Snowpark Java](../../snowpark/java/setup.md).

## Structuring and building handler code

You can keep handler source code in-line with the SQL that creates the procedure or keep handler compiled result in a separate location
and reference it from the SQL. For more information, see [Keeping handler code in-line or on a stage](../../inline-or-staged.md).

For more on building handler source code for use with a procedure, see [Packaging Handler Code](../../udf-stored-procedure-building.md).

## Guidelines for the handler class

When writing the class, note the following:

* The class and method must not be protected or private.
* If the method is not static and you want to define a constructor, define a zero-argument constructor for the class.
  Snowflake invokes this zero-argument constructor at initialization time to create an instance of your class.
* You can define different methods for different stored procedures in the same class.

## Guidelines for the handler method

When writing the method for the stored procedure, note the following:

* Specify the Snowpark `Session` object as the first argument of your method.

  When you call your stored procedure, Snowflake automatically creates a `Session` object and passes it to your stored
  procedure. (You cannot create the `Session` object yourself.)
* For the rest of the arguments and for the return value, use the [Java types](../../udf-stored-procedure-data-type-mapping.md) that
  correspond to [Snowflake data types](../../../sql-reference-data-types.md).
* Your method must return a value. For stored procedures in Java, a return value is required.
* Stored procedure execution will time out unless the timer is reset by the code’s activity. In particular, the timeout timer is reset
  by the code’s interactions with data, including file operations, queries, and iterating through a result set.
* When you run an [asynchronous child job](../../snowpark/java/working-with-dataframes.md) from within a procedure’s handler, “fire
  and forget” is not supported.

  In other words, if the handler issues a child query that is still running when the parent procedure job completes, the child job is
  canceled automatically.

## Handling errors

You can use the normal Java exception-handling techniques to catch errors within handler code.

If an uncaught exception occurs inside the method, Snowflake raises an error that includes the stack trace for the exception. When
[logging of unhandled exceptions](../../logging-tracing/unhandled-exception-messages.md) is enabled, Snowflake logs data
about unhandled exceptions in an event table.

## Guidelines for handler performance and security

To ensure that your code runs well on Snowflake, follow these guidelines:

* Limit the amount of memory consumed.

  Snowflake places limits on a method in terms of the amount of memory needed. For more information on how to avoid consuming too much,
  see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).
* Write thread-safe code.

  Make sure that your handler method or function is thread safe.
* Understand the security restrictions.

  Your handler code runs within a restricted engine, so be sure to follow the rules described in
  [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).
* Decide on using owner’s rights or caller’s rights.

  When planning to write your stored procedure, consider whether you want the stored procedure to run with
  [caller’s rights or owner’s rights](../stored-procedures-rights.md).
* Keep in mind the timeout behavior for stored procedures.

  Stored procedure execution will time out unless the timer is reset by the code’s activity. In particular, the timeout timer is reset
  by the code’s interactions with data, including file operations, queries, and iterating through a result set.

## Making dependencies available to your code

If your handler code depends on code defined outside the handler itself (such as classes in a JAR file) or on resource files, you can make
those dependencies available to your code by uploading them to a stage. When
[creating the procedure](../stored-procedures-creating.md), you can reference these dependencies using the IMPORTS
clause.

For more information, see [Making dependencies available to your code](../../upload-dependencies.md).

---
title: Writing Scala handlers for stored procedures created with SQL
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/scala/procedure-scala-overview.md
section: Developer Guide
---

# Writing Scala handlers for stored procedures created with SQL

You can create a stored procedure whose handler is written in Scala. You can use the Snowpark library within your stored procedure to
perform queries, updates, and other work on tables in Snowflake.

With stored procedures, you can build and run your data pipeline within Snowflake, using a Snowflake warehouse as the
compute framework. For the code for your data pipeline, you use the Snowpark API for Scala to write stored procedures. To schedule
the execution of these stored procedures, you use [tasks](../../../user-guide/tasks-intro.md).

You can capture log and trace data as your handler code executes. For more information, refer to
[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).

## Write a Scala handler for a stored procedure

1. Make sure your environment meets the prerequisites.
2. If you’re developing locally, set up your environment to use Snowpark.
3. Choose whether to deploy your handler [inline or on a stage](../../inline-or-staged.md).
4. Follow guidelines for the handler class or object,
   method or function,
   and performance.
5. Implement support for features such as [data access](procedure-scala-access-data.md),
   [file reading](procedure-scala-read-files.md),
   [returning tabular data](procedure-scala-tabular-data.md), and
   [logging and tracing](../../logging-tracing/logging-scala.md).
6. Make your code’s dependencies available on Snowflake.
7. Include your handler code inline or imported from a stage when you
   [create the stored procedure](../stored-procedures-creating.md).

> **Note:**
>
> To both create and call an anonymous procedure, use [CALL (with anonymous procedure)](../../../sql-reference/sql/call-with.md). Creating and calling an anonymous procedure does
> not require a role with CREATE PROCEDURE schema privileges.

## Meet prerequisites

Snowflake currently supports the following versions of Scala:

[Preview Feature](../../../release-notes/preview-features.md) — Open

Support for version 2.13 is in preview. Available to all accounts.

* 2.13
* 2.12

For more information, see [Writing code to support different Scala versions](../../scala-version-differences.md).

You must use version 1.1.0 or a more recent version of the Snowpark library.

If you are writing a stored procedure whose handler code will be copied to a stage, you must compile your classes to run in Java version
11.x.

## Set up your development environment for Snowpark

If you’re developing your code locally, set up your development environment to use the Snowpark library. See
[Setting Up Your Development Environment for Snowpark Scala](../../snowpark/scala/setup.md).

### Structure and building handler code

You can keep handler source code in-line with the SQL that creates the procedure or keep handler compiled result in a separate location
and reference it from the SQL. For more information, see [Keeping handler code in-line or on a stage](../../inline-or-staged.md).

For more on building handler source code for use with a procedure, see [Packaging Handler Code](../../udf-stored-procedure-building.md).

## Guidelines for the handler class or object

When writing the handler class or object, note the following:

* The class (or object) and method must not be protected or private.
* If the method is not static and you want to define a constructor, define a zero-argument constructor for the class.
  Snowflake invokes this zero-argument constructor at initialization time to create an instance of your class.
* You can define different methods for different stored procedures in the same class or object.

## Guidelines for the handler method or function

When writing the method or function for a stored procedure, note the following:

* Specify the Snowpark `Session` object as the first argument of your method or function.

  When you call your stored procedure, Snowflake automatically creates a `Session` object and passes it to your stored
  procedure. (You cannot create the `Session` object yourself.)
* For the rest of the arguments and for the return value, use the [Scala types](../../udf-stored-procedure-data-type-mapping.md) that
  correspond to [Snowflake data types](../../../sql-reference-data-types.md).
* Your method or function must return a value.
* Stored procedure execution times out unless the timer is reset by the code’s activity. In particular, the timeout timer is reset
  by the code’s interactions with data, including file operations, queries, and iterating through a result set.
* When you run an [asynchronous child job](../../snowpark/scala/working-with-dataframes.md) from within a procedure’s handler, “fire
  and forget” is not supported.

  In other words, if the handler issues a child query that is still running when the parent procedure job completes, the child job is
  canceled automatically.

## Guidelines for handler performance and security

To ensure that your code runs well on Snowflake, follow these guidelines:

* Limit the amount of memory consumed.

  Snowflake places limits on a method in terms of the amount of memory needed. For more information on how to avoid consuming too much,
  see [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).
* Write thread-safe code.

  Make sure that your handler method or function is thread safe.
* Understand the security restrictions.

  Your handler code runs within a restricted engine, so be sure to follow the rules described in
  [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).
* Decide on using owner’s rights or caller’s rights.

  When planning to write your stored procedure, consider whether you want the stored procedure to run with
  [caller’s rights or owner’s rights](../stored-procedures-rights.md).
* Keep in mind the timeout behavior for stored procedures.

  Stored procedure execution times out unless the timer is reset by the code’s activity. In particular, the timeout timer is reset
  by the code’s interactions with data, including file operations, queries, and iterating through a result set.

## Make dependencies available to your code

If your handler code depends on code defined outside the handler itself (such as classes in a JAR file) or on resource files, you can make
those dependencies available to your code by uploading them to a stage. When
[creating the procedure](../stored-procedures-creating.md), you can reference these dependencies using the IMPORTS
clause.

For more information, see [Making dependencies available to your code](../../upload-dependencies.md).

---
title: Writing stored procedures in JavaScript
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-javascript.md
section: Developer Guide
---

# Writing stored procedures in JavaScript

This topic explains how to write the JavaScript code for a stored procedure.

> **Note:**
>
> To both create and call an anonymous procedure, use [CALL (with anonymous procedure)](../../sql-reference/sql/call-with.md). Creating and calling an anonymous procedure does
> not require a role with CREATE PROCEDURE schema privileges.

You can capture log and trace data as your handler code executes. For more information, refer to
[Logging, tracing, and metrics](../logging-tracing/logging-tracing-overview.md).

## Understanding the JavaScript API

The JavaScript API for stored procedures is similar to, but not identical to, the APIs in Snowflake connectors and drivers
(Node.js, JDBC, Python, etc.).

The API enables you to perform operations such as:

* Execute a SQL statement.
* Retrieve the results of a query (i.e. a result set).
* Retrieve metadata about the result set (number of columns, data types of the columns, etc.).

These operations are carried out by calling methods on the following objects:

* `snowflake`, which has methods to create a `Statement` object and execute a SQL command.
* `Statement`, which helps you execute prepared statements and access metadata for those prepared statements,
  and allows you to get back a ResultSet object.
* `ResultSet`, which holds the results of a query (e.g. the rows of data retrieved for a SELECT statement).
* `SfDate`, which is an extension of JavaScript Date (with additional methods) and serves as a return type for
  the Snowflake SQL data types TIMESTAMP_LTZ, TIMESTAMP_NTZ, and TIMESTAMP_TZ.

These objects are described in detail in the [JavaScript stored procedures API](stored-procedures-api.md).

A typical stored procedure contains code similar to the following pseudo-code:

> ```javascript
> var my_sql_command1 = "delete from history_table where event_year < 2016";
> var statement1 = snowflake.createStatement(my_sql_command1);
> statement1.execute();
>
> var my_sql_command2 = "delete from log_table where event_year < 2016";
> var statement2 = snowflake.createStatement(my_sql_command2);
> statement2.execute();
> ```

This code uses an object named `snowflake`, which is a special object
that exists without being declared. The object is provided inside the context of each stored
procedure and exposes the API to allow you to interact with the server.

The other variables (e.g. `statement1`) are created with JavaScript `var` statements. For example:

> ```javascript
> var statement1 = ...;
> ```

As shown in the code sample above, the `snowflake` object allows you
to create a `Statement` object by calling one of the methods in the API.

Here’s an example that retrieves a `ResultSet` and iterates through it:

> ```sqlexample
> CREATE OR REPLACE PROCEDURE read_result_set()
>   RETURNS FLOAT NOT NULL
>   LANGUAGE JAVASCRIPT
>   AS
>   $$
>     var my_sql_command = "select * from table1";
>     var statement1 = snowflake.createStatement( {sqlText: my_sql_command} );
>     var result_set1 = statement1.execute();
>     // Loop through the results, processing one row at a time...
>     while (result_set1.next())  {
>        var column1 = result_set1.getColumnValue(1);
>        var column2 = result_set1.getColumnValue(2);
>        // Do something with the retrieved values...
>        }
>   return 0.0; // Replace with something more useful.
>   $$
>   ;
> ```

The Examples section (at the end of this topic) provides additional examples
that exercise each of the objects, and many of the methods, in the stored procedure JavaScript API.

## SQL and JavaScript data type mapping

When calling, using, and getting values back from stored procedures, you often need to convert from a Snowflake SQL
data type to a JavaScript data type or vice versa.

SQL to JavaScript conversion can occur when:

* Calling a stored procedure with an argument. The argument is a SQL data type; when it is stored inside a
  JavaScript variable inside the stored procedure, it must be converted.
* When retrieving a value from a ResultSet object into a JavaScript variable. The ResultSet holds the value as a SQL
  data type, and the JavaScript variable must store the value as one of the JavaScript data types.

JavaScript to SQL conversion can occur when:

* Returning a value from the stored procedure. The `return` statement typically contains a JavaScript
  variable that must be converted to a SQL data type.
* When dynamically constructing a SQL statement that uses a value in a JavaScript variable.
* When binding a JavaScript variable’s value to a prepared statement.

For more information about how Snowflake maps JavaScript and SQL data types, see [SQL-JavaScript Data Type Mappings](../udf-stored-procedure-data-type-mapping.md).

## General tips

### Line continuation

SQL statements can be quite long, and it is not always practical to fit them on a single line. JavaScript treats a
newline as the end of a statement. If you want to split a long SQL statement across multiple lines, you can use
the usual JavaScript techniques for handling long strings, including:

* Put a backslash (line continuation character) immediately prior to the end of the line. For example:

  ```javascript
  var sql_command = "SELECT * \
                         FROM table1;";
  ```
* Use backticks (single backquotes) rather than double quotes around the string. For example:

  ```javascript
  var sql_command = `SELECT *
                         FROM table1;`;
  ```
* Accumulate the string. For example:

  ```javascript
  var sql_command = "SELECT col1, col2"
  sql_command += "     FROM table1"
  sql_command += "     WHERE col1 >= 100"
  sql_command += "     ORDER BY col2;"
  ```

## JavaScript stored procedure considerations

### JavaScript Number Range

The range for numbers with precision intact is from

> -(2^53 -1)

to

> (2^53 -1)

The range of valid values in Snowflake NUMBER(p, s) and DOUBLE data types is larger. Retrieving a value from Snowflake
and storing it in a JavaScript numeric variable can result in loss of precision. For example:

> ```javascript
> CREATE OR REPLACE FUNCTION num_test(a double)
>   RETURNS string
>   LANGUAGE JAVASCRIPT
> AS
> $$
>   return A;
> $$
> ;
> ```
>
> ```javascript
> select hash(1) AS a,
>        num_test(hash(1)) AS b,
>        a - b;
> +----------------------+----------------------+------------+
> |                    A | B                    |      A - B |
> |----------------------+----------------------+------------|
> | -4730168494964875235 | -4730168494964875000 | -235.00000 |
> +----------------------+----------------------+------------+
> ```

The first two columns should match, and the third should contain 0.0.

The problem applies to JavaScript user-defined functions (UDFs) and stored procedures.

If you experience the problem in stored procedures when using `getColumnValue()`, you might be able to avoid the
problem by retrieving a value as a string, e.g. with:

```javascript
getColumnValueAsString()
```

You can then return the string from the stored procedure, and cast the string to a numeric data type in SQL.

### JavaScript error handling

Because a stored procedure is written in JavaScript, it can use JavaScript’s try/catch syntax.

The stored procedure can throw a pre-defined exception or a custom exception. A simple example of throwing a
custom exception is here.

You can execute your SQL statements inside a try block. If an error occurs, then your catch block can roll back all of
the statements (if you put the statements in a transaction). The Examples section contains an example of
rolling back a transaction in a stored procedure.

### Restrictions on stored procedures

Stored procedures have the following restrictions:

* The JavaScript code cannot call the JavaScript `eval()` function.
* JavaScript stored procedures support access to the standard JavaScript library. Note that this excludes many objects and methods
  typically provided by browsers. There is no mechanism to import, include, or call additional libraries.
  Allowing 3rd-party libraries could create security holes.
* JavaScript code is executed within a restricted engine, preventing system calls from the JavaScript
  context (e.g. no network and disk access), and constraining the system resources available to the engine, specifically memory.

### Case-sensitivity in JavaScript arguments

Argument names are case-insensitive in the SQL portion of the stored procedure code, but are
case-sensitive in the JavaScript portion.

For stored procedures (and UDFs) that use JavaScript, identifiers (such as
argument names) in the SQL portion of the statement are converted to uppercase automatically (unless you delimit the
identifier with double quotes), while argument names in the JavaScript portion
will be left in their original case. This can cause your stored procedure to
fail without returning an explicit error message because the arguments aren’t seen.

Here is an example of a stored procedure in which the name of an argument in the
JavaScript code does not match the name of the argument in the SQL code merely
because the case will be different:

In the example below, the first assignment statement is incorrect because the name `argument1` is in lower case.

```sqlexample-javascript
CREATE PROCEDURE f(argument1 VARCHAR)
RETURNS VARCHAR
LANGUAGE JAVASCRIPT
AS
$$
var local_variable1 = argument1;  // Incorrect
var local_variable2 = ARGUMENT1;  // Correct
$$;
```

Using uppercase identifiers (especially argument names) consistently across
your SQL statements and JavaScript code tends to reduce silent errors.

### JavaScript delimiters

The JavaScript portion of the stored procedure code must be enclosed within either single quotes `'` or
double dollar signs `$$`.

Using `$$` makes it easier to handle JavaScript code that contains single quotes without “escaping” those quotes.

### Overloading stored procedure names

For information about overloading and naming conventions, see [Naming and overloading procedures and UDFs](../udf-stored-procedure-naming-conventions.md).

### Binding variables

[Binding](../../sql-reference/bind-variables.md) a variable to a SQL statement allows you to use the value of
the variable in the statement.

You can bind NULL values as well as non-NULL values.

The data type of the variable should be appropriate for the use of the value in the SQL statement. Currently, only
JavaScript variables of type number, string, and [SfDate](stored-procedures-api.md) can be bound. (For details about the
mapping between SQL data types and JavaScript data types, see SQL and JavaScript data type mapping.)

Here is a short example of binding:

```javascript
var stmt = snowflake.createStatement(
  {
  sqlText: "INSERT INTO table2 (col1, col2) VALUES (?, ?);",
  binds:["LiteralValue1", variable2]
  }
);
```

Here is a more complete example. This example binds TIMESTAMP information. Because direct binding of SQL TIMESTAMP
data is not supported, this example passes the timestamp as a VARCHAR, then binds that to the statement. Note that
the SQL statement itself converts the VARCHAR to a TIMESTAMP by calling the TO_TIMESTAMP() function:

> This simple function returns TRUE if the specified timestamp is prior to now, and FALSE otherwise.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE right_bind(TIMESTAMP_VALUE VARCHAR)
> RETURNS BOOLEAN
> LANGUAGE JAVASCRIPT
> AS
> $$
> var cmd = "SELECT CURRENT_DATE() > TO_TIMESTAMP(:1, 'YYYY-MM-DD HH24:MI:SS')";
> var stmt = snowflake.createStatement(
>           {
>           sqlText: cmd,
>           binds: [TIMESTAMP_VALUE]
>           }
>           );
> var result1 = stmt.execute();
> result1.next();
> return result1.getColumnValue(1);
> $$
> ;
> ```
>
> ```sqlexample
> CALL right_bind('2019-09-16 01:02:03');
> +------------+
> | RIGHT_BIND |
> |------------|
> | True       |
> +------------+
> ```

This shows how to bind a VARCHAR, a TIMESTAMP_LTZ, and other data types to an `INSERT` statement. The
TIMESTAMP_LTZ binds an [SfDate](stored-procedures-api.md) variable that is created inside the stored procedure.

> Create a table.
>
> ```sqlexample
> CREATE TABLE table1 (v VARCHAR,
>                      ts1 TIMESTAMP_LTZ(9),
>                      int1 INTEGER,
>                      float1 FLOAT,
>                      numeric1 NUMERIC(10,9),
>                      ts_ntz1 TIMESTAMP_NTZ,
>                      date1 DATE,
>                      time1 TIME
>                      );
> ```
>
> Create a stored procedure. This procedure accepts a `VARCHAR`, and converts the VARCHAR to a `TIMESTAMP_LTZ`
> by using SQL. The procedure then retrieves the converted value from a ResultSet. The value is stored in a JavaScript
> variable of type [SfDate](stored-procedures-api.md). The stored procedure then binds both the original `VARCHAR` and the `TIMESTAMP_LTZ` to an `INSERT` statement. This also demonstrates binding of JavaScript numeric data.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE string_to_timestamp_ltz(TSV VARCHAR)
> RETURNS TIMESTAMP_LTZ
> LANGUAGE JAVASCRIPT
> AS
> $$
>     // Convert the input varchar to a TIMESTAMP_LTZ.
>     var sql_command = "SELECT '" + TSV + "'::TIMESTAMP_LTZ;";
>     var stmt = snowflake.createStatement( {sqlText: sql_command} );
>     var resultSet = stmt.execute();
>     resultSet.next();
>     // Retrieve the TIMESTAMP_LTZ and store it in an SfDate variable.
>     var my_sfDate = resultSet.getColumnValue(1);
>
>     f = 3.1415926;
>
>     // Specify that we'd like position-based binding.
>     sql_command = `INSERT INTO table1 VALUES(:1, :2, :3, :4, :5, :6, :7, :8);`
>     // Bind a VARCHAR, a TIMESTAMP_LTZ, a numeric to our INSERT statement.
>     result = snowflake.execute(
>         {
>         sqlText: sql_command,
>         binds: [TSV, my_sfDate, f, f, f, my_sfDate, my_sfDate, '12:30:00.123' ]
>         }
>         );
>
>     return my_sfDate;
> $$ ;
> ```
>
> Call the procedure.
>
> ```sqlexample
> CALL string_to_timestamp_ltz('2008-11-18 16:00:00');
> +-------------------------------+
> | STRING_TO_TIMESTAMP_LTZ       |
> |-------------------------------|
> | 2008-11-18 16:00:00.000 -0800 |
> +-------------------------------+
> ```
>
> Verify that the row was inserted.
>
> ```sqlexample
> SELECT * FROM table1;
> +---------------------+-------------------------------+------+----------+-------------+-------------------------+------------+----------+
> | V                   | TS1                           | INT1 |   FLOAT1 |    NUMERIC1 | TS_NTZ1                 | DATE1      | TIME1    |
> |---------------------+-------------------------------+------+----------+-------------+-------------------------+------------+----------|
> | 2008-11-18 16:00:00 | 2008-11-18 16:00:00.000 -0800 |    3 | 3.141593 | 3.141593000 | 2008-11-18 16:00:00.000 | 2008-11-18 | 12:30:00 |
> +---------------------+-------------------------------+------+----------+-------------+-------------------------+------------+----------+
> ```

For additional examples of binding data in JavaScript, see [Binding statement parameters](../node-js/nodejs-driver-execute.md).

### Code requirements

The JavaScript code must define a single literal JavaScript object for the stored procedure to be valid.

If the JavaScript code does not meet this requirement, the stored procedure will be created; however, it will fail when called.

### Code size

Snowflake limits the maximum size of the JavaScript source code in the body of a JavaScript stored procedure. Snowflake recommends
limiting the size to 100 KB. (The code is stored in a compressed form, and the exact limit depends on the compressibility of the
code.)

### Runtime errors

Most errors in stored procedures show up at runtime because the JavaScript
code is interpreted at the time that the stored procedure runs rather than when
the stored procedure is created.

### Support for dynamic SQL

Stored procedures can be used to dynamically construct SQL statements. For example,
you could build a SQL command string that contains a mix of pre-configured
SQL and user inputs (e.g. a user’s account number).

For examples, see Dynamically creating a SQL statement and the Examples section.

### Synchronous API

The API for Snowflake stored procedures is synchronous. Within a stored
procedure, you can run only one thread at a time.

Note that this is different from the rule for the JavaScript executing with the Node.js
connector, which allows you to run asynchronous threads.

## Examples

### Basic examples

The following example shows the basic syntax of creating and calling a stored procedure. It doesn’t execute any SQL
or procedural code. However, it provides a starting point for more realistic examples later:

> ```sqlexample
> CREATE OR REPLACE PROCEDURE sp_pi()
>     RETURNS FLOAT NOT NULL
>     LANGUAGE JAVASCRIPT
>     AS
>     $$
>     return 3.1415926;
>     $$
>     ;
> ```
>
> Note that the `$$` delimiter marks the beginning and end of the JavaScript code.
>
> Now call the procedure you just created:
>
> ```sqlexample
> CALL sp_pi();
> +-----------+
> |     SP_PI |
> |-----------|
> | 3.1415926 |
> +-----------+
> ```

The following example illustrates how to execute a SQL statement inside a stored procedure:

1. Create a table:

   > ```sqlexample
   > CREATE TABLE stproc_test_table1 (num_col1 numeric(14,7));
   > ```
2. Create a stored procedure. This inserts a row into
   an existing table named `stproc_test_table1` and returns the value “Succeeded.”.
   The returned value is not particularly useful from a SQL perspective, but it
   allows you to return status information (e.g. “Succeeded.” or “Failed.”) to the user.

   > ```sqlexample
   > CREATE OR REPLACE PROCEDURE stproc1(FLOAT_PARAM1 FLOAT)
   >     RETURNS STRING
   >     LANGUAGE JAVASCRIPT
   >     STRICT
   >     EXECUTE AS OWNER
   >     AS
   >     $$
   >     var sql_command =
   >      "INSERT INTO stproc_test_table1 (num_col1) VALUES (" + FLOAT_PARAM1 + ")";
   >     try {
   >         snowflake.execute (
   >             {sqlText: sql_command}
   >             );
   >         return "Succeeded.";   // Return a success/error indicator.
   >         }
   >     catch (err)  {
   >         return "Failed: " + err;   // Return a success/error indicator.
   >         }
   >     $$
   >     ;
   > ```
3. Call the stored procedure:

   > ```sqlexample
   > call stproc1(5.14::FLOAT);
   > +------------+
   > | STPROC1    |
   > |------------|
   > | Succeeded. |
   > +------------+
   > ```
4. Confirm that the stored procedure inserted the row:

   > ```sqlexample
   > select * from stproc_test_table1;
   > +-----------+
   > |  NUM_COL1 |
   > |-----------|
   > | 5.1400000 |
   > +-----------+
   > ```

The following example retrieves a result:

1. Create a procedure to count the number of rows in a table (equivalent to `select count(*) from table`):

   > ```sqlexample
   > CREATE OR REPLACE PROCEDURE get_row_count(table_name VARCHAR)
   >   RETURNS FLOAT NOT NULL
   >   LANGUAGE JAVASCRIPT
   >   AS
   >   $$
   >   var row_count = 0;
   >   // Dynamically compose the SQL statement to execute.
   >   var sql_command = "select count(*) from " + TABLE_NAME;
   >   // Run the statement.
   >   var stmt = snowflake.createStatement(
   >          {
   >          sqlText: sql_command
   >          }
   >       );
   >   var res = stmt.execute();
   >   // Get back the row count. Specifically, ...
   >   // ... get the first (and in this case only) row from the result set ...
   >   res.next();
   >   // ... and then get the returned value, which in this case is the number of
   >   // rows in the table.
   >   row_count = res.getColumnValue(1);
   >   return row_count;
   >   $$
   >   ;
   > ```
2. Ask the stored procedure how many rows are in the table:

   > ```sqlexample
   > call get_row_count('stproc_test_table1');
   > +---------------+
   > | GET_ROW_COUNT |
   > |---------------|
   > |             3 |
   > +---------------+
   > ```
3. Check independently that you got the right number:

   > ```sqlexample
   > select count(*) from stproc_test_table1;
   > +----------+
   > | COUNT(*) |
   > |----------|
   > |        3 |
   > +----------+
   > ```

### Recursive stored procedure example

The following example shows a basic, but not particularly realistic, recursive stored procedure:

> ```sqlexample
> create or replace table stproc_test_table2 (col1 FLOAT);
> ```
>
> ```none
> create or replace procedure recursive_stproc(counter FLOAT)
>     returns varchar not null
>     language javascript
>     as
>     -- "$$" is the delimiter that shows the beginning and end of the stored proc.
>     $$
>     var counter1 = COUNTER;
>     var returned_value = "";
>     var accumulator = "";
>     var stmt = snowflake.createStatement(
>         {
>         sqlText: "INSERT INTO stproc_test_table2 (col1) VALUES (?);",
>         binds:[counter1]
>         }
>         );
>     var res = stmt.execute();
>     if (COUNTER > 0)
>         {
>         stmt = snowflake.createStatement(
>             {
>             sqlText: "call recursive_stproc (?);",
>             binds:[counter1 - 1]
>             }
>             );
>         res = stmt.execute();
>         res.next();
>         returned_value = res.getColumnValue(1);
>         }
>     accumulator = accumulator + counter1 + ":" + returned_value;
>     return accumulator;
>     $$
>     ;
> ```
>
> ```sqlexample
> call recursive_stproc(4.0::FLOAT);
> +------------------+
> | RECURSIVE_STPROC |
> |------------------|
> | 4:3:2:1:0:       |
> +------------------+
> ```
>
> ```sqlexample
> SELECT *
>     FROM stproc_test_table2
>     ORDER BY col1;
> +------+
> | COL1 |
> |------|
> |    0 |
> |    1 |
> |    2 |
> |    3 |
> |    4 |
> +------+
> ```

### Dynamically creating a SQL statement

The following example shows how to dynamically create a SQL statement:

> **Note:**
>
> As stated in [SQL injection](stored-procedures-usage.md) (in this topic), be careful to guard against attacks when using dynamic SQL.

1. Create the stored procedure. This procedure allows you to pass the name of a table and get the number of rows in
   that table (equivalent to `select count(*) from table_name`):

   > ```none
   > create or replace procedure get_row_count(table_name VARCHAR)
   >     returns float
   >     not null
   >     language javascript
   >     as
   >     $$
   >     var row_count = 0;
   >     // Dynamically compose the SQL statement to execute.
   >     // Note that we uppercased the input parameter name.
   >     var sql_command = "select count(*) from " + TABLE_NAME;
   >     // Run the statement.
   >     var stmt = snowflake.createStatement(
   >            {
   >            sqlText: sql_command
   >            }
   >         );
   >     var res = stmt.execute();
   >     // Get back the row count. Specifically, ...
   >     // ... first, get the first (and in this case only) row from the
   >     //  result set ...
   >     res.next();
   >     // ... then extract the returned value (which in this case is the
   >     // number of rows in the table).
   >     row_count = res.getColumnValue(1);
   >     return row_count;
   >     $$
   >     ;
   > ```
2. Call the stored procedure:

   > ```sqlexample
   > call get_row_count('stproc_test_table1');
   > +---------------+
   > | GET_ROW_COUNT |
   > |---------------|
   > |             3 |
   > +---------------+
   > ```
3. Show the results from `select count(*)` for the same table:

   > ```sqlexample
   > SELECT COUNT(*) FROM stproc_test_table1;
   > +----------+
   > | COUNT(*) |
   > |----------|
   > |        3 |
   > +----------+
   > ```

### Retrieving result set metadata

This example demonstrates retrieving a small amount of metadata from a result set:

> ```none
> create or replace table stproc_test_table3 (
>     n10 numeric(10,0),     /* precision = 10, scale = 0 */
>     n12 numeric(12,4),     /* precision = 12, scale = 4 */
>     v1 varchar(19)         /* scale = 0 */
>     );
> ```
>
> ```none
> create or replace procedure get_column_scale(column_index float)
>     returns float not null
>     language javascript
>     as
>     $$
>     var stmt = snowflake.createStatement(
>         {sqlText: "select n10, n12, v1 from stproc_test_table3;"}
>         );
>     stmt.execute();  // ignore the result set; we just want the scale.
>     return stmt.getColumnScale(COLUMN_INDEX); // Get by column index (1-based)
>     $$
>     ;
> ```
>
> ```none
> call get_column_scale(1);
> +------------------+
> | GET_COLUMN_SCALE |
> |------------------|
> |                0 |
> +------------------+
> ```
>
> ```none
> call get_column_scale(2);
> +------------------+
> | GET_COLUMN_SCALE |
> |------------------|
> |                4 |
> +------------------+
> ```
>
> ```none
> call get_column_scale(3);
> +------------------+
> | GET_COLUMN_SCALE |
> |------------------|
> |                0 |
> +------------------+
> ```

### Catching an error using try/catch

This example demonstrates using a JavaScript try/catch block to catch an error inside a stored procedure:

> 1. Create the stored procedure:
>
>    ```none
>        create procedure broken()
>          returns varchar not null
>          language javascript
>          as
>          $$
>          var result = "";
>          try {
>              snowflake.execute( {sqlText: "Invalid Command!;"} );
>              result = "Succeeded";
>              }
>          catch (err)  {
>              result =  "Failed: Code: " + err.code + "\n  State: " + err.state;
>              result += "\n  Message: " + err.message;
>              result += "\nStack Trace:\n" + err.stackTraceTxt;
>              }
>          return result;
>          $$
>          ;
>    ```
> 2. Call the stored procedure. This should return an error showing the error
>    number and other information:
>
>    ```sqlexample
>        -- This is expected to fail.
>        call broken();
>    +---------------------------------------------------------+
>    | BROKEN                                                  |
>    |---------------------------------------------------------|
>    | Failed: Code: 1003                                      |
>    |   State: 42000                                          |
>    |   Message: SQL compilation error:                       |
>    | syntax error line 1 at position 0 unexpected 'Invalid'. |
>    | Stack Trace:                                            |
>    | Snowflake.execute, line 4 position 20                   |
>    +---------------------------------------------------------+
>    ```

The following example demonstrates throwing a custom exception:

> 1. Create the stored procedure:
>
>    ```sqlexample
>    CREATE OR REPLACE PROCEDURE validate_age (age float)
>    RETURNS VARCHAR
>    LANGUAGE JAVASCRIPT
>    EXECUTE AS CALLER
>    AS $$
>        try {
>            if (AGE < 0) {
>                throw "Age cannot be negative!";
>            } else {
>                return "Age validated.";
>            }
>        } catch (err) {
>            return "Error: " + err;
>        }
>    $$;
>    ```
> 2. Call the stored procedure with valid and invalid values:
>
>    ```sqlexample
>    CALL validate_age(50);
>    +----------------+
>    | VALIDATE_AGE   |
>    |----------------|
>    | Age validated. |
>    +----------------+
>    CALL validate_age(-2);
>    +--------------------------------+
>    | VALIDATE_AGE                   |
>    |--------------------------------|
>    | Error: Age cannot be negative! |
>    +--------------------------------+
>    ```

### Using transactions in stored procedures

The following example wraps multiple related statements in a transaction, and uses try/catch to commit or roll back.
The parameter `force_failure` allows the caller to choose between successful execution and deliberate error.

```sqlexample-javascript
-- Create the procedure
CREATE OR REPLACE PROCEDURE cleanup(force_failure VARCHAR)
  RETURNS VARCHAR NOT NULL
  LANGUAGE JAVASCRIPT
  AS
  $$
  var result = "";
  snowflake.execute( {sqlText: "BEGIN WORK;"} );
  try {
      snowflake.execute( {sqlText: "DELETE FROM child;"} );
      snowflake.execute( {sqlText: "DELETE FROM parent;"} );
      if (FORCE_FAILURE === "fail")  {
          // To see what happens if there is a failure/rollback,
          snowflake.execute( {sqlText: "DELETE FROM no_such_table;"} );
          }
      snowflake.execute( {sqlText: "COMMIT WORK;"} );
      result = "Succeeded";
      }
  catch (err)  {
      snowflake.execute( {sqlText: "ROLLBACK WORK;"} );
      return "Failed: " + err;   // Return a success/error indicator.
      }
  return result;
  $$
  ;

CALL cleanup('fail');

CALL cleanup('do not fail');
```

### Logging an error

You can capture log and trace data from JavaScript handler code by using the `snowflake` object in the JavaScript API. When you do,
log messages and trace data are stored in an event table that you can analyze with queries.

For more information, refer to the following:

* [Logging messages in JavaScript](../logging-tracing/logging-javascript.md)
* [Emitting trace events in JavaScript](../logging-tracing/tracing-javascript.md)

### Using RESULT_SCAN to retrieve the result from a stored procedure

The following example shows you how to use the [RESULT_SCAN](../../sql-reference/functions/result_scan.md) function to retrieve and process the result from a
[CALL](../../sql-reference/sql/call.md) statement:

1. Create and load the table:

   > ```none
   > CREATE TABLE western_provinces(ID INT, province VARCHAR);
   > ```
   >
   > ```none
   > INSERT INTO western_provinces(ID, province) VALUES
   >     (1, 'Alberta'),
   >     (2, 'British Columbia'),
   >     (3, 'Manitoba')
   >     ;
   > ```
2. Create the stored procedure. This procedure returns a well-formatted string that looks like a result set of
   three rows, but is actually a single string:

   > ```none
   > CREATE OR REPLACE PROCEDURE read_western_provinces()
   >   RETURNS VARCHAR NOT NULL
   >   LANGUAGE JAVASCRIPT
   >   AS
   >   $$
   >   var return_value = "";
   >   try {
   >       var command = "SELECT * FROM western_provinces ORDER BY province;"
   >       var stmt = snowflake.createStatement( {sqlText: command } );
   >       var rs = stmt.execute();
   >       if (rs.next())  {
   >           return_value += rs.getColumnValue(1);
   >           return_value += ", " + rs.getColumnValue(2);
   >           }
   >       while (rs.next())  {
   >           return_value += "\n";
   >           return_value += rs.getColumnValue(1);
   >           return_value += ", " + rs.getColumnValue(2);
   >           }
   >       }
   >   catch (err)  {
   >       result =  "Failed: Code: " + err.code + "\n  State: " + err.state;
   >       result += "\n  Message: " + err.message;
   >       result += "\nStack Trace:\n" + err.stackTraceTxt;
   >       }
   >   return return_value;
   >   $$
   >   ;
   > ```
3. Call the stored procedure, then retrieve the results by using RESULT_SCAN:

   > ```none
   > CALL read_western_provinces();
   > +------------------------+
   > | READ_WESTERN_PROVINCES |
   > |------------------------|
   > | 1, Alberta             |
   > | 2, British Columbia    |
   > | 3, Manitoba            |
   > +------------------------+
   > SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
   > +------------------------+
   > | READ_WESTERN_PROVINCES |
   > |------------------------|
   > | 1, Alberta             |
   > | 2, British Columbia    |
   > | 3, Manitoba            |
   > +------------------------+
   > ```

You can perform more complex operations on the value returned by the RESULT_SCAN function. In this case, because the
returned value is a single string, you might want to extract the individual “rows” that appear to be contained
within that string, and store those rows in another table.

> **Tip:**
>
> You can also use the [pipe operator](../../sql-reference/operators-flow.md) (`->>`) instead of the RESULT_SCAN function to
> run a CALL statement and process its result set with a single command.

The following example, which is a continuation of the previous example, illustrates one way to do this:

1. Create a table for long-term storage. This table contains the province name and the province ID after you’ve
   extracted them from the string returned by the CALL command:

   > ```none
   > CREATE TABLE all_provinces(ID INT, province VARCHAR);
   > ```
2. Call the stored procedure, then retrieve the result by using RESULT_SCAN, and then extract the three rows
   from the string and put those rows into the table:

   > ```none
   > INSERT INTO all_provinces
   >   WITH
   >     one_string (string_col) AS
   >       (SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))),
   >     three_strings (one_row) AS
   >       (SELECT VALUE FROM one_string, LATERAL SPLIT_TO_TABLE(one_string.string_col, '\n'))
   >   SELECT
   >          STRTOK(one_row, ',', 1) AS ID,
   >          STRTOK(one_row, ',', 2) AS province
   >     FROM three_strings
   >     WHERE NOT (ID IS NULL AND province IS NULL);
   > +-------------------------+
   > | number of rows inserted |
   > |-------------------------|
   > |                       3 |
   > +-------------------------+
   > ```
3. Verify that this worked by showing the rows in the table:

   > ```none
   > SELECT ID, province
   >     FROM all_provinces;
   > +----+-------------------+
   > | ID | PROVINCE          |
   > |----+-------------------|
   > |  1 |  Alberta          |
   > |  2 |  British Columbia |
   > |  3 |  Manitoba         |
   > +----+-------------------+
   > ```

Here’s approximately the same code, but in smaller steps:

1. Create a table named `one_string`. This table temporarily stores the result of the CALL command.
   The result of the CALL is a single string, so this table stores only a single VARCHAR value.

   > ```none
   > CREATE TRANSIENT TABLE one_string(string_col VARCHAR);
   > ```
2. Call the stored procedure, then retrieve the result (a string) by using RESULT_SCAN, and then store that into
   the intermediate table named `one_string`:

   > ```none
   > CALL read_western_provinces();
   > +------------------------+
   > | READ_WESTERN_PROVINCES |
   > |------------------------|
   > | 1, Alberta             |
   > | 2, British Columbia    |
   > | 3, Manitoba            |
   > +------------------------+
   > INSERT INTO one_string
   >     SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
   > +-------------------------+
   > | number of rows inserted |
   > |-------------------------|
   > |                       1 |
   > +-------------------------+
   > ```

   This shows the new row in the `one_string` table. Remember that although this is formatted to look like three rows,
   it is actually a single string:

   > ```none
   > SELECT string_col FROM one_string;
   > +---------------------+
   > | STRING_COL          |
   > |---------------------|
   > | 1, Alberta          |
   > | 2, British Columbia |
   > | 3, Manitoba         |
   > +---------------------+
   > -- Show that it's one string, not three rows:
   > SELECT '>>>' || string_col || '<<<' AS string_col
   >     FROM one_string;
   > +---------------------+
   > | STRING_COL          |
   > |---------------------|
   > | >>>1, Alberta       |
   > | 2, British Columbia |
   > | 3, Manitoba<<<      |
   > +---------------------+
   > SELECT COUNT(*) FROM one_string;
   > +----------+
   > | COUNT(*) |
   > |----------|
   > |        1 |
   > +----------+
   > ```

   The following commands show how to extract multiple rows from the string:

   > ```none
   > SELECT * FROM one_string, LATERAL SPLIT_TO_TABLE(one_string.string_col, '\n');
   > +---------------------+-----+-------+---------------------+
   > | STRING_COL          | SEQ | INDEX | VALUE               |
   > |---------------------+-----+-------+---------------------|
   > | 1, Alberta          |   1 |     1 | 1, Alberta          |
   > | 2, British Columbia |     |       |                     |
   > | 3, Manitoba         |     |       |                     |
   > | 1, Alberta          |   1 |     2 | 2, British Columbia |
   > | 2, British Columbia |     |       |                     |
   > | 3, Manitoba         |     |       |                     |
   > | 1, Alberta          |   1 |     3 | 3, Manitoba         |
   > | 2, British Columbia |     |       |                     |
   > | 3, Manitoba         |     |       |                     |
   > +---------------------+-----+-------+---------------------+
   > SELECT VALUE FROM one_string, LATERAL SPLIT_TO_TABLE(one_string.string_col, '\n');
   > +---------------------+
   > | VALUE               |
   > |---------------------|
   > | 1, Alberta          |
   > | 2, British Columbia |
   > | 3, Manitoba         |
   > +---------------------+
   > ```
3. Next, create a table named `three_strings`. This table will hold the result after you’ve split it into individual
   lines/strings:

   > ```none
   > CREATE TRANSIENT TABLE three_strings(string_col VARCHAR);
   > ```
4. Now convert that one string in the `one_string` table into three separate strings, and show that it is
   now actually three strings:

   > ```none
   > INSERT INTO three_strings
   >   SELECT VALUE FROM one_string, LATERAL SPLIT_TO_TABLE(one_string.string_col, '\n');
   > +-------------------------+
   > | number of rows inserted |
   > |-------------------------|
   > |                       3 |
   > +-------------------------+
   > SELECT string_col
   >     FROM three_strings;
   > +---------------------+
   > | STRING_COL          |
   > |---------------------|
   > | 1, Alberta          |
   > | 2, British Columbia |
   > | 3, Manitoba         |
   > +---------------------+
   > SELECT COUNT(*)
   >     FROM three_strings;
   > +----------+
   > | COUNT(*) |
   > |----------|
   > |        3 |
   > +----------+
   > ```
5. Now convert the three strings into three rows in our long-term table named `all_provinces`:

   > ```none
   > INSERT INTO all_provinces
   >   SELECT
   >          STRTOK(string_col, ',', 1) AS ID,
   >          STRTOK(string_col, ',', 2) AS province
   >     FROM three_strings
   >     WHERE NOT (ID IS NULL AND province IS NULL);
   > +-------------------------+
   > | number of rows inserted |
   > |-------------------------|
   > |                       3 |
   > +-------------------------+
   > ```
6. Show the three rows in the long-term table:

   > ```none
   > SELECT ID, province
   >     FROM all_provinces;
   > +----+-------------------+
   > | ID | PROVINCE          |
   > |----+-------------------|
   > |  1 |  Alberta          |
   > |  2 |  British Columbia |
   > |  3 |  Manitoba         |
   > +----+-------------------+
   > SELECT COUNT(*)
   >     FROM all_provinces;
   > +----------+
   > | COUNT(*) |
   > |----------|
   > |        3 |
   > +----------+
   > ```

### Returning an array of error messages

Your stored procedure might execute more than one SQL statement and you might want to return a status/error message
for each SQL statement. However, a stored procedure returns a single row; it is not designed to return multiple
rows.

If all of your messages fit into a single value of type ARRAY, you can get all the messages from a stored procedure
with some additional effort.

The following example shows one way to do this (the error messages shown are not real, but you can extend this code to
work with your actual SQL statements):

> ```none
> CREATE OR REPLACE PROCEDURE sp_return_array()
>       RETURNS VARIANT NOT NULL
>       LANGUAGE JAVASCRIPT
>       AS
>       $$
>       // This array will contain one error message (or an empty string)
>       // for each SQL command that we executed.
>       var array_of_rows = [];
>
>       // Artificially fake the error messages.
>       array_of_rows.push("ERROR: The foo was barred.")
>       array_of_rows.push("WARNING: A Carrington Event is predicted.")
>
>       return array_of_rows;
>       $$
>       ;
> ```
>
> ```sqlexample
> CALL sp_return_array();
> +-----------------------------------------------+
> | SP_RETURN_ARRAY                               |
> |-----------------------------------------------|
> | [                                             |
> |   "ERROR: The foo was barred.",               |
> |   "WARNING: A Carrington Event is predicted." |
> | ]                                             |
> +-----------------------------------------------+
> -- Now get the individual error messages, in order.
> SELECT INDEX, VALUE
>     FROM TABLE(RESULT_SCAN(LAST_QUERY_ID())) AS res, LATERAL FLATTEN(INPUT => res.$1)
>     ORDER BY index
>     ;
> +-------+---------------------------------------------+
> | INDEX | VALUE                                       |
> |-------+---------------------------------------------|
> |     0 | "ERROR: The foo was barred."                |
> |     1 | "WARNING: A Carrington Event is predicted." |
> +-------+---------------------------------------------+
> ```

Remember, this is not a general purpose solution. There is a limit on the maximum size of
ARRAY data types, and your entire result set must fit into a single ARRAY.

### Returning a result set

This section extends the previous example described in Returning an Array of Error Messages. This example is more
general, and allows you to return a result set from a query.

A stored procedure returns a single row that contains a single column; it is not designed to return a result set.
However, if your result set is small enough to fit into a single value of type VARIANT or ARRAY, you can return
a result set from a stored procedure with some additional code:

> > ```sqlexample
> > CREATE TABLE return_to_me(col_i INT, col_v VARCHAR);
> > INSERT INTO return_to_me (col_i, col_v) VALUES
> >     (1, 'Ariel'),
> >     (2, 'October'),
> >     (3, NULL),
> >     (NULL, 'Project');
> > ```
> >
> > ```none
> > -- Create the stored procedure that retrieves a result set and returns it.
> > CREATE OR REPLACE PROCEDURE sp_return_table(TABLE_NAME VARCHAR, COL_NAMES ARRAY)
> >       RETURNS VARIANT NOT NULL
> >       LANGUAGE JAVASCRIPT
> >       AS
> >       $$
> >       // This variable will hold a JSON data structure that holds ONE row.
> >       var row_as_json = {};
> >       // This array will contain all the rows.
> >       var array_of_rows = [];
> >       // This variable will hold a JSON data structure that we can return as
> >       // a VARIANT.
> >       // This will contain ALL the rows in a single "value".
> >       var table_as_json = {};
> >
> >       // Run SQL statement(s) and get a resultSet.
> >       var command = "SELECT * FROM " + TABLE_NAME;
> >       var cmd1_dict = {sqlText: command};
> >       var stmt = snowflake.createStatement(cmd1_dict);
> >       var rs = stmt.execute();
> >
> >       // Read each row and add it to the array we will return.
> >       var row_num = 1;
> >       while (rs.next())  {
> >         // Put each row in a variable of type JSON.
> >         row_as_json = {};
> >         // For each column in the row...
> >         for (var col_num = 0; col_num < COL_NAMES.length; col_num = col_num + 1) {
> >           var col_name = COL_NAMES[col_num];
> >           row_as_json[col_name] = rs.getColumnValue(col_num + 1);
> >           }
> >         // Add the row to the array of rows.
> >         array_of_rows.push(row_as_json);
> >         ++row_num;
> >         }
> >       // Put the array in a JSON variable (so it looks like a VARIANT to
> >       // Snowflake).  The key is "key1", and the value is the array that has
> >       // the rows we want.
> >       table_as_json = { "key1" : array_of_rows };
> >
> >       // Return the rows to Snowflake, which expects a JSON-compatible VARIANT.
> >       return table_as_json;
> >       $$
> >       ;
> > ```
> >
> > ```sqlexample
> > CALL sp_return_table(
> >         -- Table name.
> >         'return_to_me',
> >         -- Array of column names.
> >         ARRAY_APPEND(TO_ARRAY('COL_I'), 'COL_V')
> >         );
> > +--------------------------+
> > | SP_RETURN_TABLE          |
> > |--------------------------|
> > | {                        |
> > |   "key1": [              |
> > |     {                    |
> > |       "COL_I": 1,        |
> > |       "COL_V": "Ariel"   |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 2,        |
> > |       "COL_V": "October" |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 3,        |
> > |       "COL_V": null      |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": null,     |
> > |       "COL_V": "Project" |
> > |     }                    |
> > |   ]                      |
> > | }                        |
> > +--------------------------+
> > -- Use "ResultScan" to get the data from the stored procedure that
> > -- "did not return a result set".
> > -- Use "$1:key1" to get the value corresponding to the JSON key named "key1".
> > SELECT $1:key1 FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
> > +------------------------+
> > | $1:KEY1                |
> > |------------------------|
> > | [                      |
> > |   {                    |
> > |     "COL_I": 1,        |
> > |     "COL_V": "Ariel"   |
> > |   },                   |
> > |   {                    |
> > |     "COL_I": 2,        |
> > |     "COL_V": "October" |
> > |   },                   |
> > |   {                    |
> > |     "COL_I": 3,        |
> > |     "COL_V": null      |
> > |   },                   |
> > |   {                    |
> > |     "COL_I": null,     |
> > |     "COL_V": "Project" |
> > |   }                    |
> > | ]                      |
> > +------------------------+
> > -- Now get what we really want.
> > SELECT VALUE:COL_I AS col_i, value:COL_V AS col_v
> >   FROM TABLE(RESULT_SCAN(LAST_QUERY_ID())) AS res, LATERAL FLATTEN(input => res.$1)
> >   ORDER BY COL_I;
> > +-------+-----------+
> > | COL_I | COL_V     |
> > |-------+-----------|
> > | 1     | "Ariel"   |
> > | 2     | "October" |
> > | 3     | null      |
> > | null  | "Project" |
> > +-------+-----------+
> > ```
>
> This shows how to combine the previous two lines into a single line:
>
> > ```sqlexample
> > CALL sp_return_table(
> >         -- Table name.
> >         'return_to_me',
> >         -- Array of column names.
> >         ARRAY_APPEND(TO_ARRAY('COL_I'), 'COL_V')
> >         );
> > +--------------------------+
> > | SP_RETURN_TABLE          |
> > |--------------------------|
> > | {                        |
> > |   "key1": [              |
> > |     {                    |
> > |       "COL_I": 1,        |
> > |       "COL_V": "Ariel"   |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 2,        |
> > |       "COL_V": "October" |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 3,        |
> > |       "COL_V": null      |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": null,     |
> > |       "COL_V": "Project" |
> > |     }                    |
> > |   ]                      |
> > | }                        |
> > +--------------------------+
> > SELECT VALUE:COL_I AS col_i, value:COL_V AS col_v
> >        FROM (SELECT $1:key1 FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))) AS res,
> >             LATERAL FLATTEN(input => res.$1)
> >        ORDER BY COL_I;
> > +-------+-----------+
> > | COL_I | COL_V     |
> > |-------+-----------|
> > | 1     | "Ariel"   |
> > | 2     | "October" |
> > | 3     | null      |
> > | null  | "Project" |
> > +-------+-----------+
> > ```
>
> For convenience, you can wrap the preceding line in a view. This view also converts the string ‘null’ to a true NULL.
> You only need to create the view once. However, you must call the stored procedure immediately prior to
> selecting from this view every time you use the view. Remember, the call to RESULT_SCAN in the view is pulling from the
> most recent statement, which must be the CALL:
>
> > ```sqlexample
> > CREATE VIEW stproc_view (col_i, col_v) AS
> >   SELECT NULLIF(VALUE:COL_I::VARCHAR, 'null'::VARCHAR),
> >          NULLIF(value:COL_V::VARCHAR, 'null'::VARCHAR)
> >     FROM (SELECT $1:key1 AS tbl FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))) AS res,
> >          LATERAL FLATTEN(input => res.tbl);
> > ```
> >
> > ```sqlexample
> > CALL sp_return_table(
> >         -- Table name.
> >         'return_to_me',
> >         -- Array of column names.
> >         ARRAY_APPEND(TO_ARRAY('COL_I'), 'COL_V')
> >         );
> > +--------------------------+
> > | SP_RETURN_TABLE          |
> > |--------------------------|
> > | {                        |
> > |   "key1": [              |
> > |     {                    |
> > |       "COL_I": 1,        |
> > |       "COL_V": "Ariel"   |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 2,        |
> > |       "COL_V": "October" |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 3,        |
> > |       "COL_V": null      |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": null,     |
> > |       "COL_V": "Project" |
> > |     }                    |
> > |   ]                      |
> > | }                        |
> > +--------------------------+
> > SELECT *
> >     FROM stproc_view
> >     ORDER BY COL_I;
> > +-------+---------+
> > | COL_I | COL_V   |
> > |-------+---------|
> > | 1     | Ariel   |
> > | 2     | October |
> > | 3     | NULL    |
> > | NULL  | Project |
> > +-------+---------+
> > ```
>
> You can even use it as a true view (i.e. select a subset of it):
>
> > ```sqlexample
> > CALL sp_return_table(
> >         -- Table name.
> >         'return_to_me',
> >         -- Array of column names.
> >         ARRAY_APPEND(TO_ARRAY('COL_I'), 'COL_V')
> >         );
> > +--------------------------+
> > | SP_RETURN_TABLE          |
> > |--------------------------|
> > | {                        |
> > |   "key1": [              |
> > |     {                    |
> > |       "COL_I": 1,        |
> > |       "COL_V": "Ariel"   |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 2,        |
> > |       "COL_V": "October" |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": 3,        |
> > |       "COL_V": null      |
> > |     },                   |
> > |     {                    |
> > |       "COL_I": null,     |
> > |       "COL_V": "Project" |
> > |     }                    |
> > |   ]                      |
> > | }                        |
> > +--------------------------+
> > SELECT COL_V
> >     FROM stproc_view
> >     WHERE COL_V IS NOT NULL
> >     ORDER BY COL_V;
> > +---------+
> > | COL_V   |
> > |---------|
> > | Ariel   |
> > | October |
> > | Project |
> > +---------+
> > ```

Remember, this is not a general purpose solution. There is a limit on the maximum size of VARIANT and
ARRAY data types, and your entire result set must fit into a single VARIANT or ARRAY.

### Protecting privacy

This example shows a stored procedure that is useful for an on-line retailer.
This stored procedure respects customers’ privacy, while protecting
legitimate interests of both the retailer and the customer.
If a customer asks the retailer to delete the customer’s data for privacy reasons,
then this stored procedure deletes most of the customer’s data, but leaves the customer’s
purchase history if either of the following is true:

* Any purchased item has a warranty that has not yet expired.
* The customer still owes money (or the customer is owed a refund).

A more real-world version of this would delete individual rows for which payment has been
made and the warranty has expired.

1. Start by creating the tables and loading them:

   > ```sqlexample
   > create table reviews (customer_ID VARCHAR, review VARCHAR);
   > create table purchase_history (customer_ID VARCHAR, price FLOAT, paid FLOAT,
   >                                product_ID VARCHAR, purchase_date DATE);
   > ```
   >
   > ```sqlexample
   > insert into purchase_history (customer_ID, price, paid, product_ID, purchase_date) values
   >     (1, 19.99, 19.99, 'chocolate', '2018-06-17'::DATE),
   >     (2, 19.99,  0.00, 'chocolate', '2017-02-14'::DATE),
   >     (3, 19.99,  19.99, 'chocolate', '2017-03-19'::DATE);
   >
   > insert into reviews (customer_ID, review) values (1, 'Loved the milk chocolate!');
   > insert into reviews (customer_ID, review) values (2, 'Loved the dark chocolate!');
   > ```
2. Create the stored procedure:

   > ```none
   > create or replace procedure delete_nonessential_customer_data(customer_ID varchar)
   >     returns varchar not null
   >     language javascript
   >     as
   >     $$
   >
   >     // If the customer posted reviews of products, delete those reviews.
   >     var sql_cmd = "DELETE FROM reviews WHERE customer_ID = " + CUSTOMER_ID;
   >     snowflake.execute( {sqlText: sql_cmd} );
   >
   >     // Delete any other records not needed for warranty or payment info.
   >     // ...
   >
   >     var result = "Deleted non-financial, non-warranty data for customer " + CUSTOMER_ID;
   >
   >     // Find out if the customer has any net unpaid balance (or surplus/prepayment).
   >     sql_cmd = "SELECT SUM(price) - SUM(paid) FROM purchase_history WHERE customer_ID = " + CUSTOMER_ID;
   >     var stmt = snowflake.createStatement( {sqlText: sql_cmd} );
   >     var rs = stmt.execute();
   >     // There should be only one row, so should not need to iterate.
   >     rs.next();
   >     var net_amount_owed = rs.getColumnValue(1);
   >
   >     // Look up the number of purchases still under warranty...
   >     var number_purchases_under_warranty = 0;
   >     // Assuming a 1-year warranty...
   >     sql_cmd = "SELECT COUNT(*) FROM purchase_history ";
   >     sql_cmd += "WHERE customer_ID = " + CUSTOMER_ID;
   >     // Can't use CURRENT_DATE() because that changes. So assume that today is
   >     // always June 15, 2019.
   >     sql_cmd += "AND PURCHASE_DATE > dateadd(year, -1, '2019-06-15'::DATE)";
   >     var stmt = snowflake.createStatement( {sqlText: sql_cmd} );
   >     var rs = stmt.execute();
   >     // There should be only one row, so should not need to iterate.
   >     rs.next();
   >     number_purchases_under_warranty = rs.getColumnValue(1);
   >
   >     // Check whether need to keep some purchase history data; if not, then delete the data.
   >     if (net_amount_owed == 0.0 && number_purchases_under_warranty == 0)  {
   >         // Delete the purchase history of this customer ...
   >         sql_cmd = "DELETE FROM purchase_history WHERE customer_ID = " + CUSTOMER_ID;
   >         snowflake.execute( {sqlText: sql_cmd} );
   >         // ... and delete anything else that should be deleted.
   >         // ...
   >         result = "Deleted all data, including financial and warranty data, for customer " + CUSTOMER_ID;
   >         }
   >     return result;
   >     $$
   >     ;
   > ```
3. Show the data in the tables before deleting any of that data:

   > ```sqlexample
   > SELECT * FROM reviews;
   > +-------------+---------------------------+
   > | CUSTOMER_ID | REVIEW                    |
   > |-------------+---------------------------|
   > | 1           | Loved the milk chocolate! |
   > | 2           | Loved the dark chocolate! |
   > +-------------+---------------------------+
   > SELECT * FROM purchase_history;
   > +-------------+-------+-------+------------+---------------+
   > | CUSTOMER_ID | PRICE |  PAID | PRODUCT_ID | PURCHASE_DATE |
   > |-------------+-------+-------+------------+---------------|
   > | 1           | 19.99 | 19.99 | chocolate  | 2018-06-17    |
   > | 2           | 19.99 |  0    | chocolate  | 2017-02-14    |
   > | 3           | 19.99 | 19.99 | chocolate  | 2017-03-19    |
   > +-------------+-------+-------+------------+---------------+
   > ```
4. Customer #1 has a warranty that is still in effect. The stored procedure deletes the review comments that they posted,
   but keeps their purchase record because of the warranty:

   > ```sqlexample
   > call delete_nonessential_customer_data(1);
   > +---------------------------------------------------------+
   > | DELETE_NONESSENTIAL_CUSTOMER_DATA                       |
   > |---------------------------------------------------------|
   > | Deleted non-financial, non-warranty data for customer 1 |
   > +---------------------------------------------------------+
   > SELECT * FROM reviews;
   > +-------------+---------------------------+
   > | CUSTOMER_ID | REVIEW                    |
   > |-------------+---------------------------|
   > | 2           | Loved the dark chocolate! |
   > +-------------+---------------------------+
   > SELECT * FROM purchase_history;
   > +-------------+-------+-------+------------+---------------+
   > | CUSTOMER_ID | PRICE |  PAID | PRODUCT_ID | PURCHASE_DATE |
   > |-------------+-------+-------+------------+---------------|
   > | 1           | 19.99 | 19.99 | chocolate  | 2018-06-17    |
   > | 2           | 19.99 |  0    | chocolate  | 2017-02-14    |
   > | 3           | 19.99 | 19.99 | chocolate  | 2017-03-19    |
   > +-------------+-------+-------+------------+---------------+
   > ```
5. Customer #2 still owes money. The stored procedure deletes their review comments, but keeps their purchase record:

   > ```sqlexample
   > call delete_nonessential_customer_data(2);
   > +---------------------------------------------------------+
   > | DELETE_NONESSENTIAL_CUSTOMER_DATA                       |
   > |---------------------------------------------------------|
   > | Deleted non-financial, non-warranty data for customer 2 |
   > +---------------------------------------------------------+
   > SELECT * FROM reviews;
   > +-------------+--------+
   > | CUSTOMER_ID | REVIEW |
   > |-------------+--------|
   > +-------------+--------+
   > SELECT * FROM purchase_history;
   > +-------------+-------+-------+------------+---------------+
   > | CUSTOMER_ID | PRICE |  PAID | PRODUCT_ID | PURCHASE_DATE |
   > |-------------+-------+-------+------------+---------------|
   > | 1           | 19.99 | 19.99 | chocolate  | 2018-06-17    |
   > | 2           | 19.99 |  0    | chocolate  | 2017-02-14    |
   > | 3           | 19.99 | 19.99 | chocolate  | 2017-03-19    |
   > +-------------+-------+-------+------------+---------------+
   > ```
6. Customer #3 does not owe any money (and is not owed any money). Their warranty expired, so the stored procedure
   deletes both the review comments and the purchase records:

   > ```sqlexample
   > call delete_nonessential_customer_data(3);
   > +-------------------------------------------------------------------------+
   > | DELETE_NONESSENTIAL_CUSTOMER_DATA                                       |
   > |-------------------------------------------------------------------------|
   > | Deleted all data, including financial and warranty data, for customer 3 |
   > +-------------------------------------------------------------------------+
   > SELECT * FROM reviews;
   > +-------------+--------+
   > | CUSTOMER_ID | REVIEW |
   > |-------------+--------|
   > +-------------+--------+
   > SELECT * FROM purchase_history;
   > +-------------+-------+-------+------------+---------------+
   > | CUSTOMER_ID | PRICE |  PAID | PRODUCT_ID | PURCHASE_DATE |
   > |-------------+-------+-------+------------+---------------|
   > | 1           | 19.99 | 19.99 | chocolate  | 2018-06-17    |
   > | 2           | 19.99 |  0    | chocolate  | 2017-02-14    |
   > +-------------+-------+-------+------------+---------------+
   > ```

### Using session variables with caller’s rights and owner’s rights stored procedures

These examples illustrate one of the key differences between caller’s rights and owner’s rights stored
procedures. They attempt to use session variables in two ways:

* Set a session variable before calling the stored procedure, then use the session variable inside the stored
  procedure.
* Set a session variable inside the stored procedure, then use the session variable after returning from the stored
  procedures.

Both using the session variable and setting the session variable work correctly in a caller’s rights stored procedure.
Both fail when using an owner’s rights stored procedure even if the caller is the owner.

#### Caller’s rights stored procedure

The following example demonstrates a caller’s rights stored procedure.

1. Create and load a table:

   > ```sqlexample
   > create table sv_table (f float);
   > insert into sv_table (f) values (49), (51);
   > ```
2. Set a session variable:

   > ```sqlexample
   > set SESSION_VAR1 = 50;
   > ```
3. Create a caller’s rights stored procedure that uses one session variable and sets another:

   > ```sqlexample
   > create procedure session_var_user()
   >   returns float
   >   language javascript
   >   EXECUTE AS CALLER
   >   as
   >   $$
   >   // Set the second session variable
   >   var stmt = snowflake.createStatement(
   >       {sqlText: "set SESSION_VAR2 = 'I was set inside the StProc.'"}
   >       );
   >   var rs = stmt.execute();  // we ignore the result in this case
   >   // Run a query using the first session variable
   >   stmt = snowflake.createStatement(
   >       {sqlText: "select f from sv_table where f > $SESSION_VAR1"}
   >       );
   >   rs = stmt.execute();
   >   rs.next();
   >   var output = rs.getColumnValue(1);
   >   return output;
   >   $$
   >   ;
   > ```
4. Call the procedure:

   > ```sqlexample
   > CALL session_var_user();
   > +------------------+
   > | SESSION_VAR_USER |
   > |------------------|
   > |               51 |
   > +------------------+
   > ```
5. View the value of the session variable set inside the stored procedure:

   > ```sqlexample
   > SELECT $SESSION_VAR2;
   > +------------------------------+
   > | $SESSION_VAR2                |
   > |------------------------------|
   > | I was set inside the StProc. |
   > +------------------------------+
   > ```

> **Note:**
>
> Although you can set a session variable inside a stored procedure and leave it set after the end of the procedure,
> Snowflake does not recommend doing this.

#### Owner’s rights stored procedure

The following example demonstrates an owner’s rights stored procedure.

1. Create an owner’s rights stored procedure that uses a session variable:

   > ```sqlexample
   > create procedure cannot_use_session_vars()
   >   returns float
   >   language javascript
   >   EXECUTE AS OWNER
   >   as
   >   $$
   >   // Run a query using the first session variable
   >   var stmt = snowflake.createStatement(
   >       {sqlText: "select f from sv_table where f > $SESSION_VAR1"}
   >       );
   >   var rs = stmt.execute();
   >   rs.next();
   >   var output = rs.getColumnValue(1);
   >   return output;
   >   $$
   >   ;
   > ```
2. Call the procedure (it should fail):

   > ```sqlexample
   > CALL cannot_use_session_vars();
   > ```
3. Create an owner’s rights stored procedure that tries to set a session variable:

   > ```sqlexample
   > create procedure cannot_set_session_vars()
   >   returns float
   >   language javascript
   >   EXECUTE AS OWNER
   >   as
   >   $$
   >   // Set the second session variable
   >   var stmt = snowflake.createStatement(
   >       {sqlText: "set SESSION_VAR2 = 'I was set inside the StProc.'"}
   >       );
   >   var rs = stmt.execute();  // we ignore the result in this case
   >   return 3.0;   // dummy value.
   >   $$
   >   ;
   > ```
4. Call the procedure (it should fail):

   > ```sqlexample
   > CALL cannot_set_session_vars();
   > ```

## Troubleshooting

A general troubleshooting technique is to use a JavaScript try/catch block to
catch the error and display error information. The error object contains:

* Error code.
* Error message.
* Error state.
* Stack trace at the point of failure.

For more information, including an example, of how to use this information, see Catching an error using try/catch (in this topic).

Th following sections provide additional suggestions to help debug specific problems.

### Stored procedure or UDF unexpectedly returns NULL

Cause:
:   Your stored procedure/UDF has a parameter, and inside the procedure/UDF, the parameter is referred to by its lowercase name, but Snowflake has
    automatically converted the name to uppercase.

Solution:
:   Either:

    * Use uppercase for the variable name inside the JavaScript code, or
    * Enclose the variable name in double quotes in the SQL code.

    For more details, see [JavaScript arguments and returned values](../udf/javascript/udf-javascript-introduction.md).

### Stored procedure never finishes running

Cause:
:   You might have an infinite loop in your JavaScript code.

Solution:
:   Check for and fix any infinite loops.

### Error: `Failed: empty argument passed`

Cause:
:   Your stored procedure might contain “sqltext” when it should have “sqlText”
    (the first is all lowercase; the second is mixed case).

Solution:
:   Use “sqlText”.

### Error: `JavaScript out of memory error: UDF thread memory limit exceeded`

Cause:
:   You might have an infinite loop in your JavaScript code.

Solution:
:   Check for and fix any infinite loops. In particular, ensure that you stop calling for the next row when the result set runs out (i.e. when
    `resultSet.next()` returns `false`).

---
title: Writing stored procedures in Snowflake Scripting
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md
section: Developer Guide
---

# Writing stored procedures in Snowflake Scripting

This topic provides an introduction to writing a stored procedure in SQL by using Snowflake Scripting.
For more information about Snowflake Scripting, see the [Snowflake Scripting Developer Guide](../snowflake-scripting/index.md).

## Introduction

To write a stored procedure that uses Snowflake Scripting:

* Use the [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) or [WITH … CALL …](../../sql-reference/sql/call-with.md) command with
  LANGUAGE SQL.
* In the body of the stored procedure (the AS clause), you use a
  [Snowflake Scripting block](../snowflake-scripting/blocks.md).

  > **Note:**
  >
  > If you are creating a Snowflake Scripting procedure in [SnowSQL](../../user-guide/snowsql.md) or [Snowsight](../../user-guide/ui-snowsight-gs.md), you must use
  > [string literal delimiters](../../sql-reference/data-types-text.md) (`'` or `$$`) around the body of the stored procedure.
  >
  > For details, see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md).

Snowflake limits the maximum size of the source code in the body of a Snowflake Scripting stored procedure. Snowflake
recommends limiting the size to 100 KB. (The code is stored in a compressed form, and the exact limit depends on the
compressibility of the code.)

You can capture log and trace data as your handler code executes. For more information, see
[Logging, tracing, and metrics](../logging-tracing/logging-tracing-overview.md).

> **Note:**
>
> * The same rules around [caller’s rights vs. owner’s rights](stored-procedures-rights.md) apply to these stored procedures.
> * The same considerations and guidelines in [Working with stored procedures](stored-procedures-usage.md) apply to Snowflake Scripting stored procedures.

The following is an example of a simple stored procedure that returns the value of the argument that is passed in:

```sqlexample
CREATE OR REPLACE PROCEDURE output_message(message VARCHAR)
RETURNS VARCHAR NOT NULL
LANGUAGE SQL
AS
BEGIN
  RETURN message;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE output_message(message VARCHAR)
RETURNS VARCHAR NOT NULL
LANGUAGE SQL
AS
$$
BEGIN
  RETURN message;
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL output_message('Hello World');
```

The following is an example of creating and calling an anonymous stored procedure by using the
[WITH … CALL …](../../sql-reference/sql/call-with.md) command:

```sqlexample
WITH anonymous_output_message AS PROCEDURE (message VARCHAR)
  RETURNS VARCHAR NOT NULL
  LANGUAGE SQL
  AS
  $$
  BEGIN
    RETURN message;
  END;
  $$
CALL anonymous_output_message('Hello World');
```

Note that in an anonymous stored procedure, you must use [string literal delimiters](../../sql-reference/data-types-text.md) (`'`
or `$$`) around the body of the procedure.

## Using arguments passed to a stored procedure

If you pass in any arguments to your stored procedure, you can refer to those arguments by name in any Snowflake Scripting
expression. Snowflake Scripting stored procedures support input (IN) and output (OUT) arguments.

When you specify an output argument in the definition of a Snowflake Scripting stored procedure, the stored procedure
can return the current value of the output argument to a calling program, such as an anonymous block or a different
stored procedure. The stored procedure takes an initial value for the output argument, saves the value to
a variable in the procedure body, and optionally performs operations to change the value of the variable, before
returning the updated value to the calling program.

For example, a salesperson’s user identifier and a sales quarter can be passed to a stored procedure named
`emp_quarter_calling_sp_demo`. This stored procedure calls a different stored procedure named
`sales_total_out_sp_demo`. The `sales_total_out_sp_demo` stored procedure has an output argument that
performs operations to return the salesperson’s total sales for the quarter to the calling stored procedure
`emp_quarter_calling_sp_demo`. For an example of this scenario, see
Using an output argument to return the total sales for an employee in a quarter.

When there is a mismatch between the data type of the value being passed in and the data type of the output argument,
supported coercions are performed automatically. For an example, see Using an output argument with a different data type than the input value from a calling procedure.
For information about which coercions Snowflake can perform automatically, see [Data types that can be cast](../../sql-reference/data-type-conversion.md).

The [GET_DDL](../../sql-reference/functions/get_ddl.md) function and the [SHOW PROCEDURES](../../sql-reference/sql/show-procedures.md) command show the
type (either `IN` or `OUT`) of a stored procedure’s arguments in output. Other commands and views that show
metadata about stored procedures don’t show the type of the arguments, such as the [DESCRIBE PROCEDURE](../../sql-reference/sql/desc-procedure.md)
command, the Information Schema [PROCEDURES view](../../sql-reference/info-schema/procedures.md), and the Account Usage
[PROCEDURES view](../../sql-reference/account-usage/procedures.md).

A stored procedure can’t be overloaded by specifying different argument types in the signature. For example, assume a stored
procedure has this signature:

```sqlexample
CREATE PROCEDURE test_overloading(a IN NUMBER)
```

The following CREATE PROCEDURE command fails with an error stating that the procedure already exists, because it tries to create
a new stored procedure that differs from the previous example only in the argument type:

```sqlexample
CREATE PROCEDURE test_overloading(a OUT NUMBER)
```

### Syntax

Use the following syntax to specify an argument in a Snowflake Scripting stored procedure definition:

```sqlsyntax
<arg_name> [ { IN | INPUT | OUT | OUTPUT } ] <arg_data_type>
```

Where:

`arg_name`
:   The name of the argument. The name must follow the naming rules for [Object identifiers](../../sql-reference/identifiers.md).

`{ IN | INPUT | OUT | OUTPUT }`
:   Optional keyword that specifies whether the argument is an input argument or an output argument.

    * `IN` or `INPUT` - The argument is initialized with the supplied value, and this value is assigned to a stored procedure
      variable. The variable can be modified in the stored procedure body, but its final value can’t be passed to a calling
      program.

      `IN` and `INPUT` are synonymous.
    * `OUT` or `OUTPUT` - The argument is initialized with the supplied value, and this value is assigned to a stored procedure
      variable. The variable can be modified in the stored procedure body, and its final value can be passed to a calling
      program. In a stored procedure body, output arguments can only be assigned values by using variables.

      Output arguments can also be passed uninitialized variables. When the associated variable is unassigned, the output
      argument returns NULL.

      `OUT` and `OUTPUT` are synonymous.

    Default: `IN`

`arg_data_type`
:   A [SQL data type](../../sql-reference-data-types.md).

### Limitations

* Output arguments must be specified in a stored procedure’s definition.
* Output arguments can’t be specified as [optional arguments](../udf-stored-procedure-arguments.md). That is,
  output arguments can’t be specified using the DEFAULT keyword.
* In the body of a stored procedure, variables must be used to assign values to output arguments.
* The same variable can’t be used for multiple output arguments.
* Session variables can’t be passed to output arguments.
* User-defined functions (UDFs) don’t support output arguments.
* Stored procedures written in languages other than SQL don’t support output arguments.
* Output arguments can’t be used in [asynchronous child jobs](../snowflake-scripting/asynchronous-child-jobs.md).
* Stored procedures are limited to 500 arguments, including both input and output arguments.

### Examples

* Simple example of using arguments passed to a stored procedure
* Using an argument in a SQL statement (binding)
* Using an argument as an object identifier
* Using an argument when building a string for a SQL statement
* Using an output argument to return a single value
* Using output arguments to return several values for multiple calls to a stored procedure
* Using an output argument with a different data type than the input value from a calling procedure
* Using an output argument to return the total sales for an employee in a quarter

#### Simple example of using arguments passed to a stored procedure

The following stored procedure uses the values of the arguments in [IF](../../sql-reference/snowflake-scripting/if.md) and
[RETURN](../../sql-reference/snowflake-scripting/return.md) statements.

```sqlexample
CREATE OR REPLACE PROCEDURE return_greater(number_1 INTEGER, number_2 INTEGER)
RETURNS INTEGER NOT NULL
LANGUAGE SQL
AS
BEGIN
  IF (number_1 > number_2) THEN
    RETURN number_1;
  ELSE
    RETURN number_2;
  END IF;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE return_greater(number_1 INTEGER, number_2 INTEGER)
RETURNS INTEGER NOT NULL
LANGUAGE SQL
AS
$$
BEGIN
  IF (number_1 > number_2) THEN
    RETURN number_1;
  ELSE
    RETURN number_2;
  END IF;
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL return_greater(2, 3);
```

#### Using an argument in a SQL statement (binding)

As is the case with Snowflake Scripting variables, if you need to use an argument in a SQL statement, put a colon (`:`) in front
of the argument name. For more information, see [Using a variable in a SQL statement (binding)](../snowflake-scripting/variables.md).

The following sections contain examples that use bind variables in stored procedures:

* Example that uses a bind variable in a WHERE clause
* Example of using a bind variable to set the value of a property
* Example that uses bind variables to set parameters in a command
* Examples that use a bind variable for an array

##### Example that uses a bind variable in a WHERE clause

The following stored procedure uses the `id` argument in the WHERE clause of a SELECT statement. In the WHERE
clause, the argument is specified as `:id`.

```sqlexample
CREATE OR REPLACE PROCEDURE find_invoice_by_id(id VARCHAR)
RETURNS TABLE (id INTEGER, price NUMBER(12,2))
LANGUAGE SQL
AS
DECLARE
  res RESULTSET DEFAULT (SELECT * FROM invoices WHERE id = :id);
BEGIN
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE find_invoice_by_id(id VARCHAR)
RETURNS TABLE (id INTEGER, price NUMBER(12,2))
LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET DEFAULT (SELECT * FROM invoices WHERE id = :id);
BEGIN
  RETURN TABLE(res);
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL find_invoice_by_id('2');
```

In addition, the [TO_QUERY](../../sql-reference/functions/to_query.md) function provides a simple syntax for accepting a SQL string
directly in the FROM clause of a SELECT statement. For a comparison of the TO_QUERY function with dynamic SQL,
see [Constructing SQL at runtime](../../user-guide/querying-construct-at-runtime.md).

##### Example of using a bind variable to set the value of a property

The following stored procedure uses the `comment` argument to add a comment for a table in a
CREATE TABLE statement. In the statement, the argument is specified as `:comment`.

```sqlexample
CREATE OR REPLACE PROCEDURE test_bind_comment(comment VARCHAR)
RETURNS STRING
LANGUAGE SQL
AS
BEGIN
  CREATE OR REPLACE TABLE test_table_with_comment(a VARCHAR, n NUMBER) COMMENT = :comment;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_bind_comment(comment VARCHAR)
RETURNS STRING
LANGUAGE SQL
AS
$$
BEGIN
  CREATE OR REPLACE TABLE test_table_with_comment(a VARCHAR, n NUMBER) COMMENT = :comment;
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL test_bind_comment('My Test Table');
```

View the comment for the table by querying the [TABLES view](../../sql-reference/info-schema/tables.md)
in the INFORMATION_SCHEMA:

```sqlexample
SELECT comment FROM information_schema.tables WHERE table_name='TEST_TABLE_WITH_COMMENT';
```

```output
+---------------+
| COMMENT       |
|---------------|
| My Test Table |
+---------------+
```

You can also view the comment by running a [SHOW TABLES](../../sql-reference/sql/show-tables.md) command.

##### Example that uses bind variables to set parameters in a command

Assume you have a stage named `st` with CSV files:

```sqlexample
CREATE OR REPLACE STAGE st;
PUT file://good_data.csv @st;
PUT file://errors_data.csv @st;
```

You want to load the data in the CSV files into a table named `test_bind_stage_and_load`:

```sqlexample
CREATE OR REPLACE TABLE test_bind_stage_and_load (a VARCHAR, b VARCHAR, c VARCHAR);
```

The following stored procedure uses the FROM, ON_ERROR, and VALIDATION_MODE parameters in
a [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) statement. In the statement, the parameter values are specified as
`:my_stage_name`, `:on_error`, and `:valid_mode`, respectively.

```sqlexample
CREATE OR REPLACE PROCEDURE test_copy_files_validate(
  my_stage_name VARCHAR,
  on_error VARCHAR,
  valid_mode VARCHAR)
RETURNS STRING
LANGUAGE SQL
AS
BEGIN
  COPY INTO test_bind_stage_and_load
    FROM :my_stage_name
    ON_ERROR=:on_error
    FILE_FORMAT=(type='csv')
    VALIDATION_MODE=:valid_mode;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE test_copy_files_validate(
  my_stage_name VARCHAR,
  on_error VARCHAR,
  valid_mode VARCHAR)
RETURNS STRING
LANGUAGE SQL
AS
$$
BEGIN
  COPY INTO test_bind_stage_and_load
    FROM :my_stage_name
    ON_ERROR=:on_error
    FILE_FORMAT=(type='csv')
    VALIDATION_MODE=:valid_mode;
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL test_copy_files_validate('@st', 'skip_file', 'return_all_errors');
```

##### Examples that use a bind variable for an array

You can expand a bind variable that represents an [array](../../sql-reference/data-types-semistructured.md) into a list of individual values
by using the spread operator (`**`). For more information and examples, see [Expansion operators](../../sql-reference/operators-expansion.md).

#### Using an argument as an object identifier

If you need to use an argument to refer to an object (for example, a table name in the FROM clause of a SELECT statement), use the
[IDENTIFIER](../../sql-reference/identifier-literal.md) keyword to indicate that the argument represents an object identifier. For
example:

```sqlexample
CREATE OR REPLACE PROCEDURE get_row_count(table_name VARCHAR)
RETURNS INTEGER NOT NULL
LANGUAGE SQL
AS
DECLARE
  row_count INTEGER DEFAULT 0;
  res RESULTSET DEFAULT (SELECT COUNT(*) AS COUNT FROM IDENTIFIER(:table_name));
  c1 CURSOR FOR res;
BEGIN
  FOR row_variable IN c1 DO
    row_count := row_variable.count;
  END FOR;
  RETURN row_count;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE get_row_count(table_name VARCHAR)
RETURNS INTEGER NOT NULL
LANGUAGE SQL
AS
$$
DECLARE
  row_count INTEGER DEFAULT 0;
  res RESULTSET DEFAULT (SELECT COUNT(*) AS COUNT FROM IDENTIFIER(:table_name));
  c1 CURSOR FOR res;
BEGIN
  FOR row_variable IN c1 DO
    row_count := row_variable.count;
  END FOR;
  RETURN row_count;
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL get_row_count('invoices');
```

The following example executes a CREATE TABLE … AS SELECT (CTAS) statement in a stored procedure based on
the table names provided in arguments.

```sqlexample
CREATE OR REPLACE PROCEDURE ctas_sp(existing_table VARCHAR, new_table VARCHAR)
  RETURNS TEXT
  LANGUAGE SQL
AS
BEGIN
  CREATE OR REPLACE TABLE IDENTIFIER(:new_table) AS
    SELECT * FROM IDENTIFIER(:existing_table);
  RETURN 'Table created';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE ctas_sp(existing_table VARCHAR, new_table VARCHAR)
  RETURNS TEXT
  LANGUAGE SQL
AS
$$
BEGIN
  CREATE OR REPLACE TABLE IDENTIFIER(:new_table) AS
    SELECT * FROM IDENTIFIER(:existing_table);
  RETURN 'Table created';
END;
$$
;
```

Before calling the procedure, create a simple table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_table_for_ctas_sp (
  id NUMBER(2),
  v  VARCHAR(2))
AS SELECT
  column1,
  column2,
FROM
  VALUES
    (1, 'a'),
    (2, 'b'),
    (3, 'c');
```

Call the stored procedure to create a new table that is based on this table:

```sqlexample
CALL ctas_sp('test_table_for_ctas_sp', 'test_table_for_ctas_sp_backup');
```

#### Using an argument when building a string for a SQL statement

Note that if you are building a SQL statement as a string to be passed to
[EXECUTE IMMEDIATE](../../sql-reference/sql/execute-immediate.md) (see [Assigning a query to a declared RESULTSET](../snowflake-scripting/resultsets.md)), do not prefix the argument with a
colon. For example:

```sqlexample
CREATE OR REPLACE PROCEDURE find_invoice_by_id_via_execute_immediate(id VARCHAR)
RETURNS TABLE (id INTEGER, price NUMBER(12,2))
LANGUAGE SQL
AS
DECLARE
  select_statement VARCHAR;
  res RESULTSET;
BEGIN
  select_statement := 'SELECT * FROM invoices WHERE id = ' || id;
  res := (EXECUTE IMMEDIATE :select_statement);
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE find_invoice_by_id_via_execute_immediate(id VARCHAR)
RETURNS TABLE (id INTEGER, price NUMBER(12,2))
LANGUAGE SQL
AS
$$
DECLARE
  select_statement VARCHAR;
  res RESULTSET;
BEGIN
  select_statement := 'SELECT * FROM invoices WHERE id = ' || id;
  res := (EXECUTE IMMEDIATE :select_statement);
  RETURN TABLE(res);
END;
$$
;
```

#### Using an output argument to return a single value

The following example creates the stored procedure `simple_out_sp_demo` with the output argument `xout` in
its definition. The stored procedure sets the value of `xout` to `2`.

```sqlexample
CREATE OR REPLACE PROCEDURE simple_out_sp_demo(xout OUT NUMBER)
  RETURNS STRING
  LANGUAGE SQL
AS
BEGIN
  xout := 2;
  RETURN 'Done';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE simple_out_sp_demo(xout OUT NUMBER)
  RETURNS STRING
  LANGUAGE SQL
AS
$$
BEGIN
  xout := 2;
  RETURN 'Done';
END;
$$
;
```

The following anonymous block sets the value of the `x` variable to `1`. Then, it calls the `simple_out_sp_demo`
stored procedure and specifies the variable as the argument.

```sqlexample
BEGIN
  LET x := 1;
  CALL simple_out_sp_demo(:x);
  RETURN x;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
BEGIN
  LET x := 1;
  CALL simple_out_sp_demo(:x);
  RETURN x;
END;
$$
;
```

The output shows that the `simple_out_sp_demo` stored procedure performed an operation to set the value of the
output argument to `2` and then returned this value to the anonymous block.

```output
+-----------------+
| anonymous block |
|-----------------|
|               2 |
+-----------------+
```

The following anonymous block calls `simple_out_sp_demo` stored procedure and returns an error, because it tries to
assign a value to the output argument using an expression instead of a variable.

```sqlexample
BEGIN
  LET x := 1;
  CALL simple_out_sp_demo(:x + 2);
  RETURN x;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
BEGIN
  LET x := 1;
  CALL simple_out_sp_demo(:x + 2);
  RETURN x;
END;
$$
;
```

#### Using output arguments to return several values for multiple calls to a stored procedure

The following example demonstrates the following behavior related to stored procedures and
input and output arguments:

* A stored procedure can have several input and output arguments in its definition.
* A program can call a stored procedure with output arguments multiple times, and the values of the
  output arguments are preserved after each call.
* Input arguments don’t return values to the calling program.

Create the stored procedure `multiple_out_sp_demo` with multiple input and output arguments in its
definition. The stored procedure performs the same operations on the equivalent input and output arguments.
For example, the stored procedure adds `1` to the `p1_in` input argument and to the `p1_out` output
argument.

```sqlexample
CREATE OR REPLACE PROCEDURE multiple_out_sp_demo(
    p1_in NUMBER,
    p1_out OUT NUMBER,
    p2_in VARCHAR(100),
    p2_out OUT VARCHAR(100),
    p3_in BOOLEAN,
    p3_out OUT BOOLEAN)
  RETURNS NUMBER
  LANGUAGE SQL
AS
BEGIN
  p1_in := p1_in + 1;
  p1_out := p1_out + 1;
  p2_in := p2_in || ' hi ';
  p2_out := p2_out || ' hi ';
  p3_in := (NOT p3_in);
  p3_out := (NOT p3_out);
  RETURN 1;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE multiple_out_sp_demo(
    p1_in NUMBER,
    p1_out OUT NUMBER,
    p2_in VARCHAR(100),
    p2_out OUT VARCHAR(100),
    p3_in BOOLEAN,
    p3_out OUT BOOLEAN)
  RETURNS NUMBER
  LANGUAGE SQL
AS
$$
BEGIN
  p1_in := p1_in + 1;
  p1_out := p1_out + 1;
  p2_in := p2_in || ' hi ';
  p2_out := p2_out || ' hi ';
  p3_in := (NOT p3_in);
  p3_out := (NOT p3_out);
  RETURN 1;
END;
$$
;
```

The following anonymous block assigns values to the variables that correspond to the arguments of the
`multiple_out_sp_demo` stored procedure and then calls the stored procedure multiple times. The first
call uses the variable values specified in the anonymous block, but each subsequent call uses the values
returned by the output arguments in the `multiple_out_sp_demo` stored procedure.

```sqlexample
BEGIN
  LET x_in INT := 1;
  LET x_out INT := 1;
  LET y_in VARCHAR(100) := 'hello';
  LET y_out VARCHAR(100) := 'hello';
  LET z_in BOOLEAN := true;
  LET z_out BOOLEAN := true;

  CALL multiple_out_sp_demo(:x_in, :x_out, :y_in, :y_out, :z_in, :z_out);
  CALL multiple_out_sp_demo(:x_in, :x_out, :y_in, :y_out, :z_in, :z_out);
  CALL multiple_out_sp_demo(:x_in, :x_out, :y_in, :y_out, :z_in, :z_out);
  RETURN [x_in, x_out, y_in, y_out, z_in, z_out];
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
BEGIN
  LET x_in INT := 1;
  LET x_out INT := 1;
  LET y_in VARCHAR(100) := 'hello';
  LET y_out VARCHAR(100) := 'hello';
  LET z_in BOOLEAN := true;
  LET z_out BOOLEAN := true;

  CALL multiple_out_sp_demo(:x_in, :x_out, :y_in, :y_out, :z_in, :z_out);
  CALL multiple_out_sp_demo(:x_in, :x_out, :y_in, :y_out, :z_in, :z_out);
  CALL multiple_out_sp_demo(:x_in, :x_out, :y_in, :y_out, :z_in, :z_out);
  RETURN [x_in, x_out, y_in, y_out, z_in, z_out];
END;
$$
;
```

```output
+------------------------+
| anonymous block        |
|------------------------|
| [                      |
|   1,                   |
|   4,                   |
|   "hello",             |
|   "hello hi  hi  hi ", |
|   true,                |
|   false                |
| ]                      |
+------------------------+
```

#### Using an output argument with a different data type than the input value from a calling procedure

For some use cases, there might be a mismatch between the data type of the value being passed in to a stored
procedure and the data type of the procedure’s output argument. In these cases,
[supported coercions](../../sql-reference/data-type-conversion.md) are performed automatically.

> **Note:**
>
> Although coercion is supported in some cases, it isn’t recommended.

This example demonstrates automatic conversion of a FLOAT value that is passed to an output argument with
a NUMBER data type. The FLOAT value is automatically converted to a NUMBER value and then passed back to the
calling anonymous block.

Create the `sp_out_coercion` stored procedure, which takes an output argument of type NUMBER:

```sqlexample
CREATE OR REPLACE PROCEDURE sp_out_coercion(x OUT NUMBER)
  RETURNS STRING
  LANGUAGE SQL
AS
BEGIN
  x := x * 2;
  RETURN 'Done';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE sp_out_coercion(x OUT NUMBER)
  RETURNS STRING
  LANGUAGE SQL
AS
$$
BEGIN
  x := x * 2;
  RETURN 'Done';
END;
$$
;
```

Execute an anonymous block that passes a FLOAT value to the `sp_out_coercion` stored procedure:

```sqlexample
BEGIN
  LET a FLOAT := 500.662;
  CALL sp_out_coercion(:a);
  RETURN a || ' (Type ' || SYSTEM$TYPEOF(a) || ')';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
BEGIN
  LET a FLOAT := 500.662;
  CALL sp_out_coercion(:a);
  RETURN a || ' (Type ' || SYSTEM$TYPEOF(a) || ')';
END;
$$
;
```

The output shows both the returned value and the data type of the returned value, by calling the
[SYSTEM$TYPEOF](../../sql-reference/functions/system_typeof.md) function. Note that the value is coerced from a
NUMBER value back to a FLOAT value after it is returned from the stored procedure:

```output
+---------------------------+
| anonymous block           |
|---------------------------|
| 1002 (Type FLOAT[DOUBLE]) |
+---------------------------+
```

#### Using an output argument to return the total sales for an employee in a quarter

This example uses the following `quarterly_sales` table:

```sqlexample
CREATE OR REPLACE TABLE quarterly_sales(
  empid INT,
  amount INT,
  quarter TEXT)
  AS SELECT * FROM VALUES
    (1, 10000, '2023_Q1'),
    (1, 400, '2023_Q1'),
    (2, 4500, '2023_Q1'),
    (2, 35000, '2023_Q1'),
    (1, 5000, '2023_Q2'),
    (1, 3000, '2023_Q2'),
    (2, 200, '2023_Q2'),
    (2, 90500, '2023_Q2'),
    (1, 6000, '2023_Q3'),
    (1, 5000, '2023_Q3'),
    (2, 2500, '2023_Q3'),
    (2, 9500, '2023_Q3'),
    (3, 2700, '2023_Q3'),
    (1, 8000, '2023_Q4'),
    (1, 10000, '2023_Q4'),
    (2, 800, '2023_Q4'),
    (2, 4500, '2023_Q4'),
    (3, 2700, '2023_Q4'),
    (3, 16000, '2023_Q4'),
    (3, 10200, '2023_Q4');
```

Create the stored procedure `sales_total_out_sp_demo` that takes two input arguments for the
employee identifier and quarter, and one output argument to calculate the sales total for the
given employee and quarter.

```sqlexample
CREATE OR REPLACE PROCEDURE sales_total_out_sp_demo(
    id INT,
    quarter VARCHAR(20),
    total_sales OUT NUMBER(38,0))
  RETURNS STRING
  LANGUAGE SQL
AS
$$
BEGIN
  SELECT SUM(amount) INTO total_sales FROM quarterly_sales
    WHERE empid = :id AND
          quarter = :quarter;
  RETURN 'Done';
END;
$$
;
```

Create the stored procedure `emp_quarter_calling_sp_demo` that calls the `sales_total_out_sp_demo`
stored procedure. This stored procedure also takes two input arguments for the employee identifier and quarter.

```sqlexample
CREATE OR REPLACE PROCEDURE emp_quarter_calling_sp_demo(
    id INT,
    quarter VARCHAR(20))
  RETURNS STRING
  LANGUAGE SQL
AS
BEGIN
  LET x NUMBER(38,0);
  CALL sales_total_out_sp_demo(:id, :quarter, :x);
  RETURN 'Total sales for employee ' || id || ' in quarter ' || quarter || ': ' || x;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE emp_quarter_calling_sp_demo(
    id INT,
    quarter VARCHAR(20))
  RETURNS STRING
  LANGUAGE SQL
AS
$$
BEGIN
  LET x NUMBER(38,0);
  CALL sales_total_out_sp_demo(:id, :quarter, :x);
  RETURN 'Total sales for employee ' || id || ' in quarter ' || quarter || ': ' || x;
END;
$$
;
```

Call the `emp_quarter_calling_sp_demo` with the arguments `2` (for the employee identifier) and
`'2023_Q4'` (for the quarter).

```sqlexample
CALL emp_quarter_calling_sp_demo(2, '2023_Q4');
```

```output
+-----------------------------------------------------+
| emp_quarter_calling_sp_demo                         |
|-----------------------------------------------------|
| Total sales for employee 2 in quarter 2023_Q4: 5300 |
+-----------------------------------------------------+
```

## Returning tabular data

If you need to return tabular data (for example, data from a RESULTSET) from your stored procedure, specify
RETURNS TABLE(…) in your [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) statement.

If you know the [Snowflake data types](../../sql-reference-data-types.md) of the columns in the returned table, specify the column
names and types in the RETURNS TABLE().

```sqlexample
CREATE OR REPLACE PROCEDURE get_top_sales()
RETURNS TABLE (sales_date DATE, quantity NUMBER)
...
```

Otherwise (for example, if you are determining the column types during run time), you can omit the column names and types:

```sqlexample
CREATE OR REPLACE PROCEDURE get_top_sales()
RETURNS TABLE ()
...
```

> **Note:**
>
> Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
> applies whether you are creating a stored or anonymous procedure.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
>   RETURNS TABLE(g GEOGRAPHY)
>   ...
> ```
>
> ```sqlexample
> WITH test_return_geography_table_1() AS PROCEDURE
>   RETURNS TABLE(g GEOGRAPHY)
>   ...
> CALL test_return_geography_table_1();
> ```
>
> If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
>
> ```none
> Stored procedure execution error: data type of returned table does not match expected returned table type
> ```
>
> To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
>   RETURNS TABLE()
>   ...
> ```
>
> ```sqlexample
> WITH test_return_geography_table_1() AS PROCEDURE
>   RETURNS TABLE()
>   ...
> CALL test_return_geography_table_1();
> ```

If you need to return the data in a RESULTSET, use TABLE() in your
[RETURN](../../sql-reference/snowflake-scripting/return.md) statement.

For example:

```sqlexample
CREATE OR REPLACE PROCEDURE get_top_sales()
RETURNS TABLE (sales_date DATE, quantity NUMBER)
LANGUAGE SQL
AS
DECLARE
  res RESULTSET DEFAULT (SELECT sales_date, quantity FROM sales ORDER BY quantity DESC LIMIT 10);
BEGIN
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE get_top_sales()
RETURNS TABLE (sales_date DATE, quantity NUMBER)
LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET DEFAULT (SELECT sales_date, quantity FROM sales ORDER BY quantity DESC LIMIT 10);
BEGIN
  RETURN TABLE(res);
END;
$$
;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL get_top_sales();
```

## Calling a stored procedure from another stored procedure

In a stored procedure, if you need to call another stored procedure, use one of the following approaches:

* Calling a stored procedure without using the returned value
* Using the value returned from a stored procedure call
* Passing output argument values from a stored procedure to a calling stored procedure

### Calling a stored procedure without using the returned value

Use a [CALL](../../sql-reference/sql/call.md) statement to call the stored procedure (as you normally would).

If you need to pass in any variables or arguments as input arguments in the CALL statement, remember to use a colon (`:`) in
front of the variable name. (See [Using a variable in a SQL statement (binding)](../snowflake-scripting/variables.md).)

The following is an example of a stored procedure that calls another stored procedure but does not depend on the return value.

First, create a table for use in the example:

```sqlexample
-- Create a table for use in the example.
CREATE OR REPLACE TABLE int_table (value INTEGER);
```

Then, create the stored procedure that you will call from another stored procedure:

```sqlexample
-- Create a stored procedure to be called from another stored procedure.
CREATE OR REPLACE PROCEDURE insert_value(value INTEGER)
RETURNS VARCHAR NOT NULL
LANGUAGE SQL
AS
BEGIN
  INSERT INTO int_table VALUES (:value);
  RETURN 'Rows inserted: ' || SQLROWCOUNT;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
-- Create a stored procedure to be called from another stored procedure.
CREATE OR REPLACE PROCEDURE insert_value(value INTEGER)
RETURNS VARCHAR NOT NULL
LANGUAGE SQL
AS
$$
BEGIN
  INSERT INTO int_table VALUES (:value);
  RETURN 'Rows inserted: ' || SQLROWCOUNT;
END;
$$
;
```

Next, create a second stored procedure that calls the first stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE insert_two_values(value1 INTEGER, value2 INTEGER)
RETURNS VARCHAR NOT NULL
LANGUAGE SQL
AS
BEGIN
  CALL insert_value(:value1);
  CALL insert_value(:value2);
  RETURN 'Finished calling stored procedures';
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE insert_two_values(value1 INTEGER, value2 INTEGER)
RETURNS VARCHAR NOT NULL
LANGUAGE SQL
AS
$$
BEGIN
  CALL insert_value(:value1);
  CALL insert_value(:value2);
  RETURN 'Finished calling stored procedures';
END;
$$
;
```

Finally, call the second stored procedure:

```sqlexample
CALL insert_two_values(4, 5);
```

### Using the value returned from a stored procedure call

If you are calling a stored procedure that returns a scalar value, and you need to access that value, use the
`INTO :snowflake_scripting_variable` clause in the [CALL](../../sql-reference/sql/call.md) statement to capture the value in a
[Snowflake Scripting variable](../snowflake-scripting/variables.md).

The following example calls the `get_row_count` stored procedure that was defined in
Using an argument as an object identifier.

```sqlexample
CREATE OR REPLACE PROCEDURE count_greater_than(table_name VARCHAR, maximum_count INTEGER)
  RETURNS BOOLEAN NOT NULL
  LANGUAGE SQL
  AS
  DECLARE
    count1 NUMBER;
  BEGIN
    CALL get_row_count(:table_name) INTO :count1;
    IF (:count1 > maximum_count) THEN
      RETURN TRUE;
    ELSE
      RETURN FALSE;
    END IF;
  END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE count_greater_than(table_name VARCHAR, maximum_count INTEGER)
  RETURNS BOOLEAN NOT NULL
  LANGUAGE SQL
  AS
  $$
  DECLARE
    count1 NUMBER;
  BEGIN
    CALL get_row_count(:table_name) INTO :count1;
    IF (:count1 > maximum_count) THEN
      RETURN TRUE;
    ELSE
      RETURN FALSE;
    END IF;
  END;
  $$
  ;
```

The following is an example of calling the stored procedure:

```sqlexample
CALL count_greater_than('invoices', 3);
```

If the stored procedure returns a table, you can capture the return value by setting a
[RESULTSET](../snowflake-scripting/resultsets.md) to a string containing the CALL statement. (See
[Assigning a query to a declared RESULTSET](../snowflake-scripting/resultsets.md).)

To retrieve the return value from the call, you can use a
[CURSOR for the RESULTSET](../snowflake-scripting/resultsets.md). For example:

```sqlexample
DECLARE
  res1 RESULTSET;
BEGIN
res1 := (CALL my_procedure());
LET c1 CURSOR FOR res1;
FOR row_variable IN c1 DO
  IF (row_variable.col1 > 0) THEN
    ...;
  ELSE
    ...;
  END IF;
END FOR;
...
```

### Passing output argument values from a stored procedure to a calling stored procedure

When an output argument is specified in the definition of a Snowflake Scripting stored procedure, the stored procedure
can return the current value of the output argument to a calling stored procedure. The stored procedure takes an initial value
for the output argument, saves the value to a variable in the procedure body, and optionally performs operations to change the
value of the variable. The stored procedure then returns the updated value to the calling stored procedure.

For an example, see Using an output argument to return the total sales for an employee in a quarter.

## Using nested stored procedures

A *nested stored procedure* is a stored procedure that’s defined within the scope of an anonymous block or
a block in another stored procedure (the *parent stored procedure*).

You declare a nested stored procedure in the [DECLARE](../../sql-reference/snowflake-scripting/declare.md) section
of a block, which can be part of a [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) statement. The following example
shows a nested stored procedure declaration:

```sqlsyntax
DECLARE
  <nested_stored_procedure_name> PROCEDURE (<arguments>)
     RETURNS <data_type>
     AS
     BEGIN
       <nested_procedure_procedure_statements>
     END;
BEGIN
  <statements>
END;
```

For information about the declaration syntax of a nested stored procedure, see
[Nested stored procedure declaration syntax](../../sql-reference/snowflake-scripting/declare.md).

A nested stored procedure only exists within the scope of its [block](../snowflake-scripting/blocks.md).
It can be called from any section of its block (DECLARE, BEGIN … END, and EXCEPTION). A single block can contain
multiple nested stored procedures, and one nested stored procedure can call another nested stored procedure in the
same block. A nested procedure can’t be called or accessed from outside of its block.

A nested stored procedure operates in the same security context as the block that defines it. When a nested stored
procedure is defined in a parent stored procedure, it automatically runs with the same privileges as the parent
stored procedure.

> **Note:**
>
> Both a nested stored procedure declaration and the [CALL WITH](../../sql-reference/sql/call-with.md) command
> create a temporary stored procedure with limited scope. They differ in the following ways:
>
> * A CALL WITH statement can appear anywhere that a SQL statement can, including within a stored procedure, but a
>   nested stored procedure declaration must be in a Snowflake Scripting block.
> * A CALL WITH stored procedure only exists in the scope of its statement, but a nested stored procedure exists in
>   the scope of its Snowflake Scripting block.

### Benefits of nested stored procedures

Nested stored procedures provide the following benefits:

* They can enhance and simplify security by encapsulating logic inside an anonymous block or parent stored procedure,
  which prevents access to it from outside the block or parent.
* They keep code modular by splitting it logically into smaller chunks, which can make it easier to maintain and
  debug.
* They improve maintainability by reducing the need for global variables or additional arguments, because a nested stored
  procedure can directly access the local variables of its block.

### Usage notes for calling nested stored procedures

The following usage notes apply to calling a nested stored procedure:

* To pass arguments to a nested stored procedure, a block can use constant values,
  [Snowflake Scripting variables](../snowflake-scripting/variables.md),
  [bind variables](../../sql-reference/bind-variables.md), [SQL (session) variables](../../sql-reference/session-variables.md),
  and calls to [user-defined functions](../udf/udf-overview.md).
* When there is a mismatch between the data type of the value being passed in and the data type of an argument,
  Snowflake performs supported coercions automatically. For information about which coercions Snowflake can perform
  automatically, see [Data type conversion](../../sql-reference/data-type-conversion.md).

### Usage notes for variables in a nested stored procedure

The following usage notes apply to variables in a nested stored procedure:

* A nested stored procedure can reference variables from its block that were declared before the nested
  stored procedure declaration in the DECLARE section of its block. It can’t reference variables declared
  after it in the DECLARE section.
* A nested stored procedure can’t access variables declared in a LET statement in the BEGIN … END
  section of a block.
* The value of a referenced variable reflects its value at the time when the nested stored procedure is called.
* A nested stored procedure can modify a referenced variable value, and the modified value persists in the block
  and across multiple invocations of the same nested procedure in a single execution
  of its anonymous block or in a single call to its parent stored procedure.
* The value of a variable that was declared before a nested stored procedure call can be passed as an argument to
  the nested stored procedure. The variable value can be passed as an argument in a call even if the variable
  was declared after the nested stored procedure declaration or in a LET statement.

For example, the following stored procedure declares several variables:

```sqlexample
CREATE OR REPLACE PROCEDURE outer_sp ()
RETURNS NUMBER
LANGUAGE SQL
AS
$$
DECLARE
  var_before_nested_proc NUMBER DEFAULT 1;
  test_nested_variables PROCEDURE(arg1 NUMBER)
    -- <nested_sp_logic>
  var_after_nested_proc NUMBER DEFAULT 2;
BEGIN
  LET var_let_before_call NUMBER DEFAULT 3;
  LET result := CALL nested_proc(:<var_name>);
  LET var_let_after_call NUMBER DEFAULT 3;
  RETURN result;
END;
$$;
```

In this example, only `var_before_nested_proc` can be referenced in `nested_sp_logic`.

In the nested stored procedure call, the value of any of the following variables can be passed to the nested stored
procedure as an argument in `var_name`:

* `var_before_nested_proc`
* `var_after_nested_proc`
* `var_let_before_call`

The value of `var_let_after_call` can’t be passed to the nested stored procedure as an argument.

### Limitations for nested stored procedures

The following limitations apply to defining nested stored procedures:

* They can’t be defined inside other nested stored procedures or inside control structures, such as
  FOR or WHILE loops.
* Each nested stored procedure must have a unique name in its block. That is, nested stored procedures can’t
  be overloaded.
* They don’t support output (OUT) arguments.
* They don’t support optional arguments with default values.

The following limitations apply to calling nested stored procedures:

* They can’t be called in an [EXECUTE IMMEDIATE](../../sql-reference/sql/execute-immediate.md) statement.
* They can’t be called in [asynchronous child jobs](../snowflake-scripting/asynchronous-child-jobs.md).
* They don’t support named input arguments (`arg_name => arg`). Arguments must be
  specified by position. For more information, see [CALL](../../sql-reference/sql/call.md).

### Examples of nested stored procedures

The following examples use nested stored procedures:

* Define a nested stored procedure that returns tabular data
* Define a nested stored procedure that returns a scalar value
* Define a nested stored procedure in an anonymous block
* Define a nested stored procedure that is passed arguments
* Define a nested stored procedure that calls another nested stored procedure

#### Define a nested stored procedure that returns tabular data

The following example defines a nested stored procedure that returns a tabular data. The example creates a
parent stored procedure called `nested_procedure_example_table` with a nested stored procedure
called `nested_return_table`. The code includes the following logic:

* Declares a variable called `res` of type RESULTSET.
* Includes the following logic in the nested stored procedure:

  + Declares a variable called `res2`.
  + Inserts values into a table called `nested_table`.
  + Sets the `res2` variable to the results of a SELECT on the table.
  + Returns the tabular data in the result set.
* Creates the table `nested_table` in the parent stored procedure.
* Calls the nested stored procedure `nested_return_table` and sets the `res` variable to the results of the call
  to the nested stored procedure.
* Returns the tabular results in the `res` variable.

```sqlexample
CREATE OR REPLACE PROCEDURE nested_procedure_example_table()
RETURNS TABLE()
LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET;
  nested_return_table PROCEDURE()
    RETURNS TABLE()
    AS
    DECLARE
      res2 RESULTSET;
    BEGIN
      INSERT INTO nested_table VALUES(1);
      INSERT INTO nested_table VALUES(2);
      res2 := (SELECT * FROM nested_table);
      RETURN TABLE(res2);
    END;
BEGIN
  CREATE OR REPLACE TABLE nested_table(col1 INT);
  res := (CALL nested_return_table());
  RETURN TABLE(res);
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL nested_procedure_example_table();
```

```output
+------+
| COL1 |
|------|
|    1 |
|    2 |
+------+
```

#### Define a nested stored procedure that returns a scalar value

The following example defines a nested stored procedure that returns a scalar value. The example creates a
parent stored procedure called `nested_procedure_example_scalar` with a nested stored procedure
called `simple_counter`. The code includes the following logic:

* Declares a variable called `counter` of type NUMBER, and sets the value of this variable to `0`.
* Specifies that the nested stored procedure adds `1` to the current value of the `counter` variable.
* Calls the nested stored procedure three times in the parent stored procedure. The value of the `counter`
  variable is carried over between invocations of the nested stored procedure.
* Returns the value of the `counter` variable, which is `3`.

```sqlexample
CREATE OR REPLACE PROCEDURE nested_procedure_example_scalar()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
DECLARE
  counter NUMBER := 0;
  simple_counter PROCEDURE()
    RETURNS VARCHAR
    AS
    BEGIN
      counter := counter + 1;
      RETURN counter;
    END;
BEGIN
  CALL simple_counter();
  CALL simple_counter();
  CALL simple_counter();
  RETURN counter;
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL nested_procedure_example_scalar();
```

```output
+---------------------------------+
| NESTED_PROCEDURE_EXAMPLE_SCALAR |
|---------------------------------|
| 3                               |
+---------------------------------+
```

#### Define a nested stored procedure in an anonymous block

The following example is the same as the example in Define a nested stored procedure that returns a scalar value,
except that it defines a nested stored procedure in an anonymous block instead of a stored procedure:

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  counter NUMBER := 0;
  simple_counter PROCEDURE()
    RETURNS VARCHAR
    AS
    BEGIN
      counter := counter + 1;
      RETURN counter;
    END;
BEGIN
  CALL simple_counter();
  CALL simple_counter();
  CALL simple_counter();
  RETURN counter;
END;
$$;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               3 |
+-----------------+
```

#### Define a nested stored procedure that is passed arguments

The following example defines a nested stored procedure that is passed arguments. In the example, the nested
stored procedure inserts values into the following table:

```sqlexample
CREATE OR REPLACE TABLE log_nested_values(col1 INT, col2 INT);
```

The example creates a parent stored procedure called `nested_procedure_example_arguments` with a nested stored procedure
called `log_and_multiply_numbers`. The nested stored procedure takes two arguments of type NUMBER. The code includes the
following logic:

* Declares variables `a`, `b`, and `x` of type NUMBER.
* Includes a nested stored procedure that performs the following actions:

  + Inserts the two number values passed to it by the parent stored procedure into the `log_nested_values` table
    using bind variables.
  + Sets the value of variable `x` to the result of multiplying the two argument values.
  + Returns the value of `x` to the parent stored procedure.
* Sets the value of variable `a` to `5` and variable `b` to `10`.
* Calls the nested stored procedure.
* Returns the value of the `x` variable, which was set in the nested stored procedure.

```sqlexample
CREATE OR REPLACE PROCEDURE nested_procedure_example_arguments()
RETURNS NUMBER
LANGUAGE SQL
AS
$$
DECLARE
  a NUMBER;
  b NUMBER;
  x NUMBER;
  log_and_multiply_numbers PROCEDURE(num1 NUMBER, num2 NUMBER)
    RETURNS NUMBER
    AS
    BEGIN
      INSERT INTO log_nested_values VALUES(:num1, :num2);
      x := :num1 * :num2;
      RETURN x;
    END;
BEGIN
  a := 5;
  b := 10;
  CALL log_and_multiply_numbers(:a, :b);
  RETURN x;
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL nested_procedure_example_arguments();
```

```output
+------------------------------------+
| NESTED_PROCEDURE_EXAMPLE_ARGUMENTS |
|------------------------------------|
|                                 50 |
+------------------------------------+
```

Query the `log_nested_values` table to confirm that the nested stored procedure inserted the
values passed to it:

```sqlexample
SELECT * FROM log_nested_values;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    5 |   10 |
+------+------+
```

#### Define a nested stored procedure that calls another nested stored procedure

The following example defines a nested stored procedure that calls another nested stored procedure. The example creates a
parent stored procedure called `nested_procedure_example_call_from_nested` with two nested stored procedures
called `counter_nested_proc` and `call_counter_nested_proc`. The code includes the following logic:

* Declares a variable called `counter` of type NUMBER, and sets the value of this variable to `0`.
* Includes the nested stored procedure `counter_nested_proc` that adds `10` to the value of `counter`.
* Includes the nested stored procedure `call_counter_nested_proc` that adds `15` to the value of `counter`
  and also calls `counter_nested_proc` (which adds another `10` to the value of `counter`).
* Calls both nested stored procedures in the parent stored procedure.
* Returns the value of the `counter` variable, which is `35`.

```sqlexample
CREATE OR REPLACE PROCEDURE nested_procedure_example_call_from_nested()
RETURNS NUMBER
LANGUAGE SQL
AS
$$
DECLARE
  counter NUMBER := 0;
  counter_nested_proc PROCEDURE()
    RETURNS NUMBER
    AS
    DECLARE
      var1 NUMBER := 10;
    BEGIN
      counter := counter + var1;
    END;
  call_counter_nested_proc PROCEDURE()
    RETURNS NUMBER
    AS
    DECLARE
      var2 NUMBER := 15;
    BEGIN
      counter := counter + var2;
      CALL counter_nested_proc();
    END;
BEGIN
  counter := 0;
  CALL counter_nested_proc();
  CALL call_counter_nested_proc();
  RETURN counter;
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL nested_procedure_example_call_from_nested();
```

```output
+-------------------------------------------+
| NESTED_PROCEDURE_EXAMPLE_CALL_FROM_NESTED |
|-------------------------------------------|
|                                        35 |
+-------------------------------------------+
```

## Using and setting SQL variables in a stored procedure

By default, Snowflake Scripting stored procedures run with owner’s rights. When a
stored procedure runs with owner’s rights, it can’t access
[SQL (or session) variables](../../sql-reference/session-variables.md).

However, a caller’s rights stored procedure can read the caller’s session variables and use
them in the logic of the stored procedure. For example, a caller’s rights stored procedure
can use the value in a SQL variable in a query. To create a stored procedure that runs with
caller’s rights, specify the `EXECUTE AS CALLER` parameter in the
[CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md) statement.

These examples illustrate this key difference between caller’s rights and owner’s rights stored
procedures. They attempt to use SQL variables in two ways:

* Set a SQL variable before calling the stored procedure, then use the SQL variable inside the stored
  procedure.
* Set a SQL variable inside the stored procedure, then use the SQL variable after returning from the stored
  procedure.

Both using the SQL variable and setting the SQL variable work correctly in a caller’s rights stored procedure.
Both fail when using an owner’s rights stored procedure, even if the caller is the owner.

For more information about owner’s rights and caller’s rights, see [Understanding caller’s rights and owner’s rights stored procedures](stored-procedures-rights.md).

### Using a SQL variable in a stored procedure

The following example uses a SQL variable in a stored procedure.

First, set a SQL variable in a session:

```sqlexample
SET example_use_variable = 2;
```

Create a simple stored procedure that runs with caller’s rights and uses this SQL variable:

```sqlexample
CREATE OR REPLACE PROCEDURE use_sql_variable_proc()
RETURNS NUMBER
LANGUAGE SQL
EXECUTE AS CALLER
AS
DECLARE
  sess_var_x_2 NUMBER;
BEGIN
  sess_var_x_2 := 2 * $example_use_variable;
  RETURN sess_var_x_2;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE use_sql_variable_proc()
RETURNS NUMBER
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
  sess_var_x_2 NUMBER;
BEGIN
  sess_var_x_2 := 2 * $example_use_variable;
  RETURN sess_var_x_2;
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL use_sql_variable_proc();
```

```output
+-----------------------+
| USE_SQL_VARIABLE_PROC |
|-----------------------|
|                     4 |
+-----------------------+
```

Set the SQL variable to a different value:

```sqlexample
SET example_use_variable = 9;
```

Call the procedure again to see that the returned value has changed:

```sqlexample
CALL use_sql_variable_proc();
```

```output
+-----------------------+
| USE_SQL_VARIABLE_PROC |
|-----------------------|
|                    18 |
+-----------------------+
```

### Setting a SQL variable in a stored procedure

You can set a SQL variable in a stored procedure that’s running with caller’s rights. For
more information, including guidelines for using SQL variables in stored procedures, see
[Caller’s rights stored procedures](stored-procedures-rights.md).

> **Note:**
>
> Although you can set a SQL variable inside a stored procedure and leave it set after the end of the procedure,
> Snowflake does not recommend doing this.

The following example sets a SQL variable in a stored procedure.

First, set a SQL variable in a session:

```sqlexample
SET example_set_variable = 55;
```

Confirm the value of the SQL variable:

```sqlexample
SHOW VARIABLES LIKE 'example_set_variable';
```

```output
+----------------+-------------------------------+-------------------------------+----------------------+-------+-------+---------+
|     session_id | created_on                    | updated_on                    | name                 | value | type  | comment |
|----------------+-------------------------------+-------------------------------+----------------------+-------+-------+---------|
| 10363782631910 | 2024-11-27 08:18:32.007 -0800 | 2024-11-27 08:20:17.255 -0800 | EXAMPLE_SET_VARIABLE | 55    | fixed |         |
+----------------+-------------------------------+-------------------------------+----------------------+-------+-------+---------+
```

For example, the following stored procedure sets the SQL variable `example_set_variable`
to a new value and returns the new value:

```sqlexample
CREATE OR REPLACE PROCEDURE set_sql_variable_proc()
RETURNS NUMBER
LANGUAGE SQL
EXECUTE AS CALLER
AS
BEGIN
  SET example_set_variable = $example_set_variable - 3;
  RETURN $example_set_variable;
END;
```

Note: If you use [Snowflake CLI](../snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE set_sql_variable_proc()
RETURNS NUMBER
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
BEGIN
  SET example_set_variable = $example_set_variable - 3;
  RETURN $example_set_variable;
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL set_sql_variable_proc();
```

```output
+-----------------------+
| SET_SQL_VARIABLE_PROC |
|-----------------------|
|                    52 |
+-----------------------+
```

Confirm the new value of the SQL variable:

```sqlexample
SHOW VARIABLES LIKE 'example_set_variable';
```

```output
+----------------+-------------------------------+-------------------------------+----------------------+-------+-------+---------+
|     session_id | created_on                    | updated_on                    | name                 | value | type  | comment |
|----------------+-------------------------------+-------------------------------+----------------------+-------+-------+---------|
| 10363782631910 | 2024-11-27 08:18:32.007 -0800 | 2024-11-27 08:24:04.027 -0800 | EXAMPLE_SET_VARIABLE | 52    | fixed |         |
+----------------+-------------------------------+-------------------------------+----------------------+-------+-------+---------+
```

---
title: Writing stored procedures with SQL and Python
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-overview.md
section: Developer Guide
---

# Writing stored procedures with SQL and Python

You can write a stored procedure whose handler is coded in Python. By using APIs from the [Snowpark library](../../snowpark/python/index.md)
within your handler, you can perform queries, updates, and other work on Snowflake tables.

With stored procedures, you can build and run your data pipeline within Snowflake, using a Snowflake warehouse
as the compute framework. Build your data pipeline by using the [Snowpark API for Python](../../snowpark/python/creating-sprocs.md)
to write stored procedures. To schedule the execution of these stored procedures, you use [tasks](../../../user-guide/tasks-intro.md).

For information about machine learning models and Snowpark Python, see [Training Machine Learning Models with Snowpark Python](../../snowpark/python/python-snowpark-training-ml.md).

You can write stored procedures for Python [using a Python worksheet](../../snowpark/python/python-worksheets.md),
or using a local development environment.

You can capture log and trace data as your handler code executes. For more information, refer to
[Logging, tracing, and metrics](../../logging-tracing/logging-tracing-overview.md).

> **Note:**
>
> To both create and call an anonymous procedure, use [CALL (with anonymous procedure)](../../../sql-reference/sql/call-with.md). Creating and calling an anonymous procedure does
> not require a role with CREATE PROCEDURE schema privileges.

## Prerequisites for writing stored procedures locally

To write Python stored procedures in your local development environment, meet the following prerequisites:

* You must use version 0.4.0 or a more recent version of the Snowpark library.
* Enable Anaconda Packages so that Snowpark Python can load the required third-party dependencies. Refer to Using third-party packages from Anaconda.
* The supported versions of Python are:

  Generally available versions:

  + 3.9 (deprecated)
  + 3.10
  + 3.11
  + 3.12
  + 3.13

Be sure to set up your development environment to use the Snowpark library.
Refer to [Setting Up Your Development Environment for Snowpark](../../snowpark/python/setup.md).

> **Note:**
>
> While not required, Snowflake recommends [Artifact Repository overview](../../udf/python/udf-python-packages.md) to import Python packages. For more information, see below.

### Using artifact repository

You can specify packages to install from the Python Package Index (PyPI) and use them with Snowpark Python stored procedures. For more information, see [Artifact Repository overview](../../udf/python/udf-python-packages.md).

### Using third-party packages from Anaconda

You can specify Anaconda packages to install when you create Snowpark Python stored procedures. To view the list of third-party packages
from Anaconda, see the [Anaconda Snowflake channel](https://repo.anaconda.com/pkgs/snowflake).
These third-party packages are built and provided by Anaconda.
You may use the Snowflake conda channel for local testing and development at no cost under the Supplemental Embedded Software Terms to Anaconda’s Terms of Service.

For limitations, see [Python stored procedure limitations](procedure-python-limitations.md).

### Getting started

Before you start using the packages provided by Anaconda inside Snowflake, you must acknowledge
the [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/).

> **Note:**
>
> You must use the ORGADMIN role to accept the terms. You only need to accept the
> [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/) once for your Snowflake account. If you do not have
> access to the ORGADMIN role, see [Enabling the ORGADMIN role in an account](../../../user-guide/organization-administrators.md).

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Anaconda section, select Enable.
4. In the Anaconda Packages dialog, click the link to review the [External Offerings Terms page](https://www.snowflake.com/legal/external-offering-terms/).
5. If you agree to the terms, select Acknowledge & Continue.

If you encounter an error when attempting to accept the [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/),
it may be due to missing information in your user profile, such as a first name, last name, or email address. If you have administrator
privileges, see [Add user details to your user profile](../../../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
administrator to [update your account](../../../user-guide/admin-user-management.md).

> **Note:**
>
> If you don’t acknowledge the Snowflake [External Offerings Terms](https://www.snowflake.com/legal/external-offering-terms/) as described
> above, you can still use stored procedures, but with these limitations:
>
> * You can’t use any third-party packages from Anaconda.
> * You can still specify Snowpark Python as a package in a stored procedure, but you can’t specify a specific version.
> * You can’t use the `to_pandas` method when interacting with a `DataFrame` object.

### Displaying and using packages

You can display all available packages and their version information by querying the PACKAGES view in the Information Schema:

```python
SELECT * FROM information_schema.packages WHERE LANGUAGE = 'python';
```

For more information, see [Using third-party packages](../../udf/python/udf-python-packages.md) in the Snowflake Python UDF documentation.

## Calling your stored procedure

After creating a stored procedure, you can call it in the following ways:

* [From SQL](../stored-procedures-calling.md).
* [As part of a scheduled task](../../../user-guide/tasks-intro.md).

---
title: Writing the Python handler for a stored procedure
source: https://docs.snowflake.com/en/developer-guide/stored-procedure/python/procedure-python-writing.md
section: Developer Guide
---

# Writing the Python handler for a stored procedure

You can write Python code as the handler that executes when a stored procedure is called. This section describes the design of a
handler.

You can create a stored procedure from the handler code in several ways:

* Include the code in-line with the SQL statement that creates the procedure. Refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).
* Copy the code to a stage and reference it there when you create the procedure. Refer to [Keeping handler code in-line or on a stage](../../inline-or-staged.md).
* Write the code in a Python worksheet and deploy the worksheet contents to a stored procedure. Refer to
  [Creating a stored procedure from a Python worksheet](procedure-python-create-worksheet.md).

## Planning to write your stored procedure

Stored procedures run inside Snowflake, and so you must plan the code that you write with that in mind.

* Limit the amount of memory consumed. Snowflake places limits on a method in terms of the amount of memory needed.
  For guidance, refer to [Designing Handlers that Stay Within Snowflake-Imposed Constraints](../../udf-stored-procedure-constraints.md).
* Make sure that your handler method or function is thread safe.
* Follow the rules and security restrictions. Refer to [Security Practices for UDFs and Procedures](../../udf-stored-procedure-security-practices.md).
* Decide whether you want the stored procedure to run with [caller’s rights or owner’s rights](../stored-procedures-rights.md).
* Consider the snowflake-snowpark-python version used to run stored procedures. Due to limitations in the stored procedures release process,
  the snowflake-snowpark-python library available in the Python stored procedure environment is usually one version behind the publicly
  released version. Use the following SQL to find out the latest available version:

  ```sqlexample
  SELECT * FROM information_schema.packages WHERE package_name = 'snowflake-snowpark-python' ORDER BY version DESC;
  ```

## Writing the method or function

When writing the method or function for the stored procedure, note the following:

* Specify the Snowpark `Session` object as the first argument of your method or function.
  When you call your stored procedure, Snowflake automatically creates a `Session` object and passes it to your stored procedure.
  (You cannot create the `Session` object yourself.)
* For the rest of the arguments and for the return value, use the Python types that correspond to
  [Snowflake data types](../../../sql-reference-data-types.md). Snowflake supports the Python data types listed in
  [SQL-Python Data Type Mappings for Parameters and Return Types](../../udf-stored-procedure-data-type-mapping.md).
* When you run an asynchronous child job from within a procedure’s handler — such as by using
  [DataFrame.collect_nowait](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.DataFrame.collect_nowait)
  — “fire and forget” is not supported.

  In other words, if the handler issues a child query that is still running when the parent procedure job completes, the child job is
  canceled automatically.

## Handling errors

You can use the normal Python exception-handling techniques to catch errors within the procedure.

If an uncaught exception occurs inside the method, Snowflake raises an error that includes the stack trace for the exception. When
[logging of unhandled exceptions](../../logging-tracing/unhandled-exception-messages.md) is enabled, Snowflake logs data
about unhandled exceptions in an event table.

## Making dependencies available to your code

If your handler code depends on code defined outside the handler itself (such as code defined in a module) or on resource files, you can
make those dependencies available to your code by uploading them to a stage.
Refer to [Making dependencies available to your code](../../upload-dependencies.md), or for Python worksheets, refer to [Add a Python File from a Stage to a Worksheet](../../snowpark/python/python-worksheets.md).

If you create your stored procedure using SQL, use the IMPORTS clause when writing the
[CREATE PROCEDURE statement](../../../sql-reference/sql/create-procedure.md), to point to the dependency files.

## SQL Functions

Reference for all built-in Snowflake SQL functions and aggregates.

---
title: <service_name>!SPCS_CANCEL_JOB
source: https://docs.snowflake.com/en/sql-reference/functions/spcs_cancel_job.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# <service_name>!SPCS_CANCEL_JOB

Cancels a [Snowpark Container Services job](../../developer-guide/snowpark-container-services/working-with-services.md); also referred to as job service. When you cancel a job, Snowflake stops the job from running and removes the resources allocated for running the job.

See also:
:   [Run a job service](../../developer-guide/snowpark-container-services/working-with-services.md), [Working with services](../../developer-guide/snowpark-container-services/working-with-services.md)

## Syntax

```sqlsyntax
<service_name>!SPCS_CANCEL_JOB();
```

## Returns

Returns a string that indicates whether or not the job was canceled.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OPERATE | Service | To cancel the job service, you must use a role that was granted this privilege. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Cancel the `my_job` job.

```sqlexample
SELECT my_job!SPCS_CANCEL_JOB();
```

---
title: <service_name>!SPCS_GET_EVENTS
source: https://docs.snowflake.com/en/sql-reference/functions/spcs_get_events.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Snowpark Container Services)

# <service_name>!SPCS_GET_EVENTS

Returns the events that Snowflake collected for the specified service.
For more information,
see [Accessing platform events](../../developer-guide/snowpark-container-services/monitoring-services.md).

See also:
:   [Monitoring Services](../../developer-guide/snowpark-container-services/monitoring-services.md)

## Syntax

```sqlsyntax
<service_name>!SPCS_GET_EVENTS(
  [ START_TIME => <constant_expr> ],
  [ END_TIME => <constant_expr> ] )
```

## Arguments

`START_TIME => constant_expr`
:   Start time (in TIMESTAMP_LTZ format) for the time range from which to
    retrieve events. For available functions to construct data, time, and timestamp data, see [Date & time functions](../functions-date-time.md).

    If the `START_TIME` is not specified, it defaults to one day ago.

`END_TIME => constant_expr`
:   End time (in TIMESTAMP_LTZ format) for the time range from which to retrieve events.

    If END_TIME is not specified, it defaults to the current timestamp.

## Output

| Column | Type | Description |
| --- | --- | --- |
| TIMESTAMP | TIMESTAMP_NTZ | Coordinated Universal Time (UTC) timestamp when Snowflake collected the event. This value maps to the TIMESTAMP column in the event table. |
| SEVERITY | VARCHAR | Severity of the event. This value maps to the `severity_text` field in the RECORD column in the event table. |
| EVENT_NAME | VARCHAR | Name of the event. This value maps to the `name` field in the RECORD column in the event table. |
| EVENT_DETAILS | OBJECT | Details about the event. This value maps to the VALUE column in the event table. |
| INSTANCE_ID | NUMBER | Identifier of the service instance if the event is related to a service instance. This value maps to the `snow.service.instance` field in the RESOURCE_ATTRIBUTES column in the event table. |
| CONTAINER_NAME | VARCHAR | Name of the container if the event is related to a container. This value maps to the `snow.service.container.name` field in the RESOURCE_ATTRIBUTES column in the event table. |
| RECORD | OBJECT | Event information in JSON format. This value maps to the RECORD column in the event table. |
| RECORD_ATTRIBUTES | OBJECT | Additional information about the event. This value maps to the RECORD_ATTRIBUTES column in the event table. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Service | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../sql/grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* It can take a few minutes before events show in the output.

## Examples

Retrieve the events that Snowflake recorded for the `my_test_job`
job over the past day.

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_EVENTS());
```

Example output:

```output
+-------------------------+----------+-------------------------+----------------------------------------+-------------+----------------+--------------------------------------+-------------------+
| TIMESTAMP               | SEVERITY | EVENT_NAME              | EVENT_DETAILS                          | INSTANCE_ID | CONTAINER_NAME | RECORD                               | RECORD_ATTRIBUTES |
|-------------------------+----------+-------------------------+----------------------------------------+-------------+----------------+--------------------------------------+-------------------|
| 2025-06-26 00:23:40.933 | INFO     | CONTAINER.STATUS_CHANGE | {                                      |        0    | main           | {                                    | NULL              |
|                         |          |                         |   "message": "Completed successfully", |             |                |   "name": "CONTAINER.STATUS_CHANGE", |                   |
|                         |          |                         |   "status": "DONE"                     |             |                |   "severity_text": "INFO"            |                   |
|                         |          |                         | }                                      |             |                | }                                    |                   |
| 2025-06-26 00:23:35.919 | INFO     | CONTAINER.STATUS_CHANGE | {                                      |        0    | main           | {                                    | NULL              |
|                         |          |                         |   "message": "Running",                |             |                |   "name": "CONTAINER.STATUS_CHANGE", |                   |
|                         |          |                         |   "status": "READY"                    |             |                |   "severity_text": "INFO"            |                   |
|                         |          |                         | }                                      |             |                | }                                    |                   |
| 2025-06-26 00:23:34.127 | INFO     | CONTAINER.STATUS_CHANGE | {                                      |        0    | main           | {                                    | NULL              |
|                         |          |                         |   "message": "Waiting to start",       |             |                |   "name": "CONTAINER.STATUS_CHANGE", |                   |
|                         |          |                         |   "status": "PENDING"                  |             |                |   "severity_text": "INFO"            |                   |
|                         |          |                         | }                                      |             |                | }                                    |                   |
+-------------------------+----------+-------------------------+----------------------------------------+-------------+----------------+--------------------------------------+-------------------+
```

Retrieve the events that Snowflake recorded for the `my_test_job` job over the past three days.

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_EVENTS(START_TIME => DATEADD('day', -3, CURRENT_TIMESTAMP())));
```

---
title: <service_name>!SPCS_GET_LOGS
source: https://docs.snowflake.com/en/sql-reference/functions/spcs_get_logs.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Snowpark Container Services)

# <service_name>!SPCS_GET_LOGS

Returns the logs that Snowflake collected from containers of the specified service. For more information, see [Publishing and accessing container logs](../../developer-guide/snowpark-container-services/monitoring-services.md).

See also:
:   [Monitoring Services](../../developer-guide/snowpark-container-services/monitoring-services.md)

## Syntax

```sqlsyntax
<service_name>!SPCS_GET_LOGS(
  [ START_TIME => <constant_expr> ],
  [ END_TIME => <constant_expr> ] )
```

## Arguments

`START_TIME => constant_expr`
:   Start time (in TIMESTAMP_LTZ format) for the time range from which to retrieve logs. For available functions to construct data, time, and timestamp data, see [Date & time functions](../functions-date-time.md).

    If the `START_TIME` isn’t specified, it defaults to 1 day ago.

`END_TIME => constant_expr`
:   End time (in TIMESTAMP_LTZ format) for the time range from which to retrieve logs.

    If END_TIME isn’t specified, it defaults to the current timestamp.

## Output

Each row in the output corresponds to one logged event in the event table.
Each line that your service outputs to `stdout` or `stderr` results in one row in the output.

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `TIMESTAMP` | TIMESTAMP_NTZ | Universal Coordinated Time (UTC) timestamp when Snowflake collected the log from the container. This value maps to the TIMESTAMP column in the event table. |
| `INSTANCE_ID` | NUMBER | ID of the job service instance. This value maps to the `snow.service.instance` field in the RESOURCE_ATTRIBUTES column in the event table. |
| `CONTAINER_NAME` | VARCHAR | Name of the container. This value maps to the `snow.service.container.name` field in the RESOURCE_ATTRIBUTES column in the event table. |
| `LOG` | VARCHAR | Log Snowflake collected from your application container. This value maps to the VALUE column in the event table. |
| `RECORD_ATTRIBUTES` | OBJECT | Addition information about the log. For example, the log stream — stderr or stdout — from where the log was collected. This value maps to the RECORD_ATTRIBUTES column in the event table. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* It can take a few minutes before your container logs show in the output.

## Examples

Retrieve the logs that Snowflake collected from containers of the `my_test_job` job over the past day.

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_LOGS());
```

Example output:

```output
+-------------------------+-------------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+
| TIMESTAMP               | INSTANCE_ID | CONTAINER_NAME | LOG                                                                                                                                                                 | RECORD_ATTRIBUTES          |
|-------------------------+-------------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------|
| 2025-06-26 00:23:40.281 |           0 | main           | job-tutorial - INFO - Job finished                                                                                                                                  | {                          |
|                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
|                         |             |                |                                                                                                                                                                     | }                          |
| 2025-06-26 00:23:38.787 |           0 | main           | job-tutorial - INFO - Executing query [select current_time() as time,'hello'] and writing result to table [results]                                                 | {                          |
|                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
|                         |             |                |                                                                                                                                                                     | }                          |
| 2025-06-26 00:23:38.787 |           0 | main           | job-tutorial - INFO - Connection succeeded. Current session context: database="TUTORIAL_DB", schema="DATA_SCHEMA", warehouse="TUTORIAL_WAREHOUSE", role="TEST_ROLE" | {                          |
|                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
|                         |             |                |                                                                                                                                                                     | }                          |
| 2025-06-26 00:23:36.852 |           0 | main           | job-tutorial - INFO - Job started                                                                                                                                   | {                          |
|                         |             |                |                                                                                                                                                                     |   "log.iostream": "stdout" |
|                         |             |                |                                                                                                                                                                     | }                          |
+-------------------------+-------------+----------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------+----------------------------+
```

Retrieve the logs that Snowflake collected from containers of the `my_test_job` job over the past three days.

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_LOGS(START_TIME => DATEADD('day', -3, CURRENT_TIMESTAMP())));
```

Retrieve the logs for the `my_test_job` job instance `0` in the container named `main`. As shown in the following example, if you omit the START_TIME and END_TIME arguments, the function retrieves the logs for the past day:

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_LOGS())
WHERE instance_id = 0 AND container_name = 'main';
```

---
title: <service_name>!SPCS_GET_METRICS
source: https://docs.snowflake.com/en/sql-reference/functions/spcs_get_metrics.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Snowpark Container Services)

# <service_name>!SPCS_GET_METRICS

Returns the metrics that Snowflake collected for the specified service. For more information, see [Access platform metrics](../../developer-guide/snowpark-container-services/monitoring-services.md).

See also:
:   [Monitoring Services](../../developer-guide/snowpark-container-services/monitoring-services.md)

## Syntax

```sqlsyntax
<service_name>!SPCS_GET_METRICS(
    [ START_TIME => <constant_expr> ],
    [ END_TIME => <constant_expr> ] )
```

## Arguments

`START_TIME => constant_expr`
:   Start time (in TIMESTAMP_LTZ format) for the time range from which to retrieve metrics. For available functions to construct data, time, and timestamp data, see [Date & time functions](../functions-date-time.md).

    If the `START_TIME` isn’t specified, it defaults to one day ago.

`END_TIME => constant_expr`
:   End time (in TIMESTAMP_LTZ format) for the time range from which to retrieve metrics.

    If END_TIME isn’t specified, it defaults to the current timestamp.

## Output

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `TIMESTAMP` | TIMESTAMP_NTZ | Universal Coordinated Time (UTC) timestamp when Snowflake collected the metric. |
| `METRIC_NAME` | VARCHAR | Name of the metric. |
| `VALUE` | VARCHAR | Value of the metric. |
| `UNIT` | VARCHAR | Unit of the metric returned. |
| `INSTANCE_ID` | NUMBER | Name of the service instance if the metric is related to the service instance. |
| `CONTAINER_NAME` | VARCHAR | Name of the container if the metric is related to the container. For example, a volume metric won’t have container name. |
| `RESOURCE` | VARCHAR | Hardware — for example, GPU — the metrics is about. This column isn’t populated. |
| `RECORD` | OBJECT | Key-value pairs that provide metric information. |
| `RECORD_ATTRIBUTES` | OBJECT | Key-value pairs that provide additional information about the metric. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* It can take a few minutes before metrics appear in the output.

## Examples

Retrieve the metrics that Snowflake collected for the `my_test_job` job over the past day, the default.

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_METRICS());
```

Retrieve the metrics that Snowflake collected for the `my_test_job` job over the past three days.

```sqlexample
SELECT * from TABLE(mydb.myschema.my_test_job!SPCS_GET_METRICS(start_time => DATEADD('day', -3, CURRENT_TIMESTAMP())));
```

Retrieve metrics from the past day for the `spcs_get_metrics` job instance `0` in the container named `main`.

```sqlexample
SELECT * FROM TABLE(mydb.myschema.my_test_job!SPCS_GET_METRICS())
 WHERE instance_id = 0 AND container_name = 'main';
```

---
title: <service_name>!SPCS_WAIT_FOR
source: https://docs.snowflake.com/en/sql-reference/functions/spcs_wait_for.md
section: SQL Functions
---

Categories:
:   [Snowpark Container Services functions](../functions-spcs.md)

# <service_name>!SPCS_WAIT_FOR

Waits for the [Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md) to reach the specified state, with a timeout.

* When you execute an asynchronous job, use this helper function to wait for the job to complete.
* When you create a service, use this helper function to wait until the service is running.

See also:
:   [Snowpark Container Services: Working with services](../../developer-guide/snowpark-container-services/working-with-services.md)

## Syntax

```sqlsyntax
<service_name>!SPCS_WAIT_FOR( <status>, <timeout_sec> );
```

## Arguments

**Required arguments**

`'status'`
:   Status to wait for. For a list of service status values, see the output section of the [DESCRIBE SERVICE](../sql/desc-service.md) command.

`timeout_sec`
:   The maximum duration, in seconds, to wait for the specified status. If specified status isn’t reached within the timeout, the function returns an error message that includes the current service status.

## Returns

If the service doesn’t reach the specified status within the timeout or Snowflake determines that the status can never be reached, the function returns an error message that also provides the current service status. Otherwise, it returns a success message.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP, USAGE, MONITOR or OPERATE | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Wait for two minutes for the specified job to complete (job status is DONE).

```sqlexample
CALL my_job!spcs_wait_for('DONE', 120)
```

Wait for three minutes for the specified service to start (service status is RUNNING).

```sqlexample
CALL my_service!SPCS_WAIT_FOR('RUNNING', 180)
```

---
title: [ NOT ] BETWEEN
source: https://docs.snowflake.com/en/sql-reference/functions/between.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# [ NOT ] BETWEEN

Returns `TRUE` when the input expression (numeric or string) is within the specified lower and upper boundary.

## Syntax

```sqlsyntax
<expr> [ NOT ] BETWEEN <lower_bound> AND <upper_bound>
```

## Arguments

`expr`
:   The input expression.

`lower_bound`
:   The lower boundary.

`upper_bound`
:   The upper boundary.

## Returns

The function returns a value of type BOOLEAN.

## Usage notes

* `expr BETWEEN lower_bound AND upper_bound` is equivalent to `expr >= lower_bound AND expr <= upper_bound`.
* The data types of the argument values must be the same or [compatible](../data-type-conversion.md).

  If the function implicitly casts a value to a different data type, it might return unexpected results.

  For example, when `expr` is a TIMESTAMP value, and the `lower_bound` and `upper_bound` values
  are DATE values, the DATE values are implicitly cast to TIMESTAMP values, and the time is set to `00:00:00`. For
  the following WHERE clause, assume `timestamp_column` is a column of type TIMESTAMP in a table:

  ```sqlexample
  WHERE timestamp_column BETWEEN '2025-04-30' AND '2025-04-31'
  ```

  When the DATE values are implicitly cast, the WHERE clause is interpreted as the following:

  ```sqlexample
  WHERE timestamp_column BETWEEN '2025-04-30 00:00:00' AND '2025-04-31 00:00:00'
  ```

  With this WHERE clause, the function returns `FALSE` for virtually all `timestamp_column` values on 2025-04-31,
  which might not be intended. To avoid this specific issue, you can specify the next day for `upper_bound` when
  you call the function:

  ```sqlexample
  WHERE timestamp_column BETWEEN '2025-04-30' AND '2025-05-01'
  ```

## Collation details

The expression `A BETWEEN X AND Y` is equivalent to `A >= X AND A <= Y`. The collations used for comparing
with `X` and `Y` are independent and do not need to be identical, but both need to be compatible with the
collation of `A`.

## Examples

Here are a few simple examples of using BETWEEN with numeric and string values:

```sqlexample
SELECT 'true' WHERE 1 BETWEEN 0 AND 10;
```

```output
+--------+
| 'TRUE' |
|--------|
| true   |
+--------+
```

```sqlexample
SELECT 'true' WHERE 1.35 BETWEEN 1 AND 2;
```

```output
+--------+
| 'TRUE' |
|--------|
| true   |
+--------+
```

```sqlexample
SELECT 'true' WHERE 'the' BETWEEN 'that' AND 'then';
```

```output
+--------+
| 'TRUE' |
|--------|
| true   |
+--------+
```

The following examples use COLLATE with BETWEEN:

```sqlexample
SELECT 'm' BETWEEN COLLATE('A', 'lower') AND COLLATE('Z', 'lower');
```

```output
+-------------------------------------------------------------+
| 'M' BETWEEN COLLATE('A', 'LOWER') AND COLLATE('Z', 'LOWER') |
|-------------------------------------------------------------|
| True                                                        |
+-------------------------------------------------------------+
```

```sqlexample
SELECT COLLATE('m', 'upper') BETWEEN 'A' AND 'Z';
```

```output
+-------------------------------------------+
| COLLATE('M', 'UPPER') BETWEEN 'A' AND 'Z' |
|-------------------------------------------|
| True                                      |
+-------------------------------------------+
```

---
title: [ NOT ] EQUAL_NULL
source: https://docs.snowflake.com/en/sql-reference/functions/equal_null.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# [ NOT ] EQUAL_NULL

Compares whether two expressions are equal. The function is NULL-safe, meaning it treats NULLs as known values for comparing equality. Note that this is different from the EQUAL
[comparison operator](../operators-comparison.md) (`=`), which treats NULLs as unknown values.

See also:
:   [IS [ NOT ] DISTINCT FROM](is-distinct-from.md)

## Syntax

```sqlsyntax
[ NOT ] EQUAL_NULL( <expr1> , <expr2> )
```

## Usage notes

* The value returned depends on whether any of the inputs are NULL values:

  Returns TRUE:
  :   `EQUAL_NULL( <null> , <null> )`

  Returns FALSE:
  :   `EQUAL_NULL( <null> , <not_null> )`

      `EQUAL_NULL( <not_null> , <null> )`

  Otherwise:

  > `EQUAL_NULL(<expr1>, <expr2>)` is equivalent to `<expr1> = <expr2>`

For more details, see the examples below.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The comparisons follow the collation based on the input arguments’ collations and precedences.

## Examples

Create a table with simple data:

```sqlexample
CREATE OR REPLACE TABLE x (i NUMBER);
INSERT INTO x VALUES
  (1),
  (2),
  (NULL);
```

Show the Cartesian product generated by joining the table to itself without a filter:

```sqlexample
SELECT x1.i x1_i, x2.i x2_i
  FROM x x1, x x2
  ORDER BY x1.i, x2.i;
```

```output
+------+------+
| X1_I | X2_I |
|------+------|
|    1 |    1 |
|    1 |    2 |
|    1 | NULL |
|    2 |    1 |
|    2 |    2 |
|    2 | NULL |
| NULL |    1 |
| NULL |    2 |
| NULL | NULL |
+------+------+
```

Return rows that contain only equal values for both columns:

```sqlexample
SELECT x1.i x1_i, x2.i x2_i
  FROM x x1, x x2
  WHERE x1.i = x2.i;
```

```output
+------+------+
| X1_I | X2_I |
|------+------|
|    1 |    1 |
|    2 |    2 |
+------+------+
```

Return rows that contain only equal values or NULL values for both columns:

```sqlexample
SELECT x1.i x1_i, x2.i x2_i
  FROM x x1, x x2
  WHERE EQUAL_NULL(x1.i, x2.i);
```

```output
+------+------+
| X1_I | X2_I |
|------+------|
|    1 |    1 |
|    2 |    2 |
| NULL | NULL |
+------+------+
```

Illustrate all possible outcomes for EQUAL (`=`) and NOT EQUAL (`<>`):

```sqlexample
SELECT x1.i x1_i,
       x2.i x2_i,
       x1.i = x2.i,
       IFF(x1.i = x2.i, 'Selected', 'Not') "SELECT IF X1.I=X2.I",
       x1.i <> x2.i,
       IFF(NOT(x1.i = x2.i), 'Selected', 'Not') "SELECT IF X1.I<>X2.I"
  FROM x x1, x x2;
```

```output
+------+------+-----------+---------------------+------------+----------------------+
| X1_I | X2_I | X1.I=X2.I | SELECT IF X1.I=X2.I | X1.I<>X2.I | SELECT IF X1.I<>X2.I |
|------+------+-----------+---------------------+------------+----------------------|
|    1 |    1 | True      | Selected            | False      | Not                  |
|    1 |    2 | False     | Not                 | True       | Selected             |
|    1 | NULL | NULL      | Not                 | NULL       | Not                  |
|    2 |    1 | False     | Not                 | True       | Selected             |
|    2 |    2 | True      | Selected            | False      | Not                  |
|    2 | NULL | NULL      | Not                 | NULL       | Not                  |
| NULL |    1 | NULL      | Not                 | NULL       | Not                  |
| NULL |    2 | NULL      | Not                 | NULL       | Not                  |
| NULL | NULL | NULL      | Not                 | NULL       | Not                  |
+------+------+-----------+---------------------+------------+----------------------+
```

Illustrate all possible outcomes for EQUAL_NULL and NOT EQUAL_NULL:

```sqlexample
SELECT x1.i x1_i,
       x2.i x2_i,
       EQUAL_NULL(x1.i, x2.i),
       IFF(EQUAL_NULL(x1.i, x2.i), 'Selected', 'Not') "SELECT IF EQUAL_NULL(X1.I,X2.I)",
       NOT EQUAL_NULL(x1.i, x2.i),
       IFF(NOT EQUAL_NULL(x1.i, x2.i), 'Selected', 'Not') "SELECT IF NOT(EQUAL_NULL(X1.I,X2.I))"
  FROM x x1, x x2;
```

```output
+------+------+------------------------+---------------------------------+----------------------------+--------------------------------------+
| X1_I | X2_I | EQUAL_NULL(X1.I, X2.I) | SELECT IF EQUAL_NULL(X1.I,X2.I) | NOT EQUAL_NULL(X1.I, X2.I) | SELECT IF NOT(EQUAL_NULL(X1.I,X2.I)) |
|------+------+------------------------+---------------------------------+----------------------------+--------------------------------------|
|    1 |    1 | True                   | Selected                        | False                      | Not                                  |
|    2 |    1 | False                  | Not                             | True                       | Selected                             |
| NULL |    1 | False                  | Not                             | True                       | Selected                             |
|    1 |    2 | False                  | Not                             | True                       | Selected                             |
|    2 |    2 | True                   | Selected                        | False                      | Not                                  |
| NULL |    2 | False                  | Not                             | True                       | Selected                             |
|    1 | NULL | False                  | Not                             | True                       | Selected                             |
|    2 | NULL | False                  | Not                             | True                       | Selected                             |
| NULL | NULL | True                   | Selected                        | False                      | Not                                  |
+------+------+------------------------+---------------------------------+----------------------------+--------------------------------------+
```

---
title: [ NOT ] ILIKE
source: https://docs.snowflake.com/en/sql-reference/functions/ilike.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# [ NOT ] ILIKE

Performs a case-insensitive comparison to determine whether a string matches or does not match a specified pattern.
For case-sensitive matching, use LIKE instead.

LIKE, ILIKE, and RLIKE all perform similar operations. However, RLIKE uses POSIX ERE (Extended Regular Expression) syntax
instead of the SQL pattern syntax used by LIKE and ILIKE.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [[ NOT ] LIKE](like.md) , [[ NOT ] RLIKE](rlike.md)

## Syntax

```sqlsyntax
<subject> [ NOT ] ILIKE <pattern> [ ESCAPE <escape> ]

ILIKE( <subject> , <pattern> [ , <escape> ] )
```

## Arguments

**Required:**

`subject`
:   Subject to match.

`pattern`
:   Pattern to match.

**Optional:**

`escape`
:   Character(s) inserted in front of a wildcard character to indicate that the wildcard should
    be interpreted as a regular character and not as a wildcard.

## Returns

Returns a BOOLEAN or NULL.

* When ILIKE is specified, the value is TRUE if there is a match. Otherwise, returns FALSE.
* When NOT ILIKE is specified, the value is TRUE if there is no match. Otherwise, returns FALSE.
* When either ILIKE or NOT ILIKE is specified, returns NULL if any argument is NULL.

## Usage notes

* To include single quotes or other special characters in pattern matching, you can use a
  [backslash escape sequence](../data-types-text.md).
* NULL does not match NULL. In other words, if the subject is NULL and the pattern is NULL,
  that is not considered a match.
* SQL wildcards are supported in `pattern`:

  + An underscore (`_`) matches any single character.
  + A percent sign (`%`) matches any sequence of zero or more characters.
* Wildcards in `pattern` include newline characters (`n`) in `subject` as matches.
* Pattern matching covers the entire string. To match a sequence anywhere within a string, start and end the pattern with `%`.
* There is no default escape character.

* If you use the backslash as an escape character, then you must escape the backslash in both the
  expression and the ESCAPE clause. For example, the following command specifies that the escape character is
  the backslash, and then uses that escape character to search for `%` as a literal (without the escape character,
  the `%` would be treated as a wildcard):

  > ```sqlexample
  > 'SOMETHING%' ILIKE '%\\%%' ESCAPE '\\';
  > ```

  For examples of using escape characters, see the examples for ILIKE.
  For more examples of using escape characters, and in particular the backslash as an escape character,
  see [the examples for LIKE](like.md).

* If you require more complex pattern matching than this function supports, you can use a
  [regular expression function](../functions-regexp.md) instead.

## Collation details

Only the `upper`, `lower`, and `trim` collation specifications are supported. Combinations with `upper`,
`lower`, and `trim` are also supported (for example, `upper-trim` and `lower-trim`), except for locale
combinations (for example, `en-upper`).

## Examples

Create a table that contains some strings:

```sqlexample
CREATE OR REPLACE TABLE ilike_ex(name VARCHAR(20));
INSERT INTO ilike_ex VALUES
  ('John  Dddoe'),
  ('Joe   Doe'),
  ('John_down'),
  ('Joe down'),
  (null);
```

The following examples show the use of `ILIKE`, `NOT ILIKE`, and the wildcard
character `%`:

```sqlexample
SELECT *
  FROM ilike_ex
  WHERE name ILIKE '%j%h%do%'
  ORDER BY 1;
```

```output
+-------------+
| NAME        |
|-------------|
| John  Dddoe |
| John_down   |
+-------------+
```

```sqlexample
SELECT *
  FROM ilike_ex
  WHERE name NOT ILIKE '%j%h%do%'
  ORDER BY 1;
```

```output
+-----------+
| NAME      |
|-----------|
| Joe   Doe |
| Joe down  |
+-----------+
```

```sqlexample
SELECT *
  FROM ilike_ex
  WHERE name ILIKE '%j%h%^_do%' ESCAPE '^'
  ORDER BY 1;
```

```output
+-----------+
| NAME      |
|-----------|
| John_down |
+-----------+
```

---
title: [ NOT ] IN
source: https://docs.snowflake.com/en/sql-reference/functions/in.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# [ NOT ] IN

Tests whether its argument is or is not one of the members of an explicit list or the result of a subquery.

> **Note:**
>
> In subquery form, IN is equivalent to `= ANY` and NOT IN is equivalent to `<> ALL`.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

## Syntax

To compare individual values:

```sqlsyntax
<value> [ NOT ] IN ( <value_1> [ , <value_2> ...  ] )
```

To compare *row constructors* (parenthesized lists of values):

```sqlsyntax
( <value_A> [, <value_B> ... ] ) [ NOT ] IN (  ( <value_1> [ , <value_2> ... ] )  [ , ( <value_3> [ , <value_4> ... ] )  ...  ]  )
```

To compare a value to the values returned by a subquery:

```sqlsyntax
<value> [ NOT ] IN ( <subquery> )
```

## Parameters

`value`
:   The value for which to search.

`value_A`, `value_B`
:   The elements of a row constructor for which to search.

    Ensure that each value on the right of IN (for example, `(value3, value4)`) has the same number of elements as the value on the
    left of IN (for example, `(value_A, value_B)`).

`value_#`
:   A value to which `value` should be compared.

    If the values to compare to are row constructors, then each `value_#` is an individual element of a row constructor.

`subquery`
:   A subquery that returns a list of values to which `value` can be compared.

## Usage notes

* As in most contexts, NULL is not equal to NULL. If `value` is NULL, then the
  return value of the function is NULL, whether or not the list or subquery
  contains NULL. See Using NULL.
* Syntactically, IN is treated as an operator rather than a function. This example shows the difference between
  using IN as an operator and calling `f()` as a function:

  ```sqlexample
  SELECT
      f(a, b),
      x IN (y, z) ...
  ```

  You *can’t* use function syntax with IN. For example, you can’t rewrite the preceding example as:

  ```sqlexample
  SELECT
      f(a, b),
      IN(x, (y, z)) ...
  ```
* IN is also considered a [subquery operator](../operators-subquery.md).
* In a query that uses IN, you can expand an [array](../data-types-semistructured.md) into
  a list of individual values by using the spread operator (`**`). For more information and
  examples, see [Expansion operators](../operators-expansion.md).

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

The following examples use the IN function.

### Using IN with simple literals

The following examples show how to use IN and NOT IN with simple literals:

```sqlexample
SELECT 1 IN (1, 2, 3) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| True   |
+--------+
```

```sqlexample
SELECT 4 NOT IN (1, 2, 3) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| True   |
+--------+
```

### Using IN with a subquery

These example shows how to use IN in a subquery.

```sqlexample
SELECT 'a' IN (
  SELECT column1 FROM VALUES ('b'), ('c'), ('d')
  ) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| False  |
+--------+
```

### Using IN with a table

These examples show how to use IN with a table. The statement below creates the table used in the examples.

```sqlexample
CREATE OR REPLACE TABLE in_function_demo (
  col_1 INTEGER,
  col_2 INTEGER,
  col_3 INTEGER);

INSERT INTO in_function_demo (col_1, col_2, col_3) VALUES
  (1, 1, 1),
  (1, 2, 3),
  (4, 5, NULL);
```

This example shows how to use IN with a single column of a table:

```sqlexample
SELECT col_1, col_2, col_3
  FROM in_function_demo
  WHERE (col_1) IN (1, 10, 100, 1000)
  ORDER BY col_1, col_2, col_3;
```

```output
+-------+-------+-------+
| COL_1 | COL_2 | COL_3 |
|-------+-------+-------|
|     1 |     1 |     1 |
|     1 |     2 |     3 |
+-------+-------+-------+
```

This example shows how to use IN with multiple columns of a table:

```sqlexample
SELECT col_1, col_2, col_3
  FROM in_function_demo
  WHERE (col_1, col_2, col_3) IN (
    (1,2,3),
    (4,5,6));
```

```output
+-------+-------+-------+
| COL_1 | COL_2 | COL_3 |
|-------+-------+-------|
|     1 |     2 |     3 |
+-------+-------+-------+
```

This example shows how to use IN with a subquery that reads multiple columns of a table:

```sqlexample
SELECT (1, 2, 3) IN (
  SELECT col_1, col_2, col_3 FROM in_function_demo
  ) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| True   |
+--------+
```

### Using NULL

Remember that NULL != NULL. IN and NOT IN lists that contain comparisons with NULL (including equality conditions) might produce unexpected
results because NULL represents an unknown value. Comparisons with NULL do not return TRUE or FALSE; they return NULL. See also
[Ternary logic](../ternary-logic.md).

For example, the following query returns NULL, not TRUE, because SQL cannot determine whether NULL equals any value, including another NULL.

```sqlexample
SELECT NULL IN (1, 2, NULL) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| NULL   |
+--------+
```

Note that if you change the query to select `1`, not NULL, it returns TRUE:

```sqlexample
SELECT 1 IN (1, 2, NULL) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| True   |
+--------+
```

In this case, the result is TRUE because `1` does have a match in the IN list. The fact that NULL also exists
in the IN list doesn’t affect the result.

Similarly, NOT IN comparisons with NULL also return NULL if any value in the list is NULL.

```sqlexample
SELECT 1 NOT IN (1, 2, NULL) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| NULL  |
+--------+
```

The same behavior is true for the following query, where the set of values `4, 5, NULL` *does not match* either `4, 5, NULL` or `7, 8, 9`:

```sqlexample
SELECT (4, 5, NULL) IN ( (4, 5, NULL), (7, 8, 9) ) AS RESULT;
```

The following example shows the same behavior with NULL comparisions but uses a subquery to define the IN list values that are compared:

```sqlexample
CREATE OR REPLACE TABLE in_list_table (
  val1 INTEGER,
  val2 INTEGER,
  val3 INTEGER
);

INSERT INTO in_list_table VALUES (1, 10, NULL), (2, 20, NULL), (NULL, NULL, NULL);

SELECT 1 IN (SELECT val1 FROM in_list_table) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| True   |
+--------+
```

```sqlexample
SELECT NULL IN (SELECT val1 FROM in_list_table) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| NULL   |
+--------+
```

```sqlexample
SELECT 3 IN (SELECT val1 FROM in_list_table) AS RESULT;
```

```output
+--------+
| RESULT |
|--------|
| NULL   |
+--------+
```

---
title: [ NOT ] LIKE
source: https://docs.snowflake.com/en/sql-reference/functions/like.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# [ NOT ] LIKE

Performs a case-sensitive comparison to determine whether a string matches or does not match a specified pattern.
For case-insensitive matching, use ILIKE instead.

LIKE, ILIKE, and RLIKE all perform similar operations. However, RLIKE uses POSIX ERE (Extended Regular Expression) syntax
instead of the SQL pattern syntax used by LIKE and ILIKE.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [[ NOT ] ILIKE](ilike.md) , [[ NOT ] RLIKE](rlike.md) , [LIKE ALL](like_all.md), [LIKE ANY](like_any.md)

## Syntax

```sqlsyntax
<subject> [ NOT ] LIKE <pattern> [ ESCAPE <escape> ]

LIKE( <subject> , <pattern> [ , <escape> ] )
```

## Arguments

**Required:**

`subject`
:   Subject to match. This is typically a VARCHAR, although some other data
    types can be used.

`pattern`
:   Pattern to match. This is typically a VARCHAR, although some other data
    types can be used.

**Optional:**

`escape`
:   Character(s) inserted in front of a wildcard character to indicate that the wildcard should
    be interpreted as a regular character and not as a wildcard.

## Returns

Returns a BOOLEAN or NULL.

* When LIKE is specified, the value is TRUE if there is a match. Otherwise, returns FALSE.
* When NOT LIKE is specified, the value is TRUE if there is no match. Otherwise, returns FALSE.
* When either LIKE or NOT LIKE is specified, returns NULL if any argument is NULL.

## Usage notes

* To include single quotes or other special characters in pattern matching, you can use a
  [backslash escape sequence](../data-types-text.md).
* NULL does not match NULL. In other words, if the subject is NULL and the pattern is NULL,
  that is not considered a match.
* SQL wildcards are supported in `pattern`:

  + An underscore (`_`) matches any single character.
  + A percent sign (`%`) matches any sequence of zero or more characters.
* Wildcards in `pattern` include newline characters (`n`) in `subject` as matches.
* Pattern matching covers the entire string. To match a sequence anywhere within a string, start and end the pattern with `%`.
* There is no default escape character.

* If you use the backslash as an escape character, then you must escape the backslash in both the
  expression and the ESCAPE clause. For example, the following command specifies that the escape character is
  the backslash, and then uses that escape character to search for `%` as a literal (without the escape character,
  the `%` would be treated as a wildcard):

  > ```sqlexample
  > 'SOMETHING%' LIKE '%\\%%' ESCAPE '\\';
  > ```

  For examples of using escape characters, and in particular the backslash as an escape character, see
  Examples.

* If you require more complex pattern matching than this function supports, you can use a
  [regular expression function](../functions-regexp.md) instead.

## Collation details

Only the `upper`, `lower`, and `trim` collation specifications are supported. Combinations with `upper`,
`lower`, and `trim` are also supported (for example, `upper-trim` and `lower-trim`), except for locale
combinations (for example, `en-upper`).

## Examples

Create a table that contains some strings:

```sqlexample
CREATE OR REPLACE TABLE like_ex(name VARCHAR(20));
INSERT INTO like_ex VALUES
  ('John  Dddoe'),
  ('John \'alias\' Doe'),
  ('Joe   Doe'),
  ('John_down'),
  ('Joe down'),
  ('Elaine'),
  (''),    -- empty string
  (null);
```

The following examples show the use of `LIKE`, `NOT LIKE`, and the wildcard
character `%`:

```sqlexample
SELECT name
  FROM like_ex
  WHERE name LIKE '%Jo%oe%'
  ORDER BY name;
```

```output
+------------------+
| NAME             |
|------------------|
| Joe   Doe        |
| John  Dddoe      |
| John 'alias' Doe |
+------------------+
```

```sqlexample
SELECT name
  FROM like_ex
  WHERE name NOT LIKE '%Jo%oe%'
  ORDER BY name;
```

```output
+-----------+
| NAME      |
|-----------|
|           |
| Elaine    |
| Joe down  |
| John_down |
+-----------+
```

```sqlexample
SELECT name
  FROM like_ex
  WHERE name NOT LIKE 'John%'
  ORDER BY name;
```

```output
+-----------+
| NAME      |
|-----------|
|           |
| Elaine    |
| Joe   Doe |
| Joe down  |
+-----------+
```

```sqlexample
SELECT name
  FROM like_ex
  WHERE name NOT LIKE ''
  ORDER BY name;
```

```output
+------------------+
| NAME             |
|------------------|
| Elaine           |
| Joe   Doe        |
| Joe down         |
| John  Dddoe      |
| John 'alias' Doe |
| John_down        |
+------------------+
```

The following example uses a backslash to escape a single quote so that it can be found in pattern matching:

```sqlexample
SELECT name
  FROM like_ex
  WHERE name LIKE '%\'%'
  ORDER BY name;
```

```output
+------------------+
| NAME             |
|------------------|
| John 'alias' Doe |
+------------------+
```

The following examples use an ESCAPE clause:

```sqlexample
SELECT name
  FROM like_ex
  WHERE name LIKE '%J%h%^_do%' ESCAPE '^'
  ORDER BY name;
```

```output
+-----------+
| NAME      |
|-----------|
| John_down |
+-----------+
```

Insert more rows into the `like_ex` table:

```sqlexample
INSERT INTO like_ex (name) VALUES
  ('100 times'),
  ('1000 times'),
  ('100%');
```

Without the escape character, the percent sign (`%`) is treated as a wildcard:

```sqlexample
SELECT * FROM like_ex WHERE name LIKE '100%'
  ORDER BY 1;
```

```output
+------------+
| NAME       |
|------------|
| 100 times  |
| 100%       |
| 1000 times |
+------------+
```

With the escape character, the percent sign (`%`) is treated as a literal:

```sqlexample
SELECT * FROM like_ex WHERE name LIKE '100^%' ESCAPE '^'
  ORDER BY 1;
```

```output
+------+
| NAME |
|------|
| 100% |
+------+
```

The following example uses an ESCAPE clause in which the backslash is the escape character. Note that the backslash
itself must be escaped in both the ESCAPE clause and in the expression:

```sqlexample
SELECT * FROM like_ex WHERE name LIKE '100\\%' ESCAPE '\\'
  ORDER BY 1;
```

```output
+------+
| NAME |
|------|
| 100% |
+------+
```

---
title: [ NOT ] REGEXP
source: https://docs.snowflake.com/en/sql-reference/functions/regexp.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# [ NOT ] REGEXP

Performs a comparison to determine whether a string matches or does not match a specified pattern. Both inputs
must be text expressions.

REGEXP is similar to the [[ NOT ] LIKE](like.md) function, but with POSIX extended regular expressions instead of SQL LIKE pattern syntax.
It supports more complex matching conditions than LIKE.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

Aliases:
:   [[ NOT ] RLIKE](rlike.md) (2nd syntax)

See also: [String functions (regular expressions)](../functions-regexp.md)

> [REGEXP_COUNT](regexp_count.md) , [REGEXP_INSTR](regexp_instr.md) , [REGEXP_REPLACE](regexp_replace.md) , [REGEXP_SUBSTR](regexp_substr.md)
>
> [[ NOT ] ILIKE](ilike.md) , [[ NOT ] LIKE](like.md)

## Syntax

```sqlsyntax
<subject> [ NOT ] REGEXP <pattern>
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

## Returns

Returns a BOOLEAN or NULL.

* When REGEXP is specified, the value is TRUE if there is a match. Otherwise, returns FALSE.
* When NOT REGEXP is specified, the value is TRUE if there is no match. Otherwise, returns FALSE.
* When either REGEXP or NOT REGEXP is specified, returns NULL if any argument is NULL.

## Usage Notes

* The function implicitly anchors a pattern at both ends (for example, `''` automatically becomes `'^$'`, and `'ABC'`
  automatically becomes `'^ABC$'`). For example, to match any string starting with `ABC`, the pattern is `'ABC.*'`.
* The backslash character (`\`) is the escape character. For more information, see [Specifying regular expressions in single-quoted string constants](../functions-regexp.md).
* For more usage notes, see the [General usage notes](../functions-regexp.md) for regular expression functions.

## Collation Details

Arguments with collation specifications currently aren’t supported.

## Examples

The example below shows how to use REGEXP with a simple wildcard expression:

Create a table and load data:

```sqlexample
CREATE OR REPLACE TABLE strings (v VARCHAR(50));
INSERT INTO strings (v) VALUES
  ('San Francisco'),
  ('San Jose'),
  ('Santa Clara'),
  ('Sacramento');
```

Use wildcards to search for a pattern:

```sqlexample
SELECT v
  FROM strings
  WHERE v REGEXP 'San* [fF].*'
  ORDER BY v;
```

```output
+---------------+
| V             |
|---------------|
| San Francisco |
+---------------+
```

The backslash character `\` is the escape character in regular expressions, and specifies special
characters or groups of characters. For example, `\s` is the regular expression for whitespace.

The Snowflake string parser, which parses literal strings, also treats backslash as an escape character. For
example, a backslash is used as part of the sequence of characters that specifies a tab character. Thus to create a
string that contains a single backslash, you must specify two backslashes. For example, compare the string in
the input statement below with the corresponding string in the output:

```sqlexample
INSERT INTO strings (v) VALUES
  ('Contains embedded single \\backslash');
```

```sqlexample
SELECT *
  FROM strings
  ORDER BY v;
```

```output
+-------------------------------------+
| V                                   |
|-------------------------------------|
| Contains embedded single \backslash |
| Sacramento                          |
| San Francisco                       |
| San Jose                            |
| Santa Clara                         |
+-------------------------------------+
```

This example shows how to search for strings that start with `San`, where `San` is a complete word (for example, not
part of `Santa`). `\b` is the escape sequence for a word boundary.

```sqlexample
SELECT v, v REGEXP 'San\\b.*' AS matches
  FROM strings
  ORDER BY v;
```

```output
+-------------------------------------+---------+
| V                                   | MATCHES |
|-------------------------------------+---------|
| Contains embedded single \backslash | False   |
| Sacramento                          | False   |
| San Francisco                       | True    |
| San Jose                            | True    |
| Santa Clara                         | False   |
+-------------------------------------+---------+
```

This example shows how to search for a blank followed by a backslash. Note that the single backslash to search for
is represented by four backslashes below; for REGEXP to look for a literal backslash, that backslash must be
escaped, so you need two backslashes. The string parser requires that each of those backslashes be escaped, so the
expression contains four backslashes to represent the one backslash that the expression is searching for:

```sqlexample
SELECT v, v REGEXP '.*\\s\\\\.*' AS matches
  FROM strings
  ORDER BY v;
```

```output
+-------------------------------------+---------+
| V                                   | MATCHES |
|-------------------------------------+---------|
| Contains embedded single \backslash | True    |
| Sacramento                          | False   |
| San Francisco                       | False   |
| San Jose                            | False   |
| Santa Clara                         | False   |
+-------------------------------------+---------+
```

The following example is the same as the preceding example, except that it uses `$$` as a string delimiter to tell the
string parser that the string is a literal and that backslashes should not be interpreted as escape sequences. (The
backslashes are still interpreted as escape sequences by REGEXP.)

```sqlexample
SELECT v, v REGEXP $$.*\s\\.*$$ AS MATCHES
  FROM strings
  ORDER BY v;
```

```output
+-------------------------------------+---------+
| V                                   | MATCHES |
|-------------------------------------+---------|
| Contains embedded single \backslash | True    |
| Sacramento                          | False   |
| San Francisco                       | False   |
| San Jose                            | False   |
| Santa Clara                         | False   |
+-------------------------------------+---------+
```

---
title: [ NOT ] RLIKE
source: https://docs.snowflake.com/en/sql-reference/functions/rlike.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# [ NOT ] RLIKE

Performs a comparison to determine whether a string matches or does not match a specified pattern. Both inputs must be text expressions.

RLIKE is similar to the [[ NOT ] LIKE](like.md) function, but with POSIX extended regular expressions instead of SQL LIKE pattern syntax.
It supports more complex matching conditions than LIKE.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

Aliases:
:   [[ NOT ] REGEXP](regexp.md) (2nd syntax) , [REGEXP_LIKE](regexp_like.md) (1st syntax)

See also: [String functions (regular expressions)](../functions-regexp.md)

> [REGEXP_COUNT](regexp_count.md) , [REGEXP_INSTR](regexp_instr.md) , [REGEXP_REPLACE](regexp_replace.md) , [REGEXP_SUBSTR](regexp_substr.md) , [REGEXP_SUBSTR_ALL](regexp_substr_all.md)
>
> [[ NOT ] ILIKE](ilike.md) , [[ NOT ] LIKE](like.md)

## Syntax

```sqlsyntax
-- 1st syntax
RLIKE( <subject> , <pattern> [ , <parameters> ] )

-- 2nd syntax
<subject> [ NOT ] RLIKE <pattern>
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

## Returns

Returns a BOOLEAN or NULL.

* When RLIKE is specified, the value is TRUE if there is a match. Otherwise, returns FALSE.
* When NOT RLIKE is specified, the value is TRUE if there is no match. Otherwise, returns FALSE.
* When either RLIKE or NOT RLIKE is specified, returns NULL if any argument is NULL.

## Usage Notes

* The function implicitly anchors a pattern at both ends (for example, `''` automatically becomes `'^$'`, and `'ABC'`
  automatically becomes `'^ABC$'`). For example, to match any string starting with `ABC`, the pattern is `'ABC.*'`.
* The backslash character (`\`) is the escape character. For more information, see [Specifying regular expressions in single-quoted string constants](../functions-regexp.md).
* For more usage notes, see the [General usage notes](../functions-regexp.md) for regular expression functions.

## Collation Details

Arguments with collation specifications currently aren’t supported.

## Examples

Run the following commands to set up the data for the examples in this topic:

```sqlexample
CREATE OR REPLACE TABLE rlike_ex(city VARCHAR(20));
INSERT INTO rlike_ex VALUES ('Sacramento'), ('San Francisco'), ('San Jose'), (null);
```

### Examples that use the first syntax

The following examples perform case-insensitive pattern matching with wildcards:

```sqlexample
SELECT * FROM rlike_ex WHERE RLIKE(city, 'san.*', 'i');
```

```output
+---------------+
| CITY          |
|---------------|
| San Francisco |
| San Jose      |
+---------------+
```

```sqlexample
SELECT * FROM rlike_ex WHERE NOT RLIKE(city, 'san.*', 'i');
```

```output
+------------+
| CITY       |
|------------|
| Sacramento |
+------------+
```

The following examples determine if a string matches the format of a phone number and an email address.
In these examples, the regular expressions are specified in [dollar-quoted strings](../data-types-text.md)
to avoid escaping the backslashes in the regular expression.

```sqlexample
SELECT RLIKE('800-456-7891',
             $$[2-9]\d{2}-\d{3}-\d{4}$$) AS matches_phone_number;
```

```output
+----------------------+
| MATCHES_PHONE_NUMBER |
|----------------------|
| True                 |
+----------------------+
```

```sqlexample
SELECT RLIKE('jsmith@email.com',
             $$\w+@[a-zA-Z_]+\.[a-zA-Z]{2,3}$$) AS matches_email_address;
```

```output
+-----------------------+
| MATCHES_EMAIL_ADDRESS |
|-----------------------|
| True                  |
+-----------------------+
```

The following examples perform the same matches but use
[single-quoted string constants](../data-types-text.md) to specify the regular expressions.

Because the example uses single-quoted string constants,
[each backslash must be escaped with another backslash](../functions-regexp.md).

```sqlexample
SELECT RLIKE('800-456-7891',
             '[2-9]\\d{2}-\\d{3}-\\d{4}') AS matches_phone_number;
```

```output
+----------------------+
| MATCHES_PHONE_NUMBER |
|----------------------|
| True                 |
+----------------------+
```

```sqlexample
SELECT RLIKE('jsmith@email.com',
             '\\w+@[a-zA-Z_]+\\.[a-zA-Z]{2,3}') AS matches_email_address;
```

```output
+-----------------------+
| MATCHES_EMAIL_ADDRESS |
|-----------------------|
| True                  |
+-----------------------+
```

Alternatively, rewrite the statements and avoid sequences that rely on the backslash character.

```sqlexample
SELECT RLIKE('800-456-7891',
             '[2-9][0-9]{2}-[0-9]{3}-[0-9]{4}') AS matches_phone_number;
```

```output
+----------------------+
| MATCHES_PHONE_NUMBER |
|----------------------|
| True                 |
+----------------------+
```

```sqlexample
SELECT RLIKE('jsmith@email.com',
             '[a-zA-Z_]+@[a-zA-Z_]+\\.[a-zA-Z]{2,3}') AS matches_email_address;
```

```output
+-----------------------+
| MATCHES_EMAIL_ADDRESS |
|-----------------------|
| True                  |
+-----------------------+
```

### Examples that use the second syntax

The following example performs case-insensitive pattern matching with wildcards:

```sqlexample
SELECT * FROM rlike_ex WHERE city RLIKE 'San.* [fF].*';
```

```output
+---------------+
| CITY          |
|---------------|
| San Francisco |
+---------------+
```

### Additional examples

For additional examples of regular expressions, see [[ NOT ] REGEXP](regexp.md).

---
title: ABS
source: https://docs.snowflake.com/en/sql-reference/functions/abs.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# ABS

Returns the absolute value of a numeric expression.

## Syntax

```sqlsyntax
ABS( <num_expr> )
```

## Examples

```sqlexample
SELECT column1, abs(column1)
    FROM (values (0), (1), (-2), (3.5), (-4.5), (null));
+---------+--------------+
| COLUMN1 | ABS(COLUMN1) |
|---------+--------------|
|     0.0 |          0.0 |
|     1.0 |          1.0 |
|    -2.0 |          2.0 |
|     3.5 |          3.5 |
|    -4.5 |          4.5 |
|    NULL |         NULL |
+---------+--------------+
```

---
title: ACCEPTED_VALUES (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_accepted_values.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# ACCEPTED_VALUES (system data metric function)

Returns the number of records where the value of a column does *not* match a Boolean expression.

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.ACCEPTED_VALUES ON ( <column>, <lambda-expression> )
```

## Arguments

`column`
:   Specifies the column that contains values that are compared to the Boolean expression in `lambda-expression`.

`lambda-expression`
:   Specifies a lambda expression consisting of the following syntax: `column -> expression`.

    The function returns the number of records where the value of `column` doesn’t match the Boolean expression. This expression can
    use the following operations and functions:

    * [Comparison operators](../operators-comparison.md)
    * [Logical operators](../operators-logical.md)
    * [[ NOT ] LIKE](like.md)
    * [[ NOT ] IN](in.md)
    * [IS [ NOT ] NULL](is-null.md)

    The `column` in the lambda expression always matches the `column` argument.

## Allowed data types

The column specified in the `column` and `lambda-expression` arguments can contain any of the following data types:

* DATE
* FLOAT
* NUMBER
* TIMESTAMP_LTZ
* TIMESTAMP_NTZ
* TIMESTAMP_TZ
* VARCHAR

## Returns

The function returns a NUMBER value.

## Usage notes

* You can’t call this function directly. To learn how to associate the function with a table or view so it
  runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

  You can use the [SYSTEM$DATA_METRIC_SCAN](system_data_metric_scan.md) function to run the ACCEPTED_VALUES function against a table without
  associating it.
* You cannot associate this function with the same column more than once.
* Renaming a column that is specified in the ACCEPTED_VALUES function breaks the association between the function and the column’s table or
  view. If you rename the column, you must re-associate the function with the table or view.

## Examples

Associate the function with table `t1` so it returns the number of records where the value of the column `age` is *not* equal to five.

```sqlexample
ALTER TABLE t1
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.ACCEPTED_VALUES ON (age, age -> age = 5);
```

Associate the function with view `order_details` so it returns the number of records where the value of column `order_status` is *not*
in the list of strings `Pending`, `Dispatched`, and `Delivered`.

```sqlexample
ALTER VIEW order_details
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.ACCEPTED_VALUES ON (
    order_status,
    order_status -> order_status IN ('Pending', 'Dispatched', 'Delivered'));
```

---
title: ACOS
source: https://docs.snowflake.com/en/sql-reference/functions/acos.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ACOS

Computes the inverse cosine (arc cosine) of its input; the result is a number in the interval `[0, pi]`.

## Syntax

```sqlsyntax
ACOS( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. Must be greater than or equal to -1.0 and
    less than or equal to +1.0. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

Returns the arc cosine in radians (not degrees) as a value in the range `[0, pi]`.

## Examples

```sqlexample
SELECT ACOS(0), ACOS(0.5), ACOS(1);
```

```output
+-------------+-------------+---------+
|     ACOS(0) |   ACOS(0.5) | ACOS(1) |
|-------------+-------------+---------|
| 1.570796327 | 1.047197551 |       0 |
+-------------+-------------+---------+
```

---
title: ACOSH
source: https://docs.snowflake.com/en/sql-reference/functions/acosh.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ACOSH

Computes the inverse (arc) hyperbolic cosine of its input.

## Syntax

```sqlsyntax
ACOSH( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. Must be greater than or equal to 1.0.
    The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT ACOSH(2.352409615);
```

```output
+--------------------+
| ACOSH(2.352409615) |
|--------------------|
|                1.5 |
+--------------------+
```

---
title: ADD_MONTHS
source: https://docs.snowflake.com/en/sql-reference/functions/add_months.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# ADD_MONTHS

Adds or subtracts a specified number of months to a date or timestamp, preserving the end-of-month information.

## Syntax

```sqlsyntax
ADD_MONTHS( <date_or_timestamp_expr> , <num_months_expr> )
```

## Arguments

**Required:**

`date_or_timestamp_expr`
:   This is the date or timestamp expression to which you want to add
    a specified number of months.

`num_months_expr`
:   This is the number of months you want to add. This should be an
    integer. It may be positive or negative. If the value is a
    non-integer numeric value (for example, FLOAT) the value will be
    rounded to the nearest integer.

## Returns

The data type of the returned value is the same as the data type of the
first parameter. For example, if the input is a `DATE`, then the
output is a `DATE`. If the input is a `TIMESTAMP_NTZ`, then the
output is a `TIMESTAMP_NTZ`.

## Usage notes

* ADD_MONTHS returns slightly different results than [DATEADD](dateadd.md) used with a `MONTH` component:

  + For both ADD_MONTHS and [DATEADD](dateadd.md), if the result month has fewer days than the original day, the result day of the month is the last day of the result month.
  + For ADD_MONTHS only, if the original day is the last day of the month, the result day of month will be the last day of the result month.
* `num_months_expr` can be a positive or negative integer to either add or subtract months, respectively.

## Examples

Add 2 months to a date and cast the date to a timestamp with no time zone:

> ```sqlexample
> SELECT ADD_MONTHS('2016-05-15'::timestamp_ntz, 2) AS RESULT;
> +-------------------------+
> | RESULT                  |
> |-------------------------|
> | 2016-07-15 00:00:00.000 |
> +-------------------------+
> ```

Demonstrate preservation of end-of-month information:

> * Add one month to the last day of February 2016 (a leap year).
> * Subtract one month from the last day of May 2016.
>
>   > ```sqlexample
>   > SELECT ADD_MONTHS('2016-02-29'::date, 1) AS RESULT;
>   > +------------+
>   > | RESULT     |
>   > |------------|
>   > | 2016-03-31 |
>   > +------------+
>   > ```
>   >
>   > ```sqlexample
>   > SELECT ADD_MONTHS('2016-05-31'::date, -1) AS RESULT;
>   > +------------+
>   > | RESULT     |
>   > |------------|
>   > | 2016-04-30 |
>   > +------------+
>   > ```

---
title: AGENT_RUN (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/agent_run-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AGENT_RUN (SNOWFLAKE.CORTEX)

Runs a [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md) without an agent object and returns the response as JSON.

You can use this function to interact with Cortex Agents directly without first creating an agent object. You provide the configuration, including the orchestration model and tools, in the request body.

> **Note:**
>
> `SNOWFLAKE.CORTEX.AGENT_RUN` is a utility wrapper around the [Cortex Agents Run REST API](../../user-guide/snowflake-cortex/cortex-agents-run.md).
> For most application integrations, Snowflake recommends calling the **streaming REST API** directly.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.AGENT_RUN( <request_body> )
```

## Arguments

`request_body`
:   JSON request body to send to the agent. This value must be a string (for example, a `$$...$$` literal).

    The following fields are supported in the request body:

    | Field | Type | Description |
    | --- | --- | --- |
    | `thread_id` | integer | The thread ID for the conversation. If thread_id is used, then parent_message_id must be passed as well. |
    | `parent_message_id` | integer | The ID of the parent message in the thread. If this is the first message, parent_message_id should be 0. |
    | `messages` | array of [Message](../../user-guide/snowflake-cortex/cortex-agents-run.md) | If thread_id and parent_message_id are passed in the request, messages includes the current user message in the conversation. Else, messages includes the conversation history and the current message. Messages contains both user queries and assistant responses in chronological order. |
    | `stream` | boolean | Whether to return a streaming response (`text/event-stream`) or a non-streaming JSON response (`application/json`). If true, the response will be streamed as Server-Sent Events. If false, the response will be returned as JSON. |
    | `tool_choice` | [ToolChoice](../../user-guide/snowflake-cortex/cortex-agents-run.md) | Configures how the agent should select and use tools during the interaction. Controls whether tool use is automatic, required, or whether specific tools should be used. |
    | `models` | [ModelConfig](../../user-guide/snowflake-cortex/cortex-agents-run.md) | Model configuration for the agent. Includes the orchestration model (e.g., claude-4-sonnet). If not provided, a model is automatically selected. Currently only available for the `orchestration` step. |
    | `instructions` | [AgentInstructions](../../user-guide/snowflake-cortex/cortex-agents-run.md) | Instructions for the agent’s behavior, including response, orchestration, system, and sample questions. |
    | `orchestration` | [OrchestrationConfig](../../user-guide/snowflake-cortex/cortex-agents-run.md) | Orchestration configuration, including budget constraints (e.g., seconds, tokens). |
    | `tools` | array of [Tool](../../user-guide/snowflake-cortex/cortex-agents-run.md) | List of tools available for the agent to use. Each tool includes a tool_spec with type, name, description, and input schema. Tools may have a corresponding configuration in tool_resources. |
    | `tool_resources` | map of [ToolResource](../../user-guide/snowflake-cortex/cortex-agents-run.md) | Configuration for each tool referenced in the tools array. Keys must match the name of the respective tool. |

    **Example**

    ```json
    {
      "thread_id": 0,
      "parent_message_id": 0,
      "messages": [
        {
          "role": "user",
          "content": [
            {
              "type": "text",
              "text": "What is the total revenue for 2023?"
            }
          ]
        }
      ],
      "stream": false,
      "tool_choice": {
        "type": "auto",
        "name": [
          "analyst_tool",
          "search_tool"
        ]
      },
      "models": {
        "orchestration": "claude-4-sonnet"
      },
      "instructions": {
        "response": "You will respond in a friendly but concise manner",
        "orchestration": "For any query related to revenue we should use Analyst; For all policy questions we should use Search",
        "system": "You are a friendly agent ..."
      },
      "orchestration": {
        "budget": {
          "seconds": 30,
          "tokens": 16000
        }
      },
      "tools": [
        {
          "tool_spec": {
            "type": "generic",
            "name": "get_revenue",
            "description": "Fetch the delivery revenue for a location.",
            "input_schema": {
              "type": "object",
              "properties": {
                "location": {
                  "type": "string",
                  "description": "The city and state, e.g. San Francisco, CA"
                }
              }
            },
            "required": [
              "location"
            ]
          }
        }
      ],
      "tool_resources": {
        "get_revenue": {
          "type": "function",
          "execution_environment": {
            "type": "warehouse",
            "warehouse": "MY_WH"
          },
          "identifier": "DB.SCHEMA.UDF"
        }
      }
    }
    ```

> **Important:**
>
> The `stream` field is ignored. A non-streaming response is always returned.

## Returns

Returns a JSON string containing the agent’s response.

## Access control requirements

To run an agent, you must use a role that can access Cortex Agents.
For details, see [Access control requirements](../../user-guide/snowflake-cortex/cortex-agents.md).

## Usage notes

* The function returns a JSON string. Pass this string to [TRY_PARSE_JSON](try_parse_json.md) to convert the response to a VARIANT value.
* Unlike [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](data_agent_run-snowflake-cortex.md), this function does not require you to create an agent object first. Instead, you provide the configuration directly in the request body.

## Examples

Run an agent and parse the response JSON:

```sqlexample
SELECT
  TRY_PARSE_JSON(
    SNOWFLAKE.CORTEX.AGENT_RUN(
      $${
        "messages": [
          {
            "role": "user",
            "content": [
              {
                "type": "text",
                "text": "What is the total revenue for 2025?"
              }
            ]
          }
        ],
        "models": {
          "orchestration": "claude-4-sonnet"
        }
      }$$
    )
  ) AS resp;
```

Sample return value:

```json
{
  "content": [
    {
      "text": "The total revenue for 2025 was $100,000.",
      "type": "text"
    }
  ],
  "metadata": {
    "usage": {
      "tokens_consumed": [
        {
          "context_window": 200000,
          "input_tokens": {
            "cache_read": 0,
            "cache_write": 0,
            "total": 67,
            "uncached": 67
          },
          "model_name": "claude-4-sonnet",
          "output_tokens": {
            "total": 38
          }
        }
      ]
    }
  },
  "role": "assistant"
}
```

---
title: AGG
source: https://docs.snowflake.com/en/sql-reference/functions/agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Semantic Views)

# AGG

Evaluates and returns the value of a metric in a [semantic view](../../user-guide/views-semantic/overview.md) when you
[run a query](../../user-guide/views-semantic/querying.md).

## Syntax

```sqlsyntax
AGG( <metric_in_semantic_view> )
```

## Arguments

`metric_in_semantic_view`
:   Metric that you want to return in a query of a semantic view.

## Returns

Returns the value of the specified metric.

---
title: AI_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/ai_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)
    [String & binary functions](../functions-string.md) (AI Functions)

# AI_AGG

Reduces a column of text data using a natural language instruction.

For example, `AI_AGG(reviews, 'Describe the most common complaints mentioned in the book reviews')` will return a summary of user feedback.

Unlike [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md) and [SUMMARIZE (SNOWFLAKE.CORTEX)](summarize-snowflake-cortex.md), this function supports datasets larger than the maximum language model context window.

See also:
:   [AI_SUMMARIZE_AGG](ai_summarize_agg.md)

## Syntax

```sqlsyntax
AI_AGG( <expr>, <instruction> )
```

## Arguments

**Required:**

`expr`
:   This is an expression that contains text on which an aggregation operation is to be performed, such as restaurant reviews or phone transcripts.

`instruction`
:   A string containing a natural language specification of the aggregation to perform, for example “Summarize the reviews” or “Identify all people mentioned and write a short biography for each of them”.

## Returns

Returns a string containing the result of the aggregation.

The function may indicate that the data you’ve provided doesn’t contain the answer if:

* You don’t provide a clear instruction specifying how to aggregate the data
* The data doesn’t have the information necessary to complete your instruction

## Usage notes

For optimal performance, follow these guidelines:

* Use plain English text for the instruction.
* Provide a declarative instruction instead of asking a question. For example, instead of a question like “Can you summarize this?”, use “Summarize the phone call transcripts”.
* Describe the text provided in the instruction. For example, instead of an instruction like “summarize”, use “Summarize the phone call transcripts”.
* Describe the intended use case. For example, instead of “find the best review”, use “Find the most positive and well-written restaurant review to highlight on the restaurant website”.
* Multiple columns can be used in the string expression using `CONCAT` or the `||` operator. See the example below.
* Consider breaking the instruction into multiple steps. For example, instead of “Summarize the new articles”, use “You will be provided with news articles from various publishers presenting events from different points of view. Please create a concise and elaborative summary of source texts without missing any crucial information.”.

## Examples

AI_AGG can be used as a simple scalar function on string constants. In the following example, AI_AGG is used to
summarize product ratings, which are provided as a single string.

```sqlexample
SELECT AI_AGG('[Excellent, Excellent, Great, Mediocre]',
              'Summarize the product ratings for a blog post targeting consumers');
```

```output
Overall, the product has received overwhelmingly positive reviews, with the majority of users rating it as 'Excellent' or 'Great'. Only a small percentage of users had a mediocre experience with the product. This suggests that the product is well-liked by most consumers and is a great option for those looking for a reliable choice.
```

AI_AGG can also be used on a column of data. In the following example, the product ratings from the above example are provided as a column in a table using a [Common Table Expression](../../user-guide/queries-cte.md).

```sqlexample
WITH reviews AS (
            SELECT 'The restaurant was excellent.' AS review
  UNION ALL SELECT 'Excellent! I loved the pizza!'
  UNION ALL SELECT 'It was great, but the service was meh.'
  UNION ALL SELECT 'Mediocre food and mediocre service'
)
SELECT AI_AGG(review, 'Summarize the restaurant reviews for potential consumers')
  FROM reviews;
```

```output
Reviews for this restaurant are mixed. Some customers had a very positive experience, describing the restaurant as "excellent" and loving the pizza. However, others had a more neutral or negative experience, citing mediocre food and service.
```

AI_AGG can be used on multiple columns of data using `CONCAT` or the `||` operator.

```sqlexample
WITH reviews AS (
            SELECT 'The restaurant was excellent.' AS review, 'Pizza' AS menu_item
  UNION ALL SELECT 'Excellent! I loved the pizza!', 'Pizza'
  UNION ALL SELECT 'It was great, but the service was meh.', 'Burger'
  UNION ALL SELECT 'Mediocre food and mediocre service', 'Pancakes'
)
SELECT AI_AGG('Menu Item: ' || menu_item || '\nReview: ' || review,
              'Summarize the restaurant reviews for potential consumers')
  FROM reviews;
```

```output
Based on the reviews, the restaurant seems to receive high praise for their pizza, with two reviews using the word "excellent" to describe their experience. However, the reviews for other menu items, such as burgers and pancakes, are more mixed, with some customers expressing disappointment with the service or finding the food to be just mediocre. Overall, potential consumers may want to consider ordering pizza if they decide to dine at this restaurant.
```

AI_AGG can also be used in combination with GROUP BY. The following example summarizes product ratings for two products (identified by the column `product_id`) in a table of reviews.

```sqlexample
WITH reviews AS (
            SELECT 1 AS restaurant_id, 'The restaurant was excellent.' AS review
  UNION ALL SELECT 1, 'Excellent! I loved the pizza!'
  UNION ALL SELECT 1, 'It was great, but the service was meh.'
  UNION ALL SELECT 1, 'Mediocre food and mediocre service'
  UNION ALL SELECT 2, 'Terrible quality ingredients, I should have eaten at home.'
  UNION ALL SELECT 2, 'Bad restaurant, I would avoid this place.'
)
SELECT restaurant_id,
       AI_AGG(review, 'Summarize the restaurant reviews for potential consumers')
  FROM reviews
 GROUP BY 1;
```

```output
+---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| RESTAURANT_ID | SUMMARIZED_REVIEW                                                                                                                                                                                                                                 |
|---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| 1             | Reviews for this restaurant are mixed. Some customers had a very positive experience, describing the restaurant as "excellent" and loving the pizza. However, others had a more neutral or negative experience, citing mediocre food and service. |
+---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| 2             | Two reviewers had extremely negative experiences at this restaurant, citing poor quality ingredients and advising others to avoid it.                                                                                                             |
+---------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

The instruction can be used for various aggregation tasks and to configure the style and tone of the response. The following example uses an instruction to find the most positive rating for each product and provide
French and Polish translations of the rating.

```sqlexample
WITH reviews AS (
            SELECT 1 AS product_id, 'Excellent' AS review
  UNION ALL SELECT 1, 'Excellent'
  UNION ALL SELECT 1, 'Great'
  UNION ALL SELECT 1, 'Mediocre'
  UNION ALL SELECT 2, 'Terrible'
  UNION ALL SELECT 2, 'Bad'
  UNION ALL SELECT 2, 'Average'
)
SELECT product_id,
       AI_AGG(review, 'Identify the most positive rating and translate it into French and Polish, one word only') AS summarized_review
  FROM reviews
 GROUP BY 1;
```

```output
+------------+--------------------+
| PRODUCT_ID | SUMMARIZED_REVIEW  |
|------------+--------------------+
| 1          | French: Excellent  |
|            | Polish: Doskonały  |
+------------+--------------------+
| 2          | French: Moyen      |
|            | Polish: Przeciętny |
+------------+--------------------+
```

See also [AI_SUMMARIZE_AGG](ai_summarize_agg.md).

---
title: AI_CLASSIFY
source: https://docs.snowflake.com/en/sql-reference/functions/ai_classify.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_CLASSIFY

> **Note:**
>
> AI_CLASSIFY is the updated version of [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](classify_text-snowflake-cortex.md).
> For the latest functionality, use AI_CLASSIFY.

Classifies text or images into categories that you specify.

## Region availability

The following table shows the regions where you can use the AI_CLASSIFY function for both text and images:

| Data type | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) | Azure East US 2  (Virginia) | Azure West Europe  (Netherlands) | AWS  (Cross-Region) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| TEXT | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| IMAGE | ✔ | ✔ | ✔ |  |  |  |  |  | ✔ |

## Syntax

```sqlsyntax
AI_CLASSIFY( <input> , <list_of_categories> [, <config_object> ] [, <return_error_details> ] )
```

## Arguments

**Required:**

`input`
:   The string, image, or [prompt](prompt.md) object that you’re classifying.

    For text classification, the input string is case sensitive. Results may vary based on capitalization.

`list_of_categories`
:   An array of categories with at least two unique values. The number of categories is restricted only by the
    token window, but in practice, exceeding twenty categories might reduce classification accuracy.
    Categories are case sensitive.

    Categories can be simple strings or SQL objects of the same type.
    If you’re using objects, you can provide a description for one or more categories to improve classification accuracy.

    For each category, specify the following:

    * `label` (Required): The name of the category.
    * `description` (Optional): Describes the category in no more than 25 words.

    > **Note:**
    >
    > Descriptions count as input tokens, which affects the cost of the classification operation.
    > For more information, see [Cost considerations](../../user-guide/snowflake-cortex/aisql.md).

**Optional:**

`config_object`
:   Configuration settings specified as key/value pairs. Supported keys:

    * `task_description`: A explanation of the classification task that is 50 words or fewer. This can help the model understand the context of the classification task and improve accuracy.
    * `output_mode`: Set to `'multi'` for multi-label classification. Defaults to `'single'` for single-label classification.
    * `examples`: A list of example objects for few-shot learning. Each example must include:

      + `input`: Example text to classify.
      + `labels`: List of correct categories for the input.
      + `explanation`: Explanation of why the input maps to those categories.

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed. See Error behavior for details.

## Returns

A serialized object. The object’s `labels` field is an array that specifies the list of categories to which the input belongs.

For single label classification, the `labels` array has exactly one element. For multi-label classification, the `labels` field can have multiple elements.

## Error behavior

By default, if AI_CLASSIFY can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: An OBJECT containing the classification result, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Access control requirements

Users must use a role that has the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
For more information about this privilege, see [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md).

## Usage notes

For best results, follow these guidelines:

* Use plain text in English for the `input` and `list_of_categories`.
* Avoid trying to classify non-prose such as code snippets, logs, or non-English text.
* Avoid using code or formatting that is not open source (such as proprietary languages or formats) in the text. The
  underlying language model is not trained on proprietary formats.
* Don’t use abbreviations, special characters, or jargon in the category labels.
* Use descriptive categories. Avoid using category names such as “Xa4s3” or “category 1”.
* Use mutually exclusive categories.
* Providing a clear task description can improve accuracy when the relationship between the input and categories is unclear or complex.
* Adding label descriptions can improve accuracy, especially when labels are ambiguous or require specific selection
  criteria. Write descriptions that clearly highlight what distinguishes each label from the others.
* Each label, description, and example increases the number of input tokens for every AI_CLASSIFY call, which affects cost.
* Examples can help to improve accuracy.

> **Note:**
>
> AI_CLASSIFY adds a prompt to your input to generate its response. This increases the token count beyond the text that you’ve provided.

## Examples

The following examples use the AI_CLASSIFY function with only the required arguments.

### AI_CLASSIFY: Text

The following example classifies the prompt into one of two categories, travel or cooking:

```sqlexample
SELECT AI_CLASSIFY('One day I will see the world', ['travel', 'cooking']);
```

The following is the output of the preceding command.

```output
'{
  "labels": ["travel"]
 }';
```

The following example uses multi-label classification:

```sqlexample
SELECT AI_CLASSIFY(
  'One day I will see the world and learn to cook my favorite dishes',
  ['travel', 'cooking', 'reading', 'driving'],
  {'output_mode': 'multi'}
);
```

The following is the output of the preceding command.

```output
'{
  "labels": ["travel", "cooking"]
 }';
```

The following example passes in a task description, label descriptions, and few-shot examples:

```sqlexample
SELECT AI_CLASSIFY(
  'One day I will see the world and learn to cook my favorite dishes',
  [
    {'label': 'travel', 'description': 'content related to traveling'},
    {'label': 'cooking'},
    {'label': 'reading'},
    {'label': 'driving'}
  ],
  {
    'task_description': 'Determine topics related to the given text',
    'output_mode': 'multi',
    'examples': [
      {
        'input': 'i love traveling with a good book',
        'labels': ['travel', 'reading'],
        'explanation': 'the text mentions traveling and a good book which relates to reading'
      }
    ]
  });
```

The following is the output of the preceding command.

```output
'{
  "labels": ["travel", "cooking"]
}';
```

The following example creates a `text_classification_table` that contains a column for text and a column for possible
categories for that text. The AI_CLASSIFY function is called on each row of the table to classify the string in the text
column.

```sqlexample
CREATE OR REPLACE TEMPORARY TABLE text_classification_table AS
SELECT 'France' AS input, ['North America', 'Europe', 'Asia'] AS classes
UNION ALL
SELECT 'Singapore', ['North America', 'Europe', 'Asia']
UNION ALL
SELECT 'one day I will see the world', ['travel', 'cooking', 'dancing']
UNION ALL
SELECT 'my lobster bisque is second to none', ['travel', 'cooking', 'dancing'];

SELECT input,
    classes,
    AI_CLASSIFY(input, classes):labels AS classification
FROM text_classification_table;
```

### AI_CLASSIFY: Images

Using single file input:

```sqlexample
WITH food_pictures AS (
  SELECT
      TO_FILE(file_url) AS img
  FROM DIRECTORY(@file_stage)
)
SELECT
*,
AI_CLASSIFY(img, ['dessert', 'drink', 'main dish', 'side dish']):labels AS classification
FROM food_pictures;
```

Using a prompt object constructed by PROMPT():

```sqlexample
  WITH food_pictures AS (
  SELECT
      TO_FILE(file_url) AS img
  FROM DIRECTORY(@file_stage)
)
SELECT
*,
AI_CLASSIFY(PROMPT('Please help me classify the food within this image {0}', img),
  ['dessert', 'drink', 'main dish', 'side dish']):labels AS classification
FROM food_pictures;
```

## Limitations

* Snowflake AI functions don’t work on FILE objects created from files in the following kinds of stages:

  + Internal stages with encryption mode `TYPE = 'SNOWFLAKE_FULL'`
  + External stages with any customer-side encrypted mode:

    - `TYPE = 'AWS_CSE'`
    - `TYPE = 'AZURE_CSE'`
  + User stage
  + Table stage
  + Stage with double-quoted names

---
title: AI_COMPLETE
source: https://docs.snowflake.com/en/sql-reference/functions/ai_complete.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions) ,
    [File functions](../functions-file.md) (AI Functions)

# AI_COMPLETE

> **Note:**
>
> AI_COMPLETE is the updated version of [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md).
> For the latest functionality, use AI_COMPLETE.

Generates a response (completion) from text or an image using a supported language model. You can provide:

* A text prompt, to generate a response from the model. For more information, see [AI_COMPLETE (Single string)](ai_complete-single-string.md).
* A single image and a text prompt, to generate a response based on the image and prompt. For more information, see [AI_COMPLETE (Single image)](ai_complete-single-file.md).
* A prompt object that can support multiple images and text. For more information, see [AI_COMPLETE (Prompt object)](ai_complete-prompt-object.md).

## Syntax

The syntax for the function depends on the type of input that you provide. For information about the syntax, see the following sections:

* [Single string arguments](ai_complete-single-string.md)
* [Single image arguments](ai_complete-single-file.md)
* [Prompt object arguments](ai_complete-prompt-object.md)

All syntax variations accept an optional `return_error_details` BOOLEAN argument as the final parameter.
When set to TRUE, the function returns an OBJECT that contains the value and the error message, one of which
is NULL depending on whether the function succeeded or failed. See Error behavior for details.

## Error behavior

By default, if AI_COMPLETE can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: The completion response (same type as the normal return value), or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md) for legal notices.

---
title: AI_COMPLETE (Prompt object)
source: https://docs.snowflake.com/en/sql-reference/functions/ai_complete-prompt-object.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_COMPLETE (Prompt object)

> **Note:**
>
> AI_COMPLETE is the updated version of [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md).
> For the latest functionality, use AI_COMPLETE.

Generates a response (completion) for a prompt object. The prompt can contain [FILE objects](to_file.md), which may contain images or documents.

[Preview Feature](../../release-notes/preview-features.md) — Open

The document processing capability of AI_COMPLETE is currently in preview. All other capabilities are generally available.

## Syntax

The function can be used with either positional or named argument syntax.

```sqlsyntax
AI_COMPLETE(
    <model>, <prompt> [ , <model_parameters> ] )
```

## Arguments

`model`
:   A string specifying the model to be used. For text only inputs, you can use one of the following models:

    * `claude-4-opus`
    * `claude-4-sonnet`
    * `claude-3-7-sonnet`
    * `claude-3-5-sonnet`
    * `deepseek-r1`
    * `llama3-8b`
    * `llama3-70b`
    * `llama3.1-8b`
    * `llama3.1-70b`
    * `llama3.1-405b`
    * `llama3.3-70b`
    * `llama4-maverick`
    * `llama4-scout`
    * `mistral-large`
    * `mistral-large2`
    * `mistral-7b`
    * `mixtral-8x7b`
    * `openai-gpt-4.1`
    * `openai-gpt-5`
    * `openai-gpt-5-chat`
    * `openai-gpt-5-mini`
    * `openai-gpt-5-nano`
    * `openai-gpt-5.1`
    * `openai-o4-mini`
    * `snowflake-arctic`
    * `snowflake-llama-3.1-405b`
    * `snowflake-llama-3.3-70b`

    For image inputs, you can use one of the following models:

    * `claude-4-opus`
    * `claude-4-sonnet`
    * `claude-haiku-4-5`
    * `claude-sonnet-4-5`
    * `claude-opus-4-5`
    * `claude-sonnet-4-6`
    * `claude-opus-4-6`
    * `llama4-maverick`
    * `llama4-scout`
    * `pixtral-large`
    * `openai-gpt-5`
    * `openai-gpt-5-chat`
    * `openai-gpt-5-mini`
    * `openai-gpt-5-nano`
    * `openai-gpt-5.1`
    * `openai-gpt-5.2`
    * `gemini-2.5-flash`
    * `gemini-2.5-flash-lite`
    * `gemini-3.1-pro`

    For document inputs, you can use one of the following models:

    * `gemini-3.1-pro`
    * `claude-4-opus`
    * `claude-4-sonnet`
    * `claude-haiku-4-5`
    * `claude-sonnet-4-5`
    * `claude-opus-4-5`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`prompt`
:   A [prompt](prompt.md) object containing text and, optionally, images or documents.

`model_parameters`
An [object](../data-types-semistructured.md) containing zero or more of the following options that affect the model’s
hyperparameters. See [LLM Settings](https://www.promptingguide.ai/introduction/settings).

> * `temperature`: A value from 0 to 1 (inclusive) that controls the randomness of the output of the language model. A
>   higher temperature (for example, 0.7) results in more diverse and random output, while a lower temperature (such as
>   0.2) makes the output more deterministic and focused.
>
>   Default: 0
> * `top_p`: A value from 0 to 1 (inclusive) that controls the randomness and diversity of the language model,
>   generally used as an alternative to `temperature`. The difference is that `top_p` restricts the set of possible tokens
>   that the model outputs, while `temperature` influences which tokens are chosen at each step.
>
>   Default: 0
> * `max_tokens`: Sets the maximum number of output tokens in the response. Small values can result in truncated responses.
>
>   Default: 4096
>   Maximum allowed value: 8192
> * `guardrails`: Filters potentially unsafe and harmful responses from a language model using [Cortex Guard](../../user-guide/snowflake-cortex/aisql.md).
>   Either `TRUE` or `FALSE`. The default value is `FALSE`.

> **Important:**
>
> If you’re using AI_COMPLETE with a prompt object, you can’t provide a JSON schema to get a structured output as a response.
>
> To get a structured output as the response, use the `response_format` parameter with [AI_COMPLETE (Single string)](ai_complete-single-string.md). For more information using structured outputs, see [AI_COMPLETE structured outputs](../../user-guide/snowflake-cortex/complete-structured-outputs.md).

## Example

### Passing multiple images as the input

The following example compares two images by passing both as input to the AI_COMPLETE function and asking whether both are pictures of cats:

```sqlexample
SELECT AI_COMPLETE('claude-sonnet-4-6',
  PROMPT('Are both image {0} and image {1} pictures of cats?',
    TO_FILE('@myimages', 'sleepingcat.png'), TO_FILE('@myimages', 'jumpingcat.png'))) AS image_classification;
```

### Batch processing images from a directory or table

For batch processing of multiple images, performing the same operation on each, store the image files in the same stage.
Apply the AI_COMPLETE function to each row of the table.

> **Note:**
>
> The stage must have a [directory table](../../user-guide/data-load-dirtables.md) to retrieve the paths to its files.

First, create the table by retrieving the image locations from the directory, convert these to FILE objects, and
storing the resulting FILE objects in a column in a table. Use SQL like the following:

```sqlexample
CREATE TABLE image_table AS
    (SELECT TO_FILE('@myimages', RELATIVE_PATH) AS img FROM DIRECTORY(@myimages));
```

Then, apply the AI_COMPLETE function to the column containing the FILE objects. The following example classifies each image in the table:

```sqlexample
SELECT AI_COMPLETE('claude-sonnet-4-6',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON', img_file)) AS image_classification
FROM image_table;
```

Response:

```output
{ "classification": "Inflation Rates" }
{ "classification": "beverage refrigerator" }
{ "classification": "Space Needle" }
{ "classification": "Modern Kitchen" }
{ "classification": "Pie Chart" }
{ "classification": "Economic Graph" }
{ "classification": "Persian Cat" }
{ "classification": "Labrador Retriever" }
{ "classification": "Jedi Cat" }
{ "classification": "Sleeping cat" }
{ "classification": "Persian Cat" }
{ "classification": "Garden Costume" }
{ "classification": "Floral Fashion" }
```

If you already have a table with paths to the images, you can use the [TO_FILE function](to_file.md) to construct the FILE
objects within the query:

```sqlexample
SELECT AI_COMPLETE('claude-sonnet-4-6',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON',
        TO_FILE('@myimages', img_path)) AS image_classification
FROM image_table;
```

You can also retrieve the images to be processed directly from a stage’s directory, as shown here:

```sqlexample
SELECT AI_COMPLETE('claude-sonnet-4-6',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON',
        TO_FILE('@myimages', RELATIVE_PATH))) as image_classification
FROM DIRECTORY(@myimages);
```

### Providing images and prompts in a table

To perform a different operation on each image in a table, provide the images and their corresponding prompts in a
table. In the following example, the table contains the stage path of each image in the `img_path` column and the
prompt in the `prompt` column.

```sqlexample
AI_COMPLETE('claude-sonnet-4-6',
    PROMPT('Given the input image {0}, {1}. Respond in JSON',
        TO_FILE('@myimages', img_path), prompt) as image_result)
FROM image_table;
```

## Usage notes for processing images

* To process multiple images, specify a prompt object in the function call that defines a prompt template and the associated image files. You can use the [PROMPT](prompt.md) function to create this object. The prompt template can contain numbered placeholders (`{0}`, `{1}`, etc.) that correspond to the images in the prompt object. Use the [TO_FILE](to_file.md) function to specify the document files in the prompt object.
* Only text and images are supported. Video and audio files are not supported.
* Supported image formats:

  + `.jpg`
  + `.jpeg`
  + `.png`
  + `.gif`
  + `.webp`

  The `pixtral` and `llama4` models also support `.bmp`.
* The maximum image size is 10 MB for most models, and 3.75 MB for `claude` models. `claude` models do not support images with resolutions above 8000x8000.
* The stage containing the images must have server-side encryption enabled. Client-side encrypted stages are not supported.
* The function does not support custom network policies.
* Stage names are case-insensitive; paths are case-sensitive.

## Usage notes for processing documents

* To process multiple documents, specify a prompt object in the function call that defines a prompt template and the associated document files. You can use the [PROMPT](prompt.md) function to create this object. The prompt template can contain numbered placeholders (`{0}`, `{1}`, etc.) that correspond to the documents in the prompt object. Use the [TO_FILE](to_file.md) function to specify the document files in the prompt object.
* Only text and documents are supported. Video and audio files are not supported.
* All models support these formats: `.txt`, `.md`, and `.pdf`. Claude models also support `.txt`, `.md`, `.pdf`, `.doc`, `.docx`, `.xls`, `.xlsx`, `.csv`, and `.xhtml`.
* Claude models have a maximum document size of 4.5 MB. Gemini 3.1 Pro has a maximum document size of 10 MB.
* The function does not support custom network policies.
* Stage names are case-insensitive; paths are case-sensitive.

---
title: AI_COMPLETE (Single image)
source: https://docs.snowflake.com/en/sql-reference/functions/ai_complete-single-file.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_COMPLETE (Single image)

> **Note:**
>
> AI_COMPLETE is the updated version of [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md).
> For the latest functionality, use AI_COMPLETE.

Generates a response (completion) for a text prompt using a supported language model. This variant of the function enhances AI_COMPLETE with document understanding capabilities. The prompt can reference information or images found in a file containing a document. The function supports a single document input.

## Syntax

The function has two required arguments and four optional arguments.
The function can be used with either positional or named argument syntax.

Using AI_COMPLETE with a single image input:

```sqlsyntax
AI_COMPLETE(
    <model>, <predicate>, <file> [, <model_parameters> ] )
```

## Arguments

`model`
:   A string specifying the model to be used. Specify one of the following models:

    > * `claude-4-opus`
    > * `claude-4-sonnet`
    > * `claude-3-7-sonnet`
    > * `claude-3-5-sonnet`
    > * `llama4-maverick`
    > * `llama4-scout`
    > * `openai-o4-mini`
    > * `openai-gpt-4.1`
    > * `pixtral-large`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`predicate`
:   A string prompt.

`file`
:   A FILE type object representing an image.

`model_parameters`
:   An [object](../data-types-semistructured.md) containing zero or more of the following options that affect the model’s
    hyperparameters. See [LLM Settings](https://www.promptingguide.ai/introduction/settings).

    * `temperature`: A value from 0 to 1 (inclusive) that controls the randomness of the output of the language model. A
      higher temperature (for example, 0.7) results in more diverse and random output, while a lower temperature (such as
      0.2) makes the output more deterministic and focused.

      Default: 0
    * `top_p`: A value from 0 to 1 (inclusive) that controls the randomness and diversity of the language model,
      generally used as an alternative to `temperature`. The difference is that `top_p` restricts the set of possible tokens
      that the model outputs, while `temperature` influences which tokens are chosen at each step.

      Default: 0
    * `max_tokens`: Sets the maximum number of output tokens in the response. Small values can result in truncated responses.

      Default: 4096
      Maximum allowed value: 8192
    * `guardrails`: Filters potentially unsafe and harmful responses from a language model using [Cortex Guard](../../user-guide/snowflake-cortex/aisql.md).
      Either `TRUE` or `FALSE`. The default value is `FALSE`.

## Returns

Returns the string response from the language model.

## Examples

The following examples demonstrate the basic capabilities of the COMPLETE function with images.

### Visual question answering

A chart of inflation rates is used to answer a question about the data.

```sqlexample
SELECT AI_COMPLETE('claude-3-5-sonnet',
    'Which country will observe the largest inflation change in 2024 compared to 2023?',
    TO_FILE('@myimages', 'highest-inflation.png'));
```

Response:

```output
Looking at the data, Venezuela will experience the largest change in inflation rates between 2023 and 2024.
The inflation rate in Venezuela is projected to decrease significantly from 337.46% in 2023 to 99.98% in 2024,
representing a reduction of approximately 237.48 percentage points. This is the most dramatic change among
all countries shown in the chart, even though Zimbabwe has higher absolute inflation rates.
```

### Entity extraction from an image

This example extracts the entities (objects) from an image and returns the results in JSON format.

```sqlexample
SELECT AI_COMPLETE('claude-3-5-sonnet',
    'Extract the kitchen appliances identified in this image. Respond in JSON only with the identified appliances.',
    TO_FILE('@myimages', 'kitchen.png'));
```

Response:

```output
{
    "appliances": [ "microwave","electric stove","oven","refrigerator" ]
}
```

## Usage notes for processing images

* Only text and images are supported. Video and audio files are not supported.
* Supported image formats:

  + `.jpg`
  + `.jpeg`
  + `.png`
  + `.gif`
  + `.webp`
  + `pixtral` and `llama4` models also support `.bmp`.
* The maximum image size is 10 MB for most models, and 3.75 MB for `claude` models. `claude` models do not support images with resolutions above 8000x8000.
* The stage containing the images must have server-side encryption enabled. Client-side encrypted stages are not supported.
* The function does not support custom network policies.
* Stage names are case-insensitive; paths are case-sensitive.

---
title: AI_COMPLETE (Single string)
source: https://docs.snowflake.com/en/sql-reference/functions/ai_complete-single-string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_COMPLETE (Single string)

> **Note:**
>
> AI_COMPLETE is the updated version of [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md).
> For the latest functionality, use AI_COMPLETE.

Generates a response (completion) for a text prompt using a supported language model.

## Syntax

The function contains two required arguments and four optional arguments.
The function can be used with either positional or named argument syntax.

Using AI_COMPLETE with a single string input

```sqlsyntax
AI_COMPLETE(
    <model>, <prompt> [ , <model_parameters>, <response_format>, <show_details> ] )
```

## Arguments

`model`
:   A string specifying the [model](../../user-guide/snowflake-cortex/aisql.md) to be used.

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`prompt`
:   A string prompt

`model_parameters`
:   An [object](../data-types-semistructured.md) containing zero or more of the following options that affect the model’s
    hyperparameters. See [LLM Settings](https://www.promptingguide.ai/introduction/settings).

    * `temperature`: A value from 0 to 1 (inclusive) that controls the randomness of the output of the language model. A
      higher temperature (for example, 0.7) results in more diverse and random output, while a lower temperature (such as
      0.2) makes the output more deterministic and focused.

      Default: 0
    * `top_p`: A value from 0 to 1 (inclusive) that controls the randomness and diversity of the language model,
      generally used as an alternative to `temperature`. The difference is that `top_p` restricts the set of possible tokens
      that the model outputs, while `temperature` influences which tokens are chosen at each step.

      Default: 0
    * `max_tokens`: Sets the maximum number of output tokens in the response. Small values can result in truncated responses.

      Default: 4096
      Maximum allowed value: 8192
    * `guardrails`: Filters potentially unsafe and harmful responses from a language model using [Cortex Guard](../../user-guide/snowflake-cortex/aisql.md).
      Either TRUE or FALSE.

      Default: FALSE

`response_format`
:   The format that the response should follow. You can specify the response format as:

    * A [JSON schema](https://json-schema.org/) that the response should follow. This is a SQL sub-object, not a string.
    * A SQL type literal beginning with the TYPE keyword. The defined type must use an OBJECT as its top-level container, and fields
      of this OBJECT are mapped to corresponding JSON fields and values.

    If `response_format` is not specified, the response is a string containing either the response or a serialized JSON object containing the response and information about it.

    For more information, see [AI_COMPLETE structured outputs](../../user-guide/snowflake-cortex/complete-structured-outputs.md).

`show_details`
:   A boolean flag that indicates whether to return a serialized JSON object containing the response and information about it.

## Returns

When the `show_details` argument is not specified or set to FALSE and the `response_format` is not specified or set to NULL, returns a string containing the response.

When the `show_details` argument is not specified or set to FALSE and the `response_format` is specified, returns an object following the provided response format.

When the `show_details` argument is set to TRUE and the `response_format` is not specified, returns a
a JSON object containing the following keys.

* `"choices"`: An array of the model’s responses. (Currently, only one response is provided.) Each response is
  an object containing a `"messages"` key whose value is the model’s response to the latest prompt.
* `"created"`: UNIX timestamp (seconds since midnight, January 1, 1970) when the response was generated.
* `"model"`: The name of the model that created the response.
* `"usage"`: An object recording the number of tokens consumed and generated by this completion. Includes
  the following sub-keys:

  + `"completion_tokens"`: The number of tokens in the generated response.
  + `"prompt_tokens"`: The number of tokens in the prompt.
  + `"total_tokens"`: The total number of tokens consumed, which is the sum of the other two values.

When the `show_details` argument is set to TRUE and the `response_format` is specified, returns a
a JSON object containing the following keys

* `"structured_output"`: A json object following the specified response format.
* `"created"`: UNIX timestamp (seconds since midnight, January 1, 1970) when the response was generated.
* `"model"`: The name of the model that created the response.
* `"usage"`: An object recording the number of tokens consumed and generated by this completion. Includes
  the following sub-keys:

  + `"completion_tokens"`: The number of tokens in the generated response.
  + `"prompt_tokens"`: The number of tokens in the prompt.
  + `"total_tokens"`: The total number of tokens consumed, which is the sum of the other two values.

## Examples

### Single response

To generate a single response:

```sqlexample
SELECT AI_COMPLETE('snowflake-arctic', 'What are large language models?');
```

### Responses from table column

The following example generates a response for each row in the `reviews` table, using the `content` column as input. Each query result contains a critique of the corresponding review.

```sqlexample
SELECT AI_COMPLETE(
    'mistral-large',
        CONCAT('Critique this review in bullet points: <review>', content, '</review>')
) FROM reviews LIMIT 10;
```

> **Tip:**
>
> As shown in this example, you can use tagging in the prompt to control the kind of response generated. See
> [A guide to prompting LLaMA 2](https://replicate.com/blog/how-to-prompt-llama) for tips.

### Controlling model parameters

The following example specifies the `model_parameters` used to provide a response.

```sqlexample
SELECT AI_COMPLETE(
    model => 'deepseek-r1',
    prompt => 'how does a snowflake get its unique pattern?',
    model_parameters => {
        'temperature': 0.7,
        'max_tokens': 10
    }
);
```

The response is a string containing the message from the language model and other information. Note that the response
is truncated as instructed in the `model_parameters` argument.

```json
"The unique pattern on a snowflake is"
```

### Detailed output

The following example shows how you can use the `show_details` argument to return additional inference details.

```sqlexample
SELECT AI_COMPLETE(
    model => 'deepseek-r1',
    prompt => 'how does a snowflake get its unique pattern?',
    model_parameters => {
        'temperature': 0.7,
        'max_tokens': 10
    },
    show_details => true
);
```

The response is a JSON object with the model’s message and related details. The `options` argument was used to truncate the output.

```json
{
    "choices": [
        {
            "messages": " The unique pattern on a snowflake is"
        }
    ],
    "created": 1708536426,
    "model": "deepseek-r1",
    "usage": {
        "completion_tokens": 10,
        "prompt_tokens": 22,
        "guardrail_tokens": 0,
        "total_tokens": 32
    }
}
```

### Specifying a JSON response format

This example illustrates the use of the function’s `response_format` argument to return a structured response by providing a type literal.

```sqlexample
SELECT AI_COMPLETE(
    model => 'deepseek-r1',
    prompt => 'Extract structured data from this customer interaction note: Customer Sarah Jones complained about the mobile app crashing during checkout. She tried to purchase 3 items: a red XL jacket ($89.99), blue running shoes ($129.50), and a fitness tracker ($199.00). The app crashed after she entered her shipping address at 123 Main St, Portland OR, 97201. She has been a premium member since January 2024.',
    model_parameters => {
        'temperature': 0,
        'max_tokens': 4096
    },
    response_format => TYPE OBJECT(note OBJECT(items_count NUMBER, price ARRAY(STRING), address STRING, member_date STRING))
);
```

The response is a JSON object following the structured response format.

Response:

```output
{
    "note": {
        "address": "123 Main St, Portland OR, 97201",
        "items_count": 3,
        "member_date": "January 2024",
        "price": [
        "$89.99",
        "$129.50",
        "$199.00"
        ]
    }
}
```

### Specifying a JSON response format with details, using a type literal

This example illustrates the use of `response_format` argument to return a structured response combined with `show_details` to get additional inference information, using a type literal.

```sqlexample
SELECT AI_COMPLETE(
    model => 'llama3.3-70b',
    prompt => 'Extract structured data from this customer interaction note: Customer Sarah Jones complained about the mobile app crashing during checkout. She tried to purchase 3 items: a red XL jacket ($89.99), blue running shoes ($129.50), and a fitness tracker ($199.00). The app crashed after she entered her shipping address at 123 Main St, Portland OR, 97201. She has been a premium member since January 2024.',
    response_format => TYPE OBJECT(note OBJECT(items_count NUMBER, price ARRAY(STRING), address STRING, member_date STRING)),
    show_details => TRUE
);
```

The response is a JSON object containing structured response with additional inference metadata.

```json
{
  "created": 1758755328,
  "model": "llama3.3-70b",
  "structured_output": [
    {
      "raw_message": {
        "note": {
          "items_count": 3,
          "price": [
            "$89.99",
            "$129.50",
            "$199.00"
          ]
        }
      },
      "type": "json"
    }
  ],
  "usage": {
    "completion_tokens": 49,
    "prompt_tokens": 100,
    "total_tokens": 149
  }
}
```

### Specifying a JSON response format with details, using a JSON schema

This example illustrates the use of the function’s `response_format` argument to return a structured response combined with `show_details` to get additional inference information, using a JSON schema.

```sqlexample
SELECT AI_COMPLETE(
    model => 'deepseek-r1',
    prompt => 'Extract structured data from this customer interaction note: Customer Sarah Jones complained about the mobile app crashing during checkout. She tried to purchase 3 items: a red XL jacket ($89.99), blue running shoes ($129.50), and a fitness tracker ($199.00). The app crashed after she entered her shipping address at 123 Main St, Portland OR, 97201. She has been a premium member since January 2024.',
    model_parameters => {
        'temperature': 0,
        'max_tokens': 4096
    },
    response_format => {
            'type':'json',
            'schema':{'type' : 'object','properties' : {'note':{'type':'object','properties':
            {'items_count' : {'type' : 'number'},'price': {'type':'array','items':{'type':'string'}}, 'address': {'type':'string'}, 'member_date': {'type':'string'}},'required':['items_count','price' ,'address', 'member_date']}}}
    },
    show_details => true
);
```

The response is a json object containing structured response with additional inference metadata.

```output
{
    "created": 1758057115,
    "model": "mistral-large2",
    "structured_output": [
        {
        "raw_message": {
            "note": {
            "address": "123 Main St, Portland OR, 97201",
            "items_count": 3,
            "member_date": "January 2024",
            "price": [
                "$89.99",
                "$129.50",
                "$199.00"
            ]
            }
        },
        "type": "json"
        }
    ],
    "usage": {
        "completion_tokens": 76,
        "prompt_tokens": 100,
        "total_tokens": 176
    }
}
```

---
title: AI_COUNT_TOKENS
source: https://docs.snowflake.com/en/sql-reference/functions/ai_count_tokens.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_COUNT_TOKENS

> **Note:**
>
> AI_COUNT_TOKENS is the updated version of [COUNT_TOKENS (SNOWFLAKE.CORTEX)](count_tokens-snowflake-cortex.md).
> For the latest functionality, use AI_COUNT_TOKENS.

Returns an estimate of the number of tokens in a prompt for the specified large language model or task-specific
function. For functions that can take additional inputs that affect token count, such as model name or
categories/labels, those inputs can also be specified.

## Syntax

The syntax can vary based on the function used. In general, you pass the function name, model name if applicable,
input text, and any additional options that affect token count.

```sqlsyntax
AI_COUNT_TOKENS( <function_name>, <input_text> [, <return_error_details> ] )
AI_COUNT_TOKENS( <function_name>, <model_name>, <input_text> [, <return_error_details> ] )
AI_COUNT_TOKENS( <function_name>, <input_text>, <options> [, <return_error_details> ] )
AI_COUNT_TOKENS( <function_name>, <model_name>, <input_text>, <options> [, <return_error_details> ] )
```

AI_COUNT_TOKENS uses specific syntax variations for some functions. For example:

```sqlsyntax
AI_COUNT_TOKENS( 'ai_similarity', <input_text_1>, <input_text_2>, <options> [, <return_error_details> ] )
AI_COUNT_TOKENS( 'ai_classify', <input_text>, <categories> [, <return_error_details> ] )
AI_COUNT_TOKENS( 'ai_translate', <input_text>, <source_language>, <target_language> [, <return_error_details> ] )
```

See Examples for function specific usage patterns.

## Arguments

**Required:**

`function_name`
:   String containing the name of the function you want to base the token count on, such as `'ai_complete'` or `'ai_sentiment'`.
    The function’s name must begin with “ai_” and use only lowercase letters.

    A complete list of supported functions is available in the [Regional availability](../../user-guide/snowflake-cortex/aisql.md) table.

`input_text` or `input_text_1`, `input_text_2`
:   Input text to count the tokens in.

**Optional:**

`model_name`
:   String containing the name of the model you want to base the token content on. Required if the function specified by
    `function_name` requires you to choose the model to use, such as AI_COMPLETE or AI_EMBED.

    A list of available LLM models is available in the [Regional availability](../../user-guide/snowflake-cortex/aisql.md) table. However, not all models are
    currently supported. Snowflake intends to add support for additional models over time.

    For AI_COMPLETE, the following models are not supported:

    * claude-4-opus
    * claude-4-sonnet
    * claude-3-7-sonnet
    * claude-3-5-sonnet
    * openai-gpt-4.1
    * openai-o4-mini

`categories`
:   An array of VARIANT values that specify one or more categories or labels to use, for functions that require this data. Categories are included in the input token count.

`options`
:   A VARIANT that specifies additional options that affect how the function processes the input. For functions that take
    two text inputs, such as AI_SIMILARITY, options are used to specify the model.

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed. See Error behavior for details.

## Returns

An [INTEGER](../data-types-numeric.md) value that is the number of tokens of input text calculated using the given parameter values.

## Error behavior

By default, if AI_COUNT_TOKENS can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: An INTEGER value that is the token count, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Usage notes

* Although function names are usually written in all uppercase, use only lowercase letters in function and model
  names.
* COUNT_TOKENS does not work with LLM functions in the SNOWFLAKE.CORTEX namespace or with fine-tuned models.
  You must specify a function name that begins with “ai_”.
* COUNT_TOKENS accepts only text, not image, audio, or video inputs.
* COUNT_TOKENS only incurs compute costs and does not bill based on token count.
* COUNT_TOKENS is available in all regions, even for models not available in a given region.

## Examples

### AI_COMPLETE example

The following SQL statement counts the number of tokens in a prompt for AI_COMPLETE and the `llama3.3-70b` model:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_complete', 'llama3.3-70b', 'Summarize the insights from this
call transcript in 20 words: "I finally splurged on these after months of hesitation about
the price, and I\'m mostly impressed. The Nulu fabric really is as buttery-soft as everyone says,
and they\'re incredibly comfortable for yoga and lounging. The high-rise waistband stays put
and doesn\'t dig in, which is rare for me. However, I\'m already seeing some pilling after
just a few wears, and they definitely require gentle care. They\'re also quite delicate -
I snagged them slightly on my gym bag zipper. Great for low-impact activities, but I wouldn\'t
recommend for high-intensity workouts. Worth it for the comfort factor"');
```

Response:

```output
158
```

### AI_EMBED example

The following SQL statement counts the number of tokens in text being embedded using the AI_EMBED function and the `nv-embed-qa-4'` model:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_embed', 'nv-embed-qa-4', '"I finally splurged on these after months
of hesitation about the price, and I\'m mostly impressed. The Nulu fabric really is as buttery-soft
as everyone says, and they\'re incredibly comfortable for yoga and lounging. The high-rise waistband
stays put and doesn\'t dig in, which is rare for me. However, I\'m already seeing some pilling after
just a few wears, and they definitely require gentle care. They\'re also quite delicate - I snagged
them slightly on my gym bag zipper. Great for low-impact activities, but I wouldn\'t recommend for
high-intensity workouts. Worth it for the comfort factor"');
```

Response:

```output
142
```

### AI_CLASSIFY examples

This example calculates the total number of input tokens required for text classification with given input and labels:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_classify',
  'One day I will see the world and learn to cook my favorite dishes',
  [
      {'label': 'travel'},
      {'label': 'cooking'},
      {'label': 'reading'},
      {'label': 'driving'}
  ]
);
```

Response:

```output
187
```

The following example adds per-label descriptions and an overall task description to the previous example:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_classify',
  'One day I will see the world and learn to cook my favorite dishes',
  [
    {'label': 'travel', 'description': 'content related to traveling'},
    {'label': 'cooking','description': 'content related to food preparation'},
    {'label': 'reading','description': 'content related to reading'},
    {'label': 'driving','description': 'content related to driving a car'}
  ],
  {
    'task_description': 'Determine topics related to the given text'
  };
```

Response:

```output
254
```

The following example builds upon the previous two examples by adding label examples:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_classify',
  'One day I will see the world and learn to cook my favorite dishes',
  [
    {'label': 'travel', 'description': 'content related to traveling'},
    {'label': 'cooking','description': 'content related to food preparation'},
    {'label': 'reading','description': 'content related to reading'},
    {'label': 'driving','description': 'content related to driving a car'}
  ],
  {
    'task_description': 'Determine topics related to the given text',
    'examples': [
      {
        'input': 'i love traveling with a good book',
        'labels': ['travel', 'reading'],
        'explanation': 'the text mentions traveling and a good book which relates to reading'
      }
    ]
  }
);
```

Response:

```output
298
```

### AI_SENTIMENT examples

The following SQL statement counts the number of tokens in text being analyzed for sentiment using the AI_SENTIMENT function:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_sentiment',
  'This place makes the best truffle pizza in the world! Too bad I cannot afford it');
```

Response:

```output
139
```

The following example adds labels to the previous example:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_sentiment',
  'This place makes the best truffle pizza in the world! Too bad I cannot afford it',
  [
    {'label': 'positive'},
    {'label': 'negative'},
    {'label': 'neutral'}
  ]
);
```

Response:

```output
148
```

### AI_SIMILARITY examples

The following SQL statement counts the number of tokens in an AI_SIMILARITY call that uses the default model.

```sqlexample
SELECT AI_COUNT_TOKENS('ai_similarity',
  'The plot is fast and the characters feel real. This book kept me awake all night
  because the mystery is so deep. I love how the author  handles the ending. It is a
  great read for anyone who likes suspense.',
  'The story is quick and the people feel true. This novel kept me awake all night
  because the puzzle is so big. I love how the writer handles the finale. It is a
  solid choice for anyone who enjoys suspense.');
```

Response:

```output
101
```

The following SQL statement counts the number of tokens in an AI_SIMILARITY that uses the `e5-base-v2` model:

```sqlexample
SELECT AI_COUNT_TOKENS('ai_similarity',
  'The plot is fast and the characters feel real. This book kept me awake all night
  because the mystery is so deep. I love how the author handles the ending. It is a
  great read for anyone who likes suspense.',
  'The story is quick and the people feel true. This novel kept me awake all night
  because the puzzle is so big. I love how the writer handles the finale. It is a
  solid choice for anyone who enjoys suspense.', {'model': 'e5-base-v2'})
```

Response:

```output
92
```

### AI_TRANSLATE example

The following SQL statement counts the number of tokens used by AI_TRANSLATE when translating text from English to
German.

```sqlexample
SELECT AI_COUNT_TOKENS('ai_translate',
  'The plot is fast and the characters feel real. This book kept me awake all night
  because the mystery is so deep. I love how the author handles the ending. It is a
  great read for anyone who likes suspense.', 'en', 'de');
```

Response:

```output
51
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: AI_EMBED
source: https://docs.snowflake.com/en/sql-reference/functions/ai_embed.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_EMBED

> **Note:**
>
> AI_EMBED is the updated version of [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](embed_text_1024-snowflake-cortex.md) and [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](embed_text-snowflake-cortex.md).
> For the latest functionality, use AI_EMBED.

Creates an embedding vector from text or an image. Embeddings are abstract numerical representations of the features of
a piece of text or an image that can be used to determine the degree of similarity between pieces of text or images,
which can be used for semantic search, clustering, classification, and other tasks.

## Region availability

The following table shows the regions where you can use the AI_EMBED function for text and images:

| Data type | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) | Azure East US 2  (Virginia) | Azure West Europe  (Netherlands) | AWS  (Cross-Region) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| Text | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| Image | ✔ | ✔ | ✔ |  |  |  |  | ✔ | ✔ |

## Syntax

```sqlsyntax
AI_EMBED( <model> , <input> )
```

## Arguments

**Required:**

`model`
:   A string specifying the vector embedding model to be used to generate an embedding.

    For text, you can provide the following values:

    * `snowflake-arctic-embed-l-v2.0`
    * `snowflake-arctic-embed-l-v2.0-8k`
    * `nv-embed-qa-4`
    * `multilingual-e5-large`
    * `voyage-multilingual-2`
    * `snowflake-arctic-embed-m-v1.5`
    * `snowflake-arctic-embed-m`
    * `e5-base-v2`

    For images, you can provide only the following value:

    * `voyage-multimodal-3`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`input`
:   The string or image (as a [FILE object](to_file.md)) to generate an embedding from. Images must be:

    * In JPEG, WEBP, PNG, or BMP format
    * No more than 10 MB in size
    * No more than 8,000 x 8,000 pixels

## Returns

An embedding vector of type VECTOR derived from the input text or image.

## Access control requirements

You must use a role that has been granted the SNOWFLAKE.CORTEX_USER database role *or* the SNOWFLAKE.CORTEX_EMBED_USER
database role to call this function. See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on granting one of
these privileges.

## Examples

### Text example

In this example, a vector embedding is generated for the phrase `hello world` using the `snowflake-arctic-embed-l-v2.0` model:

```sqlexample
SELECT AI_EMBED('snowflake-arctic-embed-l-v2.0', 'hello world');
```

### Image example

In this example, a vector embedding is generated for a staged image using the `voyage-multimodal-3` model:

```sqlexample
SELECT AI_EMBED('voyage-multimodal-3',
        TO_FILE ('@my_images', 'CITY_WALKING1.PNG'));
```

## Limitations

* Snowflake AI functions don’t work on FILE objects created from files in the following kinds of stages:

  + Internal stages with encryption mode `TYPE = 'SNOWFLAKE_FULL'`
  + External stages with any customer-side encrypted mode, such as `AWS_CSE` or `AZURE_CSE`.
  + User stage
  + Table stage
  + Stage with double-quoted names

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: AI_EXTRACT
source: https://docs.snowflake.com/en/sql-reference/functions/ai_extract.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_EXTRACT

Extracts information from an input string or file.

## Syntax

**Extract information from an input string:**

```sqlsyntax
AI_EXTRACT( <text>, <responseFormat> )
```

```sqlsyntax
AI_EXTRACT( text => <text>,
            responseFormat => <responseFormat> )
```

**Extract information from a file:**

```sqlsyntax
AI_EXTRACT( <file>, <responseFormat> )
```

```sqlsyntax
AI_EXTRACT( file => <file>,
            responseFormat => <responseFormat>,
            [ config => <config_object> ] )
```

## Arguments

`text`
:   An input string for extraction.

`file`
:   A [FILE](../data-types-unstructured.md) for extraction.

    Supported file formats:

    * PDF
    * PNG
    * PPTX, PPT
    * EML
    * DOC, DOCX
    * JPEG, JPG
    * HTM, HTML
    * TEXT, TXT
    * TIF, TIFF
    * BMP, GIF, WEBP
    * MD

    The files must be less than 100 MB in size.

`responseFormat`
:   Information to be extracted. The format depends on the type of extraction.

    **Entity extraction formats**

    Extract single values by providing one of the following formats:

    * Simple object schema that maps the label and information to be extracted:

      ```output
      {'name': 'What is the last name of the employee?', 'address': 'What is the address of the employee?'}
      ```
    * An array of strings that contain the information to be extracted:

      ```output
      ['What is the last name of the employee?', 'What is the address of the employee?']
      ```
    * An array of arrays that contain two strings (label and the information to be extracted):

      ```output
      [['name', 'What is the last name of the employee?'], ['address', 'What is the address of the employee?']]
      ```
    * A JSON schema with `'type': 'string'` on the sub-object:

      ```output
      {
        'schema': {
          'type': 'object',
          'properties': {
            'title': {
              'description': 'What is the title of the document?',
              'type': 'string'
            }
          }
        }
      }
      ```

    **List extraction format**

    Extract arrays of values using a JSON schema with `'type': 'array'` on the sub-object:

    ```output
    {
      'schema': {
        'type': 'object',
        'properties': {
          'employees': {
            'description': 'What are the names of employees?',
            'type': 'array'
          }
        }
      }
    }
    ```

    **Table extraction format**

    Extract tabular data using a JSON schema with `'type': 'object'` and `column_ordering`. Each column is defined as a
    nested property with `'type': 'array'` and a `description` that matches the column name in the file:

    ```output
    {
      'schema': {
        'type': 'object',
        'properties': {
          'income_table': {
            'description': 'Income for FY2026Q2',
            'type': 'object',
            'column_ordering': ['month', 'income'],
            'properties': {
              'month': {
                'description': 'Month',
                'type': 'array'
              },
              'income': {
                'description': 'Income',
                'type': 'array'
              }
            }
          }
        }
      }
    }
    ```

    > **Note:**
    >
    > * You can’t combine the JSON schema format with other response formats. If `responseFormat` contains the `schema` key,
    >   you must define all questions within the JSON schema. Additional keys are not supported.
    > * The model only accepts certain shapes of JSON schema. Top level type must always be an object, which contains independently extracted sub-objects.
    >   Sub-objects may be a table (object of lists of strings representing columns), a list of strings, or a string.
    >
    >   String is currently the only supported scalar type.
    > * Use the `description` field to provide context to the model; for example, to help the model localize the right table in a document. You can enter the column header name,
    >   or describe the column in other way.
    > * Use the `column_ordering` field to specify the order of all columns in the extracted table. The `column_ordering` field is case-sensitive and must match
    >   the column names defined in the `properties` field. The order should reflect the order of the columns in the document.

`config => config_object`
:   An [OBJECT](../data-types-semistructured.md) value that specifies the configuration settings. You can use an
    [OBJECT constant](../data-types-semistructured.md) to specify this object.

    You can specify the following key-value pairs in this object:

    | Key | Description |
    | --- | --- |
    | `scale_factor` | A numeric value from 1.0 through 4.0. Scales pages of an input file before they are processed by the underlying model, which can enhance OCR quality and improve extraction results.  Use `scale_factor` if you receive unexpected or unclear responses in the following scenarios:   * Documents with page sizes larger than A4 * Documents containing small text, detailed visual elements, or dense layouts * Extracted text contains typos or character-level OCR errors   If omitted, AI_EXTRACT uses the default value (`'scale_factor': 1.0'`). |

## Returns

A JSON object containing the extracted information. The structure of the response depends on the type of extraction.

### Entity extraction

Returns a JSON object with key-value pairs for each extracted entity:

```output
{
  "error": null,
  "response": {
    "title": "Financial report"
  }
}
```

### List extraction

Returns a JSON object with arrays of extracted values:

```output
{
  "error": null,
  "response": {
    "employees": [
      "Smith",
      "Johnson",
      "Doe"
    ]
  }
}
```

### Table extraction

Returns a JSON object with column arrays representing the extracted table:

```output
{
  "error": null,
  "response": {
    "income_table": {
      "income": ["$120 678","$130 123","$150 998"],
      "month": ["February", "March", "April"]
    }
  }
}
```

### Combined extraction

When extracting entities, lists, and tables in a single call, the response contains all extraction types:

```output
{
  "error": null,
  "response": {
    "employees": [
      "Smith",
      "Johnson",
      "Doe"
    ],
    "income_table": {
      "income": ["$120 678","$130 123","$150 998"],
      "month": ["February", "March", "April"]
    },
    "title": "Financial report"
  }
}
```

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
For information about granting this privilege, see [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md).

## Usage notes

* AI_EXTRACT is optimized for documents both digital-born and scanned.
* You can’t use both `text` and `file` parameters simultaneously in the same function call.
* You can either ask questions in natural language or describe information to be extracted (such as city, street, ZIP code); for example:

  > ```output
  > ['address': 'City, street, ZIP', 'name': 'First and last name']
  > ```
* The following languages are supported:

  + Arabic
  + Bengali
  + Burmese
  + Cebuano
  + Chinese
  + Czech
  + Dutch
  + English
  + French
  + German
  + Hebrew
  + Hindi
  + Indonesian
  + Italian
  + Japanese
  + Khmer
  + Korean
  + Lao
  + Malay
  + Persian
  + Polish
  + Portuguese
  + Russian
  + Spanish
  + Tagalog
  + Thai
  + Turkish
  + Urdu
  + Vietnamese
* The documents must be no more than 125 pages long.
* In a single AI_EXTRACT call, you can ask a maximum of 100 questions for entity extraction, and a maximum of 10 questions for table extraction.

  A table extraction question is equal to 10 entity extraction questions. For example, you can ask 4 table extraction questions and
  60 entity extraction questions in a single AI_EXTRACT call.
* The maximum output length for entity extraction is 512 tokens per question. For table extraction, the model returns answers that are a maximum of 4096 tokens.
* Client-side encrypted stages are not supported.
* Confidence scores are not supported.

## Cost considerations

* The Cortex AI_EXTRACT function incurs compute cost based on the number of pages per document, input prompt tokens, and
  output tokens processed.

  + For paged file formats (PDF, DOCX, TIF, TIFF), each page is counted as 970 tokens.
  + For image file formats (JPEG, JPG, PNG), each individual image file is billed as a page and counted as 970 tokens.
* Using the `scale_factor` parameter changes how many tokens are consumed and how many pages can be processed per call:

  + The number of input tokens consumed increases proportionally with `scale_factor`.
  + The maximum number of pages per document that can be processed by AI_EXTRACT decreases by `scale_factor`.

  **Relationship of scale_factor to number of tokens and pages**

  > | `scale_factor` value | Token count per page | Max. number of pages per document |
  > | --- | --- | --- |
  > | 2 | 970 \* 2 = 1940 tokens | 125/2 = 62.5 (rounded down to 62) |
  > | 2.5 | 970 \* 2.5 = 2425 tokens | 125/2.5 = 50 |
  > | 4 | 970 \* 4 = 3880 tokens | 125/4 = 31.25 (rounded down to 31) |
* Snowflake recommends executing queries that call the Cortex AI_EXTRACT function in a smaller warehouse (no larger than
  MEDIUM). Larger warehouses don’t increase performance.

## Regional availability

AI_EXTRACT is available to accounts in the following regions:

| Cloud platform | Region name |
| --- | --- |
| Amazon Web Services (AWS) | * US East (N. Virginia) * US West (Oregon) * Canada (Central) * South America (Sao Paulo) * EU (Ireland) * EU (Frankfurt) * Asia Pacific (Tokyo) * Asia Pacific (Sydney) |
| Microsoft Azure | * East US 2 (Virginia) * West US 2 (Washington) * South Central US (Texas) * North Europe (Ireland) * West Europe (Netherlands) * Southeast Asia (Singapore) * Australia East (New South Wales) * Central India (Pune) * Japan East (Tokyo) |

AI_EXTRACT has cross-region support. For information on enabling Cortex AI cross-region support,
see [Cross-region inference](../../user-guide/snowflake-cortex/cross-region-inference.md).

## Error conditions

AI_EXTRACT can produce the following error messages:

| Message | Explanation |
| --- | --- |
| `Internal error.` | A system error occurred. Wait and try again. If the error persists, contact Snowflake support. |
| `Not found.` | The file was not found. |
| `Provided file cannot be found.` | The file was not found. |
| `Provided file cannot be accessed.` | The current user does not have sufficient privileges too access the file. |
| `The provided file format {file_extension} isn't supported.` | The document is not in a supported format. |
| `The provided file isn't in the expected format or is client-side encrypted or is corrupted.` | The document is not stored in a stage with server-side encryption. |
| `Empty request.` | No parameters were provided. |
| `Missing or empty response format.` | No response format was provided. |
| `Invalid response format.` | The response format is not valid JSON. |
| `Duplicate feature name found: {feature_name}.` | The response format contains one or more duplicate feature names. |
| `Too many questions: {number} complex and {number} simple = {number} total, complex question weight {number}`. | The number of questions exceeds the allowed limit. |
| `Maximum number of 125 pages exceeded. The document has {actual_pages} pages.` | The document exceeds the 125-page limit. |
| `Page size in pixels exceeds 10000x10000. The page size is {actual_px} pixels.` | Image input or a converted document page is larger than the supported dimensions. |
| `Page size in inches exceeds 50x50 (3600x3600 pt). The page size is {actual_in} inches ({actual_pt} pt).` | Page is larger than the supported dimensions. |
| `Maximum file size of 104857600 bytes exceeded. The file size is {actual_size} bytes.` | The document is larger than 100 MB. |

## Examples

### Entity extraction

* The following example extracts entities from the input text using a simple object schema:

  ```sqlexample
  SELECT AI_EXTRACT(
    text => 'John Smith lives in San Francisco and works for Snowflake',
    responseFormat => {'name': 'What is the first name of the employee?', 'city': 'What is the address of the employee?'}
  );
  ```
* The following example extracts and parses entities from the input text:

  ```sqlexample
  SELECT AI_EXTRACT(
    text => 'John Smith lives in San Francisco and works for Snowflake',
    responseFormat => PARSE_JSON('{"name": "What is the first name of the employee?", "address": "What is the address of the employee?"}')
  );
  ```
* The following example extracts entities from the `document.pdf` file:

  ```sqlexample
  SELECT AI_EXTRACT(
    file => TO_FILE('@db.schema.files','document.pdf'),
    responseFormat => [['name', 'What is the first name of the employee?'], ['city', 'Where does the employee live?']]
  );
  ```
* The following example extracts entities from all files in a directory on a stage:

  > **Note:**
  >
  > Ensure that the directory table is enabled. For more information, see [Manage directory tables](../../user-guide/data-load-dirtables-manage.md).

  ```sqlexample
  SELECT AI_EXTRACT(
    file => TO_FILE('@db.schema.files', relative_path),
    responseFormat => [
      'What is the document ID?',
      'What is the address of the company?'
    ]
  ) FROM DIRECTORY (@db.schema.files);
  ```
* The following example extracts the `title` entity from the `report.pdf` file using a JSON schema:

  ```sqlexample
  SELECT AI_EXTRACT(
    file => TO_FILE('@db.schema.files', 'report.pdf'),
    responseFormat => {
      'schema': {
        'type': 'object',
        'properties': {
          'title': {
            'description': 'What is the title of document?',
            'type': 'string'
          }
        }
      }
    }
  );
  ```

### List extraction

The following example extracts the `employees` list from the `report.pdf` file:

```sqlexample
SELECT AI_EXTRACT(
  file => TO_FILE('@db.schema.files', 'report.pdf'),
  responseFormat => {
    'schema': {
      'type': 'object',
      'properties': {
        'employees': {
          'description': 'What are the surnames of employees?',
          'type': 'array'
        }
      }
    }
  }
);
```

### Table extraction

The following example extracts the `income_table` table from the `report.pdf` file:

```sqlexample
SELECT AI_EXTRACT(
  file => TO_FILE('@db.schema.files', 'report.pdf'),
  responseFormat => {
    'schema': {
      'type': 'object',
      'properties': {
        'income_table': {
          'description': 'Income for FY2026Q2',
          'type': 'object',
          'column_ordering': ['month', 'income'],
          'properties': {
            'month': {
              'description': 'Month',
              'type': 'array'
            },
            'income': {
              'description': 'Income',
              'type': 'array'
            }
          }
        }
      }
    }
  }
);
```

### Combined extraction

The following example extracts a table (`income_table`), entity (`title`), and list (`employees`) from the `report.pdf`
file in a single call:

```sqlexample
SELECT AI_EXTRACT(
  file => TO_FILE('@db.schema.files', 'report.pdf'),
  responseFormat => {
    'schema': {
      'type': 'object',
      'properties': {
        'income_table': {
          'description': 'Income for FY2026Q2',
          'type': 'object',
          'column_ordering': ['month', 'income'],
          'properties': {
            'month': {
              'description': 'Month',
              'type': 'array'
            },
            'income': {
              'description': 'Income',
              'type': 'array'
            }
          }
        },
        'title': {
          'description': 'What is the title of document?',
          'type': 'string'
        },
        'employees': {
          'description': 'What are the surnames of employees?',
          'type': 'array'
        }
      }
    }
  }
);
```

### Extraction with a custom scale factor

The following example extracts the `employees` array from the `report.pdf` file using a scale factor of 2.0:

```sqlexample
SELECT AI_EXTRACT(
  file => TO_FILE('@db.schema.files', 'report.pdf'),
  responseFormat => {
    'schema': {
      'type': 'object',
      'properties': {
        'employees': {
          'description': 'What are the surnames of employees?',
          'type': 'array'
        }
      }
    }
  },
  config => {'scale_factor': 2.0}
);
```

### Extraction using a fine-tuned `arctic-extract` model

To use the fine-tuned `arctic-extract` model for inference with the AI_EXTRACT function,
specify the model using the `model` parameter as shown in the following example:

```sqlexample
SELECT AI_EXTRACT(
  model => 'db.schema.my_tuned_model',
  file => TO_FILE('@db.schema.files','document.pdf')
);
```

You can overwrite questions used for fine-tuning by using the `responseFormat` parameter as shown in the following example:

```sqlexample
SELECT AI_EXTRACT(
  model => 'db.schema.my_tuned_model',
  file => TO_FILE('@db.schema.files','document.pdf'),
  responseFormat => [['name', 'What is the first name of the employee?'], ['city', 'Where does the employee live?']]
);
```

The following example extracts data from the `invoice.pdf` file, using a fine-tuned `arctic-extract` model and a scale factor of 2.0:

```sqlexample
SELECT AI_EXTRACT(
  model => 'db.schema.my_tuned_model',
  file => TO_FILE('@db.schema.files', 'invoice.pdf'),
  config => {'scale_factor': 2.0}
);
```

For more information, see [Fine-tuning arctic-extract models](../../user-guide/snowflake-cortex/arctic-extract-finetuning.md).

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md) for legal notices.

---
title: AI_EXTRACT (Document AI legacy models)
source: https://docs.snowflake.com/en/sql-reference/functions/ai_extract-document-ai.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_EXTRACT (Document AI legacy models)

Extracts information from a file using a legacy Document AI model.

## Syntax

```sqlsyntax
AI_EXTRACT ( model => <model> ,
            file => <file> )
```

## Arguments

`model => model`
:   Specifies the Document AI Arctic-TILT model for extraction stored in the Snowflake Model Registry; for example, `my_db.my_schema.my_model`.

`file => file`
:   A [FILE](../data-types-unstructured.md) for extraction.

## Returns

### Entity extraction

```output
{
  "error": null,
  "response": {
    "invoice_items": [
      "NEW CRUSHED VELVET DIVAN BED",
      "Vintage Radiator",
      "Solid Wooden Worktop",
      "Sienna Crushed Velvet Curtains"
    ],
    "invoice_number": "123/20",
    "tax_amount": "77.57",
    "total_amount": "465.43 GBP",
    "vendor_name": "UK Exports & Imports Ltd"
  }
}
```

### Table extraction

```output
{
  "error": null,
  "response": {
    "table1": {
      "gross": ["10", "31", "10"],
      "item": ["apples", "banana", "pear"],
      "net": ["9", "30", "10"],
      "tax": ["1", "1", ""]
    },
    "table2": {
      "name": ["John", "Ana", "Lisa"],
      "surname": ["Smith", "Nixon", "Gonzales"]
    }
  }
}
```

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
For information about granting this privilege, see [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md).

Additionally, you must have the OWNERSHIP privilege on the model.

## Usage notes

* The model must be in the [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md).
* The Document AI model should not have more than 100 entities.
* If not set explicitly, the latest available model version is used by default (the version set when the model was published or
  trained in the Document AI UI). To set the default version of a model, use the [ALTER MODEL](../sql/alter-model.md) command
  as shown in the following example:

  ```sqlexample
  ALTER MODEL my_model SET DEFAULT_VERSION = new_version;
  ```
* Confidence scores are not supported.
* AI_EXTRACT uses token-based billing. For more information on the AI_EXTRACT cost for Document AI legacy models, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

  + Entity extraction cost is labeled as `arctic-tilt-entity`.
  + Table extraction cost is labeled as `arctic-tilt-table`.

## Regional availability

The following regions are available:

* AWS Canada (Central)
* AWS EU (Frankfurt)
* AWS EU (Ireland)
* AWS US East (N. Virginia)
* AWS US East (Ohio)
* AWS US West (Oregon)
* Azure Australia East (New South Wales)
* Azure East US 2 (Virginia)
* Azure Southeast Asia (Singapore)
* Azure West Europe (Netherlands)
* Azure West US 2 (Washington)

If your region is not listed, use [cross-region inference](../../user-guide/snowflake-cortex/cross-region-inference.md).

## Examples

The following example extracts the features defined in the Document AI model:

```sqlexample
SELECT AI_EXTRACT(
  model => 'my_db.my_schema.my_model',
  file => TO_FILE('@files_db.files_schema.files', 'agreement.pdf')
);
```

The following example extracts information from all files in a directory on a stage:

```sqlexample
SELECT AI_EXTRACT(
  model => 'my_db.my_schema.my_model',
  file => TO_FILE('@db.schema.files', relative_path)
) FROM DIRECTORY (@db.schema.files);
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: AI_FILTER
source: https://docs.snowflake.com/en/sql-reference/functions/ai_filter.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_FILTER

Classifies free-form prompt inputs into a boolean. Currently supports both text and image filtering.

## Region availability

The following table shows the regions where you can use the AI_FILTER function for both text and images:

| Data type | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) | Azure East US 2  (Virginia) | Azure West Europe  (Netherlands) | AWS  (Cross-Region) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| TEXT | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ | ✔ |
| IMAGE | ✔ | ✔ | ✔ |  |  |  |  |  | ✔ |

## Syntax

Applying AI_FILTER to an input string:

```sqlsyntax
AI_FILTER( <input> [, <return_error_details> ] )
```

Applying AI_FILTER to single image:

```sqlsyntax
AI_FILTER( <predicate> , <input> [, <return_error_details> ] )
```

Applying AI_FILTER to multiple columns with both text and images, leveraging the [PROMPT](prompt.md):

```sqlsyntax
AI_FILTER( PROMPT('<template_string>',  <col_1>, … ) [, <return_error_details> ] )
```

## Arguments

**Required:**

**If you’re specifying an input string:**

`input`
:   A string containing the text to be classified.

**If you’re filtering on one file:**

`predicate`
:   A string containing the instructions to classify the file input as either `TRUE` or `FALSE`.

`file`
:   The column that the file is classified by based on the instructions specified in `predicate`.
    You can use IMAGE FILE as an input to the AI_FILTER function.

**If you’re using the PROMPT() function to format the inputs:**

For more complicated prompts, especially with multiple file columns, you can use the [PROMPT](prompt.md) to help with creating an `input`.

The PROMPT() function supports formatting across both strings and FILE datatypes. For detailed usage, see Examples.

**Optional:**

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed. See Error behavior for details.

## Returns

Returns a Boolean value that indicates whether the statement evaluates to TRUE or FALSE for the specified text.

## Error behavior

By default, if AI_FILTER can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: A BOOLEAN value indicating the filter result, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Performance and cost optimization

By default, AI_FILTER includes a built-in performance optimization on qualifying queries. This optimization can provide 2 to 10 times faster performance and up to 60% lower token usage with a minimal impact on quality.

This optimization is triggered automatically when the query engine detects a suitable pattern. Similar to other query optimizations, Snowflake doesn’t guarantee that this optimization will be applied for every query. The engine leverages adaptive routing and context-aware rewriting to execute more efficient AI operations where possible.

> To disable this optimization for your account, contact your account manager.

## Usage notes

For optimal performance, follow these guidelines:

* Make sure the columns sent into AI_FILTER don’t contain NULL values.
* Use plain text in English for the input string or for PROMPT() arguments.
* Provide details for the input text instruction. For example, instead of a statement like “sounds satisfied”, use “In the following support transcript, the customer sounds satisfied”.
* Consider phrasing the input in the form of a question. For example, “In the following support transcript, does the customer sound satisfied?”

## Examples

### AI_FILTER: Text

Can be called as a simple scalar Boolean function on string constants.

```sqlexample
SELECT AI_FILTER('Is Canada in North America?');
```

```output
TRUE
```

You can [CONCAT , ||](concat.md) instructions with text columns to use this function:

```sqlexample
WITH reviews AS (
            SELECT 'Wow... Loved this place.' AS review
  UNION ALL SELECT 'The pizza is not good.'
)
SELECT * FROM reviews
WHERE AI_FILTER(CONCAT('The reviewer enjoyed the restaurant: ', review));
```

For easier templated formatting across multiple columns, Snowflake provides [PROMPT](prompt.md); for example:

```sqlexample
WITH reviews AS (

SELECT 'Wow... Loved this place.' AS review
UNION ALL SELECT 'The pizza is not good.'
)
SELECT * FROM reviews
WHERE AI_FILTER(PROMPT('The reviewer enjoyed the restaurant: {0}', review));
```

```output
+--------------------------+
| REVIEW                   |
|--------------------------+
| Wow... Loved this place. |
+--------------------------+
```

While evaluating the quality of AI_FILTER, it can be helpful to compare candidate predicates across columns.

```sqlexample
WITH country AS (
          SELECT 'Switzerland' AS country,
UNION ALL SELECT 'Korea'
),
region AS (
            SELECT 'Asia' AS region,
  UNION ALL SELECT 'Europe'
)
SELECT country,
      region,
      AI_FILTER(PROMPT('{0} is in {1}', country, region)) AS result
FROM country CROSS JOIN region ;
```

```output
+-------------+-------+--------+
| COUNTRY     |REGION | RESULT |
|-------------+-------+--------+
| Switzerland |Europe | TRUE   |
|-------------+-------+--------+
| Switzerland | Asia  | FALSE  |
|-------------+-------+--------+
| Korea       |Europe | FALSE  |
+-------------+-------+--------+
| Korea       | Asia  | TRUE   |
+-------------+-------+--------+
```

### Using AI_FILTER with a JOIN

You can use AI_FILTER with a JOIN to express linking two tables with a natural language prompt that AI can reason on.

The following example joins the RESUMES table with the JOBS table using a prompt with the AI_FILTER function.

```sqlexample
SELECT *
FROM RESUMES
JOIN JOBS
ON AI_FILTER(PROMPT('Evaluate if this resume {0} fits this job description {1}', RESUME.contents, JOBS.jd));
```

### AI_FILTER: Images

The following examples filter image files based on an instruction.

Filter images by providing an instruction predicate and the image file column:

```sqlexample
WITH pictures AS (
  SELECT
      TO_FILE(file_url) AS img
  FROM DIRECTORY(@file_stage)
)
SELECT
FL_GET_RELATIVE_PATH(img) AS file_path FROM pictures
WHERE AI_FILTER('Is this a picture of a cat?', img);
```

```sqlexample
WITH pictures AS (
  SELECT
      TO_FILE(file_url) AS img
  FROM DIRECTORY(@file_stage)
)
SELECT
    FL_GET_RELATIVE_PATH(img) AS file_path FROM pictures
WHERE AI_FILTER(PROMPT('{0} is a cat picture', img));
```

```output
+--------------------------+
|        FILE_PATH         |
|--------------------------+
|        2cats.jpg         |
+--------------------------+
|        cat1.png          |
+--------------------------+
|      orange_cat.jpg      |
+--------------------------+
```

## Limitations

* Snowflake AI functions don’t work on FILEs created from stage files from the following stage types:

  + Internal stages with encryption mode `TYPE = 'SNOWFLAKE_FULL'`
  + External stages with any customer-side encrypted mode:

    - `TYPE = 'AWS_CSE'`
    - `TYPE = 'AZURE_CSE'`
  + User stage, table stage
  + Stage with double-quoted names

---
title: AI_PARSE_DOCUMENT
source: https://docs.snowflake.com/en/sql-reference/functions/ai_parse_document.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_PARSE_DOCUMENT

> **Note:**
>
> AI_PARSE_DOCUMENT is the updated version of [PARSE_DOCUMENT (SNOWFLAKE.CORTEX)](parse_document-snowflake-cortex.md).
> For the latest functionality, use AI_PARSE_DOCUMENT.

Returns the extracted content from a document on a Snowflake stage as a JSON-formatted string. This
function supports two types of extraction: Optical Character Recognition (OCR) and layout. For more
information, see [Parsing documents with AI_PARSE_DOCUMENT](../../user-guide/snowflake-cortex/parse-document.md).

## Syntax

```sqlsyntax
AI_PARSE_DOCUMENT( <file_object> [, <options> ] [, <return_error_details> ] )
```

## Arguments

**Required:**

`file_object`
:   A [FILE](../data-types-unstructured.md) object that specifies the document to parse, stored in a Snowflake stage. For
    information about creating file objects, see [TO_FILE](to_file.md).

**Optional:**

`options`
:   An OBJECT value that contains options for parsing documents. The supported keys are shown below. All are optional.

    * `'extract_images'`: If set to TRUE, the function extracts images embedded in the document. Requires LAYOUT mode.
    * `'mode'`: Specifies the parsing mode. The supported modes are:

      + `'OCR'`: The function extracts text only. This is the default mode.
      + `'LAYOUT'`: The function extracts layout as well as text, including structural content such as tables.
    * `'page_split'`: If set to TRUE, the function splits the document into pages and processes each page
      separately. This feature supports only PDF, PowerPoint (`.pptx`), and Word (`.docx`) documents.
      Documents in other formats return an error. The default is FALSE.

      > **Tip:**
      >
      > To process long documents that exceed the token limit of AI_PARSE_DOCUMENT, set this option to TRUE.
    * `'page_filter'`: An array that specifies ranges of pages of a multi-page document to process. Each
      range is an object with `start` and `end` fields that specify the first (inclusive) and last (exclusive) page in
      the range. Page indexes start at 0. For example, `{'start': 0, 'end': 1}` specifies the first page of the
      document.

      > **Note:**
      >
      > Specifying `page_filter` implies `page_split`. If you specify page ranges, it is not necessary to also set
      > `page_split`.

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains `value`, `error`, and `metadata` fields. The `value` field contains the parsed document
    data, the `error` field contains the error message (or NULL on success), and `metadata` is a top-level field
    rather than a subfield of the parsed output. See Error behavior for details.

## Returns

A JSON object (as a string) that contains the extracted data and associated metadata. The `options` argument
determines the structure of the returned object.

> **Tip:**
>
> To use the output in SQL, convert it to an OBJECT value using the [PARSE_JSON](parse_json.md) function.

If the `'page_split'` option is set, the output has the following structure:

> * `"pages"`: An array of JSON objects, each containing text extracted from the document. If the document has only
>   one page, the output still contains a `"pages"` array (which contains a single object). Each page has the following fields:
>
>   > + `"content"`: Plain text (in OCR mode) or Markdown-formatted text (in LAYOUT mode).
>   > + `"index"`: The page index in the file, starting at 0. Page numbers and formats specified in the document are ignored.
>
> > * `"metadata"`: Contains metadata about the document, such as page count.

If `'page_split'` is FALSE or is not present, the output has the following structure:

> * `"content"`: Plain text (in OCR mode) or Markdown-formatted text (in LAYOUT mode).
> * `"metadata"`: Contains metadata about the document, such as page count.

If the `"extract_images"` option is set to TRUE, the output includes an additional field:

> * `"images"`: An array of JSON objects, each representing an extracted image. Each image object has the following fields:
>
>   + `"id"`: A unique identifier for the image.
>   + `"top_left_x"`, `"top_left_y"`, `"bottom_right_x"`, `"bottom_right_y"`: The coordinates of the bounding box of the image on the page.
>   + `"image_base64"`: The extracted image data encoded as a base64 string.

## Error behavior

By default, if AI_PARSE_DOCUMENT can’t process the input, the function returns NULL. If the query processes multiple
rows, rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value`, `error`, and `metadata` fields | `value`: An OBJECT containing the parsed document data, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. The `error` field inside `value` (renamed from `errorInformation`) contains per-document error details when present.    `metadata`: An OBJECT containing document metadata such as page count. This field is at the top level rather than inside the parsed output. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Examples

For examples, see [AI_PARSE_DOCUMENT examples](../../user-guide/snowflake-cortex/parse-document.md).

---
title: AI_REDACT
source: https://docs.snowflake.com/en/sql-reference/functions/ai_redact.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_REDACT

Detects and redacts personally identifiable information (PII) from unstructured text data.

## Syntax

Use AI_REDACT to detect and redact PII:

```sqlsyntax
AI_REDACT( <input> [, <categories> ] [, <return_error_details> ] [, <mode> ] )
```

## Arguments

**Required:**

`input`
:   A VARCHAR value that contains text data that may contain personally identifiable information (PII).

**Optional:**

`categories`
:   An ARRAY of string values that specify the types of PII to be redacted. If not specified, all supported PII
    categories are redacted. See [Detected PII categories](../../user-guide/snowflake-cortex/redact-pii.md) for a list of supported categories.

    Passing an unsupported category results in an error.

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed.

    Requires the session parameter AI_SQL_ERROR_HANDLING_USE_FAIL_ON_ERROR to be set to FALSE.

`mode`
:   A VARCHAR value that specifies the operating mode. Accepted values:

    * `redact` (default): Replaces detected PII with category placeholders, such as [NAME] and [ADDRESS].
    * `detect`: Returns an OBJECT that contains a `spans` array that identifies the location and category of each detected PII instance
      without redacting the text.

> **Note:**
>
> The `mode` argument is case insensitive.

## Returns

The return value of AI_REDACT depends on the `mode` argument.

### Redact mode (default)

Returns a VARCHAR that contains the input text with PII replaced by category placeholders, such as `[NAME]` where the input
text was “John Smith”.

### Detect mode

Returns an OBJECT that contains a `spans` array. Each element in the array is an OBJECT with the following fields:

> * `category`: A VARCHAR value that identifies the PII category (for example, `NAME` or `ADDRESS`).
> * `start`: A NUMBER value that identifies the start index of the PII in the input text.
> * `end`: A NUMBER value that identifies the end index of the PII in the input text.
> * `text`: A VARCHAR value that contains the matched PII text.

## Error behavior

By default, if AI_REDACT cannot process the input, the function returns an error. If the query processes multiple rows, the entire query fails.

When AI_SQL_ERROR_HANDLING_USE_FAIL_ON_ERROR is set to FALSE, the return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: A VARCHAR value that contains the redacted text, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about handling errors, see [Handle row-level errors in multi-row queries](../../user-guide/snowflake-cortex/redact-pii.md).

## Usage notes

* For categories of PII that AI_REDACT can redact, see [Detected PII categories](../../user-guide/snowflake-cortex/redact-pii.md).
* For limitations in the current version of AI_REDACT, see [Limitations](../../user-guide/snowflake-cortex/redact-pii.md).

## Examples

See [Redaction examples](../../user-guide/snowflake-cortex/redact-pii.md).

---
title: AI_SENTIMENT
source: https://docs.snowflake.com/en/sql-reference/functions/ai_sentiment.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_SENTIMENT

> **Note:**
>
> AI_SENTIMENT is the updated version of [ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)](entity_sentiment-snowflake-cortex.md).
> For the latest functionality, use AI_SENTIMENT.

Returns overall and category [sentiment](../../user-guide/snowflake-cortex/ai-sentiment.md) in the given input text.

## Syntax

```sqlsyntax
AI_SENTIMENT( <text> [ , <categories> ] [, <return_error_details> ] )
```

## Arguments

**Required:**

`text`
:   A string containing the text in which sentiment is detected.

**Optional:**

`categories`
:   An array containing up to ten categories (also called entities or aspects) for which sentiment should be extracted. Each category is a
    string. For example, if extracting sentiment from a restaurant review, you might specify
    `['cost', 'quality', 'service', 'wait time']` as the categories. Each category may be a maximum of 30 characters long.

    If you do not provide this argument, AI_SENTIMENT returns only the overall sentiment.

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed. See Error behavior for details.

## Returns

An OBJECT value containing a `categories` field. `categories` is an array of category records. Each category includes these fields:

* `name`: The name of the category. The category names match the categories specified in the `categories` argument.
* `sentiment`: The sentiment of the category. Each sentiment result is one of the following strings.

  + `unknown`: The category was not mentioned in the text.
  + `positive`: The category was mentioned positively in the text.
  + `negative`: The category was mentioned negatively in the text.
  + `neutral`: The category was mentioned in the text, but neither positively nor negatively.
  + `mixed`: The category was mentioned both positively and negatively in the text.

The `overall` category record is always included and contains the overall sentiment of the text.

Example:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "mixed"
    },
    {
      "name": "Brand",
      "sentiment": "unknown"
    },
    {
      "name": "Cost",
      "sentiment": "negative"
    },
    {
      "name": "Professionalism",
      "sentiment": "unknown"
    }
  ]
}
```

## Error behavior

By default, if AI_SENTIMENT can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: An OBJECT containing the sentiment analysis result, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this role.

## Usage notes

AI_SENTIMENT can analyze sentiment in English, French, German, Hindi, Italian, Spanish, and Portuguese. You can specify
categories in the language of the text or in English.

## Examples

The following example uses AI_SENTIMENT to get the overall sentiment of a food service review.

```sqlexample
SELECT AI_SENTIMENT('A tourist\'s delight, in low urban light,
    Recommended gem, a pizza night sight. Swift arrival, a pleasure so right,
    Yet, pockets felt lighter, a slight pricey bite. 💰🍕🚀');
```

Return value:

```output
{
  "categories": [
    {
      "name": "overall",
      "sentiment": "positive"
    }
  ]
}
```

In this example, a table named `reviews` contains a column named `review_content` containing the text of movie reviews
submitted by users. The query returns the sentiment of several facets of up to ten reviews.

```sqlexample
SELECT
  AI_SENTIMENT(
    review_content,
    ['concept', 'performance', 'script', 'cinematography', 'soundtrack']
  ),
  review_content
  FROM reviews LIMIT 10;
```

## Regional availability

AI_SENTIMENT is available in the following regions:

| Function  (Model) | AWS US West 2  (Oregon) | AWS US East 1  (N. Virginia) | AWS Europe Central 1  (Frankfurt) | AWS Europe West 1  (Ireland) | AWS AP Southeast 2  (Sydney) | AWS AP Northeast 1  (Tokyo) | Azure East US 2  (Virginia) | Azure West Europe  (Netherlands) | AWS  (Cross-Region) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| AI_SENTIMENT | ✔ | ✔ | ✔ |  |  | ✔ | ✔ | ✔ | ✔ |

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: AI_SIMILARITY
source: https://docs.snowflake.com/en/sql-reference/functions/ai_similarity.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_SIMILARITY

Computes a similarity score based on the vector cosine similarity value of the inputs’ embedding vectors. Currently supports both text and image similarity computation.

## Syntax

Applying AI_SIMILARITY to string or image inputs:

```sqlsyntax
AI_SIMILARITY( <input1>, <input2> )
```

Specifying the config object:

```sqlsyntax
AI_SIMILARITY( <input1>, <input2>, <config_object> )
```

## Arguments

**Required:**

If you’re specifying input strings:

`input1`, `input2`
:   The strings with the text that you’re comparing and using to compute the similarity score.

If you’re specifying input images:

`input1`, `input2`
:   [FILE data type](../../user-guide/unstructured-intro.md) referencing the images to be compared.

> **Note:**
>
> AI_SIMILARITY does not support computing the similarity between text and image inputs.

**Optional:**

`config_object`
:   An [OBJECT](../data-types-semistructured.md) containing key-value pairs used to configure the model.

| Key | Type | Default | Description |
| --- | --- | --- | --- |
| `model` | [STRING](../data-types-text.md) | For STRING input, default to `'snowflake-arctic-embed-l-v2.0'`. For IMAGE input, default to `'voyage-multimodal-3'` | The embedding model used for embedding. Supported values are:   * `'snowflake-arctic-embed-l-v2.0'` * `'nv-embed-qa-4'` * `'multilingual-e5-large'` * `'voyage-multilingual-2'` * `'snowflake-arctic-embed-m-v1.5'` * `'snowflake-arctic-embed-m'` * `'e5-base-v2'` * `'voyage-multimodal-3'` (IMAGE) |

## Returns

Returns a float value of range -1 to 1 that represents the similarity score computed using vector similarity between two embedding vectors for the inputs.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Examples

### AI_SIMILARITY: Text

In this example, the function is computing a similarity score between the two statement inputs `'I like this dish'` and `'This dish is very good'`.

```sqlexample
SELECT AI_SIMILARITY('I like this dish', 'This dish is very good');
```

We can also compute similarity on text columns.

```sqlexample
SELECT
    review
FROM restaurant_reviews
ORDER BY AI_SIMILARITY(review, 'I love the food here!');
```

### AI_SIMILARITY: Images

In this example, the function computes a similarity score between the two images, `cat.jpg` and `2cats.jpg`, stored in a Snowflake stage `@file_stage`.

```sqlexample
SELECT AI_SIMILARITY(TO_FILE('@file_stage', 'cat.jpg'), TO_FILE('@file_stage', '2cats.jpg'));
```

We can also compute similarity among the images using Snowflake Directory Table for the stage containing the images.

```sqlexample
SELECT
    to_file('@file_stage', relative_path)
FROM directory(@file_stage)
WHERE AI_SIMILARITY(f, to_file(@file_stage, 'cat.jpg')) >= 0.5;
```

## Limitations

* Snowflake AI functions don’t work on FILEs created from stage files from the following stage types:

  + Internal stages with encryption mode `TYPE = 'SNOWFLAKE_FULL'`
  + External stages with any customer-side encrypted mode:

    - `TYPE = 'AWS_CSE'`
    - `TYPE = 'AZURE_CSE'`
  + User stage, table stage
  + Stage with double-quoted names

## Billing

`AI_SIMILARITY` is currently billed under the `AI_EMBED` line item in SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_USAGE_HISTORY view.

---
title: AI_SUMMARIZE_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/ai_summarize_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)
    [String & binary functions](../functions-string.md) (AI Functions)

# AI_SUMMARIZE_AGG

Summarizes a column of text data.

For example, `AI_SUMMARIZE_AGG(churn_reason)` will return a summary of the `churn_reason` column.

Unlike [AI_COMPLETE](ai_complete.md) and [SUMMARIZE (SNOWFLAKE.CORTEX)](summarize-snowflake-cortex.md), this function supports datasets larger than the maximum language model context window.

See also:
:   [AI_AGG](ai_agg.md)

## Syntax

```sqlsyntax
AI_SUMMARIZE_AGG( <expr> )
```

## Arguments

**Required:**

`expr`
:   This is an expression that contains text for summarization, such as restaurant reviews or phone transcripts.

## Returns

Returns a string summary of the expression.

## Usage notes

This function provides a general purpose summary. For a more specific summary, use [AI_AGG](ai_agg.md).

## Examples

AI_SUMMARIZE_AGG can be used as a simple scalar function on string constants.

```sqlexample
SELECT AI_SUMMARIZE_AGG('The restaurant was excellent. I especially enjoyed the pizza and ice cream. My grandma didnt like it though.');
```

```output
The restaurant received mixed reviews from our group. While I thoroughly enjoyed the pizza and ice cream, my grandma did not have a positive experience.
```

AI_SUMMARIZE_AGG can be used on a column of data.

```sqlexample
WITH reviews AS (
            SELECT 'The restaurant was excellent.' AS review
  UNION ALL SELECT 'Excellent! I loved the pizza!'
  UNION ALL SELECT 'It was great, but the service was meh.'
  UNION ALL SELECT 'Mediocre food and mediocre service'
)
SELECT AI_SUMMARIZE_AGG(review)
  FROM reviews;
```

```output
The restaurant received mixed reviews. Some customers had a great experience, enjoying the pizza and finding the restaurant excellent. However, others had a more neutral experience, describing the food and service as mediocre, with one customer specifically mentioning that the service was subpar.
```

AI_SUMMARIZE_AGG can be used on multiple columns of data using `CONCAT` or the `||` operator.

```sqlexample
WITH reviews AS (
            SELECT 'The restaurant was excellent.' AS review, 'Pizza' AS menu_item
  UNION ALL SELECT 'Excellent! I loved the pizza!', 'Pizza'
  UNION ALL SELECT 'It was great, but the service was meh.', 'Burger'
  UNION ALL SELECT 'Mediocre food and mediocre service', 'Pancakes'
)
SELECT AI_SUMMARIZE_AGG('Menu Item: ' || menu_item || '\nReview: ' || review)
  FROM reviews;
```

```output
The restaurant received positive reviews for its pizza, with one reviewer describing it as "excellent" and another stating they "loved" it. In contrast, the burger received a mixed review, with the food being "great" but the service being "meh." The pancakes were rated as "mediocre" in terms of both food and service. Overall, the restaurant's performance varied depending on the menu item, with pizza being a highlight.
```

AI_SUMMARIZE_AGG can also be used in combination with GROUP BY.

```sqlexample
WITH reviews AS (
            SELECT 1 AS product_id, 'The restaurant was excellent.' AS review
  UNION ALL SELECT 1, 'Excellent! I loved the pizza!'
  UNION ALL SELECT 1, 'It was great, but the service was meh.'
  UNION ALL SELECT 1, 'Mediocre food and mediocre service'
  UNION ALL SELECT 2, 'Terrible quality ingredients, I should have eaten at home.'
  UNION ALL SELECT 2, 'Bad restaurant, I would avoid this place.'
)
SELECT product_id,
       AI_SUMMARIZE_AGG(review) AS summarized_review
  FROM reviews
 GROUP BY 1;
```

```output
+------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| PRODUCT_ID | SUMMARIZED_REVIEW                                                                                                                                                                                                                                                                                         |
|------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| 1          | The restaurant received mixed reviews. Some customers had a great experience, enjoying the pizza and finding the restaurant excellent. However, others had a more neutral experience, describing the food and service as mediocre, with one customer specifically mentioning that the service was subpar. |
+------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| 2          | The reviewer had a poor experience at the restaurant, citing the use of low-quality ingredients and expressing regret over not eating at home instead. They strongly advise against visiting this establishment.                                                                                          |
+------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

See also [AI_AGG](ai_agg.md).

---
title: AI_TRANSCRIBE
source: https://docs.snowflake.com/en/sql-reference/functions/ai_transcribe.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# AI_TRANSCRIBE

Transcribes text from an audio or video file with optional timestamps and speaker labels. AI_TRANSCRIBE supports
[numerous languages](../../user-guide/snowflake-cortex/ai-audio.md), and
audio can contain more than one language. Timestamps and speaker labels are extracted based on the specified timestamp
granularity, as shown in the table below.

| Timestamp granularity | Result |
| --- | --- |
| Default | Transcription of entire audio file in one piece |
| Word | Transcription with timestamps for each word |
| Speaker | Indicates who is speaking, and a timestamp, at each change of speaker |

## Syntax

```sqlsyntax
AI_TRANSCRIBE( <audio_file> [ , <options> ] [, <return_error_details> ] )
```

## Arguments

**Required:**

`audio_file`
:   A FILE type object representing an audio file. Use [TO_FILE function](to_file.md) to create a reference to your staged file.

**Optional:**

`options`
:   An [OBJECT value](../data-types-semistructured.md) containing zero or more of the following fields.

    * `timestamp_granularity`: A string specifying the desired timestamp granularity. Possible values are:

      + `"word"`: The file is transcribed as a series of words, each with its own timestamp.
      + `"speaker"`: The file is transcribed as a series of conversational “turns,” each with its own timestamp and speaker label.

      If this field is not specified, the entire file is transcribed as a single segment without timestamps by default.

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed. See Error behavior for details.

## Returns

An string containing a JSON representation of the transcription result. The JSON object contains the following fields:

* `"audio_duration"`: The total duration of the audio file in seconds.
* `"text"`: The transcription of the complete audio file, provided when the `timestamp_granularity` field is not specified.
* `"segments"`: An array of segments, provided when the `timestamp_granularity` field is set to `"word"` or
  `"speaker"`. Each segment is a JSON object containing the following fields:

  + `"start"`: The start time of the segment in seconds.
  + `"end"`: The end time of the segment in seconds.
  + `"text"`: The transcription text for the segment.
  + `"speaker_label"`: The label of the speaker for the segment, provided when the `timestamp_granularity` field is set to `speaker`.
    Labels are of the form “SPEAKER_00”, “SPEAKER_01”, etc. and are assigned in the order speakers are detected in the audio file.

## Error behavior

By default, if AI_TRANSCRIBE can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: A VARCHAR value containing the transcription result, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this role.

## Usage notes

* For a list of supported languages, see [Supported languages](../../user-guide/snowflake-cortex/ai-audio.md)

  Supported languages are automatically detected. A file can contain multiple languages, each of which is recognized and
  transcribed. For accurate language detection, speech must begin within the first five seconds of the file.
* AI_TRANSCRIBE supports the following audio and video file formats:

  |  |  |
  | --- | --- |
  | Audio | FLAC, MP3, MP4, OGG, WAV, WEBM |
  | Video | MKV, MP4, OGV, WEBM |

  Video files must contain at least one audio track in FLAC, MP3, OPUS, VORBIS, or WAV format.

  Factors such as sample rate, bit depth, and number of channels do not affect transcription, but they might make the
  file too large to process if they are too high. Internally, AI_TRANSCRIBE uses monophonic audio at 16 KHz, and
  resamples input files when they are not already in this format
* The maximum audio file size is 700 MB.
* The maximum audio file duration is 60 minutes when timestamp granularity is set to “word” or “speaker”.
  If timestamp granularity is not used, the maximum duration is 120 minutes.

## Examples

For examples, see [AI Audio examples](../../user-guide/snowflake-cortex/ai-audio.md).

## Troubleshooting

If the function fails, it raises an error. Common error messages include:

| Error Message | Situation and Solution |
| --- | --- |
| Invalid options object | The option provided for the `timestamp_granularity` field, if provided, must be “word” or “speaker”. |
| No response from server | The audio file cannot be retrieved, perhaps because of an expired scoped URL. |
| File too large. Maximum size is 734,003,200 Bytes, file exceeds this limit. | The provided audio file exceeds the maximum file size. |
| Invalid file format. Only [‘flac’, ‘mp3’, ‘ogg’, ‘wav’, ‘webm’] files are supported, or WebM file does not contain an audio stream. | The audio file is not one of the supported formats, which are listed in the error message. WebM files support multiple media types, so make sure the file contains an audio stream. If the file is in a supported format, check that it is not corrupted. |
| File will be too large after resampling it to 16000 Hertz. Expected size is 3,355,444,448,000.0 Bytes. | The provided audio file is too large after resampling to 16 KHz. If the provided audio has a lower sample rate, its resampled size is larger than the original, and could potentially exceed the maximum allowed file size. |
| Audio duration too long: 6052.10 seconds. Maximum allowed: 3600 seconds. or Audio duration too long: 7335.28 seconds. Maximum allowed: 7200 seconds. | The provided audio file is too long. If you are using timestamp granularity, the maximum duration is 60 minutes (3600 seconds). |
| Unsupported detected language | The audio file contains a language that is not supported by AI_TRANSCRIBE. |

## Regional availability

AI_TRANSCRIBE is available in the following regions:

* AWS US West 2 (Oregon)
* AWS US East 1 (N. Virginia)
* AWS EU Central 1 (Frankfurt)
* Azure East US 2 (Virginia)

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: AI_TRANSLATE
source: https://docs.snowflake.com/en/sql-reference/functions/ai_translate.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# AI_TRANSLATE

> **Note:**
>
> AI_TRANSLATE is the updated version of [TRANSLATE (SNOWFLAKE.CORTEX)](translate-snowflake-cortex.md).
> For the latest functionality, use AI_TRANSLATE.

Translates the given input text from one supported language to another.

## Syntax

```sqlsyntax
AI_TRANSLATE(
    <text>, <source_language>, <target_language> [, <return_error_details> ] )
```

## Arguments

`text`
:   A string containing the text to be translated.

`source_language`
:   A string specifying the language code for the language the text is currently in. See Usage notes for a list of
    supported language codes. If the source language code is an empty string, `''`, the source language is
    automatically detected.

`target_language`
:   A string specifying the language code into which the text should be translated. See Usage notes for a list of
    supported language codes.

**Optional:**

`return_error_details`
:   A BOOLEAN flag that indicates whether to return error details in case of error. When set to TRUE, the function returns
    an OBJECT that contains the value and the error message, one of which is NULL depending on whether the function
    succeeded or failed. See Error behavior for details.

## Returns

A string containing a translation of the original text into the target language.

## Error behavior

By default, if AI_TRANSLATE can’t process the input, the function returns NULL. If the query processes multiple rows,
rows with errors return NULL and don’t prevent the query from completing.

The return value on error depends on the `return_error_details`
argument. The following table shows the return value based on the `return_error_details` argument:

> | `return_error_details` | Return value | Description |
> | --- | --- | --- |
> | FALSE    Not passed | NULL |  |
> | TRUE | OBJECT with `value` and `error` fields | `value`: A VARCHAR value that contains the translated text, or NULL if an error occurred.    `error`: A VARCHAR value that contains the error message if an error occurred, or NULL if the function succeeded. |

For more information about error handling for AI functions, see [Snowflake Cortex AI Function: Multirow error handling improvements](../../release-notes/bcr-bundles/2026_02/bcr-2184.md).

## Usage notes

The following languages are supported by the AI_TRANSLATE function. Use the corresponding language code for the source and
target language.

The AI_TRANSLATE model also supports a mix of different languages in the text being translated (for example,
“Spanglish”). In this case, specify an empty string (`''`) as the source language to auto-detect the languages
used in the source text.

| Language | Code |
| --- | --- |
| Arabic | `'ar'` |
| Chinese | `'zh'` |
| Croatian | `'hr'` |
| Czech | `'cs'` |
| Dutch | `'nl'` |
| English | `'en'` |
| Finnish | `'fi'` |
| French: | `'fr'` |
| German | `'de'` |
| Greek | `'el'` |
| Hebrew | `'he'` |
| Hindi | `'hi'` |
| Italian | `'it'` |
| Japanese | `'ja'` |
| Korean | `'ko'` |
| Norwegian | `'no'` |
| Polish | `'pl'` |
| Portuguese | `'pt'` |
| Romanian | `'ro'` |
| Russian | `'ru'` |
| Spanish | `'es'` |
| Swedish | `'sv'` |
| Turkish | `'tr'` |

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Examples

The following example translates each row of a table from English to German (in this example, `review_content` is
a column from the `reviews` table):

```sqlexample
SELECT AI_TRANSLATE(review_content, 'en', 'de') FROM reviews LIMIT 10;
```

The following example translates a fictitious product review from English to Spanish:

```sqlexample
SELECT AI_TRANSLATE(
  'Hit the slopes with Snowflake\'s latest innovation - "Skii Headphones" designed to keep your ears warm and your soul ablaze. Engineered specifically for snow weather, these rugged headphones combine crystal-clear sound with thermally-insulated ear cups to keep the chill out and the beats in. Whether you\'re carving through powder or cruising down groomers, Skii Headphones will fuel your mountain adventures with vibrant sound and unrelenting passion. Stay warm, stay fired up, and shred the mountain with Snowflake Skii Headphones',
'en','es');
```

The result of this query is:

```output
Sube a las pistas con la última innovación de Snowflake: "Skii Headphones", diseñados para mantener tus oídos calientes y tu alma encendida. Diseñados específicamente para el clima de nieve, estos audífonos resistentes combinan un sonido cristalino con copas de oído aisladas térmicamente para mantener el frío fuera y los ritmos dentro. Ya sea que estés esculpiendo en polvo o deslizándote por pistas preparadas, los Skii Headphones alimentarán tus aventuras en la montaña con un sonido vibrante y una pasión incesante. Mantente caliente, mantente encendido y arrasa la montaña con los Skii Headphones de Snowflake.
```

The following example translates a call transcript from German to English:

```sqlexample
SELECT AI_TRANSLATE(
  ('Kunde: Hallo
    Agent: Hallo, ich hoffe, es geht Ihnen gut. Um Ihnen am besten helfen zu können, teilen Sie bitte Ihren Vor- und Nachnamen und den Namen der Firma, von der aus Sie anrufen.
    Kunde: Ja, hier ist Thomas Müller von SkiPisteExpress.
    Agent: Danke Thomas, womit kann ich Ihnen heute helfen?
    Kunde: Also wir haben die XtremeX Helme in Größe M bestellt, die wir speziell für die kommende Wintersaison benötigen. Jedoch sind alle Schnallen der Helme defekt, und keiner schließt richtig.
    Agent: Ich verstehe, dass das ein Problem für Ihr Geschäft sein kann. Lassen Sie mich überprüfen, was mit Ihrer Bestellung passiert ist. Um zu bestätigen: Ihre Bestellung endet mit der Nummer 56682?
    Kunde: Ja, das ist meine Bestellung.
    Agent: Ich sehe das Problem. Entschuldigen Sie die Unannehmlichkeiten. Ich werde sofort eine neue Lieferung mit reparierten Schnallen für Sie vorbereiten, die in drei Tagen bei Ihnen eintreffen sollte. Ist das in Ordnung für Sie?
    Kunde: Drei Tage sind ziemlich lang, ich hatte gehofft, diese Helme früher zu erhalten. Gibt es irgendeine Möglichkeit, die Lieferung zu beschleunigen?
    Agent: Ich verstehe Ihre Dringlichkeit. Ich werde mein Bestes tun, um die Lieferung auf zwei Tage zu beschleunigen. Wie kommst du damit zurecht?
    Kunde: Das wäre großartig, ich wäre Ihnen sehr dankbar.
    Agent: Kein Problem, Thomas. Ich kümmere mich um die eilige Lieferung. Danke für Ihr Verständnis und Ihre Geduld.
    Kunde: Vielen Dank für Ihre Hilfe. Auf Wiedersehen!
    Agent: Bitte, gerne geschehen. Auf Wiedersehen und einen schönen Tag noch!'
,'de','en');
```

The result is:

```output
Customer: Hello
Agent: Hello, I hope you are well. To best assist you, please share your first and last name and the name of the company you are calling from.
Customer: Yes, this is Thomas Müller from SkiPisteExpress.
Agent: Thank you, Thomas, what can I help you with today?
Customer: So, we ordered the XtremeX helmets in size M, which we specifically need for the upcoming winter season. However, all the buckles on the helmets are defective and none of them close properly.
Agent: I understand that this can be a problem for your business. Let me check what happened with your order. To confirm: your order ends with the number 56682?
Customer: Yes, that's my order.
Agent: I see the issue. I apologize for the inconvenience. I will prepare a new delivery with repaired buckles for you immediately, which should arrive in three days. Is that okay for you?
Customer: Three days is quite a long time; I was hoping to receive these helmets sooner. Is there any way to expedite the delivery?
Agent: I understand your urgency. I will do my best to expedite the delivery to two days. How does that sound?
Customer: That would be great, I would be very grateful.
Agent: No problem, Thomas. I will take care of the urgent delivery. Thank you for your understanding and patience.
Customer: Thank you very much for your help. Goodbye!
Agent: You're welcome. Goodbye and have a nice day!
```

Finally, the following example illustrates translating text from two different languages (in this case English and Spanish, or “Spanglish”) to English.
Note that the specification of the source language is the empty string, which tells AI_TRANSLATE to automatically detect the language.

```sqlexample
SELECT AI_TRANSLATE('Voy a likear tus fotos en Insta.', '', 'en')
```

This query results in:

```output
I'm going to like your photos on Insta.
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: ALERT_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/alert_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# ALERT_HISTORY

This INFORMATION_SCHEMA table function can be used to query the history of [alerts](../../user-guide/alerts.md) within a specified
date range. The function returns the history of alerts for your entire Snowflake account or a specified alert.

You can also access this information through the ALERT_HISTORY view in the ACCOUNT_USAGE schema. For details on the differences
between the view and table function, refer to [Differences between Account Usage and Information Schema](../account-usage.md).

> **Note:**
>
> This function returns alert executions within the last 7 days or the next scheduled execution within the next 8 days.

## Syntax

```sqlsyntax
ALERT_HISTORY(
      [ SCHEDULED_TIME_RANGE_START => <constant_expr> ]
      [, SCHEDULED_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <integer> ]
      [, ALERT_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`SCHEDULED_TIME_RANGE_START => constant_expr` , . `SCHEDULED_TIME_RANGE_END => constant_expr`
:   Time range (in TIMESTAMP_LTZ format), within the last 7 days, in which the evaluation of the condition for the alert was
    scheduled.

    * If `SCHEDULED_TIME_RANGE_END` is not specified, the function returns those alerts that have already completed, are
      currently running, or are scheduled in the future.
    * If `SCHEDULED_TIME_RANGE_END` is [CURRENT_TIMESTAMP](current_timestamp.md), the function returns those alerts
      that have already completed or are currently running. Note that an alert that is executed immediately prior to the current
      time may still be identified as scheduled.

    > **Note:**
    >
    > If no start or end time is specified, the most recent alerts are returned, up to the specified RESULT_LIMIT value.

    If the time range does not fall within the last 7 days, an error is returned.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function.

    If the number of matching rows is greater than this limit, the alert executions with the most recent timestamp are returned, up
    to the specified limit.

    Range: `1` to `10000`

    Default: `100`.

`ALERT_NAME => string`
:   A case-insensitive string specifying an alert. Only non-qualified alert names are supported. Only executions of the specified
    alert are returned. Note that if multiple alerts have the same name, the function returns the history for each of these alerts.

## Usage notes

* Returns results only for the ACCOUNTADMIN role, the alert owner (i.e. the role with the OWNERSHIP privilege on the alert).
* This function returns a maximum of 10,000 rows, set in the `RESULT_LIMIT` argument value. The default value is `100`.

  Note that when the ALERT_HISTORY function is queried, its alert name, time range, and result limit arguments are applied
  first followed by the WHERE and LIMIT clause, respectively, if specified. In addition, the ALERT_HISTORY function
  returns records in descending SCHEDULED_TIME order. Alerts that are completed (i.e. with a SUCCEEDED, FAILED, or CANCELLED
  state) tend to be scheduled earlier, so they are generally returned later in order in the search results.

  In practice, if you have many alerts running in your account, the results returned by the function could include fewer than
  expected completed alerts or only scheduled alerts, especially if the RESULT_LIMIT value is relatively low. To query the history
  of alerts that have already run, Snowflake recommends using a combination of the
  `SCHEDULED_TIME_RANGE_START => constant_expr` and/or `SCHEDULED_TIME_RANGE_END => constant_expr` arguments.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the
  function name must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).
* This function can return all executions run in the past 7 days or the next scheduled execution within the next 8 days.

## Output

The ALERT_HISTORY table function produces one row for each alert execution. Each row contains the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the alert. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the alert. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the alert. |
| CONDITION | VARCHAR | The text of the SQL statement that serves as the condition for the alert. |
| CONDITION_QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement executed as the condition of the alert. |
| ACTION | VARCHAR | The text of the SQL statement that serves as the action for the alert. |
| ACTION_QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement executed as the action of the alert. |
| STATE | VARCHAR | Status of the alert. This can be one of the following:   * SCHEDULED: The alert will execute at the time specified by the SCHEDULED_TIME column. This status does not apply to   [alerts on new data](../../user-guide/alerts.md). * EXECUTING: The condition or action of the alert is currently executing. * FAILED: The alert failed. Either the alert condition or alert action encountered an error that prevented it from being   executed. * CANCELLED: The alert execution was cancelled (e.g. when the alert is suspended). * CONDITION_FALSE: The condition was evaluated successfully but returned no data. As a result, the action was not executed.   This status does not apply to [alerts on new data](../../user-guide/alerts.md). * CONDITION_FAILED: The evaluation of the condition failed. For details on the failure, check the SQL_ERROR_CODE and   SQL_ERROR_MESSAGE columns. * ACTION_FAILED: The condition was evaluated successfully, but the execution of the action failed. For details on the   failure, check the SQL_ERROR_CODE and SQL_ERROR_MESSAGE columns. * TRIGGERED: The condition was evaluated successfully, and the action was executed successfully. |
| SQL_ERROR_CODE | NUMBER | Error code, if the alert returned an error or failed to execute (e.g. if the current user did not have privileges to execute the alert). |
| SQL_ERROR_MESSAGE | VARCHAR | Error message, if the alert returned an error. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the scheduled alert is/was scheduled to start running.  Note that we make a best effort to ensure absolute precision, but only guarantee that alerts do not execute *before* the scheduled time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the alert completed, or NULL if SCHEDULED_TIME is in the future or if the alert is still running. |
| SCHEDULED_FROM | VARCHAR | Specifies what initiated the alert. The column contains one of the following values:   * `SCHEDULE`: The alert was scheduled to run normally, as described in the SCHEDULE clause of   [CREATE ALERT](../sql/create-alert.md). * `EXECUTE ALERT`: The alert was scheduled to run using [EXECUTE ALERT](../sql/execute-alert.md). * `TRIGGER`: The [alert on new data](../../user-guide/alerts.md) was run because the underlying table or view   contains new data. |

## Examples

See [Monitoring the execution of alerts](../../user-guide/alerts.md).

---
title: ALL_USER_NAMES
source: https://docs.snowflake.com/en/sql-reference/functions/all_user_names.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# ALL_USER_NAMES

Returns all user names in the current account.

## Syntax

```sqlsyntax
ALL_USER_NAMES()
```

## Arguments

None.

## Returns

The data type of the returned value is `ARRAY`.

## Usage notes

* Users with any active role can retrieve the list of all usernames in the current account. However, simply knowing the usernames does not
  allow a role the ability to perform further actions on the users. User management requires a minimum set of privileges.
* Usernames (i.e. the `NAME` property value) are the unique identifier of the user object in Snowflake, while login names (i.e. the `LOGIN_NAME` property value) are used to authenticate to Snowflake. Usernames are not sensitive data and are returned by other commands and functions (e.g. [SHOW GRANTS](../sql/show-grants.md)). Login names are sensitive data.
* As a best practice, username and login name values should be different. To update existing username or login name values, execute the [ALTER USER](../sql/alter-user.md) command. When creating new users with the [CREATE USER](../sql/create-user.md) command, ensure that the `NAME` and `LOGIN_NAME` values are different.

## Examples

Return all user names for the current account.

> ```sqlexample
> select all_user_names();
>
> +---------------------------+
> | ALL_USER_NAMES()          |
> +---------------------------+
> | [ "user1", "user2", ... ] |
> +---------------------------+
> ```

---
title: ANY_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/any_value.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md) (General)

# ANY_VALUE

Returns some value of the expression from the group. The result is non-deterministic.

## Syntax

**Aggregate function**

```sqlsyntax
ANY_VALUE( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
ANY_VALUE( [ DISTINCT ] <expr1> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   The input expression.

`expr2`
:   The column to partition on, if you want the result to be split into multiple
    partitions.

## Returns

This function can return a value of any data type.

If the input expression is NULL, the function returns NULL.

## Usage notes

* The DISTINCT keyword can be specified for this function, but it does not have any effect.
* The function doesn’t exclude NULL values. If the expression contains NULL values, the function can
  return a NULL value.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Using ANY_VALUE with GROUP BY statements

ANY_VALUE can simplify and optimize the performance of [GROUP BY](../constructs/group-by.md) statements. A common problem for many queries is that the result of a query with a GROUP BY
clause can only contain expressions used in the GROUP BY clause itself, or results of aggregate functions. For example:

```sqlexample
SELECT customer.id , customer.name , SUM(orders.value)
  FROM customer
  JOIN orders ON customer.id = orders.customer_id
  GROUP BY customer.id , customer.name;
```

In this query, the `customer.name` attribute needs to be in the GROUP BY to be included in the result. This is unnecessary
(for example, when `customer.id` is known to be unique) and makes the computation
possibly more complex and slower. Another option is to use an aggregate function. For example:

```sqlexample
SELECT customer.id , MIN(customer.name) , SUM(orders.value)
  FROM customer
  JOIN orders ON customer.id = orders.customer_id
  GROUP BY customer.id;
```

This simplifies the GROUP BY clause, but still requires computing the [MIN](min.md) function, which incurs an extra cost.

With ANY_VALUE, you can execute the following query:

```sqlexample
SELECT customer.id , ANY_VALUE(customer.name) , SUM(orders.value)
  FROM customer
  JOIN orders ON customer.id = orders.customer_id
  GROUP BY customer.id;
```

---
title: APPLICATION_CALLBACK_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/application_callback_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# APPLICATION_CALLBACK_HISTORY

Returns information about the history of
[callback](../../developer-guide/native-apps/callbacks.md) invocations for Snowflake Native Apps in your Snowflake account.
Each row represents a callback invocation, including the callback type, execution mode, state, and any error information.

## Syntax

```sqlsyntax
APPLICATION_CALLBACK_HISTORY(
  [ APPLICATION_NAME => '<application_name>' ]
  [ , CALLBACK_TYPE => '<callback_manifest_name>' ]
  [ , LIMIT => <number> ]
)
```

## Optional arguments

`APPLICATION_NAME => 'application_name'`
:   The name of the app for which to retrieve callback history. If not specified, returns
    history for all apps in the account.

`CALLBACK_TYPE => 'callback_manifest_name'`
:   The callback type as defined in the manifest file. If not specified, returns
    history for all callback types in the specified app.

`LIMIT => number`
:   The maximum number of rows to return. Default is 100. Maximum is 10000.

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* The `QUERY_TEXT` and `ERROR_MESSAGE` columns are redacted unless the caller is the app itself.
* Using this function requires one of the following:

  + OWNERSHIP on the app.
  + MONITOR privilege on the app.
  + Running as the app itself.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| TYPE | VARCHAR | The callback type as defined in the manifest file. |
| EXECUTION_MODE | VARCHAR | The execution mode of the callback. Possible values are: `SYNC`, `ASYNC`. |
| APPLICATION_NAME | VARCHAR | The name of the app that defines the callback. |
| STATE | VARCHAR | The state of the callback execution. See Callback states. |
| STARTED_ON | TIMESTAMP_LTZ | The timestamp when the callback was invoked. |
| COMPLETED_ON | TIMESTAMP_LTZ | The completion timestamp. NULL if the callback has not yet completed. |
| TRIGGERING_QUERY_ID | VARCHAR | The query ID of the SQL statement that triggered the callback. NULL if the callback was not triggered by a SQL query (for example, when triggered after an upgrade completes). |
| QUERY_ID | VARCHAR | The query ID of the callback procedure execution. NULL if the callback has not yet completed. |
| QUERY_TEXT | VARCHAR | The procedure call SQL text. NULL if the callback has not yet completed. This column is redacted unless the caller is the app itself. |
| ERROR_CODE | VARCHAR | The error code. NULL unless STATE is `FAILED` or `ABORTED`. |
| ERROR_MESSAGE | VARCHAR | The error message. NULL unless STATE is `FAILED` or `ABORTED`. This column is redacted unless the caller is the app itself. |

## Callback states

The following table describes the possible values for the STATE column:

| State | Applies to | Description |
| --- | --- | --- |
| `QUEUED` | Async only | The callback is waiting to be scheduled. |
| `SCHEDULED` | Async only | The callback has been scheduled and is waiting to be executed. |
| `EXECUTING` | Async / Sync | The callback procedure is currently running. |
| `COMPLETED` | Async / Sync | The callback procedure finished successfully. |
| `FAILED` | Async / Sync | The callback procedure failed validation (for example, wrong signature) or execution. |
| `ABORTED` | Async only | An internal scheduling error occurred. This state requires support intervention. |

## Examples

Retrieve the callback history for a specific application:

```sqlexample
SELECT *
FROM TABLE(
    INFORMATION_SCHEMA.APPLICATION_CALLBACK_HISTORY(
        APPLICATION_NAME => 'my_app'));
```

Retrieve the callback history for a specific callback type with a custom limit:

```sqlexample
SELECT *
FROM TABLE(
    INFORMATION_SCHEMA.APPLICATION_CALLBACK_HISTORY(
        APPLICATION_NAME => 'my_app',
        CALLBACK_TYPE => 'after_configuration_change',
        LIMIT => 100));
```

---
title: APPLICATION_CONFIGURATION_VALUE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/application_configuration_value_history.md
section: SQL Functions
---

Categories:

[Table functions](../functions-table.md) (Tables)

# APPLICATION_CONFIGURATION_VALUE_HISTORY

Provides a history of the value changes for [application configurations](../../developer-guide/native-apps/app-configuration.md) in the specified Snowflake Native App.

You can call this function to check the history of the value changes for an application configuration. For information, see
[Application configuration](../../developer-guide/native-apps/app-configuration.md).

## Syntax

```sqlsyntax
APPLICATION_CONFIGURATION_VALUE_HISTORY(
  [ APPLICATION_NAME => '<application_name>' ]
  [ , CONFIGURATION_NAME => '<config_name>' ]
)
```

## Arguments

**Required:**

`application_name`
:   Name of the application that the configuration is in.

**Optional:**

`config_name`
:   Name of the configuration. If not provided, the function returns the history for all configurations in the application.

## Returns

The function returns the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| NAME | STRING | The name of the configuration, defined by the provider. |
| APPLICATION_NAME | STRING | The name of the application that the configuration is in. |
| CREATED_ON | TIMESTAMP | The timestamp when the configuration object was created. |
| UPDATED_ON | TIMESTAMP | The timestamp when the configuration object was last updated. |
| TYPE | STRING | The type of the configuration. Possible values are APPLICATION_NAME and STRING. |
| STATUS | STRING | The status of the configuration. Possible values are PENDING and DONE. |
| SENSITIVE | BOOLEAN | Whether the value is sensitive or not. |
| VALUE | STRING | The value that is set by the consumer.  For application configurations of the APPLICATION_NAME type, this is the most up-to-date name of the application specified by the consumer. This may not be the same as initially provided if the application has been renamed. If the application has been dropped, no value will be shown here, as if the value is not set.  When `SENSITIVE=TRUE`, the value is hidden, unless the executing role is the application owning the configuration. |
| VALUE_UPDATED_ON | TIMESTAMP | The last updated timestamp when the value was set or unset. |
| LABEL | STRING | A user-friendly name to be displayed in the UI, provided by the provider. |
| DESCRIPTION | STRING | The description of the configuration. |
| APPLICATION_ROLES | STRING | The comma-separated app role names that have access to the configuration.  This displays the most up-to-date names, even if roles have been renamed. If an application role has been dropped, it will not be included in the output list. |

## Usage notes

* The view only displays configurations for which the current role for the session has been granted access privileges.
* The view does not include configurations that have been dropped.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the history of the value changes for the `config_name` application configuration
in the `application_name` application:

```sqlexample
SELECT * FROM TABLE(information_schema.application_configuration_value_history(application_name => 'my_app', configuration_name => 'my_configuration'));
```

---
title: APPLICATION_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/application_json.md
section: SQL Functions
---

Categories:
:   [Notification functions](../functions-notification.md) (Message Construction)

# APPLICATION_JSON

Returns a JSON object that specifies the JSON message to use for a notification. This is a helper function that you use to
construct a message object for the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

See also:
:   [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md) ,
    [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) ,
    [TEXT_HTML](text_html.md) ,
    [TEXT_PLAIN](text_plain.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NOTIFICATION.APPLICATION_JSON( '<message>' )
```

## Arguments

`'message'`
:   Content of the message to send.

    You do not need to escape the double quotes around strings within the message (for example, double quotes around the keys
    and values). The function escapes these double quotes for you.

## Returns

A JSON-formatted string that specifies a message for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure to send.

For example, suppose that you call the function and pass in a JSON message:

```sqlexample
SELECT SNOWFLAKE.NOTIFICATION.APPLICATION_JSON('{"data": "hello world"}');
```

The function returns the following JSON-formatted string:

```json
'{"application/json":"{\"data\": \"hello world\"}"}'
```

Note how the function escapes the double quotes around the keys and values in your message.

## Examples

See [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md).

---
title: APPLICATION_SPECIFICATION_STATUS_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/application_specification_status_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# APPLICATION_SPECIFICATION_STATUS_HISTORY

Returns information about the history of the
[status changes for app specifications](../../developer-guide/native-apps/ui-consumer-app-spec.md) in your Snowflake account.

## Syntax

```sqlsyntax
APPLICATION_SPECIFICATION_STATUS_HISTORY(
  [ APPLICATION_NAME => '<application_name>' ]
  [ , SPECIFICATION_NAME => '<specification_name>'])
  [ LIMIT => <number_of_rows> ]
```

## Arguments

`APPLICATION_NAME => 'application_name'`
:   The name of the application for which to retrieve specification status history. If not specified,
    returns status history for all app specifications.

`SPECIFICATION_NAME => 'specification_name'`
:   The name of the app specification for which to retrieve status history. If not specified, returns
    status history for all app specifications.

`LIMIT <number_of_rows>`
:   The maximum number of rows to return.

## Usage notes

* This function only returns rows for app specifications that current role has privileges to view.
* This function only returns rows for app specifications in the current account.

## Output

The APPLICATION_SPECIFICATION table function produces one row for each app specification.
Each row contains the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | The name of the app specification. |
| APPLICATION_NAME | TEXT | The name of the app that contains the app specification |
| SEQUENCE_NUMBER | NUMBER | The sequence number of the app specification. |
| REQUESTED_ON | TIMESTAMP_TZ | The date and time when the app created the app specification. |
| USER_NAME | TEXT | The user that updated the app specification. This value is empty if it is a new pending request created by the application. |
| STATUS | TEXT | The status of the app specification. One of the following values: : `PENDING`, `APPROVED`, `DECLINED`. |
| STATUS_UPDATED_ON | TIMESTAMP_TZ | The date and time when the app specification was last modified. |
| LABEL | TEXT | The label associated with the app specification status change, if any. |
| DESCRIPTION | TEXT | The description associated with the app specification status change, if any. |
| DEFINITION | TEXT | The fields that comprise the app specification definition. For more information, see [Overview of app specifications](../../developer-guide/native-apps/requesting-app-specs.md). |

## Example

```sqlexample
SELECT *
FROM TABLE(
    INFORMATION_SCHEMA.APPLICATION_SPECIFICATION_STATUS_HISTORY(
        application_name=>'my_app',
        specification_name=>'eai_spec'))
    LIMIT 5;
```

The preceding example returns the last five status changes for the app specification named `my_spec` in the app named `my_app`.

---
title: APPROX_COUNT_DISTINCT
source: https://docs.snowflake.com/en/sql-reference/functions/approx_count_distinct.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) , [Window functions](../functions-window.md)

# APPROX_COUNT_DISTINCT

Uses HyperLogLog to return an approximation of the distinct cardinality of the input (i.e. `HLL(col1, col2, ... )` returns an approximation of `COUNT(DISTINCT col1, col2, ... )`).

For more information about HyperLogLog, see [Estimating the Number of Distinct Values](../../user-guide/querying-approximate-cardinality.md).

Aliases:
:   [HLL](hll.md).

See also:
:   [HLL_ACCUMULATE](hll_accumulate.md) , [HLL_COMBINE](hll_combine.md) , [HLL_ESTIMATE](hll_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
APPROX_COUNT_DISTINCT( [ DISTINCT ] <expr1>  [ , ... ] )

APPROX_COUNT_DISTINCT(*)
```

**Window function**

```sqlsyntax
APPROX_COUNT_DISTINCT( [ DISTINCT ] <expr1>  [ , ... ] ) OVER ( [ PARTITION BY <expr2> ] )

APPROX_COUNT_DISTINCT(*) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   This is the expression for which you want to know the number of distinct values.

`expr2`
:   This is the optional expression used to group rows into partitions.

`*`
:   Returns an approximation of the total number of records, excluding records with NULL values.

    When you pass a wildcard to the function, you can qualify the wildcard with the name or alias for the table.
    For example, to pass in all of the columns from the table named `mytable`, specify the following:

    ```sqlexample
    (mytable.*)
    ```

    You can also use the ILIKE and EXCLUDE keywords for filtering:

    * ILIKE filters for column names that match the specified pattern. Only one
      pattern is allowed. For example:

      ```sqlexample
      (* ILIKE 'col1%')
      ```
    * EXCLUDE filters out column names that don’t match the specified column or columns. For example:

      ```sqlexample
      (* EXCLUDE col1)

      (* EXCLUDE (col1, col2))
      ```

    Qualifiers are valid when you use these keywords. The following example uses the ILIKE keyword to
    filter for all of the columns that match the pattern `col1%` in the table `mytable`:

    ```sqlexample
    (mytable.* ILIKE 'col1%')
    ```

    The ILIKE and EXCLUDE keywords can’t be combined in a single function call.

    For this function, the ILIKE and EXCLUDE keywords are valid only in a SELECT list or GROUP BY clause.

    For more information about the ILIKE and EXCLUDE keywords, see the “Parameters” section in [SELECT](../sql/select.md).

## Returns

The data type of the returned value is INTEGER.

## Usage notes

* Although the computation is an approximation, it is deterministic. When this function is called with the same input
  data, this function returns the same results.
* For information about NULL values and aggregate functions, see [Aggregate functions and NULL values](../functions-aggregation.md).

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

This example shows how to use APPROX_COUNT_DISTINCT and its alias HLL. This example calls
both `COUNT(DISTINCT i)` and `APPROX_COUNT_DISTINCT(i)` to emphasize
that the results of those two functions do not always match exactly.

The exact output of the following query might vary because APPROX_COUNT_DISTINCT returns an approximation, not an exact value.

```sqlexample
SELECT COUNT(i), COUNT(DISTINCT i), APPROX_COUNT_DISTINCT(i), HLL(i)
  FROM sequence_demo;
```

```output
+----------+-------------------+--------------------------+--------+
| COUNT(I) | COUNT(DISTINCT I) | APPROX_COUNT_DISTINCT(I) | HLL(I) |
|----------+-------------------+--------------------------+--------|
|     1024 |              1024 |                     1007 |   1007 |
+----------+-------------------+--------------------------+--------+
```

---
title: APPROX_PERCENTILE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_percentile.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Percentile Estimation) , [Window functions](../functions-window.md)

# APPROX_PERCENTILE

Returns an approximated value for the desired percentile (that is, if column `c` has `n` numbers,
APPROX_PERCENTILE(c, p) returns a number such that approximately `n * p` of the numbers in `c`
are smaller than the returned number).

This function uses an improved version of the t-Digest algorithm. For more information, see
[Estimating Percentile Values](../../user-guide/querying-approximate-percentile-values.md).

See also:
:   [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) , [APPROX_PERCENTILE_COMBINE](approx_percentile_combine.md) , [APPROX_PERCENTILE_ESTIMATE](approx_percentile_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
APPROX_PERCENTILE( <expr> , <percentile> )
```

**Window function**

```sqlsyntax
APPROX_PERCENTILE( <expr> , <percentile> ) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`expr`
:   A valid expression, such as a column name, that evaluates to a numeric value.

`percentile`
:   A constant real value greater than or equal to `0.0` and less than `1.0`.
    This indicates the percentile (from 0 to 99.999…).
    For example, the value 0.65 indicates the 65th percentile.

`expr3`
:   This is the optional expression used to group rows into partitions.

## Returns

The output is returned as a DOUBLE value.

## Usage notes

* Percentile works only on numeric values, so `expr` should produce
  values that are numbers or can be cast to numbers.
* The values returned are not necessarily in the data set.
* The value returned is an approximation. The size of the data set and the
  skew in the data set affect the accuracy of the approximation.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Examples

Demonstrate the APPROX_PERCENTILE function:

Create and populate a table with values:

```sqlexample
CREATE TABLE testtable (c1 INTEGER);

INSERT INTO testtable (c1) VALUES
  (0), (1), (2), (3), (4), (5), (6), (7), (8), (9), (10);
```

Run queries and show the output:

```sqlexample
SELECT APPROX_PERCENTILE(c1, 0.1)
  FROM testtable;
```

```output
+----------------------------+
| APPROX_PERCENTILE(C1, 0.1) |
|----------------------------|
|                        1.5 |
+----------------------------+
```

```sqlexample
SELECT APPROX_PERCENTILE(c1, 0.5)
  FROM testtable;
```

```output
+----------------------------+
| APPROX_PERCENTILE(C1, 0.5) |
|----------------------------|
|                        5.5 |
+----------------------------+
```

Note that the value returned in this case is higher than any value actually
in the data set:

```sqlexample
SELECT APPROX_PERCENTILE(c1, 0.999)
  FROM testtable;
```

```output
+------------------------------+
| APPROX_PERCENTILE(C1, 0.999) |
|------------------------------|
|                         10.5 |
+------------------------------+
```

---
title: APPROX_PERCENTILE_ACCUMULATE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_percentile_accumulate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Percentile Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROX_PERCENTILE_ACCUMULATE

Returns the internal representation of the t-Digest state (as a JSON object) at the end of aggregation. (For more information about t-Digest, see:
[Estimating Percentile Values](../../user-guide/querying-approximate-percentile-values.md).)

The function [APPROX_PERCENTILE](approx_percentile.md) discards this internal, intermediate state when the final percentile estimate is returned. However, in certain advanced use cases, such as estimating incremental percentile during bulk
loading, you may wish to keep the intermediate state, in which case you would use APPROX_PERCENTILE_ACCUMULATE instead of [APPROX_PERCENTILE](approx_percentile.md).

APPROX_PERCENTILE_ACCUMULATE does not return a percentile value. Instead, it returns the algorithm state itself. The intermediate state can later be:

* Combined (i.e. merged) with other intermediate states from separate but
  related batches of data.
* Processed by other functions that operate directly on the intermediate state,
  for example, [APPROX_PERCENTILE_ESTIMATE](approx_percentile_estimate.md). (For an example, see the
  Examples section below.)
* Exported to external tools.

See also:
:   [APPROX_PERCENTILE_COMBINE](approx_percentile_combine.md) , [APPROX_PERCENTILE_ESTIMATE](approx_percentile_estimate.md)

## Syntax

```sqlsyntax
APPROX_PERCENTILE_ACCUMULATE( <expr> )
```

## Arguments

`expr`
:   A valid expression, such as a column name, that evaluates to a numeric
    value.

## Usage notes

* Percentile works only on numeric values, so `expr` should produce
  values that are numbers or can be cast to numbers.

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Example

Store the t-Digest state of the `testtable.c1` column in a table and then use the state to compute percentiles:

```sqlexample
CREATE OR REPLACE TABLE resultstate AS
  SELECT APPROX_PERCENTILE_ACCUMULATE(c1) AS s
    FROM testtable;

SELECT APPROX_PERCENTILE_ESTIMATE(s, 0.015)
  FROM resultstate;

SELECT APPROX_PERCENTILE_ESTIMATE(s, 0.2)
  FROM resultstate;
```

Here is a more extensive example that shows the usage of all three
related functions: APPROX_PERCENTILE_ACCUMULATE,
APPROX_PERCENTILE_ESTIMATE, and APPROX_PERCENTILE_COMBINE.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE TABLE test_table1 (c1 INTEGER);
INSERT INTO test_table1 (c1) VALUES (1), (2), (3), (4);
```

Create a table that contains the “state” that represents the current
approximate percentile information for the table named `test_table1`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT APPROX_PERCENTILE_ACCUMULATE(c1) AS rs1
    FROM test_table1);
```

Use that state information to display the current estimate of the median
value (0.5 means that we want the value at the 50th percentile):

```sqlexample
SELECT APPROX_PERCENTILE_ESTIMATE(rs1, 0.5)
  FROM resultstate1;
```

```output
+--------------------------------------+
| APPROX_PERCENTILE_ESTIMATE(RS1, 0.5) |
|--------------------------------------|
|                                  2.5 |
+--------------------------------------+
```

Now create a second table and add data. (In a more realistic situation,
the user could have loaded more data into the first table and divided the
data into non-overlapping sets based on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) VALUES (5), (6), (7), (8);
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT APPROX_PERCENTILE_ACCUMULATE(c1) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT APPROX_PERCENTILE_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate median value of the combined set of rows:

```sqlexample
SELECT APPROX_PERCENTILE_ESTIMATE(c1, 0.5)
  FROM combined_resultstate;
```

```output
+-------------------------------------+
| APPROX_PERCENTILE_ESTIMATE(C1, 0.5) |
|-------------------------------------|
|                                 4.5 |
+-------------------------------------+
```

---
title: APPROX_PERCENTILE_COMBINE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_percentile_combine.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Percentile Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROX_PERCENTILE_COMBINE

Combines (merges) percentile input states into a single output state.

This allows scenarios where [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) is run over horizontal partitions of the same table, producing an algorithm state for each table partition. These states can later be
combined using APPROX_PERCENTILE_COMBINE, producing the same output state as a single run of [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) over the entire table.

See also:
:   [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) , [APPROX_PERCENTILE_ESTIMATE](approx_percentile_estimate.md)

## Syntax

```sqlsyntax
APPROX_PERCENTILE_COMBINE( <state> )
```

## Arguments

`state`
:   An expression that contains state information generated
    by a call to [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md).

## Usage notes

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Example

Return an approximation for the median of numbers in the `testtable.c2`
column (0.5 means the 50th percentile):

> ```sqlexample
> CREATE OR REPLACE TABLE mytesttable AS
>   SELECT APPROX_PERCENTILE_COMBINE(td) s FROM
>     (
>       (SELECT APPROX_PERCENTILE_ACCUMULATE(c2) td FROM testtable WHERE c2 <= 0)
>         UNION ALL
>       (SELECT APPROX_PERCENTILE_ACCUMULATE(c2) td FROM testtable WHERE c2 > 0 AND c2 <= 0.5)
>         UNION ALL
>       (SELECT APPROX_PERCENTILE_ACCUMULATE(C2) td FROM testtable WHERE c2 > 0.5)
>     );
>
> SELECT APPROX_PERCENTILE_ESTIMATE(s , 0.5) FROM mytesttable;
> ```

Return an approximate value for the 2nd percentile of numbers in `mytest.s1 union mytest2.s2`.

> ```sqlexample
> CREATE OR REPLACE TABLE mytest AS (SELECT APPROX_PERCENTILE_ACCUMULATE(c2) s1 FROM testtable WHERE c2 < 0);
>
> CREATE OR REPLACE TABLE mytest2 AS (SELECT APPROX_PERCENTILE_ACCUMULATE(c2) s1 FROM testtable WHERE c2 >= 0);
>
> CREATE OR REPLACE TABLE combinedtable AS
>   SELECT APPROX_PERCENTILE_COMBINE(s) combinedstate FROM
>     (
>       (SELECT s1 s FROM mytest)
>         UNION ALL
>       (SELECT s1 s FROM mytest2)
>     );
>
> SELECT APPROX_PERCENTILE_ESTIMATE(combinedstate , 0.02) FROM combinedtable;
> ```

For a more extensive example, see the Examples section in
[APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md).

---
title: APPROX_PERCENTILE_ESTIMATE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_percentile_estimate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Percentile Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROX_PERCENTILE_ESTIMATE

Returns the desired approximated percentile value for the specified t-Digest state.

A t-Digest state produced by [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) and [APPROX_PERCENTILE_COMBINE](approx_percentile_combine.md) can be used to compute a percentile estimate using this function.

As such, APPROX_PERCENTILE_ESTIMATE(APPROX_PERCENTILE_ACCUMULATE(…)) is equivalent to APPROX_PERCENTILE(…).

See also:
:   [APPROX_PERCENTILE](approx_percentile.md) , [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) , [APPROX_PERCENTILE_COMBINE](approx_percentile_combine.md)

## Syntax

```sqlsyntax
APPROX_PERCENTILE_ESTIMATE( <state> , <percentile> )
```

## Arguments

`state`
:   An expression that contains state information generated
    by a call to [APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md) or
    [APPROX_PERCENTILE_COMBINE](approx_percentile_combine.md).

`percentile`
:   A constant real value greater than or equal to `0.0` and less than `1.0`.
    This indicates the percentile from 0 to 99.999… (e.g. the value 0.65 indicates the 65th percentile).

## Usage notes

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Example

Consider a scenario where you need to approximate multiple percentile values from a given set of numbers. This can be done by creating the state and then using APPROX_PERCENTILE_ESTIMATE to calculate
all the percentiles:

1. First, store the state:

   ```sqlexample
   CREATE OR REPLACE TABLE resultstate AS (
     SELECT APPROX_PERCENTILE_ACCUMULATE(c1) AS s
       FROM testtable
     );
   ```
2. Then, query the state for multiple percentiles:

   ```sqlexample
   SELECT APPROX_PERCENTILE_ESTIMATE(s, 0.01),
       APPROX_PERCENTILE_ESTIMATE(s, 0.15),
       APPROX_PERCENTILE_ESTIMATE(s, 0.845)
     FROM testtable;
   ```

For a more extensive example, see the Examples section in
[APPROX_PERCENTILE_ACCUMULATE](approx_percentile_accumulate.md).

---
title: APPROX_TOP_K
source: https://docs.snowflake.com/en/sql-reference/functions/approx_top_k.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Frequency Estimation) , [Window functions](../functions-window.md)

# APPROX_TOP_K

Uses Space-Saving to return an approximation of the most frequent values in the input, along with their approximate frequencies.

The output is a JSON array of arrays. In the inner arrays, the first entry is a value in the input, and the second entry corresponds to its estimated frequency in the input. The outer array contains
`k` items, sorted by descending frequency.

For more information about APPROX_TOP_K, see [Estimating Frequent Values](../../user-guide/querying-approximate-frequent-values.md).

See also:
:   [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) , [APPROX_TOP_K_COMBINE](approx_top_k_combine.md), [APPROX_TOP_K_ESTIMATE](approx_top_k_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
APPROX_TOP_K( <expr> [ , <k> [ , <counters> ] ] )
```

**Window function**

```sqlsyntax
APPROX_TOP_K( <expr> [ , <k> [ , <counters> ] ] ) OVER ( [ PARTITION BY <expr4> ] )
```

## Arguments

* `expr`: The expression (e.g. column name) for which you want to find
  the most common values.
* `k`: The number of values whose counts you want approximated.
  For example, if you want to see the top 10 most common values, then
  set `k` to 10.

  If `k` is omitted, the default is `1`.

  The maximum value is `100000` (100,000), and is automatically reduced if
  items cannot fit in the output.
* `counters`: This is the maximum number of distinct values that
  can be tracked at a time during the estimation process. For example, if
  `counters` is set to 100000, then the algorithm tracks 100,000
  distinct values, attempting to keep the 100,000 most frequent values.

  The maximum number of `counters` is `100000` (100,000).

`expr4`
:   This is the optional expression used to group rows into partitions.

## Usage notes

* The approximation is more accurate if the number of `counters` is
  large, so in most cases `counters` should be considerably bigger
  than `k`.
  (Each counter uses only a small amount of memory, so increasing the number
  of counters is not expensive in terms of memory.)

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Examples

```sqlexample
SELECT APPROX_TOP_K(C4) FROM lineitem;
```

```output
+--------------------+
| APPROX_TOP_K(C4,3) |
+--------------------+
| [                  |
|   [                |
|     1,             |
|     124923         |
|   ],               |
|   [                |
|     2,             |
|     107093         |
|   ],               |
|   [                |
|     3,             |
|    89315           |
|   ]                |
| ]                  |
+--------------------+
```

```sqlexample
WITH states AS (
  SELECT approx_top_k(C4, 3, 5) AS state
  FROM lineitem)
SELECT value[0]::INT AS value, value[1]::INT AS frequency
  FROM states, LATERAL FLATTEN(state);
```

```output
+-------+-----------+
| VALUE | FREQUENCY |
+-------+-----------+
|     1 |    124923 |
|     2 |    107093 |
|     3 |     89438 |
+-------+-----------+
```

---
title: APPROX_TOP_K_ACCUMULATE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_top_k_accumulate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Frequency Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROX_TOP_K_ACCUMULATE

Returns the Space-Saving summary at the end of aggregation. (For more
information about the Space-Saving summary, see
[Estimating Frequent Values](../../user-guide/querying-approximate-frequent-values.md).)

The function [APPROX_TOP_K](approx_top_k.md) discards its internal, intermediate state
when the final cardinality estimate is returned. However, in certain advanced
use cases, such as estimating incremental frequent values during bulk loading,
you might want to keep the intermediate state, in which case you would use
APPROX_TOP_K_ACCUMULATE instead of [APPROX_TOP_K](approx_top_k.md).

In contrast to [APPROX_TOP_K](approx_top_k.md), APPROX_TOP_K_ACCUMULATE does not return a frequency estimate of items. Instead,
it returns the algorithm state itself. The intermediate state can later be:

* Combined (that is, merged) with intermediate states from separate but related
  batches of data.
* Processed by other functions that operate directly on the intermediate state,
  for example, [APPROX_TOP_K_ESTIMATE](approx_top_k_estimate.md). (For an example, see the
  Examples section below.)
* Exported to external tools.

See also:
:   [APPROX_TOP_K_COMBINE](approx_top_k_combine.md), [APPROX_TOP_K_ESTIMATE](approx_top_k_estimate.md)

## Syntax

```sqlsyntax
APPROX_TOP_K_ACCUMULATE( <expr> , <counters> )
```

## Arguments

`expr`
:   The expression (e.g. column name) for which you want to find the most common values.

`counters`
:   This is the maximum number of distinct values that can be tracked at a time during the estimation process.

    For example, if `counters` is set to 100000, then the algorithm tracks 100,000 distinct values, attempting to keep the
    100,000 most frequent values.

    The maximum number of `counters` is `100000` (100,000).

## Usage notes

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Examples

This example shows how to use the three related functions
APPROX_TOP_K_ACCUMULATE, APPROX_TOP_K_ESTIMATE, and APPROX_TOP_K_COMBINE.

> **Note:**
>
> This example uses more counters than distinct data values in order to get
> consistent results. In real-world applications, the number of distinct
> values is usually larger than the number of counters, so approximations can vary.

This example generates one table with 8 rows that have values 1 - 8, and a
second table with 8 rows that have values 5 - 12. Thus the most frequent
values in the union of the two tables are the values 5-8, each of which has a
count of 2.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE SEQUENCE seq91;
CREATE OR REPLACE TABLE sequence_demo (c1 INTEGER DEFAULT seq91.NEXTVAL, dummy SMALLINT);
INSERT INTO sequence_demo (dummy) VALUES (0);

INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
```

Create a table that contains the “state” that represents the current
approximate Top K information for the table named `sequence_demo`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT APPROX_TOP_K_ACCUMULATE(c1, 50) AS rs1
    FROM sequence_demo);
```

Now create a second table and add data. (In a more realistic situation, the user could have
loaded more data into the first table and divided the data into non-overlapping sets based
on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) SELECT c1 + 4 FROM sequence_demo;
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT APPROX_TOP_K_ACCUMULATE(c1, 50) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT APPROX_TOP_K_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate Top K value of the combined set of rows:

```sqlexample
SELECT APPROX_TOP_K_ESTIMATE(c1, 4)
  FROM combined_resultstate;
```

```output
+------------------------------+
| APPROX_TOP_K_ESTIMATE(C1, 4) |
|------------------------------|
| [                            |
|   [                          |
|     5,                       |
|     2                        |
|   ],                         |
|   [                          |
|     6,                       |
|     2                        |
|   ],                         |
|   [                          |
|     7,                       |
|     2                        |
|   ],                         |
|   [                          |
|     8,                       |
|     2                        |
|   ]                          |
| ]                            |
+------------------------------+
```

---
title: APPROX_TOP_K_COMBINE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_top_k_combine.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Frequency Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROX_TOP_K_COMBINE

Combines (merges) input states into a single output state.

This allows scenarios where [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) is run over horizontal partitions of the same table, producing an algorithm state for each table partition. These states can later be combined
using APPROX_TOP_K_COMBINE, producing the same output state as a single run of [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) over the entire table.

See also:
:   [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) , [APPROX_TOP_K_ESTIMATE](approx_top_k_estimate.md)

## Syntax

```sqlsyntax
APPROX_TOP_K_COMBINE( <state> [ , <counters> ] )
```

## Arguments

`state`
:   An expression that contains state information generated
    by a call to [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md).

`counters`
:   This is the maximum number of distinct values that
    can be tracked at a time during the estimation process. For example, if
    `counters` is set to 100000, then the algorithm tracks 100,000
    distinct values, attempting to keep the 100,000 most frequent values.

    The maximum number of `counters` is `100000` (100,000).

## Returns

This returns information about the “state” of the top K calculation.

This state information is not usually useful by itself, but can be passed to
the function APPROX_TOP_K_ESTIMATE.

## Usage notes

* If `counters` is defined, the output state uses the specified number of counters.
* If `counters` is not defined, all input states must have the same number of counters.

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Examples

This example shows how to use the three related functions
APPROX_TOP_K_ACCUMULATE, APPROX_TOP_K_ESTIMATE, and
APPROX_TOP_K_COMBINE.

> **Note:**
>
> This example uses more counters than distinct data values in order to get
> consistent results. In real-world applications, the number of distinct values
> is usually larger than the number of counters, so the approximations can vary.

This example generates one table with 8 rows that have values 1 - 8, and a
second table with 8 rows that have values 5 - 12. Thus the most frequent
values in the union of the two tables are the values 5-8, each of which has a
count of 2.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE SEQUENCE seq91;
CREATE OR REPLACE TABLE sequence_demo (c1 INTEGER DEFAULT seq91.NEXTVAL, dummy SMALLINT);
INSERT INTO sequence_demo (dummy) VALUES (0);

INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
```

Create a table that contains the “state” that represents the current
approximate Top K information for the table named `sequence_demo`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT APPROX_TOP_K_ACCUMULATE(c1, 50) AS rs1
    FROM sequence_demo);
```

Now create a second table and add data. (In a more realistic situation,
the user could have loaded more data into the first table and divided the
data into non-overlapping sets based on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) SELECT c1 + 4 FROM sequence_demo;
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT APPROX_TOP_K_ACCUMULATE(c1, 50) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT APPROX_TOP_K_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate Top K value of the combined set of rows:

```sqlexample
SELECT APPROX_TOP_K_ESTIMATE(c1, 4)
  FROM combined_resultstate;
```

```output
+------------------------------+
| APPROX_TOP_K_ESTIMATE(C1, 4) |
|------------------------------|
| [                            |
|   [                          |
|     5,                       |
|     2                        |
|   ],                         |
|   [                          |
|     6,                       |
|     2                        |
|   ],                         |
|   [                          |
|     7,                       |
|     2                        |
|   ],                         |
|   [                          |
|     8,                       |
|     2                        |
|   ]                          |
| ]                            |
+------------------------------+
```

---
title: APPROX_TOP_K_ESTIMATE
source: https://docs.snowflake.com/en/sql-reference/functions/approx_top_k_estimate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Frequency Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROX_TOP_K_ESTIMATE

Returns the approximate most frequent values and their estimated frequency for the given Space-Saving state. (For more information about the Space-Saving
summary, see [Estimating Frequent Values](../../user-guide/querying-approximate-frequent-values.md).)

A Space-Saving state produced by [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) and [APPROX_TOP_K_COMBINE](approx_top_k_combine.md) can be used to compute a cardinality estimate using the APPROX_TOP_K_ESTIMATE function.

Thus, APPROX_TOP_K_ESTIMATE(APPROX_TOP_K_ACCUMULATE(…)) is equivalent to APPROX_TOP_K(…).

See also:
:   [APPROX_TOP_K](approx_top_k.md) , [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) , [APPROX_TOP_K_COMBINE](approx_top_k_combine.md)

## Syntax

```sqlsyntax
APPROX_TOP_K_ESTIMATE( <state> [ , <k> ] )
```

## Arguments

`state`
:   An expression that contains state information generated
    by a call to [APPROX_TOP_K_ACCUMULATE](approx_top_k_accumulate.md) or
    [APPROX_TOP_K_COMBINE](approx_top_k_combine.md).

`k`
:   The number of values whose counts you want approximated.
    For example, if you want to see the top 10 most common values, then
    set `k` to 10.

    If `k` is omitted, the default is `1`.

    The maximum value is `100000` (100,000), and is automatically reduced if
    items cannot fit in the output.

## Returns

Returns a value of type ARRAY.

## Usage notes

* Decimal-float ([DECFLOAT](../data-types-numeric.md)) values aren’t supported.

## Examples

This example shows how to use the three related functions
APPROX_TOP_K_ACCUMULATE, APPROX_TOP_K_ESTIMATE, and APPROX_TOP_K_COMBINE.

> **Note:**
>
> This example uses more counters than distinct data values in order to get
> consistent results. In real-world applications, the number of distinct values
> is usually larger than the number of counters, so the approximations can vary.

This example generates one table with 8 rows that have values 1 - 8, and a
second table with 8 rows that have values 5 - 12. Thus the most frequent
values in the union of the two tables are the values 5-8, each of which has a
count of 2.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE SEQUENCE seq91;
CREATE OR REPLACE TABLE sequence_demo (c1 INTEGER DEFAULT seq91.NEXTVAL, dummy SMALLINT);
INSERT INTO sequence_demo (dummy) VALUES (0);

INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
```

Create a table that contains the “state” that represents the current
approximate Top K information for the table named `sequence_demo`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT APPROX_TOP_K_ACCUMULATE(c1, 50) AS rs1
    FROM sequence_demo);
```

Now create a second table and add data. (In a more realistic situation,
the user could have loaded more data into the first table and divided the
data into non-overlapping sets based on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) SELECT c1 + 4 FROM sequence_demo;
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT APPROX_TOP_K_ACCUMULATE(c1, 50) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT APPROX_TOP_K_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate Top K value of the combined set of rows:

```sqlexample
SELECT APPROX_TOP_K_ESTIMATE(c1, 4)
  FROM combined_resultstate;
```

```output
+------------------------------+
| APPROX_TOP_K_ESTIMATE(C1, 4) |
|------------------------------|
| [                            |
|   [                          |
|     5,                       |
|     2                        |
|   ],                         |
|   [                          |
|     6,                       |
|     2                        |
|   ],                         |
|   [                          |
|     7,                       |
|     2                        |
|   ],                         |
|   [                          |
|     8,                       |
|     2                        |
|   ]                          |
| ]                            |
+------------------------------+
```

---
title: APPROXIMATE_JACCARD_INDEX
source: https://docs.snowflake.com/en/sql-reference/functions/approximate_jaccard_index.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Similarity Estimation) , [Window functions](../functions-window.md)

# APPROXIMATE_JACCARD_INDEX

Returns an estimation of the similarity (Jaccard index) of inputs based on their MinHash states. For more information about Jaccard indexes and the related
function [MINHASH](minhash.md), see [Estimating Similarity of Two or More Sets](../../user-guide/querying-approximate-similarity.md).

Alias for [APPROXIMATE_SIMILARITY](approximate_similarity.md)

## Syntax

```sqlsyntax
APPROXIMATE_JACCARD_INDEX( [ DISTINCT ] <expr> [ , ... ] )

APPROXIMATE_JACCARD_INDEX(*)
```

## Arguments

`expr`
:   The expression(s) should be one or more MinHash states returned by calls to
    the [MINHASH](minhash.md) function. In other words, the
    expressions must be `MinHash` state information, not the column or
    expression for which you want the approximate similarity. (The example below
    helps make this clear.)

    For more information about MinHash states, see
    [Estimating Similarity of Two or More Sets](../../user-guide/querying-approximate-similarity.md).

## Returns

A floating point number between 0.0 and 1.0 (inclusive), where 1.0 indicates
that the sets are identical, and 0.0 indicates that the sets have no overlap.

## Usage notes

* `DISTINCT` can be included as an argument, but has no effect.
* The input MinHash states must have MinHash arrays of equal length.
* The array length of the input MinHash states is an indicator of the quality of approximation.

  The larger the value of `k` used in function [MINHASH](minhash.md), the better the approximation. However, this value has a linear impact on the computation time for estimating similarity.

## Examples

```sqlexample
USE SCHEMA snowflake_sample_data.tpch_sf1;

SELECT APPROXIMATE_JACCARD_INDEX(mh) FROM
    (
      (SELECT MINHASH(100, C5) mh FROM orders WHERE c2 <= 50000)
         UNION
      (SELECT MINHASH(100, C5) mh FROM orders WHERE C2 > 50000)
    );

+-------------------------------+
| APPROXIMATE_JACCARD_INDEX(MH) |
|-------------------------------|
|                          0.97 |
+-------------------------------+
```

---
title: APPROXIMATE_SIMILARITY
source: https://docs.snowflake.com/en/sql-reference/functions/approximate_similarity.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Similarity Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# APPROXIMATE_SIMILARITY

Returns an estimation of the similarity (Jaccard index) of inputs based on their MinHash states. For more information about MinHash states, see [Estimating Similarity of Two or More Sets](../../user-guide/querying-approximate-similarity.md).

Aliases:
:   [APPROXIMATE_JACCARD_INDEX](approximate_jaccard_index.md)

See also:
:   [MINHASH](minhash.md) , [MINHASH_COMBINE](minhash_combine.md)

## Syntax

```sqlsyntax
APPROXIMATE_SIMILARITY( [ DISTINCT ] <expr> [ , ... ] )

APPROXIMATE_SIMILARITY(*)
```

## Arguments

`expr`
:   The expression(s) should be one or more MinHash states returned by calls to
    the [MINHASH](minhash.md) function. In other words, the
    expressions must be `MinHash` state information, not the column or
    expression for which you want the approximate similarity. (The example below
    helps make this clear.)

    For more information about MinHash states, see
    [Estimating Similarity of Two or More Sets](../../user-guide/querying-approximate-similarity.md).

## Returns

A floating point number between 0.0 and 1.0 (inclusive), where 1.0 indicates
that the sets are identical, and 0.0 indicates that the sets have no overlap.

## Usage notes

* `DISTINCT` can be included as an argument, but has no effect.
* The input MinHash states must have MinHash arrays of equal length.
* The array length of the input MinHash states is an indicator of the quality of approximation.

  The larger the value of `k` used in function [MINHASH](minhash.md), the better the approximation. However, this value has a linear impact on the computation time for estimating similarity.

## Examples

```sqlexample
USE SCHEMA snowflake_sample_data.tpch_sf1;

SELECT APPROXIMATE_SIMILARITY(mh) FROM
    (
      (SELECT MINHASH(100, C5) mh FROM orders WHERE c2 <= 50000)
         UNION
      (SELECT MINHASH(100, C5) mh FROM orders WHERE C2 > 50000)
    );

+----------------------------+
| APPROXIMATE_SIMILARITY(MH) |
|----------------------------|
|                       0.97 |
+----------------------------+
```

Here is a more extensive example, showing the three related functions
MINHASH, MINHASH_COMBINE and APPROXIMATE_SIMILARITY. This
example creates 3 tables (`ta`, `tb`, and `tc`), two of which (`ta` and `tb`) are
similar, and two of which (`ta` and `tc`) are completely dissimilar.

Create and populate tables with values:

```sqlexample
CREATE TABLE ta (i INTEGER);
CREATE TABLE tb (i INTEGER);
CREATE TABLE tc (i INTEGER);

INSERT INTO ta (i) VALUES (1), (2), (3), (4), (5), (6), (7), (8), (9), (10);
INSERT INTO tb (i) VALUES (1), (2), (3), (4), (5), (6), (7), (8), (9), (11);
INSERT INTO tc (i) VALUES (-1), (-20), (-300), (-4000);
```

Calculate minhash info for the initial set of data:

```sqlexample
CREATE TABLE minhash_a_1 (mh) AS SELECT MINHASH(100, i) FROM ta;
CREATE TABLE minhash_b (mh) AS SELECT MINHASH(100, i) FROM tb;
CREATE TABLE minhash_c (mh) AS SELECT MINHASH(100, i) FROM tc;
```

Add more data to one of the tables:

```sqlexample
INSERT INTO ta (i) VALUES (12);
```

Demonstrate the MINHASH_COMBINE function:

```sqlexample
CREATE TABLE minhash_a_2 (mh) AS SELECT MINHASH(100, i) FROM ta WHERE i > 10;

CREATE TABLE minhash_a (mh) AS
  SELECT MINHASH_COMBINE(mh)
    FROM (
      (SELECT mh FROM minhash_a_1)
      UNION ALL
      (SELECT mh FROM minhash_a_2)
    );
```

This query shows the approximate similarity of the two similar tables
(`ta` and `tb`):

```sqlexample
SELECT APPROXIMATE_SIMILARITY(mh)
  FROM (
    (SELECT mh FROM minhash_a)
    UNION ALL
    (SELECT mh FROM minhash_b)
  );
```

```output
+-----------------------------+
| APPROXIMATE_SIMILARITY (MH) |
|-----------------------------|
|                        0.75 |
+-----------------------------+
```

This query shows the approximate similarity of the two very different tables
(`ta` and `tc`):

```sqlexample
SELECT APPROXIMATE_SIMILARITY(mh)
  FROM (
    (SELECT mh FROM minhash_a)
    UNION ALL
    (SELECT mh FROM minhash_c)
  );
```

```output
+-----------------------------+
| APPROXIMATE_SIMILARITY (MH) |
|-----------------------------|
|                           0 |
+-----------------------------+
```

---
title: ARRAY_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/array_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Semi-structured Data) , [Window functions](../functions-window.md) (General) , [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_AGG

Returns the input values, pivoted into an array. If the input is empty, the function returns an empty array.

Aliases:
:   ARRAYAGG

## Syntax

**Aggregate function**

```sqlsyntax
ARRAY_AGG( [ DISTINCT ] <expr1> ) [ WITHIN GROUP ( <orderby_clause> ) ]
```

**Window function**

```sqlsyntax
ARRAY_AGG( [ DISTINCT ] <expr1> )
  [ WITHIN GROUP ( <orderby_clause> ) ]
  OVER ( [ PARTITION BY <expr2> ] [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] ] [ <window_frame> ] )
```

## Arguments

**Required:**

`expr1`
:   An expression (typically a column name) that determines the values to be put into the array.

`OVER()`
:   The OVER clause specifies that the function is being used as a window function.
    For details, see [Window function syntax and usage](../functions-window-syntax.md).

**Optional:**

`DISTINCT`
:   Removes duplicate values from the array.

`WITHIN GROUP orderby_clause`
:   Clause that contains one or more expressions (typically column names) that determine the order of the values in each array.

    The WITHIN GROUP(ORDER BY) syntax supports the same parameters as the main ORDER BY clause in a SELECT statement.
    See [ORDER BY](../constructs/order-by.md).

`PARTITION BY expr2`
:   Window function clause that specifies an expression (typically a column name).
    This expression defines partitions that group the input rows before the function is applied.
    For details, see [Window function syntax and usage](../functions-window-syntax.md).

`ORDER BY expr3` [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ `{window_frame}` ]
:   Optional expression to order by within each partition, followed by an optional window frame. For detailed
    `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

    When this function is used with a range-based frame, the ORDER BY clause supports only a single column.
    Row-based frames do not have this restriction.

LIMIT is not supported.

## Returns

Returns a value of type ARRAY.

The maximum amount of data that ARRAY_AGG can return for a single call is 128 MB.

## Usage notes

* If you do not specify WITHIN GROUP(ORDER BY), the order of
  elements within each array is unpredictable. (An ORDER BY clause outside
  the WITHIN GROUP clause applies to the order of the output rows, not to
  the order of the array elements within a row.)
* If you specify a number for an expression in WITHIN GROUP(ORDER BY), this number is parsed as a numeric
  constant, not as the ordinal position of a column in the SELECT list. Therefore, do not specify
  numbers as WITHIN GROUP(ORDER BY) expressions.
* If you specify DISTINCT and WITHIN GROUP, both must refer to the same column. For example:

  ```sqlexample
  SELECT ARRAY_AGG(DISTINCT O_ORDERKEY) WITHIN GROUP (ORDER BY O_ORDERKEY) ...;
  ```

  If you specify different columns for DISTINCT and WITHIN GROUP, an error occurs:

  ```sqlexample
  SELECT ARRAY_AGG(DISTINCT O_ORDERKEY) WITHIN GROUP (ORDER BY O_ORDERSTATUS) ...;
  ```

  ```output
  SQL compilation error: [ORDERS.O_ORDERSTATUS] is not a valid order by expression
  ```

  You must either specify the same column for DISTINCT and WITHIN GROUP or omit DISTINCT.
* DISTINCT and WITHIN GROUP are supported for window function calls only when there is no ORDER BY clause
  within the OVER clause. When an ORDER BY clause is used in the OVER clause, values in the output array
  follow the same default order (that is, the order equivalent to `WITHIN GROUP (ORDER BY expr3)`).
* NULL values are omitted from the output.

## Examples

The example queries below use the tables and data shown below:

```sqlexample
CREATE TABLE orders (
  o_orderkey INTEGER,
  o_clerk VARCHAR,
  o_totalprice NUMBER(12, 2),
  o_orderstatus CHAR(1)
);

INSERT INTO orders (o_orderkey, o_orderstatus, o_clerk, o_totalprice)
  VALUES
    ( 32123, 'O', 'Clerk#000000321',     321.23),
    ( 41445, 'F', 'Clerk#000000386', 1041445.00),
    ( 55937, 'O', 'Clerk#000000114', 1055937.00),
    ( 67781, 'F', 'Clerk#000000521', 1067781.00),
    ( 80550, 'O', 'Clerk#000000411', 1080550.00),
    ( 95808, 'F', 'Clerk#000000136', 1095808.00),
    (101700, 'O', 'Clerk#000000220', 1101700.00),
    (103136, 'F', 'Clerk#000000508', 1103136.00);
```

This example shows non-pivoted output from a query that does not use ARRAY_AGG().
The contrast in output between this example and the following example
shows that ARRAY_AGG() pivots the data.

```sqlexample
SELECT o_orderkey AS order_keys
  FROM orders
  WHERE o_totalprice > 450000
  ORDER BY o_orderkey;
```

```output
+------------+
| ORDER_KEYS |
|------------|
|      41445 |
|      55937 |
|      67781 |
|      80550 |
|      95808 |
|     101700 |
|     103136 |
+------------+
```

This example shows how to use ARRAY_AGG() to pivot a column of output
into an array in a single row:

```sqlexample
SELECT ARRAY_AGG(o_orderkey) WITHIN GROUP (ORDER BY o_orderkey ASC)
  FROM orders
  WHERE o_totalprice > 450000;
```

```output
+--------------------------------------------------------------+
| ARRAY_AGG(O_ORDERKEY) WITHIN GROUP (ORDER BY O_ORDERKEY ASC) |
|--------------------------------------------------------------|
| [                                                            |
|   41445,                                                     |
|   55937,                                                     |
|   67781,                                                     |
|   80550,                                                     |
|   95808,                                                     |
|   101700,                                                    |
|   103136                                                     |
| ]                                                            |
+--------------------------------------------------------------+
```

This example shows the use of the DISTINCT keyword with ARRAY_AGG().

```sqlexample
SELECT ARRAY_AGG(DISTINCT o_orderstatus) WITHIN GROUP (ORDER BY o_orderstatus ASC)
  FROM orders
  WHERE o_totalprice > 450000
  ORDER BY o_orderstatus ASC;
```

```output
+-----------------------------------------------------------------------------+
| ARRAY_AGG(DISTINCT O_ORDERSTATUS) WITHIN GROUP (ORDER BY O_ORDERSTATUS ASC) |
|-----------------------------------------------------------------------------|
| [                                                                           |
|   "F",                                                                      |
|   "O"                                                                       |
| ]                                                                           |
+-----------------------------------------------------------------------------+
```

This example uses two separate ORDER BY clauses. One controls
the order within the output array inside each row, and the other controls
the order of the output rows:

```sqlexample
SELECT
    o_orderstatus,
    ARRAYAGG(o_clerk) WITHIN GROUP (ORDER BY o_totalprice DESC)
  FROM orders
  WHERE o_totalprice > 450000
  GROUP BY o_orderstatus
  ORDER BY o_orderstatus DESC;
```

```output
+---------------+-------------------------------------------------------------+
| O_ORDERSTATUS | ARRAYAGG(O_CLERK) WITHIN GROUP (ORDER BY O_TOTALPRICE DESC) |
|---------------+-------------------------------------------------------------|
| O             | [                                                           |
|               |   "Clerk#000000220",                                        |
|               |   "Clerk#000000411",                                        |
|               |   "Clerk#000000114"                                         |
|               | ]                                                           |
| F             | [                                                           |
|               |   "Clerk#000000508",                                        |
|               |   "Clerk#000000136",                                        |
|               |   "Clerk#000000521",                                        |
|               |   "Clerk#000000386"                                         |
|               | ]                                                           |
+---------------+-------------------------------------------------------------+
```

The following example uses a different data set. The ARRAY_AGG function is called as a window
function with a ROWS BETWEEN window frame. First, create the table and load it with 14 rows:

```sqlexample
CREATE OR REPLACE TABLE array_data AS (
WITH data AS (
  SELECT 1 a, [1,3,2,4,7,8,10] b
  UNION ALL
  SELECT 2, [1,3,2,4,7,8,10]
  )
SELECT 'Ord'||a o_orderkey, 'c'||value o_clerk, index
  FROM data, TABLE(FLATTEN(b))
);
```

Now run the following query. Note that only a partial result set is shown here.

```sqlexample
SELECT o_orderkey,
    ARRAY_AGG(o_clerk) OVER(PARTITION BY o_orderkey ORDER BY o_orderkey
      ROWS BETWEEN 3 PRECEDING AND CURRENT ROW) AS result
  FROM array_data;
```

```output
+------------+---------+
| O_ORDERKEY | RESULT  |
|------------+---------|
| Ord1       | [       |
|            |   "c1"  |
|            | ]       |
| Ord1       | [       |
|            |   "c1", |
|            |   "c3"  |
|            | ]       |
| Ord1       | [       |
|            |   "c1", |
|            |   "c3", |
|            |   "c2"  |
|            | ]       |
| Ord1       | [       |
|            |   "c1", |
|            |   "c3", |
|            |   "c2", |
|            |   "c4"  |
|            | ]       |
| Ord1       | [       |
|            |   "c3", |
|            |   "c2", |
|            |   "c4", |
|            |   "c7"  |
|            | ]       |
| Ord1       | [       |
|            |   "c2", |
|            |   "c4", |
|            |   "c7", |
|            |   "c8"  |
|            | ]       |
| Ord1       | [       |
|            |   "c4", |
|            |   "c7", |
|            |   "c8", |
|            |   "c10" |
|            | ]       |
| Ord2       | [       |
|            |   "c1"  |
|            | ]       |
| Ord2       | [       |
|            |   "c1", |
|            |   "c3"  |
|            | ]       |
...
```

---
title: ARRAY_APPEND
source: https://docs.snowflake.com/en/sql-reference/functions/array_append.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_APPEND

Returns an array containing all elements from the source array as well as the new element. The new element is located at the end of the array.

See also:
:   [ARRAY_INSERT](array_insert.md) , [ARRAY_PREPEND](array_prepend.md)

## Syntax

```sqlsyntax
ARRAY_APPEND( <array> , <new_element> )
```

## Arguments

`array`
:   The source array.

`new_element`
:   The element to be appended. The type of the element depends on the type of the array:

    * If `array` is a [semi-structured array](../data-types-semistructured.md), the element can be of almost any data type.
      The data type can be different from the data type(s) of the existing elements in the array.
    * If `array` is a [structured array](../data-types-structured.md), the type of the new element must
      be [coercible](../data-types-structured.md) to the type of the array.

## Returns

The data type of the returned value is ARRAY.

When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
array of the same type.

If the source array is NULL, the function returns NULL.

## Examples

The examples use the following table with an ARRAY column:

```sqlexample
CREATE OR REPLACE TABLE array_append_examples (array_column ARRAY);

INSERT INTO array_append_examples (array_column)
  SELECT ARRAY_CONSTRUCT(1, 2, 3);

SELECT * FROM array_append_examples;
```

```output
+--------------+
| ARRAY_COLUMN |
|--------------|
| [            |
|   1,         |
|   2,         |
|   3          |
| ]            |
+--------------+
```

Add an element of the same type to the array:

```sqlexample
UPDATE array_append_examples
  SET array_column = ARRAY_APPEND(array_column, 4);
```

Query the table to see the new element added to the array:

```sqlexample
SELECT * FROM array_append_examples;
```

```output
+--------------+
| ARRAY_COLUMN |
|--------------|
| [            |
|   1,         |
|   2,         |
|   3,         |
|   4          |
| ]            |
+--------------+
```

Add an element of a different type to the array:

```sqlexample
UPDATE array_append_examples
  SET array_column = ARRAY_APPEND(array_column, 'five');
```

Query the table to see the new element added to the array and the data type of each element in the array:

```sqlexample
SELECT array_column,
       ARRAY_CONSTRUCT(
        TYPEOF(array_column[0]),
        TYPEOF(array_column[1]),
        TYPEOF(array_column[2]),
        TYPEOF(array_column[3]),
        TYPEOF(array_column[4])) AS type
  FROM array_append_examples;
```

```output
+--------------+--------------+
| ARRAY_COLUMN | TYPE         |
|--------------+--------------|
| [            | [            |
|   1,         |   "INTEGER", |
|   2,         |   "INTEGER", |
|   3,         |   "INTEGER", |
|   4,         |   "INTEGER", |
|   "five"     |   "VARCHAR"  |
| ]            | ]            |
+--------------+--------------+
```

---
title: ARRAY_CAT
source: https://docs.snowflake.com/en/sql-reference/functions/array_cat.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_CAT

Returns a concatenation of two arrays.

## Syntax

```sqlsyntax
ARRAY_CAT( <array1> , <array2> )
```

## Arguments

`array1`
:   The source array.

`array2`
:   The array to be appended to `array1`.

## Returns

An ARRAY containing the elements from `array2` appended after the elements of `array1`.

## Usage notes

* Both arguments must either be [structured ARRAYs](../data-types-structured.md) or
  [semi-structured ARRAYs](../data-types-semistructured.md).

* If you are passing in semi-structured ARRAYs, both arguments must be of ARRAY type or VARIANT containing an array.
* If you are passing in structured ARRAYs, the function returns an ARRAY of a type that can accommodate both input types.
* If either argument is NULL, the function returns NULL without reporting any error.

## Examples

This example shows how to use `ARRAY_CAT()`:

> Create a simple table and data:
>
> > ```sqlexample
> > CREATE TABLE array_demo (ID INTEGER, array1 ARRAY, array2 ARRAY);
> > ```
> >
> > ```sqlexample
> > INSERT INTO array_demo (ID, array1, array2)
> >     SELECT 1, ARRAY_CONSTRUCT(1, 2), ARRAY_CONSTRUCT(3, 4);
> > ```
>
> Execute the query:
>
> > ```sqlexample
> > SELECT ARRAY_CAT(array1, array2) FROM array_demo;
> > +---------------------------+
> > | ARRAY_CAT(ARRAY1, ARRAY2) |
> > |---------------------------|
> > | [                         |
> > |   1,                      |
> > |   2,                      |
> > |   3,                      |
> > |   4                       |
> > | ]                         |
> > +---------------------------+
> > ```

---
title: ARRAY_COMPACT
source: https://docs.snowflake.com/en/sql-reference/functions/array_compact.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_COMPACT

Returns a compacted array with missing and null values removed, effectively converting sparse arrays into dense arrays.

## Syntax

```sqlsyntax
ARRAY_COMPACT( <array1> )
```

## Arguments

`array1`
:   The source array.

## Usage notes

* Semi-structured data (e.g. JSON data) can contain explicit null values, which are distinct from SQL NULLs. A null value in semi-structured data indicates a missing value.
* `array1` should be either an ARRAY data type or a VARIANT data type containing an array value.
* If the argument is NULL, the result will be NULL.
* When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
  array of the same type.

## Examples

This example shows how to use `ARRAY_COMPACT()`:

> Create a simple table and data:
>
> > ```sqlexample
> > CREATE TABLE array_demo (ID INTEGER, array1 ARRAY, array2 ARRAY);
> > ```
> >
> > ```sqlexample
> > INSERT INTO array_demo (ID, array1, array2)
> >     SELECT 2, ARRAY_CONSTRUCT(10, NULL, 30), ARRAY_CONSTRUCT(40);
> > ```
>
> Execute the query:
>
> > ```sqlexample
> > SELECT array1, ARRAY_COMPACT(array1) FROM array_demo WHERE ID = 2;
> > +--------------+-----------------------+
> > | ARRAY1       | ARRAY_COMPACT(ARRAY1) |
> > |--------------+-----------------------|
> > | [            | [                     |
> > |   10,        |   10,                 |
> > |   undefined, |   30                  |
> > |   30         | ]                     |
> > | ]            |                       |
> > +--------------+-----------------------+
> > ```

---
title: ARRAY_CONSTRUCT
source: https://docs.snowflake.com/en/sql-reference/functions/array_construct.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_CONSTRUCT

Returns an array constructed from zero, one, or more inputs.

For more information about constructing and using arrays, see [ARRAY](../data-types-semistructured.md).

See also:
:   [ARRAY_CONSTRUCT_COMPACT](array_construct_compact.md)

## Syntax

```sqlsyntax
ARRAY_CONSTRUCT( [ <expr1> ] [ , <expr2> [ , ... ] ] )
```

## Arguments

The arguments are values (or expressions that evaluate to values). The argument values can be
different data types.

## Returns

The data type of the returned value is ARRAY.

## Usage notes

* If the function is called with `N` arguments, the size of the resulting array is `N`.
* In many contexts, you can use an [ARRAY constant](../data-types-semistructured.md) (also called an *ARRAY literal*) instead of
  the ARRAY_CONSTRUCT function.
* An array can contain both SQL NULL values and JSON null values. For more information, see [NULL values](../../user-guide/semistructured-considerations.md).

## Examples

Construct a basic array consisting of numeric data types:

```sqlexample
SELECT ARRAY_CONSTRUCT(10, 20, 30);
```

```output
+-----------------------------+
| ARRAY_CONSTRUCT(10, 20, 30) |
|-----------------------------|
| [                           |
|   10,                       |
|   20,                       |
|   30                        |
| ]                           |
+-----------------------------+
```

Construct a basic array consisting of different data types, including a SQL NULL value (`undefined`) and
a JSON null value (`null`):

```sqlexample
SELECT ARRAY_CONSTRUCT(NULL, PARSE_JSON('null'), 'hello', 3::DOUBLE, 4, 5);
```

```output
+---------------------------------------------------------------------+
| ARRAY_CONSTRUCT(NULL, PARSE_JSON('NULL'), 'HELLO', 3::DOUBLE, 4, 5) |
|---------------------------------------------------------------------|
| [                                                                   |
|   undefined,                                                        |
|   null,                                                             |
|   "hello",                                                          |
|   3.000000000000000e+00,                                            |
|   4,                                                                |
|   5                                                                 |
| ]                                                                   |
+---------------------------------------------------------------------+
```

Construct an empty array:

```sqlexample
SELECT ARRAY_CONSTRUCT();
```

```output
+-------------------+
| ARRAY_CONSTRUCT() |
|-------------------|
| []                |
+-------------------+
```

Create a table and insert arrays into an ARRAY column:

```sqlexample
CREATE OR REPLACE TABLE construct_array_example (id INT, array_column ARRAY);

INSERT INTO construct_array_example (id, array_column)
  SELECT 1,
         ARRAY_CONSTRUCT(1, 2, 3);

INSERT INTO construct_array_example (id, array_column)
  SELECT 2,
         ARRAY_CONSTRUCT(4, 5, 6);

INSERT INTO construct_array_example (id, array_column)
  SELECT 3,
         ARRAY_CONSTRUCT(7, 8, 9);

SELECT * FROM construct_array_example;
```

```output
+----+--------------+
| ID | ARRAY_COLUMN |
|----+--------------|
|  1 | [            |
|    |   1,         |
|    |   2,         |
|    |   3          |
|    | ]            |
|  2 | [            |
|    |   4,         |
|    |   5,         |
|    |   6          |
|    | ]            |
|  3 | [            |
|    |   7,         |
|    |   8,         |
|    |   9          |
|    | ]            |
+----+--------------+
```

---
title: ARRAY_CONSTRUCT_COMPACT
source: https://docs.snowflake.com/en/sql-reference/functions/array_construct_compact.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_CONSTRUCT_COMPACT

Returns an array constructed from zero, one, or more inputs; the constructed
array omits any NULL input values.

See also:
:   [ARRAY_CONSTRUCT](array_construct.md)

## Syntax

```sqlsyntax
ARRAY_CONSTRUCT_COMPACT( [ <expr1> ] [ , <expr2> [ , ... ] ] )
```

## Arguments

`expr#`
:   These are the input expressions to evaluate; the resulting values are put into the array.
    The expressions do not all need to evaluate to the same data type.

## Returns

The data type of the returned value is `ARRAY`.

## Usage notes

* SQL NULL values are skipped when building the result array, resulting in a compacted (i.e. dense) array.

## Examples

Construct a basic dense array consisting of different data types:

```sqlexample
SELECT ARRAY_CONSTRUCT_COMPACT(null,'hello',3::double,4,5);
+-----------------------------------------------------+
| ARRAY_CONSTRUCT_COMPACT(NULL,'HELLO',3::DOUBLE,4,5) |
|-----------------------------------------------------|
| [                                                   |
|   "hello",                                          |
|   3.000000000000000e+00,                            |
|   4,                                                |
|   5                                                 |
| ]                                                   |
+-----------------------------------------------------+
```

---
title: ARRAY_CONTAINS
source: https://docs.snowflake.com/en/sql-reference/functions/array_contains.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_CONTAINS

Returns TRUE if the specified value is found in the specified array.

## Syntax

```sqlsyntax
ARRAY_CONTAINS( <value_expr> , <array> )
```

## Arguments

`value_expr`
:   Value to find in `array`.

    * If `array` is a [semi-structured array](../data-types-semistructured.md), `value_expr` must evaluate to a
      [VARIANT](../data-types-semistructured.md).
    * If `array` is a [structured array](../data-types-structured.md), `value_expr` must evaluate
      to a type that is [comparable](../data-types-structured.md) to the type of the array.

`array`
:   The array to search.

## Returns

This function returns a value of BOOLEAN type or NULL:

* The function returns TRUE if `value_expr` is present in `array`, including the following cases:

  + When the `value_expr` argument is NULL and there is a SQL NULL value in the array (`undefined`).
  + When the `value_expr` argument is JSON null and there is a JSON null value in the array (`null`).
* The function returns FALSE if `value_expr` isn’t present in `array`, including when the
  `value_expr` argument is JSON null and there are no JSON null values in the array.
* The function returns NULL if the `value_expr` argument is NULL and there are no SQL NULL values in the array.

For more information about NULL values in arrays, see [NULL values](../../user-guide/semistructured-considerations.md).

## Usage notes

* The function does not support wildcards in `value_expr`. However, you can
  use the [ARRAY_TO_STRING](array_to_string.md) function to convert an array to a string, then search the
  string with wildcard characters. For example, you can specify wildcards to search the
  returned string using the [[ NOT ] LIKE](like.md) and [REGEXP_LIKE](regexp_like.md) functions.
* If `array` is a semi-structured array, [explicit casting](../data-type-conversion.md)
  of the `value_expr` value to a VARIANT value is required for values of the following data types:

  + [String & binary](../data-types-text.md)
  + [Date & time](../data-types-datetime.md)

  The following example explicitly casts a string value to a VARIANT value:

  ```sqlexample
  SELECT ARRAY_CONTAINS('mystring2'::VARIANT, ARRAY_CONSTRUCT('mystring1', 'mystring2'));
  ```

  Explicit casting isn’t required for values of other data types.

## Examples

The following queries use the ARRAY_CONTAINS function in a SELECT list.

In this example, the function returns TRUE because the `value_expr` argument is `'hello'`
and the array contains a VARIANT value that stores the string `'hello'`:

```sqlexample
SELECT ARRAY_CONTAINS('hello'::VARIANT, ARRAY_CONSTRUCT('hello', 'hi'));
```

```output
+------------------------------------------------------------------+
| ARRAY_CONTAINS('HELLO'::VARIANT, ARRAY_CONSTRUCT('HELLO', 'HI')) |
|------------------------------------------------------------------|
| True                                                             |
+------------------------------------------------------------------+
```

In this example, the function returns FALSE because the `value_expr` argument is `'hello'`
but the array doesn’t contain a VARIANT value that stores the string `'hello'`:

```sqlexample
SELECT ARRAY_CONTAINS('hello'::VARIANT, ARRAY_CONSTRUCT('hola', 'bonjour'));
```

```output
+----------------------------------------------------------------------+
| ARRAY_CONTAINS('HELLO'::VARIANT, ARRAY_CONSTRUCT('HOLA', 'BONJOUR')) |
|----------------------------------------------------------------------|
| False                                                                |
+----------------------------------------------------------------------+
```

In this example, the function returns NULL because the `value_expr` argument is NULL but
the array doesn’t contain a SQL NULL value:

```sqlexample
SELECT ARRAY_CONTAINS(NULL, ARRAY_CONSTRUCT('hola', 'bonjour'));
```

```output
+----------------------------------------------------------+
| ARRAY_CONTAINS(NULL, ARRAY_CONSTRUCT('HOLA', 'BONJOUR')) |
|----------------------------------------------------------|
| NULL                                                     |
+----------------------------------------------------------+
```

In this example, the function returns TRUE because the `value_expr` argument is NULL and
the array contains a SQL NULL value:

```sqlexample
SELECT ARRAY_CONTAINS(NULL, ARRAY_CONSTRUCT('hola', NULL));
```

```output
+-----------------------------------------------------+
| ARRAY_CONTAINS(NULL, ARRAY_CONSTRUCT('HOLA', NULL)) |
|-----------------------------------------------------|
| True                                                |
+-----------------------------------------------------+
```

In this example, the function returns TRUE because the `value_expr` argument is a
JSON null value and the array contains a JSON null value:

```sqlexample
SELECT ARRAY_CONTAINS(PARSE_JSON('null'), ARRAY_CONSTRUCT('hola', PARSE_JSON('null')));
```

```output
+---------------------------------------------------------------------------------+
| ARRAY_CONTAINS(PARSE_JSON('NULL'), ARRAY_CONSTRUCT('HOLA', PARSE_JSON('NULL'))) |
|---------------------------------------------------------------------------------|
| True                                                                            |
+---------------------------------------------------------------------------------+
```

In this example, the function returns NULL because the `value_expr` argument is
NULL but the array doesn’t contain a SQL NULL value (although it does contain a JSON null value):

```sqlexample
SELECT ARRAY_CONTAINS(NULL, ARRAY_CONSTRUCT('hola', PARSE_JSON('null')));
```

```output
+-------------------------------------------------------------------+
| ARRAY_CONTAINS(NULL, ARRAY_CONSTRUCT('HOLA', PARSE_JSON('NULL'))) |
|-------------------------------------------------------------------|
| NULL                                                              |
+-------------------------------------------------------------------+
```

The following query uses the ARRAY_CONTAINS function in a WHERE clause. First, create a
table with an ARRAY column and insert data:

```sqlexample
CREATE OR REPLACE TABLE array_example (id INT, array_column ARRAY);

INSERT INTO array_example (id, array_column)
  SELECT 1, ARRAY_CONSTRUCT(1, 2, 3);

INSERT INTO array_example (id, array_column)
  SELECT 2, ARRAY_CONSTRUCT(4, 5, 6);

SELECT * FROM array_example;
```

```output
+----+--------------+
| ID | ARRAY_COLUMN |
|----+--------------|
|  1 | [            |
|    |   1,         |
|    |   2,         |
|    |   3          |
|    | ]            |
|  2 | [            |
|    |   4,         |
|    |   5,         |
|    |   6          |
|    | ]            |
+----+--------------+
```

Run a query that specifies the value to find for `value_expr` and the
ARRAY column for `array`:

```sqlexample
SELECT * FROM array_example WHERE ARRAY_CONTAINS(5, array_column);
```

```output
+----+--------------+
| ID | ARRAY_COLUMN |
|----+--------------|
|  2 | [            |
|    |   4,         |
|    |   5,         |
|    |   6          |
|    | ]            |
+----+--------------+
```

---
title: ARRAY_DISTINCT
source: https://docs.snowflake.com/en/sql-reference/functions/array_distinct.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_DISTINCT

Returns a new [ARRAY](../data-types-semistructured.md) that contains only the distinct elements from the input ARRAY. The function
excludes any duplicate elements that are present in the input ARRAY.

The function is not guaranteed to return the elements in the ARRAY in a specific order.

The function is NULL-safe, which means that it treats NULLs as known values when identifying duplicate elements.

## Syntax

```sqlsyntax
ARRAY_DISTINCT( <array> )
```

## Arguments

`array`
:   An array that might contain duplicate elements to be removed.

## Returns

This function returns an ARRAY that contains the elements of the input array without any duplicate elements. For example, if the
value `'x'` appears multiple times in the input ARRAY, only one element has the value `'x'` in the returned ARRAY.

If the input argument is NULL, the function returns NULL.

The order of the values within the returned array is unspecified.

## Usage notes

* For elements of the type [OBJECT](../data-types-semistructured.md), the objects must be identical to be considered duplicate. For
  details, see Examples (in this topic).
* When identifying duplicate elements, the function considers NULL to be a known value (i.e. NULL is not a duplicate of any other
  value X besides NULL).

## Examples

The following example demonstrates how the function returns an ARRAY without the duplicate elements `A` and `NULL` from an
input [ARRAY constant](../data-types-semistructured.md):

```sqlexample
SELECT ARRAY_DISTINCT(['A', 'A', 'B', NULL, NULL]);

+---------------------------------------------+
| ARRAY_DISTINCT(['A', 'A', 'B', NULL, NULL]) |
|---------------------------------------------|
| [                                           |
|   "A",                                      |
|   "B",                                      |
|   undefined                                 |
| ]                                           |
+---------------------------------------------+
```

The following example demonstrates how passing in NULL (instead of an ARRAY) returns NULL:

```sqlexample
SELECT ARRAY_DISTINCT(NULL);

+----------------------+
| ARRAY_DISTINCT(NULL) |
|----------------------|
| NULL                 |
+----------------------+
```

The following example demonstrates how the function removes duplicate OBJECTs that are elements in the input ARRAY. The example
uses [OBJECT constants](../data-types-semistructured.md) and ARRAY constants to construct the OBJECTs and ARRAY.

```sqlexample
SELECT ARRAY_DISTINCT( [ {'a': 1, 'b': 2}, {'a': 1, 'b': 2}, {'a': 1, 'b': 3} ] );

+----------------------------------------------------------------------------+
| ARRAY_DISTINCT( [ {'A': 1, 'B': 2}, {'A': 1, 'B': 2}, {'A': 1, 'B': 3} ] ) |
|----------------------------------------------------------------------------|
| [                                                                          |
|   {                                                                        |
|     "a": 1,                                                                |
|     "b": 2                                                                 |
|   },                                                                       |
|   {                                                                        |
|     "a": 1,                                                                |
|     "b": 3                                                                 |
|   }                                                                        |
| ]                                                                          |
+----------------------------------------------------------------------------+
```

As shown in the example, the last element is not considered to be a duplicate because `b` has a different value (`3`, not
`2`).

---
title: ARRAY_EXCEPT
source: https://docs.snowflake.com/en/sql-reference/functions/array_except.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_EXCEPT

Returns a new [ARRAY](../data-types-semistructured.md) that contains the elements from one input ARRAY that are not in another input
ARRAY.

The function is NULL-safe, meaning it treats NULLs as known values for comparing equality.

See also:
:   [ARRAY_INTERSECTION](array_intersection.md)

## Syntax

```sqlsyntax
ARRAY_EXCEPT( <source_array> , <array_of_elements_to_exclude> )
```

## Arguments

`source_array`
:   An array that contains elements to be included in the new ARRAY.

`array_of_elements_to_exclude`
:   An array that contains elements to be excluded from the new ARRAY.

## Returns

This function returns an ARRAY that contains the elements from `source_array` that are not in
`array_of_elements_to_exclude`.

If no elements remain after excluding the elements in `array_of_elements_to_exclude` from `source_array`, the
function returns an empty ARRAY.

If one or both arguments are NULL, the function returns NULL.

The order of the values within the returned array is unspecified.

## Usage notes

* When you compare data of the type OBJECT, the objects must be identical to be considered matching. For details,
  see Examples (in this topic).
* In Snowflake, arrays are multi-sets, not sets. In other words, arrays can contain multiple copies of the same value.

  `ARRAY_EXCEPT` compares arrays by using multi-set semantics (sometimes called “bag semantics”). If
  `source_array` includes multiple copies of a value, the function only removes the number of copies of that value that
  are specified in `array_of_elements_to_exclude`.

  In other words, if `source_array` has N copies of a value and `array_of_elements_to_exclude` has M copies of the
  same value, the function excludes M copies of the value from the returned array. The number of copies of the value in the
  returned array is N - M or, if M is larger than N, 0.

  For example, if `source_array` contains 5 elements with the value `'A'` and `array_of_elements_to_exclude`
  contains 2 elements with the value `'A'`, the returned array contains 3 elements with the value `'A'`.

* Both arguments must either be [structured ARRAYs](../data-types-structured.md) or
  [semi-structured ARRAYs](../data-types-semistructured.md).

* If you are passing in a structured ARRAY:

  + The ARRAY in the second argument must be [comparable](../data-types-structured.md) to the ARRAY in
    the first argument.
  + The function returns a structured ARRAY of the same type as the ARRAY in the first argument.

## Examples

The examples in this section use [ARRAY constants](../data-types-semistructured.md) and [OBJECT constants](../data-types-semistructured.md)
to specify ARRAYs and OBJECTs.

The following example demonstrates how to use the function:

```sqlexample
SELECT ARRAY_EXCEPT(['A', 'B'], ['B', 'C']);

+--------------------------------------+
| ARRAY_EXCEPT(['A', 'B'], ['B', 'C']) |
|--------------------------------------|
| [                                    |
|   "A"                                |
| ]                                    |
+--------------------------------------+
```

The following example adds the element `'C'` to `source_array`. The returned ARRAY excludes `'C'` because `'C'` is
also specified in `array_of_elements_to_exclude`.

```sqlexample
SELECT ARRAY_EXCEPT(['A', 'B', 'C'], ['B', 'C']);

+-------------------------------------------+
| ARRAY_EXCEPT(['A', 'B', 'C'], ['B', 'C']) |
|-------------------------------------------|
| [                                         |
|   "A"                                     |
| ]                                         |
+-------------------------------------------+
```

In the following example, `source_array` contains 3 elements with the value `'B'`. Because
`array_of_elements_to_exclude` contains only 1 `'B'` element, the function excludes only 1 `'B'` element and returns
an ARRAY containing the other 2 `'B'` elements.

```sqlexample
SELECT ARRAY_EXCEPT(['A', 'B', 'B', 'B', 'C'], ['B']);

+------------------------------------------------+
| ARRAY_EXCEPT(['A', 'B', 'B', 'B', 'C'], ['B']) |
|------------------------------------------------|
| [                                              |
|   "A",                                         |
|   "B",                                         |
|   "B",                                         |
|   "C"                                          |
| ]                                              |
+------------------------------------------------+
```

In the following example, no elements remain after excluding the elements in `array_of_elements_to_exclude` from
`source_array`. As a result, the function returns an empty ARRAY.

```sqlexample
SELECT ARRAY_EXCEPT(['A', 'B'], ['A', 'B']);

+--------------------------------------+
| ARRAY_EXCEPT(['A', 'B'], ['A', 'B']) |
|--------------------------------------|
| []                                   |
+--------------------------------------+
```

The following example demonstrates how the function treats NULL elements as known values. As explained earlier, because
`source_array` contains one more NULL element than `array_of_elements_to_exclude`, the returned ARRAY excludes
only one NULL element and includes the other (which is printed out as `undefined`).

```sqlexample
SELECT ARRAY_EXCEPT(['A', NULL, NULL], ['B', NULL]);

+----------------------------------------------+
| ARRAY_EXCEPT(['A', NULL, NULL], ['B', NULL]) |
|----------------------------------------------|
| [                                            |
|   "A",                                       |
|   undefined                                  |
| ]                                            |
+----------------------------------------------+
```

In the following example, `source_array` and `array_of_elements_to_exclude` contain the same number of NULL
elements, so the returned ARRAY excludes the NULL elements.

```sqlexample
SELECT ARRAY_EXCEPT(['A', NULL, NULL], [NULL, 'B', NULL]);

+----------------------------------------------------+
| ARRAY_EXCEPT(['A', NULL, NULL], [NULL, 'B', NULL]) |
|----------------------------------------------------|
| [                                                  |
|   "A"                                              |
| ]                                                  |
+----------------------------------------------------+
```

The following example demonstrates how specifying the same object in `source_array` and
`array_of_elements_to_exclude` excludes that object from the returned ARRAY:

```sqlexample
SELECT ARRAY_EXCEPT([{'a': 1, 'b': 2}, 1], [{'a': 1, 'b': 2}, 3]);

+------------------------------------------------------------+
| ARRAY_EXCEPT([{'A': 1, 'B': 2}, 1], [{'A': 1, 'B': 2}, 3]) |
|------------------------------------------------------------|
| [                                                          |
|   1                                                        |
| ]                                                          |
+------------------------------------------------------------+
```

The following example demonstrates that passing in NULL results in the function returning NULL.

```sqlexample
SELECT ARRAY_EXCEPT(['A', 'B'], NULL);

+--------------------------------+
| ARRAY_EXCEPT(['A', 'B'], NULL) |
|--------------------------------|
| NULL                           |
+--------------------------------+
```

---
title: ARRAY_FLATTEN
source: https://docs.snowflake.com/en/sql-reference/functions/array_flatten.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_FLATTEN

Flattens an [ARRAY](../data-types-semistructured.md) of ARRAYs into a single ARRAY. The function effectively concatenates the ARRAYs that
are elements of the input ARRAY and returns them as a single ARRAY.

## Syntax

```sqlsyntax
ARRAY_FLATTEN( <array> )
```

## Arguments

`array`
:   The ARRAY of ARRAYs to flatten.

    If any element of `array` is not an ARRAY, the function reports an error.

## Returns

This function returns an ARRAY that is constructed by concatenating the ARRAYs in `array`.

If `array` is NULL or contains any elements that are NULL, the function returns NULL.

## Usage notes

* If `array` contains multiple levels of nested ARRAYs, the function only removes one level of nesting.

  For example, if the input ARRAY is:

  ```output
  [ [ [1, 2], [3] ], [ [4], [5] ] ]
  ```

  The function returns:

  ```sqlexample
  [ [1, 2], [3], [4], [5] ]
  ```

## Examples

The following example flattens an ARRAY of ARRAYs. Each element in the input ARRAY is an ARRAY of numbers. The example flattens
the input ARRAY into an ARRAY containing the numbers as elements.

```sqlexample
SELECT ARRAY_FLATTEN([[1, 2, 3], [4], [5, 6]]);
```

```output
+-----------------------------------------+
| ARRAY_FLATTEN([[1, 2, 3], [4], [5, 6]]) |
|-----------------------------------------|
| [                                       |
|   1,                                    |
|   2,                                    |
|   3,                                    |
|   4,                                    |
|   5,                                    |
|   6                                     |
| ]                                       |
+-----------------------------------------+
```

The following example flattens an ARRAY that contains ARRAYs containing ARRAYs. The function removes the first level of nesting.

```sqlexample
SELECT ARRAY_FLATTEN([[[1, 2], [3]], [[4], [5]]]);
```

```output
+--------------------------------------------+
| ARRAY_FLATTEN([[[1, 2], [3]], [[4], [5]]]) |
|--------------------------------------------|
| [                                          |
|   [                                        |
|     1,                                     |
|     2                                      |
|   ],                                       |
|   [                                        |
|     3                                      |
|   ],                                       |
|   [                                        |
|     4                                      |
|   ],                                       |
|   [                                        |
|     5                                      |
|   ]                                        |
| ]                                          |
+--------------------------------------------+
```

The following example demonstrates that the function returns an error when an element of the input ARRAY is not an ARRAY.

```sqlexample
SELECT ARRAY_FLATTEN([[1, 2, 3], 4, [5, 6]]);
```

```output
100107 (22000): Not an array: 'Input argument to ARRAY_FLATTEN is not an array of arrays'
```

The following example demonstrates that the function returns NULL when an element of the input ARRAY is NULL.

```sqlexample
SELECT ARRAY_FLATTEN([[1, 2, 3], NULL, [5, 6]]);
```

```output
+------------------------------------------+
| ARRAY_FLATTEN([[1, 2, 3], NULL, [5, 6]]) |
|------------------------------------------|
| NULL                                     |
+------------------------------------------+
```

The following example demonstrates that the function flattens an ARRAY when an element of the input ARRAY is an ARRAY that
contains a NULL element.

```sqlexample
SELECT ARRAY_FLATTEN([[1, 2, 3], [NULL], [5, 6]]);
```

```output
+--------------------------------------------+
| ARRAY_FLATTEN([[1, 2, 3], [NULL], [5, 6]]) |
|--------------------------------------------|
| [                                          |
|   1,                                       |
|   2,                                       |
|   3,                                       |
|   undefined,                               |
|   5,                                       |
|   6                                        |
| ]                                          |
+--------------------------------------------+
```

---
title: ARRAY_GENERATE_RANGE
source: https://docs.snowflake.com/en/sql-reference/functions/array_generate_range.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_GENERATE_RANGE

Returns an [ARRAY](../data-types-semistructured.md) of integer values within a specified range (e.g. `[2, 3, 4]`).

## Syntax

```sqlsyntax
ARRAY_GENERATE_RANGE( <start> , <stop> [ , <step> ] )
```

## Arguments

**Required:**

`start`
:   The first number in the range of numbers to return.

    You must specify an expression that evaluates to an INTEGER value.

`stop`
:   The last number in the range. Note that this number is not included in the range of numbers returned.

    For example, `ARRAY_GENERATE_RANGE(1, 5)` returns `[1, 2, 3, 4]` (which does not include `5`).

    You must specify an expression that evaluates to an INTEGER value.

**Optional:**

`step`
:   The amount to increment or decrement each subsequent number in the array. For example:

    * `ARRAY_GENERATE_RANGE(0, 16, 5)` returns `[0, 5, 10, 15]`
    * `ARRAY_GENERATE_RANGE(0, -16, -5)` returns `[0, -5, -10, -15]`

    You can specify a positive or negative number. You cannot specify 0.

    The default value is `1`.

## Returns

An ARRAY of integers in the specified range.

If any of the arguments is NULL, the function returns NULL.

## Usage notes

* After `start`, each subsequent element increases or decreases by `step` (depending on whether `step`
  is positive or negative) up to (but not including) `stop`.

  For example:

  + `ARRAY_GENERATE_RANGE(10, 50, 10)` returns `[10, 20, 30, 40]`.
  + `ARRAY_GENERATE_RANGE(-10, -50, -10)` returns `[-10, -20, -30, -40]`.
* The function returns an empty ARRAY under any of the following conditions:

  + `start = stop`.
  + `step` is a positive number and `start > stop`.
  + `step` is a negative number and `start < stop`.

  For example:

  + `ARRAY_GENERATE_RANGE(2, 2, 4)` returns `[]`.
  + `ARRAY_GENERATE_RANGE(8, 2, 2)` returns `[]`.
  + `ARRAY_GENERATE_RANGE(2, 8, -2)` returns `[]`.

## Examples

The following example returns an ARRAY containing a range of numbers starting from 2 and ending before 5:

```sqlexample
SELECT ARRAY_GENERATE_RANGE(2, 5);
```

```output
+----------------------------+
| ARRAY_GENERATE_RANGE(2, 5) |
|----------------------------|
| [                          |
|   2,                       |
|   3,                       |
|   4                        |
| ]                          |
+----------------------------+
```

The following example returns an ARRAY containing a range of numbers starting from 5 and ending before 25, increasing in value by
10:

```sqlexample
SELECT ARRAY_GENERATE_RANGE(5, 25, 10);
```

```output
+---------------------------------+
| ARRAY_GENERATE_RANGE(5, 25, 10) |
|---------------------------------|
| [                               |
|   5,                            |
|   15                            |
| ]                               |
+---------------------------------+
```

The following example returns an ARRAY containing a range of numbers starting from -5 and ending before -25, decreasing in value
by -10:

```sqlexample
SELECT ARRAY_GENERATE_RANGE(-5, -25, -10);
```

```output
+------------------------------------+
| ARRAY_GENERATE_RANGE(-5, -25, -10) |
|------------------------------------|
| [                                  |
|   -5,                              |
|   -15                              |
| ]                                  |
+------------------------------------+
```

---
title: ARRAY_INSERT
source: https://docs.snowflake.com/en/sql-reference/functions/array_insert.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_INSERT

Returns an array containing all elements from the source array as well as the new element.

## Syntax

```sqlsyntax
ARRAY_INSERT( <array> , <pos> , <new_element> )
```

See also:
:   [ARRAY_APPEND](array_append.md) , [ARRAY_PREPEND](array_prepend.md)

## Arguments

`array`
:   The source array.

`pos`
:   A (zero-based) position in the source array. The new element is inserted at this position. The original element from this position (if any) and all subsequent elements (if any) are shifted by
    one position to the right in the resulting array (i.e. inserting at position 0 has the same effect as using [ARRAY_PREPEND](array_prepend.md)).

    A negative position is interpreted as an index from the back of the array (e.g. `-1` results in insertion before the last element in the array).

`new_element`
:   The element to be inserted. The new element is located at position `pos`. The relative order of the other elements from the source array is preserved.

## Returns

The data type of the returned value is `ARRAY`.

## Usage notes

* When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
  array of the same type.
* If `array` is a [structured ARRAY](../data-types-structured.md), the type of the new element must
  be [coercible](../data-types-structured.md) to the type of the ARRAY.
* If the absolute value of `pos` exceeds the number of elements in `array`, additional empty elements are inserted between the new element and the elements from the source array.
* To append or prepend elements to an array, you should use [ARRAY_APPEND](array_append.md) or [ARRAY_PREPEND](array_prepend.md) instead.

## Examples

This shows a simple example of inserting into an array:

```sqlexample
SELECT ARRAY_INSERT(ARRAY_CONSTRUCT(0,1,2,3),2,'hello');
+--------------------------------------------------+
| ARRAY_INSERT(ARRAY_CONSTRUCT(0,1,2,3),2,'HELLO') |
|--------------------------------------------------|
| [                                                |
|   0,                                             |
|   1,                                             |
|   "hello",                                       |
|   2,                                             |
|   3                                              |
| ]                                                |
+--------------------------------------------------+
```

This shows an insert that uses an index larger than the number of existing elements in the array.

```sqlexample
SELECT ARRAY_INSERT(ARRAY_CONSTRUCT(0,1,2,3),5,'hello');
+--------------------------------------------------+
| ARRAY_INSERT(ARRAY_CONSTRUCT(0,1,2,3),5,'HELLO') |
|--------------------------------------------------|
| [                                                |
|   0,                                             |
|   1,                                             |
|   2,                                             |
|   3,                                             |
|   undefined,                                     |
|   "hello"                                        |
| ]                                                |
+--------------------------------------------------+
```

This shows an insert that uses a negative index.

```sqlexample
SELECT ARRAY_INSERT(ARRAY_CONSTRUCT(0,1,2,3),-1,'hello');
+---------------------------------------------------+
| ARRAY_INSERT(ARRAY_CONSTRUCT(0,1,2,3),-1,'HELLO') |
|---------------------------------------------------|
| [                                                 |
|   0,                                              |
|   1,                                              |
|   2,                                              |
|   "hello",                                        |
|   3                                               |
| ]                                                 |
+---------------------------------------------------+
```

---
title: ARRAY_INTERSECTION
source: https://docs.snowflake.com/en/sql-reference/functions/array_intersection.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_INTERSECTION

Returns an array that contains the matching elements in the two input arrays.

The function is NULL-safe, meaning it treats NULLs as known values for comparing equality.

See also:
:   [ARRAY_EXCEPT](array_except.md) , [ARRAYS_OVERLAP](arrays_overlap.md)

## Syntax

```sqlsyntax
ARRAY_INTERSECTION( <array1> , <array2> )
```

## Arguments

`array1`
:   An array that contains elements to be compared.

`array2`
:   An array that contains elements to be compared.

## Returns

This function returns an `ARRAY` that contains the elements of the input arrays that match.

If no elements overlap, the function returns an empty array.

If one or both arguments are NULL, the function returns NULL.

The order of the values within the returned array is unspecified.

## Usage notes

* When comparing data of type `OBJECT`, the objects must be identical to be considered matching. For details,
  see Examples (in this topic).
* The difference between `ARRAY_INTERSECTION` and the related `ARRAYS_OVERLAP` function is that the
  `ARRAYS_OVERLAP` function simply returns `TRUE` or `FALSE`, while `ARRAY_INTERSECTION` returns the actual
  overlapping values.
* In Snowflake, arrays are multi-sets, not sets. In other words, arrays can contain multiple copies of the same value.
  `ARRAY_INTERSECTION` compares arrays by using multi-set semantics (sometimes called “bag semantics”),
  which means that the function can return multiple copies of the same value. If one array has N copies of a value,
  and the other array has M copies of the same value, then the number of copies in the returned array is
  the smaller of N or M. For example, if N is 4 and M is 2, then the returned value contains 2 copies.

* Both arguments must either be [structured ARRAYs](../data-types-structured.md) or
  [semi-structured ARRAYs](../data-types-semistructured.md).

* If you are passing in structured ARRAYs:

  + The function returns an ARRAY of a type that can accommodate both input types.
  + The ARRAY in the second argument must be [comparable](../data-types-structured.md) to the ARRAY in the
    first argument.

## Examples

This example shows simple use of the function:

> ```sqlexample
> SELECT array_intersection(ARRAY_CONSTRUCT('A', 'B'),
>                           ARRAY_CONSTRUCT('B', 'C'));
> +------------------------------------------------------+
> | ARRAY_INTERSECTION(ARRAY_CONSTRUCT('A', 'B'),        |
> |                           ARRAY_CONSTRUCT('B', 'C')) |
> |------------------------------------------------------|
> | [                                                    |
> |   "B"                                                |
> | ]                                                    |
> +------------------------------------------------------+
> ```

The sets might have more than one matching value:

> ```sqlexample
> SELECT array_intersection(ARRAY_CONSTRUCT('A', 'B', 'C'),
>                           ARRAY_CONSTRUCT('B', 'C'));
> +------------------------------------------------------+
> | ARRAY_INTERSECTION(ARRAY_CONSTRUCT('A', 'B', 'C'),   |
> |                           ARRAY_CONSTRUCT('B', 'C')) |
> |------------------------------------------------------|
> | [                                                    |
> |   "B",                                               |
> |   "C"                                                |
> | ]                                                    |
> +------------------------------------------------------+
> ```

There might be more than instance of the same matching value. For example, in the query below, one array has three
copies of the letter ‘B’, and the other array has two copies of the letter ‘B’. The result contains two matches:

> ```sqlexample
> SELECT array_intersection(ARRAY_CONSTRUCT('A', 'B', 'B', 'B', 'C'),
>                           ARRAY_CONSTRUCT('B', 'B'));
> +---------------------------------------------------------------+
> | ARRAY_INTERSECTION(ARRAY_CONSTRUCT('A', 'B', 'B', 'B', 'C'),  |
> |                           ARRAY_CONSTRUCT('B', 'B'))          |
> |---------------------------------------------------------------|
> | [                                                             |
> |   "B",                                                        |
> |   "B"                                                         |
> | ]                                                             |
> +---------------------------------------------------------------+
> ```

This example uses a larger amount of data:

> ```sqlexample
> CREATE OR REPLACE TABLE array_demo (ID INTEGER, array1 ARRAY, array2 ARRAY, tip VARCHAR);
>
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 1, ARRAY_CONSTRUCT(1, 2), ARRAY_CONSTRUCT(3, 4), 'non-overlapping';
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 2, ARRAY_CONSTRUCT(1, 2, 3), ARRAY_CONSTRUCT(3, 4, 5), 'value 3 overlaps';
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 3, ARRAY_CONSTRUCT(1, 2, 3, 4), ARRAY_CONSTRUCT(3, 4, 5), 'values 3 and 4 overlap';
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 4, ARRAY_CONSTRUCT(NULL, 102, NULL), ARRAY_CONSTRUCT(NULL, NULL, 103), 'NULLs overlap';
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 5, array_construct(object_construct('a',1,'b',2), 1, 2),
>               array_construct(object_construct('a',1,'b',2), 3, 4),
>               'the objects in the array match';
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 6, array_construct(object_construct('a',1,'b',2), 1, 2),
>               array_construct(object_construct('b',2,'c',3), 3, 4),
>               'neither the objects nor any other values match';
> INSERT INTO array_demo (ID, array1, array2, tip)
>     SELECT 7, array_construct(object_construct('a',1, 'b',2, 'c',3)),
>               array_construct(object_construct('c',3, 'b',2, 'a',1)),
>               'the objects contain the same values, but in different order';
> ```
>
> ```sqlexample
> SELECT ID, array1, array2, tip, ARRAY_INTERSECTION(array1, array2)
>     FROM array_demo
>     WHERE ID <= 3
>     ORDER BY ID;
> +----+--------+--------+------------------------+------------------------------------+
> | ID | ARRAY1 | ARRAY2 | TIP                    | ARRAY_INTERSECTION(ARRAY1, ARRAY2) |
> |----+--------+--------+------------------------+------------------------------------|
> |  1 | [      | [      | non-overlapping        | []                                 |
> |    |   1,   |   3,   |                        |                                    |
> |    |   2    |   4    |                        |                                    |
> |    | ]      | ]      |                        |                                    |
> |  2 | [      | [      | value 3 overlaps       | [                                  |
> |    |   1,   |   3,   |                        |   3                                |
> |    |   2,   |   4,   |                        | ]                                  |
> |    |   3    |   5    |                        |                                    |
> |    | ]      | ]      |                        |                                    |
> |  3 | [      | [      | values 3 and 4 overlap | [                                  |
> |    |   1,   |   3,   |                        |   3,                               |
> |    |   2,   |   4,   |                        |   4                                |
> |    |   3,   |   5    |                        | ]                                  |
> |    |   4    | ]      |                        |                                    |
> |    | ]      |        |                        |                                    |
> +----+--------+--------+------------------------+------------------------------------+
> ```

This shows usage with NULL values:

> ```sqlexample
> SELECT ID, array1, array2, tip, ARRAY_INTERSECTION(array1, array2)
>     FROM array_demo
>     WHERE ID = 4
>     ORDER BY ID;
> +----+--------------+--------------+---------------+------------------------------------+
> | ID | ARRAY1       | ARRAY2       | TIP           | ARRAY_INTERSECTION(ARRAY1, ARRAY2) |
> |----+--------------+--------------+---------------+------------------------------------|
> |  4 | [            | [            | NULLs overlap | [                                  |
> |    |   undefined, |   undefined, |               |   undefined,                       |
> |    |   102,       |   undefined, |               |   undefined                        |
> |    |   undefined  |   103        |               | ]                                  |
> |    | ]            | ]            |               |                                    |
> +----+--------------+--------------+---------------+------------------------------------+
> ```

This example shows usage with the `OBJECT` data type:

> ```sqlexample
> SELECT ID, array1, array2, tip, ARRAY_INTERSECTION(array1, array2)
>     FROM array_demo
>     WHERE ID >= 5 and ID <= 7
>     ORDER BY ID;
> +----+-------------+-------------+-------------------------------------------------------------+------------------------------------+
> | ID | ARRAY1      | ARRAY2      | TIP                                                         | ARRAY_INTERSECTION(ARRAY1, ARRAY2) |
> |----+-------------+-------------+-------------------------------------------------------------+------------------------------------|
> |  5 | [           | [           | the objects in the array match                              | [                                  |
> |    |   {         |   {         |                                                             |   {                                |
> |    |     "a": 1, |     "a": 1, |                                                             |     "a": 1,                        |
> |    |     "b": 2  |     "b": 2  |                                                             |     "b": 2                         |
> |    |   },        |   },        |                                                             |   }                                |
> |    |   1,        |   3,        |                                                             | ]                                  |
> |    |   2         |   4         |                                                             |                                    |
> |    | ]           | ]           |                                                             |                                    |
> |  6 | [           | [           | neither the objects nor any other values match              | []                                 |
> |    |   {         |   {         |                                                             |                                    |
> |    |     "a": 1, |     "b": 2, |                                                             |                                    |
> |    |     "b": 2  |     "c": 3  |                                                             |                                    |
> |    |   },        |   },        |                                                             |                                    |
> |    |   1,        |   3,        |                                                             |                                    |
> |    |   2         |   4         |                                                             |                                    |
> |    | ]           | ]           |                                                             |                                    |
> |  7 | [           | [           | the objects contain the same values, but in different order | [                                  |
> |    |   {         |   {         |                                                             |   {                                |
> |    |     "a": 1, |     "a": 1, |                                                             |     "a": 1,                        |
> |    |     "b": 2, |     "b": 2, |                                                             |     "b": 2,                        |
> |    |     "c": 3  |     "c": 3  |                                                             |     "c": 3                         |
> |    |   }         |   }         |                                                             |   }                                |
> |    | ]           | ]           |                                                             | ]                                  |
> +----+-------------+-------------+-------------------------------------------------------------+------------------------------------+
> ```

Although NULL values in an array are treated as comparable values, if you pass NULL instead of an
array, then the result is NULL:

> ```sqlexample
> SELECT array_intersection(ARRAY_CONSTRUCT('A', 'B'),
>                           NULL);
> +------------------------------------------------+
> | ARRAY_INTERSECTION(ARRAY_CONSTRUCT('A', 'B'),  |
> |                           NULL)                |
> |------------------------------------------------|
> | NULL                                           |
> +------------------------------------------------+
> ```

---
title: ARRAY_MAX
source: https://docs.snowflake.com/en/sql-reference/functions/array_max.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_MAX

Given an input [ARRAY](../data-types-semistructured.md), returns the element with the highest value that is not a SQL NULL. If the input ARRAY
is empty or contains only SQL NULL elements, this function returns NULL.

## Syntax

```sqlsyntax
ARRAY_MAX( <array> )
```

## Arguments

`array`
:   The input ARRAY.

## Returns

This function returns a [VARIANT](../data-types-semistructured.md) that contains the element with the highest value that is not a SQL NULL.

The function returns NULL if `array` is NULL, empty, or contains only SQL NULL elements.

## Usage notes

* A SQL NULL is distinct from an explicit null value in semi-structured data (for example, a [JSON null](../../user-guide/semistructured-considerations.md)
  in JSON data). Explicit null values are considered when identifying the element with the highest value.

* The function determines the element to return by comparing the elements in the array. The function supports comparing elements
  of the same data type or of the following data types:

  + Elements of the NUMBER and FLOAT data types.
  + Elements of the TIMESTAMP_LTZ and TIMESTAMP_TZ data types.

  If the array contains elements of other data types, [cast](cast.md) the elements to a common data type,
  as shown in the example below.

## Examples

The following example returns a VARIANT containing the element with the highest value in an
[ARRAY constant](../data-types-semistructured.md):

```sqlexample
SELECT ARRAY_MAX([20, 0, NULL, 10, NULL]);
```

```output
+------------------------------------+
| ARRAY_MAX([20, 0, NULL, 10, NULL]) |
|------------------------------------|
| 20                                 |
+------------------------------------+
```

The following example demonstrates that a JSON null is handled differently than a SQL NULL. If `array` contains a JSON
null, the function returns the JSON null.

```sqlexample
SELECT ARRAY_MAX([NULL, PARSE_JSON('null'), NULL]);
```

```output
+--------------------------------------------------+
| ARRAY_MAX([20, 0, PARSE_JSON('NULL'), 10, NULL]) |
|--------------------------------------------------|
| null                                             |
+--------------------------------------------------+
```

The following example demonstrates that the function returns NULL if the input ARRAY is empty:

```sqlexample
SELECT ARRAY_MAX([]);
```

```output
+---------------+
| ARRAY_MAX([]) |
|---------------|
| NULL          |
+---------------+
```

The following example demonstrates that the function returns NULL if the input ARRAY contains only SQL NULLs:

```sqlexample
SELECT ARRAY_MAX([NULL, NULL, NULL]);
```

```output
+-------------------------+
| ARRAY_MAX([NULL, NULL]) |
|-------------------------|
| NULL                    |
+-------------------------+
```

To determine the maximum value in an array with elements of different data types, [cast](cast.md) the elements
to the same data type. The following example casts a DATE element to a TIMESTAMP element to determine the maximum value in the array:

```sqlexample
SELECT ARRAY_MAX([date1::TIMESTAMP, timestamp1]) AS array_max
  FROM (
      VALUES ('1999-01-01'::DATE, '2023-12-09 22:09:26.000000000'::TIMESTAMP),
             ('2023-12-09'::DATE, '1999-01-01 22:09:26.000000000'::TIMESTAMP)
          AS t(date1, timestamp1)
      );
```

```output
+---------------------------+
| ARRAY_MAX                 |
|---------------------------|
| "2023-12-09 22:09:26.000" |
| "2023-12-09 00:00:00.000" |
+---------------------------+
```

---
title: ARRAY_MIN
source: https://docs.snowflake.com/en/sql-reference/functions/array_min.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_MIN

Given an input [ARRAY](../data-types-semistructured.md), returns the element with the lowest value that is not a SQL NULL. If the input ARRAY
is empty or contains only SQL NULL elements, this function returns NULL.

## Syntax

```sqlsyntax
ARRAY_MIN( <array> )
```

## Arguments

`array`
:   The input ARRAY.

## Returns

This function returns a [VARIANT](../data-types-semistructured.md) that contains the element with the lowest value that is not a SQL NULL.

The function returns NULL if `array` is NULL, empty, or contains only SQL NULL elements.

## Usage notes

* A SQL NULL is distinct from an explicit null value in semi-structured data (for example, a [JSON null](../../user-guide/semistructured-considerations.md)
  in JSON data). Explicit null values are considered when identifying the element with the lowest value.

* The function determines the element to return by comparing the elements in the array. The function supports comparing elements
  of the same data type or of the following data types:

  + Elements of the NUMBER and FLOAT data types.
  + Elements of the TIMESTAMP_LTZ and TIMESTAMP_TZ data types.

  If the array contains elements of other data types, [cast](cast.md) the elements to a common data type,
  as shown in the example below.

## Examples

The following example returns a VARIANT containing the element with the lowest value in an
[ARRAY constant](../data-types-semistructured.md):

```sqlexample
SELECT ARRAY_MIN([20, 0, NULL, 10, NULL]);
```

```output
+------------------------------------+
| ARRAY_MIN([20, 0, NULL, 10, NULL]) |
|------------------------------------|
| 0                                  |
+------------------------------------+
```

The following example demonstrates that the function returns NULL if the input ARRAY is empty:

```sqlexample
SELECT ARRAY_MIN([]);
```

```output
+---------------+
| ARRAY_MIN([]) |
|---------------|
| NULL          |
+---------------+
```

The following example demonstrates that the function returns NULL if the input ARRAY contains only SQL NULLs:

```sqlexample
SELECT ARRAY_MIN([NULL, NULL, NULL]);
```

```output
+-------------------------+
| ARRAY_MIN([NULL, NULL]) |
|-------------------------|
| NULL                    |
+-------------------------+
```

To determine the minimum value in an array with elements of different data types, [cast](cast.md) the elements
to the same data type. The following example casts a DATE element to a TIMESTAMP element to determine the minimum value in the array:

```sqlexample
SELECT ARRAY_MIN([date1::TIMESTAMP, timestamp1]) AS array_min
  FROM (
      VALUES ('1999-01-01'::DATE, '2023-12-09 22:09:26.000000000'::TIMESTAMP),
             ('2023-12-09'::DATE, '1999-01-01 22:09:26.000000000'::TIMESTAMP)
          AS t(date1, timestamp1)
      );
```

```output
+---------------------------+
| ARRAY_MIN                 |
|---------------------------|
| "1999-01-01 00:00:00.000" |
| "1999-01-01 22:09:26.000" |
+---------------------------+
```

---
title: ARRAY_POSITION
source: https://docs.snowflake.com/en/sql-reference/functions/array_position.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_POSITION

Returns the index of the first occurrence of an element in an array.

## Syntax

```sqlsyntax
ARRAY_POSITION( <variant_expr> , <array> )
```

## Arguments

`value_expr`
:   Value to find in `array`.

    * If `array` is a [semi-structured ARRAY](../data-types-semistructured.md), `value_expr` must evaluate to a
      [VARIANT](../data-types-semistructured.md).
    * If `array` is a [structured ARRAY](../data-types-structured.md), `value_expr` must evaluate
      to a type that is [comparable](../data-types-structured.md) to the type of the ARRAY.

`array`
:   The ARRAY to search.

## Returns

The function returns an INTEGER specifying the position of `value_expr` in `array`.

## Usage notes

* The return value is 0-based, not 1-based. In other words, if the `value_expr` matches the first element in the array,
  this function returns 0, not 1.
* If the value is not contained in the ARRAY, the function returns NULL.
* If you specify NULL for `value_expr`, the function returns the position of the first NULL in the array.

## Examples

The examples below show how to use this function:

> ```sqlexample
> SELECT ARRAY_POSITION('hello'::variant, array_construct('hello', 'hi'));
> +------------------------------------------------------------------+
> | ARRAY_POSITION('HELLO'::VARIANT, ARRAY_CONSTRUCT('HELLO', 'HI')) |
> |------------------------------------------------------------------|
> |                                                                0 |
> +------------------------------------------------------------------+
> ```
>
> ```sqlexample
> SELECT ARRAY_POSITION('hi'::variant, array_construct('hello', 'hi'));
> +---------------------------------------------------------------+
> | ARRAY_POSITION('HI'::VARIANT, ARRAY_CONSTRUCT('HELLO', 'HI')) |
> |---------------------------------------------------------------|
> |                                                             1 |
> +---------------------------------------------------------------+
> ```
>
> ```sqlexample
> SELECT ARRAY_POSITION('hello'::variant, array_construct('hola', 'bonjour'));
> +----------------------------------------------------------------------+
> | ARRAY_POSITION('HELLO'::VARIANT, ARRAY_CONSTRUCT('HOLA', 'BONJOUR')) |
> |----------------------------------------------------------------------|
> |                                                                 NULL |
> +----------------------------------------------------------------------+
> ```

---
title: ARRAY_PREPEND
source: https://docs.snowflake.com/en/sql-reference/functions/array_prepend.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_PREPEND

Returns an array containing the new element as well as all elements from the source array. The new element is positioned at the beginning of the array.

See also:
:   [ARRAY_APPEND](array_append.md) , [ARRAY_INSERT](array_insert.md)

## Syntax

```sqlsyntax
ARRAY_PREPEND( <array> , <new_element> )
```

## Arguments

`array`
:   The source array.

`new_element`
:   The element to be prepended.

## Returns

This returns the updated array.

## Usage notes

* When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
  array of the same type.
* If `array` is a [structured ARRAY](../data-types-structured.md), the type of the new element must
  be [coercible](../data-types-structured.md) to the type of the ARRAY.

## Examples

The example below shows that the prepended element is placed at the beginning of the array:

> ```sqlexample
> SELECT ARRAY_PREPEND(ARRAY_CONSTRUCT(0,1,2,3),'hello');
> +-------------------------------------------------+
> | ARRAY_PREPEND(ARRAY_CONSTRUCT(0,1,2,3),'HELLO') |
> |-------------------------------------------------|
> | [                                               |
> |   "hello",                                      |
> |   0,                                            |
> |   1,                                            |
> |   2,                                            |
> |   3                                             |
> | ]                                               |
> +-------------------------------------------------+
> ```

---
title: ARRAY_REMOVE
source: https://docs.snowflake.com/en/sql-reference/functions/array_remove.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_REMOVE

Given a source [ARRAY](../data-types-semistructured.md), returns an ARRAY with elements of the specified value removed.

For example, `ARRAY_REMOVE([2, 5, 7, 5, 1], 5)` returns an ARRAY with the elements equal to 5 removed (`[2, 7, 1]`).

## Syntax

```sqlsyntax
ARRAY_REMOVE( <array> , <value_of_elements_to_remove> )
```

## Arguments

`array`
:   The source array.

`value_of_elements_to_remove`
:   The VARIANT value of the elements to be removed. The function removes elements equal to this value.

    If you specify a VARCHAR value, you must first cast the value to VARIANT.

## Returns

An ARRAY with all elements equal to the specified value removed.

If `value_of_elements_to_remove` is NULL, the function returns NULL.

## Usage notes

* If all of the elements in `array` are equal to `value_of_elements_to_remove`, the function returns an empty
  ARRAY.

## Examples

The following example returns an ARRAY with elements with the value 5 removed.

```sqlexample
SELECT ARRAY_REMOVE(
  [1, 5, 5.00, 5.00::DOUBLE, '5', 5, NULL],
  5);
```

```output
+---------------------------------------------+
| ARRAY_REMOVE(                               |
|   [1, 5, 5.00, 5.00::DOUBLE, '5', 5, NULL], |
|   5)                                        |
|---------------------------------------------|
| [                                           |
|   1,                                        |
|   "5",                                      |
|   undefined                                 |
| ]                                           |
+---------------------------------------------+
```

The following example removes the elements with the value 5 from an ARRAY that contains only elements with the value 5. The
function returns an empty ARRAY:

```sqlexample
SELECT ARRAY_REMOVE([5, 5], 5);
```

```output
+-------------------------+
| ARRAY_REMOVE([5, 5], 5) |
|-------------------------|
| []                      |
+-------------------------+
```

The following example removes elements with the value `'a'` from an ARRAY. As shown in the example, you must cast the value
as VARIANT.

```sqlexample
SELECT ARRAY_REMOVE(
  ['a', 'b', 'a', 'c', 'd', 'a'],
  'a'::VARIANT);
```

```output
+-----------------------------------+
| ARRAY_REMOVE(                     |
|   ['A', 'B', 'A', 'C', 'D', 'A'], |
|   'A'::VARIANT)                   |
|-----------------------------------|
| [                                 |
|   "b",                            |
|   "c",                            |
|   "d"                             |
| ]                                 |
+-----------------------------------+
```

---
title: ARRAY_REMOVE_AT
source: https://docs.snowflake.com/en/sql-reference/functions/array_remove_at.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_REMOVE_AT

Given a source [ARRAY](../data-types-semistructured.md), returns an ARRAY with the element at the specified position removed.

For example, `ARRAY_REMOVE_AT([2, 5, 7], 0)` returns an ARRAY with the element at position 0 removed (`[5, 7]`).

## Syntax

```sqlsyntax
ARRAY_REMOVE_AT( <array> , <position> )
```

## Arguments

`array`
:   The source array.

`position`
:   The (zero-based) position of the element to be removed. The function removes the element at this position.

    A negative position is interpreted as an index from the back of the array (e.g. `-1` removes the last element in the array).

## Returns

An ARRAY with the element at the specified position removed.

If `position` is NULL, the function returns NULL.

## Usage notes

* If the absolute value of `position` exceeds the length of `array`, the function returns `array` without
  any elements removed.

## Examples

The following example returns an ARRAY with elements with the first element removed.

```sqlexample
SELECT ARRAY_REMOVE_AT(
  [2, 5, 7],
  0);
```

```output
+-------------------------------+
| ARRAY_REMOVE_AT([2, 5, 7], 0) |
|-------------------------------|
| [                             |
|   5,                          |
|   7                           |
| ]                             |
+-------------------------------+
```

The following example returns an ARRAY with elements with the last element removed.

```sqlexample
SELECT ARRAY_REMOVE_AT(
  [2, 5, 7],
  -1);
```

```output
+--------------------------------+
| ARRAY_REMOVE_AT([2, 5, 7], -1) |
|--------------------------------|
| [                              |
|   2,                           |
|   5                            |
| ]                              |
+--------------------------------+
```

In the following example, `position` is greater than the length of the ARRAY, so the function returns the ARRAY without
making any changes.

```sqlexample
SELECT ARRAY_REMOVE_AT(
  [2, 5, 7],
  10);
```

```output
+------------------+
| ARRAY_REMOVE_AT( |
|   [2, 5, 7],     |
|   10)            |
|------------------|
| [                |
|   2,             |
|   5,             |
|   7              |
| ]                |
+------------------+
```

---
title: ARRAY_REPEAT
source: https://docs.snowflake.com/en/sql-reference/functions/array_repeat.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_REPEAT

Returns an [ARRAY](../data-types-semistructured.md) value containing a specified number of copies of an element.

## Syntax

```sqlsyntax
ARRAY_REPEAT( <element> , <count> )
```

## Arguments

`element`
:   The value to repeat in the output array.

    The value can be of any [semi-structured data type](../data-types-semistructured.md) —- for example, VARIANT, ARRAY,
    OBJECT — or any standard Snowflake data type — for example, NUMBER, VARCHAR, BOOLEAN, DATE.

    [Structured types](../data-types-structured.md), such as MAP, aren’t supported.

`count`
:   An INTEGER expression specifying the number of times to repeat `element`.

## Returns

The function returns a [semi-structured ARRAY](../data-types-semistructured.md) value containing `count` copies of
`element`.

If `count` is NULL, the function returns NULL.

## Usage notes

* If `count` is 0 or a negative number, the function returns an empty ARRAY.
* If `element` is NULL, the function returns an ARRAY of `count` NULL values.
* The `element` value is implicitly converted to VARIANT in the resulting ARRAY.

## Examples

The following example repeats an INTEGER value three times:

```sqlexample
SELECT ARRAY_REPEAT(42, 3);
```

```output
+---------------------+
| ARRAY_REPEAT(42, 3) |
|---------------------|
| [                   |
|   42,               |
|   42,               |
|   42                |
| ]                   |
+---------------------+
```

The following example repeats a STRING value:

```sqlexample
SELECT ARRAY_REPEAT('hello', 2);
```

```output
+--------------------------+
| ARRAY_REPEAT('hello', 2) |
|--------------------------|
| [                        |
|   "hello",               |
|   "hello"                |
| ]                        |
+--------------------------+
```

The following example repeats an ARRAY value to create a nested ARRAY:

```sqlexample
SELECT ARRAY_REPEAT([1, 2], 2);
```

```output
+-------------------------+
| ARRAY_REPEAT([1, 2], 2) |
|-------------------------|
| [                       |
|   [                     |
|     1,                  |
|     2                   |
|   ],                    |
|   [                     |
|     1,                  |
|     2                   |
|   ]                     |
| ]                       |
+-------------------------+
```

The following example shows that a count of 0 returns an empty ARRAY value:

```sqlexample
SELECT ARRAY_REPEAT('x', 0);
```

```output
+----------------------+
| ARRAY_REPEAT('x', 0) |
|----------------------|
| []                   |
+----------------------+
```

The following example shows that a NULL count returns NULL:

```sqlexample
SELECT ARRAY_REPEAT('hi', NULL);
```

```output
+--------------------------+
| ARRAY_REPEAT('hi', NULL) |
|--------------------------|
| NULL                     |
+--------------------------+
```

---
title: ARRAY_REVERSE
source: https://docs.snowflake.com/en/sql-reference/functions/array_reverse.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_REVERSE

Returns an [array](../data-types-semistructured.md) with the elements of the input array in reverse order.

## Syntax

```sqlsyntax
ARRAY_REVERSE( <array> )
```

## Arguments

`array`
:   The source array.

## Returns

An array containing the elements of the input array in reverse order.

## Usage notes

* If the argument is NULL, the result will be NULL.
* When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
  array of the same type.

## Examples

The following example returns an array containing the elements from the input array in reverse order:

```sqlexample
SELECT ARRAY_REVERSE([1,2,3,4]);
```

```output
+--------------------------+
| ARRAY_REVERSE([1,2,3,4]) |
|--------------------------|
| [                        |
|   4,                     |
|   3,                     |
|   2,                     |
|   1                      |
| ]                        |
+--------------------------+
```

---
title: ARRAY_SIZE
source: https://docs.snowflake.com/en/sql-reference/functions/array_size.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_SIZE

Returns the size of the input array.

A variation of ARRAY_SIZE takes a VARIANT value as input. If the VARIANT value contains an array, the size of the array is returned; otherwise, NULL is returned if the value is not an array.

## Syntax

```sqlsyntax
ARRAY_SIZE( <array> )

ARRAY_SIZE( <variant> )
```

## Returns

The data type of the returned value is `INTEGER`.

## Usage notes

* Takes an ARRAY value as input and returns the size of the array (i.e. the largest index + 1).

  If the array is a [sparse](../data-types-semistructured.md) array, this means that the size includes the undefined elements as
  well as the defined elements.
* A NULL argument returns NULL as a result.

## Examples

Here is a simple example:

> ```sqlexample
> SELECT ARRAY_SIZE(ARRAY_CONSTRUCT(1, 2, 3)) AS SIZE;
> +------+
> | SIZE |
> |------|
> |    3 |
> +------+
> ```

Here is a slightly more complex example, this time using VARIANT data type:

> ```sqlexample
> CREATE OR replace TABLE colors (v variant);
>
> INSERT INTO
>    colors
>    SELECT
>       parse_json(column1) AS v
>    FROM
>    VALUES
>      ('[{r:255,g:12,b:0},{r:0,g:255,b:0},{r:0,g:0,b:255}]'),
>      ('[{r:255,g:128,b:0},{r:128,g:255,b:0},{r:0,g:255,b:128},{r:0,g:128,b:255},{r:128,g:0,b:255},{r:255,g:0,b:128}]')
>     v;
> ```
>
> Retrieve the size for each array in the VARIANT column:
>
> ```sqlexample
> SELECT ARRAY_SIZE(v) from colors;
> +---------------+
> | ARRAY_SIZE(V) |
> |---------------|
> |             3 |
> |             6 |
> +---------------+
> ```
>
> Retrieve the last element of each array in the VARIANT column:
>
> ```sqlexample
> SELECT GET(v, ARRAY_SIZE(v)-1) FROM colors;
> +-------------------------+
> | GET(V, ARRAY_SIZE(V)-1) |
> |-------------------------|
> | {                       |
> |   "b": 255,             |
> |   "g": 0,               |
> |   "r": 0                |
> | }                       |
> | {                       |
> |   "b": 128,             |
> |   "g": 0,               |
> |   "r": 255              |
> | }                       |
> +-------------------------+
> ```

---
title: ARRAY_SLICE
source: https://docs.snowflake.com/en/sql-reference/functions/array_slice.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_SLICE

Returns an array constructed from a specified subset of elements of the input array.

## Syntax

```sqlsyntax
ARRAY_SLICE( <array> , <from> , <to> )
```

## Arguments

`array`
:   The source array of which a subset of the elements are used to construct the resulting array.

`from`
:   A position in the source array. The position of the first element is `0`. Elements from positions less than `from`
    aren’t included in the resulting array.

`to`
:   A position in the source array. Elements from positions equal to or greater than `to` are not included in
    the resulting array.

## Returns

This function returns a value of type ARRAY.

Returns NULL if the any argument is NULL, including the input `array`, `from`, or `to`.

## Usage notes

* The output includes elements up to, but not including the element
  specified by the parameter `to`.
* If either `from` or `to` is negative, it is relative to
  the end of the array, not the beginning of the array. For example, `-2` refers
  to the second-from-the-last position in the array.
* If `from` and `to` are both beyond the upper end of the
  array, or are both beyond the lower end of the array, then the result is
  the empty set.
* When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
  array of the same type.

Note that many of these rules (for example, interpretation of negative numbers as
indexes from the end of the array, and the rule that the slice is up to, but
not including, the `to` index), are similar to the rules for array
slices in programming languages such as Python.

Each of these rules is illustrated in at least one example below.

## Examples

These examples use [ARRAY constants](../data-types-semistructured.md) to construct arrays. Alternatively, you can
use the [ARRAY_CONSTRUCT](array_construct.md) function to construct arrays.

This example shows a simple array slice:

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], 0, 2);
```

```output
+------------------------------------+
| ARRAY_SLICE([0,1,2,3,4,5,6], 0, 2) |
|------------------------------------|
| [                                  |
|   0,                               |
|   1                                |
| ]                                  |
+------------------------------------+
```

This example slices an array to the last index by using the [ARRAY_SIZE](array_size.md) function with the
ARRAY_SLICE function:

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], 3, ARRAY_SIZE([0,1,2,3,4,5,6])) AS slice_to_last_index;
```

```output
+---------------------+
| SLICE_TO_LAST_INDEX |
|---------------------|
| [                   |
|   3,                |
|   4,                |
|   5,                |
|   6                 |
| ]                   |
+---------------------+
```

Although the indexes must be numeric, the elements of the array don’t need
to be numeric:

```sqlexample
SELECT ARRAY_SLICE(['foo','snow','flake','bar'], 1, 3);
```

```output
+-------------------------------------------------+
| ARRAY_SLICE(['FOO','SNOW','FLAKE','BAR'], 1, 3) |
|-------------------------------------------------|
| [                                               |
|   "snow",                                       |
|   "flake"                                       |
| ]                                               |
+-------------------------------------------------+
```

This example shows the effect of using NULL as the input array:

```sqlexample
SELECT ARRAY_SLICE(NULL, 2, 3);
```

```output
+-------------------------+
| ARRAY_SLICE(NULL, 2, 3) |
|-------------------------|
| NULL                    |
+-------------------------+
```

This example shows the effect of using NULL as one of the slice indexes:

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], NULL, 2);
```

```output
+---------------------------------------+
| ARRAY_SLICE([0,1,2,3,4,5,6], NULL, 2) |
|---------------------------------------|
| NULL                                  |
+---------------------------------------+
```

This example shows the effect of using a negative number as an index. The number
is interpreted as the offset from the end of the array:

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], 0, -2);
```

```output
+-------------------------------------+
| ARRAY_SLICE([0,1,2,3,4,5,6], 0, -2) |
|-------------------------------------|
| [                                   |
|   0,                                |
|   1,                                |
|   2,                                |
|   3,                                |
|   4                                 |
| ]                                   |
+-------------------------------------+
```

This example shows that both indexes can be negative (that is, both can be relative to the end of
the array):

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], -5, -3);
```

```output
+--------------------------------------+
| ARRAY_SLICE([0,1,2,3,4,5,6], -5, -3) |
|--------------------------------------|
| [                                    |
|   2,                                 |
|   3                                  |
| ]                                    |
+--------------------------------------+
```

In this example, both indexes are beyond the end of the array:

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], 10, 12);
```

```output
+--------------------------------------+
| ARRAY_SLICE([0,1,2,3,4,5,6], 10, 12) |
|--------------------------------------|
| []                                   |
+--------------------------------------+
```

In this example, both indexes are before the start of the array:

```sqlexample
SELECT ARRAY_SLICE([0,1,2,3,4,5,6], -10, -12);
```

```output
+----------------------------------------+
| ARRAY_SLICE([0,1,2,3,4,5,6], -10, -12) |
|----------------------------------------|
| []                                     |
+----------------------------------------+
```

---
title: ARRAY_SORT
source: https://docs.snowflake.com/en/sql-reference/functions/array_sort.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_SORT

Returns an [ARRAY](../data-types-semistructured.md) that contains the elements of the input ARRAY sorted in ascending or descending order.
You can specify whether or not NULL elements are sorted before or after non-NULL elements.

## Syntax

```sqlsyntax
ARRAY_SORT( <array> [ , <sort_ascending> [ , <nulls_first> ] ] )
```

## Arguments

**Required**

`array`
:   The ARRAY of elements to sort.

**Optional**

`sort_ascending`
:   Specifies whether to sort the elements in ascending or descending order:

    * Specify TRUE to sort the elements in ascending order.
    * Specify FALSE to sort the elements in descending order.

    Default: TRUE

`nulls_first`
:   Specifies whether to place SQL NULL elements at the beginning or end of the sorted ARRAY:

    * Specify TRUE to place the SQL NULL elements first in the ARRAY.
    * Specify FALSE to place the SQL NULL elements last in the ARRAY.

    Default: FALSE if the ARRAY is sorted in ascending order; TRUE if the ARRAY is sorted in descending order.

    This argument only affects the order of SQL NULL elements. This does not affect the order of
    [JSON null](../../user-guide/semistructured-considerations.md) elements.

## Returns

This function returns an ARRAY that contains the elements of `array` in sorted order.

## Usage notes

* The sort order is equivalent to the order resulting from [flattening](../../user-guide/querying-semistructured.md) the ARRAY and specifying an
  [ORDER BY](../constructs/order-by.md) clause with the corresponding ASC | DESC and NULLS FIRST | LAST parameters.
* If any of the input arguments is NULL, the function returns NULL.
* This function is not guaranteed to provide a stable sort when the ARRAY contains either of the following:

  + Elements of two different [numeric](../data-types-numeric.md) or [timestamp](../data-types-datetime.md)
    types.
  + Objects containing two different numeric or timestamp types.

## Examples

The following example returns an ARRAY of numbers with the elements from an input [ARRAY constant](../data-types-semistructured.md)
sorted in ascending order. The elements include a JSON NULL (PARSE_JSON(‘null’)) and a SQL NULL.

Note that in the sorted ARRAY, JSON NULLs (`null`) and SQL NULLs (`undefined`) are the last elements.

```sqlexample
SELECT ARRAY_SORT([20, PARSE_JSON('null'), 0, NULL, 10]);
```

```output
+---------------------------------------------------+
| ARRAY_SORT([20, PARSE_JSON('NULL'), 0, NULL, 10]) |
|---------------------------------------------------|
| [                                                 |
|   0,                                              |
|   10,                                             |
|   20,                                             |
|   null,                                           |
|   undefined                                       |
| ]                                                 |
+---------------------------------------------------+
```

The following example returns an ARRAY of numbers with the elements sorted in descending order. Note that in the sorted ARRAY,
JSON NULLs (`null`) and SQL NULLs (`undefined`) are the first elements.

```sqlexample
SELECT ARRAY_SORT([20, PARSE_JSON('null'), 0, NULL, 10], FALSE);
```

```output
+----------------------------------------------------------+
| ARRAY_SORT([20, PARSE_JSON('NULL'), 0, NULL, 10], FALSE) |
|----------------------------------------------------------|
| [                                                        |
|   undefined,                                             |
|   null,                                                  |
|   20,                                                    |
|   10,                                                    |
|   0                                                      |
| ]                                                        |
+----------------------------------------------------------+
```

The following example sorts the elements in ascending order. The example sets the `nulls_first` argument to TRUE to place
the SQL NULLs (`undefined`) first in the sorted ARRAY. (By default, SQL NULLs are placed at the end of an ARRAY sorted in
ascending order.)

Note that `nulls_first` has no effect on the placement of JSON NULLs (`null`).

```sqlexample
SELECT ARRAY_SORT([20, PARSE_JSON('null'), 0, NULL, 10], TRUE, TRUE);
```

```output
+---------------------------------------------------------------+
| ARRAY_SORT([20, PARSE_JSON('NULL'), 0, NULL, 10], TRUE, TRUE) |
|---------------------------------------------------------------|
| [                                                             |
|   undefined,                                                  |
|   0,                                                          |
|   10,                                                         |
|   20,                                                         |
|   null                                                        |
| ]                                                             |
+---------------------------------------------------------------+
```

The following example sorts the elements in descending order. The example sets the `nulls_first` argument to FALSE to
place the SQL NULLs (`undefined`) last in the sorted ARRAY. (By default, SQL NULLs are placed at the beginning of an ARRAY
sorted in descending order.)

Note that `nulls_first` has no effect on the placement of JSON NULLs (`null`).

```sqlexample
SELECT ARRAY_SORT([20, PARSE_JSON('null'), 0, NULL, 10], FALSE, FALSE);
```

```output
+-----------------------------------------------------------------+
| ARRAY_SORT([20, PARSE_JSON('NULL'), 0, NULL, 10], FALSE, FALSE) |
|-----------------------------------------------------------------|
| [                                                               |
|   null,                                                         |
|   20,                                                           |
|   10,                                                           |
|   0,                                                            |
|   undefined                                                     |
| ]                                                               |
+-----------------------------------------------------------------+
```

The following example uses the [ARRAY_INSERT](array_insert.md) function to construct a sparsely populated ARRAY. (The example inserts the
values `1` and `2` at specific positions in the ARRAY.) The example then uses the ARRAY_SORT function to sort this ARRAY.

```sqlexample
SELECT ARRAY_INSERT(ARRAY_INSERT(ARRAY_CONSTRUCT(), 3, 2), 6, 1) arr, ARRAY_SORT(arr);
```

```output
+--------------+-----------------+
| ARR          | ARRAY_SORT(ARR) |
|--------------+-----------------|
| [            | [               |
|   undefined, |   1,            |
|   undefined, |   2,            |
|   undefined, |   undefined,    |
|   2,         |   undefined,    |
|   undefined, |   undefined,    |
|   undefined, |   undefined,    |
|   1          |   undefined     |
| ]            | ]               |
+--------------+-----------------+
```

The following example demonstrates that sorting an ARRAY with different numeric types results in an unstable sort. The example
uses an ARRAY that contains NUMBER values and a REAL value.

```sqlexample
SELECT ARRAY_SORT([1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1e0::REAL]) AS array_of_different_numeric_types;
```

```output
+----------------------------------+
| ARRAY_OF_DIFFERENT_NUMERIC_TYPES |
|----------------------------------|
| [                                |
|   1,                             |
|   1.000000000000000e+00,         |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1,                             |
|   1                              |
| ]                                |
+----------------------------------+
```

---
title: ARRAY_TO_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/array_to_string.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAY_TO_STRING

Returns an input array converted to a string by casting all values to strings (using [TO_VARCHAR](to_char.md)) and concatenating them (using the string from the second argument to separate the
elements).

## Syntax

```sqlsyntax
ARRAY_TO_STRING( <array> , <separator_string> )
```

## Arguments

`array`
:   The array of elements to convert to a string.

`separator_string`
:   The string to put between each element, typically a space, comma, or other human-readable separator.

## Returns

This function returns a value of type VARCHAR.

## Usage notes

* A NULL argument returns NULL as a result.
* A NULL in an array is converted to an empty string in the result.
* To include a blank space between values, you must precede the space with the separator character
  (e.g. `', '`). See the examples below.

## Examples

Return various arrays as concatenated strings:

```sqlexample
SELECT column1,
       ARRAY_TO_STRING(PARSE_JSON(column1), '') AS no_separation,
       ARRAY_TO_STRING(PARSE_JSON(column1), ', ') AS comma_separated
  FROM VALUES
    (NULL),
    ('[]'),
    ('[1]'),
    ('[1, 2]'),
    ('[true, 1, -1.2e-3, "Abc", ["x","y"], {"a":1}]'),
    ('[, 1]'),
    ('[1, ]'),
    ('[1, , ,2]');
```

```output
+-----------------------------------------------+---------------------------------+-------------------------------------------+
| COLUMN1                                       | NO_SEPARATION                   | COMMA_SEPARATED                           |
|-----------------------------------------------+---------------------------------+-------------------------------------------|
| NULL                                          | NULL                            | NULL                                      |
| []                                            |                                 |                                           |
| [1]                                           | 1                               | 1                                         |
| [1, 2]                                        | 12                              | 1, 2                                      |
| [true, 1, -1.2e-3, "Abc", ["x","y"], {"a":1}] | true1-0.0012Abc["x","y"]{"a":1} | true, 1, -0.0012, Abc, ["x","y"], {"a":1} |
| [, 1]                                         | 1                               | , 1                                       |
| [1, ]                                         | 1                               | 1,                                        |
| [1, , ,2]                                     | 12                              | 1, , , 2                                  |
+-----------------------------------------------+---------------------------------+-------------------------------------------+
```

This example returns an array that contains a NULL value as a concatenated string. First, create
a table and insert an array:

```sqlexample
CREATE TABLE test_array_to_string_with_null(a ARRAY);

INSERT INTO test_array_to_string_with_null
  SELECT (['A', NULL, 'B']);
```

Return the array as a concatenated string:

```sqlexample
SELECT a,
       ARRAY_TO_STRING(a, ''),
       ARRAY_TO_STRING(a, ', ')
  FROM test_array_to_string_with_null;
```

```output
+--------------+------------------------+--------------------------+
| A            | ARRAY_TO_STRING(A, '') | ARRAY_TO_STRING(A, ', ') |
|--------------+------------------------+--------------------------|
| [            | AB                     | A, , B                   |
|   "A",       |                        |                          |
|   undefined, |                        |                          |
|   "B"        |                        |                          |
| ]            |                        |                          |
+--------------+------------------------+--------------------------+
```

---
title: ARRAY_UNION_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/array_union_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values) ,
    [Window functions](../functions-window-syntax.md) (Semi-structured Data Aggregation)

# ARRAY_UNION_AGG

Returns an [ARRAY](../data-types-semistructured.md) that contains the union of the distinct values from the input arrays in a column.
You can use this to aggregate distinct values in arrays produced by [ARRAY_UNIQUE_AGG](array_unique_agg.md).

See also:
:   [ARRAY_UNIQUE_AGG](array_unique_agg.md) , [Using Arrays to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-arrays-for-distinct-counts.md)

## Syntax

**Aggregate function**

```sqlsyntax
ARRAY_UNION_AGG( <column> )
```

**Window function**

```sqlsyntax
ARRAY_UNION_AGG( <column> ) OVER ( [ PARTITION BY <expr> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`column`
:   The column containing the arrays with the distinct values (the arrays produced by [ARRAY_UNIQUE_AGG](array_unique_agg.md)).

## Returns

The function returns an array containing the distinct values from the arrays in `column`. The values in the array are in
no particular order, and the order is not deterministic.

Note that this function uses [multiset semantics](https://en.wikipedia.org/wiki/Multiset), which means that the maximum number
of occurrences of an individual value in a single input array determines the number of occurrences of that value in the output
array. See Examples.

The function ignores NULL values in `column` and in the arrays in `column`. If `column` contains only
NULL values or the table containing `column` is empty, the function returns an empty array.

## Usage notes

* This function can be used as either of the following types of functions:

  + [aggregate function](../functions-aggregation.md)
  + [window function](../functions-window-syntax.md).
* When this function is called as a window function, it does not support explicit window frames.
* When you pass a [structured array](../data-types-structured.md) to the function, the function returns a structured
  array of the same type.

## Examples

### Aggregation: Union of arrays

The following example illustrates how the function returns the union of distinct values from two arrays:

```sqlexample
CREATE TABLE union_test(a array);

INSERT INTO union_test
    SELECT PARSE_JSON('[ 1, 1, 2]')
    UNION ALL
    SELECT PARSE_JSON('[ 1, 2, 3]');

SELECT ARRAY_UNION_AGG(a) FROM union_test;
+-------------------------+
| ARRAY_UNION_AGG(A)      |
+-------------------------+
| [ 1, 1, 2, 3]           |
+-------------------------+
```

The operation uses [multiset](https://en.wikipedia.org/wiki/Multiset) semantics. The value `1` appears twice in the output
because it appears twice in one of the input arrays.

See [Using Arrays to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-arrays-for-distinct-counts.md).

---
title: ARRAY_UNIQUE_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/array_unique_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values) ,
    [Window functions](../functions-window-syntax.md) (Semi-structured Data Aggregation)

# ARRAY_UNIQUE_AGG

Returns an [ARRAY](../data-types-semistructured.md) that contains all of the distinct values from the specified column.

See also:
:   [Using Arrays to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-arrays-for-distinct-counts.md)

## Syntax

**Aggregate function**

```sqlsyntax
ARRAY_UNIQUE_AGG( <column> )
```

**Window function**

```sqlsyntax
ARRAY_UNIQUE_AGG( <column> ) OVER ( [ PARTITION BY <expr> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`column`
:   The column containing the values.

## Returns

The function returns an array containing the distinct values in the specified column. The values in the array are in no particular
order, and the order is not deterministic.

The function ignores NULL values in `column`. If `column` contains only NULL values or the table containing
`column` is empty, the function returns an empty array.

## Usage notes

* This function can be used as either of the following types of functions:

  + [aggregate function](../functions-aggregation.md)
  + [window function](../functions-window-syntax.md).
* When this function is called as a window function, it does not support explicit window frames.

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

### Aggregation

See [Using Arrays to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-arrays-for-distinct-counts.md).

---
title: ARRAYS_OVERLAP
source: https://docs.snowflake.com/en/sql-reference/functions/arrays_overlap.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAYS_OVERLAP

Compares whether two arrays have at least one element in common. Returns TRUE if there is at least one element in common; otherwise returns FALSE. The function is NULL-safe, meaning it treats NULLs as known values for comparing equality.

See also:
:   [ARRAY_INTERSECTION](array_intersection.md)

## Syntax

```sqlsyntax
ARRAYS_OVERLAP( <array1> , <array2> )
```

## Usage notes

* When you compare objects, the objects must be identical to return TRUE. For details, see Examples (in this topic).

* Both arguments must either be [structured ARRAYs](../data-types-structured.md) or
  [semi-structured ARRAYs](../data-types-semistructured.md).

* If you are passing in structured ARRAYs, the ARRAY in the second argument must be
  [comparable](../data-types-structured.md) to the ARRAY in the first argument.

## Examples

Here are some examples:

> ```sqlexample
> SELECT ARRAYS_OVERLAP(array_construct('hello', 'aloha'),
>                       array_construct('hello', 'hi', 'hey'))
>   AS Overlap;
> +---------+
> | OVERLAP |
> |---------|
> | True    |
> +---------+
> SELECT ARRAYS_OVERLAP(array_construct('hello', 'aloha'),
>                       array_construct('hola', 'bonjour', 'ciao'))
>   AS Overlap;
> +---------+
> | OVERLAP |
> |---------|
> | False   |
> +---------+
> SELECT ARRAYS_OVERLAP(array_construct(object_construct('a',1,'b',2), 1, 2),
>                       array_construct(object_construct('b',2,'c',3), 3, 4))
>   AS Overlap;
> +---------+
> | OVERLAP |
> |---------|
> | False   |
> +---------+
> SELECT ARRAYS_OVERLAP(array_construct(object_construct('a',1,'b',2), 1, 2),
>                       array_construct(object_construct('a',1,'b',2), 3, 4))
>   AS Overlap;
> +---------+
> | OVERLAP |
> |---------|
> | True    |
> +---------+
> ```

The following example shows that NULL values are considered equal to other
NULL values. If each array contains a NULL value, then the arrays overlap, even
if no other (non-NULL) values overlap:

> ```sqlexample
> SELECT ARRAYS_OVERLAP(ARRAY_CONSTRUCT(1, 2, NULL),
>                       ARRAY_CONSTRUCT(3, NULL, 5))
>  AS Overlap;
> +---------+
> | OVERLAP |
> |---------|
> | True    |
> +---------+
> ```

---
title: ARRAYS_TO_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/arrays_to_object.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# ARRAYS_TO_OBJECT

Returns an [OBJECT](../data-types-semistructured.md) that contains the keys specified by one input [ARRAY](../data-types-semistructured.md) and the values
specified by another input ARRAY.

## Syntax

```sqlsyntax
ARRAYS_TO_OBJECT( <key_array> , <value_array> )
```

## Arguments

`key_array`
:   ARRAY of VARCHAR values that specify the keys for the new OBJECT.

`value_array`
:   ARRAY of values for the new OBJECT. This ARRAY must be the same length as `key_array`. The values in this ARRAY should
    correspond to the keys in `key_array`.

## Returns

The function returns a value of the type OBJECT. The OBJECT contains the keys and values specified by the input ARRAYs.

## Usage notes

* If any element in `key_array` is not a string, the function reports the following error:

  ```output
  215002 (22000): Key supplied for ARRAYS_TO_OBJECT does not have string type
  ```
* `key_array` and `value_array` must be equal in length. Otherwise, the function reports the following error:

  ```output
  215001 (22000): Key array and value array had unequal lengths in ARRAYS_TO_OBJECT
  ```
* If an element in `key_array` is NULL, that key and the corresponding value are omitted from the returned OBJECT.

  If the key is not NULL but the corresponding element in `value_array` is NULL, the key and NULL value are included in
  the returned OBJECT.
* The returned OBJECT does not necessarily preserve the original order of the key-value pairs.

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

The following example returns an OBJECT that contains key-value pairs specified by two input ARRAYs:

```sqlexample
SELECT ARRAYS_TO_OBJECT(['key1', 'key2', 'key3'], [1, 2, 3]);
```

```output
+-------------------------------------------------------+
| ARRAYS_TO_OBJECT(['KEY1', 'KEY2', 'KEY3'], [1, 2, 3]) |
|-------------------------------------------------------|
| {                                                     |
|   "key1": 1,                                          |
|   "key2": 2,                                          |
|   "key3": 3                                           |
| }                                                     |
+-------------------------------------------------------+
```

In the following example, the ARRAY of keys includes a NULL value. That key and the corresponding value are omitted from the
returned OBJECT.

```sqlexample
SELECT ARRAYS_TO_OBJECT(['key1', NULL, 'key3'], [1, 2, 3]);
```

```output
+-----------------------------------------------------+
| ARRAYS_TO_OBJECT(['KEY1', NULL, 'KEY3'], [1, 2, 3]) |
|-----------------------------------------------------|
| {                                                   |
|   "key1": 1,                                        |
|   "key3": 3                                         |
| }                                                   |
+-----------------------------------------------------+
```

In the following example, the ARRAY of values includes a NULL value. That value and the corresponding key are included in the
returned OBJECT.

```sqlexample
SELECT ARRAYS_TO_OBJECT(['key1', 'key2', 'key3'], [1, NULL, 3]);
```

```output
+----------------------------------------------------------+
| ARRAYS_TO_OBJECT(['KEY1', 'KEY2', 'KEY3'], [1, NULL, 3]) |
|----------------------------------------------------------|
| {                                                        |
|   "key1": 1,                                             |
|   "key2": null,                                          |
|   "key3": 3                                              |
| }                                                        |
+----------------------------------------------------------+
```

---
title: ARRAYS_ZIP
source: https://docs.snowflake.com/en/sql-reference/functions/arrays_zip.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object Creation and Manipulation)

# ARRAYS_ZIP

Returns an [array](../data-types-semistructured.md) of [objects](../data-types-semistructured.md), each of which contains key-value pairs
for an nth element in the input arrays. For example, in the returned array, the first object contains key-value pairs for each
first element in the input arrays, the second object contains key-value pairs for each second element in the input arrays, and so
on.

## Syntax

```sqlsyntax
ARRAYS_ZIP( <array> [ , <array> ... ] )
```

## Arguments

`array`
:   An input array.

    The input arrays can be of different lengths.

    If any of the input arrays is a [structured array](../data-types-structured.md), all input arrays must be
    structured arrays.

## Returns

Returns a value of one of the following types:

* If the input arrays are semi-structured arrays, the function returns a semi-structured array of structured objects.
* If the input arrays are structured arrays, the function returns a structured array of structured objects. The definition of
  the structured object depends on the number of input arrays and the types of values in the arrays.
* If any of the input arrays is NULL, the function returns NULL.

Each object contains the key-value pairs for the values of an nth element in the input arrays. The key (`$1`, `$2`, and so on)
represents the position of the input array.

For example, suppose that you pass in these arrays:

```sqlexample
SELECT ARRAYS_ZIP(
  [1, 2, 3],
  ['first', 'second', 'third'],
  ['i', 'ii', 'iii']
) AS zipped_arrays;
```

The function returns the following array of objects:

```output
+---------------------+
| ZIPPED_ARRAYS       |
|---------------------|
| [                   |
|   {                 |
|     "$1": 1,        |
|     "$2": "first",  |
|     "$3": "i"       |
|   },                |
|   {                 |
|     "$1": 2,        |
|     "$2": "second", |
|     "$3": "ii"      |
|   },                |
|   {                 |
|     "$1": 3,        |
|     "$2": "third",  |
|     "$3": "iii"     |
|   }                 |
| ]                   |
+---------------------+
```

In the returned array:

* The first object contains the first elements of all input arrays.
* The second object contains the second elements of all input arrays.
* The third object contains the third elements of all input arrays.

The keys in the objects identify the input array:

* The `$1` key-value pairs contain the values from the first input array.
* The `$2` key-value pairs contain the values from the second input array.
* The `$3` key-value pairs contain the values from the third input array.

## Usage notes

* The returned array is as long as the longest input array. If some input arrays are shorter, the function uses a
  [JSON null](../../user-guide/semistructured-considerations.md) for the remaining elements missing in the shorter arrays.
* If the input array includes a NULL element, the function returns a JSON null for that element.

## Examples

The following examples demonstrate how the function works:

* Single input array
* Multiple input arrays
* Input arrays of different lengths
* NULL and empty array handling

### Single input array

The following example returns an array of objects containing the first, second, and third elements in a single array:

```sqlexample
SELECT ARRAYS_ZIP(
  [1, 2, 3]
) AS zipped_array;
```

```output
+--------------+
| ZIPPED_ARRAY |
|--------------|
| [            |
|   {          |
|     "$1": 1  |
|   },         |
|   {          |
|     "$1": 2  |
|   },         |
|   {          |
|     "$1": 3  |
|   }          |
| ]            |
+--------------+
```

### Multiple input arrays

The following example returns an array of objects containing the first, second, and third elements in the input arrays:

```sqlexample
SELECT ARRAYS_ZIP(
  [1, 2, 3],
  [10, 20, 30],
  [100, 200, 300]
) AS zipped_array;
```

```output
+---------------+
| ZIPPED_ARRAY  |
|---------------|
| [             |
|   {           |
|     "$1": 1,  |
|     "$2": 10, |
|     "$3": 100 |
|   },          |
|   {           |
|     "$1": 2,  |
|     "$2": 20, |
|     "$3": 200 |
|   },          |
|   {           |
|     "$1": 3,  |
|     "$2": 30, |
|     "$3": 300 |
|   }           |
| ]             |
+---------------+
```

### Input arrays of different lengths

The following example passes in input arrays of different lengths. For the values absent from the shorter arrays, the function
uses a JSON null in the object.

```sqlexample
SELECT ARRAYS_ZIP(
  [1, 2, 3],
  ['one'],
  ['I', 'II']
) AS zipped_array;
```

```output
+------------------+
| ZIPPED_ARRAY     |
|------------------|
| [                |
|   {              |
|     "$1": 1,     |
|     "$2": "one", |
|     "$3": "I"    |
|   },             |
|   {              |
|     "$1": 2,     |
|     "$2": null,  |
|     "$3": "II"   |
|   },             |
|   {              |
|     "$1": 3,     |
|     "$2": null,  |
|     "$3": null   |
|   }              |
| ]                |
+------------------+
```

### NULL and empty array handling

As shown in the following example, passing in a NULL for any input array causes the function to return a SQL NULL:

```sqlexample
SELECT ARRAYS_ZIP(
  [1, 2, 3],
  NULL,
  [100, 200, 300]
) AS zipped_array;
```

```output
+--------------+
| ZIPPED_ARRAY |
|--------------|
| NULL         |
+--------------+
```

In the following example, all of the input arrays are empty, which causes the function to return an empty object:

```sqlexample
SELECT ARRAYS_ZIP(
  [], [], []
) AS zipped_array;
```

```output
+--------------+
| ZIPPED_ARRAY |
|--------------|
| [            |
|   {}         |
| ]            |
+--------------+
```

In the following example, some of the elements in the input arrays are NULL. In the returned objects, the values for these
elements are JSON nulls:

```sqlexample
SELECT ARRAYS_ZIP(
  [1, NULL, 3],
  [NULL, 20, NULL],
  [100, NULL, 300]
) AS zipped_array;
```

```output
+-----------------+
| ZIPPED_ARRAY    |
|-----------------|
| [               |
|   {             |
|     "$1": 1,    |
|     "$2": null, |
|     "$3": 100   |
|   },            |
|   {             |
|     "$1": null, |
|     "$2": 20,   |
|     "$3": null  |
|   },            |
|   {             |
|     "$1": 3,    |
|     "$2": null, |
|     "$3": 300   |
|   }             |
| ]               |
+-----------------+
```

---
title: AS_<object_type>
source: https://docs.snowflake.com/en/sql-reference/functions/as.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_*<object_type>*

You can use this family of functions to perform strict casting of VARIANT values to values of other data types:

> * [AS_ARRAY](as_array.md)
> * [AS_BINARY](as_binary.md)
> * [AS_BOOLEAN](as_boolean.md)
> * [AS_CHAR , AS_VARCHAR](as_char-varchar.md)
> * [AS_DATE](as_date.md)
> * [AS_DECIMAL , AS_NUMBER](as_decimal-number.md)
> * [AS_DOUBLE , AS_REAL](as_double-real.md)
> * [AS_INTEGER](as_integer.md)
> * [AS_OBJECT](as_object.md)
> * [AS_TIME](as_time.md)
> * [AS_TIMESTAMP_\*](as_timestamp.md)

See also:
:   [IS_<object_type>](is.md)

## General usage notes

* If the type of the value in the VARIANT argument doesn’t match the output
  value, then NULL is returned. For example, if the AS_DATE function is passed a VARIANT value
  that doesn’t contain a DATE value, then NULL is returned.
* If the input is NULL, the output is NULL.

## Examples

The following examples use AS_`object_type` functions.

### Cast values in VARIANT columns to different data types

Create the table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE multiple_types_example (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  char1 VARIANT,
  varchar1 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types_example
  (array1, array2, boolean1, char1, varchar1,
   decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('Y'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the table and cast values in the VARIANT columns to values of different data types:

```sqlexample
SELECT AS_ARRAY(array1) AS array1,
       AS_ARRAY(array2) AS array2,
       AS_BOOLEAN(boolean1) AS boolean,
       AS_CHAR(char1) AS char,
       AS_VARCHAR(varchar1) AS varchar,
       AS_DECIMAL(decimal1, 6, 3) AS decimal,
       AS_DOUBLE(double1) AS double,
       AS_INTEGER(integer1) AS integer,
       AS_OBJECT(object1) AS object
  FROM multiple_types_example;
```

```output
+-------------+-----------------+---------+------+---------+---------+--------+---------+------------------+
| ARRAY1      | ARRAY2          | BOOLEAN | CHAR | VARCHAR | DECIMAL | DOUBLE | INTEGER | OBJECT           |
|-------------+-----------------+---------+------+---------+---------+--------+---------+------------------|
| [           | [               | True    | X    | Y       |   1.230 |   3.21 |      15 | {                |
|   "Example" |   "Array-like", |         |      |         |         |        |         |   "Tree": "Pine" |
| ]           |   "example"     |         |      |         |         |        |         | }                |
|             | ]               |         |      |         |         |        |         |                  |
+-------------+-----------------+---------+------+---------+---------+--------+---------+------------------+
```

### Compute the average of numeric values in a VARIANT column

Compute the average of all numeric values from a VARIANT column in the `vartab` table:

Create the table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Show the data types of the values (some of which are numeric):

```sqlexample
SELECT n, AS_REAL(v), TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------+------------+
| N | AS_REAL(V) | TYPEOF(V)  |
|---+------------+------------|
| 1 |       NULL | NULL_VALUE |
| 2 |       NULL | NULL       |
| 3 |       NULL | BOOLEAN    |
| 4 |     -17    | INTEGER    |
| 5 |     123.12 | DECIMAL    |
| 6 |     191.2  | DOUBLE     |
| 7 |       NULL | VARCHAR    |
| 8 |       NULL | ARRAY      |
| 9 |       NULL | OBJECT     |
+---+------------+------------+
```

Use the AS_REAL function with the [AVG](avg.md) function to compute the average of all numeric values
from the VARIANT column `v`:

```sqlexample
SELECT AVG(AS_REAL(v)) FROM vartab;
```

```output
+-----------------+
| AVG(AS_REAL(V)) |
|-----------------|
|    99.106666667 |
+-----------------+
```

---
title: AS_ARRAY
source: https://docs.snowflake.com/en/sql-reference/functions/as_array.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_ARRAY

Casts a [VARIANT](../data-types-semistructured.md) value to an [ARRAY](../data-types-semistructured.md) value.

See also:
:   [AS_<object_type>](as.md) , [AS_OBJECT](as_object.md)

## Syntax

```sqlsyntax
AS_ARRAY( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type ARRAY or NULL:

* If the type of the value in the `variant_expr` argument is ARRAY, the function returns a value of type ARRAY.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Usage notes

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_array_example (
  array1 VARIANT,
  array2 VARIANT);

INSERT INTO as_array_example (array1, array2)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example'));
```

Use the AS_ARRAY function in a query to cast a VARIANT value to ARRAY values:

```sqlexample
SELECT AS_ARRAY(array1) AS array1,
       AS_ARRAY(array2) AS array2
  FROM as_array_example;
```

```output
+-------------+-----------------+
| ARRAY1      | ARRAY2          |
|-------------+-----------------|
| [           | [               |
|   "Example" |   "Array-like", |
| ]           |   "example"     |
|             | ]               |
+-------------+-----------------+
```

---
title: AS_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/as_binary.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_BINARY

Casts a [VARIANT](../data-types-semistructured.md) value to a [BINARY](../data-types-text.md) value.

See also:
:   [AS_<object_type>](as.md)

## Syntax

```sqlsyntax
AS_BINARY( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type BINARY or NULL:

* If the type of the value in the `variant_expr` argument is BINARY, the function returns a value of type BINARY.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_binary_example (binary1 VARIANT);

INSERT INTO as_binary_example (binary1)
  SELECT TO_VARIANT(TO_BINARY('F0A5'));
```

Use the AS_BINARY function in a query to cast a VARIANT value to a BINARY value:

```sqlexample
SELECT AS_BINARY(binary1) AS binary_value
  FROM as_binary_example;
```

```output
+--------------+
| BINARY_VALUE |
|--------------|
| F0A5         |
+--------------+
```

---
title: AS_BOOLEAN
source: https://docs.snowflake.com/en/sql-reference/functions/as_boolean.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_BOOLEAN

Casts a [VARIANT](../data-types-semistructured.md) value to a [BOOLEAN](../data-types-logical.md) value.

See also:
:   [AS_<object_type>](as.md)

## Syntax

```sqlsyntax
AS_BOOLEAN( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type BOOLEAN or NULL:

* If the type of the value in the `variant_expr` argument is BOOLEAN, the function returns a value of type BOOLEAN.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_boolean_example (
  boolean1 VARIANT,
  boolean2 VARIANT);

INSERT INTO as_boolean_example (boolean1, boolean2)
  SELECT
    TO_VARIANT(TO_BOOLEAN(TRUE)),
    TO_VARIANT(TO_BOOLEAN(FALSE));
```

Use the AS_BOOLEAN function in a query to cast VARIANT values to BOOLEAN values:

```sqlexample
SELECT AS_BOOLEAN(boolean1) boolean_true,
       AS_BOOLEAN(boolean2) boolean_false
  FROM as_boolean_example;
```

```output
+--------------+---------------+
| BOOLEAN_TRUE | BOOLEAN_FALSE |
|--------------+---------------|
| True         | False         |
+--------------+---------------+
```

---
title: AS_CHAR , AS_VARCHAR
source: https://docs.snowflake.com/en/sql-reference/functions/as_char-varchar.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_CHAR , AS_VARCHAR

Casts a [VARIANT](../data-types-semistructured.md) value to a [VARCHAR](../data-types-text.md) value. This function
only converts [CHAR](../data-types-text.md) and VARCHAR values.

The AS_CHAR and AS_VARCHAR functions are synonymous.

The CHAR data type is synonymous with the VARCHAR data type, except for its default length.

See also:
:   [AS_<object_type>](as.md)

## Syntax

```sqlsyntax
AS_CHAR( <variant_expr> )

AS_VARCHAR( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type VARCHAR or NULL:

* If the type of the value in the `variant_expr` argument is CHAR or VARCHAR, the function returns a value of type VARCHAR.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_varchar_example (varchar1 VARIANT);

INSERT INTO as_varchar_example (varchar1)
  SELECT TO_VARIANT('My VARCHAR value');
```

Use the AS_VARCHAR function in a query to cast a VARIANT value to a VARCHAR value:

```sqlexample
SELECT AS_VARCHAR(varchar1) varchar_value
  FROM as_varchar_example;
```

```output
+------------------+
| VARCHAR_VALUE    |
|------------------|
| My VARCHAR value |
+------------------+
```

---
title: AS_DATE
source: https://docs.snowflake.com/en/sql-reference/functions/as_date.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_DATE

Casts a [VARIANT](../data-types-semistructured.md) value to a [DATE](../data-types-datetime.md) value. This function does not convert values of
other data types, including timestamps, to DATE values.

See also:
:   [AS_<object_type>](as.md)

## Syntax

```sqlsyntax
AS_DATE( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type DATE or NULL:

* If the type of the value in the `variant_expr` argument is DATE, the function returns a value of type DATE.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_date_example (date1 VARIANT);

INSERT INTO as_date_example (date1)
 SELECT TO_VARIANT(TO_DATE('2024-10-10'));
```

Use the AS_DATE function in a query to cast a VARIANT value to a DATE value:

```sqlexample
SELECT AS_DATE(date1) date_value
  FROM as_date_example;
```

```output
+------------+
| DATE_VALUE |
|------------|
| 2024-10-10 |
+------------+
```

---
title: AS_DECIMAL , AS_NUMBER
source: https://docs.snowflake.com/en/sql-reference/functions/as_decimal-number.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_DECIMAL , AS_NUMBER

Casts a [VARIANT](../data-types-semistructured.md) value to a fixed-point [NUMBER](../data-types-numeric.md) value, with optional precision and scale.
This function doesn’t cast floating-point values.

AS_DECIMAL is a synonym for AS_NUMBER.

The [DECIMAL](../data-types-numeric.md) data type is synonymous with the NUMBER data type.

See also:
:   [AS_<object_type>](as.md) , [AS_DOUBLE , AS_REAL](as_double-real.md) , [AS_INTEGER](as_integer.md)

## Syntax

```sqlsyntax
AS_DECIMAL( <variant_expr> [ , <precision> [ , <scale> ] ] )

AS_NUMBER( <variant_expr> [ , <precision> [ , <scale> ] ] )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

`precision`
:   The number of significant digits of the decimal number to store.

    The default is `38`.

`scale`
:   The number of significant digits after the decimal point.

    The default is `0`.

## Returns

The function returns a value of type NUMBER or NULL:

* If the type of the value in the `variant_expr` argument is DECIMAL or NUMBER, the function returns a value of type NUMBER.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Usage notes

When reducing scale, this function rounds the result, which can cause out-of-range errors.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_number_example (number1 VARIANT);

INSERT INTO as_number_example (number1)
  SELECT TO_VARIANT(TO_NUMBER(2.34, 6, 3));
```

Use the AS_NUMBER function in a query to cast a VARIANT value to a NUMBER value:

```sqlexample
SELECT AS_NUMBER(number1, 6, 3) number_value
  FROM as_number_example;
```

```output
+--------------+
| NUMBER_VALUE |
|--------------|
|        2.340 |
+--------------+
```

---
title: AS_DOUBLE , AS_REAL
source: https://docs.snowflake.com/en/sql-reference/functions/as_double-real.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_DOUBLE , AS_REAL

Casts a [VARIANT](../data-types-semistructured.md) value to a [floating-point value](../data-types-numeric.md).

AS_DOUBLE is a synonym for AS_REAL.

The [DOUBLE and REAL](../data-types-numeric.md) data types are synonymous with the FLOAT data type.

See also:
:   [AS_<object_type>](as.md) , [AS_DECIMAL , AS_NUMBER](as_decimal-number.md) , [AS_INTEGER](as_integer.md)

## Syntax

```sqlsyntax
AS_DOUBLE( <variant_expr> )

AS_REAL( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a floating-point value or NULL:

* If the type of the value in the `variant_expr` argument is a floating-point value, the function returns the floating-point value.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_double_example (double1 VARIANT);

INSERT INTO as_double_example (double1)
  SELECT TO_VARIANT(TO_DOUBLE(1.23));
```

Use the AS_DOUBLE function in a query to cast a VARIANT value to a DOUBLE value:

```sqlexample
SELECT AS_DOUBLE(double1) double_value
  FROM as_double_float_example;
```

```output
+--------------+
| DOUBLE_VALUE |
|--------------|
|         1.23 |
+--------------+
```

---
title: AS_INTEGER
source: https://docs.snowflake.com/en/sql-reference/functions/as_integer.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_INTEGER

Casts a [VARIANT](../data-types-semistructured.md) value to an [INTEGER](../data-types-numeric.md). The function does
not cast non-integer values.

The INTEGER data type is synonymous with the [NUMBER](../data-types-numeric.md) data type, except that precision
and scale can’t be specified for INTEGER values.

See also:
:   [AS_<object_type>](as.md)

    [AS_DECIMAL , AS_NUMBER](as_decimal-number.md) , [AS_DOUBLE , AS_REAL](as_double-real.md)

## Syntax

```sqlsyntax
AS_INTEGER( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type INTEGER or NULL:

* If the type of the value in the `variant_expr` argument is INTEGER, the function returns a value of type INTEGER.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_integer_example (integer1 VARIANT);

INSERT INTO as_integer_example (integer1)
  SELECT TO_VARIANT(15);
```

Use the AS_INTEGER function in a query to cast a VARIANT value to an INTEGER value:

```sqlexample
SELECT AS_INTEGER(integer1) AS integer_value
  FROM as_integer_example;
```

```output
+---------------+
| INTEGER_VALUE |
|---------------|
|            15 |
+---------------+
```

---
title: AS_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/as_object.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_OBJECT

Casts a [VARIANT](../data-types-semistructured.md) value to an [OBJECT](../data-types-semistructured.md) value.

See also:
:   [AS_<object_type>](as.md) , [AS_ARRAY](as_array.md)

## Syntax

```sqlsyntax
AS_OBJECT( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type OBJECT or NULL:

* If the type of the value in the `variant_expr` argument is OBJECT, the function returns a value of type OBJECT.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Usage notes

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_object_example (object1 VARIANT);

INSERT INTO as_object_example (object1)
  SELECT TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Use the AS_OBJECT function in a query to cast a VARIANT value to an OBJECT value:

```sqlexample
SELECT AS_OBJECT(object1) AS object_value
  FROM as_object_example;
```

```output
+------------------+
| OBJECT_VALUE     |
|------------------|
| {                |
|   "Tree": "Pine" |
| }                |
+------------------+
```

---
title: AS_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/as_time.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_TIME

Casts a [VARIANT](../data-types-semistructured.md) value to a [TIME](../data-types-datetime.md) value. This function does not convert values of
other data types, including timestamps, to TIME values.

See also:
:   [AS_<object_type>](as.md)

    [AS_DATE](as_date.md) , [AS_TIMESTAMP_\*](as_timestamp.md)

## Syntax

```sqlsyntax
AS_TIME( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of type TIME or NULL:

* If the type of the value in the `variant_expr` argument is TIME, the function returns a value of type TIME.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_time_example (time1 VARIANT);

INSERT INTO as_time_example (time1)
  SELECT TO_VARIANT(TO_TIME('12:34:56'));
```

Use the AS_TIME function in a query to cast a VARIANT value to a TIME value:

```sqlexample
SELECT AS_TIME(time1) AS time_value
  FROM as_time_example;
```

```output
+------------+
| TIME_VALUE |
|------------|
| 12:34:56   |
+------------+
```

---
title: AS_TIMESTAMP_*
source: https://docs.snowflake.com/en/sql-reference/functions/as_timestamp.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# AS_TIMESTAMP_\*

Casts a [VARIANT](../data-types-semistructured.md) value to the respective
[timestamp](../data-types-datetime.md) value:

* AS_TIMESTAMP_LTZ (value with local time zone)
* AS_TIMESTAMP_NTZ (value with no time zone)
* AS_TIMESTAMP_TZ (value with time zone)

See also:
:   [AS_<object_type>](as.md) , [AS_DATE](as_date.md) , [AS_TIME](as_time.md)

## Syntax

```sqlsyntax
AS_TIMESTAMP_LTZ( <variant_expr> )

AS_TIMESTAMP_NTZ( <variant_expr> )

AS_TIMESTAMP_TZ( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

The function returns a value of a timestamp type or NULL:

* If the type of the value in the `variant_expr` argument is a timestamp type, the function returns a value of same timestamp type.

* If the type of the value in the `variant_expr` argument doesn’t match the type of the output
  value, the function returns NULL.
* If the `variant_expr` argument is NULL, the function returns NULL.

## Examples

Create a table and load data into it:

```sqlexample
CREATE OR REPLACE TABLE as_timestamp_example (timestamp1 VARIANT);

INSERT INTO as_timestamp_example (timestamp1)
  SELECT TO_VARIANT(TO_TIMESTAMP_NTZ('2024-10-10 12:34:56'));
```

Use the AS_TIMESTAMP_NTZ function in a query to cast a VARIANT value to a TIMESTAMP_NTZ value:

```sqlexample
SELECT AS_TIMESTAMP_NTZ(timestamp1) AS timestamp_value
  FROM as_timestamp_example;
```

```output
+-------------------------+
| TIMESTAMP_VALUE         |
|-------------------------|
| 2024-10-10 12:34:56.000 |
+-------------------------+
```

---
title: ASCII
source: https://docs.snowflake.com/en/sql-reference/functions/ascii.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# ASCII

Returns the ASCII code for the first character of a string. If the string is empty, a value of `0` is returned.

See also:

> [CHAR](chr.md) , [UNICODE](unicode.md)

## Syntax

```sqlsyntax
ASCII( <input> )
```

## Arguments

`input`
:   The string for which the ASCII code for the first character in the string is returned.

## Returns

The value is an integer that is the numeric representation of the ASCII character. For example, if the
input is the letter ‘a’, then the return value is 97.

## Usage notes

The value 0 is returned for either of the following cases:

* The first character of the string contains the ASCII character corresponding to 0.
* The string is empty.

To distinguish between these two cases, use the LENGTH function to determine whether the string is empty.

## Examples

This example demonstrates the behavior for single ASCII characters, as well as special cases, such as multi-character strings, empty strings, and NULL values:

> ```sqlexample
> SELECT column1, ASCII(column1)
>   FROM (values('!'), ('A'), ('a'), ('bcd'), (''), (null));
> +---------+----------------+
> | COLUMN1 | ASCII(COLUMN1) |
> |---------+----------------|
> | !       |             33 |
> | A       |             65 |
> | a       |             97 |
> | bcd     |             98 |
> |         |              0 |
> | NULL    |           NULL |
> +---------+----------------+
> ```

---
title: ASIN
source: https://docs.snowflake.com/en/sql-reference/functions/asin.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ASIN

Computes the inverse sine (arc sine) of its argument; the result is a number in the interval `[-pi/2, pi/2]`.

## Syntax

```sqlsyntax
ASIN( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. Must be greater than or equal to -1.0 and
    less than or equal to +1.0. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

Returns the arc sine in radians (not degrees) in the range `[-pi/2, pi/2]`.

## Examples

```sqlexample
SELECT ASIN(0), ASIN(0.5), ASIN(1);
```

```output
+---------+--------------+-------------+
| ASIN(0) |    ASIN(0.5) |     ASIN(1) |
|---------+--------------+-------------|
|       0 | 0.5235987756 | 1.570796327 |
+---------+--------------+-------------+
```

---
title: ASINH
source: https://docs.snowflake.com/en/sql-reference/functions/asinh.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ASINH

Computes the inverse (arc) hyperbolic sine of its argument.

## Syntax

```sqlsyntax
ASINH( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT ASINH(2.129279455);
```

```output
+--------------------+
| ASINH(2.129279455) |
|--------------------|
|                1.5 |
+--------------------+
```

---
title: ATAN
source: https://docs.snowflake.com/en/sql-reference/functions/atan.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ATAN

Computes the inverse tangent (arc tangent) of its argument; the result is a number in the interval `[-pi, pi]`.

## Syntax

```sqlsyntax
ATAN( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

Returns the arc tangent in radians (not degrees) in the range `[-pi, pi]`.

## Examples

```sqlexample
SELECT ATAN(1);
```

```output
+--------------+
|      ATAN(1) |
|--------------|
| 0.7853981634 |
+--------------+
```

---
title: ATAN2
source: https://docs.snowflake.com/en/sql-reference/functions/atan2.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ATAN2

Computes the inverse tangent (arc tangent) of the ratio of its two arguments.
For example, if x > 0, then the expression `ATAN2(y, x)` is equivalent to `ATAN(y/x)`.

The arc tangent is the angle between:

* The X axis.
* The ray from the point (0,0) to the point (X, Y) (where X and Y are not both 0).

See also:
:   [ATAN](atan.md)

## Syntax

```sqlsyntax
ATAN2( <y> , <x> )
```

Note that the first parameter is the Y coordinate, not the X coordinate.

## Arguments

`y`
:   This parameter is the Y coordinate of the point at the end of the ray. The data type must be FLOAT.

`x`
:   This parameter is the X coordinate of the point at the end of the ray. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

The returned value is in radians, not degrees.

The returned value is a number in the interval `[-pi, pi]`.

## Usage notes

* If the data type of an argument is a numeric data type other than DOUBLE, then the value is converted to DOUBLE.
* If the data type of an argument is string, the value is converted to DOUBLE if possible.
* If the data type of an argument is any other data type, the function returns an error.
* If either argument is NULL, the returned value is NULL.

## Examples

```sqlexample
SELECT ATAN2(5, 5);
```

```output
+--------------+
|  ATAN2(5, 5) |
|--------------|
| 0.7853981634 |
+--------------+
```

---
title: ATANH
source: https://docs.snowflake.com/en/sql-reference/functions/atanh.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# ATANH

Computes the inverse (arc) hyperbolic tangent of its argument.

## Syntax

```sqlsyntax
ATANH( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. Must be a value between -1.0 and +1.0
    (inclusive). The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT ATANH(0.9051482536);
```

```output
+---------------------+
| ATANH(0.9051482536) |
|---------------------|
|                 1.5 |
+---------------------+
```

---
title: AUTO_REFRESH_REGISTRATION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/auto_refresh_registration_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# AUTO_REFRESH_REGISTRATION_HISTORY

This table function can be used to query the history of data files registered in the metadata for a specified external table
or directory table and the credits billed
for these operations. The table function returns the billing history for a specified range
within the last 14 days for your entire Snowflake account.

> **Note:**
>
> To retrieve refresh history information for an Apache Iceberg™ table,
> see [ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY](iceberg_table_snapshot_refresh_history.md) instead.

## Syntax

```sqlsyntax
AUTO_REFRESH_REGISTRATION_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [, DATE_RANGE_END => <constant_expr> ]
      [, OBJECT_TYPE => '<string>' [, OBJECT_NAME => '<string>'] ])
```

## Arguments

All of the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range of the billing window:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default is to
      show the previous 10 minutes of the billing history). For example, if `DATE_RANGE_END` is [CURRENT_DATE](current_date.md), then the default
      `DATE_RANGE_START` is 11:50 PM on the previous day.

    History is displayed in increments of 5 minutes, 1 hour, or 24 hours (depending on the length of the specified range).

`OBJECT_TYPE => string`
:   Type of object for which credits are billed. The following value is supported:

    `DIRECTORY_TABLE`
    :   Directory tables that are configured for automatic metadata refreshes.

    `EXTERNAL_TABLE`
    :   External tables that are configured for automatic metadata refreshes.

`OBJECT_NAME => string`
:   A string specifying the name of the external table or directory table for which credits are billed.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified billing window. |
| END_TIME | TIMESTAMP_LTZ | End of the specified billing window. |
| OBJECT_NAME | TEXT | Name of the object for which credits are billed. |
| OBJECT_TYPE | TEXT | Type of object for which credits are billed. |
| CREDITS_USED | TEXT | Number of credits billed for data files registered in the metadata of the specified object or object type during the START_TIME and END_TIME window. |
| FILES_REGISTERED | NUMBER | Number of files registered during the START_TIME and END_TIME window. |

## Examples

Note that all of the examples in this topic reference external table metadata. To retrieve similar history records for
other object types, edit the `OBJECT_TYPE => string` value in the query.

Retrieve the billing history for all external tables in your account that are configured for automatic metadata refreshes. The query retrieves
the history for a 30 minute range, in 5 minute periods:

> ```sqlexample
> select *
>   from table(information_schema.auto_refresh_registration_history(
>     date_range_start=>to_timestamp_tz('2021-06-17 12:00:00.000 -0700'),
>     date_range_end=>to_timestamp_tz('2021-06-17 12:30:00.000 -0700'),
>     object_type=>'external_table'));
> ```

Same as the previous example, but retrieves the billing history for the last 14 days, in 1 day periods:

> ```sqlexample
> select *
>   from table(information_schema.auto_refresh_registration_history(
>     date_range_start=>dateadd('day',-14,current_date()),
>     date_range_end=>current_date(),
>     object_type=>'external_table'));
> ```

Same as the first example, but retrieves the billing history for the last 14 days, in 1 day periods:

> ```sqlexample
> select *
>   from table(information_schema.auto_refresh_registration_history(
>     date_range_start=>dateadd('day',-14,current_date()),
>     date_range_end=>current_date(),
>     object_type=>'external_table'));
> ```

Retrieve the billing history for an external table named `myexttable` in the active schema in the session for the last 12 hours, in 1
hour periods:

> ```sqlexample
> select *
>   from table(information_schema.auto_refresh_registration_history(
>     date_range_start=>dateadd('hour',-12,current_timestamp()),
>     object_type=>'external_table',
>     object_name=>'myexttable'));
> ```

Retrieve the billing history for an external table named `myexttable` in the `mydb.myschema` schema for the last 12 hours, in 1 hour
periods:

> ```sqlexample
> select *
>   from table(information_schema.auto_refresh_registration_history(
>     date_range_start=>dateadd('hour',-12,current_timestamp()),
>     object_type=>'external_table',
>     object_name=>'mydb.myschema.myexttable'));
> ```

---
title: AUTOMATIC_CLUSTERING_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/automatic_clustering_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# AUTOMATIC_CLUSTERING_HISTORY

This table function is used for querying the [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) history for given tables within a specified date range. The information returned by the function includes the
credits consumed, bytes updated, and rows updated each time a table is reclustered.

## Syntax

```sqlsyntax
AUTOMATIC_CLUSTERING_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ]
      [ , TABLE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range to display the Automatic Clustering history.
    For example, if you specify that the start date is 2019-04-03 and the end date is 2019-04-05, then you get data for
    April 3, April 4, and April 5. (The endpoints are included.)

    * If neither a start date nor an end date is specified, the default is the last 12 hours.
    * If an end date is not specified, but a start date is specified, then [CURRENT_DATE](current_date.md)
      at midnight is used as the end of the range.
    * If a start date is not specified, but an end date is specified, then the range starts 12 hours prior to the start
      of `DATE_RANGE_END`.

`TABLE_NAME => string`
:   Table name. If specified, only shows the history for the specified table.
    The table name can include the schema name and the database name.

    If a table name is not specified, then the results include history for each table maintained by the
    Automatic Clustering Service within the specified time range.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.

  > **Note:**
  >
  > A role with the MONITOR USAGE privilege can view per-object credit usage, but not object names. The role must also be granted SELECT on an object in order for its name to be returned by this function. If the role does not have sufficient privileges to see the object name, the object name might be displayed with a substitute name such as “unknown_#”, where “#” represents one or more digits.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* The history is displayed in increments of 1 hour.
* A row might be clustered multiple times, depending on data skew, clustering key distribution, and reordering required for micro-partitions. A large table with poor initial clustering might need multiple passes to reach an optimally clustered state. Therefore, the NUM_ROWS_RECLUSTERED value for a table could be as high as the total number of rows in the table or even higher.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| TABLE_NAME | TEXT | Name of the table. Displays NULL if no table name is specified in the function, in which case either row includes the totals for all tables in use within the time range. |
| CREDITS_USED | NUMBER | Number of credits billed for automatic clustering during the START_TIME and END_TIME window. |
| NUM_BYTES_RECLUSTERED | NUMBER | Number of bytes reclustered during the START_TIME and END_TIME window. |
| NUM_ROWS_RECLUSTERED | NUMBER | Number of rows reclustered during the START_TIME and END_TIME window. |

## Examples

Retrieve the automatic clustering history for a one-hour range for your account:

> ```sqlexample
> select *
>   from table(information_schema.automatic_clustering_history(
>     date_range_start=>'2018-04-10 13:00:00.000 -0700',
>     date_range_end=>'2018-04-10 14:00:00.000 -0700'));
> ```

Retrieve the automatic clustering history for the last 12 hours, in 1 hour periods, for your account:

> ```sqlexample
> select *
>   from table(information_schema.automatic_clustering_history(
>     date_range_start=>dateadd(H, -12, current_timestamp)));
> ```

Retrieve the automatic clustering history for the past week for your account:

> ```sqlexample
> select *
>   from table(information_schema.automatic_clustering_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date));
> ```

Retrieve the automatic clustering history for the past week for a specified table in your account:

> ```sqlexample
> select *
>   from table(information_schema.automatic_clustering_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date,
>     table_name=>'mydb.myschema.mytable'));
> ```

---
title: AVAILABLE_LISTING_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/available_listing_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# AVAILABLE_LISTING_REFRESH_HISTORY

Returns the past 14 days of refresh history for an available listing or a database mounted from a listing using cross-cloud
auto-fulfillment. The information returned contains replication details for data added to the listing database in each refresh event. This
function is available to consumers of listings who have any privilege on the available listing or mounted database.

## Syntax

```sqlsyntax
AVAILABLE_LISTING_REFRESH_HISTORY(
  OBJECT_TYPE => '<object_type>',
  OBJECT_NAME => '<object_name>' )
```

## Arguments

`OBJECT_TYPE => 'object_type'`
:   Type of the object, either `listing` or `database`.

`OBJECT_NAME => 'object_name'`
:   Name of the object, which can be either the listing’s global name or the mounted database name, depending on the object type.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| OBJECT_TYPE | TEXT | Lists the type of Snowflake object. For example, listing. |
| OBJECT_NAME | TEXT | Name of the listing or the mounted database. |
| PHASE | TEXT | Current phase in the replication operation, represented as one phase out of a total of X phases. For example, 2/6. |
| PHASE_NAME | TEXT | Name of the replication phases completed (or in progress) so far.  For the list of phases, see usage notes. |
| PROGRESS | TEXT | PRIMARY_UPLOADING_DATA: Percentage of total bytes replicated.  SECONDARY_DOWNLOADING_METADATA: Percentage of the total number of objects replicated.  SECONDARY_DOWNLOADING_DATA: Percentage of total bytes replicated.  Empty for remaining phases. |
| START_TIME | TIMESTAMP_LTZ | Time when the replication phase began. |
| END_TIME | TIMESTAMP_LTZ | Time when the phase finished, if applicable.  NULL if the phase is in progress or is the terminating phase (`COMPLETED/FAILED/CANCELED`). |
| JOB_UUID | TEXT | Query ID for the refresh job. |
| PRIMARY_SNAPSHOT_TIMESTAMP | TIMESTAMP_LTZ | Timestamp when the primary snapshot was created. |
| ERROR | VARIANT | NULL if the refresh operation is successful. If the refresh operation fails, returns a JSON object that provides detailed information about the error:   * `errorCode`: Error code of the failure. * `errorMessage`: Error message of the failure. |

## Usage notes

* Only returns rows for a role with any privilege on the listing, if the listing is visible to the account.
* When `object_type` is set to `database` (as opposed to `listing`), only rows for roles with any privilege on that database are returned.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more information, see [Information Schema](../info-schema.md).

* Phase list in the order processed:

  1. SECONDARY_SYNCHRONIZING_MEMBERSHIP
  2. SECONDARY_UPLOADING_INVENTORY
  3. PRIMARY_UPLOADING_METADATA
  4. PRIMARY_UPLOADING_DATA
  5. SECONDARY_DOWNLOADING_METADATA
  6. SECONDARY_DOWNLOADING_DATA
  7. COMPLETED / FAILED / CANCELED

## Examples

Retrieve the history for the database `my_mounted_database`.

```sqlexample
SELECT * FROM TABLE(
  INFORMATION_SCHEMA.AVAILABLE_LISTING_REFRESH_HISTORY(
    OBJECT_TYPE=>'database',
    OBJECT_NAME=>'my_mounted_database'
  )
);
```

---
title: AVAILABLE_LISTINGS
source: https://docs.snowflake.com/en/sql-reference/functions/available_listings.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# AVAILABLE_LISTINGS

Returns all listings that are available for the consumer to discover and access.

## Syntax

```sqlsyntax
AVAILABLE_LISTINGS(
      [ IS_IMPORTED => { TRUE | FALSE | NULL } ]
      [ , IS_ORGANIZATION => { TRUE | FALSE | NULL } ]
      [ , IS_SHARED_WITH_ME => { TRUE | FALSE | NULL } ] )
```

## Arguments

You can optionally specify the following arguments to filter listings in this view.

> **Note:**
>
> Only one of the arguments can be `TRUE` at a time.

`IS_IMPORTED => { TRUE | FALSE | NULL }`
:   Set to `TRUE` to return only imported listings; set to `FALSE` or `NULL` to return all listings.

    Default: `NULL`.

`IS_ORGANIZATION => { TRUE | FALSE | NULL }`
:   Set to `TRUE` to return only organization listings; set to `FALSE` or `NULL` to return all listings.

    Default: `NULL`.

`IS_SHARED_WITH_ME => { TRUE | FALSE | NULL }`
:   Set to `TRUE` to return only listings that have been shared privately with the current account; set to `FALSE` or `NULL` to return all listings.

    Default: `NULL`.

## Output

The function returns the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| GLOBAL_NAME | VARCHAR | The global name of the listing. |
| CREATED_ON | TIMESTAMP_LTZ | The timestamp when the listing was created. |
| TITLE | VARCHAR | The title of the listing. |
| SUBTITLE | VARCHAR | The subtitle of the listing. |
| DESCRIPTION | VARCHAR | The description of the listing. |
| IS_MONETIZED | BOOLEAN | Indicates whether the listing is monetized. |
| IS_BY_REQUEST | BOOLEAN | Indicates whether the listing is by request (personalized listing). |
| IS_LIMITED_TRIAL | BOOLEAN | Indicates whether the listing is limited trial. |
| IS_READY_FOR_IMPORT | BOOLEAN | Indicates whether the listing is ready for import. |
| IS_IMPORTED | BOOLEAN | Indicates whether the listing has been imported. |
| IS_APPLICATION | BOOLEAN | Indicates whether the listing is associated with an application. |
| IS_PRIVATE | BOOLEAN | Indicates whether the listing is private. |
| CATEGORIES | VARCHAR | Categories associated with the listing. |
| DATA_ATTRIBUTES | VARCHAR | Data attributes associated with the listing. |
| TERMS | VARCHAR | Terms of service for the listing. |
| RESOURCES | VARCHAR | Resources associated with the listing. |
| DISTRIBUTION | VARCHAR | The distribution of the listing. Possible values are `EXTERNAL` and `ORGANIZATION`. |
| UNIFORM_LISTING_LOCATOR | VARCHAR | The uniform listing locator (ULL) of the listing. |
| ORGANIZATION_PROFILE_NAME | VARCHAR | The organization profile attached to the listing, if any. |
| IS_DISCOVERY_ONLY | BOOLEAN | Indicates whether the listing is discovery only. |
| SUPPORT_CONTACT | VARCHAR | The support contact information associated with the listing. |
| REQUEST_APPROVAL_TYPE | VARCHAR | The request approval type of the listing. Incidates whether the consumer listing requests will be approved within or outside of Snowflake. |
| IS_CORTEX_KNOWLEDGE_EXTENSION | BOOLEAN | Indicates whether this listing has Cortex Search services attached. |
| PROVIDER_COMPANY_NAME | VARCHAR | The company name of the listing provider. |
| RESHARING | VARCHAR | Resharing configuration of the listing. |

## Examples

Retrieve all available listings in the current account:

```sqlexample
SELECT * FROM TABLE(<any_database>.INFORMATION_SCHEMA.AVAILABLE_LISTINGS());
```

Retrieve all available listings that have been imported by the current account:

```sqlexample
SELECT * FROM TABLE(<any_database>.INFORMATION_SCHEMA.AVAILABLE_LISTINGS(IS_IMPORTED => TRUE));
```

Retrieve all available organization listings in the current account:

```sqlexample
SELECT * FROM TABLE(<any_database>.INFORMATION_SCHEMA.AVAILABLE_LISTINGS(IS_ORGANIZATION => TRUE));
```

Retrieve all available listings that have been shared privately with the current account:

```sqlexample
SELECT * FROM TABLE(<any_database>.INFORMATION_SCHEMA.AVAILABLE_LISTINGS(IS_SHARED_WITH_ME => TRUE));
```

---
title: AVG
source: https://docs.snowflake.com/en/sql-reference/functions/avg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md) (General, Window Frame)

# AVG

Returns the average of non-NULL records. If all records inside a group are NULL, the function returns NULL.

## Syntax

**Aggregate function**

```sqlsyntax
AVG( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
AVG( [ DISTINCT ] <expr1> ) OVER (
                                 [ PARTITION BY <expr2> ]
                                 [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                 )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   This is an expression that evaluates to a numeric data type (INTEGER, FLOAT, DECIMAL, etc.).

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition.

## Usage notes

* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.

* When this function is called as a window function with an OVER clause that contains an ORDER BY clause:

  + A window frame is required. If no window frame is specified explicitly, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).
  + Using the keyword DISTINCT inside the window function is prohibited and results in a compile-time error.

## Examples

Setup:

```sqlexample
CREATE OR REPLACE TABLE avg_example (int_col INT, d DECIMAL(10,5), s1 VARCHAR(10), s2 VARCHAR(10));

INSERT INTO avg_example VALUES
  (1, 1.1, '1.1', 'one'),
  (1, 10, '10', 'ten'),
  (2, 2.4, '2.4', 'two'),
  (2, NULL, NULL, 'NULL'),
  (3, NULL, NULL, 'NULL'),
  (NULL, 9.9, '9.9', 'nine');
```

Show the data:

```sqlexample
SELECT *
  FROM avg_example
  ORDER BY int_col, d;
```

```output
+---------+----------+------+------+
| INT_COL |        D | S1   | S2   |
|---------+----------+------+------|
|       1 |  1.10000 | 1.1  | one  |
|       1 | 10.00000 | 10   | ten  |
|       2 |  2.40000 | 2.4  | two  |
|       2 |     NULL | NULL | NULL |
|       3 |     NULL | NULL | NULL |
|    NULL |  9.90000 | 9.9  | nine |
+---------+----------+------+------+
```

Calculate the average of the columns that are numeric or that can be converted to numbers:

```sqlexample
SELECT AVG(int_col), AVG(d)
  FROM avg_example;
```

```output
+--------------+---------------+
| AVG(INT_COL) |        AVG(D) |
|--------------+---------------|
|     1.800000 | 5.85000000000 |
+--------------+---------------+
```

Combine AVG with GROUP BY to calculate the averages of different groups:

```sqlexample
SELECT int_col, AVG(d), AVG(s1)
  FROM avg_example
  GROUP BY int_col
  ORDER BY int_col;
```

```output
+---------+---------------+---------+
| INT_COL |        AVG(D) | AVG(S1) |
|---------+---------------+---------|
|       1 | 5.55000000000 |    5.55 |
|       2 | 2.40000000000 |    2.4  |
|       3 |          NULL |    NULL |
|    NULL | 9.90000000000 |    9.9  |
+---------+---------------+---------+
```

Use as a simple window function:

```sqlexample
SELECT
    int_col,
    AVG(int_col) OVER (PARTITION BY int_col)
  FROM avg_example
  ORDER BY int_col;
```

```output
+---------+-----------------------------------------+
| INT_COL | AVG(INT_COL) OVER(PARTITION BY INT_COL) |
|---------+-----------------------------------------|
|       1 |                                   1.000 |
|       1 |                                   1.000 |
|       2 |                                   2.000 |
|       2 |                                   2.000 |
|       3 |                                   3.000 |
|    NULL |                                    NULL |
+---------+-----------------------------------------+
```

---
title: AVG (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_avg.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# AVG (system data metric function)

Returns the average value for the specified column in a table.

The AVG system data metric function is optimized to calculate the average value for a single column and provides greater performance when
compared to calling the [AVG](avg.md) function.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.AVG(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* FLOAT
* NUMBER

## Returns

The function returns a NUMBER value.

## Example

Measure the average value for the `salary` column in a table:

```sqlexample
SELECT SNOWFLAKE.CORE.AVG(
  SELECT
    salary
  FROM hr.tables.empl_info
);
```

```output
+------------------------------------------------------------+
| SNOWFLAKE.CORE.AVG(SELECT salary FROM hr.tables.empl_info) |
+------------------------------------------------------------+
| 137000                                                     |
+------------------------------------------------------------+
```

---
title: BASE64_DECODE_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/base64_decode_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# BASE64_DECODE_BINARY

Decodes a Base64-encoded string to a binary.

See also:
:   [TRY_BASE64_DECODE_BINARY](try_base64_decode_binary.md)

    [BASE64_DECODE_STRING](base64_decode_string.md) , [BASE64_ENCODE](base64_encode.md)

## Syntax

```sqlsyntax
BASE64_DECODE_BINARY( <input> [ , <alphabet> ] )
```

## Arguments

**Required:**

`input`
:   A Base64-encoded string expression.

**Optional:**

`alphabet`
:   A string consisting of up to three ASCII characters:

    * The first two characters in the string specify the last two characters (indexes 62 and 63) in the alphabet used to encode the input:

      + `A` to `Z` (indexes 0-25)
      + `a` to `z` (indexes 26-51)
      + `0` to `9` (indexes 52-61)
      + `+` and `/` (indexes 62, 63)

      Defaults: `+` and `/`
    * The third character in the string specifies the character used for padding.

      Default: `=`

## Returns

This returns a `BINARY` value. The value can be inserted into a column of
type `BINARY`, for example.

## Usage notes

* The characters in the `alphabet` string are positionally parsed; to specify different characters in the second or third positions in the string, you must explicitly specify all preceding characters
  even if you wish to use the defaults.

  For example:

  > + `+$` specifies the default (`+`) for index 62 and a different character (`$`) for index 63; no character is explicitly specified for padding so the default character (`=`) is used.
  > + `+/%` specifies the defaults (`+` and `/`) for indexes 62 and 63, and specifies a different character (`%`) for padding.
* The `alphabet` string used to decode `input` must match the string originally used to encode `input`.

For more information about base64 format, see [base64](../binary-input-output.md).

## Examples

This example converts data from string to binary, then encodes from binary
to a BASE64 string. After that, it decodes the base64 string back to
binary, and then converts the binary back to a string.

> Create a table and data. This includes converting a string to
> binary and that binary into a BASE64 string:
>
> > ```sqlexample
> > CREATE OR REPLACE TABLE binary_table (v VARCHAR, b BINARY, b64_string VARCHAR);
> > INSERT INTO binary_table (v) VALUES ('HELP');
> > UPDATE binary_table SET b = TO_BINARY(v, 'UTF-8');
> > UPDATE binary_table SET b64_string = BASE64_ENCODE(b);
> > ```
>
> Now display the original string, the binary form of the string
> (which is actually displayed as hexadecimal), and then the
> BASE64 form of the binary:
>
> > ```sqlexample
> > -- Note that the binary data in column b is displayed in hexadecimal
> > --   format to make it human-readable.
> > SELECT v, b, b64_string FROM binary_table;
> > +------+----------+------------+
> > | V    | B        | B64_STRING |
> > |------+----------+------------|
> > | HELP | 48454C50 | SEVMUA==   |
> > +------+----------+------------+
> > ```
>
> Now retrieve the data and decode it back to its original form.
> Note again that the pure binary values in the 2nd and 4th
> columns are displayed as hexadecimal, not as the internal
> binary form:
>
> > ```sqlexample
> > SELECT v, b, b64_string,
> >         BASE64_DECODE_BINARY(b64_string) AS FROM_BASE64_BACK_TO_BINARY,
> >         TO_VARCHAR(BASE64_DECODE_BINARY(b64_string), 'UTF-8') AS BACK_TO_STRING
> >     FROM binary_table;
> > +------+----------+------------+----------------------------+----------------+
> > | V    | B        | B64_STRING | FROM_BASE64_BACK_TO_BINARY | BACK_TO_STRING |
> > |------+----------+------------+----------------------------+----------------|
> > | HELP | 48454C50 | SEVMUA==   | 48454C50                   | HELP           |
> > +------+----------+------------+----------------------------+----------------+
> > ```

The next example is similar to the preceding example, but specifies
the `alphabet` parameter to indicate that ‘$’ should be the encoding
character for index 62 in the BASE64 encoding. In order to have diverse
enough data to need index 62, the data string uses a larger number of
distinct characters.

> Create a table and data. This includes converting a string to
> binary and that binary into a BASE64 string:
>
> > ```sqlexample
> > SET MY_STRING = 'ABCDEFGHIJKLMNOPQRSTUVWXYZ!@#$%^&*()abcdefghijklmnopqrstuvwzyz1234567890[]{};:,./<>?-=~';
> > CREATE OR REPLACE TABLE binary_table (v VARCHAR, b BINARY, b64_string VARCHAR);
> > INSERT INTO binary_table (v) VALUES ($MY_STRING);
> > UPDATE binary_table SET b = TO_BINARY(v, 'UTF-8');
> > UPDATE binary_table SET b64_string = BASE64_ENCODE(b, 0, '$');
> > ```
>
> Now retrieve the data and decode it back to its original form.
> Because this output columns are so wide, this example does five
> separate SELECT statements rather than one.
> Note again that the pure binary values are displayed as
> hexadecimal, not as the internal binary form.
> Note also the dollar sign (‘$’) in the BASE64 string (the
> third output below):
>
> > ```sqlexample
> > SELECT v
> >     FROM binary_table;
> > +-----------------------------------------------------------------------------------------+
> > | V                                                                                       |
> > |-----------------------------------------------------------------------------------------|
> > | ABCDEFGHIJKLMNOPQRSTUVWXYZ!@#$%^&*()abcdefghijklmnopqrstuvwzyz1234567890[]{};:,./<>?-=~ |
> > +-----------------------------------------------------------------------------------------+
> > SELECT b
> >     FROM binary_table;
> > +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > | B                                                                                                                                                                              |
> > |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> > | 4142434445464748494A4B4C4D4E4F505152535455565758595A21402324255E262A28296162636465666768696A6B6C6D6E6F70717273747576777A797A313233343536373839305B5D7B7D3B3A2C2E2F3C3E3F2D3D7E |
> > +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > SELECT b64_string
> >     FROM binary_table;
> > +----------------------------------------------------------------------------------------------------------------------+
> > | B64_STRING                                                                                                           |
> > |----------------------------------------------------------------------------------------------------------------------|
> > | QUJDREVGR0hJSktMTU5PUFFSU1RVVldYWVohQCMkJV4mKigpYWJjZGVmZ2hpamtsbW5vcHFyc3R1dnd6eXoxMjM0NTY3ODkwW117fTs6LC4vPD4/LT1$ |
> > +----------------------------------------------------------------------------------------------------------------------+
> > SELECT BASE64_DECODE_BINARY(b64_string, '$') AS FROM_BASE64_BACK_TO_BINARY
> >     FROM binary_table;
> > +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > | FROM_BASE64_BACK_TO_BINARY                                                                                                                                                     |
> > |--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> > | 4142434445464748494A4B4C4D4E4F505152535455565758595A21402324255E262A28296162636465666768696A6B6C6D6E6F70717273747576777A797A313233343536373839305B5D7B7D3B3A2C2E2F3C3E3F2D3D7E |
> > +--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > SELECT TO_VARCHAR(BASE64_DECODE_BINARY(b64_string, '$'), 'UTF-8') AS BACK_TO_STRING
> >     FROM binary_table;
> > +-----------------------------------------------------------------------------------------+
> > | BACK_TO_STRING                                                                          |
> > |-----------------------------------------------------------------------------------------|
> > | ABCDEFGHIJKLMNOPQRSTUVWXYZ!@#$%^&*()abcdefghijklmnopqrstuvwzyz1234567890[]{};:,./<>?-=~ |
> > +-----------------------------------------------------------------------------------------+
> > ```

---
title: BASE64_DECODE_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/base64_decode_string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# BASE64_DECODE_STRING

Decodes a Base64-encoded string to a string.

See also:
:   [TRY_BASE64_DECODE_STRING](try_base64_decode_string.md)

    [BASE64_DECODE_BINARY](base64_decode_binary.md) , [BASE64_ENCODE](base64_encode.md)

## Syntax

```sqlsyntax
BASE64_DECODE_STRING( <input> [ , <alphabet> ] )
```

## Arguments

**Required:**

`input`
:   A Base64-encoded string expression.

**Optional:**

`alphabet`
:   A string consisting of up to three ASCII characters:

    * The first two characters in the string specify the last two characters (indexes 62 and 63) in the alphabet used to encode the input:

      + `A` to `Z` (indexes 0-25)
      + `a` to `z` (indexes 26-51)
      + `0` to `9` (indexes 52-61)
      + `+` and `/` (indexes 62, 63)

      Defaults: `+` and `/`
    * The third character in the string specifies the character used for padding.

      Default: `=`

## Returns

A string.

## Usage notes

* The characters in the `alphabet` string are positionally parsed; to specify different characters in the second or third positions in the string, you must explicitly specify all preceding characters
  even if you wish to use the defaults.

  For example:

  > + `+$` specifies the default (`+`) for index 62 and a different character (`$`) for index 63; no character is explicitly specified for padding so the default character (`=`) is used.
  > + `+/%` specifies the defaults (`+` and `/`) for indexes 62 and 63, and specifies a different character (`%`) for padding.
* The `alphabet` string used to decode `input` must match the string originally used to encode `input`.

For more information about base64 format, see [base64](../binary-input-output.md).

## Examples

This shows a simple example of using `BASE64_DECODE_STRING`:

> ```sqlexample
> SELECT BASE64_DECODE_STRING('U25vd2ZsYWtl');
> +--------------------------------------+
> | BASE64_DECODE_STRING('U25VD2ZSYWTL') |
> |--------------------------------------|
> | Snowflake                            |
> +--------------------------------------+
> ```

This shows another example of using `BASE64_DECODE_STRING`:

> Create a table and data:
>
> > ```sqlexample
> > CREATE OR REPLACE TABLE base64_table (v VARCHAR, base64_string VARCHAR);
> > INSERT INTO base64_table (v) VALUES ('HELLO');
> > UPDATE base64_table SET base64_string = BASE64_ENCODE(v);
> > ```
>
> Now run a query using `BASE64_DECODE_STRING`:
>
> > ```sqlexample
> > SELECT v, base64_string, BASE64_DECODE_STRING(base64_string)
> >     FROM base64_table;
> > +-------+---------------+-------------------------------------+
> > | V     | BASE64_STRING | BASE64_DECODE_STRING(BASE64_STRING) |
> > |-------+---------------+-------------------------------------|
> > | HELLO | SEVMTE8=      | HELLO                               |
> > +-------+---------------+-------------------------------------+
> > ```

---
title: BASE64_ENCODE
source: https://docs.snowflake.com/en/sql-reference/functions/base64_encode.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# BASE64_ENCODE

Encodes the input (string or binary) using Base64 encoding.

See also:
:   [BASE64_DECODE_BINARY](base64_decode_binary.md) , [BASE64_DECODE_STRING](base64_decode_string.md)

## Syntax

```sqlsyntax
BASE64_ENCODE( <input> [ , <max_line_length> ] [ , <alphabet> ] )
```

## Arguments

**Required:**

`input`
:   A string or binary expression to be encoded.

**Optional:**

`max_line_length`
:   A positive integer that specifies the maximum number of characters in a single line of the output.

    Default: `0` (specifies that no line breaks are inserted (i.e. the maximum line length is infinite))

`alphabet`
:   A string consisting of up to three ASCII characters:

    * The first two characters in the string specify the last two characters (indexes 62 and 63) in the alphabet used to encode the input:

      + `A` to `Z` (indexes 0-25)
      + `a` to `z` (indexes 26-51)
      + `0` to `9` (indexes 52-61)
      + `+` and `/` (indexes 62, 63)

      Defaults: `+` and `/`
    * The third character in the string specifies the character used for padding.

      Default: `=`

## Returns

Returns a string (regardless of whether the input was a string or `BINARY`).

## Usage notes

* The characters in the `alphabet` string are positionally parsed; to specify different characters in the second or third positions in the string, you must explicitly specify all preceding characters
  even if you wish to use the defaults.

  For example:

  > + `+$` specifies the default (`+`) for index 62 and a different character (`$`) for index 63; no character is explicitly specified for padding so the default character (`=`) is used.
  > + `+/%` specifies the defaults (`+` and `/`) for indexes 62 and 63, and specifies a different character (`%`) for padding.
* If you specify an `alphabet` string to encode `input`, the same string must be used to decode `input`.

For more information about base64 format, see [base64](../binary-input-output.md).

## Returns

This returns a string that contains only the characters used for the base64
encoding.

## Examples

Encode a string using Base64:

```sqlexample
SELECT BASE64_ENCODE('Snowflake');

----------------------------+
 BASE64_ENCODE('SNOWFLAKE') |
----------------------------+
 U25vd2ZsYWtl               |
----------------------------+
```

Encode a string containing non-ASCII characters using Base64 with
‘$’ in place of ‘+’ for encoding, and output the string
with a maximum line length of 32:

```sqlexample
SELECT BASE64_ENCODE('Snowflake ❄❄❄ Snowman ☃☃☃',32,'$');

---------------------------------------------------+
 BASE64_ENCODE('SNOWFLAKE ❄❄❄ SNOWMAN ☃☃☃',32,'$') |
---------------------------------------------------+
 U25vd2ZsYWtlIOKdhOKdhOKdhCBTbm93                  |
 bWFuIOKYg$KYg$KYgw==                              |
---------------------------------------------------+
```

This shows another example of using `BASE64_ENCODE` (and also
`BASE64_DECODE_STRING`):

> Create a table and data:
>
> > ```sqlexample
> > CREATE OR REPLACE TABLE base64_table (v VARCHAR, base64_string VARCHAR);
> > INSERT INTO base64_table (v) VALUES ('HELLO');
> > UPDATE base64_table SET base64_string = BASE64_ENCODE(v);
> > ```
>
> Now run a query using `BASE64_DECODE_STRING`:
>
> > ```sqlexample
> > SELECT v, base64_string, BASE64_DECODE_STRING(base64_string)
> >     FROM base64_table;
> > +-------+---------------+-------------------------------------+
> > | V     | BASE64_STRING | BASE64_DECODE_STRING(BASE64_STRING) |
> > |-------+---------------+-------------------------------------|
> > | HELLO | SEVMTE8=      | HELLO                               |
> > +-------+---------------+-------------------------------------+
> > ```

---
title: BIND_VALUES
source: https://docs.snowflake.com/en/sql-reference/functions/bind_values.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# BIND_VALUES

This INFORMATION_SCHEMA table function returns information about the values of
[bind variables](../bind-variables.md) used in queries.

## Syntax

```sqlsyntax
BIND_VALUES( <query_id> )
```

## Arguments

`query_id`
:   The string identifier of a query that includes one or more bind variables.

    Snowflake query IDs are unique strings that resemble `01b71944-0001-b181-0000-0129032279f6`.

    If NULL, an empty table is returned.

## Usage notes

* Returns bind variable values for queries that are run by the current user. Also returns bind variable values for queries
  that are run by any user when the role that is currently active in a user’s session, or a higher role in a hierarchy,
  has the MONITOR or OPERATE privilege on the user-managed warehouses where the queries were run. For more information,
  see [Virtual warehouse privileges](../../user-guide/security-access-control-privileges.md).
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the
  function name must be fully qualified. For more information, see [Snowflake Information Schema](../info-schema.md).
* This function can return all queries run in the past seven days.
* This function might not return the bind values or might return an error for the following scenarios:

  + The [ALLOW_BIND_VALUES_ACCESS](../parameters.md) account-level parameter is set to `FALSE`.
  + The bind variables have large values that exceed Snowflake storage thresholds.
  + The queries have a large number of bind variables that exceed Snowflake storage thresholds.
  + The bind variables contain sensitive data. The extraction and processing are done on a best-effort basis, and
    whether data is considered sensitive depends on the context.
  + The function call specifies a query that includes [array binds](../bind-variables.md).
  + The function call specifies a query that doesn’t exist.
  + The function call specifies a query that has expired and is no longer in the query history.

## Output

The BIND_VALUES table function produces one row for each bind variable that is used in the specified query. Each row contains the
following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | The ID of the query. |
| POSITION | NUMBER | For positional bind variables, the position of the bind variable. The field is NULL for named bind variables. |
| NAME | VARCHAR | For named bind variables, the name of the bind variable. The field is NULL for positional bind variables. |
| TYPE | VARCHAR | The Snowflake data type of the bind variable. |
| VALUE | VARCHAR | The value of the bind variable. Bind values that contain more than 100,000 characters are truncated. |

## Examples

See [Retrieve bind variable values](../bind-variables.md).

---
title: BIT_LENGTH
source: https://docs.snowflake.com/en/sql-reference/functions/bit_length.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# BIT_LENGTH

Returns the length of a string or binary value in bits.

Snowflake doesn’t use fractional bytes so length is always calculated as 8 \* [OCTET_LENGTH](octet_length.md).

## Syntax

```sqlsyntax
BIT_LENGTH(<string_or_binary>)
```

## Arguments

`string_or_binary`
:   The string or binary value for which the length is returned.

## Examples

This shows use of the `BIT_LENGTH` function on both string and BINARY values:

> > ```sqlexample
> > CREATE TABLE bl (v VARCHAR, b BINARY);
> > INSERT INTO bl (v, b) VALUES
> >    ('abc', NULL),
> >    ('\u0394', X'A1B2');
> > ```
>
> Query the data:
>
> > ```sqlexample
> > SELECT v, b, BIT_LENGTH(v), BIT_LENGTH(b) FROM bl ORDER BY v;
> > +-----+------+---------------+---------------+
> > | V   | B    | BIT_LENGTH(V) | BIT_LENGTH(B) |
> > |-----+------+---------------+---------------|
> > | abc | NULL |            24 |          NULL |
> > | Δ   | A1B2 |            16 |            16 |
> > +-----+------+---------------+---------------+
> > ```

---
title: BITAND
source: https://docs.snowflake.com/en/sql-reference/functions/bitand.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# BITAND

Returns the bitwise AND of two numeric or binary expressions.

Aliases:
:   BIT_AND

See also:
:   [BITAND_AGG](bitand_agg.md)

## Syntax

```sqlsyntax
BITAND( <expr1> , <expr2> [ , '<padside>' ] )
```

## Arguments

`expr1`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`expr2`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`'padside'`
:   When two BINARY argument values are not the same length, specifies which side to pad the value
    with the shorter length. Specify one of the following case-insensitive values:

    * LEFT - Pad the value on the left.
    * RIGHT - Pad the value on the right.

    The shorter value is padded with zeros so that it equals the length of the larger value.

    This argument is valid only when BINARY expressions are specified.

    If the length of two BINARY values are different, this argument is required.

## Returns

Returns an INTEGER value, a BINARY value, or NULL:

* When the input expressions contain INTEGER values, returns an INTEGER value that represents the bitwise AND
  of the input expressions.
* When the input expressions contain BINARY values, returns a BINARY value that represents the bitwise AND
  of the input expressions.
* If either input value is NULL, returns NULL.

## Usage notes

* Both input expressions must evaluate to a value of the same data type, either INTEGER
  or BINARY.
* If the data type of either argument is [numeric](../data-types-numeric.md)
  but not INTEGER (e.g. FLOAT, DECIMAL, etc.), then the argument is cast to an INTEGER value.
* If the data type of either argument is a string (e.g. VARCHAR), then the
  argument is cast to an INTEGER value if possible. For example, the string `12.3`
  is cast to `12`. If the value cannot be cast to an INTEGER value, then the
  value is treated as NULL.
* The function does not implicitly cast arguments to BINARY values.

## Examples

The following sections contain examples for INTEGER argument values and BINARY argument values.

### Using BITAND, BITOR, and BITXOR with INTEGER argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 INTEGER, bit2 INTEGER);
```

```sqlexample
INSERT INTO bits (ID, bit1, bit2) VALUES
  (   11,    1,     1),    -- Bits are all the same.
  (   24,    2,     4),    -- Bits are all different.
  (   42,    4,     2),    -- Bits are all different.
  ( 1624,   16,    24),    -- Bits overlap.
  (65504,    0, 65504),    -- Lots of bits (all but the low 6 bits).
  (    0, NULL,  NULL)     -- No bits.
  ;
```

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       BITAND(bit1, bit2),
       BITOR(bit1, bit2),
       BITXOR(bit1, BIT2)
  FROM bits
  ORDER BY bit1;
```

```output
+------+-------+--------------------+-------------------+--------------------+
| BIT1 |  BIT2 | BITAND(BIT1, BIT2) | BITOR(BIT1, BIT2) | BITXOR(BIT1, BIT2) |
|------+-------+--------------------+-------------------+--------------------|
|    0 | 65504 |                  0 |             65504 |              65504 |
|    1 |     1 |                  1 |                 1 |                  0 |
|    2 |     4 |                  0 |                 6 |                  6 |
|    4 |     2 |                  0 |                 6 |                  6 |
|   16 |    24 |                 16 |                24 |                  8 |
| NULL |  NULL |               NULL |              NULL |               NULL |
+------+-------+--------------------+-------------------+--------------------+
```

### Using BITAND, BITOR, and BITXOR with BINARY argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 BINARY(2), bit2 BINARY(2), bit3 BINARY(4));

INSERT INTO bits VALUES
  (1, x'1010', x'0101', x'11001010'),
  (2, x'1100', x'0011', x'01011010'),
  (3, x'BCBC', x'EEFF', x'ABCDABCD'),
  (4, NULL, NULL, NULL);
```

> **Note:**
>
> The BINARY values are inserted using the `x'value'` notation, where `value` contains
> hexadecimal digits. For more information, see [Binary input and output](../binary-input-output.md).

Run a query on BINARY columns of the same length:

```sqlexample
SELECT bit1,
       bit2,
       BITAND(bit1, bit2),
       BITOR(bit1, bit2),
       BITXOR(bit1, bit2)
  FROM bits;
```

```output
+------+------+--------------------+-------------------+--------------------+
| BIT1 | BIT2 | BITAND(BIT1, BIT2) | BITOR(BIT1, BIT2) | BITXOR(BIT1, BIT2) |
|------+------+--------------------+-------------------+--------------------|
| 1010 | 0101 | 0000               | 1111              | 1111               |
| 1100 | 0011 | 0000               | 1111              | 1111               |
| BCBC | EEFF | ACBC               | FEFF              | 5243               |
| NULL | NULL | NULL               | NULL              | NULL               |
+------+------+--------------------+-------------------+--------------------+
```

If you try to run a query on BINARY columns of different lengths without specifying the `'padside'`
argument, an error is returned:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3),
       BITOR(bit1, bit3),
       BITXOR(bit1, bit3)
  FROM bits;
```

```output
100544 (22026): The lengths of two variable-sized fields do not match: first length 2, second length 4
```

Run a query on BINARY columns of different lengths, and pad the smaller argument value on the left:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3, 'LEFT'),
       BITOR(bit1, bit3, 'LEFT'),
       BITXOR(bit1, bit3, 'LEFT')
  FROM bits;
```

```output
+------+----------+----------------------------+---------------------------+----------------------------+
| BIT1 | BIT3     | BITAND(BIT1, BIT3, 'LEFT') | BITOR(BIT1, BIT3, 'LEFT') | BITXOR(BIT1, BIT3, 'LEFT') |
|------+----------+----------------------------+---------------------------+----------------------------|
| 1010 | 11001010 | 00001010                   | 11001010                  | 11000000                   |
| 1100 | 01011010 | 00001000                   | 01011110                  | 01010110                   |
| BCBC | ABCDABCD | 0000A88C                   | ABCDBFFD                  | ABCD1771                   |
| NULL | NULL     | NULL                       | NULL                      | NULL                       |
+------+----------+----------------------------+---------------------------+----------------------------+
```

Run a query on BINARY columns of different lengths, and pad the smaller argument value on the right:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3, 'RIGHT'),
       BITOR(bit1, bit3, 'RIGHT'),
       BITXOR(bit1, bit3, 'RIGHT')
  FROM bits;
```

```output
+------+----------+-----------------------------+----------------------------+-----------------------------+
| BIT1 | BIT3     | BITAND(BIT1, BIT3, 'RIGHT') | BITOR(BIT1, BIT3, 'RIGHT') | BITXOR(BIT1, BIT3, 'RIGHT') |
|------+----------+-----------------------------+----------------------------+-----------------------------|
| 1010 | 11001010 | 10000000                    | 11101010                   | 01101010                    |
| 1100 | 01011010 | 01000000                    | 11011010                   | 10011010                    |
| BCBC | ABCDABCD | A88C0000                    | BFFDABCD                   | 1771ABCD                    |
| NULL | NULL     | NULL                        | NULL                       | NULL                        |
+------+----------+-----------------------------+----------------------------+-----------------------------+
```

---
title: BITAND_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/bitand_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Bitwise) , [Window functions](../functions-window.md) (General) , [Bitwise expression functions](../expressions-byte-bit.md)

# BITAND_AGG

Returns the bitwise AND value of all non-NULL numeric records in a group.

For each bit position, if all rows have the bit set to 1, then the bit is set to 1 in the result.
If any rows have that bit set to zero, the result is zero.

If all records inside the group are NULL, or if the group is empty, the function returns NULL.

Aliases:
:   BITANDAGG , BIT_AND_AGG , BIT_ANDAGG

See also:
:   [BITOR_AGG](bitor_agg.md) , [BITXOR_AGG](bitxor_agg.md) ,

    [BITAND](bitand.md)

## Syntax

**Aggregate function**

```sqlsyntax
BITAND_AGG( <expr1> )
```

**Window function**

```sqlsyntax
BITAND_AGG( <expr1> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   This expression must evaluate to a [numeric](../data-types-numeric.md) value or a value
    of a data type that can be cast to a numeric value.

`expr2`
:   This expression is used to group the rows in partitions.

## Returns

The data type of the returned value is `NUMBER(38, 0)`.

## Usage notes

* Numeric values are aggregated to the nearest INTEGER data type. Decimal and floating-point values are rounded to the
  nearest integer before aggregation.
* Aggregating a character/text column (data type VARCHAR, CHAR, STRING, etc.) implicitly casts the input values
  to FLOAT, then rounds the values to the nearest integer. If the cast is not possible, the value is treated as NULL.
* The DISTINCT keyword can be specified for these functions, but it does not have any effect.
* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

Create the table and load the data:

```sqlexample
CREATE OR REPLACE TABLE bitwise_example
  (k INT, d DECIMAL(10,5), s1 VARCHAR(10), s2 VARCHAR(10));

INSERT INTO bitwise_example VALUES
  (15, 1.1, '12', 'one'),
  (26, 2.9, '10', 'two'),
  (12, 7.1, '7.9', 'two'),
  (14, NULL, NULL, 'null'),
  (8, NULL, NULL, 'null'),
  (NULL, 9.1, '14', 'nine');
```

Display the data:

```sqlexample
SELECT k AS k_col, d AS d_col, s1, s2
  FROM bitwise_example
  ORDER BY k_col;
```

```output
+-------+---------+------+------+
| K_COL |   D_COL | S1   | S2   |
|-------+---------+------+------|
|     8 |    NULL | NULL | null |
|    12 | 7.10000 | 7.9  | two  |
|    14 |    NULL | NULL | null |
|    15 | 1.10000 | 12   | one  |
|    26 | 2.90000 | 10   | two  |
|  NULL | 9.10000 | 14   | nine |
+-------+---------+------+------+
```

Query the data:

```sqlexample
SELECT BITAND_AGG(k),
    BITAND_AGG(d),
    BITAND_AGG(s1)
  FROM bitwise_example;
```

```output
+---------------+---------------+----------------+
| BITAND_AGG(K) | BITAND_AGG(D) | BITAND_AGG(S1) |
|---------------+---------------+----------------|
|             8 |             1 |              8 |
+---------------+---------------+----------------+
```

Query the data and use a GROUP BY clause:

```sqlexample
SELECT s2,
    BITAND_AGG(k),
    BITAND_AGG(d)
  FROM bitwise_example
  GROUP BY s2
  ORDER BY 3;
```

```output
+------+---------------+---------------+
| S2   | BITAND_AGG(K) | BITAND_AGG(D) |
|------+---------------+---------------|
| one  |            15 |             1 |
| two  |             8 |             3 |
| nine |          NULL |             9 |
| null |             8 |          NULL |
+------+---------------+---------------+
```

If you pass this function strings that can’t be converted to NUMBER values, an error is returned:

```sqlexample
SELECT BITAND_AGG(s2)
  FROM bitwise_example;
```

```output
100038 (22018): Numeric value 'one' is not recognized
```

---
title: BITMAP_BIT_POSITION
source: https://docs.snowflake.com/en/sql-reference/functions/bitmap_bit_position.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values)

# BITMAP_BIT_POSITION

Given a numeric value, returns the relative position for the bit that represents that value in a bitmap.

See also:
:   [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md)

## Syntax

```sqlsyntax
BITMAP_BIT_POSITION( <numeric_expr> )
```

## Arguments

`numeric_expr`
:   This expression must evaluate to a data type that can be cast to NUMBER.

## Returns

The function returns the zero-based position of the bit for that value in a bitmap.

## Examples

See [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md).

---
title: BITMAP_BUCKET_NUMBER
source: https://docs.snowflake.com/en/sql-reference/functions/bitmap_bucket_number.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values)

# BITMAP_BUCKET_NUMBER

Given a numeric value, returns an identifier (“bucket number”) for the bitmap containing the bit that represents the value..

See also:
:   [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md)

## Syntax

```sqlsyntax
BITMAP_BUCKET_NUMBER( <numeric_expr> )
```

## Arguments

`numeric_expr`
:   This expression must evaluate to a data type that can be cast to NUMBER.

## Returns

The function returns a number that identifies the bitmap containing the bit that represents the value.

## Examples

See [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md).

---
title: BITMAP_CONSTRUCT_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/bitmap_construct_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values)

# BITMAP_CONSTRUCT_AGG

Returns a bitmap with bits set for each distinct value in a group.

See also:
:   [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md)

## Syntax

```sqlsyntax
BITMAP_CONSTRUCT_AGG( <relative_position> )
```

## Arguments

`relative_position`
:   The relative position of a bit for a value (returned by the [BITMAP_BIT_POSITION](bitmap_bit_position.md) function).

## Returns

The function returns a BINARY value that is a bitmap with bits set for each distinct value in a group.

## Examples

See [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md).

---
title: BITMAP_COUNT
source: https://docs.snowflake.com/en/sql-reference/functions/bitmap_count.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values)

# BITMAP_COUNT

Given a bitmap that represents the set of distinct values for a column, returns the number of distinct value.

See also:
:   [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md)

## Syntax

```sqlsyntax
BITMAP_COUNT( <bitmap> )
```

## Arguments

`bitmap`
:   This expression must evaluate to a bitmap returned by the [BITMAP_CONSTRUCT_AGG](bitmap_construct_agg.md) or [BITMAP_OR_AGG](bitmap_or_agg.md) functions.

## Returns

The function returns the number of distinct values in a column, as represented by the bits set in the input bitmap.

## Examples

See [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md).

---
title: BITMAP_OR_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/bitmap_or_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Counting Distinct Values)

# BITMAP_OR_AGG

Returns a bitmap containing the results of a binary OR operation on the input bitmaps.

See also:
:   [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md)

## Syntax

```sqlsyntax
BITMAP_OR_AGG( <bitmap> )
```

## Arguments

`bitmap`
:   A bitmap returned by the [BITMAP_CONSTRUCT_AGG](bitmap_construct_agg.md) or BITMAP_OR_AGG function.

## Returns

The function returns a bitmap containing the results of a binary OR operation on the input bitmaps.

## Examples

See [Using Bitmaps to Compute Distinct Values for Hierarchical Aggregations](../../user-guide/querying-bitmaps-for-distinct-counts.md).

---
title: BITNOT
source: https://docs.snowflake.com/en/sql-reference/functions/bitnot.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# BITNOT

Returns the bitwise negation of a numeric or binary expression.

Aliases:
:   BIT_NOT

## Syntax

```sqlsyntax
BITNOT( <expr> )
```

## Arguments

`expr`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

## Returns

Returns an INTEGER value, a BINARY value, or NULL:

* When the input expression contains an INTEGER value, returns an INTEGER value that represents the bitwise
  negation of the input expression.
* When the input expression contains a BINARY value, returns a BINARY value that represents the bitwise
  negation of the input expression.
* If the input value is NULL, returns NULL.

## Usage notes

* If the data type of the argument is [numeric](../data-types-numeric.md)
  but not INTEGER (e.g. FLOAT, DECIMAL, etc.), then the argument is cast to an INTEGER value.
* If the data type of the argument is a string (e.g. VARCHAR), then the
  argument is cast to an INTEGER value if possible. For example, the string `12.3`
  is cast to `12`. If the value cannot be cast to an INTEGER value, then the
  value is treated as NULL.
* The function does not implicitly cast arguments to BINARY values.

## Examples

The following sections contain examples for INTEGER argument values and BINARY argument values.

### Using BITNOT with INTEGER argument values

Create a simple table and data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 INTEGER, bit2 INTEGER);
```

```sqlexample
INSERT INTO bits (ID, bit1, bit2) VALUES
  (   11,    1,     1),    -- Bits are all the same.
  (   24,    2,     4),    -- Bits are all different.
  (   42,    4,     2),    -- Bits are all different.
  ( 1624,   16,    24),    -- Bits overlap.
  (65504,    0, 65504),    -- Lots of bits (all but the low 6 bits).
  (    0, NULL,  NULL)     -- No bits.
  ;
```

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       BITNOT(bit1),
       BITNOT(bit2)
  FROM bits
  ORDER BY bit1;
```

```output
+------+-------+--------------+--------------+
| BIT1 |  BIT2 | BITNOT(BIT1) | BITNOT(BIT2) |
|------+-------+--------------+--------------|
|    0 | 65504 |           -1 |       -65505 |
|    1 |     1 |           -2 |           -2 |
|    2 |     4 |           -3 |           -5 |
|    4 |     2 |           -5 |           -3 |
|   16 |    24 |          -17 |          -25 |
| NULL |  NULL |         NULL |         NULL |
+------+-------+--------------+--------------+
```

### Using BITNOT with BINARY argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 BINARY(2), bit2 BINARY(2), bit3 BINARY(4));

INSERT INTO bits VALUES
  (1, x'1010', x'0101', x'11001010'),
  (2, x'1100', x'0011', x'01011010'),
  (3, x'BCBC', x'EEFF', x'ABCDABCD'),
  (4, NULL, NULL, NULL);
```

> **Note:**
>
> The BINARY values are inserted using the `x'value'` notation, where `value` contains
> hexadecimal digits. For more information, see [Binary input and output](../binary-input-output.md).

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       bit3,
       BITNOT(bit1),
       BITNOT(bit2),
       BITNOT(bit3)
  FROM bits;
```

```output
+------+------+----------+--------------+--------------+--------------+
| BIT1 | BIT2 | BIT3     | BITNOT(BIT1) | BITNOT(BIT2) | BITNOT(BIT3) |
|------+------+----------+--------------+--------------+--------------|
| 1010 | 0101 | 11001010 | EFEF         | FEFE         | EEFFEFEF     |
| 1100 | 0011 | 01011010 | EEFF         | FFEE         | FEFEEFEF     |
| BCBC | EEFF | ABCDABCD | 4343         | 1100         | 54325432     |
| NULL | NULL | NULL     | NULL         | NULL         | NULL         |
+------+------+----------+--------------+--------------+--------------+
```

---
title: BITOR
source: https://docs.snowflake.com/en/sql-reference/functions/bitor.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# BITOR

Returns the bitwise OR of two numeric or binary expressions.

Aliases:
:   BIT_OR

See also:
:   [BITOR_AGG](bitor_agg.md)

## Syntax

```sqlsyntax
BITOR( <expr1> , <expr2> [ , '<padside>' ] )
```

## Arguments

`expr1`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`expr2`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`'padside'`
:   When two BINARY argument values are not the same length, specifies which side to pad the value
    with the shorter length. Specify one of the following case-insensitive values:

    * LEFT - Pad the value on the left.
    * RIGHT - Pad the value on the right.

    The shorter value is padded with zeros so that it equals the length of the larger value.

    This argument is valid only when BINARY expressions are specified.

    If the length of two BINARY values are different, this argument is required.

## Returns

Returns an INTEGER value, a BINARY value, or NULL:

* When the input expressions contain INTEGER values, returns an INTEGER value that represents the bitwise OR
  of the input expressions.
* When the input expressions contain BINARY values, returns a BINARY value that represents the bitwise OR
  of the input expressions.
* If either input value is NULL, returns NULL.

## Usage notes

* Both input expressions must evaluate to a value of the same data type, either INTEGER
  or BINARY.
* If the data type of either argument is [numeric](../data-types-numeric.md)
  but not INTEGER (e.g. FLOAT, DECIMAL, etc.), then the argument is cast to an INTEGER value.
* If the data type of either argument is a string (e.g. VARCHAR), then the
  argument is cast to an INTEGER value if possible. For example, the string `12.3`
  is cast to `12`. If the value cannot be cast to an INTEGER value, then the
  value is treated as NULL.
* The function does not implicitly cast arguments to BINARY values.

## Examples

The following sections contain examples for INTEGER argument values and BINARY argument values.

### Using BITAND, BITOR, and BITXOR with INTEGER argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 INTEGER, bit2 INTEGER);
```

```sqlexample
INSERT INTO bits (ID, bit1, bit2) VALUES
  (   11,    1,     1),    -- Bits are all the same.
  (   24,    2,     4),    -- Bits are all different.
  (   42,    4,     2),    -- Bits are all different.
  ( 1624,   16,    24),    -- Bits overlap.
  (65504,    0, 65504),    -- Lots of bits (all but the low 6 bits).
  (    0, NULL,  NULL)     -- No bits.
  ;
```

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       BITAND(bit1, bit2),
       BITOR(bit1, bit2),
       BITXOR(bit1, BIT2)
  FROM bits
  ORDER BY bit1;
```

```output
+------+-------+--------------------+-------------------+--------------------+
| BIT1 |  BIT2 | BITAND(BIT1, BIT2) | BITOR(BIT1, BIT2) | BITXOR(BIT1, BIT2) |
|------+-------+--------------------+-------------------+--------------------|
|    0 | 65504 |                  0 |             65504 |              65504 |
|    1 |     1 |                  1 |                 1 |                  0 |
|    2 |     4 |                  0 |                 6 |                  6 |
|    4 |     2 |                  0 |                 6 |                  6 |
|   16 |    24 |                 16 |                24 |                  8 |
| NULL |  NULL |               NULL |              NULL |               NULL |
+------+-------+--------------------+-------------------+--------------------+
```

### Using BITAND, BITOR, and BITXOR with BINARY argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 BINARY(2), bit2 BINARY(2), bit3 BINARY(4));

INSERT INTO bits VALUES
  (1, x'1010', x'0101', x'11001010'),
  (2, x'1100', x'0011', x'01011010'),
  (3, x'BCBC', x'EEFF', x'ABCDABCD'),
  (4, NULL, NULL, NULL);
```

> **Note:**
>
> The BINARY values are inserted using the `x'value'` notation, where `value` contains
> hexadecimal digits. For more information, see [Binary input and output](../binary-input-output.md).

Run a query on BINARY columns of the same length:

```sqlexample
SELECT bit1,
       bit2,
       BITAND(bit1, bit2),
       BITOR(bit1, bit2),
       BITXOR(bit1, bit2)
  FROM bits;
```

```output
+------+------+--------------------+-------------------+--------------------+
| BIT1 | BIT2 | BITAND(BIT1, BIT2) | BITOR(BIT1, BIT2) | BITXOR(BIT1, BIT2) |
|------+------+--------------------+-------------------+--------------------|
| 1010 | 0101 | 0000               | 1111              | 1111               |
| 1100 | 0011 | 0000               | 1111              | 1111               |
| BCBC | EEFF | ACBC               | FEFF              | 5243               |
| NULL | NULL | NULL               | NULL              | NULL               |
+------+------+--------------------+-------------------+--------------------+
```

If you try to run a query on BINARY columns of different lengths without specifying the `'padside'`
argument, an error is returned:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3),
       BITOR(bit1, bit3),
       BITXOR(bit1, bit3)
  FROM bits;
```

```output
100544 (22026): The lengths of two variable-sized fields do not match: first length 2, second length 4
```

Run a query on BINARY columns of different lengths, and pad the smaller argument value on the left:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3, 'LEFT'),
       BITOR(bit1, bit3, 'LEFT'),
       BITXOR(bit1, bit3, 'LEFT')
  FROM bits;
```

```output
+------+----------+----------------------------+---------------------------+----------------------------+
| BIT1 | BIT3     | BITAND(BIT1, BIT3, 'LEFT') | BITOR(BIT1, BIT3, 'LEFT') | BITXOR(BIT1, BIT3, 'LEFT') |
|------+----------+----------------------------+---------------------------+----------------------------|
| 1010 | 11001010 | 00001010                   | 11001010                  | 11000000                   |
| 1100 | 01011010 | 00001000                   | 01011110                  | 01010110                   |
| BCBC | ABCDABCD | 0000A88C                   | ABCDBFFD                  | ABCD1771                   |
| NULL | NULL     | NULL                       | NULL                      | NULL                       |
+------+----------+----------------------------+---------------------------+----------------------------+
```

Run a query on BINARY columns of different lengths, and pad the smaller argument value on the right:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3, 'RIGHT'),
       BITOR(bit1, bit3, 'RIGHT'),
       BITXOR(bit1, bit3, 'RIGHT')
  FROM bits;
```

```output
+------+----------+-----------------------------+----------------------------+-----------------------------+
| BIT1 | BIT3     | BITAND(BIT1, BIT3, 'RIGHT') | BITOR(BIT1, BIT3, 'RIGHT') | BITXOR(BIT1, BIT3, 'RIGHT') |
|------+----------+-----------------------------+----------------------------+-----------------------------|
| 1010 | 11001010 | 10000000                    | 11101010                   | 01101010                    |
| 1100 | 01011010 | 01000000                    | 11011010                   | 10011010                    |
| BCBC | ABCDABCD | A88C0000                    | BFFDABCD                   | 1771ABCD                    |
| NULL | NULL     | NULL                        | NULL                       | NULL                        |
+------+----------+-----------------------------+----------------------------+-----------------------------+
```

---
title: BITOR_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/bitor_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Bitwise) , [Window functions](../functions-window.md) (General) , [Bitwise expression functions](../expressions-byte-bit.md)

# BITOR_AGG

Returns the bitwise OR value of all non-NULL numeric records in a group.

For each bit position, if at least one row has the bit set to 1, then the bit is set to 1 in the result.
If all rows have that bit set to zero, the result is zero.

If all records inside the group are NULL, or if the group is empty, the function returns NULL.

Aliases:
:   BITORAGG, BIT_OR_AGG, BIT_ORAGG

See also:
:   [BITAND_AGG](bitand_agg.md) , [BITXOR_AGG](bitxor_agg.md)

    [BITOR](bitor.md)

## Syntax

**Aggregate function**

```sqlsyntax
BITOR_AGG( <expr1> )
```

**Window function**

```sqlsyntax
BITOR_AGG( <expr1> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   This expression must evaluate to a [numeric](../data-types-numeric.md) value or a value
    of a data type that can be cast to a numeric value.

`expr2`
:   This expression is used to group the rows in partitions.

## Returns

The data type of the returned value is `NUMBER(38, 0)`.

## Usage notes

* Numeric values are aggregated to the nearest INTEGER data type. Decimal and floating-point values are rounded to the
  nearest integer before aggregation.
* Aggregating a character/text column (data type VARCHAR, CHAR, STRING, etc.) implicitly casts the input values
  to FLOAT, then rounds the values to the nearest integer. If the cast is not possible, the value is treated as NULL.
* The DISTINCT keyword can be specified for these functions, but it does not have any effect.
* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

Create the table and load the data:

```sqlexample
CREATE OR REPLACE TABLE bitwise_example
  (k INT, d DECIMAL(10,5), s1 VARCHAR(10), s2 VARCHAR(10));

INSERT INTO bitwise_example VALUES
  (15, 1.1, '12', 'one'),
  (26, 2.9, '10', 'two'),
  (12, 7.1, '7.9', 'two'),
  (14, NULL, NULL, 'null'),
  (8, NULL, NULL, 'null'),
  (NULL, 9.1, '14', 'nine');
```

Display the data:

```sqlexample
SELECT k AS k_col, d AS d_col, s1, s2
  FROM bitwise_example
  ORDER BY k_col;
```

```output
+-------+---------+------+------+
| K_COL |   D_COL | S1   | S2   |
|-------+---------+------+------|
|     8 |    NULL | NULL | null |
|    12 | 7.10000 | 7.9  | two  |
|    14 |    NULL | NULL | null |
|    15 | 1.10000 | 12   | one  |
|    26 | 2.90000 | 10   | two  |
|  NULL | 9.10000 | 14   | nine |
+-------+---------+------+------+
```

Query the data:

```sqlexample
SELECT BITOR_AGG(k),
    BITOR_AGG(d),
    BITOR_AGG(s1)
  FROM bitwise_example;
```

```output
+--------------+--------------+---------------+
| BITOR_AGG(K) | BITOR_AGG(D) | BITOR_AGG(S1) |
|--------------+--------------+---------------|
|           31 |           15 |            14 |
+--------------+--------------+---------------+
```

Query the data and use a GROUP BY clause:

```sqlexample
SELECT s2,
    BITOR_AGG(k),
    BITOR_AGG(d)
  FROM bitwise_example
  GROUP BY s2
  ORDER BY 3;
```

```output
+------+--------------+--------------+
| S2   | BITOR_AGG(K) | BITOR_AGG(D) |
|------+--------------+--------------|
| one  |           15 |            1 |
| two  |           30 |            7 |
| nine |         NULL |            9 |
| null |           14 |         NULL |
+------+--------------+--------------+
```

If you pass this function strings that can’t be converted to NUMBER values, an error is returned:

```sqlexample
SELECT BITOR_AGG(s2)
  FROM bitwise_example;
```

```output
100038 (22018): Numeric value 'one' is not recognized
```

---
title: BITSHIFTLEFT
source: https://docs.snowflake.com/en/sql-reference/functions/bitshiftleft.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# BITSHIFTLEFT

Shifts the bits for a numeric or binary expression `n` positions to the left.

Aliases:
:   BIT_SHIFTLEFT

See also:
:   [BITSHIFTRIGHT](bitshiftright.md)

## Syntax

```sqlsyntax
BITSHIFTLEFT( <expr1> , <n> )
```

## Arguments

`expr1`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`n`
:   The number of bits to shift by.

## Returns

Returns an INTEGER value, a BINARY value, or NULL:

* When the input expression contains an INTEGER value, returns a signed 128-bit (16-byte) integer,
  regardless of the size or data type of the input data value.
* When the input expression contains a BINARY value, returns a BINARY value.
* If any argument is NULL, returns NULL.

## Usage notes

* If the data type of any argument is [numeric](../data-types-numeric.md)
  but not INTEGER (e.g. FLOAT, DECIMAL, etc.), then that argument is cast to an INTEGER value.
* If the data type of any argument is a string (e.g. VARCHAR), then the
  argument is cast to an INTEGER value if possible. For example, the string `12.3`
  is cast to `12`. If the value cannot be cast to an INTEGER value, then the
  value is treated as NULL.
* If a signed INTEGER value is returned, and the value of the high bit changes (from 0 to 1, or from 1 to 0),
  the sign of the result is reversed. For example, `BITSHIFTLEFT(1, 127)` returns a negative number.
* If a signed INTEGER value is returned, bits that are shifted past the end of the 128-bit output value
  are dropped.
* The function does not implicitly cast arguments to BINARY values.

## Examples

The following sections contain examples for INTEGER argument values and BINARY argument values.

### Using BITSHIFTLEFT and BITSHIFTRIGHT with INTEGER argument values

Create a simple table and data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 INTEGER, bit2 INTEGER);
```

```sqlexample
INSERT INTO bits (ID, bit1, bit2) VALUES
  (   11,    1,     1),    -- Bits are all the same.
  (   24,    2,     4),    -- Bits are all different.
  (   42,    4,     2),    -- Bits are all different.
  ( 1624,   16,    24),    -- Bits overlap.
  (65504,    0, 65504),    -- Lots of bits (all but the low 6 bits).
  (    0, NULL,  NULL)     -- No bits.
  ;
```

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       BITSHIFTLEFT(bit1, 1),
       BITSHIFTRIGHT(bit2, 1)
  FROM bits
  ORDER BY bit1;
```

```output
+------+-------+-----------------------+------------------------+
| BIT1 |  BIT2 | BITSHIFTLEFT(BIT1, 1) | BITSHIFTRIGHT(BIT2, 1) |
|------+-------+-----------------------+------------------------|
|    0 | 65504 |                     0 |                  32752 |
|    1 |     1 |                     2 |                      0 |
|    2 |     4 |                     4 |                      2 |
|    4 |     2 |                     8 |                      1 |
|   16 |    24 |                    32 |                     12 |
| NULL |  NULL |                  NULL |                   NULL |
+------+-------+-----------------------+------------------------+
```

### Using BITSHIFTLEFT with BINARY argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 BINARY(2), bit2 BINARY(2), bit3 BINARY(4));

INSERT INTO bits VALUES
  (1, x'1010', x'0101', x'11001010'),
  (2, x'1100', x'0011', x'01011010'),
  (3, x'BCBC', x'EEFF', x'ABCDABCD'),
  (4, NULL, NULL, NULL);
```

> **Note:**
>
> The BINARY values are inserted using the `x'value'` notation, where `value` contains
> hexadecimal digits. For more information, see [Binary input and output](../binary-input-output.md).

Run the query:

```sqlexample
SELECT bit1,
       bit3,
       BITSHIFTLEFT(bit1, 1),
       BITSHIFTLEFT(bit3, 1),
       BITSHIFTLEFT(bit1, 8),
       BITSHIFTLEFT(bit3, 16)
  FROM bits;
```

```output
+------+----------+-----------------------+-----------------------+-----------------------+------------------------+
| BIT1 | BIT3     | BITSHIFTLEFT(BIT1, 1) | BITSHIFTLEFT(BIT3, 1) | BITSHIFTLEFT(BIT1, 8) | BITSHIFTLEFT(BIT3, 16) |
|------+----------+-----------------------+-----------------------+-----------------------+------------------------|
| 1010 | 11001010 | 2020                  | 22002020              | 1000                  | 10100000               |
| 1100 | 01011010 | 2200                  | 02022020              | 0000                  | 10100000               |
| BCBC | ABCDABCD | 7978                  | 579B579A              | BC00                  | ABCD0000               |
| NULL | NULL     | NULL                  | NULL                  | NULL                  | NULL                   |
+------+----------+-----------------------+-----------------------+-----------------------+------------------------+
```

---
title: BITSHIFTRIGHT
source: https://docs.snowflake.com/en/sql-reference/functions/bitshiftright.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# BITSHIFTRIGHT

Shifts the bits for a numeric or binary expression `n` positions to the right.

Aliases:
:   BIT_SHIFTRIGHT

See also:
:   [BITSHIFTLEFT](bitshiftleft.md)

## Syntax

```sqlsyntax
BITSHIFTRIGHT( <expr1> , <n> )
```

## Arguments

`expr1`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`n`
:   The number of bits to shift by.

## Returns

Returns an INTEGER value, a BINARY value, or NULL:

* When the input expression contains an INTEGER value, returns a signed 128-bit (16-byte) integer,
  regardless of the size or data type of the input data value.
* When the input expression contains a BINARY value, returns a BINARY value.
* If any argument is NULL, returns NULL.

## Usage notes

* If the data type of the argument is [numeric](../data-types-numeric.md)
  but not INTEGER (e.g. FLOAT, DECIMAL, etc.), then the argument is cast to an INTEGER value.
* If the data type of the argument is a string (e.g. VARCHAR), then the
  argument is cast to an INTEGER value if possible. For example, the string `12.3`
  is cast to `12`. If the value cannot be cast to an INTEGER value, then the
  value is treated as NULL.
* The function does not implicitly cast arguments to BINARY values.

## Examples

The following sections contain examples for INTEGER argument values and BINARY argument values.

### Using BITSHIFTLEFT and BITSHIFTRIGHT with INTEGER argument values

Create a simple table and data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 INTEGER, bit2 INTEGER);
```

```sqlexample
INSERT INTO bits (ID, bit1, bit2) VALUES
  (   11,    1,     1),    -- Bits are all the same.
  (   24,    2,     4),    -- Bits are all different.
  (   42,    4,     2),    -- Bits are all different.
  ( 1624,   16,    24),    -- Bits overlap.
  (65504,    0, 65504),    -- Lots of bits (all but the low 6 bits).
  (    0, NULL,  NULL)     -- No bits.
  ;
```

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       BITSHIFTLEFT(bit1, 1),
       BITSHIFTRIGHT(bit2, 1)
  FROM bits
  ORDER BY bit1;
```

```output
+------+-------+-----------------------+------------------------+
| BIT1 |  BIT2 | BITSHIFTLEFT(BIT1, 1) | BITSHIFTRIGHT(BIT2, 1) |
|------+-------+-----------------------+------------------------|
|    0 | 65504 |                     0 |                  32752 |
|    1 |     1 |                     2 |                      0 |
|    2 |     4 |                     4 |                      2 |
|    4 |     2 |                     8 |                      1 |
|   16 |    24 |                    32 |                     12 |
| NULL |  NULL |                  NULL |                   NULL |
+------+-------+-----------------------+------------------------+
```

### Using BITSHIFTRIGHT with BINARY argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 BINARY(2), bit2 BINARY(2), bit3 BINARY(4));

INSERT INTO bits VALUES
  (1, x'1010', x'0101', x'11001010'),
  (2, x'1100', x'0011', x'01011010'),
  (3, x'BCBC', x'EEFF', x'ABCDABCD'),
  (4, NULL, NULL, NULL);
```

> **Note:**
>
> The BINARY values are inserted using the `x'value'` notation, where `value` contains
> hexadecimal digits. For more information, see [Binary input and output](../binary-input-output.md).

Run the query:

```sqlexample
SELECT bit1,
       bit3,
       BITSHIFTRIGHT(bit1, 1),
       BITSHIFTRIGHT(bit3, 1),
       BITSHIFTRIGHT(bit1, 8),
       BITSHIFTRIGHT(bit3, 16)
  FROM bits;
```

```output
+------+----------+------------------------+------------------------+------------------------+-------------------------+
| BIT1 | BIT3     | BITSHIFTRIGHT(BIT1, 1) | BITSHIFTRIGHT(BIT3, 1) | BITSHIFTRIGHT(BIT1, 8) | BITSHIFTRIGHT(BIT3, 16) |
|------+----------+------------------------+------------------------+------------------------+-------------------------|
| 1010 | 11001010 | 0808                   | 08800808               | 0010                   | 00001100                |
| 1100 | 01011010 | 0880                   | 00808808               | 0011                   | 00000101                |
| BCBC | ABCDABCD | 5E5E                   | 55E6D5E6               | 00BC                   | 0000ABCD                |
| NULL | NULL     | NULL                   | NULL                   | NULL                   | NULL                    |
+------+----------+------------------------+------------------------+------------------------+-------------------------+
```

---
title: BITXOR
source: https://docs.snowflake.com/en/sql-reference/functions/bitxor.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# BITXOR

Returns the bitwise XOR of two numeric or binary expressions.

Aliases:
:   BIT_XOR

See also:
:   [BITXOR_AGG](bitxor_agg.md)

## Syntax

```sqlsyntax
BITXOR( <expr1> , <expr2> [ , '<padside>' ] )
```

## Arguments

`expr1`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`expr2`
:   This expression must evaluate to an INTEGER value, a BINARY value, or a value of a data type
    that can be cast to an INTEGER value.

`'padside'`
:   When two BINARY argument values are not the same length, specifies which side to pad the value
    with the shorter length. Specify one of the following case-insensitive values:

    * LEFT - Pad the value on the left.
    * RIGHT - Pad the value on the right.

    The shorter value is padded with zeros so that it equals the length of the larger value.

    This argument is valid only when BINARY expressions are specified.

    If the length of two BINARY values are different, this argument is required.

## Returns

Returns an INTEGER value, a BINARY value, or NULL:

* When the input expressions contain INTEGER values, returns an INTEGER value that represents
  the bitwise XOR of the input expressions.
* When the input expressions contain BINARY values, returns a BINARY value that
  represents the bitwise XOR of the input expressions.
* If either input value is NULL, returns NULL.

## Usage notes

* Both input expressions must evaluate to a value of the same data type, either INTEGER
  or BINARY.
* If the data type of either argument is [numeric](../data-types-numeric.md)
  but not INTEGER (e.g. FLOAT, DECIMAL, etc.), then the argument is cast to an INTEGER value.
* If the data type of either argument is a string (e.g. VARCHAR), then the
  argument is cast to an INTEGER value if possible. For example, the string `12.3`
  is cast to `12`. If the value cannot be cast to an INTEGER value, then the
  value is treated as NULL.
* The function does not implicitly cast arguments to BINARY values.

## Examples

The following sections contain examples for INTEGER argument values and BINARY argument values.

### Using BITAND, BITOR, and BITXOR with INTEGER argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 INTEGER, bit2 INTEGER);
```

```sqlexample
INSERT INTO bits (ID, bit1, bit2) VALUES
  (   11,    1,     1),    -- Bits are all the same.
  (   24,    2,     4),    -- Bits are all different.
  (   42,    4,     2),    -- Bits are all different.
  ( 1624,   16,    24),    -- Bits overlap.
  (65504,    0, 65504),    -- Lots of bits (all but the low 6 bits).
  (    0, NULL,  NULL)     -- No bits.
  ;
```

Run the query:

```sqlexample
SELECT bit1,
       bit2,
       BITAND(bit1, bit2),
       BITOR(bit1, bit2),
       BITXOR(bit1, BIT2)
  FROM bits
  ORDER BY bit1;
```

```output
+------+-------+--------------------+-------------------+--------------------+
| BIT1 |  BIT2 | BITAND(BIT1, BIT2) | BITOR(BIT1, BIT2) | BITXOR(BIT1, BIT2) |
|------+-------+--------------------+-------------------+--------------------|
|    0 | 65504 |                  0 |             65504 |              65504 |
|    1 |     1 |                  1 |                 1 |                  0 |
|    2 |     4 |                  0 |                 6 |                  6 |
|    4 |     2 |                  0 |                 6 |                  6 |
|   16 |    24 |                 16 |                24 |                  8 |
| NULL |  NULL |               NULL |              NULL |               NULL |
+------+-------+--------------------+-------------------+--------------------+
```

### Using BITAND, BITOR, and BITXOR with BINARY argument values

Create a simple table and insert the data:

```sqlexample
CREATE OR REPLACE TABLE bits (ID INTEGER, bit1 BINARY(2), bit2 BINARY(2), bit3 BINARY(4));

INSERT INTO bits VALUES
  (1, x'1010', x'0101', x'11001010'),
  (2, x'1100', x'0011', x'01011010'),
  (3, x'BCBC', x'EEFF', x'ABCDABCD'),
  (4, NULL, NULL, NULL);
```

> **Note:**
>
> The BINARY values are inserted using the `x'value'` notation, where `value` contains
> hexadecimal digits. For more information, see [Binary input and output](../binary-input-output.md).

Run a query on BINARY columns of the same length:

```sqlexample
SELECT bit1,
       bit2,
       BITAND(bit1, bit2),
       BITOR(bit1, bit2),
       BITXOR(bit1, bit2)
  FROM bits;
```

```output
+------+------+--------------------+-------------------+--------------------+
| BIT1 | BIT2 | BITAND(BIT1, BIT2) | BITOR(BIT1, BIT2) | BITXOR(BIT1, BIT2) |
|------+------+--------------------+-------------------+--------------------|
| 1010 | 0101 | 0000               | 1111              | 1111               |
| 1100 | 0011 | 0000               | 1111              | 1111               |
| BCBC | EEFF | ACBC               | FEFF              | 5243               |
| NULL | NULL | NULL               | NULL              | NULL               |
+------+------+--------------------+-------------------+--------------------+
```

If you try to run a query on BINARY columns of different lengths without specifying the `'padside'`
argument, an error is returned:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3),
       BITOR(bit1, bit3),
       BITXOR(bit1, bit3)
  FROM bits;
```

```output
100544 (22026): The lengths of two variable-sized fields do not match: first length 2, second length 4
```

Run a query on BINARY columns of different lengths, and pad the smaller argument value on the left:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3, 'LEFT'),
       BITOR(bit1, bit3, 'LEFT'),
       BITXOR(bit1, bit3, 'LEFT')
  FROM bits;
```

```output
+------+----------+----------------------------+---------------------------+----------------------------+
| BIT1 | BIT3     | BITAND(BIT1, BIT3, 'LEFT') | BITOR(BIT1, BIT3, 'LEFT') | BITXOR(BIT1, BIT3, 'LEFT') |
|------+----------+----------------------------+---------------------------+----------------------------|
| 1010 | 11001010 | 00001010                   | 11001010                  | 11000000                   |
| 1100 | 01011010 | 00001000                   | 01011110                  | 01010110                   |
| BCBC | ABCDABCD | 0000A88C                   | ABCDBFFD                  | ABCD1771                   |
| NULL | NULL     | NULL                       | NULL                      | NULL                       |
+------+----------+----------------------------+---------------------------+----------------------------+
```

Run a query on BINARY columns of different lengths, and pad the smaller argument value on the right:

```sqlexample
SELECT bit1,
       bit3,
       BITAND(bit1, bit3, 'RIGHT'),
       BITOR(bit1, bit3, 'RIGHT'),
       BITXOR(bit1, bit3, 'RIGHT')
  FROM bits;
```

```output
+------+----------+-----------------------------+----------------------------+-----------------------------+
| BIT1 | BIT3     | BITAND(BIT1, BIT3, 'RIGHT') | BITOR(BIT1, BIT3, 'RIGHT') | BITXOR(BIT1, BIT3, 'RIGHT') |
|------+----------+-----------------------------+----------------------------+-----------------------------|
| 1010 | 11001010 | 10000000                    | 11101010                   | 01101010                    |
| 1100 | 01011010 | 01000000                    | 11011010                   | 10011010                    |
| BCBC | ABCDABCD | A88C0000                    | BFFDABCD                   | 1771ABCD                    |
| NULL | NULL     | NULL                        | NULL                       | NULL                        |
+------+----------+-----------------------------+----------------------------+-----------------------------+
```

---
title: BITXOR_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/bitxor_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Bitwise) , [Window functions](../functions-window.md) (General) , [Bitwise expression functions](../expressions-byte-bit.md)

# BITXOR_AGG

Returns the bitwise XOR value of all non-NULL numeric records in a group.

In each bit position, if an even number of rows have that bit set to 1, then the function returns 0 for that bit, and
if an odd number of rows have that bit set to 1, then the function returns 1 for that bit.

If all records inside the group are NULL, or if the group is empty, the function returns NULL.

Aliases:
:   BITXORAGG , BIT_XOR_AGG, BIT_XORAGG

See also:
:   [BITAND_AGG](bitand_agg.md) , [BITOR_AGG](bitor_agg.md)

    [BITXOR](bitxor.md)

## Syntax

**Aggregate function**

```sqlsyntax
BITXOR_AGG( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
BITXOR_AGG( [ DISTINCT ] <expr1> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   This expression must evaluate to a [numeric](../data-types-numeric.md) value or a value
    of a data type that can be cast to a numeric value.

`expr2`
:   This expression is used to group the rows in partitions.

## Returns

The data type of the returned value is `NUMBER(38, 0)`.

## Usage notes

* Numeric values are aggregated to the nearest INTEGER data type. Decimal and floating-point values are rounded to the
  nearest integer before aggregation.
* Aggregating a character/text column (data type VARCHAR, CHAR, STRING, etc.) implicitly casts the input values
  to FLOAT, then rounds the values to the nearest integer. If the cast is not possible, the value is treated as NULL.
* The DISTINCT keyword can be specified for these functions, but it does not have any effect.
* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

Create the table and load the data:

```sqlexample
CREATE OR REPLACE TABLE bitwise_example
  (k INT, d DECIMAL(10,5), s1 VARCHAR(10), s2 VARCHAR(10));

INSERT INTO bitwise_example VALUES
  (15, 1.1, '12', 'one'),
  (26, 2.9, '10', 'two'),
  (12, 7.1, '7.9', 'two'),
  (14, NULL, NULL, 'null'),
  (8, NULL, NULL, 'null'),
  (NULL, 9.1, '14', 'nine');
```

Display the data:

```sqlexample
SELECT k AS k_col, d AS d_col, s1, s2
  FROM bitwise_example
  ORDER BY k_col;
```

```output
+-------+---------+------+------+
| K_COL |   D_COL | S1   | S2   |
|-------+---------+------+------|
|     8 |    NULL | NULL | null |
|    12 | 7.10000 | 7.9  | two  |
|    14 |    NULL | NULL | null |
|    15 | 1.10000 | 12   | one  |
|    26 | 2.90000 | 10   | two  |
|  NULL | 9.10000 | 14   | nine |
+-------+---------+------+------+
```

Query the data:

```sqlexample
SELECT BITXOR_AGG(k),
    BITXOR_AGG(d),
    BITXOR_AGG(s1)
  FROM bitwise_example;
```

```output
+---------------+---------------+----------------+
| BITXOR_AGG(K) | BITXOR_AGG(D) | BITXOR_AGG(S1) |
|---------------+---------------+----------------|
|            31 |            12 |              0 |
+---------------+---------------+----------------+
```

Query the data and use a GROUP BY clause:

```sqlexample
SELECT s2,
    BITXOR_AGG(k),
    BITXOR_AGG(d)
  FROM bitwise_example
  GROUP BY s2
  ORDER BY 3;
```

```output
+------+---------------+---------------+
| S2   | BITXOR_AGG(K) | BITXOR_AGG(D) |
|------+---------------+---------------|
| one  |            15 |             1 |
| two  |            22 |             4 |
| nine |          NULL |             9 |
| null |             6 |          NULL |
+------+---------------+---------------+
```

If you pass this function strings that can’t be converted to NUMBER values, an error is returned:

```sqlexample
SELECT BITXOR_AGG(s2)
  FROM bitwise_example;
```

```output
100038 (22018): Numeric value 'one' is not recognized
```

---
title: BLANK_COUNT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_blank_count.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# BLANK_COUNT (system data metric function)

Returns the count of column values that are blank for the specified column in a table.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.BLANK_COUNT(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have the VARCHAR data type.

## Returns

The function returns a NUMBER value.

## Example

Measure the percentage of blank fields for the SSN column (US social security number):

> ```sqlexample
> SELECT SNOWFLAKE.CORE.BLANK_COUNT(
>   SELECT
>     ssn
>   FROM hr.tables.empl_info
> );
> ```
>
> ```output
> +-----------------------------------------------------------------+
> | SNOWFLAKE.CORE.BLANK_COUNT(SELECT ssn FROM hr.tables.empl_info) |
> +-----------------------------------------------------------------+
> | 1                                                               |
> +-----------------------------------------------------------------+
> ```

---
title: BLANK_PERCENT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_blank_percent.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# BLANK_PERCENT (system data metric function)

Returns the percentage of column values that are blank for the specified column in a table.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.BLANK_PERCENT(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have the VARCHAR data type.

## Returns

The function returns a NUMBER value.

## Example

Measure the percentage of blank fields for the SSN column (US social security number):

> ```sqlexample
> SELECT SNOWFLAKE.CORE.BLANK_PERCENT(
>   SELECT
>     ssn
>   FROM hr.tables.empl_info
> );
> ```
>
> ```output
> +-------------------------------------------------------------------+
> | SNOWFLAKE.CORE.BLANK_PERCENT(SELECT ssn FROM hr.tables.empl_info) |
> +-------------------------------------------------------------------+
> | 1                                                                 |
> +-------------------------------------------------------------------+
> ```

---
title: BOOLAND
source: https://docs.snowflake.com/en/sql-reference/functions/booland.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# BOOLAND

Computes the Boolean AND of two numeric expressions. In accordance with Boolean semantics:

* Non-zero values, including negative numbers, are regarded as true.
* Zero values are regarded as false.

As a result, the function returns:

* `True` if both expressions are non-zero.
* `False` if both expressions are zero or one expression is zero and the other expression is non-zero or NULL.
* `NULL` if both expressions are NULL or one expression is NULL and the other expression is non-zero.

See also:
:   [BOOLNOT](boolnot.md) , [BOOLOR](boolor.md) , [BOOLXOR](boolxor.md)

## Syntax

```sqlsyntax
BOOLAND( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   A numeric expression.

`expr2`
:   A numeric expression.

## Returns

This function returns a value of type BOOLEAN or NULL.

## Usage notes

This function rounds [floating-point numbers](../data-types-numeric.md).
Therefore, it might return unexpected results when it rounds non-zero floating-point numbers
to zero.

For examples of this behavior and workarounds, see Compute Boolean AND results for floating-point numbers.

## Examples

The following examples use the BOOLAND function.

### Compute Boolean AND results for integers and NULL values

The following query computes Boolean AND results for integers and NULL values:

```sqlexample
SELECT BOOLAND(1, -2),
       BOOLAND(0, 0),
       BOOLAND(0, NULL),
       BOOLAND(NULL, 3),
       BOOLAND(NULL, NULL);
```

```output
+----------------+---------------+------------------+------------------+---------------------+
| BOOLAND(1, -2) | BOOLAND(0, 0) | BOOLAND(0, NULL) | BOOLAND(NULL, 3) | BOOLAND(NULL, NULL) |
|----------------+---------------+------------------+------------------+---------------------|
| True           | False         | False            | NULL             | NULL                |
+----------------+---------------+------------------+------------------+---------------------+
```

### Compute Boolean AND results for floating-point numbers

The following examples show how the function might return unexpected results for floating-point
numbers that round to zero.

For the following queries, a result of `True` might be expected for the following function calls,
but they return `False` because the function rounds non-zero floating-point values to zero:

```sqlexample
SELECT BOOLAND(2, 0.3);
```

```output
+-----------------+
| BOOLAND(2, 0.3) |
|-----------------|
| False           |
+-----------------+
```

```sqlexample
SELECT BOOLAND(-0.4, 5);
```

```output
+------------------+
| BOOLAND(-0.4, 5) |
|------------------|
| False            |
+------------------+
```

If required, you can work around this rounding behavior for floating-point values by using
the [AND logical operator](../operators-logical.md) instead of the function.
For example, the following query returns `True`:

```sqlexample
SELECT 2 AND 0.3;
```

```output
+-----------+
| 2 AND 0.3 |
|-----------|
| True      |
+-----------+
```

---
title: BOOLAND_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/booland_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Boolean) , [Window functions](../functions-window.md) , [Conditional expression functions](../expressions-conditional.md)

# BOOLAND_AGG

Returns TRUE if all non-NULL Boolean records in a group evaluate to TRUE.

If all records in the group are NULL, or if the group is empty, the function returns NULL.

See also:
:   [BOOLAND](booland.md) , [BOOLOR_AGG](boolor_agg.md) , [BOOLXOR_AGG](boolxor_agg.md)

## Syntax

**Aggregate function**

```sqlsyntax
BOOLAND_AGG( <expr> )
```

**Window function**

```sqlsyntax
BOOLAND_AGG( <expr> )  OVER ( [ PARTITION BY <partition_expr> ] )
```

## Arguments

`expr`
:   The input expression must be an expression that can be evaluated to a boolean or converted to a boolean.

`partition_expr`
:   This column or expression specifies how to separate the input into partitions (sub-windows).

## Returns

This function returns a value of type BOOLEAN.

## Usage notes

* [Numeric](../data-types-numeric.md) values are converted to `TRUE` if they are non-zero.
* [String and binary](../data-types-text.md) values aren’t supported because they can’t be converted to Boolean values.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

**Aggregate function**

The following example shows that booland_agg returns true when all of the input values are true.

Create and load the table:

```sqlexample
CREATE OR REPLACE TABLE test_boolean_agg (
  id INTEGER,
  c1 BOOLEAN,
  c2 BOOLEAN,
  c3 BOOLEAN,
  c4 BOOLEAN
);

INSERT INTO test_boolean_agg (id, c1, c2, c3, c4) VALUES
  (1, TRUE, TRUE,  TRUE,  FALSE),
  (2, TRUE, FALSE, FALSE, FALSE),
  (3, TRUE, TRUE,  FALSE, FALSE),
  (4, TRUE, FALSE, FALSE, FALSE);
```

Display the data:

```sqlexample
SELECT *
  FROM test_boolean_agg;
```

```output
+----+------+-------+-------+-------+
| ID | C1   | C2    | C3    | C4    |
|----+------+-------+-------+-------|
|  1 | True | True  | True  | False |
|  2 | True | False | False | False |
|  3 | True | True  | False | False |
|  4 | True | False | False | False |
+----+------+-------+-------+-------+
```

Query the data:

```sqlexample
SELECT BOOLAND_AGG(c1), BOOLAND_AGG(c2), BOOLAND_AGG(c3), BOOLAND_AGG(c4)
  FROM test_boolean_agg;
```

```output
+-----------------+-----------------+-----------------+-----------------+
| BOOLAND_AGG(C1) | BOOLAND_AGG(C2) | BOOLAND_AGG(C3) | BOOLAND_AGG(C4) |
|-----------------+-----------------+-----------------+-----------------|
| True            | False           | False           | False           |
+-----------------+-----------------+-----------------+-----------------+
```

**Window function**

This example is similar to the previous example, but it shows usage as a window function, with the input rows
split into two partitions (one for IDs greater than 0 and one for IDs less than or equal to 0). Additional data was
added to the table.

Add rows to the table:

```sqlexample
INSERT INTO test_boolean_agg (id, c1, c2, c3, c4) VALUES
  (-4, FALSE, FALSE, FALSE, TRUE),
  (-3, FALSE, TRUE,  TRUE,  TRUE),
  (-2, FALSE, FALSE, TRUE,  TRUE),
  (-1, FALSE, TRUE,  TRUE,  TRUE);
```

Display the data:

```sqlexample
SELECT *
  FROM test_boolean_agg
  ORDER BY id;
```

```output
+----+-------+-------+-------+-------+
| ID | C1    | C2    | C3    | C4    |
|----+-------+-------+-------+-------|
| -4 | False | False | False | True  |
| -3 | False | True  | True  | True  |
| -2 | False | False | True  | True  |
| -1 | False | True  | True  | True  |
|  1 | True  | True  | True  | False |
|  2 | True  | False | False | False |
|  3 | True  | True  | False | False |
|  4 | True  | False | False | False |
+----+-------+-------+-------+-------+
```

Query the data:

```sqlexample
SELECT
    id,
    BOOLAND_AGG(c1) OVER (PARTITION BY (id > 0)),
    BOOLAND_AGG(c2) OVER (PARTITION BY (id > 0)),
    BOOLAND_AGG(c3) OVER (PARTITION BY (id > 0)),
    BOOLAND_AGG(c4) OVER (PARTITION BY (id > 0))
  FROM test_boolean_agg
  ORDER BY id;
```

```output
+----+----------------------------------------------+----------------------------------------------+----------------------------------------------+----------------------------------------------+
| ID | BOOLAND_AGG(C1) OVER (PARTITION BY (ID > 0)) | BOOLAND_AGG(C2) OVER (PARTITION BY (ID > 0)) | BOOLAND_AGG(C3) OVER (PARTITION BY (ID > 0)) | BOOLAND_AGG(C4) OVER (PARTITION BY (ID > 0)) |
|----+----------------------------------------------+----------------------------------------------+----------------------------------------------+----------------------------------------------|
| -4 | False                                        | False                                        | False                                        | True                                         |
| -3 | False                                        | False                                        | False                                        | True                                         |
| -2 | False                                        | False                                        | False                                        | True                                         |
| -1 | False                                        | False                                        | False                                        | True                                         |
|  1 | True                                         | False                                        | False                                        | False                                        |
|  2 | True                                         | False                                        | False                                        | False                                        |
|  3 | True                                         | False                                        | False                                        | False                                        |
|  4 | True                                         | False                                        | False                                        | False                                        |
+----+----------------------------------------------+----------------------------------------------+----------------------------------------------+----------------------------------------------+
```

**Error example**

If this function is passed strings that can’t be converted to Boolean, the function returns an error:

```sqlsyntax
select booland_agg('invalid type');

100037 (22018): Boolean value 'invalid_type' is not recognized
```

---
title: BOOLNOT
source: https://docs.snowflake.com/en/sql-reference/functions/boolnot.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# BOOLNOT

Computes the Boolean NOT of a single numeric expression. In accordance with Boolean semantics:

* Non-zero values, including negative numbers, are regarded as true.
* Zero values are regarded as false.

As a result, the function returns:

* `True` if the expression is zero.
* `False` if the expression is non-zero.
* `NULL` if the expression is NULL.

See also:
:   [BOOLAND](booland.md) , [BOOLOR](boolor.md) , [BOOLXOR](boolxor.md)

## Syntax

```sqlsyntax
BOOLNOT( <expr> )
```

## Arguments

`expr`
:   A numeric expression.

## Returns

This function returns a value of type BOOLEAN or NULL.

## Usage Notes

This function rounds [floating-point numbers](../data-types-numeric.md).
Therefore, it might return unexpected results for floating-point numbers that round to zero.

For examples of this behavior and workarounds, see Compute Boolean NOT results for floating-point numbers.

## Examples

The following examples use the BOOLNOT function.

### Compute Boolean NOT results for integers and NULL values

The following query computes Boolean NOT results for integers and NULL values:

```sqlexample
SELECT BOOLNOT(0),
       BOOLNOT(10),
       BOOLNOT(NULL);
```

```output
+------------+-------------+---------------+
| BOOLNOT(0) | BOOLNOT(10) | BOOLNOT(NULL) |
|------------+-------------+---------------|
| True       | False       | NULL          |
+------------+-------------+---------------+
```

### Compute Boolean NOT results for floating-point numbers

The following examples demonstrate how the function might return unexpected results for floating-point
numbers that round to zero.

For the following queries, a result of `False` might be expected for the following function calls, but they return
`True` because the function rounds non-zero floating-point values to zero:

```sqlexample
SELECT BOOLNOT(0.3);
```

```output
+--------------+
| BOOLNOT(0.3) |
|--------------|
| True         |
+--------------+
```

```sqlexample
SELECT BOOLNOT(-0.4);
```

```output
+---------------+
| BOOLNOT(-0.4) |
|---------------|
| True          |
+---------------+
```

If required, you can work around this rounding behavior for positive floating-point values by using the
[CEIL](ceil.md) function. For example, the following query returns `False`:

```sqlexample
SELECT BOOLNOT(CEIL(0.3));
```

```output
+--------------------+
| BOOLNOT(CEIL(0.3)) |
|--------------------|
| False              |
+--------------------+
```

For negative floating-point values, you can work around this rounding behavior by using the
:[NOT logical operator](../operators-logical.md) instead of the function. For example,
the following query returns `False`:

```sqlexample
SELECT NOT -0.4;
```

```output
+----------+
| NOT -0.4 |
|----------|
| False    |
+----------+
```

---
title: BOOLOR
source: https://docs.snowflake.com/en/sql-reference/functions/boolor.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# BOOLOR

Computes the Boolean OR of two numeric expressions. In accordance with Boolean semantics:

* Non-zero values, including negative numbers, are regarded as true.
* Zero values are regarded as false.

As a result, the function returns:

* `True` if both expressions are non-zero or one expression is non-zero and the other expression is zero or NULL.
* `False` if both expressions are zero.
* `NULL` if both expressions are NULL or one expression is NULL and the other expression is zero.

See also:
:   [BOOLAND](booland.md) , [BOOLNOT](boolnot.md) , [BOOLXOR](boolxor.md)

## Syntax

```sqlsyntax
BOOLOR( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   A numeric expression.

`expr2`
:   A numeric expression.

## Returns

This function returns a value of type BOOLEAN or NULL.

## Usage notes

This function rounds [floating-point numbers](../data-types-numeric.md).
Therefore, it might return unexpected results when it rounds non-zero floating-point numbers
to zero.

For examples of this behavior and workarounds, see Compute Boolean OR results for floating-point numbers.

## Examples

The following examples use the BOOLOR function.

### Compute Boolean OR results for integers and NULL values

The following query computes Boolean OR results for integers and NULL values:

```sqlexample
SELECT BOOLOR(1, 2),
       BOOLOR(0, 2),
       BOOLOR(3, NULL),
       BOOLOR(0, 0),
       BOOLOR(NULL, 0),
       BOOLOR(NULL, NULL);
```

```output
+--------------+--------------+-----------------+--------------+-----------------+--------------------+
| BOOLOR(1, 2) | BOOLOR(0, 2) | BOOLOR(3, NULL) | BOOLOR(0, 0) | BOOLOR(NULL, 0) | BOOLOR(NULL, NULL) |
|--------------+--------------+-----------------+--------------+-----------------+--------------------|
| True         | True         | True            | False        | NULL            | NULL               |
+--------------+--------------+-----------------+--------------+-----------------+--------------------+
```

### Compute Boolean OR results for floating-point numbers

The following examples demonstrate how the function might return unexpected results for floating-point
numbers that round to zero.

For the following queries, a result of `True` might be expected for the following function calls, but they return
`False` because the function rounds non-zero floating-point values to zero:

```sqlexample
SELECT BOOLOR(0.4, 0.3);
```

```output
+------------------+
| BOOLOR(0.4, 0.3) |
|------------------|
| False            |
+------------------+
```

```sqlexample
SELECT BOOLOR(-0.4, 0.3);
```

```output
+-------------------+
| BOOLOR(-0.4, 0.3) |
|-------------------|
| False             |
+-------------------+
```

For the following queries, a result of `True` might be expected for the following function calls, but they return
`NULL`:

```sqlexample
SELECT BOOLOR(0.4, NULL);
```

```output
+-------------------+
| BOOLOR(0.4, NULL) |
|-------------------|
| NULL              |
+-------------------+
```

```sqlexample
SELECT BOOLOR(-0.4, NULL);
```

```output
+--------------------+
| BOOLOR(-0.4, NULL) |
|--------------------|
| NULL               |
+--------------------+
```

If required, you can work around this rounding behavior for floating-point values by using the
[OR logical operator](../operators-logical.md) instead of the function. For example,
the following query returns `True`:

```sqlexample
SELECT 0.4 OR 0.3;
```

```output
+------------+
| 0.4 OR 0.3 |
|------------|
| True       |
+------------+
```

---
title: BOOLOR_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/boolor_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Boolean) , [Window functions](../functions-window.md) , [Conditional expression functions](../expressions-conditional.md)

# BOOLOR_AGG

Returns TRUE if at least one Boolean record in a group evaluates to TRUE.

If all records in the group are NULL, or if the group is empty, the function returns NULL.

See also:
:   [BOOLOR](boolor.md) , [BOOLAND_AGG](booland_agg.md) , [BOOLXOR_AGG](boolxor_agg.md)

## Syntax

**Aggregate function**

```sqlsyntax
BOOLOR_AGG( <expr> )
```

**Window function**

```sqlsyntax
BOOLOR_AGG( <expr> ) OVER ( [ PARTITION BY <partition_expr> ] )
```

## Arguments

`expr`
:   The input expression must be an expression that can be evaluated to a boolean or converted to a boolean.

`partition_expr`
:   This column or expression specifies how to separate the input into partitions (sub-windows).

## Returns

The data type of the returned value is BOOLEAN.

## Usage notes

* [Numeric](../data-types-numeric.md) values are converted to `TRUE` if they are non-zero.
* [String and binary](../data-types-text.md) values aren’t supported because they can’t be converted to Boolean values.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

**Aggregate function**

The following example shows that boolor_agg returns true if at least one of the input values is true.

Create and load the table:

```sqlexample
CREATE OR REPLACE TABLE test_boolean_agg (
  id INTEGER,
  c1 BOOLEAN,
  c2 BOOLEAN,
  c3 BOOLEAN,
  c4 BOOLEAN
);

INSERT INTO test_boolean_agg (id, c1, c2, c3, c4) VALUES
  (1, TRUE, TRUE,  TRUE,  FALSE),
  (2, TRUE, FALSE, FALSE, FALSE),
  (3, TRUE, TRUE,  FALSE, FALSE),
  (4, TRUE, FALSE, FALSE, FALSE);
```

Display the data:

```sqlexample
SELECT *
  FROM test_boolean_agg;
```

```output
+----+------+-------+-------+-------+
| ID | C1   | C2    | C3    | C4    |
|----+------+-------+-------+-------|
|  1 | True | True  | True  | False |
|  2 | True | False | False | False |
|  3 | True | True  | False | False |
|  4 | True | False | False | False |
+----+------+-------+-------+-------+
```

Query the data:

```sqlexample
SELECT BOOLOR_AGG(c1), BOOLOR_AGG(c2), BOOLOR_AGG(c3), BOOLOR_AGG(c4)
  FROM test_boolean_agg;
```

```output
+----------------+----------------+----------------+----------------+
| BOOLOR_AGG(C1) | BOOLOR_AGG(C2) | BOOLOR_AGG(C3) | BOOLOR_AGG(C4) |
|----------------+----------------+----------------+----------------|
| True           | True           | True           | False          |
+----------------+----------------+----------------+----------------+
```

**Window function**

This example is similar to the previous example, but it shows usage as a window function, with the input rows
split into two partitions (one for IDs greater than 0 and one for IDs less than or equal to 0). Additional data was
added to the table.

Add rows to the table:

```sqlexample
INSERT INTO test_boolean_agg (id, c1, c2, c3, c4) VALUES
  (-4, FALSE, FALSE, FALSE, TRUE),
  (-3, FALSE, TRUE,  TRUE,  TRUE),
  (-2, FALSE, FALSE, TRUE,  TRUE),
  (-1, FALSE, TRUE,  TRUE,  TRUE);
```

Display the data:

```sqlexample
SELECT *
  FROM test_boolean_agg
  ORDER BY id;
```

```output
+----+-------+-------+-------+-------+
| ID | C1    | C2    | C3    | C4    |
|----+-------+-------+-------+-------|
| -4 | False | False | False | True  |
| -3 | False | True  | True  | True  |
| -2 | False | False | True  | True  |
| -1 | False | True  | True  | True  |
|  1 | True  | True  | True  | False |
|  2 | True  | False | False | False |
|  3 | True  | True  | False | False |
|  4 | True  | False | False | False |
+----+-------+-------+-------+-------+
```

Query the data:

```sqlexample
SELECT
    id,
    BOOLOR_AGG(c1) OVER (PARTITION BY (id > 0)),
    BOOLOR_AGG(c2) OVER (PARTITION BY (id > 0)),
    BOOLOR_AGG(c3) OVER (PARTITION BY (id > 0)),
    BOOLOR_AGG(c4) OVER (PARTITION BY (id > 0))
  FROM test_boolean_agg
  ORDER BY id;
```

```output
+----+---------------------------------------------+---------------------------------------------+---------------------------------------------+---------------------------------------------+
| ID | BOOLOR_AGG(C1) OVER (PARTITION BY (ID > 0)) | BOOLOR_AGG(C2) OVER (PARTITION BY (ID > 0)) | BOOLOR_AGG(C3) OVER (PARTITION BY (ID > 0)) | BOOLOR_AGG(C4) OVER (PARTITION BY (ID > 0)) |
|----+---------------------------------------------+---------------------------------------------+---------------------------------------------+---------------------------------------------|
| -4 | False                                       | True                                        | True                                        | True                                        |
| -3 | False                                       | True                                        | True                                        | True                                        |
| -2 | False                                       | True                                        | True                                        | True                                        |
| -1 | False                                       | True                                        | True                                        | True                                        |
|  1 | True                                        | True                                        | True                                        | False                                       |
|  2 | True                                        | True                                        | True                                        | False                                       |
|  3 | True                                        | True                                        | True                                        | False                                       |
|  4 | True                                        | True                                        | True                                        | False                                       |
+----+---------------------------------------------+---------------------------------------------+---------------------------------------------+---------------------------------------------+
```

**Error example**

If this function is passed strings that cannot be converted to Boolean, the function will give an error:

```sqlsyntax
select boolor_agg('invalid type');

100037 (22018): Boolean value 'invalid_type' is not recognized
```

---
title: BOOLXOR
source: https://docs.snowflake.com/en/sql-reference/functions/boolxor.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# BOOLXOR

Computes the Boolean XOR of two numeric expressions; that is, one of the expressions, but not both expressions,
is true. In accordance with Boolean semantics:

* Non-zero values, including negative numbers, are regarded as true.
* Zero values are regarded as false.

As a result, the function returns:

* `True` if one expression is non-zero and the other expression is zero.
* `False` if both expressions are non-zero or both expressions are zero.
* `NULL` if one or both expressions are NULL.

See also:
:   [BOOLAND](booland.md) , [BOOLNOT](boolnot.md) , [BOOLOR](boolor.md)

## Syntax

```sqlsyntax
BOOLXOR( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   A numeric expression.

`expr2`
:   A numeric expression.

## Returns

This function returns a value of type BOOLEAN or NULL.

## Usage notes

This function rounds [floating-point numbers](../data-types-numeric.md).
Therefore, it might return unexpected results when it rounds non-zero floating-point numbers
to zero.

For examples of this behavior and workarounds, see Compute Boolean XOR results for floating-point numbers.

## Examples

The following examples use the BOOLXOR function.

### Compute Boolean XOR results for integers and NULL values

The following query computes Boolean XOR results for integers and NULL values:

```sqlexample
SELECT BOOLXOR(2, 0),
       BOOLXOR(1, -1),
       BOOLXOR(0, 0),
       BOOLXOR(NULL, 3),
       BOOLXOR(NULL, 0),
       BOOLXOR(NULL, NULL);
```

```output
+---------------+----------------+---------------+------------------+------------------+---------------------+
| BOOLXOR(2, 0) | BOOLXOR(1, -1) | BOOLXOR(0, 0) | BOOLXOR(NULL, 3) | BOOLXOR(NULL, 0) | BOOLXOR(NULL, NULL) |
|---------------+----------------+---------------+------------------+------------------+---------------------|
| True          | False          | False         | NULL             | NULL             | NULL                |
+---------------+----------------+---------------+------------------+------------------+---------------------+
```

### Compute Boolean XOR results for floating-point numbers

The following examples demonstrate how the function might return unexpected results for floating-point
numbers that round to zero.

For the following queries, a result of `False` might be expected for the following function calls, but they return
`True` because the function rounds non-zero floating-point values to zero:

```sqlexample
SELECT BOOLXOR(2, 0.3);
```

```output
+-----------------+
| BOOLXOR(2, 0.3) |
|-----------------|
| True            |
+-----------------+
```

```sqlexample
SELECT BOOLXOR(-0.4, 5);
```

```output
+------------------+
| BOOLXOR(-0.4, 5) |
|------------------|
| True             |
+------------------+
```

Similarly, a result of `True` might be expected for the following function calls, but they return
`False`:

```sqlexample
SELECT BOOLXOR(0, 0.3);
```

```output
+-----------------+
| BOOLXOR(0, 0.3) |
|-----------------|
| False           |
+-----------------+
```

```sqlexample
SELECT BOOLXOR(-0.4, 0);
```

```output
+------------------+
| BOOLXOR(-0.4, 0) |
|------------------|
| False            |
+------------------+
```

If required, you can work around this rounding behavior for positive floating-point values by using the
[CEIL](ceil.md) function. For example, the following query returns `False`:

```sqlexample
SELECT BOOLXOR(2, CEIL(0.3));
```

```output
+-----------------------+
| BOOLXOR(2, CEIL(0.3)) |
|-----------------------|
| False                 |
+-----------------------+
```

For negative floating-point values, you can work around this rounding behavior by using the
[FLOOR](floor.md) function. For example, the following query returns `False`:

```sqlexample
SELECT BOOLXOR(FLOOR(-0.4), 5);
```

```output
+-------------------------+
| BOOLXOR(FLOOR(-0.4), 5) |
|-------------------------|
| False                   |
+-------------------------+
```

---
title: BOOLXOR_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/boolxor_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Boolean) , [Window functions](../functions-window.md) , [Conditional expression functions](../expressions-conditional.md)

# BOOLXOR_AGG

Returns TRUE if exactly one Boolean record in the group evaluates to TRUE.

If all records in the group are NULL, or if the group is empty, the function returns NULL.

See also:
:   [BOOLXOR](boolxor.md) , [BOOLAND_AGG](booland_agg.md) , [BOOLOR_AGG](boolor_agg.md)

## Syntax

**Aggregate function**

```sqlsyntax
BOOLXOR_AGG( <expr> )
```

**Window function**

```sqlsyntax
BOOLXOR_AGG( <expr> ) OVER ( [ PARTITION BY <partition_expr> ] )
```

## Arguments

`expr`
:   The input expression must be an expression that can be evaluated to a boolean or converted to a boolean.

`partition_expr`
:   This column or expression specifies how to separate the input into partitions (sub-windows).

## Returns

This function returns a value of type BOOLEAN.

## Usage notes

* [Numeric](../data-types-numeric.md) values are converted to `TRUE` if they are non-zero.
* [String and binary](../data-types-text.md) values aren’t supported because they can’t be converted to Boolean values.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

The following example shows that boolxor_agg returns true when exactly one of the input values is true.

Create and load the table:

```sqlexample
CREATE OR REPLACE TABLE test_boolean_agg (
  id INTEGER,
  c1 BOOLEAN,
  c2 BOOLEAN,
  c3 BOOLEAN,
  c4 BOOLEAN
);

INSERT INTO test_boolean_agg (id, c1, c2, c3, c4) VALUES
  (1, TRUE, TRUE,  TRUE,  FALSE),
  (2, TRUE, FALSE, FALSE, FALSE),
  (3, TRUE, TRUE,  FALSE, FALSE),
  (4, TRUE, FALSE, FALSE, FALSE);
```

Display the data:

```sqlexample
SELECT *
  FROM test_boolean_agg;
```

```output
+----+------+-------+-------+-------+
| ID | C1   | C2    | C3    | C4    |
|----+------+-------+-------+-------|
|  1 | True | True  | True  | False |
|  2 | True | False | False | False |
|  3 | True | True  | False | False |
|  4 | True | False | False | False |
+----+------+-------+-------+-------+
```

Query the data:

```sqlexample
SELECT BOOLXOR_AGG(c1), BOOLXOR_AGG(c2), BOOLXOR_AGG(c3), BOOLXOR_AGG(c4)
  FROM test_boolean_agg;
```

```output
+-----------------+-----------------+-----------------+-----------------+
| BOOLXOR_AGG(C1) | BOOLXOR_AGG(C2) | BOOLXOR_AGG(C3) | BOOLXOR_AGG(C4) |
|-----------------+-----------------+-----------------+-----------------|
| False           | False           | True            | False           |
+-----------------+-----------------+-----------------+-----------------+
```

**Window function**

This example is similar to the previous example, but it shows usage as a window function, with the input rows
split into two partitions (one for IDs greater than 0 and one for IDs less than or equal to 0). Additional data was
added to the table.

Add rows to the table:

```sqlexample
INSERT INTO test_boolean_agg (id, c1, c2, c3, c4) VALUES
  (-4, FALSE, FALSE, FALSE, TRUE),
  (-3, FALSE, TRUE,  TRUE,  TRUE),
  (-2, FALSE, FALSE, TRUE,  TRUE),
  (-1, FALSE, TRUE,  TRUE,  TRUE);
```

Display the data:

```sqlexample
SELECT *
  FROM test_boolean_agg
  ORDER BY id;
```

```output
+----+-------+-------+-------+-------+
| ID | C1    | C2    | C3    | C4    |
|----+-------+-------+-------+-------|
| -4 | False | False | False | True  |
| -3 | False | True  | True  | True  |
| -2 | False | False | True  | True  |
| -1 | False | True  | True  | True  |
|  1 | True  | True  | True  | False |
|  2 | True  | False | False | False |
|  3 | True  | True  | False | False |
|  4 | True  | False | False | False |
+----+-------+-------+-------+-------+
```

Query the data:

```sqlexample
SELECT
    id,
    BOOLXOR_AGG(c1) OVER (PARTITION BY (id > 0)),
    BOOLXOR_AGG(c2) OVER (PARTITION BY (id > 0)),
    BOOLXOR_AGG(c3) OVER (PARTITION BY (id > 0)),
    BOOLXOR_AGG(c4) OVER (PARTITION BY (id > 0))
  FROM test_boolean_agg
  ORDER BY id;
```

```output
+----+----------------------------------------------+----------------------------------------------+----------------------------------------------+----------------------------------------------+
| ID | BOOLXOR_AGG(C1) OVER (PARTITION BY (ID > 0)) | BOOLXOR_AGG(C2) OVER (PARTITION BY (ID > 0)) | BOOLXOR_AGG(C3) OVER (PARTITION BY (ID > 0)) | BOOLXOR_AGG(C4) OVER (PARTITION BY (ID > 0)) |
|----+----------------------------------------------+----------------------------------------------+----------------------------------------------+----------------------------------------------|
| -4 | False                                        | False                                        | False                                        | False                                        |
| -3 | False                                        | False                                        | False                                        | False                                        |
| -2 | False                                        | False                                        | False                                        | False                                        |
| -1 | False                                        | False                                        | False                                        | False                                        |
|  1 | False                                        | False                                        | True                                         | False                                        |
|  2 | False                                        | False                                        | True                                         | False                                        |
|  3 | False                                        | False                                        | True                                         | False                                        |
|  4 | False                                        | False                                        | True                                         | False                                        |
+----+----------------------------------------------+----------------------------------------------+----------------------------------------------+----------------------------------------------+
```

**Error example**

If this function is passed strings that cannot be converted to Boolean, the function will give an error:

```sqlsyntax
select boolxor_agg('invalid type');

100037 (22018): Boolean value 'invalid_type' is not recognized
```

---
title: BUILD_SCOPED_FILE_URL
source: https://docs.snowflake.com/en/sql-reference/functions/build_scoped_file_url.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md)

# BUILD_SCOPED_FILE_URL

Generates a scoped Snowflake file URL to a staged file using the stage name and relative file path as inputs.

A scoped URL is encoded and
permits access to a specified file for a limited period of time. The scoped URL in the output is valid for the
caller until the persisted query result period ends (until the results cache expires). That period is currently 24 hours.

Call this SQL function in a query or view. You can also use this SQL function to pass a scoped URL to a user-defined function (UDF)
or stored procedure.

## Syntax

```sqlsyntax
BUILD_SCOPED_FILE_URL(
  @<stage_name> ,
  '<relative_file_path>' ,
  <use_privatelink_host_for_business_critical>)
```

## Arguments

`stage_name`
:   Name of the internal or external stage where the file is stored.

    > **Note:**
    >
    > If the stage name includes spaces or special characters, it must be enclosed in single quotes (for example,
    > `'@"my stage"'` for a stage named `"my stage"`).

`relative_file_path`
:   Path and filename of the file, relative to its location on the stage.

`use_privatelink_host_for_business_critical`
:   Specifies whether to prepend `privatelink` to the URL for [Business Critical](../../user-guide/intro-editions.md) accounts.

    * `TRUE` prepends `privatelink` to the URL just
      before the hostname; for example, `privatelink.snowflakecomputing.com`.

      > **Note:**
      >
      > Snowflake prepends `privatelink` to the URL regardless of whether you’ve enabled private connectivity.
    * `FALSE` overrides the default behavior and does not add `privatelink` to the URL.

    Default: TRUE

## Returns

The function returns a scoped URL in the following format:

```sqlsyntax
https://<account_identifier>/api/files/<query_id>/<encoded_file_path>
```

Where:

`account_identifier`
:   Hostname of the Snowflake account for your stage. The hostname starts with a Snowflake-provided account locator
    and ends with the Snowflake domain (`snowflakecomputing.com`):

    `account_locator.snowflakecomputing.com`

    For more details, see [Account identifiers](../../user-guide/admin-account-identifier.md).

`query_id`
:   Query ID of the BUILD_SCOPED_FILE_URL call that generated the scoped URL.

`encoded_file_path`
:   Encoded path to the files to access using the scoped URL.

## Usage notes

* The permissions required to call this SQL function differ depending on how it is called:

  | SQL Operation | Permissions Required |
  | --- | --- |
  | Query | USAGE (external stage) or READ (internal stage) |
  | Column definition in a view | The view owner (i.e. role that has the OWNERSHIP privilege on the view) must have the stage privilege: USAGE (external stage) or READ (internal stage).  A role that queries the view only requires the SELECT privilege on the view. |
  | Stored procedure | The stored procedure owner (i.e. role that has the OWNERSHIP privilege on the stored procedure) must have the stage privilege: USAGE (external stage) or READ (internal stage).  A role that queries the stored procedure only requires the USAGE privilege on the stored procedure. |
  | UDF | The UDF owner (i.e. role that has the OWNERSHIP privilege on the UDF) must have the stage privilege: USAGE (external stage) or READ (internal stage).  A role that queries the UDF only requires the USAGE privilege on the UDF. |
* An HTTP client that sends a scoped URL to the REST API must be configured to allow redirects.
* When a scoped URL is accessed, the query history shows that the internal GET_SCOPED_FILE function was called.

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

Retrieve a scoped URL for a bitmap format image file in an external stage:

```sqlexample
SELECT BUILD_SCOPED_FILE_URL(@images_stage,'/us/yosemite/half_dome.jpg', TRUE);
```

```sqlexample
https://my_account.snowflakecomputing.com/api/files/019260c2-00c0-f2f2-0000-4383001cf046/bXlfZGF0YWJhc2UvbXlfc2NoZW1hL215X3N0YWdlL2ZvbGRlcjEvZm9sZGVyMi9maWxlMQ
```

Create a secure view that filters the results of a BUILD_SCOPED_FILE_URL function call for a specific audience. In this example, querying
the secure view returns only those files in the stage file path that include the string `acct1`:

```sqlexample
-- Create a table that stores the relative file path for each staged file along with any other related data.
CREATE TABLE acct_table (
  acct_name string,
  relative_file_path string
);

-- Create a secure view on the table you created.
-- A role that has the SELECT privilege on the secure view has scoped access to the filtered set of files that include the acct1 text string.
CREATE SECURE VIEW acct1_files
AS
  SELECT BUILD_SCOPED_FILE_URL(@acct_files, relative_file_path, FALSE) scoped_url
  FROM acct_table
  WHERE acct_name = 'acct1';
```

---
title: BUILD_STAGE_FILE_URL
source: https://docs.snowflake.com/en/sql-reference/functions/build_stage_file_url.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md)

# BUILD_STAGE_FILE_URL

Generates a Snowflake *file URL* to a staged file using the stage name and relative file path as inputs. A file URL permits
prolonged access to a specified file. That is, the file URL does not expire.

Call this SQL function in a query, user-defined function (UDF), or stored procedure.

Access files in a stage by sending the file URL in a request to the REST API for file support. When users send a file URL to the REST API
to access files, Snowflake performs the following actions:

1. Authenticate the user.
2. Verify that the role has sufficient privileges on the stage that contains the file.
3. Redirect the user to the staged file in the cloud storage service.

## Syntax

```sqlsyntax
BUILD_STAGE_FILE_URL( @<stage_name> , '<relative_file_path>' )
```

## Arguments

`stage_name`
:   Name of the internal or external stage where the file is stored.

    > **Note:**
    >
    > If the stage name includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'` for a stage
    > named `"my stage"`).

`relative_file_path`
:   Path and filename of the file relative to its location in the stage.

## Returns

The function returns a file URL in the following format:

```sqlsyntax
https://<account_identifier>/api/files/<db_name>/<schema_name>/<stage_name>/<relative_path>
```

Where:

`account_identifier`
:   Hostname of the Snowflake account for your stage. The hostname starts with an account locator (provided by Snowflake) and ends with the
    Snowflake domain (`snowflakecomputing.com`):

    `account_locator.snowflakecomputing.com`

    For more details, see [Account identifiers](../../user-guide/admin-account-identifier.md).

    > **Note:**
    >
    > For [Business Critical](../../user-guide/intro-editions.md) accounts, a `privatelink` segment is prepended to the URL just before
    > `snowflakecomputing.com` (`privatelink.snowflakecomputing.com`), even if private connectivity to the Snowflake service is not
    > enabled for your account.

    > **Important:**
    >
    > Currently, the function returns the account identifier in the form `organization_name-account_name`. When a file URL is used
    > as input to a GET request, the API endpoint returns an error.
    >
    > To resolve the error, you must manually convert the account identifier to the applicable form for your account:
    >
    > `account_locator.region_id` or
    >
    > `account_locator.region_id.cloud`
    >
    > For more information about these forms, see [Format 2: Account locator in a region](../../user-guide/admin-account-identifier.md).
    >
    > In an upcoming release, the function will return file URLs in the correct form.

`db_name`
:   Name of the database that contains the stage where your files are located.

`schema_name`
:   Name of the schema that contains the stage where your files are located.

`stage_name`
:   Name of the stage where your files are located.

`relative_path`
:   Path to the files to access using the file URL.

## Usage notes

* The permissions required to call this SQL function differ depending on how it is called:

  | SQL Operation | Permissions Required |
  | --- | --- |
  | Query | USAGE (external stage) or READ (internal stage) |
  | Stored procedure | The stored procedure owner (i.e. role that has the OWNERSHIP privilege on the stored procedure) must have the stage privilege: USAGE (external stage) or READ (internal stage).  A role that queries the stored procedure only requires the USAGE privilege on the stored procedure. |
  | UDF | The UDF owner (i.e. role that has the OWNERSHIP privilege on the UDF) must have the stage privilege: USAGE (external stage) or READ (internal stage).  A role that queries the UDF only requires the USAGE privilege on the UDF. |
* An HTTP client that sends a file URL to the REST API must be configured to allow redirects.
* When a file URL is accessed, the query history shows that the internal GET_STAGE_FILE function was called.

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

Retrieve a file URL for a bitmap format image file in an external stage:

```sqlexample
SELECT BUILD_STAGE_FILE_URL(@images_stage,'/us/yosemite/half_dome.jpg');
```

```sqlexample
https://my_account.snowflakecomputing.com/api/files/MY_DB/PUBLIC/IMAGES_STAGE/us/yosemite/half_dome.jpg
```

---
title: CASE
source: https://docs.snowflake.com/en/sql-reference/functions/case.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# CASE

Works like a cascading “if-then-else” statement. In the more general form,
a series of conditions are evaluated in sequence. When a condition evaluates
to TRUE, the evaluation stops and the associated result (after THEN) is
returned. If none of the conditions evaluate to TRUE, then the result after
the optional ELSE is returned, if present; otherwise NULL is returned.

In the second, “shorthand” form, the expression after CASE is compared to
each of the WHEN expressions in sequence, until one matches; then the
associated result (after THEN) is returned. If none of the expressions
match, the result after the optional ELSE is returned, if present;
otherwise NULL is returned.

Note that in the second form, a NULL CASE expression matches none of
the WHEN expressions, even if one of the WHEN expressions is also NULL.

See also:
:   [IFF](iff.md)

## Syntax

```sqlsyntax
CASE
    WHEN <condition1> THEN <result1>
  [ WHEN <condition2> THEN <result2> ]
  [ ... ]
  [ ELSE <result3> ]
END

CASE <expr>
    WHEN <value1> THEN <result1>
  [ WHEN <value2> THEN <result2> ]
  [ ... ]
  [ ELSE <result3> ]
END
```

## Arguments

`condition#`
:   In the first form of `CASE`, each condition is an expression that
    should evaluate to a BOOLEAN value (True, False, or NULL).

`expr`
:   A general expression.

`value`
:   In the second form of `CASE`, each `value` is a potential match
    for `expr`. The `value` can be a literal or an expression.
    The `value` must be the same data type as the `expr`, or
    must be a data type that can be cast to the data type of the `expr`.

`result#`
:   In the first form of the `CASE` clause, if `condition#` is true,
    then the function returns the corresponding `result#`. If more than
    one condition is true, then the result associated with the first true
    condition is returned.

    In the second form of the `CASE` statement, if `value#` matches the
    `expr`, then the corresponding `result` is returned. If more
    than one `value` matches the `expr`, then the first matching
    value’s `result` is returned.

    The result should be an expression that evaluates to a single value.

    In both forms of `CASE`, if the optional `ELSE` clause is present, and
    if no matches are found, then the function returns the result in the
    `ELSE` clause. If no `ELSE` clause is present, and no matches are found,
    then the result is NULL.

## Usage notes

* Note that, contrary to [DECODE](decode.md), a NULL value in the condition
  does not match a NULL value elsewhere in the condition.
  For example `WHEN <null_expr> = NULL THEN 'Return me!'` does not
  return “Return me!”. If you want to compare to NULL values, use
  `IS NULL` rather than `= NULL`.
* The `condition#`, `expr`, `value`, and
  `result` can all be general expressions and thus can include
  subqueries that include set operators, such
  as `UNION`, `INTERSECT`, `EXCEPT`, and `MINUS`.
  When using set operators, make sure that data types are compatible. For
  details, see the [General usage notes](../operators-query.md) in the
  [Set operators](../operators-query.md) topic.

## Collation details

In the first form of `CASE`, each expression is independent, and the collation specifications in different
branches are independent. For example, in the following, the collation specifications in
`condition1` are independent of the collation specification(s) in `condition2`,
and those collation specifications do not need to be identical or even compatible.

```sqlsyntax
CASE
    WHEN <condition1> THEN <result1>
  [ WHEN <condition2> THEN <result2> ]
```

In the second form of `CASE`, although all collation-related operations must use compatible collation specifications,
the collation specifications do not need to be identical. For example, in the following statement, the collation
specifications of both `value1` and `value2` must be compatible with the collation specification of
`expr`, but the collation specifications of `value1` and `value2` do not need to be identical
to each other or to the collation specification of `expr`.

> ```sqlexample
> CASE <expr>
>     WHEN <value1> THEN <result1>
>   [ WHEN <value2> THEN <result2> ]
>   ...
> ```

The value returned from the function has the
highest-[precedence](../collation.md) collation of the `THEN`/`ELSE`
arguments.

## Examples

This example shows a typical use of CASE:

```sqlexample
SELECT
    column1,
    CASE
        WHEN column1=1 THEN 'one'
        WHEN column1=2 THEN 'two'
        ELSE 'other'
    END AS result
FROM (values(1),(2),(3)) v;
```

```output
+---------+--------+
| COLUMN1 | RESULT |
|---------+--------|
|       1 | one    |
|       2 | two    |
|       3 | other  |
+---------+--------+
```

This example shows that if none of the values match, and there is no ELSE clause,
then the value returned is NULL:

```sqlexample
SELECT
    column1,
    CASE
        WHEN column1=1 THEN 'one'
        WHEN column1=2 THEN 'two'
    END AS result
FROM (values(1),(2),(3)) v;
```

```output
+---------+--------+
| COLUMN1 | RESULT |
|---------+--------|
|       1 | one    |
|       2 | two    |
|       3 | NULL   |
+---------+--------+
```

This example handles NULL explicitly.

```sqlexample
SELECT
    column1,
    CASE
        WHEN column1 = 1 THEN 'one'
        WHEN column1 = 2 THEN 'two'
        WHEN column1 IS NULL THEN 'NULL'
        ELSE 'other'
    END AS result
FROM VALUES (1), (2), (NULL);
```

```output
+---------+--------+
| COLUMN1 | RESULT |
|---------+--------|
|       1 | one    |
|       2 | two    |
|    NULL | NULL   |
+---------+--------+
```

The following examples combine CASE with collation:

```sqlexample
SELECT CASE COLLATE('m', 'upper')
    WHEN 'M' THEN TRUE
    ELSE FALSE
END;
```

```output
+----------------------------+
| CASE COLLATE('M', 'UPPER') |
|     WHEN 'M' THEN TRUE     |
|     ELSE FALSE             |
| END                        |
|----------------------------|
| True                       |
+----------------------------+
```

```sqlexample
SELECT CASE 'm'
    WHEN COLLATE('M', 'lower') THEN TRUE
    ELSE FALSE
END;
```

```output
+------------------------------------------+
| CASE 'M'                                 |
|     WHEN COLLATE('M', 'LOWER') THEN TRUE |
|     ELSE FALSE                           |
| END                                      |
|------------------------------------------|
| True                                     |
+------------------------------------------+
```

---
title: CAST , ::
source: https://docs.snowflake.com/en/sql-reference/functions/cast.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# CAST , `::`

Converts a value of one data type into another data type. The semantics of CAST
are the same as the semantics of the corresponding TO_ `datatype` conversion
functions. If the cast is not possible, an error is raised. For more details,
see the individual TO_ `datatype` conversion functions. For more information
about data type conversion and the TO_ `datatype` conversion
functions, see [Data type conversion](../data-type-conversion.md).

The `::` operator provides alternative syntax for CAST.

See also:
:   [TRY_CAST](try_cast.md)

## Syntax

```sqlsyntax
CAST( <source_expr> AS <target_data_type> )
  [ RENAME FIELDS | ADD FIELDS ]

<source_expr> :: <target_data_type>
```

## Arguments

`source_expr`
:   Expression of any supported data type to be converted into a
    different data type.

`target_data_type`
:   The data type to which to convert the expression. If the data
    type supports additional properties, such as
    [precision and scale](../data-types-numeric.md)
    (for numbers/decimals), the properties can be included.

`RENAME FIELDS`
:   For [structured OBJECTs](../data-types-structured.md), specifies that you want to change the OBJECT to use
    different key-value pairs. The values in the original object are copied to the new key-value pairs in the order in which
    they appear.

    For an example, see [Example: Changing the key names in an OBJECT value](../data-types-structured.md).

`ADD FIELDS`
:   For [structured OBJECTs](../data-types-structured.md), specifies that you want to add key-value pairs to the
    OBJECT.

    For an example, see [Example: Adding keys to an OBJECT value](../data-types-structured.md).

    The values for the newly added keys will be set to NULL. If you want to assign a value to these keys, call the
    [OBJECT_INSERT](../data-types-structured.md) function instead.

## Usage notes

* If the scale is not sufficient to hold the input value, the function
  rounds the value.
* If the precision is not sufficient to hold the input value, the function
  raises an error.
* When numeric columns are explicitly cast to forms of the integer data type during a data unload to Parquet files, the data type of these
  columns in the Parquet files is INT. For more information, see [Explicitly convert numeric columns to Parquet data types](../../user-guide/data-unload-considerations.md).
* Collation specifications aren’t retained when values are cast to
  [text string data types](../data-types-text.md) (for example, VARCHAR and STRING). You can include collation
  specifications when you cast values (for example, `CAST(myvalue AS VARCHAR) COLLATE 'en-ai'`).
* When you use the `::` alternative syntax, you cannot specify the `RENAME FIELDS` or `ADD FIELDS` arguments.

## Examples

The CAST examples use the data in the following table:

```sqlexample
CREATE OR REPLACE TABLE test_data_type_conversion (
  varchar_value VARCHAR,
  number_value NUMBER(5, 4),
  timestamp_value TIMESTAMP);

INSERT INTO test_data_type_conversion VALUES (
  '9.8765',
  1.2345,
  '2024-05-09 14:32:29.135 -0700');

SELECT * FROM test_data_type_conversion;
```

```output
+---------------+--------------+-------------------------+
| VARCHAR_VALUE | NUMBER_VALUE | TIMESTAMP_VALUE         |
|---------------+--------------+-------------------------|
| 9.8765        |       1.2345 | 2024-05-09 14:32:29.135 |
+---------------+--------------+-------------------------+
```

The examples use the [SYSTEM$TYPEOF](system_typeof.md) function to show the data type of the converted value.

Convert a string to a number with specified scale (2):

```sqlexample
SELECT CAST(varchar_value AS NUMBER(5,2)) AS varchar_to_number1,
       SYSTEM$TYPEOF(varchar_to_number1) AS data_type
  FROM test_data_type_conversion;
```

```output
+--------------------+------------------+
| VARCHAR_TO_NUMBER1 | DATA_TYPE        |
|--------------------+------------------|
|               9.88 | NUMBER(5,2)[SB4] |
+--------------------+------------------+
```

Convert the same string to a number with scale 5, using
the `::` notation:

```sqlexample
SELECT varchar_value::NUMBER(6,5) AS varchar_to_number2,
       SYSTEM$TYPEOF(varchar_to_number2) AS data_type
  FROM test_data_type_conversion;
```

```output
+--------------------+------------------+
| VARCHAR_TO_NUMBER2 | DATA_TYPE        |
|--------------------+------------------|
|            9.87650 | NUMBER(6,5)[SB4] |
+--------------------+------------------+
```

Convert a number to an integer. For an integer, precision and scale cannot be specified, so
the default is always NUMBER(38, 0).

```sqlexample
SELECT CAST(number_value AS INTEGER) AS number_to_integer,
       SYSTEM$TYPEOF(number_to_integer) AS data_type
  FROM test_data_type_conversion;
```

```output
+-------------------+-------------------+
| NUMBER_TO_INTEGER | DATA_TYPE         |
|-------------------+-------------------|
|                 1 | NUMBER(38,0)[SB1] |
+-------------------+-------------------+
```

Convert a number to a string:

```sqlexample
SELECT CAST(number_value AS VARCHAR) AS number_to_varchar,
       SYSTEM$TYPEOF(number_to_varchar) AS data_type
  FROM test_data_type_conversion;
```

```output
+-------------------+--------------+
| NUMBER_TO_VARCHAR | DATA_TYPE    |
|-------------------+--------------|
| 1.2345            | VARCHAR[LOB] |
+-------------------+--------------+
```

Convert a string to a VARCHAR value with a specified length:

```sqlexample
SELECT varchar_value,
       SYSTEM$TYPEOF(varchar_value) AS data_type_source,
       CAST(varchar_value AS VARCHAR(9)) AS converted_varchar,
       SYSTEM$TYPEOF(converted_varchar) AS data_type_converted
  FROM test_data_type_conversion;
```

```output
+---------------+------------------------+-------------------+---------------------+
| VARCHAR_VALUE | DATA_TYPE_SOURCE       | CONVERTED_VARCHAR | DATA_TYPE_CONVERTED |
|---------------+------------------------+-------------------+---------------------|
| 9.8765        | VARCHAR(16777216)[LOB] | 9.8765            | VARCHAR(9)[LOB]     |
+---------------+------------------------+-------------------+---------------------+
```

If you are casting a value to the VARCHAR type with a specified length and the value exceeds that
length, an error is returned:

```sqlexample
SELECT CAST(varchar_value AS VARCHAR(4))
  FROM test_data_type_conversion;
```

```output
100078 (22000): String '9.8765' is too long and would be truncated
```

Convert a timestamp to a date:

```sqlexample
SELECT CAST(timestamp_value AS DATE) AS timestamp_to_date,
       SYSTEM$TYPEOF(timestamp_to_date) AS data_type
  FROM test_data_type_conversion;
```

```output
+-------------------+-----------+
| TIMESTAMP_TO_DATE | DATA_TYPE |
|-------------------+-----------|
| 2024-05-09        | DATE[SB4] |
+-------------------+-----------+
```

---
title: CBRT
source: https://docs.snowflake.com/en/sql-reference/functions/cbrt.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Exponent and Root)

# CBRT

Returns the cubic root of a numeric expression.

## Syntax

```sqlsyntax
CBRT( <input_expr> )
```

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

```sqlexample
SELECT x, CBRT(x) FROM tab;

--------+-------------+
   x    |   cbrt(x)   |
--------+-------------+
 0      | 0           |
 2      | 1.25992105  |
 -10    | -2.15443469 |
 [NULL] | [NULL]      |
--------+-------------+
```

---
title: CEIL
source: https://docs.snowflake.com/en/sql-reference/functions/ceil.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# CEIL

Returns values from `input_expr` rounded to the nearest equal or larger integer,
or to the nearest equal or larger value with the specified number of places after the decimal point.

See also:
:   [FLOOR](floor.md) , [ROUND](round.md) , [TRUNCATE , TRUNC](trunc.md)

## Syntax

```sqlsyntax
CEIL( <input_expr> [, <scale_expr> ] )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be one of the numeric data types, such as DECFLOAT,
    FLOAT, or NUMBER.

`scale_expr`
:   The number of digits the output includes after the decimal point.

    The default `scale_expr` is zero, meaning that the function removes all digits after the decimal point.

    For information about negative scales, see Usage notes.

## Returns

The return type is based on the input type:

* If the input expression is a FLOAT, the returned type is a FLOAT.
* If the input expression is DECFLOAT, the returned type is DECFLOAT.
* If the input expression is a NUMBER, the returned type is a NUMBER.

  + If the input scale is constant:

    - If the input scale is positive, the returned type has a scale equal to the input scale and has a precision large enough to
      encompass any possible result.
    - If the input scale is negative, the returned type has a scale of 0.
  + If the input scale isn’t constant, the returned type’s scale is the same as the input expression’s.

If the scale is zero, then the value is effectively an INTEGER.

For example:

* The data type returned by CEIL(3.14::FLOAT, 1) is FLOAT.
* The NUMBER returned by CEIL(3.14, 1) has scale 1 and precision at least 3.
* The NUMBER returned by CEIL(-9.99, 0) has scale 0 and precision at least 2.
* The NUMBER returned by CEIL(33.33, -1) has scale 0 and precision at least 3.

## Usage notes

* If `scale_expr` is negative, then it specifies the number of places before the decimal point to
  which to adjust the number. For example, if the scale is -2, then the result is a multiple of 100.
* If `scale_expr` is larger than the input expression scale, the function does not have any effect.
* If either the `input_expr` or the `scale_expr` is NULL, then the result is NULL.
* When negative numbers are rounded up, the value is closer to 0. For example, CEIL(-1.9) is -1, not -2.
* If rounding the number upward brings the number outside of the range of values of the data type, then an error is returned.

## Examples

This example demonstrates the function without the `scale_expr`
parameter:

> ```sqlexample
> SELECT CEIL(135.135), CEIL(-975.975);
> +---------------+----------------+
> | CEIL(135.135) | CEIL(-975.975) |
> |---------------+----------------|
> |           136 |           -975 |
> +---------------+----------------+
> ```

This example demonstrates the function with the `scale_expr` parameter,
including with the scale set to negative numbers:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TRANSIENT TABLE test_ceiling (n FLOAT, scale INTEGER);
> > INSERT INTO test_ceiling (n, scale) VALUES
> >    (-975.975, -1),
> >    (-975.975,  0),
> >    (-975.975,  2),
> >    ( 135.135, -2),
> >    ( 135.135,  0),
> >    ( 135.135,  1),
> >    ( 135.135,  3),
> >    ( 135.135, 50),
> >    ( 135.135, NULL)
> >    ;
> > ```
>
> Output:
>
> > ```sqlexample
> > SELECT n, scale, ceil(n, scale)
> >   FROM test_ceiling
> >   ORDER BY n, scale;
> > +----------+-------+----------------+
> > |        N | SCALE | CEIL(N, SCALE) |
> > |----------+-------+----------------|
> > | -975.975 |    -1 |       -970     |
> > | -975.975 |     0 |       -975     |
> > | -975.975 |     2 |       -975.97  |
> > |  135.135 |    -2 |        200     |
> > |  135.135 |     0 |        136     |
> > |  135.135 |     1 |        135.2   |
> > |  135.135 |     3 |        135.135 |
> > |  135.135 |    50 |        135.135 |
> > |  135.135 |  NULL |           NULL |
> > +----------+-------+----------------+
> > ```

---
title: CHARINDEX
source: https://docs.snowflake.com/en/sql-reference/functions/charindex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# CHARINDEX

Searches for the first occurrence of the first argument in the second argument and, if successful, returns the position (1-based) of the first argument in the second argument.

Aliases:
:   [POSITION](position.md)

    Note that the CHARINDEX function does not support one of the syntax variations that POSITION supports.

## Syntax

```sqlsyntax
CHARINDEX( <expr1>, <expr2> [ , <start_pos> ] )
```

## Arguments

**Required:**

`expr1`
:   A string or binary expression representing the value to look for.

`expr2`
:   A string or binary expression representing the value to search.

**Optional:**

`start_pos`
:   A number indicating the position from where to start the search (with `1` representing the start of `expr2`).

    Default: `1`

## Usage notes

* If any arguments are NULL, the function returns NULL.
* If the string or binary value is not found, the function returns `0`.
* If the specified optional `start_pos` is beyond the end of the second argument (the string to
  search), the function returns `0`.
* If the first argument is empty (e.g. an empty string), the function returns `1`.
* The data types of the first two arguments should be the same; either both
  should be strings or both should be binary values.

## Collation details

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

### VARCHAR expressions

Find the first occurrence of ‘an’ in ‘banana’:

> ```sqlexample
> select charindex('an', 'banana', 1);
> +------------------------------+
> | CHARINDEX('AN', 'BANANA', 1) |
> |------------------------------|
> |                            2 |
> +------------------------------+
> ```

Find the first occurrence of ‘an’ in ‘banana’ at or after position 3. This search finds the second occurrence of ‘an’.

> ```sqlexample
> select charindex('an', 'banana', 3);
> +------------------------------+
> | CHARINDEX('AN', 'BANANA', 3) |
> |------------------------------|
> |                            4 |
> +------------------------------+
> ```

Search for various characters, including unicode characters, in strings:

> ```sqlexample
> SELECT n, h, CHARINDEX(n, h) FROM pos;
>
> +--------+---------------------+-----------------+
> | N      | H                   | CHARINDEX(N, H) |
> |--------+---------------------+-----------------|
> |        |                     |               1 |
> |        | sth                 |               1 |
> | 43     | 41424344            |               5 |
> | a      | NULL                |            NULL |
> | dog    | catalog             |               0 |
> | log    | catalog             |               5 |
> | lésine | le péché, la lésine |              14 |
> | nicht  | Ich weiß nicht      |              10 |
> | sth    |                     |               0 |
> | ☃c     | ☃a☃b☃c☃d            |               5 |
> | ☃☃     | bunch of ☃☃☃☃       |              10 |
> | ❄c     | ❄a☃c❄c☃             |               5 |
> | NULL   | a                   |            NULL |
> | NULL   | NULL                |            NULL |
> +--------+---------------------+-----------------+
> ```

### BINARY expressions

Note that because the values below are hexadecimal representations, a single BINARY byte is represented as two hex
digits.

In this example, the returned value is 3 because ‘EF’ matches the 3rd
byte (the first byte is ‘AB’; the second byte is ‘CD’, and the third byte
is ‘EF’):

> ```sqlexample
> SELECT CHARINDEX(X'EF', X'ABCDEF');
> +-----------------------------+
> | CHARINDEX(X'EF', X'ABCDEF') |
> |-----------------------------|
> |                           3 |
> +-----------------------------+
> ```

In this example, there is no match. Although the sequence ‘BC’ appears to be
in the value being searched, the ‘B’ is the second nybble of the first
byte, and the ‘C’ is the first nybble of the second byte; no byte actually
contains ‘BC’, so the returned value is 0 (not found).

> ```sqlexample
> SELECT CHARINDEX(X'BC', X'ABCD');
> +---------------------------+
> | CHARINDEX(X'BC', X'ABCD') |
> |---------------------------|
> |                         0 |
> +---------------------------+
> ```

---
title: CHECK_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/check_json.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Parsing)

# CHECK_JSON

Checks the validity of a JSON document. If the input string is a valid JSON
document or a NULL, the output is NULL (i.e. no error). If the input cannot be
translated to a valid JSON value, the output string contains the error message.

## Syntax

```sqlsyntax
CHECK_JSON( <string_or_variant_expr> )
```

## Arguments

`string_or_variant_expr`
:   A `VARIANT` or string value (or expression) to check.

    If the expression is of type `VARIANT`, it should contain a
    string.

## Examples

Create a table and insert some `VARCHAR` and `VARIANT` values:

> ```sqlexample
> CREATE TABLE sample_json_table (ID INTEGER, varchar1 VARCHAR, variant1 VARIANT);
> INSERT INTO sample_json_table (ID, varchar1) VALUES
>     (1, '{"ValidKey1": "ValidValue1"}'),
>     (2, '{"Malformed -- Missing value": }'),
>     (3, NULL)
>     ;
> UPDATE sample_json_table SET variant1 = varchar1::VARIANT;
> ```

Use the `CHECK_JSON` function to check the validity of potential
JSON-compatible strings in a `VARCHAR` column:

> ```sqlexample
> SELECT ID, CHECK_JSON(varchar1), varchar1 FROM sample_json_table ORDER BY ID;
> +----+----------------------+----------------------------------+
> | ID | CHECK_JSON(VARCHAR1) | VARCHAR1                         |
> |----+----------------------+----------------------------------|
> |  1 | NULL                 | {"ValidKey1": "ValidValue1"}     |
> |  2 | misplaced }, pos 32  | {"Malformed -- Missing value": } |
> |  3 | NULL                 | NULL                             |
> +----+----------------------+----------------------------------+
> ```

Use the `CHECK_JSON` function to check the validity of potential
JSON-compatible strings in a `VARIANT` column:

> ```sqlexample
> SELECT ID, CHECK_JSON(variant1), variant1 FROM sample_json_table ORDER BY ID;
> +----+----------------------+--------------------------------------+
> | ID | CHECK_JSON(VARIANT1) | VARIANT1                             |
> |----+----------------------+--------------------------------------|
> |  1 | NULL                 | "{\"ValidKey1\": \"ValidValue1\"}"   |
> |  2 | misplaced }, pos 32  | "{\"Malformed -- Missing value\": }" |
> |  3 | NULL                 | NULL                                 |
> +----+----------------------+--------------------------------------+
> ```

---
title: CHECK_XML
source: https://docs.snowflake.com/en/sql-reference/functions/check_xml.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Parsing)

# CHECK_XML

Checks the validity of an [XML](../../user-guide/semistructured-data-formats.md) document. If the input string is NULL or a valid XML document,
the output is NULL. In case of an XML parsing error, the output string contains the error message.

See also:
:   [PARSE_XML](parse_xml.md), [TO_XML](to_xml.md), [XMLGET](xmlget.md)

## Syntax

```sqlsyntax
CHECK_XML( <string_containing_xml> [ , <disable_auto_convert> ] )
```

```sqlsyntax
CHECK_XML( STR => <string_containing_xml>
  [ , DISABLE_AUTO_CONVERT => <disable_auto_convert> ] )
```

## Arguments

**Required:**

`string_containing_xml` . OR . `STR => string_containing_xml`
:   Specify an expression that evaluates to a VARCHAR value that contains valid XML.

**Optional:**

`disable_auto_convert` . OR . `DISABLE_AUTO_CONVERT => disable_auto_convert`
:   Specify the same value that you pass to the [PARSE_XML](parse_xml.md) function.

    Default: `FALSE`

## Returns

The data type of the returned value is VARCHAR.

## Usage notes

* When you mix arguments by position and by name, all of the positional arguments must come before
  all of the named arguments.
* When you specify an argument by name, you can’t use double quotes around the argument name.

## Examples

The following examples use the CHECK_XML function.

### Show the output of the function when the XML is valid

```sqlexample
SELECT CHECK_XML('<name> Valid </name>');
```

```output
+-----------------------------------+
| CHECK_XML('<NAME> VALID </NAME>') |
|-----------------------------------|
| NULL                              |
+-----------------------------------+
```

### Show the output of the function when the XML is invalid

```sqlexample
SELECT CHECK_XML('<name> Invalid </WRONG_CLOSING_TAG>');
```

```output
+--------------------------------------------------+
| CHECK_XML('<NAME> INVALID </WRONG_CLOSING_TAG>') |
|--------------------------------------------------|
| no opening tag for </WRONG_CLOSING_TAG>, pos 35  |
+--------------------------------------------------+
```

### Locate records with invalid XML

```sqlexample
SELECT xml_str, CHECK_XML(xml_str)
  FROM my_table
  WHERE CHECK_XML(xml_str) IS NOT NULL;
```

---
title: CHR , CHAR
source: https://docs.snowflake.com/en/sql-reference/functions/chr.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# CHR , CHAR

Converts a Unicode code point (including 7-bit ASCII) into the character that matches the input Unicode. If an invalid code point is specified, an error is returned.

CHAR is an alias for CHR.

See also:
:   [ASCII](ascii.md) , [UNICODE](unicode.md)

## Syntax

```sqlsyntax
CHR( <input> )
```

## Arguments

`input`
:   The Unicode code point for which the character is returned.

## Returns

The data type of the returned value is VARCHAR.

## Examples

This example demonstrates the function behavior for some valid Unicode code points:

> ```sqlexample
> SELECT column1, CHR(column1)
> FROM (VALUES(83), (33), (169), (8364), (0), (null));
> ```

This shows the output for the preceding query:

> ```sqlexample
> +---------+--------------+
> | COLUMN1 | CHR(COLUMN1) |
> |---------+--------------|
> |      83 | S            |
> |      33 | !            |
> |     169 | ©            |
> |    8364 | €            |
> |       0 |              |
> |    NULL | NULL         |
> +---------+--------------+
> ```

This example demonstrates the function behavior for an invalid Unicode code point:

> ```sqlexample
> SELECT column1, CHR(column1)
> FROM (VALUES(-1));
> ```

This shows the output for the preceding query:

> ```sqlexample
> FAILURE: Invalid character code -1 in the CHR input
> ```

This example demonstrates the function behavior for another invalid Unicode code point:

> ```sqlexample
> SELECT column1, CHR(column1)
> FROM (VALUES(999999999999));
> ```

This shows the output for the preceding query:

> ```sqlexample
> FAILURE: Invalid character code 999999999999 in the CHR input
> ```

---
title: CLASSIFY_TEXT (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/classify_text-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# CLASSIFY_TEXT (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_CLASSIFY](ai_classify.md) is the latest version of this function.
> You can use AI_CLASSIFY for multi-label and image classification.
> You can continue to use CLASSIFY_TEXT (SNOWFLAKE.CORTEX).

Classifies free-form text into categories that you provide.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.CLASSIFY_TEXT( <input> , <list_of_categories>, [ <options> ] )
```

## Arguments

**Required:**

`input`
:   String to classify. The input string is case sensitive. You may get different results for the same string that uses different
    capitalization.

`list_of_categories`
:   Array that represents the categories. Must contain at least two and at most 100 unique categories. Categories are case
    sensitive.

    Categories may be simple strings or SQL objects; all categories must be the same type. Using objects, you can
    provide a description and examples of each category, providing context that can help improve classification accuracy.
    It is not required to provide descriptions or examples for each category; you are free to provide a description,
    examples, both, or neither for each category.

    * `label`: The name of the category. This key is required.
    * `description`: A description of the category. Descriptions should be no longer than about 25 words (1-2 sentences) long.
      This key is optional.
    * `examples`: An array of examples that are representative of the category. Typically no more than five examples are needed,
      but there is a limit of 20 examples per category. The number of examples does not need to be the same for every category.
      This key is optional.

    > **Note:**
    >
    > Descriptions and examples count as input tokens, which increases the cost of the classification operation. Read
    > more in [Cost considerations](../../user-guide/snowflake-cortex/aisql.md).

**Optional:**

`options`
:   An object that contains optional configuration (as key/value pairs) for the classification operation. Currently, the
    only available key is:

    * `task_description`: A string containing a short explanation of the text classification task. Task descriptions should
      be no more than about 50 words (3-4 sentences) long.

## Returns

An OBJECT value (VARIANT). The object’s `label` field is a string specifying the category to which the input prompt belongs.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Usage notes

For optimal performance, follow these guidelines:

* Use plain English text for input and categories.
* Limit the amount of text that is not plain English in the input text. For example, try to limit content such as code snippets or logs
  in the text input.
* Text shouldn’t contain code or formats that are not open source (company specific languages, proprietary formats, etc.). The function
  won’t return an error, but the results may not be what you expect.
* Don’t use abbreviations, special characters, or jargon in the category labels.
* Categories should be descriptive. For example using a category such as `Xa4s3` or `category 1` won’t produce good results.
* Categories should be mutually exclusive.
* Adding a clear task description can improve accuracy when the relationship between the input text and categories is
  ambiguous or nuanced.
* Adding label descriptions can improve accuracy in cases where the descriptions are ambiguous or when specific logic
  should be followed when selecting a particular label. When writing descriptions, focus on key aspects that distinguish
  a particular label from the others.
* Each label, description, and example counts as input tokens for each record processed by a CLASSIFY_TEXT function call.
  Costs are incurred accordingly.
* Examples can help to improve accuracy.

## Examples

### Using required arguments

These examples illustrate how to use the CLASSIFY_TEXT function with only the required arguments.

The following example classifies the prompt into one of two categories, `travel` or `cooking`:

```sqlexample
SELECT SNOWFLAKE.CORTEX.CLASSIFY_TEXT('One day I will see the world', ['travel', 'cooking']);
```

```output
{
  "label": "travel"
}
```

The following example creates a table, `text_classification_table`, that contains a column for text and a column for possible
categories for that text. The CLASSIFY_TEXT function is called on each row of the table to classify the string in the text column.

```sqlexample
CREATE OR REPLACE TEMPORARY TABLE text_classification_table AS
SELECT 'France' AS input, ['North America', 'Europe', 'Asia'] AS classes
UNION ALL
SELECT 'Singapore', ['North America', 'Europe', 'Asia']
UNION ALL
SELECT 'one day I will see the world', ['travel', 'cooking', 'dancing']
UNION ALL
SELECT 'my lobster bisque is second to none', ['travel', 'cooking', 'dancing'];

SELECT input,
       classes,
       SNOWFLAKE.CORTEX.CLASSIFY_TEXT(input, classes)['label'] as classification
FROM text_classification_table;
```

### Using optional arguments

These examples illustrate how to use the CLASSIFY_TEXT function with category descriptions and examples and/or a task description.

The following example classifies the prompt into one of three categories (travel, cooking, or fitness), providing only a task description:

```sqlexample
SELECT SNOWFLAKE.CORTEX.CLASSIFY_TEXT(
  'When I am not at work, I love creating recipes using every day ingredients',
  ['travel', 'cooking', 'fitness'],
  {
    'task_description': 'Return a classification of the Hobby identified in the text'
  }
);
```

```output
{
  "label": "cooking"
}
```

The following example classifies the prompt into one of the categories, travel, cooking, or fitness using all of the options.

```sqlexample
SELECT SNOWFLAKE.CORTEX.CLASSIFY_TEXT(
  'I love running every morning before the world wakes up',
  [{
    'label': 'travel',
    'description': 'Hobbies related to going from one place to another',
    'examples': ['I like flying to Europe', 'Every summer we go to Italy' , 'I love traveling to learn new cultures']
  },{
    'label': 'cooking',
    'description': 'Hobbies related to preparing food',
    'examples': ['I like learning about new ingredients', 'You must bring your soul to the recipe' , 'Baking is my therapy']
    },{
    'label': 'fitness',
    'description': 'Hobbies related to being active and healthy',
    'examples': ['I cannot live without my Strava app', 'Running is life' , 'I go to the Gym every day']
    }],
  {'task_description': 'Return a classification of the Hobby identified in the text'})
```

```output
{
  "label": "fitness"
}
```

The following example classifies the prompt into one of three categories (travel, cooking, or fitness) using all of the
options. However, the description or examples are omitted for some categories, and the number of examples varies.

```sqlexample
SELECT SNOWFLAKE.CORTEX.CLASSIFY_TEXT(
  'I love running every morning before the world wakes up',
  [{
    'label': 'travel',
    'description': 'Hobbies related to going from one place to another',
    'examples': ['I like flying to Europe']
  },{
    'label': 'cooking',
    'examples': ['I like learning about new ingredients', 'You must bring your soul to the recipe' , 'Baking is my therapy']
    },{
    'label': 'fitness',
    'description': 'Hobbies related to being active and healthy'
    }],
  {'task_description': 'Return a classification of the Hobby identified in the text'})
```

```output
{
  "label": "fitness"
}
```

---
title: COALESCE
source: https://docs.snowflake.com/en/sql-reference/functions/coalesce.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# COALESCE

Returns the first non-NULL expression among its arguments, or NULL if
all its arguments are NULL.

## Syntax

```sqlsyntax
COALESCE( <expr1> , <expr2> [ , ... , <exprN> ] )
```

## Usage notes

* Snowflake performs [implicit conversion](../data-type-conversion.md) of arguments to make
  them compatible. For example, if one of the input expressions is a numeric type, the return type
  is also a numeric type. That is, `SELECT COALESCE('17', 1);` first converts the VARCHAR value `'17'`
  to the NUMBER value `17`, and then returns the first non-NULL value.

  When conversion isn’t possible, implicit conversion fails. For example, `SELECT COALESCE('foo', 1);`
  returns an error because the VARCHAR value `'foo'` can’t be converted to a NUMBER value.

  We recommend passing in arguments of the same type or explicitly converting arguments if needed.

* When implicit conversion converts a non-numeric value to a numeric value, the result is a value
  of type NUMBER(18,5).

  For numeric string arguments that aren’t constants, if NUMBER(18,5) isn’t sufficient to represent
  the numeric value, then [cast](../data-type-conversion.md) the argument to a type that
  can represent the value.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

The following example shows the values in three columns and then the result
when the COALESCE function is applied to the three columns:

```sqlexample
SELECT column1,
       column2,
       column3,
       COALESCE(column1, column2, column3) AS coalesce_result
  FROM (values
    (1,    2,    3   ),
    (null, 2,    3   ),
    (null, null, 3   ),
    (null, null, null),
    (1,    null, 3   ),
    (1,    null, null),
    (1,    2,    null)
  ) v;
```

```output
+---------+---------+---------+-----------------+
| COLUMN1 | COLUMN2 | COLUMN3 | COALESCE_RESULT |
|---------+---------+---------+-----------------|
|       1 |       2 |       3 |               1 |
|    NULL |       2 |       3 |               2 |
|    NULL |    NULL |       3 |               3 |
|    NULL |    NULL |    NULL |            NULL |
|       1 |    NULL |       3 |               1 |
|       1 |    NULL |    NULL |               1 |
|       1 |       2 |    NULL |               1 |
+---------+---------+---------+-----------------+
```

---
title: COLLATE
source: https://docs.snowflake.com/en/sql-reference/functions/collate.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md)

# COLLATE

Returns a copy of the original string, but with the specified `collation_specification` property instead of
the original `collation_specification` property.

This copy can be used in subsequent string comparisons, which will use the new `collation_specification`.

## Syntax

The COLLATE function can be called as a normal function:

```sqlsyntax
COLLATE(<string_expression>, '<collation_specification>')
```

The COLLATE function can be used as though it were an infix operator:

```sqlsyntax
<string_expression> COLLATE '<collation_specification>'
```

## Arguments

`string_expression`
:   The string to copy.

`collation_specification`
:   The collation to store with the copy of the string. For more information about collation
    specifiers, see [Collation specifications](../collation.md).

## Returns

Returns a copy of the original string, but with the specified
`collation_specification` property instead of the original
`collation_specification`.

## Usage notes

* Each VARCHAR contains a property that holds the collation specifier to use when comparing that VARCHAR to
  another VARCHAR. The COLLATE function copies the string, but applies the new collation specification
  rather than the original specification to the copy.

  The string itself is unchanged; only the collation specifier associated with the string is changed.
* When COLLATE is used as an infix operator, the `collation_specification` must be a constant string,
  not a general expression.

## Examples

The following examples show that calling the COLLATE function returns a copy of the string with a different
collation specification.

> **Note:**
>
> For more examples that use the COLLATE function, see [Collation examples](../collation.md).

Create a table and insert a row. The collation specification of the value in the inserted row is `es`
(Spanish).

```sqlexample
CREATE OR REPLACE TABLE collation1 (v VARCHAR COLLATE 'es');
INSERT INTO collation1 (v) VALUES ('ñ');
```

This example shows that the COLLATE function does not change the string. The copied string in the third column is
lowercase, which is the same as the original string in the first column. However, the collation specification
of the value returned by COLLATE has changed from `es` to `es-ci`.

```sqlexample
SELECT v,
       COLLATION(v),
       COLLATE(v, 'es-ci'),
       COLLATION(COLLATE(v, 'es-ci'))
  FROM collation1;
```

```output
+---+--------------+---------------------+--------------------------------+
| V | COLLATION(V) | COLLATE(V, 'ES-CI') | COLLATION(COLLATE(V, 'ES-CI')) |
|---+--------------+---------------------+--------------------------------|
| ñ | es           | ñ                   | es-ci                          |
+---+--------------+---------------------+--------------------------------+
```

This example shows that although the value returned by COLLATE is still a lowercase string, the `ci` collation
specifier is used when comparing that string to another string:

```sqlexample
SELECT v,
       v = 'ñ' AS "COMPARISON TO LOWER CASE",
       v = 'Ñ' AS "COMPARISON TO UPPER CASE",
       COLLATE(v, 'es-ci'),
       COLLATE(v, 'es-ci') = 'Ñ'
  FROM collation1;
```

```output
+---+--------------------------+--------------------------+---------------------+---------------------------+
| V | COMPARISON TO LOWER CASE | COMPARISON TO UPPER CASE | COLLATE(V, 'ES-CI') | COLLATE(V, 'ES-CI') = 'Ñ' |
|---+--------------------------+--------------------------+---------------------+---------------------------|
| ñ | True                     | False                    | ñ                   | True                      |
+---+--------------------------+--------------------------+---------------------+---------------------------+
```

This example sorts the results using German collation.

```sqlexample
SELECT *
  FROM t1
  ORDER BY COLLATE(col1 , 'de');
```

The following two queries return the same result. The first uses COLLATE as a function, while the second uses
COLLATE as an infix operator:

```sqlexample
SELECT spanish_phrase FROM collation_demo
  ORDER BY COLLATE(spanish_phrase, 'utf8');
```

```sqlexample
SELECT spanish_phrase FROM collation_demo
  ORDER BY spanish_phrase COLLATE 'utf8';
```

---
title: COLLATION
source: https://docs.snowflake.com/en/sql-reference/functions/collation.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md)

# COLLATION

Returns the collation specification of the expression.

## Syntax

```sqlsyntax
COLLATION(<expression>)
```

## Arguments

`expression`
:   The expression for which you want to know the collation specification.
    Typically, this is a column name.

## Returns

Returns a VARCHAR value that contains the collation specification of the expression.

## Examples

This example shows how to get the collation specification of a specified column.

First, create the table and insert data:

```sqlexample
CREATE OR REPLACE TABLE collation1 (v VARCHAR COLLATE 'es');
INSERT INTO collation1 (v) VALUES ('ñ');
```

Second, show the collation of the column:

```sqlexample
SELECT COLLATION(v)
  FROM collation1;
```

```output
+--------------+
| COLLATION(V) |
|--------------|
| es           |
+--------------+
```

---
title: COMPLETE (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/complete-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# COMPLETE (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_COMPLETE](ai_complete.md) is the latest version of this function.
> Use AI_COMPLETE for the latest functionality.
> You can continue to use COMPLETE (SNOWFLAKE.CORTEX).

Given a prompt, generates a response (completion) using your choice of supported language model.

> **Note:**
>
> A variant of this function allows COMPLETE to produce responses to images, including:
>
> * Comparing images
> * Captioning images
> * Classifying images
> * Extracting entities from images
> * Answering questions using data in graphs and charts

See [COMPLETE (SNOWFLAKE.CORTEX) (multimodal)](complete-snowflake-cortex-multimodal.md) for more information.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.COMPLETE(
    <model>, <prompt_or_history> [ , <options> ] )
```

## Arguments

**Required:**

`model`
:   A string specifying the model to be used. Specify one of the following values.

    * `claude-4-opus`
    * `claude-4-sonnet`
    * `claude-3-7-sonnet`
    * `claude-3-5-sonnet`
    * `deepseek-r1`
    * `llama3-8b`
    * `llama3-70b`
    * `llama3.1-8b`
    * `llama3.1-70b`
    * `llama3.1-405b`
    * `llama3.3-70b`
    * `llama4-maverick`
    * `llama4-scout`
    * `mistral-large`
    * `mistral-large2`
    * `mistral-7b`
    * `mixtral-8x7b`
    * `openai-gpt-4.1`
    * `openai-gpt-5`
    * `openai-gpt-5-chat`
    * `openai-gpt-5-mini`
    * `openai-gpt-5-nano`
    * `openai-gpt-5.1`
    * `openai-o4-mini`
    * `snowflake-arctic`
    * `snowflake-llama-3.1-405b`
    * `snowflake-llama-3.3-70b`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`prompt_or_history`
:   The prompt or conversation history to be used to generate a completion.

    If `options` is not present, the prompt given must be a string.

    If `options` is present, the argument must be an [array](../data-types-semistructured.md) of objects representing a
    conversation in chronological order. Each [object](../data-types-semistructured.md) must contain a `role` key and a
    `content` key. The `content` value is a prompt or a response, depending on the role. The role must be one of the
    following.

> | `role` value | `content` value |
> | --- | --- |
> | `'system'` | An initial plain-English prompt to the language model to provide it with background information and instructions for a response style. For example, “Respond in the style of a pirate.” The model does not generate a response to a system prompt. Only one system prompt may be provided, and if it is present, it must be the first in the array. |
> | `'user'` | A prompt provided by the user. Must follow the system prompt (if there is one) or an assistant response. |
> | `'assistant'` | A response previously provided by the language model. Must follow a user prompt. Past responses can be used to provide a stateful conversational experience; see Usage Notes. |

**Optional:**

`options`
:   An [object](../data-types-semistructured.md) containing zero or more of the following options that affect the model’s
    hyperparameters. See [LLM Settings](https://www.promptingguide.ai/introduction/settings).

    * `temperature`: A value from 0 to 1 (inclusive) that controls the randomness of the output of the language model. A
      higher temperature (for example, 0.7) results in more diverse and random output, while a lower temperature (such as
      0.2) makes the output more deterministic and focused.

      Default: 0
    * `top_p`: A value from 0 to 1 (inclusive) that controls the randomness and diversity of the language model,
      generally used as an alternative to `temperature`. The difference is that `top_p` restricts the set of possible tokens
      that the model outputs, while `temperature` influences which tokens are chosen at each step.

      Default: 0
    * `max_tokens`: Sets the maximum number of output tokens in the response. Small values can result in truncated responses.

      Default: 4096
      Maximum allowed value: 8192
    * `guardrails`: Filters potentially unsafe and harmful responses from a language model using [Cortex Guard](../../user-guide/snowflake-cortex/aisql.md).
      Either TRUE or FALSE.

      Default: FALSE
    * `response_format`: A [JSON schema](https://json-schema.org/) that the response should follow. This is a SQL
      sub-object, not a string. If `response_format` is not specified, the response is a string containing either the
      response or a serialized JSON object containing the response and information about it.

      For more information, see [AI_COMPLETE structured outputs](../../user-guide/snowflake-cortex/complete-structured-outputs.md).

    Specifying the `options` argument, even if it is an empty object (`{}`), affects how the `prompt` argument is
    interpreted and how the response is formatted.

## Returns

When the `options` argument is not specified, returns a string containing the response.

When the `options` argument is given, and this object contains the `response_format` key, returns a string
representation of a JSON object adhering to the specified JSON schema.

When the `options` argument is given, and this object *does not* contain the `response_format` key, returns a
string representation of a JSON object containing the following keys.

* `"choices"`: An array of the model’s responses. (Currently, only one response is provided.) Each response is
  an object containing a `"messages"` key whose value is the model’s response to the latest prompt.
* `"created"`: UNIX timestamp (seconds since midnight, January 1, 1970) when the response was generated.
* `"model"`: The name of the model that created the response.
* `"usage"`: An object recording the number of tokens consumed and generated by this completion. Includes
  the following sub-keys:

  + `"completion_tokens"`: The number of tokens in the generated response.
  + `"prompt_tokens"`: The number of tokens in the prompt.
  + `"total_tokens"`: The total number of tokens consumed, which is the sum of the other two values.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Usage notes

COMPLETE does not retain any state from one call to the next. To use the COMPLETE function to provide a stateful,
conversational experience, pass all previous user prompts and model responses in the conversation as part of the `prompt_or_history`
array (see [Templates for Chat Models](https://huggingface.co/docs/transformers/en/chat_templating#templates-for-chat-models)).
Keep in mind that the number of tokens processed increases for each “round,” and costs increase proportionally.

## Examples

### Single response

To generate a single response:

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('snowflake-arctic', 'What are large language models?');
```

### Responses from table column

The following example generates a response from each row of a table (in this example, `content` is a column from
the `reviews` table). The `reviews` table contains a column named `review_content` containing the text of
reviews submitted by users. The query returns a critique of each review.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'claude-haiku-4-5',
        CONCAT('Critique this review in bullet points: <review>', content, '</review>')
) FROM reviews LIMIT 10;
```

> **Tip:**
>
> As shown in this example, you can use tagging in the prompt to control the kind of response generated. See
> [A guide to prompting LLaMA 2](https://replicate.com/blog/how-to-prompt-llama) for tips.

### Controlling temperature and tokens

This example illustrates the use of the function’s `options` argument to control the inference hyperparameters in a
single response. Note that in this form of the function, the prompt must be provided as an array, since this form
supports multiple prompts and responses.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'claude-sonnet-4-6 ',
    [
        {
            'role': 'user',
            'content': 'how does a snowflake get its unique pattern?'
        }
    ],
    {
        'temperature': 0.7,
        'max_tokens': 10
    }
);
```

The response is a JSON object containing the message from the language model and other information. Note that the response
is truncated as instructed in the `options` argument.

```json
{
    "choices": [
        {
            "messages": " The unique pattern on a snowflake is"
        }
    ],
    "created": 1708536426,
    "model": "deepseek-r1",
    "usage": {
        "completion_tokens": 10,
        "prompt_tokens": 22,
        "guardrail_tokens": 0,
        "total_tokens": 32
    }
}
```

### Controlling safety

This example illustrates the use of the Cortex Guard `guardrails` argument to filter unsafe and harmful responses from a language model.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'mistral-large2',
    [
        {
            'role': 'user',
            'content': <'Prompt that generates an unsafe response'>
        }
    ],
    {
        'guardrails': true
    }
);
```

The response is a JSON object, for example:

```json
{
    "choices": [
        {
            "messages": "Response filtered by Cortex Guard"
        }
    ],
    "created": 1718882934,
    "model": "mistral-7b",
    "usage": {
        "completion_tokens": 402,
        "prompt_tokens": 93,
        "guardrails _tokens": 677,
        "total_tokens": 1172
    }
}
```

### Providing a system prompt

This example illustrates the use of a system prompt to provide a sentiment analysis of movie reviews. The `prompt`
argument here is an array of objects, each having an appropriate `role` value.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'llama3.1-70b',
    [
        {'role': 'system', 'content': 'You are a helpful AI assistant. Analyze the movie review text and determine the overall sentiment. Answer with just \"Positive\", \"Negative\", or \"Neutral\"' },
        {'role': 'user', 'content': 'this was really good'}
    ], {}
    ) as response;
```

The response is a JSON object containing the response from the language model and other information.

```json
{
    "choices": [
        {
        "messages": " Positive"
        }
    ],
    "created": 1708479449,
    "model": "deepseek-r1",
    "usage": {
        "completion_tokens": 3,
        "prompt_tokens": 64,
        "total_tokens": 67
    }
}
```

## Legal notices

The following notice applies to Cortex COMPLETE Structured Output functionality only:

Use of models provided on the [Snowflake Model and Service Flow-Down Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/open-source-model-flow-down-terms/)
page are subject to the terms specified therein. The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Feature |

For the rest of COMPLETE functionality, refer to [Snowflake AI and ML](../../guides-overview-ai-features.md) for legal notices.

---
title: COMPLETE (SNOWFLAKE.CORTEX) (multimodal)
source: https://docs.snowflake.com/en/sql-reference/functions/complete-snowflake-cortex-multimodal.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# COMPLETE (SNOWFLAKE.CORTEX) (multimodal)

> **Note:**
>
> [AI_COMPLETE](ai_complete.md) is the latest version of this function.
> Use AI_COMPLETE for the latest functionality.
> You can continue to use COMPLETE (SNOWFLAKE.CORTEX).

Given an image and a prompt, generates a response (completion) using a language model. This function variant supports
image models along with text models, and processes images stored in an internal Snowflake stage or an external stage.
COMPLETE can be used to process a single image, multiple images in a batch fashion, applying the same or a different
prompt to each image, or multiple images in a single operation (for example, comparison).

## Syntax

Use one of the following:

```sqlsyntax
SNOWFLAKE.CORTEX.COMPLETE(
    '<model>', '<prompt>', <file_object>)
FROM <table>
```

```sqlsyntax
SNOWFLAKE.CORTEX.COMPLETE(
    '<model>', <prompt_object> )
FROM <table>
```

## Arguments

`model`
:   A string specifying the model to be used. Specify one of the following models:

    * `claude-4-6-sonnet`
    * `pixtral-large`

    Supported models might have different costs and context windows. New models might be added from time to time.

`prompt`
:   A string containing a question about the image and optionally specifying an output format, such as JSON. Either
    this or the `prompt_object` argument is required.

`prompt_object`
:   A SQL OBJECT containing a string prompt with numbered placeholders (`{0}`, `{1}`, and so on) and one or more text or
    FILE valuse that are inserted into the prompt. The [PROMPT](prompt.md) function is a convenient way to create an object
    with the required layout. Either this argument or `prompt` is required.

`file_object`
:   A FILE object that contains an image file to be processed. Use the [TO_FILE](to_file.md) function to
    create FILE objects from a stage path. Required when using a string prompt.

`FROM table`
:   An optional table containing image paths and an optional prompt for each image, allowing images to be batch-processed
    in a single call to COMPLETE.

## Returns

A string containing the language model’s response.

## Usage notes

* Inputs exceeding the context window limit result in an error. Output which would exceed the context window limit is truncated.
* To process multiple images, the prompt must be an object (typically created using the PROMPT function) that specifies a prompt
  template and the files to be processed.
* Only text and images are supported. Video and audio files are not supported.
* Images with filename extensions `.jpg`, `.jpeg`, `.png`, `.gif`, and `.webp` are supported. `pixtral-large` also supports `.bmp`.
* Maximum image size is 10 MB for `pixtral-large` and 3.75 MB for `claude-4-6-sonnet`. Additionally, `claude-4-6-sonnet` does not support images with a resolution greater than 8000x8000.
* The stage containing the images must have server-side encryption enabled. Client-side encrypted stages are not supported.
* The function does not support custom network policies.
* Stage names are not case-sensitive, but paths are.

## Examples

The following examples demonstrate the basic capabilities of the COMPLETE function with images.

### Visual question answering

A chart of inflation rates is used to answer a question about the data.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    'Which country will observe the largest inflation change in 2024 compared to 2023?',
    TO_FILE('@myimages', 'highest-inflation.png'));
```

Response:

```output
Looking at the data, Venezuela will experience the largest change in inflation rates between 2023 and 2024.
The inflation rate in Venezuela is projected to decrease significantly from 337.46% in 2023 to 99.98% in 2024,
representing a reduction of approximately 237.48 percentage points. This is the most dramatic change among
all countries shown in the chart, even though Zimbabwe has higher absolute inflation rates.
```

### Image classification

This example classifies the landmark identified in a single image.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    'Classify the landmark identified in this image. Respond in JSON only with the landmark name.',
    TO_FILE('@myimages', 'Seattle.jpg'));
```

Response:

```output
{"landmark": "Space Needle"}
```

### Entity extraction from an image

This example extracts the entities (objects) from an image and returns the results in JSON format.

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    'Extract the kitchen appliances identified in this image. Respond in JSON only with the identified appliances.',
    TO_FILE('@myimages', 'kitchen.png'));
```

Response:

```output
{
    "appliances": [ "microwave","electric stove","oven","refrigerator" ]
}
```

### Batch processing images from a directory or table

For batch processing of multiple images, performing the same operation on each, store the image files in the same stage.
Apply the COMPLETE function to each row of the table.

> **Note:**
>
> The stage must have a [directory table](../../user-guide/data-load-dirtables.md) to retrieve the paths to its files.

First, create the table by retrieving the image locations from the directory, convert these to FILE objects, and
storing the resulting FILE objects in a column in a table. Use SQL like the following:

```sqlexample
CREATE TABLE image_table AS
    (SELECT TO_FILE('@myimages', RELATIVE_PATH) AS img FROM DIRECTORY(@myimages));
```

Then, apply the COMPLETE function to the column containing the FILE objects. The following example classifies each image in the table:

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON', img_file)) AS image_classification
FROM image_table;
```

Response:

```output
{ "classification": "Inflation Rates" }
{ "classification": "beverage refrigerator" }
{ "classification": "Space Needle" }
{ "classification": "Modern Kitchen" }
{ "classification": "Pie Chart" }
{ "classification": "Economic Graph" }
{ "classification": "Persian Cat" }
{ "classification": "Labrador Retriever" }
{ "classification": "Jedi Cat" }
{ "classification": "Sleeping cat" }
{ "classification": "Persian Cat" }
{ "classification": "Garden Costume" }
{ "classification": "Floral Fashion" }
```

If you already have a table with paths to the images, you can use the [TO_FILE function](to_file.md) to construct the FILE
objects within the query:

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON',
        TO_FILE('@myimages', img_path)) AS image_classification
FROM image_table;
```

You can also retrieve the images to be processed directly from a stage’s directory, as shown here:

```sqlexample
SELECT SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON',
        TO_FILE('@myimages', RELATIVE_PATH))) as image_classification
FROM DIRECTORY(@myimages);
```

### Providing images and prompts in a table

To perform a different operation on each image in a table, provide the images and their corresponding prompts in a
table. In the following example, the table contains the stage path of each image in the `img_path` column and the
prompt in the `prompt` column.

```sqlexample
SNOWFLAKE.CORTEX.COMPLETE('claude-4-6-sonnet',
    PROMPT('Given the input image {0}, {1}. Respond in JSON',
        TO_FILE('@myimages', img_path), prompt) as image_result)
FROM image_table;
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: COMPLETE_TASK_GRAPHS
source: https://docs.snowflake.com/en/sql-reference/functions/complete_task_graphs.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# COMPLETE_TASK_GRAPHS

Returns the status of a completed *graph* run. The function returns details for runs that executed
successfully, failed, or were cancelled in the past 60 minutes. A graph is currently defined as a single scheduled task or a
[task graph](../../user-guide/tasks-graphs.md) composed of a scheduled root task and one or more dependent tasks (i.e. tasks that have one or more defined predecessor tasks). For the
purposes of this function, *root task* refers to either the single scheduled task or the root task in a [task graph](../../user-guide/tasks-graphs.md).

To retrieve the details for graph runs that are currently executing, or are next scheduled to run within the next 8 days, query the
[CURRENT_TASK_GRAPHS](current_task_graphs.md) table function.

The function returns the graph run details for your entire Snowflake account or a specified root task.

## Syntax

```sqlsyntax
COMPLETE_TASK_GRAPHS(
      [ RESULT_LIMIT => <integer> ]
      [, ROOT_TASK_NAME => '<string>' ]
      [, ERROR_ONLY => { TRUE | FALSE } ] )
```

## Arguments

All the arguments are optional.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function. Note that the results are returned in descending COMPLETED_TIME
    order. If the number of matching rows is greater than the result limit, the graph executions with the most recent completed timestamp are
    returned, up to the specified limit.

    Range: `1` to `10000`

    Default: `1000`

`ROOT_TASK_NAME => string`
:   A case-insensitive string specifying the name of the root task. Only non-qualified task names are supported. Only graph runs for the
    specified task are returned. Note that if multiple tasks have the same name, the function returns the graph runs for each of these tasks.

`ERROR_ONLY => TRUE | FALSE`
:   When set to TRUE, this function returns only graph runs that failed or were cancelled.

## Usage notes

* To view a task graph within this function, the invoking role requires at least one of the following privileges:

  + OWNERSHIP privilege on the task (that is, the task owner).
  + MONITOR or OPERATE privileges on the task.
  + The global MONITOR EXECUTION privilege.
  + The ACCOUNTADMIN role.

  The role must also have the USAGE privilege on the database and schema that store the task, otherwise the DATABASE_NAME and SCHEMA_NAME values in the output are NULL.
* This function returns a maximum of 10,000 rows, set in the `RESULT_LIMIT` argument value. The default value is `1000`. To avoid
  this limitation, use the [COMPLETE_TASK_GRAPHS view](../account-usage/complete_task_graphs.md) (Account Usage).
* When the COMPLETE_TASK_GRAPHS function is queried, its task name and result limit arguments are applied first
  followed by the WHERE and LIMIT clause, respectively, if specified. In addition, the function returns records in descending
  COMPLETED_TIME order.

  In practice, if many task graphs completed running in your account in the previous hour, the results returned by the function might not
  include an expected record, especially if the RESULT_LIMIT value is relatively low.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function
  name must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROOT_TASK_NAME | TEXT | Name of the root task. |
| DATABASE_NAME | TEXT | Name of the database that contains the graph. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the graph. |
| STATE | TEXT | State of the graph run:   * `SUCCEEDED`: All tasks in the graph ran successfully to completion, or the root task run succeeded and one or more child task runs were skipped. * `FAILED`: One or more task runs in the graph failed, or the root task run succeeded and one or more child task runs failed. * `CANCELLED`: One or more task runs in the graph were cancelled, or the root task run succeeded and one or more child task runs were cancelled.   Note that if the state of the root task run is SKIPPED, the function does not return a row for the run. |
| SCHEDULED_FROM | TEXT | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| FIRST_ERROR_TASK_NAME | TEXT | Name of the first task in the graph that returned an error; returns NULL if no task produced an error. |
| FIRST_ERROR_CODE | NUMBER | Error code of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| FIRST_ERROR_MESSAGE | TEXT | Error message of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the root task was scheduled to start. Tasks start with a brief queueing period before they begin to run. For more information, see [Task graph duration](../../user-guide/tasks-graphs.md). |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the root task definition started to run. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| NEXT_SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the standalone or root task (in a [task graph](../../user-guide/tasks-graphs.md)) is next scheduled to start running, assuming the current run of the standalone task or [task graph](../../user-guide/tasks-graphs.md) started at the SCHEDULED_TIME time completes in time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the last task in the [task graph](../../user-guide/tasks-graphs.md) completed. |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a [task graph](../../user-guide/tasks-graphs.md). This ID matches the ID column value in the SHOW TASKS output for the same task. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the [task graph](../../user-guide/tasks-graphs.md) that was run, or is scheduled to be run. |
| RUN_ID | NUMBER | Time when the standalone or root task in a [task graph](../../user-guide/tasks-graphs.md) is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system might reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run before retry. You can use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| CONFIG | TEXT | Displays the graph level configuration used during the graph run if explicitly set. Otherwise displays NULL. |
| GRAPH_RUN_GROUP_ID | TEXT | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Examples

Retrieve the 1000 most recent completed graph executions in the account. Note that the maximum number of rows returned by the function is
limited to 1000 by default. To change the number of rows returned, modify the RESULT_LIMIT argument value:

> ```sqlexample
> select *
>   from table(information_schema.complete_task_graphs())
>   order by scheduled_time;
> ```

Retrieve the 10 most recent completed graph runs for a specified task graph within the last hour:

> ```sqlexample
> select *
>   from table(information_schema.complete_task_graphs (
>     result_limit => 10,
>     root_task_name=>'MYTASK'));
> ```

---
title: COMPRESS
source: https://docs.snowflake.com/en/sql-reference/functions/compress.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Compression/Decompression)

# COMPRESS

Compresses the input string or binary value with a compression method.

See also:
:   [DECOMPRESS_BINARY](decompress_binary.md) , [DECOMPRESS_STRING](decompress_string.md)

## Syntax

```sqlsyntax
COMPRESS(<input>, <method>)
```

## Arguments

**Required:**

`input`
:   A `BINARY` or string value (or expression) to be compressed.

`method`
:   A string with compression method and optional compression level. Supported
    methods are:

    * `SNAPPY`.
    * `ZLIB`.
    * `ZSTD`.
    * `BZ2`.

    The compression level is specified in parentheses, for example:
    `zlib(1)`. The compression level is a non-negative integer. `0` means
    default level (same as omitting the compression level). The compression
    level is ignored if the method doesn’t support compression levels.

## Returns

A `BINARY` with compressed data.

## Usage notes

* If the compression method is unknown or invalid, the query fails.
* The compression method name (e.g. `ZLIB`) is case-insensitive.
* Not all inputs are compressible. For very short or difficult-to-compress
  input values, the output value might be the same length as, or even slightly
  longer than, the input value.

## Examples

The example below shows how to use the `COMPRESS` function with the
`SNAPPY` compression method.

The output of the function is `BINARY`, but SNOWSQL displays the output as a
string of hexadecimal characters for readability.

```sqlexample
SELECT COMPRESS('Snowflake', 'SNAPPY');
+---------------------------------+
| COMPRESS('SNOWFLAKE', 'SNAPPY') |
|---------------------------------|
| 0920536E6F77666C616B65          |
+---------------------------------+
```

---
title: CONCAT , ||
source: https://docs.snowflake.com/en/sql-reference/functions/concat.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# CONCAT , `||`

Concatenates one or more strings, or concatenates one or more binary values.

The `||` operator provides alternative syntax for CONCAT and requires at least two arguments.

See also:
:   [CONCAT_WS](concat_ws.md)

## Syntax

```sqlsyntax
CONCAT( <expr> [ , <expr> ... ] )

<expr> || <expr> [ || <expr> ... ]
```

## Arguments

`expr`
:   The input expressions must be all strings, or all binary values.

## Returns

The data type of the returned value is the same as the data type of the input values.

If any input value is NULL, the function returns NULL.

## Usage notes

Metadata functions such as [GET_DDL](get_ddl.md) accept only constants as input. Concatenated
input generates an error.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

Concatenate two strings:

```sqlexample
SELECT CONCAT('George Washington ', 'Carver');
```

```output
+----------------------------------------+
| CONCAT('GEORGE WASHINGTON ', 'CARVER') |
|----------------------------------------|
| George Washington Carver               |
+----------------------------------------+
```

Concatenate five strings, using [session variables](../session-variables.md)
for three of them:

```sqlexample
SET var_first_name = 'George';
SET var_middle_name = 'Washington';
SET var_last_name = 'Carver';

SELECT CONCAT($var_first_name, ' ', $var_middle_name, ' ', $var_last_name) AS concat_name;
```

```output
+--------------------------+
| CONCAT_NAME              |
|--------------------------|
| George Washington Carver |
+--------------------------+
```

Concatenate two VARCHAR columns. First, create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE concat_function_example (s1 VARCHAR, s2 VARCHAR, s3 VARCHAR);
INSERT INTO concat_function_example (s1, s2, s3) VALUES
  ('co', 'd', 'e'),
  ('Colorado ', 'River ', NULL);
```

Run a query:

```sqlexample
SELECT CONCAT(s1, s2)
  FROM concat_function_example;
```

```output
+-----------------+
| CONCAT(S1, S2)  |
|-----------------|
| cod             |
| Colorado River  |
+-----------------+
```

Concatenate more than two strings:

```sqlexample
SELECT CONCAT(s1, s2, s3)
  FROM concat_function_example;
```

```output
+--------------------+
| CONCAT(S1, S2, S3) |
|--------------------|
| code               |
| NULL               |
+--------------------+
```

Use the [IFF](iff.md) function with the CONCAT function to concatenate strings that are
not NULL:

```sqlexample
SELECT CONCAT(
    IFF(s1 IS NULL, '', s1),
    IFF(s2 IS NULL, '', s2),
    IFF(s3 IS NULL, '', s3)) AS concat_non_null_strings
  FROM concat_function_example;
```

```output
+-------------------------+
| CONCAT_NON_NULL_STRINGS |
|-------------------------|
| code                    |
| Colorado River          |
+-------------------------+
```

Use the `||` concatenation operator instead of the function:

```sqlexample
SELECT 'This ' || 'is ' || 'another ' || 'concatenation ' || 'technique.';
```

```output
+--------------------------------------------------------------------+
| 'THIS ' || 'IS ' || 'ANOTHER ' || 'CONCATENATION ' || 'TECHNIQUE.' |
|--------------------------------------------------------------------|
| This is another concatenation technique.                           |
+--------------------------------------------------------------------+
```

---
title: CONCAT_WS
source: https://docs.snowflake.com/en/sql-reference/functions/concat_ws.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# CONCAT_WS

Concatenates two or more strings, or concatenates two or more binary values, and uses
the first argument as a delimiter between the following strings.

> **Note:**
>
> Unlike some implementations of the CONCAT_WS function, the Snowflake CONCAT_WS function
> doesn’t skip NULL values.

See also:
:   [CONCAT](concat.md)

## Syntax

```sqlsyntax
CONCAT_WS( <separator> , <expression> [ , <expression> ... ] )
```

## Arguments

`separator`
:   The separator must meet the same requirements as `expression`.

`expression`
:   The input expressions must be all strings, or all binary values.

## Returns

The function returns a VARCHAR or BINARY value that contains the 2nd through Nth arguments,
separated by the first argument.

If any argument is NULL, the function returns NULL.

The data type of the returned value is the same as the data type of the input values.

## Usage notes

* Metadata functions such as [GET_DDL](get_ddl.md) accept only constants as input. Concatenated
  input generates an error.
* CONCAT_WS puts separators between arguments, not after the last argument. If CONCAT_WS is called
  with only one argument after the separator, then no separator is appended.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

Call the CONCAT_WS function to concatenate three strings with a comma separator:

```sqlexample
SELECT CONCAT_WS(',', 'one', 'two', 'three');
```

```output
+---------------------------------------+
| CONCAT_WS(',', 'ONE', 'TWO', 'THREE') |
|---------------------------------------|
| one,two,three                         |
+---------------------------------------+
```

The following example shows that if any argument is NULL, the function returns NULL:

```sqlexample
SELECT CONCAT_WS(',', 'one', NULL, 'two');
```

```output
+------------------------------------+
| CONCAT_WS(',', 'ONE', NULL, 'TWO') |
|------------------------------------|
| NULL                               |
+------------------------------------+
```

The following example shows that when there is only one string to concatenate, the CONCAT_WS function
doesn’t append a separator:

```sqlexample
SELECT CONCAT_WS(',', 'one');
```

```output
+-----------------------+
| CONCAT_WS(',', 'ONE') |
|-----------------------|
| one                   |
+-----------------------+
```

---
title: CONDITIONAL_CHANGE_EVENT
source: https://docs.snowflake.com/en/sql-reference/functions/conditional_change_event.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (General)

# CONDITIONAL_CHANGE_EVENT

Returns a window event number for each row within a window partition when the value of the argument `expr1` in
the current row is different from the value of `expr1` in the previous row. The window event number starts
from 0 and is incremented by 1 to indicate the number of changes so far within that window.

## Syntax

```sqlsyntax
CONDITIONAL_CHANGE_EVENT( <expr1> ) OVER ( [ PARTITION BY <expr2> ] ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`expr1`
:   This is an expression that gets compared with the expression of the previous row.

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the expression to order by within each partition.

## Usage notes

* The expression `CONDITIONAL_CHANGE_EVENT (expr1) OVER (window_frame)` is calculated as:

  > `CONDITIONAL_TRUE_EVENT( <expr1> != LAG(<expr1>) OVER(window_frame)) OVER(window_frame)`

  For more information about CONDITIONAL_TRUE_EVENT, see [CONDITIONAL_TRUE_EVENT](conditional_true_event.md).

## Examples

This shows how to detect the number of times that the power failed and was
turned back on (i.e. the number of times that the voltage dropped to 0 or
was restored). (This example assumes that sampling the voltage every 15
minutes is sufficient. Because power failures can last less than 15 minutes,
you’d typically want more frequent samples, or you’d want to treat the
query results as an approximation.)

Create and load the table:

```sqlexample
CREATE TABLE voltage_readings (
  site_id INTEGER,  -- which refrigerator the measurement was taken in
  ts TIMESTAMP,     -- the time at which the temperature was measured
  voltage FLOAT
  );

INSERT INTO voltage_readings (site_id, ts, voltage) VALUES
  (1, '2019-10-30 13:00:00', 120),
  (1, '2019-10-30 13:15:00', 120),
  (1, '2019-10-30 13:30:00',   0),
  (1, '2019-10-30 13:45:00',   0),
  (1, '2019-10-30 14:00:00',   0),
  (1, '2019-10-30 14:15:00',   0),
  (1, '2019-10-30 14:30:00', 120)
  ;
```

This shows the samples for which the voltage was zero, whether or not those
zero-volt events were part of the same power failure or different power failures.

```sqlexample
SELECT site_id, ts, voltage
  FROM voltage_readings
  WHERE voltage = 0
  ORDER BY ts;
```

```output
+---------+-------------------------+---------+
| SITE_ID | TS                      | VOLTAGE |
|---------+-------------------------+---------|
|       1 | 2019-10-30 13:30:00.000 |       0 |
|       1 | 2019-10-30 13:45:00.000 |       0 |
|       1 | 2019-10-30 14:00:00.000 |       0 |
|       1 | 2019-10-30 14:15:00.000 |       0 |
+---------+-------------------------+---------+
```

This shows the samples, along with a column indicating whether the voltage
changed:

```sqlexample
SELECT
    site_id,
    ts,
    voltage,
    CONDITIONAL_CHANGE_EVENT(voltage = 0) OVER (ORDER BY ts) AS power_changes
  FROM voltage_readings;
```

```output
+---------+-------------------------+---------+---------------+
| SITE_ID | TS                      | VOLTAGE | POWER_CHANGES |
|---------+-------------------------+---------+---------------|
|       1 | 2019-10-30 13:00:00.000 |     120 |             0 |
|       1 | 2019-10-30 13:15:00.000 |     120 |             0 |
|       1 | 2019-10-30 13:30:00.000 |       0 |             1 |
|       1 | 2019-10-30 13:45:00.000 |       0 |             1 |
|       1 | 2019-10-30 14:00:00.000 |       0 |             1 |
|       1 | 2019-10-30 14:15:00.000 |       0 |             1 |
|       1 | 2019-10-30 14:30:00.000 |     120 |             2 |
+---------+-------------------------+---------+---------------+
```

This shows the times that the power stopped and restarted:

```sqlexample
WITH power_change_events AS (
  SELECT
      site_id,
      ts,
      voltage,
      CONDITIONAL_CHANGE_EVENT(voltage = 0) OVER (ORDER BY ts) AS power_changes
    FROM voltage_readings
)
SELECT
    site_id,
    MIN(ts),
    voltage,
    power_changes
  FROM power_change_events
  GROUP BY site_id, power_changes, voltage
  ORDER BY 2;
```

```output
+---------+-------------------------+---------+---------------+
| SITE_ID | MIN(TS)                 | VOLTAGE | POWER_CHANGES |
|---------+-------------------------+---------+---------------|
|       1 | 2019-10-30 13:00:00.000 |     120 |             0 |
|       1 | 2019-10-30 13:30:00.000 |       0 |             1 |
|       1 | 2019-10-30 14:30:00.000 |     120 |             2 |
+---------+-------------------------+---------+---------------+
```

This shows how many times the power stopped and restarted:

```sqlexample
WITH power_change_events AS (
  SELECT
      site_id,
      CONDITIONAL_CHANGE_EVENT(voltage = 0) OVER (ORDER BY ts) AS power_changes
    FROM voltage_readings
)
SELECT MAX(power_changes)
  FROM power_change_events
  GROUP BY site_id;
```

```output
+--------------------+
| MAX(POWER_CHANGES) |
|--------------------|
|                  2 |
+--------------------+
```

This example illustrates that:

* The change number within a partition changes each time the specified value changes.
* NULL values are not considered a new or changed value.
* The change count starts over at 0 for each partition.

Create and load the table:

```sqlexample
CREATE TABLE table1 (province VARCHAR, o_col INTEGER, o2_col INTEGER);

INSERT INTO table1 (province, o_col, o2_col) VALUES
  ('Alberta', 0, 10),
  ('Alberta', 0, 10),
  ('Alberta', 13, 10),
  ('Alberta', 13, 11),
  ('Alberta', 14, 11),
  ('Alberta', 15, 12),
  ('Alberta', NULL, NULL),
  ('Manitoba', 30, 30);
```

Query the table:

```sqlexample
SELECT province, o_col,
    CONDITIONAL_CHANGE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS change_event
  FROM table1
  ORDER BY province, o_col;
```

```output
+----------+-------+--------------+
| PROVINCE | O_COL | CHANGE_EVENT |
|----------+-------+--------------|
| Alberta  |     0 |            0 |
| Alberta  |     0 |            0 |
| Alberta  |    13 |            1 |
| Alberta  |    13 |            1 |
| Alberta  |    14 |            2 |
| Alberta  |    15 |            3 |
| Alberta  |  NULL |            3 |
| Manitoba |    30 |            0 |
+----------+-------+--------------+
```

The next example shows that:

* `expr1` can be an expression other than a column. This query uses the expression `o_col < 15`,
  and the output of the query shows when the value in o_col changes from a value less than 15 to
  a value greater than or equal to 15.
* `expr3` does not need to match `expr1`. In other words, the expression in the ORDER BY
  sub-clause of the OVER clause does not need to match the expression in the CONDITIONAL_CHANGE_EVENT function.

```sqlexample
SELECT province, o_col,
    'o_col < 15' AS condition,
    CONDITIONAL_CHANGE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS change_event,
    CONDITIONAL_CHANGE_EVENT(o_col < 15)
      OVER (PARTITION BY province ORDER BY o_col)
        AS change_event_2
  FROM table1
  ORDER BY province, o_col;
```

```output
+----------+-------+------------+--------------+----------------+
| PROVINCE | O_COL | CONDITION  | CHANGE_EVENT | CHANGE_EVENT_2 |
|----------+-------+------------+--------------+----------------|
| Alberta  |     0 | o_col < 15 |            0 |              0 |
| Alberta  |     0 | o_col < 15 |            0 |              0 |
| Alberta  |    13 | o_col < 15 |            1 |              0 |
| Alberta  |    13 | o_col < 15 |            1 |              0 |
| Alberta  |    14 | o_col < 15 |            2 |              0 |
| Alberta  |    15 | o_col < 15 |            3 |              1 |
| Alberta  |  NULL | o_col < 15 |            3 |              1 |
| Manitoba |    30 | o_col < 15 |            0 |              0 |
+----------+-------+------------+--------------+----------------+
```

The next example compares CONDITIONAL_CHANGE_EVENT and CONDITIONAL_TRUE_EVENT:

```sqlexample
SELECT province, o_col,
    CONDITIONAL_CHANGE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS change_event,
    CONDITIONAL_TRUE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS true_event
  FROM table1
  ORDER BY province, o_col;
```

```output
+----------+-------+--------------+------------+
| PROVINCE | O_COL | CHANGE_EVENT | TRUE_EVENT |
|----------+-------+--------------+------------|
| Alberta  |     0 |            0 |          0 |
| Alberta  |     0 |            0 |          0 |
| Alberta  |    13 |            1 |          1 |
| Alberta  |    13 |            1 |          2 |
| Alberta  |    14 |            2 |          3 |
| Alberta  |    15 |            3 |          4 |
| Alberta  |  NULL |            3 |          4 |
| Manitoba |    30 |            0 |          1 |
+----------+-------+--------------+------------+
```

This example also compares CONDITIONAL_CHANGE_EVENT and CONDITIONAL_TRUE_EVENT:

```sqlexample
CREATE TABLE borrowers (
  name VARCHAR,
  status_date DATE,
  late_balance NUMERIC(11, 2),
  thirty_day_late_balance NUMERIC(11, 2)
  );

INSERT INTO borrowers (name, status_date, late_balance, thirty_day_late_balance) VALUES
  -- Pays late frequently, but catches back up rather than falling further behind.
  ('Geoffrey Flake', '2018-01-01'::DATE,    0.0,    0.0),
  ('Geoffrey Flake', '2018-02-01'::DATE, 1000.0,    0.0),
  ('Geoffrey Flake', '2018-03-01'::DATE, 2000.0, 1000.0),
  ('Geoffrey Flake', '2018-04-01'::DATE,    0.0,    0.0),
  ('Geoffrey Flake', '2018-05-01'::DATE, 1000.0,    0.0),
  ('Geoffrey Flake', '2018-06-01'::DATE, 2000.0, 1000.0),
  ('Geoffrey Flake', '2018-07-01'::DATE,    0.0,    0.0),
  ('Geoffrey Flake', '2018-08-01'::DATE,    0.0,    0.0),
  -- Keeps falling further behind.
  ('Cy Dismal', '2018-01-01'::DATE,    0.0,    0.0),
  ('Cy Dismal', '2018-02-01'::DATE,    0.0,    0.0),
  ('Cy Dismal', '2018-03-01'::DATE, 1000.0,    0.0),
  ('Cy Dismal', '2018-04-01'::DATE, 2000.0, 1000.0),
  ('Cy Dismal', '2018-05-01'::DATE, 3000.0, 2000.0),
  ('Cy Dismal', '2018-06-01'::DATE, 4000.0, 3000.0),
  ('Cy Dismal', '2018-07-01'::DATE, 5000.0, 4000.0),
  ('Cy Dismal', '2018-08-01'::DATE, 6000.0, 5000.0),
  -- Fell behind and isn't catching up, but isn't falling further behind.
  ('Leslie Safer', '2018-01-01'::DATE,    0.0,    0.0),
  ('Leslie Safer', '2018-02-01'::DATE,    0.0,    0.0),
  ('Leslie Safer', '2018-03-01'::DATE, 1000.0, 1000.0),
  ('Leslie Safer', '2018-04-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-05-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-06-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-07-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-08-01'::DATE, 2000.0, 1000.0),
  -- Always pays on time and in full.
  ('Ida Idyll', '2018-01-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-02-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-03-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-04-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-05-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-06-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-07-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-08-01'::DATE,    0.0,    0.0)
  ;
```

```sqlexample
SELECT name, status_date, late_balance AS "OVERDUE",
    thirty_day_late_balance AS "30 DAYS OVERDUE",
    CONDITIONAL_CHANGE_EVENT(thirty_day_late_balance)
      OVER (PARTITION BY name ORDER BY status_date) AS change_event_cnt,
    CONDITIONAL_TRUE_EVENT(thirty_day_late_balance)
      OVER (PARTITION BY name ORDER BY status_date) AS true_cnt
  FROM borrowers
  ORDER BY name, status_date;
```

```output
+----------------+-------------+---------+-----------------+------------------+----------+
| NAME           | STATUS_DATE | OVERDUE | 30 DAYS OVERDUE | CHANGE_EVENT_CNT | TRUE_CNT |
|----------------+-------------+---------+-----------------+------------------+----------|
| Cy Dismal      | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Cy Dismal      | 2018-02-01  |    0.00 |            0.00 |                0 |        0 |
| Cy Dismal      | 2018-03-01  | 1000.00 |            0.00 |                0 |        0 |
| Cy Dismal      | 2018-04-01  | 2000.00 |         1000.00 |                1 |        1 |
| Cy Dismal      | 2018-05-01  | 3000.00 |         2000.00 |                2 |        2 |
| Cy Dismal      | 2018-06-01  | 4000.00 |         3000.00 |                3 |        3 |
| Cy Dismal      | 2018-07-01  | 5000.00 |         4000.00 |                4 |        4 |
| Cy Dismal      | 2018-08-01  | 6000.00 |         5000.00 |                5 |        5 |
| Geoffrey Flake | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Geoffrey Flake | 2018-02-01  | 1000.00 |            0.00 |                0 |        0 |
| Geoffrey Flake | 2018-03-01  | 2000.00 |         1000.00 |                1 |        1 |
| Geoffrey Flake | 2018-04-01  |    0.00 |            0.00 |                2 |        1 |
| Geoffrey Flake | 2018-05-01  | 1000.00 |            0.00 |                2 |        1 |
| Geoffrey Flake | 2018-06-01  | 2000.00 |         1000.00 |                3 |        2 |
| Geoffrey Flake | 2018-07-01  |    0.00 |            0.00 |                4 |        2 |
| Geoffrey Flake | 2018-08-01  |    0.00 |            0.00 |                4 |        2 |
| Ida Idyll      | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-02-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-03-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-04-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-05-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-06-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-07-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-08-01  |    0.00 |            0.00 |                0 |        0 |
| Leslie Safer   | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Leslie Safer   | 2018-02-01  |    0.00 |            0.00 |                0 |        0 |
| Leslie Safer   | 2018-03-01  | 1000.00 |         1000.00 |                1 |        1 |
| Leslie Safer   | 2018-04-01  | 2000.00 |         1000.00 |                1 |        2 |
| Leslie Safer   | 2018-05-01  | 2000.00 |         1000.00 |                1 |        3 |
| Leslie Safer   | 2018-06-01  | 2000.00 |         1000.00 |                1 |        4 |
| Leslie Safer   | 2018-07-01  | 2000.00 |         1000.00 |                1 |        5 |
| Leslie Safer   | 2018-08-01  | 2000.00 |         1000.00 |                1 |        6 |
+----------------+-------------+---------+-----------------+------------------+----------+
```

Here is a more extensive example:

```sqlexample
CREATE OR REPLACE TABLE tbl
  (p INT, o INT, i INT, r INT, s VARCHAR(100));

INSERT INTO tbl VALUES
  (100, 1, 1, 70, 'seventy'),
  (100, 2, 2, 30, 'thirty'),
  (100, 3, 3, 40, 'fourty'),
  (100, 4, NULL, 90, 'ninety'),
  (100, 5, 5, 50, 'fifty'),
  (100, 6, 6, 30, 'thirty'),
  (200, 7, 7, 40, 'fourty'),
  (200, 8, NULL, NULL, 'n_u_l_l'),
  (200, 9, NULL, NULL, 'n_u_l_l'),
  (200, 10, 10, 20, 'twenty'),
  (200, 11, NULL, 90, 'ninety'),
  (300, 12, 12, 30, 'thirty'),
  (400, 13, NULL, 20, 'twenty');
```

```sqlexample
SELECT *
  FROM tbl
  ORDER BY p, o, i;
```

```output
+-----+----+--------+--------+---------+
|  P  | O  |   I    |   R    |    S    |
+-----+----+--------+--------+---------+
| 100 | 1  | 1      | 70     | seventy |
| 100 | 2  | 2      | 30     | thirty  |
| 100 | 3  | 3      | 40     | fourty  |
| 100 | 4  | [NULL] | 90     | ninety  |
| 100 | 5  | 5      | 50     | fifty   |
| 100 | 6  | 6      | 30     | thirty  |
| 200 | 7  | 7      | 40     | fourty  |
| 200 | 8  | [NULL] | [NULL] | n_u_l_l |
| 200 | 9  | [NULL] | [NULL] | n_u_l_l |
| 200 | 10 | 10     | 20     | twenty  |
| 200 | 11 | [NULL] | 90     | ninety  |
| 300 | 12 | 12     | 30     | thirty  |
| 400 | 13 | [NULL] | 20     | twenty  |
+-----+----+--------+--------+---------+
```

```sqlexample
SELECT p, o,
    CONDITIONAL_CHANGE_EVENT(o) OVER (PARTITION BY p ORDER BY o)
  FROM tbl
  ORDER BY p, o;
```

```output
+-----+----+--------------------------------------------------------------+
|   P |  O | CONDITIONAL_CHANGE_EVENT(O) OVER (PARTITION BY P ORDER BY O) |
|-----+----+--------------------------------------------------------------|
| 100 |  1 |                                                            0 |
| 100 |  2 |                                                            1 |
| 100 |  3 |                                                            2 |
| 100 |  4 |                                                            3 |
| 100 |  5 |                                                            4 |
| 100 |  6 |                                                            5 |
| 200 |  7 |                                                            0 |
| 200 |  8 |                                                            1 |
| 200 |  9 |                                                            2 |
| 200 | 10 |                                                            3 |
| 200 | 11 |                                                            4 |
| 300 | 12 |                                                            0 |
| 400 | 13 |                                                            0 |
+-----+----+--------------------------------------------------------------+
```

---
title: CONDITIONAL_TRUE_EVENT
source: https://docs.snowflake.com/en/sql-reference/functions/conditional_true_event.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (General)

# CONDITIONAL_TRUE_EVENT

Returns a window event number for each row within a window partition based on the result of the boolean argument
`expr1`. The number starts from 0 and is incremented by 1 for each row on which the `expr1` evaluates
to true.

One use of this function is to sessionize window partitions. For example, in click stream data, it can be used to
determine whether a user has started a new session by checking whether the last event was longer ago than a threshold.

## Syntax

```sqlsyntax
CONDITIONAL_TRUE_EVENT( <expr1> ) OVER ( [ PARTITION BY <expr2> ] ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`expr1`
:   This is a boolean expression that changes the window event number value when it evaluates true.

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the expression to order by within each partition.

## Usage notes

* The conditional expression `expr1` can contain the rank-related functions LAG and LEAD, which allow us to build
  more expressive windows. If used, these functions have to use the same OVER specification as the
  CONDITIONAL_TRUE_EVENT.

## Examples

The first example illustrates that:

* The number within a partition increments each time the specified column is TRUE (non-zero in this case).
* NULL values are not considered a TRUE value.
* The number starts over at 0 for each partition.

Create and load the table:

```sqlexample
CREATE TABLE table1 (province VARCHAR, o_col INTEGER, o2_col INTEGER);

INSERT INTO table1 (province, o_col, o2_col) VALUES
  ('Alberta', 0, 10),
  ('Alberta', 0, 10),
  ('Alberta', 13, 10),
  ('Alberta', 13, 11),
  ('Alberta', 14, 11),
  ('Alberta', 15, 12),
  ('Alberta', NULL, NULL),
  ('Manitoba', 30, 30);
```

Query the table:

```sqlexample
SELECT province, o_col,
    CONDITIONAL_TRUE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS true_event
  FROM table1
  ORDER BY province, o_col;
```

```output
+----------+-------+------------+
| PROVINCE | O_COL | TRUE_EVENT |
|----------+-------+------------|
| Alberta  |     0 |          0 |
| Alberta  |     0 |          0 |
| Alberta  |    13 |          1 |
| Alberta  |    13 |          2 |
| Alberta  |    14 |          3 |
| Alberta  |    15 |          4 |
| Alberta  |  NULL |          4 |
| Manitoba |    30 |          1 |
+----------+-------+------------+
```

The next example shows that:

* `expr1` can be an expression other than a column. This query uses the expression `o_col > 20`,
  and the output of the query shows when the value in o_col changes from a value less than or equal to 20
  to a value greater than 20.
* `expr3` does not need to match `expr1`. In other words, the expression in the ORDER BY
  sub-clause of the OVER clause does not need to match the expression in the CONDITIONAL_TRUE_EVENT function.

```sqlexample
SELECT province, o_col,
    CONDITIONAL_TRUE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS true_event,
    CONDITIONAL_TRUE_EVENT(o_col > 20)
      OVER (PARTITION BY province ORDER BY o_col)
        AS true_event_gt_20
  FROM table1
  ORDER BY province, o_col;
```

```output
+----------+-------+------------+------------------+
| PROVINCE | O_COL | TRUE_EVENT | TRUE_EVENT_GT_20 |
|----------+-------+------------+------------------|
| Alberta  |     0 |          0 |                0 |
| Alberta  |     0 |          0 |                0 |
| Alberta  |    13 |          1 |                0 |
| Alberta  |    13 |          2 |                0 |
| Alberta  |    14 |          3 |                0 |
| Alberta  |    15 |          4 |                0 |
| Alberta  |  NULL |          4 |                0 |
| Manitoba |    30 |          1 |                1 |
+----------+-------+------------+------------------+
```

The next example compares CONDITIONAL_CHANGE_EVENT and CONDITIONAL_TRUE_EVENT:

```sqlexample
SELECT province, o_col,
    CONDITIONAL_CHANGE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS change_event,
    CONDITIONAL_TRUE_EVENT(o_col)
      OVER (PARTITION BY province ORDER BY o_col)
        AS true_event
  FROM table1
  ORDER BY province, o_col;
```

```output
+----------+-------+--------------+------------+
| PROVINCE | O_COL | CHANGE_EVENT | TRUE_EVENT |
|----------+-------+--------------+------------|
| Alberta  |     0 |            0 |          0 |
| Alberta  |     0 |            0 |          0 |
| Alberta  |    13 |            1 |          1 |
| Alberta  |    13 |            1 |          2 |
| Alberta  |    14 |            2 |          3 |
| Alberta  |    15 |            3 |          4 |
| Alberta  |  NULL |            3 |          4 |
| Manitoba |    30 |            0 |          1 |
+----------+-------+--------------+------------+
```

This example also compares CONDITIONAL_CHANGE_EVENT and CONDITIONAL_TRUE_EVENT:

```sqlexample
CREATE TABLE borrowers (
  name VARCHAR,
  status_date DATE,
  late_balance NUMERIC(11, 2),
  thirty_day_late_balance NUMERIC(11, 2)
  );

INSERT INTO borrowers (name, status_date, late_balance, thirty_day_late_balance) VALUES
  -- Pays late frequently, but catches back up rather than falling further behind.
  ('Geoffrey Flake', '2018-01-01'::DATE,    0.0,    0.0),
  ('Geoffrey Flake', '2018-02-01'::DATE, 1000.0,    0.0),
  ('Geoffrey Flake', '2018-03-01'::DATE, 2000.0, 1000.0),
  ('Geoffrey Flake', '2018-04-01'::DATE,    0.0,    0.0),
  ('Geoffrey Flake', '2018-05-01'::DATE, 1000.0,    0.0),
  ('Geoffrey Flake', '2018-06-01'::DATE, 2000.0, 1000.0),
  ('Geoffrey Flake', '2018-07-01'::DATE,    0.0,    0.0),
  ('Geoffrey Flake', '2018-08-01'::DATE,    0.0,    0.0),
  -- Keeps falling further behind.
  ('Cy Dismal', '2018-01-01'::DATE,    0.0,    0.0),
  ('Cy Dismal', '2018-02-01'::DATE,    0.0,    0.0),
  ('Cy Dismal', '2018-03-01'::DATE, 1000.0,    0.0),
  ('Cy Dismal', '2018-04-01'::DATE, 2000.0, 1000.0),
  ('Cy Dismal', '2018-05-01'::DATE, 3000.0, 2000.0),
  ('Cy Dismal', '2018-06-01'::DATE, 4000.0, 3000.0),
  ('Cy Dismal', '2018-07-01'::DATE, 5000.0, 4000.0),
  ('Cy Dismal', '2018-08-01'::DATE, 6000.0, 5000.0),
  -- Fell behind and isn't catching up, but isn't falling further behind.
  ('Leslie Safer', '2018-01-01'::DATE,    0.0,    0.0),
  ('Leslie Safer', '2018-02-01'::DATE,    0.0,    0.0),
  ('Leslie Safer', '2018-03-01'::DATE, 1000.0, 1000.0),
  ('Leslie Safer', '2018-04-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-05-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-06-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-07-01'::DATE, 2000.0, 1000.0),
  ('Leslie Safer', '2018-08-01'::DATE, 2000.0, 1000.0),
  -- Always pays on time and in full.
  ('Ida Idyll', '2018-01-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-02-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-03-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-04-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-05-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-06-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-07-01'::DATE,    0.0,    0.0),
  ('Ida Idyll', '2018-08-01'::DATE,    0.0,    0.0)
  ;
```

```sqlexample
SELECT name, status_date, late_balance AS "OVERDUE",
    thirty_day_late_balance AS "30 DAYS OVERDUE",
    CONDITIONAL_CHANGE_EVENT(thirty_day_late_balance)
      OVER (PARTITION BY name ORDER BY status_date) AS change_event_cnt,
    CONDITIONAL_TRUE_EVENT(thirty_day_late_balance)
      OVER (PARTITION BY name ORDER BY status_date) AS true_cnt
  FROM borrowers
  ORDER BY name, status_date;
```

```output
+----------------+-------------+---------+-----------------+------------------+----------+
| NAME           | STATUS_DATE | OVERDUE | 30 DAYS OVERDUE | CHANGE_EVENT_CNT | TRUE_CNT |
|----------------+-------------+---------+-----------------+------------------+----------|
| Cy Dismal      | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Cy Dismal      | 2018-02-01  |    0.00 |            0.00 |                0 |        0 |
| Cy Dismal      | 2018-03-01  | 1000.00 |            0.00 |                0 |        0 |
| Cy Dismal      | 2018-04-01  | 2000.00 |         1000.00 |                1 |        1 |
| Cy Dismal      | 2018-05-01  | 3000.00 |         2000.00 |                2 |        2 |
| Cy Dismal      | 2018-06-01  | 4000.00 |         3000.00 |                3 |        3 |
| Cy Dismal      | 2018-07-01  | 5000.00 |         4000.00 |                4 |        4 |
| Cy Dismal      | 2018-08-01  | 6000.00 |         5000.00 |                5 |        5 |
| Geoffrey Flake | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Geoffrey Flake | 2018-02-01  | 1000.00 |            0.00 |                0 |        0 |
| Geoffrey Flake | 2018-03-01  | 2000.00 |         1000.00 |                1 |        1 |
| Geoffrey Flake | 2018-04-01  |    0.00 |            0.00 |                2 |        1 |
| Geoffrey Flake | 2018-05-01  | 1000.00 |            0.00 |                2 |        1 |
| Geoffrey Flake | 2018-06-01  | 2000.00 |         1000.00 |                3 |        2 |
| Geoffrey Flake | 2018-07-01  |    0.00 |            0.00 |                4 |        2 |
| Geoffrey Flake | 2018-08-01  |    0.00 |            0.00 |                4 |        2 |
| Ida Idyll      | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-02-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-03-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-04-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-05-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-06-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-07-01  |    0.00 |            0.00 |                0 |        0 |
| Ida Idyll      | 2018-08-01  |    0.00 |            0.00 |                0 |        0 |
| Leslie Safer   | 2018-01-01  |    0.00 |            0.00 |                0 |        0 |
| Leslie Safer   | 2018-02-01  |    0.00 |            0.00 |                0 |        0 |
| Leslie Safer   | 2018-03-01  | 1000.00 |         1000.00 |                1 |        1 |
| Leslie Safer   | 2018-04-01  | 2000.00 |         1000.00 |                1 |        2 |
| Leslie Safer   | 2018-05-01  | 2000.00 |         1000.00 |                1 |        3 |
| Leslie Safer   | 2018-06-01  | 2000.00 |         1000.00 |                1 |        4 |
| Leslie Safer   | 2018-07-01  | 2000.00 |         1000.00 |                1 |        5 |
| Leslie Safer   | 2018-08-01  | 2000.00 |         1000.00 |                1 |        6 |
+----------------+-------------+---------+-----------------+------------------+----------+
```

Here is a more extensive example:

```sqlexample
CREATE OR REPLACE TABLE tbl
  (p INT, o INT, i INT, r INT, s VARCHAR(100));

INSERT INTO tbl VALUES
  (100, 1, 1, 70, 'seventy'),
  (100, 2, 2, 30, 'thirty'),
  (100, 3, 3, 40, 'fourty'),
  (100, 4, NULL, 90, 'ninety'),
  (100, 5, 5, 50, 'fifty'),
  (100, 6, 6, 30, 'thirty'),
  (200, 7, 7, 40, 'fourty'),
  (200, 8, NULL, NULL, 'n_u_l_l'),
  (200, 9, NULL, NULL, 'n_u_l_l'),
  (200, 10, 10, 20, 'twenty'),
  (200, 11, NULL, 90, 'ninety'),
  (300, 12, 12, 30, 'thirty'),
  (400, 13, NULL, 20, 'twenty');
```

```sqlexample
SELECT *
  FROM tbl
  ORDER BY p, o, i;
```

```output
+-----+----+--------+--------+---------+
|  P  | O  |   I    |   R    |    S    |
+-----+----+--------+--------+---------+
| 100 | 1  | 1      | 70     | seventy |
| 100 | 2  | 2      | 30     | thirty  |
| 100 | 3  | 3      | 40     | fourty  |
| 100 | 4  | [NULL] | 90     | ninety  |
| 100 | 5  | 5      | 50     | fifty   |
| 100 | 6  | 6      | 30     | thirty  |
| 200 | 7  | 7      | 40     | fourty  |
| 200 | 8  | [NULL] | [NULL] | n_u_l_l |
| 200 | 9  | [NULL] | [NULL] | n_u_l_l |
| 200 | 10 | 10     | 20     | twenty  |
| 200 | 11 | [NULL] | 90     | ninety  |
| 300 | 12 | 12     | 30     | thirty  |
| 400 | 13 | [NULL] | 20     | twenty  |
+-----+----+--------+--------+---------+
```

```sqlexample
SELECT p, o,
    CONDITIONAL_TRUE_EVENT(o > 2) OVER (PARTITION BY p ORDER BY o)
  FROM tbl
  ORDER BY p, o;
```

```output
+-----+----+--------------------------------------------------------------+
|   P |  O | CONDITIONAL_TRUE_EVENT(O>2) OVER (PARTITION BY P ORDER BY O) |
|-----+----+--------------------------------------------------------------|
| 100 |  1 |                                                            0 |
| 100 |  2 |                                                            0 |
| 100 |  3 |                                                            1 |
| 100 |  4 |                                                            2 |
| 100 |  5 |                                                            3 |
| 100 |  6 |                                                            4 |
| 200 |  7 |                                                            1 |
| 200 |  8 |                                                            2 |
| 200 |  9 |                                                            3 |
| 200 | 10 |                                                            4 |
| 200 | 11 |                                                            5 |
| 300 | 12 |                                                            1 |
| 400 | 13 |                                                            1 |
+-----+----+--------------------------------------------------------------+
```

```sqlexample
SELECT p, o,
    CONDITIONAL_TRUE_EVENT(LAG(o) OVER (PARTITION BY p ORDER BY o) > 1)
      OVER (PARTITION BY p ORDER BY o)
  FROM tbl
  ORDER BY p, o;
```

```output
+-----+----+-----------------------------------------------------------------------------------------------------+
|   P |  O | CONDITIONAL_TRUE_EVENT(LAG(O) OVER (PARTITION BY P ORDER BY O) >1) OVER (PARTITION BY P ORDER BY O) |
|-----+----+-----------------------------------------------------------------------------------------------------|
| 100 |  1 |                                                                                                   0 |
| 100 |  2 |                                                                                                   0 |
| 100 |  3 |                                                                                                   1 |
| 100 |  4 |                                                                                                   2 |
| 100 |  5 |                                                                                                   3 |
| 100 |  6 |                                                                                                   4 |
| 200 |  7 |                                                                                                   0 |
| 200 |  8 |                                                                                                   1 |
| 200 |  9 |                                                                                                   2 |
| 200 | 10 |                                                                                                   3 |
| 200 | 11 |                                                                                                   4 |
| 300 | 12 |                                                                                                   0 |
| 400 | 13 |                                                                                                   0 |
+-----+----+-----------------------------------------------------------------------------------------------------+
```

---
title: CONTAINS
source: https://docs.snowflake.com/en/sql-reference/functions/contains.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# CONTAINS

Returns true if `expr1` contains `expr2`. Both expressions must be text or binary expressions.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

## Syntax

```sqlsyntax
CONTAINS( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   The string to search in.

`expr2`
:   The string to search for.

## Returns

Returns a BOOLEAN or NULL:

* Returns TRUE if `expr2` is found inside `expr1`.
* Returns FALSE if `expr2` is not found inside `expr1`.
* Returns NULL if either input expression is NULL.

## Usage notes

For comparisons that match a string against more than one specified pattern, you can use the following functions:

* [ILIKE ANY](ilike_any.md)
* [LIKE ALL](like_all.md)
* [LIKE ANY](like_any.md)

## Collation details

The [collation specifications](../collation.md) of all input arguments must be compatible.

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

These examples use the CONTAINS function.

### Determine whether column values contain a string

Create a table with a single column that contains string values.

```sqlexample
CREATE OR REPLACE TABLE strings_test (s VARCHAR);

INSERT INTO strings_test values
  ('coffee'),
  ('ice tea'),
  ('latte'),
  ('tea'),
  (NULL);

SELECT * from strings_test;
```

```output
+---------+
| S       |
|---------|
| coffee  |
| ice tea |
| latte   |
| tea     |
| NULL    |
+---------+
```

Determine whether the values in column `s` contain the string `te`:

```sqlexample
SELECT * FROM strings_test WHERE CONTAINS(s, 'te');
```

```output
+---------+
| S       |
|---------|
| ice tea |
| latte   |
| tea     |
+---------+
```

### Use CONTAINS with collation

In the following example, CONTAINS returns different results for the same argument
values with different collation specifications.

```sqlexample
SELECT CONTAINS(COLLATE('ñ', 'en-ci-ai'), 'n'),
       CONTAINS(COLLATE('ñ', 'es-ci-ai'), 'n');
```

```output
+-----------------------------------------+-----------------------------------------+
| CONTAINS(COLLATE('Ñ', 'EN-CI-AI'), 'N') | CONTAINS(COLLATE('Ñ', 'ES-CI-AI'), 'N') |
|-----------------------------------------+-----------------------------------------|
| True                                    | False                                   |
+-----------------------------------------+-----------------------------------------+
```

---
title: CONVERT_TIMEZONE
source: https://docs.snowflake.com/en/sql-reference/functions/convert_timezone.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# CONVERT_TIMEZONE

Converts a timestamp to another time zone.

## Syntax

```sqlsyntax
CONVERT_TIMEZONE( <source_tz> , <target_tz> , <source_timestamp_ntz> )

CONVERT_TIMEZONE( <target_tz> , <source_timestamp> )
```

## Arguments

`source_tz`
:   String specifying the time zone for the input timestamp. Required for timestamps with no time zone (i.e. TIMESTAMP_NTZ).

`target_tz`
:   String specifying the time zone to which the input timestamp is converted.

`source_timestamp_ntz`
:   For the 3-argument version, string specifying the timestamp to convert (must be TIMESTAMP_NTZ).

`source_timestamp`
:   For the 2-argument version, string specifying the timestamp to convert (can be any timestamp variant, including TIMESTAMP_NTZ).

## Returns

Returns a value of type TIMESTAMP_NTZ, TIMESTAMP_TZ, or NULL:

* For the 3-argument version, returns a value of type TIMESTAMP_NTZ.
* For the 2-argument version, returns a value of type TIMESTAMP_TZ.
* If any argument is NULL, returns NULL.

## Usage notes

* The display format for timestamps in the output is determined by the
  [timestamp output format](../date-time-input-output.md) for the current
  session and the data type of the returned timestamp value.
* For the 3-argument version, the “wallclock” time in the result represents the same moment in time as the input “wallclock”
  in the input time zone, but in the target time zone.
* For the 2-argument version, the `source_timestamp` argument typically includes the time zone. If the value
  is of type TIMESTAMP_TZ, the time zone is taken from its value. Otherwise, the current session time zone is used.
* For `source_tz` and `target_tz`, you can specify a [time zone name](https://data.iana.org/time-zones/tzdb-2025b/zone1970.tab) or a [link name](https://data.iana.org/time-zones/tzdb-2025b/backward) from release
  2025b of the [IANA Time Zone Database](https://www.iana.org/time-zones) (for example, `America/Los_Angeles`, `Europe/London`, `UTC`,
  `Etc/GMT`, and so on).

  > **Note:**
  > + Time zone names are case-sensitive and must be enclosed in single quotes (e.g. `'UTC'`).
  > + Snowflake does not support the majority of timezone [abbreviations](https://en.wikipedia.org/wiki/List_of_time_zone_abbreviations) (e.g. `PDT`, `EST`, etc.) because a
  >   given abbreviation might refer to one of several different time zones. For example, `CST` might refer to Central
  >   Standard Time in North America (UTC-6), Cuba Standard Time (UTC-5), and China Standard Time (UTC+8).

## Examples

To use the default [timestamp output format](../date-time-input-output.md)
for the timestamps returned in the examples, unset the TIMESTAMP_OUTPUT_FORMAT parameter in the current session:

```sqlexample
ALTER SESSION UNSET TIMESTAMP_OUTPUT_FORMAT;
```

### Examples that specify a source time zone

The following examples use the 3-argument version of the CONVERT_TIMEZONE function and specify a `source_tz`
value. These examples return TIMESTAMP_NTZ values.

Convert a “wallclock” time in Los Angeles to the matching “wallclock” time in New York:

```sqlexample
SELECT CONVERT_TIMEZONE(
  'America/Los_Angeles',
  'America/New_York',
  '2024-01-01 14:00:00'::TIMESTAMP_NTZ
) AS conv;
```

```output
+-------------------------+
| CONV                    |
|-------------------------|
| 2024-01-01 17:00:00.000 |
+-------------------------+
```

Convert a “wallclock” time in Warsaw to the matching “wallclock” time in UTC:

```sqlexample
SELECT CONVERT_TIMEZONE(
  'Europe/Warsaw',
  'UTC',
  '2024-01-01 00:00:00'::TIMESTAMP_NTZ
) AS conv;
```

```output
+-------------------------+
| CONV                    |
|-------------------------|
| 2023-12-31 23:00:00.000 |
+-------------------------+
```

### Examples that do not specify a source time zone

The following examples use the 2-argument version of the CONVERT_TIMEZONE function. These examples return
TIMESTAMP_TZ values. Therefore, the returned values include an offset that shows the difference between
the timestamp’s time zone and Coordinated Universal Time (UTC). For example, the `America/Los_Angeles`
time zone has an offset of `-0700` to show that it is seven hours behind UTC.

Convert a string specifying a TIMESTAMP_TZ value to a different time zone:

```sqlexample
SELECT CONVERT_TIMEZONE(
  'America/Los_Angeles',
  '2024-04-05 12:00:00 +02:00'
) AS time_in_la;
```

```output
+-------------------------------+
| TIME_IN_LA                    |
|-------------------------------|
| 2024-04-05 03:00:00.000 -0700 |
+-------------------------------+
```

Show the current “wallclock” time in different time zones:

```sqlexample
SELECT
  CURRENT_TIMESTAMP() AS now_in_la,
  CONVERT_TIMEZONE('America/New_York', CURRENT_TIMESTAMP()) AS now_in_nyc,
  CONVERT_TIMEZONE('Europe/Paris', CURRENT_TIMESTAMP()) AS now_in_paris,
  CONVERT_TIMEZONE('Asia/Tokyo', CURRENT_TIMESTAMP()) AS now_in_tokyo;
```

```output
+-------------------------------+-------------------------------+-------------------------------+-------------------------------+
| NOW_IN_LA                     | NOW_IN_NYC                    | NOW_IN_PARIS                  | NOW_IN_TOKYO                  |
|-------------------------------+-------------------------------+-------------------------------+-------------------------------|
| 2024-06-12 08:52:53.114 -0700 | 2024-06-12 11:52:53.114 -0400 | 2024-06-12 17:52:53.114 +0200 | 2024-06-13 00:52:53.114 +0900 |
+-------------------------------+-------------------------------+-------------------------------+-------------------------------+
```

---
title: COPY_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/copy_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# COPY_HISTORY

This table function can be used to query Snowflake data loading history along various dimensions within the last 14 days.
The function returns load activity for both [COPY INTO <table>](../sql/copy-into-table.md) statements and
continuous data loading using [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). The table function avoids the 10,000 row limitation
of the [LOAD_HISTORY view](../info-schema/load_history.md). The results can be filtered using SQL predicates.

You can also view data loading details in Snowsight. See [Monitor data loading activity by using Copy History](../../user-guide/data-load-monitor.md).

## Syntax

```sqlsyntax
COPY_HISTORY(
      TABLE_NAME => '<string>'
       , START_TIME => <constant_expr>
      [, END_TIME => <constant_expr> ]
      [, PIPE_NAME => '<string>' ] )
```

## Arguments

**Required:**

`TABLE_NAME => 'string'`
:   A string specifying a table name.

`START_TIME => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format), within the last 14 days, marking the start of the time range for retrieving load events.

**Optional:**

`END_TIME => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format), within the last 14 days, marking the end of the time range for retrieving load events.

    Default: [CURRENT_TIMESTAMP](current_timestamp.md).

`PIPE_NAME => 'string'`
:   A string specifying a pipe name.

## Usage notes

* For bulk data loads, this function returns results for a role that has MONITOR privilege on your Snowflake account,
  or a role with USAGE privilege on schema and database and any privilege on table.
* For Snowpipe data loads, this function returns results for a role that has MONITOR privilege on your Snowflake account,
  or a role with USAGE privilege on schema and database that contains the pipe and any privilege on table.
  In addition, if MONITOR on pipe is not available, pipe name, pipe table name, pipe schema name and pipe catalog name are masked as NULL.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* This view returns a limit of 14 days of copy history. To avoid this limitation, use the [COPY_HISTORY view](../account-usage/copy_history.md) (Account Usage).
* The function only includes COPY INTO commands that executed to completion, with or without errors.
* Dropping or recreating a table object removes the historical data for bulk data loads (COPY INTO *<table>* statements) into the table.
* Dropping or recreating a pipe object removes the historical data for Snowpipe data loads using the pipe.

* The COPY_HISTORY view shows copy history only after the latest truncate operation on the target table. This applies to the COPY_HISTORY views before and after
  [replication](../../user-guide/account-replication-intro.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_NAME | TEXT | Name of the source file and relative path to the file. |
| STAGE_LOCATION | TEXT | Name of the stage where the source file is located. |
| LAST_LOAD_TIME | TIMESTAMP_LTZ | Date and time of when the file finished loading. |
| ROW_COUNT | NUMBER | Number of rows loaded from the source file. |
| ROW_PARSED | NUMBER | Number of rows parsed from the source file; `NULL` if STATUS is `Load in progress`. |
| FILE_SIZE | NUMBER | Size of the source file loaded (in bytes). |
| FIRST_ERROR_MESSAGE | TEXT | First error of the source file. |
| FIRST_ERROR_LINE_NUMBER | NUMBER | Line number of the first error. |
| FIRST_ERROR_CHARACTER_POS | NUMBER | Position of the first error character. |
| FIRST_ERROR_COLUMN_NAME | TEXT | Column name of the first error. |
| ERROR_COUNT | NUMBER | Number of error rows in the source file. |
| ERROR_LIMIT | NUMBER | If the number of errors reaches this limit, then abort. |
| STATUS | TEXT | Status: `Load in progress`, `Loaded`, `Load failed`, `Partially loaded`, or `Load skipped`. |
| TABLE_CATALOG_NAME | TEXT | Name of the database in which the target table resides. |
| TABLE_SCHEMA_NAME | TEXT | Name of the schema in which the target table resides. |
| TABLE_NAME | TEXT | Name of the target table. |
| PIPE_CATALOG_NAME | TEXT | Name of the database in which the pipe resides. |
| PIPE_SCHEMA_NAME | TEXT | Name of the schema in which the pipe resides. |
| PIPE_NAME | TEXT | Name of the pipe defining the load parameters; `NULL` for COPY statement loads. |
| PIPE_RECEIVED_TIME | TIMESTAMP_LTZ | Date and time when the INSERT request for the file loaded through the pipe was received; `NULL` for COPY statement loads. |
| BYTES_BILLED | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing visibility into Snowpipe’s cost implications directly within these history views. |

## Examples

Retrieve details about all loading activity in the last hour:

> ```sqlexample
> select *
> from table(information_schema.copy_history(TABLE_NAME=>'MYTABLE', START_TIME=> DATEADD(hours, -1, CURRENT_TIMESTAMP())));
> ```

---
title: CORR
source: https://docs.snowflake.com/en/sql-reference/functions/corr.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md) (General)

# CORR

Returns the correlation coefficient for non-null pairs in a group. It is computed for non-null pairs using the following formula:

> `COVAR_POP(y, x) / (STDDEV_POP(x) * STDDEV_POP(y))`

Where `x` is the independent variable and `y` is the dependent variable.

See also:
:   [COVAR_POP](covar_pop.md) , [STDDEV_POP](stddev_pop.md)

## Syntax

Syntax when used as an aggregate function:

```sqlsyntax
CORR( y , x )
```

Syntax when used as a window function:

```sqlsyntax
CORR( y , x ) OVER ( [ PARTITION BY <expr3> ] )
```

## Usage notes

* DISTINCT is not supported for this function.
* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k int, v decimal(10,2), v2 decimal(10, 2));
INSERT INTO aggr VALUES(1, 10, NULL);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, NULL), (2, 30, 35);

SELECT * FROM aggr;
```

```output
+---+-------+-------+
| K |     V |    V2 |
|---+-------+-------|
| 1 | 10.00 |  NULL |
| 2 | 10.00 | 11.00 |
| 2 | 20.00 | 22.00 |
| 2 | 25.00 |  NULL |
| 2 | 30.00 | 35.00 |
+---+-------+-------+
```

```sqlexample
SELECT k, CORR(v, v2) FROM aggr GROUP BY k;
```

```output
+---+--------------+
| K |  CORR(V, V2) |
|---+--------------|
| 1 |         NULL |
| 2 | 0.9988445981 |
+---+--------------+
```

---
title: CORTEX_SEARCH_DATA_SCAN
source: https://docs.snowflake.com/en/sql-reference/functions/cortex_search_data_scan.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# CORTEX_SEARCH_DATA_SCAN

This table function returns the data indexed by a [Cortex Search service](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md),
including the columns defined in the source query and the computed vector embeddings for the search column.

## Syntax

```sqlsyntax
CORTEX_SEARCH_DATA_SCAN(
      SERVICE_NAME => '<string>' )
```

## Arguments

**Required:**

`SERVICE_NAME => 'string'`
:   The name of a Cortex Search service.

    You can specify any of the following:

    * Unqualified name (`service_name`)
    * Partially qualified name (`schema_name.service_name`)
    * Fully qualified name (`database_name.schema_name.service_name`)

    For more information on object name resolution, refer to [Object Name Resolution](../name-resolution.md).

## Output

The function returns all the columns specified in the source query and the embeddings for the search column. The embedding column is of [VECTOR data type](../data-types-vector.md) and is named `_GENERATED_EMBEDDINGS_{MODEL_NAME}`.

The order of the columns is the same as the order of the columns in the source query with the embedding column appended at the end.

## Usage notes

* Requires OPERATE privilege for Cortex Search. Refer to [Access control privileges](../../user-guide/security-access-control-privileges.md) for more details.

## Examples

Suppose you have a Cortex Search service named `transcript_search_service` defined as follows:

```sqlexample
CREATE OR REPLACE CORTEX SEARCH SERVICE transcript_search_service
  ON transcript_text
  ATTRIBUTES region
  WAREHOUSE = cortex_search_wh
  TARGET_LAG = '1 day'
  AS (
    SELECT
        transcript_text,
        region,
        agent_id,
    FROM support_transcripts
);
```

For instructions about creating a Cortex Search service, see [Cortex Search Overview](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

You can use the table function to retrieve the contents for the Cortex Search service `transcript_search_service`:

```sqlexample
SELECT
  *
FROM
  TABLE (
    CORTEX_SEARCH_DATA_SCAN (
      SERVICE_NAME => 'transcript_search_service'
    )
  );
```

```output
+ ---------------------------------------------------------- + --------------- + -------- + ------------------------------ +
|                      transcript_text                       |     region      | agent_id | _GENERATED_EMBEDDINGS_MY_MODEL |
| ---------------------------------------------------------- | --------------- | -------- | ------------------------------ |
| 'My internet has been down since yesterday, can you help?' | 'North America' | 'AG1001' | [0.1, 0.2, 0.3, 0.4]           |
| 'I was overcharged for my last bill, need an explanation.' | 'Europe'        | 'AG1002' | [0.1, 0.2, 0.3, 0.4]           |
+ ---------------------------------------------------------- + --------------- + -------- + ------------------------------ +
```

---
title: CORTEX_SEARCH_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/cortex_search_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# CORTEX_SEARCH_REFRESH_HISTORY

This table function returns information about each refresh (completed and running) of [Cortex Search services](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

This table function returns all refreshes that are in progress as well as all refreshes that have a DATA_TIMESTAMP within 7 days
of the current time.

## Syntax

```sqlsyntax
CORTEX_SEARCH_REFRESH_HISTORY(
  [ NAME => '<string>' ]
  [ , DATA_TIMESTAMP_START => <constant_expr> ]
  [ , DATA_TIMESTAMP_END => <constant_expr> ]
  [ , RESULT_LIMIT => <integer> ]
)
```

## Arguments

All the arguments are optional.
If no arguments are provided, 100 refreshes from all Cortex Search services in the account will be returned.

`NAME => string`
:   The name of a Cortex Search service.

    Names must be single-quoted and are case insensitive.

    You can specify the unqualified name (`service_name`),
    the partially qualified name (`schema_name.service_name`),
    or the fully qualified name (`database_name.schema_name.service_name`).

    For more information on object name resolution, refer to [Object name resolution](../name-resolution.md).

    The function returns the refreshes for this service.

`DATA_TIMESTAMP_START => constant_expr` , . `DATA_TIMESTAMP_END => constant_expr`
:   Time range (in TIMESTAMP_LTZ format) during which the refreshes occurred.

    * If neither a start time nor an end time is specified, the default range will be the past day.
    * If an end time is not specified, [CURRENT_TIMESTAMP](current_timestamp.md) is used as the end of the range.
    * If a start time is not specified, the range starts 1 day prior to the start of DATA_TIMESTAMP_END.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function. If the number of matching rows is greater than
    this limit, the refreshes that finished most recently (and those that are still running) are returned, up to the specified
    limit.

    To apply a filter on the results, also specify a large enough RESULT_LIMIT limit value for the filter to be applied on all
    Cortex Search services.

    Range: `1` to `10000`

    Default: `100`.

## Output

The function returns the following columns.

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | Name of the Cortex Search service. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the Cortex Search service. |
| DATABASE_NAME | TEXT | Name of the database that contains the Cortex Search service. |
| STATE | TEXT | Status of the refresh for the Cortex Search service. The status can be one of the following:   * EXECUTING: refresh in progress. * SUCCEEDED: refresh completed successfully. * FAILED: refresh failed during execution. * CANCELLED: refresh was canceled before execution. |
| DATA_TIMESTAMP | TIMESTAMP_LTZ | Transactional timestamp when the refresh was evaluated. (This might be slightly before the actual time of the refresh.) All data, in base objects, that arrived before this timestamp is currently included in the Cortex Search service. |
| REFRESH_START_TIME | TIMESTAMP_LTZ | Time when the refresh job started. |
| REFRESH_END_TIME | TIMESTAMP_LTZ | Time when the refresh completed. |
| INDEX_PREPROCESSING_DURATION | NUMBER | Duration of the index preprocessing phase in milliseconds. |
| INDEX_PREPROCESSING_QUERY_ID | TEXT | ID of the query that performed the index preprocessing. |
| INDEX_PREPROCESSING_STATISTICS | OBJECT | Contains the following properties for index preprocessing:   * `compilationTimeMs`: Time spent compiling the query in milliseconds. * `executionTimeMs`: Time spent executing the query in milliseconds. * `queuedTimeMs`: Time spent queued before execution in milliseconds. * `numInsertedRows`: The number of inserted rows. * `numDeletedRows`: The number of rows that were deleted. * `numCopiedRows`: The number of rows that were copied unchanged. * `numAddedPartitions`: The number of added partitions. * `numRemovedPartitions`: The number of removed partitions. |
| INDEXING_DURATION | NUMBER | Duration of the indexing phase in milliseconds. |
| INDEXING_QUERY_ID | TEXT | ID of the query that performed the indexing. |
| REFRESH_ACTION | TEXT | One of:   * NO_DATA - no new data in base tables. * FULL - full refresh of the Cortex Search service. * INCREMENTAL - incremental refresh of the Cortex Search service. |
| REFRESH_TRIGGER | TEXT | One of:   * SCHEDULED - normal background refresh to keep the service up to date. * MANUAL - user manually triggered refresh using ALTER CORTEX SEARCH SERVICE. * CREATION - refresh performed during the creation DDL statement. |
| TARGET_LAG_SEC | NUMBER | Describes the target lag value for the Cortex Search service at the time the refresh occurred. |
| WAREHOUSE | TEXT | Name of the warehouse used for the refresh operation. |
| ERROR | TEXT | Error message if the refresh failed, otherwise NULL. |

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Find failed Cortex Search service refreshes during the past week:

```sqlexample
SELECT
  data_timestamp,
  database_name,
  schema_name,
  name,
  state,
  error,
  refresh_trigger
FROM
  TABLE (
    INFORMATION_SCHEMA.CORTEX_SEARCH_REFRESH_HISTORY (
      DATA_TIMESTAMP_START => DATEADD(WEEK, -1, CURRENT_TIMESTAMP())
    )
  )
ORDER BY
  data_timestamp DESC
LIMIT 10;
```

Find recent manual refreshes for a specific Cortex Search service:

```sqlexample
SELECT
  data_timestamp,
  refresh_start_time,
  refresh_end_time,
  refresh_action,
  state
FROM
  TABLE (
    INFORMATION_SCHEMA.CORTEX_SEARCH_REFRESH_HISTORY (
      NAME => 'MYSVC',
      DATA_TIMESTAMP_START => DATEADD(DAY, -7, CURRENT_TIMESTAMP()),
      RESULT_LIMIT => 20
    )
  )
WHERE
  refresh_trigger = 'MANUAL'
ORDER BY
  data_timestamp DESC;
```

Analyze refresh performance for a Cortex Search service:

```sqlexample
SELECT
  name,
  data_timestamp,
  index_preprocessing_duration,
  indexing_duration,
  TIMEDIFF(SECOND, refresh_start_time, refresh_end_time) AS total_refresh_duration_sec,
  index_preprocessing_statistics:numInsertedRows AS rows_processed
FROM
  TABLE (
    INFORMATION_SCHEMA.CORTEX_SEARCH_REFRESH_HISTORY (
      NAME => 'MYSVC',
      DATA_TIMESTAMP_START => DATEADD(DAY, -30, CURRENT_TIMESTAMP())
    )
  )
WHERE
  state = 'SUCCEEDED'
ORDER BY
  data_timestamp DESC;
```

---
title: COS
source: https://docs.snowflake.com/en/sql-reference/functions/cos.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# COS

Computes the cosine of its argument; the argument should be expressed in
radians.

## Syntax

```sqlsyntax
COS( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The value must be in
    radians, not degrees. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT COS(0), COS(PI()/3), COS(RADIANS(90));
```

```output
+--------+-------------+------------------+
| COS(0) | COS(PI()/3) | COS(RADIANS(90)) |
|--------+-------------+------------------|
|      1 |         0.5 |  6.123233996e-17 |
+--------+-------------+------------------+
```

---
title: COSH
source: https://docs.snowflake.com/en/sql-reference/functions/cosh.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# COSH

Computes the hyperbolic cosine of its argument.

## Syntax

```sqlsyntax
COSH( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT COSH(1.5);
```

```output
+-------------+
|   COSH(1.5) |
|-------------|
| 2.352409615 |
+-------------+
```

---
title: COT
source: https://docs.snowflake.com/en/sql-reference/functions/cot.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# COT

Computes the cotangent of its argument; the argument should be expressed in
radians.

## Syntax

```sqlsyntax
COT( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT COT(0), COT(PI()/3), COT(RADIANS(90));
```

```output
+--------+--------------+------------------+
| COT(0) |  COT(PI()/3) | COT(RADIANS(90)) |
|--------+--------------+------------------|
|    inf | 0.5773502692 |  6.123233996e-17 |
+--------+--------------+------------------+
```

---
title: COUNT
source: https://docs.snowflake.com/en/sql-reference/functions/count.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# COUNT

Returns either the number of non-NULL records for the specified columns, or the total number of records.

See also:
:   [COUNT_IF](count_if.md), [MAX](max.md), [MIN](min.md) , [SUM](sum.md)

## Syntax

**Aggregate function**

```sqlsyntax
COUNT( [ DISTINCT ] <expr1> [ , <expr2> ... ] )

COUNT(*)

COUNT(<alias>.*)
```

**Window function**

```sqlsyntax
COUNT( [ DISTINCT ] <expr1> [ , <expr2> ... ] ) OVER (
                                                     [ PARTITION BY <expr3> ]
                                                     [ ORDER BY <expr4> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                                     )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   A column name, which can be a qualified name (for example, database.schema.table.column_name).

`expr2`
:   You can include additional column name(s) if you wish. For example, you
    could count the number of distinct combinations of last name and first name.

`expr3`
:   The column to partition on, if you want the result to be split into multiple
    windows.

`expr4`
:   The column to order each window on. Note that this is separate from any
    ORDER BY clause to order the final result set.

`*`
:   Returns the total number of records.

    When you pass a wildcard to the function, you can qualify the wildcard with the name or alias for the table.
    For example, to pass in all of the columns from the table named `mytable`, specify the following:

    ```sqlexample
    (mytable.*)
    ```

    You can also use the ILIKE and EXCLUDE keywords for filtering:

    * ILIKE filters for column names that match the specified pattern. Only one
      pattern is allowed. For example:

      ```sqlexample
      (* ILIKE 'col1%')
      ```
    * EXCLUDE filters out column names that don’t match the specified column or columns. For example:

      ```sqlexample
      (* EXCLUDE col1)

      (* EXCLUDE (col1, col2))
      ```

    Qualifiers are valid when you use these keywords. The following example uses the ILIKE keyword to
    filter for all of the columns that match the pattern `col1%` in the table `mytable`:

    ```sqlexample
    (mytable.* ILIKE 'col1%')
    ```

    The ILIKE and EXCLUDE keywords can’t be combined in a single function call.

    If you specify an unqualified and unfiltered wildcard (`*`), the function returns the total number of records, including
    records with NULL values.

    If you specify a wildcard with the ILIKE or EXCLUDE keyword for filtering, the function excludes records with NULL values.

    For this function, the ILIKE and EXCLUDE keywords are valid only in a SELECT list or GROUP BY clause.

    For more information about the ILIKE and EXCLUDE keywords, see the “Parameters” section in [SELECT](../sql/select.md).

`alias.*`
:   Returns the number of records that don’t contain any NULL values. For an example, see Examples.

## Returns

Returns a value of type NUMBER.

## Usage notes

* This function treats [JSON null](../../user-guide/semistructured-considerations.md) (VARIANT NULL) as SQL NULL.
* For more information about NULL values and aggregate functions, see
  [Aggregate functions and NULL values](../functions-aggregation.md).
* When this function is called as an aggregate function:

  + If the `DISTINCT` keyword is used, it applies to all columns. For example,
    `DISTINCT col1, col2, col3` means to return the number of different
    combinations of columns `col1`, `col2`, and `col3`. For example, assume the data is:

    ```sqlexample
    1, 1, 1
    1, 1, 1
    1, 1, 1
    1, 1, 2
    ```

    In this case, the function returns `2`, because that’s the number of distinct combinations of values in the three columns.

* When this function is called as a window function with an OVER clause that contains an ORDER BY clause:

  + A window frame is required. If no window frame is specified explicitly, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).
  + Using the keyword DISTINCT inside the window function is prohibited and results in a compile-time error.

* To return the number of rows that match a condition, use [COUNT_IF](count_if.md).
* When possible, use the COUNT function on tables and views without a [row access policy](../../user-guide/security-row-intro.md).
  The query with this function is faster and more accurate on tables or views without a row access policy. The reasons for the performance
  difference include:

  + Snowflake maintains statistics on tables and views, and this optimization allows simple queries to run faster.
  + When a row access policy is set on a table or view and the COUNT function is used in a query, Snowflake must scan each row and
    determine whether the user is allowed to view the row.

## Examples

The following examples use the COUNT function on data with NULL values.

Create a table and insert values:

```sqlexample
CREATE TABLE basic_example (i_col INTEGER, j_col INTEGER);

INSERT INTO basic_example VALUES
  (11, 101), (11, 102), (11, NULL), (12, 101), (NULL, 101), (NULL, 102);
```

Query the table:

```sqlexample
SELECT *
  FROM basic_example
  ORDER BY i_col;
```

```output
+-------+-------+
| I_COL | J_COL |
|-------+-------|
|    11 |   101 |
|    11 |   102 |
|    11 |  NULL |
|    12 |   101 |
|  NULL |   101 |
|  NULL |   102 |
+-------+-------+
```

```sqlexample
SELECT COUNT(*) AS "All",
    COUNT(* ILIKE 'i_c%') AS "ILIKE",
    COUNT(* EXCLUDE i_col) AS "EXCLUDE",
    COUNT(i_col) AS "i_col",
    COUNT(DISTINCT i_col) AS "DISTINCT i_col",
    COUNT(j_col) AS "j_col",
    COUNT(DISTINCT j_col) AS "DISTINCT j_col"
  FROM basic_example;
```

```output
+-----+-------+---------+-------+----------------+-------+----------------+
| All | ILIKE | EXCLUDE | i_col | DISTINCT i_col | j_col | DISTINCT j_col |
|-----+-------+---------+-------+----------------+-------+----------------|
|   6 |     4 |       5 |     4 |              2 |     5 |              2 |
+-----+-------+---------+-------+----------------+-------+----------------+
```

The `All` column in this output shows that when an unqualified and unfiltered wildcard is specified
for COUNT, the function returns the total number of rows in the table, including rows with NULL values. The other
columns in the output show that when a column or a wildcard with filtering is specified, the function excludes
rows with NULL values.

The next query uses the COUNT function with the GROUP BY clause:

```sqlexample
SELECT i_col, COUNT(*), COUNT(j_col)
  FROM basic_example
  GROUP BY i_col
  ORDER BY i_col;
```

```output
+-------+----------+--------------+
| I_COL | COUNT(*) | COUNT(J_COL) |
|-------+----------+--------------|
|    11 |        3 |            2 |
|    12 |        1 |            1 |
|  NULL |        2 |            2 |
+-------+----------+--------------+
```

The following example shows that `COUNT(alias.*)` returns the number of rows that don’t contain any NULL values.
The `basic_example` table has a total of six rows, but three rows have at least one NULL value, and the other three rows
have no NULL values.

```sqlexample
SELECT COUNT(n.*) FROM basic_example AS n;
```

```output
+------------+
| COUNT(N.*) |
|------------|
|          3 |
+------------+
```

The following example shows that [JSON null](../../user-guide/semistructured-considerations.md) (VARIANT NULL) is treated as SQL NULL by
the COUNT function.

Create the table and insert data that contains both SQL NULL and JSON null values:

```sqlexample
CREATE OR REPLACE TABLE count_example_with_variant_column (
  i_col INTEGER,
  j_col INTEGER,
  v VARIANT);
```

```sqlexample
BEGIN WORK;

INSERT INTO count_example_with_variant_column (i_col, j_col, v)
  VALUES (NULL, 10, NULL);
INSERT INTO count_example_with_variant_column (i_col, j_col, v)
  SELECT 1, 11, PARSE_JSON('{"Title": null}');
INSERT INTO count_example_with_variant_column (i_col, j_col, v)
  SELECT 2, 12, PARSE_JSON('{"Title": "O"}');
INSERT INTO count_example_with_variant_column (i_col, j_col, v)
  SELECT 3, 12, PARSE_JSON('{"Title": "I"}');

COMMIT WORK;
```

In this SQL code, note the following:

* The first INSERT INTO statement inserts a SQL NULL for both a VARIANT column and a non-VARIANT column.
* The second INSERT INTO statement inserts a JSON null (VARIANT NULL).
* The last two INSERT INTO statements insert non-NULL VARIANT values.

Show the data:

```sqlexample
SELECT i_col, j_col, v, v:Title
  FROM count_example_with_variant_column
  ORDER BY i_col;
```

```output
+-------+-------+-----------------+---------+
| I_COL | J_COL | V               | V:TITLE |
|-------+-------+-----------------+---------|
|     1 |    11 | {               | null    |
|       |       |   "Title": null |         |
|       |       | }               |         |
|     2 |    12 | {               | "O"     |
|       |       |   "Title": "O"  |         |
|       |       | }               |         |
|     3 |    12 | {               | "I"     |
|       |       |   "Title": "I"  |         |
|       |       | }               |         |
|  NULL |    10 | NULL            | NULL    |
+-------+-------+-----------------+---------+
```

Show that the COUNT function treats both the NULL and the JSON null (VARIANT NULL) values
as NULLs. There are four rows in the table. One has a SQL NULL and the other has a
JSON null. Both those rows are excluded from the count, so the count is `2`.

```sqlexample
SELECT COUNT(v:Title)
  FROM count_example_with_variant_column;
```

```output
+----------------+
| COUNT(V:TITLE) |
|----------------|
|              2 |
+----------------+
```

---
title: COUNT_IF
source: https://docs.snowflake.com/en/sql-reference/functions/count_if.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# COUNT_IF

Returns the number of records that satisfy a condition or NULL if no records satisfy the condition.

See also:
:   [COUNT](count.md)

## Syntax

**Aggregate function**

```sqlsyntax
COUNT_IF( <condition> )
```

**Window function**

```sqlsyntax
COUNT_IF( <condition> )
    OVER ( [ PARTITION BY <expr1> ] [ ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ] )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`condition`
:   The condition is an expression that should evaluate to a BOOLEAN value (True, False, or NULL)

`expr1`
:   The column to partition on, if you want the result to be split into multiple
    windows.

`expr2`
:   The column to order each window on. Note that this is separate from the ORDER BY clause that sorts the final result set.

## Returns

If the function does not return NULL, the data type of the returned value is NUMBER.

## Usage notes

* When this function is called as a window function with an ORDER BY clause, you must specify a window frame. If
  you do not specify a window frame, the following default frame is used:

  `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

  For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).

## Examples

The examples in this section demonstrate how to use the `COUNT_IF` function.

The following statements set up a table for use in the examples:

```sqlexample
CREATE TABLE basic_example (i_col INTEGER, j_col INTEGER);

INSERT INTO basic_example VALUES
  (11, 101), (11, 102), (11, NULL), (12, 101), (NULL, 101), (NULL, 102);
```

```sqlexample
SELECT *
  FROM basic_example
  ORDER BY i_col;
```

```output
+-------+-------+
| I_COL | J_COL |
|-------+-------|
|    11 |   101 |
|    11 |   102 |
|    11 |  NULL |
|    12 |   101 |
|  NULL |   101 |
|  NULL |   102 |
+-------+-------+
```

The following example passes in `TRUE` for the condition, which returns the count of all rows in the table:

```sqlexample
SELECT COUNT_IF(TRUE)
  FROM basic_example;
```

```output
+----------------+
| COUNT_IF(TRUE) |
|----------------|
|              6 |
+----------------+
```

The following example returns the number of rows where the value in `J_COL` is greater than the value in `I_COL`:

```sqlexample
SELECT COUNT_IF(j_col > i_col)
  FROM basic_example;
```

```output
+-------------------------+
| COUNT_IF(J_COL > I_COL) |
|-------------------------|
|                       3 |
+-------------------------+
```

Note that in the example above, the count does not include rows with NULL values. As explained in
[Ternary logic](../ternary-logic.md), when any operand for a comparison operator is NULL, the result is NULL, which does not
satisfy the condition specified by `COUNT_IF`.

The following example returns the number of rows that do not contain any NULL values.

```sqlexample
SELECT COUNT_IF(i_col IS NOT NULL AND j_col IS NOT NULL)
  FROM basic_example;
```

```output
+---------------------------------------------------+
| COUNT_IF(I_COL IS NOT NULL AND J_COL IS NOT NULL) |
|---------------------------------------------------|
|                                                 3 |
+---------------------------------------------------+
```

---
title: COUNT_TOKENS (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/count_tokens-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# COUNT_TOKENS (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_COUNT_TOKENS](ai_count_tokens.md) is the latest version of this function.
> Use AI_COUNT_TOKENS for the latest functionality.
> You can continue to use COUNT_TOKENS (SNOWFLAKE.CORTEX).

Returns the number of tokens in a prompt for the large language model or the task-specific function specified in the argument. This
function does not support fine-tuned models.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.COUNT_TOKENS( <model_name> , <input_text> )
```

## Arguments

**Required:**

`model_name`
:   Name of the model you want to base the token count on. Specify one of the following values:

    * `deepseek-r1`
    * `e5-base-v2`
    * `e5-large-v2`
    * `llama3-70b`
    * `llama3-8b`
    * `llama3.1-405b`
    * `llama3.1-70b`
    * `llama3.1-8b`
    * `llama3.3-70b`
    * `llama4-maverick`
    * `llama4-scout`
    * `mistral-7b`
    * `mistral-large`
    * `mistral-large2`
    * `mixtral-8x7b`
    * `nv-embed-qa-4`
    * `snowflake-arctic-embed-l-v2.0`
    * `snowflake-arctic-embed-m-v1.5`
    * `snowflake-arctic-embed-m`
    * `snowflake-arctic`
    * `snowflake-llama-3.1-405b`
    * `snowflake-llama-3.3-70b`
    * `voyage-multilingual-2`

`input_text`
:   Input text to count the tokens in.

## Returns

Returns an [INT , INTEGER , BIGINT , SMALLINT , TINYINT , BYTEINT](../data-types-numeric.md) type that is the number of tokens in the input text based on the model or function specified.

## Usage notes

* If a function name is specified, the token count is based on the model used by the function.
* Use lowercase letters in function names.

> **Note:**
>
> COUNT_TOKENS does not account for the managed system prompt that is automatically added to the beginning of the input
> text when using a Cortex [Cortex AI functions](../../user-guide/snowflake-cortex/aisql.md). As a result, the value
> returned by COUNT_TOKENS is lower than the actual number of tokens processed by these functions.

## Examples

The following example returns the token count for the specified prompt using the `llama3.1-70b` model:

```sqlexample
SELECT SNOWFLAKE.CORTEX.COUNT_TOKENS( 'llama3.1-70b', 'what is a large language model?' );
```

```output
+---+
| 6 |
+---+
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: COVAR_POP
source: https://docs.snowflake.com/en/sql-reference/functions/covar_pop.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md) (General)

# COVAR_POP

Returns the population covariance for non-null pairs in a group. It is computed for non-null pairs using the following formula:

> `(SUM(x*y) - SUM(x) * SUM(y) / COUNT(*)) / COUNT(*)`

Where `x` is the independent variable and `y` is the dependent variable.

See also:
:   [COVAR_SAMP](covar_samp.md) , [COUNT](count.md) , [SUM](sum.md)

## Syntax

**Aggregate function**

```sqlsyntax
COVAR_POP( y , x )
```

**Window function**

```sqlsyntax
COVAR_POP( y , x ) OVER ( [ PARTITION BY <expr1> ] )
```

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k int, v decimal(10,2), v2 decimal(10, 2));
INSERT INTO aggr VALUES(1, 10, NULL);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, NULL), (2, 30, 35);

SELECT * FROM aggr;
```

```output
+---+-------+-------+
| K |     V |    V2 |
|---+-------+-------|
| 1 | 10.00 |  NULL |
| 2 | 10.00 | 11.00 |
| 2 | 20.00 | 22.00 |
| 2 | 25.00 |  NULL |
| 2 | 30.00 | 35.00 |
+---+-------+-------+
```

```sqlexample
SELECT k, COVAR_POP(v, v2) FROM aggr GROUP BY k;
```

```output
+---+------------------+
| K | COVAR_POP(V, V2) |
|---+------------------|
| 1 |             NULL |
| 2 |               80 |
+---+------------------+
```

---
title: COVAR_SAMP
source: https://docs.snowflake.com/en/sql-reference/functions/covar_samp.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md) (General)

# COVAR_SAMP

Returns the sample covariance for non-null pairs in a group. It is computed for non-null pairs using the following formula:

> `(SUM(x*y) - SUM(x) * SUM(y) / COUNT(*)) / (COUNT(*) - 1)`

Where `x` is the independent variable and `y` is the dependent variable.

See also:
:   [COVAR_POP](covar_pop.md) , [COUNT](count.md) , [SUM](sum.md)

## Syntax

**Aggregate function**

```sqlsyntax
COVAR_SAMP( y , x )
```

**Window function**

```sqlsyntax
COVAR_SAMP( y , x ) OVER ( [ PARTITION BY <expr1> ] )
```

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k int, v decimal(10,2), v2 decimal(10, 2));
INSERT INTO aggr VALUES(1, 10, NULL);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, NULL), (2, 30, 35);

SELECT k, COVAR_SAMP(v, v2) FROM aggr GROUP BY k;
```

```output
+---+-------------------+
| K | COVAR_SAMP(V, V2) |
|---+-------------------|
| 1 |              NULL |
| 2 |               120 |
+---+-------------------+
```

---
title: CUME_DIST
source: https://docs.snowflake.com/en/sql-reference/functions/cume_dist.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (Ranking)

# CUME_DIST

Finds the cumulative distribution of a value with regard to other values within the same window partition.

## Syntax

```sqlsyntax
CUME_DIST() OVER ( [ PARTITION BY <partition_expr> ]
  ORDER BY <order_expr> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`partition_expr`
:   This is the optional expression to use to group rows into partitions.

`order_expr`
:   This expression specifies the order of the rows within each partition.

## Returns

The data type of the returned value is DOUBLE.

## Usage notes

The CUME_DIST function does not support explicit window frames.

## Examples

```sqlexample
SELECT
    symbol,
    exchange,
    CUME_DIST() OVER (PARTITION BY exchange ORDER BY price) AS cume_dist
  FROM trades;
```

```output
+------+--------+------------+
|symbol|exchange|CUME_DIST   |
+------+--------+------------+
|SPY   |C       |0.3333333333|
|AAPL  |C       |         1.0|
|AAPL  |C       |         1.0|
|YHOO  |N       |0.1666666667|
|QQQ   |N       |         0.5|
|QQQ   |N       |         0.5|
|SPY   |N       |0.8333333333|
|SPY   |N       |0.8333333333|
|AAPL  |N       |         1.0|
|YHOO  |Q       |0.3333333333|
|YHOO  |Q       |0.3333333333|
|MSFT  |Q       |0.6666666667|
|MSFT  |Q       |0.6666666667|
|QQQ   |Q       |         1.0|
|QQQ   |Q       |         1.0|
|YHOO  |P       |         0.2|
|MSFT  |P       |         0.6|
|MSFT  |P       |         0.6|
|SPY   |P       |         0.8|
|AAPL  |P       |         1.0|
+------+--------+------------+
```

---
title: CUMULATIVE_PRIVACY_LOSSES
source: https://docs.snowflake.com/en/sql-reference/functions/cumulative_privacy_losses.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# CUMULATIVE_PRIVACY_LOSSES

Returns the privacy budgets associated with a specific [privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md).

For more information about viewing privacy budgets, see [View a privacy budget](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md).

## Syntax

```sqlsyntax
SNOWFLAKE.DATA_PRIVACY.CUMULATIVE_PRIVACY_LOSSES( '<privacy_policy>' )
```

## Arguments

`'privacy_policy'`
:   Specifies the fully-qualified name of the privacy policy. A privacy policy is a schema-level object.

## Output

| Column | Data type | Description |
| --- | --- | --- |
| `database_name` | VARCHAR | Database that contains the privacy policy. |
| `schema_name` | VARCHAR | Schema that contains the privacy policy. |
| `policy_name` | VARCHAR | Name of the privacy policy. |
| `budget_name` | VARCHAR | Name of the privacy budget in the privacy policy. |
| `consumer_id` | VARCHAR | Organization and account where users executed queries that incurred privacy loss. |
| `budget_spent` | FLOAT | Cumulative privacy loss since the last time the [privacy budget was refreshed](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md). |

## Usage notes

A privacy budget only appears if analysts associated with the privacy budget have incurred privacy loss or if an administrator has
[reset the privacy budget](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md).

## Examples

View the privacy budgets that are associated with the `my_policy_privacy` policy:

```sqlexample
SELECT *
  FROM TABLE(SNOWFLAKE.DATA_PRIVACY.CUMULATIVE_PRIVACY_LOSSES(
    'my_policy_db.my_policy_schema.my_policy_privacy'));
```

---
title: CURRENT_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/current_account.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_ACCOUNT

Returns the [account locator](../../user-guide/admin-account-identifier.md) used by the user’s current session.

> **Note:**
>
> If you want to find the [account name](../../user-guide/admin-account-identifier.md) rather than the account locator, use
> [CURRENT_ACCOUNT_NAME](current_account_name.md) instead. The preferred account identifier (`orgname-account_name`) uses
> the account name, not the account locator.

## Syntax

```sqlsyntax
CURRENT_ACCOUNT()
```

## Arguments

None.

## Returns

The data type of the returned value is `VARCHAR`.

## Examples

This shows how to call the `CURRENT_ACCOUNT` function:

> ```sqlexample
> SELECT CURRENT_ACCOUNT();
> ```
>
> Output:
>
> ```sqlexample
> +-------------------+
> | CURRENT_ACCOUNT() |
> |-------------------|
> | XY12345           |
> +-------------------+
> ```

---
title: CURRENT_ACCOUNT_NAME
source: https://docs.snowflake.com/en/sql-reference/functions/current_account_name.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_ACCOUNT_NAME

Returns the name of the current account.

The preferred [account identifier](../../user-guide/admin-account-identifier.md) for the account consists of this account name along with the organization of the account (`orgname-account_name`).

## Syntax

```sqlsyntax
CURRENT_ACCOUNT_NAME()
```

## Arguments

None.

## Returns

Returns the name of the current account.

The data type of the returned value is `VARCHAR`.

## Example

This shows how to call the CURRENT_ACCOUNT_NAME function:

> ```sqlexample
> SELECT CURRENT_ACCOUNT_NAME();
> ```
>
> Output:
>
> ```output
> +-----------------------------+
> | CURRENT_ACCOUNT_NAME()      |
> |-----------------------------|
> | my_account1                 |
> +-----------------------------+
> ```

---
title: CURRENT_AVAILABLE_ROLES
source: https://docs.snowflake.com/en/sql-reference/functions/current_available_roles.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md)

# CURRENT_AVAILABLE_ROLES

Returns a list of all account-level roles granted to the current user. The list includes all roles that are granted
directly to the user plus all account-level roles lower in the hierarchies of these roles.

See also:
:   [CURRENT_ROLE](current_role.md) , [CURRENT_SECONDARY_ROLES](current_secondary_roles.md) , [IS_ROLE_IN_SESSION](is_role_in_session.md)

## Syntax

```sqlsyntax
CURRENT_AVAILABLE_ROLES()
```

## Arguments

None.

## Returns

Returns a string (VARCHAR) that is a JSON-encoded list of available account-level roles. The returned value can be
passed to the [PARSE_JSON](parse_json.md) function to get a VARIANT that contains a list of all the
available roles.

## Usage notes

* This function returns a list of account-level roles only when queried by a user. This function is not supported in service contexts that
  don’t have an active user. For example, [tasks](../../user-guide/tasks-intro.md) are executed by a system service that is not associated
  with a user. Thus, when this function is queried within a task, it returns an empty list (`[]`).
* This function does not return the names of database roles, application roles, or class instance roles.
* This function does not account for role activation in a session.

  For example, if specifying this function in the conditions of a [masking policy](../../user-guide/security-column-intro.md) or a
  [row access policy](../../user-guide/security-row-intro.md), the policy might inadvertently restrict access.

  If role activation and role hierarchy is necessary in the policy conditions, use [IS_ROLE_IN_SESSION](is_role_in_session.md).

## Examples

Return the list of roles granted to the current user:

> ```sqlexample
> SELECT CURRENT_AVAILABLE_ROLES();
>
> +----------------------------------------------------------+
> | ROW | CURRENT_AVAILABLE_ROLES()                          |
> +-----+----------------------------------------------------+
> |  1  | [ "PUBLIC", "ANALYST", "DATA_ADMIN", "DATA_USER" ] |
> +-----+----------------------------------------------------+
> ```

Use the PARSE_JSON function to return a VARIANT and the [FLATTEN](flatten.md) function to obtain a single row for each role:

> ```sqlexample
> SELECT INDEX,VALUE,THIS FROM TABLE(FLATTEN(input => PARSE_JSON(CURRENT_AVAILABLE_ROLES())));
>
> +-----+-------+------------------------+---------------------------+
> | ROW | INDEX | VALUE                  | THIS                      |
> +-----+-------+------------------------+---------------------------+
> |   1 |     0 | "PUBLIC"               | [                         |
> |     |       |                        |   "PUBLIC",               |
> |     |       |                        |   "ANALYST",              |
> |     |       |                        |   "DATA_ADMIN",           |
> |     |       |                        |   "DATA_USER"             |
> |     |       |                        | ]                         |
> +-----+-------+------------------------+---------------------------+
> |   2 |     1 | "ANALYST"              | [                         |
> |     |       |                        |   "PUBLIC",               |
> |     |       |                        |   "ANALYST",              |
> |     |       |                        |   "DATA_ADMIN",           |
> |     |       |                        |   "DATA_USER"             |
> |     |       |                        | ]                         |
> +-----+-------+------------------------+---------------------------+
> |   3 |     2 | "DATA_ADMIN"           | [                         |
> |     |       |                        |   "PUBLIC",               |
> |     |       |                        |   "ANALYST",              |
> |     |       |                        |   "DATA_ADMIN",           |
> |     |       |                        |   "DATA_USER"             |
> |     |       |                        | ]                         |
> +-----+-------+------------------------+---------------------------+
> |   4 |     3 | "DATA_USER"            | [                         |
> |     |       |                        |   "PUBLIC",               |
> |     |       |                        |   "ANALYST",              |
> |     |       |                        |   "DATA_ADMIN",           |
> |     |       |                        |   "DATA_USER"             |
> |     |       |                        | ]                         |
> +-----+-------+------------------------+---------------------------+
> ```

---
title: CURRENT_CLIENT
source: https://docs.snowflake.com/en/sql-reference/functions/current_client.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# CURRENT_CLIENT

Returns the version of the client from which the function was called. If called from an application using the JDBC or ODBC driver to connect to Snowflake, returns the version of the driver.

## Syntax

```sqlsyntax
CURRENT_CLIENT()
```

## Usage notes

* The Worksheet in the Snowflake web interface connects to Snowflake directly through the interface; it doesn’t use the JDBC or ODBC driver. As such, calling CURRENT_CLIENT in the Worksheet returns a
  different value than calling the function from a client application.

## Examples

Call CURRENT_CLIENT from within SnowSQL:

> ```sqlexample
> SELECT CURRENT_CLIENT();
>
> +------------------+
> | CURRENT_CLIENT() |
> |------------------|
> | SnowSQL 1.1.18   |
> +------------------+
> ```

Call CURRENT_CLIENT from within the Worksheet in Snowsight:

> ```sqlexample
> SELECT CURRENT_CLIENT();
> ```
>
> Results
>
> |  |  |
> | --- | --- |
> | row# | CURRENT_CLIENT() |
> | 1 | Snowflake UI 1434236365 |

---
title: CURRENT_DATABASE
source: https://docs.snowflake.com/en/sql-reference/functions/current_database.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_DATABASE

Returns the name of the current database, which varies depending on where you call the function:

* If you call this function outside of a policy, UDF, or view, it returns the database that is in use for the current session.
* If you call this function in the body of a policy, for example a masking policy, it returns the database that contains the table or view
  that is protected by the policy.
* If you call this function in the handler code of a UDF, it returns the database that contains the UDF.
* If you call this function in the definition of a view, it returns the database that contains the view.

## Syntax

```sqlsyntax
CURRENT_DATABASE()
```

## Arguments

None.

## Usage notes

None.

## Examples

Show the current warehouse, database, and schema:

> ```sqlexample
> SELECT CURRENT_WAREHOUSE(), CURRENT_DATABASE(), CURRENT_SCHEMA();
> ```
>
> Output:
>
> ```sqlexample
> +---------------------+--------------------+------------------+
> | CURRENT_WAREHOUSE() | CURRENT_DATABASE() | CURRENT_SCHEMA() |
> |---------------------+--------------------+------------------|
> | DEV_WAREHOUSE       | TEST_DATABASE      | UDF_TEST_SCHEMA  |
> +---------------------+--------------------+------------------+
> ```

---
title: CURRENT_DATE
source: https://docs.snowflake.com/en/sql-reference/functions/current_date.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# CURRENT_DATE

Returns the current date of the system.

## Syntax

```sqlsyntax
CURRENT_DATE()

CURRENT_DATE
```

## Arguments

None.

## Returns

The function returns a value of type [DATE](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned date is
  in the time zone for the session.
* The display format for dates in the output is determined by the [DATE_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `YYYY-MM-DD`).
* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := CURRENT_DATE();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).

## Examples

Show the current date, time, and timestamp:

```sqlexample
SELECT CURRENT_DATE(), CURRENT_TIME(), CURRENT_TIMESTAMP();
```

```output
+----------------+----------------+-------------------------------+
| CURRENT_DATE() | CURRENT_TIME() | CURRENT_TIMESTAMP()           |
|----------------+----------------+-------------------------------|
| 2024-04-18     | 07:47:37       | 2024-04-18 07:47:37.084 -0700 |
+----------------+----------------+-------------------------------+
```

---
title: CURRENT_IP_ADDRESS
source: https://docs.snowflake.com/en/sql-reference/functions/current_ip_address.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md)

# CURRENT_IP_ADDRESS

Returns the IP address of the client that submitted the request.

## Syntax

```sqlsyntax
CURRENT_IP_ADDRESS()
```

## Arguments

None.

## Examples

Return the current IP address of the client that is connected to Snowflake:

> ```sqlexample
> select current_ip_address();
>
> +----------------------+
> | CURRENT_IP_ADDRESS() |
> +----------------------+
> | 192.0.2.255          |
> +----------------------+
> ```

---
title: CURRENT_ORGANIZATION_NAME
source: https://docs.snowflake.com/en/sql-reference/functions/current_organization_name.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_ORGANIZATION_NAME

Returns the name of the organization to which the current account belongs.

## Syntax

```sqlsyntax
CURRENT_ORGANIZATION_NAME()
```

## Arguments

None.

## Returns

The data type of the returned value is `VARCHAR`.

## Example

This shows how to call the CURRENT_ORGANIZATION_NAME function:

> ```sqlexample
> SELECT CURRENT_ORGANIZATION_NAME();
> ```
>
> Output:
>
> ```output
> +-----------------------------+
> | CURRENT_ORGANIZATION_NAME() |
> |-----------------------------|
> | bazco                       |
> +-----------------------------+
> ```

---
title: CURRENT_ORGANIZATION_USER
source: https://docs.snowflake.com/en/sql-reference/functions/current_organization_user.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_ORGANIZATION_USER

Returns the name of the user currently logged into the system, but only if the user is an
[organization user](../../user-guide/organization-users.md).

## Syntax

```sqlsyntax
CURRENT_ORGANIZATION_USER()
```

## Arguments

None.

## Returns

If the current user is an organization user, returns a value of type VARCHAR.

If the current user is not an organization user, returns NULL.

## Usage notes

* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := CURRENT_ORGANIZATION_USER();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).
* Unlike the [CURRENT_USER](current_user.md) context function, this function can return a user when it’s called from a data
  sharing consumer account.

## Examples

```sqlexample
SELECT CURRENT_ORGANIZATION_USER();
```

```output
+-----------------------------+
| CURRENT_ORGANIZATION_USER() |
|-----------------------------|
| TSMITH                      |
+-----------------------------+
```

---
title: CURRENT_REGION
source: https://docs.snowflake.com/en/sql-reference/functions/current_region.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# CURRENT_REGION

Returns the name of the region for the account where the current user is logged in.

For organizations that have accounts in multiple [region groups](../../user-guide/admin-account-identifier.md), returns `region_group.region`.

## Syntax

```sqlsyntax
CURRENT_REGION()
```

## Arguments

None.

## Examples

Show the current region:

> ```sqlexample
> SELECT CURRENT_REGION();
> ```
>
> Output:
>
> ```sqlexample
> +------------------+
> | CURRENT_REGION() |
> |------------------|
> | AWS_US_WEST_2    |
> +------------------+
> ```

Show the current region when the current user is logged into an account in an organization that spans multiple region groups:

> ```sqlexample
> SELECT CURRENT_REGION();
> ```
>
> Output:
>
> ```sqlexample
> +----------------------+
> | CURRENT_REGION()     |
> |----------------------|
> | PUBLIC.AWS_US_WEST_2 |
> +----------------------+
> ```

---
title: CURRENT_ROLE
source: https://docs.snowflake.com/en/sql-reference/functions/current_role.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_ROLE

Returns the name of the [primary role](../../user-guide/security-access-control-overview.md) in use for the current session when the primary role is an account-level role or NULL if the role in use for the current session is a database role.

To specify a different role for the session, execute the [USE ROLE](../sql/use-role.md)
command.

## Syntax

```sqlsyntax
CURRENT_ROLE()
```

## Arguments

None.

## Usage notes

* Granting access on a [secure UDF](../../developer-guide/secure-udf-procedure.md) or [secure view](../../user-guide/views-secure.md) that
  contains this function to a share is allowed. When the secure UDF or secure view is accessed from the data sharing consumer account, this
  function always returns a NULL value.
* Snowflake returns a NULL value if this function is used in a [masking policy](../../user-guide/security-column-intro.md) or
  [row access policy](../../user-guide/security-row-intro.md) that is assigned to a shared table or view.

## Examples

This demonstrates `CURRENT_ROLE()`:

```sqlexample
SELECT CURRENT_ROLE();
```

Output:

> ```sqlexample
> +----------------+
> | CURRENT_ROLE() |
> |----------------|
> | SYSADMIN       |
> +----------------+
> ```

---
title: CURRENT_ROLE_TYPE
source: https://docs.snowflake.com/en/sql-reference/functions/current_role_type.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_ROLE_TYPE

Calling the CURRENT_ROLE_TYPE function returns `ROLE` if the current active (primary) role in the session is an account role. Calling the
CURRENT_ROLE_TYPE function from a session running inside a Snowflake native application returns `APPLICATION_INSTANCE`.

## Syntax

```sqlsyntax
CURRENT_ROLE_TYPE()
```

## Arguments

None.

## Usage notes

The primary role in a session cannot be a database role. Therefore, this functions will never return `DATABASE_ROLE`.

None.

## Examples

```sqlexample
SELECT CURRENT_ROLE_TYPE();

+---------------------+
| CURRENT_ROLE_TYPE() |
|---------------------|
| ROLE                |
+---------------------+
```

---
title: CURRENT_SCHEMA
source: https://docs.snowflake.com/en/sql-reference/functions/current_schema.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_SCHEMA

Returns the name of the current schema, which varies depending on where you call the function:

* If you call this function outside of a policy, UDF, or view, it returns the schema that is in use for the current session.
* If you call this function in the body of a policy, for example a masking policy, it returns the schema that contains the table or view
  that is protected by the policy.
* If you call this function in the handler code of a UDF, it returns the schema that contains the UDF.
* If you call this function in the definition of a view, it returns the schema that contains the view.

## Syntax

```sqlsyntax
CURRENT_SCHEMA()
```

## Arguments

None.

## Usage notes

* Do not confuse this function with the similarly named function [CURRENT_SCHEMAS](current_schemas.md).

## Examples

Show the current warehouse, database, and schema:

> ```sqlexample
> SELECT CURRENT_WAREHOUSE(), CURRENT_DATABASE(), CURRENT_SCHEMA();
> ```
>
> Output:
>
> ```sqlexample
> +---------------------+--------------------+------------------+
> | CURRENT_WAREHOUSE() | CURRENT_DATABASE() | CURRENT_SCHEMA() |
> |---------------------+--------------------+------------------|
> | DEV_WAREHOUSE       | TEST_DATABASE      | UDF_TEST_SCHEMA  |
> +---------------------+--------------------+------------------+
> ```

---
title: CURRENT_SCHEMAS
source: https://docs.snowflake.com/en/sql-reference/functions/current_schemas.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_SCHEMAS

Returns active search path schemas.

For more information about search path, see [Object name resolution](../name-resolution.md).

## Syntax

```sqlsyntax
CURRENT_SCHEMAS()
```

## Arguments

None.

## Usage notes

Do not confuse this function with the similarly named function
[CURRENT_SCHEMA](current_schema.md).

## Examples

Show the schemas that will be searched if a table or other database object
is referenced without a schema name:

> ```sqlexample
> SELECT CURRENT_SCHEMAS();
> ```
>
> Output:
>
> ```sqlexample
> +-----------------------------------------+
> | CURRENT_SCHEMAS()                       |
> |-----------------------------------------|
> | ["TEST_DB1.BILLING", "TEST_DB1.PUBLIC"] |
> +-----------------------------------------+
> ```

---
title: CURRENT_SECONDARY_ROLES
source: https://docs.snowflake.com/en/sql-reference/functions/current_secondary_roles.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_SECONDARY_ROLES

Returns the [secondary roles](../../user-guide/security-access-control-overview.md) in use for the current session.

To activate a different set of secondary roles for the session, execute the [USE SECONDARY ROLES](../sql/use-secondary-roles.md) command.

## Syntax

```sqlsyntax
CURRENT_SECONDARY_ROLES()
```

## Arguments

None.

## Returns

Returns a string (VARCHAR) that is a JSON-encoded object containing the following name/value pairs:

`roles`
:   Contains a list of the activated secondary roles. This list includes only those roles that are directly granted to the user; roles lower
    in the hierarchy of these roles are not listed.

`value`
:   Contains a list of the requested secondary roles, either those requested with the current user’s `DEFAULT_SECONDARY_ROLES` property or
    with the USE SECONDARY ROLES command.

## Usage notes

Granting access on a secure UDF or secure view that contains CURRENT_SECONDARY_ROLES to a share is allowed. When the
secure UDF or secure view is accessed from the data-sharing consumer account, CURRENT_SECONDARY_ROLES always returns a
NULL value.

## Examples

The current user has `DEFAULT_SECONDARY_ROLES=('ALL')`. Custom roles `role1`, `role2`, and `role3` are granted
to the current user and are active as secondary roles:

```sqlexample
SELECT CURRENT_SECONDARY_ROLES();
```

```output
+------------------------------------------------------+
|           CURRENT_SECONDARY_ROLES()                  |
+------------------------------------------------------+
| {"roles":"ROLE1,ROLE2,ROLE3","value":"ALL"}          |
+------------------------------------------------------+
```

---
title: CURRENT_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/current_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_SESSION

Returns a unique system identifier for the Snowflake session corresponding to the present connection. This will generally be a system-generated alphanumeric string. It is NOT derived from the user name or user account.

## Syntax

```sqlsyntax
CURRENT_SESSION()
```

## Returns

The data type of the returned value is VARCHAR.

## Examples

```sqlexample
SELECT CURRENT_SESSION();
-------------------+
 CURRENT_SESSION() |
-------------------+
 34359980038       |
-------------------+
```

---
title: CURRENT_STATEMENT
source: https://docs.snowflake.com/en/sql-reference/functions/current_statement.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_STATEMENT

Returns the SQL text of the statement that is currently executing.

## Syntax

```sqlsyntax
CURRENT_STATEMENT()
```

## Arguments

None.

## Examples

This shows a simple example of using the `CURRENT_STATEMENT` function:

> ```sqlexample
> SELECT 2.71, CURRENT_STATEMENT();
> ```
>
> Output:
>
> ```sqlexample
> +------+-----------------------------------+
> | 2.71 | CURRENT_STATEMENT()               |
> |------+-----------------------------------|
> | 2.71 | SELECT 2.71, CURRENT_STATEMENT(); |
> +------+-----------------------------------+
> ```

---
title: CURRENT_TASK_GRAPHS
source: https://docs.snowflake.com/en/sql-reference/functions/current_task_graphs.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# CURRENT_TASK_GRAPHS

Returns the status of a *graph* run that is currently scheduled or is executing. A graph is currently
defined as a single scheduled task or a [task graph](../../user-guide/tasks-graphs.md) composed of a scheduled root task and one or more child
tasks. For the purposes of this function, *root task* refers to either the single
scheduled task or the root task in a task graph.

This function returns details for graph runs that are currently executing or are next scheduled to run within the next 8 days. To retrieve
the details for graph runs that have completed in the past 60 minutes, query the [COMPLETE_TASK_GRAPHS](complete_task_graphs.md) table function.

The function returns the graph run details for your entire Snowflake account or a specified root task.

## Syntax

```sqlsyntax
CURRENT_TASK_GRAPHS(
      [ RESULT_LIMIT => <integer> ]
      [, ROOT_TASK_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function. Note that the results are returned in descending SCHEDULED_TIME
    order. If the number of matching rows is greater than the result limit, the graph executions with the most recent scheduled timestamp are
    returned, up to the specified limit.

    Range: `1` to `10000`

    Default: `1000`

`ROOT_TASK_NAME => string`
:   A case-insensitive string specifying the name of the root task. Only non-qualified task names are supported. Only graph runs for the
    specified task are returned. Note that if multiple tasks have the same name, the function returns the graph runs for each of these tasks.

## Usage notes

* To view a task graph within this function, the invoking role requires at least one of the following privileges:

  + OWNERSHIP privilege on the task (that is, the task owner).
  + MONITOR or OPERATE privileges on the task.
  + The global MONITOR EXECUTION privilege.
  + The ACCOUNTADMIN role.

  The role must also have the USAGE privilege on the database and schema that store the task, otherwise the DATABASE_NAME and SCHEMA_NAME values in the output are NULL.
* When the CURRENT_TASK_GRAPHS function is queried, its task name and result limit arguments are applied first
  followed by the WHERE and LIMIT clause, respectively, if specified. In addition, the CURRENT_TASK_GRAPHS function returns records in
  descending SCHEDULED_TIME order.

  In practice, if you have many task graphs running in your account, the results returned by the function could include only scheduled tasks,
  especially if the RESULT_LIMIT value is relatively low.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROOT_TASK_NAME | TEXT | Name of the root task. |
| DATABASE_NAME | TEXT | Name of the database that contains the graph. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the graph. |
| STATE | TEXT | State of the graph run:   * `SCHEDULED`: The root task is scheduled at a future time. * `EXECUTING`: At least one task run in the graph is still executing, or the root task ran successfully to completion and one or more child tasks are scheduled.   Note that if the state of the root task run is SKIPPED, the function does not return a row for the run. |
| SCHEDULED_FROM | TEXT | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| FIRST_ERROR_TASK_NAME | TEXT | Name of the first task in the graph that returned an error; returns NULL if no task produced an error. |
| FIRST_ERROR_CODE | NUMBER | Error code of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| FIRST_ERROR_MESSAGE | TEXT | Error message of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the root task is/was scheduled to start running. Note that we make a best effort to ensure absolute precision, but only guarantee that tasks do not execute *before* the scheduled time. |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the root task definition started to run. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| NEXT_SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the standalone or root task (in a [task graph](../../user-guide/tasks-graphs.md)) is next scheduled to start running, assuming the current run of the standalone task or [task graph](../../user-guide/tasks-graphs.md) started at the SCHEDULED_TIME time completes in time. |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a [task graph](../../user-guide/tasks-graphs.md). This ID matches the ID column value in the SHOW TASKS output for the same task. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the [task graph](../../user-guide/tasks-graphs.md) that was run, or is scheduled to be run. |
| RUN_ID | NUMBER | Time when the standalone or root task in a [task graph](../../user-guide/tasks-graphs.md) is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system might reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run before retry. You can use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| CONFIG | TEXT | Displays the graph level configuration used during the graph run if explicitly set. Otherwise displays NULL. |
| GRAPH_RUN_GROUP_ID | TEXT | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Examples

Retrieve the 1000 most recent graph runs (still running, or scheduled in the future) in the account. Note that the maximum
number of rows returned by the function is limited to 1000 by default. To change the number of rows returned, modify the RESULT_LIMIT
argument value:

> ```sqlexample
> select *
>   from table(information_schema.current_task_graphs())
>   order by scheduled_time;
> ```

Retrieve the 10 most recent graph runs for a specified task (still running or scheduled in the future):

> ```sqlexample
> select *
>   from table(information_schema.current_task_graphs(
>     result_limit => 10,
>     root_task_name=>'MYTASK'));
> ```

---
title: CURRENT_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/current_time.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# CURRENT_TIME

Returns the current time for the system.

Aliases:
:   [LOCALTIME](localtime.md)

## Syntax

```sqlsyntax
CURRENT_TIME( [ <fract_sec_precision> ] )

CURRENT_TIME
```

## Arguments

`fract_sec_precision`
:   This optional argument indicates the precision with which to report the
    time. For example, a value of 3 says to use 3 digits after the decimal
    point (i.e. to specify the time with a precision of milliseconds).

    The default precision is 9 (nanoseconds).

    Valid values range from 0 - 9. However, most platforms do not support true
    nanosecond precision; the precision that you get might be less than the
    precision you specify. In practice, precision is usually approximately
    milliseconds (3 digits) at most.

    > **Note:**
    >
    > * Fractional seconds are only displayed if they have been explicitly set in the [TIME_OUTPUT_FORMAT](../parameters.md) parameter for the session (e.g. `'HH24:MI:SS.FF'`).

## Returns

Returns a value of type [TIME](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned time is
  in the time zone for the session.
* The display format for times in the output is determined by the [TIME_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `HH24:MI:SS`).
* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := <function_name>();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual
  warehouse) because the queries might be serviced by different compute resources (in the warehouse).

## Examples

Set the time output format to `HH24:MI:SS.FF`, then return the current time with fractional seconds precision first set to 2, then 4, and then the default (9):

```sqlexample
ALTER SESSION SET TIME_OUTPUT_FORMAT = 'HH24:MI:SS.FF';

SELECT CURRENT_TIME(2);
```

```output
+-----------------+
| CURRENT_TIME(2) |
|-----------------|
| 15:35:51.24     |
+-----------------+
```

```sqlexample
SELECT CURRENT_TIME(4);
```

```output
+-----------------+
| CURRENT_TIME(4) |
|-----------------|
| 15:36:53.5570   |
+-----------------+
```

```sqlexample
SELECT CURRENT_TIME;
```

```output
+--------------------+
| CURRENT_TIME       |
|--------------------|
| 15:37:29.644000000 |
+--------------------+
```

---
title: CURRENT_TIMESTAMP
source: https://docs.snowflake.com/en/sql-reference/functions/current_timestamp.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# CURRENT_TIMESTAMP

Returns the current timestamp for the system in the local time zone.

Aliases:
:   [LOCALTIMESTAMP](localtimestamp.md) , [GETDATE](getdate.md) , [SYSTIMESTAMP](systimestamp.md)

## Syntax

```sqlsyntax
CURRENT_TIMESTAMP( [ <fract_sec_precision> ] )

CURRENT_TIMESTAMP
```

## Arguments

`fract_sec_precision`
:   This optional argument indicates the precision with which to report the
    time. For example, a value of 3 says to use 3 digits after the decimal
    point (that is, to specify the time with a precision of milliseconds).

    The default precision is 9 (nanoseconds).

    Valid values range from 0 - 9. However, most platforms do not support true
    nanosecond precision; the precision that you get might be less than the
    precision you specify. In practice, precision is usually approximately
    milliseconds (3 digits) at most.

    > **Note:**
    >
    > Fractional seconds are only displayed if they have been explicitly set in the [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) parameter for the session (e.g. `'YYYY-MM-DD HH24:MI:SS.FF'`).

## Returns

Returns the current system time. The data type of the returned value is
[TIMESTAMP_LTZ](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned timestamp is in the time zone for the session.
* The setting of the [TIMESTAMP_TYPE_MAPPING](../parameters.md) parameter does not affect the return value.
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual warehouse) because the queries might be serviced by different compute resources (in the warehouse).

* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := CURRENT_TIMESTAMP();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).
* The aliases SYSTIMESTAMP and GETDATE differ from CURRENT_TIMESTAMP in the following ways:

  + They do not support the `fract_sec_precision` argument.
  + These functions must be called with parentheses.

## Examples

The examples in this section use the timestamp output format `YYYY-MM-DD HH24:MI:SS.FF`. To configure
your session to use the same output format, run the following statement:

```sqlexample
ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF';
```

### Call the CURRENT_TIMESTAMP function with different precision values

Return the current timestamp with fractional seconds precision set to `2`:

```sqlexample
SELECT CURRENT_TIMESTAMP(2);
```

```output
+------------------------+
| CURRENT_TIMESTAMP(2)   |
|------------------------|
| 2024-04-17 15:41:38.29 |
+------------------------+
```

Return the current timestamp with fractional seconds precision set to `4`:

```sqlexample
SELECT CURRENT_TIMESTAMP(4);
```

```output
+--------------------------+
| CURRENT_TIMESTAMP(4)     |
|--------------------------|
| 2024-04-17 15:42:14.2100 |
+--------------------------+
```

Return the current timestamp with fractional seconds precision set to the default (`9`):

```sqlexample
SELECT CURRENT_TIMESTAMP;
```

```output
+-------------------------------+
| CURRENT_TIMESTAMP             |
|-------------------------------|
| 2024-04-17 15:42:55.130000000 |
+-------------------------------+
```

### Call the CURRENT_TIMESTAMP function with different TIMEZONE settings

Set the [TIMEZONE](../parameters.md) parameter to `America/New_York` and call the CURRENT_TIMESTAMP function:

```sqlexample
ALTER SESSION SET TIMEZONE = 'America/New_York';

SELECT CURRENT_TIMESTAMP(2);
```

```output
+------------------------+
| CURRENT_TIMESTAMP(2)   |
|------------------------|
| 2025-08-11 14:16:43.57 |
+------------------------+
```

Set the TIMEZONE parameter to `America/Los_Angeles` and call the CURRENT_TIMESTAMP function:

```sqlexample
ALTER SESSION SET TIMEZONE = 'America/Los_Angeles';

SELECT CURRENT_TIMESTAMP(2);
```

```output
+------------------------+
| CURRENT_TIMESTAMP(2)   |
|------------------------|
| 2025-08-11 11:17:18.19 |
+------------------------+
```

---
title: CURRENT_TRANSACTION
source: https://docs.snowflake.com/en/sql-reference/functions/current_transaction.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_TRANSACTION

Returns the transaction id of an open transaction in the current session.

See also:
:   [LAST_TRANSACTION](last_transaction.md) , [DESCRIBE TRANSACTION](../sql/desc-transaction.md)

## Syntax

```sqlsyntax
CURRENT_TRANSACTION()
```

## Arguments

None.

## Examples

This shows the transaction ID of the current transaction:

> ```sqlexample
> SELECT CURRENT_TRANSACTION();
> ```
>
> Output:
>
> ```sqlexample
> +-----------------------+
> | CURRENT_TRANSACTION() |
> |-----------------------|
> | 1661899308790000000   |
> +-----------------------+
> ```

---
title: CURRENT_USER
source: https://docs.snowflake.com/en/sql-reference/functions/current_user.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# CURRENT_USER

Returns the name of the user currently logged into the system.

## Syntax

```sqlsyntax
CURRENT_USER()

CURRENT_USER
```

## Arguments

None.

## Returns

This function returns a value of type VARCHAR.

## Usage notes

* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := CURRENT_USER();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).
* Granting access on a [secure UDF](../../developer-guide/secure-udf-procedure.md) or [secure view](../../user-guide/views-secure.md) that
  contains this function to a share is allowed. When the secure UDF or secure view is accessed from the data sharing consumer account, this
  function always returns a NULL value.
* Snowflake returns a NULL value if this function is used in a [masking policy](../../user-guide/security-column-intro.md) or
  [row access policy](../../user-guide/security-row-intro.md) that is assigned to a shared table or view.

## Examples

This example calls the CURRENT_USER function:

```sqlexample
SELECT CURRENT_USER();
```

```output
+----------------+
| CURRENT_USER() |
|----------------|
| TSMITH         |
+----------------+
```

---
title: CURRENT_VERSION
source: https://docs.snowflake.com/en/sql-reference/functions/current_version.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# CURRENT_VERSION

Returns the current Snowflake version.

See also:
:   [CURRENT_CLIENT](current_client.md)

## Syntax

```sqlsyntax
CURRENT_VERSION()
```

## Arguments

None.

## Returns

The data type of the returned value is VARCHAR.

The returned value contains four fields:

> ```sqlexample
> <major_version>.<minor_version>.<patch_version>  <internal_identifier>
> ```
>
> `major_version`
> :   Major version numbers change annually. For example, the major version for all releases in 2023 is 7. For all releases in 2022, the major version is 6.
>
> `minor_version`
> :   Minor version numbers change for each weekly release.
>
> `patch_version`
> :   Patch version numbers represent minor changes within a weekly release.
>
> `internal_identifier`
> :   This field is for internal use only.
>
> For example, for version 7.32.1, the major version is 7, the minor version is 32, and the patch version is 1.

## Usage notes

* This function returns version number information for Snowflake. To retrieve information about client versions,
  see [CURRENT_CLIENT](current_client.md).

## Examples

This shows the version of Snowflake on which the query is run:

> ```sqlexample
> SELECT CURRENT_VERSION();
> ```
>
> Output:
>
> ```sqlexample
> +-------------------+
> | CURRENT_VERSION() |
> |-------------------|
> | 7.32.1            |
> +-------------------+
> ```

---
title: CURRENT_WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/functions/current_warehouse.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# CURRENT_WAREHOUSE

Returns the name of the warehouse in use for the current session.

To specify a different warehouse for the session, execute the [USE WAREHOUSE](../sql/use-warehouse.md)
command.

## Syntax

```sqlsyntax
CURRENT_WAREHOUSE()
```

## Arguments

None.

## Examples

Show the current warehouse, database, and schema:

> ```sqlexample
> SELECT CURRENT_WAREHOUSE(), CURRENT_DATABASE(), CURRENT_SCHEMA();
> ```
>
> Output:
>
> ```sqlexample
> +---------------------+--------------------+------------------+
> | CURRENT_WAREHOUSE() | CURRENT_DATABASE() | CURRENT_SCHEMA() |
> |---------------------+--------------------+------------------|
> | DEV_WAREHOUSE       | TEST_DATABASE      | UDF_TEST_SCHEMA  |
> +---------------------+--------------------+------------------+
> ```

---
title: DATA_AGENT_RUN (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/data_agent_run-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# DATA_AGENT_RUN (SNOWFLAKE.CORTEX)

Runs a [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md) and returns the response as JSON.

You can use this function to run a Cortex Agent, which orchestrates across both structured and unstructured data sources to deliver insights. This includes planning tasks, using tools to execute these tasks, and generating responses.

> **Note:**
>
> `SNOWFLAKE.CORTEX.DATA_AGENT_RUN` is a utility wrapper around the [Cortex Agents Run API](../../user-guide/snowflake-cortex/cortex-agents-run.md).
> For most application integrations, Snowflake recommends calling the **streaming REST API** directly.

See also:
:   [CREATE AGENT](../sql/create-agent.md) , [SHOW AGENTS](../sql/show-agents.md) , [DESCRIBE AGENT](../sql/desc-agent.md) , [DROP AGENT](../sql/drop-agent.md)

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.DATA_AGENT_RUN( '<agent_name>', <request_body> )
```

## Arguments

`'agent_name'`
:   Fully qualified name of the agent to run, in the form `database.schema.agent_name`.

`request_body`
:   JSON request body to send to the agent. This value must be a string (for example, a `$$...$$` literal).

    The following fields are supported in the request body:

    | Field | Type | Description |
    | --- | --- | --- |
    | `thread_id` | integer | The thread ID for the conversation. If thread_id is used, then parent_message_id must be passed as well. |
    | `parent_message_id` | integer | The ID of the parent message in the thread. If this is the first message, parent_message_id should be 0. |
    | `messages` | array of [Message](../../user-guide/snowflake-cortex/cortex-agents-run.md) | If thread_id and parent_message_id are passed in the request, messages includes the current user message in the conversation. Else, messages includes the conversation history and the current message. Messages contains both user queries and assistant responses in chronological order. |
    | `stream` | boolean | Whether to return a streaming response (`text/event-stream`) or a non-streaming JSON response (`application/json`). If true, the response will be streamed as Server-Sent Events. If false, the response will be returned as JSON. |
    | `tool_choice` | [ToolChoice](../../user-guide/snowflake-cortex/cortex-agents-run.md) | Configures how the agent should select and use tools during the interaction. Controls whether tool use is automatic, required, or whether specific tools should be used. |

    **Example**

    ```json
    {
      "thread_id": 0,
      "parent_message_id": 0,
      "messages": [
        {
          "role": "user",
          "content": [
            {
              "type": "text",
              "text": "What is the total revenue for 2023?"
            }
          ]
        }
      ],
      "stream": false,
      "tool_choice": {
        "type": "auto",
        "name": [
          "analyst_tool",
          "search_tool"
        ]
      }
    }
    ```

> **Important:**
>
> The `stream` field is ignored. A non-streaming response is always returned.

## Returns

Returns a JSON string containing the agent’s response.

## Access control requirements

To run an agent, you must use a role that can access Cortex Agents and the agent object you’re calling.
For details, see [Access control requirements](../../user-guide/snowflake-cortex/cortex-agents.md).

## Usage notes

* The function returns a JSON string. Pass this string to [TRY_PARSE_JSON](try_parse_json.md) to convert the response to a VARIANT value.

## Examples

Run an agent and parse the response JSON:

```sqlexample
SELECT
  TRY_PARSE_JSON(
    SNOWFLAKE.CORTEX.DATA_AGENT_RUN(
      'MY_DB.MY_SCHEMA.MY_AGENT',
      $${
        "parent_message_id": 1234,
        "thread_id": 5678,
        "messages": [
          {
            "role": "user",
            "content": [
              { "type": "text", "text": "What are some types of products?" }
            ]
          }
        ]
      }$$
    )
  ) AS resp;
```

Sample return value:

```json
{
  "role": "assistant",
  "content": [
    {
      "thinking": {
        "text": "\n...\n"
      },
      "type": "thinking"
    },
    {
      "tool_use": {
        "input": {
          "...": "..."
        },
        "name": "<tool_name>",
        "tool_use_id": "<tool_use_id>",
        "type": "<tool_type>"
      },
      "type": "tool_use"
    },
    {
      "text": "Based on the data available, there are two main types of products...",
      "type": "text"
    }
  ],
  "metadata": {
    "run_id": "<run_id>"
  }
}
```

---
title: DATA_METRIC_FUNCTION_EXPECTATIONS
source: https://docs.snowflake.com/en/sql-reference/functions/data_metric_function_expectations.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATA_METRIC_FUNCTION_EXPECTATIONS

Returns information about the [expectations](../../user-guide/data-quality-expectations.md) that exist in the account.

## Syntax

```sqlsyntax
DATA_METRIC_FUNCTION_EXPECTATIONS(
  [ METRIC_NAME => '<string>' ]
  [, REF_ENTITY_NAME => '<string>' ]
  [, REF_ENTITY_DOMAIN => '<string>' ]
)
```

## Arguments

`METRIC_NAME => 'string'`
:   Specifies the name of a system or custom data metric function (DMF). This function returns expectations that were added to the
    associations between objects and the specified DMF.

`REF_ENTITY_NAME => 'string'`
:   Specifies the name of an object with which DMFs are associated. Returns expectations that were added to DMF associations with the object.
    If specified, you must also specify `REF_ENTITY_DOMAIN`.

    The entire object name must be enclosed in single quotes.

    If the object name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
    case/characters. The double quotes must be enclosed within the single quotes, such as `'"table_name"'`.

`REF_ENTITY_DOMAIN => 'string'`
:   The object type of `REF_ENTITY_NAME`.

    * If the object is any type of table, use `table` as the argument value.
    * If the object is a view or materialized view, use `view` as the argument value.

## Output

The function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `metric_database_name` | VARCHAR | Database where the DMF exists. |
| `metric_schema_name` | VARCHAR | Schema where the DMF exists. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `metric_signature` | VARCHAR | Signature of the DMF. |
| `metric_data_type` | VARCHAR | Data type returned by the DMF. |
| `ref_entity_database_name` | VARCHAR | Database of the object associated with the DMF. |
| `ref_entity_schema_name` | VARCHAR | Schema of the object associated with the DMF. |
| `ref_entity_name` | VARCHAR | Name of the object associated with the DMF. |
| `ref_entity_domain` | VARCHAR | Type of object associated with the DMF. |
| `ref_arguments` | ARRAY | Arguments passed to the DMF. |
| `ref_id` | VARCHAR | System-generated identifier. |
| `expectation_id` | VARCHAR | System-generated identifier of the expectation. |
| `expectation_name` | VARCHAR | Name given to the expectation by the user when it was added to the DMF association. |
| `expectation_expression` | VARCHAR | Boolean expression of the expectation. See [Defining what meets the expectation](../../user-guide/data-quality-expectations.md). |

## Examples

Return expectations that exist for a specific object.

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_EXPECTATIONS(
      REF_ENTITY_NAME => 'my_table',
      REF_ENTITY_DOMAIN => 'table'));
```

Return expectations that exist for a specific DMF.

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_EXPECTATIONS(
      METRIC_NAME => 'SNOWFLAKE.CORE.NULL_COUNT'));
```

Return expectations that exist for a specific association between an object and a DMF.

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_EXPECTATIONS(
      METRIC_NAME => 'SNOWFLAKE.CORE.NULL_COUNT',
      REF_ENTITY_NAME => 'my_table',
      REF_ENTITY_DOMAIN => 'table'));
```

---
title: DATA_METRIC_FUNCTION_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/data_metric_function_references.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATA_METRIC_FUNCTION_REFERENCES

Returns a row for each object that has the specified data metric function assigned to the object or returns a row for each data
metric function assigned to the specified object.

See also:
:   [DATA_METRIC_FUNCTION_REFERENCES view](../account-usage/data_metric_function_references.md) (Account Usage view)

## Syntax

```sqlsyntax
DATA_METRIC_FUNCTION_REFERENCES(
  METRIC_NAME => '<string>' )

DATA_METRIC_FUNCTION_REFERENCES(
  REF_ENTITY_NAME => '<string>' ,
  REF_ENTITY_DOMAIN => '<string>'
  )
```

## Arguments

`METRIC_NAME => 'string'`
:   Specifies the name of the data metric function.

    * The entire data metric name must be enclosed in single quotes.
    * If the data metric name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes, such as `'"<metric_name>"'`.

`REF_ENTITY_NAME => 'string'`
:   The name of the object, such as `table_name`, `view_name`, or `external_table_name`, on which the data metric function is added.

    * The entire object name must be enclosed in single quotes.
    * If the object name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes, such as `'"<table_name>"'`.

`REF_ENTITY_DOMAIN => 'string'`
:   The object type, such as table or materialized view, on which the data metric function is added.

    Use `'TABLE'` for all [supported table types](../../user-guide/data-quality-intro.md).

## Returns

The function returns the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| `metric_database_name` | VARCHAR | The database that stores the data metric function. |
| `metric_schema_name` | VARCHAR | The schema that stores the data metric function. |
| `metric_name` | VARCHAR | The name of the data metric function. |
| `argument_signature` | VARCHAR | The type signature of the metrics arguments. |
| `data_type` | VARCHAR | The return data type of the data metric function. |
| `ref_database_name` | VARCHAR | The database name that contains the object on which the data metric function is added. |
| `ref_schema_name` | VARCHAR | The schema name that contains the object on which the data metric function is added. |
| `ref_entity_name` | VARCHAR | The name of the table or view on which the data metric function is set. |
| `ref_entity_domain` | VARCHAR | The object type (table, view) on which the data metric function is set. |
| `ref_arguments` | ARRAY | Identifies the reference arguments used to evaluate the rule. |
| `ref_id` | VARCHAR | A unique identifier for the association of the data metric function to the table or view. |
| `schedule` | VARCHAR | The schedule to run the data metric function on the table or view. The value for the schedule is always the most recent and effective schedule. |
| `schedule_status` | VARCHAR | The status of the metrics association. One of the following:  `STARTED`  The data metric association on the table or view is scheduled to run.  `STARTED_AND_PENDING_SCHEDULE_UPDATE`  A change to the data metric schedule occurred and the new schedule is not yet effective. Allow Snowflake to update the schedule and synchronize the schedule with the data metric function. This value is temporary until the updates are complete.  If you unset the schedule with an ALTER TABLE or ALTER VIEW command, this value remains until a new schedule is set.  `SUSPENDED`  The data metric association on the table or view is not scheduled to run. This value also occurs when the role in use that calls the function does not have the OWNERSHIP privilege on the table.  For a full list of possible values, see Usage notes: Suspended statuses. |
| `data_quality_notification_status` | VARCHAR | Indicates whether notifications are being sent when there is an expectation violation or an anomaly in data quality. Possible values are:   * `ENABLED` — Notifications are turned on for the database that contains the object *and* no one has turned off notifications   at the object level. * `DISABLED` — Notifications aren’t being sent for data quality issues uncovered by the DMF. * `ERROR_INSUFFICIENT_PRIVILEGE` — Notifications aren’t being sent because the database owner doesn’t have the required   privileges. For a list of the required privileges, see [Grant privileges](../../user-guide/data-quality-notifications.md). |
| `anomaly_detection_status` | VARCHAR | Indicates whether [anomaly detection](../../user-guide/data-quality-anomaly.md) is enabled for the association between the DMF and the object. If the value is `TRAINING_IN_PROGRESS`, see [About the training period](../../user-guide/data-quality-anomaly.md). |
| `anomaly_detection_sensitivity_level` | VARCHAR | The sensitivity level of anomaly detection. For more information, see [Adjust the sensitivity level of anomaly detection](../../user-guide/data-quality-anomaly.md). |
| `use_role` | VARCHAR | Access control role with which the DMF runs. For more information, see [Required privilege on the table or view](../../user-guide/data-quality-access-control.md). |
| `exclude_table_types` | VARCHAR | Reserved for future use. |

## Access control requirements

Results are returned based on the privileges granted to the role executing the query.

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

* Any supported privilege on the data metric function.

  + For system DMFs, the role can be granted the DATA_METRIC_USER database role.
* The SELECT privilege on the table or view.

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function must
  use the fully-qualified object name. For more details, see [Snowflake Information Schema](../info-schema.md).
* Choose one syntax variation to execute a query. Mixing arguments results in errors and query failure.

  The argument values for `REF_ENTITY_NAME` and `REF_ENTITY_DOMAIN` must be included together otherwise the query fails.
* Snowflake returns errors if the specified object name does not exist or if the query operator is not authorized to view any data metric
  function on the object. Snowflake can return a result set of data metric associations if the operator is allowed to view a subset of the
  data metric associations.
* Unsupported object types listed as the `REF_ENTITY_DOMAIN`, such as `'stream'`, return errors.

## Usage notes: Suspended statuses

When the DMF association is suspended, the status can be one of the following:

`SUSPENDED_TABLE_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`
:   One of the following:

    * The table is dropped.
    * The schema or database that contains the table is dropped.
    * The schema or database that contains the table cannot be resolved by the table owner role.

      “Resolved” means the role that calls the function does not have the appropriate privileges on the schema or database that
      contains the table.

`SUSPENDED_DATA_METRIC_FUNCTION_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`
:   One of the following:

    * The DMF is dropped.
    * The schema or database that contains the DMF is dropped.
    * The schema or database that contains the DMF cannot be resolved by the table owner role.

`SUSPENDED_TABLE_COLUMN_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`
:   One of the following:

    * The target table column is dropped.
    * The schema or database that contains the column is dropped.
    * The schema or database that contains the column cannot be resolved by the table owner role.

`SUSPENDED_INSUFFICIENT_PRIVILEGE_TO_EXECUTE_DATA_METRIC_FUNCTION`
:   The table owner role does not have the EXECUTE DATA METRIC FUNCTION privilege.

`SUSPENDED_ACTIVE_EVENT_TABLE_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`
:   The event table is not set at the account level.

## Examples

To return a row for each DMF assigned to the table named `hr.tables.empl_info`, execute the following:

> ```sqlexample
> USE DATABASE governance;
> USE SCHEMA INFORMATION_SCHEMA;
> SELECT *
>   FROM TABLE(
>     INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_REFERENCES(
>       REF_ENTITY_NAME => 'hr.tables.empl_info',
>       REF_ENTITY_DOMAIN => 'table'
>     )
>   );
> ```

To return a row for each object (table or view) that has the DMF named `count_positive_numbers` set on that table or
view, execute the following:

> ```sqlexample
> USE DATABASE governance;
> USE SCHEMA INFORMATION_SCHEMA;
> SELECT *
>   FROM TABLE(
>     INFORMATION_SCHEMA.DATA_METRIC_FUNCTION_REFERENCES(
>       METRIC_NAME => 'governance.dmfs.count_positive_numbers'
>     )
>   );
> ```

---
title: DATA_METRIC_SCHEDULED_TIME (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_data_metric_schedule_time.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# DATA_METRIC_SCHEDULED_TIME (system data metric function)

Returns the timestamp for when a DMF is scheduled to run or the current timestamp if the function is called manually.

You can use this DMF to define custom metrics to measure the freshness of your data or to define incremental metrics in
conjunction with DMFs that already exist.

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.DATA_METRIC_SCHEDULED_TIME()
```

## Arguments

None.

## Returns

The function returns a scalar value with a TIMESTAMP_LTZ data type.

## Usage notes

Calling this function manually in a SELECT query returns the same value as the [CURRENT_TIMESTAMP](current_timestamp.md) function.

## Example

Create a custom data metric function to determine the data freshness on a table in the last hour:

> ```sqlexample
> CREATE OR REPLACE DATA METRIC FUNCTION data_freshness_hour(
>   ARG_T TABLE (ARG_C TIMESTAMP_LTZ))
>   RETURNS NUMBER AS
>   'SELECT TIMEDIFF(
>      minute,
>      MAX(ARG_C),
>      SNOWFLAKE.CORE.DATA_METRIC_SCHEDULED_TIME())
>    FROM ARG_T';
> ```

Call the data metric function manually:

> ```sqlexample
> SELECT data_freshness_hour(SELECT last_updated FROM hr.tables.empl_info) < 60;
> ```
>
> The statement returns `True` if there are no updates to the table in the last hour (60 minutes).
>
> The statement returns `False` if there were updates to the table that took place more than one hour ago.

---
title: DATA_QUALITY_MONITORING_EXPECTATION_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/data_quality_monitoring_expectation_status.md
section: SQL Functions
---

Categories:
:   [LOCAL schema](../local.md) , [Table functions](../functions-table.md)

# DATA_QUALITY_MONITORING_EXPECTATION_STATUS

For a specified object, returns a row for every time a data metric function (DMF) with an
[expectation](../../user-guide/data-quality-expectations.md) was run. You can obtain the status of the expectation in each row.

See also:
:   [DATA_QUALITY_MONITORING_EXPECTATION_STATUS view](../local/data_quality_monitoring_expectation_status.md) (LOCAL view)

## Syntax

```sqlsyntax
DATA_QUALITY_MONITORING_EXPECTATION_STATUS(
  REF_ENTITY_NAME => '<string>' ,
  REF_ENTITY_DOMAIN => '<string>'
  )
```

## Arguments

`REF_ENTITY_NAME => 'string'`
:   The name of the table object on which the data metric function with an expectation is set. The name must be fully qualified.

    * The entire object name must be enclosed in single quotes.
    * If the object name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes, such as `'"table_name"'`.

`REF_ENTITY_DOMAIN => 'string'`
:   The object type on which the data metric function with an expectation is set.

    If the object is a kind of table, use `'TABLE'` as the argument value.

    If the object is a view or materialized view, use `'VIEW'` as the argument value.

    For a list of supported object types on which a data metric function can be set, see [Supported table kinds](../../user-guide/data-quality-intro.md).

## Output

The function returns rows with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `scheduled_time` | TIMESTAMP_LTZ | The time the DMF is scheduled to run based on the schedule that you set for the table or view. |
| `change_commit_time` | TIMESTAMP_LTZ | The time the DMF trigger operation occurred, or `None` if the DMF is not scheduled to run by a trigger operation.  For information about the trigger operation, see [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md). |
| `measurement_time` | TIMESTAMP_LTZ | The time at which the metric was evaluated. |
| `table_id` | NUMBER | Internal/system-generated identifier of the table that is associated with the DMF. |
| `table_name` | VARCHAR | Name of the table that is associated with the DMF. |
| `table_schema` | VARCHAR | Name of the schema name that contains the table that is associated with the DMF. |
| `table_database` | VARCHAR | Name of the database that contains the table that is associated with the DMF. |
| `metric_id` | NUMBER | Internal/system-generated identifier of the DMF. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `metric_schema` | VARCHAR | Name of the schema that contains the DMF. |
| `metric_database` | VARCHAR | Name of the database that contains the DMF. |
| `metric_return_type` | VARCHAR | Return type of the DMF. |
| `arguments_ids` | ARRAY | Array of the identifiers of the DMF arguments. Array elements are in the same order as the arguments. |
| `arguments_types` | ARRAY | Array of the domain/type of each argument. Array elements are in the same order as the arguments.  Currently only supports COLUMN type arguments. |
| `arguments_names` | ARRAY | Array of the names of the DMF arguments. For column arguments, each element is the name of a column. Array elements are in the same order as the arguments. |
| `reference_id` | VARCHAR | The ID to uniquely identify the metric entity reference, known as the association ID. |
| `value` | VARIANT | The result of the DMF evaluation. |
| `expectation_name` | VARCHAR | Name that was given to the expectation when it was added to the association between the DMF and the object. |
| `expectation_id` | VARCHAR | System-generated identifier. |
| `expectation_expression` | VARCHAR | Boolean expression of the expectation. See [Defining what meets the expectation](../../user-guide/data-quality-expectations.md). |
| `expectation_violated` | BOOLEAN | If TRUE, the expectation was violated. An expectation is violated when the `expectation_expression` evaluates to FALSE.  A NULL value indicates the evaluation of the expectation failed. |

## Access control requirements

To access this function, the role in use must have the SNOWFLAKE.DATA_QUALITY_MONITORING_LOOKUP application role, at a minimum. For other
application role options, see [Viewing data quality results](../../user-guide/data-quality-access-control.md). Use the [GRANT APPLICATION ROLE](../sql/grant-application-role.md)
command to grant the application role to a role.

To view results, the role in use must also have the following privileges:

* The SELECT or OWNERSHIP privileges on the object (table or view) to which the data metric function is assigned.
* The USAGE or OWNERSHIP privileges on the data metric function.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Usage notes

Errors occur if the specified object name does not exist or if the query operator is not authorized to view any data metric function on
the object. Unsupported object types specified in the REF_ENTITY_DOMAIN argument, such as `'STREAM'`, also return errors.

## Examples

Return a row for each data metric function with an expectation that is assigned to the table named `my_table`:

```sqlexample
SELECT *
  FROM TABLE(SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_EXPECTATION_STATUS(
    REF_ENTITY_NAME => 'my_db.sch1.my_table',
    REF_ENTITY_DOMAIN => 'TABLE'));
```

---
title: DATA_QUALITY_MONITORING_RESULTS
source: https://docs.snowflake.com/en/sql-reference/functions/data_quality_monitoring_results.md
section: SQL Functions
---

Categories:
:   [LOCAL schema](../local.md) , [Table functions](../functions-table.md)

# DATA_QUALITY_MONITORING_RESULTS

Returns a row for each data metric function assigned to the specified object, which includes the evaluation result and other metadata of
the data metric function on the object.

See also:
:   [DATA_QUALITY_MONITORING_RESULTS view](../local/data_quality_monitoring_results.md) (LOCAL view)

## Syntax

```sqlsyntax
DATA_QUALITY_MONITORING_RESULTS(
  REF_ENTITY_NAME => '<string>' ,
  REF_ENTITY_DOMAIN => '<string>'
  )
```

## Arguments

`REF_ENTITY_NAME => 'string'`
:   The name of the table object on which the data metric function is set.

    * The entire object name must be enclosed in single quotes.
    * If the object name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes, such as `'"<table_name>"'`.

`REF_ENTITY_DOMAIN => 'string'`
:   The object type on which the data metric function is set.

    If the object is a kind of table, use `'TABLE'` as the argument value.

    If the object is a view or materialized view, use `'VIEW'` as the argument value.

    For a list of supported object types on which a data metric function can be set, see [Supported table kinds](../../user-guide/data-quality-intro.md).

## Returns

The function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `scheduled_time` | TIMESTAMP_LTZ | The time the DMF is scheduled to run based on the schedule that you set for the table or view. |
| `change_commit_time` | TIMESTAMP_LTZ | The time the DMF trigger operation occurred, or `None` if the DMF is not scheduled to run by a trigger operation.  For information about the trigger operation, see [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md). |
| `measurement_time` | TIMESTAMP_LTZ | The time at which the metric was evaluated. |
| `table_id` | NUMBER | Internal/system-generated identifier of the table that the DMF is associated with. |
| `table_name` | VARCHAR | Name of the table that the DMF is associated with. |
| `table_schema` | VARCHAR | Name of the schema that contains the table that the DMF is associated with. |
| `table_database` | VARCHAR | Name of the database that contains the table that the DMF is associated with. |
| `metric_id` | NUMBER | Internal/system-generated identifier of the DMF. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `metric_schema` | VARCHAR | Name of the schema that contains the DMF. |
| `metric_database` | VARCHAR | Name of the database that contains the DMF. |
| `metric_return_type` | VARCHAR | Return type of the DMF. |
| `arguments_ids` | ARRAY | Array of the identifiers of the DMF arguments. Array elements are in the same order as the arguments. |
| `arguments_types` | ARRAY | Array of the domain/type of each DMF argument. Array elements are in the same order as the arguments.  Currently only supports COLUMN type arguments. |
| `arguments_names` | ARRAY | Array of the names of the DMF arguments. For column arguments, each element is the name of a column. Array elements are in the same order as the arguments. |
| `reference_id` | VARCHAR | The ID to uniquely identify the metric entity reference, known as the association ID. |
| `value` | VARIANT | The result of the DMF evaluation. |

## Access control requirements

To determine which privileges and roles you need to call this function, see [Viewing data quality results](../../user-guide/data-quality-access-control.md).

## Usage notes

Errors occur if the specified object name does not exist or if the query operator is not authorized to view any data metric function on
the object. Unsupported object types listed as the REF_ENTITY_DOMAIN, such as `'stream'`, also return errors.

## Examples

Return a row for each data metric function assigned to the table named `my_table`:

> ```sqlexample
> USE DATABASE SNOWFLAKE;
> USE SCHEMA LOCAL;
> SELECT *
>   FROM TABLE(SNOWFLAKE.LOCAL.DATA_QUALITY_MONITORING_RESULTS(
>     REF_ENTITY_NAME => 'my_db.my_schema.my_table',
>     REF_ENTITY_DOMAIN => 'table'));
> ```

---
title: DATA_TRANSFER_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/data_transfer_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATA_TRANSFER_HISTORY

This table function can be used to query the history of data transferred from Snowflake tables into a different cloud storage provider’s network (i.e. from Snowflake on AWS, Google Cloud Platform, or Microsoft Azure into the other cloud provider’s network) and/or geographical region within a specified date range. The function returns the history for your entire Snowflake account.

> **Note:**
>
> This function returns data transfer activity within the last 14 days.

## Syntax

```sqlsyntax
DATA_TRANSFER_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [, DATE_RANGE_END => <constant_expr> ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range, within the last 2 weeks, for which to retrieve the data transfer history:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default is to show the previous 10 minutes of data transfer history).
      For example, if `DATE_RANGE_END` is [CURRENT_DATE](current_date.md), then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

    History is displayed in increments of 5 minutes, 1 hour, or 24 hours (depending on the length of the specified range).

    If the range falls outside the last 15 days, an error is returned.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the data transfer took place. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the data transfer took place. |
| SOURCE_CLOUD | TEXT | Name of the cloud provider where the data transfer originated: Amazon Web Services, Google Cloud Platform, or Microsoft Azure. |
| SOURCE_REGION | TEXT | Region where the data transfer originated. |
| TARGET_CLOUD | TEXT | Name of the cloud provider where the data was sent: AWS, Google Cloud Platform, or Microsoft Azure. |
| TARGET_REGION | TEXT | Region where the data was sent. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred during the START_TIME and END_TIME window. |
| TRANSFER_TYPE | VARCHAR | Type of operation that caused transfer. [COPY](../sql/copy-into-location.md), [EXTERNAL_ACCESS](../../developer-guide/external-network-access/external-network-access-overview.md), [EXTERNAL_FUNCTION](../external-functions.md), [REPLICATION](../../user-guide/account-replication-intro.md). |

## Examples

Retrieve the data transfer history for a 30 minute range, in 5 minute periods, for your account:

> ```sqlexample
> select *
>   from table(mydb.information_schema.data_transfer_history(
>     date_range_start=>to_timestamp_tz('2017-10-24 12:00:00.000 -0700'),
>     date_range_end=>to_timestamp_tz('2017-10-24 12:30:00.000 -0700')));
> ```

Retrieve the data transfer history for the last 12 hours, in 1 hour periods, for your account:

> ```sqlexample
> select *
>   from table(information_schema.data_transfer_history(
>     date_range_start=>dateadd('hour',-12,current_timestamp())));
> ```

Retrieve the data transfer history for the last 14 days, in 1 day periods, for your account:

> ```sqlexample
> select *
>   from table(information_schema.data_transfer_history(
>     date_range_start=>dateadd('day',-14,current_date()),
>     date_range_end=>current_date()));
> ```

---
title: DATABASE_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/database_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATABASE_REFRESH_HISTORY

Returns the refresh history for a secondary database.

> **Note:**
>
> This function returns database refresh activity within the last 14 days.

See also:
:   [DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB](database_refresh_progress.md)

## Syntax

```sqlsyntax
DATABASE_REFRESH_HISTORY( '<secondary_db_name>' )
```

## Arguments

`secondary_db_name`
:   Name of the secondary database. This argument is optional if the secondary database is the active database in the current session.

    Note that the entire name must be enclosed in single quotes.

## Usage notes

* Only returns results for account administrators (users with the ACCOUNTADMIN role).
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* Following is the list of phases in the order processed:

  1. SECONDARY_UPLOADING_INVENTORY
  2. PRIMARY_UPLOADING_METADATA
  3. PRIMARY_UPLOADING_DATA
  4. SECONDARY_DOWNLOADING_METADATA
  5. SECONDARY_DOWNLOADING_DATA
  6. COMPLETED / FAILED / CANCELED

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| CURRENT_PHASE | TEXT | Current replication phase. For the list of phases, see the usage notes. |
| START_TIME | NUMBER | Time when the replication operation began. Format is epoch time. |
| END_TIME | NUMBER | Time when the replication operation finished, if applicable. Format is epoch time. |
| JOB_UUID | TEXT | Query ID for the secondary database refresh job. |
| COPY_BYTES | NUMBER | Number of bytes copied during the replication operation. |
| OBJECT_COUNT | NUMBER | Number of database objects copied during the replication operation. |

## Examples

Retrieve the database refresh history for the database that is currently active in the user session:

> ```sqlexample
> select *
> from table(information_schema.database_refresh_history());
> ```

---
title: DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB
source: https://docs.snowflake.com/en/sql-reference/functions/database_refresh_progress.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB

The DATABASE_REFRESH_PROGRESS family of functions can be used to query the status of a database refresh along various dimensions:

* DATABASE_REFRESH_PROGRESS returns a JSON object indicating the current refresh status for a secondary database by name.
* DATABASE_REFRESH_PROGRESS_BY_JOB returns a JSON object indicating the current refresh status for a secondary database by refresh query.

Each function is optimized for querying along the specified dimension.

> **Note:**
>
> * DATABASE_REFRESH_PROGRESS only returns the database refresh activity for the most recent database refresh if it occurred within the last
>   14 days.
> * DATABASE_REFRESH_PROGRESS_BY_JOB returns database refresh activity within the last 14 days.

See also:
:   [DATABASE_REFRESH_HISTORY](database_refresh_history.md)

## Syntax

```sqlsyntax
DATABASE_REFRESH_PROGRESS( '<secondary_db_name>' )

DATABASE_REFRESH_PROGRESS_BY_JOB( '<query_id>' )
```

## Arguments

`secondary_db_name`
:   Name of the secondary database. This argument is optional if the secondary database is the active database in the current session.

    Note that the entire name must be enclosed in single quotes.

`query_id`
:   ID of the database refresh query. The query ID can be obtained from the History  page in the web interface.

## Usage notes

* Only returns results for account administrators (users with the ACCOUNTADMIN role).
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* Following is the list of phases in the order processed:

  1. SECONDARY_UPLOADING_INVENTORY
  2. PRIMARY_UPLOADING_METADATA
  3. PRIMARY_UPLOADING_DATA
  4. SECONDARY_DOWNLOADING_METADATA
  5. SECONDARY_DOWNLOADING_DATA
  6. COMPLETED / FAILED / CANCELED

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| PHASE_NAME | TEXT | Name of the replication phases completed (or in progress) so far. For the list of phases, see the usage notes. |
| RESULT | TEXT | Status of the replication phase. Valid statuses are `EXECUTING`, `SUCCEEDED`, `CANCELLED`, `FAILED`. |
| START_TIME | NUMBER | Time when the replication phase began. Format is epoch time. |
| END_TIME | NUMBER | Time when the phase finished, if applicable. Format is epoch time. |
| DETAILS | VARIANT | Returned by the DATABASE_REFRESH_PROGRESS function only. A JSON object that provides detailed information for the following phases: . - **Primary uploading data**: The timestamp of the current snapshot of the primary database. . - **Primary uploading data** and **Secondary downloading data**: Total number of bytes in the database refresh as well as the number of bytes copied so far in the phase. . - **Secondary downloading metadata**: The number of tables, table columns, and all database objects (including tables and table columns) in the latest snapshot of the primary database. |

## Examples

Retrieve the current progress of the database refresh for the `mydb1` database:

> ```sqlexample
> select *
> from table(information_schema.database_refresh_progress(mydb1));
> ```

Retrieve the current progress of a database refresh by query ID:

> ```sqlexample
> select *
> from table(information_schema.database_refresh_progress_by_job('012a3b45-1234-a12b-0000-1aa200012345'));
> ```

---
title: DATABASE_REPLICATION_USAGE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/database_replication_usage_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATABASE_REPLICATION_USAGE_HISTORY

This table function can be used to query the replication history for a specified database within a specified date range. The information
returned by the function includes the database name, credits consumed, and bytes transferred for replication.

> **Note:**
>
> This function returns database replication usage activity within the last 14 days.

## Syntax

```sqlsyntax
DATABASE_REPLICATION_USAGE_HISTORY(
  [ DATE_RANGE_START => <constant_expr> ]
  [ , DATE_RANGE_END => <constant_expr> ]
  [ , DATABASE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range to display the database replication history:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default
      is to show the previous 10 minutes of history).

    For example, if `DATE_RANGE_END` is CURRENT_DATE, then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

`DATABASE_NAME => 'string'`
:   Database name. If specified, only shows the history for the specified database.

    If a name is not specified, then the results include the data for each database replicated within the specified time range.

## Output

The function returns the following elements in a JSON object:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| DATABASE_NAME | TEXT | Name of the database. |
| CREDITS_USED | TEXT | Number of credits billed for database replication during the START_TIME and END_TIME window. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for database replication during the START_TIME and END_TIME window. |

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the replication history for a 30 minute range for your account:

> ```sqlexample
> select database_name, credits_used, bytes_transferred
>   from table(information_schema.database_replication_usage_history(
>     date_range_start=>'2023-03-28 12:00:00.000 +0000',
>     date_range_end=>'2023-03-28 12:30:00.000 +0000'));
> ```

Retrieve the history for the last 12 hours for your account:

> ```sqlexample
> select database_name, credits_used, bytes_transferred
>   from table(information_schema.database_replication_usage_history(
>     date_range_start=>dateadd(H, -12, current_timestamp)));
> ```

Retrieve the history for the past week for your account:

> ```sqlexample
> select start_time, end_time, database_name, credits_used, bytes_transferred
>   from table(information_schema.database_replication_usage_history(
>     date_range_start=>dateadd(d, -7, current_date),
>     date_range_end=>current_date));
> ```

Retrieve the replication history for the past week for database `mydb` in your account:

> ```sqlexample
> select start_time, end_time, database_name, credits_used, bytes_transferred
>   from table(information_schema.database_replication_usage_history(
>     date_range_start=>dateadd(d, -7, current_date),
>     date_range_end=>current_date,
>     database_name=>'mydb'));
> ```

---
title: DATABASE_STORAGE_USAGE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/database_storage_usage_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DATABASE_STORAGE_USAGE_HISTORY

This table function can be used to query the average daily storage usage, in bytes, for a single database (or all the databases in your account) within a specified date range. The results include:

* All data stored in tables and materialized views in the database(s).
* All historical data maintained in Fail-safe for the database(s).

> **Note:**
>
> This function returns storage usage within the last 6 months.

See also:
:   [STAGE_STORAGE_USAGE_HISTORY](stage_storage_usage_history.md) , [WAREHOUSE_METERING_HISTORY](warehouse_metering_history.md)

## Syntax

```sqlsyntax
DATABASE_STORAGE_USAGE_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [, DATE_RANGE_END => <constant_expr> ]
      [, DATABASE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date range, within the last 6 months, for which to retrieve database storage usage:

    * If an end date is not specified, [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, `DATE_RANGE_END` is used as the start of the range (that is, the default is one day of storage usage).

    If the range falls outside the last 6 months, an error is returned.

`DATABASE_NAME => 'string'`
:   The name of the database to retrieve storage usage history for. Note that the database name must be enclosed in single quotes. Also, if the database name contains any spaces, mixed-case characters,
    or special characters, the name must be double-quoted within the single quotes (for example, `'"My DB"'` vs `'mydb'`).

    If no database is specified, data is returned for all the databases in your account.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* To call an Information Schema table function, your session must have an INFORMATION_SCHEMA schema in use *or* the function name must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Date of this storage usage record. |
| DATABASE_NAME | TEXT | Name of the database. |
| AVERAGE_DATABASE_BYTES | NUMBER | Number of bytes of database storage used, including bytes currently in Time Travel. |
| AVERAGE_FAILSAFE_BYTES | NUMBER | Number of bytes of Fail-safe storage used. |

If a database has been dropped and its data retention period has passed (that is, the database cannot be recovered using Time Travel), then the database name is reported as `DROPPED_id`, where `id` is an internally-generated identifier. This ID can be used to match entries across rows returned by the table function.

## Examples

Retrieve average daily storage usage for the past 10 days, per database, for all databases in your account:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.DATABASE_STORAGE_USAGE_HISTORY(DATEADD('days',-10,CURRENT_DATE()),CURRENT_DATE()));
```

---
title: DATASKETCHES_HLL
source: https://docs.snowflake.com/en/sql-reference/functions/datasketches_hll.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) , [Window functions](../functions-window.md)

# DATASKETCHES_HLL

Returns an approximation of the distinct cardinality of the input (that is, `DATASKETCHES_HLL(col1)`
returns an approximation of `COUNT(DISTINCT col1)`).

This function is a version of the [HLL](hll.md) HyperLogLog function that can read binary sketches
in the format used by Apache DataSketches. For more information, see the
[Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

See also:
:   [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md) , [DATASKETCHES_HLL_COMBINE](datasketches_hll_combine.md) , [DATASKETCHES_HLL_ESTIMATE](datasketches_hll_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
DATASKETCHES_HLL( [ DISTINCT ] <expr1> [ , <max_log_k> ] )
```

**Window function**

```sqlsyntax
DATASKETCHES_HLL( [ DISTINCT ] <expr1> [ , <max_log_k> ] )
  OVER ( [ PARTITION BY <expr2> ] )
```

## Required arguments

`expr1`
:   The expression for which you want to know the number of distinct values.

## Optional arguments

`max_log_k`
:   The maximum value, in log2, of K to initialize the datasketches HLL object. Specify an INTEGER value between 4 and 21, inclusive.
    For more information, see the [Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

    Default: 12

`expr2`
:   The optional expression used to group rows into partitions.

## Returns

The function returns a value of type DOUBLE.

If the input is empty, the output is `0.0`.

## Usage notes

* DISTINCT is supported syntactically, but has no effect.
* The function supports arguments that are values of the following data types:

  + [String & binary data types](../data-types-text.md) (for example, VARCHAR and BINARY).

    For example, the following function calls are supported:

    ```sqlexample
    SELECT DATASKETCHES_HLL_ACCUMULATE(1::TEXT);
    ```

    ```sqlexample
    SELECT DATASKETCHES_HLL_ACCUMULATE(TO_BINARY(HEX_ENCODE(1), 'HEX'));
    ```
  + [Data types for floating-point numbers](../data-types-numeric.md) (for example, FLOAT and DOUBLE)

    The DataSketches library casts these values to DOUBLE values.
  + [Data types for fixed-point numbers](../data-types-numeric.md) (for example, INTEGER and NUMERIC).

    The function only supports numeric types with a scale of 0. However, you can cast numeric values with a scale
    other than 0 to a data types for a floating-point number.

    The DataSketches library casts these values in the range of a 64-bit signed INTEGER to a 64-bit signed INTEGER value.

    The DataSketches library doesn’t directly cast INTEGER values exceeding the 64-bit signed INTEGER range (such as 128-bit
    integer values). However, Snowflake still supports these values by automatically converting them to DOUBLE values, which
    DataSketches supports. This behavior is identical to the behavior of the `datasketches-python` library.

  Values of other data types aren’t supported. For example, VARIANT and ARRAY values aren’t supported.
* For information about NULL values and aggregate functions, see [Aggregate functions and NULL values](../functions-aggregation.md).
* When this function is called as a window function, it doesn’t support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

Create a table and insert values:

```sqlexample
CREATE OR REPLACE TABLE datasketches_demo(v INT, g INT);

INSERT INTO datasketches_demo SELECT 1, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 5, 2;
```

The following examples use the data in the table.

### Return the estimated cardinality of grouped data in a column

Use the DATASKETCHES_HLL function to approximate the distinct cardinality of the data in column `v`
grouped by the values in column `g`.

```sqlexample
SELECT g,
       DATASKETCHES_HLL(v),
       COUNT(DISTINCT v)
  FROM datasketches_demo GROUP BY g;
```

```output
+---+---------------------+-------------------+
| G | DATASKETCHES_HLL(V) | COUNT(DISTINCT V) |
|---+---------------------+-------------------|
| 1 |         2.000000005 |                 2 |
| 2 |         3.000000015 |                 3 |
+---+---------------------+-------------------+
```

The output shows that for value `1` in column `g`, there are about two distinct values in column `v`
(that is, `1` and `2`). For value `2` in column `g`, there are about three distinct values in column `v`
(that is, `1`, `4`, and `5`). The `COUNT(DISTINCT v))` call returns exact number of distinct
values instead of an estimate.

If you use the [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md) function to create binary sketches from the grouped data,
the [DATASKETCHES_HLL_ESTIMATE](datasketches_hll_estimate.md) function returns the same results for the accumulated sketches. For an
example, see [Return the cardinality estimate for accumulated binary sketches](datasketches_hll_estimate.md).

### Return the estimated cardinality of all data in a column

Use the DATASKETCHES_HLL function to approximate the distinct cardinality of all of the data in column `v`.

```sqlexample
SELECT DATASKETCHES_HLL(v),
       COUNT(DISTINCT v)
  FROM datasketches_demo;
```

```output
+---------------------+-------------------+
| DATASKETCHES_HLL(V) | COUNT(DISTINCT V) |
|---------------------+-------------------|
|          4.00000003 |                 4 |
+---------------------+-------------------+
```

The output shows that there are about four distinct values in column `v` (that is, `1`, `2`, `4`, and `5`).
The `COUNT(DISTINCT v))` call returns exact number of distinct values instead of an estimate.

If you use the [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md) function to create binary sketches from the grouped data, and
then use the [DATASKETCHES_HLL_COMBINE](datasketches_hll_combine.md) function to combine the sketches into one unified sketch,
the [DATASKETCHES_HLL_ESTIMATE](datasketches_hll_estimate.md) function returns the same results for the unified sketch. For an
example, see [Return the cardinality estimate for combined binary sketches](datasketches_hll_estimate.md).

---
title: DATASKETCHES_HLL_ACCUMULATE
source: https://docs.snowflake.com/en/sql-reference/functions/datasketches_hll_accumulate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# DATASKETCHES_HLL_ACCUMULATE

Returns the sketch at the end of aggregation.

This function is a version of the [HLL](hll.md) HyperLogLog function that can read binary sketches
in the format used by Apache DataSketches. For more information, see the
[Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

[DATASKETCHES_HLL](datasketches_hll.md) discards its intermediate state when the final cardinality estimate is returned.
In advanced use cases, such as incremental cardinality estimation during bulk loading, you might want to keep
the intermediate state. The intermediate state can later be combined (merged) with other intermediate states,
or can be exported to external tools.

In contrast to [DATASKETCHES_HLL](datasketches_hll.md), DATASKETCHES_HLL_ACCUMULATE doesn’t return a cardinality estimate.
Instead, it skips the final estimation step and returns the algorithm state itself. For more information,
see [Estimating the Number of Distinct Values](../../user-guide/querying-approximate-cardinality.md).

See also:
:   [DATASKETCHES_HLL_COMBINE](datasketches_hll_combine.md) , [DATASKETCHES_HLL_ESTIMATE](datasketches_hll_estimate.md)

## Syntax

```sqlsyntax
DATASKETCHES_HLL_ACCUMULATE( [ DISTINCT ] <expr> [ , <max_log_k> ] )
```

## Required arguments

`expr`
:   The expression for which you want to estimate cardinality (number of
    distinct values). This is typically a column name, but can be a more
    general expression.

## Optional arguments

`max_log_k`
:   The maximum value, in log2, of K for this union. Specify an INTEGER value between 4 and 21, inclusive.
    For more information, see the [Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

    Default: 12

## Returns

The function returns a BINARY value that is compatible with the Apache Datasketches library.

## Usage notes

* DISTINCT is supported syntactically, but has no effect.
* The function supports arguments that are values of the following data types:

  + [String & binary data types](../data-types-text.md) (for example, VARCHAR and BINARY).

    For example, the following function calls are supported:

    ```sqlexample
    SELECT DATASKETCHES_HLL_ACCUMULATE(1::TEXT);
    ```

    ```sqlexample
    SELECT DATASKETCHES_HLL_ACCUMULATE(TO_BINARY(HEX_ENCODE(1), 'HEX'));
    ```
  + [Data types for floating-point numbers](../data-types-numeric.md) (for example, FLOAT and DOUBLE)

    The DataSketches library casts these values to DOUBLE values.
  + [Data types for fixed-point numbers](../data-types-numeric.md) (for example, INTEGER and NUMERIC).

    The function only supports numeric types with a scale of 0. However, you can cast numeric values with a scale
    other than 0 to a data types for a floating-point number.

    The DataSketches library casts these values in the range of a 64-bit signed INTEGER to a 64-bit signed INTEGER value.

    The DataSketches library doesn’t directly cast INTEGER values exceeding the 64-bit signed INTEGER range (such as 128-bit
    integer values). However, Snowflake still supports these values by automatically converting them to DOUBLE values, which
    DataSketches supports. This behavior is identical to the behavior of the `datasketches-python` library.

  Values of other data types aren’t supported. For example, VARIANT and ARRAY values aren’t supported.

## Examples

Create a table and insert values:

```sqlexample
CREATE OR REPLACE TABLE datasketches_demo(v INT, g INT);

INSERT INTO datasketches_demo SELECT 1, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 5, 2;
```

Use the DATASKETCHES_HLL_ACCUMULATE function to create two binary sketches for the data in column `v`,
grouped by the values `1` and `2` in column `g`:

```sqlexample
SELECT g,
       DATASKETCHES_HLL_ACCUMULATE(v) AS accumulated_sketches
  FROM datasketches_demo
  GROUP BY g;
```

```output
+---+------------------------------------------+
| G | ACCUMULATED_SKETCHES                     |
|---+------------------------------------------|
| 1 | 0201070C030802002BF2FB06862FF90D         |
| 2 | 0201070C030803002BF2FB0681BC5D067B65E608 |
+---+------------------------------------------+
```

---
title: DATASKETCHES_HLL_COMBINE
source: https://docs.snowflake.com/en/sql-reference/functions/datasketches_hll_combine.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# DATASKETCHES_HLL_COMBINE

Combines (merges) input sketches into a single output sketch.

This function is a version of the [HLL](hll.md) HyperLogLog function that can read binary sketches
in the format used by Apache DataSketches. For more information, see the
[Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

This function allows scenarios where the [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md) function is run over
horizontal partitions of the same table, producing an algorithm sketch for each table
partition. These sketches can later be combined using this function, producing the same output
sketch as a single run of [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md) over the entire table.

See also:
:   [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md)

## Syntax

```sqlsyntax
DATASKETCHES_HLL_COMBINE( [ DISTINCT ]  <state> [ , <max_log_k> ] )
```

## Required arguments

`state`
:   An expression that contains state information generated
    by a call to [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md).

## Optional arguments

`max_log_k`
:   The maximum value, in log2, of K for this union. Specify an INTEGER value between 4 and 21, inclusive.
    For more information, see the [Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

    Default: 12

## Returns

The function returns a BINARY value that is compatible with the Apache Datasketches library.

## Usage notes

DISTINCT is supported syntactically, but has no effect.

## Examples

Create a table and insert values:

```sqlexample
CREATE OR REPLACE TABLE datasketches_demo(v INT, g INT);

INSERT INTO datasketches_demo SELECT 1, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 5, 2;
```

The following example performs the following actions:

1. The DATASKETCHES_HLL_ACCUMULATE function creates two binary sketches for the data in column `v`,
   grouped by the values `1` and `2` in column `g`.
2. The DATASKETCHES_HLL_COMBINE function combines these binary sketches.

```sqlexample
WITH
  accumulated AS (
    SELECT g,
           DATASKETCHES_HLL_ACCUMULATE(v) AS accumulated_sketches
      FROM datasketches_demo
      GROUP BY g)
SELECT DATASKETCHES_HLL_COMBINE(accumulated_sketches) AS combined_sketches
  FROM accumulated;
```

```output
+--------------------------------------------------+
| COMBINED_SKETCHES                                |
|--------------------------------------------------|
| 0201070C030804002BF2FB06862FF90D81BC5D067B65E608 |
+--------------------------------------------------+
```

You can see values of the accumulated sketches in the example in [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md).

---
title: DATASKETCHES_HLL_ESTIMATE
source: https://docs.snowflake.com/en/sql-reference/functions/datasketches_hll_estimate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) , [Window function syntax and usage](../functions-window-syntax.md)

# DATASKETCHES_HLL_ESTIMATE

Returns the cardinality estimate for the given sketch.

This function is a version of the [HLL](hll.md) HyperLogLog function that can read binary sketches
in the format used by Apache DataSketches. For more information, see the
[Apache DataSketches documentation](https://datasketches.apache.org/docs/HLL/HllSketches.html).

A sketch produced by the [DATASKETCHES_HLL_COMBINE](datasketches_hll_combine.md) function can be used
to compute a cardinality estimate using the DATASKETCHES_HLL_ESTIMATE function.

## Syntax

```sqlsyntax
DATASKETCHES_HLL_ESTIMATE( <binary_sketch> )
```

## Arguments

`binary_sketch`
:   An expression that contains sketch information in binary format.

## Returns

The function returns a value of type DOUBLE.

If the input is empty, the output is `0.0`.

> **Note:**
>
> This function returns a value of a different type than the [HLL_ESTIMATE](hll_estimate.md) function,
> which returns an INTEGER value.

## Examples

Create a table and insert values:

```sqlexample
CREATE OR REPLACE TABLE datasketches_demo(v INT, g INT);

INSERT INTO datasketches_demo SELECT 1, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 2, 1;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 1, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 4, 2;
INSERT INTO datasketches_demo SELECT 5, 2;
```

The following examples use the data in the table.

### Return the cardinality estimate for accumulated binary sketches

The following example performs the following actions:

1. The DATASKETCHES_HLL_ACCUMULATE function creates two binary sketches for the data in column `v`,
   grouped by the values `1` and `2` in column `g`
2. The DATASKETCHES_HLL_ESTIMATE function returns the cardinality estimate for each accumulated sketch.

```sqlexample
WITH
  accumulated AS (
    SELECT g,
           DATASKETCHES_HLL_ACCUMULATE(v) AS accumulated_sketches
      FROM datasketches_demo
      GROUP BY g)
SELECT g, DATASKETCHES_HLL_ESTIMATE(accumulated_sketches) AS accumulated_estimate
  FROM accumulated;
```

```output
+---+----------------------+
| G | ACCUMULATED_ESTIMATE |
|---+----------------------|
| 1 |          2.000000005 |
| 2 |          3.000000015 |
+---+----------------------+
```

You can see values of the accumulated sketches in the example in [DATASKETCHES_HLL_ACCUMULATE](datasketches_hll_accumulate.md).

### Return the cardinality estimate for combined binary sketches

The following example performs the following actions:

1. The DATASKETCHES_HLL_ACCUMULATE function creates two binary sketches for the data in column `v`,
   grouped by the values `1` and `2` in column `g`
2. The DATASKETCHES_HLL_COMBINE function combines these binary sketches to unify them.
3. The DATASKETCHES_HLL_ESTIMATE function returns the cardinality estimate for the unified sketch.

```sqlexample
WITH
  accumulated AS (
    SELECT g,
           DATASKETCHES_HLL_ACCUMULATE(v) AS accumulated_sketches
      FROM datasketches_demo
      GROUP BY g),
  combined AS (
    SELECT DATASKETCHES_HLL_COMBINE(accumulated_sketches) AS unified
      FROM accumulated)
SELECT DATASKETCHES_HLL_ESTIMATE(unified) AS unified_estimate
  FROM combined;
```

```output
+------------------+
| UNIFIED_ESTIMATE |
|------------------|
|       4.00000003 |
+------------------+
```

You can see value of the combined sketches in the example in [DATASKETCHES_HLL_COMBINE](datasketches_hll_combine.md).

---
title: DATE_FROM_PARTS
source: https://docs.snowflake.com/en/sql-reference/functions/date_from_parts.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# DATE_FROM_PARTS

Creates a date from individual numeric components that represent the year,
month, and day of the month.

Aliases:
:   DATEFROMPARTS

## Syntax

```sqlsyntax
DATE_FROM_PARTS( <year>, <month>, <day> )
```

## Arguments

`year`
:   The integer expression to use as a year for building a date.

`month`
:   The integer expression to use as a month for building a date, with
    January represented as 1, and December as 12.

`day`
:   The integer expression to use as a day for building a date, usually in
    the 1-31 range.

## Usage notes

DATE_FROM_PARTS is typically used to handle values in “normal” ranges
(e.g. months 1-12, days 1-31), but it also handles values from outside these
ranges. This allows, for example, choosing the N-th day in a year, which can
be used to simplify some computations.

Year, month, and day values can be negative (e.g. to calculate a date N months
prior to a specific date). The behavior of negative numbers is not entirely
intuitive; see the Examples section for details.

## Examples

Components in normal ranges:

> ```sqlexample
> SELECT DATE_FROM_PARTS(1977, 8, 7);
> +-----------------------------+
> | DATE_FROM_PARTS(1977, 8, 7) |
> |-----------------------------|
> | 1977-08-07                  |
> +-----------------------------+
> ```

Components outside normal ranges:

> * 100th day (from January 1, 2010)
> * 24 months (from January 1, 2010)
>
> ```sqlexample
> SELECT DATE_FROM_PARTS(2010, 1, 100), DATE_FROM_PARTS(2010, 1 + 24, 1);
> +-------------------------------+----------------------------------+
> | DATE_FROM_PARTS(2010, 1, 100) | DATE_FROM_PARTS(2010, 1 + 24, 1) |
> |-------------------------------+----------------------------------|
> | 2010-04-10                    | 2012-01-01                       |
> +-------------------------------+----------------------------------+
> ```

Components with zero or negative numbers:

> ```sqlexample
> SELECT DATE_FROM_PARTS(2004, 1, 1),   -- January 1, 2004, as expected.
>        DATE_FROM_PARTS(2004, 0, 1),   -- This is one month prior to DATE_FROM_PARTS(2004, 1, 1), so it's December 1, 2003.
>                                       -- This is NOT a synonym for January 1, 2004.
>        DATE_FROM_PARTS(2004, -1, 1)   -- This is two months (not one month) before DATE_FROM_PARTS(2004, 1, 1), so it's November 1, 2003.
>        ;
> +-----------------------------+-----------------------------+------------------------------+
> | DATE_FROM_PARTS(2004, 1, 1) | DATE_FROM_PARTS(2004, 0, 1) | DATE_FROM_PARTS(2004, -1, 1) |
> |-----------------------------+-----------------------------+------------------------------|
> | 2004-01-01                  | 2003-12-01                  | 2003-11-01                   |
> +-----------------------------+-----------------------------+------------------------------+
> ```
>
> ```sqlexample
> SELECT DATE_FROM_PARTS(2004, 2, 1),   -- February 1, 2004, as expected.
>        DATE_FROM_PARTS(2004, 2, 0),   -- This is one day prior to DATE_FROM_PARTS(2004, 2, 1), so it's January 31, 2004.
>        DATE_FROM_PARTS(2004, 2, -1);  -- Two days prior to DATE_FROM_PARTS(2004, 2, 1) so it's January 30, 2004.
> +-----------------------------+-----------------------------+------------------------------+
> | DATE_FROM_PARTS(2004, 2, 1) | DATE_FROM_PARTS(2004, 2, 0) | DATE_FROM_PARTS(2004, 2, -1) |
> |-----------------------------+-----------------------------+------------------------------|
> | 2004-02-01                  | 2004-01-31                  | 2004-01-30                   |
> +-----------------------------+-----------------------------+------------------------------+
> ```
>
> ```sqlexample
> SELECT DATE_FROM_PARTS(2004, -1, -1);  -- Two months and two days prior to DATE_FROM_PARTS(2004, 1, 1), so it's October 30, 2003.
> +-------------------------------+
> | DATE_FROM_PARTS(2004, -1, -1) |
> |-------------------------------|
> | 2003-10-30                    |
> +-------------------------------+
> ```

---
title: DATE_PART
source: https://docs.snowflake.com/en/sql-reference/functions/date_part.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# DATE_PART

Extracts the specified date or time part from a date, time, or timestamp.

Alternatives:
:   [EXTRACT](extract.md) , [HOUR / MINUTE / SECOND](hour-minute-second.md) , [YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER](year.md)

## Syntax

```sqlsyntax
DATE_PART( <date_or_time_part> , <date_interval_time_or_timestamp_expr> )
```

```sqlsyntax
DATE_PART( <date_or_time_part> FROM <date_interval_time_or_timestamp_expr> )
```

## Arguments

`date_or_time_part`
:   The unit of time. Must be one of the values listed in [Supported date and time parts](../functions-date-time.md) (for example, `month`).
    The value can be a string literal or can be unquoted (for example, `'month'` or `month`).

    * When `date_or_time_part` is `week` (or any of its variations), the output is controlled by the [WEEK_START](../parameters.md) session parameter.
    * When `date_or_time_part` is `dayofweek` or `yearofweek` (or any of their variations), the output is controlled by the [WEEK_OF_YEAR_POLICY](../parameters.md) and [WEEK_START](../parameters.md) session parameters.

    For more information, including examples, see [Calendar weeks and weekdays](../functions-date-time.md).

`date_interval_time_or_timestamp_expr`
:   A date, an interval, a time, or a timestamp, or an expression that can be evaluated to one of those data types.

## Returns

Returns a value of NUMBER data type.

## Usage notes

* When `date_interval_time_or_timestamp_expr` is a year-month interval value, the supported
  `date_or_time_part` values are `year` and `month`.
* When `date_interval_time_or_timestamp_expr` is a day-time interval value, the supported
  `date_or_time_part` values are `day`, `hour`, `minute`, `second`, and `nanosecond`.
* Currently, when `date_interval_time_or_timestamp_expr` is a DATE value, the following `date_or_time_part`
  values aren’t supported:

  + `epoch_millisecond`
  + `epoch_microsecond`
  + `epoch_nanosecond`

  Other [date and time parts](../functions-date-time.md) (including `epoch_second`) are supported.

> **Tip:**
>
> To extract a full DATE or TIME value instead of a single part from a TIMESTAMP value, you can cast the
> TIMESTAMP value to a DATE or TIME value, respectively. For example:
>
> ```sqlexample
> SELECT '2025-04-08T23:39:20.123-07:00'::TIMESTAMP::DATE AS full_date_value;
> ```
>
> ```output
> +-----------------+
> | FULL_DATE_VALUE |
> |-----------------|
> | 2025-04-08      |
> +-----------------+
> ```
>
> ```sqlexample
> SELECT '2025-04-08T23:39:20.123-07:00'::TIMESTAMP::TIME AS full_time_value;
> ```
>
> ```output
> +-----------------+
> | FULL_TIME_VALUE |
> |-----------------|
> | 23:39:20        |
> +-----------------+
> ```

## Examples

This shows a simple example of extracting part of a DATE:

```sqlexample
SELECT DATE_PART(quarter, '2024-04-08'::DATE);
```

```output
+----------------------------------------+
| DATE_PART(QUARTER, '2024-04-08'::DATE) |
|----------------------------------------|
|                                      2 |
+----------------------------------------+
```

This shows an example of extracting part of a TIMESTAMP:

```sqlexample
SELECT TO_TIMESTAMP(
  '2024-04-08T23:39:20.123-07:00') AS "TIME_STAMP1",
  DATE_PART(year, "TIME_STAMP1") AS "EXTRACTED YEAR";
```

```output
+-------------------------+----------------+
| TIME_STAMP1             | EXTRACTED YEAR |
|-------------------------+----------------|
| 2024-04-08 23:39:20.123 |           2024 |
+-------------------------+----------------+
```

This shows an example of converting a TIMESTAMP to the number of seconds since
the beginning of the [Unix epoch](https://en.wikipedia.org/wiki/Unix_time) (midnight January 1, 1970):

```sqlexample
SELECT TO_TIMESTAMP(
  '2024-04-08T23:39:20.123-07:00') AS "TIME_STAMP1",
  DATE_PART(epoch_second, "TIME_STAMP1") AS "EXTRACTED EPOCH SECOND";
```

```output
+-------------------------+------------------------+
| TIME_STAMP1             | EXTRACTED EPOCH SECOND |
|-------------------------+------------------------|
| 2024-04-08 23:39:20.123 |             1712619560 |
+-------------------------+------------------------+
```

This shows an example of converting a TIMESTAMP to the number of milliseconds since
the beginning of the [Unix epoch](https://en.wikipedia.org/wiki/Unix_time) (midnight January 1, 1970):

```sqlexample
SELECT TO_TIMESTAMP(
  '2024-04-08T23:39:20.123-07:00') AS "TIME_STAMP1",
  DATE_PART(epoch_millisecond, "TIME_STAMP1") AS "EXTRACTED EPOCH MILLISECOND";
```

```output
+-------------------------+-----------------------------+
| TIME_STAMP1             | EXTRACTED EPOCH MILLISECOND |
|-------------------------+-----------------------------|
| 2024-04-08 23:39:20.123 |               1712619560123 |
+-------------------------+-----------------------------+
```

---
title: DATE_TRUNC
source: https://docs.snowflake.com/en/sql-reference/functions/date_trunc.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# DATE_TRUNC

Truncates a DATE, TIME, or TIMESTAMP value to the specified precision. For example,
truncating a timestamp down to the quarter returns the timestamp corresponding
to midnight of the first day of the original timestamp’s quarter.

This function provides an alternative syntax for [TRUNCATE, TRUNC](trunc2.md) by reversing the
two arguments.

Truncation is not the same as extraction. For example:

* Truncating a timestamp down to the quarter using this function returns the timestamp corresponding
  to midnight of the first day of the quarter for the input timestamp.
* Extracting the quarter date part from a timestamp using the [EXTRACT](extract.md) function returns the
  quarter number of the year in the timestamp.

Alternatives:
:   [TRUNCATE, TRUNC](trunc2.md)

See also:
:   [DATE_PART](date_part.md) , [EXTRACT](extract.md)

## Syntax

```sqlsyntax
DATE_TRUNC( <date_or_time_part>, <date_or_time_expr> )
```

## Arguments

`date_or_time_part`
:   This argument must be one of the values listed in [Supported date and time parts](../functions-date-time.md).

`date_or_time_expr`
:   This argument must evaluate to a date, time, or timestamp.

## Returns

The returned value is the same type as the input value.

For example, if the input value is a TIMESTAMP, then the returned value is a TIMESTAMP.

## Usage notes

* When `date_or_time_part` is `week` (or any of its variations), the output is controlled
  by the [WEEK_START](../parameters.md) session parameter. For more details, including examples, see
  [Calendar weeks and weekdays](../functions-date-time.md).
* For TIME values, you can’t specify a `date_or_time_part` that is outside the scope of the TIME type.
  For example, you can truncate a TIMESTAMP value to a `day`, `week`, `year`, and so on because the TIMESTAMP type
  encodes date/times with the required precision. However, trying to truncate a TIME value to a `day`, `week`, `year`,
  and so on causes an error.

## Examples

The DATE_TRUNC function examples use the data in the following table:

```sqlexample
CREATE OR REPLACE TABLE test_date_trunc (
 mydate DATE,
 mytime TIME,
 mytimestamp TIMESTAMP);

INSERT INTO test_date_trunc VALUES (
  '2024-05-09',
  '08:50:48',
  '2024-05-09 08:50:57.891 -0700');

SELECT * FROM test_date_trunc;
```

```output
+------------+----------+-------------------------+
| MYDATE     | MYTIME   | MYTIMESTAMP             |
|------------+----------+-------------------------|
| 2024-05-09 | 08:50:48 | 2024-05-09 08:50:57.891 |
+------------+----------+-------------------------+
```

The following examples show date truncation. In all cases, the returned value
is of the same data type as the input value, but with zeros for the portions,
such as fractional seconds, that were truncated.

Truncate a date down to the year, month, and day:

```sqlexample
SELECT mydate AS "DATE",
       DATE_TRUNC('year', mydate) AS "TRUNCATED TO YEAR",
       DATE_TRUNC('month', mydate) AS "TRUNCATED TO MONTH",
       DATE_TRUNC('week', mydate) AS "TRUNCATED TO WEEK",
       DATE_TRUNC('day', mydate) AS "TRUNCATED TO DAY"
  FROM test_date_trunc;
```

```output
+------------+-------------------+--------------------+-------------------+------------------+
| DATE       | TRUNCATED TO YEAR | TRUNCATED TO MONTH | TRUNCATED TO WEEK | TRUNCATED TO DAY |
|------------+-------------------+--------------------+-------------------+------------------|
| 2024-05-09 | 2024-01-01        | 2024-05-01         | 2024-05-06        | 2024-05-09       |
+------------+-------------------+--------------------+-------------------+------------------+
```

Truncate a time down to the minute:

```sqlexample
SELECT mytime AS "TIME",
       DATE_TRUNC('minute', mytime) AS "TRUNCATED TO MINUTE"
  FROM test_date_trunc;
```

```output
+----------+---------------------+
| TIME     | TRUNCATED TO MINUTE |
|----------+---------------------|
| 08:50:48 | 08:50:00            |
+----------+---------------------+
```

Truncate a TIMESTAMP down to the hour, minute, and second:

```sqlexample
SELECT mytimestamp AS "TIMESTAMP",
       DATE_TRUNC('hour', mytimestamp) AS "TRUNCATED TO HOUR",
       DATE_TRUNC('minute', mytimestamp) AS "TRUNCATED TO MINUTE",
       DATE_TRUNC('second', mytimestamp) AS "TRUNCATED TO SECOND"
  FROM test_date_trunc;
```

```output
+-------------------------+-------------------------+-------------------------+-------------------------+
| TIMESTAMP               | TRUNCATED TO HOUR       | TRUNCATED TO MINUTE     | TRUNCATED TO SECOND     |
|-------------------------+-------------------------+-------------------------+-------------------------|
| 2024-05-09 08:50:57.891 | 2024-05-09 08:00:00.000 | 2024-05-09 08:50:00.000 | 2024-05-09 08:50:57.000 |
+-------------------------+-------------------------+-------------------------+-------------------------+
```

Contrast the DATE_TRUNC function with the [EXTRACT](extract.md) function:

```sqlexample
SELECT DATE_TRUNC('quarter', mytimestamp) AS "TRUNCATED",
       EXTRACT('quarter', mytimestamp) AS "EXTRACTED"
  FROM test_date_trunc;
```

```output
+-------------------------+-----------+
| TRUNCATED               | EXTRACTED |
|-------------------------+-----------|
| 2024-04-01 00:00:00.000 |         2 |
+-------------------------+-----------+
```

---
title: DATEADD
source: https://docs.snowflake.com/en/sql-reference/functions/dateadd.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# DATEADD

Adds the specified value for the specified date or time part to a date, time, or timestamp.

Aliases:
:   [TIMEADD](timeadd.md) , [TIMESTAMPADD](timestampadd.md)

See also:
:   [ADD_MONTHS](add_months.md)

## Syntax

```sqlsyntax
DATEADD( <date_or_time_part>, <value>, <date_or_time_expr> )
```

## Arguments

`date_or_time_part`
:   This indicates the units of time that you want to add. For example if you
    want to add two days, then specify `day`. This unit of measure must
    be one of the values listed in [Supported date and time parts](../functions-date-time.md).

`value`
:   This is the number of units of time that you want to add. For example,
    if the units of time is `day`, and you want to add two days, specify `2`.
    If you want to subtract two days, specify `-2`.

`date_or_time_expr`
:   `date_or_time_expr` must evaluate to a date, time, or timestamp.
    This is the date, time, or timestamp to which you want to add.
    For example, if you want to add two days to August 1, 2024, then specify
    `'2024-08-01'::DATE`.

    If the data type is TIME, then the `date_or_time_part`
    must be in units of hours or smaller, not days or bigger.

    If the input data type is DATE, and the `date_or_time_part` is hours
    or smaller, the input value will not be rejected, but instead will be
    treated as a TIMESTAMP with hours, minutes, seconds, and fractions of
    a second all initially set to 0 (e.g. midnight on the specified date).

## Returns

If `date_or_time_expr` is a time, then the return data type is a time.

If `date_or_time_expr` is a timestamp, then the return data type is a timestamp.

If `date_or_time_expr` is a date:

> * If `date_or_time_part` is `day` or larger (for example, `month`, `year`),
>   the function returns a DATE value.
> * If `date_or_time_part` is smaller than a day (for example, `hour`, `minute`,
>   `second`), the function returns a TIMESTAMP_NTZ value, with `00:00:00.000` as the starting
>   time for the date.

## Usage notes

When `date_or_time_part` is `year`, `quarter`, or `month` (or any of their variations),
if the result month has fewer days than the original day of the month, the result day of the month might
be different from the original day.

## Examples

The TIMEADD and TIMESTAMPADD functions are aliases for the DATEADD function. You can use any of these three
functions in the examples to return the same results.

Add years to a date:

```sqlexample
SELECT TO_DATE('2022-05-08') AS original_date,
       DATEADD(year, 2, TO_DATE('2022-05-08')) AS date_plus_two_years;
```

```output
+---------------+---------------------+
| ORIGINAL_DATE | DATE_PLUS_TWO_YEARS |
|---------------+---------------------|
| 2022-05-08    | 2024-05-08          |
+---------------+---------------------+
```

Subtract years from a date:

```sqlexample
SELECT TO_DATE('2022-05-08') AS original_date,
       DATEADD(year, -2, TO_DATE('2022-05-08')) AS date_minus_two_years;
```

```output
+---------------+----------------------+
| ORIGINAL_DATE | DATE_MINUS_TWO_YEARS |
|---------------+----------------------|
| 2022-05-08    | 2020-05-08           |
+---------------+----------------------+
```

Add two years and two hours to a date. First, set the timestamp output format, create a table,
and insert data:

```sqlexample
ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF9';
CREATE TABLE datetest (d date);
INSERT INTO datetest VALUES ('2022-04-05');
```

Run a query that adds two years and two hours to a date:

```sqlexample
SELECT d AS original_date,
       DATEADD(year, 2, d) AS date_plus_two_years,
       TO_TIMESTAMP(d) AS original_timestamp,
       DATEADD(hour, 2, d) AS timestamp_plus_two_hours
  FROM datetest;
```

```output
+---------------+---------------------+-------------------------+--------------------------+
| ORIGINAL_DATE | DATE_PLUS_TWO_YEARS | ORIGINAL_TIMESTAMP      | TIMESTAMP_PLUS_TWO_HOURS |
|---------------+---------------------+-------------------------+--------------------------|
| 2022-04-05    | 2024-04-05          | 2022-04-05 00:00:00.000 | 2022-04-05 02:00:00.000  |
+---------------+---------------------+-------------------------+--------------------------+
```

Add a month to a date in a month with the same or more days than the
resulting month. For example, if the date is January 31, adding a month should not
return February 31.

```sqlexample
SELECT DATEADD(month, 1, '2023-01-31'::DATE) AS date_plus_one_month;
```

```output
+---------------------+
| DATE_PLUS_ONE_MONTH |
|---------------------|
| 2023-02-28          |
+---------------------+
```

Add a month to a date in a month with fewer days than the resulting month.
Adding a month to February 28 returns March 28.

```sqlexample
SELECT DATEADD(month, 1, '2023-02-28'::DATE) AS date_plus_one_month;
```

```output
+---------------------+
| DATE_PLUS_ONE_MONTH |
|---------------------|
| 2023-03-28          |
+---------------------+
```

Add hours to a time:

```sqlexample
SELECT TO_TIME('05:00:00') AS original_time,
       DATEADD(hour, 3, TO_TIME('05:00:00')) AS time_plus_three_hours;
```

```output
+---------------+-----------------------+
| ORIGINAL_TIME | TIME_PLUS_THREE_HOURS |
|---------------+-----------------------|
| 05:00:00      | 08:00:00              |
+---------------+-----------------------+
```

---
title: DATEDIFF
source: https://docs.snowflake.com/en/sql-reference/functions/datediff.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# DATEDIFF

Calculates the difference between two date, time, or timestamp expressions based on the date or time part requested.
The function returns the result of subtracting the second argument from the third argument.

> **Note:**
>
> Difference calculations compare the specified date or time part, not the complete date or time. For example, the month
> difference between November 28, 2024 and December 5, 2024 is 1, because the difference between the two months November
> and December, both in 2024, is 1. To reflect the fact that the difference between the two dates is less than a full
> month, calculate the difference in days instead.

You can also use the minus sign (`-`) to calculate the difference between two dates by subtracting one date from
another.

To add units of time to a date, time, or timestamp (for example, add two days to a date) or subtract units of time
from them, you can use the [DATEADD](dateadd.md), [TIMEADD](timeadd.md), or [TIMESTAMPADD](timestampadd.md) function.

See also:
:   [TIMEDIFF](timediff.md) , [TIMESTAMPDIFF](timestampdiff.md)

## Syntax

**For DATEDIFF:**

```sqlsyntax
DATEDIFF( <date_or_time_part>, <date_or_time_expr1>, <date_or_time_expr2> )
```

**For minus sign:**

```sqlsyntax
<date_expr2> - <date_expr1>
```

## Arguments

**For DATEDIFF:**

`date_or_time_part`
:   The unit of time. Must be one of the values listed in [Supported date and time parts](../functions-date-time.md) (for example, `month`).
    The value can be a string literal or can be unquoted (for example, `'month'` or `month`).

`date_or_time_expr1`, `date_or_time_expr2`
:   The values to compare. Must be a date, a time, a timestamp, or an expression that can be evaluated to
    a date, a time, or a timestamp. The value `date_or_time_expr1` is subtracted from
    `date_or_time_expr2`.

**For minus sign:**

`date_expr1`, `date_expr2`
:   The values to compare. Must be a date, or an expression that can be evaluated to a date. The value `date_expr1` is
    subtracted from `date_expr2`.

## Returns

**For DATEDIFF:**

Returns an integer representing the difference in the number of units (seconds, days, and so on) between `date_or_time_expr2` and
`date_or_time_expr1`.

Returns NULL if any argument is NULL.

**For minus sign:**

Returns an integer representing the number of days difference between `date_expr2` and
`date_expr1`. (The units are always days.)

Returns an error if `date_expr2` or `date_expr1` is NULL.

## Usage notes

**For both DATEDIFF and minus sign:**

* Output values can be negative, for example, -12 days.

**For DATEDIFF:**

* The function supports units of years, quarters, months, weeks, days, hours, minutes, seconds, milliseconds, microseconds, and nanoseconds.
* If `date_or_time_part` is `week` (or any of its variations), the output is controlled by the [WEEK_START](../parameters.md) session parameter. For more details, including examples, see
  [Calendar weeks and weekdays](../functions-date-time.md).
* The unit (for example, `month`) used to calculate the difference determines which parts of the DATE, TIME, or TIMESTAMP field are
  evaluated. So, the unit determines the precision of the result.

  Smaller units are not used, so values are not rounded. For example, even though the difference between January 1, 2021 and
  February 28, 2021 is closer to two months than to one month, the following returns one month:

  ```sqlexample
  DATEDIFF(month, '2021-01-01'::DATE, '2021-02-28'::DATE)
  ```

  For a DATE value:

  > + `year` uses only the year and disregards all the other parts.
  > + `month` uses the month and year.
  > + `day` uses the entire date.

  For a TIME value:

  > + `hour` uses only the hour and disregards all the other parts.
  > + `minute` uses the hour and minute.
  > + `second` uses the hour, minute, and second, but not the fractional seconds.
  > + `millisecond` uses the hour, minute, second, and first three digits of the fractional seconds. Fractional
  >   seconds are not rounded. For example, `DATEDIFF(milliseconds, '2024-02-20 21:18:41.0000', '2024-02-20 21:18:42.1239')` returns 1.123 seconds,
  >   not 1.124 seconds.
  > + `microsecond` uses the hour, minute, second, and first six digits of the fractional seconds. Fractional
  >   seconds are not rounded.
  > + `nanosecond` uses the hour, minute, second, and all nine digits of the fractional seconds.

  For a TIMESTAMP value:

  > The rules match the rules for DATE and TIME data types above. Only the specified unit and larger units are used.

**For minus sign:**

* `date_expr1` and `date_expr2` must both be dates. Times and timestamps are not allowed.

## Examples

Calculate the difference in years between two timestamps:

```sqlexample
SELECT DATEDIFF(year,
                '2020-04-09 14:39:20'::TIMESTAMP,
                '2023-05-08 23:39:20'::TIMESTAMP)
  AS diff_years;
```

```output
+------------+
| DIFF_YEARS |
|------------|
|          3 |
+------------+
```

Calculate the difference in hours between two timestamps:

```sqlexample
SELECT DATEDIFF(hour,
               '2023-05-08T23:39:20.123-07:00'::TIMESTAMP,
               DATEADD(year, 2, ('2023-05-08T23:39:20.123-07:00')::TIMESTAMP))
  AS diff_hours;
```

```output
+------------+
| DIFF_HOURS |
|------------|
|      17544 |
+------------+
```

Demonstrate how date parts affect DATEDIFF calculations; also, demonstrate use of the minus sign for date
subtraction:

```sqlexample
SELECT column1 date_1, column2 date_2,
       DATEDIFF(year, column1, column2) diff_years,
       DATEDIFF(month, column1, column2) diff_months,
       DATEDIFF(day, column1, column2) diff_days,
       column2::DATE - column1::DATE AS diff_days_via_minus
  FROM VALUES
       ('2015-12-30', '2015-12-31'),
       ('2015-12-31', '2016-01-01'),
       ('2016-01-01', '2017-12-31'),
       ('2016-08-23', '2016-09-07');
```

```output
+------------+------------+------------+-------------+-----------+---------------------+
| DATE_1     | DATE_2     | DIFF_YEARS | DIFF_MONTHS | DIFF_DAYS | DIFF_DAYS_VIA_MINUS |
|------------+------------+------------+-------------+-----------+---------------------|
| 2015-12-30 | 2015-12-31 |          0 |           0 |         1 |                   1 |
| 2015-12-31 | 2016-01-01 |          1 |           1 |         1 |                   1 |
| 2016-01-01 | 2017-12-31 |          1 |          23 |       730 |                 730 |
| 2016-08-23 | 2016-09-07 |          0 |           1 |        15 |                  15 |
+------------+------------+------------+-------------+-----------+---------------------+
```

Demonstrate how time parts affect DATEDIFF calculations:

```sqlexample
ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT = 'DY, DD MON YYYY HH24:MI:SS';
```

```sqlexample
SELECT column1 timestamp_1, column2 timestamp_2,
       DATEDIFF(hour, column1, column2) diff_hours,
       DATEDIFF(minute, column1, column2) diff_minutes,
       DATEDIFF(second, column1, column2) diff_seconds
  FROM VALUES
       ('2016-01-01 01:59:59'::TIMESTAMP, '2016-01-01 02:00:00'::TIMESTAMP),
       ('2016-01-01 01:00:00'::TIMESTAMP, '2016-01-01 01:59:00'::TIMESTAMP),
       ('2016-01-01 01:00:59'::TIMESTAMP, '2016-01-01 02:00:00'::TIMESTAMP);
```

```output
+---------------------------+---------------------------+------------+--------------+--------------+
| TIMESTAMP_1               | TIMESTAMP_2               | DIFF_HOURS | DIFF_MINUTES | DIFF_SECONDS |
|---------------------------+---------------------------+------------+--------------+--------------|
| Fri, 01 Jan 2016 01:59:59 | Fri, 01 Jan 2016 02:00:00 |          1 |            1 |            1 |
| Fri, 01 Jan 2016 01:00:00 | Fri, 01 Jan 2016 01:59:00 |          0 |           59 |         3540 |
| Fri, 01 Jan 2016 01:00:59 | Fri, 01 Jan 2016 02:00:00 |          1 |           60 |         3541 |
+---------------------------+---------------------------+------------+--------------+--------------+
```

Use the [CURRENT_TIMESTAMP](current_timestamp.md) function with the DATEDIFF function to
calculate the difference in years, months, and days between a specified timestamp and the
current timestamp:

```sqlexample
SELECT column1 specified_timestamp,
       column2 timestamp_now,
       DATEDIFF(year, column1, column2) diff_years,
       DATEDIFF(month, column1, column2) diff_months,
       DATEDIFF(day, column1, column2) diff_days,
       column2::DATE - column1::DATE AS diff_days_via_minus
  FROM VALUES
    ('2012-08-23 09:00:00.000 -0700', CURRENT_TIMESTAMP);
```

```output
+-------------------------------+-------------------------------+------------+-------------+-----------+---------------------+
| SPECIFIED_TIMESTAMP           | TIMESTAMP_NOW                 | DIFF_YEARS | DIFF_MONTHS | DIFF_DAYS | DIFF_DAYS_VIA_MINUS |
|-------------------------------+-------------------------------+------------+-------------+-----------+---------------------|
| 2012-08-23 09:00:00.000 -0700 | 2024-09-04 17:21:12.189 -0700 |         12 |         145 |      4395 |                4395 |
+-------------------------------+-------------------------------+------------+-------------+-----------+---------------------+
```

---
title: DAYNAME
source: https://docs.snowflake.com/en/sql-reference/functions/dayname.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# DAYNAME

Extracts the three-letter day-of-week name from the specified date or timestamp.

> **Note:**
>
> To return the full name for the day of the week instead of the three-letter day-of-week name,
> you can use the EXTRACT function, the DECODE function, and the `dayofweek` part. See
> [EXTRACT](extract.md) for an example.

## Syntax

```sqlsyntax
DAYNAME( <date_or_timestamp_expr> )
```

## Arguments

`date_or_timestamp_expr`
:   A date or a timestamp, or an expression that can be evaluated to a date or a timestamp.

## Returns

Returns a value of VARCHAR data type.

## Examples

Use the [TO_DATE](to_date.md) function to get the abbreviation for the day of the week of April 1, 2024:

```sqlexample
SELECT DAYNAME(TO_DATE('2024-04-01')) AS DAY;
```

```output
+-----+
| DAY |
|-----|
| Mon |
+-----+
```

Use the [TO_TIMESTAMP_NTZ](to_timestamp.md) function to get the abbreviation for the day of the week of April 2, 2024:

```sqlexample
SELECT DAYNAME(TO_TIMESTAMP_NTZ('2024-04-02 10:00')) AS DAY;
```

```output
+-----+
| DAY |
|-----|
| Tue |
+-----+
```

Get the abbreviation for the day of the week for each day from January 1, 2024, to January 8, 2024:

```sqlexample
CREATE OR REPLACE TABLE dates (d DATE);
```

```sqlexample
INSERT INTO dates (d) VALUES
  ('2024-01-01'::DATE),
  ('2024-01-02'::DATE),
  ('2024-01-03'::DATE),
  ('2024-01-04'::DATE),
  ('2024-01-05'::DATE),
  ('2024-01-06'::DATE),
  ('2024-01-07'::DATE),
  ('2024-01-08'::DATE);
```

```sqlexample
SELECT d, DAYNAME(d)
  FROM dates
  ORDER BY d;
```

```output
+------------+------------+
| D          | DAYNAME(D) |
|------------+------------|
| 2024-01-01 | Mon        |
| 2024-01-02 | Tue        |
| 2024-01-03 | Wed        |
| 2024-01-04 | Thu        |
| 2024-01-05 | Fri        |
| 2024-01-06 | Sat        |
| 2024-01-07 | Sun        |
| 2024-01-08 | Mon        |
+------------+------------+
```

---
title: DBT_PROJECT_EXECUTION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/dbt_project_execution_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DBT_PROJECT_EXECUTION_HISTORY

Returns the execution history of [dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

Call this function to get metadata and results from past dbt Project executions within seven days of the current time. Optionally, specify the values to filter the results by.

Use this function with the following system functions to access dbt artifacts and logs programmatically:

* [SYSTEM$GET_DBT_LOG](system_get_dbt_log.md)
* [SYSTEM$LOCATE_DBT_ARCHIVE](system_locate_dbt_archive.md)
* [SYSTEM$LOCATE_DBT_ARTIFACTS](system_locate_dbt_artifacts.md)

For more information, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

See also:
:   [CREATE DBT PROJECT](../sql/create-dbt-project.md), [EXECUTE DBT PROJECT](../sql/execute-dbt-project.md)

## Syntax

```sqlsyntax
DBT_PROJECT_EXECUTION_HISTORY (
  [ OBJECT_NAME => '<name>' ]
  [ , OBJECT_TYPE = { WORKSPACE | DBT PROJECT }]
  [ , START_TIME_RANGE_START => <start_time> ]
  [ , START_TIME_RANGE_END => <end_time>  ]
  [ , RESULT_LIMIT = <integer> ]
  [ , COMMAND = <dbt_command> ]
  [ , USER_NAME = <user_name> ]
  [ , DATABASE = <db_name> ]
  [ , SCHEMA = <schema_name> ]
)
```

## Arguments

`OBJECT_NAME = <name>`
:   Name of the workspace or dbt project that the run belongs to.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`OBJECT_TYPE = { WORKSPACE | DBT PROJECT }`
:   The type of the object, WORKSPACE or DBT PROJECT, the run belongs to.

`START_TIME_RANGE_START | START_TIME_RANGE_END = timestamp`
:   Timestamp to filter a range of dbt project runs.

`RESULT_LIMIT = integer`
:   An integer specifying the maximum number of rows returned by the function, from 1 - 10,000 inclusive.

    Default: 100

`COMMAND = dbt_command`
:   Specifies the [dbt command](https://docs.getdbt.com/reference/dbt-commands) used to execute the dbt project.

`USER_NAME = user_name`
:   Name of the user that initiated the dbt project object run.

`DATABASE = db_name`
:   Return only records for the specified database.

`SCHEMA = schema_name`
:   Return only records for the specified schema.

## Output

The function returns the following columns.

To view these columns, you must use a role with the MONITOR privilege.

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_ID | TEXT | ID of the query. |
| QUERY_START_TIME | TIMESTAMP_LTZ | The time the query started. |
| QUERY_END_TIME | TIMESTAMP_LTZ | The time the query ended. |
| USER_NAME | TEXT | The user that created the dbt Project. |
| OBJECT_NAME | TEXT | Name of the workspace or dbt Project the run belonged to. |
| OBJECT_TYPE | TEXT | Type of object, such as WORKSPACE or DBT PROJECT. |
| DATABASE_NAME | TEXT | Database of the object. |
| SCHEMA_NAME | TEXT | Schema of the object. |
| COMMAND | TEXT | The command that was run for the object. |
| ARGS | TEXT | The arguments that were used in the run for the object. |
| ERROR_CODE | NUMBER | If applicable, the error code for the run. |
| ERROR_MESSAGE | TEXT | If applicable, error message stating why the run failed. |
| WAREHOUSE | TEXT | Warehouse used for the object. |
| STATE | TEXT | State of run, such as HANDLED_ERROR or SUCCESS. |
| DBT_VERSION | TEXT | The specific version used for this run. For example, `1.9.4`. |
| DBT_SNOWFLAKE_VERSION | TEXT | The specific dbt Projects on Snowflake version with patch version used for this run. For example, `1.9.4`. |

## Access control requirements

This table function includes only runs from workspaces and dbt Projects in which you have the following privileges:

* OWNERSHIP, READ, or WRITE on workspaces
* OWNERSHIP, USAGE, or MONITOR on dbt Projects

## Usage notes

* Use the exact dbt Project name (case-sensitive if created with quotes). If no row matches (wrong dbt Project name or no runs yet), you might get an `Inputs may not be null.` error.

## Examples

The following example audits which engine version was used for recent runs:

```sqlexample
SELECT
    query_start_time,
    query_id,
    dbt_version
FROM
  TABLE (
    INFORMATION_SCHEMA.DBT_PROJECT_EXECUTION_HISTORY (
     OBJECT_NAME => 'finance_analytics'
    )
  );
```

For detailed examples of using the DBT_PROJECT_EXECUTION_HISTORY table function with system functions to access dbt artifacts and logs programmatically,
see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

---
title: DECODE
source: https://docs.snowflake.com/en/sql-reference/functions/decode.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# DECODE

Compares the select expression to each search expression in order. As soon as
a search expression matches the selection expression, the corresponding result
expression is returned.

> **Note:**
>
> DECODE in Snowflake is different from the DECODE function in PostgreSQL,
> which converts data into different encodings.

## Syntax

```sqlsyntax
DECODE( <expr> , <search1> , <result1> [ , <search2> , <result2> ... ] [ , <default> ] )
```

## Arguments

`expr`
:   This is the “select expression”. The “search expressions” are compared to
    this select expression, and if there is a match then `DECODE`
    returns the result that corresponds to that search expression. The select
    expression is typically a column, but can be a subquery, literal, or other
    expression.

`searchN`
:   The search expressions indicate the values to compare to the select
    expression. If one of these search expressions matches, the function returns
    the corresponding `result`. If more than one search expression would
    match, only the first match’s result is returned.

`resultN`
:   The results are the values that will be returned if one of the search
    expressions matches the select expression.

`default`
:   If an optional default is specified, and if none of the search expressions
    match the select expression, then `DECODE` returns this default value.

## Usage notes

* Note that, contrary to [CASE](case.md), a NULL value in the select expression
  matches a NULL value in the search expressions.
* The `expr` can include set operators, such as `UNION`,
  `INTERSECT`, `EXCEPT`, and `MINUS`. When using set operators,
  make sure that data types are compatible. For details, see the
  [General usage notes](../operators-query.md) in the
  [Set operators](../operators-query.md) topic.

## Collation details

* The collation specifications of the select expression and the search expressions must all be compatible.
* The value returned from the function retains the collation specification of the result with the
  highest-[precedence](../collation.md) collation.

## Examples

Create a table and insert rows:

> ```sqlexample
> CREATE TABLE d (column1 INTEGER);
> INSERT INTO d (column1) VALUES
>     (1),
>     (2),
>     (NULL),
>     (4);
> ```

Example with a default value `'other'` (note that NULL equals NULL):

> ```sqlexample
> SELECT column1, decode(column1,
>                        1, 'one',
>                        2, 'two',
>                        NULL, '-NULL-',
>                        'other'
>                       ) AS decode_result
>     FROM d;
> +---------+---------------+
> | COLUMN1 | DECODE_RESULT |
> |---------+---------------|
> |       1 | one           |
> |       2 | two           |
> |    NULL | -NULL-        |
> |       4 | other         |
> +---------+---------------+
> ```

Example without a default value (note that the non-matching value returns NULL):

> ```sqlexample
> SELECT column1, decode(column1,
>                        1, 'one',
>                        2, 'two',
>                        NULL, '-NULL-'
>                        ) AS decode_result
>     FROM d;
> +---------+---------------+
> | COLUMN1 | DECODE_RESULT |
> |---------+---------------|
> |       1 | one           |
> |       2 | two           |
> |    NULL | -NULL-        |
> |       4 | NULL          |
> +---------+---------------+
> ```

---
title: DECOMPRESS_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/decompress_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Compression/Decompression)

# DECOMPRESS_BINARY

Decompresses the compressed `BINARY` input parameter.

See also:
:   [COMPRESS](compress.md) , [DECOMPRESS_STRING](decompress_string.md)

## Syntax

```sqlsyntax
DECOMPRESS_BINARY(<input>, <method>)
```

## Arguments

**Required:**

`input`
:   A `BINARY` value (or expression) with data that was compressed using one
    of the compression methods specified in [COMPRESS](compress.md).

    If you attempt to decompress a compressed string, rather than a
    compressed `BINARY` value, you do not get an error; instead, the function
    returns a `BINARY` value. See the Usage Notes below for details.

`method`
:   The compression method originally used to compress the `input`.
    See [COMPRESS](compress.md) for a list of compression
    methods.

    The `DECOMPRESS_BINARY` method, unlike the `COMPRESS` method, does
    not require you to specify the compression level. If you do specify
    the compression level, `DECOMPRESS_BINARY` ignores it and uses the actual
    compression level.

## Returns

The data type of the returned value is `BINARY`.

## Usage notes

* If the compression method is unknown or invalid, the query fails.
* The compression method name (e.g. `ZLIB`) is case-insensitive.
* The `DECOMPRESS_BINARY` function can decompress data that was
  originally in string format. However, the output of `DECOMPRESS_BINARY`
  is still `BINARY`, not string. For example,
  `SELECT DECOMPRESS_BINARY(COMPRESS('Hello', 'SNAPPY), 'SNAPPY')` returns a
  `BINARY` value; if you display that value, it is shown as
  `48656C6C6F`, which is the hexadecimal representation of ‘Hello’.
  To avoid confusion, Snowflake recommends decompressing string data
  by using [DECOMPRESS_STRING](decompress_string.md) rather than `DECOMPRESS_BINARY`.

## Returns

A `BINARY` value with decompressed data.

## Examples

This shows a simple example of decompressing `BINARY` data that contains
a compressed value.

```sqlexample
SELECT DECOMPRESS_BINARY(TO_BINARY('0920536E6F77666C616B65', 'HEX'), 'SNAPPY');
+-------------------------------------------------------------------------+
| DECOMPRESS_BINARY(TO_BINARY('0920536E6F77666C616B65', 'HEX'), 'SNAPPY') |
|-------------------------------------------------------------------------|
| 536E6F77666C616B65                                                      |
+-------------------------------------------------------------------------+
```

---
title: DECOMPRESS_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/decompress_string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Compression/Decompression)

# DECOMPRESS_STRING

Decompresses the compressed `BINARY` input parameter to a string.

See also:
:   [COMPRESS](compress.md) , [DECOMPRESS_BINARY](decompress_binary.md)

## Syntax

```sqlsyntax
DECOMPRESS_STRING(<input>, <method>)
```

## Arguments

**Required:**

`input`
:   A `BINARY` value (or expression) with data that was compressed using one
    of the compression methods specified in [COMPRESS](compress.md).

`method`
:   The compression method originally used to compress the `input`.
    See [COMPRESS](compress.md) for a list of compression
    methods.

    The `DECOMPRESS_STRING` method, unlike the `COMPRESS` method, does
    not require you to specify the compression level. If you do specify
    the compression level, `DECOMPRESS_STRING` ignores it and uses the
    actual compression level.

## Returns

A string with decompressed data.

## Usage notes

* If the compression method is unknown or invalid, the query fails.
* The compression method name (e.g. `ZLIB`) is case insensitive.
* If you use `DECOMPRESS_STRING` to decompress a compressed `BINARY`
  value, rather than a compressed string value, you do
  not necessarily get an error; instead, the function attempts to treat
  the `BINARY` value as a sequence of hexadecimal digits and then attempts
  to convert those hexadecimal digits into printable characters. Snowflake
  recommends that you use the [DECOMPRESS_BINARY](decompress_binary.md) function to decompress
  compressed data if the original data was `BINARY`.

## Examples

This shows how to compress a string and then decompress back to the original
value.

```sqlexample
SELECT COMPRESS('Snowflake', 'SNAPPY');
+---------------------------------+
| COMPRESS('SNOWFLAKE', 'SNAPPY') |
|---------------------------------|
| 0920536E6F77666C616B65          |
+---------------------------------+
```

```sqlexample
SELECT DECOMPRESS_STRING(TO_BINARY('0920536E6F77666C616B65', 'HEX'), 'SNAPPY');
+-------------------------------------------------------------------------+
| DECOMPRESS_STRING(TO_BINARY('0920536E6F77666C616B65', 'HEX'), 'SNAPPY') |
|-------------------------------------------------------------------------|
| Snowflake                                                               |
+-------------------------------------------------------------------------+
```

---
title: DECRYPT
source: https://docs.snowflake.com/en/sql-reference/functions/decrypt.md
section: SQL Functions
---

Categories:
:   [Encryption functions](../functions-encryption.md)

# DECRYPT

Decrypts a BINARY value using a VARCHAR passphrase.

See also:
:   [ENCRYPT](encrypt.md) , [ENCRYPT_RAW](encrypt_raw.md) , [DECRYPT_RAW](decrypt_raw.md) , [TRY_DECRYPT](try_decrypt.md) , [TRY_DECRYPT_RAW](try_decrypt_raw.md)

## Syntax

```sqlsyntax
DECRYPT( <value_to_decrypt> , <passphrase> ,
         [ [ <additional_authenticated_data> , ] <encryption_method> ]
       )
```

## Arguments

**Required:**

`value_to_decrypt`
:   The BINARY value to decrypt.

`passphrase`
:   The passphrase to use to encrypt/decrypt the data. The passphrase is a VARCHAR.

**Optional:**

`additional_authenticated_data`
:   Additional authenticated data (AAD) is additional data whose confidentiality and authenticity is assured during the
    decryption process. However, this AAD is not encrypted and is not included as a field in the returned value from the
    ENCRYPT or ENCRYPT_RAW function.

    If AAD is passed to the encryption function (ENCRYPT or ENCRYPT_RAW), then the same AAD must be passed to the
    decryption function (DECRYPT or DECRYPT_RAW). If the AAD passed to the decryption function does not match the
    AAD passed to the encryption function, then decryption fails.

    The difference between the AAD and the `passphrase` is that the passphrase is intended to be kept
    secret (otherwise, the encryption is essentially worthless) while the AAD can be left public. The AAD helps
    authenticate that a public piece of information and an encrypted value are associated with each other. The
    examples section in the [ENCRYPT](encrypt.md) function includes an example showing the behavior
    when the AAD matches and the behavior when it doesn’t match.

    For ENCRYPT_RAW and DECRYPT_RAW, the data type of the AAD should be BINARY.
    For ENCRYPT and DECRYPT, the data type of the AAD can be either VARCHAR or BINARY, and does not need to match
    the data type of the value that was encrypted.

    AAD is supported only by AEAD-enabled encryption modes like GCM (default).

`encryption_method`
:   This string specifies the method to use for encrypting/decrypting the data. This string contains subfields:

    ```none
    <algorithm>-<mode> [ /pad: <padding> ]
    ```

    The `algorithm` is currently limited to:

    > * `'AES'`: When a passphrase is passed (e.g. to ENCRYPT), the function uses AES-256 encryption (256 bits). When a key
    >   is passed (e.g. to ENCRYPT_RAW), the function uses 128, 192, or 256-bit encryption, depending upon the key
    >   length.

    The `algorithm` is case-insensitive.

    The `mode` specifies which block cipher mode should be used to encrypt messages.
    The following table shows which modes are supported, and which of those modes support padding:

    | Mode | Padding | Description |
    | --- | --- | --- |
    | `'ECB'` | Yes | Encrypt every block individually with the key. This mode is generally discouraged and is included only for compatibility with external implementations. |
    | `'CBC'` | Yes | The encrypted block is XORed with the previous block. |
    | `'GCM'` | No | Galois/Counter Mode is a high-performance encryption mode that is AEAD-enabled. AEAD additionally assures the authenticity and confidentiality of the encrypted data by generating an AEAD tag. Moreover, AEAD supports AAD (additional authenticated data). |
    | `'CTR'` | No | Counter mode. |
    | `'OFB'` | No | Output feedback. The ciphertext is XORed with the plaintext of a block. |
    | `'CFB'` | No | Cipher feedback is a combination of OFB and CBC. |

    The `mode` is case-insensitive.

    The `padding` specifies how to pad messages whose length is not a multiple of the block size. Padding is
    applicable only for ECB and CBC modes; padding is ignored for other modes. The possible values for padding are:

    > * `'PKCS'`: Uses PKCS5 for block padding.
    > * `'NONE'`: No padding. The user needs to take care of the padding when using ECB or CBC mode.

    The `padding` is case-insensitive.

    Default setting: `'AES-GCM'`.

    If the `mode` is not specified, GCM is used.

    If the `padding` is not specified, PKCS is used.

## Returns

Returns the decrypted value as a BINARY value. If the original value before encryption was VARCHAR, you must
explicitly convert the returned BINARY back to VARCHAR. For example:

```sqlexample
... TO_VARCHAR(DECRYPT(ENCRYPT('secret', 'key'), 'key'), 'utf-8') ...
```

For more complete examples, see the Examples below.

## Usage notes

* To decrypt data encrypted by `ENCRYPT()`, use `DECRYPT()`. Do not use `DECRYPT_RAW()`.
* To decrypt data encrypted by `ENCRYPT_RAW()`, use `DECRYPT_RAW()`. Do not use `DECRYPT()`.
* The function’s parameters are masked for security. Sensitive information such as the following is
  not visible in the query log and is not visible to Snowflake:

  + The string or binary value to encrypt or decrypt.
  + The passphrase or key.
* The functions use a FIPS-compliant cryptographic library to effectively perform the encryption and decryption.
* The passphrase or key used to decrypt a piece of data must be the same as the passphrase or key used to encrypt that
  data.

* The passphrase can be of arbitrary length, even 0 (the empty string). However, Snowflake
  strongly recommends using a passphrase that is at least 8 bytes.
* Snowflake recommends that the passphrase follow general best practices for passwords, such as using a mix of
  uppercase letters, lowercase letters, numbers, and punctuation.
* The passphrase is not used directly to encrypt/decrypt the input. Instead, the passphrase is used to derive an
  encryption/decryption key, which is always the same for the same passphrase. Snowflake uses the
  <https://en.wikipedia.org/wiki/PBKDF2> key-derivation function with a Snowflake-internal seed to compute the
  encryption/decryption key from the given passphrase.

  Because of this key derivation, the encrypt/decrypt function cannot be used to:

  + Decrypt data that was externally encrypted.
  + Encrypt data that will be externally decrypted.

  To do either of these, use [ENCRYPT_RAW](encrypt_raw.md) or [DECRYPT_RAW](decrypt_raw.md).

## Examples

The code below shows a simple example of encryption and decryption:

> ```sqlexample
> SET passphrase='poiuqewjlkfsd';
> ```
>
> ```sqlexample
> SELECT
>     TO_VARCHAR(
>         DECRYPT(
>             ENCRYPT('Patient tested positive for COVID-19', $passphrase),
>             $passphrase),
>         'utf-8')
>         AS decrypted
>     ;
> +--------------------------------------+
> | DECRYPTED                            |
> |--------------------------------------|
> | Patient tested positive for COVID-19 |
> +--------------------------------------+
> ```

This example decrypts a BINARY value with a simple passphrase. In this example, binary values are shown in
[hexadecimal](../binary-input-output.md) format. The value of the encrypted data
might vary due to the randomness of the initialization vector (described briefly in [ENCRYPT](encrypt.md)).

> ```sqlexample
> ALTER SESSION SET BINARY_OUTPUT_FORMAT='hex';
> ```
>
> ```sqlexample
> CREATE TABLE binary_table (
>     binary_column BINARY,
>     encrypted_binary_column BINARY);
> INSERT INTO binary_table (binary_column)
>     SELECT (TO_BINARY(HEX_ENCODE('Hello')));
> UPDATE binary_table
>     SET encrypted_binary_column = ENCRYPT(binary_column, 'SamplePassphrase');
> ```
>
> ```sqlexample
> SELECT 'Hello' as original_value,
>        binary_column,
>        hex_decode_string(to_varchar(binary_column)) as decoded,
>        -- encrypted_binary_column,
>        decrypt(encrypted_binary_column, 'SamplePassphrase') as decrypted,
>        hex_decode_string(to_varchar(decrypt(encrypted_binary_column, 'SamplePassphrase'))) as decrypted_and_decoded
>     FROM binary_table;
> +----------------+---------------+---------+------------+-----------------------+
> | ORIGINAL_VALUE | BINARY_COLUMN | DECODED | DECRYPTED  | DECRYPTED_AND_DECODED |
> |----------------+---------------+---------+------------+-----------------------|
> | Hello          | 48656C6C6F    | Hello   | 48656C6C6F | Hello                 |
> +----------------+---------------+---------+------------+-----------------------+
> ```

This example shows how to use an alternative mode (`CBC`) as part of the specifier for the encryption method.
This encryption method also specifies a padding rule (`PKCS`). In this example, the AAD parameter is NULL.

> ```sqlexample
> select encrypt(to_binary(hex_encode('secret!')), 'sample_passphrase', NULL, 'aes-cbc/pad:pkcs') as encrypted_data;
> ```

This example shows how to use the AAD:

```sqlexample
SELECT
    TO_VARCHAR(
        DECRYPT(
            ENCRYPT('penicillin', $passphrase, 'John Dough AAD', 'aes-gcm'),
            $passphrase, 'John Dough AAD', 'aes-gcm'),
        'utf-8')
        AS medicine
    ;
+------------+
| MEDICINE   |
|------------|
| penicillin |
+------------+
```

If you pass the wrong AAD, decryption fails:

```sqlexample
SELECT
    DECRYPT(
        ENCRYPT('penicillin', $passphrase, 'John Dough AAD', 'aes-gcm'),
        $passphrase, 'wrong patient AAD', 'aes-gcm') AS medicine
    ;
```

```none
100311 (22023): Decryption failed. Check encrypted data, key, AAD, or AEAD tag.
```

---
title: DECRYPT_RAW
source: https://docs.snowflake.com/en/sql-reference/functions/decrypt_raw.md
section: SQL Functions
---

Categories:
:   [Encryption functions](../functions-encryption.md)

# DECRYPT_RAW

Decrypts a BINARY value using a BINARY key.

See also:
:   [ENCRYPT](encrypt.md) , [ENCRYPT_RAW](encrypt_raw.md) , [DECRYPT](decrypt.md) , [TRY_DECRYPT](try_decrypt.md) , [TRY_DECRYPT_RAW](try_decrypt_raw.md)

## Syntax

```sqlsyntax
DECRYPT_RAW( <value_to_decrypt> , <key> , <iv> ,
         [ [ [ <additional_authenticated_data> , ] <encryption_method> , ] <aead_tag> ]
       )
```

## Arguments

**Required:**

`value_to_decrypt`
:   The binary value to decrypt.

`key`
:   The key to use to encrypt/decrypt the data. The key must be a BINARY value. The key can be any value as long as the
    length is correct. For example, for AES128, the key must be 128 bits (16 bytes), and for AES256, the key must be
    256 bits (32 bytes).

    The key used to encrypt the value must be used to decrypt the value.

`iv`
:   This parameter contains the Initialization Vector (IV) to use to encrypt and decrypt this piece of
    data. The IV must be a BINARY value of a specific length:

    * For GCM, this field must be 96 bits (12 bytes). While the GCM encryption method allows this field to be a different
      size, Snowflake currently only supports 96 bits.
    * For CCM, this should be 56 bits (7 bytes).
    * For ECB, this parameter is unneeded.
    * For all other supported encryption modes, this should be 128 bits (16 bytes).

    This value is used to initialize the first encryption round. You should never use the same IV and key combination
    more than once, especially for encryption modes like GCM.

    If this parameter is set to NULL, the implementation will choose a new pseudo-random IV during each call.

**Optional:**

`additional_authenticated_data`
:   Additional authenticated data (AAD) is additional data whose confidentiality and authenticity is assured during the
    decryption process. However, this AAD is not encrypted and is not included as a field in the returned value from the
    ENCRYPT or ENCRYPT_RAW function.

    If AAD is passed to the encryption function (ENCRYPT or ENCRYPT_RAW), then the same AAD must be passed to the
    decryption function (DECRYPT or DECRYPT_RAW). If the AAD passed to the decryption function does not match the
    AAD passed to the encryption function, then decryption fails.

    The difference between the AAD and the `passphrase` is that the passphrase is intended to be kept
    secret (otherwise, the encryption is essentially worthless) while the AAD can be left public. The AAD helps
    authenticate that a public piece of information and an encrypted value are associated with each other. The
    examples section in the [ENCRYPT](encrypt.md) function includes an example showing the behavior
    when the AAD matches and the behavior when it doesn’t match.

    For ENCRYPT_RAW and DECRYPT_RAW, the data type of the AAD should be BINARY.
    For ENCRYPT and DECRYPT, the data type of the AAD can be either VARCHAR or BINARY, and does not need to match
    the data type of the value that was encrypted.

    AAD is supported only by AEAD-enabled encryption modes like GCM (default).

`encryption_method`
:   This string specifies the method to use for encrypting/decrypting the data. This string contains subfields:

    ```none
    <algorithm>-<mode> [ /pad: <padding> ]
    ```

    The `algorithm` is currently limited to:

    > * `'AES'`: When a passphrase is passed (e.g. to ENCRYPT), the function uses AES-256 encryption (256 bits). When a key
    >   is passed (e.g. to ENCRYPT_RAW), the function uses 128, 192, or 256-bit encryption, depending upon the key
    >   length.

    The `algorithm` is case-insensitive.

    The `mode` specifies which block cipher mode should be used to encrypt messages.
    The following table shows which modes are supported, and which of those modes support padding:

    | Mode | Padding | Description |
    | --- | --- | --- |
    | `'ECB'` | Yes | Encrypt every block individually with the key. This mode is generally discouraged and is included only for compatibility with external implementations. |
    | `'CBC'` | Yes | The encrypted block is XORed with the previous block. |
    | `'GCM'` | No | Galois/Counter Mode is a high-performance encryption mode that is AEAD-enabled. AEAD additionally assures the authenticity and confidentiality of the encrypted data by generating an AEAD tag. Moreover, AEAD supports AAD (additional authenticated data). |
    | `'CTR'` | No | Counter mode. |
    | `'OFB'` | No | Output feedback. The ciphertext is XORed with the plaintext of a block. |
    | `'CFB'` | No | Cipher feedback is a combination of OFB and CBC. |

    The `mode` is case-insensitive.

    The `padding` specifies how to pad messages whose length is not a multiple of the block size. Padding is
    applicable only for ECB and CBC modes; padding is ignored for other modes. The possible values for padding are:

    > * `'PKCS'`: Uses PKCS5 for block padding.
    > * `'NONE'`: No padding. The user needs to take care of the padding when using ECB or CBC mode.

    The `padding` is case-insensitive.

    Default setting: `'AES-GCM'`.

    If the `mode` is not specified, GCM is used.

    If the `padding` is not specified, PKCS is used.

`aead_tag`
:   This BINARY value is needed for AEAD-enabled decryption modes to check the authenticity and confidentiality of the
    encrypted data. Use the AEAD tag that was returned by the ENCRYPT_RAW function. An example below shows how to
    access and use this value.

## Returns

The function returns the decrypted value. The data type of the returned value is BINARY.

## Usage notes

* To decrypt data encrypted by `ENCRYPT()`, use `DECRYPT()`. Do not use `DECRYPT_RAW()`.
* To decrypt data encrypted by `ENCRYPT_RAW()`, use `DECRYPT_RAW()`. Do not use `DECRYPT()`.
* The function’s parameters are masked for security. Sensitive information such as the following is
  not visible in the query log and is not visible to Snowflake:

  + The string or binary value to encrypt or decrypt.
  + The passphrase or key.
* The functions use a FIPS-compliant cryptographic library to effectively perform the encryption and decryption.
* The passphrase or key used to decrypt a piece of data must be the same as the passphrase or key used to encrypt that
  data.

* When extracting fields (ciphertext, initialization vector, or tag) from an encrypted binary value, use:

  ```sqlexample
  as_binary(get(encrypted_value, '<field_name>'))
  ```

  For example, use:

  ```sqlexample
  as_binary(get(encrypted_value, 'ciphertext'))
  ```

  Do not use `encrypted_value:field_name::binary`. The field-access operator `:` converts the extracted field
  value to a string; however, because the source is BINARY, that string is not always a valid UTF-8 string.

## Examples

This example shows encryption and decryption.

> For readability, set the BINARY_OUTPUT_FORMAT to HEX:
>
> ```sqlexample
> ALTER SESSION SET BINARY_OUTPUT_FORMAT='HEX';
> ```
>
> Create a table and load it.
>
> > **Caution:**
> >
> > To simplify this example, the encryption/decryption key is stored in the table with the value that has
> > been encrypted. This is insecure; the key should never be stored as an unencrypted value in the table
> > that stores the encrypted data.
>
> ```sqlexample
> CREATE OR REPLACE TABLE binary_table (
>     encryption_key BINARY,   -- DO NOT STORE REAL ENCRYPTION KEYS THIS WAY!
>     initialization_vector BINARY(12), -- DO NOT STORE REAL IV'S THIS WAY!!
>     binary_column BINARY,
>     encrypted_binary_column VARIANT,
>     aad_column BINARY);
>
> INSERT INTO binary_table (encryption_key,
>                           initialization_vector,
>                           binary_column,
>                           aad_column)
>     SELECT SHA2_BINARY('NotSecretEnough', 256),
>             SUBSTR(TO_BINARY(HEX_ENCODE('AlsoNotSecretEnough'), 'HEX'), 0, 12),
>             TO_BINARY(HEX_ENCODE('Bonjour'), 'HEX'),
>             TO_BINARY(HEX_ENCODE('additional data'), 'HEX')
>     ;
> ```
>
> Encrypt:
>
> ```sqlexample
> UPDATE binary_table SET encrypted_binary_column =
>     ENCRYPT_RAW(binary_column,
>         encryption_key,
>         initialization_vector,
>         aad_column,
>         'AES-GCM');
> +------------------------+-------------------------------------+
> | number of rows updated | number of multi-joined rows updated |
> |------------------------+-------------------------------------|
> |                      1 |                                   0 |
> +------------------------+-------------------------------------+
> ```
>
> This shows the corresponding call to the `DECRYPT_RAW()` function. The initialization vector (IV)
> is taken from the encrypted value; you do not need to store the initialization vector separately. Similarly,
> the AEAD tag is also read from the encrypted value.
>
> > **Caution:**
> >
> > To simplify this example, the encryption/decryption key is read from the table with the value that has
> > been encrypted. This is insecure; the key should never be stored as an unencrypted value in the table
> > that stores the encrypted data.
>
> ```sqlexample
> SELECT 'Bonjour' as original_value,
>        binary_column,
>        hex_decode_string(to_varchar(binary_column)) as decoded,
>        encrypted_binary_column,
>        decrypt_raw(as_binary(get(encrypted_binary_column, 'ciphertext')),
>                   encryption_key,
>                   as_binary(get(encrypted_binary_column, 'iv')),
>                   aad_column,
>                   'AES-GCM',
>                   as_binary(get(encrypted_binary_column, 'tag')))
>            as decrypted,
>        hex_decode_string(to_varchar(decrypt_raw(as_binary(get(encrypted_binary_column, 'ciphertext')),
>                   encryption_key,
>                   as_binary(get(encrypted_binary_column, 'iv')),
>                   aad_column,
>                   'AES-GCM',
>                   as_binary(get(encrypted_binary_column, 'tag')))
>                   ))
>            as decrypted_and_decoded
>     FROM binary_table;
> +----------------+----------------+---------+---------------------------------------------+----------------+-----------------------+
> | ORIGINAL_VALUE | BINARY_COLUMN  | DECODED | ENCRYPTED_BINARY_COLUMN                     | DECRYPTED      | DECRYPTED_AND_DECODED |
> |----------------+----------------+---------+---------------------------------------------+----------------+-----------------------|
> | Bonjour        | 426F6E6A6F7572 | Bonjour | {                                           | 426F6E6A6F7572 | Bonjour               |
> |                |                |         |   "ciphertext": "CA2F4A383F6F55",           |                |                       |
> |                |                |         |   "iv": "416C736F4E6F745365637265",         |                |                       |
> |                |                |         |   "tag": "91F28FBC6A2FE9B213D1C44B8D75D147" |                |                       |
> |                |                |         | }                                           |                |                       |
> +----------------+----------------+---------+---------------------------------------------+----------------+-----------------------+
> ```
>
> The previous example duplicated a long call to `DECRYPT_RAW()`. You can use a WITH clause to reduce
> the duplication:
>
> ```sqlexample
> WITH
>     decrypted_but_not_decoded as (
>         decrypt_raw(as_binary(get(encrypted_binary_column, 'ciphertext')),
>                       encryption_key,
>                       as_binary(get(encrypted_binary_column, 'iv')),
>                       aad_column,
>                       'AES-GCM',
>                       as_binary(get(encrypted_binary_column, 'tag')))
>     )
> SELECT 'Bonjour' as original_value,
>        binary_column,
>        hex_decode_string(to_varchar(binary_column)) as decoded,
>        encrypted_binary_column,
>        decrypted_but_not_decoded,
>        hex_decode_string(to_varchar(decrypted_but_not_decoded))
>            as decrypted_and_decoded
>     FROM binary_table;
> +----------------+----------------+---------+---------------------------------------------+---------------------------+-----------------------+
> | ORIGINAL_VALUE | BINARY_COLUMN  | DECODED | ENCRYPTED_BINARY_COLUMN                     | DECRYPTED_BUT_NOT_DECODED | DECRYPTED_AND_DECODED |
> |----------------+----------------+---------+---------------------------------------------+---------------------------+-----------------------|
> | Bonjour        | 426F6E6A6F7572 | Bonjour | {                                           | 426F6E6A6F7572            | Bonjour               |
> |                |                |         |   "ciphertext": "CA2F4A383F6F55",           |                           |                       |
> |                |                |         |   "iv": "416C736F4E6F745365637265",         |                           |                       |
> |                |                |         |   "tag": "91F28FBC6A2FE9B213D1C44B8D75D147" |                           |                       |
> |                |                |         | }                                           |                           |                       |
> +----------------+----------------+---------+---------------------------------------------+---------------------------+-----------------------+
> ```

---
title: DEGREES
source: https://docs.snowflake.com/en/sql-reference/functions/degrees.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# DEGREES

Converts radians to degrees.

## Syntax

```sqlsyntax
DEGREES( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

Show the number of degrees for 1/3 of a radian, 1 radian, and 3 radians:

```sqlexample
SELECT DEGREES(PI()/3), DEGREES(PI()), DEGREES(3 * PI()), DEGREES(1);
```

```output
+-----------------+---------------+-------------------+--------------+
| DEGREES(PI()/3) | DEGREES(PI()) | DEGREES(3 * PI()) |   DEGREES(1) |
|-----------------+---------------+-------------------+--------------|
|              60 |           180 |               540 | 57.295779513 |
+-----------------+---------------+-------------------+--------------+
```

---
title: DENSE_RANK
source: https://docs.snowflake.com/en/sql-reference/functions/dense_rank.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# DENSE_RANK

Returns the rank of a value within a group of values, without gaps in the ranks.

The rank value starts at 1 and continues up sequentially.

If two values are the same, they have the same rank.

## Syntax

```sqlsyntax
DENSE_RANK() OVER ( [ PARTITION BY <expr1> ]
  ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

None.

The function itself takes no arguments because it returns the rank (relative position) of the current row
within the window, which is ordered by `<expr2>`. The ordering of the window determines the rank, so there
is no need to pass an additional parameter to the DENSE_RANK function.

## Usage notes

* `expr1`
  The column or expression to partition the window by.

  For example, suppose that within each state or province, you want to rank
  farmers in order by the amount of corn they produced. In this case, you
  partition by state.

  If you want only a single group (e.g. you want to rank all farmers in the U.S.
  regardless of which state they live in), then omit the PARTITION BY clause.
* `expr2`
  The column or expression to order (rank) by.

  For example, if you’re ranking farmers to see who produced the most corn
  (within their state), then you would use the `bushels_produced` column. For details,
  see Examples (in this topic).
* Tie values result in the same rank value, but unlike [RANK](rank.md), they do not result in gaps in the sequence.

## Examples

Create a table and data:

```sqlexample
CREATE OR REPLACE TABLE corn_production (farmer_id INTEGER, state VARCHAR, bushels FLOAT);

INSERT INTO corn_production (farmer_id, state, bushels) VALUES
  (1, 'Iowa', 100),
  (2, 'Iowa', 110),
  (3, 'Kansas', 120),
  (4, 'Kansas', 130);
```

Show farmers’ corn production in descending order, along with the rank of each
individual farmer’s production (highest = `1`):

```sqlexample
SELECT state, bushels,
    RANK() OVER (ORDER BY bushels DESC),
    DENSE_RANK() OVER (ORDER BY bushels DESC)
  FROM corn_production;
```

```output
+--------+---------+-------------------------------------+-------------------------------------------+
| STATE  | BUSHELS | RANK() OVER (ORDER BY BUSHELS DESC) | DENSE_RANK() OVER (ORDER BY BUSHELS DESC) |
|--------+---------+-------------------------------------+-------------------------------------------|
| Kansas |     130 |                                   1 |                                         1 |
| Kansas |     120 |                                   2 |                                         2 |
| Iowa   |     110 |                                   3 |                                         3 |
| Iowa   |     100 |                                   4 |                                         4 |
+--------+---------+-------------------------------------+-------------------------------------------+
```

Within each state, show farmers’ corn production in descending order, along with the rank of each
individual farmer’s production (highest = `1`):

```sqlexample
SELECT state, bushels,
    RANK() OVER (PARTITION BY state ORDER BY bushels DESC),
    DENSE_RANK() OVER (PARTITION BY state ORDER BY bushels DESC)
  FROM corn_production;
```

```output
+--------+---------+--------------------------------------------------------+--------------------------------------------------------------+
| STATE  | BUSHELS | RANK() OVER (PARTITION BY STATE ORDER BY BUSHELS DESC) | DENSE_RANK() OVER (PARTITION BY STATE ORDER BY BUSHELS DESC) |
|--------+---------+--------------------------------------------------------+--------------------------------------------------------------|
| Iowa   |     110 |                                                      1 |                                                            1 |
| Iowa   |     100 |                                                      2 |                                                            2 |
| Kansas |     130 |                                                      1 |                                                            1 |
| Kansas |     120 |                                                      2 |                                                            2 |
+--------+---------+--------------------------------------------------------+--------------------------------------------------------------+
```

The query and output below show how tie values are handled by the RANK and DENSE_RANK functions. Note that for DENSE_RANK,
the ranks are `1`, `2`, `3`, `3`, `4`. Unlike with the output from the RANK function, the rank `4` is not skipped
because there was a tie for rank `3`.

```sqlexample
INSERT INTO corn_production (farmer_id, state, bushels) VALUES
  (5, 'Iowa', 110);

SELECT state, bushels,
    RANK() OVER (ORDER BY bushels DESC),
    DENSE_RANK() OVER (ORDER BY bushels DESC)
  FROM corn_production;
```

```output
+--------+---------+-------------------------------------+-------------------------------------------+
| STATE  | BUSHELS | RANK() OVER (ORDER BY BUSHELS DESC) | DENSE_RANK() OVER (ORDER BY BUSHELS DESC) |
|--------+---------+-------------------------------------+-------------------------------------------|
| Kansas |     130 |                                   1 |                                         1 |
| Kansas |     120 |                                   2 |                                         2 |
| Iowa   |     110 |                                   3 |                                         3 |
| Iowa   |     110 |                                   3 |                                         3 |
| Iowa   |     100 |                                   5 |                                         4 |
+--------+---------+-------------------------------------+-------------------------------------------+
```

---
title: DIV0
source: https://docs.snowflake.com/en/sql-reference/functions/div0.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md)

# DIV0

Performs division like the division operator (`/`), but returns 0 when the divisor is 0 (rather than reporting an error).

See also:
:   [DIV0NULL](div0null.md)

## Syntax

```sqlsyntax
DIV0( <dividend> , <divisor> )
```

## Arguments

`dividend`
:   Numeric expression that evaluates to the value that you want to divide.

`divisor`
:   Numeric expression that evaluates to the value that you want to divide by.

## Returns

The quotient. If the divisor is 0, the function returns 0.

## Examples

As shown in the following example, the DIV0 function performs division like the division operator (`/`):

> ```sqlexample
> SELECT 1/2;
> +----------+
> |      1/2 |
> |----------|
> | 0.500000 |
> +----------+
> SELECT DIV0(1, 2);
> +------------+
> | DIV0(1, 2) |
> |------------|
> |   0.500000 |
> +------------+
> ```

Unlike the division operator, DIV0 returns a 0 (rather than reporting an error) when the divisor is 0.

> ```sqlexample
> select 1/0;
> 100051 (22012): Division by zero
> ```
>
> ```sqlexample
> SELECT DIV0(1, 0);
> +------------+
> | DIV0(1, 0) |
> |------------|
> |   0.000000 |
> +------------+
> ```

---
title: DIV0NULL
source: https://docs.snowflake.com/en/sql-reference/functions/div0null.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md)

# DIV0NULL

Performs division like the division operator (`/`), but returns 0 when the divisor is 0 or NULL (rather than reporting an
error or returning NULL).

See also:
:   [DIV0](div0.md)

## Syntax

```sqlsyntax
DIV0NULL( <dividend> , <divisor> )
```

## Arguments

`dividend`
:   Numeric expression that evaluates to the value that you want to divide.

`divisor`
:   Numeric expression that evaluates to the value that you want to divide by.

## Returns

The quotient. If the divisor is 0 or NULL, the function returns 0.

## Examples

As shown in the following example, the DIV0NULL function performs division like the division operator (`/`):

```sqlexample
SELECT 1/2;

+----------+
|      1/2 |
|----------|
| 0.500000 |
+----------+
```

```sqlexample
SELECT DIV0NULL(1, 2);

+----------------+
| DIV0NULL(1, 2) |
|----------------|
|       0.500000 |
+----------------+
```

Unlike the division operator, DIV0NULL returns a 0 (rather than reporting an error or returning NULL) when the divisor is 0 or
NULL.

```sqlexample
SELECT 1/0;
100051 (22012): Division by zero
```

```sqlexample
SELECT DIV0NULL(1, 0);

+----------------+
| DIV0NULL(1, 0) |
|----------------|
|       0.000000 |
+----------------+
```

```sqlexample
SELECT 1/NULL;

+--------+
| 1/NULL |
|--------|
|   NULL |
+--------+
```

```sqlexample
SELECT DIV0NULL(1, NULL);

+-------------------+
| DIV0NULL(1, NULL) |
|-------------------|
|          0.000000 |
+-------------------+
```

---
title: DP_INTERVAL_HIGH
source: https://docs.snowflake.com/en/sql-reference/functions/dp_interval_high.md
section: SQL Functions
---

Categories:
:   [Differential privacy functions](../functions-differential-privacy.md)

# DP_INTERVAL_HIGH

Returns the upper bound of the [noise interval](../../user-guide/diff-privacy/differential-privacy-analyst.md), which is used by differential privacy to help
analysts determine how much noise has been introduced into query results.

By default, there is a 95% confidence level that the actual value is equal to or smaller than the upper bound.

See also:
:   [DP_INTERVAL_LOW](dp_interval_low.md)

## Syntax

```sqlsyntax
DP_INTERVAL_HIGH( <aggregated_column> )
```

## Arguments

`aggregated_column`
:   Alias of a column that has been aggregated by the query.

## Returns

Returns an integer that is the upper bound of the noise interval.

If the table is not protected by differential privacy, returns NULL.

## Examples

To return the upper bound of the noise interval for the aggregation of the column `num_claims`:

```sqlexample
SELECT SUM(num_claims) AS sum_claims,
  DP_INTERVAL_HIGH(sum_claims)
  FROM t1;
```

---
title: DP_INTERVAL_LOW
source: https://docs.snowflake.com/en/sql-reference/functions/dp_interval_low.md
section: SQL Functions
---

Categories:
:   [Differential privacy functions](../functions-differential-privacy.md)

# DP_INTERVAL_LOW

Returns the lower bound of the [noise interval](../../user-guide/diff-privacy/differential-privacy-analyst.md), which is used by differential privacy to help
analysts determine how much noise has been introduced into query results.

By default, there is a 95% confidence level that the actual value is equal to or larger than the lower bound.

See also:
:   [DP_INTERVAL_HIGH](dp_interval_high.md)

## Syntax

```sqlsyntax
DP_INTERVAL_LOW( <aggregated_column> )
```

## Arguments

`aggregated_column`
:   Alias of a column that has been aggregated by the query.

## Returns

Returns an integer that is the lower bound of the noise interval.

If the table is not protected by differential privacy, returns NULL.

## Examples

To return the lower bound of the noise interval for the aggregation of the column `num_claims`:

```sqlexample
SELECT SUM(num_claims) AS sum_claims,
  DP_INTERVAL_LOW(sum_claims)
  FROM t1;
```

---
title: DUPLICATE_COUNT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_duplicate_count.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# DUPLICATE_COUNT (system data metric function)

Returns the count of column values that have duplicates, including NULL values. If you specify more than one column argument, returns the
number of rows where the combination of the specified columns is duplicated.

If you want to specify more than one column argument, you can’t call the function directly. For an example of associating the function with
a table so you can specify multiple column arguments, see Examples.

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.DUPLICATE_COUNT(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects one or more columns.

## Allowed data types

The columns projected by the `query` must have one of the following data types:

* DATE
* FLOAT
* NUMBER
* TIMESTAMP_LTZ
* TIMESTAMP_NTZ
* TIMESTAMP_TZ
* VARCHAR

## Returns

The function returns a scalar value with a NUMBER data type.

## Example

Determine the number of duplicate US Social Security numbers in the `SSN` column:

```sqlexample
SELECT SNOWFLAKE.CORE.DUPLICATE_COUNT(
  SELECT
    ssn
  FROM hr.tables.empl_info
);
```

Associate the DMF with a table to determine the number of duplicates based on the combination of the `first_name` and `last_name`
columns:

```sqlexample
ALTER TABLE t
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.DUPLICATE_COUNT
    ON (first_name, last_name);
```

---
title: DYNAMIC_TABLE_GRAPH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/dynamic_table_graph_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DYNAMIC_TABLE_GRAPH_HISTORY

This table function returns information on all [dynamic tables](../../user-guide/dynamic-tables-about.md) in the current account.
This information includes the dependencies between dynamic tables and on base tables.
A common use is to identify all dynamic tables that are part of a pipeline.

In the output of this function, each row represents a dynamic table.
The VALID_FROM and VALID_TO columns specify the range of time over which the description of a dynamic table was valid
(i.e., accurately described the dynamic table).

Changes to a dynamic table such as altering the TARGET_LAG result in the creation of new entries.

This table function provides only descriptions with a VALID_TO value within 7 days of the current time.

## Syntax

```sqlsyntax
DYNAMIC_TABLE_GRAPH_HISTORY(
  [ AS_OF => <constant_expr> ]
  [ , HISTORY_START => <constant_expr> [ , HISTORY_END => <constant_expr> ] ]
)
```

## Arguments

All arguments are optional. If no arguments are provided, only the most recent description of existing dynamic tables are returned. Specify `constant_expr` in [TIMESTAMP_LTZ format](../data-types-datetime.md).

`AS_OF => constant_expr`
:   Time at which to return the state of the graph. You can specify a time that corresponds to a value in
    the REFRESH_VERSION column in the output of the [DYNAMIC_TABLE_REFRESH_HISTORY](dynamic_table_refresh_history.md) function.

`HISTORY_START => constant_expr` , . `HISTORY_END => constant_expr`
:   Date/time range of the dynamic table refresh history.
    HISTORY_START specifies the earliest date/time, inclusive, to return data.
    HISTORY_END, which must be specified with HISTORY_START, specifies the end date/time for returning data.

## Output

The function returns the following columns.

To view these columns, you must use a role with the MONITOR privilege. Otherwise, the function only returns a value for `NAME`,
`SCHEMA_NAME`, `DATABASE_NAME`, and `QUALIFIED_NAME`. For more information about dynamic table privileges, see
[Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md).

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | Name of the dynamic table. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the dynamic table. |
| DATABASE_NAME | TEXT | Name of the database that contains the dynamic table. |
| QUALIFIED_NAME | TEXT | Fully qualified name of the dynamic table as it appears in the graph of dynamic tables. You can use this to join the output with the output of the [DYNAMIC_TABLE_REFRESH_HISTORY](dynamic_table_refresh_history.md) function. |
| INPUTS | ARRAY of OBJECTs | Each OBJECT represents a table, view, or dynamic table that serves as the input to this dynamic table, and consists of:   * `name` (TEXT): fully qualified name. * `kind` (TEXT): type of input (TABLE, VIEW, or DYNAMIC TABLE). * `insideRefreshBoundary` (BOOLEAN): only present for VIEW and DYNAMIC TABLE inputs when the value is TRUE, indicating the   input is wrapped in [DYNAMIC_TABLE_REFRESH_BOUNDARY()](../../user-guide/dynamic-tables-refresh-boundary.md). Inputs inside a refresh   boundary are not refreshed together with this dynamic table. |
| TARGET_LAG_TYPE | TEXT | One of:   * USER_DEFINED - Determined by the TARGET_LAG parameter specified for the dynamic table. * DOWNSTREAM - Indicates a dynamic table with a DOWNSTREAM TARGET_LAG. Refer to [Understanding dynamic table initialization and refresh](../../user-guide/dynamic-tables-refresh.md) for more information. |
| TARGET_LAG_SEC | NUMBER | The target lag time in seconds of this dynamic table. This is the value that was specified in the TARGET_LAG parameter of the dynamic table. |
| QUERY_TEXT | TEXT | The SELECT statement for this dynamic table. |
| VALID_FROM | TIMESTAMP_LTZ | The description of the dynamic table is valid after this time. |
| VALID_TO | TIMESTAMP_LTZ | If present, the description of the dynamic table is valid up to this time. If null, the description is still accurate. |
| SCHEDULING_STATE | OBJECT | OBJECT consisting of:   * `state` (TEXT): Scheduling state (ACTIVE or SUSPENDED). * `reason_code` (TEXT): Optional reason for the reason if the state is not ACTIVE. * `reason_message` (TEXT): Text description of the reason the dynamic table is not active.   Only applies if the state is not active. * `suspended_on` (TIMESTAMP_LTZ): Optional timestamp when the dynamic table was suspended. * `resumed_on` (TIMESTAMP_LTZ): Optional timestamp when it was last resumed if dynamic table is ACTIVE. |
| ALTER_TRIGGER | ARRAY | Describes why a new entry is created in the DYNAMIC_TABLE_GRAPH_HISTORY function. Can be one of the following:   * NONE (backwards-compatible) * CREATE_DYNAMIC_TABLE * ALTER_TARGET_LAG * SUSPEND * RESUME * REPLICATION_REFRESH * ALTER_WAREHOUSE |

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully qualified. For more information, see [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the graph history of each dynamic table in the account, its properties, and its dependencies on other tables and dynamic tables:

> ```sqlexample
> SELECT
>   name,
>   inputs,
>   target_lag_type,
>   target_lag_sec,
>   scheduling_state,
>   alter_trigger
> FROM
>   TABLE (
>     INFORMATION_SCHEMA.DYNAMIC_TABLE_GRAPH_HISTORY ()
>   )
> ORDER BY
>   name;
> ```
>
> ```output
> +--------------------+---------------------------------------------------+-----------------+----------------+---------------------------------------------+------------------+
> | NAME               |[] INPUTS                                          | TARGET_LAG_TYPE | TARGET_LAG_SEC | [] SCHEDULING_STATE                         | [] ALTER_TRIGGER |
> |--------------------+---------------------------------------------------+-----------------+----------------+---------------------------------------------|------------------+
> | MY_DYNAMIC_TABLE_1 | [                                                 | USER_DEFINED    | 300            | {                                           | [                |
> |                    |  {                                                |                 |                |   "resumed_on": "2024-03-01 10:29:02.066 Z",|   "RESUME"       |
> |                    |    "kind": "DYNAMIC_TABLE",                       |                 |                |   "state": "ACTIVE"                         | ]                |
> |                    |    "name": "MY_QUALIFIED_NAME.MY_DYNAMIC_TABLE_2" |                 |                | }                                           |                  |
> |                    |  }                                                |                 |                |                                             |                  |
> |                    | ]                                                 |                 |                |                                             |                  |
> +--------------------+---------------------------------------------------+-----------------+----------------+---------------------------------------------+------------------+
> ```

---
title: DYNAMIC_TABLE_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/dynamic_table_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DYNAMIC_TABLE_REFRESH_HISTORY

This table function returns information about each refresh (completed and running) of [dynamic tables](../../user-guide/dynamic-tables-about.md).

This table function returns all refreshes that are in progress as well as all refreshes that have a DATA_TIMESTAMP within 7 days
of the current time.

## Syntax

```sqlsyntax
DYNAMIC_TABLE_REFRESH_HISTORY(
  [ DATA_TIMESTAMP_START => <constant_expr> ]
  [ , DATA_TIMESTAMP_END => <constant_expr> ]
  [ , RESULT_LIMIT => <integer> ]
  [ , NAME => '<string>' ]
  [ , NAME_PREFIX => '<string>' ]
  [ , ERROR_ONLY => { TRUE | FALSE } ]
)
```

## Arguments

All the arguments are optional.
If no arguments are provided, 100 refreshes from all dynamic tables in the account will be returned.

`DATA_TIMESTAMP_START => constant_expr` , . `DATA_TIMESTAMP_END => constant_expr`
:   Time range (in TIMESTAMP_LTZ format) during which the refreshes occurred.

    * If neither a start version nor an end version is specified, the default range will be the past day.
    * If an end version is not specified, [CURRENT_TIMESTAMP](current_timestamp.md) is used as the end of the range.
    * If a start version is not specified, the range starts 1 day prior to the start of DATE_TIMESTAMP_END.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function. If the number of matching rows is greater than
    this limit, the refreshes that finished most recently (and those that are still running) are returned, up to the specified
    limit.

    To apply a filter on the results, also specify a large enough RESULT_LIMIT limit value for the filter to be applied on all
    dynamic tables.

    Range: `1` to `10000`

    Default: `100`.

`NAME => string`
:   The name of a dynamic table.

    Names must be single-quoted and are case insensitive.

    You can specify the unqualified name (`dynamic_table_name`),
    the partially qualified name (`schema_name.dynamic_table_name`),
    or the fully qualified name (`database_name.schema_name.dynamic_table_name`).

    For more information on object name resolution, refer to [Object name resolution](../name-resolution.md).

    The function returns the refreshes for this table.

`NAME_PREFIX => string`
:   A prefix for dynamic tables.

    Name prefixes must be single-quoted and are case insensitive.

    The function returns refreshes for tables with names that start with this prefix.

    You can use this argument to return the refreshes for dynamic tables in a specific database or schema.

`ERROR_ONLY => TRUE | FALSE`
:   When set to TRUE, this function returns only refreshes that failed or were cancelled.

## Output

The function returns the following columns.

To view these columns, you must use a role with the MONITOR privilege. For more information, see
[Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md).

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | Name of the dynamic table. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the dynamic table. |
| DATABASE_NAME | TEXT | Name of the database that contains the dynamic table. |
| STATE | TEXT | Status of the refresh for the dynamic table. The status can be one of the following:   * SCHEDULED: refresh scheduled, but not yet executed. * EXECUTING: refresh in progress. * SUCCEEDED: refresh completed successfully. * FAILED: refresh failed during execution. * CANCELLED: refresh was canceled before execution. * UPSTREAM_FAILED: refresh not performed due to an upstream failed refresh. * SKIPPED: refresh not performed because an upstream dynamic table refresh was skipped or the scheduler deferred the   refresh to maximize time within target lag. |
| STATE_CODE | TEXT | Code representing the current state of the refresh. |
| STATE_MESSAGE | TEXT | Description of the current state of the refresh. |
| QUERY_ID | TEXT | ID of the SQL statement that produced the results for the dynamic table. |
| DATA_TIMESTAMP | TIMESTAMP_LTZ | Transactional timestamp when the refresh was evaluated. (This might be slightly before the actual time of the refresh.) All data, in base objects, that arrived before this timestamp is currently included in the dynamic table. |
| REFRESH_START_TIME | TIMESTAMP_LTZ | Time when the refresh job started. |
| REFRESH_END_TIME | TIMESTAMP_LTZ | Time when the refresh completed. |
| COMPLETION_TARGET | TIMESTAMP_LTZ | Time by which this refresh should complete to keep lag under the TARGET_LAG parameter for the dynamic table. This is equal to the DATA_TIMESTAMP of the last refresh + TARGET_LAG. |
| QUALIFIED_NAME | TEXT | Fully qualified name of the dynamic table as it appears in the graph of dynamic tables. You can use this to join the output with the output of the [DYNAMIC_TABLE_GRAPH_HISTORY](dynamic_table_graph_history.md) function. |
| LAST_COMPLETED_DEPENDENCY | OBJECT | Contains the following properties:   * `qualified_name`: The qualified name of the latest dependency to become available. * `data_timestamp`: The refresh version of that dependency. |
| STATISTICS | OBJECT | Contains the following properties:   * `numInsertedRows`: The number of inserted rows. * `numDeletedRows`: The number of rows that were deleted. * `numCopiedRows`: The number of rows that were copied unchanged. * `numAddedPartitions`: The number of added partitions. * `numRemovedPartitions` : The number of removed partitions. * `queuedTimeMs`: The time (in milliseconds) spent in the queued state. * `compilationTimeMs`: The time (in milliseconds) spent compiling the refresh query. * `executionTimeMs`: The time (in milliseconds) spent executing the refresh query.  For successful refreshes, this column includes both the row/partition statistics and the time distribution   information. For example:  ```json   {     "numAddedPartitions": 1,     "numCopiedRows": 0,     "numDeletedRows": 25,     "numInsertedRows": 36,     "numRemovedPartitions": 1,     "queuedTimeMs": 123,     "compilationTimeMs": 456,     "executionTimeMs": 789   }   ```  For failed refreshes, this column is populated with the time distribution information only. For example:  ```json   {     "queuedTimeMs": 123,     "compilationTimeMs": 456,     "executionTimeMs": 789   }   ```  **Note:** Because a JSON object is an unordered set of keys and values, the order of properties in the output may   vary from the examples above.  For example, if an UPDATE statement updates 1 row in a partition with 10 rows, the row/partition metrics show   1 row inserted, 1 deleted, and 9 copied. Additionally, 1 partition is removed and 1 partition added. |
| REFRESH_ACTION | TEXT | One of:   * NO_DATA - no new data in base tables. Doesn’t apply to the initial refresh of newly created dynamic tables regardless of whether or not the base tables have data. * REINITIALIZE - base table changed or source table of a cloned dynamic table was refreshed during clone. * FULL - Full refresh, because dynamic table contains query elements that are not incrementalizable (see SHOW DYNAMIC TABLE refresh_mode_reason) or because full refresh was cheaper than incremental refresh. * INCREMENTAL - normal incremental refresh. |
| REFRESH_TRIGGER | TEXT | One of:   * SCHEDULED - normal background refresh to meet target lag or downstream target lag. * MANUAL - user/task used ALTER DYNAMIC TABLE <name> REFRESH * CREATION - refresh performed during the creation DDL statement, triggered by the creation of the dynamic table or any consumer dynamic tables. |
| TARGET_LAG_SEC | NUMBER | Describes the target lag value for the dynamic tables at the time the refresh occurred. |
| GRAPH_HISTORY_VALID_FROM | TIMESTAMP_NTZ | Encodes the VALID_FROM timestamp of the DYNAMIC_TABLE_GRAPH_HISTORY table function when the refresh occurred to clarify which version of a dynamic table a specific refresh corresponds to. This value can also be NULL if the corresponding dynamic table hasn’t been created. |

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the refreshes that failed or were canceled:

> ```sqlexample
> SELECT
>   name,
>   state,
>   state_code,
>   state_message,
>   query_id,
>   data_timestamp,
>   refresh_start_time,
>   refresh_end_time
> FROM
>   TABLE (
>     INFORMATION_SCHEMA.DYNAMIC_TABLE_REFRESH_HISTORY (
>       NAME_PREFIX => 'MYDB.MYSCHEMA.', ERROR_ONLY => TRUE
>     )
>   )
> ORDER BY
>   name,
>   data_timestamp;
> ```

---
title: DYNAMIC_TABLES
source: https://docs.snowflake.com/en/sql-reference/functions/dynamic_tables.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DYNAMIC_TABLES

This table function returns metadata about [dynamic tables](../../user-guide/dynamic-tables-about.md), including aggregate lag metrics and the status of the most recent refreshes, within 7 days
of the current time.

## Syntax

```sqlsyntax
DYNAMIC_TABLES (
  [ NAME => '<string>' ]
  [ , REFRESH_DATA_TIMESTAMP_START => <constant_expr> ]
  [ , RESULT_LIMIT => <integer> ]
  [ , INCLUDE_CONNECTED => { TRUE | FALSE } ]
)
```

## Arguments

All the arguments are optional.
If no arguments are provided, 100 refreshes from all dynamic tables in the account will be returned.

`NAME => 'string'`
:   The name of a dynamic table.

    Names must be single-quoted and are case insensitive.

    You can specify the unqualified name (`dynamic_table_name`),
    the partially qualified name (`schema_name.dynamic_table_name`),
    or the fully qualified name (`database_name.schema_name.dynamic_table_name`).

    For more information on object name resolution, refer to [Object name resolution](../name-resolution.md).

    The function returns the metadata for this table.

`REFRESH_DATA_TIMESTAMP_START => constant_expr`
:   Time (in TIMESTAMP_LTZ format) for computing metrics related to dynamic table target lag. Includes all refreshes with LATEST_DATA_TIMESTAMP greater than or equal to REFRESH_DATA_TIMESTAMP_START.

    Default: All refreshes in refresh history are retained for 7 days.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function.

    By default, the function returns 100 rows and the results are sorted by the dynamic table’s last completed refresh state in the following
    order, unless specified otherwise using the RESULT_LIMIT argument.

    1. FAILED
    2. UPSTREAM_FAILED
    3. SKIPPED
    4. SUCCEEDED
    5. CANCELED

    To sort by a different order, you must provide a large enough RESULT_LIMIT value (for example, the maximum value of a signed integer). As
    long as RESULT_LIMIT exceeds the total number of dynamic tables in the account, the results can be sorted using an ORDER BY clause.

    To apply a filter on the results, also specify a large enough RESULT_LIMIT value for the filter to be applied on all dynamic tables.

    **Examples**:

    The following example sorts by a different order of `name` and returns 100 rows:

    ```sqlsyntax
    SELECT * FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLES(result_limit => <max_value>)) ORDER BY name ASC LIMIT 100 ;
    ```

    The following example sorts by a different order of `name` and returns all rows:

    ```sqlsyntax
    SELECT * FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLES(result_limit => <max_value>)) ORDER BY name ASC ;
    ```

    The following example filters for all dynamic tables with 1-minute target lag, uses the default sort, and returns all rows:

    ```sqlsyntax
    SELECT * FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLES(result_limit => <max_value>)) WHERE TARGET_LAG_SEC = 60 ;
    ```

    Range: `1` to `10000`

    Default: `100`.

`INCLUDE_CONNECTED => { TRUE | FALSE }`
:   When set to TRUE, the function returns metadata for all dynamic tables connected to the dynamic table specified by the NAME argument.

    You must specify the NAME argument, you must not specify the RESULT_LIMIT argument.

    Default: `FALSE`

## Output

The function returns the following columns.

To view these columns, you must use a role with the MONITOR privilege. Otherwise, the function only returns a value for `NAME`,
`SCHEMA_NAME`, `DATABASE_NAME`, and `QUALIFIED_NAME`. For more information about dynamic table privileges, see
[Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md).

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | Name of the dynamic table. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the dynamic table. |
| DATABASE_NAME | TEXT | Name of the database that contains the dynamic table. |
| QUALIFIED_NAME | TEXT | Fully qualified name of the dynamic table. |
| TARGET_LAG_SEC | NUMBER | Target lag time in seconds of the dynamic table. This is the value that was specified in the TARGET_LAG parameter of the dynamic table. |
| TARGET_LAG_TYPE | TEXT | The type of target lag. Can be one of the following:   * USER_DEFINED: Determined by the TARGET_LAG parameter specified for the dynamic table. * DOWNSTREAM: Includes a dynamic table with a DOWNSTREAM target lag. |
| SCHEDULING_STATE | OBJECT | OBJECT consisting of:   * STATE (TEXT): Scheduling state (RUNNING or SUSPENDED). * REASON_CODE (TEXT): Specifies the code for the reason why the dynamic table is not running. * REASON_MESSAGE (TEXT): Text description of the reason the dynamic table is not running. Only applies if the dynamic table is not in the RUNNING state. * SUSPENDED_ON (TIMESTAMP_LTZ): Timestamp when the dynamic table was suspended. Only applies if the dynamic table is in the SUSPENDED state. * RESUMED_ON (TIMESTAMP_LTZ): Timestamp when the dynamic table was last resumed. Only applies if dynamic table is in the RUNNING state. |
| MEAN_LAG_SEC | NUMBER | The mean lag time (in seconds) of refreshes for this dynamic table. |
| MAXIMUM_LAG_SEC | NUMBER | The maximum lag time in seconds of refreshes for this dynamic table. |
| TIME_ABOVE_TARGET_LAG_SEC | NUMBER | The time in seconds in the retention period or since the last configuration change, when the actual lag was more than the defined target lag. |
| TIME_WITHIN_TARGET_LAG_RATIO | NUMBER | The ratio of time in the retention period or since the last configuration change, when actual lag is within the target lag. |
| LATEST_DATA_TIMESTAMP | TIMESTAMP_LTZ | Data timestamp of the last successful refresh. |
| LAST_COMPLETED_REFRESH_STATE | TEXT | Status of the last terminated refresh for the dynamic table. Can be one of the following:   * SUCCEEDED: Refresh completed successfully. * FAILED: Refresh failed during execution. * UPSTREAM_FAILED: Refresh not performed due to an upstream failed refresh. * CANCELLED: Refresh was canceled before execution. |
| LAST_COMPLETED_REFRESH_STATE_CODE | TEXT | Code representing the current state of the refresh.  If the LAST_COMPLETED_REFRESH_STATE is FAILED, this column shows the error code associated with the failure. |
| LAST_COMPLETED_REFRESH_STATE_MESSAGE | TEXT | Description of the current state of the refresh.  If the LAST_COMPLETED_REFRESH_STATE is FAILED, this column shows the error message associated with the failure. |
| EXECUTING_REFRESH_QUERY_ID | TEXT | If present, this represents the query ID of the refresh job. If null, there is no refresh job in progress. |

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the names, lag information, and data timestamp of the last successful refresh for all dynamic tables connected with the specified dynamic table.

```sqlexample
SELECT
  name,
  target_lag_sec,
  mean_lag_sec,
  latest_data_timestamp
FROM
  TABLE (
    INFORMATION_SCHEMA.DYNAMIC_TABLES (
      NAME => 'mydb.myschema.mydt',
      INCLUDE_CONNECTED => TRUE
    )
  )
ORDER BY
  target_lag_sec
```

---
title: EDITDISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/editdistance.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# EDITDISTANCE

Computes the Levenshtein distance between two input strings. It is the number
of single-character insertions, deletions, or substitutions needed to convert
one string to another.

> **Note:**
>
> Unlike some other metrics (for example, Damerau-Levenshtein distance), character
> transpositions aren’t considered.

## Syntax

```sqlsyntax
EDITDISTANCE( <string_expr1> , <string_expr2> [ , <max_distance> ] )
```

## Arguments

**Required:**

`string_expr1`, . `string_expr2`
:   The input strings.

**Optional:**

`max_distance`
:   Integer expression that specifies the maximum distance to compute.

    When the distance between the strings exceeds this number, the function stops computing the distance and just returns the
    maximum distance.

    Specifying this argument has the same effect as calling
    `LEAST( EDITDISTANCE( string_expr1, string_expr2 ), max_distance )`.

    If you specify a negative number (that is, `-n`), the function uses `0` as the maximum distance and returns `0`.

## Usage notes

* The execution time of the EDITDISTANCE function is proportional to the product of the lengths of the input strings.
* For better performance, Snowflake recommends using input strings not longer than 4096 characters.

  Input strings longer than 128 MB might result in an error.

  You can also use the optional `max_distance` argument to set an upper bound for the distance computed.

## Collation details

No impact.
In languages where the alphabet contains digraphs or trigraphs (such as “Dz” and “Dzs” in Hungarian), each character in each digraph and trigraph is treated as an independent character, not as part of a single multi-character letter.

The result is based solely on the characters in the strings, not on the collation specifications of the strings.

## Examples

The following example computes the distance between the strings in the columns `s` and `t` in the table `ed`.

The last two columns use the `max_distance` argument to specify the maximum distance to compute:

* When `max_distance` is `3`, the function returns `3` if the distance between the strings is greater than or equal to
  3 (as shown below).
* If `max_distance` is a negative number (for example, `-1`, as shown below), the function uses `0` as the maximum distance
  and returns `0`.

```sqlexample
SELECT s,
       t,
       EDITDISTANCE(s, t),
       EDITDISTANCE(t, s),
       EDITDISTANCE(s, t, 3),
       EDITDISTANCE(s, t, -1)
  FROM ed;
```

```output
+----------------+-----------------+--------------------+--------------------+-----------------------+------------------------+
|      S         |        T        | EDITDISTANCE(S, T) | EDITDISTANCE(T, S) | EDITDISTANCE(S, T, 3) | EDITDISTANCE(S, T, -1) |
|----------------+-----------------+--------------------+--------------------+-----------------------+------------------------|
|                |                 | 0                  | 0                  | 0                     | 0                      |
| Gute nacht     | Ich weis nicht  | 8                  | 8                  | 3                     | 0                      |
| Ich weiß nicht | Ich wei? nicht  | 1                  | 1                  | 1                     | 0                      |
| Ich weiß nicht | Ich weiss nicht | 2                  | 2                  | 2                     | 0                      |
| Ich weiß nicht | [NULL]          | [NULL]             | [NULL]             | [NULL]                | [NULL]                 |
| Snowflake      | Oracle          | 7                  | 7                  | 3                     | 0                      |
| święta         | swieta          | 2                  | 2                  | 2                     | 0                      |
| [NULL]         |                 | [NULL]             | [NULL]             | [NULL]                | [NULL]                 |
| [NULL]         | [NULL]          | [NULL]             | [NULL]             | [NULL]                | [NULL]                 |
+----------------+-----------------+--------------------+--------------------+-----------------------+------------------------+
```

The next example returns `FALSE` if the distance between two strings is at least 2. Because `max_distance` is
specified as `2`, the function stops calculating the distance once the distance is determined to be at least 2. (The actual
distance between the input strings is 6.)

```sqlexample
SELECT EDITDISTANCE('future', 'past', 2) < 2;
```

```output
+---------------------------------------+
| EDITDISTANCE('FUTURE', 'PAST', 2) < 2 |
|---------------------------------------|
| False                                 |
+---------------------------------------+
```

---
title: EMAIL_INTEGRATION_CONFIG
source: https://docs.snowflake.com/en/sql-reference/functions/email_integration_config.md
section: SQL Functions
---

Categories:
:   [Notification functions](../functions-notification.md) (Integration Configuration)

# EMAIL_INTEGRATION_CONFIG

Returns a JSON object that specifies the email notification integration, recipients, and subject line to use for an email
notification. This is a helper function that you use to construct an integration configuration object for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

See also:
:   [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md) ,
    [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) ,
    [INTEGRATION](integration.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
  '<email_integration_name>',
  '<subject>',
  <array_of_email_addresses_for_to_line> )
```

```sqlsyntax
SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
  '<email_integration_name>',
  '<subject>',
  <array_of_email_addresses_for_to_line>,
  <array_of_email_addresses_for_cc_line>,
  <array_of_email_addresses_for_bcc_line> )
```

## Arguments

`'email_integration_name'`
:   Name of the email notification integration to use.

`'subject'`
:   Subject of the email message.

    The subject cannot exceed 256 characters in length.

`array_of_email_addresses_for_to_line` . `array_of_email_addresses_for_cc_line` . `array_of_email_addresses_for_bcc_line`
:   ARRAYs of the email addresses to include in the “To:”, “Cc:”, and “Bcc:” lines of the message.

    You must specify email addresses of users in the current account. These users must
    [verify their email addresses](../../user-guide/notifications/email-notifications.md).

    If the ALLOWED_RECIPIENTS property is set to a list of email addresses in the
    [email notification integration](../../user-guide/notifications/email-notifications.md), the email addresses must be in that list.

    Call the [ARRAY_CONSTRUCT](array_construct.md) function to construct each ARRAY.

    > **Note:**
    >
    > You cannot send an email notification if you only specify the “Bcc:” line.

## Returns

A JSON-formatted string that specifies a notification integration for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure to send.

For example, suppose that you pass in the notification integration name `'my_email_int'` with the following subject line and
list of email addresses for the “To:” line:

```sqlexample
SELECT SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
  'my_email_int',
  'Updates',
   ARRAY_CONSTRUCT('person_a@example.com', 'person_b@example.com')
)
```

The function returns the following JSON-formatted string:

```json
'{"my_email_int":{"subject":"Updates","toAddress":["person_a@example.com","person_b@example.com"]}}'
```

The following example sends the same notification with an additional list of email addresses for the “Cc:” line. Note that this
example passes NULL for the “Bcc:” addresses to exclude the `bccAddress` property from the returned object.

```sqlexample
SELECT SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
  'my_email_int',
  'Updates',
   ARRAY_CONSTRUCT('person_a@example.com', 'person_b@example.com'),
   ARRAY_CONSTRUCT('cc_person_a@example.com'),
   NULL
)
```

The function returns the following JSON-formatted string:

```json
'{"my_email_int":{"subject":"Updates","toAddress":["person_a@example.com","person_b@example.com"],"ccAddress":["cc_person_a@snowflake.com"]}}'
```

The following example sends the same notification with an additional list of email addresses for the “Bcc:” line:

```sqlexample
SELECT SNOWFLAKE.NOTIFICATION.EMAIL_INTEGRATION_CONFIG(
  'my_email_int',
  'Updates',
   ARRAY_CONSTRUCT('person_a@example.com', 'person_b@example.com'),
   ARRAY_CONSTRUCT('cc_person_a@example.com'),
   ARRAY_CONSTRUCT('bcc_person_b@example.com')
)
```

The function returns the following JSON-formatted string:

```json
'{"my_email_int":{"subject":"Updates","toAddress":["person_a@example.com","person_b@example.com"],"ccAddress":["cc_person_a@example.com"],"bccAddress":["bcc_person_b@example.com"]}}'
```

## Examples

See [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md).

---
title: EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/embed_text_1024-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_EMBED](ai_embed.md) is the latest version of this function.
> Use AI_EMBED for the latest functionality.
> You can continue to use EMBED_TEXT_1024 (SNOWFLAKE.CORTEX).

Creates a vector embedding of 1024 dimensions from text.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.EMBED_TEXT_1024( <model>, <text> )
```

## Arguments

`model`
:   A string specifying the vector embedding model to be used to generate the embedding. This must be one of the following values.

    > * `snowflake-arctic-embed-l-v2.0`
    > * `snowflake-arctic-embed-l-v2.0-8k`
    > * `nv-embed-qa-4`
    > * `multilingual-e5-large`
    > * `voyage-multilingual-2`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`text`
:   The text for which an embedding should be calculated.

## Returns

A vector embedding of type VECTOR.

## Access control requirements

You must use a role that has been granted the SNOWFLAKE.CORTEX_USER database role *or* the SNOWFLAKE.CORTEX_EMBED_USER
database role to call this function. See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on granting one of
these privileges.

You must also have the USAGE privilege on the SNOWFLAKE.CORTEX schema to call this function.

## Example

In this example, a vector embedding is generated for the phrase `hello world` using the `snowflake-arctic-embed-l-v2.0` model:

```sqlexample
SELECT SNOWFLAKE.CORTEX.EMBED_TEXT_1024('snowflake-arctic-embed-l-v2.0', 'hello world');
```

In this example, a vector embedding is generated for the Spanish phrase `hola mundo` using the `snowflake-arctic-embed-l-v2.0` model:

```sqlexample
SELECT SNOWFLAKE.CORTEX.EMBED_TEXT_1024('snowflake-arctic-embed-l-v2.0', 'hola mundo');
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: EMBED_TEXT_768 (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/embed_text-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# EMBED_TEXT_768 (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_EMBED](ai_embed.md) is the latest version of this function.
> Use AI_EMBED for the latest functionality.
> You can continue to use EMBED_TEXT_768 (SNOWFLAKE.CORTEX).

Creates a vector embedding of 768 dimensions from English-language text.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.EMBED_TEXT_768( <model>, <text> )
```

## Arguments

`model`
:   A string specifying the vector embedding model to be used to generate the embedding. This must be one of the following values.

    > * `snowflake-arctic-embed-m-v1.5`
    > * `snowflake-arctic-embed-m`
    > * `e5-base-v2`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`text`
:   The text for which an embedding should be calculated.

## Returns

A vector embedding of type VECTOR.

## Access control requirements

You must use a role that has been granted the SNOWFLAKE.CORTEX_USER database role *or* the SNOWFLAKE.CORTEX_EMBED_USER
database role to call this function. See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on granting one of
these privileges.

You must also have the USAGE privilege on the SNOWFLAKE.CORTEX schema to call this function.

## Examples

In this example, a vector embedding is generated for the phrase `hello world` using the `snowflake-arctic-embed-m-v1.5` model:

```sqlexample
SELECT SNOWFLAKE.CORTEX.EMBED_TEXT_768('snowflake-arctic-embed-m-v1.5', 'hello world');
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: ENCRYPT
source: https://docs.snowflake.com/en/sql-reference/functions/encrypt.md
section: SQL Functions
---

Categories:
:   [Encryption functions](../functions-encryption.md)

# ENCRYPT

Encrypts a VARCHAR or BINARY value using a VARCHAR passphrase.

See also:
:   [ENCRYPT_RAW](encrypt_raw.md) , [DECRYPT](decrypt.md) , [DECRYPT_RAW](decrypt_raw.md) , [TRY_DECRYPT](try_decrypt.md) , [TRY_DECRYPT_RAW](try_decrypt_raw.md)

## Syntax

```sqlsyntax
ENCRYPT( <value_to_encrypt> , <passphrase> ,
         [ [ <additional_authenticated_data> , ] <encryption_method> ]
       )
```

## Arguments

**Required:**

`value_to_encrypt`
:   The VARCHAR or BINARY value to encrypt.

`passphrase`
:   The passphrase to use to encrypt/decrypt the data. The passphrase is always a VARCHAR, regardless of whether the
    `value_to_encrypt` is VARCHAR or BINARY.

**Optional:**

`additional_authenticated_data`
:   Additional authenticated data (AAD) is additional data whose confidentiality and authenticity is assured during the
    decryption process. However, this AAD is not encrypted and is not included as a field in the returned value from the
    ENCRYPT or ENCRYPT_RAW function.

    If AAD is passed to the encryption function (ENCRYPT or ENCRYPT_RAW), then the same AAD must be passed to the
    decryption function (DECRYPT or DECRYPT_RAW). If the AAD passed to the decryption function does not match the
    AAD passed to the encryption function, then decryption fails.

    The difference between the AAD and the `passphrase` is that the passphrase is intended to be kept
    secret (otherwise, the encryption is essentially worthless) while the AAD can be left public. The AAD helps
    authenticate that a public piece of information and an encrypted value are associated with each other. The
    examples section in the ENCRYPT function includes an example showing the behavior
    when the AAD matches and the behavior when it doesn’t match.

    For ENCRYPT_RAW and DECRYPT_RAW, the data type of the AAD should be BINARY.
    For ENCRYPT and DECRYPT, the data type of the AAD can be either VARCHAR or BINARY, and does not need to match
    the data type of the value that was encrypted.

    AAD is supported only by AEAD-enabled encryption modes like GCM (default).

`encryption_method`
:   This string specifies the method to use for encrypting/decrypting the data. This string contains subfields:

    ```none
    <algorithm>-<mode> [ /pad: <padding> ]
    ```

    The `algorithm` is currently limited to:

    > * `'AES'`: When a passphrase is passed (e.g. to ENCRYPT), the function uses AES-256 encryption (256 bits). When a key
    >   is passed (e.g. to ENCRYPT_RAW), the function uses 128, 192, or 256-bit encryption, depending upon the key
    >   length.

    The `algorithm` is case-insensitive.

    The `mode` specifies which block cipher mode should be used to encrypt messages.
    The following table shows which modes are supported, and which of those modes support padding:

    | Mode | Padding | Description |
    | --- | --- | --- |
    | `'ECB'` | Yes | Encrypt every block individually with the key. This mode is generally discouraged and is included only for compatibility with external implementations. |
    | `'CBC'` | Yes | The encrypted block is XORed with the previous block. |
    | `'GCM'` | No | Galois/Counter Mode is a high-performance encryption mode that is AEAD-enabled. AEAD additionally assures the authenticity and confidentiality of the encrypted data by generating an AEAD tag. Moreover, AEAD supports AAD (additional authenticated data). |
    | `'CTR'` | No | Counter mode. |
    | `'OFB'` | No | Output feedback. The ciphertext is XORed with the plaintext of a block. |
    | `'CFB'` | No | Cipher feedback is a combination of OFB and CBC. |

    The `mode` is case-insensitive.

    The `padding` specifies how to pad messages whose length is not a multiple of the block size. Padding is
    applicable only for ECB and CBC modes; padding is ignored for other modes. The possible values for padding are:

    > * `'PKCS'`: Uses PKCS5 for block padding.
    > * `'NONE'`: No padding. The user needs to take care of the padding when using ECB or CBC mode.

    The `padding` is case-insensitive.

    Default setting: `'AES-GCM'`.

    If the `mode` is not specified, GCM is used.

    If the `padding` is not specified, PKCS is used.

## Returns

The data type of the returned value is BINARY.

Although only a single value is returned, that value contains two or three concatenated fields:

* The first field is an initialization vector (IV). The IV is generated randomly using a CTR-DRBG random number
  generator. Both encryption and decryption use the IV.
* The second field is the ciphertext (encrypted value) of the `value_to_encrypt`.
* If the encryption mode is AEAD-enabled, then the returned value also contains a third field, which is the AEAD tag.

The IV and tag size depend on the encryption mode.

## Usage notes

* To decrypt data encrypted by `ENCRYPT()`, use `DECRYPT()`. Do not use `DECRYPT_RAW()`.
* To decrypt data encrypted by `ENCRYPT_RAW()`, use `DECRYPT_RAW()`. Do not use `DECRYPT()`.
* The function’s parameters are masked for security. Sensitive information such as the following is
  not visible in the query log and is not visible to Snowflake:

  + The string or binary value to encrypt or decrypt.
  + The passphrase or key.
* The functions use a FIPS-compliant cryptographic library to effectively perform the encryption and decryption.
* The passphrase or key used to decrypt a piece of data must be the same as the passphrase or key used to encrypt that
  data.

* The passphrase can be of arbitrary length, even 0 (the empty string). However, Snowflake
  strongly recommends using a passphrase that is at least 8 bytes.
* Snowflake recommends that the passphrase follow general best practices for passwords, such as using a mix of
  uppercase letters, lowercase letters, numbers, and punctuation.
* The passphrase is not used directly to encrypt/decrypt the input. Instead, the passphrase is used to derive an
  encryption/decryption key, which is always the same for the same passphrase. Snowflake uses the
  <https://en.wikipedia.org/wiki/PBKDF2> key-derivation function with a Snowflake-internal seed to compute the
  encryption/decryption key from the given passphrase.

  Because of this key derivation, the encrypt/decrypt function cannot be used to:

  + Decrypt data that was externally encrypted.
  + Encrypt data that will be externally decrypted.

  To do either of these, use [ENCRYPT_RAW](encrypt_raw.md) or [DECRYPT_RAW](decrypt_raw.md).
* Because the initialization vector is always regenerated randomly, calling `ENCRYPT()` with the same
  `value_to_encrypt` and `passphrase` does not return the same result every time. If you need to
  generate the same output for the same `value_to_encrypt` and `passphrase`, consider using
  [ENCRYPT_RAW](encrypt_raw.md) and specifying the initialization vector.

## Examples

This example encrypts a VARCHAR with a simple passphrase.

> ```sqlexample
> SELECT encrypt('Secret!', 'SamplePassphrase');
> ```
>
> The output is text that is not easy for humans to read.

The code below shows a simple example of encryption and decryption:

> ```sqlexample
> SET passphrase='poiuqewjlkfsd';
> ```
>
> ```sqlexample
> SELECT
>     TO_VARCHAR(
>         DECRYPT(
>             ENCRYPT('Patient tested positive for COVID-19', $passphrase),
>             $passphrase),
>         'utf-8')
>         AS decrypted
>     ;
> +--------------------------------------+
> | DECRYPTED                            |
> |--------------------------------------|
> | Patient tested positive for COVID-19 |
> +--------------------------------------+
> ```

This example uses a BINARY value for the `value_to_encrypt` and for the authenticated data.

> ```sqlexample
> SELECT encrypt(to_binary(hex_encode('Secret!')), 'SamplePassphrase', to_binary(hex_encode('Authenticated Data')));
> ```
>
> The output is:
>
> ```sqlexample
> 6E1361E297C22969345F978A45205E3E98EB872844E3A0F151713894C273FAEF50C365S
> ```

This example shows how to use an alternative mode (`CBC`) as part of the specifier for the encryption method.
This encryption method also specifies a padding rule (`PKCS`). In this example, the AAD parameter is NULL.

> ```sqlexample
> SELECT encrypt(to_binary(hex_encode('secret!')), 'sample_passphrase', NULL, 'aes-cbc/pad:pkcs') as encrypted_data;
> ```

This example shows how to use the AAD:

```sqlexample
SELECT
    TO_VARCHAR(
        DECRYPT(
            ENCRYPT('penicillin', $passphrase, 'John Dough AAD', 'aes-gcm'),
            $passphrase, 'John Dough AAD', 'aes-gcm'),
        'utf-8')
        AS medicine
    ;
+------------+
| MEDICINE   |
|------------|
| penicillin |
+------------+
```

If you pass the wrong AAD, decryption fails:

```sqlexample
SELECT
    DECRYPT(
        ENCRYPT('penicillin', $passphrase, 'John Dough AAD', 'aes-gcm'),
        $passphrase, 'wrong patient AAD', 'aes-gcm') AS medicine
    ;
```

```none
100311 (22023): Decryption failed. Check encrypted data, key, AAD, or AEAD tag.
```

---
title: ENCRYPT_RAW
source: https://docs.snowflake.com/en/sql-reference/functions/encrypt_raw.md
section: SQL Functions
---

Categories:
:   [Encryption functions](../functions-encryption.md)

# ENCRYPT_RAW

Encrypts a BINARY value using a BINARY key.

See also:
:   [ENCRYPT](encrypt.md) , [DECRYPT](decrypt.md) , [DECRYPT_RAW](decrypt_raw.md) , [TRY_DECRYPT](try_decrypt.md) , [TRY_DECRYPT_RAW](try_decrypt_raw.md)

## Syntax

```sqlsyntax
ENCRYPT_RAW( <value_to_encrypt> , <key> , <iv> ,
         [ [ <additional_authenticated_data> , ] <encryption_method> ]
       )
```

## Arguments

**Required:**

`value_to_encrypt`
:   The binary value to encrypt.

`key`
:   The key to use to encrypt/decrypt the data. The key must be a BINARY value. The key can be any value as long as the
    length is correct. For example, for AES128, the key must be 128 bits (16 bytes), and for AES256, the key must be
    256 bits (32 bytes).

    The key used to encrypt the value must be used to decrypt the value.

`iv`
:   This parameter contains the Initialization Vector (IV) to use to encrypt and decrypt this piece of
    data. The IV must be a BINARY value of a specific length:

    * For GCM, this field must be 96 bits (12 bytes). While the GCM encryption method allows this field to be a different
      size, Snowflake currently only supports 96 bits.
    * For CCM, this should be 56 bits (7 bytes).
    * For ECB, this parameter is unneeded.
    * For all other supported encryption modes, this should be 128 bits (16 bytes).

    This value is used to initialize the first encryption round. You should never use the same IV and key combination
    more than once, especially for encryption modes like GCM.

    If this parameter is set to NULL, the implementation will choose a new pseudo-random IV during each call.

**Optional:**

`additional_authenticated_data`
:   Additional authenticated data (AAD) is additional data whose confidentiality and authenticity is assured during the
    decryption process. However, this AAD is not encrypted and is not included as a field in the returned value from the
    ENCRYPT or ENCRYPT_RAW function.

    If AAD is passed to the encryption function (ENCRYPT or ENCRYPT_RAW), then the same AAD must be passed to the
    decryption function (DECRYPT or DECRYPT_RAW). If the AAD passed to the decryption function does not match the
    AAD passed to the encryption function, then decryption fails.

    The difference between the AAD and the `passphrase` is that the passphrase is intended to be kept
    secret (otherwise, the encryption is essentially worthless) while the AAD can be left public. The AAD helps
    authenticate that a public piece of information and an encrypted value are associated with each other. The
    examples section in the [ENCRYPT](encrypt.md) function includes an example showing the behavior
    when the AAD matches and the behavior when it doesn’t match.

    For ENCRYPT_RAW and DECRYPT_RAW, the data type of the AAD should be BINARY.
    For ENCRYPT and DECRYPT, the data type of the AAD can be either VARCHAR or BINARY, and does not need to match
    the data type of the value that was encrypted.

    AAD is supported only by AEAD-enabled encryption modes like GCM (default).

`encryption_method`
:   This string specifies the method to use for encrypting/decrypting the data. This string contains subfields:

    ```none
    <algorithm>-<mode> [ /pad: <padding> ]
    ```

    The `algorithm` is currently limited to:

    > * `'AES'`: When a passphrase is passed (e.g. to ENCRYPT), the function uses AES-256 encryption (256 bits). When a key
    >   is passed (e.g. to ENCRYPT_RAW), the function uses 128, 192, or 256-bit encryption, depending upon the key
    >   length.

    The `algorithm` is case-insensitive.

    The `mode` specifies which block cipher mode should be used to encrypt messages.
    The following table shows which modes are supported, and which of those modes support padding:

    | Mode | Padding | Description |
    | --- | --- | --- |
    | `'ECB'` | Yes | Encrypt every block individually with the key. This mode is generally discouraged and is included only for compatibility with external implementations. |
    | `'CBC'` | Yes | The encrypted block is XORed with the previous block. |
    | `'GCM'` | No | Galois/Counter Mode is a high-performance encryption mode that is AEAD-enabled. AEAD additionally assures the authenticity and confidentiality of the encrypted data by generating an AEAD tag. Moreover, AEAD supports AAD (additional authenticated data). |
    | `'CTR'` | No | Counter mode. |
    | `'OFB'` | No | Output feedback. The ciphertext is XORed with the plaintext of a block. |
    | `'CFB'` | No | Cipher feedback is a combination of OFB and CBC. |

    The `mode` is case-insensitive.

    The `padding` specifies how to pad messages whose length is not a multiple of the block size. Padding is
    applicable only for ECB and CBC modes; padding is ignored for other modes. The possible values for padding are:

    > * `'PKCS'`: Uses PKCS5 for block padding.
    > * `'NONE'`: No padding. The user needs to take care of the padding when using ECB or CBC mode.

    The `padding` is case-insensitive.

    Default setting: `'AES-GCM'`.

    If the `mode` is not specified, GCM is used.

    If the `padding` is not specified, PKCS is used.

## Returns

The function returns the encrypted value. The data type of the returned value is VARIANT.

Although only a single value is returned, that value contains two or three fields:

* The first field is the initialization vector (IV). Both encryption and decryption use the IV.
* The second field is the ciphertext (encrypted value) of the `value_to_encrypt`.
* If the encryption mode is AEAD-enabled, then the returned value also contains a third field, which is the AEAD tag.

The IV and tag size depend on the encryption mode.

All 3 fields within the VARIANT are of type BINARY.

## Usage notes

* To decrypt data encrypted by `ENCRYPT()`, use `DECRYPT()`. Do not use `DECRYPT_RAW()`.
* To decrypt data encrypted by `ENCRYPT_RAW()`, use `DECRYPT_RAW()`. Do not use `DECRYPT()`.
* The function’s parameters are masked for security. Sensitive information such as the following is
  not visible in the query log and is not visible to Snowflake:

  + The string or binary value to encrypt or decrypt.
  + The passphrase or key.
* The functions use a FIPS-compliant cryptographic library to effectively perform the encryption and decryption.
* The passphrase or key used to decrypt a piece of data must be the same as the passphrase or key used to encrypt that
  data.

## Examples

This example shows encryption and decryption.

> For readability, set the BINARY_OUTPUT_FORMAT to HEX:
>
> ```sqlexample
> ALTER SESSION SET BINARY_OUTPUT_FORMAT='HEX';
> ```
>
> Create a table and load it.
>
> > **Caution:**
> >
> > To simplify this example, the encryption/decryption key is stored in the table with the value that has
> > been encrypted. This is insecure; the key should never be stored as an unencrypted value in the table
> > that stores the encrypted data.
>
> ```sqlexample
> CREATE OR REPLACE TABLE binary_table (
>     encryption_key BINARY,   -- DO NOT STORE REAL ENCRYPTION KEYS THIS WAY!
>     initialization_vector BINARY(12), -- DO NOT STORE REAL IV'S THIS WAY!!
>     binary_column BINARY,
>     encrypted_binary_column VARIANT,
>     aad_column BINARY);
>
> INSERT INTO binary_table (encryption_key,
>                           initialization_vector,
>                           binary_column,
>                           aad_column)
>     SELECT SHA2_BINARY('NotSecretEnough', 256),
>             SUBSTR(TO_BINARY(HEX_ENCODE('AlsoNotSecretEnough'), 'HEX'), 0, 12),
>             TO_BINARY(HEX_ENCODE('Bonjour'), 'HEX'),
>             TO_BINARY(HEX_ENCODE('additional data'), 'HEX')
>     ;
> ```
>
> Encrypt:
>
> ```sqlexample
> UPDATE binary_table SET encrypted_binary_column =
>     ENCRYPT_RAW(binary_column,
>         encryption_key,
>         initialization_vector,
>         aad_column,
>         'AES-GCM');
> +------------------------+-------------------------------------+
> | number of rows updated | number of multi-joined rows updated |
> |------------------------+-------------------------------------|
> |                      1 |                                   0 |
> +------------------------+-------------------------------------+
> ```
>
> This shows the corresponding call to the `DECRYPT_RAW()` function. The initialization vector (IV)
> is taken from the encrypted value; you do not need to store the initialization vector separately. Similarly,
> the AEAD tag is also read from the encrypted value.
>
> > **Caution:**
> >
> > To simplify this example, the encryption/decryption key is read from the table with the value that has
> > been encrypted. This is insecure; the key should never be stored as an unencrypted value in the table
> > that stores the encrypted data.
>
> ```sqlexample
> SELECT 'Bonjour' as original_value,
>        binary_column,
>        hex_decode_string(to_varchar(binary_column)) as decoded,
>        encrypted_binary_column,
>        decrypt_raw(as_binary(get(encrypted_binary_column, 'ciphertext')),
>                   encryption_key,
>                   as_binary(get(encrypted_binary_column, 'iv')),
>                   aad_column,
>                   'AES-GCM',
>                   as_binary(get(encrypted_binary_column, 'tag')))
>            as decrypted,
>        hex_decode_string(to_varchar(decrypt_raw(as_binary(get(encrypted_binary_column, 'ciphertext')),
>                   encryption_key,
>                   as_binary(get(encrypted_binary_column, 'iv')),
>                   aad_column,
>                   'AES-GCM',
>                   as_binary(get(encrypted_binary_column, 'tag')))
>                   ))
>            as decrypted_and_decoded
>     FROM binary_table;
> +----------------+----------------+---------+---------------------------------------------+----------------+-----------------------+
> | ORIGINAL_VALUE | BINARY_COLUMN  | DECODED | ENCRYPTED_BINARY_COLUMN                     | DECRYPTED      | DECRYPTED_AND_DECODED |
> |----------------+----------------+---------+---------------------------------------------+----------------+-----------------------|
> | Bonjour        | 426F6E6A6F7572 | Bonjour | {                                           | 426F6E6A6F7572 | Bonjour               |
> |                |                |         |   "ciphertext": "CA2F4A383F6F55",           |                |                       |
> |                |                |         |   "iv": "416C736F4E6F745365637265",         |                |                       |
> |                |                |         |   "tag": "91F28FBC6A2FE9B213D1C44B8D75D147" |                |                       |
> |                |                |         | }                                           |                |                       |
> +----------------+----------------+---------+---------------------------------------------+----------------+-----------------------+
> ```
>
> The previous example duplicated a long call to `DECRYPT_RAW()`. You can use a WITH clause to reduce
> the duplication:
>
> ```sqlexample
> WITH
>     decrypted_but_not_decoded as (
>         decrypt_raw(as_binary(get(encrypted_binary_column, 'ciphertext')),
>                       encryption_key,
>                       as_binary(get(encrypted_binary_column, 'iv')),
>                       aad_column,
>                       'AES-GCM',
>                       as_binary(get(encrypted_binary_column, 'tag')))
>     )
> SELECT 'Bonjour' as original_value,
>        binary_column,
>        hex_decode_string(to_varchar(binary_column)) as decoded,
>        encrypted_binary_column,
>        decrypted_but_not_decoded,
>        hex_decode_string(to_varchar(decrypted_but_not_decoded))
>            as decrypted_and_decoded
>     FROM binary_table;
> +----------------+----------------+---------+---------------------------------------------+---------------------------+-----------------------+
> | ORIGINAL_VALUE | BINARY_COLUMN  | DECODED | ENCRYPTED_BINARY_COLUMN                     | DECRYPTED_BUT_NOT_DECODED | DECRYPTED_AND_DECODED |
> |----------------+----------------+---------+---------------------------------------------+---------------------------+-----------------------|
> | Bonjour        | 426F6E6A6F7572 | Bonjour | {                                           | 426F6E6A6F7572            | Bonjour               |
> |                |                |         |   "ciphertext": "CA2F4A383F6F55",           |                           |                       |
> |                |                |         |   "iv": "416C736F4E6F745365637265",         |                           |                       |
> |                |                |         |   "tag": "91F28FBC6A2FE9B213D1C44B8D75D147" |                           |                       |
> |                |                |         | }                                           |                           |                       |
> +----------------+----------------+---------+---------------------------------------------+---------------------------+-----------------------+
> ```

---
title: ENDSWITH
source: https://docs.snowflake.com/en/sql-reference/functions/endswith.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# ENDSWITH

Returns TRUE if the first expression ends with the second expression. Both expressions must be text or binary expressions.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

## Syntax

```sqlsyntax
ENDSWITH( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   The string to search in.

`expr2`
:   The string to search for at the end of `expr1`.

## Returns

Returns a BOOLEAN or NULL:

* Returns TRUE if `expr2` ends with `expr1`.
* Returns FALSE if `expr2` does not end with `expr1`.
* Returns NULL if either input expression is NULL.

## Collation details

The [collation specifications](../collation.md) of all input arguments must be compatible.

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

These examples use the ENDSWITH function.

### Determine whether column values contain a string

Create a table with a single column that contains string values.

```sqlexample
CREATE OR REPLACE TABLE strings_test (s VARCHAR);

INSERT INTO strings_test values
  ('coffee'),
  ('ice tea'),
  ('latte'),
  ('tea'),
  (NULL);

SELECT * from strings_test;
```

```output
+---------+
| S       |
|---------|
| coffee  |
| ice tea |
| latte   |
| tea     |
| NULL    |
+---------+
```

Determine whether the values in column `s` end with the string `te`:

```sqlexample
SELECT * FROM strings_test WHERE ENDSWITH(s, 'te');
```

```output
+-------+
| S     |
|-------|
| latte |
+-------+
```

### Use ENDSWITH with collation

In the following example, ENDSWITH returns different results for the same argument
values with different collation specifications.

```sqlexample
SELECT ENDSWITH(COLLATE('nñ', 'en-ci-ai'), 'n'),
       ENDSWITH(COLLATE('nñ', 'es-ci-ai'), 'n');
```

```output
+------------------------------------------+------------------------------------------+
| ENDSWITH(COLLATE('NÑ', 'EN-CI-AI'), 'N') | ENDSWITH(COLLATE('NÑ', 'ES-CI-AI'), 'N') |
|------------------------------------------+------------------------------------------|
| True                                     | False                                    |
+------------------------------------------+------------------------------------------+
```

---
title: ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/entity_sentiment-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_SENTIMENT](ai_sentiment.md) is the latest version of this function.
> Use AI_SENTIMENT for the latest functionality.
> You can continue to use ENTITY_SENTIMENT (SNOWFLAKE.CORTEX).

Returns sentiment scores for English-language text, including overall sentiment and specific sentiment for specified entities.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.ENTITY_SENTIMENT(<text> [, <entities> ])
```

## Arguments

`text`
:   A string containing the text for which sentiment scores should be calculated.

`entities`
:   An array containing up to ten entities or aspects for which sentiment scores should be calculated. Each entity is a
    string. For example, if scoring sentiment from a restaurant review, the `entities` array might be `['cost',
    'quality', 'waiting time']`. Entities may be a maximum of 30 characters long.

    This argument is optional. If you do not provide it, the function will return only the overall sentiment.

## Returns

An OBJECT containing a `categories` field. `categories` is an ARRAY of category records. Each category includes these fields:

* `name`: The name of the category.
* `sentiment`: The sentiment of the category: positive, negative, neutral, mixed, or unknown, as a string.

Additionally, an `overall` category contains the overall sentiment of the text.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Example

In this example, a table named `reviews` contains a column named `review_content` containing the text of movie reviews
submitted by users. The query returns a sentiment for several entities from each review.

```sqlexample
SELECT SNOWFLAKE.CORTEX.ENTITY_SENTIMENT(review_content,
    ['concept', 'performance', 'script', 'cinematography', 'soundtrack']),
        review_content FROM reviews LIMIT 10;
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: ESTIMATE_REMAINING_DP_AGGREGATES
source: https://docs.snowflake.com/en/sql-reference/functions/estimate_remaining_dp_aggregates.md
section: SQL Functions
---

Categories:
:   [Differential privacy functions](../functions-differential-privacy.md) , [Table functions](../functions-table.md)

# ESTIMATE_REMAINING_DP_AGGREGATES

Returns the estimated number of aggregation functions that can be run before the limit of a privacy budget is reached. The number of
remaining aggregates is *estimated*. The actual number of aggregate functions allowed before reaching the privacy budget limit might vary in
practice, depending on various factors.

This function is useful for both implementing differential privacy and querying privacy-protected tables:

* Analysts can use this function to estimate roughly how much privacy budget they have left in a budget window.
* Privacy policy owners can use this function to
  [fine-tune their privacy budget settings](../../user-guide/diff-privacy/differential-privacy-admin-adjust.md) so the limit of a privacy budget is
  appropriate for every user.

The privacy budget is calculated per aggregate function, not per query. So the query
`SELECT SUM(age), COUNT(age) FROM T GROUP BY STATE;` incurs twice as much privacy loss as the query `SELECT SUM(age) FROM T;` (that is,
the query ‘costs’ twice as much). In general, all aggregate functions cost the same: the value of the `MAX_BUDGET_PER_AGGREGATE`
parameter in the body of the privacy policy. Note that a GROUP BY clause is *not* considered an aggregation function, and does not incur
privacy loss.

The function also returns the budget spent (that is, the current cumulative privacy loss), but Snowflake recommends using the function to
focus on the estimated remaining budget rather than the budget spent. The budget spent is not a linear function (number of aggregations \*
cost per aggregation), but rather a *sub-linear* function. This means that the total cost of additional aggregations decreases with use
during a budget window. This is why the estimated number of remaining aggregates is larger than the formula (remaining budget of privacy
loss) / (privacy loss per function).

## Syntax

```sqlsyntax
SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES('<table_name>')
```

## Arguments

`table_name`
:   The name of the table protected by a differential privacy policy. The function returns privacy budget data based on the queries that you
    have run against this table since the last budget refresh.

## Output

The function returns a table with the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| `NUMBER_OF_REMAINING_DP_AGGREGATES` | INT | The estimated number of remaining aggregate functions that an analyst can call before exceeding the privacy budget limit. |
| `BUDGET_LIMIT` | DECIMAL | The current limit of the privacy budget protecting the specified table, as defined in the privacy policy.  To adjust the privacy budget limit, see [Set privacy settings for a privacy budget](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md). |
| `BUDGET_WINDOW` | STRING | The refresh period of the privacy budget, that is, how often the cumulative privacy loss is reset to 0. Defined in the privacy policy protecting the table.  To adjust the budget window, see [Modify the refresh period](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md). |
| `BUDGET_SPENT` | DECIMAL | The cumulative privacy loss incurred by the current user using the current role during the current budget window. |

## Access control privileges

You need the following privileges to run this function:

* SELECT privilege on the specified table.
* USAGE privilege on the database and schema of the specified table.

## Usage notes

* Estimates are based on the queries run by the user who is executing the function. A query is associated with a privacy budget based on
  several conditions, so be sure the environment you use to execute this function is exactly the same as the one used to execute the queries
  (for example, user, role, and account).
* If you’re running a query that uses multiple tables, you should run ESTIMATE_REMAINING_DP_AGGREGATES once per table, then use the lowest
  `NUMBER_OF_REMAINING_DP_AGGREGATES` value as the estimated usage cap.
* Empty output indicates that the table is not protected by differential privacy (that is, does not have a privacy policy assigned to it).

## Examples

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.DATA_PRIVACY.ESTIMATE_REMAINING_DP_AGGREGATES('my_table'));
```

```output
+-----------------------------------+--------------+---------------+--------------+
| NUMBER_OF_REMAINING_DP_AGGREGATES | BUDGET_LIMIT | BUDGET_WINDOW | BUDGET_SPENT |
|-----------------------------------+--------------+---------------+--------------|
|                 994               |     233      |     WEEKLY    |     1.8      |
+-----------------------------------+--------------+---------------+--------------+
```

For an extended example that shows how to use the ESTIMATE_REMAINING_DP_AGGREGATES function to see the effects of queries, see
[Tracking privacy budget spending](../../user-guide/diff-privacy/differential-privacy-analyst.md).

---
title: EXECUTE_AI_EVALUATION
source: https://docs.snowflake.com/en/sql-reference/functions/execute_ai_evaluation.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# EXECUTE_AI_EVALUATION

Start or get the status of a Cortex Agent evaluation run.

For more information on Cortex Agent evaluations, see [Cortex Agent evaluations](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

See also:
:   [GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)](get_ai_record_trace-snowflake-local.md) , [GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)](get_ai_evaluation_data-snowflake-local.md) , [GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)](get_ai_observability_logs-snowflake-local.md)

## Syntax

```sqlsyntax
EXECUTE_AI_EVALUATION( <evaluation_job> , <run_parameters> , <config_file_path> )
```

## Arguments

`evaluation_job`
:   One of the following values:

    > * ‘START’: Starts an evaluation
    > * ‘STATUS’: Retrieves the status of an evaluation

`run_parameters`
:   A SQL [OBJECT](../data-types-semistructured.md) value that contains the following key:

    > * `run_name`: The name of the run to perform the `evaluation_job` operation on.

`config_file_path`
:   A stage file path pointing to an agent evaluation configuration. This path can’t be a signed URL. For the full configuration YAML specification, see [Agent Evaluation YAML specification](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

## Returns

The return value of this function depends on the `evaluation_job`:

> * ‘START’ returns a single string message, indicating whether the SQL execution succeeded or failed.
> * ‘STATUS’ returns a table containing information on the current state of the evaluation run.

The table returned by the ‘STATUS’ evaluation job has the following columns:

| Name | Type | Description |
| --- | --- | --- |
| RUN_NAME | VARCHAR | The name of the evaluation run. |
| AGENT_NAME | VARCHAR | The (unqualified) name of the agent being evaluated. |
| AGENT_TYPE | VARCHAR | The type of agent being evaluated. |
| STATUS | VARCHAR | The current status of the evaluation run. |
| STATUS_DETAILS | ARRAY | An array of error messages that occured during this run. |

Values in the STATUS column are one of:

Run status

| Status | Description |
| --- | --- |
| CREATED | The run has been created but not started. |
| INVOCATION_IN_PROGRESS | The run invocation is in the process of generating the output and the traces. |
| INVOCATION_COMPLETED | The run invocation completed with all outputs and traces created. |
| INVOCATION_PARTIALLY_COMPLETED | The run invocation is partially completed due to failures in application invocation and trace generation. |
| COMPUTATION_IN_PROGRESS | The metric computation is in progress. |
| COMPLETED | The metric computation is completed with detailed outputs and traces. |
| PARTIALLY_COMPLETED | The run is partially completed due to failures during the metric computation. |
| CANCELLED | The run has been cancelled. |

## Access control requirements

For the full access control requirements to conduct a Cortex Agent evaluation, see [Cortex Agent evaluatons – Access control requirements](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

## Examples

The following example starts a run called `run-1` using the agent evaluation configuration from `@eval_db.eval_schema.metrics/agent_evaluation_config.yaml`:

```sqlexample
CALL EXECUTE_AI_EVALUATION(
  'START',
  OBJECT_CONSTRUCT('run_name', 'run-1'),
  '@eval_db.eval_schema.metrics/agent_evaluation_config.yaml'
);
```

The following example queries the status of the evaluation run `run-1` using the agent configuration from `@eval_db.eval_schema.metrics/agent_evaluation_config.yaml`:

```sqlexample
CALL EXECUTE_AI_EVALUATION(
  'STATUS',
  OBJECT_CONSTRUCT('run_name', 'run-1'),
  '@eval_db.eval_schema.metrics/agent_evaluation_config.yaml'
);
```

---
title: EXP
source: https://docs.snowflake.com/en/sql-reference/functions/exp.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Exponent and Root)

# EXP

Computes Euler’s number `e` raised to a floating-point value.

## Syntax

```sqlsyntax
EXP( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT or DECFLOAT.

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

> ```sqlexample
> SELECT EXP(1), EXP(LN(10));
> -------------+-------------+
>    EXP(1)    | EXP(LN(10)) |
> -------------+-------------+
>  2.718281828 | 10          |
> -------------+-------------+
> ```

---
title: EXPLAIN_GRANTABLE_PRIVILEGES
source: https://docs.snowflake.com/en/sql-reference/functions/explain_grantable_privileges.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# EXPLAIN_GRANTABLE_PRIVILEGES

Returns a JSON string representing all grantable privileges for each object type in Snowflake.
This function provides comprehensive information about which privileges can be granted on different
object types, including the available grant types for each privilege.

See also:
:   [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md) , [GRANT CALLER](../sql/grant-caller.md) ,

## Syntax

```sqlsyntax
EXPLAIN_GRANTABLE_PRIVILEGES(
  [ grantee => '<grantee_type>' ]
  [, object_type => '<object_type_name>' ]
  [, grant_type => '<grant_type_name>' ])
```

## Arguments

All arguments are optional and use named parameter syntax:

`grantee => 'grantee_type'`
:   Filter results by grantee type. Valid values:

    * `ROLE`
    * `APPLICATION`

    Default: `ROLE`

    The grantee type determines which privileges are available. For example, applications cannot
    have individual ownership of objects.

`object_type => 'object_type_name'`
:   Filter results to a single object type. Accepts the singular form of the object type name
    (for example, `'DATABASE'`, `'TABLE'`, `'SCHEMA'`). The text is case-insensitive.

`grant_type => 'grant_type_name'`
:   Filter results to privileges that support a specific grant type. Valid values:

    * `'INDIVIDUAL'` — Grants on individual objects. See [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md).
    * `'ALL'` — Bulk grants on all current objects (for example, `GRANT ... ON ALL TABLES IN SCHEMA`).
      See [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md).
    * `'FUTURE'` — Bulk grants on future objects (for example, `GRANT ... ON FUTURE TABLES IN SCHEMA`).
      See [Future grants on database or schema objects](../sql/grant-privilege.md).
    * `'INHERITED'` — Bulk grants on both current and future objects in a container (combines `ALL` and `FUTURE`).
      See [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md).
    * `'CALLER'` — [Caller grants](../../developer-guide/restricted-callers-rights.md) on individual objects.
      See [GRANT CALLER](../sql/grant-caller.md).
    * `'INHERITED_CALLER'` — Bulk caller grants on all current and future objects in a container
      (for example, `GRANT INHERITED CALLER ... ON ALL TABLES IN SCHEMA`).
      See [GRANT CALLER](../sql/grant-caller.md).

    The text is case-insensitive.

## Returns

The function returns a VARCHAR containing a JSON array. Each element in the array is a JSON object that represents
an object type and has the following structure:

```json
{
  "parent": "<parent_object_type>",
  "singular": "<singular_name>",
  "plural": "<plural_name>",
  "privileges": {
    "<privilege_name>": ["<grant_type>", /* ... additional grant types */],
    /* ... additional privileges */
  }
}
```

**JSON Fields:**

* `parent` — The parent object type in the object hierarchy (for example, SCHEMA is the parent of TABLE).
  The string is empty for top-level objects like ACCOUNT.
* `singular` — The singular form of the object type name (for example, DATABASE). Used for individual grants.
* `plural` — The plural form of the object type name (for example, DATABASES). Used for bulk grants.
* `privileges` — A map where each key is a privilege name and each value is an array of grant type
  names indicating how that privilege can be granted.

## Usage notes

* All arguments must be constant expressions. You cannot pass column values or other non-constant expressions.
* If no arguments are provided, the function returns all grantable privileges for roles across all object types.

## Examples

The following examples call the EXPLAIN_GRANTABLE_PRIVILEGES function:

### Get all grantable privileges for roles

Return all object types and their grantable privileges for roles:

```sqlexample
CALL EXPLAIN_GRANTABLE_PRIVILEGES();
```

### Get privileges for a specific object type

Return only the privileges for the `'DATABASE'` object type:

```sqlexample
CALL EXPLAIN_GRANTABLE_PRIVILEGES(object_type => 'DATABASE');
```

Example output:

```json
[
  {
    "parent": "ACCOUNT",
    "singular": "DATABASE",
    "plural": "DATABASES",
    "privileges": {
      "APPLYBUDGET": ["ALL", "FUTURE", "INDIVIDUAL", "INHERITED"],
      "CREATE SCHEMA": ["INDIVIDUAL"],
      "IMPORTED PRIVILEGES": ["INDIVIDUAL"],
      "MODIFY": ["ALL", "FUTURE", "INDIVIDUAL", "INHERITED"],
      "MONITOR": ["ALL", "FUTURE", "INDIVIDUAL", "INHERITED"],
      "OWNERSHIP": ["INDIVIDUAL"],
      "REFERENCE_USAGE": ["ALL", "FUTURE", "INDIVIDUAL", "INHERITED"],
      "USAGE": ["ALL", "FUTURE", "INDIVIDUAL", "INHERITED"]
    }
  }
]
```

### Filter by grantee type

Return privileges available for applications:

```sqlexample
CALL EXPLAIN_GRANTABLE_PRIVILEGES(grantee => 'APPLICATION');
```

Applications can’t have individual ownership, so `OWNERSHIP` only shows grant types
such as `'ALL'`, `'FUTURE'`, and `'INHERITED'`.

---
title: EXPLAIN_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/explain_json.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# EXPLAIN_JSON

This function converts an EXPLAIN plan from JSON to a table. The output is the same as the output of the command `EXPLAIN USING TABULAR <statement>`.

See also:
:   [SYSTEM$EXPLAIN_PLAN_JSON](system_explain_plan_json.md) , [SYSTEM$EXPLAIN_JSON_TO_TEXT](system_explain_json_to_text.md)

## Syntax

```sqlsyntax
EXPLAIN_JSON( <explain_output_in_json_format> )
```

## Arguments

`explain_output_in_json_format`
:   A string, or an expression that evaluates to a string, containing EXPLAIN output as a JSON-compatible string.
    Typically, this input is the output of the function SYSTEM$EXPLAIN_PLAN_JSON.
    If a literal string is used, it should be surrounded by single quote characters `'`.

## Returns

The function returns a table containing the EXPLAIN output as an ordered set of rows.

The output of this function is equivalent to the output of `EXPLAIN USING TABULAR <sql_statement>`.

## Usage notes

* The input must be a constant expression. You cannot call this function on a column, for example.
* If a string literal is passed as input, the delimiter around the string can be either a single quote `'` or a
  double dollar sign `$$`. If the string literal contains single quotes (and does not contain double dollar
  signs), then delimiting the string with double dollar signs avoids the need to escape the embedded single quote
  characters inside the string.
* The output table can be processed using the [RESULT_SCAN](result_scan.md) function.
* This function converts EXPLAIN information from JSON to tabular format.
  Often, the JSON value is produced directly or indirectly from the [SYSTEM$EXPLAIN_PLAN_JSON](system_explain_plan_json.md) function.
  For example, the output from SYSTEM$EXPLAIN_PLAN_JSON could be stored in a table, then displayed later using this
  EXPLAIN_JSON function.
* Because the output is tabular, this function is classified as a [table function](../functions-table.md).

## Examples

The following example shows how to use this function:

> ```sqlexample
> SELECT * FROM TABLE(
>     EXPLAIN_JSON(
>         SYSTEM$EXPLAIN_PLAN_JSON(
>            'SELECT Z1.ID, Z2.ID FROM Z1, Z2 WHERE Z2.ID = Z1.ID')
>         )
>     );
> +------+------+-----------------+-------------+------------------------------+-------+--------------------------+-----------------+--------------------+---------------+
> | step | id   | parentOperators | operation   | objects                      | alias | expressions              | partitionsTotal | partitionsAssigned | bytesAssigned |
> |------+------+-----------------+-------------+------------------------------+-------+--------------------------+-----------------+--------------------+---------------|
> | NULL | NULL |          NULL | GlobalStats | NULL                         | NULL  | NULL                     |               2 |                  2 |          1024 |
> |    1 |    0 |            NULL | Result      | NULL                         | NULL  | Z1.ID, Z2.ID             |            NULL |               NULL |          NULL |
> |    1 |    1 |             [0] | InnerJoin   | NULL                         | NULL  | joinKey: (Z2.ID = Z1.ID) |            NULL |               NULL |          NULL |
> |    1 |    2 |             [1] | TableScan   | TESTDB.TEMPORARY_DOC_TEST.Z2 | NULL  | ID                       |               1 |                  1 |           512 |
> |    1 |    3 |             [1] | JoinFilter  | NULL                         | NULL  | joinKey: (Z2.ID = Z1.ID) |            NULL |               NULL |          NULL |
> |    1 |    4 |             [3] | TableScan   | TESTDB.TEMPORARY_DOC_TEST.Z1 | NULL  | ID                       |               1 |                  1 |           512 |
> +------+------+-----------------+-------------+------------------------------+-------+--------------------------+-----------------+--------------------+---------------+
> ```

---
title: EXPLAIN_PRIVILEGES
source: https://docs.snowflake.com/en/sql-reference/functions/explain_privileges.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# EXPLAIN_PRIVILEGES

Returns a JSON string that explains which privileges are required to execute a SQL statement.
This function analyzes the authorization requirements for a given SQL statement and returns
them in a structured format showing the required privileges, object types, and object names.

See also:
:   [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md),
    [SHOW GRANTS](../sql/show-grants.md)

## Syntax

```sqlsyntax
EXPLAIN_PRIVILEGES(
  statement => '<sql_statement>'
  [, missing_only => <boolean> ]
  [, for_role => '<role_name>' ])
```

## Arguments

`statement => 'sql_statement'`
:   A string containing the SQL statement to analyze. The statement is analyzed
    to determine which privileges are required to execute it.

`missing_only => boolean`
:   Boolean value that controls the output mode:

    * `false` - Returns all privileges required to execute the statement, regardless of
      whether the current user or specified role has them.
    * `true` — Returns only the privileges that are missing (not currently held by the current
      user or specified role). If all required privileges are present, returns
      `{"authorized": true}`.

    Default: `false`

`for_role => 'role_name'`
:   The name of a role to check privileges for. This argument is used only when
    `missing_only => true`. Returns all privileges missing for the role (and its granted roles)
    to execute the statement.

## Returns

The function returns a VARCHAR value containing a JSON object that describes the required privileges
in a hierarchical structure. The JSON can contain the following node types:

**Permission Node** — Represents a single privilege requirement:

```json
{
  "privilege": "<privilege_name>",
  "objectType": "<object_type>",
  "objectName": "<fully_qualified_object_name>"
}
```

* `privilege` — The name of the required privilege (for example, USAGE, SELECT, OWNERSHIP).
  The special value `<ANY>` indicates that any privilege on the object is sufficient.
* `objectType` — The type of object (for example, DATABASE, TABLE, SCHEMA, ACCOUNT).
* `objectName` — The fully qualified name of the object.

**AND Node** — All contained privileges are required:

```json
{
  "allOf": [
    /* ... permissions or nodes */
  ]
}
```

**OR Node** — At least one of the contained privileges is required:

```json
{
  "oneOf": [
    /* ... permissions or nodes */
  ]
}
```

**Decision Node** — Indicates authorization status

```json
{
  "authorized": true
}
```

* `authorized: true` — All required privileges are present.
* `authorized: false` — Statement cannot be authorized with privilege grants.

## Access control requirements

You must have privileges to refer to the object by name in the SQL statement. Most commonly, this requirement is satisfied by having at
least one privilege on the object. The RESOLVE ALL ON ACCOUNT privilege also meets this requirement.

## Usage notes

* The `statement` argument must be a constant expression. You cannot pass column values or other
  non-constant expressions.
* Multi-statement SQL is not supported. The function accepts only a single SQL statement.
* Some SQL statements are not supported for privilege analysis (for example, GRANT, REVOKE,
  USE ROLE, USE SECONDARY ROLES).
* Some SQL statements have privilege checks that are not supported for privilege analysis. These
  checks will be omitted from the output.
* Some indirect privilege checks are not supported for privilege analysis. These checks will be
  omitted from the output. For example RESOLVE ALL ON ACCOUNT is not included as an option to
  resolve a database.
* When an object cannot be resolved the function returns an error indicating that the statement
  requires access to all objects.
* The privilege `<ANY>` means any privilege on the object is sufficient (for example, for USAGE
  checks where OWNERSHIP would also suffice).

## Examples

The following examples call the EXPLAIN_GRANTABLE_PRIVILEGES function:

### Explain privileges for a DESC command

Show all privileges required to describe a schema:

```sqlexample
CALL EXPLAIN_PRIVILEGES(statement => 'DESC SCHEMA mydb.myschema');
```

Example output:

```json
{
  "allOf": [
    {
      "privilege": "<ANY>",
      "objectType": "DATABASE",
      "objectName": "MYDB"
    },
    {
      "privilege": "MONITOR",
      "objectType": "SCHEMA",
      "objectName": "MYDB.MYSCHEMA"
    }
  ]
}
```

This output indicates that you need any privilege on the database `MYDB` AND the `MONITOR` privilege
on the schema `MYDB.MYSCHEMA`.

### Check only missing privileges

Check what privileges are missing for the current user:

```sqlexample
CALL EXPLAIN_PRIVILEGES(
  statement => 'DROP TABLE mydb.myschema.mytable',
  missing_only => true);
```

If you have all required privileges, returns:

```json
{
  "authorized": true
}
```

If you’re missing privileges, returns only the missing ones:

```json
{
  "allOf": [
    {
      "privilege": "OWNERSHIP",
      "objectType": "TABLE",
      "objectName": "MYDB.MYSCHEMA.MYTABLE"
    }
  ]
}
```

### Check missing privileges for a specific role

Check what privileges a specific role is missing:

```sqlexample
CALL EXPLAIN_PRIVILEGES(
  statement => 'SELECT * FROM mydb.myschema.mytable',
  missing_only => true,
  for_role => 'analyst_role');
```

Determines whether the `analyst_role` (including privileges from its granted roles) has
the necessary privileges to execute the SELECT statement and, if not, returns the
missing privileges.

---
title: EXTERNAL_FUNCTIONS_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/external_functions_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# EXTERNAL_FUNCTIONS_HISTORY

This table function retrieves the history of external functions called by Snowflake for your entire Snowflake account.

> **Note:**
>
> This function can return results only for activity within the last 14 days.

## Syntax

```sqlsyntax
EXTERNAL_FUNCTIONS_HISTORY(
      [ DATE_RANGE_START => <constant_date_expression> ]
      [, DATE_RANGE_END => <constant_date_expression> ]
      [, FUNCTION_SIGNATURE => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_date_expression` , . `DATE_RANGE_END => constant_date_expression`
:   The date/time range, within the last 2 weeks, for which to retrieve the history:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default is to show the previous 10 minutes of history). For example,
      if `DATE_RANGE_END` is [CURRENT_DATE](current_date.md), then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

    History is displayed in increments of 5 minutes, 1 hour, or 24 hours (depending on the length of the specified range).

    If the range falls outside the last 15 days, an error is returned.

`FUNCTION_SIGNATURE => string`
:   A string specifying an external function name and the data types of the arguments to the function. (The data types
    distinguish among overloaded function names.) Only information about that function is returned.

    Put the signature inside single quotes, for example:

    > ```sqlexample
    > function_signature => 'mydb.public.myfunction(integer, varchar)'
    > ```

    Note that the argument data types, but not the argument names, are specified.

    If no signature is specified, then the output includes the total for all external functions in use within the time
    range, and the following columns in the results display NULL:

    * FUNCTION_NAME.
    * ARGUMENTS.
    * FUNCTION_ENDPOINT_URL.
    * SOURCE_CLOUD.
    * SOURCE_REGION.
    * TARGET_CLOUD.
    * TARGET_REGION.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use
  or the function name EXTERNAL_FUNCTIONS_HISTORY must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* The output column named ARGUMENTS includes not only the argument data types, but also the return data type.
  The input parameter named FUNCTION_SIGNATURE should include the data types of the arguments, but not the return data
  type.
* For troubleshooting tips, see Symptom: EXTERNAL_FUNCTIONS_HISTORY returns “…invalid identifier…”.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range for which to return history. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range for which to return history. |
| NAME | TEXT | Name of the function for which to return history. |
| ARGUMENTS | TEXT | Data types of the arguments and of the return value. The data types of the arguments distinguish between overloaded function names. |
| FUNCTION_ENDPOINT_URL | TEXT | HTTPS endpoint that the function calls. This is typically a proxy service. |
| SOURCE_CLOUD | TEXT | Cloud platform from which rows were sent (e.g. `GCP`, `Azure`, or `AWS`). |
| SOURCE_REGION | TEXT | Region from which rows were sent (e.g. `eu-west-1`). |
| TARGET_CLOUD | TEXT | Cloud platform to which rows were sent (e.g. `GCP`, `Azure`, or `AWS`). |
| TARGET_REGION | TEXT | Region to which rows were sent (e.g. `eu-west-1`). |
| INVOCATIONS | NUMBER | Number of times that the remote service was called during the START_TIME and END_TIME window. This includes retries (e.g. due to temporary network problems). |
| SENT_ROWS | NUMBER | Number of rows sent to the external endpoint during the START_TIME and END_TIME window. |
| RECEIVED_ROWS | NUMBER | Number of rows received from the external endpoint during the START_TIME and END_TIME window. |
| SENT_BYTES | NUMBER | Number of bytes sent to the external endpoint during the START_TIME and END_TIME window. |
| RECEIVED_BYTES | NUMBER | Number of bytes received from the external endpoint during the START_TIME and END_TIME window. |

## Examples

Retrieve the history for a 30 minute range, in 5 minute periods, for your account:

> ```sqlexample
> select *
>   from table(information_schema.external_functions_history(
>     date_range_start => to_timestamp_ltz('2020-05-24 12:00:00.000'),
>     date_range_end => to_timestamp_ltz('2020-05-24 12:30:00.000')));
> ```

Retrieve the history for the last 12 hours, in 1 hour periods, for a single external function in your account:

> ```sqlexample
> select *
>   from table(information_schema.external_functions_history(
>     date_range_start => dateadd('hour', -12, current_timestamp()),
>     function_signature => 'mydb.public.myfunction(integer, varchar)'));
> ```

Retrieve the history for the last 14 days, in 1 day periods, for your account:

> ```sqlexample
> select *
>   from table(information_schema.external_functions_history(
>     date_range_start => dateadd('day', -14, current_date()),
>     date_range_end => current_date()));
> ```

Retrieve the history for the last 14 days, in 1 day periods, for a specified function in your account:

> ```sqlexample
> select *
>   from table(information_schema.external_functions_history(
>     date_range_start => dateadd('day', -14, current_date()),
>     date_range_end => current_date(),
>     function_signature => 'mydb.public.myfunction(integer, varchar)'));
> ```

## Troubleshooting

### Symptom: EXTERNAL_FUNCTIONS_HISTORY returns “…invalid identifier…”

Possible Cause:
:   You might not have put the function signature in single quotes. For example, the following
    is wrong because it does not include the quotes:

    ```sqlexample
    select *
      from table(information_schema.external_functions_history(
        function_signature => mydb.public.myfunction(integer, varchar)));
    ```

Possible Solution:
:   Correct this by adding quotation marks around the function signature:

    ```sqlexample
    select *
      from table(information_schema.external_functions_history(
        function_signature => 'mydb.public.myfunction(integer, varchar)'));
    ```

### Symptom: EXTERNAL_FUNCTIONS_HISTORY returns only one row of output, and many of the columns are NULL

Possible Cause:
:   You probably did not include a function signature. If you do not specify a function signature, then
    EXTERNAL_FUNCTION_HISTORY() returns the aggregate values for columns such as INVOCATIONS, SENT ROWS, etc., and
    returns NULL for columns such as the function name, the argument lists, etc.

Possible Solution:
:   If you intended to get information for one function, then include a function signature.

    If you intended to get information for all functions, then the NULL values for some columns are correct,
    and you do not need to fix the query.

---
title: EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/external_table_registration_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY

This table function can be used to query information about the metadata history for an external table, including:

* Files added or removed automatically as part of a metadata refresh.
* Any errors found when refreshing the metadata.

## Syntax

```sqlsyntax
EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY (
      TABLE_NAME => '<string>'
      [, START_TIME => <constant_expr> ] )
```

## Arguments

**Required:**

`TABLE_NAME => 'string'`
:   A string specifying an external table name.

**Optional:**

`START_TIME => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format), within the last 30 days, marking the start of the time range for retrieving metadata update events.

    > **Note:**
    >
    > * If no start time is specified, the function returns all update events within the last 30 days.
    > * If the start time falls outside the last 30 days, the function returns results within the last 30 days.
    > * If the start time is not a timestamp, it is ignored.

## Usage notes

* Returns results for the external table owner (i.e. the role with the OWNERSHIP privilege on the external table), or a higher role,
  or a role that has the USAGE privilege on the database and schema that contain an external table and any privilege on the external
  table.
* The table function cannot retrieve metadata about staged data files until the external table is refreshed (i.e. synched) to include the data files in its metadata.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| JOB_CREATED_TIME | TIMESTAMP_LTZ | Timestamp when the operation occurred |
| FILE_NAME | TEXT | Name of the staged source file and relative path to the file |
| OPERATION_STATUS | TEXT | Status: REGISTERED_NEW, REGISTERED_UPDATE, REGISTER_SKIPPED, REGISTER_FAILED, UNREGISTERED, or UNREGISTER_FAILED |
| MESSAGE | TEXT | Message accompanying the operation status |
| FILE_SIZE | NUMBER | Size of the file (in bytes) added to the external table |
| LAST_MODIFIED | TIMESTAMP_LTZ | Timestamp when the file was last updated in the stage |

## Examples

Retrieve the metadata stored for all data files referenced by the `mytable` external table:

> ```sqlexample
> select *
> from table(information_schema.external_table_file_registration_history(TABLE_NAME=>'MYTABLE'));
> ```

Retrieve the registration events for external table `mydb.public.external_table_name` that started within the last hour:

> ```sqlexample
> select *
>   from table(information_schema.external_table_file_registration_history(
>     start_time=>dateadd('hour',-1,current_timestamp()),
>     table_name=>'mydb.public.external_table_name'));
> ```

Retrieve the registration events for external table `mydb.public.external_table_name` starting at midnight on April 25, 2022:

> ```sqlexample
> select *
>   from table(information_schema.external_table_file_registration_history(
>     start_time=>cast('2022-04-25' as timestamp),
>     table_name=>'mydb.public.external_table_name'));
> ```

---
title: EXTERNAL_TABLE_FILES
source: https://docs.snowflake.com/en/sql-reference/functions/external_table_files.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# EXTERNAL_TABLE_FILES

This table function can be used to query information about the staged data files included in the metadata for a specified [external table](../../user-guide/tables-external-intro.md).

## Syntax

```sqlsyntax
EXTERNAL_TABLE_FILES(
      TABLE_NAME => '<string>' )
```

## Arguments

**Required:**

`TABLE_NAME => 'string'`
:   A string specifying an external table name.

## Usage notes

* Returns results for the external table owner (i.e. the role with the OWNERSHIP privilege on the external table), or a higher role,
  or a role that has the USAGE privilege on the database and schema that contain an external table and any privilege on the external
  table.
* The table function cannot retrieve metadata about staged data files until the external table is refreshed (i.e. synched) to include the data files in its metadata.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_NAME | TEXT | Name of source file and relative path to the staged file |
| REGISTERED_ON | TIMESTAMP_LTZ | Timestamp when the file metadata was added to an external table (i.e. when the external table metadata was refreshed with the file details) |
| FILE_SIZE | NUMBER | Size of the file (in bytes) |
| LAST_MODIFIED | TIMESTAMP_LTZ | Timestamp when the file was last updated in the stage |
| ETAG | HEX | ETag header for the file |
| MD5 | HEX | MD5 checksum for the file |

## Examples

Retrieve the metadata stored for all data files referenced by the `mytable` external table:

> ```sqlexample
> select *
> from table(information_schema.external_table_files(TABLE_NAME=>'MYTABLE'));
> ```

---
title: EXTRACT
source: https://docs.snowflake.com/en/sql-reference/functions/extract.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# EXTRACT

Extracts the specified date or time part from a date, interval, time, or timestamp.

> **Tip:**
>
> To extract the date from a timestamp, use the [TO_DATE](to_date.md) function.

Alternatives:
:   [DATE_PART](date_part.md) , [HOUR / MINUTE / SECOND](hour-minute-second.md) , [YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER](year.md)

## Syntax

```sqlsyntax
EXTRACT( <date_or_time_part> FROM <date_interval_time_or_timestamp_expr> )
```

```sqlsyntax
EXTRACT( <date_or_time_part> , <date_interval_time_or_timestamp_expr> )
```

## Arguments

`date_or_time_part`
:   The unit of time. Must be one of the values listed in [Supported date and time parts](../functions-date-time.md) (for example, `month`).
    The value can be a string literal or can be unquoted (for example, `'month'` or `month`).

    * When `date_or_time_part` is `week` (or any of its variations), the output is controlled by the [WEEK_START](../parameters.md) session parameter.
    * When `date_or_time_part` is `dayofweek` or `yearofweek` (or any of their variations), the output is controlled by the [WEEK_OF_YEAR_POLICY](../parameters.md) and [WEEK_START](../parameters.md) session parameters.

    For more information, including examples, see [Calendar weeks and weekdays](../functions-date-time.md).

`date_interval_time_or_timestamp_expr`
:   A date, an interval, a time, or a timestamp, or an expression that can be evaluated to one of those data types.

## Returns

Returns a value of NUMBER data type.

## Usage notes

* When `date_interval_time_or_timestamp_expr` is a year-month interval value, the supported
  `date_or_time_part` values are `year` and `month`.
* When `date_interval_time_or_timestamp_expr` is a day-time interval value, the supported
  `date_or_time_part` values are `day`, `hour`, `minute`, `second`, and `nanosecond`.
* Currently, when `date_interval_time_or_timestamp_expr` is a DATE value, the following `date_or_time_part`
  values aren’t supported:

  + `epoch_millisecond`
  + `epoch_microsecond`
  + `epoch_nanosecond`

  Other [date and time parts](../functions-date-time.md) (including `epoch_second`) are supported.

## Examples

Specify the `year` part to extract the year from a timestamp:

```sqlexample
SELECT EXTRACT(year FROM TO_TIMESTAMP('2024-04-10T23:39:20.123-07:00')) AS YEAR;
```

```output
+------+
| YEAR |
|------|
| 2024 |
+------+
```

Use EXTRACT with the [DECODE](decode.md) function and the `dayofweek` part to return the full name of the
current day of the week:

```sqlexample
SELECT DECODE(EXTRACT(dayofweek FROM SYSTIMESTAMP()),
  1, 'Monday',
  2, 'Tuesday',
  3, 'Wednesday',
  4, 'Thursday',
  5, 'Friday',
  6, 'Saturday',
  7, 'Sunday') AS DAYOFWEEK;
```

```output
+-----------+
| DAYOFWEEK |
|-----------|
| Thursday  |
+-----------+
```

> **Note:**
>
> The output depends on the value returned by the [SYSTIMESTAMP](systimestamp.md) function when you run the query. Also, you can use the
> [DAYNAME](dayname.md) function to extract the three-letter day-of-week name from the specified date or timestamp.

---
title: EXTRACT_ANSWER (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/extract_answer-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# EXTRACT_ANSWER (SNOWFLAKE.CORTEX)

Extracts an answer to a given question from a text document. The document may be a plain-English document or a string
representation of a semi-structured (JSON) data object.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.EXTRACT_ANSWER(
    <source_document>, <question>)
```

## Arguments

`source_document`
:   A string containing the plain-text or JSON document that contains the answer to the question.

`question`
:   A string containing the question to be answered.

## Returns

A string containing an answer to the given question.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on granting this privilege.

## Example

In this example, `review_content` is a column from the `reviews` table:. To extract an answer from each row
of the table:

```sqlexample
SELECT SNOWFLAKE.CORTEX.EXTRACT_ANSWER(review_content,
    'What dishes does this review mention?')
FROM reviews LIMIT 10;
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: EXTRACT_SEMANTIC_CATEGORIES
source: https://docs.snowflake.com/en/sql-reference/functions/extract_semantic_categories.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# EXTRACT_SEMANTIC_CATEGORIES

> **Note:**
>
> EXTRACT_SEMANTIC_CATEGORIES is a legacy function. Snowflake recommends using other methods of
> implementing [sensitive data classification](../../user-guide/classify-intro.md).

Returns a set of categories (semantic and privacy) for each supported column in the specified table or view. To return the categories for
a column, the column must use a [data type](../../user-guide/classify-intro.md) that supports classification and
does not contain all NULL values.

The categories are derived from the metadata and data contained in the columns, as well as the metadata about the columns and data. The
privacy categories rely on the generated semantic categories, if any.

## Syntax

```sqlsyntax
EXTRACT_SEMANTIC_CATEGORIES( '<object_name>' [ , <max_rows_to_scan> ] )
```

## Arguments

**Required:**

`object_name`
:   The name of the table, external table, view, or materialized view containing the columns to be classified. If a database and
    schema is not in use in the current session, the name must be fully-qualified.

    The name must be specified exactly as it is stored in the database. If the name contains special characters, capitalization, or blank
    spaces, the name must be enclosed first in double-quotes and then in single quotes.

**Optional:**

`max_rows_to_scan`
:   The sample size of rows to use for determining the classification categories in the specified table/view.

    Valid values: `1` to `10000`

    Default: `10000`

## Returns

As a representative example, the JSON object has the following structure:

```sqljson
{
  "valid_value_ratio": 1.0,
  "recommendation": {
    "semantic_category": "PASSPORT",
    "privacy_category": "IDENTIFIER",
    "confidence": "HIGH",
    "coverage": 0.7,
    "details": [
      {
        "semantic_category": "US_PASSPORT",
        "coverage": 0.7
      },
      {
        "semantic_category": "CA_PASSPORT",
        "coverage": 0.1
      }
    ]
  },
  "alternates": [
    {
      "semantic_category": "NATIONAL_IDENTIFIER",
      "privacy_category": "IDENTIFIER",
      "confidence": "LOW",
      "coverage": 0.3,
      "details": [
        {
          "semantic_category": "US_SSN",
          "privacy_category": "IDENTIFIER",
          "coverage": 0.3
        }
      ]
    }
  ]
}
```

Where:

`valid_value_ratio`
:   Specifies the ratio of valid values in the sample size. Invalid values include NULL, an empty string, and a string with more than 256
    characters.

`recommendation`
:   Specifies information about each tag and value. This information includes:

    `semantic_category`
    :   Specifies the semantic category tag value.

        For possible tag values, see [Native semantic categories of sensitive data classification](../../user-guide/classify-native.md).

    `privacy_category`
    :   Specifies the privacy category tag value.

        The possible values are `IDENTIFIER`, `QUASI-IDENTIFIER` and `SENSITIVE`.

    `confidence`
    :   Specifies one of the following values: `HIGH`, `MEDIUM`, or `LOW`. This value indicates the relative confidence that Snowflake has based upon the column sampling process and how the column data aligns with how Snowflake classifies data.

    `coverage`
    :   Specifies the percent of sampled cell values that match the rules for a particular category.

    `details`
    :   Specifies fields and values that can specify a geographical tag value for the SEMANTIC_CATEGORY tag.

`alternates`
:   Specifies information about each tag and value to consider other than the recommended tag.

## Usage notes

* The function requires a running warehouse. The warehouse can affect performance and cost.
* This function is no longer being updated to coincide with additional enhancements to
  [Data Classification](../../user-guide/classify-intro.md).

## Examples

Extract the semantic and privacy categories for the `my_db.my_schema.hr_data` table using the default (`10000`) for the number of
rows to scan:

> ```sqlexample
> USE ROLE data_engineer;
>
> USE WAREHOUSE classification_wh;
>
> SELECT EXTRACT_SEMANTIC_CATEGORIES('my_db.my_schema.hr_data');
> ```

Same as the previous example, but limited to scanning only 5000 rows in the table:

> ```sqlexample
> USE ROLE data_engineer;
>
> SELECT EXTRACT_SEMANTIC_CATEGORIES('my_db.my_schema.hr_data', 5000);
> ```

Same as the first example, but stores results in a table:

> ```sqlexample
> USE ROLE data_engineer;
>
> CREATE OR REPLACE TABLE classification_results(v VARIANT) AS
>   SELECT EXTRACT_SEMANTIC_CATEGORIES('my_db.my_schema.hr_data');
> ```
>
> Once the results are stored in a table, you can revise them before using
> [ASSOCIATE_SEMANTIC_CATEGORY_TAGS](../stored-procedures/associate_semantic_category_tags.md) to apply them.

---
title: FACTORIAL
source: https://docs.snowflake.com/en/sql-reference/functions/factorial.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Exponent and Root)

# FACTORIAL

Computes the factorial of its input. The input argument must be an integer expression in the range of `0` to `33`.

## Syntax

```sqlsyntax
FACTORIAL( <integer_expr> )
```

## Examples

```sqlexample
SELECT FACTORIAL(0), FACTORIAL(1), FACTORIAL(5), FACTORIAL(10);

+--------------+--------------+--------------+---------------+
| FACTORIAL(0) | FACTORIAL(1) | FACTORIAL(5) | FACTORIAL(10) |
|--------------+--------------+--------------+---------------|
|            1 |            1 |          120 |       3628800 |
+--------------+--------------+--------------+---------------+
```

---
title: FILTER
source: https://docs.snowflake.com/en/sql-reference/functions/filter.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Higher-order)

# FILTER

Filters an [array](../data-types-semistructured.md) based on the logic in a lambda expression.

See also:
:   [Use lambda functions on data with Snowflake higher-order functions](../../user-guide/querying-semistructured.md)

## Syntax

```sqlsyntax
FILTER( <array> , <lambda_expression> )
```

## Arguments

`array`
:   The array that contains the elements to be filtered. The array can be semi-structured or structured.

`lambda_expression`
:   A [lambda expression](../../user-guide/querying-semistructured.md) that defines the filter
    condition on each array element.

    The lambda expression must have only one argument specified in the following syntax:

    ```sqlsyntax
    <arg> [ <datatype> ] -> <expr>
    ```

## Returns

The return type of this function is an array of the same type as the input array. The returned array contains the elements
for which the filter condition returns TRUE.

If either argument is NULL, the function returns NULL without reporting an error.

## Usage notes

* When the data type for the lambda argument is explicitly specified, the array element is coerced into the specified type
  before lambda invocation. For information about coercion, see [Data type conversion](../data-type-conversion.md).
* If the filter condition evaluates to NULL, the corresponding array element is filtered out.

## Examples

The following examples use the FILTER function.

### Filter for array elements greater than a value

Use the FILTER function to return objects in an array that have a value greater than or equal to 50:

```sqlexample
SELECT FILTER(
  [
    {'name':'Pat', 'value': 50},
    {'name':'Terry', 'value': 75},
    {'name':'Dana', 'value': 25}
  ],
  a -> a:value >= 50
) AS "Filter >= 50";
```

```output
+----------------------+
| Filter >= 50         |
|----------------------|
| [                    |
|   {                  |
|     "name": "Pat",   |
|     "value": 50      |
|   },                 |
|   {                  |
|     "name": "Terry", |
|     "value": 75      |
|   }                  |
| ]                    |
+----------------------+
```

### Filter for array elements that are not NULL

Use the FILTER function to return array elements that are not NULL:

```sqlexample
SELECT FILTER([1, NULL, 3, 5, NULL], a -> a IS NOT NULL) AS "Not NULL Elements";
```

```output
+-------------------+
| Not NULL Elements |
|-------------------|
| [                 |
|   1,              |
|   3,              |
|   5               |
| ]                 |
+-------------------+
```

### Filter for array elements in a table that are greater than or equal to a value

Assume you have a table named `orders` with the columns `order_id`, `order_date`, and `order_detail`. The
`order_detail` column is an array of the line items, their purchase quantity, and subtotal. The table contains
two rows of data. The following SQL statement creates this table and inserts the rows:

```sqlexample
CREATE OR REPLACE TABLE orders AS
  SELECT 1 AS order_id, '2024-01-01' AS order_date, [
    {'item':'UHD Monitor', 'quantity':3, 'subtotal':1500},
    {'item':'Business Printer', 'quantity':1, 'subtotal':1200}
  ] AS order_detail
  UNION
  SELECT 2 AS order_id, '2024-01-02' AS order_date, [
    {'item':'Laptop', 'quantity':5, 'subtotal':7500},
    {'item':'Noise-canceling Headphones', 'quantity':5, 'subtotal':1000}
  ] AS order_detail;

SELECT * FROM orders;
```

```output
+----------+------------+-------------------------------------------+
| ORDER_ID | ORDER_DATE | ORDER_DETAIL                              |
|----------+------------+-------------------------------------------|
|        1 | 2024-01-01 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "UHD Monitor",                |
|          |            |     "quantity": 3,                        |
|          |            |     "subtotal": 1500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Business Printer",           |
|          |            |     "quantity": 1,                        |
|          |            |     "subtotal": 1200                      |
|          |            |   }                                       |
|          |            | ]                                         |
|        2 | 2024-01-02 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "Laptop",                     |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 7500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Noise-canceling Headphones", |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 1000                      |
|          |            |   }                                       |
|          |            | ]                                         |
+----------+------------+-------------------------------------------+
```

Use the FILTER function to return orders with subtotals that are greater than or equal to 1500:

```sqlexample
SELECT order_id,
       order_date,
       FILTER(o.order_detail, i -> i:subtotal >= 1500) AS order_detail_gt_equal_1500
  FROM orders o;
```

```output
+----------+------------+----------------------------+
| ORDER_ID | ORDER_DATE | ORDER_DETAIL_GT_EQUAL_1500 |
|----------+------------+----------------------------|
|        1 | 2024-01-01 | [                          |
|          |            |   {                        |
|          |            |     "item": "UHD Monitor", |
|          |            |     "quantity": 3,         |
|          |            |     "subtotal": 1500       |
|          |            |   }                        |
|          |            | ]                          |
|        2 | 2024-01-02 | [                          |
|          |            |   {                        |
|          |            |     "item": "Laptop",      |
|          |            |     "quantity": 5,         |
|          |            |     "subtotal": 7500       |
|          |            |   }                        |
|          |            | ]                          |
+----------+------------+----------------------------+
```

### Reference a table column in a lambda expression to filter array elements in table data

Create a table with one column of type ARRAY and another column of type INT:

```sqlexample
CREATE OR REPLACE TABLE filter_column_ref_demo AS
  SELECT [ 10, 15, 20 ] AS col1, 18 AS col2
  UNION
  SELECT [ 30, 50, 70 ] AS col1, 40 AS col2;

SELECT * FROM filter_column_ref_demo;
```

```output
+-------+------+
| COL1  | COL2 |
|-------+------|
| [     |   18 |
|   10, |      |
|   15, |      |
|   20  |      |
| ]     |      |
| [     |   40 |
|   30, |      |
|   50, |      |
|   70  |      |
| ]     |      |
+-------+------+
```

Use the FILTER function to return the values of array element values in each row that are lower
than the value in `col2`:

```sqlexample
SELECT FILTER(col1, v -> v < col2) AS filter_col_ref
  FROM filter_column_ref_demo;
```

```output
+----------------+
| FILTER_COL_REF |
|----------------|
| [              |
|   10,          |
|   15           |
| ]              |
| [              |
|   30           |
| ]              |
+----------------+
```

---
title: FINETUNE ('CANCEL') (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/finetune-cancel.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# FINETUNE ('CANCEL') (SNOWFLAKE.CORTEX)

Cancels the specified fine-tuning job from the current schema.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.FINETUNE(
  'CANCEL',
  '<finetune_job_id>'
)
```

## Parameters

`'CANCEL'`
:   Specifies that you want to cancel a fine-tuning job.

`finetune_job_id`
:   The ID of the fine-tuning job that was generated when you created the job.

## Output

| Column | Type | Description |
| --- | --- | --- |
| SNOWFLAKE.CORTEX.FINETUNE | [STRING](../data-types-text.md) | Message that the job was canceled. |

## Access control requirements

For access requirements, see [Access control requirements](../../user-guide/snowflake-cortex/cortex-finetuning.md).

## Examples

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CANCEL',
  'ft_194bbea4-1208-42f3-88c6-cfb202086125'
);
```

```output
Canceled Cortex Fine-tuning job: ft_194bbea4-1208-42f3-88c6-cfb202086125
```

---
title: FINETUNE ('CREATE') (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/finetune-create.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# FINETUNE ('CREATE') (SNOWFLAKE.CORTEX)

Creates a fine-tuning job. The tuned model is saved to the model registry of the schema.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  '<name>',
  '<base_model>',
  '<training_data_query>'
  [
    , '<validation_data_query>'
    [, '<options>' ]
  ]
)
```

## Required parameters

`'CREATE'`
:   Specifies that you want to create a fine-tuning job.

`'name'`
:   The identifier of the fine-tuned model that is saved to the model registry. This must be unique to the model registry it is saved to. If
    more than one model attempts to save using the same name, a suffix is appended to the name of the latter one to make it unique.

    Letters, underscores, decimal digits (0-9) are allowed in the identifier.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`'base_model'`
:   A string specifying the base model to fine-tune. This must be one of the following values:

    * `'llama3-8b'`
    * `'llama3-70b'`
    * `'llama3.1-8b'`
    * `'llama3.1-70b'`
    * `'mistral-7b'`
    * `'mixtral-8x7b'`

    For more information see [Models available to fine-tune](../../user-guide/snowflake-cortex/cortex-finetuning.md).

`'training_data_query'`
:   The SQL query to get the training data. The result must include `prompt` and `completion` columns.

## Optional parameters

`'validation_data_query'`
:   The SQL query to get the validation data. The result must include `prompt` and `completion` columns.
    If a query for validation data is not specified, your training data is automatically split into training and validation data.

`'options'`
:   A string representation of a JSON object containing zero or more of the following options that affect the training
    hyperparameters. For example: `'{"max_epochs": 3}'`

    * `max_epochs`: A value from 1 to 10 (inclusive) that controls the number of epochs to train the model for.

      Default: automatically determined by the system

## Returns

| Column | Type | Description |
| --- | --- | --- |
| FINETUNE | [STRING](../data-types-text.md) | When the tuning job is created, a generated unique job ID is returned. |

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | DATABASE | The database that the training (and validation) data are queried from. |
| CREATE MODEL or OWNERSHIP | SCHEMA | The schema that the model is saved to. |

## Examples

Example with validation data:

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  'my_tuned_model',
  'mistral-7b',
  'SELECT prompt, completion FROM train',
  'SELECT prompt, completion FROM validation'
);
```

Example without validation data:

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'CREATE',
  'my_tuned_model',
  'mistral-7b',
  'SELECT prompt, completion FROM train'
);
```

The output is the job ID of the fine-tuning job, such as:

```output
ft_6556e15c-8f12-4d94-8cb0-87e6f2fd2299
```

---
title: FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/finetune-describe.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)

Describes the properties of a fine-tuning job. If the job completes successfully, additional details about the job are returned, including
the final model name. Use this name when using the [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md) function to make
an inference on your fine-tuned model.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.FINETUNE(
  'DESCRIBE',
  '<finetune_job_id>'
)
```

## Parameters

`'DESCRIBE'`
:   Specifies that you want to get the properties of the provided fine-tuning job.

`finetune_job_id`
:   The ID of the fine-tuning job that was generated when you created the job.

## Output

| Column | Type | Description |
| --- | --- | --- |
| SNOWFLAKE.CORTEX.FINETUNE | [OBJECT](../data-types-semistructured.md) | An object containing the job status, progress and the tuning job ID. If the job status is `SUCCESS`, additional job information is returned.  `id`  Unique ID for the tuning job.  `status`  The status is one of the following:   * PENDING * IN_PROGRESS * SUCCESS * ERROR * CANCELLED  `progress`  A number between zero and one that indicates the percentage of the job completed with 1.0 being 100%.  `error`  If the job has a status of `ERROR`, an object that contains the error message.  `base_model`  The name of the base model used for the fine-tuning job.  `created_on`  The timestamp of when the job was created.  `finished_on`  If the job has a status of `SUCCESS`, the timestamp of when the job finished.  `model`  If the job has a status of `SUCCESS`, the fine-tuned model name. Use this name when calling the [COMPLETE](complete-snowflake-cortex.md) function for inference.  `training_data`  The query used to retrieve the training data.  `trained_tokens`  If the job has a status of `SUCCESS`, the number of tokens used for training. This is calculated by the following formula:  ```none trained tokens = number of input tokens  * number of epochs trained ```  `training_result`  If the job has a status of `SUCCESS`, the training result of the fine-tuning job.  `validation_data`  The query used to retrieve the validation data.  `options`  An [object](../data-types-semistructured.md) containing zero or more of options that affect the training hyperparameters. |

## Access control requirements

For access requirements, see [Access control requirements](../../user-guide/snowflake-cortex/cortex-finetuning.md).

## Examples

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE(
  'DESCRIBE',
  'ft_6556e15c-8f12-4d94-8cb0-87e6f2fd2299'
);
```

An example output for a successful job:

```output
{
  "base_model":"mistral-7b",
  "created_on":1717004388348,
  "finished_on":1717004691577,
  "id":"ft_6556e15c-8f12-4d94-8cb0-87e6f2fd2299",
  "model":"mydb.myschema.my_tuned_model",
  "progress":1.0,
  "status":"SUCCESS",
  "training_data":"SELECT prompt, completion FROM train",
  "trained_tokens":2670734,
  "training_result":{"validation_loss":1.0138969421386719,"training_loss":0.6477728401547047},
  "validation_data":"SELECT prompt, completion FROM validation",
  "options":{"max_epochs":3}
}
```

---
title: FINETUNE ('SHOW') (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/finetune-show.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# FINETUNE ('SHOW') (SNOWFLAKE.CORTEX)

Lists all the fine-tuning jobs in the current account.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.FINETUNE('SHOW')
```

## Parameters

`'SHOW'`
:   Specifies that you want a list of the fine-tuning jobs in the current account.

## Output

A list of all the fine-tuning jobs in the current account.

| Column | Type | Description |
| --- | --- | --- |
| SNOWFLAKE.CORTEX.FINETUNE(‘SHOW’) | [ARRAY](../data-types-semistructured.md) | An array of objects containing the job ID and the job status.  The status is one of the following:  * PENDING * IN_PROGRESS * SUCCESS * ERROR * CANCELLED |

## Access control requirements

For access requirements, see [Access control requirements](../../user-guide/snowflake-cortex/cortex-finetuning.md).

## Usage notes

* The returned fine-tuning jobs are not permanent and may be garbage collected periodically.

## Examples

```sqlexample
SELECT SNOWFLAKE.CORTEX.FINETUNE('SHOW');
```

```output
[{"id":"ft_9544250a-20a9-42b3-babe-74f0a6f88f60","status":"SUCCESS","base_model":"llama3.1-8b","created_on":1730835118114},
{"id":"ft_354cf617-2fd1-4ffa-a3f9-190633f42a25","status":"ERROR","base_model":"llama3.1-8b","created_on":1730834536632}]
```

---
title: FINETUNE (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/finetune-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# FINETUNE (SNOWFLAKE.CORTEX)

This function lets you create and manage large language models customized for your specific task.

## Syntax

```sqlsyntax
FINETUNE (
  { 'CREATE' | 'SHOW' | 'DESCRIBE' | 'CANCEL' }
  ...
  )
```

The syntax varies considerably between the different commands. For specific syntax, usage notes, and examples, see:

* [FINETUNE ('CREATE') (SNOWFLAKE.CORTEX)](finetune-create.md)
* [FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)](finetune-describe.md)
* [FINETUNE ('SHOW') (SNOWFLAKE.CORTEX)](finetune-show.md)
* [FINETUNE ('CANCEL') (SNOWFLAKE.CORTEX)](finetune-cancel.md)

## Access control requirements

For access requirements, see [Access control requirements](../../user-guide/snowflake-cortex/cortex-finetuning.md).

---
title: FIRST_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/first_value.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# FIRST_VALUE

Returns the first value within an ordered group of values.

See also:
:   [LAST_VALUE](last_value.md) , [NTH_VALUE](nth_value.md)

## Syntax

```sqlsyntax
FIRST_VALUE( <expr> ) [ { IGNORE | RESPECT } NULLS ]
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2>  [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr`
:   The expression that determines the return value.

`expr1`
:   The expression by which to partition the rows. You can specify a single expression or a comma-separated list of expressions.
    For example:

    ```sqlexample
    PARTITION BY column_1, column_2
    ```

`expr2`
:   The expression by which to order the rows. You can specify a single expression or a comma-separated list of expressions.
    For example:

    ```sqlexample
    ORDER BY column_3, column_4
    ```

`{ IGNORE | RESPECT } NULLS`
:   Whether to ignore or respect NULL values when an `expr` contains NULL values:

    * `IGNORE NULLS` returns the first non-NULL value.
    * `RESPECT NULLS` returns a NULL value if it is the first value in the expression.

    Default: `RESPECT NULLS`

## Usage notes

* This function is a rank-related function, so it must specify a window. A window clause consists of the following subclauses:

  > + `PARTITION BY expr1` subclause (optional).
  > + `ORDER BY expr2` subclause (required). For details about additional supported ordering options (sort order, ordering
  >   of NULL values, and so on), see the documentation for the [ORDER BY](../constructs/order-by.md) clause, which follows
  >   the same rules.
  > + `window_frame` subclause (optional).
* The order of rows in a window (and thus the result of the query) is fully deterministic only if the keys in the ORDER BY clause
  make each row unique. Consider the following example:

  ```sqlexample
  ... OVER (PARTITION BY p ORDER BY o COLLATE 'lower') ...
  ```

  The query result can vary if any partition contains values of column `o` that are identical, or would be identical
  in a case-insensitive comparison.
* The ORDER BY clause inside the OVER clause controls the order of rows only within the window, not the order of rows in the output
  of the entire query. To control output order, use a separate ORDER BY clause at the outermost level of the query.

* The optional `window_frame` specifies the subset of rows within the window for which the function is calculated. If no `window_frame` is specified, the default is the entire window:

  > `ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING`

  Note that this deviates from the ANSI standard, which specifies the following default for window frames:

  > `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Examples

This example shows a query that uses the FIRST_VALUE function to find the cheapest menu item
in each category. The query contains two ORDER BY clauses: one to control the order of rows
in each partition, and one to sort the output of the full query. To create and load the table that is used in this example, see [Create and load the menu_items table](stddev.md).

```sqlexample
SELECT menu_category, menu_item_name, menu_price_usd,
       FIRST_VALUE(menu_item_name) OVER (PARTITION BY menu_category ORDER BY menu_price_usd) AS cheapest_item
  FROM menu_items
  WHERE menu_category IN ('Beverage', 'Dessert', 'Snack')
  ORDER BY menu_category, menu_price_usd
  LIMIT 12;
```

```output
+---------------+--------------------+----------------+---------------+
| MENU_CATEGORY | MENU_ITEM_NAME     | MENU_PRICE_USD | CHEAPEST_ITEM |
|---------------+--------------------+----------------+---------------|
| Beverage      | Bottled Water      |           2.00 | Bottled Water |
| Beverage      | Iced Tea           |           3.00 | Bottled Water |
| Beverage      | Bottled Soda       |           3.00 | Bottled Water |
| Beverage      | Lemonade           |           3.50 | Bottled Water |
| Dessert       | Popsicle           |           3.00 | Popsicle      |
| Dessert       | Ice Cream Sandwich |           4.00 | Popsicle      |
| Dessert       | Mango Sticky Rice  |           5.00 | Popsicle      |
| Dessert       | Sugar Cone         |           6.00 | Popsicle      |
| Dessert       | Waffle Cone        |           6.00 | Popsicle      |
| Dessert       | Two Scoop Bowl     |           7.00 | Popsicle      |
| Snack         | Spring Mix Salad   |           6.00 | Fried Pickles |
| Snack         | Fried Pickles      |           6.00 | Fried Pickles |
+---------------+--------------------+----------------+---------------+
```

The following example also uses the `menu_items` table to compare three related functions: FIRST_VALUE,
[NTH_VALUE](nth_value.md), and [LAST_VALUE](last_value.md):

* The query creates a sliding window frame that is three rows wide, which contains:

  + The row that precedes the current row.
  + The current row.
  + The row that follows the current row.
* The `2` in the call `NTH_VALUE(menu_price_usd, 2)` specifies the second row in the window frame
  (which, in this case, is also the current row).
* When the current row is the very first row in the window frame, there is no preceding row to reference, so
  FIRST_VALUE returns a NULL for that row.
* Frame boundaries sometimes extend beyond the rows in a partition, but non-existent rows are not included in window function
  calculations. For example, when the current row is the very first row in the partition and the window frame is
  `ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING`, there is no preceding row to reference, so the FIRST_VALUE function returns the
  value of the first row in the partition.
* The results never match for all three functions, given the data in the table. These functions select the *first*,
  *last*, or *nth* value for each row in the frame, and the selection of values applies separately to each partition.

```sqlexample
SELECT menu_category, menu_item_name, menu_price_usd,
       FIRST_VALUE(menu_price_usd) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS first_val,
       NTH_VALUE(menu_price_usd, 2) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS nth_val,
       LAST_VALUE(menu_price_usd) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS last_val
  FROM menu_items
  WHERE menu_category = 'Dessert'
  ORDER BY menu_price_usd;
```

```output
+---------------+--------------------+----------------+-----------+---------+----------+
| MENU_CATEGORY | MENU_ITEM_NAME     | MENU_PRICE_USD | FIRST_VAL | NTH_VAL | LAST_VAL |
|---------------+--------------------+----------------+-----------+---------+----------|
| Dessert       | Popsicle           |           3.00 |      3.00 |    4.00 |     4.00 |
| Dessert       | Ice Cream Sandwich |           4.00 |      3.00 |    4.00 |     5.00 |
| Dessert       | Mango Sticky Rice  |           5.00 |      4.00 |    5.00 |     6.00 |
| Dessert       | Sugar Cone         |           6.00 |      6.00 |    6.00 |     7.00 |
| Dessert       | Waffle Cone        |           6.00 |      5.00 |    6.00 |     6.00 |
| Dessert       | Two Scoop Bowl     |           7.00 |      6.00 |    7.00 |     7.00 |
+---------------+--------------------+----------------+-----------+---------+----------+
```

This example demonstrates the difference between IGNORE NULLS and RESPECT NULLS. The sample
data includes rows where the cost value is NULL. With the default RESPECT NULLS behavior, if
the first row in the ordered partition has a NULL value, FIRST_VALUE returns NULL. With IGNORE
NULLS, FIRST_VALUE skips NULL values and returns the first non-NULL value.

```sqlexample
SELECT item_name, item_cost, item_price,
       FIRST_VALUE(item_cost) RESPECT NULLS
         OVER (ORDER BY item_price) AS first_cost_respect,
       FIRST_VALUE(item_cost) IGNORE NULLS
         OVER (ORDER BY item_price) AS first_cost_ignore
  FROM VALUES
    ('Pretzel', NULL, 3.00),
    ('Corn Dog', NULL, 4.00),
    ('Hot Dog', 1.50, 5.00),
    ('Sandwich', 2.50, 6.00)
  AS menu(item_name, item_cost, item_price)
  ORDER BY item_price;
```

```output
+-----------+-----------+------------+--------------------+-------------------+
| ITEM_NAME | ITEM_COST | ITEM_PRICE | FIRST_COST_RESPECT | FIRST_COST_IGNORE |
|-----------+-----------+------------+--------------------+-------------------|
| Pretzel   |      NULL |       3.00 |               NULL |              1.50 |
| Corn Dog  |      NULL |       4.00 |               NULL |              1.50 |
| Hot Dog   |      1.50 |       5.00 |               NULL |              1.50 |
| Sandwich  |      2.50 |       6.00 |               NULL |              1.50 |
+-----------+-----------+------------+--------------------+-------------------+
```

---
title: FL_GET_CONTENT_TYPE
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_content_type.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_CONTENT_TYPE

Returns the content type (also known as the MIME type) of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_CONTENT_TYPE( <file_expression> )

FL_GET_CONTENT_TYPE( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

Aa VARCHAR value with the MIME type of the file, for example `'image/png'` for a PNG image file.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table
    SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_CONTENT_TYPE(f) FROM file_table;
```

```output
+------------------------+
| FL_GET_CONTENT_TYPE(F) |
|------------------------|
| image/png              |
+------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table
  SELECT object_construct('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.jpg', 'ETAG', '<ETAG value>',
      'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_CONTENT_TYPE(f) FROM file_table;
```

```output
+------------------------+
| FL_GET_CONTENT_TYPE(F) |
|------------------------|
| image/jpg              |
+------------------------+
```

---
title: FL_GET_ETAG
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_etag.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_ETAG

Returns the content hash (ETAG) of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_ETAG( <file_expression> )

FL_GET_ETAG( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A VARCHAR value with the ETAG of the file.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_ETAG(f) FROM file_table;
```

```output
+-----------------------------------+
| FL_GET_ETAG(F)                    |
|-----------------------------------|
| <ETAG value>                      |
+-----------------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.jpg', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_ETAG(f) FROM file_table;
```

```output
+-----------------------------------+
| FL_GET_ETAG(F)                    |
|-----------------------------------|
| <ETAG value>                      |
+-----------------------------------+
```

---
title: FL_GET_FILE_TYPE
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_file_type.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_FILE_TYPE

Returns the file type (modality) of a [FILE](../data-types-unstructured.md). This is a more general classification than
the content type (see [FL_GET_CONTENT_TYPE](fl_get_content_type.md)).

## Syntax

Use one of the following:

```
FL_GET_FILE_TYPE( <file_expression> )

FL_GET_FILE_TYPE( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

One of following values as a VARCHAR:

* `document`
* `video`
* `audio`
* `image`
* `compressed`
* `unknown`

> **Tip:**
>
> To test if a file is of a particular type, use one of the `FL_IS` functions:
>
> * [FL_IS_AUDIO](fl_is_audio.md)
> * [FL_IS_COMPRESSED](fl_is_compressed.md)
> * [FL_IS_DOCUMENT](fl_is_document.md)
> * [FL_IS_IMAGE](fl_is_image.md)
> * [FL_IS_VIDEO](fl_is_video.md)

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE FILE_TABLE(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_FILE_TYPE(f) FROM file_table;
```

```output
+------------------------+
| FL_GET_FILE_TYPE(F)    |
|------------------------|
| image                  |
+------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'document.pdf', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'application/pdf');

SELECT FL_GET_FILE_TYPE(f) FROM file_table;
```

```output
+------------------------+
| FL_GET_FILE_TYPE(F)    |
|------------------------|
| document               |
+------------------------+
```

---
title: FL_GET_LAST_MODIFIED
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_last_modified.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_LAST_MODIFIED

Returns the last modified date of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_LAST_MODIFIED( <file_expression> )

FL_GET_LAST_MODIFIED( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A TIMESTAMP value with the date the file was last modified.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_LAST_MODIFIED(f) FROM file_table;
```

```output
+-------------------------------+
| FL_GET_LAST_MODIFIED(F)       |
|-------------------------------|
| Wed, 11 Dec 2024 20:24:00 GMT |
+-------------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.jpg', 'ETAG', '<ETAG value>',
    'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_LAST_MODIFIED(f) FROM file_table;
```

```output
+-------------------------------+
| FL_GET_LAST_MODIFIED(F)       |
|-------------------------------|
| Wed, 11 Dec 2024 20:24:00 GMT |
+-------------------------------+
```

---
title: FL_GET_RELATIVE_PATH
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_relative_path.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_RELATIVE_PATH

Returns the relative path of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_RELATIVE_PATH( <file_expression> )

FL_GET_RELATIVE_PATH( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

The relative path of the file within its stage as a VARCHAR.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_RELATIVE_PATH(f) FROM file_table;
```

```output
+-------------------------+
| FL_GET_RELATIVE_PATH(F) |
|-------------------------|
| image.png               |
+-------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.jpg', 'ETAG', '<ETAG value>',
    'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_RELATIVE_PATH(f) FROM file_table;
```

```output
+-------------------------+
| FL_GET_RELATIVE_PATH(F) |
|-------------------------|
| image.png               |
+-------------------------+
```

---
title: FL_GET_SCOPED_FILE_URL
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_scoped_file_url.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_SCOPED_FILE_URL

Returns the scoped URL of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_SCOPED_FILE_URL( <file_expression> )

FL_GET_SCOPED_FILE_URL( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

The scoped URL of the file as a VARCHAR.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_SCOPED_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_SCOPED_FILE_URL(f) FROM file_table;
```

```output
+--------------------------------------------------------------------------------------------------------------------+
| FL_GET_SCOPED_FILE_URL(F)                                                                                          |
|--------------------------------------------------------------------------------------------------------------------|
| https://snowflake.account.snowflakecomputing.com/api/files/01ba4df2-0100-0001-0000-00040002e2b6/299017/Y6JShH6KjV  |
+--------------------------------------------------------------------------------------------------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('SCOPED_FILE_URL', 'https://snowflake.account.snowflakecomputing.com/api/files/01ba4df2-0100-0001-0000-00040002e2b6/299017/Y6JShH6KjV',
  'ETAG', '<ETAG value>', 'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_SCOPED_FILE_URL(f) FROM file_table;
```

```output
+--------------------------------------------------------------------------------------------------------------------+
| FL_GET_SCOPED_FILE_URL(F)                                                                                          |
|--------------------------------------------------------------------------------------------------------------------|
| https://snowflake.account.snowflakecomputing.com/api/files/01ba4df2-0100-0001-0000-00040002e2b6/299017/Y6JShH6KjV  |
+--------------------------------------------------------------------------------------------------------------------+
```

---
title: FL_GET_SIZE
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_size.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_SIZE

Returns the size, in bytes, of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_SIZE( <file_expression> )

FL_GET_SIZE( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

The size of the file in bytes as an INTEGER.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_SIZE(f) FROM file_table;
```

```output
+-------------------+
| FL_GET_SIZE(F)    |
|-------------------|
| 105859            |
+-------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'document.pdf', 'ETAG', '<ETAG value>', 'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'application/pdf');

SELECT FL_GET_SIZE(f) FROM file_table;
```

```output
+-------------------+
| FL_GET_SIZE(F)    |
|-------------------|
| 105859            |
+-------------------+
```

---
title: FL_GET_STAGE
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_stage.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_STAGE

Returns the stage name of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_STAGE( <file_expression> )

FL_GET_STAGE( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

The stage of the file as a VARCHAR.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table select TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_STAGE(f) FROM file_table;
```

```output
+------------------------+
| FL_GET_STAGE(F)        |
|------------------------|
| MYSTAGE                |
+------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.jpg', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_STAGE(f) FROM file_table;
```

```output
+------------------------+
| FL_GET_STAGE(F)        |
|------------------------|
| MYSTAGE                |
+------------------------+
```

---
title: FL_GET_STAGE_FILE_URL
source: https://docs.snowflake.com/en/sql-reference/functions/fl_get_stage_file_url.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_GET_STAGE_FILE_URL

Returns the stage URL of a [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_GET_STAGE_FILE_URL( <file_expression> )

FL_GET_STAGE_FILE_URL( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

The URL of the file as a VARCHAR.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_GET_STAGE_FILE_URL(f) FROM file_table;
```

```output
+-------------------------------------------------------------------------------------------+
| FL_GET_STAGE_FILE_URL(F)                                                                  |
|-------------------------------------------------------------------------------------------|
| https://snowflake.account.snowflakecomputing.com/api/files/TEST/PUBLIC/MYSTAGE/image.png  |
+-------------------------------------------------------------------------------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE_FILE_URL', 'https://snowflake.account.snowflakecomputing.com/api/files/TEST/PUBLIC/MYSTAGE/image.png',
  'ETAG', '<ETAG value>', 'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/jpg');

SELECT FL_GET_STAGE_FILE_URL(f) FROM file_table;
```

```output
+-------------------------------------------------------------------------------------------+
| FL_GET_STAGE_FILE_URL(F)                                                                  |
|-------------------------------------------------------------------------------------------|
| https://snowflake.account.snowflakecomputing.com/api/files/TEST/PUBLIC/MYSTAGE/image.png  |
+-------------------------------------------------------------------------------------------+
```

---
title: FL_IS_AUDIO
source: https://docs.snowflake.com/en/sql-reference/functions/fl_is_audio.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_IS_AUDIO

Checks if the input is an audio [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_IS_AUDIO( <file_expression> )

FL_IS_AUDIO( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A BOOLEAN indicating whether the file is an audio file.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
insert into file_table select to_file(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_IS_AUDIO(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_AUDIO(F)    |
|-------------------|
| False             |
+-------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);

INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'music.mpeg', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'audio/mpeg');

SELECT FL_IS_AUDIO(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_AUDIO(F)    |
|-------------------|
| True              |
+-------------------+
```

---
title: FL_IS_COMPRESSED
source: https://docs.snowflake.com/en/sql-reference/functions/fl_is_compressed.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_IS_COMPRESSED

Checks if the input is a compressed [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_IS_COMPRESSED( <file_expression> )

FL_IS_COMPRESSED( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A BOOLEAN indicating whether the file is compressed.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_IS_COMPRESSED(f) FROM file_table;
```

```output
+---------------------+
| FL_IS_COMPRESSED(F) |
|---------------------|
| False               |
+---------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'document.pdf.gz', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'application/gzip');

SELECT FL_IS_COMPRESSED(f) FROM file_table;
```

```output
+---------------------+
| FL_IS_COMPRESSED(F) |
|---------------------|
| True                |
+---------------------+
```

---
title: FL_IS_DOCUMENT
source: https://docs.snowflake.com/en/sql-reference/functions/fl_is_document.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_IS_DOCUMENT

Checks if the input is a document [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_IS_DOCUMENT( <file_expression> )

FL_IS_DOCUMENT( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A BOOLEAN indicating whether the file is a document.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_IS_DOCUMENT(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_DOCUMENT(F) |
|-------------------|
| False             |
+-------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'document.pdf', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'application/pdf');

SELECT FL_IS_DOCUMENT(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_DOCUMENT(F) |
|-------------------|
| True              |
+-------------------+
```

---
title: FL_IS_IMAGE
source: https://docs.snowflake.com/en/sql-reference/functions/fl_is_image.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_IS_IMAGE

Checks if the input is an image [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_IS_IMAGE( <file_expression> )

FL_IS_IMAGE( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A BOOLEAN indicating whether the file is an image.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_IS_IMAGE(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_IMAGE(F)    |
|-------------------|
| True              |
+-------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'document.pdf', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'application/pdf');

SELECT FL_IS_IMAGE(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_IMAGE(F)    |
|-------------------|
| False             |
+-------------------+
```

---
title: FL_IS_VIDEO
source: https://docs.snowflake.com/en/sql-reference/functions/fl_is_video.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# FL_IS_VIDEO

Checks if the input is a video [FILE](../data-types-unstructured.md).

## Syntax

Use one of the following:

```
FL_IS_VIDEO( <file_expression> )

FL_IS_VIDEO( <variant_expression> )
```

## Arguments

`file_expression`
:   The argument must be an expression of type FILE.

`variant_expression`
:   The argument must be an OBJECT representing a FILE.

## Returns

A BOOLEAN indicating whether the file is a video.

## Examples

Example using an input FILE:

```sqlexample
CREATE TABLE file_table(f FILE);
INSERT INTO file_table SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));

SELECT FL_IS_VIDEO(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_VIDEO(F)    |
|-------------------|
| False             |
+-------------------+
```

Example using an input OBJECT:

```sqlexample
CREATE TABLE file_table(f OBJECT);
INSERT INTO file_table SELECT OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'movie.mp4', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'FILE_TYPE', 'video/mp4');

SELECT FL_IS_VIDEO(f) FROM file_table;
```

```output
+-------------------+
| FL_IS_VIDEO(F)    |
|-------------------|
| True              |
+-------------------+
```

---
title: FLATTEN
source: https://docs.snowflake.com/en/sql-reference/functions/flatten.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) , [Semi-structured and structured data functions](../functions-semistructured.md) (Extraction)

# FLATTEN

Flattens (explodes) compound values into multiple rows.

FLATTEN is a table function that takes a VARIANT, OBJECT, or ARRAY column and produces a lateral view — an inline view that contains
correlations to other tables that precede it in the [FROM](../constructs/from.md) clause.

FLATTEN can be used to convert semi-structured data to a relational representation.

## Syntax

```sqlsyntax
FLATTEN( INPUT => <expr> [ , PATH => <constant_expr> ]
                         [ , OUTER => TRUE | FALSE ]
                         [ , RECURSIVE => TRUE | FALSE ]
                         [ , MODE => 'OBJECT' | 'ARRAY' | 'BOTH' ] )
```

## Arguments

**Required:**

`INPUT => expr`
:   The expression that will be flattened into rows. The expression must be of data type VARIANT, OBJECT, or ARRAY.

**Optional:**

`PATH => constant_expr`
:   The path to the element within a VARIANT data structure that needs to be flattened. Can be a zero-length string (that is, an empty path) if the
    outermost element is to be flattened.

    Default: Zero-length string (empty path)

`OUTER => TRUE | FALSE`
:   * If `FALSE`, any input rows that can’t be expanded, either because they can’t be accessed in the path or because they have zero fields or entries, are completely omitted from the output.
    * If `TRUE`, exactly one row is generated for zero-row expansions (with NULL in the KEY, INDEX, and VALUE columns).

    Default: `FALSE`

    > **Note:**
    >
    > A zero-row expansion of an empty compound displays NULL in the `THIS` output column, distinguishing it from an attempt to expand a non-existing or wrong kind of compound.

`RECURSIVE => TRUE | FALSE`
:   * If `FALSE`, only the element referenced by `PATH` is expanded.
    * If `TRUE`, the expansion is performed for all sub-elements recursively.

    Default: `FALSE`

`MODE => 'OBJECT' | 'ARRAY' | 'BOTH'`
:   Specifies whether only objects, arrays, or both should be flattened.

    Default: `BOTH`

## Output

The returned rows consist of a fixed set of columns:

```output
+-----+------+------+-------+-------+------+
| SEQ |  KEY | PATH | INDEX | VALUE | THIS |
|-----+------+------+-------+-------+------|
```

SEQ:
:   A unique sequence number associated with the input record; the sequence is not guaranteed to be gap-free or ordered in any particular way.

KEY:
:   For maps or objects, this column contains the key to the exploded value.

PATH:
:   The path to the element within a data structure that needs to be flattened.

INDEX:
:   The index of the element, if it is an array; otherwise NULL.

VALUE:
:   The value of the element of the flattened array/object.

THIS:
:   The element being flattened (useful in recursive flattening).

> **Note:**
>
> The columns of the original (correlated) table that was used as the source of data for FLATTEN are also accessible. If a single row from the original table resulted in multiple rows in the flattened view, the values in this input row are replicated to match the number of rows produced by FLATTEN.

## Usage notes

* For single-level arrays, `TABLE(FLATTEN(...))` and `LATERAL FLATTEN(...)` produce the same
  result. For nested data structures where you need to chain multiple FLATTEN calls, use
  [LATERAL](../constructs/join-lateral.md) so that each subsequent FLATTEN can
  reference the output of the previous one.
* For information about using this function with [structured types](../data-types-structured.md), see
  [Using the FLATTEN function with values of structured types](../data-types-structured.md).

## Examples

See also [Example: Using a lateral join with the FLATTEN table function](../../user-guide/lateral-join-using.md) and [Using FLATTEN to Filter the Results in a WHERE Clause](../../user-guide/querying-semistructured.md).

The following simple example flattens one record (note that the middle element of the array is missing):

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('[1, ,77]'))) f;
```

```output
+-----+------+------+-------+-------+------+
| SEQ |  KEY | PATH | INDEX | VALUE | THIS |
|-----+------+------+-------+-------+------|
|   1 | NULL | [0]  |     0 |     1 | [    |
|     |      |      |       |       |   1, |
|     |      |      |       |       |   ,  |
|     |      |      |       |       |   77 |
|     |      |      |       |       | ]    |
|   1 | NULL | [2]  |     2 |    77 | [    |
|     |      |      |       |       |   1, |
|     |      |      |       |       |   ,  |
|     |      |      |       |       |   77 |
|     |      |      |       |       | ]    |
+-----+------+------+-------+-------+------+
```

The next two queries show the effect of the PATH parameter:

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('{"a":1, "b":[77,88]}'), OUTER => TRUE)) f;
```

```output
+-----+-----+------+-------+-------+-----------+
| SEQ | KEY | PATH | INDEX | VALUE | THIS      |
|-----+-----+------+-------+-------+-----------|
|     |     |      |       |       |   "a": 1, |
|     |     |      |       |       |   "b": [  |
|     |     |      |       |       |     77,   |
|     |     |      |       |       |     88    |
|     |     |      |       |       |   ]       |
|     |     |      |       |       | }         |
|   1 | b   | b    |  NULL | [     | {         |
|     |     |      |       |   77, |   "a": 1, |
|     |     |      |       |   88  |   "b": [  |
|     |     |      |       | ]     |     77,   |
|     |     |      |       |       |     88    |
|     |     |      |       |       |   ]       |
|     |     |      |       |       | }         |
+-----+-----+------+-------+-------+-----------+
```

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('{"a":1, "b":[77,88]}'), PATH => 'b')) f;
```

```output
+-----+------+------+-------+-------+-------+
| SEQ |  KEY | PATH | INDEX | VALUE | THIS  |
|-----+------+------+-------+-------+-------|
|   1 | NULL | b[0] |     0 |    77 | [     |
|     |      |      |       |       |   77, |
|     |      |      |       |       |   88  |
|     |      |      |       |       | ]     |
|   1 | NULL | b[1] |     1 |    88 | [     |
|     |      |      |       |       |   77, |
|     |      |      |       |       |   88  |
|     |      |      |       |       | ]     |
+-----+------+------+-------+-------+-------+
```

The next two queries show the effect of the OUTER parameter:

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('[]'))) f;
```

```output
+-----+-----+------+-------+-------+------+
| SEQ | KEY | PATH | INDEX | VALUE | THIS |
|-----+-----+------+-------+-------+------|
+-----+-----+------+-------+-------+------+
```

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('[]'), OUTER => TRUE)) f;
```

```output
+-----+------+------+-------+-------+------+
| SEQ |  KEY | PATH | INDEX | VALUE | THIS |
|-----+------+------+-------+-------+------|
|   1 | NULL |      |  NULL |  NULL | []   |
+-----+------+------+-------+-------+------+
```

The next two queries show the effect of the RECURSIVE parameter:

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('{"a":1, "b":[77,88], "c": {"d":"X"}}'))) f;
```

```output
+-----+-----+------+-------+------------+--------------+
| SEQ | KEY | PATH | INDEX | VALUE      | THIS         |
|-----+-----+------+-------+------------+--------------|
|   1 | a   | a    |  NULL | 1          | {            |
|     |     |      |       |            |   "a": 1,    |
|     |     |      |       |            |   "b": [     |
|     |     |      |       |            |     77,      |
|     |     |      |       |            |     88       |
|     |     |      |       |            |   ],         |
|     |     |      |       |            |   "c": {     |
|     |     |      |       |            |     "d": "X" |
|     |     |      |       |            |   }          |
|     |     |      |       |            | }            |
|   1 | b   | b    |  NULL | [          | {            |
|     |     |      |       |   77,      |   "a": 1,    |
|     |     |      |       |   88       |   "b": [     |
|     |     |      |       | ]          |     77,      |
|     |     |      |       |            |     88       |
|     |     |      |       |            |   ],         |
|     |     |      |       |            |   "c": {     |
|     |     |      |       |            |     "d": "X" |
|     |     |      |       |            |   }          |
|     |     |      |       |            | }            |
|   1 | c   | c    |  NULL | {          | {            |
|     |     |      |       |   "d": "X" |   "a": 1,    |
|     |     |      |       | }          |   "b": [     |
|     |     |      |       |            |     77,      |
|     |     |      |       |            |     88       |
|     |     |      |       |            |   ],         |
|     |     |      |       |            |   "c": {     |
|     |     |      |       |            |     "d": "X" |
|     |     |      |       |            |   }          |
|     |     |      |       |            | }            |
+-----+-----+------+-------+------------+--------------+
```

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('{"a":1, "b":[77,88], "c": {"d":"X"}}'),
                            RECURSIVE => TRUE )) f;
```

```output
+-----+------+------+-------+------------+--------------+
| SEQ | KEY  | PATH | INDEX | VALUE      | THIS         |
|-----+------+------+-------+------------+--------------|
|   1 | a    | a    |  NULL | 1          | {            |
|     |      |      |       |            |   "a": 1,    |
|     |      |      |       |            |   "b": [     |
|     |      |      |       |            |     77,      |
|     |      |      |       |            |     88       |
|     |      |      |       |            |   ],         |
|     |      |      |       |            |   "c": {     |
|     |      |      |       |            |     "d": "X" |
|     |      |      |       |            |   }          |
|     |      |      |       |            | }            |
|   1 | b    | b    |  NULL | [          | {            |
|     |      |      |       |   77,      |   "a": 1,    |
|     |      |      |       |   88       |   "b": [     |
|     |      |      |       | ]          |     77,      |
|     |      |      |       |            |     88       |
|     |      |      |       |            |   ],         |
|     |      |      |       |            |   "c": {     |
|     |      |      |       |            |     "d": "X" |
|     |      |      |       |            |   }          |
|     |      |      |       |            | }            |
|   1 | NULL | b[0] |     0 | 77         | [            |
|     |      |      |       |            |   77,        |
|     |      |      |       |            |   88         |
|     |      |      |       |            | ]            |
|   1 | NULL | b[1] |     1 | 88         | [            |
|     |      |      |       |            |   77,        |
|     |      |      |       |            |   88         |
|     |      |      |       |            | ]            |
|   1 | c    | c    |  NULL | {          | {            |
|     |      |      |       |   "d": "X" |   "a": 1,    |
|     |      |      |       | }          |   "b": [     |
|     |      |      |       |            |     77,      |
|     |      |      |       |            |     88       |
|     |      |      |       |            |   ],         |
|     |      |      |       |            |   "c": {     |
|     |      |      |       |            |     "d": "X" |
|     |      |      |       |            |   }          |
|     |      |      |       |            | }            |
|   1 | d    | c.d  |  NULL | "X"        | {            |
|     |      |      |       |            |   "d": "X"   |
|     |      |      |       |            | }            |
+-----+------+------+-------+------------+--------------+
```

The following example shows the effect of the MODE parameter:

```sqlexample
SELECT * FROM TABLE(FLATTEN(INPUT => PARSE_JSON('{"a":1, "b":[77,88], "c": {"d":"X"}}'),
                            RECURSIVE => TRUE, MODE => 'OBJECT' )) f;
```

```output
+-----+-----+------+-------+------------+--------------+
| SEQ | KEY | PATH | INDEX | VALUE      | THIS         |
|-----+-----+------+-------+------------+--------------|
|   1 | a   | a    |  NULL | 1          | {            |
|     |     |      |       |            |   "a": 1,    |
|     |     |      |       |            |   "b": [     |
|     |     |      |       |            |     77,      |
|     |     |      |       |            |     88       |
|     |     |      |       |            |   ],         |
|     |     |      |       |            |   "c": {     |
|     |     |      |       |            |     "d": "X" |
|     |     |      |       |            |   }          |
|     |     |      |       |            | }            |
|   1 | b   | b    |  NULL | [          | {            |
|     |     |      |       |   77,      |   "a": 1,    |
|     |     |      |       |   88       |   "b": [     |
|     |     |      |       | ]          |     77,      |
|     |     |      |       |            |     88       |
|     |     |      |       |            |   ],         |
|     |     |      |       |            |   "c": {     |
|     |     |      |       |            |     "d": "X" |
|     |     |      |       |            |   }          |
|     |     |      |       |            | }            |
|   1 | c   | c    |  NULL | {          | {            |
|     |     |      |       |   "d": "X" |   "a": 1,    |
|     |     |      |       | }          |   "b": [     |
|     |     |      |       |            |     77,      |
|     |     |      |       |            |     88       |
|     |     |      |       |            |   ],         |
|     |     |      |       |            |   "c": {     |
|     |     |      |       |            |     "d": "X" |
|     |     |      |       |            |   }          |
|     |     |      |       |            | }            |
|   1 | d   | c.d  |  NULL | "X"        | {            |
|     |     |      |       |            |   "d": "X"   |
|     |     |      |       |            | }            |
+-----+-----+------+-------+------------+--------------+
```

The following example explodes an array that is nested within another array. Create the following table:

```sqlexample
CREATE OR REPLACE TABLE persons AS
  SELECT column1 AS id, PARSE_JSON(column2) as c
    FROM values
      (12712555,
       '{ name:  { first: "John", last: "Smith"},
         contact: [
         { business:[
           { type: "phone", content:"555-1234" },
           { type: "email", content:"j.smith@example.com" } ] } ] }'),
      (98127771,
       '{ name:  { first: "Jane", last: "Doe"},
         contact: [
         { business:[
           { type: "phone", content:"555-1236" },
           { type: "email", content:"j.doe@example.com" } ] } ] }') v;
```

The following query uses multiple LATERAL FLATTEN calls. LATERAL is required here because the second
FLATTEN references the output of the first (`f.value:business`) call. Without LATERAL, the second FLATTEN
could not access columns from the first call.

```sqlexample
SELECT id as "ID",
    f.value AS "Contact",
    f1.value:type AS "Type",
    f1.value:content AS "Details"
  FROM persons p,
    LATERAL FLATTEN(INPUT => p.c, PATH => 'contact') f,
    LATERAL FLATTEN(INPUT => f.value:business) f1;
```

```output
+----------+-----------------------------------------+---------+-----------------------+
|       ID | Contact                                 | Type    | Details               |
|----------+-----------------------------------------+---------+-----------------------|
| 12712555 | {                                       | "phone" | "555-1234"            |
|          |   "business": [                         |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "555-1234",            |         |                       |
|          |       "type": "phone"                   |         |                       |
|          |     },                                  |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "j.smith@example.com", |         |                       |
|          |       "type": "email"                   |         |                       |
|          |     }                                   |         |                       |
|          |   ]                                     |         |                       |
|          | }                                       |         |                       |
| 12712555 | {                                       | "email" | "j.smith@example.com" |
|          |   "business": [                         |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "555-1234",            |         |                       |
|          |       "type": "phone"                   |         |                       |
|          |     },                                  |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "j.smith@example.com", |         |                       |
|          |       "type": "email"                   |         |                       |
|          |     }                                   |         |                       |
|          |   ]                                     |         |                       |
|          | }                                       |         |                       |
| 98127771 | {                                       | "phone" | "555-1236"            |
|          |   "business": [                         |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "555-1236",            |         |                       |
|          |       "type": "phone"                   |         |                       |
|          |     },                                  |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "j.doe@example.com",   |         |                       |
|          |       "type": "email"                   |         |                       |
|          |     }                                   |         |                       |
|          |   ]                                     |         |                       |
|          | }                                       |         |                       |
| 98127771 | {                                       | "email" | "j.doe@example.com"   |
|          |   "business": [                         |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "555-1236",            |         |                       |
|          |       "type": "phone"                   |         |                       |
|          |     },                                  |         |                       |
|          |     {                                   |         |                       |
|          |       "content": "j.doe@example.com",   |         |                       |
|          |       "type": "email"                   |         |                       |
|          |     }                                   |         |                       |
|          |   ]                                     |         |                       |
|          | }                                       |         |                       |
+----------+-----------------------------------------+---------+-----------------------+
```

---
title: FLOOR
source: https://docs.snowflake.com/en/sql-reference/functions/floor.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# FLOOR

Returns values from `input_expr` rounded to the nearest equal or smaller integer, or to the nearest equal or smaller value with the specified number of places after the decimal point.

See also:
:   [CEIL](ceil.md) , [ROUND](round.md) , [TRUNCATE , TRUNC](trunc.md)

## Syntax

```sqlsyntax
FLOOR( <input_expr> [, <scale_expr> ] )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type should be one of the numeric data types, such as DECFLOAT,
    FLOAT, or NUMBER.

`scale_expr`
:   The number of digits the output should include after the decimal point.

    The default `scale_expr` is zero, meaning that the function removes all digits after the decimal point.

    For information about negative scales, see the Usage Notes below.

## Returns

The return type is based on the input type:

* If the input expression is a FLOAT, the returned type is a FLOAT.
* If the input expression is DECFLOAT, the returned type is DECFLOAT.
* If the input expression is a NUMBER, the returned type is a NUMBER.

  + If the input scale is constant:

    - If the input scale is positive, the returned type has a scale equal to the input scale and has a precision large enough to
      encompass any possible result.
    - If the input scale is negative, the returned type has a scale of 0.
  + If the input scale isn’t constant, the returned type’s scale is the same as the input expression’s.

If the scale is zero, then the value is effectively an INTEGER.

For example:

* The data type returned by FLOOR(3.14::FLOAT, 1) is FLOAT.
* The NUMBER returned by FLOOR(3.14, 1) has scale 1 and precision at least 3.
* The NUMBER returned by FLOOR(-9.99, 0) has scale 0 and precision at least 2.
* The NUMBER returned by FLOOR(33.33, -1) has scale 0 and precision at least 3.

## Usage notes

* If `scale_expr` is negative, then it specifies the number of places before the decimal point to
  which to adjust the number. For example, if the scale is -2, then the result is a multiple of 100.
* If `scale_expr` is larger than the input expression scale, the function does not have any effect.
* If either the `input_expr` or the `scale_expr` is NULL, then the result is NULL.
* When negative numbers are rounded down, the value is further from 0. For example, FLOOR(-1.1) is -2, not -1.
* If rounding the number downward brings the number outside of the range of values of the data type, an error is returned.

## Examples

This example demonstrates the function without the `scale_expr`
parameter:

> ```sqlexample
> SELECT FLOOR(135.135), FLOOR(-975.975);
> +----------------+-----------------+
> | FLOOR(135.135) | FLOOR(-975.975) |
> |----------------+-----------------|
> |            135 |            -976 |
> +----------------+-----------------+
> ```

This example demonstrates the function with the `scale_expr` parameter,
including with the scale set to negative numbers:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE test_floor (n FLOAT, scale INTEGER);
> > INSERT INTO test_floor (n, scale) VALUES
> >    (-975.975, -1),
> >    (-975.975,  0),
> >    (-975.975,  2),
> >    ( 135.135, -2),
> >    ( 135.135,  0),
> >    ( 135.135,  1),
> >    ( 135.135,  3),
> >    ( 135.135, 50),
> >    ( 135.135, NULL)
> >    ;
> > ```
>
> Output:
>
> > ```sqlexample
> > SELECT n, scale, FLOOR(n, scale)
> >   FROM test_floor
> >   ORDER BY n, scale;
> > +----------+-------+-----------------+
> > |        N | SCALE | FLOOR(N, SCALE) |
> > |----------+-------+-----------------|
> > | -975.975 |    -1 |        -980     |
> > | -975.975 |     0 |        -976     |
> > | -975.975 |     2 |        -975.98  |
> > |  135.135 |    -2 |         100     |
> > |  135.135 |     0 |         135     |
> > |  135.135 |     1 |         135.1   |
> > |  135.135 |     3 |         135.135 |
> > |  135.135 |    50 |         135.135 |
> > |  135.135 |  NULL |            NULL |
> > +----------+-------+-----------------+
> > ```

---
title: FRESHNESS (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_freshness.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# FRESHNESS (system data metric function)

Returns how much time in seconds has elapsed since a table was last modified.

When a column argument is specified, the time period is calculated by comparing the current run of the function with the maximum value of a
timestamp column. If the scheduled time to run the function is different than the time it actually ran, then the scheduled time is used for
the comparison.

When no column is specified, the time period is calculated by comparing the current run of the function with the last time a
[DML command](../sql-dml.md) acted on the table. If the scheduled time to run the function is different than the time it
actually ran, then the scheduled time is used for the comparison.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.FRESHNESS( [ <query> ] )
```

## Arguments

`query`
:   If specified, the query must project a single timestamp column.

    If you don’t want to specify a column, you must [associate the function with a table](../../user-guide/data-quality-working.md) rather than call
    it directly.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* DATE
* TIMESTAMP_LTZ
* TIMESTAMP_TZ

## Returns

The function returns a scalar value with a NUMBER data type.

## Usage notes

* You must specify a column argument if you want to associate this function with a view or external table.
* This function can be called directly only if you specify a query that projects a timestamp column. If you want to associate the function
  with a table or view so it runs at regular intervals with or without a column argument, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Example

Associate the function with the table `t1` to determine how long it’s been since the last DML operation on the table:

```sqlexample
ALTER TABLE t1
  ADD DATA METRIC FUNCTION SNOWFLAKE.CORE.FRESHNESS on ();
```

Call the function directly to determine the freshness of the data, 300 seconds or 5 minutes, in the table by measuring the
`TIMESTAMP` column:

```sqlexample
SELECT SNOWFLAKE.CORE.FRESHNESS(
  SELECT
    timestamp
  FROM hr.tables.empl_info
) < 300;
```

```output
+---------------------------------------------------------------------+
| SNOWFLAKE.CORE.FRESHNESS(SELECT timestamp FROM hr.tables.empl_info) |
+---------------------------------------------------------------------+
| True                                                                |
+---------------------------------------------------------------------+
```

---
title: GENERATE_COLUMN_DESCRIPTION
source: https://docs.snowflake.com/en/sql-reference/functions/generate_column_description.md
section: SQL Functions
---

Categories:
:   [Metadata functions](../functions-metadata.md)

# GENERATE_COLUMN_DESCRIPTION

Generates a list of columns from a set of staged files that contain semi-structured data using the
[INFER_SCHEMA](infer_schema.md) function output.

The output from this function can be used as input when manually creating a table, external table, Apache Iceberg™ table, or view (using the appropriate
[CREATE <object>](../sql/create.md) command) based on the column definitions of the staged files.

Alternatively, the [CREATE TABLE](../sql/create-table.md) or [CREATE ICEBERG TABLE](../sql/create-iceberg-table.md) command with the USING TEMPLATE clause can be used to create a new table with the
column definitions derived from the same INFER_SCHEMA function output.

## Syntax

```sqlsyntax
GENERATE_COLUMN_DESCRIPTION( <expr> , '<string>' )
```

## Arguments

`expr`
:   Output of the INFER_SCHEMA function formatted as an array.

`'string'`
:   Type of object that could be created from the column list. The appropriate formatting for this type is applied to the output.

    Possible values are `table`, `external_table`, or `view`.

## Returns

The function returns the list of columns in a set of staged files, which can be
used as input when creating an object of the type identified in the second argument.

## Examples

Detect, format, and output the set of column definitions in a set of Parquet files staged in the `mystage` stage. The output columns are
formatted for creating a table.

This example builds on an example in the [INFER_SCHEMA](infer_schema.md) topic:

```sqlexample
-- Create a file format that sets the file type as Parquet.
CREATE FILE FORMAT my_parquet_format
  TYPE = parquet;

-- Query the GENERATE_COLUMN_DESCRIPTION function.
SELECT GENERATE_COLUMN_DESCRIPTION(ARRAY_AGG(OBJECT_CONSTRUCT(*)), 'table') AS COLUMNS
  FROM TABLE (
    INFER_SCHEMA(
      LOCATION=>'@mystage',
      FILE_FORMAT=>'my_parquet_format'
    )
  );

+--------------------+
| COLUMN_DESCRIPTION |
|--------------------|
| "country" VARIANT, |
| "continent" TEXT   |
+--------------------+

-- The function output can be used to define the columns in a table.
CREATE TABLE mytable ("country" VARIANT, "continent" TEXT);
```

Same as the previous example, but generates a set of columns formatted for creating an external table:

```sqlexample
-- Query the GENERATE_COLUMN_DESCRIPTION function.
SELECT GENERATE_COLUMN_DESCRIPTION(ARRAY_AGG(OBJECT_CONSTRUCT(*)), 'external_table') AS COLUMNS
  FROM TABLE (
    INFER_SCHEMA(
      LOCATION=>'@mystage',
      FILE_FORMAT=>'my_parquet_format'
    )
  );

+---------------------------------------------+
| COLUMN_DESCRIPTION                          |
|---------------------------------------------|
| "country" VARIANT AS ($1:country::VARIANT), |
| "continent" TEXT AS ($1:continent::TEXT)    |
+---------------------------------------------+
```

Same as the previous examples, but generates a set of columns formatted for creating an Iceberg table:

```sqlexample
-- Create a file format that sets the file type as Parquet.
CREATE OR REPLACE FILE FORMAT my_parquet_format
  TYPE = PARQUET
  USE_VECTORIZED_SCANNER = TRUE;

-- Query the GENERATE_COLUMN_DESCRIPTION function.
SELECT GENERATE_COLUMN_DESCRIPTION(ARRAY_AGG(OBJECT_CONSTRUCT(*)), 'table') AS COLUMNS
  FROM TABLE (
    INFER_SCHEMA(
      LOCATION=>'@my_int_stage',
      FILE_FORMAT=>'my_parquet_format',
      KIND => 'ICEBERG'
    )
  );

+---------------------------------------------+
| COLUMN_DESCRIPTION                          |
|---------------------------------------------|
| "id" INT NOT NULL,                          |
| "custnum" INT NOT NULL                      |
+---------------------------------------------+
```

Same as the previous examples, but generates a set of columns formatted for creating a view:

```sqlexample
-- Query the GENERATE_COLUMN_DESCRIPTION function.
SELECT GENERATE_COLUMN_DESCRIPTION(ARRAY_AGG(OBJECT_CONSTRUCT(*)), 'view') AS COLUMNS
  FROM TABLE (
    INFER_SCHEMA(
      LOCATION=>'@mystage',
      FILE_FORMAT=>'my_parquet_format'
    )
  );

+--------------------+
| COLUMN_DESCRIPTION |
|--------------------|
| "country" ,        |
| "continent"        |
+--------------------+
```

> **Note:**
>
> Using `*` for `ARRAY_AGG(OBJECT_CONSTRUCT())` might result in an error if the returned result is larger
> than 128 MB. Avoid using `*` for larger result sets, and only use the required columns, `COLUMN NAME`,
> `TYPE`, and `NULLABLE`, for the query. You can include the optional column `ORDER_ID` when using
> `WITHIN GROUP (ORDER BY order_id)`.

---
title: GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER
source: https://docs.snowflake.com/en/sql-reference/functions/generate_postgres_access_token_for_user.md
section: SQL Functions
---

# GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER

Generates a short-lived access token for a Snowflake user to use as a password when logging into a Snowflake Postgres instance that has
the AUTHENTICATION_AUTHORITY attribute set to POSTGRES_OR_SNOWFLAKE.

Short-lived access tokens generated with this function have a 15-minute lifetime. Once expired they can no longer be used to
establish new connections to the Snowflake Postgres instance.

See [Snowflake Token Authentication for Snowflake Postgres](../../user-guide/snowflake-postgres/postgres-token-auth.md) for more details.

## Syntax

```sqlsyntax
GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER('<snowflake_postgres_instance_name>', '<postgres_username>')
```

## Arguments

`snowflake_postgres_instance_name`
:   Specifies the Snowflake Postgres instance name to generate the short-lived access token for. If the given instance does not exist
    or the executing user does not have ownership or the USAGE permission on the instance, the function execution will fail.

    This argument is case-insensitive. Use double-quotes if case-sensitivity is needed.

`postgres_username`
:   Specifies the Postgres username to generate the short-lived access token for. This argument is not validated, which allows for creating
    unusable tokens for Postgres users that do not exist or are not mapped to a Snowflake user. Valid tokens will be usable if a mapping is
    subsequently created.

    This argument is case-sensitive.

## Returns

Returns a short-lived access token that has a 15-minute lifetime.

## Access control requirements

Execution of this function for a given Snowflake Postgres instance can only be done by the instance’s owner or users granted the USAGE
permission for it.

## Examples

Snowflake user Casey can generate a short-lived access token to use when logging into the `reporting_server` with a Postgres user
named `reporting_user` with:

```sqlexample
SELECT GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER('reporting_server', 'reporting_user');
```

If the instance’s name was created case-sensitive as `Reporting_server` then double-quotes are needed for the instance name:

```sqlexample
SELECT GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER('"Reporting_server"', 'reporting_user');
```

---
title: GENERATOR
source: https://docs.snowflake.com/en/sql-reference/functions/generator.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# GENERATOR

Creates rows of data based either on a specified number of rows, a specified generation period (in seconds), or both. This system-defined table function enables synthetic row generation.

Note that it is possible to generate virtual tables with 0 columns but possibly many rows. Such virtual tables are useful for queries whose [SELECT](../sql/select.md) clause consists entirely
of data-generating functions.

## Syntax

```sqlsyntax
GENERATOR( ROWCOUNT => <count> [ , TIMELIMIT => <sec> ] )

GENERATOR( [ TIMELIMIT => <sec> ] )
```

## Usage notes

* `count` and `sec` must be non-negative integer constants.
* If only the `ROWCOUNT` argument is specified, the resulting table will contain `count` rows.
* If only the `TIMELIMIT` argument is specified, the query runs for `sec` seconds, generating as many
  rows as possible within the time frame. The exact row count depends on the system speed and is not entirely
  deterministic.
* If both the `ROWCOUNT` and `TIMELIMIT` arguments are specified, then:

  + If the `ROWCOUNT` is reached before the `TIMELIMIT`, the resulting table will contain `count`
    rows.
  + If the `TIMELIMIT` is reached before the `ROWCOUNT`, the table will contain the number of rows
    generated within the time frame. The exact row count depends on the system speed and is not entirely deterministic.
* If `ROWCOUNT` or `TIMELIMIT` is null, it will be ignored. So `generator(ROWCOUNT => null)` generates 0 rows.
* If both parameters (`ROWCOUNT` and `TIMELIMIT`) are omitted, the GENERATOR function returns 0 rows.
* The content of the rows is determined by the functions in the projection
  clause, not by the GENERATOR function itself. For more details, see
  the Examples section below. See also the description(s) of the specific
  functions (e.g. SEQ()), that you plan to use in the projection clause;
  not all valid functions produce sequences without gaps.

## Examples

> **Note:**
>
> These examples generate sequences that can have gaps. For examples that generate sequences without gaps, refer to
> [SEQ1 / SEQ2 / SEQ4 / SEQ8](seq1.md) and [ROW_NUMBER](row_number.md).

This example uses the GENERATOR function to generate 10 rows. The content
of the rows is determined by the functions in the projection clause:

* The [SEQ4()](seq1.md) column generates a
  sequence of 4-byte integers, starting with 0.
* The [UNIFORM(…)](uniform.md) column generates
  values in the range between the first parameter (1) and the second
  parameter (10), based on either a function or a constant passed as the third
  parameter.

This example includes an optional “seed” for the RANDOM() function so that the
output is consistent:

> ```sqlexample
> SELECT seq4(), uniform(1, 10, RANDOM(12))
>   FROM TABLE(GENERATOR(ROWCOUNT => 10)) v
>   ORDER BY 1;
> +--------+----------------------------+
> | SEQ4() | UNIFORM(1, 10, RANDOM(12)) |
> |--------+----------------------------|
> |      0 |                          7 |
> |      1 |                          2 |
> |      2 |                          5 |
> |      3 |                          9 |
> |      4 |                          6 |
> |      5 |                          9 |
> |      6 |                          9 |
> |      7 |                          5 |
> |      8 |                          3 |
> |      9 |                          8 |
> +--------+----------------------------+
> ```

This example is similar to the preceding example, except that it passes a
constant rather than a function as the third parameter to the `UNIFORM`
function. The result is that the output for the `UNIFORM` column is the
same for every row.

> ```sqlexample
> SELECT seq4(), uniform(1, 10, 42)
>   FROM TABLE(GENERATOR(ROWCOUNT => 10)) v
>   ORDER BY 1;
> +--------+--------------------+
> | SEQ4() | UNIFORM(1, 10, 42) |
> |--------+--------------------|
> |      0 |                 10 |
> |      1 |                 10 |
> |      2 |                 10 |
> |      3 |                 10 |
> |      4 |                 10 |
> |      5 |                 10 |
> |      6 |                 10 |
> |      7 |                 10 |
> |      8 |                 10 |
> |      9 |                 10 |
> +--------+--------------------+
> ```

If you omit both the `ROWCOUNT` and `TIMELIMIT` parameters, the output is 0 rows:

> ```sqlexample
> SELECT seq4(), uniform(1, 10, RANDOM(12))
>   FROM TABLE(GENERATOR()) v
>   ORDER BY 1;
> +--------+----------------------------+
> | SEQ4() | UNIFORM(1, 10, RANDOM(12)) |
> |--------+----------------------------|
> +--------+----------------------------+
> ```

The following example uses the `TIMELIMIT` parameter without the `ROWCOUNT` parameter.

```sqlexample
SELECT COUNT(seq4()) FROM TABLE(GENERATOR(TIMELIMIT => 10)) v;

+---------------+
| COUNT(SEQ4()) |
|---------------|
|    3615440896 |
+---------------+
```

---
title: GET
source: https://docs.snowflake.com/en/sql-reference/functions/get.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Extraction)

# GET

Extracts a value from an [ARRAY](../data-types-semistructured.md) or an [OBJECT](../data-types-semistructured.md) (or a [VARIANT](../data-types-semistructured.md) that
contains an ARRAY or OBJECT).

The function returns NULL if either of the arguments is NULL.

Note that this function should not be confused with the [GET](../sql/get.md) DML command.

See also:
:   [GET_IGNORE_CASE](get_ignore_case.md) , [GET_PATH , :](get_path.md)

## Syntax

**ARRAY (or VARIANT containing an ARRAY)**

```sqlsyntax
GET( <array> , <index> )

GET( <variant> , <index> )
```

**OBJECT (or VARIANT containing an OBJECT)**

```sqlsyntax
GET( <object> , <field_name> )

GET( <variant> , <field_name> )
```

**MAP**

```sqlsyntax
GET( <map> , <key> )
```

## Arguments

`array`
:   An expression that evaluates to an [ARRAY](../data-types-semistructured.md).

`index`
:   An expression that evaluates to an INTEGER. This specifies the position of the element to retrieve from the ARRAY. The
    position is 0-based, not 1-based.

    If the index points outside of the array boundaries, or if the indexed element does not exist (in a sparse array):

    * If `array` is a semi-structured ARRAY, this function returns NULL.
    * If `array` is a structured ARRAY, an error occurs.

`variant`
:   An expression that evaluates to a [VARIANT](../data-types-semistructured.md) that contains either an ARRAY or an OBJECT.

`object`
:   An expression that evaluates to an [OBJECT](../data-types-semistructured.md) that contains key-value pairs.

`field_name`
:   An expression that evaluates to a VARCHAR. This specifies the key in a key-value pair for which you want to retrieve the value.

    `field_name` must not be an empty string.

    If `object` is a [structured OBJECT](../data-types-structured.md), you must specify a constant for
    `field_name`.

    If `object` does not contain the specified key:

    * If `object` is a semi-structured OBJECT, the function returns NULL.
    * If `object` is a structured OBJECT, an error occurs.

`map`
:   An expression that evaluates to a [MAP](../data-types-structured.md).

`key`
:   The key in a key-value pair for which you want to retrieve the value.

    If `map` does not contain the specified key, the function returns NULL.

## Returns

* The returned value is the specified element of the ARRAY, or the value that corresponds to the specified key of a key-value
  pair in the OBJECT.
* If the input object is a semi-structured OBJECT, ARRAY, or VARIANT value, the function returns a VARIANT value. The data type
  of the value is VARIANT because:

  + In an ARRAY value, each element is of type VARIANT.
  + In an OBJECT value, the value in each key-value pair is of type VARIANT.
* If the input object is a [structured OBJECT, structured ARRAY, or MAP](../data-types-structured.md),
  the function returns a value of the type specified for the object.

  For example, if the type of the input object is ARRAY(NUMBER), the function returns a NUMBER value.

## Usage notes

* GET applies case-sensitive matching to `field_name`. For case-insensitive matching, use [GET_IGNORE_CASE](get_ignore_case.md).
* If the first parameter is of type VARIANT:

  + If the second parameter is of type VARCHAR (e.g. a `field_name`), the function returns NULL if `variant`
    does not contain an OBJECT.
  + If the second parameter is of type INTEGER (e.g. an `index`), the function returns NULL if `variant`
    does not contain an ARRAY.

## Examples

Create a table with sample data:

> ```sqlexample
> CREATE TABLE vartab (a ARRAY, o OBJECT, v VARIANT);
> INSERT INTO vartab (a, o, v)
>   SELECT
>     ARRAY_CONSTRUCT(2.71, 3.14),
>     OBJECT_CONSTRUCT('Ukraine', 'Kyiv'::VARIANT,
>                      'France',  'Paris'::VARIANT),
>     TO_VARIANT(OBJECT_CONSTRUCT('weatherStationID', 42::VARIANT,
>                      'timestamp', '2022-03-07 14:00'::TIMESTAMP_LTZ::VARIANT,
>                      'temperature', 31.5::VARIANT,
>                      'sensorType', 'indoor'::VARIANT))
>     ;
> ```
>
> ```sqlexample
> SELECT a, o, v FROM vartab;
> +---------+----------------------+-------------------------------------------------+
> | A       | O                    | V                                               |
> |---------+----------------------+-------------------------------------------------|
> | [       | {                    | {                                               |
> |   2.71, |   "France": "Paris", |   "sensorType": "indoor",                       |
> |   3.14  |   "Ukraine": "Kyiv"  |   "temperature": 31.5,                          |
> | ]       | }                    |   "timestamp": "2022-03-07 14:00:00.000 -0800", |
> |         |                      |   "weatherStationID": 42                        |
> |         |                      | }                                               |
> +---------+----------------------+-------------------------------------------------+
> ```

Extract the first element of an ARRAY:

> ```sqlexample
> SELECT GET(a, 0) FROM vartab;
> +-----------+
> | GET(A, 0) |
> |-----------|
> | 2.71      |
> +-----------+
> ```

Given the name of a country, extract the name of the capital city of that country from an OBJECT containing country names and
capital cities:

> ```sqlexample
> SELECT GET(o, 'Ukraine') FROM vartab;
> +-------------------+
> | GET(O, 'UKRAINE') |
> |-------------------|
> | "Kyiv"            |
> +-------------------+
> ```

Extract the temperature from a VARIANT that contains an OBJECT:

> ```sqlexample
> SELECT GET(v, 'temperature') FROM vartab;
> +-----------------------+
> | GET(V, 'TEMPERATURE') |
> |-----------------------|
> | 31.5                  |
> +-----------------------+
> ```

For more detailed examples, see [Querying Semi-structured Data](../../user-guide/querying-semistructured.md).

For examples of using GET with XMLGET, see the Examples and Usage Notes sections in [XMLGET](xmlget.md).

---
title: GET_ABSOLUTE_PATH
source: https://docs.snowflake.com/en/sql-reference/functions/get_absolute_path.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md)

# GET_ABSOLUTE_PATH

Retrieves the absolute path of a staged file using the stage name and path of the file relative to its location in the stage as inputs.

## Syntax

```sqlsyntax
GET_ABSOLUTE_PATH( @<stage_name> , '<relative_file_path>' )
```

## Arguments

`stage_name`
:   Name of the internal or external stage where the file is stored.

    > **Note:**
    >
    > If the stage name includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'` for a stage
    > named `"my stage"`).

`relative_file_path`
:   Path and filename of the file relative to its location in the stage.

## Returns

Absolute path of the file in cloud storage.

## Usage notes

* This SQL function returns a value for any role that has the following privilege on the stage:

  External stage:
  :   USAGE

  Internal stage:
  :   READ

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

Retrieve the absolute path of a bitmap format image file in an external stage:

```sqlexample
SELECT GET_ABSOLUTE_PATH(@images_stage, 'us/yosemite/half_dome.jpg');

+------------------------------------------------------------------------------------------+
| GET_ABSOLUTE_PATH(@IMAGES_STAGE, 'US/YOSEMITE/HALF_DOME.JPG')                            |
+------------------------------------------------------------------------------------------+
| s3://photos/national_parks/us/yosemite/half_dome.jpg                                     |
+------------------------------------------------------------------------------------------+
```

---
title: GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)
source: https://docs.snowflake.com/en/sql-reference/functions/get_ai_evaluation_data-snowflake-local.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Cortex Agents)

# GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)

Retrieves evaluation data for a Cortex Agent evaluation run.

Call this function to inspect all recorded traces for an evaluatuon run. For more information on Cortex Agent evaluations, see [Cortex Agent evaluations](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

See also:
:   [EXECUTE_AI_EVALUATION](execute_ai_evaluation.md) , [GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)](get_ai_record_trace-snowflake-local.md) , [GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)](get_ai_observability_logs-snowflake-local.md)

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.GET_AI_EVALUATION_DATA( <database> , <schema> , <agent_name> , <agent_type>, <run_name> )
```

## Arguments

`database`
:   Name of the database containing the agent.

`schema`
:   Name of the schema containing the agent.

`agent_name`
:   Name of the agent to retrieve a record for.

`agent_type`
:   The string constant `CORTEX AGENT`. This value is case-insensitive.

`run_name`
:   Name of the run to retrieve full evaluation data for.

## Returns

A table containing information for the specified evaluation, with the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| RECORD_ID | VARCHAR | The unique identifier assigned by Snowflake for this evaluation record. |
| INPUT_ID | VARCHAR | The unique identifier assigned by Snowflake for this evaluation input. |
| REQUEST_ID | VARCHAR | The unique identifier assigned by Snowflake for this request. |
| TIMESTAMP | TIMESTAMP_TZ | The time (in UTC) at which the request was made. |
| DURATION_MS | INT | The amount of time, in milliseconds, that it took for the agent to return a response. |
| INPUT | VARCHAR | The query string used as input for this evaluation record. |
| OUTPUT | VARCHAR | The response returned by the Cortex Agent for this evaluation record. |
| ERROR | VARCHAR | Information about any errors that occurred during the request. |
| GROUND_TRUTH | VARCHAR | The ground truth information used to evaluate this record’s Cortex Agent output. |
| METRIC_NAME | VARCHAR | The name of the metric evaluated for this record. |
| EVAL_AGG_SCORE | NUMBER | The evaluation score assigned for this record. |
| METRIC_TYPE | VARCHAR | The type of metric being evaluated. For built-in metrics, the value is `system`. For custom metrics, the value is `custom`. |
| METRIC_STATUS | VARIANT | A map containing information about the agent’s HTTP response for this record, with the following keys:  * `status`: The HTTP status code of the response. * `message`: The HTTP message sent in the status response. |
| METRIC_CALLS | ARRAY | An array of VARIANT values that contain information about the computed metric. Each array entry contains the metric’s criteria, an explanation of the metric score, and metadata. The keys of each entry are:  * `criteria`: The criteria used by an LLM judge to evaluate response correctness. * `explanation`: An explanation of why the score was assigned. * `full_metadata`: A VARIANT value that contains metadata and information about this metric’s processing by the LLM judge. The keys of this map include:  + `completion_tokens`: The number of output tokens generated by the LLM for this metric evaluation call.   + `guard_tokens`: The number of tokens consumed by Cortex Guard for this metric evaluation call.   + `normalized_score`: The original evaluation score normalized to the range [0.0, 1.0], rounded to two decimal places.   + `original_score`: The original score assigned by this metric evaluation for the record.   + `prompt_tokens`: The number of tokens taken up by the prompt provided to the LLM judge.   + `total_tokens`: The total number of tokens used by the LLM judge for this computation. |
| TOTAL_INPUT_TOKENS | INT | The total number of tokens used to process the input query. |
| TOTAL_OUTPUT_TOKENS | INT | The total number of output tokens produced by the Cortex Agent. |
| LLM_CALL_COUNT | INT | Counts the number of times any LLM was called, either by the agent or an evaluation judge. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CORTEX_USER | Database role |  |
| USAGE | Cortex Agent |  |
| MONITOR | Cortex Agent |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For the full access control permissions required by Cortex Agent evaluations, see [Cortex Agent evaluations – Access control requirements](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

## Examples

The following example displays the full evaluation details for a run called `run-1`, where the agent is named `evaluated_agent` stored on the schema `eval_db.eval_schema`:

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_EVALUATION_DATA(
  'eval_db',
  'eval_schema',
  'evaluated_agent',
  'CORTEX AGENT',
  'run-1')
);
```

---
title: GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)
source: https://docs.snowflake.com/en/sql-reference/functions/get_ai_observability_logs-snowflake-local.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Cortex Agents)

# GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)

Retrieve log data for a Cortex Agent observability event, such as a warning or failure.

Call this function to retrieve information about what events occurred during a Cortex Agent evaluation run. For more information, see [Cortex Agent evaluations](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

See also:
:   [GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)](get_ai_record_trace-snowflake-local.md) , [GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)](get_ai_evaluation_data-snowflake-local.md) , [EXECUTE_AI_EVALUATION](execute_ai_evaluation.md)

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.GET_AI_OBSERVABILITY_LOGS( <database>, <schema>, <agent_name>, <agent_type> )
```

## Arguments

`database`
:   Name of the database containing the agent.

`schema`
:   Name of the schema containing the agent.

`agent_name`
:   Name of the agent to retrieve a record for.

`agent_type`
:   The type of agent to retrieve evaluation data for. Use the string constant `CORTEX AGENT`. This value is case-insensitive.

## Returns

For details on the information contained in AI Observability events, see [Observability data](../../user-guide/snowflake-cortex/ai-observability/reference.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CORTEX_USER | Database role |  |
| USAGE | Cortex Agent |  |
| MONITOR | Cortex Agent |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For the full access control permissions required by Cortex Agent evaluations, see [Cortex Agent evaluations – Access control requirements](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

## Examples

The following example checks for errors and warnings for a run called `run-1`, where the agent is named `evaluated_agent` stored on the schema `eval_db.eval_schema`:

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_OBSERVABILITY_LOGS(
  'eval_db',
  'eval_schema',
  'evaluated_agent',
  'CORTEX AGENT')
)
  WHERE TRUE
    AND (record:"severity_text"='ERROR' or record:"severity_text"='WARN')
    AND record_attributes:"snow.ai.observability.run.name"='run-1';
```

---
title: GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)
source: https://docs.snowflake.com/en/sql-reference/functions/get_ai_record_trace-snowflake-local.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Cortex Agents)

# GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)

Retrieve a single trace record from a Cortex Agent evaluation run.

Call this function when you want to inspect a single record from a complete Cortex Agent evaluation. For more information, see [Cortex Agent evaluations](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

See also:
:   [EXECUTE_AI_EVALUATION](execute_ai_evaluation.md) , [GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)](get_ai_evaluation_data-snowflake-local.md) , [GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)](get_ai_observability_logs-snowflake-local.md)

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.GET_AI_RECORD_TRACE( <database> , <schema> , <agent_name> , <agent_type> , <record_id> )
```

## Arguments

`database`
:   Name of the database containing the agent.

`schema`
:   Name of the schema containing the agent.

`agent_name`
:   Name of the agent to retrieve a record for.

`agent_type`
:   The string constant `CORTEX AGENT`. This value is case-insensitive.

`record_id`
:   The record identifier to retrieve.

## Returns

A table containing information for the requested trace, with the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| RECORD_ID | VARCHAR | The unique identifier assigned by Snowflake for this evaluation record. |
| INPUT_ID | VARCHAR | The unique identifier assigned by Snowflake for this evaluation input. |
| REQUEST_ID | VARCHAR | The unique identifier assigned by Snowflake for this request. |
| TIMESTAMP | TIMESTAMP_TZ | The time (in UTC) at which the request was made. |
| DURATION_MS | INT | The amount of time, in milliseconds, that it took for the agent to return a response. |
| INPUT | VARCHAR | The query string used as input for this evaluation record. |
| OUTPUT | VARCHAR | The response returned by the Cortex Agent for this evaluation record. |
| ERROR | VARCHAR | Information about any errors that occurred during the request. |
| GROUND_TRUTH | VARCHAR | The ground truth information used to evaluate this record’s Cortex Agent output. |
| METRIC_NAME | VARCHAR | The name of the metric evaluated for this record. |
| EVAL_AGG_SCORE | NUMBER | The evaluation score assigned for this record. |
| METRIC_TYPE | VARCHAR | The type of metric being evaluated. For built-in metrics, the value is `system`. For custom metrics, the value is `custom`. |
| METRIC_STATUS | VARIANT | A map containing information about the agent’s HTTP response for this record, with the following keys:  * `status`: The HTTP status code of the response. * `message`: The HTTP message sent in the status response. |
| METRIC_CALLS | ARRAY | An array of VARIANT values that contain information about the computed metric. Each array entry contains the metric’s criteria, an explanation of the metric score, and metadata. The keys of each entry are:  * `criteria`: The criteria used by an LLM judge to evaluate response correctness. * `explanation`: An explanation of why the score was assigned. * `full_metadata`: A VARIANT value that contains metadata and information about this metric’s processing by the LLM judge. The keys of this map include:  + `completion_tokens`: The number of output tokens generated by the LLM for this metric evaluation call.   + `guard_tokens`: The number of tokens consumed by Cortex Guard for this metric evaluation call.   + `normalized_score`: The original evaluation score normalized to the range [0.0, 1.0], rounded to two decimal places.   + `original_score`: The original score assigned by this metric evaluation for the record.   + `prompt_tokens`: The number of tokens taken up by the prompt provided to the LLM judge.   + `total_tokens`: The total number of tokens used by the LLM judge for this computation. |
| TOTAL_INPUT_TOKENS | INT | The total number of tokens used to process the input query. |
| TOTAL_OUTPUT_TOKENS | INT | The total number of output tokens produced by the Cortex Agent. |
| LLM_CALL_COUNT | INT | Counts the number of times any LLM was called, either by the agent or an evaluation judge. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CORTEX_USER | Database role |  |
| USAGE | Cortex Agent |  |
| MONITOR | Cortex Agent |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For the full access control permissions required by Cortex Agent evaluations, see [Cortex Agent evaluations – Access control requirements](../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

## Examples

The following example displays the trace for the record `9346efc3-5dd6-4038-9b1a-72ca3d3b768c`, where the agent is named `evaluated_agent` stored on the schema `eval_db.eval_schema`:

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.LOCAL.GET_AI_RECORD_TRACE(
  'eval_db',
  'eval_schema',
  'evaluated_agent',
  'CORTEX AGENT',
  '9346efc3-5dd6-4038-9b1a-72ca3d3b768c'
));
```

---
title: GET_ANACONDA_PACKAGES_REPODATA
source: https://docs.snowflake.com/en/sql-reference/functions/get_anaconda_packages_repodata.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# GET_ANACONDA_PACKAGES_REPODATA

Returns a list of third-party packages that are available from Anaconda.
For more information, see [Packages policies](../../developer-guide/udf/python/packages-policy.md).

## Syntax

```sqlsyntax
SNOWFLAKE.SNOWPARK.GET_ANACONDA_PACKAGES_REPODATA( '<architecture>' )
```

## Arguments

`architecture`
:   String specifying the architecture, which can be:
    `linux-64`, `linux-aarch64`, or `noarch`.

## Returns

Returns a JSON string that contains the contents of `repodata.json`, which is an index of the packages in a subdir. A subdir represents a particular archtecture. Each subdir will have its own repodata.

For more information, see the [Conda documentation](https://docs.conda.io/projects/conda-build/en/stable/concepts/generating-index.html#repodata-json)

## Access control requirements

You must use the ACCOUNTADMIN role to call this function.

## Examples

The following example gets the list of third-party packages from Anaconda for `linux-64`.

```sqlexample
USE ROLE ACCOUNTADMIN;

select SNOWFLAKE.SNOWPARK.GET_ANACONDA_PACKAGES_REPODATA('linux-64');
```

---
title: GET_CONDITION_QUERY_UUID
source: https://docs.snowflake.com/en/sql-reference/functions/get_condition_query_uuid.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Alerts)

# GET_CONDITION_QUERY_UUID

Returns the query ID for the SQL statement executed for the condition of an [alert](../../user-guide/alerts.md). In the action for
an alert, you can call this function to
[check the results of the statement for the condition](../../user-guide/alerts.md).

## Syntax

```sqlsyntax
SNOWFLAKE.ALERT.GET_CONDITION_QUERY_UUID()
```

## Arguments

None.

## Returns

The query ID for the SQL statement for the condition of the alert.

## Usage notes

* This function is defined in the ALERT schema of the SNOWFLAKE database.

  To call this function, you must use a role that is granted the
  [SNOWFLAKE database role](../snowflake-db-roles.md) ALERT_VIEWER. For example, to call the function as a user
  with the role alert_role, execute:

  ```sqlexample
  GRANT DATABASE ROLE snowflake.alert_viewer TO ROLE alert_role;
  ```
* This function can only be called from within an [alert](../../user-guide/alerts.md).

## Examples

Refer to [Checking the results of the SQL statement for the condition in the alert action](../../user-guide/alerts.md).

---
title: GET_CONFIGURATION_VALUE (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/get_configuration_value.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# GET_CONFIGURATION_VALUE (SYS_CONTEXT function)

Returns the current value for the specified configuration.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$APPLICATION' ,
  'GET_CONFIGURATION_VALUE' ,
  '<config_name>' ,
)
```

## Arguments

`'SNOWFLAKE$APPLICATION'`
:   Specifies that you want to call a function to return context information about the application in which the function is called.

`'GET_CONFIGURATION_VALUE'`
:   Calls the GET_CONFIGURATION_VALUE function.

`'config_name'`
:   Specifies the name of the configuration to get the value for.

## Returns

The function returns the current value of the configuration.

## Usage notes

* This function can only be used by an app.
* For a configuration definition of type `APPLICATION_NAME`, the value returned is the current
  name of the application stored in the specified configuration.

---
title: GET_CONTACTS
source: https://docs.snowflake.com/en/sql-reference/functions/get_contacts.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# GET_CONTACTS

Returns the [contacts](../../user-guide/contacts-using.md) associated with an object.

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.GET_CONTACTS (
  '<object_name>',
  '<object_type>'
  [ , '<contact_name>' ]
)
```

## Required arguments

`'object_name'`
:   Name of an object that can have contacts associated with it.

`'object_type'`
:   Type of the specified object. Possible values are DATABASE, SCHEMA, and TABLE (for all table-like objects contained in a database and
    schema).

    For a list of supported object types, see [Supported objects](../../user-guide/contacts-using.md).

## Optional arguments

`'contact_name'`
:   Name of a contact. If a contact is specified, the function doesn’t return information about other contacts that are associated with the
    specified object.

## Output

Returns a table, where each row has the following columns:

Title

| Column | Data type | Description |
| --- | --- | --- |
| `purpose` | VARCHAR | Describes the relationship between the contact and the specified object. The purpose helps you distinguish between contacts associated with the object so you can reach the right person for assistance. For example, an ACCESS_APPROVAL purpose indicates that the contact can help you get access to the object. |
| `email_distribution_list` | VARCHAR | Email addresses that can be used to contact someone about the object. |
| `url` | VARCHAR | A URL that can be used to contact someone about the object. |
| `user` | VARCHAR | User who can be contacted about the object. |
| `level` | VARCHAR | Type of object with which the contact was associated. You can use the level to determine where within the object hierarchy the contact was associated. Possible values include DATABASE, SCHEMA, and TABLE (for all table-like objects contained in a database and schema). |

> **Note:**
>
> The name of the contact object is intentionally omitted from the output of this function.

## Access control requirements

You must have the CORE_VIEWER database role to call this function.

## Usage notes

If a contact object includes a list of users, this function returns a separate row for each user in the list.

## Examples

Return a row for each contact associated with the table `t1`.

```sqlexample
SELECT * FROM TABLE(SNOWFLAKE.CORE.GET_CONTACTS('t1', 'TABLE'));
```

---
title: GET_DDL
source: https://docs.snowflake.com/en/sql-reference/functions/get_ddl.md
section: SQL Functions
---

Categories:
:   [Metadata functions](../functions-metadata.md)

# GET_DDL

Returns a DDL statement that can be used to recreate the specified object. For databases and schemas, GET_DDL is recursive
(that is, it returns the DDL statements for recreating all supported objects within the specified database/schema).

GET_DDL currently supports the following object types:

* Cortex Agents (see [CREATE AGENT](../sql/create-agent.md))
* Alerts (see [CREATE ALERT](../sql/create-alert.md))
* Databases (see [CREATE DATABASE](../sql/create-database.md)), including [catalog-linked databases](../sql/create-database-catalog-linked.md).
* Data metric functions (see [CREATE DATA METRIC FUNCTION](../sql/create-data-metric-function.md))
* Contacts (see [CREATE CONTACT](../sql/create-contact.md))
* dbt project objects (see [CREATE DBT PROJECT](../sql/create-dbt-project.md))
* Dynamic tables (see [CREATE DYNAMIC TABLE](../sql/create-dynamic-table.md))
* Event tables (see [CREATE EVENT TABLE](../sql/create-event-table.md))
* External tables (see [CREATE EXTERNAL TABLE](../sql/create-external-table.md))
* File formats (see [CREATE FILE FORMAT](../sql/create-file-format.md))
* Hybrid tables (see [CREATE HYBRID TABLE](../sql/create-hybrid-table.md))
* Apache Iceberg™ tables (see [CREATE ICEBERG TABLE](../sql/create-iceberg-table.md))
* Notebooks (see [CREATE NOTEBOOK](../sql/create-notebook.md))
* Online feature tables (see [CREATE ONLINE FEATURE TABLE](../sql/create-online-feature-table.md))
* Pipes (see [CREATE PIPE](../sql/create-pipe.md))
* Policies (see [CREATE AGGREGATION POLICY](../sql/create-aggregation-policy.md) , [CREATE AUTHENTICATION POLICY](../sql/create-authentication-policy.md) , [CREATE JOIN POLICY](../sql/create-join-policy.md) ,
  [CREATE MASKING POLICY](../sql/create-masking-policy.md) , [CREATE PASSWORD POLICY](../sql/create-password-policy.md) , [CREATE PRIVACY POLICY](../sql/create-privacy-policy.md) ,
  [CREATE PROJECTION POLICY](../sql/create-projection-policy.md) , [CREATE ROW ACCESS POLICY](../sql/create-row-access-policy.md) , [CREATE SESSION POLICY](../sql/create-session-policy.md),
  [CREATE STORAGE LIFECYCLE POLICY](../sql/create-storage-lifecycle-policy.md))
* Schemas (see [CREATE SCHEMA](../sql/create-schema.md))
* Semantic views (see [CREATE SEMANTIC VIEW](../sql/create-semantic-view.md))
* Sequences (see [CREATE SEQUENCE](../sql/create-sequence.md))
* Storage integrations (see [CREATE STORAGE INTEGRATION](../sql/create-storage-integration.md))
* Stored procedures (see [CREATE PROCEDURE](../sql/create-procedure.md))
* Streams (see [CREATE STREAM](../sql/create-stream.md))
* Tables (see [CREATE TABLE](../sql/create-table.md))
* Tags (see [CREATE TAG](../sql/create-tag.md))
* Tasks (see [CREATE TASK](../sql/create-task.md))
* UDFs, including external functions (see [CREATE FUNCTION](../sql/create-function.md))
* User-defined types (see [CREATE TYPE](../sql/create-type.md))
* Views (see [CREATE VIEW](../sql/create-view.md))
* Warehouses (see [CREATE WAREHOUSE](../sql/create-warehouse.md))

## Syntax

```sqlsyntax
GET_DDL( '<object_type>' , '[<namespace>.]<object_name>' [ , <use_fully_qualified_names_for_recreated_objects> ] )
```

## Arguments

**Required:**

`'object_type'`
:   Specifies the type of object for which the DDL is returned. Valid values (corresponding to the supported object types) are:

    * CORTEX_AGENT
    * CONTACT
    * DATABASE
    * DYNAMIC_TABLE
    * EVENT_TABLE
    * FILE_FORMAT
    * FUNCTION (for UDFs, including data metric functions and external functions)
    * ICEBERG_TABLE
    * INTEGRATION (storage)
    * PIPE
    * POLICY (aggregation, authentication, join, masking, password, projection, row access, session, and storage lifecycle policies)
    * PROCEDURE (for stored procedures)
    * SCHEMA
    * SEMANTIC VIEW
    * SEQUENCE
    * STREAM
    * TABLE (for tables, external tables, and hybrid tables)
    * TAG (object tagging)
    * TASK
    * TYPE
    * VIEW (for views and materialized views)
    * WAREHOUSE

`'namespace.object_name'`
:   Specifies the fully-qualified name of the object for which the DDL is returned.

    Namespace is the database and/or schema in which the object resides:

    * Not used for databases.
    * For schemas, takes the form of `database`.
    * For schema objects (tables, views, streams, tasks, sequences, file formats, pipes, policies, and UDFs), takes the form of
      `database.schema` or `schema`.

    Namespace is optional if a database and schema are currently in use within the user session; otherwise, it is required.

**Optional:**

`use_fully_qualified_names_for_recreated_objects`
:   If TRUE, the generated DDL statements use fully qualified names for the objects to be recreated.

    Default: FALSE.

    > **Note:**
    >
    > This does not affect the names of other objects referenced in the DDL statement (e.g. the name of a table referenced in
    > a view definition).

## Returns

Returns a string (a VARCHAR value) containing the text of the DDL statement that created the object.

For UDFs and stored procedures, the output might be slightly different from the original DDL. For example, if the UDF or stored
procedure contains JavaScript code, the delimiter characters around the JavaScript code might be different.

In addition, note that the DDL statement returned by the function might include default values for properties. For example, even
if the original CREATE PROCEDURE statement did not specify EXECUTE AS OWNER, the DDL statement returned by the function includes
EXECUTE AS OWNER, which is the default.

## Access control requirements

* For [semantic views](../../user-guide/views-semantic/overview.md), you must use a role that has been
  [granted the REFERENCES or OWNERSHIP privilege on the semantic view](../../user-guide/views-semantic/sql.md).

## Usage notes

The following notes apply to all supported objects:

* `object_type` and `object_name` (including `namespace` if specified) must be enclosed in single quotes.
* For `object_type`, `TABLE` and `VIEW` are interchangeable. If a `TABLE` object type is specified, and the object specified by name is a view, the function returns the DDL for
  the view and vice-versa.
* If `object_type` is `FUNCTION` (i.e. UDF) and the UDF has arguments, you must include the argument data types as part of the function name, in the form of
  `'function_name( [ arg_data_type [ , ... ] ] )'`, where `function_name` is the name of the function and `arg_data_type` is the data type of the argument.
* If `object_type` is `PROCEDURE` and the stored procedure has arguments, then you must include the
  argument data types as part of the function name, in the form of
  `'procedure_name( [ arg_data_type [ , ... ] ] )'`.
* Querying this function for most Snowflake object types requires the same minimum permissions needed to view the object (using [DESCRIBE <object>](../sql/desc.md) or [SHOW <objects>](../sql/show.md)).
  Snowflake restricts viewing special objects such as secure views to the owner, which is the role with the OWNERSHIP privilege on the object.
* When the returned DDL statement includes data type specifications, this function replaces data type aliases in the original
  statement with standard Snowflake data type names by default. If you want the returned DDL statement to include the data type
  aliases in the original statement, set the [ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS](../parameters.md) parameter to TRUE.

For Iceberg tables:

* If you specify a `TABLE` object that’s an Iceberg table, the function returns the DDL for the Iceberg table.
* If [BASE_LOCATION](../sql/create-iceberg-table-snowflake.md) was specified in the original CREATE ICEBERG TABLE statement,
  the function returns the original user input. Otherwise,
  the function returns the Snowflake-constructed file path (including the random 8-character string).
  For more information, see [Data and metadata directories](../../user-guide/tables-iceberg-storage.md).

For catalog-linked database:

* The output includes the LINKED_CATALOG options.
* For ALLOWED_NAMESPACES and BLOCKED_NAMESPACES, Snowflake doesn’t store nested namespaces if the set already contains the parent namespace.
  For example, if you create a database and specify `ALLOWED_NAMESPACES = ('ns1', 'ns1.ns2', 'ns1.ns3')`, Snowflake returns `ALLOWED_NAMESPACES = ('ns1')` in the GET_DDL output.
  The same applies for BLOCKED_NAMESPACES.

The following notes are specific to view objects. The query result always:

* Returns lowercase SQL text for `create or replace view`, even if the casing in the original SQL statement used to create the
  view was uppercase or mixed case.
* Includes the OR REPLACE clause.
* Includes the SECURE property, if the view is secure.
* Excludes the COPY GRANTS view parameter, even if the original CREATE VIEW statement specifies the COPY GRANTS parameter.
* Generates the column list.

  If a masking policy is set on a column, the result specifies the masking policy for the column.
* Removes in-line SQL comments before the view body (that is, before AS). For example, in the following code, the comment
  immediately prior to the AS clause is removed:

  ```sqlexample
  CREATE VIEW view_t1
    -- GET_DDL() removes this comment.
    AS SELECT * FROM t1;
  ```

The following notes apply specifically to table and view objects with a tag or policy:

* The role executing the GET_DDL query must have the global APPLY MASKING POLICY, APPLY ROW ACCESS POLICY, APPLY AGGREGATION POLICY, APPLY JOIN POLICY,
  APPLY PROJECTION POLICY, APPLY STORAGE LIFECYCLE POLICY, or APPLY TAG privilege and the USAGE privilege on the database and schema containing the policy or tag.
  Otherwise, Snowflake replaces the policy with `#UNKNOWN_POLICY` and the tag with `#UNKNOWN_TAG='#UNKNOWN_VALUE`. This text
  indicates that the column or the object is protected by a policy and a tag is set on the object or column. If this text is not removed
  prior to recreating the object, the CREATE OR REPLACE *<object>* statement fails.

  If this text is present in the GET_DDL query result, prior to recreating the object, consult with your internal governance administrator
  to determine which policies and tags are necessary for the columns or object. Finally, edit the GET_DDL query result and then recreate
  the object.

  Without the mentioned privileges, this table function does not return the corresponding row for the policy and tag assignments in the
  output of calling the function.
* When multiple tags are set on the object or column, the GET_DDL output sorts the tags alphabetically by tag name.
* Dropping a tag removes the tag from the GET_DDL output.
* If a tag is set on the table or view, the GET_DDL output for the table or view includes the tag assignments in the CREATE OR REPLACE
  statement.
* If a masking policy, row access policy, or storage lifecycle policy is set, the GET_DDL output includes the policy assignments using the WITH keyword.

When a tag is set on the database or the schema, the GET_DDL output includes:

* An ALTER DATABASE statement when the tag is set on the database.
* An ALTER DATABASE statement and an ALTER SCHEMA statement when the tag is set on both the database and schema.
* An ALTER SCHEMA statement when the tag is set on the schema.
* A CREATE OR REPLACE statement to generate the tag, if the tag exists in the database or schema.

The following apply to storage integrations:

* The command always returns the CREATE OR REPLACE STORAGE INTEGRATION syntax.
* If a STORAGE_AWS_EXTERNAL_ID was not specified during storage integration creation, this command returns the ID that was automatically
  generated during storage integration creation.

## Collation details

* Collation information is included in the input.

## Examples

The following examples demonstrate how to use this function to retrieve the DDL statement for an object:

* Cortex Agents
* Views
* Semantic views
* Schemas
* UDFs and stored procedures
* Masking policies
* Storage integrations
* Warehouses
* Hybrid tables

### Cortex Agents

Return the DDL used to create a Cortex Agent named `my_agent`:

```sqlexample
SELECT GET_DDL('CORTEX_AGENT', 'my_agent');

+------------------------------------------------------------------------+
| GET_DDL('CORTEX_AGENT', 'MY_AGENT')                                    |
+------------------------------------------------------------------------+
| CREATE OR REPLACE AGENT my_agent                                       |
| COMMENT = 'Test agent'                                                 |
| PROFILE = '{"display_name": "Test Agent", "color": "blue"}'            |
| FROM SPECIFICATION                                                     |
| $$                                                                     |
| models:                                                                |
|   orchestration: "llama3-8b"                                           |
| instructions:                                                          |
|   response: "You are a helpful test agent"                             |
|   system: "Respond in a friendly and concise manner"                   |
| tools:                                                                 |
|   - tool_spec:                                                         |
|       type: "cortex_analyst_text_to_sql"                               |
|       name: "Analyst1"                                                 |
| tool_resources:                                                        |
|   Analyst1:                                                            |
|     semantic_view: "db.schema.semantic_view"                           |
| $$;                                                                    |
+------------------------------------------------------------------------+
```

### Views

Return the DDL used to create a view named `books_view`:

```sqlexample
SELECT GET_DDL('VIEW', 'books_view');
+-----------------------------------------------------------------------------+
| GET_DDL('VIEW', 'BOOKS_VIEW')                                               |
|-----------------------------------------------------------------------------|
|                                                                             |
| CREATE OR REPLACE VIEW BOOKS_VIEW as select title, author from books_table; |
|                                                                             |
+-----------------------------------------------------------------------------+
```

### Semantic views

See [Getting the SQL statement for a semantic view](../../user-guide/views-semantic/sql.md).

### Schemas

Return the DDL used to create a schema named `books_schema` and the objects in the schema (the table `books_table`
and the view `books_view`):

```sqlexample
SELECT GET_DDL('SCHEMA', 'books_schema');
+-----------------------------------------------------------------------------+
| GET_DDL('SCHEMA', 'BOOKS_SCHEMA')                                           |
|-----------------------------------------------------------------------------|
| CREATE OR REPLACE SCHEMA BOOKS_SCHEMA;                                      |
|                                                                             |
| CREATE OR REPLACE TABLE BOOKS_TABLE (                                       |
| 	ID NUMBER(38,0),                                                          |
| 	TITLE VARCHAR(255),                                                       |
| 	AUTHOR VARCHAR(255)                                                       |
| );                                                                          |
|                                                                             |
| CREATE OR REPLACE VIEW BOOKS_VIEW as select title, author from books_table; |
|                                                                             |
+-----------------------------------------------------------------------------+
```

Return the DDL that uses fully-qualified names for the objects to be recreated:

```sqlexample
SELECT GET_DDL('SCHEMA', 'books_schema', true);
+---------------------------------------------------------------------------------------------------+
| GET_DDL('SCHEMA', 'BOOKS_SCHEMA', TRUE)                                                           |
|---------------------------------------------------------------------------------------------------|
| CREATE OR REPLACE SCHEMA BOOKS_DB.BOOKS_SCHEMA;                                                   |
|                                                                                                   |
| CREATE OR REPLACE TABLE BOOKS_DB.BOOKS_SCHEMA.BOOKS_TABLE (                                       |
| 	ID NUMBER(38,0),                                                                                |
| 	TITLE VARCHAR(255),                                                                             |
| 	AUTHOR VARCHAR(255)                                                                             |
| );                                                                                                |
|                                                                                                   |
| CREATE OR REPLACE VIEW BOOKS_DB.BOOKS_SCHEMA.BOOKS_VIEW as select title, author from books_table; |
|                                                                                                   |
+---------------------------------------------------------------------------------------------------+
```

> **Note:**
>
> As demonstrated in the example above, the DDL statement doesn’t use a fully-qualified name for the table used to create the
> view. To resolve the name of this table, Snowflake uses the name of the database and the name of the schema for the view.

### UDFs and stored procedures

Return the DDL used to create a UDF named `multiply` that has two arguments with the data type NUMBER:

```sqlexample
SELECT GET_DDL('FUNCTION', 'multiply(number, number)');

+--------------------------------------------------+
| GET_DDL('FUNCTION', 'MULTIPLY(NUMBER, NUMBER)')  |
+--------------------------------------------------+
| CREATE OR REPLACE "MULTIPLY"(A NUMBER, B NUMBER) |
| RETURNS NUMBER(38,0)                             |
| COMMENT='multiply two numbers'                   |
| AS 'a * b';                                      |
+--------------------------------------------------+
```

Return the DDL to create a stored procedure named `stproc_1` that has one argument with the data type FLOAT:

```sqlexample
SELECT GET_DDL('procedure', 'stproc_1(float)');
+---------------------------------------------------+
| GET_DDL('PROCEDURE', 'STPROC_1(FLOAT)')           |
|---------------------------------------------------|
| CREATE OR REPLACE PROCEDURE "STPROC_1"("F" FLOAT) |
| RETURNS FLOAT                                     |
| LANGUAGE JAVASCRIPT                               |
| EXECUTE AS OWNER                                  |
| AS '                                              |
| ''return F;''                                     |
| ';                                                |
+---------------------------------------------------+
```

### Masking policies

Return the DDL to create a masking policy named `employee_ssn_mask` to mask social security numbers. Masked values are seen unless the user’s current role is `payroll`.

```sqlexample
SELECT GET_DDL('POLICY', 'employee_ssn_mask');

+----------------------------------------------------------------------------+
|                   GET_DDL('POLICY', 'EMPLOYEE_SSN_MASK')                   |
+----------------------------------------------------------------------------+
| CREATE MASKING POLICY employee_ssn_mask AS (val string) RETURNS string ->  |
| case                                                                       |
|   when current_role() in ('PAYROLL')                                       |
|   then val                                                                 |
|   else '******'                                                            |
| end;                                                                       |
+----------------------------------------------------------------------------+
```

### Storage integrations

Return the DDL to create a storage integration named `s3_int` that creates an external AWS stage.

```sqlexample
SELECT GET_DDL('INTEGRATION', s3_int);

+----------------------------------------------------------------------------+
| GET_DDL('INTEGRATION', 's3_int')                                           |
|----------------------------------------------------------------------------|
| CREATE OR REPLACE STORAGE INTEGRATION s3_int                               |
|   TYPE = EXTERNAL_STAGE                                                    |
|   STORAGE_PROVIDER = 'S3'                                                  |
|   STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'           |
|   STORAGE_AWS_EXTERNAL_ID='ACCOUNT_SFCRole=2_kztjogs3W9S18I+iWapHpIz/wq4=' |
|   ENABLED = TRUE                                                           |
|   STORAGE_ALLOWED_LOCATIONS = ('s3://mybucket1/path1/');                   |
+----------------------------------------------------------------------------+
```

### Warehouses

Suppose that you execute the following statement to create a warehouse named `my_wh`:

```sqlexample
CREATE OR REPLACE WAREHOUSE my_wh
  WAREHOUSE_SIZE=LARGE
  INITIALLY_SUSPENDED=TRUE;
```

The following call to the GET_DDL function returns the DDL statement to recreate this warehouse:

```sqlexample
SELECT GET_DDL('WAREHOUSE', 'my_wh');
```

```output
+-------------------------------------------+
| GET_DDL('WAREHOUSE', 'MY_WH')             |
|-------------------------------------------|
| create or replace warehouse MY_WH         |
| with                                      |
|     warehouse_type='STANDARD'             |
|     warehouse_size='Large'                |
|     max_cluster_count=1                   |
|     min_cluster_count=1                   |
|     scaling_policy=STANDARD               |
|     auto_suspend=600                      |
|     auto_resume=TRUE                      |
|     initially_suspended=TRUE              |
|     enable_query_acceleration=FALSE       |
|     query_acceleration_max_scale_factor=8 |
|     max_concurrency_level=8               |
|     statement_queued_timeout_in_seconds=0 |
|     statement_timeout_in_seconds=172800   |
| ;                                         |
+-------------------------------------------+
```

Note that the statement returned by the GET_DDL function includes default values for the properties not specified in the CREATE
WAREHOUSE statement. For example, the CREATE WAREHOUSE statement did not specify the AUTO_RESUME property, so the returned
statement includes AUTO_RESUME=TRUE, which is the default value for this property.

### Hybrid tables

The following example shows the DDL that is returned for a hybrid table named `ht_weather`, which has a PRIMARY KEY
constraint on the `id` column.

```sqlexample
CREATE OR REPLACE HYBRID TABLE ht_weather
 (id INT PRIMARY KEY,
  start_time TIMESTAMP,
  precip NUMBER(3,2),
  city VARCHAR(20),
  county VARCHAR(20));
```

Note that the first argument to the function uses the `TABLE` type for hybrid tables.

```sqlexample
SELECT GET_DDL('TABLE','ht_weather');
```

The PRIMARY KEY constraint takes an out-of-line position in the output, after the column definitions.
See also [Constraints in GET_DDL](../constraints-overview.md).

```output
+---------------------------------------------+
| GET_DDL('TABLE','HT_WEATHER')               |
|---------------------------------------------|
| create or replace HYBRID TABLE HT_WEATHER ( |
|   ID NUMBER(38,0) NOT NULL,                 |
|   START_TIME TIMESTAMP_NTZ(9),              |
|   PRECIP NUMBER(3,2),                       |
|   CITY VARCHAR(20),                         |
|   COUNTY VARCHAR(20),                       |
|   primary key (ID)                          |
| );                                          |
+---------------------------------------------+
```

---
title: GET_IGNORE_CASE
source: https://docs.snowflake.com/en/sql-reference/functions/get_ignore_case.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Extraction)

# GET_IGNORE_CASE

Extracts a field value from an object; returns NULL if either of the arguments is NULL.

> **Note:**
>
> This function is similar to [GET](get.md) but applies case-insensitive matching to field names.

See also:
:   [GET](get.md)

## Syntax

**OBJECT (or VARIANT containing an OBJECT)**

```sqlsyntax
GET_IGNORE_CASE( <object> , <field_name> )

GET_IGNORE_CASE( <variant> , <field_name> )
```

**MAP**

```sqlsyntax
GET_IGNORE_CASE( <map> , <key> )
```

## Arguments

`variant`
:   An expression that evaluates to a [VARIANT](../data-types-semistructured.md) that contains either an ARRAY or an OBJECT.

`object`
:   An expression that evaluates to an [OBJECT](../data-types-semistructured.md) that contains key-value pairs.

`field_name`
:   An expression that evaluates to a VARCHAR. This specifies the key in a key-value pair for which you want to retrieve the value.

    `field_name` must not be an empty string.

    If `object` is a [structured OBJECT](../data-types-structured.md), you must specify a constant for
    `field_name`.

    If `object` does not contain the specified key:

    * If `object` is a semi-structured OBJECT, the function returns NULL.
    * If `object` is a structured OBJECT, an error occurs.

`map`
:   An expression that evaluates to a [MAP](../data-types-structured.md).

`key`
:   The key in a key-value pair for which you want to retrieve the value.

    If `map` does not contain the specified key, the function returns NULL.

## Returns

* The returned value is the specified element of the ARRAY, or the value that corresponds to the specified key of a key-value
  pair in the OBJECT.
* If the input object is a semi-structured OBJECT, ARRAY, or VARIANT value, the function returns a VARIANT value. The data type
  of the value is VARIANT because:

  + In an ARRAY value, each element is of type VARIANT.
  + In an OBJECT value, the value in each key-value pair is of type VARIANT.
* If the input object is a [structured OBJECT, structured ARRAY, or MAP](../data-types-structured.md),
  the function returns a value of the type specified for the object.

  For example, if the type of the input object is ARRAY(NUMBER), the function returns a NUMBER value.

## Usage notes

* This function returns the first exact match it finds. If the function only finds ambiguous (case-insensitive) matches, it returns the value for one of the matches; however, no guarantee can be made on which ambiguous field name is matched first.
* GET_IGNORE_CASE is a binary function that can be called in the following ways:

  + `object` is an OBJECT value and `field_name` is a string value, which can be a constant or an expression.

    This variation of GET_IGNORE_CASE extracts the value of the field with the provided name from the object value.
  + `v` is a VARIANT value and `field_name` is a string value, which can be a constant or an expression.

    Works similarly to GET_IGNORE_CASE with `object`, but additionally checks that `v` contains an object value (and returns NULL if `v` does not contain an object).

## Examples

Extract a field value from an object. The function returns the value for the exact match:

```sqlexample
SELECT GET_IGNORE_CASE(TO_OBJECT(PARSE_JSON('{"aa":1, "aA":2, "Aa":3, "AA":4}')),'aA') as output;

+--------+
| OUTPUT |
|--------|
| 2      |
+--------+
```

Extract a field value from an object. The function cannot find an exact match and so returns one of the ambiguous matches:

```sqlexample
SELECT GET_IGNORE_CASE(TO_OBJECT(PARSE_JSON('{"aa":1, "aA":2, "Aa":3}')),'AA') as output;

+--------+
| OUTPUT |
|--------|
| 3      |
+--------+
```

For more detailed examples, see [Querying Semi-structured Data](../../user-guide/querying-semistructured.md).

---
title: GET_JOB_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/get_job_history.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# GET_JOB_HISTORY

Returns the job history for [Snowpark Container Services jobs](../../developer-guide/snowpark-container-services/working-with-services.md) that ran within the specified time range. The function returns both the running and deleted job.

See also:
:   [Run a job service](../../developer-guide/snowpark-container-services/working-with-services.md)

## Syntax

```sqlsyntax
SNOWFLAKE.SPCS.GET_JOB_HISTORY(
  [ CREATED_TIME_START => <constant_expr> ],
  [ CREATED_TIME_END => <constant_expr> ],
  [ RESULT_LIMIT = <integer> ])
```

## Arguments

`CREATED_TIME_START => constant_expr`
:   Start time, in TIMESTAMP_LTZ format — for example, ‘2024-04-05 01:02:03’ — for the time range when jobs were created to retrieve the job history. For available functions to construct data, time, and timestamp data, see [Date & time functions](../functions-date-time.md).

    Default: 14 days from the current timestamp.

`CREATED_TIME_END => constant_expr`
:   End time, in TIMESTAMP_LTZ format, for the time range to retrieve the job history.

    Default: Current timestamp.

`RESULT_LIMIT => integer`
:   Maximum number of rows to return.

    If the number of matching rows exceeds the specified limit, only the jobs with the most recent timestamps are returned, up to the specified limit.

    Range: 1 to 10000

    Default: 100

## Output

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `QUERY_ID` | VARCHAR | ID of the EXECUTE JOB SERVICE SQL statement. |
| `ID` | NUMBER | Internal/system-generated identifier for the job. |
| `NAME` | VARCHAR | Name of the job. |
| `DATABASE_NAME` | VARCHAR | Name of the database in which the job is created. |
| `SCHEMA_NAME` | VARCHAR | Name of the schema in which the job is created. |
| `CREATED_TIME` | TIMESTAMP_LTZ | Time when the job was created. |
| `COMPLETED_TIME` | TIMESTAMP_LTZ | Time when the job was completed. |
| `DELETED_TIME` | TIMESTAMP_LTZ | Time when the job was deleted. |
| `STATUS` | VARCHAR | Staus of the job. |
| `MESSAGE` | VARCHAR | Additional information about the job status. |
| `INSTANCE_STATUSES` | OBJECT | Key-value pairs that describe job instances and containers. |
| `COMPUTE_POOL_NAME` | VARCHAR | Name of the compute pool where the job was run. |
| `OWNER` | VARCHAR | Role that owns the job. |
| `OWNER_ROLE_TYPE` | VARCHAR | Type of role that owns the job, either ROLE or DATABASE_ROLE. |
| `PARAMETERS` | OBJECT | Key-value pairs that describe the parameters that were specified when the job was created. |
| `MANAGING_OBJECT` | OBJECT | Key-value pairs that describe the managing object. NULL if the job isn’t managed by Snowflake. |

## Access control requirements

The PUBLIC role has the privilege to use this function.

Everyone can call this function, but the output depends on the current role.
The output only includes the jobs that are owned by the current role.

## Examples

* Returns the job history of all jobs created by the current role within the last 14 days
  (the default `CREATED_TIME_START` value).

  ```sqlexample
  SELECT * FROM TABLE(SNOWFLAKE.SPCS.GET_JOB_HISTORY(());
  ```

  The following example output shows only one job:

  ```output
  +--------------------------------------+-----+-------------+---------------+-------------+-------------------------------+-------------------------------+--------------+--------+-----------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------+-----------------------+-----------+-----------------+-----------------+-----------------+
  | QUERY_ID                             |  ID | NAME        | DATABASE_NAME | SCHEMA_NAME | CREATED_TIME                  | COMPLETED_TIME                | DELETED_TIME | STATUS | MESSAGE                     | INSTANCE_STATUSES                                                                                                                                               | COMPUTE_POOL_NAME     | OWNER     | OWNER_ROLE_TYPE | PARAMETERS      | MANAGING_OBJECT |
  |--------------------------------------+-----+-------------+---------------+-------------+-------------------------------+-------------------------------+--------------+--------+-----------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------+-----------------------+-----------+-----------------+-----------------+-----------------|
  | 01bd46d2-0004-be62-0000-ff07016490a6 | 131 | MY_TEST_JOB | TUTORIAL_DB   | DATA_SCHEMA | 2025-06-25 17:50:00.728 -0700 | 2025-06-25 17:50:10.515 -0700 | NULL         | DONE   | Job completed successfully. | {                                                                                                                                                               | TUTORIAL_COMPUTE_POOL | TEST_ROLE | ROLE            | {               | NULL            |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   "failedInstances": 0,                                                                                                                                         |                       |           |                 |   "ASYNC": true |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   "instances": [                                                                                                                                                |                       |           |                 | }               |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |     {                                                                                                                                                           |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |       "containers": [                                                                                                                                           |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |         {                                                                                                                                                       |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "containerName": "main",                                                                                                                              |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "image": "org-account.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_job_image:latest",                               |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "imageSha256": "sha256:ff07f19f233cfe76a889e39d9d7098d528312acc789f1c0cf929556a56c61a9a",                                                             |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "lastExitCode": 0,                                                                                                                                    |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "message": "Completed successfully",                                                                                                                  |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "restartCount": 0,                                                                                                                                    |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "startTime": "",                                                                                                                                      |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |           "status": "DONE"                                                                                                                                      |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |         }                                                                                                                                                       |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |       ],                                                                                                                                                        |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |       "instanceId": "0"                                                                                                                                         |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |     }                                                                                                                                                           |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   ],                                                                                                                                                            |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   "pendingInstances": 0,                                                                                                                                        |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   "runningInstances": 0,                                                                                                                                        |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   "succeededInstances": 1,                                                                                                                                      |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             |   "totalInstances": 1                                                                                                                                           |                       |           |                 |                 |                 |
  |                                      |     |             |               |             |                               |                               |              |        |                             | }                                                                                                                                                               |                       |           |                 |                 |                 |
  +--------------------------------------+-----+-------------+---------------+-------------+-------------------------------+-------------------------------+--------------+--------+-----------------------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------+-----------------------+-----------+-----------------+-----------------+-----------------+
  ```
* Returns the job history of up to 10 jobs that are owned by the current role that ran within the last three days.

  ```sqlexample
  SELECT *
   FROM TABLE(snowflake.spcs.get_job_history(
              result_limit => 10,
              created_time_start => dateadd('day', -3, current_timestamp())
    ));
  ```
* Retrieves up to 10 jobs that ran between three days ago and one day ago, not including today.

  ```sqlexample
  SELECT * FROM TABLE(SNOWFLAKE.SPCS.GET_JOB_HISTORY(
  RESULT_LIMIT => 10,
  CREATED_TIME_START => DATEADD('day', -3, CURRENT_TIMESTAMP()),
  CREATED_TIME_END => DATEADD('day', -1, CURRENT_TIMESTAMP())));
  ```

---
title: GET_LINEAGE (SNOWFLAKE.CORE)
source: https://docs.snowflake.com/en/sql-reference/functions/get_lineage-snowflake-core.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# GET_LINEAGE (SNOWFLAKE.CORE)

Given a Snowflake object, returns data lineage information upstream or downstream from that object. Upstream means the
path of objects that led to the creation of the object; downstream means the path of objects that were created
from the object.

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.GET_LINEAGE(
    '<object_name>',
    '<object_domain>',
    '<direction>',
    [ <distance>, ]
    [ '<object_version>' ]
)
```

## Arguments

**Required:**

`'object_name'`
:   Name of the object for which data lineage information is retrieved. Use the fully qualified name if the object is in a
    schema different from the current schema in the session.

`'object_domain'`
:   The domain of the object. Supported domains are ‘COLUMN’, ‘TABLE’ (which includes all table-like objects including
    views and dynamic tables), ‘SEMANTIC_VIEW’ (for [semantic views](../../user-guide/views-semantic/overview.md)), and ‘STAGE’.

    For ML lineage, use `TABLE` for feature views (which are dynamic tables and views internally), ‘DATASET’, or ‘MODULE’ for
    models.

`'direction'`
:   The direction for which the lineage should be retained. Supported directions are ‘UPSTREAM’ and ‘DOWNSTREAM’.

**Optional:**

`distance`
:   The number of levels of lineage to retrieve. The maximum is 5; this is also the default.

`'object_version'`
:   For versioned objects, such as datasets and models, the version of the object for which lineage is retrieved. If not
    specified, the default version is used.

## Output

The output is a table with one row per object relationship in the lineage path (that is, an edge in the lineage graph).
Relationships are between objects designated as source and target in each row. The table includes the following columns:

| Column | Type | Description |
| --- | --- | --- |
| `SOURCE_OBJECT_DATABASE` | VARCHAR | The database that contains the source object. |
| `SOURCE_OBJECT_SCHEMA` | VARCHAR | The schema that contains the source object. |
| `SOURCE_OBJECT_NAME` | VARCHAR | The unqualified name of the source object. |
| `SOURCE_OBJECT_DOMAIN` | VARCHAR | The domain of the target object. Possible values are ‘COLUMN’, ‘TABLE’, ‘SEMANTIC_VIEW’, ‘DATASET’, ‘MODULE’ (for ML models), and ‘STAGE’. |
| `SOURCE_OBJECT_VERSION` | VARCHAR | The version of the source object, for versioned objects such as datasets and models. NULL if the source object is not versioned. |
| `SOURCE_COLUMN_NAME` | VARCHAR | The name of the source column, if the source object is a column. NULL if the source object is not a column. |
| `SOURCE_STATUS` | VARCHAR | The status of the source object. Possible values are ‘ACTIVE’ and ‘MASKED’. |
| `TARGET_OBJECT_DATABASE` | VARCHAR | The database that contains the target object. |
| `TARGET_OBJECT_SCHEMA` | VARCHAR | The schema that contains the target object. |
| `TARGET_OBJECT_NAME` | VARCHAR | The unqualified name of the target object. |
| `TARGET_OBJECT_DOMAIN` | VARCHAR | The domain of the target object. Possible values are ‘COLUMN’, ‘TABLE’, ‘SEMANTIC_VIEW’, ‘DATASET’, ‘MODULE’ (for ML models), and ‘STAGE’. |
| `TARGET_OBJECT_VERSION` | VARCHAR | The version of the target object, for versioned objects such as datasets and models. NULL if the target object is not versioned. |
| `TARGET_COLUMN_NAME` | VARCHAR | The name of the target column, if the target object is a column. NULL if the target object is not a column. |
| `TARGET_STATUS` | VARCHAR | The status of the target object. Possible values are ‘ACTIVE’ and ‘MASKED’. |
| `DISTANCE` | INTEGER | The distance of the target object from the source object in the lineage path. A direct relationship has a distance of 1. |
| `PROCESS` | VARIANT | Provides details about how lineage between the source object and target object was established. For example, it might include the query ID of a SQL query or the name of a stored procedure that moved data from the source object to the target object. |

## Usage notes

* You will receive an error message if the object does not exist, if the object is not accessible to the current user,
  if the object does not support data lineage, or if the object is not in the specified domain.
* The output table contains no rows if no lineage information is available for the specified object; this is not an error.
* `GET_LINEAGE` returns at most 10 million rows, each row representing an edge (relationship) in the lineage graph.
  If there are more than 10 million rows in the output, the function silently truncates output to 10 million rows.
* For limitations and considerations that apply to using this function, see
  [Lineage limitations and considerations](../../user-guide/ui-snowsight-lineage.md).

## Example

Assume you have created a table named TABLE_B from TABLE_A using CREATE TABLE AS SELECT, then created a table named
TABLE_C from TABLE_B in a similar manner. The following SQL query retrieves two steps of downstream lineage from
TABLE_A:

```sqlexample
SELECT
    DISTANCE,
    SOURCE_OBJECT_DOMAIN,
    SOURCE_OBJECT_DATABASE,
    SOURCE_OBJECT_SCHEMA,
    SOURCE_OBJECT_NAME,
    SOURCE_STATUS,
    TARGET_OBJECT_DOMAIN,
    TARGET_OBJECT_DATABASE,
    TARGET_OBJECT_SCHEMA,
    TARGET_OBJECT_NAME,
    TARGET_STATUS,
FROM TABLE (SNOWFLAKE.CORE.GET_LINEAGE('my_database.sch.table_a', 'TABLE', 'DOWNSTREAM', 2));
```

The output is similar to the following:

```output
+----------+----------------------+------------------------+----------------------+--------------------+---------------+----------------------+------------------------+----------------------+--------------------+---------------+
| DISTANCE | SOURCE_OBJECT_DOMAIN | SOURCE_OBJECT_DATABASE | SOURCE_OBJECT_SCHEMA | SOURCE_OBJECT_NAME | SOURCE_STATUS | TARGET_OBJECT_DOMAIN | TARGET_OBJECT_DATABASE | TARGET_OBJECT_SCHEMA | TARGET_OBJECT_NAME | TARGET_STATUS |
|----------+----------------------+------------------------+----------------------+--------------------+---------------+----------------------+------------------------+----------------------+--------------------+---------------|
|        1 | TABLE                | MY_DATABASE            | SCH                  | TABLE_A            | ACTIVE        | TABLE                | MY_DATABASE            | SCH                  | TABLE_B            | ACTIVE        |
|        2 | TABLE                | MY_DATABASE            | SCH                  | TABLE_B            | ACTIVE        | TABLE                | MY_DATABASE            | SCH                  | TABLE_C            | ACTIVE        |
+----------+----------------------+------------------------+----------------------+--------------------+---------------+----------------------+------------------------+----------------------+--------------------+---------------+
```

---
title: GET_OBJECT_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/get_object_references.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Object Modeling)

# GET_OBJECT_REFERENCES

Returns a list of objects that a specified object references. Input is currently limited to the name of a view.

The following table identifies which types of database objects are currently returned in the output:

| Object Type | Returned in Output? |
| --- | --- |
| Tables | Yes |
| Views (including secure views) | Yes |
| Materialized views | No |
| Named stages (internal or external) | No |
| Streams | No |
| User-defined functions (UDF) / user-defined table functions (UDTF) | No |

## Syntax

```sqlsyntax
GET_OBJECT_REFERENCES(
  DATABASE_NAME => '<string>'
  , SCHEMA_NAME => '<string>'
  , OBJECT_NAME => '<string>' )
```

## Arguments

`DATABASE_NAME => 'string'`
:   Name of the database in which the schema and object reside.

`SCHEMA_NAME => 'string'`
:   Name of the schema in which the object resides.

`OBJECT_NAME => 'string'`
:   Name of the object. Currently limited to the name of a view (secure or not secure).

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE_NAME | TEXT | Name of the database that contains the queried object. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the queried object. |
| OBJECT_NAME | TEXT | Name of the queried object. |
| REFERENCED_DATABASE_NAME | TEXT | Name of the database containing an object that the queried object references. |
| REFERENCED_SCHEMA_NAME | TEXT | Name of the schema containing an object that the queried object references. |
| REFERENCED_OBJECT_NAME | TEXT | Name of an object that the queried object references. |
| REFERENCED_OBJECT_TYPE | TEXT | Type of object identified in the REFERENCED_OBJECT_NAME column. Values include TABLE or VIEW. |

## Usage notes

* This function requires the following privileges:

  + SELECT on the view. To obtain references for a view, the role in use or a role granted to the role in use must have the SELECT
    privilege on the view. For details, refer to [Table privileges](../../user-guide/security-access-control-privileges.md) and [View privileges](../../user-guide/security-access-control-privileges.md).
  + OWNERSHIP on the secure view. If the dependency chain references any secure view, the role in use or a role granted to the role in
    use must have the OWNERSHIP privilege on the secure view. Otherwise, Snowflake returns this error message:

    ```none
    Insufficient privileges to operate on view '<view_name>'
    ```
* The `DATABASE_NAME`, `SCHEMA_NAME`, and `OBJECT_NAME` values must be enclosed in single quotes. Also, if any of these names contains any spaces, mixed-case characters, or special characters, the name must be double-quoted within the single quotes (e.g. `'"My DB"'` vs `'mydb'`).
* If the view references stages, UDFs, or materialized views, this function returns an error, rather than returning
  a list of referenced tables and views.

## Examples

Return the list of references for a view:

> ```sqlexample
> -- create a database
> create or replace database ex1_gor_x;
> use database ex1_gor_x;
> use schema PUBLIC;
>
> -- create a set of tables
> create or replace table x_tab_a (mycol int not null);
> create or replace table x_tab_b (mycol int not null);
> create or replace table x_tab_c (mycol int not null);
>
> -- create views with increasing complexity of references
> create or replace view x_view_d as
> select * from x_tab_a
> join x_tab_b
> using ( mycol );
>
> create or replace view x_view_e as
> select x_tab_b.* from x_tab_b, x_tab_c
> where x_tab_b.mycol=x_tab_c.mycol;
>
> --create a second database
> create or replace database ex1_gor_y;
> use database ex1_gor_y;
> use schema PUBLIC;
>
> -- create a table in the second database
> create or replace table y_tab_a (mycol int not null);
>
> -- create more views with increasing levels of references
> create or replace view y_view_b as
> select * from ex1_gor_x.public.x_tab_a
> join y_tab_a
> using ( mycol );
>
> create or replace view y_view_c as
> select b.* from ex1_gor_x.public.x_tab_b b, ex1_gor_x.public.x_tab_c c
> where b.mycol=c.mycol;
>
> create or replace view y_view_d as
> select * from ex1_gor_x.public.x_view_e;
>
> create or replace view y_view_e as
> select e.* from ex1_gor_x.public.x_view_e e, y_tab_a
> where e.mycol=y_tab_a.mycol;
>
> create or replace view y_view_f as
> select e.* from ex1_gor_x.public.x_view_e e, ex1_gor_x.public.x_tab_c c, y_tab_a
> where e.mycol=y_tab_a.mycol
> and e.mycol=c.mycol;
>
> -- retrieve the references for the last view created
> select * from table(get_object_references(database_name=>'ex1_gor_y', schema_name=>'public', object_name=>'y_view_f'));
>
> +---------------+-------------+-----------+--------------------------+------------------------+------------------------+------------------------+
> | DATABASE_NAME | SCHEMA_NAME | VIEW_NAME | REFERENCED_DATABASE_NAME | REFERENCED_SCHEMA_NAME | REFERENCED_OBJECT_NAME | REFERENCED_OBJECT_TYPE |
> |---------------+-------------+-----------+--------------------------+------------------------+------------------------+------------------------|
> | EX1_GOR_Y     | PUBLIC      | Y_VIEW_F  | EX1_GOR_Y                | PUBLIC                 | Y_TAB_A                | TABLE                  |
> | EX1_GOR_Y     | PUBLIC      | Y_VIEW_F  | EX1_GOR_X                | PUBLIC                 | X_TAB_B                | TABLE                  |
> | EX1_GOR_Y     | PUBLIC      | Y_VIEW_F  | EX1_GOR_X                | PUBLIC                 | X_TAB_C                | TABLE                  |
> | EX1_GOR_Y     | PUBLIC      | Y_VIEW_F  | EX1_GOR_X                | PUBLIC                 | X_VIEW_E               | VIEW                   |
> +---------------+-------------+-----------+--------------------------+------------------------+------------------------+------------------------+
> ```

---
title: GET_PATH , :
source: https://docs.snowflake.com/en/sql-reference/functions/get_path.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Extraction)

# GET_PATH , `:`

Extracts a value from semi-structured data using a path name.

GET_PATH is a variation of [GET](get.md); it takes a [VARIANT](../data-types-semistructured.md), [OBJECT](../data-types-semistructured.md),
or [ARRAY](../data-types-semistructured.md) column name as the first argument, and extracts the value of the field or the
element according to the path name provided as the second argument.

## Syntax

```sqlsyntax
GET_PATH( <column_identifier> , '<path_name>' )

<column_identifier>:<path_name>

:( <column_identifier> , '<path_name>' )
```

## Arguments

`column_identifier`
:   An expression that evaluates to a VARIANT, OBJECT, or ARRAY column.

`path_name`
:   An expression that evaluates to a VARCHAR value. This value specifies the path to the field or element
    that you want to extract.

    For [structured types](../data-types-structured.md), you must specify a string constant.

## Returns

* The returned value is the specified element of the ARRAY, or the value that corresponds to the specified key of a key-value
  pair in the OBJECT.
* If the input object is a semi-structured OBJECT, ARRAY, or VARIANT value, the function returns a VARIANT value. The data type
  of the value is VARIANT because:

  + In an ARRAY value, each element is of type VARIANT.
  + In an OBJECT value, the value in each key-value pair is of type VARIANT.
* If the input object is a [structured OBJECT, structured ARRAY, or MAP](../data-types-structured.md),
  the function returns a value of the type specified for the object.

  For example, if the type of the input object is ARRAY(NUMBER), the function returns a NUMBER value.

## Usage notes

* GET_PATH is equivalent to a chain of [GET](get.md) functions. It returns NULL if the path name doesn’t correspond to any element.
* The path name syntax is standard JavaScript notation; it consists of a concatenation of field names (identifiers) preceded by
  periods (for example, `.`) and index operators (for example, `[<index>]`):

  + The first field name doesn’t require the leading period to be specified.
  + The index values in the index operators can be non-negative decimal numbers (for arrays) or single or double-quoted
    string literals (for object fields).

  For more details, see [Querying Semi-structured Data](../../user-guide/querying-semistructured.md).
* GET_PATH also supports a syntactic shortcut using the `:` character as the extraction operator that separates the column
  name (which can contain periods) from the path specifier.

  To maintain syntactic consistency, the path notation also supports SQL-style double-quoted identifiers, and use of `:` as path separators.

  When the `:` operator is used, any integer or string sub-expressions can be included within `[]`.

## Examples

Create a table with a VARIANT column and insert data. Use the [PARSE_JSON](parse_json.md) function to insert the VARIANT data.
The VARIANT values contain nested ARRAY values and OBJECT values.

```sqlexample
CREATE OR REPLACE TABLE get_path_demo(
  id INTEGER,
  v  VARIANT);

INSERT INTO get_path_demo (id, v)
  SELECT 1,
         PARSE_JSON('{
           "array1" : [
             {"id1": "value_a1", "id2": "value_a2", "id3": "value_a3"}
           ],
           "array2" : [
             {"id1": "value_b1", "id2": "value_b2", "id3": "value_b3"}
           ],
           "object_outer_key1" : {
             "object_inner_key1a": "object_x1",
             "object_inner_key1b": "object_x2"
           }
         }');

INSERT INTO get_path_demo (id, v)
  SELECT 2,
         PARSE_JSON('{
           "array1" : [
             {"id1": "value_c1", "id2": "value_c2", "id3": "value_c3"}
           ],
           "array2" : [
             {"id1": "value_d1", "id2": "value_d2", "id3": "value_d3"}
           ],
           "object_outer_key1" : {
             "object_inner_key1a": "object_y1",
             "object_inner_key1b": "object_y2"
           }
         }');

SELECT * FROM get_path_demo;
```

```output
+----+----------------------------------------+
| ID | V                                      |
|----+----------------------------------------|
|  1 | {                                      |
|    |   "array1": [                          |
|    |     {                                  |
|    |       "id1": "value_a1",               |
|    |       "id2": "value_a2",               |
|    |       "id3": "value_a3"                |
|    |     }                                  |
|    |   ],                                   |
|    |   "array2": [                          |
|    |     {                                  |
|    |       "id1": "value_b1",               |
|    |       "id2": "value_b2",               |
|    |       "id3": "value_b3"                |
|    |     }                                  |
|    |   ],                                   |
|    |   "object_outer_key1": {               |
|    |     "object_inner_key1a": "object_x1", |
|    |     "object_inner_key1b": "object_x2"  |
|    |   }                                    |
|    | }                                      |
|  2 | {                                      |
|    |   "array1": [                          |
|    |     {                                  |
|    |       "id1": "value_c1",               |
|    |       "id2": "value_c2",               |
|    |       "id3": "value_c3"                |
|    |     }                                  |
|    |   ],                                   |
|    |   "array2": [                          |
|    |     {                                  |
|    |       "id1": "value_d1",               |
|    |       "id2": "value_d2",               |
|    |       "id3": "value_d3"                |
|    |     }                                  |
|    |   ],                                   |
|    |   "object_outer_key1": {               |
|    |     "object_inner_key1a": "object_y1", |
|    |     "object_inner_key1b": "object_y2"  |
|    |   }                                    |
|    | }                                      |
+----+----------------------------------------+
```

Extract the `id3` value from `array2` in each row:

```sqlexample
SELECT id,
       GET_PATH(
         v,
         'array2[0].id3') AS id3_in_array2
  FROM get_path_demo;
```

```output
+----+---------------+
| ID | ID3_IN_ARRAY2 |
|----+---------------|
|  1 | "value_b3"    |
|  2 | "value_d3"    |
+----+---------------+
```

Use the `:` operator to extract the same `id3` value from `array2` in each row:

```sqlexample
SELECT id,
       v:array2[0].id3 AS id3_in_array2
  FROM get_path_demo;
```

```output
+----+---------------+
| ID | ID3_IN_ARRAY2 |
|----+---------------|
|  1 | "value_b3"    |
|  2 | "value_d3"    |
+----+---------------+
```

This example is the same as the previous example, but uses SQL-style double-quoted identifiers:

```sqlexample
SELECT id,
       v:"array2"[0]."id3" AS id3_in_array2
  FROM get_path_demo;
```

```output
+----+---------------+
| ID | ID3_IN_ARRAY2 |
|----+---------------|
|  1 | "value_b3"    |
|  2 | "value_d3"    |
+----+---------------+
```

Extract the `object_inner_key1a` value from the nested OBJECT value in each row:

```sqlexample
SELECT id,
       GET_PATH(
         v,
         'object_outer_key1:object_inner_key1a') AS object_inner_key1A_values
  FROM get_path_demo;
```

```output
+----+---------------------------+
| ID | OBJECT_INNER_KEY1A_VALUES |
|----+---------------------------|
|  1 | "object_x1"               |
|  2 | "object_y1"               |
+----+---------------------------+
```

Use the `:` operator to extract the same `object_inner_key1a` values:

```sqlexample
SELECT id,
       v:object_outer_key1.object_inner_key1a AS object_inner_key1a_values
  FROM get_path_demo;
```

```output
+----+---------------------------+
| ID | OBJECT_INNER_KEY1A_VALUES |
|----+---------------------------|
|  1 | "object_x1"               |
|  2 | "object_y1"               |
+----+---------------------------+
```

This example is the same as the previous example, but uses SQL-style double-quoted identifiers:

```sqlexample
SELECT id,
       v:"object_outer_key1":"object_inner_key1a" AS object_inner_key1a_values
  FROM get_path_demo;
```

```output
+----+---------------------------+
| ID | OBJECT_INNER_KEY1A_VALUES |
|----+---------------------------|
|  1 | "object_x1"               |
|  2 | "object_y1"               |
+----+---------------------------+
```

---
title: GET_PRESIGNED_URL
source: https://docs.snowflake.com/en/sql-reference/functions/get_presigned_url.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md)

# GET_PRESIGNED_URL

Generates a pre-signed URL to a file on a stage using the stage name and relative file path as inputs.

Access files in a stage using any of the following methods:

* Navigate to the pre-signed URL directly in a web browser.
* Retrieve a pre-signed URL in Snowsight. Click on the pre-signed URL in the results table.
* Send the pre-signed URL in a request to the REST API for file support.

> **Note:**
>
> When calling this function for files in an external stage that references Microsoft Azure cloud storage: This function returns
> output only when the Azure container that stores the blob object is accessed using a storage integration; querying the function
> fails if the container is accessed using a shared access signature (SAS) token you generate.
>
> The GET_PRESIGNED_URL function requires Azure Active Directory authentication to create the user delegation SAS token. For this purpose,
> a storage integration object stores a generated service principal for your Azure cloud storage. The Snowflake service principal is
> granted a role that includes the `Microsoft.Storage/storageAccounts/blobServices/generateUserDelegationKey` permission (or *action*).
> Both the `Storage Blob Data Reader` and `Storage Blob Data Contributor` roles include this permission. For more information about
> this permission, see the
> [Microsoft documentation](https://docs.microsoft.com/en-us/rest/api/storageservices/get-user-delegation-key#authorization).
>
> For more information about accessing an Azure container, see [Configure an Azure container for loading data](../../user-guide/data-load-azure-config.md).

> **Note:**
>
> For Microsoft Fabric OneLake stages, pre-signed URLs have a maximum expiration time of 60 minutes (3600 seconds) because of user
> delegation key constraints in Microsoft Fabric. If you specify a longer expiration time, the function returns an error.

## Syntax

```sqlsyntax
GET_PRESIGNED_URL( @<stage_name> , '<relative_file_path>' , [ <expiration_time> ] )
```

## Arguments

`stage_name`
:   Name of the internal or external stage where the file is stored.

    > **Note:**
    >
    > If the stage name includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'` for a stage
    > named `"my stage"`).

`relative_file_path`
:   Path and filename of the file relative to its location on the stage.

`expiration_time`
:   Length of time (in seconds) after which the short term access token expires.

    Default value: `3600` (60 minutes).

    Maximum value: If the stage uses an AWS IAM role (`AWS_ROLE`) to securely connect to your S3 bucket,
    the maximum expiration time is `3600` (60 minutes).

    For Microsoft Fabric OneLake stages, the maximum expiration time
    is `3600` (60 minutes). Otherwise, the maximum expiration
    time is `604800` (7 days).

## Returns

Pre-signed URL of the staged file.

> **Note:**
>
> This SQL function generates a pre-signed URL for the file path that you specify, even if the file does not exist on the stage.
> To ensure that the generated URL returns the expected file, open the URL in a web browser. If the file does not exist,
> the browser returns a `NoSuchKey` error in XML format.

## Usage notes

* Server-side encryption is required on the internal or external stage. For details, see [CREATE STAGE](../sql/create-stage.md).
* This SQL function returns a value for any role that has the following privilege on the stage:

  External stage:
  :   USAGE

  Internal stage:
  :   READ

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

### Querying the function

```sqlexample
SELECT GET_PRESIGNED_URL(@images_stage, 'us/yosemite/half_dome.jpg', 3600);

+================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================-------+
| GET_PRESIGNED_URL(@IMAGES_STAGE, 'US/YOSEMITE/HALF_DOME.JPG', 3600)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       |
|================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================-------|
| http://myaccount.s3.amazonaws.com/national_parks/us/yosemite/half_dome.jpg?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxAus-west-xxxxxxxxxaws1_request&X-Amz-Date=20200625T162738Z&X-Amz-Expires=3600&X-Amz-Security-Token=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx-Amz-SignedHeaders=host&X-Amz-Signature=xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx   |
+================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================================-------+
```

### Loading metadata for an image file and retrieving the pre-signed URL

Use the API for your cloud storage service to generate a list of JSON documents that contain the metadata extracted from the images.

For example, suppose the JSON document for one bitmap image file is as follows:

```sqljson
{
  "file_url": "s3://photos/national_parks/us/yosemite/half_dome.jpg",
  "image_format": "jpeg",
  "dimensions": {"x" : 1024, "y" : 768},
  "tags":[
    "rock",
    "cliff",
    "valley"
  ],
  "dominant_color": "gray"
}
```

Create a table for the image metadata, load the metadata into the table, and generate the pre-signed URL for the image:

```sqlexample
-- Create a table to store the file metadata

  CREATE TABLE images_table
  (
      file_url string,
      image_format string,
      dimensions_X number,
      dimensions_Y number,
      tags array,
      dominant_color string,
      relative_path string
  );

-- Load the metadata from the JSON document into the table.

COPY INTO images_table
  FROM
  (SELECT $1:file_url::STRING, $1:image_format::STRING, $1:size::NUMBER, $1:tags, $1:dominant_color::STRING, GET_RELATIVE_PATH(@images_stage, $1:file_url)
  FROM
  @images_stage/image_metadata.json)
  FILE_FORMAT = (type = json);

-- Create a view that queries the pre-signed URL for an image as well as the image metadata stored in a table.
CREATE VIEW image_catalog AS
(
  SELECT
   size,
   get_presigned_url(@images_stage, relative_path) AS presigned_url,
   tags
  FROM
    images_table
);
```

---
title: GET_PYTHON_PROFILER_OUTPUT (SNOWFLAKE.CORE)
source: https://docs.snowflake.com/en/sql-reference/functions/get_python_profiler_output.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# GET_PYTHON_PROFILER_OUTPUT (SNOWFLAKE.CORE)

Returns output containing a report generated by the [Python code profiler](../../developer-guide/stored-procedure/python/procedure-python-profiler.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.GET_PYTHON_PROFILER_OUTPUT(<query_id>)
```

## Arguments

`query_id`
:   Query ID of the stored procedure query for which profiling was enabled.

## Returns

A string of type VARCHAR with the report generated by the [code profiler](../../developer-guide/stored-procedure/python/procedure-python-profiler.md).

## Access control requirements

You must use the ACCOUNTADMIN role to call this function.

## Examples

When the profiler is set to profile memory, rather than time, the setting looks something like the following.

```output
Handler Name: main
Python Runtime Version: 3.8
Modules Profiled: ['main_module']
File: _udf_code.py
Function: main at line 4

Line #   Mem usage    Increment  Occurrences  Line Contents
=============================================================
    4    245.3 MiB    245.3 MiB           1   def main(session, last_n, total):
    5                                             # create sample dataset to emulate id + elapsed time
    6    245.8 MiB      0.5 MiB           1       session.sql('''
    7                                                 CREATE OR REPLACE TABLE sample_query_history (query_id INT, elapsed_time FLOAT)''').collect()
    8    245.8 MiB      0.0 MiB           2       session.sql('''
    9                                             INSERT INTO sample_query_history
    10                                             SELECT
    11                                             seq8() AS query_id,
    12                                             uniform(0::float, 100::float, random()) as elapsed_time
    13    245.8 MiB      0.0 MiB           1       FROM table(generator(rowCount => {0}));'''.format(total)).collect()
    14
    15                                             # get the mean of the last n query elapsed time
    16    245.8 MiB      0.0 MiB           3       df = session.table('sample_query_history').select(
    17    245.8 MiB      0.0 MiB           1           funcs.col('query_id'),
    18    245.8 MiB      0.0 MiB           2           funcs.col('elapsed_time')).limit(last_n)
    19
    20    327.9 MiB     82.1 MiB           1       pandas_df = df.to_pandas()
    21    328.9 MiB      1.0 MiB           1       mean_time = pandas_df.loc[:, 'ELAPSED_TIME'].mean()
    22    320.9 MiB     -8.0 MiB           1       del pandas_df
    23    320.9 MiB      0.0 MiB           1       return mean_time
```

---
title: GET_QUERY_OPERATOR_STATS
source: https://docs.snowflake.com/en/sql-reference/functions/get_query_operator_stats.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Query Information) ,
    [Table functions](../functions-table.md)

# GET_QUERY_OPERATOR_STATS

Returns statistics about individual query operators within a query that has completed. You can run this function for any
completed query that was executed in the past 14 days.

You can use this information to understand the structure of a query and identify query operators — for example, the join operator —
that cause performance problems.

For example, you can use this information to determine which operators are consuming the most resources. As another example, you
can use this function to identify joins that have more output rows than input rows, which can be a sign of an
[“exploding” join](../../user-guide/ui-snowsight-activity.md); for example, an unintended Cartesian product.

These statistics are also available in the [query profile](../../user-guide/ui-snowsight-activity.md) tab in Snowsight.
The `GET_QUERY_OPERATOR_STATS()` function makes the same information available via a programmatic interface.

For more information about finding problematic query operators,
see [Common query problems identified by Query Profile](../../user-guide/ui-snowsight-activity.md).

## Syntax

```sqlsyntax
GET_QUERY_OPERATOR_STATS( <query_id> )
```

## Arguments

`query_id`
:   The ID of a query. You can use:

    * A string literal (a string surrounded by single quotes).
    * A [session variable](../session-variables.md) containing a query ID.
    * The return value from a call to the [LAST_QUERY_ID](last_query_id.md) function.

## Returns

The GET_QUERY_OPERATOR_STATS function is a [table function](../functions-table.md). It returns rows with
statistics about each query operator in the query. For more information, see the
Usage notes and Output sections below.

## Usage notes

* This function returns statistics only for queries that have completed.
* You must have OPERATE or MONITOR privileges on the warehouse where you ran the query.
* This function provides detailed statistics about each query operator used in the specified query. The following list
  shows the possible query operators:

  + Aggregate: Groups inputs and computes aggregate functions.
  + CartesianJoin: A specialized type of join.
  + Delete: Removes a record from a table.
  + ExternalFunction: Represents processing by an external function.
  + ExternalScan: Represents access to data stored in stage objects.
  + Filter: Represents an operation that filters the rows.
  + Flatten: Processes VARIANT records, possibly flattening them on a specified path.
  + Generator: Generates records by using the TABLE([GENERATOR(…)](generator.md)) construct.
  + GroupingSets: Represents constructs, such as GROUPING SETS, ROLLUP, and CUBE.
  + Insert: Adds a record to a table either through an INSERT or COPY operation.
  + InternalObject: Represents access to an internal data object; for example, in an [Information Schema](../info-schema.md)
    or the result of a previous query.
  + Join: Combines two inputs on a given condition.
  + JoinFilter: Special filtering operation that removes tuples that can be identified as not possibly matching the condition of a
    Join further in the query plan.
  + Merge: Performs a MERGE operation on a table.
  + Pivot: Transforms unique values from a column into multiple columns and does any necessary aggregation.
  + Result: Returns the query result.
  + Sort: Orders input on a given expression.
  + SortWithLimit: Produces a part of the input sequence after sorting, typically a result of an
    `ORDER BY ... LIMIT ... OFFSET ...` construct.
  + TableScan: Represents access to a single table.
  + UnionAll: Concatenates two inputs.
  + Unload: Represents a COPY operation that exports data from a table to a file in a stage.
  + Unpivot: Rotates a table by transforming columns into rows.
  + Update: Updates a record in a table.
  + ValuesClause: List of values provided with the VALUES clause.
  + WindowFunction: Computes window functions.
  + WithClause: Precedes the body of the SELECT statement, and defines one or more CTEs.
  + WithReference: Instance of a WITH clause.
* The information is returned as a table. Each row in the table corresponds to one operator. The row contains the execution
  breakdown and the query statistics for that operator.

  The row may also list operator attributes (these depend on the
  type of operator).

  Statistics that break down query execution time are expressed as a percentage of the total query execution time.

  For more information about specific statistics, see Output (in this topic).
* Because this function is a table function, you must use it in a [FROM](../constructs/from.md) clause and you must wrap
  it in `TABLE()`. For example:

  ```sqlexample
  SELECT * FROM TABLE(GET_QUERY_OPERATOR_STATS(last_query_id()));
  ```

* For each individual execution of a specific query (i.e. a specific UUID), this function is deterministic; it returns the same
  values each time.

  However, for different executions of the same query text, this function can return different runtime statistics. The statistics
  depend on many factors. The following factors can have a major impact on the execution and therefore on the statistics returned by
  this function:

  + The volume of data.
  + The availability of [materialized views](../../user-guide/views-materialized.md), and the changes (if any) to the data since
    those materialized views were last refreshed.
  + The presence or absence of [clustering](../../user-guide/tables-clustering-keys.md).
  + The presence or absence of previously-cached data.
  + The size of the virtual warehouse.

  The values can also be affected by factors outside the user’s query and data. These factors are usually small. The factors
  include:

  + Virtual warehouse initialization time.
  + Latency with external functions.

## Output

The function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | The query ID, which is an internal, system-generated identifier for the SQL statement. |
| STEP_ID | NUMBER(38, 0) | Identifier of the step in the query plan. |
| OPERATOR_ID | NUMBER(38, 0) | The operator’s identifier. This is unique within the query. Values start at 0. |
| PARENT_OPERATORS | ARRAY containing one or more NUMBER(38, 0) | Identifiers of the parent operators for this operator, or NULL if this is the final operator in the query plan (which is usually the Result operator). |
| OPERATOR_TYPE | VARCHAR | The type of query operator; for example, `TableScan` or `Filter`. |
| OPERATOR_STATISTICS | VARIANT containing an OBJECT | Statistics about the operator (for example, the number of output rows from the operator). |
| EXECUTION_TIME_BREAKDOWN | VARIANT containing an OBJECT | Information about the execution time of the operator. |
| OPERATOR_ATTRIBUTES | VARIANT containing an OBJECT | Information about the operator. This information depends on the operator type. |

If there is no information for the specific column for the operator, the value is NULL.

Three of these columns contain [OBJECTs](../data-types-semistructured.md). Each object contains key/value pairs. The tables below
describe the keys in these objects.

### OPERATOR_STATISTICS

The fields in the OBJECTs for the `OPERATOR_STATISTICS` column provide additional information about the operator. The
information can include:

| Key | Nested key (if applicable) | Data type | Description |
| --- | --- | --- | --- |
| `dml` |  |  | Statistics for Data Manipulation Language (DML) queries. |
|  | `number_of_rows_inserted` | DOUBLE | Number of rows inserted into a table or tables. |
|  | `number_of_rows_updated` | DOUBLE | Number of rows updated in a table. |
|  | `number_of_rows_deleted` | DOUBLE | Number of rows deleted from a table. |
|  | `number_of_rows_unloaded` | DOUBLE | Number of rows unloaded during data export. |
| `extension_functions` |  |  | Information about calls to extension functions. If the value of a field is zero, then the field is not displayed. |
|  | `Java UDF handler load time` | DOUBLE | Amount of time for the Java UDF handler to load. |
|  | `Total Java UDF handler invocations` | DOUBLE | Number of times the Java UDF handler is invoked. |
|  | `Max Java UDF handler execution time` | DOUBLE | Maximum amount of time for the Java UDF handler to execute. |
|  | `Avg Java UDF handler execution time` | DOUBLE | Average amount of time to execute the Java UDF handler. |
|  | `Java UDTF process() invocations` | DOUBLE | Number of times the Java UDTF [process method](../../developer-guide/udf/java/udf-java-tabular-functions.md) was invoked. |
|  | `Java UDTF process() execution time` | DOUBLE | Amount of time to execute the Java UDTF process. |
|  | `Avg Java UDTF process() execution time` | DOUBLE | Average amount of time to execute the Java UDTF process. |
|  | `Java UDTF's constructor invocations` | DOUBLE | Number of times the Java UDTF [constructor](../../developer-guide/udf/java/udf-java-tabular-functions.md) was invoked. |
|  | `Java UDTF's constructor execution time` | DOUBLE | Amount of time to execute the Java UDTF constructor. |
|  | `Avg Java UDTF's constructor execution time` | DOUBLE | Average amount of time to execute the Java UDTF constructor. |
|  | `Java UDTF endPartition() invocations` | DOUBLE | Number of times the Java UDTF [endPartition method](../../developer-guide/udf/java/udf-java-tabular-functions.md) was invoked. |
|  | `Java UDTF endPartition() execution time` | DOUBLE | Amount of time to execute the Java UDTF endPartition method. |
|  | `Avg Java UDTF endPartition() execution time` | DOUBLE | Average amount of time to execute the Java UDTF `endPartition` method. |
|  | `Max Java UDF dependency download time` | DOUBLE | Maximum amount of time to download the Java UDF dependencies. |
|  | `Max JVM memory usage` | DOUBLE | Peak memory usage as reported by the JVM. |
|  | `Java UDF inline code compile time in ms` | DOUBLE | Compile time for the Java UDF inline code. |
|  | `Total Python UDF handler invocations` | DOUBLE | Number of times the Python UDF handler was invoked. |
|  | `Total Python UDF handler execution time` | DOUBLE | Total execution time for the Python UDF handler. |
|  | `Avg Python UDF handler execution time` | DOUBLE | Average amount of time to execute the Python UDF handler. |
|  | `Python sandbox max memory usage` | DOUBLE | Peak memory usage by the Python sandbox environment. |
|  | `Avg Python env creation time: Download and install packages` | DOUBLE | Average amount of time to create the Python environment, including downloading and installing packages. |
|  | `Conda solver time` | DOUBLE | Amount of time to run the Conda solver to solve Python packages. |
|  | `Conda env creation time` | DOUBLE | Amount of time to create the Python environment. |
|  | `Python UDF initialization time` | DOUBLE | Amount of time to initialize the Python UDF. |
|  | `Number of external file bytes read for UDFs` | DOUBLE | Number of external file bytes read for UDFs. |
|  | `Number of external files accessed for UDFs` | DOUBLE | Number of external files accessed for UDFs. |
| `external_functions` |  |  | Information about calls to external functions. If the value of a field — for example `retries_due_to_transient_errors` — is zero, then the field is not displayed. |
|  | `total_invocations` | DOUBLE | Number of times that an external function was called. This number can be different from the number of external function calls in the text of the SQL statement because of the number of batches that rows are divided into, the number of retries if there are transient network problems, and so on. |
|  | `rows_sent` | DOUBLE | Number of rows sent to external functions. |
|  | `rows_received` | DOUBLE | Number of rows received back from external functions. |
|  | `bytes_sent (x-region)` | DOUBLE | Number of bytes sent to external functions. If the key includes `(x-region)`, the data was sent across regions, which can impact billing. |
|  | `bytes_received (x-region)` | DOUBLE | Number of bytes received from external functions. If the key includes `(x-region)`, the data was sent across regions, which can impact billing. |
|  | `retries_due_to_transient_errors` | DOUBLE | Number of retries because of transient errors. |
|  | `average_latency_per_call` | DOUBLE | Average amount of time per invocation (call) in milliseconds between the time Snowflake sent the data and received the returned data. |
|  | `http_4xx_errors` | INTEGER | Total number of HTTP requests that returned a 4xx status code. |
|  | `http_5xx_errors` | INTEGER | Total number of HTTP requests that returned a 5xx status code. |
|  | `average_latency` | DOUBLE | Average latency for successful HTTP requests. |
|  | `avg_throttle_latency_overhead` | DOUBLE | Average overhead per successful request because of a slowdown caused by throttling (HTTP 429). |
|  | `batches_retried_due_to_throttling` | DOUBLE | Number of batches that were retried because of HTTP 429 errors. |
|  | `latency_per_successful_call_(p50)` | DOUBLE | 50th percentile latency for successful HTTP requests. 50 percent of all successful requests took less than this time to complete. |
|  | `latency_per_successful_call_(p90)` | DOUBLE | 90th percentile latency for successful HTTP requests. 90 percent of all successful requests took less than this time to complete. |
|  | `latency_per_successful_call_(p95)` | DOUBLE | 95th percentile latency for successful HTTP requests. 95 percent of all successful requests took less than this time to complete. |
|  | `latency_per_successful_call_(p99)` | DOUBLE | 99th percentile latency for successful HTTP requests. 99 percent of all successful requests took less than this time to complete. |
| `input_rows` |  | INTEGER | Number of input rows. This can be missing for an operator with no input edges from other operators. |
| `io` |  |  | Information about the I/O (input/output) operations performed during the query. |
|  | `scan_progress` | DOUBLE | Percentage of data scanned for a given table so far. |
|  | `bytes_scanned` | DOUBLE | Number of bytes scanned so far. |
|  | `percentage_scanned_from_cache` | DOUBLE | Percentage of data scanned from the local disk cache. |
|  | `bytes_written` | DOUBLE | Bytes written; for example, when loading into a table. |
|  | `bytes_written_to_result` | DOUBLE | Bytes written to a result object.  For example, `SELECT * FROM ...` would produce a set of results in tabular format representing each field in the selection.  In general, the results object represents whatever is produced as a result of the query, and `bytes_written_to_result` represents the size of the returned result. |
|  | `bytes_read_from_result` | DOUBLE | Bytes read from a result object. |
|  | `external_bytes_scanned` | DOUBLE | Bytes read from an external object; for example, a stage. |
| `network` | `network_bytes` | DOUBLE | Amount of data sent over the network. |
| `output_rows` |  | INTEGER | Number of output rows. This can be missing for the operator that returns the results to the user; which is usually the RESULT operator. |
| `pruning` |  |  | Information on table pruning. |
|  | `partitions_pruned_by_snowflake_optima` | DOUBLE | Number of partitions pruned by Snowflake Optima. |
|  | `partitions_scanned` | DOUBLE | Number of partitions scanned so far. |
|  | `partitions_total` | DOUBLE | Total number of partitions in a given table. |
| `spilling` |  |  | Information about disk usage for operations in which intermediate results do not fit in memory. |
|  | `bytes_spilled_remote_storage` | DOUBLE | Volume of data spilled to remote disk. |
|  | `bytes_spilled_local_storage` | DOUBLE | Volume of data spilled to local disk. |
| `search_optimization` |  |  | Information about queries that use the [search optimization service](../../user-guide/search-optimization-service.md). |
|  | `partitions_pruned_by_search_optimization` | DOUBLE | Number of partitions pruned by search optimization. |
|  | `partitions_pruned_by_search_optimization_and_snowflake_optima` | DOUBLE | Number of partitions pruned by search optimization and Snowflake Optima. |

### EXECUTION_TIME_BREAKDOWN

The fields in the OBJECTs for the `EXECUTION_TIME_BREAKDOWN` column are shown below.

| Key | Data type | Description |
| --- | --- | --- |
| `overall_percentage` | DOUBLE | Percentage of the total query time spent by this operator. |
| `initialization` | DOUBLE | Time spent setting up query processing. |
| `processing` | DOUBLE | Time spent processing the data by the CPU. |
| `synchronization` | DOUBLE | Time spent synchronizing activities between participating processes. |
| `local_disk_io` | DOUBLE | Time during which processing was blocked while waiting for local disk access. |
| `remote_disk_io` | DOUBLE | Time during which processing was blocked while waiting for remote disk access. |
| `network_communication` | DOUBLE | Time during which processing was waiting for network data transfer. |

### OPERATOR_ATTRIBUTES

Each output row describes one operator in the query.
The following table shows the possible types of operators; for example, the Filter operator.
For each type of operator, the table shows the possible attributes; for example, the expression used to filter the rows.

The operator attributes are stored in the `OPERATOR_ATTRIBUTES` column, which is of type VARIANT and contains an
[OBJECT](../data-types-semistructured.md). The OBJECT contains key/value pairs. Each key corresponds to one attribute of the operator.

| Operator name | Key | Data type | Description |
| --- | --- | --- | --- |
| `Aggregate` |  |  |  |
|  | `functions` | ARRAY of VARCHAR | List of functions computed. |
|  | `grouping_keys` | ARRAY of VARCHAR | Group-by expression. |
| `CartesianJoin` |  |  |  |
|  | `additional_join_condition` | VARCHAR | Non-equality join expression. |
|  | `equality_join_condition` | VARCHAR | Equality join expression. |
|  | `join_type` | VARCHAR | Type of join (INNER). |
| `Delete` | `table_name` | VARCHAR | Name of updated table. |
| `ExternalScan` |  |  |  |
|  | `stage_name` | VARCHAR | The name of the stage from which the data is read. |
|  | `stage_type` | VARCHAR | The type of the stage. |
| `Filter` | `filter_condition` | VARCHAR | The expression used to filter data. |
| `Flatten` | `input` | VARCHAR | Input expression used to flatten data. |
| `Generator` |  |  |  |
|  | `row_count` | NUMBER | Value of the input parameter ROWCOUNT. |
|  | `time_limit` | NUMBER | Value of the input parameter TIMELIMIT. |
| `GroupingSets` |  |  |  |
|  | `functions` | ARRAY of VARCHAR | List of functions computed. |
|  | `key_sets` | ARRAY of VARCHAR | List of grouping sets. |
| `Insert` |  |  |  |
|  | `input_expression` | VARCHAR | Which expressions are inserted. |
|  | `table_names` | ARRAY of VARCHAR | List of table names to which records are added. |
| `InternalObject` | `object_name` | VARCHAR | Name of the accessed object. |
| `Join` |  |  |  |
|  | `additional_join_condition` | VARCHAR | Non-equality join expression. |
|  | `equality_join_condition` | VARCHAR | Equality join expression. |
|  | `join_type` | VARCHAR | Type of join (INNER, OUTER, LEFT JOIN, etc.). |
| `JoinFilter` | `join_id` | NUMBER | Operator id of the join used to identify tuples that can be filtered out. |
| `Merge` | `table_name` | VARCHAR | Name of updated table. |
| `Pivot` |  |  |  |
|  | `grouping_keys` | ARRAY of VARCHAR | Remaining columns on which the results are aggregated. |
|  | `pivot_column` | ARRAY of VARCHAR | Resulting columns of pivot values. |
| `Result` | `expressions` | ARRAY of VARCHAR | List of expressions produced. |
| `Sort` | `sort_keys` | ARRAY of VARCHAR | Expression defining the sorting order. |
| `SortWithLimit` |  |  |  |
|  | `offset` | NUMBER | Position in the ordered sequence from which produced tuples are emitted. |
|  | `rows` | NUMBER | Number of rows produced. |
|  | `sort_keys` | ARRAY of VARCHAR | Expression defining the sorting order. |
| `TableScan` |  |  |  |
|  | `columns` | ARRAY of VARCHAR | List of scanned columns. |
|  | `extracted_variant_paths` | ARRAY of VARCHAR | List of paths extracted from variant columns. |
|  | `table_alias` | VARCHAR | Alias of table being accessed. |
|  | `table_name` | VARCHAR | Name of table being accessed. |
| `Unload` | `location` | VARCHAR | Stage where data is saved. |
| `Unpivot` | `expressions` | ARRAY of VARCHAR | Output columns of the unpivot query. |
| `Update` | `table_name` | VARCHAR | Name of updated table. |
| `ValuesClause` |  |  |  |
|  | `value_count` | NUMBER | Number of produced values. |
|  | `values` | VARCHAR | List of values. |
| `WindowFunction` | `functions` | ARRAY of VARCHAR | List of functions computed. |
| `WithClause` | `name` | VARCHAR | Alias of WITH clause. |

If an operator is not listed, no attributes are produced, and the value is reported as `{}`.

> **Note:**
>
> * The following operators do not have any operator attributes and therefore are not included in the
>   table of `OPERATOR_ATTRIBUTES`:
>
>   + `UnionAll`
>   + `ExternalFunction`

## Examples

The following examples call the GET_QUERY_OPERATOR_STATS function.

### Retrieving data about a single query

This example shows the statistics for a SELECT that joins two small tables.

Run the SELECT statement:

```sqlexample
SELECT x1.i, x2.i
  FROM x1 INNER JOIN x2 ON x2.i = x1.i
  ORDER BY x1.i, x2.i;
```

Get the query ID:

```sqlexample
SET lqid = (SELECT LAST_QUERY_ID());
```

Call GET_QUERY_OPERATOR_STATS() to get statistics about the individual query operators in the query:

```sqlexample
SELECT * FROM TABLE(GET_QUERY_OPERATOR_STATS($lqid));
```

```output
+--------------------------------------+---------+-------------+--------------------+---------------+-----------------------------------------+-----------------------------------------------+----------------------------------------------------------------------+
| QUERY_ID                             | STEP_ID | OPERATOR_ID | PARENT_OPERATORS   | OPERATOR_TYPE | OPERATOR_STATISTICS                     | EXECUTION_TIME_BREAKDOWN                      | OPERATOR_ATTRIBUTES                                                  |
|--------------------------------------+---------+-------------+--------------------+---------------+-----------------------------------------+-----------------------------------------------+----------------------------------------------------------------------|
| 01a8f330-0507-3f5b-0000-43830248e09a |       1 |           0 |               NULL | Result        | {                                       | {                                             | {                                                                    |
|                                      |         |             |                    |               |   "input_rows": 64                      |   "overall_percentage": 0.000000000000000e+00 |   "expressions": [                                                   |
|                                      |         |             |                    |               | }                                       | }                                             |     "X1.I",                                                          |
|                                      |         |             |                    |               |                                         |                                               |     "X2.I"                                                           |
|                                      |         |             |                    |               |                                         |                                               |   ]                                                                  |
|                                      |         |             |                    |               |                                         |                                               | }                                                                    |
| 01a8f330-0507-3f5b-0000-43830248e09a |       1 |           1 |              [ 0 ] | Sort          | {                                       | {                                             | {                                                                    |
|                                      |         |             |                    |               |   "input_rows": 64,                     |   "overall_percentage": 0.000000000000000e+00 |   "sort_keys": [                                                     |
|                                      |         |             |                    |               |   "output_rows": 64                     | }                                             |     "X1.I ASC NULLS LAST",                                           |
|                                      |         |             |                    |               | }                                       |                                               |     "X2.I ASC NULLS LAST"                                            |
|                                      |         |             |                    |               |                                         |                                               |   ]                                                                  |
|                                      |         |             |                    |               |                                         |                                               | }                                                                    |
| 01a8f330-0507-3f5b-0000-43830248e09a |       1 |           2 |              [ 1 ] | Join          | {                                       | {                                             | {                                                                    |
|                                      |         |             |                    |               |   "input_rows": 128,                    |   "overall_percentage": 0.000000000000000e+00 |   "equality_join_condition": "(X2.I = X1.I)",                        |
|                                      |         |             |                    |               |   "output_rows": 64                     | }                                             |   "join_type": "INNER"                                               |
|                                      |         |             |                    |               | }                                       |                                               | }                                                                    |
| 01a8f330-0507-3f5b-0000-43830248e09a |       1 |           3 |              [ 2 ] | TableScan     | {                                       | {                                             | {                                                                    |
|                                      |         |             |                    |               |   "io": {                               |   "overall_percentage": 0.000000000000000e+00 |   "columns": [                                                       |
|                                      |         |             |                    |               |     "bytes_scanned": 1024,              | }                                             |     "I"                                                              |
|                                      |         |             |                    |               |     "percentage_scanned_from_cache": 1, |                                               |   ],                                                                 |
|                                      |         |             |                    |               |     "scan_progress": 1                  |                                               |   "table_name": "MY_DB.MY_SCHEMA.X2" |
|                                      |         |             |                    |               |   },                                    |                                               | }                                                                    |
|                                      |         |             |                    |               |   "output_rows": 64,                    |                                               |                                                                      |
|                                      |         |             |                    |               |   "pruning": {                          |                                               |                                                                      |
|                                      |         |             |                    |               |     "partitions_scanned": 1,            |                                               |                                                                      |
|                                      |         |             |                    |               |     "partitions_total": 1               |                                               |                                                                      |
|                                      |         |             |                    |               |   }                                     |                                               |                                                                      |
|                                      |         |             |                    |               | }                                       |                                               |                                                                      |
| 01a8f330-0507-3f5b-0000-43830248e09a |       1 |           4 |              [ 2 ] | JoinFilter    | {                                       | {                                             | {                                                                    |
|                                      |         |             |                    |               |   "input_rows": 64,                     |   "overall_percentage": 0.000000000000000e+00 |   "join_id": "2"                                                     |
|                                      |         |             |                    |               |   "output_rows": 64                     | }                                             | }                                                                    |
|                                      |         |             |                    |               | }                                       |                                               |                                                                      |
| 01a8f330-0507-3f5b-0000-43830248e09a |       1 |           5 |              [ 4 ] | TableScan     | {                                       | {                                             | {                                                                    |
|                                      |         |             |                    |               |   "io": {                               |   "overall_percentage": 0.000000000000000e+00 |   "columns": [                                                       |
|                                      |         |             |                    |               |     "bytes_scanned": 1024,              | }                                             |     "I"                                                              |
|                                      |         |             |                    |               |     "percentage_scanned_from_cache": 1, |                                               |   ],                                                                 |
|                                      |         |             |                    |               |     "scan_progress": 1                  |                                               |   "table_name": "MY_DB.MY_SCHEMA.X1" |
|                                      |         |             |                    |               |   },                                    |                                               | }                                                                    |
|                                      |         |             |                    |               |   "output_rows": 64,                    |                                               |                                                                      |
|                                      |         |             |                    |               |   "pruning": {                          |                                               |                                                                      |
|                                      |         |             |                    |               |     "partitions_scanned": 1,            |                                               |                                                                      |
|                                      |         |             |                    |               |     "partitions_total": 1               |                                               |                                                                      |
|                                      |         |             |                    |               |   }                                     |                                               |                                                                      |
|                                      |         |             |                    |               | }                                       |                                               |                                                                      |
+--------------------------------------+---------+-------------+--------------------+---------------+-----------------------------------------+-----------------------------------------------+----------------------------------------------------------------------+
```

### Identifying “exploding” join operators

The following example shows how to use GET_QUERY_OPERATOR_STATS to examine a complicated query. This example looks for operators
within a query that produce many more rows than were input to that operator.

This is the query to be analyzed:

```sqlexample
SELECT *
  FROM t1
    JOIN t2 ON t1.a = t2.a
    JOIN t3 ON t1.b = t3.b
    JOIN t4 ON t1.c = t4.c;
```

Get the query ID of the previous query:

```sqlexample
SET lid = LAST_QUERY_ID();
```

The following query shows the ratio of output rows to input rows for each of the join operators in the query:

```sqlexample
SELECT  operator_id,
        operator_attributes,
        operator_statistics:output_rows / operator_statistics:input_rows AS row_multiple
  FROM TABLE(GET_QUERY_OPERATOR_STATS($lid))
  WHERE operator_type = 'Join'
  ORDER BY step_id, operator_id;
```

```output
+---------+-------------+--------------------------------------------------------------------------+---------------+
| STEP_ID | OPERATOR_ID | OPERATOR_ATTRIBUTES                                                      | ROW_MULTIPLE  |
+---------+-------------+--------------------------------------------------------------------------+---------------+
|       1 |           1 | {  "equality_join_condition": "(T4.C = T1.C)",   "join_type": "INNER"  } |  49.969249692 |
|       1 |           3 | {  "equality_join_condition": "(T3.B = T1.B)",   "join_type": "INNER"  } | 116.071428571 |
|       1 |           5 | {  "equality_join_condition": "(T2.A = T1.A)",   "join_type": "INNER"  } |  12.20657277  |
+---------+-------------+--------------------------------------------------------------------------+---------------+
```

After you identify the exploding joins, you can review each join condition to verify that the condition is correct.

---
title: GET_RELATIVE_PATH
source: https://docs.snowflake.com/en/sql-reference/functions/get_relative_path.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md)

# GET_RELATIVE_PATH

Extracts the path of a staged file relative to its location in the stage using the stage name and absolute file path in cloud storage as inputs.

## Syntax

```sqlsyntax
GET_RELATIVE_PATH( @<stage_name> , '<absolute_file_path>' )
```

## Arguments

`stage_name`
:   Name of the internal or external stage where the file is stored.

    > **Note:**
    >
    > If the stage name includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'` for a stage
    > named `"my stage"`).

`absolute_file_path`
:   Stage location, including the path and filename, of the file in cloud storage.

## Returns

Path of the file relative to the stage location.

## Usage notes

* This SQL function returns a value for any role that has the following privilege on the stage:

  External stage:
  :   USAGE

  Internal stage:
  :   READ

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

Retrieve the relative path of a bitmap format image file in an external stage, where the `@images_stage` stage references the
`s3://photos/national_parks/` bucket and path:

```sqlexample
SELECT GET_RELATIVE_PATH(@images_stage, 's3://photos/national_parks/us/yosemite/half_dome.jpg');
+================================================================================---------------------+
| GET_RELATIVE_PATH(@IMAGES_STAGE, 'S3://PHOTOS/NATIONAL_PARKS/US/YOSEMITE/HALF_DOME.JPG')  |
+================================================================================---------------------+
| us/yosemite/half_dome.jpg                                                                 |
+================================================================================---------------------+
```

---
title: GET_STAGE_LOCATION
source: https://docs.snowflake.com/en/sql-reference/functions/get_stage_location.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md)

# GET_STAGE_LOCATION

Retrieves the URL for an external or internal named stage using the stage name as the input.

## Syntax

```sqlsyntax
GET_STAGE_LOCATION( @<stage_name> )
```

## Arguments

`stage_name`
:   Name of an external or internal named stage.

    > **Note:**
    >
    > If the stage name includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'` for a stage
    > named `"my stage"`).

## Returns

URL of the cloud storage location in the stage definition.

## Usage notes

* This SQL function returns a value for any role that has the USAGE privilege on the stage.

* If files downloaded from an internal stage are corrupted, verify with the stage creator that `ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE')` is set for the stage.

## Examples

Retrieve the URL for an external stage:

```sqlexample
CREATE STAGE images_stage URL = 's3://photos/national_parks/us/yosemite/';

SELECT GET_STAGE_LOCATION(@images_stage);

+----------------------------------------------------------+
| GET_STAGE_LOCATION(@IMAGES_STAGE)                        |
+----------------------------------------------------------+
| s3://photos/national_parks/us/yosemite/                  |
+----------------------------------------------------------+
```

---
title: GETBIT
source: https://docs.snowflake.com/en/sql-reference/functions/getbit.md
section: SQL Functions
---

Categories:
:   [Bitwise expression functions](../expressions-byte-bit.md)

# GETBIT

Given an INTEGER value, returns the value of a bit at a specified position.

## Syntax

```sqlsyntax
GETBIT( <integer_expr>, <bit_position> )
```

## Arguments

`integer_expr`
:   This expression must evaluate to a data type that can be cast to an INTEGER value.

`bit_position`
:   The position of the bit (starting from 0 for the least significant bit up to 127 for the most significant bit) for which
    to retrieve the value.

## Returns

The function returns the value of the bit (0 or 1) at the specified position.

## Examples

The following example returns the values of the bits at positions 100, 3, 2, 1, and 0 for an INTEGER value.

```sqlexample
SELECT GETBIT(11, 100), GETBIT(11, 3), GETBIT(11, 2), GETBIT(11, 1), GETBIT(11, 0);
```

```output
+-----------------+---------------+---------------+---------------+---------------+
| GETBIT(11, 100) | GETBIT(11, 3) | GETBIT(11, 2) | GETBIT(11, 1) | GETBIT(11, 0) |
|-----------------+---------------+---------------+---------------+---------------|
|               0 |             1 |             0 |             1 |             1 |
+-----------------+---------------+---------------+---------------+---------------+
```

---
title: GETDATE
source: https://docs.snowflake.com/en/sql-reference/functions/getdate.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# GETDATE

Returns the current timestamp for the system in the local time zone.

Alias for [CURRENT_TIMESTAMP](current_timestamp.md).

## Syntax

```sqlsyntax
GETDATE()
```

## Arguments

None. This function must be called with parentheses.

## Returns

Returns the current system time. The data type of the returned value is
[TIMESTAMP_LTZ](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned timestamp is in the time zone for the session.
* The setting of the [TIMESTAMP_TYPE_MAPPING](../parameters.md) parameter does not affect the return value.
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual warehouse) because the queries might be serviced by different compute resources (in the warehouse).

* This function does not support the `fract_sec_precision` argument that is supported by
  the [CURRENT_TIMESTAMP](current_timestamp.md) function.

## Examples

Show the current system timestamp:

```sqlexample
SELECT GETDATE();
```

```output
+-------------------------------+
| GETDATE()                     |
|-------------------------------|
| 2024-04-17 15:44:20.960000000 |
+-------------------------------+
```

---
title: GETVARIABLE
source: https://docs.snowflake.com/en/sql-reference/functions/getvariable.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# GETVARIABLE

Returns the value associated with a SQL variable name.

See also:
:   [Session variable functions](../session-variables.md)

## Syntax

```sqlsyntax
GETVARIABLE( '<name>' )
```

## Arguments

`name`
:   The name of the SQL variable.

    You must specify the name in uppercase letters, even if you used lowercase letters when defining the variable.

## Returns

The data type of the return value is VARCHAR.

## Usage notes

This function uses the result cache for the current session if you call the function more than once in the same session. The result cache
applies wherever you call this function, including the body of policy objects, such as a row access policy.

## Examples

This example shows how to use this function and other ways of getting the value of the variable:

> ```sqlexample
> SET MY_LOCAL_VARIABLE= 'my_local_variable_value';
> +----------------------------------+
> | status                           |
> |----------------------------------|
> | Statement executed successfully. |
> +----------------------------------+
> SELECT
>     GETVARIABLE('MY_LOCAL_VARIABLE'),
>     SESSION_CONTEXT('MY_LOCAL_VARIABLE'),
>     $MY_LOCAL_VARIABLE;
> +----------------------------------+--------------------------------------+-------------------------+
> | GETVARIABLE('MY_LOCAL_VARIABLE') | SESSION_CONTEXT('MY_LOCAL_VARIABLE') | $MY_LOCAL_VARIABLE      |
> |----------------------------------+--------------------------------------+-------------------------|
> | my_local_variable_value          | my_local_variable_value              | my_local_variable_value |
> +----------------------------------+--------------------------------------+-------------------------+
> ```

When variables are created with the SET command, the variable names are forced to all upper case. The functions
GETVARIABLE and SESSION_CONTEXT must pass the uppercase version of the function name. The “$” notation
works with either uppercase or lowercase variable names.

> ```sqlexample
> SET var_2 = 'value_2';
> +----------------------------------+
> | status                           |
> |----------------------------------|
> | Statement executed successfully. |
> +----------------------------------+
> SELECT
>     GETVARIABLE('var_2'),
>     GETVARIABLE('VAR_2'),
>     SESSION_CONTEXT('var_2'),
>     SESSION_CONTEXT('VAR_2'),
>     $var_2,
>     $VAR_2;
> +----------------------+----------------------+--------------------------+--------------------------+---------+---------+
> | GETVARIABLE('VAR_2') | GETVARIABLE('VAR_2') | SESSION_CONTEXT('VAR_2') | SESSION_CONTEXT('VAR_2') | $VAR_2  | $VAR_2  |
> |----------------------+----------------------+--------------------------+--------------------------+---------+---------|
> | NULL                 | value_2              | NULL                     | value_2                  | value_2 | value_2 |
> +----------------------+----------------------+--------------------------+--------------------------+---------+---------+
> ```

---
title: GREATEST
source: https://docs.snowflake.com/en/sql-reference/functions/greatest.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# GREATEST

Returns the largest value from a list of expressions. GREATEST supports all data types, including VARIANT.

See also:
:   [GREATEST_IGNORE_NULLS](greatest_ignore_nulls.md)

## Syntax

```sqlsyntax
GREATEST( <expr1> [ , <expr2> ... ] )
```

## Arguments

`exprN`
:   The arguments must include at least one expression. All the expressions
    should be of the same type or compatible types.

## Returns

The first argument determines the return type:

* If the first type is numeric, then the return type is ‘widened’
  according to the numeric types in the list of all arguments.
* If the first type is not numeric, then all other arguments must be
  convertible to the first type.

If any argument is NULL, returns NULL.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

The following examples use the GREATEST function:

```sqlexample
CREATE TABLE test_table_1_greatest (
  col_1 INTEGER,
  col_2 INTEGER,
  col_3 INTEGER,
  col_4 FLOAT);
INSERT INTO test_table_1_greatest (col_1, col_2, col_3, col_4) VALUES
  (1, 2,    3,  4.00),
  (2, 4,   -1, -2.00),
  (3, 6, NULL, 13.45);
```

```sqlexample
SELECT col_1,
       col_2,
       col_3,
       GREATEST(col_1, col_2, col_3) AS greatest
  FROM test_table_1_greatest
  ORDER BY col_1;
```

```output
+-------+-------+-------+----------+
| COL_1 | COL_2 | COL_3 | GREATEST |
|-------+-------+-------+----------|
|     1 |     2 |     3 |        3 |
|     2 |     4 |    -1 |        4 |
|     3 |     6 |  NULL |     NULL |
+-------+-------+-------+----------+
```

```sqlexample
SELECT col_1,
       col_4,
       GREATEST(col_1, col_4) AS greatest
  FROM test_table_1_greatest
  ORDER BY col_1;
```

```output
+-------+-------+----------+
| COL_1 | COL_4 | GREATEST |
|-------+-------+----------|
|     1 |  4    |     4    |
|     2 | -2    |     2    |
|     3 | 13.45 |    13.45 |
+-------+-------+----------+
```

---
title: GREATEST_IGNORE_NULLS
source: https://docs.snowflake.com/en/sql-reference/functions/greatest_ignore_nulls.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# GREATEST_IGNORE_NULLS

Returns the largest non-NULL value from a list of expressions. GREATEST_IGNORE_NULLS supports all data types,
including VARIANT.

See also:
:   [GREATEST](greatest.md)

## Syntax

```sqlsyntax
GREATEST_IGNORE_NULLS( <expr1> [ , <expr2> ... ] )
```

## Arguments

`exprN`
:   The arguments must include at least one expression. All the expressions
    should be of the same type or compatible types.

## Returns

The first argument determines the return type:

* If the first type is numeric, then the return type is ‘widened’
  according to the numeric types in the list of all arguments.
* If the first type is not numeric, then all other arguments must be
  convertible to the first type.

If all arguments are NULL, returns NULL.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

Create a table and insert some values:

```sqlexample
CREATE TABLE test_greatest_ignore_nulls (
  col_1 INTEGER,
  col_2 INTEGER,
  col_3 INTEGER,
  col_4 FLOAT);

INSERT INTO test_greatest_ignore_nulls (col_1, col_2, col_3, col_4) VALUES
  (1, 2,    3,  4.25),
  (2, 4,   -1,  NULL),
  (3, 6, NULL,  -2.75);
```

Run a SELECT statement that returns the greatest non-null value in each row of the table:

```sqlexample
SELECT col_1,
       col_2,
       col_3,
       col_4,
       GREATEST_IGNORE_NULLS(col_1, col_2, col_3, col_4) AS greatest_ignore_nulls
 FROM test_greatest_ignore_nulls
 ORDER BY col_1;
```

```output
+-------+-------+-------+-------+-----------------------+
| COL_1 | COL_2 | COL_3 | COL_4 | GREATEST_IGNORE_NULLS |
|-------+-------+-------+-------+-----------------------|
|     1 |     2 |     3 |  4.25 |                  4.25 |
|     2 |     4 |    -1 |  NULL |                  4    |
|     3 |     6 |  NULL | -2.75 |                  6    |
+-------+-------+-------+-------+-----------------------+
```

---
title: GROUPING
source: https://docs.snowflake.com/en/sql-reference/functions/grouping.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)

# GROUPING

Describes which of a list of expressions are grouped in a row produced by a [GROUP BY](../constructs/group-by.md) query.

Aliases:
:   [GROUPING_ID](grouping_id.md)

## Syntax

```sqlsyntax
GROUPING( <expr1> [ , <expr2> , ... ] )
```

## Usage notes

GROUPING is not an aggregate function, but rather a utility function that can be used alongside aggregation, to determine the level of aggregation a row was generated for:

* GROUPING(`expr`) returns 0 for a row that is grouped on `expr`, and 1 for a row that is not grouped on `expr`.
* GROUPING(`expr1`, `expr2` , … , `exprN`) returns the integer representation of a bit-vector containing GROUPING(`expr1`) , GROUPING(`expr2`) , … , GROUPING(`exprN`).

## Examples

Group by sets:

> Create and populate a table with values:
>
> > ```sqlexample
> > CREATE OR REPLACE TABLE aggr2(col_x int, col_y int, col_z int);
> > INSERT INTO aggr2 VALUES(1, 2, 1), (1, 2, 3);
> > INSERT INTO aggr2 VALUES(2, 1, 10), (2, 2, 11), (2, 2, 3);
> > ```
>
> Show the values in the table:
>
> > ```sqlexample
> > SELECT * FROM aggr2 ORDER BY col_x, col_y, col_z;
> > +-------+-------+-------+
> > | COL_X | COL_Y | COL_Z |
> > |-------+-------+-------|
> > |     1 |     2 |     1 |
> > |     1 |     2 |     3 |
> > |     2 |     1 |    10 |
> > |     2 |     2 |     3 |
> > |     2 |     2 |    11 |
> > +-------+-------+-------+
> > ```
>
> Output:
>
> > ```sqlexample
> > SELECT col_x, col_y, sum(col_z),
> >        grouping(col_x), grouping(col_y), grouping(col_x, col_y)
> >     FROM aggr2 GROUP BY GROUPING SETS ((col_x), (col_y), ())
> >     ORDER BY 1, 2;
> > +-------+-------+------------+-----------------+-----------------+------------------------+
> > | COL_X | COL_Y | SUM(COL_Z) | GROUPING(COL_X) | GROUPING(COL_Y) | GROUPING(COL_X, COL_Y) |
> > |-------+-------+------------+-----------------+-----------------+------------------------|
> > |     1 |  NULL |          4 |               0 |               1 |                      1 |
> > |     2 |  NULL |         24 |               0 |               1 |                      1 |
> > |  NULL |     1 |         10 |               1 |               0 |                      2 |
> > |  NULL |     2 |         18 |               1 |               0 |                      2 |
> > |  NULL |  NULL |         28 |               1 |               1 |                      3 |
> > +-------+-------+------------+-----------------+-----------------+------------------------+
> > ```

---
title: GROUPING_ID
source: https://docs.snowflake.com/en/sql-reference/functions/grouping_id.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)

# GROUPING_ID

Describes which of a list of expressions are grouped in a row produced by a [GROUP BY](../constructs/group-by.md) query.

Alias for [GROUPING](grouping.md).

## Syntax

```sqlsyntax
GROUPING_ID( <expr1> [ , <expr2> , ... ] )
```

## Usage notes

GROUPING_ID is not an aggregate function, but rather a utility function that can be used alongside aggregation, to determine the level of aggregation a row was generated for:

* GROUPING_ID(`expr`) returns 0 for a row that is grouped on `expr`, and 1 for a row that is not grouped on `expr`.
* GROUPING_ID(`expr1`, `expr2` , … , `exprN`) returns the integer representation of a bit-vector containing GROUPING_ID(`expr1`) , GROUPING_ID(`expr2`) , … , GROUPING_ID(`exprN`).

## Examples

The examples use the following table and data:

> ```sqlexample
> CREATE OR REPLACE TABLE aggr2(col_x int, col_y int, col_z int);
> INSERT INTO aggr2 VALUES (1, 2, 1),
>                          (1, 2, 3);
> INSERT INTO aggr2 VALUES (2, 1, 10),
>                          (2, 2, 11),
>                          (2, 2, 3);
> ```

This example groups on col_x. Calling `GROUPING_ID(col_x)` returns 0, indicating that col_x is indeed one of
the grouping columns.

> ```sqlexample
> SELECT col_x, sum(col_z), GROUPING_ID(col_x)
>     FROM aggr2
>     GROUP BY col_x
>     ORDER BY col_x;
> +-------+------------+--------------------+
> | COL_X | SUM(COL_Z) | GROUPING_ID(COL_X) |
> |-------+------------+--------------------|
> |     1 |          4 |                  0 |
> |     2 |         24 |                  0 |
> +-------+------------+--------------------+
> ```

This query groups by sets:

> ```sqlexample
> SELECT col_x, col_y, sum(col_z),
>        GROUPING_ID(col_x),
>        GROUPING_ID(col_y),
>        GROUPING_ID(col_x, col_y)
>     FROM aggr2
>     GROUP BY GROUPING SETS ((col_x), (col_y), ())
>     ORDER BY col_x ASC, col_y DESC;
> +-------+-------+------------+--------------------+--------------------+---------------------------+
> | COL_X | COL_Y | SUM(COL_Z) | GROUPING_ID(COL_X) | GROUPING_ID(COL_Y) | GROUPING_ID(COL_X, COL_Y) |
> |-------+-------+------------+--------------------+--------------------+---------------------------|
> |     1 |  NULL |          4 |                  0 |                  1 |                         1 |
> |     2 |  NULL |         24 |                  0 |                  1 |                         1 |
> |  NULL |  NULL |         28 |                  1 |                  1 |                         3 |
> |  NULL |     2 |         18 |                  1 |                  0 |                         2 |
> |  NULL |     1 |         10 |                  1 |                  0 |                         2 |
> +-------+-------+------------+--------------------+--------------------+---------------------------+
> ```

---
title: H3_CELL_TO_BOUNDARY
source: https://docs.snowflake.com/en/sql-reference/functions/h3_cell_to_boundary.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_CELL_TO_BOUNDARY

Returns the [GEOGRAPHY](../data-types-geospatial.md) object representing the boundary of an
[H3](../data-types-geospatial.md) cell.

## Syntax

```sqlsyntax
H3_CELL_TO_BOUNDARY( <cell_id> )
```

## Arguments

`cell_id`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

## Returns

Returns a GEOGRAPHY object that represents the boundary of the H3 cell with the specified ID.

## Examples

The following example returns the GEOGRAPHY object that represents the boundary of the H3 cell containing the Brandenburg Gate.
The example specifies the H3 cell ID as an INTEGER value.

```sqlexample
SELECT H3_CELL_TO_BOUNDARY(613036919424548863);
```

```output
+-----------------------------------------+
| H3_CELL_TO_BOUNDARY(613036919424548863) |
|-----------------------------------------|
| {                                       |
|   "coordinates": [                      |
|     [                                   |
|       [                                 |
|         1.337146281884266e+01,          |
|         5.251934565725256e+01           |
|       ],                                |
|       [                                 |
|         1.336924966147084e+01,          |
|         5.251510220405509e+01           |
|       ],                                |
|       [                                 |
|         1.337455447449988e+01,          |
|         5.251214028989955e+01           |
|       ],                                |
|       [                                 |
|         1.338207263166664e+01,          |
|         5.251342164903257e+01           |
|       ],                                |
|       [                                 |
|         1.338428664751681e+01,          |
|         5.251766506194694e+01           |
|       ],                                |
|       [                                 |
|         1.337898164779325e+01,          |
|         5.252062715603375e+01           |
|       ],                                |
|       [                                 |
|         1.337146281884266e+01,          |
|         5.251934565725256e+01           |
|       ]                                 |
|     ]                                   |
|   ],                                    |
|   "type": "Polygon"                     |
| }                                       |
+-----------------------------------------+
```

The following example specifies the hexadecimal value of H3 cell ID as a VARCHAR to return the same coordinates as the previous
example.

```sqlexample
SELECT H3_CELL_TO_BOUNDARY('881F1D4887FFFFF');
```

```output
+----------------------------------------+
| H3_CELL_TO_BOUNDARY('881F1D4887FFFFF') |
|----------------------------------------|
| {                                      |
|   "coordinates": [                     |
|     [                                  |
|       [                                |
|         1.337146281884266e+01,         |
|         5.251934565725256e+01          |
|       ],                               |
|       [                                |
|         1.336924966147084e+01,         |
|         5.251510220405509e+01          |
|       ],                               |
|       [                                |
|         1.337455447449988e+01,         |
|         5.251214028989955e+01          |
|       ],                               |
|       [                                |
|         1.338207263166664e+01,         |
|         5.251342164903257e+01          |
|       ],                               |
|       [                                |
|         1.338428664751681e+01,         |
|         5.251766506194694e+01          |
|       ],                               |
|       [                                |
|         1.337898164779325e+01,         |
|         5.252062715603375e+01          |
|       ],                               |
|       [                                |
|         1.337146281884266e+01,         |
|         5.251934565725256e+01          |
|       ]                                |
|     ]                                  |
|   ],                                   |
|   "type": "Polygon"                    |
| }                                      |
+----------------------------------------+
```

---
title: H3_CELL_TO_CHILDREN
source: https://docs.snowflake.com/en/sql-reference/functions/h3_cell_to_children.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_CELL_TO_CHILDREN

Returns an [array](../data-types-semistructured.md) of the INTEGER IDs of the children of an
[H3](../data-types-geospatial.md) cell for a given resolution.

See also:
:   [H3_CELL_TO_CHILDREN_STRING](h3_cell_to_children_string.md) , [H3_CELL_TO_PARENT](h3_cell_to_parent.md)

## Syntax

```sqlsyntax
H3_CELL_TO_CHILDREN( <cell_id> , <target_resolution> )
```

## Arguments

`cell_id`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)).

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of the INTEGER values of the IDs of the children of an H3 cell at the specified target resolution.

## Examples

The following example returns an array of the IDs of the children of the H3 cell with the ID `613036919424548863`:

```sqlexample
SELECT H3_CELL_TO_CHILDREN(613036919424548863, 9);
```

```output
+--------------------------------------------+
| H3_CELL_TO_CHILDREN(613036919424548863, 9) |
|--------------------------------------------|
| [                                          |
|   617540519050084351,                      |
|   617540519050346495,                      |
|   617540519050608639,                      |
|   617540519050870783,                      |
|   617540519051132927,                      |
|   617540519051395071,                      |
|   617540519051657215                       |
| ]                                          |
+--------------------------------------------+
```

---
title: H3_CELL_TO_CHILDREN_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/h3_cell_to_children_string.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_CELL_TO_CHILDREN_STRING

Returns an [array](../data-types-semistructured.md) of the VARCHAR values containing the hexadecimal IDs of the children of an
[H3](../data-types-geospatial.md) cell for a given resolution.

See also:
:   [H3_CELL_TO_CHILDREN](h3_cell_to_children.md) , [H3_CELL_TO_PARENT](h3_cell_to_parent.md)

## Syntax

```sqlsyntax
H3_CELL_TO_CHILDREN_STRING( <cell_id> , <target_resolution> )
```

## Arguments

`cell_id`
:   A VARCHAR that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)) in hexadecimal format.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of the VARCHAR values of the hexadecimal IDs of the children of an H3 cell at the specified target resolution.

## Examples

The following example returns an array of the IDs (in hexadecimal format) of the children of the H3 cell with the ID
`881F1D4887FFFFF` (in hexadecimal format):

```sqlexample
SELECT H3_CELL_TO_CHILDREN_STRING('881F1D4887FFFFF', 9);
```

```output
+--------------------------------------------------+
| H3_CELL_TO_CHILDREN_STRING('881F1D4887FFFFF', 9) |
|--------------------------------------------------|
| [                                                |
|   "891f1d48863ffff",                             |
|   "891f1d48867ffff",                             |
|   "891f1d4886bffff",                             |
|   "891f1d4886fffff",                             |
|   "891f1d48873ffff",                             |
|   "891f1d48877ffff",                             |
|   "891f1d4887bffff"                              |
| ]                                                |
+--------------------------------------------------+
```

---
title: H3_CELL_TO_PARENT
source: https://docs.snowflake.com/en/sql-reference/functions/h3_cell_to_parent.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_CELL_TO_PARENT

Returns the ID of the parent of an [H3](../data-types-geospatial.md) cell for a given resolution. The ID is returned as
an INTEGER value (if an INTEGER value was provided as the input ID) or as a VARCHAR containing the hexadecimal ID (if the
hexadecimal ID was provided as the input ID).

See also:
:   [H3_CELL_TO_CHILDREN](h3_cell_to_children.md) , [H3_CELL_TO_CHILDREN_STRING](h3_cell_to_children_string.md)

## Syntax

```sqlsyntax
H3_CELL_TO_PARENT( <cell_id> , <target_resolution> )
```

## Arguments

`cell_id`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cell.

    Specifying any other INTEGER value results in an error.

## Returns

Returns the ID of the H3 parent cell at the specified target resolution. The ID is in one of the following formats:

* If `cell_id` is an INTEGER value, the function returns the ID as an INTEGER value.
* If `cell_id` is a VARCHAR value containing the hexadecimal ID, the function returns the hexadecimal ID as a VARCHAR
  value.

## Examples

The following example returns the H3 cell ID for the parent of the H3 cell with the ID `613036919424548863` (specified as an
INTEGER value):

```sqlexample
SELECT H3_CELL_TO_PARENT(613036919424548863, 7);
```

```output
+------------------------------------------+
| H3_CELL_TO_PARENT(613036919424548863, 7) |
|------------------------------------------|
|                       608533319805566975 |
+------------------------------------------+
```

The following example returns the H3 cell ID for the parent of the H3 cell with the ID `881F1D4887FFFFF` (specified as a
VARCHAR value):

```sqlexample
SELECT H3_CELL_TO_PARENT('881F1D4887FFFFF', 7);
```

```output
+-----------------------------------------+
| H3_CELL_TO_PARENT('881F1D4887FFFFF', 7) |
|-----------------------------------------|
|  871F1D488FFFFFF                        |
+-----------------------------------------+
```

---
title: H3_CELL_TO_POINT
source: https://docs.snowflake.com/en/sql-reference/functions/h3_cell_to_point.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_CELL_TO_POINT

Returns the [GEOGRAPHY](../data-types-geospatial.md) object representing the Point that is the centroid of an
[H3](../data-types-geospatial.md) cell.

See also:
:   [H3_POINT_TO_CELL](h3_point_to_cell.md) , [H3_POINT_TO_CELL_STRING](h3_point_to_cell_string.md)

## Syntax

```sqlsyntax
H3_CELL_TO_POINT( <cell_id> )
```

## Arguments

`cell_id`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

## Returns

Returns a GEOGRAPHY object for the Point that represents the centroid of the H3 cell with the specified ID.

## Examples

The following example returns the GEOGRAPHY object for the Point that represents the centroid of the H3 cell containing the
Brandenburg Gate. The example specifies the H3 cell ID as an INTEGER value.

```sqlexample
SELECT H3_CELL_TO_POINT(613036919424548863);
```

```output
+--------------------------------------+
| H3_CELL_TO_POINT(613036919424548863) |
|--------------------------------------|
| {                                    |
|   "coordinates": [                   |
|     1.337676791184706e+01,           |
|     5.251638386722465e+01            |
|   ],                                 |
|   "type": "Point"                    |
| }                                    |
+--------------------------------------+
```

The following example specifies the hexadecimal value of the H3 cell ID as a VARCHAR to return the same coordinates as the previous
example.

```sqlexample
SELECT H3_CELL_TO_POINT('881F1D4887FFFFF');
```

```output
+-------------------------------------+
| H3_CELL_TO_POINT('881F1D4887FFFFF') |
|-------------------------------------|
| {                                   |
|   "coordinates": [                  |
|     1.337676791184706e+01,          |
|     5.251638386722465e+01           |
|   ],                                |
|   "type": "Point"                   |
| }                                   |
+-------------------------------------+
```

---
title: H3_COMPACT_CELLS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_compact_cells.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_COMPACT_CELLS

Returns an [array](../data-types-semistructured.md) of [VARIANT](../data-types-semistructured.md) values that
contain the INTEGER IDs of fewer, larger [H3](../data-types-geospatial.md) cells that cover
the same area as the H3 cells in the input. For information about compacted cells, see [Indexing](https://h3geo.org/docs/highlights/indexing/).

## Syntax

```sqlsyntax
H3_COMPACT_CELLS( <array_of_cell_ids> )
```

## Arguments

`array_of_cell_ids`
:   An array of VARIANT values that contain the INTEGER values that represent H3 cell IDs ([indexes](https://h3geo.org/docs/core-library/h3Indexing)).

## Returns

Returns a value of the ARRAY data type or NULL.

* If the input is an array of INTEGER values, returns an array that consists of VARIANT values that represent a
  compacted set of H3 cells. The VARIANT values contain the INTEGER values that represent H3 cell IDs.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* All of the INTEGER values in the input must represent valid H3 cells.
* All of the H3 cells in the input must have the same resolution.
* The H3 cells in the input must cover unique areas without overlapping. Duplicate H3 cells are not allowed.

## Examples

The following example compacts a set of H3 cells, returning cells at a lower resolution that represent the same area.

```sqlexample
SELECT H3_COMPACT_CELLS(
  [
    622236750562230271,
    622236750562263039,
    622236750562295807,
    622236750562328575,
    622236750562361343,
    622236750562394111,
    622236750562426879,
    622236750558396415
  ]
) AS compacted;
```

```output
+-----------------------+
| COMPACTED             |
|-----------------------|
| [                     |
|   622236750558396415, |
|   617733150935089151  |
| ]                     |
+-----------------------+
```

---
title: H3_COMPACT_CELLS_STRINGS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_compact_cells_strings.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_COMPACT_CELLS_STRINGS

Returns an [array](../data-types-semistructured.md) of [VARIANT](../data-types-semistructured.md) values that contain
the VARCHAR hexadecimal IDs of fewer, larger [H3](../data-types-geospatial.md) cells that cover the
same area as the H3 cells in the input. For information about compacted cells, see [Indexing](https://h3geo.org/docs/highlights/indexing/).

## Syntax

```sqlsyntax
H3_COMPACT_CELLS_STRINGS( <array_of_cell_ids> )
```

## Arguments

`array_of_cell_ids`
:   An array of VARIANT values that contain the VARCHAR hexadecimal values that represent H3 cell IDs ([indexes](https://h3geo.org/docs/core-library/h3Indexing)).

## Returns

Returns a value of the ARRAY data type or NULL.

* If the input is an array of VARCHAR hexadecimal values, returns an array that consists of VARIANT values that represent a
  compacted set of H3 cells. The VARIANT values contain the VARCHAR hexadecimal values that represent H3 cell IDs.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* All of the VARCHAR hexadecimal values in the input must represent valid H3 cells.
* All of the H3 cells in the input must have the same resolution.
* The H3 cells in the input must cover unique areas without overlapping. Duplicate H3 cells are not allowed.

## Examples

The following example compacts a set of H3 cells, returning cells at a lower resolution that represent the same area.

```sqlexample
SELECT H3_COMPACT_CELLS_STRINGS(
  [
    '8a2a10705507fff',
    '8a2a1070550ffff',
    '8a2a10705517fff',
    '8a2a1070551ffff',
    '8a2a10705527fff',
    '8a2a1070552ffff',
    '8a2a10705537fff',
    '8a2a10705cdffff'
    ]
  ) AS compacted;
```

```output
+----------------------+
| COMPACTED            |
|----------------------|
| [                    |
|   "8a2a10705cdffff", |
|   "892a1070553ffff"  |
| ]                    |
+----------------------+
```

---
title: H3_COVERAGE
source: https://docs.snowflake.com/en/sql-reference/functions/h3_coverage.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_COVERAGE

Returns an [array](../data-types-semistructured.md) of IDs (as INTEGER values) identifying the minimal set of
[H3](../data-types-geospatial.md) cells that completely cover a shape (specified by a
[GEOGRAPHY](../data-types-geospatial.md) object).

See also:
:   [H3_COVERAGE_STRINGS](h3_coverage_strings.md) , [H3_POLYGON_TO_CELLS](h3_polygon_to_cells.md)

## Syntax

```sqlsyntax
H3_COVERAGE( <geography_expression> , <target_resolution> )
```

## Arguments

`geography_expression`
:   A GEOGRAPHY object.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an ARRAY of INTEGER values for the IDs of the minimal set of H3 cells that have completely cover the specified input
shape.

## Usage notes

* The function uses spherical approximation, which treats points on the Earth’s surface as if they were connected by arcs, rather
  than straight lines. If you need a planar approximation, use [H3_POLYGON_TO_CELLS](h3_polygon_to_cells.md) instead.
* A cell is included in the result set if its boundary intersects the input shape.
* When you apply [FLATTEN](flatten.md) to the ARRAY returned by the function,
  [cast](../data-type-conversion.md) each value explicitly to an integer.

## Examples

The following example returns an ARRAY of the IDs that identify the minimal set of H3 cells that completely cover the specified
Polygon.

```sqlexample
SELECT H3_COVERAGE(
  TO_GEOGRAPHY(
    'POLYGON((-122.481889 37.826683,-122.479487 37.808548,-122.474150 37.808904,-122.476510 37.826935,-122.481889 37.826683))'),
  8) AS set_of_h3_cells_covering_polygon;
```

```output
+----------------------------------+
| SET_OF_H3_CELLS_COVERING_POLYGON |
|----------------------------------|
| [                                |
|   613196571542028287,            |
|   613196571548319743,            |
|   613196571598651391,            |
|   613196571539931135,            |
|   613196571560902655,            |
|   613196571550416895             |
| ]                                |
+----------------------------------+
```

---
title: H3_COVERAGE_STRINGS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_coverage_strings.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_COVERAGE_STRINGS

Returns an [array](../data-types-semistructured.md) of hexadecimal IDs (as VARCHAR values) identifying
the minimal set of [H3](../data-types-geospatial.md) cells that completely cover a shape
(specified by a [GEOGRAPHY](../data-types-geospatial.md) object).

See also:
:   [H3_COVERAGE](h3_coverage.md) , [H3_POLYGON_TO_CELLS_STRINGS](h3_polygon_to_cells_strings.md)

## Syntax

```sqlsyntax
H3_COVERAGE_STRINGS( <geography_expression> , <target_resolution> )
```

## Arguments

`geography_expression`
:   A GEOGRAPHY object.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an ARRAY of VARCHAR values for the hexadecimal IDs of the minimal set of H3 cells that completely cover the
specified input shape.

## Usage notes

* The function uses spherical approximation, which treats points on the Earth’s surface as if they were connected by arcs, rather
  than straight lines. If you need a planar approximation, use [H3_POLYGON_TO_CELLS_STRINGS](h3_polygon_to_cells_strings.md) instead.
* A cell is included in the result set if its boundary intersects the input shape.

## Examples

The following example returns an ARRAY of the hexadecimal IDs that identify the minimal set of H3 cells that completely cover the
specified Polygon.

```sqlexample
SELECT H3_COVERAGE_STRINGS(
  TO_GEOGRAPHY(
    'POLYGON((-122.481889 37.826683,-122.479487 37.808548,-122.474150 37.808904,-122.476510 37.826935,-122.481889 37.826683))'),
  8) AS set_of_h3_cells_covering_polygon;
```

```output
+----------------------------------+
| SET_OF_H3_CELLS_COVERING_POLYGON |
|----------------------------------|
| [                                |
|   "882830870bfffff",             |
|   "8828308703fffff",             |
|   "8828308739fffff",             |
|   "8828308709fffff",             |
|   "8828308701fffff",             |
|   "8828308715fffff"              |
| ]                                |
|----------------------------------|
```

---
title: H3_GET_RESOLUTION
source: https://docs.snowflake.com/en/sql-reference/functions/h3_get_resolution.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_GET_RESOLUTION

Returns the resolution of an [H3](../data-types-geospatial.md) cell.

## Syntax

```sqlsyntax
H3_GET_RESOLUTION( <cell_id> )
```

## Arguments

`cell_id`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

## Returns

Returns an INTEGER value between 0 and 15, which represents the resolution of the input H3 cell.

## Examples

The following example returns the resolution of the H3 cell with the ID `617540519050084351`. The example specifies the H3
cell ID as an INTEGER value.

```sqlexample
SELECT H3_GET_RESOLUTION(617540519050084351);
```

```output
+---------------------------------------+
| H3_GET_RESOLUTION(617540519050084351) |
|---------------------------------------|
|                                     9 |
+---------------------------------------+
```

The following example specifies the hexadecimal value of H3 cell ID (`89283087033ffff`) as a VARCHAR to return the resolution
of the cell.

```sqlexample
SELECT H3_GET_RESOLUTION('89283087033ffff');
```

```output
+--------------------------------------+
| H3_GET_RESOLUTION('89283087033FFFF') |
|--------------------------------------|
|                                    9 |
+--------------------------------------+
```

---
title: H3_GRID_DISK
source: https://docs.snowflake.com/en/sql-reference/functions/h3_grid_disk.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_GRID_DISK

Returns an [array](../data-types-semistructured.md) of the IDs of the [H3](../data-types-geospatial.md) cells that
are within the k-distance from the specified cell. The IDs in the returned ARRAY are INTEGER values (if an INTEGER
value was provided as the input ID) or VARCHAR values containing the hexadecimal IDs (if a hexadecimal ID was provided
as the input ID).

## Syntax

```sqlsyntax
H3_GRID_DISK( <cell_id> , <k_value> )
```

## Arguments

`cell_id`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

`k_value`
:   An INTEGER that represents the grid distance. You must specify a non-negative value.

## Returns

Returns an ARRAY of the IDs of H3 cells that are within the distance `k_value` from the cell specified by
`cell_id`. The IDs are in one of the following formats:

* If `cell_id` is an INTEGER value, the function returns the IDs as INTEGER values.
* If `cell_id` is a VARCHAR value containing the hexadecimal ID, the function returns the hexadecimal IDs as VARCHAR
  values.

## Examples

The following example returns an ARRAY of the IDs of H3 cells within the grid distance of 1 from the cell with the ID
`617540519050084351` (specified as an INTEGER value).

```sqlexample
SELECT H3_GRID_DISK(617540519050084351, 1);
```

```output
+-------------------------------------+
| H3_GRID_DISK(617540519050084351, 1) |
|-------------------------------------|
| [                                   |
|   617540519050084351,               |
|   617540519051657215,               |
|   617540519050608639,               |
|   617540519050870783,               |
|   617540519050346495,               |
|   617540519051395071,               |
|   617540519051132927                |
| ]                                   |
+-------------------------------------+
```

The following example returns an ARRAY of the IDs of H3 cells within the grid distance of 1 from the cell with the ID
`891f1d48863ffff` (specified as a VARCHAR value).

```sqlexample
SELECT H3_GRID_DISK('891f1d48863ffff', 1);
```

```output
+------------------------------------+
| H3_GRID_DISK('891F1D48863FFFF', 1) |
|------------------------------------|
| [                                  |
|   "891f1d48863ffff",               |
|   "891f1d4887bffff",               |
|   "891f1d4886bffff",               |
|   "891f1d4886fffff",               |
|   "891f1d48867ffff",               |
|   "891f1d48877ffff",               |
|   "891f1d48873ffff"                |
| ]                                  |
+------------------------------------+
```

---
title: H3_GRID_DISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/h3_grid_distance.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_GRID_DISTANCE

Returns the distance between two [H3](../data-types-geospatial.md) cells specified by their IDs.

## Syntax

```sqlsyntax
H3_GRID_DISTANCE( <cell_id_1> , <cell_id_2> )
```

## Arguments

`cell_id_1`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

`cell_id_2`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

## Returns

Returns the INTEGER value that represents the distance in grid cells between the two H3 cells.

## Usage notes

The two input cell IDs must use the same resolution.

## Examples

The following example returns the distance (in terms of the number of grid cells) between two H3 cells. The example specifies the
H3 cell IDs as INTEGER values.

```sqlexample
SELECT H3_GRID_DISTANCE(617540519103561727, 617540519052967935);
```

```output
+----------------------------------------------------------+
| H3_GRID_DISTANCE(617540519103561727, 617540519052967935) |
|----------------------------------------------------------|
|                                                        5 |
+----------------------------------------------------------+
```

The following example specifies the hexadecimal values of the H3 cell IDs as VARCHAR values:

```sqlexample
SELECT H3_GRID_DISTANCE('891f1d48b93ffff', '891f1d4888fffff');
```

```output
+--------------------------------------------------------+
| H3_GRID_DISTANCE('891F1D48B93FFFF', '891F1D4888FFFFF') |
|--------------------------------------------------------|
|                                                      5 |
+--------------------------------------------------------+
```

---
title: H3_GRID_PATH
source: https://docs.snowflake.com/en/sql-reference/functions/h3_grid_path.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_GRID_PATH

Returns an [array](../data-types-semistructured.md) of the IDs of the [H3](../data-types-geospatial.md) cells that represent
the line between two cells. The IDs in the returned ARRAY are INTEGER values (if INTEGER values were provided as the input IDs)
or VARCHAR values containing the hexadecimal IDs (if hexadecimal IDs were provided as the input IDs).

## Syntax

```sqlsyntax
H3_GRID_PATH( <cell_id_1> , <cell_id_2> )
```

## Arguments

`cell_id_1`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

`cell_id_2`
:   An INTEGER that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR that represents the cell ID in hexadecimal format.

## Returns

Returns an ARRAY of the IDs of H3 cells that represent the line between the cells specified by `cell_id_1` and
`cell_id_2`. The IDs are in one of the following formats:

* If `cell_id_1` and `cell_id_2` are INTEGER values, the function returns the IDs as INTEGER values.
* If `cell_id_1` and `cell_id_2` are VARCHAR values containing the hexadecimal IDs, the function returns the
  hexadecimal IDs as VARCHAR values.

## Usage notes

The two input cell IDs must use the same resolution.

## Examples

The following example returns an ARRAY of the IDs of H3 cells that represent the line between the cells with the IDs
`617540519103561727` and `617540519052967935` (both specified as INTEGER values).

```sqlexample
SELECT H3_GRID_PATH(617540519103561727, 617540519052967935);
```

```output
+------------------------------------------------------+
| H3_GRID_PATH(617540519103561727, 617540519052967935) |
|------------------------------------------------------|
| [                                                    |
|   617540519103561727,                                |
|   617540519046414335,                                |
|   617540519047462911,                                |
|   617540519044055039,                                |
|   617540519045103615,                                |
|   617540519052967935                                 |
| ]                                                    |
+------------------------------------------------------+
```

The following example returns an ARRAY of the IDs of H3 cells that represent the line between the cells with the IDs
`891f1d48b93ffff` and `891f1d4888fffff` (both specified as VARCHAR values).

```sqlexample
SELECT H3_GRID_PATH('891f1d48b93ffff', '891f1d4888fffff');
```

```output
+----------------------------------------------------+
| H3_GRID_PATH('891F1D48B93FFFF', '891F1D4888FFFFF') |
|----------------------------------------------------|
| [                                                  |
|   "891f1d48b93ffff",                               |
|   "891f1d4882bffff",                               |
|   "891f1d4883bffff",                               |
|   "891f1d48807ffff",                               |
|   "891f1d48817ffff",                               |
|   "891f1d4888fffff"                                |
| ]                                                  |
+----------------------------------------------------+
```

---
title: H3_INT_TO_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/h3_int_to_string.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_INT_TO_STRING

Converts the INTEGER value of an [H3](../data-types-geospatial.md) cell ID to hexadecimal format.

See also:
:   [H3_STRING_TO_INT](h3_string_to_int.md)

## Syntax

```sqlsyntax
H3_INT_TO_STRING( <cell_id> )
```

## Arguments

`cell_id`
:   An INTEGER value that represents the cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)).

## Returns

Returns the H3 cell ID in hexadecimal format.

## Examples

The following example converts the INTEGER value of an H3 cell ID to hexadecimal format.

```sqlexample
SELECT H3_INT_TO_STRING(617700171168612351);
```

```output
+------------------------------------------------+
|          H3_INT_TO_STRING(617700171168612351)  |
|------------------------------------------------|
|                                89283087033FFFF |
+------------------------------------------------+
```

---
title: H3_IS_PENTAGON
source: https://docs.snowflake.com/en/sql-reference/functions/h3_is_pentagon.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_IS_PENTAGON

Returns TRUE if the boundary of an [H3](../data-types-geospatial.md) cell represents a pentagon.

## Syntax

```sqlsyntax
H3_IS_PENTAGON( <cell_id> )
```

## Arguments

`cell_id`
:   An INTEGER value that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR value that represents the cell ID
    in hexadecimal format.

## Returns

Returns a BOOLEAN or NULL.

* The value is TRUE if the input represents a pentagon. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following example specifies an integer that does not represent a pentagon.

```sqlexample
SELECT H3_IS_PENTAGON(613036919424548863);
```

```output
+------------------------------------+
| H3_IS_PENTAGON(613036919424548863) |
|------------------------------------|
| False                              |
+------------------------------------+
```

The following example specifies a hexadecimal string that represents a pentagon.

```sqlexample
SELECT H3_IS_PENTAGON('804dfffffffffff');
```

```output
+-----------------------------------+
| H3_IS_PENTAGON('804DFFFFFFFFFFF') |
|-----------------------------------|
| True                              |
+-----------------------------------+
```

---
title: H3_IS_VALID_CELL
source: https://docs.snowflake.com/en/sql-reference/functions/h3_is_valid_cell.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_IS_VALID_CELL

Returns TRUE if the input represents a valid [H3](../data-types-geospatial.md) cell.

## Syntax

```sqlsyntax
H3_IS_VALID_CELL( <cell_id> )
```

## Arguments

`cell_id`
:   An INTEGER value that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR value that represents the cell ID
    in hexadecimal format.

## Returns

Returns a BOOLEAN or NULL.

* The value is TRUE if the input represents a valid H3 cell. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following example specifies an integer that represents a valid H3 cell.

```sqlexample
SELECT H3_IS_VALID_CELL(613036919424548863);
```

```output
+--------------------------------------+
| H3_IS_VALID_CELL(613036919424548863) |
|--------------------------------------|
| True                                 |
+--------------------------------------+
```

The following example specifies a string that does not represent a valid H3 cell.

```sqlexample
SELECT H3_IS_VALID_CELL('Invalid Cell');
```

```output
+----------------------------------+
| H3_IS_VALID_CELL('INVALID CELL') |
|----------------------------------|
| False                            |
+----------------------------------+
```

---
title: H3_LATLNG_TO_CELL
source: https://docs.snowflake.com/en/sql-reference/functions/h3_latlng_to_cell.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_LATLNG_TO_CELL

Returns the INTEGER value of the [H3](../data-types-geospatial.md) cell ID for a given latitude, longitude, and
resolution.

See also:
:   [H3_LATLNG_TO_CELL_STRING](h3_latlng_to_cell_string.md)

## Syntax

```sqlsyntax
H3_LATLNG_TO_CELL( <latitude> , <longitude> , <target_resolution> )
```

## Arguments

`latitude`
:   A FLOAT that represents the latitude.

    Values outside the standard latitude range are wrapped to the range [-90, 90].

`longitude`
:   A FLOAT that represents the longitude.

    Values outside the standard longitude range are wrapped to the range [-180, 180].

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cell.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an INTEGER value that corresponds to the H3 cell ID for the given location and resolution.

## Usage notes

* Specifying NaN or Inf values for any input argument results in an error.

## Examples

The following example returns the H3 cell ID for the Brandenburg Gate at resolution 8.

```sqlexample
SELECT H3_LATLNG_TO_CELL(52.516262, 13.377704, 8);
```

```output
+--------------------------------------------+
| H3_LATLNG_TO_CELL(52.516262, 13.377704, 8) |
|--------------------------------------------|
|                         613036919424548863 |
+--------------------------------------------+
```

The following example specifies a `longitude` value (`373.377704`) that is outside of the traditional longitude range
(-180 to 180). The function interprets this value as `13.377704` (373.377704 modulo 180).

```sqlexample
SELECT H3_LATLNG_TO_CELL(52.516262, 373.377704, 8);
```

```output
+---------------------------------------------+
| H3_LATLNG_TO_CELL(52.516262, 373.377704, 8) |
|---------------------------------------------|
|                          613036919424548863 |
+---------------------------------------------+
```

The following example demonstrates that you cannot specify a resolution outside of 0 through 15.

```sqlexample
SELECT H3_LATLNG_TO_CELL(52.516262, 373.377704, 18);
```

```output
100410 (P0000): Invalid H3 resolution value: 18. Resolution must be between 0 (coarsest) and 15 (finest).
```

---
title: H3_LATLNG_TO_CELL_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/h3_latlng_to_cell_string.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_LATLNG_TO_CELL_STRING

Returns the [H3](../data-types-geospatial.md) cell ID in hexadecimal format (as a VARCHAR value) for a given latitude,
longitude, and resolution.

See also:
:   [H3_LATLNG_TO_CELL](h3_latlng_to_cell.md)

## Syntax

```sqlsyntax
H3_LATLNG_TO_CELL_STRING( <latitude> , <longitude> , <target_resolution> )
```

## Arguments

`latitude`
:   A FLOAT that represents the latitude.

    Values outside the standard latitude range are wrapped to the range [-90, 90].

`longitude`
:   A FLOAT that represents the longitude.

    Values outside the standard longitude range are wrapped to the range [-180, 180].

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cell.

    Specifying any other INTEGER value results in an error.

## Returns

Returns a VARCHAR value that corresponds to the hexadecimal H3 cell ID for the given location and resolution.

## Usage notes

* Specifying NaN or Inf values for any input argument results in an error.

## Examples

The following example returns the hexadecimal H3 cell ID for the Brandenburg Gate at resolution 8.

```sqlexample
SELECT H3_LATLNG_TO_CELL_STRING(52.516262, 13.377704, 8);
```

```output
+---------------------------------------------------+
| H3_LATLNG_TO_CELL_STRING(52.516262, 13.377704, 8) |
|---------------------------------------------------|
|  881F1D4887FFFFF                                  |
+---------------------------------------------------+
```

The following example specifies a `longitude` value (`373.377704`) that is outside of the traditional longitude range
(-180 to 180). The function interprets this value as `13.377704` (373.377704 modulo 180).

```sqlexample
SELECT H3_LATLNG_TO_CELL_STRING(52.516262, 373.377704, 8);
```

```output
+---------------------------------------------------+
| H3_LATLNG_TO_CELL_STRING(52.516262, 13.377704, 8) |
|---------------------------------------------------|
|  881F1D4887FFFFF                                  |
+---------------------------------------------------+
```

The following example demonstrates that you cannot specify a resolution outside of 0 through 15.

```sqlexample
SELECT H3_LATLNG_TO_CELL_STRING(52.516262, 373.377704, 18);
```

```output
100410 (P0000): Invalid H3 resolution value: 18. Resolution must be between 0 (coarsest) and 15 (finest).
```

---
title: H3_POINT_TO_CELL
source: https://docs.snowflake.com/en/sql-reference/functions/h3_point_to_cell.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_POINT_TO_CELL

Returns the INTEGER value of an [H3](../data-types-geospatial.md) cell ID for a Point (specified by a
[GEOGRAPHY](../data-types-geospatial.md) object) at a given resolution.

See also:
:   [H3_POINT_TO_CELL_STRING](h3_point_to_cell_string.md) , [H3_CELL_TO_POINT](h3_cell_to_point.md)

## Syntax

```sqlsyntax
H3_POINT_TO_CELL( <geography_point> , <target_resolution> )
```

## Arguments

`geography_point`
:   A GEOGRAPHY object that represents a Point.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cell.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an INTEGER value that corresponds to the H3 cell ID for the given location and resolution.

## Examples

The following example returns the H3 cell ID for the Brandenburg Gate at resolution 8.

```sqlexample
SELECT H3_POINT_TO_CELL(ST_POINT(13.377704, 52.516262), 8);
```

```output
+-----------------------------------------------------+
| H3_POINT_TO_CELL(ST_POINT(13.377704, 52.516262), 8) |
|-----------------------------------------------------|
|                                  613036919424548863 |
+-----------------------------------------------------+
```

The following example demonstrates that you cannot specify a resolution outside of 0 through 15.

```sqlexample
SELECT H3_POINT_TO_CELL(ST_POINT(13.377704, 52.516262), 18);
```

```output
100410 (P0000): Invalid H3 resolution value: 18. Resolution must be between 0 (coarsest) and 15 (finest).
```

---
title: H3_POINT_TO_CELL_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/h3_point_to_cell_string.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_POINT_TO_CELL_STRING

Returns the hexadecimal value of an [H3](../data-types-geospatial.md) cell ID for a Point (specified by a
[GEOGRAPHY](../data-types-geospatial.md) object) at a given resolution.

See also:
:   [H3_POINT_TO_CELL](h3_point_to_cell.md) , [H3_CELL_TO_POINT](h3_cell_to_point.md)

## Syntax

```sqlsyntax
H3_POINT_TO_CELL_STRING( <geography_point> , <target_resolution> )
```

## Arguments

`geography_point`
:   A GEOGRAPHY object that represents a Point.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cell.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an VARCHAR value that corresponds to the hexadecimal H3 cell ID for the given location and resolution.

## Examples

The following example returns the hexadecimal H3 cell ID for the Brandenburg Gate at resolution 8.

```sqlexample
SELECT H3_POINT_TO_CELL_STRING(ST_POINT(13.377704, 52.516262), 8);
```

```output
+------------------------------------------------------------+
| H3_POINT_TO_CELL_STRING(ST_POINT(13.377704, 52.516262), 8) |
|------------------------------------------------------------|
|  881F1D4887FFFFF                                           |
+------------------------------------------------------------+
```

The following example demonstrates that you cannot specify a resolution outside of 0 through 15.

```sqlexample
SELECT H3_POINT_TO_CELL_STRING(ST_POINT(13.377704, 52.516262), 18);
```

```output
100410 (P0000): Invalid H3 resolution value: 18. Resolution must be between 0 (coarsest) and 15 (finest).
```

---
title: H3_POLYGON_TO_CELLS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_polygon_to_cells.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_POLYGON_TO_CELLS

Returns an [array](../data-types-semistructured.md) of INTEGER values of the IDs of [H3](../data-types-geospatial.md)
cells that have centroids contained by a Polygon (specified by a [GEOGRAPHY](../data-types-geospatial.md) object).

See also:
:   [H3_POLYGON_TO_CELLS_STRINGS](h3_polygon_to_cells_strings.md) , [H3_COVERAGE](h3_coverage.md)

## Syntax

```sqlsyntax
H3_POLYGON_TO_CELLS( <geography_polygon> , <target_resolution> )
```

## Arguments

`geography_polygon`
:   A GEOGRAPHY object that represents a Polygon.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of INTEGER values for the IDs of the H3 cells that have centroids contained in the specified input Polygon.

## Usage notes

* The function uses planar approximation, which treats points on the Earth’s surface as if they were connected by straight lines,
  rather than curved arcs. If you need a spherical approximation, use [H3_COVERAGE](h3_coverage.md) instead.
* A cell is considered to be within the Polygon if its centroid is contained by the Polygon.
* When you apply [FLATTEN](flatten.md) to the array returned by the function,
  [cast](../data-type-conversion.md) each value explicitly to an integer.

## Examples

The following example returns an ARRAY of the IDs of H3 cells that have centroids contained in the specified Polygon.

```sqlexample
SELECT H3_POLYGON_TO_CELLS(
  TO_GEOGRAPHY(
    'POLYGON((-122.481889 37.826683,-122.479487 37.808548,-122.474150 37.808904,-122.476510 37.826935,-122.481889 37.826683))'
  ),
  9) AS h3_cells_in_polygon;
```

```output
+-----------------------+
| H3_CELLS_IN_POLYGON   |
|-----------------------|
| [                     |
|   617700171176476671, |
|   617700171168874495, |
|   617700171177525247, |
|   617700171167563775, |
|   617700171225497599, |
|   617700171188011007, |
|   617700171168350207, |
|   617700171168612351, |
|   617700171167825919  |
| ]                     |
+-----------------------+
```

---
title: H3_POLYGON_TO_CELLS_STRINGS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_polygon_to_cells_strings.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_POLYGON_TO_CELLS_STRINGS

Returns an [array](../data-types-semistructured.md) of VARCHAR values of the hexadecimal IDs of
[H3](../data-types-geospatial.md) cells that have centroids contained by a Polygon
(specified by a [GEOGRAPHY](../data-types-geospatial.md) object).

See also:
:   H3_POLYGON_TO_CELLS_STRINGS , [H3_COVERAGE_STRINGS](h3_coverage_strings.md)

## Syntax

```sqlsyntax
H3_POLYGON_TO_CELLS_STRINGS( <geography_polygon> , <target_resolution> )
```

## Arguments

`geography_polygon`
:   A GEOGRAPHY object that represents a Polygon.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of VARCHAR values for the hexadecimal IDs of the H3 cells that have centroids contained in the specified input
Polygon.

## Usage notes

* The function uses planar approximation, which treats points on the Earth’s surface as if they were connected by straight lines,
  rather than curved arcs. If you need a spherical approximation, use [H3_COVERAGE_STRINGS](h3_coverage_strings.md) instead.
* A cell is considered to be within the Polygon if its centroid is contained by the Polygon.

## Examples

The following example returns an ARRAY of VARCHAR values representing the hexadecimal IDs of H3 cells that have centroids
contained in the specified Polygon.

```sqlexample
SELECT H3_POLYGON_TO_CELLS_STRINGS(
  TO_GEOGRAPHY(
    'POLYGON((-122.481889 37.826683,-122.479487 37.808548,-122.474150 37.808904,-122.476510 37.826935,-122.481889 37.826683))'),
  9) AS h3_cells_in_polygon;
```

```output
+----------------------+
| H3_CELLS_IN_POLYGON  |
|----------------------|
| [                    |
|   "8928308715bffff", |
|   "89283087397ffff", |
|   "89283087023ffff", |
|   "892830870abffff", |
|   "89283087027ffff", |
|   "89283087033ffff", |
|   "8928308702fffff", |
|   "892830870bbffff", |
|   "89283087037ffff"  |
| ]                    |
+----------------------+
```

---
title: H3_STRING_TO_INT
source: https://docs.snowflake.com/en/sql-reference/functions/h3_string_to_int.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_STRING_TO_INT

Converts an [H3](../data-types-geospatial.md) cell ID in hexadecimal format to an INTEGER value.

See also:
:   [H3_INT_TO_STRING](h3_int_to_string.md)

## Syntax

```sqlsyntax
H3_STRING_TO_INT( <cell_id> )
```

## Arguments

`cell_id`
:   A VARCHAR that represents the cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)) in hexadecimal format.

## Returns

Returns an INTEGER value that represents the H3 cell ID.

## Examples

The following example converts an H3 cell ID from hexadecimal format to an INTEGER value.

```sqlexample
SELECT H3_STRING_TO_INT('89283087033FFFF');
```

```output
+------------------------------------------------+
|            H3_STRING_TO_INT('89283087033FFFF') |
|------------------------------------------------|
|                             617700171168612351 |
+------------------------------------------------+
```

---
title: H3_TRY_COVERAGE
source: https://docs.snowflake.com/en/sql-reference/functions/h3_try_coverage.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_TRY_COVERAGE

A special version of [H3_COVERAGE](h3_coverage.md) that returns NULL if an error occurs when it
attempts to return an [array](../data-types-semistructured.md) of IDs (as INTEGER values) identifying the minimal
set of [H3](../data-types-geospatial.md) cells that completely cover a shape (specified by a
[GEOGRAPHY](../data-types-geospatial.md) object).

## Syntax

```sqlsyntax
H3_TRY_COVERAGE( <geography_expression> , <target_resolution> )
```

## Arguments

`geography_expression`
:   A GEOGRAPHY object.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of INTEGER values or NULL.

* If the function can perform a successful calculation, returns an array of INTEGER values for the IDs of
  the minimal set of H3 cells that completely cover the specified input shape.
* If the function cannot perform a successful calculation, returns NULL without reporting an error.

## Usage notes

See [H3_COVERAGE](h3_coverage.md) for the usage notes.

## Examples

The following example attempts to return an array of IDs that identify the minimal set of [H3](../data-types-geospatial.md)
cells that completely cover a shape (specified by a [GEOGRAPHY](../data-types-geospatial.md) object). Because the array with
the cells that cover the given hexagon at the given resolution exceeds the allowed size limit, the function returns NULL.

```sqlexample
SELECT H3_TRY_COVERAGE(
  TO_GEOGRAPHY('POLYGON((-108.959 40.948,
                         -109.015 37.077,
                         -102.117 36.956,
                         -102.134 40.953,
                         -108.959 40.948))'
              ), 15) AS set_of_h3_cells_covering_polygon;
```

```output
+----------------------------------+
| SET_OF_H3_CELLS_COVERING_POLYGON |
|----------------------------------|
| NULL                             |
+----------------------------------+
```

For examples that successfully return an array of IDs, see [H3_COVERAGE](h3_coverage.md).

---
title: H3_TRY_COVERAGE_STRINGS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_try_coverage_strings.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_TRY_COVERAGE_STRINGS

A special version of [H3_COVERAGE_STRINGS](h3_coverage_strings.md) that returns NULL if an error
occurs when it attempts to return an [array](../data-types-semistructured.md) of hexadecimal IDs (as VARCHAR values)
identifying the minimal set of [H3](../data-types-geospatial.md) cells that completely cover a shape
(specified by a [GEOGRAPHY](../data-types-geospatial.md) object).

## Syntax

```sqlsyntax
H3_TRY_COVERAGE_STRINGS( <geography_expression> , <target_resolution> )
```

## Arguments

`geography_expression`
:   A GEOGRAPHY object.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of VARCHAR values or NULL.

* If the function can perform a successful calculation, returns an array of VARCHAR values for the hexadecimal
  IDs of the minimal set of H3 cells that completely cover the specified input shape.
* If the function cannot perform a successful calculation, returns NULL without reporting an error.

## Usage notes

See [H3_COVERAGE_STRINGS](h3_coverage_strings.md) for the usage notes.

## Examples

The following example attempts to return an array of IDs that identify the minimal set of [H3](../data-types-geospatial.md)
cells that completely cover a shape (specified by a [GEOGRAPHY](../data-types-geospatial.md) object). Because the array with
the cells that cover the given hexagon at the given resolution exceeds the allowed size limit, the function returns NULL.

```sqlexample
SELECT H3_TRY_COVERAGE_STRINGS(
  TO_GEOGRAPHY('POLYGON((-108.959 40.948,
                         -109.015 37.077,
                         -102.117 36.956,
                         -102.134 40.953,
                         -108.959 40.948))'
              ), 15) AS set_of_h3_cells_covering_polygon;
```

```output
+----------------------------------+
| SET_OF_H3_CELLS_COVERING_POLYGON |
|----------------------------------|
| NULL                             |
+----------------------------------+
```

For examples that successfully return an array of IDs, see [H3_COVERAGE_STRINGS](h3_coverage_strings.md).

---
title: H3_TRY_GRID_DISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/h3_try_grid_distance.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_TRY_GRID_DISTANCE

A special version of [H3_GRID_DISTANCE](h3_grid_distance.md) that returns NULL if an error occurs when it
attempts to return the distance between two [H3](../data-types-geospatial.md) cells.

## Syntax

```sqlsyntax
H3_TRY_GRID_DISTANCE( <cell_id_1> , <cell_id_2> )
```

## Arguments

`cell_id_1`
:   An INTEGER value that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR value that represents the cell ID
    in hexadecimal format.

`cell_id_2`
:   An INTEGER value that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR value that represents the cell ID
    in hexadecimal format.

## Returns

Returns an INTEGER value or NULL.

* If the function can perform a successful calculation, returns the INTEGER value that represents the distance in grid
  cells between the two H3 cells.
* If the grid distance cannot be calculated (for example, when two cells belong to non-neighboring
  [base cells](https://h3geo.org/docs/library/index/cell/)), returns NULL without reporting an error.

## Usage notes

See [H3_GRID_DISTANCE](h3_grid_distance.md) for the usage notes.

## Examples

The following example attempts to calculate the distance between two cells. Because the cells belong to non-neighboring
base cells, the function fails to calculate the distance and returns NULL.

```sqlexample
SELECT H3_TRY_GRID_DISTANCE(582046271372525567, 581883543651614719);
```

```output
+--------------------------------------------------------------+
| H3_TRY_GRID_DISTANCE(582046271372525567, 581883543651614719) |
|--------------------------------------------------------------|
|                                                         NULL |
+--------------------------------------------------------------+
```

For examples that successfully calculate the distance between two H3 cells, see [H3_GRID_DISTANCE](h3_grid_distance.md).

---
title: H3_TRY_GRID_PATH
source: https://docs.snowflake.com/en/sql-reference/functions/h3_try_grid_path.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_TRY_GRID_PATH

A special version of [H3_GRID_PATH](h3_grid_path.md) that returns NULL if an error occurs when it
attempts to return an array of VARIANT values that contain the IDs of the
[H3](../data-types-geospatial.md) cells that represent the line between two cells.

## Syntax

```sqlsyntax
H3_TRY_GRID_PATH( <cell_id_1> , <cell_id_2> )
```

## Arguments

`cell_id_1`
:   An INTEGER value that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR value that represents the cell ID
    in hexadecimal format.

`cell_id_2`
:   An INTEGER value that represents the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)), or a VARCHAR value that represents the cell ID
    in hexadecimal format.

## Returns

Returns a value of the ARRAY data type or NULL.

* If the function performs a successful calculation, returns an array of VARIANT values that contain the IDs of H3 cells
  that represent the line between the cells specified by `cell_id_1` and `cell_id_2`. For information about
  the format of the IDs, see [H3_GRID_PATH](h3_grid_path.md).
* If the line cannot be calculated (for example, when two cells belong to non-neighboring
  [base cells](https://h3geo.org/docs/library/index/cell/)), returns NULL without reporting an error.

## Usage notes

See [H3_GRID_PATH](h3_grid_path.md) for the usage notes.

## Examples

The following example attempts to return a line between two cells. Because the cells belong to non-neighboring
base cells, the function fails to return the line and returns NULL.

```sqlexample
SELECT H3_TRY_GRID_PATH('813d7ffffffffff', '81343ffffffffff');
```

```output
+--------------------------------------------------------+
| H3_TRY_GRID_PATH('813D7FFFFFFFFFF', '81343FFFFFFFFFF') |
|--------------------------------------------------------|
| NULL                                                   |
+--------------------------------------------------------+
```

For examples that successfully calculate the path between two H3 cells, see [H3_GRID_PATH](h3_grid_path.md).

---
title: H3_TRY_POLYGON_TO_CELLS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_try_polygon_to_cells.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_TRY_POLYGON_TO_CELLS

A special version of [H3_POLYGON_TO_CELLS](h3_polygon_to_cells.md) that returns NULL if an error
occurs when it attempts to return an [array](../data-types-semistructured.md) of INTEGER values of the IDs of
[H3](../data-types-geospatial.md) cells that have centroids contained by a Polygon
(specified by a [GEOGRAPHY](../data-types-geospatial.md) object).

## Syntax

```sqlsyntax
H3_TRY_POLYGON_TO_CELLS( <geography_polygon> , <target_resolution> )
```

## Arguments

`geography_polygon`
:   A GEOGRAPHY object that represents a Polygon.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of INTEGER values or NULL.

* If the function can perform a successful calculation, returns an array of INTEGER values for the IDs of the H3 cells that
  have centroids contained in the specified input Polygon.
* If the function cannot perform a successful calculation, returns NULL without reporting an error.

## Usage notes

See [H3_POLYGON_TO_CELLS](h3_polygon_to_cells.md) for the usage notes.

## Examples

The following example attempts to return an array of INTEGER values of the IDs of [H3](../data-types-geospatial.md)
cells that have centroids contained by a Polygon (specified by a [GEOGRAPHY](../data-types-geospatial.md) object).
Because the array with the cells that cover the given hexagon at the given resolution exceeds the allowed size limit, the function
returns NULL.

```sqlexample
SELECT H3_TRY_POLYGON_TO_CELLS(
  TO_GEOGRAPHY('POLYGON((-108.959 40.948,
                         -109.015 37.077,
                         -102.117 36.956,
                         -102.134 40.953,
                         -108.959 40.948))'
              ), 15) AS h3_cells_in_polygon;
```

```output
+---------------------+
| H3_CELLS_IN_POLYGON |
|---------------------|
| NULL                |
+---------------------+
```

For examples that successfully return an array of IDs, see [H3_POLYGON_TO_CELLS](h3_polygon_to_cells.md).

---
title: H3_TRY_POLYGON_TO_CELLS_STRINGS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_try_polygon_to_cells_strings.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_TRY_POLYGON_TO_CELLS_STRINGS

A special version of [H3_POLYGON_TO_CELLS_STRINGS](h3_polygon_to_cells_strings.md) that returns NULL if an error
occurs when it attempts to return an [array](../data-types-semistructured.md) of VARCHAR values of the
hexadecimal IDs of [H3](../data-types-geospatial.md) cells that have centroids contained by
a Polygon (specified by a [GEOGRAPHY](../data-types-geospatial.md) object).

## Syntax

```sqlsyntax
H3_TRY_POLYGON_TO_CELLS_STRINGS( <geography_polygon> , <target_resolution> )
```

## Arguments

`geography_polygon`
:   A GEOGRAPHY object that represents a Polygon.

`target_resolution`
:   An INTEGER between 0 and 15 (inclusive) that specifies the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns an array of VARCHAR values or NULL.

* If the function can perform a successful calculation, returns an array of VARCHAR values for the hexadecimal IDs
  of the H3 cells that have centroids contained in the specified input Polygon.
* If the function cannot perform a successful calculation, returns NULL without reporting an error.

## Usage notes

See [H3_POLYGON_TO_CELLS_STRINGS](h3_polygon_to_cells_strings.md) for the usage notes.

## Examples

The following example attempts to return an array of VARCHAR values of the hexadecimal IDs of
[H3](../data-types-geospatial.md) cells that have centroids contained by a Polygon (specified by
a [GEOGRAPHY](../data-types-geospatial.md) object). Because the array with the cells that cover
the given hexagon at the given resolution exceeds the allowed size limit, the function returns NULL.

```sqlexample
SELECT H3_TRY_POLYGON_TO_CELLS_STRINGS(
  TO_GEOGRAPHY('POLYGON((-108.959 40.948,
                         -109.015 37.077,
                         -102.117 36.956,
                         -102.134 40.953,
                         -108.959 40.948))'
              ), 15) AS h3_cells_in_polygon;
```

```output
+---------------------+
| H3_CELLS_IN_POLYGON |
|---------------------|
| NULL                |
+---------------------+
```

For examples that successfully return an array of IDs, see [H3_POLYGON_TO_CELLS_STRINGS](h3_polygon_to_cells_strings.md).

---
title: H3_UNCOMPACT_CELLS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_uncompact_cells.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_UNCOMPACT_CELLS

Returns an [array](../data-types-semistructured.md) of [VARIANT](../data-types-semistructured.md) values
that contain the INTEGER IDs of [H3](../data-types-geospatial.md) cells at the specified
resolution that cover the same area as the H3 cells in the input.

## Syntax

```sqlsyntax
H3_UNCOMPACT_CELLS( <array_of_cell_ids> , <target_resolution> )
```

## Arguments

`array_of_cell_ids`
:   An array of VARIANT values that contain INTEGER values that represent H3 cell IDs ([indexes](https://h3geo.org/docs/core-library/h3Indexing)).

`target_resolution`
:   An INTEGER value between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns a value of the ARRAY data type or NULL.

* If the input is an array of VARIANT values that contain INTEGER values, returns an array of VARIANT values
  that contain the INTEGER values that represent the set of H3 cells at the specified resolution.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* All of the INTEGER values in the input must represent valid H3 cells.
* The input cells cannot have a higher resolution than the resolution specified in the
  `target_resolution` argument.

## Examples

The following example returns an uncompacted set of H3 cells that represent valid H3 cell IDs
and a target resolution of `10`.

```sqlexample
SELECT H3_UNCOMPACT_CELLS(
  [
    622236750558396415,
    617733150935089151
  ],
  10
) AS uncompacted;
```

```output
+-----------------------+
| UNCOMPACTED           |
|-----------------------|
| [                     |
|   622236750558396415, |
|   622236750562230271, |
|   622236750562263039, |
|   622236750562295807, |
|   622236750562328575, |
|   622236750562361343, |
|   622236750562394111, |
|   622236750562426879  |
| ]                     |
+-----------------------+
```

---
title: H3_UNCOMPACT_CELLS_STRINGS
source: https://docs.snowflake.com/en/sql-reference/functions/h3_uncompact_cells_strings.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# H3_UNCOMPACT_CELLS_STRINGS

Returns an [array](../data-types-semistructured.md) of [VARIANT](../data-types-semistructured.md) values
that contain the VARCHAR hexadecimal IDs of [H3](../data-types-geospatial.md)
cells at the specified resolution that cover the same area as the H3 cells in the input.

## Syntax

```sqlsyntax
H3_UNCOMPACT_CELLS_STRINGS( <array_of_cell_ids> , <target_resolution> )
```

## Arguments

`array_of_cell_ids`
:   An array of VARIANT values that contain VARCHAR hexadecimal values that represent H3 cell IDs ([indexes](https://h3geo.org/docs/core-library/h3Indexing)).

`target_resolution`
:   An INTEGER value between 0 and 15 (inclusive) specifying the H3 [resolution](https://h3geo.org/docs/core-library/restable) that you want to use for the returned H3 cells.

    Specifying any other INTEGER value results in an error.

## Returns

Returns a value of the ARRAY data type or NULL.

* If the input is an array of VARIANT values that contain VARCHAR hexadecimal values, returns an array of VARIANT values
  that contain the VARCHAR hexadecimal values that represent the set of H3 cells at the specified resolution.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* All of the VARCHAR hexadecimal values in the input must represent valid H3 cells.
* The input cells cannot have a higher resolution than the resolution specified in the
  `target_resolution` argument.

## Examples

The following example returns an uncompacted set of H3 cells that represent valid H3 cell IDs
and a target resolution of `10`.

```sqlexample
SELECT H3_UNCOMPACT_CELLS_STRINGS(
  [
    '8a2a1072339ffff',
    '892a1072377ffff'
  ],
  10
) AS uncompacted;
```

```output
+----------------------+
| UNCOMPACTED          |
|----------------------|
| [                    |
|   "8a2a1072339ffff", |
|   "8a2a10723747fff", |
|   "8a2a1072374ffff", |
|   "8a2a10723757fff", |
|   "8a2a1072375ffff", |
|   "8a2a10723767fff", |
|   "8a2a1072376ffff", |
|   "8a2a10723777fff"  |
| ]                    |
+----------------------+
```

---
title: HASH
source: https://docs.snowflake.com/en/sql-reference/functions/hash.md
section: SQL Functions
---

Categories:
:   [Hash functions](../functions-hash-scalar.md)

# HASH

Returns a signed 64-bit hash value. Note that HASH never returns NULL, even for NULL inputs.

Possible uses for the HASH function include:

* Convert skewed data values to values that are likely to be more randomly or more evenly distributed.

  For example, you can hash a group of highly skewed values and generate a set of values that are more likely to be randomly distributed or evenly distributed.
* Put data in buckets. Because hashing can convert skewed data values to closer-to-evenly distributed values, you can use hashing to help take skewed values and
  create approximately evenly-sized buckets.

  If hashing alone is not sufficient to get the number of distinct buckets that you want, you can combine hashing with the [ROUND](round.md) or [WIDTH_BUCKET](width_bucket.md)
  functions.

> **Note:**
>
> HASH is a proprietary function that accepts a variable number of input expressions of arbitrary types and returns a signed value. It is not a
> cryptographic hash function and should not be used as such.
>
> Cryptographic hash functions have a few properties which this function does not, for example:
>
> * The cryptographic hashing of a value cannot be inverted to find the original value.
> * Given a value, it is infeasible to find another value with the same cryptographic hash.
>
> For cryptographic purposes, use the SHA families of functions (in [String & binary functions](../functions-string.md)).

See also:
:   [HASH_AGG](hash_agg.md)

## Syntax

```sqlsyntax
HASH( <expr> [ , <expr> ... ] )

HASH(*)
```

## Arguments

`expr`
:   The expression can be a general expression of any Snowflake data type.

`*`
:   Returns a single hashed value based on all columns in each record,
    including records with NULL values.

    When you pass a wildcard to the function, you can qualify the wildcard with the name or alias for the table.
    For example, to pass in all of the columns from the table named `mytable`, specify the following:

    ```sqlexample
    (mytable.*)
    ```

    You can also use the ILIKE and EXCLUDE keywords for filtering:

    * ILIKE filters for column names that match the specified pattern. Only one
      pattern is allowed. For example:

      ```sqlexample
      (* ILIKE 'col1%')
      ```
    * EXCLUDE filters out column names that don’t match the specified column or columns. For example:

      ```sqlexample
      (* EXCLUDE col1)

      (* EXCLUDE (col1, col2))
      ```

    Qualifiers are valid when you use these keywords. The following example uses the ILIKE keyword to
    filter for all of the columns that match the pattern `col1%` in the table `mytable`:

    ```sqlexample
    (mytable.* ILIKE 'col1%')
    ```

    The ILIKE and EXCLUDE keywords can’t be combined in a single function call.

    For this function, the ILIKE and EXCLUDE keywords are valid only in a SELECT list or GROUP BY clause.

    For more information about the ILIKE and EXCLUDE keywords, see the “Parameters” section in [SELECT](../sql/select.md).

## Returns

Returns a signed 64-bit value as NUMBER(19,0).

HASH never returns NULL, even for NULL inputs.

## Usage notes

* HASH is stable in the sense that it guarantees:

  + Any two values of type NUMBER that compare equally will hash to the same hash value, even if the
    respective types have different precision and/or scale.
  + Any two values of type FLOAT that can be converted to NUMBER(38, 0) without loss of precision will
    hash to the same value. For example, the following all return the same hash value:

    - `HASH(10::NUMBER(38,0))`
    - `HASH(10::NUMBER(5,3))`
    - `HASH(10::FLOAT)`
  + Any two values of type TIMESTAMP_TZ that compare equally will hash to the same hash value, even if
    the timestamps are from different time zones.
  + This guarantee also applies to NUMBER, FLOAT, and TIMESTAMP_TZ values within a VARIANT column.
  + Note that this guarantee does not apply to other combinations of types, even if implicit conversions exist
    between the types. For example, with overwhelming probability, the following will not return the same hash values
    even though `10 = '10'` after implicit conversion:

    - `HASH(10)`
    - `HASH('10')`
* `HASH(*)` means to create a single hashed value based on all columns in the row.
* Do not use HASH to create unique keys. HASH has a finite resolution of 64 bits, and is guaranteed to return
  non-unique values if more than 2^64 values are entered (e.g. for a table with more than 2^64 rows). In practice, if
  the input is on the order of 2^32 rows (approximately 4 billion rows) or more, the function is reasonably likely
  to return at least one duplicate value.

## Collation details

No impact.

* Two strings that are identical but have different collation specifications have the same hash value. In other words,
  only the string, not the collation specification, affects the hash value.
* Two strings that are different, but compare equal according to a collation, might have a different hash value. For
  example, two strings that are identical using punctuation-insensitive collation will normally have different hash
  values because only the string, not the collation specification, affects the hash value.

## Examples

```sqlexample
SELECT HASH(SEQ8()) FROM TABLE(GENERATOR(rowCount=>10));
```

```output
+----------------------+
|         HASH(SEQ8()) |
|----------------------|
| -6076851061503311999 |
| -4730168494964875235 |
| -3690131753453205264 |
| -7287585996956442977 |
| -1285360004004520191 |
|  4801857165282451853 |
| -2112898194861233169 |
|  1885958945512144850 |
| -3994946021335987898 |
| -3559031545629922466 |
+----------------------+
```

```sqlexample
SELECT HASH(10), HASH(10::number(38,0)), HASH(10::number(5,3)), HASH(10::float);
```

```output
+---------------------+------------------------+-----------------------+---------------------+
|            HASH(10) | HASH(10::NUMBER(38,0)) | HASH(10::NUMBER(5,3)) |     HASH(10::FLOAT) |
|---------------------+------------------------+-----------------------+---------------------|
| 1599627706822963068 |    1599627706822963068 |   1599627706822963068 | 1599627706822963068 |
+---------------------+------------------------+-----------------------+---------------------+
```

```sqlexample
SELECT HASH(10), HASH('10');
```

```output
+---------------------+---------------------+
|            HASH(10) |          HASH('10') |
|---------------------+---------------------|
| 1599627706822963068 | 3622494980440108984 |
+---------------------+---------------------+
```

```sqlexample
SELECT HASH(null), HASH(null, null), HASH(null, null, null);
```

```output
+---------------------+--------------------+------------------------+
|          HASH(NULL) |   HASH(NULL, NULL) | HASH(NULL, NULL, NULL) |
|---------------------+--------------------+------------------------|
| 8817975702393619368 | 953963258351104160 |    2941948363845684412 |
+---------------------+--------------------+------------------------+
```

The example below shows that even if the table contains multiple columns, `HASH(*)` returns a single value per row.

```sqlexample
CREATE TABLE orders (order_ID INTEGER, customer_ID INTEGER, order_date ...);

...

SELECT HASH(*) FROM orders LIMIT 10;
```

```output
+-----------------------+
|        HASH(*)        |
|-----------------------|
|  -3527903796973745449 |
|  6296330861892871310  |
|  6918165900200317484  |
|  -2762842444336053314 |
|  -2340602249668223387 |
|  5248970923485160358  |
|  -5807737826218607124 |
|  428973568495579456   |
|  2583438210124219420  |
|  4041917286051184231  |
+ ----------------------+
```

---
title: HASH_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/hash_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) , [Window functions](../functions-window.md)

# HASH_AGG

Returns an aggregate signed 64-bit hash value over the (unordered) set of input rows. HASH_AGG never returns NULL, even if no input is provided. Empty input “hashes” to `0`.

One use for aggregate hash functions is to detect changes to a set of values without comparing the individual old and new values. HASH_AGG can compute a single hash value
based on many inputs; almost any change to one of the inputs is likely to result in a change to the output of the HASH_AGG function. Comparing two lists of values typically
requires sorting both lists, but HASH_AGG produces the same value regardless of the order of the inputs. Because the values don’t need to be sorted for HASH_AGG,
performance is typically much faster.

> **Note:**
>
> HASH_AGG is *not* a cryptographic hash function and should not be used as such.
>
> For cryptographic purposes, use the SHA family of functions (in [String & binary functions](../functions-string.md)).

See also:
:   [HASH](hash.md)

## Syntax

**Aggregate function**

```sqlsyntax
HASH_AGG( [ DISTINCT ] <expr> [ , <expr2> ... ] )

HASH_AGG(*)
```

**Window function**

```sqlsyntax
HASH_AGG( [ DISTINCT ] <expr> [ , <expr2> ... ] ) OVER ( [ PARTITION BY <expr3> ] )

HASH_AGG(*) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`exprN`
:   The expression can be a general expression of any Snowflake data type, except
    [GEOGRAPHY](../data-types-geospatial.md) and [GEOMETRY](../data-types-geospatial.md).

`expr2`
:   You can include additional expressions.

`expr3`
:   The column to partition on, if you want the result to be split into multiple
    windows.

`*`
:   Returns an aggregated hash value over all columns for all records, including records with
    NULL values. You can specify the wildcard for both the aggregate function and the window
    function.

    When you pass a wildcard to the function, you can qualify the wildcard with the name or alias for the table.
    For example, to pass in all of the columns from the table named `mytable`, specify the following:

    ```sqlexample
    (mytable.*)
    ```

    You can also use the ILIKE and EXCLUDE keywords for filtering:

    * ILIKE filters for column names that match the specified pattern. Only one
      pattern is allowed. For example:

      ```sqlexample
      (* ILIKE 'col1%')
      ```
    * EXCLUDE filters out column names that don’t match the specified column or columns. For example:

      ```sqlexample
      (* EXCLUDE col1)

      (* EXCLUDE (col1, col2))
      ```

    Qualifiers are valid when you use these keywords. The following example uses the ILIKE keyword to
    filter for all of the columns that match the pattern `col1%` in the table `mytable`:

    ```sqlexample
    (mytable.* ILIKE 'col1%')
    ```

    The ILIKE and EXCLUDE keywords can’t be combined in a single function call.

    For this function, the ILIKE and EXCLUDE keywords are valid only in a SELECT list or GROUP BY clause.

    For more information about the ILIKE and EXCLUDE keywords, see the “Parameters” section in [SELECT](../sql/select.md).

## Returns

Returns a signed 64-bit value as NUMBER(19,0).

HASH_AGG never returns NULL, even for NULL inputs.

## Usage notes

* HASH_AGG computes a “fingerprint” over an entire table, query result, or window. Any change to the input will
  influence the result of HASH_AGG with overwhelming probability. This can be used to quickly detect changes to table
  contents or query results.

  Note that it is possible, though very unlikely, that two different input tables will produce the same result for HASH_AGG. If you need to make sure that two tables or query results that
  produce the same HASH_AGG result really contain the same data, you must still compare the data for equality (for example, by using the MINUS operator). For more details, see
  [Set operators](../operators-query.md).
* HASH_AGG is *not* order-sensitive (that is, the order of rows in an input table or query result does not influence the result of HASH_AGG). However, changing the order of input columns
  *does* change the result.
* HASH_AGG hashes individual input rows using the [HASH](hash.md) function. The salient features of this function carry over to HASH_AGG. In particular, HASH_AGG is “stable” in the sense
  that any two rows that compare as equal and have compatible types are guaranteed to hash to the same value (that is, they influence the result of HASH_AGG in the same way).

  For example, changing the scale and precision of a column that is part of some table doesn’t change the result of HASH_AGG over that table. See [HASH](hash.md) for details.
* In contrast to most other aggregate functions, HASH_AGG doesn’t ignore NULL inputs (that is, NULL inputs influence the result of HASH_AGG).
* For both the aggregate function and the window function, duplicate rows, including duplicate all-NULL rows,
  influence the result. The DISTINCT keyword can be used to suppress the effect of duplicate rows.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Collation details

* Two strings that are identical but have different collation specifications have the same hash value. In other words,
  only the string, not the collation specification, affects the hash value.
* Two strings that are different, but compare as equal according to a collation, might have a different hash value. For
  example, two strings that are identical using punctuation-insensitive collation normally have different hash
  values because only the string, not the collation specification, affects the hash value.

## Examples

This example shows that NULLs are not ignored:

```sqlexample
SELECT HASH_AGG(NULL), HASH_AGG(NULL, NULL), HASH_AGG(NULL, NULL, NULL);
```

```output
+----------------------+----------------------+----------------------------+
|       HASH_AGG(NULL) | HASH_AGG(NULL, NULL) | HASH_AGG(NULL, NULL, NULL) |
|----------------------+----------------------+----------------------------|
| -5089618745711334219 |  2405106413361157177 |       -5970411136727777524 |
+----------------------+----------------------+----------------------------+
```

This example shows that empty input hashes to `0`:

```sqlexample
SELECT HASH_AGG(NULL) WHERE 0 = 1;
```

```output
+----------------+
| HASH_AGG(NULL) |
|----------------|
|              0 |
+----------------+
```

Use HASH_AGG(\*) to conveniently aggregate over all input columns:

```sqlexample
SELECT HASH_AGG(*) FROM orders;
```

```output
+---------------------+
|     HASH_AGG(*)     |
|---------------------|
| 1830986524994392080 |
+---------------------+
```

This example shows that grouped aggregation is supported:

```sqlexample
SELECT YEAR(o_orderdate), HASH_AGG(*)
  FROM ORDERS GROUP BY 1 ORDER BY 1;
```

```output
+-------------------+----------------------+
| YEAR(O_ORDERDATE) |     HASH_AGG(*)      |
|-------------------+----------------------|
| 1992              | 4367993187952496263  |
| 1993              | 7016955727568565995  |
| 1994              | -2863786208045652463 |
| 1995              | 1815619282444629659  |
| 1996              | -4747088155740927035 |
| 1997              | 7576942849071284554  |
| 1998              | 4299551551435117762  |
+-------------------+----------------------+
```

This example suppresses duplicate rows using DISTINCT (duplicate rows influence results of HASH_AGG):

```sqlexample
SELECT YEAR(o_orderdate), HASH_AGG(o_custkey, o_orderdate)
  FROM orders GROUP BY 1 ORDER BY 1;
```

```output
+-------------------+----------------------------------+
| YEAR(O_ORDERDATE) | HASH_AGG(O_CUSTKEY, O_ORDERDATE) |
|-------------------+----------------------------------|
| 1992              | 5686635209456450692              |
| 1993              | -6250299655507324093             |
| 1994              | 6630860688638434134              |
| 1995              | 6010861038251393829              |
| 1996              | -767358262659738284              |
| 1997              | 6531729365592695532              |
| 1998              | 2105989674377706522              |
+-------------------+----------------------------------+
```

```sqlexample
SELECT YEAR(o_orderdate), HASH_AGG(DISTINCT o_custkey, o_orderdate)
  FROM orders GROUP BY 1 ORDER BY 1;
```

```output
+-------------------+-------------------------------------------+
| YEAR(O_ORDERDATE) | HASH_AGG(DISTINCT O_CUSTKEY, O_ORDERDATE) |
|-------------------+-------------------------------------------|
| 1992              | -8416988862307613925                      |
| 1993              | 3646533426281691479                       |
| 1994              | -7562910554240209297                      |
| 1995              | 6413920023502140932                       |
| 1996              | -3176203653000722750                      |
| 1997              | 4811642075915950332                       |
| 1998              | 1919999828838507836                       |
+-------------------+-------------------------------------------+
```

This example computes the number of days on which the corresponding sets of customers with orders with status not equal `'F'` and status not equal `'P'`, respectively, are identical:

```sqlexample
SELECT COUNT(DISTINCT o_orderdate) FROM orders;
```

```output
+-----------------------------+
| COUNT(DISTINCT O_ORDERDATE) |
|-----------------------------|
| 2406                        |
+-----------------------------+
```

```sqlexample
SELECT COUNT(o_orderdate)
  FROM (SELECT o_orderdate, HASH_AGG(DISTINCT o_custkey)
    FROM orders
    WHERE o_orderstatus <> 'F'
    GROUP BY 1
    INTERSECT
      SELECT o_orderdate, HASH_AGG(DISTINCT o_custkey)
        FROM orders
        WHERE o_orderstatus <> 'P'
        GROUP BY 1);
```

```output
+--------------------+
| COUNT(O_ORDERDATE) |
|--------------------|
| 1143               |
+--------------------+
```

The query doesn’t account for the possibility of hash collisions, so the actual number of days might be slightly lower.

---
title: HAVERSINE
source: https://docs.snowflake.com/en/sql-reference/functions/haversine.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# HAVERSINE

Calculates the great-circle distance in kilometers between two points on the
Earth’s surface, using the [Haversine formula](https://en.wikipedia.org/wiki/Haversine_formula).
The two points are specified by their latitude and longitude in decimal degrees.

> **Note:**
>
> Snowflake recommends using the [ST_DISTANCE](st_distance.md) function instead of the HAVERSINE function.
> The ST_DISTANCE function performs the calculation using values of geospatial types, which
> enables you to store geospatial data and use the [geospatial functions](../functions-geospatial.md)
> on the data. In addition, join predicates that use the ST_DISTANCE function perform better than join predicates
> that use the HAVERSINE function.

## Syntax

```sqlsyntax
HAVERSINE( <lat1>, <lon1>, <lat2>, <lon2> )
```

## Arguments

`lat1`
:   The latitude of the first point in decimal degrees.

`lon1`
:   The longitude of the first point in decimal degrees.

`lat2`
:   The latitude of the second point in decimal degrees.

`lon2`
:   The longitude of the second point in decimal degrees.

## Returns

This function returns a value of type FLOAT.

## Examples

The following example returns the geospatial distance in kilometers between New York and Los Angeles:

```sqlexample
SELECT HAVERSINE(
    40.7127,
    -74.0059,
    34.0500,
    -118.2500
  ) AS distance_in_kilometers;
```

```output
+------------------------+
| DISTANCE_IN_KILOMETERS |
|------------------------|
|         3936.385096389 |
+------------------------+
```

The following example is the same as the previous example, but it returns the geospatial distance
in meters instead of kilometers by multiplying the result by 1000:

```sqlexample
SELECT HAVERSINE(
    40.7127,
    -74.0059,
    34.0500,
    -118.2500
  ) * 1000 AS distance_in_meters;
```

```output
+--------------------+
| DISTANCE_IN_METERS |
|--------------------|
|   3936385.09638929 |
+--------------------+
```

---
title: HEX_DECODE_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/hex_decode_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# HEX_DECODE_BINARY

Decodes a hex-encoded string to a binary.

See also:
:   [TRY_HEX_DECODE_BINARY](try_hex_decode_binary.md)

## Syntax

```sqlsyntax
HEX_DECODE_BINARY(<input>)
```

## Arguments

`input`
:   A string expression containing only hexadecimal digits. Typically, this
    input string is generated by calling the function
    [HEX_ENCODE](hex_encode.md).

## Returns

A `BINARY` value that can, for example, be inserted into a column of type
`BINARY`.

## Examples

Start with a string; encode it as characters representing hexadecimal digits;
then convert those hex digit characters to BINARY using `HEX_DECODE_BINARY`:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE binary_table (v VARCHAR, b BINARY);
> > INSERT INTO binary_table (v, b)
> >     SELECT 'HELLO', HEX_DECODE_BINARY(HEX_ENCODE('HELLO'));
> > ```
>
> Now retrieve the BINARY value and display it as the original string (in
> the 3rd column of the output):
>
> > ```sqlexample
> > SELECT v, b, HEX_DECODE_STRING(TO_VARCHAR(b)) FROM binary_table;
> > +-------+------------+----------------------------------+
> > | V     | B          | HEX_DECODE_STRING(TO_VARCHAR(B)) |
> > |-------+------------+----------------------------------|
> > | HELLO | 48454C4C4F | HELLO                            |
> > +-------+------------+----------------------------------+
> > ```

Decode a hex-encoded binary (output by MD5_BINARY):

```sqlexample
SELECT HEX_DECODE_BINARY(HEX_ENCODE(MD5_BINARY('Snowflake')));

--------------------------------------------------------+
 HEX_DECODE_BINARY(HEX_ENCODE(MD5_BINARY('SNOWFLAKE'))) |
--------------------------------------------------------+
 EDF1439075A83A447FB8B630DDC9C8DE                       |
--------------------------------------------------------+
```

---
title: HEX_DECODE_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/hex_decode_string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# HEX_DECODE_STRING

Decodes a hex-encoded string to a string.

See also:
:   [TRY_HEX_DECODE_STRING](try_hex_decode_string.md)

## Syntax

```sqlsyntax
HEX_DECODE_STRING(<input>)
```

## Arguments

`input`
:   A hex-encoded string expression. Typically the input was created by a
    call to [HEX_ENCODE](hex_encode.md).

## Returns

The returned value is a string (VARCHAR).

## Examples

The following decodes a sequence of hexadecimal digits into the corresponding
word:

```sqlexample
SELECT HEX_DECODE_STRING('536E6F77666C616B65');

-----------------------------------------+
 HEX_DECODE_STRING('536E6F77666C616B65') |
-----------------------------------------+
 Snowflake                               |
-----------------------------------------+
```

The hexadecimal digits A-F can be uppercase or lowercase. The following
statement uses lowercase letters but produces the same result as the
preceding statement:

```sqlexample
SELECT HEX_DECODE_STRING('536e6f77666c616b65');

-----------------------------------------+
 HEX_DECODE_STRING('536E6F77666C616B65') |
-----------------------------------------+
 Snowflake                               |
-----------------------------------------+
```

This shows another example of using `HEX_DECODE_STRING`:

> Create a table and data:
>
> > ```sqlexample
> > CREATE TABLE binary_table (v VARCHAR, b BINARY);
> > INSERT INTO binary_table (v, b)
> >     SELECT 'HELLO', HEX_DECODE_BINARY(HEX_ENCODE('HELLO'));
> > ```
>
> Now run a query to show that we can retrieve the data:
>
> > ```sqlexample
> > SELECT v, b, HEX_DECODE_STRING(TO_VARCHAR(b)) FROM binary_table;
> > +-------+------------+----------------------------------+
> > | V     | B          | HEX_DECODE_STRING(TO_VARCHAR(B)) |
> > |-------+------------+----------------------------------|
> > | HELLO | 48454C4C4F | HELLO                            |
> > +-------+------------+----------------------------------+
> > ```

---
title: HEX_ENCODE
source: https://docs.snowflake.com/en/sql-reference/functions/hex_encode.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# HEX_ENCODE

Encodes the input using hexadecimal (also ‘hex’ or ‘base16’) encoding.
The result is comprised of 16 different symbols: The numbers ‘0’ to ‘9’ as
well as the letters ‘A’ to ‘F’ (or ‘a’ to ‘f’, see below).

See also:
:   [HEX_DECODE_BINARY](hex_decode_binary.md) , [HEX_DECODE_STRING](hex_decode_string.md)

## Syntax

```sqlsyntax
HEX_ENCODE(<input> [, <case>])
```

## Arguments

**Required:**

`input`
:   A binary or string expression to be encoded.

**Optional:**

`case`
:   This optional boolean argument controls the case of the letters
    (‘A’, ‘B’, ‘C’, ‘D’, ‘E’ and ‘F’) used in the encoding.
    The default value is `1` and indicates that uppercase
    letters are used. The value `0` indicates that lowercase
    letters are used. All other values are illegal and result
    in an error.

## Returns

This returns a string that contains only hexadecimal digits.

## Examples

Encode a string:

```sqlexample
SELECT HEX_ENCODE('Snowflake');

-------------------------+
 HEX_ENCODE('SNOWFLAKE') |
-------------------------+
 536E6F77666C616B65      |
-------------------------+
```

Encode a string using lowercase letters:

```sqlexample
SELECT HEX_ENCODE('Snowflake',0);

---------------------------+
 HEX_ENCODE('SNOWFLAKE',0) |
---------------------------+
 536e6f77666c616b65        |
---------------------------+
```

---
title: HLL
source: https://docs.snowflake.com/en/sql-reference/functions/hll.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) , [Window functions](../functions-window.md)

# HLL

Uses HyperLogLog to return an approximation of the distinct cardinality of the input (i.e. `HLL(col1, col2, ... )` returns an approximation of `COUNT(DISTINCT col1, col2, ... )`).

For more information about HyperLogLog, see [Estimating the Number of Distinct Values](../../user-guide/querying-approximate-cardinality.md).

Aliases:
:   [APPROX_COUNT_DISTINCT](approx_count_distinct.md).

See also:
:   [HLL_ACCUMULATE](hll_accumulate.md) , [HLL_COMBINE](hll_combine.md) , [HLL_ESTIMATE](hll_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
HLL( [ DISTINCT ] <expr1> [ , ... ] )

HLL(*)
```

**Window function**

```sqlsyntax
HLL( [ DISTINCT ] <expr1> [ , ... ] ) OVER ( [ PARTITION BY <expr2> ] )

HLL(*) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   This is the expression for which you want to know the number of distinct values.

`expr2`
:   This is the optional expression used to group rows into partitions.

## Returns

The data type of the returned value is INTEGER.

## Usage notes

* `DISTINCT` can be included as an argument, but has no effect.
* For information about NULL values and aggregate functions, see [Aggregate functions and NULL values](../functions-aggregation.md).
* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

This example shows how to use HLL and its alias APPROX_COUNT_DISTINCT. This example calls
both `COUNT(DISTINCT i)` and `APPROX_COUNT_DISTINCT(i)` to emphasize
that the results of these two functions do not always match exactly.

The exact output from the following query might vary because APPROX_COUNT_DISTINCT() returns an approximation, not an exact value.

```sqlexample
SELECT COUNT(i), COUNT(DISTINCT i), APPROX_COUNT_DISTINCT(i), HLL(i)
  FROM sequence_demo;
```

```output
+----------+-------------------+--------------------------+--------+
| COUNT(I) | COUNT(DISTINCT I) | APPROX_COUNT_DISTINCT(I) | HLL(I) |
|----------+-------------------+--------------------------+--------|
|     1024 |              1024 |                     1007 |   1007 |
+----------+-------------------+--------------------------+--------+
```

---
title: HLL_ACCUMULATE
source: https://docs.snowflake.com/en/sql-reference/functions/hll_accumulate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) ,
    [Window functions](../functions-window-syntax.md) (Cardinality Estimation)

# HLL_ACCUMULATE

Returns the HyperLogLog state at the end of aggregation.

For more information about HyperLogLog, see [Estimating the Number of Distinct Values](../../user-guide/querying-approximate-cardinality.md).

[HLL](hll.md) discards its intermediate state when the final cardinality estimate is returned. In advanced use cases, such as incremental cardinality estimation during bulk loading, one may want to keep the intermediate state. The
intermediate state can later be combined (merged) with other intermediate states, or can be exported to external tools.

In contrast to [HLL](hll.md), HLL_ACCUMULATE does not return a cardinality estimate. Instead, it skips the final estimation step and returns the algorithm state itself. The state is a binary of at most 4096 Bytes. For more information,
see [Estimating the Number of Distinct Values](../../user-guide/querying-approximate-cardinality.md).

See also:
:   [HLL_COMBINE](hll_combine.md) , [HLL_ESTIMATE](hll_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
HLL_ACCUMULATE( [ DISTINCT ] <expr> )

HLL_ACCUMULATE(*)
```

**Window function**

```sqlsyntax
HLL_ACCUMULATE( [ DISTINCT ] <expr> ) OVER ( [ PARTITION BY <expr1> ] )

HLL_ACCUMULATE(*) OVER ( [ PARTITION BY <expr1> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr`
:   The expression for which you want to estimate cardinality (number of
    distinct values). This is typically a column name, but can be a more
    general expression.

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).
* DISTINCT is supported syntactically, but has no effect.

## Examples

This shows one step towards estimating the number of distinct postal codes in
province(s) of Canada. In this step, we calculate the approximate number of
distinct postal codes in Manitoba and store an internal representation
of the “state” of the calculation, which we can later combine with similar
information for other provinces:

```sqlexample
CREATE TABLE temporary_hll_state_for_manitoba AS
  SELECT HLL_ACCUMULATE(postal_code) AS h_a_p_c
    FROM postal_data
    WHERE province = 'Manitoba';
```

Here is another example. This example shows how to use the three related functions
HLL_ACCUMULATE, HLL_ESTIMATE, and HLL_COMBINE.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE SEQUENCE seq92;
CREATE OR REPLACE TABLE sequence_demo (c1 INTEGER DEFAULT seq92.nextval, dummy SMALLINT);
INSERT INTO sequence_demo (dummy) VALUES (0);

INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
```

Create a table that contains the “state” that represents the current
approximate cardinality information for the table named `sequence_demo`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT HLL_ACCUMULATE(c1) AS rs1
    FROM sequence_demo);
```

Now create a second table and add data. (In a more realistic situation,
the user could have loaded more data into the first table and divided the
data into non-overlapping sets based on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) SELECT c1 + 4 FROM sequence_demo;
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT HLL_ACCUMULATE(c1) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT HLL_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate cardinality of the combined set of rows:

```sqlexample
SELECT HLL_ESTIMATE(c1)
  FROM combined_resultstate;
```

```output
+------------------+
| HLL_ESTIMATE(C1) |
|------------------|
|               12 |
+------------------+
```

---
title: HLL_COMBINE
source: https://docs.snowflake.com/en/sql-reference/functions/hll_combine.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) ,
    [Window functions](../functions-window-syntax.md) (Cardinality Estimation)

# HLL_COMBINE

Combines (merges) input states into a single output state.

This allows scenarios where [HLL_ACCUMULATE](hll_accumulate.md) is run over horizontal
partitions of the same table, producing an algorithm state for each table
partition. These states can later be combined using HLL_COMBINE,
producing the same output state as a single run of [HLL_ACCUMULATE](hll_accumulate.md)
over the entire table.

See also:
:   [HLL](hll.md) , [HLL_ACCUMULATE](hll_accumulate.md) , [HLL_ESTIMATE](hll_estimate.md)

## Syntax

**Aggregate function**

```sqlsyntax
HLL_COMBINE( [ DISTINCT ] <state> )
```

**Window function**

```sqlsyntax
HLL_COMBINE( [ DISTINCT ] <state> ) OVER ( [ PARTITION BY <expr1> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`state`
:   An expression that contains state information generated
    by a call to [HLL_ACCUMULATE](hll_accumulate.md).

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).
* DISTINCT is supported syntactically, but has no effect.
* The output of this function is not fully deterministic. Running this
  function on the same inputs might return different results at different
  times. The differences are typically small and are consistent with the fact
  that the HLL_\* functions are approximation functions.

## Examples

This example shows how to use the three related functions
HLL_ACCUMULATE, HLL_ESTIMATE, and HLL_COMBINE.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE SEQUENCE seq92;
CREATE OR REPLACE TABLE sequence_demo (c1 INTEGER DEFAULT seq92.nextval, dummy SMALLINT);
INSERT INTO sequence_demo (dummy) VALUES (0);

INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
```

Create a table that contains the “state” that represents the current
approximate cardinality information for the table named `sequence_demo`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT HLL_ACCUMULATE(c1) AS rs1
    FROM sequence_demo);
```

Now create a second table and add data. (In a more realistic situation,
the user could have loaded more data into the first table and divided the
data into non-overlapping sets based on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) SELECT c1 + 4 FROM sequence_demo;
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT HLL_ACCUMULATE(c1) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT HLL_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate cardinality of the combined set of rows:

```sqlexample
SELECT HLL_ESTIMATE(c1)
  FROM combined_resultstate;
```

```output
+------------------+
| HLL_ESTIMATE(C1) |
|------------------|
|               12 |
+------------------+
```

---
title: HLL_ESTIMATE
source: https://docs.snowflake.com/en/sql-reference/functions/hll_estimate.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) ,
    [Window functions](../functions-window-syntax.md) (Cardinality Estimation)

# HLL_ESTIMATE

Returns the cardinality estimate for the given HyperLogLog state.

A HyperLogLog state produced by [HLL_ACCUMULATE](hll_accumulate.md) and [HLL_COMBINE](hll_combine.md) can be used to compute a cardinality estimate using the HLL_ESTIMATE function.

Thus, HLL_ESTIMATE(HLL_ACCUMULATE(…)) is equivalent to HLL(…).

See also:
:   [HLL](hll.md) , [HLL_ACCUMULATE](hll_accumulate.md) , [HLL_COMBINE](hll_combine.md)

## Syntax

**Aggregate function**

```sqlsyntax
HLL_ESTIMATE( <state> )
```

**Window function**

```sqlsyntax
HLL_ESTIMATE( <state> ) OVER ( [ PARTITION BY <expr> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`state`
:   An expression that contains state information generated
    by a call to [HLL_ACCUMULATE](hll_accumulate.md) or [HLL_COMBINE](hll_combine.md).

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).

## Examples

This example shows how to use the three related functions
HLL_ACCUMULATE, HLL_ESTIMATE, and HLL_COMBINE.

Create a simple table and data:

```sqlexample
CREATE OR REPLACE SEQUENCE seq92;
CREATE OR REPLACE TABLE sequence_demo (c1 INTEGER DEFAULT seq92.nextval, dummy SMALLINT);
INSERT INTO sequence_demo (dummy) VALUES (0);

INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
```

Create a table that contains the “state” that represents the current
approximate cardinality information for the table named `sequence_demo`:

```sqlexample
CREATE OR REPLACE TABLE resultstate1 AS (
  SELECT HLL_ACCUMULATE(c1) AS rs1
    FROM sequence_demo);
```

Now create a second table and add data. (In a more realistic situation,
the user could have loaded more data into the first table and divided the
data into non-overlapping sets based on the time that the data was loaded.)

```sqlexample
CREATE OR REPLACE TABLE test_table2 (c1 INTEGER);
INSERT INTO test_table2 (c1) SELECT c1 + 4 FROM sequence_demo;
```

Get the “state” information for just the new data.

```sqlexample
CREATE OR REPLACE TABLE resultstate2 AS
  (SELECT HLL_ACCUMULATE(c1) AS rs1
     FROM test_table2);
```

Combine the “state” information for the two batches of rows:

```sqlexample
CREATE OR REPLACE TABLE combined_resultstate (c1) AS
  SELECT HLL_COMBINE(rs1) AS apc1
    FROM (
      SELECT rs1 FROM resultstate1
      UNION ALL
      SELECT rs1 FROM resultstate2
    );
```

Get the approximate cardinality of the combined set of rows:

```sqlexample
SELECT HLL_ESTIMATE(c1)
  FROM combined_resultstate;
```

```output
+------------------+
| HLL_ESTIMATE(C1) |
|------------------|
|               12 |
+------------------+
```

---
title: HLL_EXPORT
source: https://docs.snowflake.com/en/sql-reference/functions/hll_export.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) ,
    [Window functions](../functions-window-syntax.md) (Cardinality Estimation)

# HLL_EXPORT

Converts input in BINARY format to OBJECT format.

The HyperLogLog states operated on by HLL_ACCUMULATE, HLL_COMBINE, and HLL_ESTIMATE are in a proprietary binary format that may change in future versions of Snowflake. For long-term storage of HyperLogLog states, and for integration
with external tools, Snowflake supports converting states from the BINARY format to an OBJECT (which can be printed and exported as JSON), and vice versa.

See also:
:   [HLL](hll.md) , [HLL_ACCUMULATE](hll_accumulate.md) , [HLL_ESTIMATE](hll_estimate.md) , [HLL_IMPORT](hll_import.md)

## Syntax

**Aggregate function**

```sqlsyntax
HLL_EXPORT( <binary_expr> )
```

**Window function**

```sqlsyntax
HLL_EXPORT( <binary_expr> ) OVER ( [ PARTITION BY <expr> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`binary_expr`
:   An expression that evaluates to a HyperLogLog state in BINARY format.

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).

## Examples

```sqlexample
SELECT HLL(o_orderdate), HLL_ESTIMATE(HLL_IMPORT(HLL_EXPORT(HLL_ACCUMULATE(o_orderdate))))
FROM orders;

------------------+-------------------------------------------------------------------+
 HLL(O_ORDERDATE) | HLL_ESTIMATE(HLL_IMPORT(HLL_EXPORT(HLL_ACCUMULATE(O_ORDERDATE)))) |
------------------+-------------------------------------------------------------------+
 2398             | 2398                                                              |
------------------+-------------------------------------------------------------------+
```

---
title: HLL_IMPORT
source: https://docs.snowflake.com/en/sql-reference/functions/hll_import.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Cardinality Estimation) ,
    [Window functions](../functions-window-syntax.md) (Cardinality Estimation)

# HLL_IMPORT

Converts input in OBJECT format to BINARY format.

The HyperLogLog states operated on by HLL_ACCUMULATE, HLL_COMBINE, and HLL_ESTIMATE are in a proprietary binary format that may change in future versions of Snowflake. For long-term storage of HyperLogLog states, and for integration
with external tools, Snowflake supports using HLL_IMPORT to convert states from an OBJECT format to BINARY, and vice versa.

See also:
:   [HLL](hll.md) , [HLL_ACCUMULATE](hll_accumulate.md) , [HLL_ESTIMATE](hll_estimate.md) , [HLL_EXPORT](hll_export.md)

## Syntax

**Aggregate function**

```sqlsyntax
HLL_IMPORT( <obj> )
```

**Window function**

```sqlsyntax
HLL_IMPORT( <obj> ) OVER ( [ PARTITION BY <expr> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`obj`
:   An expression that evaluates to a HyperLogLog state in OBJECT format.

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).

## Examples

See examples for [HLL_EXPORT](hll_export.md).

---
title: HOUR / MINUTE / SECOND
source: https://docs.snowflake.com/en/sql-reference/functions/hour-minute-second.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# HOUR / MINUTE / SECOND

Extracts the corresponding time part from a time, interval, or timestamp value.

These functions are alternatives to using the [DATE_PART](date_part.md) (or [EXTRACT](extract.md)) function with the
equivalent time part (see [Supported date and time parts](../functions-date-time.md)).

See also:
:   [YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER](year.md)

## Syntax

```sqlsyntax
HOUR( <time_interval_or_timestamp_expr> )

MINUTE( <time_interval_or_timestamp_expr> )

SECOND( <time_interval_or_timestamp_expr> )
```

## Arguments

`time_interval_or_timestamp_expr`
:   A time, an interval, or a timestamp, or an expression that can be evaluated to one of those data types.
    An interval argument must be a day and time interval, not a year and month interval.

## Returns

This function returns a value of type NUMBER.

## Usage notes

| Function name | Time part extracted from time, interval, or timestamp | Possible values |
| --- | --- | --- |
| HOUR | Hour of the specified day | 0 to 23 |
| MINUTE | Minute of the specified hour | 0 to 59 |
| SECOND | Second of the specified minute | 0 to 59 |

> **Tip:**
>
> To extract a full TIME value from a TIMESTAMP value instead of a part, you can cast the
> TIMESTAMP value to a TIME value. For example:
>
> ```sqlexample
> SELECT '2025-04-08T23:39:20.123-07:00'::TIMESTAMP::TIME AS full_time_value;
> ```
>
> ```output
> +-----------------+
> | FULL_TIME_VALUE |
> |-----------------|
> | 23:39:20        |
> +-----------------+
> ```

## Examples

This example demonstrates the HOUR, MINUTE, and SECOND functions:

```sqlexample
SELECT '2025-04-08T23:39:20.123-07:00'::TIMESTAMP AS tstamp,
       HOUR(tstamp) AS "HOUR",
       MINUTE(tstamp) AS "MINUTE",
       SECOND(tstamp) AS "SECOND";
```

```output
+-------------------------+------+--------+--------+
| TSTAMP                  | HOUR | MINUTE | SECOND |
|-------------------------+------+--------+--------|
| 2025-04-08 23:39:20.123 |   23 |     39 |     20 |
+-------------------------+------+--------+--------+
```

For more examples, see [Working with date and time values](../date-time-examples.md).

---
title: ICEBERG_TABLE_FILES
source: https://docs.snowflake.com/en/sql-reference/functions/iceberg_table_files.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# ICEBERG_TABLE_FILES

Returns information about the data files registered to an externally managed Apache Iceberg™ table at a specified
point in time.

See also:
:   [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) , [Metadata and retention for Apache Iceberg™ tables](../../user-guide/tables-iceberg-metadata.md) , [ALTER ICEBERG TABLE … REFRESH](../sql/alter-iceberg-table-refresh.md)

## Syntax

```sqlsyntax
ICEBERG_TABLE_FILES(
  TABLE_NAME => '<table_name>'
  [, AT => '<timestamp_ltz>']
)
```

## Arguments

**Required**

`TABLE_NAME => 'table_name'`
:   The name of the [externally managed Iceberg table](../../user-guide/tables-iceberg.md)
    for which you want to retrieve the data file information.

**Optional**

`AT => 'timestamp_ltz'`
:   Specifies an exact date and time to use for retrieving the file information. The value must be explicitly cast to a
    TIMESTAMP_LTZ data type. For information, see [Date & time data types](../data-types-datetime.md).

    If not specified, the function returns information about the table files for the current
    [snapshot](../../user-guide/tables-iceberg.md).

## Output

The function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| REGISTERED_ON | TIMESTAMP_LTZ | The timestamp of when the Parquet file was registered. |
| FILE_NAME | TEXT | The full path to the registered file. |
| FILE_SIZE | NUMBER | The size of the file (in bytes). |
| ROW_COUNT | NUMBER | The number of rows in the file. |
| ROW_COUNT_GROUP | NUMBER | The number of row groups in the file. |
| MD5 | N/A | This field returns a placeholder value and should not be used. This field might be deprecated in a future release. |
| ETAG | N/A | This field returns a placeholder value and should not be used. This field might be deprecated in a future release. |
| LAST_MODIFIED_ON | N/A | This field returns a placeholder value and should not be used. This field might be deprecated in a future release. |

> **Note:**
>
> The ETAG, MD5, and LAST_MODIFIED_ON fields return a placeholder value and should not be used. These fields might be deprecated in a future release.

## Examples

Retrieve information about the Parquet data files for the *current snapshot*
registered to an externally managed Iceberg table named `my_iceberg_table`:

```sqlexample
SELECT *
  FROM TABLE(
    INFORMATION_SCHEMA.ICEBERG_TABLE_FILES(
      TABLE_NAME => 'my_iceberg_table'
    )
  );
```

Output:

```output
+-------------------------------------------------------+--------------------------------+------------+--------------------------------+------------+------------------+-----------------------------------+-----------------------------------+
| FILE_NAME                                             | REGISTERED_ON                  | FILE_SIZE  | LAST_MODIFIED_ON               | ROW_COUNT  | ROW_GROUP_COUNT  | ETAG                              | MD5                               |
| data/87/snow_D9zlAoeipII_AODxT1uXDxg_0_1_003.parquet  | 1969-12-31 16:00:00.000 -0800  | 27136      | 1969-12-31 16:00:00.000 -0800  | 30000      | 1                | NULL                              | NULL                              |
| data/08/snow_D9zlAoeipII_AODxT1uXDxg_0_1_006.parquet  | 1969-12-31 16:00:00.000 -0800  | 45568      | 1969-12-31 16:00:00.000 -0800  | 45000      | 1                | NULL                              | NULL                              |
| data/94/snow_D9zlAoeipII_AODxT1uXDxg_0_1_008.parquet  | 1969-12-31 16:00:00.000 -0800  | 45056      | 1969-12-31 16:00:00.000 -0800  | 45000      | 1                | NULL                              | NULL                              |
| data/24/snow_D9zlAoeipII_AODxT1uXDxg_0_1_004.parquet  | 1969-12-31 16:00:00.000 -0800  | 27136      | 1969-12-31 16:00:00.000 -0800  | 30000      | 1                | NULL                              | NULL                              |
+-------------------------------------------------------+--------------------------------+------------+--------------------------------+------------+------------------+-----------------------------------+-----------------------------------+
```

Retrieve information about the Parquet data files for a table named `my_iceberg_table`
at a specified time and day:

```sqlexample
SELECT file_name, file_size, row_count, row_group_count, etag, md5
  FROM TABLE(
    INFORMATION_SCHEMA.ICEBERG_TABLE_FILES(
      TABLE_NAME => 'my_iceberg_table',
      AT => CAST('2024-12-09 11:02:00' AS TIMESTAMP_LTZ)
    )
  );
```

Output:

```output
+------------------------------------------------------+-----------+-----------+-----------------+----------------------------------+----------------------------------+
| FILE_NAME                                            | FILE_SIZE | ROW_COUNT | ROW_GROUP_COUNT | ETAG                             | MD5                              |
|------------------------------------------------------+-----------+-----------+-----------------+----------------------------------+----------------------------------|
| data/87/snow_D9zlAoeipII_AODxT1uXDxg_0_1_003.parquet | 27136     | 30000     | 1               | NULL                             | NULL                             |
| data/08/snow_D9zlAoeipII_AODxT1uXDxg_0_1_006.parquet | 45568     | 45000     | 1               | NULL                             | NULL                             |
| data/94/snow_D9zlAoeipII_AODxT1uXDxg_0_1_008.parquet | 45056     | 45000     | 1               | NULL                             | NULL                             |
| data/24/snow_D9zlAoeipII_AODxT1uXDxg_0_1_004.parquet | 27136     | 30000     | 1               | NULL                             | NULL                             |
+------------------------------------------------------+-----------+-----------+-----------------+----------------------------------+----------------------------------+
4 Row(s) produced. Time Elapsed: 1.502s
```

---
title: ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/iceberg_table_snapshot_refresh_history.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY

Returns metadata and [snapshot](../../user-guide/tables-iceberg.md) information about the most recent
refresh history for a specified externally managed Apache Iceberg™ table.

> **Note:**
>
> Snowflake version 9.16 added Delta-based table support for this function.
> The function only displays Delta-based table refresh data from version 9.16 and later.

See also:
:   [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) , [Metadata and retention for Apache Iceberg™ tables](../../user-guide/tables-iceberg-metadata.md) , [ALTER ICEBERG TABLE … REFRESH](../sql/alter-iceberg-table-refresh.md)

## Syntax

```sqlsyntax
ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY(
  TABLE_NAME => '<table_name>'
)
```

## Arguments

`TABLE_NAME => 'table_name'`
:   The name of the [externally managed Iceberg table](../../user-guide/tables-iceberg.md)
    for which you want to retrieve the snapshot refresh history.

## Output

The function returns the following columns:

| Column name | Data type | Description | Delta-based table note |
| --- | --- | --- | --- |
| REFRESHED_ON | TIMESTAMP_LTZ | The timestamp when the table was last refreshed. |  |
| METADATA_FILE_NAME | TEXT | The full path to the metadata file. | The full path to the commit or checkpoint file. |
| SNAPSHOT_ID | TEXT | The snapshot ID of the last refresh. | The resulting commit ID of the last refresh. |
| SEQUENCE_NUMBER | TEXT | The sequence number of the last refresh; NULL for Iceberg v1. | Not applicable for Delta-based tables; displays as NULL. |
| ICEBERG_SCHEMA_ID | TEXT | The schema ID of the refresh (from metadata). | Not applicable for Delta-based tables; displays as NULL. |
| QUERY_ID | TEXT | The ID of the query that performed the refresh. For tables that use [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md), this column contains a sentinel value, which indicates that the refresh was automated. |  |
| IS_CURRENT_SNAPSHOT | BOOLEAN | TRUE if the table is refreshed on this snapshot; otherwise, FALSE. | TRUE if the table is refreshed on this version (commit); otherwise, FALSE. |
| SNAPSHOT_SUMMARY | VARIANT | The Iceberg snapshot summary from the `metadata.json` file. NULL if not present in the metadata file. | Not applicable for Delta-based tables; displays as NULL. |

## Examples

Retrieve information for the current version of an externally managed Iceberg table named `my_iceberg_table`:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY(
    TABLE_NAME => 'my_iceberg_table'
  ));
```

Output:

```output
+-------------------------------+----------------------------------------------------------------------------------+---------------------+-----------------+-------------------+--------------------------------------+---------------------+---------------------------------+
| REFRESHED_ON                  | METADATA_FILE_NAME                                                               | SNAPSHOT_ID         | SEQUENCE_NUMBER | ICEBERG_SCHEMA_ID | QUERY_ID                             | IS_CURRENT_SNAPSHOT | SNAPSHOT_SUMMARY                |
|-------------------------------+----------------------------------------------------------------------------------+---------------------+-----------------+-------------------+--------------------------------------+---------------------+---------------------------------|
| 2024-12-09 11:00:50.506 -0800 | s3://my-bucket/metadata/00000-e3bf7230-283f-4626-a770-fe97a3ca239e.metadata.json | NULL                | NULL            | 0                 | 01b8ebb4-0002-3a10-0000-012903c7e42a | False               | NULL                            |
| 2024-12-09 11:01:35.543 -0800 | s3://my-bucket/metadata/00001-bf116652-b5b0-479a-947e-6c799e4ca124.metadata.json | 6201065399847600377 | NULL            | 0                 | 01b8ebb5-0002-3a14-0000-012903c7f336 | True                | {                               |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "added-data-files": "4",      |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "added-files-size": "144896", |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "added-records": "150000",    |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "manifests-created": "1",     |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "manifests-kept": "0",        |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "manifests-replaced": "0",    |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "total-data-files": "4",      |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "total-files-size": "144896", |
|                               |                                                                                  |                     |                 |                   |                                      |                     |   "total-records": "150000"     |
|                               |                                                                                  |                     |                 |                   |                                      |                     | }                               |
+-------------------------------+----------------------------------------------------------------------------------+---------------------+-----------------+-------------------+--------------------------------------+---------------------+---------------------------------+
```

---
title: IFF
source: https://docs.snowflake.com/en/sql-reference/functions/iff.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# IFF

Returns one of two values depending on whether a Boolean expression evaluates to true or false.
This function is similar to a single-level `if-then-else` expression. It is similar to [CASE](case.md),
but only allows a single condition. You can use it to add conditional logic to SQL statements.

## Syntax

```sqlsyntax
IFF( <condition> , <expr1> , <expr2> )
```

## Arguments

`condition`
:   The condition is an expression that should evaluate to a BOOLEAN value
    (TRUE, FALSE, or NULL).

    If `condition` evaluates to TRUE, returns `expr1`, otherwise
    returns `expr2`.

`expr1`
:   A general expression. The function returns this value if the `condition`
    is true.

`expr2`
:   A general expression. The function returns this value if the `condition`
    is not true (that is, if it is false or NULL).

## Returns

This function can return a value of any type. The function can return NULL if the value of the
expression that is returned is NULL.

## Usage notes

The `condition` can include a SELECT statement containing set
operators, such as UNION, INTERSECT, and EXCEPT (MINUS). When using set operators,
make sure that data types are compatible. For details, see the [General usage notes](../operators-query.md)
in the [Set operators](../operators-query.md) topic.

## Collation details

The value returned from the function retains the collation specification of the
highest-[precedence](../collation.md) collation
of the `expr1` and `expr2` arguments.

## Examples

The following examples use the `IFF` function.

Return `expr1` because the condition evaluates to true:

```sqlexample
SELECT IFF(TRUE, 'true', 'false');
```

```output
+----------------------------+
| IFF(TRUE, 'TRUE', 'FALSE') |
|----------------------------|
| true                       |
+----------------------------+
```

Return `expr2` because the condition evaluates to false:

```sqlexample
SELECT IFF(FALSE, 'true', 'false');
```

```output
+-----------------------------+
| IFF(FALSE, 'TRUE', 'FALSE') |
|-----------------------------|
| false                       |
+-----------------------------+
```

Return `expr2` because the condition evaluates to NULL:

```sqlexample
SELECT IFF(NULL, 'true', 'false');
```

```output
+----------------------------+
| IFF(NULL, 'TRUE', 'FALSE') |
|----------------------------|
| false                      |
+----------------------------+
```

Return NULL because the value of the expression returned is NULL:

```sqlexample
SELECT IFF(TRUE, NULL, 'false');
```

```output
+--------------------------+
| IFF(TRUE, NULL, 'FALSE') |
|--------------------------|
| NULL                     |
+--------------------------+
```

Return `expr1` (`integer`) if the value is an integer, or return
`expr2` (`non-integer`) if the value is not an integer:

```sqlexample
SELECT value, IFF(value::INT = value, 'integer', 'non-integer')
  FROM ( SELECT column1 AS value
           FROM VALUES(1.0), (1.1), (-3.1415), (-5.000), (NULL) )
  ORDER BY value DESC;
```

```output
+---------+---------------------------------------------------+
|   VALUE | IFF(VALUE::INT = VALUE, 'INTEGER', 'NON-INTEGER') |
|---------+---------------------------------------------------|
|    NULL | non-integer                                       |
|  1.1000 | non-integer                                       |
|  1.0000 | integer                                           |
| -3.1415 | non-integer                                       |
| -5.0000 | integer                                           |
+---------+---------------------------------------------------+
```

Return `expr1` (`High`) if the value is greater than 50, or return
`expr2` (`Low`) if the value is 50 or lower (or NULL):

```sqlexample
SELECT value, IFF(value > 50, 'High', 'Low')
FROM ( SELECT column1 AS value
         FROM VALUES(22), (63), (5), (99), (NULL) );
```

```output
+-------+--------------------------------+
| VALUE | IFF(VALUE > 50, 'HIGH', 'LOW') |
|-------+--------------------------------|
|    22 | Low                            |
|    63 | High                           |
|     5 | Low                            |
|    99 | High                           |
|  NULL | Low                            |
+-------+--------------------------------+
```

---
title: IFNULL
source: https://docs.snowflake.com/en/sql-reference/functions/ifnull.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# IFNULL

If `expr1` is NULL, returns `expr2`, otherwise returns `expr1`.

Aliases:
:   [NVL](nvl.md)

## Syntax

```sqlsyntax
IFNULL( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   A general expression.

`expr2`
:   A general expression.

## Usage notes

* Snowflake performs [implicit conversion](../data-type-conversion.md) of arguments to make
  them compatible. For example, if one of the input expressions is a numeric type, the return type
  is also a numeric type. That is, `SELECT IFNULL('17', 1);` first converts the VARCHAR value `'17'`
  to the NUMBER value `17`, and then returns the first non-NULL value.

  When conversion isn’t possible, implicit conversion fails. For example, `SELECT IFNULL('foo', 1);`
  returns an error because the VARCHAR value `'foo'` can’t be converted to a NUMBER value.

  We recommend passing in arguments of the same type or explicitly converting arguments if needed.

* When implicit conversion converts a non-numeric value to a numeric value, the result is a value
  of type NUMBER(18,5).

  For numeric string arguments that aren’t constants, if NUMBER(18,5) isn’t sufficient to represent
  the numeric value, then [cast](../data-type-conversion.md) the argument to a type that
  can represent the value.

* Either expression can include a `SELECT` statement containing set
  operators, such as `UNION`, `INTERSECT`, `EXCEPT`, and `MINUS`.
  When using set operators, make sure that data types are compatible. For
  details, see the [General usage notes](../operators-query.md) in the
  [Set operators](../operators-query.md) topic.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Returns

Returns the data type of the returned expression.

If both expressions are NULL, returns NULL.

## Examples

Create a table that contains contact information for suppliers:

```sqlexample
CREATE TABLE IF NOT EXISTS suppliers (
  supplier_id INT PRIMARY KEY,
  supplier_name VARCHAR(30),
  phone_region_1 VARCHAR(15),
  phone_region_2 VARCHAR(15));
```

The table contains the phone number for each supplier in two different regions. The phone number can
be NULL for a region.

Insert values into the table:

```sqlexample
INSERT INTO suppliers(supplier_id, supplier_name, phone_region_1, phone_region_2)
  VALUES(1, 'Company_ABC', NULL, '555-01111'),
        (2, 'Company_DEF', '555-01222', NULL),
        (3, 'Company_HIJ', '555-01333', '555-01444'),
        (4, 'Company_KLM', NULL, NULL);
```

The following SELECT statement uses the IFNULL function to
retrieve the `phone_region_1` and `phone_region_2` values.

This example shows the following results for the IFNULL function:

* The `IF_REGION_1_NULL` column contains the value in `phone_region_1` or, if that value is NULL, the
  value in `phone_region_2`.
* The `IF_REGION_2_NULL` column contains the value in `phone_region_2` or, if that value is NULL, the
  value in `phone_region_1`.
* If both `phone_region_1` and `phone_region_2` are NULL, the function returns NULL.

```sqlexample
SELECT supplier_id,
       supplier_name,
       phone_region_1,
       phone_region_2,
       IFNULL(phone_region_1, phone_region_2) IF_REGION_1_NULL,
       IFNULL(phone_region_2, phone_region_1) IF_REGION_2_NULL
  FROM suppliers
  ORDER BY supplier_id;
```

```output
+-------------+---------------+----------------+----------------+------------------+------------------+
| SUPPLIER_ID | SUPPLIER_NAME | PHONE_REGION_1 | PHONE_REGION_2 | IF_REGION_1_NULL | IF_REGION_2_NULL |
|-------------+---------------+----------------+----------------+------------------+------------------|
|           1 | Company_ABC   | NULL           | 555-01111      | 555-01111        | 555-01111        |
|           2 | Company_DEF   | 555-01222      | NULL           | 555-01222        | 555-01222        |
|           3 | Company_HIJ   | 555-01333      | 555-01444      | 555-01333        | 555-01444        |
|           4 | Company_KLM   | NULL           | NULL           | NULL             | NULL             |
+-------------+---------------+----------------+----------------+------------------+------------------+
```

---
title: ILIKE ANY
source: https://docs.snowflake.com/en/sql-reference/functions/ilike_any.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# ILIKE ANY

Performs a case-insensitive comparison to match a string against any of one or more specified patterns.
Use this function in a WHERE clause to filter for matches. For case-sensitive matching, use LIKE ANY
instead.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [[ NOT ] LIKE](like.md) , [[ NOT ] ILIKE](ilike.md) , [LIKE ANY](like_any.md)

## Syntax

```sqlsyntax
<subject> ILIKE ANY (<pattern1> [, <pattern2> ... ] ) [ ESCAPE <escape_char> ]
```

## Arguments

**Required:**

`subject`
:   The string to compare to the pattern(s).

`pattern#`
:   The pattern(s) that the string is to be compared to. You must specify at least one pattern.

**Optional:**

`escape_char`
:   Character(s) inserted in front of a wildcard character to indicate that the wildcard should
    be interpreted as a regular character rather than as a wildcard.

## Returns

Returns a BOOLEAN value or NULL:

* Returns TRUE if there is a match.
* Returns FALSE if there isn’t a match.
* Returns NULL if any argument is NULL.

## Usage notes

* To include single quotes or other special characters in pattern matching, you can use a
  [backslash escape sequence](../data-types-text.md).
* NULL does not match NULL. In other words, if the subject is NULL and one of the patterns is NULL,
  that is not considered a match.
* You can use the [NOT](../operators-logical.md) logical operator before the `subject`
  to perform a case-sensitive comparison that returns TRUE if it does not match any of the specified patterns.
* SQL wildcards are supported in `pattern`:

  + An underscore (`_`) matches any single character.
  + A percent sign (`%`) matches any sequence of zero or more characters.
* Wildcards in `pattern` include newline characters (`n`) in `subject` as matches.
* The pattern is considered a match if the pattern matches the entire input string (subject). To match a sequence
  anywhere within a string, start and end the pattern with `%` (e.g. `%something%`).

* If the function is used with a subquery, the subquery should return a single row.

  For example, the following should be used only if the subquery returns a single row:

  ```sqlexample
  SELECT ...
    WHERE x ILIKE ANY (SELECT ...)
  ```

* If you require more complex pattern matching than this function supports, you can use a
  [regular expression function](../functions-regexp.md) instead.

## Collation details

Only the `upper`, `lower`, and `trim` collation specifications are supported. Combinations with `upper`,
`lower`, and `trim` are also supported (for example, `upper-trim` and `lower-trim`), except for locale
combinations (for example, `en-upper`).

## Examples

Create a table that contains some strings:

```sqlexample
CREATE OR REPLACE TABLE ilike_example(name VARCHAR(20));
INSERT INTO ilike_example VALUES
    ('jane doe'),
    ('Jane Doe'),
    ('JANE DOE'),
    ('John Doe'),
    ('John Smith');
```

This query shows how to use patterns with wildcards (`%`) to find matches:

```sqlexample
SELECT *
  FROM ilike_example
  WHERE name ILIKE ANY ('jane%', '%SMITH')
  ORDER BY name;
```

```output
+------------+
| NAME       |
|------------|
| JANE DOE   |
| Jane Doe   |
| John Smith |
| jane doe   |
+------------+
```

For examples of how to escape wildcard characters, see [LIKE ANY](like_any.md).

---
title: INFER_SCHEMA
source: https://docs.snowflake.com/en/sql-reference/functions/infer_schema.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# INFER_SCHEMA

Automatically detects the file metadata schema in a set of staged data files that contain semi-structured data and retrieves the column
definitions.

The [GENERATE_COLUMN_DESCRIPTION](generate_column_description.md) function builds on the INFER_SCHEMA function output to simplify the
creation of new tables, external tables, or views (using the appropriate [CREATE <object>](../sql/create.md) command) based on the column
definitions of the staged files.

You can execute the [CREATE TABLE](../sql/create-table.md), [CREATE EXTERNAL TABLE](../sql/create-external-table.md), or [CREATE ICEBERG TABLE](../sql/create-iceberg-table.md)
command with the USING TEMPLATE clause to create a new table or external table with the column definitions derived from the
INFER_SCHEMA function output.

> **Note:**
>
> This function supports Apache Parquet, Apache Avro, ORC, JSON, and CSV files.

## Syntax

```sqlsyntax
INFER_SCHEMA(
  LOCATION => '{ internalStage | externalStage }'
  , FILE_FORMAT => '<file_format_name>'
  , FILES => ( '<file_name>' [ , '<file_name>' ] [ , ... ] )
  , IGNORE_CASE => TRUE | FALSE
  , MAX_FILE_COUNT => <num>
  , MAX_RECORDS_PER_FILE => <num>
  , KIND => '<kind_name>'
)
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>][/<filename>]
>   | @~[/<path>][/<filename>]
> ```
>
> ```sqlsyntax
> externalStage ::=
>   @[<namespace>.]<ext_stage_name>[/<path>][/<filename>]
> ```

## Arguments

`LOCATION => '...'`
:   Name of the internal or external stage where the files are stored. Optionally include a path to one or more files in the cloud storage
    location; otherwise, the INFER_SCHEMA function scans files in all subdirectories in the stage:

    |  |  |
    | --- | --- |
    | `@[namespace.]int_stage_name[/path][/filename]` | Files are in the specified named internal stage. |
    | `@[namespace.]ext_stage_name[/path][/filename]` | Files are in the specified named external stage. |
    | `@~[/path][/filename]` | Files are in the stage for the current user. |

    > **Note:**
    >
    > This SQL function supports named stages (internal or external) and user stages only. It does not support table stages.

`FILES => ( 'file_name' [ , 'file_name' ] [ , ... ] )`
:   Specifies a list of one or more files (separated by commas) in a set of staged files that contain semi-structured data. The files must already have been staged in either the Snowflake internal location or external location specified in the command. If any of the specified files cannot be found, the query will be aborted.

    The maximum number of files names that can be specified is 1000.

    > > **Note:**
    > >
    > > For external stages only (Amazon S3, Google Cloud Storage, or Microsoft Azure), the file path is set by concatenating the URL in the stage definition and the list of resolved file names.
    > >
    > > However, Snowflake doesn’t insert a separator implicitly between the path and file names. You must explicitly include a separator (`/`) either at the end of the URL in the stage
    > > definition or at the beginning of each file name specified in this parameter.

`FILE_FORMAT => 'file_format_name'`
:   Name of the file format object that describes the data contained in the staged files. For more information, see
    [CREATE FILE FORMAT](../sql/create-file-format.md).

`IGNORE_CASE => TRUE | FALSE`
:   Specifies whether column names detected from stage files are treated as case sensitive. By default, the value is FALSE, which means that Snowflake preserves the case of alphabetic characters when retrieving column names. If you specify the value as TRUE, column names are treated as case-insensitive and all column names are retrieved as uppercase letters.

`MAX_FILE_COUNT => num`
:   Specifies the maximum number of files scanned from stage. This option is recommended for large number of files that have identical schema across files. This option cannot determine which files are scanned. If you want to scan specific files, use the `FILES` option instead.

`MAX_RECORDS_PER_FILE => num`
:   Specifies the maximum number of records scanned per file. This option only applies to CSV and JSON files. We recommend that you use this option for large files. This option might affect the accuracy of schema detection.

`KIND => 'kind_name'`
:   Specifies the kind of file metadata schema that can be scanned from the stage. By default, the value is `STANDARD`, which means that
    the file metadata schema that can be scanned from the stage is for Snowflake tables and the output is Snowflake data types. If you specify
    the value as `ICEBERG`, the schema is for Apache Iceberg tables and the output is Iceberg data types.

    > **Note:**
    >
    > If you’re inferring Parquet files to create Iceberg tables, we strongly recommend that you set `KIND => 'ICEBERG'`. Otherwise, the
    > column definitions returned by the function might be incorrect.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| COLUMN_NAME | TEXT | Name of a column in the staged files. |
| TYPE | TEXT | Data type of the column. |
| NULLABLE | BOOLEAN | Specifies whether rows in the column can store NULL instead of a value. Currently, the inferred nullability of a column can apply to one data file but not others in the scanned set. |
| EXPRESSION | TEXT | Expression of the column in the format `$1:COLUMN_NAME::TYPE` (primarily for external tables). If IGNORE_CASE is specified as TRUE, the expression of the column will be in the format `GET_IGNORE_CASE ($1, COLUMN_NAME)::TYPE`. |
| FILENAMES | TEXT | Names of the files that contain the column. |
| ORDER_ID | NUMBER | Column order in the staged files. |

## Usage notes

* For CSV files, you can define column names by using the file format option `PARSE_HEADER = [ TRUE | FALSE ]`.

  > + If the option is set to TRUE, the first row headers will be used to determine column names.
  > + The default value FALSE will return column names as c\*, where \* is the position of the column. The SKIP_HEADER option is not supported with PARSE_HEADER = TRUE.
  > + The PARSE_HEADER option isn’t supported for external tables.
* For both CSV and JSON files, the following file format options are currently not supported: DATE_FORMAT, TIME_FORMAT, and TIMESTAMP_FORMAT.
* The JSON TRIM_SPACE file format option is not supported.
* The scientific annotations (e.g. 1E2) in JSON files are retrieved as REAL data type.
* All the variations of timestamp data types are retrieved as TIMESTAMP_NTZ without any time zone information.
* For both CSV and JSON files, all columns are identified as NULLABLE.
* For both `KIND => 'STANDARD'` and `KIND => 'ICEBERG'`, when the specified file in the stage contains nested data types, only the
  first level of nesting is supported; deeper levels aren’t supported.
* Apache Iceberg™ version 3 (v3) tables aren’t supported.

## Examples

### Snowflake column definitions

Retrieve the Snowflake column definitions for Parquet files in the `mystage` stage:

```sqlexample
-- Create a file format that sets the file type as Parquet.
CREATE FILE FORMAT my_parquet_format
  TYPE = parquet;

-- Query the INFER_SCHEMA function.
SELECT *
  FROM TABLE(
    INFER_SCHEMA(
      LOCATION=>'@mystage'
      , FILE_FORMAT=>'my_parquet_format'
      )
    );

+-------------+---------+----------+---------------------+--------------------------+----------+
| COLUMN_NAME | TYPE    | NULLABLE | EXPRESSION          | FILENAMES                | ORDER_ID |
|-------------+---------+----------+---------------------+--------------------------|----------+
| continent   | TEXT    | True     | $1:continent::TEXT  | geography/cities.parquet | 0        |
| country     | VARIANT | True     | $1:country::VARIANT | geography/cities.parquet | 1        |
| COUNTRY     | VARIANT | True     | $1:COUNTRY::VARIANT | geography/cities.parquet | 2        |
+-------------+---------+----------+---------------------+--------------------------+----------+
```

Similar to the previous example, but specify a single Parquet file in the `mystage` stage:

```sqlexample
-- Query the INFER_SCHEMA function.
SELECT *
  FROM TABLE(
    INFER_SCHEMA(
      LOCATION=>'@mystage/geography/cities.parquet'
      , FILE_FORMAT=>'my_parquet_format'
      )
    );

+-------------+---------+----------+---------------------+--------------------------+----------+
| COLUMN_NAME | TYPE    | NULLABLE | EXPRESSION          | FILENAMES                | ORDER_ID |
|-------------+---------+----------+---------------------+--------------------------|----------+
| continent   | TEXT    | True     | $1:continent::TEXT  | geography/cities.parquet | 0        |
| country     | VARIANT | True     | $1:country::VARIANT | geography/cities.parquet | 1        |
| COUNTRY     | VARIANT | True     | $1:COUNTRY::VARIANT | geography/cities.parquet | 2        |
+-------------+---------+----------+---------------------+--------------------------+----------+
```

Retrieve the Snowflake column definitions for Parquet files in the `mystage` stage with IGNORE_CASE specified as TRUE. In the returned output, all column names are retrieved as uppercase letters.

```sqlexample
-- Query the INFER_SCHEMA function.
SELECT *
  FROM TABLE(
    INFER_SCHEMA(
      LOCATION=>'@mystage'
      , FILE_FORMAT=>'my_parquet_format'
      , IGNORE_CASE=>TRUE
      )
    );

+-------------+---------+----------+----------------------------------------+--------------------------+----------+
| COLUMN_NAME | TYPE    | NULLABLE | EXPRESSION                             | FILENAMES                | ORDER_ID |
|-------------+---------+----------+---------------------+---------------------------------------------|----------+
| CONTINENT   | TEXT    | True     | GET_IGNORE_CASE ($1, CONTINENT)::TEXT  | geography/cities.parquet | 0        |
| COUNTRY     | VARIANT | True     | GET_IGNORE_CASE ($1, COUNTRY)::VARIANT | geography/cities.parquet | 1        |
+-------------+---------+----------+---------------------+---------------------------------------------+----------+
```

Retrieve the Snowflake column definitions for JSON files in the `mystage` stage:

```sqlexample
-- Create a file format that sets the file type as JSON.
CREATE FILE FORMAT my_json_format
  TYPE = json;

-- Query the INFER_SCHEMA function.
SELECT *
  FROM TABLE(
    INFER_SCHEMA(
      LOCATION=>'@mystage/json/'
      , FILE_FORMAT=>'my_json_format'
      )
    );

+-------------+---------------+----------+---------------------------+--------------------------+----------+
| COLUMN_NAME | TYPE          | NULLABLE | EXPRESSION                | FILENAMES                | ORDER_ID |
|-------------+---------------+----------+---------------------------+--------------------------|----------+
| col_bool    | BOOLEAN       | True     | $1:col_bool::BOOLEAN      | json/schema_A_1.json     | 0        |
| col_date    | DATE          | True     | $1:col_date::DATE         | json/schema_A_1.json     | 1        |
| col_ts      | TIMESTAMP_NTZ | True     | $1:col_ts::TIMESTAMP_NTZ  | json/schema_A_1.json     | 2        |
+-------------+---------------+----------+---------------------------+--------------------------+----------+
```

Creates a table using the detected schema from staged JSON files.

> ```sqlexample
> CREATE TABLE mytable
>   USING TEMPLATE (
>     SELECT ARRAY_AGG(OBJECT_CONSTRUCT(*))
>       FROM TABLE(
>         INFER_SCHEMA(
>           LOCATION=>'@mystage/json/',
>           FILE_FORMAT=>'my_json_format'
>         )
>       ));
> ```

> **Note:**
>
> Using `*` for `ARRAY_AGG(OBJECT_CONSTRUCT())` might result in an error if the returned result is larger than 128 MB. We recommend that you avoid using `*` for larger result sets, and only use the required columns, `COLUMN NAME`, `TYPE`, and `NULLABLE`, for the query. Optional column `ORDER_ID` can be included when using `WITHIN GROUP (ORDER BY order_id)`.

Retrieve the column definitions for CSV files in the `mystage` stage and load the CSV files using MATCH_BY_COLUMN_NAME:

```sqlexample
-- Create a file format that sets the file type as CSV.
CREATE FILE FORMAT my_csv_format
  TYPE = csv
  PARSE_HEADER = true;

-- Query the INFER_SCHEMA function.
SELECT *
  FROM TABLE(
    INFER_SCHEMA(
      LOCATION=>'@mystage/csv/'
      , FILE_FORMAT=>'my_csv_format'
      )
    );

+-------------+---------------+----------+---------------------------+--------------------------+----------+
| COLUMN_NAME | TYPE          | NULLABLE | EXPRESSION                | FILENAMES                | ORDER_ID |
|-------------+---------------+----------+---------------------------+--------------------------|----------+
| col_bool    | BOOLEAN       | True     | $1:col_bool::BOOLEAN      | json/schema_A_1.csv      | 0        |
| col_date    | DATE          | True     | $1:col_date::DATE         | json/schema_A_1.csv      | 1        |
| col_ts      | TIMESTAMP_NTZ | True     | $1:col_ts::TIMESTAMP_NTZ  | json/schema_A_1.csv      | 2        |
+-------------+---------------+----------+---------------------------+--------------------------+----------+

-- Load the CSV file using MATCH_BY_COLUMN_NAME.
COPY INTO mytable FROM @mystage/csv/
  FILE_FORMAT = (
    FORMAT_NAME= 'my_csv_format'
  )
  MATCH_BY_COLUMN_NAME=CASE_INSENSITIVE;
```

### Iceberg column definitions

Retrieve the Iceberg column definitions for Parquet files on the `mystage` stage:

```sqlexample
-- Create a file format that sets the file type as Parquet.
  CREATE OR REPLACE FILE FORMAT my_parquet_format
    TYPE = PARQUET
    USE_VECTORIZED_SCANNER = TRUE;

-- Query the INFER_SCHEMA function.
SELECT *
FROM TABLE(
  INFER_SCHEMA(
    LOCATION=>'@mystage'
    , FILE_FORMAT=>'my_parquet_format'
    , KIND => 'ICEBERG'
    )
  );
```

Output:

```output
+-------------+---------+----------+---------------------+--------------------------+----------+
| COLUMN_NAME | TYPE    | NULLABLE | EXPRESSION          | FILENAMES                | ORDER_ID |
|-------------+---------+----------+---------------------+--------------------------|----------+
| id          | INT     | False    | $1:id::INT          | sales/customers.parquet   | 0       |
| custnum     | INT     | False    | $1:custnum::INT     | sales/customers.parquet   | 1       |
+-------------+---------+----------+---------------------+--------------------------+----------+
```

Creates an Apache Iceberg™ table by using the detected schema from staged Parquet files.

```sqlexample
 -- Create a file format that sets the file type as Parquet.
 CREATE OR REPLACE FILE FORMAT my_parquet_format
   TYPE = PARQUET
   USE_VECTORIZED_SCANNER = TRUE;

-- Create an Iceberg table.
CREATE ICEBERG TABLE myicebergtable
  USING TEMPLATE (
    SELECT ARRAY_AGG(OBJECT_CONSTRUCT(*))
    WITHIN GROUP (ORDER BY order_id)
      FROM TABLE(
        INFER_SCHEMA(
          LOCATION=>'@mystage',
          FILE_FORMAT=>'my_parquet_format',
          KIND => 'ICEBERG'
        )
      ))
... {rest of the ICEBERG options}
;
```

> **Note:**
>
> Using `*` for `ARRAY_AGG(OBJECT_CONSTRUCT())` might result in an error if the returned result is larger than 128 MB. We
> recommend avoiding the use of `*` for larger result sets, and only using the required columns, `COLUMN NAME`, `TYPE`, and
> `NULLABLE`, for the query. Optional column `ORDER_ID` can be included when using `WITHIN GROUP (ORDER BY order_id)`.

---
title: INITCAP
source: https://docs.snowflake.com/en/sql-reference/functions/initcap.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Case Conversion)

# INITCAP

Returns the input string with the first letter of each word in uppercase and the subsequent letters in lowercase.

## Syntax

```sqlsyntax
INITCAP( <expr> [ , '<delimiters>' ] )
```

## Arguments

`expr`
:   The string expression.

`'delimiters'`
:   A string of one or more characters that INITCAP uses as separators for words in the input expression:

    * If `delimiters` isn’t specified, any of the following characters in the input expressions are
      treated as word separators:

      ```output
      <whitespace> ! ? @ " ^ # $ & ~ _ , . : ; + - * % / | \ [ ] ( ) { } < >
      ```
    * If `delimiters` is specified, the specified value overrides all of the characters listed above.

    Supports any UTF-8 characters, including whitespace characters, and is case-sensitive.

    Must be enclosed in single quotes, for example `', '` (delimiters in this example are `,` and blank spaces).

    When specified as an empty string (that is, `''`), INITCAP ignores all delimiters, including whitespace characters,
    in the input expression. The input expression is treated as a single, continuous word. The resulting output is
    a string with the first character capitalized (if the first character is a letter) and all other letters in lowercase.

## Returns

This function returns a value of type VARCHAR.

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

This example provides various outputs in different languages using the default delimiters:

```sqlexample
SELECT v, INITCAP(v) FROM testinit;
```

```output
+---------------------------------+---------------------------------+
| C1                              | INITCAP(C1)                     |
|---------------------------------+---------------------------------|
| The Quick Gray Fox              | The Quick Gray Fox              |
| the sky is blue                 | The Sky Is Blue                 |
| OVER the River 2 Times          | Over The River 2 Times          |
| WE CAN HANDLE THIS              | We Can Handle This              |
| HelL0_hi+therE                  | Hell0_Hi+There                  |
| νησί του ποταμού                | Νησί Του Ποταμού                |
| ÄäÖößÜü                         | Ääöößüü                         |
| Hi,are?you!there                | Hi,Are?You!There                |
| to je dobré                     | To Je Dobré                     |
| ÉéÀàè]çÂâ ÊêÎÔô ÛûËÏ ïÜŸÇç ŒœÆæ | Ééààè]Çââ Êêîôô Ûûëï Ïüÿçç Œœææ |
| ĄąĆ ćĘęŁ łŃńÓ óŚśŹźŻż           | Ąąć Ćęęł Łńńó Óśśźźżż           |
| АаБб ВвГгД дЕеЁёЖ жЗзИиЙй       | Аабб Ввггд Дееёёж Жззиийй       |
| ХхЦц ЧчШш ЩщЪъ ЫыЬь ЭэЮ юЯя     | Ххцц Ччшш Щщъъ Ыыьь Ээю Юяя     |
| NULL                            | NULL                            |
+---------------------------------+---------------------------------+
```

These examples specify delimiters using the `delimiters` argument:

```sqlexample
SELECT INITCAP('this is the new Frame+work', '') AS initcap_result;
```

```output
+----------------------------+
| INITCAP_RESULT             |
|----------------------------|
| This is the new frame+work |
+----------------------------+
```

```sqlexample
SELECT INITCAP('iqamqinterestedqinqthisqtopic','q') AS initcap_result;
```

```output
+-------------------------------+
| INITCAP_RESULT                |
|-------------------------------|
| IqAmqInterestedqInqThisqTopic |
+-------------------------------+
```

```sqlexample
SELECT INITCAP('lion☂fRog potato⨊cLoUD', '⨊☂') AS initcap_result;
```

```output
+------------------------+
| INITCAP_RESULT         |
|------------------------|
| Lion☂Frog potato⨊Cloud |
+------------------------+
```

```sqlexample
SELECT INITCAP('apple is僉sweetandballIsROUND', '僉a b') AS initcap_result;
```

```output
+-------------------------------+
| INITCAP_RESULT                |
|-------------------------------|
| aPple Is僉SweetaNdbaLlisround |
+-------------------------------+
```

---
title: INSERT
source: https://docs.snowflake.com/en/sql-reference/functions/insert.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# INSERT

Replaces a substring of the specified length, starting at the specified
position, with a new string or binary value.

This function should not be confused with the [INSERT](../sql/insert.md) DML command.

## Syntax

```sqlsyntax
INSERT( <base_expr>, <pos>, <len>, <insert_expr> )
```

## Arguments

`base_expr`
:   The string or BINARY expression for which you want to insert/replace
    characters.

`pos`
:   The offset at which to start inserting characters. This is 1-based,
    not 0-based. In other words, the first character in the string is
    considered to be at position 1, not position 0. For example, to insert
    at the beginning of the string, set `pos` to 1.

    Valid values are between 1 and one more than the length of the string
    (inclusive).

    Setting `pos` to one more than the length of the string
    makes the operation equivalent to an append. (This also requires that the
    `len` parameter be 0 because you should not try to delete any
    characters past the last character.)

`len`
:   The number of characters (starting at `pos`) that you want
    to replace. Valid values range from 0 to the number of characters between
    `pos` and the end of the string. If this is 0, it means add the
    new characters without deleting any existing characters.

`insert_expr`
:   The string to insert into the `base_expr`. If this string
    is empty, and if `len` is greater than zero, then effectively the
    operation becomes a delete (some characters are deleted, and none are added).

## Usage notes

* The `base_expr` and `insert_expr` should be the same data
  type; either both should be string (e.g. VARCHAR) or both should be binary.
* If any of the arguments are NULL, the returned value is NULL.

## Returns

Returns a string or BINARY that is equivalent to making a copy of
`base_expr`, deleting `len` characters starting at
`pos`, and then inserting `insert_expr` at `pos`.

Note that the original input `base_expr` is not changed; the function
returns a separate (modified) copy.

## Examples

This is a simple example:

> ```sqlexample
> SELECT INSERT('abc', 1, 2, 'Z') as STR;
> +-----+
> | STR |
> |-----|
> | Zc  |
> +-----+
> ```

This example shows that the length of the replacement string can be different
from the length of the substring being replaced:

> ```sqlexample
> SELECT INSERT('abcdef', 3, 2, 'zzz') as STR;
> +---------+
> | STR     |
> |---------|
> | abzzzef |
> +---------+
> ```

This shows what happens when the replacement string is empty (the function deletes the
specified number of characters starting at the start position, and does not
add any characters):

> ```sqlexample
> SELECT INSERT('abc', 2, 1, '') as STR;
> +-----+
> | STR |
> |-----|
> | ac  |
> +-----+
> ```

This uses `INSERT` as an append operation, by adding characters immediately
after the last character in the original string:

> ```sqlexample
> SELECT INSERT('abc', 4, 0, 'Z') as STR;
> +------+
> | STR  |
> |------|
> | abcZ |
> +------+
> ```

The following all return NULL because at least one of the arguments is NULL:

> ```sqlexample
> SELECT INSERT(NULL, 1, 2, 'Z') as STR;
> +------+
> | STR  |
> |------|
> | NULL |
> +------+
> ```
>
> ```sqlexample
> SELECT INSERT('abc', NULL, 2, 'Z') as STR;
> +------+
> | STR  |
> |------|
> | NULL |
> +------+
> ```
>
> ```sqlexample
> SELECT INSERT('abc', 1, NULL, 'Z') as STR;
> +------+
> | STR  |
> |------|
> | NULL |
> +------+
> ```
>
> ```sqlexample
> SELECT INSERT('abc', 1, 2, NULL) as STR;
> +------+
> | STR  |
> |------|
> | NULL |
> +------+
> ```

---
title: INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/functions/integration.md
section: SQL Functions
---

Categories:
:   [Notification functions](../functions-notification.md) (Integration Configuration)

# INTEGRATION

Returns a JSON object that specifies the notification integration to use to send a message. This is a helper function that you
use to construct an integration configuration object for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

See also:
:   [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md) ,
    [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) ,
    [EMAIL_INTEGRATION_CONFIG](email_integration_config.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NOTIFICATION.INTEGRATION( '<integration_name>' )
```

## Arguments

`'integration_name'`
:   Name of the notification integration to use.

## Returns

A JSON-formatted string that specifies a notification integration for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure to send.

For example, if you pass in the notification integration name `'my_queue_int'`, the function returns:

```json
'{"my_queue_int":{}}'
```

## Examples

See [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md).

---
title: INTERPOLATE_BFILL, INTERPOLATE_FFILL, INTERPOLATE_LINEAR
source: https://docs.snowflake.com/en/sql-reference/functions/interpolate_bfill.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (General)

# INTERPOLATE_BFILL, INTERPOLATE_FFILL, INTERPOLATE_LINEAR

Updates rows in a time-series data set to gap-fill missing values based on surrounding values.

You can call the following interpolation window functions:

* INTERPOLATE_BFILL: Gap-fills rows based on the next observed row.
* INTERPOLATE_FFILL: Gap-fills rows based on the previously observed row.
* INTERPOLATE_LINEAR: Gap-fills rows based on the linear interpolation of previous and next values. This function
  only supports numeric values.

These functions have the same [window function syntax](../functions-window-syntax.md). They don’t
support explicit window frames.

## Syntax

```sqlsyntax
INTERPOLATE_BFILL( <expr> )
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

```sqlsyntax
INTERPOLATE_FFILL( <expr> )
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

```sqlsyntax
INTERPOLATE_LINEAR( <expr> )
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`expr`
:   An expression that defines the column that you want to gap-fill.

    The INTERPOLATE_LINEAR input expression must be a numeric data type.

    The INTERPOLATE_BFILL and INTERPOLATE_FFILL input expressions do not support [geospatial data types](../data-types-geospatial.md).

## Parameters

`OVER`
:   Standard window function OVER clause. See [Window function syntax and usage](../functions-window-syntax.md). For the interpolation functions, the
    PARTITION BY clause is optional, but the ORDER BY clause is required. You can’t specify an explicit window frame.

    The INTERPOLATE_LINEAR function can have only one ORDER BY expression, and it must be a numeric, DATE, or TIMESTAMP expression (including all TIMESTAMP variants).

## Returns

These functions return the same data type as the data type of the input expression.

## Usage notes

* When you use INTERPOLATE window functions with the [RESAMPLE](../constructs/resample.md) clause, include the columns to partition by in both PARTITION BY clauses: RESAMPLE (PARTITION BY) and INTERPOLATE (PARTITION BY). This approach ensures that:

  + RESAMPLE generates rows with non-NULL values for the partition columns.
  + INTERPOLATE functions operate within the correct partitions.
  + Any WHERE clause filters preserve the generated rows for the partitions you want to keep.

  For examples of using INTERPOLATE with RESAMPLE, see [Filling gaps in time-series data](../../user-guide/querying-time-series-data.md).

## Examples

The following examples show how to use the interpolation functions in simple queries.

### Example with two interpolation functions

The following example returns resampled `temperature` values and two different interpolated `temperature` values in the same query. (The table `march_temps_every_five_mins` was created earlier in this topic.)

```sqlexample
SELECT observed,
    temperature,
    INTERPOLATE_BFILL(temperature) OVER (PARTITION BY city, county ORDER BY observed) bfill_temp,
    INTERPOLATE_FFILL(temperature) OVER (PARTITION BY city, county ORDER BY observed) ffill_temp,
    city,
    county
  FROM march_temps_every_five_mins
  ORDER BY observed;
```

```output
+-------------------------+-------------+------------+------------+------------------+----------------+
| OBSERVED                | TEMPERATURE | BFILL_TEMP | FFILL_TEMP | CITY             | COUNTY         |
|-------------------------+-------------+------------+------------+------------------+----------------|
| 2025-03-15 09:45:00.000 |        NULL |         48 |       NULL | Big Bear City    | San Bernardino |
| 2025-03-15 09:49:00.000 |          48 |         48 |         48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |        NULL |         49 |         48 | Big Bear City    | San Bernardino |
| 2025-03-15 09:50:00.000 |          44 |         44 |         44 | South Lake Tahoe | El Dorado      |
| 2025-03-15 09:55:00.000 |          49 |         49 |         49 | Big Bear City    | San Bernardino |
| 2025-03-15 09:55:00.000 |          46 |         46 |         46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:00:00.000 |        NULL |         51 |         49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:00:00.000 |        NULL |         52 |         46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:05:00.000 |        NULL |         51 |         49 | Big Bear City    | San Bernardino |
| 2025-03-15 10:05:00.000 |        NULL |         52 |         46 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:10:00.000 |          51 |         51 |         51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:10:00.000 |          52 |         52 |         52 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:15:00.000 |        NULL |         54 |         51 | Big Bear City    | San Bernardino |
| 2025-03-15 10:15:00.000 |          54 |         54 |         54 | South Lake Tahoe | El Dorado      |
| 2025-03-15 10:18:00.000 |          54 |         54 |         54 | Big Bear City    | San Bernardino |
+-------------------------+-------------+------------+------------+------------------+----------------+
```

The `bfill_temp` column returns a meaningful value for every row, but `ffill_temp` returns NULL
for the first row. The INTERPOLATE_FFILL function requires a previous value in order to return a non-NULL result.
The INTERPOLATE_BFILL function only requires a next value.

### Example of an expected error for an explicit window frame

The following query returns an error because the interpolation functions do not support explicit window frames:

```sqlexample
SELECT observed, temperature,
    INTERPOLATE_BFILL(temperature)
      OVER (PARTITION BY city, county ORDER BY observed ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) bfill_temp,
    city, county
  FROM march_temps_every_five_mins
  ORDER BY observed;
```

```output
002303 (0A000): SQL compilation error: error line 1 at position 111
Sliding window frame unsupported for function INTERPOLATE_BFILL
```

---
title: INVOKER_ROLE
source: https://docs.snowflake.com/en/sql-reference/functions/invoker_role.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# INVOKER_ROLE

Returns the name of the account-level role of the object executing the query or NULL if the name of the role is a database role.

See also:
:   [Advanced Column-level Security topics](../../user-guide/security-column-advanced.md)

## Syntax

```sqlsyntax
INVOKER_ROLE()
```

## Arguments

None.

## Usage notes

* If using the INVOKER_ROLE function with [masking policy](../../user-guide/security-column-intro.md), verify that your Snowflake account is Enterprise Edition or higher.
* The following table summarizes the relationship between the query context and the role the function evaluates.

  | Context | Evaluated role |
  | --- | --- |
  | User | [CURRENT_ROLE](current_role.md) |
  | Table | CURRENT_ROLE. |
  | View | View owner role. |
  | UDF | UDF owner role. |
  | Stored procedure with caller’s right | CURRENT_ROLE. |
  | Stored procedure with owner’s right | Stored procedure owner role. |
  | Task | Task owner role. |
  | Stream | The role that queries a given [stream](../../user-guide/streams-intro.md). |
* The following diagram shows the relationship of a query performer, roles in Snowflake, and masking policies on tables or views.

  Where:

  + `R0, R1, R2, R3`
    :   Are roles in Snowflake.
  + `P1, P2, P3`
    :   Are masking policies in Snowflake.
  + `V1, V2`
    :   Are views in Snowflake.
  + `T`
    :   Is a table in Snowflake.

  Based on this diagram, the values of CURRENT_ROLE and INVOKER_ROLE in a query are as follows:

  | Policy | CURRENT_ROLE | INVOKER_ROLE |
  | --- | --- | --- |
  | P1 | R3 | R1 |
  | P2 | R3 | R2 |
  | P3 | R3 | R3 |

## Examples

The following examples show how to use the INVOKER_ROLE in a masking policy SQL expression.

Return NULL for unauthorized users:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY mask_string AS
> (val string) RETURNS string ->
> CASE
>   WHEN INVOKER_ROLE() IN ('ANALYST') THEN val
>   ELSE NULL
> END;
> ```

Return a static masked value for unauthorized users:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY mask_string AS
> (val string) RETURNS string ->
> CASE
>   WHEN INVOKER_ROLE() IN ('ANALYST') THEN val
>   ELSE '********'
> END;
> ```

Return a hash value using SHA2 , SHA2_HEX for unauthorized users:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY mask_string AS
> (val string) RETURNS string ->
> CASE
>   WHEN INVOKER_ROLE() IN ('ANALYST') THEN val
>   ELSE SHA2(val)
> END;
> ```

---
title: INVOKER_SHARE
source: https://docs.snowflake.com/en/sql-reference/functions/invoker_share.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# INVOKER_SHARE

Returns the name of the share that directly accessed the table or view where the INVOKER_SHARE function is invoked, otherwise the function returns NULL.

## Syntax

```sqlsyntax
INVOKER_SHARE()
```

## Arguments

None.

## Usage notes

* If using the INVOKER_SHARE function with [masking policy](../../user-guide/security-column-intro.md), verify that your Snowflake account is Enterprise Edition or higher.
* Use the INVOKER_SHARE function in a policy that is attached to a table or view that is directly invoked by a share.
* If the INVOKER_SHARE function is used inside a [User-defined functions overview](../../developer-guide/udf/udf-overview.md) within a masking policy directly attached to a table or view, INVOKER_SHARE returns NULL because the context of the INVOKER_SHARE function is the UDF owner, not the share.
* To help determine if a table or view was directly or indirectly invoked by a share, consider using the [CURRENT_ACCOUNT](current_account.md) function in a masking policy. This function returns the Snowflake account for the user’s current session, which can help determine if the table or view is invoked from a data sharing consumer account.

## Examples

Consider a data sharing provider account that has a masking policy set on a column of a secure view. There are two different shares that
can access the secure view to support two different data sharing consumers.

The data sharing provider creates the following policy to use UDFs to identify which share is being accessed. If a user in the data sharing
consumer account attempts to query the data through either share, they see data based on how the UDFs are written, otherwise a fixed masked
value is seen.

> ```sqlexample
> create or replace masking policy mask_share
> as (val string) returns string ->
> case
>   when invoker_share() in ('SHARE1') then mask1_function(val)
>   when invoker_share() in ('SHARE2') then mask2_function(val)
>   else '***MASKED***'
> end;
> ```

---
title: IS [ NOT ] DISTINCT FROM
source: https://docs.snowflake.com/en/sql-reference/functions/is-distinct-from.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# IS [ NOT ] DISTINCT FROM

Compares whether two expressions are equal (or not equal). The function is NULL-safe, meaning it treats NULLs as known values for comparing equality. Note that this is different from the EQUAL
[comparison operator](../operators-comparison.md) (`=`), which treats NULLs as unknown values.

See also:
:   [[ NOT ] EQUAL_NULL](equal_null.md)

## Syntax

```sqlsyntax
<expr1> IS [ NOT ] DISTINCT FROM <expr2>
```

## Usage notes

* The value returned depends on whether any of the inputs are NULL values:

  Returns TRUE:
  :   `<null> IS NOT DISTINCT FROM <null>`

      `<null> IS DISTINCT FROM <not_null>`

      `<not_null> IS DISTINCT FROM <null>`

  Returns FALSE:
  :   `<null> IS DISTINCT FROM <null>`

      `<null> IS NOT DISTINCT FROM <not_null>`

      `<not_null> IS NOT DISTINCT FROM <null>`

  Otherwise:

  > `<expr1> IS DISTINCT FROM <expr2>` is equivalent to `<expr1> != <expr2>`
  >
  > `<expr1> IS NOT DISTINCT FROM <expr2>` is equivalent to `<expr1> = <expr2>`

For more details, see the examples below.

## Examples

Create a table with simple data:

> ```sqlexample
> CREATE OR REPLACE TABLE x (i number);
> INSERT INTO x values
>     (1),
>     (2),
>     (null);
> ```

Show the Cartesian product generated by joining the table to itself without a filter:

> ```sqlexample
> SELECT x1.i x1_i, x2.i x2_i
>     FROM x x1, x x2
>     ORDER BY x1.i, x2.i;
> +------+------+
> | X1_I | X2_I |
> |------+------|
> |    1 |    1 |
> |    1 |    2 |
> |    1 | NULL |
> |    2 |    1 |
> |    2 |    2 |
> |    2 | NULL |
> | NULL |    1 |
> | NULL |    2 |
> | NULL | NULL |
> +------+------+
> ```

Return rows that contain:

> * Only equal values for both columns.
> * Only equal values or NULL values for both columns.
>
> ```sqlexample
> SELECT x1.i x1_i, x2.i x2_i
>     FROM x x1, x x2
>     WHERE x1.i=x2.i;
> +------+------+
> | X1_I | X2_I |
> |------+------|
> |    1 |    1 |
> |    2 |    2 |
> +------+------+
> ```
>
> ```sqlexample
> SELECT x1.i x1_i, x2.i x2_i
>     FROM x x1, x x2
>     WHERE x1.i IS NOT DISTINCT FROM x2.i
>     ORDER BY x1.i;
> +------+------+
> | X1_I | X2_I |
> |------+------|
> |    1 |    1 |
> |    2 |    2 |
> | NULL | NULL |
> +------+------+
> ```

Illustrate all possible outcomes for:

> * EQUAL `=` and NOT EQUAL `<>`
> * IS NOT DISTINCT FROM and IS DISTINCT FROM
>
> ```sqlexample
> SELECT x1.i x1_i,
>        x2.i x2_i,
>        x1.i=x2.i,
>        iff(x1.i=x2.i, 'Selected', 'Not') "SELECT IF X1.I=X2.I",
>        x1.i<>x2.i,
>        iff(not(x1.i=x2.i), 'Selected', 'Not') "SELECT IF X1.I<>X2.I"
>     FROM x x1, x x2;
> +------+------+-----------+---------------------+------------+----------------------+
> | X1_I | X2_I | X1.I=X2.I | SELECT IF X1.I=X2.I | X1.I<>X2.I | SELECT IF X1.I<>X2.I |
> |------+------+-----------+---------------------+------------+----------------------|
> |    1 |    1 | True      | Selected            | False      | Not                  |
> |    1 |    2 | False     | Not                 | True       | Selected             |
> |    1 | NULL | NULL      | Not                 | NULL       | Not                  |
> |    2 |    1 | False     | Not                 | True       | Selected             |
> |    2 |    2 | True      | Selected            | False      | Not                  |
> |    2 | NULL | NULL      | Not                 | NULL       | Not                  |
> | NULL |    1 | NULL      | Not                 | NULL       | Not                  |
> | NULL |    2 | NULL      | Not                 | NULL       | Not                  |
> | NULL | NULL | NULL      | Not                 | NULL       | Not                  |
> +------+------+-----------+---------------------+------------+----------------------+
> ```
>
> ```sqlexample
> SELECT x1.i x1_i, x2.i x2_i,
>                x1.i IS NOT DISTINCT FROM x2.i, iff(x1.i IS NOT DISTINCT FROM x2.i, 'Selected', 'Not') "SELECT IF X1.I IS NOT DISTINCT FROM X2.I",
>                x1.i IS DISTINCT FROM x2.i, iff(x1.i IS DISTINCT FROM x2.i, 'Selected', 'Not') "SELECT IF X1.I IS DISTINCT FROM X2.I"
>         FROM x x1, x x2
>         ORDER BY x1.i, x2.i;
> +------+------+--------------------------------+------------------------------------------+----------------------------+--------------------------------------+
> | X1_I | X2_I | X1.I IS NOT DISTINCT FROM X2.I | SELECT IF X1.I IS NOT DISTINCT FROM X2.I | X1.I IS DISTINCT FROM X2.I | SELECT IF X1.I IS DISTINCT FROM X2.I |
> |------+------+--------------------------------+------------------------------------------+----------------------------+--------------------------------------|
> |    1 |    1 | True                           | Selected                                 | False                      | Not                                  |
> |    1 |    2 | False                          | Not                                      | True                       | Selected                             |
> |    1 | NULL | False                          | Not                                      | True                       | Selected                             |
> |    2 |    1 | False                          | Not                                      | True                       | Selected                             |
> |    2 |    2 | True                           | Selected                                 | False                      | Not                                  |
> |    2 | NULL | False                          | Not                                      | True                       | Selected                             |
> | NULL |    1 | False                          | Not                                      | True                       | Selected                             |
> | NULL |    2 | False                          | Not                                      | True                       | Selected                             |
> | NULL | NULL | True                           | Selected                                 | False                      | Not                                  |
> +------+------+--------------------------------+------------------------------------------+----------------------------+--------------------------------------+
> ```

---
title: IS [ NOT ] NULL
source: https://docs.snowflake.com/en/sql-reference/functions/is-null.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# IS [ NOT ] NULL

Determines whether an expression is NULL or is not NULL.

## Syntax

```sqlsyntax
<expr> IS [ NOT ] NULL
```

## Returns

Returns a BOOLEAN.

* When IS NULL is specified, the value is TRUE if the expression is NULL. Otherwise, returns FALSE.
* When IS NOT NULL is specified, the value is TRUE if the expression is not NULL. Otherwise, returns FALSE.

## Examples

Create the `test_is_not_null` table and load the data:

```sqlexample
CREATE OR REPLACE TABLE test_is_not_null (id NUMBER, col1 NUMBER, col2 NUMBER);
INSERT INTO test_is_not_null (id, col1, col2) VALUES
  (1, 0, 5),
  (2, 0, NULL),
  (3, NULL, 5),
  (4, NULL, NULL);
```

Show the data in the `test_is_not_null` table:

```sqlexample
SELECT *
  FROM test_is_not_null
  ORDER BY id;
```

```output
+----+------+------+
| ID | COL1 | COL2 |
|----+------+------|
|  1 |    0 |    5 |
|  2 |    0 | NULL |
|  3 | NULL |    5 |
|  4 | NULL | NULL |
+----+------+------+
```

Use IS NOT NULL to return the rows for which the values in `col1` are not NULL:

```sqlexample
SELECT *
  FROM test_is_not_null
  WHERE col1 IS NOT NULL
  ORDER BY id;
```

```output
+----+------+------+
| ID | COL1 | COL2 |
|----+------+------|
|  1 |    0 |    5 |
|  2 |    0 | NULL |
+----+------+------+
```

Use IS NULL to return the rows for which the values in `col2` are NULL:

```sqlexample
SELECT *
  FROM test_is_not_null
  WHERE col2 IS NULL
  ORDER BY id;
```

```output
+----+------+------+
| ID | COL1 | COL2 |
|----+------+------|
|  2 |    0 | NULL |
|  4 | NULL | NULL |
+----+------+------+
```

Use a combination of IS NOT NULL and IS NULL to return the rows for which *either* of
the following conditions is met:

* The values in `col1` are not NULL.
* The values in `col2` are NULL.

```sqlexample
SELECT *
  FROM test_is_not_null
  WHERE col1 IS NOT NULL OR col2 IS NULL
  ORDER BY id;
```

```output
+----+------+------+
| ID | COL1 | COL2 |
|----+------+------|
|  1 |    0 |    5 |
|  2 |    0 | NULL |
|  4 | NULL | NULL |
+----+------+------+
```

Use a combination of IS NOT NULL and IS NULL to return the rows for which *both* of
the following conditions are met:

* The values in `col1` are not NULL.
* The values in `col2` are NULL.

```sqlexample
SELECT *
  FROM test_is_not_null
  WHERE col1 IS NOT NULL AND col2 IS NULL
  ORDER BY id;
```

```output
+----+------+------+
| ID | COL1 | COL2 |
|----+------+------|
|  2 |    0 | NULL |
+----+------+------+
```

---
title: IS_<object_type>
source: https://docs.snowflake.com/en/sql-reference/functions/is.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_*<object_type>*

This family of functions serves as Boolean predicates that can be used to determine the data type of a value stored in a VARIANT column:

* [IS_ARRAY](is_array.md)
* [IS_BINARY](is_binary.md)
* [IS_BOOLEAN](is_boolean.md)
* [IS_CHAR , IS_VARCHAR](is_char-varchar.md)
* [IS_DATE , IS_DATE_VALUE](is_date-value.md)
* [IS_DECIMAL](is_decimal.md)
* [IS_DOUBLE , IS_REAL](is_double-real.md)
* [IS_INTEGER](is_integer.md)
* [IS_NULL_VALUE](is_null_value.md)
* [IS_OBJECT](is_object.md)
* [IS_TIME](is_time.md)
* [IS_TIMESTAMP_\*](is_timestamp.md)

See also:
:   [AS_<object_type>](as.md) , [TYPEOF](typeof.md)

## General usage notes

* All the functions are unary, taking a VARIANT expression as the only argument.
* All the functions return FALSE if the input is SQL NULL or the VARIANT expression contains NULL.

## Examples

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Count all rows in `vartab` table where the VARIANT column `v` contains a string value:

```sqlexample
SELECT COUNT(*) FROM vartab WHERE IS_VARCHAR(v);
```

```output
+----------+
| COUNT(*) |
|----------|
|        1 |
+----------+
```

Select rows in `vartab` table where the VARIANT column `v` contains the specified data type:

```sqlexample
SELECT * FROM vartab WHERE IS_NULL_VALUE(v);
```

```output
+---+------+
| N | V    |
|---+------|
| 1 | null |
+---+------+
```

```sqlexample
SELECT * FROM vartab WHERE IS_BOOLEAN(v);
```

```output
+---+------+
| N | V    |
|---+------|
| 3 | true |
+---+------+
```

```sqlexample
SELECT * FROM vartab WHERE IS_INTEGER(v);
```

```output
+---+-----+
| N | V   |
|---+-----|
| 4 | -17 |
+---+-----+
```

```sqlexample
SELECT * FROM vartab WHERE IS_DECIMAL(v);
```

```output
+---+--------+
| N | V      |
|---+--------|
| 4 | -17    |
| 5 | 123.12 |
+---+--------+
```

```sqlexample
SELECT * FROM vartab WHERE IS_DOUBLE(v);
```

```output
+---+-----------------------+
| N | V                     |
|---+-----------------------|
| 4 | -17                   |
| 5 | 123.12                |
| 6 | 1.912000000000000e+02 |
+---+-----------------------+
```

```sqlexample
SELECT * FROM vartab WHERE IS_VARCHAR(v);
```

```output
+---+------------------------+
| N | V                      |
|---+------------------------|
| 7 | "Om ara pa ca na dhih" |
+---+------------------------+
```

```sqlexample
SELECT * FROM vartab WHERE IS_ARRAY(v);
```

```output
+---+-------------+
| N | V           |
|---+-------------|
| 8 | [           |
|   |   -1,       |
|   |   12,       |
|   |   289,      |
|   |   2188,     |
|   |   false,    |
|   |   undefined |
|   | ]           |
+---+-------------+
```

```sqlexample
SELECT * FROM vartab WHERE IS_OBJECT(v);
```

```output
+---+---------------+
| N | V             |
|---+---------------|
| 9 | {             |
|   |   "x": "abc", |
|   |   "y": false, |
|   |   "z": 10     |
|   | }             |
+---+---------------+
```

---
title: IS_APPLICATION_ROLE_ACTIVATED (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_application_role_activated.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_APPLICATION_ROLE_ACTIVATED (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if an application role is activated in the specified context.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$APPLICATION' ,
  'IS_APPLICATION_ROLE_ACTIVATED' ,
  '<context>' ,
  '<app_role>'
)
```

## Arguments

`'SNOWFLAKE$APPLICATION'`
:   Specifies that you want to call a function to return context information about the application in which the function is called.

`'IS_APPLICATION_ROLE_ACTIVATED'`
:   Calls the IS_APPLICATION_ROLE_ACTIVATED function.

`'context'`
:   Specifies the execution context that you want to check. You can specify one of the following values:

    * `SESSION`: Checks if the application role is in the role hierarchy of the current session’s primary or secondary roles.
      The function returns `'TRUE'` if the role is in the role hierarchy.
    * `ACTIVE`: Checks if the application role is in the role hierarchy in the context of the current call.

      For example, in a call to an owner’s rights stored procedure, the procedure is executed by the owner’s role. The function
      returns `'TRUE'` if the application role is in the role hierarchy of the owner’s role.

`'app_role'`
:   Specifies the application role to check.

    Do not qualify the role name with the name of the application. The function automatically determines the application name from
    the context in which the function is called.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the application role is activated in the context specified by `context`.
* `'FALSE'` if the application role is not activated in that context or if the application role is not valid.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$APPLICATION', 'IS_APPLICATION_ROLE_ACTIVATED', 'SESSION', 'my_app_role')::BOOLEAN = TRUE;
```

## Usage notes

## Examples

The following example returns `TRUE` if the application role `my_app_role` is in the role hierarchy of the session’s primary
or secondary roles:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$APPLICATION', 'IS_APPLICATION_ROLE_ACTIVATED', 'SESSION', 'my_app_role');
```

---
title: IS_APPLICATION_ROLE_IN_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/is_application_role_in_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# IS_APPLICATION_ROLE_IN_SESSION

Verifies whether the application role is activated in the consumer’s current session.

See also:
:   [IS_ROLE_IN_SESSION](is_role_in_session.md), [IS_DATABASE_ROLE_IN_SESSION](is_database_role_in_session.md)

## Syntax

```sqlsyntax
IS_APPLICATION_ROLE_IN_SESSION( '<string_literal>' )
```

## Arguments

`'string_literal'`
:   The application role name.

## Returns

* `TRUE` when the specified role name is activated in the consumer’s current session.

  The function always uses the consumer’s current session and returns `TRUE` when the application role is granted to the consumer using the function.

  The function does not return `TRUE` when the application calls the function because application roles are owned but not granted to the app.
* `FALSE` when the specified application role name is not activated in the consumer’s current session.

## Usage notes

* This function is only supported when called from within a Snowflake Native App. It
  does not work when called by a user outside an app.
* If you’re using the IS_APPLICATION_ROLE_IN_SESSION function with a
  [masking policy](../../user-guide/security-column-intro.md) or a
  [row access policy](../../user-guide/security-row-intro.md), your Snowflake must be Enterprise Edition or higher.
* Only one role name can be passed as an argument
* This function can’t be used in a materialized view definition because Snowflake cannot determine what data to
  materialize.

## Examples

Verify if the specified application role is in the current session:

```sqlexample
SELECT IS_APPLICATION_ROLE_IN_SESSION('ANALYST');
```

```output
+-------------------------------------------+
| IS_APPLICATION_ROLE_IN_SESSION('ANALYST') |
+-------------------------------------------+
| FALSE                                     |
+-------------------------------------------+
```

---
title: IS_ARRAY
source: https://docs.snowflake.com/en/sql-reference/functions/is_array.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_ARRAY

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains an [ARRAY](../data-types-semistructured.md) value.

See also:
:   [IS_<object_type>](is.md) , [IS_OBJECT](is_object.md)

## Syntax

```sqlsyntax
IS_ARRAY( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains an ARRAY value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

The following examples use the IS_ARRAY function.

### Use the IS_ARRAY function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the ARRAY values in the data by using the IS_ARRAY function in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_ARRAY(v);
```

```output
+---+-------------+
| N | V           |
|---+-------------|
| 8 | [           |
|   |   -1,       |
|   |   12,       |
|   |   289,      |
|   |   2188,     |
|   |   false,    |
|   |   undefined |
|   | ]           |
+---+-------------+
```

### Use the IS_ARRAY function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains ARRAY values in the data by using the IS_ARRAY function in a SELECT list:

```sqlexample
SELECT IS_ARRAY(array1),
       IS_ARRAY(array2),
       IS_ARRAY(boolean1)
  FROM multiple_types;
```

```output
+------------------+------------------+--------------------+
| IS_ARRAY(ARRAY1) | IS_ARRAY(ARRAY2) | IS_ARRAY(BOOLEAN1) |
|------------------+------------------+--------------------|
| True             | True             | False              |
+------------------+------------------+--------------------+
```

---
title: IS_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/is_binary.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_BINARY

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains a [binary string](../data-types-text.md) value.

See also:
:   [IS_<object_type>](is.md)

## Syntax

```sqlsyntax
IS_BINARY( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a BINARY value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

Return all of the BINARY values in a VARIANT column.

> **Note:**
>
> The output format for BINARY values is set using the [BINARY_OUTPUT_FORMAT](../parameters.md) parameter.
> The default setting is `HEX`.

Create and load a table with a BINARY value in a VARIANT column:

```sqlexample
CREATE OR REPLACE TABLE varbin (v VARIANT);

INSERT INTO varbin SELECT TO_VARIANT(TO_BINARY('snow', 'utf-8'));
```

Show the BINARY values in the data by using the IS_BINARY function in a WHERE clause:

```sqlexample
SELECT v AS hex_encoded_binary_value
  FROM varbin
  WHERE IS_BINARY(v);
```

```output
+--------------------------+
| HEX_ENCODED_BINARY_VALUE |
|--------------------------|
| "736E6F77"               |
+--------------------------+
```

---
title: IS_BOOLEAN
source: https://docs.snowflake.com/en/sql-reference/functions/is_boolean.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_BOOLEAN

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains a [BOOLEAN](../data-types-logical.md) value.

See also:
:   [IS_<object_type>](is.md)

## Syntax

```sqlsyntax
IS_BOOLEAN( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a BOOLEAN value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following examples use the IS_BOOLEAN function.

### Use the IS_BOOLEAN function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the BOOLEAN values in the data by using the IS_BOOLEAN function in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_BOOLEAN(v);
```

```output
+---+------+
| N | V    |
|---+------|
| 3 | true |
+---+------+
```

### Use the IS_BOOLEAN function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains BOOLEAN values in the data by using the IS_BOOLEAN function in a SELECT list:

```sqlexample
SELECT IS_BOOLEAN(boolean1),
       IS_BOOLEAN(array1)
  FROM multiple_types;
```

```output
+----------------------+--------------------+
| IS_BOOLEAN(BOOLEAN1) | IS_BOOLEAN(ARRAY1) |
|----------------------+--------------------|
| True                 | False              |
+----------------------+--------------------+
```

---
title: IS_CHAR , IS_VARCHAR
source: https://docs.snowflake.com/en/sql-reference/functions/is_char-varchar.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_CHAR , IS_VARCHAR

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains a [string value](../data-types-text.md).

These functions are synonymous.

See also:
:   [IS_<object_type>](is.md)

## Syntax

```sqlsyntax
IS_CHAR( <variant_expr> )

IS_VARCHAR( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a string value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following examples use the IS_VARCHAR function.

### Use the IS_VARCHAR function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the string values in the data by using the IS_VARCHAR function in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_VARCHAR(v);
```

```output
+---+------------------------+
| N | V                      |
|---+------------------------|
| 7 | "Om ara pa ca na dhih" |
+---+------------------------+
```

### Use the IS_VARCHAR function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains string values in the data by using the IS_VARCHAR function in a SELECT list:

```sqlexample
SELECT IS_VARCHAR(varchar1),
       IS_VARCHAR(boolean1)
  FROM multiple_types;
```

```output
+----------------------+----------------------+
| IS_VARCHAR(VARCHAR1) | IS_VARCHAR(BOOLEAN1) |
|----------------------+----------------------|
| True                 | False                |
+----------------------+----------------------+
```

---
title: IS_CONFIGURATION_SET (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_configuration_set.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_CONFIGURATION_SET (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if the specified configuration has a value set,
that is, the configuration’s status is `DONE`. Returns `FALSE` if the
configuration does not have a value set, that is, the configuration’s status is `PENDING`.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$APPLICATION' ,
  'IS_CONFIGURATION_SET' ,
  '<config_name>' ,
)
```

## Arguments

`'SNOWFLAKE$APPLICATION'`
:   Specifies that you want to call a function to return context information about the app in which the function is called.

`'IS_CONFIGURATION_SET'`
:   Calls the IS_CONFIGURATION_SET function.

`'config_name'`
:   Specifies the name of the configuration to check.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the configuration has a value set.
* `'FALSE'` if the configuration does not have a value set.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$APPLICATION', 'IS_CONFIGURATION_SET', 'my_config_name')::BOOLEAN = TRUE;
```

## Usage notes

* This function can only be used by an app.

---
title: IS_DATABASE_ROLE_ACTIVATED (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_database_role_activated.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_DATABASE_ROLE_ACTIVATED (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if a database role is activated in the current session.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](sys_context_snowflake_session.md)
    [IS_ROLE_ACTIVATED (SYS_CONTEXT function)](is_role_activated.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$SESSION' ,
  'IS_DATABASE_ROLE_ACTIVATED' ,
  '<database_role>'
)
```

## Arguments

`'SNOWFLAKE$SESSION'`
:   Specifies that you want to call a function to return context information about the current session.

`'IS_DATABASE_ROLE_ACTIVATED'`
:   Calls the IS_DATABASE_ROLE_ACTIVATED function.

`'database_role'`
:   Specifies the database role to check. The name can be fully qualified or relative.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the current user’s active primary role or secondary roles in the session inherits the privileges of the specified database
  role.
* `'FALSE'` if the specified database role isn’t in the user’s active role hierarchy, or if the database role doesn’t exist.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_DATABASE_ROLE_ACTIVATED', 'my_db_role')::BOOLEAN = TRUE;
```

## Usage notes

* This function isn’t supported in governance policies (such as masking policies, row access policies, or projection policies)
  applied to shared tables. Shared objects can’t access consumer session state.
* If you don’t specify a fully qualified name, the function resolves the database context of the database role as follows:

  + **Queries:** Session database (the database currently in use).
  + **Body of a data protection policy:** Database containing the protected table or view.
  + **Sharing:** Database in the consumer account.
* This function can’t be used in materialized view definitions because the function isn’t deterministic.

## Examples

Check a database role in the current database using a relative name:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_DATABASE_ROLE_ACTIVATED', 'ANALYST_ROLE');
```

```output
+-------------------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_DATABASE_ROLE_ACTIVATED', 'ANA...  |
+-------------------------------------------------------------------------+
| TRUE                                                                    |
+-------------------------------------------------------------------------+
```

Check a database role in a different database using a fully qualified name:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_DATABASE_ROLE_ACTIVATED', 'DB2.READER_ROLE');
```

```output
+-------------------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_DATABASE_ROLE_ACTIVATED', 'DB ...  |
+-------------------------------------------------------------------------+
| TRUE                                                                    |
+-------------------------------------------------------------------------+
```

---
title: IS_DATABASE_ROLE_IN_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/is_database_role_in_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# IS_DATABASE_ROLE_IN_SESSION

Verifies whether the database role is in the user’s active primary or secondary role hierarchy for the current session or if the specified
column contains a database role that is in the user’s active primary or secondary role hierarchy for the current session.

See also:
:   [IS_ROLE_IN_SESSION](is_role_in_session.md), [IS_APPLICATION_ROLE_IN_SESSION](is_application_role_in_session.md)

## Syntax

**Literal — specify a database role directly:**

```sqlsyntax
IS_DATABASE_ROLE_IN_SESSION( '<string_literal>' )
```

**Nonliteral — specify a column:**

> ```sqlsyntax
> IS_DATABASE_ROLE_IN_SESSION( <column_name> )
> ```

## Arguments

`'string_literal'`
:   The name of the database role.

    Specify the relative name of the database role. The function evaluates to `False` if you specify the fully qualified name.

`column_name`
:   The column name in a table or view.

## Returns

`True`
:   * For a literal argument (database role name), the current user’s active
      [primary role or secondary roles](../../user-guide/security-access-control-overview.md) in the session inherits the privileges of the
      specified database role.
    * For a nonliteral argument (column name), Snowflake evaluates each row in the table and returns a row that contains a value that
      specifies a database role in the user’s current session. Each row corresponds to a database role name that originates from the
      database in use or the specified database in a query.

`False`
:   * For a literal argument, the specified database role is not in the role hierarchy of the current user’s primary role or secondary
      roles.
    * For a nonliteral argument, Snowflake does not return a row if the database role is not in the table column for the database in use or
      the database specified in a query.
    * Specifying the fully qualified name of a database role in the format `database_name.database_role_name`. Use the relative
      name instead, `database_role_name`.

## Usage notes

These notes only apply to the IS_DATABASE_ROLE_IN_SESSION function:

* Use this table to predict the evaluation of the function when the function argument is a string literal:

  | Context | Evaluation |
  | --- | --- |
  | Query. | Session database. |
  | Table or view definition with WHERE clause. | Depends on the following:  + If you have a database in use and you use the relative name of the table or view, the context is the database in use   (session database). + If you specify the fully-qualified name of the table or view, the context is the database that contains the table or view. |
  | Protected table or view. | Database containing the table or view. |
  | Owner’s Rights stored procedure. | Database containing the stored procedure. |
  | Caller’s Rights stored procedure. | Session database. |
  | UDF | Database containing the protected table or view.  If the UDF is not called in a policy, the function evaluates to the database that contains the UDF. |
* A database role becomes active in the role hierarchy when the database containing the database role is in use or when querying a table in
  the same database that contains the database role.
* When you specify this function in the `body` of a data access policy, such as a masking or row access policy, the function uses
  the database and schema of the protected table.

  For example, if you add a row access policy to the `hr.tables.empl_info` table, the function searches for its argument, the
  database role name or the column name, in the `hr` database because that database contains the protected table.
* You should avoid query structures that require Snowflake to create an inline view. In this context, an inline view is a temporary view
  that Snowflake creates to determine the query result. For example, if your query calls this function and you specify a WITH clause at the
  beginning of the query or specify a subquery in the FROM clause, Snowflake returns an error:

  ```output
  Could not resolve the database for IS_DATABASE_ROLE_IN_SESSION({})
  ```

  Where `{}` is a placeholder for the function argument in your query. The reason for the error is that Snowflake does not have
  enough information to evaluate the context of the function argument. To resolve the error message, simplify your query, such as removing
  the WITH clause or removing the subquery in the FROM clause.
* When the [user property](../sql/create-user.md) `DEFAULT_SECONDARY_ROLES` value is `ALL`, the function returns
  `True` if any account role granted to the user inherits the privileges of the specified database role.
* When using this function in the condition of a masking policy or row access policy that protects shared data, ensure the database
  containing the policy and the policy-protected data are both shared to the consumer account. The policy and the policy-protected data can
  be in the same database or in different databases. For details, see [Share data protected by a policy](../../user-guide/data-sharing-policy-protected-data.md).

These notes apply to both the IS_DATABASE_ROLE_IN_SESSION and IS_ROLE_IN_SESSION functions:

* Use one syntax.
* Name syntax:

  + Only one role name can be passed as an argument.
  + The argument must be a string and use the same casing as how the role is stored in Snowflake. For details, see
    [Identifier requirements](../identifiers-syntax.md).
* Column syntax:

  + Only one column can be passed as an argument.
  + The column must have a [STRING](../data-types-text.md) data type.
  + Specify the column as one of the following:

    - `column_name`
    - `table_name.column_name`
    - `schema_name.table_name.column_name`
    - `database_name.schema_name.table_name.column_name`
* Virtual columns:

  A virtual column, which contains the result of a calculated value from an expression rather than the calculated value being stored in the
  table, is not supported.

  ```sqlexample
  SELECT IS_ROLE_IN_SESSION(UPPER(authz_role)) FROM t1;
  ```

  A virtual column is supported only when the expression has an alias for the column name:

  ```sqlexample
  CREATE VIEW v2 AS
  SELECT
    authz_role,
    UPPER(authz_role) AS upper_authz_role
  FROM t2;

  SELECT IS_ROLE_IN_SESSION(upper_authz_role) FROM v2;
  ```
* Policies:

  If you use these functions with a [masking policy](../../user-guide/security-column-intro.md) or
  [row access policy](../../user-guide/security-row-intro.md), verify that your Snowflake account is Enterprise Edition or higher.

  Snowflake recommends using this function when the policy conditions need to evaluate role hierarchy and inherited privileges.
* Result cache:

  If you use this function in a masking policy or row access policy and neither the policy nor the table or column protected by the policy
  change from a previous query, you can use the [RESULT_SCAN](result_scan.md) function to return the results of a query on
  the protected table. The result cache applies when using the nonliteral syntax only.
* These functions cannot be used in the materialized view definition because the functions are not deterministic and Snowflake cannot
  determine what data to materialize.

## Examples

Verify if the privileges granted to a specified role are inherited by the current role in the session:

```sqlexample
SELECT IS_DATABASE_ROLE_IN_SESSION('R1');
```

```output
+-----------------------------------+
| IS_DATABASE_ROLE_IN_SESSION('R1') |
+-----------------------------------+
| True                              |
+-----------------------------------+
```

Return database role values for the column named ROLE_NAME:

> ```sqlexample
> SELECT *
> FROM myb.s1.t1
> WHERE IS_DATABASE_ROLE_IN_SESSION(role_name);
> ```

For additional examples related to secure data sharing, see [Share data protected by a policy](../../user-guide/data-sharing-policy-protected-data.md).

---
title: IS_DATE , IS_DATE_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/is_date-value.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_DATE , IS_DATE_VALUE

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains a [DATE](../data-types-datetime.md) value.

IS_DATE and IS_DATE_VALUE are synonymous.

See also:
:   [IS_<object_type>](is.md) , [IS_TIME](is_time.md) , [IS_TIMESTAMP_\*](is_timestamp.md)

## Syntax

```sqlsyntax
IS_DATE( <variant_expr> )

IS_DATE_VALUE( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a DATE value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

Return all of the DATE values in a VARIANT column.

> **Note:**
>
> The output format for date values is set using the [DATE_OUTPUT_FORMAT](../parameters.md) parameter.
> The default setting is `YYYY-MM-DD`.

Create and load a table with various date and time values in a VARIANT column:

```sqlexample
CREATE OR REPLACE TABLE vardttm (v VARIANT);
```

```sqlexample
INSERT INTO vardttm SELECT TO_VARIANT(TO_DATE('2024-02-24'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIME('20:57:01.123456789+07:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP('2023-02-24 12:00:00.456'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_LTZ('2022-02-24 13:00:00.123 +01:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_NTZ('2021-02-24 14:00:00.123 +01:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_TZ('2020-02-24 15:00:00.123 +01:00'));
```

Use the [TYPEOF](typeof.md) function in a query to show the data types of the values stored in the VARIANT column `v`:

```sqlexample
SELECT v, TYPEOF(v) AS type FROM vardttm;
```

```output
+---------------------------------+---------------+
| V                               | TYPE          |
|---------------------------------+---------------|
| "2024-02-24"                    | DATE          |
| "20:57:01"                      | TIME          |
| "2023-02-24 12:00:00.456"       | TIMESTAMP_NTZ |
| "2022-02-24 04:00:00.123 -0800" | TIMESTAMP_LTZ |
| "2021-02-24 14:00:00.123"       | TIMESTAMP_NTZ |
| "2020-02-24 15:00:00.123 +0100" | TIMESTAMP_TZ  |
+---------------------------------+---------------+
```

Show the DATE values in the data by using the IS_DATE function in a WHERE clause. Only the DATE value
is returned in the output. The TIME and TIMESTAMP values aren’t returned.

```sqlexample
SELECT v FROM vardttm WHERE IS_DATE(v);
```

```output
+--------------+
| V            |
|--------------|
| "2024-02-24" |
+--------------+
```

---
title: IS_DECIMAL
source: https://docs.snowflake.com/en/sql-reference/functions/is_decimal.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_DECIMAL

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains a
[fixed-point number or integer](../data-types-numeric.md) value.

See also:
:   [IS_<object_type>](is.md) , [IS_DOUBLE , IS_REAL](is_double-real.md) , [IS_INTEGER](is_integer.md)

## Syntax

```sqlsyntax
IS_DECIMAL( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a fixed-point number or integer value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following examples use the IS_DECIMAL function.

### Use the IS_DECIMAL function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the fixed-point number and integer values in the data by using the IS_DECIMAL function
in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_DECIMAL(v);
```

```output
+---+--------+
| N | V      |
|---+--------|
| 4 | -17    |
| 5 | 123.12 |
+---+--------+
```

### Use the IS_DECIMAL function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains fixed-point number or integer values in the data by using the
IS_DECIMAL function in a SELECT list:

```sqlexample
SELECT IS_DECIMAL(decimal1),
       IS_DECIMAL(double1),
       IS_DECIMAL(integer1)
  FROM multiple_types;
```

```output
+----------------------+---------------------+----------------------+
| IS_DECIMAL(DECIMAL1) | IS_DECIMAL(DOUBLE1) | IS_DECIMAL(INTEGER1) |
|----------------------+---------------------+----------------------|
| True                 | False               | True                 |
+----------------------+---------------------+----------------------+
```

---
title: IS_DOUBLE , IS_REAL
source: https://docs.snowflake.com/en/sql-reference/functions/is_double-real.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_DOUBLE , IS_REAL

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains a
[floating-point number, fixed-point number, or integer](../data-types-numeric.md) value.

These functions are synonymous.

See also:
:   [IS_<object_type>](is.md) , [IS_DECIMAL](is_decimal.md) , [IS_INTEGER](is_integer.md)

## Syntax

```sqlsyntax
IS_DOUBLE( <variant_expr> )

IS_REAL( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a floating-point number, a fixed-point number, or an integer value.
  Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following examples use the IS_DOUBLE function.

### Use the IS_DOUBLE function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the floating-point numbers, fixed-point numbers, and integers in the data by using the
IS_DOUBLE function in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_DOUBLE(v);
```

```output
+---+-----------------------+
| N | V                     |
|---+-----------------------|
| 4 | -17                   |
| 5 | 123.12                |
| 6 | 1.912000000000000e+02 |
+---+-----------------------+
```

### Use the IS_DOUBLE function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains floating-point numbers, fixed-point numbers, or integers in
the data by using the IS_DOUBLE function in a SELECT list:

```sqlexample
SELECT IS_DOUBLE(boolean1),
       IS_DOUBLE(decimal1),
       IS_DOUBLE(double1),
       IS_DOUBLE(integer1)
  FROM multiple_types;
```

```output
+---------------------+---------------------+--------------------+---------------------+
| IS_DOUBLE(BOOLEAN1) | IS_DOUBLE(DECIMAL1) | IS_DOUBLE(DOUBLE1) | IS_DOUBLE(INTEGER1) |
|---------------------+---------------------+--------------------+---------------------|
| False               | True                | True               | True                |
+---------------------+---------------------+--------------------+---------------------+
```

---
title: IS_GRANTED_TO_INVOKER_ROLE
source: https://docs.snowflake.com/en/sql-reference/functions/is_granted_to_invoker_role.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# IS_GRANTED_TO_INVOKER_ROLE

Returns TRUE if the role returned by the INVOKER_ROLE function inherits the privileges of the specified role in the argument based on the
context in which the function is called.

The INVOKER_ROLE function only identifies and returns the account role of the object executing a SQL statement. Database roles are not
supported.

## Syntax

```sqlsyntax
IS_GRANTED_TO_INVOKER_ROLE( '<string_literal>' )
```

## Arguments

`'string_literal'`
:   The name of the role.

## Usage notes

* If using the IS_GRANTED_TO_INVOKER_ROLE function with [masking policy](../../user-guide/security-column-intro.md) or a
  [row access policy](../../user-guide/security-row-intro.md), verify that your Snowflake account is Enterprise Edition or higher.
* Only one role name can be passed as an argument.
* The following table summarizes the context in which you can call the function and the role hierarchy Snowflake evaluates.

  | Context | Evaluated role |
  | --- | --- |
  | User | [CURRENT_ROLE](current_role.md) |
  | Table | CURRENT_ROLE. |
  | View | View owner role. |
  | UDF | UDF owner role. |
  | Stored procedure with caller’s right | CURRENT_ROLE. |
  | Stored procedure with owner’s right | Stored procedure owner role. |
  | Task | Task owner role. |
  | Stream | The role that queries a given [stream](../../user-guide/streams-intro.md). |
* If prefer to evaluate the role hierarchy for the current session, call [IS_ROLE_IN_SESSION](is_role_in_session.md) instead.

## Example

Call the function directly:

> ```sqlexample
> IS_GRANTED_TO_INVOKER_ROLE('ANALYST')
>
> --------------------------------------+
> IS_GRANTED_TO_INVOKER_ROLE('ANALYST') |
> --------------------------------------+
>                 TRUE                  |
> --------------------------------------+
> ```

Specify the function in the masking policy body:

```sqlexample
CREATE OR REPLACE MASKING POLICY mask_string AS
(val string) RETURNS string ->
CASE
  WHEN IS_GRANTED_TO_INVOKER_ROLE('ANALYST') then val
  ELSE '*******'
END;
```

---
title: IS_GROUP_ACTIVATED (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_group_activated.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_GROUP_ACTIVATED (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if the role representing an [organization user group](../../user-guide/organization-users.md) is
activated in a given context.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md) ,
    [IS_GROUP_IMPORTED (SYS_CONTEXT function)](is_group_imported.md) ,
    [IS_USER_IMPORTED (SYS_CONTEXT function)](is_user_imported.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$ORGANIZATION' ,
  'IS_GROUP_ACTIVATED' ,
  '<context>' ,
  '<group_name>'
)
```

## Arguments

`'SNOWFLAKE$ORGANIZATION'`
:   Specifies that you want to call a function to return context information about the current organization.

`'IS_GROUP_ACTIVATED'`
:   Calls the IS_GROUP_ACTIVATED function.

`'context'`
:   Specifies the execution context that you want to check. You can specify one of the following values:

    * `SESSION`: Checks if the organization group role is in the role hierarchy of the current session’s primary or secondary
      roles. The function returns `'TRUE'` if the role is in the role hierarchy.
    * `ACTIVE`: Checks if the organization group role is in the role hierarchy in the context of the current call.

      For example, in a call to an owner’s rights stored procedure, the procedure is executed by the owner’s role. The function
      returns `'TRUE'` if the organization group role is in the role hierarchy of the owner’s role.

`'group_name'`
:   Specifies the name of the organization user group to check.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the organization user group role is activated in the context specified by `context`.
* `'FALSE'` if the organization user group role is not activated in that context or if the group is not a valid organization
  user group.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_GROUP_ACTIVATED', 'SESSION', 'my_group_name')::BOOLEAN = TRUE;
```

## Usage notes

## Examples

The following example returns `'TRUE'` if the role for the organization user group `my_group_name` is in the role hierarchy
of the session’s primary or secondary roles:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_GROUP_ACTIVATED', 'SESSION', 'my_group_name');
```

---
title: IS_GROUP_IMPORTED (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_group_imported.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_GROUP_IMPORTED (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if the specified group is an [organization user group](../../user-guide/organization-users.md) that
was imported into the current account.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md) ,
    [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](is_group_activated.md) ,
    [IS_USER_IMPORTED (SYS_CONTEXT function)](is_user_imported.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$ORGANIZATION' ,
  'IS_GROUP_IMPORTED' ,
  '<group_name>'
)
```

## Arguments

`'SNOWFLAKE$ORGANIZATION'`
:   Specifies that you want to call a function to return context information about the current organization.

`'IS_GROUP_IMPORTED'`
:   Calls the IS_GROUP_IMPORTED function.

`'group_name'`
:   Specifies the name of the organization user group to check.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the organization user group was imported into the current account.
* `'FALSE'` if the organization user group was not imported into the current account or is not a valid organization user group.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_GROUP_IMPORTED', 'my_group_name')::BOOLEAN = TRUE;
```

## Usage notes

## Examples

The following example returns `'TRUE'` if the group `my_group_name` is an organization user group that was imported into the
current account:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_GROUP_IMPORTED', 'my_group_name');
```

---
title: IS_INSTANCE_ROLE_IN_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/is_instance_role_in_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# IS_INSTANCE_ROLE_IN_SESSION

Verifies whether the user’s active primary or secondary role hierarchy for the session inherits the specified instance role.

See also:
:   [Instance roles](../snowflake-db-classes.md) , [IS_DATABASE_ROLE_IN_SESSION](is_database_role_in_session.md) , [IS_ROLE_IN_SESSION](is_role_in_session.md)

## Syntax

```sqlsyntax
IS_INSTANCE_ROLE_IN_SESSION( '<instance_name>' , '<instance_role_name>' )
```

## Arguments

`'instance_name'`
:   Specifies the name of the instance.

`'instance_role_name'`
:   Specifies the name of the instance role.

## Returns

* `TRUE` if the current user’s active [primary role or secondary roles](../../user-guide/security-access-control-overview.md) in the session
  inherit the specified instance role.

  When the `DEFAULT_SECONDARY_ROLES` value is `ALL`, any role granted to the user inherits the privileges of the
  specified instance role.
* `FALSE` if the specified instance role is not in the role hierarchy of the user’s current primary or secondary roles.

## Examples

Verify whether the current role for the session inherits the specified instance role:

> ```sqlexample
> USE ROLE my_role;
>
> SELECT IS_INSTANCE_ROLE_IN_SESSION('my_db.my_schema.my_anomaly_detector', 'user');
> ```
>
> ```output
> +----------------------------------------------------------------------------+
> | IS_INSTANCE_ROLE_IN_SESSION('MY_DB.MY_SCHEMA.MY_ANOMALY_DETECTOR', 'USER') |
> +----------------------------------------------------------------------------+
> | TRUE                                                                       |
> +----------------------------------------------------------------------------+
> ```

---
title: IS_INTEGER
source: https://docs.snowflake.com/en/sql-reference/functions/is_integer.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_INTEGER

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains an [integer](../data-types-numeric.md) value.

See also:
:   [IS_<object_type>](is.md) , [IS_DECIMAL](is_decimal.md) , [IS_DOUBLE , IS_REAL](is_double-real.md)

## Syntax

```sqlsyntax
IS_INTEGER( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains an integer. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

The following examples use the IS_INTEGER function.

### Use the IS_INTEGER function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the integers in the data by using the IS_INTEGER function in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_INTEGER(v);
```

```output
+---+-----+
| N | V   |
|---+-----|
| 4 | -17 |
+---+-----+
```

### Use the IS_INTEGER function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains integers in the data by using the IS_INTEGER function
in a SELECT list:

```sqlexample
SELECT IS_INTEGER(decimal1),
       IS_INTEGER(double1),
       IS_INTEGER(integer1)
  FROM multiple_types;
```

```output
+----------------------+---------------------+----------------------+
| IS_INTEGER(DECIMAL1) | IS_INTEGER(DOUBLE1) | IS_INTEGER(INTEGER1) |
|----------------------+---------------------+----------------------|
| False                | False               | True                 |
+----------------------+---------------------+----------------------+
```

---
title: IS_NULL_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/is_null_value.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md) , [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_NULL_VALUE

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument is a [JSON null](../../user-guide/semistructured-considerations.md) value.

> **Important:**
>
> The JSON null value is distinct from the SQL NULL value.
>
> This function returns TRUE only for JSON null values, not SQL NULL values.
> The difference is shown in the first and third rows in
> the output for the example below.
>
> A missing JSON value is converted to a SQL NULL value, for which
> IS_NULL_VALUE returns NULL. The 4th column in
> the output for the example below
> shows this.

This function is different from the [IS [ NOT ] NULL](is-null.md) function.

See also:
:   [IS_<object_type>](is.md)

## Syntax

```sqlsyntax
IS_NULL_VALUE( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

This function returns a value of type BOOLEAN or NULL:

* Returns TRUE for a JSON null value.
* Returns FALSE for a non-null JSON value.
* Returns NULL for a SQL NULL value.

## Examples

This example uses the IS_NULL_VALUE function. First, create a table with a VARIANT column:

```sqlexample
CREATE OR REPLACE TABLE test_is_null_value_function (
  variant_value VARIANT);
```

Insert a string value into the column using the [PARSE_JSON](parse_json.md) function:

```sqlexample
INSERT INTO test_is_null_value_function (variant_value)
  (SELECT PARSE_JSON('"string value"'));
```

> **Note:**
>
> The PARSE_JSON function returns a VARIANT value.

Insert a JSON null value into the column:

```sqlexample
INSERT INTO test_is_null_value_function (variant_value)
  (SELECT PARSE_JSON('null'));
```

Insert an empty object into the column:

```sqlexample
INSERT INTO test_is_null_value_function (variant_value)
  (SELECT PARSE_JSON('{}'));
```

Insert two rows with JSON name/value pairs into the VARIANT column :

```sqlexample
INSERT INTO test_is_null_value_function (variant_value)
  (SELECT PARSE_JSON('{"x": null}'));

INSERT INTO test_is_null_value_function (variant_value)
  (SELECT PARSE_JSON('{"x": "foo"}'));
```

Insert a NULL into the column:

```sqlexample
INSERT INTO test_is_null_value_function (variant_value)
  (SELECT PARSE_JSON(NULL));
```

Query the table:

```sqlexample
SELECT variant_value,
       variant_value:x value_of_x,
       IS_NULL_VALUE(variant_value) is_variant_value_a_json_null,
       IS_NULL_VALUE(variant_value:x) is_x_a_json_null,
       IS_NULL_VALUE(variant_value:y) is_y_a_json_null
  FROM test_is_null_value_function;
```

```output
+----------------+------------+------------------------------+------------------+------------------+
| VARIANT_VALUE  | VALUE_OF_X | IS_VARIANT_VALUE_A_JSON_NULL | IS_X_A_JSON_NULL | IS_Y_A_JSON_NULL |
|----------------+------------+------------------------------+------------------+------------------|
| "string value" | NULL       | False                        | NULL             | NULL             |
| null           | NULL       | True                         | NULL             | NULL             |
| {}             | NULL       | False                        | NULL             | NULL             |
| {              | null       | False                        | True             | NULL             |
|   "x": null    |            |                              |                  |                  |
| }              |            |                              |                  |                  |
| {              | "foo"      | False                        | False            | NULL             |
|   "x": "foo"   |            |                              |                  |                  |
| }              |            |                              |                  |                  |
| NULL           | NULL       | NULL                         | NULL             | NULL             |
+----------------+------------+------------------------------+------------------+------------------+
```

In the query results:

* The `variant_value` column shows six rows of inserted VARIANT values.
* The `value_of_x` column shows the JSON value for the name `x` in each row.
* The `is_variant_value_a_json_null` column returns the results of the IS_NULL_VALUE function
  for the VARIANT value in each row.
* The `is_x_a_json_null` column returns the results of the IS_NULL_VALUE function
  for the name `x` in each row. Rows without an `x` name return NULL.
* The `is_y_a_json_null` column returns the results of the IS_NULL_VALUE function
  for the name `y` in each row. Because there is no matching `y` name in any
  row, all of the rows return NULL.

---
title: IS_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/is_object.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_OBJECT

Returns TRUE if its [VARIANT](../data-types-semistructured.md) argument contains an [OBJECT](../data-types-semistructured.md) value.

See also:
:   [IS_<object_type>](is.md) , [IS_ARRAY](is_array.md)

## Syntax

```sqlsyntax
IS_OBJECT( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains an OBJECT value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

The following examples use the IS_OBJECT function.

### Use the IS_OBJECT function in a WHERE clause

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

Show the OBJECT values in the data by using the IS_OBJECT function
in a WHERE clause:

```sqlexample
SELECT * FROM vartab WHERE IS_OBJECT(v);
```

```output
+---+---------------+
| N | V             |
|---+---------------|
| 9 | {             |
|   |   "x": "abc", |
|   |   "y": false, |
|   |   "z": 10     |
|   | }             |
+---+---------------+
```

### Use the IS_OBJECT function in a SELECT list

Create and fill the `multiple_types` table. The INSERT statement uses the [TO_VARIANT](to_variant.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the columns.

```sqlexample
CREATE OR REPLACE TABLE multiple_types (
  array1 VARIANT,
  array2 VARIANT,
  boolean1 VARIANT,
  varchar1 VARIANT,
  varchar2 VARIANT,
  decimal1 VARIANT,
  double1 VARIANT,
  integer1 VARIANT,
  object1 VARIANT);

INSERT INTO multiple_types
    (array1, array2, boolean1, varchar1, varchar2,
     decimal1, double1, integer1, object1)
  SELECT
    TO_VARIANT(TO_ARRAY('Example')),
    TO_VARIANT(ARRAY_CONSTRUCT('Array-like', 'example')),
    TO_VARIANT(TRUE),
    TO_VARIANT('X'),
    TO_VARIANT('I am a real character'),
    TO_VARIANT(1.23::DECIMAL(6, 3)),
    TO_VARIANT(3.21::DOUBLE),
    TO_VARIANT(15),
    TO_VARIANT(TO_OBJECT(PARSE_JSON('{"Tree": "Pine"}')));
```

Query the data using the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT TYPEOF(array1),
       TYPEOF(array2),
       TYPEOF(boolean1),
       TYPEOF(varchar1),
       TYPEOF(varchar2),
       TYPEOF(decimal1),
       TYPEOF(double1),
       TYPEOF(integer1),
       TYPEOF(object1)
  FROM multiple_types;
```

```output
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
| TYPEOF(ARRAY1) | TYPEOF(ARRAY2) | TYPEOF(BOOLEAN1) | TYPEOF(VARCHAR1) | TYPEOF(VARCHAR2) | TYPEOF(DECIMAL1) | TYPEOF(DOUBLE1) | TYPEOF(INTEGER1) | TYPEOF(OBJECT1) |
|----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------|
| ARRAY          | ARRAY          | BOOLEAN          | VARCHAR          | VARCHAR          | DECIMAL          | DOUBLE          | INTEGER          | OBJECT          |
+----------------+----------------+------------------+------------------+------------------+------------------+-----------------+------------------+-----------------+
```

Show whether a column contains OBJECT values in the data by using the
IS_OBJECT function in a SELECT list:

```sqlexample
SELECT IS_OBJECT(array1),
       IS_OBJECT(boolean1),
       IS_OBJECT(object1)
  FROM multiple_types;
```

```output
+-------------------+---------------------+--------------------+
| IS_OBJECT(ARRAY1) | IS_OBJECT(BOOLEAN1) | IS_OBJECT(OBJECT1) |
|-------------------+---------------------+--------------------|
| False             | False               | True               |
+-------------------+---------------------+--------------------+
```

---
title: IS_ORGANIZATION_USER
source: https://docs.snowflake.com/en/sql-reference/functions/is_organization_user.md
section: SQL Functions
---

Categories:
:   [Organization user and organization user group functions](../functions-organization-users.md)

# IS_ORGANIZATION_USER

Returns TRUE if the argument is a Snowflake user who is an [organization user](../../user-guide/organization-users.md).

## Syntax

```sqlsyntax
IS_ORGANIZATION_USER( <exp> )
```

## Arguments

`exp`
:   Expression that resolves to the name of a Snowflake user object.

    If passing in the literal name of a user, surround it with single quotes (for example, `'joe'`).

## Returns

Returns TRUE if the argument is an organization user.

## Usage notes

In data sharing contexts, this function returns NULL if it is called from a consumer account that exists in a *different organization* than
the provider account. Calling the function from a consumer account in the *same organization* as the provider returns TRUE or FALSE.

## Examples

Determine if the user `joe` in the current account is an organization user:

```sqlexample
SELECT IS_ORGANIZATION_USER('joe');
```

Determine if the current user in the session is an organization user:

```sqlexample
SELECT IS_ORGANIZATION_USER(CURRENT_USER());
```

---
title: IS_ORGANIZATION_USER_GROUP
source: https://docs.snowflake.com/en/sql-reference/functions/is_organization_user_group.md
section: SQL Functions
---

Categories:
:   [Organization user and organization user group functions](../functions-organization-users.md)

# IS_ORGANIZATION_USER_GROUP

Returns TRUE if the specified [role](../../user-guide/security-access-control-overview.md) was created when an administrator added an
[organization user group](../../user-guide/organization-users.md) to the account.

## Syntax

```sqlsyntax
IS_ORGANIZATION_USER_GROUP( '<role>' )
```

## Arguments

`'role'`
:   Role in the current account.

## Returns

Returns TRUE if the specified role was created from or linked to an organization user group.

## Usage notes

In data sharing contexts, this function returns NULL if it is called from a consumer account that exists in a *different organization* than
the provider account. Calling the function from a consumer account in the *same organization* as the provider returns TRUE or FALSE.

## Examples

Determine if the role `data_stewards` in the current account was created from an organization user group.

```sqlexample
SELECT IS_ORGANIZATION_USER_GROUP('data_stewards');
```

---
title: IS_ORGANIZATION_USER_GROUP_IN_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/is_organization_user_group_in_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# IS_ORGANIZATION_USER_GROUP_IN_SESSION

Assuming a role was imported from an [organization user group](../../user-guide/organization-users.md), verifies whether the role is in the user’s
active primary or secondary role hierarchy for the session.

The function returns FALSE if the specified role is not linked to an organization user group.

See also:
:   [Advanced Column-level Security topics](../../user-guide/security-column-advanced.md)

## Syntax

> ```sqlsyntax
> IS_ORGANIZATION_USER_GROUP_IN_SESSION( '<string_literal>' )
> ```

## Arguments

`'string_literal'`
:   The name of an role.

## Returns

`TRUE`
:   The current user’s active [primary role or secondary roles](../../user-guide/security-access-control-overview.md) in the session inherit the
    privileges of the specified role.

    When the `DEFAULT_SECONDARY_ROLES` value is `ALL`, any role granted to the user inherits the privileges of the
    specified role.

    The specified role can be the current primary role or secondary role (that is, the roles returned by
    [CURRENT_ROLE](current_role.md) or [CURRENT_SECONDARY_ROLES](current_secondary_roles.md), respectively) or any role
    lower in the role hierarchy.

`FALSE`
:   Either of the following:

    * The specified role is a local role that is not linked to an organization user group.
    * The specified role is either higher in the role hierarchy of the current primary or secondary roles
      or is not in the role hierarchy at all.

`NULL`
:   In a data sharing consumer account, this function returns NULL if referencing a shared object (e.g. secure UDF or secure view), such
    as in a masking policy condition. This behavior prevents exposing the role hierarchy in a data sharing consumer account.

## Usage notes

The IS_ORGANIZATION_USER_GROUP_IN_SESSION function is similar to the [IS_DATABASE_ROLE_IN_SESSION](is_database_role_in_session.md) and
[IS_ROLE_IN_SESSION](is_role_in_session.md) functions. The following usage notes apply to all of these context functions:

* Use one syntax.
* Name syntax:

  + Only one role name can be passed as an argument.
  + The argument must be a string and use the same casing as how the role is stored in Snowflake. For details, see
    [Identifier requirements](../identifiers-syntax.md).
* Column syntax:

  + Only one column can be passed as an argument.
  + The column must have a [STRING](../data-types-text.md) data type.
  + Specify the column as one of the following:

    - `column_name`
    - `table_name.column_name`
    - `schema_name.table_name.column_name`
    - `database_name.schema_name.table_name.column_name`
* Virtual columns:

  A virtual column, which contains the result of a calculated value from an expression rather than the calculated value being stored in the
  table, is not supported.

  ```sqlexample
  SELECT IS_ROLE_IN_SESSION(UPPER(authz_role)) FROM t1;
  ```

  A virtual column is supported only when the expression has an alias for the column name:

  ```sqlexample
  CREATE VIEW v2 AS
  SELECT
    authz_role,
    UPPER(authz_role) AS upper_authz_role
  FROM t2;

  SELECT IS_ROLE_IN_SESSION(upper_authz_role) FROM v2;
  ```
* Policies:

  If you use these functions with a [masking policy](../../user-guide/security-column-intro.md) or
  [row access policy](../../user-guide/security-row-intro.md), verify that your Snowflake account is Enterprise Edition or higher.

  Snowflake recommends using this function when the policy conditions need to evaluate role hierarchy and inherited privileges.
* Result cache:

  If you use this function in a masking policy or row access policy and neither the policy nor the table or column protected by the policy
  change from a previous query, you can use the [RESULT_SCAN](result_scan.md) function to return the results of a query on
  the protected table. The result cache applies when using the nonliteral syntax only.
* These functions cannot be used in the materialized view definition because the functions are not deterministic and Snowflake cannot
  determine what data to materialize.

## Examples

The following example returns TRUE if the following is true:

* Role `analyst` was created or linked when an organization user group was added to the account.
* The privileges granted to the `analyst` role are inherited by the current role in the session.

```sqlexample
SELECT IS_ORGANIZATION_USER_GROUP_IN_SESSION('ANALYST');
```

---
title: IS_ROLE_ACTIVATED (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_role_activated.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_ROLE_ACTIVATED (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if an account role is activated in the current session.

See also:
:   [IS_DATABASE_ROLE_ACTIVATED (SYS_CONTEXT function)](is_database_role_activated.md)
    [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](sys_context_snowflake_session.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$SESSION' ,
  'IS_ROLE_ACTIVATED' ,
  '<role>'
)
```

## Arguments

`'SNOWFLAKE$SESSION'`
:   Specifies that you want to call a function to return context information about the current session.

`'IS_ROLE_ACTIVATED'`
:   Calls the IS_ROLE_ACTIVATED function.

`'role'`
:   Specifies the account role to check.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the account role is activated in the current session.
* `'FALSE'` if the account role is not activated or if the account role is not valid.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_ROLE_ACTIVATED', 'my_role')::BOOLEAN = TRUE;
```

## Usage notes

## Examples

The following example returns `'TRUE'` if the role `my_role` is in the role hierarchy of the session’s primary or secondary
roles:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_ROLE_ACTIVATED', 'my_role');
```

---
title: IS_ROLE_IN_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/is_role_in_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session Object)

# IS_ROLE_IN_SESSION

Verifies whether the specified account role is in the currently active primary or secondary role hierarchy.

This function looks only at the *currently* active set of roles, not at the roles activated in the session. The currently
active roles can differ from the session roles, for example, when executing an owner’s rights procedure or a Streamlit.

See also:
:   [Advanced Column-level Security topics](../../user-guide/security-column-advanced.md)

## Syntax

**Literal — specify a role directly:**

> ```sqlsyntax
> IS_ROLE_IN_SESSION( '<string_literal>' )
> ```

**Expression — specify a role expression:**

> ```sqlsyntax
> IS_ROLE_IN_SESSION( <expr> )
> ```

**Column — specify a column:**

> ```sqlsyntax
> IS_ROLE_IN_SESSION( <column_name> )
> ```

## Arguments

`'string_literal'`
:   The name of the role.

`expr`
:   An expression that returns the name of the role.

`column_name`
:   The column name in a table or view that contains the name of the role.

## Returns

`TRUE`:
:   * For a string literal or expression argument, the current user’s active
      [primary role or secondary roles](../../user-guide/security-access-control-overview.md) in the session inherit the privileges of the specified
      role.

      When the `DEFAULT_SECONDARY_ROLES` value is `ALL`, any role granted to the user inherits the privileges of the
      specified role.

      The specified role can be the current primary role or secondary role (that is, the roles returned by
      [CURRENT_ROLE](current_role.md) or [CURRENT_SECONDARY_ROLES](current_secondary_roles.md), respectively) or any role
      lower in the role hierarchy.
    * For a column argument, Snowflake evaluates each row and returns a row that contains a value that specifies an active primary or
      secondary role in the user’s current session. Each row corresponds to a role name that the active primary or secondary role can see.

`FALSE`
:   * For a string literal or expression argument, the specified role is either higher in the role hierarchy of the current primary or
      secondary roles, or the role is not in the role hierarchy at all.
    * For a nonliteral argument, Snowflake evaluates each row. If a row contains a role name that is either higher in the role hierarchy
      of the current primary or secondary roles or is not in the role hierarchy at all, Snowflake does not return this row. In this case,
      Snowflake only returns rows containing the role names the active primary or secondary role can see (if any).

`NULL`
:   * This function returns NULL when used in a shared object, such as a secure view, when accessed through a data sharing consumer account.
      This behavior prevents exposing the role hierarchy in a data sharing consumer account.

## Usage notes

* Use one syntax.
* Name syntax:

  + Only one role name can be passed as an argument.
  + The argument must be a string and use the same casing as how the role is stored in Snowflake. For details, see
    [Identifier requirements](../identifiers-syntax.md).
* Column syntax:

  + Only one column can be passed as an argument.
  + The column must have a [STRING](../data-types-text.md) data type.
  + Specify the column as one of the following:

    - `column_name`
    - `table_name.column_name`
    - `schema_name.table_name.column_name`
    - `database_name.schema_name.table_name.column_name`
* Virtual columns:

  A virtual column, which contains the result of a calculated value from an expression rather than the calculated value being stored in the
  table, is not supported.

  ```sqlexample
  SELECT IS_ROLE_IN_SESSION(UPPER(authz_role)) FROM t1;
  ```

  A virtual column is supported only when the expression has an alias for the column name:

  ```sqlexample
  CREATE VIEW v2 AS
  SELECT
    authz_role,
    UPPER(authz_role) AS upper_authz_role
  FROM t2;

  SELECT IS_ROLE_IN_SESSION(upper_authz_role) FROM v2;
  ```
* Policies:

  If you use these functions with a [masking policy](../../user-guide/security-column-intro.md) or
  [row access policy](../../user-guide/security-row-intro.md), verify that your Snowflake account is Enterprise Edition or higher.

  Snowflake recommends using this function when the policy conditions need to evaluate role hierarchy and inherited privileges.
* Result cache:

  If you use this function in a masking policy or row access policy and neither the policy nor the table or column protected by the policy
  change from a previous query, you can use the [RESULT_SCAN](result_scan.md) function to return the results of a query on
  the protected table. The result cache applies when using the nonliteral syntax only.
* These functions cannot be used in the materialized view definition because the functions are not deterministic and Snowflake cannot
  determine what data to materialize.

## Examples

Verify if the privileges granted to a specified role are inherited by the current role in the session:

> ```sqlexample
> SELECT IS_ROLE_IN_SESSION('ANALYST');
>
> +-------------------------------+
> | IS_ROLE_IN_SESSION('ANALYST') |
> |-------------------------------|
> | True                          |
> +-------------------------------+
> ```

Return active primary or secondary role values for the column named ROLE_NAME:

> ```sqlexample
> SELECT *
> FROM d1.s1.t1
> WHERE IS_ROLE_IN_SESSION(t1.role_name);
> ```

Specify a role directly in a masking policy condition:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY allow_analyst AS (val string)
> RETURNS string ->
> CASE
>   WHEN IS_ROLE_IN_SESSION('ANALYST') THEN val
>   ELSE '*******'
> END;
> ```

Specify a role expression in a masking policy condition:

> ```sqlexample
> CREATE OR REPLACE MASKING POLICY allow_tag_role AS (val string)
> RETURNS string ->
> CASE
>   WHEN IS_ROLE_IN_SESSION(SYSTEM$GET_TAG_ON_CURRENT_TABLE('D1.S1.ALLOWED_ROLE')) THEN val
>   ELSE '*******'
> END;
> ```

Specify the column named AUTHZ_ROLE (that is, the authorized role) in a row access policy and then set the policy on the table column:

> Create the policy:
>
> > ```sqlexample
> > CREATE OR REPLACE ROW ACCESS POLICY rap_authz_role AS (authz_role string)
> > RETURNS boolean ->
> > IS_ROLE_IN_SESSION(authz_role);
> > ```
>
> Add the policy to a table:
>
> > ```sqlexample
> > ALTER TABLE allowed_roles
> >   ADD ROW ACCESS POLICY rap_authz_role ON (authz_role);
> > ```

Specify the column named AUTHZ_ROLE in a row access policy that uses a mapping table to lookup the authorized role
in a mapping table column named ROLE_NAME. After creating the policy, add the policy to the table containing the AUTHZ_ROLE column:

> Create the policy:
>
> > ```sqlexample
> > CREATE OR REPLACE ROW ACCESS POLICY rap_authz_role_map AS (authz_role string)
> > RETURNS boolean ->
> > EXISTS (
> >   SELECT 1 FROM mapping_table m
> >   WHERE authz_role = m.key and IS_ROLE_IN_SESSION(m.role_name)
> > );
> > ```
>
> Add the policy to a table:
>
> > ```sqlexample
> > ALTER TABLE allowed_roles
> >   ADD ROW ACCESS POLICY rap_authz_role_map ON (authz_role);
> > ```

---
title: IS_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/is_time.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_TIME

Verifies whether a [VARIANT](../data-types-semistructured.md) argument contains a [TIME](../data-types-datetime.md) value.

See also:
:   [IS_<object_type>](is.md) , [IS_DATE , IS_DATE_VALUE](is_date-value.md) , [IS_TIMESTAMP_\*](is_timestamp.md)

## Syntax

```sqlsyntax
IS_TIME( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a TIME value. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

Return all of the TIME values in a VARIANT column.

> **Note:**
>
> The output format for TIME values is set using the [TIME_OUTPUT_FORMAT](../parameters.md) parameter. The default setting is `HH24:MI:SS`.

Create and load a table with various date and TIME values in a VARIANT column:

```sqlexample
CREATE OR REPLACE TABLE vardttm (v VARIANT);
```

```sqlexample
INSERT INTO vardttm SELECT TO_VARIANT(TO_DATE('2024-02-24'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIME('20:57:01.123456789+07:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP('2023-02-24 12:00:00.456'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_LTZ('2022-02-24 13:00:00.123 +01:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_NTZ('2021-02-24 14:00:00.123 +01:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_TZ('2020-02-24 15:00:00.123 +01:00'));
```

Use the [TYPEOF](typeof.md) function in a query to show the data types of the values stored in the VARIANT column `v`:

```sqlexample
SELECT v, TYPEOF(v) AS type FROM vardttm;
```

```output
+---------------------------------+---------------+
| V                               | TYPE          |
|---------------------------------+---------------|
| "2024-02-24"                    | DATE          |
| "20:57:01"                      | TIME          |
| "2023-02-24 12:00:00.456"       | TIMESTAMP_NTZ |
| "2022-02-24 04:00:00.123 -0800" | TIMESTAMP_LTZ |
| "2021-02-24 14:00:00.123"       | TIMESTAMP_NTZ |
| "2020-02-24 15:00:00.123 +0100" | TIMESTAMP_TZ  |
+---------------------------------+---------------+
```

Show the TIME values in the data by using the IS_TIME function in a WHERE clause. Only the TIME value
is returned in the output. The DATE and TIMESTAMP values aren’t returned.

```sqlexample
SELECT v FROM vardttm WHERE IS_TIME(v);
```

```output
+------------+
| V          |
|------------|
| "20:57:01" |
+------------+
```

---
title: IS_TIMESTAMP_*
source: https://docs.snowflake.com/en/sql-reference/functions/is_timestamp.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# IS_TIMESTAMP_\*

Verifies whether a [VARIANT](../data-types-semistructured.md) argument contains the respective
[timestamp](../data-types-datetime.md) value:

* IS_TIMESTAMP_LTZ (value with local time zone).
* IS_TIMESTAMP_NTZ (value with no time zone).
* IS_TIMESTAMP_TZ (value with time zone).

See also:
:   [IS_<object_type>](is.md) , [IS_DATE , IS_DATE_VALUE](is_date-value.md) , [IS_TIME](is_time.md)

## Syntax

```sqlsyntax
IS_TIMESTAMP_LTZ( <variant_expr> )

IS_TIMESTAMP_NTZ( <variant_expr> )

IS_TIMESTAMP_TZ( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression that evaluates to a value of type VARIANT.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if the VARIANT value contains a timestamp. Otherwise, returns FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Examples

Show all timestamps in a VARIANT column, with the output using the time zone specified for the session.

> **Note:**
>
> The output format for the time zone is set using a parameter:
>
> * The [TIMESTAMP_LTZ_OUTPUT_FORMAT](../parameters.md) parameter sets the format for TIMESTAMP_LTZ values.
> * The [TIMESTAMP_NTZ_OUTPUT_FORMAT](../parameters.md) parameter sets the format for TIMESTAMP_NTZ values.
> * The [TIMESTAMP_TZ_OUTPUT_FORMAT](../parameters.md) parameter sets the format for TIMESTAMP_TZ values.

In these examples, the local time zone is US Pacific Standard Time (-08:00 relative to GMT/UCT).

Create and load a table with various date and time values in a VARIANT column:

```sqlexample
CREATE OR REPLACE TABLE vardttm (v VARIANT);
```

```sqlexample
INSERT INTO vardttm SELECT TO_VARIANT(TO_DATE('2024-02-24'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIME('20:57:01.123456789+07:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP('2023-02-24 12:00:00.456'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_LTZ('2022-02-24 13:00:00.123 +01:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_NTZ('2021-02-24 14:00:00.123 +01:00'));
INSERT INTO vardttm SELECT TO_VARIANT(TO_TIMESTAMP_TZ('2020-02-24 15:00:00.123 +01:00'));
```

Use the [TYPEOF](typeof.md) function in a query to show the data types of the values stored in the VARIANT column `v`:

```sqlexample
SELECT v, TYPEOF(v) AS type FROM vardttm;
```

```output
+---------------------------------+---------------+
| V                               | TYPE          |
|---------------------------------+---------------|
| "2024-02-24"                    | DATE          |
| "20:57:01"                      | TIME          |
| "2023-02-24 12:00:00.456"       | TIMESTAMP_NTZ |
| "2022-02-24 04:00:00.123 -0800" | TIMESTAMP_LTZ |
| "2021-02-24 14:00:00.123"       | TIMESTAMP_NTZ |
| "2020-02-24 15:00:00.123 +0100" | TIMESTAMP_TZ  |
+---------------------------------+---------------+
```

Show the TIMESTAMP_NTZ values in the data by using the IS_TIMESTAMP_NTZ function in a WHERE clause:

```sqlexample
SELECT * FROM vardttm WHERE IS_TIMESTAMP_NTZ(v);
```

```output
+---------------------------+
| V                         |
|---------------------------|
| "2023-02-24 12:00:00.456" |
| "2021-02-24 14:00:00.123" |
+---------------------------+
```

Show the TIMESTAMP_LTZ values in the data by using the IS_TIMESTAMP_LTZ function in a WHERE clause:

```sqlexample
SELECT * FROM vardttm WHERE IS_TIMESTAMP_LTZ(v);
```

```output
+---------------------------------+
| V                               |
|---------------------------------|
| "2022-02-24 04:00:00.123 -0800" |
+---------------------------------+
```

Show the TIMESTAMP_TZ values in the data by using the IS_TIMESTAMP_TZ function in a WHERE clause:

```sqlexample
SELECT * FROM vardttm WHERE IS_TIMESTAMP_TZ(v);
```

```output
+---------------------------------+
| V                               |
|---------------------------------|
| "2020-02-24 15:00:00.123 +0100" |
+---------------------------------+
```

---
title: IS_USER_IMPORTED (SYS_CONTEXT function)
source: https://docs.snowflake.com/en/sql-reference/functions/is_user_imported.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# IS_USER_IMPORTED (SYS_CONTEXT function)

Returns the VARCHAR value `'TRUE'` if the specified user is an [organization user](../../user-guide/organization-users.md) that
was imported into the current account.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md) ,
    [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](is_group_activated.md) ,
    [IS_GROUP_IMPORTED (SYS_CONTEXT function)](is_group_imported.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$ORGANIZATION' ,
  'IS_USER_IMPORTED' ,
  '<user_name>'
)
```

## Arguments

`'SNOWFLAKE$ORGANIZATION'`
:   Specifies that you want to call a function to return context information about the current organization.

`'IS_USER_IMPORTED'`
:   Calls the IS_USER_IMPORTED function.

`'user_name'`
:   Specifies the name of the user to check.

## Returns

The function returns one of the following VARCHAR values:

* `'TRUE'` if the user is an organization user that was imported into the current account.
* `'FALSE'` if the user is not an organization user, was not imported into the current account, or is not a valid user.

To compare this return value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return
value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_USER_IMPORTED', 'my_user_name')::BOOLEAN = TRUE;
```

## Usage notes

## Examples

The following example returns `'TRUE'` if the user `my_user_name` is an organization user that was imported into the current
account:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION', 'IS_USER_IMPORTED', 'my_user_name');
```

---
title: JAROWINKLER_SIMILARITY
source: https://docs.snowflake.com/en/sql-reference/functions/jarowinkler_similarity.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# JAROWINKLER_SIMILARITY

Computes the [Jaro-Winkler similarity](https://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance) between two input strings.
The function returns an integer between 0 and 100, where 0 indicates no similarity and 100 indicates an exact match.

> **Note:**
>
> * The similarity computation is case-insensitive.
> * The computation is sensitive to all formatting characters, including white space characters.
> * The default [scaling factor](https://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance#Jaro%E2%80%93Winkler_Similarity)
>   of 0.1 is used for the computation.

## Syntax

```sqlsyntax
JAROWINKLER_SIMILARITY( <string_expr1> , <string_expr2> )
```

## Arguments

**Required:**

`string_expr1`, . `string_expr2`
:   The input strings.

## Usage notes

* When the function compares short strings, the execution time is proportional to the product of the lengths of the input strings.
* When the function compares long strings, the execution time is proportional to the length of the longer string.

## Collation details

No impact.
In languages where the alphabet contains digraphs or trigraphs (such as “Dz” and “Dzs” in Hungarian), each character in each digraph and trigraph is treated as an independent character, not as part of a single multi-character letter.

The result is based solely on the characters in the strings, not on the collation specifications of the strings.

## Examples

The following example computes the similarity between the strings in the columns `s` and `t` in the table `ed`.

```sqlexample
SELECT s, t, JAROWINKLER_SIMILARITY(s, t), JAROWINKLER_SIMILARITY(t, s) FROM ed;

----------------+-----------------+------------------------------+------------------------------+
      S         |        T        | JAROWINKLER_SIMILARITY(S, T) | JAROWINKLER_SIMILARITY(T, S) |
----------------+-----------------+------------------------------+------------------------------+
                |                 | 0                            | 0                            |
 Gute nacht     | Ich weis nicht  | 56                           | 56                           |
 Ich weiß nicht | Ich wei? nicht  | 98                           | 98                           |
 Ich weiß nicht | Ich weiss nicht | 97                           | 97                           |
 Ich weiß nicht | [NULL]          | [NULL]                       | [NULL]                       |
 Snowflake      | Oracle          | 61                           | 61                           |
 święta         | swieta          | 77                           | 77                           |
 [NULL]         |                 | [NULL]                       | [NULL]                       |
 [NULL]         | [NULL]          | [NULL]                       | [NULL]                       |
----------------+-----------------+------------------------------+------------------------------+
```

---
title: JSON_EXTRACT_PATH_TEXT
source: https://docs.snowflake.com/en/sql-reference/functions/json_extract_path_text.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Extraction)

# JSON_EXTRACT_PATH_TEXT

Parses the first argument as a JSON string and returns the value of the element pointed to by the path in the second
argument. This is equivalent to `TO_VARCHAR(GET_PATH(PARSE_JSON(JSON), PATH))`

## Syntax

```sqlsyntax
JSON_EXTRACT_PATH_TEXT( <column_identifier> , '<path_name>' )
```

## Arguments

`column_identifier`
:   The name of the column with the data that you want to extract.

`path_name`
:   A string that contains the path to the element that you want to extract.

## Returns

The data type of the returned value is VARCHAR.

## Usage notes

* The function returns NULL if the path name does not correspond to any element.
* The path name syntax is standard JavaScript notation; it consists of a concatenation of field names (identifiers)
  preceded by periods (e.g. `.`) and index operators (e.g. `[<index>]`):

  + The first field name does not require the leading period to be specified.
  + The index values in the index operators can be non-negative integers (for arrays) or single or
    double-quoted string literals (for object fields).

  For more details, see [Querying Semi-structured Data](../../user-guide/querying-semistructured.md).
* To maintain syntactic consistency, the path notation also supports SQL-style double-quoted identifiers, and use of
  `:` as path separators.

## Examples

Create a table and insert values:

> ```sqlexample
> CREATE TABLE demo1 (id INTEGER, json_data VARCHAR);
> INSERT INTO demo1 SELECT
>    1, '{"level_1_key": "level_1_value"}';
> INSERT INTO demo1 SELECT
>    2, '{"level_1_key": {"level_2_key": "level_2_value"}}';
> INSERT INTO demo1 SELECT
>    3, '{"level_1_key": {"level_2_key": ["zero", "one", "two"]}}';
> ```

Use JSON_EXTRACT_PATH_TEXT to extract a value from a simple 1-level string:

> ```sqlexample
> SELECT
>         TO_VARCHAR(GET_PATH(PARSE_JSON(json_data), 'level_1_key'))
>             AS OLD_WAY,
>         JSON_EXTRACT_PATH_TEXT(json_data, 'level_1_key')
>             AS JSON_EXTRACT_PATH_TEXT
>     FROM demo1
>     ORDER BY id;
> +--------------------------------------+--------------------------------------+
> | OLD_WAY                              | JSON_EXTRACT_PATH_TEXT               |
> |--------------------------------------+--------------------------------------|
> | level_1_value                        | level_1_value                        |
> | {"level_2_key":"level_2_value"}      | {"level_2_key":"level_2_value"}      |
> | {"level_2_key":["zero","one","two"]} | {"level_2_key":["zero","one","two"]} |
> +--------------------------------------+--------------------------------------+
> ```

Use JSON_EXTRACT_PATH_TEXT to extract a value from a 2-level string using a 2-level path:

> ```sqlexample
> SELECT
>         TO_VARCHAR(GET_PATH(PARSE_JSON(json_data), 'level_1_key.level_2_key'))
>             AS OLD_WAY,
>         JSON_EXTRACT_PATH_TEXT(json_data, 'level_1_key.level_2_key')
>             AS JSON_EXTRACT_PATH_TEXT
>     FROM demo1
>     ORDER BY id;
> +----------------------+------------------------+
> | OLD_WAY              | JSON_EXTRACT_PATH_TEXT |
> |----------------------+------------------------|
> | NULL                 | NULL                   |
> | level_2_value        | level_2_value          |
> | ["zero","one","two"] | ["zero","one","two"]   |
> +----------------------+------------------------+
> ```

This example contains an array:

> ```sqlexample
> SELECT
>       TO_VARCHAR(GET_PATH(PARSE_JSON(json_data), 'level_1_key.level_2_key[1]'))
>           AS OLD_WAY,
>       JSON_EXTRACT_PATH_TEXT(json_data, 'level_1_key.level_2_key[1]')
>           AS JSON_EXTRACT_PATH_TEXT
>     FROM demo1
>     ORDER BY id;
> +---------+------------------------+
> | OLD_WAY | JSON_EXTRACT_PATH_TEXT |
> |---------+------------------------|
> | NULL    | NULL                   |
> | NULL    | NULL                   |
> | one     | one                    |
> +---------+------------------------+
> ```

---
title: KURTOSIS
source: https://docs.snowflake.com/en/sql-reference/functions/kurtosis.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md)

# KURTOSIS

Returns the sample excess kurtosis of non-NULL records. If all records inside a group are NULL, the function returns NULL.

The following formula is used to compute the sample excess kurtosis:

\[(n \* (n+1))/((n-1) \* (n-2) \* (n-3)) \* (n \* m_4/(k_2)^2) - 3 \* (n-1)^2 / ((n-2) \* (n-3))\]

where:

* \(n\) denotes the number of non-NULL records.
* \(m_4\) denotes the sample fourth central moment.
* \(k_2\) denotes the symmetric unbiased estimator of the variance.

## Syntax

**Aggregate function**

```sqlsyntax
KURTOSIS( <expr> )
```

**Window function**

```sqlsyntax
KURTOSIS( <expr> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr`
:   An expression that evaluates to a numeric data type (such as INTEGER, FLOAT, DECIMAL).

`expr2`
:   An expression that defines the individual groups or windows.

## Returns

Returns DOUBLE if the input data type is DOUBLE/FLOAT.

Returns DECIMAL if the input data type is another numeric data type.

## Usage notes

* For inputs with fewer than four records, KURTOSIS returns NULL.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

Create a table and insert some rows:

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));

INSERT INTO aggr VALUES
  (1, 10, null),
  (2, 10, 12),
  (2, 20, 22),
  (2, 25, null),
  (2, 30, 35);
```

Select all the data from the table:

```sqlexample
SELECT * FROM aggr
  ORDER BY k, v;
```

```output
+---+-------+-------+
| K |     V |    V2 |
|---+-------+-------|
| 1 | 10.00 |  NULL |
| 2 | 10.00 | 12.00 |
| 2 | 20.00 | 22.00 |
| 2 | 25.00 |  NULL |
| 2 | 30.00 | 35.00 |
+---+-------+-------+
```

Return the KURTOSIS value for each column:

```sqlexample
SELECT KURTOSIS(k), KURTOSIS(v), KURTOSIS(v2)
  FROM aggr;
```

```output
+----------------+-----------------+--------------+
|    KURTOSIS(K) |     KURTOSIS(V) | KURTOSIS(V2) |
|----------------+-----------------+--------------|
| 5.000000000000 | -2.324218750000 |         NULL |
+----------------+-----------------+--------------+
```

---
title: LAG
source: https://docs.snowflake.com/en/sql-reference/functions/lag.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# LAG

Accesses data in a previous row in the same result set without having to join the table to itself.

See also:
:   [LEAD](lead.md)

## Syntax

```sqlsyntax
LAG ( <expr> [ , <offset> , <default> ] ) [ { IGNORE | RESPECT } NULLS ]
    OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`expr`
:   The expression to be returned based on the specified offset.

`offset`
:   The number of rows backward from the current row from which to obtain a value. For example, an `offset` of 2 returns
    the `expr` value with an interval of 2 rows.

    Note that setting a negative offset has the same effect as using the [LEAD](lead.md) function.

    Default is 1.

`default`
:   The expression to return when the offset goes out of the bounds of the window. Supports any expression whose type is compatible with `expr`.

    Default is NULL.

`{ IGNORE | RESPECT } NULLS`
:   Whether to ignore or respect NULL values when an `expr` contains NULL values:

    * `IGNORE NULLS` excludes any row whose expression evaluates to NULL when offset rows are counted.
    * `RESPECT NULLS` includes any row whose expression evaluates to NULL when offset rows are counted.

    Default: `RESPECT NULLS`

## Usage notes

* The PARTITION BY clause partitions the result set produced by the FROM clause into partitions to which the function is applied.
  For more information, see [Window function syntax and usage](../functions-window-syntax.md).
* The ORDER BY clause orders the data within each partition.

## Examples

Create the table and load the data:

```sqlexample
CREATE OR REPLACE TABLE sales(
  emp_id INTEGER,
  year INTEGER,
  revenue DECIMAL(10,2));
```

```sqlexample
INSERT INTO sales VALUES
  (0, 2010, 1000),
  (0, 2011, 1500),
  (0, 2012, 500),
  (0, 2013, 750);
INSERT INTO sales VALUES
  (1, 2010, 10000),
  (1, 2011, 12500),
  (1, 2012, 15000),
  (1, 2013, 20000);
INSERT INTO sales VALUES
  (2, 2012, 500),
  (2, 2013, 800);
```

This query shows the difference between this year’s revenue and the previous year’s revenue:

```sqlexample
SELECT emp_id, year, revenue,
       revenue - LAG(revenue, 1, 0) OVER (PARTITION BY emp_id ORDER BY year) AS diff_to_prev
  FROM sales
  ORDER BY emp_id, year;
```

```output
+--------+------+----------+--------------+
| EMP_ID | YEAR |  REVENUE | DIFF_TO_PREV |
|--------+------+----------+--------------|
|      0 | 2010 |  1000.00 |      1000.00 |
|      0 | 2011 |  1500.00 |       500.00 |
|      0 | 2012 |   500.00 |     -1000.00 |
|      0 | 2013 |   750.00 |       250.00 |
|      1 | 2010 | 10000.00 |     10000.00 |
|      1 | 2011 | 12500.00 |      2500.00 |
|      1 | 2012 | 15000.00 |      2500.00 |
|      1 | 2013 | 20000.00 |      5000.00 |
|      2 | 2012 |   500.00 |       500.00 |
|      2 | 2013 |   800.00 |       300.00 |
+--------+------+----------+--------------+
```

Create another table and load the data:

```sqlexample
CREATE OR REPLACE TABLE t1 (
  col_1 NUMBER,
  col_2 NUMBER);
```

```sqlexample
INSERT INTO t1 VALUES
  (1, 5),
  (2, 4),
  (3, NULL),
  (4, 2),
  (5, NULL),
  (6, NULL),
  (7, 6);
```

This query shows how the IGNORE NULLS clause affects the output.
All rows (except the first) contain non-NULL values even if the preceding row contained NULL.
If the preceding row contained NULL, then the current row uses the most recent non-NULL value.

```sqlexample
SELECT col_1,
       col_2,
       LAG(col_2) IGNORE NULLS OVER (ORDER BY col_1)
  FROM t1
  ORDER BY col_1;
```

```output
+-------+-------+-----------------------------------------------+
| COL_1 | COL_2 | LAG(COL_2) IGNORE NULLS OVER (ORDER BY COL_1) |
|-------+-------+-----------------------------------------------|
|     1 |     5 |                                          NULL |
|     2 |     4 |                                             5 |
|     3 |  NULL |                                             4 |
|     4 |     2 |                                             4 |
|     5 |  NULL |                                             2 |
|     6 |  NULL |                                             2 |
|     7 |     6 |                                             2 |
+-------+-------+-----------------------------------------------+
```

---
title: LAST_DAY
source: https://docs.snowflake.com/en/sql-reference/functions/last_day.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# LAST_DAY

Returns the last day of the specified date part for a date or timestamp. This function is commonly used to return the
last day of the month for a date or timestamp.

See also:
:   [NEXT_DAY](next_day.md) , [PREVIOUS_DAY](previous_day.md)

## Syntax

```sqlsyntax
LAST_DAY( <date_or_timetamp_expr> [ , <date_part> ] )
```

## Arguments

`date_or_timestamp_expr`
:   A date or a timestamp, or an expression that can be evaluated to a date or a timestamp.

`date_part`
:   The date part for which the last day is returned. Possible values are `year`, `quarter`, `month`,
    or `week` (or any of their supported variations). For more information, see [Supported date and time parts](../functions-date-time.md).

    When `date_part` is `week` (or any of its variations), the output is controlled by the [WEEK_START](../parameters.md)
    session parameter. For more details, including examples, see [Calendar weeks and weekdays](../functions-date-time.md).

    For more information, including examples, see [Calendar weeks and weekdays](../functions-date-time.md).

    Default: `month`

## Returns

This function returns a value of type DATE, even if `date_or_timetamp_expr` is a timestamp.

## Examples

Return the last day of the month for the specified date (from a timestamp):

```sqlexample
SELECT TO_DATE('2025-05-08T23:39:20.123-07:00') AS "DATE",
       LAST_DAY("DATE") AS "LAST DAY OF MONTH";
```

```output
+------------+-------------------+
| DATE       | LAST DAY OF MONTH |
|------------+-------------------|
| 2025-05-08 | 2025-05-31        |
+------------+-------------------+
```

Return the last day of the year for the specified date (from a timestamp):

```sqlexample
SELECT TO_DATE('2024-05-08T23:39:20.123-07:00') AS "DATE",
       LAST_DAY("DATE", 'year') AS "LAST DAY OF YEAR";
```

```output
+------------+------------------+
| DATE       | LAST DAY OF YEAR |
|------------+------------------|
| 2024-05-08 | 2024-12-31       |
+------------+------------------+
```

---
title: LAST_QUERY_ID
source: https://docs.snowflake.com/en/sql-reference/functions/last_query_id.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# LAST_QUERY_ID

Returns the ID of a specified query in the current session. If no query is specified, the most recent
query is returned.

> **Tip:**
>
> Instead of using this function with the [RESULT_SCAN](result_scan.md) function to process the results of a
> previous command, you can use the [pipe operator](../operators-flow.md) (`->>`). That way,
> you can run the command and process its result set in a single step.

## Syntax

```sqlsyntax
LAST_QUERY_ID( [ <num> ] )
```

## Arguments

`num`
:   Specifies the query to return, based on the position of the query (within the session).

    Default: `-1`

## Usage notes

* Positive numbers start with the first query that was run in the session. For example:

  + `LAST_QUERY_ID(1)` returns the first query.
  + `LAST_QUERY_ID(2)` returns the second query.
  + `LAST_QUERY_ID(6)` returns the sixth query.
* Negative numbers start with the most recent query in the session. For example:

  + `LAST_QUERY_ID(-1)` returns the most recent query (equivalent to `LAST_QUERY_ID()`).
  + `LAST_QUERY_ID(-2)` returns the second most recent query.
* The last LAST_QUERY_ID function considers all statements that were run within the current session,
  including child statements (for example, statements that were executed as part of a stored procedure,
  anonymous block, or [pipe operator](../operators-flow.md) statement). If you want to
  get the query ID of a statement based only on its position in a series of statements, consider using
  the pipe operator. For more complex use cases, we recommend using the
  [global variable SQLID](../../developer-guide/snowflake-scripting/query-id.md) in Snowflake Scripting blocks.

## Examples

Return the ID for the most recent query:

```sqlexample
SELECT LAST_QUERY_ID();
```

Return the ID for the first query that was run in the session:

```sqlexample
SELECT LAST_QUERY_ID(1);
```

---
title: LAST_SUCCESSFUL_SCHEDULED_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/last_successful_scheduled_time.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md) (Alerts)

# LAST_SUCCESSFUL_SCHEDULED_TIME

Returns the timestamp representing the scheduled time for the most recent successful evaluation of the alert condition, where no
errors occurred when executing the action. (In the [alert history](../../user-guide/alerts.md), these are the alerts with the
STATE CONDITION_FALSE or TRIGGERED.) Refer to [Specifying timestamps based on alert schedules](../../user-guide/alerts.md).

## Syntax

```sqlsyntax
SNOWFLAKE.ALERT.LAST_SUCCESSFUL_SCHEDULED_TIME()
```

## Arguments

None.

## Returns

TIMESTAMP_LTZ value that represents when the most recent successful evaluation of the alert condition was scheduled, or NULL
if there are no recent successful evaluations of the alert condition.

## Usage notes

* This function is defined in the ALERT schema of the SNOWFLAKE database.

  To call this function, you must use a role that is granted the
  [SNOWFLAKE database role](../snowflake-db-roles.md) ALERT_VIEWER. For example, to call the function as a user
  with the role alert_role, execute:

  ```sqlexample
  GRANT DATABASE ROLE snowflake.alert_viewer TO ROLE alert_role;
  ```
* This function can only be called from within an [alert](../../user-guide/alerts.md).

## Examples

Refer to [Specifying timestamps based on alert schedules](../../user-guide/alerts.md).

---
title: LAST_TRANSACTION
source: https://docs.snowflake.com/en/sql-reference/functions/last_transaction.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# LAST_TRANSACTION

Returns the transaction ID of the last transaction that was either committed or rolled back in the current session.

See also:
:   [CURRENT_TRANSACTION](current_transaction.md) , [DESCRIBE TRANSACTION](../sql/desc-transaction.md)

## Syntax

```sqlsyntax
LAST_TRANSACTION()
```

## Arguments

None

## Examples

This example calls the `LAST_TRANSACTION` function:

> ```sqlexample
> SELECT LAST_TRANSACTION();
> ```
>
> Output:
>
> ```sqlexample
> +---------------------+
> | LAST_TRANSACTION()  |
> |---------------------|
> | 1661899308790000000 |
> +---------------------+
> ```

---
title: LAST_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/last_value.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# LAST_VALUE

Returns the last value within an ordered group of values.

See also:
:   [FIRST_VALUE](first_value.md) , [NTH_VALUE](nth_value.md)

## Syntax

```sqlsyntax
LAST_VALUE( <expr> ) [ { IGNORE | RESPECT } NULLS ]
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr`
:   The expression that determines the return value.

`expr1`
:   The expression by which to partition the rows. You can specify a single expression or a comma-separated list of expressions.
    For example:

    ```sqlexample
    PARTITION BY column_1, column_2
    ```

`expr2`
:   The expression by which to order the rows. You can specify a single expression or a comma-separated list of expressions.
    For example:

    ```sqlexample
    ORDER BY column_3, column_4
    ```

`{ IGNORE | RESPECT } NULLS`
:   Whether to ignore or respect NULL values when an `expr` contains NULL values:

    * `IGNORE NULLS` returns the last non-NULL value.
    * `RESPECT NULLS` returns a NULL value if it is the last value in the expression.

    Default: `RESPECT NULLS`

## Usage notes

* This function is a rank-related function, so it must specify a window. A window clause consists of the following subclauses:

  > + `PARTITION BY expr1` subclause (optional).
  > + `ORDER BY expr2` subclause (required). For details about additional supported ordering options (sort order, ordering
  >   of NULL values, and so on), see the documentation for the [ORDER BY](../constructs/order-by.md) clause, which follows
  >   the same rules.
  > + `window_frame` subclause (optional).
* The order of rows in a window (and thus the result of the query) is fully deterministic only if the keys in the ORDER BY clause
  make each row unique. Consider the following example:

  ```sqlexample
  ... OVER (PARTITION BY p ORDER BY o COLLATE 'lower') ...
  ```

  The query result can vary if any partition contains values of column `o` that are identical, or would be identical
  in a case-insensitive comparison.
* The ORDER BY clause inside the OVER clause controls the order of rows only within the window, not the order of rows in the output
  of the entire query. To control output order, use a separate ORDER BY clause at the outermost level of the query.

* The optional `window_frame` specifies the subset of rows within the window for which the function is calculated.
  If no window frame is specified, the default frame is the entire window:

  `ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING`

  This behavior *differs* from the ANSI standard, which specifies the following default for window frames:

  `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Examples

The first example returns LAST_VALUE results for `column2` partitioned by `column1`:

```sqlexample
SELECT
    column1,
    column2,
    LAST_VALUE(column2) OVER (PARTITION BY column1 ORDER BY column2) AS column2_last
  FROM VALUES
    (1, 10), (1, 11), (1, 12),
    (2, 20), (2, 21), (2, 22);
```

```output
+---------+---------+--------------+
| COLUMN1 | COLUMN2 | COLUMN2_LAST |
|---------+---------+--------------|
|       1 |      10 |           12 |
|       1 |      11 |           12 |
|       1 |      12 |           12 |
|       2 |      20 |           22 |
|       2 |      21 |           22 |
|       2 |      22 |           22 |
+---------+---------+--------------+
```

The following example returns the results of three related functions: [FIRST_VALUE](first_value.md),
[NTH_VALUE](nth_value.md), and LAST_VALUE.

* The query creates a sliding window frame that is three rows wide, which contains:

  + The row that precedes the current row.
  + The current row.
  + The row that follows the current row.
* The `2` in the call `NTH_VALUE(menu_price_usd, 2)` specifies the second row in the window frame
  (which, in this case, is also the current row).
* When the current row is the very first row in the window frame, there is no preceding row to reference, so
  FIRST_VALUE returns a NULL for that row.
* Frame boundaries sometimes extend beyond the rows in a partition, but non-existent rows are not included in window function
  calculations. For example, when the current row is the very first row in the partition and the window frame is
  `ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING`, there is no preceding row to reference, so the FIRST_VALUE function returns the
  value of the first row in the partition.
* The results never match for all three functions, given the data in the table. These functions select the *first*,
  *last*, or *nth* value for each row in the frame, and the selection of values applies separately to each partition.

```sqlexample
SELECT menu_category, menu_item_name, menu_price_usd,
       FIRST_VALUE(menu_price_usd) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS first_val,
       NTH_VALUE(menu_price_usd, 2) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS nth_val,
       LAST_VALUE(menu_price_usd) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS last_val
  FROM menu_items
  WHERE menu_category = 'Dessert'
  ORDER BY menu_price_usd;
```

```output
+---------------+--------------------+----------------+-----------+---------+----------+
| MENU_CATEGORY | MENU_ITEM_NAME     | MENU_PRICE_USD | FIRST_VAL | NTH_VAL | LAST_VAL |
|---------------+--------------------+----------------+-----------+---------+----------|
| Dessert       | Popsicle           |           3.00 |      3.00 |    4.00 |     4.00 |
| Dessert       | Ice Cream Sandwich |           4.00 |      3.00 |    4.00 |     5.00 |
| Dessert       | Mango Sticky Rice  |           5.00 |      4.00 |    5.00 |     6.00 |
| Dessert       | Sugar Cone         |           6.00 |      6.00 |    6.00 |     7.00 |
| Dessert       | Waffle Cone        |           6.00 |      5.00 |    6.00 |     6.00 |
| Dessert       | Two Scoop Bowl     |           7.00 |      6.00 |    7.00 |     7.00 |
+---------------+--------------------+----------------+-----------+---------+----------+
```

---
title: LEAD
source: https://docs.snowflake.com/en/sql-reference/functions/lead.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# LEAD

Accesses data in a subsequent row in the same result set without having to join the table to itself.

See also:
:   [LAG](lag.md)

## Syntax

```sqlsyntax
LEAD ( <expr> [ , <offset> , <default> ] ) [ { IGNORE | RESPECT } NULLS ]
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`expr`
:   The string expression to be returned.

`offset`
:   The number of rows forward from the current row from which to obtain a value. For example, an `offset` of 2 returns the
    `expr` value with an interval of 2 rows.

    Note that setting a negative offset has the same effect as using the [LAG](lag.md) function.

    Default is 1. If `IGNORE NULLS` is specified, maximum is 1,000,000.

`default`
:   The expression to return when the offset goes out of the bounds of the window. Supports any expression whose type is compatible with `expr`.

    Default is NULL.

`{ IGNORE | RESPECT } NULLS`
:   Whether to ignore or respect NULL values when an `expr` contains NULL values:

    * `IGNORE NULLS` excludes any row whose expression evaluates to NULL when offset rows are counted.
    * `RESPECT NULLS` includes any row whose expression evaluates to NULL when offset rows are counted.

    Default: `RESPECT NULLS`

## Usage notes

* The PARTITION BY clause partitions the result set produced by the FROM clause into partitions to which the function is applied.
  For more information, see [Window function syntax and usage](../functions-window-syntax.md).
* The ORDER BY clause orders the data within each partition.

## Examples

```sqlexample
CREATE OR REPLACE TABLE sales(
  emp_id INTEGER,
  year INTEGER,
  revenue DECIMAL(10,2));

INSERT INTO sales VALUES
  (0, 2010, 1000),
  (0, 2011, 1500),
  (0, 2012, 500),
  (0, 2013, 750);
INSERT INTO sales VALUES
  (1, 2010, 10000),
  (1, 2011, 12500),
  (1, 2012, 15000),
  (1, 2013, 20000);
INSERT INTO sales VALUES
  (2, 2012, 500),
  (2, 2013, 800);

SELECT emp_id,
       year,
       revenue,
       LEAD(revenue) OVER (PARTITION BY emp_id ORDER BY year) - revenue AS diff_to_next
  FROM sales
  ORDER BY emp_id, year;
```

```output
+--------+------+----------+--------------+
| EMP_ID | YEAR |  REVENUE | DIFF_TO_NEXT |
|--------+------+----------+--------------|
|      0 | 2010 |  1000.00 |       500.00 |
|      0 | 2011 |  1500.00 |     -1000.00 |
|      0 | 2012 |   500.00 |       250.00 |
|      0 | 2013 |   750.00 |         NULL |
|      1 | 2010 | 10000.00 |      2500.00 |
|      1 | 2011 | 12500.00 |      2500.00 |
|      1 | 2012 | 15000.00 |      5000.00 |
|      1 | 2013 | 20000.00 |         NULL |
|      2 | 2012 |   500.00 |       300.00 |
|      2 | 2013 |   800.00 |         NULL |
+--------+------+----------+--------------+
```

```sqlexample
CREATE OR REPLACE TABLE t1 (
  c1 NUMBER,
  c2 NUMBER);

INSERT INTO t1 VALUES
  (1,5),
  (2,4),
  (3,NULL),
  (4,2),
  (5,NULL),
  (6,NULL),
  (7,6);

SELECT c1,
       c2,
       LEAD(c2) IGNORE NULLS OVER (ORDER BY c1)
  FROM t1;
```

```output
+----+------+------------------------------------------+
| C1 | C2   | LEAD(C2) IGNORE NULLS OVER (ORDER BY C1) |
|----+------+------------------------------------------|
|  1 |  5   |                                        4 |
|  2 |  4   |                                        2 |
|  3 | NULL |                                        2 |
|  4 |  2   |                                        6 |
|  5 | NULL |                                        6 |
|  6 | NULL |                                        6 |
|  7 |  6   |                                     NULL |
+----+------+------------------------------------------+
```

---
title: LEAST
source: https://docs.snowflake.com/en/sql-reference/functions/least.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# LEAST

Returns the smallest value from a list of expressions. LEAST supports all data types, including VARIANT.

See also:
:   [LEAST_IGNORE_NULLS](least_ignore_nulls.md)

## Syntax

```sqlsyntax
LEAST(( <expr1> [ , <expr2> ... ] )
```

## Arguments

`exprN`
:   The arguments must include at least one expression. All the expressions
    should be of the same type or compatible types.

## Returns

The first argument determines the return type:

* If the first type is numeric, then the return type is ‘widened’
  according to the numeric types in the list of all arguments.
* If the first type is not numeric, then all other arguments must be
  convertible to the first type.

If any argument is NULL, returns NULL.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

The following examples use the LEAST function:

```sqlexample
SELECT LEAST(1, 3, 0, 4);
```

```output
+-------------------+
| LEAST(1, 3, 0, 4) |
|-------------------|
|                 0 |
+-------------------+
```

```sqlexample
SELECT col_1,
       col_2,
       col_3,
       LEAST(col_1, col_2, col_3) AS least
  FROM (SELECT 1 AS col_1, 2 AS col_2, 3 AS col_3
    UNION ALL
    SELECT 2, 4, -1
    UNION ALL
    SELECT 3, 6, NULL);
```

```output
+-------+-------+-------+-------+
| COL_1 | COL_2 | COL_3 | LEAST |
|-------+-------+-------+-------|
|     1 |     2 |     3 |     1 |
|     2 |     4 |    -1 |    -1 |
|     3 |     6 |  NULL |  NULL |
+-------+-------+-------+-------+
```

---
title: LEAST_IGNORE_NULLS
source: https://docs.snowflake.com/en/sql-reference/functions/least_ignore_nulls.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# LEAST_IGNORE_NULLS

Returns the smallest non-NULL value from a list of expressions. LEAST_IGNORE_NULLS supports all data types,
including VARIANT.

See also:
:   [LEAST](least.md)

## Syntax

```sqlsyntax
LEAST_IGNORE_NULLS( <expr1> [ , <expr2> ... ] )
```

## Arguments

`exprN`
:   The arguments must include at least one expression. All the expressions
    should be of the same type or compatible types.

## Returns

The first argument determines the return type:

* If the first type is numeric, then the return type is ‘widened’
  according to the numeric types in the list of all arguments.
* If the first type is not numeric, then all other arguments must be
  convertible to the first type.

If all arguments are NULL, returns NULL.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Examples

Create a table and insert some values:

```sqlexample
CREATE TABLE test_least_ignore_nulls (
  col_1 INTEGER,
  col_2 INTEGER,
  col_3 INTEGER,
  col_4 FLOAT);

INSERT INTO test_least_ignore_nulls (col_1, col_2, col_3, col_4) VALUES
  (1, 2,    3,  4.25),
  (2, 4,   -1,  NULL),
  (3, 6, NULL,  -2.75);
```

Run a SELECT statement that returns the lowest non-null value in each row of the table:

```sqlexample
SELECT col_1,
       col_2,
       col_3,
       col_4,
       LEAST_IGNORE_NULLS(col_1, col_2, col_3, col_4) AS least_ignore_nulls
 FROM test_least_ignore_nulls
 ORDER BY col_1;
```

```output
+-------+-------+-------+-------+--------------------+
| COL_1 | COL_2 | COL_3 | COL_4 | LEAST_IGNORE_NULLS |
|-------+-------+-------+-------+--------------------|
|     1 |     2 |     3 |  4.25 |               1    |
|     2 |     4 |    -1 |  NULL |              -1    |
|     3 |     6 |  NULL | -2.75 |              -2.75 |
+-------+-------+-------+-------+--------------------+
```

---
title: LEFT
source: https://docs.snowflake.com/en/sql-reference/functions/left.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# LEFT

Returns a leftmost substring of its input.

`LEFT(STR, N)` is equivalent to `SUBSTR(STR, 1, N)`.

See also:
:   [RIGHT](right.md) , [SUBSTR , SUBSTRING](substr.md)

## Syntax

```sqlsyntax
LEFT( <string_expr> , <length_expr> )
```

## Arguments

`string_expr`
:   An expression that evaluates to a VARCHAR or BINARY value.

`length_expr`
:   An expression that evaluates to an integer. It specifies:

    * The number of UTF-8 characters to return if the input is a VARCHAR value.
    * The number of bytes to return if the input is a BINARY value.

    Specify a length that is greater than or equal to zero. If the length is a negative number, the function returns an
    empty string.

## Returns

The data type of the returned value is the same as the data type of the `string_expr` (VARCHAR or BINARY).

If any of the inputs are NULL, NULL is returned.

## Usage notes

If `length_expr` is greater than the length of `expr`, then the function returns `expr`.

## Collation details

* Collation applies to VARCHAR inputs. Collation doesn’t apply if the input data type of the first parameter
  is BINARY.
* No impact. Although collation is accepted syntactically, collations don’t affect processing. For example,
  two-character and three-character letters in languages (for example, “dzs” in Hungarian or “ch” in Czech)
  are still counted as two or three characters (not one character) for the length argument.
* The collation of the result is the same as the collation of the input. This can be useful if the returned value is passed to another function as part of nested function calls.

## Examples

The following examples use the LEFT function.

### Basic example

```sqlexample
SELECT LEFT('ABCDEF', 3);
```

```output
+-------------------+
| LEFT('ABCDEF', 3) |
|-------------------|
| ABC               |
+-------------------+
```

### Returning substrings for email, phone, and date strings

The following examples return substrings for customer information in a table.

Create the table and insert data:

```sqlexample
CREATE OR REPLACE TABLE customer_contact_example (
    cust_id INT,
    cust_email VARCHAR,
    cust_phone VARCHAR,
    activation_date VARCHAR)
  AS SELECT
    column1,
    column2,
    column3,
    column4
  FROM
    VALUES
      (1, 'some_text@example.com', '800-555-0100', '20210320'),
      (2, 'some_other_text@example.org', '800-555-0101', '20240509'),
      (3, 'some_different_text@example.net', '800-555-0102', '20191017');

SELECT * from customer_contact_example;
```

```output
+---------+---------------------------------+--------------+-----------------+
| CUST_ID | CUST_EMAIL                      | CUST_PHONE   | ACTIVATION_DATE |
|---------+---------------------------------+--------------+-----------------|
|       1 | some_text@example.com           | 800-555-0100 | 20210320        |
|       2 | some_other_text@example.org     | 800-555-0101 | 20240509        |
|       3 | some_different_text@example.net | 800-555-0102 | 20191017        |
+---------+---------------------------------+--------------+-----------------+
```

Use the [POSITION](position.md) function with the LEFT function to extract the username from email addresses.
This example finds the position of `@` in each string and subtracts one to return the username:

```sqlexample
SELECT cust_id,
       cust_email,
       LEFT(cust_email, POSITION('@' IN cust_email) - 1) AS username
  FROM customer_contact_example;
```

```output
+---------+---------------------------------+---------------------+
| CUST_ID | CUST_EMAIL                      | USERNAME            |
|---------+---------------------------------+---------------------|
|       1 | some_text@example.com           | some_text           |
|       2 | some_other_text@example.org     | some_other_text     |
|       3 | some_different_text@example.net | some_different_text |
+---------+---------------------------------+---------------------+
```

> **Tip:**
>
> You can use the POSITION function to find the position of other characters, such as an empty
> character (`' '`) or an underscore (`_`).

In the `cust_phone` column in the table, the area code is always the first three characters. Extract
the area code from phone numbers:

```sqlexample
SELECT cust_id,
       cust_phone,
       LEFT(cust_phone, 3) AS area_code
  FROM customer_contact_example;
```

```output
+---------+--------------+-----------+
| CUST_ID | CUST_PHONE   | AREA_CODE |
|---------+--------------+-----------|
|       1 | 800-555-0100 | 800       |
|       2 | 800-555-0101 | 800       |
|       3 | 800-555-0102 | 800       |
+---------+--------------+-----------+
```

In the `activation_date` column in the table, the date is always in the format `YYYYMMDD`. Extract the year
from these strings:

```sqlexample
SELECT cust_id,
       activation_date,
       LEFT(activation_date, 4) AS year
  FROM customer_contact_example;
```

```output
+---------+-----------------+------+
| CUST_ID | ACTIVATION_DATE | YEAR |
|---------+-----------------+------|
|       1 | 20210320        | 2021 |
|       2 | 20240509        | 2024 |
|       3 | 20191017        | 2019 |
+---------+-----------------+------+
```

---
title: LENGTH, LEN
source: https://docs.snowflake.com/en/sql-reference/functions/length.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# LENGTH, LEN

Returns the length of an input [string or binary](../data-types-text.md) value. For strings,
the length is the number of characters, and UTF-8 characters are counted as a single character. For binary,
the length is the number of bytes.

## Syntax

```sqlsyntax
LENGTH( <expression> )

LEN( <expression> )
```

## Arguments

`expression`
:   The input expression must be a string or binary value.

## Returns

The returned data type is INTEGER (more precisely, NUMBER(18, 0)).

## Collation details

* No impact.
  In languages in which one character is one letter and vice versa, the LENGTH function behaves the same with and without
  collation.
* In languages where the alphabet contains digraphs or trigraphs (such as “Dz” and “Dzs” in Hungarian), each character in each digraph and trigraph is treated as an independent character, not as part of a single multi-character letter.
  For example, although Hungarian treats “dz” as a single letter, Snowflake returns `2` for `LENGTH(COLLATE('dz', 'hu'))`.

## Examples

Create a table and insert VARCHAR values:

```sqlexample
CREATE OR REPLACE TABLE length_function_demo (s VARCHAR);

INSERT INTO length_function_demo VALUES
  (''),
  ('Joyeux Noël'),
  ('Merry Christmas'),
  ('Veselé Vianoce'),
  ('Wesołych Świąt'),
  ('圣诞节快乐'),
  (NULL);
```

Query the table using the LENGTH function:

```sqlexample
SELECT s, LENGTH(s) FROM length_function_demo;
```

```output
+-----------------+-----------+
| S               | LENGTH(S) |
|-----------------+-----------|
|                 |         0 |
| Joyeux Noël     |        11 |
| Merry Christmas |        15 |
| Veselé Vianoce  |        14 |
| Wesołych Świąt  |        14 |
| 圣诞节快乐        |         5 |
| NULL            |      NULL |
+-----------------+-----------+
```

For the next example, create a table and insert BINARY data:

```sqlexample
CREATE OR REPLACE TABLE binary_demo_table (
  v VARCHAR,
  b_hex BINARY,
  b_base64 BINARY,
  b_utf8 BINARY);

INSERT INTO binary_demo_table (v) VALUES ('hello');

UPDATE binary_demo_table SET
  b_hex    = TO_BINARY(HEX_ENCODE(v), 'HEX'),
  b_base64 = TO_BINARY(BASE64_ENCODE(v), 'BASE64'),
  b_utf8   = TO_BINARY(v, 'UTF-8');

SELECT * FROM binary_demo_table;
```

```output
+-------+------------+------------+------------+
| V     | B_HEX      | B_BASE64   | B_UTF8     |
|-------+------------+------------+------------|
| hello | 68656C6C6F | 68656C6C6F | 68656C6C6F |
+-------+------------+------------+------------+
```

Query the table using the LENGTH function:

```sqlexample
SELECT v, LENGTH(v),
       TO_VARCHAR(b_hex, 'HEX') AS b_hex, LENGTH(b_hex),
       TO_VARCHAR(b_base64, 'BASE64') AS b_base64, LENGTH(b_base64),
       TO_VARCHAR(b_utf8, 'UTF-8') AS b_utf8, LENGTH(b_utf8)
  FROM binary_demo_table;
```

```output
+-------+-----------+------------+---------------+----------+------------------+--------+----------------+
| V     | LENGTH(V) | B_HEX      | LENGTH(B_HEX) | B_BASE64 | LENGTH(B_BASE64) | B_UTF8 | LENGTH(B_UTF8) |
|-------+-----------+------------+---------------+----------+------------------+--------+----------------|
| hello |         5 | 68656C6C6F |             5 | aGVsbG8= |                5 | hello  |              5 |
+-------+-----------+------------+---------------+----------+------------------+--------+----------------+
```

---
title: LIKE ALL
source: https://docs.snowflake.com/en/sql-reference/functions/like_all.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# LIKE ALL

Performs a case-sensitive comparison to match a string against all of one or more specified patterns.
Use this function in a WHERE clause to filter for matches.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [[ NOT ] LIKE](like.md)

## Syntax

```sqlsyntax
<subject> LIKE ALL (<pattern1> [, <pattern2> ... ] ) [ ESCAPE <escape_char> ]
```

## Arguments

**Required:**

`subject`
:   The string to compare to the pattern(s).

`pattern#`
:   The pattern(s) that the string is to be compared to. You must specify at least one pattern.

**Optional:**

`escape_char`
:   Character(s) inserted in front of a wildcard character to indicate that the wildcard should
    be interpreted as a regular character rather than as a wildcard.

## Returns

Returns a BOOLEAN value or NULL:

* Returns TRUE if there is a match.
* Returns FALSE if there isn’t a match.
* Returns NULL if any argument is NULL.

## Usage notes

* To include single quotes or other special characters in pattern matching, you can use a
  [backslash escape sequence](../data-types-text.md).
* NULL does not match NULL. In other words, if the subject is NULL and one of the patterns is NULL,
  that is not considered a match.
* You can use the [NOT](../operators-logical.md) logical operator before the `subject`
  to perform a case-sensitive comparison that returns TRUE if it does not match any of the specified patterns.
* SQL wildcards are supported in `pattern`:

  + An underscore (`_`) matches any single character.
  + A percent sign (`%`) matches any sequence of zero or more characters.
* Wildcards in `pattern` include newline characters (`n`) in `subject` as matches.
* The pattern is considered a match if the pattern matches the entire input string (subject). To match a sequence
  anywhere within a string, start and end the pattern with `%` (e.g. `%something%`).

* If the function is used with a subquery, the subquery should return a single row.

  For example, the following should be used only if the subquery returns
  a single row:

  ```sqlexample
  SELECT ...
    WHERE x LIKE ALL (SELECT ...)
  ```

* If you require more complex pattern matching than this function supports, you can use a
  [regular expression function](../functions-regexp.md) instead.

## Collation details

Only the `upper`, `lower`, and `trim` collation specifications are supported. Combinations with `upper`,
`lower`, and `trim` are also supported (for example, `upper-trim` and `lower-trim`), except for locale
combinations (for example, `en-upper`).

## Examples

Create a table that contains some strings:

```sqlexample
CREATE OR REPLACE TABLE like_all_example(name VARCHAR(20));
INSERT INTO like_all_example VALUES
    ('John  Dddoe'),
    ('Joe   Doe'),
    ('John_do%wn'),
    ('Joe down'),
    ('Tom   Doe'),
    ('Tim down'),
    (null);
```

This query shows how to use patterns with wildcards (`%`) to find matches:

```sqlexample
SELECT *
  FROM like_all_example
  WHERE name LIKE ALL ('%Jo%oe%','J%e')
  ORDER BY name;
```

```output
+-------------+
| NAME        |
|-------------|
| Joe   Doe   |
| John  Dddoe |
+-------------+
```

This query shows that all patterns need to match for a successful result:

```sqlexample
SELECT *
  FROM like_all_example
  WHERE name LIKE ALL ('%Jo%oe%','J%n')
  ORDER BY name;
```

```output
+------+
| NAME |
|------|
+------+
```

This query shows how to use an escape character to indicate that characters that are usually wild cards (`_` and `%`)
should be treated as literals.

```sqlexample
SELECT *
  FROM like_all_example
  WHERE name LIKE ALL ('%J%h%^_do%', 'J%^%wn') ESCAPE '^'
  ORDER BY name;
```

```output
+------------+
| NAME       |
|------------|
| John_do%wn |
+------------+
```

---
title: LIKE ANY
source: https://docs.snowflake.com/en/sql-reference/functions/like_any.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# LIKE ANY

Performs a case-sensitive comparison to match a string against any of one or more specified patterns.
Use this function in a WHERE clause to filter for matches. For case-insensitive matching, use ILIKE ANY
instead.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [[ NOT ] LIKE](like.md) , [ILIKE ANY](ilike_any.md)

## Syntax

```sqlsyntax
<subject> LIKE ANY (<pattern1> [, <pattern2> ... ] ) [ ESCAPE <escape_char> ]
```

## Arguments

**Required:**

`subject`
:   The string to compare to the pattern(s).

`pattern#`
:   The pattern(s) that the string is to be compared to. You must specify at least one pattern.

**Optional:**

`escape_char`
:   Character(s) inserted in front of a wildcard character to indicate that the wildcard should
    be interpreted as a regular character rather than as a wildcard.

## Returns

Returns a BOOLEAN value or NULL:

* Returns TRUE if there is a match.
* Returns FALSE if there isn’t a match.
* Returns NULL if any argument is NULL.

## Usage notes

* To include single quotes or other special characters in pattern matching, you can use a
  [backslash escape sequence](../data-types-text.md).
* NULL does not match NULL. In other words, if the subject is NULL and one of the patterns is NULL,
  that is not considered a match.
* You can use the [NOT](../operators-logical.md) logical operator before the `subject`
  to perform a case-sensitive comparison that returns TRUE if it does not match any of the specified patterns.
* SQL wildcards are supported in `pattern`:

  + An underscore (`_`) matches any single character.
  + A percent sign (`%`) matches any sequence of zero or more characters.
* Wildcards in `pattern` include newline characters (`n`) in `subject` as matches.
* The pattern is considered a match if the pattern matches the entire input string (subject). To match a sequence
  anywhere within a string, start and end the pattern with `%` (e.g. `%something%`).

* If the function is used with a subquery, the subquery should return a single row.

  For example, the following should be used only if the subquery returns a single row:

  ```sqlexample
  SELECT ...
    WHERE x LIKE ANY (SELECT ...)
  ```

* If you require more complex pattern matching than this function supports, you can use a
  [regular expression function](../functions-regexp.md) instead.

## Collation details

Only the `upper`, `lower`, and `trim` collation specifications are supported. Combinations with `upper`,
`lower`, and `trim` are also supported (for example, `upper-trim` and `lower-trim`), except for locale
combinations (for example, `en-upper`).

## Examples

Create a table that contains some strings:

```sqlexample
CREATE OR REPLACE TABLE like_example(name VARCHAR(20));
INSERT INTO like_example VALUES
    ('John  Dddoe'),
    ('Joe   Doe'),
    ('John_down'),
    ('Joe down'),
    ('Tom   Doe'),
    ('Tim down'),
    (null);
```

This query shows how to use patterns with wildcards (`%`) to find matches:

```sqlexample
SELECT *
  FROM like_example
  WHERE name LIKE ANY ('%Jo%oe%','T%e')
  ORDER BY name;
```

```output
+-------------+
| NAME        |
|-------------|
| Joe   Doe   |
| John  Dddoe |
| Tom   Doe   |
+-------------+
```

This query shows how to use an escape character to indicate that a character that is usually a wild card (`_`) should be
treated as a literal.

```sqlexample
SELECT *
  FROM like_example
  WHERE name LIKE ANY ('%J%h%^_do%', 'T%^%e') ESCAPE '^'
  ORDER BY name;
```

```output
+-----------+
| NAME      |
|-----------|
| John_down |
+-----------+
```

---
title: LISTAGG
source: https://docs.snowflake.com/en/sql-reference/functions/listagg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# LISTAGG

Returns the concatenated input values, separated by the `delimiter` string.

## Syntax

**Aggregate function**

```sqlsyntax
LISTAGG( [ DISTINCT ] <expr1> [, <delimiter> ] )
    [ WITHIN GROUP ( <orderby_clause> ) ]
```

**Window function**

```sqlsyntax
LISTAGG( [ DISTINCT ] <expr1> [, <delimiter> ] )
    [ WITHIN GROUP ( <orderby_clause> ) ]
    OVER ( [ PARTITION BY <expr2> ] )
```

## Required arguments

`expr1`
:   An expression (typically a column name) that determines the values to be put into the list.
    The expression must evaluate to a string, or to a data type that can be
    [cast](../data-type-conversion.md) to string.

`OVER()`
:   The OVER clause is required when the function is being used as a window function.
    For details, see [Window function syntax and usage](../functions-window-syntax.md).

## Optional arguments

`DISTINCT`
:   Removes duplicate values from the list.

`delimiter`
:   A string, or an expression that evaluates to a string. Typically, this value is
    a single-character string. The string should be surrounded by single
    quotes, as shown in the examples below.

    If no `delimiter` is specified, an empty string is used as
    the `delimiter`.

    The `delimiter` must be a constant.

`WITHIN GROUP orderby_clause`
:   One or more expressions (typically column names) that determine the order of the values for
    each group in the list.

    The WITHIN GROUP (ORDER BY) syntax supports the same parameters as the
    [ORDER BY](../constructs/order-by.md) clause in a SELECT statement.

`PARTITION BY expr2`
:   Window function sub-clause that specifies an expression (typically a column name).
    This expression defines partitions that group the input rows before the function is applied.
    For details, see [Window function syntax and usage](../functions-window-syntax.md).

## Returns

Returns a string that includes all of the non-NULL input values, separated by the `delimiter`.

This function does not return a list or an array. It returns a single string that contains all
of the non-NULL input values.

## Usage notes

* If you do not specify WITHIN GROUP (ORDER BY), the order of elements within each list is unpredictable.
  (An ORDER BY clause outside the WITHIN GROUP clause applies to the order of the output rows, not to the order
  of the list elements within a row.)
* If you specify a number for an expression in WITHIN GROUP (ORDER BY), this number is parsed as a numeric
  constant, not as the ordinal position of a column in the SELECT list. Therefore, do not specify numbers
  as WITHIN GROUP (ORDER BY) expressions.
* If you specify DISTINCT and WITHIN GROUP, both must refer to the same column. For example:

  ```sqlexample
  SELECT LISTAGG(DISTINCT O_ORDERKEY) WITHIN GROUP (ORDER BY O_ORDERKEY) ...;
  ```

  If you specify different columns for DISTINCT and WITHIN GROUP, an error occurs:

  ```sqlexample
  SELECT LISTAGG(DISTINCT O_ORDERKEY) WITHIN GROUP (ORDER BY O_ORDERSTATUS) ...;
  ```

  ```output
  SQL compilation error: [ORDERS.O_ORDERSTATUS] is not a valid order by expression
  ```

  You must either specify the same column for DISTINCT and WITHIN GROUP or omit DISTINCT.
* Regarding NULL or empty input values:

  + If the input is empty, an empty string is returned.
  + If all input expressions evaluate to NULL, the output is an empty string.
  + If some but not all input expressions evaluate to NULL, the output contains
    all non-NULL values and excludes the NULL values.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Collation details

* The collation of the result is the same as the collation of the input.
* Elements inside the list are ordered according to collations, if the ORDER BY sub-clause specifies an expression
  with collation.
* The `delimiter` cannot use a collation specification.
* Specifying collation inside ORDER BY does not impact the collation of the result. For example, the statement below
  contains two ORDER BY clauses, one for LISTAGG and one for the query results. Specifying collation inside
  the first one does not affect the collation of the second one. If you need to collate the output in both ORDER BY
  clauses, you must specify collation explicitly in both clauses.

  ```sqlexample
  SELECT LISTAGG(x, ', ') WITHIN GROUP (ORDER BY last_name COLLATE 'es')
    FROM table1
    ORDER BY last_name;
  ```

## Examples

These examples use the LISTAGG function.

### Using the LISTAGG function to concatenate values in query results

The following examples use the LISTAGG function to concatenate values in the results of
queries on orders data.

> **Note:**
>
> These examples query the [TPC-H sample data](../../user-guide/sample-data-tpch.md). Before
> running the queries, execute the following SQL statement:
>
> ```sqlexample
> USE SCHEMA snowflake_sample_data.tpch_sf1;
> ```

This example lists the distinct `o_orderkey` values for orders with a `o_totalprice` greater than
`520000` and uses and empty string for the `delimiter`:

```sqlexample
SELECT LISTAGG(DISTINCT o_orderkey, ' ')
  FROM orders
  WHERE o_totalprice > 520000;
```

```output
+-------------------------------------------------+
| LISTAGG(DISTINCT O_ORDERKEY, ' ')               |
|-------------------------------------------------|
| 2232932 1750466 3043270 4576548 4722021 3586919 |
+-------------------------------------------------+
```

This example lists the distinct `o_orderstatus` values for orders with a `o_totalprice` greater than
`520000` and uses a vertical bar for the `delimiter`:

```sqlexample
SELECT LISTAGG(DISTINCT o_orderstatus, '|')
  FROM orders
  WHERE o_totalprice > 520000;
```

```output
+--------------------------------------+
| LISTAGG(DISTINCT O_ORDERSTATUS, '|') |
|--------------------------------------|
| O|F                                  |
+--------------------------------------+
```

This example lists the `o_orderstatus` and `o_clerk` values of each order with a `o_totalprice` greater than
`520000` grouped by `o_orderstatus`. The query uses a comma for the `delimiter`:

```sqlexample
SELECT o_orderstatus,
   LISTAGG(o_clerk, ', ')
     WITHIN GROUP (ORDER BY o_totalprice DESC)
  FROM orders
  WHERE o_totalprice > 520000
  GROUP BY o_orderstatus;
```

```output
+---------------+---------------------------------------------------+
| O_ORDERSTATUS | LISTAGG(O_CLERK, ', ')                            |
|               |      WITHIN GROUP (ORDER BY O_TOTALPRICE DESC)    |
|---------------+---------------------------------------------------|
| O             | Clerk#000000699, Clerk#000000336, Clerk#000000245 |
| F             | Clerk#000000040, Clerk#000000230, Clerk#000000924 |
+---------------+---------------------------------------------------+
```

### Using collation with the LISTAGG function

The following examples show [collation](../collation.md) with the LISTAGG function.
The examples use the following data:

```sqlexample
CREATE OR REPLACE TABLE collation_demo (
  spanish_phrase VARCHAR COLLATE 'es');
```

```sqlexample
INSERT INTO collation_demo (spanish_phrase) VALUES
  ('piña colada'),
  ('Pinatubo (Mount)'),
  ('pint'),
  ('Pinta');
```

Note the difference in output order with the different
collation specifications. This query uses the `es` collation specification:

```sqlexample
SELECT LISTAGG(spanish_phrase, '|')
    WITHIN GROUP (ORDER BY COLLATE(spanish_phrase, 'es')) AS es_collation
  FROM collation_demo;
```

```output
+-----------------------------------------+
| ES_COLLATION                            |
|-----------------------------------------|
| Pinatubo (Mount)|pint|Pinta|piña colada |
+-----------------------------------------+
```

This query uses the `utf8` collation specification:

```sqlexample
SELECT LISTAGG(spanish_phrase, '|')
    WITHIN GROUP (ORDER BY COLLATE(spanish_phrase, 'utf8')) AS utf8_collation
  FROM collation_demo;
```

```output
+-----------------------------------------+
| UTF8_COLLATION                          |
|-----------------------------------------|
| Pinatubo (Mount)|Pinta|pint|piña colada |
+-----------------------------------------+
```

---
title: LISTING_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/listing_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# LISTING_REFRESH_HISTORY

Returns the past 14 days of refresh history for a cross-cloud auto-fulfillment listing.
The information returned contains replication details for refresh events where the listing is
synchronized to a specified target region.

This function is available to providers of listings who have any privilege on the specified listing.

## Syntax

```sqlsyntax
LISTING_REFRESH_HISTORY(
  LISTING_NAME => '<listing_name>'
  [ , SNOWFLAKE_REGION => '<snowflake_region>' ]
  [ , REGION_GROUP => '<region_group>' ] )
```

## Arguments

**Required**

`LISTING_NAME => 'listing_name'`
:   SQL identifier of a cross-cloud auto-fulfillment listing in this account. The SQL identifier for
    listings can be found in the name column returned by show listings in data exchange <exchange_name>.
    Similarly, the SQL identifier for data exchanges can be found in the name column returned by
    `show data exchanges`.

**Optional**

`SNOWFLAKE_REGION => 'snowflake_region'`
:   The Snowflake region group to which the listing is replicated, where you can view the refresh history for that replication. This follows
    the same formatting as the column `snowflake_region` returned by [SHOW REGIONS](../sql/show-regions.md). If no region is specified, the
    history for all target regions is displayed.

`REGION_GROUP =>  'region_group'`
:   The Snowflake region group to which the listing is replicated, for which you can view the refresh history.

    `PUBLIC` by default. This argument only needs to be specified if the target region being monitored
    is in a US government or Virtual Private Snowflake region.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| LISTING_NAME | TEXT | Name of the cross-cloud auto-fulfillment listing in this account. |
| SNOWFLAKE_REGION | TEXT | Name of the Snowflake region the listing is replicated to. For example, `aws_us_east_1`. |
| REGION_GROUP | TEXT | Name of the Snowflake region group the listing is replicated to. For example, PUBLIC. |
| PHASE | TEXT | Current phase in the replication operation, represented as one phase out of a total of X phases. For example, 2/6. |
| PHASE_NAME | TEXT | Name of the replication phases completed (or in progress) so far.  For the list of phases, see usage notes. |
| PROGRESS | TEXT | The current replication progress as a percentage. |
| START_TIME | TIMESTAMP_LTZ | Time when the replication phase began. |
| END_TIME | TIMESTAMP_LTZ | Time when the phase finished, if applicable.  NULL if the phase is in progress or is the terminating phase (`COMPLETED/FAILED/CANCELED`). |
| JOB_UUID | TEXT | Query ID for the refresh job. |
| TOTAL_BYTES | VARIANT | A JSON object that provides detailed information about refreshed databases:   * `totalBytesToReplicate`: Total number of bytes expected to be replicated. * `bytesUploaded`: Actual number of bytes uploaded. * `bytesDownloaded`: Actual number of bytes downloaded. * `bytesSkipped`: Number of bytes skipped during a refresh when Egress Cost Optimizer is enabled. * `databases`: List of JSON objects containing the following fields for each member database:    + `name`: Name of the database.   + `totalBytesToReplicate`: Total bytes expected to be replicated for the database. |
| OBJECT_COUNT | VARIANT | A JSON object that provides detailed information about refreshed objects:   * `totalObjects`: Total number of objects in the replication or failover group. * `completedObjects`: Total number of objects completed. * `objectTypes`: List of JSON objects containing the following fields for each type:    + `objectType`: Type of object (for example users, roles, grants, warehouses, schemas, tables, columns, etc).   + `totalObjects`: Total number of objects of this type in the replication or failover group.   + `completedObjects`: Total number of objects of this type that were completed. |
| PRIMARY_SNAPSHOT_TIMESTAMP | TIMESTAMP_LTZ | Timestamp when the primary snapshot was created. |
| ERROR | VARIANT | NULL if the refresh operation is successful. If the refresh operation fails, returns a JSON object that provides detailed information about the error:   * `errorCode`: Error code of the failure. * `errorMessage`: Error message of the failure. |

## Usage notes

* Only returns rows for a role with any privilege on the listing.
* Only returns rows for a listing in the current account.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be
  fully-qualified.

  For more information, see [Information Schema](../info-schema.md).

* Phase list in the order processed:

  1. SECONDARY_SYNCHRONIZING_MEMBERSHIP
  2. SECONDARY_UPLOADING_INVENTORY
  3. PRIMARY_UPLOADING_METADATA
  4. PRIMARY_UPLOADING_DATA
  5. SECONDARY_DOWNLOADING_METADATA
  6. SECONDARY_DOWNLOADING_DATA
  7. COMPLETED / FAILED / CANCELED

* The output will also include the history of other listings that reference the same database, as they are refreshed together. If the input
  is an application listing, it contains the history of all application listings in the given region.
* In the PRIMARY_UPLOADING_DATA and SECONDARY_DOWNLOADING_DATA phases, the `totalBytesToReplicate` value is estimated prior to the
  replication operation. This value may differ from the `totalBytesToUpload` or `totalBytesToDownload` value in the respective
  phase.

  For example, if during the PRIMARY_UPLOADING_DATA phase, a previous replication operation uploaded some bytes but was canceled before the
  operation completed, those bytes would not be uploaded again. In that case, `totalBytesToUpload` would be lower
  than `totalBytesToReplicate`.

## Examples

Retrieve the history for the listing `my_listing` refreshing to AWS US East-1, a public cloud region.

> ```sqlexample
> select * from table(information_schema.listing_refresh_history(listing_name=>'my_listing',snowflake_region=>'AWS_US_EAST_1'))
> ```

---
title: LN
source: https://docs.snowflake.com/en/sql-reference/functions/ln.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Logarithmic)

# LN

Returns the natural logarithm of a numeric expression.

## Syntax

```sqlsyntax
LN(<expr>)
```

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

```sqlexample
SELECT x, ln(x) FROM tab;

--------+-------------+
   X    |    LN(X)    |
--------+-------------+
 1      | 0           |
 10     | 2.302585093 |
 100    | 4.605170186 |
 [NULL] | [NULL]      |
--------+-------------+
```

---
title: LOCALTIME
source: https://docs.snowflake.com/en/sql-reference/functions/localtime.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# LOCALTIME

Returns the current time for the system.

ANSI-compliant alias for [CURRENT_TIME](current_time.md).

## Syntax

```sqlsyntax
LOCALTIME()

LOCALTIME
```

## Arguments

None.

## Returns

Returns a value of type [TIME](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned time is
  in the time zone for the session.
* The display format for times in the output is determined by the [TIME_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `HH24:MI:SS`).
* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := <function_name>();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual
  warehouse) because the queries might be serviced by different compute resources (in the warehouse).

## Examples

Show the current local time and local timestamp:

```sqlexample
SELECT LOCALTIME(), LOCALTIMESTAMP();
```

```output
+-------------+-------------------------------+
| LOCALTIME() | LOCALTIMESTAMP()              |
|-------------+-------------------------------|
| 15:32:45    | 2024-04-17 15:32:45.775 -0700 |
+-------------+-------------------------------+
```

---
title: LOCALTIMESTAMP
source: https://docs.snowflake.com/en/sql-reference/functions/localtimestamp.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# LOCALTIMESTAMP

Returns the current timestamp for the system in the local time zone.

ANSI-compliant alias for [CURRENT_TIMESTAMP](current_timestamp.md).

## Syntax

```sqlsyntax
LOCALTIMESTAMP( [ <fract_sec_precision> ] )

LOCALTIMESTAMP
```

## Arguments

`fract_sec_precision`
:   This optional argument indicates the precision with which to report the
    time. For example, a value of 3 says to use 3 digits after the decimal
    point (that is, to specify the time with a precision of milliseconds).

    The default precision is 9 (nanoseconds).

    Valid values range from 0 - 9. However, most platforms do not support true
    nanosecond precision; the precision that you get might be less than the
    precision you specify. In practice, precision is usually approximately
    milliseconds (3 digits) at most.

    > **Note:**
    >
    > Fractional seconds are only displayed if they have been explicitly set in the [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) parameter for the session (e.g. `'YYYY-MM-DD HH24:MI:SS.FF'`).

## Returns

Returns the current system time. The data type of the returned value is
[TIMESTAMP_LTZ](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned timestamp is in the time zone for the session.
* The setting of the [TIMESTAMP_TYPE_MAPPING](../parameters.md) parameter does not affect the return value.
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual warehouse) because the queries might be serviced by different compute resources (in the warehouse).

* To comply with the ANSI standard, this function can be called without parentheses in SQL statements.

  However, if you are setting a [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md)
  to an expression that calls the function (for example, `my_var := LOCALTIMESTAMP();`), you must include the
  parentheses. For more information, see [the usage notes for context functions](../functions-context.md).

## Examples

Show the current local time and local timestamp:

```sqlexample
SELECT LOCALTIME(), LOCALTIMESTAMP();
```

```output
+-------------+-------------------------------+
| LOCALTIME() | LOCALTIMESTAMP()              |
|-------------+-------------------------------|
| 07:58:09    | 2024-04-18 07:58:09.848 -0700 |
+-------------+-------------------------------+
```

---
title: LOG
source: https://docs.snowflake.com/en/sql-reference/functions/log.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Logarithmic)

# LOG

Returns the logarithm of a numeric expression.

See also:
:   [natural log (ln)](ln.md)

## Syntax

```sqlsyntax
LOG(<base>, <expr>)
```

## Arguments

`base`
:   The “base” to use (e.g. 10 for base 10 arithmetic).

    This can be of any numeric data type (INTEGER, fixed-point, or floating
    point).

    `base` should be greater than 0.

    `base` should not be exactly 1.0.

`expr`
:   The value for which you want to know the log.

    This can be of any numeric data type (INTEGER, fixed-point, or floating
    point).

    `expr` should be greater than 0.

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* If `base` is 1 or less than or equal to 0, an error is returned.
* If `expr` is less than or equal to 0, an error is returned.

## Examples

```sqlexample
SELECT x, y, log(x, y) FROM tab;

--------+--------+-------------+
   X    |   Y    |  LOG(X, Y)  |
--------+--------+-------------+
 2      | 0.5    | -1          |
 2      | 1      | 0           |
 2      | 8      | 3           |
 2      | 16     | 4           |
 10     | 10     | 1           |
 10     | 20     | 1.301029996 |
 10     | [NULL] | [NULL]      |
 [NULL] | 10     | [NULL]      |
 [NULL] | [NULL] | [NULL]      |
--------+--------+-------------+
```

---
title: LOGIN_HISTORY , LOGIN_HISTORY_BY_USER
source: https://docs.snowflake.com/en/sql-reference/functions/login_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# LOGIN_HISTORY , LOGIN_HISTORY_BY_USER

The LOGIN_HISTORY family of table functions can be used to query login attempts by Snowflake users along various dimensions:

* LOGIN_HISTORY returns login events within a specified time range.
* LOGIN_HISTORY_BY_USER returns login events of a specified user within a specified time range.

Each function is optimized for querying along the specified dimension. The results can be further filtered using SQL predicates.

> **Note:**
>
> These functions return login activity within the last 7 days.

## Syntax

```sqlsyntax
LOGIN_HISTORY(
      [  TIME_RANGE_START => <constant_expr> ]
      [, TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <num> ] )

LOGIN_HISTORY_BY_USER(
      [  USER_NAME => '<string>' ]
      [, TIME_RANGE_START => <constant_expr> ]
      [, TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <num> ] )
```

## Arguments

All the arguments are optional.

`TIME_RANGE_START => constant_expr` , . `TIME_RANGE_END => constant_expr`
:   Time range (in TIMESTAMP_LTZ format), within the last 7 days, in which the login event occurred.

    If `TIME_RANGE_END` is not specified, the function returns the most recent login events.

    If the time range does not fall within the last 7 days, an error is returned.

`USER_NAME => 'string'`
:   Applies only to LOGIN_HISTORY_BY_USER

    A string specifying a user name or [CURRENT_USER](current_user.md). Only login events for the specified user are returned. Note that the login name must be enclosed in single quotes. Also, if the
    login name contains any spaces, mixed-case characters, or special characters, the name must be double-quoted within the single quotes (e.g. `'"User 1"'` vs `'user1'`).

    Default: [CURRENT_USER](current_user.md)

`RESULT_LIMIT => num`
:   A number specifying the maximum number of rows returned by the function.

    If the number of matching rows is greater than this limit, the login events with the most recent timestamp are returned, up to the specified limit.

    Range: `1` to `10000`

    Default: `100`.

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Time of the event occurrence. |
| EVENT_ID | NUMBER | Event’s unique id. |
| EVENT_TYPE | VARCHAR | Event type, such as LOGIN for authentication events. |
| USER_NAME | VARCHAR | User associated with this event. |
| CLIENT_IP | VARCHAR | IP address where the request originated from. |
| REPORTED_CLIENT_TYPE | VARCHAR | Reported type of the client software, such as JDBC_DRIVER, ODBC_DRIVER, etc. This information is not authenticated. |
| REPORTED_CLIENT_VERSION | VARCHAR | Reported version of the client software. This information is not authenticated. |
| FIRST_AUTHENTICATION_FACTOR | VARCHAR | Method used to authenticate the user (the first factor in multi factor authentication, if used). |
| SECOND_AUTHENTICATION_FACTOR | VARCHAR | The second factor in multi factor authentication. If the user did not use multi-factor authentication, this value is NULL. |
| IS_SUCCESS | VARCHAR | Whether the user’s request was successful or not. |
| ERROR_CODE | NUMBER | Error code, if the request was not successful. |
| ERROR_MESSAGE | VARCHAR | Error message returned to the user, if the request was not successful. |
| RELATED_EVENT_ID | NUMBER | Reserved for future use. |
| CONNECTION | VARCHAR | Name of the connection used by the client, or NULL if the client is not using a connection URL. Connection is a Snowflake object that is a part of [Client Redirect](../../user-guide/client-redirect.md). It represents a connection URL that you can use to fail over to another account for business continuity and disaster recovery. . , NOTE: If a client authenticates through an identity provider (IdP) that is configured with the account URL rather than the connection URL, the IdP directs the client to the account URL after authentication is complete. The CONNECTION column for this login event is NULL. See [Authentication and Client Redirect](../../user-guide/client-redirect.md). |
| CLIENT_PRIVATE_LINK_ID | VARCHAR | If the user logged in using [private connectivity](../../user-guide/private-connectivity-inbound.md), specifies the identifier of the endpoint from which the request originated. |
| FIRST_AUTHENTICATION_FACTOR_ID | VARCHAR | ID of the [credential](../account-usage/credentials.md) used to authenticate the user (the first factor in multi-factor authentication, if used). |
| SECOND_AUTHENTICATION_FACTOR_ID | VARCHAR | ID of the [credential](../account-usage/credentials.md) used for the second factor in multi-factor authentication. If the user did not use multi-factor authentication, this value is NULL. |
| LOGIN_DETAILS | VARCHAR | Displays details for each login event, including the malicious IP protection category name, the risk category, and the blocking status. |

For details about the error codes/messages for login attempts that were unsuccessful due to invalid SAML responses, see [Federated authentication and SSO troubleshooting](../../user-guide/errors-saml.md).

## Examples

Retrieve up to the last 100 login events of the current user:

> ```sqlexample
> select *
> from table(information_schema.login_history_by_user())
> order by event_timestamp;
> ```

Retrieve up to the last 1000 login events of the specified user:

> ```sqlexample
> select *
> from table(information_schema.login_history_by_user(USER_NAME => 'USER1', result_limit => 1000))
> order by event_timestamp;
> ```

Retrieve up to 100 login events of every user your current role is allowed to monitor in the last hour:

> ```sqlexample
> select *
> from table(information_schema.login_history(TIME_RANGE_START => dateadd('hours',-1,current_timestamp()),current_timestamp()))
> order by event_timestamp;
> ```

---
title: LOWER
source: https://docs.snowflake.com/en/sql-reference/functions/lower.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Case Conversion)

# LOWER

Returns the input string with all characters converted to lowercase.

## Syntax

```sqlsyntax
LOWER( <expr> )
```

## Arguments

`expr`
:   The string expression.

## Returns

This function returns a value of type VARCHAR.

## Examples

Convert strings in several different languages and character sets to lowercase:

```sqlexample
SELECT v, LOWER(v) FROM lu;
```

```output
+----------------------------------+----------------------------------+
|                v                 |             lower(v)             |
+----------------------------------+----------------------------------+
|                                  |                                  |
| The Quick Gray Fox               | the quick gray fox               |
| LAUGHING ALL THE WAY             | laughing all the way             |
| OVER the River 2 Times           | over the river 2 times           |
| UuVvWwXxYyZz                     | uuvvwwxxyyzz                     |
| ÁáÄäÉéÍíÓóÔôÚúÝý                 | ááääééííóóôôúúýý                 |
| ÄäÖößÜü                          | ääöößüü                          |
| ÉéÀàÈèÙùÂâÊêÎîÔôÛûËëÏïÜüŸÿÇçŒœÆæ | ééààèèùùââêêîîôôûûëëïïüüÿÿççœœææ |
| ĄąĆćĘęŁłŃńÓóŚśŹźŻż               | ąąććęęłłńńóóśśźźżż               |
| ČčĎďĹĺĽľŇňŔŕŠšŤťŽž               | ččďďĺĺľľňňŕŕššťťžž               |
| АаБбВвГгДдЕеЁёЖжЗзИиЙй           | ааббввггддееёёжжззиийй           |
| КкЛлМмНнОоПпРрСсТтУуФф           | ккллммннооппррссттууфф           |
| ХхЦцЧчШшЩщЪъЫыЬьЭэЮюЯя           | ххццччшшщщъъыыььээююяя           |
| [NULL]                           | [NULL]                           |
+----------------------------------+----------------------------------+
```

LOWER supports [collation](../collation.md) specifications. This LOWER example
specifies collation with the `tr` (Turkish) locale:

```sqlexample
SELECT LOWER('I' COLLATE 'tr');
```

```output
+-------------------------+
| LOWER('I' COLLATE 'TR') |
|-------------------------|
| ı                       |
+-------------------------+
```

---
title: LPAD
source: https://docs.snowflake.com/en/sql-reference/functions/lpad.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# LPAD

Left-pads a string with characters from another string, or left-pads a binary value with bytes from another binary value.

The argument (`base`) is left-padded to length `length_expr` with characters/bytes from the `pad` argument.

See also:
:   [RPAD](rpad.md)

## Syntax

```sqlsyntax
LPAD( <base>, <length_expr> [, <pad>] )
```

## Arguments

`base`
:   A VARCHAR or BINARY value.

`length_expr`
:   An expression that evaluates to an integer. It specifies:

    * The number of UTF-8 characters to return if the input is VARCHAR.
    * The number of bytes to return if the input is BINARY.

`pad`
:   A VARCHAR or BINARY value. The type must match the data type of the `base` argument.
    Characters (or bytes) from this argument are used to pad the `base`.

## Returns

The data type of the returned value is the same as the data type of the `base` input value (VARCHAR or BINARY).

## Usage notes

* If the `base` argument is longer than `length_expr`, it is truncated to length `length_expr`.
* The `pad` argument can be multiple characters/bytes long. The `pad`
  argument is repeated in the result until the desired length (`length_expr`) is
  reached, truncating any superfluous characters/bytes in the `pad` argument.
  If the `pad` argument is empty, no padding is inserted, but the result is
  still truncated to length `length_expr`.
* When `base` is a string, the default `pad` string is `' '` (a single blank space). When
  `base` is a binary value, the `pad` argument must be provided explicitly.

## Collation details

* Collation applies to VARCHAR inputs. Collation doesn’t apply if the input data type of the first argument
  is BINARY.
* No impact.
  Although collation is accepted syntactically, collations have no impact on processing. For example, languages with
  two-character and three-character letters (for example, “dzs” in Hungarian, “ch” in Czech) still count
  those as two or three characters (not one character) for the length argument.
* The collation of the result is the same as the collation of the input. This can be useful if the returned value is passed to another function as part of nested function calls.
* Currently, Snowflake allows the `base` and `pad` arguments to have different collation specifiers.
  However, the individual collation specifiers can’t both be retained because the returned value has only one
  collation specifier. Snowflake recommends that you avoid using `pad` strings that have a different
  collation from the `base` string.

## Examples

The LPAD function can pad a string with characters on the left so that the values conform to a
specific format. The following example assumes that the `id` values in a column should be eight
characters long and padded with zeros on the left to meet this standard.

Create a table with an `id` column and insert values:

```sqlexample
CREATE OR REPLACE TABLE demo_lpad_ids (id VARCHAR);

INSERT INTO demo_lpad_ids VALUES
  ('5'),
  ('50'),
  ('500');
```

Run a query using the LPAD function so that values in the output meet the standard:

```sqlexample
SELECT id, LPAD(id, 8, '0') AS padded_ids
  FROM demo_lpad_ids;
```

```output
+-----+------------+
| ID  | PADDED_IDS |
|-----+------------|
| 5   | 00000005   |
| 50  | 00000050   |
| 500 | 00000500   |
+-----+------------+
```

The following additional examples use the LPAD function to pad VARCHAR and BINARY data on the left.

Create and fill a table:

```sqlexample
CREATE OR REPLACE TABLE padding_example (v VARCHAR, b BINARY);

INSERT INTO padding_example (v, b)
  SELECT
    'Hi',
    HEX_ENCODE('Hi');

INSERT INTO padding_example (v, b)
  SELECT
    '-123.00',
    HEX_ENCODE('-123.00');

INSERT INTO padding_example (v, b)
  SELECT
    'Twelve Dollars',
    TO_BINARY(HEX_ENCODE('Twelve Dollars'), 'HEX');
```

Query the table to show the data:

```sqlexample
SELECT * FROM padding_example;
```

```output
+----------------+------------------------------+
| V              | B                            |
|----------------+------------------------------|
| Hi             | 4869                         |
| -123.00        | 2D3132332E3030               |
| Twelve Dollars | 5477656C766520446F6C6C617273 |
+----------------+------------------------------+
```

This example demonstrates left-padding of VARCHAR values using the LPAD function, with the
results limited to 10 characters:

```sqlexample
SELECT v,
       LPAD(v, 10, ' ') AS pad_with_blank,
       LPAD(v, 10, '$') AS pad_with_dollar_sign
  FROM padding_example
  ORDER BY v;
```

```output
+----------------+----------------+----------------------+
| V              | PAD_WITH_BLANK | PAD_WITH_DOLLAR_SIGN |
|----------------+----------------+----------------------|
| -123.00        |    -123.00     | $$$-123.00           |
| Hi             |         Hi     | $$$$$$$$Hi           |
| Twelve Dollars | Twelve Dol     | Twelve Dol           |
+----------------+----------------+----------------------+
```

This example demonstrates left-padding of BINARY values using the LPAD function, with the
results limited to 10 bytes:

```sqlexample
SELECT b,
       LPAD(b, 10, TO_BINARY(HEX_ENCODE(' '))) AS pad_with_blank,
       LPAD(b, 10, TO_BINARY(HEX_ENCODE('$'))) AS pad_with_dollar_sign
  FROM padding_example
  ORDER BY b;
```

```output
+------------------------------+----------------------+----------------------+
| B                            | PAD_WITH_BLANK       | PAD_WITH_DOLLAR_SIGN |
|------------------------------+----------------------+----------------------|
| 2D3132332E3030               | 2020202D3132332E3030 | 2424242D3132332E3030 |
| 4869                         | 20202020202020204869 | 24242424242424244869 |
| 5477656C766520446F6C6C617273 | 5477656C766520446F6C | 5477656C766520446F6C |
+------------------------------+----------------------+----------------------+
```

This example shows left-padding when multiple characters are used and when
the padding isn’t an even multiple of the length of the multi-character
string used for padding:

```sqlexample
SELECT LPAD('123.50', 19, '*_');
```

```output
+--------------------------+
| LPAD('123.50', 19, '*_') |
|--------------------------|
| *_*_*_*_*_*_*123.50      |
+--------------------------+
```

The output shows that 19 characters were returned, and the last `*` character doesn’t have
an accompanying `_` character.

---
title: LTRIM
source: https://docs.snowflake.com/en/sql-reference/functions/ltrim.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# LTRIM

Removes leading characters, including whitespace, from a string.

> **Note:**
>
> To remove characters in a string, you can use the [REPLACE](replace.md) function.

See also:
:   [RTRIM](rtrim.md) , [TRIM](trim.md)

## Syntax

```sqlsyntax
LTRIM( <expr> [, <characters> ] )
```

## Arguments

`expr`
:   The string expression to be trimmed.

`characters`
:   One or more characters to remove from the left side of `expr`.

    The default value is `' '` (a single blank space character).
    If no characters are specified, only blank spaces are removed.

## Returns

This function returns a value of VARCHAR data type or NULL. If either argument is NULL, returns NULL.

## Usage notes

* You can specify the characters in `characters` in any order.
* A specification of `' '` in `characters` does not remove other whitespace
  characters (such as tabulation characters, end-of-line characters, and so on). Explicitly
  specify these characters to remove them.

* When `characters` is specified, you must explicitly specify the characters
  to remove whitespace. For example, `' $.'` removes all leading blank spaces, dollar
  signs, and periods from the input string.

## Collation details

[Collation](../collation.md) is supported when the optional second argument is omitted, or when it
contains only whitespace.

The collation specification of the returned value is the same as the collation specification of the first argument.

## Examples

Remove leading `0` and `#` characters from a string:

```sqlexample
SELECT LTRIM('#000000123', '0#');
```

```output
+---------------------------+
| LTRIM('#000000123', '0#') |
|---------------------------|
| 123                       |
+---------------------------+
```

The remaining examples use the following table data. Also, the queries enclose the strings
in `>` and `<` characters to help you visualize the whitespace.

```sqlexample
CREATE OR REPLACE TABLE test_ltrim_function(column1 VARCHAR);

INSERT INTO test_ltrim_function VALUES ('  #Leading Spaces');
```

Remove leading whitespace from a string. This example does not specify the second
`characters` argument because the default is blank spaces.

```sqlexample
SELECT CONCAT('>', CONCAT(column1, '<')) AS original_value,
       CONCAT('>', CONCAT(LTRIM(column1), '<')) AS trimmed_value
  FROM test_ltrim_function;
```

```output
+---------------------+-------------------+
| ORIGINAL_VALUE      | TRIMMED_VALUE     |
|---------------------+-------------------|
| >  #Leading Spaces< | >#Leading Spaces< |
+---------------------+-------------------+
```

Remove leading whitespace and `#` from a string. This example specifies the second
`characters` argument because it removes other characters in addition to
blank spaces.

```sqlexample
SELECT CONCAT('>', CONCAT(column1, '<')) AS original_value,
       CONCAT('>', CONCAT(LTRIM(column1, ' #'), '<')) AS trimmed_value
  FROM test_ltrim_function;
```

```output
+---------------------+------------------+
| ORIGINAL_VALUE      | TRIMMED_VALUE    |
|---------------------+------------------|
| >  #Leading Spaces< | >Leading Spaces< |
+---------------------+------------------+
```

---
title: MAP_CAT
source: https://docs.snowflake.com/en/sql-reference/functions/map_cat.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_CAT

Returns the concatenatation of two [MAP](../data-types-structured.md) values.

## Syntax

```sqlsyntax
MAP_CAT( <map1> , <map2> )
```

## Arguments

`map1`
:   The source MAP.

`map2`
:   The MAP to be appended to `map1`.

## Returns

The return type of this function is the type of `map1`. `map2` is coerced into the `map1` type following the coercion rules. For information about coercion rules, see [Implicit casting a value (coercion)](../data-types-structured.md).

## Usage notes

* If both `map1` and `map2` have a value with the same key, then the output map contains the value from `map2`.
* If either argument is NULL, the function returns NULL without reporting any error.

## Examples

Create two MAPs and concatenate them:

```sqlexample
SELECT MAP_CAT(
  {'map1key1':'map1value1','map1key2':'map1value2'}::MAP(VARCHAR,VARCHAR),
  {'map2key1':'map2value1','map2key2':'map2value2'}::MAP(VARCHAR,VARCHAR))
  AS concatenated_maps;
```

```output
+-----------------------------+
| CONCATENATED_MAPS           |
|-----------------------------|
| {                           |
|   "map1key1": "map1value1", |
|   "map1key2": "map1value2", |
|   "map2key1": "map2value1", |
|   "map2key2": "map2value2"  |
| }                           |
+-----------------------------+
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Concatenate the two MAP columns `attrs` and `defaults`:

```sqlexample
SELECT id, MAP_CAT(attrs, defaults) AS merged
  FROM demo_maps;
```

```output
+----+----------------------+
| ID | MERGED               |
|----+----------------------|
|  1 | {                    |
|    |   "brand": "Acme",   |
|    |   "color": "red",    |
|    |   "currency": "USD", |
|    |   "size": "L"        |
|    | }                    |
|  2 | {                    |
|    |   "brand": "ZenCo",  |
|    |   "color": "blue",   |
|    |   "currency": "EUR", |
|    |   "size": "M"        |
|    | }                    |
+----+----------------------+
```

The output contains the keys and values from both maps. The output also shows that when both
`map1` in the `attr` column and `map2` in the `defaults` column have a value
with the same key, then the output map contains the value from `map2`. That is,
size `L` is in the output for row `1` instead of size `M`.

---
title: MAP_CONTAINS_KEY
source: https://docs.snowflake.com/en/sql-reference/functions/map_contains_key.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_CONTAINS_KEY

Determines whether the specified [MAP](../data-types-structured.md) contains the specified key.

## Syntax

```sqlsyntax
MAP_CONTAINS_KEY( <key> , <map> )
```

## Arguments

`key`
:   The key to find.

`map`
:   The map to be searched.

## Returns

* Returns TRUE if the specified map contains the specified key.
* Returns FALSE if the specified map doesn’t contain the specified key.

## Usage notes

* The type of the key expression must match the type of the map’s key. If the type is VARCHAR, the types can be different lengths.
* For NULL input, the output is NULL.

## Examples

The function searches for the `k1` key and finds it in the map:

```sqlexample
SELECT MAP_CONTAINS_KEY(
  'k1',{'k1':'v1','k2':'v2','k3':'v3'}::MAP(VARCHAR,VARCHAR))
  AS contains_key;
```

```output
+--------------+
| CONTAINS_KEY |
|--------------|
| True         |
+--------------+
```

The function searches for the `k1` key and doesn’t find it in the map:

```sqlexample
SELECT MAP_CONTAINS_KEY(
  'k1',{'ka':'va','kb':'vb','kc':'vc'}::MAP(VARCHAR,VARCHAR))
  AS contains_key;
```

```output
+--------------+
| CONTAINS_KEY |
|--------------|
| False        |
+--------------+
```

A SELECT statement passes in a key that uses a different type than the key in the map:

```sqlexample
SELECT MAP_CONTAINS_KEY(
  'k1',{'1':'va','2':'vb','3':'vc'}::MAP(NUMBER,VARCHAR))
  AS contains_key;
```

```output
001065 (22023): SQL compilation error:
Function MAP_CONTAINS_KEY cannot be used with arguments of types VARCHAR(2) and MAP(NUMBER(38,0), VARCHAR(134217728))
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Determine whether the map in the `attrs` column contains the key in the `ins_key` column:

```sqlexample
SELECT id, MAP_CONTAINS_KEY(ins_key, attrs) AS has_key
  FROM demo_maps;
```

```output
+----+---------+
| ID | HAS_KEY |
|----+---------|
|  1 | False   |
|  2 | True    |
+----+---------+
```

The output shows the following:

* The map in the `attrs` column in row `1` doesn’t contain the key (`material`)
  in the `ins_key` column.
* The map in the `attrs` column in row `2` contains the key (`brand`)
  in the `ins_key` column.

---
title: MAP_DELETE
source: https://docs.snowflake.com/en/sql-reference/functions/map_delete.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_DELETE

Returns a [MAP](../data-types-structured.md) based on an existing MAP with one or more keys removed.

## Syntax

```sqlsyntax
MAP_DELETE( <map>, <key1> [, <key2>, ... ] )
```

## Arguments

`map`
:   The map that contains the key to remove.

`keyN`
:   The key to be omitted from the returned map.

## Returns

Returns a MAP that contains the contents of the input (source) map with one or more keys removed.

## Usage notes

* The type of the key expression must match the type of the map’s key. If the type is VARCHAR,
  then the types can be different lengths.
* Key values that aren’t found in the map are ignored.

## Examples

Remove two key-value pairs from a map containing three key-value pairs:

```sqlexample
SELECT MAP_DELETE({'a':1,'b':2,'c':3}::MAP(VARCHAR,NUMBER),'a','b');
```

```output
+--------------------------------------------------------------+
| MAP_DELETE({'A':1,'B':2,'C':3}::MAP(VARCHAR,NUMBER),'A','B') |
|--------------------------------------------------------------|
| {                                                            |
|   "c": 3                                                     |
| }                                                            |
+--------------------------------------------------------------+
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Remove the keys in the `del_key1` and `del_key2` columns from the MAP values in the `attrs` column:

```sqlexample
SELECT id, MAP_DELETE(attrs, del_key1, del_key2) AS attrs_after_delete
  FROM demo_maps;
```

```output
+----+---------------------+
| ID | ATTRS_AFTER_DELETE  |
|----+---------------------|
|  1 | {                   |
|    |   "color": "red"    |
|    | }                   |
|  2 | {                   |
|    |   "brand": "ZenCo", |
|    |   "color": "blue"   |
|    | }                   |
+----+---------------------+
```

---
title: MAP_ENTRIES
source: https://docs.snowflake.com/en/sql-reference/functions/map_entries.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_ENTRIES

Returns an ARRAY value of key-value pair objects for each entry in a [MAP](../data-types-structured.md) value.

## Syntax

```sqlsyntax
MAP_ENTRIES( <map> )
```

## Arguments

`map`
:   The input MAP value.

## Returns

Returns an ARRAY value where each element is an [OBJECT](../data-types-semistructured.md) with a `key` field and a
`value` field corresponding to an entry in the input MAP value.

If `map` is NULL, the function returns NULL.

If `map` is empty, the function returns an empty ARRAY value.

The order of the entries in the returned ARRAY value is undefined.

## Usage notes

* The function accepts exactly one argument. Calling the function with no arguments or more than one argument
  results in an error.

## Examples

Return the entries in a MAP value as key-value pair objects:

```sqlexample
SELECT MAP_ENTRIES({'a': 1, 'b': 2}::MAP(VARCHAR, INT)) AS entries;
```

```output
+-----------------+
| ENTRIES         |
|-----------------|
| [               |
|   {             |
|     "key": "a", |
|     "value": 1  |
|   },            |
|   {             |
|     "key": "b", |
|     "value": 2  |
|   }             |
| ]               |
+-----------------+
```

Return an empty ARRAY value for an empty MAP value:

```sqlexample
SELECT MAP_ENTRIES({}::MAP(VARCHAR, INT)) AS entries;
```

```output
+---------+
| ENTRIES |
|---------|
| []      |
+---------+
```

Return the entries in a MAP where the values are of type ARRAY:

```sqlexample
SELECT MAP_ENTRIES({'a': [1, 2, 3], 'b': [4, 5]}::MAP(VARCHAR, ARRAY(INT))) AS entries;
```

```output
+-----------------+
| ENTRIES         |
|-----------------|
| [               |
|   {             |
|     "key": "a", |
|     "value": [  |
|       1,        |
|       2,        |
|       3         |
|     ]           |
|   },            |
|   {             |
|     "key": "b", |
|     "value": [  |
|       4,        |
|       5         |
|     ]           |
|   }             |
| ]               |
+-----------------+
```

Return NULL for a NULL input:

```sqlexample
SELECT MAP_ENTRIES(NULL::MAP(VARCHAR, INT)) AS entries;
```

```output
+---------+
| ENTRIES |
|---------|
| NULL    |
+---------+
```

---
title: MAP_INSERT
source: https://docs.snowflake.com/en/sql-reference/functions/map_insert.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_INSERT

Returns a new [MAP](../data-types-structured.md) consisting of the input MAP with
a new key-value pair inserted. That is, an existing key is updated with a new value.

## Syntax

```sqlsyntax
MAP_INSERT( <map> , <key> , <value> [ , <updateFlag> ] )
```

## Arguments

`map`
:   The source map into which the new key-value pair is inserted.

`key`
:   The new key to insert into the map. Must be different from all existing keys in the map, unless `updateFlag` is set to
    TRUE.

`value`
:   The value associated with the key.

**Optional**

`updateFlag`
:   A Boolean flag that, when set to TRUE, specifies the input value is used to update an existing value for a
    key in the map, rather than inserting a new key-value pair.

    The default is FALSE.

## Returns

Returns a MAP consisting of the input MAP with a new key-value pair inserted or an existing key
updated with a new value.

## Usage notes

* The type of the key expression must match the type of the map’s key. If the type is VARCHAR,
  then the types can be different lengths.
* The function supports [JSON null](../../user-guide/semistructured-considerations.md) values, but not SQL NULL values or keys:

  + If `key` is any string other than NULL and `value` is a JSON null (for example,
    `PARSE_JSON('NULL')`), then the key-value pair is inserted into the returned map.
  + If `key` is any string other than NULL and `value` is a SQL NULL (for example,
    `NULL`), then the value is converted to JSON null, and the key-value pair is inserted into the
    returned map.
  + If `key` is a SQL NULL, the key-value pair is omitted from the returned map.
* If `updateFlag` is set to TRUE, then the existing input `key` is updated
  to the input `value`. If `updateFlag` is omitted or set to FALSE, and the
  input key already exists in the map, then an error is returned.
* If `updateFlag` is set to TRUE, but the corresponding key does not already exist in
  the map, then the key-value pair is added.

## Examples

Insert a third key-value pair into a map containing two key-value pairs:

```sqlexample
SELECT MAP_INSERT({'a':1,'b':2}::MAP(VARCHAR,NUMBER),'c',3);
```

```output
+------------------------------------------------------+
| MAP_INSERT({'A':1,'B':2}::MAP(VARCHAR,NUMBER),'C',3) |
|------------------------------------------------------|
| {                                                    |
|   "a": 1,                                            |
|   "b": 2,                                            |
|   "c": 3                                             |
| }                                                    |
+------------------------------------------------------+
```

Insert two new key-value pairs, while omitting one key-value pair, into an empty map:

* `Key_One` consists of a JSON null value.
* `Key_Two` consists of a SQL NULL value, which is converted to a JSON null value.
* `Key_Three` consists of a string containing “null”.

```sqlexample
SELECT MAP_INSERT(MAP_INSERT(MAP_INSERT({}::MAP(VARCHAR,VARCHAR),
  'Key_One', PARSE_JSON('NULL')), 'Key_Two', NULL), 'Key_Three', 'null');
```

```output
+---------------------------------------------------------------------------+
| MAP_INSERT(MAP_INSERT(MAP_INSERT({}::MAP(VARCHAR,VARCHAR),                |
|    'KEY_ONE', PARSE_JSON('NULL')), 'KEY_TWO', NULL), 'KEY_THREE', 'NULL') |
|---------------------------------------------------------------------------|
| {                                                                         |
|   "Key_One": null,                                                        |
|   "Key_Three": "null",                                                    |
|   "Key_Two": null                                                         |
| }                                                                         |
+---------------------------------------------------------------------------+
```

Update an existing key-value pair (`"k1": 100`) with a new value (`"string-value"`):

```sqlexample
SELECT MAP_INSERT({'k1':100}::MAP(VARCHAR,VARCHAR), 'k1', 'string-value', TRUE) AS map;
```

```output
+------------------------+
| MAP                    |
|------------------------|
| {                      |
|   "k1": "string-value" |
| }                      |
+------------------------+
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Using the keys in the `ins_key` column and the values in the `ins_val` column, insert
or update key-value pairs in the maps in the `attrs` column:

```sqlexample
SELECT id, MAP_INSERT(attrs, ins_key, ins_val, TRUE) AS attrs_insert_or_update
  FROM demo_maps;
```

```output
+----+-------------------------+
| ID | ATTRS_INSERT_OR_UPDATE  |
|----+-------------------------|
|  1 | {                       |
|    |   "brand": "Acme",      |
|    |   "color": "red",       |
|    |   "material": "cotton", |
|    |   "size": "M"           |
|    | }                       |
|  2 | {                       |
|    |   "brand": "ZC",        |
|    |   "color": "blue"       |
|    | }                       |
+----+-------------------------+
```

---
title: MAP_KEYS
source: https://docs.snowflake.com/en/sql-reference/functions/map_keys.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_KEYS

Returns the keys in a [MAP](../data-types-structured.md).

## Syntax

```sqlsyntax
MAP_KEYS( <map> )
```

## Arguments

`map`
:   The input map.

## Returns

Returns a structured ARRAY containing the keys in the MAP. The order of the keys is undefined.

## Examples

List the keys in a map:

```sqlexample
SELECT MAP_KEYS({'a':1,'b':2,'c':3}::MAP(VARCHAR,NUMBER))
  AS map_keys;
```

```output
+----------+
| MAP_KEYS |
|----------|
| [        |
|   "a",   |
|   "b",   |
|   "c"    |
| ]        |
+----------+
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Return the keys in the MAP values in the `attrs` column:

```sqlexample
SELECT id, MAP_KEYS(attrs) AS attr_keys
  FROM demo_maps;
```

```output
+----+------------+
| ID | ATTR_KEYS  |
|----+------------|
|  1 | [          |
|    |   "brand", |
|    |   "color", |
|    |   "size"   |
|    | ]          |
|  2 | [          |
|    |   "brand", |
|    |   "color"  |
|    | ]          |
+----+------------+
```

---
title: MAP_PICK
source: https://docs.snowflake.com/en/sql-reference/functions/map_pick.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_PICK

Returns a new [MAP](../data-types-structured.md) containing the specified key-value pairs from an existing MAP.

To identify the key-value pairs to include in the new map, pass in the keys as arguments, or pass in an array containing the keys.

If a specified key isn’t present in the input map, the key is ignored.

## Syntax

```sqlsyntax
MAP_PICK( <map>, <key1> [, <key2>, ... ] )

MAP_PICK( <map>, <array> )
```

## Arguments

`map`
:   The input map.

`key1,key2`
:   One or more keys that identify the key-value pairs to be included in the returned map.

`array`
:   An array of keys that identify the key-value pairs to be included in the returned map. You can specify a semi-structured ARRAY
    or a structured ARRAY.

## Returns

Returns a new MAP containing some of the key-value pairs from an existing MAP.

## Examples

Create a new map that contains two of the three key-value pairs from an existing map:

```sqlexample
SELECT MAP_PICK({'a':1,'b':2,'c':3}::MAP(VARCHAR,NUMBER),'a', 'b')
  AS new_map;
```

```output
+-----------+
| NEW_MAP   |
|-----------|
| {         |
|   "a": 1, |
|   "b": 2  |
| }         |
+-----------+
```

In the previous example, the keys are passed as arguments to MAP_PICK. You can also use an array to specify the keys:

```sqlexample
SELECT MAP_PICK({'a':1,'b':2,'c':3}::MAP(VARCHAR,NUMBER), ['a', 'b'])
  AS new_map;
```

```output
+-----------+
| NEW_MAP   |
|-----------|
| {         |
|   "a": 1, |
|   "b": 2  |
| }         |
+-----------+
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Using the keys in the `keep_keys` column, return new MAP values from the MAP values in
the `attrs` column:

```sqlexample
SELECT id, MAP_PICK(attrs, keep_keys) AS attrs_subset
  FROM demo_maps;
```

```output
+----+--------------------+
| ID | ATTRS_SUBSET       |
|----+--------------------|
|  1 | {                  |
|    |   "brand": "Acme", |
|    |   "color": "red"   |
|    | }                  |
|  2 | {                  |
|    |   "brand": "ZenCo" |
|    | }                  |
+----+--------------------+
```

---
title: MAP_SIZE
source: https://docs.snowflake.com/en/sql-reference/functions/map_size.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Map)

# MAP_SIZE

Returns the size of a [MAP](../data-types-structured.md).

## Syntax

```sqlsyntax
MAP_SIZE( <map> )
```

## Arguments

`map`
:   The input map.

## Returns

Returns the number of entries in the map.

## Examples

Determine the number of entries in a map:

```sqlexample
SELECT MAP_SIZE({'a':1,'b':2,'c':3}::MAP(VARCHAR,NUMBER))
  AS map_size;
```

```output
+----------+
| MAP_SIZE |
|----------|
|        3 |
+----------+
```

Create a temporary table that contains MAP values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_maps(
  id INTEGER,
  attrs MAP(VARCHAR, VARCHAR),
  defaults MAP(VARCHAR, VARCHAR),
  keep_keys ARRAY(VARCHAR),
  ins_key VARCHAR,
  ins_val VARCHAR,
  update_existing BOOLEAN,
  del_key1 VARCHAR,
  del_key2 VARCHAR);

INSERT INTO demo_maps SELECT
  1,
  {'color':'red','size':'M','brand':'Acme'}::MAP(VARCHAR, VARCHAR),
  {'currency':'USD','size':'L'}::MAP(VARCHAR, VARCHAR),
  ['color','brand']::ARRAY(VARCHAR),
  'material',
  'cotton',
  TRUE,
  'size',
  'brand';

INSERT INTO demo_maps SELECT
  2,
  {'color':'blue','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  {'currency':'EUR','size':'M','brand':'ZenCo'}::MAP(VARCHAR, VARCHAR),
  ['brand','currency']::ARRAY(VARCHAR),
  'brand',
  'ZC',
  FALSE,
  'currency',
  'material';
```

Query the table to show the data:

```sqlexample
SELECT * FROM demo_maps;
```

```output
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
| ID | ATTRS               | DEFAULTS             | KEEP_KEYS    | INS_KEY  | INS_VAL | UPDATE_EXISTING | DEL_KEY1 | DEL_KEY2 |
|----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------|
|  1 | {                   | {                    | [            | material | cotton  | True            | size     | brand    |
|    |   "brand": "Acme",  |   "currency": "USD", |   "color",   |          |         |                 |          |          |
|    |   "color": "red",   |   "size": "L"        |   "brand"    |          |         |                 |          |          |
|    |   "size": "M"       | }                    | ]            |          |         |                 |          |          |
|    | }                   |                      |              |          |         |                 |          |          |
|  2 | {                   | {                    | [            | brand    | ZC      | False           | currency | material |
|    |   "brand": "ZenCo", |   "brand": "ZenCo",  |   "brand",   |          |         |                 |          |          |
|    |   "color": "blue"   |   "currency": "EUR", |   "currency" |          |         |                 |          |          |
|    | }                   |   "size": "M"        | ]            |          |         |                 |          |          |
|    |                     | }                    |              |          |         |                 |          |          |
+----+---------------------+----------------------+--------------+----------+---------+-----------------+----------+----------+
```

Return the number of entries in the MAP values in the `attrs` column:

```sqlexample
SELECT id, MAP_SIZE(attrs) AS attr_count
  FROM demo_maps;
```

```output
+----+------------+
| ID | ATTR_COUNT |
|----+------------|
|  1 |          3 |
|  2 |          2 |
+----+------------+
```

---
title: MATERIALIZED_VIEW_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/materialized_view_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# MATERIALIZED_VIEW_REFRESH_HISTORY

This table function is used for querying the [materialized views](../../user-guide/views-materialized.md) refresh history for a specified materialized view within a specified date range. The information returned by the function includes the view name and credits consumed each time a materialized view is refreshed.

## Syntax

```sqlsyntax
MATERIALIZED_VIEW_REFRESH_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ]
      [ , MATERIALIZED_VIEW_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range to display the materialized view maintenance history.
    For example, if you specify that the start date is 2019-04-03 and the end date is 2019-04-05, then you get data for
    April 3, April 4, and April 5. (The endpoints are included.)

    * If neither a start date nor an end date is specified, the default will be the last 12 hours.
    * If an end date is not specified, but a start date is specified, then [CURRENT_DATE](current_date.md)
      at midnight is used as the end of the range.
    * If a start date is not specified, but an end date is specified, then the range starts 12 hours prior to the start
      of `DATE_RANGE_END`.

`MATERIALIZED_VIEW_NAME => string`
:   Materialized view name. If specified, only shows the history for the specified materialized view. The name can include the schema name and the database
    name.

    If a name is not specified, then the results includes the data for each materialized
    view maintained within the specified time range.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.

  > **Note:**
  >
  > A role with the MONITOR USAGE privilege can view per-object credit usage, but not object names. The role must also be granted SELECT on an object in order for its name to be returned by this function. If the role does not have sufficient privileges to see the object name, the object name might be displayed with a substitute name such as “unknown_#”, where “#” represents one or more digits.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be
  fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).
* The history is displayed in increments of 1 hour.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | TEXT | Number of credits billed for materialized view maintenance during the START_TIME and END_TIME window. |
| MATERIALIZED_VIEW_NAME | TEXT | Name of the materialized view. |

## Examples

Retrieve the refresh history for a one-hour range for your account:

> ```sqlexample
> select *
>   from table(information_schema.materialized_view_refresh_history(
>     date_range_start=>'2019-05-22 19:00:00.000',
>     date_range_end=>'2019-05-22 20:00:00.000'));
> ```
>
> Here is sample output:
>
> ```sqlexample
> +-------------------------------+-------------------------------+--------------+-----------------------------------------+
> | START_TIME                    | END_TIME                      | CREDITS_USED | MATERIALIZED_VIEW_NAME                  |
> |-------------------------------+-------------------------------+--------------+-----------------------------------------|
> | 2019-05-22 19:00:00.000 -0700 | 2019-05-22 20:00:00.000 -0700 |  0.223276651 | TEST_DB.TEST_SCHEMA.MATERIALIZED_VIEW_1 |
> +-------------------------------+-------------------------------+--------------+-----------------------------------------+
> ```

Retrieve the history for the last 12 hours for your account:

> ```sqlexample
> select *
>   from table(information_schema.materialized_view_refresh_history(
>     date_range_start=>dateadd(H, -12, current_timestamp)));
> ```

Retrieve the history for the past week for your account:

> ```sqlexample
> select *
>   from table(information_schema.materialized_view_refresh_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date));
> ```

Retrieve the maintenance history for the past week for a specified materialized view in your account:

> ```sqlexample
> select *
>   from table(information_schema.materialized_view_refresh_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date,
>     materialized_view_name=>'mydb.myschema.my_materialized_view'));
> ```

---
title: MAX
source: https://docs.snowflake.com/en/sql-reference/functions/max.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# MAX

Returns the maximum value for the records within `expr`. NULL values are ignored unless all the records are NULL, in which case a NULL value is returned.

See also:
:   [COUNT](count.md) , [SUM](sum.md) , [MIN](min.md)

## Syntax

**Aggregate function**

```sqlsyntax
MAX( <expr> )
```

**Window function**

```sqlsyntax
MAX( <expr> ) [ OVER ( [ PARTITION BY <expr1> ] [ ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ] ) ]
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Returns

The data type of the returned value is the same as the data type of the input values.

## Usage Notes

* For compatibility with other systems, you can specify the DISTINCT keyword as an argument for the function,
  but it does not have any effect.
* If the function is called as a window function, the window can include an optional `window_frame`.
  The `window_frame` (either cumulative or sliding) specifies the subset of rows within the window for which
  the summed values are returned. If no `window_frame` is specified, the default is the following
  cumulative window frame (in accordance with the ANSI standard for window functions):

  > `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

  For more details about window frames, including syntax and examples, see [Usage notes for window frames](../functions-window-syntax.md).

## Collation Details

* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result is the same as the collation of the input.

## Examples

The following examples demonstrate how to use the MAX function.

Create a table and data:

```sqlexample
CREATE OR REPLACE TABLE sample_table (k CHAR(4), d CHAR(4));

INSERT INTO sample_table VALUES
  ('1', '1'), ('1', '5'), ('1', '3'),
  ('2', '2'), ('2', NULL),
  ('3', NULL),
  (NULL, '7'), (NULL, '1');
```

Display the data:

```sqlexample
SELECT k, d
  FROM sample_table
  ORDER BY k, d;
```

```output
+------+------+
| K    | D    |
|------+------|
| 1    | 1    |
| 1    | 3    |
| 1    | 5    |
| 2    | 2    |
| 2    | NULL |
| 3    | NULL |
| NULL | 1    |
| NULL | 7    |
+------+------+
```

Use the MAX function to retrieve the largest
value in the column named `d`:

```sqlexample
SELECT MAX(d)
  FROM sample_table;
```

```output
+--------+
| MAX(D) |
|--------|
| 7      |
+--------+
```

Combine the GROUP BY clause with the MAX function
to retrieve the largest values in each group (where each
group is based on the value of column `k`):

```sqlexample
SELECT k, MAX(d)
  FROM sample_table
  GROUP BY k
  ORDER BY k;
```

```output
+------+--------+
| K    | MAX(D) |
|------+--------|
| 1    | 5      |
| 2    | 2      |
| 3    | NULL   |
| NULL | 7      |
+------+--------+
```

Use a PARTITION BY clause to break the data into groups based on the
value of `k`. This is similar to, but not identical to, using
GROUP BY. In particular, GROUP BY produces one output
row per group, while PARTITION BY produces one output row per input
row.

```sqlexample
SELECT k, d, MAX(d) OVER (PARTITION BY k)
  FROM sample_table
  ORDER BY k, d;
```

```output
+------+------+------------------------------+
| K    | D    | MAX(D) OVER (PARTITION BY K) |
|------+------+------------------------------|
| 1    | 1    | 5                            |
| 1    | 3    | 5                            |
| 1    | 5    | 5                            |
| 2    | 2    | 2                            |
| 2    | NULL | 2                            |
| 3    | NULL | NULL                         |
| NULL | 1    | 7                            |
| NULL | 7    | 7                            |
+------+------+------------------------------+
```

Use a windowing ORDER BY clause to create a sliding window two rows wide,
and output the highest value within that window. (Remember that ORDER BY in
the windowing clause is separate from ORDER BY at the statement level.)
This example uses a single partition, so there is no PARTITION BY clause
in the OVER() clause.

```sqlexample
SELECT k, d, MAX(d) OVER (ORDER BY k, d ROWS BETWEEN 1 PRECEDING AND CURRENT ROW)
  FROM sample_table
  ORDER BY k, d;
```

```output
+------+------+----------------------------------------------------------------------+
| K    | D    | MAX(D) OVER (ORDER BY K, D ROWS BETWEEN 1 PRECEDING AND CURRENT ROW) |
|------+------+----------------------------------------------------------------------|
| 1    | 1    | 1                                                                    |
| 1    | 3    | 3                                                                    |
| 1    | 5    | 5                                                                    |
| 2    | 2    | 5                                                                    |
| 2    | NULL | 2                                                                    |
| 3    | NULL | NULL                                                                 |
| NULL | 1    | 1                                                                    |
| NULL | 7    | 7                                                                    |
+------+------+----------------------------------------------------------------------+
```

---
title: MAX (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_max.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# MAX (system data metric function)

Returns the maximum value for the specified column in a table.

The MAX system data metric function is optimized to calculate the maximum value for a single column and provides greater performance when
compared to calling the [MAX](max.md) function.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.MAX(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* FLOAT
* NUMBER

## Returns

The function returns either a NUMBER or FLOAT value.

## Example

Measure the maximum value for the `salary` column in a table:

```sqlexample
SELECT SNOWFLAKE.CORE.MAX(
  SELECT
    salary
  FROM hr.tables.empl_info);
```

```output
+------------------------------------------------------------+
| SNOWFLAKE.CORE.MAX(SELECT salary FROM hr.tables.empl_info) |
+------------------------------------------------------------+
| 325000                                                     |
+------------------------------------------------------------+
```

---
title: MAX_BY
source: https://docs.snowflake.com/en/sql-reference/functions/max_by.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)

# MAX_BY

Finds the row(s) containing the maximum value for a column and returns the value of another column in that row.

For example, if a table contains the columns `employee_id` and `salary`, `MAX_BY(employee_id, salary)` returns the value
of the `employee_id` column for the row that has the highest value in the `salary` column.

If multiple rows contain the specified maximum value, the function is non-deterministic.

To return values for multiple rows, specify the optional `maximum_number_of_values_to_return` argument. With this
additional argument:

* The function returns an [ARRAY](../data-types-semistructured.md) containing the values of a column for the rows with the highest
  values of a specified column.
* The values in the ARRAY are sorted by their corresponding values in the column containing the maximum values.
* If multiple rows contain these highest values, the function is non-deterministic.

For example, `MAX_BY(employee_id, salary, 5)` returns an ARRAY of values of the `employee_id` column for the five rows
containing the highest values in the `salary` column. The IDs in the ARRAY are sorted by the corresponding values in the
`salary` column.

See also:
:   [MAX](max.md)

## Syntax

```sqlsyntax
MAX_BY( <col_to_return>, <col_containing_maximum> [ , <maximum_number_of_values_to_return> ] )
```

## Arguments

**Required:**

`col_to_return`
:   Column containing the value to return.

`col_containing_maximum`
:   Column containing the maximum value.

**Optional:**

`maximum_number_of_values_to_return`
:   Constant integer specifying the maximum number of values to return. You must specify a positive number. The maximum number that
    you can specify is `1000`.

## Returns

* If `maximum_number_of_values_to_return` is not specified, the function returns a value of the same type as
  `col_to_return`.
* If `maximum_number_of_values_to_return` is specified, the function returns an ARRAY containing values of the same type
  as `col_to_return`. The values in the ARRAY are sorted by their corresponding `col_containing_maximum` values.

  For example, `MAX_BY(employee_id, salary, 5)` returns the IDs of the employees with the highest five salaries, sorted by
  `salary` (in descending order).

## Usage notes

* The function ignores NULL values in `col_containing_maximum`.
* If all values in `col_containing_maximum` are NULL, the function returns NULL (regardless of whether the optional
  `maximum_number_of_values_to_return` argument is specified).

## Examples

The following examples demonstrate how to use the MAX_BY function.

To run these examples, execute the following statements to set up the table and data for the examples:

```sqlexample
CREATE OR REPLACE TABLE employees(employee_id NUMBER, department_id NUMBER, salary NUMBER);

INSERT INTO employees VALUES
  (1001, 10, 10000),
  (1020, 10, 9000),
  (1030, 10, 8000),
  (900, 20, 15000),
  (2000, 20, NULL),
  (2010, 20, 15000),
  (2020, 20, 8000);
```

Execute the following statement to view the contents of this table:

```sqlexample
SELECT * FROM employees;
```

```output
+-------------+---------------+--------+
| EMPLOYEE_ID | DEPARTMENT_ID | SALARY |
|-------------+---------------+--------|
|        1001 |            10 |  10000 |
|        1020 |            10 |   9000 |
|        1030 |            10 |   8000 |
|         900 |            20 |  15000 |
|        2000 |            20 |   NULL |
|        2010 |            20 |  15000 |
|        2020 |            20 |   8000 |
+-------------+---------------+--------+
```

The following example returns the ID of the employee with the highest salary:

```sqlexample
SELECT MAX_BY(employee_id, salary) FROM employees;
```

```output
+-----------------------------+
| MAX_BY(EMPLOYEE_ID, SALARY) |
|-----------------------------|
|                         900 |
+-----------------------------+
```

Note the following:

* Because more than one row contains the maximum value for the `salary` column, the function is non-deterministic and might
  return the employee ID for a different row in subsequent executions.
* The function ignores the NULL value in the `salary` column when determining the rows with the maximum values.

The following example returns an ARRAY containing the IDs of the employees with the three highest salaries:

```sqlexample
SELECT MAX_BY(employee_id, salary, 3) from employees;
```

```output
+--------------------------------+
| MAX_BY(EMPLOYEE_ID, SALARY, 3) |
|--------------------------------|
| [                              |
|   900,                         |
|   2010,                        |
|   1001                         |
| ]                              |
+--------------------------------+
```

As shown in the example, the values in the ARRAY are sorted by their corresponding values in the `salary` column. So,
MAX_BY returns the IDs of employees sorted by their salary in descending order.

If more than one of these rows contain the same value in the `salary` column, the order of the returned values for that salary
is non-deterministic.

See also [Using the MIN_BY and MAX_BY aggregate functions](../../user-guide/querying-time-series-data.md).

---
title: MD5 , MD5_HEX
source: https://docs.snowflake.com/en/sql-reference/functions/md5.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Checksum)

# MD5 , MD5_HEX

Returns a 32-character hex-encoded string containing the 128-bit MD5
message digest.

These functions are synonymous.

See also:
:   [MD5_BINARY](md5_binary.md), [MD5_NUMBER_LOWER64](md5_number_lower64.md), [MD5_NUMBER_UPPER64](md5_number_upper64.md)

## Syntax

```sqlsyntax
MD5(<msg>)

MD5_HEX(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

Returns a 32-character hex-encoded string.

## Usage notes

* Although the MD5\* functions were originally developed as cryptographic functions, they are now
  obsolete for cryptography and should not be used for that purpose. They can be used for other purposes
  (for example, as “checksum” functions to detect accidental data corruption).

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
SELECT md5('Snowflake');

----------------------------------+
         MD5('SNOWFLAKE')         |
----------------------------------+
 edf1439075a83a447fb8b630ddc9c8de |
----------------------------------+
```

---
title: MD5_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/md5_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Checksum)

# MD5_BINARY

Returns a 16-byte `BINARY` value containing the 128-bit MD5 message digest.

See also:
:   [MD5 , MD5_HEX](md5.md), [MD5_NUMBER_LOWER64](md5_number_lower64.md), [MD5_NUMBER_UPPER64](md5_number_upper64.md)

## Syntax

```sqlsyntax
MD5_BINARY(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

Returns a 16-byte `BINARY` value containing the MD5 message digest.

## Usage notes

* Although the MD5\* functions were originally developed as cryptographic functions, they are now
  obsolete for cryptography and should not be used for that purpose. They can be used for other purposes
  (for example, as “checksum” functions to detect accidental data corruption).

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

The example below shows a simple example of using the function. Note that
although the output is a 16-byte binary string, by default SNOWSQL displays
binary values as a series of hexadecimal digits, so the output below appears
as 32 hexadecimal digits, not as 16 one-byte characters.

> ```sqlexample
> SELECT md5_binary('Snowflake');
> +----------------------------------+
> | MD5_BINARY('SNOWFLAKE')          |
> |----------------------------------|
> | EDF1439075A83A447FB8B630DDC9C8DE |
> +----------------------------------+
> ```

This example demonstrates using the function to insert into a table that
contains a column of type `BINARY`.

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE binary_demo (b BINARY);
> > INSERT INTO binary_demo (b) SELECT MD5_BINARY('Snowflake');
> > ```
>
> Output:
>
> > ```sqlexample
> > SELECT TO_VARCHAR(b, 'HEX') AS hex_representation
> >     FROM binary_demo;
> > +----------------------------------+
> > | HEX_REPRESENTATION               |
> > |----------------------------------|
> > | EDF1439075A83A447FB8B630DDC9C8DE |
> > +----------------------------------+
> > ```

---
title: MD5_NUMBER — Obsoleted
source: https://docs.snowflake.com/en/sql-reference/functions/md5_number.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Checksum)

# MD5_NUMBER — *Obsoleted*

Returns the 128-bit MD5 message digest interpreted as a signed 128-bit big
endian number. This representation is useful for maximally efficient storage
and comparison of MD5 digests.

See also:
:   [MD5 , MD5_HEX](md5.md), [MD5_BINARY](md5_binary.md), [MD5_NUMBER_LOWER64](md5_number_lower64.md), [MD5_NUMBER_UPPER64](md5_number_upper64.md)

## Syntax

```sqlsyntax
MD5_NUMBER(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

A signed integer (`NUMERIC(38, 0)`).

This integer can be outside the range stored by `NUMERIC(38, 0)`, so this function has been obsoleted.

## Usage notes

Although the `MD5`, `MD5_BINARY`, and `MD5_NUMBER` functions
were originally developed as cryptographic functions, they are now
obsolete for cryptography and should not be used for that purpose.
They can be used for other purposes, for example as “checksum”
functions to detect accidental data corruption.

## Examples

```sqlexample
SELECT md5_number('Snowflake');

-----------------------------------------+
         MD5_NUMBER('SNOWFLAKE')         |
-----------------------------------------+
 -24002618010294540563082926240470284066 |
-----------------------------------------+
```

---
title: MD5_NUMBER_LOWER64
source: https://docs.snowflake.com/en/sql-reference/functions/md5_number_lower64.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Checksum)

# MD5_NUMBER_LOWER64

Calculates the 128-bit MD5 message digest, interprets it as a signed 128-bit big endian number, and returns the lower 64 bits of
the number as an unsigned integer. This representation is useful for maximally efficient storage and comparison of MD5 digests.

See also:
:   [MD5 , MD5_HEX](md5.md), [MD5_BINARY](md5_binary.md), [MD5_NUMBER_UPPER64](md5_number_upper64.md)

## Syntax

```sqlsyntax
MD5_NUMBER_LOWER64(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

A 64 bit unsigned integer that represents the lower 64 bits of the message digest.

## Usage notes

* Although the MD5\* functions were originally developed as cryptographic functions, they are now
  obsolete for cryptography and should not be used for that purpose. They can be used for other purposes
  (for example, as “checksum” functions to detect accidental data corruption).

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
select md5_number_lower64('Snowflake');

+---------------------------------+
| MD5_NUMBER_LOWER64('SNOWFLAKE') |
|---------------------------------|
|             9203306159527282910 |
+---------------------------------+
```

---
title: MD5_NUMBER_UPPER64
source: https://docs.snowflake.com/en/sql-reference/functions/md5_number_upper64.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Checksum)

# MD5_NUMBER_UPPER64

Calculates the 128-bit MD5 message digest, interprets it as a signed 128-bit big endian number, and returns the upper 64 bits of
the number as an unsigned integer. This representation is useful for maximally efficient storage and comparison of MD5 digests.

See also:
:   [MD5 , MD5_HEX](md5.md), [MD5_BINARY](md5_binary.md), [MD5_NUMBER_LOWER64](md5_number_lower64.md)

## Syntax

```sqlsyntax
MD5_NUMBER_UPPER64(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

A 64 bit unsigned integer that represents the upper 64 bits of the message digest.

## Usage notes

* Although the MD5\* functions were originally developed as cryptographic functions, they are now
  obsolete for cryptography and should not be used for that purpose. They can be used for other purposes
  (for example, as “checksum” functions to detect accidental data corruption).

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
select md5_number_upper64('Snowflake');

+---------------------------------+
| MD5_NUMBER_UPPER64('SNOWFLAKE') |
|---------------------------------|
|            17145559544104499780 |
+---------------------------------+
```

---
title: MEDIAN
source: https://docs.snowflake.com/en/sql-reference/functions/median.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# MEDIAN

Determines the median of a set of values.

## Syntax

**Aggregate function**

```sqlsyntax
MEDIAN( <expr> )
```

**Window function**

```sqlsyntax
MEDIAN( <expr> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Argument

`expr`
:   The expression must evaluate to a numeric data type (INTEGER, FLOAT,
    DECIMAL, or equivalent).

## Returns

Returns a FLOAT or DECIMAL (fixed-point) number, depending upon the
input.

## Usage notes

* If the number of non-NULL values is an odd number greater than or equal to 1,
  this returns the median (“center”) value of the non-NULL values.
* If the number of non-NULL values is an even number, this returns a value
  equal to the average of the two center values. For example, if the
  values are 1, 3, 5, and 20, then this returns 4 (the average of 3 and 5).
* If all values are NULL, this returns NULL.
* If the number of non-NULL values is 0, this returns NULL.
* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

This shows how to use the function.

Create an empty table.

```sqlexample
CREATE OR REPLACE TABLE aggr (k INT, v DECIMAL(10,2));
```

Get the MEDIAN value for column v. The function returns NULL because
there are no rows.

```sqlexample
SELECT MEDIAN(v)
  FROM aggr;
```

```output
+------------+
| MEDIAN (V) |
|------------|
|       NULL |
+------------+
```

Insert some rows:

```sqlexample
INSERT INTO aggr VALUES (1, 10), (1, 20), (1, 21);
INSERT INTO aggr VALUES (2, 10), (2, 20), (2, 25), (2, 30);
INSERT INTO aggr VALUES (3, NULL);
```

Get the MEDIAN value for each group. Note that because the number of
values in group k = 2 is an even number, the returned value for that group
is the mid-point between the two middle numbers.

```sqlexample
SELECT k, MEDIAN(v)
  FROM aggr
  GROUP BY k
  ORDER BY k;
```

```output
+---+-----------+
| K | MEDIAN(V) |
|---+-----------|
| 1 |  20.00000 |
| 2 |  22.50000 |
| 3 |      NULL |
+---+-----------+
```

---
title: MIN
source: https://docs.snowflake.com/en/sql-reference/functions/min.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# MIN

Returns the minimum value for the records within `expr`. NULL values are ignored unless all the records are NULL, in which case a NULL value is returned.

See also:
:   [COUNT](count.md) , [SUM](sum.md) , [MAX](max.md)

## Syntax

**Aggregate function**

```sqlsyntax
MIN( <expr> )
```

**Window function**

```sqlsyntax
MIN( <expr> ) [ OVER ( [ PARTITION BY <expr1> ] [ ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ] ) ]
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Returns

The data type of the returned value is the same as the data type of the input values.

## Usage Notes

* For compatibility with other systems, you can specify the DISTINCT keyword as an argument for the function,
  but it does not have any effect.
* If the function is called as a window function, the window can include an optional `window_frame`.
  The `window_frame` (either cumulative or sliding) specifies the subset of rows within the window for which
  the summed values are returned. If no `window_frame` is specified, the default is the following
  cumulative window frame (in accordance with the ANSI standard for window functions):

  > `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

  For more details about window frames, including syntax and examples, see [Usage notes for window frames](../functions-window-syntax.md).

## Collation Details

* The comparisons follow the collation based on the input arguments’ collations and precedences.
* The collation of the result is the same as the collation of the input.

## Examples

The following examples demonstrate how to use the MIN function.

Create a table and data:

```sqlexample
CREATE OR REPLACE TABLE sample_table (k CHAR(4), d CHAR(4));

INSERT INTO sample_table VALUES
  ('1', '1'), ('1', '5'), ('1', '3'),
  ('2', '2'), ('2', NULL),
  ('3', NULL),
  (NULL, '7'), (NULL, '1');
```

Display the data:

```sqlexample
SELECT k, d
  FROM sample_table
  ORDER BY k, d;
```

```output
+------+------+
| K    | D    |
|------+------|
| 1    | 1    |
| 1    | 3    |
| 1    | 5    |
| 2    | 2    |
| 2    | NULL |
| 3    | NULL |
| NULL | 1    |
| NULL | 7    |
+------+------+
```

Use the MIN function to retrieve the smallest value in the column named `d`:

```sqlexample
SELECT MIN(d)
  FROM sample_table;
```

```output
+--------+
| MIN(D) |
|--------|
| 1      |
+--------+
```

Combine the GROUP BY clause with the MIN function
to retrieve the smallest values in each group (where each
group is based on the value of column `k`):

```sqlexample
SELECT k, MIN(d)
  FROM sample_table
  GROUP BY k
  ORDER BY k;
```

```output
+------+--------+
| K    | MIN(D) |
|------+--------|
| 1    | 1      |
| 2    | 2      |
| 3    | NULL   |
| NULL | 1      |
+------+--------+
```

Use a PARTITION BY clause to break the data into groups based on the
value of `k`. This is similar to, but not identical to, using
GROUP BY. In particular, GROUP BY produces one output
row per group, while PARTITION BY produces one output row per input
row.

```sqlexample
SELECT k, d, MIN(d) OVER (PARTITION BY k)
  FROM sample_table
  ORDER BY k, d;
```

```output
+------+------+------------------------------+
| K    | D    | MIN(D) OVER (PARTITION BY K) |
|------+------+------------------------------|
| 1    | 1    | 1                            |
| 1    | 3    | 1                            |
| 1    | 5    | 1                            |
| 2    | 2    | 2                            |
| 2    | NULL | 2                            |
| 3    | NULL | NULL                         |
| NULL | 1    | 1                            |
| NULL | 7    | 1                            |
+------+------+------------------------------+
```

Use an ORDER BY clause to create a sliding window two rows wide,
and output the lowest value within that window. (Remember that ORDER BY in
the OVER clause is separate from ORDER BY at the statement level.)
This example uses a single partition, so there is no PARTITION BY clause
in the OVER clause.

```sqlexample
SELECT k, d, MIN(d) OVER (ORDER BY k, d ROWS BETWEEN 1 PRECEDING AND CURRENT ROW)
  FROM sample_table
  ORDER BY k, d;
```

```output
+------+------+----------------------------------------------------------------------+
| K    | D    | MIN(D) OVER (ORDER BY K, D ROWS BETWEEN 1 PRECEDING AND CURRENT ROW) |
|------+------+----------------------------------------------------------------------|
| 1    | 1    | 1                                                                    |
| 1    | 3    | 1                                                                    |
| 1    | 5    | 3                                                                    |
| 2    | 2    | 2                                                                    |
| 2    | NULL | 2                                                                    |
| 3    | NULL | NULL                                                                 |
| NULL | 1    | 1                                                                    |
| NULL | 7    | 1                                                                    |
+------+------+----------------------------------------------------------------------+
```

---
title: MIN (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_min.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# MIN (system data metric function)

Returns the minimum value for the specified column in a table.

The MIN system data metric function is optimized to calculate the minimum value for a single column and provides greater performance when
compared to calling the [MIN](min.md) function.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.MIN(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* FLOAT
* NUMBER

## Returns

The function returns either a NUMBER or FLOAT value.

## Example

Measure the minimum value for the SALARY column in a table:

```sqlexample
SELECT SNOWFLAKE.CORE.MIN(
  SELECT
    salary
  FROM hr.tables.empl_info
);
```

```output
+------------------------------------------------------------+
| SNOWFLAKE.CORE.MIN(SELECT salary FROM hr.tables.empl_info) |
+------------------------------------------------------------+
| 60000                                                      |
+------------------------------------------------------------+
```

---
title: MIN_BY
source: https://docs.snowflake.com/en/sql-reference/functions/min_by.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)

# MIN_BY

Finds the row(s) containing the minimum value for a column and returns the value of another column in that row.

For example, if a table contains the columns `employee_id` and `salary`, `MIN_BY(employee_id, salary)` returns the value
of the `employee_id` column for the row that has the lowest value in the `salary` column.

If multiple rows contain the specified minimum value, the function is non-deterministic.

To return values for multiple rows, specify the optional `maximum_number_of_values_to_return` argument. With this
additional argument:

* The function returns an [ARRAY](../data-types-semistructured.md) containing the values of a column for the rows with the lowest
  values of a specified column.
* The values in the ARRAY are sorted by their corresponding values in the column containing the minimum values.
* If multiple rows contain these lowest values, the function is non-deterministic.

For example, `MIN_BY(employee_id, salary, 5)` returns an ARRAY of values of the `employee_id` column for the five rows
containing the lowest values in the `salary` column. The IDs in the ARRAY are sorted by the corresponding values in the
`salary` column.

See also:
:   [MIN](min.md)

## Syntax

```sqlsyntax
MIN_BY( <col_to_return>, <col_containing_mininum> [ , <maximum_number_of_values_to_return> ] )
```

## Arguments

**Required:**

`col_to_return`
:   Column containing the value to return.

`col_containing_mininum`
:   Column containing the minimum value.

**Optional:**

`maximum_number_of_values_to_return`
:   Constant integer specifying the maximum number of values to return. You must specify a positive number. The maximum number that
    you can specify is `1000`.

## Returns

* If `maximum_number_of_values_to_return` is not specified, the function returns a value of the same type as
  `col_to_return`.
* If `maximum_number_of_values_to_return` is specified, the function returns an ARRAY containing values of the same type
  as `col_to_return`. The values in the ARRAY are sorted by their corresponding `col_containing_mininum` values.

  For example, `MIN_BY(employee_id, salary, 5)` returns the IDs of the employees with the lowest five salaries, sorted by
  `salary` (in ascending order).

## Usage notes

* The function ignores NULL values in `col_containing_mininum`.
* If all values in `col_containing_mininum` are NULL, the function returns NULL (regardless of whether the optional
  `maximum_number_of_values_to_return` argument is specified).

## Examples

The following examples demonstrate how to use the MIN_BY function.

To run these examples, execute the following statements to set up the table and data for the examples:

```sqlexample
CREATE OR REPLACE TABLE employees(employee_id NUMBER, department_id NUMBER, salary NUMBER);

INSERT INTO employees VALUES
  (1001, 10, 10000),
  (1020, 10, 9000),
  (1030, 10, 8000),
  (900, 20, 15000),
  (2000, 20, NULL),
  (2010, 20, 15000),
  (2020, 20, 8000);
```

Execute the following statement to view the contents of this table:

```sqlexample
SELECT * FROM employees;
```

```output
+-------------+---------------+--------+
| EMPLOYEE_ID | DEPARTMENT_ID | SALARY |
|-------------+---------------+--------|
|        1001 |            10 |  10000 |
|        1020 |            10 |   9000 |
|        1030 |            10 |   8000 |
|         900 |            20 |  15000 |
|        2000 |            20 |   NULL |
|        2010 |            20 |  15000 |
|        2020 |            20 |   8000 |
+-------------+---------------+--------+
```

The following example returns the ID of the employee with the lowest salary:

```sqlexample
SELECT MIN_BY(employee_id, salary) FROM employees;
```

```output
+-----------------------------+
| MIN_BY(EMPLOYEE_ID, SALARY) |
|-----------------------------|
|                        1030 |
+-----------------------------+
```

Note the following:

* Because more than one row contains the minimum value for the `salary` column, the function is non-deterministic and might
  return the employee ID for a different row in subsequent executions.
* The function ignores the NULL value in the `salary` column when determining the rows with the minimum values.

The following example returns an ARRAY containing the IDs of the employees with the three lowest salaries:

```sqlexample
SELECT MIN_BY(employee_id, salary, 3) FROM employees;

+--------------------------------+
| MIN_BY(EMPLOYEE_ID, SALARY, 3) |
|--------------------------------|
| [                              |
|   1030,                        |
|   2020,                        |
|   1020                         |
| ]                              |
+--------------------------------+
```

As shown in the example, the values in the ARRAY are sorted by their corresponding values in the `salary` column. So,
MIN_BY returns the IDs of employees sorted by their salary in ascending order.

If more than one of these rows contain the same value in the `salary` column, the order of the returned values for that salary
is non-deterministic.

See also [Using the MIN_BY and MAX_BY aggregate functions](../../user-guide/querying-time-series-data.md).

---
title: MINHASH
source: https://docs.snowflake.com/en/sql-reference/functions/minhash.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Similarity Estimation) ,
    [Window functions](../functions-window-syntax.md) (Similarity Estimation)

# MINHASH

Returns a MinHash state containing an array of size `k` constructed by applying `k` number of different hash functions to the input rows and keeping the minimum of each hash function. This MinHash state can
then be input to the [APPROXIMATE_SIMILARITY](approximate_similarity.md) function to estimate the similarity with one or more other MinHash states.

For more information about MinHash states, see [Estimating Similarity of Two or More Sets](../../user-guide/querying-approximate-similarity.md).

See also:
:   [MINHASH_COMBINE](minhash_combine.md)

## Syntax

**Aggregate function**

```sqlsyntax
MINHASH( <k> , [ DISTINCT ] expr+ )

MINHASH( <k> , * )
```

**Window function**

```sqlsyntax
MINHASH( <k> , [ DISTINCT ] expr+ ) OVER ( [ PARTITION BY <expr1> ] )

MINHASH( <k> , * ) OVER ( [ PARTITION BY <expr1> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`k`
:   The number of hash functions to create. The larger the value, the better the approximation;
    however, this value has a linear impact on the computation time for estimating similarity
    using APPROXIMATE_SIMILARITY. The suggested value is 100. The maximum value is 1024.

`expr`
:   One or more expressions (typically column names) that determine the values to hash.

`*`
:   Hash all columns in the input rows.

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).
* DISTINCT can be included as an argument, but has no effect.

## Examples

```sqlexample
USE SCHEMA snowflake_sample_data.tpch_sf1;

SELECT MINHASH(5, *) FROM orders;

+----------------------+
| MINHASH(5, *)        |
|----------------------|
| {                    |
|   "state": [         |
|     78678383574307,  |
|     586952033158539, |
|     525995912623966, |
|     508991839383217, |
|     492677003405678  |
|   ],                 |
|   "type": "minhash", |
|   "version": 1       |
| }                    |
+----------------------+
```

Here is a more extensive example, showing the three related functions
MINHASH, MINHASH_COMBINE and APPROXIMATE_SIMILARITY. This
example creates 3 tables (`ta`, `tb`, and `tc`), two of which (`ta` and `tb`) are
similar, and two of which (`ta` and `tc`) are completely dissimilar.

Create and populate tables with values:

```sqlexample
CREATE TABLE ta (i INTEGER);
CREATE TABLE tb (i INTEGER);
CREATE TABLE tc (i INTEGER);

INSERT INTO ta (i) VALUES (1), (2), (3), (4), (5), (6), (7), (8), (9), (10);
INSERT INTO tb (i) VALUES (1), (2), (3), (4), (5), (6), (7), (8), (9), (11);
INSERT INTO tc (i) VALUES (-1), (-20), (-300), (-4000);
```

Calculate minhash info for the initial set of data:

```sqlexample
CREATE TABLE minhash_a_1 (mh) AS SELECT MINHASH(100, i) FROM ta;
CREATE TABLE minhash_b (mh) AS SELECT MINHASH(100, i) FROM tb;
CREATE TABLE minhash_c (mh) AS SELECT MINHASH(100, i) FROM tc;
```

Add more data to one of the tables:

```sqlexample
INSERT INTO ta (i) VALUES (12);
```

Demonstrate the MINHASH_COMBINE function:

```sqlexample
CREATE TABLE minhash_a_2 (mh) AS SELECT MINHASH(100, i) FROM ta WHERE i > 10;

CREATE TABLE minhash_a (mh) AS
  SELECT MINHASH_COMBINE(mh)
    FROM (
      (SELECT mh FROM minhash_a_1)
      UNION ALL
      (SELECT mh FROM minhash_a_2)
    );
```

This query shows the approximate similarity of the two similar tables
(`ta` and `tb`):

```sqlexample
SELECT APPROXIMATE_SIMILARITY(mh)
  FROM (
    (SELECT mh FROM minhash_a)
    UNION ALL
    (SELECT mh FROM minhash_b)
  );
```

```output
+-----------------------------+
| APPROXIMATE_SIMILARITY (MH) |
|-----------------------------|
|                        0.75 |
+-----------------------------+
```

This query shows the approximate similarity of the two very different tables
(`ta` and `tc`):

```sqlexample
SELECT APPROXIMATE_SIMILARITY(mh)
  FROM (
    (SELECT mh FROM minhash_a)
    UNION ALL
    (SELECT mh FROM minhash_c)
  );
```

```output
+-----------------------------+
| APPROXIMATE_SIMILARITY (MH) |
|-----------------------------|
|                           0 |
+-----------------------------+
```

---
title: MINHASH_COMBINE
source: https://docs.snowflake.com/en/sql-reference/functions/minhash_combine.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Similarity Estimation) ,
    [Window functions](../functions-window-syntax.md) (Similarity Estimation)

# MINHASH_COMBINE

Combines input MinHash states into a single MinHash output state. This Minhash state can then be input to the [APPROXIMATE_SIMILARITY](approximate_similarity.md) function to estimate the similarity with other MinHash states.

This allows use cases in which MINHASH is run over horizontal rowsets of the same table, producing a MinHash state for each rowset. These states can then be combined using MINHASH_COMBINE, producing the same
output state as a single run of MINHASH over the entire table.

For more information about MinHash states, see [Estimating Similarity of Two or More Sets](../../user-guide/querying-approximate-similarity.md).

See also:
:   [MINHASH](minhash.md)

## Syntax

**Aggregate function**

```sqlsyntax
MINHASH_COMBINE( [ DISTINCT ] <state> )
```

**Window function**

```sqlsyntax
MINHASH_COMBINE( [ DISTINCT ] <state> ) OVER ( [ PARTITION BY <expr> ] )
```

For details about the OVER clause, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`state`
:   An expression that contains MinHash state information generated by a call to [MINHASH](minhash.md).
    Input MinHash states must have arrays of equal length.

## Usage notes

* This function can be used as an [aggregate function](../functions-aggregation.md) or
  a [window function](../functions-window-syntax.md).
* DISTINCT can be included as an argument, but has no effect.

## Examples

```sqlexample
USE SCHEMA snowflake_sample_data.tpch_sf1;

SELECT MINHASH_COMBINE(mh) FROM
    (
      (SELECT MINHASH(5, c2) mh FROM orders WHERE c2 <= 10000)
        UNION
      (SELECT MINHASH(5, c2) mh FROM orders WHERE c2 > 10000 AND c2 <= 20000)
        UNION
      (SELECT MINHASH(5, C2) mh FROM orders WHERE c2 > 20000)
    );

+-----------------------+
| MINHASH_COMBINE(MH)   |
|-----------------------|
| {                     |
|   "state": [          |
|     628914288006793,  |
|     1071764954434168, |
|     991489123966035,  |
|     2395105834644106, |
|     680224867834949   |
|   ],                  |
|   "type": "minhash",  |
|   "version": 1        |
| }                     |
+-----------------------+
```

Here is a more extensive example, showing the three related functions
MINHASH, MINHASH_COMBINE and APPROXIMATE_SIMILARITY. This
example creates 3 tables (`ta`, `tb`, and `tc`), two of which (`ta` and `tb`) are
similar, and two of which (`ta` and `tc`) are completely dissimilar.

Create and populate tables with values:

```sqlexample
CREATE TABLE ta (i INTEGER);
CREATE TABLE tb (i INTEGER);
CREATE TABLE tc (i INTEGER);

INSERT INTO ta (i) VALUES (1), (2), (3), (4), (5), (6), (7), (8), (9), (10);
INSERT INTO tb (i) VALUES (1), (2), (3), (4), (5), (6), (7), (8), (9), (11);
INSERT INTO tc (i) VALUES (-1), (-20), (-300), (-4000);
```

Calculate minhash info for the initial set of data:

```sqlexample
CREATE TABLE minhash_a_1 (mh) AS SELECT MINHASH(100, i) FROM ta;
CREATE TABLE minhash_b (mh) AS SELECT MINHASH(100, i) FROM tb;
CREATE TABLE minhash_c (mh) AS SELECT MINHASH(100, i) FROM tc;
```

Add more data to one of the tables:

```sqlexample
INSERT INTO ta (i) VALUES (12);
```

Demonstrate the MINHASH_COMBINE function:

```sqlexample
CREATE TABLE minhash_a_2 (mh) AS SELECT MINHASH(100, i) FROM ta WHERE i > 10;

CREATE TABLE minhash_a (mh) AS
  SELECT MINHASH_COMBINE(mh)
    FROM (
      (SELECT mh FROM minhash_a_1)
      UNION ALL
      (SELECT mh FROM minhash_a_2)
    );
```

This query shows the approximate similarity of the two similar tables
(`ta` and `tb`):

```sqlexample
SELECT APPROXIMATE_SIMILARITY(mh)
  FROM (
    (SELECT mh FROM minhash_a)
    UNION ALL
    (SELECT mh FROM minhash_b)
  );
```

```output
+-----------------------------+
| APPROXIMATE_SIMILARITY (MH) |
|-----------------------------|
|                        0.75 |
+-----------------------------+
```

This query shows the approximate similarity of the two very different tables
(`ta` and `tc`):

```sqlexample
SELECT APPROXIMATE_SIMILARITY(mh)
  FROM (
    (SELECT mh FROM minhash_a)
    UNION ALL
    (SELECT mh FROM minhash_c)
  );
```

```output
+-----------------------------+
| APPROXIMATE_SIMILARITY (MH) |
|-----------------------------|
|                           0 |
+-----------------------------+
```

---
title: MOD
source: https://docs.snowflake.com/en/sql-reference/functions/mod.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# MOD

Returns the remainder of input `expr1` divided by input `expr2`.

Equivalent to the modulo [arithmetic operator](../operators-arithmetic.md) (for example, `expr1 % expr2`).

## Syntax

```sqlsyntax
MOD( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   A numeric expression.

`expr2`
:   A numeric expression.

## Returns

Returns either an integer or a fixed-point decimal number.

## Usage notes

* Both `expr1` and `expr2` must be numeric expressions.
  They aren’t required to be integers.
* The returned value is the remainder from truncation-based division (rounding towards zero), not floor-based
  division (rounding down). Therefore, if `expr1` is negative, the returned value is negative. This
  behavior is different from some programming languages (such as Python), but consistent with standard SQL. For
  more information, see the [Modulo Wikipedia page](https://en.wikipedia.org/wiki/Modulo).

## Examples

The following example shows usage of the `MOD()` function on both integer
and non-integer values:

> ```sqlexample
> SELECT MOD(3, 2) AS mod1, MOD(4.5, 1.2) AS mod2;
> ```
>
> Output:
>
> ```sqlexample
> +------+------+
> | MOD1 | MOD2 |
> +------+------+
> |    1 |  0.9 |
> +------+------+
> ```

---
title: MODE
source: https://docs.snowflake.com/en/sql-reference/functions/mode.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# MODE

Returns the most frequent value for the values within `expr1`. NULL values are ignored. If all the values are
NULL, or there are 0 rows, then the function returns NULL.

## Syntax

**Aggregate function**

```sqlsyntax
MODE( <expr1> )
```

**Window function**

```sqlsyntax
MODE( <expr1> ) OVER ( [ PARTITION BY <expr2> ] )
```

## Arguments

`expr1`
:   This expression produces the values that are searched to find the most frequent value. The expression can be of any of the following data types:

    > * BINARY
    > * BOOLEAN
    > * DATE
    > * FLOAT
    > * INTEGER
    > * NUMBER
    > * TIMESTAMP (TIMESTAMP_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ)
    > * VARCHAR
    > * VARIANT

    This function does not support the following data types:

    > * ARRAY
    > * GEOGRAPHY
    > * OBJECT

`expr2`
:   The optional expression on which to partition the data into groups. The output contains the most frequent
    value for each group/partition.

## Returns

The data type of the returned value is identical to the data type of the input expression.

## Usage notes

* If there is a tie for most frequent value (two or more values occur as frequently as each other, and
  more frequently than any other value), MODE returns one of those values.
* DISTINCT is not supported for this function.
* Even if NULL is the most frequent value, the function does not return NULL (unless all values are NULL).

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

The following code demonstrates the use of the MODE function:

Create a table and data:

```sqlexample
CREATE OR REPLACE TABLE aggr (k INT, v DECIMAL(10,2));
```

Get the MODE value for column `v`. The function returns NULL because there are no rows.

```sqlexample
SELECT MODE(v)
  FROM aggr;
```

```output
+---------+
| MODE(V) |
|---------|
|    NULL |
+---------+
```

Insert some rows:

```sqlexample
INSERT INTO aggr (k, v) VALUES
  (1, 10),
  (1, 10),
  (1, 10),
  (1, 10),
  (1, 20),
  (1, 21);
```

The MODE function returns the most frequent value `10`:

```sqlexample
SELECT MODE(v)
  FROM aggr;
```

```output
+---------+
| MODE(V) |
|---------|
|   10.00 |
+---------+
```

Insert some more rows:

```sqlexample
INSERT INTO aggr (k, v) VALUES
  (2, 20),
  (2, 20),
  (2, 25),
  (2, 30);
```

Now there are two most frequent values. The MODE function selects the value `10`:

```sqlexample
SELECT MODE(v)
  FROM aggr;
```

```output
+---------+
| MODE(V) |
|---------|
|   10.00 |
+---------+
```

Insert a row with a NULL value:

```sqlexample
INSERT INTO aggr (k, v) VALUES (3, NULL);
```

Get the MODE value for each group. Note that because group `k = 3` only contains NULL values, the returned
value for that group is NULL.

```sqlexample
SELECT k, MODE(v)
  FROM aggr
  GROUP BY k
  ORDER BY k;
```

```output
+---+---------+
| K | MODE(V) |
|---+---------|
| 1 |   10.00 |
| 2 |   20.00 |
| 3 |    NULL |
+---+---------+
```

The MODE function can also be used as a basic window function with an OVER clause:

```sqlexample
SELECT k, v, MODE(v) OVER (PARTITION BY k)
  FROM aggr
  ORDER BY k, v;
```

```output
+---+-------+-------------------------------+
| K |     V | MODE(V) OVER (PARTITION BY K) |
|---+-------+-------------------------------|
| 1 | 10.00 |                         10.00 |
| 1 | 10.00 |                         10.00 |
| 1 | 10.00 |                         10.00 |
| 1 | 10.00 |                         10.00 |
| 1 | 20.00 |                         10.00 |
| 1 | 21.00 |                         10.00 |
| 2 | 20.00 |                         20.00 |
| 2 | 20.00 |                         20.00 |
| 2 | 25.00 |                         20.00 |
| 2 | 30.00 |                         20.00 |
| 3 |  NULL |                          NULL |
+---+-------+-------------------------------+
```

---
title: MODEL_MONITOR_DRIFT_METRIC
source: https://docs.snowflake.com/en/sql-reference/functions/model-monitor-drift-metric.md
section: SQL Functions
---

Categories:
:   [Model monitor functions](../functions-model-monitors.md)

# MODEL_MONITOR_DRIFT_METRIC

Gets drift metrics from a [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md). Each model monitor monitors one machine learning model.

See also:
:   [Querying monitoring results](../../developer-guide/snowflake-ml/model-registry/model-observability.md) for more information.

## Syntax

```sqlsyntax
MODEL_MONITOR_DRIFT_METRIC(
  <model_monitor_name>, <drift_metric_name>, <column_name>
  [ , <granularity> [ , <start_time>  [ , <end_time> [ , <extra_args> ] ] ] ]
)
```

## Arguments

**Required:**

`model_monitor_name`
:   Name of the model monitor used to compute the metric.

    Valid values: A string that’s the name of the model monitor. It can be a simple or fully qualified name.

`drift_metric_name`
:   Name of the metric.

    Valid values:

    * `'JENSEN_SHANNON'`
    * `'DIFFERENCE_OF_MEANS'`
    * `'WASSERSTEIN'`
    * `'POPULATION_STABILITY_INDEX'`

`column_name`
:   Name of the column used to compute drift.

    Valid values: Any string that exists as a feature column, prediction column, or actual column in the model monitor.

**Optional:**

`granularity`
:   Granularity of the time range being queried. Default value is `1 DAY`.

    Valid values:

    * `'<num> DAY'`
    * `'<num> WEEK'`
    * `'<num> MONTH'`
    * `'<num> QUARTER'`
    * `'<num> YEAR'`
    * `'ALL'`
    * `NULL`

`start_time`
:   Start of the time range used to compute the metric. The default value is 60 days before the current time, and is calculated each time you call the function.

    Valid values: A timestamp expression or `NULL`.

`end_time`
:   End of the time range used to compute the metric. The default value is the current time, and is calculated each time you call the function.

    Valid values: A timestamp expression or `NULL`.

`extra_args`
:   Additional arguments for segment-specific queries. This parameter is optional - if not provided, the query returns metrics for all data (non-segment query).

    Valid values: A string in JSON format specifying segment column and value pairs: `'{"SEGMENTS": [{"column": "<segment_column_name>", "value": "<segment_value>"}]}'`

    > **Note:**
    >
    > Currently, segment queries support only 1 segment column:value pair per query. You cannot query multiple segments simultaneously in a single function call.

    For more information about segments, see [ML Observability: Monitoring model behavior over time](../../developer-guide/snowflake-ml/model-registry/model-observability.md).

## Returns

| Column | Description | Example values |
| --- | --- | --- |
| `EVENT_TIMESTAMP` | Timestamp at the start of the time range. | `2024-01-01 00:00:00.000` |
| `METRIC_VALUE` | Value of the metric within the specified time range. | `5` |
| `COL_COUNT_USED` | Number of records used to compute the metric. | `100` |
| `COL_COUNT_UNUSED` | Number of records excluded from the metric computation. | `10` |
| `BASELINE_COL_COUNT_USED` | Number of records used to compute the metric. | `10` |
| `BASELINE_COL_COUNT_UNUSED` | Number of records excluded from the metric computation. | `0` |
| `METRIC_NAME` | Name of the drift metric that has been computed. | `DIFFERENCE_OF_MEANS` |
| `COLUMN_NAME` | Name of the column for which the drift metric has been computed. | `FEATURE_NAME` |
| `SEGMENT_COLUMN` | Name of the segment column for which the metric is computed (or NULL for non-segment queries). | `CUSTOMER_TIER` |
| `SEGMENT_VALUE` | Segment value for which the metric is computed (or NULL for non-segment queries). | `PREMIUM` |

## Usage Notes

The model monitor must have a baseline set for the drift metric to be computed.

You might run into errors if you:

* Don’t set a baseline for the model monitor.
* Requested a numerical drift metric for a non-numeric feature.
* Use a drift metric that doesn’t exist in the model monitor.

If values you’ve specified for `column_name` or `model_monitor_name` are case-sensitive, or contain special characters or spaces, enclose them in double quotes.
You must enclose the double quotes within single quotes, such as `'"<model_monitor_name>"'`.

If double-quotes are not provided in these two fields, the `column_name` or `model_monitor_name` will be assumed to be case-insensitive.

To minimize potential impact from schema changes, update your queries to explicitly select only the necessary columns instead of using a wildcard (\*).

## Examples

The following example gets the differences of means drift metric for `MY_MONITOR` over a one-day period:

```sqlexample
SELECT * FROM TABLE(MODEL_MONITOR_DRIFT_METRIC(
'MY_MONITOR', 'DIFFERENCE_OF_MEANS', 'MODEL_PREDICTION', '1 DAY', TO_TIMESTAMP_TZ('2024-01-01'), TO_TIMESTAMP_TZ('2024-01-02'))
)
```

The following example gets the Jensen-Shannon drift metric for `MY_MONITOR` over the last 30 days:

```sqlexample
SELECT * FROM TABLE(MODEL_MONITOR_DRIFT_METRIC(
'MY_MONITOR', 'JENSEN_SHANNON', 'MODEL_PREDICTION', '1 DAY', DATEADD('DAY', -30, CURRENT_DATE()), CURRENT_DATE())
)
```

---
title: MODEL_MONITOR_PERFORMANCE_METRIC
source: https://docs.snowflake.com/en/sql-reference/functions/model-monitor-performance-metric.md
section: SQL Functions
---

Categories:
:   [Model monitor functions](../functions-model-monitors.md)

# MODEL_MONITOR_PERFORMANCE_METRIC

Gets performance metrics from a [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md). Each model monitor monitors one machine learning model.

See also:
:   [Querying monitoring results](../../developer-guide/snowflake-ml/model-registry/model-observability.md) for more information.

## Syntax

```sqlsyntax
MODEL_MONITOR_PERFORMANCE_METRIC(<model_monitor_name>, <performance_metric_name>,
    [, <granularity> [, <start_time>  [, <end_time> [, <extra_args> ] ] ] ] )
```

## Arguments

**Required:**

`MODEL_MONITOR_NAME`
:   Name of the model monitor used to compute the metric.

    Valid values:

    A string that’s the name of the model monitor. It can be a simple or fully qualified name.

`METRIC_NAME`
:   Name of the performance metric.

    Valid values if the model monitor is attached to a regression model:

    > * `'RMSE'`
    > * `'MAE'`
    > * `'MAPE'`
    > * `'MSE'`

    Valid values if the model monitor is attached to a binary classification model:

    > * `'ROC_AUC'`
    > * `'CLASSIFICATION_ACCURACY'`
    > * `'PRECISION'`
    > * `'RECALL'`
    > * `'F1_SCORE'`

**Optional:**

`GRANULARITY`
:   Granularity of the time range being queried. The default value is `1 DAY`.

    Valid values:

    > * `'<num> DAY'`
    > * `'<num> WEEK'`
    > * `'<num> MONTH'`
    > * `'<num> QUARTER'`
    > * `'<num> YEAR'`
    > * `'ALL'`
    > * `NULL`

`START_TIME`
:   Start of the time range used to compute the metric. The default value is 60 days before the current time, and is calculated each time you call the function.

    Valid values:

    > A timestamp expression or `NULL`.

`END_TIME`
:   End of the time range used to compute the metric. The default value is the current time, and is calculated each time you call the function.

    Valid values:

    > A timestamp expression or `NULL`.

`EXTRA_ARGS`
:   Additional arguments for segment-specific queries. This parameter is optional - if not provided, the query returns metrics for all data (non-segment query).

    Valid values: A string in JSON format specifying segment column and value pairs: `'{"SEGMENTS": [{"column": "<segment_column_name>", "value": "<segment_value>"}]}'`

    > **Note:**
    >
    > Currently, segment queries support only 1 segment column:value pair per query. You cannot query multiple segments simultaneously in a single function call.

    For more information about segments, see [ML Observability: Monitoring model behavior over time](../../developer-guide/snowflake-ml/model-registry/model-observability.md).

## Returns

| **Column** | **Description** | **Example values** |
| --- | --- | --- |
| `EVENT_TIMESTAMP` | Timestamp at the start of the time range. | `2024-01-01 00:00:00.000` |
| `METRIC_VALUE` | Value of the metric within the specified time range. | `0.5` |
| `COUNT_USED` | Number of records used to compute the metric. | `100` |
| `COUNT_UNUSED` | Number of records excluded from the metric computation. | `10` |
| `METRIC_NAME` | Name of the metric that has been computed. | `ROC_AUC` |
| `SEGMENT_COLUMN` | Name of the segment column for which the metric is computed (or NULL for non-segment queries). | `CUSTOMER_TIER` |
| `SEGMENT_VALUE` | Segment value for which the metric is computed (or NULL for non-segment queries). | `PREMIUM` |

## Usage Notes

If value you’ve specified for `model_monitor_name` is case-sensitive or contains special characters or spaces, enclose it in double quotes.
You must enclose the double quotes within single quotes. For example, `'"<example_model_monitor_name>"'`.

If you don’t use double-quotes, the `model_monitor_name` is assumed to be case-insensitive.

To minimize potential impact from schema changes, update your queries to explicitly select only the necessary columns instead of using a wildcard (\*).

### General requirements

* The model monitor must be associated with a model that supports the requested metric type.
* The model monitor must contain the necessary data for each metric type, as described below.

### Metric requirements

The following are the required columns to get regression metrics:

* RMSE: Requires the `prediction_score` and `actual_score` columns
* MAE: Requires the `prediction_score` and `actual_score` columns
* MAPE: Requires the `prediction_score` and `actual_score` columns

The following are the required columns to get binary classification metrics:

* ROC_AUC: Requires the `prediction_score` and `actual_class` columns
* CLASSIFICATION_ACCURACY: Requires the `prediction_class` and `actual_class` columns
* PRECISION: Requires the `prediction_class` and `actual_class` columns
* RECALL: Requires the `prediction_class` and `actual_class` columns
* F1_SCORE: Requires the `prediction_class` and `actual_class` columns

The following are the required columns to get multiclass classification metrics:

* CLASSIFICATION_ACCURACY: Requires the `prediction_class` and `actual_class` columns
* MACRO_AVERAGE_PRECISION: Requires the `prediction_class` and `actual_class` columns
* MACRO_AVERAGE_RECALL: Requires the `prediction_class` and `actual_class` columns
* MICRO_AVERAGE_PRECISION: Requires the `prediction_class` and `actual_class` columns
* MICRO_AVERAGE_RECALL: Requires the `prediction_class` and `actual_class` columns

> **Note:**
>
> For binary classification, you can use micro-average precision and recall metrics similarly to how you use classification accuracy in multi-class classification.

### Error cases

You might run into errors if you do the following:

* Request an accuracy metric without setting the corresponding prediction or actual column.
* Fail to provide data in the `actual_score` or `actual_class` column.

## Examples

The following example gets the Root Mean Square Error (RMSE) over a one-day period from the model monitor.

```sqlexample
SELECT * FROM TABLE(MODEL_MONITOR_PERFORMANCE_METRIC(
'MY_MONITOR', 'RMSE', '1 DAY', TO_TIMESTAMP_TZ('2024-01-01'), TO_TIMESTAMP_TZ('2024-01-02'))
)
```

The following example gets the Root Mean Square Error (RMSE) over the last 30 days from the model monitor:

```sqlexample
SELECT * FROM TABLE(MODEL_MONITOR_PERFORMANCE_METRIC(
'MY_MONITOR', 'RMSE', '1 DAY', DATEADD('DAY', -30, CURRENT_DATE()), CURRENT_DATE())
)
```

---
title: MODEL_MONITOR_STAT_METRIC
source: https://docs.snowflake.com/en/sql-reference/functions/model-monitor-stat-metric.md
section: SQL Functions
---

Categories:
:   [Model monitor functions](../functions-model-monitors.md)

# MODEL_MONITOR_STAT_METRIC

Gets count metrics from a [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md). Each model monitor monitors one machine learning model.

See also:
:   [Querying monitoring results](../../developer-guide/snowflake-ml/model-registry/model-observability.md) for more information.

## Syntax

```sqlsyntax
MODEL_MONITOR_STAT_METRIC(<model_monitor_name>, <stat_metric_name>, <column_name>
    [, <granularity> [, <start_time>  [, <end_time> [, <extra_args> ] ] ] ] )
```

## Arguments

**Required:**

`MODEL_MONITOR_NAME`
:   Name of the model monitor used to compute the metric.

    Valid values:

    A string that’s the name of the model monitor. It can be a simple or fully qualified name.

`METRIC_NAME`
:   Name of the metric.

    Valid values:

    > * `'COUNT'`
    > * `'COUNT_NULL'`

`COLUMN_NAME`
:   Name of the column used to compute the count.

    Valid values:

    Any string that exists as a feature column, prediction column, or actual column in the model monitor.

**Optional:**

`GRANULARITY`
:   Granularity of the time range being queried. The default value is `1 DAY`.

    Valid values:

    > * `'<num> DAY'`
    > * `'<num> WEEK'`
    > * `'<num> MONTH'`
    > * `'<num> QUARTER'`
    > * `'<num> YEAR'`
    > * `'ALL'`
    > * `NULL`

`START_TIME`
:   Start of the time range used to compute the metric. The default value is 60 days before the current time, and is calculated each time you call the function.

    Valid values:

    > A timestamp expression or `NULL`.

`END_TIME`
:   End of the time range used to compute the metric. The default value is the current time, and is calculated each time you call the function.

    Valid values:

    > A timestamp expression or `NULL`.

`EXTRA_ARGS`
:   Additional arguments for segment-specific queries. This parameter is optional - if not provided, the query returns metrics for all data (non-segment query).

    Valid values: A string in JSON format specifying segment column and value pairs: `'{"SEGMENTS": [{"column": "<segment_column_name>", "value": "<segment_value>"}]}'`

    > **Note:**
    >
    > Currently, segment queries support only 1 segment column:value pair per query. You cannot query multiple segments simultaneously in a single function call.

    For more information about segments, see [ML Observability: Monitoring model behavior over time](../../developer-guide/snowflake-ml/model-registry/model-observability.md).

## Returns

| **Column** | **Description** |
| --- | --- |
| `EVENT_TIMESTAMP` | Timestamp at the start of the time range. |
| `METRIC_VALUE` | Value of the metric within the specified time range. |
| `METRIC_NAME` | Name of the metric that has been computed. |
| `COLUMN_NAME` | Name of the column for which the stat metric has been computed. |
| `SEGMENT_COLUMN` | Name of the segment column for which the metric is computed (or NULL for non-segment queries). |
| `SEGMENT_VALUE` | Segment value for which the metric is computed (or NULL for non-segment queries). |

## Usage Notes

The model monitor must have the column being used to calculate the metric.

If the values you’ve specified for `column_name` or `model_monitor_name` are case-sensitive or contain special characters or spaces, enclose them in double quotes.
You must enclose the double quotes within single quotes. For example, `'"<example_model_monitor_name>"'`.

If double-quotes are not provided in these two fields, the `column_name` or `model_monitor_name` are assumed to be case-insensitive.

To minimize potential impact from schema changes, update your queries to explicitly select only the necessary columns instead of using a wildcard (\*).

## Examples

The following example gets count metrics for the specified model monitor and time range:

```sqlexample
SELECT * FROM TABLE(MODEL_MONITOR_STAT_METRIC(
'MY_MONITOR', 'COUNT', 'MODEL_PREDICTION', '1 DAY', TO_TIMESTAMP_TZ('2024-01-01')
, TO_TIMESTAMP_TZ('2024-01-02'))
)
```

The following example gets count metric for `MY_MONITOR` over the last 30 days:

```sqlexample
SELECT * FROM TABLE(MODEL_MONITOR_STAT_METRIC(
'MY_MONITOR', 'COUNT', 'MODEL_PREDICTION', '1 DAY', DATEADD('DAY', -30, CURRENT_DATE()), CURRENT_DATE())
)
```

---
title: MONTHNAME
source: https://docs.snowflake.com/en/sql-reference/functions/monthname.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# MONTHNAME

Returns the three-letter month name for the specified date or timestamp.

## Syntax

```sqlsyntax
MONTHNAME( <date_or_timestamp_expr> )
```

## Arguments

`date_or_timestamp_expr`
:   A date or a timestamp, or an expression that can be evaluated to a date or a timestamp.

## Returns

This function returns a value of type VARCHAR.

## Usage notes

To return the full month name instead of the three-letter month name, you can use the
[TO_CHAR](to_char.md) function with the [TO_DATE](to_date.md) or [TO_TIMESTAMP](to_timestamp.md)
function. The following example uses the TO_CHAR and TO_DATE functions to return the full month name for
the date `2025-01-01`:

```sqlexample
SELECT TO_CHAR(TO_DATE('2025-01-01'), 'MMMM') AS full_month_name;
```

```output
+-----------------+
| FULL_MONTH_NAME |
|-----------------|
| January         |
+-----------------+
```

## Examples

The following examples use the MONTHNAME function.

Return the three-letter month name of a date:

```sqlexample
SELECT MONTHNAME(TO_DATE('2025-01-01')) AS month;
```

```output
+-------+
| MONTH |
|-------|
| Jan   |
+-------+
```

Return the three-letter month name of a timestamp:

```sqlexample
SELECT MONTHNAME(TO_TIMESTAMP('2025-04-03 10:00')) AS month;
```

```output
+-------+
| MONTH |
|-------|
| Apr   |
+-------+
```

Return the three-letter month name of DATE values in a column.

First, create a table with a DATE column and insert various DATE values:

```sqlexample
CREATE OR REPLACE TABLE monthname_function_demo (d DATE);

INSERT INTO monthname_function_demo (d) VALUES
  ('2024-01-01'::DATE),
  ('2024-02-02'::DATE),
  ('2024-03-03'::DATE),
  ('2024-04-04'::DATE),
  ('2024-05-05'::DATE),
  ('2024-06-06'::DATE),
  ('2024-07-07'::DATE),
  ('2024-08-08'::DATE),
  ('2024-09-09'::DATE),
  ('2024-10-10'::DATE),
  ('2024-11-11'::DATE),
  ('2024-12-12'::DATE);
```

Use the MONTHNAME function in a query to return the three-letter month name of each
value in the `d` column:

```sqlexample
SELECT d,
       MONTHNAME(d) AS month
  FROM monthname_function_demo;
```

```output
+------------+-------+
| D          | MONTH |
|------------+-------|
| 2024-01-01 | Jan   |
| 2024-02-02 | Feb   |
| 2024-03-03 | Mar   |
| 2024-04-04 | Apr   |
| 2024-05-05 | May   |
| 2024-06-06 | Jun   |
| 2024-07-07 | Jul   |
| 2024-08-08 | Aug   |
| 2024-09-09 | Sep   |
| 2024-10-10 | Oct   |
| 2024-11-11 | Nov   |
| 2024-12-12 | Dec   |
+------------+-------+
```

---
title: MONTHS_BETWEEN
source: https://docs.snowflake.com/en/sql-reference/functions/months_between.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# MONTHS_BETWEEN

Returns the number of months between two DATE or TIMESTAMP values.

For example, `MONTHS_BETWEEN('2020-02-01'::DATE, '2020-01-01'::DATE)` returns 1.0.

See also:
:   [DATEDIFF](datediff.md)

## Syntax

```sqlsyntax
MONTHS_BETWEEN( <date_expr1> , <date_expr2> )
```

## Arguments

`date_expr1`
:   The date to subtract from.

`date_expr2`
:   The date to subtract.

## Returns

A FLOAT representing the number of months between the two dates.

The number is calculated as described below:

* The integer portion of the FLOAT is calculated using the year and month parts of the input values.
* In most situations, the fractional portion is calculated using the day and time parts of the input values.
  (When calculating the fraction of a month, the function considers each month to be 31 days long.)

  However, there are two exceptions:

  > + If the days of the month are the same (e.g. February 28 and March 28), the fractional portion is zero,
  >   even if one or both input values are timestamps and the times differ.
  > + If the days of the month are both the last day of the month (e.g. February 28 and March 31), the fractional
  >   portion is zero, even if the days of the month are not the same.
  >
  > For example, the function considers each of the following pairs of dates/timestamps to be exactly 1.0 months apart:
  >
  > | Date/Timestamp 1 | Date/Timestamp 2 | Notes |
  > | --- | --- | --- |
  > | 2019-03-01 02:00:00 | 2019-02-01 13:00:00 | Same day of each month. |
  > | 2019-03-28 | 2019-02-28 | Same day of each month. |
  > | 2019-03-31 | 2019-02-28 | Last day of each month. |
  > | 2019-03-31 01:00:00 | 2019-02-28 13:00:00 | Last day of each month. |

## Usage notes

* If date (or timestamp) d1 represents an earlier point in time than d2, then `MONTHS_BETWEEN(d1, d2)`
  returns a negative value; otherwise it returns a positive value. More generally, swapping
  the inputs reverses the sign: `MONTHS_BETWEEN(d1, d2)` = `-MONTHS_BETWEEN(d2, d1)`.
* You can use a DATE value for one input parameter and a TIMESTAMP for the other.
* If you use one or more TIMESTAMP values but do not want fractional differences based on time of day, then cast your
  TIMESTAMP expressions to DATE.
* If you only want integer values, you can truncate, round, or cast the value. For example:

  ```sqlexample
  SELECT
      ROUND(MONTHS_BETWEEN('2019-03-31 12:00:00'::TIMESTAMP,
                           '2019-02-28 00:00:00'::TIMESTAMP)) AS MonthsBetween1;
  +----------------+
  | MONTHSBETWEEN1 |
  |----------------|
  |              1 |
  +----------------+
  ```
* If any input is NULL, the result is NULL.

## Examples

This example shows differences in whole months. The first pair of dates have the same day of the month (the 15th).
The second pair of dates are both the last days in their respective months (February 28th and March 31st).

> ```sqlexample
> SELECT
>     MONTHS_BETWEEN('2019-03-15'::DATE,
>                    '2019-02-15'::DATE) AS MonthsBetween1,
>     MONTHS_BETWEEN('2019-03-31'::DATE,
>                    '2019-02-28'::DATE) AS MonthsBetween2;
> +----------------+----------------+
> | MONTHSBETWEEN1 | MONTHSBETWEEN2 |
> |----------------+----------------|
> |       1.000000 |       1.000000 |
> +----------------+----------------+
> ```

The next example shows differences in fractional months.

> * For the first column, the function is passed two dates.
> * For the second column, the function is passed two timestamps
>   that represent the same two dates as were used for the first column, but with different times.
>   The difference in the second column is larger than the first column due to the differences in time.
> * For the third column, the function is passed two timestamps that represent
>   the same day of their respective months. This causes the function to ignore
>   any time differences between the timestamps, so the fractional part is 0.
>
> ```sqlexample
> SELECT
>     MONTHS_BETWEEN('2019-03-01'::DATE,
>                    '2019-02-15'::DATE) AS MonthsBetween1,
>     MONTHS_BETWEEN('2019-03-01 02:00:00'::TIMESTAMP,
>                    '2019-02-15 01:00:00'::TIMESTAMP) AS MonthsBetween2,
>     MONTHS_BETWEEN('2019-02-15 02:00:00'::TIMESTAMP,
>                    '2019-02-15 01:00:00'::TIMESTAMP) AS MonthsBetween3
>     ;
> +----------------+----------------+----------------+
> | MONTHSBETWEEN1 | MONTHSBETWEEN2 | MONTHSBETWEEN3 |
> |----------------+----------------+----------------|
> |       0.548387 |       0.549731 |       0.000000 |
> +----------------+----------------+----------------+
> ```

The fact that the function returns an integer number of months both when the days of the
month are the same (e.g. February 28 and March 28) and when the days of the month are the last day of the month
(e.g. February 28 and March 31) can lead to unintuitive behavior; specifically, increasing the first date
in the pair does not always increase the output value.
In this example, as the first date increases from March 28th to March 30th and then to March 31st, the
difference increases from 1.0 to a larger number and then decreases back to 1.0.

> * For the first column, the input dates represent the same day in different months, so
>   the function returns `0` for the fractional part of the result.
> * For the second column, the input dates represent different days in different months (and are not both the last
>   day of the month), so the function calculates the fractional part of the result.
> * For the third column, the input dates represent the last days in each of two different months, so
>   the function again returns `0` for the fractional part of the result.
>
> ```sqlexample
> SELECT
>     MONTHS_BETWEEN('2019-03-28'::DATE,
>                    '2019-02-28'::DATE) AS MonthsBetween1,
>     MONTHS_BETWEEN('2019-03-30'::DATE,
>                    '2019-02-28'::DATE) AS MonthsBetween2,
>     MONTHS_BETWEEN('2019-03-31'::DATE,
>                    '2019-02-28'::DATE) AS MonthsBetween3
>     ;
> +----------------+----------------+----------------+
> | MONTHSBETWEEN1 | MONTHSBETWEEN2 | MONTHSBETWEEN3 |
> |----------------+----------------+----------------|
> |       1.000000 |       1.064516 |       1.000000 |
> +----------------+----------------+----------------+
> ```

This example shows that reversing the order of the parameters reverses the sign of the result:

> ```sqlexample
> SELECT
>     MONTHS_BETWEEN('2019-03-01'::DATE,
>                    '2019-02-01'::DATE) AS MonthsBetween1,
>     MONTHS_BETWEEN('2019-02-01'::DATE,
>                    '2019-03-01'::DATE) AS MonthsBetween2
>     ;
> +----------------+----------------+
> | MONTHSBETWEEN1 | MONTHSBETWEEN2 |
> |----------------+----------------|
> |       1.000000 |      -1.000000 |
> +----------------+----------------+
> ```

---
title: NETWORK_RULE_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/network_rule_references.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# NETWORK_RULE_REFERENCES

Returns a row for each object with which the specified network rule is associated or returns a row for each network rule associated
with the specified container.

See also:
:   [NETWORK_RULE_REFERENCES view](../account-usage/network_rule_references.md) (Account Usage View)

## Syntax

```sqlsyntax
NETWORK_RULE_REFERENCES(
  NETWORK_RULE_NAME => '<string>'
)

NETWORK_RULE_REFERENCES(
  CONTAINER_NAME => '<container_name>' ,
  CONTAINER_TYPE => { 'INTEGRATION' | 'NETWORK_POLICY' }
)
```

## Arguments

`NETWORK_RULE_NAME => 'string'`
:   Specifies the identifier for the [network rule](../sql/create-network-rule.md).

    * The entire network rule name must be enclosed in single quotes.
    * If the network rule name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes, such as `'"name"'`.

`CONTAINER_NAME => 'container_name'`
:   Specifies the name of the external access integration or network policy to which the network rule is associated.

    * The entire network rule name must be enclosed in single quotes.
    * If the object name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quote, such as `'"<name>"'`.

`CONTAINER_TYPE => { 'INTEGRATION' | 'NETWORK_POLICY' }`
:   Specifies the object type (domain) to which the network rule is associated.

## Output

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `container_name` | VARCHAR | The name of the container to which the network policy is associated. |
| `container_type` | VARCHAR | One of the following: `NETWORK_POLICY` or `INTEGRATION`. |
| `network_rule_name` | VARCHAR | Name of the network rule. |
| `action_type` | VARCHAR | One of the following: `ALLOW` or `BLOCK`. |
| `database_name` | VARCHAR | Name of the database that contains the network rule. |
| `schema_name` | VARCHAR | Name of the schema that contains the network rule. |

## Usage notes

Use one syntax or the other. Do not mix arguments.

## Examples

Returns a row for each object to which the specified network rule is associated:

> ```sqlexample
> USE ROLE network_admin;
> USE DATABASE securitydb;
> SELECT *
>   FROM TABLE(
>     securitydb.INFORMATION_SCHEMA.NETWORK_RULE_REFERENCES(
>       NETWORK_RULE_NAME => 'securitydb.myrules.cloud_rule'
>     )
>   );
> ```

Returns a row for each network rule associated to the specified container:

> ```sqlexample
> USE ROLE network_admin;
> USE DATABASE securitydb;
> SELECT *
>   FROM TABLE(
>     securitydb.INFORMATION_SCHEMA.NETWORK_RULE_REFERENCES(
>       CONTAINER_NAME => 'my_network_policy' ,
>       CONTAINER_TYPE => 'NETWORK_POLICY'
>     )
>   );
> ```

---
title: NEXT_DAY
source: https://docs.snowflake.com/en/sql-reference/functions/next_day.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# NEXT_DAY

Returns the date of the first specified day of week (DOW) that occurs after the input date.

See also:
:   [LAST_DAY](last_day.md) , [PREVIOUS_DAY](previous_day.md)

## Syntax

```sqlsyntax
NEXT_DAY( <date_or_timetamp_expr> , <dow_string> )
```

## Arguments

`date_or_timestamp_expr`
:   A date or a timestamp, or an expression that can be evaluated to a date or a timestamp.

`dow_string`
:   Specifies the day of week used to calculate the date for the next day. The value can be a string literal or an expression that returns a string. The string
    must start with the first two characters (case-insensitive) of the day name:

    > * `su` (Sunday)
    > * `mo` (Monday)
    > * `tu` (Tuesday)
    > * `we` (Wednesday)
    > * `th` (Thursday)
    > * `fr` (Friday)
    > * `sa` (Saturday)

    Any leading spaces and trailing characters, including spaces, in the string are ignored.

## Returns

This function returns a value of type DATE, even if `date_or_timetamp_expr` is a timestamp.

## Examples

Return the date of the next Friday that occurs after the current date:

```sqlexample
SELECT CURRENT_DATE() AS "Today's Date",
       NEXT_DAY("Today's Date", 'Friday') AS "Next Friday";
```

```output
+--------------+-------------+
| Today's Date | Next Friday |
|--------------+-------------|
| 2025-05-06   | 2025-05-09  |
+--------------+-------------+
```

Your output will be different because the example uses the [CURRENT_DATE](current_date.md) function.

---
title: NORMAL
source: https://docs.snowflake.com/en/sql-reference/functions/normal.md
section: SQL Functions
---

Categories:
:   [Data generation functions](../functions-data-generation.md)

# NORMAL

Generates a normally-distributed pseudo-random floating point number with specified
`mean` and `stddev` (standard deviation).

## Syntax

```sqlsyntax
NORMAL( <mean> , <stddev> , <gen> )
```

## Arguments

`mean`
:   A constant specifying the value that the output values should be centered on.

`stddev`
:   A constant specifying the width of one standard deviation.

    For example, if you specify a mean of 0.0 and a standard deviation of 1.0,
    approximately 68.2% of returned values from multiple calls will be between
    -1.0 and +1.0 (i.e. within one standard deviation of the mean).

    Similarly, if you choose a mean of 5.0 and a standard deviation of 2, then
    approximately 68.2% of values will be between 3.0 and 7.0.

`gen`
:   An expression that serves as a raw source of uniform random numbers,
    typically the `RANDOM` function. For more information, see the Data
    Generation Functions [Usage notes](../functions-data-generation.md).

## Returns

Returns a random floating-point number. The accumulated results of a large
number of repeated calls approximate a normal distribution.

## Usage notes

This function is related to, but different from, the
[RANDOM](random.md) function, both in the ranges
of the values returned and their distribution.

* `RANDOM` generates random 64-bit integers in a uniform distribution.
  It accepts an optional seed that allows random sequences to be repeated.

  When `RANDOM` is called a large number of times, the results are more or less
  evenly distributed over the range of possible values. For example, the number of
  results with values between 1000 and 2000 is similar to the number of
  values between 2000 and 3000.
* `NORMAL` generates random integer or floating-point numbers centered on the
  specified mean, with the specified standard deviation.

  When `NORMAL` is called a large number of times, the distribution of the
  results is likely to approximate a “normal” curve (a “bell-shaped curve”).
  The center of the curve and its “breadth” are influenced by the `mean`
  and `stddev` parameters. Values closer to the specified mean are more
  likely to occur than values far from the mean.

## Examples

This shows typical usage with a mean of 0 and a standard deviation of 1:

> ```sqlexample
> SELECT normal(0, 1, random()) FROM table(generator(rowCount => 5));
>
> +------------------------+
> | NORMAL(0, 1, RANDOM()) |
> |------------------------|
> |           0.227384164  |
> |           0.9945290748 |
> |          -0.2045078571 |
> |          -1.594607893  |
> |          -0.8213296842 |
> +------------------------+
> ```

This shows that if the `gen` parameter is a constant, then the
output is a constant:

> ```sqlexample
> SELECT normal(0, 1, 1234) FROM table(generator(rowCount => 5));
>
> +--------------------+
> | NORMAL(0, 1, 1234) |
> |--------------------|
> |      -0.6604156716 |
> |      -0.6604156716 |
> |      -0.6604156716 |
> |      -0.6604156716 |
> |      -0.6604156716 |
> +--------------------+
> ```

---
title: NOTIFICATION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/notification_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# NOTIFICATION_HISTORY

This table function can be used to query the history of notifications sent through Snowflake. These notifications include:

* [Notifications about errors in tasks](../../user-guide/tasks-errors.md).
* [Notifications about errors in Snowpipe](../../user-guide/data-load-snowpipe-errors.md).
* [Notifications sent by calling SYSTEM$SEND_EMAIL or SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../../user-guide/notifications/about-notifications.md).

The rows returned represent:

* Requests that are being processed.
* Failed attempts at sending notifications.
* Notifications that were sent successfully.

The STATUS column indicates what each row represents. See
Examples of output from the function.

## Syntax

```sqlsyntax
NOTIFICATION_HISTORY(
  [ START_TIME => <constant_expr> ]
  [, END_TIME => <constant_expr> ]
  [, INTEGRATION_NAME => '<string>' ]
  [, RESULT_LIMIT => <integer> ] )
```

## Arguments

All the arguments are optional.

`START_TIME=> constant_expr` , . `END_TIME=> constant_expr`
:   Time range (in TIMESTAMP_LTZ format) when the notification is sent out.

    * If START_TIME is not specified, the range starts 24 hours prior to the END_TIME.
    * If END_TIME is not specified, the default is [CURRENT_TIMESTAMP](current_timestamp.md).

    The maximum time range is 14 days.

`INTEGRATION_NAME => 'string'`
:   The fully qualified name of the integration that is tied with the notification. If you omit this argument, the function returns
    all notifications.

    Default: An empty string.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function.

    Range: `1` to `10000`

    Default: `100`

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| CREATED | TIMESTAMP_LTZ | Timestamp when the notification was created. |
| PROCESSED | TIMESTAMP_LTZ | Timestamp of the last attempt to send the notification. |
| MESSAGE_SOURCE | VARCHAR | Type of object or feature that generated the notification. Valid values include:   * `BUDGET` (for [notifications from budgets](../../user-guide/budgets.md)) * `TASK` (for [notifications from tasks](../../user-guide/tasks-errors.md)) * `SNOWPIPE` (for [notifications from Snowpipe](../../user-guide/data-load-snowpipe-errors.md)) * `STORED_PROCEDURE` (for email notifications sent by   [calling the SYSTEM$SEND_EMAIL or SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure](../../user-guide/notifications/about-notifications.md)) |
| INTEGRATION_NAME | VARCHAR | Name of the [integration used for this notification](../sql/create-notification-integration.md). |
| STATUS | VARCHAR | Status of the notification. Valid values are:   * `QUEUED`: The request to send the notification is being processed. * `SUCCESS`: The notification was sent successfully. * `RETRIABLE_FAILURE`: The attempt to send the notification failed, and the system will attempt to send the   notification again. * `FAILURE`: Multiple attempts to send the notification failed, and there will be no more attempts to send the   notification. |
| ERROR_MESSAGE | VARCHAR | If the notification failed, provides details about why the notification failed.  **Note:** For webhook notifications, this column contains the body of the HTTP response, which might contain sensitive data. Before using this data, make sure to sanitize it. |
| ID | VARCHAR | Unique ID of a request to send a notification.  If Snowflake fails to send a notification and attempts to send the notification again, the function returns a row for each attempt. Each row for an attempt has the same value in the ID column but a different value in the ATTEMPT column. |
| ATTEMPT | INTEGER | Number of the attempt made to send the notification. |
| MESSAGE_SOURCE_INFO | OBJECT | Object containing information about the source of the notification. The fields in this object depend on the type of the source.   * For notifications for budgets, the object contains the following fields:    + `budget_id`: Identifier for the budget.   + `budget_name`: The name of the budget. * For error notifications for tasks, the object contains the following fields:    + `name`: The name of the task   + `graph_run_group_id`: Identifier for the graph run.   + `attempt_number`: Integer representing the number of the attempt to run this task. * For error notifications for Snowpipe, the object contains the `pipe_name` field, which specifies the name of the pipe. * For notifications sent by calling the SYSTEM$SEND_SNOWFLAKE_NOTIFICATION or SYSTEM$SEND_EMAIL stored procedure, the   object contains the `query_id` field, which specifies the ID of the statement that called the stored procedure. |

## Usage notes

* Returns results only for the ACCOUNTADMIN role, the integration owner (i.e. the role with the OWNERSHIP privilege on the
  integration) or a role with the USAGE privilege on the integration.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the
  function name must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Examples

The following sections contain examples of calling the function and examples of output from the function:

* Examples of calling the function
* Examples of output from the function

### Examples of calling the function

The following examples demonstrate how to call this function:

* Retrieving the most recent notifications
* Retrieving notifications by time and integration name

#### Retrieving the most recent notifications

Retrieve the most recent notifications that were created in the past 24 hours.

```sqlexample
SELECT * FROM TABLE(INFORMATION_SCHEMA.NOTIFICATION_HISTORY());
```

#### Retrieving notifications by time and integration name

Retrieve the most recent notifications that were created in the past hour and sent using the integration named `my_integration`.

```sqlexample
SELECT * FROM TABLE(INFORMATION_SCHEMA.NOTIFICATION_HISTORY(
  START_TIME=>DATEADD('hour',-1,CURRENT_TIMESTAMP()),
  END_TIME=>CURRENT_TIMESTAMP(),
  RESULT_LIMIT=>100,
  INTEGRATION_NAME=>'my_integration'));
```

### Examples of output from the function

The following examples explain the output returned by this function for notification requests in different states:

* Example of the output when two attempts fail and a third attempt is in progress
* Example of the output when two attempts fail and a third attempt succeeds

#### Example of the output when two attempts fail and a third attempt is in progress

This example selects a subset of the columns in the output:

```sqlexample
SELECT id, attempt, created, processed, status
  FROM TABLE(INFORMATION_SCHEMA.NOTIFICATION_HISTORY());
```

The output includes the rows that represent the attempts to send one notification. In the output:

* The ID column identifies the notification that is being sent.
* The first two attempts to send the notification have failed, but the system can attempt to send the notification again (as
  indicated by the value `RETRIABLE_FAILURE` in the STATUS column).
* A third attempt is being processed, as indicated by the value `QUEUED` in the STATUS column.

```output
+-------------------+-------------+-----------------------------------+-----------------------------------+-----------------------+
|   ID              |   ATTEMPT   |   CREATED                         |   PROCESSED                       |   STATUS              |
+-------------------+-------------+-----------------------------------+-----------------------------------+-----------------------+
|   10ae695e-93c3   |   3         |   2023-12-05 15:10:15.194 -0800   |   NULL                            |   QUEUED              |
|   10ae695e-93c3   |   2         |   2023-12-05 15:10:15.194 -0800   |   2023-12-05 15:11:21.443 -0800   |   RETRIABLE_FAILURE   |
|   10ae695e-93c3   |   1         |   2023-12-05 15:10:15.194 -0800   |   2023-12-05 15:10:21.443 -0800   |   RETRIABLE_FAILURE   |
+-------------------+-------------+-----------------------------------+-----------------------------------+-----------------------+
```

#### Example of the output when two attempts fail and a third attempt succeeds

This example selects a subset of the columns in the output:

```sqlexample
SELECT id, attempt, created, processed, status
  FROM TABLE(INFORMATION_SCHEMA.NOTIFICATION_HISTORY());
```

The output includes the rows that represent the attempts to send one notification. In the output:

* The ID column identifies the notification that is being sent.
* The first two attempts to send the notification have failed, but the system can attempt to send the notification again (as
  indicated by the value `RETRIABLE_FAILURE` in the STATUS column).
* A third attempt succeeded, as indicated by the value `SUCCESS` in the STATUS column.

```output
+-------------------+-------------+-----------------------------------+-----------------------------------+-----------------------+
|   ID              |   ATTEMPT   |   CREATED                         |   PROCESSED                       |   STATUS              |
+-------------------+-------------+-----------------------------------+-----------------------------------+-----------------------+
|   10ae695e-93c3   |   3         |   2023-12-05 15:10:15.194 -0800   |   2023-12-05 15:12:21.443 -0800   |   SUCCESS             |
|   10ae695e-93c3   |   2         |   2023-12-05 15:10:15.194 -0800   |   2023-12-05 15:11:21.443 -0800   |   RETRIABLE_FAILURE   |
|   10ae695e-93c3   |   1         |   2023-12-05 15:10:15.194 -0800   |   2023-12-05 15:10:21.443 -0800   |   RETRIABLE_FAILURE   |
+-------------------+-------------+-----------------------------------+-----------------------------------+-----------------------+
```

---
title: NTH_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/nth_value.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# NTH_VALUE

Returns the nth value (up to 1000) within an ordered group of values.

See also:
:   [FIRST_VALUE](first_value.md) , [LAST_VALUE](last_value.md)

## Syntax

```sqlsyntax
NTH_VALUE( <expr> , <n> ) [ FROM { FIRST | LAST } ] [ { IGNORE | RESPECT } NULLS ]
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`n`
:   This specifies which value of N to use when looking for the Nth value.

`expr`
:   The expression that determines the return value.

`expr1`
:   The expression by which to partition the rows. You can specify a single expression or a comma-separated list of expressions.
    For example:

    ```sqlexample
    PARTITION BY column_1, column_2
    ```

`expr2`
:   The expression by which to order the rows. You can specify a single expression or a comma-separated list of expressions.
    For example:

    ```sqlexample
    ORDER BY column_3, column_4
    ```

`FROM { FIRST | LAST }`
:   Whether to ignore or respect NULL values when an `expr` contains NULL values:

    * `FROM FIRST` starts from the beginning of the ordered list and moves forward.
    * `FROM LAST` starts from the end of the ordered list and moves backward.

    Default: `FROM FIRST`

`{ IGNORE | RESPECT } NULLS`
:   Whether to ignore or respect NULL values when an `expr` contains NULL values:

    * `IGNORE NULLS` skips NULL values in the expression.
    * `RESPECT NULLS` returns a NULL value if it is the nth value in the expression.

    Default: `RESPECT NULLS`

## Usage notes

* Input value `n` can’t be greater than 1000.

* This function is a rank-related function, so it must specify a window. A window clause consists of the following subclauses:

  > + `PARTITION BY expr1` subclause (optional).
  > + `ORDER BY expr2` subclause (required). For details about additional supported ordering options (sort order, ordering
  >   of NULL values, and so on), see the documentation for the [ORDER BY](../constructs/order-by.md) clause, which follows
  >   the same rules.
  > + `window_frame` subclause (optional).
* The order of rows in a window (and thus the result of the query) is fully deterministic only if the keys in the ORDER BY clause
  make each row unique. Consider the following example:

  ```sqlexample
  ... OVER (PARTITION BY p ORDER BY o COLLATE 'lower') ...
  ```

  The query result can vary if any partition contains values of column `o` that are identical, or would be identical
  in a case-insensitive comparison.
* The ORDER BY clause inside the OVER clause controls the order of rows only within the window, not the order of rows in the output
  of the entire query. To control output order, use a separate ORDER BY clause at the outermost level of the query.

* The optional `window_frame` (cumulative or sliding) specifies the subset of rows within the window for which the function
  is calculated. If no `window_frame` is specified, the default is the entire window:

  > `ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING`

  Note that this deviates from the ANSI standard, which specifies the following default for window frames:

  > `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).

## Examples

```sqlexample
SELECT column1,
       column2,
       NTH_VALUE(column2, 2) OVER (PARTITION BY column1 ORDER BY column2) AS column2_2nd
  FROM VALUES
    (1, 10), (1, 11), (1, 12),
    (2, 20), (2, 21), (2, 22);
```

```output
+---------+---------+-------------+
| COLUMN1 | COLUMN2 | COLUMN2_2ND |
|---------+---------+-------------|
|       1 |      10 |          11 |
|       1 |      11 |          11 |
|       1 |      12 |          11 |
|       2 |      20 |          21 |
|       2 |      21 |          21 |
|       2 |      22 |          21 |
+---------+---------+-------------+
```

The following example returns the results of three related functions: [FIRST_VALUE](first_value.md),
NTH_VALUE, and [LAST_VALUE](last_value.md).

* The query creates a sliding window frame that is three rows wide, which contains:

  + The row that precedes the current row.
  + The current row.
  + The row that follows the current row.
* The `2` in the call `NTH_VALUE(menu_price_usd, 2)` specifies the second row in the window frame
  (which, in this case, is also the current row).
* When the current row is the very first row in the window frame, there is no preceding row to reference, so
  FIRST_VALUE returns a NULL for that row.
* Frame boundaries sometimes extend beyond the rows in a partition, but non-existent rows are not included in window function
  calculations. For example, when the current row is the very first row in the partition and the window frame is
  `ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING`, there is no preceding row to reference, so the FIRST_VALUE function returns the
  value of the first row in the partition.
* The results never match for all three functions, given the data in the table. These functions select the *first*,
  *last*, or *nth* value for each row in the frame, and the selection of values applies separately to each partition.

```sqlexample
SELECT menu_category, menu_item_name, menu_price_usd,
       FIRST_VALUE(menu_price_usd) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS first_val,
       NTH_VALUE(menu_price_usd, 2) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS nth_val,
       LAST_VALUE(menu_price_usd) OVER (PARTITION BY menu_category ORDER BY menu_price_usd
         ROWS BETWEEN 1 PRECEDING AND 1 FOLLOWING) AS last_val
  FROM menu_items
  WHERE menu_category = 'Dessert'
  ORDER BY menu_price_usd;
```

```output
+---------------+--------------------+----------------+-----------+---------+----------+
| MENU_CATEGORY | MENU_ITEM_NAME     | MENU_PRICE_USD | FIRST_VAL | NTH_VAL | LAST_VAL |
|---------------+--------------------+----------------+-----------+---------+----------|
| Dessert       | Popsicle           |           3.00 |      3.00 |    4.00 |     4.00 |
| Dessert       | Ice Cream Sandwich |           4.00 |      3.00 |    4.00 |     5.00 |
| Dessert       | Mango Sticky Rice  |           5.00 |      4.00 |    5.00 |     6.00 |
| Dessert       | Sugar Cone         |           6.00 |      6.00 |    6.00 |     7.00 |
| Dessert       | Waffle Cone        |           6.00 |      5.00 |    6.00 |     6.00 |
| Dessert       | Two Scoop Bowl     |           7.00 |      6.00 |    7.00 |     7.00 |
+---------------+--------------------+----------------+-----------+---------+----------+
```

---
title: NTILE
source: https://docs.snowflake.com/en/sql-reference/functions/ntile.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# NTILE

Divides an ordered data set equally into the number of buckets specified by `constant_value`. Buckets are sequentially numbered 1 through `constant_value`.

## Syntax

```sqlsyntax
NTILE( <constant_value> ) OVER ( [ PARTITION BY <expr1> ]
  ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] )
```

## Arguments

`constant_value`
:   The desired number of buckets; must be a positive integer value.

`expr1`
:   If you wish to partition the data into groups, specify the criterion
    (usually a column) to partition by. For example, you might partition by
    province.

`expr2`
:   The expression (usually a column) by which to order the rows in the window.
    For example, you might order by timestamp.

## Usage notes

If the data is partitioned, then the data is divided into buckets equally
within each partition. For example, if the number of buckets is 3, and if
the data is partitioned by province, then approximately 1/3 of the
rows for each province are put into each bucket.

If the statement has an ORDER BY clause for the output, as well as an ORDER BY
clause for the NTILE function, the two operate independently; the ORDER BY
for the NTILE function influences which rows are assigned to each bucket,
while the ORDER BY for the output determines the order in which the output
rows are shown.

## Examples

```sqlexample
SELECT
    exchange,
    symbol,
    NTILE(4) OVER (PARTITION BY exchange ORDER BY shares) AS ntile_4
  FROM trades
  ORDER BY exchange, NTILE_4;
```

```output
+--------+------+-------+
|exchange|symbol|NTILE_4|
+--------+------+-------+
|C       |SPY   |      1|
|C       |AAPL  |      2|
|C       |AAPL  |      3|
|N       |SPY   |      1|
|N       |AAPL  |      1|
|N       |SPY   |      2|
|N       |QQQ   |      2|
|N       |QQQ   |      3|
|N       |YHOO  |      4|
|Q       |MSFT  |      1|
|Q       |YHOO  |      1|
|Q       |MSFT  |      2|
|Q       |YHOO  |      2|
|Q       |QQQ   |      3|
|Q       |QQQ   |      4|
|P       |AAPL  |      1|
|P       |YHOO  |      1|
|P       |MSFT  |      2|
|P       |SPY   |      3|
|P       |MSFT  |      4|
+--------+------+-------+
```

---
title: NULL_COUNT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_null_count.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# NULL_COUNT (system data metric function)

Returns the total number of NULL values for the specified column in a table.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.NULL_COUNT(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* DATE
* FLOAT
* NUMBER
* TIMESTAMP_LTZ
* TIMESTAMP_NTZ
* TIMESTAMP_TZ
* VARCHAR

## Returns

The function returns a scalar value with a NUMBER data type.

## Usage notes

When you call a system DMF manually, you don’t need to specify whichever allowed data type you are using. You only need to specify the
query for the column that you want to measure. Snowflake matches the allowed data type for the function with the data type for the column.

## Example

Measure the number of NULL values for the SSN column (that is, US Social Security number):

```sqlexample
SELECT SNOWFLAKE.CORE.NULL_COUNT(
  SELECT
    ssn
  FROM hr.tables.empl_info
);
```

```output
+----------------------------------------------------------------+
| SNOWFLAKE.CORE.NULL_COUNT(SELECT ssn FROM hr.tables.empl_info) |
+----------------------------------------------------------------+
| 5                                                              |
+----------------------------------------------------------------+
```

---
title: NULL_PERCENT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_null_percent.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# NULL_PERCENT (system data metric function)

Returns the percentage of columns values that are NULL for the specified column in a table.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.NULL_PERCENT(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* DATE
* FLOAT
* NUMBER
* TIMESTAMP_LTZ
* TIMESTAMP_NTZ
* TIMESTAMP_TZ
* VARCHAR

## Returns

The function returns a NUMBER value.

## Example

Measure the percent of NULL values for the SSN column (i.e. US social security number):

```sqlexample
SELECT SNOWFLAKE.CORE.NULL_PERCENT(
  SELECT
    ssn
  FROM hr.tables.empl_info
);
```

```output
+----------------------------------------------------------------+
| SNOWFLAKE.CORE.NULL_COUNT(SELECT ssn FROM hr.tables.empl_info) |
+----------------------------------------------------------------+
| 1                                                              |
+----------------------------------------------------------------+
```

---
title: NULLIF
source: https://docs.snowflake.com/en/sql-reference/functions/nullif.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# NULLIF

Returns NULL if `expr1` is equal to `expr2`, otherwise returns `expr1`.

## Syntax

```sqlsyntax
NULLIF( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   Any general expression of any data type.

`expr2`
:   Any general expression that evaluates to the same data type as `expr1`.

## Returns

The data type of the returned value is the data type of `expr1`.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The collation of the result is the same as the collation of the first input.

## Examples

> ```sqlexample
> SELECT a, b, NULLIF(a,b) FROM i;
>
> --------+--------+-------------+
>    a    |   b    | nullif(a,b) |
> --------+--------+-------------+
>  0      | 0      | [NULL]      |
>  0      | 1      | 0           |
>  0      | [NULL] | 0           |
>  1      | 0      | 1           |
>  1      | 1      | [NULL]      |
>  1      | [NULL] | 1           |
>  [NULL] | 0      | [NULL]      |
>  [NULL] | 1      | [NULL]      |
>  [NULL] | [NULL] | [NULL]      |
> --------+--------+-------------+
> ```

---
title: NULLIFZERO
source: https://docs.snowflake.com/en/sql-reference/functions/nullifzero.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# NULLIFZERO

Returns NULL if the argument evaluates to `0`; otherwise, returns the argument.

## Syntax

```sqlsyntax
NULLIFZERO( <expr> )
```

## Arguments

`expr`
:   The input should be an expression that evaluates to a numeric value.

## Returns

If the value of the input expression is `0`, this returns NULL.
Otherwise, this returns the value of the input expression.

The data type of the return value is `NUMBER(p, s)` (if the input is a
[fixed-point number](../data-types-numeric.md)) or `DOUBLE` (if the
input is a [floating point number](../data-types-numeric.md)).

For fixed-point numbers, the exact values of ‘p’ (precision) and ‘s’ (scale) depend upon the input expression. For example,
if the input expression is 3.14159, then the data type of the output value will be `NUMBER(7, 5)`.

## Examples

The following examples show the output of the function for various input values:

> ```sqlexample
> SELECT NULLIFZERO(0);
> +---------------+
> | NULLIFZERO(0) |
> |---------------|
> |          NULL |
> +---------------+
> ```
>
> ```sqlexample
> SELECT NULLIFZERO(52);
> +----------------+
> | NULLIFZERO(52) |
> |----------------|
> |             52 |
> +----------------+
> ```
>
> ```sqlexample
> SELECT NULLIFZERO(3.14159);
> +---------------------+
> | NULLIFZERO(3.14159) |
> |---------------------|
> |             3.14159 |
> +---------------------+
> ```

---
title: NVL
source: https://docs.snowflake.com/en/sql-reference/functions/nvl.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# NVL

If `expr1` is NULL, returns `expr2`, otherwise returns `expr1`.

Aliases:
:   [IFNULL](ifnull.md)

## Syntax

```sqlsyntax
NVL( <expr1> , <expr2> )
```

## Arguments

`expr1`
:   A general expression.

`expr2`
:   A general expression.

## Usage notes

* Snowflake performs [implicit conversion](../data-type-conversion.md) of arguments to make
  them compatible. For example, if one of the input expressions is a numeric type, the return type
  is also a numeric type. That is, `SELECT NVL('17', 1);` first converts the VARCHAR value `'17'`
  to the NUMBER value `17`, and then returns the first non-NULL value.

  When conversion isn’t possible, implicit conversion fails. For example, `SELECT NVL('foo', 1);`
  returns an error because the VARCHAR value `'foo'` can’t be converted to a NUMBER value.

  We recommend passing in arguments of the same type or explicitly converting arguments if needed.

* When implicit conversion converts a non-numeric value to a numeric value, the result is a value
  of type NUMBER(18,5).

  For numeric string arguments that aren’t constants, if NUMBER(18,5) isn’t sufficient to represent
  the numeric value, then [cast](../data-type-conversion.md) the argument to a type that
  can represent the value.

* Either expression can include a `SELECT` statement containing set
  operators, such as `UNION`, `INTERSECT`, `EXCEPT`, and `MINUS`.
  When using set operators, make sure that data types are compatible. For
  details, see the [General usage notes](../operators-query.md) in the
  [Set operators](../operators-query.md) topic.

## Collation details

* The [collation specifications](../collation.md) of all input arguments must be compatible.
* The collation of the result of the function is the highest-[precedence](../collation.md) collation of the inputs.

## Returns

Returns the data type of the returned expression.

If both expressions are NULL, returns NULL.

## Examples

Create a table that contains contact information for suppliers:

```sqlexample
CREATE TABLE IF NOT EXISTS suppliers (
  supplier_id INT PRIMARY KEY,
  supplier_name VARCHAR(30),
  phone_region_1 VARCHAR(15),
  phone_region_2 VARCHAR(15));
```

The table contains the phone number for each supplier in two different regions. The phone number can
be NULL for a region.

Insert values into the table:

```sqlexample
INSERT INTO suppliers(supplier_id, supplier_name, phone_region_1, phone_region_2)
  VALUES(1, 'Company_ABC', NULL, '555-01111'),
        (2, 'Company_DEF', '555-01222', NULL),
        (3, 'Company_HIJ', '555-01333', '555-01444'),
        (4, 'Company_KLM', NULL, NULL);
```

The following SELECT statement uses the NVL function to
retrieve the `phone_region_1` and `phone_region_2` values.

This example shows the following results for the NVL function:

* The `IF_REGION_1_NULL` column contains the value in `phone_region_1` or, if that value is NULL, the
  value in `phone_region_2`.
* The `IF_REGION_2_NULL` column contains the value in `phone_region_2` or, if that value is NULL, the
  value in `phone_region_1`.
* If both `phone_region_1` and `phone_region_2` are NULL, the function returns NULL.

```sqlexample
SELECT supplier_id,
       supplier_name,
       phone_region_1,
       phone_region_2,
       NVL(phone_region_1, phone_region_2) IF_REGION_1_NULL,
       NVL(phone_region_2, phone_region_1) IF_REGION_2_NULL
  FROM suppliers
  ORDER BY supplier_id;
```

```output
+-------------+---------------+----------------+----------------+------------------+------------------+
| SUPPLIER_ID | SUPPLIER_NAME | PHONE_REGION_1 | PHONE_REGION_2 | IF_REGION_1_NULL | IF_REGION_2_NULL |
|-------------+---------------+----------------+----------------+------------------+------------------|
|           1 | Company_ABC   | NULL           | 555-01111      | 555-01111        | 555-01111        |
|           2 | Company_DEF   | 555-01222      | NULL           | 555-01222        | 555-01222        |
|           3 | Company_HIJ   | 555-01333      | 555-01444      | 555-01333        | 555-01444        |
|           4 | Company_KLM   | NULL           | NULL           | NULL             | NULL             |
+-------------+---------------+----------------+----------------+------------------+------------------+
```

---
title: NVL2
source: https://docs.snowflake.com/en/sql-reference/functions/nvl2.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# NVL2

Returns values depending on whether the first input is NULL:

* If `expr1` is NOT NULL, then NVL2 returns `expr2`.
* If `expr1` is NULL, then NVL2 returns `expr3`.

## Syntax

```sqlsyntax
NVL2( <expr1> , <expr2> , <expr3> )
```

## Arguments

`expr1`
:   The expression to be checked to see whether it is NULL.

`expr2`
:   If `expr1` is not NULL, this expression will be evaluated and
    its value will be returned.

`expr3`
:   If `expr1` is NULL, this expression will be evaluated and
    its value will be returned.

## Usage notes

* All three expressions should have the same (or compatible) data type.

## Collation details

* The collation specification for `expr1` is ignored because all that matters about this expression is
  whether it is NULL or not.
* The collation specifications for `expr2` and `expr3` must be compatible.
* The value returned from the function is the
  highest-[precedence](../collation.md) collation of `expr2` and
  `expr3`.

## Examples

If `a` is not null, then return `b`, else return `c`:

> ```sqlexample
> SELECT a, b, c, NVL2(a, b, c) FROM i2;
>
> --------+--------+--------+---------------+
>    A    |   B    |   C    | NVL2(A, B, C) |
> --------+--------+--------+---------------+
>  0      | 5      | 3      | 5             |
>  0      | 5      | [NULL] | 5             |
>  0      | [NULL] | 3      | [NULL]        |
>  0      | [NULL] | [NULL] | [NULL]        |
>  [NULL] | 5      | 3      | 3             |
>  [NULL] | 5      | [NULL] | [NULL]        |
>  [NULL] | [NULL] | 3      | 3             |
>  [NULL] | [NULL] | [NULL] | [NULL]        |
> --------+--------+--------+---------------+
> ```

---
title: OBJECT_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/object_agg.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Semi-structured Data) , [Window functions](../functions-window.md) (General) , [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_AGG

Returns one OBJECT per group. For each (`key`, `value`) input pair, where `key`
must be a VARCHAR and `value` must be a VARIANT, the resulting OBJECT contains
a `key`:`value` field.

Aliases:
:   OBJECTAGG

## Syntax

**Aggregate function**

```sqlsyntax
OBJECT_AGG(<key>, <value>)
```

**Window function**

```sqlsyntax
OBJECT_AGG(<key>, <value>) OVER ( [ PARTITION BY <expr2> ] )
```

## Usage notes

* Input tuples with NULL `key` and/or `value` are ignored.
* Duplicate keys within a group result in a `Duplicate field key 'key'` error.
* The DISTINCT keyword is supported, but it only filters out duplicate
  rows where both `key` and `value` are equal.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE objectagg_example(g NUMBER, k VARCHAR(30), v VARIANT);
INSERT INTO objectagg_example SELECT 0, 'name', 'Joe'::VARIANT;
INSERT INTO objectagg_example SELECT 0, 'age', 21::VARIANT;
INSERT INTO objectagg_example SELECT 1, 'name', 'Sue'::VARIANT;
INSERT INTO objectagg_example SELECT 1, 'zip', 94401::VARIANT;

SELECT * FROM objectagg_example;
```

```output
+---+------+-------+
| G |  K   |   V   |
|---+------+-------|
| 0 | name | "Joe" |
| 0 | age  | 21    |
| 1 | name | "Sue" |
| 1 | zip  | 94401 |
+---+------+-------+
```

This example uses OBJECT_AGG as an aggregate function:

```sqlexample
SELECT OBJECT_AGG(k, v) FROM objectagg_example GROUP BY g;
```

```output
+-------------------+
| OBJECT_AGG(K, V)  |
|-------------------|
| {                 |
|  "name": "Sue",   |
|   "zip": 94401    |
| }                 |
| {                 |
|  "age": 21,       |
|  "name": "Joe"    |
| }                 |
+-------------------+
```

```sqlexample
SELECT seq, key, value
  FROM (SELECT object_agg(k, v) o FROM objectagg_example GROUP BY g),
    LATERAL FLATTEN(input => o);
```

```output
+-----+------+-------+
| SEQ | KEY  | VALUE |
|-----+------+-------|
|   1 | name | "Sue" |
|   1 | zip  | 94401 |
|   2 | age  | 21    |
|   2 | name | "Joe" |
+-----+------+-------+
```

---
title: OBJECT_CONSTRUCT
source: https://docs.snowflake.com/en/sql-reference/functions/object_construct.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_CONSTRUCT

Returns an [OBJECT](../data-types-semistructured.md) constructed from the arguments.

See also:
:   [OBJECT_CONSTRUCT_KEEP_NULL](object_construct_keep_null.md)

## Syntax

```sqlsyntax
OBJECT_CONSTRUCT( [<key>, <value> [, <key>, <value> , ...]] )

OBJECT_CONSTRUCT(*)
```

## Arguments

`key`
:   The key in a key-value pair. Each key is a VARCHAR value.

`value`
:   The value that is associated with the key. The value can be any data type.

`*`
:   When invoked with an asterisk (wildcard), the OBJECT value is constructed from the
    specified data using the attribute names as keys and the associated values as values.
    See the examples below.

    When you pass a wildcard to the function, you can qualify the wildcard with the name or alias for the table.
    For example, to pass in all of the columns from the table named `mytable`, specify the following:

    ```sqlexample
    (mytable.*)
    ```

    You can also use the ILIKE and EXCLUDE keywords for filtering:

    * ILIKE filters for column names that match the specified pattern. Only one
      pattern is allowed. For example:

      ```sqlexample
      (* ILIKE 'col1%')
      ```
    * EXCLUDE filters out column names that don’t match the specified column or columns. For example:

      ```sqlexample
      (* EXCLUDE col1)

      (* EXCLUDE (col1, col2))
      ```

    Qualifiers are valid when you use these keywords. The following example uses the ILIKE keyword to
    filter for all of the columns that match the pattern `col1%` in the table `mytable`:

    ```sqlexample
    (mytable.* ILIKE 'col1%')
    ```

    The ILIKE and EXCLUDE keywords can’t be combined in a single function call.

    You can also specify the wildcard in an [object constant](../data-types-semistructured.md).

    For this function, the ILIKE and EXCLUDE keywords are valid only in a SELECT list or GROUP BY clause.

    For more information about the ILIKE and EXCLUDE keywords, see the “Parameters” section in [SELECT](../sql/select.md).

## Returns

Returns a value of type [OBJECT](../data-types-semistructured.md).

## Usage notes

* If the key or value is NULL — that is, SQL NULL — the key-value pair is
  omitted from the resulting object. A key-value pair consisting of a
  string that isn’t NULL as the key and a JSON null as the value — that is,
  `PARSE_JSON('NULL')` — isn’t omitted. For more information, see
  [NULL values](../../user-guide/semistructured-considerations.md).
* The constructed object does not necessarily preserve the original order of the key-value pairs.
* In many contexts, you can use an [OBJECT constant](../data-types-semistructured.md) (also called an *OBJECT literal*) instead of
  the OBJECT_CONSTRUCT function.

## Examples

The following examples call the OBJECT_CONSTRUCT function:

### Construct a simple object

This example shows how to construct a simple object:

```sqlexample
SELECT OBJECT_CONSTRUCT('a', 1, 'b', 'BBBB', 'c', NULL);
```

```output
+--------------------------------------------------+
| OBJECT_CONSTRUCT('A', 1, 'B', 'BBBB', 'C', NULL) |
|--------------------------------------------------|
| {                                                |
|   "a": 1,                                        |
|   "b": "BBBB"                                    |
| }                                                |
+--------------------------------------------------+
```

### Construct objects by using the wildcard (\*) character

This example uses the wildcard character (`*`) to get the attribute name and the value from the FROM clause:

```sqlexample
CREATE OR REPLACE TABLE demo_table_1 (province VARCHAR, created_date DATE);
INSERT INTO demo_table_1 (province, created_date) VALUES
  ('Manitoba', '2024-01-18'::DATE),
  ('Alberta', '2024-01-19'::DATE);
```

```sqlexample
SELECT province, created_date
  FROM demo_table_1
  ORDER BY province;
```

```output
+----------+--------------+
| PROVINCE | CREATED_DATE |
|----------+--------------|
| Alberta  | 2024-01-19   |
| Manitoba | 2024-01-18   |
+----------+--------------+
```

```sqlexample
SELECT OBJECT_CONSTRUCT(*) AS oc
  FROM demo_table_1
  ORDER BY oc['PROVINCE'];
```

```output
+---------------------------------+
| OC                              |
|---------------------------------|
| {                               |
|   "CREATED_DATE": "2024-01-19", |
|   "PROVINCE": "Alberta"         |
| }                               |
| {                               |
|   "CREATED_DATE": "2024-01-18", |
|   "PROVINCE": "Manitoba"        |
| }                               |
+---------------------------------+
```

This example uses `*` and includes the ILIKE keyword to filter the output:

```sqlexample
SELECT OBJECT_CONSTRUCT(* ILIKE 'prov%') AS oc
  FROM demo_table_1
  ORDER BY oc['PROVINCE'];
```

```output
+--------------------------+
| OC                       |
|--------------------------|
| {                        |
|   "PROVINCE": "Alberta"  |
| }                        |
| {                        |
|   "PROVINCE": "Manitoba" |
| }                        |
+--------------------------+
```

This example uses `*` and includes the EXCLUDE keyword to filter the output:

```sqlexample
SELECT OBJECT_CONSTRUCT(* EXCLUDE province) AS oc
  FROM demo_table_1
  ORDER BY oc['PROVINCE'];
```

```output
+--------------------------------+
| OC                             |
|--------------------------------|
| {                              |
|   "CREATED_DATE": "2024-01-18" |
| }                              |
| {                              |
|   "CREATED_DATE": "2024-01-19" |
| }                              |
+--------------------------------+
```

This example is equivalent to the previous example, but it uses an object constant instead of
the OBJECT_CONSTRUCT function:

```sqlexample
SELECT {* EXCLUDE province} AS oc
  FROM demo_table_1
  ORDER BY oc['PROVINCE'];
```

```output
+--------------------------------+
| OC                             |
|--------------------------------|
| {                              |
|   "CREATED_DATE": "2024-01-18" |
| }                              |
| {                              |
|   "CREATED_DATE": "2024-01-19" |
| }                              |
+--------------------------------+
```

This is another example using `*`. In this case, attribute names are not specified, so Snowflake
uses `COLUMN1`, `COLUMN2`, and so on:

```sqlexample
SELECT OBJECT_CONSTRUCT(*) FROM VALUES(1,'x'), (2,'y');
```

```output
+---------------------+
| OBJECT_CONSTRUCT(*) |
|---------------------|
| {                   |
|   "COLUMN1": 1,     |
|   "COLUMN2": "x"    |
| }                   |
| {                   |
|   "COLUMN1": 2,     |
|   "COLUMN2": "y"    |
| }                   |
+---------------------+
```

### Construct objects by using a SQL NULL and a JSON null

This example constructs an object by using SQL NULL and the string `'null'`:

```sqlexample
SELECT OBJECT_CONSTRUCT(
  'Key_One', PARSE_JSON('NULL'),
  'Key_Two', NULL,
  'Key_Three', 'null') AS obj;
```

```output
+-----------------------+
| OBJ                   |
|-----------------------|
| {                     |
|   "Key_One": null,    |
|   "Key_Three": "null" |
| }                     |
+-----------------------+
```

For more information, see [NULL values](../../user-guide/semistructured-considerations.md).

### Construct objects by using expressions

OBJECT_CONSTRUCT supports expressions and queries to add, modify, or omit values from the JSON object.

```sqlexample
SELECT OBJECT_CONSTRUCT(
    'foo', 1234567,
    'dataset_size', (SELECT COUNT(*) FROM demo_table_1),
    'distinct_province', (SELECT COUNT(DISTINCT province) FROM demo_table_1),
    'created_date_seconds', extract(epoch_seconds, created_date)
  )  AS json_object
  FROM demo_table_1;
```

```output
+---------------------------------------+
| JSON_OBJECT                           |
|---------------------------------------|
| {                                     |
|   "created_date_seconds": 1705536000, |
|   "dataset_size": 2,                  |
|   "distinct_province": 2,             |
|   "foo": 1234567                      |
| }                                     |
| {                                     |
|   "created_date_seconds": 1705622400, |
|   "dataset_size": 2,                  |
|   "distinct_province": 2,             |
|   "foo": 1234567                      |
| }                                     |
+---------------------------------------+
```

### Construct nested OBJECT values

The following example creates a table and inserts OBJECT values with two levels of nesting:

```sqlexample
CREATE OR REPLACE TABLE sample_nested_object (
  id INTEGER,
  nested_object OBJECT);

INSERT INTO sample_nested_object (id, nested_object)
  SELECT 1,
         OBJECT_CONSTRUCT(
           'outer_key1', OBJECT_CONSTRUCT('inner_key1A', 'example1', 'inner_key1B', 'example2'),
           'outer_key2', OBJECT_CONSTRUCT('inner_key2', 5)
         );

INSERT INTO sample_nested_object (id, nested_object)
  SELECT 2,
         OBJECT_CONSTRUCT(
           'outer_key1', OBJECT_CONSTRUCT('inner_key1A', 'example3', 'inner_key1B', 'example4'),
           'outer_key2', OBJECT_CONSTRUCT('inner_key2', 7)
         );

SELECT * FROM sample_nested_object;
```

```output
+----+--------------------------------+
| ID | NESTED_OBJECT                  |
+----+--------------------------------+
| 1  | {                              |
|    |   "outer_key1": {              |
|    |     "inner_key1A": "example1", |
|    |     "inner_key1B": "example2"  |
|    |   },                           |
|    |   "outer_key2": {              |
|    |     "inner_key2": 5            |
|    |   }                            |
|    | }                              |
| 2  | {                              |
|    |   "outer_key1": {              |
|    |     "inner_key1A": "example3", |
|    |     "inner_key1B": "example4"  |
|    |   },                           |
|    |   "outer_key2": {              |
|    |     "inner_key2": 7            |
|    |   }                            |
|    | }                              |
+----+--------------------------------+
```

---
title: OBJECT_CONSTRUCT_KEEP_NULL
source: https://docs.snowflake.com/en/sql-reference/functions/object_construct_keep_null.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_CONSTRUCT_KEEP_NULL

Returns an [OBJECT](../data-types-semistructured.md) constructed from the arguments
that retains key-values pairs with NULL values.

See also:
:   [OBJECT_CONSTRUCT](object_construct.md)

## Syntax

```sqlsyntax
OBJECT_CONSTRUCT_KEEP_NULL( [<key>, <value> [, <key>, <value> , ...]] )

OBJECT_CONSTRUCT_KEEP_NULL(*)
```

## Arguments

`key`
:   The key in a key-value pair. Each key is a VARCHAR value.

`value`
:   The value that is associated with the key. The value can be any data type.

`*`
:   When invoked with an asterisk (wildcard), the OBJECT value is constructed from the
    specified data using the attribute names as keys and the associated values as values.
    See the examples below.

    When you pass a wildcard to the function, you can qualify the wildcard with the name or alias for the table.
    For example, to pass in all of the columns from the table named `mytable`, specify the following:

    ```sqlexample
    (mytable.*)
    ```

    You can also use the ILIKE and EXCLUDE keywords for filtering:

    * ILIKE filters for column names that match the specified pattern. Only one
      pattern is allowed. For example:

      ```sqlexample
      (* ILIKE 'col1%')
      ```
    * EXCLUDE filters out column names that don’t match the specified column or columns. For example:

      ```sqlexample
      (* EXCLUDE col1)

      (* EXCLUDE (col1, col2))
      ```

    Qualifiers are valid when you use these keywords. The following example uses the ILIKE keyword to
    filter for all of the columns that match the pattern `col1%` in the table `mytable`:

    ```sqlexample
    (mytable.* ILIKE 'col1%')
    ```

    The ILIKE and EXCLUDE keywords can’t be combined in a single function call.

    For this function, the ILIKE and EXCLUDE keywords are valid only in a SELECT list or GROUP BY clause.

    For more information about the ILIKE and EXCLUDE keywords, see the “Parameters” section in [SELECT](../sql/select.md).

## Returns

The data type of the returned value is [OBJECT](../data-types-semistructured.md).

## Usage notes

* If the key is NULL (i.e. SQL NULL), the key-value pair is omitted from the resulting object. However,
  if the value is NULL, then the key-value pair is kept.
* The constructed object does not necessarily preserve the original order of the key-value pairs.

## Examples

This example shows the difference between OBJECT_CONSTRUCT and OBJECT_CONSTRUCT_KEEP_NULL:

```sqlexample
SELECT OBJECT_CONSTRUCT('key_1', 'one', 'key_2', NULL) AS WITHOUT_KEEP_NULL,
       OBJECT_CONSTRUCT_KEEP_NULL('key_1', 'one', 'key_2', NULL) AS KEEP_NULL_1,
       OBJECT_CONSTRUCT_KEEP_NULL('key_1', 'one', NULL, 'two') AS KEEP_NULL_2;
```

```output
+-------------------+-------------------+------------------+
| WITHOUT_KEEP_NULL | KEEP_NULL_1       | KEEP_NULL_2      |
|-------------------+-------------------+------------------|
| {                 | {                 | {                |
|   "key_1": "one"  |   "key_1": "one", |   "key_1": "one" |
| }                 |   "key_2": null   | }                |
|                   | }                 |                  |
+-------------------+-------------------+------------------+
```

The following example also shows the difference between OBJECT_CONSTRUCT and OBJECT_CONSTRUCT_KEEP NULL, but this example
uses a small table (which is shown prior to the query):

```sqlexample
CREATE TABLE demo_table_1_with_nulls (province VARCHAR, created_date DATE);
INSERT INTO demo_table_1_with_nulls (province, created_date) VALUES
  ('Manitoba', '2024-01-18'::DATE),
  ('British Columbia', NULL),
  ('Alberta', '2024-01-19'::DATE),
  (NULL, '2024-01-20'::DATE);
```

```sqlexample
SELECT *
  FROM demo_table_1_with_nulls
  ORDER BY province;
```

```output
+------------------+--------------+
| PROVINCE         | CREATED_DATE |
|------------------+--------------|
| Alberta          | 2024-01-19   |
| British Columbia | NULL         |
| Manitoba         | 2024-01-18   |
| NULL             | 2024-01-20   |
+------------------+--------------+
```

```sqlexample
SELECT OBJECT_CONSTRUCT(*) AS oc,
       OBJECT_CONSTRUCT_KEEP_NULL(*) AS oc_keep_null
  FROM demo_table_1_with_nulls
  ORDER BY oc_keep_null['PROVINCE'];
```

```output
+----------------------------------+----------------------------------+
| OC                               | OC_KEEP_NULL                     |
|----------------------------------+----------------------------------|
| {                                | {                                |
|   "CREATED_DATE": "2024-01-19",  |   "CREATED_DATE": "2024-01-19",  |
|   "PROVINCE": "Alberta"          |   "PROVINCE": "Alberta"          |
| }                                | }                                |
| {                                | {                                |
|   "PROVINCE": "British Columbia" |   "CREATED_DATE": null,          |
| }                                |   "PROVINCE": "British Columbia" |
|                                  | }                                |
| {                                | {                                |
|   "CREATED_DATE": "2024-01-18",  |   "CREATED_DATE": "2024-01-18",  |
|   "PROVINCE": "Manitoba"         |   "PROVINCE": "Manitoba"         |
| }                                | }                                |
| {                                | {                                |
|   "CREATED_DATE": "2024-01-20"   |   "CREATED_DATE": "2024-01-20",  |
| }                                |   "PROVINCE": null               |
|                                  | }                                |
+----------------------------------+----------------------------------+
```

For examples that use the closely-related function OBJECT_CONSTRUCT, see [OBJECT_CONSTRUCT](object_construct.md).

---
title: OBJECT_DELETE
source: https://docs.snowflake.com/en/sql-reference/functions/object_delete.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_DELETE

Returns an object containing the contents of the input (that is, source) object with
one or more keys removed.

## Syntax

```sqlsyntax
OBJECT_DELETE( <object>, <key1> [, <key2>, ... ] )
```

## Arguments

`object`
:   The source object.

`key1`, `key2`
:   Keys to be omitted from the returned object.

## Returns

This function returns a value of type OBJECT.

## Usage notes

For [structured OBJECTs](../data-types-structured.md):

* For the arguments that are keys, you must specify constants.
* If the specified key isn’t part of the OBJECT type definition, the call fails. For example, the following call fails because
  the OBJECT value doesn’t contain the specified key `zip_code`:

  ```sqlexample
  SELECT OBJECT_DELETE( {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR), 'zip_code' );
  ```

  ```output
  093201 (23001): Function OBJECT_DELETE: expected structured object to contain field zip_code but it did not.
  ```
* The function returns a structured OBJECT value. The type of the OBJECT value excludes the deleted key. For example, suppose that you
  remove the `city` key:

  ```sqlexample
  SELECT
    OBJECT_DELETE(
      {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR),
      'city'
    ) AS new_object,
    SYSTEM$TYPEOF(new_object);
  ```

  The function returns an OBJECT value of the type `OBJECT(state VARCHAR)`, which doesn’t include the `city` key.

  ```output
  +-----------------+----------------------------+
  | NEW_OBJECT      | SYSTEM$TYPEOF(NEW_OBJECT)  |
  |-----------------+----------------------------|
  | {               | OBJECT(state VARCHAR)[LOB] |
  |   "state": "CA" |                            |
  | }               |                            |
  +-----------------+----------------------------+
  ```
* If the function removes all keys from the OBJECT value, the function returns an empty structured OBJECT value of the type `OBJECT()`.

  ```sqlexample
  SELECT
    OBJECT_DELETE(
      {'state':'CA'}::OBJECT(state VARCHAR),
      'state'
    ) AS new_object,
    SYSTEM$TYPEOF(new_object);
  ```

  ```output
  +------------+---------------------------+
  | NEW_OBJECT | SYSTEM$TYPEOF(NEW_OBJECT) |
  |------------+---------------------------|
  | {}         | OBJECT()[LOB]             |
  +------------+---------------------------+
  ```

  When the type of a structured OBJECT value includes key-value pairs, the names and types of those pairs are included in parentheses
  in the type (for example, OBJECT(city VARCHAR)). Because an empty structured OBJECT value contains no key-value pairs, the
  parentheses are empty.

## Examples

This query returns an object that excludes the keys `a` and `b` from the source object:

```sqlexample
SELECT OBJECT_DELETE(OBJECT_CONSTRUCT('a', 1, 'b', 2, 'c', 3), 'a', 'b') AS object_returned;
```

```output
+-----------------+
| OBJECT_RETURNED |
|-----------------|
| {               |
|   "c": 3        |
| }               |
+-----------------+
```

Create a table and insert rows with OBJECT values. This example uses [OBJECT constants](../data-types-semistructured.md)
in the INSERT statements.

```sqlexample
CREATE OR REPLACE TABLE object_delete_example (
  id INTEGER,
  ov OBJECT);

INSERT INTO object_delete_example (id, ov)
  SELECT
    1,
    {
      'employee_id': 1001,
      'employee_date_of_birth': '12-10-2003',
      'employee_contact':
        {
          'city': 'San Mateo',
          'state': 'CA',
          'phone': '800-555-0100'
        }
    };

INSERT INTO object_delete_example (id, ov)
  SELECT
    2,
    {
      'employee_id': 1002,
      'employee_date_of_birth': '01-01-1990',
      'employee_contact':
        {
          'city': 'Seattle',
          'state': 'WA',
          'phone': '800-555-0101'
        }
    };
```

Query the table to see the data:

```sqlexample
SELECT * FROM object_delete_example;
```

```output
+----+-------------------------------------------+
| ID | OV                                        |
|----+-------------------------------------------|
|  1 | {                                         |
|    |   "employee_contact": {                   |
|    |     "city": "San Mateo",                  |
|    |     "phone": "800-555-0100",              |
|    |     "state": "CA"                         |
|    |   },                                      |
|    |   "employee_date_of_birth": "12-10-2003", |
|    |   "employee_id": 1001                     |
|    | }                                         |
|  2 | {                                         |
|    |   "employee_contact": {                   |
|    |     "city": "Seattle",                    |
|    |     "phone": "800-555-0101",              |
|    |     "state": "WA"                         |
|    |   },                                      |
|    |   "employee_date_of_birth": "01-01-1990", |
|    |   "employee_id": 1002                     |
|    | }                                         |
+----+-------------------------------------------+
```

To delete the `employee_date_of_birth` key from the query output, execute the following query:

```sqlexample
SELECT id,
       OBJECT_DELETE(ov, 'employee_date_of_birth') AS contact_without_date_of_birth
  FROM object_delete_example;
```

```output
+----+-------------------------------+
| ID | CONTACT_WITHOUT_DATE_OF_BIRTH |
|----+-------------------------------|
|  1 | {                             |
|    |   "employee_contact": {       |
|    |     "city": "San Mateo",      |
|    |     "phone": "800-555-0100",  |
|    |     "state": "CA"             |
|    |   },                          |
|    |   "employee_id": 1001         |
|    | }                             |
|  2 | {                             |
|    |   "employee_contact": {       |
|    |     "city": "Seattle",        |
|    |     "phone": "800-555-0101",  |
|    |     "state": "WA"             |
|    |   },                          |
|    |   "employee_id": 1002         |
|    | }                             |
+----+-------------------------------+
```

To query the `employee_contact` nested object, remove the `phone` key from it, and
return only the nested inner key-value pairs, execute the following query:

```sqlexample
SELECT id,
       OBJECT_DELETE(ov:"employee_contact", 'phone') AS contact_without_phone
  FROM object_delete_example;
```

```output
+----+------------------------+
| ID | CONTACT_WITHOUT_PHONE  |
|----+------------------------|
|  1 | {                      |
|    |   "city": "San Mateo", |
|    |   "state": "CA"        |
|    | }                      |
|  2 | {                      |
|    |   "city": "Seattle",   |
|    |   "state": "WA"        |
|    | }                      |
+----+------------------------+
```

To query the `employee_contact` nested object, remove the `phone` key from it, and
return the full object instead of just the nested inner key-value pairs, run a query
that performs the following actions:

* Call the [OBJECT_INSERT](object_insert.md) function and specify the `ov` column
  for the first argument. The function starts with the whole object in each row.
* For the second argument in the OBJECT_INSERT call, specify `employee_contact` for the existing key to update.
* For the third argument in the OBJECT_INSERT call, call the OBJECT_DELETE function to remove the `phone` key from the nested object.
* For the last argument in the OBJECT_INSERT call, specify `true` to replace the old object with the new one.

Execute the following query to perform these actions:

```sqlexample
SELECT id,
       OBJECT_INSERT(
         ov,
         'employee_contact',
         OBJECT_DELETE(
           ov:employee_contact,
           'phone'
         ),
         true
      ) AS full_object_without_phone
  FROM object_delete_example;
```

```output
+----+-------------------------------------------+
| ID | FULL_OBJECT_WITHOUT_PHONE                 |
|----+-------------------------------------------|
|  1 | {                                         |
|    |   "employee_contact": {                   |
|    |     "city": "San Mateo",                  |
|    |     "state": "CA"                         |
|    |   },                                      |
|    |   "employee_date_of_birth": "12-10-2003", |
|    |   "employee_id": 1001                     |
|    | }                                         |
|  2 | {                                         |
|    |   "employee_contact": {                   |
|    |     "city": "Seattle",                    |
|    |     "state": "WA"                         |
|    |   },                                      |
|    |   "employee_date_of_birth": "01-01-1990", |
|    |   "employee_id": 1002                     |
|    | }                                         |
+----+-------------------------------------------+
```

---
title: OBJECT_INSERT
source: https://docs.snowflake.com/en/sql-reference/functions/object_insert.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_INSERT

Returns an [OBJECT](../data-types-semistructured.md) value consisting of the input OBJECT value with a new key-value pair inserted
(or an existing key updated with a new value).

## Syntax

```sqlsyntax
OBJECT_INSERT( <object> , <key> , <value> [ , <updateFlag> ] )
```

## Arguments

**Required:**

`object`
:   The source OBJECT value into which the new key-value pair is inserted or in which an existing key-value pair is updated.

`key`
:   The new key to be inserted into the OBJECT value or an existing key whose value is being updated. The specified key must
    be different from all existing keys in the OBJECT value, unless `updateFlag` is set to TRUE.

`value`
:   The value associated with the key.

**Optional:**

`updateFlag`
:   A Boolean flag that, when set to TRUE, specifies that the input value updates the value of an existing key in the
    OBJECT value, rather than inserting a new key-value pair.

    The default is FALSE.

## Returns

This function returns a value that has the OBJECT data type.

## Usage notes

* The function supports [JSON null](../../user-guide/semistructured-considerations.md) values, but not SQL NULL values or keys:

  + If `key` is any string other than NULL and `value` is a JSON null (for example, `PARSE_JSON('null')`),
    the key-value pair is inserted into the returned OBJECT value.
  + If either `key` or `value` is a SQL NULL, the key-value pair is omitted from the returned OBJECT value.
* If the optional `updateFlag` argument is set to TRUE, the existing input `key` is updated to the input `value`.
  If `updateFlag` is omitted or set to FALSE, calling this function with an input key that already exists in the OBJECT value results
  in an error.
* If the update flag is set to TRUE, but the corresponding key doesn’t already
  exist in the OBJECT value, then the key-value pair is added.
* For [structured OBJECT values](../data-types-structured.md):

  + For the arguments that are keys, you must specify constants.
  + When the `updateFlag` argument is FALSE (when you are inserting a new key-value pair):

    - If you specify a key that already exists in the OBJECT value, an error occurs.

      ```sqlexample
      SELECT OBJECT_INSERT(
        {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR),
        'city',
        'San Jose',
        false
      );
      ```

      ```output
      093202 (23001): Function OBJECT_INSERT:
        expected structured object to not contain field city but it did.
      ```
    - The function returns a structured OBJECT value. The type of the OBJECT value includes the newly inserted key. For example, suppose that
      you add the `zipcode` key with the VARCHAR value `94402`:

      ```sqlexample
      SELECT
        OBJECT_INSERT(
          {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR),
          'zip_code',
          94402::VARCHAR,
          false
        ) AS new_object,
        SYSTEM$TYPEOF(new_object) AS type;
      ```

      ```output
      +------------------------+---------------------------------------------------------------------+
      | NEW_OBJECT             | TYPE                                                                |
      |------------------------+---------------------------------------------------------------------|
      | {                      | OBJECT(city VARCHAR, state VARCHAR, zip_code VARCHAR NOT NULL)[LOB] |
      |   "city": "San Mateo", |                                                                     |
      |   "state": "CA",       |                                                                     |
      |   "zip_code": "94402"  |                                                                     |
      | }                      |                                                                     |
      +------------------------+---------------------------------------------------------------------+
      ```

      The type of the inserted value determines the type added to the OBJECT type definition. In this case, the value for
      `zipcode` is a value cast to a VARCHAR, so the type of `zipcode` is VARCHAR.
  + When the `updateFlag` argument is TRUE (when you are replacing an existing key-value pair):

    - If you specify a key that doesn’t exist in the OBJECT value, an error occurs.
    - The function returns a structured OBJECT value of the same type.
    - The type of the inserted value is [coerced](../data-types-structured.md) to the type of the existing key.

## Examples

The following examples call the OBJECT_INSERT function:

### Add and update key-value pairs

The examples use the following table:

```sqlexample
CREATE OR REPLACE TABLE object_insert_examples (object_column OBJECT);

INSERT INTO object_insert_examples (object_column)
  SELECT OBJECT_CONSTRUCT('a', 'value1', 'b', 'value2');

SELECT * FROM object_insert_examples;
```

```output
+------------------+
| OBJECT_COLUMN    |
|------------------|
| {                |
|   "a": "value1", |
|   "b": "value2"  |
| }                |
+------------------+
```

#### Add a new key-value pair to an OBJECT value

Insert a third key-value pair into an OBJECT value that has two key-value pairs:

```sqlexample
UPDATE object_insert_examples
  SET object_column = OBJECT_INSERT(object_column, 'c', 'value3');

SELECT * FROM object_insert_examples;
```

```output
+------------------+
| OBJECT_COLUMN    |
|------------------|
| {                |
|   "a": "value1", |
|   "b": "value2", |
|   "c": "value3"  |
| }                |
+------------------+
```

Insert two new key-value pairs into the OBJECT value, while omitting one key-value pair:

> * `d` consists of a JSON null value.
> * `e` consists of a SQL NULL value and is, therefore, omitted.
> * `f` consists of a string containing “null”.

```sqlexample
UPDATE object_insert_examples
  SET object_column = OBJECT_INSERT(object_column, 'd', PARSE_JSON('null'));

UPDATE object_insert_examples
  SET object_column = OBJECT_INSERT(object_column, 'e', NULL);

UPDATE object_insert_examples
  SET object_column = OBJECT_INSERT(object_column, 'f', 'null');

SELECT * FROM object_insert_examples;
```

```output
+------------------+
| OBJECT_COLUMN    |
|------------------|
| {                |
|   "a": "value1", |
|   "b": "value2", |
|   "c": "value3", |
|   "d": null,     |
|   "f": "null"    |
| }                |
+------------------+
```

#### Update a key-value pair in an OBJECT value

Update an existing key-value pair (`"b": "value2"`) in the OBJECT value with a new value (`"valuex"`):

```sqlexample
UPDATE object_insert_examples
  SET object_column = OBJECT_INSERT(object_column, 'b', 'valuex', TRUE);

SELECT * FROM object_insert_examples;
```

```output
+------------------+
| OBJECT_COLUMN    |
|------------------|
| {                |
|   "a": "value1", |
|   "b": "valuex", |
|   "c": "value3", |
|   "d": null,     |
|   "f": "null"    |
| }                |
+------------------+
```

### Add and update nested OBJECT values

The examples use the following table with nested OBJECT values:

```sqlexample
CREATE OR REPLACE TABLE sample_nested_object (
  id INTEGER,
  nested_object OBJECT);

INSERT INTO sample_nested_object (id, nested_object)
  SELECT 1,
         OBJECT_CONSTRUCT(
           'outer_key1', OBJECT_CONSTRUCT('inner_key1A', 'example1', 'inner_key1B', 'example2'),
           'outer_key2', OBJECT_CONSTRUCT('inner_key2', 5)
         );

INSERT INTO sample_nested_object (id, nested_object)
  SELECT 2,
         OBJECT_CONSTRUCT(
           'outer_key1', OBJECT_CONSTRUCT('inner_key1A', 'example3', 'inner_key1B', 'example4'),
           'outer_key2', OBJECT_CONSTRUCT('inner_key2', 7)
         );

SELECT * FROM sample_nested_object;
```

```output
+----+--------------------------------+
| ID | NESTED_OBJECT                  |
+----+--------------------------------+
| 1  | {                              |
|    |   "outer_key1": {              |
|    |     "inner_key1A": "example1", |
|    |     "inner_key1B": "example2"  |
|    |   },                           |
|    |   "outer_key2": {              |
|    |     "inner_key2": 5            |
|    |   }                            |
|    | }                              |
| 2  | {                              |
|    |   "outer_key1": {              |
|    |     "inner_key1A": "example3", |
|    |     "inner_key1B": "example4"  |
|    |   },                           |
|    |   "outer_key2": {              |
|    |     "inner_key2": 7            |
|    |   }                            |
|    | }                              |
+----+--------------------------------+
```

#### Add new nested key-value pairs to the nested OBJECT values

The following example adds new nested key-value pairs to the nested OBJECT values in the table. It uses
a [CASE](case.md) expression to specify the added key-value pair for
each row:

```sqlexample
UPDATE sample_nested_object
  SET nested_object = OBJECT_INSERT(
    nested_object,
    'outer_key1',
     OBJECT_INSERT(
       nested_object:outer_key1,
       'inner_key1C',
       CASE
         WHEN id = 1 THEN 'added_value_1'
         WHEN id = 2 THEN 'added_value_2'
       END,
       TRUE
      ),
    TRUE);

SELECT * FROM sample_nested_object;
```

```output
+----+------------------------------------+
| ID | NESTED_OBJECT                      |
|----+------------------------------------|
|  1 | {                                  |
|    |   "outer_key1": {                  |
|    |     "inner_key1A": "example1",     |
|    |     "inner_key1B": "example2",     |
|    |     "inner_key1C": "added_value_1" |
|    |   },                               |
|    |   "outer_key2": {                  |
|    |     "inner_key2": 5                |
|    |   }                                |
|    | }                                  |
|  2 | {                                  |
|    |   "outer_key1": {                  |
|    |     "inner_key1A": "example3",     |
|    |     "inner_key1B": "example4",     |
|    |     "inner_key1C": "added_value_2" |
|    |   },                               |
|    |   "outer_key2": {                  |
|    |     "inner_key2": 7                |
|    |   }                                |
|    | }                                  |
+----+------------------------------------+
```

#### Update key-value pairs in the nested OBJECT values

The following example updates nested key-value pairs in the OBJECT values in the table:

```sqlexample
UPDATE sample_nested_object
  SET nested_object = OBJECT_INSERT(
    nested_object,
    'outer_key2',
    OBJECT_INSERT(
      nested_object:outer_key2,
      'inner_key2',
      CASE
        WHEN id = 1 THEN 6
        WHEN id = 2 THEN 8
      END,
      TRUE),
    TRUE);

SELECT * FROM sample_nested_object;
```

```output
+----+------------------------------------+
| ID | NESTED_OBJECT                      |
|----+------------------------------------|
|  1 | {                                  |
|    |   "outer_key1": {                  |
|    |     "inner_key1A": "example1",     |
|    |     "inner_key1B": "example2",     |
|    |     "inner_key1C": "added_value_1" |
|    |   },                               |
|    |   "outer_key2": {                  |
|    |     "inner_key2": 6                |
|    |   }                                |
|    | }                                  |
|  2 | {                                  |
|    |   "outer_key1": {                  |
|    |     "inner_key1A": "example3",     |
|    |     "inner_key1B": "example4",     |
|    |     "inner_key1C": "added_value_2" |
|    |   },                               |
|    |   "outer_key2": {                  |
|    |     "inner_key2": 8                |
|    |   }                                |
|    | }                                  |
+----+------------------------------------+
```

---
title: OBJECT_KEYS
source: https://docs.snowflake.com/en/sql-reference/functions/object_keys.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_KEYS

Returns an array containing the list of keys in the top-most level of the input object.

## Syntax

```sqlsyntax
OBJECT_KEYS( <object> )
```

## Arguments

`object`
:   The value for which you want the keys. The input value must be one of the following:

    * An [OBJECT](../data-types-semistructured.md).
    * A [VARIANT](../data-types-semistructured.md) that contains a value of type OBJECT.

## Returns

The function returns an [ARRAY](../data-types-semistructured.md) containing the keys.

If `object` is a [structured OBJECT](../data-types-structured.md), the function returns an ARRAY(VARCHAR).

## Usage notes

* If the object contains nested objects (e.g. objects within objects), this returns only the keys from the top-most level.

## Examples

### Basic example

The next example shows OBJECT_KEYS working with both an [OBJECT](../data-types-semistructured.md) and a
[VARIANT](../data-types-semistructured.md) that contains a value of type OBJECT.

> Create a table that contains columns of types [OBJECT](../data-types-semistructured.md) and
> [VARIANT](../data-types-semistructured.md).
>
> ```sqlexample
> CREATE TABLE objects_1 (id INTEGER, object1 OBJECT, variant1 VARIANT);
> ```
>
> INSERT values:
>
> ```sqlexample
> INSERT INTO objects_1 (id, object1, variant1)
>   SELECT
>     1,
>     OBJECT_CONSTRUCT('a', 1, 'b', 2, 'c', 3),
>     TO_VARIANT(OBJECT_CONSTRUCT('a', 1, 'b', 2, 'c', 3))
>     ;
> ```
>
> Retrieve the keys from both the OBJECT and the VARIANT:
>
> ```sqlexample
> SELECT OBJECT_KEYS(object1), OBJECT_KEYS(variant1)
>     FROM objects_1
>     ORDER BY id;
> +----------------------+-----------------------+
> | OBJECT_KEYS(OBJECT1) | OBJECT_KEYS(VARIANT1) |
> |----------------------+-----------------------|
> | [                    | [                     |
> |   "a",               |   "a",                |
> |   "b",               |   "b",                |
> |   "c"                |   "c"                 |
> | ]                    | ]                     |
> +----------------------+-----------------------+
> ```

### Example of nested objects

This example shows that if the object contains nested objects, only the keys from the top-most level are returned.

> ```sqlexample
> SELECT OBJECT_KEYS (
>            PARSE_JSON (
>                '{
>                     "level_1_A": {
>                                  "level_2": "two"
>                                  },
>                     "level_1_B": "one"
>                     }'
>                )
>            ) AS keys
>     ORDER BY 1;
> +----------------+
> | KEYS           |
> |----------------|
> | [              |
> |   "level_1_A", |
> |   "level_1_B"  |
> | ]              |
> +----------------+
> ```

---
title: OBJECT_PICK
source: https://docs.snowflake.com/en/sql-reference/functions/object_pick.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# OBJECT_PICK

Returns a new [OBJECT](../data-types-semistructured.md) containing some of the key-value pairs from an existing
object.

To identify the key-value pairs to include in the new object, pass in the keys as arguments, or pass in an array containing
the keys.

If a specified key is not present in the input object, the key is ignored.

## Syntax

```sqlsyntax
OBJECT_PICK( <object>, <key1> [, <key2>, ... ] )

OBJECT_PICK( <object>, <array> )
```

## Arguments

`object`
:   The input object.

`key1`, `key2`
:   One or more keys identifying the key-value pairs that should be included in the returned object.

`array`
:   Array of keys identifying the key-value pairs that should be included in the returned object.

## Returns

Returns a new OBJECT containing the specified key-value pairs.

## Usage notes

For structured OBJECTs:

* For the arguments that are keys, you must specify constants.
* You can’t pass in an ARRAY of keys as the second argument. You must specify each key as a separate argument.
* The function returns a structured OBJECT value. The type of the OBJECT value includes the keys in the order in which they are specified.

  For example, suppose that you select the `state` and `city` keys in that order:

  ```sqlexample
  SELECT
    OBJECT_PICK(
      {'city':'San Mateo','state':'CA','zip_code':94402}::OBJECT(city VARCHAR,state VARCHAR,zip_code DOUBLE),
      'state',
      'city') AS new_object,
    SYSTEM$TYPEOF(new_object);
  ```

  The function returns an OBJECT value of the type `OBJECT(state VARCHAR, city VARCHAR)`.

  ```output
  +-----------------------+------------------------------------------+
  | NEW_OBJECT            | SYSTEM$TYPEOF(NEW_OBJECT)                |
  |-----------------------+------------------------------------------|
  | {                     | OBJECT(state VARCHAR, city VARCHAR)[LOB] |
  |   "state": "CA",      |                                          |
  |   "city": "San Mateo" |                                          |
  | }                     |                                          |
  +-----------------------+------------------------------------------+
  ```

## Examples

The following example calls OBJECT_PICK to create a new object that contains two of the three key-value pairs from an existing
object:

> ```sqlexample
> SELECT OBJECT_PICK(
>     OBJECT_CONSTRUCT(
>         'a', 1,
>         'b', 2,
>         'c', 3
>     ),
>     'a', 'b'
> ) AS new_object;
> +------------+
> | NEW_OBJECT |
> |------------|
> | {          |
> |   "a": 1,  |
> |   "b": 2   |
> | }          |
> +------------+
> ```

In the example above, the keys are passed as arguments to OBJECT_PICK. You can also use an array to specify the keys,
as shown below:

> ```sqlexample
> SELECT OBJECT_PICK(
>     OBJECT_CONSTRUCT(
>         'a', 1,
>         'b', 2,
>         'c', 3
>     ),
>     ARRAY_CONSTRUCT('a', 'b')
> ) AS new_object;
> +------------+
> | NEW_OBJECT |
> |------------|
> | {          |
> |   "a": 1,  |
> |   "b": 2   |
> | }          |
> +------------+
> ```

---
title: OCTET_LENGTH
source: https://docs.snowflake.com/en/sql-reference/functions/octet_length.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# OCTET_LENGTH

Returns the length of a string or binary value in bytes. This will be the same as LENGTH for ASCII strings and greater than
LENGTH for strings using Unicode code points. For binary, this is always the same as LENGTH.

## Syntax

```sqlsyntax
OCTET_LENGTH(<string_or_binary>)
```

## Arguments

`string_or_binary`
:   The string or binary value for which the length is returned.

## Examples

```sqlexample
SELECT OCTET_LENGTH('abc'), OCTET_LENGTH('\u0392'), OCTET_LENGTH(X'A1B2');

---------------------+------------------------+-----------------------+
 OCTET_LENGTH('ABC') | OCTET_LENGTH('\U0392') | OCTET_LENGTH(X'A1B2') |
---------------------+------------------------+-----------------------+
 3                   | 2                      | 2                     |
---------------------+------------------------+-----------------------+
```

---
title: ONLINE_FEATURE_TABLE_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/online-feature-table-refresh-history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# ONLINE_FEATURE_TABLE_REFRESH_HISTORY

This table function returns information about each refresh (completed and running) of [online feature tables](../sql/create-online-feature-table.md).

This table function returns all refreshes that are in progress as well as all refreshes that have a REFRESH_START_TIME within 7 days of the current time.

See also:
:   [CREATE ONLINE FEATURE TABLE](../sql/create-online-feature-table.md) , [ALTER ONLINE FEATURE TABLE](../sql/alter-online-feature-table.md), [DESCRIBE ONLINE FEATURE TABLE](../sql/desc-online-feature-table.md) , [DROP ONLINE FEATURE TABLE](../sql/drop-online-feature-table.md) , [SHOW ONLINE FEATURE TABLES](../sql/show-online-feature-tables.md)

## Syntax

```sqlsyntax
ONLINE_FEATURE_TABLE_REFRESH_HISTORY(
  [ REFRESH_START_TIMESTAMP => <constant_expr> ]
  [ , REFRESH_END_TIMESTAMP => <constant_expr> ]
  [ , RESULT_LIMIT => <integer> ]
  [ , NAME => '<string>' ]
  [ , NAME_PREFIX => '<string>' ]
  [ , ERROR_ONLY => { TRUE | FALSE } ]
)
```

## Arguments

All the arguments are optional. If no arguments are provided, 100 refreshes from all online feature tables in the account will be returned.

`REFRESH_START_TIMESTAMP => constant_expr` , `REFRESH_END_TIMESTAMP => constant_expr`
:   Time range (in TIMESTAMP_LTZ format) during which the refreshes started. If an end version is not specified, CURRENT_TIMESTAMP is used as the end of the range.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function. If the number of matching rows is greater than this limit, the refreshes that finished most recently (and those that are still running) are returned, up to the specified limit.

    Range: 1 to 10000

    Default: 100.

`NAME => 'string'`
:   The name of an online feature table.

    You can specify the unqualified name (`online_feature_table_name`), the partially qualified name (`schema_name.online_feature_table_name`), or the fully qualified name (`database_name.schema_name.online_feature_table_name`).

    > For more information on object name resolution, see [Object name resolution](../name-resolution.md).

    The function returns the refreshes for this table.

`NAME_PREFIX => 'string'`
:   A prefix for online feature tables.

    The function returns refreshes for tables with names that start with this prefix.

    You can use this argument to return the refreshes for online feature tables in a specific database or schema.

`ERROR_ONLY => { TRUE | FALSE }`
:   When set to TRUE, this function returns only refreshes that failed or were cancelled.

    Default: FALSE

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Data type | Description |
| --- | --- | --- |
| `NAME` | TEXT | Name of the online feature table. |
| `SCHEMA_NAME` | TEXT | Name of the schema that contains the online feature table. |
| `DATABASE_NAME` | TEXT | Name of the database that contains the online feature table. |
| `QUALIFIED_NAME` | TEXT | Fully qualified name of the online feature table. |
| `STATE` | TEXT | Status of the refresh for the online feature table. The status can be one of the following:   * `EXECUTING`: refresh in progress. * `SUCCEEDED`: refresh completed successfully. * `FAILED`: refresh failed during execution. * `CANCELLED`: refresh was canceled before completion. |
| `REFRESH_START_TIME` | TIMESTAMP_LTZ | Time when the refresh job started. |
| `REFRESH_END_TIME` | TIMESTAMP_LTZ | Time when the refresh completed. |
| `REFRESH_TRIGGER` | TEXT | One of:   * `SCHEDULED`: normal background refresh to meet target lag. * `MANUAL`: user/task ran `ALTER ONLINE FEATURE TABLE <name> REFRESH` command. * `CREATION`: refresh performed during the creation DDL statement, triggered by the creation of the online feature table. |
| `REFRESH_ACTION` | TEXT | One of:   * `NO_DATA`: no new data in base tables. Doesn’t apply to the initial refresh of newly created online feature tables regardless of whether or not the base tables have data. * `REINITIALIZE`: base table changed. * `FULL`: Full refresh, because refresh mode of the online feature table is set to FULL. * `INCREMENTAL`: normal incremental refresh. |
| `QUERY_ID` | TEXT | ID of the SQL statement that produced the results for the online feature table. |
| `STATE_CODE` | TEXT | Code representing the current state of the refresh. |
| `STATE_MESSAGE` | TEXT | Description of the current state of the refresh. |

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Online feature table | Role that has the MONITOR privilege on the online feature table. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This function is available in the INFORMATION_SCHEMA.
* The information returned by this function is up to date. Online Feature Table refresh history in the ACCOUNT_USAGE.ONLINE_FEATURE_TABLE_REFRESH_HISTORY view may lag by up to 3 hours.

## Examples

The following example returns the refresh history for all online feature tables in the account:

```sqlexample
SELECT *
FROM TABLE(INFORMATION_SCHEMA.ONLINE_FEATURE_TABLE_REFRESH_HISTORY());
```

The following example returns the refresh history for a specific online feature table named `my_feature_table`:

```sqlexample
SELECT *
FROM TABLE(INFORMATION_SCHEMA.ONLINE_FEATURE_TABLE_REFRESH_HISTORY(
  NAME => 'my_feature_table'
));
```

The following example returns only failed refreshes from the last 24 hours:

```sqlexample
SELECT *
FROM TABLE(INFORMATION_SCHEMA.ONLINE_FEATURE_TABLE_REFRESH_HISTORY(
  REFRESH_START_TIMESTAMP => CURRENT_TIMESTAMP - INTERVAL '1 DAY',
  ERROR_ONLY => TRUE
));
```

The following example returns refreshes for online feature tables with names starting with `feature_` and limits the results to 50 rows:

```sqlexample
SELECT *
FROM TABLE(INFORMATION_SCHEMA.ONLINE_FEATURE_TABLE_REFRESH_HISTORY(
  NAME_PREFIX => 'feature_',
  RESULT_LIMIT => 50
));
```

---
title: PARSE_DOCUMENT (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/parse_document-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# PARSE_DOCUMENT (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_PARSE_DOCUMENT](ai_parse_document.md) is the latest version of this function.
> Use AI_PARSE_DOCUMENT for the latest functionality.
> You can continue to use PARSE_DOCUMENT (SNOWFLAKE.CORTEX).

Returns the extracted content from a document on a Snowflake stage as a JSON-formatted string. This
function supports two types of extraction, Optical Character Recognition (OCR), and layout. For more
information, see [Parsing documents with AI_PARSE_DOCUMENT](../../user-guide/snowflake-cortex/parse-document.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.PARSE_DOCUMENT( '@<stage>', '<path>', [ <options> ] )
```

## Arguments

**Required:**

`stage`
:   Name of the Snowflake stage.

`path`
:   Relative path to the document on the Snowflake stage.

**Optional:**

`options`
:   An OBJECT value that contains options for parsing documents. The supported keys are shown below. All are optional.

    * `'mode'`: Specifies the parsing mode. The supported modes are:

      + `'OCR'`: The function extracts text only. This is the default mode.
      + `'LAYOUT'`: The function extracts layout as well as text, including structural content such as tables.
    * `'page_split'`: If set to TRUE, the function splits the output of the function to return content per page.
      Only PDF, PowerPoint (`.pptx`), and Word (`.docx`) documents are supported.
      Documents in other formats return an error. The default is FALSE.

## Returns

A JSON object (as a string) that contains the extracted data and associated metadata. The `options` argument
determines the structure of the returned object.

> **Tip:**
>
> To use the output in SQL, convert it to an OBJECT value using the [PARSE_JSON](parse_json.md) function.

If the `'page_split'` option is set, the output has the following structure:

> * `"pages"`: An array of JSON objects, each containing text extracted from the document. If the document has only
>   one page, the output still contains a `"pages"` array (which contains a single object). Each page has the following fields:
>
>   > + `"content"`: Plain text (in OCR mode) or Markdown-formatted text (in LAYOUT mode).
>   > + `"index"`: The page index in the file, starting at 0. Page numbers and formats specified in the document are ignored.
>
> > * `"errorInformation"`: Contains error information if document can’t be parsed.
> > * `"metadata"`: Contains metadata about the document, such as page count.
>
> > **Note:**
> >
> > The `"pages"` and `"metadata"` fields are present in the output when parsing succeeds.
> > `"errorInformation"` is present only if parsing fails.

If `'page_split'` is FALSE or is not present, the output has the following structure:

> > * `"content"`: Plain text (in OCR mode) or Markdown-formatted text (in LAYOUT mode).
> > * `"errorInformation"`: Contains error information if the document can’t be parsed.
> > * `"metadata"`: Contains metadata about the document, such as page count.
>
> > **Note:**
> >
> > The `"content"` and `"metadata"` fields are present in the output when parsing succeeds.
> > `"errorInformation"` is present only if parsing fails.

## Examples

### OCR mode

```sqlexample
SELECT TO_VARCHAR(
    SNOWFLAKE.CORTEX.PARSE_DOCUMENT(
        '@PARSE_DOCUMENT.DEMO.documents',
        'document_1.pdf',
        {'mode': 'OCR'})
    ) AS OCR;
```

Output:

```output
{
    "content": "content of the document"
}
```

### LAYOUT mode

This example parses a document with a table shown in the following screenshot:

```sqlexample
SELECT
  TO_VARCHAR (
    SNOWFLAKE.CORTEX.PARSE_DOCUMENT (
        '@PARSE_DOCUMENT.DEMO.documents',
        'document_1.pdf',
        {'mode': 'LAYOUT'} ) ) AS LAYOUT;
```

Output:

```output
{
  "content": "# This is PARSE DOCUMENT example
     Example table:
     |Header|Second header|Third Header|
     |:---:|:---:|:---:|
     |First row header|Data in first row|Data in first row|
     |Second row header|Data in second row|Data in second row|

     Some more text."
 }
```

### Split pages

This example splits a multi-page document into separate pages, which are processed separately using the `'OCR'` mode.

```sqlexample
SELECT
  TO_VARCHAR (
    SNOWFLAKE.CORTEX.PARSE_DOCUMENT (
        '@PARSE_DOCUMENT.DEMO.documents',
        'document_1.pdf',
        {'mode': 'OCR', 'page_split': TRUE} ) ) AS MULTIPAGE;
```

Output:

```output
{
  "pages": [
    {
      "content": "content of the first page",
      "index": 0
    },
    {
      "content": "content of the second page",
      "index": 1
    },
    {
      "content": "content of the third page",
      "index": 2
    }
  ],
  "metadata": {
    "pageCount": 3
  }
}
```

---
title: PARSE_IP
source: https://docs.snowflake.com/en/sql-reference/functions/parse_ip.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# PARSE_IP

Returns a JSON object consisting of all the components from a valid INET (Internet Protocol) or CIDR (Classless Internet Domain Routing) IPv4 or IPv6 string.

## Syntax

```sqlsyntax
PARSE_IP(<expr>, '<type>' [, <permissive>])
```

## Arguments

**Required:**

`expr`
:   A string expression.

`type`
:   A string that identifies the type of IP address. Supports either `INET` or `CIDR`; the value is
    case-insensitive.

**Optional:**

`permissive`
:   Flag that determines how parse errors are handled:

    * If set to 0, parse errors cause the function to fail.
    * If set to 1, parse errors result in an object with the `error`
      field set to the respective error message (and no other fields set).

    Default value is 0.

## Returns

[OBJECT](../data-types-semistructured.md).

## Usage notes

* The function parses an IP address and returns a JSON object.

  The following elements are always returned:

  > `family`
  > :   Numeric value. `4` (IPv4) or `6` (IPv6).
  >
  > `ip_type`
  > :   String value. `inet` or `cidr` from the input.
  >
  > `host`
  > :   String value. Host address from the input expression.
  >
  > `ip_fields`
  > :   Array of 4 numeric fields, each a value between 0 and 4,294,967,295 (2^32 - 1), inclusive. The bit values from
  >     this array are mapped to the raw bits in the host address.
  >
  >     + IPv4 addresses: Displays only the rightmost 32 bits of the host address.
  >     + IPv6 addresses: Displays each of the 32-bit fields that map to the raw 128-bit host address from left to right.

  If a subnet mask is input, the results include `network_prefix_length`, a numeric value that identifies the length of the subnet mask.

  The following elements are returned for IPv4 addresses:

  > `ipv4`
  > :   Numeric IP address that matches the first field in `ip_fields`.
  >
  > `ipv4_range_start`
  > :   Numeric start address of the network, displayed when a subnet mask is included in the input.
  >
  > `ipv4_range_end`
  > :   Numeric end address of the network, displayed when a subnet mask is included in the input.

  The following elements are returned for IPv6 addresses:

  > `hex_ipv6`
  > :   IP address expressed as a fully padded, fixed-size hexadecimal value.
  >
  > `hex_ipv6_range_start`
  > :   Fully padded fixed-size hexadecimal start address of the network, displayed when a subnet mask is included in the input.
  >
  > `hex_ipv6_range_end`
  > :   Fully padded fixed-size hexadecimal end address of the network, displayed when a subnet mask is included in the input.

  The `snowflake$type` element is reserved for internal Snowflake usage.
* For IP address range calculations or subnet mask searches, query the individual JSON elements directly. See the examples, below.
* When inputting a subnet mask, Snowflake recommends storing the function output in a VARIANT column and querying against the generated elements for better performance. See the examples.

## Examples

> ```sqlexample
> SELECT column1, PARSE_IP(column1, 'INET') FROM VALUES('192.168.242.188/24'), ('192.168.243.189/24');
> --------------------+-----------------------------------+
>  COLUMN1            | PARSE_IP(COLUMN1, 'INET')         |
> --------------------+-----------------------------------|
>  192.168.242.188/24 | {                                 |
>                     |   "family": 4,                    |
>                     |   "host": "192.168.242.188",      |
>                     |   "ip_fields": [                  |
>                     |     3232297660,                   |
>                     |     0,                            |
>                     |     0,                            |
>                     |     0                             |
>                     |   ],                              |
>                     |   "ip_type": "inet",              |
>                     |   "ipv4": 3232297660,             |
>                     |   "ipv4_range_end": 3232297727,   |
>                     |   "ipv4_range_start": 3232297472, |
>                     |   "netmask_prefix_length": 24,    |
>                     |   "snowflake$type": "ip_address"  |
>                     | }                                 |
>  192.168.243.189/24 | {                                 |
>                     |   "family": 4,                    |
>                     |   "host": "192.168.243.189",      |
>                     |   "ip_fields": [                  |
>                     |     3232297917,                   |
>                     |     0,                            |
>                     |     0,                            |
>                     |     0                             |
>                     |   ],                              |
>                     |   "ip_type": "inet",              |
>                     |   "ipv4": 3232297917,             |
>                     |   "ipv4_range_end": 3232297983,   |
>                     |   "ipv4_range_start": 3232297728, |
>                     |   "netmask_prefix_length": 24,    |
>                     |   "snowflake$type": "ip_address"  |
>                     | }                                 |
> --------------------+-----------------------------------+
> ```
>
> ```sqlexample
> SELECT PARSE_IP('fe80::20c:29ff:fe2c:429/64', 'INET');
>
> ----------------------------------------------------------------+
>   PARSE_IP('FE80::20C:29FF:FE2C:429/64', 'INET')                |
> ----------------------------------------------------------------|
>   {                                                             |
>     "family": 6,                                                |
>     "hex_ipv6": "FE80000000000000020C29FFFE2C0429",             |
>     "hex_ipv6_range_end": "FE80000000000000FFFFFFFFFFFFFFFF",   |
>     "hex_ipv6_range_start": "FE800000000000000000000000000000", |
>     "host": "fe80::20c:29ff:fe2c:429",                          |
>     "ip_fields": [                                              |
>       4269801472,                                               |
>       0,                                                        |
>       34351615,                                                 |
>       4264297513                                                |
>     ],                                                          |
>     "ip_type": "inet",                                          |
>     "netmask_prefix_length": 64,                                |
>     "snowflake$type": "ip_address"                              |
>   }                                                             |
> ----------------------------------------------------------------+
> ```
>
> ```sqlexample
> WITH
> lookup AS (
>   SELECT column1 AS tag, PARSE_IP(column2, 'INET') AS obj FROM VALUES('San Francisco', '192.168.242.0/24'), ('New York', '192.168.243.0/24')
> ),
> entries AS (
>   SELECT PARSE_IP(column1, 'INET') AS ipv4 FROM VALUES('192.168.242.188/24'), ('192.168.243.189/24')
> )
> SELECT lookup.tag, entries.ipv4:host, entries.ipv4
> FROM lookup, entries
> WHERE lookup.tag = 'San Francisco'
> AND entries.IPv4:ipv4 BETWEEN lookup.obj:ipv4_range_start AND lookup.obj:ipv4_range_end;
>
> ---------------+-------------------+-----------------------------------+
>  TAG           | ENTRIES.IPV4:HOST | IPV4                              |
> ---------------+-------------------+-----------------------------------|
>  San Francisco | "192.168.242.188" | {                                 |
>                |                   |   "family": 4,                    |
>                |                   |   "host": "192.168.242.188",      |
>                |                   |   "ip_fields": [                  |
>                |                   |     3232297660,                   |
>                |                   |     0,                            |
>                |                   |     0,                            |
>                |                   |     0                             |
>                |                   |   ],                              |
>                |                   |   "ip_type": "inet",              |
>                |                   |   "ipv4": 3232297660,             |
>                |                   |   "ipv4_range_end": 3232297727,   |
>                |                   |   "ipv4_range_start": 3232297472, |
>                |                   |   "netmask_prefix_length": 24,    |
>                |                   |   "snowflake$type": "ip_address"  |
>                |                   | }                                 |
> ---------------+-------------------+-----------------------------------+
> ```
>
> ```sqlexample
> CREATE OR REPLACE TABLE ipv6_lookup (tag String, obj VARIANT);
>
> -----------------------------------------+
>  status                                  |
> -----------------------------------------|
>  Table IPV6_LOOKUP successfully created. |
> -----------------------------------------+
>
> INSERT INTO ipv6_lookup
>     SELECT column1 AS tag, parse_ip(column2, 'INET') AS obj
>     FROM VALUES('west', 'fe80:12:20c:29ff::/64'), ('east', 'fe80:12:1:29ff::/64');
>
> -------------------------+
>  number of rows inserted |
> -------------------------|
>                        2 |
> -------------------------+
>
> CREATE OR REPLACE TABLE ipv6_entries (obj VARIANT);
> ------------------------------------------+
>  status                                   |
> ------------------------------------------|
>  Table IPV6_ENTRIES successfully created. |
> ------------------------------------------+
>
> INSERT INTO ipv6_entries
>     SELECT parse_ip(column1, 'INET') as obj
>     FROM VALUES
>         ('fe80:12:20c:29ff:fe2c:430:370:2/64'),
>         ('fe80:12:20c:29ff:fe2c:430:370:00F0/64'),
>         ('fe80:12:20c:29ff:fe2c:430:370:0F00/64'),
>         ('fe80:12:20c:29ff:fe2c:430:370:F000/64'),
>         ('fe80:12:20c:29ff:fe2c:430:370:FFFF/64'),
>         ('fe80:12:1:29ff:fe2c:430:370:FFFF/64'),
>         ('fe80:12:1:29ff:fe2c:430:370:F000/64'),
>         ('fe80:12:1:29ff:fe2c:430:370:0F00/64'),
>         ('fe80:12:1:29ff:fe2c:430:370:00F0/64'),
>         ('fe80:12:1:29ff:fe2c:430:370:2/64');
>
> -------------------------+
>  number of rows inserted |
> -------------------------|
>                       10 |
> -------------------------+
>
> SELECT lookup.tag, entries.obj:host
>     FROM ipv6_lookup AS lookup, ipv6_entries AS entries
>     WHERE lookup.tag = 'east'
>     AND entries.obj:hex_ipv6 BETWEEN lookup.obj:hex_ipv6_range_start AND lookup.obj:hex_ipv6_range_end;
>
> ------+------------------------------------+
>  TAG  | ENTRIES.OBJ:HOST                   |
> ------+------------------------------------|
>  east | "fe80:12:1:29ff:fe2c:430:370:FFFF" |
>  east | "fe80:12:1:29ff:fe2c:430:370:F000" |
>  east | "fe80:12:1:29ff:fe2c:430:370:0F00" |
>  east | "fe80:12:1:29ff:fe2c:430:370:00F0" |
>  east | "fe80:12:1:29ff:fe2c:430:370:2"    |
> ------+------------------------------------+
> ```

---
title: PARSE_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/parse_json.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Parsing)

# PARSE_JSON

Interprets an input string as a JSON document, producing a [VARIANT](../data-types-semistructured.md) value.

You can use the PARSE_JSON function when you have input data in JSON format. This function can convert
data from JSON format to [ARRAY](../data-types-semistructured.md) or [OBJECT](../data-types-semistructured.md) data and store that
data directly in a VARIANT value. You can then analyze or manipulate the data.

By default, the function doesn’t allow duplicate keys in the JSON object, but you can set the
`'parameter'` argument to allow duplicate keys.

See also:
:   [TRY_PARSE_JSON](try_parse_json.md)

## Syntax

```sqlsyntax
PARSE_JSON( <expr> [ , '<parameter>' ] )
```

## Arguments

**Required:**

`expr`
:   An expression of string type (for example, VARCHAR) that holds valid JSON information.

**Optional:**

`'parameter'`
:   String constant that specifies the parameter used to search for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `d` | Allow duplicate keys in JSON objects. If a JSON object contains a duplicate key, the returned object has a single instance of that key with the last value specified for that key. |
    | `s` | Don’t allow duplicate keys in JSON objects (strict). This value is the default. |

## Returns

Returns a value of type VARIANT that contains a JSON document.

If the input is NULL, the function returns NULL.

This function doesn’t return a [structured type](../data-types-structured.md).

## Usage notes

* This function supports an input expression with a maximum size of 64 MB compressed.
* If the PARSE_JSON function is called with an empty string, or with a string containing only whitespace characters, then
  the function returns NULL (rather than raising an error), even though an empty string isn’t valid JSON. This behavior allows
  processing to continue rather than aborting if some inputs are empty strings.
* If the input is NULL, the output is also NULL. However, if the input string is `'null'`, then it is interpreted as a
  [JSON null](../../user-guide/semistructured-considerations.md) value so that the result isn’t SQL NULL, but instead a valid VARIANT value containing `null`.
  See the example below.
* When parsing decimal numbers, PARSE_JSON attempts to preserve the exactness of the representation by treating 123.45 as NUMBER(5,2),
  not as a DOUBLE value. However, numbers that use scientific notation (for example, 1.2345e+02), or numbers that cannot be stored as fixed-point
  decimals due to range or scale limitations, are stored as DOUBLE values. Because JSON does not represent values such as TIMESTAMP, DATE,
  TIME, or BINARY natively, these values must be represented as strings.
* In JSON, an object (also called a “dictionary” or a “hash”) is an unordered set of
  key-value pairs.

* TO_JSON and PARSE_JSON are (almost) converse or reciprocal functions.

  + The PARSE_JSON function takes a string as input and returns a JSON-compatible [VARIANT](../data-types-semistructured.md).
  + The TO_JSON function takes a JSON-compatible VARIANT and returns a string.

  The following is (conceptually) true if X is a string containing valid JSON:

  > `X = TO_JSON(PARSE_JSON(X));`

  For example, the following is (conceptually) true:

  > `'{"pi":3.14,"e":2.71}' = TO_JSON(PARSE_JSON('{"pi":3.14,"e":2.71}'))`

  However, the functions are not perfectly reciprocal because:

  + Empty strings, and strings with only whitespace, are not handled reciprocally. For example, the return value of
    `PARSE_JSON('')` is NULL, but the return value of `TO_JSON(NULL)` is NULL, not the reciprocal `''`.
  + The order of the key-value pairs in the string produced by TO_JSON is not predictable.
  + The string produced by TO_JSON can have less whitespace than the string passed to PARSE_JSON.

  For example, the following are equivalent JSON, but not equivalent strings:

  + `{"pi": 3.14, "e": 2.71}`
  + `{"e":2.71,"pi":3.14}`

## Examples

The following examples use the PARSE_JSON function.

### Storing values of different data types in a VARIANT column

This example stores different types of data in a VARIANT column by calling PARSE_JSON to parse strings.

Create and fill a table. The INSERT statement uses PARSE_JSON to insert VARIANT values in the `v` column
of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the [TYPEOF](typeof.md) function to show the data types of
the values stored in the VARIANT values.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

### Insert a JSON object with duplicate keys in a VARIANT value

Try to insert a JSON object with duplicate keys in a VARIANT value:

```sqlexample
INSERT INTO vartab
SELECT column1 AS n, PARSE_JSON(column2) AS v
  FROM VALUES (10, '{ "a" : "123", "b" : "456", "a": "789"} ')
     AS vals;
```

An error is returned because duplicate keys aren’t allowed by default:

```output
100069 (22P02): Error parsing JSON: duplicate object attribute "a", pos 31
```

Insert a JSON object with duplicate keys in a VARIANT value, and specify the `d` parameter to allow
duplicates:

```sqlexample
INSERT INTO vartab
SELECT column1 AS n, PARSE_JSON(column2, 'd') AS v
  FROM VALUES (10, '{ "a" : "123", "b" : "456", "a": "789"} ')
     AS vals;
```

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       1 |
+-------------------------+
```

A query on the table shows that only the value of the last duplicate key was inserted:

```sqlexample
SELECT v
  FROM vartab
  WHERE n = 10;
```

```output
+---------------+
| V             |
|---------------|
| {             |
|   "a": "789", |
|   "b": "456"  |
| }             |
+---------------+
```

### Handling NULL values with the PARSE_JSON and TO_JSON functions

The following example shows how PARSE_JSON and TO_JSON handle NULL values:

```sqlexample
SELECT TO_JSON(NULL), TO_JSON('null'::VARIANT),
       PARSE_JSON(NULL), PARSE_JSON('null');
```

```output
+---------------+--------------------------+------------------+--------------------+
| TO_JSON(NULL) | TO_JSON('NULL'::VARIANT) | PARSE_JSON(NULL) | PARSE_JSON('NULL') |
|---------------+--------------------------+------------------+--------------------|
| NULL          | "null"                   | NULL             | null               |
+---------------+--------------------------+------------------+--------------------+
```

### Comparing PARSE_JSON and TO_JSON

The following examples demonstrate the relationship between the PARSE_JSON and TO_JSON functions.

This example creates a table with a VARCHAR column and a VARIANT column. The INSERT statement inserts
a VARCHAR value, and the UPDATE statement generates a JSON value that corresponds with that VARCHAR value.

```sqlexample
CREATE OR REPLACE TABLE jdemo2 (
  varchar1 VARCHAR,
  variant1 VARIANT);

INSERT INTO jdemo2 (varchar1) VALUES ('{"PI":3.14}');

UPDATE jdemo2 SET variant1 = PARSE_JSON(varchar1);
```

This query shows that TO_JSON and PARSE_JSON are conceptually reciprocal functions:

```sqlexample
SELECT varchar1,
       PARSE_JSON(varchar1),
       variant1,
       TO_JSON(variant1),
       PARSE_JSON(varchar1) = variant1,
       TO_JSON(variant1) = varchar1
  FROM jdemo2;
```

```output
+-------------+----------------------+--------------+-------------------+---------------------------------+------------------------------+
| VARCHAR1    | PARSE_JSON(VARCHAR1) | VARIANT1     | TO_JSON(VARIANT1) | PARSE_JSON(VARCHAR1) = VARIANT1 | TO_JSON(VARIANT1) = VARCHAR1 |
|-------------+----------------------+--------------+-------------------+---------------------------------+------------------------------|
| {"PI":3.14} | {                    | {            | {"PI":3.14}       | True                            | True                         |
|             |   "PI": 3.14         |   "PI": 3.14 |                   |                                 |                              |
|             | }                    | }            |                   |                                 |                              |
+-------------+----------------------+--------------+-------------------+---------------------------------+------------------------------+
```

However, the functions are not exactly reciprocal. Differences in whitespace or in the order of key-value
pairs can prevent the output from matching the input. For example:

```sqlexample
SELECT TO_JSON(PARSE_JSON('{"b":1,"a":2}')),
       TO_JSON(PARSE_JSON('{"b":1,"a":2}')) = '{"b":1,"a":2}',
       TO_JSON(PARSE_JSON('{"b":1,"a":2}')) = '{"a":2,"b":1}';
```

```output
+--------------------------------------+--------------------------------------------------------+--------------------------------------------------------+
| TO_JSON(PARSE_JSON('{"B":1,"A":2}')) | TO_JSON(PARSE_JSON('{"B":1,"A":2}')) = '{"B":1,"A":2}' | TO_JSON(PARSE_JSON('{"B":1,"A":2}')) = '{"A":2,"B":1}' |
|--------------------------------------+--------------------------------------------------------+--------------------------------------------------------|
| {"a":2,"b":1}                        | False                                                  | True                                                   |
+--------------------------------------+--------------------------------------------------------+--------------------------------------------------------+
```

### Comparing PARSE_JSON and TO_VARIANT

Although both the PARSE_JSON function and the [TO_VARIANT](to_variant.md) function can take a string and return
a VARIANT value, they are not equivalent. The following example creates a table with two VARIANT
columns. Then, it uses PARSE_JSON to insert a value into one column and TO_VARIANT to
insert a value into the other column.

```sqlexample
CREATE OR REPLACE TABLE jdemo3 (
  variant1 VARIANT,
  variant2 VARIANT);

INSERT INTO jdemo3 (variant1, variant2)
  SELECT
    PARSE_JSON('{"PI":3.14}'),
    TO_VARIANT('{"PI":3.14}');
```

The query below shows that the functions returned VARIANT values that
store values of different data types.

```sqlexample
SELECT variant1,
       TYPEOF(variant1),
       variant2,
       TYPEOF(variant2),
       variant1 = variant2
  FROM jdemo3;
```

```output
+--------------+------------------+-----------------+------------------+---------------------+
| VARIANT1     | TYPEOF(VARIANT1) | VARIANT2        | TYPEOF(VARIANT2) | VARIANT1 = VARIANT2 |
|--------------+------------------+-----------------+------------------+---------------------|
| {            | OBJECT           | "{\"PI\":3.14}" | VARCHAR          | False               |
|   "PI": 3.14 |                  |                 |                  |                     |
| }            |                  |                 |                  |                     |
+--------------+------------------+-----------------+------------------+---------------------+
```

---
title: PARSE_URL
source: https://docs.snowflake.com/en/sql-reference/functions/parse_url.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# PARSE_URL

Returns an [OBJECT](../data-types-semistructured.md) value that consists of all the components (fragment,
host, parameters, path, port, query, scheme) in a valid input URL/URI.

## Syntax

```sqlsyntax
PARSE_URL(<string>, [<permissive>])
```

## Arguments

**Required:**

`string`
:   String to parse.

**Optional:**

`permissive`
:   Flag that determines how parse errors are handled:

    * If set to `0`, parse errors cause the function to fail.
    * If set to `1`, parse errors result in an object with the `error`
      field set to the respective error message (and no other fields set).

    Default value is `0`.

## Returns

The function returns a value of type OBJECT.

If any input argument is NULL, the function returns NULL.

When an OBJECT value is returned, it contains the following key-value pairs:

| Key | Value |
| --- | --- |
| `fragment` | An anchor that points to a location. |
| `host` | The domain (address of a website or server). |
| `parameters` | Values passed to the website or server. |
| `path` | A resource’s location. |
| `port` | The port (connection endpoint for a process or service). |
| `query` | A query string passed to the website or server. |
| `scheme` | The protocol. |

## Examples

The following examples use the PARSE_URL function.

### Parse URLs in table data

Create a table and insert rows:

```sqlexample
CREATE OR REPLACE TABLE parse_url_test (id INT, sample_url VARCHAR);

INSERT INTO parse_url_test VALUES
  (1, 'mailto:abc@xyz.com'),
  (2, 'https://www.snowflake.com/'),
  (3, 'http://USER:PASS@EXAMPLE.INT:4345/HELLO.PHP?USER=1'),
  (4, NULL);

SELECT * FROM parse_url_test;
```

```output
+----+----------------------------------------------------+
| ID | SAMPLE_URL                                         |
|----+----------------------------------------------------|
|  1 | mailto:abc@xyz.com                                 |
|  2 | https://www.snowflake.com/                         |
|  3 | http://USER:PASS@EXAMPLE.INT:4345/HELLO.PHP?USER=1 |
|  4 | NULL                                               |
+----+----------------------------------------------------+
```

The following query shows the results of PARSE_URL for the sample URLs:

```sqlexample
SELECT PARSE_URL(sample_url) FROM parse_url_test;
```

```output
+------------------------------------+
| PARSE_URL(SAMPLE_URL)              |
|------------------------------------|
| {                                  |
|   "fragment": null,                |
|   "host": null,                    |
|   "parameters": null,              |
|   "path": "abc@xyz.com",           |
|   "port": null,                    |
|   "query": null,                   |
|   "scheme": "mailto"               |
| }                                  |
| {                                  |
|   "fragment": null,                |
|   "host": "www.snowflake.com",     |
|   "parameters": null,              |
|   "path": "",                      |
|   "port": null,                    |
|   "query": null,                   |
|   "scheme": "https"                |
| }                                  |
| {                                  |
|   "fragment": null,                |
|   "host": "USER:PASS@EXAMPLE.INT", |
|   "parameters": {                  |
|     "USER": "1"                    |
|   },                               |
|   "path": "HELLO.PHP",             |
|   "port": "4345",                  |
|   "query": "USER=1",               |
|   "scheme": "http"                 |
| }                                  |
| NULL                               |
+------------------------------------+
```

This query shows the host for each sample URL:

```sqlexample
SELECT PARSE_URL(sample_url):host FROM parse_url_test;
```

```output
+----------------------------+
| PARSE_URL(SAMPLE_URL):HOST |
|----------------------------|
| null                       |
| "www.snowflake.com"        |
| "USER:PASS@EXAMPLE.INT"    |
| NULL                       |
+----------------------------+
```

Return the rows where the port is `4345`:

```sqlexample
SELECT *
  FROM parse_url_test
  WHERE PARSE_URL(sample_url):port = '4345';
```

```output
+----+----------------------------------------------------+
| ID | SAMPLE_URL                                         |
|----+----------------------------------------------------|
|  3 | http://USER:PASS@EXAMPLE.INT:4345/HELLO.PHP?USER=1 |
+----+----------------------------------------------------+
```

Return the rows where the host is `www.snowflake.com`:

```sqlexample
SELECT *
  FROM parse_url_test
  WHERE PARSE_URL(sample_url):host = 'www.snowflake.com';
```

```output
+----+----------------------------+
| ID | SAMPLE_URL                 |
|----+----------------------------|
|  2 | https://www.snowflake.com/ |
+----+----------------------------+
```

### Parse invalid URLs

Parse an invalid URL that is missing the scheme. Set the `permissive`
argument to `0` to indicate that the function fails if the input
is invalid:

```sqlexample
SELECT PARSE_URL('example.int/hello.php?user=12#nofragment', 0);
```

```output
100139 (22000): Error parsing URL: scheme not specified
```

Parse an invalid URL, with the `permissive` argument set to `1` to
indicate that the function returns an OBJECT value that contains the error
message:

```sqlexample
SELECT PARSE_URL('example.int/hello.php?user=12#nofragment', 1);
```

```output
+----------------------------------------------------------+
| PARSE_URL('EXAMPLE.INT/HELLO.PHP?USER=12#NOFRAGMENT', 1) |
|----------------------------------------------------------|
| {                                                        |
|   "error": "scheme not specified"                        |
| }                                                        |
+----------------------------------------------------------+
```

---
title: PARSE_XML
source: https://docs.snowflake.com/en/sql-reference/functions/parse_xml.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Parsing)

# PARSE_XML

Interprets an input string as an [XML](../../user-guide/semistructured-data-formats.md) document, producing an [OBJECT](../data-types-semistructured.md) value.
If the input is NULL, the output is NULL.

See also:
:   [CHECK_XML](check_xml.md), [TO_XML](to_xml.md), [XMLGET](xmlget.md)

## Syntax

```sqlsyntax
PARSE_XML( <string_containing_xml> [ , <disable_auto_convert> ] )
```

```sqlsyntax
PARSE_XML( STR => <string_containing_xml>
  [ , DISABLE_AUTO_CONVERT => <disable_auto_convert> ] )
```

## Arguments

**Required:**

`string_containing_xml` . OR . `STR => string_containing_xml`
:   Specify an expression that evaluates to a VARCHAR value that contains valid XML.

**Optional:**

`disable_auto_convert` . OR . `DISABLE_AUTO_CONVERT => disable_auto_convert`
:   A Boolean expression that specifies whether or not the function should attempt to convert numeric and Boolean values in
    `string_containing_xml` to Snowflake data types. (For details about this conversion, see Usage Notes below.)

    * If you don’t want the function to convert these values, set this argument to `TRUE`. This setting
      has an effect that is similar to the `DISABLE_AUTO_CONVERT` parameter in [CREATE FILE FORMAT](../sql/create-file-format.md).
    * If you want the function to convert these values, set this argument to `FALSE` or omit this argument.

    Default: `FALSE`

## Returns

The data type of the returned value is OBJECT. The OBJECT contains an internal representation of the XML.

## Usage notes

* When you mix arguments by position and by name, all of the positional arguments must come before
  all of the named arguments.
* When you specify an argument by name, you can’t use double quotes around the argument name.

* The content of every element in XML documents is text. PARSE_XML attempts to convert some XML data from text to native
  (Snowflake SQL). For more information, see [SQL data types reference](../../sql-reference-data-types.md).

  + NUMERIC and BOOLEAN:

    PARSE_XML attempts to convert obviously numeric and Boolean values to the native representation in a way that printing
    these values back produces textually identical results. For example, when parsing decimal numbers, PARSE_XML attempts
    to preserve exactness of the representation by treating 123.45 as NUMBER(5,2), not as a DOUBLE. However:

    - Numbers in scientific notation (i.e. 1.2345e+02) or numbers that cannot be stored as fixed-point decimals due to range or
      scale limitations are stored as DOUBLE.
    - If the content of an XML element is a number with digits after the decimal point, then PARSE_XML might truncate trailing zeros.

    If you do not want the function to perform this conversion, pass `TRUE` for the `disable_auto_convert` argument.
  + TIMESTAMP, DATE, TIME, BINARY:

    Because XML doesn’t represent values such as TIMESTAMP, DATE, TIME, or BINARY natively, these have to be represented as strings
    in XML. PARSE_XML doesn’t automatically recognize these values. They are retained as strings, so convert
    the values from strings to native SQL data types if needed.
* XML attributes are an unordered collection of name/value pairs. The PARSE_XML function doesn’t necessarily
  preserve order. For example, converting text to XML and back to text might result in a string that contains the
  original information in a different order.
* You might see changes in whitespace between elements when converting from string to XML.
* When PARSE_XML is used to insert [numeric values](../data-types-numeric.md) into a VARIANT column that are a
  mix of integers (for example, INT or INTEGER) and values in decimal notation (for example, NUMBER or FLOAT), the function
  might add trailing zeros to the values.

  The following example uses PARSE_XML to insert a mix of integer values and values in decimal notation into a VARIANT column:

  ```sqlexample
  CREATE OR REPLACE TABLE test_xml_table(xmlcol VARIANT);

  INSERT INTO test_xml_table (
    SELECT PARSE_XML($1) FROM VALUES
      ('<c>3.1</c>'),
      ('<e>2</e>'),
      ('<b>0.123</b>'));
  ```

  Query the table:

  ```sqlexample
  SELECT * FROM test_xml_table;
  ```

  ```output
  +--------------+
  | XMLCOL       |
  |--------------|
  | <c>3.100</c> |
  | <e>2.000</e> |
  | <b>0.123</b> |
  +--------------+
  ```

  The output shows that trailing zeros were added to the values in the first two rows.

## Examples

The following example demonstrates how to use the PARSE_XML function to convert a string of XML to an OBJECT that can be inserted
into an OBJECT column:

```sqlexample
CREATE OR REPLACE TABLE xtab (v OBJECT);

INSERT INTO xtab SELECT PARSE_XML(column1) AS v
  FROM VALUES ('<a/>'), ('<a attr="123">text</a>'), ('<a><b>X</b><b>Y</b></a>');

SELECT * FROM xtab;
```

```output
+------------------------+
| V                      |
|------------------------|
| <a></a>                |
| <a attr="123">text</a> |
| <a>                    |
|   <b>X</b>             |
|   <b>Y</b>             |
| </a>                   |
+------------------------+
```

The following example demonstrates the differences between using and disabling the conversion of numeric values. In this example,
when the conversion isn’t disabled, the function interprets a number in scientific notation as a DOUBLE.

```sqlexample
SELECT PARSE_XML('<test>22257e111</test>'), PARSE_XML('<test>22257e111</test>', TRUE);
```

```output
+-------------------------------------+-------------------------------------------+
| PARSE_XML('<TEST>22257E111</TEST>') | PARSE_XML('<TEST>22257E111</TEST>', TRUE) |
|-------------------------------------+-------------------------------------------|
| <test>2.225700000000000e+115</test> | <test>22257e111</test>                    |
+-------------------------------------+-------------------------------------------+
```

The following example demonstrates how to specify the arguments to the function by name:

```sqlexample
SELECT PARSE_XML(STR => '<test>22257e111</test>', DISABLE_AUTO_CONVERT => TRUE);
```

```output
+--------------------------------------------------------------------------+
| PARSE_XML(STR => '<TEST>22257E111</TEST>', DISABLE_AUTO_CONVERT => TRUE) |
|--------------------------------------------------------------------------|
| <test>22257e111</test>                                                   |
+--------------------------------------------------------------------------+
```

---
title: PERCENT_RANK
source: https://docs.snowflake.com/en/sql-reference/functions/percent_rank.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (Ranking)

# PERCENT_RANK

Returns the relative rank of a value within a group of values, specified as a percentage ranging from 0.0 to 1.0.

## Syntax

```sqlsyntax
PERCENT_RANK()
  OVER ( [ PARTITION BY <expr1> ] ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <fixedRangeFrame> ] )
```

Where:

> ```sqlsyntax
> fixedRangeFrame ::=
>     {
>        RANGE UNBOUNDED PRECEDING
>      | RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
>      | RANGE BETWEEN CURRENT ROW AND UNBOUNDED FOLLOWING
>     }
> ```

## Usage notes

* `expr1` specifies the column (or expression) to partition by.

  For example, suppose that within each state or province, you want to rank
  farmers in order by the amount of corn they produced. In this case, you
  partition by state.

  If you want only a single group (e.g. you want to rank all farmers in the U.S.
  regardless of which state they live in), then omit the `PARTITION BY` clause.
* `expr2` specifies the column (or expression) that you want to rank by.

  For example, if you’re ranking farmers to see who produced the most corn
  (within their state), then you would use the `bushels_produced` column. For details,
  see Examples (in this topic).
* PERCENT_RANK is calculated as:

  > If n is 1:
  >
  > > `PERCENT_RANK = 0`
  >
  > If n is greater than 1:
  >
  > > `PERCENT_RANK = (r - 1) / (n - 1)`
  >
  > where `r` is the [RANK](rank.md) of the row and `n` is the number of rows in the window partition.
* Values range from 0.0 to 1.0. You can multiply by 100 to get a true percent.
* PERCENT_RANK supports range-based window frames with fixed boundaries only. For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).

## Examples

```sqlexample
SELECT
    exchange,
    symbol,
    PERCENT_RANK() OVER (PARTITION BY exchange ORDER BY price) AS percent_rank
  FROM trades;
```

```output
+--------+------+------------+
|exchange|symbol|PERCENT_RANK|
+--------+------+------------+
|C       |SPY   |         0.0|
|C       |AAPL  |         0.5|
|C       |AAPL  |         1.0|
|N       |YHOO  |         0.0|
|N       |QQQ   |         0.2|
|N       |QQQ   |         0.4|
|N       |SPY   |         0.6|
|N       |SPY   |         0.6|
|N       |AAPL  |         1.0|
|Q       |YHOO  |         0.0|
|Q       |YHOO  |         0.2|
|Q       |MSFT  |         0.4|
|Q       |MSFT  |         0.6|
|Q       |QQQ   |         0.8|
|Q       |QQQ   |         1.0|
|P       |YHOO  |         0.0|
|P       |MSFT  |        0.25|
|P       |MSFT  |         0.5|
|P       |SPY   |        0.75|
|P       |AAPL  |         1.0|
+--------+------+------------+
```

---
title: PERCENTILE_CONT
source: https://docs.snowflake.com/en/sql-reference/functions/percentile_cont.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# PERCENTILE_CONT

Return a percentile value based on a continuous distribution of the input
column (specified in `order_by_expr`). If no input row lies exactly
at the desired percentile, the result is calculated using linear interpolation
of the two nearest input values. NULL values are ignored in the calculation.

See also:
:   [PERCENTILE_DISC](percentile_disc.md)

## Syntax

**Aggregate function**

```sqlsyntax
PERCENTILE_CONT( <percentile> ) WITHIN GROUP (ORDER BY <order_by_expr>)
```

**Window function**

```sqlsyntax
PERCENTILE_CONT( <percentile> )
  WITHIN GROUP (ORDER BY <order_by_expr>)
  OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`percentile`
:   The percentile of the value that you want to find. The percentile must be a
    constant between 0.0 and 1.0. For example, if you want to find the value
    at the 90th percentile, specify 0.9.

`order_by_expr`
:   The expression (typically a column name) by which to order the values. For
    example, if you want to want to find the student whose math SAT score is at
    the 90th percentile, then specify the column containing the math SAT score.

    Note that this is also implicitly the column from which the returned value
    is chosen. For example, if you order by math SAT scores, then the result
    is one of the math SAT scores. You cannot order by one column and get
    a percentile value for a different column.

`expr3`
:   This is the optional expression used to group rows into partitions.

## Returns

Returns the value that is at the specified percentile. If no input row lies
exactly at the desired percentile, the result is calculated using linear
interpolation of the two nearest input values.

> **Note:**
>
> If a group contains only one value, then that value will be returned
> for any specified percentile (e.g. both percentile 0.0 and
> percentile 1.0 will return that one row).

## Usage notes

* The `percentile` argument to the function must be a constant.
* DISTINCT is not supported for this function.
* The function PERCENTILE_CONT interpolates between the two closest
  values, while the function PERCENTILE_DISC chooses the closest value
  rather than interpolating.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

The following example shows the values at the 25th percentile (0.25) within
various groups:

Create and populate a table with values:

```sqlexample
CREATE OR REPLACE TABLE aggr (k INT, v DECIMAL(10,2));

INSERT INTO aggr (k, v) VALUES
  (0,  0),
  (0, 10),
  (0, 20),
  (0, 30),
  (0, 40),
  (1, 10),
  (1, 20),
  (2, 10),
  (2, 20),
  (2, 25),
  (2, 30),
  (3, 60),
  (4, NULL);
```

Run a query and show the output (note that some values are exact and some are interpolated):

```sqlexample
SELECT k, PERCENTILE_CONT(0.25) WITHIN GROUP (ORDER BY v)
  FROM aggr
  GROUP BY k
  ORDER BY k;
```

```output
+---+-------------------------------------------------+
| K | PERCENTILE_CONT(0.25) WITHIN GROUP (ORDER BY V) |
|---+-------------------------------------------------|
| 0 |                                        10.00000 |
| 1 |                                        12.50000 |
| 2 |                                        17.50000 |
| 3 |                                        60.00000 |
| 4 |                                            NULL |
+---+-------------------------------------------------+
```

---
title: PERCENTILE_DISC
source: https://docs.snowflake.com/en/sql-reference/functions/percentile_disc.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window functions](../functions-window.md)

# PERCENTILE_DISC

Returns a percentile value based on a discrete distribution of the input
column (specified in `order_by_expr`). The returned value is that
whose row has the smallest CUME_DIST value that is greater than or equal to
the given percentile. NULL values are ignored in the calculation.

See also:
:   [PERCENTILE_CONT](percentile_cont.md)

## Syntax

**Aggregate function**

```sqlsyntax
PERCENTILE_DISC( <percentile> ) WITHIN GROUP (ORDER BY <order_by_expr> )
```

**Window function**

```sqlsyntax
PERCENTILE_DISC( <percentile> )
  WITHIN GROUP (ORDER BY <order_by_expr> )
  OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`percentile`
:   The percentile of the value that you want to find. The percentile must be a
    constant between 0.0 and 1.0. For example, if you want to find the value
    at the 90th percentile, specify 0.9.

`order_by_expr`
:   The expression (typically a column name) by which to order the values. For
    example, if you want to want to find the student whose math SAT score is at
    the 90th percentile, then specify the column containing the math SAT score.

    Note that this is also implicitly the column from which the returned value
    is chosen. For example, if you order by math SAT scores, then the result
    is one of the math SAT scores. You cannot order by one column and get
    a percentile value for a different column.

`expr3`
:   This is the optional expression used to group rows into partitions.

## Returns

Returns the value that is at the specified percentile.

## Usage notes

* The `percentile` argument to the function must be a constant.
* DISTINCT is not supported for this function.
* The function PERCENTILE_CONT interpolates between the two closest
  values, while the function PERCENTILE_DISC chooses the closest value
  rather than interpolating.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

The following example shows the values at the 25th percentile (0.25) within
various groups:

Create and populate a table with values:

```sqlexample
CREATE OR REPLACE TABLE aggr (k INT, v DECIMAL(10,2));

INSERT INTO aggr (k, v) VALUES
  (0,  0),
  (0, 10),
  (0, 20),
  (0, 30),
  (0, 40),
  (1, 10),
  (1, 20),
  (2, 10),
  (2, 20),
  (2, 25),
  (2, 30),
  (3, 60),
  (4, NULL);
```

Run a query and show the output:

```sqlexample
SELECT k, PERCENTILE_DISC(0.25) WITHIN GROUP (ORDER BY v)
  FROM aggr
  GROUP BY k
  ORDER BY k;
```

```output
+---+-------------------------------------------------+
| K | PERCENTILE_DISC(0.25) WITHIN GROUP (ORDER BY V) |
|---+-------------------------------------------------|
| 0 |                                           10.00 |
| 1 |                                           10.00 |
| 2 |                                           10.00 |
| 3 |                                           60.00 |
| 4 |                                            NULL |
+---+-------------------------------------------------+
```

---
title: PI
source: https://docs.snowflake.com/en/sql-reference/functions/pi.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# PI

Returns the value of pi as a floating-point value.

## Syntax

```sqlsyntax
PI()
```

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT PI();
```

```output
+-------------+
|        PI() |
|-------------|
| 3.141592654 |
+-------------+
```

---
title: PIPE_USAGE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/pipe_usage_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# PIPE_USAGE_HISTORY

This table function can be used to query the history of data loaded into Snowflake tables using [Snowpipe](../../user-guide/data-load-snowpipe-intro.md) within a specified date range. The function returns the history of data loaded and credits billed for your entire Snowflake account.

> **Note:**
>
> This function returns pipe activity within the last 14 days.

## Syntax

```sqlsyntax
PIPE_USAGE_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [, DATE_RANGE_END => <constant_expr> ]
      [, PIPE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range, within the last 2 weeks, for which to retrieve the data load history:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the
      default is to show the previous 10 minutes of data load history). For example,
      if `DATE_RANGE_END` is [CURRENT_DATE](current_date.md), then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

    History is displayed in increments of 5 minutes, 1 hour, or 24 hours (depending on the length of the specified range).

    If the range falls outside the last 15 days, an error is returned.

`PIPE_NAME => string`
:   A string specifying a pipe. Only data loads that use the specified pipe are returned.

    If a pipe name is not specified, then the PIPE_NAME column in the results displays NULL. Each row includes the totals for all pipes in use within the time range.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* Occasionally, the data compaction and maintenance process can consume Snowflake credits. For example, the returned results might show that you consumed credits with 0 BYTES_INSERTED and 0 FILES_INSERTED. This means that your data is not being loaded, but the data compaction and maintenance process has consumed some credits.
* Snowflake bills for auto-refresh notifications in external tables and directory tables on internal named stages or external stages at
  a rate equivalent to the Snowpipe file charge. You can estimate charges incurred by your external table and directory table auto-refresh
  notifications by querying the PIPE_USAGE_HISTORY function or examining the Account Usage [PIPE_USAGE_HISTORY view](../account-usage/pipe_usage_history.md). Note that the auto-refresh pipes will be listed under a NULL pipe
  name. You can also view your external table auto-refresh notification history at the table-level/stage-level granularity by using the
  Information Schema table function [AUTO_REFRESH_REGISTRATION_HISTORY](auto_refresh_registration_history.md).

  To avoid charges for auto-refresh notifications, perform a manual refresh for external tables and directory tables. For external tables, the
  ALTER EXTERNAL TABLE <name> REFRESH … statement can be used to manually synchronize your external table to external storage. For directory
  tables, the ALTER STAGE <name> REFRESH … statement can be used to manually synchronize the directory to external storage.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which data loads took place. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which data loads took place. |
| PIPE_NAME | TEXT | Name of the pipe used for a data load. Displays NULL if no pipe name is specified in the query. Each row includes the totals for all pipes in use within the time range. |
| CREDITS_USED | TEXT | Number of credits billed for Snowpipe data loads during the START_TIME and END_TIME window. |
| BYTES_INSERTED | NUMBER | Number of bytes loaded during the START_TIME and END_TIME window. |
| FILES_INSERTED | NUMBER | Number of files loaded during the START_TIME and END_TIME window. |
| BYTES_BILLED | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing visibility into Snowpipe’s cost implications directly within these history views. |

## Examples

Retrieve the data load history from a specific 30-minute range, in 5-minute periods, for all pipes in your account:

> ```sqlexample
> select *
>   from table(information_schema.pipe_usage_history(
>     date_range_start=>to_timestamp_tz('2017-10-24 12:00:00.000 -0700'),
>     date_range_end=>to_timestamp_tz('2017-10-24 12:30:00.000 -0700')));
> ```

Retrieve the data load history from the last 14 days, in 1-day periods, for all pipes in your account:

> ```sqlexample
> select *
>   from table(information_schema.pipe_usage_history(
>     date_range_start=>dateadd('day',-14,current_date()),
>     date_range_end=>current_date()));
> ```

Retrieve the data load history from the last 12 hours, in 1-hour periods, for a specified pipe in your account:

> ```sqlexample
> select *
>   from table(information_schema.pipe_usage_history(
>     date_range_start=>dateadd('hour',-12,current_timestamp()),
>     pipe_name=>'mydb.public.mypipe'));
> ```

Retrieve the data load history from the last 14 days, in 1-day periods, for a specified pipe in your account:

> ```sqlexample
> select *
>   from table(information_schema.pipe_usage_history(
>     date_range_start=>dateadd('day',-14,current_date()),
>     date_range_end=>current_date(),
>     pipe_name=>'mydb.public.mypipe'));
> ```

---
title: POLICY_CONTEXT
source: https://docs.snowflake.com/en/sql-reference/functions/policy_context.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md)

# POLICY_CONTEXT

Simulates the results of a query based upon the value of one or more context functions, which lets you determine how policies affect query
results. Context functions return a value based on the current context of a query: for example, who is executing the query or the account
from which the query is being executed. Policy bodies often use context functions to determine which value to return from the policy.

This function evaluates the following policies to determine the query results:

* [Masking policies](../../user-guide/security-column-intro.md)
* [Row access policies](../../user-guide/security-row-intro.md)
* [Aggregation policies](../../user-guide/aggregation-policies.md)
* [Join policies](../../user-guide/join-policies.md)
* [Projection policies](../../user-guide/projection-policies.md)

## Syntax

```sqlsyntax
EXECUTE USING
POLICY_CONTEXT( <arg_1> => '<string_literal>' [ , <arg_2> => '<string_literal>' , ... , <arg_n> => '<string_literal>' ] )
AS
SELECT <query>
```

## Arguments

`arg_1 => 'string_literal'`
:   Specifies a context function and its value as a string.

    Required. You must specify at least one function and its value.

    Snowflake supports the following context functions and their values as arguments:

    * [CURRENT_USER](current_user.md)
    * [CURRENT_ROLE](current_role.md)
    * [CURRENT_AVAILABLE_ROLES](current_available_roles.md)
    * [CURRENT_ACCOUNT](current_account.md)

    To determine the format to use as a string value, execute a query using the function. For example:

    > ```sqlexample
    > SELECT CURRENT_USER();
    >
    > +----------------+
    > | CURRENT_USER() |
    > |----------------|
    > | JSMITH         |
    > +----------------+
    > ```

    The string value should be `'JSMITH'`.

    Note that if specifying CURRENT_AVAILABLE_ROLES and multiple role values, such as `ROLE1` and `ROLE2`, enclose the list of roles in square brackets as follows:

    > `['ROLE1', 'ROLE2']`

`arg_2 => 'string_literal' , ... , arg_n => 'string_literal'`
:   Specifies a comma-separated list of a context function and its value as a string.

    Optional.

`query`
:   Specifies the SQL expression to query one or more tables or views.

    Required.

## Usage notes

* This function requires the following:

  + At least one argument that specifies a supported context function and its value.
  + If a table is protected by a policy, the specified user or role must be granted the following privileges:

    - OWNERSHIP on the table or view, and
    - The APPLY privilege for the policy, either at the account-level or on the policy itself:

      * APPLY MASKING POLICY on ACCOUNT or APPLY on MASKING POLICY `policy_name`
      * APPLY ROW ACCESS POLICY on ACCOUNT or APPLY on ROW ACCESS POLICY `policy_name`
      * APPLY AGGREGATION POLICY on ACCOUNT or APPLY on AGGREGATION POLICY `policy_name`
      * APPLY JOIN POLICY on ACCOUNT or APPLY on JOIN POLICY `policy_name`
      * APPLY PROJECTION POLICY on ACCOUNT or APPLY on PROJECTION POLICY `policy_name`
* Snowflake returns an error message if any of the following conditions are true:

  + Using one or more unsupported functions as an argument. Snowflake only supports the functions listed in the Arguments section.
  + Not specifying a function string value properly, including using a string for a value that does not exist
    (e.g. no account, user, or role).
  + The SELECT `query` expression does not query a table or view properly (e.g. not specifying a table or view at all).
  + Certain data sharing uses cases (see the next bullet).
* Data sharing:

  + A data sharing consumer cannot use this function to simulate query results on tables or views that were made available by the data
    sharing provider.

    Additionally, if the consumer `query` expression includes a table or view made available through
    [Secure Data Sharing](../../user-guide/data-sharing-intro.md) and another table or view in the consumer account not associated with the
    data sharing provider account (i.e. their own table or view), Snowflake returns an error message.
  + A data sharing provider account can simulate how a data sharing consumer account views tables or views made available through a share.

    To do this, the data sharing provider specifies the consumer account name as the argument. For example:

    ```sqlexample
    execute using policy_context(current_account => '<consumer_account_name>') ... ;
    ```
* The result depends on the following:

  + The masking policy or projection policy that is set on a column, if any.
  + The row access policy, aggregation policy, or join policy that is set on the table or view, if any.
  + The policy definition(s).
  + The `query` expression.
  + The privileges granted to roles.
  + The roles granted to users (including role hierarchy).
  + The arguments in this function.
  > **Important:**
  >
  > If the result from this function is not what you expected:
  >
  > + Consult with your internal policy administrator to determine which tables, views, and columns are protected by policies, and
  >   to better understand the body definitions of those policies. This administrator might have a custom role like `POLICY_ADMIN`,
  >   `MASKING_ADMIN`, or `RAP_ADMIN`.
  > + Double-check the:
  >
  >   - Function string values.
  >   - `SELECT` `query` expression.
  >   - Privileges [granted to roles](../sql/grant-privilege.md)
  >     (e.g. SELECT on table or view, USAGE on parent database and schema) and the corresponding
  >     [privilege inheritance](../../user-guide/security-access-control-overview.md).
  >   - [Role hierarchy](../../user-guide/security-access-control-configure.md), especially if specifying the CURRENT_AVAILABLE_ROLES function and its values
  >     as an argument for this function.
  >
  > Update the SQL statement using this function, as needed, and try again.

## Examples

Simulate the effect of the PUBLIC system role querying the table `empl_info`:

> ```sqlexample
> EXECUTE USING POLICY_CONTEXT(CURRENT_ROLE => 'PUBLIC')
>   AS SELECT * FROM empl_info;
> ```

---
title: POLICY_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/policy_references.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# POLICY_REFERENCES

Returns a row for each object that has the specified policy assigned to the object or returns a row for each policy assigned to the
specified object.

See also: [POLICY_REFERENCES view](../account-usage/policy_references.md) (Account Usage View)

## Syntax

For network policy objects only:

> ```sqlsyntax
> POLICY_REFERENCES(
>       POLICY_NAME => '<string>' ,
>       POLICY_KIND => 'NETWORK_POLICY'
>       )
> ```

For other policy objects:

> ```sqlsyntax
> POLICY_REFERENCES(
>       POLICY_NAME => '<string>'
>       )
> ```

For all policy objects:

> ```sqlsyntax
> POLICY_REFERENCES(
>     REF_ENTITY_NAME => '<string>' ,
>     REF_ENTITY_DOMAIN => '<string>'
>     )
> ```

## Arguments

`POLICY_NAME => 'string'`
:   Specifies the policy name.

    * The entire policy name must be enclosed in single quotes.
    * If the policy name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes (i.e. `'"<policy_name>"'`).

    Currently, Snowflake supports the following policies when specifying the policy name as an argument:

    * [aggregation policies](../../user-guide/aggregation-policies.md)
    * [authentication policy](../../user-guide/authentication-policies.md)
    * [masking policy](../../user-guide/security-column-intro.md)
    * [network policy](../../user-guide/network-policies.md)
    * [packages policy](../../developer-guide/udf/python/packages-policy.md)
    * [password policy](../../user-guide/password-authentication.md)
    * [projection policies](../../user-guide/projection-policies.md)
    * [row access policy](../../user-guide/security-row-intro.md)
    * [session policy](../../user-guide/session-policies.md)
    * [storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md)

`POLICY_KIND => 'NETWORK_POLICY'`
:   Use this argument only when the `POLICY_NAME` value is a network policy. Do not use this argument when you specify the name of
    other kinds of policies.

`REF_ENTITY_NAME => 'string'`
:   The name of the object, such as the table name, view name, external table name, or username, on which the policy is set.

    * The entire object name must be enclosed in single quotes.
    * If the object name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes (i.e. `'"<table_name>"'`).

`REF_ENTITY_DOMAIN => 'string'`
:   The object type on which the policy is set.

    If the object is an external table, use `'TABLE'` as the argument value.

    If the object is a materialized view, use `'VIEW'` as the argument value.

    The supported domains are:

    * `'ACCOUNT'`
    * `'INTEGRATION'`
    * `'TABLE'`
    * `'TAG'`
    * `'USER'`
    * `'VIEW'`

## Returns

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| POLICY_DB | VARCHAR | The database in which the policy is set. |
| POLICY_SCHEMA | VARCHAR | The schema in which the policy is set. |
| POLICY_NAME | VARCHAR | The name of the policy. |
| POLICY_KIND | VARCHAR(17) | The type of policy in Snowflake. |
| REF_DATABASE_NAME | VARCHAR | The name of the database containing an object that the queried object references. |
| REF_SCHEMA_NAME | VARCHAR | The name of the schema containing an object that the queried object references. |
| REF_ENTITY_NAME | VARCHAR | The name of the object (i.e. table_name, view_name, external_table_name) on which the policy is set. |
| REF_ENTITY_DOMAIN | VARCHAR | The object type (i.e. table, view) on which the policy is set. |
| REF_COLUMN_NAME | VARCHAR | The column name on which the policy is set. |
| REF_ARG_COLUMN_NAMES | VARCHAR | Returns NULL for rows in the query result in which a masking policy is set. |
| TAG_DATABASE | VARCHAR | The name of the database containing the tag that has a policy assigned to the tag or NULL if a policy is not assigned to the tag. |
| TAG_SCHEMA | VARCHAR | The name of the schema containing the tag that has a policy assigned to the tag or NULL if a policy is not assigned to the tag. |
| TAG_NAME | VARCHAR | The name of the tag that has a policy assigned to it or NULL if a policy is not assigned to the tag. |
| POLICY_STATUS | VARCHAR | Specifies the status of the policy, which can be one of four possible values: `ACTIVE`, `MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`, `COLUMN_IS_MISSING_FOR_SECONDARY_ARG`, or `COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`. |

Note the following for the POLICY_STATUS column:

> `ACTIVE`
> :   Specifies that the column (i.e. REF_COLUMN_NAME) is only associated with a single policy.
>
> `MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`
> :   Specifies that multiple masking policies are assigned to the same column.
>
> `COLUMN_IS_MISSING_FOR_SECONDARY_ARG`
> :   Specifies that the policy (i.e. POLICY_NAME) is a conditional masking policy and the table (i.e. REF_ENTITY_NAME) does not have a
>     column with the same name.
>
> `COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`
> :   Specifies that the policy is a conditional masking policy and the table has a column with the same name but a different data type than
>     the data type in the masking policy signature.

## Usage notes

* Results are returned based on the privileges granted to the role executing the query:

  + If the role has the global APPLY MASKING POLICY privilege, Snowflake returns all masking policy associations in the query result.
  + If the role has the global APPLY ROW ACCESS POLICY privilege, Snowflake returns all row access policy associations in the query result.
  + If the role has the APPLY privilege on a given policy (e.g. APPLY on MASKING POLICY), Snowflake returns associations of that policy
    only for objects that are owned by the role executing the query.
  + If the role has the APPLY or OWNERSHIP privilege on the policy, but not OWNERSHIP on the table or view, Snowflake does not show policy
    associations in the query result. Having the SELECT privilege on the table or view is not enough.
  + If the role does not have any policy permissions but has the OWNERSHIP privilege on the table, Snowflake returns an error message and
    does not show policy associations.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function must
  use the fully-qualified object name. For more details, see [Snowflake Information Schema](../info-schema.md).
* Choose one syntax variation to execute a query. Mixing arguments results in errors and query failure.

  The arguments `ref_entity_name` and `ref_entity_domain` must be included together otherwise the query fails.
* Snowflake returns errors if the specified object name does not exist or if the query operator is not authorized to view any policy on the
  object. Snowflake can return a result set of policy associations if the operator is allowed to view a subset of the policy
  associations. Unsupported object types listed as the `ref_entity_domain` (e.g. `'stream'`) also return errors.
* Snowflake does not return a result set if the query operator does not have either the APPLY or OWNERSHIP privileges on the policy.

## Examples

Return a row for each object, such as a table or view, that has the masking policy named `ssn_mask` set on column:

> ```sqlexample
> use database my_db;
> use schema information_schema;
> select *
>   from table(information_schema.policy_references(policy_name => 'my_db.my_schema.ssn_mask'));
> ```

Return a row for each policy assigned to the table named `my_table`:

> ```sqlexample
> use database my_db;
> use schema information_schema;
> select *
>   from table(information_schema.policy_references(ref_entity_name => 'my_db.my_schema.my_table', ref_entity_domain => 'table'));
> ```

---
title: POSITION
source: https://docs.snowflake.com/en/sql-reference/functions/position.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# POSITION

Searches for the first occurrence of the first argument in the second argument and, if successful, returns the position (1-based) of the first argument in the second argument.

If you need to find the position beyond the first occurrence (for example, the third occurrence), you can use the [REGEXP_INSTR](regexp_instr.md) function.

Aliases:
:   [CHARINDEX](charindex.md)

    Note that the CHARINDEX function does not support one of the syntax variations that POSITION supports.

## Syntax

```sqlsyntax
POSITION( <expr1>, <expr2> [ , <start_pos> ] )

POSITION( <expr1> IN <expr2> )
```

## Arguments

**Required:**

`expr1`
:   A string or binary expression representing the value to look for.

`expr2`
:   A string or binary expression representing the value to search.

**Optional:**

`start_pos`
:   A number indicating the position at which to start the search (with `1` representing the start of `expr2`).

    Default: `1`

## Returns

This function returns a value of type NUMBER.

If any argument is NULL, the function returns NULL.

## Usage notes

* If the string or binary value is not found, the function returns `0`.
* If the specified optional `start_pos` is beyond the end of the second argument (the string to
  search), the function returns `0`.
* If the first argument is empty (for example, an empty string), the function returns `1`.
* The data types of the first two arguments must be the same (either two strings or two binary values).

## Collation details

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

The following examples use the POSITION function.

### VARCHAR expressions

Find the first occurrence of ‘an’ in ‘banana’:

```sqlexample
SELECT POSITION('an', 'banana', 1);
```

```output
+-----------------------------+
| POSITION('AN', 'BANANA', 1) |
|-----------------------------|
|                           2 |
+-----------------------------+
```

Find the first occurrence of ‘an’ in ‘banana’ at or after position 3. This search finds the second occurrence of ‘an’.

```sqlexample
SELECT POSITION('an', 'banana', 3);
```

```output
+-----------------------------+
| POSITION('AN', 'BANANA', 3) |
|-----------------------------|
|                           4 |
+-----------------------------+
```

Search for various characters, including unicode characters, in strings:

```sqlexample
SELECT n, h, POSITION(n IN h) FROM pos;
```

```output
+--------+---------------------+------------------+
| N      | H                   | POSITION(N IN H) |
|--------+---------------------+------------------|
|        |                     |                1 |
|        | sth                 |                1 |
| 43     | 41424344            |                5 |
| a      | NULL                |             NULL |
| dog    | catalog             |                0 |
| log    | catalog             |                5 |
| lésine | le péché, la lésine |               14 |
| nicht  | Ich weiß nicht      |               10 |
| sth    |                     |                0 |
| ☃c     | ☃a☃b☃c☃d            |                5 |
| ☃☃     | bunch of ☃☃☃☃       |               10 |
| ❄c     | ❄a☃c❄c☃             |                5 |
| NULL   | a                   |             NULL |
| NULL   | NULL                |             NULL |
+--------+---------------------+------------------+
```

### BINARY expressions

Because the values below are hexadecimal representations, a single BINARY byte is represented as two hex
digits.

In this example, the returned value is `3` because ‘EF’ matches the 3rd
byte (the first byte is ‘AB’; the second byte is ‘CD’, and the third byte
is ‘EF’):

```sqlexample
SELECT POSITION(X'EF', X'ABCDEF');
```

```output
+----------------------------+
| POSITION(X'EF', X'ABCDEF') |
|----------------------------|
|                          3 |
+----------------------------+
```

In this example, there is no match. Although the sequence ‘BC’ appears to be
in the value being searched, the ‘B’ is the second nybble of the first
byte, and the ‘C’ is the first nybble of the second byte. No byte actually
contains ‘BC’, so the returned value is `0` (not found).

```sqlexample
SELECT POSITION(X'BC', X'ABCD');
```

```output
+--------------------------+
| POSITION(X'BC', X'ABCD') |
|--------------------------|
|                        0 |
+--------------------------+
```

---
title: POW, POWER
source: https://docs.snowflake.com/en/sql-reference/functions/pow.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Exponent and Root)

# POW, POWER

Returns a number (x) raised to the specified power (y).

## Syntax

```sqlsyntax
POW(x, y)

POWER (x, y)
```

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

```sqlexample
SELECT x, y, pow(x, y) FROM tab;

-----+-----+-------------+
  X  |  Y  |  POW(X, Y)  |
-----+-----+-------------+
 0.1 | 2   | 0.01        |
 2   | 3   | 8           |
 2   | 0.5 | 1.414213562 |
 2   | -1  | 0.5         |
-----+-----+-------------+
```

---
title: PREVIOUS_DAY
source: https://docs.snowflake.com/en/sql-reference/functions/previous_day.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# PREVIOUS_DAY

Returns the date of the first specified day of week (DOW) that occurs before the input date.

See also:
:   [LAST_DAY](last_day.md) , [NEXT_DAY](next_day.md)

## Syntax

```sqlsyntax
PREVIOUS_DAY( <date_or_timetamp_expr> , <dow> )
```

## Arguments

`date_or_timestamp_expr`
:   A date or a timestamp, or an expression that can be evaluated to a date or a timestamp.

`dow_string`
:   Specifies the day of week used to calculate the date for the previous day. The value can be a string literal or an expression that returns a string. The string
    must start with the first two characters (case-insensitive) of the day name:

    > * `su` (Sunday)
    > * `mo` (Monday)
    > * `tu` (Tuesday)
    > * `we` (Wednesday)
    > * `th` (Thursday)
    > * `fr` (Friday)
    > * `sa` (Saturday)

    Any leading spaces and trailing characters, including spaces, in the string are ignored.

## Returns

This function returns a value of type DATE, even if `date_or_timetamp_expr` is a timestamp.

## Examples

Return the date of the previous Friday that occurred before the current date:

```sqlexample
SELECT CURRENT_DATE() AS "Today's Date",
       PREVIOUS_DAY("Today's Date", 'Friday') AS "Previous Friday";
```

```output
+--------------+-----------------+
| Today's Date | Previous Friday |
|--------------+-----------------|
| 2025-05-06   | 2025-05-02      |
+--------------+-----------------+
```

Your output will be different because the example uses the [CURRENT_DATE](current_date.md) function.

---
title: PROMPT
source: https://docs.snowflake.com/en/sql-reference/functions/prompt.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# PROMPT

The PROMPT function constructs a structured OBJECT containing a template string and a list of arguments. This object is
useful for dynamically formatting messages, constructing structured prompts, or storing formatted data for further
processing, such as by Cortex AI functions.

## Syntax

```sqlsyntax
SELECT PROMPT('<template_string>', <expr_1> [ , <expr_2>, ... ] )
    FROM <table>;
```

## Arguments

**Required:**

`template_string`
:   A string containing numbered placeholders like `{0}` where the number is at least 0 and less than the number of expressions specified.
    The first expression is substituted for `{0}`, the second for `{1}`, and so on.

`expr_1 [ , expr_2, ... ]`
:   Expressions whose values will eventually be substituted into the template string in place of the numbered
    placeholders. These can be column names or other expressions. Values can be of any type coercible to a string (for
    example, VARCHAR, NUMBER, etc.), or FILE.

## Returns

A SQL OBJECT with the following structure:

```sqlexample
{
  'template': '<template_string>',
  'args': ARRAY(<value_1>, <value_2>, ...)
}
```

The `args` array contains the value of the expressions specified in the PROMPT function call.

## Usage notes

* PROMPT does not perform any string formatting itself. It is intended to construct an object to be consumed by Cortex AI functions.
* It is an error to use a placeholder in the template string that does not have a corresponding expression, but it is not an error
  to provide expressions that are not used in the template string.

## Examples

### Basic usage

```sqlsyntax
SELECT PROMPT('Hello, {0}! Today is {1}.', 'Alice', 'Monday');
```

Output:

```output
{
    'template': 'Hello, {0}! Today is {1}.',
    'args': ['Alice', 'Monday']
}
```

### Use with Cortex AI_FILTER

```sqlexample
WITH reviews AS (
    SELECT 'Wow... Loved this place.' AS review, 5 AS rating
    UNION ALL
    SELECT 'Crust is not good.', 2 AS rating
)
SELECT * FROM reviews
WHERE AI_FILTER(PROMPT('The reviewer enjoyed the restaurant: {0}, Rating: {1}', review, rating));
```

### Use with Cortex COMPLETE and a FILE column

```sqlexample
AI_COMPLETE('claude-4-sonnet',
    PROMPT('Classify the input image {0} in no more than 2 words. Respond in JSON', img_file)) AS image_classification
FROM image_table;
```

See [AI_COMPLETE (Prompt object)](ai_complete-prompt-object.md) for more examples.

---
title: QUERY_ACCELERATION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/query_acceleration_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# QUERY_ACCELERATION_HISTORY

The QUERY_ACCELERATION_HISTORY function is used for querying the [query acceleration service](../../user-guide/query-acceleration-service.md)
history within a specified date range. The information returned includes the credits used for the query acceleration service at the
warehouse level for a given time frame.

## Syntax

```sqlsyntax
QUERY_ACCELERATION_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ]
      [ , WAREHOUSE_NAME => '<string>' ] )
```

## Parameters

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range to display the query acceleration history.

    For example, if you specify that the start date is 2019-05-03 and the end date 2019-05-05, you will get data for May 3, May 4, and May 5.
    (The endpoints are included.)

    * If neither a start date nor an end date is specified, the default will be the last 12 hours.
    * If an end date is not specified, but a start date is specified, then [CURRENT_DATE](current_date.md)
      at midnight is used as the end of the range.
    * If a start date is not specified, but an end date is specified, then the range starts 12 hours prior to the start
      of `DATE_RANGE_END`.

`WAREHOUSE_NAME => string`
:   Warehouse name. If specified, only shows the history for the specified warehouse.

    If a warehouse name is not specified, then the results will include history for each warehouse using the query acceleration service.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the service was in use. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the service was in use. |
| CREDITS_USED | NUMBER | Number of credits used by the service. |
| WAREHOUSE_NAME | TEXT | Name of the warehouse where usage occurred. |
| NUM_FILES_SCANNED | NUMBER | Number of files scanned by the service. |
| NUM_BYTES_SCANNED | NUMBER | Number of bytes scanned by the service. |

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

---
title: QUERY_HISTORY , QUERY_HISTORY_BY_*
source: https://docs.snowflake.com/en/sql-reference/functions/query_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# QUERY_HISTORY , QUERY_HISTORY_BY_\*

You can use the QUERY_HISTORY family of table functions to query Snowflake query history along various dimensions:

* QUERY_HISTORY returns queries within a specified time range.
* QUERY_HISTORY_BY_SESSION returns queries within a specified session and time range.
* QUERY_HISTORY_BY_USER returns queries submitted by a specified user within a specified time range.
* QUERY_HISTORY_BY_WAREHOUSE returns queries executed by a specified warehouse within a specified time range.

Each function is optimized for querying along the specified dimension. The results can be further filtered using SQL predicates.

See also:

> [QUERY_HISTORY view](../account-usage/query_history.md) (Account Usage)
> [Monitor query activity with Query History](../../user-guide/ui-snowsight-activity.md) (Snowsight dashboard)

## Syntax

```sqlsyntax
QUERY_HISTORY(
      [ END_TIME_RANGE_START => <constant_expr> ]
      [, END_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <num> ]
      [, INCLUDE_CLIENT_GENERATED_STATEMENT => <boolean_expr> ] )

QUERY_HISTORY_BY_SESSION(
      [ SESSION_ID => <constant_expr> ]
      [, END_TIME_RANGE_START => <constant_expr> ]
      [, END_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <num> ]
      [, INCLUDE_CLIENT_GENERATED_STATEMENT => <boolean_expr> ] )

QUERY_HISTORY_BY_USER(
      [ USER_NAME => '<string>' ]
      [, END_TIME_RANGE_START => <constant_expr> ]
      [, END_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <num> ]
      [, INCLUDE_CLIENT_GENERATED_STATEMENT => <boolean_expr> ] )

QUERY_HISTORY_BY_WAREHOUSE(
      [ WAREHOUSE_NAME => '<string>' ]
      [, END_TIME_RANGE_START => <constant_expr> ]
      [, END_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <num> ]
      [, INCLUDE_CLIENT_GENERATED_STATEMENT => <boolean_expr> ] )
```

## Arguments

All the arguments are optional.

`END_TIME_RANGE_START => constant_expr` , . `END_TIME_RANGE_END => constant_expr`
:   Time range (in TIMESTAMP_LTZ format), within the last 7 days, in which the query completed running:

    * If `END_TIME_RANGE_END` is not specified, the function returns all queries, including those that are still running.
    * If `END_TIME_RANGE_END` is [CURRENT_TIMESTAMP](current_timestamp.md), the function returns only those queries that have completed.

    If the time range does not fall within the last 7 days, an error is returned.

    > **Note:**
    >
    > If no start or end time is specified, the most recent queries are returned, up to the specified limit.

`SESSION_ID => constant_expr`
:   Applies only to QUERY_HISTORY_BY_SESSION

    The numeric identifier for a session or [CURRENT_SESSION](current_session.md). Only queries from the specified session are returned.

    Default: [CURRENT_SESSION](current_session.md)

`USER_NAME => 'string'`
:   Applies only to QUERY_HISTORY_BY_USER

    A string specifying a user login name or [CURRENT_USER](current_user.md). Only queries run by the specified user are returned. Note that the login name must be enclosed in single quotes. Also, if the
    login name contains any spaces, mixed-case characters, or special characters, the name must be double-quoted within the single quotes (e.g. `'"User 1"'` vs `'user1'`).
    You cannot specify `SYSTEM` (`USER_NAME =>'SYSTEM'`), which is a background service rather than a user. However, you can filter on `user_name='SYSTEM'` when you run queries against QUERY_HISTORY table functions.

    Default: [CURRENT_USER](current_user.md)

`WAREHOUSE_NAME => 'string'`
:   Applies only to QUERY_HISTORY_BY_WAREHOUSE

    A string specifying a warehouse name or [CURRENT_WAREHOUSE](current_warehouse.md). Only queries executed by that warehouse are returned. Note that the warehouse name must be enclosed in single quotes. Also, if the
    warehouse name contains any spaces, mixed-case characters, or special characters, the name must be double-quoted within the single quotes (e.g. `'"My Warehouse"'` vs `'mywarehouse'`).

    Default: [CURRENT_WAREHOUSE](current_warehouse.md)

`RESULT_LIMIT => num`
:   A number specifying the maximum number of rows returned by the function:

    If the number of matching rows is greater than this limit, the queries with the most recent end time (or those that are still executing) are returned, up to the specified limit.

    Range: `1` to `10000`

    Default: `100`.

    > **Note:**
    >
    > When you select from a QUERY_HISTORY table function, the time range and RESULT_LIMIT arguments
    > are applied *first*, followed by the WHERE clause. To apply a filter on a larger range of
    > queries, increase the RESULT_LIMIT value.

`INCLUDE_CLIENT_GENERATED_STATEMENT => 'boolean_expr'`
:   Specifies whether client-generated statements are included in table function queries (given the value of the `is_client_generated_statement` column).

    Default: `FALSE`.

    The ACCOUNT_USAGE [QUERY_HISTORY view](../account-usage/query_history.md) also contains an `is_client_generated_statement` column, but queries of this view return all statements, whether or not they are client-generated. If necessary, you can filter the query result.

## Usage notes

* Returns queries run by the current user. Also returns queries run by any user when the executing role, or a higher role in a hierarchy, has either of the following privileges:

  + The MONITOR or OPERATE privilege on the user-managed warehouses where the queries were run.
  + The MONITOR or OPERATE privilege on the task. Exception: If the task executes an owner’s right stored procedure or UDF, the role requires at least MONITOR privilege on the warehouse on which the task executed to view the stored procedure query and the UDF query.
  + The MONITOR EXECUTION privilege on the account in which the task resides.
  + Exceptions: Neither [stored procedures](../../developer-guide/stored-procedure/stored-procedures-overview.md) nor [user-defined functions (UDFs)](../../developer-guide/udf/udf-overview.md) can run this query.

  For more information, see [Virtual warehouse privileges](../../user-guide/security-access-control-privileges.md).
* When you call an Information Schema table function, your session must use the INFORMATION_SCHEMA, *or* the function name must be fully-qualified. For more information, see [Snowflake Information Schema](../info-schema.md).
* The values for the columns `external_function_total_invocations`, `external_function_total_sent_rows`,
  `external_function_total_received_rows`, `external_function_total_sent_bytes`, and `external_function_total_received_bytes`
  are affected by many factors, including:

  + The number of external functions in the SQL statement.
  + The number of rows per batch sent to each remote service.
  + The number of retries due to transient errors (e.g. because a response was not received within the expected time).
* Canceled queries are identified by their `error_message` text (`SQL execution canceled`), not by their `execution_status` value.
* When you select from a QUERY_HISTORY table function, the function arguments (time range, RESULT_LIMIT)
  are applied first to retrieve rows, followed by any WHERE and LIMIT clauses in your query.
  For example, if RESULT_LIMIT is set to 100 (the default), the WHERE clause applies only to the most recent
  100 queries. To search a larger range of queries before filtering, increase the RESULT_LIMIT value.

### Query retry columns

A query might need to be retried one or more times in order to successfully complete. There can be multiple causes that result in a query
retry. Some of these causes are *actionable*, that is, a user can make changes to reduce or eliminate query retries for a specific query.
For example, if a query is retried due to an out of memory error, modifying warehouse settings might resolve the issue.

Some query retries are caused by a fault that is not actionable. That is, there is no change a user can make to prevent the
query retry. For example, a network outage might result in a query retry. In this case, there is no change to the query or to the
warehouse that executes it that can prevent the query retry.

The QUERY_RETRY_TIME, QUERY_RETRY_CAUSE, and FAULT_HANDLING_TIME columns can help you optimize queries that are retried and better
understand fluctuations in query performance.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| `query_id` | VARCHAR | The statement’s unique id. |
| `query_text` | VARCHAR | Text of the SQL statement. |
| `database_name` | VARCHAR | Database that was specified in the context of the query at compilation. |
| `schema_name` | VARCHAR | Schema that was specified in the context of the query at compilation. |
| `query_type` | VARCHAR | DML, query, etc. If the query is currently running, or the query failed, then the query type may be UNKNOWN. |
| `session_id` | NUMBER | Session that executed the statement. |
| `authn_event_id` | NUMBER | ID for the event for the authentication of the user for this query. This ID corresponds to the value in the `event_id` column in the [LOGIN_HISTORY](../account-usage/login_history.md) view. ^ |
| `user_name` | VARCHAR | User who issued the query. |
| `user_type` | VARCHAR | The type of user executing the query. It’s the same as the `type` column in the [USERS view](../account-usage/users.md). If a Snowpark Container Services service executes the query, the user type is SNOWFLAKE_SERVICE (see [Access service user query history](../../developer-guide/snowpark-container-services/spcs-execute-sql.md)). |
| `user_database_name` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |
| `user_schema_name` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |
| `role_name` | VARCHAR | Role that was active in the session at the time of the query. |
| `warehouse_name` | VARCHAR | Warehouse that the query executed on, if any. |
| `warehouse_size` | VARCHAR | Size of the warehouse when this statement executed. |
| `warehouse_type` | VARCHAR | Type of the warehouse when this statement executed. |
| `cluster_number` | NUMBER | The cluster (in a multi-cluster warehouse) that this statement executed on. |
| `query_tag` | VARCHAR | Query tag set for this statement through the QUERY_TAG session parameter. |
| `execution_status` | VARCHAR | Execution status for the query: resuming_warehouse, running, queued, blocked, success, failed_with_error, or failed_with_incident. |
| `error_code` | NUMBER | Error code, if the query returned an error |
| `error_message` | VARCHAR | Error message, if the query returned an error |
| `start_time` | TIMESTAMP_LTZ | Statement start time |
| `end_time` | TIMESTAMP_LTZ | Statement end time. If the query is still running, the `end_time` is the UNIX epoch timestamp (“1970-01-01 00:00:00”), adjusted for the local time zone. E.g. for Pacific Standard Time, this would be “1969-12-31 16:00:00.000 -0800”. |
| `total_elapsed_time` | NUMBER | Elapsed time (in milliseconds) |
| `bytes_scanned` | NUMBER | Number of bytes scanned by this statement. |
| `rows_produced` | NUMBER | Number of rows produced by this statement. |
| `compilation_time` | NUMBER | Compilation time (in milliseconds) |
| `execution_time` | NUMBER | Execution time (in milliseconds) |
| `queued_provisioning_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, waiting for the warehouse compute resources to provision, due to warehouse creation, resume, or resize. |
| `queued_repair_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, waiting for compute resources in the warehouse to be repaired. |
| `queued_overload_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, due to the warehouse being overloaded by the current query workload. |
| `transaction_blocked_time` | NUMBER | Time (in milliseconds) spent blocked by a concurrent DML. |
| `outbound_data_transfer_cloud` | VARCHAR | Target cloud provider for statements that unload data to another region and/or cloud. |
| `outbound_data_transfer_region` | VARCHAR | Target region for statements that unload data to another region and/or cloud. |
| `outbound_data_transfer_bytes` | NUMBER | Number of bytes transferred in statements that unload data to another region and/or cloud. |
| `inbound_data_transfer_cloud` | VARCHAR | Source cloud provider for statements that load data from another region and/or cloud. |
| `inbound_data_transfer_region` | VARCHAR | Source region for statements that load data from another region and/or cloud. |
| `inbound_data_transfer_bytes` | NUMBER | Number of bytes transferred in a replication operation from another account. The source account could be in the same region or a different region than the current account. |
| `list_external_file_time` | NUMBER | Time (in milliseconds) spent listing external files. |
| `credits_used_cloud_services` | NUMBER | Number of credits used for cloud services. |
| `release_version` | VARCHAR | Release version in the format of `major_release.minor_release.patch_release`. |
| `external_function_total_invocations` | NUMBER | The aggregate number of times that this query called remote services. For important details, see the Usage Notes. |
| `external_function_total_sent_rows` | NUMBER | The total number of rows that this query sent in all calls to all remote services. |
| `external_function_total_received_rows` | NUMBER | The total number of rows that this query received from all calls to all remote services. |
| `external_function_total_sent_bytes` | NUMBER | The total number of bytes that this query sent in all calls to all remote services. |
| `external_function_total_received_bytes` | NUMBER | The total number of bytes that this query received from all calls to all remote services. |
| `is_client_generated_statement` | BOOLEAN | Indicates whether the query was client-generated. |
| `query_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| `query_hash_version` | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |
| `query_parameterized_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| `query_parameterized_hash_version` | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |
| `transaction_id` | NUMBER | [ID of the transaction](../transactions.md) that contains the statement or `0` if the statement is not executed within a transaction. |
| `query_acceleration_bytes_scanned` | NUMBER | Number of bytes scanned by the [query acceleration service](../../user-guide/query-acceleration-service.md). |
| `query_acceleration_partitions_scanned` | NUMBER | Number of partitions scanned by the query acceleration service. |
| `query_acceleration_upper_limit_scale_factor` | NUMBER | Upper limit [scale factor](../../user-guide/query-acceleration-service.md) that a query would have benefited from. |
| `bytes_written_to_result` | NUMBER | Number of bytes written to a result object. For example, `SELECT * FROM ...` would produce a set of results in tabular format representing each field in the selection. . . In general, the results object represents whatever is produced as a result of the query, and `bytes_written_to_result` represents the size of the returned result. |
| `rows_written_to_result` | NUMBER | Number of rows written to a result object. For CREATE TABLE AS SELECT (CTAS) and all DML operations, this result is `1`. |
| `rows_inserted` | NUMBER | Number of rows inserted by the query. |
| `query_retry_time` | NUMBER | Total execution time (in milliseconds) for query retries caused by actionable errors. For more information, see Query retry columns. |
| `query_retry_cause` | VARCHAR | Error that caused the query to retry. If there is no query retry, the field is NULL. For more information, see Query retry columns. |
| `fault_handling_time` | NUMBER | Total execution time (in milliseconds) for query retries caused by errors that are *not* actionable. For more information, see Query retry columns. |
| `bind_values` | ARRAY | Bind values in serialized form. If the query contains no bind values, then this column contains an empty array. If the array is too large or the [ALLOW_BIND_VALUES_ACCESS](../parameters.md) parameter is set to `FALSE`, this column contains NULL. For more information, see [Retrieve bind variable values](../bind-variables.md). |

The potential values for the `query_type` column include:

* CREATE_USER
* CREATE_ROLE
* CREATE_NETWORK_POLICY
* ALTER_ROLE
* ALTER_NETWORK_POLICY
* ALTER_ACCOUNT
* DROP_SEQUENCE
* DROP_USER
* DROP_ROLE
* DROP_NETWORK_POLICY
* RENAME_NETWORK_POLICY
* REVOKE

## Examples

Retrieve up to the last 100 queries run in the current session:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY_BY_SESSION())
  ORDER BY start_time;
```

Retrieve up to the last 100 queries run by the current user (or run by any user on any warehouse on which the current user has the MONITOR privilege):

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY())
  ORDER BY start_time;
```

Retrieve up to the last 100 queries run in the past hour by the current user (or run by any user on any warehouse on which the current user has the MONITOR privilege):

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY(DATEADD('hours',-1,CURRENT_TIMESTAMP()),CURRENT_TIMESTAMP()))
  ORDER BY start_time;
```

Retrieve all queries run by the current user (or run by any user on any warehouse on which the current user has the MONITOR privilege) within a specified 30-minute block of time in the past 7 days:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY(
    END_TIME_RANGE_START=>TO_TIMESTAMP_LTZ('2017-12-4 12:00:00.000 -0700'),
    END_TIME_RANGE_END=>TO_TIMESTAMP_LTZ('2017-12-4 12:30:00.000 -0700')));
```

Retrieve the number of client-generated statements that were run against a warehouse named `my_xsmall_wh`:

```sqlexample
SELECT COUNT(*)
  FROM TABLE(INFORMATION_SCHEMA.QUERY_HISTORY_BY_WAREHOUSE(
    WAREHOUSE_NAME => 'my_xsmall_wh',
    INCLUDE_CLIENT_GENERATED_STATEMENT => TRUE));
```

---
title: RADIANS
source: https://docs.snowflake.com/en/sql-reference/functions/radians.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# RADIANS

Converts degrees to radians.

## Syntax

```sqlsyntax
RADIANS( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

Show the results of calling the RADIANS function:

```sqlexample
SELECT RADIANS(0), RADIANS(60), RADIANS(180), RADIANS(360), RADIANS(720);
```

```output
+------------+-------------+--------------+--------------+--------------+
| RADIANS(0) | RADIANS(60) | RADIANS(180) | RADIANS(360) | RADIANS(720) |
|------------+-------------+--------------+--------------+--------------|
|          0 | 1.047197551 |  3.141592654 |  6.283185307 | 12.566370614 |
+------------+-------------+--------------+--------------+--------------+
```

---
title: RANDOM
source: https://docs.snowflake.com/en/sql-reference/functions/random.md
section: SQL Functions
---

Categories:
:   [Data generation functions](../functions-data-generation.md)

# RANDOM

Each call returns a pseudo-random 64-bit integer.

> **Tip:**
>
> To generate a random integer in a specified range, use the RANDOM function with the
> [UNIFORM](uniform.md) function.

## Syntax

```sqlsyntax
RANDOM([seed])
```

## Arguments

**Optional:**

`seed`
:   The seed is an integer. Different seeds cause RANDOM to produce different output values.

    If no seed is provided, a random seed is chosen in a platform-specific manner.

## Usage notes

* If a SQL statement calls RANDOM with the same seed for each row, then RANDOM returns a different value for each row,
  even though the seed is the same.
* If a SQL statement calls RANDOM more than once with the same seed for the same row,
  then RANDOM returns the same value for each call for that row. For example, the following returns
  the same value twice for each row: `SELECT RANDOM(42), RANDOM(42) FROM table1`.

  See the example below.
* If a statement that calls RANDOM is executed more than once, there is no guarantee that RANDOM will
  generate the same set of values each time. This is true whether or not you specify a seed.

  Even if the same statement is called with the same data, RANDOM can produce different values. For example, this can
  occur when:

  + The number of worker threads is different.
  + The rows are processed in a different order.
* Random values are not necessarily unique values. Although duplicates are rare for a small number of calls,
  the odds of duplicates go up as the number of calls goes up. If you need unique values, consider using
  a sequence ([SEQ1 / SEQ2 / SEQ4 / SEQ8](seq1.md)) rather than a call to
  RANDOM. Choose a sequence with enough bits that it is unlikely to wrap around.
* Because the output is a finite integer and the values are generated by an algorithm rather than truly
  randomly, the function eventually “wraps around” and starts repeating sequences of values. However, the “period”
  (number of calls before wrapping) is extremely large: 2^19937 - 1.
* The output is only pseudo-random; the output can be predicted given enough
  information (including the algorithm and the seed).
* RANDOM implements a 64-bit
  [Mersenne twister](http://en.wikipedia.org/wiki/Mersenne_twister)
  algorithm known as MT19937-64.
* Generating pseudo-random numbers is somewhat expensive computationally;
  large numbers of calls to this function can consume significant resources.

## Examples

The following examples demonstrate how to use the RANDOM function. The values displayed in the output below might differ from
the values returned when you run these examples yourself.

The following example calls RANDOM without a seed. The output for each row is different.

```sqlexample
SELECT RANDOM() FROM TABLE(GENERATOR(ROWCOUNT => 3));
```

```output
+----------------------+
|             RANDOM() |
|----------------------|
|  -962378740685764490 |
|  2115408279841266588 |
| -3473099493125344079 |
+----------------------+
```

The following example calls RANDOM with the same seed for each row. Although the seed is a constant, the
output for each row is still different.

```sqlexample
SELECT RANDOM(4711) FROM TABLE(GENERATOR(ROWCOUNT => 3));
```

```output
+----------------------+
|         RANDOM(4711) |
|----------------------|
| -3581185414942383166 |
|  1570543588041465562 |
| -6684111782596764647 |
+----------------------+
```

The following example calls RANDOM multiple times within a single statement and doesn’t use a seed.
RANDOM returns different values within each row, as well as different values for different rows:

```sqlexample
SELECT RANDOM(), RANDOM() FROM TABLE(GENERATOR(ROWCOUNT => 3));
```

```output
+----------------------+----------------------+
|             RANDOM() |             RANDOM() |
|----------------------+----------------------|
|  3150854865719208303 | -5331309978450480587 |
| -8117961043441270292 |   738998101727879972 |
|  6683692108700370630 |  7526520486590420231 |
+----------------------+----------------------+
```

The following example calls RANDOM multiple times within a single statement and uses the same seed for each of
those calls. RANDOM returns the same value within each row, but different values for different rows:

```sqlexample
SELECT RANDOM(4711), RANDOM(4711) FROM TABLE(GENERATOR(ROWCOUNT => 3));
```

```output
+----------------------+----------------------+
|         RANDOM(4711) |         RANDOM(4711) |
|----------------------+----------------------|
| -3581185414942383166 | -3581185414942383166 |
|  1570543588041465562 |  1570543588041465562 |
| -6684111782596764647 | -6684111782596764647 |
+----------------------+----------------------+
```

---
title: RANDSTR
source: https://docs.snowflake.com/en/sql-reference/functions/randstr.md
section: SQL Functions
---

Categories:
:   [Data generation functions](../functions-data-generation.md)

# RANDSTR

Returns a random string of specified `length`.

## Syntax

```sqlsyntax
RANDSTR( <length> , <gen> )
```

## Usage notes

* Individual characters are chosen uniformly at random from the following pool of characters: 0-9, a-z, A-Z.
* The value for the generator expression, `gen`, is used as the seed for this uniform random distribution. For more information about generator expressions, see [Usage notes](../functions-data-generation.md).

## Examples

```sqlexample
SELECT randstr(5, random()) FROM table(generator(rowCount => 5));

+----------------------+
| RANDSTR(5, RANDOM()) |
|----------------------|
| rM6ep                |
| nsWJ0                |
| IQi5H                |
| VBNvY                |
| wjk6y                |
+----------------------+
```

```sqlexample
SELECT randstr(5, 1234) FROM table(generator(rowCount => 5));

+------------------+
| RANDSTR(5, 1234) |
|------------------|
| E5tav            |
| E5tav            |
| E5tav            |
| E5tav            |
| E5tav            |
+------------------+
```

```sqlexample
SELECT randstr(abs(random()) % 10, random()) FROM table(generator(rowCount => 5));

+---------------------------------------+
| RANDSTR(ABS(RANDOM()) % 10, RANDOM()) |
|---------------------------------------|
| e                                     |
| iR                                    |
| qRwWl7W6                              |
|                                       |
| Yg                                    |
+---------------------------------------+
```

---
title: RANK
source: https://docs.snowflake.com/en/sql-reference/functions/rank.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (Ranking)

# RANK

Returns the rank of a value within an ordered group of values.

The rank value starts at 1 and continues up sequentially.

If two values are the same, they have the same rank.

## Syntax

```sqlsyntax
RANK() OVER ( [ PARTITION BY <expr1> ]
  ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

The function itself takes no arguments because it returns the rank (relative position) of the current row
within the window, which is ordered by `<expr2>`. The ordering of the window determines the rank, so there
is no need to pass an additional parameter to the RANK function.

## Usage notes

* `expr1`
  The column or expression to partition the window by.

  For example, suppose that within each state or province, you want to rank
  farmers in order by the amount of corn they produced. In this case, you
  partition by state.

  If you want only a single group (e.g. you want to rank all farmers in the U.S.
  regardless of which state they live in), then omit the PARTITION BY clause.
* `expr2`
  The column or expression to order (rank) by.

  For example, if you’re ranking farmers to see who produced the most corn
  (within their state), then you would use the `bushels_produced` column. For details,
  see Examples (in this topic).
* Tie values result in the same rank value; however, gaps in the sequence result from the tie values.

  For example, if the first three rows return `1`, RANK skips `2` and `3` and assigns `4` to the next row in the group.
* To avoid gaps, use the [DENSE_RANK](dense_rank.md) function instead.

## Examples

Create a table and data:

```sqlexample
CREATE OR REPLACE TABLE corn_production (farmer_id INTEGER, state VARCHAR, bushels FLOAT);

INSERT INTO corn_production (farmer_id, state, bushels) VALUES
  (1, 'Iowa', 100),
  (2, 'Iowa', 110),
  (3, 'Kansas', 120),
  (4, 'Kansas', 130);
```

Show farmers’ corn production in descending order, along with the rank of each
individual farmer’s production (highest = `1`):

```sqlexample
SELECT state, bushels,
    RANK() OVER (ORDER BY bushels DESC),
    DENSE_RANK() OVER (ORDER BY bushels DESC)
  FROM corn_production;
```

```output
+--------+---------+-------------------------------------+-------------------------------------------+
| STATE  | BUSHELS | RANK() OVER (ORDER BY BUSHELS DESC) | DENSE_RANK() OVER (ORDER BY BUSHELS DESC) |
|--------+---------+-------------------------------------+-------------------------------------------|
| Kansas |     130 |                                   1 |                                         1 |
| Kansas |     120 |                                   2 |                                         2 |
| Iowa   |     110 |                                   3 |                                         3 |
| Iowa   |     100 |                                   4 |                                         4 |
+--------+---------+-------------------------------------+-------------------------------------------+
```

Within each state, show farmers’ corn production in descending order, along with the rank of each
individual farmer’s production (highest = `1`):

```sqlexample
SELECT state, bushels,
    RANK() OVER (PARTITION BY state ORDER BY bushels DESC),
    DENSE_RANK() OVER (PARTITION BY state ORDER BY bushels DESC)
  FROM corn_production;
```

```output
+--------+---------+--------------------------------------------------------+--------------------------------------------------------------+
| STATE  | BUSHELS | RANK() OVER (PARTITION BY STATE ORDER BY BUSHELS DESC) | DENSE_RANK() OVER (PARTITION BY STATE ORDER BY BUSHELS DESC) |
|--------+---------+--------------------------------------------------------+--------------------------------------------------------------|
| Iowa   |     110 |                                                      1 |                                                            1 |
| Iowa   |     100 |                                                      2 |                                                            2 |
| Kansas |     130 |                                                      1 |                                                            1 |
| Kansas |     120 |                                                      2 |                                                            2 |
+--------+---------+--------------------------------------------------------+--------------------------------------------------------------+
```

The query and output below show how tie values are handled by the RANK and DENSE_RANK functions. Note that for DENSE_RANK,
the ranks are `1`, `2`, `3`, `3`, `4`. Unlike with the output from the RANK function, the rank `4` is not skipped because there was a tie for rank `3`.

```sqlexample
INSERT INTO corn_production (farmer_id, state, bushels) VALUES
  (5, 'Iowa', 110);

SELECT state, bushels,
    RANK() OVER (ORDER BY bushels DESC),
    DENSE_RANK() OVER (ORDER BY bushels DESC)
  FROM corn_production;
```

```output
+--------+---------+-------------------------------------+-------------------------------------------+
| STATE  | BUSHELS | RANK() OVER (ORDER BY BUSHELS DESC) | DENSE_RANK() OVER (ORDER BY BUSHELS DESC) |
|--------+---------+-------------------------------------+-------------------------------------------|
| Kansas |     130 |                                   1 |                                         1 |
| Kansas |     120 |                                   2 |                                         2 |
| Iowa   |     110 |                                   3 |                                         3 |
| Iowa   |     110 |                                   3 |                                         3 |
| Iowa   |     100 |                                   5 |                                         4 |
+--------+---------+-------------------------------------+-------------------------------------------+
```

---
title: RATIO_TO_REPORT
source: https://docs.snowflake.com/en/sql-reference/functions/ratio_to_report.md
section: SQL Functions
---

Categories:
:   [Window functions](../functions-window.md) (General)

# RATIO_TO_REPORT

Returns the ratio of a value within a group to the sum of the values within the group. If `expr1` evaluates
to null or the sum of `expr1` within the group evaluates to 0, then RATIO_TO_REPORT returns null.

## Syntax

```sqlsyntax
RATIO_TO_REPORT( <expr1> ) [ OVER ( [ PARTITION BY <expr2> ]
  [ ORDER BY <expr3> ] [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] ) ]
```

## Arguments

`expr1`
:   This is an expression that evaluates to a numeric data type (INTEGER, FLOAT, DECIMAL, etc.).

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition. Note that for this function, the order within
    the partition does not affect the output.

    In this function, as in all window functions, this ORDER BY does not control the order of the entire query output.

## Usage notes

* RATIO_TO_REPORT is calculated as:

  > value of `expr1` argument for the current row / sum of `expr1` argument for the partition
* The ORDER BY clause within the OVER clause is allowed in this function for syntactic consistency with other
  window functions but does not affect the calculation. Snowflake recommends not including the ORDER BY
  clause when using this function.

## Examples

This simple example shows the percentage of a store chain’s profit that was generated by each individual store:

```sqlexample
CREATE TABLE store_profit (
  store_id INTEGER,
  province VARCHAR,
  profit NUMERIC(11, 2));

INSERT INTO store_profit (store_id, province, profit) VALUES
  (1, 'Ontario', 300),
  (2, 'Saskatchewan', 250),
  (3, 'Ontario', 450),
  (4, 'Ontario', NULL)  -- hasn't opened yet, so no profit yet.
  ;
```

```sqlexample
SELECT store_id, profit,
    100 * RATIO_TO_REPORT(profit) OVER () AS percent_profit
  FROM store_profit
  ORDER BY store_id;
```

```output
+----------+--------+----------------+
| STORE_ID | PROFIT | PERCENT_PROFIT |
|----------+--------+----------------|
|        1 | 300.00 |    30.00000000 |
|        2 | 250.00 |    25.00000000 |
|        3 | 450.00 |    45.00000000 |
|        4 |   NULL |           NULL |
+----------+--------+----------------+
```

This example shows the percentage of profit within each province that was generated by each store in that province:

```sqlexample
SELECT province, store_id, profit,
    100 * RATIO_TO_REPORT(profit) OVER (PARTITION BY province) AS percent_profit
  FROM store_profit
  ORDER BY province, store_id;
```

```output
+--------------+----------+--------+----------------+
| PROVINCE     | STORE_ID | PROFIT | PERCENT_PROFIT |
|--------------+----------+--------+----------------|
| Ontario      |        1 | 300.00 |    40.00000000 |
| Ontario      |        3 | 450.00 |    60.00000000 |
| Ontario      |        4 |   NULL |           NULL |
| Saskatchewan |        2 | 250.00 |   100.00000000 |
+--------------+----------+--------+----------------+
```

---
title: REDUCE
source: https://docs.snowflake.com/en/sql-reference/functions/reduce.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Higher-order)

# REDUCE

Reduces an [array](../data-types-semistructured.md) to a single value based on the logic in a lambda expression.

The REDUCE function takes an array, an initial accumulator value, and a lambda function. It applies the lambda
function to each element of the array, updating the accumulator with each result. After processing all elements,
REDUCE returns the final accumulator value.

See also:
:   [Use lambda functions on data with Snowflake higher-order functions](../../user-guide/querying-semistructured.md)

## Syntax

```sqlsyntax
REDUCE( <array> , <init> , <lambda_expression> )
```

## Arguments

`array`
:   The array that contains the elements to be reduced. The array can be semi-structured or structured.

`init`
:   The initial accumulator value.

`lambda_expression`
:   A [lambda expression](../../user-guide/querying-semistructured.md) that defines the reduce
    logic on each array element.

    The lambda expression must be specified in the following syntax:

    ```sqlsyntax
    <acc> [ <datatype> ] , <value> [ <datatype> ] -> <expr>
    ```

    The `acc` argument is the accumulator, and the `value` argument is the current element
    being processed in the array.

## Returns

This function can return a value of any data type.

If the input array is empty, then the function returns the initial value of the accumulator.

The function returns NULL in these cases:

* The input array is NULL.
* The initial value of the accumulator is NULL.
* The lambda function returns NULL.

## Usage notes

* When the data type for a lambda `value` argument is explicitly specified, the array element is coerced into the specified type
  before lambda invocation. For information about coercion, see [Data type conversion](../data-type-conversion.md).
* Type checking enforces that the initial value of the accumulator, the accumulator lambda argument, and the return value
  of the lambda execution all have the same logical and physical types. If [casting](../data-type-conversion.md)
  is used to meet this requirement, the largest physical type of the three is used.
* The `value` argument can have intermediate NULL values. For an example, see Skip NULL values in an array.

## Examples

The following examples use the REDUCE function.

### Calculate the sum of the values in an array

Use the REDUCE function to return the sum of the values in an array and specify `0` for the initial
accumulator value:

```sqlexample
SELECT REDUCE([1,2,3],
              0,
              (acc, val) -> acc + val
       ) AS sum_of_values;
```

```output
+---------------+
| SUM_OF_VALUES |
|---------------|
|             6 |
+---------------+
```

This example is the same as the previous example, but it specifies a structured array of type INT:

```sqlexample
SELECT REDUCE([1,2,3]::ARRAY(INT),
              0,
              (acc, val) -> acc + val
       ) AS sum_of_values_structured;
```

```output
+--------------------------+
| SUM_OF_VALUES_STRUCTURED |
|--------------------------|
|                        6 |
+--------------------------+
```

Use the REDUCE function to return the sum of the values in an array and specify `10` for the initial
accumulator value:

```sqlexample
SELECT REDUCE([1,2,3],
              10,
              (acc, val) -> acc + val
       ) AS sum_of_values_plus_10;
```

```output
+-----------------------+
| SUM_OF_VALUES_PLUS_10 |
|-----------------------|
|                    16 |
+-----------------------+
```

### Calculate the sum of the square of each value in an array

Use the REDUCE function to return the sum of the square of each value in the array, and specify `0`
for the initial accumulator value:

```sqlexample
SELECT REDUCE([1,2,3],
              0,
              (acc, val) -> acc + val * val
       ) AS sum_of_squares;
```

```output
+----------------+
| SUM_OF_SQUARES |
|----------------|
|             14 |
+----------------+
```

### Skip NULL values in an array

In this example, the `array` argument includes NULL values. When this array is passed to
the REDUCE function, the accumulator will have intermediate NULL values.

Use the REDUCE function to return the sum of the values in the array, and use the
[ZEROIFNULL](zeroifnull.md) function in the logic of the lambda expression to skip
NULL values in the array. The lambda expression uses the ZEROIFNULL function to process each value
in the array using the following logic:

* If `val` is NULL, then the result of the lambda expression is `acc + 0`.
* If `val` is not NULL, then the result of the lambda expression is `acc + val`.

Run the query:

```sqlexample
SELECT REDUCE([1,NULL,2,NULL,3,4],
              0,
              (acc, val) -> acc + ZEROIFNULL(val))
  AS sum_of_values_skip_null;
```

```output
+-------------------------+
| SUM_OF_VALUES_SKIP_NULL |
|-------------------------|
|                      10 |
+-------------------------+
```

### Generate string values

Use the REDUCE function to return a list of string values by concatenating each value
in the array:

```sqlexample
SELECT REDUCE(['a', 'b', 'c'],
              '',
              (acc, val) -> acc || ' ' || val
       ) AS string_values;
```

```output
+---------------+
| STRING_VALUES |
|---------------|
|  a b c        |
+---------------+
```

### Use an array for the accumulator

Use the REDUCE function along with the [ARRAY_PREPEND](array_prepend.md) function in the logic
of the lambda expression to return an array that reverses the order of the input array:

```sqlexample
SELECT REDUCE([1, 2, 3, 4],
              [],
              (acc, val) -> ARRAY_PREPEND(acc, val)
       ) AS reverse_order;
```

```output
+---------------+
| REVERSE_ORDER |
|---------------|
| [             |
|   4,          |
|   3,          |
|   2,          |
|   1           |
| ]             |
+---------------+
```

### Use conditional logic

Use the REDUCE function along with the [IFF](iff.md) function in the logic
of the lambda expression to perform an action based on conditional logic similar to an `if-then`
expression. This example uses the following logic in the lambda expression:

* If the array value is less than seven, then square it and add it to the accumulator.
* If the array value is greater than or equal to seven, then add it to the accumulator without
  squaring it.

```sqlexample
SELECT REDUCE([5,10,15],
              0,
              (acc, val) -> IFF(val < 7, acc + val * val, acc + val)
       ) AS conditional_logic;
```

```output
+-------------------+
| CONDITIONAL_LOGIC |
|-------------------|
|                50 |
+-------------------+
```

### Reduce an array of elements in a table to a single value

Assume you have a table named `orders` with the columns `order_id`, `order_date`, and `order_detail`. The
`order_detail` column is an array of the line items, their purchase quantity, and subtotal. The table contains
two rows of data. The following SQL statement creates this table and inserts the rows:

```sqlexample
CREATE OR REPLACE TABLE orders AS
  SELECT 1 AS order_id, '2024-01-01' AS order_date, [
    {'item':'UHD Monitor', 'quantity':3, 'subtotal':1500},
    {'item':'Business Printer', 'quantity':1, 'subtotal':1200}
  ] AS order_detail
  UNION
  SELECT 2 AS order_id, '2024-01-02' AS order_date, [
    {'item':'Laptop', 'quantity':5, 'subtotal':7500},
    {'item':'Noise-canceling Headphones', 'quantity':5, 'subtotal':1000}
  ] AS order_detail;

SELECT * FROM orders;
```

```output
+----------+------------+-------------------------------------------+
| ORDER_ID | ORDER_DATE | ORDER_DETAIL                              |
|----------+------------+-------------------------------------------|
|        1 | 2024-01-01 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "UHD Monitor",                |
|          |            |     "quantity": 3,                        |
|          |            |     "subtotal": 1500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Business Printer",           |
|          |            |     "quantity": 1,                        |
|          |            |     "subtotal": 1200                      |
|          |            |   }                                       |
|          |            | ]                                         |
|        2 | 2024-01-02 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "Laptop",                     |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 7500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Noise-canceling Headphones", |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 1000                      |
|          |            |   }                                       |
|          |            | ]                                         |
+----------+------------+-------------------------------------------+
```

Use the REDUCE function to return the subtotal sum for all items in each order:

```sqlexample
SELECT order_id,
       order_date,
       REDUCE(o.order_detail,
              0,
              (acc, val) -> acc + val:subtotal
       ) AS subtotal_sum
  FROM orders o;
```

```output
+----------+------------+--------------+
| ORDER_ID | ORDER_DATE | SUBTOTAL_SUM |
|----------+------------+--------------|
|        1 | 2024-01-01 |         2700 |
|        2 | 2024-01-02 |         8500 |
+----------+------------+--------------+
```

Use the REDUCE function to return a list of the items sold in each order:

```sqlexample
SELECT order_id,
       order_date,
       REDUCE(o.order_detail,
              '',
              (acc, val) -> val:item || '\n' || acc
       ) AS items_sold
  FROM orders o;
```

```output
+----------+------------+-----------------------------+
| ORDER_ID | ORDER_DATE | ITEMS_SOLD                  |
|----------+------------+-----------------------------|
|        1 | 2024-01-01 | Business Printer            |
|          |            | UHD Monitor                 |
|          |            |                             |
|        2 | 2024-01-02 | Noise-canceling Headphones  |
|          |            | Laptop                      |
|          |            |                             |
+----------+------------+-----------------------------+
```

### Reference a table column in a lambda expression to reduce array elements in table data

Create a table with one column of type ARRAY and another column of type INT:

```sqlexample
CREATE OR REPLACE TABLE reduce_column_ref_demo AS
  SELECT [ 1, 2, 3 ] AS col1, 0 AS col2
  UNION
  SELECT [ 1, 2, 3 ] AS col1, 10 AS col2;

SELECT * FROM reduce_column_ref_demo;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
| [    |    0 |
|   1, |      |
|   2, |      |
|   3  |      |
| ]    |      |
| [    |   10 |
|   1, |      |
|   2, |      |
|   3  |      |
| ]    |      |
+------+------+
```

Use the REDUCE function to return the sum of the values in the array in each row by adding the value
in `col2` to the accumulator value:

```sqlexample
SELECT REDUCE(col1,
              10,
              (acc, val) -> (acc + col2) + val
       ) AS reduce_col_ref
  FROM reduce_column_ref_demo;
```

```output
+----------------+
| REDUCE_COL_REF |
|----------------|
|             16 |
|             46 |
+----------------+
```

---
title: REGEXP_COUNT
source: https://docs.snowflake.com/en/sql-reference/functions/regexp_count.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# REGEXP_COUNT

Returns the number of times that a [pattern](../functions-regexp.md) occurs in a string.

## Syntax

```sqlsyntax
REGEXP_COUNT( <subject> ,
              <pattern>
                [ , <position>
                  [ , <parameters> ]
                ]
)
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`position`
:   Number of characters from the beginning of the string where the function starts searching for matches.
    The value must be a positive integer.

    Default: `1` (the search for a match starts at the first character on the left)

`parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

## Returns

Returns a value of type NUMBER. Returns NULL if any argument is NULL.

## Usage notes

See the [General usage notes](../functions-regexp.md) for regular expression functions.

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

The following example counts occurrences of the word `was`. You can use the `\b` metacharacter to indicate
a word boundary. In the following example, matching begins at the first character in the string `w` and
ends at the last character in the string `s`, and so doesn’t match words that contain the string (such
as `washing`):

```sqlexample
SELECT REGEXP_COUNT('It was the best of times, it was the worst of times',
                    '\\bwas\\b',
                    1) AS result;
```

```output
+--------+
| RESULT |
|--------|
|      2 |
+--------+
```

The following example uses the `i` parameter for case-insensitive matching of the character `e`:

```sqlexample
SELECT REGEXP_COUNT('Excelence', 'e', 1, 'i') AS e_in_excelence;
```

```output
+----------------+
| E_IN_EXCELENCE |
|----------------|
|              4 |
+----------------+
```

The following example illustrates overlapping occurrences. Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE overlap (id NUMBER, a STRING);
INSERT INTO overlap VALUES (1,',abc,def,ghi,jkl,');
INSERT INTO overlap VALUES (2,',abc,,def,,ghi,,jkl,');

SELECT * FROM overlap;
```

```output
+----+----------------------+
| ID | A                    |
|----+----------------------|
|  1 | ,abc,def,ghi,jkl,    |
|  2 | ,abc,,def,,ghi,,jkl, |
+----+----------------------+
```

Run a query that uses REGEXP_COUNT to count the number of times that the following pattern
is found in each row: a punctuation mark followed by digits and letters, followed by a
punctuation mark.

```sqlexample
SELECT id,
       REGEXP_COUNT(a,
                    '[[:punct:]][[:alnum:]]+[[:punct:]]',
                    1,
                    'i') AS result
  FROM overlap;
```

```output
+----+--------+
| ID | RESULT |
|----+--------|
|  1 |      2 |
|  2 |      4 |
+----+--------+
```

The remaining examples use the data in the following table:

```sqlexample
CREATE OR REPLACE TABLE regexp_count_demo (dt DATE, messages VARCHAR);

INSERT INTO regexp_count_demo (dt, messages) VALUES
  ('10-AUG-2025','ER-6842,LG-230,LG-150,ER-3379,ER-6210'),
  ('11-AUG-2025','LG-272,LG-605,LG-683,ER-5577'),
  ('12-AUG-2025','ER-2207,LG-551,LG-826,ER-6842');

SELECT * FROM regexp_count_demo;
```

```output
+------------+---------------------------------------+
| DT         | MESSAGES                              |
|------------+---------------------------------------|
| 2025-08-10 | ER-6842,LG-230,LG-150,ER-3379,ER-6210 |
| 2025-08-11 | LG-272,LG-605,LG-683,ER-5577          |
| 2025-08-12 | ER-2207,LG-551,LG-826,ER-6842         |
+------------+---------------------------------------+
```

The following query returns the total number of messages for each day by searching for the delimiter (`,`) and
adding one to the total:

```sqlexample
SELECT dt,
       REGEXP_COUNT(messages, ',') + 1 AS message_count
  FROM regexp_count_demo;
```

```output
+------------+---------------+
| DT         | MESSAGE_COUNT |
|------------+---------------|
| 2025-08-10 |             5 |
| 2025-08-11 |             4 |
| 2025-08-12 |             4 |
+------------+---------------+
```

Assume that errors always begin with `ER` followed by a hyphen and a four-digit number. The following
query counts the number of errors for each day:

```sqlexample
SELECT dt,
       REGEXP_COUNT(messages, '\\bER-[0-9]{4}') AS number_of_errors
  FROM regexp_count_demo;
```

```output
+------------+------------------+
| DT         | NUMBER_OF_ERRORS |
|------------+------------------|
| 2025-08-10 |                3 |
| 2025-08-11 |                1 |
| 2025-08-12 |                2 |
+------------+------------------+
```

---
title: REGEXP_INSTR
source: https://docs.snowflake.com/en/sql-reference/functions/regexp_instr.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# REGEXP_INSTR

Returns the position of the specified occurrence of the regular expression pattern in the string subject.

See also [String functions (regular expressions)](../functions-regexp.md).

## Syntax

```sqlsyntax
REGEXP_INSTR( <subject> , <pattern> [ , <position> [ , <occurrence> [ , <option> [ , <regexp_parameters> [ , <group_num> ] ] ] ] ] )
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`position`
:   Number of characters from the beginning of the string where the function starts searching for matches.
    The value must be a positive integer.

    Default: `1` (the search for a match starts at the first character on the left)

`occurrence`
:   Specifies the first occurrence of the pattern from which to start returning matches.

    The function skips the first `occurrence - 1` matches. For example, if there are 5 matches and
    you specify `3` for the `occurrence` argument, the function ignores the first two matches and
    returns the third, fourth, and fifth matches.

    Default: `1`

`option`
:   Specifies whether to return the offset of the first character of the match (`0`) or the offset of the first character following the end of the match (`1`).

    Default: `0`

`regexp_parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

    > **Note:**
    >
    > By default, REGEXP_INSTR returns the begin or end character offset for the entire matching part of the subject.
    > However, if the `e` (for “extract”) parameter is specified, REGEXP_INSTR returns the begin or end
    > character offset for the part of the subject that matches the first sub-expression in the pattern.
    > If `e` is specified but a `group_num` is not also specified, then the `group_num`
    > defaults to 1 (the first group). If there is no sub-expression in the pattern, REGEXP_INSTR behaves as
    > if `e` was not set. For examples that use `e`, see Examples in this topic.

`group_num`
:   The `group_num` parameter specifies which group to extract. Groups are specified by using parentheses in
    the regular expression.

    If a `group_num` is specified, Snowflake allows extraction even if the `e` option was not
    also specified. The `e` option is implied.

    Snowflake supports up to 1024 groups.

    For examples that use `group_num`, see Examples of capture groups in this topic.

## Returns

Returns a value of type NUMBER.

If no match is found, returns `0`.

## Usage notes

* Positions are 1-based, not 0-based. For example, the position of the letter “M” in “MAN” is 1, not 0.
* For additional usage notes, see the [General usage notes](../functions-regexp.md) for regular expression functions.

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

The following examples use the REGEXP_INSTR function.

### Basic examples

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE demo1 (id INT, string1 VARCHAR);
INSERT INTO demo1 (id, string1) VALUES
  (1, 'nevermore1, nevermore2, nevermore3.');
```

Search for a matching string. In this case, the string is `nevermore` followed by a single decimal digit
(for example, `nevermore1`). The example uses the [REGEXP_SUBSTR](regexp_substr.md) function to show the matching
substring:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'nevermore\\d') AS substring,
       REGEXP_INSTR( string1, 'nevermore\\d') AS position
  FROM demo1
  ORDER BY id;
```

```output
+----+-------------------------------------+------------+----------+
| ID | STRING1                             | SUBSTRING  | POSITION |
|----+-------------------------------------+------------+----------|
|  1 | nevermore1, nevermore2, nevermore3. | nevermore1 |        1 |
+----+-------------------------------------+------------+----------+
```

Search for a matching string, but starting at the fifth character in the string, rather than at the first character in the
string:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'nevermore\\d', 5) AS substring,
       REGEXP_INSTR( string1, 'nevermore\\d', 5) AS position
  FROM demo1
  ORDER BY id;
```

```output
+----+-------------------------------------+------------+----------+
| ID | STRING1                             | SUBSTRING  | POSITION |
|----+-------------------------------------+------------+----------|
|  1 | nevermore1, nevermore2, nevermore3. | nevermore2 |       13 |
+----+-------------------------------------+------------+----------+
```

Search for a matching string, but look for the third match rather than the first match:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'nevermore\\d', 1, 3) AS substring,
       REGEXP_INSTR( string1, 'nevermore\\d', 1, 3) AS position
  FROM demo1
  ORDER BY id;
```

```output
+----+-------------------------------------+------------+----------+
| ID | STRING1                             | SUBSTRING  | POSITION |
|----+-------------------------------------+------------+----------|
|  1 | nevermore1, nevermore2, nevermore3. | nevermore3 |       25 |
+----+-------------------------------------+------------+----------+
```

This query is nearly identical the previous query, but this one shows how to use the `option` argument to
indicate whether you want the position of the matching expression, or the position of the first character after the
matching expression:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'nevermore\\d', 1, 3) AS substring,
       REGEXP_INSTR( string1, 'nevermore\\d', 1, 3, 0) AS start_position,
       REGEXP_INSTR( string1, 'nevermore\\d', 1, 3, 1) AS after_position
  FROM demo1
  ORDER BY id;
```

```output
+----+-------------------------------------+------------+----------------+----------------+
| ID | STRING1                             | SUBSTRING  | START_POSITION | AFTER_POSITION |
|----+-------------------------------------+------------+----------------+----------------|
|  1 | nevermore1, nevermore2, nevermore3. | nevermore3 |             25 |             35 |
+----+-------------------------------------+------------+----------------+----------------+
```

This query shows that if you search for an occurrence beyond the last actual occurrence, the position returned is 0:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'nevermore', 1, 4) AS substring,
       REGEXP_INSTR( string1, 'nevermore', 1, 4) AS position
  FROM demo1
  ORDER BY id;
```

```output
+----+-------------------------------------+-----------+----------+
| ID | STRING1                             | SUBSTRING | POSITION |
|----+-------------------------------------+-----------+----------|
|  1 | nevermore1, nevermore2, nevermore3. | NULL      |        0 |
+----+-------------------------------------+-----------+----------+
```

### Examples of capture groups

This section shows how to use the “group” feature of regular expressions.

The first few examples in this section don’t use capture groups. The section starts with some simple examples,
then continues with examples that use capture groups.

These examples use the strings created below:

```sqlexample
CREATE OR REPLACE TABLE demo2 (id INT, string1 VARCHAR);

INSERT INTO demo2 (id, string1) VALUES
    (2, 'It was the best of times, it was the worst of times.'),
    (3, 'In    the   string   the   extra   spaces  are   redundant.'),
    (4, 'A thespian theater is nearby.');

SELECT * FROM demo2;
```

```output
+----+-------------------------------------------------------------+
| ID | STRING1                                                     |
|----+-------------------------------------------------------------|
|  2 | It was the best of times, it was the worst of times.        |
|  3 | In    the   string   the   extra   spaces  are   redundant. |
|  4 | A thespian theater is nearby.                               |
+----+-------------------------------------------------------------+
```

The strings have the following characteristics:

* The string with an `id` of `2` has multiple occurrences of the word “the”.
* The string with an `id` of `3` has multiple occurrences of the word “the” with extra blank spaces
  between the words.
* The string with an `id` of `4` has the character sequence “the” inside multiple words (“thespian”
  and “theater”), but without the word “the” by itself.

This example looks for the first occurrence of the word `the`, followed by one or more non-word characters (for example,
the whitespace separating words), followed by one or more word characters.

“Word characters” include not only the letters a-z and A-Z, but also the
underscore (“_”) and the decimal digits 0-9, but not whitespace, punctuation, and so on.

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'the\\W+\\w+') AS substring,
       REGEXP_INSTR(string1, 'the\\W+\\w+') AS position
  FROM demo2
  ORDER BY id;
```

```output
+----+-------------------------------------------------------------+--------------+----------+
| ID | STRING1                                                     | SUBSTRING    | POSITION |
|----+-------------------------------------------------------------+--------------+----------|
|  2 | It was the best of times, it was the worst of times.        | the best     |        8 |
|  3 | In    the   string   the   extra   spaces  are   redundant. | the   string |        7 |
|  4 | A thespian theater is nearby.                               | NULL         |        0 |
+----+-------------------------------------------------------------+--------------+----------+
```

Starting from position 1 of the string, look for the second occurrence of the word `the`,
followed by one or more non-word characters, followed by one or more word characters.

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'the\\W+\\w+', 1, 2) AS substring,
       REGEXP_INSTR(string1, 'the\\W+\\w+', 1, 2) AS position
  FROM demo2
  ORDER BY id;
```

```output
+----+-------------------------------------------------------------+-------------+----------+
| ID | STRING1                                                     | SUBSTRING   | POSITION |
|----+-------------------------------------------------------------+-------------+----------|
|  2 | It was the best of times, it was the worst of times.        | the worst   |       34 |
|  3 | In    the   string   the   extra   spaces  are   redundant. | the   extra |       22 |
|  4 | A thespian theater is nearby.                               | NULL        |        0 |
+----+-------------------------------------------------------------+-------------+----------+
```

This example is similar to the preceding example, but adds capture groups. Rather than returning the position of the
entire match, this query returns the position of only the group, which is the portion of the substring that matches the
part of the regular expression in parentheses. In this case, the returned value is the position of the word
after the second occurrence of the word `the`.

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'the\\W+(\\w+)', 1, 2,    'e', 1) AS substring,
       REGEXP_INSTR( string1, 'the\\W+(\\w+)', 1, 2, 0, 'e', 1) AS position
  FROM demo2
  ORDER BY id;
```

```output
+----+-------------------------------------------------------------+-----------+----------+
| ID | STRING1                                                     | SUBSTRING | POSITION |
|----+-------------------------------------------------------------+-----------+----------|
|  2 | It was the best of times, it was the worst of times.        | worst     |       38 |
|  3 | In    the   string   the   extra   spaces  are   redundant. | extra     |       28 |
|  4 | A thespian theater is nearby.                               | NULL      |        0 |
+----+-------------------------------------------------------------+-----------+----------+
```

If you specify the `'e'` (extract) parameter, but don’t specify the `group_num`, then the `group_num`
defaults to `1`:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'the\\W+(\\w+)', 1, 2,    'e') AS substring,
       REGEXP_INSTR( string1, 'the\\W+(\\w+)', 1, 2, 0, 'e') AS position
  FROM demo2
  ORDER BY id;
```

```output
+----+-------------------------------------------------------------+-----------+----------+
| ID | STRING1                                                     | SUBSTRING | POSITION |
|----+-------------------------------------------------------------+-----------+----------|
|  2 | It was the best of times, it was the worst of times.        | worst     |       38 |
|  3 | In    the   string   the   extra   spaces  are   redundant. | extra     |       28 |
|  4 | A thespian theater is nearby.                               | NULL      |        0 |
+----+-------------------------------------------------------------+-----------+----------+
```

If you specify a `group_num`, Snowflake assumes that you want to extract, even if you didn’t specify
`'e'` (extract) as one of the parameters:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'the\\W+(\\w+)', 1, 2,    '', 1) AS substring,
       REGEXP_INSTR( string1, 'the\\W+(\\w+)', 1, 2, 0, '', 1) AS position
  FROM demo2
  ORDER BY id;
```

```output
+----+-------------------------------------------------------------+-----------+----------+
| ID | STRING1                                                     | SUBSTRING | POSITION |
|----+-------------------------------------------------------------+-----------+----------|
|  2 | It was the best of times, it was the worst of times.        | worst     |       38 |
|  3 | In    the   string   the   extra   spaces  are   redundant. | extra     |       28 |
|  4 | A thespian theater is nearby.                               | NULL      |        0 |
+----+-------------------------------------------------------------+-----------+----------+
```

This example shows how to retrieve the position of second word from the first, second, and third matches of
a two-word pattern in which the first word is `A`. This also shows that trying to go beyond the last
pattern causes Snowflake to return 0.

Create a table and insert data:

```sqlexample
CREATE TABLE demo3 (id INT, string1 VARCHAR);
INSERT INTO demo3 (id, string1) VALUES
  (5, 'A MAN A PLAN A CANAL');
```

Run the query:

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 1,    'e', 1) AS substring1,
       REGEXP_INSTR( string1, 'A\\W+(\\w+)', 1, 1, 0, 'e', 1) AS position1,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 2,    'e', 1) AS substring2,
       REGEXP_INSTR( string1, 'A\\W+(\\w+)', 1, 2, 0, 'e', 1) AS position2,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 3,    'e', 1) AS substring3,
       REGEXP_INSTR( string1, 'A\\W+(\\w+)', 1, 3, 0, 'e', 1) AS position3,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 4,    'e', 1) AS substring4,
       REGEXP_INSTR( string1, 'A\\W+(\\w+)', 1, 4, 0, 'e', 1) AS position4
  FROM demo3;
```

```output
+----+----------------------+------------+-----------+------------+-----------+------------+-----------+------------+-----------+
| ID | STRING1              | SUBSTRING1 | POSITION1 | SUBSTRING2 | POSITION2 | SUBSTRING3 | POSITION3 | SUBSTRING4 | POSITION4 |
|----+----------------------+------------+-----------+------------+-----------+------------+-----------+------------+-----------|
|  5 | A MAN A PLAN A CANAL | MAN        |         3 | PLAN       |         9 | CANAL      |        16 | NULL       |         0 |
+----+----------------------+------------+-----------+------------+-----------+------------+-----------+------------+-----------+
```

This example shows how to retrieve the position of first, second, and third groups within the first occurrence of the pattern.
In this case, the returned values are the positions of the individual letters of the word `MAN`.

```sqlexample
SELECT id,
       string1,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1,    'e', 1) AS substring1,
       REGEXP_INSTR( string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 0, 'e', 1) AS position1,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1,    'e', 2) AS substring2,
       REGEXP_INSTR( string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 0, 'e', 2) AS position2,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1,    'e', 3) AS substring3,
       REGEXP_INSTR( string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 0, 'e', 3) AS position3
  FROM demo3;
```

```output
+----+----------------------+------------+-----------+------------+-----------+------------+-----------+
| ID | STRING1              | SUBSTRING1 | POSITION1 | SUBSTRING2 | POSITION2 | SUBSTRING3 | POSITION3 |
|----+----------------------+------------+-----------+------------+-----------+------------+-----------|
|  5 | A MAN A PLAN A CANAL | M          |         3 | A          |         4 | N          |         5 |
+----+----------------------+------------+-----------+------------+-----------+------------+-----------+
```

### Additional examples

The following example matches occurrences of the word `was`. Matching begins at the first character in the string
and returns the position in the string of the character following the first occurrence:

```sqlexample
SELECT REGEXP_INSTR('It was the best of times, it was the worst of times',
                    '\\bwas\\b',
                    1,
                    1) AS result;
```

```output
+--------+
| RESULT |
|--------|
|      4 |
+--------+
```

The following example returns the offset of the first character of the part of the string that matches the
pattern. Matching begins at the first character in the string and returns the first occurrence of the pattern:

```sqlexample
SELECT REGEXP_INSTR('It was the best of times, it was the worst of times',
                    'the\\W+(\\w+)',
                    1,
                    1,
                    0) AS result;
```

```output
+--------+
| RESULT |
|--------|
|      8 |
+--------+
```

The following example is the same as the previous example, but uses the `e` parameter to return the
character offset for the part of the subject that matches the first subexpression in the pattern (the
first set of word characters after `the`):

```sqlexample
SELECT REGEXP_INSTR('It was the best of times, it was the worst of times',
                    'the\\W+(\\w+)',
                    1,
                    1,
                    0,
                    'e') AS result;
```

```output
+--------+
| RESULT |
|--------|
|     12 |
+--------+
```

The following example matches occurrences of words ending in `st` preceded by two or more alphabetic characters
(case-insensitive). Matching begins at the fifteenth character in the string and returns the position in the string of
the character following the first occurrence (the beginning of `worst`):

```sqlexample
SELECT REGEXP_INSTR('It was the best of times, it was the worst of times',
                    '[[:alpha:]]{2,}st',
                    15,
                    1) AS result;
```

```output
+--------+
| RESULT |
|--------|
|     38 |
+--------+
```

To run the next set of examples, create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE message(body VARCHAR(255));
INSERT INTO message VALUES
  ('Hellooo World'),
  ('How are you doing today?'),
  ('the quick brown fox jumps over the lazy dog'),
  ('PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS');
```

Return the offset of the first character in the first match that contains a
lowercase `o`:

```sqlexample
SELECT body,
       REGEXP_INSTR(body, '\\b\\S*o\\S*\\b') AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               |      1 |
| How are you doing today?                    |      1 |
| the quick brown fox jumps over the lazy dog |     11 |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     |      0 |
+---------------------------------------------+--------+
```

Return the offset of the first character in the first match that contains a
lowercase `o`, starting at the third character in the subject:

```sqlexample
SELECT body,
       REGEXP_INSTR(body, '\\b\\S*o\\S*\\b', 3) AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               |      3 |
| How are you doing today?                    |      9 |
| the quick brown fox jumps over the lazy dog |     11 |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     |      0 |
+---------------------------------------------+--------+
```

Return the offset of the first character in the third match that contains a
lowercase `o`, starting at the third character in the subject:

```sqlexample
SELECT body, REGEXP_INSTR(body, '\\b\\S*o\\S*\\b', 3, 3) AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               |      0 |
| How are you doing today?                    |     19 |
| the quick brown fox jumps over the lazy dog |     27 |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     |      0 |
+---------------------------------------------+--------+
```

Return the offset of the last character in the third match that contains a
lowercase `o`, starting at the third character in the subject:

```sqlexample
SELECT body, REGEXP_INSTR(body, '\\b\\S*o\\S*\\b', 3, 3, 1) AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               |      0 |
| How are you doing today?                    |     24 |
| the quick brown fox jumps over the lazy dog |     31 |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     |      0 |
+---------------------------------------------+--------+
```

Return the offset of the last character in the third match that contains a
lowercase `o`, starting at the third character in the subject, with case-insensitive matching:

```sqlexample
SELECT body, REGEXP_INSTR(body, '\\b\\S*o\\S*\\b', 3, 3, 1, 'i') AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               |      0 |
| How are you doing today?                    |     24 |
| the quick brown fox jumps over the lazy dog |     31 |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     |     35 |
+---------------------------------------------+--------+
```

---
title: REGEXP_LIKE
source: https://docs.snowflake.com/en/sql-reference/functions/regexp_like.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# REGEXP_LIKE

Performs a comparison to determine whether a string matches a specified pattern. Both inputs
must be text expressions.

REGEXP_LIKE is similar to the [LIKE](like.md) function, but with
[POSIX extended regular expressions](http://en.wikipedia.org/wiki/Regular_expression#POSIX_basic_and_extended)
instead of SQL LIKE pattern syntax. REGEXP_LIKE supports more complex matching conditions than LIKE.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

Aliases:
:   [RLIKE](rlike.md) (1st syntax)

## Syntax

```sqlsyntax
REGEXP_LIKE( <subject> , <pattern> [ , <parameters> ] )
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

## Returns

Returns a BOOLEAN value or NULL:

* Returns TRUE if there is a match.
* Returns FALSE if there isn’t a match.
* Returns NULL if any argument is NULL.

## Usage Notes

* The function implicitly anchors a pattern at both ends (for example, `''` automatically becomes `'^$'`, and `'ABC'`
  automatically becomes `'^ABC$'`). For example, to match any string starting with `ABC`, the pattern is `'ABC.*'`.
* The backslash character (`\`) is the escape character. For more information, see [Specifying regular expressions in single-quoted string constants](../functions-regexp.md).
* For more usage notes, see the [General usage notes](../functions-regexp.md) for regular expression functions.

## Collation Details

Arguments with collation specifications currently aren’t supported.

## Examples

The following examples use the REGEXP_LIKE function:

* Run basic regular expression queries on strings
* Run regular expression queries on strings with special characters

For additional examples of regular expressions, see [REGEXP](regexp.md).

### Run basic regular expression queries on strings

Create a table with names of cities:

```sqlexample
CREATE OR REPLACE TABLE cities(city VARCHAR(20));

INSERT INTO cities VALUES
  ('Sacramento'),
  ('San Francisco'),
  ('San Luis Obispo'),
  ('San Jose'),
  ('Santa Barbara'),
  ('Palo Alto'),
  (NULL);
```

You can use `.*` as a wildcard to match as many characters as possible. The following example matches the
pattern `Fran` anywhere in the string value:

```sqlexample
SELECT * FROM cities WHERE REGEXP_LIKE(city, '.*Fran.*');
```

```output
+---------------+
| CITY          |
|---------------|
| San Francisco |
+---------------+
```

The following example uses the `i` parameter for case-insensitive matching:

```sqlexample
SELECT * FROM cities WHERE REGEXP_LIKE(city, '.*fran.*', 'i');
```

```output
+---------------+
| CITY          |
|---------------|
| San Francisco |
+---------------+
```

To find a pattern that matches the beginning of a string value, run a query that uses a wildcard:

```sqlexample
SELECT * FROM cities WHERE REGEXP_LIKE(city, 'san.*', 'i');
```

```output
+-----------------+
| CITY            |
|-----------------|
| San Francisco   |
| San Luis Obispo |
| San Jose        |
| Santa Barbara   |
+-----------------+
```

To run a case-sensitive query with a wildcard, omit the `i` parameter:

```sqlexample
SELECT * FROM cities WHERE REGEXP_LIKE(city, 'san.*');
```

```output
+------+
| CITY |
|------|
+------+
```

You can use the `\w+` metacharacter to match one word and `\s` metacharacter to match one whitespace character, such
as a space or a tab. The following query searches for the values that include one word, followed by a whitespace
character, followed by one word:

```sqlexample
SELECT * FROM cities WHERE REGEXP_LIKE(city, '\\w+\\s\\w+');
```

```output
+---------------+
| CITY          |
|---------------|
| San Francisco |
| San Jose      |
| Santa Barbara |
| Palo Alto     |
+---------------+
```

The output for the query doesn’t include `San Luis Obispo` because that value has three words with
a space between the first and second words instead of only two words with a space in between them.

In a regular expression, you can often use an uppercase metacharacter to negate the meaning of a lowercase metacharacter. For
example, run a query that searches for the values that don’t include a whitespace character between two words by using the
`\S` metacharacter:

```sqlexample
SELECT * FROM cities WHERE REGEXP_LIKE(city, '\\w+\\S\\w+');
```

```output
+------------+
| CITY       |
|------------|
| Sacramento |
+------------+
```

### Run regular expression queries on strings with special characters

The examples in this section search for values with special characters, which are characters other than
a-z, A-Z, underscore (“_”), or decimal digit.

To search for a metacharacter, escape the metacharacter. For more information, see
[Specifying regular expressions in single-quoted string constants](../functions-regexp.md).

Create a table, and then insert some values with special characters:

```sqlexample
CREATE OR REPLACE TABLE regex_special_characters(v VARCHAR(20));

INSERT INTO regex_special_characters VALUES
  ('Snow'),
  ('Sn.ow'),
  ('Sn@ow'),
  ('Sn$ow'),
  ('Sn\\ow');
```

The first inserted value doesn’t contain special characters.

To show the data, query the table:

```sqlexample
SELECT * FROM regex_special_characters;
```

```output
+-------+
| V     |
|-------|
| Snow  |
| Sn.ow |
| Sn@ow |
| Sn$ow |
| Sn\ow |
+-------+
```

You can search for any special character by using the `\W` Perl backslash-sequence, which searches
for characters that aren’t “word” characters. For example, the following query searches for the values
in the table that have special characters:

```sqlexample
SELECT *
  FROM regex_special_characters
  WHERE REGEXP_LIKE(v, '.*\\W.*');
```

```output
+-------+
| V     |
|-------|
| Sn.ow |
| Sn@ow |
| Sn$ow |
| Sn\ow |
+-------+
```

To [search for metacharacters](../functions-regexp.md) in a single-quoted string constant, you must
escape the metacharacter with two backslashes. For example, the following query searches for the values that
contain the `$` metacharacter:

```sqlexample
SELECT *
  FROM regex_special_characters
  WHERE REGEXP_LIKE(v, '.*\\$.*');
```

```output
+-------+
| V     |
|-------|
| Sn$ow |
+-------+
```

If you search for a backslash, an additional backslash escape character is required. For example, the following
query searches for the values that contain the `\` or the `.` metacharacter:

```sqlexample
SELECT *
  FROM regex_special_characters
  WHERE REGEXP_LIKE(v, '.*(\\.|\\\\).*');
```

```output
+-------+
| V     |
|-------|
| Sn.ow |
| Sn\ow |
+-------+
```

---
title: REGEXP_REPLACE
source: https://docs.snowflake.com/en/sql-reference/functions/regexp_replace.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# REGEXP_REPLACE

Returns the subject with the specified pattern — or all occurrences of the pattern — either removed
or replaced by a replacement string.

## Syntax

```sqlsyntax
 REGEXP_REPLACE( <subject> ,
                 <pattern>
                   [ , <replacement>
                     [ , <position>
                       [ , <occurrence>
                         [ , <parameters> ]
                       ]
                     ]
                   ]
)
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`replacement`
:   String that replaces the substrings matched by the pattern. If an empty string is specified, the function removes all matched patterns and returns the resulting string.

    Default: `''` (empty string).

`position`
:   Number of characters from the beginning of the string where the function starts searching for matches.
    The value must be a positive integer.

    Default: `1` (the search for a match starts at the first character on the left)

`occurrence`
:   Specifies which occurrence of the pattern to replace. If `0` is specified, all occurrences are replaced.

    Default: `0` (all occurrences)

`parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

## Returns

Returns a value of type VARCHAR.

If no matches are found, returns the original subject.

Returns NULL if any argument is NULL.

## Usage notes

* The replacement string can contain backreferences to capture groups; for example, sub-expressions of the pattern. A capture group is a regular expression that is enclosed within parentheses (`( )`). The maximum number of capture groups is nine.

  Backreferences match expressions inside a capture group. Backreferences have the form `n` where `n` is a value from 0 to 9, inclusive, which refers to the matching instance of the capture group. For more information, see Examples (in this topic).
* Parentheses (`( )`) and square brackets (`[ ]`) currently must be double-escaped to parse them as literal strings.

  The example below shows how to remove parentheses:

  ```sqlexample
  SELECT REGEXP_REPLACE('Customers - (NY)','\\(|\\)','') AS customers;
  ```

  ```output
  +----------------+
  | CUSTOMERS      |
  |----------------|
  | Customers - NY |
  +----------------+
  ```
* For additional usage notes, see the [General usage notes](../functions-regexp.md) for regular expression functions.

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

The following example replaces all spaces in the string with nothing (that is, all spaces are removed):

```sqlexample
SELECT REGEXP_REPLACE('It was the best of times, it was the worst of times',
                      '( ){1,}',
                      '') AS result;
```

```output
+------------------------------------------+
| RESULT                                   |
|------------------------------------------|
| Itwasthebestoftimes,itwastheworstoftimes |
+------------------------------------------+
```

The following example matches the string `times` and replaces it with the string `days`. Matching begins at the first
character in the string and replaces the second occurrence of the substring:

```sqlexample
SELECT REGEXP_REPLACE('It was the best of times, it was the worst of times',
                      'times',
                      'days',
                      1,
                      2) AS result;
```

```output
+----------------------------------------------------+
| RESULT                                             |
|----------------------------------------------------|
| It was the best of times, it was the worst of days |
+----------------------------------------------------+
```

The following example uses backreferences to rearrange the string `firstname middlename lastname` as
`lastname, firstname middlename` and insert a comma between `lastname` and `firstname`:

```sqlexample
SELECT REGEXP_REPLACE('firstname middlename lastname',
                      '(.*) (.*) (.*)',
                      '\\3, \\1 \\2') AS name_sort;
```

```output
+--------------------------------+
| NAME_SORT                      |
|--------------------------------|
| lastname, firstname middlename |
+--------------------------------+
```

The remaining examples use the data in the following table:

```sqlexample
CREATE OR REPLACE TABLE regexp_replace_demo(body VARCHAR(255));

INSERT INTO regexp_replace_demo values
  ('Hellooo World'),
  ('How are you doing today?'),
  ('the quick brown fox jumps over the lazy dog'),
  ('PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS');
```

The following example inserts the character `*` between every character of the subject, including the beginning and the end,
using an empty group (`()`), which finds a match between any two characters:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '()', '*') AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+-----------------------------------------------------------------------------------------+
| BODY                                        | REPLACED                                                                                |
|---------------------------------------------+-----------------------------------------------------------------------------------------|
| Hellooo World                               | *H*e*l*l*o*o*o* *W*o*r*l*d*                                                             |
| How are you doing today?                    | *H*o*w* *a*r*e* *y*o*u* *d*o*i*n*g* *t*o*d*a*y*?*                                       |
| the quick brown fox jumps over the lazy dog | *t*h*e* *q*u*i*c*k* *b*r*o*w*n* *f*o*x* *j*u*m*p*s* *o*v*e*r* *t*h*e* *l*a*z*y* *d*o*g* |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | *P*A*C*K* *M*Y* *B*O*X* *W*I*T*H* *F*I*V*E* *D*O*Z*E*N* *L*I*Q*U*O*R* *J*U*G*S*         |
+---------------------------------------------+-----------------------------------------------------------------------------------------+
```

The following example removes all of the vowels by replacing them with nothing, regardless of their order or case:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '[aeiou]', '', 1, 0, 'i') AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+----------------------------------+
| BODY                                        | REPLACED                         |
|---------------------------------------------+----------------------------------|
| Hellooo World                               | Hll Wrld                         |
| How are you doing today?                    | Hw r y dng tdy?                  |
| the quick brown fox jumps over the lazy dog | th qck brwn fx jmps vr th lzy dg |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | PCK MY BX WTH FV DZN LQR JGS     |
+---------------------------------------------+----------------------------------+
```

The following example removes all words that contain the lowercase letter `o` from the subject by matching
a word boundary (`\b`), followed by zero or more word characters (`\S`), the letter `o`, and then zero or more
word characters until the next word boundary:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '\\b(\\S*)o(\\S*)\\b') AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+-----------------------------------------+
| BODY                                        | REPLACED                                |
|---------------------------------------------+-----------------------------------------|
| Hellooo World                               |                                         |
| How are you doing today?                    |  are   ?                                |
| the quick brown fox jumps over the lazy dog | the quick   jumps  the lazy             |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS |
+---------------------------------------------+-----------------------------------------+
```

The following example replaces all words that contain the lowercase letter `o`, swapping
the letters in front of and behind the first instance of `o`, and replacing the `o` with the character
sequence `@@`:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '\\b(\\S*)o(\\S*)\\b', '\\2@@\\1') AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+-------------------------------------------------+
| BODY                                        | REPLACED                                        |
|---------------------------------------------+-------------------------------------------------|
| Hellooo World                               | @@Helloo rld@@W                                 |
| How are you doing today?                    | w@@H are u@@y ing@@d day@@t?                    |
| the quick brown fox jumps over the lazy dog | the quick wn@@br x@@f jumps ver@@ the lazy g@@d |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS         |
+---------------------------------------------+-------------------------------------------------+
```

The following example is the same as the previous example, but the replacement starts at position `3` in the subject:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '\\b(\\S*)o(\\S*)\\b', '\\2@@\\1', 3) AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+-------------------------------------------------+
| BODY                                        | REPLACED                                        |
|---------------------------------------------+-------------------------------------------------|
| Hellooo World                               | He@@lloo rld@@W                                 |
| How are you doing today?                    | How are u@@y ing@@d day@@t?                     |
| the quick brown fox jumps over the lazy dog | the quick wn@@br x@@f jumps ver@@ the lazy g@@d |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS         |
+---------------------------------------------+-------------------------------------------------+
```

The following example is the same as the previous example, but only the third occurrence is replaced, starting at position
`3` in the subject:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '\\b(\\S*)o(\\S*)\\b', '\\2@@\\1', 3, 3) AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+----------------------------------------------+
| BODY                                        | REPLACED                                     |
|---------------------------------------------+----------------------------------------------|
| Hellooo World                               | Hellooo World                                |
| How are you doing today?                    | How are you doing day@@t?                    |
| the quick brown fox jumps over the lazy dog | the quick brown fox jumps ver@@ the lazy dog |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS      |
+---------------------------------------------+----------------------------------------------+
```

The following example is the same as the previous example, but it uses case-insensitive matching:

```sqlexample
SELECT body,
       REGEXP_REPLACE(body, '\\b(\\S*)o(\\S*)\\b', '\\2@@\\1', 3, 3, 'i') AS replaced
  FROM regexp_replace_demo;
```

```output
+---------------------------------------------+----------------------------------------------+
| BODY                                        | REPLACED                                     |
|---------------------------------------------+----------------------------------------------|
| Hellooo World                               | Hellooo World                                |
| How are you doing today?                    | How are you doing day@@t?                    |
| the quick brown fox jumps over the lazy dog | the quick brown fox jumps ver@@ the lazy dog |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | PACK MY BOX WITH FIVE DOZEN R@@LIQU JUGS     |
+---------------------------------------------+----------------------------------------------+
```

---
title: REGEXP_SUBSTR
source: https://docs.snowflake.com/en/sql-reference/functions/regexp_substr.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# REGEXP_SUBSTR

Returns the substring that matches a [regular expression](../functions-regexp.md)
within a string.

## Syntax

```sqlsyntax
REGEXP_SUBSTR( <subject> ,
               <pattern>
                 [ , <position>
                   [ , <occurrence>
                     [ , <regex_parameters>
                       [ , <group_num> ]
                     ]
                   ]
                 ]
)
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`position`
:   Number of characters from the beginning of the string where the function starts searching for matches.
    The value must be a positive integer.

    Default: `1` (the search for a match starts at the first character on the left)

`occurrence`
:   Specifies the first occurrence of the pattern from which to start returning matches.

    The function skips the first `occurrence - 1` matches. For example, if there are 5 matches and
    you specify `3` for the `occurrence` argument, the function ignores the first two matches and
    returns the third, fourth, and fifth matches.

    Default: `1`

`regex_parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

    > **Note:**
    >
    > By default, REGEXP_SUBSTR returns the entire matching part of the subject.
    > However, if the `e` (for “extract”) parameter is specified, REGEXP_SUBSTR returns the
    > part of the subject that matches the first group in the pattern.
    > If `e` is specified but a `group_num` is not also specified, then the `group_num`
    > defaults to 1 (the first group). If there is no sub-expression in the pattern, REGEXP_SUBSTR behaves as
    > if `e` was not set. For examples that use `e`, see Examples in this topic.

`group_num`
:   Specifies which group to extract. Groups are specified by using parentheses in
    the regular expression.

    If a `group_num` is specified, Snowflake allows extraction even if the `'e'` option was not
    also specified. The `'e'` is implied.

    Snowflake supports up to 1024 groups.

    For examples that use `group_num`, see the Examples in this topic.

## Returns

The function returns a value of type VARCHAR that is the matching substring.

The function returns NULL in the following cases:

* No match is found.
* Any argument is NULL.

## Usage notes

For additional information on using regular expressions, see [String functions (regular expressions)](../functions-regexp.md).

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

The documentation of the [REGEXP_INSTR](regexp_instr.md) function contains many examples that use both REGEXP_SUBSTR and
REGEXP_INSTR. You might want to look at those examples, too.

These examples use the strings created below:

```sqlexample
CREATE OR REPLACE TABLE demo2 (id INT, string1 VARCHAR);

INSERT INTO demo2 (id, string1) VALUES
    (2, 'It was the best of times, it was the worst of times.'),
    (3, 'In    the   string   the   extra   spaces  are   redundant.'),
    (4, 'A thespian theater is nearby.');

SELECT * FROM demo2;
```

```output
+----+-------------------------------------------------------------+
| ID | STRING1                                                     |
|----+-------------------------------------------------------------|
|  2 | It was the best of times, it was the worst of times.        |
|  3 | In    the   string   the   extra   spaces  are   redundant. |
|  4 | A thespian theater is nearby.                               |
+----+-------------------------------------------------------------+
```

The strings have the following characteristics:

* The string with an `id` of `2` has multiple occurrences of the word “the”.
* The string with an `id` of `3` has multiple occurrences of the word “the” with extra blank spaces
  between the words.
* The string with an `id` of `4` has the character sequence “the” inside multiple words (“thespian”
  and “theater”), but without the word “the” by itself.

The following examples call the REGEXP_SUBSTR function:

* Calling the REGEXP_SUBSTR function in a SELECT list
* Calling the REGEXP_SUBSTR function in a WHERE clause

### Calling the REGEXP_SUBSTR function in a SELECT list

Call the REGEXP_SUBSTR function in a SELECT list to extract or display values that match a pattern.

This example looks for first occurrence of the word `the`, followed by one or more non-word characters — for example,
the whitespace separating words — followed by one or more word characters.

“Word characters” include not only the letters a-z and A-Z, but also the
underscore (“_”) and the decimal digits 0-9, but not whitespace, punctuation, and so on.

```sqlexample
SELECT id,
       REGEXP_SUBSTR(string1, 'the\\W+\\w+') AS result
  FROM demo2
  ORDER BY id;
```

```output
+----+--------------+
| ID | RESULT       |
|----+--------------|
|  2 | the best     |
|  3 | the   string |
|  4 | NULL         |
+----+--------------+
```

Starting from position 1 of the string, look for the second occurrence of the word `the`,
followed by one or more non-word characters, followed by one or more word characters.

```sqlexample
SELECT id,
       REGEXP_SUBSTR(string1, 'the\\W+\\w+', 1, 2) AS result
  FROM demo2
  ORDER BY id;
```

```output
+----+-------------+
| ID | RESULT      |
|----+-------------|
|  2 | the worst   |
|  3 | the   extra |
|  4 | NULL        |
+----+-------------+
```

Starting from position 1 of the string, look for the second occurrence of the word `the`,
followed by one or more non-word characters, followed by one or more word characters.

Rather than returning the entire match, return only the “group” (for example, the portion of the substring that matches the
part of the regular expression in parentheses). In this case, the returned value should be the word after “the”.

```sqlexample
SELECT id,
       REGEXP_SUBSTR(string1, 'the\\W+(\\w+)', 1, 2, 'e', 1) AS result
  FROM demo2
  ORDER BY id;
```

```output
+----+--------+
| ID | RESULT |
|----+--------|
|  2 | worst  |
|  3 | extra  |
|  4 | NULL   |
+----+--------+
```

This example shows how to retrieve the second word from the first, second, and third matches of
a two-word pattern in which the first word is `A`. This example also shows that trying to
go beyond the last pattern causes Snowflake to return NULL.

First, create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_regexp_substr (string1 VARCHAR);;
INSERT INTO test_regexp_substr (string1) VALUES ('A MAN A PLAN A CANAL');
```

Run the query:

```sqlexample
SELECT REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 1, 'e', 1) AS result1,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 2, 'e', 1) AS result2,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 3, 'e', 1) AS result3,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w+)', 1, 4, 'e', 1) AS result4
  FROM test_regexp_substr;
```

```output
+---------+---------+---------+---------+
| RESULT1 | RESULT2 | RESULT3 | RESULT4 |
|---------+---------+---------+---------|
| MAN     | PLAN    | CANAL   | NULL    |
+---------+---------+---------+---------+
```

This example shows how to retrieve the first, second, and third groups within the first occurrence of the pattern.
In this case, the returned values are the individual letters of the word `MAN`.

```sqlexample
SELECT REGEXP_SUBSTR(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 'e', 1) AS result1,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 'e', 2) AS result2,
       REGEXP_SUBSTR(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 'e', 3) AS result3
  FROM test_regexp_substr;
```

```output
+---------+---------+---------+
| RESULT1 | RESULT2 | RESULT3 |
|---------+---------+---------|
| M       | A       | N       |
+---------+---------+---------+
```

Here are some additional examples.

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE message(body VARCHAR(255));

INSERT INTO message VALUES
  ('Hellooo World'),
  ('How are you doing today?'),
  ('the quick brown fox jumps over the lazy dog'),
  ('PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS');
```

Return the first match that contains a lowercase `o` by matching a word boundary (`\b`),
followed by zero or more word characters (`\S`), the letter `o`, and then zero or more
word characters until the next word boundary:

```sqlexample
SELECT body,
       REGEXP_SUBSTR(body, '\\b\\S*o\\S*\\b') AS result
  FROM message;
```

```output
+---------------------------------------------+---------+
| BODY                                        | RESULT  |
|---------------------------------------------+---------|
| Hellooo World                               | Hellooo |
| How are you doing today?                    | How     |
| the quick brown fox jumps over the lazy dog | brown   |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | NULL    |
+---------------------------------------------+---------+
```

Return the first match that contains a lowercase `o`, starting at the third character
in the subject:

```sqlexample
SELECT body,
       REGEXP_SUBSTR(body, '\\b\\S*o\\S*\\b', 3) AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               | llooo  |
| How are you doing today?                    | you    |
| the quick brown fox jumps over the lazy dog | brown  |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | NULL   |
+---------------------------------------------+--------+
```

Return the third match that contains a lowercase `o`, starting at the third character
in the subject:

```sqlexample
SELECT body,
       REGEXP_SUBSTR(body, '\\b\\S*o\\S*\\b', 3, 3) AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               | NULL   |
| How are you doing today?                    | today  |
| the quick brown fox jumps over the lazy dog | over   |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | NULL   |
+---------------------------------------------+--------+
```

Return the third match that contains a lowercase `o`, starting at the third character in
the subject, with case-insensitive matching:

```sqlexample
SELECT body,
       REGEXP_SUBSTR(body, '\\b\\S*o\\S*\\b', 3, 3, 'i') AS result
  FROM message;
```

```output
+---------------------------------------------+--------+
| BODY                                        | RESULT |
|---------------------------------------------+--------|
| Hellooo World                               | NULL   |
| How are you doing today?                    | today  |
| the quick brown fox jumps over the lazy dog | over   |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | LIQUOR |
+---------------------------------------------+--------+
```

This example shows that you can explicitly omit any regular expression parameters by specifying empty string.

```sqlexample
SELECT body,
       REGEXP_SUBSTR(body, '(H\\S*o\\S*\\b).*', 1, 1, '') AS result
  FROM message;
```

```output
+---------------------------------------------+--------------------------+
| BODY                                        | RESULT                   |
|---------------------------------------------+--------------------------|
| Hellooo World                               | Hellooo World            |
| How are you doing today?                    | How are you doing today? |
| the quick brown fox jumps over the lazy dog | NULL                     |
| PACK MY BOX WITH FIVE DOZEN LIQUOR JUGS     | NULL                     |
+---------------------------------------------+--------------------------+
```

The following example illustrates overlapping occurrences. First, create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE overlap (
  id NUMBER,
  a STRING);

INSERT INTO overlap VALUES (1, ',abc,def,ghi,jkl,');
INSERT INTO overlap VALUES (2, ',abc,,def,,ghi,,jkl,');

SELECT * FROM overlap;
```

```output
+----+----------------------+
| ID | A                    |
|----+----------------------|
|  1 | ,abc,def,ghi,jkl,    |
|  2 | ,abc,,def,,ghi,,jkl, |
+----+----------------------+
```

Run a query that finds the second occurrence of the following pattern in each row: a punctuation mark
followed by digits and letters, followed by a punctuation mark.

```sqlexample
SELECT id,
       REGEXP_SUBSTR(a,'[[:punct:]][[:alnum:]]+[[:punct:]]', 1, 2) AS result
  FROM overlap;
```

```output
+----+--------+
| ID | RESULT |
|----+--------|
|  1 | ,ghi,  |
|  2 | ,def,  |
+----+--------+
```

The following example creates a JSON object from an Apache HTTP Server access log using pattern matching and concatenation.
First, create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_regexp_log (logs VARCHAR);

INSERT INTO test_regexp_log (logs) VALUES
  ('127.0.0.1 - - [10/Jan/2018:16:55:36 -0800] "GET / HTTP/1.0" 200 2216'),
  ('192.168.2.20 - - [14/Feb/2018:10:27:10 -0800] "GET /cgi-bin/try/ HTTP/1.0" 200 3395');

SELECT * from test_regexp_log
```

```output
+-------------------------------------------------------------------------------------+
| LOGS                                                                                |
|-------------------------------------------------------------------------------------|
| 127.0.0.1 - - [10/Jan/2018:16:55:36 -0800] "GET / HTTP/1.0" 200 2216                |
| 192.168.2.20 - - [14/Feb/2018:10:27:10 -0800] "GET /cgi-bin/try/ HTTP/1.0" 200 3395 |
+-------------------------------------------------------------------------------------+
```

Run a query:

```sqlexample
SELECT '{ "ip_addr":"'
       || REGEXP_SUBSTR (logs,'\\b\\d{1,3}\.\\d{1,3}\.\\d{1,3}\.\\d{1,3}\\b')
       || '", "date":"'
       || REGEXP_SUBSTR (logs,'([\\w:\/]+\\s[+\-]\\d{4})')
       || '", "request":"'
       || REGEXP_SUBSTR (logs,'\"((\\S+) (\\S+) (\\S+))\"', 1, 1, 'e')
       || '", "status":"'
       || REGEXP_SUBSTR (logs,'(\\d{3}) \\d+', 1, 1, 'e')
       || '", "size":"'
       || REGEXP_SUBSTR (logs,'\\d{3} (\\d+)', 1, 1, 'e')
       || '"}' as Apache_HTTP_Server_Access
  FROM test_regexp_log;
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------+
| APACHE_HTTP_SERVER_ACCESS                                                                                                               |
|-----------------------------------------------------------------------------------------------------------------------------------------|
| { "ip_addr":"127.0.0.1", "date":"10/Jan/2018:16:55:36 -0800", "request":"GET / HTTP/1.0", "status":"200", "size":"2216"}                |
| { "ip_addr":"192.168.2.20", "date":"14/Feb/2018:10:27:10 -0800", "request":"GET /cgi-bin/try/ HTTP/1.0", "status":"200", "size":"3395"} |
+-----------------------------------------------------------------------------------------------------------------------------------------+
```

### Calling the REGEXP_SUBSTR function in a WHERE clause

Call the REGEXP_SUBSTR function in a WHERE clause to filter for rows that contain values that match a pattern.
By using the function, you can avoid multiple OR conditions.

The following example queries the `demo2` table you created previously to return rows that include either
the string `best` or the string `thespian`. Add `IS NOT NULL` to the condition to return rows that
match the pattern. That is, the rows where the REGEXP_SUBSTR function didn’t return `NULL`:

```sqlexample
SELECT id, string1
  FROM demo2
  WHERE REGEXP_SUBSTR(string1, '(best|thespian)') IS NOT NULL;
```

```output
+----+------------------------------------------------------+
| ID | STRING1                                              |
|----+------------------------------------------------------|
|  2 | It was the best of times, it was the worst of times. |
|  4 | A thespian theater is nearby.                        |
+----+------------------------------------------------------+
```

You can use AND conditions to find rows that match multiple patterns. For example, the following query returns
rows that include either the string `best` or the string `thespian` and start with the string `It`:

```sqlexample
SELECT id, string1
  FROM demo2
  WHERE REGEXP_SUBSTR(string1, '(best|thespian)') IS NOT NULL
    AND REGEXP_SUBSTR(string1, '^It') IS NOT NULL;
```

```output
+----+------------------------------------------------------
| ID | STRING1                                              |
|----+------------------------------------------------------|
|  2 | It was the best of times, it was the worst of times. |
+----+------------------------------------------------------+
```

---
title: REGEXP_SUBSTR_ALL
source: https://docs.snowflake.com/en/sql-reference/functions/regexp_substr_all.md
section: SQL Functions
---

Categories:
:   [String functions (regular expressions)](../functions-regexp.md)

# REGEXP_SUBSTR_ALL

Returns an [ARRAY](../data-types-semistructured.md) that contains all substrings that match a
[regular expression](../functions-regexp.md) within a string.

Aliases:
:   REGEXP_EXTRACT_ALL

## Syntax

```sqlsyntax
REGEXP_SUBSTR_ALL( <subject> ,
                   <pattern>
                     [ , <position>
                       [ , <occurrence>
                         [ , <regex_parameters>
                           [ , <group_num> ]
                         ]
                       ]
                     ]
)
```

## Arguments

**Required:**

`subject`
:   The string to search for matches.

`pattern`
:   Pattern to match.

    For guidelines on specifying patterns, see [String functions (regular expressions)](../functions-regexp.md).

**Optional:**

`position`
:   Number of characters from the beginning of the string where the function starts searching for matches.
    The value must be a positive integer.

    Default: `1` (the search for a match starts at the first character on the left)

`occurrence`
:   Specifies the first occurrence of the pattern from which to start returning matches.

    The function skips the first `occurrence - 1` matches. For example, if there are 5 matches and
    you specify `3` for the `occurrence` argument, the function ignores the first two matches and
    returns the third, fourth, and fifth matches.

    Default: `1`

`regex_parameters`
:   String of one or more characters that specifies the parameters used for searching for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `c` | Case-sensitive matching |
    | `i` | Case-insensitive matching |
    | `m` | Multi-line mode |
    | `e` | Extract submatches |
    | `s` | Single-line mode POSIX wildcard character `.` matches `\n` |

    Default: `c`

    For more information, see [Specifying the parameters for the regular expression](../functions-regexp.md).

    > **Note:**
    >
    > By default, REGEXP_SUBSTR_ALL returns the entire matching part of the subject.
    > However, if the `e` parameter is specified, REGEXP_SUBSTR_ALL returns the
    > part of the subject that matches the first group in the pattern.
    > If `e` is specified but a `group_num` is not also specified, then the `group_num`
    > defaults to 1 (the first group). If there is no sub-expression in the pattern, REGEXP_SUBSTR_ALL behaves as
    > if `e` was not set. For examples that use `e`, see Examples in this topic.

`group_num`
:   Specifies which group to extract. Groups are specified by using parentheses in
    the regular expression.

    If a `group_num` is specified, Snowflake allows extraction even if the `'e'` option was not
    also specified. The `'e'` is implied.

    Snowflake supports up to 1024 groups.

    For examples that use `group_num`, see the Examples in this topic.

## Returns

The function returns a value of type ARRAY. The array contains an element for each matching substring.

The function returns an empty array if no match is found.

The function returns NULL in the following cases:

* Any argument is NULL.
* You specify `group_num` and the pattern doesn’t specify a grouping with that number. For example, if the
  pattern specifies only one group (for example, `a(b)c`), and you use `2` as `group_num`, the function returns
  NULL.

## Usage notes

For additional information on using regular expressions, see [String functions (regular expressions)](../functions-regexp.md).

## Collation details

Arguments with collation specifications currently aren’t supported.

## Examples

The pattern in the following example matches a lowercase “a” followed by a digit. The example returns an ARRAY that contains all
of the matches:

```sqlexample
SELECT REGEXP_SUBSTR_ALL('a1_a2a3_a4A5a6', 'a[[:digit:]]') AS matches;
```

```output
+---------+
| MATCHES |
|---------|
| [       |
|   "a1", |
|   "a2", |
|   "a3", |
|   "a4", |
|   "a6"  |
| ]       |
+---------+
```

The following example starts finding matches from the second character in the string (`2`):

```sqlexample
SELECT REGEXP_SUBSTR_ALL('a1_a2a3_a4A5a6', 'a[[:digit:]]', 2) AS matches;
```

```output
+---------+
| MATCHES |
|---------|
| [       |
|   "a2", |
|   "a3", |
|   "a4", |
|   "a6"  |
| ]       |
+---------+
```

The following example starts returning matches from the third occurrence of the pattern in the string (`3`):

```sqlexample
SELECT REGEXP_SUBSTR_ALL('a1_a2a3_a4A5a6', 'a[[:digit:]]', 1, 3) AS matches;
```

```output
+---------+
| MATCHES |
|---------|
| [       |
|   "a3", |
|   "a4", |
|   "a6"  |
| ]       |
+---------+
```

The following example performs a case-insensitive match (`i`):

```sqlexample
SELECT REGEXP_SUBSTR_ALL('a1_a2a3_a4A5a6', 'a[[:digit:]]', 1, 1, 'i') AS matches;
```

```output
+---------+
| MATCHES |
|---------|
| [       |
|   "a1", |
|   "a2", |
|   "a3", |
|   "a4", |
|   "A5", |
|   "a6"  |
| ]       |
+---------+
```

The following example performs a case-insensitive match and returns the part of the string that matches the first group (`ie`):

```sqlexample
SELECT REGEXP_SUBSTR_ALL('a1_a2a3_a4A5a6', '(a)([[:digit:]])', 1, 1, 'ie') AS matches;
```

```output
+---------+
| MATCHES |
|---------|
| [       |
|   "a",  |
|   "a",  |
|   "a",  |
|   "a",  |
|   "A",  |
|   "a"   |
| ]       |
+---------+
```

The following example demonstrates that the function returns an empty array when no matches are found:

```sqlexample
SELECT REGEXP_SUBSTR_ALL('a1_a2a3_a4A5a6', 'b') AS matches;
```

```output
+---------+
| MATCHES |
|---------|
| []      |
+---------+
```

This example shows how to retrieve each second word in a string from the first, second, and third
matches of a two-word pattern in which the first word is `A`.

First, create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_regexp_substr_all (string1 VARCHAR);;
INSERT INTO test_regexp_substr_all (string1) VALUES ('A MAN A PLAN A CANAL');
```

Run the query:

```sqlexample
SELECT REGEXP_SUBSTR_ALL(string1, 'A\\W+(\\w+)', 1, 1, 'e', 1) AS result1,
       REGEXP_SUBSTR_ALL(string1, 'A\\W+(\\w+)', 1, 2, 'e', 1) AS result2,
       REGEXP_SUBSTR_ALL(string1, 'A\\W+(\\w+)', 1, 3, 'e', 1) AS result3
  FROM test_regexp_substr_all;
```

```output
+-----------+-----------+-----------+
| RESULT1   | RESULT2   | RESULT3   |
|-----------+-----------+-----------|
| [         | [         | [         |
|   "MAN",  |   "PLAN", |   "CANAL" |
|   "PLAN", |   "CANAL" | ]         |
|   "CANAL" | ]         |           |
| ]         |           |           |
+-----------+-----------+-----------+
```

This example shows how to retrieve the first, second, and third groups within each occurrence of the pattern
in a string. In this case, the returned values are each individual letter of each matched word in each group.

```sqlexample
SELECT REGEXP_SUBSTR_ALL(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 'e', 1) AS result1,
       REGEXP_SUBSTR_ALL(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 'e', 2) AS result2,
       REGEXP_SUBSTR_ALL(string1, 'A\\W+(\\w)(\\w)(\\w)', 1, 1, 'e', 3) AS result3
  FROM test_regexp_substr_all;
```

```output
+---------+---------+---------+
| RESULT1 | RESULT2 | RESULT3 |
|---------+---------+---------|
| [       | [       | [       |
|   "M",  |   "A",  |   "N",  |
|   "P",  |   "L",  |   "A",  |
|   "C"   |   "A"   |   "N"   |
| ]       | ]       | ]       |
+---------+---------+---------+
```

---
title: REGR_AVGX
source: https://docs.snowflake.com/en/sql-reference/functions/regr_avgx.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window functions](../functions-window.md)

# REGR_AVGX

Returns the average of the independent variable for non-null pairs in a group, where `x` is the independent variable and `y` is the dependent variable.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_AVGX(y, x)
```

**Window function**

```sqlsyntax
REGR_AVGX(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.
* In order for a row to be included in the average, BOTH the x and y values
  must be non-NULL.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k int, v decimal(10,2), v2 decimal(10, 2));
INSERT INTO aggr VALUES(1, 10, NULL);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, NULL), (2, 30, 35);
```

```sqlexample
SELECT k, REGR_AVGX(v, v2) FROM aggr GROUP BY k;

---+------------------+
 k | regr_avgx(v, v2) |
---+------------------+
 1 | [NULL]           |
 2 | 22.666666667     |
---+------------------+
```

---
title: REGR_AVGY
source: https://docs.snowflake.com/en/sql-reference/functions/regr_avgy.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window functions](../functions-window.md)

# REGR_AVGY

Returns the average of the dependent variable for non-null pairs in a group, where `x` is the independent variable and `y` is the dependent variable.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_AVGY(y, x)
```

**Window function**

```sqlsyntax
REGR_AVGY(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.
* In order for a row to be included in the average, BOTH the x and y values
  must be non-NULL.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
create or replace table aggr(k int, v decimal(10,2), v2 decimal(10, 2));
insert into aggr values(1, 10, null);
insert into aggr values(2, 10, 11), (2, 20, 22), (2, 25,null), (2, 30, 35);
```

```sqlexample
select k, regr_avgy(v, v2) from aggr group by k;

---+------------------+
 k | regr_avgy(v, v2) |
---+------------------+
 1 | [NULL]           |
 2 | 20               |
---+------------------+
```

---
title: REGR_COUNT
source: https://docs.snowflake.com/en/sql-reference/functions/regr_count.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_COUNT

Returns the number of non-null number pairs in a group.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_COUNT(y, x)
```

**Window function**

```sqlsyntax
REGR_COUNT(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

Show the number of pairs in each group, and the number of those pairs in which
neither member is NULL.

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);

SELECT k, COUNT(*), REGR_COUNT(v, v2) FROM aggr GROUP BY k;
```

```output
+---+----------+-------------------+
| k | count(*) | regr_count(v, v2) |
|---+----------+-------------------|
| 1 |      1   |            0      |
| 2 |      4   |            3      |
+---+----------+-------------------+
```

---
title: REGR_INTERCEPT
source: https://docs.snowflake.com/en/sql-reference/functions/regr_intercept.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_INTERCEPT

Returns the intercept of the univariate linear regression line for non-null pairs in a group. It is computed for non-null pairs using the following
formula:

> `AVG(y)-REGR_SLOPE(y,x)*AVG(x)`

Where `x` is the independent variable and `y` is the dependent variable.

## Syntax

**Aggregation function**

```sqlsyntax
REGR_INTERCEPT(y, x)
```

**Window function**

```sqlsyntax
REGR_INTERCEPT(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);

SELECT k, REGR_INTERCEPT(v, v2) FROM aggr GROUP BY k;
```

```output
+---+-----------------------+
| k | regr_intercept(v, v2) |
|---+-----------------------|
| 1 | [NULL]                |
| 2 | 1.154734411           |
+---+-----------------------+
```

---
title: REGR_R2
source: https://docs.snowflake.com/en/sql-reference/functions/regr_r2.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_R2

Returns the coefficient of determination for non-null pairs in a group. It is computed for non-null pairs using the following formula:

```sqlexample
NULL                 if VAR_POP(x) = 0, else
1                    if VAR_POP(y) = 0 and VAR_POP(x) <> 0, else
POWER(CORR(y,x), 2)
```

Where `x` is the independent variable and `y` is the dependent variable.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_R2(y, x)
```

**Window function**

```sqlsyntax
REGR_R2(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);
```

```sqlexample
SELECT k, REGR_R2(v, v2) FROM aggr GROUP BY k;
```

```output
+---+----------------+
| k | regr_r2(v, v2) |
|---+----------------+
| 1 | [NULL]         |
| 2 | 0.9976905312   |
+---+----------------+
```

---
title: REGR_SLOPE
source: https://docs.snowflake.com/en/sql-reference/functions/regr_slope.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_SLOPE

Returns the slope of the linear regression line for non-null pairs in a group. It is computed for non-null pairs using the following formula:

> `COVAR_POP(x,y) / VAR_POP(x)`

Where `x` is the independent variable and `y` is the dependent variable.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_SLOPE(y, x)
```

**Window function**

```sqlsyntax
REGR_SLOPE(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);

SELECT k, REGR_SLOPE(v, v2) FROM aggr GROUP BY k;
```

```output
+---+-------------------+
| k | regr_slope(v, v2) |
|---+-------------------|
| 1 | [NULL]            |
| 2 | 0.831408776       |
+---+-------------------+
```

---
title: REGR_SXX
source: https://docs.snowflake.com/en/sql-reference/functions/regr_sxx.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_SXX

Returns REGR_COUNT(y, x) \* VAR_POP(x) for non-null pairs.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_SXX(y, x)
```

**Window function**

```sqlsyntax
REGR_SXX(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);

SELECT k, REGR_SXX(v, v2) FROM aggr GROUP BY k;
```

```output
+---+-----------------+
| k | regr_sxx(v, v2) |
|---+-----------------|
| 1 | [NULL]          |
| 2 | 288.666666667   |
+---+-----------------+
```

---
title: REGR_SXY
source: https://docs.snowflake.com/en/sql-reference/functions/regr_sxy.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_SXY

Returns REGR_COUNT(expr1, expr2) \* COVAR_POP(expr1, expr2) for non-null
pairs.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_SXY(y, x)
```

**Window function**

```sqlsyntax
REGR_SXY(y, x) OVER ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);

SELECT k, REGR_SXY(v, v2) FROM aggr GROUP BY k;
```

```output
+---+-----------------+
| k | regr_sxy(v, v2) |
+---+-----------------+
| 1 | [NULL]          |
| 2 | 240             |
+---+-----------------+
```

---
title: REGR_SYY
source: https://docs.snowflake.com/en/sql-reference/functions/regr_syy.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (Linear Regression) , [Window function syntax and usage](../functions-window-syntax.md)

# REGR_SYY

Returns REGR_COUNT(y, x) \* VAR_POP(y) for non-null pairs.

## Syntax

**Aggregate function**

```sqlsyntax
REGR_SYY(y, x)
```

**Window function**

```sqlsyntax
REGR_SYY(y, x) ( [ PARTITION BY <expr3> ] )
```

## Arguments

`y`
:   The dependent variable. This must be an expression that can be evaluated to a numeric type.

`x`
:   The independent variable. This must be an expression that can be evaluated to a numeric type.

`expr3`
:   This is the optional expression used to group rows into partitions.

> **Important:**
>
> Note the order of the arguments; the dependent variable is first.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* DISTINCT is not supported for this function.

* When this function is called as a window function, it does not support:

  + An ORDER BY clause within the OVER clause.
  + Explicit window frames.

## Examples

```sqlexample
CREATE OR REPLACE TABLEE aggr(k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));
INSERT INTO aggr VALUES(1, 10, null);
INSERT INTO aggr VALUES(2, 10, 11), (2, 20, 22), (2, 25, null), (2, 30, 35);

SELECT k, REGR_SYY(v, v2) FROM aggr GROUP BY k;
```

```output
+---+-----------------+
| k | regr_syy(v, v2) |
|---+-----------------|
| 1 | [NULL]          |
| 2 | 200             |
+---+-----------------+
```

---
title: REGR_VALX
source: https://docs.snowflake.com/en/sql-reference/functions/regr_valx.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# REGR_VALX

Returns NULL if the first argument is NULL; otherwise, returns the second argument.

Note that REGR_VALX is a NULL-preserving function, while the more commonly-used [NVL](nvl.md) is a NULL-replacing function.

## Syntax

```sqlsyntax
REGR_VALX( <y> , <x> )
```

## Arguments

`y`:
:   An expression that evaluates to type FLOAT or DECFLOAT or that can be cast to type FLOAT or DECFLOAT.

`x`:
:   An expression that evaluates to type FLOAT or DECFLOAT or that can be cast to type FLOAT or DECFLOAT.

> **Important:**
>
> Note the order of the arguments; y precedes x.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

Basic example:

> ```sqlexample
> SELECT REGR_VALX(NULL, 10), REGR_VALX(1, NULL), REGR_VALX(1, 10);
> +---------------------+--------------------+------------------+
> | REGR_VALX(NULL, 10) | REGR_VALX(1, NULL) | REGR_VALX(1, 10) |
> |---------------------+--------------------+------------------|
> |                NULL |               NULL |               10 |
> +---------------------+--------------------+------------------+
> ```

This example is similar to the preceding example, but shows more clearly that the convention is to pass the `Y`
value first. It also shows the difference between REGR_VALX and REGR_VALY:

> ```sqlexample
> CREATE TABLE xy (col_x DOUBLE, col_y DOUBLE);
> INSERT INTO xy (col_x, col_y) VALUES
>     (1.0, 2.0),
>     (3.0, NULL),
>     (NULL, 6.0);
> ```
>
> ```sqlexample
> SELECT col_y, col_x, REGR_VALX(col_y, col_x), REGR_VALY(col_y, col_x)
>     FROM xy;
> +-------+-------+-------------------------+-------------------------+
> | COL_Y | COL_X | REGR_VALX(COL_Y, COL_X) | REGR_VALY(COL_Y, COL_X) |
> |-------+-------+-------------------------+-------------------------|
> |     2 |     1 |                       1 |                       2 |
> |  NULL |     3 |                    NULL |                    NULL |
> |     6 |  NULL |                    NULL |                    NULL |
> +-------+-------+-------------------------+-------------------------+
> ```

---
title: REGR_VALY
source: https://docs.snowflake.com/en/sql-reference/functions/regr_valy.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# REGR_VALY

Returns NULL if the second argument is NULL; otherwise, returns the first argument.

Note that REGR_VALY is a NULL-preserving function, while the more commonly-used [NVL](nvl.md) is a NULL-replacing function.

## Syntax

```sqlsyntax
REGR_VALY( <y> , <x> )
```

## Arguments

`y`:
:   An expression that evaluates to type FLOAT or DECFLOAT or that can be cast to type FLOAT or DECFLOAT.

`x`:
:   An expression that evaluates to type FLOAT or DECFLOAT or that can be cast to type FLOAT or DECFLOAT.

> **Important:**
>
> Note the order of the arguments; y precedes x.

## Returns

If any of the input expressions is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

Basic example:

> ```sqlexample
> SELECT REGR_VALY(NULL, 10), REGR_VALY(1, NULL), REGR_VALY(1, 10);
> +---------------------+--------------------+------------------+
> | REGR_VALY(NULL, 10) | REGR_VALY(1, NULL) | REGR_VALY(1, 10) |
> |---------------------+--------------------+------------------|
> |                NULL |               NULL |                1 |
> +---------------------+--------------------+------------------+
> ```

This example is similar to the preceding example, but shows more clearly that the convention is to pass the `Y`
value first. It also shows the difference between REGR_VALX and REGR_VALY:

> ```sqlexample
> CREATE TABLE xy (col_x DOUBLE, col_y DOUBLE);
> INSERT INTO xy (col_x, col_y) VALUES
>     (1.0, 2.0),
>     (3.0, NULL),
>     (NULL, 6.0);
> ```
>
> ```sqlexample
> SELECT col_y, col_x, REGR_VALX(col_y, col_x), REGR_VALY(col_y, col_x)
>     FROM xy;
> +-------+-------+-------------------------+-------------------------+
> | COL_Y | COL_X | REGR_VALX(COL_Y, COL_X) | REGR_VALY(COL_Y, COL_X) |
> |-------+-------+-------------------------+-------------------------|
> |     2 |     1 |                       1 |                       2 |
> |  NULL |     3 |                    NULL |                    NULL |
> |     6 |  NULL |                    NULL |                    NULL |
> +-------+-------+-------------------------+-------------------------+
> ```

---
title: REPEAT
source: https://docs.snowflake.com/en/sql-reference/functions/repeat.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# REPEAT

Builds a string by repeating the input for the specified number of
times.

## Syntax

```sqlsyntax
REPEAT(<input>, <n>)
```

## Arguments

`input`
:   The input string from which the output string is built.

`n`
:   The number of times the input string should be repeated. The minimum
    valid number is 0 (which results in an empty string).

## Examples

```sqlexample
SELECT REPEAT('xy', 5);

-----------------+
 REPEAT('XY', 5) |
-----------------+
 xyxyxyxyxy      |
-----------------+
```

---
title: REPLACE
source: https://docs.snowflake.com/en/sql-reference/functions/replace.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# REPLACE

Removes all occurrences of a specified substring, and optionally replaces them with another substring.

## Syntax

```sqlsyntax
REPLACE( <subject> , <pattern> [ , <replacement> ] )
```

## Arguments

`subject`
:   The subject is the string in which to do the replacements. Typically,
    this is a column, but it can be a literal.

`pattern`
:   This is the substring that you want to replace. Typically, this is a literal,
    but it can be a column or expression. Note that this is not a “regular
    expression”; if you want to use regular expressions to search for a
    pattern, use the [REGEXP_REPLACE](regexp_replace.md) function.

`replacement`
:   This is the value used as a replacement for the `pattern`. If this
    is omitted, or is an empty string, then the `REPLACE` function simply
    deletes all occurrences of the `pattern`.

## Returns

The returned value is the string after all replacements have been done.

## Usage notes

* If `replacement` is not specified, `subject` is returned with all occurrences of `pattern` removed.
* If `replacement` is specified, `subject` is returned with all occurrences of `pattern` replaced by `replacement`.
* If any of the arguments is a NULL, the result is also a NULL.

> **Note:**
>
> Only occurrences in the original `subject` are considered. A `pattern` that occurs in the result is not removed/replaced.

## Collation details

The [collation specifications](../collation.md) of all input arguments must be compatible.

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

Replace the string `down` with the string `up`:

> ```sqlexample
> SELECT REPLACE('down', 'down', 'up');
> ```
>
> ```output
> +-------------------------------+
> | REPLACE('DOWN', 'DOWN', 'UP') |
> |-------------------------------|
> | up                            |
> +-------------------------------+
> ```

Replace the substring `Athens` in the string `Vacation in Athens` with the substring
`Rome`:

> ```sqlexample
> SELECT REPLACE('Vacation in Athens', 'Athens', 'Rome');
> ```
>
> ```output
> +-------------------------------------------------+
> | REPLACE('VACATION IN ATHENS', 'ATHENS', 'ROME') |
> |-------------------------------------------------|
> | Vacation in Rome                                |
> +-------------------------------------------------+
> ```

Replace the substring `bc` in the string `abcd` with an empty substring:

> ```sqlexample
> SELECT REPLACE('abcd', 'bc');
> ```
>
> ```output
> +-----------------------+
> | REPLACE('ABCD', 'BC') |
> |-----------------------|
> | ad                    |
> +-----------------------+
> ```

Replace the values in a table with new values.

> Create and populate a table:
>
> > ```sqlexample
> > CREATE OR REPLACE TABLE replace_example(
> >   subject VARCHAR(10),
> >   pattern VARCHAR(10),
> >   replacement VARCHAR(10));
> >
> > INSERT INTO replace_example VALUES
> >   ('old car', 'old car', 'new car'),
> >   ('sad face', 'sad', 'happy'),
> >   ('snowman', 'snow', 'fire');
> > ```
>
> Replace strings in a value with a specified replacement:
>
> > ```sqlexample
> > SELECT subject,
> >        pattern,
> >        replacement,
> >        REPLACE(subject, pattern, replacement) AS new
> >   FROM replace_example
> >   ORDER BY subject;
> > ```
> >
> > ```output
> > +----------+---------+-------------+------------+
> > | SUBJECT  | PATTERN | REPLACEMENT | NEW        |
> > |----------+---------+-------------+------------|
> > | old car  | old car | new car     | new car    |
> > | sad face | sad     | happy       | happy face |
> > | snowman  | snow    | fire        | fireman    |
> > +----------+---------+-------------+------------+
> > ```
>
> The output shows the following replacements:
>
> * The string `old car` was replaced by the string `new car`.
> * In the string `sad face`, the substring `sad` was replaced by the substring `happy` to create the new string
>   `happy face`.
> * In the string `snowman`, the substring `snow` was replaced by the substring `fire` to create the new string
>   `fireman`.

---
title: REPLICATION_GROUP_DANGLING_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/replication_group_dangling_references.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# REPLICATION_GROUP_DANGLING_REFERENCES

Detects cases where an object that’s referenced in a replication group or failover group isn’t actually replicated to the secondary
account. Snowflake refers to these types of references as *dangling references*.

After you use this function to detect dangling references in your replication configuration, you can
rearrange your replication groups or failover groups so that all of the referenced objects are included.
Or, you can modify your SQL object hierarchy so that the referenced objects are part of a container
such as a database or schema that’s included in the replication groups or failover groups.

If you use multiple replication groups or failover groups, you might also specify the order of
refresh operations to make sure that any objects that are required to resolve dangling references
are replicated to the secondary account before the objects that refer to them.

> **Important:**
>
> Pay special attention to any TRUE values in the IS_BLOCKING_REFRESH column.
> Both refresh and failover operations can’t proceed until you resolve those
> references.

See also:
:   [Replication and references across replication groups](../../user-guide/account-replication-considerations.md)

## Syntax

```sqlsyntax
REPLICATION_GROUP_DANGLING_REFERENCES( '<replication_or_failover_group_name>' )
```

## Arguments

`'replication_or_failover_group_name'`
:   Name of the replication group or failover group to check for dangling references.
    The entire name must be enclosed in single quotes.

## Output

The function returns the following columns.

| Column Name | Data Type | Description |
| --- | --- | --- |
| REFERENCED_ENTITY_DOMAIN | VARCHAR | The domain of the entity referred to by the dangling reference. |
| REFERENCED_ENTITY_NAME | VARCHAR | The fully qualified name of the entity referred to by the dangling reference. |
| REFERENCING_ENTITY_DOMAIN | VARCHAR | The domain of the entity in the replication group with a dangling reference, for example, `Table`. |
| REFERENCING_ENTITY_NAME | VARCHAR | The fully qualified name of the entity in the replication group with a dangling reference. |
| REFERENCING_ENTITY_GROUPS | VARCHAR | A comma-separated list of all replication groups that contain the referencing entity, or NULL if no group contains that entity. |
| IS_BLOCKING_REFRESH | BOOLEAN | If TRUE, replication refreshes and failovers will fail until this reference is resolved. If FALSE, Snowflake can perform those operations despite the dangling reference. |

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema
  in use or the function name must be fully-qualified.
  For more information, see [Snowflake Information Schema](../info-schema.md).
* You can run this function from any account in your organization.
  The replication group or failover group that you specify must exist in the account that calls the function.
  That is, you specify the group name that’s used in the cloud service provider region where you call the function.

  + If the function is called using a replication group or failover group on the primary account,
    it reports dangling references if their corresponding referred objects aren’t replicated to
    *all* the secondary accounts.
  + If the function is called using a replication group or failover group on the secondary account,
    it reports dangling references if their corresponding referred objects aren’t replicated to
    the specific secondary where the function was called.
* For information about how to deal with dangling references in replication groups and failover groups,
  see [Replication and references across replication groups](../../user-guide/account-replication-considerations.md).

## Examples

To check for dangling references in the failover group `myfg`,
run the following statement from your primary or secondary account.

```sqlexample
SELECT *
  FROM TABLE(
      INFORMATION_SCHEMA.REPLICATION_GROUP_DANGLING_REFERENCES('myfg')
  );
```

---
title: REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL
source: https://docs.snowflake.com/en/sql-reference/functions/replication_group_refresh_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL

You can use the REPLICATION_GROUP_REFRESH_HISTORY family of table functions to query the replication history for
one secondary replication or failover group, or all such groups.

By default (when no date-range arguments are provided), these functions return data for the last 12 hours.
You can use the optional `DATE_RANGE_START` and `DATE_RANGE_END` arguments to query a custom range
within the 14-day retention window.

See also:
:   [REPLICATION_GROUP_REFRESH_HISTORY view](../account-usage/replication_group_refresh_history.md)

## Syntax

```sqlsyntax
REPLICATION_GROUP_REFRESH_HISTORY(
      '<secondary_group_name>'
      [ , DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ] )

REPLICATION_GROUP_REFRESH_HISTORY_ALL(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ] )
```

## Arguments

`'secondary_group_name'`
:   Name of the secondary group. The entire name must be enclosed in single quotes.
    Required for REPLICATION_GROUP_REFRESH_HISTORY. Not used with REPLICATION_GROUP_REFRESH_HISTORY_ALL.

The following arguments are optional for both functions.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range for which to return replication refresh history.

    * If neither a start date nor an end date is specified, the default is the last 12 hours.
    * If a start date is specified but no end date, [CURRENT_DATE](current_date.md)
      at midnight is used as the end of the range.
    * If an end date is specified but no start date, the range starts 12 hours prior to the start
      of `DATE_RANGE_END`.

    Data is retained for 14 days. If the requested range extends beyond the 14-day retention window,
    the function returns an error.

## Output

The function returns the following columns. REPLICATION_GROUP_REFRESH_HISTORY_ALL has additional
columns that are the first two columns in the result set.

| Column Name | Data Type | Description |
| --- | --- | --- |
| GROUP_NAME | TEXT | Specifies which secondary replication or failover group corresponds to this row in the result set. Only applies to REPLICATION_GROUP_REFRESH_HISTORY_ALL. |
| GROUP_TYPE | TEXT | Specifies whether the group corresponding to this row in the result set is a failover group or a replication group. The value is either `FAILOVER` or `REPLICATION`. Only applies to REPLICATION_GROUP_REFRESH_HISTORY_ALL. |
| PHASE_NAME | TEXT | Current phase in the replication operation. For the list of phases, see the Usage Notes. |
| START_TIME | TIMESTAMP_LTZ | Time when the replication operation began. |
| END_TIME | TIMESTAMP_LTZ | Time when the replication operation finished, if applicable. `NULL` if it is in progress. |
| JOB_UUID | TEXT | Query ID for the refresh job. |
| TOTAL_BYTES | VARIANT | A JSON object that provides detailed information about refreshed databases:   * `totalBytesToReplicate`: Total number of bytes expected to be replicated. * `bytesUploaded`: Actual number of bytes uploaded. * `bytesDownloaded`: Actual number of bytes downloaded. * `databases`: List of JSON objects containing the following fields for each member database:    + `name`: Name of the database.   + `totalBytesToReplicate`: Total bytes expected to be replicated for the database. |
| OBJECT_COUNT | VARIANT | A JSON object that provides detailed information about refreshed objects:   * `totalObjects`: Total number of objects in the replication or failover group. * `completedObjects`: Total number of objects completed. * `objectTypes`: List of JSON objects containing the following fields for each type:    + `objectType`: Type of object (for example users, roles, grants, warehouses, schemas, tables, columns, etc).   + `totalObjects`: Total number of objects of this type in the replication or failover group.   + `completedObjects`: Total number of objects of this type that were completed. |
| PRIMARY_SNAPSHOT_TIMESTAMP | TIMESTAMP_LTZ | Timestamp when the primary snapshot was created. |
| ERROR | VARIANT | NULL if the refresh operation is successful. If the refresh operation fails, returns a JSON object that provides detailed information about the error:   * `errorCode`: Error code of the failure. * `errorMessage`: Error message of the failure. |

## Usage notes

* When no `DATE_RANGE_START` or `DATE_RANGE_END` arguments are provided, the functions return data for
  the last 12 hours. To retrieve data beyond the last 12 hours, specify the date range explicitly.
  Data is available for up to 14 days.
* Only returns rows for a role with any privilege on the replication or failover group.
* Only returns rows for a secondary replication or failover group in the current account.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).
* The following is the list of phases in the order processed:

  | # | Phase name | Description |
  | --- | --- | --- |
  | 1 | `SECONDARY_SYNCHRONIZING_MEMBERSHIP` | The secondary replication or failover group receives information from the primary group about the objects included in the group, and updates its membership metadata. |
  | 2 | `SECONDARY_UPLOADING_INVENTORY` | The secondary replication or failover group sends an inventory of its objects in the target account to the primary group. |
  | 3 | `PRIMARY_UPLOADING_METADATA` | The primary replication or failover group creates a snapshot of metadata in the source account and sends it to the secondary group. |
  | 4 | `PRIMARY_UPLOADING_DATA` | The primary replication or failover group copies the files the secondary group needs to reconcile any deltas between the objects in the source and target accounts. |
  | 5 | `SECONDARY_DOWNLOADING_METADATA` | The secondary replication or failover group applies the snapshot of the metadata that was sent by the primary. The metadata updates are not applied atomically and instead applied over time. |
  | 6 | `SECONDARY_DOWNLOADING_DATA` | The secondary replication or failover group copies the files sent by the primary group to the target account. |
  | 7 | `COMPLETED` / `FAILED` / `CANCELED` | Refresh operation status. |

## Examples

To retrieve the refresh history for secondary group `myfg`,
execute the following statement.

```sqlexample
SELECT phase_name, start_time, end_time,
       total_bytes, object_count, error
  FROM TABLE(
      INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY('myfg')
  );
```

To retrieve the refresh history for the last 12 hours (default) for all failover groups and replication groups,
execute the following statement:

```sqlexample
SELECT phase_name, start_time, end_time,
       total_bytes, object_count, error
  FROM TABLE(
      INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY_ALL()
  );
```

To retrieve the refresh history for the last 7 days for all groups:

```sqlexample
SELECT phase_name, start_time, end_time,
       total_bytes, object_count, error
  FROM TABLE(
      INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY_ALL(
          DATE_RANGE_START => DATEADD(D, -7, CURRENT_DATE),
          DATE_RANGE_END => CURRENT_DATE)
  );
```

To retrieve the refresh history for a specific date range for secondary group `myfg`:

```sqlexample
SELECT phase_name, start_time, end_time,
       total_bytes, object_count, error
  FROM TABLE(
      INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_HISTORY(
          'myfg',
          DATE_RANGE_START => '2025-04-01',
          DATE_RANGE_END => '2025-04-07')
  );
```

---
title: REPLICATION_GROUP_REFRESH_PROGRESS, REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB, REPLICATION_GROUP_REFRESH_PROGRESS_ALL
source: https://docs.snowflake.com/en/sql-reference/functions/replication_group_refresh_progress.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# REPLICATION_GROUP_REFRESH_PROGRESS, REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB, REPLICATION_GROUP_REFRESH_PROGRESS_ALL

You can use the REPLICATION_GROUP_REFRESH_PROGRESS family of table functions to query the status of refresh operations
for replication or failover groups:

* REPLICATION_GROUP_REFRESH_PROGRESS returns a JSON object indicating the refresh status for a secondary replication or failover group by
  name.
* REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB returns a JSON object indicating the refresh status for a secondary replication
  or failover group by query ID.
* REPLICATION_GROUP_REFRESH_PROGRESS_ALL returns a JSON object indicating the refresh status for all the secondary replication
  and failover groups.

> **Note:**
>
> * REPLICATION_GROUP_REFRESH_PROGRESS only returns the replication or failover group refresh activity for the most recent refresh if it
>   occurred within the last 14 days.
> * REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB and REPLICATION_GROUP_REFRESH_PROGRESS_ALL return replication or failover group
>   refresh activity within the last 14 days. By default (when no date-range arguments are provided),
>   REPLICATION_GROUP_REFRESH_PROGRESS_ALL returns data for the last 12 hours. Use the optional
>   `DATE_RANGE_START` and `DATE_RANGE_END` arguments to query a custom range within the 14-day
>   retention window.

## Syntax

```sqlsyntax
REPLICATION_GROUP_REFRESH_PROGRESS( '<secondary_group_name>' )

REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB( '<query_id>' )

REPLICATION_GROUP_REFRESH_PROGRESS_ALL(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ] )
```

## Arguments

`'secondary_group_name'`
:   Name of the secondary replication or failover group. Note that the entire name must be enclosed in single quotes.

`'query_id'`
:   ID of the replication group refresh query. The query ID can be obtained from the History  page in the web
    interface.

The following arguments are optional for REPLICATION_GROUP_REFRESH_PROGRESS_ALL.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range for which to return replication refresh progress.

    * If neither a start date nor an end date is specified, the default is the last 12 hours.
    * If a start date is specified but no end date, [CURRENT_DATE](current_date.md)
      at midnight is used as the end of the range.
    * If an end date is specified but no start date, the range starts 12 hours prior to the start
      of `DATE_RANGE_END`.

    Data is retained for 14 days. If the requested range extends beyond the 14-day retention window,
    the function returns an error.

## Output

The function returns the following columns. REPLICATION_GROUP_REFRESH_PROGRESS_ALL has additional
columns that are the first two columns in the result set.

| Column Name | Data Type | Description |
| --- | --- | --- |
| GROUP_NAME | TEXT | Specifies which secondary replication or failover group corresponds to this row in the result set. Only applies to REPLICATION_GROUP_REFRESH_PROGRESS_ALL. |
| GROUP_TYPE | TEXT | Specifies whether the group corresponding to this row in the result set is a failover group or a replication group. The value is either `FAILOVER` or `REPLICATION`. Only applies to REPLICATION_GROUP_REFRESH_PROGRESS_ALL. |
| PHASE_NAME | TEXT | Name of the replication phases completed (or in progress) so far. For the list of phases, see the usage notes. |
| START_TIME | TIMESTAMP_LTZ | Time when the replication phase began. |
| END_TIME | TIMESTAMP_LTZ | Time when the phase finished, if applicable. `NULL` if the phase is in progress or is the terminating phase (`COMPLETED`/`FAILED`/`CANCELED`). |
| PROGRESS | TEXT | * `PRIMARY_UPLOADING_DATA`: Percentage of total bytes replicated. * `SECONDARY_DOWNLOADING_METADATA`: Percentage of the total number of objects replicated. * `SECONDARY_DOWNLOADING_DATA`: Percentage of total bytes replicated.   Empty for remaining phases |
| DETAILS | VARIANT | * For phase `PRIMARY_UPLOADING_METADATA`:    + `primarySnapshotTimestamp`: Time when the primary snapshot was created. Format is epoch time. * For phase `PRIMARY_UPLOADING_DATA`:    + `totalBytesToReplicate`: Total number of bytes expected to be uploaded.   + `totalBytesToUpload`: Total number of bytes to required to be uploaded.   + `bytesUploaded`: Total number of bytes uploaded so far.   + `databases`: List of JSON objects containing the following fields for each member database:  - `name`: Database name.     - `totalBytesToReplicate`: Total bytes expected to be uploaded for the database. * For phase `SECONDARY_DOWNLOADING_DATA`:    + `totalBytesToReplicate`: Total number of bytes expected to be downloaded.   + `totalBytesToDownload`: Actual number of bytes required to be downloaded.   + `bytesDownloaded`: Actual number of bytes downloaded so far.   + `databases`: List of JSON objects containing the following fields for each member database:      - `name`: Database name.     - `totalBytesToReplicate`: Total bytes expected to be downloaded for the database. * For phase `SECONDARY_DOWNLOADING_METADATA`:    + `totalObjects`: Total number of objects to download.   + `completedObjects`: Total number of objects downloaded so far.   + `objectTypes`: List of JSON objects containing the following fields for each object type:      - `objectType`: Type of object (for example, users, roles, grants, warehouses, schemas, tables, columns, etc).     - `totalObjects`: Total number of objects of this type.     - `completedObjects`: Number of completed objects of this type. * For phase `FAILED`:  + `errorCode`: Error code of the failure.   + `errorMessage`: Error message of the failure. |

## Usage notes

* When no `DATE_RANGE_START` or `DATE_RANGE_END` arguments are provided,
  REPLICATION_GROUP_REFRESH_PROGRESS_ALL returns data for the last 12 hours.
  To retrieve data beyond the last 12 hours, specify the date range explicitly.
  Data is available for up to 14 days.
* Only returns rows for a role with any privilege on the replication or failover group.
* Only returns rows for a secondary replication or failover group in the current account.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

* Following is the list of phases in the order processed:

  | # | Phase name | Description |
  | --- | --- | --- |
  | 1 | `SECONDARY_SYNCHRONIZING_MEMBERSHIP` | The secondary replication or failover group receives information from the primary group about the objects included in the group, and updates its membership metadata. |
  | 2 | `SECONDARY_UPLOADING_INVENTORY` | The secondary replication or failover group sends an inventory of its objects in the target account to the primary group. |
  | 3 | `PRIMARY_UPLOADING_METADATA` | The primary replication or failover group creates a snapshot of metadata in the source account and sends it to the secondary group. |
  | 4 | `PRIMARY_UPLOADING_DATA` | The primary replication or failover group copies the files the secondary group needs to reconcile any deltas between the objects in the source and target accounts. |
  | 5 | `SECONDARY_DOWNLOADING_METADATA` | The secondary replication or failover group applies the snapshot of the metadata that was sent by the primary. The metadata updates are not applied atomically and instead applied over time. |
  | 6 | `SECONDARY_DOWNLOADING_DATA` | The secondary replication or failover group copies the files sent by the primary group to the target account. |
  | 7 | `COMPLETED` / `FAILED` / `CANCELED` | Refresh operation status. |
* In the `PRIMARY_UPLOADING_DATA` and `SECONDARY_DOWNLOADING_DATA` phases, the `totalBytesToReplicate` value is estimated prior
  to the replication operation. This value may differ from the `totalBytesToUpload` or `totalBytesToDownload` value in
  the respective phase.

  For example, if during the `PRIMARY_UPLOADING_DATA` phase, a previous replication operation uploaded some
  bytes but was canceled before the operation completed, those bytes would not be uploaded again. In that case, `totalBytesToUpload`
  would be lower than `totalBytesToReplicate`.

## Examples

To retrieve the current refresh progress for replication group `rg1`, execute the following
statement:

```sqlexample
SELECT phase_name, start_time, end_time, progress, details
  FROM TABLE(INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_PROGRESS('rg1'));
```

To retrieve the replication group refresh progress by query ID, replace the query ID in
the example and execute the following statement:

```sqlexample
SELECT phase_name, start_time, end_time, progress, details
  FROM TABLE(
    INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB(
      '012a3b45-1234-a12b-0000-1aa200012345'));
```

To retrieve the refresh progress for the last 12 hours (default) for all failover groups and replication groups,
execute the following statement:

```sqlexample
SELECT phase_name, start_time, end_time, progress, details
  FROM TABLE(INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_PROGRESS_ALL());
```

To retrieve the refresh progress for the last 7 days for all groups:

```sqlexample
SELECT phase_name, start_time, end_time, progress, details
  FROM TABLE(
    INFORMATION_SCHEMA.REPLICATION_GROUP_REFRESH_PROGRESS_ALL(
        DATE_RANGE_START => DATEADD(D, -7, CURRENT_DATE),
        DATE_RANGE_END => CURRENT_DATE));
```

---
title: REPLICATION_GROUP_USAGE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/replication_group_usage_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# REPLICATION_GROUP_USAGE_HISTORY

Returns the replication usage history for secondary replication or failover groups within the last 14 days.

## Syntax

```sqlsyntax
REPLICATION_GROUP_USAGE_HISTORY(
   [ DATE_RANGE_START => <constant_expr> ]
   [, DATE_RANGE_END => <constant_expr> ]
   [, REPLICATION_GROUP_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range, within the last 2 weeks, for which to retrieve the data load history:

    * If an end date is not specified, then [CURRENT_TIMESTAMP](current_timestamp.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 12 hours prior to the `DATE_RANGE_END`

`REPLICATION_GROUP_NAME => string`
:   A string specifying a replication or failover group. Only replication operations for the specified group are returned.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| REPLICATION_GROUP_NAME | TEXT | Name of the replication group. |
| CREDITS_USED | TEXT | Number of credits billed for replication during the START_TIME and END_TIME window. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for replication during the START_TIME and END_TIME window. |

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* Returns results only for a secondary replication or failover group in the current account.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the replication usage history for the last 7 days:

> ```sqlexample
> SELECT START_TIME, END_TIME, REPLICATION_GROUP_NAME, CREDITS_USED, BYTES_TRANSFERRED
>   FROM TABLE(information_schema.replication_group_usage_history(date_range_start=>dateadd('day', -7, current_date())));
> ```

Retrieve the replication usage history for the last 7 days for replication group `myrg`:

> ```sqlexample
> SELECT START_TIME, END_TIME, REPLICATION_GROUP_NAME, CREDITS_USED, BYTES_TRANSFERRED
>   FROM TABLE(information_schema.replication_group_usage_history(
>     date_range_start => dateadd('day', -7, current_date()),
>     replication_group_name => 'myrg'
> ));
> ```

---
title: REPLICATION_USAGE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/replication_usage_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# REPLICATION_USAGE_HISTORY

This table function can be used to query the replication history for a specified database within a specified date range. The information returned by the function includes the database name, credits consumed and bytes transferred for replication.

> **Note:**
>
> This function returns replication usage activity within the last 14 days.

## Syntax

```sqlsyntax
REPLICATION_USAGE_HISTORY(
  [ DATE_RANGE_START => <constant_expr> ]
  [ , DATE_RANGE_END => <constant_expr> ]
  [ , DATABASE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range to display the database replication history:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default is to show the previous 10 minutes of history).

    For example, if `DATE_RANGE_END` is CURRENT_DATE, then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

`DATABASE_NAME => 'string'`
:   Database name. If specified, only shows the history for the specified database.

    If a name is not specified, then the results include the data for each database replicated within the specified time range.

## Output

The function returns the following elements in a JSON object:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| DATABASE_NAME | TEXT | Name of the database. |
| CREDITS_USED | TEXT | Number of credits billed for database replication during the START_TIME and END_TIME window. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for database replication during the START_TIME and END_TIME window. |

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve the replication history for a 30 minute range for your account:

> ```sqlexample
> select *
>   from table(information_schema.replication_usage_history(
>     date_range_start=>'2019-02-10 12:00:00.000 +0000',
>     date_range_end=>'2019-02-10 12:30:00.000 +0000'));
> ```

Retrieve the history for the last 12 hours for your account:

> ```sqlexample
> select *
>   from table(information_schema.replication_usage_history(
>     date_range_start=>dateadd(H, -12, current_timestamp)));
> ```

Retrieve the history for the past week for your account:

> ```sqlexample
> select *
>   from table(information_schema.replication_usage_history(
>     date_range_start=>dateadd(d, -7, current_date),
>     date_range_end=>current_date));
> ```

Retrieve the replication history for the past week for a specified database in your account:

> ```sqlexample
> select *
>   from table(information_schema.replication_usage_history(
>     date_range_start=>dateadd(d, -7, current_date),
>     date_range_end=>current_date,
>     database_name=>'mydb'));
> ```

---
title: REST_EVENT_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/rest_event_history.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# REST_EVENT_HISTORY

Returns a list of SCIM REST API requests made to Snowflake over a specified time interval.

## Syntax

```sqlsyntax
REST_EVENT_HISTORY(
      REST_SERVICE_TYPE => 'scim'
      [, TIME_RANGE_START => <constant_expr> ]
      [, TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <integer> ] )
```

## Arguments

**Required:**

`REST_SERVICE_TYPE => 'scim'`
:   The type of REST API service. Currently, Snowflake only supports `SCIM`.

**Optional:**

`TIME_RANGE_START => <constant_expr>`, . `TIME_RANGE_END => <constant_expr>`
:   Time range (in TIMESTAMP_LTZ format), within the last 7 days, in which the login event occurred.

    * If `TIME_RANGE_START` is not specified, all logs from the last seven days are returned.
    * If `TIME_RANGE_END` is not specified, all logs are returned.

    If the time range does not fall within the last 7 days, an error is returned.

    For more information on functions that you can use, see [Date & time functions](../functions-date-time.md).

`RESULT_LIMIT => <integer>`
:   A number specifying the maximum number of rows returned by the function.

    If the number of matching rows is greater than this limit, the queries with the most recent end time (or those that are still executing) are returned, up to the specified limit.

    Range: `1` to `10000`

    Default: `100`.

## Usage notes

* Currently, the REST_EVENT_HISTORY table function can only be used with [SCIM](../../user-guide/scim-intro.md).
* Only account administrators (i.e. users with the ACCOUNTADMIN role) can obtain query results.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Time of the event occurrence. |
| EVENT_ID | NUMBER | The unique identifier for the request. |
| EVENT_TYPE | TEXT | The REST API event category. Currently, `SCIM` is the only possible value. |
| ENDPOINT | TEXT | The endpoint in the API request (e.g. `scim/v2/Users/<id>`). |
| METHOD | TEXT | The HTTP method used in the request. |
| STATUS | TEXT | The HTTP status result of the request. |
| ERROR_CODE | TEXT | Error code, if the request was not successful. |
| DETAILS | TEXT | A description of the result of the API request in JSON format. |
| CLIENT_IP | TEXT | The IP address where the request originated from. |
| ACTOR_NAME | TEXT | The name of the actor making the request. |
| ACTOR_DOMAIN | TEXT | The domain (i.e. security integration) in which the request was made. |
| RESOURCE_NAME | TEXT | The name of the object making the request. |
| RESOURCE_DOMAIN | TEXT | The object type (e.g. user) making the request. |

## Examples

Return the SCIM REST API requests made in the last five minutes, up to 200 requests.

> ```sqlexample
> use role accountadmin;
> use database my_db;
> use schema information_schema;
> select *
>   from table(rest_event_history(
>       rest_service_type => 'scim',
>       time_range_start => dateadd('minutes',-5,current_timestamp()),
>       time_range_end => current_timestamp(),
>       200))
>   order by event_timestamp;
> ```

---
title: RESULT_SCAN
source: https://docs.snowflake.com/en/sql-reference/functions/result_scan.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# RESULT_SCAN

Returns the result set of a previous command (within 24 hours of when you ran the query) as if the result was a table.
This function is particularly useful if you want to process the output from any of the following operations:

* [SHOW](../sql/show.md) or [DESC[RIBE]](../sql/desc.md) command that you ran.
* Query that you ran on metadata or account usage information, such as [Snowflake Information Schema](../info-schema.md)
  or [Account Usage](../account-usage.md).
* The result of a stored procedure that you [called](../sql/call.md).

  As an alternative to using RESULT_SCAN, you can call a stored procedure that returns tabular data in the
  [FROM clause of a SELECT statement](../../developer-guide/stored-procedure/stored-procedures-selecting-from.md).

The command or query can be from the current session or any of your other sessions, including past sessions, as long as the 24 hour period hasn’t elapsed. This period isn’t adjustable. For more information, see [Using Persisted Query Results](../../user-guide/querying-persisted-results.md).

> **Tip:**
>
> You can use the [pipe operator](../operators-flow.md) (`->>`) instead of this function to process
> the results of a previous command.

See also:
:   [DESCRIBE RESULT](../sql/desc-result.md) (Account & Session DDL)

## Syntax

```sqlsyntax
RESULT_SCAN ( [ { '<query_id>' | <query_index>  | LAST_QUERY_ID() } ] )
```

## Arguments

`'query_id'` or `query_index` or `LAST_QUERY_ID()`
:   A specification of a query that you ran within the last 24 hours in any session, an integer index of a query in the
    current session, or the [LAST_QUERY_ID](last_query_id.md) function, which returns the ID of a query within your current session.

    Snowflake query IDs are unique strings that resemble `01b71944-0001-b181-0000-0129032279f6`.

    Query indexes are relative to the first query in the current session (if positive) or to the most recent query (if
    negative). For example, `RESULT_SCAN(-1)` is equivalent to `RESULT_SCAN(LAST_QUERY_ID())`.

    This argument is optional. If it is omitted, the default is `RESULT_SCAN(-1)`, which returns the result set of
    the most recent command.

## Usage notes

* If the original query was run manually, only the user who ran the original query can use the RESULT_SCAN function to process
  the output of the query. Even a user with the ACCOUNTADMIN privilege can’t access the results of another user’s query by calling
  RESULT_SCAN.
* If the original query was run by using [a task](../../user-guide/tasks-intro.md), the role that owns the task, instead of a specific user,
  triggered and ran the query. If a user or a task is operating with the same role, they can use RESULT_SCAN to access the query results.
* Snowflake stores all query results for 24 hours. This function only returns results for queries that were run within this time period.
* Result sets don’t have any metadata associated with them, so processing large results might be slower than if you were querying an actual table.
* The query containing the RESULT_SCAN can include clauses, such as filters and ORDER BY clauses, that weren’t
  in the original query. You can use these clauses to narrow down or modify the result set.
* A RESULT_SCAN isn’t guaranteed to return rows in the same order as the original query returned the rows. You can
  include an ORDER BY clause with the RESULT_SCAN query to specify a specific order.
* To retrieve the ID for a specific query, use any of the following methods:

  Snowsight:
  :   In either of the following locations, click the provided link to display or copy the ID:

      + In Worksheets under Projects, after running a query, the Query Details include a link for the ID.
      + In Query History under Monitoring, each query includes the ID as a link.

  SQL:
  :   Call one of the following functions:

      + [QUERY_HISTORY , QUERY_HISTORY_BY_\*](query_history.md) table function.
      + [LAST_QUERY_ID](last_query_id.md) function (if the query was run in the current session).

        For example:

        ```sqlexample
        SELECT LAST_QUERY_ID(-2);
        ```

        This is equivalent to using [LAST_QUERY_ID](last_query_id.md) as the input for RESULT_SCAN.
* If RESULT_SCAN processes query output that contained duplicate column names (for example, a query that joined
  two tables that have overlapping column names), then RESULT_SCAN references the duplicate columns with modified
  names, appending `_1`, `_2`, and so on to the original name. For an example, see the following Examples section.
* Timestamps in Parquet files that are queried by using the vectorized scanner sometimes display the time in a different time zone. Use the
  [CONVERT_TIMEZONE](convert_timezone.md) function to convert to a standard time zone for all timestamp data.

## Collation details

When `RESULT_SCAN` returns the results of the previous statement, `RESULT_SCAN` preserves the
collation specification(s) of the values that it returns.

## Examples

The following examples use the RESULT_SCAN function.

### Simple examples

Retrieve all values greater than `1` from the result of your most recent query in the current session:

```sqlexample
SELECT $1 AS value FROM VALUES (1), (2), (3);
```

```output
+-------+
| VALUE |
|-------|
|     1 |
|     2 |
|     3 |
+-------+
```

```sqlexample
SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID())) WHERE value > 1;
```

```output
+-------+
| VALUE |
|-------|
|     2 |
|     3 |
+-------+
```

Retrieve all values from your second most recent query in the current session:

```sqlexample
SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID(-2)));
```

Retrieve all values from your first query in the current session:

```sqlexample
SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID(1)));
```

Retrieve the values from the `c2` column in the result of the specified query:

```sqlexample
SELECT c2 FROM TABLE(RESULT_SCAN('ce6687a4-331b-4a57-a061-02b2b0f0c17c'));
```

### Examples using DESCRIBE and SHOW commands

Process the result of a [DESCRIBE USER](../sql/desc-user.md) command to retrieve
particular fields of interest, such as the user’s default role. Because the
output column names from the DESC USER command were generated
in lowercase, the commands use [double-quoted identifiers](../identifiers-syntax.md)
for the column names in the query to ensure that the column names in the query
match the column names in the output that was scanned.

```sqlexample
DESC USER jessicajones;
SELECT "property", "value" FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
  WHERE "property" = 'DEFAULT_ROLE';
```

Process the result of a [SHOW TABLES](../sql/show-tables.md) command to extract empty tables that are older than 21 days. The SHOW command generates lowercase column names, so the command quotes the names to use matching case:

```sqlexample
SHOW TABLES;
SELECT "database_name", "schema_name", "name" as "table_name", "rows", "created_on"
  FROM table(RESULT_SCAN(LAST_QUERY_ID()))
  WHERE "rows" = 0 AND "created_on" < DATEADD(day, -21, CURRENT_TIMESTAMP())
  ORDER BY "created_on";
```

Process the result of a [SHOW TABLES](../sql/show-tables.md) command to extract the tables in descending order of size.
The following example also shows how to use a UDF to show table size in a slightly more human-readable format:

```sqlexample
-- Show byte counts with suffixes such as "KB", "MB", and "GB".
CREATE OR REPLACE FUNCTION NiceBytes(NUMBER_OF_BYTES INTEGER)
RETURNS VARCHAR
AS
$$
CASE
  WHEN NUMBER_OF_BYTES < 1024
    THEN NUMBER_OF_BYTES::VARCHAR
  WHEN NUMBER_OF_BYTES >= 1024 AND NUMBER_OF_BYTES < 1048576
    THEN (NUMBER_OF_BYTES / 1024)::VARCHAR || 'KB'
  WHEN NUMBER_OF_BYTES >= 1048576 AND NUMBER_OF_BYTES < (POW(2, 30))
    THEN (NUMBER_OF_BYTES / 1048576)::VARCHAR || 'MB'
  ELSE
    (NUMBER_OF_BYTES / POW(2, 30))::VARCHAR || 'GB'
END
$$
;
SHOW TABLES;
-- Show all of my tables in descending order of size.
SELECT "database_name", "schema_name", "name" as "table_name", NiceBytes("bytes") AS "size"
  FROM table(RESULT_SCAN(LAST_QUERY_ID()))
  ORDER BY "bytes" DESC;
```

### Examples using a stored procedure

Stored procedure calls return a value. However, this value can’t be processed directly because you can’t embed a
stored procedure call in another statement. To work around this limitation, you can use RESULT_SCAN to process the
value returned by a stored procedure. A simplified example is below:

First, create a procedure that returns a “complicated” value (in this case, a string that contains
JSON-compatible data) that can be processed after it has been returned from the CALL.

```sqlexample
CREATE OR REPLACE PROCEDURE return_json()
  RETURNS VARCHAR
  LANGUAGE JavaScript
  AS
  $$
    return '{"keyA": "ValueA", "keyB": "ValueB"}';
  $$
  ;
```

Call the procedure:

```sqlexample
CALL return_json();
```

```output
+--------------------------------------+
| RETURN_JSON                          |
|--------------------------------------|
| {"keyA": "ValueA", "keyB": "ValueB"} |
+--------------------------------------+
```

The next three steps extract the data from the result set.

Get the first (and only) column:

```sqlexample
SELECT $1 AS output_col FROM table(RESULT_SCAN(LAST_QUERY_ID()));
```

```output
+--------------------------------------+
| OUTPUT_COL                           |
|--------------------------------------|
| {"keyA": "ValueA", "keyB": "ValueB"} |
+--------------------------------------+
```

Convert the output from a VARCHAR value to a VARIANT value:

```sqlexample
SELECT PARSE_JSON(output_col) AS json_col FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
```

```output
+---------------------+
| JSON_COL            |
|---------------------|
| {                   |
|   "keyA": "ValueA", |
|   "keyB": "ValueB"  |
| }                   |
+---------------------+
```

Extract the value that corresponds to the key `keyB`:

```sqlexample
SELECT json_col:keyB FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
```

```output
+---------------+
| JSON_COL:KEYB |
|---------------|
| "ValueB"      |
+---------------+
```

The following example shows a more compact way to extract the same data that was extracted in the previous example. This example has
fewer statements, but is harder to read:

```sqlexample
CALL return_json();
```

```output
+--------------------------------------+
| RETURN_JSON                          |
|--------------------------------------|
| {"keyA": "ValueA", "keyB": "ValueB"} |
+--------------------------------------+
```

```sqlexample
SELECT JSON_COL:keyB
 FROM (
      SELECT PARSE_JSON($1::VARIANT) AS json_col
        FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
      );
```

```output
+---------------+
| JSON_COL:KEYB |
|---------------|
| "ValueB"      |
+---------------+
```

The output from the CALL uses the function name as the column name. You can use that column name in
the query. The following example shows one additional compact version, in which the column is referenced by name instead
of the column number:

```sqlexample
CALL return_json();
```

```output
+--------------------------------------+
| RETURN_JSON                          |
|--------------------------------------|
| {"keyA": "ValueA", "keyB": "ValueB"} |
+--------------------------------------+
```

```sqlexample
SELECT json_col:keyB
  FROM (
       SELECT PARSE_JSON(RETURN_JSON::VARIANT) AS json_col
         FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
       );
```

```output
+---------------+
| JSON_COL:KEYB |
|---------------|
| "ValueB"      |
+---------------+
```

### Example with duplicate column names

The following example shows that RESULT_SCAN effectively references alternate column names when there are duplicate
column names in the original query:

Create two tables that have at least one column with the same name:

```sqlexample
CREATE TABLE employees (id INT);

CREATE TABLE dependents (id INT, employee_id INT);
```

Load data into the two tables:

```sqlexample
INSERT INTO employees (id) VALUES (11);

INSERT INTO dependents (id, employee_id) VALUES (101, 11);
```

Now run a query for which the output will contain two columns with the same name:

```sqlexample
SELECT *
  FROM employees INNER JOIN dependents
    ON dependents.employee_ID = employees.id
  ORDER BY employees.id, dependents.id;
```

```output
+----+-----+-------------+
| ID |  ID | EMPLOYEE_ID |
|----+-----+-------------|
| 11 | 101 |          11 |
+----+-----+-------------+
```

Now call RESULT_SCAN to process the results of that query. If different columns that have the same name in the
results, RESULT_SCAN uses the original name for the first column and assigns the second column a modified name
that is unique. To make the name unique, RESULT_SCAN appends the suffix `_n` to the name, where
`n` is the next number available that produces a name that is different from the names of the previous
columns.

```sqlexample
SELECT id, id_1, employee_id
  FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
  WHERE id_1 = 101;
```

```output
+----+------+-------------+
| ID | ID_1 | EMPLOYEE_ID |
|----+------+-------------|
| 11 |  101 |          11 |
+----+------+-------------+
```

---
title: REVERSE
source: https://docs.snowflake.com/en/sql-reference/functions/reverse.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# REVERSE

Reverses the order of characters in a string, or of bytes in a binary value.

The returned value is the same length as the input, but with the characters/bytes in reverse order. If `subject` is NULL, the result is also NULL.

## Syntax

```sqlsyntax
REVERSE(<subject>)
```

## Collation details

* No impact.
* The collation of the result is the same as the collation of the input.
* In languages where the alphabet contains digraphs or trigraphs (such as “Dz” and “Dzs” in Hungarian), each character in each digraph and trigraph is treated as an independent character, not as part of a single multi-character letter.

  For example, languages with 2-character and 3-character letters (e.g. “dzs” in Hungarian, “ch” in Czech)
  are reversed based on the individual characters, not the letters. See the Examples section below for an example.

## Examples

This example reverses a string:

> ```sqlexample
> SELECT REVERSE('Hello, world!');
> +--------------------------+
> | REVERSE('HELLO, WORLD!') |
> |--------------------------|
> | !dlrow ,olleH            |
> +--------------------------+
> ```

This example reverses a date:

> ```sqlexample
> SELECT '2019-05-22'::DATE, REVERSE('2019-05-22'::DATE) AS reversed;
> +--------------------+------------+
> | '2019-05-22'::DATE | REVERSED   |
> |--------------------+------------|
> | 2019-05-22         | 22-50-9102 |
> +--------------------+------------+
> ```

The following shows that in languages where a single letter is composed of multiple characters, `REVERSE`
reverses based on characters, not letters:

> ```sqlexample
> CREATE TABLE strings (s1 VARCHAR COLLATE 'en', s2 VARCHAR COLLATE 'hu');
> INSERT INTO strings (s1, s2) VALUES ('dzsa', COLLATE('dzsa', 'hu'));
> ```
>
> ```sqlexample
> SELECT s1, s2, REVERSE(s1), REVERSE(s2)
>     FROM strings;
> +------+------+-------------+-------------+
> | S1   | S2   | REVERSE(S1) | REVERSE(S2) |
> |------+------+-------------+-------------|
> | dzsa | dzsa | aszd        | aszd        |
> +------+------+-------------+-------------+
> ```

---
title: RIGHT
source: https://docs.snowflake.com/en/sql-reference/functions/right.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# RIGHT

Returns a rightmost substring of its input.

`RIGHT(STR, N)` is equivalent to `SUBSTR(STR, LENGTH(STR)-N+1, N)`.

See also:
:   [LEFT](left.md) , [SUBSTR , SUBSTRING](substr.md)

## Syntax

```sqlsyntax
RIGHT( <string_expr> , <length_expr> )
```

## Arguments

`string_expr`
:   An expression that evaluates to a VARCHAR or BINARY value.

`length_expr`
:   An expression that evaluates to an integer. It specifies:

    * The number of UTF-8 characters to return if the input is a VARCHAR value.
    * The number of bytes to return if the input is a BINARY value.

    Specify a length that is greater than or equal to zero. If the length is a negative number, the function returns an
    empty string.

## Returns

The data type of the returned value is the same as the data type of the `string_expr` (VARCHAR or BINARY).

If any of the inputs are NULL, NULL is returned.

## Usage notes

If `length_expr` is greater than the length of `expr`, then the function returns `expr`.

## Collation details

* Collation applies to VARCHAR inputs. Collation doesn’t apply if the input data type of the first parameter
  is BINARY.
* No impact. Although collation is accepted syntactically, collations don’t affect processing. For example,
  two-character and three-character letters in languages (for example, “dzs” in Hungarian or “ch” in Czech)
  are still counted as two or three characters (not one character) for the length argument.
* The collation of the result is the same as the collation of the input. This can be useful if the returned value is passed to another function as part of nested function calls.

## Examples

The following examples use the RIGHT function.

### Basic example

```sqlexample
SELECT RIGHT('ABCDEFG', 3);
```

```output
+---------------------+
| RIGHT('ABCDEFG', 3) |
|---------------------|
| EFG                 |
+---------------------+
```

### Returning substrings for email, phone, and date strings

The following examples return substrings for customer information in a table.

Create the table and insert data:

```sqlexample
CREATE OR REPLACE TABLE customer_contact_example (
    cust_id INT,
    cust_email VARCHAR,
    cust_phone VARCHAR,
    activation_date VARCHAR)
  AS SELECT
    column1,
    column2,
    column3,
    column4
  FROM
    VALUES
      (1, 'some_text@example.com', '800-555-0100', '20210320'),
      (2, 'some_other_text@example.org', '800-555-0101', '20240509'),
      (3, 'some_different_text@example.net', '800-555-0102', '20191017');

SELECT * from customer_contact_example;
```

```output
+---------+---------------------------------+--------------+-----------------+
| CUST_ID | CUST_EMAIL                      | CUST_PHONE   | ACTIVATION_DATE |
|---------+---------------------------------+--------------+-----------------|
|       1 | some_text@example.com           | 800-555-0100 | 20210320        |
|       2 | some_other_text@example.org     | 800-555-0101 | 20240509        |
|       3 | some_different_text@example.net | 800-555-0102 | 20191017        |
+---------+---------------------------------+--------------+-----------------+
```

Use the [LENGTH](length.md) and [POSITION](position.md) functions with the RIGHT function to extract the domains from
email addresses. This example first finds the length of the input string and then subtracts the position
of `@` in each string to determine the length of the domain:

```sqlexample
SELECT cust_id,
       cust_email,
       RIGHT(cust_email, LENGTH(cust_email) - (POSITION('@' IN cust_email))) AS domain
  FROM customer_contact_example;
```

```output
+---------+---------------------------------+-------------+
| CUST_ID | CUST_EMAIL                      | DOMAIN      |
|---------+---------------------------------+-------------|
|       1 | some_text@example.com           | example.com |
|       2 | some_other_text@example.org     | example.org |
|       3 | some_different_text@example.net | example.net |
+---------+---------------------------------+-------------+
```

> **Tip:**
>
> You can use the POSITION function to find the position of other characters, such as an empty
> character (`' '`) or an underscore (`_`).

In the `cust_phone` column in the table, the area code is always the first three characters. Extract
the phone numbers without the area codes:

```sqlexample
SELECT cust_id,
       cust_phone,
       RIGHT(cust_phone, 8) AS phone_without_area_code
  FROM customer_contact_example;
```

```output
+---------+--------------+-------------------------+
| CUST_ID | CUST_PHONE   | PHONE_WITHOUT_AREA_CODE |
|---------+--------------+-------------------------|
|       1 | 800-555-0100 | 555-0100                |
|       2 | 800-555-0101 | 555-0101                |
|       3 | 800-555-0102 | 555-0102                |
+---------+--------------+-------------------------+
```

In the `activation_date` column in the table, the date is always in the format `YYYYMMDD`. Extract day from
these strings:

```sqlexample
SELECT cust_id,
       activation_date,
       RIGHT(activation_date, 2) AS day
  FROM customer_contact_example;
```

```output
+---------+-----------------+-----+
| CUST_ID | ACTIVATION_DATE | DAY |
|---------+-----------------+-----|
|       1 | 20210320        | 20  |
|       2 | 20240509        | 09  |
|       3 | 20191017        | 17  |
+---------+-----------------+-----+
```

---
title: ROUND
source: https://docs.snowflake.com/en/sql-reference/functions/round.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# ROUND

Returns rounded values for `input_expr`.

See also:
:   [CEIL](ceil.md) , [FLOOR](floor.md) , [TRUNCATE , TRUNC](trunc.md)

## Syntax

```sqlsyntax
ROUND( <input_expr> [ , <scale_expr> [ , '<rounding_mode>' ] ] )
```

```sqlsyntax
ROUND( EXPR => <input_expr> ,
       SCALE => <scale_expr>
       [ , ROUNDING_MODE => '<rounding_mode>'  ] )
```

## Arguments

**Required:**

`input_expr` . OR . `EXPR => input_expr`
:   The value or expression to operate on. The data type must be one of the numeric data types, such as DECFLOAT,
    FLOAT, or NUMBER.

    If you specify the `EXPR =>` named argument, you must also specify the `SCALE =>` named argument.

**Optional:**

`scale_expr` . OR . `SCALE => scale_expr`
:   The number of digits the output includes after the decimal point.

    The default `scale_expr` is zero, meaning that the function removes all digits after the decimal point.

    For information about negative numbers, see Usage notes.

    If you specify the `SCALE =>` named argument, you must specify `EXPR =>` as the preceding named argument.

`'rounding_mode'` . OR . `ROUNDING_MODE => 'rounding_mode'`
:   The rounding mode to use. You can specify one of the following values:

    * `HALF_AWAY_FROM_ZERO`. This mode rounds the value [half away from zero](https://en.wikipedia.org/wiki/Rounding#Rounding_half_away_from_zero).
    * `HALF_TO_EVEN`. This mode rounds the value [half to even](https://en.wikipedia.org/wiki/Rounding#Rounding_half_to_even).

    Default: `HALF_AWAY_FROM_ZERO`

    If you specify the `ROUNDING_MODE =>` named argument, you must specify both `EXPR =>` and `SCALE =>` as preceding named arguments.

    > **Note:**
    >
    > If you specify either value for the `rounding_mode` argument, the data type of `input_expr` must be
    > [one of the data types for a fixed-point number](../data-types-numeric.md).
    >
    > [Data types for floating point numbers](../data-types-numeric.md) (for example, FLOAT) aren’t supported
    > with this argument.

## Returns

The return type is based on the input type:

* If the input expression is a FLOAT, the returned type is a FLOAT.
* If the input expression is DECFLOAT, the returned type is DECFLOAT.
* If the input expression is a NUMBER, the returned type is a NUMBER.

  + If the input scale is constant:

    - If the input scale is positive, the returned type has a scale equal to the input scale and has a precision large enough to
      encompass any possible result.
    - If the input scale is negative, the returned type has a scale of 0.
  + If the input scale isn’t constant, the returned type’s scale is the same as the input expression’s.

If the scale is zero, then the value is effectively an INTEGER.

For example:

* The data type returned by `ROUND(3.14::FLOAT, 1)` is FLOAT.
* The NUMBER returned by `ROUND(3.14, 1)` has scale 1 and precision at least 3.
* The NUMBER returned by `ROUND(-9.99, 0)` has scale 0 and precision at least 2.
* The NUMBER returned by `ROUND(33.33, -1)` has scale 0 and precision at least 3.

If either the `input_expr` or the `scale_expr` is NULL, the function returns NULL.

## Usage notes

* When you mix arguments by position and by name, all of the positional arguments must come before
  all of the named arguments.
* When you specify an argument by name, you can’t use double quotes around the argument name.

* If `scale_expr` is negative, it specifies the number of places before the decimal point to
  which to adjust the number. For example, if the scale is -2, the result is a multiple of 100.
* If `scale_expr` is larger than the input expression scale, the function doesn’t have any effect.
* By default, half-points are rounded away from zero for decimals. For example, -0.5 is rounded to -1.0.

  To change the rounding mode to round the value [half to even](https://en.wikipedia.org/wiki/Rounding#Rounding_half_to_even) (for example, to round -0.5 to 0), specify
  `'HALF_TO_EVEN'` for the `rounding_mode` argument.

  > **Note:**
  >
  > If you specify the `rounding_mode` argument, the data type of the `input_expr` argument must be
  > [one of the data types for a fixed-point number](../data-types-numeric.md).
* Floating point numbers are approximate values. A floating point number might not round as expected.
* If rounding brings the number outside of the range of values of the data type, the function returns an error.

## Examples

This following example shows a simple use of ROUND, with the default number of decimal places (0):

```sqlexample
SELECT ROUND(135.135), ROUND(-975.975);
```

```output
+----------------+-----------------+
| ROUND(135.135) | ROUND(-975.975) |
|----------------+-----------------|
|            135 |            -976 |
+----------------+-----------------+
```

The next example queries the data in the following table:

```sqlexample
CREATE TABLE test_ceiling (n FLOAT, scale INTEGER);

INSERT INTO test_ceiling (n, scale) VALUES
  (-975.975, -1),
  (-975.975,  0),
  (-975.975,  2),
  ( 135.135, -2),
  ( 135.135,  0),
  ( 135.135,  1),
  ( 135.135,  3),
  ( 135.135, 50),
  ( 135.135, NULL);
```

Query the table and use a range of values for the `scale_expr` argument:

```sqlexample
SELECT n, scale, ROUND(n, scale)
  FROM test_ceiling
  ORDER BY n, scale;
```

```output
+----------+-------+-----------------+
|        N | SCALE | ROUND(N, SCALE) |
|----------+-------+-----------------|
| -975.975 |    -1 |        -980     |
| -975.975 |     0 |        -976     |
| -975.975 |     2 |        -975.98  |
|  135.135 |    -2 |         100     |
|  135.135 |     0 |         135     |
|  135.135 |     1 |         135.1   |
|  135.135 |     3 |         135.135 |
|  135.135 |    50 |         135.135 |
|  135.135 |  NULL |            NULL |
+----------+-------+-----------------+
```

The next two examples show the difference between using the default rounding mode (`'HALF_AWAY_FROM_ZERO'`) and the rounding
mode `'HALF_TO_EVEN'`. Both examples call the ROUND function twice, first with the default rounding behavior, then with `'HALF_TO_EVEN'`.

The first example uses a positive input value of 2.5:

```sqlexample
SELECT ROUND(2.5, 0), ROUND(2.5, 0, 'HALF_TO_EVEN');
```

```output
+---------------+-------------------------------+
| ROUND(2.5, 0) | ROUND(2.5, 0, 'HALF_TO_EVEN') |
|---------------+-------------------------------|
|             3 |                             2 |
+---------------+-------------------------------+
```

The second example uses a negative input value of -2.5:

```sqlexample
SELECT ROUND(-2.5, 0), ROUND(-2.5, 0, 'HALF_TO_EVEN');
```

```output
+----------------+--------------------------------+
| ROUND(-2.5, 0) | ROUND(-2.5, 0, 'HALF_TO_EVEN') |
|----------------+--------------------------------|
|             -3 |                             -2 |
+----------------+--------------------------------+
```

The next two examples demonstrate how to specify the arguments to the function by name, rather than by position:

```sqlexample
SELECT ROUND(
  EXPR => -2.5,
  SCALE => 0) AS named_arguments;
```

```output
+-----------------+
| NAMED_ARGUMENTS |
|-----------------|
|              -3 |
+-----------------+
```

```sqlexample
SELECT ROUND(
  EXPR => -2.5,
  SCALE => 0,
  ROUNDING_MODE => 'HALF_TO_EVEN') AS named_with_rounding_mode;
```

```output
+--------------------------+
| NAMED_WITH_ROUNDING_MODE |
|--------------------------|
|                       -2 |
+--------------------------+
```

The next example shows that FLOAT values aren’t always stored exactly. As you can see below, in some cases .005 is
rounded to .01, while in other cases it is rounded to 0. The difference isn’t in the rounding; the difference is
actually in the underlying representation of the floating point number, because 1.005 is stored as a number very slightly
smaller than 1.005 (approximately 1.004999). The DECIMAL value, however is stored as an exact number, and is rounded
to .01 as expected in all cases.

Create and load a table:

```sqlexample
CREATE OR REPLACE TEMP TABLE rnd1(f float, d DECIMAL(10, 3));

INSERT INTO rnd1 (f, d) VALUES
  ( -10.005,  -10.005),
  (  -1.005,   -1.005),
  (   1.005,    1.005),
  (  10.005,   10.005);
```

Show examples of the difference between rounded FLOAT values and rounded DECIMAL values:

```sqlexample
SELECT f,
       ROUND(f, 2),
       d,
       ROUND(d, 2)
  FROM rnd1
  ORDER BY 1;
```

```output
+---------+-------------+---------+-------------+
|       F | ROUND(F, 2) |       D | ROUND(D, 2) |
|---------+-------------+---------+-------------|
| -10.005 |      -10.01 | -10.005 |      -10.01 |
|  -1.005 |       -1    |  -1.005 |       -1.01 |
|   1.005 |        1    |   1.005 |        1.01 |
|  10.005 |       10.01 |  10.005 |       10.01 |
+---------+-------------+---------+-------------+
```

---
title: ROW_COUNT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_row_count.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# ROW_COUNT (system data metric function)

Returns the total number of rows in a table.

## Syntax

Not applicable.

## Returns

The function returns a scalar value with a NUMBER data type.

## Usage notes

You can’t call this function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

---
title: ROW_NUMBER
source: https://docs.snowflake.com/en/sql-reference/functions/row_number.md
section: SQL Functions
---

Categories:
:   [Window function syntax and usage](../functions-window-syntax.md) (Ranking)

# ROW_NUMBER

Returns a unique row number for each row within a window partition.

The row number starts at 1 and continues up sequentially.

## Syntax

```sqlsyntax
ROW_NUMBER() OVER (
  [ PARTITION BY <expr1> [, <expr2> ... ] ]
  ORDER BY <expr3> [ , <expr4> ... ] [ { ASC | DESC } [ NULLS { FIRST | LAST } ] ]
  )
```

## Arguments

None.

## Usage notes

* `expr1` and `expr2` specify the column(s) or expression(s)
  to partition by. You can partition by 0, 1, or more expressions.

  For example, suppose that you are selecting data across multiple states
  (or provinces) and you want row numbers from 1 to N within each
  state; in that case, you can partition by the state.

  If you want only a single group, then omit the PARTITION BY clause.
* `expr3` and `expr4` specify the column(s) or expression(s) to
  use to determine the order of the rows. You can order by 1 or more
  expressions.

  For example, if want to list farmers in order by production of corn, then
  use the `bushels_produced` column. For details,
  see Examples (in this topic).

## Examples

The query below shows how to assign row numbers within partitions. In this
case, the partitions are stock exchanges (for example, “N” for “NASDAQ”).

```sqlexample
SELECT
    symbol,
    exchange,
    shares,
    ROW_NUMBER() OVER (PARTITION BY exchange ORDER BY shares) AS row_number
  FROM trades;
```

```output
+------+--------+------+----------+
|SYMBOL|EXCHANGE|SHARES|ROW_NUMBER|
+------+--------+------+----------+
|SPY   |C       |   250|         1|
|AAPL  |C       |   250|         2|
|AAPL  |C       |   300|         3|
|SPY   |N       |   100|         1|
|AAPL  |N       |   300|         2|
|SPY   |N       |   500|         3|
|QQQ   |N       |   800|         4|
|QQQ   |N       |  2000|         5|
|YHOO  |N       |  5000|         6|
+------+--------+------+----------+
```

---
title: RPAD
source: https://docs.snowflake.com/en/sql-reference/functions/rpad.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# RPAD

Right-pads a string with characters from another string, or right-pads a binary value with bytes from another binary value.

The argument (`base`) is padded to length `length_expr` with characters/bytes from the `pad` argument.

See also:
:   [LPAD](lpad.md)

## Syntax

```sqlsyntax
RPAD( <base>, <length_expr> [, <pad>] )
```

## Arguments

`base`
:   A VARCHAR or BINARY value.

`length_expr`
:   An expression that evaluates to an integer. It specifies:

    * The number of UTF-8 characters to return if the input is VARCHAR.
    * The number of bytes to return if the input is BINARY.

`pad`
:   A VARCHAR or BINARY value. The type must match the data type of the `base` argument.
    Characters (or bytes) from this argument are used to pad the `base`.

## Returns

The data type of the returned value is the same as the data type of the `base` input value (VARCHAR or BINARY).

## Usage notes

* If the `base` argument is longer than `length_expr`, it is truncated to length `length_expr`.
* The `pad` argument can be multiple characters/bytes long. The `pad`
  argument is repeated in the result until the desired length (`length_expr`) is
  reached, truncating any superfluous characters/bytes in the `pad` argument.
  If the `pad` argument is empty, no padding is inserted, but the result is
  still truncated to length `length_expr`.
* When `base` is a string, the default `pad` string is `' '` (a single blank space). When
  `base` is a binary value, the `pad` argument must be provided explicitly.

## Collation details

* Collation applies to VARCHAR inputs. Collation doesn’t apply if the input data type of the first argument
  is BINARY.
* No impact.
  Although collation is accepted syntactically, collations have no impact on processing. For example, languages with
  two-character and three-character letters (for example, “dzs” in Hungarian, “ch” in Czech) still count
  those as two or three characters (not one character) for the length argument.
* The collation of the result is the same as the collation of the input. This can be useful if the returned value is passed to another function as part of nested function calls.
* Currently, Snowflake allows the `base` and `pad` arguments to have different collation specifiers.
  However, the individual collation specifiers can’t both be retained because the returned value has only one
  collation specifier. Snowflake recommends that you avoid using `pad` strings that have a different
  collation from the `base` string.

## Examples

These examples use the RPAD function to pad VARCHAR and BINARY data on the right.

Create and fill a table:

```sqlexample
CREATE OR REPLACE TABLE padding_example (v VARCHAR, b BINARY);

INSERT INTO padding_example (v, b)
  SELECT
    'Hi',
    HEX_ENCODE('Hi');

INSERT INTO padding_example (v, b)
  SELECT
    '-123.00',
    HEX_ENCODE('-123.00');

INSERT INTO padding_example (v, b)
  SELECT
    'Twelve Dollars',
    TO_BINARY(HEX_ENCODE('Twelve Dollars'), 'HEX');
```

Query the table to show the data:

```sqlexample
SELECT * FROM padding_example;
```

```output
+----------------+------------------------------+
| V              | B                            |
|----------------+------------------------------|
| Hi             | 4869                         |
| -123.00        | 2D3132332E3030               |
| Twelve Dollars | 5477656C766520446F6C6C617273 |
+----------------+------------------------------+
```

This example demonstrates right-padding of VARCHAR values using the RPAD function, with the
results limited to 10 characters:

```sqlexample
SELECT v,
       RPAD(v, 10, '_') AS pad_with_underscore,
       RPAD(v, 10, '$') AS pad_with_dollar_sign
  FROM padding_example
  ORDER BY v;
```

```output
+----------------+---------------------+----------------------+
| V              | PAD_WITH_UNDERSCORE | PAD_WITH_DOLLAR_SIGN |
|----------------+---------------------+----------------------|
| -123.00        | -123.00___          | -123.00$$$           |
| Hi             | Hi________          | Hi$$$$$$$$           |
| Twelve Dollars | Twelve Dol          | Twelve Dol           |
+----------------+---------------------+----------------------+
```

This example demonstrates right-padding of BINARY values using the RPAD function, with the
results limited to 10 bytes:

```sqlexample
SELECT b,
       RPAD(b, 10, TO_BINARY(HEX_ENCODE('_'))) AS pad_with_underscore,
       RPAD(b, 10, TO_BINARY(HEX_ENCODE('$'))) AS pad_with_dollar_sign
  FROM padding_example
  ORDER BY b;
```

```output
+------------------------------+----------------------+----------------------+
| B                            | PAD_WITH_UNDERSCORE  | PAD_WITH_DOLLAR_SIGN |
|------------------------------+----------------------+----------------------|
| 2D3132332E3030               | 2D3132332E30305F5F5F | 2D3132332E3030242424 |
| 4869                         | 48695F5F5F5F5F5F5F5F | 48692424242424242424 |
| 5477656C766520446F6C6C617273 | 5477656C766520446F6C | 5477656C766520446F6C |
+------------------------------+----------------------+----------------------+
```

This example shows right-padding when multiple characters are used and when
the padding isn’t an even multiple of the length of the multi-character
string used for padding:

```sqlexample
SELECT RPAD('123.50', 19, '*_');
```

```output
+--------------------------+
| RPAD('123.50', 19, '*_') |
|--------------------------|
| 123.50*_*_*_*_*_*_*      |
+--------------------------+
```

The output shows that 19 characters were returned, and the last `*` character doesn’t have
an accompanying `_` character.

---
title: RTRIM
source: https://docs.snowflake.com/en/sql-reference/functions/rtrim.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# RTRIM

Removes trailing characters, including whitespace, from a string.

> **Note:**
>
> To remove characters in a string, you can use the [REPLACE](replace.md) function.

See also:
:   [LTRIM](ltrim.md) , [TRIM](trim.md)

## Syntax

```sqlsyntax
RTRIM(<expr> [, <characters> ])
```

## Arguments

`expr`
:   The string expression to be trimmed.

`characters`
:   One or more characters to remove from the right side of `expr`:

    The default value is `' '` (a single blank space character).
    If no characters are specified, only blank spaces are removed.

## Returns

This function returns a value of VARCHAR data type or NULL. If either argument is NULL, returns NULL.

## Usage notes

* You can specify the characters in `characters` in any order.
* A specification of `' '` in `characters` does not remove other whitespace
  characters (such as tabulation characters, end-of-line characters, and so on). Explicitly
  specify these characters to remove them.

* When `characters` is specified, you must explicitly specify the characters
  to remove whitespace. For example, `' $.'` removes all trailing blank spaces, dollar
  signs, and periods from the input string.

## Collation details

[Collation](../collation.md) is supported when the optional second argument is omitted, or when it
contains only whitespace.

The collation specification of the returned value is the same as the collation specification of the first argument.

## Examples

Remove trailing `0` and `.` characters from a string:

```sqlexample
SELECT RTRIM('$125.00', '0.');
```

```output
+------------------------+
| RTRIM('$125.00', '0.') |
|------------------------|
| $125                   |
+------------------------+
```

The remaining examples use the following table data. Also, the queries enclose the strings
in `>` and `<` characters to help you visualize the whitespace.

```sqlexample
CREATE OR REPLACE TABLE test_rtrim_function(column1 VARCHAR);

INSERT INTO test_rtrim_function VALUES ('Trailing Spaces#  ');
```

Remove trailing whitespace from a string. This example does not specify the second
`characters` argument because the default is blank spaces.

```sqlexample
SELECT CONCAT('>', CONCAT(column1, '<')) AS original_value,
       CONCAT('>', CONCAT(RTRIM(column1), '<')) AS trimmed_value
  FROM test_rtrim_function;
```

```output
+----------------------+--------------------+
| ORIGINAL_VALUE       | TRIMMED_VALUE      |
|----------------------+--------------------|
| >Trailing Spaces#  < | >Trailing Spaces#< |
+----------------------+--------------------+
```

Remove leading whitespace and `#` from a string. This example specifies the second
`characters` argument because it removes other characters in addition to
blank spaces.

```sqlexample
SELECT CONCAT('>', CONCAT(column1, '<')) AS original_value,
       CONCAT('>', CONCAT(RTRIM(column1, '# '), '<')) AS trimmed_value
  FROM test_rtrim_function;
```

```output
+----------------------+-------------------+
| ORIGINAL_VALUE       | TRIMMED_VALUE     |
|----------------------+-------------------|
| >Trailing Spaces#  < | >Trailing Spaces< |
+----------------------+-------------------+
```

---
title: RTRIMMED_LENGTH
source: https://docs.snowflake.com/en/sql-reference/functions/rtrimmed_length.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# RTRIMMED_LENGTH

Returns the length of its argument, minus trailing whitespace, but including leading whitespace.

## Syntax

```sqlsyntax
RTRIMMED_LENGTH( <string_expr> )
```

## Usage notes

* Equivalent to `{fn LENGTH(str)}` in ODBC.
* Not equivalent to [LENGTH, LEN](length.md) in Snowflake.

## Examples

```sqlexample
SELECT RTRIMMED_LENGTH(' ABCD ');

+---------------------------+
| RTRIMMED_LENGTH(' ABCD ') |
|---------------------------|
|                         5 |
+---------------------------+
```

---
title: SANITIZE_WEBHOOK_CONTENT
source: https://docs.snowflake.com/en/sql-reference/functions/sanitize_webhook_content.md
section: SQL Functions
---

Categories:
:   [Notification functions](../functions-notification.md) (Message Sanitization)

# SANITIZE_WEBHOOK_CONTENT

Removes placeholders (for example, the SNOWFLAKE_WEBHOOK_SECRET placeholder, which specifies a secret) from the body of a
notification message to be sent.

Placeholders like SNOWFLAKE_WEBHOOK_SECRET are used in notification integrations. When you
[create a notification integration](../../user-guide/notifications/webhook-notifications.md), you can use placeholders to indicate where
you want the content inserted into the request. For example, you can use the SNOWFLAKE_WEBHOOK_SECRET placeholder to insert the
secret into the HTTP headers or body of the request.

The [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure replaces these placeholders in
the integration parameters with actual values. The stored procedure also replaces the placeholders if specified directly in the
message string passed to the function. If the placeholder is for a secret, this might unintentionally make the secret available
to others. For example, if this message is sent to a Slack webhook, the message containing the secret might be posted to a Slack
channel.

To avoid this situation, pass the message to SANITIZE_WEBHOOK_CONTENT to remove any placeholders from the message before passing
the message to SYSTEM$SEND_SNOWFLAKE_NOTIFICATION.

See also:
:   [Sending webhook notifications](../../user-guide/notifications/webhook-notifications.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NOTIFICATION.SANITIZE_WEBHOOK_CONTENT( <message> )
```

## Arguments

`message`
:   A VARCHAR value containing the message to sanitize.

## Returns

Returns a VARCHAR value with placeholders replaced with the string `REDACTED`.

## Examples

See [Sending a notification to a webhook](../../user-guide/notifications/webhook-notifications.md).

---
title: SCHEDULED_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/scheduled_time.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md) (Alerts)

# SCHEDULED_TIME

Returns the timestamp representing the scheduled time of the current alert. Refer to [Specifying timestamps based on alert schedules](../../user-guide/alerts.md).

## Syntax

```sqlsyntax
SNOWFLAKE.ALERT.SCHEDULED_TIME()
```

## Arguments

None.

## Returns

TIMESTAMP_LTZ value that represents the scheduled time of the current alert.

## Usage notes

* This function is defined in the ALERT schema of the SNOWFLAKE database.

  To call this function, you must use a role that is granted the
  [SNOWFLAKE database role](../snowflake-db-roles.md) ALERT_VIEWER. For example, to call the function as a user
  with the role alert_role, execute:

  ```sqlexample
  GRANT DATABASE ROLE snowflake.alert_viewer TO ROLE alert_role;
  ```
* This function can only be called from within an [alert](../../user-guide/alerts.md).

## Examples

Refer to [Specifying timestamps based on alert schedules](../../user-guide/alerts.md).

---
title: SEARCH_IP
source: https://docs.snowflake.com/en/sql-reference/functions/search_ip.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Full-Text Search)

# SEARCH_IP

Searches for valid IPv4 and IPv6 addresses in specified character-string columns from one or more tables, including fields
in VARIANT, OBJECT, and ARRAY columns. The search is based on a single IP address or a range of IP addresses
that you specify. If an IP address in the column or field matches a specified IP address or is in a specified range,
then the function returns TRUE.

For more information about using this function, see [Using full-text search](../../user-guide/querying-with-search-functions.md).

## Syntax

```sqlsyntax
SEARCH_IP( <search_data>, '<search_string>' )
```

## Arguments

`search_data`
:   The data you want to search, expressed as a comma-delimited list of string literals, column names, or
    [paths](../../user-guide/querying-semistructured.md) to fields in VARIANT columns. The search data can
    also be a single literal string, which can be useful when you are testing the function.

    You can specify the wildcard character (`*`), where `*` expands to all qualifying columns in all of the
    tables that are in scope for the function. Qualifying columns are those that have VARCHAR (text), VARIANT,
    ARRAY, and OBJECT data types. VARIANT, ARRAY, and OBJECT data is converted to text for searching. You can
    also use the ILIKE and EXCLUDE keywords for filtering.

    For more information about this argument, see the `search_data` description for the
    [SEARCH](search.md) function.

`'search_string'`
:   A VARCHAR string that contains one of the following addresses:

    * A complete and valid IP address in standard IPv4 or IPv6 format, such as `192.0.2.1` or
      `2001:0db8:85a3:0000:0000:8a2e:0370:7334`.
    * A valid IP address in standard IPv4 or IPv6 format with a Classless Inter-Domain Routing (CIDR) range,
      such as `192.0.2.1/24` or `2001:db8:85a3::/64`.
    * A valid IP address in standard IPv4 or IPv6 format with leading zeros, such as `192.000.002.001`
      instead of `192.0.2.1` or `2001:0db8:85a3:0333:4444:8a2e:0370:7334` instead of
      `2001:db8:85a3:333:4444:8a2e:370:7334`. The function accepts up to three digits for each part of an IPv4
      address, and up to four digits for each part of an IPv6 address.
    * A valid compressed IPv6 address, such as `2001:db8:85a3:0:0:0:0:0` or `2001:db8:85a3::` instead of
      `2001:db8:85a3:0000:0000:0000:0000:0000`.
    * An IPv6 dual address that combines an IPv6 and an IPv4 address, such as `2001:db8:85a3::192.0.2.1`.

    This argument must be a literal string. Specify one pair of single quotes around the string.

    The following types of arguments aren’t supported:

    * Column names
    * Empty strings
    * More than one IP address
    * Partial IPv4 and IPv6 addresses

## Returns

Returns a BOOLEAN:

* Returns TRUE if a valid IP address is specified in `search_string` and a matching IP address is found in
  `search_data`.
* Returns TRUE if a valid IP address with a CIDR range is specified in `search_string` and an IP address in
  the specified range is found in `search_data`.
* Returns NULL if either of these arguments is NULL.
* Otherwise, returns FALSE.

## Usage notes

* The SEARCH_IP function operates only on VARCHAR (text), VARIANT, ARRAY, and OBJECT data. The function returns an error if the
  `search_data` argument doesn’t contain data of these data types. When the `search_data` argument includes
  data of both supported data types and unsupported data types, the function searches the data of the supported data
  types and silently ignores the data of the unsupported data types. For examples, see Examples of expected error cases.
* The function returns an error if the `search_string` argument isn’t a valid IP address. For examples, see
  Examples of expected error cases.
* You can add a FULL_TEXT search optimization on columns that are the target of SEARCH_IP function calls by using an ALTER TABLE command
  that specifies the ENTITY_ANALYZER. For example:

  ```sqlexample
  ALTER TABLE ipt ADD SEARCH OPTIMIZATION ON FULL_TEXT(
    ipv4_source,
    ANALYZER => 'ENTITY_ANALYZER');
  ```

  The ENTITY_ANALYZER recognizes only the entities (for example, IP addresses). Therefore, the search access path is typically
  much smaller than FULL_TEXT search optimization with a different analyzer.

  For more information, see [enable FULL_TEXT search optimization](../../user-guide/search-optimization/enabling.md).

## Examples

The following examples use the SEARCH_IP function:

* Search for matching IP addresses in VARCHAR columns
* Search for matching IP addresses in a VARIANT column
* Search for matching IP addresses in long strings of text
* Examples of expected error cases

### Search for matching IP addresses in VARCHAR columns

The following examples show how to use the SEARCH_IP function to query VARCHAR (text) columns.

First, create a table named `ipt` that has two columns that store IPv4 addresses and one column that
stores IPv6 addresses:

```sqlexample
CREATE OR REPLACE TABLE ipt(
  id INT,
  ipv4_source VARCHAR(20),
  ipv4_target VARCHAR(20),
  ipv6_target VARCHAR(40));
```

Insert two rows into the table:

```sqlexample
INSERT INTO ipt VALUES(
  1,
  '192.0.2.146',
  '203.0.113.5',
  '2001:0db8:85a3:0000:0000:8a2e:0370:7334');

INSERT INTO ipt VALUES(
  2,
  '192.0.2.111',
  '192.000.002.146',
  '2001:db8:1234::5678');
```

Query the table:

```sqlexample
SELECT * FROM ipt;
```

```output
+----+-------------+-----------------+-----------------------------------------+
| ID | IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET                             |
|----+-------------+-----------------+-----------------------------------------|
|  1 | 192.0.2.146 | 203.0.113.5     | 2001:0db8:85a3:0000:0000:8a2e:0370:7334 |
|  2 | 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678                     |
+----+-------------+-----------------+-----------------------------------------+
```

The following sections run queries that use the SEARCH_IP function on this table data:

* Search for matching IP addresses by using the function in a SELECT list
* Search for matching IP addresses by using the function in the WHERE clause
* Enable FULL_TEXT search optimization on VARCHAR columns

#### Search for matching IP addresses by using the function in a SELECT list

Run a query that uses the SEARCH_IP function in the SELECT list and searches
the three VARCHAR columns in the table:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target,
       SEARCH_IP((ipv4_source, ipv4_target, ipv6_target), '192.0.2.146') AS "Match found?"
  FROM ipt
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+-----------------------------------------+--------------+
| IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET                             | Match found? |
|-------------+-----------------+-----------------------------------------+--------------|
| 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678                     | True         |
| 192.0.2.146 | 203.0.113.5     | 2001:0db8:85a3:0000:0000:8a2e:0370:7334 | True         |
+-------------+-----------------+-----------------------------------------+--------------+
```

Notice that `search_data` `192.000.002.146` is a match for `search_string`
`192.0.2.146`, even though `192.000.002.146` has leading zeros.

Run a query that searches for IPv6 addresses that match `2001:0db8:85a3:0000:0000:8a2e:0370:7334`:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target,
       SEARCH_IP((ipv6_target), '2001:0db8:85a3:0000:0000:8a2e:0370:7334') AS "Match found?"
  FROM ipt
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+-----------------------------------------+--------------+
| IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET                             | Match found? |
|-------------+-----------------+-----------------------------------------+--------------|
| 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678                     | False        |
| 192.0.2.146 | 203.0.113.5     | 2001:0db8:85a3:0000:0000:8a2e:0370:7334 | True         |
+-------------+-----------------+-----------------------------------------+--------------+
```

The following query is the same as the previous query, but it excludes the leading zeros and zero segments
in the `search_string`:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target,
       SEARCH_IP((ipv6_target), '2001:db8:85a3::8a2e:370:7334') AS "Match found?"
  FROM ipt
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+-----------------------------------------+--------------+
| IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET                             | Match found? |
|-------------+-----------------+-----------------------------------------+--------------|
| 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678                     | False        |
| 192.0.2.146 | 203.0.113.5     | 2001:0db8:85a3:0000:0000:8a2e:0370:7334 | True         |
+-------------+-----------------+-----------------------------------------+--------------+
```

The following query shows that a `search_string` with a CIDR range for IPv4 addresses:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       SEARCH_IP((ipv4_source, ipv4_target), '192.0.2.1/20') AS "Match found?"
  FROM ipt
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+--------------+
| IPV4_SOURCE | IPV4_TARGET     | Match found? |
|-------------+-----------------+--------------|
| 192.0.2.111 | 192.000.002.146 | True         |
| 192.0.2.146 | 203.0.113.5     | True         |
+-------------+-----------------+--------------+
```

The following query shows that a `search_string` with leading zeros returns `True` for
IPv4 addresses that omit the leading zeros:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       SEARCH_IP((ipv4_source, ipv4_target), '203.000.113.005') AS "Match found?"
  FROM ipt
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+--------------+
| IPV4_SOURCE | IPV4_TARGET     | Match found? |
|-------------+-----------------+--------------|
| 192.0.2.111 | 192.000.002.146 | False        |
| 192.0.2.146 | 203.0.113.5     | True         |
+-------------+-----------------+--------------+
```

#### Search for matching IP addresses by using the function in the WHERE clause

The following query uses the function in the WHERE clause and searches the `ipv4_target` column only.

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target
  FROM ipt
  WHERE SEARCH_IP(ipv4_target, '203.0.113.5')
  ORDER BY ipv4_source;
```

```output
+-------------+-------------+-----------------------------------------+
| IPV4_SOURCE | IPV4_TARGET | IPV6_TARGET                             |
|-------------+-------------+-----------------------------------------|
| 192.0.2.146 | 203.0.113.5 | 2001:0db8:85a3:0000:0000:8a2e:0370:7334 |
+-------------+-------------+-----------------------------------------+
```

When the function is used in the WHERE clause and there is no match, no values are returned:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target
  FROM ipt
  WHERE SEARCH_IP(ipv4_target, '203.0.113.1')
  ORDER BY ipv4_source;
```

```output
+-------------+-------------+-------------+
| IPV4_SOURCE | IPV4_TARGET | IPV6_TARGET |
|-------------+-------------+-------------|
+-------------+-------------+-------------+
```

The following query uses the function in the WHERE clause and searches the `ipv6_target` column only.

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target
  FROM ipt
  WHERE SEARCH_IP(ipv6_target, '2001:db8:1234::5678')
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+---------------------+
| IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET         |
|-------------+-----------------+---------------------|
| 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678 |
+-------------+-----------------+---------------------+
```

You can use the `*` character (or `table.*`) as the first argument to the SEARCH function, as shown in the following example.
The search operates on all of the qualifying columns in the table that you are selecting from:

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target
  FROM ipt
  WHERE SEARCH_IP((*), '192.0.2.146')
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+-----------------------------------------+
| IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET                             |
|-------------+-----------------+-----------------------------------------|
| 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678                     |
| 192.0.2.146 | 203.0.113.5     | 2001:0db8:85a3:0000:0000:8a2e:0370:7334 |
+-------------+-----------------+-----------------------------------------+
```

You can also use the ILIKE and EXCLUDE keywords for filtering. For more information about
these keywords, see [SELECT](../sql/select.md).

The following search uses the ILIKE keyword to search only in columns that end with the string `_target`.

```sqlexample
SELECT ipv4_source,
       ipv4_target,
       ipv6_target
  FROM ipt
  WHERE SEARCH_IP(* ILIKE '%_target', '192.0.2.146')
  ORDER BY ipv4_source;
```

```output
+-------------+-----------------+---------------------+
| IPV4_SOURCE | IPV4_TARGET     | IPV6_TARGET         |
|-------------+-----------------+---------------------|
| 192.0.2.111 | 192.000.002.146 | 2001:db8:1234::5678 |
+-------------+-----------------+---------------------+
```

#### Enable FULL_TEXT search optimization on VARCHAR columns

To [enable FULL_TEXT search optimization](../../user-guide/search-optimization/enabling.md) for the columns in the
`ipt` table, run the following ALTER TABLE command:

```sqlexample
ALTER TABLE ipt ADD SEARCH OPTIMIZATION ON FULL_TEXT(
  ipv4_source,
  ipv4_target,
  ipv6_target,
  ANALYZER => 'ENTITY_ANALYZER');
```

> **Note:**
>
> The columns you specify must be VARCHAR or VARIANT columns. Columns with other data types aren’t supported.

### Search for matching IP addresses in a VARIANT column

The following examples show how to use the SEARCH_IP function to query VARIANT columns.

The following example uses the SEARCH_IP function to search a path to a field in a VARIANT column. Create a table
named `iptv` and insert two rows:

```sqlexample
CREATE OR REPLACE TABLE iptv(ip1 VARIANT);
INSERT INTO iptv(ip1)
  SELECT PARSE_JSON(' { "ipv1": "203.0.113.5", "ipv2": "203.0.113.5" } ');
INSERT INTO iptv(ip1)
  SELECT PARSE_JSON(' { "ipv1": "192.0.2.146", "ipv2": "203.0.113.5" } ');
```

Run the following search queries. The first query searches the `ipv1` field only. The
second searches `ipv1` and `ipv2`.

```sqlexample
SELECT * FROM iptv
  WHERE SEARCH_IP((ip1:"ipv1"), '203.0.113.5');
```

```output
+--------------------------+
| IP1                      |
|--------------------------|
| {                        |
|   "ipv1": "203.0.113.5", |
|   "ipv2": "203.0.113.5"  |
| }                        |
+--------------------------+
```

```sqlexample
SELECT * FROM iptv
  WHERE SEARCH_IP((ip1:"ipv1",ip1:"ipv2"), '203.0.113.5');
```

```output
+--------------------------+
| IP1                      |
|--------------------------|
| {                        |
|   "ipv1": "203.0.113.5", |
|   "ipv2": "203.0.113.5"  |
| }                        |
| {                        |
|   "ipv1": "192.0.2.146", |
|   "ipv2": "203.0.113.5"  |
| }                        |
+--------------------------+
```

To [enable FULL_TEXT search optimization](../../user-guide/search-optimization/enabling.md) for this `ip1` VARIANT
column and its fields, run the following ALTER TABLE command:

```sqlexample
ALTER TABLE iptv ADD SEARCH OPTIMIZATION ON FULL_TEXT(
  ip1:"ipv1",
  ip1:"ipv2",
  ANALYZER => 'ENTITY_ANALYZER');
```

> **Note:**
>
> The columns you specify must be VARCHAR or VARIANT columns. Columns with other data types aren’t supported.

### Search for matching IP addresses in long strings of text

Create a table named `ipt_log` and insert rows:

```sqlexample
CREATE OR REPLACE TABLE ipt_log(id INT, ip_request_log VARCHAR(200));
INSERT INTO ipt_log VALUES(1, 'Connection from IP address 192.0.2.146 succeeded.');
INSERT INTO ipt_log VALUES(2, 'Connection from IP address 203.0.113.5 failed.');
INSERT INTO ipt_log VALUES(3, 'Connection from IP address 192.0.2.146 dropped.');
```

Search for log entries in the `ip_request_log` column that include the `192.0.2.146` IP address:

```sqlexample
SELECT * FROM ipt_log
  WHERE SEARCH_IP(ip_request_log, '192.0.2.146')
  ORDER BY id;
```

```output
+----+---------------------------------------------------+
| ID | IP_REQUEST_LOG                                    |
|----+---------------------------------------------------|
|  1 | Connection from IP address 192.0.2.146 succeeded. |
|  3 | Connection from IP address 192.0.2.146 dropped.   |
+----+---------------------------------------------------+
```

### Examples of expected error cases

The following examples show queries that return expected syntax errors.

The following example fails because `5` isn’t a supported data type for the `search_string` argument:

```sqlexample
SELECT SEARCH_IP(ipv4_source, 5) FROM ipt;
```

```output
001045 (22023): SQL compilation error:
argument needs to be a string: '1'
```

The following example fails because the `search_string` argument isn’t a valid IP address.

```sqlexample
SELECT SEARCH_IP(ipv4_source, '1925.0.2.146') FROM ipt;
```

```output
0000937 (22023): SQL compilation error: error line 1 at position 30
invalid argument for function [SEARCH_IP(IPT.IPV4_SOURCE, '1925.0.2.146')] unexpected argument [1925.0.2.146] at position 1,
```

The following example fails because the `search_string` argument is an empty string.

```sqlexample
SELECT SEARCH_IP(ipv4_source, '') FROM ipt;
```

```output
000937 (22023): SQL compilation error: error line 1 at position 30
invalid argument for function [SEARCH_IP(IPT.IPV4_SOURCE, '')] unexpected argument [] at position 1,
```

The following example fails because no columns with supported data types are specified for the `search_data` argument.

```sqlexample
SELECT SEARCH_IP(id, '192.0.2.146') FROM ipt;
```

```output
001173 (22023): SQL compilation error: error line 1 at position 7: Expected non-empty set of columns supporting full-text search.
```

The following example succeeds because a column with a supported data type is specified for the `search_data`
argument. The function ignores the `id` column because it isn’t a supported data type:

```sqlexample
SELECT SEARCH_IP((id, ipv4_source), '192.0.2.146') FROM ipt;
```

```output
+---------------------------------------------+
| SEARCH_IP((ID, IPV4_SOURCE), '192.0.2.146') |
|---------------------------------------------|
| True                                        |
| False                                       |
+---------------------------------------------+
```

---
title: SEARCH_OPTIMIZATION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/search_optimization_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# SEARCH_OPTIMIZATION_HISTORY

This table function is used for querying the [search optimization service](../../user-guide/search-optimization-service.md) maintenance history for a specified table within a specified date range. The information returned by the function includes the table name and credits consumed each time a search optimization maintenance operation occurred.

## Syntax

```sqlsyntax
SEARCH_OPTIMIZATION_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ]
      [ , TABLE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date/time range for which to display the history.
    For example, if you specify that the start date is 2019-04-03 and the end date is 2019-04-05, then you get data for
    April 3, April 4, and April 5. (The endpoints are included.)

    * If neither a start date nor an end date is specified, the default is the last 12 hours.
    * If an end date is not specified, but a start date is specified, then [CURRENT_DATE](current_date.md)
      at midnight is used as the end of the range.
    * If a start date is not specified, but an end date is specified, then the range starts 12 hours prior to the start
      of `DATE_RANGE_END`.

`TABLE_NAME => string`
:   The table name. If specified, only shows the history for the specified table. The name can include the schema name and the database
    name.

    If a name is not specified, then the results include the data for each table that has search optimization for
    which maintenance occurred within the specified time range.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE
  global privilege.

  > **Note:**
  >
  > A role with the MONITOR USAGE privilege can view per-object credit usage, but not object names. The role must
  > also be granted SELECT on an object in order for the object’s name to be returned by this function. If the role
  > does not have sufficient privileges to see the object name, the object name might be displayed with a substitute
  > name such as “unknown_#”, where “#” represents one or more digits.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be
  fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).
* The history is displayed in increments of 1 hour.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | TEXT | Number of credits billed for search index maintenance during the START_TIME and END_TIME window. |
| TABLE_NAME | TEXT | Name of the table. |

## Examples

Retrieve the history for a one-hour range for your account:

> ```sqlexample
> select *
>   from table(information_schema.search_optimization_history(
>     date_range_start=>'2019-05-22 19:00:00.000',
>     date_range_end=>'2019-05-22 20:00:00.000'));
> ```
>
> Here is sample output:
>
> ```sqlexample
> +-------------------------------+-------------------------------+--------------+----------------------------------+
> | START_TIME                    | END_TIME                      | CREDITS_USED | TABLE_NAME                       |
> |-------------------------------+-------------------------------+--------------+----------------------------------|
> | 2019-05-22 19:00:00.000 -0700 | 2019-05-22 20:00:00.000 -0700 |  0.223276651 | TEST_DB.TEST_SCHEMA.TEST_TABLE_1 |
> +-------------------------------+-------------------------------+--------------+----------------------------------+
> ```

Retrieve the history for the last 12 hours for your account:

> ```sqlexample
> select *
>   from table(information_schema.search_optimization_history(
>     date_range_start=>dateadd(H, -12, current_timestamp)));
> ```

Retrieve the history for the past week for a specified table:

> ```sqlexample
> select *
>   from table(information_schema.search_optimization_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date,
>     table_name=>'mydb.myschema.my_table')
>     );
> ```

Retrieve the maintenance history for the past week for all tables in your account:

> ```sqlexample
> select *
>   from table(information_schema.search_optimization_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date)
>     );
> ```

---
title: SEARCH_PREVIEW (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/search_preview-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# SEARCH_PREVIEW (SNOWFLAKE.CORTEX)

Given a Cortex Search service name, and a query, returns a response from the specified service.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    '<service_name>',
    '<query_parameters_object>'
)
```

## Arguments

`service_name`
:   Name of your Cortex Search service. Use the fully qualified name if the service is in a schema different from the current session.

`query_parameters_object`
:   A [STRING](../data-types-text.md) that contains a JSON object that specifies the query parameters for invoking the service.

    | Key | Type | Description | Default |
    | --- | --- | --- | --- |
    | `query` | String | Your search query, to search over the text column in the service. | This is required. |
    | `columns` | Array | A comma-separated list of columns to return for each relevant result in the response. These columns must be included in the source query for the service. | Search column that was specified when the service was created. |
    | `filter` | Object | A filter object for filtering results based on data in the `ATTRIBUTES` columns. For detailed syntax, see Filter syntax. | Empty object |
    | `limit` | Integer | Maximum number of results to return in the response. | 10 |

## Filter syntax

Cortex Search supports filtering on the ATTRIBUTES columns specified in the
[CREATE CORTEX SEARCH SERVICE](../sql/create-cortex-search.md) command.

Cortex Search supports five matching operators:

* [TEXT](../data-types-text.md) or [NUMERIC](../data-types-numeric.md) equality: `@eq`
* [ARRAY](../data-types-semistructured.md) contains: `@contains`
* [NUMERIC](../data-types-numeric.md) or [DATE/TIMESTAMP](../data-types-datetime.md) greater than or equal to: `@gte`
* [NUMERIC](../data-types-numeric.md) or [DATE/TIMESTAMP](../data-types-datetime.md) less than or equal to: `@lte`
* [primary key](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) equality: `@primarykey`

These matching operators can be composed with various logical operators:

* `@and`
* `@or`
* `@not`

The following usage notes apply:

* Matching against `NaN` (‘not a number’) values in the source query are handled as described in [Special values](../data-types-numeric.md).
* [Fixed-point](../data-types-numeric.md) numeric values with more than 19 digits (not including leading zeroes) do not work with `@eq`, `@gte`, or `@lte` and will not be returned by these operators.

  + For example, if there is a large value in the source query, using `@eq` to match that exact value will return no results.
  + These large values could still be returned by the overall filter with the use of `@not` (e.g. while `@eq X` will return no values for some large X, `@not @eq Y` will return it).
* `TIMESTAMP` and `DATE` filters accept values of the form: `YYYY-MM-DD` and, for timezone aware dates: `YYYY-MM-DD+HH:MM`. If the timezone offset is not specified, the date is interpreted in UTC.
* `@primarykey` is only supported for services configured with a [primary key](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). The value of the filter must be a JSON object mapping every primary key column to its corresponding value (or `NULL`).

These operators can be combined into a single filter object.

## Example

* Filtering on rows where string-like column `string_col` is equal to value `value`.

  ```javascript
  { "@eq": { "string_col": "value" } }
  ```
* Filtering to a row with the specified primary key,

  ```javascript
  { "@primarykey": { "region": "us-west-1", "agent_id": "abc123" } }
  ```
* Filtering on rows where ARRAY column `array_col` contains value `value`.

  ```javascript
  { "@contains": { "array_col": "arr_value" } }
  ```
* Filtering on rows where NUMERIC column `numeric_col` is between 10.5 and 12.5 (inclusive):

  ```javascript
  { "@and": [
    { "@gte": { "numeric_col": 10.5 } },
    { "@lte": { "numeric_col": 12.5 } }
  ]}
  ```
* Filtering on rows where TIMESTAMP column `timestamp_col` is between `2024-11-19` and `2024-12-19` (inclusive).

  ```javascript
  { "@and": [
    { "@gte": { "timestamp_col": "2024-11-19" } },
    { "@lte": { "timestamp_col": "2024-12-19" } }
  ]}
  ```
* Composing filters with logical operators:

  ```javascript
  // Rows where the "array_col" column contains "arr_value" and the "string_col" column equals "value":
  {
      "@and": [
        { "@contains": { "array_col": "arr_value" } },
        { "@eq": { "string_col": "value" } }
      ]
  }

  // Rows where the "string_col" column does not equal "value"
  {
    "@not": { "@eq": { "string_col": "value" } }
  }

  // Rows where the "array_col" column contains at least one of "val1", "val2", or "val3"
  {
    "@or": [
        { "@contains": { "array_col": "val1" } },
        { "@contains": { "array_col": "val1" } },
        { "@contains": { "array_col": "val1" } }
    ]
  }
  ```

## Returns

Returns an [OBJECT](../data-types-semistructured.md) that contains the result of your query from your Cortex Search service and a unique
request ID. See example output in Examples.

## Usage notes

* This function is designed for testing and validation, and incurs more latency than using the REST or Python APIs. Use other methods to serve search queries in an end-user application that requires low latency.
* This function only operates on constant arguments. It does not accept table columns as input.
* This function truncates search results if they exceed 300kB. The REST surface allows responses up to 10MB.

## Examples

This example queries a service named `sample_service` with a `test query`.
The example returns five results (at most) and includes the data from the `col1` and `col2` columns.

```sqlexample
SELECT
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW (
      'mydb.mysch.sample_service',
      '{
          "query": "test query",
          "columns": ["col1", "col2"],
          "limit": 3
      }'
  );
```

```output
{
  "results":[
      {"col1":"text", "col2":"text"},
      {"col1":"text", "col2":"text"},
      {"col1":"text", "col2":"text"}
  ],
  "request_id":"a27d1d85-e02c-4730-b320-74bf94f72d0d"
}
```

---
title: SENTIMENT (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/sentiment-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# SENTIMENT (SNOWFLAKE.CORTEX)

Returns an overall sentiment score for the given English-language input text.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.SENTIMENT(<text>)
```

## Arguments

`text`
:   A string containing the text for which a sentiment score should be calculated.

## Returns

A floating-point number from -1 to 1 (inclusive) indicating the model’s level of certainty of any detected sentiment. A
score close to 0 indicates that the function could not determine a clear sentiment in the text; this result can be
considered neutral. A score close to 1 indicates positive sentiment, while a score close to -1 indicates negative
sentiment. The chart below provides guidance on how to interpret the sentiment scores:

| Sentiment | Sentiment Score |
| --- | --- |
| Positive | 0.5 to 1 |
| Neutral | -0.5 to 0.5 |
| Negative | -0.5 to -1 |

The result *does not* indicate the intensity of sentiment, but the polarity (positive, neutral, or negative) and certainty.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Examples

The following example uses SENTIMENT to get the sentiment classification of a food service review, which we can infer
as modestly positive, given the score of 0.54.

```sqlexample
SELECT SNOWFLAKE.CORTEX.SENTIMENT('A tourist\'s delight, in low urban light,
  Recommended gem, a pizza night sight. Swift arrival, a pleasure so right,
  Yet, pockets felt lighter, a slight pricey bite. 💰🍕🚀');
```

Response:

```output
0.5424458
```

In the following example, a table named `reviews` contains a column named `review_content` containing the text of reviews
submitted by users. The query returns a sentiment score for each review.

```sqlexample
SELECT SNOWFLAKE.CORTEX.SENTIMENT(review_content), review_content FROM reviews LIMIT 10;
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: SEQ1 / SEQ2 / SEQ4 / SEQ8
source: https://docs.snowflake.com/en/sql-reference/functions/seq1.md
section: SQL Functions
---

Categories:
:   [Data generation functions](../functions-data-generation.md)

# SEQ1 / SEQ2 / SEQ4 / SEQ8

Returns a sequence of monotonically increasing integers, with wrap-around. Wrap-around occurs after the largest representable integer of the integer width (1, 2, 4, or 8 byte).

> **Important:**
>
> This function uses sequences to produce a unique set of increasing integers, but does not necessarily produce a gap-free sequence. When operating on a large quantity of data, gaps can
> appear in a sequence. If a fully ordered, gap-free sequence is required, consider using the [ROW_NUMBER](row_number.md) window function.
>
> For more details about sequences in Snowflake, see [Using Sequences](../../user-guide/querying-sequences.md).

## Syntax

```sqlsyntax
SEQ1( [0|1] )

SEQ2( [0|1] )

SEQ4( [0|1] )

SEQ8( [0|1] )
```

## Usage notes

* If the optional sign argument is 0, the sequence continues at 0 after wrap-around. If the optional sign argument is 1, the sequence continues at the smallest representable number based
  on the given integer width.
* The default sign argument is 0.

## Examples

These are basic examples of using sequences:

> ```sqlexample
> SELECT seq8() FROM table(generator(rowCount => 5));
>
> +--------+
> | SEQ8() |
> |--------|
> |      0 |
> |      1 |
> |      2 |
> |      3 |
> |      4 |
> +--------+
> ```
>
> ```sqlexample
> SELECT * FROM (SELECT seq2(0), seq1(1) FROM table(generator(rowCount => 132))) ORDER BY seq2(0) LIMIT 7 OFFSET 125;
>
> +---------+---------+
> | SEQ2(0) | SEQ1(1) |
> |---------+---------|
> |     125 |     125 |
> |     126 |     126 |
> |     127 |     127 |
> |     128 |    -128 |
> |     129 |    -127 |
> |     130 |    -126 |
> |     131 |    -125 |
> +---------+---------+
> ```

This example shows how to use ROW_NUMBER to generate a sequence without gaps:

> ```sqlexample
> SELECT ROW_NUMBER() OVER (ORDER BY seq4())
>     FROM TABLE(generator(rowcount => 10));
> +-------------------------------------+
> | ROW_NUMBER() OVER (ORDER BY SEQ4()) |
> |-------------------------------------|
> |                                   1 |
> |                                   2 |
> |                                   3 |
> |                                   4 |
> |                                   5 |
> |                                   6 |
> |                                   7 |
> |                                   8 |
> |                                   9 |
> |                                  10 |
> +-------------------------------------+
> ```

---
title: SERVERLESS_ALERT_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/serverless_alert_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# SERVERLESS_ALERT_HISTORY

This table function is used for querying the [serverless alert](../../user-guide/alerts.md) usage history. The
information returned by the function includes the alert name and credits consumed by the execution of each alert.

See also:
:   [SERVERLESS_ALERT_HISTORY view (ACCOUNT_USAGE)](../account-usage/serverless_alert_history.md)

## Syntax

```sqlsyntax
SERVERLESS_ALERT_HISTORY(
  [ DATE_RANGE_START => <constant_expr> ]
  [ , DATE_RANGE_END => <constant_expr> ]
  [ , ALERT_NAME => '<string>' ] )
```

## Arguments

All of the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   Date/time range of the usage window:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (that is, the
      default is to show the previous 10 minutes of the usage history). For example, if `DATE_RANGE_END`
      is [CURRENT_DATE](current_date.md), then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

`ALERT_NAME => string`
:   The name of the alert for which to retrieve usage history. Only the usage data for the specified alert is returned.

    Note that the alert name must be enclosed in single quotes. Also, if the alert name contains any spaces, mixed-case characters,
    or special characters, the name must be double-quoted within the single quotes (e.g. `'"My Alert"'` vs `'myalert'`).

## Usage notes

* The table function returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR
  USAGE global privilege.

  > **Note:**
  >
  > A role with the MONITOR USAGE privilege can view per-object credit usage, but not object names.
* When you call an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the
  function name must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).
* The history is displayed in increments of 1 hour.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| ALERT_NAME | TEXT | Name of the alert. |
| CREDITS_USED | TEXT | Number of credits billed for serverless alert usage during the START_TIME and END_TIME window. |

## Examples

Retrieve the usage history for a one-hour range for your account:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.SERVERLESS_ALERT_HISTORY(
    DATE_RANGE_START=>'2024-10-08 19:00:00.000 -0700',
    DATE_RANGE_END=>'2024-10-08 20:00:00.000 -0700'));
```

Sample output:

```output
+-------------------------------+-------------------------------+------------+--------------+
| START_TIME                    | END_TIME                      | ALERT_NAME | CREDITS_USED |
|-------------------------------+-------------------------------+------------+--------------|
| 2024-10-08 04:16:22.000 -0700 | 2024-10-08 05:16:22.000 -0700 | A1         |  0.000286714 |
| 2024-10-08 05:16:22.000 -0700 | 2024-10-08 06:16:22.000 -0700 | A1         |  0.007001568 |
+-------------------------------+-------------------------------+------------+--------------+
```

Retrieve the history for the last 12 hours for your account:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.SERVERLESS_ALERT_HISTORY(
    DATE_RANGE_START=>DATEADD(H, -12, CURRENT_TIMESTAMP)));
```

Retrieve the history for the past week for your account:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.SERVERLESS_ALERT_HISTORY(
    DATE_RANGE_START=>DATEADD(D, -7, CURRENT_DATE),
    DATE_RANGE_END=>CURRENT_DATE));
```

Retrieve the usage history for the past week for a specified alert in your account:

```sqlexample
SELECT *
  FROM TABLE(INFORMATION_SCHEMA.SERVERLESS_ALERT_HISTORY(
    DATE_RANGE_START=>DATEADD(D, -7, CURRENT_DATE),
    DATE_RANGE_END=>CURRENT_DATE,
    ALERT_NAME=>'my_database.my_schema.my_alert'));
```

---
title: SERVERLESS_TASK_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/serverless_task_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# SERVERLESS_TASK_HISTORY

This table function is used for querying the [serverless task](../../user-guide/tasks-intro.md) usage history. The information returned
by the function includes the task name and credits consumed by runs of each task.

## Syntax

```sqlsyntax
SERVERLESS_TASK_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [ , DATE_RANGE_END => <constant_expr> ]
      [ , TASK_NAME => '<string>' ] )
```

## Arguments

All of the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   Date/time range of the usage window:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default is to
      show the previous 10 minutes of the usage history). For example, if `DATE_RANGE_END` is [CURRENT_DATE](current_date.md), then the default
      `DATE_RANGE_START` is 11:50 PM on the previous day.

`TASK_NAME => string`
:   The name of the task to retrieve usage history for. Only the usage data for the specified task is returned.

    Note that the task name must be enclosed in single quotes. Also, if the task name contains any spaces, mixed-case characters, or special
    characters, the name must be double-quoted within the single quotes (e.g. `'"My Task"'` vs `'mytask'`).

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.

  > **Note:**
  >
  > A role with the MONITOR USAGE privilege can view per-object credit usage, but not object names.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be
  fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).
* The history is displayed in increments of 1 hour.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| TASK_NAME | TEXT | Name of the task. |
| CREDITS_USED | TEXT | Number of credits billed for serverless task usage during the START_TIME and END_TIME window. |

## Examples

Retrieve the usage history for a one-hour range for your account:

> ```sqlexample
> select *
>   from table(information_schema.serverless_task_history(
>     date_range_start=>'2021-10-08 19:00:00.000',
>     date_range_end=>'2021-10-08 20:00:00.000'));
> ```
>
> Sample output:
>
> ```sqlexample
> +-------------------------------+-------------------------------+-----------+--------------+
> | START_TIME                    | END_TIME                      | TASK_NAME | CREDITS_USED |
> |-------------------------------+-------------------------------+-----------+--------------|
> | 2021-10-08 04:16:22.000 -0700 | 2021-10-08 05:16:22.000 -0700 | T1        |  0.000286714 |
> | 2021-10-08 05:16:22.000 -0700 | 2021-10-08 06:16:22.000 -0700 | T1        |  0.007001568 |
> +-------------------------------+-------------------------------+-----------+--------------+
> ```

Retrieve the history for the last 12 hours for your account:

> ```sqlexample
> select *
>   from table(information_schema.serverless_task_history(
>     date_range_start=>dateadd(H, -12, current_timestamp)));
> ```

Retrieve the history for the past week for your account:

> ```sqlexample
> select *
>   from table(information_schema.serverless_task_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date));
> ```

Retrieve the usage history for the past week for a specified task in your account:

> ```sqlexample
> select *
>   from table(information_schema.serverless_task_history(
>     date_range_start=>dateadd(D, -7, current_date),
>     date_range_end=>current_date,
>     task_name=>'mydb.myschema.mytask'));
> ```

---
title: SET_SYS_CONTEXT
source: https://docs.snowflake.com/en/sql-reference/functions/set_sys_context.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (Session)

# SET_SYS_CONTEXT

Sets a value for a specified key in a specified namespace that can be retrieved later using
[SYS_CONTEXT](sys_context.md).

This function has two modes of operation:

* **Immutable session attributes** (`SNOWFLAKE$SESSION_ATTRIBUTES` namespace): Sets custom
  session attributes that are immutable once set and persist for the duration of the session.
  Useful for tracking metadata about a session, such as application context, user attributes, or
  audit information.
* **Session variables** (other namespaces): Behaves like the [SET](../sql/set.md)
  command, setting session variables that can be updated. Returns the previous value of the
  variable.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SYS_CONTEXT (SNOWFLAKE$SESSION_ATTRIBUTES namespace)](sys_context_snowflake_session_attributes.md) ,
    [SET](../sql/set.md)

## Syntax

```sqlsyntax
CALL SET_SYS_CONTEXT( '<namespace>', '<key>', '<value>' )
```

## Arguments

`'namespace'`
:   The namespace in which to store the key-value pair. Supported namespaces:

    * `SNOWFLAKE$SESSION_ATTRIBUTES` — Stores immutable custom session attributes. Attribute
      names are **case-sensitive**.
    * Any other string (or NULL) — Treats the namespace as a prefix for a session variable name,
      similar to the [SET](../sql/set.md) command. Namespaces are
      **case-sensitive**.

`'key'`
:   The name of the attribute or variable to set. All key names are **case-sensitive**.

`'value'`
:   The value to assign. The value must be a string or an expression that evaluates to a string.

## Returns

The function returns a VARCHAR value:

* For the `SNOWFLAKE$SESSION_ATTRIBUTES` namespace: Always returns NULL (because immutable
  attributes cannot have a previous value). If the attribute has already been set in the current
  session, the function raises an error instead.
* For other namespaces: Returns the previous value of the session variable, or NULL if the
  variable did not previously exist. This matches the behavior of the
  [SET](../sql/set.md) command.

## Access control requirements

No special privileges are required to set custom session attributes. Any user can set attributes
in their own session.

## Usage notes

**For SNOWFLAKE$SESSION_ATTRIBUTES namespace (immutable attributes):**

* Attributes are **immutable**. Once an attribute is set, any attempt to set it again (even to
  the same value) will result in an error.
* Attribute names are **case-sensitive**. `app_context`, `App_Context`, and `APP_CONTEXT`
  are treated as three different attributes.
* Attributes are **session-scoped**. They persist for the duration of the session and are not
  visible to other sessions.
* To retrieve attribute values, use [SYS_CONTEXT (SNOWFLAKE$SESSION_ATTRIBUTES namespace)](sys_context_snowflake_session_attributes.md):
  `SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', '<key>')`.

**For other namespaces (session variables):**

* Variable names are **case-sensitive**. `user_id` and `USER_ID` are treated as different
  variables.
* Variables can be **updated**. Setting a variable that already exists returns the previous value
  and updates it with the new value.
* The namespace (if provided) is used as a prefix: `SET_SYS_CONTEXT('myns', 'mykey', 'val')`
  creates a variable named `myns.mykey`.
* Variables can be retrieved using `SYS_CONTEXT('<namespace>', '<key>')` with the exact case
  used when setting the variable.

**General notes:**

* If you are specifying the function call in a double-quoted string in a shell, escape the `$`
  character with a backslash (`\`) so that `$session_attributes` is not interpreted as a
  shell variable.

## Examples

**Example 1: Immutable session attributes (SNOWFLAKE$SESSION_ATTRIBUTES namespace)**

Set a custom attribute to track the application context:

```sqlexample
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context', 'production');
```

Retrieve the attribute value (note: attribute names are case-sensitive):

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context');
```

```output
+---------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context')   |
|---------------------------------------------------------------|
| production                                                    |
+---------------------------------------------------------------+
```

Once an attribute is set, attempting to change it results in an error:

```sqlexample
-- This will fail because the attribute is immutable
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context', 'development');
```

```output
SQL compilation error: Cannot overwrite context value: app_context
```

Attribute names are case-sensitive:

```sqlexample
-- This succeeds because it's a different attribute name (different case)
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'APP_CONTEXT', 'staging');

SELECT SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context') AS lower_case,
       SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'APP_CONTEXT') AS upper_case;
```

```output
+------------+------------+
| LOWER_CASE | UPPER_CASE |
|------------+------------|
| production | staging    |
+------------+------------+
```

**Example 2: Session variables (other namespaces)**

Set a session variable with a namespace prefix:

```sqlexample
CALL SET_SYS_CONTEXT('myapp', 'user_id', '12345');
```

```output
+---------------------------------------------------+
| SET_SYS_CONTEXT('myapp', 'user_id', '12345')     |
|---------------------------------------------------|
| NULL                                              |
+---------------------------------------------------+
```

The variable is stored with the exact case provided: `myapp.user_id`. Retrieve it:

```sqlexample
SELECT SYS_CONTEXT('myapp', 'user_id');
```

```output
+----------------------------------+
| SYS_CONTEXT('myapp', 'user_id')  |
|----------------------------------|
| 12345                            |
+----------------------------------+
```

Update the variable (returns the previous value):

```sqlexample
CALL SET_SYS_CONTEXT('myapp', 'user_id', '67890');
```

```output
+---------------------------------------------------+
| SET_SYS_CONTEXT('myapp', 'user_id', '67890')     |
|---------------------------------------------------|
| 12345                                             |
+---------------------------------------------------+
```

---
title: SHA1 , SHA1_HEX
source: https://docs.snowflake.com/en/sql-reference/functions/sha1.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Cryptographic Hash)

# SHA1 , SHA1_HEX

Returns a 40-character hex-encoded string containing the 160-bit SHA-1
message digest.

These functions are synonymous.

## Syntax

```sqlsyntax
SHA1(<msg>)

SHA1_HEX(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

The data type of the returned value is VARCHAR.

## Usage notes

* The SHA1 family of functions is provided primarily for backwards compatibility with other systems.
  For more secure encryption, Snowflake recommends using the SHA2 family of functions.

* Do not use this function to encrypt a message that you need to decrypt. This function has no corresponding decryption function.
  (The length of the output is independent of the length of the input. The output does not necessarily have enough bits to hold
  all of the information from the input, so it is not possible to write a function that can decrypt all possible valid inputs.)

  This function is intended for other purposes, such as calculating a checksum to detect data corruption.

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
SELECT sha1('Snowflake');

------------------------------------------+
            SHA1('SNOWFLAKE')             |
------------------------------------------+
 fda76b0bcc1e87cf259b1d1e3271d76f590fb5dd |
------------------------------------------+
```

The data type of the output is string (`VARCHAR`) and can be stored in a
`VARCHAR` column:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE sha_table(
> >     v VARCHAR,
> >     v_as_sha1 VARCHAR,
> >     v_as_sha1_hex VARCHAR,
> >     v_as_sha1_binary BINARY,
> >     v_as_sha2 VARCHAR,
> >     v_as_sha2_hex VARCHAR,
> >     v_as_sha2_binary BINARY
> >     );
> > INSERT INTO sha_table(v) VALUES ('AbCd0');
> > UPDATE sha_table SET
> >     v_as_sha1 = SHA1(v),
> >     v_as_sha1_hex = SHA1_HEX(v),
> >     v_as_sha1_binary = SHA1_BINARY(v),
> >     v_as_sha2 = SHA2(v),
> >     v_as_sha2_hex = SHA2_HEX(v),
> >     v_as_sha2_binary = SHA2_BINARY(v)
> >     ;
> > ```
>
> Here are the query and output:
>
> > ```sqlexample
> > SELECT v, v_as_sha1, v_as_sha1_hex
> >   FROM sha_table
> >   ORDER BY v;
> > +-------+------------------------------------------+------------------------------------------+
> > | V     | V_AS_SHA1                                | V_AS_SHA1_HEX                            |
> > |-------+------------------------------------------+------------------------------------------|
> > | AbCd0 | 9ddb991863d53b35a52c490db256207c776ab8d8 | 9ddb991863d53b35a52c490db256207c776ab8d8 |
> > +-------+------------------------------------------+------------------------------------------+
> > ```

---
title: SHA1_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/sha1_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Cryptographic Hash)

# SHA1_BINARY

Returns a 20-byte binary containing the 160-bit SHA-1 message digest.

## Syntax

```sqlsyntax
SHA1_BINARY(<msg>)
```

## Arguments

`msg`
:   A string expression, the message to be hashed.

## Returns

The data type of the returned value is BINARY.

## Usage notes

* The SHA1 family of functions is provided primarily for backwards compatibility with other systems.
  For more secure encryption, Snowflake recommends using the SHA2 family of functions.

* Do not use this function to encrypt a message that you need to decrypt. This function has no corresponding decryption function.
  (The length of the output is independent of the length of the input. The output does not necessarily have enough bits to hold
  all of the information from the input, so it is not possible to write a function that can decrypt all possible valid inputs.)

  This function is intended for other purposes, such as calculating a checksum to detect data corruption.

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
SELECT sha1_binary('Snowflake');

------------------------------------------+
         SHA1_BINARY('SNOWFLAKE')         |
------------------------------------------+
 FDA76B0BCC1E87CF259B1D1E3271D76F590FB5DD |
------------------------------------------+
```

The data type of the output is `BINARY` and can be stored in a `BINARY`
column:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE sha_table(
> >     v VARCHAR,
> >     v_as_sha1 VARCHAR,
> >     v_as_sha1_hex VARCHAR,
> >     v_as_sha1_binary BINARY,
> >     v_as_sha2 VARCHAR,
> >     v_as_sha2_hex VARCHAR,
> >     v_as_sha2_binary BINARY
> >     );
> > INSERT INTO sha_table(v) VALUES ('AbCd0');
> > UPDATE sha_table SET
> >     v_as_sha1 = SHA1(v),
> >     v_as_sha1_hex = SHA1_HEX(v),
> >     v_as_sha1_binary = SHA1_BINARY(v),
> >     v_as_sha2 = SHA2(v),
> >     v_as_sha2_hex = SHA2_HEX(v),
> >     v_as_sha2_binary = SHA2_BINARY(v)
> >     ;
> > ```
>
> Here are the query and output (note that for display, the output is
> implicitly cast to a user-readable form, which in this case is a string of
> hexadecimal digits):
>
> > ```sqlexample
> > SELECT v, v_as_sha1_binary
> >   FROM sha_table
> >   ORDER BY v;
> > +-------+------------------------------------------+
> > | V     | V_AS_SHA1_BINARY                         |
> > |-------+------------------------------------------|
> > | AbCd0 | 9DDB991863D53B35A52C490DB256207C776AB8D8 |
> > +-------+------------------------------------------+
> > ```

---
title: SHA2 , SHA2_HEX
source: https://docs.snowflake.com/en/sql-reference/functions/sha2.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Cryptographic Hash)

# SHA2 , SHA2_HEX

Returns a hex-encoded string containing the N-bit SHA-2 message digest,
where N is the specified output digest size.

These functions are synonymous.

## Syntax

```sqlsyntax
SHA2( <msg> [, <digest_size>] )

SHA2_HEX( <msg> [, <digest_size>] )
```

## Arguments

**Required:**

`msg`
:   A string expression, the message to be hashed

**Optional:**

`digest_size`
:   Size (in bits) of the output, corresponding to the
    specific SHA-2 function used to encrypt the string:

    > 224 = SHA-224
    >
    > 256 = SHA-256 (Default)
    >
    > 384 = SHA-384
    >
    > 512 = SHA-512

    SHA-512/224 and SHA-512/256 are not supported.

## Returns

The data type of the returned value is VARCHAR.

## Usage notes

* Do not use this function to encrypt a message that you need to decrypt. This function has no corresponding decryption function.
  (The length of the output is independent of the length of the input. The output does not necessarily have enough bits to hold
  all of the information from the input, so it is not possible to write a function that can decrypt all possible valid inputs.)

  This function is intended for other purposes, such as calculating a checksum to detect data corruption.

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
SELECT sha2('Snowflake', 224);

----------------------------------------------------------+
                  SHA2('SNOWFLAKE', 224)                  |
----------------------------------------------------------+
 6267d3d7a59929e6864dd4b737d98e3ef8569d9f88a7466647838532 |
----------------------------------------------------------+
```

The data type of the output is string (`VARCHAR`) and can be stored in a
`VARCHAR` column:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE sha_table(
> >     v VARCHAR,
> >     v_as_sha1 VARCHAR,
> >     v_as_sha1_hex VARCHAR,
> >     v_as_sha1_binary BINARY,
> >     v_as_sha2 VARCHAR,
> >     v_as_sha2_hex VARCHAR,
> >     v_as_sha2_binary BINARY
> >     );
> > INSERT INTO sha_table(v) VALUES ('AbCd0');
> > UPDATE sha_table SET
> >     v_as_sha1 = SHA1(v),
> >     v_as_sha1_hex = SHA1_HEX(v),
> >     v_as_sha1_binary = SHA1_BINARY(v),
> >     v_as_sha2 = SHA2(v),
> >     v_as_sha2_hex = SHA2_HEX(v),
> >     v_as_sha2_binary = SHA2_BINARY(v)
> >     ;
> > ```
>
> Here are the query and output:
>
> > ```sqlexample
> > SELECT v, v_as_sha2, v_as_sha2_hex
> >   FROM sha_table
> >   ORDER BY v;
> > +-------+------------------------------------------------------------------+------------------------------------------------------------------+
> > | V     | V_AS_SHA2                                                        | V_AS_SHA2_HEX                                                    |
> > |-------+------------------------------------------------------------------+------------------------------------------------------------------|
> > | AbCd0 | e1d8ba27889d6782008f495473278c4f071995c5549a976e4d4f93863ce93643 | e1d8ba27889d6782008f495473278c4f071995c5549a976e4d4f93863ce93643 |
> > +-------+------------------------------------------------------------------+------------------------------------------------------------------+
> > ```

---
title: SHA2_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/sha2_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Cryptographic Hash)

# SHA2_BINARY

Returns a binary containing the N-bit SHA-2 message digest,
where N is the specified output digest size.

## Syntax

```sqlsyntax
SHA2_BINARY(<msg> [, <digest_size>])
```

## Arguments

**Required:**

`msg`
:   A string expression, the message to be hashed

**Optional:**

`digest_size`
:   Size (in bits) of the output, corresponding to the
    specific SHA-2 function used to encrypt the string:

    > 224 = SHA-224
    >
    > 256 = SHA-256 (Default)
    >
    > 384 = SHA-384
    >
    > 512 = SHA-512

    SHA-512/224 and SHA-512/256 are not supported.

## Returns

The data type of the returned value is BINARY.

## Usage notes

* Do not use this function to encrypt a message that you need to decrypt. This function has no corresponding decryption function.
  (The length of the output is independent of the length of the input. The output does not necessarily have enough bits to hold
  all of the information from the input, so it is not possible to write a function that can decrypt all possible valid inputs.)

  This function is intended for other purposes, such as calculating a checksum to detect data corruption.

  If you need to encrypt and decrypt data, use the following functions:

  + [ENCRYPT](encrypt.md) and [DECRYPT](decrypt.md)
  + [ENCRYPT_RAW](encrypt_raw.md) and [DECRYPT_RAW](decrypt_raw.md)

## Examples

```sqlexample
SELECT sha2_binary('Snowflake', 384);

--------------------------------------------------------------------------------------------------+
                                   SHA2_BINARY('SNOWFLAKE', 384)                                  |
--------------------------------------------------------------------------------------------------+
 736BD8A53845348830B1EE63A8CD3972F031F13B111F66FFDEC2271A7AE709662E503A0CA305BD50DA8D1CED48CD45D9 |
--------------------------------------------------------------------------------------------------+
```

The data type of the output is `BINARY` and can be stored in a
`BINARY` column:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE sha_table(
> >     v VARCHAR,
> >     v_as_sha1 VARCHAR,
> >     v_as_sha1_hex VARCHAR,
> >     v_as_sha1_binary BINARY,
> >     v_as_sha2 VARCHAR,
> >     v_as_sha2_hex VARCHAR,
> >     v_as_sha2_binary BINARY
> >     );
> > INSERT INTO sha_table(v) VALUES ('AbCd0');
> > UPDATE sha_table SET
> >     v_as_sha1 = SHA1(v),
> >     v_as_sha1_hex = SHA1_HEX(v),
> >     v_as_sha1_binary = SHA1_BINARY(v),
> >     v_as_sha2 = SHA2(v),
> >     v_as_sha2_hex = SHA2_HEX(v),
> >     v_as_sha2_binary = SHA2_BINARY(v)
> >     ;
> > ```
>
> Here are the query and output (note that for display, the output is
> implicitly cast to a user-readable form, which in this case is a string of
> hexadecimal digits):
>
> > ```sqlexample
> > SELECT v, v_as_sha2_binary
> >   FROM sha_table
> >   ORDER BY v;
> > +-------+------------------------------------------------------------------+
> > | V     | V_AS_SHA2_BINARY                                                 |
> > |-------+------------------------------------------------------------------|
> > | AbCd0 | E1D8BA27889D6782008F495473278C4F071995C5549A976E4D4F93863CE93643 |
> > +-------+------------------------------------------------------------------+
> > ```

---
title: SHOW_PYTHON_PACKAGES_DEPENDENCIES
source: https://docs.snowflake.com/en/sql-reference/functions/show_python_packages_dependencies.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SHOW_PYTHON_PACKAGES_DEPENDENCIES

Returns a list of the dependencies and their versions for the Python packages that were specified.

> **Note:**
>
> This function only works for Anaconda (Conda) packages. To resolve dependencies for packages from Artifact Repository (PyPI) or to work with packages from both Anaconda and Artifact Repository, use the [SYSTEM$RESOLVE_PYTHON_PACKAGES](system_resolve_python_packages.md) function instead.

For more information, see [Packages policies](../../developer-guide/udf/python/packages-policy.md).

## Syntax

```sqlsyntax
SNOWFLAKE.SNOWPARK.SHOW_PYTHON_PACKAGES_DEPENDENCIES( '<Python_runtime_version>', '<packages_list>' )
```

## Arguments

`Python_runtime_version`
:   String specifying the version of the Python runtime.

`packages_list`
:   ARRAY of strings that specify the list of packages to check.

    You can use an [ARRAY constant](../data-types-semistructured.md) to specify this list.

## Returns

Returns a JSON array that contains the dependencies and their versions.
Each element in the array is a string in the following format: `<package_name>==<version_name>`.

## Access control requirements

You must use the ACCOUNTADMIN role to call this function.

## Examples

The following example returns a list of the dependencies of the `numpy` Python package with the Python 3.10 runtime.

```sqlexample
USE ROLE ACCOUNTADMIN;

select SNOWFLAKE.SNOWPARK.SHOW_PYTHON_PACKAGES_DEPENDENCIES('3.10', ['numpy']);
```

The result is a list of the dependencies and their versions.

```output
['_libgcc_mutex==0.1', '_openmp_mutex==5.1', 'blas==1.0', 'ca-certificates==2023.05.30', 'intel-openmp==2021.4.0',
'ld_impl_linux-64==2.38', 'ld_impl_linux-aarch64==2.38', 'libffi==3.4.4', 'libgcc-ng==11.2.0', 'libgfortran-ng==11.2.0',
'libgfortran5==11.2.0', 'libgomp==11.2.0', 'libopenblas==0.3.21', 'libstdcxx-ng==11.2.0', 'mkl-service==2.4.0',
'mkl==2021.4.0', 'mkl_fft==1.3.1', 'mkl_random==1.2.2', 'ncurses==6.4', 'numpy-base==1.24.3', 'numpy==1.24.3',
'openssl==3.0.10', 'python==3.10', 'readline==8.2', 'six==1.16.0', 'sqlite==3.41.2', 'tk==8.6.12', 'xz==5.4.2', 'zlib==1.2.13']
```

## See also

* [SYSTEM$RESOLVE_PYTHON_PACKAGES](system_resolve_python_packages.md) - Returns dependencies for both Anaconda and Artifact Repository packages (no special privileges required)
* [Packages policies](../../developer-guide/udf/python/packages-policy.md) - Packages policies for Python

---
title: SIGN
source: https://docs.snowflake.com/en/sql-reference/functions/sign.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# SIGN

Returns the sign of its argument:

* -1 if the argument is negative.
* 1 if it is positive.
* 0 if it is 0.

## Syntax

```sqlsyntax
SIGN( <expr> )
```

## Examples

```sqlexample
SELECT SIGN(5), SIGN(-1.35e-10), SIGN(0);

---------+-----------------+---------+
 SIGN(5) | SIGN(-1.35E-10) | SIGN(0) |
---------+-----------------+---------+
 1       | -1              | 0       |
---------+-----------------+---------+
```

---
title: SIN
source: https://docs.snowflake.com/en/sql-reference/functions/sin.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# SIN

Computes the sine of its argument; the argument should be expressed in
radians.

## Syntax

```sqlsyntax
SIN( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The value must be in
    radians, not degrees. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT SIN(0), SIN(PI()/3), SIN(RADIANS(90));
```

```output
+--------+--------------+------------------+
| SIN(0) |  SIN(PI()/3) | SIN(RADIANS(90)) |
|--------+--------------+------------------|
|      0 | 0.8660254038 |                1 |
+--------+--------------+------------------+
```

---
title: SINH
source: https://docs.snowflake.com/en/sql-reference/functions/sinh.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# SINH

Computes the hyperbolic sine of its argument.

## Syntax

```sqlsyntax
SINH( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT SINH(1.5);
```

```output
+-------------+
|   SINH(1.5) |
|-------------|
| 2.129279455 |
+-------------+
```

---
title: SKEW
source: https://docs.snowflake.com/en/sql-reference/functions/skew.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General)

# SKEW

Returns the sample skewness of non-NULL records. If all records inside a group are NULL, the function returns NULL.

The following formula is used to compute the sample skewness:

\[(n^2)/((n-1) \* (n-2)) \* (m_3/(k_2)^(1.5))\]

where:

* \(n\) denotes the number of non-null records.
* \(m_3\) denotes the sample third central moment.
* \(k_2\) denotes the symmetric unbiased estimator of the variance.

Intuitively, skew describes how asymmetric the underlying distribution is.

## Syntax

```sqlsyntax
SKEW( <expr> )
```

## Arguments

`expr`
:   This is an expression that evaluates to a numeric data type (INTEGER, FLOAT, DECIMAL, etc.).

## Returns

This function returns a value of type DOUBLE.

## Usage notes

* For inputs with fewer than three records, SKEW returns NULL.

## Examples

Create a table and load the data:

> ```sqlexample
> create or replace table aggr(k int, v decimal(10,2), v2 decimal(10, 2));
>
> insert into aggr values
>     (1, 10, null),
>     (2, 10, null),
>     (2, 20, 22),
>     (2, 25, null),
>     (2, 30, 35);
> ```

Display the data:

> ```sqlexample
> select *
>     from aggr
>     order by k, v;
> +---+-------+-------+
> | K |     V |    V2 |
> |---+-------+-------|
> | 1 | 10.00 |  NULL |
> | 2 | 10.00 |  NULL |
> | 2 | 20.00 | 22.00 |
> | 2 | 25.00 |  NULL |
> | 2 | 30.00 | 35.00 |
> +---+-------+-------+
> ```

Query the data:

> ```sqlexample
> select SKEW(K), SKEW(V), SKEW(V2)
>     from aggr;
> +--------------+---------------+----------+
> |      SKEW(K) |       SKEW(V) | SKEW(V2) |
> |--------------+---------------+----------|
> | -2.236069766 | 0.05240788515 |     NULL |
> +--------------+---------------+----------+
> ```

---
title: SOUNDEX
source: https://docs.snowflake.com/en/sql-reference/functions/soundex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md)

# SOUNDEX

Returns a string that contains a phonetic representation of the input string.

You can use this function to determine whether two strings (e.g. the family names `Levine` and `Lavine`, the words `to`
and `too`, etc.) have similar pronounciations in the English language.

This function uses the [Soundex phonetic algorithm](https://en.wikipedia.org/wiki/Soundex), which is described in [Soundex System](https://www.archives.gov/research/census/soundex). Note, however, that Snowflake
provides no special handling for surname prefixes (e.g. “Van”, “De”, “La”, etc.).

`SOUNDEX('Pfister')` returns `P236`. Because the first two letters (`P` and `f`) are adjacent and share the same
Soundex code number (`1`), the function ignores the Soundex code number for the second letter.

Some database systems (e.g. Teradata) use a variant that retains the Soundex code number for the second letter when the first and
second letters use the same number. For that variant, the string for `Pfister` is `P123` (not `P236`). To use that variant,
call the [SOUNDEX_P123](soundex_p123.md) function instead.

See also:
:   [SOUNDEX_P123](soundex_p123.md)

## Syntax

```sqlsyntax
SOUNDEX( <varchar_expr> )
```

## Arguments

`varchar_expr`
:   The string for which a representation of the pronunciation is returned. The string should use the Latin or Unicode character set.

## Returns

The returned value is a VARCHAR that contains the phonetic representation of the input string. In other words, the return value
is a string (not a sound) that represents the pronunciation (not the spelling) of the input string.

Note the following:

* The returned value starts with a letter that represents the first letter in the string followed by 3 digits (e.g. `s400`,
  `c130`).

  For more information about how the return value is calculated, see the [Soundex phonetic algorithm](https://en.wikipedia.org/wiki/Soundex) (in Wikipedia).
* As mentioned earlier, if you want to use the variant that retains the Soundex code number for the second letter when the first
  and second letters use the same number, call the [SOUNDEX_P123](soundex_p123.md) function instead.

## Usage notes

* Because the function returns only four characters (one letter and three digits), the output is primarily determined by the
  first few syllables of the input, rather than the entire string.

  For example, the following statement compares three strings and returns the same SOUNDEX value for each string because, even
  though they have completely different spellings and meanings, they start with phonetically similar syllables:

  > ```sqlexample
  > SELECT SOUNDEX('I love rock and roll music.'),
  >        SOUNDEX('I love rocks and gemstones.'),
  >        SOUNDEX('I leave a rock wherever I go.');
  > +----------------------------------------+--------------------------+------------------------------------------+
  > | SOUNDEX('I LOVE ROCK AND ROLL MUSIC.') | SOUNDEX('I LOVE ROCKS.') | SOUNDEX('I LEAVE A ROCK WHEREVER I GO.') |
  > |----------------------------------------+--------------------------+------------------------------------------|
  > | I416                                   | I416                     | I416                                     |
  > +----------------------------------------+--------------------------+------------------------------------------+
  > ```

## Examples

The following query returns SOUNDEX values for two names that are spelled differently, but are typically pronounced similarly:

> ```sqlexample
> SELECT SOUNDEX('Marks'), SOUNDEX('Marx');
> +------------------+-----------------+
> | SOUNDEX('MARKS') | SOUNDEX('MARX') |
> |------------------+-----------------|
> | M620             | M620            |
> +------------------+-----------------+
> ```

The following query demonstrates how to use SOUNDEX to find potentially related rows in different tables:

> Create and load the tables:
>
> ```sqlexample
> CREATE TABLE sounding_board (v VARCHAR);
> CREATE TABLE sounding_bored (v VARCHAR);
> INSERT INTO sounding_board (v) VALUES ('Marsha');
> INSERT INTO sounding_bored (v) VALUES ('Marcia');
> ```
>
> Look for related records without SOUNDEX:
>
> ```sqlexample
> SELECT *
>     FROM sounding_board AS board, sounding_bored AS bored
>     WHERE bored.v = board.v;
> +---+---+
> | V | V |
> |---+---|
> +---+---+
> ```
>
> Look for related records using SOUNDEX:
>
> ```sqlexample
> SELECT *
>     FROM sounding_board AS board, sounding_bored AS bored
>     WHERE SOUNDEX(bored.v) = SOUNDEX(board.v);
> +--------+--------+
> | V      | V      |
> |--------+--------|
> | Marsha | Marcia |
> +--------+--------+
> ```

---
title: SOUNDEX_P123
source: https://docs.snowflake.com/en/sql-reference/functions/soundex_p123.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md)

# SOUNDEX_P123

Returns a string that contains a phonetic representation of the input string, and retains the Soundex code number for the second
letter when the first and second letters use the same number.

This function is similar to the [SOUNDEX](soundex.md) function except for cases in which the first and second letters of the input string
use the same Soundex code number. In those cases, the SOUNDEX function ignores the number for the second letter, while the
SOUNDEX_P123 function preserves the number for the second letter. This variant of the Soundex algorithm is used by some database
systems (e.g. Teradata).

For example, for the input string `Pfister`, the first two letters (`P` and `f`) are adjacent and share the same Soundex
code number (`1`).

* `SOUNDEX('Pfister')` ignores the Soundex code number for the second letter (`1`) and returns `P236`.
* `SOUNDEX_P123('Pfister')` preserves the Soundex code number for the second letter and returns `P123`.

See also:
:   [SOUNDEX](soundex.md)

## Syntax

```sqlsyntax
SOUNDEX_P123( <varchar_expr> )
```

## Arguments

`varchar_expr`
:   The string for which a representation of the pronunciation is returned. The string should use the Latin or Unicode character set.

## Returns

The returned value is a VARCHAR that contains the phonetic representation of the input string. In other words, the return value
is a string (not a sound) that represents the pronunciation (not the spelling) of the input string.

As mentioned earlier, if the first and second letters use the same Soundex code, the function retains the Soundex code number for
the second letter.

For additional information, see [Returns](soundex.md) in the documentation for the [SOUNDEX](soundex.md) function.

## Usage notes

See [Usage notes](soundex.md) in the documentation for the [SOUNDEX](soundex.md) function.

## Examples

The following example demonstrates the differences in the return values of the [SOUNDEX](soundex.md) function and the SOUNDEX_P123
function:

> ```sqlexample
> SELECT SOUNDEX('Pfister'),
>        SOUNDEX_P123('Pfister'),
>        SOUNDEX('LLoyd'),
>        SOUNDEX_P123('Lloyd');
> +--------------------+-------------------------+------------------+-----------------------+
> | SOUNDEX('Pfister') | SOUNDEX_P123('Pfister') | SOUNDEX('Lloyd') | SOUNDEX_P123('Lloyd') |
> |--------------------+-------------------------+------------------+-----------------------|
> | P236               | P123                    | L300             | L430                  |
> +--------------------+-------------------------+------------------+-----------------------+
> ```

---
title: SPACE
source: https://docs.snowflake.com/en/sql-reference/functions/space.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# SPACE

Builds a string consisting of the specified number of blank spaces.

## Syntax

```sqlsyntax
SPACE(<n>)
```

## Arguments

`n`
:   The number of blank spaces used to build the string.

## Examples

```sqlexample
SELECT SPACE(3);
```

---
title: SPLIT
source: https://docs.snowflake.com/en/sql-reference/functions/split.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# SPLIT

Splits a given string with a given separator and returns the result in an array of strings.

Contiguous split strings in the source string, or the presence of a split string at the beginning
or end of the source string, results in an empty string in the output. An empty separator string results
in an array containing only the source string. If either parameter is a NULL, a NULL is returned.

You can use functions and constructs that operate on [arrays](../data-types-semistructured.md) on the result,
such as [FLATTEN](flatten.md), [ARRAY_SIZE](array_size.md), and [access by index position](../data-types-semistructured.md).

See also:
:   [SPLIT_PART](split_part.md)

## Syntax

```sqlsyntax
SPLIT(<string>, <separator>)
```

## Arguments

`string`
:   Text to be split into parts.

`separator`
:   Text to split string by.

## Returns

The data type of the returned value is ARRAY.

## Collation details

This function doesn’t support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

The values in the output array don’t include a collation specification and therefore don’t support further
collation operations.

## Examples

Split the localhost IP address `127.0.0.1` into an array consisting of each of the four parts:

```sqlexample
SELECT SPLIT('127.0.0.1', '.');
```

```output
+-------------------------+
| SPLIT('127.0.0.1', '.') |
|-------------------------|
| [                       |
|   "127",                |
|   "0",                  |
|   "0",                  |
|   "1"                   |
| ]                       |
+-------------------------+
```

Access the first element in the returned array by index position:

```sqlexample
SELECT SPLIT('127.0.0.1', '.')[0];
```

```output
+----------------------------+
| SPLIT('127.0.0.1', '.')[0] |
|----------------------------|
| "127"                      |
+----------------------------+
```

Split a string that contains vertical lines as separators, which returns output
that contains empty strings:

```sqlexample
SELECT SPLIT('|a||', '|');
```

```output
+--------------------+
| SPLIT('|A||', '|') |
|--------------------|
| [                  |
|   "",              |
|   "a",             |
|   "",              |
|   ""               |
| ]                  |
+--------------------+
```

Use the result of SPLIT to generate multiple records from a single string using the LATERAL FLATTEN construct.
[FLATTEN](flatten.md) is a table function that takes a VARIANT, OBJECT, or ARRAY column and produces a lateral view
(that is, an inline view that contains correlation referring to other tables that precede it in the FROM clause):

```sqlexample
CREATE TABLE split_test_names(first_name VARCHAR, children VARCHAR);

INSERT INTO split_test_names values
  ('Mark', 'Marky,Mike,Maria'),
  ('John', 'Johnny,Jane');

SELECT * FROM split_test_names;
```

```output
+------------+------------------+
| FIRST_NAME | CHILDREN         |
|------------+------------------|
| Mark       | Marky,Mike,Maria |
| John       | Johnny,Jane      |
+------------+------------------+
```

```sqlexample
SELECT first_name, C.value::STRING AS childname
  FROM split_test_names,
    LATERAL FLATTEN(INPUT=>SPLIT(children, ',')) C;
```

```output
+------------+-----------+
| FIRST_NAME | CHILDNAME |
|------------+-----------|
| Mark       | Marky     |
| Mark       | Mike      |
| Mark       | Maria     |
| John       | Johnny    |
| John       | Jane      |
+------------+-----------+
```

---
title: SPLIT_PART
source: https://docs.snowflake.com/en/sql-reference/functions/split_part.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# SPLIT_PART

Splits a given string at a specified character and returns the requested part.

To return all characters after a specified character, you can use the [POSITION](position.md)
and [SUBSTR](substr.md) functions. For an example, see
[Returning substrings for email, phone, and date strings](substr.md).

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [SPLIT](split.md), [STRTOK](strtok.md)

## Syntax

```sqlsyntax
SPLIT_PART(<string>, <delimiter>, <partNumber>)
```

## Arguments

`string`
:   Text to be split into parts.

`delimiter`
:   Text representing the delimiter to split by. The entire delimiter string is treated as a single delimiter,
    even if it contains multiple characters. This behavior differs from [STRTOK](strtok.md), which treats each
    character in the delimiter as a separate delimiter.

`partNumber`
:   Requested part of the split, which is 1-based so that the first token is token number 1,
    not token number 0.

    If the value is negative, the parts are counted backward from the end of the string.

## Returns

This function returns a value of type VARCHAR.

If any argument is NULL, the function returns NULL.

## Usage notes

* If the `partNumber` is out of range, the returned value is an empty string.
* If the string starts or is terminated with the delimiter, the system
  considers empty space before or after the delimiter, respectively, as a
  valid part of the split result. For an example, see the Examples section below.
  This means SPLIT_PART can return empty strings as parts, unlike [STRTOK](strtok.md), which never returns empty strings.
* If the `partNumber` is 0, it is treated as 1. In other words, it gets the first element of the split.
  To avoid confusion over whether indexes are 1-based or 0-based, Snowflake recommends avoiding the use of 0
  as a synonym for 1.
* If the delimiter is an empty string, then after the split, the returned value is the input string (the
  string isn’t split).

## Collation details

The [collation specifications](../collation.md) of all input arguments must be compatible.

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

The following examples call the SPLIT_PART function:

### Demonstrate the parts returned for different part number values

The following example shows the portions returned by different `partNumber` values:

```sqlexample
SELECT column1 part_number_value, column2 portion
  FROM VALUES
    (0, SPLIT_PART('11.22.33', '.',  0)),
    (1, SPLIT_PART('11.22.33', '.',  1)),
    (2, SPLIT_PART('11.22.33', '.',  2)),
    (3, SPLIT_PART('11.22.33', '.',  3)),
    (4, SPLIT_PART('11.22.33', '.',  4)),
    (-1, SPLIT_PART('11.22.33', '.',  -1)),
    (-2, SPLIT_PART('11.22.33', '.',  -2)),
    (-3, SPLIT_PART('11.22.33', '.',  -3)),
    (-4, SPLIT_PART('11.22.33', '.',  -4));
```

```output
+-------------------+---------+
| PART_NUMBER_VALUE | PORTION |
|-------------------+---------|
|                 0 | 11      |
|                 1 | 11      |
|                 2 | 22      |
|                 3 | 33      |
|                 4 |         |
|                -1 | 33      |
|                -2 | 22      |
|                -3 | 11      |
|                -4 |         |
+-------------------+---------+
```

### Return the first and last part of an IP address

The following example returns the first and last parts of the localhost IP address `127.0.0.1`:

```sqlexample
SELECT SPLIT_PART('127.0.0.1', '.', 1) AS first_part,
       SPLIT_PART('127.0.0.1', '.', -1) AS last_part;
```

```output
+------------+-----------+
| FIRST_PART | LAST_PART |
|------------+-----------|
| 127        | 1         |
+------------+-----------+
```

### Demonstrate the delimiter as the first character

The following example returns the first and second parts of a string of characters that are separated by vertical bars. The
delimiter is the first part of the input string, so the first element after the split is an empty string.

```sqlexample
SELECT SPLIT_PART('|a|b|c|', '|', 1) AS first_part,
       SPLIT_PART('|a|b|c|', '|', 2) AS last_part;
```

```output
+------------+-----------+
| FIRST_PART | LAST_PART |
|------------+-----------|
|            | a         |
+------------+-----------+
```

### Demonstrate a multi-character delimiter

The following example shows a multi-character delimiter:

```sqlexample
SELECT SPLIT_PART('aaa--bbb-BBB--ccc', '--', 2) AS multi_character_delimiter;
```

```output
+---------------------------+
| MULTI_CHARACTER_DELIMITER |
|---------------------------|
| bbb-BBB                   |
+---------------------------+
```

### Demonstrate the delimiter as an empty string

The following example shows that if the delimiter is an empty string, then after the split, there is still only one
string:

```sqlexample
SELECT column1 part_number_value, column2 portion
  FROM VALUES
    (1, SPLIT_PART('user@snowflake.com', '',  1)),
    (-1, SPLIT_PART('user@snowflake.com', '', -1)),
    (2, SPLIT_PART('user@snowflake.com', '',  2)),
    (-2, SPLIT_PART('user@snowflake.com', '', -2));
```

```output
+-------------------+--------------------+
| PART_NUMBER_VALUE | PORTION            |
|-------------------+--------------------|
|                 1 | user@snowflake.com |
|                -1 | user@snowflake.com |
|                 2 |                    |
|                -2 |                    |
+-------------------+--------------------+
```

### Demonstrate differences between STRTOK and SPLIT_PART

This example demonstrates the difference between STRTOK and SPLIT_PART when using repeated delimiters.
STRTOK treats each character in the delimiter string `'|-'` as a separate delimiter, splitting at every
`'|'` and `'-'` character. In contrast, SPLIT_PART treats the entire delimiter string `'|-'`
as a single delimiter, so it only splits where that exact sequence appears:

```sqlexample
SELECT STRTOK('data1||data2|-data3---data4', '|-', 1) AS strtok_1,
       STRTOK('data1||data2|-data3---data4', '|-', 2) AS strtok_2,
       STRTOK('data1||data2|-data3---data4', '|-', 3) AS strtok_3,
       STRTOK('data1||data2|-data3---data4', '|-', 4) AS strtok_4,
       SPLIT_PART('data1||data2|-data3---data4', '|-', 1) AS split_part_1,
       SPLIT_PART('data1||data2|-data3---data4', '|-', 2) AS split_part_2,
       SPLIT_PART('data1||data2|-data3---data4', '|-', 3) AS split_part_3;
```

```output
+----------+----------+----------+----------+-----------------+--------------+--------------+
| STRTOK_1 | STRTOK_2 | STRTOK_3 | STRTOK_4 | SPLIT_PART_1    | SPLIT_PART_2 | SPLIT_PART_3 |
|----------+----------+----------+----------+-----------------+--------------+--------------|
| data1    | data2    | data3    | data4    | data1||data2    | data3---data4|              |
+----------+----------+----------+----------+-----------------+--------------+--------------+
```

---
title: SPLIT_TEXT_MARKDOWN_HEADER (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/split_text_markdown_header-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# SPLIT_TEXT_MARKDOWN_HEADER (SNOWFLAKE.CORTEX)

The SPLIT_TEXT_MARKDOWN_HEADER function splits a Markdown-formatted document into structured text chunks
based on header levels. The function returns an array of objects, where each object contains the text
chunk and the associated headers under which that chunk falls.

This function is useful for preserving document structure when chunking content for embedding,
retrieval-augmented generation (RAG), or search indexing.

The function first segments the input text using the specified Markdown headers, and then recursively
splits each segment using default plain text separators (e.g., `["nn", "n", " ", ""]`) to produce chunks
of the desired size.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.SPLIT_TEXT_MARKDOWN_HEADER (
  '<text_to_split>',
  '<headers_to_split_on>',
  <chunk_size>,
  [ <overlap> ]
)
```

## Arguments

**Required:**

`'text_to_split'`
:   A Markdown-formatted string to be split.

`'headers_to_split_on'`
:   A key-value map in which the keys are Markdown header syntax (e.g., `#`, `##`) and the values are metadata field names (e.g., `header_1`, `header_2`) to label the chunks. For example:

    ```json
    {
      "#": "header_1",
      "##": "header_2"
    }
    ```

    This configuration will split the document on `#` and `##` headers. In the output, `header_1` and `header_2` fields will contain the corresponding header text values.

`chunk_size`
:   An integer specifying the maximum number of characters in each chunk. The value must be greater than zero.

**Optional:**

`overlap`
:   An integer specifying the number of characters to overlap between consecutive chunks. Defaults to 0 if not provided.

    Overlap is useful for maintaining context across chunks, which can improve performance in embedding and retrieval tasks.

## Returns

Returns an array of objects. Each object has the following structure:

* `chunk`: A string containing the extracted text.
* `headers`: A dictionary containing the Markdown header values under which the chunk is nested. Keys match those provided in the `headers_to_split_on` map.

## Examples

### Simple usage

The following example splits a Markdown string on both `#` and `##` headers, produces chunks of up to 12 characters, and applies a 5-character overlap between chunks.

```sqlexample
SELECT SNOWFLAKE.CORTEX.SPLIT_TEXT_MARKDOWN_HEADER(
  '# HEADER 1\nthis is text in header 1\n## HEADER 2\nthis is a subheading',
  OBJECT_CONSTRUCT('#', 'header_1', '##', 'header_2'),
  12,
  5
);
```

```output
[
  {
    "chunk": "this is text",
    "headers": {
      "header_1": "HEADER 1"
    }
  },
  {
    "chunk": "text in",
    "headers": {
      "header_1": "HEADER 1"
    }
  },
  {
    "chunk": "in header 1",
    "headers": {
      "header_1": "HEADER 1"
    }
  },
  {
    "chunk": "this is a",
    "headers": {
      "header_1": "HEADER 1",
      "header_2": "HEADER 2"
    }
  },
  {
    "chunk": "subheading",
    "headers": {
      "header_1": "HEADER 1",
      "header_2": "HEADER 2"
    }
  }
]
```

### Example with Markdown formatting and flattening of results into rows

The following example creates a table `markdown_docs` containing a short Markdown document in each row, then
calls the SPLIT_TEXT_MARKDOWN_HEADER function to segment each document on markdown headers ‘#’ and ‘##’. The function
then splits each segment into chunks of 20 characters each, with an overlap of 5 characters between chunks.

```sqlexample
CREATE OR REPLACE TABLE markdown_docs (doc VARCHAR);

INSERT INTO markdown_docs VALUES
('# Product Overview\nOur system is a high-performance data processing engine.\n\n## Architecture\nIt uses a distributed design optimized for analytics.\n\n## Key Benefits\n- Scalable\n- Cost-efficient\n- Secure'),
('# User Guide\nThis guide describes how to install and use the product.\n\n## Installation\nFollow the steps below to install.\n\n## Usage\nOnce installed, use the CLI or UI for operations.'),
('# FAQ\nHere are answers to commonly asked questions.\n\n## Pricing\nWe offer flexible pricing models.\n\n## Support\nContact our 24/7 support team anytime.');

SELECT
    c.value['chunk']::varchar as chunk,
    c.value['headers']::object as headers,
FROM
    markdown_docs,
    LATERAL FLATTEN(
        SNOWFLAKE.CORTEX.SPLIT_TEXT_MARKDOWN_HEADER(
        doc,
        OBJECT_CONSTRUCT('#', 'header_1', '##', 'header_2'),
        20,
        5
    )
    ) c;
```

---
title: SPLIT_TEXT_RECURSIVE_CHARACTER (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/split_text_recursive_character-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# SPLIT_TEXT_RECURSIVE_CHARACTER (SNOWFLAKE.CORTEX)

The SPLIT_TEXT_RECURSIVE_CHARACTER function splits a string into shorter stings, recursively, for preprocessing
text to be used with text embedding or search indexing functions. The function returns an array of text chunks, where the
chunks are derived from the original text based on the input parameters provided.

The splitting algorithm attempts to split text on separators in the order they are provided, either implicitly as defaults based on the
format, or explicitly in the `separators` argument. Splitting is then applied to each chunk that is longer than the specified
`chunk_size`, recursively, until all chunks no longer than the specified `chunk_size`.

For example, if format is set to `'none'`, the algorithm first splits on the “\n\n” sequences, which represent
paragraph breaks in most formats. Within any resulting chunk that is still longer than `chunk_size` characters, the
function splits on the “\n” characters, which represents line breaks. This process repeats until each of the chunks is less
than `chunk_size` characters.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.SPLIT_TEXT_RECURSIVE_CHARACTER (
  '<text_to_split>',
  '<format>',
  <chunk_size>,
  [ <overlap> ],
  [ <separators> ]
)
```

## Arguments

**Required:**

`'text_to_split'`
:   The text to split.

`'format'`
:   The format of your input text, which determines the default separators in the splitting algorithm. Must be one of the following:

    > * `none`: No format-specific separators. Only the separators in the `separators` field are used for splitting.
    > * `markdown`: Separates on headers, code blocks, and tables, in addition to any separators in the separators field.

`chunk_size`
:   An integer specifying the maximum number of characters in each chunk. The value must be greater than zero.

**Optional:**

`overlap`
:   An integer that specifies the number of characters to overlap between consecutive chunks. By default, chunks have no overlap.
    If `overlap` is specified, it must be smaller than the `chunk_size` argument.

    Overlap is useful for ensuring that each chunk has some context about the previous chunk. This can help improve the quality of search
    results or other processing.

`separators`
:   An ordered list of character sequences to use as boundaries when determining where to split the text, in addition to
    any separators dictated by the `format` parameter. The last item in this list should be a general separator, such
    as an empty string (which allows a split to be made between any two characters), so that the algorithm is guaranteed to
    be able to split the text into chunks of the desired size.

    Default: [”\n\n”, “\n”, “ “, “”], meaning a paragraph break, a line break, a space, and between any two characters (the empty string).

## Returns

Returns an array of strings that contains text chunks extracted from the input string.

## Examples

### Simple usage

The following example directly calls the SPLIT_TEXT_RECURSIVE_CHARACTER function with the input text `hello world are you here`.
The function splits the text into chunks of 15 characters each, with an overlap of 10 characters between chunks.

```sqlexample
SELECT SNOWFLAKE.CORTEX.SPLIT_TEXT_RECURSIVE_CHARACTER (
   'hello world are you here',
   'none',
   15,
   10
);
```

```output
['hello world are', 'world are you', 'are you here']
```

### Example with Markdown formatting and flattening of chunks array into rows

The following example creates a table `sample_documents` containing a short Markdown document in each row, then
calls the SPLIT_TEXT_RECURSIVE_CHARACTER function to split each document. The function splits the text into chunks of 25
characters each, with an overlap of 10 characters between chunks.

```sqlexample
-- Create sample markdown data table
CREATE OR REPLACE TABLE sample_documents (
   doc_id INT AUTOINCREMENT, -- Monotonically increasing integer
   document STRING
);

-- Insert sample data
INSERT INTO sample_documents (document)
VALUES
   ('### Heading 1\\nThis is a sample markdown document. It contains a list:\\n- Item 1\\n- Item 2\\n- Item 3\\n'),
   ('## Subheading\\nThis markdown contains a link [example](http://example.com) and some \**bold*\* text.'),
   ('### Heading 2\\nHere is a code snippet:\\n```\\ncode_block_here()\\n```\\nAnd some more regular text.'),
   ('## Another Subheading\\nMarkdown example with _italic_ text and a [second link](http://example.com).'),
   ('### Heading 3\\nText with an ordered list:\\n1. First item\\n2. Second item\\n3. Third item\\nMore text follows here.');

-- split text
SELECT
   doc_id,
   c.value
FROM
   sample_documents,
   LATERAL FLATTEN( input => SNOWFLAKE.CORTEX.SPLIT_TEXT_RECURSIVE_CHARACTER (
      document,
      'markdown',
      25,
      10
   )) c;
```

---
title: SPLIT_TO_TABLE
source: https://docs.snowflake.com/en/sql-reference/functions/split_to_table.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General) , [Table functions](../functions-table.md)

# SPLIT_TO_TABLE

This table function splits a string (based on a specified delimiter) and flattens the results into rows.

See also:
:   [SPLIT](split.md)

## Syntax

```sqlsyntax
SPLIT_TO_TABLE(<string>, <delimiter>)
```

## Arguments

`string`
:   Text to be split.

`delimiter`
:   Text to split the string by.

## Output

This function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| SEQ | NUMBER | A unique sequence number associated with the input record. The sequence is not guaranteed to be gap-free or ordered. in any particular way. |
| INDEX | NUMBER | The one-based index of the element. |
| VALUE | VARCHAR | The value of the element of the flattened array. |

> **Note:**
>
> The query can also access the columns of the original (correlated) table that served as the source of data for this function. If a single row
> from the original table resulted in multiple rows in the flattened view, the values in this input row are replicated to match the number of
> rows produced by this function.

## Examples

Here is a simple example on constant input.

```sqlexample
SELECT table1.value
  FROM TABLE(SPLIT_TO_TABLE('a.b', '.')) AS table1
  ORDER BY table1.value;
```

```output
+-------+
| VALUE |
|-------|
| a     |
| b     |
+-------+
```

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE splittable (v VARCHAR);
INSERT INTO splittable (v) VALUES ('a.b.c'), ('d'), ('');
SELECT * FROM splittable;
```

```output
+-------+
| V     |
|-------|
| a.b.c |
| d     |
|       |
+-------+
```

You can use the [LATERAL](../constructs/join-lateral.md) keyword with the SPLIT_TO_TABLE function
so that the function executes on each row of the `splittable` table as a correlated table:

```sqlexample
SELECT *
  FROM splittable, LATERAL SPLIT_TO_TABLE(splittable.v, '.')
  ORDER BY SEQ, INDEX;
```

```output
+-------+-----+-------+-------+
| V     | SEQ | INDEX | VALUE |
|-------+-----+-------+-------|
| a.b.c |   1 |     1 | a     |
| a.b.c |   1 |     2 | b     |
| a.b.c |   1 |     3 | c     |
| d     |   2 |     1 | d     |
|       |   3 |     1 |       |
+-------+-----+-------+-------+
```

Create another table that contains authors in one column and some of their book titles in another column, separated
by commas:

```sqlexample
CREATE OR REPLACE TABLE authors_books_test (author VARCHAR, titles VARCHAR);
INSERT INTO authors_books_test (author, titles) VALUES
  ('Nathaniel Hawthorne', 'The Scarlet Letter , The House of the Seven Gables,The Blithedale Romance'),
  ('Herman Melville', 'Moby Dick,The Confidence-Man');
SELECT * FROM authors_books_test;
```

```output
+---------------------+---------------------------------------------------------------------------+
| AUTHOR              | TITLES                                                                    |
|---------------------+---------------------------------------------------------------------------|
| Nathaniel Hawthorne | The Scarlet Letter , The House of the Seven Gables,The Blithedale Romance |
| Herman Melville     | Moby Dick,The Confidence-Man                                              |
+---------------------+---------------------------------------------------------------------------+
```

Use the LATERAL keyword and the SPLIT_TO_TABLE function to run a query that returns a separate row for each title.
In addition, use the [TRIM](trim.md) function to remove leading and trailing spaces from the titles. Note that the
SELECT list includes the fixed `value` column that is returned by the function:

```sqlexample
SELECT author, TRIM(value) AS title
  FROM authors_books_test, LATERAL SPLIT_TO_TABLE(titles, ',')
  ORDER BY author;
```

```output
+---------------------+-------------------------------+
| AUTHOR              | TITLE                         |
|---------------------+-------------------------------|
| Herman Melville     | Moby Dick                     |
| Herman Melville     | The Confidence-Man            |
| Nathaniel Hawthorne | The Scarlet Letter            |
| Nathaniel Hawthorne | The House of the Seven Gables |
| Nathaniel Hawthorne | The Blithedale Romance        |
+---------------------+-------------------------------+
```

---
title: SQRT
source: https://docs.snowflake.com/en/sql-reference/functions/sqrt.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Exponent and Root)

# SQRT

Returns the square-root of a non-negative numeric expression.

## Syntax

```sqlsyntax
SQRT(expr)
```

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Examples

```sqlexample
SELECT x, sqrt(x) FROM tab;

--------+-------------+
   x    |   sqrt(x)   |
--------+-------------+
 0      | 0           |
 2      | 1.414213562 |
 10     | 3.16227766  |
 [NULL] | [NULL]      |
--------+-------------+
```

---
title: SQUARE
source: https://docs.snowflake.com/en/sql-reference/functions/square.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Exponent and Root)

# SQUARE

Returns the square of a numeric expression (i.e. a numeric expression multiplied by itself).

## Syntax

```sqlsyntax
SQUARE(expr)
```

## Returns

If the input expression is of type DECFLOAT, the returned type is DECFLOAT. Otherwise, the
returned type is FLOAT.

## Usage notes

* More efficient than the expression x\*x, so square(x) is preferred
  when a floating-point result is acceptable.

## Examples

```sqlexample
SELECT column1, square(column1)
FROM (values (0), (1), (-2), (3.15), (null)) v;

---------+-----------------+
 column1 | square(column1) |
---------+-----------------+
 0       | 0               |
 1       | 1               |
 -2      | 4               |
 3.15    | 9.9225          |
 [NULL]  | [NULL]          |
---------+-----------------+
```

---
title: ST_AREA
source: https://docs.snowflake.com/en/sql-reference/functions/st_area.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_AREA

Returns the area of the Polygon(s) in a [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_AREA( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value, which represents the area:

* For GEOGRAPHY input values, the area is in square meters.
* For GEOMETRY input values, the area is computed with the same units used to define the input coordinates.

## Usage notes

* If `geography_expression` is not a Polygon, MultiPolygon, or GeometryCollection containing polygons, ST_AREA returns 0.
* If `geography_expression` is a GeometryCollection, ST_AREA returns the sum of the areas of the polygons in the collection.

## Examples

### GEOGRAPHY examples

This uses the ST_AREA function with GEOGRAPHY objects to calculate the area of Earth’s surface 1 degree on each side with the
bottom of the area on the equator:

> ```sqlexample
> SELECT ST_AREA(TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))')) AS area;
> +------------------+
> |             AREA |
> |------------------|
> | 12364036567.0764 |
> +------------------+
> ```

### GEOMETRY examples

The following example calls the ST_AREA function with GEOMETRY objects that represent a Point, LineString, and Polygon.

> ```sqlexample
> SELECT ST_AREA(g), ST_ASWKT(g)
> FROM (SELECT TO_GEOMETRY(column1) as g
>   from values ('POINT(1 1)'),
>               ('LINESTRING(0 0, 1 1)'),
>               ('POLYGON((0 0, 0 1, 1 1, 1 0, 0 0))'));
> ```
>
> ```none
> +------------+--------------------------------+
> | ST_AREA(G) | ST_ASWKT(G)                    |
> |------------+--------------------------------|
> |          0 | POINT(1 1)                     |
> |          0 | LINESTRING(0 0,1 1)            |
> |          1 | POLYGON((0 0,0 1,1 1,1 0,0 0)) |
> +------------+--------------------------------+
> ```

---
title: ST_ASEWKB
source: https://docs.snowflake.com/en/sql-reference/functions/st_asewkb.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ASEWKB

Given a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md), return the
binary representation of that value in
[EWKB (extended well-known binary)](../data-types-geospatial.md) format.

See also:
:   [ST_ASWKB](st_aswkb.md)

## Syntax

```sqlsyntax
ST_ASEWKB( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

A value of type BINARY.

## Usage notes

* For GEOGRAPHY objects, the SRID in the return value is always 4326. See
  the [note on EWKT handling](../data-types-geospatial.md).
* To return the output in WKB format, use [ST_ASWKB](st_aswkb.md) instead.

## Examples

### GEOGRAPHY examples

The following example demonstrates the ST_ASEWKB function. For the EWKB output, it is assumed that the [BINARY_OUTPUT_FORMAT](../parameters.md)
parameter is set to `HEX` (the default value for the parameter).

> ```sqlexample
> create table geospatial_table (id INTEGER, g GEOGRAPHY);
> insert into geospatial_table values
>     (1, 'POINT(-122.35 37.55)'), (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)');
> ```
>
> ```sqlexample
> select st_asewkb(g)
>     from geospatial_table
>     order by id;
> +--------------------------------------------------------------------------------------------+
> | ST_ASEWKB(G)                                                                               |
> |--------------------------------------------------------------------------------------------|
> | 0101000020E61000006666666666965EC06666666666C64240                                         |
> | 0102000020E610000002000000CDCCCCCCCC0C5FC00000000000004540713D0AD7A3005EC01F85EB51B8FE4440 |
> +--------------------------------------------------------------------------------------------+
> ```

### GEOMETRY examples

The example below demonstrates how to use the ST_ASEWKB function. The example returns the EWKB representations of two geometries
that have different SRIDs.

> ```sqlexample
> CREATE OR REPLACE TABLE geometry_table (g GEOMETRY);
> INSERT INTO geometry_table VALUES
>   ('SRID=4326;POINT(-122.35 37.55)'),
>   ('SRID=0;LINESTRING(0.75 0.75, -10 20)');
>
> SELECT ST_ASEWKB(g) FROM geometry_table;
> ```
>
> ```none
> +--------------------------------------------------------------------------------------------+
> | ST_ASEWKB(G)                                                                               |
> |--------------------------------------------------------------------------------------------|
> | 0101000020E61000006666666666965EC06666666666C64240                                         |
> | 01020000200000000002000000000000000000E83F000000000000E83F00000000000024C00000000000003440 |
> +--------------------------------------------------------------------------------------------+
> ```

---
title: ST_ASEWKT
source: https://docs.snowflake.com/en/sql-reference/functions/st_asewkt.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ASEWKT

Given a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md), return the
text (VARCHAR) representation of that value in [EWKT (extended well-known text)](../data-types-geospatial.md)
format.

See also:
:   [ST_ASWKT](st_aswkt.md)

## Syntax

```sqlsyntax
ST_ASEWKT( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

A VARCHAR.

## Usage notes

* For GEOGRAPHY objects, the SRID in the return value is always 4326. See the
  [note on EWKT handling](../data-types-geospatial.md).
* To return the output in WKT format, use [ST_ASWKT](st_aswkt.md) instead.

## Examples

### GEOGRAPHY examples

The following example demonstrates the ST_ASEWKT function:

> ```sqlexample
> create table geospatial_table (id INTEGER, g GEOGRAPHY);
> insert into geospatial_table values
>     (1, 'POINT(-122.35 37.55)'), (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)');
> ```
>
> ```sqlexample
> select st_asewkt(g)
>     from geospatial_table
>     order by id;
> +-----------------------------------------------+
> | ST_ASEWKT(G)                                  |
> |-----------------------------------------------|
> | SRID=4326;POINT(-122.35 37.55)                |
> | SRID=4326;LINESTRING(-124.2 42,-120.01 41.99) |
> +-----------------------------------------------+
> ```

### GEOMETRY examples

The example below demonstrates how to use the ST_ASEWKT function. The example returns the EWKT representations of two geometries
that have different SRIDs.

> ```sqlexample
> CREATE OR REPLACE TABLE geometry_table (g GEOMETRY);
> INSERT INTO geometry_table VALUES
>   ('SRID=4326;POINT(-122.35 37.55)'),
>   ('SRID=0;LINESTRING(0.75 0.75, -10 20)');
>
> ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';
> SELECT ST_ASEWKT(g) FROM geometry_table;
> ```
>
> ```none
> +-------------------------------------+
> | ST_ASEWKT(G)                        |
> |-------------------------------------|
> | SRID=4326;POINT(-122.35 37.55)      |
> | SRID=0;LINESTRING(0.75 0.75,-10 20) |
> +-------------------------------------+
> ```

---
title: ST_ASGEOJSON
source: https://docs.snowflake.com/en/sql-reference/functions/st_asgeojson.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ASGEOJSON

Given a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md), return the
[GeoJSON](../data-types-geospatial.md) representation of that value.

## Syntax

```sqlsyntax
ST_ASGEOJSON( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

An OBJECT in [GeoJSON](../data-types-geospatial.md) format.

## Usage notes

For GEOMETRY objects:

* The returned GEOMETRY object uses the same coordinate system as the input GEOMETRY object.

  Note that the GeoJSON specification requires that geometry be in the WGS84 coordinate system (SRID = 4326). However, the
  ST_ASGEOJSON function does not enforce this.
* The function does not add the SRID or any other CRS information to the output.

## Examples

### GEOGRAPHY examples

The following example demonstrates the ST_ASGEOJSON function:

> ```sqlexample
> create table geospatial_table (id INTEGER, g GEOGRAPHY);
> insert into geospatial_table values
>     (1, 'POINT(-122.35 37.55)'), (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)');
> ```
>
> ```sqlexample
> select st_asgeojson(g)
>     from geospatial_table
>     order by id;
> +------------------------+
> | ST_ASGEOJSON(G)        |
> |------------------------|
> | {                      |
> |   "coordinates": [     |
> |     -122.35,           |
> |     37.55              |
> |   ],                   |
> |   "type": "Point"      |
> | }                      |
> | {                      |
> |   "coordinates": [     |
> |     [                  |
> |       -124.2,          |
> |       42               |
> |     ],                 |
> |     [                  |
> |       -120.01,         |
> |       41.99            |
> |     ]                  |
> |   ],                   |
> |   "type": "LineString" |
> | }                      |
> +------------------------+
> ```
>
> Casting the VARIANT output to VARCHAR results in the following:
>
> ```sqlexample
> select st_asgeojson(g)::varchar
>     from geospatial_table
>     order by id;
> +-------------------------------------------------------------------+
> | ST_ASGEOJSON(G)::VARCHAR                                          |
> |-------------------------------------------------------------------|
> | {"coordinates":[-122.35,37.55],"type":"Point"}                    |
> | {"coordinates":[[-124.2,42],[-120.01,41.99]],"type":"LineString"} |
> +-------------------------------------------------------------------+
> ```

### GEOMETRY examples

The following example demonstrates the ST_ASGEOJSON function with a GEOMETRY object as input:

> ```sqlexample
> SELECT ST_ASGEOJSON(TO_GEOMETRY('SRID=4326;LINESTRING(389866 5819003, 390000 5830000)')) AS geojson;
> ```
>
> ```none
> +------------------------+
> | GEOJSON                |
> |------------------------|
> |{                       |
> |  "coordinates": [      |
> |    [                   |
> |      389866,           |
> |      5819003           |
> |    ],                  |
> |    [                   |
> |      390000,           |
> |      5830000           |
> |    ]                   |
> |  ],                    |
> |  "type": "LineString"  |
> |}                       |
> +------------------------+
> ```

---
title: ST_ASWKB , ST_ASBINARY
source: https://docs.snowflake.com/en/sql-reference/functions/st_aswkb.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ASWKB , ST_ASBINARY

Given a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md), return the
binary representation of that value in
[WKB (well-known binary)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary) format.

See also:
:   [ST_ASEWKB](st_asewkb.md)

## Syntax

Use one of the following:

```sqlsyntax
ST_ASWKB( <geography_or_geometry_expression> )

ST_ASBINARY( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

A value of type BINARY.

## Usage notes

* ST_ASBINARY is an alias for ST_ASWKB.
* To return the output in EWKB format, use [ST_ASEWKB](st_asewkb.md) instead.

## Examples

### GEOGRAPHY examples

The following example demonstrates the ST_ASWKB function. For the WKB output, it is assumed that the [BINARY_OUTPUT_FORMAT](../parameters.md)
parameter is set to `HEX` (the default value for the parameter).

> ```sqlexample
> create table geospatial_table (id INTEGER, g GEOGRAPHY);
> insert into geospatial_table values
>     (1, 'POINT(-122.35 37.55)'), (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)');
> ```
>
> ```sqlexample
> select st_aswkb(g)
>     from geospatial_table
>     order by id;
> +------------------------------------------------------------------------------------+
> | ST_ASWKB(G)                                                                        |
> |------------------------------------------------------------------------------------|
> | 01010000006666666666965EC06666666666C64240                                         |
> | 010200000002000000CDCCCCCCCC0C5FC00000000000004540713D0AD7A3005EC01F85EB51B8FE4440 |
> +------------------------------------------------------------------------------------+
> ```

### GEOMETRY examples

The example below demonstrates how to use the ST_ASEWKB function. The example returns the EWKB representations of two geometries.

> ```sqlexample
> CREATE OR REPLACE TABLE geometry_table (g GEOMETRY);
> INSERT INTO geometry_table VALUES
>   ('POINT(-122.35 37.55)'), ('LINESTRING(0.75 0.75, -10 20)');
>
> SELECT ST_ASWKB(g) FROM geometry_table;
> ```
>
> ```none
> +------------------------------------------------------------------------------------+
> | ST_ASWKB(G)                                                                        |
> |------------------------------------------------------------------------------------|
> | 01010000006666666666965EC06666666666C64240                                         |
> | 010200000002000000000000000000E83F000000000000E83F00000000000024C00000000000003440 |
> +------------------------------------------------------------------------------------+
> ```

---
title: ST_ASWKT , ST_ASTEXT
source: https://docs.snowflake.com/en/sql-reference/functions/st_aswkt.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ASWKT , ST_ASTEXT

Given a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md), return the
text (VARCHAR) representation of that value in
[WKT (well-known text)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry) format.

See also:
:   [ST_ASEWKT](st_asewkt.md)

## Syntax

Use one of the following:

```sqlsyntax
ST_ASWKT( <geography_or_geometry_expression> )

ST_ASTEXT( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

A VARCHAR.

## Usage notes

* ST_ASTEXT is an alias for ST_ASWKT.
* To return the output in EWKT format, use [ST_ASEWKT](st_asewkt.md) instead.

## Examples

### GEOGRAPHY examples

The following example demonstrates the ST_ASWKT function:

> ```sqlexample
> create table geospatial_table (id INTEGER, g GEOGRAPHY);
> insert into geospatial_table values
>     (1, 'POINT(-122.35 37.55)'), (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)');
> ```
>
> ```sqlexample
> select st_astext(g)
>     from geospatial_table
>     order by id;
> +-------------------------------------+
> | ST_ASTEXT(G)                        |
> |-------------------------------------|
> | POINT(-122.35 37.55)                |
> | LINESTRING(-124.2 42,-120.01 41.99) |
> +-------------------------------------+
> ```
>
> ```sqlexample
> select st_aswkt(g)
>     from geospatial_table
>     order by id;
> +-------------------------------------+
> | ST_ASWKT(G)                         |
> |-------------------------------------|
> | POINT(-122.35 37.55)                |
> | LINESTRING(-124.2 42,-120.01 41.99) |
> +-------------------------------------+
> ```

### GEOMETRY examples

The example below demonstrates how to use the ST_ASEWKT function. The example returns the EWKT representations of two geometries.

> ```sqlexample
> CREATE OR REPLACE TABLE geometry_table (g GEOMETRY);
> INSERT INTO geometry_table VALUES
>   ('POINT(-122.35 37.55)'), ('LINESTRING(0.75 0.75, -10 20)');
>
> ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
> SELECT ST_ASWKT(g) FROM geometry_table;
> ```
>
> ```none
> +------------------------------+
> | ST_ASWKT(G)                  |
> |------------------------------|
> | POINT(-122.35 37.55)         |
> | LINESTRING(0.75 0.75,-10 20) |
> +------------------------------+
> ```

---
title: ST_AZIMUTH
source: https://docs.snowflake.com/en/sql-reference/functions/st_azimuth.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_AZIMUTH

Given a Point that represents the origin (the location of the observer) and a specified Point, returns the azimuth in radians.
Both Points must be either [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md)
objects.

The [azimuth](https://en.wikipedia.org/wiki/Azimuth) is the angle between the two Points when the observer at the origin is facing the north (for GEOGRAPHY objects) or
the Y-axis (for GEOMETRY objects). The angle is positive in the clockwise direction and is:

* 0 for a line segment pointing north.
* π/2 for a line segment pointing east.
* π for a line segment pointing south.
* 3π/2 for a line segment pointing west.

If the two Points are the same location, the function returns NULL.

For GEOGRAPHY objects, on spherical Earth,
[the formula described here](https://en.wikipedia.org/wiki/Azimuth#In_geodesy) is used to determine the azimuth.

> **Caution:**
>
> Systems using an elliptical Earth model use
> [a more complex algorithm for Azimuth](https://en.wikipedia.org/wiki/Azimuth#In_geodesy), which occasionally yields
> significantly different results.

## Syntax

```sqlsyntax
ST_AZIMUTH( <geography_expression_for_origin> , <geography_expression_for_target> )
ST_AZIMUTH( <geometry_expression_for_origin> , <geometry_expression_for_target> )
```

## Arguments

`geography_expression_for_origin`
:   A GEOGRAPHY object that is a Point representing the origin (where the observer is located).

`geography_expression_for_target`
:   A GEOGRAPHY object that is a Point for which you want to calculate the azimuth.

`geometry_expression_for_origin`
:   A GEOMETRY object that is a Point representing the origin (where the observer is located).

`geometry_expression_for_target`
:   A GEOMETRY object that is a Point for which you want to calculate the azimuth.

## Returns

Returns a value of type REAL that is the azimuth in radians.

## Usage notes

* If one of the input geospatial objects is not a Point, the function reports an error.
* Returns NULL if one or both input points are NULL.

## Examples

### GEOGRAPHY examples

The following example returns the azimuth in radians for an origin Point (0, 1) and a target Point (0, 0):

> ```sqlexample
> SELECT ST_AZIMUTH(
>     TO_GEOGRAPHY('POINT(0 1)'),
>     TO_GEOGRAPHY('POINT(0 0)')
> );
> +---------------------------------+
> |                     ST_AZIMUTH( |
> |     TO_GEOGRAPHY('POINT(0 1)'), |
> |      TO_GEOGRAPHY('POINT(0 0)') |
> |                               ) |
> |---------------------------------|
> |                     3.141592654 |
> +---------------------------------+
> ```

The following example returns the azimuth in degrees for an origin Point (0, 1) and a target Point (1, 2):

> ```sqlexample
> SELECT DEGREES(ST_AZIMUTH(
>     TO_GEOGRAPHY('POINT(0 1)'),
>     TO_GEOGRAPHY('POINT(1 2)')
> ));
> +---------------------------------+
> |             DEGREES(ST_AZIMUTH( |
> |     TO_GEOGRAPHY('POINT(0 1)'), |
> |      TO_GEOGRAPHY('POINT(1 2)') |
> |                              )) |
> |---------------------------------|
> |                    44.978182941 |
> +---------------------------------+
> ```

### GEOMETRY examples

The following example returns the azimuth in radians for an origin Point (0, 1) and a target Point (0, 0):

```sqlexample
SELECT ST_AZIMUTH(
    TO_GEOMETRY('POINT(0 1)', TO_GEOMETRY('POINT(0 0)')
);

+------------------------------------------------------------------+
| ST_AZIMUTH(TO_GEOMETRY('POINT(0 1)'), TO_GEOMETRY('POINT(0 0)')) |
|------------------------------------------------------------------|
| 3.141592654                                                      |
+------------------------------------------------------------------+
```

The following example returns the azimuth in degrees for an origin Point (0, 1) and a target Point (0.707, 0.707):

```sqlexample
SELECT ST_AZIMUTH(
    TO_GEOMETRY('POINT(0 0)', TO_GEOMETRY(0.707 0.707')
);

+-------------------------------------------------------------------------+
| ST_AZIMUTH(TO_GEOMETRY('POINT(0 0)'), TO_GEOMETRY('POINT(0.707 0.707')) |
|-------------------------------------------------------------------------|
| 0.7853981634                                                            |
+-------------------------------------------------------------------------+
```

---
title: ST_BUFFER
source: https://docs.snowflake.com/en/sql-reference/functions/st_buffer.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_BUFFER

Returns a [GEOMETRY](../data-types-geospatial.md) object that represents a MultiPolygon containing
the points within a specified distance of the input GEOMETRY object. The returned object effectively
represents a “buffer” around the input object.

You can also “shrink” the input object by specifying a negative value for the distance.

## Syntax

```sqlsyntax
ST_BUFFER( <geometry_expression> , <distance> )
```

## Arguments

`geometry_expression`
:   The argument must be an expression of type GEOMETRY.

`distance`
:   The distance from the GEOMETRY object. To “shrink” the object, you can specify a negative value for the distance.

    The units depend on the [spatial reference system identifier (SRID)](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier) of the GEOMETRY object. For example,
    [ESPG:4326](https://epsg.io/4326) units are degrees, while [ESPG:25855](https://epsg.io/25833)
    units are meters.

## Returns

Returns a GEOMETRY object.

## Usage notes

* SRIDs are based on the [EPSG standard](https://epsg.org/home.html) (v10.082). For example, the SRID 4326 corresponds to the authority EPSG with the code
  4326.
* ST_BUFFER uses eight segments to approximate a quarter circle.
* If `distance` is a negative value, the returned object is smaller than the input object. You can use this to
  remove small irregularities from the shape.
* For LineStrings, the endcap and join styles are always round.
* LineStrings are always buffered on both sides.

## Examples

Before executing the examples, set the [GEOMETRY_OUTPUT_FORMAT](../parameters.md) parameter to `WKT`:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
```

The following example returns a Polygon around a Point with a radius of one:

```sqlexample
SELECT ST_BUFFER(TO_GEOMETRY('POINT(0 0)'), 1) AS geom;
```

```output
+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| GEOM                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
|-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| MULTIPOLYGON(((1 0,0.9807852804 -0.195090322,0.9238795325 -0.3826834324,0.8314696123 -0.555570233,0.7071067812 -0.7071067812,0.555570233 -0.8314696123,0.3826834324 -0.9238795325,0.195090322 -0.9807852804,6.123233996e-17 -1,-0.195090322 -0.9807852804,-0.3826834324 -0.9238795325,-0.555570233 -0.8314696123,-0.7071067812 -0.7071067812,-0.8314696123 -0.555570233,-0.9238795325 -0.3826834324,-0.9807852804 -0.195090322,-1 7.657137398e-16,-0.9807852804 0.195090322,-0.9238795325 0.3826834324,-0.8314696123 0.555570233,-0.7071067812 0.7071067812,-0.555570233 0.8314696123,-0.3826834324 0.9238795325,-0.195090322 0.9807852804,2.480838239e-15 1,0.195090322 0.9807852804,0.3826834324 0.9238795325,0.555570233 0.8314696123,0.7071067812 0.7071067812,0.8314696123 0.555570233,0.9238795325 0.3826834324,0.9807852804 0.195090322,1 0))) |
+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

The following example uses a negative value for `distance` to remove small irregularities (such as spikes) from the shape.
The [TO_GEOMETRY](to_geometry.md) call passes in TRUE as the second argument, which allows the function to create a GEOMETRY
object for [an invalid shape](../data-types-geospatial.md).

```sqlexample
SELECT ST_BUFFER(TO_GEOMETRY('SRID=2261;POLYGON((
  1540792.21541900 290472.63529214, 1547018.61770388 302537.02285369,
  1546965.96550151 302752.51514772, 1547018.61770388 302537.02285369,
  1549532.42729914 301257.07398027, 1543327.42218339 289322.60923536,
  1540792.21541900 290472.63529214))', True), -1e-08) AS geom;
```

```output
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| GEOM                                                                                                                                                                                        |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| MULTIPOLYGON(((1543327.42218339 289322.609235373,1540792.21541901 290472.635292145,1547018.61770388 302537.022853677,1549532.42729913 301257.073980266,1543327.42218339 289322.609235373))) |
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: ST_CENTROID
source: https://docs.snowflake.com/en/sql-reference/functions/st_centroid.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_CENTROID

Returns the Point representing the geometric center of a [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_CENTROID( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a GEOGRAPHY or GEOMETRY object for the Point that represents geometric center of the input object.

## Usage notes

* Returns NULL if the input is NULL.
* If the input object is a GeometryCollection that contains different types of objects (Polygons, LineStrings, and Points),
  ST_CENTROID uses the type with the [highest dimension](st_dimension.md) to determine the geometric
  center. For example:

  + If the GeometryCollection contains Polygons, LineStrings, and Points, ST_CENTROID uses the Polygons and ignores the
    LineStrings and Points in the collection.
  + If the GeometryCollection contains LineStrings and Points, ST_CENTROID uses the LineStrings and ignores the Points in the
    collection.

* For GEOMETRY objects, the returned GEOMETRY object has the same SRID as the input.

## Examples

### GEOGRAPHY examples

The following example returns the Point that represents the geometric center of a LineString.

> ```sqlexample
> SELECT ST_CENTROID(
>     TO_GEOGRAPHY(
>         'LINESTRING(0 0, 0 -2)'
>     )
> ) as center_of_linestring;
> +----------------------+
> | CENTER_OF_LINESTRING |
> |----------------------|
> | POINT(0 -1)          |
> +----------------------+
> ```

The following example returns the Point that represents the geometric center of a Polygon.

> ```sqlexample
> SELECT ST_CENTROID(
>     TO_GEOGRAPHY(
>         'POLYGON((10 10, 10 20, 20 20, 20 10, 10 10))'
>     )
> ) as center_of_polygon;
> +------------------------+
> | CENTER_OF_POLYGON      |
> |------------------------|
> | POINT(15 15.014819855) |
> +------------------------+
> ```

The following example returns the Point that represents the geometric center of a GeometryCollection. This collection contains a
Polygon, LineString, and Point. ST_CENTROID only uses the Polygon (and ignores the LineString and Point) when determining the
geometric center.

> ```sqlexample
> SELECT ST_CENTROID(
>     TO_GEOGRAPHY(
>         'GEOMETRYCOLLECTION(POLYGON((10 10, 10 20, 20 20, 20 10, 10 10)), LINESTRING(0 0, 0 -2), POINT(50 -50))'
>     )
> ) as center_of_collection_with_polygons;
> +------------------------------------+
> | CENTER_OF_COLLECTION_WITH_POLYGONS |
> |------------------------------------|
> | POINT(15 15.014819855)             |
> +------------------------------------+
> ```

### GEOMETRY examples

The following example computes the centroid of a simple rectangular Polygon. Note how the result differs from the result when
using ST_CENTROID with a GEOGRAPHY object

> ```sqlexample
> SELECT ST_CENTROID(TO_GEOMETRY('POLYGON((10 10, 10 20, 20 20, 20 10, 10 10))'));
> ```
>
> ```none
> +--------------------------------------------------------------------------+
> | ST_CENTROID(TO_GEOMETRY('POLYGON((10 10, 10 20, 20 20, 20 10, 10 10))')) |
> |--------------------------------------------------------------------------|
> | POINT(15 15)                                                             |
> +--------------------------------------------------------------------------+
> ```

---
title: ST_COLLECT
source: https://docs.snowflake.com/en/sql-reference/functions/st_collect.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_COLLECT

There are two forms of ST_COLLECT:

* Scalar: This function combines two [GEOGRAPHY](../data-types-geospatial.md) objects into one.
* Aggregate: This function combines all the GEOGRAPHY objects in a column into one GEOGRAPHY object.

## Syntax

```sqlsyntax
Scalar:

    ST_COLLECT( <geography_expression_1> , <geography_expression_2> )

Aggregate:

    ST_COLLECT( <geography_expression_1> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

## Returns

The function returns a value of type GEOGRAPHY.

## Usage notes

* If g1 and g2 are both Point objects, the result is a MultiPoint object containing the two Points. Similarly,
  if g1 and g2 are both LineString objects, the result is a MultiLineString object. Etc.
* If g1 and g2 are different types of geospatial objects, or if at least one of the input GEOGRAPHY objects is a
  collection (e.g. MultiLineString, GeometryCollection, or FeatureCollection), then the result is a GeometryCollection
  containing both input objects.

## Examples

The queries below show both scalar and aggregate uses of the ST_COLLECT function.

> Create and load the table:
>
> ```sqlexample
> CREATE TABLE geo3 (g1 GEOGRAPHY, g2 GEOGRAPHY);
> INSERT INTO geo3 (g1, g2) VALUES
>     ( 'POINT(-180 -90)', 'POINT(-45 -45)' ),
>     ( 'POINT(   0   0)', 'POINT(-60 -60)' ),
>     ( 'POINT(+180 +90)', 'POINT(+45 +45)' );
> ```
>
> This calls ST_COLLECT as a scalar function to create a MultiPoint value that contains both points in the same row:
>
> ```sqlexample
> -- Scalar function:
> SELECT ST_COLLECT(g1, g2) FROM geo3;
> +------------------------+
> | ST_COLLECT(G1, G2)     |
> |------------------------|
> | {                      |
> |   "coordinates": [     |
> |     [                  |
> |       -180,            |
> |       -90              |
> |     ],                 |
> |     [                  |
> |       -45,             |
> |       -45              |
> |     ]                  |
> |   ],                   |
> |   "type": "MultiPoint" |
> | }                      |
> | {                      |
> |   "coordinates": [     |
> |     [                  |
> |       0,               |
> |       0                |
> |     ],                 |
> |     [                  |
> |       -60,             |
> |       -60              |
> |     ]                  |
> |   ],                   |
> |   "type": "MultiPoint" |
> | }                      |
> | {                      |
> |   "coordinates": [     |
> |     [                  |
> |       180,             |
> |       90               |
> |     ],                 |
> |     [                  |
> |       45,              |
> |       45               |
> |     ]                  |
> |   ],                   |
> |   "type": "MultiPoint" |
> | }                      |
> +------------------------+
> ```
>
> This calls ST_COLLECT as an aggregate function to create a MultiPoint value that contains all the points in the
> same column:
>
> ```sqlexample
> -- Aggregate function:
> SELECT ST_COLLECT(g1), ST_COLLECT(g2) FROM geo3;
> +------------------------+------------------------+
> | ST_COLLECT(G1)         | ST_COLLECT(G2)         |
> |------------------------+------------------------|
> | {                      | {                      |
> |   "coordinates": [     |   "coordinates": [     |
> |     [                  |     [                  |
> |       -180,            |       -45,             |
> |       -90              |       -45              |
> |     ],                 |     ],                 |
> |     [                  |     [                  |
> |       0,               |       -60,             |
> |       0                |       -60              |
> |     ],                 |     ],                 |
> |     [                  |     [                  |
> |       180,             |       45,              |
> |       90               |       45               |
> |     ]                  |     ]                  |
> |   ],                   |   ],                   |
> |   "type": "MultiPoint" |   "type": "MultiPoint" |
> | }                      | }                      |
> +------------------------+------------------------+
> ```
>
> This calls ST_COLLECT first as an aggregate function on each column to create MultiPoint values that contains all
> the points in each column, and then calls ST_COLLECT on those two MultiPoint values to create a GeometryCollection
> that contains all the points in both columns. The resulting GeometryCollection is hierarchical.
>
> ```sqlexample
> -- Aggregate and then Collect:
> SELECT ST_COLLECT(ST_COLLECT(g1), ST_COLLECT(g2)) FROM geo3;
> +--------------------------------------------+
> | ST_COLLECT(ST_COLLECT(G1), ST_COLLECT(G2)) |
> |--------------------------------------------|
> | {                                          |
> |   "geometries": [                          |
> |     {                                      |
> |       "coordinates": [                     |
> |         [                                  |
> |           -180,                            |
> |           -90                              |
> |         ],                                 |
> |         [                                  |
> |           0,                               |
> |           0                                |
> |         ],                                 |
> |         [                                  |
> |           180,                             |
> |           90                               |
> |         ]                                  |
> |       ],                                   |
> |       "type": "MultiPoint"                 |
> |     },                                     |
> |     {                                      |
> |       "coordinates": [                     |
> |         [                                  |
> |           -45,                             |
> |           -45                              |
> |         ],                                 |
> |         [                                  |
> |           -60,                             |
> |           -60                              |
> |         ],                                 |
> |         [                                  |
> |           45,                              |
> |           45                               |
> |         ]                                  |
> |       ],                                   |
> |       "type": "MultiPoint"                 |
> |     }                                      |
> |   ],                                       |
> |   "type": "GeometryCollection"             |
> | }                                          |
> +--------------------------------------------+
> ```

---
title: ST_CONTAINS
source: https://docs.snowflake.com/en/sql-reference/functions/st_contains.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_CONTAINS

Returns TRUE if a [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object is
completely inside another object of the same type.

More strictly, object g1 contains object g2 if and only if no points of g2 lie in the exterior of g1, and at least one
point of the interior of B lies in the interior of A. There are certain subtleties in this definition that are not
immediately obvious. For more details on what “contains” means, see the
[Dimensionally Extended 9-Intersection Model (DE-9IM)](https://en.wikipedia.org/wiki/DE-9IM).

Although ST_COVERS and ST_CONTAINS might seem similar, the two functions have subtle differences. For details on the differences
between “covers” and “contains”, see the
[Dimensionally Extended 9-Intersection Model (DE-9IM)](https://en.wikipedia.org/wiki/DE-9IM).

> **Note:**
>
> This function does not support using a GeometryCollection or FeatureCollection as input values.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [ST_WITHIN](st_within.md) , [ST_COVERS](st_covers.md) , [ST_COVEREDBY](st_coveredby.md)

## Syntax

```sqlsyntax
ST_CONTAINS( <geography_expression_1> , <geography_expression_2> )

ST_CONTAINS( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geography_expression_2`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_1`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_2`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

## Returns

BOOLEAN.

## Usage notes

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_CONTAINS function:

> ```sqlexample
> create table geospatial_table_01 (g1 GEOGRAPHY, g2 GEOGRAPHY);
> insert into geospatial_table_01 (g1, g2) values
>     ('POLYGON((0 0, 3 0, 3 3, 0 3, 0 0))', 'POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))');
> ```
>
> ```sqlexample
> SELECT ST_CONTAINS(g1, g2)
>     FROM geospatial_table_01;
> +---------------------+
> | ST_CONTAINS(G1, G2) |
> |---------------------|
> | True                |
> +---------------------+
> ```

### GEOMETRY examples

The query below shows several examples of using ST_CONTAINS. Note how ST_CONTAINS determines that:

* The Polygon contains itself.
* The Polygon does not contain the LineString that is on its border.

  ```sqlexample
  SELECT ST_CONTAINS(poly, poly_inside),
        ST_CONTAINS(poly, poly),
        ST_CONTAINS(poly, line_on_boundary),
        ST_CONTAINS(poly, line_inside)
    FROM (SELECT
      TO_GEOMETRY('POLYGON((-2 0, 0 2, 2 0, -2 0))') AS poly,
      TO_GEOMETRY('POLYGON((-1 0, 0 1, 1 0, -1 0))') AS poly_inside,
      TO_GEOMETRY('LINESTRING(-1 1, 0 2, 1 1)') AS line_on_boundary,
      TO_GEOMETRY('LINESTRING(-2 0, 0 0, 0 1)') AS line_inside);
  ```

  ```none
  +--------------------------------+------------------------+------------------------------------+-------------------------------+
  | ST_CONTAINS(POLY, POLY_INSIDE) | ST_CONTAINS(POLY,POLY) | ST_CONTAINS(POLY,LINE_ON_BOUNDARY) | ST_CONTAINS(POLY,LINE_INSIDE) |
  |--------------------------------+------------------------+------------------------------------+-------------------------------|
  | True                           | True                   | False                              | True                          |
  +--------------------------------+------------------------+------------------------------------+-------------------------------+
  ```

---
title: ST_COVEREDBY
source: https://docs.snowflake.com/en/sql-reference/functions/st_coveredby.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_COVEREDBY

Returns TRUE if no point in one geospatial object is outside another geospatial object. In other words:

* [GEOGRAPHY](../data-types-geospatial.md) object `g1` is outside GEOGRAPHY object `g2`.
* [GEOMETRY](../data-types-geospatial.md) object `g1` is outside GEOMETRY object `g2`.

This is equivalent to `ST_COVERS(g2, g1)`.

Although ST_COVEREDBY and ST_WITHIN might seem similar, the two functions have subtle differences. For details on the differences
between “covered by” and “within”, see the
[Dimensionally Extended 9-Intersection Model (DE-9IM)](https://en.wikipedia.org/wiki/DE-9IM).

> **Note:**
>
> This function does not support using a GeometryCollection or FeatureCollection as input values.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [ST_COVERS](st_covers.md) , [ST_WITHIN](st_within.md)

## Syntax

```sqlsyntax
ST_COVEREDBY( <geography_expression_1> , <geography_expression_2> )

ST_COVEREDBY( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geography_expression_2`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_1`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_2`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

## Returns

BOOLEAN.

## Usage notes

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_COVEREDBY function:

> ```sqlexample
> create table geospatial_table_01 (g1 GEOGRAPHY, g2 GEOGRAPHY);
> insert into geospatial_table_01 (g1, g2) values
>     ('POLYGON((0 0, 3 0, 3 3, 0 3, 0 0))', 'POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))');
> ```
>
> ```sqlexample
> SELECT ST_COVEREDBY(g1, g2)
>     FROM geospatial_table_01;
> +----------------------+
> | ST_COVEREDBY(G1, G2) |
> |----------------------|
> | False                |
> +----------------------+
> ```

---
title: ST_COVERS
source: https://docs.snowflake.com/en/sql-reference/functions/st_covers.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_COVERS

Returns TRUE if no point in one geospatial object is outside of another geospatial object. In other words:

* [GEOGRAPHY](../data-types-geospatial.md) object `g2` is outside GEOGRAPHY object `g1`.
* [GEOMETRY](../data-types-geospatial.md) object `g2` is outside GEOMETRY object `g1`.

ST_COVERS is similar to, but subtly different from, ST_CONTAINS. For details on the differences between “covers” and “contains”,
see the [Dimensionally Extended 9-Intersection Model (DE-9IM)](https://en.wikipedia.org/wiki/DE-9IM).

Although ST_COVERS and ST_CONTAINS might seem similar, the two functions have subtle differences. For details on the differences
between “covers” and “contains”, see the
[Dimensionally Extended 9-Intersection Model (DE-9IM)](https://en.wikipedia.org/wiki/DE-9IM).

> **Note:**
>
> This function does not support using a GeometryCollection or FeatureCollection as input values.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [ST_CONTAINS](st_contains.md) , [ST_COVEREDBY](st_coveredby.md)

## Syntax

```sqlsyntax
ST_COVERS( <geography_expression_1> , <geography_expression_2> )

ST_COVERS( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geography_expression_2`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_1`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_2`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

## Returns

BOOLEAN.

## Usage notes

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_COVERS function:

> ```sqlexample
> create table geospatial_table_01 (g1 GEOGRAPHY, g2 GEOGRAPHY);
> insert into geospatial_table_01 (g1, g2) values
>     ('POLYGON((0 0, 3 0, 3 3, 0 3, 0 0))', 'POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))');
> ```
>
> ```sqlexample
> SELECT ST_COVERS(g1, g2)
>     FROM geospatial_table_01;
> +-------------------+
> | ST_COVERS(G1, G2) |
> |-------------------|
> | True              |
> +-------------------+
> ```

### GEOMETRY examples

The query below shows several examples of using ST_COVERS. Note how the Polygon covers (but does not
[contain](st_contains.md)) a LineString on its border.

> ```sqlexample
> SELECT ST_COVERS(poly, poly_inside),
>        ST_COVERS(poly, poly),
>        ST_COVERS(poly, line_on_boundary),
>        ST_COVERS(poly, line_inside)
>   FROM (SELECT TO_GEOMETRY('POLYGON((-2 0, 0 2, 2 0, -2 0))') AS poly,
>                TO_GEOMETRY('POLYGON((-1 0, 0 1, 1 0, -1 0))') AS poly_inside,
>                TO_GEOMETRY('LINESTRING(-1 1, 0 2, 1 1)') AS line_on_boundary,
>                TO_GEOMETRY('LINESTRING(-2 0, 0 0, 0 1)') AS line_inside);
> ```
>
> ```none
> +------------------------------+----------------------+----------------------------------+-----------------------------+
> | ST_COVERS(POLY, POLY_INSIDE) | ST_COVERS(POLY,POLY) | ST_COVERS(POLY,LINE_ON_BOUNDARY) | ST_COVERS(POLY,LINE_INSIDE) |
> |------------------------------+----------------------+----------------------------------+-----------------------------|
> | True                         | True                 | True                             | True                        |
> +------------------------------+----------------------+----------------------------------+-----------------------------+
> ```

---
title: ST_DIFFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/st_difference.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_DIFFERENCE

Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the points in the first object that are not in the
second object (i.e. the difference between the two objects).

See also:
:   [ST_INTERSECTION](st_intersection.md) , [ST_UNION](st_union.md) , [ST_SYMDIFFERENCE](st_symdifference.md)

## Syntax

```sqlsyntax
ST_DIFFERENCE( <geography_expression_1> , <geography_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

## Returns

The function returns a value of type GEOGRAPHY.

If all points of `geography_expression_1` are in `geography_expression_2` (i.e. the difference is an empty set of
points), the function returns NULL.

## Usage notes

* If any vertex of one input object is on the boundary of the other input object (excluding the vertices), the output might not be
  accurate.
* The function is not guaranteed to produce normalized and/or minimal results. For example, an output could consist of a
  LineString containing several Points that actually forms just one straight segment.

## Examples

The following example returns a GEOGRAPHY object that represents the difference between two input GEOGRAPHY objects:

> ```sqlexample
> ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';
>
> SELECT ST_DIFFERENCE(
>   TO_GEOGRAPHY('POLYGON((0 0, 1 0, 2 1, 1 2, 2 3, 1 4, 0 4, 0 0))'),
>   TO_GEOGRAPHY('POLYGON((3 0, 3 4, 2 4, 1 3, 2 2, 1 1, 2 0, 3 0))'))
> AS difference_between_objects;
> ```

This example produces the following output:

> ```none
> +-------------------------------------------------------------------------------------------------------------+
> | DIFFERENCE_BETWEEN_OBJECTS                                                                                  |
> |-------------------------------------------------------------------------------------------------------------|
> | POLYGON((1 1,1.5 1.500171359,1 2,1.5 2.500285599,1 3,1.5 3.500399839,1 4,0 4,0 0,1 0,1.5 0.5000571198,1 1)) |
> +-------------------------------------------------------------------------------------------------------------+
> ```

The following images illustrate the differences in the areas that represent the input and output objects:

| Input | Output |
| --- | --- |
|  |  |

---
title: ST_DIMENSION
source: https://docs.snowflake.com/en/sql-reference/functions/st_dimension.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_DIMENSION

Given a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md), return the
“dimension” of the value. The dimension of a GEOGRAPHY or GEOMETRY value is:

| Geospatial Object Type | Dimension |
| --- | --- |
| Point / MultiPoint | 0 |
| LineString / MultiLineString | 1 |
| Polygon / MultiPolygon | 2 |
| GeometryCollection | The dimension of the collection is equal to the maximum dimension of all the values inside the collection.  For example, if a GeometryCollection contains a Point (dimension 0) and a LineString (dimension 1), the dimension of the GeometryCollection is 1. |
| Feature | The dimension of the Feature is the same as the dimension of the geospatial object in the Feature. |
| FeatureCollection | The rule is the same as for GeometryCollection. |

The returned values (0, 1, 2) correspond to the common meaning of the word “dimension”: a polygon is a two-dimensional
object, a line is a one-dimensional object, and a point is a zero-dimensional object.

## Syntax

```sqlsyntax
ST_DIMENSION( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

A value of type INTEGER.

## Usage notes

* If the function is passed NULL, the function returns NULL.
* For GEOGRAPHY objects:

  + If the function is passed a GeometryCollection containing at least one NULL element and no non-NULL elements, the function
    returns 0.
  + If the function is passed a GeometryCollection containing at least one NULL element and at least one non-NULL element, the
    function returns the maximum dimension of the non-NULL elements.

  Note that some other systems return different values for NULL inputs.

## Examples

### GEOGRAPHY examples

The following example demonstrates the ST_DIMENSION function:

> ```sqlexample
> create table geospatial_table_02 (id INTEGER, g GEOGRAPHY);
> insert into geospatial_table_02 values
>     (1, 'POINT(-122.35 37.55)'),
>     (2, 'MULTIPOINT((-122.35 37.55), (0.00 -90.0))'),
>     (3, 'LINESTRING(-124.20 42.00, -120.01 41.99)'),
>     (4, 'LINESTRING(-124.20 42.00, -120.01 41.99, -122.5 42.01)'),
>     (5, 'MULTILINESTRING((-124.20 42.00, -120.01 41.99, -122.5 42.01), (10.0 0.0, 20.0 10.0, 30.0 0.0))'),
>     (6, 'POLYGON((-124.20 42.00, -120.01 41.99, -121.1 42.01, -124.20 42.00))'),
>     (7, 'MULTIPOLYGON(((-124.20 42.00, -120.01 41.99, -121.1 42.01, -124.20 42.0)), ((20.0 20.0, 40.0 20.0, 40.0 40.0, 20.0 40.0, 20.0 20.0)))')
>     ;
> ```
>
> ```sqlexample
> select st_dimension(g) as dimension, st_aswkt(g)
>     from geospatial_table_02
>     order by dimension, id;
> +-----------+----------------------------------------------------------------------------------------------------+
> | DIMENSION | ST_ASWKT(G)                                                                                        |
> |-----------+----------------------------------------------------------------------------------------------------|
> |         0 | POINT(-122.35 37.55)                                                                               |
> |         0 | MULTIPOINT((-122.35 37.55),(0 -90))                                                                |
> |         1 | LINESTRING(-124.2 42,-120.01 41.99)                                                                |
> |         1 | LINESTRING(-124.2 42,-120.01 41.99,-122.5 42.01)                                                   |
> |         1 | MULTILINESTRING((-124.2 42,-120.01 41.99,-122.5 42.01),(10 0,20 10,30 0))                          |
> |         2 | POLYGON((-124.2 42,-120.01 41.99,-121.1 42.01,-124.2 42))                                          |
> |         2 | MULTIPOLYGON(((-124.2 42,-120.01 41.99,-121.1 42.01,-124.2 42)),((20 20,40 20,40 40,20 40,20 20))) |
> +-----------+----------------------------------------------------------------------------------------------------+
> ```

### GEOMETRY examples

The following example demonstrates the ST_DIMENSION function:

> ```sqlexample
> CREATE OR REPLACE TABLE geometry_shapes (g GEOMETRY);
> INSERT INTO geometry_shapes VALUES
>     ('POINT(66 12)'),
>     ('MULTIPOINT((45 21), (12 54))'),
>     ('LINESTRING(40 60, 50 50, 60 40)'),
>     ('MULTILINESTRING((1 1, 32 17), (33 12, 73 49, 87.1 6.1))'),
>     ('POLYGON((17 17, 17 30, 30 30, 30 17, 17 17))'),
>     ('MULTIPOLYGON(((-10 0,0 10,10 0,-10 0)),((-10 40,10 40,0 20,-10 40)))'),
>     ('GEOMETRYCOLLECTION(POLYGON((-10 0,0 10,10 0,-10 0)),LINESTRING(40 60, 50 50, 60 40), POINT(99 11))')
>     ;
>
> SELECT ST_DIMENSION(g), ST_ASWKT(g) FROM geometry_shapes;
> ```
>
> ```none
> +-----------------+-------------------------------------------------------------------------------------------------+
> | ST_DIMENSION(G) | ST_ASWKT(G)                                                                                     |
> |-----------------+-------------------------------------------------------------------------------------------------|
> |               0 | POINT(66 12)                                                                                    |
> |               0 | MULTIPOINT((45 21),(12 54))                                                                     |
> |               1 | LINESTRING(40 60,50 50,60 40)                                                                   |
> |               1 | MULTILINESTRING((1 1,32 17),(33 12,73 49,87.1 6.1))                                             |
> |               2 | POLYGON((17 17,17 30,30 30,30 17,17 17))                                                        |
> |               2 | MULTIPOLYGON(((-10 0,0 10,10 0,-10 0)),((-10 40,10 40,0 20,-10 40)))                            |
> |               2 | GEOMETRYCOLLECTION(POLYGON((-10 0,0 10,10 0,-10 0)),LINESTRING(40 60,50 50,60 40),POINT(99 11)) |
> +-----------------+-------------------------------------------------------------------------------------------------+
> ```

---
title: ST_DISJOINT
source: https://docs.snowflake.com/en/sql-reference/functions/st_disjoint.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_DISJOINT

Returns TRUE if the two [GEOGRAPHY](../data-types-geospatial.md) objects or the two
[GEOMETRY](../data-types-geospatial.md) objects are disjoint (i.e. do not share any portion of space). ST_DISJOINT is
equivalent to NOT [ST_INTERSECTS(expr1, expr2)](st_intersects.md).

> **Note:**
>
> This function does not support using a GeometryCollection or FeatureCollection as input values.

See also:
:   [ST_INTERSECTS](st_intersects.md)

## Syntax

```sqlsyntax
ST_DISJOINT( <geography_expression_1> , <geography_expression_2> )

ST_DISJOINT( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

`geometry_expression_1`
:   A GEOMETRY object.

`geometry_expression_2`
:   A GEOMETRY object.

## Returns

BOOLEAN.

## Usage notes

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

## Examples

### GEOGRAPHY examples

The following examples use the ST_DISJOINT function to determine if two geospatial objects are disjoint:

> ```sqlexample
> -- These two polygons are disjoint and do not intersect.
> SELECT ST_DISJOINT(
>     TO_GEOGRAPHY('POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))'),
>     TO_GEOGRAPHY('POLYGON((3 3, 5 3, 5 5, 3 5, 3 3))')
>     );
> +---------------------------------------------------------+
> | ST_DISJOINT(                                            |
> |     TO_GEOGRAPHY('POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))'), |
> |     TO_GEOGRAPHY('POLYGON((3 3, 5 3, 5 5, 3 5, 3 3))')  |
> |     )                                                   |
> |---------------------------------------------------------|
> | True                                                    |
> +---------------------------------------------------------+
> ```
>
> ```sqlexample
> -- These two polygons intersect and are not disjoint.
> SELECT ST_DISJOINT(
>     TO_GEOGRAPHY('POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))'),
>     TO_GEOGRAPHY('POLYGON((1 1, 3 1, 3 3, 1 3, 1 1))')
>     );
> +---------------------------------------------------------+
> | ST_DISJOINT(                                            |
> |     TO_GEOGRAPHY('POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))'), |
> |     TO_GEOGRAPHY('POLYGON((1 1, 3 1, 3 3, 1 3, 1 1))')  |
> |     )                                                   |
> |---------------------------------------------------------|
> | False                                                   |
> +---------------------------------------------------------+
> ```

---
title: ST_DISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/st_distance.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_DISTANCE

Returns the minimum great circle distance between two [GEOGRAPHY](../data-types-geospatial.md) or the minimum Euclidean distance
between two [GEOMETRY](../data-types-geospatial.md) objects.

## Syntax

```sqlsyntax
ST_DISTANCE( <geography_or_geometry_expression_1> , <geography_or_geometry_expression_2> )
```

## Arguments

`geography_or_geometry_expression_1`
:   The argument must be of type GEOGRAPHY or GEOMETRY.

`geography_or_geometry_expression_2`
:   The argument must be of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a FLOAT value, which represents the distance, or NULL:

* For GEOGRAPHY input values, the distance is in meters.
* For GEOMETRY input values, the distance is computed with the same units used to define the input coordinates.
* Returns NULL if one or more input points are NULL.

## Usage notes

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

## Examples

The following examples use the ST_DISTANCE function.

### GEOGRAPHY examples

Show the distance in meters between two points 1 degree apart along the equator (approximately 111 kilometers or
69 miles).

```sqlexample
WITH d AS
  ( ST_DISTANCE(ST_MAKEPOINT(0, 0), ST_MAKEPOINT(1, 0)) )
SELECT d / 1000 AS kilometers, d / 1609 AS miles;
```

```output
+---------------+--------------+
|    KILOMETERS |        MILES |
|---------------+--------------|
| 111.195101177 | 69.108204585 |
+---------------+--------------+
```

Show the output of the ST_DISTANCE function when one or more input values are NULL:

```sqlexample
SELECT ST_DISTANCE(ST_MAKEPOINT(0, 0), ST_MAKEPOINT(NULL, NULL)) AS null_input;
```

```output
+------------+
| NULL_INPUT |
|------------|
|       NULL |
+------------+
```

### GEOMETRY examples

The following example compares the distance calculated for GEOGRAPHY and GEOMETRY input objects.

```sqlexample
SELECT ST_DISTANCE(TO_GEOMETRY('POINT(0 0)'), TO_GEOMETRY('POINT(1 1)')) AS geometry_distance,
  ST_DISTANCE(TO_GEOGRAPHY('POINT(0 0)'), TO_GEOGRAPHY('POINT(1 1)')) AS geography_distance;
```

```output
+-------------------+--------------------+
| GEOMETRY_DISTANCE | GEOGRAPHY_DISTANCE |
|-------------------+--------------------|
|       1.414213562 |   157249.628092508 |
+-------------------+--------------------+
```

For additional examples, see [Examples comparing the GEOGRAPHY and GEOMETRY data types](../data-types-geospatial.md).

---
title: ST_DWITHIN
source: https://docs.snowflake.com/en/sql-reference/functions/st_dwithin.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_DWITHIN

Returns TRUE if the minimum great circle distance between two points (two [GEOGRAPHY](../data-types-geospatial.md) objects) is
within the specified distance. Otherwise, returns FALSE.

If the parameters are GEOGRAPHY values that are not points (e.g. lines or polygons), this returns TRUE or FALSE based on the
minimum great circle distance between the two closest points of the two values.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

## Syntax

```sqlsyntax
ST_DWITHIN( <geography_expression_1> , <geography_expression_2> , <distance_in_meters> )
```

## Arguments

`geography_expression_1`
:   The argument must be an expression of type GEOGRAPHY.

`geography_expression_2`
:   The argument must be an expression of type GEOGRAPHY.

`distance_in_meters`
:   The argument must be an expression of type REAL. The distance is in meters.

## Returns

Returns a BOOLEAN.

## Usage notes

* Returns NULL if any input is NULL.

## Examples

This returns TRUE because the distance in meters between two points 1 degree apart along the equator is less than
150,000 meters:

> ```sqlexample
> SELECT ST_DWITHIN (ST_MAKEPOINT(0, 0), ST_MAKEPOINT(1, 0), 150000);
> +-------------------------------------------------------------+
> | ST_DWITHIN (ST_MAKEPOINT(0, 0), ST_MAKEPOINT(1, 0), 150000) |
> |-------------------------------------------------------------|
> | True                                                        |
> +-------------------------------------------------------------+
> ```

---
title: ST_ENDPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/st_endpoint.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ENDPOINT

Returns the last Point in a LineString.

See also:
:   [ST_POINTN](st_pointn.md) , [ST_STARTPOINT](st_startpoint.md)

## Syntax

```sqlsyntax
ST_ENDPOINT( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY that represents a LineString.

## Returns

The function returns a value of type GEOGRAPHY or GEOMETRY that contains the last Point of the specified LineString.

## Usage notes

* If `geography_or_geometry_expression` is not a LineString, the function reports an error.

## Examples

### GEOGRAPHY examples

The following query returns the last Point in a LineString:

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='WKT';
SELECT ST_ENDPOINT(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)'));

+-------------------------------------------------------------+
| ST_ENDPOINT(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)')) |
|-------------------------------------------------------------|
| POINT(4 4)                                                  |
+-------------------------------------------------------------+
```

### GEOMETRY examples

The following query returns the last Point in a LineString:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
SELECT ST_ENDPOINT(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)'));

+------------------------------------------------------------+
| ST_ENDPOINT(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)')) |
|------------------------------------------------------------|
| POINT(4 4)                                                 |
+------------------------------------------------------------+
```

---
title: ST_ENVELOPE
source: https://docs.snowflake.com/en/sql-reference/functions/st_envelope.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ENVELOPE

Returns the minimum bounding box (a rectangular “envelope”) that encloses a specified
[GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_ENVELOPE( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be of type GEOGRAPHY or GEOMETRY.

## Returns

The function returns a value of type [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md)
that represents the minimum bounding box around the input object.

## Usage notes

* For GEOGRAPHY objects:

  + If `geography_expression` is a LineString that represents a meridian arc (an arc along a line of longitude),
    ST_ENVELOPE returns that LineString.
  + If `geography_expression` is a LineString that represents an arc on a parallel (an arc along a line of latitude)
    other than the equator, ST_ENVELOPE returns a Polygon that represents the bounding box for the arc.
  + If `geography_expression` is a single Point, ST_ENVELOPE returns that Point.
* For GEOMETRY objects:

  + In degenerate cases (e.g. where the input is a point or a vertical or horizontal line), the function may return a geometry of
    lower dimension (i.e. a Point or LineString).
  > + For GEOMETRY objects, the returned GEOMETRY object has the same SRID as the input.

## Examples

### GEOGRAPHY examples

The following example returns the minimum bounding box for a polygon:

> ```sqlexample
> SELECT ST_ENVELOPE(
>     TO_GEOGRAPHY(
>         'POLYGON((-122.306067 37.55412, -122.32328 37.561801, -122.325879 37.586852, -122.306067 37.55412))'
>     )
> ) as minimum_bounding_box_around_polygon;
> +-----------------------------------------------------------------------------------------------------------------------+
> | MINIMUM_BOUNDING_BOX_AROUND_POLYGON                                                                                   |
> |-----------------------------------------------------------------------------------------------------------------------|
> | POLYGON((-122.325879 37.55412,-122.306067 37.55412,-122.306067 37.586852,-122.325879 37.586852,-122.325879 37.55412)) |
> +-----------------------------------------------------------------------------------------------------------------------+
> ```

The following example passes in a LineString that represents a meridian arc. The function returns the same LineString, rather
than a Polygon.

> ```sqlexample
> SELECT ST_ENVELOPE(
>     TO_GEOGRAPHY(
>         'LINESTRING(-122.32328 37.561801, -122.32328 37.562001)'
>     )
> ) as minimum_bounding_box_around_meridian_arc;
> +-------------------------------------------------------+
> | MINIMUM_BOUNDING_BOX_AROUND_MERIDIAN_ARC              |
> |-------------------------------------------------------|
> | LINESTRING(-122.32328 37.561801,-122.32328 37.562001) |
> +-------------------------------------------------------+
> ```

The following example passes in a LineString that represents an arc on a parallel that is not the equator. The function
returns a Polygon that represents the bounding box:

> ```sqlexample
> SELECT ST_ENVELOPE(
>     TO_GEOGRAPHY(
>         'LINESTRING(-122.32328 37.561801,-122.32351 37.561801)'
>     )
> ) as minimum_bounding_box_around_arc_along_parallel;
> +---------------------------------------------------------------------------------------------------------------------+
> | MINIMUM_BOUNDING_BOX_AROUND_ARC_ALONG_PARALLEL                                                                      |
> |---------------------------------------------------------------------------------------------------------------------|
> | POLYGON((-122.32351 37.561801,-122.32328 37.561801,-122.32328 37.561801,-122.32351 37.561801,-122.32351 37.561801)) |
> +---------------------------------------------------------------------------------------------------------------------+
> ```

The following example passes in a single Point. The function returns the same Point:

> ```sqlexample
> SELECT ST_ENVELOPE(
>     TO_GEOGRAPHY(
>         'POINT(-122.32328 37.561801)'
>     )
> ) as minimum_bounding_box_around_point;
> +-----------------------------------+
> | MINIMUM_BOUNDING_BOX_AROUND_POINT |
> |-----------------------------------|
> | POINT(-122.32328 37.561801)       |
> +-----------------------------------+
> ```

### GEOMETRY examples

---
title: ST_GEOGFROMGEOHASH
source: https://docs.snowflake.com/en/sql-reference/functions/st_geogfromgeohash.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# ST_GEOGFROMGEOHASH

Returns a [GEOGRAPHY](../data-types-geospatial.md) object for the polygon that represents the boundaries of a
[geohash](st_geohash.md).

The optional `precision` argument specifies the precision to use for the input geohash.
For example, passing `5` for `precision` specifies that the function should use the first 5 characters of the input geohash.

See also:
:   [ST_GEOHASH](st_geohash.md), [ST_GEOGPOINTFROMGEOHASH](st_geogpointfromgeohash.md)

## Syntax

```sqlsyntax
ST_GEOGFROMGEOHASH( <geohash> [, <precision> ] )
```

## Arguments

**Required:**

`geohash`
:   The argument must be a geohash.

**Optional:**

`precision`
:   The number of characters to use from the input geohash. For example, passing `5` for `precision` causes
    the function to use the first 5 characters in the geohash.

    You can specify a value from `1` to `20`.

    By default, `precision` is `20`, which causes the function to use up to the first 20 characters of the geohash.

## Returns

The function returns a value of type [GEOGRAPHY](../data-types-geospatial.md).

## Examples

The following example returns the GEOGRAPHY object for a geohash:

> ```sqlexample
> SELECT ST_GEOGFROMGEOHASH('9q9j8ue2v71y5zzy0s4q')
>     AS geography_from_geohash,
>     ST_AREA(ST_GEOGFROMGEOHASH('9q9j8ue2v71y5zzy0s4q'))
>     AS area_of_geohash;
> +---------------------------------+-----------------+
> | GEOGRAPHY_FROM_GEOHASH          | AREA_OF_GEOHASH |
> |---------------------------------+-----------------|
> | {                               |  5.48668572e-16 |
> |   "coordinates": [              |                 |
> |     [                           |                 |
> |       [                         |                 |
> |         -1.223061000000001e+02, |                 |
> |         3.755416199999996e+01   |                 |
> |       ],                        |                 |
> |       [                         |                 |
> |         -1.223061000000001e+02, |                 |
> |         3.755416200000012e+01   |                 |
> |       ],                        |                 |
> |       [                         |                 |
> |         -1.223060999999998e+02, |                 |
> |         3.755416200000012e+01   |                 |
> |       ],                        |                 |
> |       [                         |                 |
> |         -1.223060999999998e+02, |                 |
> |         3.755416199999996e+01   |                 |
> |       ],                        |                 |
> |       [                         |                 |
> |         -1.223061000000001e+02, |                 |
> |         3.755416199999996e+01   |                 |
> |       ]                         |                 |
> |     ]                           |                 |
> |   ],                            |                 |
> |   "type": "Polygon"             |                 |
> | }                               |                 |
> +---------------------------------+-----------------+
> ```

The following example returns the GEOGRAPHY object for a less precise geohash. The function uses the first 6 characters from the input geohash:

> ```sqlexample
> SELECT ST_GEOGFROMGEOHASH('9q9j8ue2v71y5zzy0s4q', 6)
>     AS geography_from_less_precise_geohash,
>     ST_AREA(ST_GEOGFROMGEOHASH('9q9j8ue2v71y5zzy0s4q', 6))
>     AS area_of_geohash;
> +-------------------------------------+-----------------+
> | GEOGRAPHY_FROM_LESS_PRECISE_GEOHASH | AREA_OF_GEOHASH |
> |-------------------------------------+-----------------|
> | {                                   | 591559.75661851 |
> |   "coordinates": [                  |                 |
> |     [                               |                 |
> |       [                             |                 |
> |         -1.223107910156250e+02,     |                 |
> |         3.755126953125000e+01       |                 |
> |       ],                            |                 |
> |       [                             |                 |
> |         -1.223107910156250e+02,     |                 |
> |         3.755676269531250e+01       |                 |
> |       ],                            |                 |
> |       [                             |                 |
> |         -1.222998046875000e+02,     |                 |
> |         3.755676269531250e+01       |                 |
> |       ],                            |                 |
> |       [                             |                 |
> |         -1.222998046875000e+02,     |                 |
> |         3.755126953125000e+01       |                 |
> |       ],                            |                 |
> |       [                             |                 |
> |         -1.223107910156250e+02,     |                 |
> |         3.755126953125000e+01       |                 |
> |       ]                             |                 |
> |     ]                               |                 |
> |   ],                                |                 |
> |   "type": "Polygon"                 |                 |
> | }                                   |                 |
> +-------------------------------------+-----------------+
> ```

---
title: ST_GEOGPOINTFROMGEOHASH
source: https://docs.snowflake.com/en/sql-reference/functions/st_geogpointfromgeohash.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# ST_GEOGPOINTFROMGEOHASH

Returns a [GEOGRAPHY](../data-types-geospatial.md) object for the Point that represents the center of a
[geohash](st_geohash.md).

See also:
:   [ST_GEOHASH](st_geohash.md), [ST_GEOGFROMGEOHASH](st_geogfromgeohash.md)

## Syntax

```sqlsyntax
ST_GEOGPOINTFROMGEOHASH( <geohash> )
```

## Arguments

`geohash`
:   The argument must be a geohash.

## Returns

The function returns a value of type [GEOGRAPHY](../data-types-geospatial.md) that represents the Point that is
the center of the geohash.

## Examples

The following example returns the GEOGRAPHY object for the Point at the center of a geohash:

> ```sqlexample
> SELECT ST_GEOGPOINTFROMGEOHASH('9q9j8ue2v71y5zzy0s4q')
>     AS geography_center_point_of_geohash;
> +-----------------------------------+
> | GEOGRAPHY_CENTER_POINT_OF_GEOHASH |
> |-----------------------------------|
> | {                                 |
> |   "coordinates": [                |
> |     -1.223060999999999e+02,       |
> |     3.755416200000003e+01         |
> |   ],                              |
> |   "type": "Point"                 |
> | }                                 |
> +-----------------------------------+
> ```

---
title: ST_GEOGRAPHYFROMWKB
source: https://docs.snowflake.com/en/sql-reference/functions/st_geographyfromwkb.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# ST_GEOGRAPHYFROMWKB

Parses a
[WKB (well-known binary)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary) or
[EWKB (extended well-known binary)](../data-types-geospatial.md) input and returns a value of type
[GEOGRAPHY](../data-types-geospatial.md).

Aliases:
:   ST_GEOGFROMWKB , ST_GEOGRAPHYFROMEWKB , ST_GEOGFROMEWKB

See also:
:   [TO_GEOGRAPHY](to_geography.md)

## Syntax

```sqlsyntax
ST_GEOGRAPHYFROMWKB( <varchar_or_binary_expression> [ , <allow_invalid> ] )

ST_GEOGFROMWKB( <varchar_or_binary_expression> [ , <allow_invalid> ] )

ST_GEOGRAPHYFROMEWKB( <varchar_or_binary_expression> [ , <allow_invalid> ] )

ST_GEOGFROMEWKB( <varchar_or_binary_expression> [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_or_binary_expression`
:   The argument must be a string or binary expression in WKB or EWKB that represents a valid geospatial object.

    A string expression must be in hexadecimal format (without a leading `0x`).

**Optional:**

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type [GEOGRAPHY](../data-types-geospatial.md).

## Usage notes

* Issues an error if the input cannot be parsed as WKB or EWKB.
* Issues an error if the input format is EWKB and the SRID is not 4326.
  See the [note on EWKT and EWKB handling](../data-types-geospatial.md).

## Examples

The following example returns the GEOGRAPHY object for a geospatial object described in WKT format:

> ```sqlexample
> -- Set the output format to WKT
> alter session set GEOGRAPHY_OUTPUT_FORMAT='WKT';
> ```
>
> ```sqlexample
> select ST_GEOGRAPHYFROMWKB('01010000006666666666965EC06666666666C64240');
> +-------------------------------------------------------------------+
> | ST_GEOGRAPHYFROMWKB('01010000006666666666965EC06666666666C64240') |
> |-------------------------------------------------------------------|
> | POINT(-122.35 37.55)                                              |
> +-------------------------------------------------------------------+
> ```

The following example returns the GEOGRAPHY object for a geospatial object described in EWKT format:

> ```sqlexample
> -- Set the output format to EWKT
> alter session set GEOGRAPHY_OUTPUT_FORMAT='EWKT';
> ```
>
> ```sqlexample
> select ST_GEOGRAPHYFROMEWKB('0101000020E61000006666666666965EC06666666666C64240');
> +----------------------------------------------------------------------------+
> | ST_GEOGRAPHYFROMEWKB('0101000020E61000006666666666965EC06666666666C64240') |
> |----------------------------------------------------------------------------|
> | SRID=4326;POINT(-122.35 37.55)                                             |
> +----------------------------------------------------------------------------+
> ```

---
title: ST_GEOGRAPHYFROMWKT
source: https://docs.snowflake.com/en/sql-reference/functions/st_geographyfromwkt.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# ST_GEOGRAPHYFROMWKT

Parses a
[WKT (well-known text)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry) or
[EWKT (extended well-known text)](../data-types-geospatial.md) input and returns a value of type
[GEOGRAPHY](../data-types-geospatial.md).

Aliases:
:   ST_GEOGFROMWKT , ST_GEOGRAPHYFROMEWKT , ST_GEOGFROMEWKT , ST_GEOGRAPHYFROMTEXT , ST_GEOGFROMTEXT

See also:
:   [TO_GEOGRAPHY](to_geography.md)

## Syntax

```sqlsyntax
ST_GEOGRAPHYFROMWKT( <varchar_expression> [ , <allow_invalid> ] )

ST_GEOGFROMWKT( <varchar_expression> [ , <allow_invalid> ] )

ST_GEOGRAPHYFROMEWKT( <varchar_expression> [ , <allow_invalid> ] )

ST_GEOGFROMEWKT( <varchar_expression> [ , <allow_invalid> ] )

ST_GEOGRAPHYFROMTEXT( <varchar_expression> [ , <allow_invalid> ] )

ST_GEOGFROMTEXT( <varchar_expression> [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_expression`
:   The argument must be a string expression in WKT or EWKT that represents a valid geospatial object.

**Optional:**

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type [GEOGRAPHY](../data-types-geospatial.md).

## Usage notes

* Issues an error if the input cannot be parsed as WKT or EWKT.
* Issues an error if the input format is EWKT and the SRID is not 4326.
  See the [note on EWKT and EWKB handling](../data-types-geospatial.md).

* For the coordinates in WKT, EWKT, and GeoJSON, longitude appears before latitude (for example, `POINT(lon lat)`).

## Examples

The following example returns the GEOGRAPHY object for a geospatial object described in WKT format:

> ```sqlexample
> -- Set the output format to WKT
> alter session set GEOGRAPHY_OUTPUT_FORMAT='WKT';
> ```
>
> ```sqlexample
> select ST_GEOGRAPHYFROMWKT('POINT(-122.35 37.55)');
> ```
>
> ```output
> +---------------------------------------------+
> | ST_GEOGRAPHYFROMWKT('POINT(-122.35 37.55)') |
> |---------------------------------------------|
> | POINT(-122.35 37.55)                        |
> +---------------------------------------------+
> ```

The following example returns the GEOGRAPHY object for a geospatial object with a Z coordinate described in WKT format:

> ```sqlexample
> -- Set the output format to WKT
> alter session set GEOGRAPHY_OUTPUT_FORMAT='WKT';
> ```
>
> ```sqlexample
> select ST_GEOGRAPHYFROMWKT('POINTZ(-122.35 37.55 30)');
> ```
>
> ```output
> +-------------------------------------------------+
> | ST_GEOGRAPHYFROMWKT('POINTZ(-122.35 37.55 30)') |
> |-------------------------------------------------|
> | POINTZ(-122.35 37.55 30)                        |
> +-------------------------------------------------+
> ```

The following example returns the GEOGRAPHY object for a geospatial object described in EWKT format:

> ```sqlexample
> -- Set the output format to EWKT
> alter session set GEOGRAPHY_OUTPUT_FORMAT='EWKT';
> ```
>
> ```sqlexample
> select ST_GEOGRAPHYFROMEWKT('SRID=4326;POINT(-122.35 37.55)');
> ```
>
> ```output
> +--------------------------------------------------------+
> | ST_GEOGRAPHYFROMEWKT('SRID=4326;POINT(-122.35 37.55)') |
> |--------------------------------------------------------|
> | SRID=4326;POINT(-122.35 37.55)                         |
> +--------------------------------------------------------+
> ```

---
title: ST_GEOHASH
source: https://docs.snowflake.com/en/sql-reference/functions/st_geohash.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_GEOHASH

Returns the [geohash](https://en.wikipedia.org/wiki/Geohash) for a [GEOGRAPHY](../data-types-geospatial.md)
or [GEOMETRY](../data-types-geospatial.md) object. A geohash is a short base32 string that identifies a great circle
rectangle containing a location in the world.

The number of characters in a geohash determines precision. Removing characters
from the end of a geohash results in a geohash that is less precise and that identifies a
larger rectangular area.

ST_GEOHASH returns a geohash that is 20 characters long.
The optional `precision` argument specifies the precision of the returned geohash.
For example, passing `5` for `precision` returns a shorter geohash (5 characters long) that is less precise.

> **Note:**
>
> For a geospatial object that is not a point, the function might return a geohash of less precision, regardless of the default or
> specified value for `precision`.
>
> In these cases, precision is determined by the bounding box of the geospatial object. ST_GEOHASH first determines the geohashes
> of the lower left and upper right corners of the bounding box and then returns the prefix that is common to these two geohashes.

See also:
:   [ST_GEOGFROMGEOHASH](st_geogfromgeohash.md), [ST_GEOGPOINTFROMGEOHASH](st_geogpointfromgeohash.md)

## Syntax

```sqlsyntax
ST_GEOHASH( <geography_expression> [, <precision> ] )

ST_GEOHASH( <geometry_expression> [, <precision> ] )
```

## Arguments

**Required:**

`geography_expression`
:   The argument must be an expression of type GEOGRAPHY.

`geometry_expression`
:   The argument must be an expression of type GEOMETRY with the SRID 4326.

**Optional:**

`precision`
:   The number of characters to use in the geohash. You can specify a value from `1` to `20`.

    By default, `precision` is `20`, which produces a geohash that is 20 characters long.

## Returns

Returns the geohash (a value of type STRING) for the specified object.

If the object is a Polygon and the two points of the bounding box do not share the same geohash prefix, the function might return
an empty string.

## Examples

The following example returns the geohash for a GEOGRAPHY point:

```sqlexample
SELECT ST_GEOHASH(
  TO_GEOGRAPHY('POINT(-122.306100 37.554162)'))
  AS geohash_of_point_a;
```

```output
+----------------------+
| GEOHASH_OF_POINT_A   |
|----------------------|
| 9q9j8ue2v71y5zzy0s4q |
+----------------------+
```

The following example returns a geohash for the same GEOGRAPHY point with less precision:

```sqlexample
SELECT ST_GEOHASH(
  TO_GEOGRAPHY('POINT(-122.306100 37.554162)'),
  5) AS less_precise_geohash_a;
```

```output
+------------------------+
| LESS_PRECISE_GEOHASH_A |
|------------------------|
| 9q9j8                  |
+------------------------+
```

The following example returns the geohash for a GEOMETRY point:

```sqlexample
SELECT ST_GEOHASH(
  TO_GEOMETRY('POINT(-122.306100 37.554162)', 4326))
  AS geohash_of_point_a;
```

```output
+----------------------+
| GEOHASH_OF_POINT_A   |
|----------------------|
| 9q9j8ue2v71y5zzy0s4q |
+----------------------+
```

The following example shows two geohashes that share the same prefix, which indicates that the two GEOGRAPHY points are near to each other.

```sqlexample
SELECT
  ST_GEOHASH(
    TO_GEOGRAPHY('POINT(-122.306100 37.554162)'))
    AS geohash_of_point_a,
  ST_GEOHASH(
    TO_GEOGRAPHY('POINT(-122.323111 37.562333)'))
    AS geohash_of_point_b;
```

```output
+----------------------+----------------------+
| GEOHASH_OF_POINT_A   | GEOHASH_OF_POINT_B   |
|----------------------+----------------------|
| 9q9j8ue2v71y5zzy0s4q | 9q9j8qp02yms1tpjesmc |
+----------------------+----------------------+
```

```sqlexample
SELECT
  ST_GEOHASH(
    TO_GEOGRAPHY('POINT(-122.306100 37.554162)'),
    5) AS less_precise_geohash_a,
  ST_GEOHASH(
    TO_GEOGRAPHY('POINT(-122.323111 37.562333)'),
    5) AS less_precise_geohash_b;
```

```output
+------------------------+------------------------+
| LESS_PRECISE_GEOHASH_A | LESS_PRECISE_GEOHASH_B |
|------------------------+------------------------|
| 9q9j8                  | 9q9j8                  |
+------------------------+------------------------+
```

The following example returns the geohash for a polygon. The lower left and upper right corners of the bounding box of this polygon
are the same two GEOGRAPHY points used in the previous examples. As shown in this example, ST_GEOHASH returns the prefix common to the
geohashes of the lower left and upper right corners of the bounding box.

```sqlexample
SELECT
  ST_GEOHASH(
    TO_GEOGRAPHY(
      'POLYGON((-122.306100 37.554162, -122.306100 37.562333, -122.323111 37.562333, -122.323111 37.554162, -122.306100 37.554162))'
    )
  ) AS geohash_of_polygon;
```

```output
+--------------------+
| GEOHASH_OF_POLYGON |
|--------------------|
| 9q9j8              |
+--------------------+
```

---
title: ST_GEOMETRYFROMWKB
source: https://docs.snowflake.com/en/sql-reference/functions/st_geometryfromwkb.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# ST_GEOMETRYFROMWKB

Parses a
[WKB (well-known binary)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary) or EWKB
(extended well-known binary) input and returns a value of type [GEOMETRY](../data-types-geospatial.md).

Aliases:
:   ST_GEOMFROMWKB , ST_GEOMETRYFROMEWKB , ST_GEOMFROMEWKB

See also:
:   [TO_GEOMETRY](to_geometry.md)

## Syntax

```sqlsyntax
ST_GEOMETRYFROMWKB( <varchar_or_binary_expression> [ , <srid> ]  [ , <allow_invalid> ] )

ST_GEOMFROMWKB( <varchar_or_binary_expression> [ , <srid> ]  [ , <allow_invalid> ] )

ST_GEOMETRYFROMEWKB( <varchar_or_binary_expression> [ , <srid> ] [ , <allow_invalid> ] )

ST_GEOMFROMEWKB( <varchar_or_binary_expression> [ , <srid> ] [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_or_binary_expression`
:   The argument must be a string or binary expression in WKB or EWKB that represents a valid geospatial object.

    A string expression must be in hexadecimal format (without a leading `0x`).

**Optional:**

`srid`
:   The integer value of the SRID to use.

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type [GEOMETRY](../data-types-geospatial.md).

## Usage notes

* Issues an error if the input cannot be parsed as WKB or EWKB.
* For WKB input, if the `srid` argument is not specified, the resulting GEOMETRY object has the SRID set to 0.

## Examples

The following example returns the GEOMETRY object for a geospatial object described in EWKB format:

> ```sqlexample
> -- Set the geometry output format to EWKT
> ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';
>
> SELECT ST_GEOMETRYFROMEWKB('0101000020797F000066666666A9CB17411F85EBC19E325641');
> ```
>
> ```none
> +---------------------------------------------------------------------------+
> | ST_GEOMETRYFROMEWKB('0101000020797F000066666666A9CB17411F85EBC19E325641') |
> |---------------------------------------------------------------------------|
> | SRID=32633;POINT(389866.35 5819003.03)                                    |
> +---------------------------------------------------------------------------+
> ```

In the next example, the input is in WKB format, which does not specify the SRID:

> ```sqlexample
> -- Set the geometry output format to EWKT
> ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';
>
> SELECT ST_GEOMETRYFROMEWKB('010100000066666666A9CB17411F85EBC19E325641');
> ```
>
> ```none
> +-------------------------------------------------------------------+
> | ST_GEOMETRYFROMEWKB('010100000066666666A9CB17411F85EBC19E325641') |
> |-------------------------------------------------------------------|
> | SRID=0;POINT(389866.35 5819003.03)                                |
> +-------------------------------------------------------------------+
> ```

---
title: ST_GEOMETRYFROMWKT
source: https://docs.snowflake.com/en/sql-reference/functions/st_geometryfromwkt.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# ST_GEOMETRYFROMWKT

Parses a
[WKT (well-known text)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry) or EWKT (extended well-known
text) input and returns a value of type [GEOMETRY](../data-types-geospatial.md).

Aliases:
:   ST_GEOMFROMWKT , ST_GEOMETRYFROMEWKT , ST_GEOMFROMEWKT , ST_GEOMETRYFROMTEXT , ST_GEOMFROMTEXT

See also:
:   [TO_GEOMETRY](to_geometry.md)

## Syntax

```sqlsyntax
ST_GEOMETRYFROMWKT( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

ST_GEOMFROMWKT( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

ST_GEOMETRYFROMEWKT( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

ST_GEOMFROMEWKT( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

ST_GEOMETRYFROMTEXT( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

ST_GEOMFROMTEXT( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_expression`
:   The argument must be a string expression in WKT or EWKT that represents a valid geospatial object.

**Optional:**

`srid`
:   The integer value of the SRID to use.

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type [GEOMETRY](../data-types-geospatial.md).

## Usage notes

* Issues an error if the input cannot be parsed as WKT or EWKT.
* For WKT input, if the `srid` argument is not specified, the resulting GEOMETRY object has the SRID set to 0.

## Examples

The following example returns the GEOMETRY object for a geospatial object described in EWKT format:

```sqlexample
-- Set the output format to EWKT
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT ST_GEOMETRYFROMEWKT('SRID=32633;POINT(389866.35 5819003.03)');
```

```output
+---------------------------------------------------------------+
| ST_GEOMETRYFROMEWKT('SRID=32633;POINT(389866.35 5819003.03)') |
|---------------------------------------------------------------|
| SRID=32633;POINT(389866.35 5819003.03)                        |
+---------------------------------------------------------------+
```

The following example returns the GEOMETRY object for a geospatial object with a Z coordinate described in EWKT format:

```sqlexample
-- Set the output format to EWKT
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT ST_GEOMETRYFROMEWKT('SRID=32633;POINTZ(389866.35 5819003.03 30)');
```

```output
+-------------------------------------------------------------------+
| ST_GEOMETRYFROMEWKT('SRID=32633;POINTZ(389866.35 5819003.03 30)') |
|-------------------------------------------------------------------|
| SRID=32633;POINTZ(389866.35 5819003.03 30)                        |
+-------------------------------------------------------------------+
```

In the next example, the input is in WKT format, and the function call specifies the SRID to use:

```sqlexample
-- Set the output format to EWKT
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT ST_GEOMETRYFROMWKT('POINT(389866.35 5819003.03)', 4326);
```

```output
+----------------------------------------------------------+
| ST_GEOMETRYFROMWKT('POINT(389866.35 5819003.03)', 4326)  |
|----------------------------------------------------------|
| SRID=4326;POINT(389866.35 5819003.03)                    |
+----------------------------------------------------------+
```

---
title: ST_GEOMFROMGEOHASH
source: https://docs.snowflake.com/en/sql-reference/functions/st_geomfromgeohash.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_GEOMFROMGEOHASH

Returns a [GEOMETRY](../data-types-geospatial.md) object for the polygon that represents the boundaries of a
[geohash](https://en.wikipedia.org/wiki/Geohash).

The number of characters in a geohash determines precision. Removing characters
from the end of a geohash results in a geohash that is less precise and that identifies a
larger rectangular area.

The optional `precision` argument specifies the precision to use for the input geohash. For example, passing `5`
for `precision` specifies that the function uses the first 5 characters of the input geohash.

See also:
:   [ST_GEOHASH](st_geohash.md), [ST_GEOMPOINTFROMGEOHASH](st_geompointfromgeohash.md)

## Syntax

```sqlsyntax
ST_GEOMFROMGEOHASH( <geohash> [, <precision> ] )
```

## Arguments

**Required:**

`geohash`
:   The argument must be a geohash.

**Optional:**

`precision`
:   The number of characters to use in the geohash. You can specify a value from `1` to `20`.

    By default, `precision` is `20`, which produces a geohash that is 20 characters long.

## Returns

Returns a value of type [GEOMETRY](../data-types-geospatial.md).

## Examples

The following example returns the GEOMETRY object for a geohash:

```sqlexample
SELECT ST_GEOMFROMGEOHASH('9q9j8ue2v71y5zzy0s4q')
  AS geometry_from_geohash,
  ST_AREA(ST_GEOMFROMGEOHASH('9q9j8ue2v71y5zzy0s4q'))
  AS area_of_geohash;
```

```output
+---------------------------------+-----------------+
| GEOMETRY_FROM_GEOHASH           | AREA_OF_GEOHASH |
|---------------------------------+-----------------|
| {                               | 5.492996255e-26 |
|   "coordinates": [              |                 |
|     [                           |                 |
|       [                         |                 |
|         -1.223061000000001e+02, |                 |
|         3.755416199999996e+01   |                 |
|       ],                        |                 |
|       [                         |                 |
|         -1.223061000000001e+02, |                 |
|         3.755416200000012e+01   |                 |
|       ],                        |                 |
|       [                         |                 |
|         -1.223060999999998e+02, |                 |
|         3.755416200000012e+01   |                 |
|       ],                        |                 |
|       [                         |                 |
|         -1.223060999999998e+02, |                 |
|         3.755416199999996e+01   |                 |
|       ],                        |                 |
|       [                         |                 |
|         -1.223061000000001e+02, |                 |
|         3.755416199999996e+01   |                 |
|       ]                         |                 |
|     ]                           |                 |
|   ],                            |                 |
|   "type": "Polygon"             |                 |
| }                               |                 |
+---------------------------------+-----------------+
```

The following example returns the GEOMETRY object for a less precise geohash. The function uses the first 6 characters from the input geohash:

```sqlexample
SELECT ST_GEOMFROMGEOHASH('9q9j8ue2v71y5zzy0s4q', 6)
  AS geometry_from_less_precise_geohash,
  ST_AREA(ST_GEOMFROMGEOHASH('9q9j8ue2v71y5zzy0s4q', 6))
  AS area_of_geohash;
```

```output
+------------------------------------+-----------------+
| GEOMETRY_FROM_LESS_PRECISE_GEOHASH | AREA_OF_GEOHASH |
|------------------------------------+-----------------|
| {                                  | 6.034970284e-05 |
|   "coordinates": [                 |                 |
|     [                              |                 |
|       [                            |                 |
|         -1.223107910156250e+02,    |                 |
|         3.755126953125000e+01      |                 |
|       ],                           |                 |
|       [                            |                 |
|         -1.223107910156250e+02,    |                 |
|         3.755676269531250e+01      |                 |
|       ],                           |                 |
|       [                            |                 |
|         -1.222998046875000e+02,    |                 |
|         3.755676269531250e+01      |                 |
|       ],                           |                 |
|       [                            |                 |
|         -1.222998046875000e+02,    |                 |
|         3.755126953125000e+01      |                 |
|       ],                           |                 |
|       [                            |                 |
|         -1.223107910156250e+02,    |                 |
|         3.755126953125000e+01      |                 |
|       ]                            |                 |
|     ]                              |                 |
|   ],                               |                 |
|   "type": "Polygon"                |                 |
| }                                  |                 |
+------------------------------------+-----------------+
```

---
title: ST_GEOMPOINTFROMGEOHASH
source: https://docs.snowflake.com/en/sql-reference/functions/st_geompointfromgeohash.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_GEOMPOINTFROMGEOHASH

Returns a [GEOMETRY](../data-types-geospatial.md) object for the point that represents center of a
[geohash](https://en.wikipedia.org/wiki/Geohash).

See also:
:   [ST_GEOHASH](st_geohash.md), [ST_GEOMFROMGEOHASH](st_geomfromgeohash.md)

## Syntax

```sqlsyntax
ST_GEOMPOINTFROMGEOHASH( <geohash> )
```

## Arguments

`geohash`
:   The argument must be a geohash.

## Returns

Returns a value of type [GEOMETRY](../data-types-geospatial.md) that represents the point that is the
center of the geohash.

## Examples

The following example returns the GEOMETRY object for the point at the center of a geohash:

```sqlexample
SELECT ST_GEOMPOINTFROMGEOHASH('9q9j8ue2v71y5zzy0s4q')
  AS geometry_center_point_of_geohash;
```

```output
+----------------------------------+
| GEOMETRY_CENTER_POINT_OF_GEOHASH |
|----------------------------------|
| {                                |
|   "coordinates": [               |
|     -1.223061000000001e+02,      |
|     3.755416199999996e+01        |
|   ],                             |
|   "type": "Point"                |
| }                                |
+----------------------------------+
```

---
title: ST_HAUSDORFFDISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/st_hausdorffdistance.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_HAUSDORFFDISTANCE

Returns the discrete [Hausdorff distance](https://en.wikipedia.org/wiki/Hausdorff_distance) between two
[GEOGRAPHY](../data-types-geospatial.md) objects.

The Hausdorff distance indicates how similar the two objects are. Two objects are considered to be similar if each point in
one object is close to a point in the other object. The Hausdorff distance is the greatest distance between a point in one
object and a point in the other object.

ST_HAUSDORFFDISTANCE returns the discrete Hausdorff distance, which is calculated by comparing only the vertices (discrete
points) and not arbitrary points along the edge.

## Syntax

```sqlsyntax
ST_HAUSDORFFDISTANCE( <geography_expression_1> , <geography_expression_2> )
```

## Arguments

`geography_expression_1`
:   The argument must be an expression of type GEOGRAPHY.

`geography_expression_2`
:   The argument must be an expression of type GEOGRAPHY.

## Returns

Returns a value of type REAL that represents the discrete Hausdorff distance in degrees.

## Usage notes

* Returns NULL if one or more input points are NULL.

## Examples

This example returns the Hausdorff distance between two points (point `0 0` and point `0 1`):

> ```sqlexample
> SELECT ST_HAUSDORFFDISTANCE(ST_POINT(0, 0), ST_POINT(0, 1));
> +------------------------------------------------------+
> | ST_HAUSDORFFDISTANCE(ST_POINT(0, 0), ST_POINT(0, 1)) |
> |------------------------------------------------------|
> |                                                    1 |
> +------------------------------------------------------+
> ```

The next example compares three Polygons (`a`, `b`, and `c`).

The distance between the farthest points in `a` and `c` (point `0 1` and point `0 3`) is greater than the distance
between the farthest points in `a` and `b` (point `1 0` and point `2 0`).

As a result, the value returned by ST_HAUSDORFFDISTANCE is smaller for `a` and `c`. This indicates that `a` and `c`
are more similar than `a` and `b`.

> ```sqlexample
> WITH
>     a AS (TO_GEOGRAPHY('POLYGON((-1 0, 0 1, 1 0, 0 -1, -1 0))')),
>     b AS (TO_GEOGRAPHY('POLYGON((-1 0, 0 1, 2 0, 0 -1, -1 0))')),
>     c AS (TO_GEOGRAPHY('POLYGON((-1 0, 0 3, 1 0, 0 -1, -1 0))'))
> SELECT
>     ST_HAUSDORFFDISTANCE(a, b) as distance_between_a_and_b,
>     ST_HAUSDORFFDISTANCE(a, c) as distance_between_a_and_c;
> +--------------------------+--------------------------+
> | DISTANCE_BETWEEN_A_AND_B | DISTANCE_BETWEEN_A_AND_C |
> |--------------------------+--------------------------|
> |                        1 |                        2 |
> +--------------------------+--------------------------+
> ```

---
title: ST_INTERPOLATE
source: https://docs.snowflake.com/en/sql-reference/functions/st_interpolate.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_INTERPOLATE

Given an input [GEOGRAPHY](../data-types-geospatial.md) object, returns an interpolated object that is within a specified
tolerance.

You can call this function when you need to see how GEOGRAPHY objects look like in the planar coordinate system (for example,
when using visualization tools for geospatial data).

## Syntax

```sqlsyntax
ST_INTERPOLATE( <geography_expression> [ , <tolerance> ] )
```

## Arguments

**Required:**

`geography_expression`
:   The GEOGRAPHY object to interpolate.

**Optional:**

`tolerance`
:   The maximum [Hausdorff distance](https://en.wikipedia.org/wiki/Hausdorff_distance) in meters between the original object and
    its [planar (Mercator) projection](https://en.wikipedia.org/wiki/Mercator_projection).

    Default: 10 meters

## Returns

The function returns a value of type GEOGRAPHY.

## Examples

The following statements return an interpolated object within tolerance of 1000 meters. The statement returns output in WKT
format.

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';

SELECT TO_GEOGRAPHY(
  'POLYGON((2.365837 48.862456,-76.992874 39.009046,-16.091194 18.013997,2.365837 48.862456))')
    AS input_object,
  ST_INTERPOLATE(
    TO_GEOGRAPHY(
      'POLYGON((2.365837 48.862456,-76.992874 39.009046,-16.091194 18.013997,2.365837 48.862456))'
    ),
    1000
  ) AS interpolated_object;
```

```output
+--------------------------------------------------------------------------------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| INPUT_OBJECT                                                                               | INTERPOLATED_OBJECT                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 |
|--------------------------------------------------------------------------------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| POLYGON((2.365837 48.862456,-76.992874 39.009046,-16.091194 18.013997,2.365837 48.862456)) | POLYGON((2.365837 48.862456,0.2362767764 49.398305615,-1.996262883 49.906689161,-4.332160104 50.382437182,-6.770456901 50.820131017,-9.308629958 51.214188609,-11.942383026 51.558974127,-13.293232976 51.711146148,-14.665482532 51.848930511,-16.058033739 51.971669906,-17.469660151 52.078731821,-18.899009042 52.169513959,-20.344605456 52.243449667,-21.804858156 52.300013293,-23.278067492 52.338725392,-24.762435158 52.359157656,-26.256075742 52.3609375,-27.757029917 52.343752197,-29.263279078 52.307352477,-30.772761157 52.251555506,-32.283387321 52.176247197,-33.793059226 52.081383774,-35.299686463 51.966992581,-36.801203842 51.833172099,-38.295588157 51.680091182,-39.780874093 51.507987534,-41.25516898 51.317165449,-42.716666126 51.107992889,-44.16365652 50.880897942,-45.594538759 50.636364763,-47.007827085 50.374929055,-48.402157513 50.097173201,-49.776292048 49.803721135,-51.129121057 49.495233035,-52.459663914 49.172399933,-53.767068032 48.835938324,-55.050606468 48.486584847,-57.543783773 47.752218706,-59.935729311 46.975406135,-62.224659081 46.162267318,-64.410251688 45.318811613,-66.493415436 44.450833283,-68.47605821 43.563829764,-70.360871429 42.662941503,-72.151135223 41.752911448,-73.850548486 40.838061719,-75.463084888 39.922284722,-76.992874 39.009046,-75.054371602 38.900776338,-73.070188895 38.756302935,-71.04361197 38.573354324,-68.978491512 38.349817237,-66.879224949 38.083791708,-64.750721339 37.773647264,-62.598348624 37.418078058,-60.427863866 37.016154514,-58.24532822 36.567368929,-56.057009424 36.07167252,-53.86927558 35.52950163,-51.688484598 34.941791231,-49.5208741 34.309974475,-47.372456469 33.635967742,-45.248923381 32.922141467,-43.155563394 32.171277799,-41.097195205 31.386516865,-39.078118124 30.57129403,-37.102080178 29.729270906,-35.172263312 28.864263107,-33.291284269 27.980167693,-31.461209158 27.080893062,-29.683579328 26.170293681,-27.959446009 25.252111561,-26.289411197 24.329925855,-24.673672456 23.407111409,-23.112069576 22.486806569,-21.604131345 21.571890097,-18.746079719 19.768360293,-16.091194 18.013997,-14.452640773 21.830752143,-12.663169134 25.760435024,-11.706340645 27.75379918,-10.704942742 29.757834797,-9.656443607 31.765812338,-8.558194149 33.770740816,-7.407434508 35.765476568,-6.20130448 37.742837326,-4.936858776 39.695716684,-3.61108818 41.617194045,-2.2209478 43.500635601,-0.7633937119 45.339782667,0.7645706485 47.128824761,2.365837 48.862456)) |
+--------------------------------------------------------------------------------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

The following images visualize the original and interpolated GEOGRAPHY objects from the example above in the Mercator projection
([EPSG:3857](https://epsg.io/3857)).

| Original | Interpolated |
| --- | --- |
|  |  |

---
title: ST_INTERSECTION
source: https://docs.snowflake.com/en/sql-reference/functions/st_intersection.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_INTERSECTION

Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the shape containing the set of points that are
common to both input objects (i.e. the intersection of the two objects).

See also:
:   [ST_INTERSECTION_AGG](st_intersection_agg.md) , [ST_UNION](st_union.md) , [ST_DIFFERENCE](st_difference.md) , [ST_SYMDIFFERENCE](st_symdifference.md)

## Syntax

```sqlsyntax
ST_INTERSECTION( <geography_expression_1> , <geography_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

## Returns

The function returns a value of type GEOGRAPHY.

## Usage notes

* If any vertex of one input object is on the boundary of the other input object (excluding the vertices), the output might or
  might not include that vertex point.

  For example, suppose that `geography_expression_1` is `POINT(1 1)` and `geography_expression_2` is
  `LINESTRING(1 0, 1 2)`. In this case, `geography_expression_1` is on the boundary of `geography_expression_2`
  but is not a vertex of it.

  In this example, the expected output is `POINT(1 1)`, but the actual output might be an empty geography (represented by NULL).

  To help to detect and work around these cases, one potential idea is to use [ST_DWITHIN](st_dwithin.md) to
  determine if the minimum distance between the two input objects is `0`. For example, you can check if a point lies on top of a
  LineString by checking if the minimum distance between the two objects is zero:

  > ```sqlexample
  > SELECT TO_GEOGRAPHY('POLYGON((0 0, 1 0, 2 1, 1 2, 2 3, 1 4, 0 4, 0 0))') AS polygon,
  >        TO_GEOGRAPHY('POINT(0 2)') AS point,
  >        ST_DWITHIN(polygon, point, 0) AS point_is_on_top_of_polygon,
  >        ST_INTERSECTION(polygon, point);
  > ```

  This statement produces the following output:

  > ```none
  > +--------------------------------------------+------------+----------------------------+---------------------------------+
  > | POLYGON                                    | POINT      | POINT_IS_ON_TOP_OF_POLYGON | ST_INTERSECTION(POLYGON, POINT) |
  > |--------------------------------------------+------------+----------------------------+---------------------------------|
  > | POLYGON((0 0,1 0,2 1,1 2,2 3,1 4,0 4,0 0)) | POINT(0 2) | True                       | NULL                            |
  > +--------------------------------------------+------------+----------------------------+---------------------------------+
  > ```

  The function is not guaranteed to produce normalized and/or minimal results. For example, an output could consist of a
  LineString containing several points that actually forms just one straight segment.

## Examples

The following example returns a GEOGRAPHY object that represents the intersection of two input GEOGRAPHY objects:

> ```sqlexample
> ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';
>
> SELECT ST_INTERSECTION(
>   TO_GEOGRAPHY('POLYGON((0 0, 1 0, 2 1, 1 2, 2 3, 1 4, 0 4, 0 0))'),
>   TO_GEOGRAPHY('POLYGON((3 0, 3 4, 2 4, 1 3, 2 2, 1 1, 2 0, 3 0))'))
> AS intersection_of_objects;
> ```

This example produces the following output:

> ```none
> +-----------------------------------------------------------------------------------------------------------------------------------------+
> | INTERSECTION_OF_OBJECTS                                                                                                                 |
> |-----------------------------------------------------------------------------------------------------------------------------------------|
> | MULTIPOLYGON(((1.5 0.5000571198,2 1,1.5 1.500171359,1 1,1.5 0.5000571198)),((1.5 2.500285599,2 3,1.5 3.500399839,1 3,1.5 2.500285599))) |
> +-----------------------------------------------------------------------------------------------------------------------------------------+
> ```

The following images illustrate the differences in the areas that represent the input and output objects:

| Input | Output |
| --- | --- |
|  |  |

---
title: ST_INTERSECTION_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/st_intersection_agg.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_INTERSECTION_AGG

Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the shape containing the combined set of points that are
common to the shapes represented by the objects in the column (that is, the intersection of the shapes).

See also:
:   [ST_INTERSECTION](st_intersection.md) , [ST_UNION_AGG](st_union_agg.md)

## Syntax

```sqlsyntax
ST_INTERSECTION_AGG( <geography_column> )
```

## Arguments

`geography_column`
:   A GEOGRAPHY column.

## Returns

The function returns a value of type GEOGRAPHY.

## Examples

Create a table with a GEOMETRY column and insert data:

```sqlexample
CREATE OR REPLACE TABLE st_intersection_agg_demo_table (g GEOGRAPHY);

INSERT INTO st_intersection_agg_demo_table VALUES
  ('POLYGON((10 10, 11 11, 11 10, 10 10))'),
  ('POLYGON((10 10, 11 10, 10 11, 10 10))'),
  ('POLYGON((10.5 10.5, 10 10, 11 10, 10.5 10.5))');
```

Use the ST_INTERSECTION_AGG function to return a GEOGRAPHY object that represents the intersection of
the shapes represented by the objects in the GEOGRAPHY column:

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';

SELECT ST_INTERSECTION_AGG(g) AS intersection_of_shapes
  FROM st_intersection_agg_demo_table;
```

```output
+--------------------------------------------+
| INTERSECTION_OF_SHAPES                     |
|--------------------------------------------|
| POLYGON((10.5 10.5,10 10,11 10,10.5 10.5)) |
+--------------------------------------------+
```

---
title: ST_INTERSECTS
source: https://docs.snowflake.com/en/sql-reference/functions/st_intersects.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_INTERSECTS

Returns TRUE if the two [GEOGRAPHY](../data-types-geospatial.md) objects or the two
[GEOMETRY](../data-types-geospatial.md) objects intersect (i.e. share any portion of space).

> **Note:**
>
> This function does not support using a GeometryCollection or FeatureCollection as input values.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [ST_DISJOINT](st_disjoint.md)

## Syntax

```sqlsyntax
ST_INTERSECTS( <geography_expression_1> , <geography_expression_2> )

ST_INTERSECTS( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

`geometry_expression_1`
:   A GEOMETRY object.

`geometry_expression_2`
:   A GEOMETRY object.

## Returns

BOOLEAN.

## Usage notes

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_INTERSECTS function:

> ```sqlexample
> SELECT ST_INTERSECTS(
>     TO_GEOGRAPHY('POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))'),
>     TO_GEOGRAPHY('POLYGON((1 1, 3 1, 3 3, 1 3, 1 1))')
>     );
> +---------------------------------------------------------+
> | ST_INTERSECTS(                                          |
> |     TO_GEOGRAPHY('POLYGON((0 0, 2 0, 2 2, 0 2, 0 0))'), |
> |     TO_GEOGRAPHY('POLYGON((1 1, 3 1, 3 3, 1 3, 1 1))')  |
> |     )                                                   |
> |---------------------------------------------------------|
> | True                                                    |
> +---------------------------------------------------------+
> ```

### GEOMETRY examples

This shows a simple use of the ST_INTERSECTS function:

> ```sqlexample
> SELECT ST_INTERSECTS(
>   TO_GEOMETRY('POLYGON((0 0, 0 2, 2 2, 2 0, 0 0))'),
>   TO_GEOMETRY('POLYGON((1 1, 3 1, 3 3, 1 3, 1 1))')
> );
> ```
>
> ```none
> +------------------------------------------------------+
> | ST_INTERSECTS(                                       |
> |   TO_GEOMETRY('POLYGON((0 0, 0 2, 2 2, 2 0, 0 0))'), |
> |   TO_GEOMETRY('POLYGON((1 1, 3 1, 3 3, 1 3, 1 1))')  |
> | )                                                    |
> |------------------------------------------------------|
> | True                                                 |
> +------------------------------------------------------+
> ```

---
title: ST_ISVALID
source: https://docs.snowflake.com/en/sql-reference/functions/st_isvalid.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_ISVALID

Returns TRUE if the specified [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object represents a
[valid shape](../data-types-geospatial.md). Examples of
invalid shapes include shapes with self-intersections and spikes.

## Syntax

```sqlsyntax
ST_ISVALID( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a BOOLEAN value.

## Usage notes

* ST_ISVALID only checks for the validity of a shape. It doesn’t modify data. When constructing objects from
  spatial formats (such as WKT, WKB, EWKT, EWKB, or GeoJSON), conversion functions (for example, [TO_GEOGRAPHY](to_geography.md),
  [TO_GEOMETRY](to_geometry.md), [ST_GEOGRAPHYFROMWKT](st_geographyfromwkt.md), or [ST_GEOMETRYFROMWKT](st_geometryfromwkt.md)) parse input and by default
  attempt to validate or repair shapes. If a conversion function can’t repair a shape, it returns
  an error unless you accept invalid shapes.
* To ingest data that might be invalid (for example, data that you plan to correct later), specify TRUE for the
  additional `allow_invalid` argument when you call the conversion function to allow an invalid shape.
  You can then use the ST_ISVALID function to flag invalid rows in a table.
* Some geospatial functions might return an error or unusable results when given invalid shapes. Use the
  ST_ISVALID function to check validity. You can correct invalid shapes before performing spatial analytics.
* When shapes are invalid, simple corrections include buffering with a small positive or negative distance
  (for example, to remove tiny spikes or resolve self-intersections) and then rechecking validity using the
  ST_ISVALID function.

## Examples

The following examples use the ST_ISVALID function.

Determine whether a polygon is a valid shape:

```sqlexample
SELECT ST_ISVALID(
    TO_GEOGRAPHY('POLYGON((-93.086 37.557,-86.699 37.497,-93.198 35.123,-93.086 37.557))')
  ) AS is_valid;
```

```output
+----------+
| IS_VALID |
|----------|
| True     |
+----------+
```

```sqlexample
SELECT ST_ISVALID(
    TO_GEOGRAPHY( 'POLYGON((-92.799 37.601,-88.240 37.617,-92.733 36.198,-88.305 36.171,-92.799 37.601))', TRUE)
  ) AS is_valid;
```

```output
+----------+
| IS_VALID |
|----------|
| False    |
+----------+
```

Correct an invalid shape by using the [ST_BUFFER](st_buffer.md) function to add small buffer:

```sqlexample
WITH g AS (
  SELECT TO_GEOMETRY('POLYGON((0 0, 2 2, 2 0, 0 2, 0 0))', TRUE) AS geom
)
SELECT ST_ISVALID(geom) AS is_valid_before_buffer,
  ST_ISVALID(ST_BUFFER(geom, -0.001)) AS is_valid_after_buffer
  FROM g;
```

```output
+------------------------+-----------------------+
| IS_VALID_BEFORE_BUFFER | IS_VALID_AFTER_BUFFER |
|------------------------+-----------------------|
| False                  | True                  |
+------------------------+-----------------------+
```

---
title: ST_LENGTH
source: https://docs.snowflake.com/en/sql-reference/functions/st_length.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_LENGTH

Returns the great circle length of the LineString(s) in a [GEOGRAPHY](../data-types-geospatial.md) object or the Euclidean
length of the LineString(s) in a [GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_LENGTH( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value, which represents the length:

* For GEOGRAPHY input values, the length is in meters.
* For GEOMETRY input values, the length is computed with the same units used to define the input coordinates.

## Usage notes

* If `geography_or_geometry_expression` is not a LineString, MultiLineString, or GeometryCollection containing linestrings, ST_LENGTH returns 0.
* If `geography_or_geometry_expression` is a GeometryCollection, ST_LENGTH returns the sum of the lengths of the linestrings in the collection.
* If you want the perimeter length of a polygon, use the [ST_PERIMETER](st_perimeter.md) function instead.

## Examples

### GEOGRAPHY examples

This shows the length in meters of one degree of arc at the equator:

> ```sqlexample
> SELECT ST_LENGTH(TO_GEOGRAPHY('LINESTRING(0.0 0.0, 1.0 0.0)'));
> +---------------------------------------------------------+
> | ST_LENGTH(TO_GEOGRAPHY('LINESTRING(0.0 0.0, 1.0 0.0)')) |
> |---------------------------------------------------------|
> |                                        111195.101177484 |
> +---------------------------------------------------------+
> ```

### GEOMETRY examples

The following example demonstrates how to use the ST_LENGTH function.

> ```sqlexample
> SELECT ST_LENGTH(g), ST_ASWKT(g)
> FROM (SELECT TO_GEOMETRY(column1) AS g
>   FROM VALUES ('POINT(1 1)'),
>               ('LINESTRING(0 0, 1 1)'),
>               ('POLYGON((0 0, 0 1, 1 1, 1 0, 0 0))'));
> ```
>
> ```none
> +--------------+--------------------------------+
> | ST_LENGTH(G) | ST_ASWKT(G)                    |
> |--------------+--------------------------------|
> |  0           | POINT(1 1)                     |
> |  1.414213562 | LINESTRING(0 0,1 1)            |
> |  0           | POLYGON((0 0,0 1,1 1,1 0,0 0)) |
> +--------------+--------------------------------+
> ```

---
title: ST_MAKEGEOMPOINT , ST_GEOMPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/st_makegeompoint.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_MAKEGEOMPOINT , ST_GEOMPOINT

Constructs a [GEOMETRY](../data-types-geospatial.md) object that represents a Point with the specified longitude and latitude.

See also:
:   [TO_GEOMETRY](to_geometry.md)

## Syntax

```sqlsyntax
ST_MAKEGEOMPOINT( <longitude> , <latitude> )
```

## Arguments

`longitude`
:   A REAL that represents the longitude.

`latitude`
:   A REAL that represents the latitude.

## Returns

The function returns a value of type GEOMETRY.

## Usage notes

* ST_GEOMPOINT is an alias for ST_MAKEGEOMPOINT.

## Examples

For examples, see [Examples comparing the GEOGRAPHY and GEOMETRY data types](../data-types-geospatial.md). The examples use the
ST_GEOMPOINT alias.

---
title: ST_MAKELINE
source: https://docs.snowflake.com/en/sql-reference/functions/st_makeline.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_MAKELINE

Constructs a [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object that represents
a line connecting the points in the input objects.

See also:
:   [TO_GEOGRAPHY](to_geography.md) , [TO_GEOMETRY](to_geometry.md)

## Syntax

```sqlsyntax
ST_MAKELINE( <geography_expression_1> , <geography_expression_2> )

ST_MAKELINE( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object containing the points to connect. This object must be a Point, MultiPoint, or LineString.

`geography_expression_2`
:   A GEOGRAPHY object containing the points to connect. This object must be a Point, MultiPoint, or LineString.

`geometry_expression_1`
:   A GEOMETRY object containing the points to connect. This object must be a Point, MultiPoint, or LineString.

`geometry_expression_2`
:   A GEOMETRY object containing the points to connect. This object must be a Point, MultiPoint, or LineString.

## Returns

The function returns a value of type GEOGRAPHY or GEOMETRY. The value is a LineString that connects all of the points specified by
the input GEOGRAPHY or GEOMETRY objects.

## Usage notes

* If an input GEOGRAPHY object contains multiple points, ST_MAKELINE connects all of the points specified in the object.
* ST_MAKELINE connects the points in the order in which they are specified in the input.

* For GEOMETRY objects, the function reports an error if the two input GEOMETRY objects have different SRIDs.

* For GEOMETRY objects, the returned GEOMETRY object has the same SRID as the input.

## Examples

### GEOGRAPHY examples

The examples in this section display output in WKT format:

> ```sqlexample
> alter session set GEOGRAPHY_OUTPUT_FORMAT='WKT';
> ```

The following example uses ST_MAKELINE to construct a LineString that connects two Points:

> ```sqlexample
> SELECT ST_MAKELINE(
>                    TO_GEOGRAPHY('POINT(37.0 45.0)'),
>                    TO_GEOGRAPHY('POINT(38.5 46.5)')
>                   ) AS line_between_two_points;
> +-----------------------------+
> | LINE_BETWEEN_TWO_POINTS     |
> |-----------------------------|
> | LINESTRING(37 45,38.5 46.5) |
> +-----------------------------+
> ```

The following example constructs a LineString that connects a Point with the points in a MultiPoint:

> ```sqlexample
> SELECT ST_MAKELINE(
>                    TO_GEOGRAPHY('POINT(-122.306067 37.55412)'),
>                    TO_GEOGRAPHY('MULTIPOINT((-122.32328 37.561801), (-122.325879 37.586852))')
>                   ) AS line_between_point_and_multipoint;
> +-----------------------------------------------------------------------------+
> | LINE_BETWEEN_POINT_AND_MULTIPOINT                                           |
> |-----------------------------------------------------------------------------|
> | LINESTRING(-122.306067 37.55412,-122.32328 37.561801,-122.325879 37.586852) |
> +-----------------------------------------------------------------------------+
> ```

As demonstrated by the output of the example, ST_MAKELINE connects the points in the order in which they are specified in the input.

The following example constructs a LineString that connects the points in a MultiPoint with another LineString:

> ```sqlexample
> SELECT ST_MAKELINE(
>                    TO_GEOGRAPHY('MULTIPOINT((-122.32328 37.561801), (-122.325879 37.586852))'),
>                    TO_GEOGRAPHY('LINESTRING(-122.306067 37.55412, -122.496691 37.495627)')
>                   ) AS line_between_multipoint_and_linestring;
> +---------------------------------------------------------------------------------------------------+
> | LINE_BETWEEN_MULTIPOINT_AND_LINESTRING                                                            |
> |---------------------------------------------------------------------------------------------------|
> | LINESTRING(-122.32328 37.561801,-122.325879 37.586852,-122.306067 37.55412,-122.496691 37.495627) |
> +---------------------------------------------------------------------------------------------------+
> ```

### GEOMETRY examples

The examples in this section display output in WKT format:

> ```sqlexample
> ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
> ```

The first example constructs a line between two Points:

> ```sqlexample
> SELECT ST_MAKELINE(
>   TO_GEOMETRY('POINT(1.0 2.0)'),
>   TO_GEOMETRY('POINT(3.5 4.5)')) AS line_between_two_points;
> ```
>
> ```none
> +-------------------------+
> | LINE_BETWEEN_TWO_POINTS |
> |-------------------------|
> | LINESTRING(1 2,3.5 4.5) |
> +-------------------------+
> ```

The next example demonstrates creating a LineString that connects points in a MultiPoint with a Point

> ```sqlexample
> SELECT ST_MAKELINE(
>   TO_GEOMETRY('POINT(1.0 2.0)'),
>   TO_GEOMETRY('MULTIPOINT(3.5 4.5, 6.1 7.9)')) AS line_from_point_and_multipoint;
> ```
>
> ```none
> +---------------------------------+
> | LINE_FROM_POINT_AND_MULTIPOINT  |
> |---------------------------------|
> | LINESTRING(1 2,3.5 4.5,6.1 7.9) |
> +---------------------------------+
> ```

The following example constructs a LineString that connects the points in a MultiPoint with another LineString:

> ```sqlexample
> SELECT ST_MAKELINE(
>   TO_GEOMETRY('LINESTRING(1.0 2.0, 10.1 5.5)'),
>   TO_GEOMETRY('MULTIPOINT(3.5 4.5, 6.1 7.9)')) AS line_from_linestring_and_multipoint;
> ```
>
> ```none
> +------------------------------------------+
> | LINE_FROM_LINESTRING_AND_MULTIPOINT      |
> |------------------------------------------|
> | LINESTRING(1 2,10.1 5.5,3.5 4.5,6.1 7.9) |
> +------------------------------------------+
> ```

---
title: ST_MAKEPOINT , ST_POINT
source: https://docs.snowflake.com/en/sql-reference/functions/st_makepoint.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_MAKEPOINT , ST_POINT

Constructs a [GEOGRAPHY](../data-types-geospatial.md) object that represents a point with the specified longitude
and latitude.

See also:
:   [TO_GEOGRAPHY](to_geography.md)

## Syntax

```sqlsyntax
ST_MAKEPOINT( <longitude> , <latitude> )
```

## Arguments

`longitude`
:   A REAL that represents the longitude.

`latitude`
:   A REAL that represents the latitude.

## Returns

The function returns a value of type GEOGRAPHY.

## Usage notes

* ST_POINT is an alias for ST_MAKEPOINT.

## Examples

This shows a simple use of the ST_MAKEPOINT function:

> ```sqlexample
> SELECT ST_MAKEPOINT(37.5, 45.5);
> +--------------------------+
> | ST_MAKEPOINT(37.5, 45.5) |
> |--------------------------|
> | POINT(37.5 45.5)         |
> +--------------------------+
> ```

---
title: ST_MAKEPOLYGON , ST_POLYGON
source: https://docs.snowflake.com/en/sql-reference/functions/st_makepolygon.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_MAKEPOLYGON , ST_POLYGON

Constructs a [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object that represents
a Polygon without holes. The function uses the specified LineString as the outer loop.

This function corrects the orientation of the loop to prevent the creation of Polygons that span more than half of the globe. In
contrast, [ST_MAKEPOLYGONORIENTED](st_makepolygonoriented.md) doesn’t attempt to correct the orientation of the loop.

See also:
:   [TO_GEOGRAPHY](to_geography.md) , [TO_GEOMETRY](to_geometry.md) , [ST_MAKEPOLYGONORIENTED](st_makepolygonoriented.md)

## Syntax

```sqlsyntax
ST_MAKEPOLYGON( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   A GEOGRAPHY or GEOMETRY object that represents a LineString in which the last point is the same as the first (i.e. a
    loop).

## Returns

The function returns a value of type GEOGRAPHY or GEOMETRY.

## Usage notes

* The lines of the Polygon must form a loop. In other words, the last Point in the sequence of Points defining the LineString
  must be the same Point as the first Point in the sequence.
* ST_POLYGON is an alias for ST_MAKEPOLYGON.

* For GEOMETRY objects, the returned GEOMETRY object has the same SRID as the input.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_MAKEPOLYGON function. The sequence of points below defines a great circle
rectangular area 1 degree wide and 2 degrees high, with the lower left corner of the polygon starting at the
equator (latitude) and Greenwich (longitude). The last point in the sequence is the same as the first point,
which completes the loop.

> ```sqlexample
> SELECT ST_MAKEPOLYGON(
>    TO_GEOGRAPHY('LINESTRING(0.0 0.0, 1.0 0.0, 1.0 2.0, 0.0 2.0, 0.0 0.0)')
>    ) AS polygon1;
> +--------------------------------+
> | POLYGON1                       |
> |--------------------------------|
> | POLYGON((0 0,1 0,1 2,0 2,0 0)) |
> +--------------------------------+
> ```

### GEOMETRY examples

This shows a simple use of the ST_MAKEPOLYGON function.

> ```sqlexample
> SELECT ST_MAKEPOLYGON(
>   TO_GEOMETRY('LINESTRING(0.0 0.0, 1.0 0.0, 1.0 2.0, 0.0 2.0, 0.0 0.0)')
>   ) AS polygon;
> ```
>
> ```none
> +--------------------------------+
> | POLYGON                        |
> |--------------------------------|
> | POLYGON((0 0,1 0,1 2,0 2,0 0)) |
> +--------------------------------+
> ```

---
title: ST_MAKEPOLYGONORIENTED
source: https://docs.snowflake.com/en/sql-reference/functions/st_makepolygonoriented.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_MAKEPOLYGONORIENTED

Constructs a [GEOGRAPHY](../data-types-geospatial.md) object that represents a Polygon without holes. The function uses the
specified LineString as the outer loop.

This function does not attempt to correct the orientation of the loop, thus allowing for the creation of Polygons that span more
than half of the globe. In contrast, [ST_MAKEPOLYGON](st_makepolygon.md) inverts the orientation of
those large shapes.

See also:
:   [TO_GEOGRAPHY](to_geography.md), [ST_MAKEPOLYGON](st_makepolygon.md)

## Syntax

```sqlsyntax
ST_MAKEPOLYGONORIENTED( <geography_expression> )
```

## Arguments

`geography_expression`
:   A GEOGRAPHY object that represents a LineString in which the last point is the same as the first (i.e. a loop).

## Returns

The function returns a value of type GEOGRAPHY.

## Usage notes

* The lines of the Polygon must form a loop. In other words, the last Point in the sequence of Points defining the LineString
  must be the same Point as the first Point in the sequence.
* As you follow along the loop, the inside of the Polygon should be on the left, and the outside of the Polygon should be on the
  right.

## Examples

The following example demonstrates how to use the ST_MAKEPOLYGONORIENTED function. The sequence of Points below defines a
great circle rectangular area one degree wide and two degrees high. The lower left corner of the Polygon starts at the equator (latitude)
and Greenwich (longitude). The last Point in the sequence is the same as the first Point, which completes the loop.

The example passes the GEOGRAPHY object for the Polygon to the [ST_AREA](st_area.md) function to return the area of the Polygon.

```sqlexample
SELECT ST_AREA(
  ST_MAKEPOLYGONORIENTED(
    TO_GEOGRAPHY('LINESTRING(0.0 0.0, 1.0 0.0, 1.0 2.0, 0.0 2.0, 0.0 0.0)')
  )
) AS area_of_polygon;

+------------------+
|  AREA_OF_POLYGON |
|------------------|
| 24724306355.5504 |
+------------------+
```

The following example is the same shape but has the opposite orientation. As indicated by the difference in the area of the
Polygon, the Polygon represents the entire globe except for that previous shape.

```sqlexample
SELECT ST_AREA(
  ST_MAKEPOLYGONORIENTED(
    TO_GEOGRAPHY('LINESTRING(0.0 0.0, 0.0 2.0, 1.0 2.0, 1.0 0.0, 0.0 0.0)')
  )
) AS area_of_polygon;

+-----------------+
| AREA_OF_POLYGON |
|-----------------|
| 510041348811633 |
+-----------------+
```

---
title: ST_NPOINTS , ST_NUMPOINTS
source: https://docs.snowflake.com/en/sql-reference/functions/st_npoints.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_NPOINTS , ST_NUMPOINTS

Returns the number of points in a [GEOGRAPHY](../data-types-geospatial.md) or [GEOGRAPHY](../data-types-geospatial.md)
object.

## Syntax

```sqlsyntax
ST_NPOINTS( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a value of type INTEGER.

## Usage notes

* Each Polygon loop lists the starting point twice (once as the start, once as the end). ST_NPOINTS
  counts both occurrences. For example, given a triangular Polygon, ST_NPOINTS returns 4, not 3.
* ST_NUMPOINTS is an alias for ST_NPOINTS.

  > **Note:**
  >
  > In some other systems, ST_NUMPOINTS behaves differently from ST_NPOINTS and returns the number of points for
  > LineString / MultiLineString objects only.

## Examples

### GEOGRAPHY examples

This shows the number of points in a simple Polygon.

> ```sqlexample
> create table geospatial_table_01 (g1 GEOGRAPHY, g2 GEOGRAPHY);
> insert into geospatial_table_01 (g1, g2) values
>     ('POLYGON((0 0, 3 0, 3 3, 0 3, 0 0))', 'POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))');
> ```
>
> ```sqlexample
> SELECT ST_NPOINTS(g1)
>     FROM geospatial_table_01;
> +----------------+
> | ST_NPOINTS(G1) |
> |----------------|
> |              5 |
> +----------------+
> ```

### GEOMETRY examples

The following example demonstrates how to use the ST_NPOINTS function.

> ```sqlexample
> CREATE OR REPLACE TABLE geometry_shapes (g GEOMETRY);
> INSERT INTO geometry_shapes VALUES
>     ('POINT(66 12)'),
>     ('MULTIPOINT((45 21), (12 54))'),
>     ('LINESTRING(40 60, 50 50, 60 40)'),
>     ('MULTILINESTRING((1 1, 32 17), (33 12, 73 49, 87.1 6.1))'),
>     ('POLYGON((17 17, 17 30, 30 30, 30 17, 17 17))'),
>     ('MULTIPOLYGON(((-10 0,0 10,10 0,-10 0)),((-10 40,10 40,0 20,-10 40)))'),
>     ('GEOMETRYCOLLECTION(POLYGON((-10 0,0 10,10 0,-10 0)),LINESTRING(40 60, 50 50, 60 40), POINT(99 11))')
>     ;
>
> SELECT ST_NPOINTS(g), ST_ASWKT(g) FROM geometry_shapes;
> ```
>
> ```none
> +---------------+-------------------------------------------------------------------------------------------------+
> | ST_NPOINTS(G) | ST_ASWKT(G)                                                                                     |
> |---------------+-------------------------------------------------------------------------------------------------|
> |             1 | POINT(66 12)                                                                                    |
> |             2 | MULTIPOINT((45 21),(12 54))                                                                     |
> |             3 | LINESTRING(40 60,50 50,60 40)                                                                   |
> |             5 | MULTILINESTRING((1 1,32 17),(33 12,73 49,87.1 6.1))                                             |
> |             5 | POLYGON((17 17,17 30,30 30,30 17,17 17))                                                        |
> |             8 | MULTIPOLYGON(((-10 0,0 10,10 0,-10 0)),((-10 40,10 40,0 20,-10 40)))                            |
> |             8 | GEOMETRYCOLLECTION(POLYGON((-10 0,0 10,10 0,-10 0)),LINESTRING(40 60,50 50,60 40),POINT(99 11)) |
> +---------------+-------------------------------------------------------------------------------------------------+
> ```

---
title: ST_PERIMETER
source: https://docs.snowflake.com/en/sql-reference/functions/st_perimeter.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_PERIMETER

Returns the length of the perimeter of the polygon(s) in a [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_PERIMETER( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value, which represents the length:

* For GEOGRAPHY objects, the length is in meters.
* For GEOMETRY objects, the length is computed with the same unit used to define the coordinates.

## Usage notes

* If `geography_or_geometry_expression` is not a Polygon, MultiPolygon, or GeometryCollection containing Polygons, ST_PERIMETER returns 0.
* If `geography_or_geometry_expression` is a GeometryCollection, ST_PERIMETER returns the sum of the perimeters of the Polygons in the collection.
* Use this function (rather than ST_LENGTH) to get the perimeter of a Polygon.

## Examples

### GEOGRAPHY examples

This calculates the length of the perimeter of a polygon that is one degree of arc on each edge and has one
edge on the equator:

> ```sqlexample
> SELECT ST_PERIMETER(TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))'));
> +------------------------------------------------------------------+
> | ST_PERIMETER(TO_GEOGRAPHY('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))')) |
> |------------------------------------------------------------------|
> |                                                 444763.468727621 |
> +------------------------------------------------------------------+
> ```

### GEOMETRY examples

The following example demonstrates how to use the ST_PERIMETER function.

> ```sqlexample
> SELECT ST_PERIMETER(g), ST_ASWKT(g)
> FROM (SELECT TO_GEOMETRY(column1) AS g
>   FROM VALUES ('POINT(1 1)'),
>               ('LINESTRING(0 0, 1 1)'),
>               ('POLYGON((0 0, 0 1, 1 1, 1 0, 0 0))'));
> ```
>
> ```none
> +-----------------+--------------------------------+
> | ST_PERIMETER(G) | ST_ASWKT(G)                    |
> |-----------------+--------------------------------|
> |               0 | POINT(1 1)                     |
> |               0 | LINESTRING(0 0,1 1)            |
> |               4 | POLYGON((0 0,0 1,1 1,1 0,0 0)) |
> +-----------------+--------------------------------+
> ```

---
title: ST_POINTN
source: https://docs.snowflake.com/en/sql-reference/functions/st_pointn.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_POINTN

Returns a Point at a specified index in a LineString.

See also:
:   [ST_ENDPOINT](st_endpoint.md) , [ST_STARTPOINT](st_startpoint.md)

## Syntax

```sqlsyntax
ST_POINTN( <geography_or_geometry_expression> , <index> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY that represents a LineString.

`index`
:   The index of the Point to return. The index must be an integer.

    A negative index is interpreted as the offset from the end of the LineString. For example, `-1` is interpreted as the last
    Point in the LineString, `-2` is interpreted as the second to the last Point, etc.

## Returns

The function returns a value of type GEOGRAPHY or GEOMETRY that contains the Point at the specified index of the LineString.

## Usage notes

* If `geography_or_geometry_expression` is not a LineString, the function reports an error.
* If `index` is out of bounds (e.g. exceeds the number of Points in the LineString), the function reports an error.

## Examples

### GEOGRAPHY examples

The following query returns the second Point in a LineString:

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='WKT';
SELECT ST_POINTN(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), 2);

+--------------------------------------------------------------+
| ST_POINTN(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), 2) |
|--------------------------------------------------------------|
| POINT(2 2)                                                   |
+--------------------------------------------------------------+
```

The following query uses a negative index to return the second Point from the end of a LineString:

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='WKT';
SELECT ST_POINTN(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), -2);

+---------------------------------------------------------------+
| ST_POINTN(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), -2) |
|---------------------------------------------------------------|
| POINT(3 3)                                                    |
+---------------------------------------------------------------+
```

### GEOMETRY examples

The following query returns the second Point in a LineString:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
SELECT ST_POINTN(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), 2);

+-------------------------------------------------------------+
| ST_POINTN(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), 2) |
|-------------------------------------------------------------|
| POINT(2 2)                                                  |
+-------------------------------------------------------------+
```

The following query uses a negative index to return the second Point from the end of a LineString:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
SELECT ST_POINTN(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), -2);

+--------------------------------------------------------------+
| ST_POINTN(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)'), -2) |
|--------------------------------------------------------------|
| POINT(3 3)                                                   |
+--------------------------------------------------------------+
```

---
title: ST_SETSRID
source: https://docs.snowflake.com/en/sql-reference/functions/st_setsrid.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_SETSRID

Returns a [GEOMETRY](../data-types-geospatial.md) object that has its SRID (spatial reference system identifier) set to the
specified value.

Use this function to change the SRID without affecting the coordinates of the object. If you also need to
[change the coordinates to match the new SRS (spatial reference system)](../data-types-geospatial.md), use
[ST_TRANSFORM](st_transform.md) instead.

## Syntax

```sqlsyntax
ST_SETSRID( <geometry_expression> , <srid> )
```

## Arguments

`geometry_expression`
:   The argument must be an expression of type GEOMETRY.

`srid`
:   The SRID to set in the returned GEOMETRY object.

## Returns

The function returns a value of type GEOMETRY.

## Usage notes

## Examples

The following example creates and returns a GEOMETRY object that uses the SRID 4326:

> ```sqlexample
> ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';
>
> SELECT ST_SETSRID(TO_GEOMETRY('POINT(13 51)'), 4326);
>
> +-----------------------------------------------+
> | ST_SETSRID(TO_GEOMETRY('POINT(13 51)'), 4326) |
> |-----------------------------------------------|
> | SRID=4326;POINT(13 51)                        |
> +-----------------------------------------------+
> ```

---
title: ST_SIMPLIFY
source: https://docs.snowflake.com/en/sql-reference/functions/st_simplify.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_SIMPLIFY

Given an input [GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object that represents
a Line or Polygon, returns a simpler approximation of the object. The function identifies and removes selected vertices, resulting
in a similar object that has fewer vertices.

For example, if the input object is a Polygon with 50 vertices, ST_SIMPLIFY can return a simpler Polygon with only 20 of those
vertices.

When simplifying an object, the function removes a vertex only if the distance between that vertex and the edge resulting from
the removal of that vertex is within the specified tolerance.

## Syntax

```sqlsyntax
ST_SIMPLIFY( <geography_expression>, <tolerance> [ , <preserve_collapsed> ] )
ST_SIMPLIFY( <geometry_expression>, <tolerance> )
```

## Arguments

**Required:**

`geography_expression` . OR . `geometry_expression`
:   The GEOGRAPHY or GEOMETRY object to simplify.

    Depending on the type of the GEOGRAPHY or GEOMETRY object, ST_SIMPLIFY has the following effect:

    | Type of Object | Effect of ST_SIMPLIFY |
    | --- | --- |
    | LineString, MultiLineString, Polygon, or MultiPolygon | ST_SIMPLIFY applies the simplification algorithm |
    | Point or MultiPoint | ST_SIMPLIFY has no effect. |
    | GeometryCollection or FeatureCollection | For GEOGRAPHY objects, ST_SIMPLIFY applies the simplification algorithm to each object in the collection. . . For GEOMETRY objects, ST_SIMPLIFY does not support these types. |

`tolerance`
:   The maximum distance used by the simplification algorithm. Depending on the data type of the object, the following units are used for
    the operation:

    * GEOGRAPHY - Distance is interpreted in meters.
    * GEOMETRY - Distance is interpreted in the units of the object’s SRID (spatial reference system identifier). For example, the
      distance is interpreted in degrees for EPSG:4326, in meters for many projected SRIDs, or in feet for some local SRIDs.

      If the distance exceeds this tolerance for a candidate vertex, ST_SIMPLIFY keeps that vertex in the simplified object.

**Optional:**

`preserve_collapsed`
:   (For GEOGRAPHY objects only) If `TRUE`, retains objects that would otherwise be too small given the tolerance.

    For example, when `preserve_collapsed` is `FALSE` and `tolerance` is `10` (meters), a 1m long line
    is reduced to a point in the simplified object. When `preserve_collapsed` is `TRUE`, the line is preserved in
    the simplified object.

    Default: `FALSE`.

## Returns

The function returns a value of type GEOGRAPHY or GEOMETRY.

## Examples

### GEOGRAPHY examples

The examples in this section display output in WKT format:

> ```sqlexample
> alter session set GEOGRAPHY_OUTPUT_FORMAT='WKT';
> ```

The following example returns a simplified LineString that has fewer vertices than the original LineString.
In the simplified object, a vertex is omitted if the distance between the vertex and the edge that replaces the vertex
is less than 1000 meters.

> ```sqlexample
> SELECT ST_SIMPLIFY(
>     TO_GEOGRAPHY('LINESTRING(-122.306067 37.55412, -122.32328 37.561801, -122.325879 37.586852)'),
>     1000);
> +----------------------------------------------------------------------------------------------------+
> | ST_SIMPLIFY(                                                                                       |
> |     TO_GEOGRAPHY('LINESTRING(-122.306067 37.55412, -122.32328 37.561801, -122.325879 37.586852)'), |
> |     1000)                                                                                          |
> |----------------------------------------------------------------------------------------------------|
> | LINESTRING(-122.306067 37.55412,-122.325879 37.586852)                                             |
> +----------------------------------------------------------------------------------------------------+
> ```

### GEOMETRY examples

The examples in this section display output in WKT format:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
```

The following example returns a simplified LineString that has fewer vertices than the original LineString. In the simplified
object, a vertex is omitted if the distance between the vertex and the edge that replaces the vertex is less than 500 meters.

```sqlexample
SELECT ST_SIMPIFY(
  TO_GEOMETRY('LINESTRING(1100 1100, 2500 2100, 3100 3100, 4900 1100, 3100 1900)'),
  500);

+----------------------------------------------------------------------------------------------------+
| ST_SIMPLIFY(TO_GEOMETRY('LINESTRING(1100 1100, 2500 2100, 3100 3100, 4900 1100, 3100 1900)'), 500) |
|----------------------------------------------------------------------------------------------------|
| LINESTRING(1100 1100,3100 3100,4900 1100,3100 1900)                                                |
+----------------------------------------------------------------------------------------------------+
```

The following example simplifies an ellipse that has 36 initial vertices to a shape with 16 or 10 vertices, depending on the
`tolerance` argument:

```sqlexample
SELECT ST_NUMPOINTS(geom) AS numpoints_before,
  ST_NUMPOINTS(ST_Simplify(geom, 0.5)) AS numpoints_simplified_05,
  ST_NUMPOINTS(ST_Simplify(geom, 1)) AS numpoints_simplified_1
  FROM
  (SELECT ST_BUFFER(to_geometry('LINESTRING(0 0, 1 1)'), 10) As geom);

+------------------+-------------------------+------------------------+
| NUMPOINTS_BEFORE | NUMPOINTS_SIMPLIFIED_05 | NUMPOINTS_SIMPLIFIED_1 |
|------------------+-------------------------+------------------------|
|               36 |                      16 |                     10 |
+------------------+-------------------------+------------------------+
```

---
title: ST_SRID
source: https://docs.snowflake.com/en/sql-reference/functions/st_srid.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_SRID

Returns the SRID (spatial reference system identifier) of a [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object.

Currently, for any value of the GEOGRAPHY type, only SRID 4326 is supported and is returned.

## Syntax

```sqlsyntax
ST_SRID( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a value of type NUMBER(4,0).

## Usage notes

* Returns NULL if the input is NULL.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_SRID function:

> ```sqlexample
> SELECT ST_SRID(ST_MAKEPOINT(37.5, 45.5));
> +-----------------------------------+
> | ST_SRID(ST_MAKEPOINT(37.5, 45.5)) |
> |-----------------------------------|
> |                              4326 |
> +-----------------------------------+
> ```

This shows use of the ST_SRID function with NULL values:

> ```sqlexample
> SELECT ST_SRID(ST_MAKEPOINT(NULL, NULL)), ST_SRID(NULL);
> +-----------------------------------+---------------+
> | ST_SRID(ST_MAKEPOINT(NULL, NULL)) | ST_SRID(NULL) |
> |-----------------------------------+---------------|
> |                              NULL |          NULL |
> +-----------------------------------+---------------+
> ```

---
title: ST_STARTPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/st_startpoint.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_STARTPOINT

Returns the first Point in a LineString.

See also:
:   [ST_ENDPOINT](st_endpoint.md) , [ST_POINTN](st_pointn.md)

## Syntax

```sqlsyntax
ST_STARTPOINT( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY that represents a LineString.

## Returns

The function returns a value of type GEOGRAPHY or GEOMETRY that contains the first Point of the specified LineString.

## Usage notes

* If `geography_or_geometry_expression` is not a LineString, the function reports an error.

## Examples

### GEOGRAPHY examples

The following query returns the first Point in a LineString:

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='WKT';
SELECT ST_STARTPOINT(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)'));

+---------------------------------------------------------------+
| ST_STARTPOINT(TO_GEOGRAPHY('LINESTRING(1 1, 2 2, 3 3, 4 4)')) |
|---------------------------------------------------------------|
| POINT(1 1)                                                    |
+---------------------------------------------------------------+
```

### GEOMETRY examples

The following query returns the first Point in a LineString:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='WKT';
SELECT ST_STARTPOINT(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)'));

+--------------------------------------------------------------+
| ST_STARTPOINT(TO_GEOMETRY('LINESTRING(1 1, 2 2, 3 3, 4 4)')) |
|--------------------------------------------------------------|
| POINT(1 1)                                                   |
+--------------------------------------------------------------+
```

---
title: ST_SYMDIFFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/st_symdifference.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_SYMDIFFERENCE

Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the set of points from both input objects that are
not part of the intersection of the objects (i.e. the
[symmetric difference](https://en.wikipedia.org/wiki/Symmetric_difference) of the two objects).

See also:
:   [ST_INTERSECTION](st_intersection.md) , [ST_UNION](st_union.md) , [ST_DIFFERENCE](st_difference.md)

## Syntax

```sqlsyntax
ST_SYMDIFFERENCE( <geography_expression_1> , <geography_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

## Returns

The function returns a value of type GEOGRAPHY.

If `geography_expression_1` and `geography_expression_2` are equal (i.e. the symmetric difference is an empty set
of points), the function returns NULL.

## Usage notes

* If any vertex of one input object is on the boundary of the other input object (excluding the vertices), the output might not be
  accurate.
* The function is not guaranteed to produce normalized and/or minimal results. For example, an output could consist of a
  LineString containing several Points that actually forms just one straight segment.

## Examples

The following example returns a GEOGRAPHY object that represents the symmetric difference between two input GEOGRAPHY objects:

> ```sqlexample
> ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';
>
> SELECT ST_SYMDIFFERENCE(
>   TO_GEOGRAPHY('POLYGON((0 0, 1 0, 2 1, 1 2, 2 3, 1 4, 0 4, 0 0))'),
>   TO_GEOGRAPHY('POLYGON((3 0, 3 4, 2 4, 1 3, 2 2, 1 1, 2 0, 3 0))')
> ) AS symmetric_difference_between_objects;
> ```

This example produces the following output:

> ```none
> +-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | SYMMETRIC_DIFFERENCE_BETWEEN_OBJECTS                                                                                                                                                                                    |
> |-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> | MULTIPOLYGON(((1 1,1.5 1.500171359,1 2,1.5 2.500285599,1 3,1.5 3.500399839,1 4,0 4,0 0,1 0,1.5 0.5000571198,1 1)),((3 0,3 4,2 4,1.5 3.500399839,2 3,1.5 2.500285599,2 2,1.5 1.500171359,2 1,1.5 0.5000571198,2 0,3 0))) |
> +-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> ```

The following images illustrate the differences in the areas that represent the input and output objects:

| Input | Output |
| --- | --- |
|  |  |

---
title: ST_TRANSFORM
source: https://docs.snowflake.com/en/sql-reference/functions/st_transform.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_TRANSFORM

Converts a [GEOMETRY](../data-types-geospatial.md) object from one
[spatial reference system (SRS)](https://en.wikipedia.org/wiki/Spatial_reference_system) to another.

Use this function to
[change the SRID and the coordinates of the object to match the new SRS (spatial reference system)](../data-types-geospatial.md).
If you just need to change the SRID without changing the coordinates (e.g. if the SRID was incorrect), use [ST_SETSRID](st_setsrid.md)
instead.

## Syntax

```sqlsyntax
ST_TRANSFORM( <geometry_expression> [ , <from_srid> ] , <to_srid> );
```

## Arguments

**Required:**

`geometry_expression`
:   The argument must be of type GEOMETRY.

`to_srid`
:   The [spatial reference system identifier (SRID)](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier) that identifies the SRS to use. The function transforms the input GEOMETRY
    object to a new object that uses this SRS.

**Optional:**

`from_srid`
:   The SRID identifying the current SRS of the input GEOMETRY object.

    If this argument is omitted, the function uses the SRID specified in the input GEOMETRY object.

## Returns

The function returns a [GEOMETRY](../data-types-geospatial.md) object that uses the SRS identified by `to_srid`.

## Usage notes

* SRIDs are based on the [EPSG standard](https://epsg.org/home.html) (v10.082). For example, the SRID 4326 corresponds to the authority EPSG with the code
  4326.
* Make sure that either the input GEOMETRY has the correct SRID set or that you specify the `from_srid` argument.
* Currently, the function does not support datum grid files. All transformations are performed using the static parameters of the
  datum without any grid file correction.
* If `geometry_expression`, `from_srid`, or `to_srid` are NULL, this function returns NULL.
* If `from_srid` or `to_srid` cannot be resolved to a valid SRID, an error occurs.

## Examples

The following example transforms a POINT GEOMETRY object from EPSG:32633 (WGS 84 / UTM zone 33N) to EPSG:3857 (Web Mercator).

```sqlexample
-- Set the output format to EWKT
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT
  ST_TRANSFORM(
    ST_GEOMFROMWKT('POINT(389866.35 5819003.03)', 32633),
    3857
  ) AS transformed_geom;
```

```output
+---------------------------------------------------------------+
| transformed_geom                                              |
|---------------------------------------------------------------|
| SRID=3857;POINT(1489140.093765644 6892872.198680112)          |
+---------------------------------------------------------------+
```

If the source SRID is not set correctly in the GEOMETRY object, you can specify the SRID in the `to_srid` argument of the
function. For example, to transform a POINT GEOMETRY object from EPSG:4326 (WGS84) to EPSG:28992 (Amersfoort / RD New):

```sqlexample
-- Set the output format to EWKT
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT
  ST_TRANSFORM(
    ST_GEOMFROMWKT('POINT(4.500212 52.161170)'),
    4326,
    28992
  ) AS transformed_geom;
```

```output
+---------------------------------------------------------------+
| transformed_geom                                              |
|---------------------------------------------------------------|
| SRID=28992;POINT (94308.66600006013 464038.16881095537)       |
+---------------------------------------------------------------+
```

---
title: ST_UNION
source: https://docs.snowflake.com/en/sql-reference/functions/st_union.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_UNION

Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the combined set of shapes for both objects (i.e.
the union of the two shapes).

See also:
:   [ST_UNION_AGG](st_union_agg.md) , [ST_INTERSECTION](st_intersection.md) , [ST_DIFFERENCE](st_difference.md) , [ST_SYMDIFFERENCE](st_symdifference.md)

## Syntax

```sqlsyntax
ST_UNION( <geography_expression_1> , <geography_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object.

`geography_expression_2`
:   A GEOGRAPHY object.

## Returns

The function returns a value of type GEOGRAPHY.

## Usage notes

* If any vertex of one input object is on the boundary of the other input object (excluding the vertices), some points in the
  union might be represented more than once in the output,

  For example, in the following statement:

  > ```sqlexample
  > SELECT ST_UNION(
  >   TO_GEOGRAPHY('POINT(1 1)'),
  >   TO_GEOGRAPHY('LINESTRING(1 0, 1 2)')
  > );
  > ```

  `POINT(1 1)` is on the boundary of `LINESTRING(1 0, 1 2)` but is not a vertex of it.

  In this example, ST_UNION is not guaranteed to produce minimal output. The expected output should be the input linestring:

  > ```none
  > LINESTRING(1 0, 1 2)
  > ```

  But the actual output might be:

  > ```none
  > GEOMETRYCOLLECTION(POINT(1 1),LINESTRING(1 0,1 1,1 2))
  > ```

  where `POINT (1,1)` is represented twice in the output: once as the point itself and once as a point within the LineString.

## Examples

The following example returns a GEOGRAPHY object that represents the union of two input GEOGRAPHY objects:

> ```sqlexample
> ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';
>
> SELECT ST_UNION(
>   TO_GEOGRAPHY('POLYGON((0 0, 1 0, 2 1, 1 2, 2 3, 1 4, 0 4, 0 0))'),
>   TO_GEOGRAPHY('POLYGON((3 0, 3 4, 2 4, 1 3, 2 2, 1 1, 2 0, 3 0))')
> ) AS union_of_objects;
> ```

This example produces the following output:

> ```none
> +-------------------------------------------------------------------------------------------------------------------------------------------+
> | UNION_OF_OBJECTS                                                                                                                          |
> |-------------------------------------------------------------------------------------------------------------------------------------------|
> | POLYGON((3 0,3 4,2 4,1.5 3.500399839,1 4,0 4,0 0,1 0,1.5 0.5000571198,2 0,3 0),(1.5 1.500171359,1 2,1.5 2.500285599,2 2,1.5 1.500171359)) |
> +-------------------------------------------------------------------------------------------------------------------------------------------+
> ```

The following images illustrate the differences in the areas that represent the input and output objects:

| Input | Output |
| --- | --- |
|  |  |

---
title: ST_UNION_AGG
source: https://docs.snowflake.com/en/sql-reference/functions/st_union_agg.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_UNION_AGG

Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the combined set of points that are in
at least one of the shapes represented by the objects in the column (that is, the union of the shapes).

See also:
:   [ST_UNION](st_union.md) , [ST_INTERSECTION_AGG](st_intersection_agg.md)

## Syntax

```sqlsyntax
ST_UNION_AGG( <geography_column> )
```

## Arguments

`geography_column`
:   A GEOGRAPHY column.

## Returns

The function returns a value of type GEOGRAPHY.

## Examples

Create a table with a GEOMETRY column and insert data:

```sqlexample
CREATE OR REPLACE TABLE st_union_agg_demo_table (g GEOGRAPHY);

INSERT INTO st_union_agg_demo_table VALUES
  ('POINT(1 1)'),
  ('POINT(0 1)'),
  ('LINESTRING(0 0, 0 1)'),
  ('LINESTRING(0 0, 0 2)'),
  ('POLYGON((10 10, 11 11, 11 10, 10 10))'),
  ('POLYGON((10 10, 11 11, 11 10, 10 10))');
```

Use the ST_UNION_AGG function to return a GEOGRAPHY object that represents the combined set of points that are in
at least one of the shapes represented by the objects in the GEOGRAPHY column:

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';

SELECT ST_UNION_AGG(g) AS union_of_shapes
  FROM st_union_agg_demo_table;
```

```output
+-------------------------------------------------------------------------------------------+
| UNION_OF_SHAPES                                                                           |
|-------------------------------------------------------------------------------------------|
| GEOMETRYCOLLECTION(POINT(1 1),LINESTRING(0 0,0 1,0 2),POLYGON((11 10,11 11,10 10,11 10))) |
+-------------------------------------------------------------------------------------------+
```

---
title: ST_WITHIN
source: https://docs.snowflake.com/en/sql-reference/functions/st_within.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_WITHIN

Returns true if the first geospatial object is fully contained by the second geospatial object. In other words:

* The first [GEOGRAPHY](../data-types-geospatial.md) object `g1` is fully contained by the second GEOGRAPHY object
  `g2`.
* The first [GEOMETRY](../data-types-geospatial.md) object `g1` is fully contained by the second GEOMETRY object
  `g2`.

Calling `ST_WITHIN(g1, g2)` is equivalent to calling `ST_CONTAINS(g2, g1)`.

Although ST_COVEREDBY and ST_WITHIN might seem similar, the two functions have subtle differences. For details on the differences
between “covered by” and “within”, see the
[Dimensionally Extended 9-Intersection Model (DE-9IM)](https://en.wikipedia.org/wiki/DE-9IM).

> **Note:**
>
> This function does not support using a GeometryCollection or FeatureCollection as input values.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

See also:
:   [ST_CONTAINS](st_contains.md) , [ST_COVEREDBY](st_coveredby.md)

## Syntax

```sqlsyntax
ST_WITHIN( <geography_expression_1> , <geography_expression_2> )

ST_WITHIN( <geometry_expression_1> , <geometry_expression_2> )
```

## Arguments

`geography_expression_1`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geography_expression_2`
:   A GEOGRAPHY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_1`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

`geometry_expression_2`
:   A GEOMETRY object that is not a GeometryCollection or FeatureCollection.

## Returns

BOOLEAN.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_WITHIN function:

> ```sqlexample
> create table geospatial_table_01 (g1 GEOGRAPHY, g2 GEOGRAPHY);
> insert into geospatial_table_01 (g1, g2) values
>     ('POLYGON((0 0, 3 0, 3 3, 0 3, 0 0))', 'POLYGON((1 1, 2 1, 2 2, 1 2, 1 1))');
> ```
>
> ```sqlexample
> SELECT ST_WITHIN(g1, g2)
>     FROM geospatial_table_01;
> +-------------------+
> | ST_WITHIN(G1, G2) |
> |-------------------|
> | False             |
> +-------------------+
> ```

---
title: ST_X
source: https://docs.snowflake.com/en/sql-reference/functions/st_x.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_X

Returns the longitude (X coordinate) of a Point represented by a [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_X( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of the type GEOGRAPHY or GEOMETRY and must contain a Point.

## Returns

Returns a REAL value.

## Usage notes

* Issues an error if the argument is not a Point.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_X and ST_Y functions with VARCHAR data:

> ```sqlexample
> SELECT ST_X(ST_MAKEPOINT(37.5, 45.5)), ST_Y(ST_MAKEPOINT(37.5, 45.5));
> +--------------------------------+--------------------------------+
> | ST_X(ST_MAKEPOINT(37.5, 45.5)) | ST_Y(ST_MAKEPOINT(37.5, 45.5)) |
> |--------------------------------+--------------------------------|
> |                           37.5 |                           45.5 |
> +--------------------------------+--------------------------------+
> ```

This shows use of the ST_X and ST_Y functions with NULL values:

> ```sqlexample
> SELECT
>     ST_X(ST_MAKEPOINT(NULL, NULL)), ST_X(NULL),
>     ST_Y(ST_MAKEPOINT(NULL, NULL)), ST_Y(NULL)
>     ;
> +--------------------------------+------------+--------------------------------+------------+
> | ST_X(ST_MAKEPOINT(NULL, NULL)) | ST_X(NULL) | ST_Y(ST_MAKEPOINT(NULL, NULL)) | ST_Y(NULL) |
> |--------------------------------+------------+--------------------------------+------------|
> |                           NULL |       NULL |                           NULL |       NULL |
> +--------------------------------+------------+--------------------------------+------------+
> ```

---
title: ST_XMAX
source: https://docs.snowflake.com/en/sql-reference/functions/st_xmax.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_XMAX

Returns the maximum longitude (X coordinate) of all points contained in the specified
[GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_XMAX( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value.

## Usage notes

* If the geospatial object is on the [antimeridian](https://en.wikipedia.org/wiki/180th_meridian) or crosses it, the function
  returns 180.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_XMIN, ST_XMAX, ST_YMIN, and ST_YMAX functions:

> ```sqlexample
> CREATE or replace TABLE extreme_point_collection (id INTEGER, g GEOGRAPHY);
> INSERT INTO extreme_point_collection (id, g)
>     SELECT column1, TO_GEOGRAPHY(column2) FROM VALUES
>         (1, 'POINT(-180 0)'),
>         (2, 'POINT(180 0)'),
>         (3, 'LINESTRING(-179 0, 179 0)'),
>         (4, 'LINESTRING(-60 30, 60 30)'),
>         (5, 'LINESTRING(-60 -30, 60 -30)');
> ```
>
> ```sqlexample
> SELECT
>     g,
>     ST_XMIN(g),
>     ST_XMAX(g),
>     ST_YMIN(g),
>     ST_YMAX(g)
>   FROM extreme_point_collection
>   ORDER BY id;
> +----------------------------+------------+------------+-------------------+-------------------+
> | G                          | ST_XMIN(G) | ST_XMAX(G) |        ST_YMIN(G) |        ST_YMAX(G) |
> |----------------------------+------------+------------+-------------------+-------------------|
> | POINT(-180 0)              |       -180 |        180 |   0               |   0               |
> | POINT(180 0)               |       -180 |        180 |   0               |   0               |
> | LINESTRING(-179 0,179 0)   |       -180 |        180 |  -6.883275617e-14 |   6.883275617e-14 |
> | LINESTRING(-60 30,60 30)   |        -60 |         60 |  30               |  49.106605351     |
> | LINESTRING(-60 -30,60 -30) |        -60 |         60 | -49.106605351     | -30               |
> +----------------------------+------------+------------+-------------------+-------------------+
> ```

---
title: ST_XMIN
source: https://docs.snowflake.com/en/sql-reference/functions/st_xmin.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_XMIN

Returns the minimum longitude (X coordinate) of all points contained in the specified
[GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_XMIN( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value.

## Usage notes

* If the geospatial object is on the [antimeridian](https://en.wikipedia.org/wiki/180th_meridian) or crosses it, the function
  returns -180.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_XMIN, ST_XMAX, ST_YMIN, and ST_YMAX functions:

> ```sqlexample
> CREATE or replace TABLE extreme_point_collection (id INTEGER, g GEOGRAPHY);
> INSERT INTO extreme_point_collection (id, g)
>     SELECT column1, TO_GEOGRAPHY(column2) FROM VALUES
>         (1, 'POINT(-180 0)'),
>         (2, 'POINT(180 0)'),
>         (3, 'LINESTRING(-179 0, 179 0)'),
>         (4, 'LINESTRING(-60 30, 60 30)'),
>         (5, 'LINESTRING(-60 -30, 60 -30)');
> ```
>
> ```sqlexample
> SELECT
>     g,
>     ST_XMIN(g),
>     ST_XMAX(g),
>     ST_YMIN(g),
>     ST_YMAX(g)
>   FROM extreme_point_collection
>   ORDER BY id;
> +----------------------------+------------+------------+-------------------+-------------------+
> | G                          | ST_XMIN(G) | ST_XMAX(G) |        ST_YMIN(G) |        ST_YMAX(G) |
> |----------------------------+------------+------------+-------------------+-------------------|
> | POINT(-180 0)              |       -180 |        180 |   0               |   0               |
> | POINT(180 0)               |       -180 |        180 |   0               |   0               |
> | LINESTRING(-179 0,179 0)   |       -180 |        180 |  -6.883275617e-14 |   6.883275617e-14 |
> | LINESTRING(-60 30,60 30)   |        -60 |         60 |  30               |  49.106605351     |
> | LINESTRING(-60 -30,60 -30) |        -60 |         60 | -49.106605351     | -30               |
> +----------------------------+------------+------------+-------------------+-------------------+
> ```

---
title: ST_Y
source: https://docs.snowflake.com/en/sql-reference/functions/st_y.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_Y

Returns the latitude (Y coordinate) of a Point represented by a [GEOGRAPHY](../data-types-geospatial.md) or
[GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_Y( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of the type GEOGRAPHY or GEOMETRY and must contain a Point.

## Returns

Returns a REAL value.

## Usage notes

* Issues an error if the argument is not a Point.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_X and ST_Y functions with VARCHAR data:

> ```sqlexample
> SELECT ST_X(ST_MAKEPOINT(37.5, 45.5)), ST_Y(ST_MAKEPOINT(37.5, 45.5));
> +--------------------------------+--------------------------------+
> | ST_X(ST_MAKEPOINT(37.5, 45.5)) | ST_Y(ST_MAKEPOINT(37.5, 45.5)) |
> |--------------------------------+--------------------------------|
> |                           37.5 |                           45.5 |
> +--------------------------------+--------------------------------+
> ```

This shows use of the ST_X and ST_Y functions with NULL values:

> ```sqlexample
> SELECT
>     ST_X(ST_MAKEPOINT(NULL, NULL)), ST_X(NULL),
>     ST_Y(ST_MAKEPOINT(NULL, NULL)), ST_Y(NULL)
>     ;
> +--------------------------------+------------+--------------------------------+------------+
> | ST_X(ST_MAKEPOINT(NULL, NULL)) | ST_X(NULL) | ST_Y(ST_MAKEPOINT(NULL, NULL)) | ST_Y(NULL) |
> |--------------------------------+------------+--------------------------------+------------|
> |                           NULL |       NULL |                           NULL |       NULL |
> +--------------------------------+------------+--------------------------------+------------+
> ```

---
title: ST_YMAX
source: https://docs.snowflake.com/en/sql-reference/functions/st_ymax.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_YMAX

Returns the maximum latitude (Y coordinate) of all points contained in the specified
[GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_YMAX( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value.

## Usage notes

* The function takes into account the curvature of the edges toward the poles.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_XMIN, ST_XMAX, ST_YMIN, and ST_YMAX functions:

> ```sqlexample
> CREATE or replace TABLE extreme_point_collection (id INTEGER, g GEOGRAPHY);
> INSERT INTO extreme_point_collection (id, g)
>     SELECT column1, TO_GEOGRAPHY(column2) FROM VALUES
>         (1, 'POINT(-180 0)'),
>         (2, 'POINT(180 0)'),
>         (3, 'LINESTRING(-179 0, 179 0)'),
>         (4, 'LINESTRING(-60 30, 60 30)'),
>         (5, 'LINESTRING(-60 -30, 60 -30)');
> ```
>
> ```sqlexample
> SELECT
>     g,
>     ST_XMIN(g),
>     ST_XMAX(g),
>     ST_YMIN(g),
>     ST_YMAX(g)
>   FROM extreme_point_collection
>   ORDER BY id;
> +----------------------------+------------+------------+-------------------+-------------------+
> | G                          | ST_XMIN(G) | ST_XMAX(G) |        ST_YMIN(G) |        ST_YMAX(G) |
> |----------------------------+------------+------------+-------------------+-------------------|
> | POINT(-180 0)              |       -180 |        180 |   0               |   0               |
> | POINT(180 0)               |       -180 |        180 |   0               |   0               |
> | LINESTRING(-179 0,179 0)   |       -180 |        180 |  -6.883275617e-14 |   6.883275617e-14 |
> | LINESTRING(-60 30,60 30)   |        -60 |         60 |  30               |  49.106605351     |
> | LINESTRING(-60 -30,60 -30) |        -60 |         60 | -49.106605351     | -30               |
> +----------------------------+------------+------------+-------------------+-------------------+
> ```

---
title: ST_YMIN
source: https://docs.snowflake.com/en/sql-reference/functions/st_ymin.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md)

# ST_YMIN

Returns the minimum latitude (Y coordinate) of all points contained in the specified
[GEOGRAPHY](../data-types-geospatial.md) or [GEOMETRY](../data-types-geospatial.md) object.

## Syntax

```sqlsyntax
ST_YMIN( <geography_or_geometry_expression> )
```

## Arguments

`geography_or_geometry_expression`
:   The argument must be an expression of type GEOGRAPHY or GEOMETRY.

## Returns

Returns a REAL value.

## Usage notes

* The function takes into account the curvature of the edges toward the poles.

## Examples

### GEOGRAPHY examples

This shows a simple use of the ST_XMIN, ST_XMAX, ST_YMIN, and ST_YMAX functions:

> ```sqlexample
> CREATE or replace TABLE extreme_point_collection (id INTEGER, g GEOGRAPHY);
> INSERT INTO extreme_point_collection (id, g)
>     SELECT column1, TO_GEOGRAPHY(column2) FROM VALUES
>         (1, 'POINT(-180 0)'),
>         (2, 'POINT(180 0)'),
>         (3, 'LINESTRING(-179 0, 179 0)'),
>         (4, 'LINESTRING(-60 30, 60 30)'),
>         (5, 'LINESTRING(-60 -30, 60 -30)');
> ```
>
> ```sqlexample
> SELECT
>     g,
>     ST_XMIN(g),
>     ST_XMAX(g),
>     ST_YMIN(g),
>     ST_YMAX(g)
>   FROM extreme_point_collection
>   ORDER BY id;
> +----------------------------+------------+------------+-------------------+-------------------+
> | G                          | ST_XMIN(G) | ST_XMAX(G) |        ST_YMIN(G) |        ST_YMAX(G) |
> |----------------------------+------------+------------+-------------------+-------------------|
> | POINT(-180 0)              |       -180 |        180 |   0               |   0               |
> | POINT(180 0)               |       -180 |        180 |   0               |   0               |
> | LINESTRING(-179 0,179 0)   |       -180 |        180 |  -6.883275617e-14 |   6.883275617e-14 |
> | LINESTRING(-60 30,60 30)   |        -60 |         60 |  30               |  49.106605351     |
> | LINESTRING(-60 -30,60 -30) |        -60 |         60 | -49.106605351     | -30               |
> +----------------------------+------------+------------+-------------------+-------------------+
> ```

---
title: STAGE_DIRECTORY_FILE_REGISTRATION_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/stage_directory_file_registration_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# STAGE_DIRECTORY_FILE_REGISTRATION_HISTORY

This table function can be used to query information about the metadata history for a directory table, including:

* Files added or removed automatically as part of a metadata refresh.
* Any errors found when refreshing the metadata.

## Syntax

```sqlsyntax
STAGE_DIRECTORY_FILE_REGISTRATION_HISTORY (
      STAGE_NAME => '<string>'
      [, START_TIME => <constant_expr> ] )
```

## Arguments

**Required:**

`STAGE_NAME => 'string'`
:   A string specifying the name of a stage that has a directory table.

**Optional:**

`START_TIME => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format), within the last 14 days, marking the start of the time range for retrieving metadata update events.

    > **Note:**
    >
    > * If no start time is specified, the function returns all update events within the last 14 days.
    > * If the start time falls outside the last 14 days, the function returns empty results.

## Usage notes

* Returns results for the stage owner (i.e. the role with the OWNERSHIP privilege on the stage), or a higher role,
  or a role that has the USAGE privilege on the database and schema that contain a stage with a directory
  table and any privilege on the stage.
* The table function cannot retrieve metadata about staged data files until the directory table is refreshed
  (i.e. synched) to include the data files in its metadata.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in
  use or the function name must be fully-qualified. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| JOB_CREATED_TIME | TIMESTAMP_LTZ | Timestamp when the operation occurred. |
| FILE_NAME | TEXT | Name of the staged source file and relative path to the file. |
| OPERATION_STATUS | TEXT | Status: REGISTERED_NEW, REGISTERED_UPDATE, REGISTER_SKIPPED, REGISTER_FAILED, UNREGISTERED, or UNREGISTER_FAILED. |
| MESSAGE | TEXT | Message accompanying the operation status. |
| FILE_SIZE | NUMBER | Size of the file (in bytes) added to the directory table. |
| LAST_MODIFIED | TIMESTAMP_LTZ | Timestamp when the file was last updated in the stage. |

## Examples

Retrieve the metadata stored for all data files referenced by the `mystage` stage:

> ```sqlexample
> SELECT *
>   FROM TABLE(information_schema.stage_directory_file_registration_history(
>   STAGE_NAME=>'MYSTAGE'));
> ```

Retrieve the registration events for the directory table on the `mydb.public.mystage` stage that started within the last hour:

> ```sqlexample
> SELECT *
>   FROM TABLE(information_schema.stage_directory_file_registration_history(
>     START_TIME=>DATEADD('hour',-1,current_timestamp()),
>     STAGE_NAME=>'mydb.public.mystage'));
> ```

---
title: STAGE_STORAGE_USAGE_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/stage_storage_usage_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# STAGE_STORAGE_USAGE_HISTORY

This table function can be used to query the average daily data storage usage, in bytes, for all the Snowflake stages in your account within a specified date range. The output will include storage for:

* Named internal stages.
* Default staging areas (for tables and users).

> **Note:**
>
> This function returns stage storage usage within the last 6 months.

See also:
:   [DATABASE_STORAGE_USAGE_HISTORY](database_storage_usage_history.md) , [WAREHOUSE_METERING_HISTORY](warehouse_metering_history.md)

## Syntax

```sqlsyntax
STAGE_STORAGE_USAGE_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [, DATE_RANGE_END => <constant_expr> ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date range, within the last 6 months, for which to retrieve stage storage usage:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then `DATE_RANGE_END` is used as the start of the range (i.e. the default is one day of storage usage).

    If the range falls outside the last 6 months, an error is returned.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this storage usage record |
| AVERAGE_STAGE_BYTES | NUMBER | Number of bytes of stage storage used |

## Examples

Retrieve average daily storage usage for the past 10 days for all internal stages in your account:

```sqlexample
select *
from table(information_schema.stage_storage_usage_history(dateadd('days',-10,current_date()),current_date()));
```

---
title: STARTSWITH
source: https://docs.snowflake.com/en/sql-reference/functions/startswith.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# STARTSWITH

Returns true if `expr1` starts with `expr2`. Both expressions must be text or binary expressions.

> **Tip:**
>
> You can use the search optimization service to improve the performance of queries that call this function.
> For details, see [Search optimization service](../../user-guide/search-optimization-service.md).

## Syntax

```sqlsyntax
STARTSWITH( <expr1> , <expr2> )
```

## Returns

Returns a BOOLEAN. The value is TRUE if `expr1` starts with `expr2`. Returns NULL if either
input expression is NULL. Otherwise, returns FALSE.

## Collation details

The [collation specifications](../collation.md) of all input arguments must be compatible.

This function does not support the following collation specifications:

* `pi` (punctuation-insensitive).
* `cs-ai` (case-sensitive, accent-insensitive).

## Examples

```sqlexample
select * from strings;

---------+
    S    |
---------+
 coffee  |
 ice tea |
 latte   |
 tea     |
 [NULL]  |
---------+

select * from strings where startswith(s, 'te');

-----+
  S  |
-----+
 tea |
-----+
```

---
title: STDDEV (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_stddev.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# STDDEV (system data metric function)

Returns the standard deviation value for the specified column in a table.

The STDDEV system data metric function is optimized to calculate the standard deviation for a single column and provides greater
performance when compared to calling the [STDDEV](stddev.md) function.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.STDDEV(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* FLOAT
* NUMBER

## Returns

The function returns a NUMBER value.

## Example

Measure the standard deviation value for the `salary` column in a table:

```sqlexample
SELECT SNOWFLAKE.CORE.STDDEV(
  SELECT
    salary
  FROM hr.tables.empl_info
);
```

```output
+------------------------------+
|       SNOWFLAKE.CORE.STDDEV( |
|                       SELECT |
|                       SALARY |
|     FROM HR.TABLES.EMPL_INFO |
|                            ) |
|------------------------------|
|               8407.615595399 |
+------------------------------+
```

---
title: STDDEV, STDDEV_SAMP
source: https://docs.snowflake.com/en/sql-reference/functions/stddev.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# STDDEV, STDDEV_SAMP

Returns the sample standard deviation (square root of sample variance) of non-NULL values. STDDEV and STDDEV_SAMP are aliases
for the same function.

See also [STDDEV_POP](stddev_pop.md), which returns the population standard deviation (square root of variance).

## Syntax

**Aggregate function**

```sqlsyntax
{ STDDEV | STDDEV_SAMP } ( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
{ STDDEV | STDDEV_SAMP } ( [ DISTINCT ] <expr1> ) OVER (
                                                       [ PARTITION BY <expr2> ]
                                                       [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                                       )
```

For details about `window_frame` syntax, see [Usage notes for window frames](../functions-window-syntax.md).

## Arguments

`expr1`
:   An expression that evaluates to a numeric value. This is the expression on which the standard deviation is calculated.

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition.

## Returns

The data type of the returned value is DOUBLE.

If all records inside a group are NULL, this function returns NULL.

## Usage notes

* For single-record inputs, STDDEV and STDDEV_SAMP both return NULL. This is different from the Oracle behavior, where STDDEV_SAMP returns NULL for a single record and STDDEV returns 0.
* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast cannot be performed, an error is returned.
* When this function is called as a window function and the OVER clause contains an ORDER BY clause:

  + The DISTINCT keyword is prohibited and results in a SQL compilation error.
  + A window frame must be specified. If you do not specify a window frame, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

  For more details about window frames, including syntax and examples, see [Usage notes for window frames](../functions-window-syntax.md).

## Aggregate function examples

The following example calculates the standard deviation for a small sample of integers:

```sqlexample
CREATE TABLE t1 (c1 INTEGER);
INSERT INTO t1 (c1) VALUES
  (6),
  (10),
  (14);
SELECT STDDEV(c1) FROM t1;
```

```output
+----------+
| STDDEV() |
|----------|
|        4 |
+----------+
```

Note that the function STDDEV_SAMP returns the same result:

```sqlexample
SELECT STDDEV_SAMP(c1) FROM t1;
```

```output
+-----------------+
| STDDEV_SAMP(C1) |
|-----------------|
|               4 |
+-----------------+
```

The following example uses a small table named `menu_items`, which lists items for sale from a food
truck. If you would like to create and load this table, see Create and load the menu_items table.

To find the sample standard deviation for both the cost of goods sold (COGS) and the sale price for the
`Dessert` rows, run this query:

```sqlexample
SELECT menu_category, STDDEV(menu_cogs_usd) AS stddev_cogs, STDDEV(menu_price_usd) AS stddev_price
  FROM menu_items
  WHERE menu_category = 'Dessert'
  GROUP BY 1;
```

```output
+---------------+-------------+--------------+
| MENU_CATEGORY | STDDEV_COGS | STDDEV_PRICE |
|---------------+-------------+--------------|
| Dessert       |  1.00519484 |  1.471960144 |
+---------------+-------------+--------------+
```

## Window function example

The following example also uses the `menu_items` table (see Create and load the menu_items table)
but calls the STDDEV function as a window function.

The window function partitions rows by the `menu_category` column. Therefore, the standard deviation is
calculated once for each category, and that value is repeated in the result for each row in the group.
In this example, the rows must be grouped by both the menu category and the cost of goods sold.

```sqlexample
SELECT menu_category, menu_cogs_usd,
  STDDEV(menu_cogs_usd) OVER(PARTITION BY menu_category) AS stddev_cogs
  FROM menu_items
  GROUP BY 1, 2
  ORDER BY menu_category;
```

The following output is a partial result set for this query (the first 15 rows):

```output
+---------------+---------------+--------------+
| MENU_CATEGORY | MENU_COGS_USD |  STDDEV_COGS |
|---------------+---------------+--------------|
| Beverage      |          0.50 | 0.1258305738 |
| Beverage      |          0.65 | 0.1258305738 |
| Beverage      |          0.75 | 0.1258305738 |
| Dessert       |          1.25 | 1.054751155  |
| Dessert       |          3.00 | 1.054751155  |
| Dessert       |          1.00 | 1.054751155  |
| Dessert       |          2.50 | 1.054751155  |
| Dessert       |          0.50 | 1.054751155  |
| Main          |          4.50 | 3.444051572  |
| Main          |          2.40 | 3.444051572  |
| Main          |          1.50 | 3.444051572  |
| Main          |         11.00 | 3.444051572  |
| Main          |          8.00 | 3.444051572  |
| Main          |          NULL | 3.444051572  |
| Main          |         12.00 | 3.444051572  |
...
```

## Create and load the menu_items table

To create and insert rows into the `menu_items` table that is used in some function examples,
run the following SQL commands. (This table contains 60 rows. It is based on, but not identical to,
the `menu` table in the
[Tasty Bytes sample database](https://quickstarts.snowflake.com/guide/tasty_bytes_introduction/index.html#0).)

```sqlexample
CREATE OR REPLACE TABLE menu_items(
  menu_id INT NOT NULL,
  menu_category VARCHAR(20),
  menu_item_name VARCHAR(50),
  menu_cogs_usd NUMBER(7,2),
  menu_price_usd NUMBER(7,2));
```

```sqlexample
INSERT INTO menu_items VALUES(1,'Beverage','Bottled Soda',0.500,3.00);
INSERT INTO menu_items VALUES(2,'Beverage','Bottled Water',0.500,2.00);
INSERT INTO menu_items VALUES(3,'Main','Breakfast Crepe',5.00,12.00);
INSERT INTO menu_items VALUES(4,'Main','Buffalo Mac & Cheese',6.00,10.00);
INSERT INTO menu_items VALUES(5,'Main','Chicago Dog',4.00,9.00);
INSERT INTO menu_items VALUES(6,'Main','Chicken Burrito',3.2500,12.500);
INSERT INTO menu_items VALUES(7,'Main','Chicken Pot Pie Crepe',6.00,15.00);
INSERT INTO menu_items VALUES(8,'Main','Combination Curry',9.00,15.00);
INSERT INTO menu_items VALUES(9,'Main','Combo Fried Rice',5.00,11.00);
INSERT INTO menu_items VALUES(10,'Main','Combo Lo Mein',6.00,13.00);
INSERT INTO menu_items VALUES(11,'Main','Coney Dog',5.00,10.00);
INSERT INTO menu_items VALUES(12,'Main','Creamy Chicken Ramen',8.00,17.2500);
INSERT INTO menu_items VALUES(13,'Snack','Crepe Suzette',4.00,9.00);
INSERT INTO menu_items VALUES(14,'Main','Fish Burrito',3.7500,12.500);
INSERT INTO menu_items VALUES(15,'Snack','Fried Pickles',1.2500,6.00);
INSERT INTO menu_items VALUES(16,'Snack','Greek Salad',4.00,11.00);
INSERT INTO menu_items VALUES(17,'Main','Gyro Plate',8.00,12.00);
INSERT INTO menu_items VALUES(18,'Main','Hot Ham & Cheese',7.00,11.00);
INSERT INTO menu_items VALUES(19,'Dessert','Ice Cream Sandwich',1.00,4.00);
INSERT INTO menu_items VALUES(20,'Beverage','Iced Tea',0.7500,3.00);
INSERT INTO menu_items VALUES(21,'Main','Italian',6.00,11.00);
INSERT INTO menu_items VALUES(22,'Main','Lean Beef Tibs',6.00,13.00);
INSERT INTO menu_items VALUES(23,'Main','Lean Burrito Bowl',3.500,12.500);
INSERT INTO menu_items VALUES(24,'Main','Lean Chicken Tibs',5.00,11.00);
INSERT INTO menu_items VALUES(25,'Main','Lean Chicken Tikka Masala',10.00,17.00);
INSERT INTO menu_items VALUES(26,'Beverage','Lemonade',0.6500,3.500);
INSERT INTO menu_items VALUES(27,'Main','Lobster Mac & Cheese',10.00,15.00);
INSERT INTO menu_items VALUES(28,'Dessert','Mango Sticky Rice',1.2500,5.00);
INSERT INTO menu_items VALUES(29,'Main','Miss Piggie',2.600,6.00);
INSERT INTO menu_items VALUES(30,'Main','Mothers Favorite',4.500,12.00);
INSERT INTO menu_items VALUES(31,'Main','New York Dog',4.00,8.00);
INSERT INTO menu_items VALUES(32,'Main','Pastrami',8.00,11.00);
INSERT INTO menu_items VALUES(33,'Dessert','Popsicle',0.500,3.00);
INSERT INTO menu_items VALUES(34,'Main','Pulled Pork Sandwich',7.00,12.00);
INSERT INTO menu_items VALUES(35,'Main','Rack of Pork Ribs',11.2500,21.00);
INSERT INTO menu_items VALUES(36,'Snack','Seitan Buffalo Wings',4.00,7.00);
INSERT INTO menu_items VALUES(37,'Main','Spicy Miso Vegetable Ramen',7.00,17.2500);
INSERT INTO menu_items VALUES(38,'Snack','Spring Mix Salad',2.2500,6.00);
INSERT INTO menu_items VALUES(39,'Main','Standard Mac & Cheese',3.00,8.00);
INSERT INTO menu_items VALUES(40,'Dessert','Sugar Cone',2.500,6.00);
INSERT INTO menu_items VALUES(41,'Main','Tandoori Mixed Grill',11.00,18.00);
INSERT INTO menu_items VALUES(42,'Main','The Classic',4.00,12.00);
INSERT INTO menu_items VALUES(43,'Main','The King Combo',12.00,20.00);
INSERT INTO menu_items VALUES(44,'Main','The Kitchen Sink',6.00,14.00);
INSERT INTO menu_items VALUES(45,'Main','The Original',1.500,5.00);
INSERT INTO menu_items VALUES(46,'Main','The Ranch',2.400,6.00);
INSERT INTO menu_items VALUES(47,'Main','The Salad of All Salads',6.00,12.00);
INSERT INTO menu_items VALUES(48,'Main','Three Meat Plate',10.00,17.00);
INSERT INTO menu_items VALUES(49,'Main','Three Taco Combo Plate',7.00,11.00);
INSERT INTO menu_items VALUES(50,'Main','Tonkotsu Ramen',7.00,17.2500);
INSERT INTO menu_items VALUES(51,'Main','Two Meat Plate',9.00,14.00);
INSERT INTO menu_items VALUES(52,'Dessert','Two Scoop Bowl',3.00,7.00);
INSERT INTO menu_items VALUES(53,'Main','Two Taco Combo Plate',6.00,9.00);
INSERT INTO menu_items VALUES(54,'Main','Veggie Burger',5.00,9.00);
INSERT INTO menu_items VALUES(55,'Main','Veggie Combo',4.00,9.00);
INSERT INTO menu_items VALUES(56,'Main','Veggie Taco Bowl',6.00,10.00);
INSERT INTO menu_items VALUES(57,'Dessert','Waffle Cone',2.500,6.00);
INSERT INTO menu_items VALUES(58,'Main','Wonton Soup',2.00,6.00);
INSERT INTO menu_items VALUES(59,'Main','Mini Pizza',null,null);
INSERT INTO menu_items VALUES(60,'Main','Large Pizza',null,null);
```

---
title: STDDEV_POP
source: https://docs.snowflake.com/en/sql-reference/functions/stddev_pop.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# STDDEV_POP

Returns the population standard deviation (square root of variance) of non-NULL values.

See also [STDDEV](stddev.md), which returns the sample standard deviation (square root of variance).

## Syntax

**Aggregate function**

```sqlsyntax
STDDEV_POP( [ DISTINCT ] <expr1>)
```

**Window function**

```sqlsyntax
STDDEV_POP( [ DISTINCT ] <expr1> ) OVER (
                                        [ PARTITION BY <expr2> ]
                                        [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                        )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   An expression that evaluates to a numeric value. This is the expression on which the standard deviation is calculated.

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition.

## Returns

The data type of the returned value is DOUBLE.

If all records inside a group are NULL, this function returns NULL.

## Usage notes

* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.
* When this function is called as a window function and the OVER clause contains an ORDER BY clause:

  + The DISTINCT keyword is prohibited and results in a SQL compilation error.
  + A window frame must be specified. If you do not specify a window frame, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see
    [Window function syntax and usage](../functions-window-syntax.md).

## Aggregate function examples

The following example calculates the standard deviation for a small population of integers:

> ```sqlexample
> CREATE TABLE t1 (c1 INTEGER);
> INSERT INTO t1 (c1) VALUES
>     (6),
>    (10),
>    (14)
>    ;
> SELECT STDDEV_POP(c1) FROM t1;
> ```
>
> ```output
> +----------------+
> | STDDEV_POP(C1) |
> |----------------|
> |    3.265986375 |
> +----------------+
> ```

Note that the functions STDDEV and STDDEV_SAMP do not return the same result as STDDEV_POP.

The following example assumes that you have a table named `menu` that lists food items for sale in a cafe.
The following output shows the 6 rows in the table that belong to the `Dessert` category. Other rows also exist
for other categories, such as `Main` and `Beverage`.

> ```output
> +---------+--------------------+---------------+-------------------+----------------+
> | MENU_ID | MENU_ITEM_NAME     | ITEM_CATEGORY | COST_OF_GOODS_USD | SALE_PRICE_USD |
> |---------+--------------------+---------------+-------------------+----------------|
> |   10002 | Sugar Cone         | Dessert       |            2.5000 |         6.0000 |
> |   10003 | Waffle Cone        | Dessert       |            2.5000 |         6.0000 |
> |   10004 | Two Scoop Bowl     | Dessert       |            3.0000 |         7.0000 |
> |   10008 | Ice Cream Sandwich | Dessert       |            1.0000 |         4.0000 |
> |   10009 | Mango Sticky Rice  | Dessert       |            1.2500 |         5.0000 |
> |   10010 | Popsicle           | Dessert       |            0.5000 |         3.0000 |
> +---------+--------------------+---------------+-------------------+----------------+
> ```

To find the population standard deviation for the cost of goods sold and the sale price (for the `Dessert` rows
only), run this query:

> ```sqlexample
> SELECT item_category, STDDEV_POP(cost_of_goods_usd) stddev_cogs, STDDEV_POP(sale_price_usd) stddev_price
>   FROM menu
>   WHERE item_category='Dessert'
>   GROUP BY 1;
> ```
>
> ```output
> +---------------+--------------+--------------+
> | ITEM_CATEGORY |  STDDEV_COGS | STDDEV_PRICE |
> |---------------+--------------+--------------|
> | Dessert       | 0.9176131477 |  1.343709625 |
> +---------------+--------------+--------------+
> ```

## Window function example

The following example uses the same `menu` table but runs the STDDEV_POP function as a window function.

The window function partitions rows by the `item_category` column. Therefore, the standard deviation is
calculated once for each item category, and that value is repeated in the result for each row in the group.
In this example, the rows must be grouped by both the item category and the cost of goods sold.
(Note that the 6 `Dessert` rows are now grouped into 5 rows because two rows have the same cost of goods value.)

> ```sqlexample
> SELECT item_category, cost_of_goods_usd, STDDEV_POP(cost_of_goods_usd) OVER(PARTITION BY item_category) stddev_cogs
>   FROM menu
>   GROUP BY 1,2
>   ORDER BY item_category;
> ```
>
> ```output
> +---------------+-------------------+--------------+
> | ITEM_CATEGORY | COST_OF_GOODS_USD |  STDDEV_COGS |
> |---------------+-------------------+--------------|
> | Beverage      |            0.5000 | 0.1027402334 |
> | Beverage      |            0.7500 | 0.1027402334 |
> | Beverage      |            0.6500 | 0.1027402334 |
> | Dessert       |            2.5000 | 0.9433981132 |
> | Dessert       |            3.0000 | 0.9433981132 |
> | Dessert       |            1.0000 | 0.9433981132 |
> | Dessert       |            0.5000 | 0.9433981132 |
> | Dessert       |            1.2500 | 0.9433981132 |
> | Main          |            4.5000 | 3.352193642  |
> | Main          |            8.0000 | 3.352193642  |
> | Main          |            2.0000 | 3.352193642  |
> | Main          |            3.5000 | 3.352193642  |
> ...
> ```

---
title: STORAGE_LIFECYCLE_POLICY_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/storage_lifecycle_policy_history.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) (Information Schema)

# STORAGE_LIFECYCLE_POLICY_HISTORY

Returns execution history for [storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md)
in your account within the last 14 days.

Use this table function to query the most recent policy executions (completed or still running), in descending order by
execution end time. For more information about monitoring storage lifecycle policies, see
[Monitor storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-monitoring.md).

See also:
:   [CREATE STORAGE LIFECYCLE POLICY](../sql/create-storage-lifecycle-policy.md) , [ALTER STORAGE LIFECYCLE POLICY](../sql/alter-storage-lifecycle-policy.md) , [DROP STORAGE LIFECYCLE POLICY](../sql/drop-storage-lifecycle-policy.md)

## Syntax

**By object**

```sqlsyntax
STORAGE_LIFECYCLE_POLICY_HISTORY(
  REF_ENTITY_NAME => '<string>',
  REF_ENTITY_DOMAIN => '<string>'
  [, TIME_RANGE_START => <constant_expr> ]
  [, TIME_RANGE_END => <constant_expr> ]
  [, RESULT_LIMIT => <integer> ] )
```

**By storage lifecycle policy**

```sqlsyntax
STORAGE_LIFECYCLE_POLICY_HISTORY(
  POLICY_NAME => '<string>'
  [, TIME_RANGE_START => <constant_expr> ]
  [, TIME_RANGE_END => <constant_expr> ]
  [, RESULT_LIMIT => <integer> ] )
```

## Arguments

> **Note:**
>
> Specify one of the following options when you call the function:
>
> * REF_ENTITY_NAME and REF_ENTITY_DOMAIN: Retrieves the execution history for all storage lifecycle policies attached to
>   an object (table).
> * POLICY_NAME: Retrieves the execution history for a particular storage lifecycle policy specified by name.

**Required:**

`REF_ENTITY_NAME => 'string'`
:   The identifier for the object (table) that the execution occurred on; for example, the name of the
    table that the storage lifecycle policy is attached to.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive. For more information, see [Identifier requirements](../identifiers-syntax.md).

`REF_ENTITY_DOMAIN => 'string'`
:   The object type to which the storage lifecycle policy is attached:

    * `'Table'`: Specifies that the storage lifecycle policy is attached to a table.

`POLICY_NAME => 'string'`
:   The identifier of a storage lifecycle policy to retrieve execution history for.
    If you don’t specify a policy name, you must specify values for REF_ENTITY_NAME and REF_ENTITY_DOMAIN.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

**Optional:**

`TIME_RANGE_START => constant_expr`, . `TIME_RANGE_END => constant_expr`
:   Time range, within the last 14 days, in which the policy execution occurred.

    If neither parameter is specified, the function returns rows (up to the RESULT_LIMIT) for the latest policy executions in descending
    order by END_TIME.

`RESULT_LIMIT => integer`
:   The maximum number of rows returned by the function.

    Range: `1` to `1000`

    Default: `1000`.

## Returns

The function returns execution history records for storage lifecycle policies. Each record contains information about the
policy execution, including the policy name, associated table, execution state, start and end times, and execution results.

For detailed column descriptions, see Output.

## Output

The function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| POLICY_DB | VARCHAR | The name of the database that contains the policy. |
| POLICY_SCHEMA | VARCHAR | The name of the schema that contains the policy. |
| POLICY_NAME | VARCHAR | The name of the policy. |
| REF_ENTITY_DB | VARCHAR | The name of the database that contains the object that the policy is attached to. |
| REF_ENTITY_SCHEMA | VARCHAR | The name of the schema that contains the object that the policy is attached to. |
| REF_ENTITY_NAME | VARCHAR | The name of the object that the policy is attached to. |
| REF_ENTITY_DOMAIN | VARCHAR | The domain (type) of the object that the policy is attached to; for example, `Table`. |
| STATE | VARCHAR | The state of the policy execution: `QUEUED`, `EXECUTING`, `SUCCEEDED`, or `FAILED`. |
| START_TIME | TIMESTAMP_LTZ | Earliest timestamp of when any task in the policy execution started. |
| END_TIME | TIMESTAMP_LTZ | Latest timestamp of when any task in the policy execution completed. |
| EXECUTION_RESULT | VARIANT | JSON object that contains the results of the jobs run during the policy execution. For more information, see EXECUTION_RESULT fields. |
| POLICY_BODY | VARCHAR | The body of the storage lifecycle policy. |

### EXECUTION_RESULT fields

The `EXECUTION_RESULT` column is a JSON object that includes nested objects for each task type in the policy execution:

* `EXPIRE`: Contains results for expiration operations (permanently deleting rows).
* `ARCHIVE`: Contains results for archiving operations (moving rows to archive storage).
* `EXPIRE_ARCHIVE`: Contains results for expiration operations (permanently deleting rows from archive storage).

Each nested object can contain the following fields, where specific fields apply to specific task types:

| Field name | Description |
| --- | --- |
| START_TIME | Individual task start timestamp. |
| END_TIME | Individual task end timestamp |
| STATE | Individual task status: `SUCCEEDED` or `FAILED`. |
| ROWS_EXPIRED | (EXPIRE task only) The number of rows permanently deleted from active storage. |
| ROWS_ARCHIVED | (ARCHIVE task only) The number of rows archived to storage. |
| ROWS_EXPIRED_FROM_ARCHIVE | (EXPIRE_ARCHIVE task only) The number of rows permanently deleted from archive storage. |
| ERROR_MESSAGE_CODE | The code identifying the type of error encountered during task execution. |
| ERROR_MESSAGE | A detailed error message. |

Example `EXECUTION_RESULT` body:

```output
EXECUTION_RESULT =
{
  “EXPIRE”: {
    “start_time”: "Thu, 27 Jun 2024 02:57:57 -0700",
    “end_time”: "Thu, 27 Jun 2024 02:58:01 -0700",
    “state”: "SUCCEEDED",
    “rows_expired”: 100,
    “error_message_code”: null,
    “error_message”: null
  }
}
```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY STORAGE LIFECYCLE POLICY | Global | If the role that calls the function has this privilege, Snowflake returns all policy executions related to all policies and their associated tables in the Snowflake account. |
| APPLY | Storage lifecycle policy | To view the executions for a policy, a role must also have the OWNERSHIP privilege on the table(s) associated with the policy. This privilege is not required if a role has the global APPLY STORAGE LIFECYCLE POLICY privilege. |
| OWNERSHIP | Table | To view the executions for a policy, a role must also have the APPLY privilege on the policy associated with the table. This privilege is not required if a role has the global APPLY STORAGE LIFECYCLE POLICY privilege. |

## Usage notes

* Results are returned based on the privileges granted to the role that executes the query:

  + If the role has the global APPLY STORAGE LIFECYCLE POLICY privilege, Snowflake returns all policy executions related to any policy and
    table associations in the account.
  + If the role has the APPLY privilege on a specific storage lifecycle policy, Snowflake returns executions for that policy
    only for objects that are owned by the role that calls the function.
  + If the role has either the APPLY privilege or the OWNERSHIP privilege on the policy,
    but does *not* have the OWNERSHIP privilege on the table that the policy is attached to,
    Snowflake doesn’t show policy executions for the policy in the results.
  + If the role has no policy privileges, but has the OWNERSHIP privilege on the table that a policy is attached to, Snowflake returns an error
    message and doesn’t return any policy executions.

## Examples

Specify the `REF_ENTITY_NAME` and `REF_ENTITY_DOMAIN` arguments to
retrieve the storage lifecycle policy history for
a table named `t1`:

```sqlexample
SELECT * FROM
  TABLE (
    INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY(
      REF_ENTITY_NAME => 'my_db.my_schema.t1',
      REF_ENTITY_DOMAIN => 'Table'
    )
  );
```

Retrieve the storage lifecycle policy history for
each table that has the policy named `slp` associated with it, and
limit the results to 100 rows:

```sqlexample
SELECT * FROM
  TABLE(
    INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY(
      POLICY_NAME => 'my_db.my_schema.slp',
      RESULT_LIMIT => 100
    )
  );
```

Retrieve the 100 most recent executions for a specified policy, scheduled within the last hour:

```sqlexample
SELECT * FROM
TABLE(
  INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY(
    POLICY_NAME => 'my_db.my_schema.slp',
    TIME_RANGE_START => DATEADD('HOUR', -1, CURRENT_TIMESTAMP()),
    RESULT_LIMIT => 100
  )
);
```

Retrieve the policy execution history for a given table within a 30-minute time range:

```sqlexample
SELECT * FROM
TABLE (
  INFORMATION_SCHEMA.STORAGE_LIFECYCLE_POLICY_HISTORY(
    REF_ENTITY_NAME => 'my_db.my_schema.t1',
    REF_ENTITY_DOMAIN => 'Table',
    TIME_RANGE_START => TO_TIMESTAMP_LTZ('2024-07-08 12:00:00.000 -0700'),
    TIME_RANGE_END => TO_TIMESTAMP_LTZ('2024-07-08 12:30:00.000 -0700')
  )
);
```

---
title: STRIP_NULL_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/strip_null_value.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Parsing)

# STRIP_NULL_VALUE

Converts a [JSON null](../../user-guide/semistructured-considerations.md) value to a SQL NULL value. All other variant values are passed unchanged.

## Syntax

```sqlsyntax
STRIP_NULL_VALUE( <variant_expr> )
```

## Arguments

`variant_expr`
:   An expression of type VARIANT.

## Returns

* If the expression contains a JSON null value, the function returns a SQL NULL.
* If the expression does not contain a JSON null value, the function returns the input value.

## Examples

```sqlexample
CREATE OR REPLACE TABLE mytable
(
  SRC Variant
);

INSERT INTO mytable
  SELECT PARSE_JSON(column1)
  FROM VALUES
  ('{
  "a": "1",
  "b": "2",
  "c": null
  }')
  , ('{
  "a": "1",
  "b": "2",
  "c": "3"
  }');

SELECT STRIP_NULL_VALUE(src:c) FROM mytable;
```

```output
+-------------------------+
| STRIP_NULL_VALUE(SRC:C) |
|-------------------------|
| NULL                    |
| "3"                     |
+-------------------------+
```

---
title: STRTOK
source: https://docs.snowflake.com/en/sql-reference/functions/strtok.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# STRTOK

Tokenizes a given string and returns the requested part.

See also:
:   [SPLIT_PART](split_part.md)

## Syntax

```sqlsyntax
STRTOK(<string> [,<delimiter>] [,<partNumber>])
```

## Arguments

**Required:**

`string`
:   Text to be tokenized.

**Optional:**

`delimiter`
:   Text representing the set of delimiters to tokenize on. Each character in the delimiter string is a separate delimiter.
    For example, if the delimiter is `'@.'`, then both `'@'` and `'.'` are treated as delimiters. This
    behavior differs from [SPLIT_PART](split_part.md), which treats the entire delimiter as a single delimiter string.

    If the delimiter is empty, and the `string` is empty, then the function returns NULL. If the
    delimiter is empty, and the `string` is non-empty, then the whole string will be treated as one token.

    Default: A single space character

`partNumber`
:   Requested token, which is 1-based so that the first token is token number 1, not token number 0.
    If the token number is out of range, then NULL is returned.

    Default: 1

## Returns

The data type of the returned value is VARCHAR.

If the requested part doesn’t exist or any argument is NULL, then NULL is returned.

## Usage notes

Similar to Linux strtok(), STRTOK never returns an empty string as a token.
This behavior differs from [SPLIT_PART](split_part.md), which can return empty strings
as parts when the input string starts or ends with the delimiter, or when
there are consecutive delimiters.

## Examples

The following examples call the STRTOK function:

### Return the first token in a string

The following simple example calls STRTOK to return the first token in a string:

```sqlexample
SELECT STRTOK('a.b.c', '.', 1);
```

```output
+-------------------------+
| STRTOK('A.B.C', '.', 1) |
|-------------------------|
| a                       |
+-------------------------+
```

### Use multiple delimiters to return different tokens

The following example shows how to use multiple delimiters to return the first, second, and third tokens
when the delimiters are `@` and `.`:

```sqlexample
SELECT STRTOK('user@snowflake.com', '@.', 1);
```

```output
+---------------------------------------+
| STRTOK('USER@SNOWFLAKE.COM', '@.', 1) |
|---------------------------------------|
| user                                  |
+---------------------------------------+
```

```sqlexample
SELECT STRTOK('user@snowflake.com', '@.', 2);
```

```output
+---------------------------------------+
| STRTOK('USER@SNOWFLAKE.COM', '@.', 2) |
|---------------------------------------|
| snowflake                             |
+---------------------------------------+
```

```sqlexample
SELECT STRTOK('user@snowflake.com', '@.', 3);
```

```output
+---------------------------------------+
| STRTOK('USER@SNOWFLAKE.COM', '@.', 3) |
|---------------------------------------|
| com                                   |
+---------------------------------------+
```

### Demonstrate indexing past the last possible token in the string

The following example demonstrates what happens when you index past the last possible token in the string:

```sqlexample
SELECT STRTOK('user@snowflake.com.', '@.', 4);
```

```output
+----------------------------------------+
| STRTOK('USER@SNOWFLAKE.COM.', '@.', 4) |
|----------------------------------------|
| NULL                                   |
+----------------------------------------+
```

### Demonstrate how the first element can be past the end of the string

In this example, the input string is empty, and there are no elements. So, the first
element is past the end of the string, and the function returns NULL instead of an empty string:

```sqlexample
SELECT STRTOK('', '', 1);
```

```output
+-------------------+
| STRTOK('', '', 1) |
|-------------------|
| NULL              |
+-------------------+
```

### Call STRTOK with an empty delimiter

Here is an example with an empty delimiter string:

```sqlexample
SELECT STRTOK('a.b', '', 1);
```

```output
+----------------------+
| STRTOK('A.B', '', 1) |
|----------------------|
| a.b                  |
+----------------------+
```

### Demonstrate NULL values for arguments

The following examples specify NULL values for each of the arguments:

```sqlexample
SELECT STRTOK(NULL, '.', 1);
```

```output
+----------------------+
| STRTOK(NULL, '.', 1) |
|----------------------|
| NULL                 |
+----------------------+
```

```sqlexample
SELECT STRTOK('a.b', NULL, 1);
```

```output
+------------------------+
| STRTOK('A.B', NULL, 1) |
|------------------------|
| NULL                   |
+------------------------+
```

```sqlexample
SELECT STRTOK('a.b', '.', NULL);
```

```output
+--------------------------+
| STRTOK('A.B', '.', NULL) |
|--------------------------|
| NULL                     |
+--------------------------+
```

### Demonstrate differences between STRTOK and SPLIT_PART

This example demonstrates the difference between STRTOK and SPLIT_PART when using repeated delimiters.
STRTOK treats each character in the delimiter string `'|-'` as a separate delimiter, splitting at every
`'|'` and `'-'` character. In contrast, SPLIT_PART treats the entire delimiter string `'|-'`
as a single delimiter, so it only splits where that exact sequence appears:

```sqlexample
SELECT STRTOK('data1||data2|-data3---data4', '|-', 1) AS strtok_1,
       STRTOK('data1||data2|-data3---data4', '|-', 2) AS strtok_2,
       STRTOK('data1||data2|-data3---data4', '|-', 3) AS strtok_3,
       STRTOK('data1||data2|-data3---data4', '|-', 4) AS strtok_4,
       SPLIT_PART('data1||data2|-data3---data4', '|-', 1) AS split_part_1,
       SPLIT_PART('data1||data2|-data3---data4', '|-', 2) AS split_part_2,
       SPLIT_PART('data1||data2|-data3---data4', '|-', 3) AS split_part_3;
```

```output
+----------+----------+----------+----------+-----------------+--------------+--------------+
| STRTOK_1 | STRTOK_2 | STRTOK_3 | STRTOK_4 | SPLIT_PART_1    | SPLIT_PART_2 | SPLIT_PART_3 |
|----------+----------+----------+----------+-----------------+--------------+--------------|
| data1    | data2    | data3    | data4    | data1||data2    | data3---data4|              |
+----------+----------+----------+----------+-----------------+--------------+--------------+
```

---
title: STRTOK_SPLIT_TO_TABLE
source: https://docs.snowflake.com/en/sql-reference/functions/strtok_split_to_table.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General) , [Table functions](../functions-table.md)

# STRTOK_SPLIT_TO_TABLE

Tokenizes a string with the given set of delimiters and flattens the results into rows.

See also:
:   [STRTOK](strtok.md), [STRTOK_TO_ARRAY](strtok_to_array.md)

## Syntax

```sqlsyntax
STRTOK_SPLIT_TO_TABLE(<string> [,<delimiter_list>])
```

## Arguments

**Required:**

`string`
:   Text to be tokenized.

**Optional:**

`delimiter_list`
:   Optional set of delimiters. The default value is a single space character.

## Output

This function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| SEQ | NUMBER | A unique sequence number associated with the input record. The sequence is not guaranteed to be gap-free or ordered. in any particular way. |
| INDEX | NUMBER | The one-based index of the element. |
| VALUE | VARCHAR | The value of the element of the flattened array. |

> **Note:**
>
> The query can also access the columns of the original (correlated) table that served as the source of data for this function. If a single row
> from the original table resulted in multiple rows in the flattened view, the values in this input row are replicated to match the number of
> rows produced by this function.

## Examples

Here is a simple example on constant input.

```sqlexample
SELECT table1.value
  FROM TABLE(STRTOK_SPLIT_TO_TABLE('a.b', '.')) AS table1
  ORDER BY table1.value;
```

```output
+-------+
| VALUE |
|-------|
| a     |
| b     |
+-------+
```

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE splittable_strtok (v VARCHAR);
INSERT INTO splittable_strtok (v) VALUES ('a b'), ('cde'), ('f|g'), ('');
SELECT * FROM splittable_strtok;
```

```output
+-----+
| V   |
|-----|
| a b |
| cde |
| f|g |
|     |
+-----+
```

You can use the [LATERAL](../constructs/join-lateral.md) keyword with the STRTOK_SPLIT_TO_TABLE function
so that the function executes on each row of the `splittable_strtok` table as a correlated table:

```sqlexample
SELECT *
  FROM splittable_strtok, LATERAL STRTOK_SPLIT_TO_TABLE(splittable_strtok.v, ' ')
  ORDER BY SEQ, INDEX;
```

```output
+-----+-----+-------+-------+
| V   | SEQ | INDEX | VALUE |
|-----+-----+-------+-------|
| a b |   1 |     1 | a     |
| a b |   1 |     2 | b     |
| cde |   2 |     1 | cde   |
| f|g |   3 |     1 | f|g   |
+-----+-----+-------+-------+
```

This example is the same as the preceding, except that it specifies multiple delimiters:

```sqlexample
SELECT *
  FROM splittable_strtok, LATERAL STRTOK_SPLIT_TO_TABLE(splittable_strtok.v, ' |')
  ORDER BY SEQ, INDEX;
```

```output
+-----+-----+-------+-------+
| V   | SEQ | INDEX | VALUE |
|-----+-----+-------+-------|
| a b |   1 |     1 | a     |
| a b |   1 |     2 | b     |
| cde |   2 |     1 | cde   |
| f|g |   3 |     1 | f     |
| f|g |   3 |     2 | g     |
+-----+-----+-------+-------+
```

Create another table that contains authors in one column and some of their book titles in another column. In the table
data, the book titles might be separated by a comma or a semi-colon:

```sqlexample
CREATE OR REPLACE TABLE authors_books_test2 (author VARCHAR, titles VARCHAR);
INSERT INTO authors_books_test2 (author, titles) VALUES
  ('Nathaniel Hawthorne', 'The Scarlet Letter ; The House of the Seven Gables;The Blithedale Romance'),
  ('Herman Melville', 'Moby Dick,The Confidence-Man');
SELECT * FROM authors_books_test2;
```

```output
+---------------------+---------------------------------------------------------------------------+
| AUTHOR              | TITLES                                                                    |
|---------------------+---------------------------------------------------------------------------|
| Nathaniel Hawthorne | The Scarlet Letter ; The House of the Seven Gables;The Blithedale Romance |
| Herman Melville     | Moby Dick,The Confidence-Man                                              |
+---------------------+---------------------------------------------------------------------------+
```

Use the LATERAL keyword and the SPLIT_TO_TABLE function to run a query that returns a separate row for each title.
In addition, use the [TRIM](trim.md) function to remove leading and trailing spaces from the titles. Note that the SELECT
list includes the fixed `value` column that is returned by the function:

```sqlexample
SELECT author, TRIM(value) AS title
  FROM authors_books_test2, LATERAL STRTOK_SPLIT_TO_TABLE(titles, ',;')
  ORDER BY author;
```

```output
+---------------------+-------------------------------+
| AUTHOR              | TITLE                         |
|---------------------+-------------------------------|
| Herman Melville     | Moby Dick                     |
| Herman Melville     | The Confidence-Man            |
| Nathaniel Hawthorne | The Scarlet Letter            |
| Nathaniel Hawthorne | The House of the Seven Gables |
| Nathaniel Hawthorne | The Blithedale Romance        |
+---------------------+-------------------------------+
```

---
title: STRTOK_TO_ARRAY
source: https://docs.snowflake.com/en/sql-reference/functions/strtok_to_array.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General) , [Semi-structured and structured data functions](../functions-semistructured.md) (Conversion/Casting)

# STRTOK_TO_ARRAY

Tokenizes the given string using the given set of delimiters and returns the tokens as an [ARRAY](../data-types-semistructured.md)
value.

## Syntax

```sqlsyntax
STRTOK_TO_ARRAY( <string> [ , <delimiter> ] )
```

## Arguments

**Required:**

`string`
:   Text to be tokenized.

**Optional:**

`delimiter`
:   Set of delimiters.

    Default: A single space character.

## Returns

This function returns a value of type ARRAY or NULL.

The function returns an empty array if tokenization produces no tokens.

If either argument is a NULL or [JSON null](../../user-guide/semistructured-considerations.md) value, the function returns NULL.

## Examples

The following example uses the STRTOK_TO_ARRAY function to split a string into an array:

```sqlexample
SELECT STRTOK_TO_ARRAY('a.b.c', '.') AS string_to_array;
```

```output
+-----------------+
| STRING_TO_ARRAY |
|-----------------|
| [               |
|   "a",          |
|   "b",          |
|   "c"           |
| ]               |
+-----------------+
```

The following example tokenizes on multiple delimiters (`.` and `@`):

```sqlexample
SELECT STRTOK_TO_ARRAY('user@snowflake.com', '.@') AS multiple_delimiters;
```

```output
+---------------------+
| MULTIPLE_DELIMITERS |
|---------------------|
| [                   |
|   "user",           |
|   "snowflake",      |
|   "com"             |
| ]                   |
+---------------------+
```

---
title: SUBSTR , SUBSTRING
source: https://docs.snowflake.com/en/sql-reference/functions/substr.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Matching/Comparison)

# SUBSTR , SUBSTRING

Returns the portion of the [string or binary](../data-types-text.md) value
from `base_expr`, starting from the character/byte specified by `start_expr`,
with optionally limited length.

These functions are synonymous.

See also:
:   [LEFT](left.md) , [RIGHT](right.md)

## Syntax

```sqlsyntax
SUBSTR( <base_expr>, <start_expr> [ , <length_expr> ] )

SUBSTRING( <base_expr>, <start_expr> [ , <length_expr> ] )
```

## Arguments

`base_expr`
:   An expression that evaluates to a VARCHAR or BINARY value.

`start_expr`
:   An expression that evaluates to an integer. It specifies the offset from which the substring starts. The offset is measured in:

    * The number of UTF-8 characters if the input is a VARCHAR value.
    * The number of bytes if the input is a BINARY value.

    The start position is 1-based, not 0-based. For example, `SUBSTR('abc', 1, 1)` returns `a`, not `b`.

`length_expr`
:   An expression that evaluates to an integer. It specifies:

    * The number of UTF-8 characters to return if the input is VARCHAR.
    * The number of bytes to return if the input is BINARY.

    Specify a length that is greater than or equal to zero. If the length is a negative number, the function returns an
    empty string.

## Returns

The data type of the returned value is the same as the data type of the `base_expr` (VARCHAR or BINARY).

If any of the inputs are NULL, NULL is returned.

## Usage notes

* If `length_expr` is specified, up to `length_expr` characters/bytes are
  returned. If `length_expr` isn’t specified, all the characters until the end of the string or
  binary value are returned.
* The values in `start_expr` start from 1:

  > + If 0 is specified, it is treated as 1.
  > + If a negative value is specified, the starting position is computed as
  >   the `start_expr` characters/bytes from the end of the string or binary
  >   value. If the position is outside of the range of a string or binary
  >   value, an empty value is returned.

## Collation details

* Collation applies to VARCHAR inputs. Collation doesn’t apply if the input data type of the first parameter
  is BINARY.
* No impact. Although collation is accepted syntactically, collations don’t affect processing. For example,
  two-character and three-character letters in languages (for example, “dzs” in Hungarian or “ch” in Czech)
  are still counted as two or three characters (not one character) for the length argument.
* The collation of the result is the same as the collation of the input. This can be useful if the returned value is passed to another function as part of nested function calls.

## Examples

The following examples use the SUBSTR function.

### Basic example

The following example uses the SUBSTR function to return the portion of the string that starts at the
ninth character and limits the length of the returned value to three characters:

```sqlexample
SELECT SUBSTR('testing 1 2 3', 9, 3);
```

```output
+-------------------------------+
| SUBSTR('TESTING 1 2 3', 9, 3) |
|-------------------------------|
| 1 2                           |
+-------------------------------+
```

### Specifying different start and length values

The following example shows the substrings returned for the same `base_expr` when different
values are specified for `start_expr` and `length_expr`:

```sqlexample
CREATE OR REPLACE TABLE test_substr (
    base_value VARCHAR,
    start_value INT,
    length_value INT)
  AS SELECT
    column1,
    column2,
    column3
  FROM
    VALUES
      ('mystring', -1, 3),
      ('mystring', -3, 3),
      ('mystring', -3, 7),
      ('mystring', -5, 3),
      ('mystring', -7, 3),
      ('mystring', 0, 3),
      ('mystring', 0, 7),
      ('mystring', 1, 3),
      ('mystring', 1, 7),
      ('mystring', 3, 3),
      ('mystring', 3, 7),
      ('mystring', 5, 3),
      ('mystring', 5, 7),
      ('mystring', 7, 3),
      ('mystring', NULL, 3),
      ('mystring', 3, NULL);

SELECT base_value,
       start_value,
       length_value,
       SUBSTR(base_value, start_value, length_value) AS substring
  FROM test_substr;
```

```output
+------------+-------------+--------------+-----------+
| BASE_VALUE | START_VALUE | LENGTH_VALUE | SUBSTRING |
|------------+-------------+--------------+-----------|
| mystring   |          -1 |            3 | g         |
| mystring   |          -3 |            3 | ing       |
| mystring   |          -3 |            7 | ing       |
| mystring   |          -5 |            3 | tri       |
| mystring   |          -7 |            3 | yst       |
| mystring   |           0 |            3 | mys       |
| mystring   |           0 |            7 | mystrin   |
| mystring   |           1 |            3 | mys       |
| mystring   |           1 |            7 | mystrin   |
| mystring   |           3 |            3 | str       |
| mystring   |           3 |            7 | string    |
| mystring   |           5 |            3 | rin       |
| mystring   |           5 |            7 | ring      |
| mystring   |           7 |            3 | ng        |
| mystring   |        NULL |            3 | NULL      |
| mystring   |           3 |         NULL | NULL      |
+------------+-------------+--------------+-----------+
```

### Returning substrings for email, phone, and date strings

The following examples return substrings for customer information in a table.

Create the table and insert data:

```sqlexample
CREATE OR REPLACE TABLE customer_contact_example (
    cust_id INT,
    cust_email VARCHAR,
    cust_phone VARCHAR,
    activation_date VARCHAR)
  AS SELECT
    column1,
    column2,
    column3,
    column4
  FROM
    VALUES
      (1, 'some_text@example.com', '800-555-0100', '20210320'),
      (2, 'some_other_text@example.org', '800-555-0101', '20240509'),
      (3, 'some_different_text@example.net', '800-555-0102', '20191017');

SELECT * from customer_contact_example;
```

```output
+---------+---------------------------------+--------------+-----------------+
| CUST_ID | CUST_EMAIL                      | CUST_PHONE   | ACTIVATION_DATE |
|---------+---------------------------------+--------------+-----------------|
|       1 | some_text@example.com           | 800-555-0100 | 20210320        |
|       2 | some_other_text@example.org     | 800-555-0101 | 20240509        |
|       3 | some_different_text@example.net | 800-555-0102 | 20191017        |
+---------+---------------------------------+--------------+-----------------+
```

Use the [POSITION](position.md) function with the SUBSTR function to extract the domains from email addresses.
This example finds the position of `@` in each string and starts from the next character by adding
one:

```sqlexample
SELECT cust_id,
       cust_email,
       SUBSTR(cust_email, POSITION('@' IN cust_email) + 1) AS domain
  FROM customer_contact_example;
```

```output
+---------+---------------------------------+-------------+
| CUST_ID | CUST_EMAIL                      | DOMAIN      |
|---------+---------------------------------+-------------|
|       1 | some_text@example.com           | example.com |
|       2 | some_other_text@example.org     | example.org |
|       3 | some_different_text@example.net | example.net |
+---------+---------------------------------+-------------+
```

> **Tip:**
>
> You can use the POSITION function to find the position of other characters, such as an empty
> character (`' '`) or an underscore (`_`).

In the `cust_phone` column in the table, the area code is always the first three characters. Extract
the area code from phone numbers:

```sqlexample
SELECT cust_id,
       cust_phone,
       SUBSTR(cust_phone, 1, 3) AS area_code
  FROM customer_contact_example;
```

```output
+---------+--------------+-----------+
| CUST_ID | CUST_PHONE   | AREA_CODE |
|---------+--------------+-----------|
|       1 | 800-555-0100 | 800       |
|       2 | 800-555-0101 | 800       |
|       3 | 800-555-0102 | 800       |
+---------+--------------+-----------+
```

Remove the area code from phone numbers:

```sqlexample
SELECT cust_id,
       cust_phone,
       SUBSTR(cust_phone, 5) AS phone_without_area_code
  FROM customer_contact_example;
```

```output
+---------+--------------+-------------------------+
| CUST_ID | CUST_PHONE   | PHONE_WITHOUT_AREA_CODE |
|---------+--------------+-------------------------|
|       1 | 800-555-0100 | 555-0100                |
|       2 | 800-555-0101 | 555-0101                |
|       3 | 800-555-0102 | 555-0102                |
+---------+--------------+-------------------------+
```

In the `activation_date` column in the table, the date is always in the format `YYYYMMDD`. Extract the year,
month, and day from these strings:

```sqlexample
SELECT cust_id,
       activation_date,
       SUBSTR(activation_date, 1, 4) AS year,
       SUBSTR(activation_date, 5, 2) AS month,
       SUBSTR(activation_date, 7, 2) AS day
  FROM customer_contact_example;
```

```output
+---------+-----------------+------+-------+-----+
| CUST_ID | ACTIVATION_DATE | YEAR | MONTH | DAY |
|---------+-----------------+------+-------+-----|
|       1 | 20210320        | 2021 | 03    | 20  |
|       2 | 20240509        | 2024 | 05    | 09  |
|       3 | 20191017        | 2019 | 10    | 17  |
+---------+-----------------+------+-------+-----+
```

---
title: SUM
source: https://docs.snowflake.com/en/sql-reference/functions/sum.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# SUM

Returns the sum of non-NULL records for `expr`. You can use the DISTINCT keyword to compute the sum of unique
non-null values. If all records inside a group are NULL, the function returns NULL.

See also:
:   [COUNT](count.md) , [MAX](max.md) , [MIN](min.md)

## Syntax

**Aggregate function**

```sqlsyntax
SUM( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
SUM( [ DISTINCT ] <expr1> ) OVER (
                                 [ PARTITION BY <expr2> ]
                                 [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                 )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   This is an expression that evaluates to a numeric data type (INTEGER, FLOAT, DECIMAL, etc.).

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition. (This does not control the order of the
    entire query output.)

## Usage notes

* Numeric values are summed into an equivalent or larger data type.
* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.

* When this function is called as a window function with an OVER clause that contains an ORDER BY clause:

  + A window frame is required. If no window frame is specified explicitly, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).
  + Using the keyword DISTINCT inside the window function is prohibited and results in a compile-time error.

## Examples

```sqlexample
CREATE OR REPLACE TABLE sum_example (k INT, d DECIMAL(10,5),
  s1 VARCHAR(10), s2 VARCHAR(10));

INSERT INTO sum_example VALUES
  (1, 1.1, '1.1', 'one'),
  (1, 10, '10', 'ten'),
  (2, 2.2, '2.2', 'two'),
  (2, NULL, NULL, 'null'),
  (3, NULL, NULL, 'null'),
  (NULL, 9, '9.9', 'nine');

SELECT *
  FROM sum_example;
```

```output
+------+----------+------+------+
|    K |        D | S1   | S2   |
|------+----------+------+------|
|    1 |  1.10000 | 1.1  | one  |
|    1 | 10.00000 | 10.0 | ten  |
|    2 |  2.20000 | 2.2  | two  |
|    2 |     NULL | NULL | null |
|    3 |     NULL | NULL | null |
| NULL |  9.00000 | 9.9  | nine |
+------+----------+------+------+
```

```sqlexample
SELECT SUM(d), SUM(s1)
  FROM sum_example;
```

```output
+----------+---------+
|   SUM(D) | SUM(S1) |
|----------+---------|
| 22.30000 |    23.2 |
+----------+---------+
```

```sqlexample
SELECT k, SUM(d), SUM(s1)
  FROM sum_example
  GROUP BY k;
```

```output
+------+----------+---------+
|    K |   SUM(D) | SUM(S1) |
|------+----------+---------|
|    1 | 11.10000 |    11.1 |
|    2 |  2.20000 |     2.2 |
|    3 |     NULL |    NULL |
| NULL |  9.00000 |     9.9 |
+------+----------+---------+
```

```sqlexample
SELECT SUM(s2)
  FROM sum_example;
```

```output
100038 (22018): Numeric value 'one' is not recognized
```

The script below shows the use of this function (and some other aggregate window functions):

```sqlexample
CREATE OR REPLACE TABLE example_cumulative (p INT, o INT, i INT);

INSERT INTO example_cumulative VALUES
  (  0, 1, 10), (0, 2, 20), (0, 3, 30),
  (100, 1, 10), (100, 2, 30), (100, 2, 5), (100, 3, 11), (100, 3, 120),
  (200, 1, 10000), (200, 1, 200), (200, 1, 808080), (200, 2, 33333), (200, 3, NULL), (200, 3, 4),
  (300, 1, NULL), (300, 1, NULL);
```

```sqlexample
SELECT
    p, o, i,
    COUNT(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS count_i_rows_pre,
    SUM(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS sum_i_rows_pre,
    AVG(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS avg_i_rows_pre,
    MIN(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS min_i_rows_pre,
    MAX(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) AS max_i_rows_pre
  FROM example_cumulative
  ORDER BY p, o;
```

```output
+-----+---+--------+------------------+----------------+----------------+----------------+----------------+
|   P | O |      I | COUNT_I_ROWS_PRE | SUM_I_ROWS_PRE | AVG_I_ROWS_PRE | MIN_I_ROWS_PRE | MAX_I_ROWS_PRE |
|-----+---+--------+------------------+----------------+----------------+----------------+----------------|
|   0 | 1 |     10 |                1 |             10 |         10.000 |             10 |             10 |
|   0 | 2 |     20 |                2 |             30 |         15.000 |             10 |             20 |
|   0 | 3 |     30 |                3 |             60 |         20.000 |             10 |             30 |
| 100 | 1 |     10 |                1 |             10 |         10.000 |             10 |             10 |
| 100 | 2 |     30 |                2 |             40 |         20.000 |             10 |             30 |
| 100 | 2 |      5 |                3 |             45 |         15.000 |              5 |             30 |
| 100 | 3 |     11 |                4 |             56 |         14.000 |              5 |             30 |
| 100 | 3 |    120 |                5 |            176 |         35.200 |              5 |            120 |
| 200 | 1 |  10000 |                1 |          10000 |      10000.000 |          10000 |          10000 |
| 200 | 1 |    200 |                2 |          10200 |       5100.000 |            200 |          10000 |
| 200 | 1 | 808080 |                3 |         818280 |     272760.000 |            200 |         808080 |
| 200 | 2 |  33333 |                4 |         851613 |     212903.250 |            200 |         808080 |
| 200 | 3 |   NULL |                4 |         851613 |     212903.250 |            200 |         808080 |
| 200 | 3 |      4 |                5 |         851617 |     170323.400 |              4 |         808080 |
| 300 | 1 |   NULL |                0 |           NULL |           NULL |           NULL |           NULL |
| 300 | 1 |   NULL |                0 |           NULL |           NULL |           NULL |           NULL |
+-----+---+--------+------------------+----------------+----------------+----------------+----------------+
```

---
title: SUMMARIZE (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/summarize-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# SUMMARIZE (SNOWFLAKE.CORTEX)

Summarizes the given English-language input text.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.SUMMARIZE(<text>)
```

## Arguments

`text`
:   A string containing the English text from which a summary should be generated.

## Returns

A string containing a summary of the original text.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Example

In this example, a table named `reviews` contains a column named `review_content` containing the text of reviews
submitted by users. The query returns a summary of each review.

```sqlexample
SELECT SNOWFLAKE.CORTEX.SUMMARIZE(review_content) FROM reviews LIMIT 10;
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: SYS_CONTEXT
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT

Returns information about the context in which the function is called.

See also:
:   [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)](sys_context_snowflake_environment.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)](sys_context_snowflake_organization_session.md) ,
    [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](sys_context_snowflake_session.md) ,
    [SYS_CONTEXT (SNOWFLAKE$SESSION_ATTRIBUTES namespace)](sys_context_snowflake_session_attributes.md)

## Syntax

**Syntax for retrieving properties:**

```sqlsyntax
SYS_CONTEXT(
  '<namespace>' ,
  '<property>'
)
```

**Syntax for calling functions:**

```sqlsyntax
SYS_CONTEXT(
  '<namespace>' ,
  '<function>' , '<argument>' [ , ... ]
)
```

## Arguments

`'namespace'`
:   Namespace of the property that you want to retrieve or the function that you want to call. You can specify one of the following
    namespaces:

    | Namespace | Description |
    | --- | --- |
    | [SNOWFLAKE$APPLICATION](sys_context_snowflake_application.md) | Properties and functions providing context around the application in which the function is called. |
    | [SNOWFLAKE$ENVIRONMENT](sys_context_snowflake_environment.md) | Properties providing context around the environment in which the function is called. These properties include information about:   * The client, driver, or library that is used to call the function. * The account associated with the session in which the function is called. * The region of that account. |
    | [SNOWFLAKE$ORGANIZATION](sys_context_snowflake_organization.md) | Functions providing context around the current organization. |
    | [SNOWFLAKE$ORGANIZATION_SESSION](sys_context_snowflake_organization_session.md) | Properties providing context around the session in which the function is called, when the current account is in an organization. |
    | [SNOWFLAKE$SESSION](sys_context_snowflake_session.md) | Properties and functions providing context around the session in which the function is called. |
    | [SNOWFLAKE$SESSION_ATTRIBUTES](sys_context_snowflake_session_attributes.md) | Custom key-value attributes set for the current session using [SET_SYS_CONTEXT](set_sys_context.md). |

`'property'`
:   Name of the property that you want to retrieve. The properties that you can specify depend on the namespace. See the
    documentation for a namespace for the list of properties that you can specify.

`'function'`
:   Name of the function that you want to call. The functions that you can call depend on the namespace. See the
    documentation for a namespace for the list of functions that you can call.

`'argument' [ , ... ]`
:   Arguments to pass to the function that you want to call.

## Returns

The function returns a VARCHAR value or NULL.

* The return value depends on the property that you are retrieving or the function that you are calling.

  See the documentation for each namespace for information about the properties and
  return values of functions in that namespace.
* The function returns NULL if:

  + The namespace is not accessible from within the context of the function call. For example, attempting to access properties in
    the SNOWFLAKE$APPLICATION namespace returns NULL if you are calling the function outside of application code.
  + The value of the property or the return value of the function call is NULL or non-existent.

Some properties and functions return Boolean values as the string `TRUE` or `FALSE`. To compare this return value against the
BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the return value to BOOLEAN. For example:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_ROLE_ACTIVATED', 'MY_CUSTOM_ROLE')::BOOLEAN = TRUE;
```

```output
+-----------------------------------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION', 'IS_ROLE_ACTIVATED', 'MY_CUSTOM_ROLE')::BOOLEAN = TRUE |
|-----------------------------------------------------------------------------------------|
| True                                                                                    |
+-----------------------------------------------------------------------------------------+
```

## Access control requirements

See the documentation for each namespace for information about the access control
requirements for the properties and functions in that namespace.

## Usage notes

See the documentation for each namespace for usage notes for the properties and
functions in that namespace.

## Examples

See the documentation for each namespace for examples of retrieving the properties and
calling the functions in that namespace.

---
title: SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context_snowflake_application.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)

Returns information about the context in which a statement is executed within a
[Snowflake Native App](../../developer-guide/native-apps/native-apps-about.md).

You can call this function in the following contexts:

* A stored procedure or Streamlit app that is configured to use
  [owner’s rights](../../developer-guide/native-apps/restricted-callers-rights.md) and is within or owned by a Snowflake Native App.
* A UDF, view, or policy that is owned by a Snowflake Native App.
* A UDF, view, or policy that is part of the [shared data content](../../developer-guide/native-apps/preparing-data-content.md) of a
  Snowflake Native App.

In any other context, the function returns NULL.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)](sys_context_snowflake_environment.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)](sys_context_snowflake_organization_session.md) ,
    [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](sys_context_snowflake_session.md) ,
    [IS_APPLICATION_ROLE_ACTIVATED (SYS_CONTEXT function)](is_application_role_activated.md)

## Syntax

**Syntax for retrieving properties:**

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$APPLICATION' ,
  '<property>'
)
```

**Syntax for calling functions:**

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$APPLICATION' ,
  '<function>' , '<argument>' [ , ... ]
)
```

## Arguments

`'SNOWFLAKE$APPLICATION'`
:   Specifies that you want to retrieve a property or call a function to return context information about the application in which
    the function is called.

`'property'`
:   Name of the property that you want to retrieve. You can specify the following properties:

    | Property | Description |
    | --- | --- |
    | `NAME` | Name of the application. |
    | `CURRENT_VERSION` | Current version of the application in which the current SQL statement is executed.  The value of the `CURRENT_VERSION` property can differ from the `INSTALLED_VERSION` property in the following situations:   * The SQL statement is executed in a [setup script](../../developer-guide/native-apps/creating-setup-script.md) that   upgrades the application to a new version.  In this case, `CURRENT_VERSION` is the new version, and `INSTALLED_VERSION` is the currently installed version that   is being upgraded. * A long-running procedure or query started executing before an upgrade completed.  In this case, `CURRENT_VERSION` is the version when the procedure or query started executing, and   `INSTALLED_VERSION` is the version after the upgrade completed. |
    | `CURRENT_PATCH` | Current patch number of the application in which the current SQL statement is executed. |
    | `INSTALLED_VERSION` | Installed version of the application in which the current SQL statement is executed. |
    | `INSTALLED_PATCH` | Installed patch number of the application in which the current SQL statement is executed. |
    | `IS_DEV_MODE` | `TRUE` if the application is in [development mode](../../developer-guide/native-apps/installing-testing-application.md); otherwise, `FALSE`.  To compare this value against the BOOLEAN value TRUE or FALSE, [cast](../data-type-conversion.md) the value to BOOLEAN. For example:  ```sqlexample SELECT SYS_CONTEXT('SNOWFLAKE$APPLICATION', 'IS_DEV_MODE')::BOOLEAN = TRUE; ``` |

`'function'`
:   Name of the function that you want to call. You can call the following functions:

    * [IS_APPLICATION_ROLE_ACTIVATED (SYS_CONTEXT function)](is_application_role_activated.md)

`'argument' [ , ... ]`
:   Arguments to pass to the function that you want to call.

## Returns

The function returns a VARCHAR value or NULL:

* The return value depends on
  the property that you are retrieving or
  the function that you are calling.
* If you call SYS_CONTEXT with the SNOWFLAKE$APPLICATION namespace outside of
  any of the supported contexts, the function returns NULL.

## Usage notes

* If you are specifying the function call in a double-quoted string in a shell, escape the `$` character with a backslash
  (`\`) so that `$APPLICATION` is not interpreted as a shell variable.

  For example, if you are using Snowflake CLI and you are
  [specifying the SQL statement as a command-line argument](../../developer-guide/snowflake-cli/sql/execute-sql.md) in double
  quotes:

  ```bash
  snow sql --query "SELECT SYS_CONTEXT('SNOWFLAKE\$APPLICATION', 'NAME');"
  ```

## Examples

The following example returns the current version of the application:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$APPLICATION', 'CURRENT_VERSION');
```

---
title: SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context_snowflake_environment.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)

Returns information about the environment (the client, current account, and current region) in which the function is called.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)](sys_context_snowflake_organization_session.md) ,
    [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](sys_context_snowflake_session.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$ENVIRONMENT' ,
  '<property>'
)
```

## Arguments

`'SNOWFLAKE$ENVIRONMENT'`
:   Specifies that you want to retrieve a property to return context information about the environment in which the function is
    called.

`'property'`
:   Name of the property that you want to retrieve. You can specify the following properties:

    | Property | Description |
    | --- | --- |
    | `CLIENT` | Name and version of the client, driver, or library used to call the function.  If this function is called in Snowsight, the function returns the name and version of the Go Snowflake Driver.  If this function is called in Snowflake CLI, the function returns the name and version of the Snowflake Connector for Python.  The value of this property is the same as the return value of the [CURRENT_CLIENT](current_client.md) function. |
    | `ACCOUNT` | The [account locator](../../user-guide/admin-account-identifier.md) of the account for the current session.  The value of this property is the same as the return value of the [CURRENT_ACCOUNT](current_account.md) function. |
    | `REGION` | The name of the [region](../../user-guide/intro-regions.md) of the account for the current session.  For organizations that have accounts in multiple [region groups](../../user-guide/admin-account-identifier.md), the value of the property is `region_group.region`.  The value of this property is the same as the return value of the [CURRENT_REGION](current_region.md) function. |

## Returns

The function returns a VARCHAR value.

## Usage notes

* If you are specifying the function call in a double-quoted string in a shell, escape the `$` character with a backslash
  (`\`) so that `$ENVIRONMENT` is not interpreted as a shell variable.

  For example, if you are using Snowflake CLI and you are
  [specifying the SQL statement as a command-line argument](../../developer-guide/snowflake-cli/sql/execute-sql.md) in double
  quotes:

  ```bash
  snow sql --query "SELECT SYS_CONTEXT('SNOWFLAKE\$ENVIRONMENT', 'CLIENT');"
  ```

## Examples

The following example returns the name and version of the client used to execute the command:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ENVIRONMENT', 'CLIENT');
```

The following example returns the account locator of the account for the current session:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ENVIRONMENT', 'ACCOUNT');
```

The following example returns the region of the account for the current session:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ENVIRONMENT', 'REGION');
```

---
title: SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context_snowflake_organization.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)

Returns information about the current organization.

You can call this function in any account in the organization. In any other context, the function returns NULL.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)](sys_context_snowflake_environment.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)](sys_context_snowflake_organization_session.md) ,
    [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](sys_context_snowflake_session.md) ,
    [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](is_group_activated.md) ,
    [IS_GROUP_IMPORTED (SYS_CONTEXT function)](is_group_imported.md) ,
    [IS_USER_IMPORTED (SYS_CONTEXT function)](is_user_imported.md)

## Syntax

**Syntax for calling functions:**

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$ORGANIZATION' ,
  '<function>' , '<argument>' [ , ... ]
)
```

## Arguments

`'SNOWFLAKE$ORGANIZATION'`
:   Specifies that you want to retrieve a property or call a function to return context information about the current organization.

`'function'`
:   Name of the function that you want to call. You can call the following functions:

    * [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](is_group_activated.md)
    * [IS_GROUP_IMPORTED (SYS_CONTEXT function)](is_group_imported.md)
    * [IS_USER_IMPORTED (SYS_CONTEXT function)](is_user_imported.md)

`'argument' [ , ... ]`
:   Arguments to pass to the function that you want to call.

## Returns

The function returns a VARCHAR value or NULL:

* The return value depends on
  the function that you are calling.
* If you call SYS_CONTEXT with the SNOWFLAKE$ORGANIZATION namespace outside of
  any of the supported contexts, the function returns NULL.

## Usage notes

* If you are specifying the function call in a double-quoted string in a shell, escape the `$` character with a backslash
  (`\`) so that `$ORGANIZATION` is not interpreted as a shell variable.

  For example, if you are using Snowflake CLI and you are
  [specifying the SQL statement as a command-line argument](../../developer-guide/snowflake-cli/sql/execute-sql.md) in double
  quotes:

  ```bash
  snow sql --query "SELECT SYS_CONTEXT('SNOWFLAKE\$ORGANIZATION', 'IS_USER_IMPORTED', 'my_user_name');"
  ```

## Examples

See the following topics:

* [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](is_group_activated.md)
* [IS_GROUP_IMPORTED (SYS_CONTEXT function)](is_group_imported.md)
* [IS_USER_IMPORTED (SYS_CONTEXT function)](is_user_imported.md)

---
title: SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context_snowflake_organization_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)

Returns information about the session in which the function is called and the current organization user.

You can call this function in the following contexts:

* You can call this function directly in the current session.
* You can run a caller’s rights executable (for example, a caller’s rights stored procedure) that calls this function.
* You can run an owner’s rights executable (for example, an owner’s rights stored procedure) that calls this function, provided
  that:

  + The owner role has been granted the READ SESSION privilege on the account.
  + The account containing the owner role is the same organization as the current account for the session.

In any other context, the function returns NULL.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)](sys_context_snowflake_environment.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md)

## Syntax

**Syntax for retrieving properties:**

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$ORGANIZATION_SESSION' ,
  '<property>'
)
```

## Arguments

`'SNOWFLAKE$ORGANIZATION_SESSION'`
:   Specifies that you want to retrieve a property or call a function to return information about the session in which the function
    is called, when the current account is in an organization.

`'property'`
:   Name of the property that you want to retrieve. You can specify the following properties:

    | Property | Description |
    | --- | --- |
    | `PRINCIPAL_NAME` | Name of the principal (the [organization user](../../user-guide/organization-users.md)) that started the session.  If the current user is not an organization user, the value of this property is NULL. |

## Returns

The function returns a VARCHAR value or NULL:

* The return value depends on
  the property that you are retrieving.
* If you call SYS_CONTEXT with the SNOWFLAKE$ORGANIZATION_SESSION namespace outside of
  any of the supported contexts, the function returns NULL.

## Usage notes

* If you are specifying the function call in a double-quoted string in a shell, escape the `$` character with a backslash
  (`\`) so that `$ORGANIZATION_SESSION` is not interpreted as a shell variable.

  For example, if you are using Snowflake CLI and you are
  [specifying the SQL statement as a command-line argument](../../developer-guide/snowflake-cli/sql/execute-sql.md) in double
  quotes:

  ```bash
  snow sql --query "SELECT SYS_CONTEXT('SNOWFLAKE\$ORGANIZATION_SESSION', 'PRINCIPAL_NAME');"
  ```

## Examples

The following example returns the name of the organization user for the current session:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$ORGANIZATION_SESSION', 'PRINCIPAL_NAME');
```

```output
+-----------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$ORGANIZATION_SESSION', 'PRINCIPAL_NAME') |
|-----------------------------------------------------------------|
| my_organization_user_name                                       |
+-----------------------------------------------------------------+
```

---
title: SYS_CONTEXT (SNOWFLAKE$SESSION namespace)
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context_snowflake_session.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT (SNOWFLAKE$SESSION namespace)

Returns information about the session in which the function is called.

You can call this function in the following contexts:

* You can call this function directly in the current session.
* You can run a caller’s rights executable (for example, a caller’s rights stored procedure) that calls this function.
* You can run an owner’s rights executable (for example, an owner’s rights stored procedure) that calls this function, provided
  that the owner role has been granted the READ SESSION privilege on the account.

In any other context, the function returns NULL.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](sys_context_snowflake_application.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)](sys_context_snowflake_environment.md) ,
    [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](sys_context_snowflake_organization.md)

## Syntax

**Syntax for retrieving properties:**

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$SESSION' ,
  '<property>'
)
```

**Syntax for calling functions:**

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$SESSION' ,
  '<function>' , '<argument>' [ , ... ]
)
```

## Arguments

`'SNOWFLAKE$SESSION'`
:   Specifies that you want to retrieve a property or call a function to return information about the session in which the function
    is called.

`'property'`
:   Name of the property that you want to retrieve. You can specify the following properties:

    | Property | Description |
    | --- | --- |
    | `PRINCIPAL_NAME` | Name of the principal (the user, [task](../../user-guide/tasks-intro.md), or [SPCS service](../../developer-guide/snowpark-container-services/overview.md)) that started the session. The name depends on the value of the `PRINCIPAL_TYPE` property:   * If `PRINCIPAL_TYPE` is one of the following values, the value of the `PRINCIPAL_NAME` property is the name of the   user:    + `USER`   + `USER_PERSON`   + `USER_SERVICE`   + `USER_LEGACY_SERVICE` * If `PRINCIPAL_TYPE` is `TASK`, the value is the name of the task. * If `PRINCIPAL_TYPE` is `SNOWSERVICE`, the value is the name of the SPCS service. |
    | `PRINCIPAL_TYPE` | Type of the principal that started the session. This property can have one of the following values:   * `USER` or `USER_suffix`, if a user started the session. `suffix` depends on the type of the user:    + If the user object has no TYPE property, the value is `USER`.   + If the TYPE property is `PERSON`, the value is `USER_PERSON`.   + If the TYPE property is `SERVICE`, the value is `USER_SERVICE`.   + If the TYPE property is `LEGACY_SERVICE`, the value is `USER_LEGACY_SERVICE`. * `TASK`, if a [task](../../user-guide/tasks-intro.md) started the session. * `SNOWSERVICE`, if an [SPCS service](../../developer-guide/snowpark-container-services/overview.md) started the session. |
    | `PRINCIPAL_EMAIL` | Email address that is associated with the principal. If there is no associated email address, the value of this property is NULL. |
    | `PRINCIPAL_DATABASE` | Name of the database containing the object for the principal. For example, if the principal is a task, the value of this property is the name of the database that contains the task.  If the principal is an account-level object (such as a user), the value of this property is NULL. |
    | `PRINCIPAL_SCHEMA` | Name of the schema containing the object for the principal. For example, if the principal is a task, the value of this property is the name of the schema that contains the task.  If the principal is an account-level object (such as a user), the value of this property is NULL. |
    | `ID` | Identifier for the session in which the function was called. |
    | `ROLE` | Primary role for the session in which the function was called. |
    | `ROLE_TYPE` | Type of the primary role. This property can have one of the following values:   * `ROLE`, if the primary role is an account role. |
    | `ROLE_DATABASE` | Name of the database that contains the database role, if the primary role is a database role. |
    | `SECONDARY_ROLES` | JSON array of the account-level roles activated as secondary roles in the session. The activated roles include roles that are hierarchically under the requested role. For example, suppose that the user executed:  ```sqlexample USE SECONDARY ROLES ACCOUNTADMIN; ```  The JSON array for this property includes the ACCOUNTADMIN role and the SECURITYADMIN, SYSADMIN, and USERADMIN roles, which are under the ACCOUNTADMIN role. |
    | `WANTED_SECONDARY_ROLES` | JSON array of the account-level roles requested by the user. For example, suppose that the user executed:  ```sqlexample USE SECONDARY ROLES ACCOUNTADMIN; ```  The JSON array for this property just includes the ACCOUNTADMIN role. |
    | `DATABASE` | Current database in use for the session, if the role that called the function has privileges to access the database. |
    | `SCHEMA` | Current schema in use for the session, if the role that called the function has privileges to access the schema. |
    | `SCHEMAS` | Current [search path](../name-resolution.md) of schemas for the session, if the role that called the function has privileges to access the current database. |
    | `WAREHOUSE` | Current warehouse in use for the session. |

`'function'`
:   Name of the function that you want to call. You can call the following functions:

    * [IS_DATABASE_ROLE_ACTIVATED (SYS_CONTEXT function)](is_database_role_activated.md)
    * [IS_ROLE_ACTIVATED (SYS_CONTEXT function)](is_role_activated.md)

`'argument' [ , ... ]`
:   Arguments to pass to the function that you want to call.

## Returns

The function returns a VARCHAR value or NULL:

* The return value depends on
  the property that you are retrieving or
  the function that you are calling.
* If you call SYS_CONTEXT with the SNOWFLAKE$SESSION namespace outside of
  any of the supported contexts, the function returns NULL.

## Usage notes

* If you are specifying the function call in a double-quoted string in a shell, escape the `$` character with a backslash
  (`\`) so that `$SESSION` is not interpreted as a shell variable.

  For example, if you are using Snowflake CLI and you are
  [specifying the SQL statement as a command-line argument](../../developer-guide/snowflake-cli/sql/execute-sql.md) in double
  quotes:

  ```bash
  snow sql --query "SELECT SYS_CONTEXT('SNOWFLAKE\$SESSION', 'PRINCIPAL_NAME');"
  ```

## Examples

The following examples demonstrate how to retrieve context information about the session:

* Retrieving information about the principal
* Retrieving information about roles
* Retrieving the current database, schema, search path, and warehouse

### Retrieving information about the principal

The following example returns the name and type of the principal that called the function:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'PRINCIPAL_NAME') AS name,
  SYS_CONTEXT('SNOWFLAKE$SESSION', 'PRINCIPAL_TYPE') AS type,
  SYS_CONTEXT('SNOWFLAKE$SESSION', 'PRINCIPAL_EMAIL') AS email;
```

```output
+--------------+-------------+---------------------+
| NAME         | TYPE        | EMAIL               |
|--------------+-------------+---------------------|
| MY_USER_NAME | USER_PERSON | my.user@example.com |
+--------------+-------------+---------------------+
```

### Retrieving information about roles

The following example returns the name and type of the primary role in the session where the function was called:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'ROLE') AS role,
  SYS_CONTEXT('SNOWFLAKE$SESSION', 'ROLE_TYPE') AS type;
```

```output
+---------+------+
| ROLE    | TYPE |
|---------+------|
| MY_ROLE | ROLE |
+---------+------+
```

The following example uses the ACCOUNTADMIN role as a secondary role. The example then returns the list of requested secondary
roles in the session (ACCOUNTADMIN) and the list of account-level roles that are activated as secondary roles in the session.

The list of activated roles includes roles that are hierarchically under the requested role. Because the ACCOUTADMIN role is
activated, the list includes SECURITYADMIN, SYSADMIN, and USERADMIN, which are under the ACCOUNTADMIN role.

```sqlexample
USE SECONDARY ROLES ACCOUNTADMIN;

SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'WANTED_SECONDARY_ROLES') AS requested_roles,
  SYS_CONTEXT('SNOWFLAKE$SESSION', 'SECONDARY_ROLES') AS requested_roles_with_child_roles;
```

```output
+------------------+---------------------------------------------------------+
| REQUESTED_ROLES  | REQUESTED_ROLES_WITH_CHILD_ROLES                        |
|------------------+---------------------------------------------------------|
| ["ACCOUNTADMIN"] | ["ACCOUNTADMIN","SECURITYADMIN","SYSADMIN","USERADMIN"] |
+------------------+---------------------------------------------------------+
```

### Retrieving the current database, schema, search path, and warehouse

The following example returns the current database, schema, and warehouse in use for the session:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'DATABASE') AS database,
  SYS_CONTEXT('SNOWFLAKE$SESSION', 'SCHEMA') AS schema,
  SYS_CONTEXT('SNOWFLAKE$SESSION', 'WAREHOUSE') AS warehouse;
```

```output
+----------+--------+--------------+
| DATABASE | SCHEMA | WAREHOUSE    |
|----------+--------+--------------|
| MY_DB    | PUBLIC | MY_WAREHOUSE |
+----------+--------+--------------+
```

The following example returns a JSON array that contains the search path for the session:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION', 'SCHEMAS');
```

```output
+---------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION', 'SCHEMAS') |
|---------------------------------------------|
| ["MY_DB.MY_SCHEMA","MY_DB.PUBLIC"]          |
+---------------------------------------------+
```

The following example returns a row for each element in the search path:

```sqlexample
SELECT value::VARCHAR AS path_element
  FROM TABLE(
    FLATTEN(INPUT => PARSE_JSON(SYS_CONTEXT('SNOWFLAKE$SESSION', 'SCHEMAS'))));
```

```output
+-----------------------+
| PATH_ELEMENT          |
|-----------------------|
| BOOKS_DB.BOOKS_SCHEMA |
| BOOKS_DB.PUBLIC       |
+-----------------------+
```

---
title: SYS_CONTEXT (SNOWFLAKE$SESSION_ATTRIBUTES namespace)
source: https://docs.snowflake.com/en/sql-reference/functions/sys_context_snowflake_session_attributes.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYS_CONTEXT (SNOWFLAKE$SESSION_ATTRIBUTES namespace)

Returns a custom session attribute that was set using [SET_SYS_CONTEXT](set_sys_context.md) in the
`SNOWFLAKE$SESSION_ATTRIBUTES` namespace.

Custom session attributes are immutable once set and persist for the duration of the session.
They are useful for tracking metadata about a session, such as application context, user
attributes, or audit information.

See also:
:   [SYS_CONTEXT](sys_context.md) ,
    [SET_SYS_CONTEXT](set_sys_context.md)

## Syntax

```sqlsyntax
SYS_CONTEXT(
  'SNOWFLAKE$SESSION_ATTRIBUTES' ,
  '<key>'
)
```

## Arguments

`'SNOWFLAKE$SESSION_ATTRIBUTES'`
:   Specifies that you want to retrieve a custom session attribute.

`'key'`
:   The name of the custom attribute to retrieve. Attribute names are **case-sensitive**.

## Returns

The function returns a VARCHAR value:

* The value of the specified attribute if it has been set in the current session using
  [SET_SYS_CONTEXT](set_sys_context.md).
* NULL if the attribute has not been set.

## Access control requirements

No special privileges are required to retrieve custom session attributes. Any user can retrieve
attributes from their own session.

## Usage notes

* Attributes must be set using [SET_SYS_CONTEXT](set_sys_context.md) before they can be retrieved.
* Attribute names are **case-sensitive**. `app_context` and `APP_CONTEXT` are treated as
  different attributes.
* Attributes are session-scoped and are not visible to other sessions.
* If you are specifying the function call in a double-quoted string in a shell, escape the `$`
  character with a backslash (`\`) so that `$session_attributes` is not interpreted as a
  shell variable.

## Examples

The following example sets a custom attribute and then retrieves it:

```sqlexample
-- Set a custom session attribute
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context', 'production');

-- Retrieve the custom attribute
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context');
```

```output
+---------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'app_context')   |
|---------------------------------------------------------------|
| production                                                    |
+---------------------------------------------------------------+
```

Retrieving an attribute that has not been set returns NULL:

```sqlexample
SELECT SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'nonexistent_attr');
```

```output
+------------------------------------------------------------------+
| SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'nonexistent_attr') |
|------------------------------------------------------------------|
| NULL                                                             |
+------------------------------------------------------------------+
```

Attribute names are case-sensitive:

```sqlexample
-- Set attributes with different cases
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'mykey', 'lowercase');
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'MyKey', 'mixedcase');
CALL SET_SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'MYKEY', 'uppercase');

-- Each is a distinct attribute
SELECT
  SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'mykey') AS lower,
  SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'MyKey') AS mixed,
  SYS_CONTEXT('SNOWFLAKE$SESSION_ATTRIBUTES', 'MYKEY') AS upper;
```

```output
+-----------+-----------+-----------+
| LOWER     | MIXED     | UPPER     |
|-----------+-----------+-----------|
| lowercase | mixedcase | uppercase |
+-----------+-----------+-----------+
```

---
title: SYSDATE
source: https://docs.snowflake.com/en/sql-reference/functions/sysdate.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYSDATE

Returns the current timestamp for the system in the UTC time zone.

See also:
:   [CURRENT_TIMESTAMP](current_timestamp.md)

## Syntax

```sqlsyntax
SYSDATE()
```

## Arguments

None.

## Returns

Returns the current timestamp in the UTC time zone.

The data type of the returned value is [TIMESTAMP_NTZ](../data-types-datetime.md).

## Usage notes

* Despite the name, this returns a TIMESTAMP_NTZ, not a DATE. To control the output format, use the session
  parameter TIMESTAMP_NTZ_OUTPUT_FORMAT.
* This function is similar to CURRENT_TIMESTAMP, except that:

  + It returns the current timestamp in the UTC time zone, whereas CURRENT_TIMESTAMP returns the timestamp in the
    local time zone.
  + Its return value is TIMESTAMP_NTZ, whereas CURRENT_TIMESTAMP returns TIMESTAMP_LTZ.
  + It requires parentheses (`SYSDATE()`), whereas CURRENT_TIMESTAMP can be called without parentheses.
  + It does not support a parameter to specify the precision of fractional seconds.
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual
  warehouse) because the queries might be serviced by different compute resources (in the warehouse).

## Examples

Set the time output format to `YYYY-MM-DD HH24:MI:SS.FF4`, then return the SYSDATE and CURRENT_TIMESTAMP.
Note the difference in the hour field due to the difference in time zone.

```sqlexample
ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF4';
ALTER SESSION SET TIMESTAMP_LTZ_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF4';

ALTER SESSION SET TIMEZONE = 'America/Los_Angeles';

SELECT SYSDATE(), CURRENT_TIMESTAMP();
```

```output
+--------------------------+--------------------------+
| SYSDATE()                | CURRENT_TIMESTAMP()      |
|--------------------------+--------------------------|
| 2024-04-17 22:47:54.3520 | 2024-04-17 15:47:54.3520 |
+--------------------------+--------------------------+
```

---
title: SYSTEM$ABORT_SESSION
source: https://docs.snowflake.com/en/sql-reference/functions/system_abort_session.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ABORT_SESSION

Aborts the specified session.

## Syntax

```sqlsyntax
SYSTEM$ABORT_SESSION( <session_id> )
```

## Arguments

`session_id`
:   Identifier for the session to abort. To obtain the ID for a session, log into the web interface as an account administrator (user with the ACCOUNTADMIN role) and go to:

    > Account  » Sessions

## Examples

```sqlexample
SELECT SYSTEM$ABORT_SESSION(1065153868222);

+-------------------------------------+
| SYSTEM$ABORT_SESSION(1065153868222) |
|-------------------------------------|
| session [1065153868222] terminated. |
+-------------------------------------+
```

---
title: SYSTEM$ABORT_TRANSACTION
source: https://docs.snowflake.com/en/sql-reference/functions/system_abort_transaction.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ABORT_TRANSACTION

Aborts the specified transaction, if it is running. If the transaction has already been committed or rolled back, then the state of the
transaction is not altered.

For more information, see [Transactions](../transactions.md).

## Syntax

```sqlsyntax
SYSTEM$ABORT_TRANSACTION(<transaction_id>)
```

## Arguments

`transaction_id`
:   Identifier for the transaction to abort. To obtain transaction IDs,
    you can use the [SHOW TRANSACTIONS](../sql/show-transactions.md) or
    [SHOW LOCKS](../sql/show-locks.md) commands.

## Usage notes

* This function is supported for explicit/multi-statement transactions only. Autocommit transactions can be aborted by aborting the associated job.
* Note that DDL statements, including “CREATE TABLE AS SELECT …” will implicitly commit an open transaction. After the implicit commit finishes, the previously open transaction cannot be aborted.
* Transactions can be aborted only by the user who started the transaction or an account administrator.

## Examples

> ```sqlexample
> SHOW LOCKS IN ACCOUNT;
>
> --------------+--------+---------------+---------------------------------+---------+---------------------------------+--------------------------------------+
>    session    | table  |  transaction  |     transaction_started_on      | status  |           acquired_on           |               query_id               |
> --------------+--------+---------------+---------------------------------+---------+---------------------------------+--------------------------------------+
>  103079321618 | ORDERS | 1442254688149 | Mon, 14 Sep 2015 11:18:08 -0700 | HOLDING | Mon, 14 Sep 2015 11:18:16 -0700 | 6a478582-9e8c-4603-b5bf-89b14c042e1a |
>  103079325702 | ORDERS | 1442255439400 | Mon, 14 Sep 2015 11:30:39 -0700 | WAITING | [NULL]                          | 82fea8a6-a679-4de1-b6e9-7a80905831cf |
> --------------+--------+---------------+---------------------------------+---------+---------------------------------+--------------------------------------+
>
> SELECT SYSTEM$ABORT_TRANSACTION(1442254688149);
>
> -----------------------------------------+
>  SYSTEM$ABORT_TRANSACTION(1442254688149) |
> -----------------------------------------+
>  Aborted transaction id: 1442254688149   |
> -----------------------------------------+
> ```

---
title: SYSTEM$ACTIVATE_CMK_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_activate_cmk_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$ACTIVATE_CMK_INFO

Activates Tri-Secret Secure in your account, optionally with private connectivity, by using the customer-managed key (CMK) information
that you registered for your account.

This system function performs the following actions:

* Configures your account to use Tri-Secret Secure with the registered CMK.
* Creates a new composed account master key.
* Registers your account with the rekeying background service.
* Optionally, enables private connectivity on an active CMK, without rekeying.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

```sqlsyntax
SYSTEM$ACTIVATE_CMK_INFO( [ <option> ] )
```

## Arguments

**Required:**

None.

**Optional:**

`option`
:   You can specify one of the following values:

    `REKEY_SAME_CMK`
    :   Allows rekeying with the active CMK.

    `UPDATE_PRIVATELINK`
    :   Updates the privatelink status from the registered CMK.

## Returns

Success or error messages.

## Access control requirements

Only users that are granted the MODIFY privilege on the account can call this function.
The MODIFY privilege is typically granted only to the ACCOUNTADMIN role.

## Usage notes

The background service generates email messages that notify the account administrator about rekeying and Tri-Secret Secure activation status.

## Examples

Activate Tri-Secret Secure for your Snowflake account:

```sqlexample
SELECT SYSTEM$ACTIVATE_CMK_INFO();
```

Rekey with your current CMK:

```sqlexample
SELECT SYSTEM$ACTIVATE_CMK_INFO('REKEY_SAME_CMK');
```

Update private connectivity enablement on the CMK that is registered for use with Tri-Secret Secure and the active CMK, without rekeying.

```sqlexample
SELECT SYSTEM$ACTIVATE_CMK_INFO('UPDATE_PRIVATELINK');
```

---
title: SYSTEM$ACTIVATE_CMK_INFO_POSTGRES
source: https://docs.snowflake.com/en/sql-reference/functions/system_activate_cmk_info_postgres.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ACTIVATE_CMK_INFO_POSTGRES

Activates Snowflake Postgres Tri-Secret Secure in your account by using the CMK (customer-managed key) information that you
registered for your account.

This system function performs the following actions:

* Configures your account to use Snowflake Postgres Tri-Secret Secure with the registered CMK.
* Snowflake Postgres instances created after the registered CMK is activated will use it.
* Snowflake Postgres instances that were created before the registered CMK is activated will not be rekeyed to use the new CMK, but will
  continue working with the prior CMK or no CMK.
* Snowflake Postgres replicas and forks will always inherit the CMK configuration from the parent Snowflake Postgres primary instance.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

```sqlsyntax
SYSTEM$ACTIVATE_CMK_INFO_POSTGRES()
```

## Returns

Success or error messages.

## Access control requirements

Only users that are granted the MODIFY privilege on the account can call this function.
The MODIFY privilege is typically granted only to the ACCOUNTADMIN role.

## Examples

Activate Snowflake Postgres Tri-Secret Secure for your Snowflake account:

```sqlexample
SELECT SYSTEM$ACTIVATE_CMK_INFO_POSTGRES();
```

---
title: SYSTEM$ADD_EVENT (for Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/functions/system_add_event.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$ADD_EVENT (for Snowflake Scripting)

Add an event for trace.

Use SYSTEM$ADD_EVENT to add an event when using trace events from a handler written in Snowflake Scripting.

For more information, refer to [Emitting trace events in Snowflake Scripting](../../developer-guide/logging-tracing/tracing-snowflake-scripting.md).

## Syntax

```sqlsyntax
SYSTEM$ADD_EVENT('<name>', '<object>');
```

## Arguments

`'name'`
:   The name of the event to add.

`'object'`
:   An object containing name-value pairs representing the attributes to add.

## Examples

Code in the following example uses the SYSTEM$ADD_EVENT function to add an event named `name_a` and an event named `name_b`.
With `name_b`, it associates two attributes, `score` and `pass`. The code also sets two attributes for the span,
`key1` and `key2`.

```sqlexample
CREATE OR REPLACE PROCEDURE pi_proc()
  RETURNS DOUBLE
  LANGUAGE SQL
  AS $$
  BEGIN
    -- Add an event without attributes
    SYSTEM$ADD_EVENT('name_a');

    -- Add an event with attributes
    LET attr := {'score': 89, 'pass': TRUE};
    SYSTEM$ADD_EVENT('name_b', attr);

    -- Set attributes for the span
    SYSTEM$SET_SPAN_ATTRIBUTES({'key1': 'value1', 'key2': TRUE});

    RETURN 3.14;
  END;
  $$;
```

---
title: SYSTEM$ADD_REFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_add_reference.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ADD_REFERENCE

Called by a Snowflake Native App to associate a consumer reference string to a reference definition. The app can use this
association to access the consumer object. The reference string passed to this system function is the value returned by the
[SYSTEM$REFERENCE](system_reference.md) function, which represents a consumer object.

For information about using this function in an app, see
[Request references and object-level privileges from consumers](../../developer-guide/native-apps/requesting-refs.md).

This function supports both single and multi-valued references. The function returns an
error if an association has already been created using the same value specified by
`reference_name`.

## Syntax

```sqlsyntax
SYSTEM$ADD_REFERENCE('<reference_name>', '<reference_string>')
```

## Arguments

`'reference_name'`
:   The name of the reference as specified in the `manifest.yml` file of the app.

`'reference_string'`
:   The system-generated ID of the reference to the object in the consumer account.

---
title: SYSTEM$ALLOWLIST
source: https://docs.snowflake.com/en/sql-reference/functions/system_allowlist.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$ALLOWLIST

Returns host names and port numbers to add to your firewall’s allowed list so that you can access Snowflake from behind your firewall.
The output of this function can then be passed into [SnowCD](../../user-guide/snowcd.md).

Typically, Snowflake customers use a firewall to prevent unauthorized access. By default, your firewall might block access to Snowflake. To
update your firewall’s allowed list, you need to know the host names and port numbers for the URL for your
[Snowflake account](../../user-guide/admin-account-identifier.md), stages, and other hosts used by Snowflake.

For more details about the allowed listing for the Snowflake clients you use, see [Allowing Host names](../../user-guide/hostname-allowlist.md).

## Syntax

```sqlsyntax
SYSTEM$ALLOWLIST()
```

## Arguments

None.

## Returns

The data type of the returned value is VARIANT. The value is an array of JSON structures. Each JSON structure contains three
key/value pairs:

`type`
:   Snowflake supports the following types:

    `SNOWFLAKE_DEPLOYMENT`
    :   Host name and port number information for your Snowflake account.

    `SNOWFLAKE_DEPLOYMENT_REGIONLESS`
    :   Host name and port number information for your [organization](../../user-guide/organizations.md).

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md).

    `STAGE`
    :   Location (such as Amazon S3, Google Cloud Storage, or Microsoft Azure) where files that the Snowflake client can read or write are stored.

    `SNOWSQL_REPO`
    :   Endpoint accessed by SnowSQL to perform automatic downloads or upgrades.

    `OUT_OF_BAND_TELEMETRY`
    :   The hosts to which drivers report metrics and out-of-band incidents such as OCSP issues.

    `CLIENT_FAILOVER`
    :   Host name and port number for the connection URL for [Client Redirect](../../user-guide/client-redirect.md). Note that each row in the query
        output that specifies this value refers to either the primary connection or the secondary connection depending on how the connection
        URLs were configured.

    `CRL_DISTRIBUTION_POINT`
    :   Host name and port number for certificate revocation list (CRL) distribution endpoints.

    `OCSP_CACHE`
    :   Snowflake-provided alternative source of OCSP certificate information in case the primary OCSP responder cannot be reached. Most of the
        latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CACHE_REGIONLESS`
    :   Snowflake-provided alternative source of OCSP certificate information for your [organization](../../user-guide/organizations.md). Most of
        the latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CLIENT_FAILOVER`
    :   Snowflake-provided alternative source of OCSP certificate information for [Client Redirect](../../user-guide/client-redirect.md).

    `DUO_SECURITY`
    :   The host name for the Duo Security service that is used with [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md) while authenticating to Snowflake.

    `OCSP_RESPONDER`
    :   Host name to contact to verify that the OCSP TLS certificate has not been revoked.

        Note that this value is not necessary when configuring private connectivity to the Snowflake service ; follow the instructions in the
        corresponding topic to select the OCSP value to add to your allowlist.

    `SNOWSIGHT_DEPLOYMENT_REGIONLESS`
    :   Host name and port number for your [organization](../../user-guide/organizations.md) to access Snowsight.

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md) and [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

    `SNOWSIGHT_DEPLOYMENT`
    :   Host name and port number to access [Snowsight](../../user-guide/ui-snowsight.md) for your Snowflake account.

`host`
:   Specifies the full host name for `type`, for example: `"xy12345.east-us-2.azure.snowflakecomputing.com"`, `"ocsp.snowflakecomputing.com"`.

`port`
:   Specifies the port number for `type`, for example: `443`, `80`.

## Usage notes

* The output might include multiple entries for certain types (e.g. `STAGE`, `OCSP_RESPONDER`).
* Occasionally, Snowflake cannot resolve the socket connection from the client that calls the function, and the statement that calls the
  function fails with one of the following error messages:

  > ```output
  > SYSTEM$ALLOWLIST: Fail to get SSL context
  > SYSTEM$ALLOWLIST: SSLContext init failed
  > SYSTEM$ALLOWLIST: Could not find host in OCSP dumping
  > SYSTEM$ALLOWLIST: Peer unverified
  > SYSTEM$ALLOWLIST: Connection failure
  > ```

  Additionally, Snowflake returns an empty list for the OCSP fields in the function output. To troubleshoot, you can wait a few minutes and
  rerun the statement if the network connection is transient. If the issue persists, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Examples

To call the function:

> ```sqlexample
> SELECT SYSTEM$ALLOWLIST();
> ```
>
> Sample output:
>
> ```sqljson
> [
>   {"type":"SNOWFLAKE_DEPLOYMENT",    "host":"xy12345.snowflakecomputing.com",                 "port":443},
>   {"type":"STAGE",                   "host":"sfc-customer-stage.s3.us-west-2.amazonaws.com",  "port":443},
>   ...
>   {"type":"SNOWSQL_REPO",            "host":"sfc-repo.snowflakecomputing.com",                "port":443},
>   ...
>   {"type":"CRL_DISTRIBUTION_POINT",  "host":"crl.r2m01.amazontrust.com",                       "port":80},
>   ...
>   {"type":"OCSP_CACHE",              "host":"ocsp.snowflakecomputing.com",                     "port":80},
>   {"type":"OCSP_RESPONDER",          "host":"o.ss2.us",                                        "port":80},
>   ...
> ]
> ```
>
> In this sample output, note the following:
>
> * For readability, whitespace and newline characters have been added. In addition, some entries have been omitted.
> * The region ID (`us-west-2`) in some of the host names indicates the account is in the US West region; however, the region ID
>   is not utilized in the host name for `SNOWFLAKE_DEPLOYMENT`.

To extract the information into tabular output rather than JSON, use the [FLATTEN](flatten.md) function in conjunction with the [PARSE_JSON](parse_json.md)
function:

> ```sqlexample
> SELECT t.VALUE:type::VARCHAR as type,
>        t.VALUE:host::VARCHAR as host,
>        t.VALUE:port as port
> FROM TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$ALLOWLIST()))) AS t;
> ```
>
> Sample output:
>
> ```none
> +------------------------+---------------------------------------------------+------+
> | TYPE                   | HOST                                              | PORT |
> |------------------------+---------------------------------------------------+------|
> | SNOWFLAKE_DEPLOYMENT   | xy12345.snowflakecomputing.com                    | 443  |
> | STAGE                  | sfc-customer-stage.s3.us-west-2.amazonaws.com     | 443  |
>   ...
> | SNOWSQL_REPO           | sfc-repo.snowflakecomputing.com                   | 443  |
>   ...
> | CRL_DISTRIBUTION_POINT | crl.r2m01.amazontrust.com                         | 80   |
>   ...
> | OCSP_CACHE             | ocsp.snowflakecomputing.com                       | 80   |
> | OCSP_RESPONDER         | ocsp.sca1b.amazontrust.com                        | 80   |
>   ...
> +------------------------+---------------------------------------------------+------+
> ```

---
title: SYSTEM$ALLOWLIST_PRIVATELINK
source: https://docs.snowflake.com/en/sql-reference/functions/system_allowlist_privatelink.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$ALLOWLIST_PRIVATELINK

Returns host names and port numbers for [AWS PrivateLink](https://aws.amazon.com/privatelink/),
[Azure Private Link](https://azure.microsoft.com/en-us/services/private-link/), and
[Google Cloud Private Service Connect](https://cloud.google.com/vpc/docs/configure-private-service-connect-services) deployments to add
to your firewall’s allowed list so that you can access Snowflake from behind your firewall. These features provide private connectivity to
the Snowflake service on each supported cloud platform.

The output of this function can then be passed into [SnowCD](../../user-guide/snowcd.md) to diagnose and troubleshoot your network connection
to Snowflake.

Typically, Snowflake customers use a firewall to prevent unauthorized access. By default, your firewall might block access to Snowflake. To
update your firewall’s allowed list, you need to know the host names and port numbers for the URL associated with your Snowflake
[account identifier](../../user-guide/admin-account-identifier.md), stages, and other hosts used by Snowflake.

For more details about allowed lists for the Snowflake clients you use, see [Allowing Host names](../../user-guide/hostname-allowlist.md).

## Syntax

```sqlsyntax
SYSTEM$ALLOWLIST_PRIVATELINK()
```

## Arguments

None.

## Returns

The data type of the returned value is `VARIANT`. The value is an array of JSON structures. Each JSON structure contains three key/value
pairs:

`type`
:   Snowflake supports the following types:

    `SNOWFLAKE_DEPLOYMENT`
    :   Host name and port number information for your Snowflake account.

    `SNOWFLAKE_DEPLOYMENT_REGIONLESS`
    :   Host name and port number information for your [organization](../../user-guide/organizations.md).

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md).

    `STAGE`
    :   Location (such as Amazon S3, Google Cloud Storage, or Microsoft Azure) where files that the Snowflake client can read or write are stored.

    `SNOWSQL_REPO`
    :   Endpoint accessed by SnowSQL to perform automatic downloads or upgrades.

    `OUT_OF_BAND_TELEMETRY`
    :   The hosts to which drivers report metrics and out-of-band incidents such as OCSP issues.

    `CLIENT_FAILOVER`
    :   Host name and port number for the connection URL for [Client Redirect](../../user-guide/client-redirect.md). Note that each row in the query
        output that specifies this value refers to either the primary connection or the secondary connection depending on how the connection
        URLs were configured.

    `CRL_DISTRIBUTION_POINT`
    :   Host name and port number for certificate revocation list (CRL) distribution endpoints.

    `OCSP_CACHE`
    :   Snowflake-provided alternative source of OCSP certificate information in case the primary OCSP responder cannot be reached. Most of the
        latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CACHE_REGIONLESS`
    :   Snowflake-provided alternative source of OCSP certificate information for your [organization](../../user-guide/organizations.md). Most of
        the latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CLIENT_FAILOVER`
    :   Snowflake-provided alternative source of OCSP certificate information for [Client Redirect](../../user-guide/client-redirect.md).

    `DUO_SECURITY`
    :   The host name for the Duo Security service that is used with [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md) while authenticating to Snowflake.

    `OCSP_RESPONDER`
    :   Host name to contact to verify that the OCSP TLS certificate has not been revoked.

        Note that this value is not necessary when configuring private connectivity to the Snowflake service ; follow the instructions in the
        corresponding topic to select the OCSP value to add to your allowlist.

    `SNOWSIGHT_DEPLOYMENT_REGIONLESS`
    :   Host name and port number for your [organization](../../user-guide/organizations.md) to access Snowsight.

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md) and [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

    `SNOWSIGHT_DEPLOYMENT`
    :   Host name and port number to access [Snowsight](../../user-guide/ui-snowsight.md) for your Snowflake account.

`host`
:   Specifies the full host name for `type`, for example: `"xy12345.east-us-2.azure.snowflakecomputing.com"`, `"ocsp.snowflakecomputing.com"`.

`port`
:   Specifies the port number for `type`, for example: `443`, `80`.

## Usage notes

The output may include multiple entries for certain types (`STAGE`, etc.).

## Examples

To call the function:

> ```sqlexample
> SELECT SYSTEM$ALLOWLIST_PRIVATELINK();
> ```
>
> Sample output:
>
> ```sqljson
> [
>   {"type":"SNOWFLAKE_DEPLOYMENT",  "host":"xy12345.us-west-2.privatelink.snowflakecomputing.com",            "port":443},
>   {"type":"STAGE",                 "host":"sfc-ss-ds2-customer-stage.s3.us-west-2.amazonaws.com",            "port":443},
>   ...
>   {"type":"SNOWSQL_REPO",           "host":"sfc-repo.snowflakecomputing.com",                                "port":443},
>   ...
>   {"type":"OUT_OF_BAND_TELEMETRY",  "host":"client-telemetry.snowflakecomputing.com",                        "port":443},
>   {"type":"CRL_DISTRIBUTION_POINT", "host":"crl.r2m01.amazontrust.com",                                      "port":80},
>   ...
>   {"type":"OCSP_CACHE",             "host":"ocsp.station00752.us-west-2.privatelink.snowflakecomputing.com", "port":80}
> ]
> ```
>
> In this sample output, note the following:
>
> * For readability, whitespace and newline characters have been added. In addition, some entries have been omitted.
> * The region ID (`us-west-2`) in some of the host names indicates the account is in the US West region ; however, the region ID is not utilized in the host name for `SNOWFLAKE_DEPLOYMENT`.

To extract the information into tabular output rather than JSON, use the [FLATTEN](flatten.md) function in conjunction with the [PARSE_JSON](parse_json.md) function:

> ```sqlexample
> SELECT t.VALUE:type::VARCHAR as type,
>        t.VALUE:host::VARCHAR as host,
>        t.VALUE:port as port
> FROM TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$ALLOWLIST_PRIVATELINK()))) AS t;
> ```
>
> Sample output:
>
> ```none
> +------------------------+---------------------------------------------------+------+
> | TYPE                   | HOST                                              | PORT |
> +------------------------+---------------------------------------------------+------+
> | SNOWFLAKE_DEPLOYMENT   | xy12345.snowflakecomputing.com                    | 443  |
> | STAGE                  | sfc-customer-stage.s3.us-west-2.amazonaws.com     | 443  |
>   ...
> | SNOWSQL_REPO           | sfc-repo.snowflakecomputing.com                   | 443  |
>   ...
> | CRL_DISTRIBUTION_POINT | crl.r2m01.amazontrust.com                         | 80   |
>   ...
> | OCSP_CACHE             | ocsp.snowflakecomputing.com                       | 80   |
>   ...
> +------------------------+---------------------------------------------------+------+
> ```

---
title: SYSTEM$APP_COMPATIBILITY_CHECK
source: https://docs.snowflake.com/en/sql-reference/functions/system_app_compatibility_check.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$APP_COMPATIBILITY_CHECK

Returns the [Snowflake edition](../../user-guide/intro-editions.md) of the consumer account
where an app is installed.

> **Note:**
>
> This function can only be called by a Snowflake Native App.

## Syntax

```sqlsyntax
SYSTEM$APP_COMPATIBILITY_CHECK()
```

## Returns

Returns a VARCHAR value containing a JSON object. This object has the following
structure:

```json
{
   "ACCOUNT_EDITION": "<service_level>"
}
```

Possible values for `service_level` are:

* `STANDARD`
* `PREMIER`
* `PREMIER_PLUS_1`
* `PREMIER_PLUS_2`
* `ENTERPRISE`
* `BUSINESS_CRITICAL`
* `VPS`

## Usage notes

* Providers can use this function to determine the Snowflake edition of the account where
  the app is installed. For example, providers can call this function from the setup script
  to check for the edition during installation.

## Examples

Determine the Snowflake edition for a consumer account:

```sqlexample
SELECT SYSTEM$APP_COMPATIBILITY_CHECK();
```

```json
{
  "ACCOUNT_EDITION": "STANDARD"
}
```

This indicates that the consumer account is a Standard Edition account.

---
title: SYSTEM$APPLICATION_GET_LOG_LEVEL
source: https://docs.snowflake.com/en/sql-reference/functions/system_application_get_log_level.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$APPLICATION_GET_LOG_LEVEL

Returns the log level for the specified object. The following objects are supported:

* Functions
* Schemas
* Stored procedures
* Versioned schemas

## Syntax

```sqlsyntax
SYSTEM$APPLICATION_GET_LOG_LEVEL( '<schema_name>.<object_name>' )
```

## Arguments

`'schema_name.object_name'`
:   The name of schema (or versioned schema) and object you want to determine the log
    level for.

## Usage notes

* This function can only be called by a Snowflake Native App and must be run as the APP_PRIMARY
  role.

---
title: SYSTEM$APPLICATION_GET_METRIC_LEVEL
source: https://docs.snowflake.com/en/sql-reference/functions/system_application_get_metric_level.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$APPLICATION_GET_METRIC_LEVEL

Returns the metric level for the specified object. The following objects are supported:

* Functions
* Schemas
* Stored procedures
* Versioned schemas

## Syntax

```sqlsyntax
SYSTEM$APPLICATION_GET_METRIC_LEVEL( '<schema_name>.<object_name>' )
```

## Arguments

`'schema_name.object_name'`
:   The name of schema (or versioned schema) and object you want to determine the log
    level for.

## Usage notes

* This function can only be called by a Snowflake Native App and must be run as the APP_PRIMARY
  role.

---
title: SYSTEM$APPLICATION_GET_TRACE_LEVEL
source: https://docs.snowflake.com/en/sql-reference/functions/system_application_get_trace_level.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$APPLICATION_GET_TRACE_LEVEL

Returns the trace level for the specified object. The following objects are supported:

* Functions
* Schemas
* Stored procedures
* Versioned schemas

## Syntax

```sqlsyntax
SYSTEM$APPLICATION_GET_TRACE_LEVEL( '<schema_name>.<object_name>' )
```

## Arguments

`'schema_name.object_name'`
:   The name of schema (or versioned schema) and object you want to determine the log
    level for.

## Usage notes

* This function can only be called by a Snowflake Native App and must be run as the APP_PRIMARY
  role.

## Examples

```sqlexample
SELECT SYSTEM$APPLICATION_GET_TRACE_LEVEL('my_schema');
```

---
title: SYSTEM$AUTHORIZE_PRIVATELINK
source: https://docs.snowflake.com/en/sql-reference/functions/system_authorize_privatelink.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$AUTHORIZE_PRIVATELINK

Enables private connectivity to the Snowflake service for the current account.

See also:
:   [SYSTEM$REVOKE_PRIVATELINK](system_revoke_privatelink.md) , [SYSTEM$GET_PRIVATELINK](system_get_privatelink.md) ,
    [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](system_get_privatelink_authorized_endpoints.md)

## Syntax

**AWS:**

> ```sqlsyntax
> SYSTEM$AUTHORIZE_PRIVATELINK( '<aws_id>' , '<federated_token>' )
> ```

**Azure:**

> ```sqlsyntax
> SYSTEM$AUTHORIZE_PRIVATELINK( '<private-endpoint-resource-id>' , '<federated_token>' )
> ```

**GCP**

> ```sqlsyntax
> SYSTEM$AUTHORIZE_PRIVATELINK( '<gcp_project_id>' , '<access_token>' )
> ```

## Arguments

`'aws_id'`
:   The 12-digit identifier that uniquely identifies your Amazon Web Services (AWS) account, as a string.

`'private-endpoint-resource-id'`
:   The identifier that uniquely identifies your Snowflake account in Microsoft Azure (Azure) as a string.

`'federated_token'`
:   The federated token value that contains access credentials for a federated user as a string.

    To obtain this value, execute the appropriate command for the cloud platform that hosts your Snowflake account. Use the command-line tool
    provided by the platform:

    * For Snowflake on AWS:

      ```bash
      aws sts get-federation-token --name sam
      ```
    * For Snowflake on Azure:

      ```bash
      az account get-access-token --subscription <SubscriptionID>
      ```

      Where:

      + `SubscriptionID`
        :   The unique identifier for your subscription. For example:

            > `13c...`

            To obtain this value, execute the following Azure CLI command in your command-line environment:

            > ```bash
            > az account list --output table
            > ```
            >
            > Note the output value in the `SubscriptionID` column, which is truncated in this example:
            >
            > > ```text
            > > Name     CloudName   SubscriptionId                        State    IsDefault
            > > -------  ----------  ------------------------------------  -------  ----------
            > > MyCloud  AzureCloud  13c....                               Enabled  True
            > > ```

`'gcp_project_id'`
:   The identifier that uniquely identifies your Google Cloud (GCP) project, as a string.

`'access_token'`
:   The access token value that contains access credentials for a Google Cloud user as a string.

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* This function can be used with Snowflake accounts on AWS or Azure; Google Cloud Platform (GCP) is not currently supported.
* Call the [SYSTEM$GET_PRIVATELINK](system_get_privatelink.md) function to verify whether your Snowflake account is authorized
  to use private connectivity to the Snowflake service.
* Call the [SYSTEM$REVOKE_PRIVATELINK](system_revoke_privatelink.md) function disable your Snowflake account to use private
  connectivity to the Snowflake service.

## Examples

Enable AWS PrivateLink for your Snowflake account on AWS. Note that the values are truncated in this example.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$AUTHORIZE_PRIVATELINK(
>     '185...',
>     '{
>       "Credentials": {
>           "AccessKeyId": "ASI...",
>           "SecretAccessKey": "enw...",
>           "SessionToken": "Fwo...",
>           "Expiration": "2021-01-07T19:06:23+00:00"
>       },
>       "FederatedUser": {
>           "FederatedUserId": "185...:sam",
>           "Arn": "arn:aws:sts::185...:federated-user/sam"
>       },
>       "PackedPolicySize": 0
>   }'
>   );
> ```

Enable Azure Private Link for your Snowflake account on Azure. Note that the values are truncated in this example.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$AUTHORIZE_PRIVATELINK(
>   '/subscriptions/26d.../resourcegroups/sf-1/providers/microsoft.network/privateendpoints/test-self-service',
>   'eyJ...');
> ```

Enable Google Cloud Private Service Connect for your Snowflake account on GCP:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$AUTHORIZE_PRIVATELINK(
>   'my-gcp-project-id',
>   'ya29.a0AcM612zT4pJaXdYfwgY8aiMoDE9W_xkqQ20coFTB1TJcImKDPo...'
>   );
> ```

---
title: SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_authorize_snowflake_managed_storage_volume_privatelink_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS

Authorizes Snowflake to access the private endpoint for
[Azure private endpoints for Snowflake-managed storage volumes](../../user-guide/private-managed-volumes-azure.md) for the current account.

See also:
:   [SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](system_revoke_snowflake_managed_storage_volume_privatelink_access.md)

## Syntax

```sqlsyntax
SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS( '<private_endpoint_resource_id>' )
```

## Arguments

`'private_endpoint_resource_id'`
:   The unique identifier for the Azure Private Endpoint.

    For instructions on how to obtain this value, see
    [Configuring private endpoints to access Snowflake-managed storage volumes](../../user-guide/private-managed-volumes-azure.md).

## Usage notes

* Only account administrators (that is, users with the ACCOUNTADMIN role) can call this function.
* This function is supported for Snowflake accounts on Microsoft Azure only.

## Examples

Authorize Snowflake to access an Azure private endpoint for a Snowflake-managed storage volume:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS(
>   '/subscriptions/subId/resourceGroups/rg1/providers/Microsoft.Network/privateEndpoints/pe1');
> ```

---
title: SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_authorize_stage_privatelink_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS

Authorizes Snowflake to access the private endpoint for [Azure private endpoints for internal stages](../../user-guide/private-internal-stages-azure.md) and
[Google Private Service Connect endpoints for internal stages](../../user-guide/private-internal-stages-gcp.md) for the current account.

See also:
:   [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](system_revoke_stage_privatelink_access.md)

## Syntax

**Azure**

```sqlsyntax
SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS( '<private_endpoint_resource_id>' )
```

**Google Cloud**

```sqlsyntax
SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS( '<google_cloud_vpc_network_name>' )
```

## Arguments

`'private_endpoint_resource_id'`
:   The unique identifier for the Azure Private Endpoint.

`'google_cloud_vpc_network_name'`
:   The fully qualified path value for the Google Cloud VPC Network.

    This value is from the Google Cloud VPC network path that Snowflake uses to limit access to your internal stage through the cloud provider’s internal network and avoid using the public internet.

    For instructions on how to obtain this value on Azure, see [Configuring private endpoints to access Snowflake internal stages](../../user-guide/private-internal-stages-azure.md); for Google Cloud, see [Configure private endpoints to access Snowflake internal stages](../../user-guide/private-internal-stages-gcp.md).

## Usage notes

* Only account administrators (that is, users with the ACCOUNTADMIN role) can call this function.
* This function is not supported for Snowflake accounts on
  Amazon Web Services (AWS).

## Examples

**Azure**

Authorize Snowflake to access an Azure private endpoint:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS('/subscriptions/subId/resourceGroups/rg1/providers/Microsoft.Network/privateEndpoints/pe1');
> ```

**Google Cloud**

Authorize Snowflake to access a Google Private Service Connect endpoint:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS('projects/vpc_network_name/global/networks/network_name');
> ```

---
title: SYSTEM$AUTO_REFRESH_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_auto_refresh_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$AUTO_REFRESH_STATUS

Returns the automated refresh status for an externally managed [Iceberg table](../../user-guide/tables-iceberg.md).

> **Note:**
>
> To return this refresh status for all applicable externally managed Apache Iceberg™ tables for which you have access privileges, run the [SHOW ICEBERG TABLES](../sql/show-iceberg-tables.md)
> command and see the `auto_refresh_status` column in the output.

## Syntax

```sqlsyntax
SYSTEM$AUTO_REFRESH_STATUS('<table_name>')
```

## Arguments

`'table_name'`
:   The name of the Iceberg table for which you want to retrieve the current automated refresh status.

    If using the fully qualified name, enclose the entire name in single quotes, including the database and schema.
    If the table name is case-sensitive or includes any special characters or spaces, you must use double quotes.
    Enclose the double quotes within the single quotes, for example, `'"Table_Name"'`.

## Returns

The function returns a JSON object containing the following name/value pairs:

```sqljson
{
  "executionState":"<value>",
  "invalidExecutionStateReason":"<value>",
  "pendingSnapshotCount":"<value>",
  "oldestSnapshotTime":"<value>",
  "currentSnapshotId":"<value>",
  "currentSnapshotSummary":"<value>",
  "lastSnapshotTime":"<value>",
  "lastUpdatedTime":"<value>",
  "currentMetadataFile":"<value>",
  "currentSchemaId":"<value>"
}
```

Where:

> `executionState`
> :   Current execution state of the pipe that Snowflake uses to automate metadata refreshes for the table.
>
>     Values:
>
>     * `RUNNING`: Automated refresh is running as expected. This status doesn’t indicate whether Snowflake is actively processing
>       event messages for the pipe.
>     * `STALLED`: Automated refresh encountered an error and is attempting to recover.
>     * `STOPPED`: Automated refresh encountered an unrecoverable error and is stopped unless you take further action. For more information,
>       see [Error recovery](../../user-guide/tables-iceberg-auto-refresh.md).
>     * `ICEBERG_TABLE_NOT_INITIALIZED`: Automated refresh isn’t initialized because an error occurred when Snowflake attempted to create the table.
>       To run automated refresh, you must resolve the error, and then [enable automated refresh for the table](../../user-guide/tables-iceberg-auto-refresh.md).
>       This execution state only occurs for tables in a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).
>
> `invalidExecutionStateReason`
> :   Error message associated with a `STALLED` or `STOPPED` execution state.
>
> `pendingSnapshotCount`
> :   Number of snapshots queued for automated refresh.
>
> `oldestSnapshotTime`
> :   Earliest timestamp among queued snapshots. Snowflake sets the timestamp for a snapshot when the snapshot is added to the queue.
>
> `currentSnapshotId`
> :   ID of the current snapshot that Snowflake is tracking. This represents the snapshot that the current table data corresponds to.
>
> `currentSnapshotSummary`
> :   The Iceberg snapshot summary from the `metadata.json` file. NULL if not present in the metadata file.
>
> `lastSnapshotTime`
> :   Creation timestamp for the current snapshot according to Iceberg metadata.
>     This timestamp corresponds to when the current snapshot was generated in the external catalog.
>
> `lastUpdatedTime`
> :   Timestamp that indicates when Snowflake successfully processed the current snapshot.
>     The difference between this value and the `lastSnapshotTime` indicates the latency between when snapshots
>     are created in the external catalog and when Snowflake successfully refreshes the table metadata.
>
>     To decrease the latency, adjust the `REFRESH_INTERVAL_SECONDS` parameter for the catalog integration associated with the table.
>
> `currentMetadataFile`
> :   The full path to the current metadata file.
>
> `currentSchemaId`
> :   ID of the current schema.

## Usage notes

* Calling this function requires a role that has the OWNERSHIP privilege on the Iceberg table.
* For Delta-based tables, note the following:

  + In the context of this function and automated refresh in Snowflake, the term “snapshot” refers to a Delta commit.
  + The function doesn’t return a value for `lastSnapshotTime`.

## Examples

Retrieve the automated refresh status for the table `my_iceberg_table` in the schema `db1.schema1`:

```sqlexample
SELECT SYSTEM$AUTO_REFRESH_STATUS('db1.schema1.my_iceberg_table');
```

---
title: SYSTEM$BEGIN_DEBUG_APPLICATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_begin_debug_application.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$BEGIN_DEBUG_APPLICATION

Enables [session debug mode](../../developer-guide/native-apps/installing-testing-application.md) for a Snowflake Native App.

## Syntax

```sqlsyntax
SYSTEM$BEGIN_DEBUG_APPLICATION( '<app_name>' [ , <execution_mode>] )
```

## Arguments

`'app_name'`
:   The name of the app on which session debug mode is being enabled.

`execution_mode =`
:   The behavior of commands run during session debug mode. Possible values are:

    * `'AS_APPLICATION'` (DEFAULT)

      All statements are executed as using the same privileges as the app. This mimics the
      behavior of the app in the consumer account.
    * `'AS_SETUP_SCRIPT'`

      All statements are executed using the same privileges as the setup script of the app. This
      allows providers to test the setup script using session debug mode.

## Usage notes

* Providers can use this function to enable session debug mode on an app created using development mode.
  This allows providers to test the behavior of the app and setup script.

## Examples

The following example shows how to set the execution mode to `AS_APPLICATION`:

```sqlexample
SELECT SYSTEM$BEGIN_DEBUG_APPLICATION( 'hello_snowflake_app', execution_mode ='AS_APPLICATION')
```

The following example show how to set the execution mode to `AS_SETUP_SCRIPT`:

```sqlexample
SELECT SYSTEM$BEGIN_DEBUG_APPLICATION( 'hello_snowflake_app', execution_mode = 'AS_SETUP_SCRIPT')
```

---
title: SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_behavior_change_bundle_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS

Returns the status of the specified [behavior change release bundle](../../release-notes/behavior-change-policy.md) for the current account.

See also:
:   [SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](system_enable_behavior_change_bundle.md),
    [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](system_disable_behavior_change_bundle.md),
    [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](system_show_active_behavior_change_bundles.md)

## Syntax

```sqlsyntax
SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS( '<bundle_name>' )
```

## Arguments

`bundle_name`
:   Name of the behavior change bundle, specified as a string. To obtain the name for a bundle, see
    [Behavior change announcements](../../release-notes/behavior-changes.md).

## Returns

Returns one of the following VARCHAR values:

* `ENABLED` (if the specified bundle is enabled for the current account)
* `DISABLED` (if the specified bundle is disabled for the current account)
* `RELEASED` (if the specified bundle is
  [generally enabled](../../release-notes/behavior-change-policy.md) for the current account and thus permanently enabled)

## Examples

The following example returns the status of the `2020_08` behavior change bundle for the current account.

```sqlexample
SELECT SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS('2020_08');
```

```output
+-------------------------------------------------+
| SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS('2020_08') |
|-------------------------------------------------|
| DISABLED                                        |
+-------------------------------------------------+
```

---
title: SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_block_internal_stages_public_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS

Prevents all public traffic from accessing the internal stage of the current Snowflake account on Microsoft Azure.

This function uses settings for the internal stage’s Azure storage account to block public IP addresses. For details on which Azure settings
are affected, refer to [Blocking public access — Recommended](../../user-guide/private-internal-stages-azure.md).

> **Important:**
>
> Confirm that traffic via private connectivity is successfully reaching the internal stage before blocking public access. Blocking
> public access without configuring private connectivity can cause unintended disruptions, including interference with managed services like
> Azure Data Factory.

See also:
:   [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_unblock_internal_stages_public_access.md), [SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](system_internal_stages_public_access_status.md)

## Syntax

> ```sqlsyntax
> SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS()
> ```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to internal stages is blocked. Private link is required to connect to internal stages of this account. | Indicates that the function successfully blocked public access. |
| Network config is not found, Please contact support | Indicates that there is a problem with the system parameters. |
| Azure Error when attempting to block public access to internal stages. Please contact Snowflake support. | Indicates that the function was unable to change the Azure settings in order to block public access. |

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Azure only. AWS and Google Cloud are not supported.

## Examples

Block all public traffic trying to access the internal stage of an Azure account.

> ```sqlexample
> USE ROLE accountadmin;
>
> SELECT SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS();
> ```

---
title: SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION
source: https://docs.snowflake.com/en/sql-reference/functions/system_block_internal_stages_public_access_with_exception.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION

Prevents public traffic from accessing the internal stage of the current Snowflake account on Microsoft Azure, while allowing
access from specified IP addresses or CIDR blocks.

This function is similar to [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_block_internal_stages_public_access.md). Instead of blocking
all public IP addresses, this function maintains an allowlist of IP addresses or CIDR blocks that are still permitted to access
the internal stage.

Calling SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION when an allowlist already exists replaces the existing allowlist with the
new one.

For more information, see [Blocking public access with IP allowlist exceptions](../../user-guide/private-internal-stages-azure.md).

> **Important:**
>
> Confirm that traffic via private connectivity is successfully reaching the internal stage before blocking public access.
> Blocking public access without configuring private connectivity can cause unintended disruptions, including interference with
> managed services like Microsoft Azure Data Factory.

See also:
:   [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_unblock_internal_stages_public_access.md),
    [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_block_internal_stages_public_access.md),
    [SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](system_internal_stages_public_access_status.md)

## Syntax

```sqlsyntax
SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION( '<ip_address_or_cidr_range>' [ , '<ip_address_or_cidr_range>' , ... ] )
```

## Arguments

`'ip_address_or_cidr_range'`
:   A string that specifies one of the following:

    * A single IP address, such as `'100.0.0.1'`.
    * A range of IP addresses using Classless Inter-Domain Routing (CIDR) notation:

      `ip_address/prefix_length`

      For example, `'1.2.3.0/24'` or `'101.0.0.0/31'`.

    IP addresses or CIDR ranges specified in this argument are allowed to access the internal stage. Specify multiple values as
    separate, comma-separated arguments.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to internal stages is blocked. Private link is required to connect to internal stages of this account. Exceptions: <ip_or_cidr_list> | Indicates that the function successfully blocked public access and set the specified IP allowlist. |
| Network config is not found, Please contact support | Indicates that there is a problem with the system parameters. |
| Microsoft Azure Error when attempting to block public access to internal stages. Please contact Snowflake support. | Indicates that the function was unable to change the Microsoft Azure settings in order to block public access. |

## Usage notes

* Only account administrators, that is users with the ACCOUNTADMIN role can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Microsoft Azure only. Amazon Web Services and Google Cloud are not supported.
* Calling this function replaces any existing IP allowlist. To modify the allowlist, call the function again with the complete
  updated list.

## Examples

Block public access while allowing specific IP addresses and CIDR blocks:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION(100.0.0.1, '1.2.3.4/24, 101.0.0.0/31');
```

---
title: SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_block_snowflake_managed_storage_volume_public_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS

Prevents all public traffic from accessing the Snowflake-managed storage volume of the current Snowflake account on Microsoft
Azure.

This function uses settings for the managed storage volume’s Azure storage account to block public IP addresses. For details on
which Azure settings are affected, refer to [Blocking public access](../../user-guide/private-managed-volumes-azure.md).

> **Important:**
>
> Confirm that traffic via private connectivity is successfully reaching the managed storage volume before blocking
> public access. Blocking public access without configuring private connectivity can cause unintended disruptions.

See also:
:   [SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_unblock_snowflake_managed_storage_volume_public_access.md),
    [SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS](system_snowflake_managed_storage_volume_public_access_status.md)

## Syntax

> ```sqlsyntax
> SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS()
> ```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to Snowflake-managed storage volumes is blocked. | Indicates that the function successfully blocked public access. |
| Network config is not found, Please contact support | Indicates that there is a problem with the system parameters. |
| No interop volumes configured on account | Indicates that there are no Snowflake-managed storage volumes configured for the account. |
| Azure Error when attempting to block public access to Snowflake-managed storage volumes. Please contact Snowflake support. | Indicates that the function was unable to change the Azure settings in order to block public access. |

## Usage notes

* Only account administrators (that is, users with the ACCOUNTADMIN role) can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Azure only. AWS and Google Cloud are not supported.

## Examples

Block all public traffic trying to access the Snowflake-managed storage volume of an Azure account.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS();
> ```

---
title: SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION
source: https://docs.snowflake.com/en/sql-reference/functions/system_block_snowflake_managed_storage_volume_public_access_with_exception.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION

Prevents public traffic from accessing the Snowflake-managed storage volume of the current Snowflake account on Microsoft Azure, while
allowing access from specified IP addresses or CIDR blocks.

This function is similar to
[SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_block_snowflake_managed_storage_volume_public_access.md). Instead of blocking all
public IP addresses, this function maintains an allowlist of IP addresses or CIDR blocks that are still permitted to access the
Snowflake-managed storage volume.

Calling SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION when an allowlist already exists replaces the
existing allowlist with the new one.

For more information, see [Blocking public access](../../user-guide/private-managed-volumes-azure.md).

> **Important:**
>
> Confirm that traffic via private connectivity is successfully reaching the Snowflake-managed storage volume before blocking
> public access. Blocking public access without configuring private connectivity can cause unintended disruptions.

See also:
:   [SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_unblock_snowflake_managed_storage_volume_public_access.md),
    [SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_block_snowflake_managed_storage_volume_public_access.md),
    [SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS](system_snowflake_managed_storage_volume_public_access_status.md)

## Syntax

```sqlsyntax
SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION( '<ip_address_or_cidr_range>' [ , '<ip_address_or_cidr_range>' , ... ] )
```

## Arguments

`'ip_address_or_cidr_range'`
:   A string that specifies one of the following:

    * A single IP address, such as `'100.0.0.1'`.
    * A range of IP addresses using Classless Inter-Domain Routing (CIDR) notation:

      `ip_address/prefix_length`

      For example, `'1.2.3.0/24'` or `'101.0.0.0/31'`.

    IP addresses or CIDR ranges specified in this argument are allowed to access the Snowflake-managed storage volume. Specify multiple values
    as separate, comma-separated arguments.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to Snowflake-managed storage volumes is blocked. | Indicates that the function successfully blocked public access. |
| Network config is not found, Please contact support | Indicates that there is a problem with the system parameters. |
| No interop volumes configured on account | Indicates that there are no Snowflake-managed storage volumes configured for the account. |
| Azure Error when attempting to block public access to Snowflake-managed storage volumes. Please contact Snowflake support. | Indicates that the function was unable to change the Azure settings in order to block public access. |

## Usage notes

* Only account administrators, that is users with the ACCOUNTADMIN role can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Microsoft Azure only. Amazon Web Services and Google Cloud are not supported.
* Calling this function replaces any existing IP allowlist. To modify the allowlist, call the function again with the complete
  updated list.

## Examples

Block public access while allowing specific IP addresses and CIDR blocks:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION('100.0.0.1', '1.2.3.4/24', '101.0.0.0/31');
```

---
title: SYSTEM$CANCEL_ALL_QUERIES
source: https://docs.snowflake.com/en/sql-reference/functions/system_cancel_all_queries.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$CANCEL_ALL_QUERIES

Cancels all active/running queries in the specified session.

See also:
:   [SYSTEM$CANCEL_QUERY](system_cancel_query.md)

## Syntax

```sqlsyntax
SYSTEM$CANCEL_ALL_QUERIES( <session_id> )
```

## Arguments

`session_id`
:   Identifier for the session for which to cancel all queries. To obtain the ID for a session, log into the web interface as an account administrator (user with the ACCOUNTADMIN role) and go to:

    > Account  » Sessions

## Usage notes

* A user can cancel their own running SQL operations using this SQL function. Canceling running operations executed by another user
  requires a role with one of the following privileges:

  + OWNERSHIP on the user who executed the operation.
  + OPERATE or OWNERSHIP on the warehouse that is running the operation (if applicable).

  Note that the ACCOUNTADMIN role is not necessarily granted any of these privileges.
* This function is not intended for canceling queries for a particular warehouse or user. Instead, use:

  > + [ALTER WAREHOUSE … ABORT ALL QUERIES](../sql/alter-warehouse.md)
  > + [ALTER USER … ABORT ALL QUERIES](../sql/alter-user.md)

## Examples

```sqlexample
SELECT SYSTEM$CANCEL_ALL_QUERIES(1065153872298);

+------------------------------------------+
| SYSTEM$CANCEL_ALL_QUERIES(1065153872298) |
|------------------------------------------|
| 1 cancelled.                             |
+------------------------------------------+
```

For a more detailed, working example, see [Canceling Statements](../../user-guide/querying-cancel-statements.md).

---
title: SYSTEM$CANCEL_QUERY
source: https://docs.snowflake.com/en/sql-reference/functions/system_cancel_query.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$CANCEL_QUERY

Cancels the specified query (or statement) if it is currently active/running.

See also:
:   [SYSTEM$CANCEL_ALL_QUERIES](system_cancel_all_queries.md)

## Syntax

```sqlsyntax
SYSTEM$CANCEL_QUERY( <query_id> )
```

## Arguments

`query_id`
:   Identifier for the query to cancel. To obtain the ID for a query executed within the last 14 days, log into the web interface and go to the History  page.

## Usage notes

* A user can cancel their own running SQL operations using this SQL function. Canceling running operations executed by another user
  requires a role with one of the following privileges:

  + OWNERSHIP on the user who executed the operation.
  + OPERATE or OWNERSHIP on the warehouse that is running the operation (if applicable).
  + ACCOUNTADMIN role.
* For a query run by a [task](../../user-guide/tasks-intro.md), canceling running operations requires a role with one of the following privileges:

  + OPERATE or OWNERSHIP on the task that is running the operation.
  + ACCOUNTADMIN role.
* Snowflake query IDs are UUID text strings with hyphens, which are special characters, so the strings must be escaped by using single quotes.
* This function is not intended for canceling queries for a particular warehouse or user. Instead, use:

  > + [ALTER WAREHOUSE … ABORT ALL QUERIES](../sql/alter-warehouse.md)
  > + [ALTER USER … ABORT ALL QUERIES](../sql/alter-user.md)

## Examples

```sqlexample
SELECT SYSTEM$CANCEL_QUERY('d5493e36-5e38-48c9-a47c-c476f2111ce5');

+-------------------------------------------------------------+
| SYSTEM$CANCEL_QUERY('D5493E36-5E38-48C9-A47C-C476F2111CE5') |
|-------------------------------------------------------------|
| query [d5493e36-5e38-48c9-a47c-c476f2111ce5] terminated.    |
+-------------------------------------------------------------+
```

---
title: SYSTEM$CATALOG_LINK_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_catalog_link_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CATALOG_LINK_STATUS

Returns the link status for a specified [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).

See also:
:   [Use a catalog-linked database for Apache Iceberg™ tables](../../user-guide/tables-iceberg-catalog-linked-database.md)

## Syntax

```sqlsyntax
SYSTEM$CATALOG_LINK_STATUS( '<catalog_linked_db_name>' )
```

## Arguments

`'catalog_linked_db_name'`
:   Specifies the name of the catalog-linked database that you want to check the status of.

## Returns

The function returns a JSON object containing the following name/value pairs:

```sqljson
{
  "executionState":"<value>",
  "failedExecutionStateReason":"<value>",
  "failedExecutionStateErrorCode":"<value>",
  "lastLinkAttemptStartTime":"<value>",
  "failureDetails":[
    {
      "qualifiedEntityName":"<value>",
      "entityDomain":"<value>",
      "operation":"<value>",
      "errorCode":"<value>",
      "errorMessage":"<value>"
    },
    { ... },
    ...
  ]
}
```

Where:

> `executionState`
> :   Current execution state of the linking operation that Snowflake uses to connect to your Iceberg catalog.
>
>     Values:
>
>     * `RUNNING`: The next table discovery sync is scheduled or executing; does not guarantee that all tables have successfully synced.
>     * `FAILED`: The linking operation encountered an error and was unsuccessful.
>
>       If the linking operation fails, resolve the error first. Snowflake then automatically schedules the next table discovery sync, unless
>       discovery has been suspended for the catalog-linked database. If you suspended discovery, run
>       [ALTER DATABASE … RESUME DISCOVERY](../sql/alter-database-catalog-linked.md) after you resolve the error to resume discovery.
>
>       For example:
>
>       ```sqlexample
>       ALTER DATABASE IF EXISTS my_linked_db RESUME DISCOVERY;
>       ```
>
> `failedExecutionStateReason`
> :   Error message associated with a `FAILED` execution state. Doesn’t appear in the function output if the last sync attempt
>     was successful.
>
> `failedExecutionStateErrorCode`
> :   Error code associated with a `FAILED` execution state. Does not appear in the function output if the last sync attempt
>     was successful.
>
> `lastLinkAttemptStartTime`
> :   Timestamp that indicates when Snowflake last started the process of discovering and syncing changes in the remote catalog.
>
> `failureDetails`
> :   An array of objects that provide details about entities (for example, tables) in the remote catalog that Snowflake can’t sync.
>     Each object has the following fields:
>
>     `qualifiedEntityName`
>     :   The qualified name of the entity in the remote catalog, relative to the catalog name.
>
>         For example, `namespace_level_1.namespace_level_2.table_name`.
>
>         Type: String
>
>     `entityDomain`
>     :   The entity domain in the remote catalog; for example, TABLE.
>
>         Type: String
>
>     `operation`
>     :   The operation in Snowflake associated with the sync; for example, `CREATE` a table or schema, `DROP`.
>
>         * If the operation is `CATALOG_CONNECTION`, there was an error when Snowflake attempted to connect to the remote catalog.
>         * If the operation is `DISCOVERY`, there was an error when Snowflake attempted to discover the tables or namespaces in your remote
>           catalog. To see which table or namespace caused the error, see `entityDomain`, which will either be `TABLE` or `NAMESPACE`.
>
>         Type: String
>
>     `errorCode`
>     :   Error code associated with the failure.
>
>         Type: String
>
>     `errorMessage`
>     :   Error code associated with the failure.
>
>         Type: String

## Access control requirements

A [role](http://docs.snowflake.com/user-guide/security-access-control-overview#roles) used to execute this SQL command must have either of the following
[privileges](http://docs.snowflake.com/user-guide/security-access-control-overview#privileges) at a minimum:

| Privilege | Object |
| --- | --- |
| OWNERSHIP | The target catalog-linked database. |
| MONITOR | The target catalog-linked database. |

## Usage notes

* The `failureDetails` field returns information about DROP SCHEMA and DROP ICEBERG TABLE failures.
* Returns results as long as you use a role with a privilege on the specified catalog-linked database.
  For more information, see [Database privileges](../../user-guide/security-access-control-privileges.md).

## Examples

Retrieve the link status for a catalog-linked database named `my_cld`:

```sqlexample
SELECT SYSTEM$CATALOG_LINK_STATUS('my_cld');
```

Output:

```output
{
  "executionState": "RUNNING",
  "lastLinkAttemptStartTime": "2025-02-14T01:35:01.71Z",
  "failureDetails": [
    {
      "qualifiedEntityName": "my_namespace.table_1",
      "entityDomain": "TABLE",
      "operation": "CREATE",
      "errorCode": "0040000",
      "errorMessage": "An internal error occurred. Please contact Snowflake support."
    },
    {
      "qualifiedEntityName": "my_namespace.table_2",
      "entityDomain": "TABLE",
      "operation": "CREATE",
      "errorCode": "0040000",
      "errorMessage": "An internal error occurred. Please contact Snowflake support."
    }
  ]
}
```

---
title: SYSTEM$CKE_HASH_FUNCTION
source: https://docs.snowflake.com/en/sql-reference/functions/system_cke_hash_function.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System information)

# SYSTEM$CKE_HASH_FUNCTION

Analyzes [Cortex Knowledge Extensions (CKE)](../../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md) usage by mapping `hashedDocumentIds` back to your original document primary keys in the Cortex Search Service. This is required because Snowflake shares only hashed IDs for privacy.

This function returns the hashed document identifier, which maps to the HASHED_DOC_ID in the [LISTING_ACCESS_HISTORY view](../data-sharing-usage/listing-access-history.md).

See also:
:   [SYSTEM$ENCODE_CKE_PRIMARY_KEY](system_encode_cke_primary_key.md)

## Syntax

```sqlsyntax
SYSTEM$CKE_HASH_FUNCTION( '<hash_version>', '<encoded_primary_key>' )
```

## Arguments

`hash_version`
:   The version of the hash function used, provided in the [LISTING_ACCESS_HISTORY view](../data-sharing-usage/listing-access-history.md) view.

`encoded_primary_key`
:   The encoded primary keys returned when you call the [SYSTEM$ENCODE_CKE_PRIMARY_KEY](system_encode_cke_primary_key.md) function.

## Returns

Returns the hashed encoded primary keys specified by the hash version.

## Examples

The following example retrieves the hash version and uses the SYSTEM$CKE_HASH_FUNCTION to compute a hashed document ID for every primary
key. In the following example, `cke_document_daily_access` is a view created from the [LISTING_ACCESS_HISTORY view](../data-sharing-usage/listing-access-history.md):

```sqlexample
WITH
  encoded_primary_keys AS
  (
    SELECT pkCol1,
          pkCol2,
          SYSTEM$ENCODE_CKE_PRIMARY_KEY(pkCol1, pkCol2) AS encoded_primary_key
      FROM your_cortex_search_table
  )
,
  hash_versions AS
  (
    SELECT DISTINCT(hash_version) AS hash_version
      FROM cke_document_daily_access
  )
SELECT pkCol1,
      pkCol2,
      hash_version,
      SYSTEM$CKE_HASH_FUNCTION(hash_version, encoded_primary_key) AS hashed_doc_id
  FROM encoded_primary_keys
  CROSS JOIN hash_versions;
```

---
title: SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_cleanup_database_role_grants.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS

Revokes privileges on dropped objects from the share and grants the database role to the share.

## Syntax

```sqlsyntax
SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS( '<database_role_name>' , '<share_name>' )
```

## Arguments

`'database_role_name'`
:   The name of the database role.

    If the identifier is not fully qualified in the form of `db_name.database_role_name`, the command uses the database role
    in the current database for the session.

`'share_name'`
:   The name of the share.

## Access control requirements

To call this function, the active role must have the global [MANAGE GRANTS privilege](../../user-guide/security-access-control-privileges.md).

## Usage notes

None.

## Example

Call the function:

> ```sqlexample
> SELECT SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS('mydb.dbr1' , 'myshare');
> ```

---
title: SYSTEM$CLIENT_VERSION_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_client_version_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CLIENT_VERSION_INFO

Returns version information for Snowflake clients and drivers.

See also:
:   [Client versions & support policy](../../release-notes/requirements.md)

## Syntax

```sqlsyntax
SYSTEM$CLIENT_VERSION_INFO()
```

## Arguments

None

## Returns

Return a string containing a JSON array of objects. Each object contains information about a specific client and driver, such as SnowSQL, the JDBC driver, and so on.

Each JSON object contains the following name/value pairs:

```json
{
  "clientId": "DOTNETDriver",
  "clientAppId": ".NET",
  "minimumSupportedVersion": "2.0.9",
  "minimumNearingEndOfSupportVersion": "2.0.11",
  "recommendedVersion": "2.1.5",
  "deprecatedVersions": [],
  "_customSupportedVersions_": []
},
```

Where:

> `clientId`
> :   Internal ID of the client or driver. Possible values include:
>
>     * `DOTNETDriver`
>     * `GO`
>     * `JDBC`
>     * `JSDriver`
>     * `ODBC`
>     * `PHP_PDO`
>     * `PythonConnector`
>     * `SnowSQL`
>     * `SQLAPI`
>
> `clientAppId`
> :   Name of the client or driver. Possible values include:
>
>     * `.NET`
>     * `GO`
>     * `JDBC`
>     * `JavaScript`
>     * `ODBC`
>     * `PDO`
>     * `PythonConnector`
>     * `SnowSQL`
>     * `SQLAPI`
>
> `minimumSupportedVersion`
> :   Oldest version of the client or driver officially supported.
>
> `minimumNearingEndOfSupportVersion`
> :   Version of the client or driver that reaches end-of-support (EOS) at the start of the next quarter.
>
> `recommendedVersion`
> :   Current version of the client or driver.
>
> `deprecatedVersions`, `_customSupportedVersions_`
> :   Internal use only.

## Usage notes

* If you prefer not to process JSON, you can use the [PARSE_JSON](parse_json.md) and [LATERAL FLATTEN](../../user-guide/lateral-join-using.md) function to convert the JSON to columnar output.
* You can also use the WHERE clause to return the information for a specific client or driver (`clientId`).

## Examples

The following example retrieves the version information for all Snowflake clients and drivers. Note that the output has been manually formatted for better readability.

```sqlexample
SELECT SYSTEM$CLIENT_VERSION_INFO();
```

```output
[
  {
    "clientId": "DOTNETDriver",
    "clientAppId": ".NET",
    "minimumSupportedVersion": "2.0.9",
    "minimumNearingEndOfSupportVersion": "2.0.11",
    "recommendedVersion": "2.1.5",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "GO",
    "clientAppId": "Go",
    "minimumSupportedVersion": "1.6.6",
    "minimumNearingEndOfSupportVersion": "1.6.9",
    "recommendedVersion": "1.7.1",
    "deprecatedVersions": [],
    "_customSupportedVersions_": [
      "1.1.5"
    ]
  },
  {
    "clientId": "JDBC",
    "clientAppId": "JDBC",
    "minimumSupportedVersion": "3.13.14",
    "minimumNearingEndOfSupportVersion": "3.13.18",
    "recommendedVersion": "3.14.4",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "JSDriver",
    "clientAppId": "JavaScript",
    "minimumSupportedVersion": "1.6.6",
    "minimumNearingEndOfSupportVersion": "1.6.9",
    "recommendedVersion": "1.9.2",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "ODBC",
    "clientAppId": "ODBC",
    "minimumSupportedVersion": "2.24.5",
    "minimumNearingEndOfSupportVersion": "2.24.7",
    "recommendedVersion": "3.1.4",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "PHP_PDO",
    "clientAppId": "PDO",
    "minimumSupportedVersion": "1.2.0",
    "minimumNearingEndOfSupportVersion": "1.2.1",
    "recommendedVersion": "2.0.1",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "PythonConnector",
    "clientAppId": "PythonConnector",
    "minimumSupportedVersion": "2.7.3",
    "minimumNearingEndOfSupportVersion": "2.7.7",
    "recommendedVersion": "3.6.0",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "SnowSQL",
    "clientAppId": "SnowSQL",
    "minimumSupportedVersion": "1.2.21",
    "minimumNearingEndOfSupportVersion": "1.2.21",
    "recommendedVersion": "1.2.31",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  },
  {
    "clientId": "SQLAPI",
    "clientAppId": "SQLAPI",
    "minimumSupportedVersion": "1.0.0",
    "minimumNearingEndOfSupportVersion": "",
    "recommendedVersion": "",
    "deprecatedVersions": [],
    "_customSupportedVersions_": []
  }
]
```

The following example returns the version information for all clients as a rowset:

```sqlexample
WITH output AS (
  SELECT
    PARSE_JSON(SYSTEM$CLIENT_VERSION_INFO()) a
)
SELECT
    value:clientAppId::STRING AS client_app_id,
    value:minimumSupportedVersion::STRING AS minimum_version,
    value:minimumNearingEndOfSupportVersion::STRING AS near_end_of_support_version,
    value:recommendedVersion::STRING AS recommended_version
  FROM output r,
    LATERAL FLATTEN(INPUT => r.a, MODE =>'array');
```

```output
+-----------------+-----------------+-----------------------------+---------------------+
| CLIENT_APP_ID   | MINIMUM_VERSION | NEAR_END_OF_SUPPORT_VERSION | RECOMMENDED_VERSION |
|-----------------+-----------------+-----------------------------+---------------------|
| .NET            | 2.0.9           | 2.0.11                      | 2.1.5               |
| Go              | 1.6.6           | 1.6.9                       | 1.7.1               |
| JDBC            | 3.13.14         | 3.13.18                     | 3.14.4              |
| JavaScript      | 1.6.6           | 1.6.9                       | 1.9.2               |
| ODBC            | 2.23.5          | 2.24.7                      | 3.1.4               |
| PDO             | 1.2.0           | 1.2.1                       | 2.0.1               |
| PythonConnector | 2.7.3           | 2.7.7                       | 3.6.0               |
| SnowSQL         | 1.2.21          | 1.2.21                      | 1.2.31              |
| SQLAPI          | 1.0.0           |                             |                     |
+-----------------+-----------------+-----------------------------+---------------------+
```

The following example returns the version information for the JDBC driver as a rowset:

```sqlexample
WITH output AS (
  SELECT
    PARSE_JSON(SYSTEM$CLIENT_VERSION_INFO()) a
)
SELECT
    value:clientId::STRING AS client_id,
    value:minimumSupportedVersion::STRING AS minimum_version,
    value:minimumNearingEndOfSupportVersion::STRING AS near_end_of_support_version,
    value:recommendedVersion::STRING AS recommended_version
  FROM output r,
    LATERAL FLATTEN(INPUT => r.a, MODE =>'array')
  WHERE client_id = 'JDBC';
```

```output
+-----------+-----------------+-----------------------------+---------------------+
| CLIENT_ID | MINIMUM_VERSION | NEAR_END_OF_SUPPORT_VERSION | RECOMMENDED_VERSION |
|-----------+-----------------+-----------------------------+---------------------|
| JDBC      | 3.13.14         | 3.13.18                     | 3.14.4              |
+-----------+-----------------+-----------------------------+---------------------+
```

---
title: SYSTEM$CLIENT_VULNERABILITY_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_client_vulnerability_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CLIENT_VULNERABILITY_INFO

Returns details about common vulnerabilities and exposures (CVE) fixes and related vulnerabilities for Snowflake clients and drivers.

See also:
:   [SYSTEM$CLIENT_VERSION_INFO](system_client_version_info.md)

## Syntax

```sqlsyntax
SYSTEM$CLIENT_VULNERABILITY_INFO()
```

## Arguments

None

## Returns

Return a string containing a JSON array of objects. Each object contains information about a specific client and driver, such as SnowSQL, the JDBC driver, and so on.

Each JSON object contains the following structure:

```json
{
  "clientId": "GO",
  "vulnerabilities": [
    {
      "cve": "CVE-2023-34231",
      "severity": "high",
      "maxAffected": "1.6.18"
    },
    {
      "cve": "CVE-2025-46327",
      "severity": "low",
      "minAffected": "1.7.0",
      "maxAffected": "1.13.2"
    }
  ]
}
```

Where:

`clientId`
:   Internal ID of the client or driver. Possible values include:

    * `DOTNETDriver`
    * `GO`
    * `JDBC`
    * `JSDriver` (Node.js)
    * `ODBC`
    * `PHP_PDO`
    * `PythonConnector`
    * `SnowSQL`
    * `SQLAPI`

`vulnerabilities`
:   Array of vulnerabilities affecting the client or driver. Each vulnerability is represented as an object with the following name/value pairs:

    * `cve` is the CVE identifier for the vulnerability.
    * `severity` is the severity level of the vulnerability. Possible values include: `none`, `low`, `medium`, `high`, and `critical`.
    * `minAffected` is the minimum version of the client or driver that contains this vulnerability. This field is optional, as some vulnerabilities might occur in the first version of a client or driver.
    * `maxAffected` is the maximum version that contains this vulnerability.

## Usage notes

None

## Examples

The following example calls the SYSTEM$CLIENT_VERSION_INFO and SYSTEM$CLIENT_VULNERABIITY_INFO system functions. The example parses the JSON strings returned by these functions and presents the data in tabular form.

```sqlexample
-- CLIENT VERSION INFO

SELECT
      value:clientAppId::VARCHAR clientAppId
    , value:clientId::VARCHAR clientId
    , value:minimumNearingEndOfSupportVersion::VARCHAR minimumNearingEndOfSupportVersion
    , value:minimumSupportedVersion::VARCHAR minimumSupportedVersion
    , value:recommendedVersion::VARCHAR recommendedVersion
    , value:deprecatedVersions deprecatedVersions
    , value:_customSupportedVersions_ customSupportedVersions
FROM
    TABLE(FLATTEN(PARSE_JSON(SYSTEM$CLIENT_VERSION_INFO())));

-- CLIENT VULNERABILITY INFO

SELECT
    c:clientId::VARCHAR clientId
    , f.value:cve::VARCHAR cve
    , f.value:maxAffected::VARCHAR maxAffected
    , f.value:minAffected::VARCHAR minAffected
    , f.value:severity::VARCHAR severity
FROM
    (
        SELECT value c
        FROM TABLE(FLATTEN(PARSE_JSON(SYSTEM$CLIENT_VULNERABILITY_INFO())))
    ) c,
    lateral flatten(input => c, path => 'vulnerabilities' ) f;
```

---
title: SYSTEM$CLUSTERING_DEPTH
source: https://docs.snowflake.com/en/sql-reference/functions/system_clustering_depth.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CLUSTERING_DEPTH

Computes the average depth of the table according to the specified columns (or the clustering key defined for the table). The average depth of a populated table (i.e. a table containing
data) is always `1` or more. The smaller the average depth, the better clustered the table is with regards to the specified columns.

For more information about micro-partitions and clustering keys, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

See also:
:   [SYSTEM$CLUSTERING_INFORMATION](system_clustering_information.md)

## Syntax

```sqlsyntax
SYSTEM$CLUSTERING_DEPTH( '<table_name>' , '( <col1> [ , <col2> ... ] )' [ , '<predicate>' ] )
```

## Arguments

`table_name`
:   Table for which you want to calculate the clustering depth.

`col1 [ , col2 ... ]`
:   Column(s) in the table used to calculate the clustering depth:

    * For a table with no clustering key, this argument is required. If this argument is omitted, an error is returned.
    * For a table with a clustering key, this argument is optional; if the argument is omitted, Snowflake uses the defined clustering key to calculate the depth.

    > **Note:**
    >
    > You can use this argument to calculate the depth for any columns in the table, regardless of the clustering key defined for the table.

`predicate`
:   Clause that filters the range of values in the columns on which to calculate the clustering depth. Note that `predicate` does not utilize a WHERE keyword at the beginning of the clause.

## Usage notes

* All arguments are strings (i.e. they must be enclosed in single quotes).
* If `predicate` contains a string, the string must be enclosed in single quotes, which then must be escaped using single quotes. For example:

  > `SYSTEM$CLUSTERING_DEPTH( ... , 'col1 = 100 and col2 = ''A''' )`

## Examples

Calculate the clustering depth for a table using the clustering key defined for the table:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_DEPTH('TPCH_ORDERS');
>
> +----------------------------------------+
> | SYSTEM$CLUSTERING_DEPTH('TPCH_ORDERS') |
> |----------------------------------------+
> | 2.4865                                 |
> +----------------------------------------+
> ```

Calculate the clustering depth for a table using two columns in the table:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_DEPTH('TPCH_ORDERS', '(C2, C9)');
>
> +----------------------------------------------------+
> | SYSTEM$CLUSTERING_DEPTH('TPCH_ORDERS', '(C2, C9)') |
> +----------------------------------------------------+
> | 23.1351                                            |
> +----------------------------------------------------+
> ```

Same as the previous example, but with a predicate on one of the columns:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_DEPTH('TPCH_ORDERS', '(C2, C9)', 'C2 = 25');
>
> +----------------------------------------------------+
> | SYSTEM$CLUSTERING_DEPTH('TPCH_ORDERS', '(C2, C9)') |
> +----------------------------------------------------+
> | 11.2452                                            |
> +----------------------------------------------------+
> ```

---
title: SYSTEM$CLUSTERING_INFORMATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_clustering_information.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CLUSTERING_INFORMATION

Returns clustering information, including average clustering depth, for a table based on one or more columns in the table.

See also:
:   [SYSTEM$CLUSTERING_DEPTH](system_clustering_depth.md)

## Syntax

```sqlsyntax
SYSTEM$CLUSTERING_INFORMATION( '<table_name>'
    [ , { '( <expr1> [ , <expr2> ... ] )' | <number_of_errors> } ] )
```

## Arguments

`table_name`
:   Table for which you want to return clustering information.

`(expr1 [ , expr2 ... ])`
:   Column names or expressions for which clustering information is returned:

    * For a table with no clustering key, this argument is required. If this argument is omitted, an error is returned.
    * For a table with a clustering key, this argument is optional; if the argument is omitted, Snowflake uses the defined clustering key to return clustering information.

    Even if only one column name or expression is passed, it must be inside parentheses.

    > **Note:**
    >
    > You can use this argument to return clustering information for any columns in the table, regardless of whether a clustering key is defined for the table.
    >
    > In other words, you can use this to help you decide what clustering to use in the future.

`number_of_errors`
:   Number of clustering errors returned by the function. If this argument is omitted, the 10 most recent errors are returned.

## Usage notes

* The second argument of the function specifies a column name/expression or a number of errors. You cannot include both arguments
  in a single function call.
* The table name, column name, and expression are strings, and should be enclosed in single quotes.

## Returns

The function returns a value of type VARCHAR.

The returned string is in JSON format and contains the following name/value pairs:

`cluster_by_keys`
:   Columns in table used to return clustering information; can be any columns in the table.

`notes`
:   This column can contain suggestions to make clustering more efficient. For example, this field might contain a warning
    if the cardinality of the clustering column is extremely high.

    This column can be empty.

    For more information about how to cluster efficiently, see [Strategies for Selecting Clustering Keys](../../user-guide/tables-clustering-keys.md).

`total_partition_count`
:   Total number of micro-partitions that comprise the table.

`total_constant_partition_count`
:   Total number of micro-partitions for which the value of the specified columns have reached a constant state (i.e. the micro-partitions will not benefit significantly from reclustering). The number
    of constant micro-partitions in a table has an impact on pruning for queries. The higher the number, the more micro-partitions can be pruned from queries executed on the table, which has a
    corresponding impact on performance.

`average_overlaps`
:   Average number of overlapping micro-partitions for each micro-partition in the table. A high number indicates the table is not well-clustered.

`average_depth`
:   Average overlap depth of each micro-partition in the table. A high number indicates the table is not well-clustered.

    This value is also returned by [SYSTEM$CLUSTERING_DEPTH](system_clustering_depth.md).

`partition_depth_histogram`
:   A histogram depicting the distribution of overlap depth for each micro-partition in the table. The histogram contains buckets with widths:

    * `0` to `16` with increments of `1`.
    * For buckets larger than `16`, increments of twice the width of the previous bucket (e.g. `32`, `64`, `128`, …).

`clustering_errors`
:   An array of JSON objects, each with a `timestamp` and `error` name/value pair. The `error` describes why automatic
    clustering was not able to recluster data.

    By default, the 10 most recent errors are returned in the array. To return more or fewer errors, specify a number as the second argument
    of the function.

For more information about micro-partition overlap and depth, and their impact on query pruning, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

## Examples

Return the 5 most recent clustering errors:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_INFORMATION('t1', 5);
> ```

Return the clustering information for a table using two columns in the table:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_INFORMATION('test2', '(col1, col3)');
> ```
>
> ```output
> +--------------------------------------------------------------------+
> | SYSTEM$CLUSTERING_INFORMATION('TEST2', '(COL1, COL3)')             |
> |--------------------------------------------------------------------|
> | {                                                                  |
> |   "cluster_by_keys" : "LINEAR(COL1, COL3)",                        |
> |   "total_partition_count" : 1156,                                  |
> |   "total_constant_partition_count" : 0,                            |
> |   "average_overlaps" : 117.5484,                                   |
> |   "average_depth" : 64.0701,                                       |
> |   "partition_depth_histogram" : {                                  |
> |     "00000" : 0,                                                   |
> |     "00001" : 0,                                                   |
> |     "00002" : 3,                                                   |
> |     "00003" : 3,                                                   |
> |     "00004" : 4,                                                   |
> |     "00005" : 6,                                                   |
> |     "00006" : 3,                                                   |
> |     "00007" : 5,                                                   |
> |     "00008" : 10,                                                  |
> |     "00009" : 5,                                                   |
> |     "00010" : 7,                                                   |
> |     "00011" : 6,                                                   |
> |     "00012" : 8,                                                   |
> |     "00013" : 8,                                                   |
> |     "00014" : 9,                                                   |
> |     "00015" : 8,                                                   |
> |     "00016" : 6,                                                   |
> |     "00032" : 98,                                                  |
> |     "00064" : 269,                                                 |
> |     "00128" : 698                                                  |
> |   },                                                               |
> |   "clustering_errors" : [ {                                        |
> |      "timestamp" : "2023-04-03 17:50:42 +0000",                    |
> |      "error" : "(003325) Clustering service has been disabled.\n"  |
> |      }                                                             |
> |   ]                                                                |
> | }                                                                  |
> +--------------------------------------------------------------------+
> ```
>
> This example indicates that the `test2` table is not well-clustered for the following reasons:
>
> * Zero (`0`) constant micro-partitions out of `1156` total micro-partitions.
> * High average of overlapping micro-partitions.
> * High average of overlap depth across micro-partitions.
> * Most of the micro-partitions are grouped at the lower-end of the histogram, with the majority of micro-partitions having an overlap depth between `64` and `128`.
> * Automatic clustering was previously disabled.

## Limitations

If a table has more than 2 million partitions:

* The results of the function are based on a subset of the table’s partitions.
* The value of the output’s `total_partition_count` field is 2 million.

---
title: SYSTEM$CLUSTERING_RATIO — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_clustering_ratio.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CLUSTERING_RATIO — *Deprecated*

Calculates the clustering ratio for a table, based on one or more columns in the table. The ratio is a number from `0` to `100`. The higher the ratio, the better clustered the table is.

The clustering ratio for a table can be calculated using any columns in the table or columns that have been explicitly defined as a clustering key for the table. A clustering key can be defined for a table
using either [CREATE TABLE](../sql/create-table.md) or [ALTER TABLE](../sql/alter-table.md).

For more information about clustering ratio and clustering keys, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

## Syntax

```sqlsyntax
SYSTEM$CLUSTERING_RATIO( '<table_name>' , '( <col1> [ , <col2> ... ] )' [ , '<predicate>' ] )
```

## Arguments

`table_name`
:   Table for which you want to calculate the clustering ratio.

`col1 [ , col2 ... ]`
:   Column(s) in the table used to calculate the clustering ratio:

    * For a table with no clustering key, this argument is required. If this argument is omitted, an error is returned.
    * For a table with a clustering key, this argument is optional; if the argument is omitted, Snowflake uses the defined clustering key to calculate the ratio.

    > **Note:**
    >
    > You can use this argument to calculate the ratio for any columns in the table, regardless of the clustering key defined for the table.

`predicate`
:   Clause that filters the range of values in the columns on which to calculate the clustering ratio. Note that `predicate` does not utilize a WHERE keyword at the beginning of the clause.

## Usage notes

* All arguments are strings (i.e. they must be enclosed in single quotes).
* If `predicate` contains a string, the string must be enclosed in single quotes, which then must be escaped using single quotes. For example:

  > `SYSTEM$CLUSTERING_RATIO( ... , 'col1 = 100 and col2 = ''A''' )`

## Examples

Calculate the clustering ratio for a table using two columns in the table:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_RATIO('t2', '(col1, col3)');
>
> +-------------------------------+
> | SYSTEM$CLUSTERING_RATIO('T2') |
> |-------------------------------|
> |                          77.1 |
> +-------------------------------+
> ```

Calculate the clustering ratio for a table using two columns in the table and a predicate on one of the columns:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_RATIO('t2', '(col1, col2)', 'col1 = ''A''');
>
> +-------------------------------+
> | SYSTEM$CLUSTERING_RATIO('T2') |
> |-------------------------------|
> |                          87.7 |
> +-------------------------------+
> ```

Calculate the clustering ratio for a table using the clustering key defined for the table:

> ```sqlexample
> SELECT SYSTEM$CLUSTERING_RATIO('t1');
>
> +-------------------------------+
> | SYSTEM$CLUSTERING_RATIO('T1') |
> |-------------------------------|
> |                         100.0 |
> +-------------------------------+
> ```

---
title: SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_commit_move_organization_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT

Finalizes the process of moving an [organization account](../../user-guide/organization-accounts.md) from one region to another.

The process of moving the organization account began when the [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](system_initiate_move_organization_account.md) was
called.

See also:
:   [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](system_initiate_move_organization_account.md) , [SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](system_show_move_organization_account_status.md)

## Syntax

```sqlsyntax
SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT( <grace_period> )
```

## Arguments

`grace_period`
:   Specifies the number of days after which the organization account in the original region (that is, the source region) will be deleted.

## Access control requirements

Only users with the GLOBALORGADMIN role can call this function.

## Usage notes

* You are automatically logged out of Snowflake immediately after calling this function.
* Until the process of finalizing the move completes (usually within a few minutes), you cannot sign in to the organization account in the
  source region nor the organization account in the target region.
* When the finalization process completes, the name of the organization account in the new region changes from the temporary name that was
  specified by the [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](system_initiate_move_organization_account.md) function to the original name of the
  organization account.
* To check the status of the finalization process, call the [SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](system_show_move_organization_account_status.md)
  function.

## Examples

Delete the original organization account 14 days after the move is finalized:

```sqlexample
SELECT SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT(14);
```

---
title: SYSTEM$CONVERT_PIPES_SQS_TO_SNS
source: https://docs.snowflake.com/en/sql-reference/functions/system_convert_pipes_sqs_to_sns.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CONVERT_PIPES_SQS_TO_SNS

Convert pipes using Amazon SQS (Simple Queue Service) notifications to the Amazon Simple Notification Service (SNS) service for
an S3 bucket.

For more information, see [Automating Snowpipe for Amazon S3](../../user-guide/data-load-snowpipe-auto-s3.md).

## Syntax

```sqlsyntax
SYSTEM$CONVERT_PIPES_SQS_TO_SNS( '<bucket_name>, '<sns_topic_arn>' )
```

## Arguments

`bucket_name`
:   Name of the S3 bucket.

`sns_topic_arn`
:   ARN of Amazon SNS topic.

## Access control requirements

Only account administrators can execute this function.

## Usage notes

* Before you call this function, update the access policy for your topic with the following permissions:

  + Allow the Snowflake IAM user to subscribe the SQS queue that is in your *target* account
    to your topic.
  + Allow Amazon S3 to publish event notifications from your bucket to the SNS topic.

  For instructions, see [Step 1: Subscribe the Snowflake SQS Queue to the SNS Topic](../../user-guide/data-load-snowpipe-auto-s3.md).
* Call this function *before* you update your S3 bucket to send notifications to the SNS topic.
* To prevent any data loss, Snowpipe will continue to consume messages from the SQS queue.
* The S3 bucket and SNS topic must be in the same AWS region.

## Examples

Convert all notifications from bucket `my_s3_bucket`:

```sqlexample
SELECT SYSTEM$CONVERT_PIPES_SQS_TO_SNS(
   'my_s3_bucket', 'arn:aws:sns:us-east-2:111122223333:sns_topic');
```

---
title: SYSTEM$CREATE_BILLING_EVENT
source: https://docs.snowflake.com/en/sql-reference/functions/system_create_billing_event.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$CREATE_BILLING_EVENT

Creates a [billable event](../../developer-guide/native-apps/adding-custom-event-billing.md) that tracks consumer usage of an installed
monetized application. If you need to exceed the one event per minute frequency limitation of this system function, use [SYSTEM$CREATE_BILLING_EVENTS](system_create_billing_events.md). This system function can only be called from an application installed in a consumer account.

## Syntax

```sqlsyntax
SYSTEM$CREATE_BILLING_EVENT(
 '<class>',
 '<subclass>',
 <start_timestamp>,
 <timestamp>,
 <base_charge>,
 '<objects>',
 '<additional_info>'
 )
```

## Arguments

**Required:**

`'class'`
:   Identifier for the custom event class.

    Type: STRING

    The identifier has the following requirements:

    * Must start with a letter (A-Z) or an underscore (“_”).
    * Must contain only letters, underscores, decimal digits (0-9), and dollar signs (“$”).
    * Length cannot exceed 64 characters.
    * Must not start with `SNOWFLAKE_`. `SNOWFLAKE_` is reserved for internal identifiers.

    The class name is stored and resolved as uppercase characters. Class name comparisons are case-insensitive.

`timestamp`
:   Specifies the timestamp (UTC) when the event was created as a Unix timestamp in milliseconds.

    Type: Integer

`base_charge`
:   Specifies the amount in US dollars to charge for the billable event. The value must be greater than zero, less than 99,999.99, and must not exceed two decimal places of precision. For example, `1.00` or `0.07`.

    Type: DOUBLE

**Optional:**

`'subclass'`
:   Identifier for the custom event subclass. This field is only used by the provider.

    Type: STRING

    The identifier has the same naming requirements as the `class` argument.

`start_timestamp`
:   Specifies the start time (UTC) of the event as a Unix timestamp in milliseconds.

    Type: INTEGER

    Use to set the start time in cases where providers want to emit an event based on a time range; otherwise set to the same
    value used for the `TIMESTAMP` argument.

`'objects'`
:   A JSON string array containing fully qualified object names that apply to this event.

    Type: STRING

    The maximum size is 4 KB.

`'additional_info'`
:   A JSON string of key-value pairs the provider can use to send additional info.

    > Type: STRING
    >
    > The maximum size is 4 KB.

## Returns

This function returns the following status messages:

> | Status message | Description |
> | --- | --- |
> | Success | Indicates the billable event was successfully created. |
> | Invalid parameter: `<PARAM_NAME>`. | Indicates an unsupported parameter was passed to the function. |
> | Only callable from within an application. | Indicates the function was called from outside an application. |
> | Payload length exceeds the limit of 9000 characters. | Indicates the call to the function exceeds the character limit. |
> | Too many calls. At most 1 call is allowed per 10 millisecond window. | Indicates an application made too many calls to this system function within a specific period. |

## Usage notes

This system function can only be called from within a stored procedure in the setup script of an application created using the
Snowflake Native App Framework.

## Examples

See [Billable event examples](../../developer-guide/native-apps/adding-custom-event-billing.md).

---
title: SYSTEM$CREATE_BILLING_EVENTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_create_billing_events.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$CREATE_BILLING_EVENTS

Creates multiple [billable events](../../developer-guide/native-apps/adding-custom-event-billing.md) that track consumer usage of installed
monetized applications. Use this system function when you need to exceed the one event per minute frequency limitation of [SYSTEM$CREATE_BILLING_EVENT](system_create_billing_event.md). This system function can only be called from an application installed in a consumer account.

## Syntax

```sqlsyntax
SYSTEM$CREATE_BILLING_EVENTS('<json_array_of_events>')
```

## Arguments

`'json_array_of_events'`
:   A STRING containing a JSON array of objects. Each object specifies a billing event.

    Each JSON object contains the following key-value pairs:

    ```json
    {
      "class": "my_class",
      "subclass": "my_subclass",
      "start_timestamp": 1730825611,
      "timestamp": 1730826611,
      "base_charge": 1.00,
      "objects": "[\"my_schema.my_udf\"]",
      "additional_info": "my_additional_info"
    }
    ```

    The following table describes these key-value pairs:

    > | Key-value pair | Type | Description |
    > | --- | --- | --- |
    > | `"class"` | STRING | Identifier for the custom event class. |
    > | `"subclass"` | STRING | Identifier for the custom event subclass. This field is only used by the provider. |
    > | `"start_timestamp"` | INTEGER | The start time (UTC) of the event as a Unix timestamp in milliseconds. |
    > | `"timestamp"` | INTEGER | The timestamp (UTC) when the event was created as a Unix timestamp in milliseconds. |
    > | `"base_charge"` | DOUBLE | The amount in US dollars to charge for the billable event. The value must be greater than zero, less than 99,999.99, and must not exceed two decimal places of precision. For example, `1.00` or `0.07`. |
    > | `"objects"` | STRING | A JSON string array containing fully qualified object names that apply to the event. |
    > | `"additional_info"` | STRING | A JSON string of key-value pairs the provider can use to send additional info. |

## Returns

This function returns the following status messages:

> | Status message | Description |
> | --- | --- |
> | Success | Indicates the billable event was successfully created. |
> | Invalid parameter: `<PARAM_NAME>`. | Indicates an unsupported parameter was passed to the function. |
> | Only callable from within an application. | Indicates the function was called from outside an application. |
> | Payload length exceeds the limit of 9000 characters. | Indicates the call to the function exceeds the character limit. |
> | Number of events exceeds the limit of 100. | Indicates the maximum number of billable events has been reached for a single call. The specified custom event class is not used in this determination. |
> | Too many calls. At most 1 call is allowed per 10 millisecond window. | Indicates an application made too many calls to this system function within a specific period. |

## Usage notes

This system function can only be called from within a stored procedure in the setup script of an application created using the
Snowflake Native App Framework.

## Examples

See [Billable event examples](../../developer-guide/native-apps/adding-custom-event-billing.md).

---
title: SYSTEM$CURRENT_USER_TASK_NAME
source: https://docs.snowflake.com/en/sql-reference/functions/system_current_user_task_name.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$CURRENT_USER_TASK_NAME

Returns the name of the task currently executing when invoked from the statement or stored procedure defined by the task.

## Syntax

```sqlsyntax
SYSTEM$CURRENT_USER_TASK_NAME()
```

## Arguments

None.

## Examples

Insert the name of the current task into a table along with the current time:

> ```sqlexample
> CREATE TASK mytask
>   WAREHOUSE = mywh,
>   SCHEDULE = '5 MINUTE'
> AS
>   INSERT INTO mytable(ts, task) VALUES(CURRENT_TIMESTAMP, SYSTEM$CURRENT_USER_TASK_NAME());
>
> SELECT * FROM mytable;
>
> +-------------------------+------------------------------------+
> | TS                      | TASK                               |
> |-------------------------+------------------------------------|
> | 2018-11-15 07:41:33.463 | MYDB.PUBLIC.MYTASK                 |
> +-------------------------+------------------------------------+
> ```

---
title: SYSTEM$DATA_METRIC_SCAN
source: https://docs.snowflake.com/en/sql-reference/functions/system_data_metric_scan.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md), [Table functions](../functions-table.md)

# SYSTEM$DATA_METRIC_SCAN

Returns the rows identified by a [data quality metric](../../user-guide/data-quality-intro.md) as containing data that fails a data quality
check. For example, if you use the NULL_COUNT data metric function as an argument, the function returns the rows in the table that contain a
NULL value in a specific column.

## Syntax

```sqlsyntax
SYSTEM$DATA_METRIC_SCAN(
  REF_ENTITY_NAME  => '<object>'
  , METRIC_NAME  => '<data_metric_function>'
  , ARGUMENT_NAME => '<column> [ , <column> ... ]'
  [ , ARGUMENT_EXPRESSION => '<boolean-expression>' ]
  [ , AT_TIMESTAMP => '<timestamp>' ] )
```

## Arguments

**Required:**

`REF_ENTITY_NAME => 'object'`
:   Name of the table or view on which the specified data metric function will run. The function returns rows from this object.

`METRIC_NAME => 'data_metric_function'`
:   Name of the system data metric that you want to run to evaluate the specified table or view. Only the following system functions are
    supported:

    > * SNOWFLAKE.CORE.ACCEPTED_VALUES
    > * SNOWFLAKE.CORE.BLANK_COUNT
    > * SNOWFLAKE.CORE.BLANK_PERCENT
    > * SNOWFLAKE.CORE.DUPLICATE_COUNT
    > * SNOWFLAKE.CORE.NULL_COUNT
    > * SNOWFLAKE.CORE.NULL_PERCENT

`ARGUMENT_NAME => 'column [ , column ... ]'`
:   Name of the columns in the specified table or view that are being passed as arguments to the specified data metric function.

**Optional:**

`ARGUMENT_EXPRESSION => 'boolean-expression'`
:   Required if the specified data metric function is [ACCEPTED_VALUES](dmf_accepted_values.md). Disallowed for all other DMFs.

    Specifies a Boolean expression used to evaluate whether a record passes or fails the ACCEPTED_VALUES data quality check. The
    SYSTEM$DATA_METRIC_SCAN function returns records that do *not* match the Boolean expression. The expression can include the following
    operators and functions:

    * [Comparison operators](../operators-comparison.md)
    * [Logical operators](../operators-logical.md)
    * [[ NOT ] LIKE](like.md)
    * [[ NOT ] IN](in.md)
    * [IS [ NOT ] NULL](is-null.md)

    The column in the Boolean expression must be the same column specified in the ARGUMENT_NAME argument.

    If the ACCEPTED_VALUES DMF is [associated with the object](../../user-guide/data-quality-working.md) specified by REF_ENTITY_NAME, the
    SYSTEM$DATA_METRIC_SCAN function ignores the Boolean expression that was specified when ACCEPTED_VALUES was associated with the
    object.

`AT_TIMESTAMP => 'timestamp'`
:   Timestamp that is being passed as an argument to check the results of a DMF evaluation on the table or view in the past.

## Returns

Rows from the specified table or view.

## Access control privileges

Executing this function requires the following privileges:

* SELECT on the specified table.
* USAGE on the specified data metric function.

## Usage notes

* This function does not support user-defined metrics.
* If the specified table is protected by a policy, such as a masking policy or row access policy, the function might return unexpected or
  incomplete data because results depend on the user’s role when executing the function.

## Examples

Given that the SNOWFLAKE.CORE.NULL_COUNT system metric returns the total number of NULL values in a particular column, the following returns
the rows of the `employeesTable` table that have NULL values in the `SSN` column.

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
    REF_ENTITY_NAME  => 'governance.sch.employeesTable',
    METRIC_NAME  => 'snowflake.core.null_count',
    ARGUMENT_NAME => 'SSN'
  ));
```

Given that the SNOWFLAKE.CORE.DUPLICATE_COUNT system metric returns the count of duplicate values, the following returns
the rows of the `employeesTable` table that had duplicate values in both the `first_name` and `last_name` columns.

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
    REF_ENTITY_NAME  => 'governance.sch.employeesTable',
    METRIC_NAME  => 'snowflake.core.duplicate_count',
    ARGUMENT_NAME => 'first_name, last_name'
  ));
```

Return the rows where the value of the `age` column is *not* equal to five (that is, the rows that *don’t* match the condition specified by
ARGUMENT_EXPRESSION).

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$DATA_METRIC_SCAN(
    REF_ENTITY_NAME  => 'governance.sch.employeesTable',
    METRIC_NAME  => 'snowflake.core.accepted_values',
    ARGUMENT_NAME => 'age',
    ARGUMENT_EXPRESSION => 'age = 5'
  ));
```

---
title: SYSTEM$DATABASE_REFRESH_HISTORY — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_database_refresh_history.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$DATABASE_REFRESH_HISTORY — *Deprecated*

Returns a JSON object showing the refresh history for a secondary database.

> **Note:**
>
> This function returns database refresh activity within the last 14 days.

## Syntax

```sqlsyntax
SYSTEM$DATABASE_REFRESH_HISTORY( '<secondary_db_name>' )
```

## Arguments

`secondary_db_name`
:   Name of the secondary database. This argument is optional if the secondary database is the active database in the current session.

    Note that the entire name must be enclosed in single quotes.

## Output

The function returns the following elements in a JSON object:

| Column Name | Data Type | Description |
| --- | --- | --- |
| startTimeUTC | NUMBER | Time when the replication operation began. Format is epoch time. |
| endTimeUTC | NUMBER | Time when the replication operation finished, if applicable. Format is epoch time. |
| currentPhase | TEXT | Current replication phase. For the list of phases, see the usage notes. |
| jobUUID | TEXT | Query ID for the secondary database refresh job. |
| copy_bytes | NUMBER | Number of bytes copied during the replication operation. |
| object_count | NUMBER | Number of database objects copied during the replication operation. |

## Usage notes

* Only returns results for account administrators (users with the ACCOUNTADMIN role).
* Following is the list of phases in the order processed:

  1. SECONDARY_UPLOADING_INVENTORY
  2. PRIMARY_UPLOADING_METADATA
  3. PRIMARY_UPLOADING_DATA
  4. SECONDARY_DOWNLOADING_METADATA
  5. SECONDARY_DOWNLOADING_DATA
  6. COMPLETED / FAILED / CANCELED

## Examples

The following example retrieves the refresh history for the `mydb` secondary database. The results are returned in a JSON object:

> ```sqlexample
> SELECT SYSTEM$DATABASE_REFRESH_HISTORY('mydb');
> ```

The following example retrieves the same details as in the previous example, but the results are flattened into relational form:

> ```sqlexample
> SELECT
>     to_timestamp_ltz(value:startTimeUTC::numeric,3) AS "start_time"
>     , to_timestamp_ltz(value:endTimeUTC::numeric,3) AS "end_time"
>     , value:currentPhase::string AS "phase"
>   , value:jobUUID::string AS "query_ID"
>   , value:copy_bytes::integer AS "bytes_transferred"
> FROM TABLE(flatten(INPUT=> PARSE_JSON(SYSTEM$DATABASE_REFRESH_HISTORY('mydb'))));
> ```

---
title: SYSTEM$DATABASE_REFRESH_PROGRESS , SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_database_refresh_progress.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$DATABASE_REFRESH_PROGRESS , SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB — *Deprecated*

The SYSTEM$DATABASE_REFRESH_PROGRESS family of functions can be used to query the status of a database refresh along various dimensions:

* SYSTEM$DATABASE_REFRESH_PROGRESS returns a JSON object indicating the current refresh status for a secondary database by name.
* SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB returns a JSON object indicating the current refresh status for a secondary database by refresh query.

> **Note:**
>
> These functions return database refresh activity within the last 14 days.

## Syntax

```sqlsyntax
SYSTEM$DATABASE_REFRESH_PROGRESS( '<secondary_db_name>' )

SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB( '<query_id>' )
```

## Arguments

`secondary_db_name`
:   Name of the secondary database. This argument is optional if the secondary database is the active database in the current session.

    Note that the entire name must be enclosed in single quotes.

`query_id`
:   ID of the database refresh query. The query ID can be obtained from the History  page in the web interface.

## Output

The function returns the following elements in a JSON object:

| Column Name | Data Type | Description |
| --- | --- | --- |
| phaseName | TEXT | Name of the replication phases completed (or in progress) so far. For the list of phases, see the usage notes. |
| resultName | TEXT | Status of the replication phase. |
| startTimeUTC | NUMBER | Time when the replication phase began. Format is epoch time. |
| endTimeUTC | NUMBER | Time when the phase finished, if applicable. Format is epoch time. |
| details | VARIANT | A separate JSON object that shows the total number of bytes in the data refresh as well as the number of bytes copied so far in the phase. If the refresh statement previously failed or was cancelled and was initiated again, the object indicates the number of bytes skipped in the second attempt. The `details` object is included in the `Copying Primary Data` and `Copying Replica Data` phase information. |

## Usage notes

* Only returns results for account administrators (users with the ACCOUNTADMIN role).
* Following is the list of phases in the order processed:

  1. SECONDARY_UPLOADING_INVENTORY
  2. PRIMARY_UPLOADING_METADATA
  3. PRIMARY_UPLOADING_DATA
  4. SECONDARY_DOWNLOADING_METADATA
  5. SECONDARY_DOWNLOADING_DATA
  6. COMPLETED / FAILED / CANCELED

## Examples

The following example retrieves the current refresh status for the specified secondary database. The results are returned in a JSON object:

> ```sqlexample
> SELECT SYSTEM$DATABASE_REFRESH_PROGRESS('mydb');
> ```

The following example retrieves the same details as in the previous example, but the results are separated into relational columns and the timestamps are cast as TIMESTAMP_LTZ:

> ```sqlexample
> SELECT value:phaseName::string AS "Phase",
>   value:resultName::string AS "Result",
>   TO_TIMESTAMP_LTZ(value:startTimeUTC::numeric,3) AS "startTime",
>   TO_TIMESTAMP_LTZ(value:endTimeUTC::numeric,3) AS "endTime",
>   value:details AS "details"
>   FROM table(flatten(INPUT=> PARSE_JSON(SYSTEM$DATABASE_REFRESH_PROGRESS('mydb1'))));
> ```

The following example retrieves the status for the specified database refresh query. The results are returned in a JSON object:

> ```sqlexample
> SELECT SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB('4cbd7187-51f6-446c-9814-92d7f57d939b');
> ```

The following example retrieves the same details as in the previous example, but the results are separated into relational columns and the timestamps are cast as TIMESTAMP_LTZ:

> ```sqlexample
> SELECT value:phaseName::string AS "Phase",
>   value:resultName::string AS "Result",
>   TO_TIMESTAMP_LTZ(value:startTimeUTC::numeric,3) AS "startTime",
>   TO_TIMESTAMP_LTZ(value:endTimeUTC::numeric,3) AS "endTime",
>   value:details AS "details"
>   FROM TABLE(FLATTEN(input=> PARSE_JSON(SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB('4cbd7187-51f6-446c-9814-92d7f57d939b'))));
> ```

---
title: SYSTEM$DEACTIVATE_CMK_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_deactivate_cmk_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$DEACTIVATE_CMK_INFO

De-activates Tri-Secret Secure in your account.

This system function:

* Configures your account to stop using Tri-Secret Secure.
* Creates a new account master key.
* Retires the composed account master key.
* Registers your account with the rekeying background service.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

```sqlsyntax
SYSTEM$DEACTIVATE_CMK_INFO()
```

## Arguments

None.

## Returns

Success or error messages.

## Access control requirements

Only users granted the MODIFY privilege on the account can call this function. The MODIFY privilege on an account is typically granted only
to the ACCOUNTADMIN role.

## Usage notes

The background service generates email messages that notify the account administrator when Tri-Secret Secure is deactivated.

## Examples

Deactivate Tri-Secret Secure for your Snowflake account:

```sqlexample
SELECT SYSTEM$DEACTIVATE_CMK_INFO();
```

---
title: SYSTEM$DECODE_PAT
source: https://docs.snowflake.com/en/sql-reference/functions/system_decode_pat.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$DECODE_PAT

Returns information about a [programmatic access token](../../user-guide/programmatic-access-tokens.md), given the secret for the
token. This information includes the name of the token, the state of the token, and the user associated with the token.

You can call this function if you need to disable a programmatic access token and you want to know which user is associated with
the token.

## Syntax

```sqlsyntax
SYSTEM$DECODE_PAT( '<secret_for_programmatic_access_token>' )
```

## Arguments

`'secret_for_programmatic_access_token'`
:   Secret for the programmatic access token.

## Returns

Returns a VARCHAR value containing the token information in a JSON object. The JSON object has the following fields:

| Field | Description |
| --- | --- |
| `STATE` | State of the programmatic access token. This field contains one of the following values:   * `ACTIVE`: The programmatic access token can be used to authenticate and has not expired yet. * `EXPIRED`: The programmatic access token cannot be used to authenticate because the expiration date has passed. * `DISABLED`: The programmatic access token is [disabled](../../user-guide/programmatic-access-tokens.md) because user login access is disabled or   the user is locked out of logging in. |
| `PAT_NAME` | Name of the programmatic access token. |
| `USER_NAME` | Name of the user associated with the programmatic access token. |

## Examples

The following example returns information about the programmatic access token with the secret `abC...Y5Z`:

```sqlexample
SELECT SYSTEM$DECODE_PAT('abC...Y5Z');
```

```output
+------------------------------------------------------------------------+
| SYSTEM$DECODE_PAT('☺☺☺...☺☺☺')                                         |
|------------------------------------------------------------------------|
| {"STATE":"ACTIVE","PAT_NAME":"MY_EXAMPLE_TOKEN","USER_NAME":"MY_USER"} |
+------------------------------------------------------------------------+
```

---
title: SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/system_deprovision_privatelink_endpoint.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT

Deprovisions a private connectivity endpoint in the Snowflake VPC or VNet to prevent Snowflake from connecting to an external service by using
private connectivity. The endpoint can be a service endpoint or a resource endpoint depending on the cloud platform that hosts your
Snowflake account.

If you call this function and specify the wrong private connectivity endpoint, call the [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT](system_restore_privatelink_endpoint.md) system
function to restore the endpoint within a seven day period. After seven days, the endpoint is deleted and cannot be recovered; you
will need to recreate the endpoint with the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](system_provision_privatelink_endpoint.md) system function.

## Syntax

**AWS:**

> ```sqlsyntax
> SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>  '<provider_service_name>' )
> ```

**Azure:**

> ```sqlsyntax
> SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>  '<provider_resource_id>'
>  [, '<subresource>' ]
> )
> ```

**Google Cloud**

```sqlsyntax
 SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
   '<service_attachment_id>'
);
```

## Arguments

**AWS**

`provider_service_name`
:   Specifies the external service or resource endpoint to restore. For example, `com.amazonaws.us-west-2.execute-api` for the Amazon API
    Gateway or `com.amazonaws.us-west-2.s3` for Amazon S3.

**Azure**

`'provider_resource_id'`
:   Specifies the fully-qualified identifier for the resource in your VPC or VNet.

`'subresource'`
:   Specifies the name of the subresource of the Azure resource.

    This argument is not required for [Azure Private Link Service](https://learn.microsoft.com/en-us/azure/private-link/private-link-service-overview) and Azure API Management Service.

    For all supported values, see the [Sub-resource table](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview#private-link-resource).

**Google Cloud**

`'target_service_id'`
:   Specifies the ID of the service attachment in your VPC network or the regional Google API.

## Returns

Returns a status message stating that the endpoint, with its identifier, is deprovisioned successfully.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Usage notes

* An error message occurs if a private connectivity endpoint is not associated with the specified arguments.

## Examples

**AWS:**

> Deprovision a private endpoint with external access to Amazon S3:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT('com.amazonaws.us-west-2.s3');
```

**Azure:**

> Deprovision a private endpoint to prevent Snowflake on Microsoft Azure from connecting to the Microsoft Azure API Management service in your
> Microsoft Azure VNet:
>
> ```sqlexample
> SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>   '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
>   'Gateway'
>   );
> ```
>
> ```output
> Private endpoint with id "/subscriptions/e48379a7-2fc4-473e-b071-f94858cc83f5/resourcegroups/test_rg/providers/microsoft.network/privateendpoints/5ef8fd34-07db-4583-b0dd-0e2360398ed3" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
> ```
>
> Deprovision a private endpoint to prevent Snowflake on Microsoft Azure from connecting to an external service using external network access:
>
> ```sqlexample
> SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>   '/subscriptions/11111111-2222-3333-4444-5555555555/resourceGroups/leorg1/providers/Microsoft.Sql/servers/myserver/databases/testdb',
>   'sqlServer'
>   );
> ```
>
> ```output
> "Resource Endpoint with id "/subscriptions/f0abb333-1b05-47c6-8c31-dd36d2512fd1/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" deprovisioned successfully"
> ```
>
> Deprovision a private endpoint to prevent Snowflake from connecting to an external stage for Microsoft Azure:
>
> ```sqlexample
> SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>   '/subscriptions/cb72345g5-d347-4sdc-r3ee-70d234551a78/resourceGroups/rg-db-dev/providers/Microsoft.Storage/storageAccounts/dbasdfffext',
>   'blob'
> );
> ```
>
> ```output
> "Resource Endpoint with id "/subscriptions/57faea9a-20c2-4d35-b283-9c0c1e9593d8/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" deprovisioned successfully"
> ```

**Google Cloud**

> Deprovision a private endpoint to prevent Snowflake on Google Cloud from connecting to the service attachment in your Google Cloud VPC
> Network:
>
> ```sqlexample
> SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>   'projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment'
>   );
> ```
>
> ```output
> Private endpoint with id "abcd0000000000000001" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
> ```
>
> Deprovision a private endpoint to prevent Snowflake on Google Cloud from connecting to a regional Google service endpoint (CloudKMS):
>
> ```sqlexample
> SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>  'cloudkms.us-east4.rep.googleapis.com'
>  );
> ```
>
> ```output
> Private endpoint with id "abcd0000000000000001" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
> ```
>
> Deprovision a private endpoint to prevent Snowflake from connecting to an external stage for Google Cloud:
>
> ```sqlexample
> SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT(
>  'storage.us-east4.rep.googleapis.com'
>  );
> ```
>
> ```output
> Private endpoint with id "abcd0000000000000001" successfully marked for deletion. Before it is fully deleted in 7-8 days, it can be restored.
> ```

---
title: SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS
source: https://docs.snowflake.com/en/sql-reference/functions/system_deprovision_privatelink_endpoint_tss.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS

Deprovisions a private connectivity endpoint in the Snowflake VPC or VNet to prevent Snowflake from connecting to an external key management service (KMS) resource
using private connectivity. The endpoint can be a service endpoint or a resource endpoint depending on the cloud platform that hosts your
Snowflake account.

If you call this function and mistakenly remove an endpoint, call the [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS](system_restore_privatelink_endpoint_tss.md)
system function to restore the endpoint within seven days. After seven days, the endpoint is deleted and can’t be recovered;
you will need to recreate the endpoint with the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS](system_provision_privatelink_endpoint_tss.md).

## Syntax

**AWS:**

```sqlsyntax
SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS(
  '<provider_service_name>'
  )
```

**Azure:**

```sqlsyntax
SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS(
  '<provider_resource_id>'
  )
```

**Google Cloud:**

```sqlsyntax
SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS(
  '<target_service_id>'
  )
```

## Arguments

**AWS:**

`provider_service_name`
:   Specifies the external KMS resource endpoint.

**Azure:**

`provider_resource_id`
:   Specifies the fully-qualified identifier for the resource in your VPC or VNet.

**Google Cloud:**

`target_service_id`
:   Specifies the service attachment ID or regional Google API endpoint.

## Returns

Returns a status message stating that the endpoint, with its identifier, is deprovisioned successfully.

## Access control requirements

Only users granted the MODIFY privilege on the account can call this function.
The MODIFY privilege is typically granted only to the ACCOUNTADMIN role.

## Usage notes

An error message occurs if a private connectivity endpoint is not associated with the specified arguments.

## Examples

**AWS:**

Deprovision a private endpoint with external access to the AWS KMS:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS('com.amazonaws.us-west-2.s3');
```

**Azure:**

Deprovision a private endpoint to prevent Snowflake from connecting to an external key vault on Microsoft Azure for Tri-Secret Secure:

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS(
  '/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/myvault/providers/Microsoft.KeyVault/vaults/TriSecretVault', 'trisecretvault.vault.azure.net'
);
```

```output
"Resource Endpoint with id "/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/myvault/privatelink-test/providers/Microsoft.KeyVault/vaults/TriSecretVault/privateEndpoints/" deprovisioned successfully"
```

**Google Cloud:**

```sqlexample
SELECT SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS(
  'cloudkms.us-west2.rep.googleapis.com'
);
```

```output
Private endpoint with id 'abcd0000000000001234' successfully marked for deletion. It may be restored within 7 days of deprovisioning.
```

---
title: SYSTEM$DEREGISTER_CMK_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_deregister_cmk_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DEREGISTER_CMK_INFO

Cancels registration of your currently-registered customer-managed key (CMK) for use with Tri-Secret Secure.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

```sqlsyntax
SYSTEM$DEREGISTER_CMK_INFO();
```

## Arguments

None.

## Returns

Returns a status message to system administrators stating that registration of your current CMK is cancelled.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Examples

De-register your CMK for your Snowflake account:

```sqlexample
SELECT SYSTEM$DEREGISTER_CMK_INFO();
```

---
title: SYSTEM$DEREGISTER_CMK_INFO_POSTGRES
source: https://docs.snowflake.com/en/sql-reference/functions/system_deregister_cmk_info_postgres.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DEREGISTER_CMK_INFO_POSTGRES

Cancels registration of your currently-registered customer-managed key (CMK) for use with Snowflake Postgres Tri-Secret Secure.

## Syntax

```sqlsyntax
SYSTEM$DEREGISTER_CMK_INFO_POSTGRES();
```

## Arguments

None.

## Returns

Returns a status message to system administrators stating that registration of your current CMK is cancelled.

## Arguments

None.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Examples

De-register your CMK for your Snowflake account:

```sqlexample
SELECT SYSTEM$DEREGISTER_CMK_INFO_POSTGRES();
```

---
title: SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY
source: https://docs.snowflake.com/en/sql-reference/functions/system_desc_iceberg_access_identity.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY

Returns information about the Snowflake service principal for a specified external cloud provider
in an account.

See also:
:   [Configure replication for Snowflake-managed Apache Iceberg™ tables](../../user-guide/tables-iceberg-replication.md)

## Syntax

```sqlsyntax
SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY(
  '<cloud_storage_provider>' [ , '<account_name>' ] )
```

## Required arguments

`'cloud_storage_provider'`
:   Specifies the cloud provider to retrieve service principal information for. You can specify one of the following values for this argument:

    * `'S3'`
    * `'GCS'`
    * `'AZURE'`

## Optional arguments

`'account_name'`
:   Optionally specifies the name of the Snowflake account for which you want to retrieve the service principal information. If specified, you
    must use the value in the `account_name` column returned by the [SHOW REPLICATION ACCOUNTS](../sql/show-replication-accounts.md) command.

    If not specified, the function returns information for the current account.

## Returns

The function returns a JSON object containing the following name/value pairs:

**S3**

```sqljson
{
  "STORAGE_PROVIDER":"S3",
  "STORAGE_AWS_IAM_USER_ARN":"<iam_user_arn>"
}
```

Where:

> `STORAGE_PROVIDER`
> :   The cloud storage provider.
>
> `STORAGE_AWS_IAM_USER_ARN`
> :   The ARN for the AWS IAM user that was created automatically for your Snowflake account.

**GCS**

```sqljson
{
  "STORAGE_PROVIDER":"GCS",
  "STORAGE_GCP_SERVICE_ACCOUNT":"<service_account_identifier>"
}
```

Where:

> `STORAGE_PROVIDER`
> :   The cloud storage provider.
>
> `STORAGE_GCP_SERVICE_ACCOUNT`
> :   The ID for the GCS service account that was created automatically for your Snowflake account.

**AZURE**

```sqljson
{
  "STORAGE_PROVIDER":"AZURE",
  "AZURE_MULTI_TENANT_APP_NAME":"<client_app_name>",
  "AZURE_CONSENT_URL_TEMPLATE":"https://login.microsoftonline.com/<your_tenant_id>/oauth2/authorize?client_id=..."
}
```

Where:

> `STORAGE_PROVIDER`
> :   The cloud storage provider.
>
> `AZURE_MULTI_TENANT_APP_NAME`
> :   Name of the Snowflake client application created for your Snowflake account.
>
> `AZURE_CONSENT_URL_TEMPLATE`
> :   Template URL to the Microsoft permissions request page. You must replace `your_tenant_id` with the ID for your
>     tenant that the storage location belongs to.
>
>     To find your tenant ID, log into the Azure portal and click Azure Active Directory » Properties.
>     The tenant ID is displayed in the Tenant ID field.

## Usage notes

Only returns results for account administrators (users with the ACCOUNTADMIN role).

## Examples

Retrieve the service principal for Azure:

```sqlexample
SELECT SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY('AZURE', 'MY_TARGET_SNOWFLAKE_ACCOUNT');
```

---
title: SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE
source: https://docs.snowflake.com/en/sql-reference/functions/system_disable_behavior_change_bundle.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE

Disables the behavior changes included in the specified [behavior change release bundle](../../release-notes/behavior-change-policy.md)
for the current account.

You can call this function for a particular bundle at the beginning of the
[testing period](../../release-notes/behavior-change-policy.md) for that bundle and becomes unavailable after the
[opt-out period](../../release-notes/behavior-change-policy.md) for that bundle.

An error occurs in either of the following cases:

* You call this function before the testing period for that bundle begins.
* You call this function after the opt-out period for that bundle ends.

If you call this function to disable a behavior change bundle during the testing period, the bundle remains disabled until you
enable it again or until the end of the opt-out period. Snowflake does not override this setting at the beginning of the opt-out
period, when the bundle becomes enabled by default.

See also:
:   [SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](system_enable_behavior_change_bundle.md),
    [SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](system_behavior_change_bundle_status.md),
    [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](system_show_active_behavior_change_bundles.md)

## Syntax

```sqlsyntax
SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE( '<bundle_name>' )
```

## Arguments

`bundle_name`
:   Name of the behavior change bundle, specified as a string. To obtain the name for a bundle, see
    [Behavior change announcements](../../release-notes/behavior-changes.md).

## Returns

Returns the VARCHAR value `DISABLED` if the function successfully disables the behavior changes.

## Examples

The following example disables the `2020_08` behavior change bundle for the current account.

```sqlexample
SELECT SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE('2020_08');
```

```output
+--------------------------------------------------+
| SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE('2020_08') |
|--------------------------------------------------|
| DISABLED                                         |
+--------------------------------------------------+
```

---
title: SYSTEM$DISABLE_DATABASE_REPLICATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_disable_database_replication.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DISABLE_DATABASE_REPLICATION

Disable replication for a primary database and any secondary databases linked to it.

If a database was previously enabled for replication using ALTER DATABASE … ENABLE REPLICATION TO ACCOUNTS, database replication
must be disabled before the database can be added to a [replication or failover group](../../user-guide/account-replication-intro.md).

## Syntax

```sqlsyntax
SYSTEM$DISABLE_DATABASE_REPLICATION('<db_name>');
```

## Arguments

`db_name`
:   Specifies the identifier for the database.

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL function.
* This function must be executed from the source account with the primary database.

## Examples

Disable replication for primary database `mydb` and any linked secondary databases.

Executed from the source account:

```sqlexample
SELECT SYSTEM$DISABLE_DATABASE_REPLICATION('mydb');
```

---
title: SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_disable_global_data_sharing_for_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT

Disables Cross-Cloud Auto-Fulfillment on an account.

See also:
:   [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](system_is_global_data_sharing_enabled_for_account.md) , [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](system_enable_global_data_sharing_for_account.md), [Auto-fulfillment for listings](../../collaboration/provider-listings-auto-fulfillment.md)

## Syntax

```sqlsyntax
SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT( '<account_name>' )
```

## Arguments

`account_name`
:   Specifies the account on which to disable Cross-Cloud Auto-Fulfillment. To learn more about Snowflake account identifiers and how to locate them, see [Account identifiers](../../user-guide/admin-account-identifier.md).

## Returns

Returns the VARCHAR value `Statement executed successfully` if the function successfully disables Cross-Cloud Auto-Fulfillment on the account.

## Access control requirements

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute this function.

## Examples

The following example disables Cross-Cloud Auto-Fulfillment on the account named `my_account`:

```sqlexample
SELECT SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('my_account');
```

```output
+--------------------------------------------------------------------+
| SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('my_account') |
|--------------------------------------------------------------------|
| Statement executed successfully                                    |
+--------------------------------------------------------------------+
```

---
title: SYSTEM$DISABLE_PREVIEW_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_disable_preview_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DISABLE_PREVIEW_ACCESS

Disables access to [open preview](../../release-notes/preview-features.md) and private preview features.

See also:

> [SYSTEM$GET_PREVIEW_ACCESS_STATUS](system_get_preview_access_status.md), [SYSTEM$ENABLE_PREVIEW_ACCESS](system_enable_preview_access.md)

## Syntax

```sqlsyntax
SYSTEM$DISABLE_PREVIEW_ACCESS()
```

## Arguments

None.

## Returns

Returns a VARCHAR status message that preview features have been disabled:

```output
+----------------------------------------------------------------+
| SYSTEM$DISABLE_PREVIEW_ACCESS()                                |
+----------------------------------------------------------------+
| Preview access has been successfully disabled for this account |
+----------------------------------------------------------------+
```

## Access control requirements

* Only account administrators (users with the ACCOUNTADMIN role) can execute this function.

## Usage notes

* Applies to both private and open preview features.
* This is an all-or-nothing setting that affects all users and all previews within an account.
* Any user in the account who is using a preview feature will lose access to that feature immediately after SYSTEM$DISABLE_PREVIEW_ACCESS is executed.
* Snowflake Marketplace products, which are managed separately through [IMPORTED PRIVILEGES](../../user-guide/data-exchange-marketplace-privileges.md), are not covered as part of this capability.
* Client-side libraries (such as Snowpark API) are not covered as part of this capability.

## Examples

Disable preview features:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$DISABLE_PREVIEW_ACCESS();
```

---
title: SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY
source: https://docs.snowflake.com/en/sql-reference/functions/system_disable_privatelink_access_only.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY

Unblocks connections for inbound network traffic that are routed over the public internet.

## Syntax

```sqlsyntax
SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY()
```

## Arguments

None.

## Returns

Returns a VARCHAR message that inbound connections can use the public internet.

## Access control requirements

Only account administrators — users with the ACCOUNTADMIN role — can run this function.

## Example

Restore public access for inbound network traffic to your Snowflake account:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY();
```

---
title: SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE
source: https://docs.snowflake.com/en/sql-reference/functions/system_enable_behavior_change_bundle.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE

Enables behavior changes included in the specified [behavior change release bundle](../../release-notes/behavior-change-policy.md) for the
current account.

By default, behavior change bundles are not enabled during the pre-announcement period. Use this function to test behavior changes before they are enabled for your account.

See also:
:   [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](system_disable_behavior_change_bundle.md),
    [SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](system_behavior_change_bundle_status.md)
    [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](system_show_active_behavior_change_bundles.md)

## Syntax

```sqlsyntax
SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE( '<bundle_name>' )
```

## Arguments

`bundle_name`
:   Name of the behavior change bundle, specified as a string. To obtain the name for a bundle, see
    [Behavior change announcements](../../release-notes/behavior-changes.md).

## Returns

Returns the VARCHAR value `ENABLED` if the function successfully enables the behavior changes.

## Usage notes

* You cannot call this function from within a stored procedure or user-defined function (UDF).

## Examples

The following example enables the `2020_08` behavior change bundle for the current account.

```sqlexample
SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2020_08');
```

```output
+-------------------------------------------------+
| SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2020_08') |
|-------------------------------------------------|
| ENABLED                                         |
+-------------------------------------------------+
```

---
title: SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_enable_global_data_sharing_for_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT

Enables Cross-Cloud Auto-Fulfillment on an account. Cross-Cloud Auto-Fulfillment allows you to automatically provide the share or application package attached to your listing to other Snowflake consumer regions.

See also:
:   [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](system_is_global_data_sharing_enabled_for_account.md) , [SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](system_disable_global_data_sharing_for_account.md), [Auto-fulfillment for listings](../../collaboration/provider-listings-auto-fulfillment.md)

## Syntax

```sqlsyntax
SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT( '<account_name>' )
```

## Arguments

`account_name`
:   Specifies the account on which to enable Cross-Cloud Auto-Fulfillment. To learn more about Snowflake account identifiers and how to locate them, see [Account identifiers](../../user-guide/admin-account-identifier.md).

## Returns

Returns the VARCHAR value `Statement executed successfully` if the function successfully enables Cross-Cloud Auto-Fulfillment on the account.

## Access control requirements

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute this function.

## Examples

The following example enables Cross-Cloud Auto-Fulfillment on the account named `my_account`:

```sqlexample
SELECT SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('my_account');
```

```output
+--------------------------------------------------------------------+
| SYSTEM$SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT('my_account') |
|--------------------------------------------------------------------|
| Statement executed successfully                                    |
+--------------------------------------------------------------------+
```

---
title: SYSTEM$ENABLE_PREVIEW_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_enable_preview_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ENABLE_PREVIEW_ACCESS

Enables access to [open preview](../../release-notes/preview-features.md) features.

See also:

> [SYSTEM$GET_PREVIEW_ACCESS_STATUS](system_get_preview_access_status.md), [SYSTEM$DISABLE_PREVIEW_ACCESS](system_disable_preview_access.md)

## Syntax

```sqlsyntax
SYSTEM$ENABLE_PREVIEW_ACCESS()
```

## Arguments

None.

## Returns

Returns a VARCHAR status message that open preview features have been enabled:

```output
+---------------------------------------------------------------+
| SELECT SYSTEM$ENABLE_PREVIEW_ACCESS();                        |
+---------------------------------------------------------------+
| Preview access has been successfully enabled for this account |
+---------------------------------------------------------------+
```

## Access control requirements

* Only account administrators (users with the ACCOUNTADMIN role) can execute this function.

## Usage notes

* This is an all-or-nothing setting that affects all users and all previews within an account.
* SYSTEM$ENABLE_PREVIEW_ACCESS only can enable [open preview features](../../release-notes/preview-features.md).

  [Contact Snowflake Support](../../user-guide/contacting-support.md) to enable or re-enable private preview features.
* Snowflake Marketplace products, which are managed separately through [IMPORTED PRIVILEGES](../../user-guide/data-exchange-marketplace-privileges.md),
  are not covered as part of this capability.
* Client-side libraries (such as Snowpark API) are not covered as part of this capability.
* For customers who have not agreed to the Snowflake [Preview Terms of Service](https://www.snowflake.com/legal/preview-terms-of-service/) (“Preview Terms”),
  enabling preview features may not be possible.

  To agree to Preview Terms, contact your account representative or [Snowflake Support](../../user-guide/contacting-support.md) for assistance.

## Examples

Enable preview features:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$ENABLE_PREVIEW_ACCESS();
```

---
title: SYSTEM$ENCODE_CKE_PRIMARY_KEY
source: https://docs.snowflake.com/en/sql-reference/functions/system_encode_cke_primary_key.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System information)

# SYSTEM$ENCODE_CKE_PRIMARY_KEY

Takes one or more [primary key](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) columns from a [Cortex Knowledge Extensions (CKE)](../../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md) document and converts them into an encoded representation.

The encoded primary key is used as an input for further hashing, which anonymizes document identifiers in access history tables. This process helps protect customer data by ensuring that Snowflake stores hashed values derived from the encoded primary key instead of plain-text document IDs.

See also:
:   [SYSTEM$CKE_HASH_FUNCTION](system_cke_hash_function.md)

## Syntax

```sqlsyntax
SYSTEM$ENCODE_CKE_PRIMARY_KEY(
  '<pk_column_name>'
  [ , '<additional_pk_column_name>' ]
  [ , '<additional_pk_column_name>' ]
  [ , '<additional_pk_column_name>' ]
  [ , '<additional_pk_column_name>' ]
)
```

## Arguments

**Required:**

`pk_column_name`
:   The primary key column name.

**Optional:**

`additional_pk_column_name`
:   Additional primary key column names.

    You can specify up to four additional primary key column names as separate arguments.

## Returns

Returns a length-encoded string from the combined primary keys. This serves as the unique document ID.

## Examples

The following example returns the encoded primary key for the primary key columns pkCol1 and pkCol2:

```sqlexample
SELECT ["pkCol1", "pkCol2"], SYSTEM$ENCODE_CKE_PRIMARY_KEY('primary_key_col_1' , 'primary_key_col_2') AS encoded_primary_key
  FROM your_cortex_search_service_table;
```

---
title: SYSTEM$END_DEBUG_APPLICATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_end_debug_application.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$END_DEBUG_APPLICATION

Disables [session debug mode](../../developer-guide/native-apps/installing-testing-application.md) for a Snowflake Native App.

## Syntax

```sqlsyntax
SYSTEM$END_DEBUG_APPLICATION()
```

---
title: SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY
source: https://docs.snowflake.com/en/sql-reference/functions/system_enforce_privatelink_access_only.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY

Enforces the behavior that successful connections to your Snowflake account use only your private endpoints.
Blocks connections for inbound network traffic that are routed over the public internet.

## Syntax

```sqlsyntax
SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY()
```

## Arguments

None.

## Returns

Returns a VARCHAR message that successful inbound connections now use only private endpoints.

## Access control requirements

Only account administrators — users with the ACCOUNTADMIN role — can run this function.

## Example

To enforce the behavior that successful connections to your Snowflake account use only your private endpoints:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY();
```

---
title: SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_estimate_automatic_clustering_costs.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS

Returns estimated costs associated with enabling [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) for a table. This
function is used to do the following:

* Estimate the cost of clustering a table for the first time.
* Estimate the cost of changing the cluster key of a table.
* Estimate, when possible, the cost associated with maintaining the table after it’s clustered around the specified key.
  Sometimes, a table might need more DML history to estimate future maintenance costs.

> **Important:**
>
> The cost estimates returned by the SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS function are best efforts. The actual realized costs can vary by up to 100% (or, in rare cases, several times) from the estimated costs.

## Syntax

```sqlsyntax
SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS( '<table_name>' ,
 [ '( <expr1> [ , <expr2> ... ] )' ] )
```

## Arguments

‘`table_name`’
:   Name of the table for which you want to return the estimated cost of clustering.

‘`(expr1 [ , expr2 ... ])`’
:   The proposed cluster key for the table is where each expression resolves to a table column. The function estimates the cost of
    clustering the table using these columns as the cluster key.

Even if only one column name or expression is passed, it must be inside parentheses.

This argument is required for a table with no clustering key. An error is returned if the argument is omitted.

This argument is optional for a table with a clustering key. If the argument is omitted, the function estimates the cost of
clustering the table using the table’s current clustering key.

## Returns

A value of type VARCHAR. The returned string is in JSON format and contains the following name/value pairs:

`warning`
:   Indicates whether conditions might affect the cost estimation accuracy or the impact of choosing a cluster key.

`reportTime`
:   Date when the function’s output was generated.

`clusteringKey`
:   Columns that make up the cluster key.

`initial`
:   Describes the predicted cost of clustering the table around the specified cluster key.
    The estimated cost of maintaining the table once it is clustered is not included.
    The `initial` JSON object contains the following name/value pairs.

    `unit`
    :   Indicates the units in which the initial cost is expressed.

    `value`
    :   Indicates the cost to cluster the table, expressed in `unit`.

    `comment`
    :   Interprets the initial cost of clustering.

`maintenance`
:   Describes the predicted costs of maintaining a well-clustered table after it is initially clustered.
    This prediction is based on recent DML activity because a table is reclustered as it changes.

    An empty object indicates that Snowflake was unable to provide a maintenance cost estimate. In most cases, Snowflake is unable to provide
    a maintenance estimate because the table did not have enough DML history available or did not have enough supported DML types in the
    past week to accurately predict costs.

    `unit`
    :   Indicates the units in which the cost is expressed.

    `value`
    :   Indicates how much it will cost to maintain the table after its initial clustering, expressed in `units` per day.

    `comment`
    :   Includes costs-incurring period and the time frame upon which the estimate is based.

## Access control requirements

The privileges needed to estimate costs are the same as those required to read the table and change the cluster key. You need the
following privileges:

* SELECT and INSERT privileges on the table, or OWNERSHIP privilege on the table.
* USAGE or OWNERSHIP privilege on the parent schema and database.

## Usage notes

* The cost estimates returned by the SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS function are based on sampling a subset of
  micro-partitions from your table and capturing the clustering execution time. Depending on sampled specific micro-partitions
  and the system speed, the cost estimates might differ between function executions.
* For the best possible accuracy, you can run SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS multiple times and average the results.
  The function uses sample clustering jobs and collects their execution time. The provided cost estimate might fluctuate depending
  on the system speed.
  Running the function multiple times and averaging the results can produce a more accurate cost estimate.
* The most common reason for an inaccurate maintenance cost estimate is that the past DML patterns based on the estimate did not
  match future DML patterns.
* Snowflake is able to provide a one-time cost estimate in the vast majority of cases and a maintenance cost estimate in some cases. If the
  function is unable to provide a maintenance cost estimate, Snowflake includes a reason in the output.

## Examples

Return the estimated costs associated with defining columns `day` and `tenantId` as the cluster key for table `myTable`.

```sqlexample
SELECT SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS('myTable', '(day, tenantId)');
```

```output
{
  "reportTime": "Fri, 12 Jul 2024 01:06:18 GMT",
  "clusteringKey": "LINEAR(day, tenantId)",
  "initial": {
    "unit": "Credits",
    "value": 98.2,
    "comment": "Total upper bound of one-time cost"
  },
  "maintenance": {
    "unit": "Credits",
    "value": 10.0,
    "comment": "Daily maintenance cost estimate provided based on DML history from the
    past seven days."
  }
}
```

---
title: SYSTEM$ESTIMATE_QUERY_ACCELERATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_estimate_query_acceleration.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$ESTIMATE_QUERY_ACCELERATION

For a previously executed query, this function returns a JSON object that specifies if the query is eligible to benefit from the
[query acceleration service](../../user-guide/query-acceleration-service.md). If the query is eligible for query acceleration, the output
includes the estimated query execution time for different query acceleration scale factors.

See also:
:   [QUERY_ACCELERATION_ELIGIBLE view](../account-usage/query_acceleration_eligible.md)

## Syntax

```sqlsyntax
SYSTEM$ESTIMATE_QUERY_ACCELERATION( '<query_id>' )
```

## Parameters

`query_id`
:   Query ID. Query ID must be for a query executed within the last 14 days; otherwise, the `status` is `invalid`.

## Output

The function returns a JSON object with the properties described below:

| Property | Description |
| --- | --- |
| `estimatedQueryTimes` | Object that contains the estimated query execution time in seconds for different query acceleration scale factors. If the `status` for the query is not `eligible` for query acceleration, this object is empty. |
| `ineligibleReason` | Explanation of why Snowflake didn’t use QAS for the query. For example, if the query doesn’t perform any table scans, or if it doesn’t scan a large enough amount of data to make QAS worthwhile, the reason is listed as `NO_LARGE_ENOUGH_SCAN`. |
| `originalQueryTime` | Execution time of the original query in seconds. |
| `queryUUID` | Query ID. |
| `status` | One of the following values that indicates whether or not the query is eligible to benefit from the query acceleration service:   |  |  | | --- | --- | | `eligible` | The query can benefit from query acceleration. | | `ineligible` | The query cannot benefit from query acceleration. | | `accelerated` | The query has already been accelerated. | | `invalid` | The query with the specified ID was not found. | |
| `upperLimitScaleFactor` | Number of the highest query acceleration scale factor in the `estimatedQueryTimes` object. If the `status` for the query is not `eligible` for query acceleration, this field is set to `0`. |

In the `estimatedQueryTimes` object, each name / value pair specifies a query acceleration [scale factor](../sql/create-warehouse.md) and the estimated query execution time at that scale factor.

The following example lists the estimated query execution time for the scale factors `1`, `2`, `4` and
`8`:

```sqljson
...
"estimatedQueryTimes" : {
  "1" : 171,
  "2": 152,
  "4": 133,
  "8": 120
}
...
```

## Usage notes

* Estimated query times are for analysis purposes only and are not guaranteed.
* Estimated query times are calculated based on the assumption that the query is serviced by all the compute resources allocated by the
  query acceleration service based on scale factor.
* Estimated query times do not factor in concurrency.

## Examples

For example queries, see [Identifying queries with the SYSTEM$ESTIMATE_QUERY_ACCELERATION function](../../user-guide/query-acceleration-service.md).

---
title: SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_estimate_search_optimization_costs.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS

Returns the estimated costs of adding [search optimization](../../user-guide/search-optimization-service.md) to a given table and
configuring specific columns for search optimization.

> **Important:**
>
> Cost estimates returned by the SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS function are best efforts. The actual realized
> costs can vary by up to 50% (or, in rare cases, by several times) from the estimated costs.
>
> * Build and storage cost estimates are based on sampling a subset of the rows in the table
> * Maintenance cost estimates are based on recent create, delete, and update activity in the table

## Syntax

```sqlsyntax
SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS('<table_name>' [ , '<search_method_with_target>' ])
```

## Arguments

**Required:**

`table_name`
:   Table for which you want to estimate the search optimization costs.

    If the table name is not fully-qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the function looks for the table in the current schema for the session.

    The entire name must be enclosed in single quotes.

**Optional:**

`search_method_with_target`
:   Specifies a [search method and target](../sql/alter-table-event-table.md) for a
    [column configuration](../../user-guide/search-optimization/enabling.md) similar to what can be
    specified in the ON clause of the [ALTER TABLE](../sql/alter-table.md) … [ADD SEARCH
    OPTIMIZATION](../sql/alter-table.md) command.

    This entire argument must be enclosed in single quotes. Within this string, use double quotes around column names
    [where required](../identifiers-syntax.md).

## Output

The function returns a JSON object with the properties described below:

| Property | Description |
| --- | --- |
| `tableName` | Name of the table. |
| `searchOptimizationEnabled` | `true` if search optimization is enabled for the table or any columns in it; `false` otherwise. |
| `costPositions` | Array of objects that describe the predicted costs of adding search optimization to the table or its columns. |

Each object in the `costPositions` array represents a different type of cost estimate:

```sqljson
...
"costPositions" : [
  {
    "name" : "BuildCosts",
    ...
  }, {
    "name" : "StorageCosts",
    ...
  }, {
    "name" : "Benefit",
    ...
  }, {
    "name" : "MaintenanceCosts",
    ...
  }
]
...
```

The `name` property identifies the type of cost represented by the object. `name` can be one of the following:

| `name` of object in `costPositions` | Description |
| --- | --- |
| `BuildCosts` | This object describes the predicted costs of building the search access path for the table. If search optimization has already been added to the table or to all the specified columns, this object contains no cost information. |
| `StorageCosts` | This object describes the predicted amount of storage space (in TB) needed for the search access path for the table. |
| `Benefit` | This object appears only when the table has search optimization enabled. It does not contain information at this time. |
| `MaintenanceCosts` | This object describes the predicted costs of maintaining the search access path for the table when rows are inserted, deleted, or modified. If the table has been created recently, no cost information is reported. |

Each object in the `costPositions` array can have the following properties:

| Property | Description |
| --- | --- |
| `name` | Name that identifies the type of cost information represented by this object. |
| `costs` | Object that describes the predicted costs in terms of the following properties: |
| `value` | Amount of the predicted cost. |
| `unit` | Unit of measure for the cost (e.g., “Credits” for compute costs, “TB” for storage costs, etc.). |
| `perTimeUnit` | For maintenance costs, the unit of time that the estimated cost covers (for example, `"MONTH"` for the cost per month). |
| `computationMethod` | Method used to estimate the costs, if multiple methods are available. |
| `comment` | Additional information about the estimated cost. |

## Usage notes

* The `searchOptimizationEnabled` property is `true` when the table or any column in it has search optimization enabled.
* For the build cost, this function returns an approximation based on building search access paths for a sample of the data in the
  specified table.
* For the maintenance cost, this function bases the estimates on recent changes made to the table (the changes to bytes over
  time).
* In order to call the function, you must have a warehouse in use. If no warehouse is currently in use, the function reports the
  following error:

  ```none
  No active warehouse selected in the current session.
  Select an active warehouse with the 'use warehouse' command.
  ```

  The [warehouse size](../../user-guide/warehouses-overview.md) has no effect on the performance of this function, so you can use an
  X-Small warehouse.
* Because the function uses a warehouse, you are billed for warehouse usage for this function.
* The function can take somewhere in the range of 20 seconds to 10 minutes to complete. Using a larger warehouse does
  not result in faster execution.

## Examples

The following example shows the estimated costs of adding search optimization to a table:

> ```sqlexample
> SELECT SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS('table_without_search_opt')
>   AS estimate_for_table_without_search_optimization;
> ```
>
> ```output
> +---------------------------------------------------------------------------+
> | ESTIMATE_FOR_TABLE_WITHOUT_SEARCH_OPTIMIZATION                            |
> |---------------------------------------------------------------------------|
> | {                                                                         |
> |   "tableName" : "TABLE_WITHOUT_SEARCH_OPT",                               |
> |   "searchOptimizationEnabled" : false,                                    |
> |   "costPositions" : [ {                                                   |
> |     "name" : "BuildCosts",                                                |
> |     "costs" : {                                                           |
> |       "value" : 11.279,                                                   |
> |       "unit" : "Credits"                                                  |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "estimated via sampling"                                  |
> |   }, {                                                                    |
> |     "name" : "StorageCosts",                                              |
> |     "costs" : {                                                           |
> |       "value" : 0.070493,                                                 |
> |       "unit" : "TB"                                                       |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "estimated via sampling"                                  |
> |   }, {                                                                    |
> |     "name" : "MaintenanceCosts",                                          |
> |     "costs" : {                                                           |
> |       "value" : 30.296,                                                   |
> |       "unit" : "Credits",                                                 |
> |       "perTimeUnit" : "MONTH"                                             |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "Estimated from historic change rate over last ~11 days." |
> |   } ]                                                                     |
> | }                                                                         |
> +---------------------------------------------------------------------------+
> ```

The following example shows the output of this function for a table that already has search optimization enabled. You
can see that no build cost information is available in this case. Also, the `Benefit` property is included (but
it does not contain any information).

> ```sqlexample
> SELECT SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS('table_with_search_opt')
>   AS estimate_for_table_with_search_optimization;
> ```
>
> ```output
> +---------------------------------------------------------------------------+
> | ESTIMATE_FOR_TABLE_WITH_SEARCH_OPTIMIZATION                               |
> |---------------------------------------------------------------------------|
> | {                                                                         |
> |   "tableName" : "TABLE_WITH_SEARCH_OPT",                                  |
> |   "searchOptimizationEnabled" : true,                                     |
> |   "costPositions" : [ {                                                   |
> |     "name" : "BuildCosts",                                                |
> |     "computationMethod" : "NotAvailable",                                 |
> |     "comment" : "Search optimization is already enabled."                 |
> |   }, {                                                                    |
> |     "name" : "StorageCosts",                                              |
> |     "costs" : {                                                           |
> |       "value" : 0.052048,                                                 |
> |       "unit" : "TB"                                                       |
> |     },                                                                    |
> |     "computationMethod" : "Measured"                                      |
> |   }, {                                                                    |
> |     "name" : "Benefit",                                                   |
> |     "computationMethod" : "NotAvailable",                                 |
> |     "comment" : "Currently not supported."                                |
> |   }, {                                                                    |
> |     "name" : "MaintenanceCosts",                                          |
> |     "costs" : {                                                           |
> |       "value" : 30.248,                                                   |
> |       "unit" : "Credits",                                                 |
> |       "perTimeUnit" : "MONTH"                                             |
> |     },                                                                    |
> |     "computationMethod" : "EstimatedUpperBound",                          |
> |     "comment" : "Estimated from historic change rate over last ~11 days." |
> |   } ]                                                                     |
> | }                                                                         |
> +---------------------------------------------------------------------------+
> ```

The following example shows the output of this function for estimating search optimization on three specific columns of
a table using the EQUALITY search method (that is, the estimate is for enabling search optimization only for equality
comparisons on these columns). Neither the table nor any of its columns already have any type of search optimization enabled.

> ```sqlexample
> SELECT SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS('table_without_search_opt', 'EQUALITY(C1, C2, C3)')
>   AS estimate_for_columns_without_search_optimization;
> ```
>
> ```output
> +---------------------------------------------------------------------------+
> | ESTIMATE_FOR_COLUMNS_WITHOUT_SEARCH_OPTIMIZATION                          |
> |---------------------------------------------------------------------------|
> | {                                                                         |
> |   "tableName" : "TABLE_WITHOUT_SEARCH_OPT",                               |
> |   "searchOptimizationEnabled" : false,                                    |
> |   "costPositions" : [ {                                                   |
> |     "name" : "BuildCosts",                                                |
> |     "costs" : {                                                           |
> |       "value" : 10.527,                                                   |
> |       "unit" : "Credits"                                                  |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "estimated via sampling"                                  |
> |   }, {                                                                    |
> |     "name" : "StorageCosts",                                              |
> |     "costs" : {                                                           |
> |       "value" : 0.040323,                                                 |
> |       "unit" : "TB"                                                       |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "estimated via sampling"                                  |
> |   }, {                                                                    |
> |     "name" : "MaintenanceCosts",                                          |
> |     "costs" : {                                                           |
> |       "value" : 22.821,                                                   |
> |       "unit" : "Credits",                                                 |
> |       "perTimeUnit" : "MONTH"                                             |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "Estimated from historic change rate over last ~7 days."  |
> |   } ]                                                                     |
> | }                                                                         |
> +---------------------------------------------------------------------------+
> ```

If a similar query is run on a table where search optimization is already enabled for any of the specified columns, the
output includes a build cost estimate that covers adding search optimization to the specified columns where it is
not already enabled. This is different from the earlier example where we were estimating search optimization on a whole
table where search optimization was already enabled, which resulted in no build cost estimate since there was no build
work to be done.

The storage estimate here includes only the actual search access path size for the columns where search optimization is
already enabled.

The maintenance estimate covers all of the specified columns regardless of whether they already have search optimization
enabled.

> ```sqlexample
> SELECT SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS('table_with_search_opt', 'EQUALITY(C1, C2, C3)')
>   AS estimate_for_columns_with_search_optimization;
> ```
>
> ```output
> +---------------------------------------------------------------------------+
> | ESTIMATE_FOR_COLUMNS_WITH_SEARCH_OPTIMIZATION                             |
> |---------------------------------------------------------------------------|
> | {                                                                         |
> |   "tableName" : "TABLE_WITH_SEARCH_OPT",                                  |
> |   "searchOptimizationEnabled" : true,                                     |
> |   "costPositions" : [ {                                                   |
> |     "name" : "BuildCosts",                                                |
> |     "costs" : {                                                           |
> |       "value" : 8.331,                                                    |
> |       "unit" : "Credits"                                                  |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "estimated via sampling"                                  |
> |   }, {                                                                    |
> |     "name" : "StorageCosts",                                              |
> |     "costs" : {                                                           |
> |       "value" : 0.040323,                                                 |
> |       "unit" : "TB"                                                       |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "estimated via sampling"                                  |
> |   }, {                                                                    |
> |     "name" : "Benefit",                                                   |
> |     "computationMethod" : "NotAvailable",                                 |
> |     "comment" : "Currently not supported."                                |
> |   }, {                                                                    |
> |     "name" : "MaintenanceCosts",                                          |
> |     "costs" : {                                                           |
> |       "value" : 22.821,                                                   |
> |       "unit" : "Credits",                                                 |
> |       "perTimeUnit" : "MONTH"                                             |
> |     },                                                                    |
> |     "computationMethod" : "Estimated",                                    |
> |     "comment" : "Estimated from historic change rate over last ~7 days."  |
> |   } ]                                                                     |
> | }                                                                         |
> +---------------------------------------------------------------------------+
> ```

---
title: SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS
source: https://docs.snowflake.com/en/sql-reference/functions/system_evaluate_data_quality_expectations.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md), [Table functions](../functions-table.md)

# SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS

Returns the [expectations](../../user-guide/data-quality-expectations.md) for associations between data metric functions (DMFs) and a table,
including whether an expectation is currently violated.

## Syntax

```sqlsyntax
SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS(
  REF_ENTITY_NAME  => '<object>'
  [ , SKIP_SUSPENDED_DMF => { TRUE | FALSE } ] )
```

## Arguments

`REF_ENTITY_NAME => 'object'`
:   Name of the table or view that has at least one DMF with one or more expectations. Must be fully qualified.

`SKIP_SUSPENDED_DMF => { TRUE | FALSE }`
:   If set to TRUE, the function doesn’t return expectations that are defined for associations between the `object` and suspended
    DMFs. A suspended DMF doesn’t run on the object’s specified schedule.

    Default: TRUE

## Returns

Returns a table with the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| `metric_database` | VARCHAR | Name of the database that contains the DMF. |
| `metric_schema` | VARCHAR | Name of the schema that contains the DMF. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `expectation_name` | VARCHAR | Name that the user assigned the expectation when adding it to the association between the DMF and the table. |
| `expectation_id` | NUMBER | System-generated identifier. |
| `expectation_expression` | VARCHAR | Boolean expression of the expectation. See [Defining what meets the expectation](../../user-guide/data-quality-expectations.md). |
| `arguments` | ARRAY | Columns with which the DMF is associated. |
| `value` | VARIANT | The result of the DMF evaluation. |
| `expectation_violated` | BOOLEAN | If TRUE, the expectation was violated. An expectation is violated when the `expectation_expression` evaluates to FALSE. |

## Access control privileges

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| SELECT | Table or view |  |
| USAGE | Data metric function (DMF) |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Example

Return the expectations for the associations between DMFs AND table `t1`. The DMFs are executed to determine if the expectations are
currently violated.

```sqlexample
SELECT *
  FROM TABLE(SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS(
      REF_ENTITY_NAME => 'my_db.sch.t1'));
```

---
title: SYSTEM$EXPLAIN_JSON_TO_TEXT
source: https://docs.snowflake.com/en/sql-reference/functions/system_explain_json_to_text.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$EXPLAIN_JSON_TO_TEXT

This function converts EXPLAIN output from JSON to formatted text.

See also:
:   [SYSTEM$EXPLAIN_PLAN_JSON](system_explain_plan_json.md) , [EXPLAIN_JSON](explain_json.md)

## Syntax

```sqlsyntax
SYSTEM$EXPLAIN_JSON_TO_TEXT( <explain_output_in_json_format> )
```

## Arguments

`explain_output_in_json_format`
:   A string, or an expression that evaluates to a string, containing EXPLAIN output as a JSON-compatible string.
    If the input is a string, the string should be enclosed in single quotes `'`.

## Returns

The function returns a VARCHAR containing the EXPLAIN output as text that has been formatted to be relatively easy for
humans to read.

## Usage notes

* This function converts EXPLAIN information from JSON to formatted text.
  Often, the JSON value is produced directly or indirectly from the [SYSTEM$EXPLAIN_PLAN_JSON](system_explain_plan_json.md) function.
  For example, the output from SYSTEM$EXPLAIN_PLAN_JSON could be stored in a table, then displayed later using this
  SYSTEM$EXPLAIN_JSON_TO_TEXT function.
* If a string literal is passed as input, the delimiter around the string can be either a single quote `'` or a
  double dollar sign `$$`. If the string literal contains single quotes (and does not contain double dollar
  signs), then delimiting the string with double dollar signs avoids the need to escape the embedded single quote
  characters inside the string.

## Examples

The example(s) below use these tables:

> ```sqlexample
> CREATE TABLE Z1 (ID INTEGER);
> CREATE TABLE Z2 (ID INTEGER);
> CREATE TABLE Z3 (ID INTEGER);
> ```

If you want to store the EXPLAIN output in JSON format, but display it as formatted text, you can call
`SYSTEM$EXPLAIN_JSON_TO_TEXT()` as shown below:

> First, get EXPLAIN output in JSON format and store it in a table:
>
> > ```sqlexample
> > SET QUERY_10 = 'SELECT Z1.ID, Z2.ID FROM Z1, Z2 WHERE Z2.ID = Z1.ID';
> > CREATE TABLE json_explain_output_for_analysis (
> >     ID INTEGER,
> >     query VARCHAR,
> >     explain_plan VARCHAR
> >     );
> > INSERT INTO json_explain_output_for_analysis (ID, query, explain_plan)
> >     SELECT
> >         1,
> >         $QUERY_10 AS query,
> >         SYSTEM$EXPLAIN_PLAN_JSON($QUERY_10) AS explain_plan;
> > ```
>
> The JSON looks like the output shown below:
>
> > ```sqlexample
> > SELECT query, explain_plan FROM json_explain_output_for_analysis;
> > +-----------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > | QUERY                                               | EXPLAIN_PLAN                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
> > |-----------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> > | SELECT Z1.ID, Z2.ID FROM Z1, Z2 WHERE Z2.ID = Z1.ID | {"GlobalStats":{"partitionsTotal":2,"partitionsAssigned":2,"bytesAssigned":1024},"Operations":[[{"id":0,"operation":"Result","expressions":["Z1.ID","Z2.ID"]},{"id":1,"parentOperators":[0],"operation":"InnerJoin","expressions":["joinKey: (Z2.ID = Z1.ID)"]},{"id":2,"parentOperators":[1],"operation":"TableScan","objects":["TESTDB.TEMPORARY_DOC_TEST.Z2"],"expressions":["ID"],"partitionsAssigned":1,"partitionsTotal":1,"bytesAssigned":512},{"id":3,"parentOperators":[1],"operation":"JoinFilter","expressions":["joinKey: (Z2.ID = Z1.ID)"]},{"id":4,"parentOperators":[3],"operation":"TableScan","objects":["TESTDB.TEMPORARY_DOC_TEST.Z1"],"expressions":["ID"],"partitionsAssigned":1,"partitionsTotal":1,"bytesAssigned":512}]]} |
> > +-----------------------------------------------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > ```
>
> After you have stored the JSON in a table, you can pass the JSON to the SYSTEM$EXPLAIN_JSON_TO_TEXT function to
> convert it to a more human-readable text format by calling SYSTEM$EXPLAIN_JSON_TO_TEXT:
>
> > ```sqlexample
> > SELECT SYSTEM$EXPLAIN_JSON_TO_TEXT(explain_plan)
> >     FROM json_explain_output_for_analysis
> >     WHERE json_explain_output_for_analysis.ID = 1;
> > +------------------------------------------------------------------------------------------------------------------------------------+
> > | SYSTEM$EXPLAIN_JSON_TO_TEXT(EXPLAIN_PLAN)                                                                                          |
> > |------------------------------------------------------------------------------------------------------------------------------------|
> > | GlobalStats:                                                                                                                       |
> > | 	bytesAssigned=1024                                                                                                                                                                                                                                                                         |
> > | 	partitionsAssigned=2                                                                                                                                                                                                                                                                         |
> > | 	partitionsTotal=2                                                                                                                                                                                                                                                                         |
> > | Operations:                                                                                                                        |
> > | 1:0     ->Result  Z1.ID, Z2.ID                                                                                                     |
> > | 1:1          ->InnerJoin  joinKey: (Z2.ID = Z1.ID)                                                                                 |
> > | 1:2               ->TableScan  TESTDB.TEMPORARY_DOC_TEST.Z2  ID  {partitionsTotal=1, partitionsAssigned=1, bytesAssigned=512}      |
> > | 1:3               ->JoinFilter  joinKey: (Z2.ID = Z1.ID)                                                                           |
> > | 1:4                    ->TableScan  TESTDB.TEMPORARY_DOC_TEST.Z1  ID  {partitionsTotal=1, partitionsAssigned=1, bytesAssigned=512} |
> > |                                                                                                                                    |
> > +------------------------------------------------------------------------------------------------------------------------------------+
> > ```

---
title: SYSTEM$EXPLAIN_PLAN_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/system_explain_plan_json.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$EXPLAIN_PLAN_JSON

Given the text of a SQL statement, this function generates the EXPLAIN plan in JSON.

See also:
:   [SYSTEM$EXPLAIN_JSON_TO_TEXT](system_explain_json_to_text.md) , [EXPLAIN_JSON](explain_json.md)

## Syntax

```sqlsyntax
SYSTEM$EXPLAIN_PLAN_JSON( { <sql_statement_expression> | <sql_query_id_expression> } )
```

## Arguments

`sql_statement_expression`
:   A string, or an expression that evaluates to a string, containing the SQL statement for which you want the EXPLAIN
    plan.
    If a literal string is used, it should be surrounded by single quote characters `'`.

`sql_query_id_expression`
:   A string, or an expression that evaluates to a string, containing the query ID for which you want the EXPLAIN plan.
    If a literal string is used, it should be surrounded by single quote characters `'`.

    Snowflake retains historical data for query IDs executed within the previous 14 days. If you specify the query ID
    for a query executed more than 14 days in the past, an error is returned. For more information, see
    [Monitor query activity with Query History](../../user-guide/ui-snowsight-activity.md).

## Returns

The function returns a VARCHAR containing the EXPLAIN output in JSON-compatible format.

## Usage notes

* If a string literal is passed as input, the delimiter around the string can be either a single quote `'` or a
  double dollar sign `$$`. If the string literal contains single quotes (and does not contain double dollar
  signs), then delimiting the string with double dollar signs avoids the need to escape the embedded single quote
  characters inside the string.
* SQL statements that would fail if they were run standalone can’t be used as arguments to this function.
  For example, if a CREATE TABLE statement is specified, it can’t be run again (the table name already exists).
  The system function fails with an error when it attempts to recompile the statement.
* To post-process the output of this command, you can:

  + Use the [RESULT_SCAN](result_scan.md) function, which treats the output as a table that can be
    queried.
  + Insert the JSON-formatted output into a table for analysis later.
    If you store the output in JSON format, you can use the function
    [SYSTEM$EXPLAIN_JSON_TO_TEXT](system_explain_json_to_text.md) or
    [EXPLAIN_JSON](explain_json.md) to convert the JSON to a more human readable format (either tabular
    or formatted text).

## Examples

These examples use the tables shown below:

```sqlexample
CREATE TABLE Z1 (ID INTEGER);
CREATE TABLE Z2 (ID INTEGER);
CREATE TABLE Z3 (ID INTEGER);
```

This example uses a literal string that contains an SQL statement as the input argument:

```sqlsyntax
SELECT SYSTEM$EXPLAIN_PLAN_JSON(
  'SELECT Z1.ID, Z2.ID FROM Z1, Z2 WHERE Z2.ID = Z1.ID'
  ) AS explain_plan;
```

```output
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| EXPLAIN_PLAN                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      |
|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| {"GlobalStats":{"partitionsTotal":2,"partitionsAssigned":2,"bytesAssigned":1024},"Operations":[[{"id":0,"operation":"Result","expressions":["Z1.ID","Z2.ID"]},{"id":1,"parentOperators":[0],"operation":"InnerJoin","expressions":["joinKey: (Z2.ID = Z1.ID)"]},{"id":2,"parentOperators":[1],"operation":"TableScan","objects":["TESTDB.TEMPORARY_DOC_TEST.Z2"],"expressions":["ID"],"partitionsAssigned":1,"partitionsTotal":1,"bytesAssigned":512},{"id":3,"parentOperators":[1],"operation":"JoinFilter","expressions":["joinKey: (Z2.ID = Z1.ID)"]},{"id":4,"parentOperators":[3],"operation":"TableScan","objects":["TESTDB.TEMPORARY_DOC_TEST.Z1"],"expressions":["ID"],"partitionsAssigned":1,"partitionsTotal":1,"bytesAssigned":512}]]} |
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

Use `$$` to delimit queries that contain single quotes:

```sqlexample
SELECT SYSTEM$EXPLAIN_PLAN_JSON(
    $$ SELECT symptom, IFNULL(diagnosis, '(not yet diagnosed)') FROM medical $$
    );
```

The code below shows how to look at the EXPLAIN plan for a query that you already executed.

Run the query:

```sqlexample
SELECT Z1.ID, Z2.ID FROM Z1, Z2 WHERE Z2.ID = Z1.ID;
```

Run EXPLAIN on the query, calling `LAST_QUERY_ID()` to look up the query ID:

```sqlexample
SELECT SYSTEM$EXPLAIN_PLAN_JSON(LAST_QUERY_ID()) AS explain_plan;
```

---
title: SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW
source: https://docs.snowflake.com/en/sql-reference/functions/system_export_tds_from_semantic_view.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW

Returns a [semantic view](../../user-guide/views-semantic/overview.md) in Tableau Data Source (TDS) format.

## Syntax

```sqlsyntax
SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW( '<semantic_view_name>' )
```

## Arguments

`'semantic_view_name'`
:   Name of the semantic view to export.

    If the semantic view is not in the current schema and database, specify the
    [fully-qualified name of the view](../name-resolution.md) (for example, `my_db.my_schema.my_semantic_view`).

## Returns

Returns a VARCHAR value containing the semantic view in TDS format.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

For details about the conversion process and limitations with the conversion, see [Exporting a semantic view to a Tableau Data Source (TDS) file](../../user-guide/views-semantic/sql.md).

## Examples

The following statement returns the semantic view `my_sv` in TDS format:

```sqlexample
SELECT SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW('my_sv');
```

```output
+------------------------------------------------------------------------+
| SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW('MY_SV')                          |
|------------------------------------------------------------------------|
| <?xml version="1.0" encoding="UTF-8"?>                                 |
| <!--Tableau compatibility notice:                                      |
| ... -->                                                                |
| <datasource xmlns:user="http://www.tableausoftware.com/xml/user" ... > |
| ...                                                                    |
+------------------------------------------------------------------------+
```

---
title: SYSTEM$EXTERNAL_TABLE_PIPE_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_external_table_pipe_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$EXTERNAL_TABLE_PIPE_STATUS

Retrieves a JSON representation of the current refresh status for the internal (hidden) pipe object associated with an external table.

Automatically refreshing the metadata for an external table relies internally on Snowpipe, which receives event notifications when changes occur in the monitored cloud storage. For more information, see [Introduction to external tables](../../user-guide/tables-external-intro.md).

## Syntax

```sqlsyntax
SYSTEM$EXTERNAL_TABLE_PIPE_STATUS( '<external_table_name>' )
```

## Arguments

`external_table_name`
:   External table for which you want to retrieve the current automatic refresh status.

## Usage notes

* This function only returns results for the external table owner (i.e. the role that has the OWNERSHIP privilege on the external table).
* `external_table_name` is a string so it must be enclosed in single quotes:

  + Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully-qualified), i.e. `'<db>.<schema>.<external_table_name>'`.
  + If the external table name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<external_table_name>"'`.

## Output

The function returns a JSON object containing the following name/value pairs (if applicable to the current pipe status):

> {“executionState”:”<value>”,”oldestPendingFilePath”:”<value>”,”oldestFileTimestamp”:<value>,”pendingFileCount”:<value>,”lastPipeFaultTimestamp”:”<value>”,”notificationChannelName”:”<value>”,”numOutstandingMessagesOnChannel”:<value>,”lastReceivedMessageTimestamp”:”<value>”,”lastForwardedMessageTimestamp”:”<value>”,”error”:<value>,”fault”:<value>,”lastPulledFromChannelTimestamp”:”<value>”,”lastForwardedFilePath”:”<value>”}

Where:

> `executionState`
> :   Current execution state of the pipe. The value could be any one of the following:
>
>     * `RUNNING` (i.e. everything is normal; Snowflake may or may not be actively processing event messages for this pipe)
>     * `STOPPED_CLONED` (i.e. the pipe is contained by a database or schema clone)
>     * `STOPPED_FEATURE_DISABLED`
>     * `STOPPED_STAGE_DROPPED`
>     * `STOPPED_FILE_FORMAT_DROPPED`
>     * `STOPPED_NOTIFICATION_INTEGRATION_DROPPED`
>     * `STOPPED_MISSING_PIPE`
>     * `STOPPED_MISSING_TABLE` (the target table defined in the pipe definition was dropped)
>     * `STALLED_COMPILATION_ERROR`
>     * `STALLED_INITIALIZATION_ERROR`
>     * `STALLED_EXECUTION_ERROR`
>     * `STALLED_INTERNAL_ERROR`
>     * `PAUSED`
>     * `PAUSED_BY_SNOWFLAKE_ADMIN`
>     * `PAUSED_BY_ACCOUNT_ADMIN`
>
> `oldestPendingFilePath`
> :   Path to the oldest data file currently queued for a metadata refresh operation. The timestamp when the file was added to the queue is returned in the `oldestFileTimestamp` property.
>
> `oldestFileTimestamp`
> :   Earliest timestamp among data files currently queued for a metadata refresh operation (if applicable), where the timestamp is set when the file is added to the queue.
>
> `pendingFileCount`
> :   Number of files currently being processed by the pipe. This value decreases as the external table metadata is refreshed. When this value is `0`, no metadata refreshes are queued for this pipe.
>
> `lastPipeFaultTimestamp`
> :   Timestamp when an internal Snowflake process error was last detected.
>
> `notificationChannelName`
> :   Amazon SQS queue or Microsoft Azure storage queue associated with the pipe.
>
> `numOutstandingMessagesOnChannel`
> :   Number of messages in the queue that have been queued but not received yet.
>
> `lastReceivedMessageTimestamp`
> :   Timestamp of the last message received from the queue.
>
> `lastForwardedMessageTimestamp`
> :   Timestamp of the last applicable event message with a matching path/prefix that was forwarded to the pipe.
>
> `channelErrorMessage`
> :   Error message produced when attempting to read messages from the associated cloud messaging service queue.
>
> `lastErrorRecordTimestamp`
> :   Timestamp of last channel error message (i.e. error message reported in the `channelErrorMessage` value).
>
> `error`
> :   Error message produced when the pipe was last compiled for execution (if applicable); often caused by problems accessing the necessary objects (i.e. table, stage, file format) due to permission problems or dropped objects.
>
> `fault`
> :   Most recent internal Snowflake process error (if applicable). Used primarily by Snowflake for debugging purposes.
>
> `lastPulledFromChannelTimestamp`
> :   Timestamp when Snowpipe last pulled event notifications for the pipe from the cloud messaging service queue.
>
> `lastForwardedFilePath`
> :   Path of the data file identified in the last applicable event message that was forwarded to the pipe.

## Examples

Retrieve the automatic refresh status for an external table with a case-insensitive name:

> ```sqlexample
> SELECT SYSTEM$EXTERNAL_TABLE_PIPE_STATUS('mydb.myschema.exttable');
>
> +---------------------------------------------------------------+
> | SYSTEM$EXTERNAL_TABLE_PIPE_STATUS('MYDB.MYSCHEMA.EXTTABLE')   |
> |---------------------------------------------------------------|
> | {"executionState":"RUNNING","pendingFileCount":0}             |
> +---------------------------------------------------------------+
> ```

Retrieve the status for a pipe with a case-sensitive name:

> ```sqlexample
> SELECT SYSTEM$EXTERNAL_TABLE_PIPE_STATUS('mydb.myschema."extTable"');
>
> +---------------------------------------------------------------+
> | SYSTEM$EXTERNAL_TABLE_PIPE_STATUS('MYDB.MYSCHEMA."extTable"') |
> |---------------------------------------------------------------|
> | {"executionState":"RUNNING","pendingFileCount":0}             |
> +---------------------------------------------------------------+
> ```

---
title: SYSTEM$FINISH_OAUTH_FLOW
source: https://docs.snowflake.com/en/sql-reference/functions/system_finish_oauth_flow.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$FINISH_OAUTH_FLOW

Sets the OAUTH_REFRESH_TOKEN parameter value of the secret passed as an argument in the [SYSTEM$START_OAUTH_FLOW](system_start_oauth_flow.md)
call that began the OAuth flow.

This function completes the OAuth client flow begun with SYSTEM$START_OAUTH_FLOW.

## Syntax

```sqlsyntax
SYSTEM$FINISH_OAUTH_FLOW( '<query_string>' )
```

## Arguments

`'query_string'`
:   Query string from the URL in the browser after completing user authentication and providing OAuth consent.

## Usage notes

Use this function to set the refresh token of an OAuth2 secret you’re using to authenticate with a service provider. This function finishes an
OAuth flow that must begin with your call to [SYSTEM$START_OAUTH_FLOW](system_start_oauth_flow.md).

You must execute this function immediately after – and in the same session as – SYSTEM$START_OAUTH_FLOW. This ensures that the user who is
finishing the flow is the same as the user who started it.

## Examples

```sqlexample
SELECT SYSTEM$FINISH_OAUTH_FLOW('state=252462476&authz_code=54264262');
```

---
title: SYSTEM$GENERATE_SAML_CSR
source: https://docs.snowflake.com/en/sql-reference/functions/system_generate_saml_csr.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GENERATE_SAML_CSR

Generates a certificate signing request (CSR) with the subject set to the subject of the certificate stored in the [SAML2 integration](../sql/create-security-integration-saml2.md) and can specify the `DN` to be used in the CSR.

## Syntax

```sqlsyntax
SYSTEM$GENERATE_SAML_CSR( <name> , <DN> )
```

## Arguments

`name`
:   The name of the SAML2 security integration to generate the CSR.

    Required.

`DN`
:   The distinguished name to be used the CSR. Note that a DN is a string of relative DNs separated by commas. For example:

    > `'cn=juser, ou=dev, ou=people, o=eng, dc=com'`

    Optional.

    If missing, the DN of the current certificate will be used. If using the self-signed certificate, the value will be the account alias, if set, or the account name.

## Usage notes

None.

## Example

To generate a CSR with the subject set to the subject of the current certificate stored in the SAML2 integration, execute the function with the `name` parameter only. For example:

> ```sqlexample
> select system$generate_saml_csr('my_idp');
>
> --------------------------------------------------------------------------------------------------+
> SYSTEM$GENERATE_SAML_CSR('MY_IDP')                                                                |
> --------------------------------------------------------------------------------------------------+
> -----BEGIN NEW CERTIFICATE REQUEST-----                                                           |
> MIICWzCCAUMCAQAwFjEUMBIGA1UEAxMLVEVTVEFDQ09VTlQwggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEKAoIBAQDCRpyZ  |
> ...                                                                                               |
> -----END NEW CERTIFICATE REQUEST-----                                                             |
> --------------------------------------------------------------------------------------------------+
> ```
>
> > **Note:**
> >
> > The current certificate refers to the value of the `SAML2_SNOWFLAKE_X509_CERT` in the SAML2 integration (row 7 after executing a [DESCRIBE INTEGRATION](../sql/desc-integration.md) statement on the SAML2 integration).
> >
> > This certificate value could be the self-signed certificate or a certificate uploaded previously using an [ALTER SECURITY INTEGRATION](../sql/alter-security-integration-saml2.md) statement as shown in [Manage Your SAML2 security integration](../../user-guide/admin-security-fed-auth-security-integration.md).

To generate a CSR with the CSR’s subject set to a given value, execute the function with both the `name` and `DN` parameters. For example:

> ```sqlexample
> select system$generate_saml_csr('my_idp', 'cn=juser, ou=dev, ou=people, o=eng, dc=com');
>
> --------------------------------------------------------------------------------------------------+
> SYSTEM$GENERATE_SAML_CSR('MY_IDP')                                                                |
> --------------------------------------------------------------------------------------------------+
> -----BEGIN NEW CERTIFICATE REQUEST-----                                                           |
> MIICWzCCAUMCAQAwFjEUMBIGA1UEAxMLVEVTVEFDQ09VTlQwggEiMA0GCSqGSIb3DQEBAQUAA4IBDwAwggEKAoIBAQDCRpyZ  |
> ...                                                                                               |
> -----END NEW CERTIFICATE REQUEST-----                                                             |
> --------------------------------------------------------------------------------------------------+
> ```

You can then upload the certificate for that private key using the CSR generated by the function into Snowflake.

---
title: SYSTEM$GENERATE_SCIM_ACCESS_TOKEN
source: https://docs.snowflake.com/en/sql-reference/functions/system_generate_scim_access_token.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GENERATE_SCIM_ACCESS_TOKEN

Returns a new SCIM access token that is valid for six months.

## Syntax

```sqlsyntax
SYSTEM$GENERATE_SCIM_ACCESS_TOKEN('<integration_name>')
```

## Arguments

`<integration_name>`
:   Name of the security integration where `TYPE = SCIM`. Note that the integration name is case-sensitive, must be uppercase, and be enclosed in single quotes.

    For more information, see [CREATE SECURITY INTEGRATION](../sql/create-security-integration-scim.md).

## Usage notes

* Generating a new SCIM access token does not invalidate an existing token. To invalidate an access token,
  you must delete the entire SCIM security integration using the [DROP INTEGRATION](../sql/drop-integration.md) command. At that point, you can
  recreate the security integration using the [CREATE SECURITY INTEGRATION](../sql/create-security-integration-scim.md) command, and then use this
  function to generate a new token.
* There is no limit to the number of SCIM access tokens that you can generate.

## Output

The function returns the SCIM access token as a string.

## Examples

The following example retrieves the SCIM access token for the specified integration:

> ```sqlexample
> SELECT SYSTEM$GENERATE_SCIM_ACCESS_TOKEN('OKTA_PROVISIONING');
> ```

---
title: SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_all_default_columns_overrides.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES

Returns the list of columns that were set by previous calls to
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_set_default_columns_override_for_show_command.md) and
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_set_default_columns_override_for_system_object.md).

For more information, see [Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_set_default_columns_override_for_show_command.md) ,
    [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_get_default_columns_override_for_show_command.md) ,
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_unset_default_columns_override_for_show_command.md) ,
    [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_set_default_columns_override_for_system_object.md) ,
    [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_get_default_columns_override_for_system_object.md) ,
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_unset_default_columns_override_for_system_object.md)

## Syntax

```sqlsyntax
SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES()
```

## Arguments

None.

## Returns

Returns a VARCHAR value (a string) in JSON format. The string is a JSON array that contains an object for each SHOW command and
Snowflake view that has an overridden list of columns.

If the object represents the overridden list of default columns for a SHOW command, the object contains the following name/value
pairs:

| Name | Description |
| --- | --- |
| `isShowCommand` | Indicates if the object represents the list of columns for a SHOW command. In this case, the value is `true`. |
| `showCommandType` | Type of the object for the SHOW command. For example, for SHOW NOTIFICATION INTEGRATIONS, the value is `"NOTIFICATION INTEGRATIONS"`. |
| `serializedDefaultColumns` | Comma-separated list of columns specified in a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND call. The column names are in uppercase. |

If the object represents the overridden list of default columns for a Snowflake view, the object contains the following
name/value pairs:

| Name | Description |
| --- | --- |
| `domain` | Type of the object. In this case, the value is `"VIEW"`. |
| `isShowCommand` | Indicates if the object represents the list of columns for a SHOW command. In this case, the value is `false`. |
| `dbName` | Name of the database containing the view. For INFORMATION_SCHEMA views, the value is an empty string (`""`). |
| `schemaName` | Name of the schema containing the view. |
| `objectName` | Name of the view. |
| `serializedDefaultColumns` | Comma-separated list of columns specified in a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT call. The column names are in uppercase. |

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Examples

The following example returns the list of columns specified by previous calls to
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND and SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT:

```sqlexample
SELECT SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES();
```

```output
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES()                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| [{"domain":"VIEW","isShowCommand":false,"dbName":"","schemaName":"INFORMATION_SCHEMA","objectName":"DATABASES","serializedDefaultColumns":"DATABASE_NAME,DATABASE_OWNER,IS_TRANSIENT,COMMENT,CREATED,LAST_ALTERED,RETENTION_TIME,TYPE,OWNER_ROLE_TYPE"},{"domain":"VIEW","isShowCommand":false,"dbName":"SNOWFLAKE","schemaName":"ACCOUNT_USAGE","objectName":"DATABASES","serializedDefaultColumns":"DATABASE_ID,DATABASE_NAME,DATABASE_OWNER,IS_TRANSIENT,COMMENT,CREATED,LAST_ALTERED,DELETED,RETENTION_TIME,RESOURCE_GROUP,TYPE,OWNER_ROLE_TYPE,OBJECT_VISIBILITY"},{"isShowCommand":true,"showCommandType":"NOTIFICATION INTEGRATIONS","serializedDefaultColumns":"name,type,category,enabled,comment,created_on"}] |
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: SYSTEM$GET_ALL_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_all_references.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_ALL_REFERENCES

Iterates through all associations for a reference and returns information about the associations.

## Syntax

```sqlsyntax
SYSTEM$GET_ALL_REFERENCES('<reference_name>', [, <include_details> = True | False ])
```

## Arguments

**Required**

`'reference_name'`
:   The name of the reference.

`include_details = True | False`
:   Determines the type of information returned by the function.
    For more information, see Returns.

## Returns

* If the `include_details` parameter is set to `True`, returns a
  VARCHAR containing a JSON object that contains an array of the following name/value pairs:

  ```json
  {
    "alias": "<value>",
    "database": "<value>",
    "schema": "<value>",
    "name": "<value>"
  }
  ```

  Where:

  > + alias: The system-generated alias for the reference.
  > + database: The parent database name of the consumer object, if the object resides in a
  >   database. Otherwise, null.
  > + schema: The parent schema of the consumer object, if the object resides in a schema.
  >   Otherwise, null.
  > + name: The name of the consumer object.
* If the `include_details` parameter is set to `False`, returns an array of
  system-generated aliases:

  + If the reference is not associated with an object, returns an empty list.
  + If the reference, is associated with an object, returns all associations for a multi-valued
    references.
  + If the reference is a single-valued reference, returns 0.

---
title: SYSTEM$GET_AWS_SNS_IAM_POLICY
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_aws_sns_iam_policy.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_AWS_SNS_IAM_POLICY

Returns an AWS IAM policy statement that must be added to the Amazon SNS topic policy in order to grant the Amazon SQS messaging queue created by Snowflake to subscribe to the topic.

This function is used when automating Snowpipe using SQS notifications for S3 events. To avoid conflicts with existing SQS queues for the same *endpoint* (i.e. S3 bucket), creating an SNS topic for the bucket and subscribing all SQS queues to this topic enables SNS to publish event notifications for the bucket to multiple subscribers.

## Syntax

```sqlsyntax
SYSTEM$GET_AWS_SNS_IAM_POLICY( '<sns_topic_arn>' )
```

## Arguments

`sns_topic_arn`
:   Amazon Resource Name (ARN) of the SNS topic for your S3 bucket. The function returns an IAM policy for Snowflake’s SQS queue to subscribe to this topic.

## Usage notes

* All arguments are strings (i.e. they must be enclosed in single quotes).

## Examples

Return an IAM policy for a specified SNS topic ARN:

> ```sqlexample
> select system$get_aws_sns_iam_policy('arn:aws:sns:us-west-2:001234567890:s3_mybucket');
>
> +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | SYSTEM$GET_AWS_SNS_IAM_POLICY('ARN:AWS:SNS:US-WEST-2:001234567890:S3_MYBUCKET')                                                                                                                                                                   |
> +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | {"Version":"2012-10-17","Statement":[{"Sid":"1","Effect":"Allow","Principal":{"AWS":"arn:aws:iam::123456789001:user/vj4g-a-abcd1234"},"Action":["sns:Subscribe"],"Resource":["arn:aws:sns:us-west-2:001234567890:s3_mybucket"]}]}                 |
> +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> ```

---
title: SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_catalog_linked_database_config.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG

Returns the configuration parameters set on the specified [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md). The output is in JSON format.

## Syntax

```sqlsyntax
SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG('<catalog_linked_database_name>');
```

## Arguments

`catalog_linked_database_name`
:   The name of the catalog-linked database you want to get the configuration for.

    Specify it as a string literal enclosed in single quotes.

## Returns

The function returns a string that contains a JSON object with the database’s configuration parameters.

| Field | Description |
| --- | --- |
| `catalog_integration` | Name of the catalog integration used by the catalog-linked database. |
| `catalog_name` | Name of the catalog namespace in the external catalog. Returns `null` if not specified. |
| `external_volume` | Name of the external volume used for table storage. |
| `sync_interval_seconds` | Interval (in seconds) that Snowflake polls the remote catalog to detect changes. |
| `namespace_mode` | Mode for handling namespaces. Possible values: `FLATTEN_NESTED_NAMESPACE`, `HIERARCHICAL`. |
| `namespace_flatten_delimiter` | Delimiter used when flattening nested namespaces. Only applicable when `namespace_mode` is `FLATTEN_NESTED_NAMESPACE`. |
| `allowed_write_operations` | Types of write operations allowed on the catalog-linked database. Possible values: `NONE`, `ALL`. |
| `catalog_case_sensitivity` | Case sensitivity setting for the catalog. Possible values: `CASE_SENSITIVE`, `CASE_INSENSITIVE`. |
| `is_suspended` | Whether the catalog-linked database synchronization is suspended. Returns `true` if suspended, `false` otherwise. |
| `allowed_namespaces` | List of namespaces that are allowed to be synced. Returns `null` if all namespaces are allowed. |
| `blocked_namespaces` | List of namespaces that are blocked from being synced. Returns `null` if no namespaces are blocked. |

For a sample output, see Examples.

## Access control requirements

A role used to execute this operation must have the MONITOR, USAGE, OWNERSHIP, or ALL privilege.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Get the configuration for a catalog-linked database named `my_db`:

```sqlexample
SELECT SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG('my_db');
```

An example output:

```json
{
  "catalog_integration": "TEST_GET_CLD_CONFIG_EBEC9E22_44BD_4945_A4C3_A402CCBB86AF_CAT",
  "catalog_name": null,
  "external_volume": "EXVOL_GET_CLD_CONFIG",
  "sync_interval_seconds": 600,
  "namespace_mode": "FLATTEN_NESTED_NAMESPACE",
  "namespace_flatten_delimiter": "_",
  "allowed_write_operations": "NONE",
  "catalog_case_sensitivity": "CASE_INSENSITIVE",
  "is_suspended": false,
  "allowed_namespaces": ["'ns1'", "'ns2'"],
  "blocked_namespaces": ["'blocked_ns1'"]
}
```

---
title: SYSTEM$GET_CLASSIFICATION_RESULT
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_classification_result.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CLASSIFICATION_RESULT

Returns the classification result of the specified object.

## Syntax

```sqlsyntax
SELECT SYSTEM$GET_CLASSIFICATION_RESULT( '<object_name>' )
```

## Arguments

`object_name`
:   The name of the table, external table, view, or materialized view containing the columns to be classified. If a database and schema are
    not in use in the current session, the name must be fully-qualified.

    The name must be specified exactly as it is stored in the database. If the name contains special characters, capitalization, or blank
    spaces, the name must be enclosed first in double-quotes and then in single quotes.

## Returns

Returns a JSON object in the following format. For example:

```sqljson
{
  "classification_profile_config": {
    "classification_profile_name": "db1.sch.sensitive_data_detection_profile"
  },
  "classification_result": {
    "col1_name": {
      "alternates": [],
      "recommendation": {
        "confidence": "HIGH",
        "coverage": 1,
        "details": [],
        "privacy_category": "QUASI_IDENTIFIER",
        "semantic_category": "DATE_OF_BIRTH",
        "tags": [
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.semantic_category",
            "tag_value": "DATE_OF_BIRTH"
          },
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.privacy_category",
            "tag_value": "QUASI_IDENTIFIER"
          }
        ]
      },
      "valid_value_ratio": 1
    }
  }
}
```

**Possible fields**:

`classification_profile_config`
:   If automatic classification is configured, contains the fully qualified name of the configuration profile that was used to generate the
    classification results.

`classification_result`
:   Provides details about each column that was classified.

`object_path_results`
:   When a column contains semi-structured data with sensitive fields, the `object_path_results` key lists the fields that were
    classified into a native or custom semantic category. For more information, see [View classification results for JSON columns](../../user-guide/classify-results.md).

`alternates`
:   Provides information about each tag and value to consider other than the recommended tag.

`recommendation`
:   Provides information about each tag and value as the primary choice based on the classification process.

These values can appear in both the alternates and recommendation:

> `classifier_name`
> :   The fully-qualified name of the custom classification instance that was used to tag the classified column.
>
>     This field only appears when using a custom classification instance as the source of the tag to set on a column.
>
> `confidence`
> :   Provides one of the following values: `HIGH`, `MEDIUM`, or `LOW`. This value indicates the relative confidence that Snowflake
>     has based upon the column sampling process and how the column data aligns with how Snowflake classifies data.
>
> `coverage`
> :   Provides the percent of sampled cell values that match the rules for a particular category.
>
> `details`
> :   Provides fields and values related to geography-specific classification. The `semantic_category` field contains the
>     [semantic subcategory](../../user-guide/classify-native.md) for a locale.
>
> `privacy_category`
> :   Provides the privacy category.
>
>     The possible values are `IDENTIFIER`, `QUASI-IDENTIFIER` and `SENSITIVE`.
>
> `semantic_category`
> :   Provides the semantic category. For a list of native semantic categories, see [Native semantic categories of sensitive data classification](../../user-guide/classify-native.md).
>
>     If the value is `MULTIPLE`, then sensitive data was found in semi-structured data. Inspect the `object_path_results` field
>     of the results object for a detailed breakdown of which native and custom semantic categories were found during classification. For more information, see [View classification results for JSON columns](../../user-guide/classify-results.md).
>
> `tags`
> :   Provides information about the tags that were applied to the column as a result of the classification process.
>
> `valid_value_ratio`
> :   Provides the ratio of how many values in the sample size are valid.
>
>     * For structured data, invalid values include NULL, an empty string, and a string with more than 256 characters.
>     * For semi-structured data, invalid values include NULL and an empty string.

## Examples

Return the sensitive data classification result for a table:

> ```sqlexample
> SELECT SYSTEM$GET_CLASSIFICATION_RESULT('hr.tables.empl_info');
> ```

---
title: SYSTEM$GET_CMK_AKV_CONSENT_URL
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_cmk_akv_consent_url.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CMK_AKV_CONSENT_URL

Returns a consent URL to the Azure Key Vault account related to customer-managed keys.

You can use the consent URL for your customer-managed keys with [Tri-Secret Secure](../../user-guide/security-encryption-tss.md) for Snowflake accounts on Microsoft Azure.

See also:
:   [Customer-managed keys](../../user-guide/security-encryption-manage.md)

## Syntax

> ```sqlsyntax
> SYSTEM$GET_CMK_AKV_CONSENT_URL( '<account_identifier>' , '<tenant_id>' )
> ```

## Arguments

`'account_identifier'`
:   Specifies the [account identifier](../../user-guide/admin-account-identifier.md) for your Snowflake account on Azure.

    Required.

`'tenant_id'`
:   Specifies the unique identifier for the [tenant](https://docs.microsoft.com/en-us/azure/key-vault/general/basic-concepts) in your Azure
    subscription. This value is in the GUID/UUID format, such as `b3ddabe4-e5ed-4e71-8827-0cefb99af240`.

    Required.

    To locate this value, follow the instructions in [How to find your Azure Active Directory tenant ID](https://docs.microsoft.com/en-us/azure/active-directory/fundamentals/active-directory-how-to-find-tenant).

## Usage notes

* This function is for use in Snowflake accounts on Microsoft Azure only.
* Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role that is granted the global MONITOR SECURITY privilege can
  call this function.

## Examples

Return the consent URL to the Azure Key Vault account related to customer-managed keys, where `my-account`
is the Snowflake account identifier in the [account name format](../../user-guide/admin-account-identifier.md) for your Snowflake account on Azure and
`b3ddabe4-e5ed-4e71-8827-0cefb99af240` is the tenant identifier for your Azure subscription:

> ```sqlexample
> SELECT SYSTEM$GET_CMK_AKV_CONSENT_URL('my-account' , 'b3ddabe4-e5ed-4e71-8827-0cefb99af240');
> ```
>
> Returns:
>
> ```output
> https://login.microsoftonline.com/tenantId/oauth2/authorize?client_id=myClientId&response_type=code
> ```

---
title: SYSTEM$GET_CMK_CONFIG
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_cmk_config.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CMK_CONFIG

Returns configuration information for use with customer-managed keys (CMKs) and Tri-Secret Secure.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

Amazon Web Services and Google Cloud Platform:

```sqlsyntax
SYSTEM$GET_CMK_CONFIG()
```

Microsoft Azure:

```sqlsyntax
SYSTEM$GET_CMK_CONFIG( '<tenant_id>' )
```

## Arguments

`tenant_id`
:   Specifies the unique identifier for the Azure Key Vault
    [tenant](https://docs.microsoft.com/en-us/azure/key-vault/general/basic-concepts) in your Microsoft Azure subscription.

    This value is in the GUID format, such as `b3ddabe4-e5ed-4e71-8827-0cefb99af240`. You can find this value by logging into the Portal
    and navigating to Key Vault » Overview. Select the Directory ID value.

## Returns

The output depends on the cloud platform that hosts your Snowflake account:

* For Amazon Web Services, a snippet of the statement identifier (`Sid`) for the CMK policy:

  ```output
  {"Sid": "Allow use of the key by Snowflake","Effect": "Allow","Principal": {"AWS": "my-arn:name/TRISECRETTEST"},"Action": ["kms:Decrypt","kms:GenerateDataKeyWithoutPlaintext"],"Resource": "arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59"},
  ```
* For Microsoft Azure, a consent URL and the name of the Snowflake service principal:

  ```output
  Consent url is: https://login.microsoftonline.com/tenantId/oauth2/authorize?client_id=c03edcfb-19f9-435f-92fa-e8ec9e24f40e&response_type=code and Snowflake Service Principal name is: trisec_cmk_azure"
  ```
* For Google Cloud Platform, a gcloud command:

  ```output
  gcloud kms keys add-iam-policy-binding TriSecretGCPKey --project my-env --location us-west1 --keyring TriSecretTest --member serviceAccount:site-trisecret@my-env.iam.serviceaccount.com --role roles/cloudkms.cryptoKeyEncrypterDecrypter
  ```

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) or a role that is granted the MONITOR SECURITY privilege can call this function.

## Examples

Obtain the configuration information for the CMK for your Snowflake account on Microsoft Azure:

> ```sqlexample
> SELECT SYSTEM$GET_CMK_CONFIG('b3ddabe4-e5ed-4e71-8827-0cefb99af240');
> ```

---
title: SYSTEM$GET_CMK_CONFIG_POSTGRES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_cmk_config_postgres.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CMK_CONFIG_POSTGRES

Returns configuration information for use with customer-managed keys (CMKs) and Snowflake Postgres Tri-Secret Secure.

## Syntax

Amazon Web Services:

```sqlsyntax
SYSTEM$GET_CMK_CONFIG_POSTGRES()
```

Microsoft Azure:

```sqlsyntax
SYSTEM$GET_CMK_CONFIG_POSTGRES( '<tenant_id>' )
```

## Arguments

`'tenant_id'`
:   Specifies the unique identifier for the Azure Key Vault
    [tenant](https://docs.microsoft.com/en-us/azure/key-vault/general/basic-concepts) in your Microsoft Azure subscription.

    This value is in the GUID format, such as `b3ddabe4-e5ed-4e71-8827-0cefb99af240`. You can find this value by logging into the Portal
    and navigating to Key Vault » Overview. Select the Directory ID value.

## Returns

The output depends on the cloud platform that hosts your Snowflake account:

* For Amazon Web Services, a snippet of the statement identifier (`Sid`) for the CMK policy:

  ```output
  {"Sid": "Allow use of the key by Snowflake","Effect": "Allow","Principal": {"AWS": "my-arn:name/TRISECRETTEST"},"Action": ["kms:Decrypt","kms:GenerateDataKeyWithoutPlaintext"],"Resource": "arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59"},
  ```
* For Microsoft Azure, use the Azure CLI to create service principals in your tenant for Snowflake multi-tenant apps that need to access the CMK:

  ```output
  az ad sp create --id appId1
  az ad sp create --id appId2
  ```

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) or a role that is granted the MONITOR SECURITY privilege on the account can call this function.

## Examples

Obtain the configuration information for the CMK for your Snowflake account on Microsoft Azure:

```sqlexample
SELECT SYSTEM$GET_CMK_CONFIG_POSTGRES('b3ddabe4-e5ed-4e71-8827-0cefb99af240');
```

---
title: SYSTEM$GET_CMK_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_cmk_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CMK_INFO

Returns the status of your customer-managed key (CMK) for use with Tri-Secret Secure.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

```sqlsyntax
SYSTEM$GET_CMK_INFO( [ '<ssa_account_name>' ] )
```

## Arguments

**Required:**

None.

**Optional:**

`ssa_account_name`
:   A string that specifies the name of SSA account name for which you want to retrieve the CMK status.

## Returns

Returns a status message indicating the state of your CMK. The output includes the values that you specified when calling
[SYSTEM$REGISTER_CMK_INFO](system_register_cmk_info.md). If you have enabled private connectivity, the status message returned by SYSTEM$GET_CMK_INFO includes
whether your CMK is privately connected.

The following messages are possible, using CMKs on Amazon Web Services as a representative example:

* Your CMK is registered, but not yet enabled, to use Tri-Secret Secure:

  ```output
  CMK with ARN: arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is pre-registered for Tri-Secret Secure.
  ```
* Your CMK is activated and in use with Tri-Secret Secure:

  ```output
  CMK with ARN: arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is activated for Tri-Secret Secure.
  ```
* You have an active CMK, but you just pre-registered a new key:

  ```output
  CMK with ARN: arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is activated with Tri-Secret Secure, but
  CMK with ARN: arn:aws:kms:us-west-2:481048248138:key/e08cb6c0-7c09-4f37-8e55-e395a12fe965
  is pre-registered for Tri-Secret Secure.
  ```
* You have an active key, but have not registered any CMK to use Tri-Secret Secure:

  ```output
  CMK info has not been pre-registered in this account yet, but
  CMK arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is activated with Tri-Secret Secure.
  ```
* You have not registered any CMK to use Tri-Secret Secure:

  ```output
  CMK info has not been pre-registered in this account yet.
  ```
* Your active CMK is registered with private connectivity *enabled*.

  ```output
  CMK with ARN: arn:aws:kms:us-east-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
  with PrivateLink enabled is activated for Tri-Secret Secure.
  ```
* Your active CMK is registered with private connectivity *not enabled*.

  ```output
  CMK with ARN: arn:aws:kms:us-east-2:111122223333:key/1234abcd-12ab-34cd-56ef-1234567890ab
  is activated for Tri-Secret Secure.
  ```

## Access control requirements

* Only users with the ACCOUNTADMIN role or with a role that is granted the MONITOR SECURITY privilege can call this
  function.
* Only users with the GLOBALORGADMIN role or ORGADMIN role can specify an SSA account name.

## Examples

Obtain the status of the CMK for your Snowflake account:

> ```sqlexample
> SELECT SYSTEM$GET_CMK_INFO();
> ```

Obtain the status of the CMK for a specific SSA account:

> ```sqlexample
> SELECT SYSTEM$GET_CMK_INFO('AUTO_FULFILLMENT_AREA$PUBLIC_AZURE_EASTUS2');
> ```

---
title: SYSTEM$GET_CMK_INFO_POSTGRES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_cmk_info_postgres.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CMK_INFO_POSTGRES

Returns the status of your customer-managed key (CMK) for use with Snowflake Postgres Tri-Secret Secure. Information is returned only for currently
registered and activated keys.

## Syntax

```sqlsyntax
SYSTEM$GET_CMK_INFO_POSTGRES()
```

## Returns

Returns a status message indicating the state of your CMK. The output includes the values that you specified when calling
[SYSTEM$REGISTER_CMK_INFO_POSTGRES](system_register_cmk_info_postgres.md).

The following messages are possible, using CMKs on Amazon Web Services as a representative example:

* Your CMK is registered, but not yet enabled, to use Snowflake Postgres Tri-Secret Secure:

  ```output
  CMK with ARN: arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is pre-registered for Tri-Secret Secure.
  ```
* Your CMK is activated and in use with Snowflake Postgres Tri-Secret Secure:

  ```output
  CMK with ARN: arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is activated for Tri-Secret Secure.
  ```
* You have an active CMK, but you just pre-registered a new key:

  ```output
  CMK with ARN: arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is activated with Tri-Secret Secure, but
  CMK with ARN: arn:aws:kms:us-west-2:481048248138:key/e08cb6c0-7c09-4f37-8e55-e395a12fe965
  is pre-registered for Tri-Secret Secure.
  ```
* You have an active key, but have not registered any CMK to use Snowflake Postgres Tri-Secret Secure:

  ```output
  CMK info has not been pre-registered in this account yet, but
  CMK arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59
  is activated with Tri-Secret Secure.
  ```
* You have not registered any CMK to use Snowflake Postgres Tri-Secret Secure:

  ```output
  CMK info has not been pre-registered in this account yet.
  ```

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) or a role that is granted the MONITOR SECURITY privilege on the account
can call this function.

## Examples

Obtain the status CMK for your Snowflake account:

```sqlexample
SELECT SYSTEM$GET_CMK_INFO_POSTGRES();
```

---
title: SYSTEM$GET_CMK_KMS_KEY_POLICY
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_cmk_kms_key_policy.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_CMK_KMS_KEY_POLICY

Returns an ARRAY containing a snippet of the AWS Key Management Service policy information related to customer-managed keys.

You can use this policy information for your customer-managed keys with [Tri-Secret Secure](../../user-guide/security-encryption-tss.md) for Snowflake accounts on Amazon Web
Services.

See also:
:   [Tri-Secret Secure in Snowflake](../../user-guide/security-encryption-tss.md)

## Syntax

> ```sqlsyntax
> SYSTEM$GET_CMK_KMS_KEY_POLICY()
> ```

## Arguments

None.

## Usage notes

* This function is for use in Snowflake accounts on Amazon Web Services only.
* Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role that is granted the MONITOR SECURITY privilege can call
  this function.

## Examples

Return a snippet of the AWS KMS policy for use with customer-managed keys:

> ```sqlexample
> SELECT SYSTEM$GET_CMK_KMS_KEY_POLICY();
> ```

---
title: SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_compute_pool_pending_maintenance.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE

Retrieves information about pending Snowflake [maintenance actions for compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) in the current account.

See also:
:   [Snowpark Container Services: Working with compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md)

## Syntax

```sqlsyntax
SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE()
```

## Returns

* Returns a JSON object that provides an indication of whether maintenance is required and the upcoming maintenance window timeline. The JSON fields are:

  + `maintenanceRequired`. Boolean field that provides an indication of whether maintenance is required.
  + `start`. Start time of the maintenance window.
  + `end`. End time of the maintenance window.
* If there are no running compute pools in the Snowflake account, the function returns “No running Snowpark Container Services found.”
* If there is no scheduled maintenance window, the function returns “No pending maintenance actions.”

## Usage notes

* All roles have privilege to access this function.

## Examples

```sqlexample
SELECT SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE();
```

Sample output:

```output
+---------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE()                                                           |
|---------------------------------------------------------------------------------------------------------|
| {"maintenanceRequired":false,"maintenanceWindow":{"start":"2025-02-27T23:00","end":"2025-02-28T00:00"}} |
+---------------------------------------------------------------------------------------------------------+
```

This output indicates that no maintenance is scheduled for the next maintenance window. If maintenance is required, `maintenanceRequired` is set to true.

---
title: SYSTEM$GET_DBT_LOG
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_dbt_log.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_DBT_LOG

Returns logs for the specified run for a dbt Projects on Snowflake.

Use this function with the [DBT_PROJECT_EXECUTION_HISTORY](dbt_project_execution_history.md) function to access dbt artifacts and logs programmatically.

## Syntax

```sqlsyntax
SYSTEM$GET_DBT_LOG ( '<query_id>' )
```

## Arguments

`query_id`
:   Query ID of the run that you want logs for.

## Returns

The function returns the last 1,000 lines of the `dbt.log` file. For full logs, download the archive ZIP file.

For more information and examples, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

## Access control requirements

This function includes only runs from workspaces and dbt Projects in which you have the following privileges:

* OWNERSHIP, READ, or WRITE on workspaces
* OWNERSHIP, USAGE, or MONITOR on dbt Projects

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This system function works only on dbt project objects; it isn’t available for workspaces.
* Query IDs generated from CREATE DBT PROJECT or ALTER DBT PROJECT … ADD VERSION aren’t supported for this system function.
* Direct querying of file content (for example, [Query examples](../../user-guide/querying-stage.md)) isn’t supported.
* If `query_id` is NULL or not a dbt execution, you’ll get an error.
* dbt project results are available for up to 14 days.
* Logs might be unavailable if a run times out, is canceled, or fails before files are uploaded. In such cases, runs appear as `UNHANDLED ERROR` in dbt history, and these entries might not include logs.
* You can’t use this function to get logs for runs that are in progress because the logs file is only available after the run in complete.

## Examples

The following example looks up the most recent dbt Project execution for `MY_DBT_PROJECT` using DBT_PROJECT_EXECUTION_HISTORY and then fetches the dbt run logs for that execution using
SYSTEM$GET_DBT_LOG, so you can inspect what happened during the run.

```sqlexample
--Look up the most recent dbt Project execution
SET latest_query_id = (SELECT query_id
  FROM TABLE(INFORMATION_SCHEMA.DBT_PROJECT_EXECUTION_HISTORY())
  WHERE OBJECT_NAME = 'MY_DBT_PROJECT'
  ORDER BY query_end_time DESC LIMIT 1);

--Get the dbt run logs for the most recent dbt Project execution
SELECT SYSTEM$GET_DBT_LOG($latest_query_id);
```

```output
============================== 15:14:53.100781 | 46d19186-61b8-4442-8339-53c771083f16 ==============================
[0m15:14:53.100781 [info ] [Dummy-1   ]: Running with dbt=1.9.4
...
[0m15:14:58.198545 [debug] [Dummy-1   ]: Command `cli run` succeeded at 15:14:58.198121 after 5.19 seconds
```

For more information, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

---
title: SYSTEM$GET_DEBUG_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_debug_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_DEBUG_STATUS

Returns the [session debug mode](../../developer-guide/native-apps/installing-testing-application.md) status of the current session.

## Syntax

```sqlsyntax
SYSTEM$GET_DEBUG_STATUS()
```

---
title: SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_default_columns_override_for_show_command.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND

Returns the list of columns that were set by a previous call to
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_set_default_columns_override_for_show_command.md).

For more information, see [Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_set_default_columns_override_for_show_command.md) ,
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_unset_default_columns_override_for_show_command.md) ,
    [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](system_get_all_default_columns_overrides.md)

## Syntax

```sqlsyntax
SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  '<object_type>'
)
```

## Arguments

`'object_type'`
:   Type of object for the SHOW command. For example, for the SHOW TABLES command, specify `'TABLES'`. For the SHOW NOTIFICATION
    INTEGRATIONS command, specify `'NOTIFICATION INTEGRATIONS'`.

## Returns

Returns a VARCHAR value containing a comma-separated list of the columns specified by the previous call to
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND. The column names are in lowercase.

If SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND was not called or if
[SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_unset_default_columns_override_for_show_command.md) was called to clear the list of columns, the function returns an
empty string.

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Examples

The following example returns the list of columns specified by a previous call to
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND for the SHOW TABLES command:

```sqlexample
SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'TABLES'
);
```

```output
+-------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND( |
|   'TABLES'                                            |
| )                                                     |
|-------------------------------------------------------|
| name,database_name,kind,comment                       |
+-------------------------------------------------------+
```

If SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND was not called or if
SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND was called to clear the list, the function returns an empty string:

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'TABLES'
);

SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'TABLES'
);
```

```output
+-------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND( |
|   'TABLES'                                            |
| )                                                     |
|-------------------------------------------------------|
|                                                       |
+-------------------------------------------------------+
```

---
title: SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_default_columns_override_for_system_object.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT

Returns the list of columns that were set by a previous call to
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_set_default_columns_override_for_system_object.md) for the specified Snowflake view (for
example, for a specific [ACCOUNT_USAGE view](../account-usage.md) or
[INFORMATION_SCHEMA view](../info-schema.md)).

For more information, see [Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_set_default_columns_override_for_system_object.md) ,
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_unset_default_columns_override_for_system_object.md) ,
    [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](system_get_all_default_columns_overrides.md)

## Syntax

```sqlsyntax
SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  '<object_type>',
  '<database_name>',
  '<schema_name>',
  '<object_name>'
)
```

## Arguments

`'object_type'`
:   Type of the object. You must specify `'VIEW'` for this argument.

`'database_name'`
:   Name of the database that contains the object. You must specify `'SNOWFLAKE'` or, for INFORMATION_SCHEMA views, an empty
    string.

`'schema_name'`
:   Name of the schema that contains the object. You must specify the name of a schema in the
    [SNOWFLAKE database](../snowflake-db.md) or `'INFORMATION_SCHEMA'`.

`'object_name'`
:   Name of the object.

## Returns

Returns a VARCHAR value containing a comma-separated list of the columns specified by the previous call to
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT. The column names are in uppercase.

If SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT was not called or if
[SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_unset_default_columns_override_for_system_object.md) was called to clear the list of columns, the function returns an
empty string.

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Usage notes

* You must have a database in use (for example, by running [USE DATABASE](../sql/use-database.md)) in order to call this function.
  If no database is in use, the function call fails.

## Examples

The following example returns the list of columns specified by a previous call to
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT for the
[TABLES view in the ACCOUNT_USAGE schema](../account-usage/tables.md):

```sqlexample
SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'TABLES'
);
```

```output
+--------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT( |
|   'VIEW',                                              |
|   'SNOWFLAKE',                                         |
|   'ACCOUNT_USAGE',                                     |
|   'TABLES'                                             |
| )                                                      |
|--------------------------------------------------------|
| TABLE_NAME,TABLE_SCHEMA,TABLE_TYPE                     |
+--------------------------------------------------------+
```

The following example returns the list of columns specified by a previous call to
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT for the
[TABLES view in the INFORMATION_SCHEMA schema](../info-schema/tables.md):

```sqlexample
SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  '',
  'ACCOUNT_USAGE',
  'TABLES'
);
```

```output
+--------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT( |
|   'VIEW',                                              |
|   '',                                                  |
|   'INFORMATION_SCHEMA',                                |
|   'TABLES'                                             |
| )                                                      |
|--------------------------------------------------------|
| TABLE_NAME,TABLE_SCHEMA,TABLE_TYPE                     |
+--------------------------------------------------------+
```

If SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT was not called or if
SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT was called to clear the list, the function returns an empty string:

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'TABLES'
);

SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'TABLES'
);
```

```output
+--------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT( |
|   'VIEW',                                              |
|   'SNOWFLAKE',                                         |
|   'ACCOUNT_USAGE',                                     |
|   'TABLES'                                             |
| )                                                      |
|--------------------------------------------------------|
```

---
title: SYSTEM$GET_DIRECTORY_TABLE_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_directory_table_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_DIRECTORY_TABLE_STATUS

Returns a list of records that contain the [directory table](../../user-guide/data-load-dirtables.md) consistency status for
stages in your account. Consistency status indicates whether a directory table on a replicated stage has information about (is consistent with) all of the replicated files on the stage.

See also:
:   [Stage, pipe, and load history replication](../../user-guide/account-replication-stages-pipes-load-history.md) , [Directory tables](../../user-guide/data-load-dirtables.md)

## Syntax

```sqlsyntax
SYSTEM$GET_DIRECTORY_TABLE_STATUS( [ '<stage_name>' ] )
```

## Arguments

**Optional:**

`'stage_name'`
:   Stage for which you want to retrieve the directory table consistency status. When you specify a stage name, the function returns a list
    with a single record for the directory table on that stage.

## Returns

Returns a list of directory table consistency records for each stage in your account. The list contains a maximum of 10,000 records.
If you specify a `'stage_name'` argument, the function returns a list with a single record for the directory table on that stage.

The records are in JSON format and contain the following name/value pairs:

```Output
{
  "stage" : "STAGE1",
  "status" : "INCONSISTENT"
}
```

Where:

> `stage`
> :   The stage on which the directory table is enabled.
>
> `status`
> :   Consistency status for the directory table. `CONSISTENT` if the directory table is fully consistent with the replicated content
>     on the stage; `INCONSISTENT` otherwise. A status of `INCONSISTENT` means that Snowflake cannot verify consistency,
>     and that the directory table might be missing information about some files that exist on the stage.

## Usage notes

* To call this function, you must use a role that is granted or inherits the READ privilege on the stage(s) for which you want to
  retrieve consistency status.
* To update the consistency status from `INCONSISTENT` to `CONSISTENT`, perform a full refresh using the
  [ALTER STAGE … REFRESH](../sql/alter-stage.md) command.

## Examples

The following example retrieves a list of consistency status records for the stages in the account:

> ```sqlexample
> SELECT SYSTEM$GET_DIRECTORY_TABLE_STATUS();
> ```
>
> Output:
>
> ```output
> [
>   {
>     "stage" : "STAGE1",
>     "status" : "CONSISTENT"
>   },
>   {
>     "stage" : "STAGE2",
>     "status" : "INCONSISTENT"
>   }
> ]
> ```

The following example retrieves a consistency status record for a stage named `stage1`:

> ```sqlexample
> SELECT SYSTEM$GET_DIRECTORY_TABLE_STATUS('stage1');
> ```
>
> Output:
>
> ```output
> [
>   {
>     "stage" : "STAGE1",
>     "status" : "CONSISTENT"
>   }
> ]
> ```

---
title: SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_gcp_kms_cmk_grant_access_cmd.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD

Returns a Google Cloud gcloud command to obtain policy information for the Google Cloud Key Management Service for use with
customer-managed keys.

You can use the policy information for your customer-managed keys with [Tri-Secret Secure](../../user-guide/security-encryption-tss.md) for Snowflake accounts on Google Cloud
Platform.

See also:
:   [Understanding Encryption Key Management in Snowflake](../../user-guide/security-encryption-manage.md)

## Syntax

> ```sqlsyntax
> SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD()
> ```

## Arguments

None.

## Usage notes

* This function is for use in Snowflake accounts on Google Cloud Platform only.
* Only account administrators (i.e. users with the ACCOUNTADMIN role) or a role that is granted the MONITOR SECURITY privilege can call
  this function.

## Examples

Return the gcloud command to obtain GCP KMS policy information for customer-managed keys. Note that the example
gcloud command does not contain real option values, some of which are replaced by angle brackets.

For details about the gcloud command, see the documentation for [gcloud kms](https://cloud.google.com/sdk/gcloud/reference/kms).

> ```sqlexample
> select SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD();
> ```

Returns:

> ```bash
> gcloud kms keys add-iam-policy-binding <key-name> --project <project-id> --location <location> --keyring <key-ring> --member serviceAccount:<service-account-email> --role roles/cloudkms.cryptoKeyEncrypterDecrypter
> ```

---
title: SYSTEM$GET_HASH_FOR_APPLICATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_hash_for_application.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_HASH_FOR_APPLICATION

Returns the hash value for a Snowflake Native App or query ID.

## Syntax

```sqlsyntax
SYSTEM$GET_HASH_FOR_APPLICATION( '<app_name>' [ , '<query_id>' ] )
```

## Arguments

**Required**

`'app_name'`
:   The name of the app whose hash value you want to return.

**Optional:**

`'query_id'`
:   The query ID whose hash value you want to return.

## Returns

Returns a signed 64-bit hash value. If a query ID is passed as
an argument to this function, this function returns the hash value of the query
ID. Otherwise, it returns the hash value for the app.

## Examples

The following example returns the hash value for the app ‘hello_snowflake_app’:

```sqlsyntax
SELECT SYSTEM$GET_HASH_FOR_APPLICATION('hello_snowflake_app');
```

```output
+--------------------------------------------------------+
| SYSTEM$GET_HASH_FOR_APPLICATION('HELLO_SNOWFLAKE_APP') |
|--------------------------------------------------------|
| a1b2c3d4e5fg+1234567890+1234
+--------------------------------------------------------+
```

The following example returns the hash value for a query id associated with the app ‘hello_snowflake_app’:

```sqlsyntax
SELECT SYSTEM$GET_HASH_FOR_APPLICATION('hello_snowflake_app', 'abcd1234-12345-WXYZ-0000-0987654321');
```

```output
+------------------------------------------------------------------------------------------------+
| SYSTEM$GET_HASH_FOR_APPLICATION('HELLO_SNOWFLAKE_APP', '<app_id>') |
|------------------------------------------------------------------------------------------------|
| a1b2c3d4e5fg+1234567890+1234                                                                   |
+------------------------------------------------------------------------------------------------+
```

---
title: SYSTEM$GET_ICEBERG_TABLE_INFORMATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_iceberg_table_information.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_ICEBERG_TABLE_INFORMATION

Returns the location of the root metadata file and status of the latest snapshot for an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md).

The SYSTEM$GET_ICEBERG_TABLE_INFORMATION function works differently according to table type:

* For Snowflake-managed Iceberg tables or Delta-based tables, calling the function generates metadata for data manipulation language (DML)
  operations or other table updates that have occurred since Snowflake last generated metadata for the table.

  If there are no updates, the function returns the location of the latest metadata file,
  but does not generate new metadata.
* For other externally managed Iceberg tables, the function returns information about the latest refreshed snapshot.

## Syntax

```sqlsyntax
SYSTEM$GET_ICEBERG_TABLE_INFORMATION('<iceberg_table_name>')
```

## Arguments

`'iceberg_table_name'`
:   The name of the Iceberg table for which you want to retrieve information. The table name is a string, so it must be enclosed in single
    quotes.

    * If the Iceberg table name is fully qualified, such as `'<db>.<schema>.<iceberg_table_name>'`,
      the entire name must be enclosed in single quotes, including the database and schema.
    * If the Iceberg table name is case-sensitive or includes any special characters or spaces,
      double quotes are required to process the case/characters.
      The double quotes must be enclosed within the single quotes, for example, `'"<case_sensitive_iceberg_table_name>"'`.

## Returns

The function returns a JSON object containing the following name/value pairs:

> {“metadataLocation”:”<value>”,”status”:”<value>”}

Where:

> `metadataLocation`
> :   Location of the root metadata file updated or retrieved by the function.
>
> `status`
> :   Status of the operation. This field returns a success or failure message.

## Usage notes

* Calling this function requires a role that has the OWNERSHIP privilege on the Iceberg table.

## Examples

Generate a snapshot for the Iceberg table `it1` in the schema `db1.schema1`:

```sqlexample
SELECT SYSTEM$GET_ICEBERG_TABLE_INFORMATION('db1.schema1.it1');
```

Output:

```output
+-----------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_ICEBERG_TABLE_INFORMATION('DB1.SCHEMA1.IT1')                                                   |
|-----------------------------------------------------------------------------------------------------------|
| {"metadataLocation":"s3://mybucket/metadata/v1.metadata.json","status":"success"}                         |
+-----------------------------------------------------------------------------------------------------------+
```

---
title: SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_instance_family_placement_groups.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS

Returns the list of placement groups supported for the specified
[instance family](../../developer-guide/snowpark-container-services/working-with-compute-pool.md)
for [Snowpark Container Services compute pool nodes](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

## Syntax

```sqlsyntax
SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS( '<instance_family>' )
```

## Arguments

`'instance_family'`
:   Instance family.

## Returns

Returns a VARCHAR value that contains the supported placement groups
formatted as a JSON array.

## Usage notes

* The returned list of placement group names is specific to your Snowflake account and the specified
  instance family. For more information, see [Compute pool placement](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).
* Results don’t guarantee capacity. You might still run into insufficient capacity errors in a placement
  group even if an instance family is supported there.

## Examples

The following function returns the supported placement groups for the `GPU_NV_L` instance family:

```sqlexample
SELECT SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS('GPU_NV_L');
```

Example output:

```output
+--------------------------------------------------------------+
| SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS('GPU_NV_L')      |
|--------------------------------------------------------------|
| ["A","B","C","D"]                                            |
+--------------------------------------------------------------+
```

The `GPU_NV_L` instance family is available in the following placement
groups: `A`, `B`, `C` and `D`.

---
title: SYSTEM$GET_LOGIN_FAILURE_DETAILS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_login_failure_details.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_LOGIN_FAILURE_DETAILS

Returns a JSON object that represents an unsuccessful login attempt associated with External OAuth, SAML, or key pair authentication. The
JSON object contains the error associated with the failed login attempt.

## Syntax

```sqlsyntax
SYSTEM$GET_LOGIN_FAILURE_DETAILS('<uuid>')
```

## Arguments

`uuid`
:   A string representing a UUID. The UUID appears after the error message that is returned from a failed login event associated with External
    OAuth, SAML, or key pair authentication.

## Returns

Returns the following elements in a JSON object:

| Key | Data Type | Value Description |
| --- | --- | --- |
| clientIP | STRING | The IP address from where the failed login request originated. For example, `"10.211.55.1"`. |
| clientType | STRING | The client software reported by the client. For example, `"JDBC_DRIVER"`. This value is not verified. If the client does not report this value, then this value is `"OTHER"`. |
| clientVersion | STRING | The version of the client software reported by the client. For example, `"2.9.0"`. This value is not verified. If the client does not report this value, the this value is `null`. |
| username | STRING | The username associated with the failed login event. If the system cannot find the username, or the error occurred before the system found the username, then this value is `null`. |
| errorCode | STRING | The error associated with the failed login event. For a description of the error, refer to External OAuth errors, SAML errors, or JWT token errors. If the error is OVERFLOW_FAILURE_EVENTS_ELIDED, then the number of failed login attempts is too high. |
| timestamp | NUMBER | The date and time, in Unix timestamp format, when the failed login event occurred. |

## Usage notes

Only administrators that have a MONITOR privilege assigned to their role can use this function.

## Error descriptions

This section provides descriptions for errors returned by the SYSTEM$GET_LOGIN_FAILURE_DETAILS function.

### External OAuth errors

| Error | Description |
| --- | --- |
| EXTERNAL_OAUTH_INVALID_SIGNATURE | Invalid signature algorithm or issue validating signature. |
| EXTERNAL_OAUTH_MISSING_ISSUER | Cannot extract issuer (an `iss` claim) from the access token. |
| EXTERNAL_OAUTH_JWS_INVALID_TYPE | Invalid type of access token. |
| EXTERNAL_OAUTH_JWS_INVALID_FORMAT | Malformed access token. |
| EXTERNAL_OAUTH_ACCESS_TOKEN_ISSUER_NOT_FOUND | Cannot find security integration associated with the issuer. |
| EXTERNAL_OAUTH_ACCESS_TOKEN_EXPIRED | Access token expired. |
| EXTERNAL_OAUTH_MISSING_AUDIENCE | Cannot extract audience (an `aud` claim) from the access token. |
| EXTERNAL_OAUTH_AUDIENCE_VALIDATION_FAILED | Audience of the access token does not match any of the audiences defined in the security integration. |
| EXTERNAL_OAUTH_ACCESS_TOKEN_ISSUER_NOT_ENABLED | Security integration is disabled. |
| EXTERNAL_OAUTH_JWS_CANT_RETRIEVE_PUBLIC_KEY | Cannot retrieve the public key from the authorization server to validate the access token. |
| EXTERNAL_OAUTH_USER_CLAIM_MISSING | Cannot extract user mapping claim from the access token. |
| EXTERNAL_OAUTH_ACCESS_TOKEN_NOT_YET_VALID | Token is not valid yet. A timestamp with a `iat` or `nbf` claim indicates the token is valid in the future. |

### SAML errors

| Error Code | Error | Description |
| --- | --- | --- |
| 390133 | SAML_RESPONSE_INVALID | The SAML response was invalid for an unspecified reason, although it is most likely malformed (this is also used if there is an error on parsing). |
| 390165 | SAML_RESPONSE_INVALID_SIGNATURE | The SAML response contains an invalid Signature. |
| 390166 | SAML_RESPONSE_INVALID_DIGEST_METHOD | The SAML response contains an invalid “DigestMethod” attribute or omits it entirely. |
| 390167 | SAML_RESPONSE_INVALID_SIGNATURE_METHOD | The SAML response contains an invalid “SignatureMethod” or omits it entirely. |
| 390168 | SAML_RESPONSE_INVALID_DESTINATION | The “Destination” attribute in the SAML response does not match a valid destination URL on the account. |
| 390169 | SAML_RESPONSE_INVALID_AUDIENCE | The SAML response does not contain exactly one audience or the audience URL does not match what we expect the audience URL to be. |
| 390170 | SAML_RESPONSE_INVALID_MISSING_INRESPONSETO | The “InResponseTo” attribute in the SAML assertion is missing. |
| 390171 | SAML_RESPONSE_INVALID_RECIPIENT_MISMATCH | The “Recipient” attribute does not match a valid destination URL. |
| 390172 | SAML_RESPONSE_INVALID_NOTONORAFTER_VALIDATION | This typically indicates that the time in which the SAML assertion is valid has expired. |
| 390173 | SAML_RESPONSE_INVALID_NOTBEFORE_VALIDATION | This typically indicates that the time in which the SAML assertion is valid has not yet come. |
| 390174 | SAML_RESPONSE_INVALID_USERNAMES_MISMATCH | The login names do not match during re-authentication. |
| 390175 | SAML_RESPONSE_INVALID_SESSIONID_MISSING | During re-authentication, we were unable to find a session corresponding to the user. |
| 390176 | SAML_RESPONSE_INVALID_ACCOUNTS_MISMATCH | During re-authentication, the names of the accounts were found to not match. |
| 390177 | SAML_RESPONSE_INVALID_BAD_CERT | The x.509 certificate contained in the SAML response is either malformed or does not match the expected certificate. |
| 390178 | SAML_RESPONSE_INVALID_PROOF_KEY_MISMATCH | The proof keys do not match with respect to the authentication request ID. |
| 390179 | SAML_RESPONSE_INVALID_INTEGRATION_MISCONFIGURATION | The SAML IdP configuration is invalid. |
| 390180 | SAML_RESPONSE_INVALID_REQUEST_PAYLOAD | During authentication, using an invalid payload or using an invalid federated OAuth connection string. |
| 390181 | SAML_RESPONSE_INVALID_MISSING_SUBJECT_CONFIRMATION_BEARER | The Subject confirmation with Bearer method is missing and cannot be validated. |
| 390182 | SAML_RESPONSE_INVALID_MISSING_SUBJECT_CONFIRMATION_DATA | The Subject confirmation data is missing in the assertion. |
| 390183 | SAML_RESPONSE_INVALID_CONDITIONS | The SAML assertion is not valid for a reason that is different than the preceding conditions in this table. |
| 390184 | SAML_RESPONSE_INVALID_ISSUER | The SAML Response contained an issuer/entityID value different from the one configured in the SAML IDP Configuration. |

### JWT token errors

The following errors are associated with the JWT token used for [key pair authentication](../../user-guide/key-pair-auth.md).

| Error Code | Error | Description |
| --- | --- | --- |
| 394307 | JWT_TOKEN_ACCOUNT_MISMATCH | The Snowflake account obtained from the token is not the same as the account in the request’s URL. |
| 390144 | JWT_TOKEN_INVALID | There is a general issue with the JWT token. For possible solutions, see [Common Errors and Solutions](../../user-guide/key-pair-auth-troubleshooting.md). |
| 394300 | JWT_TOKEN_INVALID_USER_IN_ISSUER | The user name specified in the issuer does not exist in the Snowflake account. For possible solutions, see [Common Errors and Solutions](../../user-guide/key-pair-auth-troubleshooting.md). |
| 394301 | JWT_TOKEN_MISSING_ISSUE_OR_EXPIRATION_TIME | The JWT token does not contain an issue time or an expiration time. |
| 394302 | JWT_TOKEN_INVALID_ISSUE_TIME | The JWT token was received by Snowflake more than 60 seconds after the issue time. For possible solutions, see [Common Errors and Solutions](../../user-guide/key-pair-auth-troubleshooting.md). |
| 394303 | JWT_TOKEN_INVALID_EXPIRATION_TIME | The JWT token is expired. |
| 394304 | JWT_TOKEN_INVALID_PUBLIC_KEY_FINGERPRINT_MISMATCH | There is a mismatch between the public key fingerprint specified in the issuer and the one stored for the user in Snowflake. For possible solutions, see [Common Errors and Solutions](../../user-guide/key-pair-auth-troubleshooting.md). |
| 394305 | JWT_TOKEN_INVALID_ALGORITHM | The JWT token was not signed with the RS256 algorithm. |
| 394306 | JWT_TOKEN_INVALID_SIGNATURE | Snowflake could not verify the signature provided by the JWT token. It is possible that the JWT was signed with a private key that is not paired with the provided public key. It is also possible that the JWT signature is corrupt or has been modified. |

## Examples

The following example teaches you how to use the SYSTEM$GET_LOGIN_FAILURE_DETAILS function with a UUID from a failed login attempt
associated with External OAuth, SAML, or key pair authentication:

1. Find the UUID in the error message:

   > ```output
   > Invalid  OAuth access token. [0ce9eb56-821d-4ca9-a774-04ae89a0cf5a]
   > ```
2. Use the UUID as an argument to the SYSTEM$GET_LOGIN_FAILURE_DETAILS function, and extract the error using the [JSON_EXTRACT_PATH_TEXT](json_extract_path_text.md) function:

   > ```sqlexample
   > SELECT JSON_EXTRACT_PATH_TEXT(SYSTEM$GET_LOGIN_FAILURE_DETAILS('0ce9eb56-821d-4ca9-a774-04ae89a0cf5a'), 'errorCode');
   > ```
3. Find the error description in the External OAuth errors or SAML errors tables.

---
title: SYSTEM$GET_PREDECESSOR_RETURN_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_predecessor_return_value.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_PREDECESSOR_RETURN_VALUE

Retrieves the return value for the predecessor task in a [task graph](../../user-guide/tasks-graphs.md). The return value is explicitly set by the predecessor task by calling the [SYSTEM$SET_RETURN_VALUE](system_set_return_value.md) function.

## Syntax

```sqlsyntax
SYSTEM$GET_PREDECESSOR_RETURN_VALUE('<task_name>')
```

## Arguments

`'task_name'`
:   Identifier for the predecessor task that sets the return value to be retrieved.

    * If the task has multiple predecessor tasks that are enabled, this argument is required.
    * If the task has only one predecessor task that is enabled, the argument is optional.
      If this argument is omitted, the function retrieves the return value for the only enabled predecessor task.
    * If the immediate predecessor task name does not match the requested task name, but an ancestor predecessor does match the task name,
      the return value of the matching ancestor is returned.
    * The task name argument should not include the database name or schema name. All tasks in a graph are required to be within the same schema, so there should be no need to reference a task in a different schema. For example, you should use `MYTASK` as an input to this function, instead of using `MYDATABASE.MYSCHEMA.MYTASK`.

## Usage notes

* Task names are case sensitive.
* When a task name is specified, it must match an enabled predecessor, otherwise the call will fail.

## Examples

See complete examples for this function in [SYSTEM$SET_RETURN_VALUE](system_set_return_value.md).

---
title: SYSTEM$GET_PREVIEW_ACCESS_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_preview_access_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_PREVIEW_ACCESS_STATUS

Determine if access to all preview features is enabled or disabled.

See also:

> [SYSTEM$DISABLE_PREVIEW_ACCESS](system_disable_preview_access.md), [SYSTEM$ENABLE_PREVIEW_ACCESS](system_enable_preview_access.md)

## Syntax

```sqlsyntax
SYSTEM$GET_PREVIEW_ACCESS_STATUS()
```

## Arguments

None.

## Returns

Returns a VARCHAR status message representing whether preview features are enabled or disabled as shown below:

* Enabled:

  ```output
  +--------------------------------------------+
  | SYSTEM$GET_PREVIEW_ACCESS_STATUS()         |
  +--------------------------------------------+
  | Preview access is ENABLED for this account |
  +--------------------------------------------+
  ```
* Disabled:

  ```output
  +---------------------------------------------+
  | SYSTEM$GET_PREVIEW_ACCESS_STATUS()          |
  |---------------------------------------------|
  | Preview access is DISABLED for this account |
  +---------------------------------------------+
  ```

## Access control requirements

The SYSTEM$GET_PREVIEW_ACCESS_STATUS function can be executed by any user in the account and does not require special privileges.

## Examples

Display the current state of preview features.

```sqlexample
SELECT SYSTEM$GET_PREVIEW_ACCESS_STATUS();
```

---
title: SYSTEM$GET_PRIVATELINK
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$GET_PRIVATELINK

Verifies whether your current account is authorized for private connectivity to the Snowflake service.

Returns:
:   Boolean

    `TRUE`: The current Snowflake account is authorized (i.e. enabled) to use private connectivity to the Snowflake service.

    `FALSE`: The current Snowflake account is unauthorized (i.e. disabled) to use private connectivity to the Snowflake service.

See also:
:   [SYSTEM$AUTHORIZE_PRIVATELINK](system_authorize_privatelink.md) , [SYSTEM$REVOKE_PRIVATELINK](system_revoke_privatelink.md) ,
    [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](system_get_privatelink_authorized_endpoints.md)

## Syntax

**AWS:**

> ```sqlsyntax
> SYSTEM$GET_PRIVATELINK( '<aws_id>' , '<federated_token>' )
> ```

**Azure:**

> ```sqlsyntax
> SYSTEM$GET_PRIVATELINK( '<private-endpoint-resource-id>' , '<federated_token>' )
> ```

**GCP**

> ```sqlsyntax
> SYSTEM$GET_PRIVATELINK( '<gcp_project_id>' , '<access_token>' )
> ```

## Arguments

`'aws_id'`
:   The 12-digit identifier that uniquely identifies your Amazon Web Services (AWS) account, as a string.

`'private-endpoint-resource-id'`
:   The identifier that uniquely identifies the private endpoint in Microsoft Azure (Azure) as a string.

`'federated_token'`
:   The federated token value that contains access credentials for a federated user as a string.

    To obtain this value, execute the appropriate command for the cloud platform that hosts your Snowflake account. Use the command-line tool
    provided by the platform:

    * For Snowflake on AWS:

      ```bash
      aws sts get-federation-token --name sam
      ```
    * For Snowflake on Azure:

      ```bash
      az account get-access-token --subscription <SubscriptionID>
      ```

      Where:

      + `SubscriptionID`
        :   The unique identifier for your subscription. For example:

            > `13c...`

            To obtain this value, execute the following Azure CLI command in your command-line environment:

            > ```bash
            > az account list --output table
            > ```
            >
            > Note the output value in the `SubscriptionID` column, which is truncated in this example:
            >
            > > ```text
            > > Name     CloudName   SubscriptionId                        State    IsDefault
            > > -------  ----------  ------------------------------------  -------  ----------
            > > MyCloud  AzureCloud  13c...                                Enabled  True
            > > ```

`'gcp_project_id'`
:   The identifier that uniquely identifies your Google Cloud (GCP) project, as a string.

`'access_token'`
:   The access token value that contains access credentials for a Google Cloud user as a string.

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* This function can be used with Snowflake accounts on AWS or Azure; Google Cloud Platform (GCP) is not currently supported.
* Call the [SYSTEM$AUTHORIZE_PRIVATELINK](system_authorize_privatelink.md) function to enable your Snowflake account to use private
  connectivity to the Snowflake service.
* Call the [SYSTEM$REVOKE_PRIVATELINK](system_revoke_privatelink.md) function to disable your Snowflake account to use private
  connectivity to the Snowflake service.

## Examples

Verify whether AWS PrivateLink is authorized for your Snowflake account on AWS. Note that the values are truncated in this example.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$GET_PRIVATELINK(
>     '185...',
>     '{
>       "Credentials": {
>           "AccessKeyId": "ASI...",
>           "SecretAccessKey": "enw...",
>           "SessionToken": "Fwo...",
>           "Expiration": "2021-01-07T19:06:23+00:00"
>       },
>       "FederatedUser": {
>           "FederatedUserId": "185...:sam",
>           "Arn": "arn:aws:sts::185...:federated-user/sam"
>       },
>       "PackedPolicySize": 0
>   }'
>   );
> ```

Verify whether Azure Private Link is authorized for your Snowflake account on Azure. Note that the values are truncated in this example.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$GET_PRIVATELINK(
>   '/subscriptions/26d.../resourcegroups/sf-1/providers/microsoft.network/privateendpoints/test-self-service',
>   'eyJ...');
> ```

Verify whether Google Cloud Private Service Connect is authorized for your Snowflake account on GCP:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$GET_PRIVATELINK(
>   'my-gcp-project-id',
>   'ya29.a0AcM612zT4pJaXdYfwgY8aiMoDE9W_xkqQ20coFTB1TJcImKDPo...'
>   );
> ```

---
title: SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_authorized_endpoints.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS

Returns a list of the authorized endpoints for your current account to use with private connectivity to the Snowflake service.

The endpoint value in the command output can be used as the value for the `aws_id` or the
`private-endpoint-resource-id` when using these functions:

* [SYSTEM$GET_PRIVATELINK](system_get_privatelink.md)
* [SYSTEM$AUTHORIZE_PRIVATELINK](system_authorize_privatelink.md)
* [SYSTEM$REVOKE_PRIVATELINK](system_revoke_privatelink.md)

## Syntax

> ```sqlsyntax
> SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS()
> ```

## Arguments

None

## Returns

Returns a list of JSON objects that show key-value pairs where a key represents the `endpoint Id Type`, and a value represents the
`endpoint Id`. For Azure, SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS returns two values, an endpoint ID and a Link Identifier.

**AWS:**

> `endpoint Id Type`
> :   A string label that represents the type of AWS endpoint.
>
> `endpoint Id`
> :   The AWS account ID which has been authorized to connect to the Snowflake endpoint service.

**Azure:**

> `endpoint Id Type`
> :   A string value that represents the type of Azure endpoint.
>
> `endpoint Id`
> :   The Azure resource ID authorized to connect to the Snowflake privatelink service.
>
> `link Identifier`
> :   The link ID of the endpoint that is associated with Azure resource ID.

**GCP:**

> `endpoint Id Type`
> :   A string value that represents the type of Google Cloud endpoint.
>
> `endpoint Id`
> :   The Google Cloud project ID authorized to create the private service connect endpoint to the Snowflake service attachment.

## Usage notes

* Only account administrators (that is. users with the ACCOUNTADMIN role) can execute this function.
* This function can be used with Snowflake accounts on Amazon Web Services (AWS), Microsoft Azure (Azure), and Google Cloud.

## Examples

**AWS**

Returns the authorized endpoints for your Snowflake account to use with AWS PrivateLink for your Snowflake account on AWS:

> ```sqlexample
> use role accountadmin;
> select system$get_privatelink_authorized_endpoints();
> ```

You can optionally use the following command to flatten the query result. For example:

> ```sqlexample
> select
>   value: endpointId
> from
>   table(
>     flatten(
>       input => parse_json(system$get_privatelink_authorized_endpoints())
>     )
>   );
> ```
>
> Returns (endpoints for a Snowflake account on AWS):
>
> > ```none
> > +----------------------+---------------------+
> > | KEY:ENDPOINT ID TYPE |   VALUE:ENDPOINT ID |
> > +----------------------+---------------------+
> > |  "123456789012"      |    "123456789012"   |
> > +----------------------+---------------------+
> > ```

---
title: SYSTEM$GET_PRIVATELINK_CONFIG
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_config.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_PRIVATELINK_CONFIG

Returns a JSON representation of the Snowflake account information necessary to facilitate the self-service configuration of private
connectivity to the Snowflake service or Snowflake internal stages.

## Syntax

```sqlsyntax
SYSTEM$GET_PRIVATELINK_CONFIG()
```

## Arguments

None.

## Returns

The function returns a JSON object containing the following name/value pairs based on the cloud platform where your Snowflake account is
located:

**AWS**

> ```sqljson
> {
>   "regionless-snowsight-privatelink-url": "<privatelink_org_snowsight_url>",
>   "privatelink-account-name": "<account_identifier>",
>   "privatelink-connection-ocsp-urls": "<client_redirect_ocsp_url_list>",
>   "snowsight-privatelink-url": "<privatelink_region_snowsight_url>",
>   "privatelink-internal-stage": "<privatelink_stage_endpoint>",
>   "privatelink-account-url": "<privatelink_account_url>",
>   "privatelink-connection-urls": "<privatelink_connection_url_list>",
>   "regionless-privatelink-account-url": "<privatelink_org_account_url>",
>   "privatelink-ocsp-url": "<privatelink_ocsp_url>",
>   "privatelink-vpce-id": "<aws_vpce_id>",
>   "privatelink-account-principal": "<aws_principal_arn>",
>   "regionless-privatelink-ocsp-url": "<privatelink_org_ocsp_url>",
>   "app-service-privatelink-url": "<privatelink_streamlit_url>",
>   "privatelink-dashed-urls-for-duo": "<privatelink_duo_url_list>"
> }
> ```

**Microsoft Azure**

> ```sqljson
> {
>   "regionless-snowsight-privatelink-url": "<privatelink_org_snowsight_url>",
>   "privatelink-account-name": "<account_identifier>",
>   "privatelink-connection-ocsp-urls": "<client_redirect_ocsp_url_list>",
>   "snowsight-privatelink-url": "<privatelink_region_snowsight_url>",
>   "privatelink-internal-stage": "<privatelink_stage_endpoint>",
>   "privatelink-snowflake-managed-storage-volume-nfs": "<privatelink_volume_nfs_endpoint>",
>   "privatelink-snowflake-managed-storage-volume-fs": "<privatelink_volume_fs_endpoint>",
>   "privatelink-account-url":"<privatelink_account_url>",
>   "privatelink-connection-urls": "<privatelink_connection_url_list>",
>   "regionless-privatelink-account-url": "<privatelink_org_account_url>",
>   "privatelink-ocsp-url": "<privatelink_ocsp_url>",
>   "privatelink-pls-id": "<azure_privatelink_service_id>",
>   "regionless-privatelink-ocsp-url": "<privatelink_org_ocsp_url>",
>   "privatelink-dashed-urls-for-duo": "<privatelink_duo_url_list>"
> }
> ```

**Google Cloud Platform**

> ```sqljson
> {
>   "regionless-snowsight-privatelink-url": "<privatelink_org_snowsight_url>",
>   "privatelink-account-name": "<account_identifier>",
>   "privatelink-connection-ocsp-urls": "<client_redirect_ocsp_url_list>",
>   "snowsight-privatelink-url": "<privatelink_region_snowsight_url>",
>   "privatelink-account-url": "<privatelink_account_url>",
>   "privatelink-connection-urls": "<privatelink_connection_url_list>",
>   "regionless-privatelink-account-url": "<privatelink_org_account_url>",
>   "privatelink-ocsp-url": "<privatelink_ocsp_url>",
>   "privatelink-gcp-service-attachment": "<snowflake_service_endpoint>",
>   "regionless-privatelink-ocsp-url": "<privatelink_org_ocsp_url>",
>   "privatelink-dashed-urls-for-duo": "<privatelink_duo_url_list>"
> }
> ```

Where:

> `regionless-snowsight-privatelink-url`
> :   The URL for your [organization](../../user-guide/organizations.md) to access Snowsight using private connectivity to the Snowflake
>     service.
>
>     Use this URL to create a canonical name (i.e. CNAME) for DNS resolution. This URL should match the output for the
>     `SNOWSIGHT_DEPLOYMENT_REGIONLESS` (i.e. `TYPE`) from the [SYSTEM$ALLOWLIST_PRIVATELINK](system_allowlist_privatelink.md)
>     function.
>
>     For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md) and [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).
>
> `privatelink-account-name`
> :   The identifier for your Snowflake account.
>
>     Use this value with clients for [Applications and tools for connecting to Snowflake](../../guides-overview-connecting.md).
>
>     For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md).
>
> `privatelink-connection-ocsp-urls`
> :   The list of OCSP URLs for use with [Redirecting client connections](../../user-guide/client-redirect.md).
>
>     The list of values should match the output for `OCSP_CLIENT_FAILOVER` from the SYSTEM$ALLOWLIST_PRIVATELINK function.
>
> `snowsight-privatelink-url`
> :   The URL containing the [cloud region](../../user-guide/intro-regions.md) to access Snowsight and the Snowflake Marketplace using
>     private connectivity to the Snowflake service.
>
>     Use this URL to create a canonical name (i.e. CNAME) for DNS resolution. This URL should match the output for the
>     `SNOWSIGHT_DEPLOYMENT` (i.e. `TYPE`) from the [SYSTEM$ALLOWLIST_PRIVATELINK](system_allowlist_privatelink.md) function.
>
>     For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md) and [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).
>
> `privatelink-internal-stage`
> :   The endpoint to connect to your Snowflake internal stage using AWS PrivateLink or Azure Private Link.
>
>     Use this value with private connectivity to Snowflake internal stages.
>
>     The visibility of this key and the corresponding value in the query result depends on the
>     [ENABLE_INTERNAL_STAGES_PRIVATELINK](../parameters.md) parameter setting. The default setting for this parameter is `FALSE`. You must set
>     this parameter to `TRUE` prior to executing this system function to obtain the internal stage endpoint in the query result.
>
> `privatelink-snowflake-managed-storage-volume-nfs`
> :   The endpoint to connect to your non failsafe Snowflake-managed storage volume using Azure Private Link.
>
>     Use this value with private connectivity to Snowflake-managed storage volumes for Apache Iceberg tables.
>
>     The visibility of this key and the corresponding value in the query result depends on the
>     [ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK](../parameters.md) parameter setting. The default setting for this parameter is
>     `FALSE`. You must set this parameter to `TRUE` prior to executing this system function to obtain the endpoint in the query
>     result.
>
> `privatelink-snowflake-managed-storage-volume-fs`
> :   The endpoint to connect to your failsafe Snowflake-managed storage volume using Azure Private Link.
>
>     Use this value with private connectivity to Snowflake-managed storage volumes for Apache Iceberg tables.
>
>     The visibility of this key and the corresponding value in the query result depends on the
>     [ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK](../parameters.md) parameter setting. The default setting for this parameter is
>     `FALSE`. You must set this parameter to `TRUE` prior to executing this system function to obtain the endpoint in the query
>     result.
>
> `privatelink-account-url`
> :   The URL to connect to your Snowflake account using AWS PrivateLink, Azure Private Link, or Google Cloud Private Service Connect.
>
>     Use this value to create a canonical name (i.e. CNAME) for DNS resolution. This URL should match the output from
>     [SYSTEM$ALLOWLIST_PRIVATELINK](system_allowlist_privatelink.md).
>
>     For more information on URL formats, see [Account identifiers](../../user-guide/admin-account-identifier.md).
>
> `privatelink-connection-urls`
> :   The list of connection URLs for [Client Redirect](../../user-guide/client-redirect.md).
>
>     Use these URLs to create a canonical name (i.e. CNAME) for DNS resolution. These URL should match the output for
>     `CLIENT_FAILOVER` (i.e. `TYPE`) from the [SYSTEM$ALLOWLIST_PRIVATELINK](system_allowlist_privatelink.md) function.
>
> `regionless-privatelink-account-url`
> :   The private connectivity URL that includes your organization name and account name.
>
>     This value matches the output value of `SNOWFLAKE_DEPLOYMENT_REGIONLESS` in the
>     [SYSTEM$ALLOWLIST_PRIVATELINK](system_allowlist_privatelink.md) function.
>
> `privatelink-ocsp-url`
> :   The OCSP URL corresponding to your Snowflake account identifier that uses AWS PrivateLink, Microsoft Azure Private Link, or Google
>     Cloud Private Service Connect.
>
>     Use this value to create a canonical name (i.e. CNAME) for DNS resolution.
>
> `privatelink-vpce-id`
> :   The AWS VPCE ID for your account identifier.
>
>     Use this value to create an AWS VPC endpoint (i.e. VPCE).
>
> `privatelink-account-principal`
> :   The AWS principal ARN to allow for outbound private connections to your VPC endpoint services.
>
>     Use this value to set the
>     [allowed principal](https://docs.aws.amazon.com/vpc/latest/privatelink/configure-endpoint-service.html#add-remove-permissions)
>     of your endpoint service, which allows Snowflake to connect to your endpoint service via
>     [AWS PrivateLink](../../user-guide/private-manage-endpoints-aws.md).
>
> `privatelink-pls-id`
> :   The Microsoft Azure Private Link Service ID for your account identifier in the format of an alias. For example:
>
>     > `sf-pvlinksvc-azurecentralus.<unique_identifier>.centralus.azure.privatelinkservice`
>     >
>     > Where the `<unique_identifier>` is in GUID/UUID format.
>
>     Use this value to create an Azure Private Link private endpoint. If you receive an error while creating the private endpoint, contact
>     [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) and ask for the resource ID that is associated with this alias value.
>
> `privatelink-gcp-service-attachment`
> :   The endpoint for the Snowflake service when using Google Cloud Private Service Connect.
>
>     Use this value when creating a forwarding rule to route the Private Service Connect endpoint in your VPC to the Snowflake service.
>
> `"regionless-privatelink-ocsp-url`
> :   The OCSP URL for your [account identifier](../../user-guide/admin-account-identifier.md).
>
>     The value is recorded as follows:
>
>     `"ocsp.org_name-account_name.privatelink.snowflakecomputing.com"`
>
>     Where:
>
>     * `org_name` is the name of your Snowflake organization.
>     * `account_name` is the unique name of your account within your organization.
>
> `app-service-privatelink-url`
> :   The PrivateLink endpoint URL used to route traffic to Snowflake-hosted app services, such as Streamlit or Notebooks.
>
> `privatelink-dashed-urls-for-duo`
> :   The list of dashed variant URLs is shown only when the hostname contains an underscore. These URLs are used for Duo Multi-Factor Authentication.

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* For Snowflake accounts on Microsoft Azure, if you call the function and the query time is greater than one minute, please contact
  [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Examples

Retrieve the JSON information for your Snowflake account on AWS:

> ```sqlexample
> SELECT SYSTEM$GET_PRIVATELINK_CONFIG();
> ```

You can optionally run the following command to flatten the JSON output. The following output is an example for a Snowflake account on
Microsoft Azure:

> ```sqlexample
> select key, value from table(flatten(input=>parse_json(SYSTEM$GET_PRIVATELINK_CONFIG())));
>
> +--------------------------------------+--------------------------------------+
> | KEY                                  | VALUE                                |
> +--------------------------------------+--------------------------------------+
> | regionless-snowsight-privatelink-url | "<privatelink_org_snowsight_url>"    |
> |--------------------------------------+--------------------------------------|
> | privatelink-account-name             | "<account_identifier>"               |
> |--------------------------------------+--------------------------------------|
> | privatelink-connection-ocsp-urls     | "<client_redirect_ocsp_url_list>"    |
> |--------------------------------------+--------------------------------------|
> | snowsight-privatelink-url            | "<privatelink_region_snowsight_url>" |
> |--------------------------------------+--------------------------------------|
> | privatelink-internal-stage           | "<privatelink_stage_endpoint>"       |
> |--------------------------------------+--------------------------------------|
> | privatelink-snowflake-managed-       | "<privatelink_volume_nfs_endpoint>"  |
> | storage-volume-nfs                   |                                      |
> |--------------------------------------+--------------------------------------|
> | privatelink-snowflake-managed-       | "<privatelink_volume_fs_endpoint>"   |
> | storage-volume-fs                    |                                      |
> |--------------------------------------+--------------------------------------|
> | privatelink-account-url              | "<privatelink_account_url>"          |
> |--------------------------------------+--------------------------------------|
> | privatelink-connection-urls          | "<privatelink_connection_url_list>"  |
> |--------------------------------------+--------------------------------------|
> | privatelink-pls-id                   | "<azure_private_link_service_id>"    |
> |--------------------------------------+--------------------------------------|
> | regionless-privatelink-account-url   | "<privatelink_org_account_url>"      |
> |--------------------------------------+--------------------------------------|
> | privatelink-ocsp-url                 | "<privatelink_ocsp_url>"             |
> |--------------------------------------+--------------------------------------|
> | regionless-privatelink-ocsp-url      | "<privatelink_org_ocsp_url>"         |
> +--------------------------------------+--------------------------------------+
> ```

---
title: SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_endpoint_registrations.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS

Returns the registered private endpoints that can route your connection to the Snowflake service.

## Syntax

```sqlsyntax
SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS()
```

## Arguments

None.

## Returns

Returns a list of JSON objects, with each JSON object specifying a registered private connectivity endpoint. A string containing an
empty JASON array (`"[]"`) is returned if the account doesn’t have any registered private connectivity endpoints to the Snowflake Service.

Where:

> `consumerEndpointId`
> :   Specifies the AWS account id containing the registered VPC endpoint, or the Azure resource group identifier
>     containing the registered private endpoint.
>
> `consumerEndpointType`
> :   Specifies the type of registered private connectivity endpoint.
>
> `pinnedConsumerEndpointId`
> :   Specifies the private connectivity endpoint identifier that is registered with Snowflake.
>
> `providerServiceEndpoint`
> :   Specifies the identifier for the private connectivity service endpoint in the Snowflake VPC.

## Usage notes

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Examples

Return the registered private connectivity endpoints that route your connection to the Snowflake service:

**AWS:**

```sqlexample
 use role accountadmin;

SELECT SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS();
```

```json
[
  {
    "consumerEndpointId": "148896251...",
    "consumerEndpointType": "Aws Id",
    "pinnedConsumerEndpointId": "vpce-0be92fc5953c0...",
    "providerServiceEndpoint": "vpce-svc-0dcda6d2e9d14..."
  }
]
```

**Azure:**

```sqlexample
 use role accountadmin;

SELECT SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS();
```

```json
[
  {
    "consumerEndpointId": "/subscriptions/a92a429f-83ba-4249.../..../snowflake-private-link",
    "consumerEndpointType": "Azure Endpoint Connection Id",
    "pinnedConsumerEndpointId": "184549...",
    "providerServiceEndpoint": "sf-pvlinksvc-azcanadacentral.70f..."
  }
]
```

---
title: SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_privatelink_endpoints_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO

Returns the status of all private connectivity endpoints that you provision. The endpoint can be a service endpoint or a resource endpoint
depending on the cloud platform that hosts your Snowflake account.

## Syntax

```sqlsyntax
SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO()
```

## Returns

Returns a JSON object with the following fields:

**AWS:**

> `provider_service_name`
> :   Name of the service or resource.
>
> `snowflake_endpoint_name`
> :   The VPC endpoint ID in your Snowflake account. This field contains a temporary name while the endpoint is
>     being created. After the endpoint is created, and `endpoint_state` changes to `CREATED`, then this name changes.
>
> `endpoint_state`
> :   The endpoint state in Snowflake. This field can contain one of the following states:
>
>     * `PENDING_CREATION`: The endpoint is still being created.
>     * `CREATED`: Indicates that Snowflake received a response from the cloud provider that the endpoint was successfully created and
>       is ready to use.
>     * `FAILED`: The endpoint is in an unexpected state on the cloud provider, and cannot be used.
>     * `PENDING_DELETION`: The endpoint is on the deletion queue, but can be restored.
>     * `DELETING`: The endpoint is being deleted on the cloud provider and cannot be restored.
>
> `host`
> :   Hostname used to connect to the service.
>
> `status`
> :   The endpoint provisioning status on AWS. This field can contain one of the following statuses:
>
>     * `Pending`: The endpoint is still being created.
>     * `Available`: The endpoint is created and ready to use.

**Azure:**

> `provider_resource_id`
> :   Azure Resource ID of the resource that the endpoint connects to.
>
> `subresource`
> :   Subresource of the Azure resource that the endpoint connects to.
>
> `snowflake_resource_id`
> :   Azure Resource ID of the private endpoint that connects to the Azure resource.
>
> `host`
> :   Hostname used to connect to the resource.
>
> `endpoint_state`
> :   The endpoint state in Snowflake. This field can contain one of the following states:
>
>     * `PENDING_CREATION`: The endpoint is still being created.
>     * `CREATED`: Indicates that Snowflake received a response from the cloud provider that the endpoint was successfully created and
>       is ready to use.
>     * `FAILED`: The endpoint is in an unexpected state on the cloud provider, and cannot be used.
>     * `PENDING_DELETION`: The endpoint is on the deletion queue, but can be restored.
>     * `DELETING`: The endpoint is being deleted on the cloud provider and cannot be restored.
>
> `status`
> :   The endpoint provisioning status on Microsoft Azure. Use this field to determine if Microsoft Azure has approved the private endpoint connection to the
>     resource. This field can contain one of the following statuses:
>
>     > * `APPROVED`
>     > * `PENDING`
>     > * `DISCONNECTED`
>     > * `REJECTED`

**Google Cloud:**

> `provider_resource_id`
> :   The resource ID (service attachment ID) that the private connectivity endpoint connects to.
>
> `snowflake_resource_id`
> :   The identifier of the private connectivity endpoint.
>
> `host`
> :   The hostname to use when accessing the provider service or resource that uses this endpoint.
>
> `endpoint_state`
> :   The state of the endpoint on the Snowflake side.
>
> `status`
> :   The connection status on Google Cloud. NO CONNECTION might appear shortly after creating the private connectivity endpoint, because the
>     cloud provider takes time to complete the connection setup. This field can contain one of the following statuses:
>
>     * `ACCEPTED`
>     * `NO CONNECTION`
>     * `REJECTED`

## Usage notes

* This function can take approximately five minutes to run because it depends on the process to retrieve the private connectivity
  endpoints in the cloud platform that are outside of Snowflake.

## Examples

**AWS:**

> List all PrivateLink endpoints with external access to Amazon S3, execute the following SQL statement:

SQLReturned value

```sqlexample
SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
```

```json
[
  {
    "provider_service_name": "com.amazonaws.us-west-2.s3",
    "snowflake_endpoint_name": "vpce-123456789012abcdea",
    "endpoint_state": "CREATED",
    "host": "*.s3.us-west-2.amazonaws.com",
    "status": "Available"
  },
  ...
```

For your Snowflake account on Amazon Web Services, return the private connectivity endpoint for a specific resource identifier:

**Azure:**

> For your Snowflake account on Microsoft Azure, list the private connectivity endpoints that you provisioned and the service names that
> each endpoint is associated with:
>
> SQLReturned value
>
> ```sqlexample
> SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
> ```
>
> ```json
>   [
>      {
>         "provider_resource_id": "/subscriptions/11111111-2222-3333-4444-5555555555/...",
>         "subresource": "sqlServer",
>         "snowflake_resource_id": "/subscriptions/fa57a1f0-b4e6-4847-9c00-95f39520f...",
>         "host": "testdb.database.windows.net",
>         "endpoint_state": "CREATED",
>         "status": "Approved",
>      }
>   ]
> ```

**Google Cloud**

> For your Snowflake account on Google Cloud, list the private connectivity endpoints that you provisioned and the service names that
> each endpoint is associated with:
>
> > SQLReturned value
> >
> > ```sqlexample
> > SELECT SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO();
> > ```
> >
> > ```json
> >   [
> >      {
> >         "provider_resource_id": "projects/my-project/regions/us-east4/serviceAttachments/...",
> >         "snowflake_resource_id": "abcd0000000000000001",
> >         "host": "my-service.com",
> >         "endpoint_state": "CREATED",
> >         "status": "ACCEPTED",
> >      }
> >   ]
> > ```

---
title: SYSTEM$GET_PURCHASE_ATTRIBUTES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_purchase_attributes.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_PURCHASE_ATTRIBUTES

Identifies the behavior of a listing at runtime.

## Syntax

```sqlsyntax
SYSTEM$GET_PURCHASE_ATTRIBUTES()
```

## Arguments

None

## Returns

The function returns a value of type VARCHAR.

The returned string is in JSON format and contains the following name/value pairs:

`pricing_plan_identifier`
:   The identifier for the pricing plan associated with the listing.

`discount`
:   The pricing plan discount.

`offer_name`
:   The name of the private offer associated with the listing.

## Examples

```sqlexample
SELECT SYSTEM$GET_PURCHASE_ATTRIBUTES();
```

```output
+-----------------------------------------------------------------------------------------+
| SYSTEM$GET_PURCHASE_ATTRIBUTES()                                                        |
|-----------------------------------------------------------------------------------------|
| {"pricing_plan_identifier":"TESTPLAN","discount":10.0,"offer_name":"TESTOFFER_WELE_RO"} |
+-----------------------------------------------------------------------------------------+
```

---
title: SYSTEM$GET_REFERENCED_OBJECT_ID_HASH
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_referenced_object_id_hash.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_REFERENCED_OBJECT_ID_HASH

Returns the hash of the entity ID of the consumer object. This is the identifier of the entity resolved originally when a reference was created.

This function is useful for an app to determine whether the object bound to a reference has changed. The app can save the value and then compare the current value to the previously known value.

## Syntax

```sqlsyntax
SYSTEM$GET_REFERENCED_OBJECT_ID_HASH('<reference_name>'[, '<alias>'])
```

## Arguments

**Required**

`'reference_name'`
:   The name of the reference as specified in the `manifest.yml` file of the app.

`'reference_string'`
:   The system-generated ID of the reference to the object in the consumer account.

---
title: SYSTEM$GET_RESULTSET_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_resultset_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Query Information)

# SYSTEM$GET_RESULTSET_STATUS

Returns the status of a [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md) in a Snowflake Scripting
stored procedure.

This function can be useful for getting the status an
[asynchronous child job](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md)
that is running for a RESULTSET.

## Syntax

```sqlsyntax
SYSTEM$GET_RESULTSET_STATUS( <resultset_name> )
```

## Arguments

`resultset_name`
:   The name of the RESULTSET.

## Returns

This function returns the status of the RESULTSET in a value of type VARCHAR.
The following status values are possible:

| Status | Description |
| --- | --- |
| RUNNING | The query is still running. |
| SUCCESS | The query finished successfully. |
| ABORTING | The query is in the process of being aborted on the server side. |
| FAILED_WITH_ERROR | The query finished unsuccessfully due to an error in the query. |
| FAILED_WITH_INCIDENT | The query finished unsuccessfully due to an incident on the server side. |
| ABORTED | The query was aborted on the server side. |
| QUEUED | The query is queued for execution (that is, hasn’t yet started running), typically because it is waiting for resources. |
| DISCONNECTED | The session’s connection is broken. The query’s state will change to FAILED_WITH_ERROR soon. |
| RESUMING_WAREHOUSE | The warehouse is starting up, and the query isn’t yet running. |
| QUEUED_REPAIRING_WAREHOUSE | The warehouse is being repaired, and the query is queued for execution. |
| RESTARTED | The query restarted. |
| BLOCKED | The query is waiting on a lock held by another statement. |

## Usage notes

This function can only be called in a [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

## Examples

The following example calls SYSTEM$GET_RESULTSET_STATUS twice to return the status of an asynchronous
child job that is running for a RESULTSET. The example calls the function while the asynchronous child job is
running and after it completes.

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  status2 VARCHAR DEFAULT 'invalid';
BEGIN
  LET res RESULTSET := ASYNC (SELECT SYSTEM$WAIT(3));
  LET status VARCHAR := SYSTEM$GET_RESULTSET_STATUS(res);

  AWAIT res;
  status2 := SYSTEM$GET_RESULTSET_STATUS(res);
  RETURN [status, status2];
END;
$$;
```

```output
+------------------+
| GET_QUERY_STATUS |
+------------------+
| [                |
|   "RUNNING",     |
|   "SUCCESS"      |
| ]                |
+------------------+
```

---
title: SYSTEM$GET_SERVICE_DNS_DOMAIN
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_service_dns_domain.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_SERVICE_DNS_DOMAIN

Given a schema name, returns that schema’s DNS namespace hash as a string.

See also:
:   [Working with Services](../../developer-guide/snowpark-container-services/working-with-services.md)

## Syntax

```sqlsyntax
SYSTEM$GET_SERVICE_DNS_DOMAIN( <schema_name> )
```

## Arguments

`schema_name`
:   Schema name. If the schema is not in the current database, specify the fully qualified name of the schema.

## Returns

Returns the schema’s DNS namespace hash as a string.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Schema |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

If TUTORIAL_DB is the current database, then both of the following return the same result. This is the same DNS domain that appears in the DNS name (as reported by [SHOW SERVICES](../sql/show-services.md)) for any service in the DATA_SCHEMA schema.

```sqlexample
SELECT SYSTEM$GET_SERVICE_DNS_DOMAIN('DATA_SCHEMA');
SELECT SYSTEM$GET_SERVICE_DNS_DOMAIN('TUTORIAL_DB.DATA_SCHEMA');
```

Example output:

```output
+----------------------------------------------+
| SYSTEM$GET_SERVICE_DNS_DOMAIN('DATA_SCHEMA') |
|----------------------------------------------|
| k3m6.svc.spcs.internal                       |
+----------------------------------------------+
```

---
title: SYSTEM$GET_SERVICE_LOGS
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_service_logs.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_SERVICE_LOGS

Retrieves local logs from a
[Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md) container.

See also:
:   [Publishing and accessing container logs](../../developer-guide/snowpark-container-services/monitoring-services.md)

## Syntax

```sqlsyntax
SYSTEM$GET_SERVICE_LOGS( <service_name>, <instance_id>, <container_name>
   [, <number_of_most_recent_lines> ] [, <retrieve_previous_logs> ])
```

## Arguments

**Required:**

`service_name`
:   Service name.

`instance_id`
:   ID of the service instance, starting with 0.

`container_name`
:   Container name as specified in the service specification file.

**Optional:**

`number_of_most_recent_lines`
:   Number of trailing log lines to retrieve.

    Default: Up to 100 KB of the most recent log lines.

`retrieve_previous_logs`
:   If TRUE, the function retrieves logs from a previously terminated container. You can specify this parameter only if the container has been restarted at least once.

    Default: false (retrieve logs from the currently running container).

## Returns

Returns a string consisting of newline-separated log entries from the specified service container.

## Usage notes

* The current role must have the MONITOR privilege on the service to access the container logs.
* The function returns a container log as a string. You can use the [SPLIT_TO_TABLE](split_to_table.md) function to
  convert the string into a table containing one row for each newline-separated entry.

## Examples

### Retrieving logs from the current container

The following statement retrieves the last 10 log lines from the instance 0 of the “echo_service” service that is running in
the “echo” container:

```sqlexample
SELECT SYSTEM$GET_SERVICE_LOGS('TUTORIAL_DB.data_schema.echo_service', 0, 'echo', 10);
```

You can also follow [Tutorial 1: Create a Snowpark Container Services Service](../../developer-guide/snowpark-container-services/tutorials/tutorial-1.md) to start a service and execute the
preceding command to get the service log from a container.

The function returns a string consisting of newline-separated log entries. You can convert this string into a table using the
[SPLIT_TO_TABLE](split_to_table.md) function and the TABLE() keyword (see [Table functions](../functions-table.md)).

```sqlexample
SELECT value AS log_line
  FROM TABLE(
    SPLIT_TO_TABLE(SYSTEM$GET_SERVICE_LOGS('echo_service', 0, 'echo'), '\n')
  )
```

You can further apply a filter to retrieve only specific log entries. The WHERE clause in the following SELECT statement uses the
[CONTAINS](contains.md) function to retrieve only the log lines containing a specific date string:

```sqlexample
SELECT value AS log_line
  FROM TABLE(
   SPLIT_TO_TABLE(SYSTEM$GET_SERVICE_LOGS('echo_service', '0', 'echo'), '\n')
  )
  WHERE (CONTAINS(log_line, '06/Jun/2023 02:44:'))
  ORDER BY log_line;
```

The following sample output shows three log entry rows retrieved:

```output
+-----------------------------------------------------------------------------------------------------+
| LOG_LINE                                                                                            |
|-----------------------------------------------------------------------------------------------------|
| 10.16.9.193 - - [06/Jun/2023 02:44:04] "GET /healthcheck HTTP/1.1" 200 -                            |
| 10.16.9.193 - - [06/Jun/2023 02:44:09] "GET /healthcheck HTTP/1.1" 200 -                            |
| 10.16.9.193 - - [06/Jun/2023 02:44:14] "GET /healthcheck HTTP/1.1" 200 -                            |
+-----------------------------------------------------------------------------------------------------+
```

### Retrieving logs from a previously terminated container

The following statement retrieves the last 10 log lines from the previously terminated instance of the “echo_service” service that is running in the “echo” container. Here we assume the container restarted at least once:

```sqlexample
SELECT SYSTEM$GET_SERVICE_LOGS('TUTORIAL_DB.data_schema.echo_service', 0, 'echo', 10, true);
```

---
title: SYSTEM$GET_SERVICE_STATUS — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_service_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_SERVICE_STATUS — *Deprecated*

Retrieves the status of a
[Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md).

## Syntax

```sqlsyntax
SYSTEM$GET_SERVICE_STATUS( [ <db_name>.<schema_name>. ]<service_name> [ , <timeout_secs> ]  )
```

## Arguments

**Required:**

`service_name`
:   Service name. If you omit the `db_name` and `schema_name`, the function uses the current database and schema.

**Optional:**

`timeout_secs`
:   Number of seconds to wait for the service to reach a steady state (for example, READY) before returning the status. If the
    service does not reach a steady state within the specified time, Snowflake returns the current state.

    If not specified, Snowflake returns the current state immediately.

    Default: 0 seconds

## Returns

Returns status information in a JSON array with one JSON object for each container in each service instance. The JSON fields are:

* `status`. Service container status. Currently supported status values include: PENDING, READY, FAILED and UNKNOWN.
* `message`. Provides details about the specific status. For example, when the status is PENDING, this field describes why.
* `containerName`. Container name.
* `instanceId`. Service instance ID.
* `serviceName`. Service name.
* `image`. URL of the image that is running.
* `restartCount`. Number of times Snowflake restarted the container. A higher restart count can indicate an unhealthy
  service. For example, if your service code crashes, the container can exit. Snowflake then tries to restart the container.
  In this case, to investigate, you can access the container log using these options:

  + Use the [SYSTEM$GET_SERVICE_LOGS](system_get_service_logs.md) function for live logs (the container is running).
  + Use Event tables for persistent logs (useful when the container is no longer running).
* `startTime`. Time when the container started.

## Usage notes

* The current role must have the MONITOR privilege on the service to get the status information.

## Examples

The following function retrieves status information for the “echo_service” service. The function specifies a 5-second timeout:

```sqlexample
SELECT SYSTEM$GET_SERVICE_STATUS('echo_service', 5);
```

Example outputs:

* **Running one service instance that has one container.** The function returns the container information as shown:

  ```json
  [
   {
      "status":"READY",
      "message":"Running",
      "containerName":"echo",
      "instanceId":"0",
      "serviceName":"ECHO_SERVICE",
      "image":"<account>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:tutorial",
      "restartCount":0,
      "startTime":"2023-01-01T00:00:00Z"
   }
  ]
  ```

  `instanceId` is the service instance ID. If you have two instances of this service running, the array includes two
  objects in the output, providing container status for two separate service instances (the `instanceId` will be 0 and 1).
* **Running one service instance that has three containers (as defined in the service specification).** The function returns an
  array with three objects (one for each container):

  ```json
  [
    {
    "status":"READY",
    "message":"Running",
    "containerName":"echo-1",
    "instanceId":"0",
    "serviceName":"ECHO_SERVICE",
    "image":"<account>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image_x:tutorial",
    "restartCount":0,
    "startTime":"2023-01-01T00:00:00Z"
    },
    {
    "status":"READY",
    "message":"Running",
    "containerName":"echo-2",
    "instanceId":"0",
    "serviceName":"ECHO_SERVICE",
    "image":"<account>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image_y:tutorial",
    "restartCount":0,
    "startTime":"2023-01-01T00:00:00Z"
    },
    {
    "status":"READY",
    "message":"Running",
    "containerName":"echo-3",
    "instanceId":"0",
    "serviceName":"ECHO_SERVICE",
    "image":"<account>.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image_z:tutorial",
    "restartCount":0,
    "startTime":"2023-01-01T00:00:00Z"
    }
  ]
  ```

Because all these containers belong to the same service instance, the `instanceId` will be 0 for all containers.

---
title: SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_snowflake_egress_ip_ranges.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES

Support for this feature is available only for external access on AWS.

Returns a list of egress IP address ranges (as Classless Inter-Domain Routing (CIDR) IP addresses) that you can use to represent
Snowflake in a server’s IP allowlist.

Use this function to obtain a list of egress IP address ranges with which to allow Snowflake traffic on external servers. You
can add IP addresses from the list to the allowlist on an external server from which Snowflake makes requests.

For example, you can allow requests by user-defined functions (UDFs) deployed on Snowflake to access resources on an external server.
To do this, you add Snowflake egress IP addresses to the network firewall for your server.

Addresses in the returned list expire. You can automate refreshes from the list as described in
[Securing ingress of Snowflake requests with egress IP addresses](../../user-guide/egress-ip/network-egress.md).

## Syntax

```sqlsyntax
SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES()
```

## Returns

Returns JSON containing a list of CIDR IP addresses, effective date, and an expiration date for each address. The following example shows what the
return value looks like:

```sqlexample
SELECT SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES();
```

```json
{
  "ipv4_prefix": "153.45.151.0/24",
  "effective": "2025-06-30T23:59:59Z",
  "expires": "2026-08-30T23:59:59Z"
}
```

## Usage notes

Keep in mind the following about the returned list of CIDR IP address ranges:

* Each IP address expires. The returned list includes both the IP address and its expiration date and time. To allow continued access
  from Snowflake, automate refreshing your allowlist with addresses that have not yet expired.

  For more information, see [Securing ingress of Snowflake requests with egress IP addresses](../../user-guide/egress-ip/network-egress.md).
* Addresses are scoped to the region of your Snowflake deployment. Addresses for one region differ from those for another region.
* Addresses are shared among Snowflake accounts in the region. In other words, they’re not unique to a Snowflake account.

---
title: SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_snowflake_platform_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO

Returns platform information for the cloud provider that hosts your Snowflake account.
The function returns different values, depending on your cloud provider:

* For Amazon Web Services (AWS) and Microsoft Azure,
  the function returns the Amazon Virtual Private Cloud (Amazon VPC) IDs or Azure Virtual Network (VNet) IDs.

  A cloud administrator in your company can specify VPC IDs in trust policies. Doing so allows Snowflake to connect to
  the following resources, and denies requests that originate from outside of the virtual network:

  + Your cloud storage.
  + Your [proxy service](../external-functions-introduction.md) for your
    [external function](../external-functions.md).

  This security restriction can limit traffic to your cloud storage or proxy service on the same cloud platform.
* For Google Cloud, the function returns the project ID and Google Workspace customer ID associated with the Snowflake service account.

  A cloud administrator can use this information to update the domain restriction constraint in an organization policy.

For more information, see the following information for your cloud platform:

AWS:
:   [Allowing the Virtual Private Cloud IDs](../../user-guide/data-load-s3-allow.md)

GCS:
:   [Allow access to Google Cloud Storage](../../user-guide/data-load-gcs-allow.md)

Azure:
:   [Allow the VNet subnet IDs](../../user-guide/data-load-azure-allow.md)

## Syntax

```sqlsyntax
SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO()
```

## Arguments

None.

## Usage notes

Only returns results for account administrators (users with the ACCOUNTADMIN role).

## Examples

Query the IDs of the virtual network in which your Snowflake account is located:

> ```sqlexample
> SELECT SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO();
> ```

---
title: SYSTEM$GET_TABLE_ARCHIVE_METADATA
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_table_archive_metadata.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_TABLE_ARCHIVE_METADATA

Returns metadata about the archived data for a table,
without requiring data retrieval from the archive tier.

See also:
:   [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md),
    [Retrieve archived data](../../user-guide/storage-management/storage-lifecycle-policies-retrieving-archived-data.md)

## Syntax

```sqlsyntax
SYSTEM$GET_TABLE_ARCHIVE_METADATA( '<table_name>' )
```

## Arguments

`'table_name'`
:   The name of the table with archived data. The table must have had data archived to the COOL or COLD tier,
    usually by a storage lifecycle policy.

## Returns

Returns a TEXT value containing JSON with metadata about the archived data. The JSON structure includes:

* `rowCount`: The number of rows in the archive.
* `columns`: An object containing metadata for each column:

  + `column_id`: The column ID (as shown in the COLUMNS view).
  + `data_type`: Column data type
  + `min`: The minimum value for the column, or `null` if not applicable.
  + `max`: The maximum value for the column, or `null` if not applicable.

> **Note:**
>
> The `min` and `max` values are `null` for TEXT, OBJECT, ARRAY, and VARIANT data types.

The output also includes the archived timestamp column (`METADATA$STORAGE_LIFECYCLE_POLICY_ARCHIVED_TIMESTAMP`),
which indicates when each row was archived.

**Example output:**

```json
{
  "rowCount": 2304,
  "columns": {
    "CUSTOMER_ID": {
      "column_id": 10283,
      "data_type": "fixed",
      "min": -23,
      "max": 54032
    },
    "CUSTOMER_NAME": {
      "column_id": 10284,
      "data_type": "text",
      "min": null,
      "max": null
    },
    "METADATA$STORAGE_LIFECYCLE_POLICY_ARCHIVED_TIMESTAMP": {
      "data_type": "timestampltz",
      "min": "2025-01-02T03:04:05.6789Z",
      "max": "2025-11-12T13:14:15.1617Z"
    }
  }
}
```

## Usage notes

* The table owner or an account administrator (a user with the ACCOUNTADMIN role) who has access
  to the table can execute this function.
* Use this function to inspect archived data metadata without incurring the cost of retrieving data
  from the archive tier.
* The `column_id` field helps distinguish columns when a column has been dropped and a new column
  with the same name has been added later.
* To retrieve the actual archived data, use the
  [CREATE TABLE … FROM ARCHIVE OF](../sql/create-table.md) command.

## Examples

Retrieve metadata about archived data for a table:

```sqlexample
SELECT SYSTEM$GET_TABLE_ARCHIVE_METADATA('my_database.my_schema.my_table');
```

Parse the JSON output to extract specific information:

```sqlexample
SELECT PARSE_JSON(SYSTEM$GET_TABLE_ARCHIVE_METADATA('my_database.my_schema.my_table')):rowCount AS archived_row_count;
```

---
title: SYSTEM$GET_TAG
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_tag.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$GET_TAG

Returns the tag value associated with the specified Snowflake object or column. Returns NULL if a tag is not set on the specified
Snowflake object or column.

## Syntax

```sqlsyntax
SYSTEM$GET_TAG( '<tag_name>' , '<obj_name>' , '<obj_domain>' )
```

## Arguments

`'tag_name'`
:   The name of the tag as a string.

    The name is the `key` in the key-value pair of the tag. For example, in the tag `cost_center = 'sales'`, `cost_center` is the
    key-name of the tag. For this argument, use `'cost_center'`.

`'obj_name'`
:   The name of the object as a string.

    For example, if a table name is `my_table`, use `'my_table'` as the name of the object.

    To specify a column, use the format `<table_name>.<column_name>`. For example, `my_table.revenue`.

    For more information, see [Object identifiers](../identifiers.md).

`'object_domain'`
:   Domain of the reference object, such as a table or view, if the tag association is on the object. For columns, the domain is `COLUMN`
    if the tag association is on a column.

    Use one of the following values:

    > * `'ACCOUNT'`
    > * `'ALERT'`
    > * `'BACKUP POLICY'`
    > * `'BACKUP SET'`
    > * `'COLUMN'`
    > * `'COMPUTE POOL'`
    > * `'CORTEX AGENT'`
    > * `'DATABASE'`
    > * `'DATABASE ROLE'`
    > * `'FAILOVER GROUP'`
    > * `'FUNCTION'`
    > * `'INTEGRATION'`
    > * `'INSTANCE'`
    > * `'NETWORK POLICY'`
    > * `'PROCEDURE'`
    > * `'REPLICATION GROUP'`
    > * `'ROLE'`
    > * `'SCHEMA'`
    > * `'SHARE'`
    > * `'SNAPSHOT POLICY'` (deprecated; prefer `'BACKUP POLICY'`)
    > * `'SNAPSHOT SET'` (deprecated; prefer `'BACKUP SET'`)
    > * `'SNOWFLAKE INTELLIGENCE'`
    > * `'STAGE'`
    > * `'STREAM'`
    > * `'TABLE'`: Use this for all table-like objects such as views, materialized views, and external tables.
    > * `'TASK'`
    > * `'USER'`
    > * `'WAREHOUSE'`

## Usage notes

* Using this function requires:

  + The privileges to run a [DESCRIBE <object>](../sql/desc.md) operation on the specified object name.
  + USAGE on the database and schema in which the tag exists.

    For more information, see [Tag Privilege & DDL Summary](../../user-guide/object-tagging/work.md).
  + IMPORTED PRIVILEGES on the shared SNOWFLAKE database if you specify a [system classification tag](../../user-guide/classify-intro.md).

## Examples

Returns `NULL` if a tag is not associated to the specified object:

> ```sqlexample
> select system$get_tag('cost_center', 'my_table', 'table');
>
> +-----------------------------------------------------+
> | SYSTEM$GET_TAG('COST_CENTER', 'MY_TABLE', 'TABLE')  |
> +-----------------------------------------------------+
> | NULL                                                |
> +-----------------------------------------------------+
> ```

Returns the tag value for the specified table. The tag value is the string component of the `key = 'value'` pair in the tag:

> ```sqlexample
> select system$get_tag('cost_center', 'my_table', 'table');
>
> -----------------------------------------------------+
> | SYSTEM$GET_TAG('COST_CENTER', 'MY_TABLE', 'TABLE') |
> +----------------------------------------------------+
> | sales                                              |
> +----------------------------------------------------+
> ```

Returns the tag value for the specified column:

> ```sqlexample
> select system$get_tag('fiscal_quarter', 'my_table.revenue', 'column');
>
> +----------------------------------------------------------------+
> | SYSTEM$GET_TAG('FISCAL_QUARTER', 'MY_TABLE.REVENUE', 'COLUMN') |
> +----------------------------------------------------------------+
> | Q1                                                             |
> +----------------------------------------------------------------+
> ```

---
title: SYSTEM$GET_TAG_ALLOWED_VALUES
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_tag_allowed_values.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_TAG_ALLOWED_VALUES

Returns a comma-separated list of string values that can be set on a [supported object](../../user-guide/object-tagging/introduction.md), or NULL
to indicate the tag key does not have any specified string values and accepts all [possible](../../user-guide/object-tagging/introduction.md) string
values.

See also:
:   [Set a list of allowed tag values](../../user-guide/object-tagging/work.md) , [TAGS view](../account-usage/tags.md)

## Syntax

```sqlsyntax
SYSTEM$GET_TAG_ALLOWED_VALUES('<name>')
```

## Arguments

`name`
:   The fully-qualified name of the tag key as a string.

## Usage notes

* The role that calls this function must have either the USAGE privilege on the parent database and schema of the tag or the global APPLY
  TAG on ACCOUNT permission.
* Snowflake returns NULL when you pass the SNOWFLAKE.CORE.SEMANTIC_CATEGORY system tag as an argument in the function because there is not
  an allowed values constraint with this tag.

## Examples

Query the allowed tag values for the tag key named `cost_center`, which resides in the database named `governance` and the schema named
`tags`:

> ```sqlexample
> select system$get_tag_allowed_values('governance.tags.cost_center');
> ```

---
title: SYSTEM$GET_TAG_ON_CURRENT_COLUMN
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_tag_on_current_column.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$GET_TAG_ON_CURRENT_COLUMN

Returns the tag string value assigned to the column based upon the specified tag or NULL if a tag is not assigned to the specified column.

When the body of a [masking policy](../../user-guide/security-column-intro.md) or [projection policy](../../user-guide/projection-policies.md)
includes this function, the value of a tag assigned to a column can determine the return value of the policy assigned to that column.

## Syntax

```sqlsyntax
SYSTEM$GET_TAG_ON_CURRENT_COLUMN( '<tag_name>' )
```

## Arguments

`'tag_name'`
:   Identifier for the tag as a string.

    For example, if the tag is named `cost_center` use `'cost_center'` as the argument.

## Usage notes

* Currently, this function can only be used in a masking policy or projection policy condition to dynamically evaluate the tag string value
  set on a column.

  Snowflake returns an error while using the function in either a SELECT query, a row access policy, a view, or a user-defined function
  (UDF).
* Note that this function applies to all table-like objects (e.g. views).
* The tag must exist when calling this system function; otherwise, Snowflake returns the following error message:

  ```none
  Tag '<tag_name>' does not exist or not authorized.
  ```

## Examples

Masking policy
:   For a contextual example on how to use this function with a masking policy, see [Example 2: Protect column data based on the column tag string value](../../user-guide/tag-based-masking-policies.md).

Projection policy
:   When the following projection policy is assigned to a column, the value of the `tags.accounting_col` tag on that column must be
    `public` in order to project the column.

```sqlexample
CREATE PROJECTION POLICY mypolicy
AS () RETURNS PROJECTION_CONSTRAINT ->
CASE
  WHEN SYSTEM$GET_TAG_ON_CURRENT_COLUMN('tags.accounting_col') = 'public'
    THEN PROJECTION_CONSTRAINT(ALLOW => true)
  ELSE PROJECTION_CONSTRAINT(ALLOW => false)
END;
```

---
title: SYSTEM$GET_TAG_ON_CURRENT_TABLE
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_tag_on_current_table.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$GET_TAG_ON_CURRENT_TABLE

Returns the tag string value assigned to the table based upon the specified tag or NULL if a tag is not assigned to the specified table.

Use this function in the [masking policy](../../user-guide/security-column-intro.md) conditions or the
[row access policy](../../user-guide/security-row-intro.md) conditions.

## Syntax

```sqlsyntax
SYSTEM$GET_TAG_ON_CURRENT_TABLE( '<tag_name>' )
```

## Arguments

`'tag_name'`
:   Identifier for the tag as a string.

    For example, if the tag is named `cost_center` use `'cost_center'` as the argument.

## Usage notes

* Currently, this function can only be used in a masking policy or row access policy condition to dynamically evaluate the tag string value
  set on a table.

  Snowflake returns an error while using the function in a SELECT query, view, materialized view, or a user-defined function (UDF).
* Note that this function applies to all table-like objects (e.g. views).
* The tag must exist when calling this system function; otherwise, Snowflake returns the following error message:

  ```none
  Tag '<tag_name>' does not exist or not authorized.
  ```

## Examples

For a contextual example on how to use this function, see [Example 3: Protect a table based on the table tag string value](../../user-guide/tag-based-masking-policies.md).

---
title: SYSTEM$GET_TASK_GRAPH_CONFIG
source: https://docs.snowflake.com/en/sql-reference/functions/system_get_task_graph_config.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$GET_TASK_GRAPH_CONFIG

Returns information from a [task graph](../../user-guide/tasks-graphs.md) configuration.

For information about storing configuration values in a task graph, see [CREATE TASK … CONFIG](../sql/create-task.md).

## Syntax

```sqlsyntax
SYSTEM$GET_TASK_GRAPH_CONFIG( [<configuration_path>] )
```

## Arguments

`configuration_path`
:   Optional path of the configuration value to return.

    Uses the same syntax as Snowflake queries for semi-structured data.
    See [GET_PATH](get_path.md) for more information.

## Examples

The following example creates a task that defines a configuration and then uses
the SYSTEM$GET_TASK_GRAPH_CONFIG function to retrieve values from the configuration.

```sqlexample
CREATE OR REPLACE TASK root_task_with_config
  WAREHOUSE = mywarehouse
  SCHEDULE = '10 m'
  CONFIG = $${"output_dir": "/temp/test_directory/", "learning_rate": 0.1}$$
  AS
  BEGIN
    LET OUTPUT_DIR STRING := SYSTEM$GET_TASK_GRAPH_CONFIG('output_dir')::string;
    LET LEARNING_RATE DECIMAL := SYSTEM$GET_TASK_GRAPH_CONFIG('learning_rate')::DECIMAL;
    ...
  END;
```

### Example: Pass configuration information to another task in a task graph

You can pass configuration information by using a JSON object
that other tasks in a task graph can read.

Use the syntax CREATE/ALTER TASK … CONFIG to set, unset,
or modify the configuration information in the root task.
Then, use the SYSTEM$GET_TASK_GRAPH_CONFIG function to retrieve it.

The following example shows how you can use a JSON object to pass configuration information and
store it in a table:

```sqlexample
CREATE OR REPLACE TASK my_task_root
  SCHEDULE = '1 MINUTE'
  USER_TASK_TIMEOUT_MS = 60000
  CONFIG = $${"environment":"production", "dir":"/my_prod_directory/"}$$
  AS SELECT 1;

CREATE OR REPLACE TASK my_child_task
  USER_TASK_TIMEOUT_MS = 600000
  AFTER my_task_root
  AS
    BEGIN
      LET value := (SELECT SYSTEM$GET_TASK_GRAPH_CONFIG('dir'));
      CREATE TABLE IF NOT EXISTS my_table(name VARCHAR, value VARCHAR);
      INSERT INTO my_table VALUES('my_task_root dir',:value);
    END;
```

---
title: SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER
source: https://docs.snowflake.com/en/sql-reference/functions/system_global_account_set_parameter.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER

Enables replication and failover features for a specified account in an [organization](../../user-guide/organizations.md).

After an [organization administrator](../../user-guide/organization-administrators.md) has called this function, the following features are enabled for the
account:

* [Replication](../../user-guide/account-replication-intro.md)
* [Client Redirect](../../user-guide/client-redirect.md)

Call the SQL function once for each account in your organization for which you are enabling replication and failover features. This
includes each account that you intend to contain a primary or secondary
[replication or failover group](../../user-guide/account-replication-intro.md), database, or
[connection](../../user-guide/client-redirect.md).

## Syntax

```sqlsyntax
SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER('<account_identifier>',
  'ENABLE_ACCOUNT_DATABASE_REPLICATION', 'true');
```

## Arguments

`<account_identifier>`
:   Identifier of an account for which you are enabling replication. The preferred format for the identifier is
    `organization_name.account_name`. Though the legacy `account_locator` format is also supported, its use is discouraged as it
    can cause unexpected results when an organization has multiple accounts with the same locator (in different regions).

    Retrieve the set of accounts in your organization using the [SHOW ACCOUNTS](../sql/show-accounts.md) command, which returns
    details about each account, including the organization name, account name, and account locator.

## Usage notes

* Only [organization administrators](../../user-guide/organization-administrators.md) can call this SQL function.
* Multiple accounts can be enabled for replication from the same organization administrator account.
* When replication is enabled for an account using this SQL function,
  the [SHOW REPLICATION ACCOUNTS](../sql/show-replication-accounts.md) output includes the account.
* If you have more than one account with the same account locator in different regions, to enable replication, you must use
  `organization_name.account_name` as the account identifier.

## Examples

The following example enables replication for the `account1` and `account2` accounts in the `myorg` organization:

```sqlexample
SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER('myorg.account1',
  'ENABLE_ACCOUNT_DATABASE_REPLICATION', 'true');

SELECT SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER('myorg.account2',
  'ENABLE_ACCOUNT_DATABASE_REPLICATION', 'true');
```

---
title: SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_hold_privilege_on_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT

Indicates if a privilege has been granted to a Snowflake Native App. For example, providers
can use this function in the setup script to check if the app has the necessary
privileges to create an object.

> **Note:**
>
> This system function can only be called by a Snowflake Native App.

## Syntax

```sqlsyntax
SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT('<privilege_name>')
```

## Arguments

`'privilege_name'`
:   The name of the privilege.

## Returns

* Returns TRUE if the app has been granted the specified privilege. Otherwise,
  returns FALSE.

## Examples

Check if the app has been granted the CREATE COMPUTE POOL privilege:

```sqlexample
SELECT SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT('CREATE COMPUTE POOL');
```

Check if the app has been granted the IMPORTED PRIVILEGES ON SNOWFLAKE DB privilege:

```sqlexample
SELECT SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT('IMPORTED PRIVILEGES ON SNOWFLAKE DB');
```

---
title: SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_initiate_move_organization_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT

Starts the process of moving an [organization account](../../user-guide/organization-accounts.md) to a new region.

See also:
:   [SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT](system_commit_move_organization_account.md) , [SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](system_show_move_organization_account_status.md)

## Syntax

```sqlsyntax
SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT(
    '<temp_name>' ,
    '<region>' ,
    { 'ALL' | '<object> [, <object> ...]' } )
```

## Arguments

`'temp_name'`
:   Specifies a temporary account name by which the organization account in the new region can be identified until the move is finalized. The
    name must start with a letter and can only contain uppercase letters, numbers, and underscores.

    The name of the organization account in the new region changes from this temporary account name to the name of the original organization
    account when the [SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT](system_commit_move_organization_account.md) function finishes successfully.

`'region'`
:   [Snowflake Region ID](../../user-guide/admin-account-identifier.md) of the region where the organization account will be moved.

`{ 'ALL' | 'object [, object ...]' }`
:   List of objects that will be moved to the organization account in its new region. Because Snowflake uses replication groups to move the
    objects, you can only move objects that are supported by replication groups, which varies depending on your Snowflake edition. For a list
    of objects that can be moved, see [Replicated objects](../../user-guide/account-replication-intro.md).

    To move all objects that can be replicated, specify `ALL`.

## Access control requirements

Only users with the GLOBALORGADMIN role can call this function.

## Usage notes

* You cannot sign in to the organization account in the new region until the initiation process is complete. To check the status of the
  process, call the [SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](system_show_move_organization_account_status.md) function.
* After the initiation process completes, you can sign in to the organization account in the new region using its temporary name, but cannot
  execute any SQL statement other than SELECT, USE, and SHOW.

## Examples

```sqlexample
SELECT SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT('TEMP_ACCT', 'aws_us_west_2', 'ALL');
```

---
title: SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_internal_stages_public_access_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS

Checks to see whether public IP addresses are allowed to access the internal stage of the current Snowflake account on Microsoft Azure.

See also:
:   [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_block_internal_stages_public_access.md) , [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_unblock_internal_stages_public_access.md)

## Syntax

> ```sqlsyntax
> SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS()
> ```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to internal stages is blocked | Indicates that the Azure settings that control access to the internal stage are currently blocking all public IP addresses. |
| Public Access to internal stages is unblocked | Indicates that at least some public IP addresses can access the internal stage. |

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Azure only. AWS and Google Cloud are not supported.

## Examples

> ```sqlexample
> USE ROLE accountadmin;
>
> SELECT SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS();
> ```
>
> ```output
> Public Access to internal stages is blocked
> ```

---
title: SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_application_all_mandatory_telemetry_event_definitions_enabled.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED

Indicates that the AUTHORIZE_TELEMETRY_EVENT_SHARING property has been set on the app.

## Syntax

```sqlsyntax
SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED
```

## Returns

* Returns `TRUE` if the AUTHORIZE_TELEMETRY_EVENT_SHARING property is set
  on the app. This indicates that event sharing is allowed in the consumer account.
  Otherwise, returns `FALSE`.

  For more information, see [Determine information about event sharing in the consumer account](../../developer-guide/native-apps/event-develop.md).

---
title: SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_application_authorized_for_telemetry_event_sharing.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING

Indicates that the AUTHORIZE_TELEMETRY_EVENT_SHARING has been set on the app.

## Syntax

```sqlsyntax
SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING()
```

## Returns

* Returns `TRUE` if the AUTHORIZE_TELEMETRY_EVENT_SHARING property is
  set on the app. This indicates that event sharing is allowed in the
  consumer account. Otherwise, returns `FALSE`.

  For more information, see [Determine information about event sharing in the consumer account](../../developer-guide/native-apps/event-develop.md).

---
title: SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_application_installed_from_same_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT

Shows if an app is installed on the same account as the application package it is based on.

See also:
:   [SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER](system_is_application_sharing_events_with_provider.md)

For more information about event sharing, see [Use logging and event tracing for an app](../../developer-guide/native-apps/event-about.md).

## Syntax

```sqlsyntax
SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT()
```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| TRUE | Indicates if an app is installed on the same account as the application package it is based on. |
| FALSE | Indicates if an app is not installed on the same account as the application package it is based on. |

## Access control requirements

* These system functions can only be called from within an app.

## Examples

```sqlexample
SELECT SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT();
```

---
title: SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_application_sharing_events_with_provider.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER

Shows if event sharing is enabled.

See also:
:   [SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT](system_is_application_installed_from_same_account.md)

For more information about event sharing, see [Use logging and event tracing for an app](../../developer-guide/native-apps/event-about.md).

## Syntax

```sqlsyntax
SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER()
```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| TRUE | Indicates that event sharing is enabled on the app and the app has an active event table. |
| FALSE | Indicates that event sharing is not enabled on the app. |

## Access control requirements

* These system functions can only be called from within an app.

## Examples

```sqlexample
SELECT SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER();
```

---
title: SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_global_data_sharing_enabled_for_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT

Specifies whether Cross-Cloud Auto-Fulfillment is enabled or disabled on an account.

See also:
:   [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](system_enable_global_data_sharing_for_account.md), [SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](system_disable_global_data_sharing_for_account.md), [Auto-fulfillment for listings](../../collaboration/provider-listings-auto-fulfillment.md)

## Syntax

```sqlsyntax
SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT( '<account_name>' )
```

## Arguments

`account_name`
:   Specifies the account on which you want to determine if Cross-Cloud Auto-Fulfillment is enabled or disabled. To learn more about Snowflake account identifiers and how to locate them, see [Account identifiers](../../user-guide/admin-account-identifier.md).

## Returns

Returns one of the following Boolean values:

* `TRUE` (if Cross-Cloud Auto-Fulfillment is enabled for the current account)
* `FALSE` (if Cross-Cloud Auto-Fulfillment is disabled for the current account)

## Access control requirements

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute this function.

## Examples

The following example determines if Cross-Cloud Auto-Fulfillment is enabled on the account named `my_account`:

```sqlexample
SELECT SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT('my_account');
```

```output
+------------------------------------------------------------------------+
| SYSTEM$SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT('my_account') |
|------------------------------------------------------------------------|
| TRUE                                                                   |
+------------------------------------------------------------------------+
```

---
title: SYSTEM$IS_LISTING_PURCHASED
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_listing_purchased.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_LISTING_PURCHASED

Returns TRUE if the consumer account querying data has purchased the listing, otherwise returns FALSE. If an account is trialing the listing,
the function returns FALSE. Use this system function in a secure view to manage access to the data in a share and display certain data only
to paying customers.

This function infers the listing associated with the database that contains the view and determines whether the account running the query
has purchased the listing.

## Syntax

```sqlsyntax
SYSTEM$IS_LISTING_PURCHASED()
```

## Arguments

None.

## Returns

The function returns a value of type BOOLEAN.

## Example

Create a secure view that selects all columns in a table. The view returns rows only when queried within a consumer account that has
purchased a paid listing:

```sqlexample
CREATE SECURE VIEW paid_view
  AS
  SELECT
    *
  FROM
    paid_table
  WHERE
    SYSTEM$IS_LISTING_PURCHASED();
```

Consumers trialing the paid listing see no rows in this view.

For additional examples, see [Prepare shares for a paid listing](../../collaboration/provider-listings-preparing.md).

---
title: SYSTEM$IS_LISTING_TRIAL
source: https://docs.snowflake.com/en/sql-reference/functions/system_is_listing_trial.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$IS_LISTING_TRIAL

Limits the functionality of a Snowflake Native App based on whether a consumer is trialing the application as part of a [Limited trial listings](../../collaboration/collaboration-listings-about.md) or has access to the full data product.

Returns TRUE if the consumer account is trialing the data product as part of a limited trial listing, otherwise returns FALSE.

Use this system function in a secure view, secure UDF, or Streamlit app to manage access to the functionality
of your Snowflake Native App and display certain output only to consumers with access to the full data product.

> **Caution:**
>
> Do not use this system function to limit access to functionality for consumers trialing a paid listing.
> Instead, use [SYSTEM$IS_LISTING_PURCHASED](system_is_listing_purchased.md).

This function infers the listing associated with the application package that contains the secure view, secure UDF, or Streamlit app,
and determines whether the account running the query is trialing the listing as part of a limited trial listing.
For more details, see [Limit functionality of your Snowflake Native App for trial consumers](../../collaboration/provider-listings-preparing.md).

## Syntax

```sqlsyntax
SYSTEM$IS_LISTING_TRIAL()
```

## Arguments

None.

## Returns

The function returns a value of type BOOLEAN.

## Examples

In this example, create a secure view that returns a subset of rows to trial consumers, but returns all rows to consumers with full access
to your data product. You can control the output of the secure view using this system function and the value of a data column to determine
which data to show to which consumers.

In this example, create a secure view `limited_functionality_view` with your data from a table named `exclusive_access_table`.
In that table, define a BOOLEAN type column, `is_trial`, where some rows of data have `is_trial` set to `TRUE` to indicate that the
data in those rows should be shown to trial consumers. Other rows have `is_trial` set to `FALSE`,
indicating that the data in those rows should be shown only to consumers with full access to your Snowflake Native App.

This example view is set up to return all rows only when it is queried by a consumer account with full access to your Snowflake Native App,
otherwise it returns only the rows where `is_trial` is set to `TRUE`.

```sqlexample
CREATE SECURE VIEW limited_functionality_view
  AS
  SELECT
    *
  FROM
    exclusive_access_table
  WHERE
    is_trial
    OR
    SYSTEM$IS_LISTING_TRIAL() = TRUE;
```

See more examples and details in [Limit functionality of your Snowflake Native App for trial consumers](../../collaboration/provider-listings-preparing.md).

---
title: SYSTEM$LAST_CHANGE_COMMIT_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/system_last_change_commit_time.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LAST_CHANGE_COMMIT_TIME

Returns a token that can be used to detect whether a database table or view changed between two calls to the function.
If the token returned by a call is different from the token returned by a separate call, then the table or view
changed between the two calls, typically due to a DML operation (e.g. an INSERT).

If the specified database object is a view, then at least one of the database objects referenced by the view changed.

## Syntax

```sqlsyntax
SYSTEM$LAST_CHANGE_COMMIT_TIME( '<object_name>'  )
```

## Arguments

`object_name`
:   Specifies the table or view.

## Returns

The data type of the returned value is NUMBER with a scale of 0.

## Usage notes

* The value can be used in applications such as BI tools to determine whether the underlying table data has changed.
  This can be useful for applications that display dashboards and need to figure out whether the dashboard needs to be
  updated based on new data in the table.
* For each DML operation performed on the specified table or underlying tables in the specified view, the returned
  value increases.
* The value returned by the function is typically an approximation of the time that the database object was
  last changed, expressed as the UTC timestamp in nanoseconds since the beginning of the epoch (i.e. since midnight
  January 1, 1970). However, the values are only approximations, in part because the precision and skew of the
  results can vary.

  > **Note:**
  >
  > Snowflake recommends using this value only as a change indicator and strongly discourages users from treating this
  > value as a timestamp.

## Examples

```sqlexample
CALL SYSTEM$LAST_CHANGE_COMMIT_TIME('mytable');

+--------------------------------+
| SYSTEM$LAST_CHANGE_COMMIT_TIME |
|--------------------------------|
|            1661920053987000000 |
+--------------------------------+
```

```sqlexample
SELECT SYSTEM$LAST_CHANGE_COMMIT_TIME('mytable');

+--------------------------------+
| SYSTEM$LAST_CHANGE_COMMIT_TIME |
|--------------------------------|
|            1661920118648000000 |
+--------------------------------+

INSERT INTO mytable VALUES (2,100), (3,300);

SELECT SYSTEM$LAST_CHANGE_COMMIT_TIME('mytable');

+--------------------------------+
| SYSTEM$LAST_CHANGE_COMMIT_TIME |
|--------------------------------|
|            1661920131893000000 |
+--------------------------------+
```

---
title: SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME
source: https://docs.snowflake.com/en/sql-reference/functions/system_link_account_objects_by_name.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME

Adds a global identifier to account objects in the target (current) account that were created using scripts
and that match objects with the same names in the source account.

Global identifiers are only added to account objects that are included in a replication or failover group for the
following object types:

* `RESOURCE_MONITOR`
* `ROLE`
* `USER`
* `WAREHOUSE`

For more information, refer to [Apply global IDs to objects created by scripts in target accounts](../../user-guide/account-replication-config.md).

## Syntax

```sqlsyntax
SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME('<group_name>')
```

## Arguments

`group_name`
:   Specifies the identifier for the replication or failover group.

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL function.
* To retain account objects that exist only in the target account, replicate them
  manually in the source account before executing this function.

## Examples

```sqlexample
SELECT SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME('myfg');
```

---
title: SYSTEM$LINK_ORGANIZATION_USER
source: https://docs.snowflake.com/en/sql-reference/functions/system_link_organization_user.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$LINK_ORGANIZATION_USER

Links an [organization user](../../user-guide/organization-users.md) with a user that already exists in the regular account.

When an account administrator adds an organization user group to a regular account, a conflict arises when an organization user in the
group corresponds to a person who already has a user object in the account. This function resolves the conflict and allows the user to be
managed as an organization user going forward.

## Syntax

```sqlsyntax
SYSTEM$LINK_ORGANIZATION_USER( '<local_user>', '<org_user>' )
```

## Arguments

`'local_user'`
:   Name of a user object that exists in the regular account.

`'org_user'`
:   Name of the organization user that corresponds to the same person as `local_user`.

## Usage notes

Linking an organization user to a local user object replaces the EMAIL property of the local user with the EMAIL property of the
organization user.

## Example

```sqlexample
SELECT SYSTEM$LINK_ORGANIZATION_USER('jloeb', 'jloeb');
```

---
title: SYSTEM$LINK_ORGANIZATION_USER_GROUP
source: https://docs.snowflake.com/en/sql-reference/functions/system_link_organization_user_group.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$LINK_ORGANIZATION_USER_GROUP

Links an [organization user group](../../user-guide/organization-users.md) with an access control role that already exists in the regular account.

When an account administrator adds an organization user group to a regular account, a conflict arises if there is an existing role with the
same name as the group. This function resolves the conflict and allows the role to be managed as an organization user group going forward.

## Syntax

```sqlsyntax
SYSTEM$LINK_ORGANIZATION_USER_GROUP( <name> )
```

## Arguments

`name`
:   Name of an organization user group. This matches the name of an existing access control role.

## Usage notes

You can’t link an organization user group to a role that is granted to other roles.

## Examples

```sqlexample
SELECT SYSTEM$LINK_ORGANIZATION_USER_GROUP('marketing_team');
```

---
title: SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES
source: https://docs.snowflake.com/en/sql-reference/functions/system_list_application_restricted_features.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES

Returns a JSON object containing a list of restricted features that the consumer has
allowed a Snowflake Native App to use.

> **Note:**
>
> Currently, only [external and Apache Iceberg™ tables](../../developer-guide/native-apps/preparing-data-content.md) are supported.

## Syntax

```sqlsyntax
SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES( '<app_name>' )
```

## Arguments

`app_name`
:   Name of the Snowflake Native App.

    > **Note:**
    >
    > This argument is ignored when the system function is called by the app.

## Returns

Returns a JSON-formatted string which lists all restricted feature settings allowed for the app.
The JSON-formatted string has the following structure:

```json
"{""external_data"":{""allowed_cloud_providers"":""all""}}"
```

## Usage notes

* When an app runs this system function, the `app_name` parameter is not required and is ignored if provided.
  In this context, all the apps restricted features are listed.
* When a provider or consumer runs this system function, `app_name` parameter is required and
  lists the restricted features of the app and whether they are enabled or not.

## Examples

To call the function:

```sqlexample
SELECT SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES('hello_snowflake_app');
```

Sample output:

```json
[
    {"external_data":{"allowed_cloud_providers":"all"}}
]
```

---
title: SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG
source: https://docs.snowflake.com/en/sql-reference/functions/system_list_iceberg_tables_from_catalog.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG

Lists tables in a remote Apache Iceberg™ REST catalog (including [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview)).

See also:
:   * [Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake](../../user-guide/tables-iceberg-open-catalog.md)
    * [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md)
    * [CREATE ICEBERG TABLE (Iceberg REST catalog)](../sql/create-iceberg-table-rest.md)

## Syntax

```sqlsyntax
SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG( '<catalog_integration_name>'
  [ , '<parent_namespace>', <levels> ] )
```

## Arguments

**Required:**

`catalog_integration_name`
:   Identifier for the catalog integration for [Iceberg REST](../sql/create-catalog-integration-rest.md) or
    [Snowflake Open Catalog](../../user-guide/tables-iceberg-configure-catalog-integration-open-catalog.md).

**Optional:**

`parent_namespace`
:   The identifier of the namespace from which to start listing tables. To retrieve
    results for the 0th namespace level in the catalog, specify an empty string (`''`).

    Default: The default namespace for the catalog integration (`CATALOG_NAMESPACE`), if specified. If you don’t specify a default
    namespace at the catalog integration level, the default is the 0th namespace level in the catalog. To list tables when the default is the
    0th namespace, you must specify an empty string (`CATALOG_NAMESPACE`) and the `<levels>` parameter.

`levels`
:   Specifies the number of levels to traverse in the namespace hierarchy for listing tables.

    For example:

    * If set to 0, the function returns all of the tables recursively, relative to the `parent_namespace`.
    * If set to 1, the function returns all of the tables within the `parent_namespace`.
    * If set to *n*, the function returns tables up to *n* levels deep, relative to the `parent_namespace`.

    Default: 1

## Returns

Returns a JSON-formatted string which lists tables in the Iceberg REST catalog for the specified
namespace and number of levels.

The JSON-formatted string has the following structure:

```json
[
  {
    "namespace": "<namespace_identifier>",
    "name": "<table_name>"
  },
  {
    "namespace": "<namespace_identifier>",
    "name": "<table_name_n>"
  },
]
```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration (catalog) |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

List only the tables in the default catalog namespace:

```sqlexample
SELECT SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG('myCatalogIntegration');
```

List *every* table in the catalog:

```sqlexample
SELECT SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG('myCatalogIntegration', '', 0);
```

List all of the tables recursively under the `db1` namespace:

```sqlexample
SELECT SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG('myCatalogIntegration', 'db1', 0);
```

List all of the tables three levels under the `db1` namespace:

```sqlexample
SELECT SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG('myCatalogIntegration', 'db1', 3);
```

---
title: SYSTEM$LIST_NAMESPACES_FROM_CATALOG
source: https://docs.snowflake.com/en/sql-reference/functions/system_list_namespaces_from_catalog.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LIST_NAMESPACES_FROM_CATALOG

Lists the namespaces in a remote Apache Iceberg™ REST catalog (including [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview)).

See also:
:   * [Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake](../../user-guide/tables-iceberg-open-catalog.md)
    * [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md)

## Syntax

```sqlsyntax
SYSTEM$LIST_NAMESPACES_FROM_CATALOG( '<catalog_integration_name>'
  [ , '<parent_namespace>', <levels> ] )
```

## Arguments

**Required:**

`catalog_integration_name`
:   Identifier for the catalog integration for [Iceberg REST](../sql/create-catalog-integration-rest.md) or
    [Snowflake Open Catalog](../../user-guide/tables-iceberg-configure-catalog-integration-open-catalog.md).

**Optional:**

`parent_namespace`
:   The identifier of the namespace from which to start listing namespaces.

    If `CATALOG_NAMESPACE` is defined at the catalog integration level, to retrieve results for the 0th namespace level in the
    catalog, specify an empty string (`''`).

    If `CATALOG_NAMESPACE` is only defined at the table level, the results for the 0th namespace level are returned by default, so
    you don’t need to specify an empty string (`''`).

    Default:

    > * If `CATALOG_NAMESPACE` is defined at the catalog integration level, the namespace for the catalog integration.
    > * If `CATALOG_NAMESPACE` is only defined at the table level, you retrieve results for the 0th namespace level in the catalog.

`levels`
:   Specifies the number of levels to traverse in the namespace hierarchy for listing child namespaces.

    For example:

    * If set to 0, the function returns all of the namespaces, recursively, relative to the `parent_namespace`.
    * If set to 1, the function returns all of the namespaces one level under the `parent_namespace`.
    * If set to *n*, the function returns namespaces up to *n* levels deep, relative to the `parent_namespace`.

    Default: 1

## Returns

Returns a JSON-formatted string which lists namespaces in the Iceberg REST catalog for the specified
parent namespace and number of levels.

The JSON-formatted string has the following structure:

```json
[
  "<namespace_identifier>",
  "<namespace_identifier_n>"
]
```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration (catalog) |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

List only the namespaces directly under the default namespace of the catalog integration:

```sqlexample
SELECT SYSTEM$LIST_NAMESPACES_FROM_CATALOG('my_catalog_integration');
```

List all namespaces recursively in the catalog:

```sqlexample
SELECT SYSTEM$LIST_NAMESPACES_FROM_CATALOG('my_catalog_integration', '', 0);
```

List only the namespaces one level under (directly under) the ‘’db1’’ namespace:

```sqlexample
SELECT SYSTEM$LIST_NAMESPACES_FROM_CATALOG('my_catalog_integration', 'db1');
```

List the namespaces three levels under the ‘’db1’’ namespace:

```sqlexample
SELECT SYSTEM$LIST_NAMESPACES_FROM_CATALOG('my_catalog_integration', 'db1', 3);
```

---
title: SYSTEM$LOCATE_DBT_ARCHIVE
source: https://docs.snowflake.com/en/sql-reference/functions/system_locate_dbt_archive.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LOCATE_DBT_ARCHIVE

Returns the URL from which you can retrieve zipped dbt run artifacts for a specified dbt project.

Use this function with the [DBT_PROJECT_EXECUTION_HISTORY](dbt_project_execution_history.md) function to access dbt artifacts and logs programmatically.

## Syntax

```sqlsyntax
SYSTEM$LOCATE_DBT_ARCHIVE ( '<query_id>' )
```

## Arguments

`query_id`
:   The query ID of the dbt project run whose files you want to locate.

## Returns

This function returns the URL from which you can retrieve the zipped contents of the results of a specified dbt Project.

For more information and examples, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

## Access control requirements

This function includes only runs from workspaces and dbt Projects in which you have the following privileges:

* OWNERSHIP, READ, or WRITE on workspaces
* OWNERSHIP, USAGE, or MONITOR on dbt Projects

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This system function works only on dbt project objects; it isn’t available for workspaces.
* Query IDs generated from CREATE DBT PROJECT or ALTER DBT PROJECT … ADD VERSION aren’t supported for this system function.
* Direct querying of file content (for example, [Query examples](../../user-guide/querying-stage.md)) isn’t supported.
* If `query_id` is NULL or not a dbt execution, you’ll get an error.
* dbt project results are available for up to 14 days.
* Files might be unavailable if a run times out, is canceled, or fails before they are uploaded. In such cases, runs appear as `UNHANDLED ERROR` in dbt history.
* You can’t use this function to get logs for runs that are in progress because the logs file is only available after the run in complete.

## Examples

The following example returns the `snow://` URL of the zipped artifacts (for example, `dbt_artifacts.zip`) for the specified execution.

You can use this URL with GET to download the ZIP file (or COPY FILES to move it to your own stage). For the folder path instead of the ZIP, use
[SYSTEM$LOCATE_DBT_ARTIFACTS](system_locate_dbt_artifacts.md).

```sqlexample
SELECT SYSTEM$LOCATE_DBT_ARCHIVE($latest_query_id);
```

For more information, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

---
title: SYSTEM$LOCATE_DBT_ARTIFACTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_locate_dbt_artifacts.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LOCATE_DBT_ARTIFACTS

Returns the location of artifacts from a specified dbt Project run (for example, `manifest.json`).

Use this function with the [DBT_PROJECT_EXECUTION_HISTORY](dbt_project_execution_history.md) function to access dbt artifacts and logs programmatically.

## Syntax

```sqlsyntax
SYSTEM$LOCATE_DBT_ARTIFACTS ( '<query_id>' )
```

## Arguments

`query_id`
:   The query ID of the dbt project run whose files you want to locate.

## Returns

The function returns the file path for dbt Project artifacts from a run (for example, `snow://dbt/DBTEST.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf3f5a-010b-4d87-0000-53493abb7cce/`).

For more information and examples, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

## Access control requirements

This function includes only runs from workspaces and dbt Projects in which you have the following privileges:

* OWNERSHIP, READ, or WRITE on workspaces
* OWNERSHIP, USAGE, or MONITOR on dbt Projects

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This system function works only on dbt project objects; it isn’t available for workspaces.
* Query IDs generated from CREATE DBT PROJECT or ALTER DBT PROJECT … ADD VERSION aren’t supported for this system function.
* Direct querying of file content (for example, [Query examples](../../user-guide/querying-stage.md)) isn’t supported.
* If `query_id` is NULL or not a dbt execution, you’ll get an error.
* dbt project results are available for up to 14 days.
* Files might be unavailable if a run times out, is canceled, or fails before they are uploaded. In such cases, runs appear as `UNHANDLED ERROR` in dbt history.
* You can’t use this function to get logs for runs that are in progress because the logs file is only available after the run in complete.

## Examples

To view the stage path where Snowflake stored the dbt Project run’s artifacts (that is, the results folder for that execution), use the SYSTEM$LOCATE_DBT_ARTIFACTS function, as shown in the following
example. You can then use that path with `GET` or `COPY FILES` or the Snowflake CLI to download things like `manifest.json`, compiled SQL, or logs.

```sqlexample
--Look up the most recent dbt Project execution
SET latest_query_id = (SELECT query_id
  FROM TABLE(INFORMATION_SCHEMA.DBT_PROJECT_EXECUTION_HISTORY())
  WHERE OBJECT_NAME = 'MY_DBT_PROJECT'
  ORDER BY query_end_time DESC LIMIT 1);

--Get the dbt run logs for the most recent dbt Project execution
SELECT SYSTEM$GET_DBT_LOG($latest_query_id);
```

```output
============================== 15:14:53.100781 | 46d19186-61b8-4442-8339-53c771083f16 ==============================
[0m15:14:53.100781 [info ] [Dummy-1   ]: Running with dbt=1.9.4
...
[0m15:14:58.198545 [debug] [Dummy-1   ]: Command `cli run` succeeded at 15:14:58.198121 after 5.19 seconds
```

```sqlexample
--Get the location of the dbt Project archive ZIP file (see all files)
SELECT SYSTEM$LOCATE_DBT_ARTIFACTS($latest_query_id);
```

```output
+-------------------------------------------------------------------------------------------------+
| SYSTEM$LOCATE_DBT_ARTIFACTS($LATEST_QUERY_ID)                                                   |
+-------------------------------------------------------------------------------------------------+
| snow://dbt/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01c01096-010c-0ccb-0000-a99506bd199e/ |
+-------------------------------------------------------------------------------------------------+
```

```sqlexample
--List all the files of the retrieved dbt run
ls 'snow://dbt/TESTDBT.PUBLIC.MY_DBT_PROJECT/results/query_id_01bf3f5a-010b-4d87-0000-53493abb7cce/';
```

You can also create a fresh internal stage, locate the Snowflake-managed path for the specified dbt Project run’s artifacts, and copy those artifacts into your stage for retrieval, as shown in the
following example:

```sqlexample
CREATE OR REPLACE STAGE my_dbt_stage ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');
```

For more information, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md).

---
title: SYSTEM$LOG, SYSTEM$LOG_<level> (for Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/functions/system_log.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$LOG, SYSTEM$LOG_<level> (for Snowflake Scripting)

Logs a message at the specified severity level.

## Syntax

```sqlsyntax
SYSTEM$LOG('<level>', <message>);

SYSTEM$LOG_TRACE(<message>);
SYSTEM$LOG_DEBUG(<message>);
SYSTEM$LOG_INFO(<message>);
SYSTEM$LOG_WARN(<message>);
SYSTEM$LOG_ERROR(<message>);
SYSTEM$LOG_FATAL(<message>);
```

## Arguments

`'level'`
:   The severity level at which to log the message. You can specify one of the following strings:

    * ‘trace’
    * ‘debug’
    * ‘info’
    * ‘warn’
    * ‘error’
    * ‘fatal’

`message`
:   An expression that resolves to the message to log. If the message is not a string, the function converts the message to a string.

## Examples

Code in the following example uses the SYSTEM$LOG function to log messages at each of the supported levels. Note that a message logged
from code that processes an input row will be logged *for every row* processed by the handler. If the handler is executed in a large table,
this can result in a large number of messages in the event table.

```sqlexample
-- The following calls are equivalent.
-- Both log information-level messages.
SYSTEM$LOG('info', 'Information-level message');
SYSTEM$LOG_INFO('Information-level message');

-- The following calls are equivalent.
-- Both log error messages.
SYSTEM$LOG('error', 'Error message');
SYSTEM$LOG_ERROR('Error message');

-- The following calls are equivalent.
-- Both log warning messages.
SYSTEM$LOG('warning', 'Warning message');
SYSTEM$LOG_WARN('Warning message');

-- The following calls are equivalent.
-- Both log debug messages.
SYSTEM$LOG('debug', 'Debug message');
SYSTEM$LOG_DEBUG('Debug message');

-- The following calls are equivalent.
-- Both log trace messages.
SYSTEM$LOG('trace', 'Trace message');
SYSTEM$LOG_TRACE('Trace message');

-- The following calls are equivalent.
-- Both log fatal messages.
SYSTEM$LOG('fatal', 'Fatal message');
SYSTEM$LOG_FATAL('Fatal message');
```

---
title: SYSTEM$MIGRATE_SAML_IDP_REGISTRATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_migrate_saml_idp_registration.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$MIGRATE_SAML_IDP_REGISTRATION

Migrates an existing SAML identity provider (i.e. IdP) configuration as defined by the account parameter [SAML_IDENTITY_PROVIDER](../parameters.md) to a security integration.

If the account parameter SAML_IDENTITY_PROVIDER is present, SYSTEM$MIGRATE_SAML_IDP_REGISTRATION creates a new security integration using the data in the SAML_IDENTITY_PROVIDER parameter.

If the SAML_IDENTITY_PROVIDER account parameter is not present, the function fails. If this occurs, create a security integration where `TYPE = SAML2` as shown in [CREATE SECURITY INTEGRATION](../sql/create-security-integration-saml2.md).

## Syntax

```sqlsyntax
SYSTEM$MIGRATE_SAML_IDP_REGISTRATION( '<integration_name>', '<issuer>' )
```

## Arguments

`integration_name`
:   Name of the new SAML2 security integration that will be created by the function.

    Note that the entire name must be enclosed in single quotes.

    Required.

`issuer`
:   The EntityID / Issuer of the IdP.

    The entire name must be enclosed in single quotes.

    Required if not specified in the SAML_IDENTITY_PROVIDER parameter as the `Issuer` attribute.

    > **Important:**
    >
    > If the SAML_IDENTITY_PROVIDER parameter does not contain a value for `Issuer`, use your IdP’s metadata to locate the exact
    > value. Depending on the IdP, you might be able to locate the `issuer` value through the user interface administrator settings,
    > a URL your IdP provides, or by downloading the SAML federation metadata XML to a local file.
    >
    > As a representative example, the following references detail how to locate the `issuer` value for Okta and Microsoft Entra ID:
    >
    > * [Okta SAML Settings](https://developer.okta.com/docs/guides/build-sso-integration/saml2/specify-your-settings/)
    > * [Microsoft Entra ID integration with Snowflake](https://docs.microsoft.com/en-us/azure/active-directory/saas-apps/snowflake-tutorial)

## Examples

The commands below provide an example of how you can migrate an existing IdP configuration:

```sqlexample
SELECT SYSTEM$MIGRATE_SAML_IDP_REGISTRATION('my_fed_integration', 'http://my_idp.com');
```

Output:

```output
+---------------------------------------------------------------------------------+
| SYSTEM$MIGRATE_SAML_IDP_REGISTRATION('MY_FED_INTEGRATION', 'HTTP://MY_IDP.COM') |
+---------------------------------------------------------------------------------+
| SUCCESS : [MY_FED_INTEGRATION] Fed SAML integration created                     |
+---------------------------------------------------------------------------------+
```

To view details about your migrated IdP, you can use the `DESCRIBE` command:

```sqlexample
DESC INTEGRATION my_fed_integration;
```

---
title: SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS
source: https://docs.snowflake.com/en/sql-reference/functions/system_opt_in_internal_stage_network_logs.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS

Starts record collection of network access attempts to internal stage locations for this account. You can view these records in the
[INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](../account-usage/internal_stage_network_access_history.md).

See also:
:   [SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS](system_opt_out_internal_stage_network_logs.md)

## Syntax

```sqlsyntax
SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS()
```

## Arguments

None.

## Returns

Returns a VARCHAR status message, which states that record collection of network access attempts to internal stage locations has been enabled.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can execute this function.

## Usage notes

Latency between running this function and record collection is up to 6 hours.

## Example

Start record collection of network access attempts to internal stage locations for this account:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS();
```

```output
+-------------------------------------------------------------------+
| Record collection has been successfully enabled for this account. |
+-------------------------------------------------------------------+
```

---
title: SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS
source: https://docs.snowflake.com/en/sql-reference/functions/system_opt_out_internal_stage_network_logs.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS

Stops record collection of network access attempts to internal stage locations for this account. You can view these records in the
[INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](../account-usage/internal_stage_network_access_history.md).

See also:
:   [SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS](system_opt_in_internal_stage_network_logs.md)

## Syntax

```sqlsyntax
SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS()
```

## Arguments

None.

## Returns

Returns a VARCHAR status message, which states that record collection of network access attempts to internal stage locations has ended.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can execute this function.

## Usage notes

Latency between running this function and stopping record collection is up to 6 hours.

## Example

Stop record collection of network access attempts to internal stage locations for this account:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS();
```

```output
+--------------------------------------------------------------------+
| Record collection has been successfully disabled for this account. |
+--------------------------------------------------------------------+
```

---
title: SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY
source: https://docs.snowflake.com/en/sql-reference/functions/system_opt_out_malicious_ip_protection_by_category.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY

Disables [Malicious IP Protection](../../user-guide/malicious-ip-protection.md) for one or more curated IP categories in the current account.
Use this function to allow traffic that would otherwise be blocked based on its category. For example, you can opt out of blocking IP
addresses that are categorized as low-risk and opt out of blocking the addresses for a specific user.

## Syntax

```sqlsyntax
SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY( '<category>,<category>' [ , '<user_name>' ] )
```

## Arguments

**Required:**

`'category'`
:   A case-insensitive string that identifies the curated IP category you want to opt out.

    Allowed values:

    * `'ANONYMOUS_VPN'`
    * `'ANONYMOUS_PROXIES'`
    * `'MALICIOUS_BEHAVIOR'`
    * `'TOR_EXITS'`
    * `''`

    For definitions of these categories, see [Malicious IP Protection](../../user-guide/malicious-ip-protection.md).

**Optional:**

`'user_name'`
:   A string that specifies the user name to opt out of Malicious IP Protection for the specified category.
    If no user is provided, the opt-out applies to all users in the account.

## Returns

Returns a VARCHAR status message that indicates that Malicious IP Protection was disabled for the specified category
and user, if provided, in the account.

## Access control requirements

Only account administrators, which are users with the ACCOUNTADMIN role, can execute this function.

## Usage notes

* Changes can take time to propagate across the Snowflake control plane.
* Opting out reduces protection for the selected traffic category. Use this function with discretion and review it regularly.
* Each call of this function overwrites results of the previous call.
* To re-enable blocking for all categories, pass in `''` as the `category` argument.
* To review blocked connection attempts, query [LOGIN_HISTORY view](../account-usage/login_history.md). Find rows with
  `IS_SUCCESS = 'NO'`, `ERROR_CODE = 390422`, and `ERROR_MESSAGE = 'INCOMING_REQUEST_BLOCKED'`. For those rows, review the output in the
  LOGIN_DETAILS column.

## Examples

Disable protection for all IPs in the `ANONYMOUS_VPN` and `MALICIOUS_BEHAVIOR` categories:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('ANONYMOUS_VPN,MALICIOUS_BEHAVIOR');
```

```output
+-------------------------------------------------------------------------------------------------+
| Successfully set malicious IP protection opt-out categories to ANONYMOUS_VPN,MALICIOUS_BEHAVIOR |
+-------------------------------------------------------------------------------------------------+
```

Disable protection for a specific user in the `ANONYMOUS_VPN` category:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('ANONYMOUS_VPN', 'JSMITH');
```

```output
+----------------------------------------------------------------------------------------------+
| Successfully set malicious IP protection opt-out categories to ANONYMOUS_VPN for user JSMITH |
+----------------------------------------------------------------------------------------------+
```

Re-enable protection for a specific user:

```sqlexample
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY('', 'JSMITH');
```

```output
+---------------------------------------------------------------------------------+
| Successfully cleared malicious IP protection opt-out categories for user JSMITH |
+---------------------------------------------------------------------------------+
```

---
title: SYSTEM$PIPE_FORCE_RESUME
source: https://docs.snowflake.com/en/sql-reference/functions/system_pipe_force_resume.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$PIPE_FORCE_RESUME

Forces a pipe paused using [ALTER PIPE](../sql/alter-pipe.md) to resume. This is necessary in either of the following scenarios:

* The pipe owner transfers ownership of the pipe to another role while the pipe is paused.
* The paused pipe is allowed to become stale.

  A pipe is considered stale when it is paused for longer than the limited retention period for event messages received for the pipe
  (14 days by default). As each notification reaches the end of this period, Snowflake schedules it to be dropped from the internal
  metadata. If the pipe is later resumed, Snowpipe may process notifications older than 14 days on a best effort basis. Snowflake cannot
  guarantee that these older notifications are processed.

  This scenario only pertains to pipe objects that leverage cloud messaging to trigger data loads (i.e. where `AUTO_INGEST = TRUE` in
  the pipe definition).

Executing this function resumes the specified pipe.

To determine how many files are queued, query [SYSTEM$PIPE_STATUS](system_pipe_status.md).

For more information, see [Snowpipe](../../user-guide/data-load-snowpipe-intro.md).

## Syntax

```sqlsyntax
SYSTEM$PIPE_FORCE_RESUME( '<pipe_name>' , '[ STALENESS_CHECK_OVERRIDE ] [ , OWNERSHIP_TRANSFER_CHECK_OVERRIDE ]')
```

## Arguments

`pipe_name`
:   Pipe to resume running.

`STALENESS_CHECK_OVERRIDE`
:   Specifies to resume a stale pipe. A pipe is considered stale when it is paused for longer than the limited retention period for event
    messages received for the pipe (14 days by default).

    > **Note:**
    >
    > This argument only pertains to pipe objects that leverage cloud messaging to trigger data loads.

`OWNERSHIP_TRANSFER_CHECK_OVERRIDE`
:   Specifies to resume a pipe after ownership of the pipe was transferred to another role.

    > **Note:**
    >
    > To ensure backward compatibility, passing `pipe_name` as the only input is syntactically equivalent to passing both
    > `pipe_name` and `OWNERSHIP_TRANSFER_CHECK_OVERRIDE`.

If both `STALENESS_CHECK_OVERRIDE` and `OWNERSHIP_TRANSFER_CHECK_OVERRIDE` are required, these arguments can be input in either
order.

## Usage notes

* Only the pipe owner (i.e. the role with the OWNERSHIP privilege on the pipe) or a role with the OPERATE privilege on the pipe can call this SQL
  function:

  SQL operations on schema objects also require the USAGE privilege on the database and schema that contain the object.
* `pipe_name` is a string so it must be enclosed in single quotes:

  + Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully-qualified), i.e. `'<db>.<schema>.<pipe_name>'`.
  + If the pipe name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<pipe_name>"'`.

## Examples

Force a pipe with a case-insensitive name to resume:

> ```sqlexample
> SELECT SYSTEM$PIPE_FORCE_RESUME('mydb.myschema.mypipe');
> ```

Force a pipe with a case-sensitive name to resume:

> ```sqlexample
> SELECT SYSTEM$PIPE_FORCE_RESUME('mydb.myschema."myPipe"');
> ```

Force a stale pipe to resume after its ownership was transferred to another role:

> ```sqlexample
> SELECT SYSTEM$PIPE_FORCE_RESUME('mydb.myschema.stalepipe','staleness_check_override, ownership_transfer_check_override');
> ```

---
title: SYSTEM$PIPE_REBINDING_WITH_NOTIFICATION_CHANNEL
source: https://docs.snowflake.com/en/sql-reference/functions/system_pipe_rebinding_with_notification_channel.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$PIPE_REBINDING_WITH_NOTIFICATION_CHANNEL

Retries the notification channel binding process when a replicated pipe has not been successfully bound to a notification channel during replication time. Binding can be unsuccessful for one of the following reasons:

* The cloud messaging is not correctly set up in the secondary deployment during replication. For example, a notification integration with the same name is not created manually, or SNS policy is not set to allow subscription, etc.
* There is a cloud provider error when Snowpipe tries to bind the pipe to the notification channel.
* The pipe and its source stage are in different replication groups, and the stage is not replicated when the pipe is replicated.

You can also retry the notification binding by refreshing the replication group or database. However, if the primary account is down, or a failover has already completed, the only option is to call this system function.

For more information, see [Snowpipe](../../user-guide/data-load-snowpipe-intro.md) and [Stage, pipe, and load history replication](../../user-guide/account-replication-stages-pipes-load-history.md).

## Syntax

```sqlsyntax
SYSTEM$PIPE_REBINDING_WITH_NOTIFICATION_CHANNEL( '<pipe_name>' )
```

## Arguments

`'pipe_name'`
:   The name of the pipe that needs to go through the rebind notification process.

## Access control requirements

* Only the pipe owner (that is, the role with the OWNERSHIP privilege on the pipe) or a role with the OPERATE privilege on the pipe can call this SQL
  function.

  Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Usage notes

* `pipe_name` is a string so it must be enclosed in single quotes:

  + Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully qualified), that is, `'db.schema.pipe_name'`.
  + If the pipe name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, that is, `'"pipe_name"'`.

## Examples

Retries the notification channel binding process for `mypipe`:

```sqlexample
SELECT SYSTEM$PIPE_REBINDING_WITH_NOTIFICATION_CHANNEL('mydb.myschema.mypipe');
```

---
title: SYSTEM$PIPE_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_pipe_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$PIPE_STATUS

Retrieves a JSON representation of the current status of a pipe.

For more information, see [Snowpipe](../../user-guide/data-load-snowpipe-intro.md).

## Syntax

```sqlsyntax
SYSTEM$PIPE_STATUS( '<pipe_name>' )
```

## Arguments

`pipe_name`
:   Pipe for which you want to retrieve the current status.

## Usage notes

* Returns results only for the pipe owner (the role with the OWNERSHIP privilege on the pipe) or a role with the MONITOR privilege on
  the pipe.
* `pipe_name` is a string so it must be enclosed in single quotes:

  + Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully qualified): `'<db>.<schema>.<pipe_name>'`.
  + If the pipe name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<pipe_name>"'`.
* The `oldestPendingFilePath` and `oldestFileTimestamp` fields are not available in the JSON output if `pendingFileCount` is `0`, as these fields only appear when there are files queued for ingestion.

## Output

The function returns a JSON object containing the following name/value pairs (if applicable to the current pipe status):

> {“executionState”:”<value>”,”oldestPendingFilePath”:”<value>”,”oldestFileTimestamp”:<value>,”pendingFileCount”:<value>,”lastPipeErrorTimestamp”:”<value>”,”lastPipeFaultTimestamp”:”<value>”,”lastIngestedTimestamp”:”<value>”,”lastIngestedFilePath”:”<value>”,”notificationChannelName”:”<value>”,”numOutstandingMessagesOnChannel”:<value>,”lastReceivedMessageTimestamp”:”<value>”,”lastForwardedMessageTimestamp”:”<value>”,”error”:<value>,”fault”:<value>,”lastPulledFromChannelTimestamp”:”<value>”,”lastForwardedFilePath”:”<value>”,”loadHistoryRemainingEntriesToSync”:”<value>”, “oldestPendingHistoryRefreshJobCreationTime”:”<value>”, “pendingHistoryRefreshJobsCount”:”<value>”}

Where:

> `executionState`
> :   Current execution state of the pipe. The value could be any one of the following:
>
>     * `FAILING_OVER` (the pipe is in the process of failing over from primary to secondary account)
>     * `PAUSED`
>     * `READ_ONLY` (the pipe or the target table is in a secondary read-only database.) A pipe in a secondary database is
>       read only until you promote the secondary database to primary. For more information, see [Pipes in secondary databases](../../user-guide/account-replication-stages-pipes-load-history.md).
>     * `RUNNING` (everything is normal; Snowflake may or may not be actively processing files for this pipe)
>     * `STOPPED_BY_SNOWFLAKE_ADMIN` (the pipe is stopped by [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). The pipe will not accept new files for ingestion. )
>     * `STOPPED_CLONED` (the pipe is contained by a database or schema clone)
>     * `STOPPED_FEATURE_DISABLED`
>     * `STOPPED_STAGE_ALTERED` (the pipe is stopped because the underlying stage location has been altered.)
>     * `STOPPED_STAGE_DROPPED`
>     * `STOPPED_FILE_FORMAT_DROPPED`
>     * `STOPPED_NOTIFICATION_INTEGRATION_DROPPED`
>     * `STOPPED_MISSING_PIPE`
>     * `STOPPED_MISSING_TABLE` (the target table defined in the pipe definition is dropped)
>     * `STALLED_COMPILATION_ERROR`
>     * `STALLED_INITIALIZATION_ERROR`
>     * `STALLED_EXECUTION_ERROR`
>     * `STALLED_INTERNAL_ERROR`
>     * `STALLED_STAGE_PERMISSION_ERROR` (an external stage permission error is detected.)
>
> `oldestPendingFilePath`
> :   Path to the oldest data file currently queued for processing. The timestamp when the file was added to the queue is returned in the existing oldestFileTimestamp property.
>
> `oldestFileTimestamp`
> :   Earliest timestamp among data files currently queued (if applicable), where the timestamp is set when the file is added to the queue.
>
> `pendingFileCount`
> :   Number of files queued for loading by the pipe.
>
>     This count can increase even if a pipe is paused. Depending on the `AUTO_INGEST` setting for the pipe, the number of queued
>     files can increase as follows:
>
>     `AUTO_INGEST = TRUE`
>     :   Files added to the cloud storage bucket or container trigger new file event notifications for the pipe.
>
>         Note that if a paused pipe becomes [stale](../../user-guide/data-load-snowpipe-manage.md), the `pendingFileCount` count ignores
>         any event notifications older than the limited retention period.
>
>     `AUTO_INGEST = FALSE`
>     :   Calls to the `insertFiles` REST endpoint trigger files to be queued for loading by the pipe.
>
> `lastPipeErrorTimestamp`
> :   Timestamp when compiling the COPY INTO <table> statement in the pipe definition for execution last produced an error.
>
> `lastPipeFaultTimestamp`
> :   Timestamp when an internal Snowflake process error was last detected.
>
> `lastIngestedTimestamp`
> :   Timestamp when the most recent file was loaded successfully by Snowpipe into the destination table.
>
> `lastIngestedFilePath`
> :   Path of the file loaded at the timestamp specified in lastIngestedTimestamp.
>
> `notificationChannelName`
> :   Amazon SQS queue or Microsoft Azure storage queue associated with the pipe.
>
> `numOutstandingMessagesOnChannel`
> :   Number of messages in the queue that have been queued but not received yet.
>
> `lastReceivedMessageTimestamp`
> :   Timestamp of the last message received from the queue. Note that this message might not apply to the specific pipe (e.g., if the path/prefix associated with the message does not match the path/prefix in the pipe definition). In addition, only messages triggered by created data objects are consumed by auto-ingest pipes.
>
> `lastForwardedMessageTimestamp`
> :   Timestamp of the last “create object” event message with a matching path/prefix that was forwarded to the pipe.
>
> `channelErrorMessage`
> :   Error message produced when attempting to read messages from the associated Google Cloud Pub/Sub queue or Microsoft Azure Event
>     Grid storage queue.
>
> `lastErrorRecordTimestamp`
> :   Timestamp of last channel error message (i.e. error message reported in the `channelErrorMessage` value).
>
> `error`
> :   Error message produced when the pipe was last compiled for execution (if applicable); often caused by problems accessing the necessary objects (i.e. table, stage, file format) due to permission problems or dropped objects.
>
> `fault`
> :   Most recent internal Snowflake process error (if applicable). Used primarily by Snowflake for debugging purposes.
>
> `lastPulledFromChannelTimestamp`
> :   Timestamp when Snowpipe last pulled “create object” event notifications for the pipe from the Amazon Simple Queue Service (SQS) queue, Google Pub/Sub queue, or Microsoft Azure storage queue.
>
>     This value applies to auto-ingest Snowpipe loads only.
>
> `lastForwardedFilePath`
> :   Path of the data file identified in the last “create object” event message that was forwarded to the pipe.
>
> `loadHistoryRemainingEntriesToSync`
> :   Number of remaining load history entries to be replicated. When a pipe fails over, load history entries might continue to be replicated for the pipe, ensuring that changes from the last refresh operation are up to date. You can use this field to monitor the progress of load history replication for a pipe.
>
> `oldestPendingHistoryRefreshJobCreationTime`
> :   Timestamp of the oldest pending refresh job creation time, displayed only when a pending job exists.
>
> `pendingHistoryRefreshJobsCount`
> :   Total number of pending history refresh jobs for non-table-owned pipes. Displays `0` if no jobs are pending.

## Examples

Retrieve the status for a pipe with a case-insensitive name:

> ```sqlexample
> SELECT SYSTEM$PIPE_STATUS('mydb.myschema.mypipe');
>
> +---------------------------------------------------+
> | SYSTEM$PIPE_STATUS('MYDB.MYSCHEMA.MYPIPE')        |
> |---------------------------------------------------|
> | {"executionState":"RUNNING","pendingFileCount":0} |
> +---------------------------------------------------+
> ```

Retrieve the status for a pipe with a case-sensitive name:

> ```sqlexample
> SELECT SYSTEM$PIPE_STATUS('mydb.myschema."myPipe"');
>
> +---------------------------------------------------+
> | SYSTEM$PIPE_STATUS('MYDB.MYSCHEMA."MYPIPE"')      |
> |---------------------------------------------------|
> | {"executionState":"RUNNING","pendingFileCount":0} |
> +---------------------------------------------------+
> ```

---
title: SYSTEM$PROVISION_PRIVATELINK_ENDPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/system_provision_privatelink_endpoint.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$PROVISION_PRIVATELINK_ENDPOINT

Provisions a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to an external service by using private
connectivity. The endpoint can be a service endpoint or a resource endpoint depending on the cloud platform that hosts your Snowflake
account.

> **Note:**
>
> If the Snowflake account is in an Azure government region, the provider resource ID must be the ID of a resource in a government
> subscription. For more information about government regions for Snowflake customers, see [U.S. SnowGov Regions](../../user-guide/intro-regions.md).

## Syntax

**AWS:**

> ```sqlsyntax
> SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
>   '<provider_service_name>',
>   '<host_name>'
> )
> ```

**Azure:**

> ```sqlsyntax
> SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
>   '<provider_resource_id>',
>   '<host_name>',
>   [, '<subresource>' ]
> )
> ```

**Google Cloud:**

> ```sqlsyntax
> SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
>   '<target_service_id>',
>   '<host_name>'
> )
> ```

## Arguments

**AWS:**

`'provider_service_name'`
:   Specifies the external service or resource to connect to. For example, `com.amazonaws.us-west-2.execute-api` for the Amazon API
    Gateway or `com.amazonaws.us-west-2.s3` for Amazon S3.

    > **Note:**
    >
    > When you connect to a VPC endpoint service in a region that is different from the Snowflake region, ensure that the VPC endpoint service supports the
    > Snowflake region.

    For information about retrieving this value from AWS, see [Provision private connectivity endpoints](../../user-guide/private-manage-endpoints-aws.md).

`'host_name'`
:   Specifies the fully-qualified host name to access the resource in your VPC or VNet.

    This value doesn’t contain any port numbers and must match what you specified in the Snowflake object that lets you to connect to the
    external service.

    > Examples include `bedrock-runtime.us-west-2.amazonaws.com` and `*.s3.us-west-2.amazonaws.com`.
    >
    > When using private connectivity for external stages and external volumes, the `host_name` must use a wildcard instead of
    > specifying a specific AWS S3 bucket.
    >
    > For information about retrieving this value from AWS, see [Provision private connectivity endpoints](../../user-guide/private-manage-endpoints-aws.md).

**Azure:**

`'provider_resource_id'`
:   Specifies the fully qualified identifier for the resource in your VPC or VNet.

`'host_name'`
:   Specifies the fully qualified host name to access the resource in your VPC or VNet.

For examples of the host name for outbound private connectivity for external functions, see the following topics:

> * [Azure Portal](../external-functions-creating-azure-ui-private-connect.md)
> * [Azure ARM template](../external-functions-creating-azure-template-private-connect.md).

`'subresource'`
:   Specifies the name of the subresource of the Azure resource.

    This argument isn’t required for [Azure Private Link Service](https://learn.microsoft.com/en-us/azure/private-link/private-link-service-overview) and Azure API Management Service.

    For all supported values, see the [Sub-resource table](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview#private-link-resource).

**Google Cloud:**

`'target_service_id'`
:   Specifies the service attachment ID (to a custom service), or regional Google API endpoint to connect to.

`'host_name'`
:   Specifies the fully qualified host name to access the resource.

> **Note:**
>
> When the target service ID is a regional Google API endpoint, the host name value should match the target service ID value.

## Returns

Returns a status message that the endpoint was provisioned successfully or details and instructions about why the endpoint was not
provisioned successfully.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Usage notes

* You can modify only the host name of an existing private connectivity endpoint. To modify any other properties, you must deprovision the
  endpoint, then provision a new one. For more information about changing a host name, see [SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME](system_set_privatelink_endpoint_hostname.md).
* This function can take approximately 5 minutes to execute because it depends on the process to provision the private connectivity
  endpoint in the cloud platform (outside of Snowflake).
* For details about private endpoint limits, see [Scaling considerations](../../user-guide/private-connectivity-outbound.md).

## Examples

AWS:
:   Set up outbound private connectivity to an external S3 service:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      'com.amazonaws.us-west-2.s3',
      '*.s3.us-west-2.amazonaws.com'
    );
    ```

    For more AWS examples, see the following guides:

    * [Set up private connectivity to an external Amazon S3 service](../../developer-guide/external-network-access/creating-using-private-aws.md)
    * [Set up private connectivity to an external Amazon Bedrock service](../../developer-guide/external-network-access/creating-using-private-aws.md)

Microsoft Azure:
:   Provision a private endpoint to allow Snowflake on Microsoft Azure to connect to the Microsoft Azure API Management service in your Microsoft Azure VNet:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
      'aztest1-external-function-api.azure.net',
      'Gateway'
      );
    ```

    ```output
    Private endpoint with ID "/subscriptions/e48379a7-2fc4-473e-b071-f94858cc83f5/resourcegroups/test_rg/providers/microsoft.network/privateendpoints/32bd3122-bfbd-417d-8620-1a02fd68fcf8" to resource "/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api" has been provisioned successfully. Please note down the endpoint ID and approve the connection from it on the Azure portal.
    ```

    Provision a private endpoint to allow Snowflake on Microsoft Azure to connect to an external service using external network access:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      '/subscriptions/11111111-2222-3333-4444-5555555555/resourceGroups/leorg1/providers/Microsoft.Sql/servers/myserver',
      'testdb.database.windows.net',
      'sqlServer'
      );
    ```

    ```output
    "Resource Endpoint with id "/subscriptions/f0abb333-1b05-47c6-8c31-dd36d2512fd1/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" provisioned successfully"
    ```

    Provision a private endpoint to allow Snowflake to connect to an external stage for Microsoft Azure:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      '/subscriptions/cc2909f2-ed22-4c89-8e5d-bdc40e5eac26/resourceGroups/mystorage/providers/Microsoft.Storage/storageAccounts/storagedemo',
      'storagedemo.blob.core.windows.net',
      'blob'
    );
    ```

    ```output
    "Resource Endpoint with id "/subscriptions/57faea9a-20c2-4d35-b283-9c0c1e9593d8/resourceGroups/privatelink-test/providers/Microsoft.Network/privateEndpoints/external-network-access-pe" provisioned successfully"
    ```

Google Cloud:
:   Connect to a published service:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      'projects/my-project/regions/us-west2/serviceAttachments/my-http-server',
      'my-http-server.com'
    );
    ```

    After creating the endpoint, the connection must be accepted on Google Cloud by the resource provider.

    Provision a private endpoint to allow Snowflake on Google Cloud to connect to a service attachment in your Google Cloud VPC Network:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      'projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment',
      'my-service.com'
      );
    ```

    ```output
    Private endpoint with ID "abcd0000000000000001" to resource "projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment"
    was provisioned successfully. Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
    ```

    Provision a private endpoint to allow Snowflake on Google Cloud to connect to the regional Cloud Key Management Service (Cloud KMS) endpoint:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      'cloudkms.us-east4.rep.googleapis.com',
      'cloudkms.us-east4.rep.googleapis.com'
      );
    ```

    ```output
    Private endpoint with ID "abcd0000000000000001" to resource "cloudkms.us-east4.rep.googleapis.com" was provisioned successfully.
    Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
    ```

    Provision a private endpoint to allow Snowflake to connect to an external stage for Google Cloud:

    ```sqlexample
    SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
      'storage.us-east4.rep.googleapis.com',
      'storage.us-east4.rep.googleapis.com'
    );
    ```

    ```output
    Private endpoint with ID "abcd0000000000000001" to resource "storage.us-east4.rep.googleapis.com" was provisioned successfully.
    Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
    ```

---
title: SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS
source: https://docs.snowflake.com/en/sql-reference/functions/system_provision_privatelink_endpoint_tss.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS

Provisions a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to a key management service (KMS) by using
private connectivity. The endpoint can be a service endpoint or resource endpoint, depending on the cloud platform that hosts your Snowflake account.

> **Note:**
>
> If the Snowflake account is in an Azure government region, the provider resource ID must be the ID of a resource in a government
> subscription. For more information about government regions for Snowflake customers, see [U.S. SnowGov Regions](../../user-guide/intro-regions.md).

## Syntax

**AWS:**

```sqlsyntax
SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS(
  '<provider_service_name>',
  '<host_name>'
  )
```

**Azure:**

```sqlsyntax
SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS(
  '<provider_resource_id>',
  '<host_name>'
  )
```

**Google Cloud:**

```sqlsyntax
SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS(
  '<target_service_id>',
  '<host_name>'
  )
```

## Arguments

**AWS:**

`provider_service_name`
:   Specifies the KMS service in AWS to connect to.

    For information about retrieving this value from AWS, see [Provision private connectivity endpoints](../../user-guide/private-manage-endpoints-aws.md).

**Azure:**

`provider_resource_id`
:   Specifies the fully qualified identifier for the Azure Key Vault in your VPC or VNet.

**Google Cloud:**

`target_service_id`
:   Specifies the KMS service in Google Cloud to connect to.

`host_name`
:   Specifies the fully-qualified hostname to access the KMS resource in your VPC, VNet, or PSC network.

    This value does not contain any port numbers and must match what you specified in the Snowflake object that enables you to connect to the
    KMS.

## Returns

Returns a status message that the endpoint was provisioned successfully or details and instructions about why the endpoint was not
provisioned successfully.

## Access control requirements

Only users granted the MODIFY privilege on the account can call this function.
The MODIFY privilege is typically granted only to the ACCOUNTADMIN role.

## Usage notes

* You cannot modify an existing private connectivity endpoint; you must deprovision the endpoint, then provision a new one. To deprovision
  the endpoint, call the [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS](system_deprovision_privatelink_endpoint_tss.md) system function.
* This function can take approximately 5 minutes to execute because it depends on the process to provision the private connectivity
  endpoint in the cloud platform (outside of Snowflake).
* For details about private endpoint limits, see [Scaling considerations](../../user-guide/private-connectivity-outbound.md).

## Examples

**AWS:**

Set up outbound private connectivity to an external KMS resource:

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS(
  'com.amazonaws.us-west-2.kms',
  'kms.us-west-2.amazonaws.com'
);
```

```output
Private endpoint with ID "vpce-0123456789abcdef0" to resource "com.amazonaws.us-west-2.kms" has been provisioned successfully.
Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
```

**Azure:**

Provision a private endpoint on Microsoft Azure for TSS

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS(
  '/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/myvault/providers/Microsoft.KeyVault/vaults/TriSecretVault',
  'trisecretvault.vault.azure.net'
);
```

```output
Private endpoint with ID "/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/prod-snowplex-rg/providers/Microsoft.Network/privateEndpoints/12345678-90ab-cdef-1234-567890abcdef"
to resource "/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/myvault/providers/Microsoft.KeyVault/vaults/TriSecretVault"
has been provisioned successfully.

 Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
```

**Google Cloud:**

```sqlexample
SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS(
  'cloudkms.us-west2.rep.googleapis.com',
  'cloudkms.us-west2.rep.googleapis.com'
);
```

```output
Private endpoint with ID "abcd0000000000001234" to resource "cloudkms.us-west2.rep.googleapis.com" has been provisioned successfully.
Please note the Private Endpoint ID and approve the corresponding connection request in the cloud provider console.
```

---
title: SYSTEM$QUERY_REFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_query_reference.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$QUERY_REFERENCE

Returns a [query reference](../../developer-guide/stored-procedure/stored-procedures-calling-references.md) that you can pass to a stored procedure.
Within the stored procedure, when you execute the query, the query is performed using the role of the user who created the query
reference.

> **Note:**
>
> As an alternative to calling this function, you can use the TABLE keyword, if you want the reference to be valid for the scope
> of the call (rather than for the entire session). See [Using the TABLE keyword to create a reference to a table, view, or query](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

See also:
:   [SYSTEM$REFERENCE](system_reference.md)

## Syntax

```sqlsyntax
SYSTEM$QUERY_REFERENCE('<select_statement>', [ , <use_session_scope> ] )
```

## Arguments

**Required**

`select_statement`
:   The SELECT statement to pass to the stored procedure. This must be a statement that serves as an inline view.

    Note that if the SELECT statement contains any single quotes or other special characters (e.g. newlines), you must
    [escape those characters with backslashes](../data-types-text.md).

**Optional**

`use_session_scope`
:   If `TRUE`, specifies that the query reference should be valid for the duration for the session. If this is `FALSE`
    or omitted, the query reference is valid within the context in which it was created. See [Specifying the scope of the reference](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

    Default value: `FALSE`

## Returns

A query reference that represents the specified SELECT statement.

## Usage notes

## Examples

See [Using query references](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

---
title: SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW
source: https://docs.snowflake.com/en/sql-reference/functions/system_read_yaml_from_semantic_view.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW

Returns the
[specification of a semantic model (in YAML format)](../../user-guide/views-semantic/sql.md)
for a [semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../stored-procedures/system_create_semantic_view_from_yaml.md)

## Syntax

```sqlsyntax
SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW( '<semantic_view_name>' )
```

## Arguments

`'semantic_view_name'`
:   Name of the semantic view.

    If the semantic view is a different schema or database from the current schema or database, specify the
    [partial or fully qualified name](../name-resolution.md) (for example, `my_schema.my_semantic_view` or
    `my_db.my_schema.my_semantic_view`).

## Returns

Returns a VARCHAR value containing the
[specification for the semantic model in YAML format](../../user-guide/views-semantic/sql.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

If the name of the database, schema, or view is a [double-quoted identifier](../identifiers-syntax.md) (for example, if
the name contains spaces), you must include double quotes around the name. For example:

```sqlexample
SELECT SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW(
  '"my database"."my schema"."my semantic view"'
);
```

## Examples

The following example returns the YAML specification for the semantic view named `tpch_analysis` in the database `my_db` and
schema `my_schema`:

```sqlexample
SELECT SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW(
  'my_db.my_schema.tpch_rev_analysis'
);
```

```output
+-------------------------------------------------------------+
| READ_YAML_FROM_SEMANTIC_VIEW                                |
|-------------------------------------------------------------|
| name: TPCH_REV_ANALYSIS                                     |
| description: Semantic view for revenue analysis             |
| tables:                                                     |
|   - name: CUSTOMERS                                         |
|     description: Main table for customer data               |
|     base_table:                                             |
|       database: SNOWFLAKE_SAMPLE_DATA                       |
|       schema: TPCH_SF1                                      |
|       table: CUSTOMER                                       |
|     primary_key:                                            |
|       columns:                                              |
|         - C_CUSTKEY                                         |
|     dimensions:                                             |
|       - name: CUSTOMER_NAME                                 |
|         synonyms:                                           |
|           - customer name                                   |
|         description: Name of the customer                   |
|         expr: customers.c_name                              |
|         data_type: VARCHAR(25)                              |
|       - name: C_CUSTKEY                                     |
|         expr: C_CUSTKEY                                     |
|         data_type: VARCHAR(134217728)                       |
|   - name: LINE_ITEMS                                        |
|     description: Line items in orders                       |
|     base_table:                                             |
|       database: SNOWFLAKE_SAMPLE_DATA                       |
|       schema: TPCH_SF1                                      |
|       table: LINEITEM                                       |
|     primary_key:                                            |
|       columns:                                              |
|         - L_ORDERKEY                                        |
|         - L_LINENUMBER                                      |
|     dimensions:                                             |
|       - name: L_ORDERKEY                                    |
|         expr: L_ORDERKEY                                    |
|         data_type: VARCHAR(134217728)                       |
|       - name: L_LINENUMBER                                  |
|         expr: L_LINENUMBER                                  |
|         data_type: VARCHAR(134217728)                       |
|     facts:                                                  |
|       - name: DISCOUNTED_PRICE                              |
|         description: Extended price after discount          |
|         expr: l_extendedprice * (1 - l_discount)            |
|         data_type: "NUMBER(25,4)"                           |
|       - name: LINE_ITEM_ID                                  |
|         expr: "CONCAT(l_orderkey, '-', l_linenumber)"       |
|         data_type: VARCHAR(134217728)                       |
|   - name: ORDERS                                            |
|     synonyms:                                               |
|       - sales orders                                        |
|     description: All orders table for the sales domain      |
|     base_table:                                             |
|       database: SNOWFLAKE_SAMPLE_DATA                       |
|       schema: TPCH_SF1                                      |
|       table: ORDERS                                         |
|     primary_key:                                            |
|       columns:                                              |
|         - O_ORDERKEY                                        |
|     dimensions:                                             |
|       - name: ORDER_DATE                                    |
|         description: Date when the order was placed         |
|         expr: o_orderdate                                   |
|         data_type: DATE                                     |
|       - name: ORDER_YEAR                                    |
|         description: Year when the order was placed         |
|         expr: YEAR(o_orderdate)                             |
|         data_type: "NUMBER(4,0)"                            |
|       - name: O_ORDERKEY                                    |
|         expr: O_ORDERKEY                                    |
|         data_type: VARCHAR(134217728)                       |
|       - name: O_CUSTKEY                                     |
|         expr: O_CUSTKEY                                     |
|         data_type: VARCHAR(134217728)                       |
|     facts:                                                  |
|       - name: COUNT_LINE_ITEMS                              |
|         expr: COUNT(line_items.line_item_id)                |
|         data_type: "NUMBER(18,0)"                           |
|     metrics:                                                |
|       - name: AVERAGE_LINE_ITEMS_PER_ORDER                  |
|         description: Average number of line items per order |
|         expr: AVG(orders.count_line_items)                  |
|       - name: ORDER_AVERAGE_VALUE                           |
|         description: Average order value across all orders  |
|         expr: AVG(orders.o_totalprice)                      |
| relationships:                                              |
|   - name: LINE_ITEM_TO_ORDERS                               |
|     left_table: LINE_ITEMS                                  |
|     right_table: ORDERS                                     |
|     relationship_columns:                                   |
|       - left_column: L_ORDERKEY                             |
|         right_column: O_ORDERKEY                            |
|   - name: ORDERS_TO_CUSTOMERS                               |
|     left_table: ORDERS                                      |
|     right_table: CUSTOMERS                                  |
|     relationship_columns:                                   |
|       - left_column: O_CUSTKEY                              |
|         right_column: C_CUSTKEY                             |
|                                                             |
+-------------------------------------------------------------+
```

---
title: SYSTEM$REFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_reference.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$REFERENCE

Returns a [reference](../references.md) to an object (a table, view, or function). When
you execute SQL actions on a reference to an object, the actions are performed using the role of the user who created the
reference.

> **Note:**
>
> As an alternative to calling this function, you can use the TABLE keyword, if you need to create a reference to an object that
> you don’t plan to modify (for example, if you are passing in a table that the stored procedure will query) and you want that
> reference to be valid for the scope of the call (rather than for the entire session). See
> [Using the TABLE keyword to create a reference to a table, view, or query](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

See also:
:   [SYSTEM$QUERY_REFERENCE](system_query_reference.md)

## Syntax

```sqlsyntax
SYSTEM$REFERENCE('<object_type>', '<object_identifier>',
  [ , '<reference_scope>' [ , '<privilege>' [ , '<privilege>' ... ] ] ] )
```

## Arguments

**Required**

`'object_type'`
:   Type of the object. You can specify one of the following values:

    * `api_integration`
    * `compute_pool`
    * `database`
    * `external_access_integration`
    * `external_table`
    * `external_volume`
    * `function`
    * `materialized_view`
    * `policy`
    * `pipe`
    * `procedure`
    * `row_access_policy`
    * `secret`
    * `schema`
    * `table`
    * `tag`
    * `task`
    * `view`
    * `warehouse`

`'object_identifier'`
:   Identifier for the object. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

**Optional**

`'reference_scope'`
:   Specifies the scope of the reference.

    If `'CALL'` or omitted, specifies that the reference is valid within the context in which it was created.
    See [Specifying the scope of the reference](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

    If `'SESSION'`, specifies that the reference should be valid for the duration for the session.

    If `'PERSISTENT'`, specifies that the reference should be valid until the object is dropped. See
    [persistent references](../references.md).

    Note: If you need to specify the `'privilege'` argument, the `'reference_scope'` argument is required.

    Valid values:

    * `'CALL'`
    * `'SESSION'`
    * `'PERSISTENT'`

    Default value: `'CALL'`

`'privilege'`
:   Additional [privilege](../../user-guide/security-access-control-privileges.md) that is needed to perform an SQL action on the
    object.

    For example, suppose that you are passing the reference for a table to a stored procedure that inserts rows into that table.
    Specify `'INSERT'` to confer the INSERT privilege on that table to the stored procedure.

    For a list of supported objects and privileges, see [Supported object types and privileges for references](../references.md).

    To specify more than one additional privilege, pass each privilege name as an additional argument to the function. For example,
    to confer the INSERT, UPDATE, and TRUNCATE privileges:

    ```sqlexample
    CALL myprocedure( SYSTEM$REFERENCE('TABLE', 'table_with_different_owner', 'SESSION', 'INSERT'. 'UPDATE', 'TRUNCATE'));
    ```

    Note that you cannot specify OWNERSHIP or ALL as privileges.

## Returns

A serialized string representation of the reference that can be used as an identifier.

## Usage notes

The `'object_type'` argument must match the type of the object specified by `object_identifier`.

## Troubleshooting

The following scenarios can help you troubleshoot issues that can occur.

|  |  |
| --- | --- |
| Error | ```output 505028 (42601): Object type <object_type> does not match the specified type <type_of_the_specified_object> for reference creation ``` |
| Cause | If you try to create a reference using the SYSTEM$REFERENCE function and the `object_type` argument does not match the type of the object specified by `object_identifier`, the function fails. For example, if the `object_type` argument is TABLE, but `object_identifier` resolves to an object type other than TABLE (for example, VIEW), the function fails. |
| Solution | Verify that the type of the object specified by `object_identifier` matches the `object_type` argument. For a list of supported object types, see [Supported object types and privileges for references](../references.md). |

## Examples

See [Background: The problem with passing objects and queries to stored procedures](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

---
title: SYSTEM$REGISTER_CMK_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_register_cmk_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REGISTER_CMK_INFO

Registers your customer-managed key (CMK) for use with Tri-Secret Secure.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

**AWS:**

```sqlsyntax
SYSTEM$REGISTER_CMK_INFO( '<cmk_arn>' [ , '<privatelink_enabled>' ] )
```

**Azure:**

```sqlsyntax
SYSTEM$REGISTER_CMK_INFO( '<vault_uri>' , '<key_name>' [ , '<privatelink_enabled>' ] )
```

**Google Cloud:**

```sqlsyntax
SYSTEM$REGISTER_CMK_INFO( '<project_id>' , '<location>', '<key_ring>' , '<key_name>' [ , '<privatelink_enabled>' ] )
```

## Arguments

**Required:**

**AWS**

`cmk_arn`
:   Specifies the Amazon Web Services resource number (ARN) that specifies the customer-managed key (CMK) for use with Tri-Secret Secure.

**Azure**

`vault_uri`
:   Specifies the Microsoft Azure unique endpoint identifier for your Azure Key Vault.

`key_name`
:   Specifies the name for your CMK in Microsoft Azure.

**Google Cloud**

`project_id`
:   Specifies the unique identifier for your project in Google Cloud.

`location`
:   Specifies the Google Cloud region that hosts your Snowflake account.

`key_ring`
:   Specifies the key ring for your CMK in Google Cloud.

`key_name`
:   Specifies the name for your CMK in Google Cloud.

**Optional:**

`privatelink_enabled`

> Specify whether or not to use your private connectivity endpoint for Tri-Secret Secure by passing in one of the following values:
>
> > **Important:**
> >
> > If you omit this argument or pass in an empty string, Snowflake doesn’t use a private connectivity endpoint for Tri-Secret Secure.
>
> `'TRUE'`
> :   Specifies that Snowflake uses the provisioned private connectivity endpoint for Tri-Secret Secure.
>
> `'FALSE'` (default)
> :   Specifies that Snowflake doesn’t use a private connectivity endpoint for Tri-Secret Secure.
>
> `''`
> :   Empty string. Same behavior as `'FALSE'`.

## Returns

Returns a status message stating that the registration is complete.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Examples

Register your CMK for your Snowflake account on Amazon Web Services:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO('arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59');
```

Register your CMK for your Snowflake account on Microsoft Azure:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO('https://trisecretsite.vault.azure.net/', 'trisecretazkey');
```

Register your CMK for your Snowflake account on Google Cloud:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO('my-env', 'us-west1', 'trisecrettest', 'trisecretgcpkey');
```

Register your CMK with a privatelink endpoint for your Snowflake account on Amazon Web Services:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO('arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59', 'true');
```

Register your CMK with a privatelink endpoint for your Snowflake account on Microsoft Azure:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO('https://trisecretsite.vault.azure.net/', 'trisecretazkey', 'true');
```

Register your CMK with a privatelink endpoint for your Snowflake account on Google Cloud:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO('my-env', 'us-west1', 'trisecrettest', 'trisecretgcpkey', 'true');
```

---
title: SYSTEM$REGISTER_CMK_INFO_POSTGRES
source: https://docs.snowflake.com/en/sql-reference/functions/system_register_cmk_info_postgres.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REGISTER_CMK_INFO_POSTGRES

Registers your customer-managed key (CMK) for use with Snowflake Postgres Tri-Secret Secure.

## Syntax

**AWS:**

```sqlsyntax
SYSTEM$REGISTER_CMK_INFO_POSTGRES( '<cmk_arn>' )
```

**Azure:**

```sqlsyntax
SYSTEM$REGISTER_CMK_INFO_POSTGRES( '<vault_uri>' , '<key_name>' )
```

## Arguments

`cmk_arn`
:   Specifies the Amazon Web Services resource number (ARN) that specifies the customer-managed key (CMK) for use with Tri-Secret Secure.

`vault_uri`
:   Specifies the Microsoft Azure unique endpoint identifier for your Azure Key Vault.

`key_name`
:   Specifies the name for your CMK in Microsoft Azure.

`project_id`
:   Specifies the unique identifier for your project in Google Cloud Platform.

`location`
:   Specifies the Google Cloud Platform region that hosts your Snowflake account.

`key_ring`
:   Specifies the key ring for your CMK in Google Cloud Platform.

`key_name`
:   Specifies the name of your CMK.

## Returns

Returns a status message stating that the registration is complete.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Examples

Register your CMK for your Snowflake account on Amazon Web Services:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO_POSTGRES('arn:aws:kms:us-west-2:736112632310:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59');
```

Register your CMK for your Snowflake account on Microsoft Azure:

```sqlexample
SELECT SYSTEM$REGISTER_CMK_INFO_POSTGRES('https://trisecretsite.vault.azure.net/', 'trisecretazkey');
```

---
title: SYSTEM$REGISTER_PRIVATELINK_ENDPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/system_register_privatelink_endpoint.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REGISTER_PRIVATELINK_ENDPOINT

Registers a private connectivity endpoint to route your connection to the Snowflake service.

## Syntax

**AWS**

```sqlsyntax
SYSTEM$REGISTER_PRIVATELINK_ENDPOINT(
  '<aws_private_endpoint_vpce_id>',
  '<aws_account_id>',
  '<token>',
  [ <delay_time> ]
  )
```

**Azure**

```sqlsyntax
SYSTEM$REGISTER_PRIVATELINK_ENDPOINT(
  '<azure_private_endpoint_link_id>',
  '<azure_private_endpoint_resource_id>',
  '<token>',
  [ <delay_time> ]
  )
```

## Required arguments

**AWS**

`aws_private_endpoint_vpce_id`
:   Specifies the identifier for your Amazon Web Services (AWS) virtual private cloud endpoint (AWS VPCEID).

    To obtain the AWS VPCEID value, navigate through the AWS console or use the following command:

    ```bash
    aws ec2 describe-vpc-endpoints
    ```

`aws_account_id`
:   The 12-digit identifier that uniquely identifies your Amazon Web Services (AWS) account, as a string.

    To obtain the AWS account ID value, navigate through the AWS console or use the following command:

    ```bash
    aws sts get-caller-identity
    ```

**Azure**

`azure_private_endpoint_link_id`
:   Specifies the identifier for your Microsoft Azure (Azure) virtual private cloud endpoint link (Azure LinkID).

    To obtain the Azure LinkID value:

    Run the [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](system_get_privatelink_authorized_endpoints.md) system function.

`azure_private_endpoint_resource_id`
:   The identifier that uniquely identifies your Snowflake account in Microsoft Azure (Azure) as a string.

    To obtain the Azure private endpoint resource Id, use the following command:

    ```bash
    az network private-endpoint list --resource-group my_resource_group
    ```

`token`
:   Specifies an access token to verify ownership of the private connectivity endpoint.

    To obtain the token, you must have the corresponding read or describe privilege on the private connectivity endpoint at a minimum.
    For more information, see:

    * [AWS endpoint policies](https://docs.aws.amazon.com/vpc/latest/privatelink/vpc-endpoints-access.html)
    * [Azure private endpoint privileges](https://learn.microsoft.com/en-us/azure/private-link/rbac-permissions#private-endpoint)

    To obtain the token, use the following commands:

    * For Snowflake on AWS:

      ```bash
      aws sts get-federation-token --name snowflake --policy '{ "Version": "2012-10-17", "Statement"
      : [ { "Effect": "Allow", "Action": ["ec2:DescribeVpcEndpoints"], "Resource": ["*"] } ] }'
      ```
    * For Snowflake on Azure:

      ```bash
      az account get-access-token --subscription <subscription_id>
      ```

    For more information about limiting the scope of an access token, see:

    * For Snowflake on AWS: [Managing access token scope on Amazon Web Services](../../user-guide/pin-private-endpoints.md)
    * For Snowflake on Azure: [Managing access token scope on Microsoft Azure](../../user-guide/pin-private-endpoints.md)

## Optional arguments

`delay_time`
:   Specifies the number of minutes to wait before enforcing the private endpoint registration.

    Range: 0 to 1440 minutes (24 hours)

    0 minutes: The registration is enforced immediately.

    Default: 60 (1 hour)

    For more information about the delay time and enforcement, see [Manage enforcement with the delay time argument](../../user-guide/pin-private-endpoints.md).

## Returns

Returns a status message about the registration of the private connectivity endpoint.

If you specify a delay time, the function returns a message stating when the registration will be enforced, with a reminder that when you
pin multiple accounts to the same private endpoint the enforcement is based on the earliest registration.

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can call this function.
* You can register multiple private connectivity endpoints for your Snowflake account.

## Examples

Call the SYSTEM$REGISTER_PRIVATELINK_ENDPOINT system function to register the VPC endpoint with your
Snowflake account. The `token` arguments contain truncated values and the delay time unit is minutes:

**AWS**

```sqlexample
SELECT SYSTEM$REGISTER_PRIVATELINK_ENDPOINT(
  'vpce-0c1...',
  '123.....',
  '{
    "Credentials": {
      "AccessKeyId": "ASI...",
      "SecretAccessKey": "alD...",
      "SessionToken": "IQo...",
      "Expiration": "2024-12-10T08:20:20+00:00"
    },
    "FederatedUser": {
      "FederatedUserId": "0123...:snowflake",
      "Arn": "arn:aws:sts::174...:federated-user/snowflake"
    },
    "PackedPolicySize": 9,
    }',
  120
  );
```

**Azure**

```sqlexample
SELECT SYSTEM$REGISTER_PRIVATELINK_ENDPOINT(
  '123....',
  '/subscriptions/0cc51670-.../resourceGroups/dbsec_test_rg/providers/Microsoft.Network/
  privateEndpoints/...',
  'eyJ...',
  120
);
```

---
title: SYSTEM$REGISTRY_LIST_IMAGES — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_registry_list_images.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$REGISTRY_LIST_IMAGES — *Deprecated*

Lists images in an [image repository](../../developer-guide/snowpark-container-services/working-with-registry-repository.md).

See also:
:   [Working with an Image Registry and Repository](../../developer-guide/snowpark-container-services/working-with-registry-repository.md)

## Syntax

```sqlsyntax
SYSTEM$REGISTRY_LIST_IMAGES( '/<dbName>/<schemaName>/<repositoryName>' )
```

## Arguments

**Required:**

`dbName`
:   Name of the database in which the repository is created.

`schemaName`
:   Name of the database in which the repository is created.

`repositoryName`
:   Name of the image repository.

## Returns

Returns a JSON object listing all the images.

## Usage notes

* You need the read permission on the repository to get a list of images.

## Examples

This function retrieves a list of images from the `/tutorial_db/data_schema/tutorial_repository` repository.

```sqlexample
SELECT SYSTEM$REGISTRY_LIST_IMAGES('/tutorial_db/data_schema/tutorial_repository');
```

Sample output showing a list of two images in the repository:

```output
+-----------------------------------------------------------------------------+
| SYSTEM$REGISTRY_LIST_IMAGES('/TUTORIAL_DB/DATA_SCHEMA/TUTORIAL_REPOSITORY') |
|-----------------------------------------------------------------------------|
| {"images":["my_echo_service_image","my_job_image"]}                         |
+-----------------------------------------------------------------------------+
```

---
title: SYSTEM$REMOVE_ALL_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/system_remove_all_references.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REMOVE_ALL_REFERENCES

Deletes all associations to the reference.

## Syntax

```sqlsyntax
SYSTEM$REMOVE_ALL_REFERENCES('<reference_name>')
```

## Arguments

**Required**

`'reference_name'`
:   The name of the reference as specified in the `manifest.yml` file of the app.

---
title: SYSTEM$REMOVE_REFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_remove_reference.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REMOVE_REFERENCE

Remove an association from the reference to an object in the consumer account and returns
a unique system-generated alias for the reference.

This function supports both single and multi-valued references. For multi-valued references,
an alias to the reference is required. This alias is used to remove a single association. To
remove all associations of a multi-valued reference, use [SYSTEM$REMOVE_ALL_REFERENCES](system_remove_all_references.md).

## Syntax

```sqlsyntax
SYSTEM$REMOVE_REFERENCE('<reference_name>'[, '<alias>'])
```

## Arguments

**Required**

`'reference_name'`
:   The name of the reference as specified in the `manifest.yml` file of the app.

`'reference_string'`
:   The system-generated ID of the reference to the object in the consumer account.

---
title: SYSTEM$REPORT_HEALTH_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_report_health_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$REPORT_HEALTH_STATUS

Sends [application health information](../../developer-guide/native-apps/monitoring.md) from a consumer app to the provider account.

## Syntax

```sqlsyntax
SYSTEM$REPORT_HEALTH_STATUS( '<status>' )
```

## Arguments

`'status'`
:   A string literal of type VARCHAR that indicates the health status of the
    application. You can specify one of the following values:

* `'OK'`: The consumer instance is healthy.
* `'FAILED'`: The consumer instance is in an error state.
* `'PAUSED'`: The consumer manually paused the app.

## Usage notes

* This function is intended to be called by consumer applications. Your application
  should call this function periodically to report its health status to the
  provider account.
* Your application logic determines what health status to report based on its own
  monitoring and error handling.
* The health status reported by this function is visible to the provider account
  via the GET_HEALTH_STATUS function. You
  should call GET_HEALTH_STATUS periodically
  from the provider account to monitor the health of consumer instances. If you
  use a task or monitored task to call this function, ensure that the application
  has the correct privileges to run the task. Consider setting up alerts to
  notify you when a consumer instance reports a `FAILED` status, a
  `PAUSED` status, or stops reporting its status.
* Snowflake only retains the most recent health status reported by each
  consumer instance of the application.
* To avoid excessive load on Snowflake, this function is rate limited. If the
  function is called again within 55 minutes from the same consumer instance, it
  will return `false` to indicate that the status report was not accepted.
* For more information about monitoring application health from the provider side,
  see [Use monitoring for an app](../../developer-guide/native-apps/monitoring.md).

## Return value

* This function returns TRUE if the health status was successfully reported.
* This function returns FALSE if the status report failed due to being
  rate limited.

---
title: SYSTEM$RESOLVE_PYTHON_PACKAGES
source: https://docs.snowflake.com/en/sql-reference/functions/system_resolve_python_packages.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System functions)

# SYSTEM$RESOLVE_PYTHON_PACKAGES

Returns a list of the resolved dependencies and their versions for the Python packages that were specified.
This function works with packages from both Anaconda and Artifact Repository (PyPI).

## Syntax

```sqlsyntax
SYSTEM$RESOLVE_PYTHON_PACKAGES( '<python_version>', '<package_spec_string>', ['<artifact_repository_name>'] )
```

## Arguments

`python_version`
:   String specifying the version of the Python runtime (e.g., ‘3.12’).

`package_spec_string`
:   Package specifications in PACKAGES clause format (e.g., `$$('numpy>=1.20.0', 'pandas==1.3.0')$$`).
    Use `$$()$$` to return only base packages (Python runtime and its dependencies).

`artifact_repository_name`
:   Optional. String specifying the artifact repository name (e.g., ‘snowflake.snowpark.pypi_shared_repository’).
    If not provided or empty, uses the default Anaconda repository.

## Returns

Returns a JSON array that contains the resolved packages and their dependencies.
Each element in the array is a string in the following format: `<package_name>==<version_name>`.
The result always includes base packages (e.g., Python runtime and system libraries).

## Access control requirements

This function can be called by any user. No special privileges are required.

## Usage notes

* Unlike [SHOW_PYTHON_PACKAGES_DEPENDENCIES](show_python_packages_dependencies.md), which only works with Anaconda packages,
  `SYSTEM$RESOLVE_PYTHON_PACKAGES` works with packages from both Anaconda and Artifact Repository (PyPI).
* The function creates a temporary UDF internally to resolve package dependencies and automatically cleans it up.
* Use this function when you need to determine all dependencies for packages that will be included in a packages policy.

## Examples

**Example 1: Resolve packages from Anaconda**

The following example returns a list of the dependencies of the `numpy` and `pandas` Python packages
with the Python 3.12 runtime from the default Anaconda repository:

```sqlexample
SELECT SYSTEM$RESOLVE_PYTHON_PACKAGES('3.12', $$('numpy>=1.20.0', 'pandas==1.3.0')$$);
```

The result is a list of the dependencies and their versions:

```output
["_libgcc_mutex==0.1", "_openmp_mutex==5.1", "blas==1.0", "ca-certificates==2024.9.24",
"intel-openmp==2023.1.0", "libffi==3.4.4", "libgcc-ng==11.2.0", "numpy==1.24.3",
"pandas==1.5.3", "python==3.12.20", "readline==8.2", "sqlite==3.45.3", ...]
```

**Example 2: Resolve packages from Artifact Repository (PyPI)**

The following example resolves the `scikit-learn` package from a PyPI artifact repository:

```sqlexample
SELECT SYSTEM$RESOLVE_PYTHON_PACKAGES('3.12', $$('scikit-learn')$$, 'snowflake.snowpark.pypi_shared_repository');
```

**Example 3: Get base packages only**

The following example returns only the base packages for Python 3.12:

```sqlexample
SELECT SYSTEM$RESOLVE_PYTHON_PACKAGES('3.12', $$()$$);
```

The result contains the Python runtime and system dependencies:

```output
["_libgcc_mutex==0.1", "ca-certificates==2024.9.24", "libffi==3.4.4",
"openssl==3.0.15", "python==3.12.20", "readline==8.2", ...]
```

## See also

* [SHOW_PYTHON_PACKAGES_DEPENDENCIES](show_python_packages_dependencies.md) - Returns dependencies for Anaconda packages only (requires ACCOUNTADMIN role)
* [Packages policies](../../developer-guide/udf/python/packages-policy.md) - Packages policies for Python
* [Using third-party packages](../../developer-guide/udf/python/udf-python-packages.md) - Using Python packages in UDFs

---
title: SYSTEM$RESTORE_PRIVATELINK_ENDPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/system_restore_privatelink_endpoint.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$RESTORE_PRIVATELINK_ENDPOINT

Restores a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to an external service
using private connectivity. The endpoint can be a service endpoint or a resource endpoint depending on the cloud platform that hosts your
Snowflake account.

You can restore a private endpoint within 7 days of deprovisioning it. After 7 days, the endpoint cannot be restored and you need to
recreate the endpoint with the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](system_provision_privatelink_endpoint.md) system function.

## Syntax

**AWS:**

> ```sqlsyntax
> SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
>   '<provider_service_name>' )
> ```

**Azure:**

```sqlsyntax
SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
  '<provider_resource_id>'
  [, '<subresource>' ]
  )
```

**Google Cloud:**

```sqlsyntax
SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
   '<service_attachment_id>'
);
```

## Arguments

**AWS**

`provider_service_name`
:   Specifies the external service or resource endpoint to restore. For example, `com.amazonaws.us-west-2.execute-api` for the Amazon API
    Gateway or `com.amazonaws.us-west-2.s3` for Amazon S3.

**Azure**

`'provider_resource_id'`
:   Specifies the fully-qualified identifier for the resource in your VPC or VNet.

`'subresource'`
:   Specifies the name of the subresource of the Azure resource.

    This argument is not required for [Azure Private Link Service](https://learn.microsoft.com/en-us/azure/private-link/private-link-service-overview) and Azure API Management Service.

    For all supported values, see the [Sub-resource table](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview#private-link-resource).

**Google Cloud**

`'target_service_id'`
:   Specifies the ID of the service attachment in your VPC network or the regional Google API.

## Returns

Returns a status message stating that the endpoint, with its identifier, is restored successfully.

If unsuccessful, returns an error. For example, if the provided argument is not a valid existing endpoint. If you do not know the endpoint
name, you can use the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](system_get_privatelink_endpoints_info.md) system function to list all endpoints in your
Snowflake account.

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can call this function.
* An error message occurs if a private connectivity endpoint is not associated with the specified arguments.

## Examples

**AWS:**

> Restore a private endpoint with external access to Amazon S3:

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT('com.amazonaws.us-west-2.s3');
```

**Azure:**

> Restore a private endpoint to allow Snowflake on Microsoft Azure to connect to the Azure API Management service in your Azure VNet:
>
> ```sqlexample
> SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
>   '/subscriptions/11111111-2222-3333-4444-5555555555/resourceGroups/my_rg/providers/Microsoft.Sql/servers/my_db_server',
>   'sqlServer'
> );
> ```
>
> ```output
> Private endpoint with id ''/subscriptions/66666666-7777-8888-9999-0000000000/resourcegroups/rg/providers/microsoft.network/privateendpoints/00000000-1111-2222-3333-4444444444'' restored successfully.
> ```

**Google Cloud:**

> Restore a private endpoint to allow Snowflake on Google Cloud to connect to the Google API Management service in your Google Cloud VPC Network:
>
> ```sqlexample
> SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT(
>   'projects/my-project/regions/us-east4/serviceAttachments/my-service-attachment'
> );
> ```
>
> ```output
> Private endpoint with id ''abcd0000000000000001'' restored successfully.
> ```

---
title: SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS
source: https://docs.snowflake.com/en/sql-reference/functions/system_restore_privatelink_endpoint_tss.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS

Restores a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to an external key management service (KMS) resource
by using private connectivity. The endpoint can be a service endpoint or a resource endpoint, depending on the cloud platform that hosts your
Snowflake account.

You can restore a private endpoint within 7 days of deprovisioning it. After 7 days, the endpoint cannot be restored and you need to
recreate the endpoint with the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS](system_provision_privatelink_endpoint_tss.md) system function.

## Syntax

**AWS:**

```sqlsyntax
SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS(
  '<provider_service_name>'
  )
```

**Azure:**

```sqlsyntax
SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS(
  '<provider_resource_id>'
  )
```

**Google Cloud:**

```sqlsyntax
SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS(
  '<target_service_id>'
  )
```

## Arguments

**AWS:**

`provider_service_name`
:   Specifies the external KMS resource endpoint to restore.

**Azure:**

`provider_resource_id`
:   Specifies the fully-qualified identifier for the resource in your VPC or VNet.

**Google Cloud:**

`target_service_id`
:   Specifies the service attachment ID (to a custom service), or regional Google API endpoint to connect to.

## Returns

Returns a status message stating that the endpoint, with its identifier, is restored successfully.

If unsuccessful, returns an error — for example, if the provided argument is not a valid existing endpoint. If you do not know the endpoint
name, you can use the [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](system_get_privatelink_endpoints_info.md) system function to list all endpoints in your
Snowflake account.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Usage notes

An error message occurs if a private connectivity endpoint is not associated with the specified arguments.

## Examples

**AWS:**

Restore a private endpoint with external access to an AWS key store.

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS(
  'com.amazonaws.us-west-2.s3'
);
```

**Azure:**

Restore a private endpoint to allow Snowflake on Microsoft Azure to connect to the Azure key vault in your Azure VNet:

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS(
  '/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/myvault/providers/Microsoft.KeyVault/vaults/TriSecretVault'
);
```

```output
"Resource Endpoint with id "/subscriptions/12345678-90ab-cdef-1234-567890abcdef/resourceGroups/myvault/privatelink-test/providers/Microsoft.KeyVault/vaults/TriSecretVault/privateEndpoints/" restored successfully.
```

**Google Cloud:**

```sqlexample
SELECT SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS(
  'cloudkms.us-west2.rep.googleapis.com'
);
```

```output
Private endpoint with id 'abcd0000000000001234' restored successfully.
```

---
title: SYSTEM$REVOKE_PRIVATELINK
source: https://docs.snowflake.com/en/sql-reference/functions/system_revoke_privatelink.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REVOKE_PRIVATELINK

Disables private connectivity to the Snowflake service for the current account.

See also:
:   [SYSTEM$AUTHORIZE_PRIVATELINK](system_authorize_privatelink.md) , [SYSTEM$GET_PRIVATELINK](system_get_privatelink.md) ,
    [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](system_get_privatelink_authorized_endpoints.md)

## Syntax

**AWS:**

> ```sqlsyntax
> SYSTEM$REVOKE_PRIVATELINK( '<aws_id>' , '<federated_token>' )
> ```

**Azure:**

> ```sqlsyntax
> SYSTEM$REVOKE_PRIVATELINK( '<private-endpoint-resource-id>' , '<federated_token>' )
> ```

**GCP**

> ```sqlsyntax
> SYSTEM$REVOKE_PRIVATELINK( '<gcp_project_id>' , '<access_token>' )
> ```

## Arguments

`'aws_id'`
:   The 12-digit identifier that uniquely identifies your Amazon Web Services (AWS) account, as a string.

`'private-endpoint-resource-id'`
:   The identifier that uniquely identifies the private endpoint in Microsoft Azure (Azure) as a string.

`'federated_token'`
:   The federated token value that contains access credentials for a federated user as a string.

    To obtain this value, execute the appropriate command for the cloud platform that hosts your Snowflake account. Use the command-line tool
    provided by the platform:

    * For Snowflake on AWS:

      ```bash
      aws sts get-federation-token --name sam
      ```
    * For Snowflake on Azure:

      ```bash
      az account get-access-token --subscription <SubscriptionID>
      ```

      Where:

      + `SubscriptionID`
        :   The unique identifier for your subscription. For example:

            > `13c...`

            To obtain this value, execute the following Azure CLI command in your command-line environment:

            > ```bash
            > az account list --output table
            > ```
            >
            > Note the output value in the `SubscriptionID` column, which is truncated in this example:
            >
            > > ```text
            > > Name     CloudName   SubscriptionId                        State    IsDefault
            > > -------  ----------  ------------------------------------  -------  ----------
            > > MyCloud  AzureCloud  13c...                                Enabled  True
            > > ```

`'gcp_project_id'`
:   The identifier that uniquely identifies your Google Cloud (GCP) project, as a string.

`'access_token'`
:   The access token value that contains access credentials for a Google Cloud user as a string.

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* This function can be used with Snowflake accounts on AWS or Azure; Google Cloud Platform (GCP) is not currently supported.
* Call the [SYSTEM$GET_PRIVATELINK](system_get_privatelink.md) function to verify whether your Snowflake account is authorized
  to use private connectivity to the Snowflake service.
* Call the [SYSTEM$AUTHORIZE_PRIVATELINK](system_authorize_privatelink.md) function to enable your Snowflake account to use private
  connectivity to the Snowflake service.

## Examples

Disable AWS PrivateLink for your Snowflake account on AWS. Note that the values are truncated in this example.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$REVOKE_PRIVATELINK(
>     '185...',
>     '{
>       "Credentials": {
>           "AccessKeyId": "ASI...",
>           "SecretAccessKey": "enw...",
>           "SessionToken": "Fwo...",
>           "Expiration": "2021-01-07T19:06:23+00:00"
>       },
>       "FederatedUser": {
>           "FederatedUserId": "185...:sam",
>           "Arn": "arn:aws:sts::185...:federated-user/sam"
>       },
>       "PackedPolicySize": 0
>   }'
>   );
> ```

Disable Azure Private Link for your Snowflake account on Azure. Note that the values are truncated in this example.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$REVOKE_PRIVATELINK(
>   '/subscriptions/26d.../resourcegroups/sf-1/providers/microsoft.network/privateendpoints/test-self-service',
>   'eyJ...');
> ```

Disable Google Cloud Private Service Connect for your Snowflake account on GCP:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> select SYSTEM$REVOKE_PRIVATELINK(
>   'my-gcp-project-id',
>   'ya29.a0AcM612zT4pJaXdYfwgY8aiMoDE9W_xkqQ20coFTB1TJcImKDPo...'
>   );
> ```

---
title: SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_revoke_snowflake_managed_storage_volume_privatelink_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS

Revokes the authorization for Snowflake to access the private endpoint for
[Azure private endpoints for Snowflake-managed storage volumes](../../user-guide/private-managed-volumes-azure.md) for the current account.

See also:
:   [SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](system_authorize_snowflake_managed_storage_volume_privatelink_access.md),
    [Revoking private endpoints to access Snowflake-managed storage volumes](../../user-guide/private-managed-volumes-azure.md)

## Syntax

```sqlsyntax
SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS( '<private_endpoint_resource_id>' )
```

## Arguments

`'private_endpoint_resource_id'`
:   The unique identifier for the Azure Private Endpoint.

    For instructions on how to obtain this value, see
    [Configuring private endpoints to access Snowflake-managed storage volumes](../../user-guide/private-managed-volumes-azure.md).

## Usage notes

* Only account administrators (that is, users with the ACCOUNTADMIN role) can call this function.
* This function is supported for Snowflake accounts on Microsoft Azure only.

## Examples

Revoke access to an Azure private endpoint for a Snowflake-managed storage volume:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS(
>   '/subscriptions/subId/resourceGroups/rg1/providers/Microsoft.Network/privateEndpoints/pe1');
> ```

---
title: SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_revoke_stage_privatelink_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS

Revokes the authorization for Snowflake to access the private endpoint for [Azure private endpoints for internal stages](../../user-guide/private-internal-stages-azure.md)
and [Google Private Service Connect endpoints for internal stages](../../user-guide/private-internal-stages-gcp.md) for the current account.

See also:
:   [SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS](system_authorize_stage_privatelink_access.md) , [Revoking private endpoints to access Snowflake internal stages](../../user-guide/private-internal-stages-azure.md)

## Syntax

**Azure**

```sqlsyntax
SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS( '<private_endpoint_resource_id>' )
```

**Google Cloud**

```sqlsyntax
SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS( '<google_cloud_vpc_network_name>' )
```

## Arguments

`'private_endpoint_resource_id'`
:   The unique identifier for the Azure Private Endpoint.

`'google_cloud_vpc_network_name'`
:   The fully qualified path value for the Google Cloud VPC Network.

    This value is the Google Cloud VPC network path that Snowflake uses to limit access to your internal stage through the cloud provider’s internal network and avoid using the public internet.

    For instructions on how to obtain these values on Azure, see [Configuring private endpoints to access Snowflake internal stages](../../user-guide/private-internal-stages-azure.md); for Google Cloud, see [Configure private endpoints to access Snowflake internal stages](../../user-guide/private-internal-stages-gcp.md).

## Usage notes

* Only account administrators—that is, users with the ACCOUNTADMIN role—can call this function.
* This function is not supported for Snowflake accounts on
  Amazon Web Services (AWS).

## Examples

Revoke Snowflake to access the Private Endpoint on Azure:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS('/subscriptions/subId/resourceGroups/rg1/providers/Microsoft.Network/privateEndpoints/pe1');
> ```

Revoke Snowflake to access the private endpoint on Google Cloud:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS('projects/vpc_network_name/global/networks/network_name');
> ```

---
title: SYSTEM$SAP_BDC_LIST_SHARES
source: https://docs.snowflake.com/en/sql-reference/functions/system_sap_bdc_list_shares.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SAP_BDC_LIST_SHARES

Lists Data Products shared by SAP® Business Data Cloud with the enrolled catalog integration.

See also:
:   [CREATE CATALOG INTEGRATION (SAP® Business Data Cloud)](../sql/create-catalog-integration-sap.md)

## Syntax

```sqlsyntax
SYSTEM$SAP_BDC_LIST_SHARES( '<catalog_integration_name>' )
```

## Arguments

`catalog_integration_name`
:   Identifier for the catalog integration for [Iceberg REST](../sql/create-catalog-integration-rest.md) or
    [Snowflake Open Catalog](../../user-guide/tables-iceberg-configure-catalog-integration-open-catalog.md).

## Returns

Returns a JSON-formatted array of strings that lists the Data Products shared by SAP® Business Data Cloud with the enrolled catalog integration.

The JSON-formatted string has the following structure:

```json
[
  "usid:[guid]:ns:[namespace]:r:[dataproduct1]:v:[version]",
  "usid:[guid]:ns:[namespace]:r:[dataproduct2]:v:[version]",
]
```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration (catalog) |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

List the Data Products that currently shared from SAP® BDC to Snowflake with an enrolled catalog integration. Note that when new Data Products are shared, they are automatically available in the return value. When previously shared Data Products are unshared, they are automatically removed from the return value.

```sqlexample
SELECT SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG('myCatalogIntegration');
SELECT SYSTEM$SAP_BDC_LIST_SHARES('my-sap-bdc-catalog-int');
```

Which should produce results similar to:

```output
["usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:cashflow:v:1",
 "usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:generalledgeraccount:v:1",
 "usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:salesorder:v:1",
 "usid:0c7785a5-951f-4f3c-9f9f-9df3a5524d84:ns:sap.s4com:r:profitcenter:v:1"]
```

Where `cashflow`, `generalledgeraccount`, `salesorder`, and `profitcenter`
are the Data Products shared from SAP® BDC to Snowflake with the enrolled catalog integration `my-sap-bdc-catalog-int`.

---
title: SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH
source: https://docs.snowflake.com/en/sql-reference/functions/system_schedule_async_replication_group_refresh.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH

Starts a refresh operation for a replication group or a failover group, in the background.
You can call this function in a stored procedure to begin one or more refresh operations
and continue doing work while the refreshes are in progress.

See also:
:   [Replication groups and failover groups](../../user-guide/account-replication-intro.md),
    [ALTER REPLICATION GROUP](../sql/alter-replication-group.md),
    [ALTER FAILOVER GROUP](../sql/alter-failover-group.md),
    [REPLICATION_GROUP_REFRESH_HISTORY view](../organization-usage/replication_group_refresh_history.md)

## Syntax

```sqlsyntax
SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH(<replication_group_name>)
SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH(<failover_group_name>)
```

## Arguments

`'replication_group_name'` or `'failover_group_name'`
:   The name of the replication group or failover group to refresh.

## Usage notes

* This function has the same effect as an
  ALTER REPLICATION GROUP … REFRESH or ALTER FAILOVER GROUP … REFRESH command,
  but doesn’t wait for the operation to complete.
* Only account administrators (that is, users with the ACCOUNTADMIN role) can execute this function.
* This function must be executed from the secondary account.

## Examples

Start refreshing two failover groups simultaneously:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH('failover_group_1');
> SELECT SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH('failover_group_2');
> ```

---
title: SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG
source: https://docs.snowflake.com/en/sql-reference/functions/system_send_notifications_to_catalog.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG

Sends a notification to [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) to update Snowflake-managed [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) in Open Catalog with the latest table changes, and
returns whether the notification was sent successfully along with an error code and error message for the failure, if applicable.

Notifications are a mechanism for keeping Snowflake-managed Iceberg tables that are synced to Open Catalog updated with the latest table
changes. When tables are synced to Open Catalog, notifications are continuously sent to them. However, if notifications aren’t being
sent to a table, you can call this function and use the error message it returns to diagnose the reason for the sync failure.

## Syntax

```sqlsyntax
SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG( '<domain>' , '<entity_name>' [ , '<notification_type>'] [ , '<catalog_sync_integration_name>'] )
```

## Arguments

**Required:**

`domain`
:   The domain at which to send the notification. You can specify one of the following domains:

    * `DATABASE`
    * `SCHEMA`
    * `TABLE`

    For example, if you want to send a notification to tables under a certain schema, specify `SCHEMA`.

`entity_name`
:   The name of an entity for the given `domain`. Depending on the given domain, `entity_name` specifies the name of a
    database, schema, or table.

**Optional:**

`notification_type`
:   The type of notification to send to Open Catalog. You can specify one of the following types of notifications:

    * `UPDATE`: Updates the state of the table in Open Catalog. If the table doesn’t yet exist, Open Catalog, creates the table.
    * `DROP`: Drops the table from Open Catalog if it exists.

    Default: `UPDATE`

`catalog_sync_integration_name`
:   The name of a catalog integration for Open Catalog to which you want to scope the notifications. The notifications are only sent to a given
    table if the `CATALOG_SYNC` parameter for the table is set to this catalog integration.

    > **Important:**
    >
    > If you need to specify a value for `catalog_sync_integration_name`, you can’t leave `notification_type` empty to use
    > its default value. In other words, if you need to specify a value for `catalog_sync_integration_name` instead of using the
    > default, you must first specify `UPDATE` or `DROP` for `notification_type`.

    Default: If the argument is not specified, notifications are sent to all the tables in the domain specified by the required arguments,
    regardless of their catalog sync integration. For example, if you specify `SCHEMA` for `domain` and `schema1` for `entity_name`
    and use the default for `catalog_sync_integration_name`, all tables under `schema1` are notified. This argument is used to limit
    the scope of notifications to a single catalog sync integration.

## Returns

The function returns a JSON object with the properties described below:

| Property | Description |
| --- | --- |
| TABLENAME | Table name that the notification was sent to. It’s presented as the fully qualified table name (Database.Schema.Table). |
| NOTIFICATIONSTATUS | Status of the notification. Returns `TRUE` if the notification was sent successfully to Open Catalog or `FALSE` if it wasn’t sent successfully. |
| ERRORCODE | Error code for the send notification failure. If the notification was sent successfully, this field is empty. |
| ERRORMESSAGE | Error message describing why the notification failed. If the notification was sent successfully, this field is empty. |

## Usage Notes

`domain`, `entity_name`, `notification_type`, and `catalog_sync_integration_name` are all a string data
type, so each must be enclosed in single quotes.

## Examples

Send an `UPDATE` notification to any Snowflake-managed Iceberg table in Open Catalog that is under the `testSchema` schema in Snowflake and is
synced to Open Catalog.

```sqlexample
SELECT VALUE[0]::STRING AS tableName,
       VALUE[1]::BOOLEAN notificationStatus,
       VALUE[2]::STRING errorCode,
       VALUE[3]::STRING errorMessage
  FROM TABLE(FLATTEN(PARSE_JSON(
    SELECT SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG(
      'SCHEMA',
      'testSchema'))));
```

Send a `DROP` notification to any Snowflake-managed Iceberg table in Open Catalog that is named `icebergTable` and is synced to
Open Catalog through the `my_catalog_sync_integration` catalog integration.

```sqlexample
SELECT VALUE[0]::STRING AS tableName,
       VALUE[1]::BOOLEAN notificationStatus,
       VALUE[2]::STRING errorCode,
       VALUE[3]::STRING errorMessage
   FROM TABLE(FLATTEN(PARSE_JSON(
     SELECT SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG(
       'TABLE',
       'icebergTable',
       'DROP',
       'my_catalog_sync_integration'))));
```

---
title: SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_application_restricted_feature_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS

Enables a restricted feature for a Snowflake Native App. Currently, only external and Apache Iceberg™ tables are
supported.

## Syntax

```sqlsyntax
SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS(
  '<app_name>',
  '<type>',
  '<parameters>'
)
```

## Arguments

`app_name`
:   Name of the Snowflake Native App.

`type`
:   The type of restricted feature. Currently only `EXTERNAL_DATA` is supported.

`parameters`
:   A JSON object that contains configuration parameters for the restricted feature. Currently,
    only JSON objects of the following format are supported:

    ```json
    {"allowed_cloud_providers" : "all"}
    ```

    The supported values for `allowed_cloud_providers` are `all` and `none`.

## Returns

A JSON object containing a list of external features whose value the consumer has set. The JSON
object has the following structure:

```sqljson
"{""external_data"":{""allowed_cloud_providers"":""all""}}"
```

## Examples

To call the function:

```sqlexample
SELECT SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS('hello_snowflake_app', 'external_data', '{"allowed_cloud_providers" : "none"}');
```

Sample output:

```output
"SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS('EXTERNAL_DATA_DEMO_APP', 'EXTERNAL_DATA', '{""ALLOWED_CLOUD_PROVIDERS"" : ""NONE""}')"
"{""external_data"":{""allowed_cloud_providers"":""none""}}"
```

---
title: SYSTEM$SET_CATALOG_INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_catalog_integration.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SET_CATALOG_INTEGRATION

Replaces the catalog integration associated with an externally managed [Apache Iceberg™ table](../../user-guide/tables-iceberg.md).

Use this function to update a table to work with an Iceberg REST catalog integration, which supports a wider range of Iceberg features, such as
[write support for externally managed Iceberg tables](../../user-guide/tables-iceberg-externally-managed-writes.md). You might also use this function to roll back to
the original Glue catalog integration, if needed.

You can also use this function to migrate your table from one [Iceberg REST catalog integration](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md)
to another.

## Syntax

```sqlsyntax
SYSTEM$SET_CATALOG_INTEGRATION(
  '<table_name>' ,
  '<new_catalog_integration_name>'
)
```

## Arguments

`'table_name'`
:   Name of the Iceberg table whose catalog integration you want to replace.

`'new_catalog_integration_name'`
:   Name of the catalog integration that you want to migrate the given `table_name` to.

## Returns

The function returns a status message that the catalog integration for the table is successfully migrated. For an example, see
Examples.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| OWNERSHIP | Table whose catalog integration is being replaced. |
| USAGE | Current catalog integration. |
| USAGE | Target catalog integration. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You can only replace the catalog integration for externally managed Iceberg tables in a standard Snowflake database.
  You can’t replace the catalog integration for Iceberg tables in a catalog-linked database or replace the catalog integration for any
  other type of Iceberg table.
* The type of the current catalog integration associated with the table restricts the types of catalog integrations that you can use as a
  replacement. The following table lists the supported transitions when you replace one type of catalog integration with another:

  | Current catalog integration type | New catalog integration type | Notes |
  | --- | --- | --- |
  | [AWS Glue](../sql/create-catalog-integration-glue.md) | [AWS Glue Iceberg REST](../sql/create-catalog-integration-rest.md) |  |
  | AWS Glue Iceberg REST | AWS Glue | Fall back to a catalog integration that uses an AWS Glue catalog source. |
  | [Iceberg REST](../sql/create-catalog-integration-rest.md) | Iceberg REST | Migrate the table to an alternative catalog integration. |

  No other transition combinations are supported.
* `table_name` and `new_catalog_integration_name` are string literals, so you must include the values in single quotes.
* Both the current and target catalog integrations must point to the same external catalog.
* Both the current and target catalog integrations can’t have credential vending enabled.

## Examples

Replace the AWS Glue catalog integration associated with an Iceberg table named `glue_table` with an AWS Glue
Iceberg REST catalog integration named `glue_rest_catalog_int`:

```sqlexample
SELECT SYSTEM$SET_CATALOG_INTEGRATION('glue_table', 'glue_rest_catalog_int');
```

Sample output:

```none
+------------------------------------------------------------------------------------------------------------------------------+
|                                                SYSTEM$SET_CATALOG_INTEGRATION                                                |
+------------------------------------------------------------------------------------------------------------------------------+
| Catalog integration for table GLUE_TABLE has been migrated from 'GLUE_CATALOG_INTEGRATION' to 'GLUE_REST_CATALOG_INT'        |
+------------------------------------------------------------------------------------------------------------------------------+
```

## Troubleshooting

If the function fails, it returns an error response. Common error messages include:

| Error Message | Situation and Solution |
| --- | --- |
| SYSTEM$SET_CATALOG_INTEGRATION does not support transitioning from catalog integration ‘[CURRENT_CATALOG_INTEGRATION]’ to ‘[TARGET_CATALOG_INTEGRATION]’ due to unsupported type combination | The current or target catalog integration provided doesn’t match the catalog integration types that are supported. For the supported types, see the usage notes. |
| SYSTEM$SET_CATALOG_INTEGRATION cannot transition from ‘[CURRENT_CATALOG_INTEGRATION]’ to ‘[TARGET_CATALOG_INTEGRATION]’ due to incompatible catalog integration configurations | The given catalog integrations are of the supported type but don’t align with one of the supported transition combinations. For the supported transition combinations, see the usage notes. |
| Currently doesn’t support performing transition when catalog integration ‘[CATALOG_INTEGRATION]’ has credential vending enabled | The provided catalog integration has credential vending enabled. Provide a catalog integration with credential vending disabled and try again. |
| SYSTEM$SET_CATALOG_INTEGRATION can only be used on unmanaged Iceberg tables | The table provided isn’t an externally managed Iceberg table. Provide an externally managed Iceberg table and try again. |

---
title: SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_default_columns_override_for_show_command.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND

Controls the columns that should be returned when the specified [SHOW <objects>](../sql/show.md) command is executed.

You can call this function if the introduction of new columns in a SHOW COMMAND introduces a problem with a script or code that
depends on a fixed number or order of columns in the results. See [Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_get_default_columns_override_for_show_command.md) ,
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_unset_default_columns_override_for_show_command.md) ,
    [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](system_get_all_default_columns_overrides.md)

## Syntax

```sqlsyntax
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  '<object_type>',
  '<list_of_columns>'
)
```

## Arguments

`'object_type'`
:   Type of object for the SHOW command. For example, for the SHOW TABLES command, specify `'TABLES'`. For the SHOW NOTIFICATION
    INTEGRATIONS command, specify `'NOTIFICATION INTEGRATIONS'`.

`list_of_columns`
:   Comma-separated or space-separated list of columns that should be returned in the output of the SHOW command.

    You can specify the column names in uppercase, lowercase, or mixed case.

    To return all columns, specify an empty string or call
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_unset_default_columns_override_for_show_command.md).

## Returns

Returns TRUE if the operation was successful.

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Examples

The following example configures the [SHOW TABLES](../sql/show-tables.md) command to return only the `name`, `database_name`,
`kind`, and `comment` columns:

```sqlexample
SELECT SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'TABLES',
  'name, database_name, kind, comment'
);
```

Executing the SHOW TABLES command returns only the specified columns:

```sqlexample
SHOW TABLES;
```

```output
+------------------+---------------+-------+---------+
| name             | database_name | kind  | comment |
|------------------+---------------+-------+---------|
| DEPARTMENT_TABLE | MY_DB         | TABLE |         |
| EMPLOYEE_TABLE   | MY_DB         | TABLE |         |
+------------------+---------------+-------+---------+
```

Executing the SHOW TERSE TABLES command returns only the specified columns except for `comment`, which isn’t normally returned
when you specify TERSE:

```sqlexample
SHOW TERSE TABLES;
```

```output
+------------------+-------+---------------+
| name             | kind  | database_name |
|------------------+-------+---------------|
| DEPARTMENT_TABLE | TABLE | MY_DB         |
| EMPLOYEE_TABLE   | TABLE | MY_DB         |
+------------------+-------+---------------+
```

---
title: SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_default_columns_override_for_system_object.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT

Controls the columns that should be returned when you select all columns (`SELECT *`) from the specified Snowflake view (for
example, from a specific [ACCOUNT_USAGE view](../account-usage.md) or
[INFORMATION_SCHEMA view](../info-schema.md)).

> **Note:**
>
> This function does not affect queries that select specific columns from the view.

You can call this function if the introduction of new columns in a Snowflake view introduces a problem with a script or code that
selects all columns and depends on a fixed number or order of columns in the results. See
[Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_get_default_columns_override_for_system_object.md) ,
    [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_unset_default_columns_override_for_system_object.md) ,
    [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](system_get_all_default_columns_overrides.md)

## Syntax

```sqlsyntax
SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  '<object_type>',
  '<database_name>',
  '<schema_name>',
  '<object_name>',
  '<list_of_columns>'
)
```

## Arguments

`'object_type'`
:   Type of the object. You must specify `'VIEW'` for this argument.

`'database_name'`
:   Name of the database that contains the object. You must specify `'SNOWFLAKE'` or, for INFORMATION_SCHEMA views, an empty
    string.

`'schema_name'`
:   Name of the schema that contains the object. You must specify the name of a schema in the
    [SNOWFLAKE database](../snowflake-db.md) or `'INFORMATION_SCHEMA'`.

`'object_name'`
:   Name of the object.

`list_of_columns`
:   Comma-separated or space-separated list of columns that should be returned when you select all columns from this view.

    You can specify the column names in uppercase, lowercase, or mixed case.

    To return all columns, specify an empty string or call SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT.

## Returns

Returns TRUE if the operation was successful.

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Usage notes

* You must have a database in use (for example, by running [USE DATABASE](../sql/use-database.md)) in order to call this function.
  If no database is in use, the function call fails.

## Examples

The following example configures queries that select all columns from the [TABLES view](../account-usage/tables.md) view in the
ACCOUNT_USAGE schema to return only the `table_name`, `table_schema`, and `table_type` columns:

```sqlexample
SELECT SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'TABLES',
  'table_name, table_schema, table_type'
);
```

Selecting all columns from that view returns only the specified columns:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.TABLES;
```

```output
+------------+---------------------+------------+
| TABLE_NAME | TABLE_SCHEMA        | TABLE_TYPE |
|------------+---------------------+------------|
| MY_TABLE   | MY_SCHEMA           | BASE TABLE |
+------------+---------------------+------------+
```

The following example configures queries that select all columns from the [TABLES view](../info-schema/tables.md) view in the
INFORMATION_SCHEMA schema to return only the `table_name`, `table_schema`, and `table_type` columns:

```sqlexample
SELECT SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  '',
  'INFORMATION_SCHEMA',
  'TABLES',
  'table_name, table_schema, table_type'
);
```

Selecting all columns from that view returns only the specified columns:

```sqlexample
SELECT * FROM INFORMATION_SCHEMA.TABLES;
```

```output
+--------------+------------+------------+
| TABLE_SCHEMA | TABLE_NAME | TABLE_TYPE |
|--------------+------------+------------|
| MY_SCHEMA    | MY_TABLE   | BASE TABLE |
+--------------+------------+------------+
```

---
title: SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_event_sharing_account_for_region.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION

Sets the event account for a region.

See also:
:   [SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION](system_unset_event_sharing_account_for_region.md)

## Syntax

```sqlsyntax
SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION( '<snowflake_region>' , '<region_group>' , '<account_name>' )
```

## Arguments

`snowflake_region`
:   Specifies the region where the account is located, for example: `AWS_US_WEST_2, AWS_US_EAST_1`.

`region_group`
:   Specifies the region group, for example: `PUBLIC`. Refer to
    [Region groups](../../user-guide/admin-account-identifier.md) for details.

`account_name`
:   Specifies the account name. If another account is already set as the events account in the
    specified region, calling this function changes the events account to be the account
    specified here.

## Access control requirements

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute this SQL function.

## Examples

```sqlexample
SELECT SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION('aws_us_west_2', 'public', 'myaccount');
```

---
title: SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_privatelink_endpoint_hostname.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME

Modifies only the host name of an existing [private connectivity endpoint](../../user-guide/private-connectivity-outbound.md).

> **Note:**
>
> If the Snowflake account is in an Azure government region, the provider resource ID must be the ID of a resource in a government
> subscription. For more information about government regions for Snowflake customers, see [U.S. SnowGov Regions](../../user-guide/intro-regions.md).

## Syntax

**AWS:**

```sqlsyntax
SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME( '<provider_service_name>' , '<host_name>' )
```

**Azure:**

```sqlsyntax
SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME( '<provider_resource_id>' , '<host_name>' , [ , '<subresource>' ] )
```

**Google Cloud:**

```sqlsyntax
SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME( '<target_service_id>' , '<host_name>' )
```

## Arguments

**AWS:**

`'provider_service_name'`
:   Specifies the external service or resource to connect to. For example, `com.amazonaws.us-west-2.execute-api` for the Amazon API
    Gateway or `com.amazonaws.us-west-2.s3` for Amazon S3.

    For information about retrieving this value from AWS, see [Provision private connectivity endpoints](../../user-guide/private-manage-endpoints-aws.md).

`'host_name'`
:   Specifies the new fully-qualified host name that should be used to access the resource in your VPC or VNet.

    This value doesn’t contain any port numbers and must match what you specified in the Snowflake object that you use to connect to the
    external service.

    Examples include `bedrock-runtime.us-west-2.amazonaws.com` and `*.s3.us-west-2.amazonaws.com`.

    When you use private connectivity for external stages and external volumes, the `host_name` must use a wildcard instead of
    specifying an AWS S3 bucket.

    For information about retrieving this value from AWS, see [Provision private connectivity endpoints](../../user-guide/private-manage-endpoints-aws.md).

**Azure:**

`'provider_resource_id'`
:   Specifies the fully qualified identifier for the resource in your VPC or VNet.

`'host_name'`
:   Specifies new the fully qualified host name to access the resource in your VPC or VNet.

    For examples of the host name for outbound private connectivity for external functions, see the following topics:

    * [Azure Portal](../external-functions-creating-azure-ui-private-connect.md)
    * [Azure ARM template](../external-functions-creating-azure-template-private-connect.md).

`'subresource'`
:   Specifies the name of the subresource of the Azure resource.

    This argument isn’t required for [Azure Private Link Service](https://learn.microsoft.com/en-us/azure/private-link/private-link-service-overview) and Azure API Management Service.

    For all supported values, see the [Sub-resource table](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview#private-link-resource).

**Google Cloud:**

`'target_service_id'`
:   Specifies the service attachment ID (to a custom service), or regional Google API endpoint to connect to.

`'host_name'`
:   Specifies the new fully qualified host name to access the resource.

## Returns

Returns a status message that the host name for the private connectivity endpoint was updated successfully.

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Usage notes

* You can only modify the host name of an existing private connectivity endpoint.

## Examples

**AWS:**
:   Update the hostname of a private endpoint to allow Snowflake on Amazon Web Services to connect to the VPCE service in your Amazon Web Services VPC:

    ```sqlexample
    SELECT SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME(
      'com.amazonaws.vpce.us-west-2.vpce-svc-01234567890abcdef',
      'my-new-service-name.com'
      );
    ```

    ```output
    Successfully set the host name of the privatelink endpoint ``com.amazonaws.vpce.us-west-2.vpce-svc-01234567890abcdef`` to ``my-new-service-name.com``
    ```

**Azure:**
:   Update the host name of a private endpoint to allow Snowflake on Microsoft Azure to connect to the Microsoft Azure API Management service in your Microsoft Azure VNet:

    ```sqlexample
    SELECT SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME(
      '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
      'my-new-custom-api-endpoint.net',
      'Gateway'
      );
    ```

    ```output
    Successfully set the host name of the privatelink endpoint ``/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api`` to ``my-new-custom-api-endpoint.net``
    ```

**Google Cloud:**
:   Update the host name of a private endpoint to allow Snowflake on Google Cloud Platform to connect to the service attachment in your Google Cloud Platform VPC network:

    ```sqlexample
    SELECT SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME(
      'projects/my-project/regions/us-west2/serviceAttachments/my-http-server',
      'my-new-custom-api-endpoint.com'
      );
    ```

    ```output
    Successfully set the host name of the privatelink endpoint ``projects/my-project/regions/us-west2/serviceAttachments/my-http-server`` to ``my-new-custom-api-endpoint.com``
    ```

---
title: SYSTEM$SET_REFERENCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_reference.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SET_REFERENCE

Called by a Snowflake Native App to associate a consumer reference string to a reference definition.
The app can use this association to access the consumer object. The reference string passed to this system function is the value returned by the
[SYSTEM$REFERENCE](system_reference.md) function, which represents a consumer object.

This function only supports a single-valued reference. If an association has already been created using the same reference name, the existing association is overwritten.

## Syntax

```sqlsyntax
SYSTEM$SET_REFERENCE('<reference_name>', '<reference_string>')
```

## Arguments

**Required**

`'reference_name'`
:   The name of the reference as specified in the `manifest.yml` file of the app.

`'reference_string'`
:   The system-generated ID of the reference to the object in the consumer account.

---
title: SYSTEM$SET_RETURN_VALUE
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_return_value.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SET_RETURN_VALUE

Explicitly sets the return value for a task.

In a [task graph](../../user-guide/tasks-graphs.md), a task can call this function to set a return value.
Another task that identifies this task as the predecessor task (using the `AFTER` keyword in the task definition)
can retrieve the return value set by the predecessor task using [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](system_get_predecessor_return_value.md).

## Syntax

```sqlsyntax
SYSTEM$SET_RETURN_VALUE( '<string_expression>' )
```

The value for the `string_expression` argument can be a string literal or a variable; for example, `SYSTEM$SET_RETURN_VALUE(:VARIABLE)`.

## Arguments

`string_expression`
:   The string to set as the return value. The string size must be <= 10 kB (when encoded in UTF8).

## Examples

Create a task that sets a return value. Create a second, child task that runs after the predecessor task has completed.
The child task retrieves the return value set by the predecessor task (by calling [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](system_get_predecessor_return_value.md)) and inserts it into a table row:

> ```sqlexample
> -- Create a table to store the return values.
> CREATE OR REPLACE TABLE return_values_table (str VARCHAR);
>
> -- Create a task that sets the return value for the task.
> CREATE TASK set_return_value_task
>   WAREHOUSE = return_task_wh
>   SCHEDULE = '1 MINUTE'
>   AS
>     CALL SYSTEM$SET_RETURN_VALUE('The quick brown fox jumps over the lazy dog');
>
> -- Create a task that identifies the first task as the predecessor task and retrieves the return value set for that task.
> CREATE TASK get_return_value_task
>   WAREHOUSE = return_task_wh
>   AFTER set_return_value_task
>   AS
>     INSERT INTO return_values_table VALUES(SYSTEM$GET_PREDECESSOR_RETURN_VALUE());
>
>
> -- Note that if there are multiple predecessor tasks that are enabled, you must specify the name of the task to retrieve the return value for that task.
> CREATE TASK get_return_value_by_pred_task
>   WAREHOUSE = return_task_wh
>   AFTER set_return_value_task
>   AS
>     INSERT INTO return_values_table VALUES(SYSTEM$GET_PREDECESSOR_RETURN_VALUE('get_return_value_task'));
>
> -- Resume task (using ALTER TASK ... RESUME).
> -- Wait for task to run on schedule.
>
> SELECT DISTINCT(str) FROM return_values_table;
> +-----------------------------------------------+
> |                      STR                      |
> +-----------------------------------------------+
> |  The quick brown fox jumps over the lazy dog  |
> +-----------------------------------------------+
>
> SELECT DISTINCT(RETURN_VALUE)
>   FROM TABLE(information_schema.task_history())
>   WHERE RETURN_VALUE IS NOT NULL;
>
>
> +-----------------------------------------------+
> |                  RETURN_VALUE                 |
> +-----------------------------------------------+
> |  The quick brown fox jumps over the lazy dog  |
> +-----------------------------------------------+
> ```

### Example 2: Call by using a separate stored procedure

Similar to the first example, but set the return value for the task and retrieve it by calling separate stored procedures:

```sqlexample
-- Create a table to store the return values.
CREATE OR REPLACE TABLE return_values_sp (str VARCHAR);

-- Create a stored procedure that sets the return value for the task.
CREATE OR REPLACE PROCEDURE set_return_value_sp()
RETURNS STRING
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS $$
var stmt = snowflake.createStatement({sqlText:`CALL SYSTEM$SET_RETURN_VALUE('The quick brown fox jumps over the lazy dog');`});
  var res = stmt.execute();
$$;

-- Create a stored procedure that inserts the return value for the predecessor task into the 'return_values_sp' table.
CREATE OR REPLACE PROCEDURE get_return_value_sp()
RETURNS STRING
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS $$
var stmt = snowflake.createStatement({sqlText:`INSERT INTO return_values_sp VALUES(SYSTEM$GET_PREDECESSOR_RETURN_VALUE());`});
var res = stmt.execute();
$$;

-- Create a task that calls the set_return_value_sp stored procedure.
CREATE TASK set_return_value_t
WAREHOUSE=warehouse1
SCHEDULE='1 MINUTE'
AS
  CALL set_return_value_sp();

-- Create a task that calls the get_return_value stored procedure.
CREATE TASK get_return_value_t
WAREHOUSE=warehouse1
AFTER set_return_value_t
AS
  CALL get_return_value_sp();

-- Resume task.
-- Wait for task to run on schedule.

SELECT DISTINCT(str) FROM return_values_sp;
+-----------------------------------------------+
|                      STR                      |
+-----------------------------------------------+
|  The quick brown fox jumps over the lazy dog  |
+-----------------------------------------------+

SELECT DISTINCT(RETURN_VALUE)
  FROM TABLE(information_schema.task_history())
  WHERE RETURN_VALUE IS NOT NULL;

+-----------------------------------------------+
|                  RETURN_VALUE                 |
+-----------------------------------------------+
|  The quick brown fox jumps over the lazy dog  |
+-----------------------------------------------+
```

### Example 3: Use a variable to set the return value

The following example demonstrates how to dynamically generate a return value based on the task’s execution and set the return value by using a variable. In this example, the task loads data from a stream into a landing table and sets the return value to indicate the number of rows loaded:

```sqlexample
CREATE OR REPLACE TASK load_raw_data
WAREHOUSE = 'WH'
WHEN
    SYSTEM$STREAM_HAS_DATA('NEW_WEATHER_DATA')
AS
    DECLARE
        rows_loaded NUMBER;
        result_string VARCHAR;
    BEGIN
        INSERT INTO raw_weather_data ( -- our landing table
            row_id)
        SELECT
            row_id
        FROM
            new_weather_data  -- our source stream
        ;

        -- to see the number of rows loaded in the UI
        rows_loaded := (SELECT $1 FROM TABLE(RESULT_SCAN(LAST_QUERY_ID())));
        result_string := :rows_loaded || ' rows loaded into RAW_WEATHER_DATA';
        -- show result string as task return value
        CALL SYSTEM$SET_RETURN_VALUE(:result_string);
    END;
```

---
title: SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_row_timestamp_on_all_supported_tables.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES

Use this system function to bulk enable row timestamps on existing tables.

This function adds the row timestamp column to all existing eligible tables within the container and ensures newly created tables automatically
have row timestamp enabled.

To successfully execute the function, you need MODIFY privileges on the container you’re invoking the function on.

After row timestamps are enabled, tables expose the METADATA$ROW_LAST_COMMIT_TIME column, which returns the timestamp when each row was last
modified. This enables change tracking, incremental processing, and time-travel queries based on row modification time. For more information, see
[Use row timestamps to measure latency in your pipelines](../../user-guide/data-engineering/row-timestamps.md).

## Syntax

```sqlsyntax
SELECT SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES('<level>', '<qualified_name>')

- The first argument is level: one of :code:`schema`, :code:`database`, or :code:`account`.
- The second argument is the fully qualified name of the container.
```

## Arguments

**Required**

`'level'`
:   Container level. Can be one of the following: `account`, `database`, `schema`.

`'qualified_name'`
:   The fully qualified name of the container. For example, `my_db.myschema` for schema level.

## Examples

The following example demonstrates how to bulk-enable row timestamps for all supported tables within a specific schema using a system function. It
also verifies that the feature is applied to existing tables and sets the schema-level default to ensure all future tables automatically include
the METADATA$ROW_LAST_COMMIT_TIME column.

```sqlexample
CREATE OR REPLACE DATABASE my_db;
CREATE OR REPLACE SCHEMA my_schema;
USE DATABASE my_db;
USE SCHEMA my_schema;

CREATE OR REPLACE TABLE my_table (id INT, v STRING);
CREATE OR REPLACE TRANSIENT TABLE my_transient_table (id INT, v STRING);
CREATE OR REPLACE TEMP TABLE my_temp_table (id INT, v STRING);

SELECT SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES(
  'schema',
  'my_db.my_schema'
);

-- System function sets the container default so that new tables will get row timestamp going forward
SHOW PARAMETERS LIKE 'ROW_TIMESTAMP_DEFAULT' IN SCHEMA my_db.my_schema;

INSERT INTO my_table VALUES (1, 'a'), (2, 'b');
INSERT INTO my_transient_table VALUES (10, 'x');
INSERT INTO my_temp_table VALUES (100, 'tmp');

SELECT ID, METADATA$ROW_LAST_COMMIT_TIME FROM my_table ORDER BY ID;

SELECT ID, METADATA$ROW_LAST_COMMIT_TIME FROM my_transient_table ORDER BY ID;

SELECT ID, METADATA$ROW_LAST_COMMIT_TIME FROM my_temp_table ORDER BY ID;
```

---
title: SYSTEM$SET_SPAN_ATTRIBUTES (for Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/functions/system_set_span_attributes.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SET_SPAN_ATTRIBUTES (for Snowflake Scripting)

Sets attribute name and value associated with a span containing trace events.

Use SYSTEM$SET_SPAN_ATTRIBUTES to set the attribute name and value for a span when using trace events from a handler written in
Snowflake Scripting.

For more information, refer to [Emitting trace events in Snowflake Scripting](../../developer-guide/logging-tracing/tracing-snowflake-scripting.md).

## Syntax

```sqlsyntax
SYSTEM$SET_SPAN_ATTRIBUTES('<object>');
```

## Arguments

`'object'`
:   An object containing name-value pairs representing the attributes to add.

## Examples

Code in the following example uses the SYSTEM$ADD_EVENT function to add an event named `name_a` and an event named `name_b`.
With `name_b`, it associates two attributes, `score` and `pass`. The code also uses SYSTEM$SET_SPAN_ATTRIBUTES to
set two attributes for the span, `attr1` and `attr2`.

```sqlexample
create procedure MYPROC()
returns double
language sql
as
$$
begin
    -- Add an event without attributes
    SYSTEM$ADD_EVENT('name_a');

    -- Add an event with attributes
    let attr := {'score':89, 'pass':true};
    SYSTEM$ADD_EVENT('name_b', attr);

    -- Set attributes for the span
    SYSTEM$SET_SPAN_ATTRIBUTES('{'attr1':'value1', 'attr2':true}');

    return 3.14;
end;
$$
;
```

---
title: SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_active_behavior_change_bundles.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES

Returns an array of the currently available [behavior change release bundles](../../release-notes/behavior-change-policy.md), the default
state of each bundle, and the actual state of the bundle for the current account.

See also:
:   [SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](system_enable_behavior_change_bundle.md),
    [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](system_disable_behavior_change_bundle.md),
    [SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](system_behavior_change_bundle_status.md)

## Syntax

```sqlsyntax
SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES()
```

## Arguments

None.

## Returns

Returns a VARCHAR value that contains an array of objects that represent the currently available behavior change bundles.
Each object contains the following keys, which describe the status of the bundle:

| Key | Description of value |
| --- | --- |
| `name` | Name of the behavior change bundle |
| `isDefault` | `true` if the associated bundle should be enabled by default for the current account; `false` otherwise. |
| `isEnabled` | `true` if the associated bundle is actually enabled by default for the current account; `false` otherwise. |

## Usage notes

* Calling [SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](system_enable_behavior_change_bundle.md) or [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](system_disable_behavior_change_bundle.md) changes the value of
  `isEnabled` for the specified bundle.
* [SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](system_behavior_change_bundle_status.md) returns the same information as this function for a
  specific bundle.

## Examples

The following example returns information about the current behavior change bundles.

```sqlexample
SELECT SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES();
```

```output
+--------------------------------------------------------------------------------------------------------------+
| SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES()                                                                 |
|--------------------------------------------------------------------------------------------------------------|
| [{"name":"2023_08","isDefault":true,"isEnabled":true},{"name":"2024_01","isDefault":false,"isEnabled":true}] |
+--------------------------------------------------------------------------------------------------------------+
```

The following example uses the [PARSE_JSON](parse_json.md) function to return the array as a VARIANT and then uses the [FLATTEN](flatten.md)
function to present the bundle information in a tabular format.

```sqlexample
SELECT
    bundles.VALUE:name::VARCHAR AS bundle_name,
    bundles.VALUE:isDefault::BOOLEAN AS is_enabled_by_default,
    bundles.VALUE:isEnabled::BOOLEAN AS is_actually_enabled_in_account
  FROM
    TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES())))
    AS bundles;
```

```output
+-------------+-----------------------+--------------------------------+
| BUNDLE_NAME | IS_ENABLED_BY_DEFAULT | IS_ACTUALLY_ENABLED_IN_ACCOUNT |
|-------------+-----------------------+--------------------------------|
| 2023_08     | True                  | True                           |
| 2024_01     | False                 | True                           |
+-------------+-----------------------+--------------------------------+
```

---
title: SYSTEM$SHOW_BUDGET_SHARED_RESOURCE_CANDIDATES
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_budget_shared_resource_candidates.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_BUDGET_SHARED_RESOURCE_CANDIDATES

Returns the list of resources that can be added as shared resources to a [budget](../../user-guide/budgets.md).

For more information about configuring a budget to track consumption by shared resources, see [Using budgets for AI features (shared resources)](../../user-guide/budgets/budget-shared-resources.md).

## Syntax

```sqlsyntax
SYSTEM$SHOW_BUDGET_SHARED_RESOURCE_CANDIDATES( '<domain>' )
```

## Arguments

`'domain'`
:   Specifies a type of resource. The function returns all resources of the specified type that can be added to a budget as shared resources.

    Currently, the only supported value is `'AI_FUNCTION'`, which lists all AI functions that can be added as shared resources to a budget.

## Returns

The function returns an array of objects. Each object contains the following keys:

| Key | Data type | Description |
| --- | --- | --- |
| `id` | NUMBER | Internal identifier for the resource. |
| `name` | VARCHAR | Name of the resource (for example, the AI function name). |
| `domain` | VARCHAR | Type of resource (for example, `AI_FUNCTION`). |

## Examples

List the AI functions that can be added as shared resources to a budget:

```sqlexample
CALL SYSTEM$SHOW_BUDGET_SHARED_RESOURCE_CANDIDATES('AI_FUNCTION');
```

---
title: SYSTEM$SHOW_BUDGETS_FOR_RESOURCE
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_budgets_for_resource.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$SHOW_BUDGETS_FOR_RESOURCE

Returns a string containing a list of the [budgets](../../user-guide/budgets.md) that track a specified resource (for example,
a table or a schema).

See also:
:   [<budget_name>!GET_LINKED_RESOURCES](../classes/budget/methods/get_linked_resources.md)

## Syntax

```sqlsyntax
SYSTEM$SHOW_BUDGETS_FOR_RESOURCE( '<resource_domain>' , '<resource_name>' )
```

## Arguments

`'resource_domain'`
:   Domain of the resource. You can specify one of the following values:

    * `compute_pool`
    * `database`
    * `materialized_view`
    * `pipe`
    * `schema`
    * `table`
    * `task`
    * `warehouse`

`'resource_name'`
:   Name of the resource (for example, the name of the table).

## Returns

Returns a VARCHAR value containing the comma-delimited list of the fully qualified names of the budgets for the resource.
The list is surrounded by square brackets.

If there are no budgets tracking the specified resource, the function returns a string containing an empty pair of square brackets
(`[]`).

## Usage notes

The output of this function includes budgets that include the resource because of any of the following reasons:

* The resource was added directly to the budget.
* The resource has the tag/value combination that was added to the budget.
* The resource belongs to an object (for example, a database) that was added to the budget.

## Examples

The following example returns the list of budgets that track the schema named `my_db.my_schema`:

```sqlexample
SELECT SYSTEM$SHOW_BUDGETS_FOR_RESOURCE('SCHEMA', 'my_db.my_schema');
```

```output
+---------------------------------------------------------------+
| SYSTEM$SHOW_BUDGETS_FOR_RESOURCE('SCHEMA', 'MY_DB.MY_SCHEMA') |
|---------------------------------------------------------------|
| [BUDGETS_DB.BUDGETS_SCHEMA.MY_BUDGET]                         |
+---------------------------------------------------------------+
```

The following example returns the list of budgets that track the table named `my_db.my_schema.my_table`. In this example, the
table is not tracked by any budget, so the function returns an empty list.

```sqlexample
SELECT SYSTEM$SHOW_BUDGETS_FOR_RESOURCE('TABLE', 'my_db.my_schema.my_table');
```

```output
+-----------------------------------------------------------------------+
| SYSTEM$SHOW_BUDGETS_FOR_RESOURCE('TABLE', 'MY_DB.MY_SCHEMA.MY_TABLE') |
|-----------------------------------------------------------------------|
| []                                                                    |
+-----------------------------------------------------------------------+
```

---
title: SYSTEM$SHOW_BUDGETS_IN_ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_budgets_in_account.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_BUDGETS_IN_ACCOUNT

Returns the [budgets](../../user-guide/budgets.md) in the account for which you have access privileges.

See also:
:   [CREATE BUDGET](../classes/budget/commands/create-budget.md)

## Syntax

```sqlsyntax
SYSTEM$SHOW_BUDGETS_IN_ACCOUNT()
```

## Returns

The function returns the following elements in a JSON object:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE | TEXT | Name of the database to which the budget instance belongs. |
| SCHEMA | TEXT | Name of the schema to which the budget instance belongs. |
| CREATED_ON | NUMBER | UTC timestamp when the budget instance was created. |
| ID | NUMBER | Internal/system identifier for the budget instance. |
| CURRENT_VERSION | TEXT | Budget class version used to create the budget instance. |
| COMMENT | TEXT | Comment for the budget instance. |
| NAME | TEXT | Name of the budget instance. |

## Usage notes

The results include budgets for which the role executing the function has been granted any privileges.

## Examples

The following example retrieves the budgets in the account:

```sqlexample
SELECT SYSTEM$SHOW_BUDGETS_IN_ACCOUNT();
```

---
title: SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_dynamic_tables_created_for_resharing.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING

When a consumer of a listing reshares the listing’s data into another region, Snowflake creates hidden dynamic tables to enable listing auto-fulfillment in the target region. This system function returns information about the hidden dynamic tables that Snowflake creates under the *outgoing* view to materialize imported data for cross-region resharing.

Use this function to:

* Identify which imported objects have backing dynamic tables for a given outgoing view.
* Inspect the most recent refresh times for those dynamic tables (for debugging or cost/health analysis).

See also:
:   [Resharing listings](../../collaboration/reshare-listings.md)

## Syntax

```sqlsyntax
SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING( '<view_name>' )
```

## Arguments

`'view_name'`
:   The name of the outgoing view attached to a listing or share whose imported data is being auto-materialized into hidden dynamic tables for
    resharing.

    You can pass a fully qualified view name, for example:

    ```sqlexample
    SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING(
      'RESHARER_DB.PUBLIC.SHARED_VIEW'
    );
    ```

## Returns

Returns a JSON string containing an array of objects. Each object represents a hidden dynamic table created under the specified view for
resharing:

| Field | Type | Description |
| --- | --- | --- |
| dtName | STRING | The fully qualified name of the hidden dynamic table nested under the outgoing view (for example, `_<id>_IMPORTED_DB.SCHEMA.TABLE_DT_FOR_RESHARING`). |
| dtSourceObject | STRING | The fully qualified name of the imported object (for example, `IMPORTED_DB.SCHEMA.TABLE`) that is being materialized into this dynamic table for resharing. This corresponds to the original imported entity referenced in the view definition. |
| dtRefreshStartTimeMillis | NUMBER | The epoch timestamp in milliseconds when the most recent refresh of this dynamic table started. Null if no refresh has occurred. Convert with `TO_TIMESTAMP_LTZ(value:dtRefreshStartTimeMillis::number, 3)`. |
| dtRefreshEndTimeMillis | NUMBER | The epoch timestamp in milliseconds when the most recent refresh of this dynamic table completed. Null if no refresh has occurred. Convert with `TO_TIMESTAMP_LTZ(value:dtRefreshEndTimeMillis::number, 3)`. |
| status | STRING | The status of the most recent refresh. Null if no refresh has occurred. Possible values: `SCHEDULED`, `EXECUTING`, `SUCCEEDED`, `FAILED`, `CANCELLED`, `UPSTREAM_FAILED`. For descriptions of each status, see the [DYNAMIC_TABLE_REFRESH_HISTORY](dynamic_table_refresh_history.md) output. |

## Usage notes

* In the following scenarios, dynamic tables aren’t created and the function returns no rows:

  + The view doesn’t reference any imported databases.
  + The view uses imported data that is not eligible for resharing.
  + The view hasn’t yet been processed by the listing auto-fulfillment.
* This function is intended for observability and debugging.

## Examples

The following example retrieves the dynamic tables created for a reshared view:

```sqlexample
SELECT * FROM TABLE(FLATTEN(input =>
  PARSE_JSON(
    SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING(
      'RESHARER_DB.PUBLIC.SHARED_VIEW'
    )
  )
));
```

To get a readable table with proper timestamps:

```sqlexample
SELECT
  value:dtName::STRING AS dt_name,
  value:dtSourceObject::STRING AS dt_source_object,
  TO_TIMESTAMP_LTZ(value:dtRefreshStartTimeMillis::NUMBER, 3) AS dt_refresh_start_time,
  TO_TIMESTAMP_LTZ(value:dtRefreshEndTimeMillis::NUMBER, 3) AS dt_refresh_end_time,
  value:status::STRING AS status
FROM TABLE(FLATTEN(input =>
  PARSE_JSON(
    SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING(
      'RESHARER_DB.PUBLIC.SHARED_VIEW'
    )
  )
));
```

Sample output:

```output
+----------------------------------------------------------+----------------------------+-------------------------------+-------------------------------+-------------------+
| DT_NAME                                                  | DT_SOURCE_OBJECT           | DT_REFRESH_START_TIME         | DT_REFRESH_END_TIME           | STATUS            |
+----------------------------------------------------------+----------------------------+-------------------------------+-------------------------------+-------------------+
| _12345_IMPORTED_DB.PUBLIC.TABLE_A_DT_FOR_RESHARING       | IMPORTED_DB.PUBLIC.TABLE_A | 2026-03-19 10:00:00.000 -0700 | 2026-03-19 10:00:05.000 -0700 | SUCCEEDED |
| _12345_IMPORTED_DB.PUBLIC.VIEW_B_DT_FOR_RESHARING        | IMPORTED_DB.PUBLIC.VIEW_B  | 2026-03-19 10:00:01.000 -0700 | 2026-03-19 10:00:04.000 -0700 | SUCCEEDED |
+----------------------------------------------------------+----------------------------+-------------------------------+-------------------------------+-------------------+
```

---
title: SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_event_sharing_accounts.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS

Shows event accounts in a provider organization.

This system function returns a string in JSON format containing a list of event accounts within the organization.
Because the metadata takes some time to propagate to all regions, this function might experience some delay when
showing latest events account after the user sets or unsets an events account for the organization.

## Syntax

```sqlsyntax
SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS()
```

## Arguments

None.

## Access control requirements

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute this SQL function.

## Examples

```sqlexample
SELECT SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS();
```

---
title: SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_move_organization_account_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS

Returns the status of an attempt to move an [organization account](../../user-guide/organization-accounts.md).

See also:
:   [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](system_initiate_move_organization_account.md) , [SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT](system_commit_move_organization_account.md)

## Syntax

```sqlsyntax
SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS( )
```

## Arguments

None.

## Returns

The following are the possible statuses:

| Code | Status |
| --- | --- |
| 060050 | Move of the current organization account has been initiated. |
| 060051 | Created a new organization account as the destination for migrating the existing organization account. |
| 060052 | Objects are being replicated from the current organization account to the target organization account. Target organization account is currently locked and not ready for use. |
| 060053 | Initial replication of objects is complete and the target organization account is ready to be reviewed. If you are ready to proceed with the move please run SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT(<GRACE_PERIOD_IN_DAYS>). |
| 060054 | Commit of organization account move in progress. |
| 060055 | The move has been completed successfully. The original organization account is locked and will be deleted in x days. |
| 060056 | The organization account move failed. |
| 060057 | Cannot fetch status of organization account move. |

## Access control requirements

Only users with the GLOBALORGADMIN role can call this function.

## Usage notes

Only shows the status of the latest attempt to move the organization account.

## Example

```sqlexample
SELECT SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS();
```

---
title: SYSTEM$SHOW_OAUTH_CLIENT_SECRETS
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_oauth_client_secrets.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_OAUTH_CLIENT_SECRETS

Returns the client secrets in a string. The client ID and a client secret must be included in the authorization header to the OAuth token endpoint.

## Syntax

```sqlsyntax
SYSTEM$SHOW_OAUTH_CLIENT_SECRETS( '<integration_name>' )
```

## Arguments

`integration_name`
:   Name of the integration. Note that the integration name is case-sensitive and must be uppercase and enclosed in single quotes.

## Output

The function returns the following elements in a JSON object:

| Column Name | Data Type | Description |
| --- | --- | --- |
| oauth_client_secret_2 | BASE64 | Secondary client secret for the specified integration. Snowflake supports two client secrets to allow for uninterrupted rotation. |
| oauth_client_secret | BASE64 | Client secret for the specified integration |
| oauth_client_id | STRING | Client ID in Snowflake |

## Examples

The following example retrieves the client secret for the specified integration:

> ```sqlexample
> select system$show_oauth_client_secrets('MYINT');
> ```

---
title: SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES
source: https://docs.snowflake.com/en/sql-reference/functions/system_show_sensitive_data_monitored_entities.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES

Returns a JSON array of databases or schemas that are associated with a classification profile, which indicates that objects in these
entities are monitored by [sensitive data classification](../../user-guide/classify-intro.md).

## Syntax

```sqlsyntax
SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES( [ '<entity_type>' ] )
```

## Arguments

`'entity_type'`
:   Optional. A string specifying the type of entity to return. Possible values are `DATABASE` and `SCHEMA`.

    If omitted, returns all entities monitored by sensitive data classification.

## Returns

A JSON string containing an array of monitored entities and their associated classification profiles. Each object in the array contains
the following fields:

* `name`: Name of the monitored entity (that is, a database or schema).
* `type`: Type of the entity (DATABASE or SCHEMA).
* `profile_name`: Fully qualified name of the associated classification profile.

## Usage notes

* Only objects associated with a classification profile are shown.
* The current role must have access to both the entity and the classification profile associated with it for the entity to be included in
  the output.

## Examples

Show all databases that are monitored by sensitive data classification:

```sqlexample
SELECT SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES('DATABASE');
```

```output
[
{"name":"TESTDB","type":"DATABASE","profile_name":"TESTDB.TESTSCHEMA.MY_CLASSIFICATION_PROFILE"},
{"name":"TEST","type":"DATABASE","profile_name":"TEST.PUBLIC.TEST_PROFILE"}
]
```

Show all schemas that are monitored by sensitive data classification:

```sqlexample
SELECT SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES('SCHEMA');
```

```output
[
{"name":"TESTDB.TESTSCHEMA","type":"SCHEMA","profile_name":"TESTDB.TESTSCHEMA.MY_CLASSIFICATION_PROFILE"}
]
```

Show all entities (databases and schemas) that are monitored by sensitive data classification:

```sqlexample
SELECT SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES();
```

```output
[
{"name":"TESTDB","type":"DATABASE","profile_name":"TESTDB.TESTSCHEMA.MY_CLASSIFICATION_PROFILE"},
{"name":"TESTDB.TESTSCHEMA","type":"SCHEMA","profile_name":"TESTDB.TESTSCHEMA.TEST_PROFILE"}
]
```

---
title: SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS
source: https://docs.snowflake.com/en/sql-reference/functions/system_snowflake_managed_storage_volume_public_access_status.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Information)

# SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS

Checks to see whether public IP addresses are allowed to access the Snowflake-managed storage volume of the current Snowflake
account on Microsoft Azure.

See also:
:   [SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_block_snowflake_managed_storage_volume_public_access.md),
    [SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_unblock_snowflake_managed_storage_volume_public_access.md)

## Syntax

> ```sqlsyntax
> SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS()
> ```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to Snowflake-managed storage volumes is blocked | Indicates that the Azure settings that control access to the Snowflake-managed storage volume are currently blocking all public IP addresses. |
| Public Access to Snowflake-managed storage volumes is unblocked | Indicates that at least some public IP addresses can access the Snowflake-managed storage volume. |
| No interop volumes configured on account | Indicates that there are no Snowflake-managed storage volumes configured for the account. |

## Usage notes

* Only account administrators (that is, users with the ACCOUNTADMIN role) can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Azure only. AWS and Google Cloud are not supported.

## Examples

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS();
> ```
>
> ```output
> Public Access to Snowflake-managed storage volumes is blocked
> ```

---
title: SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN
source: https://docs.snowflake.com/en/sql-reference/functions/system_snowpipe_streaming_update_channel_offset_token.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN

Updates the offset token for a particular channel used by Snowpipe Streaming with a new offset token.

For more information about channels and offset tokens, see [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

See also:

> [SHOW CHANNELS](../sql/show-channels.md)

## Syntax

```sqlsyntax
SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN('<dbName>.<schemaName>.<tableName>', '<channelName>', '<new_offset_token>')
```

## Arguments

`dbName`
:   Name of the database in which the channel is stored.

`schemaName`
:   Name of the schema in which the channel is stored.

`tableName`
:   Name of the table where the channel is mapped to.

`channelName`
:   Name of the channel.

`new_offset_token`
:   The new offset token.

## Usage notes

* The role that executes this function must have at least the INSERT privilege on the table where the channel is mapped to.

## Examples

Updates the offset token for `mychannel` in `mydb.myschema.mytable` with a `<new_offset_token>`:

> ```sqlexample
> show channels;
> select SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN('mydb.myschema.mytable', 'mychannel', '<new_offset_token>');
> show channels;
> ```

---
title: SYSTEM$START_OAUTH_FLOW
source: https://docs.snowflake.com/en/sql-reference/functions/system_start_oauth_flow.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$START_OAUTH_FLOW

Initiates the OAUTH client flow, returning a URL you use in a browser to complete the OAuth consent process.

## Syntax

```sqlsyntax
SYSTEM$START_OAUTH_FLOW( '<database_name.schema_name.secret_name>' )
```

## Arguments

`'database_name.schema_name.secret_name'`
:   The name of the OAuth2 secret specifying authentication information for the API to access with OAuth.

## Usage notes

Use this function to begin a flow that results in an OAuth refresh token added to the secret you pass to this function as an argument.

As an intermediate step, this function returns an authorization URL you can in a browser to complete the OAuth consent process.

After executing this function and using the URL it returns, immediately execute [SYSTEM$FINISH_OAUTH_FLOW](system_finish_oauth_flow.md)
in the same session to have Snowflake add a refresh token to the secret you specified.

The [secret](../sql/create-secret.md) in this function’s argument must include:

* A TYPE parameter specifying a value of `oauth2`.
* An API_AUTHENTICATION parameter specifying a [security integration](../sql/create-security-integration-api-auth.md)
  containing details (such as OAuth client ID, secret, authorization endpoint, and token endpoint) about the service provider for which
  access is being granted.

---
title: SYSTEM$START_USER_EMAIL_VERIFICATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_start_user_email_verification.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$START_USER_EMAIL_VERIFICATION

Starts the [email verification process for a user](../../user-guide/notifications/email-notifications.md). The system sends a verification email to the user’s email address.

## Syntax

```sqlsyntax
SYSTEM$START_USER_EMAIL_VERIFICATION( '<user_name>' )
```

## Arguments

`'user_name'`
:   Name of the user.

## Access control requirements

Only the specified user or the role with the OWNERSHIP privilege on that user can call this function.

## Usage notes

* `user_name` is a string literal that must be enclosed in single quotes.

  If the user name is [case-sensitive or includes any special characters or spaces](../identifiers-syntax.md), you must enclose the name in double quotes, and then enclose the resulting double-quoted name in single quotes. For example:

  ```sqlexample
  SELECT SYSTEM$START_USER_EMAIL_VERIFICATION(
    '"Case-Sensitive UserName"');
  ```

## Examples

Start email verification for a user when the user name follows the rules for [unquoted object identifiers](../identifiers-syntax.md):

```sqlexample
SELECT SYSTEM$START_USER_EMAIL_VERIFICATION('user_name');
```

Start email verification for a user with a case-sensitive name:

```sqlexample
SELECT SYSTEM$START_USER_EMAIL_VERIFICATION('"UserName"');
```

---
title: SYSTEM$STREAM_BACKLOG
source: https://docs.snowflake.com/en/sql-reference/functions/system_stream_backlog.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md) , [System functions](../functions-system.md) (Information)

# SYSTEM$STREAM_BACKLOG

Returns the set of table versions between the current [offset](../../user-guide/streams-intro.md) for a specified stream and the
current timestamp. This function accepts any stream type as input (e.g. table, external table, or view) with the exception of streams on
directory tables.

For each table version, the function provides the estimated number of change data capture (CDC) records that comprise the table version,
as well as the DML operation (INSERT, UPDATE, DELETE, TRUNCATE) associated with the table version.

Use this function to analyze the volume of CDC records generated for each stream, enabling you to estimate the compute resources required
for a task to process the records.

## Syntax

```sqlsyntax
SYSTEM$STREAM_BACKLOG('<stream_name>')
```

## Arguments

`stream_name`
:   The name of the stream to query.

    * Note that the entire name must be enclosed in single quotes, including the database and schema, if the name is fully-qualified,
      (i.e. `'<db>.<schema>.<stream_name>'`).
    * If the stream name is case-sensitive or includes any special characters or spaces, double quotes are required to process the
      case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<stream_name>"'`.

## Usage notes

N/A

## Examples

Retrieve the current set of unconsumed table versions for stream `db1.schema1.s1`:

> ```sqlexample
> SELECT * FROM TABLE(SYSTEM$STREAM_BACKLOG('db1.schema1.s1'));
> ```

---
title: SYSTEM$STREAM_GET_TABLE_TIMESTAMP
source: https://docs.snowflake.com/en/sql-reference/functions/system_stream_get_table_timestamp.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$STREAM_GET_TABLE_TIMESTAMP

Returns the timestamp in nanoseconds of the latest table version at or before the current offset for the specified stream. When the stream is
queried (or consumed), the records returned include all transactions committed after this table version and before the current time.

> **Note:**
>
> This function was created primarily as a means to “bootstrap” a stream (i.e. return the set of records inserted between the period when the table was created (at table version `t0`) and the specified stream was created). Since this function was introduced, [CREATE STREAM](../sql/create-stream.md) and [SELECT](../sql/select.md) statements that include the [CHANGES](../constructs/changes.md) clause now support Time Travel using the [AT | BEFORE](../constructs/at-before.md) clause. These options provide greater flexibility for querying historical table records.

## Syntax

```sqlsyntax
SYSTEM$STREAM_GET_TABLE_TIMESTAMP('<stream_name>')
```

## Arguments

`stream_name`
:   The name of the stream to query.

    * Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully-qualified), i.e. `'<db>.<schema>.<stream_name>'`.
    * If the stream name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<stream_name>"'`.

## Usage notes

* This function returns an error when the input is a stream on a view.

  To create a stream at or before the current offset for an existing stream, we recommend providing the existing stream name as input to
  the AT | BEFORE clause for simplicity and maximum compatibility with existing streams:

  ```sqlsyntax
  CREATE STREAM ... AT ( STREAM => '<stream-name>' )
  ```

## Examples

Query the timestamp for the current offset for a stream:

```sqlexample
create table MYTABLE1 (id int);

create table MYTABLE2(id int);

create or replace stream MYSTREAM on table MYTABLE1;

insert into MYTABLE1 values (1);

-- consume the stream
begin;
insert into MYTABLE2 select id from MYSTREAM;
commit;

-- return the current offset for the stream
select system$stream_get_table_timestamp('MYSTREAM');
```

---
title: SYSTEM$STREAM_HAS_DATA
source: https://docs.snowflake.com/en/sql-reference/functions/system_stream_has_data.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$STREAM_HAS_DATA

Indicates whether a specified stream contains change data capture (CDC) records.

## Syntax

```sqlsyntax
SYSTEM$STREAM_HAS_DATA('<stream_name>')
```

## Arguments

`stream_name`
:   The name of the stream to query.

    * Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully-qualified), i.e. `'<db>.<schema>.<stream_name>'`.
    * If the stream name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<stream_name>"'`.

## Usage notes

* This function is intended to be used in the WHEN expression in the definition of tasks. If the specified stream contains no change data, the task skips the current run. This check can help avoid starting or resuming a warehouse unnecessarily. However, note that the function is designed to avoid false negatives (i.e. returning a false value even when the stream contains change data); however, the function is not guaranteed to avoid false positives (i.e. returning a true value when the stream contains no change data).
* This function performs a diff of the table version metadata (between the stream offset and the current transactional time) to determine whether the stream contains CDC records. If the DML activity for the table during that period consisted of the same set of rows being inserted, optionally updated, and deleted, returning to the original table state, then it is possible this function could return a TRUE value even though the stream contains no CDC records.
* When the input is a view stream, the returned value is `TRUE` when change data capture (CDC) records for the
  underlying tables change. The function performs a diff on the version metadata for the underlying tables rather than for the
  view itself. The result is a false positive when the query in the source view definition does not reference the rows in the underlying
  tables that have changed. The rate of false positives increases as a view becomes more selective.

  When this function is referenced in the optional `WHEN` parameter in a task definition, the higher false positive rate means that
  tasks may run when a view stream is empty more often than when a table stream is the input for the function. However, this check still
  avoids task runs when there is no change in the underlying table data.
* Calling this function on a stream prevents it from becoming stale, provided the stream is empty and the SYSTEM$STREAM_HAS_DATA function
  returns `FALSE`.
* When this function returns TRUE, you must consume the stream in a DML operation, whether it’s a false positive or actual
  change data. If you don’t consume the stream, this function keeps returning `TRUE`, and tasks that use this function in their WHEN
  clause won’t skip execution. This results in unnecessary task runs and warehouse charges.

  To consume the stream efficiently when the result is a false positive — for example, querying the stream returns no records —
  use a statement like the following example:

  ```sqlexample
  CREATE TEMPORARY TABLE _unused_table AS SELECT * FROM my_stream WHERE 1=0;
  ```

  This statement counts as a DML operation that consumes the stream, because `CREATE TABLE AS SELECT` is a DML transaction. The
  `WHERE 1=0` clause filters out all data, so nothing gets processed or stored. This operation advances the stream offset, and
  `SYSTEM$STREAM_HAS_DATA` returns `FALSE` until new changes occur.

  Alternatively, run your regular data processing logic — INSERT, UPDATE, MERGE, or other DML statements — on the stream.
  This also consumes the stream and advances its offset, even when the stream contains no change records.

## Examples

> ```sqlexample
> create table MYTABLE1 (id int);
>
> create table MYTABLE2(id int);
>
> create stream MYSTREAM on table MYTABLE1;
>
> insert into MYTABLE1 values (1);
>
> -- returns true because the stream contains change tracking information
> select system$stream_has_data('MYSTREAM');
>
> +----------------------------------------+
> | SYSTEM$STREAM_HAS_DATA('MYSTREAM')     |
> |----------------------------------------|
> | True                                   |
> +----------------------------------------+
>
>  -- consume the stream
> begin;
> insert into MYTABLE2 select id from MYSTREAM;
> commit;
>
> -- returns false because the stream was consumed
> select system$stream_has_data('MYSTREAM');
>
> +----------------------------------------+
> | SYSTEM$STREAM_HAS_DATA('MYSTREAM')     |
> |----------------------------------------|
> | False                                  |
> +----------------------------------------+
> ```

---
title: SYSTEM$SUPPORTED_DBT_VERSIONS
source: https://docs.snowflake.com/en/sql-reference/functions/system_supported_dbt_versions.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$SUPPORTED_DBT_VERSIONS

Returns a JSON array containing the versions that Snowflake supports for dbt Projects.

For more information, see [Supported dbt Core versions for dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake-dbt-core-versions.md).

## Syntax

```sqlsyntax
SYSTEM$SUPPORTED_DBT_VERSIONS()
```

## Arguments

None.

## Returns

Returns a JSON array containing the versions that Snowflake supports for dbt Projects.

## Examples

To view supported versions for your dbt projects, run the following SQL command:

```sqlexample
SELECT SYSTEM$SUPPORTED_DBT_VERSIONS();
```

```output
[{"dbt_version":"1.9.4","type":"dbt Core"},{"dbt_version":"1.10.15","type":"dbt Core"}]
```

---
title: SYSTEM$TASK_DEPENDENTS_ENABLE
source: https://docs.snowflake.com/en/sql-reference/functions/system_task_dependents_enable.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$TASK_DEPENDENTS_ENABLE

Recursively resumes a specified task and all its dependent tasks. This function allows the owner of a [task graph](../../user-guide/tasks-graphs.md)
(like the role with the OWNERSHIP privilege on the tasks) to resume the tasks by executing a single SQL statement rather than resuming each task individually (using [ALTER TASK](../sql/alter-task.md) … RESUME).

For more information about tasks, see [Introduction to tasks](../../user-guide/tasks-intro.md).

## Syntax

```sqlsyntax
SYSTEM$TASK_DEPENDENTS_ENABLE( '<task_name>' )
```

## Arguments

`task_name`
:   Name of a task in a simple task graph. It does not need to be a root task.

## Usage notes

* `task_name` is a string so it must be enclosed in single quotes:

  + Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully qualified), that is, `'<db>.<schema>.<task_name>'`.
  + If the task name is case sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, that is, `'"<task_name>"'`.
  + Accounts are currently limited to a maximum of 30,000 resumed tasks (that is, tasks in a `Started` state) .

## Examples

Resume a specified task and all its dependent tasks in a tree where the specified task has a case-insensitive name:

> ```sqlexample
> SELECT SYSTEM$TASK_DEPENDENTS_ENABLE('mydb.myschema.mytask');
> ```

Resume a specified task and all its dependent tasks in a tree where the specified task has a case-sensitive name:

> ```sqlexample
> SELECT SYSTEM$TASK_DEPENDENTS_ENABLE('mydb.myschema."myTask"');
> ```

---
title: SYSTEM$TASK_RUNTIME_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_task_runtime_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md)

# SYSTEM$TASK_RUNTIME_INFO

Returns information about the current task run. If this function is called outside of a task run, it fails with an error.

## Syntax

```sqlsyntax
SYSTEM$TASK_RUNTIME_INFO('<arg_name>')
```

## Arguments

`'arg_name'`
:   Specifies the type of information to return. You can specify one of the following values:

    | Value | Description |
    | --- | --- |
    | `'CURRENT_TASK_NAME'` | Returns the name of the current task. |
    | `'CURRENT_ROOT_TASK_NAME'` | Returns the name of the root task in the current task graph. |
    | `'CURRENT_ROOT_TASK_UUID'` | Returns a universally unique identifier (UUID) that represents the root task in the current task graph. |
    | `'CURRENT_TASK_GRAPH_RUN_GROUP_ID'` | Returns a universally unique identifier (UUID) that represents the current graph run group. |
    | `'CURRENT_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP'` | Returns the original scheduled timestamp of the root task in the current graph run group.  For graphs that are retried, the returned value is the original scheduled timestamp of the initial graph run in the current group. |
    | `'LAST_SUCCESSFUL_TASK_GRAPH_RUN_GROUP_ID'` | Returns a universally unique identifier (UUID) that represents the latest successful graph run group.  The value is consistent throughout the graph run group and is determined when the root task of the initial graph run starts. |
    | `'LAST_SUCCESSFUL_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP'` | Returns the original scheduled timestamp of the root task in the latest successful graph run group.  The value is consistent throughout the graph run group and is determined when the root task of the initial graph run starts. |

## Returns

Returns a STRING or TEXT with requested information.

## Usage notes

* We recommend using SELECT instead of CALL for SYSTEM$TASK_RUNTIME_INFO, because SELECT SYSTEM$TASK_RUNTIME_INFO automatically converts datatypes, while CALL SYSTEM$TASK_RUNTIME_INFO doesn’t.

## Examples

Use CURRENT_TASK_GRAPH_RUN_GROUP_ID with CURRENT_ROOT_TASK_NAME for debugging and creating a unique output directory or file:

> ```sqlexample
> CREATE OR REPLACE TASK my_task ...
>   AS
>   ...
>
>   -- Inside Python UDF
>
>   query_result = session.sql("""select
>         SYSTEM$TASK_RUNTIME_INFO('CURRENT_ROOT_TASK_NAME')
>         AS root_name,
>         SYSTEM$TASK_RUNTIME_INFO('CURRENT_TASK_GRAPH_RUN_GROUP_ID')
>         AS run_id""").collect()
>   current_root_task_name, current_graph_run_id = result.ROOT_NAME, result.RUN_ID
>
>   -- Logging information here
>
>   logger.debug(f"start training for {current_root_task_name} at run {current_graph_run_id}")
>
>   -- Create a unique output directory to store intermediate information
>
>   output_dir_name = f"{current_root_task_name}/{current_graph_run_id}/preprocessing.out"
>   with open(output_dir_name, "rw+") as f:
>     ....
> ...;
> ```

Use CURRENT_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP with LAST_SUCCESSFUL_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP to process data from streaming input source:

> ```sqlexample
> CREATE OR REPLACE TASK my_task ...
>   AS
>   ...
>   INSERT INTO my_output_table
>     SELECT * FROM my_source_table
>       WHERE TRUE
>         ...
>         AND TIMESTAMP BETWEEN
>           COALESCE(
>             SYSTEM$TASK_RUNTIME_INFO('LAST_SUCCESSFUL_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP')::timestamp_ltz,
>             '2023-07-01'
>           ) AND SYSTEM$TASK_RUNTIME_INFO('CURRENT_TASK_GRAPH_ORIGINAL_SCHEDULED_TIMESTAMP')::timestamp_ltz
>    ...;
> ```

Use LAST_SUCCESSFUL_TASK_GRAPH_RUN_GROUP_ID to generate a unique output directory and log lines:

> ```sqlexample
> CREATE OR REPLACE TASK my_task ...
>   AS
>   ...
>
>   -- Inside Python UDF
>
>   query_result = session.sql("select
>       SYSTEM$TASK_RUNTIME_INFO('CURRENT_ROOT_TASK_NAME') AS root_name, SYSTEM$TASK_RUNTIME_INFO('LAST_SUCCESSFUL_TASK_GRAPH_RUN_GROUP_ID') AS last_run_id").collect()
>   current_root_task_name, last_graph_run_id = query_result.ROOT_NAME,query_result.LAST_RUN_ID
>   logger.log(f"graph name: {current_root_task_name}, last successful run: {last_graph_run_id}")
>   ...;
> ```

---
title: SYSTEM$TRIGGER_LISTING_REFRESH
source: https://docs.snowflake.com/en/sql-reference/functions/system_trigger_listing_refresh.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$TRIGGER_LISTING_REFRESH

Triggers a one-time, on-demand data refresh for a provider’s databases or listings, accessible to all consumers. The refresh job begins immediately upon triggering and can be tracked using the [LISTING_REFRESH_HISTORY](listing_refresh_history.md) function. Consumers can track the refresh using the [AVAILABLE_LISTING_REFRESH_HISTORY](available_listing_refresh_history.md) function. You can trigger a listing refresh even if you have already set up a scheduled refresh or interval-based refresh.

> **Note:**
>
> A completed trigger listing refresh will skip the next interval-based refresh.

For details on the refresh types available for your listings, see [Auto-fulfillment for listings](../../collaboration/provider-listings-auto-fulfillment.md).

See also:
:   [LISTING_REFRESH_HISTORY](listing_refresh_history.md)

## Syntax

```sqlsyntax
SYSTEM$TRIGGER_LISTING_REFRESH( '<type>' , '<name>' )
```

## Arguments

**Required:**

`'type'`
:   Type of dataset to refresh (`LISTING` or `DATABASE`). Note that the dataset type must be enclosed in single quotes.

`'name'`
:   Name of the listing or database. Note that the entire name must be enclosed in single quotes.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE LISTING AUTO FULFILLMENT | Account | This privilege grants the ability to publish listings to remote regions. |
| USAGE | Listing or database |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* For share-based data product listings, the database associated with the listing is replicated and refreshed across all regions managed by
  auto-fulfillment.
* Application and application package data product listings refresh according to the value of the [LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE](../parameters.md)
  parameter set on the account. All listings using this schedule are refreshed simultaneously.

## Examples

```sqlexample
SELECT SYSTEM$TRIGGER_LISTING_REFRESH('DATABASE', 'MY_DATABASE');
```

---
title: SYSTEM$TYPEOF
source: https://docs.snowflake.com/en/sql-reference/functions/system_typeof.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$TYPEOF

Returns a string representing the SQL data type associated with an
expression.

See also:
:   [TYPEOF](typeof.md)

## Syntax

```sqlsyntax
SYSTEM$TYPEOF( <expr> )
```

## Arguments

`expr`
:   The argument can be a column name or a general expression.

## Returns

Returns a VARCHAR value that contains the data type of the input expression, for example, BOOLEAN, NUMBER, ARRAY, OBJECT, etc.

## Usage notes

* If TYPEOF is executed without the SYSTEM$ prefix (i.e. as a regular
  function rather than a system function), it returns different
  results (see [TYPEOF](typeof.md)).

## Examples

```sqlexample
SELECT SYSTEM$TYPEOF(NULL);
```

```output
+---------------------+
| SYSTEM$TYPEOF(NULL) |
|---------------------|
| NULL[LOB]           |
+---------------------+
```

```sqlexample
SELECT SYSTEM$TYPEOF(1);
```

```output
+------------------+
| SYSTEM$TYPEOF(1) |
|------------------|
| NUMBER(1,0)[SB1] |
+------------------+
```

```sqlexample
SELECT SYSTEM$TYPEOF(1e10);
```

```output
+---------------------+
| SYSTEM$TYPEOF(1E10) |
|---------------------|
| NUMBER(11,0)[SB8]   |
+---------------------+
```

```sqlexample
SELECT SYSTEM$TYPEOF(10000);
```

```output
+----------------------+
| SYSTEM$TYPEOF(10000) |
|----------------------|
| NUMBER(5,0)[SB2]     |
+----------------------+
```

```sqlexample
SELECT SYSTEM$TYPEOF('something');
```

```output
+----------------------------+
| SYSTEM$TYPEOF('SOMETHING') |
|----------------------------|
| VARCHAR(9)[LOB]            |
+----------------------------+
```

```sqlexample
SELECT SYSTEM$TYPEOF(CONCAT('every', 'body'));
```

```output
+----------------------------------------+
| SYSTEM$TYPEOF(CONCAT('EVERY', 'BODY')) |
|----------------------------------------|
| VARCHAR(9)[LOB]                        |
+----------------------------------------+
```

---
title: SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_unblock_internal_stages_public_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS

Allows traffic from public IP addresses to access the internal stage of the current Snowflake account on Microsoft Azure.

This function reverses the Azure settings on the internal stage’s Azure storage account that were made when
SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS was executed. For details about these Azure settings, refer to [Unblocking public access](../../user-guide/private-internal-stages-azure.md).

See also:
:   [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](system_block_internal_stages_public_access.md), [SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](system_internal_stages_public_access_status.md)

## Syntax

> ```sqlsyntax
> SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS()
> ```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to internal stages is unblocked | Indicates that the function successfully unblocked public access. |
| Azure Error when attempting to unblock public access to internal stages. Please contact Snowflake support. | Indicates that the function was unable to change the Azure settings in order to unblock public access. |

## Usage notes

* Only account administrators (i.e. users with the ACCOUNTADMIN role) can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Azure only. AWS and Google Cloud Platform are not supported.

## Examples

Allow public IP addresses to access the Azure internal stage.

> ```sqlexample
> USE ROLE accountadmin;
>
> SELECT SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS();
> ```

---
title: SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS
source: https://docs.snowflake.com/en/sql-reference/functions/system_unblock_snowflake_managed_storage_volume_public_access.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS

Allows traffic from public IP addresses to access the Snowflake-managed storage volume of the current Snowflake account on
Microsoft Azure.

This function reverses the Azure settings on the managed storage volume’s Azure storage account that were made when
SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS was executed. For details about these Azure settings, refer to
[Blocking public access](../../user-guide/private-managed-volumes-azure.md).

See also:
:   [SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](system_block_snowflake_managed_storage_volume_public_access.md),
    [SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS](system_snowflake_managed_storage_volume_public_access_status.md)

## Syntax

> ```sqlsyntax
> SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS()
> ```

## Arguments

None.

## Returns

This function returns the following status messages:

| Status Message | Description |
| --- | --- |
| Public Access to Snowflake-managed storage volumes is unblocked | Indicates that the function successfully unblocked public access. |
| Azure Error when attempting to unblock public access to Snowflake-managed storage volumes. Please contact Snowflake support. | Indicates that the function was unable to change the Azure settings in order to unblock public access. |
| No interop volumes configured on account | Indicates that there are no Snowflake-managed storage volumes configured for the account. |

## Usage notes

* Only account administrators (that is, users with the ACCOUNTADMIN role) can execute this function.
* This function can take a few minutes to finish executing.
* This function can be used with Snowflake accounts on Azure only. AWS and Google Cloud are not supported.

## Examples

Allow public IP addresses to access the Azure Snowflake-managed storage volume.

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> SELECT SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS();
> ```

---
title: SYSTEM$UNLINK_ORGANIZATION_USER
source: https://docs.snowflake.com/en/sql-reference/functions/system_unlink_organization_user.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$UNLINK_ORGANIZATION_USER

Unlinks a user object from an [organization user](../../user-guide/organization-users.md) so it can be managed as a local user going forward.

## Syntax

```sqlsyntax
SYSTEM$UNLINK_ORGANIZATION_USER( '<user_name>' )
```

## Arguments

`'user_name'`
:   Name of a user object that was imported from an organization user.

## Examples

```sqlexample
SELECT SYSTEM$UNLINK_ORGANIZATION_USER('jloeb');
```

---
title: SYSTEM$UNLINK_ORGANIZATION_USER_GROUP
source: https://docs.snowflake.com/en/sql-reference/functions/system_unlink_organization_user_group.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$UNLINK_ORGANIZATION_USER_GROUP

Unlinks an access control role from an [organization user group](../../user-guide/organization-users.md) so it can be managed as a local role going
forward.

## Syntax

```sqlsyntax
SYSTEM$UNLINK_ORGANIZATION_USER_GROUP( '<role>' )
```

## Arguments

`'role'`
:   Name of an access control role that is linked to an organization user group.

## Usage notes

When you unlink an organization user group, user objects that were added to the regular account when the group was imported are also
unlinked.

## Examples

```sqlexample
SELECT SYSTEM$UNLINK_ORGANIZATION_USER_GROUP('marketing_team');
```

---
title: SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT
source: https://docs.snowflake.com/en/sql-reference/functions/system_unregister_privatelink_endpoint.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT

Unregisters a private connectivity endpoint to route your connection to the Snowflake service.

## Syntax

**AWS**

```sqlsyntax
SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT(
  '<aws_private_endpoint_vpce_id>',
  '<aws_account_id>',
  '<token>'
  )
```

**Azure**

```sqlsyntax
SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT(
  '<azure_private_endpoint_link_id>',
  '<azure_private_endpoint_resource_id>',
  '<token>'
  )
```

## Arguments

**AWS**

`aws_private_endpoint_vpce_id`
:   Specifies the identifier for your Amazon Web Services (AWS) virtual private cloud endpoint (AWS VPCEID).

    To obtain the AWS VPCEID value, navigate through the AWS console or use the following command:

    ```bash
    aws ec2 describe-vpc-endpoints
    ```

`aws_account_id`
:   The 12-digit identifier that uniquely identifies your Amazon Web Services (AWS) account, as a string.

    To obtain the AWS account ID value, navigate through the AWS console or use the following command:

    ```bash
    aws sts get-caller-identity
    ```

**Azure**

`azure_private_endpoint_link_id`
:   Specifies the identifier for your Microsoft Azure (Azure) virtual private cloud endpoint link (Azure LinkID).

    To obtain the Azure LinkID value:

    Run the [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](system_get_privatelink_authorized_endpoints.md) system function.

`azure_private_endpoint_resource_id`
:   The identifier that uniquely identifies your Snowflake account in Microsoft Azure (Azure) as a string.

    To obtain the Azure private endpoint resource Id, use the following command:

    ```bash
    az network private-endpoint list --resource-group my_resource_group
    ```

`token`
:   Specifies an access token to verify ownership of the private connectivity endpoint.

    To obtain the token, you must have the corresponding read or describe privilege on the private connectivity endpoint at a minimum.
    For more information, see:

    * [AWS endpoint policies](https://docs.aws.amazon.com/vpc/latest/privatelink/vpc-endpoints-access.html)
    * [Azure private endpoint privileges](https://learn.microsoft.com/en-us/azure/private-link/rbac-permissions#private-endpoint)

    To obtain the token, use the following commands:

    * For Snowflake on AWS:

      ```bash
      aws sts get-federation-token --name snowflake --policy '{ "Version": "2012-10-17", "Statement"
      : [ { "Effect": "Allow", "Action": ["ec2:DescribeVpcEndpoints"], "Resource": ["*"] } ] }'
      ```
    * For Snowflake on Azure:

      ```bash
      az account get-access-token --subscription <subscription_id>
      ```

    For more information about limiting the scope of an access token, see:

    * For Snowflake on AWS: [Managing access token scope on Amazon Web Services](../../user-guide/pin-private-endpoints.md)
    * For Snowflake on Azure: [Managing access token scope on Microsoft Azure](../../user-guide/pin-private-endpoints.md)

## Returns

Returns a status message about the registration of the private connectivity endpoint.

## Usage notes

Only account administrators (users with the ACCOUNTADMIN role) can call this function.

## Examples

Unregister a VPC endpoint for your Snowflake account. Note that the `AccessKeyId`, `SecretAccessKey`, and
`SessionToken` values are truncated:

**AWS**

> ```sqlexample
> SELECT SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT(
>   'vpce-0c1...',
>   '174...',
>   '{
>     "Credentials": {
>       "AccessKeyId": "ASI...",
>       "SecretAccessKey": "aFP...",
>       "SessionToken": "Fwo...",
>       "Expiration": "2024-04-26 05:49:09+00:00"
>     },
>     "FederatedUser": {
>       "FederatedUserId": "0123...:snowflake",
>       "Arn": "arn:aws:sts::174...:federated-user/sam"
>     },
>     "PackedPolicySize": 9
>   }'
> );
> ```

**Azure**

```sqlexample
SELECT SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT(
  '123...',
  '/subscriptions/0cc51670-.../resourceGroups/dbsec_test_rg/providers/Microsoft.Network/
  privateEndpoints/...',
  'eyJ...'
  );
```

---
title: SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND
source: https://docs.snowflake.com/en/sql-reference/functions/system_unset_default_columns_override_for_show_command.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND

Clears the list of columns specified by a previous call to
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_set_default_columns_override_for_show_command.md) for a type of object.

For more information, see [Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_set_default_columns_override_for_show_command.md) ,
    [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](system_get_default_columns_override_for_show_command.md) ,
    [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](system_get_all_default_columns_overrides.md)

## Syntax

```sqlsyntax
SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  '<object_type>'
)
```

## Arguments

`'object_type'`
:   Type of object for the SHOW command. For example, for the SHOW TABLES command, specify `'TABLES'`. For the SHOW NOTIFICATION
    INTEGRATIONS command, specify `'NOTIFICATION INTEGRATIONS'`.

## Returns

Returns TRUE if the operation was successful.

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Examples

The following example clears the list of columns set by a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND call for
the [SHOW TABLES](../sql/show-tables.md) command:

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'TABLES'
);
```

---
title: SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/system_unset_default_columns_override_for_system_object.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (Control)

# SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT

Clears the list of columns specified by a previous call to
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_set_default_columns_override_for_system_object.md) for the specified Snowflake view (for
example, for a specific [ACCOUNT_USAGE view](../account-usage.md) or
[INFORMATION_SCHEMA view](../info-schema.md)).

For more information, see [Handling new columns in SHOW command output and Snowflake views](../../release-notes/behavior-changes-new-columns.md).

See also:
:   [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_set_default_columns_override_for_system_object.md) ,
    [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](system_get_default_columns_override_for_system_object.md) ,
    [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](system_get_all_default_columns_overrides.md)

## Syntax

```sqlsyntax
SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  '<object_type>',
  '<database_name>',
  '<schema_name>',
  '<object_name>'
)
```

## Arguments

`'object_type'`
:   Type of the object. You must specify `'VIEW'` for this argument.

`'database_name'`
:   Name of the database that contains the object. You must specify `'SNOWFLAKE'` or, for INFORMATION_SCHEMA views, an empty
    string.

`'schema_name'`
:   Name of the schema that contains the object. You must specify the name of a schema in the
    [SNOWFLAKE database](../snowflake-db.md) or `'INFORMATION_SCHEMA'`.

`'object_name'`
:   Name of the object.

## Returns

Returns TRUE if the operation was successful.

## Access control requirements

Only account administrators (users who have been granted the ACCOUNTADMIN role) can call this function.

## Usage notes

* You must have a database in use (for example, by running [USE DATABASE](../sql/use-database.md)) in order to call this function.
  If no database is in use, the function call fails.

## Examples

The following example clears the list of columns set by a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT call for
the [TABLES view in the ACCOUNT_USAGE schema](../account-usage/tables.md):

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'TABLES'
);
```

The following example clears the list of columns set by a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT call for
the [TABLES view in the INFORMATION_SCHEMA schema](../info-schema/tables.md):

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  '',
  'INFORMATION_SCHEMA',
  'TABLES'
);
```

---
title: SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION
source: https://docs.snowflake.com/en/sql-reference/functions/system_unset_event_sharing_account_for_region.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION

Unsets the events account for a region.

See also:
:   [SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION](system_set_event_sharing_account_for_region.md)

## Syntax

```sqlsyntax
SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION( '<snowflake_region>' , '<region_group>' , '<account_name>' )
```

## Arguments

`snowflake_region`
:   Specifies the region where the account is located, for example: `AWS_US_WEST_2, AWS_US_EAST_1`.

`region_group`
:   Specifies the region group, for example: `PUBLIC`.

`account_name`
:   Specifies the account name.

## Access control requirements

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute this SQL function.

## Examples

```sqlexample
SELECT SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION('aws_us_west_2', 'public', 'myaccount');
```

---
title: SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS
source: https://docs.snowflake.com/en/sql-reference/functions/system_user_task_cancel_ongoing_executions.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS

Cancels a run of the specified task that the system has already started to process (that is, a run with an EXECUTING state in the [TASK_HISTORY](task_history.md) output).

## Syntax

```sqlsyntax
SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS( '<task_name>' )
```

## Arguments

`task_name`
:   Name of the task.

## Usage notes

* Only the task owner (that is, the role with the OWNERSHIP privilege on the task) or a role with the OPERATE privilege on the task can call this function.
* `task_name` is a string so it must be enclosed in single quotes:

  + The entire name must be enclosed in single quotes, including the database and schema (if the name is fully-qualified); for example: `'<db>.<schema>.<task_name>'`.
  + If the task name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case or characters. The double quotes must be enclosed within the single quotes; for example: `'"<task_name>"'`.
* This function returns a success message before the current run of the specified task is actually canceled.
* If the current run of the specified task is almost completed, this function might not cancel the run.
* This function only cancels the current run of the specified task. Additional tasks in a [task graph](../../user-guide/tasks-graphs.md) that includes this
  task might also be running. To cancel these runs, you must call this function and specify the name of each additional child task separately.
* If a task is replaced using CREATE OR REPLACE TASK, this function will not be able to cancel the ongoing executions of the previous task.

  To stop an ongoing task run after you replace it with CREATE OR REPLACE TASK:

  1. Find the query ID of the ongoing run; for example:

     ```sqlexample
     select name, query_id, state, scheduled_time, error_message
     from table(information_schema.task_history(task_name => 'my_task'));
     ```
  2. Cancel the query using the [SYSTEM$CANCEL_QUERY](system_cancel_query.md) function with the query ID, for example:

     ```sqlexample
     select system$cancel_query('query_id');
     ```
  3. Monitor the task run for a few seconds until the cancel completes, for example:

     ```sqlexample
     select name, query_id, state, scheduled_time, error_message
     from table(information_schema.task_history(task_name => 'my_task'));
     ```
* To check if a task run has been cancelled or completed, or if any child tasks are currently running, query the
  [TASK_HISTORY](task_history.md) function.
* To prevent future runs of the task from starting, we recommend first suspending the task (using [ALTER TASK … SUSPEND](../sql/alter-task.md)) and then executing this function.

  Note that if the task is not suspended when this function is executed, it currently takes several minutes for the Snowflake cloud services to begin scheduling executions of this task again.

## Examples

Cancel the current run of a task with a case-insensitive name:

> ```sqlexample
> SELECT SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS('mydb.myschema.mytask');
> ```

Cancel the current run of a task with a case-sensitive name:

> ```sqlexample
> SELECT SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS('mydb.myschema."myTask"');
> ```

The following example shows a successful cancellation of a task run:

```sqlexample
SELECT SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS('my_task');

+------------------------------------------------------------------------------------+
| SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS('my_task')                              |
|------------------------------------------------------------------------------------|
| Marked 1 task runs for cancellation. It may take a few seconds for cancellation to |
| complete. Query ids canceled: [2036a04c-9c46-4c6b-b354-67a44b5e0b50]               |
+------------------------------------------------------------------------------------+
```

The following example shows that the task has no currently running executions, so the function doesn’t cancel any runs:

```sqlexample
SELECT SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS('my_task');

+------------------------------------------------------------------------------------+
| SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS('my_task') |
|------------------------------------------------------------------------------------|
| Task MY_TASK has no currently running executions. If the task was dropped or       |
| replaced after a previous execution started, use SYSTEM$CANCEL_QUERY along with    |
| the query id to cancel the run.                                                    |
|------------------------------------------------------------------------------------|
```

---
title: SYSTEM$VALIDATE_STORAGE_INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_validate_storage_integration.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$VALIDATE_STORAGE_INTEGRATION

Validates the configuration for a specified storage integration.
The function attempts to use the storage integration to write, read, list, or delete a file for a storage location that you specify by path.

For more information about configuring storage integrations, see:

* [Snowflake storage integration for AWS](../../user-guide/data-load-s3-config-storage-integration.md)
* [Snowflake storage integration for Google Cloud Storage](../../user-guide/data-load-gcs-config.md)
* [Snowflake storage integration for Microsoft Azure](../../user-guide/data-load-azure-config.md)

See also:
:   [CREATE STORAGE INTEGRATION](../sql/create-storage-integration.md), [ALTER STORAGE INTEGRATION](../sql/alter-storage-integration.md)

## Syntax

```sqlsyntax
SYSTEM$VALIDATE_STORAGE_INTEGRATION( '<storage_integration_name>', '<storage_path>', '<test_file_name>', '<validate_action>' )
```

## Arguments

`storage_integration_name`
:   Name of the storage integration to test.

    Storage integration names are case-sensitive.

`storage_path`
:   The full path to a storage location that you want to validate.
    The storage path must be a URL in the `STORAGE_ALLOWED_LOCATIONS` list for the storage integration.

    **Amazon S3**

    > `'s3://bucket/path/'`
    >
    > > * The `s3` prefix refers to S3 storage in public AWS regions. The `s3gov` prefix refers to S3 storage in
    > >   [government regions](../../user-guide/intro-regions.md).
    > > * `bucket` is the name of an S3 bucket that stores your data files.
    > > * `path` is an optional path or directory in the bucket.

    **Google Cloud Storage**

    > `'gcs://bucket/path/'`
    >
    > > * `bucket` is the name of a GCS bucket that stores your data files.
    > > * `path` is an optional path or directory in the bucket.

    **Microsoft Azure**

    > `'azure://account.blob.core.windows.net/container/path/'`
    >
    > > * `account` is the name of the Azure storage account.
    > > * `container` is the name of an Azure blob storage container that stores your data files.
    > > * `path` is an optional path or directory in the bucket.

`test_file_name`
:   The name of the file to use in storage integration validation.

`validate_action`
:   The validation action to perform.

    Values:
    :   * `read` - Validates that Snowflake can read from the storage location. This action fails if the file doesn’t exist.
        * `write` - Validate that Snowflake can write to the storage location. This action fails if the file already exists.
        * `list` - Validates that Snowflake can list the files in the storage location.
        * `delete` - Validates that Snowflake can delete files in the storage location.
        * `all` - Validates all possible actions in the storage location.

## Returns

The function returns a JSON object with the properties described below:

| Property | Description |
| --- | --- |
| `status` | The status of the validation test. Returns a status of `success` if all actions completed successfully; returns `failure` if any action didn’t complete as expected. |
| `actions` | Array of objects that contain the requested validation action (`READ`, `DELETE`, `LIST`, `WRITE`) and status. |

```sqljson
{
  "status" : "success",
  "actions" : {
    "READ" : {
      "status" : "success"
    },
    "DELETE" : {
      "status" : "success"
    },
    "LIST" : {
      "status" : "success"
    },
    "WRITE" : {
      "status" : "success"
    }
  }
}
```

## Examples

The following example validates the configuration of the storage integration `example_integration` for all validation actions. The
example returns a successful result in JSON.

```sqlexample
SELECT
  SYSTEM$VALIDATE_STORAGE_INTEGRATION(
    'example_integration',
    's3://example_bucket/test_path/'',
    'validate_all.txt', 'all');
```

Output:

```output
+----------------------------+
|           RESULT           |
+----------------------------+
| {                          |
|   "status" : "success",    |
|   "actions" : {            |
|     "READ" : {             |
|       "status" : "success" |
|     },                     |
|     "DELETE" : {           |
|       "status" : "success" |
|     },                     |
|     "LIST" : {             |
|       "status" : "success" |
|     },                     |
|     "WRITE" : {            |
|       "status" : "success" |
|     }                      |
|   }                        |
| }                          |
+----------------------------+
```

The following example shows the result when the storage integration doesn’t have `read` permissions.

```sqlexample
SELECT
  SYSTEM$VALIDATE_STORAGE_INTEGRATION(
    'example_integration',
    'gcs://example_bucket/test_path/'',
    'read_fail.txt', 'all');
```

Output:

```output
+----------------------------------------------------------------------------------------------------------------+
|                                                     RESULT                                                     |
+----------------------------------------------------------------------------------------------------------------+
| {                                                                                                              |
|   "status" : "failure",                                                                                        |
|   "actions" : {                                                                                                |
|     "READ" : {                                                                                                 |
|       "message" : "Access Denied (Status Code: 403; Error Code: AccessDenied)",                                |
|       "status" : "failure"                                                                                     |
|     },                                                                                                         |
|     "DELETE" : {                                                                                               |
|       "status" : "success"                                                                                     |
|     },                                                                                                         |
|     "LIST" : {                                                                                                 |
|       "status" : "success"                                                                                     |
|     },                                                                                                         |
|     "WRITE" : {                                                                                                |
|       "status" : "success"                                                                                     |
|     }                                                                                                          |
|   },                                                                                                           |
|   "message" : "Some of the integration checks failed. Check the Snowflake documentation for more information." |
| }                                                                                                              |
+----------------------------------------------------------------------------------------------------------------+
```

---
title: SYSTEM$VERIFY_CATALOG_INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/functions/system_verify_catalog_integration.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$VERIFY_CATALOG_INTEGRATION

Verifies the configuration for a specified catalog integration for Apache Iceberg™ REST.

To check whether you’ve correctly configured authorization and access control with your Iceberg REST catalog,
the function attempts to use the catalog integration to interact with your catalog server.

See also:
:   [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../sql/create-catalog-integration-rest.md) , [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md)

## Syntax

```sqlsyntax
SYSTEM$VERIFY_CATALOG_INTEGRATION( '<rest_catalog_integration_name>' )
```

## Arguments

`rest_catalog_integration_name`
:   Name of the [Iceberg REST catalog integration](../sql/create-catalog-integration-rest.md) to test.

    Catalog integration names are case sensitive.

## Returns

The function returns a JSON object with the properties described below:

| Property | Description |
| --- | --- |
| `success` | Specifies whether verification was successful; `true` if successful, otherwise `false`. |
| `errorCode` | Error code of the failure (if verification fails). |
| `errorMessage` | A detailed error message (if verification fails). |

```sqljson
{
  "success" : false,
  "errorCode" : "004140",
  "errorMessage" : "SQL Execution Error: Failed to access the REST endpoint of catalog integration CAT_INT_VERIFICATION with error: Unable to process: Unable to find warehouse my_warehouse. Check the accessibility of the REST catalog URI or warehouse."
}
```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Catalog integration |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example statement creates a REST catalog integration
using an invalid OAuth client secret (this runs without error):

```sqlexample
CREATE CATALOG INTEGRATION my_rest_cat_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'default'
  REST_CONFIG = (
    CATALOG_URI = 'https://abc123.us-west-2.aws.myapi.com/polaris/api/catalog'
    CATALOG_NAME = 'my_catalog_name'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = '123AbC ...'
    OAUTH_CLIENT_SECRET = '1365910abIncorrectSecret ...'
    OAUTH_ALLOWED_SCOPES = ('all-apis', 'sql')
  )
  ENABLED = TRUE;
```

Use the system function to verify the catalog integration, expecting failure:

```sqlexample
SELECT SYSTEM$VERIFY_CATALOG_INTEGRATION('my_rest_cat_int');
```

Output:

```output
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
|                                                                                                              SYSTEM$VERIFY_CATALOG_INTEGRATION('MY_REST_CAT_INT')                                                                                                               |
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| {                                                                                                                                                                                                                                                                               |
|  "success" : false,                                                                                                                                                                                                                                                             |                                                                                                                                                                                                                                                                    |
|   "errorCode" : "004155",                                                                                                                                                                                                                                                       |
|   "errorMessage" : "SQL Execution Error: Failed to perform OAuth client credential flow for the REST Catalog integration MY_REST_CAT_INT due to error: SQL execution error: OAuth2 Access token request failed with error 'unauthorized_client:The client is not authorized'.." |
| }                                                                                                                                                                                                                                                                               |
+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: SYSTEM$VERIFY_CMK_INFO
source: https://docs.snowflake.com/en/sql-reference/functions/system_verify_cmk_info.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$VERIFY_CMK_INFO

Verifies your customer-managed key (CMK) configuration and returns a message about the registered CMK.

See also:
:   [Understanding CMK self-registration with support activation of Tri-Secret Secure](../../user-guide/security-encryption-tss.md)

## Syntax

```sqlsyntax
SYSTEM$VERIFY_CMK_INFO( [ '<ssa_account_name>' ] )
```

## Arguments

**Required:**

None.

**Optional:**

`ssa_account_name`
:   A string that specifies the SSA account name for which you want to verify the CMK configuration.

## Returns

Returns a successful status message or, as shown in the following example outputs, information about the unsuccessful verification:

* **AWS**:

  ```output
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  |                                                                                                                                                                                               SYSTEM$VERIFY_CMK_INFO()                                                                                                                                                                                               |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  | Verification failed due to an exception with message: Access is denied to the customer managed key (CMK) for this account. This could be because: 1) the CMK access permissions granted to Snowflake have been revoked OR 2) the CMK is disabled OR 3) the CMK is scheduled for deletion OR 4) the CMK specified is wrong. CMK ARN used: arn:aws:kms:us-west-2:736112632311:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59 |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  ```
* **Azure:**:

  ```output
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  |                                                                                                                                                     SYSTEM$VERIFY_CMK_INFO()                                                                                                                                                     |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  | Verification failed due to an exception with message: Error received from the customer managed key (CMK) provider caused by user: 'Your request cannot be completed because of the failure of an external dependency. Please try again later.'. CMK KEY URI used: https://trisecretsite.vault.azure.net/keys/TriSecretAZKeyWrong |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  ```
* **Google Cloud**:

  ```output
  +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  |                                                                                                                                                                                                                   SYSTEM$VERIFY_CMK_INFO()                                                                                                                                                                                                                    |
  +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  | Verification failed due to an exception with message: Access is denied to the customer managed key (CMK) for this account. This could be because: 1) the CMK access permissions granted to Snowflake have been revoked OR 2) the CMK is disabled OR 3) the CMK is scheduled for deletion OR 4) the CMK specified is wrong. CMK resource ID used: projects/my-env/locations/us-west2/keyRings/TriSecretTest/cryptoKeys/TriSecretGCPKey                         |
  +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  ```

## Access control requirements

* Only users with the ACCOUNTADMIN role or a with role that is granted the MONITOR SECURITY privilege can call this
  function.
* Only users with the GLOBALORGADMIN role or ORGADMIN role can specify an SSA account name.

## Examples

Verify the status of the CMK for your Snowflake account:

> ```sqlexample
> SELECT SYSTEM$VERIFY_CMK_INFO();
> ```

Verify the status of the CMK for a specific SSA account:

> ```sqlexample
> SELECT SYSTEM$VERIFY_CMK_INFO('AUTO_FULFILLMENT_AREA$PUBLIC_AZURE_EASTUS2');
> ```

---
title: SYSTEM$VERIFY_CMK_INFO_POSTGRES
source: https://docs.snowflake.com/en/sql-reference/functions/system_verify_cmk_info_postgres.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$VERIFY_CMK_INFO_POSTGRES

Verifies your customer-managed key (CMK) configuration for Snowflake Postgres Tri-Secret Secure and returns a message about the registered CMK.

## Syntax

```sqlsyntax
SYSTEM$VERIFY_CMK_INFO_POSTGRES()
```

## Returns

Returns a successful status message or information about the unsuccessful verification:

* **AWS**:

  ```output
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  |                                                                                                                                                                                               SYSTEM$VERIFY_CMK_INFO_POSTGRES()                                                                                                                                                                                               |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  | Verification failed due to an exception with message: Access is denied to the customer managed key (CMK) for this account. This could be because: 1) the CMK access permissions granted to Snowflake have been revoked OR 2) the CMK is disabled OR 3) the CMK is scheduled for deletion OR 4) the CMK specified is wrong. CMK ARN used: arn:aws:kms:us-west-2:736112632311:key/ceab36e4-f0e5-4b46-9a78-86e8f17a0f59 |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  ```
* **Azure:**:

  ```output
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  |                                                                                                                                                     SYSTEM$VERIFY_CMK_INFO_POSTGRES()                                                                                                                                                     |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  | Verification failed due to an exception with message: Error received from the customer managed key (CMK) provider caused by user: 'Your request cannot be completed because of the failure of an external dependency. Please try again later.'. CMK KEY URI used: https://trisecretsite.vault.azure.net/keys/TriSecretAZKeyWrong |
  +----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
  ```

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) or a role that is granted the MONITOR SECURITY privilege on the account
can call this function.

## Examples

Obtain the status of the CMK for your Snowflake account:

> ```sqlexample
> SELECT SYSTEM$VERIFY_CMK_INFO_POSTGRES();
> ```

---
title: SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN
source: https://docs.snowflake.com/en/sql-reference/functions/system_verify_ext_oauth_token.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN

Determines whether your [External OAuth](../../user-guide/oauth-ext-overview.md) access token is valid or has expired and needs to be regenerated.

## Syntax

```sqlsyntax
SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN( '<access_token>' )
```

## Arguments

`access_token`
:   The External OAuth access token generated by your OAuth 2.0 server.

## Output

The function returns a JSON object stating the validation result with a reason. The query result should never display the token itself. For example, an invalid token should return a masked token in the result to ensure that sensitive information is not exposed unnecessarily in Snowflake.

| Column Name | Data Type | Description |
| --- | --- | --- |
| Validation Result | String | A valid token returns `Passed`. . An invalid token returns `Failed`. |
| Reason | String | A valid token returns the Issuer URL and the user. . An invalid token states the problem with the access token (e.g. `EXTERNAL_OAUTH_JWS_INVALID_FORMAT`). |

## Examples

The following example returns a valid External OAuth token result:

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN('<access_token>');

+-----------------------------------------------------------------------------------------------+
| Token Validation finished.{"Validation Result":"Passed","Issuer":"<URL>","User":"<username>"} |
+-----------------------------------------------------------------------------------------------+
```

---
title: SYSTEM$VERIFY_EXTERNAL_VOLUME
source: https://docs.snowflake.com/en/sql-reference/functions/system_verify_external_volume.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$VERIFY_EXTERNAL_VOLUME

Verifies the configuration for a specified [external volume](../../user-guide/tables-iceberg-configure-external-volume.md).

For external volumes with write access, Snowflake attempts the following additional operations to verify the configuration:

* Write a test file.
* Read the test file.
* List the files in the storage location.
* Delete the test file.

See also:
:   [Storage for Apache Iceberg™ tables](../../user-guide/tables-iceberg-storage.md) , [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md) , [CREATE EXTERNAL VOLUME](../sql/create-external-volume.md)

## Syntax

```sqlsyntax
SYSTEM$VERIFY_EXTERNAL_VOLUME('<external_volume_name>')
```

## Arguments

`external_volume_name`
:   Name of the external volume to verify. If the identifier contains spaces or special characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive.

## Returns

The function returns a JSON object with the properties described below:

| Property | Description |
| --- | --- |
| `success` | The status of the verification test. Returns `TRUE` if all actions finished; returns `FALSE` if any action didn’t finish as expected. |
| `storageLocationSelectionResult` | The result of selecting an [active storage location](../../user-guide/tables-iceberg-storage.md) for the external volume. Returns `TRUE` if Snowflake can successfully select an active location; otherwise, returns `FALSE`. |
| `storageLocationName` | The name of the active storage location. |
| `servicePrincipalProperties` | The properties of the Snowflake service principal for the cloud provider of the active storage location. |
| `location` | The `BASE_URL` of the active storage location. |
| `storageAccount` | For Azure, the storage account of the active storage location. |
| `region` | The region of the active storage location. |
| `writeResult` | The result of writing a test file to the active storage location. Skipped for read-only external volumes. |
| `readResult` | The result of reading a test file from the active storage location. Skipped for read-only external volumes. |
| `listResult` | The result of listing the contents of the active storage location. Skipped for read-only external volumes. |
| `deleteResult` | The result of deleting a test file written to the active storage location. Skipped for read-only external volumes. |
| `awsRoleArnValidationResult` | For Amazon S3, returns the result of validating the Amazon Resource Name (ARN) for the IAM role used by the external volume. |
| `azureGetUserDelegationKeyResult` | For Azure, returns the result of getting a user delegation key. |

### Result values

Return properties that indicate a result can have the following values:

| Result value | Description |
| --- | --- |
| `PASSED` | The operation succeeded. |
| `SKIPPED` | The operation isn’t applicable for the specified external volume. For example, the read, write, list, and delete operations are skipped for read-only external volumes. |
| `<error_message>` | A detailed error message. |

### Example output

```json
{
  "success": true,
  "storageLocationSelectionResult": "PASSED",
  "storageLocationName": "my-azure-westus-1",
  "servicePrincipalProperties": "AZURE_MULTI_TENANT_APP_NAME: powerful-azure-ad-auth-test-snowflake-app_...; AZURE_CONSENT_URL: https://login.microsoftonline.com...",
  "location": "azure://myStorageAccount.blob.core.windows.net/myStorageLocation/",
  "storageAccount": "myStorageAccount",
  "region": "westus",
  "writeResult": "PASSED",
  "readResult": "PASSED",
  "listResult": "PASSED",
  "deleteResult": "PASSED",
  "awsRoleArnValidationResult": "SKIPPED",
  "azureGetUserDelegationKeyResult": "PASSED"
}
```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | External volume |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* For Amazon S3 external volumes:

  If you receive the following error, your account administrator must activate AWS STS in the Snowflake deployment region.
  For instructions, see
  [Manage AWS STS in an AWS Region](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_credentials_temp_enable-regions.html)
  in the AWS documentation.

  ```output
  Error assuming AWS_ROLE:
  STS is not activated in this region for account:<external volume id>. Your account administrator can activate STS in this region using the IAM Console.
  ```

## Examples

Verify an external volume named `my_s3_external_volume`:

```sqlexample
SELECT SYSTEM$VERIFY_EXTERNAL_VOLUME('my_s3_external_volume');
```

---
title: SYSTEM$WAIT
source: https://docs.snowflake.com/en/sql-reference/functions/system_wait.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$WAIT

Waits for the specified amount of time before proceeding.

## Syntax

```sqlsyntax
SYSTEM$WAIT( amount [ , time_unit ] )
```

## Arguments

**Required:**

`amount`
:   Number specifying the amount of time to wait as determined by `time_unit`.

**Optional:**

`time_unit`
:   Time unit for `amount`. Accepted values are DAYS, HOURS, MINUTES, SECONDS, MILLISECONDS, MICROSECONDS, NANOSECONDS.
    The unit should be in single quotes (see Examples below).

    Default: SECONDS

## Usage notes

* Most systems do not have clocks that have nanosecond precision. As a result:

  > + The actual wait time might not be exactly the same as the specified wait time.
  > + The reported wait time might not be exact.
* SYSTEM$WAIT checks periodically for cancellation. If a user cancels a query while it is waiting, there might be a delay between the
  time the query is cancelled and the time the cancellation takes effect.
* If the wait period exceeds the compilation timeout, the query is not cancelled automatically. After the wait, the query resumes normally.

## Examples

> ```sqlexample
> CALL SYSTEM$WAIT(10);
>
> -------------------+
>     SYSTEM$WAIT    |
> -------------------+
>  waited 10 seconds |
> -------------------+
> ```
>
> ```sqlexample
> CALL SYSTEM$WAIT(2, 'MINUTES');
>
> -------------------+
>     SYSTEM$WAIT    |
> -------------------+
>  waited 2 minutes  |
> -------------------+
> ```

---
title: SYSTEM$WAIT_FOR_SERVICES
source: https://docs.snowflake.com/en/sql-reference/functions/system_wait_for_services.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$WAIT_FOR_SERVICES

Waits for one or more [Snowpark Container Services services](../../developer-guide/snowpark-container-services/working-with-services.md) to reach the READY state (or becomes upgraded) before returning.

* All services with names passed to the system function have READY status.
* Any of the named services has the FAILED status.
* The pause duration has reached the specified time duration, in seconds.

You might use this function, for example, in a Native App scenario to pause the native app (with containers) setup script to allow for the services to upgrade correctly. For more information, see [Upgrade an app with containers](../../developer-guide/native-apps/update-app-upgrade.md).

See also:
:   [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md)

## Syntax

```sqlsyntax
SYSTEM$WAIT_FOR_SERVICES( <seconds_to_pause>, '<service_name>' [, ...] )
```

## Arguments

`seconds_to_pause`
:   Number of seconds to pause.

`service_name [ , ... ]`
:   Names of one or more services to wait for.

## Returns

‘OK’ or fails in case of timeout.

## Usage notes

* The current role must have the MONITOR privilege on the services listed in the command.

## Examples

The following statement causes the setup script to pause until one of the following occurs:

* All the three named services passed to the system function have the READY status.
* Any of the named services has the FAILED status.
* 600 seconds have passed.

```sqlexample
SELECT SYSTEM$WAIT_FOR_SERVICES(600, 'service-name-1', 'service-name-2', 'service-name-3');
```

---
title: SYSTEM$WHITELIST — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_whitelist.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$WHITELIST — *Deprecated*

Returns hostnames and port numbers to add to your firewall’s allowed list so that you can access Snowflake from behind your firewall.
The output of this function can then be passed into [SnowCD](../../user-guide/snowcd.md).

Typically, Snowflake customers use a firewall to prevent unauthorized access. By default, your firewall might block access to Snowflake. To
update your firewall’s allowed list, you need to know the hostnames and port numbers for the URL for your
[Snowflake account](../../user-guide/admin-account-identifier.md), stages, and other hosts used by Snowflake.

For more details about the allowed listing for the Snowflake clients you use, see [Allowing Host names](../../user-guide/hostname-allowlist.md).

## Syntax

```sqlsyntax
SYSTEM$WHITELIST()
```

## Arguments

None.

## Returns

The data type of the returned value is VARIANT. The value is an array of JSON structures. Each JSON structure contains three
key/value pairs:

`type`
:   Snowflake supports the following types:

    `SNOWFLAKE_DEPLOYMENT`
    :   Host name and port number information for your Snowflake account.

    `SNOWFLAKE_DEPLOYMENT_REGIONLESS`
    :   Host name and port number information for your [organization](../../user-guide/organizations.md).

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md).

    `STAGE`
    :   Location (such as Amazon S3, Google Cloud Storage, or Microsoft Azure) where files that the Snowflake client can read or write are stored.

    `SNOWSQL_REPO`
    :   Endpoint accessed by SnowSQL to perform automatic downloads or upgrades.

    `OUT_OF_BAND_TELEMETRY`
    :   The hosts to which drivers report metrics and out-of-band incidents such as OCSP issues.

    `CLIENT_FAILOVER`
    :   Host name and port number for the connection URL for [Client Redirect](../../user-guide/client-redirect.md). Note that each row in the query
        output that specifies this value refers to either the primary connection or the secondary connection depending on how the connection
        URLs were configured.

    `CRL_DISTRIBUTION_POINT`
    :   Host name and port number for certificate revocation list (CRL) distribution endpoints.

    `OCSP_CACHE`
    :   Snowflake-provided alternative source of OCSP certificate information in case the primary OCSP responder cannot be reached. Most of the
        latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CACHE_REGIONLESS`
    :   Snowflake-provided alternative source of OCSP certificate information for your [organization](../../user-guide/organizations.md). Most of
        the latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CLIENT_FAILOVER`
    :   Snowflake-provided alternative source of OCSP certificate information for [Client Redirect](../../user-guide/client-redirect.md).

    `DUO_SECURITY`
    :   The host name for the Duo Security service that is used with [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md) while authenticating to Snowflake.

    `OCSP_RESPONDER`
    :   Host name to contact to verify that the OCSP TLS certificate has not been revoked.

        Note that this value is not necessary when configuring private connectivity to the Snowflake service ; follow the instructions in the
        corresponding topic to select the OCSP value to add to your allowlist.

    `SNOWSIGHT_DEPLOYMENT_REGIONLESS`
    :   Host name and port number for your [organization](../../user-guide/organizations.md) to access Snowsight.

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md) and [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

    `SNOWSIGHT_DEPLOYMENT`
    :   Host name and port number to access [Snowsight](../../user-guide/ui-snowsight.md) for your Snowflake account.

`host`
:   Specifies the full host name for `type`, for example: `"xy12345.east-us-2.azure.snowflakecomputing.com"`, `"ocsp.snowflakecomputing.com"`.

`port`
:   Specifies the port number for `type`, for example: `443`, `80`.

## Usage notes

* The output may include multiple entries for certain types (e.g. `STAGE`, `OCSP_RESPONDER`).

## Examples

To call the function:

> ```sqlexample
> SELECT SYSTEM$WHITELIST();
> ```
>
> Sample output:
>
> ```sqljson
> [
>   {"type":"SNOWFLAKE_DEPLOYMENT", "host":"xy12345.snowflakecomputing.com",                 "port":443},
>   {"type":"STAGE",                "host":"sfc-customer-stage.s3.us-west-2.amazonaws.com",  "port":443},
>   ...
>   {"type":"SNOWSQL_REPO",         "host":"sfc-repo.snowflakecomputing.com",                "port":443},
>   ...
>   {"type":"OCSP_CACHE",           "host":"ocsp.snowflakecomputing.com",                    "port":80}
>   {"type":"OCSP_RESPONDER",       "host":"o.ss2.us",                                       "port":80},
>   ...
> ]
> ```
>
> In this sample output, note the following:
>
> * For readability, whitespace and newline characters have been added. In addition, some entries have been omitted.
> * The region ID (`us-west-2`) in some of the hostnames indicates the account is in the US West region; however, the region ID
>   is not utilized in the hostname for `SNOWFLAKE_DEPLOYMENT`.

To extract the information into tabular output rather than JSON, use the [FLATTEN](flatten.md) function in conjunction with the [PARSE_JSON](parse_json.md)
function:

> ```sqlexample
> SELECT t.VALUE:type::VARCHAR as type,
>        t.VALUE:host::VARCHAR as host,
>        t.VALUE:port as port
> FROM TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$WHITELIST()))) AS t;
> ```
>
> Sample output:
>
> ```none
> +-----------------------+---------------------------------------------------+------+
> | TYPE                  | HOST                                              | PORT |
> |-----------------------+---------------------------------------------------+------|
> | SNOWFLAKE_DEPLOYMENT  | xy12345.snowflakecomputing.com                    | 443  |
> | STAGE                 | sfc-customer-stage.s3.us-west-2.amazonaws.com     | 443  |
>   ...
> | SNOWSQL_REPO          | sfc-repo.snowflakecomputing.com                   | 443  |
>   ...
> | OCSP_CACHE            | ocsp.snowflakecomputing.com                       | 80   |
> | OCSP_RESPONDER        | ocsp.sca1b.amazontrust.com                        | 80   |
>   ...
> +-----------------------+---------------------------------------------------+------+
> ```

---
title: SYSTEM$WHITELIST_PRIVATELINK — Deprecated
source: https://docs.snowflake.com/en/sql-reference/functions/system_whitelist_privatelink.md
section: SQL Functions
---

Categories:
:   [System functions](../functions-system.md) (System Information)

# SYSTEM$WHITELIST_PRIVATELINK — *Deprecated*

Returns hostnames and port numbers for [AWS PrivateLink](https://aws.amazon.com/privatelink/),
[Azure Private Link](https://azure.microsoft.com/en-us/services/private-link/), and
[Google Cloud Private Service Connect](https://cloud.google.com/vpc/docs/configure-private-service-connect-services) deployments to add
to your firewall’s allowed list so that you can access Snowflake from behind your firewall. These features provide private connectivity to
the Snowflake service on each supported cloud platform.

The output of this function can then be passed into [SnowCD](../../user-guide/snowcd.md) to diagnose and troubleshoot your network connection
to Snowflake.

Typically, Snowflake customers use a firewall to prevent unauthorized access. By default, your firewall might block access to Snowflake. To
update your firewall’s allowed list, you need to know the hostnames and port numbers for the URL associated with your Snowflake
[account identifier](../../user-guide/admin-account-identifier.md), stages, and other hosts used by Snowflake.

For more details about allowed lists for the Snowflake clients you use, see [Allowing Host names](../../user-guide/hostname-allowlist.md).

## Syntax

```sqlsyntax
SYSTEM$WHITELIST_PRIVATELINK()
```

## Arguments

None.

## Returns

The data type of the returned value is `VARIANT`. The value is an array of JSON structures. Each JSON structure contains three key/value
pairs:

`type`
:   Snowflake supports the following types:

    `SNOWFLAKE_DEPLOYMENT`
    :   Host name and port number information for your Snowflake account.

    `SNOWFLAKE_DEPLOYMENT_REGIONLESS`
    :   Host name and port number information for your [organization](../../user-guide/organizations.md).

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md).

    `STAGE`
    :   Location (such as Amazon S3, Google Cloud Storage, or Microsoft Azure) where files that the Snowflake client can read or write are stored.

    `SNOWSQL_REPO`
    :   Endpoint accessed by SnowSQL to perform automatic downloads or upgrades.

    `OUT_OF_BAND_TELEMETRY`
    :   The hosts to which drivers report metrics and out-of-band incidents such as OCSP issues.

    `CLIENT_FAILOVER`
    :   Host name and port number for the connection URL for [Client Redirect](../../user-guide/client-redirect.md). Note that each row in the query
        output that specifies this value refers to either the primary connection or the secondary connection depending on how the connection
        URLs were configured.

    `CRL_DISTRIBUTION_POINT`
    :   Host name and port number for certificate revocation list (CRL) distribution endpoints.

    `OCSP_CACHE`
    :   Snowflake-provided alternative source of OCSP certificate information in case the primary OCSP responder cannot be reached. Most of the
        latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CACHE_REGIONLESS`
    :   Snowflake-provided alternative source of OCSP certificate information for your [organization](../../user-guide/organizations.md). Most of
        the latest versions of the Snowflake clients access the OCSP cache rather than connecting directly to the OCSP responder.

    `OCSP_CLIENT_FAILOVER`
    :   Snowflake-provided alternative source of OCSP certificate information for [Client Redirect](../../user-guide/client-redirect.md).

    `DUO_SECURITY`
    :   The host name for the Duo Security service that is used with [Multi-factor authentication (MFA)](../../user-guide/security-mfa.md) while authenticating to Snowflake.

    `OCSP_RESPONDER`
    :   Host name to contact to verify that the OCSP TLS certificate has not been revoked.

        Note that this value is not necessary when configuring private connectivity to the Snowflake service ; follow the instructions in the
        corresponding topic to select the OCSP value to add to your allowlist.

    `SNOWSIGHT_DEPLOYMENT_REGIONLESS`
    :   Host name and port number for your [organization](../../user-guide/organizations.md) to access Snowsight.

        For more information, see [Account identifiers](../../user-guide/admin-account-identifier.md) and [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

    `SNOWSIGHT_DEPLOYMENT`
    :   Host name and port number to access [Snowsight](../../user-guide/ui-snowsight.md) for your Snowflake account.

`host`
:   Specifies the full host name for `type`, for example: `"xy12345.east-us-2.azure.snowflakecomputing.com"`, `"ocsp.snowflakecomputing.com"`.

`port`
:   Specifies the port number for `type`, for example: `443`, `80`.

## Usage notes

* The output may include multiple entries for certain types (`STAGE`, etc.).

## Examples

To call the function:

> ```sqlexample
> SELECT SYSTEM$WHITELIST_PRIVATELINK();
> ```
>
> Sample output:
>
> ```sqljson
> [
>   {"type":"SNOWFLAKE_DEPLOYMENT", "host":"xy12345.us-west-2.privatelink.snowflakecomputing.com","port":443},
>   {"type":"STAGE",                "host":"sfc-ss-ds2-customer-stage.s3.us-west-2.amazonaws.com","port":443},
>   ...
>   {"type":"SNOWSQL_REPO",         "host":"sfc-repo.snowflakecomputing.com",                     "port":443},
>   ...
>   {"type":"OUT_OF_BAND_TELEMETRY","host":"client-telemetry.snowflakecomputing.com","port":443},
>   {"type":"OCSP_CACHE",           "host":"ocsp.station00752.us-west-2.privatelink.snowflakecomputing.com","port":80}
> ]
> ```
>
> In this sample output, note the following:
>
> * For readability, whitespace and newline characters have been added. In addition, some entries have been omitted.
> * The region ID (`us-west-2`) in some of the hostnames indicates the account is in the US West region ; however, the region ID is not utilized in the hostname for `SNOWFLAKE_DEPLOYMENT`.

To extract the information into tabular output rather than JSON, use the [FLATTEN](flatten.md) function in conjunction with the [PARSE_JSON](parse_json.md) function:

> ```sqlexample
> SELECT t.VALUE:type::VARCHAR as type,
>        t.VALUE:host::VARCHAR as host,
>        t.VALUE:port as port
> FROM TABLE(FLATTEN(input => PARSE_JSON(SYSTEM$WHITELIST_PRIVATELINK()))) AS t;
> ```
>
> Sample output:
>
> ```none
> +-----------------------+---------------------------------------------------+------+
> | TYPE                  | HOST                                              | PORT |
> +-----------------------+---------------------------------------------------+------+
> | SNOWFLAKE_DEPLOYMENT  | xy12345.snowflakecomputing.com                    | 443  |
> | STAGE                 | sfc-customer-stage.s3.us-west-2.amazonaws.com     | 443  |
>   ...
> | SNOWSQL_REPO          | sfc-repo.snowflakecomputing.com                   | 443  |
>   ...
> | OCSP_CACHE            | ocsp.snowflakecomputing.com                       | 80   |
>   ...
> +-----------------------+---------------------------------------------------+------+
> ```

---
title: SYSTIMESTAMP
source: https://docs.snowflake.com/en/sql-reference/functions/systimestamp.md
section: SQL Functions
---

Categories:
:   [Context functions](../functions-context.md) (General)

# SYSTIMESTAMP

Returns the current timestamp for the system.

See also:
:   [CURRENT_TIMESTAMP](current_timestamp.md)

## Syntax

```sqlsyntax
SYSTIMESTAMP()
```

## Arguments

None. This function must be called with parentheses.

## Returns

Returns the current system time in the local time zone. The data type of the returned value is
[TIMESTAMP_LTZ](../data-types-datetime.md).

## Usage notes

* The setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned timestamp is in the time zone for the session.
* The setting of the [TIMESTAMP_TYPE_MAPPING](../parameters.md) parameter does not affect the return value.
* Do not use the returned value for precise time ordering between concurrent queries (processed by the same virtual warehouse) because the queries might be serviced by different compute resources (in the warehouse).

* This function does not support the `fract_sec_precision` argument that is supported by
  the [CURRENT_TIMESTAMP](current_timestamp.md) function.

## Examples

Show the current system timestamp:

```sqlexample
SELECT SYSTIMESTAMP();
```

```output
+--------------------------+
| SYSTIMESTAMP()           |
|--------------------------|
| 2024-04-17 15:49:34.0800 |
+--------------------------+
```

---
title: TAG_REFERENCES
source: https://docs.snowflake.com/en/sql-reference/functions/tag_references.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# TAG_REFERENCES

Returns a table in which each row displays an association between a tag and value.

The associated tag and value are the result of a direct association to an object or through [tag inheritance](../../user-guide/object-tagging/inheritance.md).

## Syntax

```sqlsyntax
TAG_REFERENCES( '<object_name>' , '<object_domain>' )
```

## Arguments

`'object_name'`
:   Name of the referenced object if the tag association is on the object.

`'object_domain'`
:   Domain of the reference object, such as a table or view, if the tag association is on the object. For columns, the domain is `COLUMN`
    if the tag association is on a column.

    Use one of the following values:

    > * `'ACCOUNT'`
    > * `'ALERT'`
    > * `'BACKUP POLICY'`
    > * `'BACKUP SET'`
    > * `'COLUMN'`
    > * `'COMPUTE POOL'`
    > * `'CORTEX AGENT'`
    > * `'DATABASE'`
    > * `'DATABASE ROLE'`
    > * `'FAILOVER GROUP'`
    > * `'FUNCTION'`
    > * `'INTEGRATION'`
    > * `'INSTANCE'`
    > * `'NETWORK POLICY'`
    > * `'PROCEDURE'`
    > * `'REPLICATION GROUP'`
    > * `'ROLE'`
    > * `'SCHEMA'`
    > * `'SHARE'`
    > * `'SNAPSHOT POLICY'` (deprecated; prefer `'BACKUP POLICY'`)
    > * `'SNAPSHOT SET'` (deprecated; prefer `'BACKUP SET'`)
    > * `'SNOWFLAKE INTELLIGENCE'`
    > * `'STAGE'`
    > * `'STREAM'`
    > * `'TABLE'`: Use this for all table-like objects such as views, materialized views, and external tables.
    > * `'TASK'`
    > * `'USER'`
    > * `'WAREHOUSE'`

## Usage notes

* Results are only returned for a role that has access to the specified object.

  To view references for [system tags](../../user-guide/classify-intro.md) associated with sensitive data classification, use a role
  with IMPORTED PRIVILEGES on the shared SNOWFLAKE database.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function
  must use the fully-qualified object name. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| TAG_DATABASE | TEXT | The database in which the tag is set. |
| TAG_SCHEMA | TEXT | The schema in which the tag is set. |
| TAG_NAME | TEXT | The name of the tag. This is the `key` in the `key = 'value'` pair of the tag. |
| TAG_VALUE | TEXT | The value of the tag. This is the `'value'` in the `key = 'value'` pair of the tag. |
| APPLY_METHOD | TEXT | Specifies how the tag got assigned to the object. Possible values include the following:   * `CLASSIFIED`: The tag was automatically applied to a column that was classified as containing sensitive data. See [About tag mapping](../../user-guide/classify-auto.md). * `INHERITED`: The object inherited the tag from an object higher up in the Snowflake securable object hierarchy. See [Tag inheritance](../../user-guide/object-tagging/inheritance.md). * `MANUAL`: Someone manually set the tag on the object using a CREATE <object> or ALTER <object> command. See [Set a tag](../../user-guide/object-tagging/work.md). * `PROPAGATED`: The tag was automatically propagated from one object to another. See [Automatic tag propagation with user-defined tags](../../user-guide/object-tagging/propagation.md). * `NULL`: Legacy record. * `NONE`: Legacy record. |
| LEVEL | TEXT | The object domain on which the tag is set. |
| OBJECT_DATABASE | TEXT | Database name of the referenced object for database and schema objects. If the object is not a database or schema object, the value is empty. |
| OBJECT_SCHEMA | TEXT | Schema name of the referenced object (for schema objects). If the referenced object is not a schema object (e.g. warehouse), this value is empty. |
| OBJECT_NAME | TEXT | Name of the reference object if the tag association is on the object. |
| DOMAIN | TEXT | Domain of the reference object (e.g. table, view) if the tag association is on the object. If the tag association is on a column, the domain is COLUMN. |
| COLUMN_NAME | TEXT | Name of the referenced column; not applicable if the tag association is not a column. |

## Examples

Retrieve the list of tags associated with the table `my_table`:

> ```sqlexample
> select *
>   from table(my_db.information_schema.tag_references('my_table', 'table'));
> ```

Retrieve the list of tags associated on the column `result`:

> ```sqlexample
> select *
>   from table(my_db.information_schema.tag_references('my_table.result', 'COLUMN'));
> ```

---
title: TAG_REFERENCES_ALL_COLUMNS
source: https://docs.snowflake.com/en/sql-reference/functions/tag_references_all_columns.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# TAG_REFERENCES_ALL_COLUMNS

Returns a table in which each row displays the tag name and tag value assigned to a specific column.

This function returns every tag set on every column in a given table or view, whether the tag is directly assigned to a column or through
[tag inheritance](../../user-guide/object-tagging/inheritance.md).

## Syntax

```sqlsyntax
TAG_REFERENCES_ALL_COLUMNS( '<object_name>' , '<object_domain>' )
```

## Arguments

`'object_name'`
:   Name of the referenced object if the tag association is on the object.

    This argument supports the names for tables and views.

`'object_domain'`
:   Domain of the referenced object.

    Snowflake supports one domain for this function: `TABLE`.

    Note that the domain `TABLE` must be used for all objects that contain columns, even if the object name is a view
    (i.e. view, materialized view).

## Usage notes

* Results are only returned for a role that has access to the specified object.

  To view references for [system tags](../../user-guide/classify-intro.md) associated with sensitive data classification, use a role
  with IMPORTED PRIVILEGES on the shared SNOWFLAKE database.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function
  must use the fully-qualified object name. For more details, see [Snowflake Information Schema](../info-schema.md).

## Output

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| TAG_DATABASE | TEXT | The database in which the tag is set. |
| TAG_SCHEMA | TEXT | The schema in which the tag is set. |
| TAG_NAME | TEXT | The name of the tag. This is the `key` in the `key = 'value'` pair of the tag. |
| TAG_VALUE | TEXT | The value of the tag. This is the `'value'` in the `key = 'value'` pair of the tag. |
| APPLY_METHOD | TEXT | Specifies how the tag got assigned to the object. Possible values include the following:   * `CLASSIFIED`: The tag was automatically applied to a column that was classified as containing sensitive data. See [About tag mapping](../../user-guide/classify-auto.md). * `INHERITED`: The object inherited the tag from an object higher up in the Snowflake securable object hierarchy. See [Tag inheritance](../../user-guide/object-tagging/inheritance.md). * `MANUAL`: Someone manually set the tag on the object using a CREATE <object> or ALTER <object> command. See [Set a tag](../../user-guide/object-tagging/work.md). * `PROPAGATED`: The tag was automatically propagated from one object to another. See [Automatic tag propagation with user-defined tags](../../user-guide/object-tagging/propagation.md). * `NULL`: Legacy record. * `NONE`: Legacy record. |
| LEVEL | TEXT | The object domain on which the tag is set. |
| OBJECT_DATABASE | TEXT | The database name containing the table or view. |
| OBJECT_SCHEMA | TEXT | The schema name containing the table or view. |
| OBJECT_NAME | TEXT | The name of the table or view. |
| DOMAIN | TEXT | This value should be `COLUMN` since this function returns all tags set on all columns in the table or view. |
| COLUMN_NAME | TEXT | The name of the column that the tag is set on. |

## Examples

Retrieve the list of tags that are assigned to every column in the table `my_table`:

> ```sqlexample
> select *
>   from table(my_db.information_schema.tag_references_all_columns('my_table', 'table'));
> ```

---
title: TAG_REFERENCES_WITH_LINEAGE
source: https://docs.snowflake.com/en/sql-reference/functions/tag_references_with_lineage.md
section: SQL Functions
---

Categories:
:   [Account Usage table functions](../account-usage.md) , [Table functions](../functions-table.md)

# TAG_REFERENCES_WITH_LINEAGE

Returns a table in which each row displays an association between the specified tag and the Snowflake object to which the tag is associated.

The associated tag and Snowflake object are the result of both a direct association to an object and
[tag inheritance](../../user-guide/object-tagging/inheritance.md).

## Syntax

```sqlsyntax
TAG_REFERENCES_WITH_LINEAGE( '<name>' )
```

## Arguments

`'name'`
:   The fully qualified name of the tag.

    The fully qualified name must specify the parent tag database and tag schema for the tag in the following format:

    > `<tag_database>.<tag_schema>.<tag_name>`

## Usage notes

* Results are only returned for the role that has access to the specified object.
* This function doesn’t support system tags used by sensitive data classification.
* When calling an Account Usage table function, the session must have an Account Usage schema in use. For more details, see
  [Account Usage](../account-usage.md).
* Similar to the Account Usage views, please account for latency when calling this table function. The expected latency for this table
  function is similar to the latency for the [TAG_REFERENCES](../account-usage.md) view.

## Output

The function returns the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| TAG_DATABASE | TEXT | The database in which the tag is set. |
| TAG_SCHEMA | TEXT | The schema in which the tag is set. |
| TAG_ID | NUMBER | Internal/system-generated identifier for the tag. |
| TAG_NAME | TEXT | The name of the tag. This is the `key` in the `key = 'value'` pair of the tag. |
| TAG_VALUE | TEXT | The value of tag. This is the `'value'` in the `key = 'value'` pair of the tag. |
| LEVEL | TEXT | The object domain on which the tag is set. |
| OBJECT_DATABASE | TEXT | Database name of the referenced object for database and schema objects. If the object is not a database or schema object, the value is empty. |
| OBJECT_SCHEMA | TEXT | Schema name of the referenced object (for schema objects). If the referenced object is not a schema object (e.g. warehouse), this value is empty. |
| OBJECT_ID | NUMBER | Internal/system-generated identifier for the object. |
| OBJECT_NAME | TEXT | Name of the referenced object if the tag association is on the object. |
| OBJECT_DELETED | TIMESTAMP_LTZ | Date and time when the associated object or column was dropped, or if the parent object is dropped. |
| DOMAIN | TEXT | Domain of the reference object (e.g. table, view) if the tag association is on the object. If the tag association is on a column, the domain is COLUMN. |
| COLUMN_ID | NUMBER | Internal/system-generated identifier for the column. |
| COLUMN_NAME | TEXT | Name of the referenced column; not applicable if the tag association is not a column. |
| APPLY_METHOD | TEXT | Specifies how the tag got assigned to the object. Possible values include the following:   * `CLASSIFIED`: The tag was automatically applied to a column that was classified as containing sensitive data. See [About tag mapping](../../user-guide/classify-auto.md). * `INHERITED`: The object inherited the tag from an object higher up in the Snowflake securable object hierarchy. See [Tag inheritance](../../user-guide/object-tagging/inheritance.md). * `MANUAL`: Someone manually set the tag on the object using a CREATE <object> or ALTER <object> command. See [Set a tag](../../user-guide/object-tagging/work.md). * `PROPAGATED`: The tag was automatically propagated from one object to another. See [Automatic tag propagation with user-defined tags](../../user-guide/object-tagging/propagation.md). * `NULL`: Legacy record. * `NONE`: Legacy record. |

## Examples

Retrieve the list of tag associations for the `cost_center` tag:

```sqlexample
select *
  from table(snowflake.account_usage.tag_references_with_lineage('MY_DB.MY_SCHEMA.COST_CENTER'));
```

> **Note:**
>
> The tag name must be written in uppercase letters.

---
title: TAN
source: https://docs.snowflake.com/en/sql-reference/functions/tan.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# TAN

Computes the tangent of its argument; the argument should be expressed in
radians.

## Syntax

```sqlsyntax
TAN( <input_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The value must be in
    radians, not degrees. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT TAN(0), TAN(PI()/3), TAN(RADIANS(90));
```

```output
+--------+-------------+----------------------+
| TAN(0) | TAN(PI()/3) |     TAN(RADIANS(90)) |
|--------+-------------+----------------------|
|      0 | 1.732050808 | 1.63312393531954e+16 |
+--------+-------------+----------------------+
```

---
title: TANH
source: https://docs.snowflake.com/en/sql-reference/functions/tanh.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Trigonometric)

# TANH

Computes the hyperbolic tangent of its argument.

## Syntax

```sqlsyntax
TANH( <real_expr> )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be FLOAT.

## Returns

This function returns a value of type FLOAT.

## Examples

```sqlexample
SELECT TANH(1.5);
```

```output
+--------------+
|    TANH(1.5) |
|--------------|
| 0.9051482536 |
+--------------+
```

---
title: TASK_DEPENDENTS
source: https://docs.snowflake.com/en/sql-reference/functions/task_dependents.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# TASK_DEPENDENTS

This table function returns the list of child [tasks](../../user-guide/tasks-intro.md) for a given root task in a
[task graph](../../user-guide/tasks-graphs.md).

## Syntax

```sqlsyntax
TASK_DEPENDENTS(
      TASK_NAME => '<string>'
      [, RECURSIVE => <Boolean> ] )
```

## Arguments

`TASK_NAME => 'string'`
:   A string specifying a task. The function returns the specified root task as the first entry, followed by the list of child tasks.

    * Note that the entire name must be enclosed in single quotes, including the database and schema (if the name is fully-qualified), i.e. `'<db>.<schema>.<task_name>'`.
    * If the task name is case-sensitive or includes any special characters or spaces, double quotes are required to process the case/characters. The double quotes must be enclosed within the single quotes, i.e. `'"<task_name>"'`.

`RECURSIVE => Boolean`
:   Specifies whether to limit the output to include only direct child tasks or to include all recursive child tasks.

    Values:
    :   `TRUE`: Returns all recursive child tasks (children, grandchildren, etc.) in the output.

        `FALSE`: Returns only direct child tasks in the output.

    Default: `TRUE`.

## Usage notes

* Only returns rows for a task owner (i.e. the role with the OWNERSHIP privilege on a task) or a role with either the MONITOR or OPERATE privilege on a task.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

The function output provides table properties and metadata in the following columns:

```sqlexample
| created_on | name | database_name | schema_name | owner | comment | warehouse | schedule | predecessors | state | definition | condition |
```

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the task was created. |
| `name` | Name of the task. |
| `database_name` | Database for the schema for the task. |
| `schema_name` | Schema for the task. |
| `owner` | Role that owns the task (i.e. has the OWNERSHIP privilege on the task) |
| `comment` | Comment for the task. |
| `warehouse` | Warehouse that provides the required resources to run the task. |
| `schedule` | Schedule for running the task. Displays NULL if no schedule is specified. |
| `predecessors` | JSON array of any tasks identified in the AFTER parameter for the task (i.e. predecessor tasks). When run successfully to completion, these tasks trigger the current task. Individual task names in the array are fully-qualified (i.e. include the container database and schema names). . . Displays an empty array if the task has no predecessor. |
| `state` | ‘Started’ or ‘Suspended’ based on the current state of the task. |
| `definition` | SQL statements executed when the task runs. |
| `condition` | Condition specified in the WHEN clause for the task. |

## Examples

Retrieve the list of direct child tasks for the `mydb.myschema.mytask` task:

> ```sqlexample
> select *
>   from table(information_schema.task_dependents(task_name => 'mydb.myschema.mytask', recursive => false));
> ```

---
title: TASK_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/task_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# TASK_HISTORY

You can use this table function to query the history of [task](../../user-guide/tasks-intro.md) usage within a specified date range.
The function returns the history of task usage for your entire Snowflake account, a specified task, or task graph.

This function can return all executions run in the past seven days or the next scheduled execution within the next eight days.

## Syntax

```sqlsyntax
TASK_HISTORY(
      [ SCHEDULED_TIME_RANGE_START => <constant_expr> ]
      [, SCHEDULED_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <integer> ]
      [, TASK_NAME => '<string>' ]
      [, ERROR_ONLY => { TRUE | FALSE } ]
      [, ROOT_TASK_ID => '<string>'] )
```

## Arguments

All the arguments are optional.

`SCHEDULED_TIME_RANGE_START => constant_expr` , . `SCHEDULED_TIME_RANGE_END => constant_expr`
:   Time range (in [TIMESTAMP_LTZ format](../data-types-datetime.md)), within the last 7 days, in which the task execution was scheduled. If the time range does not fall
    within the last 7 days, an error is returned.

    * If `SCHEDULED_TIME_RANGE_END` is not specified, the function returns those tasks that have already completed, are currently
      running, or are scheduled in the future.
    * If `SCHEDULED_TIME_RANGE_END` is [CURRENT_TIMESTAMP](current_timestamp.md), the function returns those tasks that have
      already completed or are currently running. Note that a task that is executed immediately before the current time might still be
      identified as scheduled.
    * To query only those tasks that have already completed or are currently running, include `WHERE query_id IS NOT NULL` as a filter.
      The QUERY_ID column in the TASK_HISTORY output is populated only when a task has started running.

    > **Note:**
    >
    > If no start or end time is specified, the most recent tasks are returned, up to the specified RESULT_LIMIT value.

`RESULT_LIMIT => integer`
:   A number specifying the maximum number of rows returned by the function.

    If the number of matching rows is greater than this limit, the task executions with the most recent timestamp are returned, up to the specified limit.

    Range: `1` to `10000`

    Default: `100`.

`TASK_NAME => string`
:   A case-insensitive string specifying a task. Only non-qualified task names are supported. Only executions of the specified task are returned. Note that if multiple tasks have the same name, the function returns the history for each of these tasks.

`ERROR_ONLY => TRUE | FALSE`
:   When set to TRUE, this function returns only task runs that failed or were cancelled.

    Default: `FALSE`.

`ROOT_TASK_ID =>string`
:   Unique identifier for the root task in a task graph. This ID matches the ID column value in the SHOW TASKS output for the same task.
    Specify the ROOT_TASK_ID to show the history of the root task and any child tasks that are part of the task graph.

## Usage notes

* To view a task graph within this function, the invoking role requires at least one of the following privileges:

  + OWNERSHIP privilege on the task (that is, the task owner).
  + MONITOR or OPERATE privileges on the task.
  + The global MONITOR EXECUTION privilege.
  + The ACCOUNTADMIN role.

  The role must also have the USAGE privilege on the database and schema that store the task, otherwise the DATABASE_NAME and SCHEMA_NAME values in the output are NULL.
* This function returns a maximum of 10,000 rows, set in the `RESULT_LIMIT` argument value. The default value is `100`. To avoid
  this limitation, use the [TASK_HISTORY view](../account-usage/task_history.md) (Account Usage).
* Note that when the TASK_HISTORY function is queried, its task name, time range, and result limit arguments are applied first
  followed by the WHERE and LIMIT clause, respectively, if specified. In addition, the TASK_HISTORY function returns records in descending
  SCHEDULED_TIME order. Tasks in a SUCCEEDED, FAILED, or CANCELLED state are usually scheduled earlier, so they are generally returned
  later in the search results.
* In practice, if you have many tasks running in your account, the results returned by the function could include fewer than expected
  completed tasks or only scheduled tasks. To query the history of tasks that have already run, use a combination of
  the `SCHEDULED_TIME_RANGE_START => constant_expr` and `SCHEDULED_TIME_RANGE_END => constant_expr` arguments.
* When calling an information schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name
  must be fully qualified. For more information, see [Snowflake Information Schema](../info-schema.md).
* Tasks run during a cloud services failure might appear as duplicate entries in the results of this function. During a cloud services
  failure, Snowflake might rerun a task causing the task to have two UUIDs with different task SCHEDULED_TIME.
  [TASK_HISTORY view](../account-usage/task_history.md) only displays the final UUID of the rerun task.
* All tasks in a task graph run show the same task history output.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_ID | TEXT | ID of the SQL statement executed by the task. Can be joined with the QUERY_HISTORY view for additional details about the execution of the statement or stored procedure. |
| NAME | TEXT | Name of the task. |
| DATABASE_NAME | TEXT | Name of the database that contains the task. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the task. |
| QUERY_TEXT | TEXT | Text of the SQL statement. |
| CONDITION_TEXT | TEXT | Text of WHEN condition the task evaluates when determining whether to run. |
| STATE | TEXT | Status of the task:   * `SCHEDULED`: scheduled for execution. * `EXECUTING`: currently executing. * `SUCCEEDED`: execution successful. * `FAILED`: execution failed. The timed-out tasks always have a `FAILED` state in the task history. * [FAILED_AND_AUTO_SUSPENDED](../../user-guide/tasks-intro.md): task failed, and was automatically suspended. * `CANCELLED`: execution cancelled. * `SKIPPED`: indicates that a task run began, but the optional `WHEN` parameter in the task definition returned a FALSE value; therefore, the run did not resume the warehouse (if the task uses customer-managed compute resources) or execute the SQL code in the task definition. |
| ERROR_CODE | NUMBER | Error code, if the statement returned an error. |
| ERROR_MESSAGE | TEXT | Error message, if the statement returned an error. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the task is/was scheduled to start running. Tasks start with a brief queueing period before they begin to run. For more information, see [Task duration](../../user-guide/tasks-intro.md). |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the task definition started to run, or NULL if SCHEDULED_TIME is in the future or the current scheduled run has not started yet. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| NEXT_SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the standalone or root task (in a [task graph](../../user-guide/tasks-graphs.md)) is next scheduled to start running, assuming the current run of the standalone task or task graph started at the SCHEDULED_TIME time completes in time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the task completed, or NULL if SCHEDULED_TIME is in the future or if the task is still running. |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a task graph. This ID matches the ID column value in the SHOW TASKS output for the same task. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the task graph that was run, or is scheduled to be run. Each incremental increase in the value represents one or more modifications to tasks in the task graph. If the root task is recreated (using CREATE OR REPLACE TASK), then the version number restarts from 1. |
| RUN_ID | NUMBER | Time when the standalone or root task in a [task graph](../../user-guide/tasks-graphs.md) is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system might reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run before retry. You can use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| RETURN_VALUE | TEXT | Value set for the predecessor task in a task graph. The return value is explicitly set by calling the [SYSTEM$SET_RETURN_VALUE](system_set_return_value.md) function by the predecessor task. |
| SCHEDULED_FROM | TEXT | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| CONFIG | TEXT | Configuration that the task execution used. This includes dynamic configurations specified with [EXECUTE TASK … USING CONFIG](../sql/execute-task.md). If no configuration is set, the column displays NULL. |
| QUERY_HASH | TEXT | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |
| QUERY_PARAMETERIZED_HASH | TEXT | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_PARAMETERIZED_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |
| GRAPH_RUN_GROUP_ID | TEXT | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Examples

Retrieve the 100 most recent task executions (completed, still running, or scheduled in the future) in the account. Note that the maximum
number of rows returned by the function is limited to 100 by default:

> ```sqlexample
> SELECT *
>   FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.TASK_HISTORY())
>   ORDER BY SCHEDULED_TIME;
> ```

Retrieve the execution history for tasks in the account within a specified 30-minute block of time within a specific 7-day period:

> ```sqlexample
> SELECT *
>   FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.TASK_HISTORY(
>     SCHEDULED_TIME_RANGE_START=>TO_TIMESTAMP_LTZ('2024-11-9 12:00:00.000 -0700'),
>     SCHEDULED_TIME_RANGE_END=>TO_TIMESTAMP_LTZ('2024-11-9 12:30:00.000 -0700')));
> ```

Retrieve the 10 most recent executions of a specified task (completed, still running, or scheduled in the future) scheduled within the last hour:

> ```sqlexample
> SELECT *
>   FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.TASK_HISTORY(
>     SCHEDULED_TIME_RANGE_START=>DATEADD('hour',-1,current_timestamp()),
>     RESULT_LIMIT => 10,
>     TASK_NAME=>'mytask'));
> ```
>
> > **Note:**
> >
> > To retrieve only tasks that are completed or still running, filter the query using `WHERE query_id IS NOT NULL`. Note that this filter is applied after `RESULT_LIMIT` already reduces the results returned, so the query could return 9 tasks if 1 task was scheduled but had not started yet.

Retrieve the execution history of all tasks in the task graph of the specified root task.

> ```sqlexample
> SELECT *
>   FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.TASK_HISTORY(ROOT_TASK_ID=>'d4b89013-c942-465c-bcb8-e7037a932b04'));
> ```

Retrieve the execution history of all tasks in the task graph of the most recently queried root task:

> ```sqlexample
> DESC TASK my_task
> SET task_id=(SELECT "id" FROM TABLE(RESULT_SCAN(LAST_QUERY_ID())));
> SELECT *
>   FROM TABLE(SNOWFLAKE.INFORMATION_SCHEMA.TASK_HISTORY(ROOT_TASK_ID=>$task_id));
> ```

---
title: TEXT_HTML
source: https://docs.snowflake.com/en/sql-reference/functions/text_html.md
section: SQL Functions
---

Categories:
:   [Notification functions](../functions-notification.md) (Message Construction)

# TEXT_HTML

Returns a JSON object that specifies the HTML message to use for a notification. This is a helper function that you use to
construct a message object for the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

See also:
:   [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md) ,
    [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) ,
    [TEXT_PLAIN](text_plain.md) ,
    [APPLICATION_JSON](application_json.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NOTIFICATION.TEXT_HTML( '<message>' )
```

## Arguments

`'message'`
:   Content of the message to send.

## Returns

A JSON-formatted string that specifies a message for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure to send.

For example:

```json
'{"text/html":"<p>A message</p>"}'
```

## Examples

See [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md).

---
title: TEXT_PLAIN
source: https://docs.snowflake.com/en/sql-reference/functions/text_plain.md
section: SQL Functions
---

Categories:
:   [Notification functions](../functions-notification.md) (Message Construction)

# TEXT_PLAIN

Returns a JSON object that specifies the plain text message to use for a notification. This is a helper function that you use to
construct a message object for the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

See also:
:   [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md) ,
    [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) ,
    [TEXT_HTML](text_html.md) ,
    [APPLICATION_JSON](application_json.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NOTIFICATION.TEXT_PLAIN( '<message>' )
```

## Arguments

`'message'`
:   Content of the message to send.

## Returns

A JSON-formatted string that specifies a message for the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure to send.

For example:

```json
'{"text/plain":"A message"}'
```

## Examples

See [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md).

---
title: TIME_FROM_PARTS
source: https://docs.snowflake.com/en/sql-reference/functions/time_from_parts.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIME_FROM_PARTS

Creates a time from individual numeric components.

Aliases:
:   TIMEFROMPARTS

## Syntax

```sqlsyntax
TIME_FROM_PARTS( <hour>, <minute>, <second> [, <nanoseconds>] )
```

## Arguments

**Required:**

`hour`
:   An integer expression to use as an hour for building a time,
    usually in the 0-23 range.

`minute`
:   An integer expression to use as a minute for building a time,
    usually in the 0-59 range.

`second`
:   An integer expression to use as a second for building a time,
    usually in the 0-59 range.

**Optional:**

`nanoseconds`
:   A 9-digit integer expression to use as a nanosecond for building
    a time.

## Usage notes

TIME_FROM_PARTS is typically used to handle values in “normal” ranges
(e.g. hours 0-23, minutes 0-59), but it also handles values from outside
these ranges. This allows, for example, choosing the N-th minute in a day,
which can be used to simplify some computations.

## Examples

```sqlexample
ALTER SESSION SET TIME_OUTPUT_FORMAT='HH24:MI:SS.FF9';
```

Components in normal ranges:

> ```sqlexample
> select time_from_parts(12, 34, 56, 987654321);
>
> ----------------------------------------+
>  TIME_FROM_PARTS(12, 34, 56, 987654321) |
> ----------------------------------------+
>  12:34:56.987654321                     |
> ----------------------------------------+
> ```

Components outside normal ranges:

* 100th minute (from midnight)
* 12345 seconds (from noon)

  > ```sqlexample
  > select time_from_parts(0, 100, 0), time_from_parts(12, 0, 12345);
  >
  > ----------------------------+-------------------------------+
  >  TIME_FROM_PARTS(0, 100, 0) | TIME_FROM_PARTS(12, 0, 12345) |
  > ----------------------------+-------------------------------+
  >  01:40:00.000000000         | 15:25:45.000000000            |
  > ----------------------------+-------------------------------+
  > ```

---
title: TIME_SLICE
source: https://docs.snowflake.com/en/sql-reference/functions/time_slice.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIME_SLICE

Calculates the beginning or end of a “slice” of time, where the length of the slice is a multiple of a standard unit of time
(minute, hour, day, etc.).

This function can be used to calculate the start and end times of fixed-width “buckets” into which data can be categorized.

See also:
:   [DATE_TRUNC](date_trunc.md)

## Syntax

```sqlsyntax
TIME_SLICE( <date_or_time_expr> , <slice_length> , <date_or_time_part> [ , <start_or_end> ] )
```

## Arguments

**Required:**

`date_or_time_expr`
:   The function returns the start or end of the slice that contains this date or time. The expression must
    be of type DATE or TIMESTAMP_NTZ.

`slice_length`
:   This indicates the width of the slice (i.e. how many units of
    time are contained in the slice). For example, if the unit is MONTH and the `slice_length`
    is 2, then each slice is 2 months wide. The `slice_length` must be an integer
    greater than or equal to 1.

`date_or_time_part`
:   Time unit for the slice length. The value must be a string containing one of the values listed
    below:

    * If input expression is a DATE: YEAR, QUARTER, MONTH, WEEK, DAY.
    * If input expression is a TIMESTAMP_NTZ: YEAR, QUARTER, MONTH, WEEK, DAY, HOUR, MINUTE, SECOND.

    The values are case-insensitive.

**Optional:**

`start_or_end`
:   This is an optional constant parameter that determines whether the start or end of the slice should be returned.

    Supported values are ‘START’ or ‘END’. The values are case-insensitive.

    The default value is ‘START’.

## Returns

The data type of the return value is identical to the data type of the input `date_or_time_expr`
(i.e. either TIMESTAMP_NTZ or DATE).

## Usage notes

* All slices are aligned relative to midnight January 1, 1970 (1970-01-01 00:00:00).

  Most slices start on an integer multiple of the slice length relative to January 1, 1970. For example, if you choose
  a slice length of 15 years, then each slice will start on one of the following boundaries:

  + January 1, 1970.
  + January 1, 1985.
  + January 1, 2000.
  + January 1, 2015.
  + Etc.

  Dates prior to January 1, 1970 are also valid; for example, a 15-year slice can start on January 1, 1955.

  The one exception is that, for slices measured in weeks, the starts of the slices are aligned with the beginning of
  the week that contains January 1, 1970. January 1, 1970 was a Thursday. So, for example, if your
  [WEEK_START](../parameters.md) session parameter specifies that your calendar weeks start on Monday, and if your slices
  are 2 weeks, then your slices will start on one of the following boundaries:

  + December 29, 1969 (Monday).
  + January 12, 1970 (Monday).
  + January 25, 1970 (Monday).
  + Etc.

  If your calendar weeks start on Sunday, then your slices will start on:

  + December 28, 1969 (Sunday).
  + January 11, 1970 (Sunday).
  + January 25, 1970 (Sunday).
  + Etc.

  For more details about how calendar weeks are handled, including examples, see [Calendar weeks and weekdays](../functions-date-time.md).
* Although the parameters to TIME_SLICE must be of type DATE or TIMESTAMP_NTZ, you can use casting to process
  TIMESTAMP_LTZ values. For TIMESTAMP_LTZ values, cast the input to TIMESTAMP_NTZ first and then cast back
  to TIMESTAMP_LTZ. However, in this case, slices crossing daylight saving time boundaries can be either one hour
  longer or one hour shorter than slices that do not cross daylight saving time boundaries.
* The end of each slice is the same as the beginning of the following slice. For example, if the slice is
  2 months and the start of the slice is 2019-01-01, then the end of the slice will be 2019-03-01, not
  2019-02-28. In other words, the slice contains dates or timestamps greater than or equal to the start
  and less than (but not equal to) the end.

## Examples

Find the start and end of a 4-month slice containing a date:

> ```sqlexample
> SELECT '2019-02-28'::DATE AS "DATE",
>        TIME_SLICE("DATE", 4, 'MONTH', 'START') AS "START OF SLICE",
>        TIME_SLICE("DATE", 4, 'MONTH', 'END') AS "END OF SLICE";
> +------------+----------------+--------------+
> | DATE       | START OF SLICE | END OF SLICE |
> |------------+----------------+--------------|
> | 2019-02-28 | 2019-01-01     | 2019-05-01   |
> +------------+----------------+--------------+
> ```

Find the start of 8-hour slices corresponding to two timestamps:

> ```sqlexample
> SELECT '2019-02-28T01:23:45.678'::TIMESTAMP_NTZ AS "TIMESTAMP 1",
>        '2019-02-28T12:34:56.789'::TIMESTAMP_NTZ AS "TIMESTAMP 2",
>        TIME_SLICE("TIMESTAMP 1", 8, 'HOUR') AS "SLICE FOR TIMESTAMP 1",
>        TIME_SLICE("TIMESTAMP 2", 8, 'HOUR') AS "SLICE FOR TIMESTAMP 2";
> +-------------------------+-------------------------+-------------------------+-------------------------+
> | TIMESTAMP 1             | TIMESTAMP 2             | SLICE FOR TIMESTAMP 1   | SLICE FOR TIMESTAMP 2   |
> |-------------------------+-------------------------+-------------------------+-------------------------|
> | 2019-02-28 01:23:45.678 | 2019-02-28 12:34:56.789 | 2019-02-28 00:00:00.000 | 2019-02-28 08:00:00.000 |
> +-------------------------+-------------------------+-------------------------+-------------------------+
> ```

Group data into “buckets” based on the date or timestamp (e.g. group data into buckets that are two weeks wide):

> This example uses the table and data created below:
>
> ```sqlexample
> CREATE TABLE accounts (ID INT, billing_date DATE, balance_due NUMBER(11, 2));
>
> INSERT INTO accounts (ID, billing_date, balance_due) VALUES
>     (1, '2018-07-31', 100.00),
>     (2, '2018-08-01', 200.00),
>     (3, '2018-08-25', 400.00);
> ```
>
> This query shows the bucketed data:
>
> ```sqlexample
> SELECT
>        TIME_SLICE(billing_date, 2, 'WEEK', 'START') AS "START OF SLICE",
>        TIME_SLICE(billing_date, 2, 'WEEK', 'END')   AS "END OF SLICE",
>        COUNT(*) AS "NUMBER OF LATE BILLS",
>        SUM(balance_due) AS "SUM OF MONEY OWED"
>     FROM accounts
>     WHERE balance_due > 0    -- bill hasn't yet been paid
>     GROUP BY "START OF SLICE", "END OF SLICE";
> +----------------+--------------+----------------------+-------------------+
> | START OF SLICE | END OF SLICE | NUMBER OF LATE BILLS | SUM OF MONEY OWED |
> |----------------+--------------+----------------------+-------------------|
> | 2018-07-23     | 2018-08-06   |                    2 |            300.00 |
> | 2018-08-20     | 2018-09-03   |                    1 |            400.00 |
> +----------------+--------------+----------------------+-------------------+
> ```
>
> Note that the GROUP BY clause needs both the start of the slice and the end of the slice because the compiler
> expects the GROUP BY clause to contain all non-aggregate expressions of the projection clause.

---
title: TIMEADD
source: https://docs.snowflake.com/en/sql-reference/functions/timeadd.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIMEADD

Adds the specified value for the specified date or time part to a date, time, or timestamp.

Alias for [DATEADD](dateadd.md).

## Syntax

```sqlsyntax
TIMEADD( <date_or_time_part> , <value> , <date_or_time_expr> )
```

## Arguments

`date_or_time_part`
:   This indicates the units of time that you want to add. For example if you
    want to add two days, then specify `day`. This unit of measure must
    be one of the values listed in [Supported date and time parts](../functions-date-time.md).

`value`
:   This is the number of units of time that you want to add. For example,
    if the units of time is `day`, and you want to add two days, specify `2`.
    If you want to subtract two days, specify `-2`.

`date_or_time_expr`
:   `date_or_time_expr` must evaluate to a date, time, or timestamp.
    This is the date, time, or timestamp to which you want to add.
    For example, if you want to add two days to August 1, 2024, then specify
    `'2024-08-01'::DATE`.

    If the data type is TIME, then the `date_or_time_part`
    must be in units of hours or smaller, not days or bigger.

    If the input data type is DATE, and the `date_or_time_part` is hours
    or smaller, the input value will not be rejected, but instead will be
    treated as a TIMESTAMP with hours, minutes, seconds, and fractions of
    a second all initially set to 0 (e.g. midnight on the specified date).

## Returns

If `date_or_time_expr` is a time, then the return data type is a time.

If `date_or_time_expr` is a timestamp, then the return data type is a timestamp.

If `date_or_time_expr` is a date:

> * If `date_or_time_part` is `day` or larger (for example, `month`, `year`),
>   the function returns a DATE value.
> * If `date_or_time_part` is smaller than a day (for example, `hour`, `minute`,
>   `second`), the function returns a TIMESTAMP_NTZ value, with `00:00:00.000` as the starting
>   time for the date.

## Usage notes

When `date_or_time_part` is `year`, `quarter`, or `month` (or any of their variations),
if the result month has fewer days than the original day of the month, the result day of the month might
be different from the original day.

## Examples

The TIMEADD and TIMESTAMPADD functions are aliases for the DATEADD function. You can use any of these three
functions in the examples to return the same results.

Add years to a date:

```sqlexample
SELECT TO_DATE('2022-05-08') AS original_date,
       DATEADD(year, 2, TO_DATE('2022-05-08')) AS date_plus_two_years;
```

```output
+---------------+---------------------+
| ORIGINAL_DATE | DATE_PLUS_TWO_YEARS |
|---------------+---------------------|
| 2022-05-08    | 2024-05-08          |
+---------------+---------------------+
```

Subtract years from a date:

```sqlexample
SELECT TO_DATE('2022-05-08') AS original_date,
       DATEADD(year, -2, TO_DATE('2022-05-08')) AS date_minus_two_years;
```

```output
+---------------+----------------------+
| ORIGINAL_DATE | DATE_MINUS_TWO_YEARS |
|---------------+----------------------|
| 2022-05-08    | 2020-05-08           |
+---------------+----------------------+
```

Add two years and two hours to a date. First, set the timestamp output format, create a table,
and insert data:

```sqlexample
ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF9';
CREATE TABLE datetest (d date);
INSERT INTO datetest VALUES ('2022-04-05');
```

Run a query that adds two years and two hours to a date:

```sqlexample
SELECT d AS original_date,
       DATEADD(year, 2, d) AS date_plus_two_years,
       TO_TIMESTAMP(d) AS original_timestamp,
       DATEADD(hour, 2, d) AS timestamp_plus_two_hours
  FROM datetest;
```

```output
+---------------+---------------------+-------------------------+--------------------------+
| ORIGINAL_DATE | DATE_PLUS_TWO_YEARS | ORIGINAL_TIMESTAMP      | TIMESTAMP_PLUS_TWO_HOURS |
|---------------+---------------------+-------------------------+--------------------------|
| 2022-04-05    | 2024-04-05          | 2022-04-05 00:00:00.000 | 2022-04-05 02:00:00.000  |
+---------------+---------------------+-------------------------+--------------------------+
```

Add a month to a date in a month with the same or more days than the
resulting month. For example, if the date is January 31, adding a month should not
return February 31.

```sqlexample
SELECT DATEADD(month, 1, '2023-01-31'::DATE) AS date_plus_one_month;
```

```output
+---------------------+
| DATE_PLUS_ONE_MONTH |
|---------------------|
| 2023-02-28          |
+---------------------+
```

Add a month to a date in a month with fewer days than the resulting month.
Adding a month to February 28 returns March 28.

```sqlexample
SELECT DATEADD(month, 1, '2023-02-28'::DATE) AS date_plus_one_month;
```

```output
+---------------------+
| DATE_PLUS_ONE_MONTH |
|---------------------|
| 2023-03-28          |
+---------------------+
```

Add hours to a time:

```sqlexample
SELECT TO_TIME('05:00:00') AS original_time,
       DATEADD(hour, 3, TO_TIME('05:00:00')) AS time_plus_three_hours;
```

```output
+---------------+-----------------------+
| ORIGINAL_TIME | TIME_PLUS_THREE_HOURS |
|---------------+-----------------------|
| 05:00:00      | 08:00:00              |
+---------------+-----------------------+
```

---
title: TIMEDIFF
source: https://docs.snowflake.com/en/sql-reference/functions/timediff.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIMEDIFF

Calculates the difference between two date, time, or timestamp expressions based on the specified date or time part.
The function returns the result of subtracting the second argument from the third argument.

Alternative for [DATEDIFF](datediff.md).

## Syntax

```sqlsyntax
TIMEDIFF( <date_or_time_part> , <date_or_time_expr1> , <date_or time_expr2> )
```

## Arguments

`date_or_time_part`
:   The unit of time. Must be one of the values listed in [Supported date and time parts](../functions-date-time.md) (for example, `month`).
    The value can be a string literal or can be unquoted (for example, `'month'` or `month`).

`date_or_time_expr1`, `date_or_time_expr2`
:   The values to compare. Must be a date, a time, a timestamp, or an expression that can be evaluated to
    a date, a time, or a timestamp. The value `date_or_time_expr1` is subtracted from
    `date_or_time_expr2`.

## Returns

Returns an integer representing the number of units (seconds, days, etc.) difference between `date_or_time_expr2` and
`date_or_time_expr1`.

## Usage notes

* Output values can be negative, for example, -12 days.

* The function supports units of years, quarters, months, weeks, days, hours, minutes, seconds, milliseconds, microseconds, and nanoseconds.
* If `date_or_time_part` is `week` (or any of its variations), the output is controlled by the [WEEK_START](../parameters.md) session parameter. For more details, including examples, see
  [Calendar weeks and weekdays](../functions-date-time.md).
* The unit (for example, `month`) used to calculate the difference determines which parts of the DATE, TIME, or TIMESTAMP field are
  evaluated. So, the unit determines the precision of the result.

  Smaller units are not used, so values are not rounded. For example, even though the difference between January 1, 2021 and
  February 28, 2021 is closer to two months than to one month, the following returns one month:

  ```sqlexample
  DATEDIFF(month, '2021-01-01'::DATE, '2021-02-28'::DATE)
  ```

  For a DATE value:

  > + `year` uses only the year and disregards all the other parts.
  > + `month` uses the month and year.
  > + `day` uses the entire date.

  For a TIME value:

  > + `hour` uses only the hour and disregards all the other parts.
  > + `minute` uses the hour and minute.
  > + `second` uses the hour, minute, and second, but not the fractional seconds.
  > + `millisecond` uses the hour, minute, second, and first three digits of the fractional seconds. Fractional
  >   seconds are not rounded. For example, `DATEDIFF(milliseconds, '2024-02-20 21:18:41.0000', '2024-02-20 21:18:42.1239')` returns 1.123 seconds,
  >   not 1.124 seconds.
  > + `microsecond` uses the hour, minute, second, and first six digits of the fractional seconds. Fractional
  >   seconds are not rounded.
  > + `nanosecond` uses the hour, minute, second, and all nine digits of the fractional seconds.

  For a TIMESTAMP value:

  > The rules match the rules for DATE and TIME data types above. Only the specified unit and larger units are used.

## Examples

This shows the result of subtracting two dates, in which the second is
two years later than the first:

> ```sqlexample
> SELECT TIMEDIFF(YEAR, '2017-01-01', '2019-01-01') AS Years;
> +-------+
> | YEARS |
> |-------|
> |     2 |
> +-------+
> ```

This shows that the value is truncated rather than rounded. The difference is closer
to 12 months than to 11, but Snowflake calculates the difference as 11 months:

> ```sqlexample
> SELECT TIMEDIFF(MONTH, '2017-01-1', '2017-12-31') AS Months;
> +--------+
> | MONTHS |
> |--------|
> |     11 |
> +--------+
> ```

There are additional examples in [DATEDIFF](datediff.md).

---
title: TIMESTAMP_FROM_PARTS
source: https://docs.snowflake.com/en/sql-reference/functions/timestamp_from_parts.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIMESTAMP_FROM_PARTS

Creates a timestamp from individual numeric components. If no time zone is in effect, the function can be used to create a timestamp from a date expression and a time expression.

Aliases:
:   TIMESTAMPFROMPARTS

Variations (and Aliases):
:   TIMESTAMP_LTZ_FROM_PARTS , TIMESTAMPLTZFROMPARTS

    TIMESTAMP_NTZ_FROM_PARTS , TIMESTAMPNTZFROMPARTS

    TIMESTAMP_TZ_FROM_PARTS , TIMESTAMPTZFROMPARTS

## Syntax

```sqlsyntax
TIMESTAMP_FROM_PARTS( <year>, <month>, <day>, <hour>, <minute>, <second> [, <nanosecond> ] [, <time_zone> ] )

TIMESTAMP_FROM_PARTS( <date_expr>, <time_expr> )
```

```sqlsyntax
TIMESTAMP_LTZ_FROM_PARTS( <year>, <month>, <day>, <hour>, <minute>, <second> [, <nanosecond>] )
```

```sqlsyntax
TIMESTAMP_NTZ_FROM_PARTS( <year>, <month>, <day>, <hour>, <minute>, <second> [, <nanosecond>] )

TIMESTAMP_NTZ_FROM_PARTS( <date_expr>, <time_expr> )
```

```sqlsyntax
TIMESTAMP_TZ_FROM_PARTS( <year>, <month>, <day>, <hour>, <minute>, <second> [, <nanosecond>] [, <time_zone>] )
```

> **Note:**
>
> The date and time expression version of TIMESTAMP_FROM_PARTS is only valid when the [TIMESTAMP_TYPE_MAPPING](../parameters.md) session parameter is set to TIMESTAMP_NTZ.

## Arguments

**Required:**

`year`
:   An integer expression to use as a year for building a timestamp.

`month`
:   An integer expression to use as a month for building a timestamp, with January represented as `1`, and December as `12`.

`day`
:   An integer expression to use as a day for building a timestamp, usually in the `1`-`31` range.

`hour`
:   An integer expression to use as an hour for building a timestamp, usually in the `0`-`23` range.

`minute`
:   An integer expression to use as a minute for building a timestamp, usually in the `0`-`59` range.

`second`
:   An integer expression to use as a second for building a timestamp, usually in the `0`-`59` range.

`date_expr` , `time_expr`
:   Specifies the date and time expressions to use for building a timestamp where `date_expr` provides the year, month, and day for the timestamp and `time_expr` provides the hour,
    minute, second, and nanoseconds within the day. Only valid for:

    * TIMESTAMP_FROM_PARTS (when the [TIMESTAMP_TYPE_MAPPING](../parameters.md) session parameter is set to TIMESTAMP_NTZ)
    * TIMESTAMP_NTZ_FROM_PARTS

**Optional:**

`nanoseconds`
:   An integer expression to use as a nanosecond for building a timestamp, usually in the `0`-`999999999` range.

`time_zone`
:   A string expression to use as a time zone for building a timestamp (e.g. `America/Los_Angeles`). Only valid for:

    * TIMESTAMP_FROM_PARTS (when the [TIMESTAMP_TYPE_MAPPING](../parameters.md) session parameter is set to TIMESTAMP_TZ)
    * TIMESTAMP_TZ_FROM_PARTS

## Usage notes

* TIMESTAMP_FROM_PARTS variations are typically used to handle values in the “normal” value ranges (e.g. months `1`-`12`, days `1`-`31`, hours `0`-`23`, etc.); however, they can also
  handle values from outside these ranges. This allows choosing the Nth day in a year or Nth second in a day, which can be useful for simplifying some computations.
* TIMESTAMP_FROM_PARTS is equivalent to the variation specified by the [TIMESTAMP_TYPE_MAPPING](../parameters.md) session parameter (default is TIMESTAMP_NTZ).

## Examples

Set the session variables that control output format and time zone:

> ```sqlexample
> ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT='YYYY-MM-DD HH24:MI:SS.FF9 TZH:TZM';
> ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT='YYYY-MM-DD HH24:MI:SS.FF9 TZH:TZM';
> ALTER SESSION SET TIMEZONE='America/New_York';
> ```

Using `TIMESTAMP_LTZ_FROM_PARTS`:

> ```sqlexample
> SELECT TIMESTAMP_LTZ_FROM_PARTS(2013, 4, 5, 12, 00, 00);
> +--------------------------------------------------+
> | TIMESTAMP_LTZ_FROM_PARTS(2013, 4, 5, 12, 00, 00) |
> |--------------------------------------------------|
> | 2013-04-05 12:00:00.000000000 -0400              |
> +--------------------------------------------------+
> ```

Using `TIMESTAMP_NTZ_FROM_PARTS`:

> ```sqlexample
> select timestamp_ntz_from_parts(2013, 4, 5, 12, 00, 00, 987654321);
> +-------------------------------------------------------------+
> | TIMESTAMP_NTZ_FROM_PARTS(2013, 4, 5, 12, 00, 00, 987654321) |
> |-------------------------------------------------------------|
> | 2013-04-05 12:00:00.987654321                               |
> +-------------------------------------------------------------+
> ```

Using `TIMESTAMP_NTZ_FROM_PARTS` with a date and time rather than with
year, month, day, hour, etc.:

> ```sqlexample
> select timestamp_ntz_from_parts(to_date('2013-04-05'), to_time('12:00:00'));
> +----------------------------------------------------------------------+
> | TIMESTAMP_NTZ_FROM_PARTS(TO_DATE('2013-04-05'), TO_TIME('12:00:00')) |
> |----------------------------------------------------------------------|
> | 2013-04-05 12:00:00.000000000                                        |
> +----------------------------------------------------------------------+
> ```

Using `TIMESTAMP_TZ_FROM_PARTS` with a session-default time zone (‘America/New_York’/-0400):

> ```sqlexample
> select timestamp_tz_from_parts(2013, 4, 5, 12, 00, 00);
> +-------------------------------------------------+
> | TIMESTAMP_TZ_FROM_PARTS(2013, 4, 5, 12, 00, 00) |
> |-------------------------------------------------|
> | 2013-04-05 12:00:00.000000000 -0400             |
> +-------------------------------------------------+
> ```

Using `TIMESTAMP_TZ_FROM_PARTS` with a specified time zone (‘America/Los_Angeles’/-0700); note also the use of 0 as the nanoseconds argument:

> ```sqlexample
> select timestamp_tz_from_parts(2013, 4, 5, 12, 00, 00, 0, 'America/Los_Angeles');
> +---------------------------------------------------------------------------+
> | TIMESTAMP_TZ_FROM_PARTS(2013, 4, 5, 12, 00, 00, 0, 'AMERICA/LOS_ANGELES') |
> |---------------------------------------------------------------------------|
> | 2013-04-05 12:00:00.000000000 -0700                                       |
> +---------------------------------------------------------------------------+
> ```

Handling values outside normal ranges (subtracting 1 hour by specifying -3600 seconds):

> ```sqlexample
> select timestamp_from_parts(2013, 4, 5, 12, 0, -3600);
> +------------------------------------------------+
> | TIMESTAMP_FROM_PARTS(2013, 4, 5, 12, 0, -3600) |
> |------------------------------------------------|
> | 2013-04-05 11:00:00.000000000                  |
> +------------------------------------------------+
> ```

---
title: TIMESTAMPADD
source: https://docs.snowflake.com/en/sql-reference/functions/timestampadd.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIMESTAMPADD

Adds the specified value for the specified date or time part to a date, time, or timestamp.

Alias for [DATEADD](dateadd.md).

## Syntax

```sqlsyntax
TIMESTAMPADD( <date_or_time_part> , <time_value> , <date_or_time_expr> )
```

## Arguments

`date_or_time_part`
:   This indicates the units of time that you want to add. For example if you
    want to add two days, then specify `day`. This unit of measure must
    be one of the values listed in [Supported date and time parts](../functions-date-time.md).

`value`
:   This is the number of units of time that you want to add. For example,
    if the units of time is `day`, and you want to add two days, specify `2`.
    If you want to subtract two days, specify `-2`.

`date_or_time_expr`
:   `date_or_time_expr` must evaluate to a date, time, or timestamp.
    This is the date, time, or timestamp to which you want to add.
    For example, if you want to add two days to August 1, 2024, then specify
    `'2024-08-01'::DATE`.

    If the data type is TIME, then the `date_or_time_part`
    must be in units of hours or smaller, not days or bigger.

    If the input data type is DATE, and the `date_or_time_part` is hours
    or smaller, the input value will not be rejected, but instead will be
    treated as a TIMESTAMP with hours, minutes, seconds, and fractions of
    a second all initially set to 0 (e.g. midnight on the specified date).

## Returns

If `date_or_time_expr` is a time, then the return data type is a time.

If `date_or_time_expr` is a timestamp, then the return data type is a timestamp.

If `date_or_time_expr` is a date:

> * If `date_or_time_part` is `day` or larger (for example, `month`, `year`),
>   the function returns a DATE value.
> * If `date_or_time_part` is smaller than a day (for example, `hour`, `minute`,
>   `second`), the function returns a TIMESTAMP_NTZ value, with `00:00:00.000` as the starting
>   time for the date.

## Usage notes

When `date_or_time_part` is `year`, `quarter`, or `month` (or any of their variations),
if the result month has fewer days than the original day of the month, the result day of the month might
be different from the original day.

## Examples

The TIMEADD and TIMESTAMPADD functions are aliases for the DATEADD function. You can use any of these three
functions in the examples to return the same results.

Add years to a date:

```sqlexample
SELECT TO_DATE('2022-05-08') AS original_date,
       DATEADD(year, 2, TO_DATE('2022-05-08')) AS date_plus_two_years;
```

```output
+---------------+---------------------+
| ORIGINAL_DATE | DATE_PLUS_TWO_YEARS |
|---------------+---------------------|
| 2022-05-08    | 2024-05-08          |
+---------------+---------------------+
```

Subtract years from a date:

```sqlexample
SELECT TO_DATE('2022-05-08') AS original_date,
       DATEADD(year, -2, TO_DATE('2022-05-08')) AS date_minus_two_years;
```

```output
+---------------+----------------------+
| ORIGINAL_DATE | DATE_MINUS_TWO_YEARS |
|---------------+----------------------|
| 2022-05-08    | 2020-05-08           |
+---------------+----------------------+
```

Add two years and two hours to a date. First, set the timestamp output format, create a table,
and insert data:

```sqlexample
ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF9';
CREATE TABLE datetest (d date);
INSERT INTO datetest VALUES ('2022-04-05');
```

Run a query that adds two years and two hours to a date:

```sqlexample
SELECT d AS original_date,
       DATEADD(year, 2, d) AS date_plus_two_years,
       TO_TIMESTAMP(d) AS original_timestamp,
       DATEADD(hour, 2, d) AS timestamp_plus_two_hours
  FROM datetest;
```

```output
+---------------+---------------------+-------------------------+--------------------------+
| ORIGINAL_DATE | DATE_PLUS_TWO_YEARS | ORIGINAL_TIMESTAMP      | TIMESTAMP_PLUS_TWO_HOURS |
|---------------+---------------------+-------------------------+--------------------------|
| 2022-04-05    | 2024-04-05          | 2022-04-05 00:00:00.000 | 2022-04-05 02:00:00.000  |
+---------------+---------------------+-------------------------+--------------------------+
```

Add a month to a date in a month with the same or more days than the
resulting month. For example, if the date is January 31, adding a month should not
return February 31.

```sqlexample
SELECT DATEADD(month, 1, '2023-01-31'::DATE) AS date_plus_one_month;
```

```output
+---------------------+
| DATE_PLUS_ONE_MONTH |
|---------------------|
| 2023-02-28          |
+---------------------+
```

Add a month to a date in a month with fewer days than the resulting month.
Adding a month to February 28 returns March 28.

```sqlexample
SELECT DATEADD(month, 1, '2023-02-28'::DATE) AS date_plus_one_month;
```

```output
+---------------------+
| DATE_PLUS_ONE_MONTH |
|---------------------|
| 2023-03-28          |
+---------------------+
```

Add hours to a time:

```sqlexample
SELECT TO_TIME('05:00:00') AS original_time,
       DATEADD(hour, 3, TO_TIME('05:00:00')) AS time_plus_three_hours;
```

```output
+---------------+-----------------------+
| ORIGINAL_TIME | TIME_PLUS_THREE_HOURS |
|---------------+-----------------------|
| 05:00:00      | 08:00:00              |
+---------------+-----------------------+
```

---
title: TIMESTAMPDIFF
source: https://docs.snowflake.com/en/sql-reference/functions/timestampdiff.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TIMESTAMPDIFF

Calculates the difference between two date, time, or timestamp expressions based on the specified date or time part.
The function returns the result of subtracting the second argument from the third argument.

Alternative for [DATEDIFF](datediff.md).

## Syntax

```sqlsyntax
TIMESTAMPDIFF( <date_or_time_part> , <date_or_time_expr1> , <date_or_time_expr2> )
```

## Arguments

`date_or_time_part`
:   The unit of time. Must be one of the values listed in [Supported date and time parts](../functions-date-time.md) (for example, `month`).
    The value can be a string literal or can be unquoted (for example, `'month'` or `month`).

`date_or_time_expr1`, `date_or_time_expr2`
:   The values to compare. Must be a date, a time, a timestamp, or an expression that can be evaluated to
    a date, a time, or a timestamp. The value `date_or_time_expr1` is subtracted from
    `date_or_time_expr2`.

## Returns

Returns an integer representing the number of units (seconds, days, etc.) difference between `date_or_time_expr2` and
`date_or_time_expr1`.

## Usage notes

* Output values can be negative, for example, -12 days.

* The function supports units of years, quarters, months, weeks, days, hours, minutes, seconds, milliseconds, microseconds, and nanoseconds.
* If `date_or_time_part` is `week` (or any of its variations), the output is controlled by the [WEEK_START](../parameters.md) session parameter. For more details, including examples, see
  [Calendar weeks and weekdays](../functions-date-time.md).
* The unit (for example, `month`) used to calculate the difference determines which parts of the DATE, TIME, or TIMESTAMP field are
  evaluated. So, the unit determines the precision of the result.

  Smaller units are not used, so values are not rounded. For example, even though the difference between January 1, 2021 and
  February 28, 2021 is closer to two months than to one month, the following returns one month:

  ```sqlexample
  DATEDIFF(month, '2021-01-01'::DATE, '2021-02-28'::DATE)
  ```

  For a DATE value:

  > + `year` uses only the year and disregards all the other parts.
  > + `month` uses the month and year.
  > + `day` uses the entire date.

  For a TIME value:

  > + `hour` uses only the hour and disregards all the other parts.
  > + `minute` uses the hour and minute.
  > + `second` uses the hour, minute, and second, but not the fractional seconds.
  > + `millisecond` uses the hour, minute, second, and first three digits of the fractional seconds. Fractional
  >   seconds are not rounded. For example, `DATEDIFF(milliseconds, '2024-02-20 21:18:41.0000', '2024-02-20 21:18:42.1239')` returns 1.123 seconds,
  >   not 1.124 seconds.
  > + `microsecond` uses the hour, minute, second, and first six digits of the fractional seconds. Fractional
  >   seconds are not rounded.
  > + `nanosecond` uses the hour, minute, second, and all nine digits of the fractional seconds.

  For a TIMESTAMP value:

  > The rules match the rules for DATE and TIME data types above. Only the specified unit and larger units are used.

## Examples

For example(s), see [DATEDIFF](datediff.md). (DATEDIFF, TIMEDIFF, and TIMESTAMPDIFF all use the same basic format.)

---
title: TO_ARRAY
source: https://docs.snowflake.com/en/sql-reference/functions/to_array.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# TO_ARRAY

Converts the input expression to an [ARRAY](../data-types-semistructured.md) value.

## Syntax

```sqlsyntax
TO_ARRAY( <expr> )
```

## Arguments

`expr`
:   An expression of any data type.

## Returns

This function returns a value of type ARRAY or NULL:

> * If the input is an ARRAY, or a VARIANT containing an ARRAY value, the value is returned unchanged.
> * If `expr` is a NULL or [JSON null](../../user-guide/semistructured-considerations.md) value, the function returns NULL.
> * For any other value, the value returned is a single-element array that contains this value.

## Usage notes

To create an array that contains more than one element, you can use [ARRAY_CONSTRUCT](array_construct.md)
or [STRTOK_TO_ARRAY](strtok_to_array.md).

## Examples

Create a table, and insert data by calling the TO_ARRAY function:

```sqlexample
CREATE OR REPLACE TABLE array_demo_2 (
  ID INTEGER,
  array1 ARRAY,
  array2 ARRAY);

INSERT INTO array_demo_2 (ID, array1, array2)
  SELECT 1, TO_ARRAY(1), TO_ARRAY(3);

SELECT * FROM array_demo_2;
```

```output
+----+--------+--------+
| ID | ARRAY1 | ARRAY2 |
|----+--------+--------|
|  1 | [      | [      |
|    |   1    |   3    |
|    | ]      | ]      |
+----+--------+--------+
```

Execute a query that shows the single-element arrays created during the insert and
the result of calling ARRAY_CAT to concatenate the two arrays:

```sqlexample
SELECT array1, array2, ARRAY_CAT(array1, array2)
  FROM array_demo_2;
```

```output
+--------+--------+---------------------------+
| ARRAY1 | ARRAY2 | ARRAY_CAT(ARRAY1, ARRAY2) |
|--------+--------+---------------------------|
| [      | [      | [                         |
|   1    |   3    |   1,                      |
| ]      | ]      |   3                       |
|        |        | ]                         |
+--------+--------+---------------------------+
```

This example demonstrates that TO_ARRAY converts a string input expression to an array with a
single element, even when the input expression includes delimiters (such as commas):

```sqlexample
SELECT TO_ARRAY('snowman,snowball,snowcone') AS to_array_result;
```

```output
+-------------------------------+
| TO_ARRAY_RESULT               |
|-------------------------------|
| [                             |
|   "snowman,snowball,snowcone" |
| ]                             |
+-------------------------------+
```

To convert the same string input expression into an array with multiple elements, you can use the
[STRTOK_TO_ARRAY](strtok_to_array.md) function:

```sqlexample
SELECT STRTOK_TO_ARRAY('snowman,snowball,snowcone', ',') AS strtok_to_array_result;
```

```output
+------------------------+
| STRTOK_TO_ARRAY_RESULT |
|------------------------|
| [                      |
|   "snowman",           |
|   "snowball",          |
|   "snowcone"           |
| ]                      |
+------------------------+
```

---
title: TO_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/to_binary.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_BINARY

Converts the input expression to a binary value. For NULL input, the output is NULL.

See also:

* [TRY_TO_BINARY](try_to_binary.md).
* [Binary input and output](../binary-input-output.md).

## Syntax

```sqlsyntax
TO_BINARY( <string_expr> [, '<format>'] )
TO_BINARY( <variant_expr> )
```

## Returns

The return type is BINARY.

## Arguments

**Required:**

`string_expr`
:   A string expression.

**Optional:**

`format`
:   The binary format for conversion: HEX, BASE64, or UTF-8 (see [Binary input and output](../binary-input-output.md)). The default is the value of the
    BINARY_INPUT_FORMAT session parameter. If this parameter is not set, the
    default is HEX.

## Returns

Returns a value of type BINARY.

## Examples

These examples show the output when `TO_BINARY` is called.

This example shows how to convert a `VARCHAR` to `BINARY` and then get it
back in its original form (`VARCHAR`).

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE binary_test (v VARCHAR, b BINARY);
> > INSERT INTO binary_test(v) VALUES ('SNOW');
> > ```
>
> Convert the `VARCHAR` to `BINARY`:
>
> > ```sqlexample
> > UPDATE binary_test SET b = TO_BINARY(HEX_ENCODE(v), 'HEX');
> > ```
>
> Run a query and show the output:
>
> > ```sqlexample
> > SELECT v, HEX_DECODE_STRING(TO_VARCHAR(b, 'HEX')) FROM binary_test;
> > +------+-----------------------------------------+
> > | V    | HEX_DECODE_STRING(TO_VARCHAR(B, 'HEX')) |
> > |------+-----------------------------------------|
> > | SNOW | SNOW                                    |
> > +------+-----------------------------------------+
> > ```

This example shows how to convert a string of UTF-8 characters into
`BINARY`. Note that by default SNOWSQL shows `BINARY` values as a string
of hexadecimal digits, not in UTF-8 and not in the internal `BINARY` format.

> ```sqlexample
> SELECT TO_BINARY('SNOW', 'utf-8');
> +----------------------------+
> | TO_BINARY('SNOW', 'UTF-8') |
> |----------------------------|
> | 534E4F57                   |
> +----------------------------+
> ```

This example is the same as the preceding example, except that this example explicitly
converts the output to hexadecimal digits so that it is more obvious that the
output is a string containing hexadecimal digits:

> ```sqlexample
> SELECT TO_VARCHAR(TO_BINARY('SNOW', 'utf-8'), 'HEX');
> +-----------------------------------------------+
> | TO_VARCHAR(TO_BINARY('SNOW', 'UTF-8'), 'HEX') |
> |-----------------------------------------------|
> | 534E4F57                                      |
> +-----------------------------------------------+
> ```

---
title: TO_BOOLEAN
source: https://docs.snowflake.com/en/sql-reference/functions/to_boolean.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_BOOLEAN

Converts the input text or numeric expression to a [BOOLEAN](../data-types-logical.md) value.

See also:
:   [TRY_TO_BOOLEAN](try_to_boolean.md)

## Syntax

```sqlsyntax
TO_BOOLEAN( <string_or_numeric_expr> )
```

## Arguments

`string_or_numeric_expr`
:   A string expression or numeric expression that can be evaluated to a BOOLEAN value.

## Returns

Returns a BOOLEAN value or NULL.

* Returns TRUE if `string_or_numeric_expr` evaluates to TRUE.
* Returns FALSE if `string_or_numeric_expr` evaluates to FALSE.
* If the input is NULL, returns NULL without reporting an error.

## Usage notes

* For a string expression:

  + `'true'`, `'t'`, `'yes'`, `'y'`, `'on'`, `'1'` return TRUE.
  + `'false'`, `'f'`, `'no'`, `'n'`, `'off'`, `'0'` return FALSE.
  + All other strings return an error.

  The evaluations of the strings are case-insensitive.
* For a numeric expression:

  + `0` returns FALSE.
  + All non-zero numeric values return TRUE.
  + When converting from the [FLOAT](../data-types-numeric.md) data type, non-numeric values, such as
    `NaN` (not a number) and `INF` (infinity), return an error.

## Examples

The following examples use the TO_BOOLEAN function.

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE test_boolean(
  b BOOLEAN,
  n NUMBER,
  s STRING);

INSERT INTO test_boolean VALUES
  (true, 1, 'yes'),
  (false, 0, 'no'),
  (null, null, null);

SELECT * FROM test_boolean;
```

```output
+-------+------+------+
| B     |    N | S    |
|-------+------+------|
| True  |    1 | yes  |
| False |    0 | no   |
| NULL  | NULL | NULL |
+-------+------+------+
```

Convert a text string to a BOOLEAN value:

```sqlexample
SELECT s, TO_BOOLEAN(s) FROM test_boolean;
```

```output
+------+---------------+
| S    | TO_BOOLEAN(S) |
|------+---------------|
| yes  | True          |
| no   | False         |
| NULL | NULL          |
+------+---------------+
```

Convert a number to a BOOLEAN value:

```sqlexample
SELECT n, TO_BOOLEAN(n) FROM test_boolean;
```

```output
+------+---------------+
|    N | TO_BOOLEAN(N) |
|------+---------------|
|    1 | True          |
|    0 | False         |
| NULL | NULL          |
+------+---------------+
```

---
title: TO_CHAR , TO_VARCHAR
source: https://docs.snowflake.com/en/sql-reference/functions/to_char.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_CHAR , TO_VARCHAR

Converts the input expression to a string. For NULL input, the output is NULL.

These functions are synonymous.

## Syntax

```sqlsyntax
TO_CHAR( <expr> )
TO_CHAR( <numeric_expr> [, '<format>' ] )
TO_CHAR( <date_or_time_expr> [, '<format>' ] )
TO_CHAR( <binary_expr> [, '<format>' ] )

TO_VARCHAR( <expr> )
TO_VARCHAR( <numeric_expr> [, '<format>' ] )
TO_VARCHAR( <date_or_time_expr> [, '<format>' ] )
TO_VARCHAR( <binary_expr> [, '<format>' ] )
```

## Arguments

**Required:**

`expr`
:   An expression of any data type.

`numeric_expr`
:   A numeric expression.

`date_or_time_expr`
:   An expression of type DATE, TIME, or TIMESTAMP.

`binary_expr`
:   An expression of type BINARY or VARBINARY.

**Optional:**

`format`
:   The format of the output string:

    * For `numeric_expr`, specifies the SQL format model used to
      interpret the numeric expression. For more information, see
      [SQL format models](../sql-format-models.md).
    * For `date_or_time_expr`, specifies the expected format to parse
      or produce a string. For more information, see [Date and time formats in conversion functions](../functions-conversion.md).

      The default is the current value of the following session
      parameters:

      > + [DATE_OUTPUT_FORMAT](../parameters.md) (for DATE inputs)
      > + [TIME_OUTPUT_FORMAT](../parameters.md) (for TIME inputs)
      > + [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) (for TIMESTAMP inputs)
    * For `binary_expr`, specifies the format in which to produce
      the string (e.g. ‘HEX’, ‘BASE64’ or ‘UTF-8’).

      For more information, see
      [Overview of supported binary formats](../binary-input-output.md).

## Returns

This function returns a value of VARCHAR data type or NULL.

## Usage notes

* For VARIANT, ARRAY, or OBJECT inputs, the output is the string containing
  a JSON document or JSON elementary value (unless VARIANT or OBJECT
  contains an XML tag, in which case the output is a string containing
  an XML document):

  + A string stored in VARIANT is preserved as is (i.e. it is not converted to
    a JSON string).
  + A JSON **null** value is converted to a string containing the word “null”.

## Examples

The following examples convert numbers, timestamps, and dates to strings.

### Examples that convert numbers

Convert numeric values to strings in the specified [formats](../sql-format-models.md):

```sqlexample
CREATE OR REPLACE TABLE convert_numbers_to_strings(column1 NUMBER);

INSERT INTO convert_numbers_to_strings VALUES
  (-12.391),
  (0),
  (-1),
  (0.10),
  (0.01),
  (3987),
  (1.111);

SELECT column1 AS orig_value,
       TO_CHAR(column1, '">"$99.0"<"') AS D2_1,
       TO_CHAR(column1, '">"B9,999.0"<"') AS D4_1,
       TO_CHAR(column1, '">"TME"<"') AS TME,
       TO_CHAR(column1, '">"TM9"<"') AS TM9,
       TO_CHAR(column1, '">"0XXX"<"') AS X4,
       TO_CHAR(column1, '">"S0XXX"<"') AS SX4
  FROM convert_numbers_to_strings;
```

```output
+------------+----------+------------+-------------+------------+--------+---------+
| ORIG_VALUE | D2_1     | D4_1       | TME         | TM9        | X4     | SX4     |
|------------+----------+------------+-------------+------------+--------+---------|
|    -12.391 | >-$12.4< | >   -12.4< | >-1.2391E1< | >-12.391<  | >FFF4< | >-000C< |
|      0.000 | >  $0.0< | >      .0< | >0E0<       | >0.000<    | >0000< | >+0000< |
|     -1.000 | > -$1.0< | >    -1.0< | >-1E0<      | >-1.000<   | >FFFF< | >-0001< |
|      0.100 | >  $0.1< | >      .1< | >1E-1<      | >0.100<    | >0000< | >+0000< |
|      0.010 | >  $0.0< | >      .0< | >1E-2<      | >0.010<    | >0000< | >+0000< |
|   3987.000 | > $##.#< | > 3,987.0< | >3.987E3<   | >3987.000< | >0F93< | >+0F93< |
|      1.111 | >  $1.1< | >     1.1< | >1.111E0<   | >1.111<    | >0001< | >+0001< |
+------------+----------+------------+-------------+------------+--------+---------+
```

The output illustrates how the values are converted to strings based on the specified formats:

* The `>` and `<` symbols are string literals that are included in the output. They make it easier
  to see where spaces are inserted.
* The `D2_1` column shows the values with a `$` printed before the digits.

  + For the `3987` value, there are more digits in the integer part of the number than there are digit positions
    in the format, so all digits are printed as `#` to indicate overflow.
  + For the `0.10`, `0.01`, and `1.111` values, there are more digits in the fractional part of the number
    than there are digit positions in the format, so the fractional values are truncated.
* The `D4_1` column shows that zero values are represented as spaces in the integer parts of the
  numbers.

  + For the `0`, `0.10`, and `0.01` values, a space replaces the zero before the separator.
  + For the `0.10`, `0.01`, and `1.111` values, there are more digits in the fractional part of
    the number than there are digit positions in the format, so the fractional values are truncated.
* The `TME` column shows the values in scientific notation.
* The `TM9` column shows the values as integers or decimal fractions, based on the value of the number.
* The `X4` column shows the values as hexadecimal digits without the fractional parts.
* The `SX4` column shows the values as hexadecimal digits of the absolute value of the numbers and
  includes the numeric sign (`+` or `-`).

This example converts a logarithmic value to a string:

```sqlexample
SELECT TO_VARCHAR(LOG(3,4));
```

```output
+----------------------+
| TO_VARCHAR(LOG(3,4)) |
|----------------------|
| 1.261859507          |
+----------------------+
```

### Examples that convert timestamps and dates

Convert a TIMESTAMP value to a string in the specified format:

```sqlexample
SELECT TO_VARCHAR('2024-04-05 01:02:03'::TIMESTAMP, 'mm/dd/yyyy, hh24:mi hours');
```

```output
+---------------------------------------------------------------------------+
| TO_VARCHAR('2024-04-05 01:02:03'::TIMESTAMP, 'MM/DD/YYYY, HH24:MI HOURS') |
|---------------------------------------------------------------------------|
| 04/05/2024, 01:02 hours                                                   |
+---------------------------------------------------------------------------+
```

Convert a DATE value to a string in the default format:

```sqlexample
SELECT TO_VARCHAR('03-April-2024'::DATE);
```

```output
+-----------------------------------+
| TO_VARCHAR('03-APRIL-2024'::DATE) |
|-----------------------------------|
| 2024-04-03                        |
+-----------------------------------+
```

Convert a DATE value to a string in the specified format:

```sqlexample
SELECT TO_VARCHAR('03-April-2024'::DATE, 'yyyy.mm.dd');
```

```output
+-------------------------------------------------+
| TO_VARCHAR('03-APRIL-2024'::DATE, 'YYYY.MM.DD') |
|-------------------------------------------------|
| 2024.04.03                                      |
+-------------------------------------------------+
```

---
title: TO_DATE , DATE
source: https://docs.snowflake.com/en/sql-reference/functions/to_date.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Date & time functions](../functions-date-time.md)

# TO_DATE , DATE

Converts an input expression to a date:

* For a VARCHAR expression, the result of converting the string to a date.
* For a TIMESTAMP expression, the date from the timestamp.
* For a VARIANT expression:

  > + If the VARIANT contains a string, a string conversion is performed.
  > + If the VARIANT contains a date, the date value is preserved as is.
  > + If the VARIANT contains a JSON null value, the output is NULL.
* For NULL input, the output is NULL.

For all other values, a conversion error is generated.

See also:
:   [TRY_TO_DATE](try_to_date.md)

## Syntax

```sqlsyntax
TO_DATE( <string_expr> [, <format> ] )
TO_DATE( <timestamp_expr> )
TO_DATE( '<integer>' )
TO_DATE( <variant_expr> )

DATE( <string_expr> [, <format> ] )
DATE( <timestamp_expr> )
DATE( '<integer>' )
DATE( <variant_expr> )
```

## Arguments

**Required:**

One of:

> `string_expr`
> :   String from which to extract a date. For example: `'2024-01-31'`.
>
> `timestamp_expr`
> :   A TIMESTAMP expression. The DATE portion of the TIMESTAMP value is extracted.
>
> `'integer'`
> :   An expression that evaluates to a string containing an integer. For example: `'15000000'`. Depending
>     on the magnitude of the string, it can be interpreted as seconds, milliseconds, microseconds, or
>     nanoseconds. For details, see the Usage notes for this function.
>
> `variant_expr`
> :   An expression of type VARIANT.
>
>     The VARIANT must contain one of the following:
>
>     * A string from which to extract a date.
>     * A date.
>     * A string containing an integer that represents the number of seconds or milliseconds.
>
>     Although TO_DATE accepts a TIMESTAMP value, it does not accept a TIMESTAMP value inside a VARIANT.

**Optional:**

`format`
:   Date format specifier for `string_expr` or
    [AUTO](../date-time-input-output.md),
    which specifies that Snowflake automatically detects the format to use. For more information,
    see [Date and time formats in conversion functions](../functions-conversion.md).

    The default is the current value of the [DATE_INPUT_FORMAT](../parameters.md)
    session parameter (default `AUTO`).

## Returns

The data type of the returned value is DATE. If the input is NULL, returns NULL.

## Usage notes

* The display format for dates in the output is determined by the [DATE_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `YYYY-MM-DD`).
* If the format of the input parameter is a string that contains an integer:

  + After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
    microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).

    - If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
      a number of seconds.
    - If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
      as milliseconds.
    - If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
      treated as microseconds.
    - If the value is greater than or equal to 31536000000000000, then the value is
      treated as nanoseconds.
  + If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
    one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
    nanoseconds.

## Examples

The following examples use the TO_DATE and DATE functions.

### Basic example

```sqlexample
SELECT TO_DATE('2024-05-10'), DATE('2024-05-10');
```

```output
+-----------------------+--------------------+
| TO_DATE('2024-05-10') | DATE('2024-05-10') |
|-----------------------+--------------------|
| 2024-05-10            | 2024-05-10         |
+-----------------------+--------------------+
```

### Example that extracts the date from a timestamp

The TO_DATE function accepts TIMESTAMP values and strings in TIMESTAMP format, but discards the time
information (hours, minutes, and so on).

Create and load the table:

```sqlexample
CREATE OR REPLACE TABLE date_from_timestamp(ts TIMESTAMP);

INSERT INTO date_from_timestamp(ts)
  VALUES (TO_TIMESTAMP('2024.10.02 04:00:00', 'YYYY.MM.DD HH:MI:SS'));
```

Query the TIMESTAMP value in the table:

```sqlexample
SELECT ts FROM date_from_timestamp;
```

```output
+-------------------------+
| TS                      |
|-------------------------|
| 2024-10-02 04:00:00.000 |
+-------------------------+
```

Query the TIMESTAMP value in the table using the TO_DATE function:

```sqlexample
SELECT TO_DATE(ts) FROM date_from_timestamp;
```

```output
+-------------+
| TO_DATE(TS) |
|-------------|
| 2024-10-02  |
+-------------+
```

### Examples that use different input formats

The following examples use the TO_DATE and DATE functions with different input format
specifications. The date format in the returned output is determined by the
setting of the [DATE_OUTPUT_FORMAT](../parameters.md) session parameter.

```sqlexample
SELECT TO_DATE('2024.05.10', 'YYYY.MM.DD'), DATE('2024.05.10', 'YYYY.MM.DD');
```

```output
+-------------------------------------+----------------------------------+
| TO_DATE('2024.05.10', 'YYYY.MM.DD') | DATE('2024.05.10', 'YYYY.MM.DD') |
|-------------------------------------+----------------------------------|
| 2024-05-10                          | 2024-05-10                       |
+-------------------------------------+----------------------------------+
```

```sqlexample
SELECT TO_DATE('2024-05-10', 'AUTO'), DATE('2024-05-10', 'AUTO');
```

```output
+-------------------------------+----------------------------+
| TO_DATE('2024-05-10', 'AUTO') | DATE('2024-05-10', 'AUTO') |
|-------------------------------+----------------------------|
| 2024-05-10                    | 2024-05-10                 |
+-------------------------------+----------------------------+
```

```sqlexample
SELECT TO_DATE('05/10/2024', 'MM/DD/YYYY'), DATE('05/10/2024', 'MM/DD/YYYY');
```

```output
+-------------------------------------+----------------------------------+
| TO_DATE('05/10/2024', 'MM/DD/YYYY') | DATE('05/20/2024', 'MM/DD/YYYY') |
|-------------------------------------+----------------------------------|
| 2024-05-10                          | 2024-05-20                       |
+-------------------------------------+----------------------------------+
```

### Examples that use different output formats

The following examples show the results of queries when the [DATE_OUTPUT_FORMAT](../parameters.md)
session parameter is set to `DD-MON-YYYY`:

```sqlexample
ALTER SESSION SET DATE_OUTPUT_FORMAT = 'DD-MON-YYYY';
```

```sqlexample
SELECT TO_DATE('2024-05-10', 'YYYY-MM-DD'), DATE('2024-05-10', 'YYYY-MM-DD');
```

```output
+-------------------------------------+----------------------------------+
| TO_DATE('2024-05-10', 'YYYY-MM-DD') | DATE('2024-05-10', 'YYYY-MM-DD') |
|-------------------------------------+----------------------------------|
| 10-May-2024                         | 10-May-2024                      |
+-------------------------------------+----------------------------------+
```

```sqlexample
SELECT TO_DATE('05/10/2024', 'MM/DD/YYYY'), DATE('05/10/2024', 'MM/DD/YYYY');
```

```output
+-------------------------------------+----------------------------------+
| TO_DATE('05/10/2024', 'MM/DD/YYYY') | DATE('05/10/2024', 'MM/DD/YYYY') |
|-------------------------------------+----------------------------------|
| 10-May-2024                         | 10-May-2024                      |
+-------------------------------------+----------------------------------+
```

### Examples that use a string that contains an integer

When the input is a string that contains an integer, the magnitude of that integer affects whether it is interpreted
as seconds, milliseconds, etc. The following example shows how the function chooses the units to use (seconds, milliseconds,
microseconds, or nanoseconds), based on the magnitude of the value.

Create and load the table:

```sqlexample
CREATE OR REPLACE TABLE demo1 (
  description VARCHAR,
  value VARCHAR -- string rather than bigint
);

INSERT INTO demo1 (description, value) VALUES
  ('Seconds',      '31536000'),
  ('Milliseconds', '31536000000'),
  ('Microseconds', '31536000000000'),
  ('Nanoseconds',  '31536000000000000');
```

Pass the strings to the function:

```sqlexample
SELECT description,
       value,
       TO_TIMESTAMP(value),
       TO_DATE(value)
  FROM demo1
  ORDER BY value;
```

```output
+--------------+-------------------+-------------------------+----------------+
| DESCRIPTION  | VALUE             | TO_TIMESTAMP(VALUE)     | TO_DATE(VALUE) |
|--------------+-------------------+-------------------------+----------------|
| Seconds      | 31536000          | 1971-01-01 00:00:00.000 | 1971-01-01     |
| Milliseconds | 31536000000       | 1971-01-01 00:00:00.000 | 1971-01-01     |
| Microseconds | 31536000000000    | 1971-01-01 00:00:00.000 | 1971-01-01     |
| Nanoseconds  | 31536000000000000 | 1971-01-01 00:00:00.000 | 1971-01-01     |
+--------------+-------------------+-------------------------+----------------+
```

---
title: TO_DECFLOAT
source: https://docs.snowflake.com/en/sql-reference/functions/to_decfloat.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_DECFLOAT

Converts an expression to a decimal floating-point number ([DECFLOAT](../data-types-numeric.md)).

See also:
:   [TRY_TO_DECFLOAT](try_to_decfloat.md)

## Syntax

```sqlsyntax
TO_DECFLOAT( <expr> [ , '<format>' ] )
```

## Arguments

**Required:**

`expr`
:   An expression of a numeric, character, or Boolean type.

**Optional:**

`'format'`
:   If the expression evaluates to a string, the function accepts
    an optional format model. For more information, see
    [SQL format models](../sql-format-models.md). The format model
    specifies the format of the input string, not the format of the
    output value.

## Returns

This function returns a value of DECFLOAT data type.

If `expr` is NULL, the function returns NULL.

## Usage notes

The special values `'NaN'` (not a number), `'inf'` (infinity),
and `'-inf'` (negative infinity) aren’t supported.

## Examples

After you create a table with columns of different data types, call the TO_DECFLOAT
function to convert the values in each of those columns:

```sqlexample
CREATE OR REPLACE TABLE to_decfloat_demo (d DECIMAL(7, 2), v VARCHAR);
INSERT INTO to_decfloat_demo (d, v) SELECT 1.1, '2.2';
SELECT TO_DECFLOAT(d), TO_DECFLOAT(v) FROM to_decfloat_demo;
```

```output
+----------------+----------------+
| TO_DECFLOAT(D) | TO_DECFLOAT(V) |
|----------------+----------------|
| 1.1            | 2.2            |
+----------------+----------------+
```

---
title: TO_DECIMAL , TO_NUMBER , TO_NUMERIC
source: https://docs.snowflake.com/en/sql-reference/functions/to_decimal.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_DECIMAL , TO_NUMBER , TO_NUMERIC

Converts an input expression to a fixed-point number.

These functions are synonymous.

See also:
:   [TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC](try_to_decimal.md)

## Syntax

```sqlsyntax
TO_DECIMAL( <expr> [, '<format>' ] [, <precision> [, <scale> ] ] )

TO_NUMBER( <expr> [, '<format>' ] [, <precision> [, <scale> ] ] )

TO_NUMERIC( <expr> [, '<format>' ] [, <precision> [, <scale> ] ] )
```

## Arguments

**Required:**

`expr`
:   An expression of a numeric, character, or variant type.

**Optional:**

`format`
:   The SQL format model used to parse the input `expr` and return. For more
    information, see [SQL format models](../sql-format-models.md).

`precision`
:   The maximum number of decimal digits in the resulting number, from 1
    to 38. In Snowflake, precision isn’t used to determine the
    number of bytes that are needed to store the number and doesn’t have any effect
    on efficiency, so the default is the maximum (38).

`scale`
:   The number of fractional decimal digits (from 0 to `precision` - 1).
    0 indicates no fractional digits; that is, an integer number. The default scale
    is 0.

## Returns

The function returns a value of type NUMBER with the following defaults:

* If the `precision` isn’t specified, then it defaults to 38.
* If the `scale` isn’t specified, then it defaults to 0.

For NULL input, returns NULL.

## Usage notes

* For fixed-point numbers:

  + Numbers with different scales are converted by either adding zeros to the right (if the scale needs to be increased) or by
    reducing the number of fractional digits by rounding (if the scale needs to be decreased).
  + Note that casts of fixed-point numbers to fixed-point numbers that increase scale might fail.
* For floating-point numbers:

  + Numbers are converted if they are within the representable range, given the scale.
  + The conversion between binary and decimal fractional numbers is not precise. This might result in loss of precision or
    out-of-range errors.
  + Values of infinity and NaN (not-a-number) result in conversion errors.
* Strings are converted as decimal, integer, fractional, or floating-point numbers.

  + For fractional input, the precision is deduced as the number of digits after the point.
  + For floating-point input, omitting the mantissa or exponent is allowed and is interpreted as 0. Thus, `E` is parsed as 0.
* For VARIANT input:

  + If the variant contains a fixed-point or a floating-point numeric value, an appropriate numeric conversion is performed.
  + If the variant contains a string, a string conversion is performed.
  + If the variant contains a Boolean value, the result is 0 or 1 (for false and true, correspondingly).
  + If the variant contains JSON `null` value, the output is NULL.

## Examples

Create a table with a VARCHAR column, then retrieve the string values from the table and pass those values
to the TO_NUMBER function with different `precision` and `scale` values.

```sqlexample
CREATE OR REPLACE TABLE number_conv(expr VARCHAR);
INSERT INTO number_conv VALUES ('12.3456'), ('98.76546');

SELECT expr,
       TO_NUMBER(expr),
       TO_NUMBER(expr, 10, 1),
       TO_NUMBER(expr, 10, 8)
  FROM number_conv;
```

The query returns the following output:

```output
+----------+-----------------+------------------------+------------------------+
| EXPR     | TO_NUMBER(EXPR) | TO_NUMBER(EXPR, 10, 1) | TO_NUMBER(EXPR, 10, 8) |
|----------+-----------------+------------------------+------------------------|
| 12.3456  |              12 |                   12.3 |            12.34560000 |
| 98.76546 |              99 |                   98.8 |            98.76546000 |
+----------+-----------------+------------------------+------------------------+
```

Try a query on the same table using the TO_NUMBER function to return a number with the `precision` of `10`
and the scale of `9`.

```sqlexample
SELECT expr, TO_NUMBER(expr, 10, 9) FROM number_conv;
```

With the `precision` argument set to `10`, the maximal number of decimal digits in the results is 10.
Because both values in the table have two digits before the decimal point and `scale` is set to `9`,
the query returns an error because the results would return 11 digits.

```output
100039 (22003): Numeric value '12.3456' is out of range
```

Use different [format elements](../sql-format-models.md) with the TO_DECIMAL
function in a query:

```sqlexample
SELECT column1,
       TO_DECIMAL(column1, '99.9') as D0,
       TO_DECIMAL(column1, '99.9', 9, 5) as D5,
       TO_DECIMAL(column1, 'TM9', 9, 5) as TD5
  FROM VALUES ('1.0'), ('-12.3'), ('0.0'), ('- 0.1');
```

The query returns the following output:

```output
+---------+-----+-----------+-----------+
| COLUMN1 |  D0 |        D5 |       TD5 |
|---------+-----+-----------+-----------|
| 1.0     |   1 |   1.00000 |   1.00000 |
| -12.3   | -12 | -12.30000 | -12.30000 |
| 0.0     |   0 |   0.00000 |   0.00000 |
| - 0.1   |   0 |  -0.10000 |  -0.10000 |
+---------+-----+-----------+-----------+
```

The output shows that the `TM9` text-minimal format element prints precisely the number of
digits in the fractional part based on the specified scale. For more information, see
[Text-minimal numeric formats](../sql-format-models.md).

Convert a number that uses a comma to separate groups of digits:

```sqlexample
SELECT column1,
       TO_DECIMAL(column1, '9,999.99', 6, 2) as convert_number
  FROM VALUES ('3,741.72');
```

The query returns the following output:

```output
+----------+----------------+
| COLUMN1  | CONVERT_NUMBER |
|----------+----------------|
| 3,741.72 |        3741.72 |
+----------+----------------+
```

Convert a currency value that uses a comma to separate groups of digits:

```sqlexample
SELECT column1,
       TO_DECIMAL(column1, '$9,999.99', 6, 2) as convert_currency
  FROM VALUES ('$3,741.72');
```

The query returns the following output:

```output
+-----------+------------------+
| COLUMN1   | CONVERT_CURRENCY |
|-----------+------------------|
| $3,741.72 |          3741.72 |
+-----------+------------------+
```

Use the [X format element](../sql-format-models.md) with the TO_DECIMAL function
to convert a hexadecimal value to a decimal value:

```sqlexample
SELECT TO_DECIMAL('ae5', 'XXX');
```

The query returns the following output:

```output
+--------------------------+
| TO_DECIMAL('AE5', 'XXX') |
|--------------------------|
|                     2789 |
+--------------------------+
```

The number of digits in the format element must be equal to or greater than the number of digits in the
expression. For example, try to run the following query:

```sqlexample
SELECT TO_DECIMAL('ae5', 'XX');
```

The query returns an error:

```output
100140 (22007): Can't parse 'ae5' as number with format 'XX'
```

---
title: TO_DOUBLE
source: https://docs.snowflake.com/en/sql-reference/functions/to_double.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_DOUBLE

Converts an expression to a double-precision floating-point number.

For NULL input, the result is NULL.

See also:
:   [TRY_TO_DOUBLE](try_to_double.md)

## Syntax

```sqlsyntax
TO_DOUBLE( <expr> [, '<format>' ] )
```

## Arguments

`expr`
:   An expression of a numeric, character, or variant type.

`format`
:   If the expression evaluates to a string, then the function accepts
    an optional format model. Format models are described at
    [SQL format models](../sql-format-models.md). The format model
    specifies the format of the input string, not the format of the
    output value.

## Returns

This function returns a value of FLOAT data type.

If `expr` is NULL, the function returns NULL.

## Usage notes

* Fixed-point numbers are converted to floating point; the conversion
  cannot fail, but might result in loss of precision.
* Strings are converted as decimal integer or fractional numbers,
  scientific notation and special values (**nan**, **inf**, **infinity**)
  are accepted.
* For VARIANT input:

  > + If the variant contains a fixed-point value, the numeric conversion
  >   will be performed.
  > + If the variant contains a floating-point value, the value will be
  >   preserved unchanged.
  > + If the variant contains a string, a string conversion will be
  >   performed.
  > + If the variant contains a Boolean value, the result will be 0 or 1
  >   (for false and true, correspondingly).
  > + If the variant contains JSON **null** value, the output will be
  >   NULL.
* Conversion of decimal fractions to binary float and back is not precise
  (that is, printing of a floating-point number converted from decimal representation
  might produce a slightly different number). If precise representation of decimal
  fractions is required, use fixed-point numbers.

## Examples

After creating a table with columns of different data types, this script calls
TO_DOUBLE on each of those columns:

```sqlexample
CREATE OR REPLACE TABLE double_demo (d DECIMAL(7, 2), v VARCHAR, o VARIANT);
INSERT INTO double_demo (d, v, o) SELECT 1.1, '2.2', TO_VARIANT(3.14);
SELECT TO_DOUBLE(d), TO_DOUBLE(v), TO_DOUBLE(o) FROM double_demo;
```

```output
+--------------+--------------+--------------+
| TO_DOUBLE(D) | TO_DOUBLE(V) | TO_DOUBLE(O) |
|--------------+--------------+--------------|
|          1.1 |          2.2 |         3.14 |
+--------------+--------------+--------------+
```

The following example shows that converting from a binary float back to a number is not precise:

```sqlexample
SELECT TO_DOUBLE(1.1)::NUMBER(38, 18);
```

```output
+--------------------------------+
| TO_DOUBLE(1.1)::NUMBER(38, 18) |
|--------------------------------|
|           1.100000000000000089 |
+--------------------------------+
```

---
title: TO_FILE
source: https://docs.snowflake.com/en/sql-reference/functions/to_file.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# TO_FILE

Constructs a value of type [FILE](../data-types-unstructured.md) from a file location or from metadata.

## Syntax

Use one of the following:

```
TO_FILE( <stage_name>, <relative_path> )

TO_FILE( <file_url> )

TO_FILE( <metadata> )
```

## Arguments

Specify the file by providing:

* Both `stage_name` and `relative_path`
* `file_url`
* `metadata`

Only one of these methods can be used at a time.

`stage_name`
:   The name of the stage where the file is located, as a string, in the form `'@stage_name'`.

`relative_path`
:   The path to the file on the stage specified by `stage_name` as a string.

`file_url`
:   A valid stage or scoped file URL as a string.

`metadata`
:   An OBJECT containing the required FILE attributes. A FILE must have CONTENT_TYPE, SIZE, ETAG, and LAST_MODIFIED fields.
    It must also specify the file’s location in one of the following ways:

    * Both STAGE and RELATIVE_PATH
    * STAGE_FILE_URL
    * SCOPED_FILE_URL

## Returns

A [FILE](../data-types-unstructured.md) that represents the staged file.

## Usage notes

Raises an error when:

* The supplied URL is not valid.
* The file is on a stage that the user lacks privileges to access.
* The supplied metadata doesn’t contain the required FILE fields.

## Examples

### Creating FILE objects using TO_FILE

A simple use of the TO_FILE function with a stage name and relative path:

```sqlexample
SELECT TO_FILE('@mystage', 'image.png');
```

Result:

```output
+-----------------------------------------------------+
| TO_FILE('@MYSTAGE', 'IMAGE.PNG')                    |
|-----------------------------------------------------|
| {                                                   |
|   "CONTENT_TYPE": "image/png",                      |
|   "ETAG": "2859efde6e26491810f619668280a2ce",       |
|   "LAST_MODIFIED": "Thu, 18 Sep 2025 09:02:00 GMT", |
|   "RELATIVE_PATH": "image.png",                     |
|   "SIZE": 23698,                                    |
|   "STAGE": "@MYDB.MYSCHEMA.MYSTAGE"                 |
| }                                                   |
+-----------------------------------------------------+
```

A simple use of the TO_FILE function with a staged file URL:

```sqlexample
SELECT TO_FILE(BUILD_STAGE_FILE_URL('@mystage', 'image.png'));
```

Result:

```output
+--------------------------------------------------------------------------------------------------------------------+
| TO_FILE(BUILD_STAGE_FILE_URL('@MYSTAGE', 'IMAGE.PNG'))                                                             |
|--------------------------------------------------------------------------------------------------------------------|
| {                                                                                                                  |
|   "CONTENT_TYPE": "image/png",                                                                                     |
|   "ETAG": "..."                                                                                                    |
|   "LAST_MODIFIED": "Wed, 11 Dec 2024 20:24:00 GMT",                                                                |
|   "RELATIVE_PATH": "image.png",                                                                                    |
|   "SIZE": 105859,                                                                                                  |
|   "STAGE": "@MYDB.MYSCHEMA.MYSTAGE",                                                                               |
|   "STAGE_FILE_URL": "https://snowflake.account.snowflakecomputing.com/api/files/MYDB/MYSCHEMA/MYSTAGE/image.png"   |
| }                                                                                                                  |
+--------------------------------------------------------------------------------------------------------------------+
```

Or use the FILE_URL from a file in the directory of your stage:

```sqlexample
SELECT TO_FILE(FILE_URL) FROM DIRECTORY(@mystage) LIMIT 1;
```

```output
+--------------------------------------------------------------------------------------------------------------------+
| TO_FILE(FILE_URL)                                                                                                  |
|--------------------------------------------------------------------------------------------------------------------|
| {                                                                                                                  |
|   "CONTENT_TYPE": "image/png",                                                                                     |
|   "ETAG": "..."                                                                                                    |
|   "LAST_MODIFIED": "Wed, 11 Dec 2024 20:24:00 GMT",                                                                |
|   "RELATIVE_PATH": "image.png",                                                                                    |
|   "SIZE": 105859,                                                                                                  |
|   "STAGE": "@MYDB.MYSCHEMA.MYSTAGE",                                                                               |
|   "STAGE_FILE_URL": "https://snowflake.account.snowflakecomputing.com/api/files/MYDB/MYSCHEMA/MYSTAGE/image.png"   |
| }                                                                                                                  |
+--------------------------------------------------------------------------------------------------------------------+
```

This example uses TO_FILE function directly with a scoped file URL:

```sqlexample
SELECT TO_FILE(`https://snowflake.account.snowflakecomputing.com/api/files/01ba4df2-0100-0001-0000-00040002e2b6/299017/Y6JShH6KjV`);

+------------------------------------------------------------------------------------------------------------------------------------------------+
| TO_FILE(https://snowflake.account.snowflakecomputing.com/api/files/01ba4df2-0100-0001-0000-00040002e2b6/299017/Y6JShH6KjV                      |
|------------------------------------------------------------------------------------------------------------------------------------------------|
| {                                                                                                                                              |
|   "CONTENT_TYPE": "image/png",                                                                                                                 |
|   "ETAG": "..."                                                                                                                                |
|   "LAST_MODIFIED": "Wed, 11 Dec 2024 20:24:00 GMT",                                                                                            |
|   "SCOPED_FILE_URL": "https://snowflake.account.snowflakecomputing.com/api/files/01ba4df2-0100-0001-0000-00040002e2b6/299017/Y6JShH6KjV",      |
|   "SIZE": 105859                                                                                                                               |
| }                                                                                                                                              |
+-----------------------------------------------------------------------------------------------------------------------------------------------+|
```

This shows an example of constructing a FILE from an object containing the required metadata:

```sqlexample
SELECT TO_FILE(OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.png', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/png'));

+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| TO_FILE(OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'IMAGE.PNG', 'ETAG', '<ETAG value>', 'LAST_MODIFIED', 'WED, 11 DEC 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'IMAGE/PNG')) |
|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| {                                                                                                                                                                                                  |
|   "CONTENT_TYPE": "image/png",                                                                                                                                                                     |
|   "ETAG": "<ETAG value>>"                                                                                                                                                                          |
|   "LAST_MODIFIED": "Wed, 11 Dec 2024 20:24:00 GMT",                                                                                                                                                |
|   "RELATIVE_PATH": "image.png",                                                                                                                                                                    |
|   "SIZE": 105859,                                                                                                                                                                                  |
|   "STAGE": "@MYDB.MYSCHEMA.MYSTAGE"                                                                                                                                                                |
| }                                                                                                                                                                                                  |
+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

### Adding FILE to a table

The following example demonstrates how to create FILE and store it in a table, then perform various operations using that
column, including saving and loading from Parquet, SnowPipe, datasets, materialized views, dynamic tables, and cloning
with time travel.

Creating a table with a FILE column:

```sqlexample
CREATE OR REPLACE TABLE sample_table (a INT, f FILE NOT NULL);
DESCRIBE TABLE sample_table;
INSERT INTO sample_table SELECT 1, TO_FILE('@mystage', 'image.png');
INSERT INTO sample_table SELECT 1, TO_FILE('@mystage', relative_path) FROM DIRECTORY('@mystage');
SELECT * FROM sample_table WHERE fl_get_file_type(f) = 'image';
```

To write a table containing a FILE column to a stage as a Parquet file and load it back:

```sqlexample
-- Write to stage as Parquet
CREATE OR REPLACE STAGE test_stage_parquet;

CREATE OR REPLACE FILE FORMAT parquet_format
  TYPE = 'PARQUET'
  USE_LOGICAL_TYPE = TRUE;

COPY INTO @test_stage_parquet/file_copy.parquet FROM sample_table
  FILE_FORMAT = (FORMAT_NAME = parquet_format) HEADER = TRUE ->> SELECT "rows_unloaded" FROM $1;
ALTER STAGE test_stage_parquet SET DIRECTORY = (ENABLE=TRUE);
ALTER STAGE test_stage_parquet REFRESH;

-- Read Parquet files back from stage
SELECT * FROM @TEST_STAGE_PARQUET/file_copy.parquet_0_0_0.snappy.parquet(FILE_FORMAT => parquet_format);
SELECT * FROM @TEST_STAGE_PARQUET (PATTERN => '.*.parquet', FILE_FORMAT => parquet_format);
```

Create a dataset from a Parquet file:

```sqlexample
CREATE OR REPLACE DATASET mydataset;
ALTER DATASET mydataset ADD VERSION 'v1' FROM
  (SELECT * FROM @TEST_STAGE_PARQUET/file_copy.parquet_0_0_0.snappy.parquet(
    FILE_FORMAT => my_parquet_format))
  COMMENT = 'test dataset';
```

Copy Parquet files into a table:

```sqlexample
CREATE OR REPLACE TABLE t1_copy_parquet (a INT, f OBJECT NOT NULL);

COPY INTO t1_copy_parquet
  FROM @test_stage_parquet
  FILE_FORMAT = (FORMAT_NAME = parquet_format)
  MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE;

SELECT FL_GET_STAGE(f), FL_GET_RELATIVE_PATH(f) FROM t1_copy_parquet;
```

Create a Snowpipe:

```sqlexample
CREATE OR REPLACE TABLE t1_copy_parquet_snowpipe (f OBJECT NOT NULL);
CREATE OR REPLACE PIPE test_pipe AS
  COPY INTO t1_copy_parquet_snowpipe
  FROM @test_stage_parquet
FILE_FORMAT = (FORMAT_NAME = my_parquet_format)
MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE;

ALTER PIPE TEST_PIPE REFRESH;
```

Create a materialized view or a dynamic table from the table:

```sqlexample
CREATE OR REPLACE MATERIALIZED VIEW MV AS SELECT * FROM SAMPLE_TABLE;

CREATE OR REPLACE DYNAMIC TABLE sample_dynamic_table
  WAREHOUSE = my_warehouse
  TARGET_LAG = '60 minutes'
AS SELECT f FROM sample_table;
```

Store files in an array in a table column:

```sqlexample
CREATE OR REPLACE TABLE files_array_table(files ARRAY);
INSERT INTO files_array_table SELECT ARRAY_CONSTRUCT(TO_FILE('@mystage', 'image.png'));
CREATE OR REPLACE TABLE files_array_table_copy(files ARRAY);
INSERT INTO files_array_table_copy SELECT files[0] FROM files_array_table;
```

### Examples of errors

These examples illustrate common mistakes in using TO_FILE that result in the function raising an error.

The following example constructs a FILE from a metadata object but omits a required field:

```sqlexample
SELECT TO_FILE(OBJECT_CONSTRUCT('RELATIVE_PATH', 'image.png', 'ETAG', '<ETAG value>',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/png'));
```

```output
Invalid file metadata. Must provide (STAGE and RELATIVE_PATH), SCOPED_FILE_URL, or STAGE_FILE_URL.
```

The following example is similar, but omits the ETAG field, which is requred.

```sqlexample
SELECT TO_FILE(OBJECT_CONSTRUCT('STAGE', 'MYSTAGE', 'RELATIVE_PATH', 'image.png',
  'LAST_MODIFIED', 'Wed, 11 Dec 2024 20:24:00 GMT', 'SIZE', 105859, 'CONTENT_TYPE', 'image/png'));
```

```output
Invalid file metadata. Missing required fields: ETAG.
```

The following example shows attempts to GROUP BY, ORDER BY, and CLUSTER BY a FILE column, which is not supported
because FILE values cannot be compared.

```sqlexample
SELECT f, count(*) FROM sample_table GROUP BY f;
-- Expressions of type FILE cannot be used as GROUP BY keys

SELECT * FROM sample_table ORDER by f;
-- Expressions of type FILE cannot be used as ORDER BY keys

CREATE OR REPLACE TABLE cluster_to_file (a int, url string) CLUSTER BY (to_file(url));
-- Unsupported type 'FILE' for clustering keys
```

This final example uses an incorrect stage name, specifically a slash at the end of the stage name.
Snowflake already adds a slash between the stage name and relative path, so this results in
two slashes, and the combined stage path does not specify any file.

```sqlexample
SELECT TO_FILE('@mystage/', 'image.png');
```

```output
Remote file '@mystage//image.png' was not found. There are several potential causes.
The file might not exist. The required credentials may be missing or invalid. If you
are running a copy command, please make sure files are not deleted when they are
being loaded or files are not being loaded into two different tables concurrently
with auto purge option.
```

## Known limitations

* TO_FILE cannot be used in INSERT INTO TABLE <t> VALUES clause. Use INSERT INTO TABLE <t> SELECT instead.

---
title: TO_GEOGRAPHY
source: https://docs.snowflake.com/en/sql-reference/functions/to_geography.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# TO_GEOGRAPHY

Parses an input and returns a value of type [GEOGRAPHY](../data-types-geospatial.md).

See also:
:   [TRY_TO_GEOGRAPHY](try_to_geography.md) , [ST_GEOGRAPHYFROMWKB](st_geographyfromwkb.md) , [ST_GEOGRAPHYFROMWKT](st_geographyfromwkt.md)

## Syntax

Use one of the following:

```sqlsyntax
TO_GEOGRAPHY( <varchar_expression> [ , <allow_invalid> ] )

TO_GEOGRAPHY( <binary_expression> [ , <allow_invalid> ] )

TO_GEOGRAPHY( <variant_expression> [ , <allow_invalid> ] )

TO_GEOGRAPHY( <geometry_expression> [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_expression`
:   The argument must be a string expression that represents a valid geometric object in one of the following formats:

    * WKT (well-known text).
    * WKB (well-known binary) in hexadecimal format (without a leading `0x`).
    * EWKT (extended well-known text).
    * EWKB (extended well-known binary) in hexadecimal format (without a leading `0x`).
    * GeoJSON.

`binary_expression`
:   The argument must be a binary expression in WKB or EWKB format.

`variant_expression`
:   The argument must be an OBJECT in GeoJSON format.

`geometry_expression`
:   The argument must be an expression of type GEOMETRY with the SRID 4326.

**Optional:**

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type [GEOGRAPHY](../data-types-geospatial.md).

## Usage notes

* Issues an error if the input cannot be parsed as one of the supported formats (WKT, WKB, EWKT, EWKB, GeoJSON).
* Issues an error if the input format is EWKT or EWKB and the SRID is not 4326.
  See the [note on EWKT and EWKB handling](../data-types-geospatial.md).
* To construct a GEOGRAPHY object from WKT or EWKT input, you can also use [ST_GEOGRAPHYFROMWKT](st_geographyfromwkt.md).
* To construct a GEOGRAPHY object from WKB or EWKB input, you can also use [ST_GEOGRAPHYFROMWKB](st_geographyfromwkb.md).

* For the coordinates in WKT, EWKT, and GeoJSON, longitude appears before latitude (for example, `POINT(lon lat)`).

## Examples

This shows a simple use of the TO_GEOGRAPHY function with VARCHAR data:

> ```sqlexample
> select TO_GEOGRAPHY('POINT(-122.35 37.55)');
> ```
>
> ```output
> +--------------------------------------+
> | TO_GEOGRAPHY('POINT(-122.35 37.55)') |
> |--------------------------------------|
> | POINT(-122.35 37.55)                 |
> +--------------------------------------+
> ```

The following example returns the GEOGRAPHY object for a geospatial object with a Z coordinate described in WKT format:

> ```sqlexample
> select TO_GEOGRAPHY('POINTZ(-122.35 37.55 30)');
> ```
>
> ```output
> +------------------------------------------+
> | TO_GEOGRAPHY('POINTZ(-122.35 37.55 30)') |
> |------------------------------------------|
> | POINTZ(-122.35 37.55 30)                 |
> +------------------------------------------+
> ```

---
title: TO_GEOMETRY
source: https://docs.snowflake.com/en/sql-reference/functions/to_geometry.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# TO_GEOMETRY

Parses an input and returns a value of type [GEOMETRY](../data-types-geospatial.md).

See also:
:   [TRY_TO_GEOMETRY](try_to_geometry.md) , [ST_GEOMETRYFROMWKB](st_geometryfromwkb.md) , [ST_GEOMETRYFROMWKT](st_geometryfromwkt.md)

## Syntax

Use one of the following:

```sqlsyntax
TO_GEOMETRY( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

TO_GEOMETRY( <binary_expression> [ , <srid> ] [ , <allow_invalid> ] )

TO_GEOMETRY( <variant_expression> [ , <srid> ] [ , <allow_invalid> ] )

TO_GEOMETRY( <geography_expression> [ , <srid> ] [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_expression`
:   The argument must be a string expression that represents a valid geometric object in one of the following formats:

    * WKT (well-known text)
    * WKB (well-known binary) in hexadecimal format (without a leading `0x`)
    * EWKT (extended well-known text)
    * EWKB (extended well-known binary) in hexadecimal format (without a leading `0x`)
    * GeoJSON

`binary_expression`
:   The argument must be a binary expression in WKB or EWKB format.

`variant_expression`
:   The argument must be an OBJECT in GeoJSON format.

`geography_expression`
:   The argument must be an expression of type GEOGRAPHY.

**Optional:**

`srid`
:   The integer value of the SRID to use.

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type [GEOMETRY](../data-types-geospatial.md).

## Usage notes

* Issues an error if the input cannot be parsed as one of the supported formats (WKT, WKB, EWKT, EWKB, GeoJSON).
* For GeoJSON, WKT, and WKB input, if the `srid` argument is not specified, the resulting GEOMETRY object has the SRID
  set to 0.
* To construct a GEOMETRY object from WKT or EWKT input, you can also use [ST_GEOMETRYFROMWKT](st_geometryfromwkt.md).
* To construct a GEOMETRY object from WKB or EWKB input, you can also use [ST_GEOMETRYFROMWKB](st_geometryfromwkb.md).

## Examples

The following example shows how to use the TO_GEOMETRY function to convert an object represented in WKT to a GEOMETRY object. The
example doesn’t specify the `srid` argument, and the SRID isn’t specified in the input representation of the object, so
the SRID is set to `0`.

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT TO_GEOMETRY('POINT(1820.12 890.56)');
```

```output
+--------------------------------------+
| TO_GEOMETRY('POINT(1820.12 890.56)') |
|--------------------------------------|
| SRID=0;POINT(1820.12 890.56)         |
+--------------------------------------+
```

The following example converts an object represented in EWKT to a GEOMETRY object. The input EKWT specifies the SRID to use:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT TO_GEOMETRY('SRID=4326;POINT(1820.12 890.56)');
```

```output
+------------------------------------------------+
| TO_GEOMETRY('SRID=4326;POINT(1820.12 890.56)') |
|------------------------------------------------|
| SRID=4326;POINT(1820.12 890.56)                |
+------------------------------------------------+
```

The following example demonstrates how to specify the SRID as the `srid` input argument:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT TO_GEOMETRY('POINT(1820.12 890.56)', 4326);
```

```output
+--------------------------------------------+
| TO_GEOMETRY('POINT(1820.12 890.56)', 4326) |
|--------------------------------------------|
| SRID=4326;POINT(1820.12 890.56)            |
+--------------------------------------------+
```

The following example returns the GEOMETRY object for a geospatial object with a Z coordinate described in EWKT format:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT TO_GEOMETRY('SRID=32633;POINTZ(389866.35 5819003.03 30)');
```

```output
+-----------------------------------------------------------+
| TO_GEOMETRY('SRID=32633;POINTZ(389866.35 5819003.03 30)') |
|-----------------------------------------------------------|
| SRID=32633;POINTZ(389866.35 5819003.03 30)                |
+-----------------------------------------------------------+
```

For examples that convert a GEOGRAPHY object to a GEOMETRY object, see [Converting between GEOGRAPHY and GEOMETRY](../data-types-geospatial.md).

The next examples use the TO_GEOMETRY function in queries on data in a table.

Create a temporary table and insert rows with GEOMETRY values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_to_geometry AS
SELECT
  1                                                     AS id,
  'POINT(10 20)'                                        AS wkt_col,         -- VARCHAR (WKT)
  'SRID=32633;POINT(500000.0 4649776.22)'               AS ewkt_col,        -- VARCHAR (EWKT)
  ST_ASWKB(TO_GEOMETRY('LINESTRING(0 0, 1 1)'))         AS wkb_bin_col,     -- BINARY (WKB)
  PARSE_JSON('{"type":"Point","coordinates":[10,20]}')  AS geojson_col,     -- VARIANT (GeoJSON)
  TO_GEOGRAPHY('POINT(-122.35 37.55)')                  AS geog_col,        -- GEOGRAPHY
  'POLYGON((0 0,2 2,2 0,0 2,0 0))'                      AS invalid_wkt_col, -- invalid shape
  0                                                     AS srid0,           -- SRID columns to show positional args
  3857                                                  AS srid_col,
  TRUE                                                  AS allow_true,      -- allow_invalid flags from columns
  FALSE                                                 AS allow_false
UNION ALL
SELECT
  2,
  'LINESTRING(0 0, 10 10)',
  'SRID=32633;POINT(389866.35 5819003.03)',
  ST_ASWKB(TO_GEOMETRY('POINT(2 3)')),
  PARSE_JSON('{"type":"LineString","coordinates":[[0,0],[1,1]]}'),
  TO_GEOGRAPHY('LINESTRING(-124.2 42,-120.01 41.99)'),
  'POLYGON((0 0,1 1,1 0,0 1,0 0))',
  0,
  3857,
  TRUE,
  FALSE;
```

This table has columns with data types that the TO_GEOMETRY function accepts as inputs in the following formats:

* VARCHAR (WKT/WKB and hex/EWKT/EWKB/GeoJSON)
* BINARY (WKB/EWKB)
* VARIANT (GeoJSON object)
* GEOGRAPHY

Optional `srid` and `allow_invalid` values can follow any of these formats. The [ST_ASWKB , ST_ASBINARY](st_aswkb.md) function
generates valid WKB BINARY values.

The following example converts VARCHAR values in the `wkt_col` column to GEOMETRY values by using the default
SRID of `0`:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TO_GEOMETRY(wkt_col) AS g
  FROM demo_to_geometry;
```

```output
+----+------------------------------+
| ID | G                            |
|----+------------------------------|
|  1 | SRID=0;POINT(10 20)          |
|  2 | SRID=0;LINESTRING(0 0,10 10) |
+----+------------------------------+
```

The following example converts VARCHAR values in the `wkt_col` column to GEOMETRY values by using the
SRID value in the `srid_col` column:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TO_GEOMETRY(wkt_col, srid_col) AS g
  FROM demo_to_geometry;
```

```output
+----+----------------------------------+
| ID | G                                |
|----+----------------------------------|
|  1 | SRID=3857;POINT(10 20)           |
|  2 | SRID=3857;LINESTRING(0 0,10 10)  |
+----+----------------------------------+
```

The following example converts VARCHAR values in the `ewkt_col` column to GEOMETRY values, with the SRID value
embedded in the `ewkt_col` column value:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TO_GEOMETRY(ewkt_col) AS g
  FROM demo_to_geometry;
```

```output
+----+--------------------------------------------+
| ID | G                                          |
|----+--------------------------------------------|
|  1 | SRID=32633;POINT(500000 4649776.22)        |
|  2 | SRID=32633;POINT(389866.35 5819003.03)     |
+----+--------------------------------------------+
```

The following example converts BINARY values in the `wkb_bin_col` column to GEOMETRY values:

```sqlexample
ALTER SESSION SET BINARY_OUTPUT_FORMAT='HEX';

SELECT id, TO_GEOMETRY(wkb_bin_col) AS g
  FROM demo_to_geometry;
```

```output
+----+----------------------------+
| ID | G                          |
|----+----------------------------|
|  1 | SRID=0;LINESTRING(0 0,1 1) |
|  2 | SRID=0;POINT(2 3)          |
+----+----------------------------+
```

The following example converts VARIANT values in the `geojson_col` column to GEOMETRY values
by using the SRID value in the `srid_col` column:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TO_GEOMETRY(geojson_col, srid_col) AS g
  FROM demo_to_geometry;
```

```output
+----+--------------------------------+
| ID | G                              |
|----+--------------------------------|
|  1 | SRID=3857;POINT(10 20)         |
|  2 | SRID=3857;LINESTRING(0 0,1 1)  |
+----+--------------------------------+
```

The following example converts GEOGRAPHY values in the `geog_col` column to GEOMETRY values
by using the SRID value in the `srid_col` column:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TO_GEOMETRY(geog_col, srid_col) AS g
  FROM demo_to_geometry;
```

```output
+----+-----------------------------------------------+
| ID | G                                             |
|----+-----------------------------------------------|
|  1 | SRID=4326;POINT(-122.35 37.55)                |
|  2 | SRID=4326;LINESTRING(-124.2 42,-120.01 41.99) |
+----+-----------------------------------------------+
```

The following example converts VARCHAR values in the `invalid_wkt_col` column to GEOMETRY values by using
the SRID value in the `srid0` column (`0`) and the `allow_invalid` value in the `allow_true`
column:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TO_GEOMETRY(invalid_wkt_col, srid0, allow_true) AS g
  FROM demo_to_geometry;
```

The output includes shapes that aren’t valid:

```output
+----+---------------------------------------+
| ID | G                                     |
|----+---------------------------------------|
|  1 | SRID=0;POLYGON((0 0,2 2,2 0,0 2,0 0)) |
|  2 | SRID=0;POLYGON((0 0,1 1,1 0,0 1,0 0)) |
+----+---------------------------------------+
```

---
title: TO_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/to_json.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# TO_JSON

Converts a [VARIANT](../data-types-semistructured.md) value to a string containing the JSON representation of the value.

## Syntax

```sqlsyntax
TO_JSON( <expr> )
```

## Arguments

`expr`
:   An expression of type VARIANT that holds valid JSON information.

## Returns

Returns a value of type VARCHAR.

If the input is NULL, the function returns NULL.

## Usage notes

* If the input is NULL, the output is also NULL. If the input is a VARIANT that contains [JSON null](../../user-guide/semistructured-considerations.md),
  then the returned value is the string `"null"` (i.e. the word “null” surrounded by double quotes). See the example below.
* A JSON object (also called a “dictionary” or a “hash”) is an
  unordered set of key-value pairs. When TO_JSON produces a
  string, the order of the key-value pairs in that string is not predictable.

* TO_JSON and PARSE_JSON are (almost) converse or reciprocal functions.

  + The PARSE_JSON function takes a string as input and returns a JSON-compatible [VARIANT](../data-types-semistructured.md).
  + The TO_JSON function takes a JSON-compatible VARIANT and returns a string.

  The following is (conceptually) true if X is a string containing valid JSON:

  > `X = TO_JSON(PARSE_JSON(X));`

  For example, the following is (conceptually) true:

  > `'{"pi":3.14,"e":2.71}' = TO_JSON(PARSE_JSON('{"pi":3.14,"e":2.71}'))`

  However, the functions are not perfectly reciprocal because:

  + Empty strings, and strings with only whitespace, are not handled reciprocally. For example, the return value of
    `PARSE_JSON('')` is NULL, but the return value of `TO_JSON(NULL)` is NULL, not the reciprocal `''`.
  + The order of the key-value pairs in the string produced by TO_JSON is not predictable.
  + The string produced by TO_JSON can have less whitespace than the string passed to PARSE_JSON.

  For example, the following are equivalent JSON, but not equivalent strings:

  + `{"pi": 3.14, "e": 2.71}`
  + `{"e":2.71,"pi":3.14}`

## Examples

The following examples use the TO_JSON function.

### Inserting VARIANT values and converting them to strings with a query

Create and fill a table. The INSERT statement uses the PARSE_JSON function to insert
a VARIANT value in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE jdemo1 (v VARIANT);
INSERT INTO jdemo1 SELECT PARSE_JSON('{"food":"bard"}');
```

Query the data and use the TO_JSON function to convert the VARIANT value to a string.

```sqlexample
SELECT v, v:food, TO_JSON(v) FROM jdemo1;
```

```output
+------------------+--------+-----------------+
| V                | V:FOOD | TO_JSON(V)      |
|------------------+--------+-----------------|
| {                | "bard" | {"food":"bard"} |
|   "food": "bard" |        |                 |
| }                |        |                 |
+------------------+--------+-----------------+
```

### Handling NULL values with the PARSE_JSON and TO_JSON functions

The following example shows how PARSE_JSON and TO_JSON handle NULL values:

```sqlexample
SELECT TO_JSON(NULL), TO_JSON('null'::VARIANT),
       PARSE_JSON(NULL), PARSE_JSON('null');
```

```output
+---------------+--------------------------+------------------+--------------------+
| TO_JSON(NULL) | TO_JSON('NULL'::VARIANT) | PARSE_JSON(NULL) | PARSE_JSON('NULL') |
|---------------+--------------------------+------------------+--------------------|
| NULL          | "null"                   | NULL             | null               |
+---------------+--------------------------+------------------+--------------------+
```

### Comparing PARSE_JSON and TO_JSON

The following examples demonstrate the relationship between the PARSE_JSON and TO_JSON functions.

This example creates a table with a VARCHAR column and a VARIANT column. The INSERT statement inserts
a VARCHAR value, and the UPDATE statement generates a JSON value that corresponds with that VARCHAR value.

```sqlexample
CREATE OR REPLACE TABLE jdemo2 (
  varchar1 VARCHAR,
  variant1 VARIANT);

INSERT INTO jdemo2 (varchar1) VALUES ('{"PI":3.14}');

UPDATE jdemo2 SET variant1 = PARSE_JSON(varchar1);
```

This query shows that TO_JSON and PARSE_JSON are conceptually reciprocal functions:

```sqlexample
SELECT varchar1,
       PARSE_JSON(varchar1),
       variant1,
       TO_JSON(variant1),
       PARSE_JSON(varchar1) = variant1,
       TO_JSON(variant1) = varchar1
  FROM jdemo2;
```

```output
+-------------+----------------------+--------------+-------------------+---------------------------------+------------------------------+
| VARCHAR1    | PARSE_JSON(VARCHAR1) | VARIANT1     | TO_JSON(VARIANT1) | PARSE_JSON(VARCHAR1) = VARIANT1 | TO_JSON(VARIANT1) = VARCHAR1 |
|-------------+----------------------+--------------+-------------------+---------------------------------+------------------------------|
| {"PI":3.14} | {                    | {            | {"PI":3.14}       | True                            | True                         |
|             |   "PI": 3.14         |   "PI": 3.14 |                   |                                 |                              |
|             | }                    | }            |                   |                                 |                              |
+-------------+----------------------+--------------+-------------------+---------------------------------+------------------------------+
```

However, the functions are not exactly reciprocal. Differences in whitespace or in the order of key-value
pairs can prevent the output from matching the input. For example:

```sqlexample
SELECT TO_JSON(PARSE_JSON('{"b":1,"a":2}')),
       TO_JSON(PARSE_JSON('{"b":1,"a":2}')) = '{"b":1,"a":2}',
       TO_JSON(PARSE_JSON('{"b":1,"a":2}')) = '{"a":2,"b":1}';
```

```output
+--------------------------------------+--------------------------------------------------------+--------------------------------------------------------+
| TO_JSON(PARSE_JSON('{"B":1,"A":2}')) | TO_JSON(PARSE_JSON('{"B":1,"A":2}')) = '{"B":1,"A":2}' | TO_JSON(PARSE_JSON('{"B":1,"A":2}')) = '{"A":2,"B":1}' |
|--------------------------------------+--------------------------------------------------------+--------------------------------------------------------|
| {"a":2,"b":1}                        | False                                                  | True                                                   |
+--------------------------------------+--------------------------------------------------------+--------------------------------------------------------+
```

---
title: TO_OBJECT
source: https://docs.snowflake.com/en/sql-reference/functions/to_object.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Semi-structured and structured data functions](../functions-semistructured.md) (Array/Object)

# TO_OBJECT

Converts the input value to an [OBJECT](../data-types-semistructured.md):

* For a [VARIANT](../data-types-semistructured.md) value containing an OBJECT, returns the OBJECT.
* For NULL input, or for a VARIANT value containing only [JSON null](../../user-guide/semistructured-considerations.md), returns NULL.
* For an OBJECT, returns the OBJECT itself.
* For all other input values, reports an error.

## Syntax

```sqlsyntax
TO_OBJECT( <expr> )
```

## Arguments

`expr`
:   An expression that evaluates to a VARIANT that contains an OBJECT.

## Returns

The data type of the returned value is OBJECT.

## Examples

This demonstrates simple usage of the TO_OBJECT function:

> Create a table and insert a value of type VARIANT. (The function [PARSE_JSON](parse_json.md) returns a VARIANT.)
>
> > ```sqlexample
> > CREATE TABLE t1 (vo VARIANT);
> > INSERT INTO t1 (vo)
> >     SELECT PARSE_JSON('{"a":1}');
> > ```
>
> Call the TO_OBJECT function:
>
> > ```sqlexample
> > SELECT TO_OBJECT(vo) from t1;
> > +---------------+
> > | TO_OBJECT(VO) |
> > |---------------|
> > | {             |
> > |   "a": 1      |
> > | }             |
> > +---------------+
> > ```

---
title: TO_QUERY
source: https://docs.snowflake.com/en/sql-reference/functions/to_query.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# TO_QUERY

Returns a result set based on SQL text and an optional set of arguments that are passed to the SQL text if
it is parameterized. The function compiles the SQL text as the definition of a subquery in the FROM clause.
When writing an application or a stored procedure, you can call this function to construct a SQL statement.

> **Note:**
>
> This function can include user input in query statements, which has potential security risks, such as
> SQL injection. If inputs to the function come from external sources, make sure they are validated.
> For more information, see [SQL injection](../../developer-guide/stored-procedure/stored-procedures-usage.md).

See also:
:   [Constructing SQL at runtime](../../user-guide/querying-construct-at-runtime.md)

## Syntax

```sqlsyntax
TO_QUERY( SQL => '<string>' [ , <arg> => '<value>' [, <arg> => '<value>' ...] ] )
```

## Arguments

**Required**

`SQL => 'string'`
:   String representation of the subquery.

**Optional**

`arg => 'value'`
:   [Bind variables](../bind-variables.md) passed to the SQL `string`.

## Returns

Returns the result set produced by the execution of the specified SQL text or NULL. If any argument is NULL,
the function returns NULL without reporting any error.

## Usage notes

* All arguments must be one of the following:

  + Constant strings.
  + [SQL variables](../session-variables.md) or
    [Snowflake Scripting variables](../../developer-guide/snowflake-scripting/variables.md) that resolve to strings.
* If you need to convert a string passed in an argument to a different data type, you can use a
  [conversion function](../functions-conversion.md) in the SQL `string` to
  convert the string to another data type.
* The set of columns defining the result set is derived from the SELECT list of the compiled SQL statement.
* The function is valid only in the FROM clause of a SQL statement.

## Examples

Create a table and insert data into it.

```sqlexample
CREATE OR REPLACE TABLE to_query_example (
  deptno NUMBER(2),
  dname  VARCHAR(14),
  loc    VARCHAR(13))
AS SELECT
  column1,
  column2,
  column3
FROM
  VALUES
    (10, 'ACCOUNTING', 'NEW YORK'),
    (20, 'RESEARCH',   'DALLAS'  ),
    (30, 'SALES',      'CHICAGO' ),
    (40, 'OPERATIONS', 'BOSTON'  );
```

The examples use the data in this table.

### Using TO_QUERY in SQL statements

First, set a session variable (SQL variable) for the table name:

```sqlexample
SET table_name = 'to_query_example';
```

The examples use the session variable and [IDENTIFIER()](../identifier-literal.md) to
identify the table.

Using IDENTIFIER() to identify database objects is a best practice because it can make
code more reusable and help to prevent [SQL injection](../../developer-guide/stored-procedure/stored-procedures-usage.md) risks.

The following example uses the TO_QUERY function to return all of the data in the `to_query_example` table:

```sqlexample
SELECT * FROM TABLE(TO_QUERY('SELECT * FROM IDENTIFIER($table_name)'));
```

```output
+--------+------------+----------+
| DEPTNO | DNAME      | LOC      |
|--------+------------+----------|
|     10 | ACCOUNTING | NEW YORK |
|     20 | RESEARCH   | DALLAS   |
|     30 | SALES      | CHICAGO  |
|     40 | OPERATIONS | BOSTON   |
+--------+------------+----------+
```

The following example uses the TO_QUERY function to return all of the values in the `deptno` column of
the `to_query_example` table:

```sqlexample
SELECT deptno FROM TABLE(TO_QUERY('SELECT * FROM IDENTIFIER($table_name)'));
```

```output
+--------+
| DEPTNO |
|--------|
|     10 |
|     20 |
|     30 |
|     40 |
+--------+
```

The following example uses the TO_QUERY function to pass an argument to a SQL statement so that it returns the row
where `deptno` equals `10` in the `to_query_example` table:

```sqlexample
SELECT * FROM TABLE(
  TO_QUERY(
    'SELECT * FROM IDENTIFIER($table_name)
    WHERE deptno = TO_NUMBER(:dno)', dno => '10'
    )
  );
```

```output
+--------+------------+----------+
| DEPTNO | DNAME      | LOC      |
|--------+------------+----------|
|     10 | ACCOUNTING | NEW YORK |
+--------+------------+----------+
```

The following example is the same as the previous example, but it uses a session variable to pass
the `deptno` value to the TO_QUERY function:

```sqlexample
SET dept = '10';

SELECT * FROM TABLE(
  TO_QUERY(
    'SELECT * FROM IDENTIFIER($table_name)
    WHERE deptno = TO_NUMBER(:dno)', dno => $dept
    )
  );
```

```output
+--------+------------+----------+
| DEPTNO | DNAME      | LOC      |
|--------+------------+----------|
|     10 | ACCOUNTING | NEW YORK |
+--------+------------+----------+
```

The following example uses the TO_QUERY function to pass two arguments to a SQL statement so that it returns the rows
where `deptno` equals `10` or `dname` equals `SALES` in the `to_query_example` table:

```sqlexample
SELECT * FROM TABLE(
  TO_QUERY(
    'SELECT * FROM IDENTIFIER($table_name)
    WHERE deptno = TO_NUMBER(:dno) OR dname = :dnm',
    dno => '10', dnm => 'SALES'
    )
  );
```

```output
+--------+------------+----------+
| DEPTNO | DNAME      | LOC      |
|--------+------------+----------|
|     10 | ACCOUNTING | NEW YORK |
|     30 | SALES      | CHICAGO  |
+--------+------------+----------+
```

### Using TO_QUERY in stored procedures

The following example uses the TO_QUERY function in a stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE get_num_results_tq(query VARCHAR)
RETURNS TABLE ()
LANGUAGE SQL
AS
DECLARE
  res RESULTSET DEFAULT (SELECT COUNT(*) FROM TABLE(TO_QUERY(:query)));
BEGIN
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE get_num_results_tq(query VARCHAR)
RETURNS TABLE ()
LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET DEFAULT (SELECT COUNT(*) FROM TABLE(TO_QUERY(:query)));
BEGIN
  RETURN TABLE(res);
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL get_num_results_tq('SELECT * FROM to_query_example');
```

```output
+----------+
| COUNT(*) |
|----------|
|        4 |
+----------+
```

```sqlexample
CALL get_num_results_tq('SELECT * FROM to_query_example WHERE deptno = 20');
```

```output
+----------+
| COUNT(*) |
|----------|
|        1 |
+----------+
```

The following example uses the TO_QUERY function in a stored procedure with a bind variable:

```sqlexample
CREATE OR REPLACE PROCEDURE get_results_tqbnd(dno VARCHAR)
RETURNS TABLE ()
LANGUAGE SQL
AS
DECLARE
  res RESULTSET DEFAULT (SELECT * FROM TABLE(
    TO_QUERY(
      'SELECT * FROM to_query_example
      WHERE deptno = TO_NUMBER(:dnoval)', dnoval => :dno
    )
  ));
BEGIN
  RETURN TABLE(res);
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE get_results_tqbnd(dno VARCHAR)
RETURNS TABLE ()
LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET DEFAULT (SELECT * FROM TABLE(
    TO_QUERY(
      'SELECT * FROM to_query_example
      WHERE deptno = TO_NUMBER(:dnoval)', dnoval => :dno
    )
  ));
BEGIN
  RETURN TABLE(res);
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL get_results_tqbnd('40');
```

```output
+--------+------------+--------+
| DEPTNO | DNAME      | LOC    |
|--------+------------+--------|
|     40 | OPERATIONS | BOSTON |
+--------+------------+--------+
```

Call the stored procedure using a session variable:

```sqlexample
SET dept = '20';

CALL get_results_tqbnd($dept);
```

```output
+--------+----------+--------+
| DEPTNO | DNAME    | LOC    |
|--------+----------+--------|
|     20 | RESEARCH | DALLAS |
+--------+----------+--------+
```

---
title: TO_TIME , TIME
source: https://docs.snowflake.com/en/sql-reference/functions/to_time.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Date & time functions](../functions-date-time.md)

# TO_TIME , TIME

Converts an input expression into a time.

See also:
:   [TRY_TO_TIME](try_to_time.md)

## Syntax

```sqlsyntax
TO_TIME( <string_expr> [, <format> ] )
TO_TIME( <timestamp_expr> )
TO_TIME( '<integer>' )
TO_TIME( <variant_expr> )

TIME( <string_expr> )
TIME( <timestamp_expr> )
TIME( '<integer>' )
TIME( <variant_expr> )
```

## Arguments

**Required:**

`string_expr` or `timestamp_expr` or `'integer'` or `variant_expr`
:   Expression to be converted into a time:

    * For `string_expr`, the string to convert to a time.
    * For `timestamp_expr`, the timestamp to convert to a time. The function returns the time portion of the input value.
    * For `'integer'`, a string containing an integer to convert to a time. The integer is treated as a number of seconds, milliseconds,
      microseconds, or nanoseconds after the start of the Unix epoch. See the Usage Notes.

      For this timestamp, the function gets the number of seconds after the start of the Unix epoch. The function performs a
      [modulo operation](https://en.wikipedia.org/wiki/Modulo_operation) to get the remainder from dividing this number by the
      number of seconds in a day (`86400`):

      > `number_of_seconds % 86400`

      The function interprets this remainder as the number of seconds after midnight.

      For example, suppose that the value is `'31536002789'`.

      1. Based on the magnitude of this value, the function uses milliseconds as the unit of time and determines that the value
         represents `1971-01-01 00:00:02.789`.
      2. The function gets the number of seconds after the Unix epoch for this value (`31536002`).
      3. The function gets the remainder from dividing that number by the number of seconds in a day (`31536002 % 86400`).
      4. The function uses the remainder (`2`) as the number of seconds after midnight. The resulting time is `00:00:02`.
    * For `variant_expr`:

      > + If the VARIANT contains a string in TIME format (such as `HH:MI:SS`), a string conversion is performed.
      > + If the VARIANT contains a string in INTEGER format, a string conversion is performed and the value is
      >   treated as the number of seconds since midnight (modulus 86400 if necessary).
      > + If the VARIANT contains a JSON null value, the output is NULL.

    For all other values, a conversion error is generated.

**Optional:**

`format`
:   Time format specifier for `string_expr` or
    [AUTO](../date-time-input-output.md),
    which specifies that Snowflake automatically detects the format to use. For more information,
    see [Date and time formats in conversion functions](../functions-conversion.md).

    Default: The current value of the [TIME_INPUT_FORMAT](../parameters.md)
    session parameter (default AUTO)

## Returns

The data type of the returned value is TIME. If the input is NULL, returns NULL.

## Usage notes

* The display format for times in the output is determined by the [TIME_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `HH24:MI:SS`).
* If the format of the input parameter is a string that contains an integer, the unit of measurement for the value (seconds,
  microseconds, milliseconds, or nanoseconds) is determined as follows:

> * After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
>   microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).
>
>   + If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
>     a number of seconds.
>   + If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
>     as milliseconds.
>   + If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
>     treated as microseconds.
>   + If the value is greater than or equal to 31536000000000000, then the value is
>     treated as nanoseconds.
> * If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
>   one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
>   nanoseconds.

* Unlike the TO_TIME function, the TIME function does not support the optional `format` parameter.

## Examples

These examples use the TO_TIME and TIME functions.

```sqlexample
SELECT TO_TIME('13:30:00'), TIME('13:30:00');
```

```output
+---------------------+------------------+
| TO_TIME('13:30:00') | TIME('13:30:00') |
|---------------------+------------------|
| 13:30:00            | 13:30:00         |
+---------------------+------------------+
```

```sqlexample
SELECT TO_TIME('13:30:00.000'), TIME('13:30:00.000');
```

```output
+-------------------------+----------------------+
| TO_TIME('13:30:00.000') | TIME('13:30:00.000') |
|-------------------------+----------------------|
| 13:30:00                | 13:30:00             |
+-------------------------+----------------------+
```

This example shows how to use the TO_TIME function to process field separators
other than the default colons. The example uses the period character as
the separator between hours and minutes, and between minutes and seconds:

```sqlexample
SELECT TO_TIME('11.15.00', 'HH24.MI.SS');
```

```output
+-----------------------------------+
| TO_TIME('11.15.00', 'HH24.MI.SS') |
|-----------------------------------|
| 11:15:00                          |
+-----------------------------------+
```

This example demonstrates how the TO_TIME function interprets a string containing an integer:

```sqlexample
CREATE OR REPLACE TABLE demo1_time (
  description VARCHAR,
  value VARCHAR -- string rather than bigint
);

INSERT INTO demo1_time (description, value) VALUES
  ('Seconds',      '31536001'),
  ('Milliseconds', '31536002400'),
  ('Microseconds', '31536003600000'),
  ('Nanoseconds',  '31536004900000000');
```

```sqlexample
SELECT description,
       value,
       TO_TIMESTAMP(value),
       TO_TIME(value)
  FROM demo1_time
  ORDER BY value;
```

```output
+--------------+-------------------+-------------------------+----------------+
| DESCRIPTION  | VALUE             | TO_TIMESTAMP(VALUE)     | TO_TIME(VALUE) |
|--------------+-------------------+-------------------------+----------------|
| Seconds      | 31536001          | 1971-01-01 00:00:01.000 | 00:00:01       |
| Milliseconds | 31536002400       | 1971-01-01 00:00:02.400 | 00:00:02       |
| Microseconds | 31536003600000    | 1971-01-01 00:00:03.600 | 00:00:03       |
| Nanoseconds  | 31536004900000000 | 1971-01-01 00:00:04.900 | 00:00:04       |
+--------------+-------------------+-------------------------+----------------+
```

---
title: TO_TIMESTAMP / TO_TIMESTAMP_*
source: https://docs.snowflake.com/en/sql-reference/functions/to_timestamp.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Date & time functions](../functions-date-time.md)

# TO_TIMESTAMP / TO_TIMESTAMP_\*

Converts an input expression into the corresponding timestamp:

* TO_TIMESTAMP_LTZ (timestamp with local time zone)
* TO_TIMESTAMP_NTZ (timestamp with no time zone)
* TO_TIMESTAMP_TZ (timestamp with time zone)

> **Note:**
>
> TO_TIMESTAMP maps to one of the other timestamp functions, based on the
> [TIMESTAMP_TYPE_MAPPING](../parameters.md) session parameter. The parameter default is
> TIMESTAMP_NTZ, so TO_TIMESTAMP maps to TO_TIMESTAMP_NTZ by default.

See also:
:   [TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_\*](try_to_timestamp.md) ,

    [AS_TIMESTAMP_\*](as_timestamp.md) , [IS_TIMESTAMP_\*](is_timestamp.md) ,

    [TO_DATE , DATE](to_date.md) , [TO_TIME , TIME](to_time.md)

## Syntax

```sqlsyntax
timestampFunction ( <numeric_expr> [ , <scale> ] )

timestampFunction ( <date_expr> )

timestampFunction ( <timestamp_expr> )

timestampFunction ( <string_expr> [ , <format> ] )

timestampFunction ( '<integer>' )

timestampFunction ( <variant_expr> )
```

Where:

> ```sqlsyntax
> timestampFunction ::=
>     TO_TIMESTAMP | TO_TIMESTAMP_LTZ | TO_TIMESTAMP_NTZ | TO_TIMESTAMP_TZ
> ```

## Arguments

**Required:**

One of:

> `numeric_expr`
> :   A number of seconds (if scale = 0 or is absent) or fractions of a second (e.g. milliseconds or nanoseconds)
>     since the start of the Unix epoch (1970-01-01 00:00:00 UTC). If a non-integer decimal expression is input, the
>     scale of the result is inherited.
>
> `date_expr`
> :   A date to be converted into a timestamp.
>
> `timestamp_expr`
> :   A timestamp to be converted into another timestamp (e.g. convert TIMESTAMP_LTZ to TIMESTAMP_NTZ).
>
> `string_expr`
> :   A string from which to extract a timestamp, for example `'2019-01-31 01:02:03.004'`.
>
> `'integer'`
> :   An expression that evaluates to a string containing an integer, for example `'15000000'`. Depending
>     on the magnitude of the string, it can be interpreted as seconds, milliseconds, microseconds, or
>     nanoseconds. For details, see the Usage Notes.
>
> `variant_expr`
> :   An expression of type VARIANT. The VARIANT must contain one of the following:
>
>     * A string from which to extract a timestamp.
>     * A timestamp.
>     * An integer that represents the number of seconds, milliseconds, microseconds, or nanoseconds.
>     * A string containing an integer that represents the number of seconds, milliseconds, microseconds, or nanoseconds.
>
>     Although TO_TIMESTAMP accepts a DATE value, it does not accept a DATE inside a VARIANT.

**Optional:**

`format`
:   Format specifier (only for `string_expr`). For more information, see [Date and time formats in conversion functions](../functions-conversion.md).

    The default value is the current value of the [TIMESTAMP_INPUT_FORMAT](../parameters.md) parameter (default
    [AUTO](../date-time-input-output.md)).

`scale`
:   Scale specifier (only for `numeric_expr`). If specified, defines the scale of the numbers provided. For example:

    * For seconds, scale = `0`.
    * For milliseconds, scale = `3`.
    * For microseconds, scale = `6`.
    * For nanoseconds, scale = `9`.

    Default: `0`

## Returns

The data type of the returned value is one of the TIMESTAMP data
types. By default, the data type is TIMESTAMP_NTZ. You can change
this by setting the session parameter [TIMESTAMP_TYPE_MAPPING](../parameters.md).

If the input is NULL, then the result is NULL.

## Usage notes

* This family of functions returns timestamp values, specifically:

  + For `string_expr`: A timestamp represented by a given string. If the string does not have a time component, midnight is used.
  + For `date_expr`: A timestamp representing midnight of a given day is used, according to the specific timestamp mapping (NTZ/LTZ/TZ) semantics.
  + For `timestamp_expr`: A timestamp with possibly different mapping than the source timestamp.
  + For `numeric_expr`: A timestamp representing the number of seconds (or fractions of a second) provided by the user. UTC time is always used to build the result.
  + For `variant_expr`:

    - If the VARIANT contains a JSON null value, the result is NULL.
    - If the VARIANT contains a timestamp value of the same kind as the result, this value is preserved as is.
    - If the VARIANT contains a timestamp value of a different kind, the conversion is done in the same way as from `timestamp_expr`.
    - If the VARIANT contains a string, conversion from a string value is performed (using automatic format).
    - If the VARIANT contains a number, conversion from `numeric_expr` is performed.

      > **Note:**
      >
      > When an INTEGER value is cast directly to TIMESTAMP_NTZ, the integer is treated as the number of seconds
      > since the beginning of the Linux epoch, and the local time zone is not taken into account. However, if the
      > INTEGER value is stored inside a VARIANT value, for example as shown below, then the conversion is indirect,
      > and is affected by the local time zone, even though the final result is TIMESTAMP_NTZ:
      >
      > ```sqlexample
      > SELECT TO_TIMESTAMP(31000000);
      > SELECT TO_TIMESTAMP(PARSE_JSON(31000000));
      > SELECT PARSE_JSON(31000000)::TIMESTAMP_NTZ;
      > ```
      >
      > The timestamp returned by the first query is different from the time returned by the second and
      > third queries.
      >
      > To convert independently of the local time zone, add an explicit cast to integer in the expression, as shown
      > below:
      >
      > ```sqlexample
      > SELECT TO_TIMESTAMP(31000000);
      > SELECT TO_TIMESTAMP(PARSE_JSON(31000000)::INT);
      > SELECT PARSE_JSON(31000000)::INT::TIMESTAMP_NTZ;
      > ```
      >
      > The timestamp returned by all three queries is the same. This applies whether casting to TIMESTAMP_NTZ or calling the
      > function TO_TIMESTAMP_NTZ. It also applies when calling TO_TIMESTAMP when the TIMESTAMP_TYPE_MAPPING parameter
      > is set to TIMESTAMP_NTZ.
      >
      > For an example with output, see the examples at the end of this topic.
  + If conversion is not possible, an error is returned.
* For timestamps with time zones, the setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned
  timestamp is in the time zone for the session.
* The display format for timestamps in the output is determined by the timestamp output format that corresponds with the
  function ([TIMESTAMP_OUTPUT_FORMAT](../parameters.md), [TIMESTAMP_LTZ_OUTPUT_FORMAT](../parameters.md), [TIMESTAMP_NTZ_OUTPUT_FORMAT](../parameters.md),
  or [TIMESTAMP_TZ_OUTPUT_FORMAT](../parameters.md)).
* If the format of the input parameter is a string that contains an integer:

  + After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
    microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).

    - If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
      a number of seconds.
    - If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
      as milliseconds.
    - If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
      treated as microseconds.
    - If the value is greater than or equal to 31536000000000000, then the value is
      treated as nanoseconds.
  + If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
    one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
    nanoseconds.

* When you use the TO_TIMESTAMP_NTZ or TRY_TO_TIMESTAMP_NTZ function to convert a timestamp with time zone information, the time zone
  information is lost. If the timestamp is then converted back to a timestamp with time zone information (by using
  the TO_TIMESTAMP_TZ function for example), the time zone information is not recoverable.

## Examples

This example shows that TO_TIMESTAMP_TZ creates a timestamp that contains a time
zone from the session, but the value from TO_TIMESTAMP_NTZ does not have a
time zone:

```sqlexample
ALTER SESSION SET TIMEZONE = 'America/Los_Angeles';
```

```sqlexample
SELECT TO_TIMESTAMP_TZ('2024-04-05 01:02:03');
```

```output
+----------------------------------------+
| TO_TIMESTAMP_TZ('2024-04-05 01:02:03') |
|----------------------------------------|
| 2024-04-05 01:02:03.000 -0700          |
+----------------------------------------+
```

```sqlexample
SELECT TO_TIMESTAMP_NTZ('2024-04-05 01:02:03');
```

```output
+-----------------------------------------+
| TO_TIMESTAMP_NTZ('2024-04-05 01:02:03') |
|-----------------------------------------|
| 2024-04-05 01:02:03.000                 |
+-----------------------------------------+
```

The following examples show how different formats can influence the parsing of an ambiguous date.
Assume that the [TIMESTAMP_TZ_OUTPUT_FORMAT](../parameters.md) is not set, so the
[TIMESTAMP_OUTPUT_FORMAT](../parameters.md) is used and is set to the default
(`YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM`).

This example shows the results when the input format is `mm/dd/yyyy hh24:mi:ss` (month/day/year):

```sqlexample
SELECT TO_TIMESTAMP_TZ('04/05/2024 01:02:03', 'mm/dd/yyyy hh24:mi:ss');
```

```output
+-----------------------------------------------------------------+
| TO_TIMESTAMP_TZ('04/05/2024 01:02:03', 'MM/DD/YYYY HH24:MI:SS') |
|-----------------------------------------------------------------|
| 2024-04-05 01:02:03.000 -0700                                   |
+-----------------------------------------------------------------+
```

This example shows the results when the input format is `dd/mm/yyyy hh24:mi:ss` (day/month/year):

```sqlexample
SELECT TO_TIMESTAMP_TZ('04/05/2024 01:02:03', 'dd/mm/yyyy hh24:mi:ss');
```

```output
+-----------------------------------------------------------------+
| TO_TIMESTAMP_TZ('04/05/2024 01:02:03', 'DD/MM/YYYY HH24:MI:SS') |
|-----------------------------------------------------------------|
| 2024-05-04 01:02:03.000 -0700                                   |
+-----------------------------------------------------------------+
```

This example shows how to use a numeric input that represents approximately 40
years from midnight January 1, 1970 (the start of the Unix epoch). The scale
is not specified, so the default scale of `0` (seconds) is used.

```sqlexample
ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF9 TZH:TZM';
```

```sqlexample
SELECT TO_TIMESTAMP_NTZ(40 * 365.25 * 86400);
```

```output
+---------------------------------------+
| TO_TIMESTAMP_NTZ(40 * 365.25 * 86400) |
|---------------------------------------|
| 2010-01-01 00:00:00.000               |
+---------------------------------------+
```

This example is similar to the preceding example, but provides the value as milliseconds
by specifying a scale value of `3`:

```sqlexample
SELECT TO_TIMESTAMP_NTZ(40 * 365.25 * 86400 * 1000 + 456, 3);
```

```output
+-------------------------------------------------------+
| TO_TIMESTAMP_NTZ(40 * 365.25 * 86400 * 1000 + 456, 3) |
|-------------------------------------------------------|
| 2010-01-01 00:00:00.456                               |
+-------------------------------------------------------+
```

This example shows how the results change when different scale values are specified for the same
numeric value:

```sqlexample
SELECT TO_TIMESTAMP(1000000000, 0) AS "Scale in seconds",
       TO_TIMESTAMP(1000000000, 3) AS "Scale in milliseconds",
       TO_TIMESTAMP(1000000000, 6) AS "Scale in microseconds",
       TO_TIMESTAMP(1000000000, 9) AS "Scale in nanoseconds";
```

```output
+-------------------------+-------------------------+-------------------------+-------------------------+
| Scale in seconds        | Scale in milliseconds   | Scale in microseconds   | Scale in nanoseconds    |
|-------------------------+-------------------------+-------------------------+-------------------------|
| 2001-09-09 01:46:40.000 | 1970-01-12 13:46:40.000 | 1970-01-01 00:16:40.000 | 1970-01-01 00:00:01.000 |
+-------------------------+-------------------------+-------------------------+-------------------------+
```

This example shows how the function determines the units to use (seconds, milliseconds, microseconds, or nanoseconds)
when the input is a string that contains an integer, based on the magnitude of the value.

Create and load the table with strings containing integers within different ranges:

```sqlexample
CREATE OR REPLACE TABLE demo1 (
  description VARCHAR,
  value VARCHAR -- string rather than bigint
);

INSERT INTO demo1 (description, value) VALUES
  ('Seconds',      '31536000'),
  ('Milliseconds', '31536000000'),
  ('Microseconds', '31536000000000'),
  ('Nanoseconds',  '31536000000000000');
```

Pass the strings to the function:

```sqlexample
SELECT description,
       value,
       TO_TIMESTAMP(value),
       TO_DATE(value)
  FROM demo1
  ORDER BY value;
```

```output
+--------------+-------------------+-------------------------+----------------+
| DESCRIPTION  | VALUE             | TO_TIMESTAMP(VALUE)     | TO_DATE(VALUE) |
|--------------+-------------------+-------------------------+----------------|
| Seconds      | 31536000          | 1971-01-01 00:00:00.000 | 1971-01-01     |
| Milliseconds | 31536000000       | 1971-01-01 00:00:00.000 | 1971-01-01     |
| Microseconds | 31536000000000    | 1971-01-01 00:00:00.000 | 1971-01-01     |
| Nanoseconds  | 31536000000000000 | 1971-01-01 00:00:00.000 | 1971-01-01     |
+--------------+-------------------+-------------------------+----------------+
```

The following example casts values to TIMESTAMP_NTZ. The example shows the difference in
behavior between using an integer and using a variant that contains an integer:

```sqlexample
SELECT 0::TIMESTAMP_NTZ, PARSE_JSON(0)::TIMESTAMP_NTZ, PARSE_JSON(0)::INT::TIMESTAMP_NTZ;
```

```output
+-------------------------+------------------------------+-----------------------------------+
| 0::TIMESTAMP_NTZ        | PARSE_JSON(0)::TIMESTAMP_NTZ | PARSE_JSON(0)::INT::TIMESTAMP_NTZ |
|-------------------------+------------------------------+-----------------------------------|
| 1970-01-01 00:00:00.000 | 1969-12-31 16:00:00.000      | 1970-01-01 00:00:00.000           |
+-------------------------+------------------------------+-----------------------------------+
```

The returned timestamps match for an integer and for a variant cast to an integer in the
first and third columns, but the returned timestamp is different for the variant that is not
cast to an integer in the second column. For more information, see
Usage notes.

This same behavior applies when calling the TO_TIMESTAMP_NTZ function:

```sqlexample
SELECT TO_TIMESTAMP_NTZ(0), TO_TIMESTAMP_NTZ(PARSE_JSON(0)), TO_TIMESTAMP_NTZ(PARSE_JSON(0)::INT);
```

```output
+-------------------------+---------------------------------+--------------------------------------+
| TO_TIMESTAMP_NTZ(0)     | TO_TIMESTAMP_NTZ(PARSE_JSON(0)) | TO_TIMESTAMP_NTZ(PARSE_JSON(0)::INT) |
|-------------------------+---------------------------------+--------------------------------------|
| 1970-01-01 00:00:00.000 | 1969-12-31 16:00:00.000         | 1970-01-01 00:00:00.000              |
+-------------------------+---------------------------------+--------------------------------------+
```

---
title: TO_UUID
source: https://docs.snowflake.com/en/sql-reference/functions/to_uuid.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_UUID

Converts the input expression to a [UUID](../data-types-uuid.md) value.

See also:

* [TRY_TO_UUID](try_to_uuid.md)

## Syntax

```sqlsyntax
TO_UUID( <string_expr> )
```

## Arguments

`string_expr`
:   A string expression in UUID format.

## Returns

The return type is [UUID](../data-types-uuid.md).

For NULL input, the output is NULL.

## Examples

The following example converts a string to a UUID value:

```sqlexample
SELECT TO_UUID('c73d9175-0a1d-48c6-8d30-df165461328b');
```

```output
+-------------------------------------------------+
| TO_UUID('C73D9175-0A1D-48C6-8D30-DF165461328B') |
|-------------------------------------------------|
| c73d9175-0a1d-48c6-8d30-df165461328b            |
+-------------------------------------------------+
```

---
title: TO_VARIANT
source: https://docs.snowflake.com/en/sql-reference/functions/to_variant.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TO_VARIANT

Converts any value to a [VARIANT](../data-types-semistructured.md) value or NULL (if input is NULL).

## Syntax

```sqlsyntax
TO_VARIANT( <expr> )
```

## Arguments

`expr`
:   An expression of any data type.

## Usage notes

* The `TO_VARIANT` function cannot be used directly in an INSERT statement.
  Instead, use `INSERT INTO ... SELECT...`. The Examples section
  shows how to do this.

## Examples

Use TO_VARIANT and [PARSE_JSON](parse_json.md) to insert VARIANT values into a table. The PARSE_JSON function
returns a VARIANT value.

```sqlexample
CREATE OR REPLACE TABLE to_variant_example (
  v_varchar   VARIANT,
  v_number    VARIANT,
  v_timestamp VARIANT,
  v_array     VARIANT,
  v_object    VARIANT);

INSERT INTO to_variant_example (v_varchar, v_number, v_timestamp, v_array, v_object)
  SELECT
    TO_VARIANT('Skiing is fun!'),
    TO_VARIANT(3.14),
    TO_VARIANT('2024-01-25 01:02:03'),
    TO_VARIANT(ARRAY_CONSTRUCT('San Mateo', 'Seattle', 'Berlin')),
    PARSE_JSON(' { "key1": "value1", "key2": "value2" } ');

SELECT * FROM to_variant_example;
```

```output
+------------------+----------+-----------------------+----------------+---------------------+
| V_VARCHAR        | V_NUMBER | V_TIMESTAMP           | V_ARRAY        | V_OBJECT            |
|------------------+----------+-----------------------+----------------+---------------------|
| "Skiing is fun!" | 3.14     | "2024-01-25 01:02:03" | [              | {                   |
|                  |          |                       |   "San Mateo", |   "key1": "value1", |
|                  |          |                       |   "Seattle",   |   "key2": "value2"  |
|                  |          |                       |   "Berlin"     | }                   |
|                  |          |                       | ]              |                     |
+------------------+----------+-----------------------+----------------+---------------------+
```

---
title: TO_XML
source: https://docs.snowflake.com/en/sql-reference/functions/to_xml.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Semi-structured and structured data functions](../functions-semistructured.md) (Cast)

# TO_XML

Converts a [VARIANT](../data-types-semistructured.md) to a VARCHAR that contains an [XML](../../user-guide/semistructured-data-formats.md) representation
of the value. If the input is NULL, the result is also NULL.

See also:
:   [CHECK_XML](check_xml.md), [PARSE_XML](parse_xml.md), [XMLGET](xmlget.md)

## Syntax

```sqlsyntax
TO_XML( <expression> )
```

## Arguments

`expression`
:   An expression that evaluates to a VARIANT or that can be [cast](../data-type-conversion.md) to a VARIANT.

## Returns

The data type of the returned value is VARCHAR.

## Usage notes

* Common uses for this function include:

  + Generating a string that contains an XML-formatted value that matches an originally inserted XML-formatted value.
  + Converting a semi-structured value (which doesn’t necessarily need to have been formatted as XML originally) into an
    XML-formatted value. For example, you can use TO_XML to generate an XML-compatible representation of a value that was
    originally formatted as JSON.
* If the input `expression` doesn’t evaluate to a VARIANT, Snowflake implicitly
  [casts](../data-type-conversion.md) the result of the expression to a VARIANT. Because all other Snowflake data types
  can be cast to VARIANT, this means that a value of any data type can be passed to TO_XML and converted to an XML-formatted string.
  (The [GEOGRAPHY](../data-types-geospatial.md) data type is a partial exception; to call TO_XML with a value of
  type GEOGRAPHY, you must explicitly cast the GEOGRAPHY value to VARIANT.)
* If the value didn’t originate as XML, then Snowflake generates XML-compatible tags. These tags may use the `type` attribute to
  specify the Snowflake data type of the tag’s contents. Below are examples of tags generated by Snowflake.

  The outermost tag pair is similar to the following:

  ```xml
  <SnowflakeData type="OBJECT"> </SnowflakeData>
  ```

  The data type specified in the `type` attribute of the tag can vary.

  For an OBJECT, each key-value pair’s tags are based on the key. For example:

  ```xml
  <key1 type="VARCHAR">value1</key1>
  ```

  For an ARRAY, each element of the array is in a tag pair similar to:

  ```xml
  <e type="VARCHAR"> </e>
  ```

  Here is a complete example of the XML for a simple OBJECT that contains two key-value pairs:

  ```xml
  <SnowflakeData type="OBJECT">
      <key1 type="VARCHAR">value1</key1>
      <key2 type="VARCHAR">value2</key2>
  </SnowflakeData>
  ```

  Here is a complete example of the XML for a simple ARRAY that contains two VARCHAR values:

  ```xml
  <SnowflakeData type="ARRAY">
      <e type="VARCHAR">v1</e>
      <e type="VARCHAR">v2</e>
  </SnowflakeData>
  ```

## Examples

This example shows how to use the function if you’ve loaded XML-formatted data into an OBJECT by calling
[PARSE_XML](parse_xml.md).

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE xml_02 (x OBJECT);

INSERT INTO xml_02 (x)
  SELECT PARSE_XML('<note> <body>Sample XML</body> </note>');
```

Call the TO_XML and TO_VARCHAR functions:

```sqlexample
SELECT x, TO_VARCHAR(x), TO_XML(x) FROM xml_02;
```

```output
+---------------------------+--------------------------------------+--------------------------------------+
| X                         | TO_VARCHAR(X)                        | TO_XML(X)                            |
|---------------------------+--------------------------------------+--------------------------------------|
| <note>                    | <note><body>Sample XML</body></note> | <note><body>Sample XML</body></note> |
|   <body>Sample XML</body> |                                      |                                      |
| </note>                   |                                      |                                      |
+---------------------------+--------------------------------------+--------------------------------------+
```

You can also call the TO_XML function with data that did not originate as XML-formatted data, as shown in the examples below.

The following example creates a simple OBJECT and then generates the corresponding XML. The XML output contains information about the
data types of the values in the key-value pairs, as well as the data type of the overall value (OBJECT).

```sqlexample
CREATE OR REPLACE TABLE xml_03 (object_col_1 OBJECT);

INSERT INTO xml_03 (object_col_1)
  SELECT OBJECT_CONSTRUCT('key1', 'value1', 'key2', 'value2');
```

```sqlexample
SELECT object_col_1, TO_XML(object_col_1)
  FROM xml_03;
```

```output
+---------------------+-------------------------------------------------------------------------------------------------------------------+
| OBJECT_COL_1        | TO_XML(OBJECT_COL_1)                                                                                              |
|---------------------+-------------------------------------------------------------------------------------------------------------------|
| {                   | <SnowflakeData type="OBJECT"><key1 type="VARCHAR">value1</key1><key2 type="VARCHAR">value2</key2></SnowflakeData> |
|   "key1": "value1", |                                                                                                                   |
|   "key2": "value2"  |                                                                                                                   |
| }                   |                                                                                                                   |
+---------------------+-------------------------------------------------------------------------------------------------------------------+
```

The following example creates a simple ARRAY and then generates the corresponding XML. The XML output contains information about the
data types of the array elements, as well as the data type of the overall value (ARRAY).

```sqlexample
CREATE OR REPLACE TABLE xml_04 (array_col_1 ARRAY);

INSERT INTO xml_04 (array_col_1)
  SELECT ARRAY_CONSTRUCT('v1', 'v2');
```

```sqlexample
SELECT array_col_1, TO_XML(array_col_1)
  FROM xml_04;
```

```output
+-------------+----------------------------------------------------------------------------------------------+
| ARRAY_COL_1 | TO_XML(ARRAY_COL_1)                                                                          |
|-------------+----------------------------------------------------------------------------------------------|
| [           | <SnowflakeData type="ARRAY"><e type="VARCHAR">v1</e><e type="VARCHAR">v2</e></SnowflakeData> |
|   "v1",     |                                                                                              |
|   "v2"      |                                                                                              |
| ]           |                                                                                              |
+-------------+----------------------------------------------------------------------------------------------+
```

The following example inserts data that is in JSON format and then generates the corresponding XML.

```sqlexample
CREATE OR REPLACE TABLE xml_05 (json_col_1 VARIANT);

INSERT INTO xml_05 (json_col_1)
  SELECT PARSE_JSON(' { "key1": ["a1", "a2"] } ');
```

```sqlexample
SELECT json_col_1,
       TO_JSON(json_col_1),
       TO_XML(json_col_1)
  FROM xml_05;
```

```output
+-------------+----------------------+-------------------------------------------------------------------------------------------------------------------------+
| JSON_COL_1  | TO_JSON(JSON_COL_1)  | TO_XML(JSON_COL_1)                                                                                                      |
|-------------+----------------------+-------------------------------------------------------------------------------------------------------------------------|
| {           | {"key1":["a1","a2"]} | <SnowflakeData type="OBJECT"><key1 type="ARRAY"><e type="VARCHAR">a1</e><e type="VARCHAR">a2</e></key1></SnowflakeData> |
|   "key1": [ |                      |                                                                                                                         |
|     "a1",   |                      |                                                                                                                         |
|     "a2"    |                      |                                                                                                                         |
|   ]         |                      |                                                                                                                         |
| }           |                      |                                                                                                                         |
+-------------+----------------------+-------------------------------------------------------------------------------------------------------------------------+
```

---
title: TRANSFORM
source: https://docs.snowflake.com/en/sql-reference/functions/transform.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Higher-order)

# TRANSFORM

Transforms an [array](../data-types-semistructured.md) based on the logic in a lambda expression.

See also:
:   [Use lambda functions on data with Snowflake higher-order functions](../../user-guide/querying-semistructured.md)

## Syntax

```sqlsyntax
TRANSFORM( <array> , <lambda_expression> )
```

## Arguments

`array`
:   The array that contains the elements to be transformed. The array can be semi-structured or structured.

`lambda_expression`
:   A [lambda expression](../../user-guide/querying-semistructured.md) that defines the transformation
    logic on each array element.

    The lambda expression must have only one argument specified in the following syntax:

    ```sqlsyntax
    <arg> [ <datatype> ] -> <expr>
    ```

## Returns

The return type of this function is a semi-structured or structured array of the lambda expression result.

If either argument is NULL, the function returns NULL without reporting an error.

## Usage notes

* When the data type for the lambda argument is explicitly specified, the array element is coerced into the specified type
  before lambda invocation. For information about coercion, see [Data type conversion](../data-type-conversion.md).
* When there is no data type specified for the lambda argument, its data type is derived from the input array as follows:

  + If the input array is semi-structured, the data type of the lambda argument is [VARIANT](../data-types-semistructured.md).
  + If the input array is structured, the data type of the lambda argument is the data type of the array element.
* For semi-structured array input, a semi-structured array is returned. For structured array input, a structured array
  of the lambda expression result type is returned.

## Examples

The following examples use the TRANSFORM function.

### Multiply each element in an array by a value

Use the TRANSFORM function to multiply each element in an array by two:

```sqlexample
SELECT TRANSFORM([1, 2, 3], a INT -> a * 2) AS "Multiply by Two";
```

```output
+-----------------+
| Multiply by Two |
|-----------------|
| [               |
|   2,            |
|   4,            |
|   6             |
| ]               |
+-----------------+
```

This example is the same as the previous example, but it specifies a structured array of type INT:

```sqlexample
SELECT TRANSFORM([1, 2, 3]::ARRAY(INT), a INT -> a * 2) AS "Multiply by Two (Structured)";
```

```output
+------------------------------+
| Multiply by Two (Structured) |
|------------------------------|
| [                            |
|   2,                         |
|   4,                         |
|   6                          |
| ]                            |
+------------------------------+
```

### Return values in an array with added text

Use the TRANSFORM function to return the value of each object in an array, and add text to each one:

```sqlexample
SELECT TRANSFORM(
    [
      {'name':'Pat', 'value': 50},
      {'name':'Terry', 'value': 75},
      {'name':'Dana', 'value': 25}
    ],
    c -> c:value || ' is the number'
  ) AS "Return Values";
```

```output
+-----------------------+
| Return Values         |
|-----------------------|
| [                     |
|   "50 is the number", |
|   "75 is the number", |
|   "25 is the number"  |
| ]                     |
+-----------------------+
```

### Transform array elements in table data

Assume you have a table named `orders` with the columns `order_id`, `order_date`, and `order_detail`. The
`order_detail` column is an array of the line items, their purchase quantity, and subtotal. The table contains
two rows of data. The following SQL statement creates this table and inserts the rows:

```sqlexample
CREATE OR REPLACE TABLE orders AS
  SELECT 1 AS order_id, '2024-01-01' AS order_date, [
    {'item':'UHD Monitor', 'quantity':3, 'subtotal':1500},
    {'item':'Business Printer', 'quantity':1, 'subtotal':1200}
  ] AS order_detail
  UNION
  SELECT 2 AS order_id, '2024-01-02' AS order_date, [
    {'item':'Laptop', 'quantity':5, 'subtotal':7500},
    {'item':'Noise-canceling Headphones', 'quantity':5, 'subtotal':1000}
  ] AS order_detail;

SELECT * FROM orders;
```

```output
+----------+------------+-------------------------------------------+
| ORDER_ID | ORDER_DATE | ORDER_DETAIL                              |
|----------+------------+-------------------------------------------|
|        1 | 2024-01-01 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "UHD Monitor",                |
|          |            |     "quantity": 3,                        |
|          |            |     "subtotal": 1500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Business Printer",           |
|          |            |     "quantity": 1,                        |
|          |            |     "subtotal": 1200                      |
|          |            |   }                                       |
|          |            | ]                                         |
|        2 | 2024-01-02 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "Laptop",                     |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 7500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Noise-canceling Headphones", |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 1000                      |
|          |            |   }                                       |
|          |            | ]                                         |
+----------+------------+-------------------------------------------+
```

Use the TRANSFORM function to add a `unit_price` element to each array in the `orders` table:

```sqlexample
SELECT order_id,
       order_date,
       TRANSFORM(o.order_detail, i -> OBJECT_INSERT(
         i,
         'unit_price',
         (i:subtotal / i:quantity)::NUMERIC(10,2)
         )
       ) AS order_detail_with_unit_price
  FROM orders o;
```

```output
+----------+------------+-------------------------------------------+
| ORDER_ID | ORDER_DATE | ORDER_DETAIL_WITH_UNIT_PRICE              |
|----------+------------+-------------------------------------------|
|        1 | 2024-01-01 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "UHD Monitor",                |
|          |            |     "quantity": 3,                        |
|          |            |     "subtotal": 1500,                     |
|          |            |     "unit_price": 500                     |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Business Printer",           |
|          |            |     "quantity": 1,                        |
|          |            |     "subtotal": 1200,                     |
|          |            |     "unit_price": 1200                    |
|          |            |   }                                       |
|          |            | ]                                         |
|        2 | 2024-01-02 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "Laptop",                     |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 7500,                     |
|          |            |     "unit_price": 1500                    |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Noise-canceling Headphones", |
|          |            |     "quantity": 5,                        |
|          |            |     "subtotal": 1000,                     |
|          |            |     "unit_price": 200                     |
|          |            |   }                                       |
|          |            | ]                                         |
+----------+------------+-------------------------------------------+
```

Use the TRANSFORM function along with the [OBJECT_DELETE](object_delete.md) function in the logic of the
lambda expression to delete the `quantity` element in each array from the `orders` table:

```sqlexample
SELECT order_id,
       order_date,
       TRANSFORM(o.order_detail, i -> OBJECT_DELETE(
         i,
         'quantity'
         )
       ) AS order_detail_without_quantity
  FROM orders o;
```

```output
+----------+------------+-------------------------------------------+
| ORDER_ID | ORDER_DATE | ORDER_DETAIL_WITHOUT_QUANTITY             |
|----------+------------+-------------------------------------------|
|        1 | 2024-01-01 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "UHD Monitor",                |
|          |            |     "subtotal": 1500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Business Printer",           |
|          |            |     "subtotal": 1200                      |
|          |            |   }                                       |
|          |            | ]                                         |
|        2 | 2024-01-02 | [                                         |
|          |            |   {                                       |
|          |            |     "item": "Laptop",                     |
|          |            |     "subtotal": 7500                      |
|          |            |   },                                      |
|          |            |   {                                       |
|          |            |     "item": "Noise-canceling Headphones", |
|          |            |     "subtotal": 1000                      |
|          |            |   }                                       |
|          |            | ]                                         |
+----------+------------+-------------------------------------------+
```

### Reference a table column in a lambda expression to transform array elements in table data

Create a table with one column of type ARRAY and another column of type INT:

```sqlexample
CREATE OR REPLACE TABLE transform_column_ref_demo AS
  SELECT [ 1, 2, 3 ] AS col1, 10 AS col2
  UNION
  SELECT [ 4, 5, 6 ] AS col1, -1 AS col2
  UNION
  SELECT [ 7, 8, 9 ] AS col1, NULL AS col2;

SELECT * FROM transform_column_ref_demo;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
| [    |   10 |
|   1, |      |
|   2, |      |
|   3  |      |
| ]    |      |
| [    |   -1 |
|   4, |      |
|   5, |      |
|   6  |      |
| ]    |      |
| [    | NULL |
|   7, |      |
|   8, |      |
|   9  |      |
| ]    |      |
+------+------+
```

Use the TRANSFORM function to add the value in `col2` to the value of each array element in each row:

```sqlexample
SELECT TRANSFORM(col1, v INT -> v + col2) AS transform_col_ref
  FROM transform_column_ref_demo;
```

```output
+-------------------+
| TRANSFORM_COL_REF |
|-------------------|
| [                 |
|   11,             |
|   12,             |
|   13              |
| ]                 |
| [                 |
|   3,              |
|   4,              |
|   5               |
| ]                 |
| [                 |
|   undefined,      |
|   undefined,      |
|   undefined       |
| ]                 |
+-------------------+
```

---
title: TRANSLATE
source: https://docs.snowflake.com/en/sql-reference/functions/translate.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# TRANSLATE

Replaces characters in a string. Specifically, given a string, a set of characters to replace, and
the characters to substitute for the original characters, TRANSLATE makes the specified substitutions.

> **Attention:**
>
> This function doesn’t translate between languages. See the [TRANSLATE (SNOWFLAKE.CORTEX)](translate-snowflake-cortex.md) function
> for translating text between natural languages.

## Syntax

```sqlsyntax
TRANSLATE( <subject>, <sourceAlphabet>, <targetAlphabet> )
```

## Arguments

`subject`
:   A string expression that is translated. If a character in `subject` isn’t
    in `sourceAlphabet`, the character is added to the result without any translation.

`sourceAlphabet`
:   A string with all characters that are modified by
    this function. Each character is either translated to the corresponding
    character in the `targetAlphabet` or omitted in the result. A character is
    omitted in the result if the `targetAlphabet` has no corresponding character
    (that is, has fewer characters than the `sourceAlphabet`).

`targetAlphabet`
:   A string with all characters that are used to replace characters from the
    `sourceAlphabet`.

    If `targetAlphabet` is longer than `sourceAlphabet`, Snowflake reports the
    following error:

    ```output
    String '(target alphabet)' is too long and would be truncated.
    ```

## Returns

This function returns a value of type VARCHAR.

## Collation details

Arguments with collation specifications currently aren’t supported. Collation specifications are ignored without returning an error.

## Examples

Translate the character `ñ` to `n`:

```sqlexample
SELECT TRANSLATE('peña','ñ','n') AS translation;
```

```output
+-------------+
| TRANSLATION |
|-------------|
| pena        |
+-------------+
```

Translate `X` to `c`, `Y` to `e`, `Z` to `f`, and remove `❄` characters:

```sqlexample
SELECT TRANSLATE('❄a❄bX❄dYZ❄','XYZ❄','cef') AS translation;
```

```output
+-------------+
| TRANSLATION |
|-------------|
| abcdef      |
+-------------+
```

---
title: TRANSLATE (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/translate-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# TRANSLATE (SNOWFLAKE.CORTEX)

> **Note:**
>
> [AI_TRANSLATE](ai_translate.md) is the latest version of this function.
> Use AI_TRANSLATE for the latest functionality.
> You can continue to use TRANSLATE (SNOWFLAKE.CORTEX).

Translates the given input text from one supported language to another.

> **Attention:**
>
> This function does not transform a string given a search string and a replacement string. See the [TRANSLATE](translate.md) function if that
> functionality is what you’re looking for.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.TRANSLATE(
    <text>, <source_language>, <target_language>)
```

## Arguments

`text`
:   A string containing the text to be translated.

`source_language`
:   A string specifying the language code for the language the text is currently in. See Usage notes for a list of
    supported language codes. If the source language code is an empty string, `''`, the source language is
    automatically detected.

`target_language`
:   A string specifying the language code into which the text should be translated. See Usage notes for a list of
    supported language codes.

## Returns

A string containing a translation of the original text into the target language.

## Usage notes

The following languages are supported by the TRANSLATE function. Use the corresponding language code for the source and
target language.

The TRANSLATE model also supports a mix of two different languages in the text being translated (for example,
“Spanglish”). In this case, specify an empty string (`''`) as the source language to auto-detect the languages
used in the source text.

| Language | Code |
| --- | --- |
| Chinese | `'zh'` |
| Dutch | `'nl'` |
| English | `'en'` |
| French: | `'fr'` |
| German | `'de'` |
| Hindi | `'hi'` |
| Italian | `'it'` |
| Japanese | `'ja'` |
| Korean | `'ko'` |
| Polish | `'pl'` |
| Portuguese | `'pt'` |
| Russian | `'ru'` |
| Spanish | `'es'` |
| Swedish | `'sv'` |

The TRANSLATE function produces its best results when either the source or target language is English (for example,
English to Spanish or German to English). Results for other language pairs, such as German to Spanish, might be less
accurate.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Examples

The following example translates each row of a table from English to German (in this example, `review_content` is
a column from the `reviews` table):

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRANSLATE(review_content, 'en', 'de') FROM reviews LIMIT 10;
```

The following example translates a fictitious product review from English to Spanish:

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRANSLATE(
  'Hit the slopes with Snowflake\'s latest innovation - "Skii Headphones" designed to keep your ears warm and your soul ablaze. Engineered specifically for snow weather, these rugged headphones combine crystal-clear sound with thermally-insulated ear cups to keep the chill out and the beats in. Whether you\'re carving through powder or cruising down groomers, Skii Headphones will fuel your mountain adventures with vibrant sound and unrelenting passion. Stay warm, stay fired up, and shred the mountain with Snowflake Skii Headphones',
'en','es');
```

The result of this query is:

```output
Sube a las pistas con la última innovación de Snowflake: "Skii Headphones", diseñados para mantener tus oídos calientes y tu alma encendida. Diseñados específicamente para el clima de nieve, estos audífonos resistentes combinan un sonido cristalino con copas de oído aisladas térmicamente para mantener el frío fuera y los ritmos dentro. Ya sea que estés esculpiendo en polvo o deslizándote por pistas preparadas, los Skii Headphones alimentarán tus aventuras en la montaña con un sonido vibrante y una pasión incesante. Mantente caliente, mantente encendido y arrasa la montaña con los Skii Headphones de Snowflake.
```

The following example translates a call transcript from German to English:

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRANSLATE
  ('Kunde: Hallo
    Agent: Hallo, ich hoffe, es geht Ihnen gut. Um Ihnen am besten helfen zu können, teilen Sie bitte Ihren Vor- und Nachnamen und den Namen der Firma, von der aus Sie anrufen.
    Kunde: Ja, hier ist Thomas Müller von SkiPisteExpress.
    Agent: Danke Thomas, womit kann ich Ihnen heute helfen?
    Kunde: Also wir haben die XtremeX Helme in Größe M bestellt, die wir speziell für die kommende Wintersaison benötigen. Jedoch sind alle Schnallen der Helme defekt, und keiner schließt richtig.
    Agent: Ich verstehe, dass das ein Problem für Ihr Geschäft sein kann. Lassen Sie mich überprüfen, was mit Ihrer Bestellung passiert ist. Um zu bestätigen: Ihre Bestellung endet mit der Nummer 56682?
    Kunde: Ja, das ist meine Bestellung.
    Agent: Ich sehe das Problem. Entschuldigen Sie die Unannehmlichkeiten. Ich werde sofort eine neue Lieferung mit reparierten Schnallen für Sie vorbereiten, die in drei Tagen bei Ihnen eintreffen sollte. Ist das in Ordnung für Sie?
    Kunde: Drei Tage sind ziemlich lang, ich hatte gehofft, diese Helme früher zu erhalten. Gibt es irgendeine Möglichkeit, die Lieferung zu beschleunigen?
    Agent: Ich verstehe Ihre Dringlichkeit. Ich werde mein Bestes tun, um die Lieferung auf zwei Tage zu beschleunigen. Wie kommst du damit zurecht?
    Kunde: Das wäre großartig, ich wäre Ihnen sehr dankbar.
    Agent: Kein Problem, Thomas. Ich kümmere mich um die eilige Lieferung. Danke für Ihr Verständnis und Ihre Geduld.
    Kunde: Vielen Dank für Ihre Hilfe. Auf Wiedersehen!
    Agent: Bitte, gerne geschehen. Auf Wiedersehen und einen schönen Tag noch!'
,'de','en');
```

The result is:

```output
Customer: Hello
Agent: Hello, I hope you are well. To best assist you, please share your first and last name and the name of the company you are calling from.
Customer: Yes, this is Thomas Müller from SkiPisteExpress.
Agent: Thank you, Thomas, what can I help you with today?
Customer: So, we ordered the XtremeX helmets in size M, which we specifically need for the upcoming winter season. However, all the buckles on the helmets are defective and none of them close properly.
Agent: I understand that this can be a problem for your business. Let me check what happened with your order. To confirm: your order ends with the number 56682?
Customer: Yes, that's my order.
Agent: I see the issue. I apologize for the inconvenience. I will prepare a new delivery with repaired buckles for you immediately, which should arrive in three days. Is that okay for you?
Customer: Three days is quite a long time; I was hoping to receive these helmets sooner. Is there any way to expedite the delivery?
Agent: I understand your urgency. I will do my best to expedite the delivery to two days. How does that sound?
Customer: That would be great, I would be very grateful.
Agent: No problem, Thomas. I will take care of the urgent delivery. Thank you for your understanding and patience.
Customer: Thank you very much for your help. Goodbye!
Agent: You're welcome. Goodbye and have a nice day!
```

Finally, the following example illustrates translating text from two different languages (in this case English and Spanish, or “Spanglish”) to English.
Note that the specification of the source language is the empty string.

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRANSLATE ('Voy a likear tus fotos en Insta.', '', 'en')
```

This query results in:

```output
I'm going to like your photos on Insta.
```

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: TRIM
source: https://docs.snowflake.com/en/sql-reference/functions/trim.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# TRIM

Removes leading and trailing characters from a string.

> **Note:**
>
> To remove characters in a string, you can use the [REPLACE](replace.md) function.

See also:
:   [LTRIM](ltrim.md) , [RTRIM](rtrim.md) , [String & binary data types](../data-types-text.md)

## Syntax

```sqlsyntax
TRIM( <expr> [, <characters> ] )
```

## Arguments

`expr`
:   A string expression to be trimmed.

`characters`
:   One or more characters to remove from the left and right side of `expr`.

    The default value is `' '` (a single blank space character).
    If no characters are specified, only blank spaces are removed.

## Returns

This function returns a value of VARCHAR data type or NULL. If either argument is NULL, returns NULL.

## Usage notes

* You can specify the characters in `characters` in any order.
* A specification of `' '` in `characters` does not remove other whitespace
  characters (such as tabulation characters, end-of-line characters, and so on). Explicitly
  specify these characters to remove them.

* To remove whitespace, the characters must be explicitly included in the
  argument. For example, `' $.'` removes all leading and trailing blank
  spaces, dollar signs, and periods from the input string.

## Collation details

[Collation](../collation.md) is supported when the optional second argument is omitted, or when it
contains only whitespace.

The collation specification of the returned value is the same as the collation specification of the first argument.

## Examples

Remove leading and trailing `*` and `-` characters from a string:

```sqlexample
SELECT '*-*ABC-*-' AS original,
       TRIM('*-*ABC-*-', '*-') AS trimmed;
```

```output
+-----------+---------+
| ORIGINAL  | TRIMMED |
|-----------+---------|
| *-*ABC-*- | ABC     |
+-----------+---------+
```

Remove a trailing new line from a string. This example uses the [CONCAT](concat.md) function to enclose
the strings in `>` and `<` characters to help you visualize the whitespace.

```sqlexample
SELECT CONCAT('>', CONCAT('ABC\n', '<')) AS original,
       CONCAT('>', CONCAT(TRIM('ABC\n', '\n'), '<')) AS trimmed;
```

```output
+----------+---------+
| ORIGINAL | TRIMMED |
|----------+---------|
| >ABC     | >ABC<   |
| <        |         |
+----------+---------+
```

Remove leading and trailing whitespace from a string. This example encloses
the strings in `>` and `<` characters to help you visualize the whitespace.
It also shows that the function returns NULL for NULL input.

```sqlexample
CREATE OR REPLACE TABLE test_trim_function(column1 VARCHAR);

INSERT INTO test_trim_function VALUES ('  Leading Spaces'), ('Trailing Spaces  '), (NULL);

SELECT CONCAT('>', CONCAT(column1, '<')) AS original_values,
       CONCAT('>', CONCAT(TRIM(column1), '<')) AS trimmed_values
  FROM test_trim_function;
```

```output
+---------------------+-------------------+
| ORIGINAL_VALUES     | TRIMMED_VALUES    |
|---------------------+-------------------|
| >  Leading Spaces<  | >Leading Spaces<  |
| >Trailing Spaces  < | >Trailing Spaces< |
| NULL                | NULL              |
+---------------------+-------------------+
```

---
title: TRUNCATE , TRUNC
source: https://docs.snowflake.com/en/sql-reference/functions/trunc.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md) (Rounding and Truncation)

# TRUNCATE , TRUNC

Rounds the input expression down to the nearest (or equal) value closer to zero.
Depending on the value you specify as the scale parameter, the transformation can remove:

* All the digits after the decimal point, producing an integer value. This is the default
  and most common use of TRUNC for numbers.
* Some of the significant digits after the decimal point, producing a less precise value.
* All the significant digits after the decimal point and some significant digits
  to the left of the decimal point, producing a value that is a multiple of 10, 100, or other power of 10.

The TRUNCATE and TRUNC functions are synonymous.

> **Note:**
>
> TRUNC is overloaded. It can also be used with date/time values to [truncate dates, times, and timestamps](trunc2.md)
> to a specified part. The numeric TRUNC has one required and one optional parameter. The date/time TRUNC has two required parameters.

See also:
:   [CEIL](ceil.md) , [FLOOR](floor.md) , [ROUND](round.md)

## Syntax

```sqlsyntax
TRUNCATE( <input_expr> [ , <scale_expr> ] )

TRUNC( <input_expr> [ , <scale_expr> ] )
```

## Arguments

`input_expr`
:   The value or expression to operate on. The data type must be one of the numeric data types, such as DECFLOAT,
    FLOAT, or NUMBER.

`scale_expr`
:   The number of digits to include after the decimal point.

    The default `scale_expr` is zero, meaning that the function removes all digits after the decimal point.

    For information about negative scales, see the Usage notes below.

## Returns

* If the input is a NUMBER value, the data type of the returned value is NUMBER(precision, scale).

  If the input scale was greater than or equal to zero, then the output scale generally matches the input scale.

  If the input scale was negative, then the output scale is 0.

  For example:

  + The data type returned by `TRUNCATE(3.14, 1)` is `NUMBER(4, 1)`.
  + The data type returned by `TRUNCATE(3.14, 0)` is `NUMBER(4, 0)`.
  + The data type returned by `TRUNCATE(33.33, -1)` is `NUMBER(5, 0)`.

  If the scale is zero, then the value is effectively an integer.
* If the input is a FLOAT value, the data type of the returned value is FLOAT.
* If the input is a DECFLOAT value, the data type of the returned value is DECFLOAT.

## Usage notes

* If `scale_expr` is negative, then it specifies the number of places before the decimal point to
  which to adjust the number. For example, if the scale is -2, then the result is a multiple of 100.
* If `scale_expr` is larger than the input expression scale, the function does not have any effect.
* If either the `input_expr` or the `scale_expr` is NULL, then the result is NULL.
* Truncation is performed towards 0, not towards the smaller number. For example, `TRUNCATE(-9.6)` results in `-9`, not `-10`.

## Examples

The following examples demonstrate the TRUNC function for numeric values.
For examples of truncating dates, times, and timestamps, see [the date/time form of TRUNC](trunc2.md).

The examples use data from this sample table. The table contains two different decimal numbers,
-975.975 and 135.135, along with different values to use for the scale parameter with the TRUNC function.

```sqlexample
CREATE TABLE numeric_trunc_demo (n FLOAT, scale INTEGER);
INSERT INTO numeric_trunc_demo (n, scale) VALUES
   (-975.975, -1), (-975.975,  0), (-975.975,  2),
   ( 135.135, -2), ( 135.135,  0), ( 135.135,  1),
   ( 135.135,  3), ( 135.135, 50), ( 135.135, NULL);
```

When you don’t specify a scale parameter, the default behavior for TRUNC
with a numeric parameter is to return the integer value that’s equal to
the parameter or closer to zero. Specifying a scale parameter of 0
does the same thing.

```sqlexample
SELECT DISTINCT n, TRUNCATE(n)
  FROM numeric_trunc_demo ORDER BY n;
```

```output
+----------+-------------+
|        N | TRUNCATE(N) |
|----------+-------------|
| -975.975 |        -975 |
|  135.135 |         135 |
+----------+-------------+
```

The following example shows the results of calling the TRUNC function with
zero, positive, or negative scale parameters applied to a positive and a negative
number.

* Specifying a zero scale parameter removes all the digits after the decimal point, producing an integer value.
* Specifying a positive scale parameter leaves the specified number of significant digits after the decimal point.
* Specifying a negative scale parameter turns that many digits into zeroes to the left of the decimal point.
* Specifying a scale that is greater than +38 or less than -38 is the same as specifying +38 or -38.

```sqlexample
SELECT n, scale, TRUNC(n, scale)
  FROM numeric_trunc_demo ORDER BY n, scale;
```

```output
+----------+-------+-----------------+
|        N | SCALE | TRUNC(N, SCALE) |
|----------+-------+-----------------|
| -975.975 |    -1 |        -970     |
| -975.975 |     0 |        -975     |
| -975.975 |     2 |        -975.97  |
|  135.135 |    -2 |         100     |
|  135.135 |     0 |         135     |
|  135.135 |     1 |         135.1   |
|  135.135 |     3 |         135.135 |
|  135.135 |    50 |         135.135 |
|  135.135 |  NULL |            NULL |
+----------+-------+-----------------+
```

---
title: TRUNCATE, TRUNC
source: https://docs.snowflake.com/en/sql-reference/functions/trunc2.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# TRUNCATE, TRUNC

Truncates a DATE, TIME, or TIMESTAMP value to the specified precision. For example,
truncating a timestamp down to the quarter returns the timestamp corresponding
to midnight of the first day of the original timestamp’s quarter.

This function provides an alternative syntax for [DATE_TRUNC](date_trunc.md) by reversing the
two arguments.

The TRUNCATE and TRUNC functions are synonymous.

Truncation is not the same as extraction. For example:

* Truncating a timestamp down to the quarter using this function returns the timestamp corresponding
  to midnight of the first day of the quarter for the input timestamp.
* Extracting the quarter date part from a timestamp using the [EXTRACT](extract.md) function returns the
  quarter number of the year in the timestamp.

> **Note:**
>
> TRUNC is overloaded. It can also be used with numeric values to [reduce the number of significant digits](trunc.md),
> such as by truncating a decimal value to an integer. The numeric TRUNC has one required and one optional parameter.
> The date/time TRUNC has two required parameters.

Alternatives:
:   [DATE_TRUNC](date_trunc.md)

See also:
:   [DATE_PART](date_part.md) , [EXTRACT](extract.md)

## Syntax

```sqlsyntax
TRUNC( <date_or_time_expr>, <date_or_time_part> )
```

## Arguments

`date_or_time_expr`
:   This argument must evaluate to a date, time, or timestamp.

`date_or_time_part`
:   This argument must be one of the values listed in [Supported date and time parts](../functions-date-time.md).

## Returns

The returned value is the same type as the input value.

For example, if the input value is a TIMESTAMP, then the returned value is a TIMESTAMP.

## Usage notes

* When `date_or_time_part` is `week` (or any of its variations), the output is controlled
  by the [WEEK_START](../parameters.md) session parameter. For more details, including examples, see
  [Calendar weeks and weekdays](../functions-date-time.md).
* For TIME values, you can’t specify a `date_or_time_part` that is outside the scope of the TIME type.
  For example, you can truncate a TIMESTAMP value to a `day`, `week`, `year`, and so on because the TIMESTAMP type
  encodes date/times with the required precision. However, trying to truncate a TIME value to a `day`, `week`, `year`,
  and so on causes an error.

## Examples

The following examples demonstrate the TRUNC or TRUNCATE function for date/time values.
For examples of truncating numeric values, see [the numeric form of TRUNC](trunc.md).

The function examples use the data in the following table:

```sqlexample
CREATE OR REPLACE TABLE test_date_trunc (
 mydate DATE,
 mytime TIME,
 mytimestamp TIMESTAMP);

INSERT INTO test_date_trunc VALUES (
  '2024-05-09',
  '08:50:48',
  '2024-05-09 08:50:57.891 -0700');

SELECT * FROM test_date_trunc;
```

```output
+------------+----------+-------------------------+
| MYDATE     | MYTIME   | MYTIMESTAMP             |
|------------+----------+-------------------------|
| 2024-05-09 | 08:50:48 | 2024-05-09 08:50:57.891 |
+------------+----------+-------------------------+
```

The following examples show date truncation. In all cases, the returned value
is of the same data type as the input value, but with zeros for the portions,
such as fractional seconds, that were truncated.

Truncate a DATE value down to the year, month, and day:

```sqlexample
SELECT mydate AS "DATE",
       TRUNC(mydate, 'year') AS "TRUNCATED TO YEAR",
       TRUNC(mydate, 'month') AS "TRUNCATED TO MONTH",
       TRUNC(mydate, 'day') AS "TRUNCATED TO DAY"
  FROM test_date_trunc;
```

```output
+------------+-------------------+--------------------+------------------+
| DATE       | TRUNCATED TO YEAR | TRUNCATED TO MONTH | TRUNCATED TO DAY |
|------------+-------------------+--------------------+------------------|
| 2024-05-09 | 2024-01-01        | 2024-05-01         | 2024-05-09       |
+------------+-------------------+--------------------+------------------+
```

Truncate a TIME value down to the minute:

```sqlexample
SELECT mytime AS "TIME",
       TRUNCATE(mytime, 'minute') AS "TRUNCATED TO MINUTE"
  FROM test_date_trunc;
```

```output
+----------+---------------------+
| TIME     | TRUNCATED TO MINUTE |
|----------+---------------------|
| 08:50:48 | 08:50:00            |
+----------+---------------------+
```

Truncate a TIMESTAMP value down to the hour, minute, and second:

```sqlexample
SELECT mytimestamp AS "TIMESTAMP",
       TRUNCATE(mytimestamp, 'hour') AS "TRUNCATED TO HOUR",
       TRUNCATE(mytimestamp, 'minute') AS "TRUNCATED TO MINUTE",
       TRUNCATE(mytimestamp, 'second') AS "TRUNCATED TO SECOND"
  FROM test_date_trunc;
```

```output
+-------------------------+-------------------------+-------------------------+-------------------------+
| TIMESTAMP               | TRUNCATED TO HOUR       | TRUNCATED TO MINUTE     | TRUNCATED TO SECOND     |
|-------------------------+-------------------------+-------------------------+-------------------------|
| 2024-05-09 08:50:57.891 | 2024-05-09 08:00:00.000 | 2024-05-09 08:50:00.000 | 2024-05-09 08:50:57.000 |
+-------------------------+-------------------------+-------------------------+-------------------------+
```

Contrast the TRUNC function with the [EXTRACT](extract.md) function:

```sqlexample
SELECT TRUNC(mytimestamp, 'quarter') AS "TRUNCATED",
       EXTRACT('quarter', mytimestamp) AS "EXTRACTED"
  FROM test_date_trunc;
```

```output
+-------------------------+-----------+
| TRUNCATED               | EXTRACTED |
|-------------------------+-----------|
| 2024-04-01 00:00:00.000 |         2 |
+-------------------------+-----------+
```

---
title: TRY_BASE64_DECODE_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/try_base64_decode_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# TRY_BASE64_DECODE_BINARY

A special version of [BASE64_DECODE_BINARY](base64_decode_binary.md) that
returns a NULL value if an error occurs during decoding.

## Syntax

```sqlsyntax
TRY_BASE64_DECODE_BINARY(<input> [, <alphabet>])
```

## Arguments

`input`
:   The base64-encoded string to convert to BINARY data type.

`alphabet`
:   A string consisting of up to three ASCII characters:

    * The first two characters in the string specify the last two characters (indexes 62 and 63) in the alphabet used to encode the input:

      + `A` to `Z` (indexes 0-25).
      + `a` to `z` (indexes 26-51).
      + `0` to `9` (indexes 52-61).
      + `+` and `/` (indexes 62, 63).

      Defaults: `+` and `/`
    * The third character in the string specifies the character used for padding.

      Default: `=`

## Returns

This returns a `BINARY` value. The value can be inserted into a column of
type `BINARY`, for example.

## Usage notes

For more information about base64 format, see [base64](../binary-input-output.md).

## Examples

This shows how to use the function `TRY_BASE64_DECODE_BINARY`. The function
is used in the `INSERT` statement to decode a base64-encoded string
into a BINARY field; the function is not used in the `SELECT`
statement.

> Create a table and insert data:
>
> > ```sqlexample
> > CREATE TABLE base64 (v VARCHAR, base64_encoded_varchar VARCHAR, b BINARY);
> > INSERT INTO base64 (v, base64_encoded_varchar, b)
> >    SELECT 'HELP', BASE64_ENCODE('HELP'),
> >       TRY_BASE64_DECODE_BINARY(BASE64_ENCODE('HELP'));
> > ```
>
> Now run a query to show that we can retrieve the data intact:
>
> > ```sqlexample
> > SELECT v, base64_encoded_varchar,
> >     -- Convert binary -> base64-encoded-string
> >     TO_VARCHAR(b, 'BASE64'),
> >     -- Convert binary back to original value
> >     TO_VARCHAR(b, 'UTF-8')
> >   FROM base64;
> > +------+------------------------+-------------------------+------------------------+
> > | V    | BASE64_ENCODED_VARCHAR | TO_VARCHAR(B, 'BASE64') | TO_VARCHAR(B, 'UTF-8') |
> > |------+------------------------+-------------------------+------------------------|
> > | HELP | SEVMUA==               | SEVMUA==                | HELP                   |
> > +------+------------------------+-------------------------+------------------------+
> > ```

---
title: TRY_BASE64_DECODE_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/try_base64_decode_string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# TRY_BASE64_DECODE_STRING

A special version of [BASE64_DECODE_STRING](base64_decode_string.md) that
returns a NULL value if an error occurs during decoding.

`BASE64_DECODE_STRING` and `TRY_BASE64_DECODE_STRING` are “reciprocal”
(or “converse”) functions of `BASE64_ENCODE`.

## Syntax

```sqlsyntax
TRY_BASE64_DECODE_STRING(<input> [, <alphabet>])
```

## Arguments

`input`
:   The base64-encoded string to decode to a normal string.

`alphabet`
:   A string consisting of up to three ASCII characters:

    * The first two characters in the string specify the last two characters (indexes 62 and 63) in the alphabet used to encode the input:

      + `A` to `Z` (indexes 0-25).
      + `a` to `z` (indexes 26-51).
      + `0` to `9` (indexes 52-61).
      + `+` and `/` (indexes 62, 63).

      Defaults: `+` and `/`
    * The third character in the string specifies the character used for padding.

      Default: `=`

## Returns

A string.

## Usage notes

For more information about base64 format, see [base64](../binary-input-output.md).

## Examples

This shows how to use the function and demonstrates that
`TRY_BASE64_DECODE_STRING` is the converse of `BASE64_ENCODE`:

> ```sqlexample
> SELECT TRY_BASE64_DECODE_STRING(BASE64_ENCODE('HELLO'));
> +--------------------------------------------------+
> | TRY_BASE64_DECODE_STRING(BASE64_ENCODE('HELLO')) |
> |--------------------------------------------------|
> | HELLO                                            |
> +--------------------------------------------------+
> ```

This shows a more realistic example:

> Create a table and data:
>
> > ```sqlexample
> > CREATE TABLE base64 (v VARCHAR, base64_string VARCHAR, garbage VARCHAR);
> > INSERT INTO base64 (v, base64_string, garbage)
> >   SELECT 'HELLO', BASE64_ENCODE('HELLO'), '127';
> > ```
>
> Query the data using the `TRY_BASE64_DECODE_STRING` function:
>
> > ```sqlexample
> > SELECT v, base64_string, TRY_BASE64_DECODE_STRING(base64_string), TRY_BASE64_DECODE_STRING(garbage) FROM base64;
> > +-------+---------------+-----------------------------------------+-----------------------------------+
> > | V     | BASE64_STRING | TRY_BASE64_DECODE_STRING(BASE64_STRING) | TRY_BASE64_DECODE_STRING(GARBAGE) |
> > |-------+---------------+-----------------------------------------+-----------------------------------|
> > | HELLO | SEVMTE8=      | HELLO                                   | NULL                              |
> > +-------+---------------+-----------------------------------------+-----------------------------------+
> > ```

---
title: TRY_CAST
source: https://docs.snowflake.com/en/sql-reference/functions/try_cast.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_CAST

A special version of [CAST , ::](cast.md) that is available for a subset of data type conversions. It performs the same operation (i.e. converts a value of one data type into another data type), but returns a NULL value instead of
raising an error when the conversion can not be performed.

For more information, see [Error-handling conversion functions](../functions-conversion.md).

## Syntax

```sqlsyntax
TRY_CAST( <source_string_expr> AS <target_data_type> )
```

## Usage notes

* Only works for string expressions.
* `target_data_type` must be one of the following:

  + VARCHAR (or any of its synonyms)
  + NUMBER (or any of its synonyms)
  + DOUBLE
  + BOOLEAN
  + DATE
  + An interval variation
  + TIME
  + TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ

## Examples

The following code samples show how to use the `TRY_CAST` function with
valid and invalid values:

> ```sqlexample
> SELECT TRY_CAST('05-Mar-2016' AS TIMESTAMP);
> +--------------------------------------+
> | TRY_CAST('05-MAR-2016' AS TIMESTAMP) |
> |--------------------------------------|
> | 2016-03-05 00:00:00.000              |
> +--------------------------------------+
> ```
>
> ```sqlexample
> SELECT TRY_CAST('05/16' AS TIMESTAMP);
> +--------------------------------+
> | TRY_CAST('05/16' AS TIMESTAMP) |
> |--------------------------------|
> | NULL                           |
> +--------------------------------+
> ```
>
> ```sqlexample
> SELECT TRY_CAST('ABCD' AS CHAR(2));
> +-----------------------------+
> | TRY_CAST('ABCD' AS CHAR(2)) |
> |-----------------------------|
> | NULL                        |
> +-----------------------------+
> ```
>
> ```sqlexample
> SELECT TRY_CAST('ABCD' AS VARCHAR(10));
> +---------------------------------+
> | TRY_CAST('ABCD' AS VARCHAR(10)) |
> |---------------------------------|
> | ABCD                            |
> +---------------------------------+
> ```

---
title: TRY_COMPLETE (SNOWFLAKE.CORTEX)
source: https://docs.snowflake.com/en/sql-reference/functions/try_complete-snowflake-cortex.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (AI Functions)

# TRY_COMPLETE (SNOWFLAKE.CORTEX)

Performs the same operation as the [COMPLETE](complete-snowflake-cortex.md) function
but returns NULL instead of raising an error when the operation cannot be performed.

## Syntax

```sqlsyntax
SNOWFLAKE.CORTEX.TRY_COMPLETE( <model>, <prompt_or_history> [ , <options> ] )
```

## Arguments

**Required:**

`model`
:   A string specifying the model to be used. Specify one of the following values.

    * `claude-4-opus`
    * `claude-4-sonnet`
    * `claude-3-7-sonnet`
    * `claude-3-5-sonnet`
    * `deepseek-r1`
    * `llama3-8b`
    * `llama3-70b`
    * `llama3.1-8b`
    * `llama3.1-70b`
    * `llama3.1-405b`
    * `llama3.3-70b`
    * `llama4-maverick`
    * `llama4-scout`
    * `mistral-large`
    * `mistral-large2`
    * `mistral-7b`
    * `mixtral-8x7b`
    * `openai-gpt-4.1`
    * `openai-gpt-5`
    * `openai-gpt-5-chat`
    * `openai-gpt-5-mini`
    * `openai-gpt-5-nano`
    * `openai-gpt-5.1`
    * `openai-o4-mini`
    * `snowflake-arctic`
    * `snowflake-llama-3.1-405b`
    * `snowflake-llama-3.3-70b`

    Supported models might have different [costs](../../user-guide/snowflake-cortex/aisql.md).

`prompt_or_history`
:   The prompt or conversation history to be used to generate a completion.

    If `options` is not present, the prompt given must be a string.

    If `options` is present, the argument must be an [array](../data-types-semistructured.md) of objects representing a
    conversation in chronological order. Each [object](../data-types-semistructured.md) must contain a `role` key and a
    `content` key. The `content` value is a prompt or a response, depending on the role. The role must be one of the
    following.

> | `role` value | `content` value |
> | --- | --- |
> | `'system'` | An initial plain-English prompt to the language model to provide it with background information and instructions for a response style. For example, “Respond in the style of a pirate.” The model does not generate a response to a system prompt. Only one system prompt may be provided, and if it is present, it must be the first in the array. |
> | `'user'` | A prompt provided by the user. Must follow the system prompt (if there is one) or an assistant response. |
> | `'assistant'` | A response previously provided by the language model. Must follow a user prompt. Past responses can be used to provide a stateful conversational experience; see Usage Notes. |

**Optional:**

`options`
:   An [object](../data-types-semistructured.md) containing zero or more of the following options that affect the model’s
    hyperparameters. See [LLM Settings](https://www.promptingguide.ai/introduction/settings).

    * `temperature`: A value from 0 to 1 (inclusive) that controls the randomness of the output of the language model. A
      higher temperature (for example, 0.7) results in more diverse and random output, while a lower temperature (such as
      0.2) makes the output more deterministic and focused.

      Default: 0
    * `top_p`: A value from 0 to 1 (inclusive) that controls the randomness and diversity of the language model,
      generally used as an alternative to `temperature`. The difference is that `top_p` restricts the set of possible tokens
      that the model outputs, while `temperature` influences which tokens are chosen at each step.

      Default: 0
    * `max_tokens`: Sets the maximum number of output tokens in the response. Small values can result in truncated responses.

      Default: 4096
      Maximum allowed value: 8192
    * `guardrails`: Filters potentially unsafe and harmful responses from a language model using [Cortex Guard](../../user-guide/snowflake-cortex/aisql.md).
      Either TRUE or FALSE.

      Default: FALSE
    * `response_format`: A [JSON schema](https://json-schema.org/) that the response should follow. This is a SQL
      sub-object, not a string. If `response_format` is not specified, the response is a string containing either the
      response or a serialized JSON object containing the response and information about it.

      For more information, see [AI_COMPLETE structured outputs](../../user-guide/snowflake-cortex/complete-structured-outputs.md).

    Specifying the `options` argument, even if it is an empty object (`{}`), affects how the `prompt` argument is
    interpreted and how the response is formatted.

## Returns

When the `options` argument is not specified, returns a string containing the response.

When the `options` argument is given, and this object contains the `response_format` key, returns a string
representation of a JSON object adhering to the specified JSON schema.

When the `options` argument is given, and this object *does not* contain the `response_format` key, returns a
string representation of a JSON object containing the following keys.

* `"choices"`: An array of the model’s responses. (Currently, only one response is provided.) Each response is
  an object containing a `"messages"` key whose value is the model’s response to the latest prompt.
* `"created"`: UNIX timestamp (seconds since midnight, January 1, 1970) when the response was generated.
* `"model"`: The name of the model that created the response.
* `"usage"`: An object recording the number of tokens consumed and generated by this completion. Includes
  the following sub-keys:

  + `"completion_tokens"`: The number of tokens in the generated response.
  + `"prompt_tokens"`: The number of tokens in the prompt.
  + `"total_tokens"`: The total number of tokens consumed, which is the sum of the other two values.

## Access control requirements

Users must use a role that has been granted the [SNOWFLAKE.CORTEX_USER database role](../snowflake-db-roles.md).
See [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md) for more information on this privilege.

## Usage notes

TRY_COMPLETE does not retain any state from one call to the next. To use the TRY_COMPLETE function to provide a stateful,
conversational experience, pass all previous user prompts and model responses in the conversation as part of the `prompt_or_history`
array (see [Templates for Chat Models](https://huggingface.co/docs/transformers/en/chat_templating#templates-for-chat-models)).
Keep in mind that the number of tokens processed increases for each “round,” and costs increase proportionally.

## Examples

The following examples use the TRY_COMPLETE function in various use cases.

### Generating a single response

To generate a single response:

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRY_COMPLETE('snowflake-arctic', 'What are large language models?');
```

### Controlling temperature and tokens

This example illustrates the use of the function’s `options` argument to control the inference hyperparameters in a
single response. Note that in this form of the function, the prompt must be provided as an array, since this form
supports multiple prompts and responses.

```sqlexample
SELECT SNOWFLAKE.CORTEX.TRY_COMPLETE(
    'deepseek-r1',
    [
        {
            'role': 'user',
            'content': 'how does a snowflake get its unique pattern?'
        }
    ],
    {
        'temperature': 0.7,
        'max_tokens': 10
    }
);
```

The response is a JSON object containing the message from the language model and other information. Note that the response
is truncated as instructed in the `options` argument.

```json
{
    "choices": [
        {
            "messages": " The unique pattern on a snowflake is"
        }
    ],
    "created": 1708536426,
    "model": "deepseek-r1",
    "usage": {
        "completion_tokens": 10,
        "prompt_tokens": 22,
        "total_tokens": 32
    }
}
```

For additional examples, see the [COMPLETE (SNOWFLAKE.CORTEX)](complete-snowflake-cortex.md) reference.

## Legal notices

Refer to [Snowflake AI and ML](../../guides-overview-ai-features.md).

---
title: TRY_DECRYPT
source: https://docs.snowflake.com/en/sql-reference/functions/try_decrypt.md
section: SQL Functions
---

Categories:
:   [Encryption functions](../functions-encryption.md)

# TRY_DECRYPT

A special version of [DECRYPT](decrypt.md) that returns a NULL
value if an error occurs during decryption.

See also:
:   [ENCRYPT](encrypt.md) , [ENCRYPT_RAW](encrypt_raw.md) , [DECRYPT](decrypt.md) , [DECRYPT_RAW](decrypt_raw.md) , [TRY_DECRYPT_RAW](try_decrypt_raw.md)

## Syntax

```sqlsyntax
TRY_DECRYPT( <value_to_decrypt> , <passphrase> ,
         [ [ <additional_authenticated_data> , ] <encryption_method> ]
       )
```

## Arguments

**Required:**

`value_to_decrypt`
:   The BINARY value to decrypt.

`passphrase`
:   The passphrase to use to encrypt/decrypt the data. The passphrase is a VARCHAR.

**Optional:**

`additional_authenticated_data`
:   Additional authenticated data (AAD) is additional data whose confidentiality and authenticity is assured during the
    decryption process. However, this AAD is not encrypted and is not included as a field in the returned value from the
    ENCRYPT or ENCRYPT_RAW function.

    If AAD is passed to the encryption function (ENCRYPT or ENCRYPT_RAW), then the same AAD must be passed to the
    decryption function (DECRYPT or DECRYPT_RAW). If the AAD passed to the decryption function does not match the
    AAD passed to the encryption function, then decryption fails.

    The difference between the AAD and the `passphrase` is that the passphrase is intended to be kept
    secret (otherwise, the encryption is essentially worthless) while the AAD can be left public. The AAD helps
    authenticate that a public piece of information and an encrypted value are associated with each other. The
    examples section in the [ENCRYPT](encrypt.md) function includes an example showing the behavior
    when the AAD matches and the behavior when it doesn’t match.

    For ENCRYPT_RAW and DECRYPT_RAW, the data type of the AAD should be BINARY.
    For ENCRYPT and DECRYPT, the data type of the AAD can be either VARCHAR or BINARY, and does not need to match
    the data type of the value that was encrypted.

    AAD is supported only by AEAD-enabled encryption modes like GCM (default).

`encryption_method`
:   This string specifies the method to use for encrypting/decrypting the data. This string contains subfields:

    ```none
    <algorithm>-<mode> [ /pad: <padding> ]
    ```

    The `algorithm` is currently limited to:

    > * `'AES'`: When a passphrase is passed (e.g. to ENCRYPT), the function uses AES-256 encryption (256 bits). When a key
    >   is passed (e.g. to ENCRYPT_RAW), the function uses 128, 192, or 256-bit encryption, depending upon the key
    >   length.

    The `algorithm` is case-insensitive.

    The `mode` specifies which block cipher mode should be used to encrypt messages.
    The following table shows which modes are supported, and which of those modes support padding:

    | Mode | Padding | Description |
    | --- | --- | --- |
    | `'ECB'` | Yes | Encrypt every block individually with the key. This mode is generally discouraged and is included only for compatibility with external implementations. |
    | `'CBC'` | Yes | The encrypted block is XORed with the previous block. |
    | `'GCM'` | No | Galois/Counter Mode is a high-performance encryption mode that is AEAD-enabled. AEAD additionally assures the authenticity and confidentiality of the encrypted data by generating an AEAD tag. Moreover, AEAD supports AAD (additional authenticated data). |
    | `'CTR'` | No | Counter mode. |
    | `'OFB'` | No | Output feedback. The ciphertext is XORed with the plaintext of a block. |
    | `'CFB'` | No | Cipher feedback is a combination of OFB and CBC. |

    The `mode` is case-insensitive.

    The `padding` specifies how to pad messages whose length is not a multiple of the block size. Padding is
    applicable only for ECB and CBC modes; padding is ignored for other modes. The possible values for padding are:

    > * `'PKCS'`: Uses PKCS5 for block padding.
    > * `'NONE'`: No padding. The user needs to take care of the padding when using ECB or CBC mode.

    The `padding` is case-insensitive.

    Default setting: `'AES-GCM'`.

    If the `mode` is not specified, GCM is used.

    If the `padding` is not specified, PKCS is used.

## Returns

Returns the decrypted value as a BINARY value or a NULL value if any runtime
error occurs during decryption.

## Usage notes and examples

See the [DECRYPT](decrypt.md) function for the usage notes and examples.

---
title: TRY_DECRYPT_RAW
source: https://docs.snowflake.com/en/sql-reference/functions/try_decrypt_raw.md
section: SQL Functions
---

Categories:
:   [Encryption functions](../functions-encryption.md)

# TRY_DECRYPT_RAW

A special version of [DECRYPT_RAW](decrypt_raw.md) that returns a NULL
value if an error occurs during decryption.

See also:
:   [ENCRYPT](encrypt.md) , [ENCRYPT_RAW](encrypt_raw.md) , [DECRYPT](decrypt.md) , [TRY_DECRYPT](try_decrypt.md) , [DECRYPT_RAW](decrypt_raw.md)

## Syntax

```sqlsyntax
TRY_DECRYPT_RAW( <value_to_decrypt> , <key> , <iv> ,
         [ [ [ <additional_authenticated_data> , ] <encryption_method> , ] <aead_tag> ]
       )
```

## Arguments

**Required:**

`value_to_decrypt`
:   The binary value to decrypt.

`key`
:   The key to use to encrypt/decrypt the data. The key must be a BINARY value. The key can be any value as long as the
    length is correct. For example, for AES128, the key must be 128 bits (16 bytes), and for AES256, the key must be
    256 bits (32 bytes).

    The key used to encrypt the value must be used to decrypt the value.

`iv`
:   This parameter contains the Initialization Vector (IV) to use to encrypt and decrypt this piece of
    data. The IV must be a BINARY value of a specific length:

    * For GCM, this field must be 96 bits (12 bytes). While the GCM encryption method allows this field to be a different
      size, Snowflake currently only supports 96 bits.
    * For CCM, this should be 56 bits (7 bytes).
    * For ECB, this parameter is unneeded.
    * For all other supported encryption modes, this should be 128 bits (16 bytes).

    This value is used to initialize the first encryption round. You should never use the same IV and key combination
    more than once, especially for encryption modes like GCM.

    If this parameter is set to NULL, the implementation will choose a new pseudo-random IV during each call.

**Optional:**

`additional_authenticated_data`
:   Additional authenticated data (AAD) is additional data whose confidentiality and authenticity is assured during the
    decryption process. However, this AAD is not encrypted and is not included as a field in the returned value from the
    ENCRYPT or ENCRYPT_RAW function.

    If AAD is passed to the encryption function (ENCRYPT or ENCRYPT_RAW), then the same AAD must be passed to the
    decryption function (DECRYPT or DECRYPT_RAW). If the AAD passed to the decryption function does not match the
    AAD passed to the encryption function, then decryption fails.

    The difference between the AAD and the `passphrase` is that the passphrase is intended to be kept
    secret (otherwise, the encryption is essentially worthless) while the AAD can be left public. The AAD helps
    authenticate that a public piece of information and an encrypted value are associated with each other. The
    examples section in the [ENCRYPT](encrypt.md) function includes an example showing the behavior
    when the AAD matches and the behavior when it doesn’t match.

    For ENCRYPT_RAW and DECRYPT_RAW, the data type of the AAD should be BINARY.
    For ENCRYPT and DECRYPT, the data type of the AAD can be either VARCHAR or BINARY, and does not need to match
    the data type of the value that was encrypted.

    AAD is supported only by AEAD-enabled encryption modes like GCM (default).

`encryption_method`
:   This string specifies the method to use for encrypting/decrypting the data. This string contains subfields:

    ```none
    <algorithm>-<mode> [ /pad: <padding> ]
    ```

    The `algorithm` is currently limited to:

    > * `'AES'`: When a passphrase is passed (e.g. to ENCRYPT), the function uses AES-256 encryption (256 bits). When a key
    >   is passed (e.g. to ENCRYPT_RAW), the function uses 128, 192, or 256-bit encryption, depending upon the key
    >   length.

    The `algorithm` is case-insensitive.

    The `mode` specifies which block cipher mode should be used to encrypt messages.
    The following table shows which modes are supported, and which of those modes support padding:

    | Mode | Padding | Description |
    | --- | --- | --- |
    | `'ECB'` | Yes | Encrypt every block individually with the key. This mode is generally discouraged and is included only for compatibility with external implementations. |
    | `'CBC'` | Yes | The encrypted block is XORed with the previous block. |
    | `'GCM'` | No | Galois/Counter Mode is a high-performance encryption mode that is AEAD-enabled. AEAD additionally assures the authenticity and confidentiality of the encrypted data by generating an AEAD tag. Moreover, AEAD supports AAD (additional authenticated data). |
    | `'CTR'` | No | Counter mode. |
    | `'OFB'` | No | Output feedback. The ciphertext is XORed with the plaintext of a block. |
    | `'CFB'` | No | Cipher feedback is a combination of OFB and CBC. |

    The `mode` is case-insensitive.

    The `padding` specifies how to pad messages whose length is not a multiple of the block size. Padding is
    applicable only for ECB and CBC modes; padding is ignored for other modes. The possible values for padding are:

    > * `'PKCS'`: Uses PKCS5 for block padding.
    > * `'NONE'`: No padding. The user needs to take care of the padding when using ECB or CBC mode.

    The `padding` is case-insensitive.

    Default setting: `'AES-GCM'`.

    If the `mode` is not specified, GCM is used.

    If the `padding` is not specified, PKCS is used.

`aead_tag`
:   This BINARY value is needed for AEAD-enabled decryption modes to check the authenticity and confidentiality of the
    encrypted data. Use the AEAD tag that was returned by the ENCRYPT_RAW function. An example below shows how to
    access and use this value.

## Returns

The function returns the decrypted value or a NULL value if any runtime error occurs during decryption. The data type of the
returned value is BINARY.

## Usage notes and examples

See the [DECRYPT_RAW](decrypt_raw.md) function for the usage notes and examples.

---
title: TRY_HEX_DECODE_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/try_hex_decode_binary.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# TRY_HEX_DECODE_BINARY

A special version of [HEX_DECODE_BINARY](hex_decode_binary.md) that
returns a NULL value if an error occurs during decoding.

## Syntax

```sqlsyntax
TRY_HEX_DECODE_BINARY(<input>)
```

## Arguments

`input`
:   A string expression containing only hexadecimal digits. Typically, this
    input string is generated by calling the function
    [HEX_ENCODE](hex_encode.md).

## Returns

A `BINARY` value that can, for example, be inserted into a column of type
`BINARY`.

## Examples

This shows how to use the function `TRY_HEX_DECODE_BINARY` (note that
the function is used in the `INSERT` statement to decode into
a BINARY field; the function is not used in the `SELECT` statement):

> Create a table and data:
>
> > ```sqlexample
> > CREATE TABLE hex (v VARCHAR, b BINARY);
> > INSERT INTO hex (v, b)
> >    SELECT 'ABab',
> >      -- Convert string -> hex-encoded string -> binary.
> >      TRY_HEX_DECODE_BINARY(HEX_ENCODE('ABab'));
> > ```
>
> Now run a query to show that we can retrieve the data intact:
>
> > ```sqlexample
> > SELECT v, b,
> >     -- Convert binary -> hex-encoded-string -> string.
> >     TRY_HEX_DECODE_STRING(TO_VARCHAR(b))
> >   FROM hex;
> > ```
>
> Output:
>
> > ```sqlexample
> > +------+----------+--------------------------------------+
> > | V    | B        | TRY_HEX_DECODE_STRING(TO_VARCHAR(B)) |
> > |------+----------+--------------------------------------|
> > | ABab | 41426162 | ABab                                 |
> > +------+----------+--------------------------------------+
> > ```

---
title: TRY_HEX_DECODE_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/try_hex_decode_string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Encoding/Decoding)

# TRY_HEX_DECODE_STRING

A special version of [HEX_DECODE_STRING](hex_decode_string.md) that
returns a NULL value if an error occurs during decoding.

## Syntax

```sqlsyntax
TRY_HEX_DECODE_STRING(<input>)
```

## Arguments

`input`
:   A hex-encoded string expression. Typically the input was created by a
    call to [HEX_ENCODE](hex_encode.md).

## Returns

The returned value is a string (VARCHAR).

## Examples

This shows how to use the function:

> Create a table and data:
>
> > ```sqlexample
> > CREATE TABLE hex (v VARCHAR, hex_string VARCHAR, garbage VARCHAR);
> > INSERT INTO hex (v, hex_string, garbage)
> >   SELECT 'AaBb', HEX_ENCODE('AaBb'), '127';
> > ```
>
> Now run the query:
>
> > ```sqlexample
> > SELECT v, hex_string, TRY_HEX_DECODE_STRING(hex_string), TRY_HEX_DECODE_STRING(garbage) FROM hex;
> > ```
>
> Output:
>
> > ```sqlexample
> > +------+------------+-----------------------------------+--------------------------------+
> > | V    | HEX_STRING | TRY_HEX_DECODE_STRING(HEX_STRING) | TRY_HEX_DECODE_STRING(GARBAGE) |
> > |------+------------+-----------------------------------+--------------------------------|
> > | AaBb | 41614262   | AaBb                              | NULL                           |
> > +------+------------+-----------------------------------+--------------------------------+
> > ```

---
title: TRY_PARSE_JSON
source: https://docs.snowflake.com/en/sql-reference/functions/try_parse_json.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Parsing)

# TRY_PARSE_JSON

A special version of [PARSE_JSON](parse_json.md) that
returns a NULL value if an error occurs during parsing.

## Syntax

```sqlsyntax
TRY_PARSE_JSON( <expr> [ , '<parameter>' ] )
```

## Arguments

**Required:**

`expr`
:   An expression of string type (for example, VARCHAR) that holds valid JSON information.

**Optional:**

`'parameter'`
:   String constant that specifies the parameter used to search for matches. Supported values:

    | Parameter | Description |
    | --- | --- |
    | `d` | Allow duplicate keys in JSON objects. If a JSON object contains a duplicate key, the returned object has a single instance of that key with the last value specified for that key. |
    | `s` | Don’t allow duplicate keys in JSON objects (strict). This value is the default. |

## Returns

Returns a value of type VARIANT that contains a JSON document.

If the input is NULL or if an error occurs during parsing, the function returns NULL.

This function doesn’t return a [structured type](../data-types-structured.md).

## Usage notes

See [PARSE_JSON](parse_json.md) for the usage notes.

## Examples

This shows an example of storing different types of data in a VARIANT column by calling TRY_PARSE_JSON to parse
strings that contain values that can be parsed as JSON:

Create and fill a table.

```sqlexample
CREATE OR REPLACE TEMPORARY TABLE vartab (ID INTEGER, v VARCHAR);

INSERT INTO vartab (id, v) VALUES
  (1, '[-1, 12, 289, 2188, FALSE,]'),
  (2, '{ "x" : "abc", "y" : FALSE, "z": 10} '),
  (3, '{ "bad" : "json", "missing" : TRUE, "close_brace": 10 ');
```

Query the data, using TRY_PARSE_JSON. Note that the value for the third line is NULL. If the query used
PARSE_JSON rather than TRY_PARSE_JSON, it would fail.

```sqlexample
SELECT ID, TRY_PARSE_JSON(v)
  FROM vartab
  ORDER BY ID;
```

```output
+----+-------------------+
| ID | TRY_PARSE_JSON(V) |
|----+-------------------|
|  1 | [                 |
|    |   -1,             |
|    |   12,             |
|    |   289,            |
|    |   2188,           |
|    |   false,          |
|    |   undefined       |
|    | ]                 |
|  2 | {                 |
|    |   "x": "abc",     |
|    |   "y": false,     |
|    |   "z": 10         |
|    | }                 |
|  3 | NULL              |
+----+-------------------+
```

See [PARSE_JSON](parse_json.md) for more examples.

---
title: TRY_TO_BINARY
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_binary.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_BINARY

A special version of [TO_BINARY](to_binary.md) that performs the same operation (i.e. converts an input expression to a binary value),
but with error handling support (i.e. if the conversion cannot be performed, it returns a NULL value instead of raising an error).

For more information, see:

* [Error-handling conversion functions](../functions-conversion.md).
* [TO_BINARY](to_binary.md).
* [Binary input and output](../binary-input-output.md).

## Syntax

```sqlsyntax
TRY_TO_BINARY( <string_expr> [, '<format>'] )
```

## Arguments

**Required:**

`string_expr`
:   A string expression.

**Optional:**

`format`
:   The binary format for conversion: HEX, BASE64, or UTF-8 (see [Binary input and output](../binary-input-output.md)). The default is the value of the
    BINARY_INPUT_FORMAT session parameter. If this parameter is not set, the
    default is HEX.

## Returns

Returns a value of type BINARY.

## Usage notes

* Only works for string expressions.
* If `format` is specified but is not HEX, BASE64, or UTF-8, the result will be a NULL value.

## Examples

This shows how to use the `TRY_TO_BINARY` function when loading
hex-encoded strings into a BINARY column:

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE strings (v VARCHAR, hex_encoded_string VARCHAR, b BINARY);
> > INSERT INTO strings (v) VALUES
> >     ('01'),
> >     ('A B'),
> >     ('Hello'),
> >     (NULL);
> > UPDATE strings SET hex_encoded_string = HEX_ENCODE(v);
> > UPDATE strings SET b = TRY_TO_BINARY(hex_encoded_string, 'HEX');
> > ```
>
> Query the table, calling TRY_TO_BINARY():
>
> > ```sqlexample
> > SELECT v, hex_encoded_string, TO_VARCHAR(b, 'UTF-8')
> >   FROM strings
> >   ORDER BY v
> >   ;
> > +-------+--------------------+------------------------+
> > | V     | HEX_ENCODED_STRING | TO_VARCHAR(B, 'UTF-8') |
> > |-------+--------------------+------------------------|
> > | 01    | 3031               | 01                     |
> > | A B   | 412042             | A B                    |
> > | Hello | 48656C6C6F         | Hello                  |
> > | NULL  | NULL               | NULL                   |
> > +-------+--------------------+------------------------+
> > ```

---
title: TRY_TO_BOOLEAN
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_boolean.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_BOOLEAN

A special version of [TO_BOOLEAN](to_boolean.md) that performs the same operation
(that is, converts an input expression to a Boolean value), but with error-handling
support. If the conversion can’t be performed, TRY_TO_BOOLEAN returns a NULL value
instead of raising an error.

For more information, see [Error-handling conversion functions](../functions-conversion.md).

## Syntax

```sqlsyntax
TRY_TO_BOOLEAN( <string_expr> )
```

## Arguments

`string_expr`
:   A string expression that can be evaluated to a BOOLEAN value.

## Returns

This function returns a value of type [BOOLEAN](../data-types-logical.md).

## Usage notes

The input argument must be a string expression. The function evaluates the string expression
in the following way:

* `'true'`, `'t'`, `'yes'`, `'y'`, `'on'`, `'1'` return TRUE.
* `'false'`, `'f'`, `'no'`, `'n'`, `'off'`, `'0'` return FALSE.
* All other strings return NULL.

The evaluations of the strings are case-insensitive.

## Examples

This example uses the TRY_TO_BOOLEAN function:

```sqlexample
SELECT TRY_TO_BOOLEAN('True')  AS "T",
       TRY_TO_BOOLEAN('False') AS "F",
       TRY_TO_BOOLEAN('Not valid')  AS "N";
```

```output
+------+-------+------+
| T    | F     | N    |
|------+-------+------|
| True | False | NULL |
+------+-------+------+
```

For more examples, see [TO_BOOLEAN](to_boolean.md).

---
title: TRY_TO_DATE
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_date.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md) , [Date & time functions](../functions-date-time.md)

# TRY_TO_DATE

A special version of the [TO_DATE](to_date.md) function
that performs the same operation (i.e. converts an input expression to a date), but
with error-handling support (i.e. if the conversion cannot be performed, it returns a
NULL value instead of raising an error).

For more information, see [Error-handling conversion functions](../functions-conversion.md).

See also:
:   [TO_DATE , DATE](to_date.md)

## Syntax

```sqlsyntax
TRY_TO_DATE( <string_expr> [, <format> ] )
TRY_TO_DATE( '<integer>' )
```

## Arguments

**Required:**

One of:

> `string_expr`
> :   String from which to extract a date. For example: `'2024-01-31'`.
>
> `'integer'`
> :   An expression that evaluates to a string containing an integer. For example: `'15000000'`. Depending
>     on the magnitude of the string, it can be interpreted as seconds, milliseconds, microseconds, or
>     nanoseconds. For details, see the Usage notes for this function.

**Optional:**

`format`
:   Date format specifier for `string_expr` or
    [AUTO](../date-time-input-output.md),
    which specifies that Snowflake should automatically detect the format to use. For more information,
    see [Date and time formats in conversion functions](../functions-conversion.md).

    The default is the current value of the [DATE_INPUT_FORMAT](../parameters.md)
    session parameter (default `AUTO`).

## Returns

The data type of the returned value is DATE.

## Usage notes

* The display format for dates in the output is determined by the [DATE_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `YYYY-MM-DD`).
* If the format of the input parameter is a string that contains an integer:

  + After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
    microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).

    - If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
      a number of seconds.
    - If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
      as milliseconds.
    - If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
      treated as microseconds.
    - If the value is greater than or equal to 31536000000000000, then the value is
      treated as nanoseconds.
  + If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
    one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
    nanoseconds.

## Examples

The following example uses the TRY_TO_DATE function:

```sqlexample
SELECT
  TRY_TO_DATE('2024-05-10') AS valid_date,
  TRY_TO_DATE('Invalid') AS invalid_date;
```

```output
+------------+--------------+
| VALID_DATE | INVALID_DATE |
|------------+--------------|
| 2024-05-10 | NULL         |
+------------+--------------+
```

See [TO_DATE , DATE](to_date.md) for examples that convert an input expression to a date.

---
title: TRY_TO_DECFLOAT
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_decfloat.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_DECFLOAT

A special version of [TO_DECFLOAT](to_decfloat.md) that performs the same
operation — that is, converts an input expression to a [DECFLOAT](../data-types-numeric.md) —
but with error-handling support. If the conversion can’t be performed, it returns a NULL
value instead of raising an error.

For more information, see [Error-handling conversion functions](../functions-conversion.md).

## Syntax

```sqlsyntax
TRY_TO_DECFLOAT( <string_expr> [ , '<format>' ] )
```

## Arguments

**Required:**

`expr`
:   An expression of a numeric, character, or Boolean type.

**Optional:**

`'format'`
:   If the expression evaluates to a string, the function accepts
    an optional format model. For more information, see
    [SQL format models](../sql-format-models.md). The format model
    specifies the format of the input string, not the format of the
    output value.

## Usage notes

The special values `'NaN'` (not a number), `'inf'` (infinity),
and `'-inf'` (negative infinity) aren’t supported.

## Returns

This function returns a value of DECFLOAT data type.

If there is a conversion error, the function returns NULL.

## Examples

This example uses the TRY_TO_DECFLOAT function:

```sqlexample
SELECT TRY_TO_DECFLOAT('3.1415926'), TRY_TO_DECFLOAT('Invalid');
```

```output
+------------------------------+----------------------------+
| TRY_TO_DECFLOAT('3.1415926') | TRY_TO_DECFLOAT('INVALID') |
|------------------------------+----------------------------|
| 3.1415926                    | NULL                       |
+------------------------------+----------------------------+
```

For additional examples, see [TO_DECFLOAT](to_decfloat.md).

---
title: TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_decimal.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC

A special version of [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](to_decimal.md) that performs the same operation
of converting an input expression to a fixed-point number, but has error-handling support so that
the function returns NULL if the conversion can’t be performed.

These functions are synonymous.

For more information, see [Error-handling conversion functions](../functions-conversion.md).

## Syntax

```sqlsyntax
TRY_TO_DECIMAL( <string_expr> [, '<format>' ] [, <precision> [, <scale> ] ] )

TRY_TO_NUMBER( <string_expr> [, '<format>' ] [, <precision> [, <scale> ] ] )

TRY_TO_NUMERIC( <string_expr> [, '<format>' ] [, <precision> [, <scale> ] ] )
```

## Arguments

**Required:**

`string_expr`
:   An expression of type VARCHAR.

**Optional:**

`format`
:   The SQL format model used to parse the input `expr` and return. For more
    information, see [SQL format models](../sql-format-models.md).

`precision`
:   The maximal number of decimal digits in the resulting number; from 1
    to 38. In Snowflake, precision is not used to determine the
    number of bytes that are needed to store the number and doesn’t have any effect
    on efficiency, so the default is the maximum (38).

`scale`
:   The number of fractional decimal digits (from 0 to `precision` - 1).
    0 indicates no fractional digits (i.e. an integer number). The default scale
    is 0.

## Returns

The function returns a value of type NUMBER with the following defaults:

* If the `precision` isn’t specified, then it defaults to 38.
* If the `scale` isn’t specified, then it defaults to 0.

If the conversion can’t be performed or the input is NULL, returns NULL.

## Usage notes

The input must be a string expression.

## Examples

The following example fails because the last column (`dec_with_range_error`)
doesn’t store enough significant digits to hold the value that it is asked
to hold:

```sqlexample
SELECT column1 AS orig_string,
       TO_DECIMAL(column1) AS dec,
       TO_DECIMAL(column1, 10, 2) AS dec_with_scale,
       TO_DECIMAL(column1, 4, 2) AS dec_with_range_err
  FROM VALUES ('345.123');
```

```output
100039 (22003): Numeric value '345.123' is out of range
```

The following query is the same as the preceding query, except that it uses
TRY_TO_DECIMAL rather than TO_DECIMAL, so it converts the
out-of-range value to NULL:

```sqlexample
SELECT column1 AS orig_string,
       TRY_TO_DECIMAL(column1) AS dec,
       TRY_TO_DECIMAL(column1, 10, 2) AS dec_with_scale,
       TRY_TO_DECIMAL(column1, 4, 2) AS dec_with_range_err
  FROM VALUES ('345.123');
```

```output
+-------------+-----+----------------+--------------------+
| ORIG_STRING | DEC | DEC_WITH_SCALE | DEC_WITH_RANGE_ERR |
|-------------+-----+----------------+--------------------|
| 345.123     | 345 |         345.12 |               NULL |
+-------------+-----+----------------+--------------------+
```

The following example fails because the input string contains a dollar sign (`$`) and
a comma to separate groups of digits, not just digits and decimal points. However,
the format specifier for the last column doesn’t tell the TO_DECIMAL function
to expect the dollar sign and comma:

```sqlexample
SELECT column1 AS orig_string,
       TO_DECIMAL(column1, '$9,999.00') AS num,
       TO_DECIMAL(column1, '$9,999.00', 6, 2) AS num_with_scale,
       TO_DECIMAL(column1, 6, 2) AS num_with_format_err
  FROM VALUES ('$7,543.21');
```

```output
100038 (22018): Numeric value '$7,543.21' is not recognized
```

The following query is the same as the preceding query, except that it uses
TRY_TO_DECIMAL rather than TO_DECIMAL, so it converts the input
to NULL:

```sqlexample
SELECT column1 AS orig_string,
       TRY_TO_DECIMAL(column1, '$9,999.00') AS num,
       TRY_TO_DECIMAL(column1, '$9,999.00', 6, 2) AS num_with_scale,
       TRY_TO_DECIMAL(column1, 6, 2) AS num_with_format_err
  FROM VALUES ('$7,543.21');
```

```output
+-------------+------+----------------+---------------------+
| ORIG_STRING |  NUM | NUM_WITH_SCALE | NUM_WITH_FORMAT_ERR |
|-------------+------+----------------+---------------------|
| $7,543.21   | 7543 |        7543.21 |                NULL |
+-------------+------+----------------+---------------------+
```

The following example fails because the input expression contains characters that aren’t digits:

```sqlexample
SELECT column1 AS orig_string,
       TO_DECIMAL(column1) AS num
  FROM VALUES ('aaa');
```

```output
100038 (22018): Numeric value 'aaa' is not recognized
```

The following query is the same as the preceding query, except that it uses TRY_TO_DECIMAL rather than TO_DECIMAL,
so it converts the input to NULL:

```sqlexample
SELECT column1 AS orig_string,
       TRY_TO_DECIMAL(column1) AS num
  FROM VALUES ('aaa');
```

```output
+-------------+------+
| ORIG_STRING | NUM  |
|-------------+------|
| aaa         | NULL |
+-------------+------+
```

You can perform the conversion if you specify the [X format element](../sql-format-models.md)
with the TO_DECIMAL or TRY_TO_DECIMAL function to convert a hexadecimal value to a decimal value:

```sqlexample
SELECT column1 AS orig_string,
       TO_DECIMAL(column1, 'XXX') AS to_decimal_num,
       TRY_TO_DECIMAL(column1, 'XXX') AS try_to_decimal_num
  FROM VALUES ('aaa');
```

```output
+-------------+----------------+--------------------+
| ORIG_STRING | TO_DECIMAL_NUM | TRY_TO_DECIMAL_NUM |
|-------------+----------------+--------------------|
| aaa         |           2730 |               2730 |
+-------------+----------------+--------------------+
```

For additional examples, see [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](to_decimal.md).

---
title: TRY_TO_DOUBLE
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_double.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_DOUBLE

A special version of [TO_DOUBLE](to_double.md) that performs the same operation (that is,
converts an input expression to a double-precision floating-point number), but
with error-handling support (that is, if the conversion can’t be performed, it
returns a NULL value instead of raising an error).

For more information, see [Error-handling conversion functions](../functions-conversion.md).

## Syntax

```sqlsyntax
TRY_TO_DOUBLE( <string_expr> [, '<format>' ] )
```

## Arguments

`expr`
:   An expression of a character type.

`format`
:   If the expression evaluates to a string, then the function accepts
    an optional format model. Format models are described at
    [SQL format models](../sql-format-models.md). The format model
    specifies the format of the input string, not the format of the
    output value.

## Usage notes

* The function only accepts string expressions.
* Strings are converted as decimal integer or fractional numbers,
  scientific notation and special values (**nan**, **inf**, **infinity**)
  are accepted.

## Returns

This function returns a value of FLOAT data type.

If there is a conversion error, the function returns NULL.

## Examples

This example uses the TRY_TO_DOUBLE function:

```sqlexample
SELECT TRY_TO_DOUBLE('3.1415926'), TRY_TO_DOUBLE('Invalid');
```

```output
+----------------------------+--------------------------+
| TRY_TO_DOUBLE('3.1415926') | TRY_TO_DOUBLE('INVALID') |
|----------------------------+--------------------------|
|                  3.1415926 |                     NULL |
+----------------------------+--------------------------+
```

For additional examples, see [TO_DOUBLE](to_double.md).

---
title: TRY_TO_FILE
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_file.md
section: SQL Functions
---

Categories:
:   [File functions](../functions-file.md) (AI Functions)

# TRY_TO_FILE

A version of [TO_FILE](to_file.md) that returns NULL instead of raising an error.

## Syntax

Use one of the following:

```
TRY_TO_FILE( <stage_name>, <relative_path> )

TRY_TO_FILE( <file_url> )

TRY_TO_FILE( <metadata> )
```

## Arguments

Specify the file by providing:

* Both `stage_name` and `relative_path`
* `file_url`
* `metadata`

Only one of these methods can be used at a time.

`stage_name`
:   The name of the stage where the file is located, as a string, in the form `'@stage_name'`.

`relative_path`
:   The path to the file on the stage specified by `stage_name` as a string.

`file_url`
:   A valid stage or scoped file URL as a string.

`metadata`
:   An OBJECT containing the required FILE attributes. A FILE must have CONTENT_TYPE, SIZE, ETAG, and LAST_MODIFIED fields.
    It must also specify the file’s location in one of the following ways:

    * Both STAGE and RELATIVE_PATH
    * STAGE_FILE_URL
    * SCOPED_FILE_URL

## Returns

A [FILE](../data-types-unstructured.md), or NULL.

## Usage notes

Returns NULL when:

* The supplied URL is not validL.
* The file is on a stage that the user lacks privileges to access.
* The supplied metadata doesn’t contain the required FILE fields.

## Examples

Unlike TO_FILE, which raises an error on invalid arguments, TRY_TO_FILE returns NULL in this situation.
It otherwise works exactly like [TO_FILE](to_file.md).

The example below illustrates the behavior of TRY_TO_FILE when given an invalid file path, assuming that
the file `image.png` exists on the stage but the other two files do not.

```sqlexample
SELECT
    TRY_TO_FILE('@mystage/image.png'),
    TRY_TO_FILE('@mystage/incorrect_file1.jpg'),
    TRY_TO_FILE('@mystage', 'incorrect_file2.png');
```

Result:

```output
+-----------------------------------------------------+---------------------------------------------+------------------------------------------------+
| TRY_TO_FILE('@MYSTAGE/IMAGE.PNG')                   | TRY_TO_FILE('@MYSTAGE/INCORRECT_FILE1.JPG') | TRY_TO_FILE('@MYSTAGE', 'INCORRECT_FILE2.PNG') |
|-----------------------------------------------------|---------------------------------------------|------------------------------------------------|
| {                                                   | NULL                                        | NULL                                           |
|   "CONTENT_TYPE": "image/png",                      |                                             |                                                |
|   "ETAG": "2859efde6e26491810f619668280a2ce",       |                                             |                                                |
|   "LAST_MODIFIED": "Thu, 18 Sep 2025 09:02:00 GMT", |                                             |                                                |
|   "RELATIVE_PATH": "image.png",                     |                                             |                                                |
|   "SIZE": 23698,                                    |                                             |                                                |
|   "STAGE": "@MYDB.MYSCHEMA.MYSTAGE"                 |                                             |                                                |
| }                                                   |                                             |                                                |
+-----------------------------------------------------+---------------------------------------------+------------------------------------------------+
```

For more examples of creating FILE objects from valid inputs, see [TO_FILE examples](to_file.md).

---
title: TRY_TO_GEOGRAPHY
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_geography.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# TRY_TO_GEOGRAPHY

Parses an input and returns a value of type [GEOGRAPHY](../data-types-geospatial.md).

This function is identical to [TO_GEOGRAPHY](to_geography.md) except that it returns
NULL when TO_GEOGRAPHY would return an error.

See also:
:   [TO_GEOGRAPHY](to_geography.md)

## Syntax

Use one of the following:

```sqlsyntax
TRY_TO_GEOGRAPHY( <varchar_expression> [ , <allow_invalid> ] )

TRY_TO_GEOGRAPHY( <binary_expression> [ , <allow_invalid> ] )

TRY_TO_GEOGRAPHY( <variant_expression> [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_expression`
:   The argument must be a string expression that represents a valid geometric object in one of the following formats:

    * WKT (well-known text).
    * WKB (well-known binary) in hexadecimal format (without a leading `0x`).
    * EWKT (extended well-known text).
    * EWKB (extended well-known binary) in hexadecimal format (without a leading `0x`).
    * GeoJSON.

`binary_expression`
:   The argument must be a binary expression in WKB or EWKB format.

`variant_expression`
:   The argument must be an OBJECT in GeoJSON format.

**Optional:**

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type GEOGRAPHY.

## Usage notes

* Returns NULL if the input cannot be parsed as the appropriate supported format (WKT, WKB, EWKT, EWKB, GeoJSON).
* Returns NULL if the input format is EWKT or EWKB and the SRID is not 4326.
  See the [note on EWKT and EWKB handling](../data-types-geospatial.md).

* For the coordinates in WKT, EWKT, and GeoJSON, longitude appears before latitude (for example, `POINT(lon lat)`).

## Examples

This shows a simple use of the TRY_TO_GEOGRAPHY function with VARCHAR data:

> ```sqlexample
> select TRY_TO_GEOGRAPHY('Not a valid input for this data type.');
> +-----------------------------------------------------------+
> | TRY_TO_GEOGRAPHY('NOT A VALID INPUT FOR THIS DATA TYPE.') |
> |-----------------------------------------------------------|
> | NULL                                                      |
> +-----------------------------------------------------------+
> ```

---
title: TRY_TO_GEOMETRY
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_geometry.md
section: SQL Functions
---

Categories:
:   [Geospatial functions](../functions-geospatial.md), [Conversion functions](../functions-conversion.md)

# TRY_TO_GEOMETRY

Parses an input and returns a value of type [GEOMETRY](../data-types-geospatial.md).

This function is identical to [TO_GEOMETRY](to_geometry.md) except that it returns NULL
when TO_GEOMETRY would return an error.

See also:
:   [TO_GEOMETRY](to_geometry.md)

## Syntax

Use one of the following:

```sqlsyntax
TRY_TO_GEOMETRY( <varchar_expression> [ , <srid> ] [ , <allow_invalid> ] )

TRY_TO_GEOMETRY( <binary_expression> [ , <srid> ] [ , <allow_invalid> ] )

TRY_TO_GEOMETRY( <variant_expression> [ , <srid> ] [ , <allow_invalid> ] )
```

## Arguments

**Required:**

`varchar_expression`
:   The argument must be a string expression that represents a valid geometric object in one of the following formats:

    * WKT (well-known text).
    * WKB (well-known binary) in hexadecimal format (without a leading `0x`).
    * EWKT (extended well-known text).
    * EWKB (extended well-known binary) in hexadecimal format (without a leading `0x`).
    * GeoJSON.

`binary_expression`
:   The argument must be a binary expression in WKB or EWKB format.

`variant_expression`
:   The argument must be an OBJECT in GeoJSON format.

**Optional:**

`srid`
:   The integer value of the SRID to use.

`allow_invalid`
:   If TRUE, specifies that the function returns a GEOGRAPHY or GEOMETRY object, even when the input shape isn’t valid and
    can’t be repaired. For more information, see [Specifying how invalid geospatial shapes are handled](../data-types-geospatial.md).

## Returns

The function returns a value of type GEOMETRY or NULL when TO_GEOMETRY would return an error.

## Usage notes

* Returns NULL if the input can’t be parsed as the appropriate supported format (WKT, WKB, EWKT, EWKB, GeoJSON).
* For GeoJSON, WKT, and WKB input, if the `srid` argument is not specified, the resulting GEOMETRY object has the SRID
  set to 0.

## Examples

This shows a simple use of the TRY_TO_GEOMETRY function with VARCHAR data:

```sqlexample
SELECT TRY_TO_GEOMETRY('INVALID INPUT');
```

```none
+----------------------------------+
| TRY_TO_GEOMETRY('INVALID INPUT') |
|----------------------------------|
| NULL                             |
+----------------------------------+
```

Create a temporary table and insert rows with GEOMETRY values:

```sqlexample
CREATE OR REPLACE TEMP TABLE demo_to_geometry AS
SELECT
  1                                                     AS id,
  'POINT(10 20)'                                        AS wkt_col,         -- VARCHAR (WKT)
  'SRID=32633;POINT(500000.0 4649776.22)'               AS ewkt_col,        -- VARCHAR (EWKT)
  ST_ASWKB(TO_GEOMETRY('LINESTRING(0 0, 1 1)'))         AS wkb_bin_col,     -- BINARY (WKB)
  PARSE_JSON('{"type":"Point","coordinates":[10,20]}')  AS geojson_col,     -- VARIANT (GeoJSON)
  TO_GEOGRAPHY('POINT(-122.35 37.55)')                  AS geog_col,        -- GEOGRAPHY
  'POLYGON((0 0,2 2,2 0,0 2,0 0))'                      AS invalid_wkt_col, -- invalid shape
  0                                                     AS srid0,           -- SRID columns to show positional args
  3857                                                  AS srid_col,
  TRUE                                                  AS allow_true,      -- allow_invalid flags from columns
  FALSE                                                 AS allow_false
UNION ALL
SELECT
  2,
  'LINESTRING(0 0, 10 10)',
  'SRID=32633;POINT(389866.35 5819003.03)',
  ST_ASWKB(TO_GEOMETRY('POINT(2 3)')),
  PARSE_JSON('{"type":"LineString","coordinates":[[0,0],[1,1]]}'),
  TO_GEOGRAPHY('LINESTRING(-124.2 42,-120.01 41.99)'),
  'POLYGON((0 0,1 1,1 0,0 1,0 0))',
  0,
  3857,
  TRUE,
  FALSE;
```

This table has columns with data types that the TO_GEOMETRY function accepts as inputs in the following formats:

* VARCHAR (WKT/WKB and hex/EWKT/EWKB/GeoJSON)
* BINARY (WKB/EWKB)
* VARIANT (GeoJSON object)
* GEOGRAPHY

Optional `srid` and `allow_invalid` values can follow any of these formats. The [ST_ASWKB , ST_ASBINARY](st_aswkb.md) function
generates valid WKB BINARY values.

The following example tries to convert VARCHAR values in the `invalid_wkt_col` column to GEOMETRY values,
but the shapes aren’t valid:

```sqlexample
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT id, TRY_TO_GEOMETRY(invalid_wkt_col) AS g_or_null
  FROM demo_to_geometry;
```

```output
+----+-----------+
| ID | G_OR_NULL |
|----+-----------|
|  1 | NULL      |
|  2 | NULL      |
+----+-----------+
```

---
title: TRY_TO_TIME
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_time.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_TIME

A special version of [TO_TIME , TIME](to_time.md) that performs the same operation (i.e.
converts an input expression into a time), but with error-handling
support (i.e. if the conversion cannot be performed, it returns a NULL value
instead of raising an error).

For more information, see [Error-handling conversion functions](../functions-conversion.md).

See also:
:   [TO_TIME , TIME](to_time.md)

## Syntax

```sqlsyntax
TRY_TO_TIME( <string_expr> [, <format> ] )
TRY_TO_TIME( '<integer>' )
```

## Arguments

**Required:**

One of:

> `string_expr`
> :   A string that can be converted to a valid time.
>
> `'integer'`
> :   An expression that evaluates to a string containing an integer, for example `'15000000'`. Depending
>     on the magnitude of the string, it can be interpreted as seconds, milliseconds, microseconds, or
>     nanoseconds. For details, see the Usage Notes.

**Optional:**

`format`
:   Format specifier for `string_expr` or
    [AUTO](../date-time-input-output.md).
    For more information, see [Date and time formats in conversion functions](../functions-conversion.md).

    The default is the current value of the [TIME_INPUT_FORMAT](../parameters.md)
    session parameter (default AUTO).

## Returns

The data type of the returned value is TIME.

## Usage notes

* The display format for times in the output is determined by the [TIME_OUTPUT_FORMAT](../parameters.md)
  session parameter (default `HH24:MI:SS`).
* If the format of the input parameter is a string that contains an integer, the unit of measurement for the value (seconds,
  microseconds, milliseconds, or nanoseconds) is determined as follows:

  + After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
    microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).

    - If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
      a number of seconds.
    - If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
      as milliseconds.
    - If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
      treated as microseconds.
    - If the value is greater than or equal to 31536000000000000, then the value is
      treated as nanoseconds.
  + If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
    one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
    nanoseconds.

## Examples

This example uses TRY_TO_TIME:

```sqlexample
SELECT TRY_TO_TIME('12:30:00'), TRY_TO_TIME('Invalid');
```

```output
+-------------------------+------------------------+
| TRY_TO_TIME('12:30:00') | TRY_TO_TIME('INVALID') |
|-------------------------+------------------------|
| 12:30:00                | NULL                   |
+-------------------------+------------------------+
```

See [TO_TIME , TIME](to_time.md) for examples that convert an input expression to a time.

---
title: TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_*
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_timestamp.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_\*

A special version of [TO_TIMESTAMP / TO_TIMESTAMP_\*](to_timestamp.md) that performs the same operation (i.e. converts an input expression into a timestamp), but with error-handling support (i.e. if the conversion cannot be performed, it returns a NULL value instead of raising an error).

For more information, see [Error-handling conversion functions](../functions-conversion.md).

> **Note:**
>
> TRY_TO_TIMESTAMP maps to one of the other timestamp functions, based on the
> [TIMESTAMP_TYPE_MAPPING](../parameters.md) session parameter. The parameter default
> is TIMESTAMP_NTZ so TRY_TO_TIMESTAMP maps to TRY_TO_TIMESTAMP_NTZ by default.

See also:
:   [TO_TIMESTAMP / TO_TIMESTAMP_\*](to_timestamp.md)

## Syntax

```sqlsyntax
timestampFunction ( <string_expr> [, <format> ] )
timestampFunction ( '<integer>' )
```

Where:

> ```sqlsyntax
> timestampFunction ::=
>     TRY_TO_TIMESTAMP | TRY_TO_TIMESTAMP_LTZ | TRY_TO_TIMESTAMP_NTZ | TRY_TO_TIMESTAMP_TZ
> ```

## Arguments

**Required:**

One of:

> `string_expr`
> :   A string that can be evaluated to a TIMESTAMP (TIMESTAMP_NTZ, TIMESTAMP_LTZ, or TIMESTAMP_TZ).
>
> `'integer'`
> :   An expression that evaluates to a string containing an integer, for example `'15000000'`. Depending
>     on the magnitude of the string, it can be interpreted as seconds, milliseconds, microseconds, or
>     nanoseconds. For details, see the Usage Notes.

**Optional:**

`format`
:   Format specifier for `string_expr` or
    [AUTO](../date-time-input-output.md).
    For more information, see [Date and time formats in conversion functions](../functions-conversion.md).

    The default is the current value of the [TIMESTAMP_INPUT_FORMAT](../parameters.md)
    session parameter (default AUTO).

## Returns

The data type of the returned value is one of the TIMESTAMP data
types. By default, the data type is TIMESTAMP_NTZ. You can change
this by setting the session parameter [TIMESTAMP_TYPE_MAPPING](../parameters.md).

## Usage notes

* For timestamps with time zones, the setting of the [TIMEZONE](../parameters.md) parameter affects the return value. The returned
  timestamp is in the time zone for the session.
* The display format for timestamps in the output is determined by the timestamp output format that corresponds with the
  function ([TIMESTAMP_OUTPUT_FORMAT](../parameters.md), [TIMESTAMP_LTZ_OUTPUT_FORMAT](../parameters.md), [TIMESTAMP_NTZ_OUTPUT_FORMAT](../parameters.md),
  or [TIMESTAMP_TZ_OUTPUT_FORMAT](../parameters.md)).
* If the format of the input parameter is a string that contains an integer:

  + After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
    microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).

    - If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
      a number of seconds.
    - If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
      as milliseconds.
    - If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
      treated as microseconds.
    - If the value is greater than or equal to 31536000000000000, then the value is
      treated as nanoseconds.
  + If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
    one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
    nanoseconds.

* When you use the TO_TIMESTAMP_NTZ or TRY_TO_TIMESTAMP_NTZ function to convert a timestamp with time zone information, the time zone
  information is lost. If the timestamp is then converted back to a timestamp with time zone information (by using
  the TO_TIMESTAMP_TZ function for example), the time zone information is not recoverable.

## Examples

This example uses TRY_TO_TIMESTAMP:

```sqlexample
SELECT TRY_TO_TIMESTAMP('2024-01-15 12:30:00'), TRY_TO_TIMESTAMP('Invalid');
```

```output
+-----------------------------------------+-----------------------------+
| TRY_TO_TIMESTAMP('2024-01-15 12:30:00') | TRY_TO_TIMESTAMP('INVALID') |
|-----------------------------------------+-----------------------------|
| 2024-01-15 12:30:00.000                 | NULL                        |
+-----------------------------------------+-----------------------------+
```

See [TO_TIMESTAMP / TO_TIMESTAMP_\*](to_timestamp.md) for examples that convert an input expression to a timestamp.

---
title: TRY_TO_UUID
source: https://docs.snowflake.com/en/sql-reference/functions/try_to_uuid.md
section: SQL Functions
---

Categories:
:   [Conversion functions](../functions-conversion.md)

# TRY_TO_UUID

A special version of [TO_UUID](to_uuid.md) that performs the same operation
— that is, converts an input expression to a [UUID](../data-types-uuid.md) value —
but with error handling support. If the conversion can’t be performed, it returns a NULL value
instead of raising an error.

For more information, see the following topics:

* [Error-handling conversion functions](../functions-conversion.md)
* [TO_UUID](to_uuid.md)

## Syntax

```sqlsyntax
TRY_TO_UUID( <string_expr> )
```

## Arguments

`string_expr`
:   A string expression in UUID format.

## Returns

Returns a value of type [UUID](../data-types-uuid.md) or NULL when
TO_UUID would return an error.

## Examples

The following example returns NULL because the input string isn’t a UUID:

```sqlexample
SELECT TRY_TO_UUID('not a uuid');
```

```output
+--------------------------------------+
| TRY_TO_UUID('NOT A UUID')            |
|--------------------------------------|
| NULL                                 |
+--------------------------------------+
```

For examples that convert an input expression to a UUID value, see [TO_UUID](to_uuid.md).

---
title: TYPEOF
source: https://docs.snowflake.com/en/sql-reference/functions/typeof.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Type Predicates)

# TYPEOF

Returns the type of a value stored in a [VARIANT](../data-types-semistructured.md) column.

See also:
:   [IS_<object_type>](is.md) , [SYSTEM$TYPEOF](system_typeof.md)

## Syntax

```sqlsyntax
TYPEOF( <expr> )
```

## Arguments

`expr`
:   The argument can be a column name or a general expression of type VARIANT. If necessary, you can
    [cast](cast.md) the `expr` to a VARIANT.

## Returns

Returns a VARCHAR value that contains the data type of the input expression, such as BOOLEAN, DECIMAL, ARRAY,
OBJECT, and so on.

## Usage notes

* The returned string might be DECIMAL even if the input is an exact integer, due to optimizations that change the
  physical storage type of the input.

* This function doesn’t support a [structured type](../data-types-structured.md) as an input argument.

## Examples

Create and fill the `vartab` table. The INSERT statement uses the [PARSE_JSON](parse_json.md) function to insert
[VARIANT](../data-types-semistructured.md) values in the `v` column of the table.

```sqlexample
CREATE OR REPLACE TABLE vartab (n NUMBER(2), v VARIANT);

INSERT INTO vartab
  SELECT column1 AS n, PARSE_JSON(column2) AS v
    FROM VALUES (1, 'null'),
                (2, null),
                (3, 'true'),
                (4, '-17'),
                (5, '123.12'),
                (6, '1.912e2'),
                (7, '"Om ara pa ca na dhih"  '),
                (8, '[-1, 12, 289, 2188, false,]'),
                (9, '{ "x" : "abc", "y" : false, "z": 10} ')
       AS vals;
```

Query the data. The query uses the TYPEOF function to show the data types of
the values stored in the VARIANT column.

```sqlexample
SELECT n, v, TYPEOF(v)
  FROM vartab
  ORDER BY n;
```

```output
+---+------------------------+------------+
| N | V                      | TYPEOF(V)  |
|---+------------------------+------------|
| 1 | null                   | NULL_VALUE |
| 2 | NULL                   | NULL       |
| 3 | true                   | BOOLEAN    |
| 4 | -17                    | INTEGER    |
| 5 | 123.12                 | DECIMAL    |
| 6 | 1.912000000000000e+02  | DOUBLE     |
| 7 | "Om ara pa ca na dhih" | VARCHAR    |
| 8 | [                      | ARRAY      |
|   |   -1,                  |            |
|   |   12,                  |            |
|   |   289,                 |            |
|   |   2188,                |            |
|   |   false,               |            |
|   |   undefined            |            |
|   | ]                      |            |
| 9 | {                      | OBJECT     |
|   |   "x": "abc",          |            |
|   |   "y": false,          |            |
|   |   "z": 10              |            |
|   | }                      |            |
+---+------------------------+------------+
```

The following example uses the TYPEOF function to determine the data type of a value by
[casting](cast.md) the value to a VARIANT.

Create and populate a table:

```sqlexample
CREATE OR REPLACE TABLE typeof_cast(status VARCHAR, time TIMESTAMP);

INSERT INTO typeof_cast VALUES('check in', '2024-01-17 19:00:00.000 -0800');
```

Query the table using the TYPEOF function by casting each value to a VARIANT:

```sqlexample
SELECT status,
       TYPEOF(status::VARIANT) AS "TYPE OF STATUS",
       time,
       TYPEOF(time::VARIANT) AS "TYPE OF TIME"
  FROM typeof_cast;
```

```output
+----------+----------------+-------------------------+---------------+
| STATUS   | TYPE OF STATUS | TIME                    | TYPE OF TIME  |
|----------+----------------+-------------------------+---------------|
| check in | VARCHAR        | 2024-01-17 19:00:00.000 | TIMESTAMP_NTZ |
+----------+----------------+-------------------------+---------------+
```

---
title: UNICODE
source: https://docs.snowflake.com/en/sql-reference/functions/unicode.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General)

# UNICODE

Returns the Unicode code point for the first Unicode character in a string. If the string is empty, a value of `0` is returned.

See also:

> [ASCII](ascii.md) , [CHAR](chr.md)

## Syntax

```sqlsyntax
UNICODE( <input> )
```

## Arguments

`input`
:   The string for which the Unicode code point for the first character in the string is returned.

## Examples

This example demonstrates the function behavior for single ASCII and Unicode characters, as well as special cases, such as multi-character strings, empty strings,
and `NULL` values. It also demonstrates how the UNICODE and [CHAR](chr.md) functions interact:

```sqlexample
SELECT column1, UNICODE(column1), CHAR(UNICODE(column1))
FROM values('a'), ('\u2744'), ('cde'), (''), (null);

+---------+------------------+------------------------+
| COLUMN1 | UNICODE(COLUMN1) | CHAR(UNICODE(COLUMN1)) |
|---------+------------------+------------------------|
| a       |               97 | a                      |
| ❄       |            10052 | ❄                      |
| cde     |               99 | c                      |
|         |                0 |                        |
| NULL    |             NULL | NULL                   |
+---------+------------------+------------------------+
```

---
title: UNIFORM
source: https://docs.snowflake.com/en/sql-reference/functions/uniform.md
section: SQL Functions
---

Categories:
:   [Data generation functions](../functions-data-generation.md)

# UNIFORM

Generates a uniformly-distributed pseudo-random number in the inclusive
range [`min`, `max`].

## Syntax

```sqlsyntax
UNIFORM( <min> , <max> , <gen> )
```

## Arguments

`min`
:   A constant specifying the minimum value (inclusive) of the generated number.

`max`
:   A constant specifying the maximum value (inclusive) of the generated number.

`gen`
:   An expression that serves as a raw source of uniform random numbers,
    typically the [RANDOM](random.md) function. For more information, see the Data
    Generation Functions [Usage notes](../functions-data-generation.md).

## Returns

If either or both of `min` or `max` is a floating-point number,
UNIFORM returns a floating-point number. If both `min` and
`max` are integers, UNIFORM returns an integer.

## Usage notes

This function is related to, but different from, the [RANDOM](random.md) function. Both
functions generate uniform distributions, but there are differences in the ranges of
the values returned.

* RANDOM generates pseudo-random 64-bit integers. It accepts an optional
  seed that allows sequences to be repeated.
* UNIFORM generates random integer or floating-point numbers in the
  specified range.

## Examples

The following examples demonstrate how to use the UNIFORM function. The values displayed in the output below might differ from
the values returned when you run these examples yourself.

This example generates five random integers in the range of 1 to 10 (inclusive):

```sqlexample
SELECT UNIFORM(1, 10, RANDOM()) FROM TABLE(GENERATOR(ROWCOUNT => 5));
```

```output
+--------------------------+
| UNIFORM(1, 10, RANDOM()) |
|--------------------------|
|                        6 |
|                        1 |
|                        8 |
|                        5 |
|                        6 |
+--------------------------+
```

This example generates five floating-point numbers in the range of 0 to 1 (inclusive):

```sqlexample
SELECT UNIFORM(0::FLOAT, 1::FLOAT, RANDOM()) FROM TABLE(GENERATOR(ROWCOUNT => 5));
```

```output
+---------------------------------------+
| UNIFORM(0::FLOAT, 1::FLOAT, RANDOM()) |
|---------------------------------------|
|                         0.1180758313  |
|                         0.4945805484  |
|                         0.7113092833  |
|                         0.06170806767 |
|                         0.01635235156 |
+---------------------------------------+
```

This example shows that if the `gen` argument is a constant, then the output is a constant:

```sqlexample
SELECT UNIFORM(1, 10, 1234) FROM TABLE(GENERATOR(ROWCOUNT => 5));
```

```output
+----------------------+
| UNIFORM(1, 10, 1234) |
|----------------------|
|                    7 |
|                    7 |
|                    7 |
|                    7 |
|                    7 |
+----------------------+
```

---
title: UNIQUE_COUNT (system data metric function)
source: https://docs.snowflake.com/en/sql-reference/functions/dmf_unique_count.md
section: SQL Functions
---

Categories:
:   [Data metric functions](../functions-data-metric.md)

# UNIQUE_COUNT (system data metric function)

Returns the total number of unique non-NULL values for the specified columns in a table.

This topic provides the syntax for calling the function directly. To learn how to associate the function with a table or view so it
runs at regular intervals, see [Associate a DMF](../../user-guide/data-quality-working.md).

## Syntax

```sqlsyntax
SNOWFLAKE.CORE.UNIQUE_COUNT(<query>)
```

## Arguments

`query`
:   Specifies a SQL query that projects a single column.

## Allowed data types

The column projected by the `query` must have one of the following data types:

* DATE
* FLOAT
* NUMBER
* TIMESTAMP_LTZ
* TIMESTAMP_NTZ
* TIMESTAMP_TZ
* VARCHAR

## Returns

The function returns a scalar value with a NUMBER data type.

## Usage notes

When you call a system DMF manually, you don’t need to specify whichever allowed data type you are using. You only need to specify the
query for the column that you want to measure. Snowflake matches the allowed data type for the function with the data type for the column.

## Example

Measure the number of unique non-NULL values for the SSN column (that is, US Social Security number):

```sqlexample
SELECT SNOWFLAKE.CORE.UNIQUE_COUNT(
  SELECT
    ssn
  FROM hr.tables.empl_info
);
```

```output
+------------------------------------------------------------------+
| SNOWFLAKE.CORE.UNIQUE_COUNT(SELECT ssn FROM hr.tables.empl_info) |
+------------------------------------------------------------------+
| 42                                                               |
+------------------------------------------------------------------+
```

---
title: UPPER
source: https://docs.snowflake.com/en/sql-reference/functions/upper.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (Case Conversion)

# UPPER

Returns the input string with all characters converted to uppercase.

## Syntax

```sqlsyntax
UPPER( <expr> )
```

## Arguments

`expr`
:   The string expression.

## Returns

This function returns a value of type VARCHAR.

## Examples

```sqlexample
SELECT v, UPPER(v) FROM lu;
```

```output
+----------------------------------+----------------------------------+
|                v                 |             upper(v)             |
+----------------------------------+----------------------------------+
|                                  |                                  |
| 1č2Щ3ß4Ę!-?abc@                  | 1Č2Щ3SS4Ę!-?ABC@                 |
| AaBbCcDdEeFfGgHhIiJj             | AABBCCDDEEFFGGHHIIJJ             |
| KkLlMmNnOoPpQqRrSsTt             | KKLLMMNNOOPPQQRRSSTT             |
| UuVvWwXxYyZz                     | UUVVWWXXYYZZ                     |
| ÁáÄäÉéÍíÓóÔôÚúÝý                 | ÁÁÄÄÉÉÍÍÓÓÔÔÚÚÝÝ                 |
| ÄäÖößÜü                          | ÄÄÖÖSSÜÜ                         |
| ÉéÀàÈèÙùÂâÊêÎîÔôÛûËëÏïÜüŸÿÇçŒœÆæ | ÉÉÀÀÈÈÙÙÂÂÊÊÎÎÔÔÛÛËËÏÏÜÜŸŸÇÇŒŒÆÆ |
| ĄąĆćĘęŁłŃńÓóŚśŹźŻż               | ĄĄĆĆĘĘŁŁŃŃÓÓŚŚŹŹŻŻ               |
| ČčĎďĹĺĽľŇňŔŕŠšŤťŽž               | ČČĎĎĹĹĽĽŇŇŔŔŠŠŤŤŽŽ               |
| АаБбВвГгДдЕеЁёЖжЗзИиЙй           | ААББВВГГДДЕЕЁЁЖЖЗЗИИЙЙ           |
| КкЛлМмНнОоПпРрСсТтУуФф           | ККЛЛММННООППРРССТТУУФФ           |
| ХхЦцЧчШшЩщЪъЫыЬьЭэЮюЯя           | ХХЦЦЧЧШШЩЩЪЪЫЫЬЬЭЭЮЮЯЯ           |
| [NULL]                           | [NULL]                           |
+----------------------------------+----------------------------------+
```

UPPER supports [collation](../collation.md) specifications. This UPPER example
specifies collation with the `tr` (Turkish) locale:

```sqlexample
SELECT UPPER('i' COLLATE 'tr');
```

```output
+-------------------------+
| UPPER('I' COLLATE 'TR') |
|-------------------------|
| İ                       |
+-------------------------+
```

---
title: UUID_STRING
source: https://docs.snowflake.com/en/sql-reference/functions/uuid_string.md
section: SQL Functions
---

Categories:
:   [String & binary functions](../functions-string.md) (General) , [Data generation functions](../functions-data-generation.md)

# UUID_STRING

Generates either a version 4 (random) or version 5 (named) RFC 4122-compliant universally unique identifier (UUID)
as a formatted string.

## Syntax

```sqlsyntax
UUID_STRING()

UUID_STRING( '<uuid>' , '<name>' )
```

## Arguments

`'uuid'`
:   A valid UUID string. This value is the namespace used to generate the returned UUID.

`'name'`
:   The name used to generate the returned UUID.

## Returns

This function returns a 128-bit value, formatted as a string (VARCHAR data type).

## Usage notes

UUID_STRING supports generating two versions of UUIDs, both compliant with RFC 4122:

* A version 4 (random) UUID is returned when no arguments are provided to the function. For random-number generation, the
  64-bit [Mersenne twister](http://en.wikipedia.org/wiki/Mersenne_twister) known as MT19937-64 is used.
* A version 5 (named) UUID can be produced by providing a `uuid` string (known as the namespace) as the first
  argument and a `name` string as the second argument.

## Examples

Generate a random UUID:

```sqlexample
SELECT UUID_STRING();
```

```output
+--------------------------------------+
| UUID_STRING()                        |
|--------------------------------------|
| d47f4e30-306f-4940-8921-c154094df1a1 |
+--------------------------------------+
```

Generate a named UUID:

```sqlexample
SELECT UUID_STRING('fe971b24-9572-4005-b22f-351e9c09274d','foo');
```

```output
+-----------------------------------------------------------+
| UUID_STRING('FE971B24-9572-4005-B22F-351E9C09274D','FOO') |
|-----------------------------------------------------------|
| dc0b6f65-fca6-5b4b-9d37-ccc3fde1f3e2                      |
+-----------------------------------------------------------+
```

Create a table and insert random UUIDs:

```sqlexample
CREATE OR REPLACE TABLE uuid_insert_test(random_uuid VARCHAR(36), test VARCHAR(10));

INSERT INTO uuid_insert_test (random_uuid, test) SELECT UUID_STRING(), 'test1';
INSERT INTO uuid_insert_test (random_uuid, test) SELECT UUID_STRING(), 'test2';
INSERT INTO uuid_insert_test (random_uuid, test) SELECT UUID_STRING(), 'test3';
INSERT INTO uuid_insert_test (random_uuid, test) SELECT UUID_STRING(), 'test4';
INSERT INTO uuid_insert_test (random_uuid, test) SELECT UUID_STRING(), 'test5';

SELECT * FROM uuid_insert_test;
```

```output
+--------------------------------------+-------+
| RANDOM_UUID                          | TEST  |
|--------------------------------------+-------|
| 7745a0cf-d136-406b-9289-38072d242871 | test1 |
| 8c31e031-a6bf-479d-9abb-b7909f298ba1 | test2 |
| e65d5641-01c0-4126-b80d-c5ae6d4848be | test3 |
| bd02bf4e-fa5d-498d-8a9a-d38200f1ca30 | test4 |
| 4df2a34e-ad65-46b4-a51a-3eb9394aeb83 | test5 |
+--------------------------------------+-------+
```

---
title: VALIDATE
source: https://docs.snowflake.com/en/sql-reference/functions/validate.md
section: SQL Functions
---

Categories:
:   [Table functions](../functions-table.md)

# VALIDATE

Validates the files loaded in a past execution of the [COPY INTO <table>](../sql/copy-into-table.md) command and returns all the errors encountered during the load, rather than just the first error.

## Syntax

```sqlsyntax
VALIDATE( [<namespace>.]<table_name> , JOB_ID => { '<query_id>' | '_last' } )
```

## Arguments

`[namespace.]table_name`
:   Specifies the fully-qualified name of the table that was the target of the load.

    Namespace is the database and/or schema in which the table resides, in the form of `database_name.schema_name` or `schema_name`. It is optional if a database and schema
    are currently in use within the user session; otherwise, it is required.

`JOB_ID => query_id | _last`
:   The ID for the [COPY INTO <table>](../sql/copy-into-table.md) command to be validated:

    * The ID can be obtained from the Query ID column in the Query History page in Snowsight. The specified query ID must have been for the specified target table.
    * If `_last` is specified instead of `query_id`, the function validates the last load executed during the current session, regardless of the specified target table.

## Usage notes

* The validation returns no results for COPY statements that specify `ON_ERROR = ABORT_STATEMENT` (default value).
* Validation fails if:

  > + [SELECT](../sql/select.md) statements are used to transform data during a [COPY INTO <table>](../sql/copy-into-table.md) operation.
  > + The current user does not have access to `table_name`.
  > + The current user is not the user who executed `query_id` and does not have access control privileges on this user.
  > + The copy history metadata has expired. For more information, refer to [Load metadata](../../user-guide/data-load-considerations-load.md).
* If new files have been added to the stage used by `query_id` since the load was executed, the new files added are ignored during the validation.
* If files have been removed from the stage used by `query_id` since the load was executed, the files removed are reported as missing.

## Examples

Return errors for the last executed COPY command:

> ```sqlexample
> SELECT * FROM TABLE(VALIDATE(t1, JOB_ID => '_last'));
> ```

Return errors by specifying a query ID obtained from the Query History page in Snowsight or the Query History page in Snowsight:

> ```sqlexample
> SELECT * FROM TABLE(VALIDATE(t1, JOB_ID=>'5415fa1e-59c9-4dda-b652-533de02fdcf1'));
> ```

Same query as above, but save the results to a table for future reference:

> ```sqlexample
> CREATE OR REPLACE TABLE save_copy_errors AS SELECT * FROM TABLE(VALIDATE(t1, JOB_ID=>'5415fa1e-59c9-4dda-b652-533de02fdcf1'));
> ```

---
title: VALIDATE_PIPE_LOAD
source: https://docs.snowflake.com/en/sql-reference/functions/validate_pipe_load.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# VALIDATE_PIPE_LOAD

This table function can be used to validate data files processed by [Snowpipe](../../user-guide/data-load-snowpipe-intro.md) within a specified time range. The function returns details about any errors encountered during an attempted data load into Snowflake tables.

> **Note:**
>
> This function returns pipe activity within the last 14 days.

## Syntax

```sqlsyntax
VALIDATE_PIPE_LOAD(
      PIPE_NAME => '<string>'
       , START_TIME => <constant_expr>
      [, END_TIME => <constant_expr> ] )
```

## Arguments

`PIPE_NAME => string`
:   A string specifying a pipe. The function returns results for the specified pipe only.

`START_TIME => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format), within the last 14 days, marking the start of the time range for retrieving error events.

**Optional:**

`END_TIME => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format), within the last 14 days, marking the end of the time range for retrieving error events.

## Usage notes

* Returns results only for the pipe owner (i.e. the role with the OWNERSHIP privilege on the pipe) or a role with the following minimum permissions:

  | Privilege | Object | Notes |
  | --- | --- | --- |
  | MONITOR | Pipe | Alternatively, the global MONITOR EXECUTION privilege is supported. |
  | USAGE | Stage in the pipe definition | External stages only |
  | READ | Stage in the pipe definition | Internal stages only |
  | SELECT | Table in the pipe definition |  |
  | INSERT | Table in the pipe definition |  |

  SQL operations on schema objects also require the USAGE privilege on the database and schema that contain the object.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* If Snowpipe encountered no errors while processing data files within the specified time range, the function returns no results.
* If the COPY statement in the pipe description includes a query to further transform the data during the load (i.e. a COPY transformation), then the function currently returns a user error.
* If the specified date range falls outside the last 15 days, an error is returned.

## Output

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ERROR | TEXT | First error in the source file. |
| FILE | TEXT | Name of the source file where the error was encountered. |
| LINE | NUMBER | Number of the line in the source file where the error was encountered. |
| CHARACTER | NUMBER | Position of the character where the error was encountered. |
| BYTE_OFFSET | NUMBER | Byte offset to the character where the error was encountered. |
| CATEGORY | TEXT | Category of the operation when the error was produced. |
| CODE | NUMBER | ID for the error message displayed in the ERROR column. |
| SQL_STATE | NUMBER | SQL state code. |
| COLUMN_NAME | TEXT | Name and order of the column that contained the error. |
| ROW_NUMBER | NUMBER | Number of the row in the source file where the error was encountered. |
| ROW_START_LINE | NUMBER | Number of the first line of the row where the error was encountered. |
| REJECTED_RECORD | TEXT | Record that contained the error. |

## Examples

Validate any loads for the `mypipe` pipe within the previous hour:

> ```sqlexample
> select * from table(validate_pipe_load(
>   pipe_name=>'MY_DB.PUBLIC.MYPIPE',
>   start_time=>dateadd(hour, -1, current_timestamp())));
> ```

---
title: VAR_POP
source: https://docs.snowflake.com/en/sql-reference/functions/var_pop.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# VAR_POP

Returns the population variance of non-NULL records in a group. If all records inside a group are NULL, a NULL is returned.

Aliases:
:   [VARIANCE_POP](variance_pop.md)

## Syntax

**Aggregate function**

```sqlsyntax
VAR_POP( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
VAR_POP( [ DISTINCT ] <expr1> ) OVER (
                                     [ PARTITION BY <expr2> ]
                                     [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                     )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   The `expr1` should evaluate to one of the numeric data types.

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition.

## Returns

The data type of the returned value is `NUMBER(<precision>, <scale>)`. The scale depends upon the values being processed.

## Usage notes

* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.

* When this function is called as a window function with an OVER clause that contains an ORDER BY clause:

  + A window frame is required. If no window frame is specified explicitly, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).
  + Using the keyword DISTINCT inside the window function is prohibited and results in a compile-time error.

## Examples

This example shows how to use the VAR_POP function:

Create and fill a table:

```sqlexample
CREATE TABLE aggr (k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));

INSERT INTO aggr VALUES
  (1, 10, NULL),
  (2, 10, 11),
  (2, 20, 22),
  (2, 25, NULL),
  (2, 30, 35);
```

Query the table:

```sqlexample
SELECT k, VAR_POP(v), VAR_POP(v2)
  FROM aggr
  GROUP BY k
  ORDER BY k;
```

```output
+---+---------------+---------------+
| K |    VAR_POP(V) |   VAR_POP(V2) |
|---+---------------+---------------|
| 1 |  0.0000000000 |          NULL |
| 2 | 54.6875000000 | 96.2222222222 |
+---+---------------+---------------+
```

---
title: VAR_SAMP
source: https://docs.snowflake.com/en/sql-reference/functions/var_samp.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# VAR_SAMP

Returns the sample variance of non-NULL records in a group. If all records inside a group are NULL, a NULL is returned.

Aliases:
:   [VARIANCE , VARIANCE_SAMP](variance.md)

## Syntax

**Aggregate function**

```sqlsyntax
VAR_SAMP( [DISTINCT] <expr1> )
```

**Window function**

```sqlsyntax
VAR_SAMP( <expr1> ) OVER (
                         [ PARTITION BY <expr2> ]
                         [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                         )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   The `expr1` should evaluate to one of the numeric data types.

`expr2`
:   This is the expression to partition by.

`expr3`
:   This is the expression to order by within each partition.

## Returns

The data type of the returned value is `NUMBER(<precision>, <scale>)`. The scale depends upon the values being processed.

## Usage notes

* For single-record inputs, VAR_SAMP, VARIANCE, and VARIANCE_SAMP all return NULL. This is different from the Oracle behavior,
  where VAR_SAMP returns NULL for a single record and VARIANCE returns 0.
* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.
* When this function is called as a window function:

  + The syntax allows the DISTINCT keyword, but it is ignored.
  + If you do not specify a window frame, the following implied window frame is used:

    > `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see
    [Window function syntax and usage](../functions-window-syntax.md).

## Examples

This example shows how to use the VAR_SAMP function:

Create and fill a table:

```sqlexample
CREATE TABLE aggr (k INT, v DECIMAL(10,2), v2 DECIMAL(10, 2));

INSERT INTO aggr VALUES
  (1, 10, NULL),
  (2, 10, 11),
  (2, 20, 22),
  (2, 25, NULL),
  (2, 30, 35);
```

Query the table:

```sqlexample
SELECT k, VAR_SAMP(v), VAR_SAMP(v2)
  FROM aggr
  GROUP BY k
  ORDER BY k;
```

```output
+---+---------------+----------------+
| K |   VAR_SAMP(V) |   VAR_SAMP(V2) |
|---+---------------+----------------|
| 1 |          NULL |           NULL |
| 2 | 72.9166666667 | 144.3333333333 |
+---+---------------+----------------+
```

---
title: VARIANCE , VARIANCE_SAMP
source: https://docs.snowflake.com/en/sql-reference/functions/variance.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# VARIANCE , VARIANCE_SAMP

Returns the sample variance of non-NULL records in a group. If all records inside a group are NULL, a NULL is returned.

Aliases:
:   [VAR_SAMP](var_samp.md)

## Syntax

**Aggregate function**

```sqlsyntax
VARIANCE( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
VARIANCE( [ DISTINCT ] <expr1> ) OVER (
                                      [ PARTITION BY <expr2> ]
                                      [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                      )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   The `expr1` should evaluate to one of the numeric data types.

`expr2`
:   This is the expression to partition by.

`expr3`
:   This is the expression to order by within each partition.

## Returns

The data type of the returned value is `NUMBER(<precision>, <scale>)`. The scale depends upon the values being processed.

## Usage notes

* For single-record inputs, VAR_SAMP, VARIANCE, and VARIANCE_SAMP all return NULL. This is different from the Oracle behavior,
  where VAR_SAMP returns NULL for a single record and VARIANCE returns 0.
* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.

* When this function is called as a window function with an OVER clause that contains an ORDER BY clause:

  + A window frame is required. If no window frame is specified explicitly, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).
  + Using the keyword DISTINCT inside the window function is prohibited and results in a compile-time error.

## Examples

For examples, see [VAR_SAMP](var_samp.md).

---
title: VARIANCE_POP
source: https://docs.snowflake.com/en/sql-reference/functions/variance_pop.md
section: SQL Functions
---

Categories:
:   [Aggregate functions](../functions-aggregation.md) (General) , [Window function syntax and usage](../functions-window-syntax.md) (General)

# VARIANCE_POP

Returns the population variance of non-NULL records in a group. If all records inside a group are NULL, a NULL is returned.

Aliases:
:   [VAR_POP](var_pop.md)

## Syntax

**Aggregate function**

```sqlsyntax
VARIANCE_POP( [ DISTINCT ] <expr1> )
```

**Window function**

```sqlsyntax
VARIANCE_POP( [ DISTINCT ] <expr1> ) OVER (
                                          [ PARTITION BY <expr2> ]
                                          [ ORDER BY <expr3> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ <window_frame> ] ]
                                          )
```

For detailed `window_frame` syntax, see [Window function syntax and usage](../functions-window-syntax.md).

## Arguments

`expr1`
:   The `expr1` should evaluate to one of the numeric data types.

`expr2`
:   This is the optional expression to partition by.

`expr3`
:   This is the optional expression to order by within each partition.

## Returns

The data type of the returned value is `NUMBER(<precision>, <scale>)`. The scale depends upon the values being processed.

## Usage notes

* When passed a VARCHAR expression, this function implicitly casts the input to floating point values. If the cast
  cannot be performed, an error is returned.

* When this function is called as a window function with an OVER clause that contains an ORDER BY clause:

  + A window frame is required. If no window frame is specified explicitly, the following implied window frame is used:

    `RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`

    For more information about window frames, including syntax, usage notes, and examples, see [Window function syntax and usage](../functions-window-syntax.md).
  + Using the keyword DISTINCT inside the window function is prohibited and results in a compile-time error.

## Examples

For examples, see [VAR_POP](var_pop.md).

---
title: VECTOR_AVG
source: https://docs.snowflake.com/en/sql-reference/functions/vector_avg.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md) , [Aggregate functions](../functions-aggregation.md)

# VECTOR_AVG

Computes the element-wise average of [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. Returns a vector where
each element is the average of the corresponding elements across all input vectors. The output is always VECTOR(FLOAT, N) regardless of input type.

See also:
:   [VECTOR_SUM](vector_sum.md) , [VECTOR_MIN](vector_min.md) , [VECTOR_MAX](vector_max.md) , [AVG](avg.md), [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_AVG( <vector_column> )
```

## Arguments

`vector_column`
:   A column containing [VECTOR](../data-types-vector.md) values. All vectors in the column must have the same element type and dimension.

## Returns

Returns a VECTOR(FLOAT, N) value where N is the dimension of the input vectors. Each element in the result vector is the average of the corresponding elements across all input vectors.

## Usage notes

* NULL values are ignored in the aggregation.
* If all values in the group are NULL, the function returns NULL.
* All input vectors in the column must have the same dimension and element type.
* The output is always VECTOR(FLOAT, N) regardless of the input’s type. For information on floating-point numbers in Snowflake, see [Floating-point data types](../data-types-numeric.md).
* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example demonstrates computing the element-wise average of vectors:

```sqlexample
CREATE OR REPLACE TABLE vector_data (
  id INT,
  category VARCHAR,
  embedding VECTOR(FLOAT, 3)
);

INSERT INTO vector_data
SELECT 1, 'A', [2.0, 4.0, 6.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 2, 'A', [4.0, 8.0, 12.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 3, 'B', [1.0, 2.0, 3.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 4, 'B', [3.0, 6.0, 9.0]::VECTOR(FLOAT, 3);

-- Compute average for each category
SELECT category, VECTOR_AVG(embedding) AS avg_vector
  FROM vector_data
  GROUP BY category
  ORDER BY category;
```

```output
+----------+------------------+
| CATEGORY | AVG_VECTOR       |
+----------+------------------+
| A        | [3.0, 6.0, 9.0] |
| B        | [2.0, 4.0, 6.0] |
+----------+------------------+
```

This example shows scalar aggregation (no GROUP BY):

```sqlexample
SELECT VECTOR_AVG(embedding) AS overall_avg
  FROM vector_data;
```

```output
+------------------+
| OVERALL_AVG      |
+------------------+
| [2.5, 5.0, 7.5]  |
+------------------+
```

This example shows how integer vectors are converted to float output:

```sqlexample
CREATE OR REPLACE TABLE int_vector_data (
  id INT,
  vec VECTOR(INT, 2)
);

INSERT INTO int_vector_data
SELECT 1, [1, 3]::VECTOR(INT, 2)
UNION ALL SELECT 2, [2, 4]::VECTOR(INT, 2);

SELECT VECTOR_AVG(vec) AS avg_result
  FROM int_vector_data;
```

```output
+-------------+
| AVG_RESULT  |
+-------------+
| [1.5, 3.5]  |
+-------------+
```

---
title: VECTOR_COSINE_SIMILARITY
source: https://docs.snowflake.com/en/sql-reference/functions/vector_cosine_similarity.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md)

# VECTOR_COSINE_SIMILARITY

Computes the cosine similarity between two [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md).

Cosine similarity is based on the angle between two vectors in a multi-dimensional space; the magnitude of the vectors is not
considered. The cosine similarity value is the inner product of the vectors divided by the product of their lengths. The cosine
similarity is always in the interval `[-1, 1]`. For example, identical vectors have a cosine similarity of `1`, two
orthogonal vectors have a similarity of `0`, and two opposite vectors have a similarity of `-1`.

See also:
:   [VECTOR_INNER_PRODUCT](vector_inner_product.md) , [VECTOR_L1_DISTANCE](vector_l1_distance.md) , [VECTOR_L2_DISTANCE](vector_l2_distance.md) , [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_COSINE_SIMILARITY( <vector>, <vector> )
```

## Arguments

`vector`
:   The [VECTOR](../data-types-vector.md) value to calculate the angle from.

`vector`
:   The VECTOR value to calculate the angle to.

## Returns

Returns a [FLOAT](../data-types-numeric.md) value in the interval `[-1, 1]`, which indicates the
cosine similarity between the two input vectors.

## Usage notes

* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example calls the VECTOR_COSINE_SIMILARITY function to find the vector closest to `[1,2,3]`.

```sqlexample
SELECT a, VECTOR_COSINE_SIMILARITY(a, [1,2,3]::VECTOR(FLOAT, 3)) AS similarity
  FROM vectors
  ORDER BY similarity DESC
  LIMIT 1;
```

```output
+-------------------------+
| [1, 2.2, 3] | 0.9990... |
+-------------------------+
```

---
title: VECTOR_INNER_PRODUCT
source: https://docs.snowflake.com/en/sql-reference/functions/vector_inner_product.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md)

# VECTOR_INNER_PRODUCT

Computes the inner product of two [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md).

The inner product (also known as the dot or scalar product) multiplies two vectors. The result represents the combined direction
of the two vectors. Similar vectors result in larger inner products than dissimilar ones.

See also:
:   [VECTOR_COSINE_SIMILARITY](vector_cosine_similarity.md) , [VECTOR_L1_DISTANCE](vector_l1_distance.md) , [VECTOR_L2_DISTANCE](vector_l2_distance.md) , [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_INNER_PRODUCT( <vector>, <vector> )
```

## Arguments

`vector`
:   First [VECTOR](../data-types-vector.md) value.

`vector`
:   Second VECTOR value.

## Returns

Returns a REAL that is the inner product of the two vectors given as inputs.

## Usage notes

* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example uses the VECTOR_INNER_PRODUCT function to determine which vectors in the table
are closest to each other between columns `a` and `b`:

```sqlexample
CREATE TABLE vectors (a VECTOR(FLOAT, 3), b VECTOR(FLOAT, 3));
INSERT INTO vectors SELECT [1.1,2.2,3]::VECTOR(FLOAT,3), [1,1,1]::VECTOR(FLOAT,3);
INSERT INTO vectors SELECT [1,2.2,3]::VECTOR(FLOAT,3), [4,6,8]::VECTOR(FLOAT,3);

-- Compute the pairwise inner product between columns a and b
SELECT VECTOR_INNER_PRODUCT(a, b) FROM vectors;
```

```output
+------+
| 6.3  |
|------|
| 41.2 |
+------+
```

---
title: VECTOR_L1_DISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/vector_l1_distance.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md)

# VECTOR_L1_DISTANCE

Computes the L1 distance between two [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md).

L1 distance, also known as the Taxicab or Manhattan distance, is a measure of
the distance between two points in a vector space. The distance is calculated by
taking the sum of the absolute value of the differences of vector elements. The
result is a value of zero or higher. If the distance is zero, the vectors
are identical. The larger the distance, the farther apart the vectors are.

See also:
:   [VECTOR_INNER_PRODUCT](vector_inner_product.md) , [VECTOR_L2_DISTANCE](vector_l2_distance.md) , [VECTOR_COSINE_SIMILARITY](vector_cosine_similarity.md) , [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_L1_DISTANCE( <vector>, <vector> )
```

## Arguments

`vector`
:   The [VECTOR](../data-types-vector.md) value to calculate the distance from.

`vector`
:   The VECTOR value to calculate the distance to.

## Returns

Returns the L1 distance between the two input vectors as a [FLOAT](../data-types-numeric.md) value.

## Usage notes

* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example uses the VECTOR_L1_DISTANCE function to determine which vectors in
the table are closest to each other between columns `a` and `b`:

```sqlexample
CREATE TABLE vectors (a VECTOR(FLOAT, 3), b VECTOR(FLOAT, 3));
INSERT INTO vectors SELECT [1.1,2.2,3]::VECTOR(FLOAT,3), [1,1,1]::VECTOR(FLOAT,3);
INSERT INTO vectors SELECT [1,2.2,3]::VECTOR(FLOAT,3), [4,6,8]::VECTOR(FLOAT,3);

SELECT VECTOR_L1_DISTANCE(a, b) FROM vectors;
```

```output
+--------------+
| 3.300000191  |
|--------------|
| 11.800000191 |
+--------------+
```

---
title: VECTOR_L2_DISTANCE
source: https://docs.snowflake.com/en/sql-reference/functions/vector_l2_distance.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md)

# VECTOR_L2_DISTANCE

Computes the L2 distance between two [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md).

L2 distance, also known as the Euclidean distance, is a measure of the distance between two vectors in a vector space. The
distance is calculated by taking the square root of the sum of the squared differences of vector elements. The distance can be
a value of zero or higher. If the distance is zero, the vectors are identical. A larger distance indicates that the vectors are farther apart.

See also:
:   [VECTOR_INNER_PRODUCT](vector_inner_product.md) , [VECTOR_COSINE_SIMILARITY](vector_cosine_similarity.md) , [VECTOR_L1_DISTANCE](vector_l1_distance.md) , [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_L2_DISTANCE( <vector>, <vector> )
```

## Arguments

`vector`
:   The [VECTOR](../data-types-vector.md) value to calculate the distance from.

`vector`
:   The VECTOR value to calculate the distance to.

## Returns

Returns the distance between the two input vectors as a [FLOAT](../data-types-numeric.md) value.

## Usage notes

* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example uses the VECTOR_L2_DISTANCE function to determine which vectors in the table
are closest to each other between columns `a` and `b`:

```sqlexample
CREATE TABLE vectors (a VECTOR(FLOAT, 3), b VECTOR(FLOAT, 3));
INSERT INTO vectors SELECT [1.1,2.2,3]::VECTOR(FLOAT,3), [1,1,1]::VECTOR(FLOAT,3);
INSERT INTO vectors SELECT [1,2.2,3]::VECTOR(FLOAT,3), [4,6,8]::VECTOR(FLOAT,3);

-- Compute the pairwise inner product between columns a and b
SELECT VECTOR_L2_DISTANCE(a, b) FROM vectors;
```

```output
+------+
| 2.3  |
|------|
| 6.95 |
+------+
```

---
title: VECTOR_MAX
source: https://docs.snowflake.com/en/sql-reference/functions/vector_max.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md) , [Aggregate functions](../functions-aggregation.md)

# VECTOR_MAX

Computes the element-wise maximum of [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. Returns a vector where
each element is the maximum of the corresponding elements across all input vectors.

See also:
:   [VECTOR_SUM](vector_sum.md) , [VECTOR_MIN](vector_min.md) , [VECTOR_AVG](vector_avg.md) , [MAX](max.md), [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_MAX( <vector_column> )
```

## Arguments

`vector_column`
:   A column containing [VECTOR](../data-types-vector.md) values. All vectors in the column must have the same element type and dimension.

## Returns

Returns a VECTOR value with the same element type and dimension as the input vectors. Each element in the result vector is the maximum of the corresponding elements across all input vectors.

## Usage notes

* NULL values are ignored in the aggregation.
* If all values in the group are NULL, the function returns NULL.
* All input vectors in the column must have the same dimension and element type.
* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example demonstrates computing the element-wise maximum of vectors:

```sqlexample
CREATE OR REPLACE TABLE vector_data (
  id INT,
  category VARCHAR,
  embedding VECTOR(FLOAT, 3)
);

INSERT INTO vector_data
SELECT 1, 'A', [1.5, 8.0, 3.2]::VECTOR(FLOAT, 3)
UNION ALL SELECT 2, 'A', [4.1, 2.5, 6.7]::VECTOR(FLOAT, 3)
UNION ALL SELECT 3, 'B', [2.0, 1.0, 4.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 4, 'B', [3.0, 2.0, 1.0]::VECTOR(FLOAT, 3);

-- Compute maximum for each category
SELECT category, VECTOR_MAX(embedding) AS max_vector
  FROM vector_data
  GROUP BY category
  ORDER BY category;
```

```output
+----------+------------------+
| CATEGORY | MAX_VECTOR       |
+----------+------------------+
| A        | [4.1, 8.0, 6.7] |
| B        | [3.0, 2.0, 4.0] |
+----------+------------------+
```

This example shows scalar aggregation (no GROUP BY):

```sqlexample
SELECT VECTOR_MAX(embedding) AS overall_max
  FROM vector_data;
```

```output
+------------------+
| OVERALL_MAX      |
+------------------+
| [4.1, 8.0, 6.7]  |
+------------------+
```

---
title: VECTOR_MIN
source: https://docs.snowflake.com/en/sql-reference/functions/vector_min.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md) , [Aggregate functions](../functions-aggregation.md)

# VECTOR_MIN

Computes the element-wise minimum of [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. Returns a vector where
each element is the minimum of the corresponding elements across all input vectors.

See also:
:   [VECTOR_SUM](vector_sum.md) , [VECTOR_MAX](vector_max.md) , [VECTOR_AVG](vector_avg.md) , [MIN](min.md), [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_MIN( <vector_column> )
```

## Arguments

`vector_column`
:   A column containing [VECTOR](../data-types-vector.md) values. All vectors in the column must have the same element type and dimension.

## Returns

Returns a VECTOR value with the same element type and dimension as the input vectors. Each element in the result vector is the minimum of the corresponding elements across all input vectors.

## Usage notes

* NULL values are ignored in the aggregation.
* If all values in the group are NULL, the function returns NULL.
* All input vectors in the column must have the same dimension and element type.
* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example demonstrates computing the element-wise minimum of vectors:

```sqlexample
CREATE OR REPLACE TABLE vector_data (
  id INT,
  category VARCHAR,
  embedding VECTOR(FLOAT, 3)
);

INSERT INTO vector_data
SELECT 1, 'A', [1.5, 8.0, 3.2]::VECTOR(FLOAT, 3)
UNION ALL SELECT 2, 'A', [4.1, 2.5, 6.7]::VECTOR(FLOAT, 3)
UNION ALL SELECT 3, 'B', [2.0, 1.0, 4.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 4, 'B', [3.0, 2.0, 1.0]::VECTOR(FLOAT, 3);

-- Compute minimum for each category
SELECT category, VECTOR_MIN(embedding) AS min_vector
  FROM vector_data
  GROUP BY category
  ORDER BY category;
```

```output
+----------+------------------+
| CATEGORY | MIN_VECTOR       |
+----------+------------------+
| A        | [1.5, 2.5, 3.2] |
| B        | [2.0, 1.0, 1.0] |
+----------+------------------+
```

This example shows scalar aggregation (no GROUP BY):

```sqlexample
SELECT VECTOR_MIN(embedding) AS overall_min
  FROM vector_data;
```

```output
+------------------+
| OVERALL_MIN      |
+------------------+
| [1.5, 1.0, 1.0]  |
+------------------+
```

---
title: VECTOR_NORMALIZE
source: https://docs.snowflake.com/en/sql-reference/functions/vector_normalize.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md)

# VECTOR_NORMALIZE

Normalizes a [VECTOR](../data-types-vector.md) in the L2 vector space, giving its elements values in the range of [0,1] and giving it a magnitude of 1.

See also:
:   [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md), [VECTOR_TRUNCATE](vector_truncate.md)

## Syntax

```sqlsyntax
VECTOR_NORMALIZE( <vector> )
```

## Arguments

`vector`
:   A single VECTOR value to normalize.

## Returns

Returns a VECTOR normalized to the L2 space, with values of type [FLOAT](../data-types-numeric.md).

## Usage notes

* Returns NULL when the input is NULL.
* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example demonstrates normalizing the vector `[1, 2, 3]`:

```sqlexample
SELECT VECTOR_NORMALIZE([1, 2, 3]::VECTOR(INT, 3));
```

```output
[0.267261, 0.534522, 0.801784]
```

This example shows how to re-normalize a truncated vector. The original vector is produced by [EMBED_TEXT_768](../../user-guide/snowflake-cortex/vector-embeddings.md) with the `snowflake-arctic-embed-m-v1.5` model, and then truncated to 256 elements. The truncated vector is then normalized:

```sqlexample
VECTOR_NORMALIZE(
    VECTOR_TRUNCATE(
        SNOWFLAKE.CORTEX.EMBED_TEXT_768(
            'snowflake-arctic-embed-m-v1.5',
            'Analytical databases are typically column-oriented rather than row-oriented'
        ),
        256
    )
);
```

---
title: VECTOR_SUM
source: https://docs.snowflake.com/en/sql-reference/functions/vector_sum.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md) , [Aggregate functions](../functions-aggregation.md)

# VECTOR_SUM

Computes the element-wise sum of [vectors](../../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. Returns a vector where
each element is the sum of the corresponding elements across all input vectors.

See also:
:   [VECTOR_MIN](vector_min.md) , [VECTOR_MAX](vector_max.md) , [VECTOR_AVG](vector_avg.md) , [SUM](sum.md), [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md)

## Syntax

```sqlsyntax
VECTOR_SUM( <vector_column> )
```

## Arguments

`vector_column`
:   A column containing [VECTOR](../data-types-vector.md) values. All vectors in the column must have the same element type and dimension.

## Returns

Returns a VECTOR value with the same element type and dimension as the input vectors. Each element in the result vector is the sum of the corresponding elements across all input vectors.

## Usage notes

* NULL values are ignored in the aggregation.
* If all values in the group are NULL, the function returns NULL.
* All input vectors in the column must have the same dimension and element type.
* Vector functions are optimized in a way that can reduce floating point precision. This function’s results have a margin of error up to `1e-4`.

## Examples

This example demonstrates computing the element-wise sum of vectors:

```sqlexample
CREATE OR REPLACE TABLE vector_data (
  id INT,
  category VARCHAR,
  embedding VECTOR(FLOAT, 3)
);

INSERT INTO vector_data
SELECT 1, 'A', [1.0, 2.0, 3.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 2, 'A', [4.0, 5.0, 6.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 3, 'B', [2.0, 1.0, 4.0]::VECTOR(FLOAT, 3)
UNION ALL SELECT 4, 'B', [3.0, 2.0, 1.0]::VECTOR(FLOAT, 3);

-- Compute sum for each category
SELECT category, VECTOR_SUM(embedding) AS sum_vector
  FROM vector_data
  GROUP BY category
  ORDER BY category;
```

```output
+----------+------------------+
| CATEGORY | SUM_VECTOR       |
+----------+------------------+
| A        | [5.0, 7.0, 9.0]  |
| B        | [5.0, 3.0, 5.0]  |
+----------+------------------+
```

This example shows scalar aggregation (no GROUP BY):

```sqlexample
SELECT VECTOR_SUM(embedding) AS total_sum
  FROM vector_data;
```

```output
+--------------------+
| TOTAL_SUM          |
+--------------------+
| [10.0, 10.0, 14.0] |
+--------------------+
```

---
title: VECTOR_TRUNCATE
source: https://docs.snowflake.com/en/sql-reference/functions/vector_truncate.md
section: SQL Functions
---

Categories:
:   [Vector functions](../functions-vector.md)

# VECTOR_TRUNCATE

Truncates a [VECTOR](../data-types-vector.md) to a smaller dimension.

This function can also be called through the alias VECTOR_TRUNC.

See also:
:   [Vector Embeddings](../../user-guide/snowflake-cortex/vector-embeddings.md), [VECTOR_NORMALIZE](vector_normalize.md)

## Syntax

```sqlsyntax
VECTOR_TRUNCATE( <vector>, <dimension> )
```

## Arguments

`vector`
:   A single [VECTOR](../data-types-vector.md) value to truncate.

`dimension`
:   The number of elements that should be in the returned vector.

## Returns

Returns a VECTOR value with the same values and types for the first `dimension` entries, with the remainder discarded.

## Usage notes

* Returns NULL when any input is NULL.
* Using a `dimension` larger than the number of dimensions in the `vector` causes an error.
* Truncated vectors are not normalized.

## Examples

This example demonstrates truncating a 3-dimensional vector into a 2-dimensional vector:

```sqlexample
SELECT VECTOR_TRUNCATE([1, 2, 3]::VECTOR(INT, 3), 2);
```

```output
[1,2]
```

This example demonstrates truncating a vector produced by [EMBED_TEXT_768](../../user-guide/snowflake-cortex/vector-embeddings.md) for the text “Analytical databases are typically column-oriented rather than row-oriented” with the `snowflake-arctic-embed-m-v1.5` model from 768 elements to 256 elements:

```sqlexample
SELECT VECTOR_TRUNCATE(
    SNOWFLAKE.CORTEX.EMBED_TEXT_768(
        'snowflake-arctic-embed-m-v1.5',
        'Analytical databases are typically column-oriented rather than row-oriented'
    ),
    256)
;
```

---
title: WAREHOUSE_LOAD_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/warehouse_load_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# WAREHOUSE_LOAD_HISTORY

This table function can be used to query the activity history (defined as the “query load”) for a single warehouse within a specified date range.

> **Note:**
>
> This function returns warehouse activity within the last 14 days.

> **Note:**
>
> Specifying a date value that is within one minute of the current timestamp can produce inaccurate results.

See also:
:   [WAREHOUSE_METERING_HISTORY](warehouse_metering_history.md)

## Syntax

```sqlsyntax
WAREHOUSE_LOAD_HISTORY(
      [ DATE_RANGE_START => <constant_expr> ]
      [, DATE_RANGE_END => <constant_expr> ]
      [, WAREHOUSE_NAME => '<string>' ] )
```

## Arguments

All the arguments are optional.

`DATE_RANGE_START => constant_expr` , . `DATE_RANGE_END => constant_expr`
:   The date range, within the last 14 days, for which to retrieve warehouse load history data:

    * If an end date is not specified, then [CURRENT_DATE](current_date.md) is used as the end of the range.
    * If a start date is not specified, then the range starts 10 minutes prior to the start of `DATE_RANGE_END` (i.e. the default is to show the previous 10 minutes of load history). For example,
      if `DATE_RANGE_END` is [CURRENT_DATE](current_date.md), then the default `DATE_RANGE_START` is 11:50 PM on the previous day.

    If the range falls outside the last 15 days, an error is returned.

    > **Note:**
    >
    > If the selected period is less than 8 hours, load is shown in 5-second intervals; otherwise, 5-minute intervals are used.

`WAREHOUSE_NAME => 'string'`
:   The name of the warehouse to retrieve usage load history for. Note that the warehouse name must be enclosed in single quotes. Also, if the warehouse name contains any spaces, mixed-case characters,
    or special characters, the name must be double-quoted within the single quotes (e.g. `'"My Warehouse"'` vs `'mywarehouse'`).

    Default: [CURRENT_WAREHOUSE](current_warehouse.md)

## Usage notes

* To get results from this function, one of the following roles or privileges are required:

  + The ACCOUNTADMIN role can get results from this function as it has all of the global account permissions.
  + A role with the MONITOR USAGE global privilege on the ACCOUNT can query this function for any warehouses in the account.
  + A role with the MONITOR privilege on the WAREHOUSE can query this function for the warehouse it has permissions on.
  + A role with the OWNERSHIP privilege on the WAREHOUSE has all permissions on the warehouse including MONITOR.

  For more details, see [Access control privileges](../../user-guide/security-access-control-privileges.md).
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Output

> **Note:**
>
> For the output columns of this function, the query load value is the ratio of the total execution time (in seconds) of all queries in a specific state in an interval by the total time (in seconds) for that interval.
>
> For example, if 276 seconds was the total time for 4 queries in a 5 minute (300 second) interval, then the query load value is 276 / 300 = 0.92.

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The start of the specified time range (in the UTC time zone) in which the warehouse usage took place. |
| END_TIME | TIMESTAMP_LTZ | The end of the specified time range (in the UTC time zone) in which the warehouse usage took place. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| AVG_RUNNING | NUMBER(38,2) | Query load value for queries executed. |
| AVG_QUEUED_LOAD | NUMBER(38,2) | Query load value for queries queued because the warehouse was overloaded. |
| AVG_QUEUED_PROVISIONING | NUMBER(38,2) | Query load value for queries queued because the warehouse was being provisioned. |
| AVG_BLOCKED | NUMBER(38,2) | Query load value for queries blocked by a transaction lock. |

## Examples

Retrieve the load history for the last hour, in 5-second intervals, for the warehouse currently in use for your session:

> ```sqlexample
> use warehouse mywarehouse;
>
> select *
> from table(information_schema.warehouse_load_history(date_range_start=>dateadd('hour',-1,current_timestamp())));
> ```

Retrieve the load history for the last 14 days, in 5-minute intervals, for the warehouse currently in use for your session:

> ```sqlexample
> use warehouse mywarehouse;
>
> select *
> from table(information_schema.warehouse_load_history(date_range_start=>dateadd('day',-14,current_date()), date_range_end=>current_date()));
> ```

---
title: WAREHOUSE_METERING_HISTORY
source: https://docs.snowflake.com/en/sql-reference/functions/warehouse_metering_history.md
section: SQL Functions
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# WAREHOUSE_METERING_HISTORY

This table function can be used in queries to return the hourly credit usage for a single warehouse (or all the warehouses in your account) within a specified date range.

> **Note:**
>
> This function returns credit usage within the last 6 months. However, if you are querying multiple warehouses over a lengthy time period,
> it might not return a complete data set. To obtain a complete data set, use the
> [ACCOUNT_USAGE view](../account-usage/warehouse_metering_history.md) instead.

See also:
:   [WAREHOUSE_LOAD_HISTORY](warehouse_load_history.md)

## Syntax

```sqlsyntax
WAREHOUSE_METERING_HISTORY(
      DATE_RANGE_START => <constant_expr>
      [ , DATE_RANGE_END => <constant_expr> ]
      [ , WAREHOUSE_NAME => '<string>' ] )
```

## Arguments

**Required:**

`DATE_RANGE_START => constant_expr`
:   The starting date, within the last 6 months, for which warehouse usage is returned.

**Optional:**

`DATE_RANGE_END => constant_expr`
:   The ending date, within the last 6 months, for which warehouse usage is returned.

    Default: [CURRENT_DATE](current_date.md) is used.

`WAREHOUSE_NAME => 'string'`
:   The name of the warehouse to retrieve credit usage for. Note that the warehouse name must be enclosed in single quotes. Also, if the warehouse name any spaces, mixed-case characters,
    or special characters, the name must be double-quoted within the single quotes (e.g. `'"My Warehouse"'` vs `'mywarehouse'`).

    Default: All warehouses that ran during the specified date range.

## Usage notes

* Returns results only for the ACCOUNTADMIN role or any role that has been explicitly granted the MONITOR USAGE global privilege.
* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA schema in use or the function name must be fully-qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).
* The order and structure of the arguments depends on whether the argument keywords (e.g. `DATE_RANGE_START`) are included:

  + The keywords are not required if the arguments are specified in order.
  + If the argument keywords are included, the arguments can be specified in any order.

## Output

The function returns the following columns, ordered by WAREHOUSE_NAME and START_TIME:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The beginning of the hour in which this warehouse usage took place. |
| END_TIME | TIMESTAMP_LTZ | The end of the hour in which this warehouse usage took place. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| CREDITS_USED | NUMBER | Number of credits billed for this warehouse in this hour. |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits used for the warehouse in the hour. |
| CREDITS_USED_CLOUD_SERVICES | NUMBER | Number of credits used for cloud services in the hour. |

## Examples

Retrieve hourly warehouse usage over the past 10 days for all warehouses that ran during this time period:

> ```sqlexample
> select *
> from table(information_schema.warehouse_metering_history(dateadd('days',-10,current_date())));
> ```

Retrieve hourly warehouse usage for the `testingwh` warehouse on a specified date:

> ```sqlexample
> select *
> from table(information_schema.warehouse_metering_history('2017-10-23', '2017-10-23', 'testingwh'));
> ```

---
title: WIDTH_BUCKET
source: https://docs.snowflake.com/en/sql-reference/functions/width_bucket.md
section: SQL Functions
---

Categories:
:   [Numeric functions](../functions-numeric.md)

# WIDTH_BUCKET

Constructs equi-width histograms, in which the histogram range is divided into intervals of identical size, and returns the bucket number into which the value of an expression falls, after
it has been evaluated. The function returns an integer value or null (if any input is null).

## Syntax

```sqlsyntax
WIDTH_BUCKET( <expr> , <min_value> , <max_value> , <num_buckets> )
```

## Arguments

`expr`
:   The expression for which the histogram is created. This expression must evaluate to a numeric value or to a value that can be implicitly converted to a numeric value.

    The value must be within the range of `-(2^53 - 1)` to `2^53 - 1` (inclusive).

`min_value` and `max_value`
:   The low and high end points of the acceptable range for the expression. The end points must also evaluate to numeric values and not be equal.

    The low and high end points must be within the range of `-(2^53 - 1)` to `2^53 - 1` (inclusive). In addition, the difference
    between these points must be less than `2^53` (i.e. `abs(max_value - min_value) < 2^53`).

`num_buckets`
:   The desired number of buckets; must be a positive integer value. A value from the expression is assigned to each bucket, and the function then returns the corresponding bucket number.

    When an expression falls outside the range, the function returns:

    * `0` if the expression is less than `min_value`.
    * `num_buckets + 1` if the expression is greater than or equal to `max_value`.

## Example

Create a four-bucket histogram on the `price` column for homes sold in the price range of $200 - 600k,
ordered by sales date. The function returns the bucket number (`SALES GROUP`) for each value in the set.

> Create and fill a table:
>
> > ```sqlexample
> > CREATE TABLE home_sales (
> >     sale_date DATE,
> >     price NUMBER(11, 2)
> >     );
> > INSERT INTO home_sales (sale_date, price) VALUES
> >     ('2013-08-01'::DATE, 290000.00),
> >     ('2014-02-01'::DATE, 320000.00),
> >     ('2015-04-01'::DATE, 399999.99),
> >     ('2016-04-01'::DATE, 400000.00),
> >     ('2017-04-01'::DATE, 470000.00),
> >     ('2018-04-01'::DATE, 510000.00);
> > ```
>
> Query the table, calling WIDTH_BUCKET():
>
> > ```sqlexample
> > SELECT
> >     sale_date,
> >     price,
> >     WIDTH_BUCKET(price, 200000, 600000, 4) AS "SALES GROUP"
> >   FROM home_sales
> >   ORDER BY sale_date;
> > +------------+-----------+-------------+
> > | SALE_DATE  |     PRICE | SALES GROUP |
> > |------------+-----------+-------------|
> > | 2013-08-01 | 290000.00 |           1 |
> > | 2014-02-01 | 320000.00 |           2 |
> > | 2015-04-01 | 399999.99 |           2 |
> > | 2016-04-01 | 400000.00 |           3 |
> > | 2017-04-01 | 470000.00 |           3 |
> > | 2018-04-01 | 510000.00 |           4 |
> > +------------+-----------+-------------+
> > ```

---
title: XMLGET
source: https://docs.snowflake.com/en/sql-reference/functions/xmlget.md
section: SQL Functions
---

Categories:
:   [Semi-structured and structured data functions](../functions-semistructured.md) (Extraction)

# XMLGET

Extracts an [XML](../../user-guide/semistructured-data-formats.md) element object (often referred to as simply a *tag*) from the content of
the outer XML element based on the name and instance number of the specified tag.

(Note that an XML tag is not the same as a Snowflake [data governance tag](../../user-guide/object-tagging/introduction.md).)

See also:
:   [CHECK_XML](check_xml.md), [PARSE_XML](parse_xml.md), [TO_XML](to_xml.md)

## Syntax

```sqlsyntax
XMLGET( <expression> , <tag_name> [ , <instance_number> ] )
```

## Arguments

`expression`
:   The expression from which to extract the element.

    The expression must evaluate to an [OBJECT](../data-types-semistructured.md) (or a VARIANT containing an OBJECT). The OBJECT must contain
    valid XML in the internal format that Snowflake supports. Typically, that means that the OBJECT was produced by one of the
    following:

    * Calling the [PARSE_XML](parse_xml.md) function.
    * Loading the data (e.g. via the [COPY INTO <table>](../sql/copy-into-table.md) command) and specifying that the data is in XML
      format.

    The XMLGET function does not operate directly on a VARCHAR expression even if that VARCHAR contains valid XML text.

`tag_name`
:   The name of an XML tag stored in the `expression`.

`instance_number`
:   If the XML contains multiple instances of `tag_name`, then use `instance_number` to specify which instance to
    retrieve. Like an array index, the `instance_number` is 0-based, not 1-based.

    `instance_number` can be omitted, in which case the default value 0 is used.

## Returns

The data type of the returned value is [OBJECT](../data-types-semistructured.md).

The function returns NULL in the following cases:

* If any argument of XMLGET is NULL.
* If the tag instance isn’t found.

See the Usage Notes for more details.

## Usage notes

* The result of XMLGET isn’t the content of the tag (that is, the text between the tags), but the entire element (the opening tag,
  content, and closing tag). From the returned OBJECT value, you can extract the tag name, the tag’s attribute values, and the contents
  of the element (including nested tags) by using the [GET](get.md) function:

  + To extract attribute values, use `GET(tag, '@attrname')`.
  + To extract the content, use `GET(tag, '$')`.
  + To extract the tag name, use `GET(tag, '@')`.
* You can extract nested tags by nesting XMLGET function calls. For example:

  ```sqlexample
  SELECT XMLGET(XMLGET(my_xml_column, 'my_tag'), 'my_inner_tag') ...;
  ```
* Positions of the inner tags in the content can be obtained by using `GET(tag, 'inner-tag-name')`. If the content contains
  multiple elements, the positions are represented as an array.
* You can’t use XMLGET to extract the outermost element. To get the outermost element, select the `expression`
  itself.

## Examples

The following example creates a table with an OBJECT that contains XML, then uses the XMLGET function to extract elements from
that OBJECT.

```sqlexample
CREATE OR REPLACE TABLE xml_demo (id INTEGER, object_col OBJECT);

INSERT INTO xml_demo (id, object_col)
  SELECT 1001,
    PARSE_XML('<level1> 1 <level2> 2 <level3> 3A </level3> <level3> 3B </level3> </level2> </level1>');
```

```sqlexample
SELECT object_col,
       XMLGET(object_col, 'level2'),
       XMLGET(XMLGET(object_col, 'level2'), 'level3', 1)
  FROM xml_demo;
```

```output
+-------------------------+------------------------------+---------------------------------------------------+
| OBJECT_COL              | XMLGET(OBJECT_COL, 'LEVEL2') | XMLGET(XMLGET(OBJECT_COL, 'LEVEL2'), 'LEVEL3', 1) |
|-------------------------+------------------------------+---------------------------------------------------|
| <level1>                | <level2>                     | <level3>3B</level3>                               |
|   1                     |   2                          |                                                   |
|   <level2>              |   <level3>3A</level3>        |                                                   |
|     2                   |   <level3>3B</level3>        |                                                   |
|     <level3>3A</level3> | </level2>                    |                                                   |
|     <level3>3B</level3> |                              |                                                   |
|   </level2>             |                              |                                                   |
| </level1>               |                              |                                                   |
+-------------------------+------------------------------+---------------------------------------------------+
```

This example shows how to use GET with XMLGET to retrieve the content of an element. In the example, the `level2` tag
contains three items (text and two nested tags), so GET returns these items in an [ARRAY](../data-types-semistructured.md). The
nested tags are represented by OBJECTs (key-value pairs). The `@` property contains the nested tag name and the `$` property
contains the nested tag contents.

```sqlexample
SELECT object_col,
       GET(XMLGET(object_col, 'level2'), '$') AS content_of_element
  FROM xml_demo;
```

```output
+-------------------------+--------------------+
| OBJECT_COL              | CONTENT_OF_ELEMENT |
|-------------------------+--------------------|
| <level1>                | [                  |
|   1                     |   2,               |
|   <level2>              |   {                |
|     2                   |     "$": "3A",     |
|     <level3>3A</level3> |     "@": "level3"  |
|     <level3>3B</level3> |   },               |
|   </level2>             |   {                |
| </level1>               |     "$": "3B",     |
|                         |     "@": "level3"  |
|                         |   }                |
|                         | ]                  |
+-------------------------+--------------------+
```

This example shows how to use GET with XMLGET to retrieve an attribute of a tag.

```sqlexample
INSERT INTO xml_demo (id, object_col)
  SELECT 1002,
      PARSE_XML('<level1> 1 <level2 an_attribute="my attribute"> 2 </level2> </level1>');
```

```sqlexample
SELECT object_col,
       GET(XMLGET(object_col, 'level2'), '@an_attribute') AS attribute
  FROM xml_demo
  WHERE ID = 1002;
```

```output
+--------------------------------------------------+----------------+
| OBJECT_COL                                       | ATTRIBUTE      |
|--------------------------------------------------+----------------|
| <level1>                                         | "my attribute" |
|   1                                              |                |
|   <level2 an_attribute="my attribute">2</level2> |                |
| </level1>                                        |                |
+--------------------------------------------------+----------------+
```

> **Note:**
>
> For more examples of queries that use the XMLGET function, see [Examples of working with XML](../../user-guide/semistructured-data-formats.md).

---
title: YEAR* / DAY* / WEEK* / MONTH / QUARTER
source: https://docs.snowflake.com/en/sql-reference/functions/year.md
section: SQL Functions
---

Categories:
:   [Date & time functions](../functions-date-time.md)

# YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER

Extracts the corresponding date part from a date or timestamp.

These functions are alternatives to using the [DATE_PART](date_part.md) (or [EXTRACT](extract.md)) function with the
equivalent date part (see [Supported date and time parts](../functions-date-time.md)).

See also:
:   [HOUR / MINUTE / SECOND](hour-minute-second.md)

## Syntax

```sqlsyntax
YEAR( <date_interval_or_timestamp_expr> )

YEAROFWEEK( <date_or_timestamp_expr> )
YEAROFWEEKISO( <date_or_timestamp_expr> )

DAY( <date_interval_or_timestamp_expr> )

DAYOFMONTH( <date_or_timestamp_expr> )
DAYOFWEEK( <date_or_timestamp_expr> )
DAYOFWEEKISO( <date_or_timestamp_expr> )
DAYOFYEAR( <date_or_timestamp_expr> )

WEEK( <date_or_timestamp_expr> )

WEEKOFYEAR( <date_or_timestamp_expr> )
WEEKISO( <date_or_timestamp_expr> )

MONTH( <date_interval_or_timestamp_expr> )

QUARTER( <date_or_timestamp_expr> )
```

## Arguments

`date_or_timestamp_expr`
:   A date or a timestamp, or an expression that can be evaluated to a date or a timestamp.

`date_interval_or_timestamp_expr`
:   A date, an interval, or a timestamp, or an expression that can be evaluated to a date, an interval, or a timestamp.

    When an interval value is passed to the function, the YEAR and MONTH functions support a year-month interval,
    or an expression that can be evaluated to a year-month interval. The DAY function supports a day-time interval,
    or an expression that can be evaluated to a day-time interval.

## Returns

This function returns a value of type NUMBER.

## Usage notes

| Function name | Date part extracted from input date or timestamp | Possible values |
| --- | --- | --- |
| YEAR | Year | Any valid year (for example, 2025) |
| YEAROFWEEK [1] | Year that the extracted week belongs to | Any valid year (for example, 2025) |
| YEAROFWEEKISO | Year that the extracted week belongs to using [ISO semantics](../functions-date-time.md) | Any valid year (for example, 2025) |
| DAY , DAYOFMONTH | Day (number) of the month | 1 to 31 |
| DAYOFWEEK [1] | Day (number) of the week dictated by [session parameters](../functions-date-time.md) | 0 to 7 |
| DAYOFWEEKISO | Day (number) of the week using [ISO semantics](../functions-date-time.md) | 1 to 7 |
| DAYOFYEAR | Day (number) of the year | 1 to 366 |
| WEEK , WEEKOFYEAR [1] | Week (number) of the year | 1 to 54 |
| WEEKISO | Week (number) of the year using [ISO semantics](../functions-date-time.md) | 1 to 53 |
| MONTH | Month (number) of the year | 1 to 12 |
| QUARTER | Quarter (number) of the year | 1 to 4 |

[1] Results dictated by the values set for the WEEK_OF_YEAR_POLICY and/or WEEK_START session parameters.

For details about ISO semantics and the parameter, see [Calendar weeks and weekdays](../functions-date-time.md).

## Examples

The following example demonstrates the use of the functions YEAR, QUARTER, MONTH, DAY, DAYOFWEEK,
and DAYOFYEAR:

```sqlexample
SELECT '2025-04-11T23:39:20.123-07:00'::TIMESTAMP AS tstamp,
       YEAR(tstamp) AS "YEAR",
       QUARTER(tstamp) AS "QUARTER OF YEAR",
       MONTH(tstamp) AS "MONTH",
       DAY(tstamp) AS "DAY",
       DAYOFMONTH(tstamp) AS "DAY OF MONTH",
       DAYOFYEAR(tstamp) AS "DAY OF YEAR";
```

```output
+-------------------------+------+-----------------+-------+-----+--------------+-------------+
| TSTAMP                  | YEAR | QUARTER OF YEAR | MONTH | DAY | DAY OF MONTH | DAY OF YEAR |
|-------------------------+------+-----------------+-------+-----+--------------+-------------|
| 2025-04-11 23:39:20.123 | 2025 |               2 |     4 |  11 |           11 |         101 |
+-------------------------+------+-----------------+-------+-----+--------------+-------------+
```

The following example demonstrates the use of the functions WEEK, WEEKISO, WEEKOFYEAR, YEAROFWEEK, and
YEAROFWEEKISO. The session parameter [WEEK_OF_YEAR_POLICY](../parameters.md) is set to `1`, so that the first week
of the year is the week that contains January 1st of that year.

```sqlexample
ALTER SESSION SET WEEK_OF_YEAR_POLICY = 1;
```

```sqlexample
SELECT '2016-01-02T23:39:20.123-07:00'::TIMESTAMP AS tstamp,
       WEEK(tstamp) AS "WEEK",
       WEEKISO(tstamp) AS "WEEK ISO",
       WEEKOFYEAR(tstamp) AS "WEEK OF YEAR",
       YEAROFWEEK(tstamp) AS "YEAR OF WEEK",
       YEAROFWEEKISO(tstamp) AS "YEAR OF WEEK ISO";
```

```output
+-------------------------+------+----------+--------------+--------------+------------------+
| TSTAMP                  | WEEK | WEEK ISO | WEEK OF YEAR | YEAR OF WEEK | YEAR OF WEEK ISO |
|-------------------------+------+----------+--------------+--------------+------------------|
| 2016-01-02 23:39:20.123 |    1 |       53 |            1 |         2016 |             2015 |
+-------------------------+------+----------+--------------+--------------+------------------+
```

The following example also demonstrates the use of the functions WEEK, WEEKISO, WEEKOFYEAR, YEAROFWEEK, and
YEAROFWEEKISO. The session parameter WEEK_OF_YEAR_POLICY is set to indicate that the first week
of the year is the first week of the year that contains at least four days from that year. In this example,
the week December 27, 2015 through January 2, 2016 is considered the last week of 2015, not the first week
of 2016. Even though the week contains Friday January 1, 2016, less than half of the week is in 2016.

```sqlexample
ALTER SESSION SET WEEK_OF_YEAR_POLICY = 0;
```

```sqlexample
SELECT '2016-01-02T23:39:20.123-07:00'::TIMESTAMP AS tstamp,
       WEEK(tstamp) AS "WEEK",
       WEEKISO(tstamp) AS "WEEK ISO",
       WEEKOFYEAR(tstamp) AS "WEEK OF YEAR",
       YEAROFWEEK(tstamp) AS "YEAR OF WEEK",
       YEAROFWEEKISO(tstamp) AS "YEAR OF WEEK ISO";
```

```output
+-------------------------+------+----------+--------------+--------------+------------------+
| TSTAMP                  | WEEK | WEEK ISO | WEEK OF YEAR | YEAR OF WEEK | YEAR OF WEEK ISO |
|-------------------------+------+----------+--------------+--------------+------------------|
| 2016-01-02 23:39:20.123 |   53 |       53 |           53 |         2015 |             2015 |
+-------------------------+------+----------+--------------+--------------+------------------+
```

The following example demonstrates the use of the functions DAYOFWEEK and DAYOFWEEKISO.
The session parameter [WEEK_START](../parameters.md) is set to indicate that the week starts on Sunday.

```sqlexample
ALTER SESSION SET WEEK_START = 7;
```

The timestamp in the following query is for April 5, 2025, which was a Saturday. The DAYOFWEEK function
returns `7` for Saturday, because the first day of the week is set to Sunday. The DAYOFWEEKISO function
returns `6` because the first day of the week using ISO semantics is Monday. For more information about ISO
semantics and the WEEK_START parameter, see [Calendar weeks and weekdays](../functions-date-time.md).

```sqlexample
SELECT '2025-04-05T23:39:20.123-07:00'::TIMESTAMP AS tstamp,
       DAYOFWEEK(tstamp) AS "DAY OF WEEK",
       DAYOFWEEKISO(tstamp) AS "DAY OF WEEK ISO";
```

```output
+-------------------------+-------------+-----------------+
| TSTAMP                  | DAY OF WEEK | DAY OF WEEK ISO |
|-------------------------+-------------+-----------------|
| 2025-04-05 23:39:20.123 |           7 |               6 |
+-------------------------+-------------+-----------------+
```

The following example also demonstrates the use of the functions DAYOFWEEK and DAYOFWEEKISO.
The session parameter WEEK_START is set to indicate that the week starts on Monday.

```sqlexample
ALTER SESSION SET WEEK_START = 1;
```

```sqlexample
SELECT '2025-04-05T23:39:20.123-07:00'::TIMESTAMP AS tstamp,
       DAYOFWEEK(tstamp) AS "DAY OF WEEK",
       DAYOFWEEKISO(tstamp) AS "DAY OF WEEK ISO";
```

```output
+-------------------------+-------------+-----------------+
| TSTAMP                  | DAY OF WEEK | DAY OF WEEK ISO |
|-------------------------+-------------+-----------------|
| 2025-04-05 23:39:20.123 |           6 |               6 |
+-------------------------+-------------+-----------------+
```

For more examples, see [Working with date and time values](../date-time-examples.md).

For more detailed examples of the week-related functions (DAYOFWEEK, WEEK, WEEKOFYEAR, YEAROFWEEK, and so on),
see [Calendar weeks and weekdays](../functions-date-time.md).

---
title: ZEROIFNULL
source: https://docs.snowflake.com/en/sql-reference/functions/zeroifnull.md
section: SQL Functions
---

Categories:
:   [Conditional expression functions](../expressions-conditional.md)

# ZEROIFNULL

Returns 0 if its argument is null; otherwise, returns its argument.

## Syntax

```sqlsyntax
ZEROIFNULL( <expr> )
```

## Arguments

`expr`
:   The input should be an expression that evaluates to a numeric value (or NULL).

## Returns

If the value of the input expressions is NULL, this returns 0.
Otherwise, this returns the value of the input expression.

The data type of the return value is `NUMBER(p, s)`. The exact values of ‘p’ (precision) and ‘s’ (scale) depend
upon the input expression. For example, if the input expression is 3.14159, then the data type of the output value
will be `NUMBER(7, 5)`.

## Examples

The following example shows the output of the function for various input values:

> ```sqlexample
> SELECT column1, ZEROIFNULL(column1)
>     FROM VALUES (1), (null), (5), (0), (3.14159);
> +---------+---------------------+
> | COLUMN1 | ZEROIFNULL(COLUMN1) |
> |---------+---------------------|
> | 1.00000 |             1.00000 |
> |    NULL |             0.00000 |
> | 5.00000 |             5.00000 |
> | 0.00000 |             0.00000 |
> | 3.14159 |             3.14159 |
> +---------+---------------------+
> ```

---
title: ZIPF
source: https://docs.snowflake.com/en/sql-reference/functions/zipf.md
section: SQL Functions
---

Categories:
:   [Data generation functions](../functions-data-generation.md)

# ZIPF

Returns a Zipf-distributed integer, for `N` elements and characteristic exponent `s`.

## Syntax

```sqlsyntax
ZIPF( <s> , <N> , <gen> )
```

## Usage notes

* The computational cost of choosing a single random number is logarithmic in the argument `N`. More importantly, the memory cost is linear for `N`. Because of this, the argument
  `N` is limited to the inclusive range `[1, 16777215]`.
* `gen` specifies the generator expression for the function. For more information, see [Usage notes](../functions-data-generation.md).
* The first two arguments (`s` and `N`) must be constants.

## Examples

```sqlexample
SELECT zipf(1, 10, random()) FROM table(generator(rowCount => 10));

+-----------------------+
| ZIPF(1, 10, RANDOM()) |
|-----------------------|
|                     9 |
|                     7 |
|                     1 |
|                     8 |
|                     8 |
|                     2 |
|                     3 |
|                     8 |
|                     2 |
|                     5 |
+-----------------------+
```

```sqlexample
SELECT zipf(1, 10, 1234) FROM table(generator(rowCount => 10));

+-------------------+
| ZIPF(1, 10, 1234) |
|-------------------|
|                 4 |
|                 4 |
|                 4 |
|                 4 |
|                 4 |
|                 4 |
|                 4 |
|                 4 |
|                 4 |
|                 4 |
+-------------------+
```

## SQL Commands

DDL and DML command reference (CREATE, ALTER, DROP, SELECT, INSERT, and more).

---
title: ALTER <object>
source: https://docs.snowflake.com/en/sql-reference/sql/alter.md
section: SQL Commands
---

# ALTER *<object>*

Modifies the metadata of an account-level or database object, or the parameters for a session.

See also:
:   [CREATE <object>](create.md) , [DESCRIBE <object>](desc.md) , [SHOW <objects>](show.md)

## ALTER commands

For specific syntax, usage notes, and examples, see:

**Account and Session Operations:**

> * [ALTER ACCOUNT](alter-account.md) (account administrators only)
> * [ALTER SESSION](alter-session.md) (all users)

**Account Objects:**

> * [ALTER APPLICATION](alter-application.md)
> * [ALTER APPLICATION PACKAGE](alter-application-package.md)
> * [ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](alter-application-package-release-directive.md)
> * [ALTER APPLICATION PACKAGE … VERSION](alter-application-package-version.md)
> * [ALTER APPLICATION ROLE](alter-application-role.md)
> * [ALTER AUTHENTICATION POLICY](alter-authentication-policy.md)
> * [ALTER CATALOG INTEGRATION](alter-catalog-integration.md)
> * [ALTER COMPUTE POOL](alter-compute-pool.md)
> * [ALTER CONNECTION](alter-connection.md)
> * [ALTER DATABASE](alter-database.md)
> * [ALTER DATABASE (catalog-linked)](alter-database-catalog-linked.md)
> * [ALTER DATABASE ROLE](alter-database-role.md)
> * [ALTER DYNAMIC TABLE](alter-dynamic-table.md)
> * [ALTER EXTERNAL ACCESS INTEGRATION](alter-external-access-integration.md)
> * [ALTER EXTERNAL VOLUME](alter-external-volume.md)
> * [ALTER FAILOVER GROUP](alter-failover-group.md)
> * [ALTER FEATURE POLICY](alter-feature-policy.md)
> * [ALTER NETWORK POLICY](alter-network-policy.md)
> * [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md)
> * [ALTER ORGANIZATION PROFILE](alter-organization-profile.md)
> * [ALTER POSTGRES INSTANCE](alter-postgres-instance.md)
> * [ALTER REPLICATION GROUP](alter-replication-group.md)
> * [ALTER RESOURCE MONITOR](alter-resource-monitor.md)
> * [ALTER SECURITY INTEGRATION](alter-security-integration.md)
> * [ALTER SHARE](alter-share.md)
> * [ALTER STORAGE INTEGRATION](alter-storage-integration.md)
> * [ALTER ROLE](alter-role.md)
> * [ALTER USER](alter-user.md)
> * [ALTER WAREHOUSE](alter-warehouse.md)

**Database Objects:**

> * [ALTER AGENT](alter-agent.md)
> * [ALTER AGGREGATION POLICY](alter-aggregation-policy.md)
> * [ALTER ALERT](alter-alert.md)
> * [ALTER AUTHENTICATION POLICY](alter-authentication-policy.md)
> * [ALTER BACKUP POLICY](alter-backup-policy.md)
> * [ALTER BACKUP SET](alter-backup-set.md)
> * [ALTER CONTACT](alter-contact.md)
> * [ALTER CORTEX SEARCH SERVICE](alter-cortex-search.md)
> * [ALTER DATASET](alter-dataset.md)
> * [ALTER DATASET … ADD VERSION](alter-dataset-add-version.md)
> * [ALTER DATASET … DROP VERSION](alter-dataset-drop-version.md)
> * [ALTER DBT PROJECT](alter-dbt-project.md)
> * [ALTER DCM PROJECT](alter-dcm-project.md)
> * [ALTER EXPERIMENT](alter-experiment.md)
> * [ALTER EXTERNAL TABLE](alter-external-table.md)
> * [ALTER FILE FORMAT](alter-file-format.md)
> * [ALTER FUNCTION](alter-function.md)
> * [ALTER FUNCTION (DMF)](alter-function-dmf.md)
> * [ALTER GIT REPOSITORY](alter-git-repository.md)
> * [ALTER ICEBERG TABLE](alter-iceberg-table.md)
> * [ALTER JOIN POLICY](alter-join-policy.md)
> * [ALTER LISTING](alter-listing.md)
> * [ALTER MAINTENANCE POLICY](alter-maintenance-policy.md)
> * [ALTER MASKING POLICY](alter-masking-policy.md)
> * [ALTER MATERIALIZED VIEW](alter-materialized-view.md)
> * [ALTER MODEL](alter-model.md)
> * [ALTER MODEL … ADD VERSION](alter-model-add-version.md)
> * [ALTER MODEL … DROP VERSION](alter-model-drop-version.md)
> * [ALTER MODEL … MODIFY VERSION](alter-model-modify-version.md)
> * [ALTER MODEL MONITOR](alter-model-monitor.md)
> * [ALTER NETWORK RULE](alter-network-rule.md)
> * [ALTER NOTEBOOK](alter-notebook.md)
> * [ALTER OPENFLOW DATA PLANE](alter-oflow-data-plane.md)
> * [ALTER ONLINE FEATURE TABLE](alter-online-feature-table.md)
> * [ALTER PACKAGES POLICY](alter-packages-policy.md)
> * [ALTER PASSWORD POLICY](alter-password-policy.md)
> * [ALTER PIPE](alter-pipe.md)
> * [ALTER PRIVACY POLICY](alter-privacy-policy.md)
> * [ALTER PROCEDURE](alter-procedure.md)
> * [ALTER PROJECTION POLICY](alter-projection-policy.md)
> * [ALTER ROW ACCESS POLICY](alter-row-access-policy.md)
> * [ALTER SCHEMA](alter-schema.md)
> * [ALTER SECRET](alter-secret.md)
> * [ALTER SEMANTIC VIEW](alter-semantic-view.md)
> * [ALTER SEQUENCE](alter-sequence.md)
> * [ALTER SERVICE](alter-service.md)
> * [ALTER SESSION POLICY](alter-session-policy.md)
> * [ALTER SNAPSHOT](alter-snapshot.md)
> * [ALTER SNAPSHOT POLICY — Deprecated](alter-snapshot-policy.md) (deprecated; prefer [ALTER BACKUP POLICY](alter-backup-policy.md))
> * [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md) (deprecated; prefer [ALTER BACKUP SET](alter-backup-set.md))
> * [ALTER STAGE](alter-stage.md)
> * [ALTER STORAGE LIFECYCLE POLICY](alter-storage-lifecycle-policy.md)
> * [ALTER STREAM](alter-stream.md)
> * [ALTER STREAMLIT](alter-streamlit.md)
> * [ALTER TABLE](alter-table.md)
> * [ALTER TABLE (event tables)](alter-table-event-table.md)
> * [ALTER TAG](alter-tag.md)
> * [ALTER TASK](alter-task.md)
> * [ALTER TYPE](alter-type.md)
> * [ALTER VIEW](alter-view.md)

**Classes**:

> * [ALTER BUDGET](../classes/budget/commands/alter-budget.md)
> * [ALTER SNOWFLAKE.ML.CLASSIFICATION](../classes/classification/commands/alter-classification.md)

---
title: ALTER ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-account.md
section: SQL Commands
---

# ALTER ACCOUNT

Modifies an account. The ALTER ACCOUNT command has two purposes:

* Allows account administrators (that is, users with the ACCOUNTADMIN role) to modify [parameters](../parameters.md) and
  other settings at the account level. For example, the account administrator can set the resource monitor or enable a security feature for
  an account. For these actions, the account administrator executes ALTER ACCOUNT from the account being modified.
* Allows [organization administrators](../../user-guide/organization-administrators.md) to modify core characteristics of an account. For example, the
  organization administrator can rename an account. For these actions, the organization administrator executes ALTER ACCOUNT from a
  different account than the one being modified.

> **Note:**
>
> While ALTER ACCOUNT is primarily executed by account administrators and organization administrators, users with the SECURITYADMIN
> role can use it to set the network policy for the account.

## Syntax

The syntax for ALTER ACCOUNT varies depending on whether you are modifying the current account or a different account.

### Altering the current account

```sqlsyntax
ALTER ACCOUNT SET { [ accountProperties ] | [ accountParams ] | [ objectParams ] | [ sessionParams ] }

ALTER ACCOUNT UNSET <param_name> [ , ... ]

ALTER ACCOUNT SET RESOURCE_MONITOR = <monitor_name>

ALTER ACCOUNT ADD ORGANIZATION USER GROUP <group_name>
ALTER ACCOUNT REMOVE ORGANIZATION USER GROUP <group_name>

ALTER ACCOUNT SET { AUTHENTICATION | SESSION } POLICY <policy_name> [ { FOR ALL PERSON USERS | FOR ALL SERVICE USERS } ] [ FORCE ]

ALTER ACCOUNT UNSET { AUTHENTICATION | SESSION } POLICY [ { FOR ALL PERSON USERS | FOR ALL SERVICE USERS } ]

ALTER ACCOUNT SET FEATURE POLICY <policy_name> FOR ALL APPLICATIONS [ FORCE ]

ALTER ACCOUNT UNSET FEATURE POLICY FOR ALL APPLICATIONS

ALTER ACCOUNT SET MAINTENANCE POLICY <policy_name> [ FORCE ] FOR ALL APPLICATIONS

ALTER ACCOUNT UNSET MAINTENANCE POLICY FOR ALL APPLICATIONS

ALTER ACCOUNT SET { PACKAGES | PASSWORD } POLICY <policy_name> [ FORCE ]

ALTER ACCOUNT UNSET { PACKAGES | PASSWORD } POLICY

ALTER ACCOUNT SET CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ]

ALTER ACCOUNT UNSET CONTACT <purpose>

ALTER ACCOUNT SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER ACCOUNT UNSET TAG <tag_name> [ , <tag_name> ... ]
```

Where:

```sqlsyntax
accountProperties ::=
    LOGIN_IDP_REDIRECT = ( <interface> = <security_integration> [ , ... ] )
    OBJECT_VISIBILITY = { <object_visibility_spec> | PRIVILEGED }
```

```sqlsyntax
accountParams ::=
  ALLOW_ID_TOKEN = TRUE | FALSE
  ALLOWED_SPCS_WORKLOAD_TYPES = { '<list_of_workload_types>' | 'ALL' }
  CLIENT_ENCRYPTION_KEY_SIZE = <integer>
  CORTEX_ENABLED_CROSS_REGION = { 'DISABLED' | 'ANY_REGION' | '<list_of_regions>' }
  DISALLOWED_SPCS_WORKLOAD_TYPES = { '<list_of_workload_types>' | 'ALL' }
  DISABLE_USER_PRIVILEGE_GRANTS = TRUE | FALSE
  DEFAULT_DBT_VERSION = { '<version>' }
  ENABLE_EGRESS_COST_OPTIMIZER = TRUE | FALSE
  ENABLE_INTERNAL_STAGES_PRIVATELINK = TRUE | FALSE
  ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK = TRUE | FALSE
  ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES = TRUE | FALSE
  ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME = TRUE | FALSE
  ENABLE_NOTEBOOK_CREATION_IN_PERSONAL_DB = TRUE | FALSE
  ENABLE_SPCS_BLOCK_STORAGE_SNOWFLAKE_FULL_ENCRYPTION_ENFORCEMENT = TRUE | FALSE
  EXTERNAL_OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST = TRUE | FALSE
  INITIAL_REPLICATION_SIZE_LIMIT_IN_TB = <num>
  LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE = <schedule>
  LLM_INFERENCE_PARSE_DOCUMENT_PRESIGNED_URL_EXPIRY_SECONDS = <integer>
  NETWORK_POLICY = <string>
  OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST = TRUE | FALSE
  PERIODIC_DATA_REKEYING = TRUE | FALSE
  READ_CONSISTENCY_MODE = 'SESSION' | 'GLOBAL'
  REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_CREATION = TRUE | FALSE
  REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_OPERATION = TRUE | FALSE
  SAML_IDENTITY_PROVIDER = <json_object>
  SQL_TRACE_QUERY_TEXT = ON | OFF
  SSO_LOGIN_PAGE = TRUE | FALSE
  USE_WORKSPACES_FOR_SQL = { 'always' | 'never' }
```

```sqlsyntax
objectParams ::=
  BASE_LOCATION_PREFIX = '<string>'
  CATALOG = <catalog_integration_name>
  CATALOG_SYNC = '<snowflake_open_catalog_integration_name>'
  CORTEX_MODELS_ALLOWLIST = {'<list_of_models>' | 'ALL' | 'NONE'}
  DATA_RETENTION_TIME_IN_DAYS = <integer>
  DEFAULT_DDL_COLLATION = '<collation_specification>'
  DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU = <compute_pool_name>
  DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU = <compute_pool_name>
  DEFAULT_STREAMLIT_COMPUTE_POOL = <compute_pool_name>
  DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE = <warehouse_name>
  ENABLE_DATA_COMPACTION = { TRUE | FALSE }
  ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }
  ENABLE_TAG_PROPAGATION_EVENT_LOGGING = TRUE | FALSE
  ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR = TRUE | FALSE
  ENABLE_UNREDACTED_SECURE_OBJECT_ERROR = TRUE | FALSE
  EVENT_TABLE = <string>
  EXTERNAL_VOLUME = <external_volume_name>
  ICEBERG_VERSION_DEFAULT = <integer>
  LOG_LEVEL = <string>
  MAX_CONCURRENCY_LEVEL = <num>
  MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer>
  METRIC_LEVEL = <string>
  NETWORK_POLICY = <string>
  PIPE_EXECUTION_PAUSED = TRUE | FALSE
  PREVENT_UNLOAD_TO_INLINE_URL = TRUE | FALSE
  PREVENT_UNLOAD_TO_INTERNAL_STAGES = TRUE | FALSE
  REPLACE_INVALID_CHARACTERS = TRUE | FALSE
  STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>
  STATEMENT_TIMEOUT_IN_SECONDS = <num>
  STORAGE_SERIALIZATION_POLICY = COMPATIBLE | OPTIMIZED
  TRACE_LEVEL = <string>
```

```sqlsyntax
sessionParams ::=
  ABORT_DETACHED_QUERY = TRUE | FALSE
  AUTOCOMMIT = TRUE | FALSE
  BINARY_INPUT_FORMAT = <string>
  BINARY_OUTPUT_FORMAT = <string>
  DATE_INPUT_FORMAT = <string>
  DATE_OUTPUT_FORMAT = <string>
  DEFAULT_NULL_ORDERING = <string>
  ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS = TRUE | FALSE
  ERROR_ON_NONDETERMINISTIC_MERGE = TRUE | FALSE
  ERROR_ON_NONDETERMINISTIC_UPDATE = TRUE | FALSE
  JSON_INDENT = <num>
  LOCK_TIMEOUT = <num>
  OPT_OUT_ERROR_LOGGING = TRUE | FALSE
  QUERY_TAG = <string>
  ROWS_PER_RESULTSET = <num>
  S3_STAGE_VPCE_DNS_NAME = <string>
  SEARCH_PATH = <string>
  SIMULATED_DATA_SHARING_CONSUMER = <string>
  STATEMENT_TIMEOUT_IN_SECONDS = <num>
  STRICT_JSON_OUTPUT = TRUE | FALSE
  TIMESTAMP_DAY_IS_ALWAYS_24H = TRUE | FALSE
  TIMESTAMP_INPUT_FORMAT = <string>
  TIMESTAMP_LTZ_OUTPUT_FORMAT = <string>
  TIMESTAMP_NTZ_OUTPUT_FORMAT = <string>
  TIMESTAMP_OUTPUT_FORMAT = <string>
  TIMESTAMP_TYPE_MAPPING = <string>
  TIMESTAMP_TZ_OUTPUT_FORMAT = <string>
  TIMEZONE = <string>
  TIME_INPUT_FORMAT = <string>
  TIME_OUTPUT_FORMAT = <string>
  TRANSACTION_DEFAULT_ISOLATION_LEVEL = <string>
  TWO_DIGIT_CENTURY_START = <num>
  UNSUPPORTED_DDL_ACTION = <string>
  USE_CACHED_RESULT = TRUE | FALSE
  WEEK_OF_YEAR_POLICY = <num>
  WEEK_START = <num>
```

> **Note:**
>
> For readability, the complete list of session parameters that can be set for an account is not included here. For a complete list of all session
> parameters, with their descriptions, as well as account and object parameters, see [Parameters](../parameters.md).

### Altering a different account

```sqlsyntax
ALTER ACCOUNT <name> SET IS_ORG_ADMIN = { TRUE | FALSE }

ALTER ACCOUNT <name> RENAME TO <new_name> [ SAVE_OLD_URL = { TRUE | FALSE } ]

ALTER ACCOUNT <name> DROP OLD URL

ALTER ACCOUNT <name> DROP OLD ORGANIZATION URL
```

## Account properties

You can set the following properties for the current account.

`SET property`
:   Specifies a property to set for your account:

> `LOGIN_IDP_REDIRECT = ( interface = security_integration [ , ... ] )`
> :   Specifies a mapping between Snowflake interfaces and
>     [SAML security integrations](../../user-guide/admin-security-fed-auth-security-integration.md). SAML security integrations are used to
>     implement single sign-on (SSO) authentication. If an interface is mapped to a SAML security integration, then users who access the
>     interface are redirected to the third-party identity provider (IdP) to authenticate; they never see the Snowflake login screen.
>
>     If you don’t want interface users automatically redirected to an IdP, specify `interface = NULL`. Possible interfaces are:
>
>     `DEFAULT = security_integration`
>     :   Specifies the default security integration. Unless overridden by another interface-to-integration mapping, users are automatically
>         directed to the integration’s IdP when they access any Snowflake interface. Use this mapping to define the security integration
>         for Snowsight.
>
>     `SNOWFLAKE_INTELLIGENCE = security_integration`
>     :   Specifies the security integration used to redirect unauthenticated users to an IdP when they access
>         [Snowflake Intelligence](../../user-guide/snowflake-cortex/snowflake-intelligence.md). This overrides the DEFAULT mapping
>         for Snowflake Intelligence. For more information, see [Redirect users to your identity provider](../../user-guide/snowflake-cortex/snowflake-intelligence/deploy-agents.md).
>
>     `STREAMLIT = security_integration`
>     :   Specifies the security integration used to redirect unauthenticated users to an IdP when they access
>         Streamlit in Snowflake app-viewer URLs. This overrides the DEFAULT mapping for Streamlit app-viewer URLs. For more information,
>         see [Redirect app viewers to your identity provider](../../developer-guide/streamlit/object-management/security.md).
>
>     `SPCS = security_integration`
>     :   Specifies the security integration used to redirect unauthenticated users to an IdP when they access
>         SPCS ingress endpoints. This overrides the DEFAULT mapping for SPCS ingress endpoints. For more information,
>         see [Ingress and your Identity Provider (IdP) considerations](../../developer-guide/snowpark-container-services/service-network-communications.md).
>
>     Default: Empty list `( )`
>
> `OBJECT_VISIBILITY = {object_visibility_spec | PRIVILEGED }`
> :   [Preview Feature](../../release-notes/preview-features.md) — Open
>
>     Available to all accounts.
>
>     Specifies the visibility of objects in the account, which controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md)
>     and enables users without explicit access privileges to find objects and request access.
>
>     * A YAML specification describing the visibility in one of the following formats:
>
>       ```sqlexample-yaml
>       $$
>       organization_targets:
>         - all_accounts_including_external
>       $$
>       ```
>
>       Or
>
>       ```sqlexample-yaml
>       $$
>       organization_targets:
>         - account: <account_name_1>
>         - account: <account_name_2>
>         - ...
>         - organization_user_group: <org_user_group_1>
>         - organization_user_group: <org_user_group_2>
>       $$
>       ```
>
>       In the syntax above:
>
>       + `all_accounts_including_external`: Specifies that all users in all accounts in the organization can see the object. This includes
>         all accounts within the organization, even those to which external parties may have been given access, such as
>         [reader accounts](../../user-guide/data-sharing-reader-create.md).
>       + `account: account_name`: Specifies that all users in the specified account can see the object. You can specify multiple accounts.
>         Note that `account` is the account name, not the account locator. You must specify only the account name, excluding the organization name.09-22
>       + `organization_user_group: org_user_group`: Specifies that the specified [organization user group](../../user-guide/organization-users.md) can
>         see the object in all accounts in the organization where the [organization user group has been imported](../../user-guide/organization-users.md).
>     * `PRIVILEGED`: Specifies that only roles within the current account that are granted an explicit privilege on the object can see the object.
>       This is the default behavior in Snowflake.
>
>     For examples, see [Make database objects discoverable in Universal Search](../../user-guide/ui-snowsight/object-visibility-universal-search.md).
>
>     Default: `'PRIVILEGED'`

`UNSET property`
:   Reverts the specified account property to its default.

## Parameters for altering the current account

Use the following parameters when modifying the current account.

For more information about setting parameters at the account level, see [Parameter management](../../user-guide/admin-account-management.md). For details about a particular parameter, see [Parameters](../parameters.md).

`SET ...`
:   Specifies one (or more) account, session, object parameters, and object properties to set for your account (separated by blank spaces, commas, or new lines):

    * Account parameters cannot be changed by any other users.
    * Session and object parameters set at the account level serve only as defaults and can be changed by other users.

    For descriptions of the parameters you can set for your account, see [Parameters](../parameters.md).

`UNSET ...`
:   Specifies one (or more) account, session, and object parameters to unset for your account, which resets them to the system defaults.

    You can reset multiple properties with a single ALTER statement; however, each property must be separated by a comma. When resetting a
    property, specify only the name; specifying a value for the property will return an error.

`SET RESOURCE_MONITOR resource_monitor_name`
:   Special parameter that specifies the name of the resource monitor used to control all virtual warehouses created in the account.

    > **Important:**
    >
    > Setting a resource monitor at the account level does not impact any of the Snowflake-provided warehouses that Snowflake uses
    > for Snowpipe, automatic reclustering, or materialized views. The credits consumed by these warehouses do not count towards the
    > credit quota for an account-level resource monitor.
    >
    > For more details, see [Working with resource monitors](../../user-guide/resource-monitors.md).

`ADD ORGANIZATION USER GROUP group_name`
:   Imports an [organization user group](../../user-guide/organization-users.md) into the account. Organization users in the group are added to the
    account as user objects.

`REMOVE ORGANIZATION USER GROUP group_name`
:   Removes an [organization user group](../../user-guide/organization-users.md) from the account.

`SET { AUTHENTICATION | SESSION } POLICY policy_name [ { FOR ALL PERSON USERS | FOR ALL SERVICE USERS } ] [ FORCE ]`
:   Specifies the [authentication policy](../../user-guide/authentication-policies.md) or
    [session policy](../../user-guide/session-policies.md) for the account.

    The `FOR ALL PERSON USERS` clause applies the policy to users with their TYPE property set to NULL or PERSON.

    The `FOR ALL SERVICE USERS` clause applies the policy to users with their TYPE property set to SERVICE or
    LEGACY_SERVICE.

    If you don’t specify `FOR ALL SERVICE USERS` or `FOR ALL PERSON USERS`, then the policy applies to all users in the account.

    If you explicitly set a policy on a specific user or a specific user type, then that policy takes precedence over a policy applied to `FOR ALL SERVICE USERS` or `FOR ALL PERSON USERS`.

    If you specify FORCE, then policies you set on specific users or specific user types are overridden. You can use
    this if you don’t want to unset policies.

    If a policy is already set on the current account, you can use FORCE to set the policy without having to unset the
    existing policy first.

`SET FEATURE POLICY policy_name FOR ALL APPLICATIONS [ FORCE ]`
:   Specifies the feature policy to set for the account. If a feature policy
    is already set on the current account, you can use FORCE to set the feature policy
    without having to unset the feature policy first.

`UNSET FEATURE POLICY FOR ALL APPLICATIONS`
:   Unsets the feature policy for the account.

    If you already set a policy on the current account, then you can specify FORCE to set the policy without needing to unset an
    existing policy first.

`SET MAINTENANCE POLICY policy_name [ FORCE ] FOR ALL APPLICATIONS`
:   Specifies the [maintenance policy](../../developer-guide/native-apps/consumer-maintenance-policies.md) to apply to all applications in the account. If a maintenance policy is already set on
    the account, you can use FORCE to set the maintenance policy without having to unset the
    maintenance policy first.

`UNSET MAINTENANCE POLICY FOR ALL APPLICATIONS`
:   Removes the maintenance policy from all applications in the account. When a maintenance policy is removed from all applications in an account,
    the account-level maintenance policy, if it exists, is applied.

`UNSET { AUTHENTICATION | SESSION } POLICY [ FOR ALL PERSON USERS | FOR ALL SERVICE USERS ]`
:   Unsets the [authentication policy](../../user-guide/authentication-policies.md) or
    [session policy](../../user-guide/session-policies.md) for the account.

    Specifying `FOR ALL SERVICE USERS` or `FOR ALL PERSON USERS` narrows the scope of the command; the policy is unset from the
    specified user type only instead of all users in the account.

`SET PACKAGES | PASSWORD POLICY policy_name [ FORCE ]`
:   Specifies the [packages policy](../../developer-guide/udf/python/packages-policy.md) or
    [password policy](../../user-guide/password-authentication.md) for the account.

    If you already set a policy on the current account, then you can specify FORCE to set the policy without needing to unset an
    existing policy first.

`UNSET { PACKAGES | PASSWORD } POLICY`
:   Unsets the [packages policy](../../developer-guide/udf/python/packages-policy.md) or
    [password policy](../../user-guide/password-authentication.md) for the account.

`SET CONTACT purpose = contact [ , purpose = contact ... ]`
:   Associate the account with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

`UNSET CONTACT {purpose}`
:   Removes the contact that was added to the account for the specified purpose.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Parameters for altering a different account

Use the following parameters when using the current account to modify a different account. Only [organization administrators](../../user-guide/organization-administrators.md) can use these parameters.

`name`
:   Specifies the name of the account that is being modified.

`SET`
:   Specifies an account property to set for the account.

    `IS_ORG_ADMIN = { TRUE | FALSE }`
    :   Sets an account property that determines whether the ORGADMIN role is enabled in the account.

        > **Note:**
        >
        > Using the ORGADMIN role in a regular account is being phased out. Organization administrators should use the
        > [organization account](../../user-guide/organization-accounts.md) to complete organization-level tasks.

        To enable the ORGADMIN role for an account, specify `SET IS_ORG_ADMIN = TRUE`.

        You cannot set the property to `FALSE` from the current account. As a workaround, enable the role in a different account,
        and then switch to that account before executing the ALTER ACCOUNT command.

        By default, the ORGADMIN role can be enabled in a maximum of 8 accounts. If your organization requires more accounts with the ORGADMIN
        role, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

`RENAME TO new_name`
:   Changes the name of an account to the specified name.

    The new name should conform with all the [requirements for account identifiers](../../user-guide/admin-account-identifier.md).

    Organization administrators cannot rename an account while they are logged in to it, so they must log in to a different account before
    executing the ALTER ACCOUNT command. If your organization consists of a single account that needs to be renamed, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    `SAVE_OLD_URL = { TRUE | FALSE }`
    :   Optional parameter used in conjunction with `RENAME TO` that preserves the [account URL](../../user-guide/organizations-connect.md) used to
        access Snowflake prior to renaming. By default, Snowflake saves the original URL, which means you can access the account with either
        the old URL or the URL that contains the new account name. When set to `FALSE`, you must use the new URL to access the account.

        Default:
        :   TRUE

`DROP OLD URL`
:   Removes the original [account URL](../../user-guide/organizations-connect.md) of an account that was renamed. Once the old URL is dropped, you must
    access the account with the URL that contains the new account name.

    If an account has an old account URL because it was moved to another organization, had its organization renamed, or was part of an
    organization that was merged, use the ALTER ACCOUNT … DROP OLD ORGANIZATION URL instead.

`DROP OLD ORGANIZATION URL`
:   Removes the original [account URL](../../user-guide/organizations-connect.md) of an account after one of the following occurs:

    * Account moved to another organization
    * Account had its organization renamed.
    * Account was part of an organization that was merged with another organization.

    If an account has an old account URL because the account, not the organization, was renamed, use the ALTER ACCOUNT … DROP OLD URL
    command instead.

## Usage notes

* Account parameters can be set only at the account level.
* Session and object parameters that are set using this command serve only as defaults:

  > + User parameters can be overridden at the individual user level.
  > + Session parameters can be overridden at the individual user and session level.
  > + Object parameters can be overridden at the individual object level.
* Setting a resource monitor at the account level controls the credit usage for all virtual warehouses created in the account, but does not impact
  the credit usage for any of the Snowflake-provided warehouses. For more details, see [Working with resource monitors](../../user-guide/resource-monitors.md).

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Associate a network policy named `mypolicy` with your account:

> ```sqlexample
> ALTER ACCOUNT SET NETWORK_POLICY = mypolicy;
> ```

Disable user privilege grants:

> ```sqlexample
> ALTER ACCOUNT SET DISABLE_USER_PRIVILEGE_GRANTS = TRUE;
> ```

Remove the network policy association from your account:

> ```sqlexample
> ALTER ACCOUNT UNSET NETWORK_POLICY;
> ```

Set the packages policy at the account level.

> ```sqlexample
> ALTER ACCOUNT SET PACKAGES POLICY packages_policy_prod_1 FORCE;
> ```
>
> > **Note:**
> >
> > If a packages policy is already set on the current account, you can use FORCE to set the packages policy without
> > having to unset the packages policy first.

Unset the packages policy.

> ```sqlexample
> ALTER ACCOUNT UNSET PACKAGES POLICY;
> ```

---
title: ALTER AGENT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-agent.md
section: SQL Commands
---

# ALTER AGENT

Modifies the properties or specification for an existing [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md).

See also:
:   [CREATE AGENT](create-agent.md), [DESCRIBE AGENT](desc-agent.md), [DROP AGENT](drop-agent.md), [SHOW AGENTS](show-agents.md)

## Syntax

```sqlsyntax
ALTER AGENT <name> SET
  [ COMMENT = '<string>' ]
  [ PROFILE = '<string>' ]

ALTER AGENT <name> MODIFY LIVE VERSION SET SPECIFICATION = <specification>
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the agent to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`SET ...`
:   Sets one or more specified properties or parameters for the agent:

    `COMMENT = comment`
    :   Specifies the description of the agent.

    `PROFILE = string`
    :   Specifies the agent profile information, such as display name, avatar, and color. Format the string as follows:

        ```none
        '{"display_name": "<display_name>", "avatar": "<avatar>", "color": "<color>"}'
        ```

        The following table describes the key-value pairs in the string:

        | Key | Type | Description |
        | --- | --- | --- |
        | `display_name` | String | Display name for the agent. |
        | `avatar` | String | Avatar image file name or identifier. |
        | `color` | String | Color theme for the agent (such as “blue”, “green”, “red”) |

`MODIFY LIVE VERSION SET SPECIFICATION specification`
:   Specifies the VARCHAR value containing the replacement settings for an agent as either a YAML or JSON object:

    * [Dollar-quoted literal](../data-types-text.md): $$ … $$
    * [Single-quoted string](../data-types-text.md): ‘…’

    The maximum length of the specification object is 100,000 bytes.

    > **Important:**
    >
    > The new specification completely replaces the existing one. Fields that are not included in the new specification are removed.

    The YAML object should have the following structure:

    ```yaml
    models:
      orchestration: <model_name>

    orchestration:
      budget:
          seconds: <number_of_seconds>
          tokens: <number_of_tokens>

    instructions:
      response: '<response_instructions>'
      orchestration: '<orchestration_instructions>'
      system: '<system_instructions>'
      sample_questions:
          - question: '<sample_question>'
            answer: '<sample_answer>'
          ...

    tools:
      - tool_spec:
          type: '<tool_type>'
          name: '<tool_name>'
          description: '<tool_description>'
          input_schema:
              type: 'object'
              properties:
                <property_name>:
                  type: '<property_type>'
                  description: '<property_description>'
              required: <required_property_names>
      ...

    tool_resources:
      <tool_name>:
        <resource_key>: '<resource_value>'
        ...
      ...
    ```

    The JSON object should have the following structure:

    ```none
    {
      "models": {
        "orchestration": "<model_name>"
      },
      "orchestration": {
        "budget": {
          "seconds": <number_of_seconds>,
          "tokens": <number_of_tokens>
        }
      },
      "instructions": {
        "response": "<response_instructions>",
        "orchestration": "<orchestration_instructions>",
        "system": "<system_instructions>",
        "sample_questions": [
          {
            "question": "<sample_question>",
            "answer": "<sample_answer>"
          }
        ]
      },
      "tools": [
        {
          "tool_spec": {
            "type": "<tool_type>",
            "name": "<tool_name>",
            "description": "<tool_description>",
            "input_schema": {
              "type": "object",
              "properties": {
                "<property_name>": {
                  "type": "<property_type>",
                  "description": "<property_description>"
                }
              },
              "required": ["<required_property_names>"]
            }
          }
        }
      ],
      "tool_resources": {
        "<tool_name>": {
          "<resource_key>": "<resource_value>"
        }
      }
    }
    ```

    The following table describes the key-value pairs in this object:

    | Key | Type | Description |
    | --- | --- | --- |
    | `models` | [ModelConfig](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional model configuration for the agent. Includes the orchestration model (e.g., claude-4-sonnet). If not provided, a model is automatically selected. Currently only available for the `orchestration` step. |
    | `orchestration` | [OrchestrationConfig](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional orchestration configuration, including budget constraints (e.g., seconds, tokens). |
    | `instructions` | [AgentInstructions](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | Optional instructions for the agent’s behavior, including response, orchestration, system, and sample questions. |
    | `tools` | array of [Tool](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional list of tools available for the agent to use. Each tool includes a `tool_spec` with type, name, description, and input schema. Tools may have a corresponding configuration in `tool_resources`. |
    | `tool_resources` | map of [ToolResource](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional configuration for each tool referenced in the tools array. Keys must match the name of the respective tool. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MODIFY | Agent | Required to modify the agent properties or specification.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When modifying a live version’s specification, the new specification completely replaces the existing one.
  Fields that are not included in the new specification are removed.
* Both YAML and JSON formats are supported for specifications.
* Invalid specification fields result in an error.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Update the comment for an agent:

```sqlexample
ALTER AGENT my_support_agent SET COMMENT = 'Customer support agent for product inquiries';
```

Update the profile for an agent:

```sqlexample
ALTER AGENT my_support_agent SET PROFILE = '{"display_name": "Support Bot", "avatar": "bot-icon.png"}';
```

Update both the comment and profile together:

```sqlexample
ALTER AGENT my_support_agent
  SET COMMENT = 'Production support agent',
      PROFILE = '{"display_name": "Customer Assistant", "avatar": "assistant.png"}';
```

Update the live version specification using YAML format:

```sqlexample-yaml
ALTER AGENT my_support_agent
  MODIFY LIVE VERSION SET SPECIFICATION =
  $$
  models:
    orchestration: claude-4-sonnet

  orchestration:
    budget:
      seconds: 30
      tokens: 50000

  instructions:
    system: "You are a helpful customer support assistant."
    response: "Always be concise and accurate."
    sample_questions:
      - question: "What is the status of my order?"
        answer: "I can help you check your order status. Please provide your order number."
  $$;
```

Update the live version specification using JSON format:

```sqlexample
ALTER AGENT my_support_agent
  MODIFY LIVE VERSION SET SPECIFICATION = '{"models":{"orchestration":"claude-4-sonnet"},"orchestration":{"budget":{"seconds":45,"tokens":80000}}}';
```

---
title: ALTER AGGREGATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-aggregation-policy.md
section: SQL Commands
---

# ALTER AGGREGATION POLICY

Replaces the existing rules or comment of an [aggregation policy](../../user-guide/aggregation-policies.md). Also allows you to rename an
aggregation policy.

See also:
:   [Aggregation policy DDL reference](../../user-guide/aggregation-policies.md)

## Syntax

```sqlsyntax
ALTER AGGREGATION POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER AGGREGATION POLICY [ IF EXISTS ] <name> SET BODY -> <expression>

ALTER AGGREGATION POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER AGGREGATION POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER AGGREGATION POLICY [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER AGGREGATION POLICY [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the aggregation policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the aggregation policy; must be unique for your schema. The new identifier cannot be used if the
    identifier is already in place for a different aggregation policy.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one (or more) properties to set for the aggregation policy:

    `BODY -> expression`
    :   SQL expression that determines the restrictions of an aggregation policy.

        To define the constraints of the aggregation policy, use the SQL expression to call one or more of the following functions:

        NO_AGGREGATION_CONSTRAINT
        :   When the policy body returns a value from this function, queries can return data from an aggregation-constrained table or view
            without restriction. For example, the body of the policy could call this function when an administrator needs to obtain unaggregated
            results from the aggregation-constrained table or view.

            Call NO_AGGREGATION_CONSTRAINT without an argument.

        AGGREGATION_CONSTRAINT
        :   When the policy body returns a value from this function, queries must aggregate data in order to return results. Use the
            MIN_GROUP_SIZE argument to specify how many records must be included in each aggregation group.

            The syntax of the AGGREGATION_CONSTRAINT function is:

            ```sqlsyntax
            AGGREGATION_CONSTRAINT ( MIN_GROUP_SIZE => <integer_expression> )
            ```

            Where:

            `MIN_GROUP_SIZE => integer_expression`
            :   Specifies how many rows or [entities](../../user-guide/aggregation-policies-entity-privacy.md) must be included in the groups returned by
                a query against the aggregation-constrained table or view.

                There is a difference between passing a `1` and a `0` as the argument to the function. Both require results to be aggregated.

                * Passing a `1` also requires that each aggregation group contain at least one record from the aggregation-constrained table. So for
                  outer joins, at least one record from the aggregation-constrained table must match a record from an unprotected table.
                * Passing a `0` allows the query to return groups that consist entirely of records from another table. So for outer joins between an
                  aggregation-constrained table and an unprotected table, a group could consist of records from the unprotected table that do not match
                  any records in the aggregation-constrained table.

        The body of a policy cannot reference user-defined functions, tables, or views.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the aggregation policy.

        Default: No value

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset, by resetting them to their defaults, for the aggregation policy:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

    When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Aggregation policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on aggregation policy DDL and privileges, see [Privileges and commands](../../user-guide/aggregation-policies.md).

## Usage notes

* If you want to update an existing aggregation policy and need to see the current body of the policy, run the
  [DESCRIBE AGGREGATION POLICY](desc-aggregation-policy.md) command. You can also use the [GET_DDL](../functions/get_ddl.md) function to
  obtain the full definition of the aggregation policy, including its body.
* Moving an aggregation policy to a [managed access schema](../../user-guide/security-access-control-configure.md)
  (using the ALTER AGGREGATION POLICY … RENAME TO syntax) is prohibited unless the aggregation policy owner
  (i.e. the role that has the OWNERSHIP privilege on the aggregation policy) also owns the target schema.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Change the SQL expression of the aggregation policy to require a minimum group size of 2 rows in all circumstances:

> ```sqlexample
> ALTER AGGREGATION POLICY my_policy SET BODY -> AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE=>2);
> ```

Rename an aggregation policy:

> ```sqlexample
> ALTER AGGREGATION POLICY my_policy RENAME TO agg_policy_table1;
> ```

---
title: ALTER ALERT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-alert.md
section: SQL Commands
---

# ALTER ALERT

Modifies the properties of an existing alert and suspends or resumes an existing [alert](../../user-guide/alerts.md).

See also:
:   [CREATE ALERT](create-alert.md) , [DESCRIBE ALERT](desc-alert.md), [DROP ALERT](drop-alert.md) , [SHOW ALERTS](show-alerts.md) , [EXECUTE ALERT](execute-alert.md)

## Syntax

```sqlsyntax
ALTER ALERT [ IF EXISTS ] <name> { RESUME | SUSPEND };

ALTER ALERT [ IF EXISTS ] <name> SET
  [ WAREHOUSE = <string> ]
  [ SCHEDULE = '{ <number> MINUTE | USING CRON <expr> <time_zone> }' ]
  [ COMMENT = '<string_literal>' ]

ALTER ALERT [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER ALERT [ IF EXISTS ] <name> UNSET
  [ WAREHOUSE ]
  [ COMMENT ]

ALTER ALERT <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER ALERT [ IF EXISTS ] <name> MODIFY CONDITION EXISTS (<condition>)

ALTER ALERT [ IF EXISTS ] <name> MODIFY ACTION <action>
```

## Parameters

`name`
:   Identifier for the alert to alter. If the identifier contains spaces or special characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`{ RESUME | SUSPEND }`
:   Specifies the action to perform on the alert:

    * `RESUME` makes a suspended alert active.
    * `SUSPEND` puts the alert into a “Suspended” state.

    If the alert schedule is set to an interval (i.e. `num MINUTE`), then to avoid ambiguity, the *base interval time* for
    the schedule is reset to the current time when the alert is resumed.

    The base interval time starts the interval counter from the current clock time. For example, if an alert is created with
    `10 MINUTE` and the alert is resumed at 9:03 AM, then the alert runs at 9:13 AM, 9:23 AM, and so on. Note that we make a best
    effort to ensure absolute precision, but only guarantee that alerts do not execute before their set interval occurs
    (e.g., in the current example, the alert could first run at 9:14 AM, but will definitely not run at 9:12 AM).

`SET ...`
:   Specifies one (or more) properties to set for the alert (separated by blank spaces, commas, or new lines).

    `WAREHOUSE = warehouse_name`
    :   Specifies the [virtual warehouse](../../user-guide/warehouses.md) that provides compute resources for executing this alert.

        > **Note:**
        >
        > For [serverless alerts](../../user-guide/alerts.md), do not set this property.

    `SCHEDULE ...`
    :   Specifies the schedule for periodically evaluating the condition for the alert on a schedule.

        When you create an alert, omitting this parameter or setting it to NULL creates an
        [alert on new data](../../user-guide/alerts.md).

        For alerts on a schedule, you can specify the schedule in one of the following ways:

        * `USING CRON expr time_zone`

          Specifies a cron expression and time zone for periodically evaluating the condition for the alert. Supports a subset of
          standard cron utility syntax.

          The cron expression consists of the following fields:

          ```bash
          # __________ minute (0-59)
          # | ________ hour (0-23)
          # | | ______ day of month (1-31, or L)
          # | | | ____ month (1-12, JAN-DEC)
          # | | | | _ day of week (0-6, SUN-SAT, or L)
          # | | | | |
          # | | | | |
            * * * * *
          ```

          The following special characters are supported:

          | Special Character | Description |
          | --- | --- |
          | `*` | Wildcard. When specified for a given field, the alert runs at every unit of time for that field.  For example, `*` in the month field specifies that the alert runs every month. |
          | `L` | Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a given month. In the day-of-month field, it specifies the last day of the month. |
          | `/n` | Indicates the `n`th instance of a given unit of time. Each quanta of time is computed independently.  For example, if `4/3` is specified in the month field, then the evaluation of the condition is scheduled for April, July and October (i.e. every 3 months, starting with the 4th month of the year).  The same schedule is maintained in subsequent years. That is, the condition is not scheduled to be evaluated in January (3 months after the October run). |

          > **Note:**
          > + The cron expression currently evaluates against the specified time zone only. Altering the
          >   [TIMEZONE](../parameters.md) parameter value for the account (or setting the value at the user or session level) does not
          >   change the time zone for the alert.
          > + The cron expression defines all valid times for the evaluation of the condition for the alert. Snowflake attempts
          >   to evaluate the condition based on this schedule; however, any valid run time is skipped if a previous run has not
          >   completed before the next valid run time starts.
          > + When both a specific day of month and day of week are included in the cron expression, then the evaluation of the
          >   condition is scheduled on days satisfying either the day of month or day of week. For example,
          >   `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'` schedules an evaluation at 0AM on any 10th to 20th day of the month
          >   and also on any Tuesday or Thursday outside of those dates.
        * `num MINUTE`

          Specifies an interval (in minutes) of wait time inserted between evaluations of the alert. Accepts positive integers only.

          Also supports `num M` syntax.

          To avoid ambiguity, a *base interval time* is set when the alert is resumed (using
          ALTER ALERT … RESUME).

          The base interval time starts the interval counter from the current clock time. For example, if an alert is created with
          `10 MINUTE` and the alert is resumed at 9:03 AM, then the condition for the alert is evaluated at 9:13 AM, 9:23 AM, and so
          on. Note that we make a best effort to ensure absolute precision, but only guarantee that conditions are not evaluated
          before their set interval occurs (e.g. in the current example, the condition could be evaluated first at 9:14 AM but
          definitely not at 9:12 AM).

          > **Note:**
          >
          > The maximum supported value is `11520` (8 days). Alerts that have a greater `num MINUTE` value never have their
          > conditions evaluated.

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the alert.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the alert, which resets them back to their defaults:

    * `WAREHOUSE`
    * `COMMENT`
    * `TAG tag_key [ , tag_key ... ]`

`MODIFY CONDITION EXISTS (condition)`
:   Specifies the SQL statement that should represent the condition for the alert. You can use the following commands:

    * [SELECT](select.md)
    * [SHOW <objects>](show.md)
    * [CALL](call.md)

    If the statement returns one or more rows, the action for the alert is executed.

`MODIFY ACTION action`
:   Specifies the SQL statement that should be executed if the condition returns one or more rows.

    To send a notification, you can
    [call the SYSTEM$SEND_EMAIL or SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure](../../user-guide/notifications/about-notifications.md).

## Access control requirements

Executing this SQL command requires [roles](../../user-guide/security-access-control-overview.md) with the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

* To resume an alert:

  + The role executing ALTER ALERT must have either the OPERATE or OWNERSHIP privilege on the alert.
  + The role with the OWNERSHIP privilege on the alert must also have the following privileges:

    - The global EXECUTE ALERT privilege.
    - The global EXECUTE MANAGED ALERT privilege, if the alert is a [serverless alert](../../user-guide/alerts.md).
    - The USAGE privilege on the warehouse, if the [alert uses a specified warehouse](../../user-guide/alerts.md).
* To suspend an alert, the role executing ALTER ALERT must have either the OPERATE or OWNERSHIP privilege on the alert.
* To modify the properties of the alert, the role executing ALTER ALERT must have the OWNERSHIP privilege on the alert.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You cannot change an [alert on new data](../../user-guide/alerts.md) to an
  [alert on a schedule](../../user-guide/alerts.md). Similarly, you cannot change an alert on a schedule to an alert
  on new data.
* When an alert is resumed, Snowflake verifies that the role with the OWNERSHIP privilege on the alert also has the USAGE
  privilege on the warehouse assigned to the alert, as well as the global EXECUTE ALERT privilege; if not, an error is produced.
* Only account administrators (users with the ACCOUNTADMIN role) can grant the EXECUTE ALERT privilege to a role. For ease of use,
  we recommend creating a custom role (e.g. alert_admin) and assigning the EXECUTE ALERT privilege to this role. Any role that can
  grant privileges (e.g. SECURITYADMIN or any role with the MANAGE GRANTS privilege) can then grant this custom role to any alert
  owner role to allow altering their own alerts. For instructions for creating custom roles and role hierarchies, see
  [Configuring access control](../../user-guide/security-access-control-configure.md).

* When you execute CREATE ALERT or ALTER ALERT, some validation checks are not performed on the statements in the condition and
  action, including:

  + The resolution of the identifiers for objects.
  + The resolution of the data types of expressions.
  + The verification of the number and types of arguments in a function call.

  The CREATE ALERT and ALTER ALERT commands do not fail if the SQL statement for a condition or action specifies an invalid
  identifier, incorrect data type, incorrect number and types of function arguments, etc. Instead, the failure occurs when the
  alert executes.

  To check for failures in an existing alert, use the [ALERT_HISTORY](../functions/alert_history.md) table function.

  To avoid these types of failures, before you specify the conditions and actions for alerts, verify the SQL expressions and
  statements for those conditions and actions.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

See [Suspending and resuming an alert](../../user-guide/alerts.md).

---
title: ALTER API INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-api-integration.md
section: SQL Commands
---

# ALTER API INTEGRATION

Modifies the properties of an existing API integration.

See also:
:   [CREATE API INTEGRATION](create-api-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
ALTER [ API ] INTEGRATION [ IF EXISTS ] <name> SET
  [ API_AWS_ROLE_ARN = '<iam_role>' ]
  [ AZURE_AD_APPLICATION_ID = '<azure_application_id>' ]
  [ API_KEY = '<api_key>' ]
  [ ENABLED = { TRUE | FALSE } ]
  [ API_ALLOWED_PREFIXES = ('<...>') ]
  [ API_BLOCKED_PREFIXES = ('<...>') ]
  [ ALLOWED_AUTHENTICATION_SECRETS = ( { <secret_name> [, <secret_name>, ... ] } ) | all | none ]
  [ COMMENT = '<string_literal>' ]

ALTER [ API ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ API ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ API ] INTEGRATION [ IF EXISTS ] <name>  UNSET {
                                                      API_KEY              |
                                                      ENABLED              |
                                                      API_BLOCKED_PREFIXES |
                                                      COMMENT
                                                      }
                                                      [ , ... ]
```

## Parameters

`name`
:   The identifier of the integration to alter. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for API integration (separated by blank spaces, commas, or new lines):

    `ENABLED = TRUE | FALSE`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` allows the integration to run.
        * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    `ALLOWED_AUTHENTICATION_SECRETS = <secret_name> [, <secret_name> ... ] | all | none`
    :   Specifies the secrets that UDF or procedure handler code can use when accessing the Git repository at the API_ALLOWED_PREFIXES value. You
        specify a secret from this list when specifying Git credentials with the [GIT_CREDENTIALS parameter](create-git-repository.md).

        This parameter’s value must be one of the following:

        * One or more fully-qualified Snowflake secret names to allow any of the listed secrets.
        * (Default) `all` to allow any secret.
        * `none` to allow no secrets.

        For reference information about secrets, refer to [CREATE SECRET](create-secret.md).

    `API_AWS_ROLE_ARN = '<iam_role>'`
    :   The `iam_role` is the ARN (Amazon resource name) of a cloud platform role.

        This parameter applies only if the API_PROVIDER is set to `aws_api_gateway`.

    `AZURE_AD_APPLICATION_ID = '<azure_application_id>'`
    :   The “Application (client) id” of the Azure AD (Active Directory) app for your remote service.

        This parameter applies only if the API_PROVIDER is set to `azure_api_management`.

    `API_KEY = '<api_key>'`
    :   The [API key](../external-functions-security.md) (also called a “subscription key”).

    `API_ALLOWED_PREFIXES = ('<...>')`
    :   Explicitly limits external functions that use the integration to reference one or more HTTPS proxy
        service endpoints (e.g. Amazon AWS API Gateway) and resources within those proxies. Supports a comma-separated
        list of URLs, which are treated as prefixes (for details, see below).

        Each URL in `API_ALLOWED_PREFIXES = (...)` is treated as a prefix. For example, if you specify:

        `https://xyz.amazonaws.com/production/`

        that means all resources under

        `https://xyz.amazonaws.com/production/`

        are allowed. For example the following is allowed:

        `https://xyz.amazonaws.com/production/ml1`

        To maximize security, you should restrict allowed locations as narrowly as practical.

    `API_BLOCKED_PREFIXES = ('<...>')`
    :   Lists the endpoints and resources in the HTTPS proxy service that are not allowed to be called from Snowflake.

        The possible values for locations follow the same rules as for `API_ALLOWED_PREFIXES` above.

        API_BLOCKED_PREFIXES takes precedence over API_ALLOWED_PREFIXES. If a prefix matches both, then it is blocked.
        In other words, Snowflake allows all values that match API_ALLOWED_PREFIXES except values that also
        match API_BLOCKED_PREFIXES.

        If a value is outside API_ALLOWED_PREFIXES, you do not need to explicitly block it.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`

    > String (literal) that specifies a comment for the integration.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the API integration, which resets them back to their defaults:

    * `ENABLED`
    * `API_BLOCKED_PREFIXES`
    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* The API_PROVIDER cannot be changed.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

```sqlexample
ALTER API INTEGRATION myint SET ENABLED = TRUE;
```

---
title: ALTER APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application.md
section: SQL Commands
---

# ALTER APPLICATION

Modifies the properties of an installed Snowflake Native App. Use ALTER APPLICATION to upgrade an app to a
specific version or patch. This command is also used to set other properties for an app.

See also:
:   [CREATE APPLICATION](create-application.md), [DESCRIBE APPLICATION](desc-application.md), [DROP APPLICATION](drop-application.md), [SHOW APPLICATIONS](show-applications.md)

## Syntax

```sqlsyntax
ALTER APPLICATION [ IF EXISTS ] <name> SET
  [ COMMENT = '<string-literal>' ]
  [ SHARE_EVENTS_WITH_PROVIDER = { TRUE | FALSE } ]
  [ DEBUG_MODE = { TRUE | FALSE } ]

ALTER APPLICATION [ IF EXISTS ] <name> UNSET
  [ COMMENT ]
  [ SHARE_EVENTS_WITH_PROVIDER ]
  [ DEBUG_MODE ]

ALTER APPLICATION [ IF EXISTS ] <name> RENAME TO <new_app_name>

ALTER APPLICATION <name> SET FEATURE POLICY <policy_name> [ FORCE ]

ALTER APPLICATION <name> UNSET FEATURE POLICY;

ALTER APPLICATION <name> SET MAINTENANCE POLICY <policy_name> [ FORCE ]

ALTER APPLICATION <name> UNSET MAINTENANCE POLICY

ALTER APPLICATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER APPLICATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER APPLICATION <name> SET SHARED TELEMETRY EVENTS ('<event_definition' [ , <event_definition>, ...])

ALTER APPLICATION <name> SET AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE }

ALTER APPLICATION <name> UNSET REFERENCES [ ( '<reference_name>' [ , '<reference_alias>' ] ) ]

ALTER APPLICATION <name> UPGRADE

ALTER APPLICATION <name> UPGRADE USING VERSION <version_name> [ PATCH <patch_num> ]

ALTER APPLICATION <name> UPGRADE USING <path_to_stage>
```

## Parameters

`name`
:   Specifies the identifier for the app being altered. If the identifier contains
    spaces, special characters, or mixed-case characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET`
:   Specifies one (or more) properties to set for the app (separated by blank spaces, commas, or new lines). For more details
    about the properties you can set, see [CREATE APPLICATION](create-application.md).

    `COMMENT = '{string}'`
    :   Adds a comment or overwrites an existing comment for the app.

    `DEBUG_MODE = { TRUE | FALSE }`
    :   Enables or disables debug mode for the installed app.

        * `TRUE` enables debug mode for the installed app.
        * `FALSE` disables debug mode for the installed app.

        You can only set `DEBUG_MODE` on the app if the following conditions are met:

        > * The installed app is in the same account as the application package.
        > * The installed app must have been created in development mode.
        >
        >   Development mode is installed with an explicit stage, version, or patch.
        > * You have OWNERSHIP privileges on the installed app and your role has been granted
        >   the DEVELOP privilege on the application package used to create the installed app.

    `SHARE_EVENTS_WITH_PROVIDER = { TRUE | FALSE }`
    :   Specifies whether to share logs and event data with the provider.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET`
:   Specifies one (or more) properties and/or session parameters to unset for the app, which resets them to the defaults.

    You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be
    separated by a comma. When resetting a property/parameter, specify only the name; specifying a value for the
    property/parameter will return an error.

    * `COMMENT`
    * `DEBUG_MODE`
      Disables debug mode for the installed app. This clause is semantically the same as setting `DEBUG_MODE = FALSE`.
    * `TAG tag_name [ , tag_name ... ]`
    * `REFERENCES[ ( 'reference_name' [, 'reference_alias' ] ) ]`

      [Unsets a persistent reference](../references.md) for an app. If no arguments are passed,
      unsets all persistent references set for the app.

`RENAME TO new_app_name`
:   Specifies a new identifier for the app. This identifier must be unique for
    your account.

`SET FEATURE POLICY policy_name [ FORCE ]`
:   Specifies the feature policy to apply to the app. If a feature policy is already set on
    the app, you can use FORCE to set the feature policy without having to unset the
    feature policy first.

`UNSET FEATURE POLICY`
:   Removes the feature policy from the app. When a feature policy is removed from an app
    the account-level feature policy, if it exists, is applied.

`SET MAINTENANCE POLICY policy_name [ FORCE ]`
:   Specifies the [maintenance policy](../../developer-guide/native-apps/consumer-maintenance-policies.md) to apply to the app. If a maintenance policy is already set on
    the app, you can use FORCE to set the maintenance policy without having to unset the
    maintenance policy first.

`UNSET MAINTENANCE POLICY`
:   Removes the maintenance policy from the app. When a maintenance policy is removed from an app,
    the account-level maintenance policy, if it exists, is applied.

`SET SHARED TELEMETRY EVENTS ( 'event_definition' [ , event_definition, ... ] )`
:   Specifies the optional event definition to enable for an app.

`SET AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE }`
:   When set to TRUE, enables all required event definitions for an app. However, optional event definitions
    remain disabled. Use the SET SHARED TELEMETRY EVENTS clause to set optional event definitions for an app.

    > **Caution:**
    >
    > After setting this value to TRUE, you cannot reset the value back to FALSE if there are required event
    > definitions in the app.

`UNSET REFERENCES[ ( 'reference_name' [ , 'reference_alias' ] ) ]`
:   Removes the specified references from the app.

`UPGRADE`
:   Upgrades the app if the provider has published a new version or patch for the app.

    An app is automatically upgraded when the provider sets the release directive of the app. However, this command may be used to
    begin the upgrade immediately without waiting for automatic upgrade to take place. This command may only be used on apps
    that were not created in development mode. Apps in development mode are installed from a listing or without specifying a stage
    or version, and are primarily intended to test the upgrade process.

`UPGRADE USING VERSION version_name [ PATCH patch_num ]`
:   Upgrades the app to the specified version. If `patch_num` is not specified,
    the latest patch is used. This command is only valid for apps that were installed by
    specifying a version and patch.

`UPGRADE USING path_to_stage`
:   Upgrades the app using files on a named stage at the path specified by `path_to_stage`.

    This clause applies only if you installed the app from a named stage.

## Usage notes

* If you do not specify values for optional parameters, values for these parameters are taken from the `manifest.yml` file. If you
  specify values in both the manifest and when running the command, values specified in the command take precedence.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

---
title: ALTER APPLICATION DROP CONFIGURATION DEFINITION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-drop-configuration-definition.md
section: SQL Commands
---

# ALTER APPLICATION DROP CONFIGURATION DEFINITION

Deletes the [app configuration definition](../../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App.

> **Note:**
>
> This command can only be used by a Snowflake Native App.

See also:
:   [ALTER APPLICATION SET CONFIGURATION DEFINITION](alter-application-set-configuration-definition.md)

## Syntax

```sqlsyntax
ALTER APPLICATION DROP CONFIGURATION DEFINITION {config};
```

## Parameters

`config`
:   Identifier for the app configuration definition.

---
title: ALTER APPLICATION DROP SPECIFICATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-drop-app-spec.md
section: SQL Commands
---

# ALTER APPLICATION DROP SPECIFICATION

Drops an app specification from an app.

> **Note:**
>
> This command can only be used by a Snowflake Native App.

See also:
:   [ALTER APPLICATION SET SPECIFICATION](alter-application-set-app-spec.md), [ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION](alter-application-sequence-number.md)

## Syntax

```sqlsyntax
ALTER APPLICATION DROP SPECIFICATION <app_spec_name>;
```

## Parameters

`app_spec_name`
:   The name of the app specification.

---
title: ALTER APPLICATION PACKAGE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-package.md
section: SQL Commands
---

# ALTER APPLICATION PACKAGE

Modifies the properties of an existing application package.

See also:
:   [CREATE APPLICATION PACKAGE](create-application-package.md), [DROP APPLICATION PACKAGE](drop-application-package.md), [SHOW APPLICATION PACKAGES](show-application-packages.md),
    [SHOW VERSIONS IN APPLICATION PACKAGE](show-versions.md), [SHOW RELEASE DIRECTIVES](show-release-directives.md)

## Syntax

```sqlsyntax
ALTER APPLICATION PACKAGE [ IF EXISTS ] <name> SET
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ COMMENT = <string-literal> ]
  [ DISTRIBUTION = { INTERNAL | EXTERNAL } ]
  [ MULTIPLE_INSTANCES = TRUE ]
  [ ENABLE_RELEASE_CHANNELS = TRUE ]
  [ LISTING_AUTO_REFRESH = { TRUE | FALSE } ]
  [ AUTOMATIC_APPLICATION_MAINTENANCE = { TRUE | FALSE } ]

ALTER APPLICATION PACKAGE [ IF EXISTS ] <name> UNSET
  [ DATA_RETENTION_TIME_IN_DAYS ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS ]
  [ DEFAULT_DDL_COLLATION ]
  [ COMMENT  = <string-literal> ]
  [ DISTRIBUTION = { INTERNAL | EXTERNAL } ]

ALTER APPLICATION PACKAGE <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER APPLICATION PACKAGE <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the application package to alter. If the identifier contains
    spaces, special characters, or mixed-case characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one (or more) properties to set for the application package (separated by blank spaces, commas, or new lines):

    `DATA_RETENTION_TIME_IN_DAYS = num`
    :   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the database, as well as specifying the
        default Time Travel retention time for all schemas created in the database.

        The value you can specify depends on the Snowflake Edition you are using:

        * Standard Edition: `0` or `1`
        * Enterprise Edition (or higher): `0` to `90`

    `MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
    :   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in the database
        to prevent streams on the tables from becoming stale.

        For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

    `DEFAULT_DDL_COLLATION = 'collation_specification'`
    :   Specifies a default [collation specification](../collation.md) for:

        * Any new columns added to existing tables in the database.
        * All columns in new tables added to the database.

        Setting the parameter does not change the collation specification for any existing columns.

        For more information about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the database.

    `DISTRIBUTION = { INTERNAL | EXTERNAL }`
    :   Specifies the type of listing a provider can create when using the application package as the data product of a listing.

        * `INTERNAL` indicates that a provider can only create a private listing within the same organization
          where the application package was created. The automated security scan is not performed
          when the DISTRIBUTION property is set to INTERNAL.
        * `EXTERNAL` indicates that a provider can create listings outside the same organization where
          the application package was created.

        See [Run the automated security scan](../../developer-guide/native-apps/security-run-scan.md) for information on setting the DISTRIBUTION property and
        the automated security scan.

        > **Note:**
        >
        > Setting the `DISTRIBUTION` parameter to `EXTERNAL` triggers an automated security review for each
        > active version and patch defined in the application package.
        >
        > The following restrictions apply until the automated security review has a status of `APPROVED`:
        >
        > * You cannot set a release directive for a version or patch.
        > * You cannot publish a listing for the application package.

    `LISTING_AUTO_REFRESH = TRUE | FALSE`
    :   When set to TRUE, initiates replication to all remote regions when there is a change to the release directive of the application package. When a release directive changes, the application package does not wait for the Cross-Cloud Auto-Fulfillment schedule.

    `MULTIPLE_INSTANCES = TRUE`
    :   Enables the consumer to install multiple instances of an app from the application package. This property cannot be
        set for application packages that are included in a trial or paid listing.

        When multiple instances are allowed, consumers can install a maximum of 10 instances of an app in their account.

        > **Caution:**
        >
        > After setting this property to true, it cannot be set to `FALSE` or unset later.

    `ENABLE_RELEASE_CHANNELS = TRUE`
    :   Enables [release channels](../../developer-guide/native-apps/release-channels.md) for the application package.

        > **Caution:**
        >
        > After setting this property to `TRUE`, it cannot be set to `FALSE` or unset later.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    `AUTOMATIC_APPLICATION_MAINTENANCE = { TRUE | FALSE }`
    :   When set to TRUE, aligns Snowpark Container Services compute pool node software upgrades with the consumer’s maintenance
        window. The application upgrades first, then any compute pool node maintenance follows within the
        same maintenance window.

        For more information, see [Consumer-controlled maintenance policies: Provider guide](../../developer-guide/native-apps/consumer-maintenance-policies-provider.md).

`UNSET ...`
:   Specifies one (or more) properties and/or parameters to unset for the application package, which resets
    them to the defaults:

    * `DATA_RETENTION_TIME_IN_DAYS`
    * `MAX_DATA_EXTENSION_TIME_IN_DAYS`
    * `EXTERNAL_VOLUME`
    * `CATALOG`
    * `DEFAULT_DDL_COLLATION`
    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`
    * `LISTING_AUTO_REFRESH`
    * `AUTOMATIC_APPLICATION_MAINTENANCE`

    You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be separated by a
    comma. When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Usage notes

* If you do not specify the values for the optional properties, the command uses the values specified in the
  manifest file of the app.

  If you specify values for the properties in the command and in the manifest file of the app, the values specified in the command take precedence.
* If two versions are active (e.g. if the current version has not finished rolling out), adding a new version results in an error.
* New versions are added with a default patch number of 0.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package SET
  COMMENT = 'Altered the Hello Snowflake app.';
```

```output
+-------------------------------------------+
| status                                    |
|-------------------------------------------|
| Statement executed successfully.          |
+-------------------------------------------+
```

---
title: ALTER APPLICATION PACKAGE … MODIFY RELEASE CHANNEL
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-package-release-channel.md
section: SQL Commands
---

# ALTER APPLICATION PACKAGE … MODIFY RELEASE CHANNEL

Modifies the release channels defined for an existing application package. Use this command
to modify a release channel, change the version or patch assigned to a release channel, or
set the release directive for a release channel.

> **Note:**
>
> The syntax in this topic only applies to application packages that use release channels. For more information, see
> [Publish an app using release channels](../../developer-guide/native-apps/release-channels.md). To set the release directive
> for an application package that does not use release channels, see
> [ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](alter-application-package-release-directive.md).

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md) , [ALTER APPLICATION PACKAGE … VERSION](alter-application-package-version.md),
    [ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](alter-application-package-release-directive.md)
    [SHOW RELEASE DIRECTIVES](show-release-directives.md)

## Syntax

```sqlsyntax
ALTER APPLICATION PACKAGE <name>
  MODIFY RELEASE CHANNEL <release_channel>
  SET DEFAULT RELEASE DIRECTIVE
  VERSION = <version_identifier>
  PATCH = <patch_num>
  [ UPGRADE_AFTER = '<timestamp>' ]
  [ UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE } ]
  [ UPGRADE_DEADLINE = '<timestamp>' ]

ALTER APPLICATION PACKAGE <name>
  MODIFY RELEASE CHANNEL <release_channel>
  SET RELEASE DIRECTIVE <release_directive>
  ACCOUNTS = ( <organization_name>.<account_name> [ , <organization_name>.<account_name> , ... ] )
  VERSION = <version_identifier>
  PATCH = <patch_num>
  [ UPGRADE_AFTER = '<timestamp>' ]
  [ UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE } ]
  [ UPGRADE_DEADLINE = '<timestamp>' ]

ALTER APPLICATION PACKAGE <name>
 MODIFY RELEASE CHANNEL <release_channel>
 MODIFY RELEASE DIRECTIVE <release_directive>
 VERSION = <version_identifier>
 PATCH = <patch_num>
 [ UPGRADE_AFTER = '<timestamp>' ]
 [ UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE } ]
 [ UPGRADE_DEADLINE = '<timestamp>' ]

ALTER APPLICATION PACKAGE <name>
  MODIFY RELEASE CHANNEL <release_channel>
  UNSET RELEASE DIRECTIVE <release_directive>
```

## Parameters

`name`
:   Specifies the identifier for the application package. If the identifier contains spaces, special characters, or mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`MODIFY RELEASE CHANNEL release_channel`
:   Specifies the release channel that this release directive applies to. If not specified, the release directive applies to all release channels.

    The supported values are:

    * ALPHA
    * QA
    * DEFAULT

    For more information about release channels, see [Publish an app using release channels](../../developer-guide/native-apps/release-channels.md).

`VERSION = version_identifier` . `PATCH = patch_num`
:   Modifies the version and patch level of the specified custom release directive.

`SET`
:   Specifies one or more properties to set for the application package, separated by blank spaces, commas, or new lines. For more details
    about the properties you can set, see [CREATE APPLICATION](create-application.md).

    `DEFAULT RELEASE DIRECTIVE VERSION = version_identifier PATCH = patch_num`
    :   Sets the version and patch level of the application package that should be installed for consumers by default.

    `RELEASE DIRECTIVE release_directive` . `ACCOUNTS = ( organization_name.account_name [ , organization_name.account_name , ... ] )` . `VERSION = version_identifier` . `PATCH = patch_num`
    :   Creates a custom release directive for the specified accounts.

        Use the ACCOUNTS clause to specify the list of accounts that this release directive applies to.

        Use the VERSION and PATCH clauses to specify the version identifier and patch number to be installed for these accounts.

`UPGRADE_AFTER = 'timestamp'`
:   Specifies the date and time when the automated upgrade process begins. Consumers can manually
    upgrade an app to a new version or patch before this date.

    This value can be any valid date and time format.

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

`UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE }`
:   When set to TRUE, upgrades respect consumer maintenance policies. Instead of upgrading immediately,
    the upgrade is delayed until the consumer’s next maintenance window or until the upgrade deadline
    is reached, whichever comes first.

    When this parameter is set to TRUE, the UPGRADE_DEADLINE parameter is required.

    You can’t set the UPGRADE_AFTER and UPGRADE_IN_MAINTENANCE_WINDOW parameters at the same time.
    If you try to set both, the command fails with an error.

    For more information, see [Consumer-controlled maintenance policies: Provider guide](../../developer-guide/native-apps/consumer-maintenance-policies-provider.md).

`UPGRADE_DEADLINE = 'timestamp'`
:   Required when UPGRADE_IN_MAINTENANCE_WINDOW is set to TRUE. Specifies the deadline by which the
    upgrade must be completed. After this time, the system automatically upgrades the application
    regardless of the consumer’s maintenance policy.

    Set the deadline to a date and time that allows sufficient time for consumers to complete the
    upgrade within their maintenance windows.

`UNSET`
:   Specifies one or more properties and/or session parameters to unset for the application package, which resets them to the defaults.

    `UNSET RELEASE DIRECTIVE release_directive`
    :   Removes the specified custom release directive from the application package.

## Usage notes

* Modifying the release directive requires the OWNERSHIP privilege on the application or the global MANAGE VERSIONS privilege.
* If you do not specify the values for the optional properties, the command uses the values specified in the application
  manifest file.
* If you specify values for the properties in the command and in the application manifest file, the values specified in
  the command take precedence.

## Examples

The following example adds version `V1` to the default release channel:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  MODIFY RELEASE CHANNEL DEFAULT
  ADD VERSION V1;
```

```output
+---------------------------------------------------------------------------------------------------------+
| status                                                                                                  |
|---------------------------------------------------------------------------------------------------------|
| Version V1 added to release channel DEFAULT in application package my_app_package                       |
+---------------------------------------------------------------------------------------------------------+
```

The following example modifies the default release directive of the default release channel to set the version to
`V1` and the patch to `0`:

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  MODIFY RELEASE CHANNEL DEFAULT
  SET DEFAULT RELEASE DIRECTIVE
  VERSION = V1
  PATCH=0;
```

```output
+---------------------------------------------------------------------------------------------------------+
| status                                                                                                  |
|---------------------------------------------------------------------------------------------------------|
| Version V1 added to release channel DEFAULT in application package my_app_package                       |
+---------------------------------------------------------------------------------------------------------+
```

```sqlexample
ALTER APPLICATION PACKAGE my_app_package
  MODIFY RELEASE CHANNEL ALPHA
  ADD ACCOUNTS=(PM.CONNECTORS);
```

```output
+---------------------------------------------------------------------------------------+---------+-------+
| status                                                                                | version | patch |
|---------------------------------------------------------------------------------------+---------+-------|
| TBD                                                                                   |         |       |
+---------------------------------------------------------------------------------------+---------+-------+
```

---
title: ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-package-release-directive.md
section: SQL Commands
---

# ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE

Modifies the properties of an existing application package. Use this command to modify a release directive to a new version or patch.

> **Note:**
>
> The syntax described in this topic is only applicable to application packages that do not use release channels. To modify the release directive of an application package that uses release channels, see [ALTER APPLICATION PACKAGE … MODIFY RELEASE CHANNEL](alter-application-package-release-channel.md).

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md) , [ALTER APPLICATION PACKAGE … VERSION](alter-application-package-version.md)

## Syntax

```sqlsyntax
ALTER APPLICATION PACKAGE <name>
  MODIFY RELEASE DIRECTIVE <release_directive>
  VERSION = <version_identifier>
  PATCH = <patch_num>
  [ UPGRADE_AFTER = '<timestamp>' ]
  [ UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE } ]
  [ UPGRADE_DEADLINE = '<timestamp>' ]

ALTER APPLICATION PACKAGE <name>
  SET DEFAULT RELEASE DIRECTIVE
  VERSION = <version_identifier>
  PATCH = <patch_num>
  [ UPGRADE_AFTER = '<timestamp>' ]
  [ UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE } ]
  [ UPGRADE_DEADLINE = '<timestamp>' ]

ALTER APPLICATION PACKAGE <name>
  SET RELEASE DIRECTIVE <release_directive>
  ACCOUNTS = ( <organization_name>.<account_name> [ , <organization_name>.<account_name> , ... ] )
  VERSION = <version_identifier>
  PATCH = <patch_num>
  [ UPGRADE_AFTER = '<timestamp>' ]
  [ UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE } ]
  [ UPGRADE_DEADLINE = '<timestamp>' ]

ALTER APPLICATION PACKAGE <name> UNSET RELEASE DIRECTIVE <release_directive>
```

## Parameters

`name`
:   Specifies the identifier for the application package. If the identifier contains spaces, special characters, or mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`MODIFY RELEASE DIRECTIVE release_directive` . `VERSION = version_identifier` . `PATCH = patch_num`
:   Modifies the version and patch level of the specified custom release directive.

`SET`
:   Specifies one (or more) properties to set for the application package (separated by blank spaces, commas, or new lines). For more details
    about the properties you can set, see [CREATE APPLICATION](create-application.md).

    `DEFAULT RELEASE DIRECTIVE VERSION = version_identifier PATCH = patch_num`
    :   Sets the version and patch level of the application package that should be installed for consumers by default.

    `RELEASE DIRECTIVE release_directive` . `ACCOUNTS = ( organization_name.account_name [ , organization_name.account_name , ... ] )` . `VERSION = version_identifier` . `PATCH = patch_num`
    :   Creates a custom release directive for the specified accounts.

        Use the ACCOUNTS clause to specify the list of accounts to which this release directive applies.

        Use the VERSION and PATCH clauses to specify the version identifier and patch number to be installed for these accounts.

`UPGRADE_AFTER = 'timestamp'`
:   Specifies the date and time when the automated upgrade process begins. Consumers can manually
    upgrade an app to a new version or patch before this date.

    This value can be any valid date and time format.

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

`UPGRADE_IN_MAINTENANCE_WINDOW = { TRUE | FALSE }`
:   When set to TRUE, upgrades respect consumer maintenance policies. Instead of upgrading immediately,
    the upgrade is delayed until the consumer’s next maintenance window or until the upgrade deadline
    is reached, whichever comes first.

    When this parameter is set to TRUE, the UPGRADE_DEADLINE parameter is required.

    You can’t set the UPGRADE_AFTER and UPGRADE_IN_MAINTENANCE_WINDOW parameters at the same time.
    If you try to set both, the command fails with an error.

    For more information, see [Consumer-controlled maintenance policies: Provider guide](../../developer-guide/native-apps/consumer-maintenance-policies-provider.md).

`UPGRADE_DEADLINE = 'timestamp'`
:   Required when UPGRADE_IN_MAINTENANCE_WINDOW is set to TRUE. Specifies the deadline by which the
    upgrade must be completed. After this time, the system automatically upgrades the application
    regardless of the consumer’s maintenance policy.

    Set the deadline to a date and time that allows sufficient time for consumers to complete the
    upgrade within their maintenance windows.

`UNSET`
:   Specifies one (or more) properties and/or session parameters to unset for the application package, which resets them to the defaults.

    `UNSET RELEASE DIRECTIVE release_directive`
    :   Removes the specified custom release directive from the application package.

## Usage notes

* Modifying the release directive requires the OWNERSHIP privilege on the application or the global MANAGE VERSIONS privilege.
* If you do not specify the values for the optional properties, the command uses the values specified in the application manifest file.

  If you specify values for the properties in the command and in the application manifest file, the values specified in the command take
  precedence.

---
title: ALTER APPLICATION PACKAGE … VERSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-package-version.md
section: SQL Commands
---

# ALTER APPLICATION PACKAGE … VERSION

Modifies the versioning of an existing application package in the Snowflake Native App Framework.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md) , [ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](alter-application-package-release-directive.md)

## Syntax

```sqlsyntax
ALTER APPLICATION PACKAGE <name> ADD VERSION [ <version_identifier> ]
  USING <path_to_version_directory> [ LABEL = '<display_label>' ]

ALTER APPLICATION PACKAGE <name> DROP VERSION <version_identifier>

ALTER APPLICATION PACKAGE <name> ADD PATCH [<patch_number>] FOR VERSION [<version_identifier>]
  USING <path_to_version_directory> [ LABEL = '<display_label>' ]
```

## Parameters

`name`
:   Specifies the identifier for the application package to alter. If the identifier contains
    spaces, special characters, or mixed-case characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`ADD VERSION [ version_identifier ] USING path_to_version_directory`
:   Adds a version or patch using the application files located in the path to a stage location specified by
    `path_to_version_directory`.

    You can specify an identifier for this version using `version_identifier`. If you do
    not specify a `version_identifier` in the manifest file, you must specify a
    `version_identifier` as part of this command. If you specify `version_identifier`
    as part of this command, it takes precedence over `version_identifier` specified
    in the manifest file.

`[ LABEL = 'display_label' ]`
:   You can use the LABEL clause to specify a label for this new version. This label is displayed
    to the consumer. If you omit the LABEL clause, the label specified in the `manifest.yml`
    file is used.

`DROP VERSION version_identifier`
:   Drops the version with the specified version name.

    Drops a version with the specified version identifier. A version may only be dropped when
    there are no release directives that are referring to it. Dropping is an asynchronous
    process and completes when all application instances have successfully upgraded from the
    older version and no longer have code running on the dropping version.

    Use the [APPLICATION_STATE view](../data-sharing-usage/application-state-view.md) view to monitor
    the state of the application instances. Use the [SHOW VERSIONS IN APPLICATION PACKAGE](show-versions.md) command to monitor the
    status of the dropped version.

`ADD PATCH patch_number` `FOR VERSION version_identifier` . `USING path_to_version_directory [ LABEL = 'display_label' ]`
:   Adds a patch for the specified version (`version_identifier`) using the application files located in the specified path to a
    stage location (`path_to_version_directory`).

    You can use the LABEL clause to specify a label for this new patch. This label is displayed to the consumer. If you omit the LABEL
    clause, the label specified in the `manifest.yml` file is used.

## Usage notes

* Version identifiers have a maximum limit of 30 characters.
* A single version can have up to 130 patches.
* Modifying the version requires a role with the OWNERSHIP privilege on the application or the global MANAGE VERSIONS privilege.
* If you do not specify the values for the optional properties, the command uses the values specified in the
  application manifest file.

  If you specify values for the properties in the command and in the application manifest file, the values
  specified in the command take precedence.
* If two versions are active, for example, if the current version has not finished rolling out, adding
  a new version results in an error.

## Examples

```sqlexample
ALTER APPLICATION PACKAGE hello_snowflake_package
  ADD VERSION v1_1
  USING '@hello_snowflake_code.core.hello_snowflake_stage';
```

```output
+---------------------------------------------------------------------------------------+---------+-------+
| status                                                                                | version | patch |
|---------------------------------------------------------------------------------------+---------+-------|
| Version 'v1_1' of application package 'hello_snowflake_package' created successfully. | v1_1    |     0 |
+---------------------------------------------------------------------------------------+---------+-------+
```

---
title: ALTER APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-role.md
section: SQL Commands
---

# ALTER APPLICATION ROLE

Modifies the properties for an existing application role.

See also:
:   [CREATE APPLICATION ROLE](create-application-role.md), [GRANT APPLICATION ROLE](grant-application-role.md),
    [REVOKE APPLICATION ROLE](revoke-application-role.md), [SHOW APPLICATION ROLES](show-application-roles.md)

## Syntax

```sqlsyntax
ALTER APPLICATION ROLE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER APPLICATION ROLE [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER APPLICATION ROLE [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the application role. If the identifier contains spaces or
    special characters, the entire string must be enclosed in double quotes. Identifiers enclosed
    in double quotes are also case-sensitive.

`RENAME TO new_name`
:   Specifies the new identifier for the application role. The identifier must be unique
    for within the application.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    Note that when specifying the fully-qualified name of the application role, you cannot specify a
    different application. The name of the application, `application_name`, must remain the same.
    Only the `application_role_name` can change during a rename operation.

`SET ...`
:   Specifies the properties to set for the application role:

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the application role.

`UNSET ...`
:   Specifies the properties to unset for the application role, which resets them to the defaults.

    * `COMMENT`

## Usage notes

* This command can only be run in the context of an application created using the Native
  Apps Framework.
* Only the application role owner (i.e. the role with the OWNERSHIP privilege on the application
  role), or a higher role, can run this command.
* Renaming an application role is only allowed within the same application.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

```sqlexample
ALTER APPLICATION ROLE app_role RENAME TO new_app_role;
```

```sqlexample
ALTER APPLICATION ROLE app_role SET
  COMMENT = 'Application role for the Hello Snowflake application.';
```

```sqlexample
ALTER APPLICATION ROLE app_role UNSET COMMENT;
```

---
title: ALTER APPLICATION SET CONFIGURATION DEFINITION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-set-configuration-definition.md
section: SQL Commands
---

# ALTER APPLICATION SET CONFIGURATION DEFINITION

Creates or updates an [app configuration](../../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App.

> **Note:**
>
> This command can only be used by a Snowflake Native App.

See also:
:   [ALTER APPLICATION DROP CONFIGURATION DEFINITION](alter-application-drop-configuration-definition.md)

## Syntax

```sqlsyntax
ALTER APPLICATION SET CONFIGURATION DEFINITION <config>
  TYPE = {APPLICATION_NAME | STRING}
  LABEL = '<label>'
  DESCRIPTION = '<description>'
  APPLICATION_ROLES = ( <app_role1> [ , <app_role2> ... ] );
```

## Parameters

`config`
:   Identifier for the app configuration.

`TYPE`
:   Specifies the type of app configuration. Supported values are:

    * `APPLICATION_NAME`
    * `STRING`

`LABEL = 'label'`
:   Specifies a label for the app specification to be displayed in the Snowsight.

`DESCRIPTION = 'description'`
:   Specifies a description of the app specification. Snowflake recommends
    including information about the app specification type and why it is
    required by the app.

`APPLICATION_ROLES = ( <app_role1> [ , <app_role2> ... ] )`
:   Specifies the application roles that will have access to the app configuration object.

## Usage notes

* This command can only be used by a Snowflake Native App.
* When creating a configuration definition for the server app name for inter-app communication, you must set the `LABEL` and `DESCRIPTION` parameters to the same values as the `LABEL` and `DESCRIPTION` parameters of the associated `APPLICATION SPECIFICATION` object.

---
title: ALTER APPLICATION SET CONFIGURATION VALUE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-set-configuration-value.md
section: SQL Commands
---

# ALTER APPLICATION SET CONFIGURATION VALUE

Sets a value in an [app configuration definition](../../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App.

See also:
:   [ALTER APPLICATION SET CONFIGURATION DEFINITION](alter-application-set-configuration-definition.md), [ALTER APPLICATION DROP CONFIGURATION DEFINITION](alter-application-drop-configuration-definition.md)

## Syntax

```sqlsyntax
ALTER APPLICATION <app> SET CONFIGURATION <config> VALUE = '<value>';
```

## Parameters

`app`
:   Identifier for the Snowflake Native App that contains the configuration.

`config`
:   Identifier for the app configuration definition.

`VALUE = 'value'`
:   Specifies the value to set for the app configuration definition.

## Usage notes

* This command can only be used by a consumer. This command cannot be used by the Snowflake Native App itself.
* For a configuration definition of type `APPLICATION_NAME`, the value must be the name of a Snowflake Native App that is installed in the current account.
* In order to set a configuration, the current role must be granted an application role that has access to the configuration (that is, one of the application roles specified in the `APPLICATION_ROLES` field in the `ALTER APPLICATION SET CONFIGURATION DEFINITION` command).

---
title: ALTER APPLICATION SET SPECIFICATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-set-app-spec.md
section: SQL Commands
---

# ALTER APPLICATION SET SPECIFICATION

Creates or updates an [app specification](../../developer-guide/native-apps/requesting-app-specs.md) for a Snowflake Native App.

> **Note:**
>
> This command can only be used by a Snowflake Native App.

See also:
:   [ALTER APPLICATION](alter-application.md),
    [ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION](alter-application-sequence-number.md), [ALTER APPLICATION DROP SPECIFICATION](alter-application-drop-app-spec.md)

## Syntax

### External access integration

```sqlsyntax
ALTER APPLICATION SET SPECIFICATION <app_spec_name>
  TYPE = EXTERNAL_ACCESS
  LABEL = '<label>'
  DESCRIPTION = '<description>'
  { HOST_PORTS | PRIVATE_HOST_PORTS } = ( '<value>' [, '<value>', ... ] )
```

### Security integration (CLIENT_CREDENTIALS)

```sqlsyntax
ALTER APPLICATION SET SPECIFICATION <app_spec_name>
    TYPE = SECURITY_INTEGRATION
    LABEL = '<string_literal>'
    DESCRIPTION = '<string_literal>'
    OAUTH_TYPE = 'CLIENT_CREDENTIALS'
    OAUTH_TOKEN_ENDPOINT = '<string_literal>'
    OAUTH_ALLOWED_SCOPES = ( '<scope>' [ , '<scope>' ... ] );
```

### Security integration (AUTHORIZATION_CODE)

```sqlsyntax
ALTER APPLICATION SET SPECIFICATION <app_spec_name>
  TYPE = SECURITY_INTEGRATION
  LABEL = '<string_literal>'
  DESCRIPTION = '<string_literal>'
  OAUTH_TYPE = 'AUTHORIZATION_CODE'
  OAUTH_TOKEN_ENDPOINT = '<string_literal>'
  [ OAUTH_AUTHORIZATION_ENDPOINT = '<string_literal>' ]
  [ OAUTH_ALLOWED_SCOPES = ( '<scope>' [ , '<scope>' ... ] ) ];
```

### Security integration (JWT_BEARER)

```sqlsyntax
ALTER APPLICATION SET SPECIFICATION <app_spec_name>
  TYPE = SECURITY_INTEGRATION
  LABEL = '<string_literal>'
  DESCRIPTION = '<string_literal>'
  OAUTH_TYPE = 'JWT_BEARER'
  OAUTH_TOKEN_ENDPOINT = '<string_literal>'
  [ OAUTH_AUTHORIZATION_ENDPOINT = '<string_literal>' ]
  [ OAUTH_ALLOWED_SCOPES = ( '<scope>' [ , '<scope>' ... ] ) ];
```

### Listing

```sqlsyntax
ALTER APPLICATION SET SPECIFICATION <app_spec_name>
  TYPE = LISTING
  LABEL = '<string_literal>'
  DESCRIPTION = '<string_literal>'
  TARGET_ACCOUNTS = '<account_list>'
  LISTING = <listing_name>
  [ AUTO_FULFILLMENT_REFRESH_SCHEDULE = '<schedule>' ]
```

### Inter-App Communication

```sqlsyntax
ALTER APPLICATION SET SPECIFICATION <app_spec_name>
  TYPE = CONNECTION
        LABEL = '<label>'
        DESCRIPTION = '<description>'
        SERVER_APPLICATION = <server_app>
        SERVER_APPLICATION_ROLES = ( <app_role1> [ , <app_role2> ... ] );
```

## General parameters

`app_spec_name`
:   Identifier for the [app specification](../../developer-guide/native-apps/requesting-app-specs.md).

`TYPE = {EXTERNAL_ACCESS | SECURITY_INTEGRATION | LISTING | CONNECTION}}`
:   Specifies the type of app specification. Supported values are:

    * [EXTERNAL_ACCESS](../../developer-guide/external-network-access/creating-using-external-network-access.md)
    * [SECURITY_INTEGRATION](create-security-integration-api-auth.md)
    * [LISTING](../../developer-guide/native-apps/requesting-app-specs-listing.md)
    * [CONNECTION](../../developer-guide/native-apps/inter-app-communication.md)

    > **Important:**
    >
    > The type of an app specification cannot be changed once it has been created. Attempting to
    > alter the type will result in an error.

`LABEL = 'label'`
:   Specifies a label for the app specification. This label is the name of the
    app specification that is visible to the consumer. Each app specification must
    have a unique label.

    > **Note:**
    >
    > Changing only the label will not trigger a new approval request. To require consumer
    > approval, you must also change the app specification definition (such as HOST_PORTS,
    > OAUTH_TOKEN_ENDPOINT, or TARGET_ACCOUNTS).

`DESCRIPTION = 'description'`
:   Specifies a description of the app specification. Snowflake recommends
    including information about the app specification type and why it is
    required by the app.

    > **Note:**
    >
    > Changing only the description will not trigger a new approval request. To require consumer
    > approval, you must also change the app specification definition (such as HOST_PORTS,
    > OAUTH_TOKEN_ENDPOINT, or TARGET_ACCOUNTS).

## External access integration parameters

`HOST_PORTS | PRIVATE_HOST_PORTS = ( 'value' [ , 'value', ... ] )`
:   Specifies a list of host ports or private host ports that the app can connect to.
    These ports are used by external access integrations.

## Security integration parameters - CLIENT_CREDENTIALS

`OAUTH_TYPE = 'CLIENT_CREDENTIALS'`
:   Specifies the type of security integration for external API Authentication. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_TOKEN_ENDPOINT = 'string_literal'`
:   Specifies the token endpoint used by the client to obtain an access token by presenting its authorization
    grant or refresh token. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_ALLOWED_SCOPES = ( 'scope' [  , 'scope' ... ]  )`
:   Specifies a comma-separated list of scopes, with single quotes surrounding each scope, to use when making
    a request from the OAuth by a role with USAGE on the integration during the OAuth client credentials
    flow. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_ACCESS_TOKEN_VALIDITY = integer`
:   Specifies the default lifetime of the OAuth access token (in seconds) issued by an OAuth server. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

## Security integration parameters - AUTHORIZATION_CODE

`OAUTH_TYPE = 'AUTHORIZATION_CODE'`
:   Specifies the type of security integration for external API Authentication. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_TOKEN_ENDPOINT = 'string_literal'`
:   Specifies the token endpoint used by the client to obtain an access token by presenting its authorization
    grant or refresh token. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_AUTHORIZATION_ENDPOINT = 'string_literal'`
:   Specifies the URL for authenticating to the external service. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_ACCESS_TOKEN_VALIDITY = integer`
:   Specifies the default lifetime of the OAuth access token (in seconds) issued by an OAuth server. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_REFRESH_TOKEN_VALIDITY = integer`
:   Specifies the default lifetime of the OAuth refresh token (in seconds) issued by an OAuth server. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

## Security integration parameters - JWT_BEARER

`OAUTH_TYPE = 'JWT_BEARER'`
:   Specifies the type of security integration for external API Authentication. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_TOKEN_ENDPOINT = 'string_literal'`
:   Specifies the token endpoint used by the client to obtain an access token by presenting its authorization
    grant or refresh token. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_AUTHORIZATION_ENDPOINT = 'string_literal'`
:   Specifies the URL for authenticating to the external service. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

`OAUTH_REFRESH_TOKEN_VALIDITY = integer`
:   Specifies the default lifetime of the OAuth refresh token (in seconds) issued by an OAuth server. See
    [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) for more information.

## Listing parameters

`TARGET_ACCOUNTS = 'account_list'`
:   Specifies a single-quoted string of target accounts, separated by commas, with
    no spaces. Each account must be specified in the format
    `OrgName.AccountName`; for example:
    `'ProviderOrg.ProviderAccount,PartnerOrg.PartnerAccount'`. When the
    specification is approved, these accounts are added to the listing. When
    declined, all accounts are removed from the listing.

`LISTING = listing_name`
:   Specifies the identifier of the external listing created by the app. The listing must already exist
    and must have been created by the app with a share attached. After the listing is set in an app specification,
    the listing name cannot be changed.

`AUTO_FULFILLMENT_REFRESH_SCHEDULE = 'schedule'`
:   Optional. Specifies the refresh schedule for cross-region data sharing. This parameter is required
    when sharing data across regions. The value can be specified in two formats:

    * `num MINUTE`: Number of minutes, with a minimum of 10 minutes and
      a maximum of 11,520 minutes (eight days).
    * `USING CRON expression time_zone`: Cron expression with time zone
      for the refresh.

## Inter-app communication parameters

`SERVER_APPLICATION = server_app`
:   The name of the server application to be connected to. The following operations
    are not supported:

    * Updating this setting for an existing specification.
    * More than one specification targeting the same server application.

`SERVER_APPLICATION_ROLES = ( app_role1 [ , app_role2 ... ] )`
:   Specifies a comma-separated list of application roles in the server application
    to be granted to this application.

## Usage notes

* To use this command, providers must ensure that the manifest file of the app
  uses `manifest_version: 2`.

## Examples

Create an app specification for external access:

```sqlexample
ALTER APPLICATION SET SPECIFICATION eai_spec
  TYPE = EXTERNAL_ACCESS
  LABEL = 'External API Access'
  DESCRIPTION = 'Connect to external weather API'
  HOST_PORTS = ('api.weather.com:443', 'api.openweather.org:443');
```

Create an app specification for OAuth security integration:

```sqlexample
ALTER APPLICATION SET SPECIFICATION oauth_spec
  TYPE = SECURITY_INTEGRATION
  LABEL = 'OAuth Integration'
  DESCRIPTION = 'Connect to Microsoft Graph API'
  OAUTH_TYPE = 'CLIENT_CREDENTIALS'
  OAUTH_TOKEN_ENDPOINT = 'https://login.microsoftonline.com/YOUR_TENANT_ID/oauth2/v2.0/token'
  OAUTH_ALLOWED_SCOPES = ('https://graph.microsoft.com/.default');
```

Create an app specification for data sharing through a listing:

```sqlexample
ALTER APPLICATION SET SPECIFICATION shareback_spec
  TYPE = LISTING
  LABEL = 'Telemetry Data Sharing'
  DESCRIPTION = 'Share telemetry and usage data with provider'
  TARGET_ACCOUNTS = 'ProviderOrg.ProviderAccount,PartnerOrg.PartnerAccount'
  LISTING = telemetry_listing
  AUTO_FULFILLMENT_REFRESH_SCHEDULE = '720 MINUTE';
```

---
title: ALTER APPLICATION UNSET CONFIGURATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-unset-configuration.md
section: SQL Commands
---

# ALTER APPLICATION UNSET CONFIGURATION

Unsets an [app configuration definition](../../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App.

See also:
:   [ALTER APPLICATION SET CONFIGURATION DEFINITION](alter-application-set-configuration-definition.md), [ALTER APPLICATION DROP CONFIGURATION DEFINITION](alter-application-drop-configuration-definition.md)

## Syntax

```sqlsyntax
ALTER APPLICATION <app> UNSET CONFIGURATION <config>;
```

## Parameters

`app`
:   Identifier for the Snowflake Native App that contains the configuration.

`config`
:   The name of the app configuration definition to unset.

## Usage notes

* This command can only be used by a consumer. This command cannot be used by the Snowflake Native App itself.
* After unsetting a configuration, the app’s status is updated to `PENDING`.
* To unset a configuration, the current role must be granted an application role that has access to the configuration (that is, one of the application roles specified in the `APPLICATION_ROLES` field in the `ALTER APPLICATION SET CONFIGURATION DEFINITION` command).

---
title: ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-application-sequence-number.md
section: SQL Commands
---

# ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION

Approves or declines an [app specification](../../developer-guide/native-apps/requesting-app-specs.md)
using the specified sequence number.

See also:
:   [ALTER APPLICATION SET SPECIFICATION](alter-application-set-app-spec.md), [ALTER APPLICATION DROP SPECIFICATION](alter-application-drop-app-spec.md)

## Syntax

```sqlsyntax
ALTER APPLICATION <app_name>
  { APPROVE | DECLINE } SPECIFICATION <spec_name>
  SEQUENCE_NUMBER = <sequence_num>;
```

## Parameters

`app_name`
:   Specifies the identifier for the app being altered. If the identifier contains spaces, special characters, or
    mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double
    quotes are also case-sensitive.

`APPROVE | DECLINE SPECIFICATION spec_name`
:   Approves or declines the specified app specification.

`SEQUENCE_NUMBER = sequence_num`
:   Specifies the sequence number of the app specification to approve. The sequence number represents a
    version id of the app specification. The sequence number starts at 1 when the specification is created.
    The value is incremented each time the provider updates the app specification. Use
    [SHOW SPECIFICATIONS](show-specifications.md) or
    [DESCRIBE SPECIFICATION](desc-specification.md) commands to determine the current sequence number of
    the app.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE APPLICATION SPECIFICATIONS | Account | Allows a role to approve or decline app specifications for any app in their account. Only the SECURITYADMIN and ACCOUNTADMIN system roles have the MANAGE APPLICATION SPECIFICATIONS privilege; however, the privilege can be granted to custom roles. |

---
title: ALTER AUTHENTICATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-authentication-policy.md
section: SQL Commands
---

# ALTER AUTHENTICATION POLICY

Modifies the properties of an [authentication policy](../../user-guide/authentication-policies.md).

See also:
:   [CREATE AUTHENTICATION POLICY](create-authentication-policy.md), [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md), [DROP AUTHENTICATION POLICY](drop-authentication-policy.md), [SHOW AUTHENTICATION POLICIES](show-authentication-policies.md)

## Syntax

```sqlsyntax
ALTER AUTHENTICATION POLICY <name> RENAME TO <new_name>

ALTER AUTHENTICATION POLICY [ IF EXISTS ] <name> SET
  [ AUTHENTICATION_METHODS = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ CLIENT_TYPES = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ CLIENT_POLICY = ( <client_type> = ( MINIMUM_VERSION = '<version>' ) [ , ... ] ) ]
  [ SECURITY_INTEGRATIONS = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ MFA_ENROLLMENT = { 'REQUIRED' | 'REQUIRED_PASSWORD_ONLY' } ]
  [ MFA_POLICY= ( <list_of_properties> ) ]
  [ PAT_POLICY = ( <list_of_properties> ) ]
  [ WORKLOAD_IDENTITY_POLICY = ( <list_of_properties> ) ]
  [ COMMENT = '<string_literal>' ]

ALTER AUTHENTICATION POLICY [ IF EXISTS ] <name> UNSET
  [ AUTHENTICATION_METHODS ]
  [ CLIENT_TYPES ]
  [ CLIENT_POLICY ]
  [ SECURITY_INTEGRATIONS ]
  [ MFA_ENROLLMENT ]
  [ MFA_POLICY ]
  [ PAT_POLICY ]
  [ WORKLOAD_IDENTITY_POLICY ]
  [ COMMENT ]
  [ DCM PROJECT ]
```

## Parameters

`name`
:   Specifies the identifier for the authentication policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO ...`
:   Specifies a new name for an existing authentication policy.

`SET ...`
:   Specifies one or more properties to set for the authentication policy, separated by blank spaces, commas, or new lines.

    `AUTHENTICATION_METHODS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Changes the authentication methods that are allowed during login. This parameter accepts one or more of the following values:

        > **Caution:**
        >
        > Restricting by authentication method can have unintended consequences, such as blocking driver connections or third-party
        > integrations.

        `ALL`
        :   Allow all authentication methods.

        `SAML`
        :   Allows [SAML2 security integrations](../../user-guide/admin-security-fed-auth-security-integration.md). If `SAML` is
            present, an SSO login option appears. If `SAML` is not present, an SSO login option does not appear.

        `PASSWORD`
        :   Allows users to authenticate using username and password.

        `OAUTH`
        :   Allows [External OAuth](../../user-guide/oauth-ext-overview.md).

        `KEYPAIR`
        :   Allows [Key pair authentication](../../user-guide/key-pair-auth.md).

        `PROGRAMMATIC_ACCESS_TOKEN`
        :   Allows users to authenticate with a [programmatic access token](../../user-guide/programmatic-access-tokens.md).

        `WORKLOAD_IDENTITY`
        :   Allows users to authenticate through [workload identity federation](../../user-guide/workload-identity-federation.md).

        Default: `ALL`.

    `CLIENT_TYPES = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Changes which clients can authenticate with Snowflake.

        If a client tries to connect, and the client is not one of the valid `CLIENT_TYPES` values listed below, then the login attempt fails.

        If you set `MFA_ENROLLMENT` to `REQUIRED`, then you must include `SNOWFLAKE_UI` in the `CLIENT_TYPES` list to allow
        users to enroll in MFA.

        If you want to exclude `SNOWFLAKE_UI` from the `CLIENT_TYPES` list, then you must set `MFA_ENROLLMENT` to
        `OPTIONAL`.

        The `CLIENT_TYPES` property of an authentication policy is a best-effort method to block user logins based on specific clients. It should not be used as the sole control to establish a security boundary. Notably, it does not restrict access to the Snowflake REST APIs.

        This property accepts one or more of the following values:

        `ALL`
        :   Allow all clients to authenticate.

        `SNOWFLAKE_UI`
        :   [Snowsight](../../user-guide/ui-snowsight-gs.md), the Snowflake web interface.

            > **Caution:**
            >
            > If `SNOWFLAKE_UI` is not included in the `CLIENT_TYPES` list while `MFA_ENROLLMENT` is set to `REQUIRED`, or `MFA_ENROLLMENT` is unspecified, MFA enrollment doesn’t work.

        `DRIVERS`
        :   Drivers allow access to Snowflake from applications written in
            [supported languages](../../developer-guide/drivers.md). For example, the [Go](../../developer-guide/golang/go-driver.md),
            [JDBC](../../developer-guide/jdbc/jdbc.md), [.NET](../../developer-guide/dotnet/dotnet-driver.md) drivers, and
            [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

            > **Caution:**
            >
            > If `DRIVERS` is not included in the `CLIENT_TYPES` list, automated ingestion may stop working.

        `SNOWFLAKE_CLI`
        :   A [command-line client](../../developer-guide/snowflake-cli/index.md) for connecting to Snowflake and for managing developer-centric workloads and SQL operations.

        `SNOWSQL`
        :   A [command-line client](../../user-guide/snowsql.md) for connecting to Snowflake.

        If a client tries to connect, and the client is not one of the valid `CLIENT_TYPES`, then the login attempt fails. If
        `CLIENT_TYPES` is unset, any client can connect.

        Default: `ALL`.

    `CLIENT_POLICY = client_type = ( MINIMUM_VERSION = 'version' )`
    :   Specifies a policy within the authentication policy that sets the minimum version allowed for each specified client type.

        If CLIENT_TYPES is empty, contains `ALL`, or contains `DRIVERS`, the CLIENT_POLICY parameter accepts one or more of the following driver clients (and a specific version string). For any driver client that is not specified, the policy implicitly allows any
        version of that client.

        If CLIENT_TYPES contains another value, such as `SNOWFLAKE_CLI`, and does not also contain `DRIVERS`, specifying any of the following client types results in an error. You can’t create (or alter) an authentication policy such that the CLIENT_TYPES and CLIENT_POLICY parameters aren’t compatible.

        `client_type`
        :   One or more valid client type values. This is a different set of values from those that the CLIENT_TYPES parameter accepts. Do not use single quotes for these values.

            * `JDBC_DRIVER` (Snowflake JDBC Driver)
            * `ODBC_DRIVER` (Snowflake ODBC Driver)
            * `PYTHON_DRIVER` (Snowflake Python Driver)
            * `JAVASCRIPT_DRIVER` (Snowflake Javascript Driver)
            * `C_DRIVER` (Libsnowflakeclient C Driver)
            * `GO_DRIVER` (Snowflake Go Driver)
            * `PHP_DRIVER` (Snowflake PHP PDO Driver)
            * `DOTNET_DRIVER` (Snowflake .NET Driver)
            * `SQL_API` (SQL API)
            * `SNOWPIPE_STREAMING_CLIENT_SDK` (Snowpipe Streaming Client SDK)
            * `PY_CORE` (Snowflake Python Core Driver)
            * `SPROC_PYTHON` (Snowflake Python Stored Procedure)
            * `PYTHON_SNOWPARK` (Snowflake Python Snowpark Driver)
            * `SQL_ALCHEMY` (Snowflake SQLAlchemy)
            * `SNOWPARK` (Snowpark)
            * `SNOWFLAKE_CLIENT` (Snowflake Client SDK)

        `'version'`
        :   The minimum accepted version for each specified client type: a sequence of three digits delimited by periods and enclosed by single quotation marks.
            For example: `'1.0.0'` or `'3.14.1'`. Authentication attempts with lower client versions are blocked when this policy is in effect for an account or a user.

        The CLIENT_POLICY property of an authentication policy is a best-effort method to block user logins based on specific client versions. It should not be used as the sole control to establish a security boundary.

    `SECURITY_INTEGRATIONS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Changes the security integrations that the authentication policy is associated with. This parameter has no effect when `SAML`
        or `OAUTH` are not in the `AUTHENTICATION_METHODS` list.

        All values in the `SECURITY_INTEGRATIONS` list must be compatible with the values in the `AUTHENTICATION_METHODS` list. For
        example, if `SECURITY_INTEGRATIONS` contains a SAML security integration, and `AUTHENTICATION_METHODS` contains
        `OAUTH`, then you cannot create the authentication policy.

        `ALL`
        :   Allow all security integrations.

        Default: `ALL`.

    `MFA_ENROLLMENT = { 'REQUIRED' | 'REQUIRED_PASSWORD_ONLY' }`
    :   Determines whether a user must enroll in multi-factor authentication. If this value is used, then
        the `CLIENT_TYPES` parameter must include `SNOWFLAKE_UI`, because Snowsight is the only place users can
        [enroll in multi-factor authentication (MFA)](../../user-guide/ui-snowsight-profile.md).

        `REQUIRED`
        :   Human users who are using password or single-sign on (SSO) authentication must enroll in MFA.

        `REQUIRED_PASSWORD_ONLY`
        :   All human users who are using password authentication must enroll in MFA, regardless of the client they are using. Users using SSO
            authentication are not required to enroll.

    `MFA_POLICY= ( list_of_properties )`
    :   Specifies the policies that affect how multi-factor authentication (MFA) is enforced. Set this to a space-delimited list of one or more
        of the following properties and values:

        `ALLOWED_METHODS = ( { 'ALL' | 'PASSKEY' | 'TOTP' | 'OTP' | 'DUO' } [ , { 'PASSKEY' | 'TOTP' | 'OTP' | 'DUO' } ... ] )`
        :   Specifies the multi-factor authentication (MFA) methods that users can use as a second factor of authentication. You can specify more than one method as a comma-delimited list.

            `ALL`
            :   Users can use a passkey, an authenticator app, or Duo as their second factor of authentication.

            `PASSKEY`
            :   Users can use a passkey as their second factor of authentication.

            `TOTP`
            :   Users can use an authenticator app as their second factor of authentication.

            `OTP`
            :   User can use a one-time passcode as their second factor of authentication. For more information, see [Setting up administrators for break glass access](../../user-guide/security-mfa.md).

            `DUO`
            :   Users can use Duo as their second factor of authentication.

            Default: `ALL`.

        `ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION = { 'ALL' | 'NONE' }`
        :   Specifies whether multi-factor authentication (MFA) is required when users authenticate with single sign-on (SSO). To require MFA, specify
            `ALL`.

            Default: `NONE`

    `PAT_POLICY = ( list_of_properties )`
    :   Specifies the policies for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md). Set this to a
        space-delimited list of one or more of the following properties and values:

        `DEFAULT_EXPIRY_IN_DAYS = number_of_days`
        :   Specifies the default expiration time (in days) for a programmatic access token. You can specify a value from 1 to the
            maximum expiration time (which you can specify by setting MAX_EXPIRY_IN_DAYS).

            The default expiration time is 15 days.

            For more information, see [Setting the default expiration time](../../user-guide/programmatic-access-tokens.md).

        `MAX_EXPIRY_IN_DAYS = number_of_days`
        :   Specifies the maximum number of days that can be set for the expiration time for a programmatic access token. You can specify
            a value from the default expiration time (which you can specify by setting DEFAULT_EXPIRY_IN_DAYS) to 365.

            The default maximum expiration time is 365 days.

            > **Note:**
            >
            > If there are existing programmatic access tokens with expiration times that exceed the new maximum expiration time, attempts to
            > authenticate with those tokens will fail.
            >
            > For example, suppose that you generate a programmatic access token named `my_token` with the expiration time of 7 days. If you
            > later change the maximum expiration time for all tokens to 2 days, authenticating with `my_token` will fail because the
            > expiration time of the token exceeds the new maximum expiration time.

            For more information, see [Setting the maximum expiration time](../../user-guide/programmatic-access-tokens.md).

        `NETWORK_POLICY_EVALUATION = { ENFORCED_REQUIRED | ENFORCED_NOT_REQUIRED | NOT_ENFORCED }`
        :   Specifies how network policy requirements are handled for programmatic access tokens.

            By default, a user must be subject to a [network policy](../../user-guide/network-policies.md) with one or more
            [network rules](../../user-guide/network-rules.md) to generate or use programmatic access tokens:

            * Service users (with TYPE=SERVICE) must be subject to a network policy to generate and use programmatic access tokens.
            * Human users (with TYPE=PERSON) must be subject to a network policy to use programmatic access tokens.

            To override this behavior, set this property to one of the following values:

            `ENFORCED_REQUIRED` (default behavior)
            :   The user must be subject to a network policy to generate and use programmatic access tokens.

                If the user is subject to a network policy, the network policy is enforced during authentication.

            `ENFORCED_NOT_REQUIRED`
            :   The user does not need to be subject to a network policy to generate and use programmatic access tokens.

                If the user is subject to a network policy, the network policy is enforced during authentication.

            `NOT_ENFORCED`
            :   The user does not need to be subject to a network policy to generate and use programmatic access tokens.

                If the user is subject to a network policy, the network policy is not enforced during authentication.

        `REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = { TRUE | FALSE }`
        :   If TRUE, when you generate a programmatic access token for a service user, you must restrict the use of that token to a
            specific role.

            If you set this parameter to FALSE, you can generate a programmatic access token for a service user without restricting that
            token to a specific role.

            Changing REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS from FALSE back to TRUE invalidates any programmatic access tokens for
            service users that were generated without the role restriction.

            Default value: TRUE

        The following example of the PAT_POLICY clause specifies the following policy:

        * By default, programmatic access tokens expire in 30 days.
        * Programmatic access tokens have a maximum expiration time of 365 days.
        * You can generate a programmatic access token for a user if the user is not subject to a network policy requirement. Any
          network policy that the user is subject to is still enforced.
        * When you generate a programmatic access token for a service user, you do not need to restrict to token to use a specific role.

        ```sqlexample
        PAT_POLICY=(
          DEFAULT_EXPIRY_IN_DAYS=30
          MAX_EXPIRY_IN_DAYS=365
          NETWORK_POLICY_EVALUATION = ENFORCED_NOT_REQUIRED
          REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = FALSE
        );
        ```

    `WORKLOAD_IDENTITY_POLICY = ( list_of_properties )`
    :   Specifies the policies for [workload identity federation](../../user-guide/workload-identity-federation.md). Set this to a
        space-delimited list that contains one or more of the following properties and values:

        `ALLOWED_PROVIDERS = ( { ALL | AWS | AZURE | GCP | OIDC } [ , { AWS | AZURE | GCP | OIDC } ... ] )`
        :   Specifies the workload identity providers allowed by the authentication policy during workload identity authentication.
            If this parameter is omitted, all workload identity providers are allowed.

            `ALL`
            :   Users can authenticate with any supported and configured workload identity provider.

            `AWS`
            :   Users can authenticate with an AWS IAM role or user.

            `AZURE`
            :   Users can authenticate with an Azure Entra ID access token.

            `GCP`
            :   Users can authenticate with a Google-signed ID token.

            `OIDC`
            :   Users can authenticate with an ID token from a configured OIDC provider.

        `ALLOWED_AWS_ACCOUNTS = ( 'string_literal' [ , 'string_literal' , ... ] )`
        :   Specifies the list of AWS account IDs allowed by the authentication policy during workload identity authentication of type `AWS`.

            By default, when a Snowflake service user has a `WORKLOAD_IDENTITY` of type `AWS`, then the ARN can reference any AWS account.
            If this parameter is set, then only ARNs from the specified AWS account IDs are allowed to authenticate.

            Each element must be a 12-digit string representing the AWS account ID.

            For more information, see [View AWS account identifiers](https://docs.aws.amazon.com/accounts/latest/reference/manage-acct-identifiers.html).

        `ALLOWED_AZURE_ISSUERS = ( 'string_literal' [ , 'string_literal' , ... ] )`
        :   Specifies the list of Azure Entra ID issuers allowed by the authentication policy during workload identity authentication of type `AZURE`.

            By default, when a Snowflake service user has a `WORKLOAD_IDENTITY` of type `AZURE`, then the issuer can be any Entra ID tenant.
            If this parameter is set, then only Azure tokens from the specified issuers are allowed to authenticate.

            Each element must be a valid Authority URL with following format:

            * `https://login.microsoftonline.com/tenantId/v2.0`

        `ALLOWED_OIDC_ISSUERS = ( 'string_literal' [ , 'string_literal' , ... ] )`
        :   Specifies the list of OIDC issuers allowed by the authentication policy during workload identity authentication of type `OIDC`.

            By default, when a Snowflake service user has a `WORKLOAD_IDENTITY` of type `OIDC`, then the issuer can be any valid OIDC issuer.
            If this parameter is set, then only tokens from the specified OIDC issuers are allowed to authenticate.

            Each element must be a valid HTTPS URL that contains scheme, host, and optionally, port number and path components but no query or fragment
            components. The URL must not contain spaces, and it must not exceed 2048 characters in length.

        For example:

        ```sqlexample
        WORKLOAD_IDENTITY_POLICY=(
          ALLOWED_PROVIDERS = (AWS, AZURE, GCP, OIDC)
          ALLOWED_AWS_ACCOUNTS = ('123456789012', '210987654321')
          ALLOWED_AZURE_ISSUERS = ('https://login.microsoftonline.com/8c7832f5-de56-4d9f-ba94-3b2c361abe6b/v2.0',
            'https://login.microsoftonline.com/9ebd1ec9-9a78-4429-8f53-5cf870a812d1/v2.0')
          ALLOWED_OIDC_ISSUERS = ('https://my.custom.oidc.issuer/', 'https://another.custom/oidc/issuer')
        );
        ```

    `COMMENT = 'string_literal'`
    :   Changes the comment for the authentication policy.

`UNSET ...`
:   Specifies the properties to unset for the authentication policy, which resets them to their defaults.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the authentication policy from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the authentication policy and the DCM project without dropping the authentication policy. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Authentication policy | Only the SECURITYADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you want to update an existing authentication policy and need to see the definition of the policy, run the
  [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md) command or [GET_DDL](../functions/get_ddl.md) function.

## Examples

Alter the list of allowed clients on an authentication policy:

```sqlexample
ALTER AUTHENTICATION POLICY restrict_client_types_policy
  SET CLIENT_TYPES = ('SNOWFLAKE_UI', 'SNOWSQL');
```

---
title: ALTER BACKUP POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-backup-policy.md
section: SQL Commands
---

# ALTER BACKUP POLICY

Modifies the properties of a [backup](../../user-guide/backups.md) policy. The following changes are supported:

* Rename the policy.
* Add or update the comment for the policy.
* Change the schedule and expiration settings for the policy. The schedule determines how often Snowflake
  automatically makes a backup and adds the resulting backup to the backup set that’s governed by the policy.
  The expiration period determines how long each backup is retained before Snowflake automatically deletes it from the
  associated backup set.
* Unset properties of the policy, so that they revert back to their default values.

See also:
:   [CREATE BACKUP POLICY](create-backup-policy.md),
    [DROP BACKUP POLICY](drop-backup-policy.md),
    [SHOW BACKUP POLICIES](show-backup-policies.md)

## Syntax

```sqlsyntax
ALTER BACKUP POLICY <name> RENAME TO <new_name>

ALTER BACKUP POLICY <name> SET
  [ COMMENT = '<string_literal>' ]
  [ SCHEDULE = '{ <num> MINUTE | <num> HOUR | USING CRON <expr> <time_zone> }' ]
  [ EXPIRE_AFTER_DAYS = <days_integer> ]

ALTER BACKUP POLICY <name> UNSET { COMMENT | SCHEDULE | EXPIRE_AFTER_DAYS }

ALTER BACKUP POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER BACKUP POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the backup policy.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies a new identifier for the backup policy; must be unique for your account.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET...`
:   Specifies one or more properties to set for the backup policy (separated by blank spaces, commas, or new lines):

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the backup policy.

    `SCHEDULE = '{ num MINUTE | num HOUR | USING CRON expr time_zone }'`
    :   Specifies the schedule for creating backups of an object.

        > **Note:**
        >
        > The minimum schedule for backups is 60 minutes or 1 hour.
        >
        > Every policy must include a SCHEDULE clause, an EXPIRE_AFTER_DAYS clause, or both.

        * `USING CRON expr time_zone`
          :   Specifies a cron expression and time zone for the point in time a backup of an object is created. Supports a subset of
              standard cron utility syntax.

              For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
              (in Wikipedia).

              The cron expression consists of the following fields:

              ```output
              # __________ minute (0-59)
              # | ________ hour (0-23)
              # | | ______ day of month (1-31, or L)
              # | | | ____ month (1-12, JAN-DEC)
              # | | | | __ day of week (0-6, SUN-SAT, or L)
              # | | | | |
              # | | | | |
                * * * * *
              ```

              The following special characters are supported:

              `*`
              :   Wildcard. Specifies any occurrence of the field.

              `L`
              :   Stands for “last”. When used in the day-of-week field, it lets you specify constructs such as “the last Friday” (“5L”) of a
                  given month. In the day-of-month field, it specifies the last day of the month.

              `/n`
              :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
                  specified in the month field, then the backup is scheduled for April, July and October (that is, every 3 months, starting with the 4th
                  month of the year). The same schedule is maintained in subsequent years. That is, the backup is not scheduled to run in
                  January (3 months after the October run).

              > **Note:**
              > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
              >   for the account (or setting the value at the user or session level) does not change the time zone for the backup.
              > + The cron expression defines all valid run times for the backup. Snowflake attempts to create a backup based on
              >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
              > + When both a specific day of month and day of week are included in the cron expression, then the backup is scheduled on days
              >   satisfying either the day of the month or the day of the week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
              >   schedules a backup at 0AM (midnight) on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
        * `num MINUTE` or `num MINUTES`
          :   Specifies an interval (in minutes) of wait time between backups. Accepts positive integers only.

              Also supports `num M` syntax.
        * `num HOUR` or `num HOURS`
          :   Specifies an interval (in hours) of wait time between backups. Accepts positive integers only.

              Also supports `num H` syntax.

        To avoid ambiguity, a *base interval time* is set in the following circumstances:

        * When the object is created (using CREATE BACKUP SET … WITH BACKUP POLICY).
        * When a different interval is set (using ALTER BACKUP SET … APPLY BACKUP POLICY or
          ALTER BACKUP POLICY … SET SCHEDULE).

        The base interval time starts the interval counter from the current clock time. For example, if an
        INTERVAL value of `10 MINUTES` is set and the scheduled backup is enabled at 9:03 AM, then the next backup
        is created at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to ensure absolute
        precision, but only guarantee that a backup does not execute before the set interval occurs
        (that is, in the current example, the backup could first run at 9:14 AM, but will definitely not run
        at 9:12 AM).

    `EXPIRE_AFTER_DAYS = days_integer`
    :   > Specifies the number of days until the backup expires. Snowflake automatically deletes expired backups.
        > If this parameter isn’t specified, backups remain in the backup set until they are manually deleted from the set.
        >
        > * Minimum value: `1`
        > * Maximum value: `3653` (roughly 10 years) if you don’t specify the `SCHEDULE` clause.
        >
        > > **Note:**
        > >
        > > If the policy has a retention lock, you can increase the EXPIRE_AFTER_DAYS value, but you can’t decrease that value.
        > >
        > > Every policy must include a SCHEDULE clause, an EXPIRE_AFTER_DAYS clause, or both.

        `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
        :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

            The tag value is always a string, and the maximum number of characters for the tag value is 256.

            For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET...`
:   Unset one of the following properties for the backup policy. The property reverts to its default value.

    * COMMENT
    * `TAG tag_name [ , tag_name ... ]`
    * SCHEDULE
    * EXPIRE_AFTER_DAYS

    > **Note:**
    >
    > You can unset the SCHEDULE property, or the EXPIRE_AFTER_DAYS property, but not both.
    > For example, you might keep the EXPIRE_AFTER_DAYS property when you don’t intend to create new backups,
    > but you want existing backups to expire after a certain time.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| OWNERSHIP | The role used to modify a backup policy must have the OWNERSHIP privilege on the backup policy. |
| APPLY BACKUP RETENTION LOCK | The role used to modify a backup policy with a retention lock must have this privilege on the account. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

Regarding metadata:

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename the backup policy `hourly_backup_policy` to `daily_backup_policy`:

```sqlexample
ALTER BACKUP POLICY hourly_backup_policy
  RENAME TO daily_backup_policy;
```

Add a comment to backup policy `hourly_backup_policy`:

```sqlexample
ALTER BACKUP POLICY hourly_backup_policy
  SET COMMENT = 'hourly backup expires in 90 days';
```

Change schedule for backup policy `every_two_hours`:

```sqlexample
ALTER BACKUP POLICY every_two_hours SET SCHEDULE = '120 MINUTE';
```

Revert the EXPIRE_AFTER_DAYS property back to its default value:

```sqlexample
ALTER BACKUP POLICY sample_backup_policy UNSET EXPIRE_AFTER_DAYS;
```

---
title: ALTER BACKUP SET
source: https://docs.snowflake.com/en/sql-reference/sql/alter-backup-set.md
section: SQL Commands
---

# ALTER BACKUP SET

Modifies the properties for a [backup](../../user-guide/backups.md) set.
This operation can be one of the following:

* Taking a new backup that becomes part of the backup set.
* Removing an old backup from the backup set.
* Suspending or resuming the scheduled backups and scheduled backup deletion
  that are specified by the backup policy.
* Applying a backup policy to a backup set that doesn’t already have a policy.
* Adding or removing a legal hold for a specific backup within the backup set.
* Renaming the backup set.
* Specifying or removing a comment for the backup set.

See also:
:   [CREATE BACKUP SET](create-backup-set.md),
    [DROP BACKUP SET](drop-backup-set.md),
    [SHOW BACKUP SETS](show-backup-sets.md)

## Syntax

```sqlsyntax
ALTER BACKUP SET <name> ADD BACKUP

ALTER BACKUP SET <name> APPLY BACKUP POLICY <policy_name> [ FORCE ]

ALTER BACKUP SET <name> SUSPEND BACKUP [ { CREATION | EXPIRATION } ] POLICY

ALTER BACKUP SET <name> RESUME BACKUP [ { CREATION | EXPIRATION } ] POLICY

ALTER BACKUP SET <name> DELETE BACKUP IDENTIFIER '<backup_id>'

ALTER BACKUP SET <name> MODIFY BACKUP IDENTIFIER '<backup_id>' { ADD | REMOVE } LEGAL HOLD

ALTER BACKUP SET <name> RENAME TO <new_name>

ALTER BACKUP SET <name> SET COMMENT = '<string_literal>'

ALTER BACKUP SET <name> UNSET COMMENT

ALTER BACKUP SET <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER BACKUP SET <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the backup set.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ADD BACKUP`
:   Manually create a backup in the set. If the backup policy doesn’t include a schedule for
    taking new backups, this is how you make a new backup of the table, schema, or database that’s
    included in the backup set. You can also make new backups in the backup set at any time even
    when backups happen on a regular schedule.

`APPLY BACKUP POLICY policy_name [ FORCE ]`
:   Specifies the backup policy to attach to the backup set.

    The FORCE option overwrites an existing policy on a backup set. You can only use this option if the old
    policy doesn’t have a retention lock.

    > **Important:**
    >
    > Applying a backup policy with a retention lock to a backup set is *irreversible*.
    > Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a backup set,
    > you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
    > you set a retention lock on a backup set with a long expiration period, to avoid unexpected storage charges
    > for undeletable backup sets, and the schemas and databases that contain them.
    >
    > If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
    > Snowflake deletes all backups, including those with retention locks. Deleting a Snowflake organization
    > requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

`SUSPEND BACKUP [ { CREATION | EXPIRATION } ] POLICY`
:   Suspend a backup policy in the backup set.
    You can suspend the entire backup policy, or only creation or expiration operations.
    When you specify SUSPEND BACKUP POLICY without the CREATION or EXPIRATION keywords, Snowflake
    suspends both the creation and expiration aspects of the policy.
    For more information, see [Suspend a backup policy on a backup set](../../user-guide/backups.md).

`RESUME BACKUP [ { CREATION | EXPIRATION } ] POLICY`
:   Resume a suspended backup policy in the set.
    You can resume the entire backup policy, or only creation or expiration operations.
    When you specify RESUME BACKUP POLICY without the CREATION or EXPIRATION keywords, Snowflake
    resumes both the creation and expiration aspects of the policy.
    For more information, see [Resume a backup policy on a backup set](../../user-guide/backups.md).

`DELETE BACKUP IDENTIFIER 'backup_id'`
:   Delete a backup in the backup set by ID.
    The backup ID is a UUID value, in the format returned by
    the [UUID_STRING](../functions/uuid_string.md) function.
    Snowflake only allows deleting the oldest backup from the backup set.
    For more information, see [Delete a backup from a backup set](../../user-guide/backups.md).

`MODIFY BACKUP IDENTIFIER 'backup_id' { ADD | REMOVE } LEGAL HOLD`
:   Adds or removes a legal hold from a specified backup within the backup set.
    For more information about legal holds for WORM backups, see [Legal hold](../../user-guide/backups.md).
    For examples of using this clause, see [Add and remove legal holds](../../user-guide/backups.md).

`RENAME TO new_name`
:   Specifies a new identifier for the backup set; must be unique for your account.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET COMMENT = 'string_literal'`
:   Associate a comment with the backup set.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one (or more) properties and/or parameters to unset for the backup set, which resets them to the defaults:

    * `property_name`
    * `param_name`

      + `COMMENT`
      + `TAG tag_name [ , tag_name ... ]`

    You can reset multiple properties/parameters with a single ALTER statement; however, each
    property/parameter must be separated by a comma. Also, when resetting a
    property/parameter, you only specify the name; no value is required.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Description |
| --- | --- |
| OWNERSHIP | The role used to modify a backup set must have the OWNERSHIP privilege on the backup set. |
| APPLY BACKUP RETENTION LOCK | If the backup policy includes a retention lock, the owner role of the backup set must have this privilege on the account. |
| APPLY LEGAL HOLD | This account privilege grants the ability to add or remove a legal hold from a backup. This privilege is only needed for the ADD LEGAL HOLD and REMOVE LEGAL HOLD clauses. By default, the ACCOUNTADMIN role has this privilege. |
| APPLY | The owner role of the backup set must have this privilege on the backup policy, either directly or through the role hierarchy. |

APPLY on the backup policy and APPLY BACKUP RETENTION LOCK on the account must
be granted to the owner role of the backup set (that is, the role that holds
OWNERSHIP of the backup set), either directly or indirectly through role
hierarchy inheritance. Background snapshot creation jobs run under the backup
set’s owner role, so that role must be able to use the policy after it is
applied.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Important:**
>
> If the backup policy has a retention lock applied to it, and there are any
> unexpired backups in the backup set, then you can’t delete the backup set.
> In that case, you must wait for all the backups in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a backup policy.

## Examples

Manually add a backup to backup set `t1_backups`:

```sqlexample
ALTER BACKUP SET t1_backups
  ADD BACKUP;
```

Update the backup policy for backup set `t1_backups`:

```sqlexample
ALTER BACKUP SET t1_backups
  APPLY BACKUP POLICY daily_backup_policy;
```

Suspend a backup policy on the backup set `t1_backup`:

```sqlexample
ALTER BACKUP SET t1_backups
  SUSPEND BACKUP POLICY;
```

Resume a backup policy on the backup set `t1_backups`:

```sqlexample
ALTER BACKUP SET t1_backups
  RESUME BACKUP POLICY;
```

Rename the backup set `t1_backups` to `table1_backups`:

```sqlexample
ALTER BACKUP SET t1_backups
  RENAME TO table1_backups;
```

To find the backup identifier to use with the ADD LEGAL HOLD
and REMOVE LEGAL HOLD clauses, you typically use the SHOW BACKUPS
command to list the eligible backups and their creation times.
The following example shows how you might list the appropriate
backups, add a legal hold to one specific backup, and later
remove that legal hold. Substitute your own role name, backup set name,
and backup identifier.

```sqlexample
USE ROLE my_legal_hold_role; -- use a role that has the APPLY LEGAL HOLD privilege
SHOW BACKUPS IN BACKUP SET my_db_backup_set
  ->> SELECT "created_on", "backup_id" FROM $1 WHERE "is_under_legal_hold" = 'N';
ALTER BACKUP SET my_db_backup_set
  MODIFY BACKUP IDENTIFIER '790d1ee4-88b2-451f-9ccc-eacd1e93a134'
  ADD LEGAL HOLD;

USE ROLE my_legal_hold_role; -- use a role that has the APPLY LEGAL HOLD privilege
SHOW BACKUPS IN BACKUP SET my_db_backup_set
  ->> SELECT "created_on", "backup_id" FROM $1 WHERE "is_under_legal_hold" = 'Y';
ALTER BACKUP SET my_db_backup_set
  MODIFY BACKUP IDENTIFIER '790d1ee4-88b2-451f-9ccc-eacd1e93a134'
  REMOVE LEGAL HOLD;
```

---
title: ALTER CATALOG INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-catalog-integration.md
section: SQL Commands
---

# ALTER CATALOG INTEGRATION

Modifies the properties of an existing [catalog integration](../../user-guide/tables-iceberg.md).

See also:
:   [CREATE CATALOG INTEGRATION](create-catalog-integration.md) , [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md), [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md)

## Syntax

```sqlsyntax
ALTER CATALOG INTEGRATION [ IF EXISTS ] <name> SET
  REST_AUTHENTICATION = (
    restAuthenticationParams
  )
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

The `restAuthenticationParams` are as follows, depending on your authentication method:

**OAuth**

```sqlsyntax
restAuthenticationParams (for OAuth) ::=

  OAUTH_CLIENT_SECRET = '<oauth_client_secret>'
```

**Bearer token**

```sqlsyntax
restAuthenticationParams (for Bearer token) ::=

  BEARER_TOKEN = '<bearer_token>'
```

## Parameters

`name`
:   Specifies the identifier for the catalog integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Sets one or more specified properties or parameters to set for the catalog integration:

    `REFRESH_INTERVAL_SECONDS = value`
    :   Specifies the number of seconds that Snowflake waits between attempts to poll the external Iceberg catalog for metadata updates
        for [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

        For Delta-based tables, specifies the number of seconds that Snowflake waits between attempts to poll your external cloud storage for
        new metadata.

        Values: 30 to 86400, inclusive

        Default: 30 seconds

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

### REST authentication parameters (restAuthenticationParams)

**OAuth**

> `OAUTH_CLIENT_SECRET = oauth_client_secret`
> :   Your OAuth2 client secret.

**Bearer token**

> `BEARER_TOKEN = bearer_token`
> :   The bearer token for your identity provider. You can alternatively specify a personal access token (PAT).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Integration (catalog) | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example updates the refresh interval for automated refresh to 30 seconds:

```sqlexample
ALTER CATALOG INTEGRATION myCatalogIntegration SET REFRESH_INTERVAL_SECONDS = 30;
```

---
title: ALTER COMPUTE POOL
source: https://docs.snowflake.com/en/sql-reference/sql/alter-compute-pool.md
section: SQL Commands
---

# ALTER COMPUTE POOL

Modifies the properties of an existing
[compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

See also:
:   [CREATE COMPUTE POOL](create-compute-pool.md) , [DESCRIBE COMPUTE POOL](desc-compute-pool.md), [DROP COMPUTE POOL](drop-compute-pool.md) , [SHOW COMPUTE POOLS](show-compute-pools.md)

## Syntax

```sqlsyntax
ALTER COMPUTE POOL [ IF EXISTS ] <name> { SUSPEND | RESUME }

ALTER COMPUTE POOL [ IF EXISTS ] <name> STOP ALL  [ OF TYPE <workload_type> [ , ... ] ]

ALTER COMPUTE POOL [ IF EXISTS ] <name> SET [ MIN_NODES = <num> ]
                                            [ MAX_NODES = <num> ]
                                            [ AUTO_RESUME = { TRUE | FALSE } ]
                                            [ AUTO_SUSPEND_SECS = <num> ]
                                            [ PLACEMENT_GROUP = '<placement_group_name>' ]
                                            [ INSTANCE_FAMILY = <instance_family_name> ]
                                            [ TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ]
                                            [ COMMENT = '<string_literal>' ]

ALTER COMPUTE POOL [ IF EXISTS ] <name> UNSET { AUTO_SUSPEND_SECS |
                                                AUTO_RESUME       |
                                                PLACEMENT_GROUP   |
                                                COMMENT
                                              }
                                              [ , ... ]
```

## Parameters

`name`
:   Specifies the identifier for the compute pool to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`{ SUSPEND | RESUME }`
:   Suspends a compute pool or resumes a previously suspended compute pool. When you suspend a compute pool, Snowflake suspends all services in that compute pool,
    but the jobs continue to run until they reach a terminal state (DONE or FAILED), after which the compute pool nodes are released.

`STOP ALL  OF TYPE workload_type [ , ... ]`
:   Drops all services and cancels jobs executing in the compute pool. Snowflake then removes all the containers from the compute pool. If the optional `OF TYPE` clause is specified, Snowflake only stops the services of the specified workload types. For a list of available workload types, see [ALLOWED_SPCS_WORKLOAD_TYPES](../parameters.md).

    The filter is case-insensitive.

`SET ...`
:   Sets one or more specified properties or parameters for the compute pool:

    `MIN_NODES = num`
    :   Specifies the minimum number of compute pool nodes.

    `MAX_NODES = num`
    :   Specifies the maximum number of compute pool nodes.

    `AUTO_RESUME = { TRUE | FALSE }`
    :   Specifies whether to automatically resume a compute pool when a service or job is submitted to it. If AUTO_RESUME is FALSE,
        you need to explicitly resume the compute pool (using ALTER COMPUTE POOL <name> RESUME) before you can start a service or
        job on the compute pool.

    `AUTO_SUSPEND_SECS = num`
    :   Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool. Inactivity means
        no services and no jobs running on any node in the compute pool.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `PLACEMENT_GROUP = placement_group_name`
    :   Identifies the [placement group of the compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). Use the [SHOW COMPUTE POOLS](show-compute-pools.md)
        and [DESCRIBE COMPUTE POOL](desc-compute-pool.md)
        commands to review the assignment of the compute pool into placement groups.

        You can also set `placement_group` to `DISTRIBUTED`. In this case, Snowflake attempts to distribute compute pool nodes across all available placement groups to maintain an even distribution across multiple placement groups so that the groups are more fault tolerant. For more information, see [Compute pool placement](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

    `INSTANCE_FAMILY = instance_family_name`
    :   Identifies the type of machine you want to provision for the nodes in the compute pool. The machine type determines the amount of compute resources in the compute pool and, therefore, the number of credits consumed while
        the compute pool is running. For a list of available instance family names, see [instance families](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

        INSTANCE_FAMILY can be altered only when a compute pool is fully suspended. Upon resuming, Snowflake uses the new instance type to provision the compute pool.

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the compute pool.

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset for the compute pool,
    which resets them to the defaults. For more information, see
    [CREATE COMPUTE POOL](create-compute-pool.md):

    * `AUTO_SUSPEND_SECS`
    * `AUTO_RESUME`
    * `PLACEMENT_GROUP`: The placement group can only be unset when the compute pool is fully suspended.
    * `COMMENT`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OPERATE | Compute pool | To suspend or resume a compute pool, the role requires these permissions. |
| MODIFY | Compute pool | To alter the compute pool and set properties, the role requires this permission. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example sets the MAX_NODES and AUTO_RESUME properties for a compute pool:

```sqlexample
ALTER COMPUTE POOL tutorial_compute_pool SET
  MAX_NODES = 5
  AUTO_RESUME = FALSE
```

The following example sets the “CPU_X64_S” as the INSTANCE_FAMILTY for a compute pool. Because the compute pool must be stopped to change the instance family, the compute pool is first suspended:

```sqlexample
ALTER COMPUTE POOL tutorial_compute_pool SUSPEND;
ALTER COMPUTE POOL tutorial_compute_pool SET
  INSTANCE_FAMILY = CPU_X64_S;
ALTER COMPUTE POOL tutorial_compute_pool RESUME;
```

---
title: ALTER CONNECTION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-connection.md
section: SQL Commands
---

# ALTER CONNECTION

Modifies the properties for an existing [connection](../../user-guide/client-redirect.md).

See also:
:   [CREATE CONNECTION](create-connection.md) , [DROP CONNECTION](drop-connection.md) , [SHOW CONNECTIONS](show-connections.md)

## Syntax

```sqlsyntax
ALTER CONNECTION [ IF EXISTS ] <name> ENABLE FAILOVER TO ACCOUNTS <organization_name>.<account_name> [ , <organization_name>.<account_name> ... ]
                        [ IGNORE EDITION CHECK ]

ALTER CONNECTION [ IF EXISTS ] <name> DISABLE FAILOVER [ TO ACCOUNTS <organization_name>.<account_name> [ , <organization_name>.<account_name> ... ] ]

ALTER CONNECTION [ IF EXISTS ] <name> PRIMARY

ALTER CONNECTION [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER CONNECTION [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Identifier for the connection to alter.

`ENABLE FAILOVER TO ACCOUNTS organization_name.account_name [ , organization_name.account_name ... ]`
:   Specifies a comma-separated list of accounts in your organization where a secondary connection for this primary connection can be
    promoted to serve as the primary connection. Include your organization name for each account in the list.

    Each account in the list must be located in a different region than the account with the primary connection. Otherwise,
    the command fails.

`DISABLE FAILOVER [ TO ACCOUNTS organization_name.account_name [ , organization_name.account_name ... ] ]`
:   Disables failover for this primary connection, meaning no secondary connection for this primary connection can be promoted to serve as the primary
    connection.

    To disable failover to selected accounts (rather than to all accounts), specify a comma-delimited list of those accounts.

`PRIMARY`
:   Promote connection to serve as primary connection.

`SET ...`
:   Specifies the properties to set for the connection:

    `COMMENT = 'string'`
    :   Adds a comment or overwrites an existing comment for the connection.

`UNSET ...`
:   Specifies the properties to unset for the connection, which resets them to the defaults.

    Currently, the only property you can unset is `COMMENT`, which removes the comment, if one exists, for the connection.

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL command.
* If private connectivity to the Snowflake service is enabled for your Snowflake account, your network administrator must update the
  DNS CNAME record for your connection URL when a connection is promoted to serve as the primary connection. For more information, see
  [Configuring the DNS settings for private connectivity to the Snowflake service](../../user-guide/client-redirect.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Allow accounts `myaccount2` and `myaccount3` in the `myorg` organization to each store a secondary connection for the
`myconnection` connection:

```sqlexample
ALTER CONNECTION myconnection ENABLE FAILOVER TO ACCOUNTS myorg.myaccount2, myorg.myaccount3;
```

Add a comment for a connection:

```sqlexample
ALTER CONNECTION myconnection SET COMMENT = 'New comment for connection';
```

Promote a secondary connection to primary connection:

```sqlexample
ALTER CONNECTION myconnection PRIMARY;
```

---
title: ALTER CONTACT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-contact.md
section: SQL Commands
---

# ALTER CONTACT

Modifies the properties of an existing [contact](../../user-guide/contacts-using.md).

See also:
:   [CREATE CONTACT](create-contact.md) , [DROP CONTACT](drop-contact.md) , [SHOW CONTACTS](show-contacts.md)

## Syntax

```sqlsyntax
ALTER CONTACT [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER CONTACT [ IF EXISTS ] <name> SET
  [ {
    USERS = ( '<user_name>' [ , '<user_name>' ... ] )
    | EMAIL_DISTRIBUTION_LIST = '<email>'
    | URL = '<url>'
    } ]
  [ COMMENT = '<string_literal>' ]
```

## Parameters

`name`
:   Specifies the identifier for the contact to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the contact to `new_name`. The new identifier must be unique for the schema.

    For more information about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When a contact is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Sets one of the following parameters for the contact:

    `USERS = ( 'user_name' [ , 'user_name' ... ] )`
    :   Comma-delimited list of Snowflake users who can be contacted, specified by the name of their user objects.

        If the user name is case-sensitive or includes any special characters or spaces, double quotes are required. The double quotes must be
        enclosed within the single quotes. For example, if the user is `joe@example.com`, you must specify `'"joe@example.com"'`.

    `EMAIL_DISTRIBUTION_LIST = 'email'`
    :   A valid email address, which can be a distribution list.

    `URL = 'url'`
    :   A URL that can be used to contact people about an object.

    `COMMENT = '<string_literal>'`
    :   A user-defined string. Specifies a comment for the contact.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY | Contact |  |
| OWNERSHIP | Contact | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

```sqlexample
ALTER CONTACT my_contact SET EMAIL_DISTRIBUTION_LIST = 'support@example.com';
```

---
title: ALTER CORTEX SEARCH SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-cortex-search.md
section: SQL Commands
---

# ALTER CORTEX SEARCH SERVICE

Suspends, resumes, or modifies the properties of an existing [Cortex Search service](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

## Syntax

```sqlsyntax
ALTER CORTEX SEARCH SERVICE [ IF EXISTS ] <name>
  { SUSPEND | RESUME } [ { INDEXING | SERVING } ]

ALTER CORTEX SEARCH SERVICE [ IF EXISTS ] <name> REFRESH

ALTER CORTEX SEARCH SERVICE [ IF EXISTS ] <name> SET
  [ TARGET_LAG = { '<num> { seconds | minutes | hours | days }' } ]
  [ WAREHOUSE = <warehouse_name> ]
  [ PRIMARY KEY = ( <col_name> [, ... ] ) ]
  [ FULL_INDEX_BUILD_INTERVAL_DAYS = <num> ]
  [ REQUEST_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]

ALTER CORTEX SEARCH SERVICE [ IF EXISTS ] <name> UNSET
  [ PRIMARY KEY ]

ALTER CORTEX SEARCH SERVICE <name>
  ADD SCORING PROFILE [ IF NOT EXISTS ] <profile_name>
  <scoring_profile>

ALTER CORTEX SEARCH SERVICE <name>
  DROP SCORING PROFILE [ IF EXISTS ] <profile_name>
```

## Parameters

`name`
:   Specifies the identifier for the Cortex Search service to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`{ SUSPEND | RESUME } ...`
:   Suspends or resumes the indexing, serving, or both for a Cortex Search service. You can specify one of the following keywords to indicate
    which layer to suspend or resume:

    `INDEXING`
    :   The target that indicates the indexing layer of the Cortex Search Service. For more details, see Usage Notes.

    `SERVING`
    :   The target that indicates the serving layer of the Cortex Search Service. For more details, see Usage Notes.

        If you do not specify either keyword, both the indexing and serving layers are suspended or resumed. The OPERATE privilege is required to suspend or resume a Cortex Search service.

`REFRESH`
:   Triggers a manual refresh of the Cortex Search Service. The indexing service immediately checks for changes to the source data and processes
    any new or changed rows.

`SET ...`
:   Sets one or more specified properties or parameters to set for the Cortex Search service:

    `TARGET_LAG = 'num { seconds | minutes | hours | days }'`
    :   Specifies the maximum amount of time that the Cortex Search service content should lag behind updates to the base tables specified in the source query.

    `WAREHOUSE = warehouse_name`
    :   Specifies the warehouse to use for running the source query, building the search index, and keeping it refreshed per the TARGET_LAG target.

    `FULL_INDEX_BUILD_INTERVAL_DAYS = num`
    :   Specifies the target interval, in days, between full index rebuilds for a Cortex Search service with
        [primary keys](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) defined. This property is only applicable to services that have primary keys set.

        This value is a soft target. Full index rebuilds may occur more frequently than the specified interval to optimize
        serving performance based on factors such as service target lag, change rate in the service source data, and overall
        service size.

    `REQUEST_LOGGING = { TRUE | FALSE }`
    :   Enables or disables request logging for the Cortex Search Service. When enabled, the service records
        information about search requests, which you can query for monitoring and analysis purposes.
        For more information, see [Monitor Cortex Search requests](../../user-guide/snowflake-cortex/cortex-search/cortex-search-monitor.md).

        Default: `FALSE`

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the search service.

    `PRIMARY KEY = (column_name, column_name, ...)`
    :   Modifies the set of [primary key columns](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) for the Cortex Search service.
        The combination of values in the designated columns must be unique for each row; rows with duplicate primary key
        values are ignored in the resulting search index. Primary key columns must be of the
        [TEXT](../data-types-text.md) data type. Changes to primary keys take effect after the next change
        to the source data.

`UNSET ...`
:   Unsets one or more specified properties or parameters for the Cortex Search service.

    `PRIMARY KEY`
    :   Removes any [primary key columns](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) that were previously set for the Cortex Search service.

`ADD SCORING PROFILE profile_name [ IF NOT EXISTS ] scoring_profile`
:   Adds a named scoring profile to the Cortex Search service. Scoring profiles define custom ranking behavior for search
    results. For more information, see [Named scoring profiles](../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md).

    `profile_name`
    :   Specifies the name of the scoring profile to add. If a profile with the specified name already exists, an error occurs unless
        you specify IF NOT EXISTS. To modify an existing profile, drop it using DROP SCORING PROFILE first.

    `scoring_profile`
    :   The scoring profile definition in JSON string format. The schema is the same as a scoring configuration specified directly in a search query
        using the `scoring_config` parameter. See [Numeric boosts and time decays](../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md) for syntax and examples.

`DROP SCORING PROFILE [ IF EXISTS ] profile_name`
:   Drops a named scoring profile from the Cortex Search service.

    `profile_name`
    :   The name of the scoring profile to drop.

## Access Control Requirements

| Privilege | Object |
| --- | --- |
| OWNERSHIP | Cortex Search service you want to modify any properties on. |
| OPERATE | Cortex Search service you want to perform one of the following on:   * Suspend search indexing * Resume search indexing * Refresh search index * Set or change query warehouse * Set or change target lag |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage Notes

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

### INDEXING and SERVING states

Cortex Search services have two distinct processes that can be in either the RUNNING or SUSPENDED state: INDEXING and SERVING.

* INDEXING is the target that indicates the indexing layer of the Cortex Search Service. When in the RUNNING state, changes in base tables
  referenced by the service’s source query will prompt refreshes of the materialized data stored as part of the search index. These
  refreshes incur cost in the form of warehouse compute and vector embeddings. When in the SUSPENDED state, changes in base tables will
  not trigger refreshes, nor will they be reflected in the queryable data of the Cortex Search Service.

  > **Note:**
  >
  > If indexing is suspended for longer than the data retention period of the source tables, the service may be unable to detect changes in
  > the source data when indexing is resumed and could require recreation. For more information, see [DATA_RETENTION_TIME_IN_DAYS](../parameters.md).
* SERVING is the target that indicates the serving layer of the Cortex Search Service. This target must be in the RUNNING state for the
  service to be queryable. When in the suspended state, the Cortex Search Service will not incur billing
  in the form of Cortex Search’s serving costs.

For detailed cost considerations, see [Cost considerations](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

The INDEXING and SERVING layers of the Cortex Search Service can be managed independently. For instance, if SERVING is in the running
state while INDEXING is suspended, you can still query the service. However, the service will not reflect any changes in the base data
regardless of the TARGET_LAG until INDEXING is resumed and a refresh is completed successfully.

Conversely, if INDEXING is running while SERVING is suspended, the index will continue to refresh. When SERVING is resumed, the loaded
index that becomes queryable will reflect the most up-to-date source data.

When neither the SERVING nor INDEXING keywords are specified, both targets will be impacted by the specified action.

### Manual refreshes

When you manually refresh a Cortex Search Service, the service immediately checks for changes in its source data and updates the
index as needed. Trigger a manual refresh of your Cortex Search Service when you need the most up-to-date results possible — for
example, when you have just added or updated important documents and want these to be available immediately to users. You can
also use a manual refresh to ensure that results are always current at specific times, such as at the start of business.

You can trigger a manual refresh of a Cortex Search Service using the ALTER CORTEX SEARCH SERVICE … REFRESH command or in Snowsight.

### Primary keys

Altering the primary key columns of a Cortex Search Service affects only future refreshes.
That is, changes to primary keys go into effect after the next change to the source data.

## Examples

The following example changes warehouse used by the Cortex Search service named `mysvc` to `my_new_wh`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc SET WAREHOUSE = my_new_wh;
```

The following example sets the comment field of the Cortex Search service named `mysvc` to `new_comment`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc SET COMMENT = 'new_comment';
```

The following example changes the target refresh lag of the Cortex Search service named `mysvc` to `1 hour`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc SET TARGET_LAG = '1 hour';
```

The following example sets the primary key columns of the Cortex Search service named `mysvc` to `region` and `agent_id`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc SET PRIMARY KEY = (region, agent_id);
```

The following example clears the primary key columns of the Cortex Search service named `mysvc`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc UNSET PRIMARY KEY;
```

The following example enables request logging for a Cortex Search service named `mysvc`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc SET REQUEST_LOGGING = TRUE;
```

The following example suspends serving for a Cortex Search service named `mysvc`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc SUSPEND SERVING;
```

The following example manually refreshes a Cortex Search service named `mysvc`:

```sqlexample
ALTER CORTEX SEARCH SERVICE mysvc REFRESH;
```

---
title: ALTER DATABASE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-database.md
section: SQL Commands
---

# ALTER DATABASE

Modifies the properties for an existing database.

Database modifications include the following:

* Changing the name of the database or changing the Time Travel data retention period (if you are using Snowflake Enterprise Edition or higher).
* Enabling and managing database replication and failover.

See also:
:   [CREATE DATABASE](create-database.md) , [DESCRIBE DATABASE](desc-database.md) , [DROP DATABASE](drop-database.md) , [SHOW DATABASES](show-databases.md) , [UNDROP DATABASE](undrop-database.md)

## Syntax

```sqlsyntax
ALTER DATABASE [ IF EXISTS ] <name> RENAME TO <new_db_name>

ALTER DATABASE [ IF EXISTS ] <name> SWAP WITH <target_db_name>

ALTER DATABASE [ IF EXISTS ] <name> SET [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
                                        [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
                                        [ EXTERNAL_VOLUME = <external_volume_name> ]
                                        [ CATALOG = <catalog_integration_name> ]
                                        [ ICEBERG_VERSION_DEFAULT = <integer> ]
                                        [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
                                        [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
                                        [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
                                        [ DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU = '<compute_pool_name>' ]
                                        [ DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU = '<compute_pool_name>' ]
                                        [ OBJECT_VISIBILITY = { <object_visibility_spec> | PRIVILEGED } ]
                                        [ LOG_LEVEL = '<log_level>' ]
                                        [ METRIC_LEVEL = '<metric_level>' ]
                                        [ TRACE_LEVEL = '<trace_level>' ]
                                        [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
                                        [ EVENT_TABLE = <event_table_name> ]
                                        [ COMMENT = '<string_literal>' ]
                                        [ CATALOG_SYNC = '<snowflake_open_catalog_integration_name>' ]
                                        [ REPLICABLE_WITH_FAILOVER_GROUPS = { 'YES' | 'NO' } ]
                                        [ BASE_LOCATION_PREFIX = '<string>' ]
                                        [ DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE = <warehouse_name> ]
                                        [ CLASSIFICATION_PROFILE = '<profile_name>' ]
                                        [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
                                        [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
                                        [ DATA_QUALITY_MONITORING_SETTINGS = <yaml_spec> ]

ALTER DATABASE <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER DATABASE <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER DATABASE [ IF EXISTS ] <name> UNSET { DATA_RETENTION_TIME_IN_DAYS         |
                                            MAX_DATA_EXTENSION_TIME_IN_DAYS     |
                                            EXTERNAL_VOLUME                     |
                                            CATALOG                             |
                                            ICEBERG_VERSION_DEFAULT             |
                                            ENABLE_ICEBERG_MERGE_ON_READ        |
                                            DEFAULT_DDL_COLLATION               |
                                            DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU   |
                                            DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU   |
                                            OBJECT_VISIBILITY                   |
                                            STORAGE_SERIALIZATION_POLICY        |
                                            EVENT_TABLE = <event_table_name>    |
                                            COMMENT                             |
                                            CATALOG_SYNC                        |
                                            REPLICABLE_WITH_FAILOVER_GROUPS     |
                                            BASE_LOCATION_PREFIX                |
                                            DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE|
                                            CLASSIFICATION_PROFILE              |
                                            CONTACT <purpose>                   |
                                            ENABLE_DATA_COMPACTION              |
                                            DCM PROJECT
                                          }
                                          [ , ... ]
```

## Database replication and failover syntax

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

**Database Replication**

```sqlsyntax
ALTER DATABASE <name> ENABLE REPLICATION TO ACCOUNTS <account_identifier> [ , <account_identifier> ... ] [ IGNORE EDITION CHECK ]

ALTER DATABASE <name> DISABLE REPLICATION [ TO ACCOUNTS <account_identifier> [ , <account_identifier> ... ] ]

ALTER DATABASE <name> REFRESH
```

**Database Failover**

```sqlsyntax
ALTER DATABASE <name> ENABLE FAILOVER TO ACCOUNTS <account_identifier> [ , <account_identifier> ... ]

ALTER DATABASE <name> DISABLE FAILOVER [ TO ACCOUNTS <account_identifier> [ , <account_identifier> ... ] ]

ALTER DATABASE <name> PRIMARY
```

## Parameters

`name`
:   Specifies the identifier for the database to alter. If the identifier contains spaces, special characters, or mixed-case characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_db_name`
:   Specifies the new identifier for the database; must be unique for your account.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    When an object is renamed, other objects that reference it must be updated with the new name.

`SWAP WITH target_db_name`
:   Swaps all objects (schemas, tables, views, etc.) and metadata, including identifiers, between the two specified databases. Also swaps all access
    control privileges granted on the databases and objects they contain. `SWAP WITH` essentially performs a rename of both databases as a
    single operation.

`SET ...`
:   Specifies one (or more) properties to set for the database (separated by blank spaces, commas, or new lines):

    `DATA_RETENTION_TIME_IN_DAYS = num`
    :   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the database, as well as specifying the
        default Time Travel retention time for all schemas created in the database.

        The value you can specify depends on the Snowflake Edition you are using:

        * Standard Edition: `0` or `1`
        * Enterprise Edition (or higher): `0` to `90`

    `MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
    :   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in the database
        to prevent streams on the tables from becoming stale.

        For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

    `EXTERNAL_VOLUME = external_volume_name`
    :   Object parameter that specifies the default external volume to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

        For more information about this parameter, see [EXTERNAL_VOLUME](../parameters.md).

    `CATALOG = catalog_integration_name`
    :   Object parameter that specifies the default catalog integration to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

        For more information about this parameter, see [CATALOG](../parameters.md).

    `ICEBERG_VERSION_DEFAULT = integer`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies the version of the Apache Iceberg™ table specification that Iceberg tables conform to.

        Values:
        :   `2`: New tables conform with Iceberg version 2.

            `3`: New tables conform with Iceberg version 3.

        > **Caution:**
        >
        > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
        > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
        > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
        > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

        Default:
        :   `2`

        For more information about this parameter, see [ICEBERG_VERSION_DEFAULT](../parameters.md).

    `ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

        Values:
        :   `TRUE`: New tables use merge-on-read behavior.

            `FALSE`: New tables use copy-on-write behavior.

        Default:
        :   `TRUE`

        For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md). For more information about merge-on-read
        and copy-on-write behavior in Snowflake, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

    `REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
    :   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results for an
        [Iceberg table](create-iceberg-table.md).
        You can only set this parameter for tables that use an external Iceberg catalog.

        * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
        * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
          characters in a Parquet data file.

        Default: `FALSE`

    `DEFAULT_DDL_COLLATION = 'collation_specification'`
    :   Specifies a default [collation specification](../collation.md) for:

        * Any new columns added to existing tables in the database.
        * All columns in new tables added to the database.

        Setting the parameter does not change the collation specification for any existing columns.

        For more information about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

    `DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU = compute_pool_name`
    :   CPU compute pool name that overrides the default CPU compute pool Snowflake provisioned in your account for running Notebooks. For more information, see [System compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

    `DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU = compute_pool_name`
    :   GPU compute pool name that overrides the default GPU compute pool Snowflake provisioned in your account for running Notebooks. For more information, see [System compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

    `OBJECT_VISIBILITY = {object_visibility_spec | PRIVILEGED }`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies the visibility of objects in the database, which controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md)
        and enables users without explicit access privileges to find objects and request access.

        * A YAML specification describing the visibility in one of the following formats:

          ```sqlexample-yaml
          $$
          organization_targets:
            - all_accounts_including_external
          $$
          ```

          Or

          ```sqlexample-yaml
          $$
          organization_targets:
            - account: <account_name_1>
            - account: <account_name_2>
            - ...
            - organization_user_group: <org_user_group_1>
            - organization_user_group: <org_user_group_2>
          $$
          ```

          In the syntax above:

          + `all_accounts_including_external`: Specifies that all users in all accounts in the organization can see the object. This includes
            all accounts within the organization, even those to which external parties may have been given access, such as
            [reader accounts](../../user-guide/data-sharing-reader-create.md).
          + `account: account_name`: Specifies that all users in the specified account can see the object. You can specify multiple accounts.
            Note that `account` is the account name, not the account locator. You must specify only the account name, excluding the organization name.09-22
          + `organization_user_group: org_user_group`: Specifies that the specified [organization user group](../../user-guide/organization-users.md) can
            see the object in all accounts in the organization where the [organization user group has been imported](../../user-guide/organization-users.md).
        * `PRIVILEGED`: Specifies that only roles within the current account that are granted an explicit privilege on the object can see the object.
          This is the default behavior in Snowflake.

        For examples, see [Make database objects discoverable in Universal Search](../../user-guide/ui-snowsight/object-visibility-universal-search.md).

        Default: `'PRIVILEGED'`

    `LOG_LEVEL = 'log_level'`
    :   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
        the specified level (and at more severe levels) are ingested.

        For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `METRIC_LEVEL = 'metric_level'`
    :   Specifies whether metrics data should be ingested and made available in the active event table.

        For more information, see [METRIC_LEVEL](../parameters.md) and [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `TRACE_LEVEL = 'trace_level'`
    :   Controls how trace events are ingested into the event table.

        For information about levels, see [TRACE_LEVEL](../parameters.md). For information about setting the trace level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }`
    :   Specifies the storage serialization policy for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) that use Snowflake as the catalog.

        * `COMPATIBLE`: Snowflake performs encoding and compression of data files that ensures interoperability with third-party compute engines.
        * `OPTIMIZED`: Snowflake performs encoding and compression of data files that ensures the best table performance within Snowflake.

        Default: `OPTIMIZED`

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `EVENT_TABLE = event_table_name`
    :   Specifies the fully-qualified name of the event table that should collect telemetry data from objects in the database, such as
        procedures and UDFs.

        For more information, see [Associate an event table with an object](../../developer-guide/logging-tracing/event-table-setting-up.md).

        Associating an event table with a database is available in [Enterprise Edition or higher](../../user-guide/intro-editions.md).

    `CLASSIFICATION_PROFILE = 'profile_name'`
    :   Sets a classification profile on the database to implement [sensitive data classification](../../user-guide/classify-auto.md)
        for all of the tables and views in the database.

        Specify `profile_name` as a fully qualified name of a classification profile (that is, an instance of the
        CLASSIFICATION_PROFILE class).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the database.

    `CATALOG_SYNC = 'snowflake_open_catalog_integration_name'`
    :   Specifies the name of a catalog integration configured for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).
        If specified, Snowflake syncs Snowflake-managed Apache Iceberg™ tables in the database with an external catalog in your Snowflake Open Catalog account. For more
        information about syncing Snowflake-managed Iceberg tables with Open Catalog, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

        For more information about this parameter, see [CATALOG_SYNC](../parameters.md).

        Default: No value

    `REPLICABLE_WITH_FAILOVER_GROUPS = { 'YES' | 'NO' }`
    :   Specifies if all the schemas in the database are eligible for replication.
        You can set this property to `NO` for a database, and then allow some schemas
        to be replicated by setting the equivalent property to `YES` for those schemas.

        For more information about this parameter, see [Schema-level replication for failover groups](../../user-guide/account-replication-config.md).

        Default: `'YES'`

    `DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE`
    :   Specifies the default warehouse to be used when creating a notebook using SQL.

    `BASE_LOCATION_PREFIX = 'string'`
    :   Specifies a prefix for Snowflake to use in the write path for Snowflake-managed Apache Iceberg™ tables.
        For more information,
        see [data and metadata directories for Iceberg tables](../../user-guide/tables-iceberg-storage.md) and
        [BASE_LOCATION_PREFIX](../parameters.md) in the Snowflake Parameters topic.

        Default: No value

    `ENABLE_DATA_COMPACTION = { TRUE | FALSE }`
    :   Specifies whether Snowflake should enable data compaction on Snowflake-managed [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

        * `TRUE`: Snowflake performs data compaction on the tables.
        * `FALSE`: Snowflake doesn’t perform data compaction on the tables.

        Default: `TRUE`

        For more information, see [ENABLE_DATA_COMPACTION](../parameters.md) and [Set data compaction](../../user-guide/tables-iceberg-manage.md).

    `DATA_QUALITY_MONITORING_SETTINGS = yaml_spec`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts that are Enterprise Edition (or higher).

        To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

        Specifies settings that control whether notifications are sent when data quality issues are detected in the database. Set the property
        to a [dollar-quoted](../data-types-text.md) YAML specification in the following format:

        ```sqlexample-yaml
        $$
        notification:
          enabled: <boolean>
          email_recipients: [ <list_of_emails> ]
          integrations:
            - <notification_integration>
          cooldown_hours: <integer>
          metadata_included: <boolean>
        $$
        ```

        In the syntax above:

        * `enabled`: If `true`, notifications are sent when there is a data quality issue.
        * `email_recipients`: An array of email addresses, where each address is enclosed in single quotes.
        * `integrations`: Specifies a list of [notification integrations](create-notification-integration.md) that
          provide an interface between Snowflake and a third-party messaging service that sends the notifications.
        * `cooldown_hours`: Number of hours that Snowflake waits before sending another notification. Valid values are between `1` (one hour) and `720` (30 days), inclusive.
        * `metadata_included`: If `true`, the notification includes metadata that identifies which object within the database had
          the data quality issue. If `false`, the notification is sent, but it doesn’t identify which object had the issue.

        For more information about setting this parameter, including an example, see [Configure database settings for data quality notifications](../../user-guide/data-quality-notifications.md).

`UNSET ...`
:   Specifies one (or more) properties and/or parameters to unset for the database, which resets them to the defaults:

    * `DATA_RETENTION_TIME_IN_DAYS`
    * `MAX_DATA_EXTENSION_TIME_IN_DAYS`
    * `EXTERNAL_VOLUME`
    * `CATALOG`
    * `ICEBERG_VERSION_DEFAULT`
    * `ENABLE_ICEBERG_MERGE_ON_READ`
    * `DEFAULT_DDL_COLLATION`
    * `TAG tag_name [ , tag_name ... ]`
    * `DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU`
    * `DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU`
    * `STORAGE_SERIALIZATION_POLICY`
    * `EVENT_TABLE = event_table_name`
    * `COMMENT`
    * `CATALOG_SYNC`
    * `REPLICABLE_WITH_FAILOVER_GROUPS`
    * `BASE_LOCATION_PREFIX`
    * `DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE`
    * `CLASSIFICATION_PROFILE`
    * `CONTACT purpose`
    * `ENABLE_DATA_COMPACTION`

    You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be separated by a
    comma. When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

    You cannot unset the CONTACT property with other properties in the same statement.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the database from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the database and the DCM project without dropping the database. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Database replication and failover parameters

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

`ENABLE REPLICATION TO ACCOUNTS account_identifier [ , account_identifier ... ]`
:   Promotes a local database to serve as a primary database for replication. A primary database can be replicated in one or more accounts,
    allowing users in those accounts to query objects in each *secondary* (i.e. replica) database.

    Alternatively, modify an existing primary database to add to or remove from the list of accounts that can store a replica of the database.

    Provide a comma-separated list of accounts in your organization that can store a replica of this database.

    `account_identifier`
    :   Unique identifier of the account. The preferred identifier is `organization_name.account_name`. To view the list of accounts
        enabled for replication in your organization, query [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).

        Though the legacy account locator can also be used as the account identifier, its use is discouraged as it may not work in the future.
        For more information about using the account locator as an account identifier, see Database Replication and Failover Usage Notes.

    `IGNORE EDITION CHECK`
    :   Allows replicating data to accounts on lower editions in either of the following scenarios:

        * The primary database is in a Business Critical (or higher) account but one or more of the accounts approved for replication are on lower
          editions. Business Critical Edition is intended for Snowflake accounts with extremely sensitive data.
        * The primary database is in a Business Critical (or higher) account and a signed business associate agreement is in place to store PHI data
          in the account per HIPAA and [HITRUST](../../user-guide/intro-cloud-platforms.md) regulations, but no such agreement is in place for one or more of the
          accounts approved for replication, regardless if they are Business Critical (or higher) accounts.

        Both scenarios are prohibited by default in an effort to help prevent account administrators for Business Critical (or higher) accounts from
        inadvertently replicating sensitive data to accounts on lower editions.

`DISABLE REPLICATION [ TO ACCOUNTS account_identifier [ , account_identifier ... ] ]`
:   Disables replication for this primary database, meaning no replica of this database (i.e. secondary database) in another account can be refreshed.
    Any secondary databases remain linked to the primary database, but requests to refresh a secondary database are denied.

    Note that disabling replication for a primary database does not prevent it from being replicated to the same account; therefore, the database
    continues to be listed in the [SHOW REPLICATION DATABASES](show-replication-databases.md) output.

    Optionally provide a comma-separated list of accounts in your organization to disable replication for this database only in the specified
    accounts.

    `account_identifier`
    :   Unique identifier of the account. The preferred identifier is `organization_name.account_name`. To view the list of accounts
        enabled for replication in your organization, query [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).

        Though the legacy account locator can also be used as the account identifier, its use is discouraged as it may not work in the future.
        For more information about using the account locator as an account identifier, see Database Replication and Failover Usage Notes.

`REFRESH`
:   Refreshes a secondary database from a snapshot of its primary database. A snapshot includes changes to the objects and data.

`ENABLE FAILOVER TO ACCOUNTS account_identifier [ , account_identifier ... ]`
:   Specifies a comma-separated list of accounts in your organization where a replica of this primary database can be promoted to serve as the
    primary database.

    `account_identifier`
    :   Unique identifier of the account. The preferred identifier is `organization_name.account_name`. To view the list of accounts
        enabled for replication in your organization, query [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).

        Though the legacy account locator can also be used as the account identifier, its use is discouraged as it may not work in the future.
        For more information about using the account locator as an account identifier, see Database Replication and Failover Usage Notes.

`DISABLE FAILOVER [ TO ACCOUNTS account_identifier [ , account_identifier ... ] ]`
:   Disables failover for this primary database, meaning no replica of this database (i.e. secondary database) can be promoted to serve as the
    primary database.

    Optionally provide a comma-separated list of accounts in your organization to disable failover for this database only in the specified
    accounts.

    `account_identifier`
    :   Unique identifier of the account. The preferred identifier is `organization_name.account_name`. To view the list of accounts
        enabled for replication in your organization, query [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).

        Though the legacy account locator can also be used as the account identifier, its use is discouraged as it may not work in the future.
        For more information about using the account locator as an account identifier, see Database Replication and Failover Usage Notes.

`PRIMARY`
:   Promotes the specified secondary (replica) database to serve as the primary database. When promoted, the database becomes writeable. At the same
    time, the previous primary database becomes a read-only secondary database.

## Usage notes

* To rename a database, the role used to perform the operation must have the CREATE DATABASE global privilege and OWNERSHIP privilege on
  the database.
* To swap two databases, the role used to perform the operation must have OWNERSHIP privileges on both databases.
* To update a comment, the role used to perform the operation must be granted or inherit the MODIFY privilege on the database.
* To specify the default version of the Apache Iceberg™ specification that Iceberg tables conform to, you must use a role that has been granted the OWNERSHIP privilege on the database.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Database replication and failover usage notes

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

* Only account administrators (users with the ACCOUNTADMIN role) can enable and manage database replication and failover.
* A default 10 TB size limit is applied when a primary database is initially replicated to a secondary database. To change or remove the size limit,
  set the [INITIAL_REPLICATION_SIZE_LIMIT_IN_TB](../parameters.md) parameter at the account level.

  Note that there is currently no default size limit applied to subsequent refreshes of a secondary database.
* The preferred method of identifying an account uses the organization name and account name as the account
  identifier. If you decide to use the legacy account locator instead, see [Account identifiers for replication and failover](../../user-guide/admin-account-identifier.md).

## General examples

Rename database `db1` to `db2`:

> ```sqlexample
> ALTER DATABASE IF EXISTS db1 RENAME TO db2;
> ```

## Database replication examples

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

Use a replication or failover group to replicate and failover a single database. For examples, see one of the following:

* [Create a failover group to enable replication and failover for a database](create-failover-group.md).
* [Replicate a single database](create-replication-group.md).

---
title: ALTER DATABASE (catalog-linked)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-database-catalog-linked.md
section: SQL Commands
---

# ALTER DATABASE (catalog-linked)

Modifies the properties for an existing [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).

Database modifications include the following actions:

* Enabling or turning off automatic discovery.
* Changing the allowed and blocked namespaces.
* Changing the time interval that Snowflake should use for automatically discovering schemas and tables in your remote catalog.
* Changing whether your remote catalog is read only or writable.

## Syntax

```sqlsyntax
ALTER DATABASE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER DATABASE [ IF EXISTS ] <name> SUSPEND DISCOVERY

ALTER DATABASE [ IF EXISTS ] <name> RESUME DISCOVERY

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  ADD ( '<namespace>' [ , ... ] ) TO ALLOWED_NAMESPACES

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  REMOVE ( '<namespace>' [ , ... ] ) FROM ALLOWED_NAMESPACES

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  UNSET ALLOWED_NAMESPACES

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  ADD ( '<namespace>' [ , ... ] ) TO BLOCKED_NAMESPACES

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  REMOVE ( '<namespace>' [ , ... ] ) FROM BLOCKED_NAMESPACES

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  UNSET BLOCKED_NAMESPACES

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  SET SYNC_INTERVAL_SECONDS = <value>

ALTER DATABASE [ IF EXISTS ] <name> UPDATE LINKED_CATALOG
  SET ALLOWED_WRITE_OPERATIONS = { NONE | ALL }

ALTER DATABASE [ IF EXISTS ] <name> SET [ BASE_LOCATION_PREFIX = '<string>' ]
                                        [ COMMENT = '<string_literal>' ]
                                        [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
                                        [ ICEBERG_VERSION_DEFAULT = <integer> ]
                                        [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]

ALTER DATABASE [ IF EXISTS ] <name> UNSET { BASE_LOCATION_PREFIX         |
                                            COMMENT                      |
                                            CONTACT                      |
                                            ICEBERG_VERSION_DEFAULT      |
                                            ENABLE_ICEBERG_MERGE_ON_READ
                                          }
```

## Parameters

`name`
:   Specifies the identifier for the catalog-linked database to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the catalog-linked database to `new_name`. The new identifier must be unique for the account.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    When an object is renamed, other objects that reference it must be updated with the new name.

`SUSPEND DISCOVERY`
:   Suspends automatic discovery. You might want to suspend automatic discovery to prevent consuming unnecessary credits or
    resources if an underlying issue is preventing Snowflake from discovering the tables in your remote catalog. For example,
    you might want to suspend automatic discovery because there is an underlying issue with missing permissions or a misconfiguration.
    After you resolve the issue, run ALTER DATABASE … RESUME DISCOVERY to resume discovery.

    To confirm that automatic discovery is suspended, call the [SYSTEM$CATALOG_LINK_STATUS](../functions/system_catalog_link_status.md) function and
    verify that the `executionState` field is set to `SUSPENDED`. If you suspend automatic discovery but an automatic discovery task is
    currently running, the execution state won’t change to suspended until the task is complete.

    > **Note:**
    >
    > Suspending automatic discovery doesn’t turn off automated refresh. To turn off automated refresh for an existing
    > Iceberg table, see [Enable or turn off automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

`RESUME DISCOVERY`
:   Resumes automatic discovery. You might want to resume discovery for the following reasons:

    * You suspended discovery to resolve an issue and now you’re ready to resume discovery.
    * You want to force an immediate discovery run to ensure that recent changes, such as fixed permissions, are picked up.

    To confirm that automatic discovery is resumed, call the [SYSTEM$CATALOG_LINK_STATUS](../functions/system_catalog_link_status.md) function, and then
    verify that the `executionState` field is set to `RUNNING`.

`UPDATE LINKED_CATALOG`
:   Updates the properties that apply to catalog-linked databases. You can set the following properties:

    `ADD ( 'namespace1' [ , 'namespace2' ,  ... ] ) TO ALLOWED_NAMESPACES`
    :   Specifies one or more namespaces in your remote catalog to limit the scope of automatic discovery. Snowflake syncs the specified
        namespaces and all namespaces and tables that are nested under them.

        * If you created a catalog-linked database with an empty ALLOWED_NAMESPACES list, Snowflake syncs *all* of the namespaces and tables from the
          remote catalog.

          If you later alter the database by specifying the ALLOWED_NAMESPACES parameter to only allow a specific list of namespaces,
          Snowflake updates the catalog-linked database to only retain those namespaces you allow. All the other namespaces and tables are
          dropped from the catalog-linked database.
        * If you created a catalog-linked database with a list of ALLOWED_NAMESPACES, Snowflake only creates those allowed namespaces in
          the catalog-linked database.

          If you later alter the database to add namespaces to the ALLOWED_NAMESPACES list, Snowflake only creates the
          newly added namespaces and retains the existing allowed namespaces. If you remove namespaces from the ALLOWED_NAMESPACES list,
          Snowflake only drops the newly removed namespaces from the catalog-linked database and retains all of the remaining allowed namespaces.

        If a nested namespace is in the ALLOWED_NAMESPACES list but you set the
        NAMESPACE_MODE parameter to IGNORE_NESTED_NAMESPACE, Snowflake doesn’t sync the nested namespace or any schemas and tables under it.

    `REMOVE ( 'namespace1' [ , 'namespace2' ,  ... ] ) FROM ALLOWED_NAMESPACES`
    :   Specifies one or more namespaces in your remote catalog to remove from your list of allowed namespaces.

    `UNSET ALLOWED_NAMESPACES`
    :   Unsets your list of allowed namespaces to the default, which is all namespaces are allowed.

    `ADD ( 'namespace1' [ , 'namespace2' ,  ... ] ) TO BLOCKED_NAMESPACES`
    :   Specifies one or more namespaces in your remote catalog to block for automatic discovery.

        Snowflake blocks the specified namespaces and all namespaces and tables that are nested under them.

        If you specify both ALLOWED_NAMESPACES and BLOCKED_NAMESPACES, the BLOCKED_NAMESPACES list takes precedence.
        For example, if `ns1.ns2` is allowed, but `ns1` is blocked, then Snowflake won’t sync `ns1.ns2`.

    `REMOVE ( 'namespace1' [ , 'namespace2' ,  ... ] ) FROM BLOCKED_NAMESPACES`
    :   Specifies one or more namespaces in your remote catalog to remove from your list of blocked namespaces.

    `UNSET BLOCKED_NAMESPACES`
    :   Unsets your list of blocked namespaces to the default, which is zero namespaces are blocked.

    `SET SYNC_INTERVAL_SECONDS = value`
    :   Specifies the time interval in seconds that Snowflake should use for automatically discovering schemas and tables in your remote catalog.
        You can reduce your credit consumption by setting a longer time interval.

        Values: 30 to 86400 (1 day), inclusive

        Default: 30 seconds

    `SET ALLOWED_WRITE_OPERATIONS = { NONE | ALL }`
    :   Specifies whether your catalog-linked database is read-only or writable.

        * `NONE`: Your catalog-linked database is read-only.

          When your catalog-linked database is read only, any operation that you run that requires committing to the catalog fails. For
          example, DROP ICEBERG TABLE.
        * `ALL`: Your catalog-linked database is writable.

          > **Warning:**
          >
          > When your catalog-linked database has write permissions enabled, Snowflake propagates table drops to the remote catalog, which removes
          > the table and data from both systems.

        Default: `ALL`

`SET ...`
:   Specifies one or more properties or parameters to set for the catalog-linked database, separated by blank spaces, commas, or new lines:

    `BASE_LOCATION_PREFIX = 'string'`
    :   Specifies a prefix for Snowflake to use in the write path for externally managed Apache Iceberg™ tables.
        For more information,
        see [Data and metadata directories for Iceberg tables](../../user-guide/tables-iceberg-storage.md) and
        [BASE_LOCATION_PREFIX](../parameters.md).

        Default: No value

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the catalog-linked database.

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `ICEBERG_VERSION_DEFAULT = integer`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies the version of the Apache Iceberg™ table specification that Iceberg tables conform to.

        Values:
        :   `2`: New tables conform with Iceberg version 2.

            `3`: New tables conform with Iceberg version 3.

        > **Caution:**
        >
        > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
        > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
        > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
        > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

        Default:
        :   `2`

        For more information about this parameter, see [ICEBERG_VERSION_DEFAULT](../parameters.md).

    `ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

        Values:
        :   `TRUE`: New tables use merge-on-read behavior.

            `FALSE`: New tables use copy-on-write behavior.

        Default:
        :   `TRUE`

        For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md). For more information about merge-on-read
        and copy-on-write behavior in Snowflake, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

`UNSET ...`
:   Specifies one or more properties or parameters to unset for the database, which resets them to the defaults:

    * `BASE_LOCATION_PREFIX`
    * `COMMENT`
    * `CONTACT`
    * `ICEBERG_VERSION_DEFAULT`
    * `ENABLE_ICEBERG_MERGE_ON_READ`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | The catalog-linked database being modified. | Required to suspend or resume automatic table discovery. |
| OWNERSHIP or MODIFY | The catalog-linked database being modified. | Required for all other operations. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Reset the list of allowed namespaces for a catalog-linked database named `my_linked_db` to the default.

```sqlexample
ALTER DATABASE IF EXISTS my_linked_db UPDATE LINKED_CATALOG
  UNSET ALLOWED_NAMESPACES;
```

Add `my_namespace` to the list of allowed namespaces for a catalog-linked database named `my_linked_db`.

```sqlexample
ALTER DATABASE IF EXISTS my_linked_db UPDATE LINKED_CATALOG
 ADD ('my_namespace') TO ALLOWED_NAMESPACES;
```

---
title: ALTER DATABASE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-database-role.md
section: SQL Commands
---

# ALTER DATABASE ROLE

Modifies the properties for an existing database role.

Currently, the only supported operations are renaming a database role or adding/overwriting/removing a comment for a database role.

See also:
:   [CREATE DATABASE ROLE](create-database-role.md) , [DROP DATABASE ROLE](drop-database-role.md) , [SHOW DATABASE ROLES](show-database-roles.md)

## Syntax

```sqlsyntax
ALTER DATABASE ROLE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER DATABASE ROLE [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER DATABASE ROLE [ IF EXISTS ] <name> UNSET COMMENT

ALTER DATABASE ROLE [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER DATABASE ROLE [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER DATABASE ROLE [ IF EXISTS ] <name> UNSET DCM PROJECT
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) for the database role; must be unique in the database in which the role is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is not fully qualified in the form of `db_name.database_role_name`, the command looks for the database role
    in the current database for the session.

`RENAME TO new_name`
:   Specifies the new identifier for the database role; must be unique for your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    Note that when specifying the fully-qualified name of the database role, you cannot specify a different database. The name of
    the database, `db_name`, must remain the same. Only the `database_role_name` can change during a rename operation.

`SET ...`
:   Specifies the properties to set for the database role:

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the database role.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies the properties to unset for the database role, which resets them to the defaults.

    * `COMMENT`
    * `TAG tag_name [ , tag_name ... ]`

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the database role from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the database role and the DCM project without dropping the database role. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Access control privileges

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Database role | Only the database role owner (i.e. the database role with the OWNERSHIP privilege on the database role), or a higher role, can execute this command.  The owner role does not inherit any permissions granted to the owned database role. To inherit permissions from a database role, that database role must be granted to another role, creating a parent-child relationship in a role hierarchy. |
| APPLY | Tag | Enables setting a tag on a database role. |

## Usage notes

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename database role `dr1` to `dbr2` in database `d1`:

> ```sqlexample
> ALTER DATABASE ROLE d1.dr1 RENAME TO d1.dbr2;
> ```

Add a comment for database role `d1.dbr2`:

> ```sqlexample
> ALTER DATABASE ROLE d1.dbr2 SET COMMENT = 'New comment for database role';
> ```

---
title: ALTER DATASET
source: https://docs.snowflake.com/en/sql-reference/sql/alter-dataset.md
section: SQL Commands
---

# ALTER DATASET

Modifies a dataset by adding or dropping dataset versions. When you add a version, you can specify properties such as partitioning, comments, or custom metadata.

The following are the command variants:

* [ALTER DATASET … ADD VERSION](alter-dataset-add-version.md)
* [ALTER DATASET … DROP VERSION](alter-dataset-drop-version.md)

---
title: ALTER DATASET … ADD VERSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-dataset-add-version.md
section: SQL Commands
---

# ALTER DATASET … ADD VERSION

Adds a version to a dataset. When you add a version, you can specify properties such as partitioning, comments, or custom metadata.

See also:
:   [ALTER DATASET](alter-dataset.md) , [ALTER DATASET … DROP VERSION](alter-dataset-drop-version.md)

## Syntax

```sqlsyntax
ALTER DATASET <name> ADD VERSION <version_name>
  FROM <select_statement>
  [ PARTITION BY <string_expr> ]
  [ COMMENT = <string_literal> ]
  [ METADATA = <json_string_literal> ]
```

## Parameters

`name`
:   The name of the dataset that you’re altering.

`ADD VERSION version_name`
:   The name of the new dataset version that you’re creating.

`FROM select_statement`
:   The SQL statement that defines the data for the new dataset version.

`PARTITION BY string_expr`
:   The partitioning expression for the new dataset version.

`COMMENT = string_literal`
:   A comment for the new dataset version.

`METADATA = json_string_literal`
:   A JSON string containing metadata for the new dataset version.
    The following is an example of a JSON string.

    ```json
    {"source": "my_table", "job_id": "123"}
    ```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Dataset | Provides the privilege to both read and modify the dataset. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example adds version `v1` to the `abc` dataset with partitioning:

```sqlexample
ALTER DATASET abc
ADD VERSION 'v1' FROM (
    SELECT seq4() as ID, uniform(1, 10, random(721)) as PART
    FROM TABLE(GENERATOR(ROWCOUNT => 100000)) v)
PARTITION BY PART
COMMENT = 'Initial version'
METADATA = '{"source":"some_table","created_by":"analyst1"}';
```

---
title: ALTER DATASET … DROP VERSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-dataset-drop-version.md
section: SQL Commands
---

# ALTER DATASET … DROP VERSION

Drops a dataset version.

See also:
:   [ALTER DATASET](alter-dataset.md) , [ALTER DATASET … ADD VERSION](alter-dataset-add-version.md)

## Syntax

```sqlsyntax
ALTER DATASET [ IF EXISTS ] <name> DROP VERSION <version_name>
```

## Parameters

`name`
:   The name of the dataset that you’re dropping.

`DROP VERSION version_name`
:   The name of the dataset version that you’re dropping.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Dataset | Provides the privilege to both read and modify the dataset. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example drops version `v1` of the `my_dataset` dataset:

```sqlexample
ALTER DATASET my_dataset
DROP VERSION 'v1';
```

---
title: ALTER DBT PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-dbt-project.md
section: SQL Commands
---

# ALTER DBT PROJECT

Modifies the properties of an existing [dbt project object](../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

See also:
:   [CREATE DBT PROJECT](create-dbt-project.md), [EXECUTE DBT PROJECT](execute-dbt-project.md), [DESCRIBE DBT PROJECT](desc-dbt-project.md), [DROP DBT PROJECT](drop-dbt-project.md), [SHOW DBT PROJECTS](show-dbt-projects.md)

## Syntax

```sqlsyntax
ALTER DBT PROJECT [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER DBT PROJECT <name> ADD VERSION [ <version_name_alias> ]
  FROM '<source_location>'

ALTER DBT PROJECT [ IF EXISTS ] <name> SET
  [ DBT_VERSION = '<version_number>' ]
  [ DEFAULT_TARGET = '<default_target>'' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [, ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER DBT PROJECT [ IF EXISTS ] <name> UNSET
  [ DBT_VERSION ]
  [ DEFAULT_TARGET ]
  [ EXTERNAL_ACCESS_INTEGRATIONS ]
  [ COMMENT ]
```

## Parameters

`name`
:   Specifies the identifier for the dbt project object to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the dbt project object to `new_name`. The new identifier must be unique for the schema.

    For more information about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`ADD VERSION [ version_name_alias ]`
:   Creates a new version by incrementally increasing the current version identifier by one; for example, from `version$2` to `version$3`.

    The `version name alias` is optional and is a custom identifier that corresponds to the newly created version identifier. The `version name alias` must follow [Identifier requirements](../identifiers-syntax.md).

    `FROM 'source_location'`
    :   A string that specifies the location of the source files and version for the dbt project from which the version will be created.

        The dbt project source files can be in any one of the following locations:

        > * **A Git repository stage**, for example:
        >
        >   `'@my_db.my_schema.my_git_repository_stage/branches/my_branch/path/to/dbt_project_or_projects_parent'`
        >
        >   For more information about creating a Git repository object in Snowflake that connects a Git repository to a workspace for dbt Projects on Snowflake, see [Create a workspace connected to your Git repository](../../user-guide/tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md). For more information about creating and managing a Git repository object and stage without using a workspace, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md) and [CREATE GIT REPOSITORY](create-git-repository.md).
        > * **An existing dbt project stage**, for example:
        >
        >   `'snow://dbt/my_db.my_schema.my_existing_dbt_project_object/versions/last'`
        >
        >   The version specifier is required and can be `last` (as shown in the previous example), `first`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](../../user-guide/data-engineering/dbt-projects-on-snowflake-versions.md).
        > * **An internal named stage**, for example:
        >
        >   `'@my_db.my_schema.my_internal_named_stage/path/to/dbt_projects_or_projects_parent'`
        >
        >   Internal user stages and table stages aren’t supported.
        > * **A workspace for dbt on Snowflake**, for example:
        >
        >   `'snow://workspace/user$.public."my_workspace_name"/versions/live/path/to/dbt_projects_or_projects_parent'`
        >
        >   We recommend enclosing the workspace name in double quotes because workspace names are case-sensitive and can contain special characters.
        >
        >   The version specifier is required and can be `last`, `first`, `live`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](../../user-guide/data-engineering/dbt-projects-on-snowflake-versions.md).

`SET ...`
:   Sets one or more specified properties or parameters for the dbt project object:

    `DBT_VERSION = 'version_number'`
    :   Specifies a version for the dbt Project.

        If no value is specified, the system uses version 1.9.4 by default.

    `DEFAULT_TARGET = default_target`
    :   Specifies the profile used for compilation and subsequent runs (for example, `prod`) of the dbt project object. This parameter can be overridden by using the [EXECUTE DBT PROJECT](execute-dbt-project.md)
        command with `ARGS = --target`.

    `EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
    :   Specifies the external access integrations used to grant permissions to pull remote dependencies from dbt package hub or GitHub. When declared on an object, `dbt deps` will run automatically during deployment.
        For more information, see [Understand dependencies for dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake-dependencies.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the dbt project object.

`UNSET ...`
:   Unsets one or more specified properties or parameters for the dbt project object to NULL or no value:

    * `DBT_VERSION`
    * `DEFAULT_TARGET`
    * `EXTERNAL_ACCESS_INTEGRATIONS`
    * `COMMENT`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | dbt project | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Set a dbt version

The following example sets a new dbt version to a dbt project object:

```sqlexample
ALTER DBT PROJECT finance_analytics SET dbt_version = '1.10.15';
```

### Add a new version

The following example updates a Git repository object in Snowflake to fetch the latest code from the Git repository and then updates the
contents of the dbt project object by adding a new version:

```sqlexample
-- Update the Git repository object to fetch the latest code

ALTER GIT REPOSITORY sales_db.integrations_schema.sales_dbt_git_stage FETCH;

-- Add a new version to the dbt project object based on the updated Git repository object

ALTER DBT PROJECT sales_db.dbt_projects_schema.sales_model
  ADD VERSION
  FROM '@sales_db.integrations_schema.sales_dbt_git_stage/branches/main/sales_dbt_project';
```

### Set a default target and new external access integration

The following example updates an existing dbt project object with the following changes:

* Sets a default target that Snowflake uses when executing EXECUTE DBT PROJECT without specifying a `--target` argument. For example, if
  `DEFAULT_TARGET = 'prod'`, then a command such as `EXECUTE DBT PROJECT sales_db.dbt_projects_schema.sales_model RUN;` would
  automatically run using the `prod` target unless overridden by `ARGS = --target`.
* Assigns an external access integration for the dbt project to use.

  You can provide a single integration or a list: `EXTERNAL_ACCESS_INTEGRATIONS = ('integration1', 'integration2')`.

```sqlexample
ALTER DBT PROJECT sales_db.dbt_projects_schema.sales_model SET
  DEFAULT_TARGET = 'prod',
  EXTERNAL_ACCESS_INTEGRATIONS = ('my_external_access_integration');
```

### Revert to the system default version

The following example reverts the dbt project to the system default version, which is currently 1.9.4.

```sqlexample
ALTER DBT PROJECT finance_analytics UNSET DBT_VERSION;
```

```output
Statement executed successfully.
```

---
title: ALTER DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-dcm-project.md
section: SQL Commands
---

# ALTER DCM PROJECT

Modifies the properties of an existing [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md).

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [DESCRIBE DCM PROJECT](desc-dcm-project.md), [DROP DCM PROJECT](drop-dcm-project.md), [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md),
    [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)

## Syntax

```sqlsyntax
ALTER DCM PROJECT [ IF EXISTS ] <name> SET
  [ LOG_LEVEL = <log_level> ]
  [ COMMENT = '<string_literal>' ]

ALTER DCM PROJECT [ IF EXISTS ] <name> UNSET
  [ LOG_LEVEL ]
  [ COMMENT ]
```

## Required parameters

`name`
:   Specifies the identifier for the DCM project to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Sets one or more specified properties or parameters for the DCM project:

Optional parameters
:   `LOG_LEVEL = log_level`
    :   Sets the logging level for the DCM project.

        For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

        The value can be one of the following:

        * `TRACE`
        * `DEBUG`
        * `INFO`
        * `WARN`
        * `ERROR`
        * `FATAL`
        * `OFF`

        Default: `OFF`

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the DCM project.

`UNSET ...`
:   Unsets one or more specified properties or parameters for the DCM project, which resets the properties to their defaults:

    * `LOG_LEVEL`
    * `COMMENT`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | DCM project | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example sets the logging level for the DCM project named `my_project` to `DEBUG`:

```sqlexample
ALTER DCM PROJECT my_project SET LOG_LEVEL = DEBUG;
```

The following example adds a comment to the DCM project named `my_project`:

```sqlexample
ALTER DCM PROJECT my_project SET COMMENT = 'Updated project for Q4 data management';
```

---
title: ALTER DYNAMIC TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-dynamic-table.md
section: SQL Commands
---

# ALTER DYNAMIC TABLE

Modifies the properties of a [dynamic table](../../user-guide/dynamic-tables-about.md).

See also:
:   [CREATE DYNAMIC TABLE](create-dynamic-table.md), [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md), [DROP DYNAMIC TABLE](drop-dynamic-table.md), [SHOW DYNAMIC TABLES](show-dynamic-tables.md)

## Syntax

```sqlsyntax
ALTER DYNAMIC TABLE [ IF EXISTS ] <name> { SUSPEND | RESUME }

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> SWAP WITH <target_dynamic_table_name>

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> REFRESH [ COPY SESSION ]

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> { clusteringAction }

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> { tableColumnCommentAction }

ALTER DYNAMIC TABLE <name> { SET | UNSET } COMMENT = '<string_literal>'

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> dataGovnPolicyTagAction

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> searchOptimizationAction

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> SET
  [ TARGET_LAG = { '<num> { seconds | minutes | hours | days }'  | DOWNSTREAM } ],
  [ SCHEDULER = DISABLE | ENABLE ],
  [ WAREHOUSE = <warehouse_name> ],
  [ INITIALIZATION_WAREHOUSE = <warehouse_name> ],
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ],
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ],
  [ LOG_LEVEL = '<log_level>' ],
  [ CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ],
  [ IMMUTABLE WHERE ( <expr> ) ],
  [ EXECUTE AS USER <user_name>
    [ USE SECONDARY ROLES { ALL | NONE | <role> [ , ... ] } ]
  ]
  [ ROW_TIMESTAMP = { TRUE | FALSE } ]

ALTER DYNAMIC TABLE [ IF EXISTS ] <name> UNSET
  [ INITIALIZATION_WAREHOUSE ],
  [ DATA_RETENTION_TIME_IN_DAYS ],
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS ],
  [ DEFAULT_DDL_COLLATION ],
  [ LOG_LEVEL ],
  [ CONTACT <purpose> ],
  [ IMMUTABLE WHERE ],
  [ EXECUTE AS USER ],
  [ ROW_TIMESTAMP ],
  [ DCM PROJECT ]
```

Where:

> ```sqlsyntax
> clusteringAction ::=
>   {
>     CLUSTER BY ( <expr> [ , <expr> , ... ] )
>     | { SUSPEND | RESUME } RECLUSTER
>     | DROP CLUSTERING KEY
>   }
> ```
>
> For more information, see [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).
>
> ```sqlsyntax
> tableCommentAction ::=
>   {
>     ALTER | MODIFY [ ( ]
>                            [ COLUMN ] <col1_name> COMMENT '<string>'
>                          , [ COLUMN ] <col1_name> UNSET COMMENT
>                        [ , ... ]
>                    [ ) ]
>   }
> ```
>
> ```sqlsyntax
> dataGovnPolicyTagAction ::=
>   {
>       ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ROW ACCESS POLICY <policy_name>
>     | DROP ROW ACCESS POLICY <policy_name> ,
>         ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ALL ROW ACCESS POLICIES
>   }
>   |
>   {
>     SET AGGREGATION POLICY <policy_name>
>       [ ENTITY KEY ( <col_name> [, ... ] ) ]
>       [ FORCE ]
>   | UNSET AGGREGATION POLICY
>   }
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] [ FORCE ]
>       | UNSET MASKING POLICY
>   }
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> SET TAG
>       <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>       , [ COLUMN ] <col2_name> SET TAG
>           <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET PROJECTION POLICY <policy_name>
>           [ FORCE ]
>       | UNSET PROJECTION POLICY
> }
> |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
>                   , [ COLUMN ] <col2_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
>   }
>   |
>   {
>       SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>     | UNSET TAG <tag_name> [ , <tag_name> ... ]
>   }
> ```
>
> ```sqlsyntax
> searchOptimizationAction ::=
>   {
>     ADD SEARCH OPTIMIZATION [
>       ON <search_method_with_target> [ , <search_method_with_target> ... ]
>         [ EQUALITY ]
>       ]
>
>     | DROP SEARCH OPTIMIZATION [
>       ON { <search_method_with_target> | <column_name> | <expression_id> }
>         [ EQUALITY ]
>         [ , ... ]
>       ]
>
>     | SUSPEND SEARCH OPTIMIZATION [
>        ON { <search_method_with_target> | <column_name> | <expression_id> }
>           [ , ... ]
>      ]
>
>     | RESUME SEARCH OPTIMIZATION [
>        ON { <search_method_with_target> | <column_name> | <expression_id> }
>           [ , ... ]
>      ]
>   }
> ```
>
> For details, see [Search optimization actions (searchOptimizationAction)](alter-table.md).

## Parameters

`name`
:   Identifier for the dynamic table to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SUSPEND | RESUME`
:   Specifies the action to perform on the dynamic table:

    * `SUSPEND` suspends refreshes on the dynamic table. If the dynamic table is used
      by other dynamic tables, they are also suspended.
    * `RESUME` resumes refreshes on the dynamic table. Resume operations cascade
      downstream to all downstream dynamic tables not manually suspended.

    When `SCHEDULER = DISABLE`, the command isolates the dynamic table from pipeline scheduling. `TARGET_LAG`-based refresh is
    suspended, and a manual refresh on this dynamic table doesn’t cascade to upstream dynamic tables.

`RENAME TO new_name`
:   Renames the specified dynamic table with a new identifier that is not currently used by
    any other dynamic tables in the schema.

    Renaming a dynamic table requires the CREATE DYNAMIC TABLE privilege on the schema for
    the dynamic table.

    You can also move the dynamic table to a different database and/or schema while
    optionally renaming the dynamic table. To do so, specify a qualified `new_name`
    value that includes the new database and/or schema name in the form
    `db_name.schema_name.new_name` or `schema_name.new_name`,
    respectively.

    The following restrictions apply:

    * The destination database and/or schema must already exist. In addition, an object
      with the same name cannot already exist in the new location; otherwise, the
      statement returns an error.
    * You can’t move an object to a managed access schema unless the object owner
      (that is, the role that has the OWNERSHIP privilege on the object) also owns the
      target schema.
    * When an object (table, column, etc.) is renamed, other objects that reference it
      must be updated with the new name.

`SWAP WITH target_dynamic_table_name`
:   Swaps two dynamic tables in a single transaction. The role used to perform this
    operation must have OWNERSHIP privileges on both dynamic tables.

    The following restrictions apply:

    * You can only swap a dynamic table with another dynamic table.

`REFRESH [ COPY SESSION ]`
:   Specifies that the dynamic table should be manually refreshed.

    Both user-suspended and auto-suspended dynamic tables can be manually refreshed.
    Manually refreshed dynamic tables return MANUAL as the output for `refresh_trigger`
    in the DYNAMIC_TABLE_REFRESH_HISTORY function.

    When `SCHEDULER = DISABLE`, refreshing a dynamic table only refreshes that table and doesn’t cascade to any other
    dynamic tables.

    When `SCHEDULER = ENABLE`, refreshing a dynamic table also refreshes all upstream dynamic tables, but the cascade
    stops at any upstream dynamic table that has `SCHEDULER = DISABLE`.

    For information on dynamic table refresh status, see [DYNAMIC_TABLE_REFRESH_HISTORY](../functions/dynamic_table_refresh_history.md).

    `COPY SESSION`

    > Runs the refresh operation in a copy of the current session using the current user and
    > warehouse.
    >
    > This only applies to a single manual refresh; it does not permanently update the credentials for the dynamic table.
    > Use the [GRANT OWNERSHIP](grant-ownership.md) command to transfer the ownership for scheduled
    > refreshes. For more information, see [Transfer ownership](../../user-guide/dynamic-tables-privileges.md).
    >
    > The primary role is the role that owns the dynamic table and secondary roles will match
    > the DEFAULT_SECONDARY_ROLES property of the user.

`SET ...`
:   Specifies one or more properties/parameters to set for the table (separated by blank
    spaces, commas, or new lines):

    `TARGET_LAG = { num { seconds | minutes | hours | days } | DOWNSTREAM }`
    :   > Specifies the target lag for the dynamic table:

        `'num seconds | minutes | hours | days'`
        :   Specifies the maximum amount of time that the dynamic table’s content should lag
            behind updates to the base tables.

            For example:

            * If the data in the dynamic table should lag by no more than 5 minutes, specify `5 minutes`.
            * If the data in the dynamic table should lag by no more than 5 hours, specify `5 hours`.

            The minimum value is 1 minute. If a dynamic table A depends on another dynamic
            table B, the minimum lag for A must be greater than or equal to the lag for B.

        `DOWNSTREAM`
        :   Specifies that the dynamic table should be refreshed if any dynamic table
            downstream of it is refreshed.

    `SCHEDULER = { DISABLE | ENABLE }`
    :   Specifies whether the dynamic table is to be refreshed automatically by Snowflake’s dynamic table scheduler.

        `DISABLE`
        :   Excludes the dynamic table from automatic background refresh. The table isn’t refreshed on a schedule, either directly or
            through downstream dependencies.

        * Manual control: Refreshing must be triggered manually by using `ALTER DYNAMIC TABLE ... REFRESH`.
        * Isolation: A manual refresh of a disabled table doesn’t automatically refresh its upstream dependencies. This creates a “isolation
          boundary,” allowing external orchestrators, like dbt, to manage specific table refreshes in isolation without triggering the entire
          pipeline.
        * `TARGET_LAG` can’t be defined when `SCHEDULER = DISABLE`.

        `ENABLE`
        :   Enables the automated background scheduler for the dynamic table. The scheduler ensures that the table is refreshed alongside its
            dependencies to maintain snapshot consistency. In this mode, Snowflake automatically calculates the optimal refresh frequency based on
            the defined `TARGET_LAG`.

    `WAREHOUSE = warehouse_name`
    :   Specifies the name of the warehouse that provides the compute resources for
        refreshing the dynamic table.

        You must use a role that has the USAGE privilege on this warehouse. For more information, see [Privileges to create a dynamic table](../../user-guide/dynamic-tables-privileges.md).

        For guidance on choosing a warehouse for optimal refresh performance, see [Adjust your warehouse configuration](../../user-guide/dynamic-tables-performance-optimize.md).

    `INITIALIZATION_WAREHOUSE = warehouse_name`
    :   Specifies a warehouse to use for all dynamic table [initializations and reinitializations](../../user-guide/dynamic-tables-refresh.md).

        When this parameter is set, the specified warehouse is used for all initializations and reinitializations; otherwise, the dynamic
        table uses the warehouse that is specified by the required WAREHOUSE parameter for all refreshes.

        You must use a role that has the USAGE privilege on this warehouse. For more information, see [Privileges to create a dynamic table](../../user-guide/dynamic-tables-privileges.md).

    `DATA_RETENTION_TIME_IN_DAYS = integer`
    :   Object-level parameter that modifies the retention period for the dynamic table for
        Time Travel. For more details, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md) and
        [Working with Temporary and Transient Tables](../../user-guide/tables-temp-transient.md).

        For a detailed description of this parameter and more information about object
        parameters, see [Parameters](../parameters.md).

        Values:

        > * Standard Edition: `0` or `1`
        > * Enterprise Edition:
        >
        >   + `0` to `90` for permanent dynamic tables
        >   + `0` or `1` for transient dynamic tables

        > **Note:**
        >
        > A value of `0` effectively disables Time Travel for the dynamic table.

    `MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
    :   Object parameter that specifies the maximum number of days Snowflake can extend the
        data retention period to prevent streams on the dynamic table from becoming stale.

        For a detailed description of this parameter, see
        [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

    `DEFAULT_DDL_COLLATION = 'collation_specification'`
    :   Specifies a default [collation specification](../collation.md)
        for any new columns added to the dynamic table.

        Setting this parameter does not change the collation specification for any
        existing columns.

        For more information, see [DEFAULT_DDL_COLLATION](../parameters.md).

    `LOG_LEVEL = 'log_level'`
    :   Specifies the severity level of [events for this dynamic table](../../user-guide/dynamic-tables-monitor-event-table-alerts.md) that are
        ingested and made available in the active event table. Events at the specified level (and at more severe levels) are
        ingested.

        For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `IMMUTABLE WHERE`
    :   Specifies a condition that defines the immutable portion of the dynamic table. For more
        information, see [Understanding immutability constraints](../../user-guide/dynamic-tables-immutability-constraints.md). If the dynamic table has
        primary key or unique constraints with the RELY property, the columns in the predicate must be a subset of
        the columns in those RELY constraints. For details, see [Interaction with primary key and unique constraints (RELY)](../../user-guide/dynamic-tables-immutability-constraints.md).

    `EXECUTE AS USER user_name`
    :   Refreshes the dynamic table as the specified user, rather than as the SYSTEM user.

        To specify EXECUTE AS USER, you must use a role that has been granted the IMPERSONATE privilege on the `user_name` user. To grant this privilege,
        run the [GRANT <privileges> … TO ROLE](grant-privilege.md) command.

        `USE SECONDARY ROLES { ALL | NONE | role [ , ... ] }`
        :   Specifies the secondary roles to use on the dynamic table. Can be used to override the default secondary roles that are otherwise used in execution.

            Can only be used with the EXECUTE AS USER option.

        For more information, see [Refresh dynamic tables with specific user privileges and secondary roles](../../user-guide/dynamic-tables-privileges.md).

    `ROW_TIMESTAMP = { TRUE | FALSE }`
    :   Adds or removes row timestamps on the table.

        * `TRUE` adds row timestamps on the table.
        * `FALSE` removes row timestamps on the table. This parameter setting permanently deletes all stored METADATA$ROW_LAST_COMMIT_TIME values.
          Reenabling it will not restore these values and Time Travel queries will return nothing.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the dynamic table, which
    resets them back to their defaults:

    * `INITIALIZATION_WAREHOUSE`
    * `DATA_RETENTION_TIME_IN_DAYS`
    * `MAX_DATA_EXTENSION_TIME_IN_DAYS`
    * `DEFAULT_DDL_COLLATION`
    * `LOG_LEVEL`
    * `CONTACT purposes`
    * `IMMUTABLE WHERE`
    * `EXECUTE AS USER`
    * `ROW_TIMESTAMP`
    * `DCM PROJECT`

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the dynamic table from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the dynamic table and the DCM project without dropping the dynamic table. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Clustering actions (`clusteringAction`)

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies (or modifies) one or more table columns or column expressions as the
    clustering key for the dynamic table. These are the columns/expressions for which
    clustering is maintained by Automatic Clustering. Before you specify a clustering key
    for a dynamic table, you should understand micro-partitions. For more information, see
    [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

    Note the following when using clustering keys with dynamic tables:

    * Column definitions are required and must be explicitly specified in the statement.
    * Clustering keys are not intended or recommended for all tables; they
      typically benefit very large (for example, multi-terabyte) tables.

`SUSPEND | RESUME RECLUSTER`
:   Enables or disables [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) for the dynamic table.

`DROP CLUSTERING KEY`
:   Drops the clustering key for the dynamic table.

For more information about clustering keys and reclustering, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

## Table comment actions (`tableCommentAction`)

`ALTER | MODIFY [ ( ]` . `[ COLUMN ] <col1_name> COMMENT '<string>'` . `, [ COLUMN ] <col1_name> UNSET COMMENT` . `[ , ... ]` . `[ ) ]`
:   Alters a comment or overwrites the existing comment for a column in the dynamic table.

`SET | UNSET COMMENT = '<string_literal>'`
:   Adds a comment or overwrites the existing comment for the dynamic table.

## Data Governance policy and tag actions (`dataGovnPolicyTagAction`)

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`policy_name`
:   Identifier for the policy; must be unique for your schema.

    `ADD ROW ACCESS POLICY policy_name ON (col_name [ , ... ])`
    :   Adds a row access policy to the dynamic table.

        At least one column name must be specified. Additional columns can be specified
        with a comma separating each column name.

    `DROP ROW ACCESS POLICY policy_name`
    :   Drops a row access policy from the dynamic table.

    `DROP ROW ACCESS POLICY policy_name, ADD ROW ACCESS POLICY policy_name ON ( col_name [ , ... ] )`
    :   Drops the row access policy that is set on the dynamic table and adds a row access
        policy to the same dynamic table in a single SQL statement.

    `DROP ALL ROW ACCESS POLICIES`
    :   Drops all [row access policy](../../user-guide/security-row-using.md) associations from the dynamic table.

        You must also use this clause to access a dynamic table that you restore from a [backup](../../user-guide/backups.md), if a
        row access policy applied to the table when the backup was created and the policy was later dropped. After the dynamic table
        is restored, you can’t query it until you run an ALTER TABLE command with the DROP ALL ROW ACCESS POLICIES clause.

    `{ ALTER | MODIFY } [ COLUMN ] ...`
    :   `USING ( col_name , cond_col_1 ... )`
        :   Specifies the arguments to pass into the conditional masking policy.

            The first column in the list specifies the data to be masked or tokenized based on
            policy conditions and must match the column to which the masking policy
            is applied.

            The additional columns specify which data to evaluate for masking or tokenization
            in each row of the query result when selecting from the first column.

            If the USING clause is omitted, Snowflake treats the conditional masking policy as a
            normal [masking policy](../../user-guide/security-column-intro.md).

    `SET AGGREGATION POLICY {policy_name}`
    :   `[ ENTITY KEY ({col_name} [ , ... ]) ] [ FORCE ]`
        :   Assigns an [aggregation policy](../../user-guide/aggregation-policies.md) to the dynamic table.

            Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the dynamic table. For
            more information, see [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md).

            Use the optional FORCE parameter to atomically replace an existing aggregation policy with the new aggregation policy.

    `UNSET AGGREGATION POLICY`
    :   Detaches an aggregation policy from the dynamic table.

    `FORCE`
    :   Replaces a masking or projection policy that is currently set on a column with a
        different policy in a single statement.

        Note that using the `FORCE` keyword with a masking policy requires the
        [data type](../../sql-reference-data-types.md) of the policy in the ALTER DYNAMIC
        TABLE statement (i.e. STRING) to match the data type of the masking policy currently
        set on the column (i.e. STRING).

        If a masking policy is not currently set on the column, specifying this keyword has
        no effect.

        For details, see: [Replace a masking policy on a column](../../user-guide/security-column-intro.md) or
        [Replace a projection policy](../../user-guide/projection-policies.md).

## Search optimization actions (`searchOptimizationAction`)

`ADD SEARCH OPTIMIZATION`
:   Adds [search optimization](../../user-guide/search-optimization-service.md) for the
    entire dynamic table or, if you specify the optional `ON` clause, for specific
    columns.

    Search optimization can be expensive to maintain, especially if the data in the table
    changes frequently. For more information, see
    [Search optimization cost estimation and management](../../user-guide/search-optimization/cost-estimation.md).

`ON search_method_with_target [, search_method_with_target ... ]`
:   Specifies that you want to configure search optimization for specific columns or
    VARIANT fields (rather than the entire dynamic table).

    For `search_method_with_target`, use an expression with the following syntax:

    ```sqlsyntax
    <search_method>(<target> [, ...])
    ```

    Where:

    * `search_method` specifies one of the following methods that optimizes
      queries for a particular type of predicate:

      + `GEO`: Predicates that use GEOGRAPHY types.
      + `SUBSTRING`: Predicates that match substrings and regular expressions (for
        example, [[ NOT ] LIKE](../functions/like.md), [[ NOT ] ILIKE](../functions/ilike.md),
        [[ NOT ] RLIKE](../functions/rlike.md), [REGEXP_LIKE](../functions/regexp_like.md),
        etc.)
      + `EQUALITY`: Equality and IN predicates.
    * `target` specifies the column, VARIANT field, or an asterisk (\*).

      Depending on the value of `search_method`, you can specify a column or
      VARIANT field of one of the following types:

      + `GEO`: Columns of the GEOGRAPHY data type.
      + `SUBSTRING`: Columns of string or VARIANT data types, including paths to
        fields in VARIANTs. Specify paths to fields as described under `EQUALITY`;
        searches on nested fields are improved in the same way.
      + `EQUALITY`: Columns of numeric, string, binary, and VARIANT data types,
        including paths to fields in VARIANT columns.

        To specify a VARIANT field, use
        [dot or bracket notation](../../user-guide/querying-semistructured.md). For
        example:

        - `my_column:my_field_name.my_nested_field_name`
        - `my_column['my_field_name']['my_nested_field_name']`

        You may also use a colon-delimited path to the field. For example:

        - `my_column:my_field_name:my_nested_field_name`

        When you specify a VARIANT field, the configuration applies to all nested fields
        under that field.

        For example, if you specify `ON EQUALITY(src:a.b)`:

        - This configuration can improve queries `on src:a.b` and on any nested fields
          (for example, `src:a.b.c`, `src:a.b.c.d`, etc.).
        - This configuration only affects queries that use the `src:a.b` prefix (for
          example, `src:a`, `src:z`, etc.).

    To specify all applicable columns in the table as targets, use an asterisk (`*`).

    Note that you can’t specify both an asterisk and specific column names for a
    given search method. However, you can specify an asterisk in different search methods.

    For example, you can specify the following expressions:

    ```sqlexample
    ON SUBSTRING(*)
    ON EQUALITY(*), SUBSTRING(*), GEO(*)
    ```

    You can’t specify the following expressions:

    ```sqlexample
    ON EQUALITY(*, c1)
    ON EQUALITY(c1, *)
    ON EQUALITY(v1:path, *)
    ON EQUALITY(c1), EQUALITY(*)
    ```

    To specify more than one search method on a target, use a comma to separate each
    subsequent method and target:

    ```sqlexample
    ALTER DYNAMIC TABLE my_dynamic_table ADD SEARCH OPTIMIZATION ON EQUALITY(c1), EQUALITY(c2, c3);
    ```

    If you run the ALTER DYNAMIC TABLE … ADD SEARCH OPTIMIZATION ON … command multiple
    times on the same table, each subsequent command adds to the existing configuration
    for the table. For instance, suppose that you run the following commands:

    ```sqlexample
    ALTER DYNAMIC TABLE my_dynamic_table ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2);
    ALTER DYNAMIC TABLE my_dynamic_table ADD SEARCH OPTIMIZATION ON EQUALITY(c3, c4);
    ```

    This adds equality predicates for the columns `c1`, `c2`, `c3`, and `c4` to
    the configuration for the table. This is equivalent to running the command:

    ```sqlexample
    ALTER DYNAMIC TABLE my_dynamic_table ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2, c3, c4);
    ```

    For examples, see [Enabling search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

`DROP SEARCH OPTIMIZATION`
:   Removes [search optimization](../../user-guide/search-optimization-service.md) for the
    entire dynamic table or, if you specify the optional `ON` clause, from specific
    columns.

    The following restrictions apply:

    * If a dynamic table has the search optimization property, then dropping the dynamic
      table and undropping it preserves the search optimization property.
    * Removing the search optimization property from a dynamic table and then adding it
      back incurs the same cost as adding it the first time.

`ON search_method_with_target | column_name | expression_id [, ... ]`
:   Specifies that you want to drop the search optimization configuration for specific
    columns or VARIANT fields (rather than dropping search optimization for the entire
    dynamic table).

    To identify the column configuration to drop, specify one of the following:

    * For `search_method_with_target`, specify a method for optimizing queries for
      one or more specific targets, which can be columns or VARIANT fields. Use the
      syntax described earlier.
    * For `column_name`, specify the name of the column configured for search
      optimization. Specifying the column name drops all expressions for that column,
      including expressions that use VARIANT fields in the column.
    * For `expression_id`, specify the ID for an expression listed in the output
      of the [DESCRIBE SEARCH OPTIMIZATION](../../user-guide/search-optimization/enabling.md)
      command.

    You can specify any combination of search methods with targets, column names, and
    expression IDs using a comma between items.

    For examples, see [Dropping search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or OPERATE | The dynamic table you want to alter. | Some actions are only supported with the OWNERSHIP privilege. For more information, see [Privileges to alter a dynamic table](../../user-guide/dynamic-tables-privileges.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To alter a dynamic table, you must be using a role that has OPERATE privilege on that
  dynamic table. For general information, see [Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md).
* Making changes to masking policies on a base table causes a [reinitialization](../../user-guide/dynamic-tables-refresh.md).
* If you want to update an existing dynamic table and need to see its current definition,
  call the [GET_DDL](../functions/get_ddl.md) function.
* You can use data metric functions with dynamic tables by executing an [ALTER TABLE](alter-table.md)
  command. For more information, see [Use SQL to set up data metric functions](../../user-guide/data-quality-working.md).
* You cannot use [IDENTIFIER()](../identifier-literal.md) to specify the
  name of the dynamic table to alter. For example, the following statement isn’t supported:

  ```sqlexample
  ALTER DYNAMIC TABLE IDENTIFIER(my_dynamic_table) SUSPEND;
  ```
* After a reinitialization or full refresh, search indexes on dynamic tables are rebuilt.
  This process involves dropping the existing indexes and rebuilding them from scratch,
  which might incur higher costs. For more information, see [Search optimization cost estimation and management](../../user-guide/search-optimization/cost-estimation.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Change the target lag time of a dynamic table named `my_dynamic_table` to 1 hour:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET
  TARGET_LAG = '1 hour';
```

Specify downstream target lag for `my_dynamic_table`:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET TARGET_LAG = DOWNSTREAM;
```

Suspend a dynamic table:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SUSPEND;
```

Resume a dynamic table:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table RESUME;
```

Rename `my_dynamic_table`:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table RENAME TO my_updated_dynamic_table;
```

Swap `my_dynamic_table` with `my_new_dynamic_table`:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SWAP WITH my_new_dynamic_table;
```

Change the clustering key for a dynamic table:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table CLUSTER BY (date);
```

Remove clustering from a dynamic table:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table DROP CLUSTERING KEY;
```

Perform a manual refresh of `my_dynamic_table` using the user, secondary roles, and warehouse settings
from the current session. This ensures that the refresh operation runs with the exact context
of the user session.

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table REFRESH COPY SESSION
```

To modify or remove an existing constraint, you can replace a predicate:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table SET IMMUTABLE WHERE ( <new_expr> );
```

Alternatively, remove an immutability predicate:

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table UNSET IMMUTABLE WHERE;
```

---
title: ALTER EXPERIMENT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-experiment.md
section: SQL Commands
---

# ALTER EXPERIMENT

Modifies the properties of an existing [experiment](../../developer-guide/snowflake-ml/experiments.md).

See also:
:   [CREATE EXPERIMENT](create-experiment.md) , [SHOW EXPERIMENTS](show-experiments.md), [DROP EXPERIMENT](drop-experiment.md) , [SHOW RUNS IN EXPERIMENT](show-runs-in-experiment.md) , [SHOW RUN … IN EXPERIMENT](show-run-in-experiment.md)

## Syntax

```sqlsyntax
ALTER EXPERIMENT <experiment_name> ADD RUN <run_name>

ALTER EXPERIMENT <experiment_name> COMMIT RUN <run_name>

ALTER EXPERIMENT <experiment_name> DROP RUN <run_name>
```

## Parameters

`experiment_name`
:   Specifies the identifier for the experiment to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ADD RUN run_name`
:   Adds a new run with the identifier `run_name`; must be unique for the runs in experiment `experiment_name`.

    For information on how to manually conduct an experiment run in SQL, see [Start an experiment run](../../developer-guide/snowflake-ml/experiments.md).

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`COMMIT RUN run_name`
:   Completes the run with the identifier `run_name` for experiment `experiment_name`. Committed runs can’t be altered.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    For information on how to retrieve the results and artifacts of an experiment run, see [Complete a run](../../developer-guide/snowflake-ml/experiments.md).

`DROP RUN run_name`
:   Deletes the run with the identifier `run_name`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY | Experiment |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example creates a new run named `run_1` in the experiment `my_experiment`:

```sqlexample
ALTER EXPERIMENT my_experiment ADD RUN run_1;
```

The following example completes and records the run named `run_1` in the experiment `my_experiment`:

```sqlexample
ALTER EXPERIMENT my_experiment COMMIT RUN run_1;
```

---
title: ALTER EXTERNAL ACCESS INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-external-access-integration.md
section: SQL Commands
---

# ALTER EXTERNAL ACCESS INTEGRATION

Modifies the properties of an existing [external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md).

See also:
:   [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md), [DROP INTEGRATION](drop-integration.md),
    [SHOW INTEGRATIONS](show-integrations.md), [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER EXTERNAL ACCESS INTEGRATION [ IF EXISTS ] <name> SET
  [ ALLOWED_NETWORK_RULES = (<rule_name> [ , <rule_name> ... ]) ]
  [ ALLOWED_API_AUTHENTICATION_INTEGRATIONS = { ( <integration_name_1> [, <integration_name_2>, ... ] ) | none } ]
  [ ALLOWED_AUTHENTICATION_SECRETS = { ( <secret_name> [ , <secret_name> ... ] ) | all | none } ]
  [ ENABLED = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ] ]

ALTER EXTERNAL ACCESS INTEGRATION [ IF EXISTS ] <name> UNSET {
  ALLOWED_NETWORK_RULES |
  ALLOWED_API_AUTHENTICATION_INTEGRATIONS |
  ALLOWED_AUTHENTICATION_SECRETS |
  COMMENT |
  TAG <tag_name> }
  [ , ... ]
```

## Parameters

`name`
:   Identifier for the external access integration to alter. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies the properties to set for the integration:

    `ALLOWED_NETWORK_RULES = (rule_name [ , rule_name ... ])`
    :   Specifies the allowed network rules. Only egress rules may be specified.

        For reference information about network rules, refer to [CREATE NETWORK RULE](create-network-rule.md).

    `ALLOWED_API_AUTHENTICATION_INTEGRATIONS = ( integration_name_1 [, integration_name_2, ... ] ) | none`
    :   Specifies the security integrations whose OAuth authorization server issued the secret used by the UDF or procedure. The security
        integration must be the type used for external API integration.

        For reference information about security integrations, refer to [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md).

    `ALLOWED_AUTHENTICATION_SECRETS = ( secret_name [ , secret_name ... ] ) | all | none`
    :   Specifies the secrets that a UDF or procedure can use when referring to this integration.

        For reference information about secrets, refer to [CREATE SECRET](create-secret.md).

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether this integration is enabled or disabled. If the integration is disabled, any handler code that relies
        on it will be unable to reach the external endpoint.

        The value is case-insensitive.

        The default is `TRUE`.

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the external access integration.

        Default: No value

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies the property to unset for the integration, which resets it to the default:

    * `ALLOWED_NETWORK_RULES`
    * `ALLOWED_API_AUTHENTICATION_INTEGRATIONS`
    * `ALLOWED_AUTHENTICATION_SECRETS`
    * `COMMENT`
    * `TAG tag_name`

    You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be separated by a
    comma. When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Set the allowed secrets to the `my_new_secret` secret:

```sqlexample
ALTER EXTERNAL ACCESS INTEGRATION IF EXISTS dev_integration
  SET ALLOWED_AUTHENTICATION_SECRETS = (my_new_secret);
```

Disable the integration `dev_integration_disabled`:

```sqlexample
ALTER EXTERNAL ACCESS INTEGRATION IF EXISTS dev_integration_disabled
  SET ENABLED = FALSE;

ALTER EXTERNAL ACCESS INTEGRATION IF EXISTS dev_integration_disabled
  SET COMMENT = 'Disabled until the end of the Q1.';
```

---
title: ALTER EXTERNAL TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-external-table.md
section: SQL Commands
---

# ALTER EXTERNAL TABLE

Modifies the properties, columns, or constraints for an existing external table.

See also:
:   [CREATE EXTERNAL TABLE](create-external-table.md) , [DROP EXTERNAL TABLE](drop-external-table.md) , [SHOW EXTERNAL TABLES](show-external-tables.md) , [DESCRIBE EXTERNAL TABLE](desc-external-table.md)

## Syntax

```sqlsyntax
ALTER EXTERNAL TABLE [ IF EXISTS ] <name> REFRESH [ '<relative-path>' ]

ALTER EXTERNAL TABLE [ IF EXISTS ] <name> ADD FILES ( '<path>/[<filename>]' [ , '<path>/[<filename>'] ] )

ALTER EXTERNAL TABLE [ IF EXISTS ] <name> REMOVE FILES ( '<path>/[<filename>]' [ , '<path>/[<filename>]' ] )

ALTER EXTERNAL TABLE [ IF EXISTS ] <name> SET
  [ AUTO_REFRESH = { TRUE | FALSE } ]
```

**Partitions added and removed manually**

> ```sqlsyntax
> ALTER EXTERNAL TABLE <name> [ IF EXISTS ] ADD PARTITION ( <part_col_name> = '<string>' [ , <part_col_name> = '<string>' ] ) LOCATION '<path>'
>
> ALTER EXTERNAL TABLE <name> [ IF EXISTS ] DROP PARTITION LOCATION '<path>'
> ```

## Parameters

`name`
:   Identifier for the external table to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in double
    quotes. Identifiers enclosed in double quotes are also case sensitive.

`REFRESH [ 'relative-path' ]`
:   Accesses the staged data files referenced in the external table definition and updates the table metadata:

    * New files in the path are added to the table metadata.
    * Changes to files in the path are updated in the table metadata.
    * Files no longer in the path are removed from the table metadata.

    Optionally specify a relative path to refresh the metadata for a specific subset of the data files.

    Using this parameter only needs to be done once, when the external table is created. This step synchronizes the metadata with the latest set
    of associated files in the stage and path in the external table definition. Also, this step ensures the external table can read the data files
    in the specified stage and path, and that no files were missed in the external table definition.

    > **Note:**
    >
    > * This parameter isn’t supported by partitioned external tables when partitions are added manually by the object owner; that is,
    >   when `PARTITION_TYPE = USER_SPECIFIED`.
    > * If `TABLE_FORMAT = DELTA` is set on the external table, `REFRESH` doesn’t support a relative path to refresh the
    >   metadata for a specific subset of the data files.

`ADD FILES`
:   Registers the specified comma-separated list of files with the external table metadata, and refreshes the table.
    For each file, list the path and filename relative to [ WITH ] LOCATION in the external table definition.
    For information, see [CREATE EXTERNAL TABLE](create-external-table.md).

    This parameter is not supported by partitioned external tables when partitions are added manually by the object owner; that is,
    when `PARTITION_TYPE = USER_SPECIFIED`.

`REMOVE FILES`
:   Deregisters the specified comma-separated list of files from the external table metadata, and refreshes the table.
    For each file, list the path and filename relative to [ WITH ] LOCATION in the external table definition.
    For more information, see [CREATE EXTERNAL TABLE](create-external-table.md).

    This parameter is not supported by partitioned external tables when partitions are added manually by the object owner; that is,
    when `PARTITION_TYPE = USER_SPECIFIED`.

`SET ...`
:   Specifies one or more properties/parameters to set for the external table that is separated by blank spaces, commas, or new lines:

    `AUTO_REFRESH = TRUE | FALSE`
    :   Specifies whether Snowflake should enable triggering automatic refreshes of the external table metadata when new or updated data files
        are available in the named external stage specified in the `[ WITH ] LOCATION =` setting.

        > **Note:**
        >
        > * You must configure an event notification for your storage location to notify Snowflake when new or updated data is available
        >   to read into the external table metadata. For more information, see the instructions for your cloud storage service:
        >
        >   + Amazon S3:
        >     :   [Refresh external tables automatically for Amazon S3](../../user-guide/tables-external-s3.md)
        >   + Google Cloud Storage:
        >     :   [Refresh external tables automatically for Google Cloud Storage](../../user-guide/tables-external-gcs.md)
        >   + Microsoft Azure:
        >     :   [Refresh external tables automatically for Azure Blob Storage](../../user-guide/tables-external-azure.md)
        > * This parameter isn’t supported by partitioned external tables when partitions are added manually by the object owner;
        >   that is, when `PARTITION_TYPE = USER_SPECIFIED`.
        > * Setting this parameter to TRUE isn’t supported for external tables that reference data files stored on an [S3-compatible external stage](../../user-guide/data-load-s3-compatible-storage.md).

        `TRUE`
        :   Snowflake enables the triggering of automatic refreshes of the external table metadata.

        `FALSE`
        :   Snowflake doesn’t enable the triggering of automatic refreshes of the external table metadata. You must manually refresh the external table metadata
            periodically by using ALTER EXTERNAL TABLE … REFRESH to synchronize the metadata with the current list of files in the stage path.

        Default: `TRUE`

### Partitions added and removed manually

Use the following parameters to add or remove partitions when the partition type for the external table is user-specified; that is,
`PARTITION_TYPE = USER_SPECIFIED`:

`ADD PARTITION ( <part_col_name> = '<string>' [ , <part_col_name> = '<string>' , ... ] ) LOCATION '<path>'`
:   Manually add a partition for one or more partition columns defined for the external table in a specified location; that is, path.

    > **Note:**
    >
    > The maximum length of user-specified partition column names is 32 characters.

    Adding a partition also adds any new or updated files in the location to the external table metadata.

`DROP PARTITION LOCATION '<path>'`
:   Manually drop all partitions in a specified location; that is, path.

    Dropping a partition also removes any files in the location from the external table metadata.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | External table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | Stage | Required to manually refresh the external table metadata. |
| USAGE | File format | Required to manually refresh the external table metadata. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only the external table owner — the role with the OWNERSHIP privilege on the external table — or higher can run this command.
* The following commands can be used in explicit transactions (using [BEGIN](begin.md) … [COMMIT](commit.md)):

  + `ALTER EXTERNAL TABLE ... REFRESH`
  + `ALTER EXTERNAL TABLE ... ADD FILES`
  + `ALTER EXTERNAL TABLE ... REMOVE FILES`

  Explicit transactions could be used to ensure a consistent state when manually replacing updated files in external table metadata.
* Add or remove columns in an external table by using the following syntax:

  Add column:
  :   ```sqlsyntax
      ALTER TABLE <name> ADD COLUMN ( <col_name> <col_type> AS <expr> ) [, ...]
      ```

  Rename column:
  :   ```sqlsyntax
      ALTER TABLE <name> RENAME COLUMN <col_name> to <new_col_name>
      ```

  Drop column:
  :   ```sqlsyntax
      ALTER TABLE <name> DROP COLUMN <col_name>
      ```

      > **Note:**
      >
      > The default VALUE and METADATA$FILENAME columns cannot be dropped.

  For examples, see the [ALTER TABLE](alter-table.md) topic.
* To add and drop a row access policy on an external table, or to set or unset a tag, use the [ALTER TABLE](alter-table.md) command.

  However, you can create an external table with a row access policy and a tag on the table. For more information, see [CREATE EXTERNAL TABLE](create-external-table.md).
* You can use data metric functions with external tables by running an [ALTER TABLE](alter-table.md) command. For more information, see
  [Use SQL to set up data metric functions](../../user-guide/data-quality-working.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Refresh metadata manually

Manually refresh the entire set of external table metadata that is based on changes in the referenced data files:

```sqlexample
ALTER EXTERNAL TABLE exttable_json REFRESH;
```

Similar to the first example, but manually refresh only a path of the metadata for an external table:

```sqlexample
CREATE OR REPLACE STAGE mystage
  URL='<cloud_platform>://twitter_feed/logs/'
  .. ;

-- Create the external table
-- 'daily' path includes paths in </YYYY/MM/DD/> format
CREATE OR REPLACE EXTERNAL TABLE daily_tweets
  WITH LOCATION = @twitter_feed/daily/;

-- Refresh the metadata for a single day of data files by date
ALTER EXTERNAL TABLE exttable_part REFRESH '2018/08/05/';
```

### Add or remove files manually

Add an explicit list of files to the external table metadata:

```sqlexample
ALTER EXTERNAL TABLE exttable1 ADD FILES ('path1/sales4.json.gz', 'path1/sales5.json.gz');
```

Remove an explicit list of files from the external table metadata:

```sqlexample
ALTER EXTERNAL TABLE exttable1 REMOVE FILES ('path1/sales4.json.gz', 'path1/sales5.json.gz');
```

Replace an updated log file for December 2019 in the external table metadata in an explicit transaction:

```sqlexample
BEGIN;

ALTER EXTERNAL TABLE extable1 REMOVE FILES ('2019/12/log1.json.gz');

ALTER EXTERNAL TABLE extable1 ADD FILES ('2019/12/log1.json.gz');

COMMIT;
```

### Add or remove partitions manually

Manually add partitions in a specified location for the partition columns:

```sqlexample
ALTER EXTERNAL TABLE et2 ADD PARTITION(col1='2022-01-24', col2='a', col3='12') LOCATION '2022/01';
```

Snowflake adds the partitions to the metadata for the external table. The operation also adds any new data files in the specified
location to the metadata.

Manually remove partitions from a specified location:

```sqlexample
ALTER EXTERNAL TABLE et2 DROP PARTITION LOCATION '2022/01';
```

Snowflake removes the partitions from the metadata for the external table. The operation also removes any data files in the specified
location from the metadata.

---
title: ALTER EXTERNAL VOLUME
source: https://docs.snowflake.com/en/sql-reference/sql/alter-external-volume.md
section: SQL Commands
---

# ALTER EXTERNAL VOLUME

Modifies the properties for an existing [external volume](../../user-guide/tables-iceberg.md).

See also:
:   [CREATE EXTERNAL VOLUME](create-external-volume.md) , [DROP EXTERNAL VOLUME](drop-external-volume.md) , [SHOW EXTERNAL VOLUMES](show-external-volumes.md) , [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md)

## Syntax

```sqlsyntax
ALTER EXTERNAL VOLUME [ IF EXISTS ] <name> ADD STORAGE_LOCATION =
  (
    NAME = '<storage_location_name>'
    cloudProviderParams
  )

ALTER EXTERNAL VOLUME [ IF EXISTS ] <name> REMOVE STORAGE_LOCATION '<storage_location_name>'

ALTER EXTERNAL VOLUME [ IF EXISTS ] <name> UPDATE
  STORAGE_LOCATION = '<s3_compatible_storage_location_name>'
  CREDENTIALS = (
    AWS_KEY_ID = '<string>'
    AWS_SECRET_KEY = '<string>'
  )

ALTER EXTERNAL VOLUME [ IF EXISTS ] <name> SET ALLOW_WRITES = { TRUE | FALSE }

ALTER EXTERNAL VOLUME [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'
```

Where:

> ```sqlsyntax
> cloudProviderParams (for Amazon S3) ::=
>   STORAGE_PROVIDER = '{ S3 | S3GOV }'
>   STORAGE_AWS_ROLE_ARN = '<iam_role>'
>   STORAGE_BASE_URL = '<protocol>://<bucket>[/<path>/]'
>   [ STORAGE_AWS_ACCESS_POINT_ARN = '<string>' ]
>   [ ENCRYPTION = ( [ TYPE = 'AWS_SSE_S3' ] |
>               [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ] ] |
>               [ TYPE = 'NONE' ] ) ]
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Google Cloud Storage) ::=
>   STORAGE_PROVIDER = 'GCS'
>   STORAGE_BASE_URL = 'gcs://<bucket>[/<path>/]'
>   [ ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' ] [ KMS_KEY_ID = '<string>' ] |
>               [ TYPE = 'NONE' ] ) ]
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Microsoft Azure) ::=
>   STORAGE_PROVIDER = 'AZURE'
>   AZURE_TENANT_ID = '<tenant_id>'
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
>   STORAGE_BASE_URL = 'azure://<account>.blob.core.windows.net/<container>[/<path>/]'
> ```

## Parameters

`name`
:   Specifies the identifier for the external volume to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ADD STORAGE_LOCATION = ( NAME = 'storage_location_name' cloudProviderParams )`
:   Adds a named storage location to the external volume definition.
    To add multiple storage locations, execute an ALTER EXTERNAL VOLUME
    statement for each storage location.

    > **Note:**
    >
    > Apache Iceberg™ tables write to and read from the first storage location in the set that is located
    > in the same region as your Snowflake account. To view the external volume definition and storage location regions,
    > execute [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md).

`REMOVE STORAGE_LOCATION 'storage_location_name'`
:   Removes the specified storage location from the external volume definition. To remove multiple storage locations,
    execute an ALTER EXTERNAL VOLUME statement for each storage location.

    > **Note:**
    >
    > The ALTER EXTERNAL VOLUME statement fails if you attempt to remove the active storage location used by Iceberg tables in your account.

`UPDATE STORAGE_LOCATION = 's3_compatible_storage_location_name'`
:   Updates the specified S3-compatible storage location from the external volume definition.

`CREDENTIALS = ( AWS_KEY_ID = 'string' AWS_SECRET_KEY = 'string' )`
:   Specifies updated security credentials for connecting to and accessing an S3-compatible storage location.

`SET ...`
:   Specifies one or more properties/parameters to set for the external volume (separated by blank spaces, commas, or new lines):

    `ALLOW_WRITES = { TRUE | FALSE }`
    :   Specifies whether write operations are allowed for the external volume.

        * `TRUE` specifies that write operations are allowed. This parameter must be set to `TRUE` for the following tables:

          + Iceberg tables that use Snowflake as the catalog.
          + Iceberg tables that use an external catalog and are writable. Externally managed Iceberg tables are writable when you access them
            through a catalog-linked database that has the ALLOWED_WRITE_OPERATIONS parameter set to TRUE.
        * `FALSE` specifies that write operations aren’t allowed. You can’t change the value of this parameter to `FALSE` if
          there are Snowflake-managed Iceberg tables associated with the external volume.

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the external volume.

## Cloud provider parameters (`cloudProviderParams`)

**Amazon S3**

> `STORAGE_PROVIDER = 'S3'`
> :   Specifies the cloud storage provider that stores your data files.
>
> `STORAGE_AWS_ROLE_ARN = iam_role`
> :   Specifies the Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges on the S3 bucket
>     containing your data files. For more information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).
>
> `STORAGE_BASE_URL = 'protocol://bucket[/path/]'`
> :   Specifies the base URL for your cloud storage location, where:
>
>     * `protocol` is one of the following:
>
>       + `s3` refers to S3 storage in public AWS regions outside of China.
>       + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
>     * `bucket` is the name of an S3 bucket that stores your data files or the [bucket-style alias](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-alias.html)
>       for an S3 bucket access point. For an S3 access point, you must also specify a value for the
>       `STORAGE_AWS_ACCESS_POINT_ARN` parameter.
>     * `path` is an optional path that can be used to provide granular control over objects in the bucket.
>
> `STORAGE_AWS_ACCESS_POINT_ARN = 'string'`
> :   Specifies the Amazon resource name (ARN) for your S3 access point. Required only when you specify an S3 access point alias
>     for your storage `STORAGE_BASE_URL`.
>
> `ENCRYPTION = ( [ TYPE = 'AWS_SSE_S3' ] | [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = 'string' ] ] | [ TYPE = 'NONE' ] )`
> :   Specifies the properties needed to encrypt data on the external volume.
>
>     `TYPE = ...`
>     :   Specifies the encryption type used. Possible values are:
>
>         * `AWS_SSE_S3` : Server-side encryption using S3-managed encryption keys. For more information, see [Using server-side encryption with Amazon S3-managed encryption keys (SSE-S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingServerSideEncryption.html).
>         * `AWS_SSE_KMS` : Server-side encryption using keys stored in KMS. For more information, see [Using server-side encryption with AWS Key Management Service (SSE-KMS)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingKMSEncryption.html).
>         * `NONE`: No encryption.
>
>     `KMS_KEY_ID = 'string'` (applies to `AWS_SSE_KMS` encryption only)
>     :   Optionally specifies the ID for the AWS KMS-managed key used to encrypt files written to the bucket. If no value is provided, your default KMS key is used to encrypt files for writing data.
>
>         Note that this value is ignored when reading data.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see
>     [Private connectivity to external volumes for Amazon Web Services](../../user-guide/tables-iceberg-configure-external-volume-s3-private.md).

**Google Cloud Storage**

> `STORAGE_PROVIDER = 'GCS'`
> :   Specifies the cloud storage provider that stores your data files.
>
> `STORAGE_BASE_URL = 'gcs://bucket[/path/]'`
> :   Specifies the base URL for your cloud storage location, where:
>
>     * `bucket` is the name of a Cloud Storage bucket that stores your data files.
>     * `path` is an optional path that can be used to provide granular control over objects in the bucket.
>
> `ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' ] [ KMS_KEY_ID = 'string' ] | [ TYPE = 'NONE' ] )`
> :   Specifies the properties needed to encrypt data on the external volume.
>
>     `TYPE = ...`
>     :   Specifies the encryption type used. Possible values are:
>
>         * `GCS_SSE_KMS`: Server-side encryption using keys stored in KMS. For more information, see [customer-managed encryption keys](https://cloud.google.com/storage/docs/encryption/customer-managed-keys).
>         * `NONE`: No encryption.
>
>     `KMS_KEY_ID = 'string'` (applies to `GCS_SSE_KMS` encryption only)
>     :   Specifies the ID for the Cloud KMS-managed key that is used to encrypt files written to the bucket.
>
>         This value is ignored when reading data. The read operation should succeed if the service account has sufficient permissions to the data and any specified KMS keys.

**Microsoft Azure**

> `STORAGE_PROVIDER = 'AZURE'`
> :   Specifies the cloud storage provider that stores your data files.
>
> `AZURE_TENANT_ID = 'tenant_id'`
> :   Specifies the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to.
>     An external volume can authenticate to only one tenant,
>     so the allowed and blocked storage locations must refer to storage accounts that all belong to this tenant.
>
>     To find your tenant ID, log into the Azure portal and click Azure Active Directory » Properties. The tenant ID is
>     displayed in the Tenant ID field.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see
>     [Private connectivity to external volumes for Microsoft Azure](../../user-guide/tables-iceberg-configure-external-volume-azure-private.md).
>
> `STORAGE_BASE_URL = 'azure://account.blob.core.windows.net/container[/path/]'`
> :   Specifies the base URL for your cloud storage location, where:
>
>     * `account` is the name of your Azure account; for example, `myaccount`.
>     * `container` is the name of an Azure container that stores your data files.
>     * `path` is an optional path that can be used to provide granular control over logical directories in the container.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | External volume | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* For S3 external volumes that use an S3 access point:

  + You must configure the IAM policy for the external volume
    to grant permission to your S3 access point. For more information,
    see [Step 1: Create an IAM policy that grants access to your S3 location](../../user-guide/tables-iceberg-configure-external-volume-s3.md).
  + Multi-region access points aren’t supported.

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example removes the storage location named `my-us-east-1` from the `exvol1` external volume:

```sqlexample
ALTER EXTERNAL VOLUME exvol1 REMOVE STORAGE_LOCATION 'my-us-east-1';
```

The following examples add a storage location to an external volume:

**Amazon S3**

```sqlexample
ALTER EXTERNAL VOLUME exvol1
  ADD STORAGE_LOCATION =
    (
      NAME = 'my-s3-us-central-2'
      STORAGE_PROVIDER = 'S3'
      STORAGE_BASE_URL = 's3://my_bucket_us_central-1/'
      STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/myrole'
    );
```

**Google Cloud Storage**

```sqlexample
ALTER EXTERNAL VOLUME exvol2
  ADD STORAGE_LOCATION =
    (
      NAME = 'my-gcs-europe-west4'
      STORAGE_PROVIDER = 'GCS'
      STORAGE_BASE_URL = 'gcs://my_bucket_europe-west4/'
    );
```

**Microsoft Azure**

```sqlexample
ALTER EXTERNAL VOLUME exvol3
  ADD STORAGE_LOCATION =
    (
      NAME = 'my-azure-japaneast'
      STORAGE_PROVIDER = 'AZURE'
      STORAGE_BASE_URL = 'azure://sfcdev1.blob.core.windows.net/my_container_japaneast/'
      AZURE_TENANT_ID = 'a9876545-4321-987b-b23c-2kz436789d0'
    );
```

**S3-compatible storage**

Update the credentials for an S3-compatible external volume:

```sqlexample
ALTER EXTERNAL VOLUME ext_vol_s3_compat UPDATE
  STORAGE_LOCATION = 'my_s3_compat_storage_location'
  CREDENTIALS = (
    AWS_KEY_ID = '4d5e6f...'
    AWS_SECRET_KEY = '7g8h9i...'
  );
```

---
title: ALTER FAILOVER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/alter-failover-group.md
section: SQL Commands
---

# ALTER FAILOVER GROUP

Modifies the properties for an existing [failover group](../../user-guide/account-replication-intro.md).

From the source account, you can perform the following actions:

* Rename the failover group.
* Reset the list of specified object types enabled for replication and failover.
* Set or update the replication schedule for automatic refresh of secondary failover groups.
* Add or remove account objects of the following types to or from a failover group:

  + Databases
  + External volumes
  + Shares
  + Security integrations
  + API integrations
  + Storage integrations
  + External access integrations
  + Certain types of notification integrations (see [Integration replication](../../user-guide/account-replication-intro.md))
* Add or remove target accounts enabled for replication and failover.
* Move shares or databases to another failover group.

From the target account, you can perform the following actions:

* Refresh objects in the target account from the source account.
* Promote a secondary failover group to primary (that is, fail over the failover group of objects).
* Suspend scheduled replication.
* Resume scheduled replication.

See also:
:   [CREATE FAILOVER GROUP](create-failover-group.md) , [DROP FAILOVER GROUP](drop-failover-group.md) , [SHOW FAILOVER GROUPS](show-failover-groups.md),
    [SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH](../functions/system_schedule_async_replication_group_refresh.md)

## Syntax

**Source Account**

```sqlsyntax
ALTER FAILOVER GROUP [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SET
  [ OBJECT_TYPES = <object_type> [ , <object_type> , ... ] ]
  [ ALLOWED_DATABASES = <db_name> [ , <db_name> , ... ] ]
  [ ALLOWED_EXTERNAL_VOLUMES = <external_volume_name> [ , <external_volume_name> , ... ] ]
  [ ALLOWED_SHARES = <share_name> [ , <share_name> , ... ] ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SET
  OBJECT_TYPES = INTEGRATIONS [ , <object_type> , ... ]
  ALLOWED_INTEGRATION_TYPES = <integration_type_name> [ , <integration_type_name> ... ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SET
  COMMENT = '<string_literal>'

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SET
  REPLICATION_SCHEDULE = '{ <num> MINUTE | USING CRON <expr> <time_zone> }'

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SET
  ERROR_INTEGRATION = <integration_name>

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SET
  TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name> UNSET
  { COMMENT | REPLICATION_SCHEDULE | ERROR_INTEGRATION } [ , ... ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name> UNSET
  TAG <tag_name> [ , <tag_name> ... ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  ADD <db_name> [ , <db_name> ,  ... ] TO ALLOWED_DATABASES

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  MOVE DATABASES <db_name> [ , <db_name> ,  ... ] TO FAILOVER GROUP <move_to_fg_name>

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  REMOVE <db_name> [ , <db_name> ,  ... ] FROM ALLOWED_DATABASES

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  ADD <external_volume_name> [ , <external_volume_name> ,  ... ] TO ALLOWED_EXTERNAL_VOLUMES

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  MOVE EXTERNAL VOLUMES <external_volume_name> [ , <external_volume_name> ,  ... ] TO FAILOVER GROUP <move_to_fg_name>

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  REMOVE <external_volume_name> [ , <external_volume_name> ,  ... ] FROM ALLOWED_EXTERNAL_VOLUMES

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  ADD <share_name> [ , <share_name> ,  ... ] TO ALLOWED_SHARES

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  MOVE SHARES <share_name> [ , <share_name> ,  ... ] TO FAILOVER GROUP <move_to_fg_name>

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  REMOVE <share_name> [ , <share_name> ,  ... ] FROM ALLOWED_SHARES

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  ADD <org_name>.<target_account_name> [ , <org_name>.<target_account_name> ,  ... ] TO ALLOWED_ACCOUNTS
  [ IGNORE EDITION CHECK ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name>
  REMOVE <org_name>.<target_account_name> [ , <org_name>.<target_account_name> ,  ... ] FROM ALLOWED_ACCOUNTS
```

**Target Account**

```sqlsyntax
ALTER FAILOVER GROUP [ IF EXISTS ] <name> REFRESH

ALTER FAILOVER GROUP [ IF EXISTS ] <name> PRIMARY

ALTER FAILOVER GROUP [ IF EXISTS ] <name> SUSPEND [ IMMEDIATE ]

ALTER FAILOVER GROUP [ IF EXISTS ] <name> RESUME
```

## Parameters

**Source Account**

`name`
:   Specifies the identifier for the failover group.

`RENAME TO new_name`
:   `new_name`
    :   Specifies the new identifier for the failover group. The new identifier cannot be used if the identifier is already in place for a
        different replication or failover group.

        For more details, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies properties to set for the failover group (separated by blank spaces, commas, or new lines).

    `OBJECT_TYPES = object_type [ , object_type , ... ]`
    :   Reset the list of object types for which you are enabling replication and failover from the source account to target
        account(s).

        > **Note:**
        >
        > For database, external volume, and share objects:
        >
        > * If DATABASES, EXTERNAL VOLUMES, or SHARES are included in the OBJECT_TYPES list, and remain in the OBJECT_TYPES list after
        >   the list is reset, the respective allowed objects list (ALLOWED_DATABASES, ALLOWED_EXTERNAL_VOLUMES, or ALLOWED_SHARES) remains
        >   unchanged.
        > * If the OBJECT_TYPES list is reset to add or remove DATABASES, the ALLOWED_DATABASES list is set to NULL.
        > * If the OBJECT_TYPES list is reset to add or remove EXTERNAL VOLUMES, the ALLOWED_EXTERNAL_VOLUMES list is set to NULL.
        > * If the OBJECT_TYPES list is reset to add or remove SHARES, the ALLOWED_SHARES list is set to NULL.
        > * Use the ADD, MOVE, and REMOVE clauses to modify the list of allowed database, external volume, or share objects.

        The following object types are supported:

        > ACCOUNT PARAMETERS:
        > :   All account-level parameters. This includes [account parameters](../parameters.md) and parameters that can be
        >     [set for your account](../../user-guide/admin-account-management.md).
        >
        > DATABASES:
        > :   Add database objects to the list of object types. If database objects were already included in the list of specified object
        >     types, the `ALLOWED_DATABASES` list remains unchanged. To modify the list of databases, use the
        >     ADD, MOVE, or REMOVE clauses.
        >
        > EXTERNAL VOLUMES:
        > :   Add external volume objects to the list of object types. If external volume objects are included in the list of specified object types,
        >     the `ALLOWED_EXTERNAL_VOLUMES` parameter must be set. To modify the list of external volumes, use the ADD, MOVE, or REMOVE clauses.
        >
        > INTEGRATIONS:
        > :   Currently, only security, API, storage, external access, and certain types of notification integrations are supported.
        >     For details, see [Integration replication](../../user-guide/account-replication-intro.md).
        >
        >     If integration objects are included in the list of specified object types, the
        >     `ALLOWED_INTEGRATION_TYPES` parameter must be set.
        >
        > LISTINGS:
        > :   Add listings to the list of object types. When adding listings to a failover group, adding shares is optional. Snowflake automatically selects all of the eligible listings and their shares for replication and failover.
        >
        > NETWORK POLICIES:
        > :   All network policies in the source account.
        >
        > PROFILES:
        > :   All profiles in the source account. Review [profile replication constraints](../../collaboration/listings-bcdr.md) for information about current constraints.
        >
        > RESOURCE MONITORS:
        > :   All resource monitors in the source account.
        >
        > ROLES:
        > :   All roles in the source account. Replicating roles implicitly includes all grants for object types included in the failover group.
        >     For example, if `ROLES` is the only object type that is replicated, then only hierarchies of roles (that is, roles granted to
        >     other roles) are replicated to target accounts. If the `USERS` object type is also included, then role grants to users are
        >     also replicated.
        >
        > SHARES:
        > :   Add share objects to the list of object types. If share objects were already included in the list of specified object types, the
        >     `ALLOWED_SHARES` list remains unchanged. To modify the list of shares, use the ADD, MOVE, or REMOVE clauses.
        >
        > USERS:
        > :   All users in the source account.
        >
        > WAREHOUSES:
        > :   All warehouses in the source account.

        > **Note:**
        >
        > If you replicate users and roles, programmatic access tokens for users are replicated automatically.

    `ALLOWED_DATABASES = db_name [ , db_name , ... ]`
    :   Specifies the database or list of databases for which you are enabling replication and failover from the source account to the target
        account. In order for you to set this parameter, the `OBJECT_TYPES` list must include `DATABASES`.

        `db_name`
        :   Specifies the identifier for the database.

    `ALLOWED_EXTERNAL_VOLUMES = external_volume_name [ , external_volume_name , ... ]`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies the external volume or list of external volumes for which you are enabling replication and failover from the source account
        to the target account. For you to set this parameter, the `OBJECT_TYPES` list must include `EXTERNAL VOLUMES`.

        `external_volume_name`
        :   Specifies the identifier for the external volume.

    `ALLOWED_SHARES = share_name [ , share_name , ... ]`
    :   Specifies the share or list of shares for which you are enabling replication and failover from the source account to the target account.
        For you to set this parameter, the `OBJECT_TYPES` list must include `SHARES`.

        `share_name`
        :   Specifies the identifier for the share.

    > **Note:**
    >
    > If the ALLOWED_DATABASES, ALLOWED_EXTERNAL_VOLUMES, or ALLOWED_SHARES lists are modified, any objects that were previously in the list and removed
    > will be dropped in any target account with a linked secondary failover group when the next refresh operation occurs.

    `ALLOWED_INTEGRATION_TYPES = integration_type_name [ , integration_type_name , ... ]`
    :   Type(s) of integrations for which you are enabling replication and failover from the source account to the target account.

        > This property requires that the `OBJECT_TYPES` list include `INTEGRATIONS` to set this parameter.
        >
        > The following integration types are supported:
        >
        > > SECURITY INTEGRATIONS:
        > > :   Specifies security integrations.
        > >
        > >     This property requires that the `OBJECT_TYPES` list include `ROLES`.
        > >
        > > API INTEGRATIONS:
        > > :   Specifies API integrations.
        > >
        > >     API integration replication requires additional set up after the API integration is replicated to the target account.
        > >     For more information, see [Updating the remote service for API integrations](../../user-guide/account-replication-config.md).
        > >
        > > STORAGE INTEGRATIONS:
        > > :   Specifies storage integrations.
        > >
        > > EXTERNAL ACCESS INTEGRATIONS:
        > > :   Specifies [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md).
        > >
        > >     For more information, see [Replication of stored procedures and user-defined functions (UDFs)](../../user-guide/account-replication-considerations.md).
        > >
        > > NOTIFICATION INTEGRATIONS:
        > > :   Specifies notification integrations.
        > >
        > >     Only some types of notification integrations are replicated. For details, see
        > >     [Integration replication](../../user-guide/account-replication-intro.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the failover group.

        Default:
        :   `NULL`

    `REPLICATION_SCHEDULE ...`
    :   Specifies the schedule for refreshing secondary failover groups.

        * `USING CRON expr time_zone`
          :   Specifies a cron expression and time zone for the secondary group refresh. Supports a subset of standard cron utility syntax.

              For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
              (in Wikipedia).

              The cron expression consists of the following fields:

              ```output
              # __________ minute (0-59)
              # | ________ hour (0-23)
              # | | ______ day of month (1-31, or L)
              # | | | ____ month (1-12, JAN-DEC)
              # | | | | __ day of week (0-6, SUN-SAT, or L)
              # | | | | |
              # | | | | |
                * * * * *
              ```

              The following special characters are supported:

              `*`
              :   Wildcard. Specifies any occurrence of the field.

              `L`
              :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a
                  given month. In the day-of-month field, it specifies the last day of the month.

              `/n`
              :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
                  specified in the month field, then the refresh is scheduled for April, July and October (i.e. every 3 months, starting with the 4th
                  month of the year). The same schedule is maintained in subsequent years. That is, the refresh is not scheduled to run in
                  January (3 months after the October run).

              > **Note:**
              > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
              >   for the account (or setting the value at the user or session level) does not change the time zone for the refresh.
              > + The cron expression defines all valid run times for the refresh. Snowflake attempts to refresh secondary groups based on
              >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
              > + When both a specific day of month and day of week are included in the cron expression, then the refresh is scheduled on days
              >   satisfying either the day of month or day of week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
              >   schedules a refresh at 0AM on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
        * `num MINUTE`
          :   Specifies an interval (in minutes) of wait time between refreshes. Accepts positive integers only.

              Also supports `num M` syntax.

              To avoid ambiguity, a *base interval time* is set:

              + When the object is created (using CREATE <object>) or
              + When a different interval is set (using ALTER <object> … SET REPLICATION_SCHEDULE)

              The base interval time starts the interval counter from the current clock time. For example, if an INTERVAL value of `10` is set and
              the scheduled refresh is enabled at 9:03 AM, then the refresh runs at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to
              ensure absolute precision, but only guarantee that refreshes do not execute before their set interval occurs (e.g. in the
              current example, the refresh could first run at 9:14 AM, but will definitely not run at 9:12 AM).

              > **Note:**
              >
              > The maximum supported value is `11520` (8 days). If the replication schedule has a greater `num MINUTE` value, the
              > refresh operation never runs.

        Default:
        :   `NULL`

    `ERROR_INTEGRATION = integration_name`
    :   Specifies the name of the notification integration to use to email/push notifications when refresh errors occur for the failover
        group. For more details, see [Error notifications for replication and failover groups](../../user-guide/account-replication-error-notifications.md).

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`ADD db_name [ , db_name ,  ... ] TO ALLOWED_DATABASES`
:   Specifies a comma-separated list of additional databases to enable for replication and failover. To add databases,
    DATABASES must be included in the list of specified object types. If the list of object types does not already include DATABASES, you must
    add it.

    > `db_name`
    > :   Specifies the identifier for the database.

`MOVE DATABASES db_name [ , db_name ,  ... ] TO FAILOVER GROUP move_to_fg_name`
:   Specifies a comma-separated list of databases to move from one failover group to another failover group. The failover group the databases
    are being moved to must include DATABASES in the list of specified object types.

    > `db_name`
    > :   Specifies the identifier for the database.
    >
    > `move_to_fg_name`
    > :   Specifies the identifier for the failover group the databases are being moved to.

`REMOVE db_name [ , db_name ,  ... ] FROM ALLOWED_DATABASES`
:   Specifies a comma-separated list of databases to remove from the list of databases enabled for replication and failover.

    > **Note:**
    >
    > When you remove a database from a primary failover group, the database is dropped in any target account with a linked secondary
    > failover group when the next refresh operation occurs.
    >
    > To avoid dropping databases in the target account, you can drop the secondary failover group *before* the next time the modified
    > primary failover group is replicated to the target account. When you drop the secondary failover group, read-only secondary
    > databases that were included in the group become standalone read-write databases in the target account.

`ADD external_volume_name [ , external_volume_name ,  ... ] TO ALLOWED_EXTERNAL_VOLUMES`
:   Specifies a comma-separated list of additional external volumes to enable for replication and failover. To add external volumes,
    EXTERNAL VOLUMES must be included in the list of specified object types. If the list of object types does not already include
    EXTERNAL VOLUMES, you must add it.

    > `external_volume_name`
    > :   Specifies the identifier for the external volume.

`MOVE EXTERNAL VOLUMES external_volume_name [ , external_volume_name ,  ... ] TO FAILOVER GROUP move_to_fg_name`
:   Specifies a comma-separated list of external volumes to move from one failover group to another failover group. The failover group the external volumes
    are being moved to must include EXTERNAL VOLUMES in the list of specified object types.

    > `db_name`
    > :   Specifies the identifier for the external volume.
    >
    > `move_to_fg_name`
    > :   Specifies the identifier for the failover group the external volumes are being moved to.

`REMOVE external_volume_name [ , external_volume_name ,  ... ] FROM ALLOWED_EXTERNAL_VOLUMES`
:   Specifies a comma-separated list of external volumes to remove from the list of external volumes enabled for replication and failover.

    > **Note:**
    >
    > When you remove an external volume from a primary failover group, the external volume is dropped in any target account with a
    > linked secondary failover group when the next refresh operation occurs.
    >
    > To avoid dropping external volumes in the target account, you can drop the secondary failover group *before* the next time the modified
    > primary failover group is replicated to the target account. When you drop the secondary failover group, read-only secondary
    > external volumes that were included in the group become standalone read-write external volumes in the target account.

`ADD share_name [ , share_name ,  ... ] TO ALLOWED_SHARES`
:   Specifies a comma-separated list of additional shares to enable for replication and failover. To add shares, SHARES must be included in
    the list of specified object types. If the list of object types doesn’t already include SHARES, you must add it.

    > `share_name`
    > :   Specifies the identifier for the share.

`MOVE SHARES share_name [ , share_name ,  ... ] TO FAILOVER GROUP move_to_fg_name`
:   Specifies a comma-separated list of shares to move from one failover group to another failover group. The failover group the shares
    are being moved to must include SHARES in the list of specified object types.

    > `share_name`
    > :   Specifies the identifier for the share.
    >
    > `move_to_fg_name`
    > :   Specifies the identifier for the failover group the shares are being moved to.

`REMOVE share_name [ , share_name ,  ... ] FROM ALLOWED_SHARES`
:   Specifies a comma-separated list of shares to remove from the list of shares enabled for replication and failover.

    > **Note:**
    >
    > When you remove a share from a primary failover group, the share is dropped in any target account with a secondary
    > failover group when the next refresh operation occurs.

`ADD org_name.target_account_name [ , org_name.target_account_name ,  ... ] TO ALLOWED_ACCOUNTS`
:   Specifies a comma-separated list of target accounts to add to the primary failover group to enable replication and failover of
    specified objects in the source account to the target account. Secondary failover groups in the target accounts in this list
    can be promoted to serve as the primary failover group in case of failover.

    > `org_name`
    > :   Name of your Snowflake organization.
    >
    > `target_account_name`
    > :   Target account to which you are enabling replication of the specified objects.

`REMOVE org_name.target_account_name [ , org_name.target_account_name ,  ... ] FROM ALLOWED_ACCOUNTS`
:   Specifies a comma-separated list of target accounts to remove from the primary failover group to disable replication
    of specified objects in the source account to the target account.
    Removing a target account disables failover from the current account to this target account.

    > `org_name`
    > :   Name of your Snowflake organization.
    >
    > `target_account_name`
    > :   Target account to which you are disabling replication of the specified objects.

`IGNORE EDITION CHECK`
:   Allows replicating objects to accounts in the following scenario:

    > The primary failover group is in a Business Critical (or higher) account and a signed business associate agreement is in place to
    > store PHI data in the account per HIPAA and [HITRUST](../../user-guide/intro-cloud-platforms.md) regulations. However, no such agreement is in place
    > for one or more of the accounts approved for replication, regardless if they are Business Critical (or higher) accounts.

    This scenario is prohibited by default.

**Target Account**

`name`
:   Specifies the identifier for the failover group.

`REFRESH`
:   Refreshes the objects in the target (current) account from the source account.

`PRIMARY`
:   Promote a secondary failover group and its specified objects in the target (current) account to primary (in case of
    failover).

`SUSPEND [ IMMEDIATE ]`
:   Suspend the scheduled refresh of the secondary failover group (if the primary failover group has scheduled refreshes using the
    `REPLICATION_SCHEDULE` property).

    The optional `IMMEDIATE` keyword cancels a scheduled refresh operation that is currently in progress for the secondary failover group
    (if there is one). Note that there might be a slight delay between the time that the statement returns and the time that the cancellation
    of the refresh operation is finished.

`RESUME`
:   Resume scheduled refresh of the secondary failover group (if the primary failover group has scheduled refreshes using the
    `REPLICATION_SCHEDULE` property).

`UNSET ...`
:   Specifies one (or more) properties to unset for the failover group, which resets them to the defaults:

    * `COMMENT`
    * `REPLICATION_SCHEDULE`
    * `ERROR_INTEGRATION`
    * `TAG tag_name [ , tag_name ... ]`

    You can reset multiple properties with a single ALTER statement; however, each property must be separated by
    a comma. Also, when resetting a property, you only specify the name; no value is required.

## Usage notes

* The following minimal privileges are required:

  + To refresh a secondary failover group using ALTER FAILOVER GROUP … REFRESH, the active, primary role must have either the OWNERSHIP or
    REPLICATE privilege on the failover group.
  + To fail over a secondary failover group using ALTER FAILOVER GROUP … PRIMARY, a role must have either the OWNERSHIP or FAILOVER
    privilege on the failover group.
  + To make any other changes to the failover group, only a role with the OWNERSHIP privilege on the group can execute this SQL command.
  + To add a database to a failover group, the active role must have the MONITOR privilege on the database.
  + To add an external volume to a replication group, the active role must have the USAGE privilege on the external volume.
  + To add a share to a failover group, the active role must have the OWNERSHIP privilege on the share.
* Identifiers for failover groups and replication groups in an account must be unique.
* Objects other than databases, external volumes, and shares must be in the same failover group.
* A database can only be added to one failover group.
* An external volume can only be added to one failover group.
* [Inbound shares](../../user-guide/data-share-consumers.md) (shares from providers) *cannot* be added to a replication or failover group.
* Promoting a secondary failover group to primary (in case of failover) fails if a refresh is in progress.
* If a refresh is in progress when the replication schedule is updated, the refresh continues until completion and the next refresh will
  use the new schedule.
* On failover, scheduled refreshes on all secondary failover groups are suspended. `ALTER FAILOVER GROUP ... RESUME` must be executed
  on each secondary to resume automatic refreshes.
* To move databases, external volumes, or shares from one failover group (the move-from group) to another failover group (the move-to group):

  + Both groups must be of the same type: FAILOVER GROUP.
  + If the last database in the move-from group is moved to another group, the `allowed_databases` property for the move-from group
    is set to NULL. The same behavior applies to shares and external volumes.
  + If the move-to group doesn’t have the object type that is being moved (`databases`, `external volumes`, or `shares`) in the `object_types`
    list, it must be explicitly added to the move-to group before you move the objects.
* If database, external volume, or share objects are removed from a primary failover group (by using the REMOVE parameter or SET parameter to
  modify the ALLOWED_DATABASES, ALLOWED_EXTERNAL_VOLUMES, or ALLOWED_SHARES lists), those objects are dropped in any target account when the next
  refresh operation occurs.

  To avoid dropping these objects in the target account, you can drop the secondary failover group *before* the next time the modified
  primary failover group is replicated to the target account.
* To retrieve the list of accounts in your organization that are enabled for replication, use the
  [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md) command.
* To retrieve the list of failover groups in your organization, use [SHOW FAILOVER GROUPS](show-failover-groups.md).
* Automatically [scheduled refresh operations](../../user-guide/account-replication-intro.md) are executed using the role with the OWNERSHIP
  privilege on the group. If a scheduled refresh operation fails due to insufficient privileges, grant the required privileges
  to the role with the OWNERSHIP privilege on the group.
* The ALTER FAILOVER GROUP … SUSPEND IMMEDIATE command doesn’t cancel an in-progress refresh operation if it was manually triggered.
  For information, see [Cancel an in-progress refresh operation that wasn’t automatically scheduled](../../user-guide/account-replication-failover-failback.md).
* Canceling an in-progress refresh operation that is in the SECONDARY_DOWNLOADING_METADATA or SECONDARY_DOWNLOADING_DATA phase might
  result in an inconsistent state on the target account. For more information see [View the current phase of an in-progress refresh operation](../../user-guide/account-replication-failover-failback.md).

* If you create a replication or failover group with a tag or modify a replication or failover group by setting a tag on it,
  [tag inheritance](../../user-guide/object-tagging/inheritance.md) does not apply to any objects that you specify in the replication or failover group.

  Tag inheritance is only applicable to objects with a [parent-child relationship](../../user-guide/security-access-control-overview.md), such
  database, schema, and table. There are no child objects of replication or failover groups.
* You cannot set a tag or modify a tag on a secondary replication or failover group because these objects are read
  only.
* When you refresh a secondary replication or failover group, any tags that are set on the primary group are then set on
  the secondary group.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Executed from the source account

Add `myorg.myaccount3` to the list of target accounts to which replication of specified objects and failover from the source
account is enabled.

```sqlexample
ALTER FAILOVER GROUP myfg ADD myorg.myaccount3 TO ALLOWED_ACCOUNTS;
```

Reset the object types list for replication in the source account and add database `db1`:

```sqlexample
ALTER FAILOVER GROUP myfg SET
  OBJECT_TYPES = USERS, ROLES, WAREHOUSES, RESOURCE MONITORS, DATABASES
  ALLOWED_DATABASES = db1;
```

Add databases `db2` and `db3` to the list of databases:

```sqlexample
ALTER FAILOVER GROUP myfg
  ADD db2, db3 TO ALLOWED_DATABASES;
```

Move database `db3` to another failover group, `myfg2`:

```sqlexample
ALTER FAILOVER GROUP myfg
  MOVE DATABASES db3 TO FAILOVER GROUP myfg2;
```

Move database `db2` in `myfg` to another failover group, `myfg3`, that currently has no databases:

> 1. First add `databases` to `object_types`:
>
>    ```sqlexample
>    ALTER FAILOVER GROUP myfg3 SET
>      OBJECT_TYPES = DATABASES, SHARES;
>    ```
> 2. Move `db2` to `myfg3`:
>
>    ```sqlexample
>    ALTER FAILOVER GROUP myfg
>      MOVE DATABASES db2 TO FAILOVER GROUP myfg3;
>    ```

Remove all databases from the list of databases in the source account for replication and failover:

```sqlexample
ALTER FAILOVER GROUP myfg
  SET ALLOWED_DATABASES = NULL;
```

> **Note:**
>
> Executing the statement above removes all databases from the list of databases to be replicated, but does not remove
> database objects from the list of specified object types for replication and failover.
>
> To disable replication and failover of all databases and remove databases from the list of specified object types:
>
> ```sqlexample
> ALTER FAILOVER GROUP myfg
>   REMOVE databases FROM OBJECT_TYPES;
> ```

Add (or modify) the interval for automatically scheduled refreshes:

```sqlexample
ALTER FAILOVER GROUP myfg
  SET REPLICATION_SCHEDULE = '15 MINUTE';
```

### Executed from the target account

Refresh objects in the failover group `myfg` in the target account:

```sqlexample
ALTER FAILOVER GROUP myfg REFRESH;
```

Promote the secondary failover group in the current target account to primary:

```sqlexample
ALTER FAILOVER GROUP myfg PRIMARY;
```

Suspend automatic refreshes:

```sqlexample
ALTER FAILOVER GROUP myfg SUSPEND;
```

---
title: ALTER FEATURE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-feature-policy.md
section: SQL Commands
---

# ALTER FEATURE POLICY

Alters or renames a [feature policy](../../developer-guide/native-apps/ui-consumer-feature-policies.md).

See also:
:   [CREATE FEATURE POLICY](create-feature-policy.md) , [DESCRIBE FEATURE POLICY](desc-feature-policy.md), [DROP FEATURE POLICY](drop-feature-policy.md), [SHOW FEATURE POLICIES](show-feature-policies.md)

## Syntax

```sqlsyntax
ALTER FEATURE POLICY [ IF EXISTS ] <name> SET
  [ BLOCKED_OBJECT_TYPES_FOR_CREATION = ( [ <type> [ , <type>  ... ] ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER FEATURE POLICY [ IF EXISTS ] <name> UNSET
  [ BLOCKED_OBJECT_TYPES_FOR_CREATION ]
  [ COMMENT ]

ALTER FEATURE POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER FEATURE POLICY [ IF EXISTS ] <name> SET  TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FEATURE POLICY [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , ... ]
```

## Parameters

`name`
:   Specifies the identifier for the feature policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET`
:   Specifies one (or more) properties to set for the feature policy.

    `BLOCKED_OBJECT_TYPES_FOR_CREATION = ( type [ , type ... ] )`
    :   Specifies the objects that an app is prohibit from creating.

        Possible values are:

        * COMPUTE_POOLS
        * DATABASES
        * TASKS
        * WAREHOUSES

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the feature policy.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY FEATURE POLICY | Account | This privilege is required to set a feature policy for the current account. |
| APPLY or OWNERSHIP | Feature policy | One of these privileges is required to modify a feature policy. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If a previous policy had been applied to the account or an app an error is return, unless the you
  specify the FORCE option to force the replacement of the existing policy.
* When a feature policy is unbound from an app, the account level policy takes effect, if it exists.

## Examples

The following example sets the BLOCKED_OBJECT_TYPES_FOR_CREATION property on the feature policy
to prohibit an app from creating databases or tasks:

```sqlexample
ALTER FEATURE POLICY block_create_db_policy SET
  BLOCKED_OBJECT_TYPES_FOR_CREATION = (DATABASES, TASKS);
```

The following example changes the name of a feature policy from `block_create_db_policy` to
`block_create_db_task_policy`:

```sqlexample
ALTER FEATURE POLICY block_create_db_policy RENAME TO block_create_db_task_policy;
```

---
title: ALTER FILE FORMAT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-file-format.md
section: SQL Commands
---

# ALTER FILE FORMAT

Modifies the properties for an existing file format object. Currently the only actions that are supported are renaming the file format, changing
the file format options (based on the type), and adding/changing a comment. To make any other changes, you must drop the file format and then
recreate it.

See also:
:   [CREATE FILE FORMAT](create-file-format.md) , [DROP FILE FORMAT](drop-file-format.md) , [SHOW FILE FORMATS](show-file-formats.md) , [DESCRIBE FILE FORMAT](desc-file-format.md)

## Syntax

```sqlsyntax
ALTER FILE FORMAT [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER FILE FORMAT [ IF EXISTS ] <name> SET { [ formatTypeOptions ] [ COMMENT = '<string_literal>' ] }
```

Where:

> ```sqlsyntax
> formatTypeOptions ::=
> -- If TYPE = CSV
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      RECORD_DELIMITER = '<string>' | NONE
>      FIELD_DELIMITER = '<string>' | NONE
>      MULTI_LINE = TRUE | FALSE
>      FILE_EXTENSION = '<string>'
>      PARSE_HEADER = TRUE | FALSE
>      SKIP_HEADER = <integer>
>      SKIP_BLANK_LINES = TRUE | FALSE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      ESCAPE = '<character>' | NONE
>      ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
>      TRIM_SPACE = TRUE | FALSE
>      FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
>      ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      EMPTY_FIELD_AS_NULL = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
>      ENCODING = '<string>' | UTF8
> -- If TYPE = JSON
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      TRIM_SPACE = TRUE | FALSE
>      MULTI_LINE = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
>      FILE_EXTENSION = '<string>'
>      ENABLE_OCTAL = TRUE | FALSE
>      ALLOW_DUPLICATE = TRUE | FALSE
>      STRIP_OUTER_ARRAY = TRUE | FALSE
>      STRIP_NULL_VALUES = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> -- If TYPE = AVRO
>      COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = ORC
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = PARQUET
>      COMPRESSION = AUTO | LZO | SNAPPY | NONE
>      SNAPPY_COMPRESSION = TRUE | FALSE
>      BINARY_AS_TEXT = TRUE | FALSE
>      USE_LOGICAL_TYPE = TRUE | FALSE
>      TRIM_SPACE = TRUE | FALSE
>      USE_VECTORIZED_SCANNER = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = XML
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      PRESERVE_SPACE = TRUE | FALSE
>      STRIP_OUTER_ELEMENT = TRUE | FALSE
>      DISABLE_AUTO_CONVERT = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> ```

## Parameters

`name`
:   Specifies the identifier for the file format to alter. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_name`
:   Specifies the new identifier for the file format; must be unique for the schema.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies the options/properties to set for the file format:

    `FILE_FORMAT = ( ... )`
    :   Modifies the format-specific options for the file format. For more details, see
        Format Type Options (in this topic).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the file format.

## Format type options (`formatTypeOptions`)

Depending on the file format type specified (`TYPE = ...`), you can include one or more of the following format-specific options (separated
by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified when loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`RECORD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate records in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   Data loading:
        :   New line character. Note that “new line” is logical such that `\r\n` will be understood as a new line for files on a Windows platform.

        Data unloading:
        :   New line character (`\n`).

`FIELD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate fields in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

            > > **Note:**
            > >
            > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   comma (`,`)

`MULTI_LINE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and the specified record delimiter is present within a CSV field, the record containing the field will be interpreted as an error.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > If you are loading large uncompressed CSV files (greater than 128MB) that follow the RFC4180 specification, Snowflake supports parallel scanning of these CSV files when MULTI_LINE is set to `FALSE`, COMPRESSION is set to `NONE`, and ON_ERROR is set to `ABORT_STATEMENT` or `CONTINUE`.

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.csv[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

    > **Note:**
    >
    > If the `SINGLE` copy option is `TRUE`, then the COPY command unloads a file without a file extension by default. To specify a file extension, provide a file name and extension in the
    > `internal_location` or `external_location` path (For example, `copy into @stage/data.csv`).

`PARSE_HEADER = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to use the first row headers in the data files to determine column names.

    This file format option is applied to the following actions only:

    > * Automatically detecting column definitions by using the INFER_SCHEMA function.
    > * Loading CSV data into separate columns by using the INFER_SCHEMA function and MATCH_BY_COLUMN_NAME copy option.

    If the option is set to TRUE, the first row headers will be used to determine column names. The default value FALSE will return column names as c\*, where \* is the position of the column.

    > **Note:**
    >
    > * This option isn’t supported for external tables.
    > * The SKIP_HEADER option isn’t supported if you set `PARSE_HEADER = TRUE`.

    Default:
    :   `FALSE`

`SKIP_HEADER = integer`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Number of lines at the start of the file to skip.

    Note that SKIP_HEADER does not use the RECORD_DELIMITER or FIELD_DELIMITER values to determine what a header line is; rather, it simply skips the specified number of CRLF (Carriage Return, Line Feed)-delimited lines in the file. RECORD_DELIMITER and FIELD_DELIMITER are then used to determine the rows of data to load.

    Default:
    :   `0`

`SKIP_BLANK_LINES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to skip any blank lines encountered in the data files; otherwise, blank lines produce an end-of-record error (default behavior).

    Default:
    :   `FALSE`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of date values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) (data loading) or [DATE_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of time values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) (data loading) or [TIME_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of timestamp values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) (data loading) or [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the encoding format for binary input or output. The option can be used when loading data into or unloading data from binary columns in a table.

    Default:
    :   `HEX`

`ESCAPE = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character string used as the escape character for enclosed or unenclosed field values. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_OPTIONALLY_ENCLOSED_BY` character in the data as literals.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for enclosed fields only. Specify the character used to enclose fields by setting `FIELD_OPTIONALLY_ENCLOSED_BY`.

        > **Note:**
        >
        > This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        > as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        > the option value.
        >
        > In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        > option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If this option is set, it overrides the escape character set for `ESCAPE_UNENCLOSED_FIELD`.

    Default:
    :   `NONE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   A singlebyte character string used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for unenclosed fields only.

        > **Note:**
        >
        > * The default value is `\\`. If a row in a data file ends in the backslash (`\`) character, this character escapes the newline or
        >   carriage return character specified for the `RECORD_DELIMITER` file format option. As a result, the load operation treats
        >   this row and the next row as a single row of data. To avoid this issue, set the value to `NONE`.
        > * This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        >   as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        >   the option value.
        >
        >   In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        >   option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If `ESCAPE` is set, the escape character set for that file format option overrides this option.

    Default:
    :   backslash (`\\`)

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove white space from fields.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        As another example, if leading or trailing spaces surround quotes that enclose strings, you can remove the surrounding spaces using this option and the quote character using the
        `FIELD_OPTIONALLY_ENCLOSED_BY` option. Note that any spaces within the quotes are preserved. For example, assuming `FIELD_DELIMITER = '|'` and `FIELD_OPTIONALLY_ENCLOSED_BY = '"'`:

        ```sqlexample
        |"Hello world"|    /* loads as */  >Hello world<
        |" Hello world "|  /* loads as */  > Hello world <
        | "Hello world" |  /* loads as */  >Hello world<
        ```

        (the brackets in this example are not loaded; they are used to demarcate the beginning and end of the loaded strings)

    Default:
    :   `FALSE`

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex representation (`0x27`) or the double single-quoted escape (`''`).

        Data unloading only:
        :   When a field in the source table contains this character, Snowflake escapes it using the same character for unloading. For example, if the value is the double quote character and a field contains the string `A "B" C`, Snowflake escapes the double quotes for unloading as follows:

            `A ""B"" C`

    Default:
    :   `NONE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   String used to convert to and from SQL NULL:

        * When loading data, Snowflake replaces these values in the data load source with SQL NULL. To specify more than one string, enclose
          the list of strings in parentheses and use commas to separate each value.

          Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as
          a value, all instances of `2` as either a string or number are converted.

          For example:

          `NULL_IF = ('\N', 'NULL', 'NUL', '')`

          Note that this option can include empty strings.
        * When unloading data, Snowflake converts SQL NULL values to the first value in the list.

    Default:
    :   `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\`)

`ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to generate a parsing error if the number of delimited columns (i.e. fields) in an input file does not match the number of columns in the corresponding table.

        If set to `FALSE`, an error is not generated and the load continues. If the file is successfully loaded:

        * If the input file contains records with more fields than columns in the table, the matching fields are loaded in order of occurrence in the file and the remaining fields are not loaded.
        * If the input file contains records with fewer fields than columns in the table, the non-matching columns in the table are loaded with NULL values.

        This option assumes all the records within the input file are the same length (i.e. a file containing records of varying length return an error regardless of the value specified for this parameter).

    Default:
    :   `TRUE`

    > **Note:**
    >
    > When [transforming data during loading](../../user-guide/data-load-transform.md) (i.e. using a query as the source for the COPY command), this option is ignored. There is no requirement for your data files to have
    > the same number and ordering of columns as your target table.

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`).

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies whether to insert SQL NULL for empty fields in an input file, which are represented by two successive delimiters (For example, `,,`).

          If set to `FALSE`, Snowflake attempts to cast an empty field to the corresponding column type. An empty string is inserted into columns of type STRING. For other column types, the COPY command produces an error.
        * When unloading data, this option is used in combination with `FIELD_OPTIONALLY_ENCLOSED_BY`. When `FIELD_OPTIONALLY_ENCLOSED_BY = NONE`, setting `EMPTY_FIELD_AS_NULL = FALSE` specifies to unload empty strings in tables to empty string values without quotes enclosing the field values.

          If set to `TRUE`, `FIELD_OPTIONALLY_ENCLOSED_BY` must specify a character to enclose strings.

    Default:
    :   `TRUE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

`ENCODING = 'string'`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String (constant) that specifies the character set of the source data when loading data into a table.

        | Character Set | `ENCODING` Value | Supported Languages | Notes |
        | --- | --- | --- | --- |
        | Big5 | `BIG5` | Traditional Chinese |  |
        | EUC-JP | `EUCJP` | Japanese |  |
        | EUC-KR | `EUCKR` | Korean |  |
        | GB18030 | `GB18030` | Chinese |  |
        | IBM420 | `IBM420` | Arabic |  |
        | IBM424 | `IBM424` | Hebrew |  |
        | IBM949 | `IBM949` | Korean |  |
        | ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
        | ISO-2022-JP | `ISO2022JP` | Japanese |  |
        | ISO-2022-KR | `ISO2022KR` | Korean |  |
        | ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
        | ISO-8859-5 | `ISO88595` | Russian |  |
        | ISO-8859-6 | `ISO88596` | Arabic |  |
        | ISO-8859-7 | `ISO88597` | Greek |  |
        | ISO-8859-8 | `ISO88598` | Hebrew |  |
        | ISO-8859-9 | `ISO88599` | Turkish |  |
        | ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
        | KOI8-R | `KOI8R` | Russian |  |
        | Shift_JIS | `SHIFTJIS` | Japanese |  |
        | UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
        | UTF-16 | `UTF16` | All languages |  |
        | UTF-16BE | `UTF16BE` | All languages |  |
        | UTF-16LE | `UTF16LE` | All languages |  |
        | UTF-32 | `UTF32` | All languages |  |
        | UTF-32BE | `UTF32BE` | All languages |  |
        | UTF-32LE | `UTF32LE` | All languages |  |
        | windows-874 | `WINDOWS874` | Thai |  |
        | windows-949 | `WINDOWS949` | Korean |  |
        | windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
        | windows-1251 | `WINDOWS1251` | Russian |  |
        | windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | windows-1253 | `WINDOWS1253` | Greek |  |
        | windows-1254 | `WINDOWS1254` | Turkish |  |
        | windows-1255 | `WINDOWS1255` | Hebrew |  |
        | windows-1256 | `WINDOWS1256` | Arabic |  |

    Default:
    :   `UTF8`

    > **Note:**
    >
    > Snowflake stores all data internally in the UTF-8 character set. The data is converted into UTF-8 before it is loaded into Snowflake.

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of date string values in the data files. If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of time string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of timestamp string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the encoding format for binary string values in the data files. The option can be used when loading data into binary columns in a table.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `HEX`

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`MULTI_LINE = TRUE | FALSE`
:   Use: Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and a new line is present within a JSON record, the record containing the new line will be interpreted as an error.

    Default:
    :   `TRUE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.json[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

`ENABLE_OCTAL = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that enables parsing of octal numbers.

    Default:
    :   `FALSE`

`ALLOW_DUPLICATE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to allow duplicate object field names (only the last one will be preserved).

    Default:
    :   `FALSE`

`STRIP_OUTER_ARRAY = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove outer brackets (i.e. `[ ]`).

    Default:
    :   `FALSE`

`STRIP_NULL_VALUES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove object fields or array elements containing `null` values. For example, when set to `TRUE`:

        | Before | After |
        | --- | --- |
        | `[null]` | `[]` |
        | `[null,null,3]` | `[,,3]` |
        | `{"a":null,"b":null,"c":123}` | `{"c":123}` |
        | `{"a":[1,null,2],"b":{"x":null,"y":88}}` | `{"a":[1,,2],"b":{"y":88}}` |

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

### TYPE = AVRO

`COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`.

> **Note:**
>
> We recommend that you use the default `AUTO` option because it will determine both the file and codec compression. Specifying a compression option refers to the compression of files, not the compression of blocks (codecs).

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = ORC

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = PARQUET

`COMPRESSION = AUTO | LZO | SNAPPY | NONE`
:   Use:
    :   Data unloading and external tables

    Definition:

    * When unloading data, specifies the compression algorith for columns in the Parquet files.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically. Supports the following compression algorithms: Brotli, gzip, Lempel-Ziv-Oberhumer (LZO), LZ4, Snappy, or Zstandard v0.8 (and higher). . When unloading data, unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `LZO` | When unloading data, files are compressed using the Snappy algorithm by default. If unloading data to LZO-compressed files, specify this value. |
        | `SNAPPY` | When unloading data, files are compressed using the Snappy algorithm by default. You can optionally specify this value. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`SNAPPY_COMPRESSION = TRUE | FALSE`
:   Use:
    :   Data unloading only

        | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | Unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `SNAPPY` | May be specified if unloading Snappy-compressed files. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Definition:
    :   Boolean that specifies whether unloaded file(s) are compressed using the SNAPPY algorithm.

    > **Note:**
    >
    > Deprecated. Use `COMPRESSION = SNAPPY` instead.

    Limitations:
    :   Only supported for data unloading operations.

    Default:
    :   `TRUE`

`BINARY_AS_TEXT = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to interpret columns with no defined logical data type as UTF-8 text. When set to `FALSE`, Snowflake interprets these columns as binary data.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > Snowflake recommends that you set BINARY_AS_TEXT to FALSE to avoid any potential conversion issues.

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`USE_LOGICAL_TYPE = TRUE | FALSE`
:   Use:
    :   Data loading, data querying in staged files, and schema detection.

    Definition:
    :   Boolean that specifies whether to use Parquet logical types. With this file format option, Snowflake can interpret Parquet logical types during data loading. For more information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md). To enable Parquet logical types, set USE_LOGICAL_TYPE as TRUE when you create a new file format option.

    Limitations:
    :   Not supported for data unloading.

`USE_VECTORIZED_SCANNER = TRUE | FALSE`
:   Use:
    :   Data loading and data querying in staged files

    Definition:
    :   Boolean that specifies whether to use a vectorized scanner for loading Parquet files.

    Default:
    :   `FALSE`. In a future BCR, the default value will be `TRUE`.

    Using the vectorized scanner can significantly reduce the latency for loading Parquet files, because this scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file. The scanner only downloads relevant sections of the Parquet file into memory, such as the subset of selected columns.

    If `USE_VECTORIZED_SCANNER` is set to `TRUE`, the vectorized scanner has the following behaviors:

    > * The `BINARY_AS_TEXT` option is always treated as `FALSE` and the `USE_LOGICAL_TYPE` option is always treated as `TRUE`, no matter what the actual value is being set to.
    > * The vectorized scanner supports Parquet map types. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >   {
    >   >    "k1": "v1",
    >   >    "k2": "v2"
    >   >   }
    >   > ```
    > * The vectorized scanner shows `NULL` values in the output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "nickname": null,
    >   >   "age": 34,
    >   >   "phone_numbers":
    >   >   [
    >   >     "1234567890",
    >   >     "0987654321",
    >   >     null,
    >   >     "6781234590"
    >   >   ]
    >   >   }
    >   > ```
    > * The vectorized scanner handles Time and Timestamp as follows:
    >
    >   > | Parquet | Snowflake vectorized scanner |
    >   > | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS/NANOS) | TIME |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_LTZ |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_NTZ |
    >   > | INT96 | TIMESTAMP_LTZ |

    If `USE_VECTORIZED_SCANNER` is set to `FALSE`, the scanner has the following behaviors:

    > * This option does not support Parquet maps. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >  {
    >   >   "key_value":
    >   >   [
    >   >    {
    >   >           "key": "k1",
    >   >           "value": "v1"
    >   >       },
    >   >       {
    >   >           "key": "k2",
    >   >           "value": "v2"
    >   >       }
    >   >     ]
    >   >   }
    >   > ```
    > * This option does not explicitly show `NULL` values in the scan output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "age": 34
    >   >   "phone_numbers":
    >   >   [
    >   >    "1234567890",
    >   >    "0987654321",
    >   >    "6781234590"
    >   >   ]
    >   >  }
    >   > ```
    > * This option handles Time and Timestamp as follows:
    >
    >   > | Parquet | When USE_LOGICAL_TYPE = TRUE | When USE_LOGICAL_TYPE = FALSE |
    >   > | --- | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS) | TIME | + TIME (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=NANOS) | TIME | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS) | TIMESTAMP_LTZ | TIMESTAMP_NTZ |
    >   > | TimestampType(isAdjustedToUtc=True, unit=NANOS) | TIMESTAMP_LTZ | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS) | TIMESTAMP_NTZ | + TIMESTAMP_LTZ (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimestampType(isAdjustedToUtc=False, unit=NANOS) | TIMESTAMP_NTZ | INTEGER |
    >   > | INT96 | TIMESTAMP_NTZ | TIMESTAMP_NTZ |

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = XML

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`PRESERVE_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser preserves leading and trailing spaces in element content.

    Default:
    :   `FALSE`

`STRIP_OUTER_ELEMENT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser strips out the outer XML element, exposing 2nd level elements as separate documents.

    Default:
    :   `FALSE`

`DISABLE_AUTO_CONVERT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser disables automatic conversion of numeric and Boolean values from text to native representation.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip any BOM (byte order mark) present in an input file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

## Usage notes

* ALTER FILE FORMAT does not support the following actions:

  + Changing the type (CSV, JSON, etc.) for the file format.
  + Unsetting any format options (i.e. resetting the options to the defaults for the type).
  + Unsetting (i.e. removing) a comment.

  To make any of these changes, you must recreate the file format.

## Examples

Rename file format `my_format` to `my_new_format`:

> ```sqlexample
> ALTER FILE FORMAT IF EXISTS my_format RENAME TO my_new_format;
> ```

Specify comma (`,`) as the field delimiter for `my_format` (created in the [CREATE FILE FORMAT](create-file-format.md) examples):

> ```sqlexample
> ALTER FILE FORMAT my_format SET FIELD_DELIMITER=',';
> ```

---
title: ALTER FUNCTION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-function.md
section: SQL Commands
---

# ALTER FUNCTION

Modifies the properties of an existing user-defined or external function.

To make any other changes to a UDF, you must drop the function (using [DROP FUNCTION](drop-function.md)) and then recreate it.

See also:
:   [Writing external functions](../external-functions.md), [User-defined functions overview](../../developer-guide/udf/udf-overview.md), [CREATE FUNCTION](create-function.md) , [DROP FUNCTION](drop-function.md) ,
    [SHOW USER FUNCTIONS](show-user-functions.md) , [DESCRIBE FUNCTION](desc-function.md), [CREATE EXTERNAL FUNCTION](create-external-function.md) ,
    [DESCRIBE FUNCTION](desc-function.md) , [DROP FUNCTION](drop-function.md) , [SHOW EXTERNAL FUNCTIONS](show-external-functions.md)

## Syntax

### User-defined and external functions

The syntax for ALTER FUNCTION varies depending on which language you’re using as the UDF handler.

#### Java handler

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET { SECURE | LOG_LEVEL | TRACE_LEVEL | COMMENT }

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , <integration_name> ... ] ) ]
  [ SECRETS = ( '<secret_variable_name>' = <secret_name> [ , '<secret_variable_name>' = <secret_name> ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]
```

#### JavaScript handler

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET { SECURE | LOG_LEVEL | TRACE_LEVEL | COMMENT }

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ COMMENT = '<string_literal>' ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]
```

#### Python handler

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET { SECURE | LOG_LEVEL | TRACE_LEVEL | COMMENT }

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , <integration_name> ... ] ) ]
  [ SECRETS = ( '<secret_variable_name>' = <secret_name> [ , '<secret_variable_name>' = <secret_name> ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]
```

#### Scala handler

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET { SECURE | LOG_LEVEL | TRACE_LEVEL | COMMENT }

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , <integration_name> ... ] ) ]
  [ SECRETS = ( '<secret_variable_name>' = <secret_name> [ , '<secret_variable_name>' = <secret_name> ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]
```

#### SQL handler

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET { SECURE | LOG_LEVEL | TRACE_LEVEL | COMMENT }

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ COMMENT = '<string_literal>' ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET DCM PROJECT
```

### External functions

#### Any language handler

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET API_INTEGRATION = <api_integration_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET HEADERS = ( [ '<header_1>' = '<value>' [ , '<header_2>' = '<value>' ... ] ] )

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET CONTEXT_HEADERS = ( [ <context_function_1> [ , <context_function_2> ...] ] )

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET MAX_BATCH_ROWS = <integer>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET COMPRESSION = <compression_type>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET { REQUEST_TRANSLATOR | RESPONSE_TRANSLATOR } = <udf_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET
              { COMMENT | HEADERS | CONTEXT_HEADERS | MAX_BATCH_ROWS | COMPRESSION | SECURE | REQUEST_TRANSLATOR | RESPONSE_TRANSLATOR }
```

## Parameters

### User-defined and external functions

`name`
:   Specifies the identifier for the UDF to alter. The identifier can contain the schema name and database name, as well as the function name.
    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in
    double quotes are also case-sensitive.

`arg_data_type [ , ... ]`
:   Specifies the arguments/input data types for the external function.

    If the function accepts arguments, then the ALTER command must specify the argument types because functions support
    name overloading (i.e. two functions in the same schema can have the same name), and the argument types are used to
    identify the function.

`SET ...`
:   Specifies the properties to set for the function:

    `SECURE`
    :   Specifies whether a function is secure. For more details, see [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../../developer-guide/secure-udf-procedure.md).

    `LOG_LEVEL = 'log_level'`
    :   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
        the specified level (and at more severe levels) are ingested.

        For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting log level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `TRACE_LEVEL = 'trace_level'`
    :   Controls how trace events are ingested into the event table.

        For information about levels, see [TRACE_LEVEL](../parameters.md). For information about setting trace level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
    :   The names of [external access integrations](create-external-access-integration.md) needed in order for this
        function’s handler code to access external networks.

        An external access integration contains [network rules](create-network-rule.md) and
        [secrets](create-secret.md) that specify the external locations and credentials (if any) needed for handler code
        to make requests of an external network, such as an external REST API.

        For more information, refer to [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

    `SECRETS = ( 'secret_variable_name' = secret_name [ , ...  ] )`
    :   Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from
        secrets in handler code.

        This parameter’s value is a list of assignment expressions with the following parts:

        * `secret_name` as the name of a secret specified in an
          [external access integration’s](create-external-access-integration.md) ALLOWED_AUTHENTICATION_SECRETS parameter
          value. That external access integration’s name must, in turn, be specified as a value of this CREATE FUNCTION call’s
          EXTERNAL_ACCESS_INTEGRATIONS parameter.

          You will receive an error if you specify a SECRETS value whose secret isn’t also included in an integration specified by the
          EXTERNAL_ACCESS_INTEGRATIONS parameter.
        * `'secret_variable_name'` as the variable that will be used in handler code when retrieving information from the secret.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the function. The value you specify is displayed in the `DESCRIPTION`
        column in the [SHOW FUNCTIONS](show-functions.md) and [SHOW USER FUNCTIONS](show-user-functions.md) output.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies the properties to unset for the function, which resets them to the defaults.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the function from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the function and the DCM project without dropping the function. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

### User-defined functions

`RENAME TO new_name`
:   Specifies the new identifier for the UDF; the combination of the identifier and existing argument data types must be unique for the schema.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    > **Note:**
    >
    > When specifying the new name for the UDF, do not specify argument data types or parentheses; specify only the new name.

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

### External functions

`RENAME TO new_name`
:   Specifies the new identifier for the function.

    The identifier does not need to be unique for the schema in which the function is created because functions are
    identified and resolved by their name and argument types. However, the signature (name and parameter data types)
    must be unique within the schema.

    The `name` must follow the rules for Snowflake [identifiers](../identifiers.md).
    For more details, see [Identifier requirements](../identifiers-syntax.md).

    > **Note:**
    >
    > When specifying the new name for the external function, do not specify argument data types or parentheses;
    > the function will continue using the same arguments as before.

`api_integration_name`
:   This is the name of the API integration object that should be used to authenticate the call to the proxy service.

    More details about this parameter are in [CREATE EXTERNAL FUNCTION](create-external-function.md).

`HEADERS = ( 'header_1' = 'value' [ , 'header_2' = 'value' ... ] )`
:   This clause allows users to attach key-value metadata that is sent with every request.

    The value must be a constant string, not an expression.

    More details about this parameter are in [CREATE EXTERNAL FUNCTION](create-external-function.md).

`CONTEXT_HEADERS = ( [ context_function_1 [ , context_function_2 ... ] ] )`
:   This is similar to HEADERS, but instead of allowing only constant strings, it allows binding Snowflake
    context function results to HTTP headers.

    Each value must be the name of a context function. The names should not be quoted.

    More details about this parameter are in [CREATE EXTERNAL FUNCTION](create-external-function.md).

`COMPRESSION = compression_type`
:   If this clause is specified, the JSON payload is compressed using the specified format when sent from Snowflake to
    the proxy service, and when sent back from the proxy service to Snowflake.

    For more details about valid values of `compression_type`, see [CREATE EXTERNAL FUNCTION](create-external-function.md).

`{ REQUEST_TRANSLATOR | RESPONSE_TRANSLATOR } = udf_name`
:   Add a request translator or a response translator if the external function does not already have one or replace an existing request translator
    or response translator by specifying the name of a previously-created JavaScript UDF.
    For more information, see [Using request and response translators with data for a remote service](../external-functions-translators.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Function | Enables calling a UDF or external function. |
| APPLY | Tag | Enables setting a tag on the UDF or external function. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

### User-defined functions

* If using a UDF in a [masking policy](create-masking-policy.md), ensure the data type of the column, UDF, and masking policy match. For
  more information, see [User-defined functions in a masking policy](../../user-guide/security-column-intro.md).

### External functions

* There is no UNSET command for API_INTEGRATION. You can change the API_INTEGRATION, but you cannot unset it. For more, see
  [ALTER API INTEGRATION](alter-api-integration.md).

## Examples

Rename the function `function1` to `function2`:

```sqlexample
ALTER FUNCTION IF EXISTS function1(number) RENAME TO function2;
```

Convert a regular function `function2` to a secure function:

```sqlexample
ALTER FUNCTION function2(number) SET SECURE;
```

### External functions

Change the API Integration for an external function:

```sqlexample
ALTER FUNCTION function4(number) SET API_INTEGRATION = api_integration_2;
```

Set the maximum number of rows per batch for an external function:

```sqlexample
ALTER FUNCTION function5(number) SET MAX_BATCH_ROWS = 100;
```

---
title: ALTER FUNCTION (DMF)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-function-dmf.md
section: SQL Commands
---

# ALTER FUNCTION (DMF)

Modifies the properties of an existing data metric function (DMF).

To make any other changes to a DMF, you must drop the function using a [DROP FUNCTION](drop-function.md) command and
recreate the DMF.

## Syntax

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  SET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  UNSET SECURE

ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  SET COMMENT = '<string_literal>'

ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  UNSET COMMENT

ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER FUNCTION [ IF EXISTS ] <name> ( TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ] )
  UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the DMF to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`TABLE( arg_data_type [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ]`
:   Specifies the data type of the column arguments for the DMF. The data types are necessary because DMFs support name
    overloading, where two DMFs in the same schema can have the same name. The data types of the arguments are used to identify the DMF you
    want to alter.

`RENAME TO new_name`
:   Specifies the new identifier for the DMF; the combination of the identifier and existing argument data types must be unique for the
    schema.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    > **Note:**
    >
    > When specifying the new name for the UDF, don’t specify argument data types or parentheses; specify only the new name.

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies the properties to set for the DMF:

    `SECURE`
    :   Specifies whether a function is secure. For more information, see [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../../developer-guide/secure-udf-procedure.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the function. The value you specify is displayed in the `DESCRIPTION`
        column in the [SHOW FUNCTIONS](show-functions.md) and [SHOW USER FUNCTIONS](show-user-functions.md) output.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies the properties to unset for the function, which resets them to the defaults.

    * `SECURE`
    * `COMMENT`
    * `TAG tag_name [ , tag_name ... ]`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Data metric function |  |
| APPLY | Tag | Enables setting a tag on the DMF. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you want to update an existing data metric function and need to see the current definition of the function, run the
  [DESCRIBE FUNCTION (DMF)](desc-function-dmf.md) command or call the [GET_DDL](../functions/get_ddl.md) function.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Example

You can use the ALTER FUNCTION command to make a DMF secure. For more information about what it means for a function to be secure, see
[Protecting Sensitive Information with Secure UDFs and Stored Procedures](../../developer-guide/secure-udf-procedure.md).

```sqlexample
ALTER FUNCTION governance.dmfs.count_positive_numbers(
 TABLE(
   NUMBER,
   NUMBER,
   NUMBER
))
SET SECURE;
```

---
title: ALTER FUNCTION (Snowpark Container Services)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-function-spcs.md
section: SQL Commands
---

# ALTER FUNCTION (Snowpark Container Services)

Modifies the properties of an existing [service function](../../developer-guide/snowpark-container-services/working-with-services.md).

To make any other changes to a service function, you must drop the function (using [DROP FUNCTION (Snowpark Container Services)](drop-function-spcs.md)) and then recreate it.

See also:
:   [Service functions](../../developer-guide/snowpark-container-services/working-with-services.md), [CREATE FUNCTION](create-function-spcs.md), [DESC FUNCTION](desc-function-spcs.md), [DROP FUNCTION](drop-function-spcs.md)

## Syntax

```sqlsyntax
ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  RENAME TO <new_name>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET CONTEXT_HEADERS = ( <context_function_1> [ , <context_function_2> ...] )

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET MAX_BATCH_ROWS = <integer>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET MAX_BATCH_RETRIES = <integer>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET ON_BATCH_FAILURE = { ABORT | RETURN_NULL }

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET BATCH_TIMEOUT_SECS = <integer>

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET COMMENT = '<string_literal>'

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  SET SERVICE = '<service_name>' ENDPOINT = '<endpoint_name>'

ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
  UNSET { CONTEXT_HEADERS | MAX_BATCH_ROWS | MAX_BATCH_RETRIES | ON_BATCH_FAILURE | BATCH_TIMEOUT_SECS | COMMENT }
```

## Parameters

`name`
:   Specifies the identifier for the service function to alter. The identifier can contain the schema name and database name, as well as the function name.
    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in
    double quotes are also case sensitive.

`arg_data_type [ , ... ]`
:   Specifies the arguments/input data types for the service function.

    If the function accepts arguments, then the ALTER command must specify the argument types because functions support
    name overloading (that is, two functions in the same schema can have the same name), and the argument types are used to
    identify the function.

`RENAME TO new_name`
:   Specifies the new identifier for the service function; the combination of the identifier and existing argument data types must be unique for the schema.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    > **Note:**
    >
    > When specifying the new name for the service function, do not specify argument data types or parentheses; specify only the new name.

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies the properties to set for the function:

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the function, which is displayed in the DESCRIPTION column in the [SHOW FUNCTIONS](show-functions.md) and [SHOW USER FUNCTIONS](show-user-functions.md)
        output.

    `SERVICE = '<service_name>' ENDPOINT = '<endpoint_name>'`
    :   Specifies the service name and the endpoint name as defined in the service specification.

    `CONTEXT_HEADERS = ( context_function_1 [ , context_function_2 ... ] )`
    :   It allows binding Snowflake context function results to HTTP headers.

        Each value must be the name of a context function. Don’t include quote marks around the names.

        More details about this parameter are in [CREATE FUNCTION (Snowpark Container Services)](create-function-spcs.md).

    `MAX_BATCH_ROWS = integer`
    :   Specifies the [batch size](../../developer-guide/snowpark-container-services/working-with-services.md) when sending data to a service to increase concurrency

    `MAX_BATCH_RETRIES = integer`
    :   Specifies the number of times you want Snowflake to retry a failed batch.

    `ON_BATCH_FAILURE = { ABORT | RETURN_NULL }`
    :   Specifies the behavior of the function after Snowflake reaches the maximum number of retries processing the batch.

        * `ABORT`: Service function aborts execution. Any remaining batches of rows are not processed.
        * `RETURN_NULL`: Service function returns a NULL for each row in the failed batch and continues processing the remaining batches. If you choose this option, note the following caveats:

          + If these batches depend on each other and one batch fails, this could lead to unexpected results.
          + If your service can return a NULL as a valid response, then it’s not possible to differentiate NULL returned by Snowflake due to batch failure and NULL returned by your service.

    `BATCH_TIMEOUT_SECS = integer`
    :   Specifies the maximum duration for processing a single batch of rows, including retries (and polling for async function requests), after which Snowflake should terminate the batch request.

        Acceptable Values: greater than 0 and less than or equal to 604800 seconds (7 days).

`UNSET ...`
:   Specifies the properties to unset for the function, which resets them to the defaults. Note that you can’t unset the service endpoint.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Function |  |
| USAGE | Service endpoint | Usage on a service endpoint is granted to service roles defined in the service specification. You then grant the service role to the role altering the service function. This privilege is required if altering a service endpoint. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename a service function:

```sqlexample
ALTER FUNCTION my_echo_udf(VARCHAR) RENAME TO my_echo_udf_temp;
```

Set a comment for a service function:

```sqlexample
ALTER FUNCTION my_echo_udf(VARCHAR) SET COMMENT = 'some comment';
```

Set the maximum number of rows per batch for a service function:

```sqlexample
ALTER FUNCTION my_echo_udf(number) SET MAX_BATCH_ROWS = 100;
```

Set the CURRENT_USER context header for a service function:

```sqlexample
ALTER FUNCTION my_echo_udf(VARCHAR) SET CONTEXT_HEADER = (CURRENT_USER);
```

Unset MAX_BATCH_ROWS for a service function:

```sqlexample
ALTER FUNCTION my_echo_udf(VARCHAR) UNSET MAX_BATCH_ROWS;
```

---
title: ALTER GATEWAY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-gateway.md
section: SQL Commands
---

# ALTER GATEWAY

Modifies the configuration of an existing [gateway](../../developer-guide/snowpark-container-services/gateway.md).
Use this command to update the traffic split configuration for a gateway.

See also:
:   [CREATE GATEWAY](create-gateway.md) , [DESCRIBE GATEWAY](desc-gateway.md), [DROP GATEWAY](drop-gateway.md) , [SHOW GATEWAYS](show-gateways.md)

## Syntax

```sqlsyntax
ALTER GATEWAY [ IF EXISTS ] <name>
  FROM SPECIFICATION <specification_text>
```

## Parameters

`name`
:   Specifies the identifier for the gateway to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FROM SPECIFICATION`
:   Specifies the updated gateway specification inline. The specification defines the traffic split configuration.

    The specification uses the following format:

    ```yaml
    spec:
      type: traffic_split
      split_type: custom
      targets:
      - type: endpoint
        value: <db>.<schema>.<service>!<endpoint>
        weight: <weight>
      - type: endpoint
        value: <db>.<schema>.<service>!<endpoint>
        weight: <weight>
    ```

## Specification parameters

`type`
:   Fixed value. Must be set to `traffic_split`.

`split_type`
:   Fixed value. Must be set to `custom`.

`targets`
:   A list of target endpoints to route traffic to. Each target must specify:

    `type`
    :   Fixed value. Must be set to `endpoint`.

    `value`
    :   The fully qualified endpoint name in the format `db.schema.service!endpoint`. Each target endpoint must exist.

    `weight`
    :   The traffic weight for this endpoint, specified as an integer. All weights must add up to 100.

> **Note:**
>
> * Maximum number of endpoints per gateway is 5 by default.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY or OWNERSHIP | Gateway | Required to alter the gateway configuration. |
| BIND SERVICE ENDPOINT | Account | Required to bind service endpoints to the gateway. |
| USAGE | Database | Required on the database containing the gateway. |
| USAGE | Schema | Required on the schema containing the gateway. |
| USAGE | Service endpoints | Required on the target service endpoints. |

To grant the required privileges, use the following commands:

```sqlexample
-- Grant MODIFY or OWNERSHIP privilege on the gateway
GRANT MODIFY ON GATEWAY <gateway_name> TO ROLE <role_name>;
-- OR
GRANT OWNERSHIP ON GATEWAY <gateway_name> TO ROLE <role_name>;

-- Grant BIND SERVICE ENDPOINT privilege on the account
GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO ROLE <role_name>;
```

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Alter a gateway to update the traffic split configuration:

```sqlexample
ALTER GATEWAY split_gateway
  FROM SPECIFICATION $$
spec:
  type: traffic_split
  split_type: custom
  targets:
  - type: endpoint
    value: db.schema.s2!ep1
    weight: 60
  - type: endpoint
    value: db.schema.s1!ep1
    weight: 40
$$;
```

---
title: ALTER GIT REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-git-repository.md
section: SQL Commands
---

# ALTER GIT REPOSITORY

Modifies the properties of a Snowflake [Git repository clone](../../developer-guide/git/git-overview.md).

See also:
:   [CREATE GIT REPOSITORY](create-git-repository.md), [DESCRIBE GIT REPOSITORY](desc-git-repository.md), [DROP GIT REPOSITORY](drop-git-repository.md), [SHOW GIT BRANCHES](show-git-branches.md),
    [SHOW GIT REPOSITORIES](show-git-repositories.md), [SHOW GIT TAGS](show-git-tags.md)

## Syntax

```sqlsyntax
ALTER GIT REPOSITORY [ IF EXISTS ] <name> SET
  [ GIT_CREDENTIALS = <secret_name> ]
  [ API_INTEGRATION = <integration_name> ]
  [ COMMENT = '<string_literal>' ]

ALTER GIT REPOSITORY [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER GIT REPOSITORY [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER GIT REPOSITORY [ IF EXISTS ] <name> UNSET {
  GIT_CREDENTIALS |
  COMMENT }
  [ , ... ]

ALTER GIT REPOSITORY [ IF EXISTS ] <name> FETCH
```

## Parameters

`name`
:   Specifies the identifier for the Git repository clone to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies the properties to set for the integration:

    `GIT_CREDENTIALS = secret_name`
    :   Specifies the secret object containing credentials for authenticating with the remote Git repository.

        The secret you specify here must be a secret specified by the ALLOWED_AUTHENTICATION_SECRETS parameter of the API integration specified
        for this Git repository.

        For reference information about secrets, see [CREATE SECRET](create-secret.md).

    `API_INTEGRATION = integration_name`
    :   Specifies the API integration containing details about how Snowflake should interact with the repository API.

        For reference information about API integrations, see [CREATE API INTEGRATION](create-api-integration.md).

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Specifies a comment.

        Default: No value

`UNSET ...`
:   Specifies the property to unset for the integration, which resets it to the default value:

    * `GIT_CREDENTIALS`
    * `COMMENT`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

`FETCH`
:   Fetches content from the remote Git repository to the Git repository clone.

    The content fetched is a full clone that fetches all branches, tags, and commits from the remote repository. The command also prunes
    branches and commits that were fetched earlier but no longer exist in the remote repository.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or WRITE | Git repository | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Examples

The following example refreshes the `snowflake_extensions` [Git repository clone](../../developer-guide/git/git-overview.md) with
data from its remote Git origin:

```sqlexample
ALTER GIT REPOSITORY snowflake_extensions FETCH;
```

---
title: ALTER ICEBERG TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-iceberg-table.md
section: SQL Commands
---

# ALTER ICEBERG TABLE

Modifies properties such as clustering options and tags for an existing [Apache Iceberg™ table](../../user-guide/tables-iceberg.md).

> **Note:**
>
> To replace the catalog integration for an externally managed Iceberg table in a standard Snowflake database with a different
> catalog integration, see [SYSTEM$SET_CATALOG_INTEGRATION](../functions/system_set_catalog_integration.md).

You can also use an ALTER ICEBERG TABLE statement to refresh a table, convert a table, or alter a structured type column. The syntax for those operations varies
considerably. To view the syntax, parameter descriptions, usage notes, and examples for refreshing or converting an Iceberg table,
see the following pages:

* [ALTER ICEBERG TABLE … REFRESH](alter-iceberg-table-refresh.md)
* [ALTER ICEBERG TABLE … CONVERT TO MANAGED](alter-iceberg-table-convert-to-managed.md)
* [ALTER ICEBERG TABLE … ALTER COLUMN … SET DATA TYPE (structured types)](alter-iceberg-table-alter-column-set-data-type.md)

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)

## Syntax

```sqlsyntax
ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> { clusteringAction | tableColumnAction }

ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> SET
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ CATALOG_SYNC = '<snowflake_open_catalog_integration_name>']
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ TARGET_FILE_SIZE = { AUTO | 16MB | 32MB | 64MB | 128MB } ]
  [ CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  [ LOG_EVENT_LEVEL = { ERROR | WARN | DEBUG } ]
  [ ERROR_LOGGING = { TRUE | FALSE } ]
  [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]

ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> UNSET
  [ REPLACE_INVALID_CHARACTERS ]
  [ LOG_EVENT_LEVEL ]
  [ ERROR_LOGGING ]
  [ ENABLE_DATA_COMPACTION ]
  [ ENABLE_ICEBERG_MERGE_ON_READ ]

ALTER ICEBERG TABLE [ IF EXISTS ] dataGovnPolicyTagAction

ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> searchOptimizationAction
```

Where:

> ```sqlsyntax
> clusteringAction ::=
>   {
>      CLUSTER BY ( <expr> [ , <expr> , ... ] )
>      /* { SUSPEND | RESUME } RECLUSTER is valid action */
>    | { SUSPEND | RESUME } RECLUSTER
>    | DROP CLUSTERING KEY
>   }
> ```
>
> ```sqlsyntax
> tableColumnAction ::=
>   {
>      ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type> [ DEFAULT <col_default> ]
>         [ inlineConstraint ]
>         [ COLLATE '<collation_specification>' ]
>
>    | RENAME COLUMN <col_name> TO <new_col_name>
>
>    | ALTER | MODIFY [ ( ]
>                           , [ COLUMN ] <col1_name> { [ SET ] NOT NULL | DROP NOT NULL }
>                           , [ COLUMN ] <col1_name> [ [ SET DATA ] TYPE ] <type>
>                           , [ COLUMN ] <col1_name> COMMENT '<string>'
>                           , [ COLUMN ] <col1_name> UNSET COMMENT
>                           , [ COLUMN ] <col1_name> SET WRITE DEFAULT <col_write_default>
>                           , [ COLUMN ] <col1_name> DROP WRITE DEFAULT
>                         [ , [ COLUMN ] <col2_name> ... ]
>                         [ , ... ]
>                     [ ) ]
>
>    | DROP [ COLUMN ] [ IF EXISTS ] <col1_name> [, <col2_name> ... ]
>   }
>
>   inlineConstraint ::=
>     [ NOT NULL ]
>     [ CONSTRAINT <constraint_name> ]
>     {
>         UNIQUE
>       | PRIMARY KEY
>       | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
>       | CHECK ( <expr> )
>     }
>     [ <constraint_properties> ]
> ```
>
> ```sqlsyntax
> dataGovnPolicyTagAction ::=
>   {
>       SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>     | UNSET TAG <tag_name> [ , <tag_name> ... ]
>   }
>   |
>   {
>       ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ROW ACCESS POLICY <policy_name>
>     | DROP ROW ACCESS POLICY <policy_name> ,
>         ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ALL ROW ACCESS POLICIES
>   }
>   |
>   {
>       SET AGGREGATION POLICY <policy_name>
>         [ ENTITY KEY ( <col_name> [, ... ] ) ]
>         [ FORCE ]
>     | UNSET AGGREGATION POLICY
>   }
>   |
>   {
>       SET JOIN POLICY <policy_name>
>         [ FORCE ]
>     | UNSET JOIN POLICY
>   }
>   |
>   ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type>
>     [ [ WITH ] MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] ]
>     [ [ WITH ] PROJECTION POLICY <policy_name> ]
>     [ [ WITH ] TAG ( <tag_name> = '<tag_value>'
>           [ , <tag_name> = '<tag_value>' , ... ] ) ]
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] [ FORCE ]
>       | UNSET MASKING POLICY
>   }
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET PROJECTION POLICY <policy_name>
>           [ FORCE ]
>       | UNSET PROJECTION POLICY
>   }
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> SET TAG
>       <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>       , [ COLUMN ] <col2_name> SET TAG
>           <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
>                    , [ COLUMN ] <col2_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
> ```
>
> ```sqlsyntax
> searchOptimizationAction ::=
>   {
>      ADD SEARCH OPTIMIZATION [
>        ON <search_method_with_target> [ , <search_method_with_target> ... ]
>      ]
>
>    | DROP SEARCH OPTIMIZATION [
>        ON { <search_method_with_target> | <column_name> | <expression_id> }
>           [ , ... ]
>      ]
>   }
> ```
>
> For details, see Search optimization actions (searchOptimizationAction).

## Parameters

`table_name`
:   Identifier for the table to modify.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the external table (separated by blank spaces, commas, or new lines):

    `REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
    :   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results.
        You can only set this parameter for tables that use an external Iceberg catalog.

        * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
        * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
          characters in a Parquet data file.

        If not specified, the Iceberg table defaults to the parameter value for the schema, database, or account.
        The schema takes precedence over the database, and the database takes precedence over the account.

        Default: `FALSE`

    `CATALOG_SYNC = 'snowflake_open_catalog_integration_name'`
    :   Specifies the name of a catalog integration configured for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview). Snowflake syncs
        the table with an external catalog in your Snowflake Open Catalog account. For more information about syncing Snowflake-managed Iceberg tables with Open Catalog, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

        For more information about this parameter, see [CATALOG_SYNC](../parameters.md).

    `DATA_RETENTION_TIME_IN_DAYS = integer`
    :   Specifies the retention period for a Snowflake-managed table so that Time Travel actions (SELECT, CLONE, UNDROP) can be performed on historical
        data in the table. For more information, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

        For a detailed description of this object-level parameter, as well as more information about object parameters, see
        [Parameters](../parameters.md).

        Values:

        > * Standard Edition: `0` or `1`
        > * Enterprise Edition: `0` to `90` for permanent tables

        Default:

        > * Standard Edition: `1`
        > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the schema, database, or account level)

        > **Note:**
        >
        > A value of `0` effectively disables Time Travel for the table.

    `AUTO_REFRESH = { TRUE | FALSE }`
    :   Specifies whether Snowflake should automatically poll the external Iceberg catalog that is associated with the table for metadata updates.

        If no value is specified for the `REFRESH_INTERVAL_SECONDS` parameter on the catalog integration, Snowflake uses a default
        refresh interval of 30 seconds.

        For more information, see [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

        Default: FALSE

        > > **Note:**
        > >
        > > Using AUTO_REFRESH with INFER_SCHEMA isn’t supported.

    `TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }'`
    :   Specifies a target Parquet file size for the table.

        * `'{ 16MB | 32MB | 64MB | 128MB }'` specifies a fixed target file size for the table.
        * `'AUTO'` works differently, depending on the table type:

          + Snowflake-managed tables: AUTO specifies that Snowflake should choose the file size for the table based on table characteristics
            such as size, DML patterns, ingestion workload, and clustering configuration. Snowflake automatically
            adjusts the file size, starting at 16 MB, for better read and write performance in Snowflake. Use this option to optimize table performance
            in Snowflake.
          + Externally managed tables: AUTO specifies that Snowflake should aggressively scale to the largest file size (128 MB).

        For more information, see [Set a target file size](../../user-guide/tables-iceberg-manage.md).

        Default: AUTO

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `LOG_EVENT_LEVEL = { ERROR | WARN | DEBUG }`
    :   Specifies the severity level of automated refresh log events that should be ingested and made available in the active event table.
        The [LOG_EVENT_LEVEL](../parameters.md) parameter determines which events to capture based on the following values:

        * `ERROR`: Events that signal a change requiring human intervention to resolve.
        * `WARN`: Events that signal an issue that can be resolved without human intervention.
        * `DEBUG`: High-volume events.

        > **Note:**
        >
        > There is no default severity level. To capture events, you must set the severity level at either the
        > Iceberg table level or account level.

        For more information, see [Monitor automated refresh events](../../user-guide/tables-iceberg-auto-refresh.md).

`ERROR_LOGGING = { TRUE | FALSE }`
:   Specifies whether to turn on DML error logging for the table.

    * `TRUE` turns on DML error logging for the table.
    * `FALSE` turns off DML error logging for the table.

    For more information, see [DML error logging](../../user-guide/data-load-overview.md).

    > **Note:**
    >
    > If the [OPT_OUT_ERROR_LOGGING](../parameters.md) parameter is set to `TRUE` for a session,
    > DML error logging isn’t turned on, regardless of whether it is turned on for specific tables.

`ENABLE_DATA_COMPACTION = { TRUE | FALSE }`
:   Specifies whether Snowflake should enable data compaction on the table. You can only set this parameter for Snowflake-managed tables.

    > * `TRUE`: Snowflake performs data compaction on the table.
    > * `FALSE`: Snowflake doesn’t perform data compaction on the table.
    >
    > Default: `TRUE`
    >
    > For more information, see [ENABLE_DATA_COMPACTION](../parameters.md) and [Set data compaction](../../user-guide/tables-iceberg-manage.md).

    `ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies whether the table uses merge-on-read behavior.

        If you don’t set this parameter, the Iceberg table defaults to the merge-on-read behavior that is specified for the schema, database,
        or account. The schema takes precedence over the database, and the database takes precedence over the account.

        Values:

        `TRUE`: The table uses merge-on-read behavior. Depending on whether the table conforms to v2 or v3 of the
        Apache Iceberg™ table specification, the behavior is as described in the following list:

        * If the table conforms with v2, use positional delete files.
        * If the table conforms with v3, use deletion vectors.

        `FALSE`: The table uses copy-on-write behavior.

        Default: `TRUE`

        For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md).

`UNSET`
:   Currently, you can only unset the following parameters with this command:

    > * `REPLACE_INVALID_CHARACTERS`
    > * `CATALOG_SYNC`
    > * `LOG_EVENT_LEVEL`
    > * `ENABLE_DATA_COMPACTION`
    > * `ENABLE_ICEBERG_MERGE_ON_READ`

## Clustering actions (`clusteringAction`)

> **Note:**
>
> Clustering is only supported for tables that use Snowflake as the Iceberg catalog.

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies (or modifies) one or more table columns or column expressions as the clustering key for the table. These are the
    columns/expressions for which clustering is maintained by [Automatic Clustering](../../user-guide/tables-auto-reclustering.md).

    To learn more about clustering, see [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).

`SUSPEND | RESUME RECLUSTER`
:   Enables or disables [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) for the table.

`DROP CLUSTERING KEY`
:   Drops the clustering key for the table.

For more information about clustering keys and reclustering, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

## Table column actions (`tableColumnAction`)

`ADD [ COLUMN ] [ IF NOT EXISTS ] col_name col_data_type [ DEFAULT col_default ]` . `[ inlineConstraint ]` `[ COLLATE 'collation_specification' ] [ , ... ]`
:   > Adds a new column. You can specify a default value, an inline constraint, and a [collation specification](../collation.md).
    >
    > For additional details about table column actions, see:
    >
    > * [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md)
    > * [CREATE MASKING POLICY](create-masking-policy.md)
    > * [CREATE TAG](create-tag.md)
    >
    > You can perform ADD COLUMN operations on multiple columns in the same command.
    >
    > If you aren’t sure whether the column already exists, you can specify IF NOT EXISTS when adding the column. If the column already
    > exists, ADD COLUMN has no effect on the existing column and doesn’t result in an error.
    >
    > > **Note:**
    > >
    > > You can’t specify IF NOT EXISTS if you are also specifying any of the following for the new column:
    > >
    > > * AUTOINCREMENT, or IDENTITY
    > > * UNIQUE, PRIMARY KEY, or FOREIGN KEY

    `DEFAULT col_default`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        For Iceberg version 3 (v3) tables only, specifies the default value for the column. If the data type for the
        column is string, you must surround the default value with single quotes.

        The value you specify is used as both the initial default and write default for the column. To change the write default for the column,
        use ALTER ICEBERG TABLE … ALTER COLUMN … SET WRITE DEFAULT.

        > **Important:**
        >
        > When you specify a default value for a column, you must specify a static value; you can’t specify an expression or
        > function for the value. This requirement is in accordance with the Iceberg v3 specification and applies to both the initial default
        > and write default.

        For more information about using default values with Iceberg tables, see [Use default values with Iceberg tables](../../user-guide/tables-iceberg-manage.md).

`RENAME COLUMN col_name to new_col_name`
:   Renames the specified column to a new name that’s not currently used for any other columns in the table.

    You can’t rename a column that’s part of a clustering key.

    When you rename an object, such as a table or column, you must update other objects that reference it with the new name.

`{ ALTER | MODIFY } COLUMN col_name ...`
:   > Modifies the properties for a column.

    `SET WRITE DEFAULT col_default`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        For Iceberg version 3 (v3) tables only, specifies the write default value for the column. If the data type for the
        column is string, you must surround the default value with single quotes.

        If the column already has a write default, you can use this setting to change the write default.

        > **Important:**
        >
        > When you specify a default value for a column, you must specify a static value; you can’t specify an expression or
        > function for the value. This requirement is in accordance with the Iceberg v3 specification and applies to both the initial default
        > and write default.

        For more information about using default values with Iceberg tables, see [Use default values with Iceberg tables](../../user-guide/tables-iceberg-manage.md).

    `DROP WRITE DEFAULT`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        For Iceberg version 3 (v3) tables only, drops the write default value for the column.

        For more information about using default values with Iceberg tables, see [Use default values with Iceberg tables](../../user-guide/tables-iceberg-manage.md).

`DROP COLUMN [ IF EXISTS ] col_name [ CASCADE | RESTRICT ]`
:   Removes the specified column from the table.

    If you aren’t sure whether the column already exists, you can specify IF EXISTS when dropping the column. If the column doesn’t
    exist, DROP COLUMN has no effect and doesn’t result in an error.

    Dropping a column is a metadata-only operation. It doesn’t immediately re-write the micro-partitions and
    therefore doesn’t immediately free up the space used by the column. Typically, the space within an individual
    micro-partition is freed the next time that the micro-partition is re-written, which is typically when a write is
    done either due to DML (INSERT, UPDATE, DELETE) or re-clustering.

## Data Governance policy and tag actions (`dataGovnPolicyTagAction`)

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`policy_name`
:   Identifier for the policy; must be unique for your schema.

The following clauses apply to all table kinds that support row access policies, such as but not limited to tables, views, and event tables.
To simplify, the clauses just refer to “table.”

> `ADD ROW ACCESS POLICY policy_name ON (col_name [ , ... ])`
> :   Adds a row access policy to the table.
>
>     At least one column name must be specified. Additional columns can be specified with a comma separating each column name. Use this
>     expression to add a row access policy to both an event table and an external table.
>
> `DROP ROW ACCESS POLICY policy_name`
> :   Drops a row access policy from the table.
>
>     Use this clause to drop the policy from the table.
>
> `DROP ROW ACCESS POLICY policy_name, ADD ROW ACCESS POLICY policy_name ON ( col_name [ , ... ] )`
> :   Drops the row access policy that is set on the table and adds a row access policy to the same table in a single SQL statement.
>
> `DROP ALL ROW ACCESS POLICIES`
> :   Drops all [row access policy](../../user-guide/security-row-using.md) associations from the table.
>
>     This expression is helpful when a row access policy is dropped from a schema before dropping the policy from an event table. Use this expression to drop row access policy associations from the table.
>
>     Suppose that a row access policy applied to the table when the backup was created, and the policy was later dropped. After you
>     restore the table from a [backup](../../user-guide/backups.md), you can’t query it until you run an ALTER TABLE command with the
>     DROP ALL ROW ACCESS POLICIES clause.
>
> `SET AGGREGATION POLICY policy_name`
> :   `[ ENTITY KEY (col_name [ , ... ]) ] [ FORCE ]`
>     :   Assigns an [aggregation policy](../../user-guide/aggregation-policies.md) to the table.
>
>         Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the table. For more information, see
>         [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md).
>
>         Use the optional FORCE parameter to atomically replace an existing aggregation policy with the new aggregation policy.
>
> `UNSET AGGREGATION POLICY`
> :   Detaches an aggregation policy from the table.
>
> `SET JOIN POLICY policy_name`
> :   `[ FORCE ]`
>     :   Assigns a [join policy](../../user-guide/join-policies.md) to the table.
>
>         Use the optional FORCE parameter to atomically replace an existing join policy with the new join policy.
>
> `UNSET JOIN POLICY`
> :   Detaches a join policy from the table.

`{ ALTER | MODIFY } [ COLUMN ] ...`
:   `USING ( col_name , cond_col_1 ... )`
    :   Specifies the arguments to pass into the conditional masking policy SQL expression.

        The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
        column to which the masking policy is set.

        The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query
        result when a query is made on the first column.

        If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
        [masking policy](../../user-guide/security-column-intro.md).

    `FORCE`
    :   Replaces a masking or projection policy that is currently set on a column with a different policy in a single statement.

        Note that using the `FORCE` keyword with a masking policy requires the [data type](../../sql-reference-data-types.md) of the policy
        in the ALTER TABLE statement (i.e. STRING) to match the data type of the masking policy currently set on the column (i.e. STRING).

        If a masking policy is not currently set on the column, specifying this keyword has no effect.

        For details, see: [Replace a masking policy on a column](../../user-guide/security-column-intro.md) or [Replace a projection policy](../../user-guide/projection-policies.md).

## Search optimization actions (`searchOptimizationAction`)

`ADD SEARCH OPTIMIZATION`
:   Adds [search optimization](../../user-guide/search-optimization-service.md) for the entire table or, if you specify the optional
    ON clause, for specific columns.

    > **Note:**
    >
    > Search optimization can be expensive to maintain, especially if the data in the table changes frequently.
    > For more information, see [Search optimization cost estimation and management](../../user-guide/search-optimization/cost-estimation.md).

`ON search_method_with_target [, search_method_with_target ... ]`
:   Specifies that you want to configure search optimization for specific columns (instead of the entire table).

    For `search_method_with_target`, use an expression with the following syntax:

    ```sqlsyntax
    <search_method>( <target> [ , <target> , ... ] [ , ANALYZER => '<analyzer_name>' ] )
    ```

    Where:

    * `search_method` specifies one of the following methods that optimizes queries for a particular type of predicate:

      | Search method | Description |
      | --- | --- |
      | `FULL_TEXT` | Predicates that use VARCHAR (text) types. |
      | `EQUALITY` | Equality and IN predicates. |
      | `SUBSTRING` | Predicates that match substrings and regular expressions (for example, [[ NOT ] LIKE](../functions/like.md), [[ NOT ] ILIKE](../functions/ilike.md), [[ NOT ] RLIKE](../functions/rlike.md), and [REGEXP_LIKE](../functions/regexp_like.md)). |
    * `target` specifies the column or an asterisk (\*).

      Depending on the value of `search_method`, you can specify a column of one of the following types:

      | Search method | Supported targets |
      | --- | --- |
      | `FULL_TEXT` | Columns of VARCHAR (text) data types. |
      | `EQUALITY` | Columns of numerical, string, and binary data types. |
      | `SUBSTRING` | Columns of VARCHAR (text) data types. |

      To specify all applicable columns in the table as targets, use an asterisk (`*`).

      Note that you can’t specify both an asterisk and specific column names for a given search method. However, you can
      specify an asterisk in different search methods.

      For example, you can specify the following expressions:

      ```sqlexample
      -- Allowed
      ON SUBSTRING(*)
      ON EQUALITY(*), SUBSTRING(*)
      ```

      You can’t specify the following expressions:

      ```sqlexample
      -- Not allowed
      ON EQUALITY(*, c1)
      ON EQUALITY(c1, *)
      ON EQUALITY(v1:path, *)
      ON EQUALITY(c1), EQUALITY(*)
      ```

    * `ANALYZER => 'analyzer_name'` specifies the name of the text analyzer, if `search_method`
      is `FULL_TEXT`.

      For more information about search optimization analyzers, see [ALTER TABLE](alter-table.md).

    To specify more than one search method on a target, use a comma to separate each subsequent method and target:

    ```sqlexample
    ALTER ICEBERG TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1), EQUALITY(c2, c3);
    ```

    If you run the ALTER ICEBERG TABLE … ADD SEARCH OPTIMIZATION ON … command multiple times on the same table, each subsequent command
    adds to the existing configuration for the table. For example, suppose that you run the following commands:

    ```sqlexample
    ALTER ICEBERG TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2);
    ALTER ICEBERG TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c3, c4);
    ```

    This adds equality predicates for the columns c1, c2, c3, and c4 to the configuration for the table. This is equivalent to
    running the command:

    ```sqlexample
    ALTER ICEBERG TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2, c3, c4);
    ```

    For examples, see [Enabling search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

`DROP SEARCH OPTIMIZATION`
:   Removes [search optimization](../../user-guide/search-optimization-service.md) for the entire table or, if you specify the
    optional ON clause, from specific columns.

    > **Note:**
    >
    > * If a table has the search optimization property, then dropping the table and undropping it preserves the
    >   search optimization property.
    > * Removing the search optimization property from a table and then adding it back incurs the same cost as adding it the first
    >   time.

`ON search_method_with_target | column_name | expression_id [ , ... ]`
:   Specifies that you want to drop the search optimization configuration for specific columns (instead of
    dropping search optimization for the entire table).

    To identify the column configuration to drop, specify one of the following:

    * For `search_method_with_target`, specify a method for optimizing queries for one or more specific columns. Use the
      syntax described earlier.
    * For `column_name`, specify the name of the column configured for search optimization. Specifying the column name drops
      all expressions for that column.
    * For `expression_id`, specify the ID for an expression listed in the output of the
      [DESCRIBE SEARCH OPTIMIZATION](../../user-guide/search-optimization/enabling.md) command.

    To specify more than one of these, use a comma between items.

    You can specify any combination of search methods with targets, column names, and expression IDs.

    For examples, see [Dropping search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Iceberg table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | External volume |  |
| USAGE | Catalog integration | Required if the table uses a catalog integration. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only the table owner (that is, the role with the OWNERSHIP privilege on the table) or higher can execute this command.
* Clustering is only supported for tables that use Snowflake as the Iceberg catalog. To add clustering to an Iceberg table,
  you must also have the USAGE or OWNERSHIP privileges on the schema and database that contain the table.
* For tables in a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md), the following features aren’t supported:

  + Setting the NOT NULL, COMMENT, and DATA TYPE properties for an existing column.
  + Setting column constraints.
  + Clustering.
* You can use data metric functions with Iceberg tables by executing an [ALTER TABLE](alter-table.md) command. For more information, see
  [Use SQL to set up data metric functions](../../user-guide/data-quality-working.md).
* For more information about using search optimization with Iceberg tables, including limitations, see
  [Support for Apache Iceberg™ tables](../../user-guide/search-optimization/queries-that-benefit.md) in the search optimization documentation.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* To troubleshooting issues with altering the CATALOG_SYNC parameter, see [You can’t alter an Iceberg table when specifying the CATALOG_SYNC parameter](../../user-guide/tables-iceberg-open-catalog-troubleshooting.md)
* You can’t use this command to modify the PATH_LAYOUT property for an existing table.

## Examples

The following example sets a tag (`my_tag`) with a value of `customer` on an Iceberg table.

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table SET TAG my_tag = 'customer';
```

The following example enables [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md) for an existing externally managed table:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table SET AUTO_REFRESH = TRUE;
```

The following examples add and drop search optimization for an Iceberg table:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ADD SEARCH OPTIMIZATION ON SUBSTRING(C6);

ALTER ICEBERG TABLE my_iceberg_table DROP SEARCH OPTIMIZATION ON EQUALITY(C7, C8);
```

---
title: ALTER ICEBERG TABLE … ALTER COLUMN … SET DATA TYPE (structured types)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-iceberg-table-alter-column-set-data-type.md
section: SQL Commands
---

# ALTER ICEBERG TABLE … ALTER COLUMN … SET DATA TYPE (structured types)

> **Note:**
>
> This variant of the syntax is not supported for Iceberg tables that use an external catalog.

Modifies (evolves) a [structured type](../data-types-structured.md)
column in a Snowflake-managed [Apache Iceberg™ table](../../user-guide/tables-iceberg.md).

With this command, you can modify structured types in an Iceberg table column. You can
either rename a key in a structured OBJECT or perform a combination of the following changes:

* Evolving the type of a field within a structured type.
* Reordering keys in a structured OBJECT.
* Adding keys to a structured OBJECT.
* Dropping keys from a structured OBJECT.

You can’t combine renaming a key with any other modifications.

For brevity, this topic refers to Iceberg tables as just “tables” except when making a distinction between
Iceberg tables and regular Snowflake tables.

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)

## Syntax

**Modify a structured type column**

```sqlsyntax
ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> ALTER COLUMN <structured_column>
  SET DATA TYPE <new_structured_type>
```

**Rename keys in a structured OBJECT**

```sqlsyntax
ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> ALTER COLUMN <structured_column>
  SET DATA TYPE <new_structured_type>
  RENAME FIELDS
```

## Parameters

`table_name`
:   Identifier for the table to modify.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ALTER COLUMN structured_column`
:   Specifies the structured type column to modify.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET DATA TYPE new_structured_type`
:   A full specification for the new structured type to use for the column. For example, to specify a structured ARRAY of NUMBER elements,
    use ARRAY(NUMBER).

    For more information, see [Specifying a structured type](../data-types-structured.md) and the examples on this page.

`RENAME FIELDS`
:   Specifies that the command should rename one or more keys in a structured OBJECT.
    The old and new keys must differ only in name, and must have exactly the same hierarchy and data types. Renaming keys doesn’t change
    the field IDs.

    Renaming keys can’t be combined with any other modifications to structured types in an Iceberg table.

    See the RENAME FIELDS example.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Iceberg table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | External volume |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This command doesn’t support the following actions:

  + Evolving a structured type into a non-structured type (or the other way around).
  + Setting a null constraint on a structured ARRAY element or on the key-value pairs of a structured MAP.
  + Using RENAME FIELDS to rename a key that is part of the clustering key for the table.
  + Altering the NULL constraint for a structured OBJECT.
  + Altering a table in a catalog-linked database.
* For tables that use data access policies,
  make sure the new data type for a column is compatible with the argument type of your data access policy. Otherwise, querying the table
  might fail. For example, if you add a key to a structured OBJECT column, you must alter your policy or
  create a new policy and apply it to your table.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Evolving types

You can evolve the type for a field in a structured type.
Evolving the type means widening it into a larger, related Iceberg data type.

Snowflake supports the following type evolutions, in accordance with the
[Apache Iceberg spec](https://iceberg.apache.org/spec/#schema-evolution):

* Changing a field of type `int` into type `long`.
* Changing a field of type `float` into type `double`.
* Changing a field of type `decimal(p,s)` into type `decimal(p',s)` where `p` is smaller than `p'`.

To evolve a field type, use the [Snowflake syntax for specifying a structured type](../data-types-structured.md).
You can use the Iceberg data type in your specification.
For example, the following statement changes the element type in a structured ARRAY column to (Iceberg) type `long`.

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ALTER COLUMN col1
  SET DATA TYPE ARRAY(long);
```

For information about how Iceberg data types map to Snowflake data types, see [Data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).

### Reordering keys

To rearrange the order of keys in a structured OBJECT, specify a new order in your
ALTER ICEBERG TABLE statement. Rearranging the key order does not affect the data in the OBJECT.

For example, consider the following CREATE ICEBERG TABLE statement.
The table has one column (`column_1`) of type OBJECT with two keys in a specified order:

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (
  column_1 OBJECT(
      key_a int,
      key_b int
    )
  )
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = '';
```

The following command changes the order of the keys so that `key_b` comes before `key_a`:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ALTER COLUMN column_1
  SET DATA TYPE OBJECT(
    key_b int,
    key_a int
  );
```

### Adding keys

You can add keys to a structured OBJECT.
A new key can use any of the [data types supported for Iceberg tables](../../user-guide/tables-iceberg-data-types.md) for its value.

> **Note:**
>
> You can’t set a null constraint when you add a key, because Snowflake sets the value of the key to NULL for
> all existing rows in the table.

For example, consider the following CREATE ICEBERG TABLE statement.
The table has one column (`column_1`) of type OBJECT with one key (`key_1`):

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (
  column_1 OBJECT(
      key_1 int
    )
  )
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = '';
```

The following command adds a key named `key_2` to `column_1`:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ALTER COLUMN column_1
  SET DATA TYPE OBJECT(
    key_1 int,
    key_2 int
  );
```

### Dropping keys

> **Note:**
>
> Dropping a key whose value is a structured data type that belongs to a clustering key isn’t supported.

To drop a key from a structured OBJECT, use the ALTER ICEBERG TABLE … ALTER COLUMN command
to redefine the OBJECT.

Dropping the key removes the key and its value from all rows in the table.

For example, consider the following CREATE ICEBERG TABLE statement.
The table has one column (`column_1`) of type OBJECT with two keys:

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (
  column_1 OBJECT(
      key_1 int,
      key_2 ARRAY(string)
    )
  )
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = '';
```

The following command drops the key named `key_2` by omitting it from the OBJECT specification:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ALTER COLUMN column_1
  SET DATA TYPE OBJECT(
    key_1 int
  );
```

### Renaming keys

To change the key names in a structured OBJECT, use the RENAME FIELDS keywords.

For example, consider the following CREATE ICEBERG TABLE statement:

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (
  column_1 OBJECT(
      key_1 int,
      key_2 int
    )
  )
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = '';
```

The following command uses RENAME FIELDS to rename the keys in `column_1`:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table ALTER COLUMN column_1
  SET DATA TYPE OBJECT(
    k_1 int,
    k_2 int
  )
  RENAME FIELDS;
```

---
title: ALTER ICEBERG TABLE … CONVERT TO MANAGED
source: https://docs.snowflake.com/en/sql-reference/sql/alter-iceberg-table-convert-to-managed.md
section: SQL Commands
---

# ALTER ICEBERG TABLE … CONVERT TO MANAGED

Converts an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) that uses
an external Iceberg catalog into a table that uses Snowflake as the catalog (a Snowflake-managed Iceberg table).

The converted table supports both read and write operations,
and Snowflake handles all life-cycle maintenance, such as compaction, for the table.
For more information, see [Before and after table conversion](../../user-guide/tables-iceberg-conversion.md).

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)

## Syntax

```sqlsyntax
ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> CONVERT TO MANAGED
  [ BASE_LOCATION = '<directory_for_table_files>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
```

## Parameters

`table_name`
:   Identifier for the table to convert.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`[ BASE_LOCATION = 'directory_for_table_files' ]`
:   The path to a directory where Snowflake can write data and metadata files for the table.
    Specify a relative path from the table’s `EXTERNAL_VOLUME` location.
    For more information, see [Data and metadata directories](../../user-guide/tables-iceberg-storage.md).

    You must specify a value for this property if the original CREATE ICEBERG TABLE statement did not allow or include a
    `BASE_LOCATION`.

    This directory can’t be changed after you convert a table.

`STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }`
:   Specifies the storage serialization policy for the table.
    If not specified during conversion, the table inherits the value set at the schema, database, or account level. If the value isn’t
    specified at any level, the table uses the default value.

    You can’t change the value of this parameter after you convert a table.

    * `COMPATIBLE`: Snowflake performs encoding and compression that ensures interoperability with third-party compute engines.
    * `OPTIMIZED`: Snowflake performs encoding and compression that ensures the best table performance within Snowflake.

    Default: `OPTIMIZED`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Iceberg table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | External volume |  |
| USAGE | Catalog integration |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only the table owner (that is, the role with the OWNERSHIP privilege on the table) or higher can execute this command.
* Converting a table in a catalog-linked database isn’t supported.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example uses the ALTER ICEBERG TABLE … CONVERT TO MANAGED statement to
convert a table that Snowflake doesn’t manage into a table that uses Snowflake as the Iceberg catalog.

```sqlexample
ALTER ICEBERG TABLE myTable CONVERT TO MANAGED
  BASE_LOCATION = 'my/relative/path/from/external_volume';
```

---
title: ALTER ICEBERG TABLE … REFRESH
source: https://docs.snowflake.com/en/sql-reference/sql/alter-iceberg-table-refresh.md
section: SQL Commands
---

# ALTER ICEBERG TABLE … REFRESH

Refreshes the metadata for an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) that uses an external Iceberg catalog.
Refreshing an Iceberg table synchronizes the table metadata with the most recent table changes.

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)

## Syntax

```sqlsyntax
ALTER ICEBERG TABLE [ IF EXISTS ] <table_name> REFRESH [ '<metadata_file_relative_path>' ]
```

## Parameters

`table_name`
:   Identifier for the table to refresh.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`'metadata_file_relative_path'`
:   Specifies a metadata file path for a table created from *Iceberg* files in object storage. The path must be relative to the
    [active storage location](../../user-guide/tables-iceberg-storage.md) of the external volume associated with the table.

    The following table shows what value to specify based on an example storage location:

    |  |  |
    | --- | --- |
    | **Active storage location for the table’s external volume** | `s3://mybucket_us_east_1` |
    | **Full path to the metadata file** | `s3://mybucket_us_east_1/metadata/v1.metadata.json` |
    | **Value to specify as the** `'metadata_file_relative_path'` | `metadata/v1.metadata.json` (without a leading forward slash) |

    > **Note:**
    >
    > * If the table uses AWS Glue as the catalog, or is created from Delta table files, don’t specify a metadata file path.
    > * Omit the leading forward slash (`/`) in the metadata file path.
    > * Before Snowflake version 7.34,
    >   a parameter named `BASE_LOCATION` (also called `FILE_PATH` in previous versions) was required to create a table
    >   from Iceberg files in object storage. The parameter specified a relative path from the `EXTERNAL_VOLUME`
    >   location.
    >
    >   To refresh a table that you created using the old syntax, specify a path relative to the `BASE_LOCATION`. For example,
    >   if the full path to your metadata file is `s3://mybucket_us_east_1/my_base_location/metadata/v1.metadata.json`,
    >   specify `metadata/v1.metadata.json` as the `metadata-file-relative-path`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Iceberg table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | External volume | Not required if using [vended credentials](../../user-guide/tables-iceberg-configure-catalog-integration-vended-credentials.md). |
| USAGE | Catalog integration |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only the table owner (that is, the role with the OWNERSHIP privilege on the table) or higher can execute this command.
* Using the ALTER ICEBERG TABLE … REFRESH command in transactions (implicit or explicit) is not supported.
* Snowflake processes a maximum of 1000 Delta commit files each time you refresh a table using CREATE/ALTER … REFRESH.
  If your table has over 1000 commit files, you can do additional manual refreshes.
  Each time, the refresh process continues from where the last one stopped.

  > **Note:**
  >
  > Snowflake uses Delta checkpoint files when creating an Iceberg table.
  > The 1,000 commit file limit only applies to commits after the latest checkpoint.
  >
  > When you refresh an existing table, Snowflake processes Delta commit files, but not checkpoint files. If table maintenance removes stale log and data files for the source
  > Delta table, you should refresh Delta-based
  > Iceberg tables in Snowflake more frequently than the retention period of Delta logs and data files.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Refresh a table

This example manually refreshes the metadata for a table for the following scenarios:

* The table uses AWS Glue for the Iceberg catalog.
* The table is based on Delta table files in object storage.

For these scenarios, you don’t specify a metadata file path in the refresh command.

```sqlexample
ALTER ICEBERG TABLE myIcebergTable REFRESH;
```

### Refresh a table created from Iceberg files in object storage

This example manually refreshes the table metadata based on changes in a new metadata file. In this example, the full path
to the metadata file is `<external-volume-storage-base-url>/path/to/metadata/v2.metadata.json`.

When specifying a metadata file, you don’t include a leading forward slash (`/`) in the metadata file path.

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table REFRESH 'path/to/metadata/v2.metadata.json';
```

> **Note:**
>
> Before Snowflake version 7.34,
> a parameter named `BASE_LOCATION` (also called `FILE_PATH` in previous versions) was required to create a table
> from Iceberg files in object storage. The parameter specified a relative path from the `EXTERNAL_VOLUME`
> location.
>
> To refresh a table that you created using the old syntax, specify a path relative to the `BASE_LOCATION`. For example,
> if the full path to your metadata file is `s3://mybucket_us_east_1/my_base_location/metadata/v1.metadata.json`,
> specify `metadata/v1.metadata.json` as the `metadata-file-relative-path`.

---
title: ALTER INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-integration.md
section: SQL Commands
---

# ALTER INTEGRATION

Modifies the properties for an existing integration.

See also:
:   [CREATE INTEGRATION](create-integration.md), [DROP INTEGRATION](drop-integration.md), [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER <integration_type> INTEGRATION <object_name> <actions>
```

Where `actions` are specific to the object type.

For specific syntax, usage notes, and examples, see:

* [ALTER API INTEGRATION](alter-api-integration.md)
* [ALTER CATALOG INTEGRATION](alter-catalog-integration.md)
* [ALTER EXTERNAL ACCESS INTEGRATION](alter-external-access-integration.md)
* [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md)
* [ALTER SECURITY INTEGRATION](alter-security-integration.md)
* [ALTER STORAGE INTEGRATION](alter-storage-integration.md)

---
title: ALTER JOIN POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-join-policy.md
section: SQL Commands
---

# ALTER JOIN POLICY

Replaces the existing rules or comment for a [join policy](../../user-guide/join-policies.md). Also allows you to rename a join policy.

See also:
:   [Join policy DDL reference](../../user-guide/join-policies.md)

## Syntax

```sqlsyntax
ALTER JOIN POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER JOIN POLICY [ IF EXISTS ] <name> SET BODY -> <expression>

ALTER JOIN POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER JOIN POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER JOIN POLICY [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER JOIN POLICY [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the join policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the join policy; must be unique for your schema. The new identifier cannot be used if the
    identifier is already in place for a different join policy.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one (or more) properties to set for the join policy:

    `BODY -> expression`
    :   SQL expression that determines the restrictions of a join policy.

        To define the body of the join policy, call the JOIN_CONSTRAINT function, which returns TRUE or FALSE.
        When the function returns TRUE, queries are required to use a join to return results.

        The syntax of the JOIN_CONSTRAINT function is:

        ```sqlsyntax
        JOIN_CONSTRAINT (
          { JOIN_REQUIRED => <boolean_expression> }
          )
        ```

        Where:

        `JOIN_REQUIRED => boolean_expression`
        :   Specifies whether a join is required in queries when data is selected from tables or views that have
            the join policy assigned to them.

        The body of a policy cannot reference user-defined functions, tables, or views.

        Allowed join columns are specified in the CREATE or ALTER statement for the table or view to which the
        policy is applied, not in the CREATE JOIN POLICY statement.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the join policy.

        Default: No value

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset, by resetting them to their defaults, for the join policy:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

    When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Join policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about join policy DDL and privileges, see [Privileges and commands](../../user-guide/aggregation-policies.md).

## Usage notes

* If you want to update an existing join policy and need to see the current body of the policy, run the
  [DESCRIBE JOIN POLICY](desc-join-policy.md) command. You can also use the [GET_DDL](../functions/get_ddl.md) function to obtain the full definition of the join policy, including its body.
* Moving a join policy to a [managed access schema](../../user-guide/security-access-control-configure.md)
  (using the ALTER JOIN POLICY … RENAME TO syntax) is prohibited unless the join policy owner
  (that is, the role that has the OWNERSHIP privilege on the join policy) also owns the target schema.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Modify the SQL expression for a join policy:

```sqlexample
ALTER JOIN POLICY jp3 SET BODY -> JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE);
```

Rename a join policy:

```sqlexample
ALTER JOIN POLICY my_join_policy RENAME TO my_join_policy_2;
```

---
title: ALTER LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/alter-listing.md
section: SQL Commands
---

# ALTER LISTING

Modifies the properties of a [listings](../../collaboration/collaboration-listings-about.md) with an inline YAML manifest, or from a file located in a stage location.

> **Note:**
>
> We recommend running [DESCRIBE LISTING](desc-listing.md) to view the current properties of a listing before running `ALTER LISTING`.

See also:
:   [CREATE LISTING](create-listing.md), [DESCRIBE LISTING](desc-listing.md), [SHOW LISTINGS](show-listings.md), [SHOW VERSIONS IN LISTING](show-versions-in-listing.md), [DROP LISTING](drop-listing.md)

## Syntax

```sqlsyntax
ALTER LISTING [ IF EXISTS ] <name> [ { PUBLISH | UNPUBLISH | REVIEW } ]

ALTER LISTING [ IF EXISTS ] <name> AS '<yaml_manifest_string>'
  [ PUBLISH = { TRUE | FALSE } ]
  [ REVIEW = { TRUE | FALSE } ]
  [ COMMENT = '<string>' ]

ALTER LISTING <name> ADD VERSION [ [ IF NOT EXISTS ] <version_name> ]
  FROM <yaml_manifest_stage_location>
  [ COMMENT = '<string>' ]

ALTER LISTING [ IF EXISTS ] <name> { ADD | REMOVE } TARGETS <manifest>

ALTER LISTING [ IF EXISTS ] <name> RENAME TO <new_name>;

ALTER LISTING [ IF EXISTS ] <name> SET COMMENT = '<string>'
```

## Parameters

`name`
:   Specifies the identifier (name) for the listing being altered.

`{ PUBLISH | UNPUBLISH | REVIEW }`
:   The action to perform on the listing:

    * `PUBLISH` Makes a previously undiscoverable listing discoverable.

      Specifying PUBLISH on a previously published listing has no effect.
    * `UNPUBLISH` Makes a previously discoverable listing undiscoverable for new consumers.
      Existing consumers can continue to access the data associated with an unpublished listing.

      Specifying UNPUBLISH on a previously unpublished listing has no effect.

    See also [Unpublish a listing](../../collaboration/provider-listings-modifying.md).

    * `REVIEW` Submits the listing for review.

`yaml_manifest_string`
:   The YAML manifest for the listing. For manifest parameters, see [Listing manifest reference](../../progaccess/listing-manifest-reference.md).

    Manifests are normally provided as dollar quoted strings.
    For more information, see [Dollar-quoted string constants](../data-types-text.md).

`ADD VERSION version_name`
:   Specifies the unique version identifier for the version being added.
    If the identifier contains spaces, special characters, or mixed-case characters, the entire identifier must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case sensitive. For information about identifier syntax,
    see [Identifier Requirements](../identifiers-syntax.md).

`FROM 'yaml_manifest_stage_location'`
:   Specifies the path for the internal or Snowflake [Git repository clone](../../developer-guide/git/git-overview.md) manifest.yml file. If the changes require Marketplace Ops review, use the REVIEW and PUBLISH operations.

`{ ADD | REMOVE } TARGETS manifest`
:   Add targets to or remove targets from a listing using the manifest containing *only* the targets you want to add or remove. This partial manifest reuses the familiar structures `targets`, `external_targets`, and `organization_targets`, which are already defined in the listing manifest specification.

    > The table below lists unsupported listing-manifest / incoming-manifest combinations:

    > **Note:**
    >
    > V2 listings are still in preview. Upon feature enablement, all subsequent listings, whether public or private, will be created as v2 listings.

    | External listing targets version | Incoming manifest | Result | Workaround |
    | --- | --- | --- | --- |
    | V1 targets | V2 external targets | Returns an error. | Provide a version 1 incoming manifest. |
    | V2 targets | V1 targets | Returns an error. | Provide a version 2 incoming manifest. |
    | Any external listing | Organization-level target that specifies an organization without accounts. | Returns an error. | Organization-level targets aren’t supported at this time. |

    For organizational listings, the table below lists unsupported use cases for adding and removing targets:

    | External listing | Incoming manifest | Add or remove | Result | Action |
    | --- | --- | --- | --- | --- |
    | Any organization listing | Manifest has the `organization_user_group` field set. | Both | Returns an error. | Remove the `organization_user_group` field and try again. |
    | Account or account and role | Manifest has the `all_internal_accounts` field set to `TRUE`. | Remove | Returns an error. | Remove specific accounts and try again. |
    | The listing has the `all_internal_accounts` field set to `TRUE`. | The incoming manifest includes an account or an account and role. | Remove | Returns an errors. | Replace `all_internal_accounts` with specific accounts and try again. |
    | Account has no roles specified | Incoming manifest has an account with roles. | Remove | Returns an error. | Remove the account first and then add specific roles. |

`RENAME TO new_name`
:   Changes the name of the listing to `new_name`. Listing names must be unique. The new identifier cannot be used if the identifier is already in use for a different listing.

`SET ...`

> Specifies one (or more) properties to set for the listing (separated by blank spaces, commas, or new lines).
>
> `COMMENT = 'string_literal'`
> :   Adds a comment or overwrites the existing comment for an existing listing.

`PUBLISH = { TRUE | FALSE }`
:   Specifies how the listing should be published.

    If TRUE, listing is published immediately on listing to Marketplace Ops for review.

    Default: TRUE.

`REVIEW =  { TRUE | FALSE }`
:   Specifies whether the listing should or should not submitted to Marketplace Ops review.

    Default: TRUE.

Different combinations of values for the PUBLISH and REVIEW properties result in the following behaviors:

| PUBLISH | REVIEW | Behavior |
| --- | --- | --- |
| TRUE | TRUE | Request review then immediately publish after approval. |
| TRUE | FALSE | Results in an error. You cannot publish a listing on the Snowflake Marketplace without review. |
| FALSE | TRUE | Request a review without publishing automatically after review. |
| FALSE | FALSE | Save your listing as a draft without requesting review or publishing. |

## Usage notes

* Listings can be renamed only in DRAFT state.
* When setting the live version of the YAML format manifest for a listing, you must use `COMMIT` to apply the changes, or `ABORT` to discard the changes.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MODIFY | On the listing being modified. |  |

If you’re using the ALTER command to modify the manifest content for auto-fulfillment,
you must use a role with the delegated privileges necessary to configure cross-cloud auto-fulfillment.
See [Delegate privileges to set up auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Alters the listing `mylisting` to use an updated manifest file:

> ```sqlexample
> ALTER LISTING mylisting
> AS
> $$
> title: "MyListing"
> subtitle: "Subtitle for MyListing"
> description: "Description or MyListing"
> listing_terms:
>   type: "STANDARD"
> targets:
>   accounts: ["Org1.Account1"]
> usage_examples:
>   - title: "this is a test sql"
>     description: "Simple example"
>     query: "select *"
> $$;
> ```

Submits the `mylisting` listing for review:

> ```sqlexample
> ALTER LISTING mylisting REVIEW;
> ```

Alters the `mylisting` listing by publishing it:

> ```sqlexample
> ALTER LISTING mylisting PUBLISH;
> ```

Alters the `mylisting` listing by unpublishing it:

> ```sqlexample
> ALTER LISTING mylisting UNPUBLISH;
> ```

Alters the `mylisting` listing by setting a new comment:

> ```sqlexample
> ALTER LISTING mylisting SET COMMENT = 'My listing is ready!';
> ```

Adds a new version from the specified YAML manifest file stage location:

> ```sqlexample
> ALTER LISTING mylisting ADD VERSION V3 FROM @dbforstage.public.listingstage/listingmanifests;
> ```

Alters a listing so that targets will take the incoming manifest and merge it with the existing listing targets:

> ```sqlexample
> ALTER LISTING mylisting ADD TARGETS $$manifest$$;
> ```

Adds targets to an external V1 listing:

> ```sqlexample-yaml
> ALTER LISTING mylisting ADD TARGETS
> $$
> targets:
>   accounts: ["Org1.Account1", "Org2.Account2"]
> $$;
> ```

Adds targets to an external V2 listing:

> ```sqlexample-yaml
> ALTER LISTING mylisting ADD TARGETS
> $$
> external_targets:
>   access:
>     - organization: OrgName2
>       accounts: [acc1, acc2]
> $$;
> ```

When adding targets, this takes the incoming manifest and merges it with the existing `organization_targets`.

> ```sqlexample-yaml
> ALTER LISTING mylisting ADD TARGETS
> $$
> organization_targets:
>   access:
>     - account: account2
>       roles: [role1, role2]
> $$;
> ```

Removes a target:

> ```sqlexample
> ALTER LISTING mylisting REMOVE TARGETS $$manifest$$;
> ```

Removes targets from an external V1 listing:

> ```sqlexample-yaml
> ALTER LISTING mylisting REMOVE TARGETS
> $$
> targets:
>   accounts: ["Org1.Account1", "Org2.Account2"]
> $$;
> ```

Removes targets from an external V2 listing:

> ```sqlexample-yaml
> ALTER LISTING mylisting REMOVE TARGETS
> $$
> external_targets:
>   access:
>     - organization: OrgName2
>       accounts: [acc1, acc2]
> $$;
> ```

Removes targets from an organizational listing:

> ```sqlexample-yaml
> ALTER LISTING mylisting REMOVE TARGETS
> $$
> organization_targets:
>   access:
>     - account: account1
> $$;
> ```

---
title: ALTER MAINTENANCE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-maintenance-policy.md
section: SQL Commands
---

# ALTER MAINTENANCE POLICY

Modifies an existing [maintenance policy](../../developer-guide/native-apps/consumer-maintenance-policies.md).

See also:
:   [CREATE MAINTENANCE POLICY](create-maintenance-policy.md), [DROP MAINTENANCE POLICY](drop-maintenance-policy.md), [SHOW MAINTENANCE POLICIES](show-maintenance-policies.md)

## Syntax

```sqlsyntax
 ALTER MAINTENANCE POLICY [ IF EXISTS ] <name> SET
   [ SCHEDULE = '<schedule>' ]
   [ COMMENT = '<comment>' ]

ALTER MAINTENANCE POLICY [ IF EXISTS ] <name> UNSET
   [ COMMENT ]

ALTER MAINTENANCE POLICY [ IF EXISTS ] <name> RENAME TO <new_name>
```

## Parameters

`name`
:   Specifies the identifier of the maintenance policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET`
:   Sets one or more specified properties for the maintenance policy.

    `SCHEDULE = 'schedule'`
    :   Specifies the schedule for the maintenance policy. This parameter uses the
        same syntax as the `SCHEDULE` parameter of the [CREATE TASK](create-task.md) command.

    `COMMENT = 'comment'`
    :   Specifies a comment for the maintenance policy.

`UNSET`
:   Unsets one or more properties for the maintenance policy.

    `COMMENT`
    :   Unsets the maintenance policy.

`RENAME TO new_name`
:   Renames the maintenance policy to a new identifier.
    The new identifier must be unique for the schema.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY or OWNERSHIP | Maintenance policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example changes the maintenance policy schedule to Sundays at 3 AM UTC:

```sqlexample
ALTER MAINTENANCE POLICY my_maintenance_policy SET
  SCHEDULE = 'USING CRON 0 3 * * SUN UTC';
```

---
title: ALTER MASKING POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-masking-policy.md
section: SQL Commands
---

# ALTER MASKING POLICY

Replaces the existing masking policy rules with new rules or a new comment and allows the renaming of a masking policy.

Any changes made to the policy rules go into effect when the next SQL query that uses the masking policy runs.

See also:
:   [Masking policy DDL](../../user-guide/security-column-intro.md)

## Syntax

```sqlsyntax
ALTER MASKING POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER MASKING POLICY [ IF EXISTS ] <name> SET BODY -> <expression_on_arg_name_to_mask>

ALTER MASKING POLICY [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER MASKING POLICY [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER MASKING POLICY [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER MASKING POLICY [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Identifier for the masking policy; must be unique in the parent schema of the policy.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the masking policy; must be unique for your schema. The new identifier cannot be used if the identifier
    is already in place for a different masking policy.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one (or more) properties to set for the masking policy:

    `BODY -> expression_on_arg_name_to_mask`
    :   SQL expression that transforms the data in the column designated by `arg_name_mask`.

        The expression can include [Conditional expression functions](../expressions-conditional.md) to represent conditional logic, built-in functions, or UDFs to
        transform the data.

        If a UDF or external function is used inside the masking policy body, the policy owner must have the USAGE privilege on the UDF or
        external function. Users querying a column that has a masking policy applied to it do not need to have USAGE on the UDF or external
        function.

        If a UDF or external function is used inside the conditional masking policy body, the policy owner must have the OWNERSHIP privilege on
        the UDF or external function. Users querying a column that has a conditional masking policy applied to it do not need to have USAGE on
        the UDF or external function.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the masking policy.

        Default: No value

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset for the masking policy, which resets them to the defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

    When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Masking policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Managing Column-level Security](../../user-guide/security-column-intro.md).

## Usage notes

* If you want to update an existing masking policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE MASKING POLICY](desc-masking-policy.md) command.
* You cannot change the policy signature (i.e. argument name or input/output data type). If you need to change the signature, execute a
  [DROP MASKING POLICY](drop-masking-policy.md) statement on the policy and create a new one.
* Before executing an ALTER statement, you can execute a [DESCRIBE MASKING POLICY](desc-masking-policy.md) statement to determine the argument name to use for
  updating the policy.
* For masking policies that include a subquery in the masking policy body, use [EXISTS](../operators-subquery.md) in the
  WHEN clause. For a representative example, see the custom entitlement table example in the Examples section in
  [CREATE MASKING POLICY](create-masking-policy.md).
* If the policy `body` contains a mapping table lookup, create a centralized mapping table and store the mapping table
  in the same database as the protected table. This is particularly important if the `body` calls the
  [IS_DATABASE_ROLE_IN_SESSION](../functions/is_database_role_in_session.md) function. For details, see the function usage notes.
* Adding a masking policy to a column fails if the column is referenced by a row access policy. For more information, see
  [ALTER ROW ACCESS POLICY](alter-row-access-policy.md).
* If using a [UDF](../../developer-guide/udf/udf-overview.md) in a masking policy, ensure the data type of the column, UDF, and masking
  policy match. For more information, see [User-defined functions in a masking policy](../../user-guide/security-column-intro.md).
* Once you create a dynamic table, you can’t make changes to the masking policy.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example updates the masking policy to use a SHA-512 hash. Users without the ANALYST role see the value as a SHA-512 hash,
while users with the ANALYST role see the plain-text value.

```sqlexample
DESCRIBE MASKING POLICY email_mask;
```

```output
+-----+------------+---------------+-------------------+-----------------------------------------------------------------------+
| Row | name       | signature     | return_type       | body                                                                  |
+-----+------------+---------------+-------------------+-----------------------------------------------------------------------+
| 1   | EMAIL_MASK | (VAL VARCHAR) | VARCHAR(16777216) | case when current_role() in ('ANALYST') then val else '*********' end |
+-----+------------+---------------+-------------------+-----------------------------------------------------------------------+
```

```sqlexample
ALTER MASKING POLICY email_mask SET BODY ->
  CASE
    WHEN current_role() IN ('ANALYST') THEN VAL
    ELSE sha2(val, 512)
  END;
```

---
title: ALTER MATERIALIZED VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/alter-materialized-view.md
section: SQL Commands
---

# ALTER MATERIALIZED VIEW

Alters a materialized view in the current/specified schema. Supported actions include:

* Renaming the materialized view.
* Suspending and resuming use and maintenance of the materialized view.
* Clustering the materialized view.
* Suspending and resuming reclustering of the materialized view.
* Dropping clustering of the materialized view.

For more details, see [Working with Materialized Views](../../user-guide/views-materialized.md).

See also:
:   [CREATE MATERIALIZED VIEW](create-materialized-view.md) , [DROP MATERIALIZED VIEW](drop-materialized-view.md) , [SHOW MATERIALIZED VIEWS](show-materialized-views.md) , [DESCRIBE MATERIALIZED VIEW](desc-materialized-view.md)

## Syntax

```sqlsyntax
ALTER MATERIALIZED VIEW <name>
  {
  RENAME TO <new_name>                     |
  CLUSTER BY ( <expr1> [, <expr2> ... ] )  |
  DROP CLUSTERING KEY                      |
  SUSPEND RECLUSTER                        |
  RESUME RECLUSTER                         |
  SUSPEND                                  |
  RESUME                                   |
  SET {
    [ SECURE ]
    [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
    [ COMMENT = '<comment>' ]
    }                                      |
  UNSET {
    SECURE
    CONTACT <purpose>                                 |
    COMMENT
    }
  }

ALTER MATERIALIZED VIEW
  SET DATA_METRIC_SCHEDULE = {
      '<num> MINUTE'
    | 'USING CRON <expr> <time_zone>'
  }

ALTER MATERIALIZED VIEW UNSET DATA_METRIC_SCHEDULE
```

## Parameters

`name`
:   Specifies the identifier of the materialized view to alter.

`RENAME TO new_name`
:   This option allows you to rename a materialized view.

    The new identifier must be unique for the schema in which the view is created.
    The new identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier string
    is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.
    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    Note that renaming a materialized view does not update references to that view. For example, if
    you create a view named `V1` on top of a materialized view, and then you rename
    the materialized view, the definition of view `V1` becomes out of date.

`CLUSTER BY expr#`
:   This command clusters the materialized view. Clustering
    re-orders the rows in the materialized view to increase performance for queries
    that filter based on the clustering key expressions.

    The `expr#` specifies an expression on which to cluster the materialized view.
    Typically, each expression is the name of a column in the materialized view.

    For more information about clustering materialized views, see:
    [Materialized Views and Clustering](../../user-guide/views-materialized.md).
    For more information about clustering in general, see:
    [What is Data Clustering?](../../user-guide/tables-clustering-micropartitions.md).

`DROP CLUSTERING KEY`
:   This command drops the clustering of the materialized view.

`SUSPEND RECLUSTER`
:   The `SUSPEND RECLUSTER` option suspends re-clustering of the materialized
    view. For more information about clustering materialized views,
    see [Materialized Views and Clustering](../../user-guide/views-materialized.md).

`RESUME RECLUSTER`
:   The `RESUME RECLUSTER` option resumes reclustering of the materialized
    view.

`SUSPEND`
:   The `SUSPEND` option suspends the maintenance (updates) and use of the
    materialized view. While the view is suspended, updates to the base table are
    not propagated to the materialized view. The materialized view itself is
    also inaccessible; if you attempt to use it, you get an error message
    similar to:

    ```output
    Failure during expansion of view 'MV1':
      SQL compilation error: Materialized View MV1 is invalid.
      Invalidation reason: Marked Materialized View as invalid manually.
    ```

    If you suspend a clustered materialized view, suspending the view implicitly
    suspends reclustering of that view.

`RESUME`
:   The `RESUME` option allows you to resume using the materialized view.
    It also resumes maintenance of the materialized view.
    If the view is clustered, it also implicitly resumes reclustering of that view.

`SET ...`
:   Specifies the property to set for the materialized view:

    `SECURE`
    :   This option turns the view into a secure view. For more information about secure views, see
        [Working with Secure Views](../../user-guide/views-secure.md).

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `COMMENT = 'string_literal'`
    :   This option sets a comment for the materialized view. The comment has no effect on the behavior of the view,
        but can provide useful information to people who use or maintain the view.

    `DATA_METRIC_SCHEDULE ...`
    :   Specifies the schedule to run the data metric function periodically.

        `'num MINUTE'`
        :   Specifies an interval (in minutes) of wait time inserted between runs of the data metric function. Accepts positive integers only.

            Also supports `num M` syntax.

            For data metric functions, use one of the following values: `5`, `15`, `30`, `60`, `720`, or `1440`.

        `'USING CRON expr time_zone'`
        :   Specifies a cron expression and time zone for periodically running the data metric function. Supports a subset of standard cron
            utility syntax.

            For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones).

            The cron expression consists of the following fields, and the periodic interval must be at least 5 minutes:

            ```bash
            # __________ minute (0-59)
            # | ________ hour (0-23)
            # | | ______ day of month (1-31, or L)
            # | | | ____ month (1-12, JAN-DEC)
            # | | | | _ day of week (0-6, SUN-SAT, or L)
            # | | | | |
            # | | | | |
              * * * * *
            ```

            The following special characters are supported:

            `*`
            :   Wildcard. Specifies any occurrence of the field.

            `L`
            :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of
                a given month. In the day-of-month field, it specifies the last day of the month.

            `/{n}`
            :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
                specified in the month field, then the data metric function is scheduled for April, July and October (i.e. every 3 months, starting
                with the 4th month of the year). The same schedule is maintained in subsequent years. That is, the data metric function is
                not scheduled to run in January (3 months after the October run).

            > **Note:**
            >
            > * The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
            >   for the account (or setting the value at the user or session level) does not change the time zone for the data metric
            >   function.
            > * The cron expression defines all valid run times for the data metric function. Snowflake attempts to run a data metric
            >   function based on this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid
            >   run time starts.
            > * When both a specific day of month and day of week are included in the cron expression, then the data metric function is scheduled
            >   on days satisfying either the day of month or day of week. For example,
            >   `DATA_METRIC_SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'` schedules a data metric function at 0AM on any 10th to 20th
            >   day of the month and also on any Tuesday or Thursday outside of those dates.
            > * The shortest granularity of time in cron is minutes.

`UNSET ...`
:   Specifies the property to unset for the materialized view:

    * `SECURE`
    * `TAG tag_name [ , tag_name ... ]`
    * `CONTACT purpose`
    * `COMMENT`
    * `DATA_METRIC_SCHEDULE`

    You cannot unset the CONTACT property with other properties in the same statement.

## Usage notes

* Use the [ALTER VIEW](alter-view.md) command to set/unset a masking policy, row access policy, or tag on/from a materialized view.
* You can use data metric functions (DMFs) with materialized views as follows:

  + To set the [DATA_METRIC_SCHEDULE](../parameters.md) parameter on the materialized view, use the ALTER MATERIALIZED VIEW command. For more
    information, see [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md).
  + To add a DMF to a column or drop a DMF from a column in a materialized view, use the [ALTER VIEW](alter-view.md) command.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename a materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW table1_MV RENAME TO my_mv;
> ```

Cluster a materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW my_mv CLUSTER BY(i);
> ```

Suspend clustering of a materialized view, but not use of the view:

> ```sqlexample
> ALTER MATERIALIZED VIEW my_mv SUSPEND RECLUSTER;
> ```

Resume clustering of a materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW my_mv RESUME RECLUSTER;
> ```

Suspend all use and automatic maintenance of the specified materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW my_mv SUSPEND;
> ```

Resume all use and automatic maintenance of the specified materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW my_mv RESUME;
> ```

Stop clustering a materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW my_mv DROP CLUSTERING KEY;
> ```

Modify the view to be a secure view:

> ```sqlexample
> ALTER MATERIALIZED VIEW mv1 SET SECURE;
> ```

Add or replace the comment for a materialized view:

> ```sqlexample
> ALTER MATERIALIZED VIEW mv1 SET COMMENT = 'Sample view';
> ```

---
title: ALTER MODEL
source: https://docs.snowflake.com/en/sql-reference/sql/alter-model.md
section: SQL Commands
---

# ALTER MODEL

Modifies the properties for an existing model, including its name, tags, default version, or comment.

Three other variants of this command exist, namely:

* [ALTER MODEL … ADD VERSION](alter-model-add-version.md) adds a new version of a model.
* [ALTER MODEL … DROP VERSION](alter-model-drop-version.md) removes a version of a model.
* [ALTER MODEL … MODIFY VERSION](alter-model-modify-version.md) sets a model version’s comment or metadata.

## Syntax

```sqlsyntax
ALTER MODEL [ IF EXISTS ] <name> SET
  [ COMMENT = '<string_literal>' ]
  [ DEFAULT_VERSION = '<version_name>']

ALTER MODEL [ IF EXISTS ] <model_name> SET TAG <tag_name> = '<tag_value>'

ALTER MODEL [ IF EXISTS ] <model_name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER MODEL [ IF EXISTS ] <model_name> VERSION <version_name> SET ALIAS = '<alias_name>'

ALTER MODEL [ IF EXISTS ] <model_name> VERSION <version_or_alias_name> UNSET ALIAS

ALTER MODEL <model_name> RENAME TO <new_name>
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) of the model.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more model properties to be set.

    `COMMENT = 'string_literal'`
    :   Sets the comment of the model. This can also be done using the [COMMENT](comment.md) command.

    `DEFAULT_VERSION = 'version_name'`
    :   Sets the default version of the model (the version that methods are invoked on when calling a method directly on the
        model). The version name is an [identifier](../identifiers-syntax.md).

        The system alias DEFAULT refers to the default version.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `ALIAS = 'alias_name'`
    :   Sets `alias_name` as an alias of the version. An alias is an alternative name that can be easily reassigned.
        The alias can be used most places where the version name can be used. A version can have at most one alias.

        The alias name is an [identifier](../identifiers-syntax.md). It must be unique in the model and may
        not duplicate the system alias names, which are:

        * `DEFAULT` refers to the default version of the model.
        * `FIRST` refers to the oldest version of the model by creation time.
        * `LAST` refers to the newest version of the model by creation time.

`UNSET TAG tag_name [ , tag_name ... ]`
:   Specifies one or more tags to be unset on the model.

`UNSET ALIAS`
:   Removes the alias from this model version, if it has one. The system aliases DEFAULT, FIRST, and LAST cannot be removed.
    You may specify the version by its name or by alias.

`RENAME TO new_name`
:   Renames the specified model with a new identifier that is not currently used by any other models in the schema.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When a model is renamed, other objects that reference it must be updated with the new name.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Model | A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object that already exists in the schema. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

---
title: ALTER MODEL MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/alter-model-monitor.md
section: SQL Commands
---

# ALTER MODEL MONITOR

Modifies the properties of a [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md):

* Suspends or resumes the monitor.
* Sets the baseline table the monitor uses.
* Sets the refresh interval for dynamic table operations within the monitor.
* Sets the warehouse the monitor uses.
* Adds or removes segment columns for monitoring specific data segments.

See also:
:   [CREATE MODEL MONITOR](create-model-monitor.md),
    [SHOW MODEL MONITORS](show-model-monitors.md),
    [DESCRIBE MODEL MONITOR](desc-model-monitor.md),
    [DROP MODEL MONITOR](drop-model-monitor.md)

## Syntax

```sqlsyntax
ALTER MODEL MONITOR [ IF EXISTS ] <monitor_name> { SUSPEND | RESUME }

ALTER MODEL MONITOR [ IF EXISTS ] <monitor_name> SET
   [ BASELINE='<baseline_table_name>' ]
   [ REFRESH_INTERVAL='<refresh_interval>' ]
   [ WAREHOUSE=<warehouse_name> ]

ALTER MODEL MONITOR [ IF EXISTS ] <monitor_name> ADD segment_column = '<segment_column_name>'

ALTER MODEL MONITOR [ IF EXISTS ] <monitor_name> DROP segment_column = '<segment_column_name>'
```

## Parameters

`monitor_name`
:   Specifies the identifier (i.e. name) of the model monitor.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more model monitor properties to be set.

    `BASELINE='<baseline_table_name>'`
    :   Sets the baseline table that the monitor uses.

    `WAREHOUSE = warehouse_name`
    :   Sets the warehouse that the monitor uses.

    `REFRESH_INTERVAL = 'refresh_interval'`
    :   The interval at which the monitor refreshes its internal state. The value must be a string representing a time period,
        such as `'1 day'`. The minimum refresh interval is `'60 seconds'`. Supported units include seconds, minutes, hours, and days.
        You may use singular (“hour”) or plural (“hours”) for the interval name.

`ADD segment_column = '<segment_column_name>'`
:   Adds a segment column to the monitor. The specified column must exist in the source data and be of type STRING.
    You can add up to 5 segment columns per monitor. Each segment column should have fewer than 25 unique values for optimal performance.

`DROP segment_column = '<segment_column_name>'`
:   Removes a segment column from the monitor.

For more information about segments, see [ML Observability: Monitoring model behavior over time](../../developer-guide/snowflake-ml/model-registry/model-observability.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Modify | Model monitor |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

---
title: ALTER MODEL … ADD VERSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-model-add-version.md
section: SQL Commands
---

# ALTER MODEL … ADD VERSION

Adds a new version to an existing model from an existing model version. Versions are the actual model code that contains
methods that can be called to perform inference and other functions.

> **Note:**
>
> Use the [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md) Python API
> to create model versions from scratch. In SQL, you can only create model versions from existing model versions.

Some version properties can be modified (see [ALTER MODEL … MODIFY VERSION](alter-model-modify-version.md)), but the actual model implementation
contained in a version is immutable.

This command also supports the following variant:

* ALTER MODEL .. ADD VERSION … FROM internalStage (creates a model version from an internal stage)

See also:
:   [ALTER MODEL … MODIFY VERSION](alter-model-modify-version.md), [ALTER MODEL … DROP VERSION](alter-model-drop-version.md)

## Syntax

```sqlsyntax
ALTER MODEL [ IF EXISTS ] <name> ADD VERSION <version_name>
  FROM MODEL <source_model_name> [ VERSION <source_version_name> ]
```

## Variant Syntax

This variant is used by the [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md) Python API.
It is not possible to create model versions from scratch in SQL.

```sqlsyntax
ALTER MODEL [ IF EXISTS ] <name> ADD VERSION <version_name> FROM internalStage
```

Where:

```sqlsyntax
internalStage ::=
    @[<namespace>.]<int_stage_name>[/<path>]
| @[<namespace>.]%<table_name>[/<path>]
| @~[/<path>]
```

For additional internal stage details, see [Choosing an internal stage for local files](../../user-guide/data-load-local-file-system-create-stage.md).

## Parameters

`name`
:   Specifies the identifier of the model. If the identifier contains spaces, special characters, or mixed-case
    characters, the entire identifier must be enclosed in double quotes. Identifiers enclosed in double quotes are also
    case-sensitive. For information on identifier syntax, see [Identifier requirements](../identifiers-syntax.md).

`ADD VERSION version_name`
:   Specifies the identifier of the version, which must be unique within the model. If the identifier contains
    spaces, special characters, or mixed-case characters, the entire identifier must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive. For information on identifier syntax, see
    [Identifier requirements](../identifiers-syntax.md).

`FROM MODEL source_model_name [ VERSION source_version_or_alias_name ]`
:   Required if not using FROM internalStage variant
    :   Specifies the name of the model from which the version will be obtained.

        To obtain a specific version of that model, specify the `VERSION source_version_or_alias_name` clause. If
        you omit this clause, the command obtains the default version of the source model.

`FROM internalStage`
:   Required if using FROM internalStage variant
    :   Specifies the internal stage that contains the version’s files.

---
title: ALTER MODEL … DROP VERSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-model-drop-version.md
section: SQL Commands
---

# ALTER MODEL … DROP VERSION

Removes a version from the specified machine learning model.

See also:
:   [ALTER MODEL … ADD VERSION](alter-model-add-version.md), [ALTER MODEL … MODIFY VERSION](alter-model-modify-version.md)

## Syntax

```sqlsyntax
ALTER MODEL [ IF EXISTS ] <name> DROP VERSION <version_name>
```

## Parameters

`name`
:   Specifies the identifier of the model. If the identifier contains spaces, special characters, or mixed-case
    characters, the entire identifier must be enclosed in double quotes. Identifiers enclosed in double quotes are also
    case-sensitive. For information on identifier syntax, see [Identifier requirements](../identifiers-syntax.md).

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`version_name`
:   Specifies the identifier of the version to be removed.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Usage notes

Aliases are alternative names for model versions. In addition to aliases you create, the following three system aliases are available.

* `DEFAULT` refers to the default version of the model.
* `FIRST` refers to the oldest version of the model by creation time.
* `LAST` refers to the newest version of the model by creation time.

When you drop the first or last model version, the corresponding system alias, `FIRST` or `LAST`, adjusts to point
to the new first or last alias.

You cannot drop the default version of a model. Change the default to a different version, if there is one, using
[ALTER MODEL … SET DEFAULT VERSION](alter-model.md), then drop the unneeded version. If there is no other version to
select as the default, because the model has only one version, [drop the entire model](drop-model.md).

---
title: ALTER MODEL … MODIFY VERSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-model-modify-version.md
section: SQL Commands
---

# ALTER MODEL … MODIFY VERSION

Modifies a version of a model, changing the version’s comment or metadata.

See also:
:   [ALTER MODEL … ADD VERSION](alter-model-add-version.md), [ALTER MODEL … DROP VERSION](alter-model-drop-version.md)

## Syntax

```sqlsyntax
ALTER MODEL [ IF EXISTS ] <name> MODIFY VERSION <version_or_alias_name> SET
  [ COMMENT = '<string_literal>' ]
  [ METADATA = '<json_metadata>']
```

## Parameters

`name`
:   Specifies the identifier of the model.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`version_or_alias_name`
:   Specifies the identifier of the version, either its version name or its alias. Version names that contain spaces or
    that are case sensitive must be enclosed in double quotes. For information on identifier syntax, see
    [Identifier requirements](../identifiers-syntax.md).

    Aliases must be valid identifiers without double quotes.

    See Usage Notes for more information on aliases.

`SET ...`
:   Specifies one or more model version properties to be set.

    `COMMENT = 'string_literal'`
    :   Sets the comment of the version.

    `METADATA = 'json_metadata'`
    :   Sets the metadata of the version. Metadata is a JSON object that stores key-value pairs of your choosing.

## Usage notes

Aliases are alternative names for model versions. In addition to aliases you create, the following three system aliases are available.

* `DEFAULT` refers to the default version of the model.
* `FIRST` refers to the oldest version of the model by creation time.
* `LAST` refers to the newest version of the model by creation time.

---
title: ALTER NETWORK POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-network-policy.md
section: SQL Commands
---

# ALTER NETWORK POLICY

Modifies the properties for an existing network policy.

> **Note:**
>
> Only the network policy owner (that is, the role with the OWNERSHIP privilege on the network policy) or higher can alter a network policy.

See also:
:   [CREATE NETWORK POLICY](create-network-policy.md) , [DESCRIBE NETWORK POLICY](desc-network-policy.md) , [DROP NETWORK POLICY](drop-network-policy.md) , [SHOW NETWORK POLICIES](show-network-policies.md)

## Syntax

```sqlsyntax
ALTER NETWORK POLICY [ IF EXISTS ] <name> SET {
    [ ALLOWED_NETWORK_RULE_LIST = ( '<network_rule>' [ , '<network_rule>' , ... ] ) ]
    [ BLOCKED_NETWORK_RULE_LIST = ( '<network_rule>' [ , '<network_rule>' , ... ] ) ]
    [ ALLOWED_IP_LIST = ( [ '<ip_address>' ] [ , '<ip_address>' ... ] ) ]
    [ BLOCKED_IP_LIST = ( [ '<ip_address>' ] [ , '<ip_address>' ... ] ) ]
    [ COMMENT = '<string_literal>' ] }

ALTER NETWORK POLICY [ IF EXISTS ] <name> UNSET COMMENT

ALTER NETWORK POLICY <name> ADD { ALLOWED_NETWORK_RULE_LIST = '<network_rule>' | BLOCKED_NETWORK_RULE_LIST = '<network_rule>' }

ALTER NETWORK POLICY <name> REMOVE { ALLOWED_NETWORK_RULE_LIST = '<network_rule>' | BLOCKED_NETWORK_RULE_LIST = '<network_rule>' }

ALTER NETWORK POLICY <name>  RENAME TO <new_name>

ALTER NETWORK POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER NETWORK POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the network policy to alter. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies the parameter to set for the network policy:

    > `ALLOWED_NETWORK_RULE_LIST = ( 'network_rule' [ , 'network_rule' , ... ] )`
    > :   Specifies a list of [network rules](../../user-guide/network-rules.md) that contain the network identifiers that are allowed access to
    >     Snowflake. There is no limit on the number of network rules in the list.
    >
    >     Replaces existing network rules in the allowed list. To add network rules without replacing existing ones, use the
    >     `ALTER NETWORK POLICY ... ADD` command.
    >
    > `BLOCKED_NETWORK_RULE_LIST = ( 'network_rule' [ , 'network_rule' , ... ] )`
    > :   Specifies a list of network rules that contain the network identifiers that are denied access to Snowflake. There is no limit on the
    >     number of network rules in the list.
    >
    >     Replaces existing network rules in the blocked list. To add network rules without replacing existing ones, use the
    >     `ALTER NETWORK POLICY ... ADD` command.
    >
    > `ALLOWED_IP_LIST = ( [ ip_address ] [ , ip_address , ... ] )`
    > :   Specifies a list of IPv4 addresses that are allowed access to your Snowflake account. This is referred to as the *allowed list*.
    >
    >     Snowflake recommends using network rules in conjunction with network policies rather than using this property. Use the
    >     `ALLOWED_NETWORK_RULE_LIST` property to specify network rules that contain IPv4 addresses.
    >
    >     If you are not yet using network rules, specify at least one IPv4 address or CIDR block range to allow access to your Snowflake
    >     account. Additionally, if you are not using network rules and this property is specified with an empty list, no IPv4 addresses are
    >     allowed to access your Snowflake account.
    >
    > `BLOCKED_IP_LIST = ( [ ip_address ] [ , ip_address , ... ] )`
    > :   Specifies a list of IPv4 addresses that are denied access to your Snowflake account. This is referred to as the *blocked list*.
    >     To unset this parameter, specify a different CIDR block range, a series of IPv4 addresses, or a single IPv4 address.
    >
    >     Snowflake recommends using network rules in conjunction with network policies rather than using this parameter. Use the
    >     `BLOCKED_NETWORK_RULE_LIST` property to specify network rules that contain IPv4 addresses.
    >
    >     To block public access, use a network rule and add the network rule to the `BLOCKED_NETWORK_RULE_LIST` property. The result is
    >     that only IP addresses that use private connectivity, such as AWS PrivateLink, can access your Snowflake account.
    >
    >     Default: No value; no IP addresses in `ALLOWED_IP_LIST` property are blocked.
    >
    > `COMMENT = 'string_literal'`
    > :   Adds a comment or overwrites an existing comment for the network policy.
    >
    > `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    > :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.
    >
    >     The tag value is always a string, and the maximum number of characters for the tag value is 256.
    >
    >     For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies the properties to unset for the network policy, which resets them to the defaults:

    > * `COMMENT`, which removes the comment, if one exists, for the network policy.
    > * `TAG tag_name [ , tag_name ... ]`

`ADD { ALLOWED_NETWORK_RULE_LIST = 'network_rule' | BLOCKED_NETWORK_RULE_LIST = 'network_rule' }`
:   Adds a network rule to the allowed or blocked list of the network policy without removing existing ones.

`REMOVE { ALLOWED_NETWORK_RULE_LIST = 'network_rule' | BLOCKED_NETWORK_RULE_LIST = 'network_rule' }`
:   Removes a network rule from the allowed or blocked list of the network policy.

`RENAME TO ...`
:   Specifies a new name for the existing network policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Network policy | Modifying a network policy requires a role with the OWNERSHIP privilege on the network policy. |

For general information about roles and privilege grants for performing SQL actions on securable objects, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* A user whose IP address is on the ALLOWED LIST and who attempts to alter a network policy by removing their entire IP CDIR range sees the
  following message:

  ```output
  Changes cannot be applied to the network policy RULE_BASED_POLICY because
  current IP address/token x.xx.xxx.xxx is not allowed in it.
  ```

  This design helps prevent the currently logged-in user from being accidentally blocked or locked out from their Snowflake account.
* Don’t modify a network policy to have empty `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST` properties. Use network rules in
  conjunction with the network policy to manage access to your Snowflake account.
* The `SET` action for the allowed/blocked lists is not additive (that is, it removes all IP addresses in the existing lists
  for the network policy and replaces them with the specified lists).

  As a result, to make additions to the existing lists, you must specify the new IP addresses and replicate the existing lists.
* Each `ip_address` can cover a range of addresses using Classless Inter-Domain Routing (CIDR) notation:

  > `ip_address[/optional_prefix_length]`

  For example:

  > `192.168.1.0/24`
* When a network policy includes values for both `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST`, Snowflake applies the blocked list
  first.
* Do not add `0.0.0.0/0` to `BLOCKED_IP_LIST`. Because Snowflake applies the blocked list first, this would block your own
  access. Additionally, in order to block all IP addresses except a select list, you only need to add IP addresses to `ALLOWED_IP_LIST`.
  Snowflake automatically blocks all IP addresses not included in the allowed list.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Example

For example, use ALTER NETWORK POLICY to change a network policy named `allow_access_policy` that allows network traffic defined by
`allow_access_rule` to also block network traffic defined by `block_access_rule`, consistent with the network rules defined in
[IP ranges](../../user-guide/network-policies.md). First, show the current policy:

```sqlexample
DESC NETWORK POLICY allow_access_policy;
```

```output
+---------------------------+-------------------+
| name                      | value             |
|---------------------------+-------------------|
| ALLOWED_NETWORK_RULE_LIST | allow_access_rule |
+---------------------------+-------------------+
```

Next, change `allow_access_policy` to also use `block_access_rule`, and then show the updated policy:

```sqlexample
ALTER NETWORK POLICY IF EXISTS allow_access_policy SET
  BLOCKED_NETWORK_RULE_LIST = ('block_access_rule');
DESC NETWORK POLICY allow_access_policy;
```

```output
+---------------------------+-------------------+
| name                      | value             |
|---------------------------+-------------------|
| ALLOWED_NETWORK_RULE_LIST | ALLOW_ACCESS_RULE |
| BLOCKED_NETWORK_RULE_LIST | BLOCK_ACCESS_RULE |
+---------------------------+-------------------+
```

Next, rename the updated policy to describe use of both rules:

```sqlexample
ALTER NETWORK POLICY allow_access_policy RENAME TO limit_access_policy;
```

Then, add a comment which describes that `limit_access_policy` is defined by network rules:

```sqlexample
ALTER NETWORK POLICY limit_access_policy SET COMMENT = 'No_Lists_See_Rules';
SHOW NETWORK POLICIES;
```

Output from SHOW NETWORK POLICIES includes the updated policy name, and comment included in the changed (altered) network policy.

```output
+-------------------------------+---------------------+--------------------+----------------------------+----------------------------+----------------------------------+----------------------------------+
| created on                    | name                | comment            | entries_in_allowed_ip_list | entries_in_blocked_ip_list | entries_in_allowed_network_rules | entries_in_blocked_network_rules |
|-------------------------------+------------------------------------------|----------------------------|----------------------------|----------------------------------|----------------------------------|
|...                            |                     |                    |                            |                            |                                  |                                  |
|-------------------------------+------------------------------------------|----------------------------|----------------------------|----------------------------------|----------------------------------|
| 2024-12-04 10:33:19.853 -0800 | LIMIT_ACCESS_POLICY | NO_LISTS_SEE_RULES |                           0|                           0|                                 1|                                 1|
|-------------------------------+------------------------------------------|----------------------------|----------------------------|----------------------------------|----------------------------------|
|...                            |                     |                    |                            |                            |                                  |                                  |
+-------------------------------+---------------------+--------------------+----------------------------+----------------------------+----------------------------------+----------------------------------+
```

---
title: ALTER NETWORK RULE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-network-rule.md
section: SQL Commands
---

# ALTER NETWORK RULE

Modifies an existing network rule.

See also:
:   [CREATE NETWORK RULE](create-network-rule.md) , [DROP NETWORK RULE](drop-network-rule.md) , [DESCRIBE NETWORK RULE](desc-network-rule.md) , [SHOW NETWORK RULES](show-network-rules.md)

## Syntax

```sqlsyntax
ALTER NETWORK RULE [ IF EXISTS ] <name> SET
  VALUE_LIST = ( '<value>'  [ , '<value>', ... ] )
  [ COMMENT = '<string_literal>' ]

ALTER NETWORK RULE [ IF EXISTS ] <name> UNSET { VALUE_LIST | COMMENT }
```

## Parameters

`name`
:   Specifies the identifier of the network rule.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in
    double quotes are case-sensitive.

`SET ...`
:   Specifies the properties to set for the network rule:

    `VALUE_LIST = ( 'value'  [, 'value', ...] )`
    :   Replaces the current network identifiers with a new list of identifiers. Using this command is not additive; previously specified
        values are removed when you set a new value list.

        Valid values in the list are determined by the type of network rule:

        > * When `TYPE = IPV4`, each value must be a valid IPv4 address or range of addresses.
        > * When `TYPE = AWSVPCEID`, each value must be a valid VPCE ID. VPC IDs are not supported.
        > * When `TYPE = AZURELINKID`, each value must be a valid LinkID of an Azure [private endpoint](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview).
        >   Execute the [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](../functions/system_get_privatelink_authorized_endpoints.md) function to retrieve the LinkID associated
        >   with an account.
        > * When `TYPE = GCPPSCID`, each value must be a valid pscConnectionID of a [Google Cloud Private Service Connect (PSC) endpoint](https://docs.cloud.google.com/vpc/docs/private-service-connect#endpoints). Run the [gcloud compute forwarding-rules describe command](https://docs.cloud.google.com/memorystore/docs/cluster/multiple-vpcs-automatically-registered-psc-connection#get_the_connection_id_1) to get the pscConnectionID for each forwarding rule.
        > * When `TYPE = HOST_PORT`, each value must resolve to a valid domain. Optionally, it can also include a port or range of ports.
        >
        >   In most cases, the valid port range is 1-65535. If you do not specify a port, it defaults to 443. If an external network location supports dynamic ports, you need to specify all possible ports.
        >
        >   To allow access to all ports, define the port as 0; for example, `example.com:0`.
        >
        >   When the value resolves to a domain, you can use a single asterisk as a wildcard character. The asterisk matches only alphanumeric
        >   characters and hyphens (`-`).
        >
        >   Wildcards are supported only for a single level of subdomains, as in the following examples:
        >
        >   + `*.google.com`
        >   + `snowflake-*.google.com` and `snowflake*abc.google.com`
        >
        >   You can allow requests to all outbound endpoints by specifying `0.0.0.0` as the domain, as in the examples below.
        >   When you specify `0.0.0.0` as the domain, you may use only 443 and 80 as port values.
        >
        >   + Allow access to all endpoints at port 80
        >
        >     ```none
        >     value_list = ('0.0.0.0:80');
        >     ```
        >   + Allow access to all endpoints at port 443
        >
        >     ```none
        >     value_list = ('0.0.0.0:443');
        >     ```
        >
        >     ```none
        >     value_list = ('0.0.0.0');
        >     ```
        >   + Allow access to all endpoints at both port 80 and 443
        >
        >     ```none
        >     value_list = ('0.0.0.0:80', '0.0.0.0:443');
        >     ```
        > * When `TYPE = PRIVATE_HOST_PORT`, specify one valid domain.
        >
        >   In most cases, the valid port range is 1-65535. If you do not specify a port, it defaults to 443. If an external network location supports dynamic ports, you need to specify all possible ports.
        >
        >   To allow access to all ports, define the port as 0; for example, `example.com:0`.

    `COMMENT = 'string_literal'`
    :   Adds a comment for the first time or overwrites an existing comment.

`UNSET ...`
:   Clears properties of the network rule:

    `VALUE_LIST`
    :   Removes all network identifiers from the network rule.

    `COMMENT`
    :   Removes the comment that was associated with the network rule.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Network Rule | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When specifying IP addresses for a network rule, Snowflake supports ranges of IP addresses using [Classless Inter-Domain Routing (CIDR) notation](https://tools.ietf.org/html/rfc4632).

  For example, `192.168.1.0/24` represents all IPv4 addresses in the range of `192.168.1.0` to `192.168.1.255`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Example

Modify a network rule that is used to allow or block traffic from a range of IPv4 addresses. Assumes that `TYPE = IPV4` for the
network rule.

```sqlexample
ALTER NETWORK RULE cloud_network SET VALUE_LIST = ('47.88.25.32/27');
```

Modify a network rule that is used to allow or block traffic over AWS PrivateLink. Assumes that `TYPE = AWS_VPCEID` for the network
rule.

```sqlexample
ALTER NETWORK RULE corporate_network SET VALUE_LIST = ('vpce-123abc3420c1931');
```

Modify a network rule that is used to allow or block traffic over Google Cloud Private Service Connect. Assumes that `TYPE = GCPPSCID`
for the network rule.

```sqlexample
ALTER NETWORK RULE corporate_network SET VALUE_LIST = ('31618973889077266');
```

Modify a network rule that is used to allow traffic to an external destination. Assumes that `TYPE = HOST_PORT` for the network
rule.

```sqlexample
ALTER NETWORK RULE external_access_rule SET VALUE_LIST = ('example.com', 'example.com:443');
```

---
title: ALTER NOTEBOOK
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notebook.md
section: SQL Commands
---

# ALTER NOTEBOOK

Modifies the properties of an existing [notebook](../../user-guide/ui-snowsight/notebooks.md).

## Syntax

```sqlsyntax
ALTER NOTEBOOK [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER NOTEBOOK [ IF EXISTS ] <name> SET
  [ COMMENT = '<string_literal>' ]
  [ QUERY_WAREHOUSE = <warehouse_to_run_nb_and_sql_queries_in> ]
  [ IDLE_AUTO_SHUTDOWN_TIME_SECONDS = <number_of_seconds> ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name>) [ , ... ] ]
```

## Parameters

`name`
:   Specifies the identifier for the notebook to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the notebook to `new_name`. The new identifier must be unique for the schema.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Sets one or more specified properties or parameters for the notebook:

`QUERY_WAREHOUSE = warehouse_name`
:   Specifies the warehouse where SQL queries in the notebook are run.
    This parameter is optional. However, it is required to run the EXECUTE NOTEBOOK command.

`IDLE_AUTO_SHUTDOWN_TIME_SECONDS = number_of_seconds`
:   Number of seconds of idle time before the notebook is shut down automatically. This parameter is available for notebooks running
    on both Warehouse and Container Runtime. The value must be an integer between 60 and 259200 (72 hours).

    Default: 3600 seconds

`SECRETS = '(secret_variable_name' = secret_name [ , ... ])`
:   Sets secret variables for the notebook.

    * `secret_variable_name` - The variable that will be used in the notebook cell when retrieving information from the secret.
    * `secret_name` - The name of the Snowflake secret.

`UNSET ...`
:   Unsets one or more specified properties or parameters for the notebook, which resets the properties to the defaults:

    * QUERY_WAREHOUSE
    * COMMENT

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE or OWNERSHIP | Notebook | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example renames the notebook named `my_notebook` to `notebook_v2`:

```sqlexample
ALTER NOTEBOOK my_notebook RENAME TO notebook_v2;
```

The following example unsets the QUERY_WAREHOUSE property:

```sqlexample
ALTER NOTEBOOK my_notebook UNSET QUERY_WAREHOUSE;
```

---
title: ALTER NOTIFICATION INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION

Modifies the properties for an existing notification integration.

The properties that you can set depend on the type of the messaging service and whether the message is inbound or outbound. The
following topics explain the syntax for altering notification integrations for different use cases:

* [ALTER NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](alter-notification-integration-queue-inbound-azure.md)
* [ALTER NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](alter-notification-integration-queue-inbound-gcp.md)
* [ALTER NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)](alter-notification-integration-queue-outbound-aws.md)
* [ALTER NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)](alter-notification-integration-queue-outbound-azure.md)
* [ALTER NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)](alter-notification-integration-queue-outbound-gcp.md)
* [ALTER NOTIFICATION INTEGRATION (email)](alter-notification-integration-email.md)
* [ALTER NOTIFICATION INTEGRATION (webhooks)](alter-notification-integration-webhooks.md)

See also:
:   [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md) , [DESCRIBE INTEGRATION](desc-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

---
title: ALTER NOTIFICATION INTEGRATION (email)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-email.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (email)

Modifies the properties for an existing notification integration for
[sending email messages](../../user-guide/notifications/email-notifications.md).

See also:
:   [CREATE NOTIFICATION INTEGRATION (email)](create-notification-integration-email.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ ALLOWED_RECIPIENTS = ( '<email_address>' [ , ... '<email_address>' ] ) ]
  [ DEFAULT_RECIPIENTS = ( '<email_address>' [ , ... '<email_address>' ] ) ]
  [ DEFAULT_SUBJECT = '<subject_line>' ]
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ NOTIFICATION ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET
  ENABLED            |
  ALLOWED_RECIPIENTS |
  DEFAULT_RECIPIENTS |
  DEFAULT_SUBJECT    |
  COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `ALLOWED_RECIPIENTS = ( 'email_address' [ , ... 'email_address' ] )`
    :   (For `TYPE = EMAIL`) A comma-separated list of quoted email addresses that can receive notification emails from this
        integration.

        You must specify email addresses of users in the current account. These users must
        [verify their email addresses](../../user-guide/notifications/email-notifications.md).

        The maximum number of email addresses that you can specify is 50.

        If you omit this parameter, you can send email notifications to any verified email address in the current account.

    `DEFAULT_RECIPIENTS = ( 'email_address' [ , ... 'email_address' ] )`
    :   Specifies the list of default recipients for messages sent with this integration. Use a comma-separated list of quoted email
        addresses to specify the default recipients.

        You must specify email addresses of users in the current account. These users must verify their email addresses.

        To override the default recipients for a given message, use the [EMAIL_INTEGRATION_CONFIG](../functions/email_integration_config.md) helper
        function when calling the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

    `DEFAULT_SUBJECT = 'subject_line'`
    :   Specifies the default subject line for messages sent with this integration.

        The subject cannot exceed 256 characters in length.

        Default: ‘Snowflake Email Notification’

        To override the default subject line for a given message, use the [EMAIL_INTEGRATION_CONFIG](../functions/email_integration_config.md)
        helper function when calling the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the integration, which resets them back to their defaults:

    * `ENABLED`
    * `ALLOWED_RECIPIENTS`
    * `DEFAULT_RECIPIENTS`
    * `DEFAULT_SUBJECT`
    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

---
title: ALTER NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-queue-inbound-gcp.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)

Modifies the properties for an existing notification integration for receiving messages from a Google Pub/Sub topic.

See also:
:   [CREATE NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](create-notification-integration-queue-inbound-gcp.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ NOTIFICATION ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the integration, which resets them back to their defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* Disabling or dropping an integration might not take effect immediately because the integration might be cached. To expedite the
  removal process, remove the integration privilege from the cloud provider.

---
title: ALTER NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-queue-inbound-azure.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)

Modifies the properties for an existing notification integration for receiving messages from an Azure Event Grid topic.

See also:
:   [CREATE NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](create-notification-integration-queue-inbound-azure.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ NOTIFICATION ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the integration, which resets them back to their defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* Disabling or dropping an integration might not take effect immediately because the integration might be cached. To expedite the
  removal process, remove the integration privilege from the cloud provider.

---
title: ALTER NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-queue-outbound-gcp.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)

Modifies the properties for an existing notification integration for
[sending a message to a Google Pub/Sub topic](../../user-guide/notifications/creating-notification-integration-google-pubsub.md).

See also:
:   [CREATE NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)](create-notification-integration-queue-outbound-gcp.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  GCP_PUBSUB_SUBSCRIPTION_NAME = '<subscription_id>'
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ NOTIFICATION ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `GCP_PUBSUB_TOPIC_NAME = 'topic_id'`
    :   Identification of the Pub/Sub topic to which Snowflake pushes notifications.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the integration, which resets them back to their defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* Disabling or dropping an integration might not take effect immediately because the integration might be cached. To expedite the
  removal process, remove the integration privilege from the cloud provider.

---
title: ALTER NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-queue-outbound-aws.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)

Modifies the properties for an existing notification integration for
[sending a message to an Amazon SNS topic](../../user-guide/notifications/creating-notification-integration-amazon-sns.md).

See also:
:   [CREATE NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)](create-notification-integration-queue-outbound-aws.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  AWS_SNS_TOPIC_ARN = '<topic_arn>'
  AWS_SNS_ROLE_ARN = '<iam_role_arn>'
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ NOTIFICATION ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `AWS_SNS_TOPIC_ARN = 'topic_arn'`
    :   Amazon Resource Name (ARN) of the Amazon SNS (SNS) topic to which notifications are pushed.

    `AWS_SNS_ROLE_ARN = 'iam_role_arn'`
    :   ARN of the IAM role that has permissions to publish messages to the SNS topic.

        > **Note:**
        >
        > The value of AWS_SNS_ROLE_ARN is case-sensitive. Use the exact value that is specified in your AWS account.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the integration, which resets them back to their defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* Disabling or dropping an integration might not take effect immediately because the integration might be cached. To expedite the
  removal process, remove the integration privilege from the cloud provider.

---
title: ALTER NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-queue-outbound-azure.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)

Modifies the properties for an existing notification integration for
[sending a message to an Azure Event Grid topic](../../user-guide/notifications/creating-notification-integration-azure-event-grid.md).

See also:
:   [CREATE NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)](create-notification-integration-queue-outbound-azure.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  AZURE_STORAGE_QUEUE_PRIMARY_URI = '<queue_URL>'
  AZURE_TENANT_ID = '<directory_ID>';
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ NOTIFICATION ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `AZURE_EVENT_GRID_TOPIC_ENDPOINT = 'event_grid_topic_endpoint'`
    :   Event Grid topic endpoint to which Snowflake pushes notifications.

    `AZURE_TENANT_ID = 'ad_directory_id'`
    :   ID of the Azure Active Directory tenant used for identity management. This ID is needed to generate the consent URL that grants
        Snowflake access to the Event Grid topic.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the integration, which resets them back to their defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* Disabling or dropping an integration might not take effect immediately because the integration might be cached. To expedite the
  removal process, remove the integration privilege from the cloud provider.

---
title: ALTER NOTIFICATION INTEGRATION (webhooks)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-notification-integration-webhooks.md
section: SQL Commands
---

# ALTER NOTIFICATION INTEGRATION (webhooks)

Modifies the properties for an existing notification integration for a
[webhook](../../user-guide/notifications/webhook-notifications.md).

See also:
:   [CREATE NOTIFICATION INTEGRATION (webhooks)](create-notification-integration-webhooks.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ WEBHOOK_URL = '<url>' ]
  [ WEBHOOK_SECRET = <secret_name> ]
  [ WEBHOOK_BODY_TEMPLATE = '<template_for_http_request_body>' ]
  [ WEBHOOK_HEADERS = ( '<header_1>'='<value_1>' [ , '<header_N>'='<value_N>', ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER [ NOTIFICATION ] INTEGRATION [ IF EXISTS ] <name> UNSET {
  ENABLED               |
  WEBHOOK_SECRET        |
  WEBHOOK_BODY_TEMPLATE |
  WEBHOOK_HEADERS       |
  COMMENT
}
```

## Parameters

`name`
:   Specifies the identifier for the integration to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Sets one or more properties for the integration:

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` enables the integration.
        * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
          work.

        The value is case-insensitive.

        The default is `TRUE`.

    `WEBHOOK_URL = 'url'`
    :   Specifies the URL for the webhook. The URL must use the `https://` protocol.

        You can only specify the following URLs:

        * URLs for Slack webhooks. These URLs must start with `https://hooks.slack.com/services/`.
        * URLs for Microsoft Teams webhooks. These URLs must use the following general format:

          + Up until November 30, 2025, Microsoft Teams supports URLs in the following format:

            ```none
            https://<hostname>.<region>.logic.azure.com:443/workflows/<secret>
            ```
          + [From November 30, 2025 onward](https://learn.microsoft.com/en-us/troubleshoot/power-platform/power-automate/flow-run-issues/triggers-troubleshoot?tabs=new-designer#changes-to-http-or-teams-webhook-trigger-flows),
            Microsoft Teams supports URLs in the following format:

            ```none
            https://default<hostname>.environment.api.powerplatform.com/powerautomate/automations/direct/workflows/<secret>/triggers/manual/paths/invoke
            ```
          > **Note:**
          >
          > You must omit the port number (`:443`) from the URL in the WEBHOOK_URL parameter.
          > For information about the Microsoft API data format, see <https://adaptivecards.io/> .
        * URLs for PagerDuty webhooks. This URL must be `https://events.pagerduty.com/v2/enqueue`.

        If the URL includes a secret and you [created a secret object for that secret](../../user-guide/notifications/webhook-notifications.md),
        replace that secret in the URL with SNOWFLAKE_WEBHOOK_SECRET. For example, if you
        [created a secret object for the secret in a Slack webhook URL](../../user-guide/notifications/webhook-notifications.md), set
        WEBHOOK_URL to:

        ```sqlexample
        WEBHOOK_URL='https://hooks.slack.com/services/SNOWFLAKE_WEBHOOK_SECRET'
        ```

    `WEBHOOK_SECRET = secret_name`
    :   Specifies the [secret to use with this integration](../../user-guide/notifications/webhook-notifications.md).

        If you are using the SNOWFLAKE_WEBHOOK_SECRET placeholder in WEBHOOK_URL, WEBHOOK_BODY_TEMPLATE, or WEBHOOK_HEADERS, the
        placeholder is replaced by this secret when you send a notification.

        If the database and schema containing the secret object will not be active when you send a notification,
        [qualify the secret name with the schema name or the database and schema names](../name-resolution.md). For
        example:

        ```sqlexample
        WEBHOOK_SECRET = my_secrets_db.my_secrets_schema.my_slack_webhook_secret
        ```

        You must have the USAGE privilege on the secret (and the database and schema that contain it) to specify this parameter.

        Default: No value

    `WEBHOOK_BODY_TEMPLATE = 'template_for_http_request_body'`
    :   Specifies a template for the body of the HTTP request to send for the notification.

        If the webhook requires a specific format for the body of the HTTP request (for example, a specific JSON format), set this to
        a string that specifies the format. In this string:

        * If the message needs to include a secret and you
          [created a secret object for that secret](../../user-guide/notifications/webhook-notifications.md), use the SNOWFLAKE_WEBHOOK_SECRET
          placeholder where the secret should appear in the message.
        * Use the SNOWFLAKE_WEBHOOK_MESSAGE placeholder where the notification message needs to be included.

        For example:

        ```sqlexample
        WEBHOOK_BODY_TEMPLATE='{
          "routing_key": "SNOWFLAKE_WEBHOOK_SECRET",
          "event_action": "trigger",
          "payload":
            {
              "summary": "SNOWFLAKE_WEBHOOK_MESSAGE",
              "source": "Snowflake monitoring",
              "severity": "INFO",
            }
          }'
        ```

        If you set WEBHOOK_BODY_TEMPLATE, you must also set WEBHOOK_HEADERS to include the `Content-Type` header with the type
        of your message. For example, if you set WEBHOOK_BODY_TEMPLATE to a template in JSON format, set WEBHOOK_HEADERS to include
        the header `Content-Type: application/json`:

        ```sqlexample
        WEBHOOK_HEADERS=('Content-Type'='application/json')
        ```

        Default: No value

    `WEBHOOK_HEADERS = ( 'header'='value' [ , 'header'='value', ... ] )`
    :   Specifies a list of HTTP headers and values to include in the HTTP request for the webhook.

        If an HTTP header must include a secret (for example, the `Authorization` header) and you
        [created a secret object for that secret](../../user-guide/notifications/webhook-notifications.md), use the SNOWFLAKE_WEBHOOK_SECRET
        placeholder in the header value. For example:

        ```sqlexample
        WEBHOOK_HEADERS=('Authorization'='Basic SNOWFLAKE_WEBHOOK_SECRET')
        ```

        Default: No value

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

`UNSET ...`
:   Unsets one or more properties for the integration, which resets the properties to their default values:

    * `ENABLED`
    * `WEBHOOK_SECRET`
    * `WEBHOOK_BODY_TEMPLATE`
    * `WEBHOOK_HEADERS`
    * `COMMENT`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | Secret | If you set the WEBHOOK_SECRET property to a secret object, you must have the USAGE privilege on that secret and on the database and schema containing that secret. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

---
title: ALTER ONLINE FEATURE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-online-feature-table.md
section: SQL Commands
---

# ALTER ONLINE FEATURE TABLE

Modifies the properties of an existing [online feature table](create-online-feature-table.md).

See also:
:   [CREATE ONLINE FEATURE TABLE](create-online-feature-table.md) , [DESCRIBE ONLINE FEATURE TABLE](desc-online-feature-table.md), [DROP ONLINE FEATURE TABLE](drop-online-feature-table.md) , [SHOW ONLINE FEATURE TABLES](show-online-feature-tables.md)

## Syntax

```sqlsyntax
ALTER ONLINE FEATURE TABLE [ IF EXISTS ] <name> { SUSPEND | RESUME }

ALTER ONLINE FEATURE TABLE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER ONLINE FEATURE TABLE [ IF EXISTS ] <name> REFRESH

ALTER ONLINE FEATURE TABLE [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER ONLINE FEATURE TABLE [ IF EXISTS ] <name> SET
  [ TARGET_LAG = '<num> { seconds | minutes | hours | days }' ]
  [ WAREHOUSE = <warehouse_name> ]

ALTER ONLINE FEATURE TABLE [ IF EXISTS ] <name> <tagAction>
```

## Parameters

`name`
:   Specifies the identifier for the online feature table to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the online feature table to `new_name`. The new identifier must be unique for the schema.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    When an object is renamed, other objects that reference it must be updated with the new name.

`SUSPEND | RESUME`
:   Specifies whether the periodic background refreshes of the data in the table are suspended or resumed.

    `SUSPEND`
    :   Suspends the periodic background refreshes of the online feature table.

    `RESUME`
    :   Resumes the periodic background refreshes of the online feature table.

`REFRESH`
:   Specifies that the online feature table must be manually refreshed.

`SET ...`
:   Sets one or more specified properties or parameters for the online feature table:

    `TARGET_LAG = 'num { seconds | minutes | hours | days }'`
    :   Specifies the new target lag to use to define the schedule of the background refreshes.

        Must be a value between 10 seconds and 8 days, inclusive.

    `WAREHOUSE = warehouse_name`
    :   Specifies the name of the new warehouse that provides the compute resources for refreshing the online feature table.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the online feature table.

`tagAction`
:   Sets or unsets the tag on the online feature table:

    ```sqlsyntax
    tagAction ::=
      {
          SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
        | UNSET TAG <tag_name> [ , <tag_name> ... ]
      }
    ```

    `SET TAG`
    :   Sets the specified tag and tag value on the online feature table.

    `UNSET TAG`
    :   Unsets the specified tag on the online feature table.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Online feature table | Role that has the OWNERSHIP privilege on the online feature table. |
| USAGE | Warehouse | Required when changing the warehouse |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example suspends the periodic background refreshes for the online feature table named `my_online_feature_table`:

```sqlexample
ALTER ONLINE FEATURE TABLE my_online_feature_table SUSPEND;
```

The following example manually refreshes the online feature table named `my_online_feature_table`:

```sqlexample
ALTER ONLINE FEATURE TABLE my_online_feature_table REFRESH;
```

The following example changes the target lag for the online feature table named `my_online_feature_table`:

```sqlexample
ALTER ONLINE FEATURE TABLE my_online_feature_table SET TARGET_LAG = '1 minute';
```

The following example changes the name of the online feature table `my_online_feature_table` to `my_new_online_feature_table`:

```sqlexample
ALTER ONLINE FEATURE TABLE my_online_feature_table RENAME TO my_new_online_feature_table;
```

---
title: ALTER OPENFLOW DATA PLANE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-oflow-data-plane.md
section: SQL Commands
---

# ALTER OPENFLOW DATA PLANE

Modifies an Openflow data plane integration.

See also:
:   [DESCRIBE OPENFLOW DATA PLANE INTEGRATION](desc-oflow-data-plane-integration.md), [SHOW OPENFLOW DATA PLANE INTEGRATIONS](show-oflow-data-plane-integration.md),

## Syntax

```sqlsyntax
ALTER OPENFLOW DATA PLANE INTEGRATION <name>
    SET EVENT_TABLE = '<database>.<schema>.<tablename>';
```

## Parameters

`name`
:   Specifies the identifier (name) for the Openflow data plane integration being altered.

`SET ...`

> Specifies the properties to set for the DATA PLANE INTEGRATION.
>
> `EVENT_TABLE = 'event table name'`
> :   Fully qualified name of the event table to associate with the Openflow data plane integration.

## Usage notes

* Openflow data plane integrations cannot be created directly, but rather are created when a deployment is created.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MODIFY | On the OPENFLOW DATA PLANE INTEGRATION being modified. |  |

## Examples

Alter an OPENFLOW DATA PLANE INTEGRATION to specify a specific event table.

```sqlexample
SHOW OPENFLOW DATA PLANE INTEGRATIONS;

ALTER OPENFLOW DATA PLANE INTEGRATION OPENFLOW_DATAPLANE_63600E17_5D91_4C56_BFC8_54FA0AXXXXXX
    SET EVENT_TABLE = 'openflow.openflow.openflow_events';
```

---
title: ALTER ORGANIZATION ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-organization-account.md
section: SQL Commands
---

# ALTER ORGANIZATION ACCOUNT

Modifies the properties of an existing [organization account](../../user-guide/organization-accounts.md).

See also:
:   [CREATE ORGANIZATION ACCOUNT](create-organization-account.md), [SHOW ORGANIZATION ACCOUNTS](show-organization-accounts.md)

## Syntax

```sqlsyntax
ALTER ORGANIZATION ACCOUNT SET { [ accountParams ] | [ objectParams ] | [ sessionParams ] }

ALTER ORGANIZATION ACCOUNT UNSET <param_name> [ , ... ]

ALTER ORGANIZATION ACCOUNT SET RESOURCE_MONITOR = <monitor_name>

ALTER ORGANIZATION ACCOUNT SET { PASSWORD | SESSION } POLICY <policy_name>

ALTER ORGANIZATION ACCOUNT UNSET { PASSWORD | SESSION } POLICY

ALTER ORGANIZATION ACCOUNT SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER ORGANIZATION ACCOUNT UNSET TAG <tag_name> [ , <tag_name> ... ]
```

> **Note:**
>
> The accountParams, objectParams, and sessionParams for the organization account are identical to the parameters for other accounts. See [ALTER ACCOUNT](alter-account.md) for their syntax.

## Parameters

`name`
:   Specifies the identifier for the organization account to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one (or more) account, session, and object parameters to set for your organization account (separated by blank spaces, commas, or new lines):

    * Account parameters cannot be changed by any other users.
    * Session and object parameters set at the account level serve only as defaults and can be changed by other users.

    For descriptions of the parameters you can set for your organization account, see [Parameters](../parameters.md).

`UNSET ...`
:   Specifies one (or more) account, session, and object parameters to unset for your account, which resets them to the system defaults.

    You can reset multiple properties with a single ALTER statement; however, each property must be separated by a comma. When resetting a
    property, specify only the name; specifying a value for the property will return an error.

`SET RESOURCE_MONITOR resource_monitor_name`
:   Special parameter that specifies the name of the resource monitor used to control all virtual warehouses created in the account.

    The organization account is not intended to be heavily used for analytics or other workloads.

`{ PASSWORD | SESSION } POLICY policy_name`
:   Specifies the [password policy](../../user-guide/password-authentication.md) or the [session policy](../../user-guide/session-policies.md) to set for the
    account.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access Control Requirements

Only a user with the GLOBALORGADMIN role can execute this command.

## Examples

Rename the organization account while allowing users to use either the new or the old account URL to access the account.

> ```sqlexample
> ALTER ORGANIZATION ACCOUNT original_acctname RENAME TO new_acctname;
> ```

---
title: ALTER ORGANIZATION PROFILE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-organization-profile.md
section: SQL Commands
---

# ALTER ORGANIZATION PROFILE

Modifies the properties of an [organization profile](../../user-guide/collaboration/organization-profiles/org-profiles-create-manage.md)
using an inline YAML manifest, or using a YAML manifest file located in a stage location.

See also:
:   [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md), [SHOW ORGANIZATION PROFILES](show-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md)

## Syntax

```sqlsyntax
ALTER ORGANIZATION PROFILE [ IF EXISTS ] <name> AS '<yaml_manifest_string>'

ALTER ORGANIZATION PROFILE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER ORGANIZATION PROFILE [ IF EXISTS ] <name> PUBLISH

ALTER ORGANIZATION PROFILE <name> ADD VERSION [ [ IF NOT EXISTS ] <version_alias_name> ]
  FROM @<yaml_manifest_stage_location>

ALTER ORGANIZATION PROFILE <name> ADD LIVE VERSION [ [ IF NOT EXISTS ] <version_alias_name> ]
  FROM LAST

ALTER ORGANIZATION PROFILE <name> COMMIT

ALTER ORGANIZATION PROFILE <name> ABORT
```

## Parameters

`name`
:   Specifies the identifier (name) for the organization profile being altered. Organization profile names can only contain uppercase characters or numbers, and they must start with an uppercase character.

`RENAME TO new_name`
:   Changes the name of the organization profile to `new_name`. The new identifier must be unique within the current organization. The identifier must conform to Snowflake identifier requirements. See [Identifier requirements](../identifiers-syntax.md). Additionally, organization profile names can only contain uppercase characters or numbers, and they must start with an uppercase character.

    > **Note:**
    >
    > An organization profile with the same name cannot already exist in the organization;
    > otherwise, the statement returns an error.

`PUBLISH`
:   Makes a previously undiscoverable organization profile discoverable.

`ADD VERSION [ [ IF NOT EXISTS ] version_alias_name ]`
:   Specifies the unique version identifier for the version being added. If `version_alias_name` isn’t specified, an alias isn’t created. If the identifier contains spaces, special characters, or mixed-case characters, the entire identifier must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. The FIRST, LAST, DEFAULT, and LIVE keywords are reserved as version shortcuts and can’t be used. The unique version identifier can’t start with “version$” and can’t contain slashes ( / ). For information about identifier syntax, see [Identifier requirements](../identifiers-syntax.md).

`ADD LIVE VERSION [ [ IF NOT EXISTS ] version_alias_name ]`
:   > Adds a new live editable version with the specified name from the last committed version. `version_alias_name` is optional and if it isn’t specified, an alias isn’t created. If the identifier contains spaces, special characters, or mixed-case characters, the entire identifier must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. The FIRST, LAST, DEFAULT, and LIVE keywords are reserved as version shortcuts and can’t be used. The unique version identifier can’t start with “version$” and can’t contain slashes ( / ). For information about identifier syntax, see [Identifier requirements](../identifiers-syntax.md).

    Changes made to the files in a live version are not applied to the organization profile until the live version is committed. The properties of an organization profile remain unchanged until the live version is committed.

`AS yaml_manifest_string`
:   The YAML manifest for the organization profile. For organizational listing profile manifest fields,
    see [Organization profile manifest reference](../../user-guide/collaboration/organization-profiles/org-profile-manifest-reference.md).

    Inline manifests are normally provided as dollar-quoted strings.
    For more information, see [Dollar-quoted string constants](../data-types-text.md).

`FROM 'yaml_manifest_stage_location'`
:   Specifies the external stage, internal stage, or Snowflake [Git repository clone](../../developer-guide/git/git-overview.md) YAML format manifest stage location.

`COMMIT`
:   Commits the changes in the organization profile. The live version being committed must contain a valid organization profile manifest file.

`ABORT`
:   Discards the changes in the organization profile.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MODIFY | Organization profile |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Organization profiles can be renamed only when they are in draft state.
* When setting the live version of the YAML format manifest for an organization profile, you must use COMMIT to apply the changes, or ABORT to discard the changes. An organization profile can only have one live version at a time.

## Examples

Alter the organization profile MYORGPROFILE to use an updated manifest file:

```sqlexample
ALTER ORGANIZATION PROFILE MYORGPROFILE ADD VERSION V2 FROM @STAGE_PATH_WITH_UPDATED_MANIFEST;
```

Publish the organization profile MYORGPROFILE:

```sqlexample
ALTER ORGANIZATION PROFILE MYORGPROFILE PUBLISH;
```

---
title: ALTER ORGANIZATION USER
source: https://docs.snowflake.com/en/sql-reference/sql/alter-organization-user.md
section: SQL Commands
---

# ALTER ORGANIZATION USER

Modifies the properties of an existing [organization user](../../user-guide/organization-users.md).

See also:
:   [CREATE ORGANIZATION USER](create-organization-user.md) , [DROP ORGANIZATION USER](drop-organization-user.md) , [SHOW ORGANIZATION USERS](show-organization-users.md)

## Syntax

```sqlsyntax
ALTER ORGANIZATION USER [ IF EXISTS ] <name> SET [ objectProperties ]

ALTER ORGANIZATION USER <name> UNSET [ objectProperties ]
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   EMAIL = '<string>'
>   DISPLAY_NAME = '<string>'
>   FIRST_NAME = '<string>'
>   MIDDLE_NAME = '<string>'
>   LAST_NAME = '<string>'
>   COMMENT = '<string>'
> ```

## Parameters

`name`
:   Specifies the identifier for the organization user to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Set object properties. For a description of the object properties, see [CREATE ORGANIZATION USER](create-organization-user.md).

`UNSET ...`
:   Unset object properties. For a description of the object properties, see [CREATE ORGANIZATION USER](create-organization-user.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE ORGANIZATION USERS | Account | By default, only the GLOBALORGADMIN and USERADMIN system roles in the organization account have this privilege. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

```sqlexample
ALTER ORGANIZATION USER alice
  SET LOGIN_NAME = 'asmith';
```

---
title: ALTER ORGANIZATION USER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/alter-organization-user-group.md
section: SQL Commands
---

# ALTER ORGANIZATION USER GROUP

Modifies the properties of an existing [organization user group](../../user-guide/organization-users.md).

See also:
:   [CREATE ORGANIZATION USER GROUP](create-organization-user-group.md) , [DROP ORGANIZATION USER GROUP](drop-organization-user-group.md) , [SHOW ORGANIZATION USER GROUPS](show-organization-user-groups.md)

## Syntax

```sqlsyntax
ALTER ORGANIZATION USER GROUP [ IF EXISTS ] <name> ADD ORGANIZATION USERS <org_user> [ , <org_user> ... ]

ALTER ORGANIZATION USER GROUP [ IF EXISTS ] <name> REMOVE ORGANIZATION USERS <org_user> [ , <org_user> ... ]

ALTER ORGANIZATION USER GROUP [ IF EXISTS ] <name> SET VISIBILITY =
  { ALL
  | ACCOUNTS <account> [ , <account> ... ]
  | REGION GROUPS '<region_group>' [ , '<region_group>' ... ]
  }

ALTER ORGANIZATION USER GROUP [ IF EXISTS ] <name> SET IS_GRANTABLE = { TRUE | FALSE }
```

## Parameters

`name`
:   Specifies the identifier for the organization user group to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ADD ORGANIZATION USERS org_user [ , org_user ]`
:   Specifies the organization users that you want to add to the organization user group. A comma-delimited list of organization user objects.

    Adding new organization users as members of an organization user group does not remove existing members of the group.

`REMOVE ORGANIZATION USERS org_user [ , org_user ]`
:   Specifies the organization users that you want to remove from the organization user group. A comma-delimited list of organization user objects.

`SET VISIBILITY = ALL` or . `SET VISIBILITY = ACCOUNTS account [ , account ... ]` or . `SET VISIBILITY = REGION GROUPS 'region_group' [ , 'region_group' ... ]`
:   Specifies which accounts can view and add the organization user group.

    > **Note:**
    >
    > An organization administrator cannot unilaterally hide an organization user group from an
    > account that previously had visibility. An administrator in the regular account must run the ALTER ACCOUNT REMOVE ORGANIZATION USER GROUP
    > command to remove the organization user group from the account before the organization administrator can change the visibility.

    `ACCOUNTS account [ , account ... ]`
    :   Only the specified accounts can view and add the organization user group.

        Specify the account name without the name of the organization. Do not use the account locator.

    `REGION GROUPS 'region_group' [ , 'region_group' ... ]`
    :   Only accounts in the specified [region groups](../../user-guide/admin-account-identifier.md) can view and add the organization user group.

`SET IS_GRANTABLE = { TRUE | FALSE }`
:   Specifies whether the role that is imported into a regular account from the organization user group can be granted to an
    account-specific role. If `TRUE`, the role that is created when the ACCOUNTADMIN imports the organization user group can be
    granted to another role.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE ORGANIZATION USER GROUPS | Account | By default, only the GLOBALORGADMIN and USERADMIN system roles in the organization account have this privilege. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Add organization users `joe` and `mary` to an organization user group `marketing`.

> ```sqlexample
> ALTER ORGANIZATION USER GROUP marketing ADD ORGANIZATION USERS joe, mary;
> ```

Remove organization user `dave` from the organization user group `data_stewards`.

> ```sqlexample
> ALTER ORGANIZATION USER GROUP data_stewards REMOVE ORGANIZATION USERS dave;
> ```

Allow all accounts in the organization to add the organization user group:

> ```sqlexample
> ALTER ORGANIZATION USER GROUP data_stewards SET VISIBILITY = ALL;
> ```

Only allow the account `qa_env` to add the organization user group:

> ```sqlexample
> ALTER ORGANIZATION USER GROUP data_stewards SET VISIBILITY = ACCOUNTS qa_env;
> ```

---
title: ALTER PACKAGES POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-packages-policy.md
section: SQL Commands
---

# ALTER PACKAGES POLICY

Modifies the properties for an existing [packages policy](../../developer-guide/udf/python/packages-policy.md).

Any changes made to the packages policy properties go into effect when the next SQL query that uses the packages policy runs.

## Syntax

```sqlsyntax
ALTER PACKAGES POLICY [ IF EXISTS ] <name> SET
  [ ALLOWLIST = ( [ '<packageSpec>' ] [ , '<packageSpec>' ... ] ) ]
  [ BLOCKLIST = ( [ '<packageSpec>' ] [ , '<packageSpec>' ... ] ) ]
  [ ADDITIONAL_CREATION_BLOCKLIST = ( [ '<packageSpec>' ] [ , '<packageSpec>' ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER PACKAGES POLICY [ IF EXISTS ] <name> UNSET
  [ ALLOWLIST ]
  [ BLOCKLIST ]
  [ ADDITIONAL_CREATION_BLOCKLIST ]
  [ COMMENT ]
```

## Parameters

`name`
:   Specifies the identifier for the packages policy to alter. If the identifier contains spaces or special characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties to set for the packages policy.

    `ALLOWLIST = ( [ 'packageSpec' ] [ , 'packageSpec' ... ] )`
    :   Specifies a list of package specs that are allowed.

        Default: `('*')` (i.e. allow all packages).

    `BLOCKLIST = ( [ 'packageSpec' ] [ , 'packageSpec' ... ] )`
    :   Specifies a list of package specs that are blocked. To unset this parameter, specify an empty list.

        Default: `()` (i.e. do not block any packages).

    `ADDITIONAL_CREATION_BLOCKLIST = ( [ 'packageSpec' ] [ , 'packageSpec' ... ] )`
    :   Specifies a list of package specs that are blocked at creation time. To unset this parameter, specify an empty list.
        If the `ADDITIONAL_CREATION_BLOCKLIST` is set, it is appended to the basic BLOCKLIST at the creation time.
        For temporary UDFs and anonymous stored procedures, the `ADDITIONAL_CREATION_BLOCKLIST` is appended to the basic BLOCKLIST at both creation and execution time.

        Default: `()` (i.e. do not block any packages).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the packages policy.

`UNSET ...`
:   Specifies one or more properties to unset for the packages policy, which resets them to the defaults:

    > * `ALLOWLIST`
    > * `BLOCKLIST`
    > * `ADDITIONAL_CREATION_BLOCKLIST`
    > * `COMMENT`
    >
    > You can reset multiple properties with a single ALTER statement; however, each property must be separated by a comma. When resetting
    > a property, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Packages policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you want to update an existing packages policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run
  the [DESCRIBE PACKAGES POLICY](desc-packages-policy.md) command.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example updates the packages policy.

> ```sqlexample
> ALTER PACKAGES POLICY packages_policy_prod_1 SET ALLOWLIST = ('pandas==1.2.3');
> ```

---
title: ALTER PASSWORD POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-password-policy.md
section: SQL Commands
---

# ALTER PASSWORD POLICY

Modifies the properties for an existing password policy.

Any changes made to the password policy properties go into effect when the next SQL query that uses the password policy runs.

See also:
:   [DDL commands](../../user-guide/password-authentication.md)

## Syntax

```sqlsyntax
ALTER PASSWORD POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER PASSWORD POLICY [ IF EXISTS ] <name> SET [ PASSWORD_MIN_LENGTH = <integer> ]
                                               [ PASSWORD_MAX_LENGTH = <integer> ]
                                               [ PASSWORD_MIN_UPPER_CASE_CHARS = <integer> ]
                                               [ PASSWORD_MIN_LOWER_CASE_CHARS = <integer> ]
                                               [ PASSWORD_MIN_NUMERIC_CHARS = <integer> ]
                                               [ PASSWORD_MIN_SPECIAL_CHARS = <integer> ]
                                               [ PASSWORD_MIN_AGE_DAYS = <integer> ]
                                               [ PASSWORD_MAX_AGE_DAYS = <integer> ]
                                               [ PASSWORD_MAX_RETRIES = <integer> ]
                                               [ PASSWORD_LOCKOUT_TIME_MINS = <integer> ]
                                               [ PASSWORD_HISTORY = <integer> ]
                                               [ COMMENT = '<string_literal>' ]

ALTER PASSWORD POLICY [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PASSWORD POLICY [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PASSWORD POLICY [ IF EXISTS ] <name> UNSET [ PASSWORD_MIN_LENGTH ]
                                                 [ PASSWORD_MAX_LENGTH ]
                                                 [ PASSWORD_MIN_UPPER_CASE_CHARS ]
                                                 [ PASSWORD_MIN_LOWER_CASE_CHARS ]
                                                 [ PASSWORD_MIN_NUMERIC_CHARS ]
                                                 [ PASSWORD_MIN_SPECIAL_CHARS ]
                                                 [ PASSWORD_MIN_AGE_DAYS ]
                                                 [ PASSWORD_MAX_AGE_DAYS ]
                                                 [ PASSWORD_MAX_RETRIES ]
                                                 [ PASSWORD_LOCKOUT_TIME_MINS ]
                                                 [ PASSWORD_HISTORY ]
                                                 [ COMMENT ]
```

## Parameters

`name`
:   Identifier for the password policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the session policy; must be unique for your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one or more parameters to set for the password policy separated by blank spaces, commas, or new lines.

    `PASSWORD_MIN_LENGTH = integer`
    :   Specifies the minimum number of characters the password must contain.

        Supported range: 8 to 256, inclusive.

        Default: 14

    `PASSWORD_MAX_LENGTH = integer`
    :   Specifies the maximum number of characters the password must contain. This number must be greater than or equal to the sum of
        `PASSWORD_MIN_LENGTH`, `PASSWORD_MIN_UPPER_CASE_CHARS`, and `PASSWORD_MIN_LOWER_CASE_CHARS`.

        Supported range: 8 to 256, inclusive.

        Default: 256

    `PASSWORD_MIN_UPPER_CASE_CHARS = integer`
    :   Specifies the minimum number of uppercase characters the password must contain.

        Supported range: 0 to 256, inclusive.

        Default: 1

    `PASSWORD_MIN_LOWER_CASE_CHARS = integer`
    :   Specifies the minimum number of lowercase characters the password must contain.

        Supported range: 0 to 256, inclusive.

        Default: 1

    `PASSWORD_MIN_NUMERIC_CHARS = integer`
    :   Specifies the minimum number of numeric characters the password must contain.

        Supported range: 0 to 256, inclusive.

        Default: 1

    `PASSWORD_MIN_SPECIAL_CHARS = integer`
    :   Specifies the minimum number of special characters the password must contain.

        Supported range: 0 to 256, inclusive.

        Default: 0

    `PASSWORD_MIN_AGE_DAYS = integer`
    :   Specifies the number of days the user must wait before a recently changed password can be changed again.

        Supported range: 0 to 999, inclusive.

        Default: 0

    `PASSWORD_MAX_AGE_DAYS = integer`
    :   Specifies the maximum number of days before the password must be changed.

        Supported range: 0 to 999, inclusive.

        A value of zero (i.e. `0`) indicates that the password does not need to be changed. Snowflake does not recommend choosing this
        value for a default account-level password policy or for any user-level policy. Instead, choose a value that meets your internal
        security guidelines.

        Default: 90, which means the password must be changed every 90 days.

        > **Important:**
        >
        > This parameter is stateful. For details, see the note in [Custom password policy for the account and users](../../user-guide/password-authentication.md).

    `PASSWORD_MAX_RETRIES = integer`
    :   Specifies the maximum number of attempts to enter a password before being locked out.

        Supported range: 1 to 10, inclusive.

        Default: 5

        > **Important:**
        >
        > This parameter is stateful. For details, see the note in [Custom password policy for the account and users](../../user-guide/password-authentication.md).

    `PASSWORD_LOCKOUT_TIME_MINS = integer`
    :   Specifies the number of minutes the user account will be locked after exhausting the designated number of password retries
        (i.e. `PASSWORD_MAX_RETRIES`).

        Supported range: 1 to 999, inclusive.

        Default: 15

        > **Important:**
        >
        > This parameter is stateful. For details, see the note in [Custom password policy for the account and users](../../user-guide/password-authentication.md).

    `PASSWORD_HISTORY = integer`
    :   Specifies the number of the most recent passwords that Snowflake stores. These stored passwords cannot be repeated when a user updates
        their password value.

        The current password value does not count towards the history.

        When you increase the history value, Snowflake saves the previous values.

        When you decrease the value, Snowflake saves the stored values up to that value that is set. For example, if the history value is 8 and
        you change the history value to 3, Snowflake stores the most recent 3 password values and deletes the 5 older password values from the
        history.

        Default: 5

        Max: 24

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the password policy.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more parameters to unset for the password policy, which resets them to the system defaults.

    You can reset multiple properties with a single ALTER statement. Each property must be separated by a comma. When
    resetting a property, specify only the name. Specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Password policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on password policy DDL and privileges, see [DDL commands](../../user-guide/password-authentication.md).

## Usage notes

* Before executing this command, run the [DESCRIBE PASSWORD POLICY](desc-password-policy.md) command to determine the attribute values of the policy.

  If you want to update an existing password policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE PASSWORD POLICY](desc-password-policy.md) command.
* Moving a password policy to a managed access schema is prohibited unless the password policy owner (i.e. the role that has the
  OWNERSHIP privilege on the password policy) also owns the target schema. For more information, see
  [Overview of Access Control Privileges](../../user-guide/security-access-control-overview.md).

## Examples

The following example describes the current password policy, and then updates the password policy to specify the number of allowed password
retries:

> ```sqlexample
> DESC PASSWORD POLICY password_policy_prod_1;
>
> ALTER PASSWORD POLICY password_policy_prod_1 SET PASSWORD_MAX_RETRIES = 3;
> ```

---
title: ALTER PIPE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-pipe.md
section: SQL Commands
---

# ALTER PIPE

Modifies a limited set of properties for an existing pipe object. Also supports the following operations:

* Pausing the pipe.
* Refreshing a pipe (i.e. copying the specified staged data files to the Snowpipe ingest queue for loading into the target table).
* Adding/overwriting/removing a comment for a pipe.
* Setting/unsetting a tag on a pipe.

See also:
:   [CREATE PIPE](create-pipe.md), [DROP PIPE](drop-pipe.md) , [SHOW PIPES](show-pipes.md) , [DESCRIBE PIPE](desc-pipe.md)

## Syntax

```sqlsyntax
ALTER PIPE [ IF EXISTS ] <name> SET { [ objectProperties ]
                                      [ COMMENT = '<string_literal>' ] }

ALTER PIPE <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PIPE <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PIPE [ IF EXISTS ] <name> UNSET { <property_name> | COMMENT } [ , ... ]

ALTER PIPE [ IF EXISTS ] <name> REFRESH { [ PREFIX = '<path>' ] [ MODIFIED_AFTER = <start_time> ] }
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   PIPE_EXECUTION_PAUSED = TRUE | FALSE
> ```

## Parameters

`name`
:   Specifies the identifier for the pipe to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one (or more) properties to set for the pipe (separated by blank spaces, commas, or new lines):

    `ERROR_INTEGRATION = 'integration_name'`
    :   Required only when configuring Snowpipe to send error notifications to a cloud messaging service. Specifies the name of the notification
        integration used to communicate with the messaging service. For more information, see [Snowpipe error notifications](../../user-guide/data-load-snowpipe-errors.md).

    `PIPE_EXECUTION_PAUSED = TRUE | FALSE`
    :   Specifies whether to pause a running pipe, typically in preparation for transferring ownership of the pipe:

        * `TRUE` pauses the pipe. The `executionState` reported by [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md) is `PAUSED`.
          Note that the pipe owner can continue to submit files to a paused pipe; however, they won’t be processed until the pipe is resumed.
        * `FALSE` resumes the pipe. The `executionState` reported by [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md) is `RUNNING`.

          > **Note:**
          >
          > Either of the following scenarios requires forcing a pipe to resume by calling the
          > [SYSTEM$PIPE_FORCE_RESUME](../functions/system_pipe_force_resume.md) function:
          >
          > + Transferring ownership of the pipe to another role. This requirement allows the new owner to evaluate the pipe status and
          >   determine how many files are waiting to be loaded by calling the [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md) function.
          > + Allowing a pipe object that leverages cloud messaging to trigger data loads (i.e. where `AUTO_INGEST = TRUE` in the pipe
          >   definition) to become stale. A pipe is considered stale when it is paused for longer than the limited retention period for event
          >   messages received for the pipe (14 days by default).

        Default: `FALSE` (the pipe is running by default)

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string'`
    :   Adds a comment or overwrites an existing comment for the pipe.

`UNSET ...`
:   Specifies one (or more) properties to unset for the pipe, which resets them to the defaults:

    * `ERROR_INTEGRATION`
    * `PIPE_EXECUTION_PAUSED`
    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

    You can reset multiple properties with a single ALTER statement; however, each property must be separated by a comma. When resetting
    a property, specify only the name; specifying a value for the property will return an error.

`REFRESH`
:   Copies a set of staged data files to the Snowpipe ingest queue for loading into the target table. This clause accepts an optional path and can
    further filter the list of files to load based on a specified start time.

    > **Note:**
    >
    > * This SQL command can only load data files that were staged within the last 7 days.
    > * This SQL command checks the load history for both the pipe and the target table. As a result, the command queues only those files
    >   that were not loaded already using either:
    >
    >   + The same pipe, provided the pipe owner did not recreate the pipe after the files were loaded.
    >   + A [COPY INTO <table>](copy-into-table.md) statement.

    > **Important:**
    >
    > The REFRESH functionality is intended for short term use to resolve specific issues when Snowpipe fails to load a subset of files and is not
    > intended for regular use.

    `PREFIX = 'path'`
    :   Path (or *prefix*) appended to the stage reference in the pipe definition. The path limits the set of files to load. Only files that start
        with the specified path are included in the data load.

        For example, suppose the pipe definition references `@mystage/path1/`. If the `path` value is `d1/`, the ALTER
        PIPE statement limits loads to files in the `@mystage` stage with the `/path1/d1/` path. See the examples for more
        information.

        Note that the path must be enclosed in single quotes.

    `MODIFIED_AFTER = 'start_time'`
    :   Timestamp (in ISO-8601 format) of the oldest data files to copy into the Snowpipe ingest queue based on the LAST_MODIFIED date (i.e. date
        when a file was staged).

        The default and maximum allowed value is 7 days.

## Usage notes

* Only the pipe owner (i.e. the role with the OWNERSHIP privilege on the pipe) can set or unset properties on a pipe.

  A non-owner role with the following minimum privileges can refresh a pipe (using ALTER PIPE … REFRESH …):

  | Privilege | Object | Notes |
  | --- | --- | --- |
  | OPERATE | Pipe |  |
  | USAGE | Stage in the pipe definition | External stages only |
  | READ | Stage in the pipe definition | Internal stages only |
  | SELECT, INSERT | Table in the pipe defintion |  |

  A non-owner role with the OPERATE privilege on the pipe can pause or resume a pipe (using ALTER PIPE … SET PIPE_EXECUTION_PAUSED = TRUE
  | FALSE).

  SQL operations on schema objects also require the USAGE privilege on the database and schema that contain the object.
* Currently, it is not possible to modify the following pipe properties using an ALTER PIPE statement:

  + [COPY INTO <table>](copy-into-table.md) statement
  + `AWS_SNS_TOPIC` parameter
  + `INTEGRATION` parameter

  Instead, recreate the pipe using a [CREATE OR REPLACE PIPE](create-pipe.md) statement.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Pause the `mypipe` pipe:

> ```sqlexample
> alter pipe mypipe SET PIPE_EXECUTION_PAUSED = true;
> ```

Add or modify the comment for pipe `mypipe`:

> ```sqlexample
> alter pipe mypipe SET COMMENT = "Pipe for North American sales data";
> ```

### Refreshing a pipe

Set up for examples:

> ```sqlexample
> CREATE PIPE mypipe AS COPY INTO mytable FROM @mystage/path1/;
> ```

Load data files from the `@mystage/path1/` stage and path into the `mytable` table, as defined in the `mypipe` pipe definition:

> ```sqlexample
> ALTER PIPE mypipe REFRESH;
> ```

Same as the previous example, but append `d1` to the path to further limit the list of files to load. In the current example, the statement
loads files from the `@mystage/path1/d1/` stage and path:

> ```sqlexample
> ALTER PIPE mypipe REFRESH PREFIX='d1/';
> ```

Same as the previous example, but only load files staged after a specified timestamp:

> ```sqlexample
> ALTER PIPE mypipe REFRESH PREFIX='d1/' MODIFIED_AFTER='2018-07-30T13:56:46-07:00';
> ```

---
title: ALTER POSTGRES INSTANCE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-postgres-instance.md
section: SQL Commands
---

# ALTER POSTGRES INSTANCE

Modifies the properties of an existing [Snowflake Postgres instance](../../user-guide/snowflake-postgres/about.md).

See also:
:   [CREATE POSTGRES INSTANCE](create-postgres-instance.md), [DESCRIBE POSTGRES INSTANCE](desc-postgres-instance.md), [DROP POSTGRES INSTANCE](drop-postgres-instance.md), [SHOW POSTGRES INSTANCES](show-postgres-instances.md)

## Syntax

```sqlsyntax
ALTER POSTGRES INSTANCE [ IF EXISTS ] <name>
  RENAME TO <new_name>

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> SET
  [ NETWORK_POLICY = '<network_policy>' ]
  [ AUTHENTICATION_AUTHORITY = { POSTGRES | POSTGRES_OR_SNOWFLAKE } ]
  [ COMMENT = '<string_literal>' ]
  [ HIGH_AVAILABILITY = { TRUE | FALSE } ]
  [ COMPUTE_FAMILY = '<compute_family>' ]
  [ STORAGE_SIZE_GB = <storage_gb> ]
  [ STORAGE_INTEGRATION = '<storage_integration_name>' ]
  [ POSTGRES_VERSION = { 16 | 17 | 18 } ]
  [ MAINTENANCE_WINDOW_START = <hour_of_day> ]
  [ POSTGRES_SETTINGS = '<json_string>' ]
  [ APPLY { IMMEDIATELY | ON '<timestamp>' } ]

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name>
  UNSET { COMMENT | POSTGRES_SETTINGS | NETWORK_POLICY
    | MAINTENANCE_WINDOW_START | STORAGE_INTEGRATION } [ , ... ]

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> SUSPEND

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> RESUME

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> RESET ACCESS
  FOR { 'snowflake_admin' | 'application' }

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> SET TAG <tag_name> =
  '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER POSTGRES INSTANCE [ IF EXISTS ] <name> UNSET TAG <tag_name>
  [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the Postgres instance to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the Postgres instance to the specified new name. The new identifier must be unique for the account.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

`RESET ACCESS FOR { 'snowflake_admin' | 'application' }`
:   Regenerates credentials for the `snowflake_admin` or `application` role. Returns one row with the following column:

    * `password`

    For more information, see [Snowflake Postgres Roles](../../user-guide/snowflake-postgres/postgres-roles.md).

`SET ...`
:   Sets one or more specified properties for the Postgres instance:

    `NETWORK_POLICY = 'network_policy'`
    :   Specifies the [network policy](../../user-guide/snowflake-postgres/postgres-network.md) to use for the instance.
        Changes to the policy may take up to 2 minutes to take effect.

        To specify this parameter, you must have been granted the USAGE privilege on the network policy object.

    `AUTHENTICATION_AUTHORITY = { POSTGRES | POSTGRES_OR_SNOWFLAKE }`
    :   Change the authentication method for the instance. POSTGRES indicates that only Postgres user passwords can be used.
        POSTGRES_OR_SNOWFLAKE also allows the use of short-lived access token passwords. See
        [Snowflake Token Authentication for Snowflake Postgres](../../user-guide/snowflake-postgres/postgres-token-auth.md) for more details.

    `COMMENT = 'string_literal'`
    :   Adds or overwrites an existing comment for the Postgres instance.

    `HIGH_AVAILABILITY = { TRUE | FALSE }`
    :   Enables or disables [high availability](../../user-guide/snowflake-postgres/high-availability.md) for the instance.
        Executes as an asynchronous operation. Run the DESCRIBE POSTGRES INSTANCE command and monitor the
        `operations` field to track progress.

        A high availability change can only be initiated if the instance is in the READY state and no other operation is running.

        > **Note:**
        >
        > Burstable instance sizes (BURST_XS, BURST_S, BURST_M) do not support high availability. To enable HA, you must
        > first change to a STANDARD or HIGHMEM compute family.

    `COMPUTE_FAMILY = 'compute_family'`
    :   Specifies the new [instance size](../../user-guide/snowflake-postgres/postgres-instance-sizes.md) for the Postgres instance.

    `STORAGE_SIZE_GB = storage_gb`
    :   Specifies the new storage size in GB. Both increases and decreases are supported.

        > **Note:**
        >
        > When you decrease the storage size, you can’t set it too close to current disk usage. The new size must be
        > at least 1.4x the disk space currently in use. That way, there’s still room to add more data without
        > triggering an automatic storage increase.

    `STORAGE_INTEGRATION = 'storage_integration_name'`
    :   Attaches a storage integration of type `POSTGRES_EXTERNAL_STORAGE` to the Postgres instance,
        enabling the pg_lake extension to access data in external object storage. For the complete setup
        procedure, see [Configuring S3 Storage for pg_lake](../../user-guide/snowflake-postgres/postgres-pg_lake.md).

        [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

    `POSTGRES_VERSION = { 16 | 17 | 18 }`
    :   Specifies the Postgres major version to upgrade to. You can only upgrade to a newer version; downgrading isn’t supported.

    `MAINTENANCE_WINDOW_START = hour_of_day`
    :   Specifies the hour of day (0-23, UTC) when a maintenance window can start. Maintenance windows are three hours long,
        starting from the specified hour.

    `POSTGRES_SETTINGS = 'json_string'`
    :   Specifies changes to the [Postgres server settings](../../user-guide/snowflake-postgres/postgres-server-settings.md)
        for the instance in JSON format:

        ```none
        '{"component:name" = "value", ...}'
        ```

        Some settings require an instance restart to take effect. These changes won’t be applied unless you specify
        `APPLY IMMEDIATELY`.

    `APPLY IMMEDIATELY`
    :   Overrides any defined maintenance window and applies the specified operations as soon as they’re ready.
        Applies to `COMPUTE_FAMILY`, `STORAGE_SIZE_GB`, `POSTGRES_VERSION`, and `POSTGRES_SETTINGS`.

    `APPLY ON 'timestamp'`
    :   Overrides any defined maintenance window and applies the specified operations at the given timestamp.
        The timestamp can’t be more than 72 hours in the future.

        Supported timestamp formats:

        * `yyyy-MM-dd`
        * `yyyy-MM-dd HH:mm`
        * `yyyy-MM-dd HH:mm:ss`
        * `yyyy-MM-dd HH:mm zzz`

`UNSET ...`
:   Unsets one or more specified properties for the Postgres instance, resetting them to their defaults:

    * `COMMENT`
    * `POSTGRES_SETTINGS`
    * `NETWORK_POLICY`
    * `MAINTENANCE_WINDOW_START` - Unsetting causes all ongoing operations to be applied as soon as they’re completed.
    * `STORAGE_INTEGRATION` - Removes the storage integration from the instance, disabling pg_lake access to external storage.

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

`SUSPEND`
:   Suspends the Postgres instance. The virtual machine is deactivated while the disk image is kept in storage.
    Normal billing is suspended, but storage costs continue to accrue. Existing backups are retained.

`RESUME`
:   Resumes a suspended Postgres instance. If there were operations pending restart, they’re applied when the instance resumes.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or OPERATE | Postgres instance | Required for modifying instance properties. |
| USAGE | Network policy | Required only if specifying a NETWORK_POLICY. |
| USAGE | Storage integration | Required only if specifying a STORAGE_INTEGRATION. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Changes to `COMPUTE_FAMILY`, `STORAGE_SIZE_GB`, and `POSTGRES_VERSION` are collectively referred to as
  “upgrade” operations and can be performed together. Run the DESCRIBE POSTGRES INSTANCE command and monitor the
  `operations` field to track progress.
* An upgrade operation can only be initiated if the instance is in the READY state and no other operation is running.
* If an instance has a defined maintenance window, changes won’t take effect until the maintenance window period starts,
  unless `APPLY IMMEDIATELY` is specified. Maintenance windows control *when* changes are applied, not whether
  the instance is running. For more details about maintenance operations, see
  [Snowflake Postgres instance management](../../user-guide/snowflake-postgres/managing-instances.md).
* **A brief service interruption is required to perform instance management operations.** Ensure that your applications
  can automatically reconnect to the database.
* SUSPEND and RESUME are immediate operations for stopping and starting instance billing. They are distinct from
  maintenance windows, which schedule when configuration changes (like upgrades or HA enablement) take effect.
* The connection string for an instance remains the same across instance management operations, unless you explicitly
  regenerate credentials.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Change the compute family and storage size for a Postgres instance:

```sqlexample
ALTER POSTGRES INSTANCE my_postgres
  SET COMPUTE_FAMILY = 'STANDARD_M'
      STORAGE_SIZE_GB = 100;
```

Monitor the progress of the operation using DESCRIBE:

```sqlexample
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN
        ('name', 'state', 'operations', 'compute_family',
          'storage_size_gb');

-- Repeat until state shows 'READY'
```

Enable high availability for an instance:

```sqlexample
-- Check current HA status
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN ('name', 'high_availability',
        'state');

-- Enable HA (asynchronous operation)
ALTER POSTGRES INSTANCE my_postgres
  SET HIGH_AVAILABILITY = TRUE;

-- Monitor until operation completes
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN ('name', 'high_availability',
        'state');
```

Upgrade to Postgres 18:

```sqlexample
-- Check current Postgres version using flow operator
SHOW POSTGRES INSTANCES
  ->> SELECT "name", "postgres_version", "state"
      FROM $1
      WHERE "name" = 'my_postgres';

-- Upgrade to version 18
ALTER POSTGRES INSTANCE my_postgres
  SET POSTGRES_VERSION = 18;
```

Apply changes immediately, overriding the maintenance window:

```sqlexample
ALTER POSTGRES INSTANCE my_postgres
  SET COMPUTE_FAMILY = 'STANDARD_L'
  APPLY IMMEDIATELY;
```

Suspend a Postgres instance:

```sqlexample
-- Check state before suspending
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN ('name', 'state');

-- Suspend the instance
ALTER POSTGRES INSTANCE my_postgres SUSPEND;

-- Verify suspended state
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN ('name', 'state');
```

Resume a suspended instance:

```sqlexample
ALTER POSTGRES INSTANCE my_postgres RESUME;
```

Rename a Postgres instance:

```sqlexample
ALTER POSTGRES INSTANCE my_postgres
  RENAME TO prod_postgres;
```

> **Note:**
>
> Renaming an instance changes its identifier in Snowflake but does *not* change the connection hostname. The
> hostname remains the same, so existing connections and applications continue to work without modification.

---
title: ALTER PRIVACY POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-privacy-policy.md
section: SQL Commands
---

# ALTER PRIVACY POLICY

Modifies the properties of an existing [privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md).

> **Caution:**
>
> When changing `budget_limit`, `max_budget_per_aggregate`, or
> `budget_window`, any property not specified in your ALTER command will revert
> back to its default value. To obtain the current values of the parameters, execute the [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md) command.

See also:
:   [CREATE PRIVACY POLICY](create-privacy-policy.md) , [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md) , [DROP PRIVACY POLICY](drop-privacy-policy.md) , [SHOW PRIVACY POLICIES](show-privacy-policies.md)

## Syntax

```sqlsyntax
ALTER PRIVACY POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER PRIVACY POLICY [ IF EXISTS ] <name> SET BODY -> <expression>

ALTER PRIVACY POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PRIVACY POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PRIVACY POLICY [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER PRIVACY POLICY [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the privacy policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the privacy policy; must be unique for your schema. The new identifier cannot be used if the
    identifier is already in place for a different privacy policy.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one (or more) properties to set for the privacy policy:

    `BODY -> expression`
    :   Specifies a new body for the policy.

        The SQL expression of the body calls two functions to control the return value of the policy:
        NO_PRIVACY_POLICY and PRIVACY_BUDGET. When a query is executed against a table that has been assigned the
        policy, Snowflake evaluates the conditions of the body to call the appropriate function and return a value. This return value determines
        which privacy budget, if any, is associated with the query against the privacy-protected table.

        The expression can use context functions such as [CURRENT_ROLE](../functions/current_role.md) or [INVOKER_ROLE](../functions/invoker_role.md)
        to associate a user or group of users with a privacy budget.

        If you use a [CASE](../functions/case.md) block in the body’s expression, it must include an ELSE statement that
        calls either NO_PRIVACY_POLICY or PRIVACY_BUDGET. Every user must either be associated with a privacy budget or have unrestricted access to
        the privacy-protected table. If a user should not have any access to a privacy-protected table or view, revoke SELECT privileges rather than
        trying to define this in the privacy policy.

        `NO_PRIVACY_POLICY`
        :   Use the body’s expression to call the `NO_PRIVACY_POLICY` function when you want a query to have unrestricted access to the table or view to which the privacy policy is assigned.

        `PRIVACY_BUDGET`
        :   Use the body’s expression to call the `PRIVACY_BUDGET` function when you want to return a privacy budget from the policy. The
            expression can contain conditions that allow the policy to return different privacy budgets for different queries based on factors like
            the user who is executing the query.

            In cross-account collaboration, privacy budgets are automatically namespaced by the account identifier of the consumer account, which
            prevents two different consumer accounts from sharing the same privacy budget even if the name of the privacy budget is the same. Using
            the [CURRENT_ACCOUNT](../functions/current_account.md) function to concatenate the name of the account with the name of the privacy budget
            can help distinguish between privacy budgets. For example, you could call the function as follows:
            `PRIVACY_BUDGET(BUDGET_NAME => 'external_budget.' || CURRENT_ACCOUNT())`.

            The signature of the `PRIVACY_BUDGET` function is:

            ```sqlsyntax
            PRIVACY_BUDGET(
              BUDGET_NAME=> '<string>'
              [, BUDGET_LIMIT=> <decimal> ]
              [, MAX_BUDGET_PER_AGGREGATE=> <decimal> ]
              [, BUDGET_WINDOW=> <string> ]
            )
            ```

            **Privacy budget arguments:**

            `BUDGET_NAME => expression`
            :   Resolves to the name of a privacy budget. Snowflake creates the privacy budget automatically when its name is
                specified in the body of the privacy policy.

            `BUDGET_LIMIT => decimal`
            :   A decimal number > 0 that specifies the budget limit for this privacy policy.
                This controls the total amount of privacy loss allowed. Adjusting this value
                changes how many total differentially private aggregates can be calculated
                against tables protected by this privacy budget during the refresh period. When a query is run that would
                cause the cumulative privacy loss to exceed this number, the query will fail.
                As a rough estimate, a budget
                limit of 233 with `MAX_BUDGET_PER_AGGREGATE=1` permits about 1000 aggregates
                per refresh period.

                Default: 233.0

            `MAX_BUDGET_PER_AGGREGATE => decimal`
            :   Specifies how much privacy budget is used for each aggregate function in a
                query. Adjusting this value changes the amount of noise added to each aggregate
                query, as well as the number of aggregates that can be calculated before the budget limit is reached. As an example, the query
                `select count(*), avg(a) ...` has two aggregates: `count(*)` and `avg(a)`. Specify a decimal value > 0.

                Default: 0.5

            `BUDGET_WINDOW => string`
            :   How often the privacy budget is refreshed, that is, has its cumulative privacy loss reset to 0. Valid values:

                * `Daily`: Refreshed every day at 12:00 AM UTC
                * `Weekly`: Refreshed every Sunday at 12:00 AM UTC
                * `Monthly`: Refreshed on the first day of the calendar month at 12:00 AM UTC
                * `Yearly`: Refreshed on January 1 at 12:00 AM UTC
                * `Never`: Privacy budget is never refreshed.

                Default: Weekly

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the privacy policy.

        Default: No value

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset, by resetting them to their defaults, for the privacy policy:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

    When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Privacy policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you want to update an existing privacy policy and need to see the current definition of the policy, run the
  [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md) command. You can also use the [GET_DDL](../functions/get_ddl.md) function to obtain the full definition
  of the privacy policy, including its body.
* Moving a privacy policy to a [managed access schema](../../user-guide/security-access-control-configure.md)
  (using the ALTER PRIVACY POLICY … RENAME TO syntax) is prohibited unless the privacy policy owner
  (that is, the role that has the OWNERSHIP privilege on the privacy policy) also owns the target schema.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Modify the body of a privacy policy `my_priv_policy` so it always returns a budget named `analysts`:

> ```sqlexample
> -- Modify the body of privacy policy "my_priv_policy" so it always returns a
> -- budget named "analysts"
> ALTER PRIVACY POLICY my_priv_policy SET BODY ->
>   PRIVACY_BUDGET(BUDGET_NAME => 'analysts');
>
> -- Set budget limit to 50 and max budget per aggregate to 0.1
> -- budget window is not mentioned so it is reset to its default value
> ALTER PRIVACY POLICY users_policy SET BODY ->
>   privacy_budget(budget_name=>'analysts', budget_limit=>50, max_budget_per_aggregate=>0.1);
> ```

---
title: ALTER PROCEDURE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-procedure.md
section: SQL Commands
---

# ALTER PROCEDURE

Modifies the properties for an existing stored procedure. If you need to make any changes not supported here, use [DROP PROCEDURE](drop-procedure.md)
instead and then recreate the stored procedure.

See also:
:   [CREATE PROCEDURE](create-procedure.md) , [DROP PROCEDURE](drop-procedure.md) , [SHOW PROCEDURES](show-procedures.md) , [DESCRIBE PROCEDURE](desc-procedure.md), [SHOW USER PROCEDURES](show-user-procedures.md)

## Syntax

The syntax for ALTER PROCEDURE varies depending on which language you’re using as the UDF handler.

### Java handler

```sqlsyntax
ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = '<integration_name>' [ , '<integration_name>' ... ] ]
  [ SECRETS = '<secret_variable_name>' = <secret_name> [ , '<secret_variable_name>' = <secret_name> ... ] ]
  [ COMMENT = '<string_literal>' ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET COMMENT

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }
```

### JavaScript handler

```sqlsyntax
ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ COMMENT = '<string_literal>' ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET COMMENT

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }
```

### Python handler

```sqlsyntax
ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = '<integration_name>' [ , '<integration_name>' ... ] ]
  [ SECRETS = '<secret_variable_name>' = <secret_name> [ , '<secret_variable_name>' = <secret_name> ... ] ]
  [ COMMENT = '<string_literal>' ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET COMMENT

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }
```

### Scala handler

```sqlsyntax
ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = '<integration_name>' [ , '<integration_name>' ... ] ]
  [ SECRETS = '<secret_variable_name>' = <secret_name> [ , '<secret_variable_name>' = <secret_name> ... ] ]
  [ COMMENT = '<string_literal>' ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET COMMENT

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }
```

### Snowflake Scripting handler

```sqlsyntax
ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) RENAME TO <new_name>

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET
  [ AUTO_EVENT_LOGGING = '<option>' ]
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ COMMENT = '<string_literal>' ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET COMMENT

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PROCEDURE [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }
```

## Parameters

`name`
:   Specifies the identifier for the stored procedure to alter. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`arg_data_type [ , ... ]`
:   Specifies the data type of the argument(s) for the stored procedure, if it has arguments. The argument types are required because stored
    procedures support name overloading (i.e. two stored procedures in the same schema can have the same name) and the argument types are used to
    identify the procedure you wish to alter.

`RENAME TO new_name`
:   Specifies the new identifier for the stored procedure; the combination of the identifier and existing argument data types must be unique for
    the schema.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies the properties to set for the stored procedure.

    `AUTO_EVENT_LOGGING = 'option'`
    :   (For Snowflake Scripting stored procedures only) Controls whether additional Snowflake Scripting log messages and trace events are
        ingested automatically into the [event table](../../developer-guide/logging-tracing/event-table-setting-up.md).

        For information about the options, see [AUTO_EVENT_LOGGING](../parameters.md).

    `LOG_LEVEL = 'log_level'`
    :   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
        the specified level (and at more severe levels) are ingested.

        For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting log level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `TRACE_LEVEL = 'trace_level'`
    :   Controls how trace events are ingested into the event table.

        For information about levels, see [TRACE_LEVEL](../parameters.md). For information about setting trace level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
    :   The names of [external access integrations](create-external-access-integration.md) needed in order for this
        procedure’s handler code to access external networks.

        An external access integration contains [network rules](create-network-rule.md) and
        [secrets](create-secret.md) that specify the external locations and credentials (if any) needed for handler code
        to make requests of an external network, such as an external REST API.

        For more information, refer to [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

    `SECRETS = ( 'secret_variable_name' = secret_name [ , ...  ] )`
    :   Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from
        secrets in handler code.

        This parameter’s value is a list of assignment expressions with the following parts:

        * `secret_name` as the name of a secret specified in an
          [external access integration’s](create-external-access-integration.md) ALLOWED_AUTHENTICATION_SECRETS parameter
          value. That external access integration’s name must, in turn, be specified as a value of this CREATE PROCEDURE call’s
          EXTERNAL_ACCESS_INTEGRATIONS parameter.

          You will receive an error if you specify a SECRETS value whose secret isn’t also included in an integration specified by the
          EXTERNAL_ACCESS_INTEGRATIONS parameter.
        * `'secret_variable_name'` as the variable that will be used in handler code when retrieving information from the secret.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the stored procedure. The value you specify is displayed in the `DESCRIPTION`
        column in the output for [SHOW PROCEDURES](show-procedures.md).

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies the properties to unset for the stored procedure, which resets them to the defaults.

    Currently, the only properties you can unset are:

    * `COMMENT`, which removes the comment, if any, for the procedure.
    * `TAG tag_name [ , tag_name ... ]`

`EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Restricted caller’s rights (`EXECUTE AS RESTRICTED CALLER`) is a preview feature available to all accounts.

    Specifies whether the stored procedure executes with the privileges of the owner (an “owner’s rights” stored procedure) or with
    the privileges of the caller (a “caller’s rights” stored procedure):

    * If you execute ALTER PROCEDURE … EXECUTE AS OWNER, then in the future the procedure will execute as an owner’s rights procedure.
    * If you execute the statement ALTER PROCEDURE … EXECUTE AS CALLER, then in the future the procedure will execute as a
      caller’s rights procedure.
    * If you execute the statement ALTER PROCEDURE … EXECUTE AS RESTRICTED CALLER, then in the future the procedure will execute as a
      caller’s rights procedure, but might not be able to run with all of the caller’s privileges. For more information, see
      [Restricted caller’s rights](../../developer-guide/restricted-callers-rights.md).

    If `EXECUTE AS ...` isn’t specified, the procedure runs as an owner’s rights stored procedure. Owner’s rights stored
    procedures have less access to the caller’s environment (for example, the caller’s session variables), and Snowflake defaults to this
    higher level of privacy and security.

    For more information, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

    Default: `EXECUTE AS OWNER`

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename stored procedure `procedure1` to `procedure2`:

> ```sqlexample
> ALTER PROCEDURE IF EXISTS procedure1(FLOAT) RENAME TO procedure2;
> ```

---
title: ALTER PROJECTION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-projection-policy.md
section: SQL Commands
---

# ALTER PROJECTION POLICY

Replaces the existing [projection policy](../../user-guide/projection-policies.md) rules with new rules or a new comment and allows the
renaming of a projection policy.

Any changes made to the policy rules go into effect when the next SQL query that uses the projection policy runs.

See also:
:   [Projection policy DDL reference](../../user-guide/projection-policies.md)

## Syntax

```sqlsyntax
ALTER PROJECTION POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER PROJECTION POLICY [ IF EXISTS ] <name> SET BODY -> <expression>

ALTER PROJECTION POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER PROJECTION POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER PROJECTION POLICY [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER PROJECTION POLICY [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the projection policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the projection policy; must be unique for your schema. The new identifier cannot be used if the
    identifier is already in place for a different projection policy.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one (or more) properties to set for the projection policy:

    `BODY -> expression`
    :   SQL expression that determines whether to project the column.

        The expression can contain CASE and other logic statements, but must call the PROJECTION_CONSTRAINT function:

        ```sqlsyntax
        PROJECTION_CONSTRAINT(ALLOW=>{TRUE|FALSE}, ENFORCEMENT=><enforcement_style>)
        ```

        * `ALLOW => { TRUE | FALSE }` - TRUE allows the column to be projected. FALSE prevents the column from being projected, with the behavior
          specified by ENFORCEMENT. FALSE affects only columns that appear in the final results table.
        * `ENFORCEMENT => 'enforcement_style'` - If ALLOW=FALSE, specifies what should happen if a query includes a protected column.
          Supported values:

          + FAIL - The query will fail if a protected column is included in the outermost query.
          + NULLIFY - All rows in the protected column return the value NULL.

          Default: FAIL

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the projection policy.

        Default: No value

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset, by resetting them to their defaults, for the projection policy:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

    When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Projection policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on projection policy DDL and privileges, see [Privileges and commands](../../user-guide/projection-policies.md).

## Usage notes

* If you want to update an existing projection policy and need to see the current definition of the policy, run the
  [DESCRIBE PROJECTION POLICY](desc-projection-policy.md) command or [GET_DDL](../functions/get_ddl.md) function.
* Moving a projection policy to a [managed access schema](../../user-guide/security-access-control-configure.md)
  (using the ALTER PROJECTION POLICY … RENAME TO syntax) is prohibited unless the projection policy owner
  (i.e. the role that has the OWNERSHIP privilege on the projection policy) also owns the target schema.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename a projection policy:

> ```sqlexample
> ALTER PROJECTION POLICY mypolicy RENAME TO proj_policy_acctnumber;
> ```

---
title: ALTER REPLICATION GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/alter-replication-group.md
section: SQL Commands
---

# ALTER REPLICATION GROUP

Modifies the properties for an existing [replication group](../../user-guide/account-replication-intro.md).

From the source account, you can perform the following actions:

* Rename the replication group.
* Reset the list of specified object types enabled for replication.
* Set or update the replication schedule for automatic refresh of secondary replication groups.
* Add or remove account objects of the following types to or from a replication group:

  + Databases
  + External volumes
  + Shares
  + Security integrations
  + API integrations
  + Storage integrations
  + External access integrations
  + Certain types of notification integrations (see [Integration replication](../../user-guide/account-replication-intro.md))
* Add or remove target accounts enabled for replication.
* Move databases or shares from one replication group to another replication group.

From the target account, you can perform the following actions:

* Refresh objects in the target account from the source account.
* Suspend scheduled replication.
* Resume scheduled replication.

See also:
:   [CREATE REPLICATION GROUP](create-replication-group.md) , [DROP REPLICATION GROUP](drop-replication-group.md) , [SHOW REPLICATION GROUPS](show-replication-groups.md),
    [SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH](../functions/system_schedule_async_replication_group_refresh.md)

## Syntax

**Source Account**

```sqlsyntax
ALTER REPLICATION GROUP [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SET
  [ OBJECT_TYPES = <object_type> [ , <object_type> , ... ] ]
  [ ALLOWED_DATABASES = <db_name> [ , <db_name> , ... ] ]
  [ ALLOWED_EXTERNAL_VOLUMES = <external_volume_name> [ , <external_volume_name> , ... ] ]
  [ ALLOWED_SHARES = <share_name> [ , <share_name> , ... ] ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SET
  OBJECT_TYPES = INTEGRATIONS [ , <object_type> , ... ]
  ALLOWED_INTEGRATION_TYPES = <integration_type_name> [ , <integration_type_name> ... ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SET
  COMMENT = '<string_literal>'

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SET
  REPLICATION_SCHEDULE = '{ <num> MINUTE | USING CRON <expr> <time_zone> }'

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SET
  ERROR_INTEGRATION = <integration_name>

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SET
  TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name> UNSET
  { COMMENT | REPLICATION_SCHEDULE | ERROR_INTEGRATION } [ , ... ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name> UNSET
  TAG <tag_name> [ , <tag_name> ... ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  ADD <db_name> [ , <db_name> ,  ... ] TO ALLOWED_DATABASES

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  MOVE DATABASES <db_name> [ , <db_name> ,  ... ] TO REPLICATION GROUP <move_to_rg_name>

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  REMOVE <db_name> [ , <db_name> ,  ... ] FROM ALLOWED_DATABASES

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  ADD <external_volume_name> [ , <external_volume_name> ,  ... ] TO ALLOWED_EXTERNAL_VOLUMES

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  MOVE EXTERNAL VOLUMES <external_volume_name> [ , <external_volume_name> ,  ... ] TO REPLICATION GROUP <move_to_rg_name>

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  REMOVE <external_volume_name> [ , <external_volume_name> ,  ... ] FROM ALLOWED_EXTERNAL_VOLUMES

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  ADD <share_name> [ , <share_name> ,  ... ] TO ALLOWED_SHARES

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  MOVE SHARES <share_name> [ , <share_name> ,  ... ] TO REPLICATION GROUP <move_to_rg_name>

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  REMOVE <share_name> [ , <share_name> ,  ... ] FROM ALLOWED_SHARES

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  ADD <org_name>.<target_account_name> [ , <org_name>.<target_account_name> ,  ... ] TO ALLOWED_ACCOUNTS
  [ IGNORE EDITION CHECK ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name>
  REMOVE <org_name>.<target_account_name> [ , <org_name>.<target_account_name> ,  ... ] FROM ALLOWED_ACCOUNTS
```

**Target Account**

```sqlsyntax
ALTER REPLICATION GROUP [ IF EXISTS ] <name> REFRESH

ALTER REPLICATION GROUP [ IF EXISTS ] <name> SUSPEND [ IMMEDIATE ]

ALTER REPLICATION GROUP [ IF EXISTS ] <name> RESUME
```

## Parameters

**Source Account**

`name`
:   Specifies the identifier for the replication group.

`RENAME TO new_name`
:   `new_name`
    :   Specifies the new identifier for the replication group. The new identifier cannot be used if the identifier is already in place for a
        different replication or failover group.

        For more details, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies properties to set for the replication group (separated by blank spaces, commas, or new lines).

    `OBJECT_TYPES = object_type [ , object_type , ... ]`
    :   Reset the list of object types to replicate from the source account to target account(s).

        > **Note:**
        >
        > For database, external volume, and share objects:
        >
        > * If DATABASES, EXTERNAL VOLUMES, or SHARES are included in the OBJECT_TYPES list, and remain in the OBJECT_TYPES list after
        >   the list is reset, the respective allowed objects list (ALLOWED_DATABASES, ALLOWED_EXTERNAL_VOLUMES, or ALLOWED_SHARES) remains
        >   unchanged.
        > * If the OBJECT_TYPES list is reset to add or remove DATABASES, the ALLOWED_DATABASES list is set to NULL.
        > * If the OBJECT_TYPES list is reset to add or remove EXTERNAL VOLUMES, the ALLOWED_EXTERNAL_VOLUMES list is set to NULL.
        > * If the OBJECT_TYPES list is reset to add or remove SHARES, the ALLOWED_SHARES list is set to NULL.
        > * Use the ADD, MOVE, and REMOVE clauses to modify the list of allowed database, external volume, or share objects.

        The following object types are supported:

        > ACCOUNT PARAMETERS:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     All account-level parameters. This includes [account parameters](../parameters.md) and parameters that can be
        >     [set for your account](../../user-guide/admin-account-management.md).
        >
        > DATABASES:
        > :   Add database objects to the list of object types. If database objects were already included in the list of specified object
        >     types, the `ALLOWED_DATABASES` list remains unchanged. To modify the list of databases, use the
        >     ADD, MOVE, or REMOVE clauses.
        >
        > EXTERNAL VOLUMES:
        > :   Add external volume objects to the list of object types. If external volume objects are included in the list of specified object types,
        >     the `ALLOWED_EXTERNAL_VOLUMES` parameter must be set. To modify the list of external volumes, use the ADD, MOVE, or REMOVE clauses.
        >
        > INTEGRATIONS:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     Currently, only security, API, storage, external access, and certain types of notification integrations are supported.
        >     For details, see [Integration replication](../../user-guide/account-replication-intro.md).
        >
        >     If integration objects are included in the list of specified object types, the
        >     `ALLOWED_INTEGRATION_TYPES` parameter must be set.
        >
        > NETWORK POLICIES:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     All network policies in the source account.
        >
        > RESOURCE MONITORS:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     All resource monitors in the source account.
        >
        > ROLES:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     All roles in the source account. Replicating roles implicitly includes all grants for object types included in the replication group.
        >     For example, if `ROLES` is the only object type that is replicated, then only hierarchies of roles (that is, roles granted to
        >     other roles) are replicated to target accounts. If the `USERS` object type is also included, then role grants to users are
        >     also replicated.
        >
        > SHARES:
        > :   Add share objects to the list of object types. If share objects were already included in the list of specified object types, the
        >     `ALLOWED_SHARES` list remains unchanged. To modify the list of shares, use the ADD, MOVE, or REMOVE clauses.
        >
        > USERS:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     All users in the source account.
        >
        > WAREHOUSES:
        > :   *Requires Business Critical Edition (or higher).*
        >
        >     All warehouses in the source account.

        > **Note:**
        >
        > If you replicate users and roles, programmatic access tokens for users are replicated automatically.

    `ALLOWED_DATABASES = db_name [ , db_name , ... ]`
    :   Specifies the database or list of databases for which you are enabling replication from the source account to the target
        account. In order for you to set this parameter, the `OBJECT_TYPES` list must include `DATABASES`.

        `db_name`
        :   Specifies the identifier for the database.

    `ALLOWED_EXTERNAL_VOLUMES = external_volume_name [ , external_volume_name , ... ]`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies the external volume or list of external volumes for which you are enabling replication from the source account to the target
        account. For you to set this parameter, the `OBJECT_TYPES` list must include `EXTERNAL VOLUMES`.

        `external_volume_name`
        :   Specifies the identifier for the external volume.

    `ALLOWED_SHARES = share_name [ , share_name , ... ]`
    :   Specifies the share or list of shares for which you are enabling replication from the source account to the target account.
        For you to set this parameter, the `OBJECT_TYPES` list must include `SHARES`.

        `share_name`
        :   Specifies the identifier for the share.

    > **Note:**
    >
    > If the ALLOWED_DATABASES, ALLOWED_EXTERNAL_VOLUMES, or ALLOWED_SHARES lists are modified, any objects that were previously in the list and removed
    > will be dropped in any target account with a linked secondary replication group when the next refresh operation occurs.

    `ALLOWED_INTEGRATION_TYPES = integration_type_name [ , integration_type_name , ... ]`
    :   *Requires Business Critical Edition (or higher).*

        Type(s) of integrations for which you are enabling replication from the source account to the target account.

        > This property requires that the `OBJECT_TYPES` list include `INTEGRATIONS` to set this parameter.
        >
        > The following integration types are supported:
        >
        > > SECURITY INTEGRATIONS:
        > > :   Specifies security integrations.
        > >
        > >     This property requires that the `OBJECT_TYPES` list include `ROLES`.
        > >
        > > API INTEGRATIONS:
        > > :   Specifies API integrations.
        > >
        > >     API integration replication requires additional set up after the API integration is replicated to the target account.
        > >     For more information, see [Updating the remote service for API integrations](../../user-guide/account-replication-config.md).
        > >
        > > STORAGE INTEGRATIONS:
        > > :   Specifies storage integrations.
        > >
        > > EXTERNAL ACCESS INTEGRATIONS:
        > > :   Specifies [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md).
        > >
        > >     For more information, see [Replication of stored procedures and user-defined functions (UDFs)](../../user-guide/account-replication-considerations.md).
        > >
        > > NOTIFICATION INTEGRATIONS:
        > > :   Specifies notification integrations.
        > >
        > >     Only some types of notification integrations are replicated. For details, see
        > >     [Integration replication](../../user-guide/account-replication-intro.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the replication group.

        Default:
        :   `NULL`

    `REPLICATION_SCHEDULE ...`
    :   Specifies the schedule for refreshing secondary replication groups.

        * `USING CRON expr time_zone`
          :   Specifies a cron expression and time zone for the secondary group refresh. Supports a subset of standard cron utility syntax.

              For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
              (in Wikipedia).

              The cron expression consists of the following fields:

              ```output
              # __________ minute (0-59)
              # | ________ hour (0-23)
              # | | ______ day of month (1-31, or L)
              # | | | ____ month (1-12, JAN-DEC)
              # | | | | __ day of week (0-6, SUN-SAT, or L)
              # | | | | |
              # | | | | |
                * * * * *
              ```

              The following special characters are supported:

              `*`
              :   Wildcard. Specifies any occurrence of the field.

              `L`
              :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a
                  given month. In the day-of-month field, it specifies the last day of the month.

              `/n`
              :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
                  specified in the month field, then the refresh is scheduled for April, July and October (i.e. every 3 months, starting with the 4th
                  month of the year). The same schedule is maintained in subsequent years. That is, the refresh is not scheduled to run in
                  January (3 months after the October run).

              > **Note:**
              > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
              >   for the account (or setting the value at the user or session level) does not change the time zone for the refresh.
              > + The cron expression defines all valid run times for the refresh. Snowflake attempts to refresh secondary groups based on
              >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
              > + When both a specific day of month and day of week are included in the cron expression, then the refresh is scheduled on days
              >   satisfying either the day of month or day of week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
              >   schedules a refresh at 0AM on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
        * `num MINUTE`
          :   Specifies an interval (in minutes) of wait time between refreshes. Accepts positive integers only.

              Also supports `num M` syntax.

              To avoid ambiguity, a *base interval time* is set:

              + When the object is created (using CREATE <object>) or
              + When a different interval is set (using ALTER <object> … SET REPLICATION_SCHEDULE)

              The base interval time starts the interval counter from the current clock time. For example, if an INTERVAL value of `10` is set and
              the scheduled refresh is enabled at 9:03 AM, then the refresh runs at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to
              ensure absolute precision, but only guarantee that refreshes do not execute before their set interval occurs (e.g. in the
              current example, the refresh could first run at 9:14 AM, but will definitely not run at 9:12 AM).

              > **Note:**
              >
              > The maximum supported value is `11520` (8 days). If the replication schedule has a greater `num MINUTE` value, the
              > refresh operation never runs.

        Default:
        :   `NULL`

    `ERROR_INTEGRATION = integration_name`
    :   Specifies the name of the notification integration to use to email/push notifications when refresh errors occur for the replication
        group. For more details, see [Error notifications for replication and failover groups](../../user-guide/account-replication-error-notifications.md).

        Default:
        :   `NULL`

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`ADD db_name [ , db_name ,  ... ] TO ALLOWED_DATABASES`
:   Specifies a comma-separated list of databases to add to the list of databases enabled for replication. To add a database, DATABASES must
    be included in the list of specified object types. If the list of object types does not already include DATABASES, you must add it.

    > `db_name`
    > :   Specifies the identifier for the database.

`MOVE DATABASES db_name [ , db_name ,  ... ] TO REPLICATION GROUP move_to_rg_name`
:   Specifies a comma-separated list of databases to move from one replication group to another replication group. The replication group the
    databases are being moved to must include DATABASES in the list of specified object types.

    > `db_name`
    > :   Specifies the identifier for the database.
    >
    > `move_to_rg_name`
    > :   Specifies the identifier for the replication group the databases are being moved to.

`REMOVE db_name [ , db_name ,  ... ] FROM ALLOWED_DATABASES`
:   Specifies a comma-separated list of database to remove from the list of databases enabled for replication.

    > **Note:**
    >
    > When you remove a database from a primary replication group, the database is dropped in any target account with a linked secondary
    > replication group when the next refresh operation occurs.
    >
    > To avoid dropping databases in the target account, you can drop the secondary replication group *before* the next time the modified
    > primary replication group is replicated to the target account. When you drop the secondary replication group, read-only secondary
    > databases that were included in the group become standalone read-write databases in the target account.

`ADD external_volume_name [ , external_volume_name ,  ... ] TO ALLOWED_EXTERNAL_VOLUMES`
:   Specifies a comma-separated list of external volumes to add to the list of external volumes enabled for replication. To add an external volume,
    EXTERNAL VOLUMES must be included in the list of specified object types. If the list of object types does not already include EXTERNAL VOLUMES,
    you must add it.

    > `external_volume_name`
    > :   Specifies the identifier for the external volume.

`MOVE EXTERNAL VOLUMES external_volume_name [ , external_volume_name ,  ... ] TO REPLICATION GROUP move_to_rg_name`
:   Specifies a comma-separated list of external volumes to move from one replication group to another replication group. The replication group the
    external volumes are being moved to must include EXTERNAL VOLUMES in the list of specified object types.

    > `db_name`
    > :   Specifies the identifier for the external volume.
    >
    > `move_to_rg_name`
    > :   Specifies the identifier for the replication group the external volumes are being moved to.

`REMOVE external_volume_name [ , external_volume_name ,  ... ] FROM ALLOWED_EXTERNAL_VOLUMES`
:   Specifies a comma-separated list of external volumes to remove from the list of external volumes enabled for replication.

    > **Note:**
    >
    > When you remove an external volume from a primary replication group, the external volume is dropped in any target account with a
    > linked secondary replication group when the next refresh operation occurs.
    >
    > To avoid dropping external volumes in the target account, you can drop the secondary replication group before the next time the
    > modified primary replication group is replicated to the target account. When you drop the secondary replication group, read-only
    > secondary external volumes that were included in the group become standalone read-write external volumes in the target account.

`ADD share_name [ , share_name ,  ... ] TO ALLOWED_SHARES`
:   Specifies a comma-separated list of shares to the list of shares for replication. To add a share, SHARES must be included in the list of
    specified object types. If the list of object types doesn’t already include SHARES, you must add it.

    > `share_name`
    > :   Specifies the identifier for the share.

`MOVE SHARES share_name [ , share_name ,  ... ] TO REPLICATION GROUP move_to_rg_name`
:   Specifies a comma-separated list of shares to move from one replication group to another replication group. The replication group the
    shares are being moved to must include SHARES in the list of specified object types.

    > `share_name`
    > :   Specifies the identifier for the share.
    >
    > `move_to_rg_name`
    > :   Specifies the identifier for the replication group the shares are being moved to.

`REMOVE share_name [ , share_name ,  ... ] FROM ALLOWED_SHARES`
:   Specifies a comma-separated list of shares to remove from the list of shares enabled for replication.

    > **Note:**
    >
    > When you remove a share from a primary replication group, the share is dropped in any target account with a linked secondary
    > replication group when the next refresh operation occurs.

`ADD org_name.target_account_name [ , org_name.target_account_name ,  ... ] TO ALLOWED_ACCOUNTS`
:   Specifies a comma-separated list of target accounts to add to the primary replication group to enable replication of specified objects in
    the source account to the target account.

    > `org_name`
    > :   Name of your Snowflake organization.
    >
    > `target_account_name`
    > :   Target account to which you are enabling replication of the specified objects.

`REMOVE org_name.target_account_name [ , org_name.target_account_name ,  ... ] FROM ALLOWED_ACCOUNTS`
:   Specifies a comma-separated list of target accounts to remove from the primary replication group to disable replication of specified
    objects in the source account to the target account.

    > `org_name`
    > :   Name of your Snowflake organization.
    >
    > `target_account_name`
    > :   Target account to which you are disabling replication of the specified objects.

`IGNORE EDITION CHECK`
:   Allows replicating objects to accounts on lower editions in either of the following scenarios:

    * A primary replication group with only database and/or share objects is in a Business Critical (or higher) account but
      one or more accounts approved for replication are on lower editions. Business Critical Edition is intended for Snowflake accounts
      with extremely sensitive data.
    * A primary replication group with any [object type](create-replication-group.md) is in a Business
      Critical (or higher) account and a signed business associate agreement is in place to store PHI data in the account per HIPAA and
      [HITRUST](../../user-guide/intro-cloud-platforms.md) regulations. However, no such agreement is in place for one or more of the accounts approved
      for replication, regardless if they are Business Critical (or higher) accounts.

    Both scenarios are prohibited by default in an effort to help prevent account administrators for Business Critical (or higher) accounts
    from inadvertently replicating sensitive data to accounts on lower editions.

**Target Account**

`name`
:   Specifies the identifier for the replication group.

`REFRESH`
:   Refreshes the objects in the target (current) account from the source account.

`SUSPEND [ IMMEDIATE ]`
:   Suspends the scheduled refresh of the secondary replication group (if the primary replication group has automatically scheduled refresh set
    using the `REPLICATION_SCHEDULE` property).

    The optional `IMMEDIATE` clause cancels a scheduled refresh operation that is currently in progress for the secondary replication group
    (if there is one). Note that there might be a slight delay between the time that the statement returns and the time that the cancellation
    of the refresh operation is finished.

`RESUME`
:   Resume scheduled refresh of the secondary replication group (if the primary replication group has automatically scheduled refresh set
    using the `REPLICATION_SCHEDULE` property).

`UNSET ...`
:   Specifies one (or more) properties to unset for the replication group, which resets them to the defaults:

    * `COMMENT`
    * `REPLICATION_SCHEDULE`
    * `ERROR_INTEGRATION`
    * `TAG tag_name [ , tag_name ... ]`

    You can reset multiple properties with a single ALTER statement; however, each property must be separated by
    a comma. Also, when resetting a property, you only specify the name; no value is required.

## Usage notes

* The following minimal privileges are required:

  + To refresh a secondary replication group using ALTER REPLICATION GROUP … REFRESH, the active, primary role must have either the
    OWNERSHIP or REPLICATE privilege on the replication group.
  + To make any other changes to the replication group, only a user with a role with the OWNERSHIP privilege on the group can execute
    this SQL command.
  + To add a database to a replication group, the active role must have the MONITOR privilege on the database.
  + To add an external volume to a replication group, the active role must have the USAGE privilege on the external volume.
  + To add a share to a replication group, the active role must have the OWNERSHIP privilege on the share.
* Identifiers for failover groups and replication groups in an account must be unique.
* Objects other than databases, external volumes, and shares must be in the same replication group.
* A database can only be added to one replication or failover group.
* An external volume can only be added to one replication or failover group.
* To move databases, external volumes, or shares from one replication group (the move-from group) to another replication group (the move-to group):

  + Both groups must be of the same type: REPLICATION GROUP.
  + If the last database in the move-from group is moved to another group, the `allowed_databases` property for the move-from group
    is set to NULL. The same behavior applies to shares and external volumes.
  + If the move-to group doesn’t have the object type that is being moved (`databases`, `external volumes`, or `shares`) in the `object_types`
    list, it must be explicitly added to the move-to group before you move the objects.
* If database, external volume, or share objects are removed from a primary replication group (by using the REMOVE parameter or SET parameter to
  modify the ALLOWED_DATABASES, ALLOWED_EXTERNAL_VOLUMES, or ALLOWED_SHARES lists), those objects are dropped in any target account with a
  linked secondary replication group when the next refresh operation occurs.

  To avoid dropping these objects in the target account, you can drop the secondary replication group *before* the next time the modified
  primary replication group is replicated to the target account.
* [Inbound shares](../../user-guide/data-share-consumers.md) (shares from providers) *cannot* be added to a replication or failover group.
* To retrieve the list of accounts in your organization that are enabled for replication, use the
  [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md) command.
* To retrieve the list of replication groups in your organization, use the [SHOW REPLICATION GROUPS](show-replication-groups.md) command. The
  `allowed_accounts` column lists all target accounts enabled for replication from a source account.
* Automatically [scheduled refresh operations](../../user-guide/account-replication-intro.md) are executed using the role with the OWNERSHIP
  privilege on the group. If a scheduled refresh operation fails due to insufficient privileges, grant the required privileges
  to the role with the OWNERSHIP privilege on the group.
* The ALTER REPLICATION GROUP … SUSPEND IMMEDIATE command doesn’t cancel an in-progress refresh operation if it was manually triggered.
  For more information, see [Cancel an in-progress refresh operation that wasn’t automatically scheduled](../../user-guide/account-replication-failover-failback.md).
* Canceling an in-progress refresh operation that is in the SECONDARY_DOWNLOADING_METADATA or SECONDARY_DOWNLOADING_DATA phase might
  result in an inconsistent state on the target account. For more information see [View the current phase of an in-progress refresh operation](../../user-guide/account-replication-failover-failback.md).

* If you create a replication or failover group with a tag or modify a replication or failover group by setting a tag on it,
  [tag inheritance](../../user-guide/object-tagging/inheritance.md) does not apply to any objects that you specify in the replication or failover group.

  Tag inheritance is only applicable to objects with a [parent-child relationship](../../user-guide/security-access-control-overview.md), such
  database, schema, and table. There are no child objects of replication or failover groups.
* You cannot set a tag or modify a tag on a secondary replication or failover group because these objects are read
  only.
* When you refresh a secondary replication or failover group, any tags that are set on the primary group are then set on
  the secondary group.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Executed from the source account

Add `myorg.myaccount3` to the list of target accounts to which replication of specified objects from the source account is enabled:

```sqlexample
ALTER REPLICATION GROUP myrg ADD myorg.myaccount3 TO ALLOWED_ACCOUNTS;
```

Reset the object types list for replication in the source account:

```sqlexample
ALTER REPLICATION GROUP myrg SET
  OBJECT_TYPES = DATABASES, SHARES;
```

Add database `db1` to the list of databases enabled for replication:

```sqlexample
ALTER REPLICATION GROUP myrg
  ADD db1 to ALLOWED_DATABASES;
```

Add share `s2` to the list of shares enabled for replication:

```sqlexample
ALTER REPLICATION GROUP myrg
  ADD s2 TO ALLOWED_SHARES;
```

Move database `db1` to another replication group, `myrg2`:

```sqlexample
ALTER REPLICATION GROUP myrg
  MOVE DATABASES db1 TO REPLICATION GROUP myrg2;
```

Set the scheduled refresh interval time to 15 minutes:

```sqlexample
ALTER REPLICATION GROUP myrg SET
  REPLICATION_SCHEDULE = '15 MINUTE';
```

### Executed from the target account

Refresh objects in the replication group `myrg` in the target account:

```sqlexample
ALTER REPLICATION GROUP myrg REFRESH;
```

Suspend automatic refreshes:

```sqlexample
ALTER REPLICATION GROUP myrg SUSPEND;
```

---
title: ALTER RESOURCE MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/alter-resource-monitor.md
section: SQL Commands
---

# ALTER RESOURCE MONITOR

Modifies the properties and triggers for an existing [resource monitor](../../user-guide/resource-monitors.md). Use this command to
increase or decrease the credit quota, change the
scheduling information, or change/replace the triggers for a resource monitor.

See also:
:   [CREATE RESOURCE MONITOR](create-resource-monitor.md) , [DROP RESOURCE MONITOR](drop-resource-monitor.md) , [SHOW RESOURCE MONITORS](show-resource-monitors.md) , [ALTER WAREHOUSE](alter-warehouse.md) , [ALTER ACCOUNT](alter-account.md)

## Syntax

```sqlsyntax
ALTER RESOURCE MONITOR [ IF EXISTS ] <name> [ SET { [ CREDIT_QUOTA = <num> ]
                                                    [ FREQUENCY = { MONTHLY | DAILY | WEEKLY | YEARLY | NEVER } ]
                                                    [ START_TIMESTAMP = { <timestamp> | IMMEDIATELY } ]
                                                    [ END_TIMESTAMP = <timestamp> ]
                                                    [ NOTIFY_USERS = ( <user_name> [ , <user_name> , ... ] ) ] } ]
                                            [ TRIGGERS triggerDefinition [ triggerDefinition ... ] ]
```

Where:

> ```sqlsyntax
> triggerDefinition ::=
>    ON <threshold> PERCENT DO { SUSPEND | SUSPEND_IMMEDIATE | NOTIFY }
> ```

## Parameters

`name`
:   Specifies the identifier for the resource monitor to alter. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   `CREDIT_QUOTA = num`
    :   Specifies the number of credits allocated to the resource monitor per frequency interval. When total usage for all warehouses assigned to
        the monitor reaches this number for the current frequency interval, the resource monitor is considered to be at 100% of quota.

        If a value is not specified for a resource monitor, the monitor has no quota and will never reach 100% usage within the specified interval.

    `FREQUENCY = MONTHLY | DAILY | WEEKLY | YEARLY | NEVER`
    :   The frequency interval at which the credit usage resets to `0`.

        If you specify `NEVER` for the frequency, the credit usage for the warehouse does not reset.

    `START_TIMESTAMP = timestamp | IMMEDIATELY`
    :   The date and time when the resource monitor starts monitoring credit usage for the assigned warehouses.

        If you specify `IMMEDIATELY` for the start timestamp, the current timestamp is used.

        If you specify a date without a time, the current time is used.

        If you set a time without specifying a time zone, UTC is used as the default time zone.

    `END_TIMESTAMP = timestamp`
    :   The date and time when the resource monitor suspends the assigned warehouses.

    `NOTIFY_USERS = ( user_name [ , user_name , ... ] )`
    :   Specifies the list of users to receive email notifications on resource monitors. If a user identifier includes spaces or special
        characters or is case-sensitive, then the identifier must be enclosed in double quotes (e.g. “Mary Smith”). See
        [Identifier requirements](../identifiers-syntax.md) for details.

        The user identifier, `user_name`, is the value of the `name` column from the output of
        [SHOW USERS](show-users.md).

        Each user listed must have a verified email address. For instructions on verifying email addresses in the web interface, see: [Verify your email address](../../user-guide/ui-support.md).

        Email notifications for non-administrator users do not supersede email notifications for administrators. Any account administrators that
        have [enabled email notifications](../../user-guide/resource-monitors.md) will continue to receive email notifications.

        > **Note:**
        >
        > * The following limitations apply for non-administrator users:
        >
        >   + Non-administrator users can only receive [notifications](../../user-guide/resource-monitors.md)
        >     for [warehouse monitors](../../user-guide/resource-monitors.md).
        >   + Non-administrator users are notified by email but can’t see notifications in Snowsight.
        >   + Non-administrator users can’t create resource monitors.
        >   + Non-administrator users can’t assign other users to be notified.

`TRIGGERS ...` (aka *actions*)
:   Specifies one or more triggers for the resource monitor. Each trigger definition consists of:

    * `ON threshold PERCENT` (usage percentage; values larger than `100` are supported)
    * `DO SUSPEND | SUSPEND_IMMEDIATE | NOTIFY` (action to perform when the threshold is reached).

    For more details, see [CREATE RESOURCE MONITOR](create-resource-monitor.md).

## Usage notes

* If a `SUSPEND` or `SUSPEND_IMMEDIATE` trigger is active for a resource monitor and the trigger threshold has been reached for
  the specified frequency interval, thereby preventing all assigned warehouses from being started/resumed, you can use this command to
  either increase the credit quota above the trigger threshold or replace the trigger with a new trigger with a higher threshold.

  Once the credit quota or trigger threshold for the resource monitor has been increased, assigned warehouses can be started or resumed.
* The `TRIGGERS` parameter is not additive; it removes all existing triggers for the resource monitor and replaces them
  with the specified triggers.

  As a result, to make additions to the existing triggers, you must specify the new triggers and replicate the existing triggers.

  Replicating an existing trigger re-evaluates whether consumption has reached the trigger percentage and sends another notification if it
  has. For example, suppose a notification was sent at 70%, and consumption is currently at 90%. If you run an ALTER command to specify a
  70% trigger, a new notification is sent immediately.
* If `frequency` and `start_timestamp` parameters are set on a resource monitor, the day for the credit usage reset is
  calculated based on those parameters. The time the credit usage resets to `0` is 12:00 AM UTC regardless of the time specified in
  `start_timestamp`.
* If you specify an `end_timestamp`, monitoring ends at that specified date and time and all assigned warehouses are suspended
  at that date and time even if the credit quota has not been reached.

  When this occurs, a notification is sent that states the resource monitor has reached a percentage of its quota and has triggered a
  suspend immediate action. The percentage of the quota reflects the number of credits used in the current interval up to the end date
  and might not be a threshold you specified.
* If there are non-administrator users in the notification list, the following notes apply:

  + If any user in the notification list does not have a [verified email](../../user-guide/notifications/email-notifications.md),
    the SQL statement fails.
  + If any user in the notification list changes their email address and does not verify the new email address, the
    notification silently fails.
  + The notification list is limited to a maximum number of 5 non-administrator users.
  + Account administrators can view the notification list of non-administrator users in the output of
    [SHOW RESOURCE MONITORS](show-resource-monitors.md) in the `notify_user` column.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Specify a new credit quota for the resource monitor `limiter` and replace the existing triggers for the monitor with a new set
of triggers:

> ```sqlexample
> ALTER RESOURCE MONITOR limiter
>   SET CREDIT_QUOTA=2000
>   TRIGGERS ON 80 PERCENT DO NOTIFY
>            ON 100 PERCENT DO SUSPEND_IMMEDIATE;
> ```

Alter a resource monitor to send notifications to three users when 80% of the credit quota is reached. In this example, the
`user_name` for two of the users includes a space and is therefore enclosed in double quotes:

> ```sqlexample
> ALTER RESOURCE MONITOR limiter
>   SET CREDIT_QUOTA = 2000
>       NOTIFY_USERS = (JDOE, "Jane Smith", "John Doe")
>   TRIGGERS ON 80 PERCENT DO NOTIFY
>            ON 100 PERCENT DO SUSPEND_IMMEDIATE
> ```

---
title: ALTER ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-role.md
section: SQL Commands
---

# ALTER ROLE

Modifies the properties for an existing [custom role](../../user-guide/security-access-control-overview.md). Currently, the only supported
operations are renaming a role or adding/overwriting/removing a comment for a role.

See also:
:   [CREATE ROLE](create-role.md) , [DROP ROLE](drop-role.md) , [SHOW ROLES](show-roles.md)

## Syntax

```sqlsyntax
ALTER ROLE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER ROLE [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER ROLE [ IF EXISTS ] <name> UNSET COMMENT

ALTER ROLE [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER ROLE [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER ROLE [ IF EXISTS ] <name> UNSET DCM PROJECT
```

## Parameters

`name`
:   Specifies the identifier for the role to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_name`
:   Specifies the new identifier for the role; must be unique for your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies the properties to set for the role:

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the role.

`UNSET ...`
:   Specifies the properties to unset for the role, which resets them to the defaults.

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the role from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the role and the DCM project without dropping the role. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Usage notes

* Only the role owner (i.e. the role with the OWNERSHIP privilege on the role), or a higher role, can execute this command.
* To rename a role (using the `RENAME TO new_name` parameter) the role that executes this command must also have the global CREATE ROLE
  privilege.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename role `role1` to `role2`:

> ```sqlexample
> ALTER ROLE role1 RENAME TO role2;
> ```

Add a comment for role `myrole`:

> ```sqlexample
> ALTER ROLE myrole SET COMMENT = 'New comment for role';
> ```

---
title: ALTER ROW ACCESS POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-row-access-policy.md
section: SQL Commands
---

# ALTER ROW ACCESS POLICY

Modifies the properties for an existing row access policy, including renaming the policy or replacing the policy rules.

Any changes made to the policy rules go into effect when the next SQL query that uses the row access policy runs.

See also:
:   [Row access policy DDL](../../user-guide/security-row-intro.md)

## Syntax

```sqlsyntax
ALTER ROW ACCESS POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER ROW ACCESS POLICY [ IF EXISTS ] <name> SET BODY -> <expression_on_arg_name>

ALTER ROW ACCESS POLICY [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER ROW ACCESS POLICY [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER ROW ACCESS POLICY [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER ROW ACCESS POLICY [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Identifier for the row access policy; must be unique in the parent schema of the policy.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the row access policy; must be unique for your schema. The new identifier cannot be used if the
    identifier is already in place for a different row access policy.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one (or more) properties to set for the row access policy:

    `BODY -> expression_on_arg_name`
    :   SQL expression that filters the data.

        The expression can include [Conditional expression functions](../expressions-conditional.md) to represent conditional logic, built-in functions, or UDFs to
        transform the data.

        If a UDF or external function is used inside the row access policy body, the policy owner must have OWNERSHIP on the UDF or external
        function. Users querying a database object that has a row access policy applied to it do not need to have USAGE on the UDF or external
        function.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the masking policy.

        Default: No value

    `UNSET ...`
    :   Specifies one or more properties and/or parameters to unset for the masking policy, which resets them to the defaults:

        * `TAG tag_name [ , tag_name ... ]`
        * `COMMENT`

        When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Row access policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on row access policy DDL and privileges, see [Manage row access policies](../../user-guide/security-row-intro.md).

## Usage notes

* If you want to update an existing row access policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE ROW ACCESS POLICY](desc-row-access-policy.md) command.
* You cannot change the policy signature (i.e. argument name or input/output data type). Similarly, using
  `CREATE OR REPLACE ROW ACCESS POLICY` is not supported if the policy is attached to a table or view. If you need to change the
  signature, execute a [DROP ROW ACCESS POLICY](drop-row-access-policy.md) statement on the policy and create a new row access policy.
* Before executing an ALTER statement, you can execute a [DESCRIBE ROW ACCESS POLICY](desc-row-access-policy.md) statement to determine the
  argument name to use for updating the policy.
* Including one or more [subqueries](../../user-guide/querying-subqueries.md) in the policy body may cause errors. When possible, limit the
  number of subqueries, limit the number of JOIN operations, and simplify WHERE clause conditions.
* If the policy `body` contains a mapping table lookup, create a centralized mapping table and store the mapping table
  in the same database as the protected table. This is particularly important if the `body` calls the
  [IS_DATABASE_ROLE_IN_SESSION](../functions/is_database_role_in_session.md) function. For details, see the function usage notes.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example updates the row access policy.

> ```sqlexample
> DESC ROW ACCESS POLICY rap_table_employee_info;
> ```

```output
+-------------------------+-------------+-------------+------+
| name                    | signature   | return_type | body |
+-------------------------+-------------+-------------+------+
| rap_table_employee_info | (V VARCHAR) | BOOLEAN     | true |
+-------------------------+-------------+-------------+------+
```

```sqlexample
ALTER ROW ACCESS POLICY rap_table_employee_info SET BODY -> false;
```

---
title: ALTER SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/alter-schema.md
section: SQL Commands
---

# ALTER SCHEMA

Modifies the properties for an existing schema, including renaming the schema or swapping it with another schema, and changing the Time Travel
data retention period (if you are using Snowflake Enterprise Edition or higher).

See also:
:   [CREATE SCHEMA](create-schema.md) , [DESCRIBE SCHEMA](desc-schema.md) , [DROP SCHEMA](drop-schema.md) , [SHOW SCHEMAS](show-schemas.md) , [UNDROP SCHEMA](undrop-schema.md)

## Syntax

```sqlsyntax
ALTER SCHEMA [ IF EXISTS ] <name> RENAME TO <new_schema_name>

ALTER SCHEMA [ IF EXISTS ] <name> SWAP WITH <target_schema_name>

ALTER SCHEMA [ IF EXISTS ] <name> SET {
                                      [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
                                      [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
                                      [ EXTERNAL_VOLUME = <external_volume_name> ]
                                      [ CATALOG = <catalog_integration_name> ]
                                      [ ICEBERG_VERSION_DEFAULT = <integer> ]
                                      [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
                                      [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
                                      [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
                                      [ DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU = '<compute_pool_name>' ]
                                      [ DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU = '<compute_pool_name>' ]
                                      [ LOG_LEVEL = '<log_level>' ]
                                      [ TRACE_LEVEL = '<trace_level>' ]
                                      [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
                                      [ CLASSIFICATION_PROFILE = '<profile_name>' ]
                                      [ COMMENT = '<string_literal>' ]
                                      [ CATALOG_SYNC = '<snowflake_open_catalog_integration_name>' ]
                                      [ REPLICABLE_WITH_FAILOVER_GROUPS = { 'YES' | 'NO' } ]
                                      [ BASE_LOCATION_PREFIX = '<string>']
                                      [ DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE = '<warehouse_name>']
                                      [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
                                      [ OBJECT_VISIBILITY = PRIVILEGED } ]
                                      [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
                                      }

ALTER SCHEMA [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER SCHEMA [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER SCHEMA [ IF EXISTS ] <name> UNSET {
                                        DATA_RETENTION_TIME_IN_DAYS         |
                                        MAX_DATA_EXTENSION_TIME_IN_DAYS     |
                                        EXTERNAL_VOLUME                     |
                                        CATALOG                             |
                                        ICEBERG_VERSION_DEFAULT             |
                                        ENABLE_ICEBERG_MERGE_ON_READ        |
                                        REPLACE_INVALID_CHARACTERS          |
                                        DEFAULT_DDL_COLLATION               |
                                        LOG_LEVEL                           |
                                        TRACE_LEVEL                         |
                                        STORAGE_SERIALIZATION_POLICY        |
                                        COMMENT                             |
                                        CATALOG_SYNC                        |
                                        REPLICABLE_WITH_FAILOVER_GROUPS     |
                                        BASE_LOCATION_PREFIX                |
                                        DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE|
                                        CONTACT <purpose>
                                        CLASSIFICATION_PROFILE
                                        OBJECT_VISIBILITY                   |
                                        CONTACT <purpose>                   |
                                        CLASSIFICATION_PROFILE              |
                                        ENABLE_DATA_COMPACTION              |
                                        DCM PROJECT
                                        }
                                        [ , ... ]

ALTER SCHEMA [ IF EXISTS ] <name> { ENABLE | DISABLE } MANAGED ACCESS
```

## Parameters

`name`
:   Specifies the identifier for the schema to alter. If the identifier contains spaces, special characters, or mixed-case characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_schema_name`
:   Specifies the new identifier for the schema; must be unique for the database.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database while optionally renaming the schema. To do so, specify a qualified
    `new_schema_name` value that includes the new database name in the form `db_name.new_schema_name`.

    > **Note:**
    >
    > The destination database must already exist. In addition, a schema with the same name cannot already exist in the new location;
    > otherwise, the statement returns an error.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SWAP WITH target_schema_name`
:   Swaps all objects (tables, views, etc.) and metadata, including identifiers, between the two specified schemas. Also swaps all access control
    privileges granted on the schemas and objects they contain. `SWAP WITH` essentially performs a rename of both schemas as a single operation.

`SET ...`
:   Specifies one (or more) properties to set for the schema (separated by blank spaces, commas, or new lines):

    `DATA_RETENTION_TIME_IN_DAYS = integer`
    :   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the schema, as well as specifying the
        default Time Travel retention time for all tables created in the schema.

        The value you can specify depends on the Snowflake Edition you are using:

        * Standard Edition: `0` or `1`
        * Enterprise Edition (or higher): `0` to `90`

    `MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
    :   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in the schema
        to prevent streams on the tables from becoming stale.

        For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

    `EXTERNAL_VOLUME = external_volume_name`
    :   Object parameter that specifies the default external volume to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

        For more information about this parameter, see [EXTERNAL_VOLUME](../parameters.md).

    `CATALOG = catalog_integration_name`
    :   Object parameter that specifies the default catalog integration to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

        For more information about this parameter, see [CATALOG](../parameters.md).

    `ICEBERG_VERSION_DEFAULT = integer`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies the version of the Apache Iceberg™ table specification that Iceberg tables conform to.

        Values:
        :   `2`: New tables conform with Iceberg version 2.

            `3`: New tables conform with Iceberg version 3.

        > **Caution:**
        >
        > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
        > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
        > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
        > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

        Default:
        :   `2`

        For more information about this parameter, see [ICEBERG_VERSION_DEFAULT](../parameters.md).

    `ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
    :   [Preview feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

        Values:
        :   `TRUE`: New tables use merge-on-read behavior.

            `FALSE`: New tables use copy-on-write behavior.

        Default:
        :   `TRUE`

        For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md). For more information about merge-on-read
        and copy-on-write behavior in Snowflake, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

    `REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
    :   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results for an
        [Iceberg table](create-iceberg-table.md).
        You can only set this parameter for tables that use an external Iceberg catalog.

        * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
        * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
          characters in a Parquet data file.

        Default: `FALSE`

    `DEFAULT_DDL_COLLATION = 'collation_specification'`
    :   Specifies a default [collation specification](../collation.md) for:

        * Any new columns added to existing tables in the schema.
        * All columns in new tables added to the schema.

        Setting the parameter does not change the collation specification for any existing columns.

        For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

    `DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU = compute_pool_name`
    :   CPU compute pool name that overrides the default CPU compute pool Snowflake provisioned in your account for running Notebooks. For more information, see [System compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

    `DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU = compute_pool_name`
    :   GPU compute pool name that overrides the default GPU compute pool Snowflake provisioned in your account for running Notebooks. For more information, see [System compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

    `LOG_LEVEL = 'log_level'`
    :   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
        the specified level (and at more severe levels) are ingested.

        For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting log level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `TRACE_LEVEL = 'trace_level'`
    :   Controls how trace events are ingested into the event table.

        For information about levels, see [TRACE_LEVEL](../parameters.md). For information about setting trace level, see
        [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }`
    :   Specifies the storage serialization policy for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) that use Snowflake as the catalog.

        * `COMPATIBLE`: Snowflake performs encoding and compression of data files that ensures interoperability with third-party compute engines.
        * `OPTIMIZED`: Snowflake performs encoding and compression of data files that ensures the best table performance within Snowflake.

        Default: `OPTIMIZED`

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `CLASSIFICATION_PROFILE = 'profile_name'`
    :   Associates the schema with a classification profile so that sensitive data in the schema is
        [automatically classified](../../user-guide/classify-auto.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the schema.

    `CATALOG_SYNC = 'snowflake_open_catalog_integration_name'`
    :   Specifies the name of a catalog integration configured for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).
        If specified, Snowflake syncs Snowflake-managed Apache Iceberg™ tables in the schema with an external catalog in your Snowflake Open Catalog account.
        For more information about syncing Snowflake-managed Iceberg tables with Open Catalog, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

        For more information about this parameter, see [CATALOG_SYNC](../parameters.md).

        Default: No value

    `REPLICABLE_WITH_FAILOVER_GROUPS = { 'YES' | 'NO' }`
    :   Specifies if this schema is eligible for replication.
        You can set this property to `NO` to prevent individual schemas
        within a database from being replicated.

        For more information about this parameter, see [Schema-level replication for failover groups](../../user-guide/account-replication-config.md).

        Default: `'YES'`

    `DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE = 'warehouse_name'`
    :   Specifies the default warehouse to use when you create a notebook using SQL.

    `BASE_LOCATION_PREFIX = 'string'`
    :   Specifies a prefix for Snowflake to use in the write path for Snowflake-managed Apache Iceberg™ tables.
        For more information,
        see [data and metadata directories for Iceberg tables](../../user-guide/tables-iceberg-storage.md) and
        [BASE_LOCATION_PREFIX](../parameters.md) in the Snowflake Parameters topic.

        Default: No value

`OBJECT_VISIBILITY = PRIVILEGED`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies that only roles within the current account that are granted an explicit privilege on the object can see the object. This is the default behavior in Snowflake.

    For examples, see [Make database objects discoverable in Universal Search](../../user-guide/ui-snowsight/object-visibility-universal-search.md).

`ENABLE_DATA_COMPACTION = { TRUE | FALSE }`
:   Specifies whether Snowflake should enable data compaction on Snowflake-managed [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    * `TRUE`: Snowflake performs data compaction on the tables.
    * `FALSE`: Snowflake doesn’t perform data compaction on the tables.

    Default: `TRUE`

    For more information, see [ENABLE_DATA_COMPACTION](../parameters.md) and [Set data compaction](../../user-guide/tables-iceberg-manage.md).

`UNSET ...`
:   Specifies one (or more) properties and/or parameters to unset for the database, which resets them to the defaults:

    * `DATA_RETENTION_TIME_IN_DAYS`
    * `MAX_DATA_EXTENSION_TIME_IN_DAYS`
    * `EXTERNAL_VOLUME`
    * `CATALOG`
    * `ICEBERG_VERSION_DEFAULT`
    * `ENABLE_ICEBERG_MERGE_ON_READ`
    * `REPLACE_INVALID_CHARACTERS`
    * `DEFAULT_DDL_COLLATION`
    * `TAG tag_name [ , tag_name ... ]`
    * `LOG_LEVEL`
    * `TRACE_LEVEL`
    * `STORAGE_SERIALIZATION_POLICY`
    * `COMMENT`
    * `CATALOG_SYNC`
    * `REPLICABLE_WITH_FAILOVER_GROUPS`
    * `BASE_LOCATION_PREFIX`
    * `DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE`
    * `CONTACT purpose`
    * `CLASSIFICATION_PROFILE`
    * `OBJECT_VISIBILITY`
    * `ENABLE_DATA_COMPACTION`

    You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be separated by a
    comma. When resetting a property/parameter, specify only the name; specifying a value for the property will return an error.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the schema from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the schema and the DCM project without dropping the schema. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

`{ ENABLE | DISABLE } MANAGED ACCESS`
:   Enable managed access for a schema, or disable to convert a managed access schema to a regular schema. Managed access schemas centralize
    privilege management with the schema owner.

    In regular schemas, the owner of an object (i.e. the role that has the OWNERSHIP privilege on the object) can grant further privileges on
    their objects to other roles. In managed access schemas, the schema owner manages all privilege grants, including
    [future grants](../../user-guide/security-access-control-configure.md), on objects in the schema. Object owners retain the OWNERSHIP privileges
    on the objects; however, only the schema owner can manage privilege grants on the objects.

## Usage notes

* To rename a schema, the role used to perform the operation must have the CREATE SCHEMA privilege on the database for the schema and OWNERSHIP
  privileges on the schema.
* To swap two schemas, the role used to perform the operation must have OWNERSHIP privileges on both schemas.
* To convert a regular schema to a managed access schema:

  + The schema owner (i.e. the role that has the OWNERSHIP privileges on the schema) must also have the global MANAGE GRANTS privilege. The
    MANAGE GRANTS privilege is required because another role with this privilege could have defined future grants on objects of a specified
    type in the schema. After a regular schema becomes a managed access schema, the schema owner could revoke the future grants without
    understanding why a role with the MANAGE GRANTS privilege granted them.
  + All open future grants must be revoked using [REVOKE <privileges> … FROM ROLE](revoke-privilege.md) with the FUTURE keyword.

  After a regular schema is converted to a managed access schema, all privileges previously granted on individual objects are retained; however,
  the object owners cannot grant further privileges on those objects.
* To convert a managed access schema to a regular schema, the schema owner must also have the global MANAGE GRANTS privilege only if the
  current schema has future privilege grants defined.
* For schemas in a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md), this command only supports
  the following parameters:

  + SET/UNSET with the following options:

    - CLASSIFICATION_PROFILE
    - COMMENT
    - CONTACT
    - STORAGE_SERIALIZATION_POLICY
    - TAG
  + ENABLE MANAGED ACCESS and DISABLE MANAGED ACCESS.
* To specify the default version of the Apache Iceberg™ specification that Iceberg tables conform to, the role used to perform the operation
  must have the OWNERSHIP privilege on the schema.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename schema `schema1` to `schema2`:

> ```sqlexample
> ALTER SCHEMA IF EXISTS schema1 RENAME TO schema2;
> ```

Convert a regular schema to a managed access schema:

> ```sqlexample
> ALTER SCHEMA schema2 ENABLE MANAGED ACCESS;
> ```

---
title: ALTER SECRET
source: https://docs.snowflake.com/en/sql-reference/sql/alter-secret.md
section: SQL Commands
---

# ALTER SECRET

Modifies the properties of an existing secret.

See also:
:   [CREATE SECRET](create-secret.md) , [DESCRIBE SECRET](desc-secret.md) , [DROP SECRET](drop-secret.md) , [SHOW SECRETS](show-secrets.md)

## Syntax

**OAuth with client credentials flow:**

```sqlsyntax
ALTER SECRET [ IF EXISTS ] <name> SET [ OAUTH_SCOPES = ( '<scope_1>' [ , '<scope_2>' ... ] ) ]
                                      [ COMMENT = '<string_literal>' ]

ALTER SECRET [ IF EXISTS ] <name> UNSET COMMENT
```

**OAuth with authorization code grant flow:**

```sqlsyntax
ALTER SECRET [ IF EXISTS ] <name> SET [ OAUTH_REFRESH_TOKEN = '<token>' ]
                                      [ OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '<string_literal>' ]
                                      [ COMMENT = '<string_literal>' ]

ALTER SECRET [ IF EXISTS ] <name> UNSET COMMENT
```

**Cloud provider:**

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

```sqlsyntax
ALTER SECRET [ IF EXISTS ] <name> SET [ API_AUTHENTICATION = '<cloud_provider_security_integration>' ]
                                      [ COMMENT = '<string_literal>' ]

ALTER SECRET [ IF EXISTS ] <name> UNSET COMMENT
```

**Basic authentication:**

```sqlsyntax
ALTER SECRET [ IF EXISTS ] <name> SET [ USERNAME = '<username>' ]
                                      [ PASSWORD = '<password>' ]
                                      [ COMMENT = '<string_literal>' ]

ALTER SECRET [ IF EXISTS ] <name> UNSET COMMENT
```

**Generic string:**

```sqlsyntax
ALTER SECRET [ IF EXISTS ] <name> SET [ SECRET_STRING = '<string_literal>' ]
                                      [ COMMENT = '<string_literal>' ]

ALTER SECRET [ IF EXISTS ] <name> UNSET COMMENT
```

## OAuth with client credentials flow parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

`SET ...`
:   Specifies one (or more) parameters to set (separated by blank spaces, commas, or new lines).

    `OAUTH_SCOPES = ( 'scope_1' [ , 'scope_2' ... ] )`
    :   Specifies a comma-separated list of scopes to use when making a request from the OAuth server by a role with USAGE on the integration
        during the OAuth client credentials flow.

        This list must be a subset of the scopes defined in the `OAUTH_ALLOWED_SCOPES` property of the security integration. If the
        `OAUTH_SCOPES` property values are not specified, the secret inherits all of the scopes that are specified in the security
        integration.

## AWS IAM required parameters

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.

`SET ...`
:   Specifies one (or more) parameters to set (separated by blank spaces, commas, or new lines).

    `TYPE = CLOUD_PROVIDER_TOKEN`
    :   Specifies that this is secret for use with a cloud provider, such as Amazon Web Services (AWS).

    `API_AUTHENTICATION = 'cloud_provider_security_integration'`
    :   Specifies the `name` value of the Snowflake [security integration](create-security-integration-aws-iam.md)
        that connects Snowflake to a cloud provider.

## OAuth with authorization code grant flow parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

`SET ...`
:   Specifies one (or more) parameters to set (separated by blank spaces, commas, or new lines).

    `OAUTH_REFRESH_TOKEN = 'token'`
    :   Specifies the token as a string that is used to obtain a new access token from the OAuth authorization server when the access token
        expires.

    `OAUTH_REFRESH_TOKEN_EXPIRY_TIME = 'string_literal'`
    :   Specifies the timestamp as a string when the OAuth refresh token expires.

## Basic authentication parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

`SET ...`
:   > Specifies one (or more) parameters to set for the session (separated by blank spaces, commas, or new lines).

    `USERNAME = 'username'`
    :   Specifies the username value to store in the secret.

        Specify this property value when using a secret for basic authentication (i.e. the secret is `TYPE = PASSWORD`).

    `PASSWORD = 'password'`
    :   Specifies the password value to store in the secret.

        Specify this property value when using a secret for basic authentication (i.e. the secret is `TYPE = PASSWORD`).

## Generic string parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

`SET ...`
:   Specifies one (or more) parameters to set (separated by blank spaces, commas, or new lines).

    `SECRET_STRING = 'string_literal'`
    :   Specifies the string to store in the secret.

        The string can be an API token or a string of sensitive value that can be used in the handler code of a UDF or stored procedure. For
        details, see [Creating and using an external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md).

        You should not use this property to store any kind of OAuth token; use one of the other secret types for your OAuth use cases.

## Common parameters: all syntaxes

`SET ...`
:   > Specifies one (or more) parameters to set for the session (separated by blank spaces, commas, or new lines).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the secret.

        Default: No value

`UNSET ...`
:   Specifies one (or more) properties/parameters to unset for the secret, which resets them back to their defaults:

    * `COMMENT`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Secret | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

Regarding metadata:

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Modify the comment for a secret:

> ```sqlexample
> ALTER SECRET service_now_creds_pw SET COMMENT = 'production secret for servicenow';
> ```

---
title: ALTER SECURITY INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION

Modifies the properties for an existing security integration.

See also:
:   [CREATE SECURITY INTEGRATION](create-security-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET <parameters>

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name>  UNSET <parameter>

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

The syntax varies considerably among security environments (i.e. types of security integrations). For specific syntax, usage notes, and
examples, see:

* [ALTER SECURITY INTEGRATION (AWS IAM Authentication)](alter-security-integration-aws-iam.md)
* [ALTER SECURITY INTEGRATION (External API Authentication)](alter-security-integration-api-auth.md)
* [ALTER SECURITY INTEGRATION (External OAuth)](alter-security-integration-oauth-external.md)
* [ALTER SECURITY INTEGRATION (Snowflake OAuth)](alter-security-integration-oauth-snowflake.md)
* [ALTER SECURITY INTEGRATION (SAML2)](alter-security-integration-saml2.md)
* [ALTER SECURITY INTEGRATION (SCIM)](alter-security-integration-scim.md)

---
title: ALTER SECURITY INTEGRATION (AWS IAM Authentication)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration-aws-iam.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION (AWS IAM Authentication)

Modifies the properties of an existing security integration created for authenticating with AWS IAM.

For information about modifying other types of security integrations (such as Snowflake OAuth), see [ALTER SECURITY INTEGRATION](alter-security-integration.md).

See also:
:   [CREATE SECURITY INTEGRATION (AWS IAM Authentication)](create-security-integration-aws-iam.md) , [DESCRIBE INTEGRATION](desc-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET
  [ TYPE = AWS_IAM ]
  [ AWS_ROLE_ARN = '<iam_role_arn>' ]
  [ ENABLED = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   String that specifies the identifier (such as the name) for the integration.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `TYPE = AWS_IAM`
    :   Specifies that the integration uses AWS IAM to authenticate to the external service.

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to enable or disable this security integration.

        `TRUE`
        :   Allows the integration to run based on the parameters specified in the integration definition.

        `FALSE`
        :   Suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

`AWS_ROLE_ARN = 'iam_role_arn'`
:   Specifies the Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges for AWS resources.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

> ```sqlexample
> ALTER SECURITY INTEGRATION myint SET ENABLED = TRUE;
> ```

---
title: ALTER SECURITY INTEGRATION (External API Authentication)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration-api-auth.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION (External API Authentication)

Modifies the properties of an existing security integration created for External API Authentication.

For information about modifying other types of security integrations (e.g. Snowflake OAuth), see [ALTER SECURITY INTEGRATION](alter-security-integration.md).

See also:
:   [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md) , [DESCRIBE INTEGRATION](desc-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

### OAuth: Client credentials

```sqlsyntax
ALTER SECURITY INTEGRATION <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ OAUTH_TOKEN_ENDPOINT = '<string_literal>' ]
  [ OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST } ]
  [ OAUTH_CLIENT_ID = '<string_literal>' ]
  [ OAUTH_CLIENT_SECRET = '<string_literal>' ]
  [ OAUTH_GRANT = 'CLIENT_CREDENTIALS']
  [ OAUTH_ACCESS_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_ALLOWED_SCOPES = ( '<scope_1>' [ , '<scope_2>' ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> UNSET {
  ENABLED | [ , ... ]
}
```

### OAuth: Authorization code grant flow

```sqlsyntax
ALTER SECURITY INTEGRATION <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ OAUTH_AUTHORIZATION_ENDPOINT = '<string_literal>' ]
  [ OAUTH_TOKEN_ENDPOINT = '<string_literal>' ]
  [ OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST } ]
  [ OAUTH_CLIENT_ID = '<string_literal>' ]
  [ OAUTH_CLIENT_SECRET = '<string_literal>' ]
  [ OAUTH_GRANT = 'AUTHORIZATION_CODE']
  [ OAUTH_ACCESS_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
  [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> UNSET {
  ENABLED | [ , ... ]
}
```

### OAuth: JWT bearer flow

```sqlsyntax
ALTER SECURITY INTEGRATION <name> SET
  [ ENABLED = { TRUE | FALSE } ]
  [ OAUTH_AUTHORIZATION_ENDPOINT = '<string_literal>' ]
  [ OAUTH_TOKEN_ENDPOINT = '<string_literal>' ]
  [ OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST } ]
  [ OAUTH_CLIENT_ID = '<string_literal>' ]
  [ OAUTH_CLIENT_SECRET = '<string_literal>' ]
  [ OAUTH_GRANT = 'JWT_BEARER']
  [ OAUTH_ACCESS_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
  [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> UNSET {
  ENABLED | [ , ... ]
}
```

## Parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether this security integration is enabled or disabled.

        `TRUE`
        :   Allows the integration to run based on the parameters specified in the integration definition.

        `FALSE`
        :   Suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    `OAUTH_AUTHORIZATION_ENDPOINT = 'string_literal'`
    :   Specifies the URL for authenticating to the external service. For example, to connect to the ServiceNow instance, the URL should be in
        the following format:

        ```none
        https://<instance_name>.service-now.com/oauth_token.do
        ```

        Where `instance_name` is the name of your ServiceNow instance.

    `OAUTH_TOKEN_ENDPOINT = 'string_literal'`
    :   Specifies the token endpoint used by the client to obtain an access token by presenting its authorization grant or refresh token.
        The token endpoint is used with every authorization grant except for the implicit grant type (since an access token is issued directly).

`OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST }`
:   Controls how client credentials are sent to the external service.

    `CLIENT_SECRET_BASIC`
    :   Specifies that client credentials are sent using the HTTP Basic Authentication Scheme.

    `CLIENT_SECRET_POST`
    :   Specifies that client credentials are sent in the HTTP request body of a POST request.

    Default: `CLIENT_SECRET_BASIC`

    `OAUTH_CLIENT_ID = 'string_literal'`
    :   Specifies the client ID for the OAuth application in the external service.

    `OAUTH_CLIENT_SECRET = 'string_literal'`
    :   Specifies the client secret for the OAuth application in the ServiceNow instance. The connector uses this to request an access token
        from the ServiceNow instance.

    `OAUTH_GRANT = 'string_literal'`
    :   Specifies the type of OAuth flow. One of the following:

        * `'CLIENT_CREDENTIALS'` when the integration will use client credentials.
        * `'AUTHORIZATION_CODE'` when the integration will use an authorization code.
        * `'JWT_BEARER'` when the integration will use a JWT bearer token.

    `OAUTH_ACCESS_TOKEN_VALIDITY = integer`
    :   Specifies the default lifetime of the OAuth access token (in seconds) issued by an OAuth server.

        The value set in this property is used if the access token lifetime is not returned as part of OAuth token response. When both
        values are available, the smaller value will be used to refresh the access token.

    `OAUTH_REFRESH_TOKEN_VALIDITY = integer`
    :   Specifies the value to determine the validity of the refresh token obtained from the OAuth server.

    `OAUTH_ALLOWED_SCOPES = ( list )`
    :   Specifies a comma-separated list of scopes, with single quotes surrounding each scope, to use when making a request from the OAuth by a
        role with USAGE on the integration during the OAuth client credentials flow.

        This list must be a subset of the scopes defined in the `OAUTH_ALLOWED_SCOPES` property of the security integration. If the
        `OAUTH_SCOPES` property values are not specified, the secret inherits all of the scopes that are specified in the security
        integration.

        For the ServiceNow connector, the only possible scope value is `'useraccount'`.

        Default: Empty list (i.e. `[]`).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the integration.

        Default: No value

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

> ```sqlexample
> ALTER SECURITY INTEGRATION myint SET ENABLED = TRUE;
> ```

---
title: ALTER SECURITY INTEGRATION (External OAuth)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration-oauth-external.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION (External OAuth)

> **Attention:**
>
> Mentions of Microsoft Azure Active Directory refer to Microsoft Entra ID.

Modifies the properties of an existing security integration created for External OAuth. For information about modifying other types of
security integrations (e.g. Snowflake OAuth), see [ALTER SECURITY INTEGRATION](alter-security-integration.md).

See also:
:   [CREATE SECURITY INTEGRATION (External OAuth)](create-security-integration-oauth-external.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET
  [ TYPE = EXTERNAL_OAUTH ]
  [ ENABLED = { TRUE | FALSE } ]
  [ EXTERNAL_OAUTH_TYPE = { OKTA | AZURE | PING_FEDERATE | CUSTOM } ]
  [ EXTERNAL_OAUTH_ISSUER = '<string_literal>' ]
  [ EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = '<string_literal>' | ('<string_literal>', '<string_literal>' [ , ... ] ) ]
  [ EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = 'LOGIN_NAME | EMAIL_ADDRESS' ]
  [ EXTERNAL_OAUTH_JWS_KEYS_URL = '<string_literal>' ] -- For OKTA | PING_FEDERATE | CUSTOM
  [ EXTERNAL_OAUTH_JWS_KEYS_URL = '<string_literal>' | ('<string_literal>' [ , '<string_literal>' ... ] ) ] -- For Azure
  [ EXTERNAL_OAUTH_RSA_PUBLIC_KEY = <public_key1> ]
  [ EXTERNAL_OAUTH_RSA_PUBLIC_KEY_2 = <public_key2> ]
  [ EXTERNAL_OAUTH_BLOCKED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ EXTERNAL_OAUTH_ALLOWED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ EXTERNAL_OAUTH_AUDIENCE_LIST = ('<string_literal>') ]
  [ EXTERNAL_OAUTH_ANY_ROLE_MODE = DISABLE | ENABLE | ENABLE_FOR_PRIVILEGE ]
  [ EXTERNAL_OAUTH_SCOPE_DELIMITER = '<string_literal>' ] -- Only for EXTERNAL_OAUTH_TYPE = CUSTOM
  [ NETWORK_POLICY = '<network_policy>' ]
  [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name>  UNSET {
                                                            ENABLED                      |
                                                            EXTERNAL_OAUTH_AUDIENCE_LIST |
                                                            }
                                                            [ , ... ]

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Identifier for the integration to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `TYPE = EXTERNAL_OAUTH`
    :   Distinguishes the [External OAuth](../../user-guide/oauth-ext-overview.md) integration from a
        [Snowflake OAuth](../../user-guide/oauth-snowflake-overview.md) integration.

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` allows the integration to run based on the parameters specified in the pipe definition.
        * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    `EXTERNAL_OAUTH_TYPE = { OKTA | AZURE | PING_FEDERATE | CUSTOM }`
    :   Specifies the OAuth 2.0 authorization server to be Okta, Microsoft Entra ID, Ping Identity PingFederate, or a Custom OAuth 2.0
        authorization server.

    `EXTERNAL_OAUTH_ISSUER = 'string_literal'`
    :   Specifies the URL to define the OAuth 2.0 authorization server.

    `EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = { 'string_literal' | ('string_literal', 'string_literal' [ , ... ] ) }`
    :   Specifies the access token claim or claims that can be used to map the access token to a Snowflake user record.

        The data type of the claim must be a string or a list of strings.

    `EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = { 'LOGIN_NAME | EMAIL_ADDRESS' }`
    :   Indicates which Snowflake user record attribute should be used to map the access token to a Snowflake user record.

    `EXTERNAL_OAUTH_JWS_KEYS_URL = 'string_literal'`
    :   Specifies the endpoint from which to download public keys or certificates to validate an External OAuth access token.

        This syntax applies to security integrations where `EXTERNAL_OAUTH_TYPE = { OKTA | PING_FEDERATE | CUSTOM }`

    `EXTERNAL_OAUTH_JWS_KEYS_URL = { 'string_literal' | ('string_literal' [ , 'string_literal' ... ] ) }`
    :   Specifies the endpoint or a list of endpoints from which to download public keys or certificates to validate an External OAuth access
        token. The maximum number of URLs that can be specified in the list is 3.

        This syntax applies to security integrations where `EXTERNAL_OAUTH_TYPE = AZURE`

    `EXTERNAL_OAUTH_RSA_PUBLIC_KEY = public_key1`
    :   Specifies a Base64-encoded RSA public key, without the `-----BEGIN PUBLIC KEY-----` and `-----END PUBLIC KEY-----` headers.

    `EXTERNAL_OAUTH_RSA_PUBLIC_KEY_2 = public_key2`
    :   Specifies a second RSA public key, without the `-----BEGIN PUBLIC KEY-----` and `-----END PUBLIC KEY-----` headers. Used
        for key rotation.

    `EXTERNAL_OAUTH_BLOCKED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
    :   Specifies the list of roles that a client cannot set as the [primary role](../../user-guide/security-access-control-overview.md). A role
        in this list cannot be used when creating a Snowflake session based on the access token from the External OAuth
        authorization server.

        By default, this list includes the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles. To remove these privileged roles from the list, use
        the [ALTER ACCOUNT](alter-account.md) command to set the [EXTERNAL_OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST](../parameters.md) account parameter to
        `FALSE`.

    `EXTERNAL_OAUTH_ALLOWED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
    :   Specifies the list of roles that the client can set as the primary role.

        A role in this list can be used when creating a Snowflake session based on the access token from the External OAuth authorization
        server.

        > **Caution:**
        >
        > This parameter supports the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN system roles.
        >
        > Exercise caution when creating a Snowflake session with these highly privileged roles set as the primary role.

    `EXTERNAL_OAUTH_AUDIENCE_LIST = ('string_literal')`
    :   Specifies additional values that can be used for the access token’s audience validation on top of using the Customer’s Snowflake
        Account URL (i.e. `<account_identifier>.snowflakecomputing.com`). For more information, see
        [Account identifiers](../../user-guide/admin-account-identifier.md).

        For details on this property when using Power BI SSO, refer to
        [Power BI SSO security integrations](../../user-guide/oauth-powerbi.md).

        Currently, multiple audience URLs can be specified for [External OAuth Custom Clients](../../user-guide/oauth-ext-custom.md) only. Each
        URL must be enclosed in single quotes, with a comma separating each URL. For example:

        > ```sqlexample
        > external_oauth_audience_list = ('https://example.com/api/v2/', 'https://example.com')
        > ```

    `EXTERNAL_OAUTH_ANY_ROLE_MODE = { DISABLE | ENABLE | ENABLE_FOR_PRIVILEGE }`
    :   Specifies whether the OAuth client or user can use a role that is not defined in the OAuth access token. Note that with a
        [Power BI to Snowflake integration](../../user-guide/oauth-powerbi.md), the PowerBI user cannot switch roles even when this parameter is enabled.

        * `DISABLE` does not allow the OAuth client or user to switch roles (i.e. `USE ROLE role;`). Default.
        * `ENABLE` allows the OAuth client or user to switch roles.
        * `ENABLE_FOR_PRIVILEGE` allows the OAuth client or user to switch roles only for a client or user with the USE_ANY_ROLE
          privilege. This privilege can be granted and revoked to one or more roles available to the user. For example:

          ```sqlexample
          GRANT USE_ANY_ROLE ON INTEGRATION external_oauth_1 TO role1;
          ```

          ```sqlexample
          REVOKE USE_ANY_ROLE ON INTEGRATION external_oauth_1 FROM role1;
          ```

        Note that the value can be optionally enclosed in single quotes (e.g. either `DISABLE` or `'DISABLE'`).

    `EXTERNAL_OAUTH_SCOPE_DELIMITER = 'string_literal'`
    :   Specifies the scope delimiter in the authorization token.

        The delimiter can be any single character, such as comma (`','`) or space (`' '`).

        This security integration property is optional and can be used to override the default comma delimiter. Note that this property is only
        supported for custom External OAuth integrations, where:

        > `EXTERNAL_OAUTH_TYPE = CUSTOM`

        Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to enable this
        property in your Snowflake account.

    `NETWORK_POLICY = 'network_policy'`
    :   Specifies an existing [network policy](../../user-guide/network-policies.md). This network policy controls network traffic from the client
        to Snowflake.

        For more information, see [Restricting network traffic for External OAuth](../../user-guide/oauth-ext-overview.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the integration.

        Default: No value

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the security integration, which resets them back to their defaults:

    * `ENABLED`
    * `EXTERNAL_OAUTH_AUDIENCE_LIST`
    * `TAG tag_name [ , tag_name ... ]`

## Usage notes

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

```sqlexample
ALTER SECURITY INTEGRATION myint SET ENABLED = TRUE;
```

---
title: ALTER SECURITY INTEGRATION (SAML2)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration-saml2.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION (SAML2)

Modifies the properties of an existing SAML2 security integration. For information about modifying other types of
security integrations (e.g. SCIM), see [ALTER SECURITY INTEGRATION](alter-security-integration.md).

See also:
:   [CREATE SECURITY INTEGRATION (SAML2)](create-security-integration-saml2.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET
    [ TYPE = SAML2 ]
    [ ENABLED = { TRUE | FALSE } ]
    [ METADATA_URL = '<string_literal>' ]
    [ SAML2_ISSUER = '<string_literal>' ]
    [ SAML2_SSO_URL = '<string_literal>' ]
    [ SAML2_PROVIDER = '<string_literal>' ]
    [ SAML2_X509_CERT = '<string_literal>' ]
    [ ALLOWED_USER_DOMAINS = ( '<string_literal>' [ , '<string_literal>' , ... ] ) ]
    [ ALLOWED_EMAIL_PATTERNS = ( '<string_literal>' [ , '<string_literal>' , ... ] ) ]
    [ SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = '<string_literal>' ]
    [ SAML2_ENABLE_SP_INITIATED = TRUE | FALSE ]
    [ SAML2_SNOWFLAKE_X509_CERT = '<string_literal>' ]
    [ SAML2_SIGN_REQUEST = TRUE | FALSE ]
    [ SAML2_REQUESTED_NAMEID_FORMAT = '<string_literal>' ]
    [ SAML2_POST_LOGOUT_REDIRECT_URL = '<string_literal>' ]
    [ SAML2_FORCE_AUTHN = TRUE | FALSE ]
    [ SAML2_SNOWFLAKE_ISSUER_URL = '<string_literal>' ]
    [ SAML2_SNOWFLAKE_ACS_URL = '<string_literal>' ]
    [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> UNSET {
    ENABLED |
    [ , ... ]
    }

ALTER [ SECURITY ] INTEGRATION <name> REFRESH
  [ SAML2_SNOWFLAKE_PRIVATE_KEY ]
  [ METADATA_URL ]

ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Identifier for the integration to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ALLOWED_USER_DOMAINS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Specifies a list of email domains that can authenticate with a SAML2 security integration. For example,
        `ALLOWED_USER_DOMAINS = ("example.com", "example2.com", ...)`.

        This parameter can be used to associate a user with an IdP for configurations that use multiple IdPs. For details, see [Using multiple identity providers for federated authentication](../../user-guide/admin-security-fed-auth-security-integration-multiple.md).

    `ALLOWED_EMAIL_PATTERNS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Specifies a list of regular expressions that email addresses are matched against to authenticate with a SAML2 security integration. For
        example,
        `ALLOWED_EMAIL_PATTERNS = ("^(.+dev)@example.com$", "^(.+dev)@example2.com$", ... )`.

        This parameter can be used to associate a user with an IdP for configurations that use multiple IdPs. For details, see [Using multiple identity providers for federated authentication](../../user-guide/admin-security-fed-auth-security-integration-multiple.md).

    `TYPE = SAML2`
    :   Specify the type of integration:

    * `SAML2`: Creates a security interface between Snowflake and the identity provider.

    `ENABLED = TRUE | FALSE`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` allows the integration to run based on the parameters specified in the integration definition.
        * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    `METADATA_URL = 'string_literal'`
    :   Specifies the metadata URL of the IdP. A metadata URL is an endpoint that allows Snowflake to dynamically
        retrieve and synchronize IdP configuration settings, including certificate updates.

        This parameter is only supported for Okta and Microsoft Entra ID. For help obtaining the metadata URL, see the section in [Configuring an identity provider (IdP) for Snowflake](../../user-guide/admin-security-fed-auth-configure-idp.md) that corresponds to your IdP.

        If you specify a metadata URL, you can’t use the `SAML2_ISSUER`, `SAML2_SSO_URL`, `SAML2_PROVIDER`, and
        `SAML2_X509_CERT` parameters. The information specified with these parameters is obtained from the metadata URL.

    `SAML2_ISSUER = 'string_literal'`
    :   The string containing the EntityID / Issuer of the IdP.

    `SAML2_SSO_URL = 'string_literal'`
    :   The string containing the IdP SSO URL, where the user should be redirected by Snowflake (the Service Provider) with a SAML
        `AuthnRequest` message.

    `SAML2_PROVIDER = 'string_literal'`
    :   The string describing the IdP.

        One of the following: OKTA, ADFS, Custom.

    `SAML2_X509_CERT = 'string_literal'`
    :   The Base64 encoded IdP signing certificate on a single line without the leading `-----BEGIN CERTIFICATE-----` and ending
        `-----END CERTIFICATE-----` markers.

    `SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = 'string_literal'`
    :   The string containing the label to display after the Log In With button on the login page.

    `SAML2_ENABLE_SP_INITIATED = TRUE | FALSE`
    :   The Boolean indicating if the Log In With button will be shown on the login page.

        * `TRUE` displays the Log in With button on the login page.
        * `FALSE` does not display the Log in With button on the login page.

    `SAML2_SNOWFLAKE_X509_CERT = 'string_literal'`
    :   The Base64 encoded self-signed certificate generated by Snowflake for use with [Encrypt SAML assertions](../../user-guide/admin-security-fed-auth-security-integration.md) and
        [Send signed SAML requests](../../user-guide/admin-security-fed-auth-security-integration.md).

        You must have at least one of these features (encrypted SAML assertions or signed SAML responses) enabled in your Snowflake account to
        access the certificate value.

    `SAML2_SIGN_REQUEST = TRUE | FALSE`
    :   The Boolean indicating whether SAML requests are signed.

        * `TRUE` allows SAML requests to be signed.
        * `FALSE` does not allow SAML requests to be signed.

    `SAML2_REQUESTED_NAMEID_FORMAT = 'string_literal'`
    :   The SAML NameID format allows Snowflake to set an expectation of the identifying attribute of the user (i.e. SAML Subject) in the SAML
        assertion from the IdP to ensure a valid authentication to Snowflake. If a value is not specified, Snowflake sends the
        `urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress` value in the authentication request to the IdP.

        Optional.

        If you choose to specify the SAML `NameID` format, use one of the following values:

        * `urn:oasis:names:tc:SAML:1.1:nameid-format:unspecified`
        * `urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress`
        * `urn:oasis:names:tc:SAML:1.1:nameid-format:X509SubjectName`
        * `urn:oasis:names:tc:SAML:1.1:nameid-format:WindowsDomainQualifiedName`
        * `urn:oasis:names:tc:SAML:2.0:nameid-format:kerberos`
        * `urn:oasis:names:tc:SAML:2.0:nameid-format:persistent`
        * `urn:oasis:names:tc:SAML:2.0:nameid-format:transient`

    `SAML2_POST_LOGOUT_REDIRECT_URL = '<string_literal>'`
    :   The endpoint to which Snowflake redirects users after clicking the Log Out button in Snowsight.

        Snowflake terminates the Snowflake session upon redirecting to the specified endpoint.

    `SAML2_FORCE_AUTHN = TRUE | FALSE`
    :   The Boolean indicating whether users, during the initial authentication flow, are forced to authenticate again to access Snowflake.
        When set to `TRUE`, Snowflake sets the `ForceAuthn` SAML parameter to `TRUE` in the outgoing request from Snowflake
        to the identity provider.

        * `TRUE` forces users to authenticate again to access Snowflake, even if a valid session with the identity provider exists.
        * `FALSE` does not force users to authenticate again to access Snowflake.

        Default: `FALSE`.

    `SAML2_SNOWFLAKE_ISSUER_URL = '<string_literal>'`
    :   The string containing the `EntityID` / `Issuer` for the Snowflake service provider.

        If an incorrect value is specified, Snowflake returns an error message indicating the acceptable values to use.

        The value of this property must match the Snowflake account URL specified in the IdP. It defaults to the
        [legacy URL](../../user-guide/admin-account-identifier.md), so if you define a different [URL format](../../user-guide/organizations-connect.md) in the IdP, make
        sure to set this property appropriately.

    `SAML2_SNOWFLAKE_ACS_URL = '<string_literal>'`
    :   The string containing the Snowflake Assertion Consumer Service URL to which the IdP will send its SAML authentication response back to
        Snowflake. This property will be set in the SAML authentication request generated by Snowflake when initiating a SAML SSO operation with the IdP.

        If an incorrect value is specified, Snowflake returns an error message indicating the acceptable values to use.

        The value of this property must match the Snowflake account URL specified in the IdP. It defaults to the
        [legacy URL](../../user-guide/admin-account-identifier.md), so if you define a different [URL format](../../user-guide/organizations-connect.md) in the IdP,
        make sure to set this property appropriately.

        Default: `https://<account_locator>.<region>.snowflakecomputing.com/fed/login`

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the integration.

        Default: No value

`REFRESH ...`
:   Updates integration values.

    `SAML2_SNOWFLAKE_PRIVATE_KEY`
    :   Generates a new private key and self-signed certificate for a SAML2 security integration. The old private key and self-signed
        certificate are overwritten by the new ones. For more information about best practices for rotating keys, see
        [Manage Your SAML2 security integration](../../user-guide/admin-security-fed-auth-security-integration.md).

    `METADATA_URL`
    :   Updates the security integration with the current IdP configuration settings. Snowflake uses the metadata URL, which is specified with
        the integration’s `METADATA_URL` parameter, to update the settings. Use `REFRESH METADATA_URL` to keep Snowflake
        synchronized with modified IdP configuration settings, including certificate updates, without manually updating parameters.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the security integration, which resets them back to their defaults:

    * `ENABLED`
    * `TAG tag_name [ , tag_name ... ]`

## Usage notes

* Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* After rotating your `SAML2_SNOWFLAKE_PRIVATE_KEY` using the `REFRESH` command, you need to upload your new
  `SAML2_SNOWFLAKE_X509_CERT` value to your IdP, otherwise SAML authentication stops working. For more information about best
  practices around rotating keys, see [Manage Your SAML2 security integration](../../user-guide/admin-security-fed-auth-security-integration.md).

## Examples

* The following example initiates operation of a suspended integration:

  ```sqlexample
  ALTER SECURITY INTEGRATION myint SET ENABLED = TRUE;
  ```
* The following example rotates the private key and generates a new self-signed certificate for a SAML2 security integration named
  `my_idp`:

  > **Caution:**
  >
  > After running the command below, SAML authentication stops working because your IdP still uses your old
  > `SAML2_SNOWFLAKE_X509_CERT` certificate. To minimize disruptions, you should run this command when users are not as active. For
  > more information, see [Manage Your SAML2 security integration](../../user-guide/admin-security-fed-auth-security-integration.md).

  ```sqlexample
  alter security integration my_idp refresh SAML2_SNOWFLAKE_PRIVATE_KEY;
  ```

---
title: ALTER SECURITY INTEGRATION (SCIM)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration-scim.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION (SCIM)

Modifies the properties of an existing SCIM security integration. For information about modifying other types of
security integrations (e.g. SAML2), see [ALTER SECURITY INTEGRATION](alter-security-integration.md).

See also:
:   [CREATE SECURITY INTEGRATION (SCIM)](create-security-integration-scim.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET
    [ ENABLED = { TRUE | FALSE } ]
    [ NETWORK_POLICY = '<network_policy>' ]
    [ REJECT_TOKENS_ISSUED_BEFORE = '<datetime_string>' ]
    [ SYNC_PASSWORD = { TRUE | FALSE } ]
    [ COMMENT = '<string_literal>' ]

ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name>  UNSET {
                                                            NETWORK_POLICY |
                                                            [ , ... ]
                                                            }
ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>'
    [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Identifier for the integration to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = TRUE | FALSE`
    :   Specifies whether the security integration is enabled. To disable the integration, set `ENABLED = FALSE`.

    `NETWORK_POLICY = 'network_policy'`
    :   Specifies an existing [network policy](../../user-guide/network-policies.md) that controls SCIM network traffic.

        If there are also network policies set for the account or user, see [Network policy precedence](../../user-guide/network-policies.md).

    `REJECT_TOKENS_ISSUED_BEFORE = 'datetime_string'`
    :   If this parameter is set, tokens issued before the date specified are rejected. This can mitigate security risks associated with long-lived or
        potentially compromised tokens. When this parameter is unset or not specified, tokens have no expiration date, and tokens that were previously rejected because of this
        mechanism will be considered valid. Tokens issued before this date that have already been approved are not invalidated, but new
        requests to validate tokens issued before this date will fail.

        This parameter cannot be assigned in the CREATE SECURITY INTEGRATION statement; it can be added only after the integration is created.

        The format is any [valid Snowflake timestamp format](../date-time-input-output.md), with an optional time zone. If the
        time zone is not provided, it is inferred from the current user settings. For example:

        * ‘Tue, 30 Sep 2025 12:30:00 -0700’
        * ‘Tue, 30 Sep 2025 12:30:00’
        * ‘2025-09-30 12:30:00’

        Default: No earliest issue date.

    `SYNC_PASSWORD = TRUE | FALSE`
    :   Specifies whether to enable or disable the synchronization of a user password from an Okta SCIM client as part of the API request to
        Snowflake.

        * `TRUE` enables password synchronization.
        * `FALSE` disables password synchronization.

        Default `FALSE`. If a security integration is created without setting this parameter, Snowflake sets this parameter to `FALSE`.

        If user passwords should not be synchronized from the client to Snowflake, ensure this property value is set to `FALSE` and
        disable password synchronization in the Okta client.

        Note that this property is only supported for Okta SCIM integrations. Microsoft Entra ID SCIM integrations are not supported because
        Microsoft Entra ID does not support password synchronization. To request support, please contact Microsoft.

        For details, see [Snowflake SCIM support](../../user-guide/scim-intro.md).

    `COMMENT`
    :   String (literal) that specifies a comment for the integration.

        Default: No value

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the security integration, which resets them back to their defaults:

    * `NETWORK_POLICY`
    * `REJECT_TOKENS_ISSUED_BEFORE`
    * `SYNC_PASSWORD`
    * `COMMENT`
    * `TAG tag_name [ , tag_name ... ]`

## Usage notes

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

```sqlexample
ALTER SECURITY INTEGRATION myint SET ENABLED = TRUE;
```

The following code adds a token age limit to the SCIM integration; tokens issued before noon on September 30, 2025, are considered invalid.

```sqlexample
ALTER SECURITY INTEGRATION 'example_integration'
  SET REJECT_TOKENS_ISSUED_BEFORE = '2025-09-30 12:00:00';
```

---
title: ALTER SECURITY INTEGRATION (Snowflake OAuth)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-security-integration-oauth-snowflake.md
section: SQL Commands
---

# ALTER SECURITY INTEGRATION (Snowflake OAuth)

Modifies the properties of an existing security integration created for a Snowflake OAuth client. For information about modifying other
types of security integrations (e.g. External OAuth), see [ALTER SECURITY INTEGRATION](alter-security-integration.md).

See also:
:   [CREATE SECURITY INTEGRATION (Snowflake OAuth)](create-security-integration-oauth-snowflake.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER [ SECURITY ] INTEGRATION <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ SECURITY ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

**Snowflake OAuth for partner applications**

> ```sqlsyntax
> ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET
>   [ ENABLED = { TRUE | FALSE } ]
>   [ OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE } ]
>   [ OAUTH_REDIRECT_URI ] = '<uri>'
>   [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
>   [ OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED = { TRUE | FALSE } ]
>   [ OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE } ]
>   [ BLOCKED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
>   [ NETWORK_POLICY = '<network_policy>' ]
>   [ USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE } ]
>   [ COMMENT = '<string_literal>' ]
>
> ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name>
>   REFRESH { OAUTH_CLIENT_SECRET | OAUTH_CLIENT_SECRET_2 }
>
> ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> UNSET {
>   ENABLED |
>   COMMENT
>   }
>   [ , ... ]
> ```

**Snowflake OAuth for custom clients**

> ```sqlsyntax
> ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name> SET
>   [ ENABLED = { TRUE | FALSE } ]
>   [ OAUTH_REDIRECT_URI = '<uri>' ]
>   [ OAUTH_ALLOW_NON_TLS_REDIRECT_URI = { TRUE | FALSE } ]
>   [ OAUTH_ENFORCE_PKCE = { TRUE | FALSE } ]
>   [ PRE_AUTHORIZED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
>   [ BLOCKED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
>   [ OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE } ]
>   [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
>   [ OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED = { TRUE | FALSE } ]
>   [ OAUTH_USE_SECONDARY_ROLES = IMPLICIT | NONE ]
>   [ NETWORK_POLICY = '<network_policy>' ]
>   [ OAUTH_CLIENT_RSA_PUBLIC_KEY = <public_key1> ]
>   [ OAUTH_CLIENT_RSA_PUBLIC_KEY_2 = <public_key2> ]
>   [ USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE } ]
>   [ COMMENT = '{string_literal}' ]
>
> ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name>
>   REFRESH { OAUTH_CLIENT_SECRET | OAUTH_CLIENT_SECRET_2 }
>
> ALTER [ SECURITY ] INTEGRATION [ IF EXISTS ] <name>  UNSET {
>                                                            ENABLED                       |
>                                                            NETWORK_POLICY                |
>                                                            OAUTH_CLIENT_RSA_PUBLIC_KEY   |
>                                                            OAUTH_CLIENT_RSA_PUBLIC_KEY_2 |
>                                                            OAUTH_USE_SECONDARY_ROLES = IMPLICIT | NONE
>                                                            COMMENT
>                                                            }
>                                                            [ , ... ]
> ```

## Parameters

### Snowflake OAuth partner application parameters

Use these parameters when `OAUTH_CLIENT = <partner_application>` in the security integration. For example, these parameters are valid
for `OAUTH_CLIENT = TABLEAU_SERVER`.

`name`
:   Identifier for the integration to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` allows the integration to run based on the parameters specified in the pipe definition.
        * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    `OAUTH_REDIRECT_URI = 'uri'`
    :   Specifies the client URI. After a user is authenticated, the web browser is redirected to this URI.

        This parameter is required when `OAUTH_CLIENT = LOOKER`. For details, see the example in the
        [Looker documentation](https://docs.looker.com/setup-and-management/database-config/snowflake#oauth).

    `OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE }`
    :   Boolean that specifies whether to allow the client to exchange a refresh token for an access token when the current access token has
        expired. If set to `FALSE`, a refresh token is not issued. User consent is revoked, and the user must confirm authorization again.

        Default: `TRUE`

    `OAUTH_REFRESH_TOKEN_VALIDITY = integer`
    :   Integer that specifies how long refresh tokens should be valid (in seconds). This can be used to expire the refresh token periodically.

        Note that if your organization would like the minimum or maximum values lowered or raised, respectively, ask your account administrator
        to send a request to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

        Values: `86400` (1 day) to `7776000` (90 days)

        Default: `7776000`

    `OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED =  { TRUE | FALSE }`
    :   Specifies whether [single-use refresh tokens](../../user-guide/single-use-refresh-tokens.md) should be used.

        Default: `FALSE`

    `OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE }`
    :   * `IMPLICIT` - Default secondary roles set in the user properties are activated by default in the session being opened.
        * `NONE` - Default secondary roles are not supported in the session being opened.

        Default: `NONE`

    `BLOCKED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
    :   Comma-separated list of Snowflake roles that a user cannot explicitly consent to using after authenticating
        (e.g. `'custom_role1', 'custom_role2'`).

        By default, Snowflake prevents the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles from authenticating. To allow these
        privileged roles to authenticate, use the [ALTER ACCOUNT](alter-account.md) command to set the [OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST](../parameters.md) account parameter to `FALSE`.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE }`
    :   When TRUE, the interaction between Snowflake as the authorization server and the user who is authenticating uses
        [private connectivity](../../user-guide/private-connectivity-inbound.md). Interactions between Snowflake and the client, including the
        initial request to the authorization endpoint, still happens over the public internet.

        Default: `FALSE`

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

    `REFRESH { OAUTH_CLIENT_SECRET | OAUTH_CLIENT_SECRET_2 }`
    :   Generates a new client secret for the client to use, which allows an administrator to rotate client secrets. Snowflake provides two client
        secrets (OAUTH_CLIENT_SECRET and OAUTH_CLIENT_SECRET_2) for uninterrupted rotation; you can generate a new secret for either of these
        client secrets.

    `NETWORK_POLICY = 'network_policy'`
    :   Specifies an existing [network policy](../../user-guide/network-policies.md). This network policy controls network traffic that is
        attempting to exchange an authorization code for an access or refresh token, use a refresh token to obtain a new
        access token, or obtain Snowflake resources with an access token.

        For more information, see [Restricting network traffic for Snowflake OAuth](../../user-guide/oauth-snowflake-overview.md).

### Snowflake OAuth custom client parameters

Use these parameters when `OAUTH_CLIENT = CUSTOM` in the security integration.

`name`
:   Identifier for the integration to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for the integration (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether to initiate operation of the integration or suspend it.

        * `TRUE` allows the integration to run based on the parameters specified in the pipe definition.
        * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    `OAUTH_REDIRECT_URI = 'uri'`
    :   Specifies the client URI. After a user is authenticated, the web browser is redirected to this URI. The URI must be protected by TLS
        (Transport Layer Security) unless the optional `OAUTH_ALLOW_NON_TLS_REDIRECT_URI` parameter is set to `TRUE`.

        Do not include query parameters sent with the redirect URI in the request to the [authorization endpoint](../../user-guide/oauth-custom.md). For example, if the value of the `redirect_uri` query parameter in the request
        to the authorization endpoint is `https://www.example.com/connect?authType=snowflake`, make sure the OAUTH_REDIRECT_URI parameter is
        set to `https://www.example.com/connect`.

    `OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED =  { TRUE | FALSE }`
    :   Specifies whether [single-use refresh tokens](../../user-guide/single-use-refresh-tokens.md) should be used.

        Default: `FALSE`

    `OAUTH_ALLOW_NON_TLS_REDIRECT_URI = { TRUE | FALSE }`
    :   If `TRUE`, allows setting `OAUTH_REDIRECT_URI` to a URI not protected by TLS. We highly recommend use of TLS to
        prevent man-in-the-middle OAuth redirects for use in phishing attacks.

        Default: `FALSE`

    `OAUTH_ENFORCE_PKCE = { TRUE | FALSE }`
    :   Boolean that specifies whether Proof Key for Code Exchange (PKCE) should be required for the integration.

        Default: `FALSE`

    `OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE }`
    :   * `IMPLICIT` - Default secondary roles set in the user properties are activated by default in the session being opened.
        * `NONE` - Default secondary roles are not supported in the session being opened.

        Default: `NONE`

    `PRE_AUTHORIZED_ROLES_LIST = '( role_name' [ , 'role_name , ... ] ')`
    :   Comma-separated list of Snowflake roles that a user does not need to explicitly consent to using after authenticating, e.g.
        `'custom_role1', 'custom_role2'`. The ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles cannot be included in this list.

        > **Note:**
        >
        > This parameter is supported for confidential clients only.

    `BLOCKED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
    :   Comma-separated list of Snowflake roles that a user cannot explicitly consent to using after authenticating
        (e.g. `'custom_role1', 'custom_role2'`).

        The ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles are included in this list by default; however, if these roles should be removed
        for your account, ask your account administrator to send a request to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    `OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE }`
    :   Boolean that specifies whether to allow the client to exchange a refresh token for an access token when the current access token has
        expired. If set to `FALSE`, a refresh token is not issued. User consent is revoked, and the user must confirm authorization again.

        Default: `TRUE`

    `OAUTH_REFRESH_TOKEN_VALIDITY = integer`
    :   Integer that specifies how long refresh tokens should be valid (in seconds). This can be used to expire the refresh token periodically.

        When a refresh token expires, the application will need to direct the user through the authorization flow again to obtain a new refresh
        token.

        The supported minimum, maximum, and default values are as follows:

        | Application | Minimum | Maximum | Default |
        | --- | --- | --- | --- |
        | Tableau Desktop | `60` (1 minute) | `36000` (10 hours) | `36000` (10 hours) |
        | Tableau Cloud | `60` (1 minute) | `7776000` (90 days) | `7776000` (90 days) |
        | Custom client | `86400` (1 day) | `7776000` (90 days) | `7776000` (90 days) |

        If you have a business need to lower the minimum value or raise the maximum value, ask your account administrator to send a request to
        [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    `OAUTH_CLIENT_RSA_PUBLIC_KEY = public_key1`
    :   Specifies an RSA public key.

    `OAUTH_CLIENT_RSA_PUBLIC_KEY_2 = public_key2`
    :   Specifies a second RSA public key. Used for key rotation.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE }`
    :   When TRUE, the interaction between Snowflake as the authorization server and the user who is authenticating uses
        [private connectivity](../../user-guide/private-connectivity-inbound.md). Interactions between Snowflake and the client, including the
        initial request to the authorization endpoint, still happens over the public internet.

        Default: `FALSE`

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

    `REFRESH { OAUTH_CLIENT_SECRET | OAUTH_CLIENT_SECRET_2 }`
    :   Generates a new client secret for the client to use, which allows an administrator to rotate client secrets. Snowflake provides two client
        secrets (OAUTH_CLIENT_SECRET and OAUTH_CLIENT_SECRET_2) for uninterrupted rotation; you can generate a new secret for either of these
        client secrets.

    `NETWORK_POLICY = 'network_policy'`
    :   Specifies an existing [network policy](../../user-guide/network-policies.md). This network policy controls network traffic that is
        attempting to exchange an authorization code for an access or refresh token, use a refresh token to obtain a new
        access token, or obtain Snowflake resources with an access token.

        For more information, see [Restricting network traffic for Snowflake OAuth](../../user-guide/oauth-snowflake-overview.md).

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the security integration, which resets them back to their defaults:

    * `ENABLED`
    * `NETWORK_POLICY`
    * `OAUTH_CLIENT_RSA_PUBLIC_KEY`
    * `OAUTH_CLIENT_RSA_PUBLIC_KEY_2`
    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

```sqlexample
ALTER SECURITY INTEGRATION myint SET ENABLED = TRUE;
```

---
title: ALTER SEMANTIC VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/alter-semantic-view.md
section: SQL Commands
---

# ALTER SEMANTIC VIEW

Modifies the comment for an existing [semantic view](../../user-guide/views-semantic/overview.md) or renames a semantic view.

> **Note:**
>
> You can’t use the ALTER SEMANTIC VIEW command to change properties other than the comment. To change other properties of the
> semantic view, replace the semantic view. See [Replacing an existing semantic view](../../user-guide/views-semantic/sql.md).

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
ALTER SEMANTIC VIEW [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER SEMANTIC VIEW [ IF EXISTS ] <name> SET
  COMMENT = '<string_literal>'

ALTER SEMANTIC VIEW [ IF EXISTS ] <name> UNSET
  COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the semantic view to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Changes the name of the semantic view to `new_name`. The new identifier must be unique within the schema.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Sets one or more specified properties or parameters for the semantic view:

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the semantic view.

`UNSET ...`
:   Unsets one or more specified properties or parameters for the semantic view, which resets the properties to their defaults:

    * `COMMENT`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Semantic view | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example renames a semantic view:

```sqlexample
ALTER SEMANTIC VIEW sv RENAME TO sv_new_name;
```

The following example sets the comment for a semantic view:

```sqlexample
ALTER SEMANTIC VIEW my_semantic_view SET COMMENT = 'my comment';
```

The following example unsets the existing comment for a semantic view:

```sqlexample
ALTER SEMANTIC VIEW my_semantic_view UNSET COMMENT;
```

---
title: ALTER SEQUENCE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-sequence.md
section: SQL Commands
---

# ALTER SEQUENCE

Modifies the properties for an existing sequence.

See also:
:   [CREATE SEQUENCE](create-sequence.md) , [DROP SEQUENCE](drop-sequence.md) , [SHOW SEQUENCES](show-sequences.md) , [DESCRIBE SEQUENCE](desc-sequence.md)

## Syntax

```sqlsyntax
ALTER SEQUENCE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER SEQUENCE [ IF EXISTS ] <name> [ SET ] [ INCREMENT [ BY ] [ = ] <sequence_interval> ]

ALTER SEQUENCE [ IF EXISTS ] <name> SET
  [ { ORDER | NOORDER } ]
  [ COMMENT = '<string_literal>' ]

ALTER SEQUENCE [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the sequence to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the sequence; must be unique for the schema.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET...`
:   Specifies the properties to set for the sequence:

    `[ INCREMENT [ BY ] sequence_interval ]`
    :   Specifies the step interval of the sequence:

        * For positive sequence interval `n`, the next `n-1` values are reserved by each sequence call.
        * For negative sequence interval `-n`, the next `n-1` lower values are reserved by each sequence call.

        Supported values are any non-zero value that can be represented by a 64-bit two’s complement integer.

    `{ ORDER | NOORDER }`
    :   Specifies whether or not the values are generated for the sequence in
        [increasing order](../../user-guide/querying-sequences.md).

        * ORDER specifies that the values generated for a sequence or auto-incremented column are in increasing order (or, if the interval
          is a negative value, in decreasing order).

          For example, if a sequence or auto-incremented column has `START 1 INCREMENT 2`, the generated values might be
          `1`, `3`, `5`, `7`, `9`, etc.
        * NOORDER specifies that the values are not guaranteed to be in increasing order.

          For example, if a sequence has `START 1 INCREMENT 2`, the generated values might be `1`, `3`, `101`, `5`, `103`, etc.

          NOORDER can improve performance when multiple INSERT operations are performed concurrently (for example, when multiple
          clients are executing multiple INSERT statements).

        > **Note:**
        >
        > If a sequence is set to NOORDER, you cannot change the sequence to ORDER.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the sequence.

`UNSET ...`
:   Specifies the properties to unset for the sequence, which resets them to the defaults.

    Currently, the only property you can unset is COMMENT, which removes the comment, if one exists, for the sequence.

## Usage notes

* The first/initial value for a sequence cannot be changed after the sequence is created.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename sequence `myseq` to `newseq`:

```sqlexample
ALTER SEQUENCE myseq RENAME TO newseq;
```

More examples are available in [Using Sequences](../../user-guide/querying-sequences.md).

---
title: ALTER SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-service.md
section: SQL Commands
---

# ALTER SERVICE

Modifies [Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md)
configuration, upgrades the code for the service, and allows you to suspend or resume a service. You can:

* Apply modifications to a running service. For example, suspend or resume a service, and update the number of service instances
  running.
* Apply modifications that take effect only after service is restarted. For example, specify a default warehouse for queries.
* Apply modifications that cause Snowflake to shut down the service, and restart using new code. For example, you might want to
  deploy updated service code.
* Restart the specified instances of a service using the snapshot provided as the initial content for the specified volume. The service must be suspended before you execute ALTER SERVICE.

See also:
:   [CREATE SERVICE](create-service.md) , [DESCRIBE SERVICE](desc-service.md), [DROP SERVICE](drop-service.md) , [SHOW SERVICES](show-services.md)

## Syntax

```sqlsyntax
ALTER SERVICE [ IF EXISTS ] <name> { SUSPEND | RESUME }

ALTER SERVICE [ IF EXISTS ] <name>
  {
     fromSpecification
     | fromSpecificationTemplate
  }

ALTER SERVICE [IF EXISTS] <service_name> RESTORE VOLUME <volume_name>
                                                 INSTANCES <comma_separated_instance_ids>
                                                 FROM SNAPSHOT <snapshot_name>

ALTER SERVICE [ IF EXISTS ] <name> SET [ MIN_INSTANCES = <num> ]
                                       [ MAX_INSTANCES = <num> ]
                                       [ LOG_LEVEL = '<log_level>' ]
                                       [ AUTO_SUSPEND_SECS = <num> ]
                                       [ MIN_READY_INSTANCES = <num> ]
                                       [ QUERY_WAREHOUSE = <warehouse_name> ]
                                       [ AUTO_RESUME = { TRUE | FALSE } ]
                                       [ EXTERNAL_ACCESS_INTEGRATIONS = ( <EAI_name> [ , ... ] ) ]
                                       [ COMMENT = '<string_literal>' ]

ALTER SERVICE [ IF EXISTS ] <name> UNSET { MIN_INSTANCES                |
                                           AUTO_SUSPEND_SECS            |
                                           MAX_INSTANCES                |
                                           LOG_LEVEL                    |
                                           MIN_READY_INSTANCES          |
                                           QUERY_WAREHOUSE              |
                                           AUTO_RESUME                  |
                                           EXTERNAL_ACCESS_INTEGRATIONS |
                                           COMMENT
                                         }
                                         [ , ... ]

ALTER SERVICE [ IF EXISTS ] <name> SET [ TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]]
```

Where:

> ```sqlsyntax
> fromSpecification ::=
>   {
>     FROM SPECIFICATION_FILE = '<yaml_file_path>' -- for native app service.
>     | FROM @<stage> SPECIFICATION_FILE = '<yaml_file_path>' -- for non-native app service.
>     | FROM SPECIFICATION <specification_text>
>   }
> ```
>
> ```sqlsyntax
> fromSpecificationTemplate ::=
>   {
>     FROM SPECIFICATION_TEMPLATE_FILE = '<yaml_file_path>' -- for native app service.
>     | FROM @<stage> SPECIFICATION_TEMPLATE_FILE = '<yaml_file_path>' -- for non-native app service.
>     | FROM SPECIFICATION_TEMPLATE <specification_text>
>   }
>   USING ( <key> => <value> [ , <key> => <value> [ , ... ] ]  )
> ```

## Parameters

`name`
:   Specifies the identifier for the service to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`{ SUSPEND | RESUME }`
:   Specifies whether to suspend or resume the service.

    When you suspend a service, Snowflake shuts down and deletes the containers. If you later resume a suspended service,Snowflake
    recreates the containers. That is, Snowflake takes the image from your repository and starts the containers. Note that, Snowflake deploys the same image version; it is not a service update operation.

    When you invoke a suspended service using either a service function or invoking the public endpoint (ingress), Snowflake
    automatically resumes the service.

`FROM ...`
:   Identifies the [specification](../../developer-guide/snowpark-container-services/specification-reference.md) or
    the [template](../../developer-guide/snowpark-container-services/working-with-services.md) specification for the service.

    **Using a service specification**

    You can either define the specification either [inline or in a separate file](../../developer-guide/snowpark-container-services/working-with-services.md).

    `SPECIFICATION_FILE = 'yaml_file_path'` or . `@stage SPECIFICATION_FILE = 'yaml_file_path'` or . `SPECIFICATION specification_text`
    :   Specifies the file containing the service specification or the service specification inline. If your service specification is in a file, use SPECIFICATION_FILE. For services created in a Snowflake Native App, omit `@stage`, and specify a path relative to the app root directory. For services created in other contexts, specify the Snowflake internal stage and path to the service specification file.

    **Using a service specification template**

    You can either define the [template specification](../../developer-guide/snowpark-container-services/working-with-services.md) either [inline or in a separate file](../../developer-guide/snowpark-container-services/working-with-services.md).

    `SPECIFICATION_TEMPLATE_FILE = 'yaml_file_path'` or . `@stage SPECIFICATION_TEMPLATE_FILE = 'yaml_file_path'` or . `SPECIFICATION_TEMPLATE specification_text`
    :   Specifies the file containing the service specification template or the service specification template inline. If your service specification template is in a file, use SPECIFICATION_TEMPLATE_FILE. For services created in a Snowflake Native App, omit `@stage`, and specify a path relative to the app root directory. For services created in other contexts, specify the Snowflake internal stage and path to the service specification file. When using template specification, you should also include the `USING` parameter.

    `USING ( key => value [ , key => value [ , ... ] ]  )`
    :   Specifies the template variables and the values of those variables.

        * `key` is the name of the template variable. The template variable name can optionally be enclosed in double quotes
          (`"`).
        * `value` is the value to assign to the variable in the template. String values must be enclosed in `'` or
          `$$`. The value must either be alphanumeric or valid JSON.

        Use a comma between each key-value pair.

`RESTORE VOLUME volume_name INSTANCES comma_separated_instance_ids FROM SNAPSHOT snapshot_name`
:   Restores the snapshot `snapshot_name` on the existing block storage volume `volume_name` for the instances `comma_separated_instance_ids`.

    Snapshots can only be taken for block storage volumes (and not for local, memory, or stage volumes).

    Volume names are case-sensitive. Therefore, double quotes should always be used to match the corresponding name in the service specification.

`SET ...`
:   Sets one or more specified properties or parameters for the service:

    `MIN_INSTANCES = num`
    :   Specifies the minimum number of service instances.

    `MAX_INSTANCES = num`
    :   Specifies the maximum number of service instances.

    `LOG_LEVEL = 'log_level'`
    :   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
        the specified level (and at more severe levels) are ingested.
        Currently, LOG_LEVEL is supported only for [platform events](../../developer-guide/snowpark-container-services/monitoring-services.md), Changing LOG_LEVEL for [container logs](../../developer-guide/snowpark-container-services/monitoring-services.md) is not supported.

        > For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

    `AUTO_SUSPEND_SECS = num`
    :   Specifies the number of seconds of inactivity (service is idle) after which Snowflake automatically suspends the service. When AUTO_SUSPEND_SECS is 0 (default), Snowflake does not auto-suspend the service. You can configure this value to 300 seconds or more to enable auto-suspension. For more information, see [Suspending a service](../../developer-guide/snowpark-container-services/working-with-services.md).

        [Preview Feature](../../release-notes/preview-features.md) — Open

        Configuring the automatic suspension of a Snowpark Container Services service using the AUTO_SUSPEND_SECS property is a [preview feature](../../release-notes/preview-features.md).

    `MIN_READY_INSTANCES = num`
    :   Specifies the minimum service instances that must be ready for Snowflake to consider the service ready to process requests. For more information, see [Scaling services](../../developer-guide/snowpark-container-services/working-with-services.md).

    `QUERY_WAREHOUSE = warehouse_name`
    :   Warehouse to use if a service container connects to Snowflake to execute a query but does not explicitly specify a warehouse to use.

    `AUTO_RESUME = { TRUE | FALSE }`
    :   Specifies whether to automatically resume a service when user performs one of the following actions that depend on the service:

        * Executing a query is that uses a [service function](../../developer-guide/snowpark-container-services/working-with-services.md).
        * Sending a request to the public endpoint exposed by the service ([ingress](../../developer-guide/snowpark-container-services/working-with-services.md)).

        If AUTO_RESUME is FALSE, you need to explicitly resume the service (using ALTER SERVICE … RESUME).

        Default: TRUE.

    `EXTERNAL_ACCESS_INTEGRATIONS = ( EAI_name [ , ... ] )`
    :   Specifies the names of the [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md) that allow your service to access external sites.
        Snowflake replaces all the existing EAIs with those specified in this parameter.
        The names in this list are case-sensitive. For more information, see [Configure service egress](../../developer-guide/snowpark-container-services/service-network-communications.md).
        Note that this changes the allowed network access for all running instances of the service. You don’t need to explicitly suspend and resume the service.

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the service.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more properties and/or parameters to unset for the service, which resets them to the defaults (see
    [CREATE SERVICE](create-service.md)):

    * `MIN_INSTANCES`
    * `MAX_INSTANCES`
    * `AUTO_SUSPEND_SECS`

      [Preview Feature](../../release-notes/preview-features.md) — Open

      Configuring the automatic suspension of a Snowpark Container Services service using the AUTO_SUSPEND_SECS property is a [preview feature](../../release-notes/preview-features.md).
    * `MIN_READY_INSTANCES`
    * `QUERY_WAREHOUSE`
    * `AUTO_RESUME`
    * `EXTERNAL_ACCESS_INTEGRATIONS`
    * `COMMENT`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Snapshot | To restore a snapshot, the role requires this privilege on the snapshot. |
| OWNERSHIP | Service | To alter the service and set/unset properties and tags, the role requires this privilege. |
| OPERATE | Service | To alter the service, except set/unset properties, the role requires this privilege. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Suspend a service.

```sqlexample
ALTER SERVICE echo_service SUSPEND;
```

Modify the MIN_INSTANCES and MAX_INSTANCES properties of an existing service.

```sqlexample
ALTER SERVICE echo_service SET MIN_INSTANCES=3 MAX_INSTANCES=5;
```

Restore a snapshot on an existing block volume associated with instances 0 and 2 of the `example_service` service.

```sqlexample
ALTER SERVICE example_service
  RESTORE VOLUME "myvolume"
  INSTANCES 0,2
  FROM SNAPSHOT my_snapshot;
```

---
title: ALTER SESSION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-session.md
section: SQL Commands
---

# ALTER SESSION

Sets parameters that change the behavior for the current session.

See also:
:   [SHOW PARAMETERS](show-parameters.md)

## Syntax

```sqlsyntax
ALTER SESSION SET sessionParams

ALTER SESSION UNSET <param_name> [ , <param_name> , ... ]
```

Where:

> ```sqlsyntax
> sessionParams ::=
>   ABORT_DETACHED_QUERY = TRUE | FALSE
>   ACTIVE_PYTHON_PROFILER = 'LINE' | 'MEMORY'
>   AUTOCOMMIT = TRUE | FALSE
>   BINARY_INPUT_FORMAT = <string>
>   BINARY_OUTPUT_FORMAT = <string>
>   DATE_INPUT_FORMAT = <string>
>   DATE_OUTPUT_FORMAT = <string>
>   ERROR_ON_NONDETERMINISTIC_MERGE = TRUE | FALSE
>   ERROR_ON_NONDETERMINISTIC_UPDATE = TRUE | FALSE
>   GEOGRAPHY_OUTPUT_FORMAT = 'GeoJSON' | 'WKT' | 'WKB' | 'EWKT' | 'EWKB'
>   HYBRID_TABLE_LOCK_TIMEOUT = <num>
>   JSON_INDENT = <num>
>   LOG_LEVEL = <string>
>   LOCK_TIMEOUT = <num>
>   OPT_OUT_ERROR_LOGGING = TRUE | FALSE
>   PYTHON_PROFILER_TARGET_STAGE = <string>
>   PYTHON_PROFILER_MODULES = <string>
>   QUERY_TAG = <string>
>   ROWS_PER_RESULTSET = <num>
>   S3_STAGE_VPCE_DNS_NAME = <string>
>   SEARCH_PATH = <string>
>   SIMULATED_DATA_SHARING_CONSUMER = <string>
>   STATEMENT_TIMEOUT_IN_SECONDS = <num>
>   STRICT_JSON_OUTPUT = TRUE | FALSE
>   TIMESTAMP_DAY_IS_ALWAYS_24H = TRUE | FALSE
>   TIMESTAMP_INPUT_FORMAT = <string>
>   TIMESTAMP_LTZ_OUTPUT_FORMAT = <string>
>   TIMESTAMP_NTZ_OUTPUT_FORMAT = <string>
>   TIMESTAMP_OUTPUT_FORMAT = <string>
>   TIMESTAMP_TYPE_MAPPING = <string>
>   TIMESTAMP_TZ_OUTPUT_FORMAT = <string>
>   TIMEZONE = <string>
>   TIME_INPUT_FORMAT = <string>
>   TIME_OUTPUT_FORMAT = <string>
>   TRACE_LEVEL = <string>
>   TRANSACTION_DEFAULT_ISOLATION_LEVEL = <string>
>   TWO_DIGIT_CENTURY_START = <num>
>   UNSUPPORTED_DDL_ACTION = <string>
>   USE_CACHED_RESULT = TRUE | FALSE
>   WEEK_OF_YEAR_POLICY = <num>
>   WEEK_START = <num>
> ```

> **Note:**
>
> For readability, the complete list of session parameters that can be set is not included here. For a complete list of all session parameters,
> with their descriptions, as well as account and object parameters, see [Parameters](../parameters.md).

## Parameters

`SET ...`
:   Specifies one (or more) parameters to set for the session (separated by blank spaces, commas, or new lines).

    For descriptions of each of the parameters you can set for a session, see [Parameters](../parameters.md).

`UNSET ...`
:   Specifies one (or more) parameters to unset for the session, which resets them to the defaults.

    You can reset multiple parameters with a single ALTER statement; however, each property must be separated by a comma. When resetting
    a property, specify only the name; specifying a value for the property will return an error.

## Usage notes

* Parameters are typed. The supported types are BOOLEAN, NUMBER, and STRING.
* To see the current parameter values for the session, use [SHOW PARAMETERS](show-parameters.md).

## Examples

Set the lock timeout for statements executed in the session to 1 hour (3600 seconds):

> ```sqlexample
> ALTER SESSION SET LOCK_TIMEOUT = 3600;
> ```

Set the lock timeout for statements executed in the session back to the default:

> ```sqlexample
> ALTER SESSION UNSET LOCK_TIMEOUT;
> ```

---
title: ALTER SESSION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-session-policy.md
section: SQL Commands
---

# ALTER SESSION POLICY

Modifies the properties for an existing session policy.

Any changes made to the session policy properties go into effect when the next SQL query that uses the session policy runs.

See also:
:   [Session Policy DDL Reference](../../user-guide/session-policies-managing.md)

## Syntax

```sqlsyntax
ALTER SESSION POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER SESSION POLICY [ IF EXISTS ] <name> SET
  [ SESSION_IDLE_TIMEOUT_MINS = <integer> ]
  [ SESSION_UI_IDLE_TIMEOUT_MINS = <integer> ]
  [ ALLOWED_SECONDARY_ROLES = ( [ { 'ALL' | <role_name> [ , <role_name> ... ] } ] ) ]
  [ BLOCKED_SECONDARY_ROLES = ( [ { 'ALL' | <role_name> [ , <role_name> ... ] } ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER SESSION POLICY [ IF EXISTS ] <name> SET
  TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER SESSION POLICY [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER SESSION POLICY [ IF EXISTS ] <name> UNSET
  [ SESSION_IDLE_TIMEOUT_MINS ]
  [ SESSION_UI_IDLE_TIMEOUT_MINS ]
  [ ALLOWED_SECONDARY_ROLES ]
  [ BLOCKED_SECONDARY_ROLES ]
  [ COMMENT ]
```

## Parameters

`name`
:   Identifier for the session policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the session policy; must be unique for your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies one or more parameters to set for the session policy separated by blank spaces, commas, or new lines.

    `SESSION_IDLE_TIMEOUT_MINS = integer`
    :   For Snowflake clients and programmatic clients, the number of minutes in which a session can be idle before users must authenticate to
        Snowflake again. If a value is not specified, Snowflake uses the default value.

        The number of minutes can be any integer between `5` and `1440`, inclusive.

        Default: `240` (4 hours)

    `SESSION_UI_IDLE_TIMEOUT_MINS = integer`
    :   For Snowsight, the number of minutes in which a session can be idle before a user must authenticate to Snowflake again. If a
        value is not specified, Snowflake uses the default value.

        The number of minutes can be any integer between `5` and `1440`, inclusive.

        Default: `240` (4 hours)

    `ALLOWED_SECONDARY_ROLES = ( [ { 'ALL' | role_name [ , role_name ... ] } ] )`
    :   Specifies the secondary roles for a session policy, if any.

        The possible values for the property are:

        `()`
        :   Disallows secondary roles.

        `('ALL')`
        :   Allows all secondary roles.

        `( role_name [ , role_name ... ] )`
        :   Allows the specified roles as secondary roles. The secondary roles can be user-defined account roles or system roles. Specify the
            role name as it is stored in Snowflake. For details, see [Identifier requirements](../identifiers-syntax.md).

        Default: `('ALL')`. If you unset this property, its value in the output of a [DESCRIBE SESSION POLICY](desc-session-policy.md) command is `'ALL'`.

    `BLOCKED_SECONDARY_ROLES = ( [ { 'ALL' | role_name [ , role_name ... ] } ] )`
    :   Specifies the blocked secondary roles for a session policy, if any. Blocked secondary roles take precedence over
        allowed secondary roles.

        The possible values for the property are:

        `()`
        :   Allows all secondary roles.

        `('ALL')`
        :   Disallows secondary roles.

        `( role_name [ , role_name ... ] )`
        :   Blocks the specified roles as secondary roles. The specified roles, and the roles granted to those roles, cannot be
            activated as secondary roles. These blocked roles can be user-defined account roles or system roles. Specify the
            role name as it is stored in Snowflake. For details, see [Identifier requirements](../identifiers-syntax.md).

        Default: `()`. If you unset this property, its value in the output of a [DESCRIBE SESSION POLICY](desc-session-policy.md) command is
        `'()'`.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the session policy.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one or more parameters to unset for the session policy, which resets them to the system defaults.

    You can reset multiple properties with a single ALTER statement. Each property must be separated by a comma. When
    resetting a property, specify only the name. Specifying a value for the property will return an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Session policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on session policy DDL and privileges, see [Managing session policies](../../user-guide/session-policies-managing.md).

## Usage notes

* If you want to update an existing session policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE SESSION POLICY](desc-session-policy.md) command.
* Before executing an ALTER statement, you can execute a DESCRIBE SESSION POLICY statement to determine the attribute values of the policy.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example updates the session policy to have a Snowsight session timeout value of `15` minutes.

```sqlexample
DESC SESSION POLICY session_policy_prod_1;
```

```output
+---------------------------------+-----------------------+------------------------+--------------------------+--------------------------------------------------+
| createdOn                       | name                  | sessionIdleTimeoutMins | sessionUIIdleTimeoutMins | comment                                          |
+---------------------------------+-----------------------+------------------------+--------------------------+--------------------------------------------------+
| Mon, 11 Jan 2021 00:00:00 -0700 | session_policy_prod_1 | 30                     | 30                       | session policy for use in the prod_1 environment |
+---------------------------------+-----------------------+------------------------+--------------------------+--------------------------------------------------+
```

```sqlexample
ALTER SESSION POLICY session_policy_prod_1 SET SESSION_UI_IDLE_TIMEOUT_MINS = 15;
```

---
title: ALTER SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-share.md
section: SQL Commands
---

# ALTER SHARE

Modifies the properties for an existing [share](../../user-guide/data-sharing-intro.md):

* Adds or removes accounts from the list of accounts.
* Sets a new list of accounts with which the corresponding database for the share is shared.
* Modifies other properties. For parameter details, see [Parameters](../parameters.md).

See also:
:   [CREATE SHARE](create-share.md) , [DROP SHARE](drop-share.md) , [DESCRIBE SHARE](desc-share.md) , [SHOW SHARES](show-shares.md)

## Syntax

```sqlsyntax
ALTER SHARE [ IF EXISTS ] <name> { ADD | REMOVE } ACCOUNTS = <consumer_account> [ , <consumer_account> , ... ]
                                        [ SHARE_RESTRICTIONS = { TRUE | FALSE } ]

ALTER SHARE [ IF EXISTS ] <name> SET { [ ACCOUNTS = <consumer_account> [ , <consumer_account> ... ] ]
                                       [ COMMENT = '<string_literal>' ] }

ALTER SHARE [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER SHARE <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER SHARE [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the share to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`ADD | REMOVE ACCOUNTS = consumer_account [ , consumer_account , ... ]`
:   Specifies the name of the account(s) to add or remove from the list of accounts for the share:

    * Adding an account to a share that was already in the list has no effect.
    * Removing an account that has already imported the shared database immediately revokes that account’s access to the database. If the account
      is later added back to the share, the account must re-create the database before they can use it again.
    * Removing an account from a share that was not already in the list of shared accounts has no effect.

    This parameter adds to (or removes from) the existing list of accounts for the share. If you want to replace the entire list of accounts, use
    `SET` instead.

    `SHARE_RESTRICTIONS = { TRUE | FALSE }`

    > `FALSE`: A Standard or Enterprise consumer account can be added to a share belonging to a Business Critical provider account.
    > A non-HIPAA consumer account can be added to a share belonging to a HIPAA-compliant provider account.
    >
    > `TRUE`: A Standard or Enterprise consumer account cannot be added to a share belonging to a Business Critical provider account.
    > A non-HIPAA consumer account cannot be added to a share belonging to a HIPAA-compliant provider account.
    >
    > Default:
    > :   `TRUE`
    >
    > > **Important:**
    > >
    > > You must set this parameter each time you add a new non-Business Critical consumer account to the share belonging to a Business Critical provider account,
    > > or each time you add a new non-HIPAA consumer account to the share belonging to a HIPAA-compliant provider account.
    > > For more information see, [Override share restrictions](../../user-guide/override_share_restrictions.md).

`SET...`

> `ACCOUNTS = consumer_account [ , consumer_account ... ]`
> :   Specifies the account(s) to replace all previous accounts with which the share was shared. To add/remove individual accounts from the
>     list, use `ADD | REMOVE` instead.
>
> `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
> :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.
>
>     The tag value is always a string, and the maximum number of characters for the tag value is 256.
>
>     For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).
>
> `COMMENT = 'string'`
> :   Adds a comment or overwrites an existing comment for the share.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the share, which resets them back to their defaults:

    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Usage notes

* One of the following privileges is required to alter a share:

  > + The OWNERSHIP privilege which is granted to the role that creates the share.
  > + The MANAGE SHARE TARGET privilege determines which roles can add or remove accounts from a share.
  >   Only roles granted MANAGE SHARE TARGET can add or remove share account access.
* Keywords `ACCOUNT` and `ACCOUNTS` are both supported and can be used interchangeably.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Add two accounts to the existing share named `sales_s`:

> ```sqlexample
> ALTER SHARE sales_s ADD ACCOUNTS=<orgname.accountname1>,<orgname.accountname2>;
>
> +----------------------------------+
> | status                           |
> |----------------------------------|
> | Statement executed successfully. |
> +----------------------------------+
> ```

Remove account `<orgname.accountname>;` from `sales_s`:

> ```sqlexample
> ALTER SHARE sales_s REMOVE ACCOUNT=<orgname.accountname>;
>
> +----------------------------------+
> | status                           |
> |----------------------------------|
> | Statement executed successfully. |
> +----------------------------------+
> ```

Grant MANAGE SHARE TARGET to a role, and use that role manage share targets:

```sqlexample
GRANT MANAGE SHARE TARGET ON ACCOUNT TO ROLE <role_name>;

GRANT ROLE <role_name> TO USER <user_name>;

USE ROLE <role_name>;

ALTER SHARE <data_share_name> ADD ACCOUNTS = <orgname.accountname1>,<orgname.accountname2>;
```

Set a new comment for `sales_s`:

> ```sqlexample
> ALTER SHARE sales_s SET COMMENT='This share contains sales data for 2017';
>
> +----------------------------------+
> | status                           |
> |----------------------------------|
> | Statement executed successfully. |
> +----------------------------------+
> ```

---
title: ALTER SNAPSHOT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-snapshot.md
section: SQL Commands
---

# ALTER SNAPSHOT

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Modifies the properties of an existing [snapshot of a block storage volume](../../developer-guide/snowpark-container-services/block-storage-volume.md).

See also:
:   [CREATE SNAPSHOT](create-snapshot.md) , [DESCRIBE SNAPSHOT](desc-snapshot.md), [DROP SNAPSHOT](drop-snapshot.md), [SHOW SNAPSHOTS](show-snapshots.md)

## Syntax

```sqlsyntax
ALTER SNAPSHOT [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'
```

## Parameters

`name`
:   Specifies the identifier for the snapshot to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Sets one or more specified properties or parameters for the snapshot:

    `COMMENT = string-literal`
    :   Specifies a comment for the snapshot.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Snapshot | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example sets a comment on the `example_snapshot` snapshot.

```sqlexample
ALTER SNAPSHOT example_snapshot SET COMMENT = 'sample comment.';
```

---
title: ALTER SNAPSHOT POLICY — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/alter-snapshot-policy.md
section: SQL Commands
---

# ALTER SNAPSHOT POLICY — *Deprecated*

Modifies the properties of a [snapshot](../../user-guide/backups.md) policy. The following changes are supported:

* Rename the policy.
* Add or update the comment for the policy.
* Change the schedule and expiration settings for the policy. The schedule determines how often Snowflake
  automatically makes a backup and adds the resulting snapshot to the snapshot set that’s governed by the policy.
  The expiration period determines how long each snapshot is retained before Snowflake automatically deletes it from the
  associated snapshot set.
* Unset properties of the policy, so that they revert back to their default values.

See also:
:   [CREATE SNAPSHOT POLICY — Deprecated](create-snapshot-policy.md),
    [DROP SNAPSHOT POLICY — Deprecated](drop-snapshot-policy.md),
    [SHOW SNAPSHOT POLICIES — Deprecated](show-snapshot-policies.md)

## Syntax

```sqlsyntax
ALTER SNAPSHOT POLICY <name> RENAME TO <new_name>

ALTER SNAPSHOT POLICY <name> SET
  [ COMMENT = '<string_literal>' ]
  [ SCHEDULE = '{ <num> MINUTE | <num> HOUR | USING CRON <expr> <time_zone> }' ]
  [ EXPIRE_AFTER_DAYS = <days_integer> ]

ALTER SNAPSHOT POLICY <name> UNSET { COMMENT | SCHEDULE | EXPIRE_AFTER_DAYS }

ALTER SNAPSHOT POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER SNAPSHOT POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the snapshot policy.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies a new identifier for the snapshot policy; must be unique for your account.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET...`
:   Specifies one or more properties to set for the snapshot policy (separated by blank spaces, commas, or new lines):

    `COMMENT = 'string_literal'`
    :   Specifies a comment for the snapshot policy.

    `SCHEDULE = '{ num MINUTE | num HOUR | USING CRON expr time_zone }'`
    :   Specifies the schedule for creating snapshots of an object.

        > **Note:**
        >
        > The minimum schedule for snapshots is 60 minutes or 1 hour.
        >
        > Every policy must include a SCHEDULE clause, an EXPIRE_AFTER_DAYS clause, or both.

        * `USING CRON expr time_zone`
          :   Specifies a cron expression and time zone for the point in time a snapshot of an object is created. Supports a subset of
              standard cron utility syntax.

              For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
              (in Wikipedia).

              The cron expression consists of the following fields:

              ```output
              # __________ minute (0-59)
              # | ________ hour (0-23)
              # | | ______ day of month (1-31, or L)
              # | | | ____ month (1-12, JAN-DEC)
              # | | | | __ day of week (0-6, SUN-SAT, or L)
              # | | | | |
              # | | | | |
                * * * * *
              ```

              The following special characters are supported:

              `*`
              :   Wildcard. Specifies any occurrence of the field.

              `L`
              :   Stands for “last”. When used in the day-of-week field, it lets you specify constructs such as “the last Friday” (“5L”) of a
                  given month. In the day-of-month field, it specifies the last day of the month.

              `/n`
              :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
                  specified in the month field, then the snapshot is scheduled for April, July and October (that is, every 3 months, starting with the 4th
                  month of the year). The same schedule is maintained in subsequent years. That is, the snapshot is not scheduled to run in
                  January (3 months after the October run).

              > **Note:**
              > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
              >   for the account (or setting the value at the user or session level) does not change the time zone for the snapshot.
              > + The cron expression defines all valid run times for the snapshot. Snowflake attempts to create a snapshot based on
              >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
              > + When both a specific day of month and day of week are included in the cron expression, then the snapshot is scheduled on days
              >   satisfying either the day of the month or the day of the week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
              >   schedules a snapshot at 0AM (midnight) on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
        * `num MINUTE`
          :   Specifies an interval (in minutes) of wait time between snapshots. Accepts positive integers only.

              Also supports `num M` syntax.
        * `num HOUR` or `num HOURS`
          :   Specifies an interval (in hours) of wait time between backups. Accepts positive integers only.

              Also supports `num H` syntax.

        To avoid ambiguity, a *base interval time* is set in the following circumstances:

        * When the object is created (using CREATE BACKUP SET … WITH BACKUP POLICY).
        * When a different interval is set (using ALTER BACKUP SET … APPLY BACKUP POLICY or
          ALTER BACKUP POLICY … SET SCHEDULE).

        The base interval time starts the interval counter from the current clock time. For example, if an
        INTERVAL value of `10 MINUTES` is set and the scheduled backup is enabled at 9:03 AM, then the next backup
        is created at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to ensure absolute
        precision, but only guarantee that a backup does not execute before the set interval occurs
        (that is, in the current example, the backup could first run at 9:14 AM, but will definitely not run
        at 9:12 AM).

    `EXPIRE_AFTER_DAYS = days_integer`
    :   > Specifies the number of days until the snapshot expires. Snowflake automatically deletes expired snapshots.
        > If this parameter isn’t specified, snapshots remain in the snapshot set until they are manually deleted from the set.
        >
        > * Minimum value: `1`
        > * Maximum value: `3653` (roughly 10 years) if you don’t specify the `SCHEDULE` clause.
        >
        > > **Note:**
        > >
        > > If the policy has a retention lock, you can increase the EXPIRE_AFTER_DAYS value, but you can’t decrease that value.
        > >
        > > Every policy must include a SCHEDULE clause, an EXPIRE_AFTER_DAYS clause, or both.

        `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
        :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

            The tag value is always a string, and the maximum number of characters for the tag value is 256.

            For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET...`
:   Unset one of the following properties for the snapshot policy. The property reverts to its default value.

    * COMMENT
    * `TAG tag_name [ , tag_name ... ]`
    * SCHEDULE
    * EXPIRE_AFTER_DAYS

    > **Note:**
    >
    > You can unset the SCHEDULE property, or the EXPIRE_AFTER_DAYS property, but not both.
    > For example, you might keep the EXPIRE_AFTER_DAYS property when you don’t intend to create new snapshots,
    > but you want existing snapshots to expire after a certain time.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| OWNERSHIP | The role used to modify a snapshot policy must have the OWNERSHIP privilege on the snapshot policy. |
| APPLY SNAPSHOT RETENTION LOCK | The role used to modify a snapshot policy with a retention lock must have this privilege on the account. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

Regarding metadata:

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Add a comment to snapshot policy `hourly_snapshot_policy`:

```sqlexample
ALTER SNAPSHOT POLICY hourly_snapshot_policy
  SET COMMENT = 'hourly snapshot expires in 90 days';
```

Change schedule for snapshot policy `every_two_hours`:

```sqlexample
ALTER SNAPSHOT POLICY every_two_hours SET SCHEDULE = '120 MINUTE';
```

Revert the EXPIRE_AFTER_DAYS property back to its default value:

```sqlexample
ALTER SNAPSHOT POLICY sample_snapshot_policy UNSET EXPIRE_AFTER_DAYS;
```

---
title: ALTER SNAPSHOT SET — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/alter-snapshot-set.md
section: SQL Commands
---

# ALTER SNAPSHOT SET — *Deprecated*

Modifies the properties for a [snapshot](../../user-guide/backups.md) set.
This operation can be one of the following:

* Taking a new backup that becomes part of the snapshot set.
* Removing an old backup from the snapshot set.
* Suspending or resuming the scheduled backups and scheduled snapshot deletion
  that are specified by the snapshot policy.
* Applying a snapshot policy to a snapshot set that doesn’t already have a policy.
* Adding or removing a legal hold for a specific snapshot within the snapshot set.
* Specifying or removing a comment for the snapshot set.

See also:
:   [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md),
    [DROP SNAPSHOT SET — Deprecated](drop-snapshot-set.md),
    [SHOW SNAPSHOT SETS — Deprecated](show-snapshot-sets.md)

## Syntax

```sqlsyntax
ALTER SNAPSHOT SET <name> ADD SNAPSHOT

ALTER SNAPSHOT SET <name> APPLY SNAPSHOT POLICY <policy_name> [ FORCE ]

ALTER SNAPSHOT SET <name> SUSPEND SNAPSHOT [ { CREATION | EXPIRATION } ] POLICY

ALTER SNAPSHOT SET <name> RESUME SNAPSHOT [ { CREATION | EXPIRATION } ] POLICY

ALTER SNAPSHOT SET <name> DELETE SNAPSHOT IDENTIFIER '<snapshot_id>'

ALTER SNAPSHOT SET <name> MODIFY SNAPSHOT IDENTIFIER '<snapshot_id>' { ADD | REMOVE } LEGAL HOLD

ALTER SNAPSHOT SET <name> SET COMMENT = '<string_literal>'

ALTER SNAPSHOT SET <name> UNSET COMMENT

ALTER SNAPSHOT SET <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER SNAPSHOT SET <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the snapshot set.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ADD SNAPSHOT`
:   Manually create a snapshot in the set. If the snapshot policy doesn’t include a schedule for
    taking new backups, this is how you make a new backup of the table, schema, or database that’s
    included in the snapshot set. You can also make new backups in the snapshot set at any time even
    when backups happen on a regular schedule.

`APPLY SNAPSHOT POLICY policy_name [ FORCE ]`
:   Specifies the snapshot policy to attach to the snapshot set.

    The FORCE option overwrites an existing policy on a snapshot set. You can only use this option if the old
    policy doesn’t have a retention lock.

    > **Important:**
    >
    > Applying a snapshot policy with a retention lock to a snapshot set is *irreversible*.
    > Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a snapshot set,
    > you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
    > you set a retention lock on a snapshot set with a long expiration period, to avoid unexpected storage charges
    > for undeletable snapshot sets, and the schemas and databases that contain them.
    >
    > If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
    > Snowflake deletes all snapshots, including those with retention locks. Deleting a Snowflake organization
    > requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

`SUSPEND SNAPSHOT [ { CREATION | EXPIRATION } ] POLICY`
:   Suspend a snapshot policy in the snapshot set.
    You can suspend the entire snapshot policy, or only creation or expiration operations.
    When you specify SUSPEND SNAPSHOT POLICY without the CREATION or EXPIRATION keywords, Snowflake
    suspends both the creation and expiration aspects of the policy.
    For more information, see [Suspend a backup policy on a backup set](../../user-guide/backups.md).

`RESUME SNAPSHOT [ { CREATION | EXPIRATION } ] POLICY`
:   Resume a suspended snapshot policy in the set.
    You can resume the entire snapshot policy, or only creation or expiration operations.
    When you specify RESUME SNAPSHOT POLICY without the CREATION or EXPIRATION keywords, Snowflake
    resumes both the creation and expiration aspects of the policy.
    For more information, see [Resume a backup policy on a backup set](../../user-guide/backups.md).

`DELETE SNAPSHOT IDENTIFIER 'snapshot_id'`
:   Delete a snapshot in the snapshot set by ID.
    The snapshot ID is a UUID value, in the format returned by
    the [UUID_STRING](../functions/uuid_string.md) function.
    Snowflake only allows deleting the oldest snapshot from the snapshot set.
    For more information, see [Delete a backup from a backup set](../../user-guide/backups.md).

`MODIFY SNAPSHOT IDENTIFIER 'snapshot_id' { ADD | REMOVE } LEGAL HOLD`
:   Adds or removes a legal hold from a specified snapshot within the snapshot set.
    For more information about legal holds for WORM snapshots, see [Legal hold](../../user-guide/backups.md).
    For examples of using this clause, see [Add and remove legal holds](../../user-guide/backups.md).

`SET COMMENT = 'string_literal'`
:   Associate a comment with the snapshot set.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one (or more) properties and/or parameters to unset for the snapshot set, which resets them to the defaults:

    * `property_name`
    * `param_name`

      + `COMMENT`
      + `TAG tag_name [ , tag_name ... ]`

    You can reset multiple properties/parameters with a single ALTER statement; however, each
    property/parameter must be separated by a comma. Also, when resetting a
    property/parameter, you only specify the name; no value is required.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Description |
| --- | --- |
| OWNERSHIP | The role used to modify a snapshot set must have the OWNERSHIP privilege on the snapshot set. |
| APPLY SNAPSHOT RETENTION LOCK | If the snapshot policy applied to a snapshot set includes a retention lock, the role used to apply the policy must have this privilege on the account. |
| APPLY LEGAL HOLD | This account privilege grants the ability to add or remove a legal hold from a snapshot. This privilege is only needed for the ADD LEGAL HOLD and REMOVE LEGAL HOLD clauses. By default, the ACCOUNTADMIN role has this privilege. |
| APPLY | Only a user with this privilege on the snapshot policy can use the ALTER SNAPSHOT SET command with the APPLY SNAPSHOT POLICY clause to add the snapshot policy to a snapshot set that already exists. |

These privileges are required on the currently active primary role, not a secondary role.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Important:**
>
> If the snapshot policy has a retention lock applied to it, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.

## Examples

Manually add a snapshot to snapshot set `t1_snapshots`:

```sqlexample
ALTER SNAPSHOT SET t1_snapshots
  ADD SNAPSHOT;
```

Update the snapshot policy for snapshot set `t1_snapshots`:

```sqlexample
ALTER SNAPSHOT SET t1_snapshots
  APPLY SNAPSHOT POLICY daily_snapshot_policy;
```

Suspend a snapshot policy on the snapshot set `t1_snapshot`:

```sqlexample
ALTER SNAPSHOT SET t1_snapshots
  SUSPEND SNAPSHOT POLICY;
```

Resume a snapshot policy on the snapshot set `t1_snapshots`:

```sqlexample
ALTER SNAPSHOT SET t1_snapshots
  RESUME SNAPSHOT POLICY;
```

To find the snapshot identifier to use with the ADD LEGAL HOLD
and REMOVE LEGAL HOLD clauses, you typically use the SHOW SNAPSHOTS
command to list the eligible snapshots and their creation times.
The following example shows how you might list the appropriate
snapshots, add a legal hold to one specific snapshot, and later
remove that legal hold. Substitute your own role name, snapshot set name,
and snapshot identifier.

```sqlexample
USE ROLE my_legal_hold_role; -- use a role that has the APPLY LEGAL HOLD privilege
SHOW SNAPSHOTS IN SNAPSHOT SET my_db_snapshot_set
  ->> SELECT "created_on", "snapshot_id" FROM $1 WHERE "is_under_legal_hold" = 'N';
ALTER SNAPSHOT SET my_db_snapshot_set
  MODIFY SNAPSHOT IDENTIFIER '790d1ee4-88b2-451f-9ccc-eacd1e93a134'
  ADD LEGAL HOLD;

USE ROLE my_legal_hold_role; -- use a role that has the APPLY LEGAL HOLD privilege
SHOW SNAPSHOTS IN SNAPSHOT SET my_db_snapshot_set
  ->> SELECT "created_on", "snapshot_id" FROM $1 WHERE "is_under_legal_hold" = 'Y';
ALTER SNAPSHOT SET my_db_snapshot_set
  MODIFY SNAPSHOT IDENTIFIER '790d1ee4-88b2-451f-9ccc-eacd1e93a134'
  REMOVE LEGAL HOLD;
```

---
title: ALTER STAGE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-stage.md
section: SQL Commands
---

# ALTER STAGE

Modifies the properties for an existing named internal or external stage.

See also:
:   [CREATE STAGE](create-stage.md) , [DROP STAGE](drop-stage.md) , [SHOW STAGES](show-stages.md) , [DESCRIBE STAGE](desc-stage.md)

## Syntax

```sqlsyntax
ALTER STAGE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER STAGE [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER STAGE <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER STAGE [ IF EXISTS ] <name> UNSET DCM PROJECT

-- Internal stage
ALTER STAGE [ IF EXISTS ] <name> SET
  [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM } [ formatTypeOptions ] } ) ]
  { [ COMMENT = '<string_literal>' ] }

-- External stage
ALTER STAGE [ IF EXISTS ] <name> SET {
    [ externalStageParams ]
    [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM } [ formatTypeOptions ] } ) ]
    [ COMMENT = '<string_literal>' ]
    }
```

Where:

> ```sqlsyntax
> externalStageParams (for Amazon S3) ::=
>   URL = '<protocol>://<bucket>[/<path>/]'
>   [ AWS_ACCESS_POINT_ARN = '<string>' ]
>   [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( {  { AWS_KEY_ID = '<string>' AWS_SECRET_KEY = '<string>' [ AWS_TOKEN = '<string>' ] } | AWS_ROLE = '<string>'  } ) } ]
>   [ ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] MASTER_KEY = '<string>'
>                    | TYPE = 'AWS_SSE_S3'
>                    | TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ]
>                    | TYPE = 'NONE' ) ]
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```
>
> ```sqlsyntax
> externalStageParams (for Google Cloud Storage) ::=
>   [ URL = 'gcs://<bucket>[/<path>/]' ]
>   [ STORAGE_INTEGRATION = <integration_name> } ]
>   [ ENCRYPTION = (   TYPE = 'GCS_SSE_KMS' [ KMS_KEY_ID = '<string>' ]
>                    | TYPE = 'NONE' ) ]
> ```
>
> ```sqlsyntax
> externalStageParams (for Microsoft Azure) ::=
>   [ URL = 'azure://<account>.blob.core.windows.net/<container>[/<path>/]' ]
>   [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( [ AZURE_SAS_TOKEN = '<string>' ] ) } ]
>   [ ENCRYPTION = (   TYPE = 'AZURE_CSE' [ MASTER_KEY = '<string>' ]
>                    | TYPE = 'NONE' ) ]
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```
>
> ```sqlsyntax
> formatTypeOptions ::=
> -- If TYPE = CSV
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      RECORD_DELIMITER = '<string>' | NONE
>      FIELD_DELIMITER = '<string>' | NONE
>      MULTI_LINE = TRUE | FALSE
>      FILE_EXTENSION = '<string>'
>      PARSE_HEADER = TRUE | FALSE
>      SKIP_HEADER = <integer>
>      SKIP_BLANK_LINES = TRUE | FALSE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      ESCAPE = '<character>' | NONE
>      ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
>      TRIM_SPACE = TRUE | FALSE
>      FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
>      ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      EMPTY_FIELD_AS_NULL = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
>      ENCODING = '<string>' | UTF8
> -- If TYPE = JSON
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      TRIM_SPACE = TRUE | FALSE
>      MULTI_LINE = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
>      FILE_EXTENSION = '<string>'
>      ENABLE_OCTAL = TRUE | FALSE
>      ALLOW_DUPLICATE = TRUE | FALSE
>      STRIP_OUTER_ARRAY = TRUE | FALSE
>      STRIP_NULL_VALUES = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> -- If TYPE = AVRO
>      COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = ORC
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = PARQUET
>      COMPRESSION = AUTO | LZO | SNAPPY | NONE
>      SNAPPY_COMPRESSION = TRUE | FALSE
>      BINARY_AS_TEXT = TRUE | FALSE
>      USE_LOGICAL_TYPE = TRUE | FALSE
>      TRIM_SPACE = TRUE | FALSE
>      USE_VECTORIZED_SCANNER = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = XML
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      PRESERVE_SPACE = TRUE | FALSE
>      STRIP_OUTER_ELEMENT = TRUE | FALSE
>      DISABLE_AUTO_CONVERT = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> ```

## Directory table syntax

```sqlsyntax
ALTER STAGE [ IF EXISTS ] <name> SET DIRECTORY = ( { ENABLE = TRUE | FALSE } )

ALTER STAGE [ IF EXISTS ] <name> REFRESH [ SUBPATH = '<relative-path>' ]
```

## Parameters

`name`
:   Specifies the identifier for the stage to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_name`
:   Specifies the new identifier for the stage; must be unique for the schema.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`SET ...`
:   Specifies the options/properties to set for the stage:

    `URL = ' ... '` , . `STORAGE_INTEGRATION = ...` , . `CREDENTIALS = ( ... )` , . `ENCRYPTION = ( ... )`
    :   Modifies the cloud-specific URL, storage integration or credentials, and/or encryption for the external stage. For more details, see
        External Stage Parameters (in this topic).

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the stage.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the stage from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the stage and the DCM project without dropping the stage. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

> `FILE_FORMAT = ( FORMAT_NAME = 'file_format_name' )` or . `FILE_FORMAT = ( TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM [ ... ] )`
> :   Modifies the file format for the stage, which can be either:
>
>     `FORMAT_NAME = file_format_name`
>     :   Specifies an existing file format object to use for the stage. The specified file format object determines the format type (CSV, JSON, etc.)
>         and other format options for data files.
>
>         Note that no additional format options are specified in the string. Instead, the named file format object defines the other file format
>         options used for loading/unloading data. For more information, see [CREATE FILE FORMAT](create-file-format.md).
>
>     `TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM [ ... ]`
>     :   Specifies the file format type for the stage:
>
>         > * Loading data from a stage (using [COPY INTO <table>](copy-into-table.md)) accommodates all of the supported file format types.
>         > * Unloading data into a stage (using [COPY INTO <location>](copy-into-location.md)) accommodates CSV, JSON, or PARQUET.
>
>         If a file format type is specified, additional format-specific options can be modified. For more details, see
>         Format Type Options (in this topic).
>
>         The `CUSTOM` format type specifies that the underlying stage holds unstructured data and can only be used with the `FILE_PROCESSOR` copy option.
>
>     > **Note:**
>     >
>     > `FORMAT_NAME` and `TYPE` are mutually exclusive; you can only specify one or the other for a stage.

> **Note:**
>
> Do not specify copy options using the CREATE STAGE, ALTER STAGE, CREATE TABLE, or ALTER TABLE commands. We recommend that you use the [COPY INTO <table>](copy-into-table.md) command to specify copy options.

## External stage parameters (`externalStageParams`)

`URL = 'cloud_specific_url'`
:   If a stage does not have a URL, it is an internal stage

    > **Warning:**
    >
    > Modifying the `URL` parameter of a stage can break the following functionality for objects that rely on the stage:
    >
    > * Pipe objects that leverage cloud messaging to trigger data loads (i.e. where `AUTO_INGEST = TRUE`).
    > * External tables that leverage cloud messaging to trigger metadata refreshes (i.e. where `AUTO_REFRESH = TRUE`).

    **Amazon S3**

    > `URL = 'protocol://bucket[/path/]'`
    > :   Modifies the URL for the external location (existing S3 bucket) used to store data files for loading/unloading, where:
    >
    >     * `protocol` is one of the following:
    >
    >       + `s3` refers to S3 storage in public AWS regions outside of China.
    >       + `s3china` refers to S3 storage in public AWS regions in China.
    >       + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
    >
    >       Accessing cloud storage in a [government region](../../user-guide/intro-regions.md) using a storage integration is limited to Snowflake
    >       accounts hosted in the same government region.
    >
    >       Similarly, if you need to access cloud storage in a region in China, you can use a storage integration only from a Snowflake
    >       account hosted in the same region in China.
    >
    >       In these cases, use the CREDENTIALS parameter in the [CREATE STAGE](create-stage.md) command (rather than using a storage
    >       integration) to provide the credentials for authentication.
    >     * `bucket` is the name of the S3 bucket or the [bucket-style alias](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-alias.html)
    >       for an S3 bucket access point. For an S3 access point, you must also specify a value for the
    >       `AWS_ACCESS_POINT_ARN` parameter.
    >     * `path` is an optional case-sensitive path for files in the cloud storage location (files have names that begin with
    >       a common string) that limits the set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >       services.
    >
    > `AWS_ACCESS_POINT_ARN = 'string'`
    > :   Specifies the Amazon resource name (ARN) for your S3 access point. Required only when you specify an S3 access point alias
    >     for your storage `URL`.

    **Google Cloud Storage**

    > `URL = 'gcs://bucket[/path/]'`
    > :   Modifies the URL for the external location (existing GCS bucket) used to store data files for loading/unloading, where:
    >
    >     * `bucket` is the name of the GCS bucket.
    >     * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
    >       common string) that limits the set of files to load. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >       services.

    **Microsoft Azure**

    > `URL = 'azure://account.blob.core.windows.net/container[/path/]'`
    > :   Modifies the URL for the external location (existing Azure container) used to store data files for loading, where:
    >
    >     > * `account` is the name of the Azure account (e.g. `myaccount`). Use the `blob.core.windows.net` endpoint for all
    >     >   supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
    >     > * `container` is the name of the Azure container (e.g. `mycontainer`).
    >     > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
    >     >   common string) that limits the set of files to load. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >     >   services.

`STORAGE_INTEGRATION = integration_name` or . `CREDENTIALS = ( cloud_specific_credentials )`
:   Required only if the Amazon S3, Google Cloud Storage, or Microsoft Azure is private; not required for public buckets/containers

    **Amazon S3**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > We highly recommend the use of storage integrations. This option avoids the need to supply cloud storage credentials using the CREDENTIALS
    >     > parameter when creating stages or loading data.
    >
    > `CREDENTIALS = ( AWS_KEY_ID = 'string' AWS_SECRET_KEY = 'string' [ AWS_TOKEN = 'string' ] )` or . `CREDENTIALS = ( AWS_ROLE = 'string' )`
    > :   Modifies the security credentials for connecting to AWS and accessing the private S3 bucket where the files to load/unload are staged. For
    >     more information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).
    >
    >     The credentials you specify depend on whether you associated the Snowflake access permissions for the bucket with an AWS IAM
    >     (Identity & Access Management) user or role:
    >
    >     * **IAM user:** IAM credentials are required. Temporary (aka “scoped”) credentials are generated by AWS Security Token Service (STS) and
    >       consist of three components:
    >
    >       > + `AWS_KEY_ID`
    >       > + `AWS_SECRET_KEY`
    >       > + `AWS_TOKEN`
    >
    >       All three are required to access a private bucket. After a designated period of time, temporary credentials expire and can no
    >       longer be used. You must then generate a new set of valid temporary credentials.
    >
    >       > **Important:**
    >       >
    >       > The COPY command also allows permanent (aka “long-term”) credentials to be used; however, for security reasons, Snowflake does not
    >       > recommend using them. If you must use permanent credentials, Snowflake recommends periodically generating new permanent credentials for
    >       > external stages.
    >     * **IAM role:** Omit the security credentials and access keys and, instead, identify the role using `AWS_ROLE` and specify the AWS
    >       role ARN (Amazon Resource Name).
    >
    >       > **Important:**
    >       >
    >       > The ability to use an AWS IAM role to access a private S3 bucket to load or unload data is now deprecated (i.e. support will be removed
    >       > in a future release, TBD). We highly recommend modifying any existing S3 stages that use this feature to instead reference storage
    >       > integration objects. For instructions, see [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md).

    **Google Cloud Storage**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).

    **Microsoft Azure**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > We highly recommend the use of storage integrations. This option avoids the need to supply cloud storage credentials using the CREDENTIALS
    >     > parameter when creating stages or loading data.
    >
    > `CREDENTIALS = ( AZURE_SAS_TOKEN = 'string' )`
    > :   Modifies the SAS (shared access signature) token for connecting to Azure and accessing the private container where the files containing
    >     loaded data are staged. Credentials are generated by Azure.

`ENCRYPTION = ( cloud_specific_encryption )`
:   Required only for loading from/unloading into encrypted files; not required if storage location and files are unencrypted

    Data loading:
    :   Modifies the encryption settings used to decrypt encrypted files in the storage location and extract data.

    Data unloading:
    :   Modifies the encryption settings used to encrypt files unloaded to the storage location.

    **Amazon S3**

    > `ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] MASTER_KEY = 'string' | TYPE = 'AWS_SSE_S3' | TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = 'string' ] | TYPE = 'NONE' )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AWS_CSE`: Client-side encryption (requires a `MASTER_KEY` value). Currently, the client-side
    > >       [master key](https://csrc.nist.gov/glossary/term/master_key) you provide can only be a symmetric key. Note that, when a
    > >       `MASTER_KEY` value is provided, Snowflake assumes `TYPE = AWS_CSE` (i.e. when a `MASTER_KEY` value is
    > >       provided, `TYPE` is not required).
    > >     * `AWS_SSE_S3`: Server-side encryption that requires no additional encryption settings.
    > >     * `AWS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >     For more information about the encryption types, see the AWS documentation for
    > >     [client-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/UsingClientSideEncryption.html)
    > >     or [server-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/serv-side-encryption.html).
    > >
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to `AWS_CSE` encryption only)
    > > :   Specifies the client-side master key used to encrypt the files in the bucket. The master key must be a 128-bit or 256-bit key in
    > >     Base64-encoded form.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `AWS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the AWS KMS-managed key used to encrypt files unloaded into the bucket. If no value is provided,
    > >     your default KMS key ID is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading.
    > >
    > > Default: `NONE`

    **Google Cloud Storage**

    > `ENCRYPTION = ( TYPE = 'GCS_SSE_KMS' [ KMS_KEY_ID = 'string' ] | TYPE = 'NONE' )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `GCS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >       For more information, see the Google Cloud documentation:
    > >
    > >       + <https://cloud.google.com/storage/docs/encryption/customer-managed-keys>
    > >       + <https://cloud.google.com/storage/docs/encryption/using-customer-managed-keys>
    > >     * `NONE`: No encryption.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `GCS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the Cloud KMS-managed key that is used to encrypt files unloaded into the bucket. If no value
    > >     is provided, your default KMS key ID set on the bucket is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading. The load operation should succeed if the service account has sufficient permissions
    > >     to decrypt data in the bucket.
    > >
    > > Default: `NONE`

    **Microsoft Azure**

    > `ENCRYPTION = ( TYPE = 'AZURE_CSE' MASTER_KEY = 'string' | TYPE = 'NONE' )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AZURE_CSE`: Client-side encryption (requires a MASTER_KEY value). For information, see the
    > >       [Client-side encryption information](https://docs.microsoft.com/en-us/azure/storage/common/storage-client-side-encryption) in
    > >       the Microsoft Azure documentation.
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to AZURE_CSE encryption only)
    > > :   Specifies the client-side master key used to encrypt or decrypt files. The master key must be a 128-bit or 256-bit key in Base64-encoded
    > >     form.
    > >
    > > Default: `NONE`

`USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
:   Specifies whether to use [private connectivity](../../user-guide/private-connectivity-outbound.md) for an external stage to harden your
    security posture.

    If the external stage uses a storage integration, and that integration is configured for private connectivity, set this parameter to
    FALSE.

    For information about using this parameter, see one of the following:

    * [Private connectivity to external stages for Amazon Web Services](../../user-guide/data-load-aws-private.md).
    * [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

## Directory table parameters

`ENABLE = TRUE | FALSE`
:   Specifies whether to add a directory table to the stage. When the value is TRUE, a directory table is added to the stage.

    > **Note:**
    >
    > Setting this parameter to TRUE is not supported for [S3-compatible external stages](../../user-guide/data-load-s3-compatible-storage.md). The metadata for S3-compatible external stages cannot be refreshed automatically.

    Default: `FALSE`

`REFRESH`
:   Accesses the staged data files referenced in the directory table definition and updates the table metadata:

    * New files in the path are added to the table metadata.
    * Changes to files in the path are updated in the table metadata.
    * Files no longer in the path are removed from the table metadata.

    You can execute this command each time files are added to the stage, updated, or dropped. This step synchronizes
    the metadata with the latest set of associated files in the stage definition for the directory table.

`SUBPATH = 'relative-path'`
:   Optionally specify a relative path to refresh the metadata for a specific subset of the data files.

## Format type options (`formatTypeOptions`)

Depending on the file format type specified (`FILE_FORMAT = ( TYPE = ... )`), you can include one or more of the following format-specific options (separated by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified when loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`RECORD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate records in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   Data loading:
        :   New line character. Note that “new line” is logical such that `\r\n` will be understood as a new line for files on a Windows platform.

        Data unloading:
        :   New line character (`\n`).

`FIELD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate fields in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

            > > **Note:**
            > >
            > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   comma (`,`)

`MULTI_LINE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and the specified record delimiter is present within a CSV field, the record containing the field will be interpreted as an error.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > If you are loading large uncompressed CSV files (greater than 128MB) that follow the RFC4180 specification, Snowflake supports parallel scanning of these CSV files when MULTI_LINE is set to `FALSE`, COMPRESSION is set to `NONE`, and ON_ERROR is set to `ABORT_STATEMENT` or `CONTINUE`.

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.csv[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

    > **Note:**
    >
    > If the `SINGLE` copy option is `TRUE`, then the COPY command unloads a file without a file extension by default. To specify a file extension, provide a file name and extension in the
    > `internal_location` or `external_location` path (For example, `copy into @stage/data.csv`).

`PARSE_HEADER = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to use the first row headers in the data files to determine column names.

    This file format option is applied to the following actions only:

    > * Automatically detecting column definitions by using the INFER_SCHEMA function.
    > * Loading CSV data into separate columns by using the INFER_SCHEMA function and MATCH_BY_COLUMN_NAME copy option.

    If the option is set to TRUE, the first row headers will be used to determine column names. The default value FALSE will return column names as c\*, where \* is the position of the column.

    > **Note:**
    >
    > * This option isn’t supported for external tables.
    > * The SKIP_HEADER option isn’t supported if you set `PARSE_HEADER = TRUE`.

    Default:
    :   `FALSE`

`SKIP_HEADER = integer`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Number of lines at the start of the file to skip.

    Note that SKIP_HEADER does not use the RECORD_DELIMITER or FIELD_DELIMITER values to determine what a header line is; rather, it simply skips the specified number of CRLF (Carriage Return, Line Feed)-delimited lines in the file. RECORD_DELIMITER and FIELD_DELIMITER are then used to determine the rows of data to load.

    Default:
    :   `0`

`SKIP_BLANK_LINES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to skip any blank lines encountered in the data files; otherwise, blank lines produce an end-of-record error (default behavior).

    Default:
    :   `FALSE`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of date values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) (data loading) or [DATE_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of time values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) (data loading) or [TIME_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of timestamp values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) (data loading) or [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the encoding format for binary input or output. The option can be used when loading data into or unloading data from binary columns in a table.

    Default:
    :   `HEX`

`ESCAPE = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character string used as the escape character for enclosed or unenclosed field values. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_OPTIONALLY_ENCLOSED_BY` character in the data as literals.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for enclosed fields only. Specify the character used to enclose fields by setting `FIELD_OPTIONALLY_ENCLOSED_BY`.

        > **Note:**
        >
        > This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        > as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        > the option value.
        >
        > In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        > option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If this option is set, it overrides the escape character set for `ESCAPE_UNENCLOSED_FIELD`.

    Default:
    :   `NONE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   A singlebyte character string used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for unenclosed fields only.

        > **Note:**
        >
        > * The default value is `\\`. If a row in a data file ends in the backslash (`\`) character, this character escapes the newline or
        >   carriage return character specified for the `RECORD_DELIMITER` file format option. As a result, the load operation treats
        >   this row and the next row as a single row of data. To avoid this issue, set the value to `NONE`.
        > * This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        >   as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        >   the option value.
        >
        >   In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        >   option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If `ESCAPE` is set, the escape character set for that file format option overrides this option.

    Default:
    :   backslash (`\\`)

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove white space from fields.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        As another example, if leading or trailing spaces surround quotes that enclose strings, you can remove the surrounding spaces using this option and the quote character using the
        `FIELD_OPTIONALLY_ENCLOSED_BY` option. Note that any spaces within the quotes are preserved. For example, assuming `FIELD_DELIMITER = '|'` and `FIELD_OPTIONALLY_ENCLOSED_BY = '"'`:

        ```sqlexample
        |"Hello world"|    /* loads as */  >Hello world<
        |" Hello world "|  /* loads as */  > Hello world <
        | "Hello world" |  /* loads as */  >Hello world<
        ```

        (the brackets in this example are not loaded; they are used to demarcate the beginning and end of the loaded strings)

    Default:
    :   `FALSE`

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex representation (`0x27`) or the double single-quoted escape (`''`).

        Data unloading only:
        :   When a field in the source table contains this character, Snowflake escapes it using the same character for unloading. For example, if the value is the double quote character and a field contains the string `A "B" C`, Snowflake escapes the double quotes for unloading as follows:

            `A ""B"" C`

    Default:
    :   `NONE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   String used to convert to and from SQL NULL:

        * When loading data, Snowflake replaces these values in the data load source with SQL NULL. To specify more than one string, enclose
          the list of strings in parentheses and use commas to separate each value.

          Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as
          a value, all instances of `2` as either a string or number are converted.

          For example:

          `NULL_IF = ('\N', 'NULL', 'NUL', '')`

          Note that this option can include empty strings.
        * When unloading data, Snowflake converts SQL NULL values to the first value in the list.

    Default:
    :   `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\`)

`ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to generate a parsing error if the number of delimited columns (i.e. fields) in an input file does not match the number of columns in the corresponding table.

        If set to `FALSE`, an error is not generated and the load continues. If the file is successfully loaded:

        * If the input file contains records with more fields than columns in the table, the matching fields are loaded in order of occurrence in the file and the remaining fields are not loaded.
        * If the input file contains records with fewer fields than columns in the table, the non-matching columns in the table are loaded with NULL values.

        This option assumes all the records within the input file are the same length (i.e. a file containing records of varying length return an error regardless of the value specified for this parameter).

    Default:
    :   `TRUE`

    > **Note:**
    >
    > When [transforming data during loading](../../user-guide/data-load-transform.md) (i.e. using a query as the source for the COPY command), this option is ignored. There is no requirement for your data files to have
    > the same number and ordering of columns as your target table.

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`).

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies whether to insert SQL NULL for empty fields in an input file, which are represented by two successive delimiters (For example, `,,`).

          If set to `FALSE`, Snowflake attempts to cast an empty field to the corresponding column type. An empty string is inserted into columns of type STRING. For other column types, the COPY command produces an error.
        * When unloading data, this option is used in combination with `FIELD_OPTIONALLY_ENCLOSED_BY`. When `FIELD_OPTIONALLY_ENCLOSED_BY = NONE`, setting `EMPTY_FIELD_AS_NULL = FALSE` specifies to unload empty strings in tables to empty string values without quotes enclosing the field values.

          If set to `TRUE`, `FIELD_OPTIONALLY_ENCLOSED_BY` must specify a character to enclose strings.

    Default:
    :   `TRUE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

`ENCODING = 'string'`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String (constant) that specifies the character set of the source data when loading data into a table.

        | Character Set | `ENCODING` Value | Supported Languages | Notes |
        | --- | --- | --- | --- |
        | Big5 | `BIG5` | Traditional Chinese |  |
        | EUC-JP | `EUCJP` | Japanese |  |
        | EUC-KR | `EUCKR` | Korean |  |
        | GB18030 | `GB18030` | Chinese |  |
        | IBM420 | `IBM420` | Arabic |  |
        | IBM424 | `IBM424` | Hebrew |  |
        | IBM949 | `IBM949` | Korean |  |
        | ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
        | ISO-2022-JP | `ISO2022JP` | Japanese |  |
        | ISO-2022-KR | `ISO2022KR` | Korean |  |
        | ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
        | ISO-8859-5 | `ISO88595` | Russian |  |
        | ISO-8859-6 | `ISO88596` | Arabic |  |
        | ISO-8859-7 | `ISO88597` | Greek |  |
        | ISO-8859-8 | `ISO88598` | Hebrew |  |
        | ISO-8859-9 | `ISO88599` | Turkish |  |
        | ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
        | KOI8-R | `KOI8R` | Russian |  |
        | Shift_JIS | `SHIFTJIS` | Japanese |  |
        | UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
        | UTF-16 | `UTF16` | All languages |  |
        | UTF-16BE | `UTF16BE` | All languages |  |
        | UTF-16LE | `UTF16LE` | All languages |  |
        | UTF-32 | `UTF32` | All languages |  |
        | UTF-32BE | `UTF32BE` | All languages |  |
        | UTF-32LE | `UTF32LE` | All languages |  |
        | windows-874 | `WINDOWS874` | Thai |  |
        | windows-949 | `WINDOWS949` | Korean |  |
        | windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
        | windows-1251 | `WINDOWS1251` | Russian |  |
        | windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | windows-1253 | `WINDOWS1253` | Greek |  |
        | windows-1254 | `WINDOWS1254` | Turkish |  |
        | windows-1255 | `WINDOWS1255` | Hebrew |  |
        | windows-1256 | `WINDOWS1256` | Arabic |  |

    Default:
    :   `UTF8`

    > **Note:**
    >
    > Snowflake stores all data internally in the UTF-8 character set. The data is converted into UTF-8 before it is loaded into Snowflake.

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of date string values in the data files. If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of time string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of timestamp string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the encoding format for binary string values in the data files. The option can be used when loading data into binary columns in a table.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `HEX`

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`MULTI_LINE = TRUE | FALSE`
:   Use: Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and a new line is present within a JSON record, the record containing the new line will be interpreted as an error.

    Default:
    :   `TRUE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.json[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

`ENABLE_OCTAL = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that enables parsing of octal numbers.

    Default:
    :   `FALSE`

`ALLOW_DUPLICATE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to allow duplicate object field names (only the last one will be preserved).

    Default:
    :   `FALSE`

`STRIP_OUTER_ARRAY = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove outer brackets (i.e. `[ ]`).

    Default:
    :   `FALSE`

`STRIP_NULL_VALUES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove object fields or array elements containing `null` values. For example, when set to `TRUE`:

        | Before | After |
        | --- | --- |
        | `[null]` | `[]` |
        | `[null,null,3]` | `[,,3]` |
        | `{"a":null,"b":null,"c":123}` | `{"c":123}` |
        | `{"a":[1,null,2],"b":{"x":null,"y":88}}` | `{"a":[1,,2],"b":{"y":88}}` |

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

### TYPE = AVRO

`COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`.

> **Note:**
>
> We recommend that you use the default `AUTO` option because it will determine both the file and codec compression. Specifying a compression option refers to the compression of files, not the compression of blocks (codecs).

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = ORC

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = PARQUET

`COMPRESSION = AUTO | LZO | SNAPPY | NONE`
:   Use:
    :   Data unloading and external tables

    Definition:

    * When unloading data, specifies the compression algorith for columns in the Parquet files.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically. Supports the following compression algorithms: Brotli, gzip, Lempel-Ziv-Oberhumer (LZO), LZ4, Snappy, or Zstandard v0.8 (and higher). . When unloading data, unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `LZO` | When unloading data, files are compressed using the Snappy algorithm by default. If unloading data to LZO-compressed files, specify this value. |
        | `SNAPPY` | When unloading data, files are compressed using the Snappy algorithm by default. You can optionally specify this value. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`SNAPPY_COMPRESSION = TRUE | FALSE`
:   Use:
    :   Data unloading only

        | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | Unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `SNAPPY` | May be specified if unloading Snappy-compressed files. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Definition:
    :   Boolean that specifies whether unloaded file(s) are compressed using the SNAPPY algorithm.

    > **Note:**
    >
    > Deprecated. Use `COMPRESSION = SNAPPY` instead.

    Limitations:
    :   Only supported for data unloading operations.

    Default:
    :   `TRUE`

`BINARY_AS_TEXT = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to interpret columns with no defined logical data type as UTF-8 text. When set to `FALSE`, Snowflake interprets these columns as binary data.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > Snowflake recommends that you set BINARY_AS_TEXT to FALSE to avoid any potential conversion issues.

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`USE_LOGICAL_TYPE = TRUE | FALSE`
:   Use:
    :   Data loading, data querying in staged files, and schema detection.

    Definition:
    :   Boolean that specifies whether to use Parquet logical types. With this file format option, Snowflake can interpret Parquet logical types during data loading. For more information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md). To enable Parquet logical types, set USE_LOGICAL_TYPE as TRUE when you create a new file format option.

    Limitations:
    :   Not supported for data unloading.

`USE_VECTORIZED_SCANNER = TRUE | FALSE`
:   Use:
    :   Data loading and data querying in staged files

    Definition:
    :   Boolean that specifies whether to use a vectorized scanner for loading Parquet files.

    Default:
    :   `FALSE`. In a future BCR, the default value will be `TRUE`.

    Using the vectorized scanner can significantly reduce the latency for loading Parquet files, because this scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file. The scanner only downloads relevant sections of the Parquet file into memory, such as the subset of selected columns.

    If `USE_VECTORIZED_SCANNER` is set to `TRUE`, the vectorized scanner has the following behaviors:

    > * The `BINARY_AS_TEXT` option is always treated as `FALSE` and the `USE_LOGICAL_TYPE` option is always treated as `TRUE`, no matter what the actual value is being set to.
    > * The vectorized scanner supports Parquet map types. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >   {
    >   >    "k1": "v1",
    >   >    "k2": "v2"
    >   >   }
    >   > ```
    > * The vectorized scanner shows `NULL` values in the output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "nickname": null,
    >   >   "age": 34,
    >   >   "phone_numbers":
    >   >   [
    >   >     "1234567890",
    >   >     "0987654321",
    >   >     null,
    >   >     "6781234590"
    >   >   ]
    >   >   }
    >   > ```
    > * The vectorized scanner handles Time and Timestamp as follows:
    >
    >   > | Parquet | Snowflake vectorized scanner |
    >   > | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS/NANOS) | TIME |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_LTZ |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_NTZ |
    >   > | INT96 | TIMESTAMP_LTZ |

    If `USE_VECTORIZED_SCANNER` is set to `FALSE`, the scanner has the following behaviors:

    > * This option does not support Parquet maps. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >  {
    >   >   "key_value":
    >   >   [
    >   >    {
    >   >           "key": "k1",
    >   >           "value": "v1"
    >   >       },
    >   >       {
    >   >           "key": "k2",
    >   >           "value": "v2"
    >   >       }
    >   >     ]
    >   >   }
    >   > ```
    > * This option does not explicitly show `NULL` values in the scan output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "age": 34
    >   >   "phone_numbers":
    >   >   [
    >   >    "1234567890",
    >   >    "0987654321",
    >   >    "6781234590"
    >   >   ]
    >   >  }
    >   > ```
    > * This option handles Time and Timestamp as follows:
    >
    >   > | Parquet | When USE_LOGICAL_TYPE = TRUE | When USE_LOGICAL_TYPE = FALSE |
    >   > | --- | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS) | TIME | + TIME (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=NANOS) | TIME | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS) | TIMESTAMP_LTZ | TIMESTAMP_NTZ |
    >   > | TimestampType(isAdjustedToUtc=True, unit=NANOS) | TIMESTAMP_LTZ | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS) | TIMESTAMP_NTZ | + TIMESTAMP_LTZ (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimestampType(isAdjustedToUtc=False, unit=NANOS) | TIMESTAMP_NTZ | INTEGER |
    >   > | INT96 | TIMESTAMP_NTZ | TIMESTAMP_NTZ |

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = XML

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`PRESERVE_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser preserves leading and trailing spaces in element content.

    Default:
    :   `FALSE`

`STRIP_OUTER_ELEMENT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser strips out the outer XML element, exposing 2nd level elements as separate documents.

    Default:
    :   `FALSE`

`DISABLE_AUTO_CONVERT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser disables automatic conversion of numeric and Boolean values from text to native representation.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip any BOM (byte order mark) present in an input file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Stage | Required to alter the stage properties and to enable or disable a directory table on the stage using ALTER STAGE … SET DIRECTORY.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| WRITE | Stage | Required to refresh the metadata using ALTER STAGE … REFRESH. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* For external stages that use an S3 access point:

  + If you’re using a storage integration, you must configure the IAM policy for the integration
    to grant permission to your S3 access point. For more information, see [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md).
  + Multi-region access points aren’t supported.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename `my_int_stage` to `new_int_stage`:

> ```sqlexample
> ALTER STAGE my_int_stage RENAME TO new_int_stage;
> ```

Alter `my_ext_stage` (created in the [CREATE STAGE](create-stage.md) examples) to change the URL to reference a sub-folder named `new` in the
`files` folder. If a [COPY INTO <table>](copy-into-table.md) command that references this stage encounters a data error on any of the
records, it skips the file. All other copy options are set to the default values.

If the S3 bucket is in a region in China, use the `s3china://` protocol for the URL parameter.

> ```sqlexample
> ALTER STAGE my_ext_stage
> SET URL='s3://loading/files/new/'
> COPY_OPTIONS = (ON_ERROR='skip_file');
> ```

Alter `my_ext_stage` to replace the supplied credentials with a reference to a storage integration named `myint` :

> ```sqlexample
> ALTER STAGE my_ext_stage SET STORAGE_INTEGRATION = myint;
> ```

Alter `my_ext_stage` to specify a new access key ID and secret access key for the stage:

> ```sqlexample
> ALTER STAGE my_ext_stage SET CREDENTIALS=(AWS_KEY_ID='d4c3b2a1' AWS_SECRET_KEY='z9y8x7w6');
> ```
>
> (the credentials values used in the above example are for illustration purposes only)

Alter `my_ext_stage3` to change the encryption type to `AWS_SSE_S3` server-side encryption for the stage:

> ```sqlexample
> ALTER STAGE my_ext_stage3 SET ENCRYPTION=(TYPE='AWS_SSE_S3');
> ```

## Directory table examples

Add a directory table to an existing stage named `mystage`:

```sqlexample
ALTER STAGE mystage SET DIRECTORY = ( ENABLE = TRUE );
```

Manually refresh the directory table metadata in a stage named `mystage`:

```sqlexample
ALTER STAGE mystage REFRESH;

+-------------------------+----------------+-------------------------------+
| file                    | status         | description                   |
|-------------------------+----------------+-------------------------------|
| data/json/myfile.json   | REGISTERED_NEW | File registered successfully. |
+-------------------------+----------------+-------------------------------+
```

Manually refresh the directory table metadata for the files in the `data` path in a stage named `mystage`:

```sqlexample
ALTER STAGE mystage REFRESH SUBPATH = 'data';
```

---
title: ALTER STORAGE INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/alter-storage-integration.md
section: SQL Commands
---

# ALTER STORAGE INTEGRATION

Modifies the properties for an existing storage integration.

See also:
:   [CREATE STORAGE INTEGRATION](create-storage-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
ALTER [ STORAGE ] INTEGRATION [ IF EXISTS ] <name> SET
  [ cloudProviderParams ]
  [ ENABLED = { TRUE | FALSE } ]
  [ STORAGE_ALLOWED_LOCATIONS = ('<cloud>://<bucket>/<path>/' [ , '<cloud>://<bucket>/<path>/' ... ] ) ]
  [ STORAGE_BLOCKED_LOCATIONS = ('<cloud>://<bucket>/<path>/' [ , '<cloud>://<bucket>/<path>/' ... ] ) ]
  [ COMMENT = '<string_literal>' ]

ALTER [ STORAGE ] INTEGRATION [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER [ STORAGE ] INTEGRATION <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER [ STORAGE ] INTEGRATION [ IF EXISTS ] <name>  UNSET {
                                                          ENABLED                   |
                                                          STORAGE_BLOCKED_LOCATIONS |
                                                          COMMENT
                                                          }
                                                          [ , ... ]
```

Where:

> ```sqlsyntax
> cloudProviderParams (for Amazon S3) ::=
>   STORAGE_AWS_ROLE_ARN = '<iam_role>'
>   [ STORAGE_AWS_OBJECT_ACL = 'bucket-owner-full-control' ]
>   [ STORAGE_AWS_EXTERNAL_ID = '<external_id>' ]
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Microsoft Azure) ::=
>   AZURE_TENANT_ID = '<tenant_id>'
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```

## Parameters

`name`
:   Identifier for the integration to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies one or more properties/parameters to set for the table (separated by blank spaces, commas, or new lines):

    `ENABLED = { TRUE | FALSE }`
    :   Specifies whether this storage integration is available for usage in stages.

        > * `TRUE` allows users to create new stages that reference this integration. Existing stages that reference this integration function
        >   normally.
        > * `FALSE` prevents users from creating new stages that reference this integration. Existing stages that reference this integration
        >   cannot access the storage location in the stage definition.

    `STORAGE_ALLOWED_LOCATIONS = ( 'cloud_specific_url' )`
    :   Explicitly limits external stages that use the integration to reference one or more storage locations (Amazon S3, Google Cloud Storage, or
        Microsoft Azure). Supports a comma-separated list of URLs for existing buckets and, optionally, paths used to store data files for
        loading/unloading. Alternatively supports the `*` wildcard, meaning “allow access to all buckets and/or paths”.

        **Amazon S3**

        > `STORAGE_ALLOWED_LOCATIONS = ( 'protocol://bucket/path/' [ , 'protocol://bucket/path/' ... ]  )`
        >
        > > * `protocol` is one of the following:
        > >
        > >   + `s3` refers to S3 storage in public AWS regions outside of China.
        > >   + `s3china` refers to S3 storage in public AWS regions in China.
        > >   + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
        > > * `bucket` is the name of an S3 bucket that stores your data files (e.g. `mybucket`).
        > > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
        > >   common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
        > >   services.

        **Google Cloud Storage**

        > `STORAGE_ALLOWED_LOCATIONS = ( 'gcs://bucket/path/' [ , 'gcs://bucket/path/' ... ] )`
        >
        > > * `bucket` is the name of a GCS bucket that stores your data files (e.g. `mybucket`).
        > > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
        > >   common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
        > >   services.

        **Microsoft Azure**

        > `STORAGE_ALLOWED_LOCATIONS = ( 'azure://account.blob.core.windows.net/container/path/' [ , 'azure://account.blob.core.windows.net/container/path/' ... ] )`
        >
        > > * `account` is the name of the Azure account (e.g. `myaccount`). Use the `blob.core.windows.net` endpoint for all supported
        > >   types of Azure blob storage accounts, including Data Lake Storage Gen2.
        > > * `container` is the name of the Azure container that stores your data files (e.g. `mycontainer`).
        > > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
        > >   common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
        > >   services.

    `STORAGE_BLOCKED_LOCATIONS = ( 'cloud_specific_url' )`
    :   Explicitly prohibits external stages that use the integration from referencing one or more storage locations (Amazon S3, Google Cloud Storage,
        Microsoft Azure). Supports a comma-separated list of URLs for existing storage locations and, optionally, paths used to store data files for
        loading/unloading. Commonly used when STORAGE_ALLOWED_LOCATIONS is set to the `*` wildcard, allowing access to all buckets in your account
        except for blocked storage locations and, optionally, paths.

        > **Note:**
        >
        > Make sure to enclose only individual cloud storage location URLs in quotes. If you enclose the entire
        > `STORAGE_BLOCKED_LOCATIONS` value in quotes, the value is invalid. As a result, the `STORAGE_BLOCKED_LOCATIONS` parameter
        > setting is ignored when users create stages that reference the storage integration.

        **Amazon S3**

        > `STORAGE_BLOCKED_LOCATIONS = ( 'protocol://bucket/path/' [ , 'protocol://bucket/path/' ... ]  )`
        >
        > > * `protocol` is one of the following:
        > >
        > >   + `s3` refers to S3 storage in public AWS regions outside of China.
        > >   + `s3china` refers to S3 storage in public AWS regions in China.
        > >   + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
        > > * `bucket` is the name of an S3 bucket that stores your data files (e.g. `mybucket`).
        > > * `path` is an optional path (or *directory*) in the bucket that further limits access to data files.

        **Google Cloud Storage**

        > `STORAGE_BLOCKED_LOCATIONS = ( 'gcs://bucket/path/' [ , 'gcs://bucket/path/' ... ] )`
        >
        > > * `bucket` is the name of a GCS bucket that stores your data files (e.g. `mybucket`).
        > > * `path` is an optional path (or *directory*) in the bucket that further limits access to data files.

        **Microsoft Azure**

        > `STORAGE_BLOCKED_LOCATIONS = ( 'azure://account.blob.core.windows.net/container/path/' [ , 'azure://account.blob.core.windows.net/container/path/' ... ] )`
        >
        > > * `account` is the name of the Azure account (e.g. `myaccount`).
        > > * `container` is the name of the Azure container that stores your data files (e.g. `mycontainer`).
        > > * `path` is an optional path (or *directory*) in the container that further limits access to data files.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string_literal'`
    :   String (literal) that specifies a comment for the integration.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the storage integration, which resets them back to their defaults:

    * `ENABLED`
    * `STORAGE_BLOCKED_LOCATIONS`
    * `TAG tag_name [ , tag_name ... ]`
    * `COMMENT`

## Cloud provider parameters (`cloudProviderParams`)

**Amazon S3**

> `STORAGE_AWS_ROLE_ARN = 'iam_role'`
> :   Specifies the Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges on the S3 bucket
>     containing your data files. For more information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).
>
> `STORAGE_AWS_OBJECT_ACL = 'bucket-owner-full-control'`
> :   Enables support for AWS access control lists (ACLs) to grant the S3 bucket owner full control. Files created in Amazon S3 buckets from
>     unloaded table data are owned by an AWS Identity and Access Management (IAM) role. ACLs support the use case where IAM roles in one AWS
>     account are configured to access S3 buckets in one or more other AWS accounts. Without ACL support, users in the bucket-owner accounts
>     could not access the data files unloaded to an external (S3) stage using a storage integration.
>
>     When users unload Snowflake table data to data files in an S3 stage using [COPY INTO <location>](copy-into-location.md), the unload operation
>     applies an ACL to the unloaded data files. The data files apply the `"s3:x-amz-acl":"bucket-owner-full-control"` privilege to the files,
>     granting the S3 bucket owner full control over them.
>
> `STORAGE_AWS_EXTERNAL_ID = 'external_id'`
> :   Specifies an external ID that Snowflake uses to establish a trust relationship with AWS.
>     You must specify the same external ID in the trust policy of the IAM role
>     that you configured for this storage integration. For more information,
>     see [How to use an external ID when granting access to your AWS resources to a third party](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html).
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see
>     [Private connectivity to external stages for Amazon Web Services](../../user-guide/data-load-aws-private.md).

**Microsoft Azure**

> `AZURE_TENANT_ID = 'tenant_id'`
> :   Specifies the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to. A storage integration can authenticate
>     to only one tenant, and so the allowed and blocked storage locations must refer to storage accounts that all belong this tenant.
>
>     To find your tenant ID, log into the Azure portal and click Azure Active Directory » Properties. The tenant ID is
>     displayed in the Tenant ID field.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter,
>     see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

## Usage notes

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example initiates operation of a suspended integration:

```sqlexample
ALTER STORAGE INTEGRATION myint SET ENABLED = TRUE;
```

---
title: ALTER STORAGE LIFECYCLE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/alter-storage-lifecycle-policy.md
section: SQL Commands
---

# ALTER STORAGE LIFECYCLE POLICY

Modifies the properties of an existing [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md).

> **Attention:**
>
> Changes to a storage lifecycle policy can have significant impact on all associated tables.
> Use the QUERY_HISTORY view in the ACCOUNT_USAGE schema to audit policy changes regularly.
> For more information, see [QUERY_HISTORY view](../account-usage/query_history.md).

See also:
:   [CREATE STORAGE LIFECYCLE POLICY](create-storage-lifecycle-policy.md) , [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md) , [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md) , [SHOW STORAGE LIFECYCLE POLICIES](show-storage-lifecycle-policies.md)

## Syntax

```sqlsyntax
ALTER STORAGE LIFECYCLE POLICY [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER STORAGE LIFECYCLE POLICY [ IF EXISTS ] <name> SET

  BODY -> <expression_on_arg_name>
  | ARCHIVE_TIER = { COOL | COLD }
  | ARCHIVE_FOR_DAYS = <number_of_days>
  | COMMENT = '<string_literal>'
  | TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER STORAGE LIFECYCLE POLICY [ IF EXISTS ] <name> UNSET
  ARCHIVE_FOR_DAYS
  | COMMENT
  | TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier for the policy to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RENAME TO new_name`
:   Specifies the new identifier for the policy; must be unique for your schema.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies one or more properties to set for the policy:

    `BODY -> expression_on_arg_name`
    :   SQL expression that determines the rows to expire.

        To transform the data, you can use built-in functions such as [Conditional expression functions](../expressions-conditional.md) or
        [user-defined functions](../../developer-guide/udf/udf-overview.md) (UDFs).

        > **Note:**
        >
        > Currently, only SQL and JavaScript UDFs are supported in the body of a storage lifecycle policy.

    `ARCHIVE_TIER = { COOL | COLD }`
    :   Specifies a storage tier to convert an expiration policy where ARCHIVE_FOR_DAYS is unset into an archival policy.

        * `COOL` requires that you set an archival period (ARCHIVE_FOR_DAYS) of 90 days or longer.
        * `COLD` requires that you set an archival period (ARCHIVE_FOR_DAYS) of 180 days or longer.

        For supported cloud providers, see [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md).

    `ARCHIVE_FOR_DAYS = number_of_days`
    :   Specifies the number of days to keep rows that match the policy expression in archive storage.
        If set, Snowflake moves the data into archive storage according
        to the value you select for ARCHIVE_TIER. If unset, Snowflake expires the rows from the table without archiving the data.

        Values:

        * ARCHIVE_TIER = COOL: `90` - `2147483647`
        * ARCHIVE_TIER = COLD: `180` - `2147483647`

        Default: Unset

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the policy.

        Default: No value

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies properties to unset for the policy, which resets the properties to their defaults:

    * `ARCHIVE_FOR_DAYS`
    * `COMMENT`
    * `TAG tag_name [ , tag_name ... ]`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Storage lifecycle policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you want to update an existing policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md) command.
* You can’t change the policy signature with this command. To change the signature, use the [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md) command and then create a new policy.
* After you set the ARCHIVE_TIER for a policy, you can’t change it. For example, you can’t use this command to change the ARCHIVE_TIER for a policy from COOL to COLD.
* If you unset ARCHIVE_FOR_DAYS for a policy, the storage tier doesn’t change. If you later re-enable archival storage for the policy, you can’t modify the storage tier.
* Including one or more [subqueries](../../user-guide/querying-subqueries.md) in the policy body might cause errors. When possible, limit the
  number of subqueries, limit the number of JOIN operations, and simplify WHERE clause conditions.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example updates the storage lifecycle policy to expire closed accounts after 30 days.

```sqlexample
ALTER STORAGE LIFECYCLE POLICY expire_storage_for_closed_accounts
  SET BODY ->
    event_ts < DATEADD(DAY, -30, CURRENT_TIMESTAMP())
    AND EXISTS (
      SELECT 1 FROM closed_accounts
      WHERE id = account_id
    );
```

---
title: ALTER STREAM
source: https://docs.snowflake.com/en/sql-reference/sql/alter-stream.md
section: SQL Commands
---

# ALTER STREAM

Modifies the properties, columns, or constraints for an existing [stream](../../user-guide/streams-intro.md).

See also:
:   [CREATE STREAM](create-stream.md) , [DROP STREAM](drop-stream.md) , [SHOW STREAMS](show-streams.md) , [DESCRIBE STREAM](desc-stream.md)

## Syntax

```sqlsyntax
ALTER STREAM [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER STREAM [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER STREAM <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER STREAM [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Identifier for the stream to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in double
    quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies the properties to set for the stream:

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    `COMMENT = 'string'`
    :   Adds a comment or overwrites an existing comment for the stream.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the stream, which resets them back to their defaults:

    * `TAG tag_key [ , tag_key ... ]`
    * `COMMENT`

## Usage notes

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Add a comment for a stream:

> ```sqlexample
> ALTER STREAM mystream SET COMMENT = 'New comment for stream';
> ```

---
title: ALTER STREAMLIT
source: https://docs.snowflake.com/en/sql-reference/sql/alter-streamlit.md
section: SQL Commands
---

# ALTER STREAMLIT

Modifies the properties of an existing Streamlit object.

See also:
:   [CREATE STREAMLIT](create-streamlit.md), [SHOW STREAMLITS](show-streamlits.md), [DESCRIBE STREAMLIT](desc-streamlit.md), [DROP STREAMLIT](drop-streamlit.md)

## Syntax

```sqlsyntax
ALTER STREAMLIT [ IF EXISTS ] <name> SET
  [ MAIN_FILE = '<filename>']
  [ QUERY_WAREHOUSE = <warehouse_name> ]
  [ RUNTIME_NAME = '<runtime_name>' ]
  [ COMPUTE_POOL = <compute_pool_name> ]
  [ COMMENT = '<string_literal>']
  [ TITLE = '<app_title>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ]
  [ SECRETS = ( '<snowflake_secret_name>' = <snowflake_secret> [ , ... ] ) ]

ALTER STREAMLIT [ IF EXISTS ] <name> UNSET { SECRETS                      |
                                             EXTERNAL_ACCESS_INTEGRATIONS |
                                             QUERY_WAREHOUSE              |
                                             TITLE                        |
                                             COMMENT
                                           }
                                           [ , ... ]

ALTER STREAMLIT [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER STREAMLIT <name> COMMIT

ALTER STREAMLIT <name> PUSH [ TO <git_branch_uri> ]
  [
    {
      GIT_CREDENTIALS = <snowflake_secret>
      | USERNAME = <git_username> PASSWORD = <git_password>
    }
    NAME = <git_author_name>
    EMAIL = <git_author_email>
  ]
  [ COMMENT = <git_push_comment> ]

ALTER STREAMLIT <name> ABORT

ALTER STREAMLIT <name> PULL

ALTER STREAMLIT <name> ADD LIVE VERSION FROM LAST
```

**For Streamlit objects created with ROOT_LOCATION, only the following syntax is supported:**

> **Important:**
>
> * ROOT_LOCATION is a legacy parameter and may be deprecated in a future release.
> * For container runtimes, ROOT_LOCATION is not supported.
> * For Streamlit apps created using ROOT_LOCATION, multi-file editing and Git integration are not supported.

```sqlsyntax
ALTER STREAMLIT [ IF EXISTS ] <name> SET
  [ ROOT_LOCATION = '<stage_path_and_root_directory>' ]
  [ MAIN_FILE = '<path_to_main_file>']
  [ QUERY_WAREHOUSE = <warehouse_name> ]
  [ COMMENT = '<string_literal>']
  [ TITLE = '<app_title>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ]

ALTER STREAMLIT [ IF EXISTS ] <name> RENAME TO <new_name>
```

## Parameters

`name`
:   Identifier for the Streamlit object. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`SET ...`
:   Specifies the property to set for the Streamlit object:

    `MAIN_FILE = 'filename'`
    :   Specifies the Streamlit entrypoint file. The requirements depend on the runtime type:

    * **Warehouse runtimes**: The file must be in the root of the source directory specified in FROM.
      Only a filename is allowed, not a path.
    * **Container runtimes**: The file can be in the root or a subdirectory. You can specify a relative
      path from the root of the source directory, like `'subdir/my_app.py'`.

      If your app was created with ROOT_LOCATION instead of FROM, then MAIN_FILE can be a path relative to ROOT_LOCATION
      even though ROOT_LOCATION only supports warehouse runtimes.

    `QUERY_WAREHOUSE = warehouse_name`
    :   Specifies the warehouse used by the Streamlit app. The behavior depends on the runtime type:

        * **Warehouse runtimes**: Specifies the warehouse to run the app code and execute SQL queries.
          This is the code warehouse. It’s recommended to manually switch to a different warehouse within your app code for queries.
        * **Container runtimes**: Specifies the warehouse to execute SQL queries issued by the app.
          The app code runs on the compute pool specified by COMPUTE_POOL.

    `RUNTIME_NAME = 'runtime_name'`
    :   Specifies the runtime environment for the Streamlit app. Use this to change the runtime from
        warehouse to container or from container to warehouse.

        * **Warehouse runtime**: Run the app in a virtual warehouse. Each viewer gets a personal instance
          of the app. Use `SYSTEM$WAREHOUSE_RUNTIME`. The Python version is selected separately
          using the `environment.yml` file.
        * **Container runtimes**: Run the app in a Snowpark Container Services compute pool. All viewers
          share a single, long-running instance of the app. Container runtime names include the Python
          version. The following container runtimes are valid:

          + `SYSTEM$ST_CONTAINER_RUNTIME_PY3_11`

        > **Important:**
        >
        > When changing from a warehouse runtime to a container runtime, you must also
        > set the COMPUTE_POOL parameter as appropriate. Container runtimes require a compute pool.

    `COMPUTE_POOL = compute_pool_name`
    :   Specifies the compute pool where the Streamlit app runs. This parameter is required when using
        a container runtime and is ignored for warehouse runtimes.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the Streamlit object.

    `TITLE = 'app_title'`
    :   Adds a title for the Streamlit app to display in Snowsight.

    `IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [ , ... ] )`
    :   The location (stage), path, and name of the directory or file(s) to import. This only applies to warehouse runtimes and
        is ignored for container runtimes.

    `EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
    :   The names of [external access integrations](create-external-access-integration.md) needed in order for the
        Streamlit app code to access external networks.

        For container runtimes, external access integrations are required to install packages from external package indexes
        like PyPI. For all runtime types, external access integrations enable the app to make outbound network requests.

    `SECRETS = ( 'snowflake_secret_name' = snowflake_secret [ , ... ] )`
    :   Maps Snowflake secrets to secret names that can be referenced in the Streamlit app code. The secret name (left side)
        is how you reference the secret in your code, and the secret object (right side) is the identifier of the Snowflake secret.

        For example: `SECRETS = ('api_key' = my_database.my_schema.my_secret)`

        In warehouse runtimes, secrets are accessed through the `_snowflake` module. In container runtimes,
        secrets are accessible through `st.secrets` and are also mapped to environment variables.
        Secrets must be associated with an external access integration in EXTERNAL_ACCESS_INTEGRATIONS.
        For more information, see [Manage secrets and configure your Streamlit app](../../developer-guide/streamlit/app-development/secrets-and-configuration.md).

    `ROOT_LOCATION = 'stage_path_and_root_directory'`
    :   Specifies the root stage name and prefix containing the Streamlit Python files, media files, and `environment.yml`
        file. This parameter must point to a single directory inside a named internal stage.

`UNSET ...`
:   Specifies one or more properties to unset for the Streamlit object, which resets them to their defaults:

    * `SECRETS`
    * `EXTERNAL_ACCESS_INTEGRATIONS`
    * `QUERY_WAREHOUSE`
    * `TITLE`
    * `COMMENT`

`RENAME TO new_name`
:   Specifies the new identifier for the Streamlit object. The identifier must be unique for the schema where the object was created.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

`COMMIT`
:   Commits the pending edits in the LIVE version to a new LAST version. Immediately after the commit,
    the LIVE version is identical to the LAST version.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

`PUSH`
:   Pushes the latest committed changes to the Git repo, using the branch stored in the base version if `TO git_branch_uri` is not specified.

    If the base version is not based on a Git branch, this throws an error.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    `TO git_branch_uri`
    :   Pushes committed changes to the specified branch.

    `GIT_CREDENTIALS = snowflake_secret`
    :   Specifies the Snowflake secret containing the credentials to use for authenticating with the repository.

    `USERNAME = git_username`
    :   Specifies a Git username.

    `PASSWORD = git_password`
    :   Specifies a Git password.

    `NAME = git_author_name`
    :   Specifies the name of the git author to use.

    `EMAIL = git_author_email`
    :   Specifies a valid e-mail address to use as the git author’s name.

    `COMMENT = git_push_comment`
    :   Specifies a comment to include in the git push.

`ABORT`
:   Removes the current live version of the app, including all edits made in
    Snowsight that have not been committed.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

`PULL`
:   Pulls latest changes. You must abort the current live version before pulling.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

`ADD LIVE VERSION FROM LAST`
:   Creates a new live version of the app based on the last committed version.

    When the owner of a Streamlit app opens the app in Snowsight and a
    live version doesn’t exist, this command is executed automatically. If a
    different user visits the app and a live version doesn’t exist, an error is
    returned.

## Access control requirements

If your role does not own the objects in the following table, then your role
must have the listed
[privileges](../../user-guide/security-access-control-overview.md) on those objects:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Streamlit object that you alter |  |
| USAGE | Warehouse used by the Streamlit app | This privilege is only required if you set a new value for QUERY_WAREHOUSE. |
| USAGE | Compute pool used by the Streamlit app | This privilege is only required if you set a new value for COMPUTE_POOL. |
| USAGE | External access integrations used by the Streamlit app | This privilege is only required if you set a new value for EXTERNAL_ACCESS_INTEGRATIONS. |
| USAGE | Secrets used by the Streamlit app | This privilege is only required if you set a new value for SECRETS. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you remove the live version of the app, a user can’t visit the app until
  you do one of the following actions:

  + Execute ALTER STREAMLIT … ADD LIVE VERSION FROM LAST on the Streamlit
    object.
  + Visit the app in Snowsight with the role that owns the app.
* If you run the ALTER STREAMLIT command while viewing a Streamlit app in Snowsight,
  the app reflects the changes differently depending on the runtime type:

  + **Warehouse runtime**: The app doesn’t reflect the changes until you select Run.
  + **Container runtime**: The app reflects the changes immediately when you next interact with the app.

  If you want your changes reflected in the app, you must reload or reboot the app.
* When migrating from warehouse runtime to container runtime:

  + You must set both RUNTIME_NAME and COMPUTE_POOL.
  + Your app must use Python 3.11 and Streamlit 1.50 or later.
  + Ensure your app code is thread-safe and optimized for concurrent viewers.
  + Replace `get_active_session()` with `st.connection("snowflake")`.
  + Replace `_snowflake` module with native Python equivalents.

  For a complete migration checklist, see [Migrating between runtime environments](../../developer-guide/streamlit/migrations-and-upgrades/runtime-migration.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Change the query warehouse

To change the warehouse used by a Streamlit app, run the ALTER STREAMLIT command as shown in the following example:

```sqlexample
ALTER STREAMLIT my_app
  SET QUERY_WAREHOUSE = new_warehouse;
```

### Migrate from a warehouse runtime to a container runtime

To migrate a Streamlit app from warehouse runtime to container runtime, run the ALTER STREAMLIT command as shown in the following example:

```sqlexample
ALTER STREAMLIT my_app SET
  RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
  COMPUTE_POOL = my_compute_pool
  EXTERNAL_ACCESS_INTEGRATIONS = (pypi_access_integration);
```

Container runtimes require an external access integration to install packages from external package indexes like PyPI. Otherwise, they
can only use the default, pre-installed packages. For more information, see [Manage dependencies for your Streamlit app](../../developer-guide/streamlit/app-development/dependency-management.md).

### Add secrets to an existing warehouse-runtime app

To add secrets to an existing warehouse-runtime Streamlit app, run the ALTER STREAMLIT command as shown in the following example:

```sqlexample
ALTER STREAMLIT my_app SET
  EXTERNAL_ACCESS_INTEGRATIONS = (my_access_integration)
  SECRETS = ('api_key' = my_database.my_schema.my_api_secret);
```

Secrets must be associated with an external access integration. In warehouse runtimes, secrets are accessed
through the `_snowflake` module. In container runtimes, secrets are accessible through `st.secrets`
and as environment variables. For more information, see
[Manage secrets and configure your Streamlit app](../../developer-guide/streamlit/app-development/secrets-and-configuration.md).

### Remove secrets from an app

To remove all secrets from a Streamlit app, run the ALTER STREAMLIT command as shown in the following example:

```sqlexample
ALTER STREAMLIT my_app
  UNSET SECRETS;
```

This removes all secret associations from the app. The underlying secret objects remain in your Snowflake account.
To also remove the external access integrations, unset both properties:

```sqlexample
ALTER STREAMLIT my_app
  UNSET SECRETS, EXTERNAL_ACCESS_INTEGRATIONS;
```

For more information, see [Manage secrets and configure your Streamlit app](../../developer-guide/streamlit/app-development/secrets-and-configuration.md).

### Rename a Streamlit app

To rename a Streamlit app, run the ALTER STREAMLIT command as shown in the following example:

```sqlexample
ALTER STREAMLIT old_app_name
  RENAME TO new_app_name;
```

---
title: ALTER TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-table.md
section: SQL Commands
---

# ALTER TABLE

Modifies the properties, columns, or constraints for an existing table.

See also:
:   [ALTER TABLE … ALTER COLUMN](alter-table-column.md) , [CREATE TABLE](create-table.md) , [DROP TABLE](drop-table.md) , [SHOW TABLES](show-tables.md) , [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
 ALTER TABLE [ IF EXISTS ] <name> RENAME TO <new_table_name>

 ALTER TABLE [ IF EXISTS ] <name> SWAP WITH <target_table_name>

 ALTER TABLE [ IF EXISTS ] <name> { clusteringAction | tableColumnAction | constraintAction  }

 ALTER TABLE [ IF EXISTS ] <name> dataMetricFunctionAction

 ALTER TABLE [ IF EXISTS ] <name> dataGovnPolicyTagAction

 ALTER TABLE [ IF EXISTS ] <name> extTableColumnAction

 ALTER TABLE [ IF EXISTS ] <name> searchOptimizationAction

 ALTER TABLE [ IF EXISTS ] <name> ADD STORAGE LIFECYCLE POLICY <policy_name>
   ON ( <col_name> [ , <col_name> ... ] )

ALTER TABLE [ IF EXISTS ] <name> DROP STORAGE LIFECYCLE POLICY

 ALTER TABLE [ IF EXISTS ] <name> SET
   [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
   [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
   [ CHANGE_TRACKING = { TRUE | FALSE  } ]
   [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
   [ ENABLE_SCHEMA_EVOLUTION = { TRUE | FALSE } ]
   [ ERROR_LOGGING = { TRUE | FALSE } ]
   [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
   [ COMMENT = '<string_literal>' ]
   [ ROW_TIMESTAMP = { TRUE | FALSE } ]

 ALTER TABLE [ IF EXISTS ] <name> UNSET {
                                        DATA_RETENTION_TIME_IN_DAYS         |
                                        MAX_DATA_EXTENSION_TIME_IN_DAYS     |
                                        CHANGE_TRACKING                     |
                                        DEFAULT_DDL_COLLATION               |
                                        ENABLE_SCHEMA_EVOLUTION             |
                                        ERROR_LOGGING                       |
                                        CONTACT <purpose>                   |
                                        COMMENT                             |
                                        ROW_TIMESTAMP                       |
                                        DCM PROJECT
                                        }
                                        [ , ... ]
```

Where:

> ```sqlsyntax
> clusteringAction ::=
>   {
>      CLUSTER BY ( <expr> [ , <expr> , ... ] )
>      /* RECLUSTER is deprecated */
>    | RECLUSTER [ MAX_SIZE = <budget_in_bytes> ] [ WHERE <condition> ]
>      /* { SUSPEND | RESUME } RECLUSTER is valid action */
>    | { SUSPEND | RESUME } RECLUSTER
>    | DROP CLUSTERING KEY
>   }
> ```
>
> ```sqlsyntax
> tableColumnAction ::=
>   {
>      ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type>
>         [
>            {
>               DEFAULT <default_value>
>               | { AUTOINCREMENT | IDENTITY }
>                  /* AUTOINCREMENT (or IDENTITY) is supported only for           */
>                  /* columns with numeric data types (NUMBER, INT, FLOAT, etc.). */
>                  /* Also, if the table is not empty (that is, if the table contains */
>                  /* any rows), only DEFAULT can be altered.                     */
>                  [
>                     {
>                        ( <start_num> , <step_num> )
>                        | START <num> INCREMENT <num>
>                     }
>                  ]
>                  [  { ORDER | NOORDER } ]
>            }
>         ]
>         [ inlineConstraint ]
>         [ COLLATE '<collation_specification>' ]
>
>    | RENAME COLUMN <col_name> TO <new_col_name>
>
>    | ALTER | MODIFY [ ( ]
>                             [ COLUMN ] <col1_name> DROP DEFAULT
>                           , [ COLUMN ] <col1_name> SET DEFAULT <seq_name>.NEXTVAL
>                           , [ COLUMN ] <col1_name> { [ SET ] NOT NULL | DROP NOT NULL }
>                           , [ COLUMN ] <col1_name> [ [ SET DATA ] TYPE ] <type>
>                           , [ COLUMN ] <col1_name> COMMENT '<string>'
>                           , [ COLUMN ] <col1_name> UNSET COMMENT
>                         [ , [ COLUMN ] <col2_name> ... ]
>                         [ , ... ]
>                     [ ) ]
>
>    | DROP [ COLUMN ] [ IF EXISTS ] <col1_name> [, <col2_name> ... ]
>   }
>
>   inlineConstraint ::=
>     [ NOT NULL ]
>     [ CONSTRAINT <constraint_name> ]
>     {
>         UNIQUE
>       | PRIMARY KEY
>       | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
>       | CHECK ( <expr> )
>     }
>     [ <constraint_properties> ]
> ```
>
> For detailed syntax and examples for altering columns, see [ALTER TABLE … ALTER COLUMN](alter-table-column.md). .
>
> For detailed syntax and examples for creating/altering inline constraints, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> dataMetricFunctionAction ::=
>
>     SET DATA_METRIC_SCHEDULE = {
>         '<num> MINUTE'
>       | 'USING CRON <expr> <time_zone>'
>       | 'TRIGGER_ON_CHANGES'
>     }
>
>   | UNSET DATA_METRIC_SCHEDULE
>
>   | { ADD | DROP } DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>       [ EXPECTATION <expectation_name> ( <expression> )
>         [, <expectation_name> ( <expression> ) [ , ... ] ] ]
>       [ EXECUTE AS ROLE <role_name> ]
>       [ ANOMALY_DETECTION = { TRUE | FALSE } ]
>       [ SENSITIVITY = { 'LOW' | 'MEDIUM' | 'HIGH' } ]
>       [ , <metric_name_2> ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] ) ]
>         [ EXPECTATION <expectation_name> ( <expression> )
>           [, <expectation_name> ( <expression> ) [ , ... ] ] ]
>         [ EXECUTE AS ROLE <role_name> ]
>         [ ANOMALY_DETECTION = { TRUE | FALSE } ]
>         [ SENSITIVITY = { 'LOW' | 'MEDIUM' | 'HIGH' } ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>         { SUSPEND | RESUME }
>       [ , <metric_name_2> ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>         { SUSPEND | RESUME } ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>       { ADD | MODIFY } EXPECTATION <expectation_name> ( <expression> )
>           [, <expectation_name> ( <expression> ) [ , ... ] ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>       DROP EXPECTATION <expectation_name> [ , <expectation_name> [ , ... ] ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       SET <list_of_properties>
> ```
>
> ```sqlsyntax
> dataGovnPolicyTagAction ::=
>   {
>       SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>     | UNSET TAG <tag_name> [ , <tag_name> ... ]
>   }
>   |
>   {
>       ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ROW ACCESS POLICY <policy_name>
>     | DROP ROW ACCESS POLICY <policy_name> ,
>         ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ALL ROW ACCESS POLICIES
>   }
>   |
>   {
>       SET AGGREGATION POLICY <policy_name>
>         [ ENTITY KEY ( <col_name> [, ... ] ) ]
>         [ FORCE ]
>     | UNSET AGGREGATION POLICY
>   }
>   |
>   {
>       SET JOIN POLICY <policy_name>
>         [ FORCE ]
>     | UNSET JOIN POLICY
>   }
>   |
>   ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type>
>     [ [ WITH ] MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] ]
>     [ [ WITH ] PROJECTION POLICY <policy_name> ]
>     [ [ WITH ] TAG ( <tag_name> = '<tag_value>'
>           [ , <tag_name> = '<tag_value>' , ... ] ) ]
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] [ FORCE ]
>       | UNSET MASKING POLICY
>   }
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET PROJECTION POLICY <policy_name>
>           [ FORCE ]
>       | UNSET PROJECTION POLICY
>   }
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> SET TAG
>       <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>       , [ COLUMN ] <col2_name> SET TAG
>           <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
>                    , [ COLUMN ] <col2_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
> ```
>
> ```sqlsyntax
> extTableColumnAction ::=
>   {
>      ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type> AS ( <expr> )
>
>    | RENAME COLUMN <col_name> TO <new_col_name>
>
>    | DROP [ COLUMN ] [ IF EXISTS ] <col1_name> [, <col2_name> ... ]
>   }
> ```
>
> ```sqlsyntax
> constraintAction ::=
>   {
>      ADD outoflineConstraint
>    | RENAME CONSTRAINT <constraint_name> TO <new_constraint_name>
>    | { ALTER | MODIFY } {   CONSTRAINT <constraint_name>
>                           | PRIMARY KEY
>                           | UNIQUE
>                           | FOREIGN KEY } ( <col_name> [ , ... ] )
>                         }
>         [ [ NOT ] ENFORCED ] [ VALIDATE | NOVALIDATE ] [ RELY | NORELY ]
>    | DROP {   CONSTRAINT <constraint_name>
>             | PRIMARY KEY
>             | UNIQUE | FOREIGN KEY } ( <col_name> [ , ... ] )
>         [ CASCADE | RESTRICT ]
>   }
>
>   outoflineConstraint ::=
>     [ CONSTRAINT <constraint_name> ]
>     {
>          UNIQUE [ ( <col_name> [ , <col_name> , ... ] ) ]
>        | PRIMARY KEY [ ( <col_name> [ , <col_name> , ... ] ) ]
>        | [ FOREIGN KEY ] [ ( <col_name> [ , <col_name> , ... ] ) ]
>                           REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
>        | CHECK ( <expr> )
>     }
>     [ <constraint_properties> ]
> ```
>
> For detailed syntax and examples for creating/altering out-of-line constraints, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> searchOptimizationAction ::=
>   {
>      ADD SEARCH OPTIMIZATION [
>        ON <search_method_with_target> [ , <search_method_with_target> ... ]
>      ]
>
>    | DROP SEARCH OPTIMIZATION [
>        ON { <search_method_with_target> | <column_name> | <expression_id> }
>           [ , ... ]
>      ]
>   }
> ```
>
> For details, see Search optimization actions (searchOptimizationAction).

## Parameters

`name`
:   Identifier for the table to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in double
    quotes. Identifiers enclosed in double quotes are also case sensitive.

`RENAME TO new_table_name`
:   Renames the specified table with a new identifier that is not currently used by any other tables in the schema.

    For more information about table identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object (table, column, etc.) is renamed, other objects that reference it must be updated with the new name.

`SWAP WITH target_table_name`
:   Swap renames two tables in a single transaction.

    Note that swapping a permanent or transient table with a temporary table, which persists only for the duration of the user session in which
    it was created, is not allowed. This restriction prevents a naming conflict that could occur when a temporary table is swapped with a permanent
    or transient table, and an existing permanent or transient table has the same name as the temporary table. To swap a permanent or transient
    table with a temporary table, use three `ALTER TABLE ... RENAME TO` statements: Rename table `a` to `c`, `b`
    to `a`, and then `c` to `b`.

> **Note:**
>
> To rename a table or swap two tables, the role used to perform the operation must have OWNERSHIP privileges on the table or tables. In addition,
> renaming a table requires the CREATE TABLE privilege on the schema for the table.

`ADD STORAGE LIFECYCLE POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Attaches a [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md) to
    the table.

    For more information about creating and managing storage lifecycle policies, see
    [Create and manage storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-create-manage.md).

    > **Important:**
    >
    > If you attach an archival storage policy to a table, the table is permanently assigned to the specified archive tier for its lifetime. You can’t change the archive tier by applying a new policy. For example, you can’t specify a policy created with a COOL archive tier in ALTER TABLE…DROP STORAGE LIFECYCLE POLICY and then subsequently alter the table to add a policy created with a COLD archive tier. To alter the archive tier for a table, contact Snowflake Support to request deletion of the currently archived data. For additional considerations, see [Archival storage policies](../../user-guide/storage-management/storage-lifecycle-policies.md).

`DROP STORAGE LIFECYCLE POLICY`
:   Removes the storage lifecycle policy from the table.

    For more information, see [Remove a policy from a table](../../user-guide/storage-management/storage-lifecycle-policies-create-manage.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the table (separated by blank spaces, commas, or new lines):

    `DATA_RETENTION_TIME_IN_DAYS = integer`
    :   Object-level parameter that modifies the retention period for the table for Time Travel. For more information, see
        [Understanding & using Time Travel](../../user-guide/data-time-travel.md) and [Working with Temporary and Transient Tables](../../user-guide/tables-temp-transient.md).

        For a detailed description of this parameter, as well as more information about object parameters, see [Parameters](../parameters.md).

        Values:

        > * Standard Edition: `0` or `1`
        > * Enterprise Edition:
        >
        >   + `0` to `90` for permanent tables
        >   + `0` or `1` for temporary and transient tables

        > **Note:**
        >
        > A value of `0` effectively disables Time Travel for the table.

    `MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
    :   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for the table to
        prevent streams on the table from becoming stale.

        For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

    `CHANGE_TRACKING = TRUE | FALSE`
    :   Specifies to enable or disable change tracking on the table.

        * `TRUE` enables change tracking on the table. This option adds several hidden columns to the source table and begins storing
          change tracking metadata in the columns. These columns consume a small amount of storage.

          The change tracking metadata can be queried using the [CHANGES](../constructs/changes.md) clause for [SELECT](select.md)
          statements, or by creating and querying one or more streams on the table.
        * `FALSE` disables change tracking on the table. Associated hidden columns are dropped from the table.

    `DEFAULT_DDL_COLLATION = 'collation_specification'`
    :   Specifies a default [collation specification](../collation.md) for any new columns added to the table.

        Setting the parameter does not change the collation specification for any existing columns.

        For more information about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

    `ENABLE_SCHEMA_EVOLUTION = { TRUE | FALSE }`
    :   Enables or disables automatic changes to the table schema from data loaded into the table from source files, including:

        > * Added columns.
        >
        >   By default, schema evolution is limited to a maximum of 100 added columns per load operation. To request more than 100 added columns per load operation, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
        > * The NOT NULL constraint can be dropped from any number of columns missing in new data files.

        Setting it to `TRUE` enables automatic table schema evolution. The default `FALSE` disables automatic table schema evolution.

        > **Note:**
        >
        > Loading data from files evolves the table columns when all of the following are true:
        >
        > * The [COPY INTO <table>](copy-into-table.md) statement includes the `MATCH_BY_COLUMN_NAME` option.
        > * The role used to load the data has the EVOLVE SCHEMA or OWNERSHIP privilege on the table.
        >
        > Additionally, for schema evolution with CSV, when used with `MATCH_BY_COLUMN_NAME` and `PARSE_HEADER`, `ERROR_ON_COLUMN_COUNT_MISMATCH` must be set to false.

    `ERROR_LOGGING = { TRUE | FALSE }`
    :   Specifies whether to turn on DML error logging for the table.

        * `TRUE` turns on DML error logging for the table.
        * `FALSE` turns off DML error logging for the table.

        For more information, see [DML error logging](../../user-guide/data-load-overview.md).

        > **Note:**
        >
        > If the [OPT_OUT_ERROR_LOGGING](../parameters.md) parameter is set to `TRUE` for a session,
        > DML error logging isn’t turned on, regardless of whether it is turned on for specific tables.

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the table.

    `ROW_TIMESTAMP = { TRUE | FALSE }`
    :   Adds or removes row timestamps on the table.

        * `TRUE` adds row timestamps on the table.
        * `FALSE` removes row timestamps on the table. This parameter setting permanently deletes all stored METADATA$ROW_LAST_COMMIT_TIME values.
          Reenabling it will not restore these values and Time Travel queries will return nothing.

> **Note:**
>
> Do not specify copy options using the CREATE STAGE, ALTER STAGE, CREATE TABLE, or ALTER TABLE commands. We recommend that you use the [COPY INTO <table>](copy-into-table.md) command to specify copy options.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the table, which resets them back to their defaults:

    * `DATA_RETENTION_TIME_IN_DAYS`
    * `MAX_DATA_EXTENSION_TIME_IN_DAYS`
    * `CHANGE_TRACKING`
    * `DEFAULT_DDL_COLLATION`
    * `ENABLE_SCHEMA_EVOLUTION`
    * `CONTACT purpose`
    * `COMMENT`
    * `ROW_TIMESTAMP`

    You cannot unset the CONTACT property with other properties in the same statement.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the table from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the table and the DCM project without dropping the table. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Clustering actions (`clusteringAction`)

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies (or modifies) one or more table columns or column expressions as the clustering key for the table. These are the
    columns/expressions for which clustering is maintained by Automatic Clustering.

    > **Important:**
    >
    > Clustering keys are not intended or recommended for all tables; they typically benefit very large (that is, multi-terabyte) tables.
    >
    > Before you specify a clustering key for a table, please see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

`RECLUSTER ...`
:   Deprecated

    Performs manual, incremental reclustering of a table that has a clustering key defined:

    > `MAX_SIZE = budget_in_bytes`
    > :   Deprecated — use a larger warehouse to achieve more effective manual reclustering
    >
    >     Specifies the upper-limit on the amount of data (in bytes) in the table to recluster.
    >
    > `WHERE condition`
    > :   Specifies a condition or range on which to recluster data in the table.

    > **Note:**
    >
    > Only roles with the OWNERSHIP or INSERT privilege on a table can recluster the table.

`SUSPEND | RESUME RECLUSTER`
:   Enables or disables [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) for the table.

`DROP CLUSTERING KEY`
:   Drops the clustering key for the table.

For more information about clustering keys and reclustering, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

## Table column actions (`tableColumnAction`)

`ADD [ COLUMN ] [ IF NOT EXISTS ] col_name col_data_type` . `[ DEFAULT default_value | AUTOINCREMENT ... ]` . `[ inlineConstraint ]` `[ COLLATE 'collation_specification' ]` . `[ [ WITH ] MASKING POLICY policy_name ]` . `[ [ WITH ] PROJECTION POLICY policy_name ]` . `[ [ WITH ] TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] ) ] [ , ...]`
:   Adds a new column. You can specify a default value, an inline constraint, a [collation specification](../collation.md),
    a masking policy, and/or one or more tags.

    A default value for a column that you are adding must be a literal value; it cannot be an expression or a value
    returned by a function. For example, the following command returns an expected error:

    ```sqlexample
    ALTER TABLE t1 ADD COLUMN c5 VARCHAR DEFAULT 12345::VARCHAR;
    ```

    ```output
    002263 (22000): SQL compilation error:
    Invalid column default expression [CAST(12345 AS VARCHAR(134217728))]
    ```

    When you first create a table, you can use expressions as default values, but not when you add columns.

    The default value for a column must match the data type of the column. An attempt to
    set a default value with a non-matching data type fails with an error. For example:

    ```sqlexample
    ALTER TABLE t1 ADD COLUMN c6 DATE DEFAULT '20230101';
    ```

    ```output
    002023 (22000): SQL compilation error:
    Expression type does not match column data type, expecting DATE but got VARCHAR(8) for column C6
    ```

    For additional details about table column actions, see:

    * [CREATE TABLE](create-table.md)
    * [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md)
    * [CREATE MASKING POLICY](create-masking-policy.md)
    * [CREATE TAG](create-tag.md)

    ADD COLUMN operations can be performed on multiple columns in the same command.

    If you are not sure if the column already exists, you can specify IF NOT EXISTS when adding the column. If the column already
    exists, ADD COLUMN has no effect on the existing column and does not result in an error.

    > **Note:**
    >
    > You cannot specify IF NOT EXISTS if you are also specifying any of the following for the new column:
    >
    > * DEFAULT, AUTOINCREMENT, or IDENTITY
    > * UNIQUE, PRIMARY KEY, or FOREIGN KEY

`RENAME COLUMN col_name to new_col_name`
:   Renames the specified column to a new name that is not currently used for any other columns in the table.

    You cannot rename a column that is part of a clustering key.

    When an object (table, column, etc.) is renamed, other objects that reference it must be updated with the new name.

`DROP COLUMN [ IF EXISTS ] col_name [ CASCADE | RESTRICT ]`
:   Removes the specified column from the table.

    If you are not sure if the column already exists, you can specify IF EXISTS when dropping the column. If the column does not
    exist, DROP COLUMN has no effect and does not result in an error.

    Dropping a column is a metadata-only operation. It does not immediately re-write the micro-partition(s) and
    therefore does not immediately free up the space used by the column. Typically, the space within an individual
    micro-partition is freed the next time that the micro-partition is re-written, which is typically when a write is
    done either due to DML (INSERT, UPDATE, DELETE) or re-clustering.

## Data metric function actions (`dataMetricFunctionAction`)

`DATA_METRIC_SCHEDULE ...`
:   Specifies the schedule to run the data metric function periodically.

    `'num MINUTE'`
    :   Specifies an interval (in minutes) of wait time inserted between runs of the data metric function. Accepts positive integers only.

        Also supports `num M` syntax.

        For data metric functions, use one of the following values: `5`, `15`, `30`, `60`, `720`, or `1440`.

        If you want to suspend all DMFs associated with the object, set the parameter to an empty string.

        Default: `60 MINUTE`

    `'USING CRON expr time_zone'`
    :   Specifies a cron expression and time zone for periodically running the data metric function. Supports a subset of standard cron utility
        syntax.

        For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones).

        The cron expression consists of the following fields, and the periodic interval must be at least 5 minutes:

        ```bash
        # __________ minute (0-59)
        # | ________ hour (0-23)
        # | | ______ day of month (1-31, or L)
        # | | | ____ month (1-12, JAN-DEC)
        # | | | | _ day of week (0-6, SUN-SAT, or L)
        # | | | | |
        # | | | | |
          * * * * *
        ```

        The following special characters are supported:

        `*`
        :   Wildcard. Specifies any occurrence of the field.

        `L`
        :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of
            a given month. In the day-of-month field, it specifies the last day of the month.

        `/{n}`
        :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
            specified in the month field, then the data metric function is scheduled for April, July and October (i.e. every 3 months, starting
            with the 4th month of the year). The same schedule is maintained in subsequent years. That is, the data metric function is
            not scheduled to run in January (3 months after the October run).

        > **Note:**
        >
        > * The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
        >   for the account (or setting the value at the user or session level) does not change the time zone for the data metric
        >   function.
        > * The cron expression defines all valid run times for the data metric function. Snowflake attempts to run a data metric
        >   function based on this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid
        >   run time starts.
        > * When both a specific day of month and day of week are included in the cron expression, then the data metric function is scheduled
        >   on days satisfying either the day of month or day of week. For example,
        >   `DATA_METRIC_SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'` schedules a data metric function at 0AM on any 10th to 20th day
        >   of the month and also on any Tuesday or Thursday outside of those dates.
        > * The shortest granularity of time in cron is minutes.
        >
        >   If a data metric function is resumed during the minute defined in its cron expression, the first scheduled run of the data metric
        >   function is the next occurrence of the instance of the cron expression. For example, if data metric function scheduled to run daily
        >   at midnight (`USING CRON 0 0 * * *`) is resumed at midnight plus 5 seconds (`00:00:05`), the first data metric function run
        >   is scheduled for the following midnight.

    `'TRIGGER_ON_CHANGES'`
    :   Specifies that the DMF runs when a [DML operation](../sql-dml.md) modifies the table, such as inserting a new row or
        deleting a row.

        You can specify `'TRIGGER_ON_CHANGES'` for the following objects:

        * Dynamic tables
        * External tables
        * Apache Iceberg™ tables
        * Regular tables
        * Temporary tables
        * Transient tables

        You cannot specify `'TRIGGER_ON_CHANGES'` for views.

        Changes to the table as a result of [reclustering](../../user-guide/tables-auto-reclustering.md) do not trigger the DMF to run.

`UNSET DATA_METRIC_SCHEDULE`
:   Resets the schedule for DMFs associated with the object to the default of `60 MINUTE`.

    If you want to suspend DMFs associated with the object, run a `SET DATA_METRIC_SCHEDULE = ''` statement instead.

`{ ADD | DROP } DATA METRIC FUNCTION metric_name`
:   Identifier of the data metric function to add to the table or view or drop from the table or view.

    `ON ( col_name [ , ... ] [ , TABLE( table_name( col_name [ , ... ] ) ) ] )`
    :   The table or view columns on which to associate the data metric function. The data types of the columns must match the data types of
        the columns specified in the data metric function definition.

        If the data metric function accepts a second table as an argument, specify the fully qualified name of the table and its columns.

    `EXPECTATION expectation_name ( expression ) [, expectation_name ( expression ) [ , ... ] ]`
    :   Defines one or more [expectations](../../user-guide/data-quality-expectations.md) for the association between the column and the DMF.

    `[ , metric_name_2 ON ( col_name [ , ... ] [ , TABLE( table_name( col_name [ , ... ] ) ) ] ) ]`
    :   Additional data metric functions to add to the table or view. Use a comma to separate each data metric function and its specified
        columns.

        If the data metric function accepts a second table as an argument, specify the fully qualified name of the table and its columns.

    `ANOMALY_DETECTION = { TRUE | FALSE }`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts that are Enterprise Edition (or higher).

        To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

        Specifies whether Snowflake uses the DMF to [automatically detect anomalies](../../user-guide/data-quality-anomaly.md) in the table.

        Default: `FALSE`

    `SENSITIVITY = { LOW | MEDIUM | HIGH }`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts that are Enterprise Edition (or higher).

        To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

        Specifies the sensitivity of the anomaly-detecting algorithm. For more information, see [Adjust the sensitivity level of anomaly detection](../../user-guide/data-quality-anomaly.md).

        Default: `'MEDIUM'`

    `EXECUTE AS ROLE role_name`
    :   Specifies which role the DMF runs with. The role must have the SELECT privilege on the table or view.

        For more information, see [Required privilege on the table or view](../../user-guide/data-quality-access-control.md).

`MODIFY DATA METRIC FUNCTION metric_name`
:   Identifier of the data metric function to modify.

    `ON ( col_name [ , ... ] [ , TABLE( table_name( col_name [ , ... ] ) ) ] )`
    :   Specifies the columns associated with the data metric function. If the data metric function accepts a second table as an argument,
        specify the fully qualified name of the table and its columns.

    `{ SUSPEND | RESUME }`
    :   Suspends or resumes the data metric function on the specified columns. When a data metric function is set for a table or view, the data
        metric function is automatically included in the schedule.

        * `SUSPEND` removes the data metric function from the schedule.
        * `RESUME` brings a suspended date metric function back into the schedule.

    `{ ADD | MODIFY } EXPECTATION expectation_name ( expression ) [, expectation_name ( expression ) [ , ... ] ]`
    :   Defines or modifies one or more [expectations](../../user-guide/data-quality-expectations.md) for the association between the column and
        the DMF.

    `DROP EXPECTATION expectation_name [ , expectation_name [ , ... ] ]`
    :   Removes the specified expectations from the association between the column and the DMF.

    `[ , metric_name_2 ON ( col_name [ , ... ] [ , TABLE(col_name [ , ... ] ) ] ) ]`
    :   Additional data metric functions to modify. Use a comma to separate each data metric function and its specified
        columns. If the data metric function accepts a second table as an argument, specify the fully qualified name of the table and its
        columns.

    `SET list_of_properties`
    :   Sets one or more properties of the association between the DMF and the object. You set more than one property by using a space-delimited list.

        `ANOMALY_DETECTION = { TRUE | FALSE }`
        :   [Preview Feature](../../release-notes/preview-features.md) — Open

            Available to all accounts that are Enterprise Edition (or higher).

            To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

            Controls whether Snowflake uses the DMF to [automatically detect anomalies](../../user-guide/data-quality-anomaly.md) in the table.

        `SENSITIVITY = { 'LOW' | 'MEDIUM' | 'HIGH' }`
        :   [Preview Feature](../../release-notes/preview-features.md) — Open

            Available to all accounts that are Enterprise Edition (or higher).

            To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

            Sets the sensitivity of the anomaly-detecting algorithm. For more information, see [Adjust the sensitivity level of anomaly detection](../../user-guide/data-quality-anomaly.md).

        `DATA_QUALITY_NOTIFICATION = { TRUE | FALSE }`
        :   [Preview Feature](../../release-notes/preview-features.md) — Open

            Available to all accounts that are Enterprise Edition (or higher).

            To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

            Controls whether notifications are sent when the value returned by the DMF is an expectation violation or an anomaly.

            Notifications are sent if the parameter is set to `TRUE` *and* notifications are turned on for the object’s database. Specify `FALSE` to turn off notifications for this object-DMF association even though notifications are sent for other associations in the database.

            For more information about configuring notifications, see [Sending notifications for data quality issues](../../user-guide/data-quality-notifications.md).

            Default: `TRUE`

## External table column actions (`extTableColumnAction`)

For all other external table modifications, see [ALTER EXTERNAL TABLE](alter-external-table.md).

`ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type> AS ( <expr> ) [, ...]`
:   Adds a new column to the external table.

    If you are not sure if the column already exists, you can specify IF NOT EXISTS when adding the column. If the column already
    exists, ADD COLUMN has no effect on the existing column and does not result in an error.

    This operation can be performed on multiple columns in the same command.

    `col_name`
    :   String that specifies the column identifier (that is, name). All the requirements for table identifiers also apply to column identifiers.

        For more information, see [Identifier requirements](../identifiers-syntax.md).

    `col_type`
    :   String (constant) that specifies the data type for the column. The data type must match the result of `expr` for the column.

        For details about the data types that can be specified for table columns, see [SQL data types reference](../../sql-reference-data-types.md).

    `expr`
    :   String that specifies the expression for the column. When queried, the column returns results derived from this expression.

        External table columns are virtual columns, which are defined using an explicit expression. Add virtual columns as expressions using the
        VALUE column and/or the METADATA$FILENAME pseudocolumn:

        VALUE:
        :   A VARIANT type column that represents a single row in the external file.

            CSV:
            :   The VALUE column structures each row as an object with elements identified by column position (that is,
                `{c1: <column_1_value>, c2: <column_2_value>, c3: <column_1_value> ...}`).

                For example, add a VARCHAR column named `mycol` that references the first column in the staged CSV files:

                ```sqlexample
                mycol varchar as (value:c1::varchar)
                ```

            Semi-structured data:
            :   Enclose element names and values in double-quotes. Traverse the path in the VALUE column using dot notation.

                For example, suppose the following represents a single row of semi-structured data in a staged file:

                ```bash
                { "a":"1", "b": { "c":"2", "d":"3" } }
                ```

                Add a VARCHAR column named `mycol` that references the nested repeating `c` element in the staged file:

                ```sqlexample
                mycol varchar as (value:"b"."c"::varchar)
                ```

        METADATA$FILENAME:
        :   A pseudocolumn that identifies the name of each staged data file included in the external table, including its path in the stage.

`RENAME COLUMN col_name to new_col_name`
:   Renames the specified column to a new name that is not currently used for any other columns in the external table.

`DROP COLUMN [ IF EXISTS ] col_name`
:   Removes the specified column from the external table.

    If you are not sure if the column already exists, you can specify IF EXISTS when dropping the column. If the column does not
    exist, DROP COLUMN has no effect and does not result in an error.

## Constraint actions (`constraintAction`)

`ADD CONSTRAINT`
:   Adds an out-of-line integrity constraint to one or more columns in the table. To add an inline constraint (for a column), see
    Column Actions (in this topic).

`RENAME CONSTRAINT constraint_name TO new_constraint_name`
:   Renames the specified constraint.

`{ ALTER | MODIFY } CONSTRAINT ...`
:   Alters the properties for the specified constraint.

    For CHECK constraints, the `constraint_name` is required.

`DROP CONSTRAINT constraint_name | PRIMARY KEY | UNIQUE | FOREIGN KEY ( col_name [ , ... ] ) [ CASCADE | RESTRICT ]`
:   Drops the specified constraint for the specified column or set of columns.

    For CHECK constraints, the `constraint_name` is required.

For detailed syntax and examples for adding or altering constraints, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).

## Data Governance policy and tag actions (`dataGovnPolicyTagAction`)

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`policy_name`
:   Identifier for the policy; must be unique for your schema.

The following clauses apply to all table kinds that support row access policies, such as but not limited to tables, views, and event tables.
To simplify, the clauses just refer to “table.”

> `ADD ROW ACCESS POLICY policy_name ON (col_name [ , ... ])`
> :   Adds a row access policy to the table.
>
>     At least one column name must be specified. Additional columns can be specified with a comma separating each column name. Use this
>     expression to add a row access policy to both an event table and an external table.
>
> `DROP ROW ACCESS POLICY policy_name`
> :   Drops a row access policy from the table.
>
>     Use this clause to drop the policy from the table.
>
> `DROP ROW ACCESS POLICY policy_name, ADD ROW ACCESS POLICY policy_name ON ( col_name [ , ... ] )`
> :   Drops the row access policy that is set on the table and adds a row access policy to the same table in a single SQL statement.
>
> `DROP ALL ROW ACCESS POLICIES`
> :   Drops all [row access policy](../../user-guide/security-row-using.md) associations from the table.
>
>     This expression is helpful when a row access policy is dropped from a schema before dropping the policy from an event table. Use this expression to drop row access policy associations from the table.
>
>     Suppose that a row access policy applied to the table when the backup was created, and the policy was later dropped. After you
>     restore the table from a [backup](../../user-guide/backups.md), you can’t query it until you run an ALTER TABLE command with the
>     DROP ALL ROW ACCESS POLICIES clause.
>
> `SET AGGREGATION POLICY policy_name`
> :   `[ ENTITY KEY (col_name [ , ... ]) ] [ FORCE ]`
>     :   Assigns an [aggregation policy](../../user-guide/aggregation-policies.md) to the table.
>
>         Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the table. For more information, see
>         [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md).
>
>         Use the optional FORCE parameter to atomically replace an existing aggregation policy with the new aggregation policy.
>
> `UNSET AGGREGATION POLICY`
> :   Detaches an aggregation policy from the table.
>
> `SET JOIN POLICY policy_name`
> :   `[ FORCE ]`
>     :   Assigns a [join policy](../../user-guide/join-policies.md) to the table.
>
>         Use the optional FORCE parameter to atomically replace an existing join policy with the new join policy.
>
> `UNSET JOIN POLICY`
> :   Detaches a join policy from the table.

`{ ALTER | MODIFY } [ COLUMN ] ...`
:   `USING ( col_name , cond_col_1 ... )`
    :   Specifies the arguments to pass into the conditional masking policy SQL expression.

        The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
        column to which the masking policy is set.

        The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query
        result when a query is made on the first column.

        If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
        [masking policy](../../user-guide/security-column-intro.md).

    `FORCE`
    :   Replaces a masking or projection policy that is currently set on a column with a different policy in a single statement.

        Note that using the `FORCE` keyword with a masking policy requires the [data type](../../sql-reference-data-types.md) of the policy
        in the ALTER TABLE statement (i.e. STRING) to match the data type of the masking policy currently set on the column (i.e. STRING).

        If a masking policy is not currently set on the column, specifying this keyword has no effect.

        For details, see: [Replace a masking policy on a column](../../user-guide/security-column-intro.md) or [Replace a projection policy](../../user-guide/projection-policies.md).

## Search optimization actions (`searchOptimizationAction`)

`ADD SEARCH OPTIMIZATION`
:   Adds [search optimization](../../user-guide/search-optimization-service.md) for the entire table or, if you specify the optional
    ON clause, for specific columns.

    > **Note:**
    >
    > * Search optimization can be expensive to maintain, especially if the data in the table changes frequently.
    >   For more information, see [Search optimization cost estimation and management](../../user-guide/search-optimization/cost-estimation.md).
    > * If you try to add search optimization on a materialized view, Snowflake returns an error message.

`ON search_method_with_target [, search_method_with_target ... ]`
:   Specifies that you want to configure search optimization for specific columns or VARIANT fields (instead of the entire table).

    For `search_method_with_target`, use an expression with the following syntax:

    ```sqlsyntax
    <search_method>( <target> [ , <target> , ... ] [ , ANALYZER => '<analyzer_name>' ] )
    ```

    Where:

    * `search_method` specifies one of the following methods that optimizes queries for a particular type of predicate:

      | Search method | Description |
      | --- | --- |
      | `FULL_TEXT` | Predicates that use VARCHAR (text), VARIANT, ARRAY, and OBJECT types. |
      | `EQUALITY` | Equality and IN predicates. |
      | `SUBSTRING` | Predicates that match substrings and regular expressions (for example, [[ NOT ] LIKE](../functions/like.md), [[ NOT ] ILIKE](../functions/ilike.md), [[ NOT ] RLIKE](../functions/rlike.md), and [REGEXP_LIKE](../functions/regexp_like.md)). |
      | `GEO` | Predicates that use GEOGRAPHY types. |
    * `target` specifies the column, VARIANT field, or an asterisk (\*).

      Depending on the value of `search_method`, you can specify a column or VARIANT field of one of the following types:

      | Search method | Supported targets |
      | --- | --- |
      | `FULL_TEXT` | Columns of VARCHAR (text), VARIANT, ARRAY, and OBJECT data types, including paths to fields in VARIANTs. |
      | `EQUALITY` | Columns of numerical, string, binary, and VARIANT data types, including paths to fields in VARIANTs. |
      | `SUBSTRING` | Columns of string or VARIANT data types, including paths to fields in VARIANTs. Specify paths to fields as described above under `EQUALITY`; searches on nested fields are improved in the same way. |
      | `GEO` | Columns of the GEOGRAPHY data type. |

      To specify a VARIANT field, use [dot or bracket notation](../../user-guide/querying-semistructured.md) (for example,
      `my_column:my_field_name.my_nested_field_name` or `my_column['my_field_name']['my_nested_field_name']`).
      You can also use a colon-delimited path to the field (for example, `my_column:my_field_name:my_nested_field_name`).

      When you specify a VARIANT field, the configuration applies to all nested fields under that field.
      For example, if you specify `ON EQUALITY(src:a.b)`:

      + This configuration can improve queries `on src:a.b` and on any nested fields (for example, `src:a.b.c`, `src:a.b.c.d`,
        etc.).
      + This configuration does not affect queries that do not use the `src:a.b` prefix (for example, `src:a`, `src:z`, etc.).

      To specify all applicable columns in the table as targets, use an asterisk (`*`).

      Note that you cannot specify both an asterisk and specific column names for a given search method. However, you can
      specify an asterisk in different search methods.

      For example, you can specify the following expressions:

      ```sqlexample
      -- Allowed
      ON SUBSTRING(*)
      ON EQUALITY(*), SUBSTRING(*), GEO(*)
      ```

      You cannot specify the following expressions:

      ```sqlexample
      -- Not allowed
      ON EQUALITY(*, c1)
      ON EQUALITY(c1, *)
      ON EQUALITY(v1:path, *)
      ON EQUALITY(c1), EQUALITY(*)
      ```

    * `ANALYZER => 'analyzer_name'` specifies the name of the text analyzer, if `search_method`
      is `FULL_TEXT`.

      When the `FULL_TEXT` search method is used and queries are executed with the
      [SEARCH](../functions/search.md) or [SEARCH_IP](../functions/search_ip.md) function, the analyzer
      breaks the search terms (and the text from the column being searched) into tokens. A row matches if any of
      the tokens extracted from the search string matches a token extracted from any of the columns or fields
      being searched. The analyzer isn’t relevant when the `FULL_TEXT` search method isn’t used or for queries
      that don’t use the SEARCH or SEARCH_IP function.

      The analyzer tokenizes a string by breaking it where it finds certain delimiters. These delimiters are not
      included in the resulting tokens, and empty tokens are not extracted.

      This parameter accepts one of the following values:

      + DEFAULT_ANALYZER: Breaks text into tokens based on the following delimiters:

        | Character | Unicode code | Description |
        | --- | --- | --- |
        |  | `U+0020` | Space |
        | `[` | `U+005B` | Left square bracket |
        | `]` | `U+005D` | Right square bracket |
        | `;` | `U+003B` | Semicolon |
        | `<` | `U+003C` | Less-than sign |
        | `>` | `U+003E` | Greater-than sign |
        | `(` | `U+0028` | Left parenthesis |
        | `)` | `U+0029` | Right parenthesis |
        | `{` | `U+007B` | Left curly bracket |
        | `}` | `U+007D` | Right curly bracket |
        | `|` | `U+007C` | Vertical bar |
        | `!` | `U+0021` | Exclamation mark |
        | `,` | `U+002C` | Comma |
        | `'` | `U+0027` | Apostrophe |
        | `"` | `U+0022` | Quotation mark |
        | `*` | `U+002A` | Asterisk |
        | `&` | `U+0026` | Ampersand |
        | `?` | `U+003F` | Question mark |
        | `+` | `U+002B` | Plus sign |
        | `/` | `U+002F` | Slash |
        | `:` | `U+003A` | Colon |
        | `=` | `U+003D` | Equal sign |
        | `@` | `U+0040` | At sign |
        | `.` | `U+002E` | Period (full stop) |
        | `-` | `U+002D` | Hyphen |
        | `$` | `U+0024` | Dollar sign |
        | `%` | `U+0025` | Percent sign |
        | `\` | `U+005C` | Backslash |
        | `_` | `U+005F` | Underscore (low line) |
        | `\n` | `U+000A` | New line (line feed) |
        | `\r` | `U+000D` | Carriage return |
        | `\t` | `U+0009` | Horizontal tab |
      + UNICODE_ANALYZER: Tokenizes based on Unicode segmentation rules that treat spaces and certain
        punctuation characters as delimiters. These internal rules are designed for natural language searches (in
        many different languages). For example, the default analyzer treats periods in IP addresses and
        apostrophes in contractions as delimiters, but the Unicode analyzer does not.
        See [Using an analyzer to adjust search behavior](../functions/search.md).

        For more information about the Unicode Text Segmentation algorithm, see <https://unicode.org/reports/tr29/>.
      + NO_OP_ANALYZER: Tokenizes neither the data nor the query string. A search term must exactly match the full text
        in a column or field, including case sensitivity; otherwise, the SEARCH function returns FALSE. Even if the query
        string looks like it contains multiple tokens (for example, `'sky blue'`), the column or field must equal the
        entire query string exactly. In this case, only `'sky blue'` is a match; `'sky'` and `'blue'` are not matches.
      + ENTITY_ANALYZER: Tokenizes the data for IP address searches.

        This analyzer is used only for queries executed with the SEARCH_IP function.

    To specify more than one search method on a target, use a comma to separate each subsequent method and target:

    ```sqlexample
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1), EQUALITY(c2, c3);
    ```

    If you run the ALTER TABLE … ADD SEARCH OPTIMIZATION ON … command multiple times on the same table, each subsequent command
    adds to the existing configuration for the table. For example, suppose that you run the following commands:

    ```sqlexample
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2);
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c3, c4);
    ```

    This adds equality predicates for the columns c1, c2, c3, and c4 to the configuration for the table. This is equivalent to
    running the command:

    ```sqlexample
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2, c3, c4);
    ```

    For examples, see [Enabling search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

`DROP SEARCH OPTIMIZATION`
:   Removes [search optimization](../../user-guide/search-optimization-service.md) for the entire table or, if you specify the
    optional ON clause, from specific columns.

    > **Note:**
    >
    > * If a table has the search optimization property, then dropping the table and undropping it preserves the
    >   search optimization property.
    > * Removing the search optimization property from a table and then adding it back incurs the same cost as adding it the first
    >   time.

`ON search_method_with_target | column_name | expression_id [ , ... ]`
:   Specifies that you want to drop the search optimization configuration for specific columns or VARIANT fields (instead of
    dropping search optimization for the entire table).

    To identify the column configuration to drop, specify one of the following:

    * For `search_method_with_target`, specify a method for optimizing queries for one or more specific targets, which can
      be columns or VARIANT fields. Use the
      [syntax described earlier](alter-table-event-table.md).
    * For `column_name`, specify the name of the column configured for search optimization. Specifying the column name drops
      all expressions for that column, including expressions that use VARIANT fields in the column.
    * For `expression_id`, specify the ID for an expression listed in the output of the
      [DESCRIBE SEARCH OPTIMIZATION](../../user-guide/search-optimization/enabling.md) command.

    To specify more than one of these, use a comma between items.

    You can specify any combination of search methods with targets, column names, and expression IDs.

    For examples, see [Dropping search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

## Usage notes: General

* Changes to a table are not automatically propagated to views created on that table. For example, if you drop a
  column in a table, and a view is defined to include that column, the view becomes invalid; the view is not
  adjusted to remove the column.

* Dropping a column does not immediately free up the column’s storage space.

  + The space in each micro-partition is not reclaimed until that micro-partition is re-written. Write
    operations (insert, update, delete, etc.) on 1 or more rows in that micro-partition cause the micro-partition to
    be re-written. If you want to force space to be reclaimed, you can follow these steps:

    1. Use a [CREATE TABLE AS SELECT (CTAS)](create-table.md) statement to create a new table that contains
       only the columns of the old table you want to keep.
    2. Set the [DATA_RETENTION_TIME_IN_DAYS](../parameters.md) parameter to `0` for the old table (optional).
    3. Drop the old table.
  + If the table is protected by the Time Travel feature, the space used by the Time Travel storage is not reclaimed
    until the Time Travel retention period expires.
* If a new column with a default value is added to a table with existing rows, all of the existing rows are populated with the default value.
* Adding a new column with a default value containing a function is not currently supported. The following error is returned:

  > `Invalid column default expression (expr)`
* To alter a table, you must be using a role that has ownership privilege on the table.
* To add clustering to a table, you must also have USAGE or OWNERSHIP privileges on the schema and database that
  contain the table.

* For masking policies:

  + The `USING` clause and the `FORCE` keyword are both optional; neither are required to set a masking policy on a column. The
    `USING` clause and the `FORCE` keyword can be used separately or together. For details, see:

    - [Apply a conditional masking policy on a column](../../user-guide/security-column-intro.md)
    - [Replace a masking policy on a column](../../user-guide/security-column-intro.md)
  + A single masking policy that uses conditional columns can be applied to multiple tables provided that the column structure of the table
    matches the columns specified in the policy.
  + When modifying one or more table columns with a masking policy or the table itself with a row access policy, use the
    [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a masking policy and the
    table protected by a row access policy.

* For row access policies:

  + Snowflake supports adding and dropping row access policies in a single SQL statement.

    For example, to replace a row access policy that is already set on a table with a different policy, drop the row access policy first
    and then add the new row access policy.
  + For a given resource (i.e. table or view), to `ADD` or `DROP` a row access policy you must have either the
    [APPLY ROW ACCESS POLICY](../../user-guide/security-row-intro.md) privilege on the schema, or the
    [OWNERSHIP](../../user-guide/security-row-intro.md) privilege on the resource and the APPLY privilege on the row access policy resource.
  + A table or view can only be protected by one row access policy at a time. Adding a policy fails if the policy body refers to a table or
    view column that is protected by a row access policy or the column protected by a masking policy.

    Similarly, adding a masking policy to a table column fails if the masking policy body refers to a table that is protected by a row
    access policy or another masking policy.
  + Row access policies cannot be applied to system views or table functions.
  + Similar to other [DROP <object>](drop.md) operations, Snowflake returns an error if attempting to drop a row access policy from a
    resource that does not have a row access policy added to it.
  + If an object has both a row access policy and one or more masking policies, the row access policy is evaluated first.

* When you attach a [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md) to a table by
  using the ADD STORAGE LIFECYCLE POLICY option:

  + You must have the necessary privileges to apply the policy. For information about required privileges, see
    [Storage lifecycle policy privileges](../../user-guide/security-access-control-privileges.md).
  + A table can have only one attached storage lifecycle policy.
  + The number of columns must match the argument count in the policy function signature, and the column data must be compatible with the argument types.
  + Associated policies aren’t affected if you rename table columns. Snowflake associates policies to tables by using the column IDs.
  + In order to evaluate and apply storage lifecycle policy expressions, Snowflake internally and temporarily bypasses any governance policies on a table.

* If you create a foreign key, the columns in the REFERENCES clause must be listed in the same order as they were
  listed for the primary key. For example:

  ```sqlexample
  CREATE TABLE parent ... CONSTRAINT primary_key_1 PRIMARY KEY (c_1, c_2) ...
  CREATE TABLE child  ... CONSTRAINT foreign_key_1 FOREIGN KEY (...) REFERENCES parent (c_1, c_2) ...
  ```

  In both cases, the order of the columns is `c_1, c_2`. If the order of the columns in the foreign key had been different
  (for example, `c_2, c_1`), the attempt to create the foreign key would have failed.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* ALTER TABLE … CHANGE_TRACKING = TRUE

  > + When a table is altered to enable change tracking, the table is locked for the duration of the operation.
  >   Locks can cause latency with some associated DDL/DML operations.
  >   For more information, refer to [Resource locking](../transactions.md).
* Indexes in hybrid tables:

  > + When you use the ALTER TABLE command to add or drop a UNIQUE or
  >   FOREIGN KEY constraint in a hybrid table, the corresponding index is
  >   also created or dropped. For more information about hybrid
  >   table indexes, see [CREATE INDEX](create-index.md).
  > + FOREIGN KEY constraints are supported only across hybrid tables that are
  >   stored in the same database. You cannot move a hybrid table from
  >   one database to another. The PRIMARY KEY, UNIQUE, and
  >   FOREIGN KEY constraints defined on hybrid tables have their RELY
  >   property marked as `TRUE`.
  > + A column that is used by an index cannot be dropped before the
  >   corresponding index is dropped.
* For [interactive tables](../../user-guide/interactive.md), ALTER TABLE supports the following operations:

  + Renaming the table.
  + Modifying columns to set or unset comments.
  + Setting or unsetting masking policies on columns.
  + Adding or unsetting a masking policy, join policy, aggregation policy, or row access policy on the table.
  + Adding a [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md)
    to the table, or dropping a storage lifecycle policy from the table.
  + Setting or unsetting tags.

## Usage notes: Data metric functions

Add a DMF to a table:
:   Prior to adding a data metric function to a table, you must:

    * Set the schedule for the data metric function to run. For details, see
      [DATA_METRIC_SCHEDULE](../parameters.md).
    * Configure the event table to store the results of calling the data metric function. For details, see
      [View results of a data metric function](../../user-guide/data-quality-results.md).
    * Ensure that the table is view is not granted to a share because you cannot set a data metric function on a shared table or view.

    Additionally:

    * When you specify a column, Snowflake uses the ordinal position. If you rename a column after adding a data metric function to the table
      or view, the association of the data metric function to the column remains valid.
    * Only one data metric function of its kind can be added to a column. For example, a NULL_COUNT data metric function cannot be added to a
      single column twice.
    * If you drop a column after adding a data metric function that references the column, Snowflake cannot evaluate the data metric function.
    * Referencing a virtual column is not supported.

Schedule a DMF
:   It takes ten minutes for the schedule to become effective once the schedule is set.

    Similarly, it takes ten minutes once the DMF is unset for the scheduling changes to take effect. For more information, see
    [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md).

## Examples

The following sections provide examples of using the ALTER COLUMN command:

* Renaming a table
* Swapping tables
* Adding columns
* Renaming columns
* Dropping columns
* Adding, renaming, and dropping columns in an external table
* Changing the order of clustering keys
* Adding and dropping row access policies

### Renaming a table

The following creates a table named `t1`:

```sqlexample
CREATE OR REPLACE TABLE t1(a1 number);
```

```sqlexample
SHOW TABLES LIKE 't1';
```

```output
+-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+-----------------+-------------+-------------------------+-----------------+----------+--------+
| created_on                    | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | change_tracking | is_external | enable_schema_evolution | owner_role_type | is_event | budget |
|-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+-----------------+-------------+-------------------------+-----------------+----------+--------|
| 2023-10-19 10:37:04.858 -0700 | T1   | TESTDB        | MY_SCHEMA   | TABLE |         |            |    0 |     0 | PUBLIC | 1              | OFF             | N           | N                       | ROLE            | N        | NULL   |
+-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+-----------------+-------------+-------------------------+-----------------+----------+--------+
```

The following statement changes the name of the table to `tt1`:

```sqlexample
ALTER TABLE t1 RENAME TO tt1;
```

```sqlexample
SHOW TABLES LIKE 'tt1';
```

```output
+-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+-----------------+-------------+-------------------------+-----------------+----------+--------+
| created_on                    | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time | change_tracking | is_external | enable_schema_evolution | owner_role_type | is_event | budget |
|-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+-----------------+-------------+-------------------------+-----------------+----------+--------|
| 2023-10-19 10:37:04.858 -0700 | TT1  | TESTDB        | MY_SCHEMA   | TABLE |         |            |    0 |     0 | PUBLIC | 1              | OFF             | N           | N                       | ROLE            | N        | NULL   |
+-------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+-----------------+-------------+-------------------------+-----------------+----------+--------+
```

### Swapping tables

The following statements create tables named `t1` and `t2`:

```sqlexample
CREATE OR REPLACE TABLE t1(a1 NUMBER, a2 VARCHAR, a3 DATE);
CREATE OR REPLACE TABLE t2(b1 VARCHAR);
```

```sqlexample
DESC TABLE t1;
```

```output
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| A1   | NUMBER(38,0)      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A2   | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | DATE              | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

```sqlexample
DESC TABLE t2;
```

```output
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| B1   | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

The following statement swaps table `t1` with table `t2`:

```sqlexample
ALTER TABLE t1 SWAP WITH t2;
```

```sqlexample
DESC TABLE t1;
```

```output
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| B1   | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

```sqlexample
DESC TABLE t2;
```

```output
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| A1   | NUMBER(38,0)      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A2   | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | DATE              | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

### Adding columns

The following creates a table named `t1`:

```sqlexample
CREATE OR REPLACE TABLE t1(a1 NUMBER);
```

```sqlexample
DESC TABLE t1;
```

```output
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| A1   | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

The following statement adds a column named `a2` to this table:

```sqlexample
ALTER TABLE t1 ADD COLUMN a2 NUMBER;
```

The following statement adds a column named `a3` with a NOT NULL constraint:

```sqlexample
ALTER TABLE t1 ADD COLUMN a3 NUMBER NOT NULL;
```

The following statement adds a column named `a4` with a default value and a NOT NULL constraint:

```sqlexample
ALTER TABLE t1 ADD COLUMN a4 NUMBER DEFAULT 0 NOT NULL;
```

The following statement adds a VARCHAR column named `a5` with a language-specific
[collation specification](../collation.md):

```sqlexample
ALTER TABLE t1 ADD COLUMN a5 VARCHAR COLLATE 'en_US';
```

```sqlexample
DESC TABLE t1;
```

```output
+------+-----------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type                              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+-----------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| A1   | NUMBER(38,0)                      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A2   | NUMBER(38,0)                      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | NUMBER(38,0)                      | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A4   | NUMBER(38,0)                      | COLUMN | N     | 0       | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A5   | VARCHAR(16777216) COLLATE 'en_us' | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+-----------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

The following statement uses the IF NOT EXISTS clause to add a column named `a2` only if the column does not exist. There is
an existing column named `a2`. Specifying the IF NOT EXISTS clause prevents the statement from failing with an error.

```sqlexample
ALTER TABLE t1 ADD COLUMN IF NOT EXISTS a2 NUMBER;
```

As shown in the output of the [DESCRIBE TABLE](desc-table.md) command, the statement above has no effect on the existing column named `a2`:

```sqlexample
DESC TABLE t1;
```

```output
+------+-----------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type                              | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+-----------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| A1   | NUMBER(38,0)                      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A2   | NUMBER(38,0)                      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | NUMBER(38,0)                      | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A4   | NUMBER(38,0)                      | COLUMN | N     | 0       | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A5   | VARCHAR(16777216) COLLATE 'en_us' | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+-----------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

### Renaming columns

The following statement changes the name of the column `a1` to `b1`:

```sqlexample
ALTER TABLE t1 RENAME COLUMN a1 TO b1;
```

```sqlexample
DESC TABLE t1;
```

```output
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| B1   | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A2   | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | NUMBER(38,0) | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A4   | NUMBER(38,0) | COLUMN | N     | 0       | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

### Dropping columns

The following statement drops the column `a2`:

```sqlexample
ALTER TABLE t1 DROP COLUMN a2;
```

```sqlexample
DESC TABLE t1;
```

```output
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| B1   | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | NUMBER(38,0) | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A4   | NUMBER(38,0) | COLUMN | N     | 0       | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

The following statement uses the IF EXISTS clause to drop a column named `a2` only if the column exists. There is no existing
column named `a2`. Specifying the IF EXISTS clause prevents the statement from failing with an error.

```sqlexample
ALTER TABLE t1 DROP COLUMN IF EXISTS a2;
```

As shown in the output of the [DESCRIBE TABLE](desc-table.md) command, the statement above has no effect on the existing table:

```sqlexample
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| B1   | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A3   | NUMBER(38,0) | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| A4   | NUMBER(38,0) | COLUMN | N     | 0       | N           | N          | NULL  | NULL       | NULL    | NULL        |
+------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

### Adding, renaming, and dropping columns in an external table

The following statement creates an external table named `exttable1`:

```sqlexample
CREATE EXTERNAL TABLE exttable1
  LOCATION=@mystage/logs/
  AUTO_REFRESH = true
  FILE_FORMAT = (TYPE = PARQUET)
  ;
```

```sqlexample
DESC EXTERNAL TABLE exttable1;
```

```output
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
| name      | type              | kind      | null? | default | primary key | unique key | check | expression                                               | comment               |
|-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------|
| VALUE     | VARIANT           | COLUMN    | Y     | NULL    | N           | N          | NULL  | NULL                                                     | The value of this row |
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
```

The following statement adds a new column named `a1` to the external table:

```sqlexample
ALTER TABLE exttable1 ADD COLUMN a1 VARCHAR AS (value:a1::VARCHAR);
```

```sqlexample
DESC EXTERNAL TABLE exttable1;
```

```output
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
| name      | type              | kind      | null? | default | primary key | unique key | check | expression                                               | comment               |
|-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------|
| VALUE     | VARIANT           | COLUMN    | Y     | NULL    | N           | N          | NULL  | NULL                                                     | The value of this row |
| A1        | VARCHAR(16777216) | VIRTUAL   | Y     | NULL    | N           | N          | NULL  | TO_CHAR(GET(VALUE, 'a1'))                                | NULL                  |
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
```

The following statement changes the name of the `a1` column to `b1`:

```sqlexample
ALTER TABLE exttable1 RENAME COLUMN a1 TO b1;
```

```sqlexample
DESC EXTERNAL TABLE exttable1;
```

```output
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
| name      | type              | kind      | null? | default | primary key | unique key | check | expression                                               | comment               |
|-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------|
| VALUE     | VARIANT           | COLUMN    | Y     | NULL    | N           | N          | NULL  | NULL                                                     | The value of this row |
| B1        | VARCHAR(16777216) | VIRTUAL   | Y     | NULL    | N           | N          | NULL  | TO_CHAR(GET(VALUE, 'a1'))                                | NULL                  |
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
```

The following statement drops the column named `b1`:

```sqlexample
ALTER TABLE exttable1 DROP COLUMN b1;
```

```sqlexample
DESC EXTERNAL TABLE exttable1;
```

```output
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
| name      | type              | kind      | null? | default | primary key | unique key | check | expression                                               | comment               |
|-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------|
| VALUE     | VARIANT           | COLUMN    | Y     | NULL    | N           | N          | NULL  | NULL                                                     | The value of this row |
+-----------+-------------------+-----------+-------+---------+-------------+------------+-------+----------------------------------------------------------+-----------------------+
```

### Changing the order of clustering keys

The following statement creates a table named `t1` that clusters by the `id` and `date` columns:

```sqlexample
CREATE OR REPLACE TABLE T1 (id NUMBER, date TIMESTAMP_NTZ, name STRING) CLUSTER BY (id, date);
```

```sqlexample
SHOW TABLES LIKE 'T1';
```

```output
+---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
|           created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes |    owner     | retention_time |
+---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
| Tue, 21 Jun 2016 15:42:12 -0700 | T1   | TESTDB        | TESTSCHEMA  | TABLE |         | (ID,DATE)  | 0    | 0     | ACCOUNTADMIN | 1              |
+---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
```

The following statement changes the order of the clustering key:

```sqlexample
ALTER TABLE t1 CLUSTER BY (date, id);
```

```sqlexample
SHOW TABLES LIKE 'T1';
```

```output
+---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
|           created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes |    owner     | retention_time |
+---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
| Tue, 21 Jun 2016 15:42:12 -0700 | T1   | TESTDB        | TESTSCHEMA  | TABLE |         | (DATE,ID)  | 0    | 0     | ACCOUNTADMIN | 1              |
+---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
```

### Adding and dropping row access policies

The following example adds a row access policy on a table while specifying a single column. After setting the policy, you can verify by checking
the [information schema](../../user-guide/security-row-intro.md).

```sqlexample
ALTER TABLE t1 ADD ROW ACCESS POLICY rap_t1 ON (empl_id);
```

The following example adds a row access policy while specifying two columns in a single table.

```sqlexample
ALTER TABLE t1 ADD ROW ACCESS POLICY rap_test2 ON (cost, item);
```

The following example drops a row access policy from a table. Verify the policies were dropped by querying the
[information schema](../../user-guide/security-row-intro.md).

```sqlexample
ALTER TABLE t1 DROP ROW ACCESS POLICY rap_v1;
```

The following example shows how to combine adding and dropping row access policies in a single SQL statement for a table. Verify the
results by checking the [information schema](../../user-guide/security-row-intro.md).

> ```sqlexample
> alter table t1
>   drop row access policy rap_t1_version_1,
>   add row access policy rap_t1_version_2 on (empl_id);
> ```

### Schedule for a data metric function to run

Set the data metric function schedule to run every 5 minutes:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = '5 MINUTE';
> ```

Set the data metric function schedule to run at 8:00 AM daily:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'USING CRON 0 8 * * * UTC';
> ```

Set the data metric function schedule to run at 8:00 AM on weekdays only:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'USING CRON 0 8 * * MON,TUE,WED,THU,FRI UTC';
> ```

Set the data metric function schedule to run three times daily at 0600, 1200, and 1800 UTC:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'USING CRON 0 6,12,18 * * * UTC';
> ```

Set the data metric function to run when a general DML operation, such as inserting a new row, modifies the table:

> ```sqlexample
> ALTER TABLE hr.tables.empl_info SET
>   DATA_METRIC_SCHEDULE = 'TRIGGER_ON_CHANGES';
> ```

### Apply a join policy on a table

Alter a table to apply a [join policy](../../user-guide/join-policies.md) with an allowed joining column:

```sqlexample
ALTER TABLE join_table_2
  SET JOIN POLICY jp1 ALLOWED JOIN KEYS (col1);
```

---
title: ALTER TABLE (event tables)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-table-event-table.md
section: SQL Commands
---

# ALTER TABLE (event tables)

Modifies the properties, columns, or constraints for an existing [event table](../../developer-guide/logging-tracing/event-table-setting-up.md).

See also:
:   [CREATE EVENT TABLE](create-event-table.md) , [DROP TABLE](drop-table.md) , [SHOW EVENT TABLES](show-event-tables.md) , [DESCRIBE EVENT TABLE](desc-event-table.md)

## Syntax

```sqlsyntax
ALTER TABLE [ IF EXISTS ] <name> RENAME TO <new_table_name>

ALTER TABLE [ IF EXISTS ] <name> clusteringAction

ALTER TABLE [ IF EXISTS ] <name> dataGovnPolicyTagAction

ALTER TABLE [ IF EXISTS ] <name> searchOptimizationAction

ALTER TABLE [ IF EXISTS ] <name> SET
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE  } ]
  [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
  [ COMMENT = '<string_literal>' ]

ALTER TABLE [ IF EXISTS ] <name> UNSET {
                                       DATA_RETENTION_TIME_IN_DAYS         |
                                       MAX_DATA_EXTENSION_TIME_IN_DAYS     |
                                       CHANGE_TRACKING                     |
                                       CONTACT <purpose>                   |
                                       COMMENT                             |
                                       }
```

Where:

> ```sqlsyntax
> clusteringAction ::=
>   {
>      CLUSTER BY ( <expr> [ , <expr> , ... ] )
>    | { SUSPEND | RESUME } RECLUSTER
>    | DROP CLUSTERING KEY
>   }
> ```
>
> ```sqlsyntax
> dataGovnPolicyTagAction ::=
>   {
>       SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>     | UNSET TAG <tag_name> [ , <tag_name> ... ]
>   }
>   |
>   {
>       ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ROW ACCESS POLICY <policy_name>
>     | DROP ROW ACCESS POLICY <policy_name> ,
>         ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ALL ROW ACCESS POLICIES
>   }
> ```
>
> ```sqlsyntax
> searchOptimizationAction ::=
>   {
>      ADD SEARCH OPTIMIZATION [
>        ON <search_method_with_target> [ , <search_method_with_target> ... ]
>      ]
>
>    | DROP SEARCH OPTIMIZATION [
>        ON { <search_method_with_target> | <column_name> | <expression_id> }
>           [ , ... ]
>      ]
>
>   }
> ```
>
> For details, see Search optimization actions (searchOptimizationAction).

## Parameters

`name`
:   Identifier for the event table to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in double
    quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_table_name`
:   Renames the specified event table with a new identifier that is not currently used by any other event tables in the schema.

    > **Note:**
    >
    > Not supported on the default event table, SNOWFLAKE.TELEMETRY.EVENTS.

    For more details about event table identifiers, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object (table, column, etc.) is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies one or more properties/parameters to set for the event table (separated by blank spaces, commas, or new lines):

    `DATA_RETENTION_TIME_IN_DAYS = integer`
    :   Object-level parameter that modifies the retention period for the event table for Time Travel. For more details, see
        [Understanding & using Time Travel](../../user-guide/data-time-travel.md) and [Working with Temporary and Transient Tables](../../user-guide/tables-temp-transient.md).

        For a detailed description of this parameter, as well as more information about object parameters, see [Parameters](../parameters.md).

        Values:

        > * Standard Edition: `0` or `1`
        > * Enterprise Edition:
        >
        >   + `0` to `90` for permanent event tables
        >   + `0` or `1` for temporary and transient event tables

        > **Note:**
        >
        > A value of `0` effectively disables Time Travel for the event table.

    `MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
    :   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for the event table to
        prevent streams on the event table from becoming stale.

        For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

    `CHANGE_TRACKING = TRUE | FALSE`
    :   Specifies to enable or disable change tracking on the event table.

        * `TRUE` enables change tracking on the event table. This option adds a pair of hidden columns to the source event table and begins storing
          change tracking metadata in the columns. These columns consume a small amount of storage.

          The change tracking metadata can be queried using the [CHANGES](../constructs/changes.md) clause for [SELECT](select.md)
          statements, or by creating and querying one or more streams on the event table.
        * `FALSE` disables change tracking on the event table. The pair of hidden columns is dropped from the event table.

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites the existing comment for the event table.

`UNSET ...`
:   Specifies one or more properties/parameters to unset for the event table, which resets them back to their defaults:

    * `DATA_RETENTION_TIME_IN_DAYS`
    * `MAX_DATA_EXTENSION_TIME_IN_DAYS`
    * `CHANGE_TRACKING`
    * `CONTACT purpose`
    * `COMMENT`

    You cannot unset the CONTACT property with other properties in the same statement.

## Data Governance policy and tag actions (`dataGovnPolicyTagAction`)

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`policy_name`
:   Identifier for the policy; must be unique for your schema.

The following clauses apply to all table kinds that support row access policies, such as but not limited to tables, views, and event tables.
To simplify, the clauses just refer to “table.”

> `ADD ROW ACCESS POLICY policy_name ON (col_name [ , ... ])`
> :   Adds a row access policy to the table.
>
>     At least one column name must be specified. Additional columns can be specified with a comma separating each column name. Use this
>     expression to add a row access policy to both an event table and an external table.
>
> `DROP ROW ACCESS POLICY policy_name`
> :   Drops a row access policy from the table.
>
>     Use this clause to drop the policy from the table.
>
> `DROP ROW ACCESS POLICY policy_name, ADD ROW ACCESS POLICY policy_name ON ( col_name [ , ... ] )`
> :   Drops the row access policy that is set on the table and adds a row access policy to the same table in a single SQL statement.
>
> `DROP ALL ROW ACCESS POLICIES`
> :   Drops all [row access policy](../../user-guide/security-row-using.md) associations from the table.
>
>     This expression is helpful when a row access policy is dropped from a schema before dropping the policy from an event table. Use this expression to drop row access policy associations from the table.
>
>     Suppose that a row access policy applied to the table when the backup was created, and the policy was later dropped. After you
>     restore the table from a [backup](../../user-guide/backups.md), you can’t query it until you run an ALTER TABLE command with the
>     DROP ALL ROW ACCESS POLICIES clause.
>
> `SET AGGREGATION POLICY policy_name`
> :   `[ ENTITY KEY (col_name [ , ... ]) ] [ FORCE ]`
>     :   Assigns an [aggregation policy](../../user-guide/aggregation-policies.md) to the table.
>
>         Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the table. For more information, see
>         [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md).
>
>         Use the optional FORCE parameter to atomically replace an existing aggregation policy with the new aggregation policy.
>
> `UNSET AGGREGATION POLICY`
> :   Detaches an aggregation policy from the table.
>
> `SET JOIN POLICY policy_name`
> :   `[ FORCE ]`
>     :   Assigns a [join policy](../../user-guide/join-policies.md) to the table.
>
>         Use the optional FORCE parameter to atomically replace an existing join policy with the new join policy.
>
> `UNSET JOIN POLICY`
> :   Detaches a join policy from the table.

## Clustering actions (`clusteringAction`)

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies (or modifies) one or more event table columns or column expressions as the clustering key for the event table. These are the
    columns/expressions for which clustering is maintained by Automatic Clustering.

    > **Important:**
    >
    > Clustering keys are not intended or recommended for all event tables; they typically benefit very large (i.e. multi-terabyte) event tables.
    >
    > Before you specify a clustering key for an event table, please see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

`SUSPEND | RESUME RECLUSTER`
:   Enables or disables [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) for the event table.

`DROP CLUSTERING KEY`
:   Drops the clustering key for the event table.

For more information about clustering keys and reclustering, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

## Search optimization actions (`searchOptimizationAction`)

`ADD SEARCH OPTIMIZATION`
:   Adds [search optimization](../../user-guide/search-optimization-service.md) for the entire event table or, if you specify the optional
    ON clause, for specific columns.

    Note:

    * Search optimization can be expensive to maintain, especially if the data in the event table changes frequently.
      For more information, see [Search optimization cost estimation and management](../../user-guide/search-optimization/cost-estimation.md).
    * If you try to add search optimization on a materialized view, Snowflake returns an error message.

`ON search_method_with_target [, search_method_with_target ... ]`
:   Specifies that you want to configure search optimization for specific columns or VARIANT fields (rather than the entire event table).

    For `search_method_with_target`, use an expression with the following syntax:

    ```sqlsyntax
    <search_method>(<target> [, ...])
    ```

    Where:

    * `search_method` specifies one of the following methods that optimizes queries for a particular type of predicate:

      | Search Method | Description |
      | --- | --- |
      | `EQUALITY` | Equality and IN predicates. |
      | `SUBSTRING` | Predicates that match substrings and regular expressions (e.g. [[ NOT ] LIKE](../functions/like.md), [[ NOT ] ILIKE](../functions/ilike.md), [[ NOT ] RLIKE](../functions/rlike.md), [REGEXP_LIKE](../functions/regexp_like.md), etc.) |
      | `GEO` | Predicates that use GEOGRAPHY types. |
    * `target` specifies the column, VARIANT field, or an asterisk (\*).

      Depending on the value of `search_method`, you can specify a column or VARIANT field of one of the following types:

      | Search Method | Supported Targets |
      | --- | --- |
      | `EQUALITY` | Columns of numerical, string, binary, and VARIANT data types, including paths to fields in VARIANTs.  To specify a VARIANT field, use a colon-delimited path to the field (e.g. `my_column:my_field_name:my_nested_field_name`), or use [dot or bracket notation](../../user-guide/querying-semistructured.md) (e.g. `my_column:my_field_name.my_nested_field_name` or `my_column['my_field_name']['my_nested_field_name']`).  When you specify a VARIANT field, the configuration applies to all nested fields under that field. For example, suppose that you specify `ON EQUALITY(src:a.b)`:  + This configuration can improve queries `on src:a.b` and on any nested fields (e.g. `src:a.b.c`, `src:a.b.c.d`,   etc.). + This configuration does not affect queries that do not use the `src:a.b` prefix (e.g. `src:a`, `src:z`, etc.). |
      | `SUBSTRING` | Columns of string data types. |
      | `GEO` | Columns of the GEOGRAPHY data type. |

      To specify all applicable columns in the event table as targets, use an asterisk (`*`).

      Note that you cannot specify both an asterisk and specific column names for a given search method. However, you can
      specify an asterisk in different search methods.

      For example, you can specify the following expressions:

      ```sqlexample
      -- Allowed
      ON SUBSTRING(*)
      ON EQUALITY(*), SUBSTRING(*), GEO(*)
      ```

      You cannot specify the following expressions:

      ```sqlexample
      -- Not allowed
      ON EQUALITY(*, c1)
      ON EQUALITY(c1, *)
      ON EQUALITY(v1:path, *)
      ON EQUALITY(c1), EQUALITY(*)
      ```

    To specify more than one search method on a target, use a comma to separate each subsequent method and target:

    ```sqlexample
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1), EQUALITY(c2, c3);
    ```

    If you run the ALTER TABLE … ADD SEARCH OPTIMIZATION ON … command multiple times on the same event table, each subsequent command
    adds to the existing configuration for the event table. For example, suppose that you run the following commands:

    ```sqlexample
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2);
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c3, c4);
    ```

    This adds equality predicates for the columns c1, c2, c3, and c4 to the configuration for the event table. This is equivalent to
    running the command:

    ```sqlexample
    ALTER TABLE t1 ADD SEARCH OPTIMIZATION ON EQUALITY(c1, c2, c3, c4);
    ```

    For examples, see [Enabling search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

`DROP SEARCH OPTIMIZATION`
:   Removes [search optimization](../../user-guide/search-optimization-service.md) for the entire event table or, if you specify the
    optional ON clause, from specific columns.

    Note:

    * If an event table has the search optimization property, then dropping the event table and undropping it preserves the
      search optimization property.
    * Removing the search optimization property from an event table and then adding it back incurs the same cost as adding it the first
      time.

`ON search_method_with_target | column_name | expression_id [, ... ]`
:   Specifies that you want to drop the search optimization configuration for specific columns or VARIANT fields (rather than
    dropping search optimization for the entire event table).

    To identify the column configuration to drop, specify one of the following:

    * For `search_method_with_target`, specify a method for optimizing queries for one or more specific targets, which can
      be columns or VARIANT fields. Use the
      syntax described earlier.
    * For `column_name`, specify the name of the column configured for search optimization. Specifying the column name drops
      all expressions for that column, including expressions that use VARIANT fields in the column.
    * For `expression_id`, specify the ID for an expression listed in the output of the
      [DESCRIBE SEARCH OPTIMIZATION](../../user-guide/search-optimization/enabling.md) command.

    To specify more than one of these, use a comma between items.

    You can specify any combination of search methods with targets, column names, and expression IDs.

    For examples, see [Dropping search optimization for specific columns](../../user-guide/search-optimization/enabling.md).

## Usage notes

* Changes to an event table are not automatically propagated to views created on that event table.
* To alter an event table, you must be using a role that has ownership privilege on the event table.
* To add clustering to an event table, you must also have USAGE or OWNERSHIP privileges on the schema and database that
  contain the event table.

* For row access policies:

  + Snowflake supports adding and dropping row access policies in a single SQL statement.

    For example, to replace a row access policy that is already set on a table with a different policy, drop the row access policy first
    and then add the new row access policy.
  + For a given resource (i.e. table or view), to `ADD` or `DROP` a row access policy you must have either the
    [APPLY ROW ACCESS POLICY](../../user-guide/security-row-intro.md) privilege on the schema, or the
    [OWNERSHIP](../../user-guide/security-row-intro.md) privilege on the resource and the APPLY privilege on the row access policy resource.
  + A table or view can only be protected by one row access policy at a time. Adding a policy fails if the policy body refers to a table or
    view column that is protected by a row access policy or the column protected by a masking policy.

    Similarly, adding a masking policy to a table column fails if the masking policy body refers to a table that is protected by a row
    access policy or another masking policy.
  + Row access policies cannot be applied to system views or table functions.
  + Similar to other [DROP <object>](drop.md) operations, Snowflake returns an error if attempting to drop a row access policy from a
    resource that does not have a row access policy added to it.
  + If an object has both a row access policy and one or more masking policies, the row access policy is evaluated first.

* If you create a foreign key, the columns in the REFERENCES clause must be listed in the same order as they were
  listed for the primary key. For example:

  ```sqlexample
  CREATE TABLE parent ... CONSTRAINT primary_key_1 PRIMARY KEY (c_1, c_2) ...
  CREATE TABLE child  ... CONSTRAINT foreign_key_1 FOREIGN KEY (...) REFERENCES parent (c_1, c_2) ...
  ```

  In both cases, the order of the columns is `c_1, c_2`. If the order of the columns in the foreign key had been different
  (for example, `c_2, c_1`), the attempt to create the foreign key would have failed.

* You can use data metric functions with event tables by executing an [ALTER TABLE](alter-table.md) command. For more information, see
  [Use SQL to set up data metric functions](../../user-guide/data-quality-working.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* ALTER TABLE … CHANGE_TRACKING = TRUE

  > + When an event table is altered to enable change tracking, the event table is locked for the duration of the operation.
  >   Locks can cause latency with some associated DDL/DML operations.
  >   For more information, refer to [Resource locking](../transactions.md).

## Examples

Rename event table `t1` to `a1`:

> ```sqlexample
> CREATE OR REPLACE TABLE t1(a1 number);
>
> SHOW TABLES LIKE 't1';
>
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+
>            created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+
>  Tue, 17 Mar 2015 16:52:33 -0700 | T1   | TESTDB        | MY_SCHEMA   | TABLE |         |            | 0    | 0     | PUBLIC | 1              |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+
>
> ALTER TABLE t1 RENAME TO tt1;
>
> SHOW TABLES LIKE 'tt1';
>
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+
>            created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner  | retention_time |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+
>  Tue, 17 Mar 2015 16:52:33 -0700 | TT1  | TESTDB        | MY_SCHEMA   | TABLE |         |            | 0    | 0     | PUBLIC | 1              |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------+----------------+
> ```

Change the order of the clustering key for an event table:

> ```sqlexample
> CREATE OR REPLACE TABLE T1 (id NUMBER, date TIMESTAMP_NTZ, name STRING) CLUSTER BY (id, date);
>
> SHOW TABLES LIKE 'T1';
>
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
>            created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes |    owner     | retention_time |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
>  Tue, 21 Jun 2016 15:42:12 -0700 | T1   | TESTDB        | TESTSCHEMA  | TABLE |         | (ID,DATE)  | 0    | 0     | ACCOUNTADMIN | 1              |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
>
> -- Change the order of the clustering key
> ALTER TABLE t1 CLUSTER BY (date, id);
>
> SHOW TABLES LIKE 'T1';
>
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
>            created_on            | name | database_name | schema_name | kind  | comment | cluster_by | rows | bytes |    owner     | retention_time |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
>  Tue, 21 Jun 2016 15:42:12 -0700 | T1   | TESTDB        | TESTSCHEMA  | TABLE |         | (DATE,ID)  | 0    | 0     | ACCOUNTADMIN | 1              |
> ---------------------------------+------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
> ```

The following example adds a row access policy on an event table while specifying a single column. After setting the policy, you can verify by checking
the [information schema](../../user-guide/security-row-intro.md).

> ```sqlexample
> ALTER TABLE t1
>   ADD ROW ACCESS POLICY rap_t1 ON (empl_id);
> ```

The following example adds a row access policy while specifying two columns in a single event table.

> ```sqlexample
> ALTER TABLE t1
>   ADD ROW ACCESS POLICY rap_test2 ON (cost, item);
> ```

The following example drops a row access policy from an event table. Verify the policies were dropped by querying the
[information schema](../../user-guide/security-row-intro.md).

> ```sqlexample
> ALTER TABLE t1
>   DROP ROW ACCESS POLICY rap_v1;
> ```

The following example shows how to combine adding and dropping row access policies in a single SQL statement for a table. Verify the
results by checking the [information schema](../../user-guide/security-row-intro.md).

> ```sqlexample
> alter table t1
>   drop row access policy rap_t1_version_1,
>   add row access policy rap_t1_version_2 on (empl_id);
> ```

---
title: ALTER TABLE … ALTER COLUMN
source: https://docs.snowflake.com/en/sql-reference/sql/alter-table-column.md
section: SQL Commands
---

# ALTER TABLE … ALTER COLUMN

This topic describes how to modify one or more column properties for a table using an `ALTER COLUMN` clause in a
[ALTER TABLE](alter-table.md) statement.

The following table describes the supported/unsupported actions for modifying column properties:

| Action | Supported | Unsupported | Notes |
| --- | --- | --- | --- |
| **Default Values** |  |  |  |
| Drop the default for a column (i.e. `DROP DEFAULT`). | ✔ |  | Not allowed if the column and default were defined by an ALTER TABLE command. For details, see the Usage Notes below. |
| Change the default sequence for a column (i.e. `SET DEFAULT seq_name.NEXTVAL`). | ✔ |  | Use only for columns that have a sequence already. |
| Change the default for a column, unless the default is a sequence. |  | ✔ |  |
| Add a default for a column. |  | ✔ |  |
| **Nullability** |  |  |  |
| Change the nullability of a column (i.e. `SET NOT NULL` or `DROP NOT NULL`). | ✔ |  |  |
| **Data Types** |  |  |  |
| Change a column [data type](../../sql-reference-data-types.md) to a synonymous type (for example, `STRING` to `VARCHAR`). | ✔ |  |  |
| Change a column [data type](../../sql-reference-data-types.md) to a different type (for example, `STRING` to `NUMBER`). |  | ✔ |  |
| Increase the length of a [text string column](../data-types-text.md) (for example, `VARCHAR(50)` to `VARCHAR(100)`). | ✔ |  |  |
| Decrease the length of a [text string column](../data-types-text.md) (for example, `VARCHAR(50)` to `VARCHAR(25)`). |  | ✔ |  |
| Increase the length of a [binary string column](../data-types-text.md) (for example, `BINARY(50)` to `BINARY(100)`). |  | ✔ |  |
| Decrease the length of a [binary string column](../data-types-text.md) (for example, `BINARY(50)` to `BINARY(25)`). |  | ✔ |  |
| Increase the precision of a [number column](../data-types-numeric.md) (for example, `NUMBER(10,2)` to `NUMBER(20,2)`). | ✔ |  |  |
| Decrease the precision of a [number column](../data-types-numeric.md) (for example, `NUMBER(20,2)` to `NUMBER(10,2)`). | ✔ |  | Only allowed if the new precision is sufficient to hold all values currently in the column. In addition, decreasing the precision can impact Time Travel (see Usage Notes for details). |
| Change the scale of a [number column](../data-types-numeric.md) (for example, `NUMBER(10,2)` to `NUMBER(10,4)`). |  | ✔ |  |
| **Comments** |  |  |  |
| Set or unset the comment for a column. | ✔ |  |  |
| **Masking Policy** |  |  |  |
| Set or unset a [masking policy](../../user-guide/security-column-intro.md) on a column. | ✔ |  |  |
| **Projection Policy** |  |  |  |
| Set or unset a [projection policy](../../user-guide/projection-policies.md) on a column. | ✔ |  |  |
| **Object Tagging** |  |  |  |
| Set or unset a [tag](../../user-guide/object-tagging/introduction.md) on a column | ✔ |  | A column can support up to 20 tags, and the maximum number of characters for a tag string value is 256. |

See also:
:   [ALTER TABLE](alter-table.md) , [CREATE TABLE](create-table.md) , [DROP TABLE](drop-table.md) , [SHOW TABLES](show-tables.md) , [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
ALTER TABLE <name> { ALTER | MODIFY } [ ( ]
                                              [ COLUMN ] <col1_name> DROP DEFAULT
                                            , [ COLUMN ] <col1_name> SET DEFAULT <seq_name>.NEXTVAL
                                            , [ COLUMN ] <col1_name> { [ SET ] NOT NULL | DROP NOT NULL }
                                            , [ COLUMN ] <col1_name> [ [ SET DATA ] TYPE ] <type>
                                            , [ COLUMN ] <col1_name> COMMENT '<string>'
                                            , [ COLUMN ] <col1_name> UNSET COMMENT
                                          [ , [ COLUMN ] <col2_name> ... ]
                                          [ , ... ]
                                      [ ) ]

ALTER TABLE <name> { ALTER | MODIFY } [ COLUMN ] dataGovnPolicyTagAction
```

## Usage notes

* A single ALTER TABLE statement can be used to modify multiple columns in a table. Each change is specified as a clause consisting of the column
  and column property to modify, separated by commas:

  > + Use either the `ALTER` or `MODIFY` keyword to initiate the list of clauses (i.e. columns/properties to modify) in the statement.
  > + Parentheses can be used for grouping the clauses, but are not required.
  > + The `COLUMN` keyword can be specified in each clause, but is not required.
  > + The clauses can be specified in any order.
* When setting a column to `NOT NULL`, if the column contains NULL values, an error is returned and no changes are applied to the column.
* Columns that use semi-structured data types (ARRAY, OBJECT, and VARIANT) cannot be set to `NOT NULL`, except when the table is empty. Setting these columns to `NOT NULL` when the table contains rows is not supported and results in an error.
* To change the default sequence for a column, the column must already have a default sequence. You cannot use the command
  `ALTER TABLE ... SET DEFAULT <seq_name>` to add a sequence to a column that does not already have a sequence.
* If you alter a table to add a column with a `DEFAULT` value, then you cannot drop the default value for that column.
  For example, in the following sequence of statements, the last `ALTER TABLE ... ALTER COLUMN` statement causes an error:

  ```sqlexample
  CREATE TABLE t(x INT);
  INSERT INTO t VALUES (1), (2), (3);
  ALTER TABLE t ADD COLUMN y INT DEFAULT 100;
  INSERT INTO t(x) VALUES (4), (5), (6);

  ALTER TABLE t ALTER COLUMN y DROP DEFAULT;
  ```

  This restriction prevents inconsistency between values in rows inserted before the column was added and
  rows inserted after the column was added. If the default were dropped, then the column would contain:

  + A NULL value for rows inserted before the column was added.
  + The default value for rows inserted after the column was added.

  Dropping the default column value from any clone of the table is also prohibited.
* When setting the `TYPE` for a column, the specified type (i.e. `type`) must be
  [NUMBER](../data-types-numeric.md) or a [text data type](../data-types-text.md) (VARCHAR,
  STRING, TEXT, etc.).

  + For the NUMBER data type, `TYPE` can be used to:

    - Increase the precision of the specified number column.
    - Decrease the precision of the specified number column if the new precision is sufficient to hold
      all data values currently in the column.
  + For text data types, `TYPE` can be used only to increase the length of the column.
* If the precision of a column is decreased below the maximum precision of any column data retained in Time Travel, you cannot restore the
  table without first increasing the precision.
* For [interactive tables](../../user-guide/interactive.md), currently the only clauses that you can
  use with the ALTER TABLE MODIFY COLUMN command are COMMENT and UNSET COMMENT.

* For masking policies:

  + The `USING` clause and the `FORCE` keyword are both optional; neither are required to set a masking policy on a column. The
    `USING` clause and the `FORCE` keyword can be used separately or together. For details, see:

    - [Apply a conditional masking policy on a column](../../user-guide/security-column-intro.md)
    - [Replace a masking policy on a column](../../user-guide/security-column-intro.md)
  + A single masking policy that uses conditional columns can be applied to multiple tables provided that the column structure of the table
    matches the columns specified in the policy.
  + When modifying one or more table columns with a masking policy or the table itself with a row access policy, use the
    [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a masking policy and the
    table protected by a row access policy.

* Regarding metadata (for example, the `COMMENT` field):

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Example setup:

> ```sqlexample
> CREATE OR REPLACE TABLE t1 (
>    c1 NUMBER NOT NULL,
>    c2 NUMBER DEFAULT 3,
>    c3 NUMBER DEFAULT seq1.nextval,
>    c4 VARCHAR(20) DEFAULT 'abcde',
>    c5 STRING);
>
> DESC TABLE t1;
>
> +------+-------------------+--------+-------+-------------------------+-------------+------------+-------+------------+---------+
> | name | type              | kind   | null? | default                 | primary key | unique key | check | expression | comment |
> |------+-------------------+--------+-------+-------------------------+-------------+------------+-------+------------+---------|
> | C1   | NUMBER(38,0)      | COLUMN | N     | NULL                    | N           | N          | NULL  | NULL       | NULL    |
> | C2   | NUMBER(38,0)      | COLUMN | Y     | 3                       | N           | N          | NULL  | NULL       | NULL    |
> | C3   | NUMBER(38,0)      | COLUMN | Y     | DB1.PUBLIC.SEQ1.NEXTVAL | N           | N          | NULL  | NULL       | NULL    |
> | C4   | VARCHAR(20)       | COLUMN | Y     | 'abcde'                 | N           | N          | NULL  | NULL       | NULL    |
> | C5   | VARCHAR(16777216) | COLUMN | Y     | NULL                    | N           | N          | NULL  | NULL       | NULL    |
> +------+-------------------+--------+-------+-------------------------+-------------+------------+-------+------------+---------+
> ```

Make the following changes to `t1`:

> * Change NOT NULL column `c1` to NULL.
> * Drop the default for column `c2` and change the default sequence for column `c3`.
> * Increase the length of column `c4` and drop the default for the column.
> * Add a comment for column `c5`.
>
> ```sqlexample
> ALTER TABLE t1 ALTER COLUMN c1 DROP NOT NULL;
>
> ALTER TABLE t1 MODIFY c2 DROP DEFAULT, c3 SET DEFAULT seq5.nextval ;
>
> ALTER TABLE t1 ALTER c4 SET DATA TYPE VARCHAR(50), COLUMN c4 DROP DEFAULT;
>
> ALTER TABLE t1 ALTER c5 COMMENT '50 character column';
>
> DESC TABLE t1;
>
> +------+-------------------+--------+-------+-------------------------+-------------+------------+-------+------------+---------------------+
> | name | type              | kind   | null? | default                 | primary key | unique key | check | expression | comment             |
> |------+-------------------+--------+-------+-------------------------+-------------+------------+-------+------------+---------------------|
> | C1   | NUMBER(38,0)      | COLUMN | Y     | NULL                    | N           | N          | NULL  | NULL       | NULL                |
> | C2   | NUMBER(38,0)      | COLUMN | Y     | NULL                    | N           | N          | NULL  | NULL       | NULL                |
> | C3   | NUMBER(38,0)      | COLUMN | Y     | DB1.PUBLIC.SEQ5.NEXTVAL | N           | N          | NULL  | NULL       | NULL                |
> | C4   | VARCHAR(50)       | COLUMN | Y     | NULL                    | N           | N          | NULL  | NULL       | NULL                |
> | C5   | VARCHAR(16777216) | COLUMN | Y     | NULL                    | N           | N          | NULL  | NULL       | 50 character column |
> +------+-------------------+--------+-------+-------------------------+-------------+------------+-------+------------+---------------------+
> ```

Same as previous example, but with the following changes to illustrate the versatility/flexibility of the command:

> * All actions executed in a single `ALTER COLUMN` clause.
> * The order of the columns within the clause is different.
> * `SET DATA TYPE` shortened to simply `TYPE`.
>
> ```sqlexample
> ALTER TABLE t1 ALTER (
>    c1 DROP NOT NULL,
>    c5 COMMENT '50 character column',
>    c4 TYPE VARCHAR(50),
>    c2 DROP DEFAULT,
>    COLUMN c4 DROP DEFAULT,
>    COLUMN c3 SET DEFAULT seq5.nextval
>   );
> ```
>
> This example produces the same results.

Apply a Column-level Security masking policy to a table column:

> ```sqlexample
> -- single column
>
> ALTER TABLE empl_info MODIFY COLUMN empl_id SET MASKING POLICY mask_empl_id;
>
> -- multiple columns
>
> ALTER TABLE empl_info MODIFY
>     COLUMN empl_id SET MASKING POLICY mask_empl_id
>   , COLUMN empl_dob SET MASKING POLICY mask_empl_dob
> ;
> ```

Unset a Column-level Security masking policy from a table column:

> ```sqlexample
> -- single column
>
> ALTER TABLE empl_info modify column empl_id unset masking policy;
>
> -- multiple columns
>
> ALTER TABLE empl_info MODIFY
>     COLUMN empl_id UNSET MASKING POLICY
>   , COLUMN empl_dob UNSET MASKING POLICY
> ;
> ```

---
title: ALTER TAG
source: https://docs.snowflake.com/en/sql-reference/sql/alter-tag.md
section: SQL Commands
---

# ALTER TAG

Modifies the properties for an existing tag, including renaming the tag and setting a masking policy on a tag.

Any changes made to the tag go into effect when the next SQL query that uses the tag runs.

See also:
:   [CREATE TAG](create-tag.md) , [DROP TAG](drop-tag.md) , [SHOW TAGS](show-tags.md) , [UNDROP TAG](undrop-tag.md)

## Syntax

```sqlsyntax
ALTER TAG [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER TAG [ IF EXISTS ] <name> { ADD | DROP } ALLOWED_VALUES '<val_1>' [ , '<val_2>' [ , ... ] ]

ALTER TAG [ IF EXISTS ] <name> SET
  [ ALLOWED_VALUES '<val_1>' [ , '<val_2>' [ , ... ] ] ]
  [ PROPAGATE = { ON_DEPENDENCY_AND_DATA_MOVEMENT | ON_DEPENDENCY | ON_DATA_MOVEMENT }
    [ ON_CONFLICT = { '<string>' | ALLOWED_VALUES_SEQUENCE } ] ]
  [ COMMENT = '<string_literal>' ]

ALTER TAG [ IF EXISTS ] <name> UNSET { ALLOWED_VALUES | PROPAGATE | ON_CONFLICT | COMMENT }

ALTER TAG [ IF EXISTS ] <name> SET MASKING POLICY
  <masking_policy_name> [ , MASKING POLICY <masking_policy_2_name> , ... ] [ FORCE ]

ALTER TAG [ IF EXISTS ] <name> UNSET MASKING POLICY <masking_policy_name> [ , MASKING POLICY <masking_policy_2_name> , ... ]

ALTER TAG [ IF EXISTS ] <name> UNSET DCM PROJECT
```

## Parameters

`name`
:   Identifier for the tag. Assign the tag string value on an [object](../../user-guide/object-tagging/introduction.md) using either a
    [CREATE <object>](create.md) statement or an [ALTER <object>](alter.md) statement.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md)

`RENAME TO new_name`
:   Specifies the new identifier for the tag; must be unique for your schema. The new identifier cannot be used if the identifier is already
    in place for a different tag.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

`ALLOWED_VALUES 'val_1' [ , 'val_2' [ , ... ] ]`
:   Specifies a comma-separated list of the possible string values that can be assigned to the tag when the tag is set on an
    [object](../../user-guide/object-tagging/introduction.md) using the corresponding [CREATE <object>](create.md) or
    [ALTER <object>](alter.md) command.

    The maximum number of tag values in this list is 5,000.

    If you use the SET ALLOWED_VALUES clause, the specified values *replace* previously specified values, which allows you to adjust the
    sequence of values atomically.

    Using the DROP ALLOWED_VALUES clause to remove all values prevents someone from setting the tag to a value. If your intention is to let
    users set the tag to any value, use UNSET ALLOWED_VALUES instead.

    If a tag is configured to automatically propagate to target objects, the order of values in the allowed list can affect how conflicts are
    resolved. For more information, see [Tag propagation conflicts](../../user-guide/object-tagging/propagation.md).

    Default: NULL (all string values are allowed, including an empty string value (that is, `' '`)).

`PROPAGATE = { ON_DEPENDENCY_AND_DATA_MOVEMENT | ON_DEPENDENCY | ON_DATA_MOVEMENT }`
:   [Enterprise Edition Feature](../../user-guide/intro-editions.md)

    This parameter requires Enterprise Edition or higher. To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    Specifies that the tag will be [automatically propagated](../../user-guide/object-tagging/propagation.md) from source objects to target
    objects. You can configure the tag to propagate when there is an [object dependency](../../user-guide/object-tagging/propagation.md),
    [data movement](../../user-guide/object-tagging/propagation.md), or both.

    Changes to this parameter do not automatically propagate to target objects. These changes have no effect on tags that were previously
    applied to target objects as part of tag propagation.

    Possible values are:

    `ON_DEPENDENCY_AND_DATA_MOVEMENT`
    :   Propagates the tag when there is an object dependency or data movement.

    `ON_DEPENDENCY`
    :   Propagates the tag for object dependencies, but not for data movement.

    `ON_DATA_MOVEMENT`
    :   Propagates the tag when there is data movement, but not for object dependencies.

`ON_CONFLICT = { 'string' | ALLOWED_VALUES_SEQUENCE }`
:   [Enterprise Edition Feature](../../user-guide/intro-editions.md)

    This parameter requires Enterprise Edition or higher. To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    Specifies what happens when there is a conflict between the values of [propagated tags](../../user-guide/object-tagging/propagation.md).

    If you don’t set this parameter and there is a conflict, the value of the tag is set to the string `CONFLICT`.

    Changes to this parameter do not automatically propagate to target objects. These changes have no effect on tags that were previously
    applied to target objects as part of tag propagation.

    Possible values are:

    `'string'`
    :   When there is a conflict, the value of the tag is set to the specified string.

    `ALLOWED_VALUES_SEQUENCE`
    :   The order of the values in the ALLOWED_VALUES property of the tag determines which value is used when there is a conflict.
        For example, suppose you created a tag with the following statement:

        ```sqlexample
        CREATE TAG my_tag ALLOWED_VALUES 'blue', 'red' PROPAGATE = ON_DEPENDENCY;
        ```

        If there is a conflict, then the value of `my_tag` will be `blue` because it comes before `red` in the allowed values list.

    Default: Set the value of the tag to `CONFLICT`.

`MASKING POLICY masking_policy_name [ , MASKING POLICY masking_policy_2_name , ... ]`
:   Specifies a comma-separated list of [masking policies](../../user-guide/security-column-intro.md) that can be assigned to the tag.

`FORCE`
:   Replaces a masking policy that is currently set on a tag with a different masking policy in a single statement.

    Note that using the FORCE keyword replaces the masking policy when a policy of the same [data type](../../sql-reference-data-types.md) is
    already set on the tag.

    If a masking policy is not currently set on the tag, specifying this keyword has no effect.

    For details, see [Replace a masking policy on a tag](../../user-guide/tag-based-masking-policies.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the tag.

    Default: No value

`UNSET`
:   Specifies one (or more) properties and/or parameters to unset for the tag, which resets them to the defaults:

    * `ALLOWED_VALUES`
    * `PROPAGATE`
    * `ON_CONFLICT`
    * `COMMENT`

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the tag from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the tag and the DCM project without dropping the tag. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Tag | This privilege is required to modify tag properties (e.g. comment, allowed values).  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| APPLY MASKING POLICY | Account | Assigning and replacing a masking policy on a tag requires the global APPLY MASKING POLICY privilege. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on tag DDL and privileges, see [Access control privileges](../../user-guide/object-tagging/work.md).

## Usage notes

* For more information on tag DDL authorization, see [required privileges](../../user-guide/object-tagging/work.md).
* Regarding assigning one or more masking policies to a tag:

  + A tag can have only one masking policy per data type.

    For example, a tag can have one policy for the STRING data type, one policy for the NUMBER data type, and so on.
  + If a masking policy already protects a column and the tag with a masking policy is set on the same column, the masking policy
    directly assigned to the column takes precedence over the masking policy assigned to the tag.
  + A tag cannot be [dropped](drop-tag.md) if a masking policy is assigned to the tag, nor can the masking policy be
    dropped if the masking policy is assigned to a tag.
* Regarding replication, particularly with tag-based masking policies, see
  [policy replication considerations](../../user-guide/database-replication-considerations.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename the `cost_center` tag to `cost_center_na`, where `na` specifies North America.

> ```sqlexample
> ALTER TAG cost_center RENAME TO cost_center_na;
> ```

---
title: ALTER TASK
source: https://docs.snowflake.com/en/sql-reference/sql/alter-task.md
section: SQL Commands
---

# ALTER TASK

Modifies the properties for an existing task.

For information about tasks, see [Introduction to tasks](../../user-guide/tasks-intro.md).

See also:
:   [CREATE TASK](create-task.md) , [DROP TASK](drop-task.md) , [SHOW TASKS](show-tasks.md) , [DESCRIBE TASK](desc-task.md)

## Syntax

```sqlsyntax
ALTER TASK [ IF EXISTS ] <name> RESUME | SUSPEND

ALTER TASK [ IF EXISTS ] <name> REMOVE AFTER <string> [ , <string> , ... ]
  | ADD AFTER <string> [ , <string> , ... ]

ALTER TASK [ IF EXISTS ] <name> SET
  [ { WAREHOUSE = <string> }
    | { USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = <string> } ]
  [ SCHEDULE = { '<num> { HOURS | MINUTES | SECONDS }'
               | 'USING CRON <expr> <time_zone>' } ]
  [ CONFIG = <configuration_string> ]
  [ OVERLAP_POLICY = { NO_OVERLAP | ALLOW_CHILD_OVERLAP | ALLOW_ALL_OVERLAP } ]
  [ USER_TASK_TIMEOUT_MS = <num> ]
  [ SUSPEND_TASK_AFTER_NUM_FAILURES = <num> ]
  [ ERROR_INTEGRATION = <integration_name> ]
  [ SUCCESS_INTEGRATION = <integration_name> ]
  [ LOG_LEVEL = '<log_level>' ]
  [ COMMENT = <string> ]
  [ <session_parameter> = <value>
    [ , <session_parameter> = <value> ... ] ]
  [ TASK_AUTO_RETRY_ATTEMPTS = <num> ]
  [ USER_TASK_MINIMUM_TRIGGER_INTERVAL_IN_SECONDS = <num> ]
  [ TARGET_COMPLETION_INTERVAL = '<num> { HOURS | MINUTES | SECONDS }' ]
  [ SERVERLESS_TASK_MIN_STATEMENT_SIZE= 'XSMALL | SMALL
    | MEDIUM | LARGE | XLARGE | XXLARGE' ]
  [ SERVERLESS_TASK_MAX_STATEMENT_SIZE= 'XSMALL | SMALL
    | MEDIUM | LARGE | XLARGE | XXLARGE' ]
  [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
  [ EXECUTE AS USER <user_name> ]

ALTER TASK [ IF EXISTS ] <name> UNSET
  [ WAREHOUSE ]
  [ SCHEDULE ]
  [ CONFIG ]
  [ OVERLAP_POLICY ]
  [ USER_TASK_TIMEOUT_MS ]
  [ SUSPEND_TASK_AFTER_NUM_FAILURES ]
  [ LOG_LEVEL ]
  [ COMMENT ]
  [ <session_parameter> [ , <session_parameter> ... ] ]
  [ TARGET_COMPLETION_INTERVAL ]
  [ SERVERLESS_TASK_MIN_STATEMENT_SIZE ]
  [ SERVERLESS_TASK_MAX_STATEMENT_SIZE ]
  [ CONTACT <purpose> [ , ... ]]
  [ EXECUTE AS USER ]
  [ DCM PROJECT ]

ALTER TASK [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>'
  [ , <tag_name> = '<tag_value>' ... ]

ALTER TASK [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER TASK [ IF EXISTS ] <name> SET FINALIZE = <string>

ALTER TASK [ IF EXISTS ] <name> UNSET FINALIZE

ALTER TASK [ IF EXISTS ] <name> MODIFY AS <sql>

ALTER TASK [ IF EXISTS ] <name> MODIFY WHEN <boolean_expr>

ALTER TASK [ IF EXISTS ] <name> REMOVE WHEN
```

## Parameters

`name`
:   Identifier for the task to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in double
    quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RESUME | SUSPEND`
:   Specifies the action to perform on the task:

    * `RESUME` brings a suspended task to the ‘Started’ state. Note that accounts are currently limited to a maximum of 30000 started
      tasks.

      Before resuming the root task of your Task Graph, resume all child tasks. To recursively resume the root task’s child
      tasks, use [SYSTEM$TASK_DEPENDENTS_ENABLE](../functions/system_task_dependents_enable.md).
    * `SUSPEND` puts the task into a ‘Suspended’ state.

    If the task schedule is set to an interval of (`number { HOURS | MINUTES | SECONDS }`), the *base interval time* for the schedule is reset to the current time the task is resumed.

    The base interval time starts the interval counter from the current clock time. For example, if an INTERVAL value of `10 MINUTES` is set and
    the task is resumed at 9:03 AM, then the task runs at 9:13 AM, 9:23 AM, and so on. Note that we only guarantee that tasks don’t execute
    *before* their set interval occurs. In the current example, the task could first run at 9:14 AM, but won’t run at 9:12 AM.

`REMOVE AFTER string [ , string , ... ]`
:   Specifies the names of one or more current predecessor tasks for this child task in a [task graph](../../user-guide/tasks-graphs.md).

    When all predecessors for a child task are removed, then the former child task becomes either a standalone task or a root task, depending on
    whether other tasks identify this former child task as their predecessor. If the former child task becomes a root task, this task is suspended
    by default and must be resumed manually.

`ADD AFTER string [ , string , ... ]`
:   Specifies the names of one or more existing tasks to add as predecessors for this child task in a [task graph](../../user-guide/tasks-graphs.md).
    Each child task in a task graph runs when all predecessor tasks finish their runs successfully. For more information, see the description
    of the `AFTER` parameter in [CREATE TASK](create-task.md).

    Each child task is limited to 100 predecessor tasks.

`SET ...`
:   Specifies either or both of the following:

    * One or more properties to set for the task, which are separated by blank spaces, commas, or new lines. For more details about the properties you
      can set, see [CREATE TASK](create-task.md).

      When you set the configuration on a task, you specify the default configuration string for the task. You can override the default configuration
      for a single execution by using the [EXECUTE TASK](execute-task.md) command.
    * A comma-separated list of session parameters to set for the session when the task runs. A task supports all session parameters. For the
      complete list, see [Parameters](../parameters.md).
    * `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
      :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

          The tag value is always a string, and the maximum number of characters for the tag value is 256.

          For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).
    * `CONTACT purpose = contact [ , purpose = contact ... ]`
      :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

          You cannot set the CONTACT property with other properties in the same statement.

`UNSET ...`
:   Specifies one (or more) properties and/or session parameters to unset for the task, which resets them to the defaults.

    You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be separated by a
    comma. When resetting a property/parameter, specify only the name; specifying a value for the property/parameter will return an error.

    To detach a contact from the task, specify `UNSET CONTACT purpose`. You cannot unset the CONTACT property with other properties in
    the same statement.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the task from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the task and the DCM project without dropping the task. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

`sql`
:   Specifies the SQL code to execute when the task runs:

    * Single SQL statement
    * Call to a stored procedure
    * Procedural logic using [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md)

      Note that currently, Snowsight does not support creating or modifying tasks to use Snowflake Scripting.
      Instead, use SnowSQL or another command-line client.

    > **Note:**
    >
    > Verify that the SQL code you reference in a task executes as expected before you create the task. Tasks are intended to
    > automate SQL code that has already been tested thoroughly.

`WHEN boolean_expr`
:   Specifies a Boolean SQL expression. When a task is triggered, it validates the conditions of the expression to determine whether to
    execute. If the conditions of the expression are not met, then the task skips the current run. Any tasks that identify this task
    as a predecessor also do not run.

    Validating the conditions of the WHEN expression does not require a virtual warehouse. The validation is instead processed in the cloud
    services layer. A nominal charge accrues each time a task evaluates its WHEN condition and does not run. The charges accumulate each time
    the task is triggered until it runs. At that time, the charge is converted to Snowflake credits and added to the compute resource usage
    for the task run.

    Generally the compute time to validate the condition is insignificant compared to task execution time. As a best practice, align
    scheduled and actual task runs as closely as possible. Avoid task schedules that are wildly out of synch with actual task runs. For
    example, if data is inserted into a table with a stream roughly every 24 hours, don’t schedule a task that checks for stream data every
    minute. The charge to validate the WHEN expression with each run is generally insignificant, but the charges are cumulative.

    Note that daily consumption of cloud services that falls below the
    [10% quota of the daily usage of the compute resources](../../user-guide/cost-understanding-compute.md) accumulates no cloud services charges.

    Currently, the following functions are supported for evaluation in the SQL expression:

    [SYSTEM$STREAM_HAS_DATA](../functions/system_stream_has_data.md)
    :   Indicates whether a specified stream contains change tracking data. Used to run a triggered task if no schedule is defined for the
        task. You can also use this to skip the current task run if the stream contains no change data.

        If the result is `FALSE`, then the task does not run.

    [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](../functions/system_get_predecessor_return_value.md)
    :   Retrieves the return value for the predecessor task in a task graph.
        Used to decide whether the task should run based on the returned result.

`REMOVE WHEN`
:   Remove the `WHEN` condition that you have specified.

`EXECUTE AS USER user_name`
:   Runs the task on behalf of a specified user account. The user who runs the command must have permissions granted by using the [GRANT IMPERSONATE ON USER TO ROLE](grant-privilege-user.md) command.

    For more information, see [Run tasks with user privileges](../../user-guide/tasks-intro.md).

## Rename a task

Renaming a task isn’t supported. Instead, you can clone the task, and then drop the old task; for example:

1. Suspend the task (`ALTER TASK task_old_name SUSPEND`).
2. Clone the task, giving it a new name ([CREATE TASK new_task_name CLONE old_task_name](create-task.md)).
3. For task graphs, find dependent tasks that refer to the old task name, and update them to use the new name:

> 1. Find immediately dependent tasks (that is, child tasks and finalizer tasks, but not children-of-children tasks) using the [TASK_DEPENDENTS … RECURSIVE](../functions/task_dependents.md) function; for example:
>
>    ```sqlexample
>    SELECT * FROM TABLE(INFORMATION_SCHEMA.TASK_DEPENDENTS('old_task_name', RECURSIVE => false));
>    ```
> 2. Update each dependent task to use the new task name (`ALTER TASK child_task_1 ADD AFTER new_task_name`).

1. Drop the old version of the task ([DROP TASK old_task_name](drop-task.md)).
2. Resume the new version of the task (`ALTER TASK new_task_name RESUME`).

## Usage notes

* Resuming or suspending a task (using ALTER TASK … RESUME or ALTER TASK … SUSPEND, respectively) requires either the OWNERSHIP or
  OPERATE privilege on the task.

  When a task is resumed, Snowflake verifies that the role with the OWNERSHIP privilege on the task also has the USAGE privilege on the
  warehouse assigned to the task, as well as the global EXECUTE TASK privilege; if not, an error is produced.

  Only account administrators (users with the ACCOUNTADMIN role) can grant the EXECUTE TASK privilege to a role. For ease of use, we recommend
  creating a custom role (e.g. TASKADMIN) and assigning the EXECUTE TASK privilege to this role. Any role that can grant privileges
  (e.g. SECURITYADMIN or any role with the MANAGE GRANTS privilege) can then grant this custom role to any task owner role to allow altering
  their own tasks. For instructions for creating custom roles and role hierarchies, see [Configuring access control](../../user-guide/security-access-control-configure.md).
* Only the task owner — that is, the role with the OWNERSHIP privilege on the task — can set or unset properties on a task.
* To alter the default CONFIG, you must supply the entire replacement JSON string. You can’t update individual key-value pairs.
  To override the default configuration for a single execution, use the [EXECUTE TASK](execute-task.md).
* A standalone task must be suspended before it can be modified.
* The root task in a [task graph](../../user-guide/tasks-graphs.md) must be suspended before any
  task in the task graph is modified, a child task is suspended or resumed, or a child task is added (using ALTER TASK … AFTER).
* A task graph is limited to a maximum of 1000 tasks total (including the root task) in either a resumed or suspended state.
* To recursively resume all dependent tasks tied to a root task in a task graph, query the [SYSTEM$TASK_DEPENDENTS_ENABLE](../functions/system_task_dependents_enable.md)
  function rather than enabling each task individually (using ALTER TASK … RESUME).
* By default, a DML statement executed without explicitly starting a transaction is automatically committed on success or rolled back on failure
  at the end of the statement. This behavior is called *autocommit* and is controlled with the [AUTOCOMMIT](../parameters.md) parameter. This parameter
  must be set to TRUE. If the AUTOCOMMIT parameter is set to FALSE at the account level, then set the parameter to TRUE for the
  individual task (using ALTER TASK … SET AUTOCOMMIT = TRUE).
* When a task is suspended, any current run of the task (i.e. a run with an EXECUTING state in the [TASK_HISTORY](../functions/task_history.md)
  output) is completed. To abort the run of the specified task, execute the [SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS](../functions/system_user_task_cancel_ongoing_executions.md)
  function.
* The compute resources for individual runs of a task are either managed by Snowflake (i.e. the serverless compute model) or a
  user-specified virtual warehouse. To convert a task that relies on a warehouse to the serverless compute model, unset the
  `WAREHOUSE`.
* If a task fails with an unexpected error, you can receive a notification about the error.
  For more information about configuring task error notifications, see [Set up error notifications for tasks](../../user-guide/tasks-errors.md).
* The `OVERLAP_POLICY` parameter replaces the deprecated `ALLOW_OVERLAPPING_EXECUTION` parameter. For backward compatibility,
  `ALLOW_OVERLAPPING_EXECUTION = TRUE` maps to `OVERLAP_POLICY = ALLOW_CHILD_OVERLAP`, and
  `ALLOW_OVERLAPPING_EXECUTION = FALSE` maps to `OVERLAP_POLICY = NO_OVERLAP`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* Regarding the finalizer task:

  + When you `SET FINALIZE = <root task name>`, this function configures a normal task to be a finalizer task associated with the
    given root task.
  + When you `UNSET FINALIZE`, a finalizer task changes to a normal standalone task with no schedule or predecessor.
  + `SET FINALIZE` conflicts with `SET SCHEDULE` and `ADD AFTER`. A task with an existing schedule or predecessor will
    also fail the `SET FINALIZE` query.
  + To alter the root task’s defined finalizer task, first use `UNSET FINALIZE` to unset the finalizer task and then use
    `SET FINALIZE = <root task name>` to update the root task’s finalizer task.
  + The root task must be suspended before the finalizer task is modified, set, or unset.

  For more information, see [Finalizer task](../../user-guide/tasks-graphs.md).

## Examples

The following example initiates operation of a task:

```sqlexample
ALTER TASK mytask RESUME;
```

The following example converts a task to the serverless compute model and sets `xsmall` as the amount of compute resources to provision
for the first serverless runs of the task:

```sqlexample
ALTER TASK mytask UNSET WAREHOUSE;

ALTER TASK mytask SET USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = 'XSMALL';
```

The following example sets the TIMEZONE and CLIENT_TIMESTAMP_TYPE_MAPPING session parameters for the session in which the task runs:

```sqlexample
ALTER TASK mytask SET TIMEZONE = 'America/Los_Angeles', CLIENT_TIMESTAMP_TYPE_MAPPING = TIMESTAMP_LTZ;
```

The following example sets a different schedule for a task:

```sqlexample
ALTER TASK mytask SET SCHEDULE = 'USING CRON */3 * * * * UTC';
```

The following example removes the current predecessor tasks for the `mytask` child task (`pred_task1`, `pred_task2`) and replace them
with a different predecessor task (`pred_task3`):

```sqlexample
ALTER TASK mytask REMOVE AFTER pred_task1, pred_task2;

ALTER TASK mytask ADD AFTER pred_task3;
```

The following example changes the SQL statement associated with a task. The task now queries the CURRENT_VERSION function when it runs:

```sqlexample
ALTER TASK mytask MODIFY AS SELECT CURRENT_VERSION();
```

The following example modifies the WHEN condition associated with a task. When triggered (on a schedule or after the predecessor task runs
successfully), the task now runs only when the `mystream` stream contains data:

```sqlexample
ALTER TASK mytask MODIFY WHEN SYSTEM$STREAM_HAS_DATA('MYSTREAM');
```

Update an existing task with new or replacement default configuration:

```sqlexample
ALTER TASK task_with_config SET
      CONFIG=$${"output_directory": "/temp/prod_directory/", "environment": "prod"}$$;
```

Remove the default configuration from an existing task:

```sqlexample
ALTER TASK task_with_config UNSET CONFIG;
```

---
title: ALTER TYPE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-type.md
section: SQL Commands
---

# ALTER TYPE

Modifies the properties for an existing [user-defined type](../data-types-user-defined.md).

See also:
:   [CREATE TYPE](create-type.md) , [DESCRIBE TYPE](desc-type.md) , [SHOW TYPES](show-types.md) , [DROP TYPE](drop-type.md) , [UNDROP TYPE](undrop-type.md)

## Syntax

```sqlsyntax
ALTER TYPE [ IF EXISTS ] <name> SET
  COMMENT = '<string_literal>'

ALTER TYPE [ IF EXISTS ] <name> UNSET COMMENT
```

## Parameters

`name`
:   Specifies the identifier for the user-defined type to alter.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies the properties to set for the user-defined type:

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the user-defined type.

`UNSET ...`
:   Specifies the properties to unset for the user-defined type, which resets them to the defaults.

    Currently, the only property you can unset is COMMENT, which removes the comment, if one exists, for the user-defined type.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | User-defined type | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Add a comment to the `age` user-defined type:

```sqlexample
ALTER TYPE age SET COMMENT = 'User-defined type for storing age values';
```

Remove the comment from the `age` user-defined type:

```sqlexample
ALTER TYPE age UNSET COMMENT;
```

---
title: ALTER USER
source: https://docs.snowflake.com/en/sql-reference/sql/alter-user.md
section: SQL Commands
---

# ALTER USER

Modifies the properties and object/session parameters for an existing user in the system:

* Administrators can use this command to alter properties and parameter defaults for any users for which the administrators have the
  appropriate privileges.
* Individual users can use this command to alter specific properties and any session parameter defaults for themselves. For more details, see
  Usage Notes (in this topic).

Can also be used to abort all queries (and other SQL statements) submitted by the user.

See also:
:   [CREATE USER](create-user.md) , [DROP USER](drop-user.md), [SHOW PARAMETERS](show-parameters.md), [SHOW USERS](show-users.md) , [DESCRIBE USER](desc-user.md)

## Syntax

```sqlsyntax
ALTER USER [ IF EXISTS ] [ <name> ] RENAME TO <new_name>

ALTER USER [ IF EXISTS ] [ <name> ] RESET PASSWORD

ALTER USER [ IF EXISTS ] [ <name> ] ABORT ALL QUERIES

ALTER USER [ IF EXISTS ] [ <name> ] ADD DELEGATED AUTHORIZATION OF ROLE <role_name> TO SECURITY INTEGRATION <integration_name>

ALTER USER [ IF EXISTS ] [ <name> ] REMOVE DELEGATED { AUTHORIZATION OF ROLE <role_name> | AUTHORIZATIONS } FROM SECURITY INTEGRATION <integration_name>

ALTER USER [ IF EXISTS ] [ <name> ] mfaActions

ALTER USER [ IF EXISTS ] [ <name> ] SET { AUTHENTICATION | PASSWORD | SESSION } POLICY <policy_name> [ FORCE ]

ALTER USER [ IF EXISTS ] [ <name> ] UNSET { AUTHENTICATION | PASSWORD | SESSION } POLICY

ALTER USER [ IF EXISTS ] [ <name> ] SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER USER [ IF EXISTS ] [ <name> ] UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER USER [ IF EXISTS ] [ <name> ] SET { [ objectProperties ] [ objectParams ] [ sessionParams ] }

ALTER USER [ IF EXISTS ] [ <name> ] UNSET { <object_property_name> | <object_param_name> | <session_param_name> } [ , ... ]
```

Where:

> ```sqlsyntax
> mfaActions ::=
>   {
>     ENROLL MFA
>     SET DEFAULT_MFA_METHOD = { PASSKEY | TOTP | DUO }
>     REMOVE MFA METHOD <mfa_method>
>     MODIFY MFA METHOD <mfa_method> SET COMMENT = '<string>'
>     ADD MFA METHOD OTP [ COUNT = <number> ]
>   }
> ```
>
> ```sqlsyntax
> objectProperties ::=
>     PASSWORD = '<string>'
>     LOGIN_NAME = <string>
>     DISPLAY_NAME = <string>
>     FIRST_NAME = <string>
>     MIDDLE_NAME = <string>
>     LAST_NAME = <string>
>     EMAIL = <string>
>     MUST_CHANGE_PASSWORD = TRUE | FALSE
>     DISABLED = TRUE | FALSE
>     ALLOWED_INTERFACES = ( <list_of_interfaces> )
>     DAYS_TO_EXPIRY = <integer>
>     MINS_TO_UNLOCK = <integer>
>     DEFAULT_WAREHOUSE = <string>
>     DEFAULT_NAMESPACE = <string>
>     DEFAULT_ROLE = <string>
>     DEFAULT_SECONDARY_ROLES = ( 'ALL' )
>     MINS_TO_BYPASS_MFA = <integer>
>     DISABLE_MFA = TRUE | FALSE
>     RSA_PUBLIC_KEY = <string>
>     RSA_PUBLIC_KEY_FP = <string>
>     RSA_PUBLIC_KEY_2 = <string>
>     RSA_PUBLIC_KEY_2_FP = <string>
>     TYPE = PERSON | SERVICE | LEGACY_SERVICE
>     WORKLOAD_IDENTITY = ( <list_of_properties> )
>     COMMENT = '<string>'
> ```
>
> ```sqlsyntax
> objectParams ::=
>     ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR = TRUE | FALSE
>     ENABLE_UNREDACTED_SECURE_OBJECT_ERROR = TRUE | FALSE
>     NETWORK_POLICY = <string>
>     PREVENT_UNLOAD_TO_INLINE_URL = TRUE | FALSE
>     PREVENT_UNLOAD_TO_INTERNAL_STAGES = TRUE | FALSE
> ```
>
> ```sqlsyntax
> sessionParams ::=
>     ABORT_DETACHED_QUERY = TRUE | FALSE
>     AUTOCOMMIT = TRUE | FALSE
>     BINARY_INPUT_FORMAT = <string>
>     BINARY_OUTPUT_FORMAT = <string>
>     DATE_INPUT_FORMAT = <string>
>     DATE_OUTPUT_FORMAT = <string>
>     DEFAULT_NULL_ORDERING = <string>
>     ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS = TRUE | FALSE
>     ENABLE_NOTEBOOK_CREATION_IN_PERSONAL_DB = TRUE | FALSE
>     ERROR_ON_NONDETERMINISTIC_MERGE = TRUE | FALSE
>     ERROR_ON_NONDETERMINISTIC_UPDATE = TRUE | FALSE
>     JSON_INDENT = <num>
>     LOCK_TIMEOUT = <num>
>     OPT_OUT_ERROR_LOGGING = TRUE | FALSE
>     QUERY_TAG = <string>
>     ROWS_PER_RESULTSET = <num>
>     S3_STAGE_VPCE_DNS_NAME = <string>
>     SEARCH_PATH = <string>
>     SIMULATED_DATA_SHARING_CONSUMER = <string>
>     STATEMENT_TIMEOUT_IN_SECONDS = <num>
>     STRICT_JSON_OUTPUT = TRUE | FALSE
>     TIMESTAMP_DAY_IS_ALWAYS_24H = TRUE | FALSE
>     TIMESTAMP_INPUT_FORMAT = <string>
>     TIMESTAMP_LTZ_OUTPUT_FORMAT = <string>
>     TIMESTAMP_NTZ_OUTPUT_FORMAT = <string>
>     TIMESTAMP_OUTPUT_FORMAT = <string>
>     TIMESTAMP_TYPE_MAPPING = <string>
>     TIMESTAMP_TZ_OUTPUT_FORMAT = <string>
>     TIMEZONE = <string>
>     TIME_INPUT_FORMAT = <string>
>     TIME_OUTPUT_FORMAT = <string>
>     TRANSACTION_DEFAULT_ISOLATION_LEVEL = <string>
>     TWO_DIGIT_CENTURY_START = <num>
>     UNSUPPORTED_DDL_ACTION = <string>
>     USE_CACHED_RESULT = TRUE | FALSE
>     WEEK_OF_YEAR_POLICY = <num>
>     WEEK_START = <num>
> ```

> **Note:**
>
> For readability, the complete list of session parameters that can be set for a user is not included here. For a complete list of all session
> parameters, with their descriptions, as well as account and object parameters, see [Parameters](../parameters.md).

## Parameters

`name`
:   Specifies the identifier for the user to alter. If the identifier contains spaces or special characters, the entire string must be enclosed in
    double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is omitted, the statement modifies the active (i.e. logged in) user. The restrictions described in Usage Notes (in
    this topic) apply.

`RENAME TO new_name`
:   Specifies the new identifier for the user; must be unique for your account.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`RESET PASSWORD`
:   Generates a URL, which you can share with the user, that opens a web page from which the user can enter a new password. The generated URL is
    valid for a single use and expires after 4 hours.

    Note that specifying this parameter does not invalidate the user’s current password. The user can continue to use their current password
    until they reset it through the URL.

    If you wish to invalidate their current password, use `SET PASSWORD = 'string'` instead, which changes their password to a new value.

`ABORT ALL QUERIES`
:   Aborts all the queries and other SQL statements currently running or scheduled by the user, regardless of the warehouse on which the queries
    are running/scheduled.

    Note that the user can still log into Snowflake and initiate new queries.

    If you want to abort all running/scheduled queries and prevent the user from logging into Snowflake or initiating new queries, specify
    `SET DISABLED = TRUE` instead.

`ADD DELEGATED AUTHORIZATION OF ROLE role_name TO SECURITY INTEGRATION integration_name;`
:   Adds user consent to initiate a session using a specified role for a particular integration.

    For more details, see [Adding Delegated Authorizations for OAuth User Consent](../../user-guide/oauth-consent.md).

`REMOVE DELEGATED AUTHORIZATION OF ROLE role_name FROM SECURITY INTEGRATION integration_name` , . `REMOVE DELEGATED AUTHORIZATIONS FROM SECURITY INTEGRATION integration_name`
:   Revokes consent for the user:

    * The first syntax revokes consent for a specified security integration for a specified role. This has the effect of revoking any OAuth
      access token associated with the integration and specific role.
    * The second syntax revokes all consent from a specified security integration. This has the effect of revoking any OAuth access token
      associated with the integration.

    For more details, see:

    * [Configure Snowflake OAuth for partner applications](../../user-guide/oauth-partner.md)
    * [Configure Snowflake OAuth for custom clients](../../user-guide/oauth-custom.md)

`{ AUTHENTICATION | PASSWORD | SESSION } POLICY policy_name [ FORCE ]`
:   Specifies one of the following policies for the user:

    * [Authentication policy](../../user-guide/authentication-policies.md)
    * [Password policy](../../user-guide/password-authentication.md)
    * [Session policy](../../user-guide/session-policies.md)

    If you already set a policy on the user, then you can specify FORCE to set the new policy without needing to
    unset the existing policy first.

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Object properties (`objectProperties`)

`SET property_name = property_value [ ... ]` , . `UNSET property_name [ , ... ]`
:   Specifies one (or more) object properties to set or unset for the use. Unsetting an object property resets it back to the default.

    `TYPE = { PERSON | SERVICE | LEGACY_SERVICE | NULL }`
    :   Alters the type of user. You can set this property to differentiate between human, service, and legacy service users. For information
        about the characteristics of these types of users, see [Types of users](../../user-guide/admin-user-management.md).

        `PERSON`
        :   A user who is a human user that interacts with Snowflake.

        `SERVICE`
        :   A user that is a service or application that interacts with Snowflake without human intervention.

            If a user has their `TYPE` property set to `SERVICE` using the ALTER USER command, then incompatible properties remain
            stored, but are not returned by commands such as DESCRIBE USER. The incompatible properties cannot be set using
            the ALTER USER command.

            If a user, with their `TYPE` property set to `SERVICE`, is changed to a user with their `TYPE` property set to
            `PERSON`, the incompatible properties are restored and can be changed, including their `PASSWORD` property.

        `LEGACY_SERVICE`
        :   A user with their `TYPE` property set to `LEGACY_SERVICE` represents a non-interactive integration. It is similar to
            `SERVICE`, but allows password and SAML authentication.

            > **Note:**
            >
            > The LEGACY_SERVICE type is being deprecated. Use the SERVICE type for services and applications. For a timeline of the deprecation of
            > LEGACY_SERVICE, see [Planning for the deprecation of single-factor password sign-ins](../../user-guide/security-mfa-rollout.md).

        `NULL`
        :   Functions the same as `PERSON`. You can’t set the `TYPE` property as `NULL` for an existing user.

    `ALLOWED_INTERFACES = ( {  'ALL' | 'interface' [ , ... ] } )`
    :   Specifies which Snowflake interfaces the user can access.

        If you specify `('ALL')`, the user can access Snowsight and all other interfaces that can be
        specified for this property. If you specify one or more interfaces, the user can only access the interfaces
        specified and can’t interact with any Snowflake data outside of the interfaces specified.

        For `interface`, you can specify one or more of the following values in a comma-delimited list:

        > `SNOWFLAKE_INTELLIGENCE`
        > :   The user can access [Snowflake Intelligence](../../user-guide/snowflake-cortex/snowflake-intelligence.md).
        >
        > `STREAMLIT`
        > :   The user can access Streamlit apps through the app-viewer URLs.

    `DISABLE_MFA = { TRUE | FALSE }`
    :   The effect of this parameter depends on whether the user voluntarily enrolled in MFA or was required to enroll.

        * If the user is subject to an authentication policy that requires them to use MFA, setting this parameter to TRUE clears the MFA
          methods for the user. The next time the user signs in, they are prompted to add a new MFA method that they can use as a second factor
          of authentication.
        * If the user voluntarily enrolled in MFA, setting this parameter to TRUE allows the password user to authenticate without a second
          factor of authentication.

    `WORKLOAD_IDENTITY = ( list_of_properties )`
    :   Configures the user to authenticate by using [workload identity federation](../../user-guide/workload-identity-federation.md).

        The following list shows the properties:

        `TYPE = { AWS | AZURE | GCP | OIDC }`
        :   Specifies the provider that issues the attestation that is sent by the application or workload to Snowflake.

        `ARN = 'string'`
        :   Required for `TYPE=AWS`. Not valid for other types.

            Specifies the Amazon Resource Identifier (ARN) that uniquely identifies the AWS user or role that is associated with the instance
            authenticating to Snowflake. Snowflake accepts the following forms of [IAM identifiers](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_identifiers.html):

            * `arn:aws:iam::account:user/user_name_with_path`
            * `arn:aws:iam::account:role/role_name_with_path`
            * `arn:aws:sts::account:assumed_role/role_name/role_session_name`

            For help obtaining the ARN, see [Configure Snowflake](../../user-guide/workload-identity-federation.md).

        `ISSUER = 'string'`
        :   Required for `TYPE=AZURE` and `TYPE=OIDC`. Not valid for other types.

            * For `TYPE=AZURE`, specifies the Entra ID tenant’s Authority URL in the following form:

              `https://login.microsoftonline.com/tenant/v2.0`

              For help obtaining this URL, see [Configure Microsoft Azure](../../user-guide/workload-identity-federation.md).
            * For `TYPE=OIDC`, specifies the OpenID Connect (OIDC) issuer URL. An OIDC provider is identified by its issuer URL.

              For examples of how to obtain this issuer URL for different OIDC providers, [Use cases](../../user-guide/workload-identity-federation.md).

        `SUBJECT = 'string'`
        :   Required for `TYPE=AZURE`, `TYPE=GCP`, and `TYPE=OIDC`. Not valid for other types.

            * For `TYPE=AZURE`, specifies the case-sensitive Object ID (Principal ID) of the managed identity assigned to the Azure workload.
            * For `TYPE=GCP`, specifies the `uniqueId` property of the service account associated with the workload that is connecting to
              Snowflake.

              For help obtaining this identifier, see [Configure Snowflake](../../user-guide/workload-identity-federation.md).
            * For `TYPE=OIDC`, specifies the identifier of the workload that is connecting to Snowflake. The format of the value is specific to the
              OIDC provider that is issuing the attestation.

              For examples of how to construct the subject of an attestation issued by an OIDC provider, see [Use cases](../../user-guide/workload-identity-federation.md).

        `OIDC_AUDIENCE_LIST = ( 'string' [ , 'string' ... ] )`
        :   Optional for `TYPE=OIDC`. Not valid for other types.

            Specifies which values must be present in the `aud` claim of the ID token issued by the OIDC provider. Snowflake
            accepts the attestation if the `aud` claim contains at least one of the specified audiences.

            If omitted or empty, the audience is assumed to be `snowflakecomputing.com`.

For information about the other object properties you can set (for example, PASSWORD, LOGIN_NAME, DEFAULT_ROLE), see [CREATE USER](create-user.md).

Refer to Usage Notes (in this topic) for more general details about setting and unsetting properties.

## Object parameters (`objectParams`)

`SET ...`
:   Specifies one (or more) parameters to set for the user (separated by blank spaces, commas, or new
    lines):

    `ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR = { TRUE | FALSE }`
    :   Controls how queries that fail due to syntax or parsing errors show up in a query history. If FALSE, the contents of a
        failed query is redacted from the views, pages, and functions that provide a query history.

        This parameter controls behavior for the user viewing the query history, not the user who executed the query.

        Only users with a role that is granted or inherits the AUDIT privilege can set the ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR parameter.

    `ENABLE_UNREDACTED_SECURE_OBJECT_ERROR = { TRUE | FALSE }`
    :   Controls whether error messages related to secure objects are redacted in metadata. For more information about
        error message redaction for secure objects, see [Secure objects: Redaction of information in error messages](../../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

        Only users with a role that is granted or inherits the AUDIT privilege can set the ENABLE_UNREDACTED_SECURE_OBJECT_ERROR parameter.

        When using the ALTER USER command to set the parameter to `TRUE` for a particular user, modify the user that you want to see the
        redacted error messages in metadata, not the user who caused the error.

    `NETWORK_POLICY = string`
    :   Specifies the [network policy](../../user-guide/network-policies.md) that is active for the user.

Also, see Usage Notes (in this topic) for more general details about setting and unsetting parameters.

`UNSET ...`
:   Specifies the properties to unset for the user, which resets them to the defaults.

    * `NETWORK_POLICY`
    * `AUTHENTICATION POLICY`
    * `PASSWORD POLICY`
    * `SESSION POLICY`
    * `TAG tag_name [ , tag_name ... ]`

## Session parameters (`sessionParams`)

`SET session_param_name = param_value [ ... ]` , . `UNSET session_param_name [ , ... ]`
:   Specifies one (or more) session parameters to set or unset for the user. Unsetting a session parameter resets it back to the default.

For more details about the session parameters you can set (ABORT_DETACHED_SESSION, AUTOCOMMIT, etc.), see [Parameters](../parameters.md).

Also, see Usage Notes (in this topic) for more general details about setting and unsetting parameters.

## Multi-factor authentication (MFA) actions (mfaActions)

`user ENROLL MFA`
:   Enrolls the specified user in multi-factor authentication (MFA) and prompts them to add a second factor of authentication.

    * If the user has a verified email, Snowflake sends an email prompting them to add an MFA authentication method.
    * If the user does not have a verified email, Snowflake returns the URL of a page that prompts the user to add an MFA authentication method.

`SET DEFAULT_MFA_METHOD = { PASSKEY | TOTP | DUO }`
:   If the current user has more than one MFA method, specifies which one will be used as the second factor of authentication.

`user REMOVE MFA METHOD mfa_method`
:   Removes an MFA method that the specified user previously set up. The user can no longer use the MFA method as a second factor of
    authentication.

    To obtain the identifier for `mfa_method`, execute the [SHOW MFA METHODS](show-mfa-methods.md) command and find the value in
    the `name` column.

`[ user ] MODIFY MFA METHOD mfa_method SET COMMENT = 'string'`
:   Sets a descriptive name for the specified MFA method.

    To obtain the identifier for `mfa_method`, execute the [SHOW MFA METHODS](show-mfa-methods.md) command and find the value in
    the `name` column.

    Users can omit `user` to set a descriptive name for their own MFA methods.

`ADD MFA METHOD OTP [ COUNT = number ]`
:   Generates one-time passcodes (OTPs) that highly privileged users can use to authenticate when other authentication methods are unavailable.

    `COUNT` controls how many OTPs are generated. If omitted, one OTP is generated. The maximum is 10 OTPs.

    For more information, see [Setting up administrators for break glass access](../../user-guide/security-mfa.md).

## Usage notes

* Only the role with the OWNERSHIP privilege on the user, or a higher role, can execute this command to modify most user properties.

  > **Tip:**
  >
  > When changing a user’s password using `SET PASSWORD = 'string'`, we recommend also specifying `MUST_CHANGE_PASSWORD = TRUE`
  > to force the user to log into the web interface and change their password before they can log into Snowflake through any other interface
  > (e.g. SnowSQL or another client application).
  >
  > Alternatively, use `RESET PASSWORD` to generate a URL to a web page that the user can access to change their password.
* Only users with the ACCOUNTADMIN role can set the following parameters:

  + `PREVENT_UNLOAD_TO_INLINE_URL`
  + `PREVENT_UNLOAD_TO_INTERNAL_STAGES`
* Individual users can execute the ALTER USER command on themselves (i.e. by specifying their user name/identifier in the command) and change
  the following:

  > + `DEFAULT_WAREHOUSE`
  > + `DEFAULT_NAMESPACE`
  > + `DEFAULT_ROLE`
  > + Any of their session parameter defaults

  Note that users can not use this command to change their password. For security reasons, Snowflake only allows users to change
  their passwords from within the web interface.

  However, an administrator with the appropriate privileges can use this command with `SET PASSWORD = 'string'` to change the password
  for a user.

  > **Tip:**
  >
  > When changing a user’s password, we recommend also specifying `MUST_CHANGE_PASSWORD = TRUE` to force the user to log into the web
  > interface and change their password before they can log into Snowflake through any other interface (e.g. SnowSQL or another client application).
  >
  > Alternatively, use `RESET PASSWORD` to generate a URL to a web page that the user can access to change their password.
* An ALTER USER statement does not verify that default objects (`DEFAULT_WAREHOUSE`, `DEFAULT_NAMESPACE`,
  and `DEFAULT_ROLE`) exist. Note that `DEFAULT_SECONDARY_ROLES` does not accept an object name as the value, but an ALTER
  USER statement does verify that a supported value is specified.
* You can set and unset multiple object properties and object/session parameters with a single ALTER statement:

  > + When setting multiple properties/parameters, separate them with blank spaces, commas, or new lines.
  > + When unsetting multiple properties/parameters, they must be separated by a comma. Also, when unsetting a property/parameter,
  >   specify only the name; specifying a value for the property/parameter will return an error.
* If there is a conflict between a local user object and an [organization user](../../user-guide/organization-users.md), a user that
  corresponds to the organization user is automatically created when you rename the local user.
* If you specify `SET DISABLED = TRUE` for a user:

  > + All queries and other SQL statements currently running or scheduled by the user are aborted and the user cannot initiate additional queries.
  > + The user is locked out of Snowflake and cannot log in again.

  If you only want to abort all running and scheduled queries/statements for a user, use `ABORT ALL QUERIES` instead.
* If the user’s `TYPE` property is `SERVICE`, the following commands cannot be used:

  + ALTER USER RESET PASSWORD
  + ALTER USER SET DISABLE_MFA = TRUE
* If you run an ALTER USER … UNSET TYPE command, the `TYPE` property is set to `PERSON`.
* If you run an ALTER USER … SET TYPE=NULL command, the `TYPE` property is set to `PERSON`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Rename `user1` to `user2`:

> ```sqlexample
> ALTER USER user1 RENAME TO user2;
> ```

Set the password for a user named `user1` to `H8MZRqa8gEe/kvHzvJ+Giq94DuCYoQXmfbb$Xnt` and require the user to change their password
by logging into the Snowflake web interface:

> ```sqlexample
> ALTER USER user1 SET PASSWORD = 'H8MZRqa8gEe/kvHzvJ+Giq94DuCYoQXmfbb$Xnt' MUST_CHANGE_PASSWORD = TRUE;
> ```

Change the [type of user](../../user-guide/admin-user-management.md) to an application that interacts with Snowflake programmatically:

> ```sqlexample
> ALTER USER user1 SET TYPE = SERVICE;
> ```

Remove an existing comment from a user:

> ```sqlexample
> ALTER USER user1 UNSET COMMENT;
> ```

Activate no secondary roles by default:

> ```sqlexample
> ALTER USER user1 SET DEFAULT_SECONDARY_ROLES = ();
> ```

Activate all secondary roles by default:

> ```sqlexample
> ALTER USER user1 UNSET DEFAULT_SECONDARY_ROLES;
> ```

OR

> ```sqlexample
> ALTER USER user1 SET DEFAULT_SECONDARY_ROLES = ('ALL');
> ```

---
title: ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-user-add-programmatic-access-token.md
section: SQL Commands
---

# ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)

Creates a [programmatic access token](../../user-guide/programmatic-access-tokens.md) for a user.

See also:
:   [ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-modify-programmatic-access-token.md) ,
    [ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-rotate-programmatic-access-token.md) ,
    [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-remove-programmatic-access-token.md) ,
    [SHOW USER PROGRAMMATIC ACCESS TOKENS](show-user-programmatic-access-tokens.md)

## Syntax

```sqlsyntax
ALTER USER [ IF EXISTS ] [ <username> ] ADD { PROGRAMMATIC ACCESS TOKEN | PAT } <token_name>
  [ ROLE_RESTRICTION = '<string_literal>' ]
  [ DAYS_TO_EXPIRY = <integer> ]
  [ MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT = <integer> ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`ADD { PROGRAMMATIC ACCESS TOKEN | PAT } token_name`
:   Creates a programmatic access token with the specified name.

    You can use the keyword PAT as a shorter way of specifying the keywords PROGRAMMATIC ACCESS TOKEN.

## Optional parameters

`username`
:   The name of the user that the token is associated with. A user cannot use another user’s programmatic access token to
    authenticate.

    To create programmatic access tokens on behalf of a user, administrators must specify the name of that user in the ALTER USER
    command.

    If `username` is omitted, the command generates a programmatic access token for the user who is currently logged in (the
    active user of this session).

`ROLE_RESTRICTION = 'string_literal'`
:   The name of the role used for privilege evaluation and object creation. This must be one of the roles that has already been
    granted to the user.

    > **Note:**
    >
    > This parameter is required if the user is a service user (if the USER object has TYPE=SERVICE).

    When you use this token for authentication, any objects that you create are owned by this role, and this role is used for
    privilege evaluation.

    > **Note:**
    >
    > Secondary roles are not used, even if [DEFAULT_SECONDARY_ROLES](create-user.md) is set to
    > (‘ALL’) for the user.

    If this role is revoked from the user associated with the programmatic access token, any attempts to use the token for
    authentication will fail.

    > **Note:**
    >
    > Specifying a role as the ROLE_RESTRICTION value does not grant the specified role to the programmatic access token. The user
    > must have already been granted this role.

    If you omit ROLE_RESTRICTION, any objects that you create owned by your primary role, and privileges are evaluated against
    your primary and secondary roles (as explained in [Authorization through primary role and secondary roles](../../user-guide/security-access-control-overview.md)).

`DAYS_TO_EXPIRY = integer`
:   The number of days that the programmatic access token can be used for authentication.

    You can specify a value ranging from `1` to the [maximum expiration time](../../user-guide/programmatic-access-tokens.md).

    Default: `15`

`MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT = integer`
:   The number of minutes during which a user can use this token to access Snowflake without being subject to an active
    [network policy](../../user-guide/network-policies.md).

    You can set this for a token for a person (if the USER object has TYPE=PERSON) if that person is not subject to a network policy
    but needs to use a programmatic access token for authentication. See [Network policy requirements](../../user-guide/programmatic-access-tokens.md).

    > **Note:**
    >
    > Setting MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT does not allow users to bypass the network policy itself.

    You can set this to a value in the range of `1` to `1440` (1 day).

    Default: `0`

`COMMENT = 'string_literal'`
:   Descriptive comment about the programmatic access token. This comment is displayed in the
    [list of programmatic access tokens](../../user-guide/programmatic-access-tokens.md) in Snowsight.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY PROGRAMMATIC AUTHENTICATION METHODS | User | Required only when generating a programmatic access token for a user other than yourself. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides information about the newly generated programmatic access token in the following columns:

| Column | Description |
| --- | --- |
| `token_name` | Name of the generated token. |
| `token_secret` | The token itself. Use this to authenticate to an endpoint.  **Note:** The token only appears in the output of the ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN command. No other SQL command or function prints out or returns the token.  If you need to access this token programmatically, you can use [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) to execute this command and retrieve the token from the [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md). |

## Usage notes

* Each user can have a maximum of 15 programmatic access tokens.

  + This number includes [tokens that have been disabled](../../user-guide/programmatic-access-tokens.md).
  + This number does not include tokens that have expired.

## Examples

Create a programmatic access token named `example_token` that is associated with the user `example_user`, and inherits all
privileges from the associated user:

```sqlexample
ALTER USER IF EXISTS example_user ADD PROGRAMMATIC ACCESS TOKEN example_token
  COMMENT = 'a reference example';
```

Create a programmatic access token named `example_token` that is associated with the user `example_user`, inherits all
privileges from the role `example_role`, and expires after 15 days:

```sqlexample
ALTER USER IF EXISTS example_user ADD PROGRAMMATIC ACCESS TOKEN example_token
  ROLE_RESTRICTION = 'example_role'
  DAYS_TO_EXPIRY = 15;
```

---
title: ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-user-modify-programmatic-access-token.md
section: SQL Commands
---

# ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)

Changes the name of a [programmatic access token](../../user-guide/programmatic-access-tokens.md) or a property of the token.

> **Note:**
>
> You cannot modify or rename a programmatic access token in a session where you used a programmatic access token for
> authentication.

See also:
:   [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-add-programmatic-access-token.md) ,
    [ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-rotate-programmatic-access-token.md) ,
    [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-remove-programmatic-access-token.md) ,
    [SHOW USER PROGRAMMATIC ACCESS TOKENS](show-user-programmatic-access-tokens.md)

## Syntax

```sqlsyntax
ALTER USER [ IF EXISTS ] [ <username> ] MODIFY { PROGRAMMATIC ACCESS TOKEN | PAT } <token_name>
  RENAME TO <new_token_name>

ALTER USER [ IF EXISTS ] [ <username> ] MODIFY { PROGRAMMATIC ACCESS TOKEN | PAT } <token_name> SET
  [ DISABLED = { TRUE | FALSE } ]
  [ MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT = <integer> ]
  [ COMMENT = '<string_literal>' ]

ALTER USER [ IF EXISTS ] [ <username> ] MODIFY { PROGRAMMATIC ACCESS TOKEN | PAT } <token_name> UNSET
  [ DISABLED ]
  [ MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT ]
  [ COMMENT ]
```

## Parameters

`username`
:   The name of the user that the token is associated with.

    If `username` is omitted, the command modifies the programmatic access token for the user who is currently logged in
    (the active user of this session).

`MODIFY { PROGRAMMATIC ACCESS TOKEN | PAT } token_name`
:   Modifies a programmatic access token with the specified name.

    You can use the keyword PAT as a shorter way of specifying the keywords PROGRAMMATIC ACCESS TOKEN.

`RENAME TO new_token_name`
:   Specifies a new name for a programmatic access token.

`SET ...`
:   Specifies one (or more) properties to set for the programmatic access token (separated by blank spaces, commas, or new lines).

    `DISABLED = { TRUE | FALSE }`
    :   Disables or enables the programmatic access token.

        If a user is disabled or Snowflake locks a user, the programmatic tokens associated with that user are disabled automatically.
        If the user is subsequently enabled or Snowflake unlocks the user, the programmatic access tokens remain disabled. To enable
        the tokens again, set DISABLED to FALSE.

        For information, see [Re-enabling a disabled programmatic access token](../../user-guide/programmatic-access-tokens.md).

    `MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT = integer`
    :   The number of minutes during which a user can use this token to access Snowflake without being subject to an active
        [network policy](../../user-guide/network-policies.md).

        You can set this for a token for a person (if the USER object has TYPE=PERSON) if that person is not subject to a network policy
        but needs to use a programmatic access token for authentication. See [Network policy requirements](../../user-guide/programmatic-access-tokens.md).

        > **Note:**
        >
        > Setting MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT does not allow users to bypass the network policy itself.

        You can set this to a value in the range of `1` to `1440` (1 day).

        Default: `0`

    `COMMENT = 'string_literal'`
    :   Descriptive comment about the programmatic access token. This comment is displayed in the
        [list of programmatic access tokens](../../user-guide/programmatic-access-tokens.md) in Snowsight.

`UNSET ...`
:   Unsets one or more specified properties or parameters for the programmatic access token, which resets the properties to their
    defaults:

    * `DISABLED`
    * `MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT`
    * `COMMENT`

    To unset multiple properties or parameters with a single ALTER statement, separate each property or parameter with a comma.

    When unsetting a property or parameter, specify only the property or parameter name (unless the syntax above indicates that you
    should specify the value). Specifying the value returns an error.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY PROGRAMMATIC AUTHENTICATION METHODS | User | Required only when modifying a programmatic access token for a human user other than yourself or a service user. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

You cannot modify or rename a programmatic access token in a session where you used a programmatic access token for
authentication.

## Examples

Change the name of a programmatic access token associated with the user `example_user`:

```sqlexample
ALTER USER IF EXISTS example_user MODIFY PROGRAMMATIC ACCESS TOKEN old_token_name
  RENAME TO new_token_name;
```

Change the comment associated with a programmatic access token:

```sqlexample
ALTER USER IF EXISTS example_user MODIFY PROGRAMMATIC ACCESS TOKEN token_name
  SET COMMENT = 'my new comment';
```

---
title: ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-user-remove-programmatic-access-token.md
section: SQL Commands
---

# ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)

Revokes a [programmatic access token](../../user-guide/programmatic-access-tokens.md) for a user.

> **Note:**
>
> You cannot revoke a programmatic access token in a session where you used a programmatic access token for authentication.

See also:
:   [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-add-programmatic-access-token.md) ,
    [ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-modify-programmatic-access-token.md) ,
    [ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-rotate-programmatic-access-token.md) ,
    [SHOW USER PROGRAMMATIC ACCESS TOKENS](show-user-programmatic-access-tokens.md)

## Syntax

```sqlsyntax
ALTER USER [ IF EXISTS ] [ <username> ] REMOVE { PROGRAMMATIC ACCESS TOKEN | PAT } <token_name>
```

## Parameters

`username`
:   The name of the user that the token is associated with.

    If you omit this parameter, the command revokes the token for the user who is currently logged in (the active user in the
    current session).

`REMOVE { PROGRAMMATIC ACCESS TOKEN | PAT } token_name`
:   Revokes a programmatic access token with the specified name.

    You can use the keyword PAT as a shorter way of specifying the keywords PROGRAMMATIC ACCESS TOKEN.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY PROGRAMMATIC AUTHENTICATION METHODS | User | Required only when revoking a programmatic access token for a human user other than yourself or a service user. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You cannot use revoked programmatic access tokens for authentication.
* You cannot recover programmatic access tokens. You must generate a new programmatic access token instead.
* You cannot revoke a programmatic access token in a session where you used a programmatic access token for authentication.

## Examples

Revoke a programmatic access token named `example_token` from the user `example_user`:

```sqlexample
ALTER USER IF EXISTS example_user REMOVE PROGRAMMATIC ACCESS TOKEN example_token;
```

---
title: ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)
source: https://docs.snowflake.com/en/sql-reference/sql/alter-user-rotate-programmatic-access-token.md
section: SQL Commands
---

# ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)

Rotates [programmatic access token](../../user-guide/programmatic-access-tokens.md), generating a new token secret with an
extended expiration time, and expiring the existing token secret. The new secret is generated using the same DAYS_TO_EXPIRY
property set when the token was first created.

> **Note:**
>
> You cannot rotate a programmatic access token in a session where you used a programmatic access token for the same user for
> authentication.

See also:
:   [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-add-programmatic-access-token.md) ,
    [ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-modify-programmatic-access-token.md) ,
    [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-remove-programmatic-access-token.md) ,
    [SHOW USER PROGRAMMATIC ACCESS TOKENS](show-user-programmatic-access-tokens.md)

## Syntax

```sqlsyntax
ALTER USER [ IF EXISTS ] [ <username> ] ROTATE { PROGRAMMATIC ACCESS TOKEN | PAT } <token_name>
  [ EXPIRE_ROTATED_TOKEN_AFTER_HOURS = <integer> ]
```

## Parameters

`username`
:   The name of the user that the token is associated with.

    If you omit this parameter, the command rotates the token for the user who is currently logged in (the active user in the
    current session).

`ROTATE { PROGRAMMATIC ACCESS TOKEN | PAT } token_name`
:   Rotates a programmatic access token with the specified name.

    You can use the keyword PAT as a shorter way of specifying the keywords PROGRAMMATIC ACCESS TOKEN.

`EXPIRE_ROTATED_TOKEN_AFTER_HOURS = integer`
:   Sets the expiration time of the existing token secret to expire after the specified number of hours.

    You can set this to a value of `0` to expire the current token secret immediately.

    You can set this to a value in the range of `0` to the number of hours remaining before the current secret expires.

    Default: `24`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY PROGRAMMATIC AUTHENTICATION METHODS | User | Required only when rotating a programmatic access token for a human user other than yourself or a service user. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides information about the rotated programmatic access token in the following columns:

| Column | Description |
| --- | --- |
| `token_name` | Name of the rotated token. |
| `token_secret` | The token itself. Use this to authenticate to an endpoint.  **Note:** The token only appears in the output of the ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN command. No other SQL command or function prints out or returns the token.  If you need to access this token programmatically, you can use [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) to execute this command and retrieve the token from the [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md). |
| `rotated_token_name` | Name of the token that represents the prior secret.  You can use this token object to determine how long the prior secret remains valid. You can also expire the token, if needed. You can’t make any other types of changes to this token.  Note that this token object counts against the maximum number of tokens allowed per user. |

## Usage notes

* When you rotate a programmatic access token:

  + Snowflake does not verify that the [network policy](../../user-guide/programmatic-access-tokens.md) and
    [authentication policy](../../user-guide/programmatic-access-tokens.md) requirements are met.
  + If the programmatic access token is restricted to a role, Snowflake does not verify that the user associated with the token
    has been granted that role.

## Examples

Rotate a programmatic access token associated with the user `example_user`:

```sqlexample
ALTER USER IF EXISTS example_user ROTATE PROGRAMMATIC ACCESS TOKEN token_name;
```

Rotate a programmatic access token associated with the user `example_user` and expire the existing token secret
immediately:

```sqlexample
ALTER USER IF EXISTS example_user ROTATE PROGRAMMATIC ACCESS TOKEN token_name
  EXPIRE_ROTATED_TOKEN_AFTER_HOURS=0;
```

---
title: ALTER VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/alter-view.md
section: SQL Commands
---

# ALTER VIEW

Modifies the properties for an existing view. Currently the only supported operations are:

* Renaming a view.
* Converting to (or reverting from) a secure view.
* Adding, overwriting, removing a comment for a view.

Note that you cannot use this command to change the definition for a view. To change the view definition, you must drop the view and
then recreate it.

See also:
:   [CREATE VIEW](create-view.md) , [DROP VIEW](drop-view.md) , [SHOW VIEWS](show-views.md) , [DESCRIBE VIEW](desc-view.md)

## Syntax

```sqlsyntax
ALTER VIEW [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER VIEW [ IF EXISTS ] <name> SET
  [ SECURE ]
  [ CHANGE_TRACKING =  { TRUE | FALSE } ]
  [ CONTACT <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ]
  [ COMMENT = '<string_literal>' ]

ALTER VIEW [ IF EXISTS ] <name> UNSET
  [ SECURE ]
  [ CONTACT <purpose> ]
  [ COMMENT = '<string_literal>' ]
  [ DCM PROJECT ]

ALTER VIEW <name> dataMetricFunctionAction

ALTER VIEW [ IF EXISTS ] <name> dataGovnPolicyTagAction
```

Where:

> ```sqlsyntax
> dataMetricFunctionAction ::=
>
>     SET DATA_METRIC_SCHEDULE = {
>         '<num> MINUTE'
>       | 'USING CRON <expr> <time_zone>'
>       | 'TRIGGER_ON_CHANGES'
>     }
>
>   | UNSET DATA_METRIC_SCHEDULE
>
>   | { ADD | DROP } DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>       [ EXPECTATION <expectation_name> ( <expression> )
>         [, <expectation_name> ( <expression> ) [ , ... ] ] ]
>       [ EXECUTE AS ROLE <role_name> ]
>       [ ANOMALY_DETECTION = { TRUE | FALSE } ]
>       [ SENSITIVITY = { 'LOW' | 'MEDIUM' | 'HIGH' } ]
>       [ , <metric_name_2> ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] ) ]
>         [ EXPECTATION <expectation_name> ( <expression> )
>           [, <expectation_name> ( <expression> ) [ , ... ] ] ]
>         [ EXECUTE AS ROLE <role_name> ]
>         [ ANOMALY_DETECTION = { TRUE | FALSE } ]
>         [ SENSITIVITY = { 'LOW' | 'MEDIUM' | 'HIGH' } ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>         { SUSPEND | RESUME }
>       [ , <metric_name_2> ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>         { SUSPEND | RESUME } ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>       { ADD | MODIFY } EXPECTATION <expectation_name> ( <expression> )
>           [, <expectation_name> ( <expression> ) [ , ... ] ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       ON ( <col_name> [ , ... ] [ , TABLE <table_name>( <col_name> [ , ... ] ) ] )
>       DROP EXPECTATION <expectation_name> [ , <expectation_name> [ , ... ] ]
>
>   | MODIFY DATA METRIC FUNCTION <metric_name>
>       SET <list_of_properties>
> ```
>
> ```sqlsyntax
> dataGovnPolicyTagAction ::=
>   {
>       SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>     | UNSET TAG <tag_name> [ , <tag_name> ... ]
>   }
>   |
>   {
>       ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ROW ACCESS POLICY <policy_name>
>     | DROP ROW ACCESS POLICY <policy_name> ,
>         ADD ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , ... ] )
>     | DROP ALL ROW ACCESS POLICIES
>   }
>   |
>   {
>       SET AGGREGATION POLICY <policy_name>
>         [ ENTITY KEY ( <col_name> [, ... ] ) ]
>         [ FORCE ]
>     | UNSET AGGREGATION POLICY
>   }
>   |
>   {
>       SET JOIN POLICY <policy_name>
>         [ FORCE ]
>     | UNSET JOIN POLICY
>   }
>   |
>   ADD [ COLUMN ] [ IF NOT EXISTS ] <col_name> <col_type>
>     [ [ WITH ] MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] ]
>     [ [ WITH ] PROJECTION POLICY <policy_name> ]
>     [ [ WITH ] TAG ( <tag_name> = '<tag_value>'
>           [ , <tag_name> = '<tag_value>' , ... ] ) ]
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET MASKING POLICY <policy_name>
>           [ USING ( <col1_name> , <cond_col_1> , ... ) ] [ FORCE ]
>       | UNSET MASKING POLICY
>   }
>   |
>   {
>     { ALTER | MODIFY } [ COLUMN ] <col1_name>
>         SET PROJECTION POLICY <policy_name>
>           [ FORCE ]
>       | UNSET PROJECTION POLICY
>   }
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> SET TAG
>       <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>       , [ COLUMN ] <col2_name> SET TAG
>           <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
>   |
>   { ALTER | MODIFY } [ COLUMN ] <col1_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
>                    , [ COLUMN ] <col2_name> UNSET TAG <tag_name> [ , <tag_name> ... ]
> ```

## Parameters

`name`
:   Specifies the identifier for the view to alter. If the identifier contains spaces or special characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`RENAME TO new_name`
:   Specifies the new identifier for the view; must be unique for the schema.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    You can move the object to a different database and/or schema while optionally renaming the object. To do so, specify
    a qualified `new_name` value that includes the new database and/or schema name in the form
    `db_name.schema_name.object_name` or `schema_name.object_name`, respectively.

    > **Note:**
    >
    > * The destination database and/or schema must already exist. In addition, an object with the same name cannot already
    >   exist in the new location; otherwise, the statement returns an error.
    > * Moving an object to a managed access schema is prohibited unless the object owner (that is, the role that has
    >   the OWNERSHIP privilege on the object) also owns the target schema.

    When an object is renamed, other objects that reference it must be updated with the new name.

`SET ...`
:   Specifies the property to set for the view:

    `SECURE`
    :   Specifies a view as secure.

    `CHANGE_TRACKING = TRUE | FALSE`
    :   Specifies to enable or disable change tracking on the table.

        * `TRUE` enables change tracking on the view, and cascades the setting to all underlying tables.
        * `FALSE` disables change tracking on the view, and cascades the setting to all underlying tables.

    `CONTACT purpose = contact [ , purpose = contact ... ]`
    :   Associate the existing object with one or more [contacts](../../user-guide/contacts-using.md). For a list of valid purposes, see [Associate a contact with an object](../../user-guide/contacts-using.md).

        You cannot set the CONTACT property with other properties in the same statement.

    `COMMENT = 'string_literal'`
    :   Adds a comment or overwrites an existing comment for the view.

    > **Note:**
    >
    > You must set each view property individually.

`UNSET ...`
:   Specifies the property to unset for the view, which resets it to the default:

    * `SECURE`
    * `CONTACT purpose`
    * `COMMENT`

    When resetting a property, specify only the name; specifying a value for the property will return an error.

    > **Note:**
    >
    > You must reset each view property individually.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the view from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the view and the DCM project without dropping the view. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

## Data metric function actions (`dataMetricFunctionAction`)

`DATA_METRIC_SCHEDULE ...`
:   Specifies the schedule to run the data metric function periodically.

    `'num MINUTE'`
    :   Specifies an interval (in minutes) of wait time inserted between runs of the data metric function. Accepts positive integers only.

        Also supports `num M` syntax.

        For data metric functions, use one of the following values: `5`, `15`, `30`, `60`, `720`, or `1440`.

        If you want to suspend all DMFs associated with the object, set the parameter to an empty string.

        Default: `60 MINUTE`

    `'USING CRON expr time_zone'`
    :   Specifies a cron expression and time zone for periodically running the data metric function. Supports a subset of standard cron utility
        syntax.

        For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones).

        The cron expression consists of the following fields, and the periodic interval must be at least 5 minutes:

        ```bash
        # __________ minute (0-59)
        # | ________ hour (0-23)
        # | | ______ day of month (1-31, or L)
        # | | | ____ month (1-12, JAN-DEC)
        # | | | | _ day of week (0-6, SUN-SAT, or L)
        # | | | | |
        # | | | | |
          * * * * *
        ```

        The following special characters are supported:

        `*`
        :   Wildcard. Specifies any occurrence of the field.

        `L`
        :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of
            a given month. In the day-of-month field, it specifies the last day of the month.

        `/{n}`
        :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
            specified in the month field, then the data metric function is scheduled for April, July and October (i.e. every 3 months, starting
            with the 4th month of the year). The same schedule is maintained in subsequent years. That is, the data metric function is
            not scheduled to run in January (3 months after the October run).

        > **Note:**
        >
        > * The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
        >   for the account (or setting the value at the user or session level) does not change the time zone for the data metric
        >   function.
        > * The cron expression defines all valid run times for the data metric function. Snowflake attempts to run a data metric
        >   function based on this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid
        >   run time starts.
        > * When both a specific day of month and day of week are included in the cron expression, then the data metric function is scheduled
        >   on days satisfying either the day of month or day of week. For example,
        >   `DATA_METRIC_SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'` schedules a data metric function at 0AM on any 10th to 20th day
        >   of the month and also on any Tuesday or Thursday outside of those dates.
        > * The shortest granularity of time in cron is minutes.
        >
        >   If a data metric function is resumed during the minute defined in its cron expression, the first scheduled run of the data metric
        >   function is the next occurrence of the instance of the cron expression. For example, if data metric function scheduled to run daily
        >   at midnight (`USING CRON 0 0 * * *`) is resumed at midnight plus 5 seconds (`00:00:05`), the first data metric function run
        >   is scheduled for the following midnight.

    `'TRIGGER_ON_CHANGES'`
    :   Specifies that the DMF runs when a [DML operation](../sql-dml.md) modifies the table, such as inserting a new row or
        deleting a row.

        You can specify `'TRIGGER_ON_CHANGES'` for the following objects:

        * Dynamic tables
        * External tables
        * Apache Iceberg™ tables
        * Regular tables
        * Temporary tables
        * Transient tables

        You cannot specify `'TRIGGER_ON_CHANGES'` for views.

        Changes to the table as a result of [reclustering](../../user-guide/tables-auto-reclustering.md) do not trigger the DMF to run.

`UNSET DATA_METRIC_SCHEDULE`
:   Resets the schedule for DMFs associated with the object to the default of `60 MINUTE`.

    If you want to suspend DMFs associated with the object, run a `SET DATA_METRIC_SCHEDULE = ''` statement instead.

`{ ADD | DROP } DATA METRIC FUNCTION metric_name`
:   Identifier of the data metric function to add to the table or view or drop from the table or view.

    `ON ( col_name [ , ... ] [ , TABLE( table_name( col_name [ , ... ] ) ) ] )`
    :   The table or view columns on which to associate the data metric function. The data types of the columns must match the data types of
        the columns specified in the data metric function definition.

        If the data metric function accepts a second table as an argument, specify the fully qualified name of the table and its columns.

    `EXPECTATION expectation_name ( expression ) [, expectation_name ( expression ) [ , ... ] ]`
    :   Defines one or more [expectations](../../user-guide/data-quality-expectations.md) for the association between the column and the DMF.

    `[ , metric_name_2 ON ( col_name [ , ... ] [ , TABLE( table_name( col_name [ , ... ] ) ) ] ) ]`
    :   Additional data metric functions to add to the table or view. Use a comma to separate each data metric function and its specified
        columns.

        If the data metric function accepts a second table as an argument, specify the fully qualified name of the table and its columns.

    `ANOMALY_DETECTION = { TRUE | FALSE }`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts that are Enterprise Edition (or higher).

        To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

        Specifies whether Snowflake uses the DMF to [automatically detect anomalies](../../user-guide/data-quality-anomaly.md) in the table.

        Default: `FALSE`

    `SENSITIVITY = { LOW | MEDIUM | HIGH }`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts that are Enterprise Edition (or higher).

        To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

        Specifies the sensitivity of the anomaly-detecting algorithm. For more information, see [Adjust the sensitivity level of anomaly detection](../../user-guide/data-quality-anomaly.md).

        Default: `'MEDIUM'`

    `EXECUTE AS ROLE role_name`
    :   Specifies which role the DMF runs with. The role must have the SELECT privilege on the table or view.

        For more information, see [Required privilege on the table or view](../../user-guide/data-quality-access-control.md).

`MODIFY DATA METRIC FUNCTION metric_name`
:   Identifier of the data metric function to modify.

    `ON ( col_name [ , ... ] [ , TABLE( table_name( col_name [ , ... ] ) ) ] )`
    :   Specifies the columns associated with the data metric function. If the data metric function accepts a second table as an argument,
        specify the fully qualified name of the table and its columns.

    `{ SUSPEND | RESUME }`
    :   Suspends or resumes the data metric function on the specified columns. When a data metric function is set for a table or view, the data
        metric function is automatically included in the schedule.

        * `SUSPEND` removes the data metric function from the schedule.
        * `RESUME` brings a suspended date metric function back into the schedule.

    `{ ADD | MODIFY } EXPECTATION expectation_name ( expression ) [, expectation_name ( expression ) [ , ... ] ]`
    :   Defines or modifies one or more [expectations](../../user-guide/data-quality-expectations.md) for the association between the column and
        the DMF.

    `DROP EXPECTATION expectation_name [ , expectation_name [ , ... ] ]`
    :   Removes the specified expectations from the association between the column and the DMF.

    `[ , metric_name_2 ON ( col_name [ , ... ] [ , TABLE(col_name [ , ... ] ) ] ) ]`
    :   Additional data metric functions to modify. Use a comma to separate each data metric function and its specified
        columns. If the data metric function accepts a second table as an argument, specify the fully qualified name of the table and its
        columns.

    `SET list_of_properties`
    :   Sets one or more properties of the association between the DMF and the object. You set more than one property by using a space-delimited list.

        `ANOMALY_DETECTION = { TRUE | FALSE }`
        :   [Preview Feature](../../release-notes/preview-features.md) — Open

            Available to all accounts that are Enterprise Edition (or higher).

            To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

            Controls whether Snowflake uses the DMF to [automatically detect anomalies](../../user-guide/data-quality-anomaly.md) in the table.

        `SENSITIVITY = { 'LOW' | 'MEDIUM' | 'HIGH' }`
        :   [Preview Feature](../../release-notes/preview-features.md) — Open

            Available to all accounts that are Enterprise Edition (or higher).

            To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

            Sets the sensitivity of the anomaly-detecting algorithm. For more information, see [Adjust the sensitivity level of anomaly detection](../../user-guide/data-quality-anomaly.md).

        `DATA_QUALITY_NOTIFICATION = { TRUE | FALSE }`
        :   [Preview Feature](../../release-notes/preview-features.md) — Open

            Available to all accounts that are Enterprise Edition (or higher).

            To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

            Controls whether notifications are sent when the value returned by the DMF is an expectation violation or an anomaly.

            Notifications are sent if the parameter is set to `TRUE` *and* notifications are turned on for the object’s database. Specify `FALSE` to turn off notifications for this object-DMF association even though notifications are sent for other associations in the database.

            For more information about configuring notifications, see [Sending notifications for data quality issues](../../user-guide/data-quality-notifications.md).

            Default: `TRUE`

## Data Governance policy and tag actions (`dataGovnPolicyTagAction`)

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`policy_name`
:   Identifier for the policy; must be unique for your schema.

The following clauses apply to all table kinds that support row access policies, such as but not limited to tables, views, and event tables.
To simplify, the clauses just refer to “table.”

> `ADD ROW ACCESS POLICY policy_name ON (col_name [ , ... ])`
> :   Adds a row access policy to the table.
>
>     At least one column name must be specified. Additional columns can be specified with a comma separating each column name. Use this
>     expression to add a row access policy to both an event table and an external table.
>
> `DROP ROW ACCESS POLICY policy_name`
> :   Drops a row access policy from the table.
>
>     Use this clause to drop the policy from the table.
>
> `DROP ROW ACCESS POLICY policy_name, ADD ROW ACCESS POLICY policy_name ON ( col_name [ , ... ] )`
> :   Drops the row access policy that is set on the table and adds a row access policy to the same table in a single SQL statement.
>
> `DROP ALL ROW ACCESS POLICIES`
> :   Drops all [row access policy](../../user-guide/security-row-using.md) associations from the table.
>
>     This expression is helpful when a row access policy is dropped from a schema before dropping the policy from an event table. Use this expression to drop row access policy associations from the table.
>
>     Suppose that a row access policy applied to the table when the backup was created, and the policy was later dropped. After you
>     restore the table from a [backup](../../user-guide/backups.md), you can’t query it until you run an ALTER TABLE command with the
>     DROP ALL ROW ACCESS POLICIES clause.
>
> `SET AGGREGATION POLICY policy_name`
> :   `[ ENTITY KEY (col_name [ , ... ]) ] [ FORCE ]`
>     :   Assigns an [aggregation policy](../../user-guide/aggregation-policies.md) to the table.
>
>         Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the table. For more information, see
>         [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md).
>
>         Use the optional FORCE parameter to atomically replace an existing aggregation policy with the new aggregation policy.
>
> `UNSET AGGREGATION POLICY`
> :   Detaches an aggregation policy from the table.
>
> `SET JOIN POLICY policy_name`
> :   `[ FORCE ]`
>     :   Assigns a [join policy](../../user-guide/join-policies.md) to the table.
>
>         Use the optional FORCE parameter to atomically replace an existing join policy with the new join policy.
>
> `UNSET JOIN POLICY`
> :   Detaches a join policy from the table.

`{ ALTER | MODIFY } [ COLUMN ] ...`
:   `USING ( col_name , cond_col_1 ... )`
    :   Specifies the arguments to pass into the conditional masking policy SQL expression.

        The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
        column to which the masking policy is set.

        The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query
        result when a query is made on the first column.

        If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
        [masking policy](../../user-guide/security-column-intro.md).

    `FORCE`
    :   Replaces a masking or projection policy that is currently set on a column with a different policy in a single statement.

        Note that using the `FORCE` keyword with a masking policy requires the [data type](../../sql-reference-data-types.md) of the policy
        in the ALTER TABLE statement (i.e. STRING) to match the data type of the masking policy currently set on the column (i.e. STRING).

        If a masking policy is not currently set on the column, specifying this keyword has no effect.

        For details, see: [Replace a masking policy on a column](../../user-guide/security-column-intro.md) or [Replace a projection policy](../../user-guide/projection-policies.md).

## Usage notes: General

* Moving a view to a managed access schema (using the ALTER VIEW … RENAME TO syntax) is prohibited unless the view owner (i.e.
  the role that has the OWNERSHIP privilege on the view) also owns the target schema.

* For masking policies:

  + The `USING` clause and the `FORCE` keyword are both optional; neither are required to set a masking policy on a column. The
    `USING` clause and the `FORCE` keyword can be used separately or together. For details, see:

    - [Apply a conditional masking policy on a column](../../user-guide/security-column-intro.md)
    - [Replace a masking policy on a column](../../user-guide/security-column-intro.md)
  + A single masking policy that uses conditional columns can be applied to multiple tables provided that the column structure of the table
    matches the columns specified in the policy.
  + When modifying one or more table columns with a masking policy or the table itself with a row access policy, use the
    [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a masking policy and the
    table protected by a row access policy.

* A single masking policy that uses conditional columns can be applied to multiple views provided that the column structure of the view
  matches the columns specified in the policy.

* For row access policies:

  + Snowflake supports adding and dropping row access policies in a single SQL statement.

    For example, to replace a row access policy that is already set on a table with a different policy, drop the row access policy first
    and then add the new row access policy.
  + For a given resource (i.e. table or view), to `ADD` or `DROP` a row access policy you must have either the
    [APPLY ROW ACCESS POLICY](../../user-guide/security-row-intro.md) privilege on the schema, or the
    [OWNERSHIP](../../user-guide/security-row-intro.md) privilege on the resource and the APPLY privilege on the row access policy resource.
  + A table or view can only be protected by one row access policy at a time. Adding a policy fails if the policy body refers to a table or
    view column that is protected by a row access policy or the column protected by a masking policy.

    Similarly, adding a masking policy to a table column fails if the masking policy body refers to a table that is protected by a row
    access policy or another masking policy.
  + Row access policies cannot be applied to system views or table functions.
  + Similar to other [DROP <object>](drop.md) operations, Snowflake returns an error if attempting to drop a row access policy from a
    resource that does not have a row access policy added to it.
  + If an object has both a row access policy and one or more masking policies, the row access policy is evaluated first.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Usage notes: Data metric functions

Add a DMF to a table:
:   Prior to adding a data metric function to a table, you must:

    * Set the schedule for the data metric function to run. For details, see
      [DATA_METRIC_SCHEDULE](../parameters.md).
    * Configure the event table to store the results of calling the data metric function. For details, see
      [View results of a data metric function](../../user-guide/data-quality-results.md).
    * Ensure that the table is view is not granted to a share because you cannot set a data metric function on a shared table or view.

    Additionally:

    * When you specify a column, Snowflake uses the ordinal position. If you rename a column after adding a data metric function to the table
      or view, the association of the data metric function to the column remains valid.
    * Only one data metric function of its kind can be added to a column. For example, a NULL_COUNT data metric function cannot be added to a
      single column twice.
    * If you drop a column after adding a data metric function that references the column, Snowflake cannot evaluate the data metric function.
    * Referencing a virtual column is not supported.

Schedule a DMF
:   It takes ten minutes for the schedule to become effective once the schedule is set.

    Similarly, it takes ten minutes once the DMF is unset for the scheduling changes to take effect. For more information, see
    [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md).

## Examples

Rename view `view1` to `view2`:

```sqlexample
ALTER VIEW view1 RENAME TO view2;
```

Convert a view to a secure view:

```sqlexample
ALTER VIEW view1 SET SECURE;
```

Revert a secure view to a regular view:

```sqlexample
ALTER VIEW view1 UNSET SECURE;
```

Apply a Column-level Security masking policy to a view column:

```sqlexample
-- single column

ALTER VIEW user_info_v MODIFY COLUMN ssn_number SET MASKING POLICY ssn_mask_v;

-- multiple columns

ALTER VIEW user_info_v MODIFY
  COLUMN ssn_number SET MASKING POLICY ssn_mask_v,
  COLUMN dob SET MASKING POLICY dob_mask_v
  ;
```

Unset a Column-level Security masking policy from a view column:

```sqlexample
-- single column

ALTER VIEW user_info_v MODIFY COLUMN ssn_number UNSET MASKING POLICY;

-- multiple columns

ALTER VIEW user_info_v MODIFY
  COLUMN ssn_number UNSET MASKING POLICY,
  COLUMN dob UNSET MASKING POLICY
  ;
```

The following example adds a row access policy on a view. After setting policies, you can verify their
referenced objects by checking the [information schema](../../user-guide/security-row-intro.md).

```sqlexample
ALTER VIEW v1
  ADD ROW ACCESS POLICY rap_v1 ON (empl_id);
```

The following example drops a row access policy from a view. Verify that the policies were dropped by querying the
[information schema](../../user-guide/security-row-intro.md).

```sqlexample
ALTER VIEW v1
  DROP ROW ACCESS POLICY rap_v1;
```

The following example shows how to combine adding and dropping row access policies in a single SQL statement for a view. Verify the
results by checking the [information schema](../../user-guide/security-row-intro.md).

```sqlexample
ALTER VIEW v1
  DROP ROW ACCESS POLICY rap_v1_version_1,
  ADD ROW ACCESS POLICY rap_v1_version_2 ON (empl_id);
```

The following example sets a [join policy](../../user-guide/join-policies.md) on a view:

```sqlexample
ALTER VIEW join_view
  SET JOIN POLICY jp1;
```

---
title: ALTER WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/sql/alter-warehouse.md
section: SQL Commands
---

# ALTER WAREHOUSE

Suspends or resumes a [virtual warehouse](../../user-guide/warehouses-overview.md),
or aborts all queries (and other SQL statements) for a warehouse. Can also be used to rename or
set/unset the properties for a warehouse.

See also:
:   [CREATE WAREHOUSE](create-warehouse.md) , [DESCRIBE WAREHOUSE](desc-warehouse.md) , [DROP WAREHOUSE](drop-warehouse.md) , [SHOW WAREHOUSES](show-warehouses.md)

## Syntax

```sqlsyntax
ALTER WAREHOUSE [ IF EXISTS ] [ <name> ] { SUSPEND | RESUME [ IF SUSPENDED ] }

ALTER WAREHOUSE [ IF EXISTS ] [ <name> ] ABORT ALL QUERIES

ALTER WAREHOUSE [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER WAREHOUSE [ IF EXISTS ] <name> SET [ objectProperties ]
                                         [ objectParams ]

ALTER WAREHOUSE [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER WAREHOUSE [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]

ALTER WAREHOUSE [ IF EXISTS ] <name> UNSET { <property_name> | <param_name> } [ , ... ]

ALTER WAREHOUSE [ IF EXISTS ] <name> UNSET DCM PROJECT

ALTER WAREHOUSE [ IF EXISTS ] <name> ADD TABLES ( <table_name> [ , <table_name> ... ] )

ALTER WAREHOUSE [ IF EXISTS ] <name> DROP TABLES ( <table_name> [ , <table_name> ... ] )
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   WAREHOUSE_TYPE = { STANDARD | 'SNOWPARK-OPTIMIZED' }
>   WAREHOUSE_SIZE = { XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE | X5LARGE | X6LARGE }
>   GENERATION = { '1' | '2' }
>   RESOURCE_CONSTRAINT = { STANDARD_GEN_1 | STANDARD_GEN_2 | MEMORY_1X | MEMORY_1X_x86 | MEMORY_16X | MEMORY_16X_x86 | MEMORY_64X | MEMORY_64X_x86 }
>   WAIT_FOR_COMPLETION = { TRUE | FALSE }
>   MAX_CLUSTER_COUNT = <num>
>   MIN_CLUSTER_COUNT = <num>
>   SCALING_POLICY = { STANDARD | ECONOMY }
>   AUTO_SUSPEND = { <num> | NULL }
>   AUTO_RESUME = { TRUE | FALSE }
>   RESOURCE_MONITOR = <monitor_name>
>   COMMENT = '<string_literal>'
>   ENABLE_QUERY_ACCELERATION = { TRUE | FALSE }
>   QUERY_ACCELERATION_MAX_SCALE_FACTOR = <num>
> ```
>
> ```sqlsyntax
> objectParams ::=
>   MAX_CONCURRENCY_LEVEL = <num>
>   STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>
>   STATEMENT_TIMEOUT_IN_SECONDS = <num>
> ```

## Properties/parameters

`name`
:   Specifies the identifier for the warehouse to alter. If the identifier contains spaces or special characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    > **Note:**
    >
    > A warehouse identifier is required or optional depending on the following:
    >
    > * When resuming/suspending a warehouse or aborting queries for a warehouse, if a warehouse is currently in use for the session, the identifier
    >   can be omitted.
    > * When renaming a warehouse or performing any other operations on a warehouse, the identifier must be specified.

`{ SUSPEND | RESUME [ IF SUSPENDED ] }`
:   Specifies the action to perform on the warehouse:

    * `SUSPEND` removes all compute nodes from a warehouse and puts the warehouse into a ‘Suspended’ state.
    * `RESUME [ IF SUSPENDED ]` brings a suspended warehouse to a usable ‘Running’ state by provisioning compute resources.

      The optional `IF SUSPENDED` clause specifies whether the ALTER WAREHOUSE command completes successfully when resuming a warehouse that
      is already running:

      + If omitted, the command fails and returns an error if the warehouse is already running.
      + If specified, the command completes successfully regardless of whether the warehouse is running.

`ABORT ALL QUERIES`
:   Aborts all the queries currently running or queued on the warehouse.

`RENAME TO new_name`
:   Specifies a new identifier for the warehouse; must be unique for your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`SET ...`
:   Specifies one or more properties/parameters to set for the warehouse (separated by blank spaces, commas, or new lines).

    > **Note:**
    >
    > Only some properties apply to an [interactive warehouse](../../user-guide/interactive.md).
    > For the list of properties that you can use, see [CREATE INTERACTIVE WAREHOUSE](create-interactive-warehouse.md).
    >
    > `WAREHOUSE_TYPE = { STANDARD | 'SNOWPARK-OPTIMIZED' }`
    > :   Specifies the warehouse type.
    >
    >     Valid values:
    >     :   * `STANDARD`, `'STANDARD'`
    >         * `'SNOWPARK-OPTIMIZED'`
    >
    >     Default:
    >     :   `STANDARD`
    >
    >     > **Note:**
    >     >
    >     > * To use a value that contains a hyphen (`'SNOWPARK-OPTIMIZED'`), you must enclose the value in single quotes, as shown.
    >
    > `WAREHOUSE_SIZE = string_constant`
    > :   Specifies the size of the virtual warehouse. The size determines the amount of compute resources in each cluster and, therefore,
    >     the number of credits consumed while the warehouse is running.
    >
    >     For more information see [Resizing a warehouse](../../user-guide/warehouses-tasks.md).
    >
    >     Valid values:
    >     :   * `XSMALL` , `'X-SMALL'`
    >         * `SMALL`
    >         * `MEDIUM`
    >         * `LARGE`
    >         * `XLARGE` , `'X-LARGE'`
    >         * `XXLARGE` , `X2LARGE` , `'2X-LARGE'`
    >         * `XXXLARGE` , `X3LARGE` , `'3X-LARGE'`
    >         * `X4LARGE` , `'4X-LARGE'`
    >         * `X5LARGE` , `'5X-LARGE'`
    >         * `X6LARGE` , `'6X-LARGE'`
    >
    >     Default:
    >     :   `XSMALL`
    >
    >     > **Note:**
    >     >
    >     > * The default size for Snowpark-optimized warehouses is MEDIUM.
    >     > * X5LARGE and X6LARGE sizes for Snowpark-optimized warehouses are only supported with the MEMORY_16X resource constraint.
    >     > * X5LARGE and X6LARGE sizes aren’t supported for standard warehouses that use the STANDARD_GEN_2 resource constraint.
    >     > * To use a value that contains a hyphen (for example, `'2X-LARGE'`), you must enclose the value in single quotes, as shown.
    >     > * To block the immediate return of the ALTER WAREHOUSE command until the resize is complete, add the
    >     >   WAIT_FOR_COMPLETION parameter.
    >     > * The upper limit for the MAX_CLUSTER_COUNT property depends on the warehouse size. When you change WAREHOUSE_SIZE
    >     >   to a value higher than `MEDIUM`, you might need to reduce MAX_CLUSTER_COUNT at the same time. For the upper limit
    >     >   on MAX_CLUSTER_COUNT for each warehouse size, see [Upper limit on number of clusters for a multi-cluster warehouse](../../user-guide/warehouses-multicluster.md).
    >     > * Larger warehouse sizes 5X-Large and 6X-Large are generally available in all Amazon Web Services (AWS) and Microsoft Azure regions.
    >     >
    >     >   Larger warehouse sizes are in preview in US Government regions (requires FIPS support on ARM).
    >
    > `GENERATION = { '1' | '2' }`
    > :   Specifies the warehouse generation for standard warehouses. This parameter provides a simplified way to set the warehouse generation,
    >     instead of using RESOURCE_CONSTRAINT = STANDARD_GEN_1 or STANDARD_GEN_2.
    >
    >     Valid values:
    >     :   * `'1'`: Uses generation 1 compute resources. Equivalent to
    >           `RESOURCE_CONSTRAINT = STANDARD_GEN_1`.
    >         * `'2'`: Uses generation 2 compute resources. Equivalent to
    >           `RESOURCE_CONSTRAINT = STANDARD_GEN_2`.
    >
    >     Default:
    >     :   `'1'` (generation 1 compute resources)
    >
    >     > **Note:**
    >     >
    >     > * Values must be enclosed in single quotes (for example, `'1'`, not `1`).
    >     > * For standard warehouses, the default depends on Gen2 support for your cloud service provider region and whether your organization was created after Gen2 support became available in that region. For more information, see [Default value for the RESOURCE_CONSTRAINT for standard warehouses](../../user-guide/warehouses-gen2.md).
    >     > * GENERATION applies only to standard warehouses (WAREHOUSE_TYPE = STANDARD).
    >     > * When both GENERATION and RESOURCE_CONSTRAINT are specified, any mismatch results in an error.
    >     > * You can’t use GENERATION with Snowpark-optimized warehouses or memory-based resource constraints.
    >
    > `RESOURCE_CONSTRAINT = { STANDARD_GEN_1 | STANDARD_GEN_2 | MEMORY_1X| MEMORY_1X_x86 | MEMORY_16X | MEMORY_16X_x86 | MEMORY_64X | MEMORY_64X_x86 }`
    > :   [Preview Feature](../../release-notes/preview-features.md) — Open
    >
    >     The 1 TB resource constraints (MEMORY_64X and MEMORY_64X_x86) are available as a preview feature.
    >     The 1 TB constraints are available only on the Amazon Web Services (AWS) cloud platform.
    >
    >     All other MEMORY_\* resource constraint sizes are generally available and are available for all cloud platforms.
    >
    >     Specifies the memory and CPU architecture for [Snowpark-optimized warehouses](../../user-guide/warehouses-snowpark-optimized.md),
    >     or generation 1 or [generation 2 capabilities for standard warehouses](../../user-guide/warehouses-gen2.md).
    >
    >     The following table includes the valid values for the property, available memory, CPU architecture, and the minimum warehouse
    >     size required for the `resource_constraint` setting.
    >     For more information about regions and cloud service providers where generation 2 standard warehouses
    >     are available, see [Snowflake generation 2 standard warehouses](../../user-guide/warehouses-gen2.md).
    >
    >     > Valid values:
    >     >
    >     > | Value | Memory (up to) | CPU architecture | Min warehouse size required | Max warehouse size |
    >     > | --- | --- | --- | --- | --- |
    >     > | `STANDARD_GEN_1` | 16 GB | Standard | XSMALL | X6LARGE |
    >     > | `STANDARD_GEN_2` | 16 GB | Standard (generation 2) | XSMALL | X4LARGE |
    >     > | `MEMORY_1X` | 16 GB | Standard | XSMALL | X4LARGE |
    >     > | `MEMORY_1X_x86` | 16 GB | x86 | XSMALL | X4LARGE |
    >     > | `MEMORY_16X` | 256 GB | Standard | MEDIUM | X6LARGE |
    >     > | `MEMORY_16X_x86` | 256 GB | x86 | MEDIUM | X4LARGE |
    >     > | `MEMORY_64X` | 1 TB | Standard | LARGE | X4LARGE |
    >     > | `MEMORY_64X_x86` | 1 TB | x86 | LARGE | X4LARGE |
    >     >
    >     > Default value:
    >     > :   `MEMORY_16X` for Snowpark-optimized warehouses. For standard warehouses, the default depends on
    >     >     Gen2 support for your cloud service provider region and whether your organization was created after
    >     >     Gen2 support became available in that region. For more information, see
    >     >     [Default value for the RESOURCE_CONSTRAINT for standard warehouses](../../user-guide/warehouses-gen2.md).
    >     >
    >     > > **Tip:**
    >     > >
    >     > > For standard warehouses, consider using the GENERATION parameter instead of STANDARD_GEN_1 and STANDARD_GEN_2 values.
    >     > > The GENERATION parameter provides a simpler way to specify the warehouse generation.
    >     > > Specify `GENERATION = '2'` or `GENERATION = '1'`. The quotes are required around the
    >     > > generation number.
    >
    > `WAIT_FOR_COMPLETION = { TRUE | FALSE }`
    > :   When resizing a warehouse, you can use this parameter to block the return of the ALTER WAREHOUSE command until the resize has finished
    >     provisioning all its compute resources. Blocking the return of the command when resizing to a larger warehouse serves to notify you
    >     that your compute resources have been fully provisioned and the warehouse is now ready to execute queries using all the new resources.
    >
    >     Valid values:
    >     :   * `TRUE`: The ALTER WAREHOUSE command will block until the warehouse resize completes.
    >         * `FALSE`: The ALTER WAREHOUSE command returns immediately, before the warehouse resize completes.
    >
    >     Default:
    >     :   FALSE
    >
    >     > **Note:**
    >     >
    >     > * The value of this parameter isn’t persisted and must be set to TRUE on every execution if you want the warehouse resizing to
    >     >   complete before this command returns.
    >     > * If set to `TRUE` and you abort the ALTER WAREHOUSE command, only the waiting is aborted and the warehouse resize will go
    >     >   through. To resize the warehouse back to its original size, you will need to execute another ALTER WAREHOUSE command.
    >     > * This parameter must be used with the WAREHOUSE_SIZE parameter, otherwise an exception will be thrown.
    >
    > `MAX_CLUSTER_COUNT = num`
    > :   Specifies the maximum number of clusters for a multi-cluster warehouse. For a single-cluster warehouse, this value is always `1`.
    >
    >     Valid values:
    >     :   `1` to an upper limit that varies depending on warehouse size.
    >
    >         Note that specifying a value greater than `1` indicates the warehouse is a multi-cluster warehouse; however, the value can
    >         only be set to a higher value in [Snowflake Enterprise Edition](../../user-guide/intro-editions.md) (or higher).
    >
    >         * The upper limit for the MAX_CLUSTER_COUNT property depends on the warehouse size. When you change WAREHOUSE_SIZE
    >           to a value higher than MEDIUM, you might need to reduce MAX_CLUSTER_COUNT at the same time. For the upper limit
    >           on MAX_CLUSTER_COUNT for each warehouse size, see [Upper limit on number of clusters for a multi-cluster warehouse](../../user-guide/warehouses-multicluster.md).
    >
    >         For more information about multi-cluster warehouses, see [Multi-cluster warehouses](../../user-guide/warehouses-multicluster.md).
    >
    >     Default:
    >     :   `1` (single-cluster warehouse)
    >
    >     > **Tip:**
    >     >
    >     > For Snowflake Enterprise Edition (or higher), we recommend always setting the value greater than `1` to help maintain
    >     > high-availability and optimal performance of the (multi-cluster) warehouse. This also helps ensure continuity in the unlikely event
    >     > that a cluster fails.
    >
    > `MIN_CLUSTER_COUNT = num`
    > :   Specifies the minimum number of clusters for a multi-cluster warehouse.
    >
    >     Valid values:
    >     :   `1` to the value of MAX_CLUSTER_COUNT. The upper limit for MAX_CLUSTER_COUNT varies depending on the warehouse size.
    >
    >         MIN_CLUSTER_COUNT must be equal to or less than MAX_CLUSTER_COUNT:
    >
    >         * If both parameters are equal, the warehouse runs in Maximized mode.
    >         * If MIN_CLUSTER_COUNT is less than MAX_CLUSTER_COUNT, the warehouse runs in Auto-scale mode.
    >
    >         For more information, including the upper limit for each warehouse size, see [Multi-cluster warehouses](../../user-guide/warehouses-multicluster.md).
    >
    >     Default:
    >     :   `1`
    >
    > `SCALING_POLICY = { STANDARD | ECONOMY }`
    > :   Object parameter that specifies the policy for automatically starting and shutting down clusters in a multi-cluster warehouse
    >     running in Auto-scale mode.
    >
    >     For a detailed description of this parameter, see [Setting the scaling policy for a multi-cluster warehouse](../../user-guide/warehouses-multicluster.md).
    >
    > `AUTO_SUSPEND = { num | NULL }`
    > :   Specifies the number of seconds of inactivity after which a warehouse is automatically suspended.
    >
    >     Valid values:
    >     :   Any integer `0` or greater, or `NULL`:
    >
    >         * The background process that suspends a warehouse runs approximately every 30 seconds and therefore, the setting for
    >           this property isn’t intended for enabling precise control over warehouse suspension.
    >         * Setting a value less than 30, or a value that isn’t a multiple of 30, is allowed but might not result in the expected
    >           behavior due to the 30 second poll interval for warehouse suspension.
    >         * Setting a `0` or `NULL` value means the warehouse never suspends.
    >
    >     Default:
    >     :   `600` (the warehouse suspends automatically after 10 minutes of inactivity)
    >
    >     > **Important:**
    >     >
    >     > Setting `AUTO_SUSPEND` to `0` or `NULL` is not recommended, unless your query workloads require a continually running
    >     > warehouse. Note that this can result in significant consumption of credits (and corresponding charges), particularly for larger warehouses.
    >     >
    >     > For more details, see [Warehouse considerations](../../user-guide/warehouses-considerations.md).
    >
    > `AUTO_RESUME = { TRUE | FALSE }`
    > :   Specifies whether to automatically resume a warehouse when a SQL statement (for example, query) is submitted to it. If `FALSE`, the warehouse
    >     only starts again when explicitly resumed using ALTER WAREHOUSE or through the Snowflake web interface.
    >
    >     Valid values:
    >     :   * `TRUE`: The warehouse resumes when a new query is submitted.
    >         * `FALSE`: The warehouse only resumes when explicitly resumed using ALTER WAREHOUSE or through the Snowflake web interface.
    >
    >     Default:
    >     :   `TRUE` (the warehouse resumes automatically when a SQL statement is submitted to it)
    >
    > `INITIALLY_SUSPENDED = { TRUE | FALSE }`
    > :   Not applicable when altering a warehouse
    >
    > `RESOURCE_MONITOR = monitor_name`
    > :   Specifies the identifier of a resource monitor that is explicitly assigned to the warehouse. When a resource monitor is explicitly assigned
    >     to a warehouse, the monitor controls the monthly credits used by the warehouse (and all other warehouses to which the monitor is assigned).
    >
    >     Valid values:
    >     :   Any existing resource monitor.
    >
    >         For more details, see [Working with resource monitors](../../user-guide/resource-monitors.md).
    >
    >     Default:
    >     :   No value (no resource monitor assigned to the warehouse)
    >
    >     > **Tip:**
    >     >
    >     > To view all resource monitors and their identifiers, use the [SHOW RESOURCE MONITORS](show-resource-monitors.md) command.
    >
    > `COMMENT = 'string_literal'`
    > :   Adds a comment or overwrites an existing comment for the warehouse.
    >
    > `MAX_CONCURRENCY_LEVEL = num`
    > :   Object parameter that specifies the concurrency level for SQL statements (i.e. queries and DML) executed by a warehouse cluster. When
    >     the level is reached:
    >
    >     > * For a single-cluster warehouse or a multi-cluster warehouse (in Maximized mode), additional statements are queued until resources
    >     >   are available.
    >     > * For a multi-cluster warehouse (in Auto-scale mode), additional clusters are started.
    >
    >     This parameter can be used in conjunction with `STATEMENT_QUEUED_TIMEOUT_IN_SECONDS` to ensure a warehouse is never backlogged.
    >
    >     For a detailed description of this parameter, see [MAX_CONCURRENCY_LEVEL](../parameters.md).
    >
    > `STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = num`
    > :   Object parameter that specifies the time, in seconds, a SQL statement (query, DDL, DML, etc.) can be queued on a warehouse before it is
    >     canceled by the system.
    >
    >     This parameter can be used in conjunction with `MAX_CONCURRENCY_LEVEL` to ensure a warehouse is never backlogged.
    >
    >     For a detailed description of this parameter, see [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../parameters.md).
    >
    > `STATEMENT_TIMEOUT_IN_SECONDS = num`
    > :   Object parameter that specifies the time, in seconds, after which a running SQL statement (query, DDL, DML, etc.) is canceled by the system.
    >
    >     For a detailed description of this parameter, see [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md).
    >
    > `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    > :   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.
    >
    >     The tag value is always a string, and the maximum number of characters for the tag value is 256.
    >
    >     For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).
    >
    > `ENABLE_QUERY_ACCELERATION = { TRUE | FALSE }`
    > :   Specifies whether to enable the [query acceleration service](../../user-guide/query-acceleration-service.md) for queries that rely on
    >     this warehouse for compute resources.
    >
    >     > [Enterprise Edition Feature](../../user-guide/intro-editions.md)
    >     >
    >     > Query acceleration service requires Enterprise Edition (or higher).
    >     > To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
    >     >
    >     > Valid values:
    >     > :   * `TRUE` Enables Query Acceleration
    >     >     * `FALSE` Disables Query Acceleration
    >     >
    >     > Default:
    >     > :   `FALSE`: Query Acceleration is disabled
    >
    > `QUERY_ACCELERATION_MAX_SCALE_FACTOR = num`
    > :   Specifies the maximum scale factor for leasing compute resources for query acceleration. The scale factor is used as a multiplier based
    >     on [warehouse size](../../user-guide/warehouses-overview.md).
    >
    >     Setting the QUERY_ACCELERATION_MAX_SCALE_FACTOR to 0 eliminates the limit and allows queries to lease as many resources as necessary and
    >     as available to service the query.
    >
    >     Regardless of the QUERY_ACCELERATION_MAX_SCALE_FACTOR value, the amount of available compute resources for query acceleration is bound by
    >     the available resources in the service and the number of other concurrent requests. For more details, refer to
    >     [Adjusting the scale factor](../../user-guide/query-acceleration-service.md).
    >
    >     Valid values:
    >     :   `0` to `100`
    >
    >     Default:
    >     :   `8`

`UNSET ...`
:   > Specifies one (or more) properties and/or parameters to unset for the database, which resets them to the defaults:
    >
    > * `property_name`
    > * `param_name`
    >
    >   + `TAG tag_name [ , tag_name ... ]`
    >
    > You can reset multiple properties/parameters with a single ALTER statement; however, each property/parameter must be separated by
    > a comma. Also, when resetting a property/parameter, you only specify the name; no value is required.

    > **Note:**
    >
    > `UNSET` can be used to unset all the properties and parameters for a warehouse, except `WAREHOUSE_SIZE`, which can only
    > be changed using `SET`.

`UNSET DCM PROJECT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Detaches the warehouse from the [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) that currently manages it.
    The command removes the association between the warehouse and the DCM project without dropping the warehouse. See [Detach objects from a DCM project](../../user-guide/dcm-projects/dcm-projects-use.md) for more information.

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.
Currently, this feature is only available on Amazon Web Services (AWS).

`ADD TABLES ( table_name[, ...] )`
:   Associates one or more [interactive tables with an interactive warehouse](../../user-guide/interactive.md). This action initiates the cache-warming process for the specified
    tables.

    > **Note:**
    >
    > * This clause only applies to interactive warehouses created with the INTERACTIVE keyword.
    > * If an interactive table is already associated with the warehouse, the command succeeds but has no effect.
    > * An interactive table can be associated with multiple interactive warehouses.
    > * Cache warming may take significant time depending on the size of the data.

    `table_name`
    :   Specifies the identifier for an interactive table to associate with the warehouse. You can
        specify multiple table names separated by commas.

[Preview Feature](../../release-notes/preview-features.md) — Open

Available to all accounts.
Currently, this feature is only available on Amazon Web Services (AWS).

`DROP TABLES ( table_name[, ...] )`
:   Removes the association between one or more [interactive tables and an interactive
    warehouse](../../user-guide/interactive.md). This action stops using the cache for the specified
    tables, but does not drop the tables themselves.

    > **Note:**
    >
    > * This clause only applies to interactive warehouses created with the INTERACTIVE keyword.
    > * The interactive tables continue to exist after this operation. This clause does not
    >   perform a DROP TABLE operation.

    `table_name`
    :   Specifies the identifier for an interactive table to disassociate from the warehouse. You can
        specify multiple table names separated by commas.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY | Tag | Enables setting a tag on a warehouse. |
| MODIFY | Warehouse | Enables altering any properties of a warehouse, including changing its size. Required to assign a warehouse to a resource monitor. Only the ACCOUNTADMIN role can assign warehouses to resource monitors. |
| MONITOR | Warehouse | Enables viewing current and past queries executed on a warehouse as well as usage statistics on that warehouse. |
| OPERATE | Warehouse | Enables changing the state of a warehouse (stop, start, suspend, resume), and enables viewing current and past queries executed on a warehouse, and aborting any executing queries. |
| USAGE | Warehouse | Enables using a virtual warehouse and, as a result, executing queries on the warehouse. If the warehouse is configured to auto-resume when a SQL statement (for example, query) is submitted to it, the warehouse resumes automatically and executes the statement. |

> **Tip:**
>
> The granting of the global MANAGE WAREHOUSES privilege is equivalent to granting the MODIFY, MONITOR, and OPERATE
> privileges on all warehouses in an account. You can grant this
> privilege to a role whose purpose includes managing a warehouse to simplify your Snowflake access control management.
>
> For details, refer to [Delegating warehouse management](../../user-guide/warehouses-tasks.md).

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* A warehouse does not need to be suspended to set or change any of its properties. You can change the warehouse type
  and resource constraint properties while the warehouse is running.
* When the warehouse size is changed, the change doesn’t impact any statements, including queries, that are currently executing. Once the
  statements complete, and the compute resources are fully provisioned, the new size is used for all subsequent statements.
* Suspending a warehouse does not abort any queries being processed by the warehouse at the time it is suspended. Instead, the
  warehouse completes the queries, then shuts down the compute resources used to process the queries. During this time period, the warehouse
  is in *quiescing* mode. When all the compute resources are shut down, the warehouse’s status changes to Suspended.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* Resuming a Snowpark-optimized virtual warehouse may take longer than standard warehouses.
* Snowpark-optimized warehouses don’t support [Query Acceleration](../../user-guide/query-acceleration-service.md).
* Specifying the `IF EXISTS` clause requires the role in use or a role in the active role hierarchy to have the appropriate
  [warehouse privileges](../../user-guide/security-access-control-privileges.md) on the warehouse.
* The ADD TABLES and DROP TABLES clauses only apply to interactive warehouses created with the
  INTERACTIVE keyword. These clauses are not available for standard or Snowpark-optimized
  warehouses.

## Billing and pricing

For information on Snowpark-optimized warehouse credit consumption, see
`Table 1` in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

For information about billing and pricing considerations for interactive warehouses, see
[Cost and billing considerations](../../user-guide/interactive.md).

> **Tip:**
>
> For information about cost implications of changing the RESOURCE_CONSTRAINT property, see
> [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](../../user-guide/warehouses-gen2.md).

## Examples

Rename warehouse `wh1` to `wh2`:

```sqlexample
ALTER WAREHOUSE IF EXISTS wh1 RENAME TO wh2;
```

Resume a warehouse named `my_wh` and then change the size of the warehouse while it is running:

```sqlexample
ALTER WAREHOUSE my_wh RESUME;

ALTER WAREHOUSE my_wh SET warehouse_size=MEDIUM;
```

Modify the memory resources to 256 GB and set the CPU architecture to x86 for Snowpark-optimized warehouse `so_warehouse`:

```sqlexample
ALTER WAREHOUSE so_warehouse SET
  RESOURCE_CONSTRAINT = 'MEMORY_16X_x86';
```

Change a warehouse to use generation 2 compute resources:

```sqlexample
ALTER WAREHOUSE my_wh SET GENERATION = '2';
```

Associate interactive tables with an interactive warehouse:

```sqlexample
ALTER WAREHOUSE interactive_demo ADD TABLES (orders, customers);
```

Remove interactive tables from an interactive warehouse:

```sqlexample
ALTER WAREHOUSE interactive_demo DROP TABLES (orders, customers);
```

---
title: BEGIN
source: https://docs.snowflake.com/en/sql-reference/sql/begin.md
section: SQL Commands
---

# BEGIN

Begins a transaction in the current session.

START TRANSACTION is a synonym for BEGIN.

See also:
:   [COMMIT](commit.md) , [ROLLBACK](rollback.md) , [SHOW TRANSACTIONS](show-transactions.md) , [DESCRIBE TRANSACTION](desc-transaction.md)

## Syntax

```sqlsyntax
BEGIN [ { WORK | TRANSACTION } ] [ NAME <name> ]

START TRANSACTION [ NAME <name> ]
```

## Parameters

`WORK | TRANSACTION`
:   Optional keywords that provide compatibility with other database systems.

`NAME name`
:   Optional string that assigns a name to the transaction. A name helps identify a transaction, but is not required and does not need to be unique.

## Usage notes

* All transactions have a system-generated internal ID. The transaction ID is a signed 64-bit (long) integer. The range of values is
  -9,223,372,036,854,775,808 (-2 63) to 9,223,372,036,854,775,807 (2 63 - 1).
* If you specify a name for a transaction, the NAME keyword is required.
* If a name is not specified, a system-generated name is assigned to the transaction.
* To complete a transaction, a COMMIT or ROLLBACK command must be explicitly executed. Until one of these commands is executed, the transaction
  remains open.
* When a SQL statement queries a stream within an explicit transaction, the stream is queried at the stream advance point (i.e. the timestamp)
  when the transaction began rather than when the statement was run. This behavior pertains both to DML statements and
  CREATE TABLE … AS SELECT (CTAS) statements that populate a new table with rows from an existing stream.
* If two BEGIN statements in a row are executed (within the same [scope](../transactions.md)), the second one is ignored. For
  example, in the following code, the second and third BEGINs have no effect; the existing open transaction continues.

  ```sqlexample
  BEGIN;
  BEGIN;    -- Ignored!
  INSERT INTO table1 ...;
  BEGIN;    -- Ignored!
  INSERT INTO table2 ...;
  COMMIT;
  ```

  The rules can be more complex if you are using
  [autonomous scoped transactions and stored procedures](../transactions.md).

## Examples

> **Note:**
>
> These examples do not include the necessary commands for completing the transactions. For examples of complete transactions, see [COMMIT](commit.md) or [ROLLBACK](rollback.md).

Begin a transaction:

> ```sqlexample
> BEGIN;
>
> SHOW TRANSACTIONS;
>
> +---------------+--------+--------------+--------------------------------------+-------------------------------+---------+
> |            id | user   |      session | name                                 | started_on                    | state   |
> |---------------+--------+--------------+--------------------------------------+-------------------------------+---------|
> | 1530042321085 | USER1  | 223347060798 | 56cb9163-77a3-4223-b3e0-aa24a20540a3 | 2018-06-26 12:45:21.085 -0700 | running |
> +---------------+--------+--------------+--------------------------------------+-------------------------------+---------+
>
> SELECT CURRENT_TRANSACTION();
>
> +-----------------------+
> | CURRENT_TRANSACTION() |
> |-----------------------|
> | 1530042321085         |
> +-----------------------+
> ```
>
> Note the system-assigned name, `56cb9163-77a3-4223-b3e0-aa24a20540a3`, for the transaction.

Begin a transaction with a specified name:

> ```sqlexample
> BEGIN NAME T1;
>
> SHOW TRANSACTIONS;
>
> +---------------+--------+--------------+------+-------------------------------+---------+
> |            id | user   |      session | name | started_on                    | state   |
> |---------------+--------+--------------+------+-------------------------------+---------|
> | 1530042377426 | USER1  | 223347060798 | T1   | 2018-06-26 12:46:17.426 -0700 | running |
> +---------------+--------+--------------+------+-------------------------------+---------+
>
> SELECT CURRENT_TRANSACTION();
>
> +-----------------------+
> | CURRENT_TRANSACTION() |
> |-----------------------|
> | 1530042377426         |
> +-----------------------+
> ```

This example is the same as the previous example, but uses START TRANSACTION instead of BEGIN:

> ```sqlexample
> START TRANSACTION NAME T2;
>
> SHOW TRANSACTIONS;
>
> +---------------+--------+--------------+------+-------------------------------+---------+
> |            id | user   |      session | name | started_on                    | state   |
> |---------------+--------+--------------+------+-------------------------------+---------|
> | 1530042467963 | USER1  | 223347060798 | T2   | 2018-06-26 12:47:47.963 -0700 | running |
> +---------------+--------+--------------+------+-------------------------------+---------+
>
> SELECT CURRENT_TRANSACTION();
>
> +-----------------------+
> | CURRENT_TRANSACTION() |
> |-----------------------|
> | 1530042467963         |
> +-----------------------+
> ```

---
title: CALL
source: https://docs.snowflake.com/en/sql-reference/sql/call.md
section: SQL Commands
---

# CALL

Calls a [stored procedure](../../developer-guide/stored-procedure/stored-procedures-overview.md).

See also:
:   [CREATE PROCEDURE](create-procedure.md) , [SHOW PROCEDURES](show-procedures.md)

## Syntax

```sqlsyntax
CALL <procedure_name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

## Required parameters

`procedure_name ( [ [ arg_name => ] arg , ... ] )`
:   Specifies the identifier (`procedure_name`) for the procedure to call and any input arguments.

    You can either specify the input arguments by name (`arg_name => arg`) or by position (`arg`).

    > **Note:**
    >
    > * You must either specify all arguments by name or by position. You can’t specify some of the arguments by name and other
    >   arguments by position.
    > * When you specify an argument by name, you can’t use double quotes around the argument name.
    > * If two procedures have the same name but different argument types, you can use the argument names to specify
    >   which procedure to execute, if the argument names are different. For more information, see
    >   [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).

## Optional parameters

`INTO :snowflake_scripting_variable`
:   Sets the specified [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md) to the return value of
    the stored procedure.

## Examples

For more extensive examples of creating and calling stored procedures, see [Working with stored procedures](../../developer-guide/stored-procedure/stored-procedures-usage.md).

```sqlexample
CALL stproc1(5.14::FLOAT);
```

Each argument to a stored procedure can be a general expression:

```sqlexample
CALL stproc1(2 * 5.14::FLOAT);
```

An argument can be a subquery:

```sqlexample
CALL stproc1(SELECT COUNT(*) FROM stproc_test_table1);
```

You can call only one stored procedure per CALL statement. For example, the following statement fails:

```sqlexample
CALL proc1(1), proc2(2);                          -- Not allowed
```

Also, you cannot use a stored procedure CALL as part of an expression. For example, all the following statements fail:

```sqlexample
CALL proc1(1) + proc1(2);                         -- Not allowed
CALL proc1(1) + 1;                                -- Not allowed
CALL proc1(proc2(x));                             -- Not allowed
SELECT * FROM (call proc1(1));                    -- Not allowed
```

However, inside a stored procedure, the stored procedure can call
another stored procedure, or call itself recursively.

> **Caution:**
>
> Nested calls can exceed the maximum allowed stack depth, so be careful when nesting calls,
> especially when using recursion.

The following example calls a stored procedure named `sv_proc1` and passes in a string literal and number as input arguments.
The example specifies the arguments by position:

```sqlexample
CALL sv_proc1('Manitoba', 127.4);
```

You can also specify the arguments by their names:

```sqlexample
CALL sv_proc1(province => 'Manitoba', amount => 127.4);
```

The following example demonstrates how to set and pass a [session variable](../session-variables.md) as an input
argument to a stored procedure:

```sqlexample
SET Variable1 = 49;
CALL sv_proc2($Variable1);
```

The following is an example of a Snowflake Scripting block that captures the return value of a stored procedure in a Snowflake
Scripting variable.

```sqlexample
DECLARE
  ret1 NUMBER;
BEGIN
  CALL sv_proc1('Manitoba', 127.4) into :ret1;
  RETURN ret1;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  ret1 NUMBER;
BEGIN
  CALL sv_proc1('Manitoba', 127.4) into :ret1;
  RETURN ret1;
END;
$$
;
```

---
title: CALL (with anonymous procedure)
source: https://docs.snowflake.com/en/sql-reference/sql/call-with.md
section: SQL Commands
---

# CALL (with anonymous procedure)

Creates and calls an anonymous procedure that is like a [stored procedure](../../developer-guide/stored-procedure/stored-procedures-overview.md) but is not
stored for later use.

With this command, you both create an anonymous procedure defined by parameters in the WITH clause and call that procedure.

You need not have a role with CREATE PROCEDURE schema privileges for this command.

The procedure runs with [caller’s rights](../../developer-guide/stored-procedure/stored-procedures-rights.md), which means that the procedure runs with
the privileges of the caller, uses the current session context, and has access to the caller’s session variables and parameters.

See also:
:   [CREATE PROCEDURE](create-procedure.md) , [CALL](call.md).

## Syntax

### Java

```sqlsyntax
WITH <name> AS PROCEDURE ([ <arg_name> <arg_data_type> ]) [ , ... ] )
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE { JAVA }
  RUNTIME_VERSION = '<scala_or_java_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ AS '<procedure_definition>' ]
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

For Java procedures with [staged handlers](../../developer-guide/inline-or-staged.md), use the following syntax:

```sqlsyntax
WITH <name> AS PROCEDURE ([ <arg_name> <arg_data_type> ]) [ , ... ] )
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE { JAVA }
  RUNTIME_VERSION = '<scala_or_java_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

### JavaScript

```sqlsyntax
WITH <name> AS PROCEDURE ([ <arg_name> <arg_data_type> ]) [ , ... ] )
  RETURNS <result_data_type> [ [ NOT ] NULL ]
  LANGUAGE JAVASCRIPT
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  AS '<procedure_definition>'
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

### Python

For in-line procedures, use the following syntax:

```sqlsyntax
WITH <name> AS PROCEDURE ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE PYTHON
  RUNTIME_VERSION = '<python_version>'
  PACKAGES = ( 'snowflake-snowpark-python[==<version>]'[, '<package_name>[==<version>]' ... ])
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<function_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
  AS '<procedure_definition>'
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

For a procedure in which the code is in a file on a stage, use the following syntax:

```sqlsyntax
WITH <name> AS PROCEDURE ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE PYTHON
  RUNTIME_VERSION = '<python_version>'
  PACKAGES = ( 'snowflake-snowpark-python[==<version>]'[, '<package_name>[==<version>]' ... ])
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<module_file_name>.<function_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

### Scala

```sqlsyntax
WITH <name> AS PROCEDURE ([ <arg_name> <arg_data_type> ]) [ , ... ] )
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE { SCALA }
  RUNTIME_VERSION = '<scala_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark_<scala_version>:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ AS '<procedure_definition>' ]
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

For Scala procedures with [staged handlers](../../developer-guide/inline-or-staged.md), use the following syntax:

```sqlsyntax
WITH <name> AS PROCEDURE ([ <arg_name> <arg_data_type> ]) [ , ... ] )
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE { SCALA }
  RUNTIME_VERSION = '<scala_or_java_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark_<scala_version>:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

### Snowflake Scripting

```sqlsyntax
WITH <name> AS PROCEDURE ([ <arg_name> <arg_data_type> ]) [ , ... ] )
  RETURNS { <result_data_type> | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE SQL
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  AS '<procedure_definition>'
  [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
CALL <name> ( [ [ <arg_name> => ] <arg> , ... ] )
  [ INTO :<snowflake_scripting_variable> ]
```

## Required parameters

### All languages

`WITH name AS PROCEDURE ( [ arg_name arg_data_type ] [ , ... ] )`
:   Specifies the identifier (`name`) and any input arguments for the procedure.

    * For the identifier:

      + The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
        identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also
        case-sensitive. See [Identifier requirements](../identifiers-syntax.md).
    * For the input arguments:

      + For `arg_name`, specify the name of the input argument.
      + For `arg_data_type`, use the Snowflake data type that corresponds to the handler language that you are using.

        - For [Java procedures](../../developer-guide/stored-procedure/java/procedure-java-overview.md), see [SQL-Java Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For [JavaScript procedures](../../developer-guide/stored-procedure/stored-procedures-javascript.md), see
          [SQL and JavaScript data type mapping](../../developer-guide/stored-procedure/stored-procedures-javascript.md).
        - For [Python procedures](../../developer-guide/stored-procedure/python/procedure-python-overview.md), see
          [SQL-Python Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For [Scala procedures](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md), see [SQL-Scala Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For Snowflake Scripting, a [SQL data type](../../sql-reference-data-types.md).
        > **Note:**
        >
        > For procedures you write in Java, Python, or Scala (which use Snowpark APIs), omit the argument for the Snowpark
        > `Session` object.
        >
        > The `Session` argument is not a formal parameter that you specify. When you execute this command, Snowflake automatically
        > creates a `Session` object and passes it to the handler function for your procedure.

`RETURNS result_data_type [ [ NOT ] NULL ]`
:   Specifies the type of the result returned by the procedure.

    Use NOT NULL to specify that the procedure must return only non-null values; the default is NULL, meaning that the procedure
    can return NULL.

    * For `result_data_type`, use the Snowflake data type that corresponds to the type of the language that you are using.

      + For [Java procedures](../../developer-guide/stored-procedure/java/procedure-java-overview.md), see [SQL-Java Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + For [JavaScript procedures](../../developer-guide/stored-procedure/stored-procedures-javascript.md), see
        [SQL and JavaScript data type mapping](../../developer-guide/stored-procedure/stored-procedures-javascript.md).
      + For [Python procedures](../../developer-guide/stored-procedure/python/procedure-python-overview.md), see
        [SQL-Python Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + For [Scala procedures](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md), see [SQL-Scala Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + For Snowflake Scripting, a [SQL data type](../../sql-reference-data-types.md).
      > **Note:**
      >
      > Procedures you write in Java or Scala must have a return value. In Python, when a procedure returns no value, it is considered to be
      > returning `None`.
      >
      > Note that regardless of handler language, the WITH clause for this command must include a RETURNS clause that defines a return type,
      > even if the procedure does not explicitly return anything.
    * For `RETURNS TABLE ( [ col_name col_data_type [ , ... ] ] )`, if you know the
      [Snowflake data types](../../sql-reference-data-types.md) of the columns in the returned table, specify the column names and
      types:

      ```sqlexample
      WITH get_top_sales() AS PROCEDURE
        RETURNS TABLE (sales_date DATE, quantity NUMBER)
        ...
      CALL get_top_sales();
      ```

      Otherwise (e.g. if you are determining the column types during run time), you can omit the column names and types:

      ```sqlexample
      WITH get_top_sales() AS PROCEDURE
        ...
        RETURNS TABLE ()
      CALL get_top_sales();
      ```

      > **Note:**
      >
      > Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
      > applies whether you are creating a stored or anonymous procedure.
      >
      > ```sqlexample
      > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
      >   RETURNS TABLE(g GEOGRAPHY)
      >   ...
      > ```
      >
      > ```sqlexample
      > WITH test_return_geography_table_1() AS PROCEDURE
      >   RETURNS TABLE(g GEOGRAPHY)
      >   ...
      > CALL test_return_geography_table_1();
      > ```
      >
      > If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
      >
      > ```none
      > Stored procedure execution error: data type of returned table does not match expected returned table type
      > ```
      >
      > To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
      >
      > ```sqlexample
      > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
      >   RETURNS TABLE()
      >   ...
      > ```
      >
      > ```sqlexample
      > WITH test_return_geography_table_1() AS PROCEDURE
      >   RETURNS TABLE()
      >   ...
      > CALL test_return_geography_table_1();
      > ```

      `RETURNS TABLE(...)` is supported only when the handler is written in the following languages:

      + [Java](../../developer-guide/stored-procedure/scala/procedure-scala-tabular-data.md)
      + [Python](../../developer-guide/stored-procedure/python/procedure-python-tabular-data.md)
      + [Scala](../../developer-guide/stored-procedure/scala/procedure-scala-tabular-data.md)
      + [Snowflake Scripting](../snowflake-scripting/return.md)

    As a practical matter, outside of a [Snowflake Scripting block](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md),
    [the returned value cannot be used because the call cannot be part of an expression](../../developer-guide/stored-procedures-vs-udfs.md).

`LANGUAGE language`
:   Specifies the language of the procedure’s handler code.

    Currently, the supported values for `language` include:

    * `JAVA` (for [Java](../../developer-guide/stored-procedure/java/procedure-java-overview.md))
    * `JAVASCRIPT` (for [JavaScript](../../developer-guide/stored-procedure/stored-procedures-javascript.md))
    * `PYTHON` (for [Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md))
    * `SCALA` (for [Scala](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md))
    * `SQL` (for [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md))

`AS procedure_definition`
:   Defines the code executed by the procedure. The definition can consist of any valid code.

    Note the following:

    * For procedures for which the code is not in-line, omit the AS clause. This includes procedures whose
      [handlers are on a stage](../../developer-guide/inline-or-staged.md).

      Instead, use the IMPORTS clause to specify the location of the file containing the code for the procedure. For
      details, see:

      + [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md)
      + [Writing Java handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/java/procedure-java-overview.md)
      + [Writing Scala handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md)
    * You must use [string literal delimiters](../data-types-text.md) (`'` or `$$`) around
      `procedure definition`, even in Snowflake Scripting.
    * For procedures in JavaScript, if you are writing a string that contains newlines, you can use
      backquotes (also called “backticks”) around the string.

      The following example of a JavaScript procedure uses `$$` and backquotes because the body of the procedure
      contains single quotes and double quotes:

      ```sqlexample
      WITH proc3 AS PROCEDURE ()
        RETURNS VARCHAR
        LANGUAGE javascript
        AS
        $$
        var rs = snowflake.execute( { sqlText:
            `INSERT INTO table1 ("column 1")
                SELECT 'value 1' AS "column 1" ;`
            } );
        return 'Done.';
        $$
      CALL proc3();
      ```
    * Snowflake does not validate the handler code. However, invalid handler code will result in errors when you execute the command.

    For more details about stored procedures, see [Working with stored procedures](../../developer-guide/stored-procedure/stored-procedures-usage.md).

`CALL name ( [ [ arg_name => ] arg , ... ] )`
:   Specifies the identifier (`name`) for the procedure to call and any input arguments.

    You can either specify the input arguments by name (`arg_name => arg`) or by position (`arg`).

    > **Note:**
    >
    > * You must either specify all arguments by name or by position. You can’t specify some of the arguments by name and other
    >   arguments by position.
    > * When you specify an argument by name, you can’t use double quotes around the argument name.
    > * If two procedures have the same name but different argument types, you can use the argument names to specify
    >   which procedure to execute, if the argument names are different. For more information, see
    >   [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).

### Java, Python, or Scala

`RUNTIME_VERSION = 'language_runtime_version'`
:   The language runtime version to use. Currently, the supported versions are:

    * Java: 11
    * Python:

      > Generally available versions:
      >
      > + 3.9 (deprecated)
      > + 3.10
      > + 3.11
      > + 3.12
      > + 3.13
    * Scala: 2.12

`PACKAGES = ( 'snowpark_package_name' [, 'package_name' ...] )`
:   A comma-separated list of the names of packages deployed in Snowflake that should be included in the handler code’s
    execution environment. The Snowpark package is required for procedures, so it must always be referenced in the PACKAGES clause.
    For more information about Snowpark, see [Snowpark API](../../developer-guide/snowpark/index.md).

    By default, the environment in which Snowflake runs procedures includes a selected set of packages for supported languages.
    When you reference these packages in the PACKAGES clause, it is not necessary to reference a file containing the package in the IMPORTS
    clause because the package is already available in Snowflake.

    For the list of supported packages and versions for a given language, query the
    [INFORMATION_SCHEMA.PACKAGES view](../info-schema/packages.md), specifying the language. For example:

    ```sqlexample
    SELECT * FROM information_schema.packages WHERE language = '<language>';
    ```

    where `language` is `java`, `python`, or `scala`.

    The syntax for referring to a package in the PACKAGES clause varies by the package’s language, as described below.

    * Java

      Specify the package name and version number using the following form:

      ```none
      domain:package_name:version
      ```

      To specify the latest version, specify `latest` for `version`.

      For example, to include a package from the latest Snowpark library in Snowflake, use the following:

      ```sqlexample
      PACKAGES = ('com.snowflake:snowpark:latest')
      ```

      When specifying a package from the Snowpark library, you must specify version 1.3.0 or later.
    * Python

      Snowflake includes a large number of packages available through Anaconda; for more information, see
      [Using third-party packages](../../developer-guide/udf/python/udf-python-packages.md).

      Specify the package name and version number using the following form:

      ```none
      package_name[==version]
      ```

      To specify the latest version, omit the version number.

      For example, to include the spacy package version 2.3.5 (along with the latest version of the required Snowpark package), use the
      following:

      ```sqlexample
      PACKAGES = ('snowflake-snowpark-python', 'spacy==2.3.5')
      ```

      When specifying a package from the Snowpark library, you must specify version 0.4.0 or later. Omit the version number to use the
      latest version available in Snowflake.
    * Scala

      Specify the package name and version number using the following form:

      ```none
      domain:package_name:version
      ```

      To specify the latest version, specify `latest` for `version`.

      For example, to include a package from the latest Snowpark library in Snowflake, use the following:

      ```sqlexample
      PACKAGES = ('com.snowflake:snowpark:latest')
      ```

      Snowflake supports using Snowpark version 0.9.0 or later in a Scala procedure. Note, however, that these versions have limitations.
      For example, versions prior to 1.1.0 do not support the use of transactions in a procedure.

`HANDLER = 'fully_qualified_method_name'`
:   * Python

      Use the name of the procedure’s function or method. This can differ depending on whether the code is in-line or
      referenced at a stage.

      + When the code is in-line, you can specify just the function name, as in the following example:

        ```sqlexample
        WITH myproc AS PROCEDURE()
          ...
          HANDLER = 'run'
          AS
          $$
          def run(session):
            ...
          $$
        CALL myproc();
        ```
      + When the code is imported from a stage, specify the fully-qualified handler function name as `<module_name>.<function_name>`.

        ```sqlexample
        WITH myproc AS PROCEDURE()
          ...
          IMPORTS = ('@mystage/my_py_file.py')
          HANDLER = 'my_py_file.run'
        CALL myproc();
        ```
    * Java and Scala

      Use the fully-qualified name of the method or function for the procedure. This is typically in the
      following form:

      ```none
      com.my_company.my_package.MyClass.myMethod
      ```

      where:

      ```none
      com.my_company.my_package
      ```

      corresponds to the package containing the object or class:

      ```none
      package com.my_company.my_package;
      ```

## Optional parameters

### All languages

`CALLED ON NULL INPUT` or . `RETURNS NULL ON NULL INPUT | STRICT`
:   Specifies the behavior of the procedure when called with null inputs. In contrast to system-defined functions, which
    always return null when any input is null, procedures can handle null inputs, returning non-null values even when an
    input is null:

    * `CALLED ON NULL INPUT` will always call the procedure with null inputs. It is up to the procedure to handle such
      values appropriately.
    * `RETURNS NULL ON NULL INPUT` (or its synonym `STRICT`) will not call the procedure if any input is null,
      so the statements inside the procedure will not be executed. Instead, a null value will always be returned. Note that
      the procedure might still return null for non-null inputs.

    Default: `CALLED ON NULL INPUT`

`INTO :snowflake_scripting_variable`
:   Sets the specified [Snowflake Scripting variable](../../developer-guide/snowflake-scripting/variables.md) to the return value of
    the stored procedure.

### Java, Python, or Scala

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [, 'stage_path_and_directory_or_file_name_to_read' ...] )`
:   The location (stage), path, and name of the directory or file(s) to import. You must set the `IMPORTS` clause to include any files that
    your procedure depends on:

    * If you are writing an in-line procedure, you can omit this clause, unless your code depends on classes defined outside
      the procedure or resource files.
    * Java or Scala: If you are writing a procedure whose handler will be compiled code, you must also include a path to the JAR file
      containing the procedure’s handler.
    * Python: If your procedure’s code will be on a stage, you must also include a path to the module file your code is in.

    Each file in the `IMPORTS` clause must have a unique name, even if the files are in different subdirectories or different
    stages.

## Usage notes

### General usage

* Procedures are not atomic; if one statement in a procedure fails, the other statements in the
  procedure are not necessarily rolled back. For information about procedures and transactions, see
  [Transaction management](../../developer-guide/stored-procedure/stored-procedures-usage.md).
* A procedure can return only a single value, such as a string (for example, a success/failure indicator)
  or a number (for example, an error code). If you need to return more extensive information, you can return a
  VARCHAR that contains values separated by a delimiter (such as a comma), or a semi-structured data type, such
  as [VARIANT](../data-types-semistructured.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

### Syntax

* Similar to when a [WITH](../constructs/with.md) clause is used with a SELECT statement, a WITH clause used with CALL supports
  specifying multiple CTEs separated by commas, in addition to the procedure definition. However, it is not possible to pass tabular
  values produced by a WITH clause to the CALL clause.

  It is, however, possible to specify a simple variable whose value is assigned in the WITH clause.
* The CALL clause must occur last in the syntax.

### Privileges

* Creating and calling a procedure with this command does not require a role with CREATE PROCEDURE schema privileges.
* The procedure’s handler code will be able to perform only actions permitted for the role assigned to the person who ran this command.

### Language-specific

* For Java procedures, see the [known limitations](../../developer-guide/stored-procedure/java/procedure-java-limitations.md).
* For Python procedures, see the [known limitations](../../developer-guide/stored-procedure/python/procedure-python-limitations.md).
* For Scala procedures, see the [known limitations](../../developer-guide/stored-procedure/scala/procedure-scala-limitations.md).

## Examples

The following example creates and calls a procedure, specifying the arguments by position:

Scala 2.12Scala 2.13 (Preview)

```sqlexample
WITH copy_to_table AS PROCEDURE (fromTable STRING, toTable STRING, count INT)
  RETURNS STRING
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'DataCopy.copyBetweenTables'
  AS
  $$
    object DataCopy
    {
      def copyBetweenTables(session: com.snowflake.snowpark.Session, fromTable: String, toTable: String, count: Int): String =
      {
        session.table(fromTable).limit(count).write.saveAsTable(toTable)
        return "Success"
      }
    }
  $$
```

```sqlexample
WITH copy_to_table AS PROCEDURE (fromTable STRING, toTable STRING, count INT)
  RETURNS STRING
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'DataCopy.copyBetweenTables'
  AS
  $$
    object DataCopy
    {
      def copyBetweenTables(session: com.snowflake.snowpark.Session, fromTable: String, toTable: String, count: Int): String =
      {
        session.table(fromTable).limit(count).write.saveAsTable(toTable)
        return "Success"
      }
    }
  $$
```

```sqlexample
CALL copy_to_table('table_a', 'table_b', 5);
```

The following example creates and calls a procedure, specifying the arguments by name:

Scala 2.12Scala 2.13 (Preview)

```sqlexample
WITH copy_to_table AS PROCEDURE (fromTable STRING, toTable STRING, count INT)
  RETURNS STRING
  LANGUAGE SCALA
  RUNTIME_VERSION = '2.12'
  PACKAGES = ('com.snowflake:snowpark_2.12:latest')
  HANDLER = 'DataCopy.copyBetweenTables'
  AS
  $$
    object DataCopy
    {
      def copyBetweenTables(session: com.snowflake.snowpark.Session, fromTable: String, toTable: String, count: Int): String =
      {
        session.table(fromTable).limit(count).write.saveAsTable(toTable)
        return "Success"
      }
    }
  $$
```

```sqlexample
WITH copy_to_table AS PROCEDURE (fromTable STRING, toTable STRING, count INT)
  RETURNS STRING
  LANGUAGE SCALA
  RUNTIME_VERSION = '2.13'
  PACKAGES = ('com.snowflake:snowpark_2.13:latest')
  HANDLER = 'DataCopy.copyBetweenTables'
  AS
  $$
    object DataCopy
    {
      def copyBetweenTables(session: com.snowflake.snowpark.Session, fromTable: String, toTable: String, count: Int): String =
      {
        session.table(fromTable).limit(count).write.saveAsTable(toTable)
        return "Success"
      }
    }
  $$
```

Call the procedure:

```sqlexample
CALL copy_to_table(
  toTable => 'table_b',
  count => 5,
  fromTable => 'table_a');
```

For additional examples, refer to the following topics:

* For examples of Java procedures, see [Writing Java handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/java/procedure-java-overview.md).
* For examples of Python procedures, see [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md).
* For examples of Scala procedures, see [Writing Scala handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md).
* For examples of Snowflake Scripting stored procedures, see [Writing stored procedures in Snowflake Scripting](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

For procedure examples, see [Working with stored procedures](../../developer-guide/stored-procedure/stored-procedures-usage.md).

---
title: COMMENT
source: https://docs.snowflake.com/en/sql-reference/sql/comment.md
section: SQL Commands
---

# COMMENT

Adds a comment or overwrites an existing comment for an existing object.

Comments can be added to all objects (users, roles, warehouses, databases, tables, and so on). You can also use
this command to add comments to individual table columns, but not to constraints on columns.

## Syntax

```sqlsyntax
COMMENT [ IF EXISTS ] ON <object_type> <object_name> IS '<string_literal>';

COMMENT [ IF EXISTS ] ON COLUMN <table_name>.<column_name> IS '<string_literal>';
```

## Parameters

`ON object_type object_name`
:   Adds a comment to the object of the specified type (for example, `TABLE`, `SCHEMA`, `VIEW`, and so on)
    with the specified identifier.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ON COLUMN table_name.column_name`
:   Adds a comment to the specified table column.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`IS string_literal`
:   Specifies the comment to add.

    Default: `NULL`

## Usage notes

* You can also add or modify comments when you are creating or altering objects:

  + To add a comment, specify the `COMMENT` parameter in the [CREATE <object>](create.md) or [ALTER <object>](alter.md) command.
  + To modify an existing comment, specify the `COMMENT` parameter in the [ALTER <object>](alter.md) command.
* A slightly different syntax is used for adding or modifying comments on table columns:

  + To add a comment at creation, follow the column declaration with the `COMMENT` keyword (not property).
  + To modify a comment, use this command.
* To add a comment to a constraint, use the [CREATE TABLE](create-table.md) or [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md) commands.
* The DESCRIBE TABLE output doesn’t show comments for table constraints, such as multi-column primary keys. To see these comments,
  query the [TABLE_CONSTRAINTS view](../info-schema/table_constraints.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a schema with a comment, then overwrite the comment:

```sqlexample
CREATE SCHEMA my_schema COMMENT='this is comment1';

SHOW SCHEMAS LIKE 'my_schema';
```

```output
+-------------------------------+-----------+------------+------------+---------------+---------+------------------+---------+----------------+------+
| created_on                    | name      | is_default | is_current | database_name | owner   | comment          | options | retention_time | ...  |
|-------------------------------+-----------+------------+------------+---------------+---------+------------------+---------+----------------+------|
| 2025-02-26 12:08:52.363 -0800 | MY_SCHEMA | N          | Y          | MY_DB         | MY_ROLE | this is comment1 |         | 1              |  ... |
+-------------------------------+-----------+------------+------------+---------------+---------+------------------+---------+----------------+------+
```

```sqlexample
COMMENT ON SCHEMA my_schema IS 'now comment2';

SHOW SCHEMAS LIKE 'my_schema';
```

```output
+-------------------------------+-----------+------------+------------+---------------+---------+--------------+---------+----------------+-----+
| created_on                    | name      | is_default | is_current | database_name | owner   | comment      | options | retention_time | ... |
|-------------------------------+-----------+------------+------------+---------------+---------+--------------+---------+----------------+-----+
| 2025-02-26 12:08:52.363 -0800 | MY_SCHEMA | N          | Y          | MY_DB         | MY_ROLE | now comment2 |         | 1              | ... |
+-------------------------------+-----------+------------+------------+---------------+---------+--------------+---------+----------------+-----+
```

Create a table with a comment on a table column, then overwrite the comment:

```sqlexample
CREATE OR REPLACE TABLE test_comment_table_column(my_column STRING COMMENT 'this is comment3');

DESC TABLE test_comment_table_column;
```

```output
+-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+------------------+-------------+----------------+
| name      | type              | kind   | null? | default | primary key | unique key | check | expression | comment          | policy name | privacy domain |
|-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+------------------+-------------+----------------|
| MY_COLUMN | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | this is comment3 | NULL        | NULL           |
+-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+------------------+-------------+----------------+
```

```sqlexample
COMMENT ON COLUMN test_comment_table_column.my_column IS 'now comment4';

DESC TABLE test_comment_table_column;
```

```output
+-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+--------------+-------------+----------------+
| name      | type              | kind   | null? | default | primary key | unique key | check | expression | comment      | policy name | privacy domain |
|-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+--------------+-------------+----------------|
| MY_COLUMN | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | now comment4 | NULL        | NULL           |
+-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+--------------+-------------+----------------+
```

Create a view with a comment, then overwrite the comment:

```sqlexample
CREATE OR REPLACE VIEW test_comment_view COMMENT='this is comment5' AS (SELECT * FROM test_comment_table_column);

SHOW VIEWS LIKE 'test_comment_view';
```

```output
+-------------------------------+-------------------+----------+---------------+-------------+---------+------------------+-----+
| created_on                    | name              | reserved | database_name | schema_name | owner   | comment          | ... |
|-------------------------------+-------------------+----------+---------------+-------------+---------+------------------+-----+
| 2025-02-26 12:38:35.440 -0800 | TEST_COMMENT_VIEW |          | MY_DB         | MY_SCHEMA   | MY_ROLE | this is comment5 | ... |
+-------------------------------+-------------------+----------+---------------+-------------+---------+------------------+-----+
```

```sqlexample
COMMENT ON VIEW test_comment_view IS 'now comment6';

SHOW VIEWS LIKE 'test_comment_view';
```

```output
+-------------------------------+-------------------+----------+---------------+-------------+---------+--------------+-----+
| created_on                    | name              | reserved | database_name | schema_name | owner   | comment      | ... |
|-------------------------------+-------------------+----------+---------------+-------------+---------+--------------+-----+
| 2025-02-26 12:38:35.440 -0800 | TEST_COMMENT_VIEW |          | MY_DB         | MY_SCHEMA   | MY_ROLE | now comment6 | ... |
+-------------------------------+-------------------+----------+---------------+-------------+---------+--------------+-----+
```

---
title: COMMIT
source: https://docs.snowflake.com/en/sql-reference/sql/commit.md
section: SQL Commands
---

# COMMIT

Commits an open transaction in the current session.

See also:
:   [BEGIN](begin.md) , [ROLLBACK](rollback.md) , [SHOW TRANSACTIONS](show-transactions.md) , [DESCRIBE TRANSACTION](desc-transaction.md)

## Syntax

```sqlsyntax
COMMIT [ WORK ]
```

## Parameters

`WORK`
:   Optional keyword that provides compatibility with other database systems.

## Usage notes

* If two COMMIT statements in a row are executed (within the same [scope](../transactions.md)), the
  second one is ignored. For example, in the following code, the second COMMIT has no effect; there is no open
  transaction to commit.

  ```sqlexample
  BEGIN;
  INSERT INTO table1 ...;
  COMMIT;
  COMMIT;  -- Ignored!
  ```

  The rules can be more complex if you are using
  [autonomous scoped transactions and stored procedures](../transactions.md).

## Examples

Begin a transaction, insert some values into a table, then complete the transaction by committing it:

```sqlexample
SELECT COUNT(*) FROM A1;

+----------+
| COUNT(*) |
|----------+
|        0 |
+----------+

BEGIN NAME T3;

SELECT CURRENT_TRANSACTION();

+-----------------------+
| CURRENT_TRANSACTION() |
|-----------------------+
| 1432071497832         |
+-----------------------+

INSERT INTO A1 VALUES (1), (2);

+-------------------------+
| number of rows inserted |
|-------------------------+
|                       2 |
+-------------------------+

COMMIT;

SELECT CURRENT_TRANSACTION();

+-----------------------+
| CURRENT_TRANSACTION() |
|-----------------------+
| [NULL]                |
+-----------------------+

SELECT LAST_TRANSACTION();

+--------------------+
| LAST_TRANSACTION() |
|--------------------+
| 1432071497832      |
+--------------------+

SELECT COUNT(*) FROM A1;

+----------+
| COUNT(*) |
|----------+
|        2 |
+----------+
```

---
title: COPY FILES
source: https://docs.snowflake.com/en/sql-reference/sql/copy-files.md
section: SQL Commands
---

# COPY FILES

Copy files from a source location to an output stage. You can either use a stage or a query as the source of the files to copy.

* Use a stage as the source to copy files from one stage to another without renaming.
* Use a query as a source for the following tasks:

  + Copy from or to a set of files defined by a query ([SELECT](select.md) statement).
  + Copy from files written by a UDF (for example, [Writing files from Snowpark Python UDFs and UDTFs](../../developer-guide/snowpark/python/creating-udfs.md)).
  + Copy from scoped or stage URLs.

You can copy from and to existing named stages, as the following table illustrates:

| Source location | Target location |
| --- | --- |
| Internal named stage | Internal named stage |
| External stage | Internal named stage |
| Internal named stage | External stage |
| External stage | External stage |
| Snowflake [Git repository clone](../../developer-guide/git/git-overview.md) | Internal named stage |
| Snowflake [Git repository clone](../../developer-guide/git/git-overview.md) | External stage |

A target or source external stage can reference files in any of the following cloud storage services or on-premises locations:

* Amazon S3
* Google Cloud Storage
* Microsoft Azure Blob storage
* Microsoft Data Lake Storage Gen2
* Microsoft Azure General-purpose v2
* [Amazon S3-compatible storage](../../user-guide/data-load-s3-compatible-storage.md)

See also:
:   [External stages](../../user-guide/data-load-overview.md) , [Internal stages](../../user-guide/data-load-overview.md), [Snowflake Git repository clone](../../developer-guide/git/git-overview.md)

## Syntax

### Copy from a stage

```sqlsyntax
COPY FILES INTO @[<namespace>.]<stage_name>[/<path>/]
  FROM @[<namespace>.]<stage_name>[/<path>/]
  [ FILES = ( '<file_name>' [ , '<file_name>' ] [ , ... ] ) ]
  [ PATTERN = '<regex_pattern>' ]
  [ DETAILED_OUTPUT = { TRUE | FALSE } ]
```

### Copy from a query

```sqlsyntax
COPY FILES INTO @[<namespace>.]<stage_name>[/<path>/]
  FROM ( SELECT <existing_url> [ , <new_filename> ] FROM ... )
  [ DETAILED_OUTPUT = { TRUE | FALSE } ]
```

## Required parameters

`INTO @[namespace.]stage_name[/path/]`
:   > Specifies the target location for the copied files.

    * `namespace` is the database or schema in which the internal or external stage resides, in the form of `database_name.schema_name` or `schema_name`. The namespace is optional if a database and schema are currently in use within the user session; otherwise, it is required.
    * `path` is an optional, case-sensitive path in the cloud storage location that specifies a set of files to copy from the source stage or a specific location on the target stage. Your cloud storage service might call the path a *prefix* or a *folder*.

    > > **Note:**
    > >
    > > * If a target or source path name includes special characters or spaces, you must enclose the `INTO ...`
    > >   value in single quotes.
    > > * The values for `INTO ...` must be literal constants.
    > >   The values cannot be [SQL variables](../session-variables.md).

### Using a stage as a source

`FROM @[namespace.]stage_name[/path/]`
:   Specifies the source location where the files to copy are staged. The values provided to `FROM ...` follow the same specification and constraints as `INTO...` values.

### Using a query as a source

`FROM (SELECT existing_url [ , new_filename ] FROM ... )`
:   Specifies the source location and optional relative output location for the copied files. Each row that the [SELECT](select.md)
    query returns represents a file to copy.

    * `existing_url` is a scoped URL, stage name, or stage URL.
    * `new_filename` is an optional relative path from the output stage specified for the `INTO` clause.

    Snowflake copies the file to the following location:

    `@[<namespace>.]<stage_name>[/<path>]<new_filename>`

    If you don’t specify a value for `new_filename`, Snowflake uses the relative path of the `existing_url`.

## Optional parameters

`FILES = ( 'file_name' [ , 'file_name' ... ] )`
:   Specifies a list of one or more comma-separated file names to copy.
    The files must already be staged in the source location that you specify in the command.
    Snowflake skips any specified files that can’t be found.

    You can specify a maximum of 1000 file names.

    Copy files from query does not support this option. Instead, use the query to provide the filename list.

    > **Note:**
    >
    > To set the file path for external stages, Snowflake prepends the URL in the stage definition to each file name in the list.
    >
    > However, Snowflake does not insert a separator between the path and file name.
    > You must explicitly include a separator (`/`) at the end of the URL in the stage definition
    > or at the beginning of each file name in the `FILES` list.

`PATTERN = 'regex_pattern'`
:   Specifies a regular expression pattern for filtering the list of files to copy.
    This command applies the regular expression to the entire storage location in the `FROM` clause.

    Copy files from query does not support this option. Instead, use the query to match the pattern.

    > **Tip:**
    >
    > For best performance, avoid patterns that filter on a large number of files.

`DETAILED_OUTPUT = { TRUE | FALSE }`
:   Specifies whether the command output should summarize the results of the copy operation or list each file copied.

    Values:
    :   * If `TRUE`, the output includes a row for each file copied to the target location.
          A single column named `file` contains the target path (if applicable) and file name for each copied file.
        * If `FALSE`, the output is a single row with the number of files that were copied.

    Default:
    :   `TRUE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have the following
[privileges](../../user-guide/security-access-control-overview.md) (depending on the source and target locations) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | External stage | Required on a source or target external stage. |
| READ | Internal named stage | Required on a source internal stage. |
| WRITE | Internal named stage | Required on a target internal stage. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This command does not support the following:

  + Copying files to or from *table* stages.
  + When using a stage as a source, copying files to or from *user* stages.
  + Copying data in archival cloud storage classes that requires restoration before it can be retrieved.
    Archival storage classes include Amazon S3 Glacier Flexible Retrieval, Glacier Deep Archive,
    or Microsoft Azure Archive Storage.
  + Copying files that are larger than 5GB.
* Considerations for running this command:

  + COPY FILES statements overwrite any existing files with matching names in the target location. The command does
    not remove any existing files that don’t match the names of the copied files.
  + If a file copy operation fails, Snowflake does not perform any automatic cleanup.
  + **Copying files from Google Cloud Storage**: A COPY FILES statement might fail if the object list for an external stage includes
    one or more directory blobs. A *directory blob* is a path that ends in a forward slash character (`/`). In the following example output
    for `LIST @<stage>`, `my_gcs_stage/load/` is a directory blob.

    ```output
    +---------------------------------------+------+----------------------------------+-------------------------------+
    | name                                  | size | md5                              | last_modified                 |
    |---------------------------------------+------+----------------------------------+-------------------------------|
    | my_gcs_stage/load/                    |  12  | 12348f18bcb35e7b6b628ca12345678c | Mon, 11 Aug 2022 16:57:43 GMT |
    | my_gcs_stage/load/data_0_0_0.csv.gz   |  147 | 9765daba007a643bdff4eae10d43218y | Mon, 11 Aug 2022 18:13:07 GMT |
    +---------------------------------------+------+----------------------------------+-------------------------------+
    ```

    Google creates directory blobs when you use the Google Cloud console to create a directory.

    To avoid this issue and specify which files to copy, use the `PATTERN` option (for copy from stage) or `FROM` (for copy from query).

    For an example, see Copy files using pattern matching.
  + Snowflake uses multipart uploads when uploading to Amazon S3 and Google Cloud Storage.
    To prevent incomplete uploads from accumulating, we recommend that you set a lifecycle rule.
    For instructions, see the [Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpu-abort-incomplete-mpu-lifecycle-config.html)
    or [Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle#abort-mpu) documentation.

* The COPY FILES command incurs data transfer and compute costs:

  + **Data transfer**: Cloud providers might charge for data transferred out of their own network. To recover these expenses,
    Snowflake charges a per-byte fee when you copy files from an internal Snowflake stage
    into an external stage in a different [region](../../user-guide/intro-regions.md)
    or with a different cloud provider. Snowflake does not charge for data ingress
    (for example, when copying files from an external stage into an internal stage).

    For more information about data transfer billing, see [Understanding data transfer cost](../../user-guide/cost-understanding-data-transfer.md).
  + **Compute**: COPY FILES is a [serverless](../../user-guide/cost-understanding-compute.md) feature and doesn’t require a virtual warehouse.
    The line item for the COPY FILES command on your Snowflake bill does not include any cloud services charges.

    For more information about compute resource billing, see [Understanding compute cost](../../user-guide/cost-understanding-compute.md).
  > **Note:**
  >
  > Some Snowflake features, such as Native Apps and worksheets, incur COPY FILES charges. As a result, you might see
  > COPY FILES charges even if you haven’t executed the COPY FILES command. For more information about these charges,
  > contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* Snowflake does not maintain a file copy history for this command.

## Examples

### Copy files

Copy all of the files from an existing source stage (`src_stage`) to an existing target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage;
```

> **Note:**
>
> To copy files from or to an external stage with a protected storage location,
> make sure the stage definition includes credentials to access the cloud storage location.

Specify the names of files to copy from an existing source stage (`src_stage`) to an existing target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage
  FILES = ('file1.csv', 'file2.csv');
```

Copy files from a specific path on an existing stage (`src_stage/src_path/`)
to a specific path on an existing target stage (`trg_stage/trg_path/`):

```sqlexample
COPY FILES
  INTO @trg_stage/trg_path/
  FROM @src_stage/src_path/;
```

### Copy files using pattern matching

Use pattern matching to load only compressed CSV files in any path on an existing source stage (`src_stage`) to an existing target stage (`trg_stage`):

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage
  PATTERN='.*/.*/.*[.]csv[.]gz';
```

The `.*` component represents zero or more occurrences of any character.
The square brackets escape the period character (`.`) that precedes a file extension.

Copy only uncompressed CSV files whose names include the string `sales`:

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM @src_stage
  PATTERN='.*sales.*[.]csv';
```

### Copy files using a query

#### Copy a single file

The file name remains the same as in the source stage.

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM (SELECT '@src_stage/file.txt');
```

#### Copy and rename a single file

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM (SELECT '@src_stage/file.txt', 'new_filename.txt');
```

#### Copy all of the files from a table

To copy multiple files using a query, you can use a generic query.

```sqlexample
-- Create a table with URLs
CREATE TABLE urls(src_file STRING, trg_file STRING);
INSERT INTO urls VALUES ('@src_stage/file.txt', 'new_filename.txt');

-- Insert additional URLs here
COPY FILES
  INTO @trg_stage
  FROM (SELECT src_file, trg_file FROM urls);
```

#### Copy only some files

This example uses a filter to copy files that match a pattern.

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM (SELECT src_file, trg_file FROM urls WHERE src_file LIKE '%file%');
```

#### Copy files from a directory table

```sqlexample
COPY FILES
  INTO @trg_stage
  FROM (SELECT relative_path FROM directory(@src_stage) WHERE relative_path LIKE '%.txt');
```

#### Copy files with detailed output

* To produce command output with a list of files that are copied to the target location, use `DETAILED_OUTPUT = TRUE`.

  The output has a single column named `file` that contains the target path, if applicable, and the file name for each copied file.

  ```sqlexample
  COPY FILES
    INTO @trg_stage
    FROM @src_stage
    DETAILED_OUTPUT = TRUE;
  ```

  An example output:

  ```output
  +--------------------+
  | file               |
  |--------------------|
  | employees01.csv.gz |
  | employees02.csv.gz |
  | employees03.csv.gz |
  | employees04.csv.gz |
  | employees05.csv.gz |
  +--------------------+
  ```
* To produce command output that summarizes the results of the copy operation, use `DETAILED_OUTPUT =  FALSE`.

  The output is a single row with the number of files that were copied.

  ```sqlexample
  COPY FILES
    INTO @trg_stage
    FROM @src_stage
    DETAILED_OUTPUT = FALSE;
  ```

  An example output:

  ```output
  +-------------------+
  | numOfFilesCopied  |
  | 5                 |
  +-------------------+
  ```

---
title: COPY INTO <location>
source: https://docs.snowflake.com/en/sql-reference/sql/copy-into-location.md
section: SQL Commands
---

# COPY INTO *<location>*

Unloads data from a table (or query) into one or more files in one of the following locations:

* Named internal stage (or table/user stage). The files can then be downloaded from the stage/location using the [GET](get.md) command.
* Named external stage that references an external location (Amazon S3, Google Cloud Storage, or Microsoft Azure).
* External location (Amazon S3, Google Cloud Storage, or Microsoft Azure).

See also:
:   [COPY INTO <table>](copy-into-table.md)

## Syntax

```sqlsyntax
COPY INTO { internalStage | externalStage | externalLocation }
     FROM { [<namespace>.]<table_name> | ( <query> ) }
[ PARTITION BY <expr> ]
[ FILE_FORMAT = ( { FORMAT_NAME = '[<namespace>.]<file_format_name>' |
                    TYPE = { CSV | JSON | PARQUET } [ formatTypeOptions ] } ) ]
[ copyOptions ]
[ VALIDATION_MODE = RETURN_ROWS ]
[ HEADER ]
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>]
>   | @[<namespace>.]%<table_name>[/<path>]
>   | @~[/<path>]
> ```
>
> ```sqlsyntax
> externalStage ::=
>   @[<namespace>.]<ext_stage_name>[/<path>]
> ```
>
> ```sqlsyntax
> externalLocation (for Amazon S3) ::=
>   '<protocol>://<bucket>[/<path>]'
>   [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( {  { AWS_KEY_ID = '<string>' AWS_SECRET_KEY = '<string>' [ AWS_TOKEN = '<string>' ] } } ) } ]
>   [ ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] [ MASTER_KEY = '<string>' ] |
>                    [ TYPE = 'AWS_SSE_S3' ] |
>                    [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ] ] |
>                    [ TYPE = 'NONE' ] ) ]
> ```
>
> ```sqlsyntax
> externalLocation (for Google Cloud Storage) ::=
>   'gcs://<bucket>[/<path>]'
>   [ STORAGE_INTEGRATION = <integration_name> ]
>   [ ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' ] [ KMS_KEY_ID = '<string>' ] | [ TYPE = 'NONE' ] ) ]
> ```
>
> ```sqlsyntax
> externalLocation (for Microsoft Azure) ::=
>   'azure://<account>.blob.core.windows.net/<container>[/<path>]'
>   [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( [ AZURE_SAS_TOKEN = '<string>' ] ) } ]
>   [ ENCRYPTION = ( [ TYPE = { 'AZURE_CSE' | 'NONE' } ] [ MASTER_KEY = '<string>' ] ) ]
> ```
>
> ```sqlsyntax
> formatTypeOptions ::=
> -- If FILE_FORMAT = ( TYPE = CSV ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      RECORD_DELIMITER = '<string>' | NONE
>      FIELD_DELIMITER = '<string>' | NONE
>      FILE_EXTENSION = '<string>'
>      ESCAPE = '<character>' | NONE
>      ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
>      NULL_IF = ( '<string1>' [ , '<string2>' , ... ] )
>      EMPTY_FIELD_AS_NULL = TRUE | FALSE
> -- If FILE_FORMAT = ( TYPE = JSON ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      FILE_EXTENSION = '<string>'
> -- If FILE_FORMAT = ( TYPE = PARQUET ... )
>      COMPRESSION = AUTO | LZO | SNAPPY | NONE
>      SNAPPY_COMPRESSION = TRUE | FALSE
> ```
>
> ```sqlsyntax
> copyOptions ::=
>      OVERWRITE = TRUE | FALSE
>      SINGLE = TRUE | FALSE
>      MAX_FILE_SIZE = <num>
>      INCLUDE_QUERY_ID = TRUE | FALSE
>      DETAILED_OUTPUT = TRUE | FALSE
> ```

## Required parameters

`INTO ...`
:   Specifies the internal or external location where the data files are unloaded:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]int_stage_name[/path]` | Files are unloaded to the specified named internal stage. |
    > | `@[namespace.]ext_stage_name[/path]` | Files are unloaded to the specified named external stage. |
    > | `@[namespace.]%table_name[/path]` | Files are unloaded to the stage for the specified table. |
    > | `@~[/path]` | Files are unloaded to the stage for the current user. |
    > | `'protocol://bucket[/path]'` | Files are unloaded to the specified external location (S3 bucket). Additional parameters could be required. For details, see Additional Cloud Provider Parameters (in this topic). |
    > | `'gcs://bucket[/path]'` | Files are unloaded to the specified external location (Google Cloud Storage bucket). Additional parameters could be required. For details, see Additional Cloud Provider Parameters (in this topic). |
    > | `'azure://account.blob.core.windows.net/container[/path]'` | Files are unloaded to the specified external location (Azure container). Additional parameters could be required. For details, see Additional Cloud Provider Parameters (in this topic). |

    Where:

    > * `namespace` is the database and/or schema in which the internal or external stage resides, in the form of
    >   `database_name.schema_name` or `schema_name`. It is optional if a database and schema are currently in use within
    >   the user session; otherwise, it is required.
    > * `protocol` is one of the following:
    >
    >   + `s3` refers to S3 storage in public AWS regions outside of China.
    >   + `s3china` refers to S3 storage in public AWS regions in China.
    >   + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
    >
    >   Accessing cloud storage in a [government region](../../user-guide/intro-regions.md) using a storage integration is limited to Snowflake
    >   accounts hosted in the same government region.
    >
    >   Similarly, if you need to access cloud storage in a region in China, you can use a storage integration only from a Snowflake
    >   account hosted in the same region in China.
    >
    >   In these cases, use the CREDENTIALS parameter in the [CREATE STAGE](create-stage.md) command (rather than using a storage
    >   integration) to provide the credentials for authentication.
    > * `bucket` is the name of the bucket.
    >
    > * `account` is the name of the Azure account (e.g. `myaccount`). Use the `blob.core.windows.net` endpoint for all
    >   supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
    >
    >   Note that currently, accessing Azure blob storage in [government regions](../../user-guide/intro-regions.md) using a storage
    >   integration is limited to Snowflake accounts hosted on Azure in the same government region. Accessing your blob storage from an
    >   account hosted outside of the government region using direct credentials is supported.
    > * `container` is the name of the Azure container (e.g. `mycontainer`).
    >
    > * The optional `path` parameter specifies a folder and filename prefix for the file(s) containing unloaded data. If a filename
    >   prefix is not included in `path` or if the `PARTITION BY` parameter is specified, the filenames for
    >   the generated data files are prefixed with `data_`.
    >
    >   Relative path modifiers such as `/./` and `/../` are interpreted literally, because “paths” are literal prefixes for a name.
    >   For example:
    >
    >   > ```sqlexample
    >   > -- S3 bucket
    >   > COPY INTO 's3://mybucket/./../a.csv' FROM mytable;
    >   >
    >   > -- Google Cloud Storage bucket
    >   > COPY INTO 'gcs://mybucket/./../a.csv' FROM mytable;
    >   >
    >   > -- Azure container
    >   > COPY INTO 'azure://myaccount.blob.core.windows.net/mycontainer/./../a.csv' FROM mytable;
    >   > ```
    >
    >   In these COPY statements, Snowflake creates a file that is literally named `./../a.csv` in the storage location.

    > **Note:**
    >
    > * If the internal or external stage or path name includes special characters, including spaces, enclose the `INTO ...` string in
    >   single quotes.
    > * The `INTO ...` value must be a literal constant. The value cannot be a [SQL variable](../session-variables.md).
    > * When writing to an external stage within the Snowflake Native App Framework, you must use `STAGE_URL` to specify a URL instead of the external stage name and path.

`FROM ...`
:   Specifies the source of the data to be unloaded, which can either be a table or a query:

    > `[namespace.]table_name`
    > :   Specifies the name of the table from which data is unloaded.
    >
    >     Namespace optionally specifies the database and/or schema in which the table resides, in the form of `database_name.schema_name`
    >     or `schema_name`. It is optional if a database and schema are currently in use within the user session; otherwise, it is
    >     required.
    >
    > `( query )`
    > :   [SELECT](select.md) statement that returns data to be unloaded into files. You can limit the number of rows returned by specifying a
    >     [LIMIT / FETCH](../constructs/limit.md) clause in the query.
    >
    >     > **Note:**
    >     >
    >     > When casting column values to a data type using the [CAST , ::](../functions/cast.md) function, verify the data type supports
    >     > all of the column values. Values too long for the specified data type could be truncated.

### Additional cloud provider parameters

`STORAGE_INTEGRATION = integration_name` or . `CREDENTIALS = ( cloud_specific_credentials )`
:   Supported when the COPY statement specifies an external storage URI rather than an external stage name for the target cloud storage location. Specifies the security credentials for connecting to the cloud provider and accessing the private storage container where the unloaded files are staged.

    Required only for unloading into an external private cloud storage location; not required for public buckets/containers

    **Amazon S3**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > Snowflake recommends the use of storage integrations. This option avoids the need to supply cloud storage credentials using the CREDENTIALS
    >     > parameter when creating stages or loading data.
    >
    > `CREDENTIALS = ( AWS_KEY_ID = 'string' AWS_SECRET_KEY = 'string' [ AWS_TOKEN = 'string' ] )` or . `CREDENTIALS = ( AWS_ROLE = 'string' )`
    > :   Specifies the security credentials for connecting to AWS and accessing the private S3 bucket where the unloaded files are staged. For more
    >     information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).
    >
    >     The credentials you specify depend on whether you associated the Snowflake access permissions for the bucket with an AWS IAM (Identity &
    >     Access Management) user or role:
    >
    >     * **IAM user:** Temporary IAM credentials are required. Temporary (aka “scoped”) credentials are generated by AWS Security Token Service
    >       (STS) and consist of three components:
    >
    >       > + `AWS_KEY_ID`
    >       > + `AWS_SECRET_KEY`
    >       > + `AWS_TOKEN`
    >
    >       All three are required to access a private bucket. After a designated period of time, temporary credentials expire and can no
    >       longer be used. You must then generate a new set of valid temporary credentials.
    >
    >       > **Important:**
    >       >
    >       > COPY commands contain complex syntax and sensitive information, such as credentials. In addition, they are executed frequently and are
    >       > often stored in scripts or worksheets, which could lead to sensitive information being inadvertently exposed. The COPY command allows
    >       > permanent (aka “long-term”) credentials to be used; however, for security reasons, do not use permanent credentials in COPY
    >       > commands. Instead, use temporary credentials.
    >       >
    >       > If you must use permanent credentials, use [external stages](create-stage.md), for which credentials are entered
    >       > once and securely stored, minimizing the potential for exposure.
    >     * **IAM role:** Omit the security credentials and access keys and, instead, identify the role using `AWS_ROLE` and specify the AWS
    >       role ARN (Amazon Resource Name).
    >
    >       > **Important:**
    >       >
    >       > The ability to use an AWS IAM role to access a private S3 bucket to load or unload data is now deprecated (i.e. support will be removed
    >       > in a future release, TBD). Snowflake recommends modifying any existing S3 stages that use this feature to instead reference storage
    >       > integration objects. For instructions, see [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md).

    **Google Cloud Storage**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).

    **Microsoft Azure**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > Snowflake recommends the use of storage integrations. This option avoids the need to supply cloud storage credentials using the
    >     > CREDENTIALS parameter when creating stages or loading data.
    >
    > `CREDENTIALS = ( AZURE_SAS_TOKEN = 'string' )`
    > :   Specifies the SAS (shared access signature) token for connecting to Azure and accessing the private container where the files containing
    >     data are staged. Credentials are generated by Azure.

`ENCRYPTION = ( cloud_specific_encryption )`
:   For use in ad hoc COPY statements (statements that do not reference a named external stage). Required only for unloading data to files in encrypted storage locations

    **Amazon S3**

    > `ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] [ MASTER_KEY = '<string>' ] | [ TYPE = 'AWS_SSE_S3' ] | [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ] ] | [ TYPE = 'NONE' ] )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AWS_CSE`: Client-side encryption (requires a `MASTER_KEY` value). Currently, the client-side
    > >       [master key](https://csrc.nist.gov/glossary/term/master_key) you provide can only be a symmetric key. Note that, when a
    > >       `MASTER_KEY` value is provided, Snowflake assumes `TYPE = AWS_CSE` (i.e. when a `MASTER_KEY` value is
    > >       provided, `TYPE` is not required).
    > >     * `AWS_SSE_S3`: Server-side encryption that requires no additional encryption settings.
    > >     * `AWS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >     For more information about the encryption types, see the AWS documentation for
    > >     [client-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/UsingClientSideEncryption.html)
    > >     or [server-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/serv-side-encryption.html).
    > >
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to `AWS_CSE` encryption only)
    > > :   Specifies the client-side master key used to encrypt the files in the bucket. The master key must be a 128-bit or 256-bit key in
    > >     Base64-encoded form.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `AWS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the AWS KMS-managed key used to encrypt files unloaded into the bucket. If no value is
    > >     provided, your default KMS key ID is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading.

    **Google Cloud Storage**

    > `ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' | 'NONE' ] [ KMS_KEY_ID = 'string' ] )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `GCS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >       For more information, see the Google Cloud documentation:
    > >
    > >       + <https://cloud.google.com/storage/docs/encryption/customer-managed-keys>
    > >       + <https://cloud.google.com/storage/docs/encryption/using-customer-managed-keys>
    > >     * `NONE`: No encryption.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `GCS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the Cloud KMS-managed key that is used to encrypt files unloaded into the bucket. If no value
    > >     is provided, your default KMS key ID set on the bucket is used to encrypt files on unload.
    > >
    > >     This value is ignored for data loading. The load operation should succeed if the service account has sufficient permissions
    > >     to decrypt data in the bucket.

    **Microsoft Azure**

    > `ENCRYPTION = ( [ TYPE = 'AZURE_CSE' | 'NONE' ] [ MASTER_KEY = 'string' ] )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AZURE_CSE`: Client-side encryption (requires a MASTER_KEY value). For information, see the
    > >       [Client-side encryption information](https://docs.microsoft.com/en-us/azure/storage/common/storage-client-side-encryption) in
    > >       the Microsoft Azure documentation.
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to AZURE_CSE encryption only)
    > > :   Specifies the client-side master key used to encrypt files. The master key must be a 128-bit or 256-bit key in Base64-encoded form.

## Optional parameters

`PARTITION BY expr`
:   Specifies an expression used to partition the unloaded table rows into separate files. Supports any SQL expression that evaluates to a
    string.

    The unload operation splits the table rows based on the partition expression and determines the number of files to create based on the
    amount of data and number of parallel operations, distributed among the compute resources in the warehouse.

    Filenames are prefixed with `data_` and include the partition column values. Individual filenames in each partition are identified
    with a universally unique identifier (UUID). The UUID is the query ID of the COPY statement used to unload the data files.

    > **Caution:**
    >
    > COPY INTO *<location>* statements write partition column values to the unloaded file names. Snowflake recommends partitioning your
    > data on common data types such as dates or timestamps rather than potentially sensitive string or integer values.
    >
    > Note that file URLs are included in the internal logs that Snowflake maintains to aid in debugging issues when customers create Support
    > cases. As a result, data in columns referenced in a PARTITION BY expression is also indirectly stored in internal logs. These logs
    > might be processed outside of your deployment region. Hence, as a best practice, only include dates, timestamps, and Boolean data types
    > in PARTITION BY expressions.
    >
    > If you prefer to disable the PARTITION BY parameter in COPY INTO *<location>* statements for your account, please contact
    > [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
    >
    > Note that Snowflake provides a set of parameters to further restrict data unloading operations:
    >
    > * [PREVENT_UNLOAD_TO_INLINE_URL](../parameters.md) prevents ad hoc data unload operations to external cloud storage locations (i.e. COPY INTO
    >   *<location>* statements that specify the cloud storage URL and access settings directly in the statement).
    > * [PREVENT_UNLOAD_TO_INTERNAL_STAGES](../parameters.md) prevents data unload operations to any internal stage, including user stages,
    >   table stages, or named internal stages.

    For an example, see Partitioning Unloaded Rows to Parquet Files (in this topic).

    > **Note:**
    >
    > * The following copy option values are not supported in combination with PARTITION BY:
    >
    >   + `OVERWRITE = TRUE`
    >   + `SINGLE = TRUE`
    >   + `INCLUDE_QUERY_ID = FALSE`
    > * Including the ORDER BY clause in the SQL statement in combination with PARTITION BY does not guarantee that the specified order is
    >   preserved in the unloaded files.
    > * If the PARTITION BY expression evaluates to NULL, the partition path in the output filename is `_NULL_`
    >   (e.g. `mystage/_NULL_/data_01234567-0123-1234-0000-000000001234_01_0_0.snappy.parquet`).
    > * When unloading to files of type `PARQUET`:
    >
    >   + Small data files unloaded by parallel execution threads are merged automatically into a single file that matches the MAX_FILE_SIZE
    >     copy option value as closely as possible.
    >   + All row groups are 128 MB in size. A row group is a logical horizontal partitioning of the data into rows. There is no physical
    >     structure that is guaranteed for a row group. A row group consists of a column chunk for each column in the dataset.
    >   + The unload operation attempts to produce files as close in size to the `MAX_FILE_SIZE` copy option setting as possible. The
    >     default value for this copy option is 16 MB. Note that this behavior applies only when unloading data to Parquet files.
    >   + VARIANT columns are converted into simple JSON strings. Casting the values to an array (using the
    >     [TO_ARRAY](../functions/to_array.md) function) results in an array of JSON strings.
    > * There is no option to omit the columns in the partition expression from the unloaded data files.

`FILE_FORMAT = ( FORMAT_NAME = 'file_format_name' )` or . `FILE_FORMAT = ( TYPE = CSV | JSON | PARQUET [ ... ] )`
:   Specifies the format of the data files containing unloaded data:

    `FORMAT_NAME = 'file_format_name'`
    :   Specifies an existing named file format to use for unloading data from the table. The named file format determines the format type
        (CSV, JSON, PARQUET), as well as any other format options, for the data files. For more information, see [CREATE FILE FORMAT](create-file-format.md).

    `TYPE = CSV | JSON | PARQUET`
    :   Specifies the type of files unloaded from the table.

        If a format type is specified, you can specify additional format-specific options. For information, see
        Format Type Options (in this topic).

    > **Note:**
    >
    > * JSON can only be used to unload data from columns of type VARIANT (i.e. columns containing JSON data).
    > * Currently, nested data in VARIANT columns cannot be unloaded successfully in Parquet format.

`copyOptions`
:   Specifies one or more copy options for the unloaded data. For more details, see Copy Options
    (in this topic).

`VALIDATION_MODE = RETURN_ROWS`
:   String (constant) that instructs the COPY command to return the results of the query in the SQL statement instead of unloading
    the results to the specified cloud storage location. The only supported validation option is `RETURN_ROWS`. This option returns
    all rows produced by the query.

    When you have validated the query, you can remove the `VALIDATION_MODE` to perform the unload operation.

`HEADER = TRUE | FALSE`
:   Specifies whether to include the table column headings in the output files.

    * Set this option to `TRUE` to include the table column headings to the output files.

      Note that if the COPY operation unloads the data to multiple files, the column headings are included in every file.

      When unloading data in Parquet format, the table column names are retained in the output files.
    * Set this option to `FALSE` to specify the following behavior:

      CSV:
      :   Do not include table column headings in the output files.

      Parquet:
      :   Include generic column headings (e.g. `col1`, `col2`, etc.) in the output files.

    Default: `FALSE`

## Format type options (`formatTypeOptions`)

Depending on the file format type specified (`FILE_FORMAT = ( TYPE = ... )`), you can include one or more of the following
format-specific options (separated by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies to compresses the unloaded data files using the specified compression algorithm.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Unloaded files are automatically compressed using the default, which is gzip. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` | Must be specified when loading Brotli-compressed files. |
    | `ZSTD` | Zstandard v0.8 (and higher) supported. |
    | `DEFLATE` | Unloaded files are compressed using Deflate (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Unloaded files are compressed using Raw Deflate (without header, RFC1951). |
    | `NONE` | Unloaded files are not compressed. |

    Default: `AUTO`

`RECORD_DELIMITER = 'string' | NONE`
:   One or more singlebyte or multibyte characters that separate records in an unloaded file. Accepts common escape sequences or the following singlebyte or multibyte characters:

    Singlebyte characters:
    :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

    Multibyte characters:
    :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

        The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (e.g. `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

    The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

    Also accepts a value of `NONE`.

    Default: New line character. Note that “new line” is logical such that `\r\n` is understood as a new line for files on a Windows platform.

`FIELD_DELIMITER = 'string' | NONE`
:   One or more singlebyte or multibyte characters that separate fields in an unloaded file. Accepts common escape sequences or the following singlebyte or multibyte characters:

    Singlebyte characters:
    :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

    Multibyte characters:
    :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

        The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (e.g. `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        > > **Note:**
        > >
        > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

> Also accepts a value of `NONE`.
>
> Default: comma (`,`)

`FILE_EXTENSION = 'string'`
:   String that specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a valid file extension that can be read by the desired software or
    service.

    > **Note:**
    >
    > If the `SINGLE` copy option is `TRUE`, then the COPY command unloads a file without a file extension by default. To specify a file extension, provide a file name and extension in the
    > `internal_location` or `external_location` path. For example:
    >
    > > `copy into @stage/data.csv ...`

    Default: null, meaning the file extension is determined by the format type, e.g. `.csv[compression]`, where `compression` is the extension added by the compression method, if
    `COMPRESSION` is set.

`DATE_FORMAT = 'string' | AUTO`
:   String that defines the format of date values in the unloaded data files. If a value is not specified or is set to `AUTO`, the value for the [DATE_OUTPUT_FORMAT](../parameters.md) parameter is used.

    Default: `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   String that defines the format of time values in the unloaded data files. If a value is not specified or is set to `AUTO`, the value for the [TIME_OUTPUT_FORMAT](../parameters.md) parameter is used.

    Default: `AUTO`

`TIMESTAMP_FORMAT = 'string' | AUTO`
:   String that defines the format of timestamp values in the unloaded data files. If a value is not specified or is set to `AUTO`, the value for the [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) parameter is used.

    Default: `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   String (constant) that defines the encoding format for binary output. The option can be used when unloading data from binary columns in a table.

    Default: `HEX`

`ESCAPE = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character string used as the escape character for enclosed or unenclosed field values. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_OPTIONALLY_ENCLOSED_BY` character in the data as literals. The escape character can also be used to escape instances of itself in the data.

    Accepts common escape sequences, octal values, or hex values.

    Specify the character used to enclose fields by setting `FIELD_OPTIONALLY_ENCLOSED_BY`.

    If this option is set, it overrides the escape character set for `ESCAPE_UNENCLOSED_FIELD`.

    Default:
    :   `NONE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character string used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

    Accepts common escape sequences, octal values, or hex values.

    If `ESCAPE` is set, the escape character set for that file format option overrides this option.

    Default:
    :   backslash (`\\`)

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex
    representation (`0x27`) or the double single-quoted escape (`''`).

    When a field in the source table contains this character, Snowflake escapes it using the same character for unloading. For example, if the value is the double quote character and a field contains the string `A "B" C`, Snowflake escapes the double quotes for unloading as follows:

    `A ""B"" C`

    Default: `NONE`

`NULL_IF = ( 'string1' [ , 'string2' ... ] ) | ()`
:   String used to convert from SQL NULL. Snowflake converts SQL NULL values to the first value in the list.

    If `NULL_IF = ()` is specified, Snowflake converts NULL values to empty fields (`,,`).

    Default: `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\` (default))

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Used in combination with `FIELD_OPTIONALLY_ENCLOSED_BY`. When `FIELD_OPTIONALLY_ENCLOSED_BY = NONE`, setting `EMPTY_FIELD_AS_NULL = FALSE` specifies to unload empty strings in tables to empty string values without quotes enclosing the field values.

    If set to `TRUE`, `FIELD_OPTIONALLY_ENCLOSED_BY` must specify a character to enclose strings.

    Default: `TRUE`

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant). Compresses the data file using the specified compression algorithm.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Unloaded files are automatically compressed using the default, which is gzip. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` |  |
    | `ZSTD` |  |
    | `DEFLATE` | Unloaded files are compressed using Deflate (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Unloaded files are compressed using Raw Deflate (without header, RFC1951). |
    | `NONE` | Unloaded files are not compressed. |

    Default: `AUTO`

`FILE_EXTENSION = 'string' | NONE`
:   String that specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a valid file extension that can be read by the desired software or
    service.

    Default: null, meaning the file extension is determined by the format type (e.g. `.csv[compression]`), where `compression` is the extension added by the compression method, if
    `COMPRESSION` is set.

### TYPE = PARQUET

`COMPRESSION = AUTO | LZO | SNAPPY | NONE`
:   String (constant). Compresses the data file using the specified compression algorithm.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Files are compressed using Snappy, the default compression algorithm. |
    | `LZO` | Files are compressed using the Snappy algorithm by default. If applying Lempel-Ziv-Oberhumer (LZO) compression instead, specify this value. |
    | `SNAPPY` | Files are compressed using the Snappy algorithm by default. You can optionally specify this value. |
    | `NONE` | Specifies that the unloaded files are not compressed. |

    Default: `AUTO`

`SNAPPY_COMPRESSION = TRUE | FALSE`
:   Boolean that specifies whether the unloaded file(s) are compressed using the SNAPPY algorithm.

    > **Note:**
    >
    > Deprecated. Use `COMPRESSION = SNAPPY` instead.

    Default: `TRUE`

## Copy options (`copyOptions`)

You can specify one or more of the following copy options (separated by blank spaces, commas, or new lines):

`OVERWRITE = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether the COPY command overwrites existing files with matching names, if any, in the location where files are stored. The option does not remove any existing files that do not match the names of the files that the COPY command unloads.

        In many cases, enabling this option helps prevent data duplication in the target stage when the same COPY INTO *<location>* statement is executed multiple times. However, when an unload operation writes multiple files to a stage, Snowflake appends a suffix that ensures each file name is unique across parallel execution threads (e.g. `data_0_1_0`). The number of parallel execution threads can vary between unload operations. If the files written by an unload operation do not have the same filenames as files written by a previous operation, SQL statements that include this copy option cannot replace the existing files, resulting in duplicate files.

        In addition, in the rare event of a machine or network failure, the unload job is retried. In that scenario, the unload operation writes additional files to the stage without first removing any files that were previously written by the first attempt.

        To avoid data duplication in the target stage, we recommend setting the `INCLUDE_QUERY_ID = TRUE` copy option instead of `OVERWRITE = TRUE` and removing all data files in the target stage and path (or using a different path for each unload operation) between each unload job.

    Default:
    :   `FALSE`

`SINGLE = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether to generate a single file or multiple files. If `FALSE`, a filename prefix must be included in `path`.

    > **Important:**
    >
    > If `SINGLE = TRUE`, then COPY ignores the `FILE_EXTENSION` file format option and outputs a file simply named **data**. To specify a file extension, provide a filename and extension in the internal or external location `path`. For example:
    >
    > > ```sqlexample
    > > COPY INTO @mystage/data.csv ...
    > > ```
    >
    > In addition, if the `COMPRESSION` file format option is also explicitly set to one of the supported compression algorithms (e.g. `GZIP`), then the specified internal or external location `path` must end in a filename with the corresponding file extension (e.g. `gz`) so that the file can be uncompressed using the appropriate tool. For example:
    >
    > > ```sqlexample
    > > COPY INTO @mystage/data.gz ...
    > >
    > > COPY INTO @mystage/data.csv.gz ...
    > > ```

    Default:
    :   `FALSE`

`MAX_FILE_SIZE = num`
:   Definition:
    :   Specifies the maximum size (in bytes) of each file to be generated in parallel per thread.
        Snowflake utilizes parallel execution to optimize performance. The number of threads can’t be modified.

        > **Note:**
        >
        > The actual unloaded file size and number of files unloaded depends on the total amount of data and number of nodes available for parallel processing. The unloaded file size depends on the available memory in the warehouse worker, which varies based on:
        >
        > * The warehouse size and available resources.
        > * The number of concurrent queries running on the warehouse.
        >
        > MAX_FILE_SIZE sets an upper limit but does not guarantee that files reach this size. Files might be smaller than
        > the specified MAX_FILE_SIZE when memory constraints require earlier file completion.
        >
        > The COPY command unloads one set of table rows at a time. If you set a very small `MAX_FILE_SIZE` value (for example, less than 1 MB), the amount of data in a set of rows could exceed the specified size.

    Default:
    :   16777216 (16 MB)

    Maximum:
    :   5368709120 (5 GB)

`INCLUDE_QUERY_ID = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether to uniquely identify unloaded files by including a universally unique identifier (UUID) in the filenames of unloaded data files. This option helps ensure that concurrent COPY statements do not overwrite unloaded files accidentally.

    Values:
    :   If `TRUE`, a UUID is added to the names of unloaded files. The UUID is the query ID of the COPY statement used to unload the data files. The UUID is a segment of the filename: `<path>/data_<uuid>_<name>.<extension>`. This option also prevents unloading duplicate data if an internal retry occurs. When an internal retry occurs, Snowflake deletes the partial set of unloaded files (identified by UUID), then restarts the copy operation.

        If `FALSE`, then a UUID is not added to the unloaded data files.

        > **Note:**
        >
        > * `INCLUDE_QUERY_ID = TRUE` is the default copy option value when you partition the unloaded table rows into separate files (by setting `PARTITION BY expr` in the COPY INTO *<location>* statement). This value cannot be changed to FALSE.
        > * `INCLUDE_QUERY_ID = TRUE` is not supported when either of the following copy options is set:
        >
        >   + `SINGLE = TRUE`
        >   + `OVERWRITE = TRUE`
        > * In the rare event of a machine or network failure, the unload job is retried. In that scenario, the unload operation removes any files that were written to the stage with the UUID of the current query ID and then attempts to unload the data again. Any new files written to the stage have the retried query ID as the UUID.

    Default:
    :   `FALSE`

`DETAILED_OUTPUT = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether the command output should describe the unload operation or the individual files unloaded as a result of the operation.

    Values:
    :   * If `TRUE`, the command output includes a row for each file unloaded to the specified stage. Columns show the path and name for each file, its size, and the number of rows that were unloaded to the file.
        * If `FALSE`, the command output consists of a single row that describes the entire unload operation. Columns show the total amount of data unloaded from tables, before and after compression (if applicable), and the total number of rows that were unloaded.

    Default:
    :   `FALSE`

## Usage notes

* `STORAGE_INTEGRATION` or `CREDENTIALS` only applies if you are unloading directly into a private storage location (Amazon S3,
  Google Cloud Storage, or Microsoft Azure). If you are unloading into a public bucket, secure access is not required, and if you are
  unloading into a named external stage, the stage provides all the credential information required for accessing the bucket.
* If referencing a file format in the current namespace, you can omit the single quotes around the format identifier.
* `JSON` can be specified for `TYPE` only when unloading data from VARIANT columns in tables.
* When unloading to files of type `CSV`, `JSON`, or `PARQUET`:

  By default, VARIANT columns are converted into simple JSON strings in the output file.

  + To unload the data as Parquet LIST values, explicitly cast the column values to arrays
    (using the [TO_ARRAY](../functions/to_array.md) function).
  + If a VARIANT column contains XML, Snowflake recommends explicitly casting the column values to
    XML in a `FROM ...` query. Casting the values using the
    [TO_XML](../functions/to_xml.md) function unloads XML-formatted strings
    instead of JSON strings.
* When unloading to files of type `PARQUET`:

  Unloading TIMESTAMP_TZ or TIMESTAMP_LTZ data produces an error.
* If the source table contains 0 rows, then the COPY operation does not unload a data file.
* This SQL command does not return a warning when unloading into a non-empty storage location. To avoid unexpected behaviors when files in
  a storage location are consumed by data pipelines, Snowflake recommends only writing to empty storage locations.
* Failed unload operations:

  + A failed unload operation can still result in unloaded data files (for example, if the statement is canceled or exceeds its timeout limit).
    For [Partitioned data unloading](../../user-guide/data-unload-overview.md) or if `INCLUDE_QUERY_ID = TRUE`,
    Snowflake attempts to clean up unloaded files and might retry the failed unload operation.
    If the cleanup operation times out and returns an error message, you can use the failed query ID to find and manually
    remove the files from your storage location. Alternatively, you can identify files to remove by the
    [file naming pattern](../../user-guide/data-unload-overview.md).

    To facilitate cleanup and avoid timeout, Snowflake recommends that you only unload into empty storage locations.
  + A failed unload operation to cloud storage in a different region results in data transfer costs.
* If a [masking policy](../../user-guide/security-column-intro.md) is set on a column, the masking policy is applied to the data resulting in
  unauthorized users seeing masked data in the column.
* To view the status and history of this command’s executions, use [QUERY_HISTORY view](../account-usage/query_history.md).
* For [outbound private connectivity](../../user-guide/private-connectivity-outbound.md), unloading directly to an external location (external
  storage URI) isn’t supported. Instead, use an external stage with a storage integration configured for outbound private connectivity.
  For more information, see the following topics:

  + [Private connectivity to external stages for Amazon Web Services](../../user-guide/data-load-aws-private.md)
  + [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md)

To learn more, see [Explicitly convert numeric columns to Parquet data types](../../user-guide/data-unload-considerations.md).

## Examples

### Unloading data from a table to files in a table stage

Unload data from the `orderstiny` table into the table’s stage using a folder/filename prefix (`result/data_`), a named
file format (`myformat`), and gzip compression:

> ```sqlexample
> COPY INTO @%orderstiny/result/data_
>   FROM orderstiny FILE_FORMAT = (FORMAT_NAME ='myformat' COMPRESSION='GZIP');
> ```

### Unloading data from a query to files in a named internal stage

Unload the result of a query into a named internal stage (`my_stage`) using a folder/filename prefix (`result/data_`), a named
file format (`myformat`), and gzip compression:

> ```sqlexample
> COPY INTO @my_stage/result/data_ FROM (SELECT * FROM orderstiny)
>    file_format=(format_name='myformat' compression='gzip');
> ```
>
> Note that the above example is functionally equivalent to the first example, except the file containing the unloaded data is stored in
> the stage location for `my_stage` rather than the table location for `orderstiny`.

### Unloading data from a table directly to files in an external location

> **Note:**
>
> This option isn’t supported for [outbound private connectivity](../../user-guide/private-connectivity-outbound.md).
> Instead, use an external stage.

Unload all data in a table into a storage location using a named `my_csv_format` file format:

**Amazon S3**

> Access the referenced S3 bucket using a referenced storage integration named `myint`:
>
> ```sqlexample
> COPY INTO 's3://mybucket/unload/'
>   FROM mytable
>   STORAGE_INTEGRATION = myint
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```
>
> Access the referenced S3 bucket using supplied credentials:
>
> ```sqlexample
> COPY INTO 's3://mybucket/unload/'
>   FROM mytable
>   CREDENTIALS = (AWS_KEY_ID='xxxx' AWS_SECRET_KEY='xxxxx' AWS_TOKEN='xxxxxx')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

**Google Cloud Storage**

> Access the referenced GCS bucket using a referenced storage integration named `myint`:
>
> ```sqlexample
> COPY INTO 'gcs://mybucket/unload/'
>   FROM mytable
>   STORAGE_INTEGRATION = myint
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

**Microsoft Azure**

> Access the referenced container using a referenced storage integration named `myint`:
>
> ```sqlexample
> COPY INTO 'azure://myaccount.blob.core.windows.net/unload/'
>   FROM mytable
>   STORAGE_INTEGRATION = myint
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```
>
> Access the referenced container using supplied credentials:
>
> ```sqlexample
> COPY INTO 'azure://myaccount.blob.core.windows.net/mycontainer/unload/'
>   FROM mytable
>   CREDENTIALS=(AZURE_SAS_TOKEN='xxxx')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

### Partitioning unloaded rows to Parquet files

The following example partitions unloaded rows into Parquet files by the values in two columns: a date column and a time column. The
example specifies a maximum size for each unloaded file:

```sqlexample
CREATE or replace TABLE t1 (
  dt date,
  ts time
  )
AS
  SELECT TO_DATE($1)
        ,TO_TIME($2)
    FROM VALUES
           ('2020-01-28', '18:05')
          ,('2020-01-28', '22:57')
          ,('2020-01-28', NULL)
          ,('2020-01-29', '02:15')
;

SELECT * FROM t1;

+------------+----------+
| DT         | TS       |
|------------+----------|
| 2020-01-28 | 18:05:00 |
| 2020-01-28 | 22:57:00 |
| 2020-01-28 | 22:32:00 |
| 2020-01-29 | 02:15:00 |
+------------+----------+

-- Partition the unloaded data by date and hour. Set ``32000000`` (32 MB) as the upper size limit of each file to be generated in parallel per thread.
COPY INTO @%t1
  FROM t1
  PARTITION BY ('date=' || to_varchar(dt, 'YYYY-MM-DD') || '/hour=' || to_varchar(date_part(hour, ts))) -- Concatenate labels and column values to output meaningful filenames
  FILE_FORMAT = (TYPE=parquet)
  MAX_FILE_SIZE = 32000000
  HEADER=true;

LIST @%t1;

+------------------------------------------------------------------------------------------+------+----------------------------------+------------------------------+
| name                                                                                     | size | md5                              | last_modified                |
|------------------------------------------------------------------------------------------+------+----------------------------------+------------------------------|
| __NULL__/data_019c059d-0502-d90c-0000-438300ad6596_006_4_0.snappy.parquet                |  512 | 1c9cb460d59903005ee0758d42511669 | Wed, 5 Aug 2020 16:58:16 GMT |
| date=2020-01-28/hour=18/data_019c059d-0502-d90c-0000-438300ad6596_006_4_0.snappy.parquet |  592 | d3c6985ebb36df1f693b52c4a3241cc4 | Wed, 5 Aug 2020 16:58:16 GMT |
| date=2020-01-28/hour=22/data_019c059d-0502-d90c-0000-438300ad6596_006_6_0.snappy.parquet |  592 | a7ea4dc1a8d189aabf1768ed006f7fb4 | Wed, 5 Aug 2020 16:58:16 GMT |
| date=2020-01-29/hour=2/data_019c059d-0502-d90c-0000-438300ad6596_006_0_0.snappy.parquet  |  592 | 2d40ccbb0d8224991a16195e2e7e5a95 | Wed, 5 Aug 2020 16:58:16 GMT |
+------------------------------------------------------------------------------------------+------+----------------------------------+------------------------------+
```

### Retaining NULL/empty field data in unloaded files

Retain SQL NULL and empty fields in unloaded files:

> ```sqlexample
> -- View the table column values
> SELECT * FROM HOME_SALES;
>
> +------------+-------+-------+-------------+--------+------------+
> | CITY       | STATE | ZIP   | TYPE        | PRICE  | SALE_DATE  |
> |------------+-------+-------+-------------+--------+------------|
> | Lexington  | MA    | 95815 | Residential | 268880 | 2017-03-28 |
> | Belmont    | MA    | 95815 | Residential |        | 2017-02-21 |
> | Winchester | MA    | NULL  | Residential |        | 2017-01-31 |
> +------------+-------+-------+-------------+--------+------------+
>
> -- Unload the table data into the current user's personal stage. The file format options retain both the NULL value and the empty values in the output file
> COPY INTO @~ FROM HOME_SALES
>   FILE_FORMAT = (TYPE = csv NULL_IF = ('NULL', 'null') EMPTY_FIELD_AS_NULL = false);
>
> -- Contents of the output file
> Lexington,MA,95815,Residential,268880,2017-03-28
> Belmont,MA,95815,Residential,,2017-02-21
> Winchester,MA,NULL,Residential,,2017-01-31
> ```

### Unloading data to a single file

Unload all rows to a single data file using the SINGLE copy option:

> ```sqlexample
> copy into @~ from HOME_SALES
> single = true;
> ```

### Including the UUID in the unloaded filenames

Include the UUID in the names of unloaded files by setting the INCLUDE_QUERY_ID copy option to TRUE:

```sqlexample
-- Unload rows from the T1 table into the T1 table stage:
COPY INTO @%t1
  FROM t1
  FILE_FORMAT=(TYPE=parquet)
  INCLUDE_QUERY_ID=true;

-- Retrieve the query ID for the COPY INTO location statement.
-- This optional step enables you to see that the query ID for the COPY INTO location statement
-- is identical to the UUID in the unloaded files.
SELECT last_query_id();
+--------------------------------------+
| LAST_QUERY_ID()                      |
|--------------------------------------|
| 019260c2-00c0-f2f2-0000-4383001cf046 |
+--------------------------------------+

LS @%t1;
+----------------------------------------------------------------+------+----------------------------------+-------------------------------+
| name                                                           | size | md5                              | last_modified                 |
|----------------------------------------------------------------+------+----------------------------------+-------------------------------|
| data_019260c2-00c0-f2f2-0000-4383001cf046_0_0_0.snappy.parquet |  544 | eb2215ec3ccce61ffa3f5121918d602e | Thu, 20 Feb 2020 16:02:17 GMT |
+----------------------------------------------------------------+------+----------------------------------+-------------------------------+
```

### Validating data to be unloaded (from a query)

Execute COPY in validation mode to return the result of a query and view the data that will be unloaded from the `orderstiny` table if
COPY is executed in normal mode:

```sqlexample
COPY INTO @my_stage
  FROM (SELECT * FROM orderstiny LIMIT 5)
  VALIDATION_MODE='RETURN_ROWS';
```

Output:

```output
+-----+--------+----+-----------+------------+----------+-----------------+----+---------------------------------------------------------------------------+
|  C1 |   C2   | C3 |    C4     |     C5     |    C6    |       C7        | C8 |                                    C9                                     |
+-----+--------+----+-----------+------------+----------+-----------------+----+---------------------------------------------------------------------------+
|  1  | 36901  | O  | 173665.47 | 1996-01-02 | 5-LOW    | Clerk#000000951 | 0  | nstructions sleep furiously among                                         |
|  2  | 78002  | O  | 46929.18  | 1996-12-01 | 1-URGENT | Clerk#000000880 | 0  |  foxes. pending accounts at the pending\, silent asymptot                 |
|  3  | 123314 | F  | 193846.25 | 1993-10-14 | 5-LOW    | Clerk#000000955 | 0  | sly final accounts boost. carefully regular ideas cajole carefully. depos |
|  4  | 136777 | O  | 32151.78  | 1995-10-11 | 5-LOW    | Clerk#000000124 | 0  | sits. slyly regular warthogs cajole. regular\, regular theodolites acro   |
|  5  | 44485  | F  | 144659.20 | 1994-07-30 | 5-LOW    | Clerk#000000925 | 0  | quickly. bold deposits sleep slyly. packages use slyly                    |
+-----+--------+----+-----------+------------+----------+-----------------+----+---------------------------------------------------------------------------+
```

---
title: COPY INTO <table>
source: https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.md
section: SQL Commands
---

# COPY INTO *<table>*

Loads data from files to an existing table. The files must already be in one of the following locations:

* Named internal stage (or table/user stage). Files can be staged using the [PUT](put.md) command.
* Named external stage that references an external location (Amazon S3, Google Cloud Storage, or Microsoft Azure).

  You cannot access data held in archival cloud storage classes that requires restoration before it can be retrieved. These archival storage classes include, for example, the Amazon S3 Glacier Flexible Retrieval or Glacier Deep Archive storage class, or Microsoft Azure Archive Storage.
* External location (Amazon S3, Google Cloud Storage, or Microsoft Azure).

See also:
:   [COPY INTO <location>](copy-into-location.md)

## Syntax

```sqlsyntax
/* Standard data load */
COPY INTO [<namespace>.]<table_name>
     FROM { internalStage | externalStage | externalLocation }
[ FILES = ( '<file_name>' [ , '<file_name>' ] [ , ... ] ) ]
[ PATTERN = '<regex_pattern>' ]
[ FILE_FORMAT = ( { FORMAT_NAME = '[<namespace>.]<file_format_name>' |
                    TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
[ copyOptions ]
[ VALIDATION_MODE = RETURN_<n>_ROWS | RETURN_ERRORS | RETURN_ALL_ERRORS ]

/* Data load with transformation */
COPY INTO [<namespace>.]<table_name> [ ( <col_name> [ , <col_name> ... ] ) ]
     FROM ( SELECT [<alias>.]$<file_col_num>[.<element>] [ , [<alias>.]$<file_col_num>[.<element>] ... ]
            FROM { internalStage | externalStage } )
[ FILES = ( '<file_name>' [ , '<file_name>' ] [ , ... ] ) ]
[ PATTERN = '<regex_pattern>' ]
[ FILE_FORMAT = ( { FORMAT_NAME = '[<namespace>.]<file_format_name>' |
                    TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
[ copyOptions ]
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>]
>   | @[<namespace>.]%<table_name>[/<path>]
>   | @~[/<path>]
> ```
>
> ```sqlsyntax
> externalStage ::=
>   @[<namespace>.]<ext_stage_name>[/<path>]
> ```
>
> ```sqlsyntax
> externalLocation (for Amazon S3) ::=
>   '<protocol>://<bucket>[/<path>]'
>   [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( {  { AWS_KEY_ID = '<string>' AWS_SECRET_KEY = '<string>' [ AWS_TOKEN = '<string>' ] } } ) } ]
>   [ ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] [ MASTER_KEY = '<string>' ] |
>                    [ TYPE = 'AWS_SSE_S3' ] |
>                    [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ] ] |
>                    [ TYPE = 'NONE' ] ) ]
> ```
>
> ```sqlsyntax
> externalLocation (for Google Cloud Storage) ::=
>   'gcs://<bucket>[/<path>]'
>   [ STORAGE_INTEGRATION = <integration_name> ]
>   [ ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' ] [ KMS_KEY_ID = '<string>' ] | [ TYPE = 'NONE' ] ) ]
> ```
>
> ```sqlsyntax
> externalLocation (for Microsoft Azure) ::=
>   'azure://<account>.blob.core.windows.net/<container>[/<path>]'
>   [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( [ AZURE_SAS_TOKEN = '<string>' ] ) } ]
>   [ ENCRYPTION = ( [ TYPE = { 'AZURE_CSE' | 'NONE' } ] [ MASTER_KEY = '<string>' ] ) ]
> ```
>
> ```sqlsyntax
> formatTypeOptions ::=
> -- If FILE_FORMAT = ( TYPE = CSV ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      RECORD_DELIMITER = '<string>' | NONE
>      FIELD_DELIMITER = '<string>' | NONE
>      MULTI_LINE = TRUE | FALSE
>      PARSE_HEADER = TRUE | FALSE
>      SKIP_HEADER = <integer>
>      SKIP_BLANK_LINES = TRUE | FALSE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      ESCAPE = '<character>' | NONE
>      ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
>      TRIM_SPACE = TRUE | FALSE
>      FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
>      NULL_IF = ( [ '<string>' [ , '<string>' ... ] ] )
>      ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      EMPTY_FIELD_AS_NULL = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
>      ENCODING = '<string>' | UTF8
> -- If FILE_FORMAT = ( TYPE = JSON ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      TRIM_SPACE = TRUE | FALSE
>      MULTI_LINE = TRUE | FALSE
>      NULL_IF = ( [ '<string>' [ , '<string>' ... ] ] )
>      ENABLE_OCTAL = TRUE | FALSE
>      ALLOW_DUPLICATE = TRUE | FALSE
>      STRIP_OUTER_ARRAY = TRUE | FALSE
>      STRIP_NULL_VALUES = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> -- If FILE_FORMAT = ( TYPE = AVRO ... )
>      COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( [ '<string>' [ , '<string>' ... ] ] )
> -- If FILE_FORMAT = ( TYPE = ORC ... )
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( [ '<string>' [ , '<string>' ... ] ] )
> -- If FILE_FORMAT = ( TYPE = PARQUET ... )
>      COMPRESSION = AUTO | SNAPPY | NONE
>      BINARY_AS_TEXT = TRUE | FALSE
>      USE_LOGICAL_TYPE = TRUE | FALSE
>      TRIM_SPACE = TRUE | FALSE
>      USE_VECTORIZED_SCANNER = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( [ '<string>' [ , '<string>' ... ] ] )
> -- If FILE_FORMAT = ( TYPE = XML ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      PRESERVE_SPACE = TRUE | FALSE
>      STRIP_OUTER_ELEMENT = TRUE | FALSE
>      DISABLE_AUTO_CONVERT = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> ```
>
> ```sqlsyntax
> copyOptions ::=
>      CLUSTER_AT_INGEST_TIME = TRUE | FALSE
>      ENFORCE_LENGTH = TRUE | FALSE
>      FORCE = TRUE | FALSE
>      INCLUDE_METADATA = ( <column_name> = METADATA$<field> [ , <column_name> = METADATA${field} ... ] )
>      LOAD_MODE = { FULL_INGEST | ADD_FILES_COPY }
>      LOAD_UNCERTAIN_FILES = TRUE | FALSE
>      MATCH_BY_COLUMN_NAME = CASE_SENSITIVE | CASE_INSENSITIVE | NONE
>      ON_ERROR = { CONTINUE | SKIP_FILE | SKIP_FILE_<num> | 'SKIP_FILE_<num>%' | ABORT_STATEMENT }
>      PURGE = TRUE | FALSE
>      RETURN_FAILED_ONLY = TRUE | FALSE
>      SIZE_LIMIT = <num>
>      TRUNCATECOLUMNS = TRUE | FALSE
> ```

## Required parameters

`[namespace.]table_name`
:   Specifies the name of the table into which data is loaded.

    Namespace optionally specifies the database and/or schema for the table, in the form of `database_name.schema_name` or
    `schema_name`. It is optional if a database and schema are currently in use within the user session; otherwise, it is required.

`FROM ...`
:   Specifies the internal or external location where the files containing data to be loaded are staged:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]int_stage_name[/path]` | Files are in the specified named internal stage. |
    > | `@[namespace.]ext_stage_name[/path]` | Files are in the specified named external stage. |
    > | `@[namespace.]%table_name[/path]` | Files are in the stage for the specified table. |
    > | `@~[/path]` | Files are in the stage for the current user. |
    > | `'protocol://bucket[/path]'` | Files are in the specified external location (S3 bucket). Additional parameters might be required. For details, see Additional Cloud Provider Parameters (in this topic). |
    > | `'gcs://bucket[/path]'` | Files are in the specified external location (Google Cloud Storage bucket). Additional parameters could be required. For details, see Additional Cloud Provider Parameters (in this topic). |
    > | `'azure://account.blob.core.windows.net/container[/path]'` | Files are in the specified external location (Azure container). Additional parameters might be required. For details, see Additional Cloud Provider Parameters (in this topic). |

    Where:

    > * `namespace` is the database and/or schema in which the internal or external stage resides, in the form of
    >   `database_name.schema_name` or `schema_name`. It is optional if a database and schema are currently in use
    >   within the user session; otherwise, it is required.
    > * `protocol` is one of the following:
    >
    >   + `s3` refers to S3 storage in public AWS regions outside of China.
    >   + `s3china` refers to S3 storage in public AWS regions in China.
    >   + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
    >
    >   Accessing cloud storage in a [government region](../../user-guide/intro-regions.md) using a storage integration is limited to Snowflake
    >   accounts hosted in the same government region.
    >
    >   Similarly, if you need to access cloud storage in a region in China, you can use a storage integration only from a Snowflake
    >   account hosted in the same region in China.
    >
    >   In these cases, use the CREDENTIALS parameter in the [CREATE STAGE](create-stage.md) command (rather than using a storage
    >   integration) to provide the credentials for authentication.
    > * `bucket` is the name of the bucket.
    >
    > * `account` is the name of the Azure account (e.g. `myaccount`). Use the `blob.core.windows.net` endpoint for all
    >   supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
    >
    >   Note that currently, accessing Azure blob storage in [government regions](../../user-guide/intro-regions.md) using a storage
    >   integration is limited to Snowflake accounts hosted on Azure in the same government region. Accessing your blob storage from an
    >   account hosted outside of the government region using direct credentials is supported.
    > * `container` is the name of the Azure container (e.g. `mycontainer`).
    >
    > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
    >   common string) that limits the set of files to load. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >   services.
    >
    >   Relative path modifiers such as `/./` and `/../` are interpreted literally because “paths” are literal prefixes for a name.
    >   For example:
    >
    >   > ```sqlexample
    >   > -- S3 bucket
    >   > COPY INTO mytable FROM 's3://mybucket/./../a.csv';
    >   >
    >   > -- Google Cloud Storage bucket
    >   > COPY INTO mytable FROM 'gcs://mybucket/./../a.csv';
    >   >
    >   > -- Azure container
    >   > COPY INTO mytable FROM 'azure://myaccount.blob.core.windows.net/mycontainer/./../a.csv';
    >   > ```
    >
    >   When you specify a path in COPY INTO *<table>* statements, Snowflake treats the path value as a prefix and
    >   performs a prefix match against the file name in the external location.
    >   In the previous examples, Snowflake would load all files with a name that
    >   starts with `./../a.csv` in the external location.

    > **Note:**
    >
    > * If the internal or external stage or path name includes special characters, including spaces, enclose the `FROM ...` string in
    >   single quotes.
    > * The `FROM ...` value must be a literal constant. The value cannot be a [SQL variable](../session-variables.md).

### Additional cloud provider parameters

`STORAGE_INTEGRATION = integration_name` or . `CREDENTIALS = ( cloud_specific_credentials )`
:   Supported when the FROM value in the COPY statement is an external storage URI rather than an external stage name.

    Required only for loading from an external private/protected cloud storage location; not required for public buckets/containers

    Specifies the security credentials for connecting to the cloud provider and accessing the private/protected storage container where the
    data files are staged.

    **Amazon S3**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > We highly recommend the use of storage integrations. This option avoids the need to supply cloud storage credentials using the
    >     > CREDENTIALS parameter when creating stages or loading data.
    >
    > `CREDENTIALS = ( AWS_KEY_ID = 'string' AWS_SECRET_KEY = 'string' [ AWS_TOKEN = 'string' ] )`
    > :   Specifies the security credentials for connecting to AWS and accessing the private/protected S3 bucket where the files to load are staged.
    >     For more information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).
    >
    >     The credentials you specify depend on whether you associated the Snowflake access permissions for the bucket with an AWS IAM
    >     (Identity & Access Management) user or role:
    >
    >     * **IAM user:** Temporary IAM credentials are required. Temporary (aka “scoped”) credentials are generated by AWS Security Token Service
    >       (STS) and consist of three components:
    >
    >       > + `AWS_KEY_ID`
    >       > + `AWS_SECRET_KEY`
    >       > + `AWS_TOKEN`
    >
    >       All three are required to access a private/protected bucket. After a designated period of time, temporary credentials expire
    >       and can no longer be used. You must then generate a new set of valid temporary credentials.
    >
    >       > **Important:**
    >       >
    >       > COPY commands contain complex syntax and sensitive information, such as credentials. In addition, they are executed frequently and
    >       > are often stored in scripts or worksheets, which could lead to sensitive information being inadvertently exposed. The COPY command
    >       > allows permanent (aka “long-term”) credentials to be used; however, for security reasons, do not use permanent
    >       > credentials in COPY commands. Instead, use temporary credentials.
    >       >
    >       > If you must use permanent credentials, use [external stages](create-stage.md), for which credentials are
    >       > entered once and securely stored, minimizing the potential for exposure.
    >     * **IAM role:** Omit the security credentials and access keys and, instead, identify the role using `AWS_ROLE` and specify the
    >       AWS role ARN (Amazon Resource Name).

    **Google Cloud Storage**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).

    **Microsoft Azure**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a Snowflake
    >     identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > We highly recommend the use of storage integrations. This option avoids the need to supply cloud storage credentials using the
    >     > CREDENTIALS parameter when creating stages or loading data.
    >
    > `CREDENTIALS = ( AZURE_SAS_TOKEN = 'string' )`
    > :   Specifies the SAS (shared access signature) token for connecting to Azure and accessing the private/protected container where the files
    >     containing data are staged. Credentials are generated by Azure.

`ENCRYPTION = ( cloud_specific_encryption )`
:   For use in ad hoc COPY statements (statements that do not reference a named external stage). Required only for loading from encrypted files; not required if files are unencrypted. Specifies the encryption settings used to decrypt encrypted files in the storage location.

    **Amazon S3**

    > `ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] [ MASTER_KEY = '<string>' ] | [ TYPE = 'AWS_SSE_S3' ] | [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ] ] | [ TYPE = 'NONE' ] )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AWS_CSE`: Client-side encryption (requires a `MASTER_KEY` value). Currently, the client-side
    > >       [master key](https://csrc.nist.gov/glossary/term/master_key) you provide can only be a symmetric key. Note that, when a
    > >       `MASTER_KEY` value is provided, Snowflake assumes `TYPE = AWS_CSE` (i.e. when a `MASTER_KEY` value is
    > >       provided, `TYPE` is not required).
    > >     * `AWS_SSE_S3`: Server-side encryption that requires no additional encryption settings.
    > >     * `AWS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >     * `NONE`: No encryption.
    > >
    > >     For more information about the encryption types, see the AWS documentation for
    > >     [client-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/UsingClientSideEncryption.html)
    > >     or [server-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/serv-side-encryption.html).
    > >
    > > `MASTER_KEY = 'string'` (applies to `AWS_CSE` encryption only)
    > > :   Specifies the client-side master key used to encrypt the files in the bucket. The master key must be a 128-bit or 256-bit key in
    > >     Base64-encoded form.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `AWS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the AWS KMS-managed key used to encrypt files unloaded into the bucket. If no value is
    > >     provided, your default KMS key ID is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading.

    **Google Cloud Storage**

    > `ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' | 'NONE' ] [ KMS_KEY_ID = 'string' ] )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `GCS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >       For more information, see the Google Cloud documentation:
    > >
    > >       + <https://cloud.google.com/storage/docs/encryption/customer-managed-keys>
    > >       + <https://cloud.google.com/storage/docs/encryption/using-customer-managed-keys>
    > >     * `NONE`: No encryption.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `GCS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the Cloud KMS-managed key that is used to encrypt files unloaded into the bucket. If no
    > >     value is provided, your default KMS key ID set on the bucket is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading. The load operation should succeed if the service account has sufficient permissions
    > >     to decrypt data in the bucket.

    **Microsoft Azure**

    > `ENCRYPTION = ( [ TYPE = 'AZURE_CSE' | 'NONE' ] [ MASTER_KEY = 'string' ] )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AZURE_CSE`: Client-side encryption (requires a MASTER_KEY value). For information, see the
    > >       [Client-side encryption information](https://docs.microsoft.com/en-us/azure/storage/common/storage-client-side-encryption) in
    > >       the Microsoft Azure documentation.
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to AZURE_CSE encryption only)
    > > :   Specifies the client-side master key used to decrypt files. The master key must be a 128-bit or 256-bit key in Base64-encoded form.

### Transformation parameters

`( SELECT [alias.]$file_col_num[.element] [ , [alias.]$file_col_num[.element] ... ] FROM ... [ alias ] )`
:   Required for transforming data during loading

    Specifies an explicit set of fields/columns (separated by commas) to load from the staged data files. The fields/columns are selected from
    the files using a standard SQL query (i.e. [SELECT](select.md) list), where:

    > |  |  |
    > | --- | --- |
    > | `alias` | Specifies an optional alias for the `FROM` value (e.g. `d` in `COPY INTO t1 (c1) FROM (SELECT d.$1 FROM @mystage/file1.csv.gz d);`). |
    > | `file_col_num` | Specifies the positional number of the field/column (in the file) that contains the data to be loaded (`1` for the first field, `2` for the second field, etc.) |
    > | `element` | Specifies the path and element name of a repeating value in the data file (applies only to semi-structured data files). |

    The SELECT list defines a numbered set of field/columns in the data files you are loading from. The list must match the sequence
    of columns in the target table. You can use the optional `( col_name [ , col_name ... ] )` parameter to map the list to specific
    columns in the target table.

    Note that the actual field/column order in the data files can be different from the column order in the target table. It is only important
    that the SELECT list maps fields/columns in the data files to the corresponding columns in the table.

    > **Note:**
    >
    > The SELECT statement used for transformations does not support all functions. For a complete list of the supported functions and more
    > details about data loading transformations, including examples, see the usage notes in [Transform data during a load](../../user-guide/data-load-transform.md).
    >
    > Also, data loading transformation only supports selecting data from user stages and named stages (internal or external).

`( col_name [ , col_name ... ] )`
:   Optionally specifies an explicit list of table columns (separated by commas) into which you want to insert data:

    * The first column consumes the values produced from the first field/column extracted from the loaded files.
    * The second column consumes the values produced from the second field/column extracted from the loaded files.
    * And so on, in the order specified.

    Columns cannot be repeated in this listing. Any columns excluded from this column list are populated by their default value (NULL, if not
    specified). However, excluded columns cannot have a sequence as their default value.

## Optional parameters

`FILES = ( 'file_name' [ , 'file_name' ... ] )`
:   Specifies a list of one or more files names (separated by commas) to be loaded. The files must already have been staged in either the
    Snowflake internal location or external location specified in the command. If any of the specified files cannot be found, the default
    behavior `ON_ERROR = ABORT_STATEMENT` aborts the load operation unless a different `ON_ERROR` option is explicitly set in
    the COPY statement.

    The maximum number of files names that can be specified is 1000.

    > **Note:**
    >
    > For external stages only (Amazon S3, Google Cloud Storage, or Microsoft Azure), the file path is set by concatenating the URL in the
    > stage definition and the list of resolved file names.
    >
    > However, Snowflake doesn’t insert a separator implicitly between the path and file names. You must explicitly include a separator (`/`)
    > either at the end of the URL in the stage definition or at the beginning of each file name specified in this parameter.

`PATTERN = 'regex_pattern'`
:   A regular expression pattern string, enclosed in single quotes, specifying the file names and/or paths to match.

    > **Tip:**
    >
    > For the best performance, try to avoid applying patterns that filter on a large number of files.

    Note that the regular expression is applied differently to bulk data loads versus Snowpipe data loads.

    * Snowpipe trims any path segments in the stage definition from the storage location and applies the regular expression to any remaining
      path segments and filenames. To view the stage definition, execute the [DESCRIBE STAGE](desc-stage.md) command for the stage.
      The URL property consists of the bucket or container name and zero or more path segments. For example, if the FROM location in a COPY
      INTO *<table>* statement is `@s/path1/path2/` and the URL value for stage `@s` is `s3://mybucket/path1/`, then Snowpipe trims
      `s3://mybucket/path1/path2/` from the storage location in the FROM clause and applies the regular expression to the remaining filenames in the path.
    * Bulk data load operations apply the regular expression to the entire storage location in the FROM clause.

    > **Note:**
    >
    > When the `FILES` and `PATTERN` options are used together, only the specified paths in the `FILES` option are loaded. It is recommended to not use these two options together.

`FILE_FORMAT = ( FORMAT_NAME = 'file_format_name' )` or . `FILE_FORMAT = ( TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML [ ... ] )`
:   Specifies the format of the data files to load:

    `FORMAT_NAME = 'file_format_name'`
    :   Specifies an existing named file format to use for loading data into the table. The named file format determines the format type
        (CSV, JSON, etc.), as well as any other format options, for the data files. For more information, see [CREATE FILE FORMAT](create-file-format.md).

    `TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML [ ... ]`
    :   Specifies the type of files to load into the table. If a format type is specified, then additional format-specific options can be
        specified. For more details, see Format Type Options (in this topic).

    > **Note:**
    >
    > `FORMAT_NAME` and `TYPE` are mutually exclusive; specifying both in the same COPY command might result in unexpected behavior.

`COPY_OPTIONS = ( ... )`
:   Specifies one or more copy options for the loaded data. For more details, see Copy Options
    (in this topic).

`VALIDATION_MODE = RETURN_n_ROWS | RETURN_ERRORS | RETURN_ALL_ERRORS`
:   String (constant) that instructs the COPY command to validate the data files instead of loading them into the specified table; i.e.
    the COPY command tests the files for errors but does not load them. The command validates the data to be loaded and returns results based
    on the validation option specified:

    | Supported Values | Notes |
    | --- | --- |
    | `RETURN_n_ROWS` (e.g. `RETURN_10_ROWS`) | Validates the specified number of rows, if no errors are encountered; otherwise, fails at the first error encountered in the rows. |
    | `RETURN_ERRORS` | Returns all errors (parsing, conversion, etc.) across all files specified in the COPY statement. |
    | `RETURN_ALL_ERRORS` | Returns all errors across all files specified in the COPY statement, including files with errors that were partially loaded during an earlier load because the `ON_ERROR` copy option was set to `CONTINUE` during the load. |

    > **Note:**
    >
    > * `VALIDATION_MODE` does not support COPY statements that transform data during a load. If the parameter is specified, the COPY
    >   statement returns an error.
    > * `VALIDATION_MODE` isn’t supported for Iceberg tables.
    > * Use the [VALIDATE](../functions/validate.md) table function to view all errors encountered during a previous load. Note that this
    >   function also does not support COPY statements that transform data during a load.

## Format type options (`formatTypeOptions`)

Depending on the file format type specified (`FILE_FORMAT = ( TYPE = ... )`), you can include one or more of the following
format-specific options (separated by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be loaded. Snowflake uses this option to detect how already-compressed data files were compressed
    so that the compressed data in the files can be extracted for loading.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If loading Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` | Must be specified when loading Brotli-compressed files. |
    | `ZSTD` | Zstandard v0.8 (and higher) supported. |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Data files to load have not been compressed. |

    Default:
    :   `AUTO`

`RECORD_DELIMITER = 'string' | NONE`
:   One or more characters that separate records in an input file. Accepts common escape sequences or the following singlebyte or multibyte characters:

    Singlebyte characters:
    :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

    Multibyte characters:
    :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

        The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

    The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

    Also accepts a value of `NONE`.

    Default:
    :   New line character. Note that “new line” is logical such that `\r\n` is understood as a new line for files on a Windows platform.

`FIELD_DELIMITER = 'string' | NONE`
:   One or more singlebyte or multibyte characters that separate fields in an input file. Accepts common escape sequences or the following singlebyte or multibyte characters:

    Singlebyte characters:
    :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

    Multibyte characters:
    :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

        The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        > > **Note:**
        > >
        > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

    The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

    Also accepts a value of `NONE`.

    Default:
    :   comma (`,`)

`MULTI_LINE = TRUE | FALSE`
:   Boolean that specifies whether multiple lines are allowed.

    If MULTI_LINE is set to `FALSE` and the specified record delimiter is present within a CSV field, the record containing the field will be interpreted as an error.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > If you are loading large uncompressed CSV files (greater than 128MB) that follow the RFC4180 specification, Snowflake supports parallel scanning of these CSV files when MULTI_LINE is set to `FALSE`, COMPRESSION is set to `NONE`, and ON_ERROR is set to `ABORT_STATEMENT` or `CONTINUE`.

`PARSE_HEADER = TRUE | FALSE`
:   Boolean that specifies whether to use the first row headers in the data files to determine column names.

    This file format option is applied to the following actions only:

    > * Automatically detecting column definitions by using the INFER_SCHEMA function.
    > * Loading CSV data into separate columns by using the INFER_SCHEMA function and MATCH_BY_COLUMN_NAME copy option.

    If the option is set to TRUE, the first row headers will be used to determine column names. The default value FALSE will return column names as c\*, where \* is the position of the column.

    Note that the SKIP_HEADER option is not supported with PARSE_HEADER = TRUE.

    Default:
    :   `FALSE`

`SKIP_HEADER = integer`
:   Number of lines at the start of the file to skip.

    Note that SKIP_HEADER does not use the RECORD_DELIMITER or FIELD_DELIMITER values to determine what a header line is; rather, it simply skips the specified number of CRLF (Carriage Return, Line Feed)-delimited lines in the file. RECORD_DELIMITER and FIELD_DELIMITER are then used to determine the rows of data to load.

    Default:
    :   `0`

`SKIP_BLANK_LINES = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies to skip any blank lines encountered in the data files; otherwise, blank lines produce an end-of-record error (default behavior).

    Default:
    :   `FALSE`

`DATE_FORMAT = 'string' | AUTO`
:   String that defines the format of date values in the data files to be loaded. If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) session parameter is used.

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   String that defines the format of time values in the data files to be loaded. If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) session parameter is used.

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = 'string' | AUTO`
:   String that defines the format of timestamp values in the data files to be loaded. If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) session parameter
    is used.

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   String (constant) that defines the encoding format for binary input or output. This option only applies when loading data into binary columns in a table.

    Default:
    :   `HEX`

`ESCAPE = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character used as the escape character for enclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_OPTIONALLY_ENCLOSED_BY` character in the data as literals.

    Accepts common escape sequences (For example, `\t` for tab, `\n` for newline, `\r` for carriage return, `\\` for backslash), octal values, or hex values.

    > **Note:**
    >
    > This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
    > as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
    > the option value.
    >
    > In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
    > option as the character encoding for your data files to ensure the character is interpreted correctly.

    Default:
    :   `NONE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

    Accepts common escape sequences (For example, `\t` for tab, `\n` for newline, `\r` for carriage return, `\\` for backslash), octal values, or hex values.

    > **Note:**
    >
    > * The default value is `\\`. If a row in a data file ends in the backslash (`\`) character, this character escapes the newline or
    >   carriage return character specified for the `RECORD_DELIMITER` file format option. As a result, the load operation treats
    >   this row and the next row as a single row of data. To avoid this issue, set the value to `NONE`.
    > * This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
    >   as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
    >   the option value.
    >
    >   In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
    >   option as the character encoding for your data files to ensure the character is interpreted correctly.

    Default:
    :   backslash (`\\`)

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove white space from fields.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the field (that is, the quotation marks are interpreted as part of the string of field data). Use this option to remove undesirable spaces during the data load.

    As another example, if leading or trailing space surrounds quotes that enclose strings, you can remove the surrounding space using the `TRIM_SPACE` option and the quote character using the `FIELD_OPTIONALLY_ENCLOSED_BY` option. Note that any space within the quotes is preserved.

    For example, assuming the field delimiter is `|` and `FIELD_OPTIONALLY_ENCLOSED_BY = '"'`:

    ```sqlexample
    |"Hello world"|
    |" Hello world "|
    | "Hello world" |
    ```

    becomes:

    ```sqlexample
    +---------------+
    | C1            |
    |----+----------|
    | Hello world   |
    |  Hello world  |
    | Hello world   |
    +---------------+
    ```

    Default:
    :   `FALSE`

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex
    representation (`0x27`) or the double single-quoted escape (`''`).

    Default:
    :   `NONE`

`NULL_IF = ( [ 'string1' [ , 'string2' ... ] ] )`
:   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To specify more
    than one string, enclose the list of strings in parentheses and use commas to separate each value.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
    value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\` (default))

`ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE`
:   Boolean that specifies whether to generate a parsing error if the number of delimited columns (that is, fields) in an input data file does not match the number of columns in the corresponding table.

    If set to `FALSE`, an error is not generated and the load continues. If the file is successfully loaded:

    * If the input file contains records with more fields than columns in the table, the matching fields are loaded in order of occurrence in the file and the remaining fields are not loaded.
    * If the input file contains records with fewer fields than columns in the table, the non-matching columns in the table are loaded with NULL values.

    This option assumes all the records within the input file are the same length (that is, a file containing records of varying length return an error regardless of the value specified for this
    option).

    Default:
    :   `TRUE`

    > **Note:**
    >
    > When [transforming data during loading](../../user-guide/data-load-transform.md) (that is, using a query as the source for the COPY INTO <table> command), this option is ignored. There is no requirement for your data files
    > to have the same number and ordering of columns as your target table.

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). The copy
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Boolean that specifies whether to insert SQL NULL for empty fields in an input file, which are represented by two successive delimiters (For example, `,,`).

    If set to `FALSE`, Snowflake attempts to cast an empty field to the corresponding column type. An empty string is inserted into columns of type STRING. For other column types, the
    COPY INTO *<table>* command produces an error.

    Default:
    :   `TRUE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

    If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

`ENCODING = 'string'`
:   String (constant) that specifies the character set of the source data.

    | Character Set | `ENCODING` Value | Supported Languages | Notes |
    | --- | --- | --- | --- |
    | Big5 | `BIG5` | Traditional Chinese |  |
    | EUC-JP | `EUCJP` | Japanese |  |
    | EUC-KR | `EUCKR` | Korean |  |
    | GB18030 | `GB18030` | Chinese |  |
    | IBM420 | `IBM420` | Arabic |  |
    | IBM424 | `IBM424` | Hebrew |  |
    | IBM949 | `IBM949` | Korean |  |
    | ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
    | ISO-2022-JP | `ISO2022JP` | Japanese |  |
    | ISO-2022-KR | `ISO2022KR` | Korean |  |
    | ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
    | ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
    | ISO-8859-5 | `ISO88595` | Russian |  |
    | ISO-8859-6 | `ISO88596` | Arabic |  |
    | ISO-8859-7 | `ISO88597` | Greek |  |
    | ISO-8859-8 | `ISO88598` | Hebrew |  |
    | ISO-8859-9 | `ISO88599` | Turkish |  |
    | ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
    | KOI8-R | `KOI8R` | Russian |  |
    | Shift_JIS | `SHIFTJIS` | Japanese |  |
    | UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
    | UTF-16 | `UTF16` | All languages |  |
    | UTF-16BE | `UTF16BE` | All languages |  |
    | UTF-16LE | `UTF16LE` | All languages |  |
    | UTF-32 | `UTF32` | All languages |  |
    | UTF-32BE | `UTF32BE` | All languages |  |
    | UTF-32LE | `UTF32LE` | All languages |  |
    | windows-874 | `WINDOWS874` | Thai |  |
    | windows-949 | `WINDOWS949` | Korean |  |
    | windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
    | windows-1251 | `WINDOWS1251` | Russian |  |
    | windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
    | windows-1253 | `WINDOWS1253` | Greek |  |
    | windows-1254 | `WINDOWS1254` | Turkish |  |
    | windows-1255 | `WINDOWS1255` | Hebrew |  |
    | windows-1256 | `WINDOWS1256` | Arabic |  |

    Default:
    :   `UTF8`

    > **Note:**
    >
    > Snowflake stores all data internally in the UTF-8 character set. The data is converted into UTF-8 before it is loaded into Snowflake.

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be loaded. Snowflake uses this option to detect how already-compressed data files were compressed so that the
    compressed data in the files can be extracted for loading.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If loading Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` |  |
    | `ZSTD` |  |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Indicates the files for loading data have not been compressed. |

    Default:
    :   `AUTO`

`DATE_FORMAT = 'string' | AUTO`
:   Defines the format of date string values in the data files. If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) parameter is used.

    This file format option is applied to the following actions only:

    * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
    * Loading JSON data into separate columns by specifying a query in the COPY statement (that is, COPY transformation).

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Defines the format of time string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) parameter is used.

    This file format option is applied to the following actions only:

    * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
    * Loading JSON data into separate columns by specifying a query in the COPY statement (that is, COPY transformation).

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Defines the format of timestamp string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) parameter is used.

    This file format option is applied to the following actions only:

    * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
    * Loading JSON data into separate columns by specifying a query in the COPY statement (that is, COPY transformation).

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Defines the encoding format for binary string values in the data files. The option can be used when loading data into binary columns in a table.

    This file format option is applied to the following actions only:

    * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
    * Loading JSON data into separate columns by specifying a query in the COPY statement (that is, COPY transformation).

    Default:
    :   `HEX`

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove leading and trailing white space from strings.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the field (that is, the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

    This file format option is applied to the following actions only when loading JSON data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`MULTI_LINE = TRUE | FALSE`
:   Boolean that specifies whether multiple lines are allowed.

    If MULTI_LINE is set to `FALSE` and a new line is present within a JSON record, the record containing the new line will be interpreted as an error.

    Default:
    :   `TRUE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To specify more than
    one string, enclose the list of strings in parentheses and use commas to separate each value.

    This file format option is applied to the following actions only when loading JSON data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
    value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    Default:
    :   `\\N` (that is, NULL)

`ENABLE_OCTAL = TRUE | FALSE`
:   Boolean that enables parsing of octal numbers.

    Default:
    :   `FALSE`

`ALLOW_DUPLICATE = TRUE | FALSE`
:   Boolean that allows duplicate object field names (only the last one will be preserved).

    Default:
    :   `FALSE`

`STRIP_OUTER_ARRAY = TRUE | FALSE`
:   Boolean that instructs the JSON parser to remove outer brackets `[ ]`.

    Default:
    :   `FALSE`

`STRIP_NULL_VALUES = TRUE | FALSE`
:   Boolean that instructs the JSON parser to remove object fields or array elements containing `null` values. For example, when set to `TRUE`:

    | Before | After |
    | --- | --- |
    | `[null]` | `[]` |
    | `[null,null,3]` | `[,,3]` |
    | `{"a":null,"b":null,"c":123}` | `{"c":123}` |
    | `{"a":[1,null,2],"b":{"x":null,"y":88}}` | `{"a":[1,,2],"b":{"y":88}}` |

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). The copy
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (that is, “replacement character”).

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Boolean that specifies whether to skip any BOM (byte order mark) present in an input file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

    If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

### TYPE = AVRO

`COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be loaded. Snowflake uses this option to detect how already-compressed data files were compressed so that the
    compressed data in the files can be extracted for loading.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If loading Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BROTLI` |  |
    | `ZSTD` |  |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Data files to load have not been compressed. |

    Default:
    :   `AUTO`.

    > **Note:**
    >
    > We recommend that you use the default `AUTO` option because it will determine both the file and codec compression. Specifying a compression option refers to the compression of files, not the compression of blocks (codecs).

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove leading and trailing white space from strings.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the field (that is, the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

    This file format option is applied to the following actions only when loading Avro data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). The copy
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To specify more than
    one string, enclose the list of strings in parentheses and use commas to separate each value.

    This file format option is applied to the following actions only when loading Avro data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
    value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    Default:
    :   `\\N` (that is, NULL)

### TYPE = ORC

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove leading and trailing white space from strings.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the field (that is, the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

    This file format option is applied to the following actions only when loading Orc data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). The copy
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To specify more than
    one string, enclose the list of strings in parentheses and use commas to separate each value.

    This file format option is applied to the following actions only when loading Orc data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
    value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    Default:
    :   `\\N` (that is, NULL)

### TYPE = PARQUET

`COMPRESSION = AUTO | SNAPPY | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be loaded. Snowflake uses this option to detect how already-compressed data files were compressed so that the
    compressed data in the files can be extracted for loading.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically. Supports the following compression algorithms: Brotli, gzip, Lempel-Ziv-Oberhumer (LZO), LZ4, Snappy, or Zstandard v0.8 (and higher). |
    | `SNAPPY` |  |
    | `NONE` | Data files to load have not been compressed. |

    Default:
    :   `AUTO`

`BINARY_AS_TEXT = TRUE | FALSE`
:   Boolean that specifies whether to interpret columns with no defined logical data type as UTF-8 text. When set to `FALSE`, Snowflake interprets these columns as binary data.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > Snowflake recommends that you set BINARY_AS_TEXT to FALSE to avoid any potential conversion issues.

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove leading and trailing white space from strings.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space
    rather than the opening quotation character as the beginning of the field (that is, the quotation marks are interpreted as part of the string
    of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

    This file format option is applied to the following actions only when loading Parquet data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`USE_LOGICAL_TYPE = TRUE | FALSE`
:   Boolean that specifies whether to use Parquet logical types. With this file format option, Snowflake can interpret Parquet logical types during data loading. For more information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md). To enable Parquet logical types, set USE_LOGICAL_TYPE as TRUE when you create a new file format option.

    Default:
    :   `FALSE`

`USE_VECTORIZED_SCANNER = TRUE | FALSE`
:   Boolean that specifies whether to use a vectorized scanner for loading Parquet files.

    The default value is `FALSE`. In a future BCR, the default value will be `TRUE`. We recommend that you set `USE_VECTORIZED_SCANNER = TRUE` for new workloads, and set it for existing workloads after testing.

    Using the vectorized scanner can significantly reduce the latency for loading Parquet files, because this scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file. The scanner only downloads relevant sections of the Parquet file into memory, such as the subset of selected columns.

    If `USE_VECTORIZED_SCANNER` is set to `TRUE`, the vectorized scanner has the following behaviors:

    > * The `BINARY_AS_TEXT` option is always treated as `FALSE` and the `USE_LOGICAL_TYPE` option is always treated as `TRUE`, no matter what the actual value is being set to.
    > * The vectorized scanner supports Parquet map types. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >   {
    >   >    "k1": "v1",
    >   >    "k2": "v2"
    >   >   }
    >   > ```
    > * The vectorized scanner shows `NULL` values in the output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "nickname": null,
    >   >   "age": 34,
    >   >   "phone_numbers":
    >   >   [
    >   >     "1234567890",
    >   >     "0987654321",
    >   >     null,
    >   >     "6781234590"
    >   >   ]
    >   >   }
    >   > ```
    > * The vectorized scanner handles Time and Timestamp as follows:
    >
    >   > | Parquet | Snowflake vectorized scanner |
    >   > | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS/NANOS) | TIME |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_LTZ |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_NTZ |
    >   > | INT96 | TIMESTAMP_LTZ |

    If `USE_VECTORIZED_SCANNER` is set to `FALSE`, the scanner has the following behaviors:

    > * This option does not support Parquet maps. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >  {
    >   >   "key_value":
    >   >   [
    >   >    {
    >   >           "key": "k1",
    >   >           "value": "v1"
    >   >       },
    >   >       {
    >   >           "key": "k2",
    >   >           "value": "v2"
    >   >       }
    >   >     ]
    >   >   }
    >   > ```
    > * This option does not explicitly show `NULL` values in the scan output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "age": 34
    >   >   "phone_numbers":
    >   >   [
    >   >    "1234567890",
    >   >    "0987654321",
    >   >    "6781234590"
    >   >   ]
    >   >  }
    >   > ```
    > * This option handles Time and Timestamp as follows:
    >
    >   > | Parquet | When USE_LOGICAL_TYPE = TRUE | When USE_LOGICAL_TYPE = FALSE |
    >   > | --- | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS) | TIME | + TIME (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=NANOS) | TIME | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS) | TIMESTAMP_LTZ | TIMESTAMP_NTZ |
    >   > | TimestampType(isAdjustedToUtc=True, unit=NANOS) | TIMESTAMP_LTZ | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS) | TIMESTAMP_NTZ | + TIMESTAMP_LTZ (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimestampType(isAdjustedToUtc=False, unit=NANOS) | TIMESTAMP_NTZ | INTEGER |
    >   > | INT96 | TIMESTAMP_NTZ | TIMESTAMP_NTZ |

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). The copy
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To specify more than
    one string, enclose the list of strings in parentheses and use commas to separate each value.

    This file format option is applied to the following actions only when loading Parquet data into separate columns using the
    MATCH_BY_COLUMN_NAME copy option.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
    value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = XML

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be loaded. Snowflake uses this option to detect how already-compressed data files were compressed so that the
    compressed data in the files can be extracted for loading.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If loading Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` |  |
    | `ZSTD` |  |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Data files to load have not been compressed. |

    Default:
    :   `AUTO`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (that is, “replacement character”).

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`PRESERVE_SPACE = TRUE | FALSE`
:   Boolean that specifies whether the XML parser preserves leading and trailing spaces in element content.

    Default:
    :   `FALSE`

`STRIP_OUTER_ELEMENT = TRUE | FALSE`
:   Boolean that specifies whether the XML parser strips out the outer XML element, exposing 2nd level elements as separate documents.

    Default:
    :   `FALSE`

`DISABLE_AUTO_CONVERT = TRUE | FALSE`
:   Boolean that specifies whether the XML parser disables automatic conversion of numeric and Boolean values from text to native representation.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). The copy
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Boolean that specifies whether to skip any BOM (byte order mark) present in an input file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

    If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

## Copy options (`copyOptions`)

You can specify one or more of the following copy options (separated by blank spaces, commas, or new lines):

`CLUSTER_AT_INGEST_TIME = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether to pre-cluster data directly during ingestion for tables that are configured with clustering keys.

        When set to `TRUE`, this option lets [Snowpipe Streaming (with high-performance architecture)](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md) sort data that is based on the target table’s clustering keys before the data is committed. This significantly improves query performance on the target table by ensuring data is optimally organized upon ingestion.

        This feature is only available with [Snowpipe Streaming’s high-performance architecture](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md). The target table must be configured with clustering keys defined for this option to have an effect.

    Default:
    :   `FALSE`

    > **Important:**
    >
    > When using the pre-clustering feature, ensure that you do not disable the auto-clustering feature on the destination table. Disabling auto-clustering can lead to degraded query performance over time.

    Example:
    :   ```sqlexample
        CREATE OR REPLACE PIPE TEST_PRECLUSTERED_PIPE
        AS
            COPY INTO TEST_PRECLUSTERED_TABLE (num) FROM (
                    SELECT $1:num::number as num FROM TABLE(
                        DATA_SOURCE(
                            TYPE => 'STREAMING')
                    ))
              CLUSTER_AT_INGEST_TIME=TRUE;
        ```

`ENFORCE_LENGTH = TRUE | FALSE`
:   Definition:
    :   Alternative syntax for `TRUNCATECOLUMNS` with reverse logic (for compatibility with other systems)

        Boolean that specifies whether to truncate text strings that exceed the target column length:

        * If `TRUE`, the COPY statement produces an error if a loaded string exceeds the target column length.
        * If `FALSE`, the strings are automatically truncated to the target column length.

        This copy option supports CSV data and string values in semi-structured data when they are loaded into separate columns in relational tables.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > * If the length of the target string column is set to the maximum — for example, `VARCHAR (134217728)` — an incoming string can’t exceed this length; otherwise, the COPY command produces an error.
    > * This parameter is functionally equivalent to `TRUNCATECOLUMNS`, but has the opposite behavior. It is provided for compatibility with other databases. It is only necessary to include one of these two
    >   parameters in a COPY statement to produce the output that you want.

`FORCE = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies to load all files, regardless of whether they were loaded previously and haven’t changed after they were loaded. This option reloads files, potentially duplicating data in a table.

    Default:
    :   `FALSE`

`INCLUDE_METADATA = ( column_name = METADATA$field [ , column_name = METADATA$field ... ] )`
:   Definition:
    :   A user-defined mapping between a target table’s existing columns to its METADATA$ columns. This copy option can only be used with the MATCH_BY_COLUMN_NAME copy option. The following list shows the valid input for `METADATA$field`:

        * METADATA$FILENAME
        * METADATA$FILE_ROW_NUMBER
        * METADATA$FILE_CONTENT_KEY
        * METADATA$FILE_LAST_MODIFIED
        * METADATA$START_SCAN_TIME

        For more information about metadata columns, see [Query metadata for staged files](../../user-guide/querying-metadata.md).

        When a mapping is defined with this copy option, the column `column_name` is populated with the specified metadata value, as the following example shows:

        > ```sqlexample
        > COPY INTO table1 FROM @stage1
        > MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE
        > INCLUDE_METADATA = (
        >     ingestdate = METADATA$START_SCAN_TIME, filename = METADATA$FILENAME);
        > ```
        >
        > ```output
        > +-----+-----------------------+---------------------------------+-----+
        > | ... | FILENAME              | INGESTDATE                      | ... |
        > |---------------------------------------------------------------+-----|
        > | ... | example_file.json.gz  | Thu, 22 Feb 2024 19:14:55 +0000 | ... |
        > +-----+-----------------------+---------------------------------+-----+
        > ```

    Default:
    :   NULL

    > **Note:**
    >
    > * The `INCLUDE_METADATA` target column name must first exist in the table. The target column name is not automatically added if it doesn’t exist.
    > * Use a unique column name for the `INCLUDE_METADATA` columns. If the `INCLUDE_METADATA` target column has a name conflict with a column in the data file, the `METADATA$` value that is defined by `INCLUDE_METADATA` takes precedence.
    > * When you load a CSV file with `INCLUDE_METADATA`, set the file format option `ERROR_ON_COLUMN_COUNT_MISMATCH` to `FALSE`.

`LOAD_MODE = { FULL_INGEST | ADD_FILES_COPY }`
:   Definition:
    :   Specifies the mode to use when you load data from Parquet files into a Snowflake-managed [Iceberg table](../../user-guide/tables-iceberg.md).

        * `FULL_INGEST`: Snowflake scans the files and rewrites the Parquet data under the base location of the Iceberg table.
          Use this option if you need to transform or convert the data before you register the files to your Iceberg table.
        * `ADD_FILES_COPY`: Snowflake performs a server-side copy of the original Parquet files into the base location of the Iceberg table,
          then registers the files to the table. This action enables cross-region or cross-cloud ingestion of raw Parquet files into Iceberg tables.

          > **Note:**
          >
          > The `ADD_FILES_COPY` option is only supported when you load data from Iceberg-compatible raw Parquet files without transformation.
          > A raw Iceberg-compatible Parquet file isn’t registered with an Iceberg catalog, but contains Iceberg compatible data types.
          >
          > Use this option to avoid file-read overhead. To minimize storage costs, use `PURGE = TRUE` with this option.
          > Doing so tells Snowflake to automatically remove the data files from the original location after the data is loaded successfully.

    For additional usage notes, see the LOAD_MODE usage notes.
    For examples, see Loading Iceberg-compatible Parquet data into an Iceberg table.

    Default:
    :   `FULL_INGEST`

`LOAD_UNCERTAIN_FILES = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies to load files for which the load status is unknown. The COPY command skips these files by default.

        The load status is unknown if all of the following conditions are true:

        * The file’s LAST_MODIFIED date (that is, the date when the file was staged) is older than 64 days.
        * The initial set of data was loaded into the table more than 64 days earlier.
        * If the file was already loaded successfully into the table, this event occurred more than 64 days earlier.

        To force the COPY command to load all files regardless of whether the load status is known, use the `FORCE` option instead.

        For more information about load status uncertainty, see [Loading older files](../../user-guide/data-load-considerations-load.md).

    Default:
    :   `FALSE`

`MATCH_BY_COLUMN_NAME = CASE_SENSITIVE | CASE_INSENSITIVE | NONE`
:   Definition:
    :   String that specifies whether to load semi-structured data into columns in the target table that match corresponding columns represented in the data.

        > **Important:**
        >
        > Do not use the MATCH_BY_COLUMN_NAME copy option with a SELECT statement for transforming data during a load in all cases. These two options can still be used separately, but can’t be used together. Any attempt to do so will result in the following error: `SQL compilation error: match_by_column_name is not supported with copy transform.`.
        >
        > For example, the following syntax is not allowed:
        >
        > ```sqlexample
        > COPY INTO [<namespace>.]<table_name> [ ( <col_name> [ , <col_name> ... ] ) ]
        > FROM ( SELECT [<alias>.]$<file_col_num>[.<element>] [ , [<alias>.]$<file_col_num>[.<element>] ... ]
        >     FROM { internalStage | externalStage } )
        > [ FILES = ( '<file_name>' [ , '<file_name>' ] [ , ... ] ) ]
        > [ PATTERN = '<regex_pattern>' ]
        > [ FILE_FORMAT = ( { FORMAT_NAME = '[<namespace>.]<file_format_name>' |
        >             TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] } ) ]
        > MATCH_BY_COLUMN_NAME = CASE_SENSITIVE | CASE_INSENSITIVE | NONE
        > [ other copyOptions ]
        > ```
        >
        > For more information, see [Transforming Data During a Load](../../user-guide/data-load-transform.md).

        This copy option is supported for the following data formats:

        * JSON
        * Avro
        * ORC
        * Parquet
        * CSV

        For a column to match, the following criteria must be true:

        * The column represented in the data must have the exact same name as the column in the table. The copy option supports case sensitivity for column names. Column order does not matter.
        * The column in the table must have a data type that is compatible with the values in the column represented in the data. For example, string, number, and Boolean values can all be loaded into a variant column.

    Values:
    :   `CASE_SENSITIVE` | `CASE_INSENSITIVE`
        :   Load semi-structured data into columns in the target table that match corresponding columns represented in the data. Column names are either case-sensitive (`CASE_SENSITIVE`) or case-insensitive (`CASE_INSENSITIVE`).

            The COPY operation verifies that at least one column in the target table matches a column represented in the data files. If a match is found, the values in the data files are loaded into the column or columns. If no match is found, a set of NULL values for each record in the files is loaded into the table.

            > **Note:**
            >
            > * If additional non-matching columns are present in the data files, the values in these columns are not loaded.
            > * If additional non-matching columns are present in the target table, the COPY operation inserts NULL values into these columns. These columns must support NULL values.

        `NONE`
        :   The COPY operation loads the semi-structured data into a variant column or, if a query is included in the COPY statement, transforms the data.

    Default:
    :   `NONE`

    > **Note:**
    >
    > The following limitations currently apply:
    >
    > > * MATCH_BY_COLUMN_NAME can’t be used with the `VALIDATION_MODE` parameter in a COPY statement to validate the staged data rather than load it into the target table.
    > > * Parquet data only. When MATCH_BY_COLUMN_NAME is set to `CASE_SENSITIVE` or `CASE_INSENSITIVE`, an empty column value (for example, `"col1": ""`) produces an error.

`ON_ERROR = CONTINUE | SKIP_FILE | SKIP_FILE_num | 'SKIP_FILE_num%' | ABORT_STATEMENT`
:   Use:
    :   Data loading only

    Definition:
    :   String (constant) that specifies the error handling for the load operation.

        > **Important:**
        >
        > Carefully consider the ON_ERROR copy option value. The default value is appropriate in common scenarios, but isn’t always the best
        > option.

    Values:
    :   * `CONTINUE`

          > > Continue to load the file if errors are found. The COPY statement returns an error message for a maximum of one error found per data file.
          >
          > The difference between the ROWS_PARSED and ROWS_LOADED column values represents the number of rows that include detected errors. However, each of these rows could include multiple errors. To view all the errors in the data files, use the VALIDATION_MODE parameter or query the [VALIDATE](../functions/validate.md) function.
        * `SKIP_FILE`

          > > Skip a file when an error is found.
          >
          > The `SKIP_FILE` action buffers an entire file whether errors are found or not. For this reason, `SKIP_FILE` is slower than either `CONTINUE` or `ABORT_STATEMENT`. If you skip large files because of a small number of errors, this could result in delays and wasted credits. When you load large numbers of records from files that have no logical delineation — for example, the files were generated automatically at rough intervals — consider specifying `CONTINUE` instead.
          >
          > Additional patterns:
          >
          > `SKIP_FILE_num` (for example, `SKIP_FILE_10`)
          > :   Skip a file when the number of error rows found in the file is equal to or exceeds the specified number.
          >
          > `'SKIP_FILE_num%'` (for example, `'SKIP_FILE_10%'`)
          > :   Skip a file when the percentage of error rows found in the file exceeds the specified percentage.
        * `ABORT_STATEMENT`

          > > Stop the load operation if any error is found in a data file.
          >
          > The load operation is stopped only when the data files that were explicitly specified in the `FILES` parameter can’t be found. Otherwise, the load operation is not stopped if the data file can’t be found; for example, because it doesn’t exist or can’t be accessed.
          >
          > The terminated operations don’t show up in [COPY_HISTORY](../functions/copy_history.md) as the data files weren’t ingested. We recommend that you search for the failures in [QUERY_HISTORY](../functions/query_history.md).

    Default:
    :   Bulk loading using COPY:
        :   `ABORT_STATEMENT`

        Snowpipe:
        :   `SKIP_FILE`

`PURGE = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether to remove the data files from the stage automatically after the data is loaded successfully.

        If this option is set to `TRUE`, an attempt is made to remove successfully loaded data files. If the purge operation fails for any reason, no error is returned currently. We recommend that you list staged files periodically (using [LIST](list.md)) and manually remove successfully loaded files, if any exist.

    Default:
    :   `FALSE`

`RETURN_FAILED_ONLY = TRUE | FALSE`
:   Definition:
    :   Boolean that specifies whether to return only files that have failed to load in the statement result.

    Default:
    :   `FALSE`

`SIZE_LIMIT = num`
:   Definition:
    :   Number (> 0) that specifies the maximum size (in bytes) of data to be loaded for a given COPY statement. When the threshold is exceeded, the COPY operation discontinues loading files. This option is commonly used to load a common group of files by using multiple COPY statements. For each statement, the data load continues until the specified `SIZE_LIMIT` is exceeded, before moving on to the next statement.

        For example, suppose a set of files in a stage path were each 10 MB in size. If multiple COPY statements set SIZE_LIMIT to `25000000` (25 MB), each would load 3 files. That is, each COPY operation would discontinue after the `SIZE_LIMIT` threshold was exceeded.

        At least one file is loaded regardless of the value specified for `SIZE_LIMIT`, unless there is no file to be loaded.

    Default:
    :   null (no size limit)

`TRUNCATECOLUMNS = TRUE | FALSE`
:   Definition:
    :   Alternative syntax for `ENFORCE_LENGTH` with reverse logic (for compatibility with other systems)

        Boolean that specifies whether to truncate text strings that exceed the target column length:

        * If `TRUE`, strings are automatically truncated to the target column length.
        * If `FALSE`, the COPY statement produces an error if a loaded string exceeds the target column length.

        This copy option supports CSV data and string values in semi-structured data when they are loaded into separate columns in relational tables.

    Default:
    :   `FALSE`

    > **Note:**
    >
    > * If the length of the target string column is set to the maximum — for example, `VARCHAR (134217728)`— an incoming string can’t exceed this length; otherwise, the COPY command produces an error.
    > * This parameter is functionally equivalent to `ENFORCE_LENGTH`, but has the opposite behavior. It is provided for compatibility with other databases. It is only necessary to include one of these two
    >   parameters in a COPY statement to produce the output that you want.

## Usage notes

* Some use cases are not fully supported and can lead to inconsistent or unexpected ON_ERROR behavior, including the
  following use cases:

  + Specifying the DISTINCT keyword in SELECT statements.
  + Using COPY with clustered tables.
* For [partitioned Iceberg tables](../../user-guide/tables-iceberg-metadata.md):

  + A COPY job fails if Snowflake encounters an error on a partition transform, even if
    you’ve set `ON_ERROR = CONTINUE`.
  + LOAD_MODE = ADD_FILES_COPY is not supported.
* When you load CSV data, if [a stream](../../user-guide/streams-intro.md) is on the target table, the ON_ERROR copy option might not work as expected.
* Loading from Google Cloud Storage only: The list of objects returned for an external stage might include one or more “directory blobs”;
  essentially, paths that end in a forward slash character (`/`), e.g.:

  ```sqlexample
  LIST @my_gcs_stage;

  +---------------------------------------+------+----------------------------------+-------------------------------+
  | name                                  | size | md5                              | last_modified                 |
  |---------------------------------------+------+----------------------------------+-------------------------------|
  | my_gcs_stage/load/                    |  12  | 12348f18bcb35e7b6b628ca12345678c | Mon, 11 Sep 2019 16:57:43 GMT |
  | my_gcs_stage/load/data_0_0_0.csv.gz   |  147 | 9765daba007a643bdff4eae10d43218y | Mon, 11 Sep 2019 18:13:07 GMT |
  +---------------------------------------+------+----------------------------------+-------------------------------+
  ```

  These blobs are listed when directories are created in the Google Cloud console rather than using any other tool provided by Google.

  COPY statements that reference a stage can fail when the object list includes directory blobs. To avoid errors, we recommend using file
  pattern matching to identify the files for inclusion (i.e. the PATTERN clause) when the file list for a stage includes directory blobs. For
  an example, see Loading Using Pattern Matching (in this topic). Alternatively, set ON_ERROR = SKIP_FILE in the COPY statement.
* `STORAGE_INTEGRATION`, `CREDENTIALS`, and `ENCRYPTION` only apply if you are loading directly from a private/protected
  storage location:

  + If you are loading from a public bucket, secure access is not required.
  + If you are loading from a named external stage, the stage provides all the credential information required for accessing the bucket.
* If you encounter errors while running the COPY command, after the command completes, you can validate the files that produced the errors
  using the [VALIDATE](../functions/validate.md) table function.

  > **Note:**
  >
  > The VALIDATE function only returns output for COPY commands used to perform standard data loading; it does not support COPY commands that
  > perform transformations during data loading (e.g. loading a subset of data columns or reordering data columns).
* Unless you explicitly specify `FORCE = TRUE` as one of the copy options, the command ignores staged data files that were already
  loaded into the table. To reload the data, you must either specify `FORCE = TRUE` or modify the file and stage it again, which
  generates a new checksum.
* The COPY command does not validate data type conversions for Parquet files.
* For information about loading hybrid tables, see [Loading data](../../user-guide/tables-hybrid-create.md).

* `VALIDATION_MODE` isn’t supported for Iceberg tables.
* Loading from Iceberg-compatible Parquet files using `LOAD_MODE`:

  + You must fulfill the following prerequisites when using the `LOAD_MODE = ADD_FILES_COPY` option:

    - The target table must be a Snowflake-managed Iceberg table with column data types that are compatible with the source Parquet file data types.
      For more information, see [Data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).
    - The source file format type must be Iceberg-compatible Parquet, and you must use a vectorized scanner: `FILE_FORMAT = ( TYPE = PARQUET USE_VECTORIZED_SCANNER = TRUE)`.
    - You must use case-sensitive column names with the `LOAD_MODE = ADD_FILES_COPY` option.

      * Create your Iceberg table with the case-sensitive column names, enclosed in double quotes, in your CREATE ICEBERG TABLE statement.
      * Set the `MATCH_BY_COLUMN_NAME` option to `CASE_SENSITIVE`.
  + The following options aren’t supported when you use `LOAD_MODE = ADD_FILES_COPY`:

    - Copying unstaged data by specifying a cloud storage location and a storage integration.
    - Any file format configuration *other than* `FILE_FORMAT = ( TYPE = PARQUET USE_VECTORIZED_SCANNER = TRUE)`.
    - `MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE | NONE`.
    - `ON_ERROR = CONTINUE | SKIP_FILE_N | SKIP_FILE_X%`.
    - Transforming or filtering the data before loading. To transform the data, use the FULL_INGEST option instead.
  + For `ADD_FILES_COPY`, using a larger warehouse does not significantly decrease the duration of the COPY query. The
    majority of the COPY operation relies on Cloud Services compute resources.
  + To load the row lineage metadata columns in Parquet files, which are
    `_row_id` and `_last_updated_sequence_number`, you must use the FULL_INGEST option. The other LOAD_MODE options aren’t supported.

    - Parquet files that contain row lineage are likely already part of an Iceberg v3 table. Registering Parquet files by using ADD_FILES_COPY is
      not recommended if those files are already part of another Iceberg table. The best practice for converting externally managed Iceberg
      tables to Snowflake-managed Iceberg tables without rewriting files is to use the [ALTER ICEBERG TABLE … CONVERT TO MANAGED](alter-iceberg-table-convert-to-managed.md)
      command.
* To run this command with an external stage that uses a storage integration,
  you must use a role that has or inherits the USAGE privilege on the storage integration.

  For more information, see [Stage privileges](../../user-guide/security-access-control-privileges.md).
* For [outbound private connectivity](../../user-guide/private-connectivity-outbound.md), loading directly from an external location (external
  storage URI) isn’t supported. Instead, use an external stage with a storage integration configured for outbound private connectivity.

## Output

The command returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE | TEXT | Name of source file and relative path to the file |
| STATUS | TEXT | Status: loaded, load failed or partially loaded |
| ROWS_PARSED | NUMBER | Number of rows parsed from the source file |
| ROWS_LOADED | NUMBER | Number of rows loaded from the source file |
| ERROR_LIMIT | NUMBER | If the number of errors reaches this limit, then abort |
| ERRORS_SEEN | NUMBER | Number of error rows in the source file |
| FIRST_ERROR | TEXT | First error of the source file |
| FIRST_ERROR_LINE | NUMBER | Line number of the first error |
| FIRST_ERROR_CHARACTER | NUMBER | Position of the first error character |
| FIRST_ERROR_COLUMN_NAME | TEXT | Column name of the first error |

## Examples

For examples of data loading transformations, see [Transform data during a load](../../user-guide/data-load-transform.md).

### Loading files from an internal stage

> **Note:**
>
> These examples assume the files were copied to the stage earlier using the [PUT](put.md) command.

Load files from a named internal stage into a table:

> ```sqlexample
> COPY INTO mytable
> FROM @my_int_stage;
> ```

Load files from a table’s stage into the table:

> ```sqlexample
> COPY INTO mytable
> FILE_FORMAT = (TYPE = CSV);
> ```
>
> > **Note:**
> >
> > When copying data from files in a table location, the FROM clause can be omitted because Snowflake automatically checks for files in the
> > table’s location.

Load files from the user’s personal stage into a table:

> ```sqlexample
> COPY INTO mytable from @~/staged
> FILE_FORMAT = (FORMAT_NAME = 'mycsv');
> ```

### Loading files from a named external stage

Load files from a named external stage that you created previously using the [CREATE STAGE](create-stage.md) command. The named
external stage references an external location (Amazon S3, Google Cloud Storage, or Microsoft Azure) and includes all the credentials and
other details required for accessing the location:

> ```sqlexample
> COPY INTO mycsvtable
>   FROM @my_ext_stage/tutorials/dataloading/contacts1.csv;
> ```

### Loading files using column matching

Load files from a named external stage into the table with the `MATCH_BY_COLUMN_NAME` copy option, by case-insensitive matching the column names in the files to the column names defined in the table. With this option, the column ordering of the file does not need to match the column ordering of the table.

> ```sqlexample
> COPY INTO mytable
>   FROM @my_ext_stage/tutorials/dataloading/sales.json.gz
>   FILE_FORMAT = (TYPE = 'JSON')
>   MATCH_BY_COLUMN_NAME='CASE_INSENSITIVE';
> ```

### Loading files directly from an external location

> **Note:**
>
> This option isn’t supported for [outbound private connectivity](../../user-guide/private-connectivity-outbound.md).
> Instead, use an external stage.

The following example loads all files prefixed with `data/files` from a storage location (Amazon S3, Google Cloud Storage, or
Microsoft Azure) using a named `my_csv_format` file format:

**Amazon S3**

> Access the referenced S3 bucket using a referenced storage integration named `myint`. Note that both examples truncate the
> `MASTER_KEY` value:
>
> ```sqlexample
> COPY INTO mytable
>   FROM s3://mybucket/data/files
>   STORAGE_INTEGRATION = myint
>   ENCRYPTION=(MASTER_KEY = 'eSx...')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```
>
> Access the referenced S3 bucket using supplied credentials:
>
> ```sqlexample
> COPY INTO mytable
>   FROM s3://mybucket/data/files
>   CREDENTIALS=(AWS_KEY_ID='$AWS_ACCESS_KEY_ID' AWS_SECRET_KEY='$AWS_SECRET_ACCESS_KEY')
>   ENCRYPTION=(MASTER_KEY = 'eSx...')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

**Google Cloud Storage**

> Access the referenced GCS bucket using a referenced storage integration named `myint`:
>
> ```sqlexample
> COPY INTO mytable
>   FROM 'gcs://mybucket/data/files'
>   STORAGE_INTEGRATION = myint
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

**Microsoft Azure**

> Access the referenced container using a referenced storage integration named `myint`. Note that both examples truncate the
> `MASTER_KEY` value:
>
> ```sqlexample
> COPY INTO mytable
>   FROM 'azure://myaccount.blob.core.windows.net/data/files'
>   STORAGE_INTEGRATION = myint
>   ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'kPx...')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```
>
> Access the referenced container using supplied credentials:
>
> ```sqlexample
> COPY INTO mytable
>   FROM 'azure://myaccount.blob.core.windows.net/mycontainer/data/files'
>   CREDENTIALS=(AZURE_SAS_TOKEN='?sv=2016-05-31&ss=b&srt=sco&sp=rwdl&se=2018-06-27T10:05:50Z&st=2017-06-27T02:05:50Z&spr=https,http&sig=bgqQwoXwxzuD2GJfagRg7VOS8hzNr3QLT7rhS8OFRLQ%3D')
>   ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'kPx...')
>   FILE_FORMAT = (FORMAT_NAME = my_csv_format);
> ```

### Loading using pattern matching

Load files from a table’s stage into the table, using pattern matching to only load data from compressed CSV files in any path:

> ```sqlexample
> COPY INTO mytable
>   FILE_FORMAT = (TYPE = 'CSV')
>   PATTERN='.*/.*/.*[.]csv[.]gz';
> ```

Where `.*` is interpreted as “zero or more occurrences of any character.” The square brackets escape the period character (`.`)
that precedes a file extension.

Load files from a table stage into the table using pattern matching to only load uncompressed CSV files whose names include the string
`sales`:

> ```sqlexample
> COPY INTO mytable
>   FILE_FORMAT = (FORMAT_NAME = myformat)
>   PATTERN='.*sales.*[.]csv';
> ```

### Loading JSON data into a VARIANT column

The following example loads JSON data into a table with a single column of type VARIANT.

The staged JSON array comprises three objects separated by new lines:

> > ```sqljson
> > [{
> >     "location": {
> >       "city": "Lexington",
> >       "zip": "40503",
> >       },
> >       "sq__ft": "1000",
> >       "sale_date": "4-25-16",
> >       "price": "75836"
> > },
> > {
> >     "location": {
> >       "city": "Belmont",
> >       "zip": "02478",
> >       },
> >       "sq__ft": "1103",
> >       "sale_date": "6-18-16",
> >       "price": "92567"
> > }
> > {
> >     "location": {
> >       "city": "Winchester",
> >       "zip": "01890",
> >       },
> >       "sq__ft": "1122",
> >       "sale_date": "1-31-16",
> >       "price": "89921"
> > }]
> > ```
>
> ```sqlexample
> /* Create a JSON file format that strips the outer array. */
>
> CREATE OR REPLACE FILE FORMAT json_format
>   TYPE = 'JSON'
>   STRIP_OUTER_ARRAY = TRUE;
>
> /* Create an internal stage that references the JSON file format. */
>
> CREATE OR REPLACE STAGE mystage
>   FILE_FORMAT = json_format;
>
> /* Stage the JSON file. */
>
> PUT file:///tmp/sales.json @mystage AUTO_COMPRESS=TRUE;
>
> /* Create a target table for the JSON data. */
>
> CREATE OR REPLACE TABLE house_sales (src VARIANT);
>
> /* Copy the JSON data into the target table. */
>
> COPY INTO house_sales
>    FROM @mystage/sales.json.gz;
>
> SELECT * FROM house_sales;
>
> +---------------------------+
> | SRC                       |
> |---------------------------|
> | {                         |
> |   "location": {           |
> |     "city": "Lexington",  |
> |     "zip": "40503"        |
> |   },                      |
> |   "price": "75836",       |
> |   "sale_date": "4-25-16", |
> |   "sq__ft": "1000",       |
> |   "type": "Residential"   |
> | }                         |
> | {                         |
> |   "location": {           |
> |     "city": "Belmont",    |
> |     "zip": "02478"        |
> |   },                      |
> |   "price": "92567",       |
> |   "sale_date": "6-18-16", |
> |   "sq__ft": "1103",       |
> |   "type": "Residential"   |
> | }                         |
> | {                         |
> |   "location": {           |
> |     "city": "Winchester", |
> |     "zip": "01890"        |
> |   },                      |
> |   "price": "89921",       |
> |   "sale_date": "1-31-16", |
> |   "sq__ft": "1122",       |
> |   "type": "Condo"         |
> | }                         |
> +---------------------------+
> ```

### Reloading files

Add `FORCE = TRUE` to a COPY command to reload (duplicate) data from a set of staged data files that have not changed (i.e. have
the same checksum as when they were first loaded).

In the following example, the first command loads the specified files and the second command forces the same files to be loaded again
(producing duplicate rows), even though the contents of the files have not changed:

> ```sqlexample
> COPY INTO load1 FROM @%load1/data1/
>     FILES=('test1.csv', 'test2.csv');
>
> COPY INTO load1 FROM @%load1/data1/
>     FILES=('test1.csv', 'test2.csv')
>     FORCE=TRUE;
> ```

### Purging files after loading

Load files from a table’s stage into the table and purge files after loading. By default, COPY does not purge loaded files from the
location. To purge the files after loading:

* Make sure your account has write access to the bucket or container where the files are stored.
* Set `PURGE=TRUE` for the table to specify that all files successfully loaded into the table are purged after loading:

  > ```sqlexample
  > ALTER TABLE mytable SET STAGE_COPY_OPTIONS = (PURGE = TRUE);
  >
  > COPY INTO mytable;
  > ```
* You can also override any of the copy options directly in the COPY command:

  > ```sqlexample
  > COPY INTO mytable PURGE = TRUE;
  > ```

After the files are loaded into the table, the files are deleted from the bucket or container from where they are stored. After the files have begun the deletion process, the query cannot be cancelled.

### Validating staged files

Validate files in a stage without loading:

* Run the COPY command in validation mode and see all errors:

  > ```sqlexample
  > COPY INTO mytable VALIDATION_MODE = 'RETURN_ERRORS';
  >
  > +-------------------------------------------------------------------------------------------------------------------------------+------------------------+------+-----------+-------------+----------+--------+-----------+----------------------+------------+----------------+
  > |                                                         ERROR                                                                 |            FILE        | LINE | CHARACTER | BYTE_OFFSET | CATEGORY |  CODE  | SQL_STATE |   COLUMN_NAME        | ROW_NUMBER | ROW_START_LINE |
  > +-------------------------------------------------------------------------------------------------------------------------------+------------------------+------+-----------+-------------+----------+--------+-----------+----------------------+------------+----------------+
  > | Field delimiter ',' found while expecting record delimiter '\n'                                                               | @MYTABLE/data1.csv.gz  | 3    | 21        | 76          | parsing  | 100016 | 22000     | "MYTABLE"["QUOTA":3] | 3          | 3              |
  > | NULL result in a non-nullable column. Use quotes if an empty field should be interpreted as an empty string instead of a null | @MYTABLE/data3.csv.gz  | 3    | 2         | 62          | parsing  | 100088 | 22000     | "MYTABLE"["NAME":1]  | 3          | 3              |
  > | End of record reached while expected to parse column '"MYTABLE"["QUOTA":3]'                                                   | @MYTABLE/data3.csv.gz  | 4    | 20        | 96          | parsing  | 100068 | 22000     | "MYTABLE"["QUOTA":3] | 4          | 4              |
  > +-------------------------------------------------------------------------------------------------------------------------------+------------------------+------+-----------+-------------+----------+--------+-----------+----------------------+------------+----------------+
  > ```
* Run the COPY command in validation mode for a specified number of rows. In this example, the first run encounters no errors in the
  specified number of rows and completes successfully, displaying the information as it will appear when loaded into the table. The
  second run encounters an error in the specified number of rows and fails with the error encountered:

  > ```sqlexample
  > COPY INTO mytable VALIDATION_MODE = 'RETURN_2_ROWS';
  >
  > +--------------------+----------+-------+
  > |        NAME        |    ID    | QUOTA |
  > +--------------------+----------+-------+
  > | Joe Smith          |  456111  | 0     |
  > | Tom Jones          |  111111  | 3400  |
  > +--------------------+----------+-------+
  >
  > COPY INTO mytable VALIDATION_MODE = 'RETURN_3_ROWS';
  >
  > FAILURE: NULL result in a non-nullable column. Use quotes if an empty field should be interpreted as an empty string instead of a null
  >   File '@MYTABLE/data3.csv.gz', line 3, character 2
  >   Row 3, column "MYTABLE"["NAME":1]
  > ```

### Loading Iceberg-compatible Parquet data into an Iceberg table

This example shows how to create an Iceberg table, and then load data into it from
Iceberg-compatible Parquet data files on an external stage.

> **Important:**
>
> Registering Parquet files by using ADD_FILES_COPY isn’t recommended if those files are already part of another Iceberg table. The best
> practice for converting externally managed Iceberg tables to Snowflake-managed Iceberg tables without rewriting files is to use the
> [ALTER ICEBERG TABLE … CONVERT TO MANAGED](alter-iceberg-table-convert-to-managed.md) command.

For demonstration purposes, this example uses the following resources:

* An external volume named `iceberg_ingest_vol`. To create
  an external volume, see [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md).
* An external stage named `my_parquet_stage` with Iceberg-compatible Parquet files on it. To create an external stage, see
  [CREATE STAGE](create-stage.md).

1. Create a file format object that describes the staged Parquet files, using the required configuration for copying
   Iceberg-compatible Parquet data (`TYPE = PARQUET USE_VECTORIZED_SCANNER = TRUE`):

   ```sqlexample
   CREATE OR REPLACE FILE FORMAT my_parquet_format
     TYPE = PARQUET
     USE_VECTORIZED_SCANNER = TRUE;
   ```
2. Create a Snowflake-managed Iceberg table, defining columns with data types that are compatible with the source Parquet file data types:

   This example uses case-sensitive column names. You must surround the column names in double quotes when you create the Iceberg table, and
   specify the column names exactly as they appear in your Parquet footer.

   ```sqlexample
   CREATE OR REPLACE ICEBERG TABLE customer_iceberg_ingest (
     "c_custkey" INTEGER,
     "c_name" STRING,
     "c_address" STRING,
     "c_nationkey" INTEGER,
     "c_phone" STRING,
     "c_acctbal" INTEGER,
     "c_mktsegment" STRING,
     "c_comment" STRING
   )
     CATALOG = 'SNOWFLAKE'
     EXTERNAL_VOLUME = 'iceberg_ingest_vol'
     BASE_LOCATION = 'customer_iceberg_ingest/';
   ```

   > **Note:**
   >
   > The example statement specifies Iceberg data types that map to Snowflake data types. For more information,
   > see [Data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).
3. To load the data from the staged Parquet files, which are located directly under the stage URL path, into the Iceberg table, use a COPY INTO statement:

   In COPY INTO *<table>* statements with `LOAD_MODE = ADD_FILES_COPY`, only `MATCH_BY_COLUMN_NAME = CASE_SENSITIVE` is supported.

   ```sqlexample
   COPY INTO customer_iceberg_ingest
     FROM @my_parquet_stage
     FILE_FORMAT = 'my_parquet_format'
     LOAD_MODE = ADD_FILES_COPY
     PURGE = TRUE
     MATCH_BY_COLUMN_NAME = CASE_SENSITIVE;
   ```

   > **Note:**
   >
   > The example specifies `LOAD_MODE = ADD_FILES_COPY`, which tells Snowflake to copy the files into your external volume location,
   > and then register the files to the table.
   >
   > This option avoids file charges, because Snowflake doesn’t scan the source Parquet files and rewrite the data into new Parquet files.

   Output:

   ```output
   +---------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   | file                                                          | status | rows_parsed | rows_loaded | error_limit | errors_seen | first_error | first_error_line | first_error_character | first_error_column_name |
   |---------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------|
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_008.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_006.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_005.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_002.parquet | LOADED |           5 |           5 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   | my_parquet_stage/snow_af9mR2HShTY_AABspxOVwhc_0_1_010.parquet | LOADED |       15000 |       15000 |           0 |           0 | NULL        |             NULL |                  NULL | NULL                    |
   +---------------------------------------------------------------+--------+-------------+-------------+-------------+-------------+-------------+------------------+-----------------------+-------------------------+
   ```
4. Query the table:

   ```sqlexample
   SELECT
       c_custkey,
       c_name,
       c_mktsegment
     FROM customer_iceberg_ingest
     LIMIT 10;
   ```

   Output:

   ```output
   +-----------+--------------------+--------------+
   | C_CUSTKEY | C_NAME             | C_MKTSEGMENT |
   |-----------+--------------------+--------------|
   |     75001 | Customer#000075001 | FURNITURE    |
   |     75002 | Customer#000075002 | FURNITURE    |
   |     75003 | Customer#000075003 | MACHINERY    |
   |     75004 | Customer#000075004 | AUTOMOBILE   |
   |     75005 | Customer#000075005 | FURNITURE    |
   |         1 | Customer#000000001 | BUILDING     |
   |         2 | Customer#000000002 | AUTOMOBILE   |
   |         3 | Customer#000000003 | AUTOMOBILE   |
   |         4 | Customer#000000004 | MACHINERY    |
   |         5 | Customer#000000005 | HOUSEHOLD    |
   +-----------+--------------------+--------------+
   ```

### Loading files into v3 Apache Iceberg™ table

The following example loads files into an Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ table specification:

```sqlexample
COPY INTO my_v3_iceberg_table
  FROM @my_json_stage
  FILE_FORMAT = 'my_json_format'
  MATCH_BY_COLUMN_NAME = CASE_SENSITIVE;
```

---
title: CREATE <object>
source: https://docs.snowflake.com/en/sql-reference/sql/create.md
section: SQL Commands
---

# CREATE *<object>*

Creates a new object of the specified type.

See also:
:   [ALTER <object>](alter.md) , [DESCRIBE <object>](desc.md) , [SHOW <objects>](show.md)

## CREATE commands

For specific syntax, usage notes, and examples, see:

**Account Objects:**

> * [CREATE API INTEGRATION](create-api-integration.md)
> * [CREATE APPLICATION](create-application.md)
> * [CREATE APPLICATION PACKAGE](create-application-package.md)
> * [CREATE AUTHENTICATION POLICY](create-authentication-policy.md)
> * [CREATE CATALOG INTEGRATION](create-catalog-integration.md)
> * [CREATE COMPUTE POOL](create-compute-pool.md)
> * [CREATE CONNECTION](create-connection.md)
> * [CREATE DATABASE](create-database.md) , [CREATE DATABASE (catalog-linked)](create-database-catalog-linked.md) , [CREATE DATABASE … CLONE](create-clone.md)
> * [CREATE DATABASE ROLE](create-database-role.md)
> * [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md)
> * [CREATE EXTERNAL VOLUME](create-external-volume.md)
> * [CREATE FAILOVER GROUP](create-failover-group.md)
> * [CREATE FEATURE POLICY](create-feature-policy.md)
> * [CREATE NETWORK POLICY](create-network-policy.md)
> * [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md)
> * [CREATE ORGANIZATION PROFILE](create-organization-profile.md)
> * [CREATE POSTGRES INSTANCE](create-postgres-instance.md)
> * [CREATE PROVISIONED THROUGHPUT](create-provisioned-throughput.md)
> * [CREATE REPLICATION GROUP](create-replication-group.md)
> * [CREATE RESOURCE MONITOR](create-resource-monitor.md)
> * [CREATE ROLE](create-role.md)
> * [CREATE SECURITY INTEGRATION](create-security-integration.md)
> * [CREATE SHARE](create-share.md)
> * [CREATE STORAGE INTEGRATION](create-storage-integration.md)
> * [CREATE USER](create-user.md)
> * [CREATE WAREHOUSE](create-warehouse.md)

**Database Objects:**

> * [CREATE AGENT](create-agent.md)
> * [CREATE AGGREGATION POLICY](create-aggregation-policy.md)
> * [CREATE ALERT](create-alert.md)
> * [CREATE AUTHENTICATION POLICY](create-authentication-policy.md)
> * [CREATE BACKUP POLICY](create-backup-policy.md)
> * [CREATE BACKUP SET](create-backup-set.md)
> * [CREATE CONTACT](create-contact.md)
> * [CREATE CORTEX SEARCH SERVICE](create-cortex-search.md)
> * [CREATE DATA METRIC FUNCTION](create-data-metric-function.md)
> * [CREATE DATASET](create-dataset.md)
> * [CREATE DBT PROJECT](create-dbt-project.md)
> * [CREATE DCM PROJECT](create-dcm-project.md)
> * [CREATE DYNAMIC TABLE](create-dynamic-table.md)
> * [CREATE EVENT TABLE](create-event-table.md)
> * [CREATE EXPERIMENT](create-experiment.md)
> * [CREATE EXTERNAL FUNCTION](create-external-function.md)
> * [CREATE EXTERNAL TABLE](create-external-table.md)
> * [CREATE FILE FORMAT](create-file-format.md) , [CREATE FILE FORMAT … CLONE](create-clone.md)
> * [CREATE FUNCTION](create-function.md)
> * [CREATE GATEWAY](create-gateway.md)
> * [CREATE GIT REPOSITORY](create-git-repository.md)
> * [CREATE HYBRID TABLE](create-hybrid-table.md)
> * [CREATE ICEBERG TABLE](create-iceberg-table.md)
> * [CREATE INTERACTIVE TABLE](create-interactive-table.md)
> * [CREATE INTERACTIVE WAREHOUSE](create-interactive-warehouse.md)
> * [CREATE IMAGE REPOSITORY](create-image-repository.md)
> * [CREATE JOIN POLICY](create-join-policy.md)
> * [CREATE LISTING](create-listing.md)
> * [CREATE MAINTENANCE POLICY](create-maintenance-policy.md)
> * [CREATE MASKING POLICY](create-masking-policy.md)
> * [CREATE MATERIALIZED VIEW](create-materialized-view.md)
> * [CREATE MCP SERVER](create-mcp-server.md)
> * [CREATE MODEL](create-model.md)
> * [CREATE MODEL MONITOR](create-model-monitor.md)
> * [CREATE NETWORK RULE](create-network-rule.md)
> * [CREATE NOTEBOOK](create-notebook.md)
> * [CREATE NOTEBOOK PROJECT](create-notebook-project.md)
> * [CREATE ONLINE FEATURE TABLE](create-online-feature-table.md)
> * [CREATE ORGANIZATION LISTING](create-organization-listing.md)
> * [CREATE PACKAGES POLICY](create-packages-policy.md)
> * [CREATE PASSWORD POLICY](create-password-policy.md)
> * [CREATE PIPE](create-pipe.md)
> * [CREATE PRIVACY POLICY](create-privacy-policy.md)
> * [CREATE PROCEDURE](create-procedure.md)
> * [CREATE PROJECTION POLICY](create-projection-policy.md)
> * [CREATE ROW ACCESS POLICY](create-row-access-policy.md)
> * [CREATE SCHEMA](create-schema.md) , [CREATE SCHEMA … CLONE](create-clone.md)
> * [CREATE SECRET](create-secret.md)
> * [CREATE SEMANTIC VIEW](create-semantic-view.md)
> * [CREATE SEQUENCE](create-sequence.md) , [CREATE SEQUENCE … CLONE](create-clone.md)
> * [CREATE SERVICE](create-service.md)
> * [CREATE SESSION POLICY](create-session-policy.md)
> * [CREATE SNAPSHOT](create-snapshot.md)
> * [CREATE SNAPSHOT POLICY — Deprecated](create-snapshot-policy.md) (deprecated; prefer [CREATE BACKUP POLICY](create-backup-policy.md))
> * [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md) (deprecated; prefer [CREATE BACKUP SET](create-backup-set.md))
> * [CREATE STAGE](create-stage.md) , [CREATE STAGE … CLONE](create-clone.md)
> * [CREATE STORAGE LIFECYCLE POLICY](create-storage-lifecycle-policy.md)
> * [CREATE STREAM](create-stream.md) , [CREATE STREAM … CLONE](create-clone.md)
> * [CREATE STREAMLIT](create-streamlit.md),
> * [CREATE TABLE](create-table.md) , [CREATE TABLE … CLONE](create-clone.md)
> * [CREATE TAG](create-tag.md)
> * [CREATE TASK](create-task.md) , [CREATE TASK … CLONE](create-clone.md)
> * [CREATE TYPE](create-type.md)
> * [CREATE VIEW](create-view.md)

**Classes:**

> * [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../classes/anomaly-detection/commands/create-anomaly-detection.md)
> * [CREATE BUDGET](../classes/budget/commands/create-budget.md)
> * [CREATE SNOWFLAKE.ML.CLASSIFICATION](../classes/classification/commands/create-classification.md)
> * [CREATE CLASSIFICATION_PROFILE](../classes/classification_profile/commands/create-classification-profile.md)
> * [CREATE CUSTOM_CLASSIFIER](../classes/custom_classifier/commands/create-custom-classifier.md)
> * [CREATE SNOWFLAKE.ML.FORECAST](../classes/forecast/commands/create-forecast.md)

---
title: CREATE <object> … CLONE
source: https://docs.snowflake.com/en/sql-reference/sql/create-clone.md
section: SQL Commands
---

# CREATE *<object>* … CLONE

Creates a copy of an existing object in the system. This command is primarily used for creating
[zero-copy clones](../../user-guide/tables-storage-considerations.md) of databases, schemas, and tables.
You can also use this command to create clones of other schema objects, including
external stages, file formats, sequences, and database roles.

The command is a variation of the object-specific [CREATE <object>](create.md) commands with the addition of the `CLONE` keyword.

## Clone objects using Time Travel

For databases, schemas, and non-temporary tables, `CLONE` supports an additional `AT | BEFORE` clause for cloning using
[Time Travel](../../user-guide/data-time-travel.md).

For databases and schemas:

* `CLONE` supports the IGNORE TABLES WITH INSUFFICIENT DATA RETENTION parameter to skip any
  tables that have been purged from Time Travel (for example,
  transient tables with a one day data retention period).
* `CLONE` supports the IGNORE HYBRID TABLES parameter to skip hybrid tables, if required.

> **Note:**
>
> For information about cloning databases that contain hybrid tables, see [Clone databases that contain hybrid tables](../../user-guide/tables-hybrid-clone.md).

## Syntax

### Databases, schemas

```sqlsyntax
CREATE [ OR REPLACE ] { DATABASE | SCHEMA } [ IF NOT EXISTS ] <object_name>
  CLONE <source_object_name>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
    [ IGNORE TABLES WITH INSUFFICIENT DATA RETENTION ]
    [ IGNORE HYBRID TABLES ]
    [ INCLUDE INTERNAL STAGES ]
  ...
```

### Tables

```sqlsyntax
CREATE [ OR REPLACE ] TABLE [ IF NOT EXISTS ] <object_name>
  CLONE <source_object_name>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
  ...
```

### Dynamic tables

```sqlsyntax
CREATE [ OR REPLACE ] DYNAMIC TABLE <name>
  CLONE <source_dynamic_table>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
  [
    TARGET_LAG = { '<num> { seconds | minutes | hours | days }' | DOWNSTREAM }
    WAREHOUSE = <warehouse_name>
  ]
```

### Event tables

```sqlsyntax
CREATE [ OR REPLACE ] EVENT TABLE <name>
  CLONE <source_event_table>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
```

### Apache Iceberg™ tables

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <name>
  CLONE <source_iceberg_table>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
    [ COPY GRANTS ]
    ...
```

### Database roles

```sqlsyntax
CREATE [ OR REPLACE ] DATABASE ROLE [ IF NOT EXISTS ] <database_role_name>
  CLONE <source_database_role_name>
```

### Other schema objects

```sqlsyntax
CREATE [ OR REPLACE ] { ALERT | FILE FORMAT | SEQUENCE | STAGE | STREAM | TASK }
  [ IF NOT EXISTS ] <object_name>
  CLONE <source_object_name>
  ...
```

## Time Travel parameters

`{ AT | BEFORE } ( { TIMESTAMP => timestamp | OFFSET => time_difference | STATEMENT => id } )`
:   The [AT | BEFORE](../constructs/at-before.md) clause accepts one of the following parameters:

    `TIMESTAMP => timestamp`
    :   Specifies an exact date and time to use for Time Travel. The value must be explicitly cast to a TIMESTAMP,
        TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ data type.

        If no explicit cast is specified, the timestamp in the AT clause is treated as a timestamp with the UTC time zone (equivalent to
        TIMESTAMP_NTZ). Using the TIMESTAMP data type for an explicit cast may also result in the value being treated as a TIMESTAMP_NTZ
        value. For details, see [Date & time data types](../data-types-datetime.md).

    `OFFSET => time_difference`
    :   Specifies the difference in seconds from the current time to use for Time Travel, in the form `-N` where `N`
        can be an integer or arithmetic expression (e.g. `-120` is 120 seconds, `-30*60` is 1800 seconds or 30 minutes).

    `STATEMENT => id`
    :   Specifies the query ID of a statement to use as the reference point for Time Travel. This parameter supports any statement of one of the
        following types:

        * DML (e.g. INSERT, UPDATE, DELETE)
        * TCL (BEGIN, COMMIT transaction)
        * SELECT

        The query ID must reference a query that has been executed within the last 14 days. If the query ID references a query over 14 days old,
        the following error is returned:

        ```output
        Error: statement <query_id> not found
        ```

        To work around this limitation, use the timestamp for the referenced query.

`IGNORE TABLES WITH INSUFFICIENT DATA RETENTION`
:   Ignore tables that no longer have historical data available in Time Travel to clone. If the time in the past specified in the
    AT | BEFORE clause is beyond the data retention period for any child table in a database or schema, skip the cloning operation
    for the child table. For more information, see
    [Child Objects and Data Retention Time](../../user-guide/object-clone.md).

## Hybrid tables parameters

`IGNORE HYBRID TABLES`
:   Ignore hybrid tables when cloning a database or schema. The cloned database or schema includes other objects but skips hybrid tables.
    For more information, see [Clone databases that contain hybrid tables](../../user-guide/tables-hybrid-clone.md).

## Internal stage parameters

`INCLUDE INTERNAL STAGES`
:   Include named internal stages when cloning a database or schema.

    For more information, see the usage notes.

## Access control requirements

To create a clone, your current role must have the following privilege(s) on the source object:

> Databases:
> :   USAGE on the database.
>
> Database roles:
> :   OWNERSHIP on the database role and the CREATE DATABASE ROLE privilege on the target database.
>
> Schemas:
> :   If you specify the WITH MANAGED ACCESS clause, the required privileges depend on whether the source schema is a
>     managed or unmanaged schema. For details, see [CREATE SCHEMA privileges](create-schema.md).
>
> Tables:
> :   SELECT
>
> Alerts, Pipes, Streams, Tasks:
> :   OWNERSHIP
>
> Other objects:
> :   USAGE
>
> In addition, to clone a schema or an object within a schema, your current role must have required privileges on the container object(s)
> for both the source and the clone.
>
> For information about privilege inheritance for cloned objects, see [Cloning considerations](../../user-guide/object-clone.md).

## General usage notes

* A clone is writable and is independent of its source. Changes made to the source or clone aren’t reflected in the other object.
* Parameters that are explicitly set on a source database, schema, or table are retained in any clones created from the source container or
  child objects.
* For database roles:

  + A database role is cloned when you run the CREATE DATABASE … CLONE command to clone a database. However, if you clone other database
    objects, such as a schema or table, database roles in the database are not cloned with the schema or table.
  + If the database role is already cloned to the target database, the command fails. If this occurs, drop the database role from the
    target database and try the CLONE command again.
* For databases and schemas, cloning is recursive:

  + Cloning a database clones all the schemas and other objects in the database.
  + Cloning a schema clones all the contained objects in the schema.
  + Cloning includes only the objects on which the role that creates the clone has appropriate privileges.

  However, the following object types are not cloned:

  + External tables
  + Hybrid tables can be cloned for databases but not for schemas.
  + User tasks in a database or schema are not cloned when using CREATE SCHEMA … TIMESTAMP. In the following example, tasks in the source schema (S1) are not cloned to the schema with a timestamp (S2) but are cloned to the schema without a timestamp (S3).

    ```sqlexample
    CREATE SCHEMA S1;
    USE SCHEMA S1;
    CREATE TASK T1 AS SELECT 1;
    CREATE SCHEMA S2 CLONE S1 AT(TIMESTAMP => '2025-04-01 12:00:00');
      -- T1 is not cloned into S2
    CREATE SCHEMA S3 CLONE S1;
      -- T1 is cloned into S3
    ```
* For databases, schemas, and tables, a clone does not contribute to the overall data storage for the object until operations are
  performed on the clone that modify existing data or add new data, such as:

  + Adding, deleting, or modifying rows in a cloned table.
  + Creating a new, populated table in a cloned schema.
* Cloning a table replicates the structure, data, and certain other properties (for example, `STAGE FILE FORMAT`) of the source table.

  However:

  + A cloned table does not include the load history of the source table. One consequence of this is that data files that
    were loaded into a source table can be loaded again into its clones.
  + Although a cloned table replicates the source table’s clustering keys, the new table starts with Automatic Clustering
    suspended – even if Automatic Clustering is not suspended for the source table.
  + [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md) aren’t automatically applied to cloned tables.
    If the source table has a storage lifecycle policy
    attached, you must manually attach the policy to the clone by using the [ALTER TABLE](alter-table.md) command.
* The COPY GRANTS parameter affects a new table clone as follows:

  + If the COPY GRANTS parameter is used, then the new object inherits any explicit access privileges granted on the original table but does
    not inherit any future grants defined for the object type in the schema.
  + If the COPY GRANTS parameter is not used, then the new object clone does not inherit any explicit access privileges granted on
    the original table but does inherit any future grants defined for the object type in the schema (using the
    [GRANT <privileges> … TO ROLE](grant-privilege.md) … ON FUTURE syntax).
  > **Note:**
  >
  > If the statement is replacing an existing table of the same name, then the grants are copied from the table
  > being replaced. If there is no existing table of that name, then the grants are copied from the source table
  > being cloned.
* For [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md), cloning is currently supported for Snowflake-managed tables only. For more information, see
  [Cloning and Apache Iceberg™ tables](../../user-guide/object-clone.md).

* For named internal stages:

  + Cloning is supported only at the database or schema level.
  + For stages with a directory table enabled, Snowflake uses the directory table as the source of truth for files on the stage.
    We recommend refreshing the directory table before cloning.

    The cloned stage contains copies of any undeleted files registered in the source directory table at the time of cloning.
    If a file has been updated, but the directory table isn’t refreshed, the updated file isn’t copied.
    After cloning, the source
    stage and the clone aren’t linked. Changes to files on the source stage don’t affect the files on the cloned stage (and the other way around).
  + For stages without a directory table enabled, Snowflake creates empty clones (doesn’t make copies of files on the source stage).
  + Snowflake makes clones of internal stages in their current state, regardless of whether your CREATE CLONE statement uses
    Time Travel (AT | BEFORE). If you specify a point in time before a stage was created, the stage won’t be cloned.
  + Cloning for internal stages relies on the [COPY FILES](copy-files.md) service, which incurs compute and file transfer charges.
    To monitor credit usage and bytes copied, you can query the [COPY_FILES_HISTORY view](../account-usage/copy_files_history.md) view.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Additional rules that apply to cloning objects

Metadata:
:   An object clone inherits the name and structure of the source object current at the time the CREATE *<object>* CLONE
    statement is executed or at a specified time/point in the past using [Time Travel](../../user-guide/data-time-travel.md). An object clone
    inherits any other metadata, such as comments or table clustering keys, that is current in the source object at the time the statement
    is executed, regardless of whether Time Travel is used.

Child objects:
:   A database or schema clone includes all child objects active at the time the statement is executed or at the specified time/point
    in the past. A snapshot of the table data represents the state of the source data when the statement is executed or at the specified time/point
    in the past. Child objects inherit the name and structure of the source child objects at the time the statement is executed.

    Not cloned:
    :   Cloning a database or schema does not clone external tables in the database or schema.

        Hybrid tables can be cloned for databases but not for schemas.

    Pipes:
    :   A database or schema clone includes only pipe objects that reference external (Amazon S3, Google Cloud Storage, or Microsoft Azure)
        stages; internal (Snowflake) pipes are not cloned.

        The default state of a pipe clone is as follows:

        * When `AUTO_INGEST = FALSE`, a cloned pipe is paused by default.
        * When `AUTO_INGEST = TRUE`, a cloned pipe is set to the `STOPPED_CLONED` state. In this state, pipes do not accumulate event
          notifications as a result of newly staged files. When a pipe is explicitly resumed, it only processes data files triggered as a
          result of new event notifications.

        A pipe clone in either state can be resumed by executing an [ALTER PIPE](alter-pipe.md) … SET PIPE_EXECUTION_PAUSED = false statement.

    Tags:
    :   Cloning a database or schema affects [tags](../../user-guide/object-tagging/introduction.md) in that database or schema as follows:

        * Tag associations in the source object (e.g. table) are maintained in the cloned objects.
        * For a database or a schema:

          When a database or schema is cloned, tags that reside in that schema or database are also cloned.

          If a table or view exists in the source schema/database and has references to tags in the same schema or database, the cloned table or view is mapped to the corresponding cloned tag (in the target schema/database) instead of the tag in the source schema or database.

    Java UDF:
    :   A Java UDF can be cloned when the database or schema containing the Java UDF is cloned. To be cloned, the Java UDF must meet certain
        conditions. For more information, see [Limitations on cloning](../../developer-guide/udf/java/udf-java-limitations.md).

    Data metric functions:
    :   Cloning does not result in DMF assignments on the target object. If you clone a database or schema that contains DMFs, the DMFs are
        cloned to the target database or schema.

Table data:
:   When cloning a database, schema, or table, a snapshot of the data in each table is taken and made available to the clone. The snapshot
    represents the state of the source data either at the time the statement is executed or at the specified time/point in the past (using
    [Time Travel](../../user-guide/data-time-travel.md)).

Object references:
:   Objects such as views, streams, and tasks include object references in their definition. For example:

    * A view contains a stored query that includes table references.
    * A stream points to a source table.
    * A task or alert calls a stored procedure or executes a SQL statement that references other objects.

    When one of these objects is cloned, either in a cloned database or schema or as an individual object, for those object types that support
    cloning, the clone inherits references to other objects from the definition of the source object. For example, a clone of a view inherits
    the stored query from the source view, including the table references in the query.

    Pay close attention to whether any object names in the definition of a source object are fully or partially qualified. A fully-qualified
    name includes the database and schema names. Any clone of the source object includes these parts in its own definition.

    For example:

    ```sqlexample
    -- Create a schema to serve as the source for a cloned schema.
    CREATE SCHEMA source;

    -- Create a table.
    CREATE TABLE mytable (col1 string, col2 string);

    -- Create a view that references the table with a fully-qualified name.
    CREATE VIEW myview AS SELECT col1 FROM source.mytable;

    -- Retrieve the DDL for the source schema.
    SELECT GET_DDL ('schema', 'source', true);
    ```

    ```output
    +--------------------------------------------------------------------------+
    | GET_DDL('SCHEMA', 'SOURCE', TRUE)                                        |
    |--------------------------------------------------------------------------|
    | create or replace schema MPETERS_DB.SOURCE;                              |
    |                                                                          |
    | create or replace TABLE MPETERS_DB.SOURCE.MYTABLE (                      |
    |   COL1 VARCHAR(16777216),                                                |
    |   COL2 VARCHAR(16777216)                                                 |
    | );                                                                       |
    |                                                                          |
    | create view MPETERS_DB.SOURCE.MYVIEW as select col1 from SOURCE.MYTABLE; |
    |                                                                          |
    +--------------------------------------------------------------------------+
    ```

    ```sqlexample
    -- Clone the source schema.
    CREATE SCHEMA source_clone CLONE source;

    -- Retrieve the DDL for the clone of the source schema.
    -- The clone of the view references the source table with the same fully-qualified name
    -- as in the view in the source schema.
    SELECT GET_DDL ('schema', 'source_clone', true);
    ```

    ```output
    +--------------------------------------------------------------------------------+
    | GET_DDL('SCHEMA', 'SOURCE_CLONE', TRUE)                                        |
    |--------------------------------------------------------------------------------|
    | create or replace schema MPETERS_DB.SOURCE_CLONE;                              |
    |                                                                                |
    | create or replace TABLE MPETERS_DB.SOURCE_CLONE.MYTABLE (                      |
    |   COL1 VARCHAR(16777216),                                                      |
    |   COL2 VARCHAR(16777216)                                                       |
    | );                                                                             |
    |                                                                                |
    | create view MPETERS_DB.SOURCE_CLONE.MYVIEW as select col1 from SOURCE.MYTABLE; |
    |                                                                                |
    +--------------------------------------------------------------------------------+
    ```

    If you intend to point a view to tables with the same names in *other* databases or schemas, we suggest creating a new view rather
    than cloning an existing view. This guidance also pertains to other objects that reference objects in their definition.

> **Note:**
>
> * Certain limitations apply to cloning operations. For example, DDL statements that affect the source object during a cloning operation
>   can alter the outcome or cause errors.
> * Cloning is not instantaneous, particularly for large objects (databases, schemas, tables), and does not lock the object being cloned.
>   As such, a clone does not reflect any DML statements applied to table data, if applicable, while the cloning operation is still running.
>
> For more information about this and other use cases that might affect your cloning operations, see [Cloning considerations](../../user-guide/object-clone.md).

## Notes for cloning with Time Travel

* The [AT | BEFORE](../constructs/at-before.md) clause clones a database, schema, or table as of a specified time in the past or based on
  a specified SQL statement:

  > + The `AT` keyword specifies that the request is inclusive of any changes made by a statement or transaction with timestamp equal
  >   to the specified parameter.
  > + The `BEFORE` keyword specifies that the request refers to a point immediately preceding the specified parameter.
* Cloning using `STATEMENT` is equivalent to using `TIMESTAMP` with a value equal to the recorded execution time of the SQL
  statement (or its enclosing transaction), as identified by the specified statement ID.
* An error is returned if:

  > + The object being cloned did not exist at the point in the past specified in the [AT | BEFORE](../constructs/at-before.md) clause.
  > + The historical data required to clone the object or any of its child objects (for example, tables in cloned schemas or database) has been
  >   purged.
  >
  >   As a workaround for child objects that have been purged from Time Travel, use the
  >   IGNORE TABLES WITH INSUFFICIENT DATA RETENTION parameter of the
  >   CREATE <object> … CLONE command. For more information, see [Child objects and data retention time](../../user-guide/object-clone.md).
* If any child object in a cloned database or schema did not exist at the point in the past specified in the
  [AT | BEFORE](../constructs/at-before.md) clause, the child object is not cloned.

If you don’t specify a point in time, the clone defaults to the state of the object as of now
(the [CURRENT_TIMESTAMP](../functions/current_timestamp.md) value).

For more information, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

### Troubleshoot cloning objects using Time Travel

The following scenarios can help you troubleshoot issues that can occur when cloning an object using Time Travel.

|  |  |
| --- | --- |
| Error | ```output 000707 (02000): Time travel data is not available for <object_type> <object_name>. The requested time is either beyond the allowed time travel period or before the object creation time. ``` |

This error can be returned for the following reasons:

|  |  |
| --- | --- |
| Cause | The time in the past specified by the AT | BEFORE clause is beyond the data retention period for the object. |
| Solution | Verify the data retention period for the object using the appropriate [SHOW <objects>](show.md) command and the `retention_time` column. Update the CREATE *<object>* … CLONE statement to use a time in the past that is within the data retention period for the object. |

|  |  |
| --- | --- |
| Cause | The cloning operation for a database or schema fails if the historical data for any child object has moved out of Time Travel. |
| Solution | To skip child tables that no longer have historical data available in Time Travel, execute the cloning statement using the IGNORE TABLES WITH INSUFFICIENT DATA RETENTION parameter to skip these tables. |

|  |  |
| --- | --- |
| Cause | In some cases, this is caused by using a string where a timestamp is expected. |
| Solution | Cast the string to a timestamp.  ```sqlexample ... AT(TIMESTAMP => '2023-12-31 12:00:00')               -- fails ... AT(TIMESTAMP => '2023-12-31 12:00:00'::TIMESTAMP)    -- succeeds ``` |

## Examples

Clone a database and all objects within the database at its current state:

```sqlexample
CREATE DATABASE mytestdb_clone CLONE mytestdb;
```

Clone a schema and all objects within the schema at its current state:

```sqlexample
CREATE SCHEMA mytestschema_clone CLONE testschema;
```

Clone a table at its current state:

```sqlexample
CREATE TABLE orders_clone CLONE orders;
```

Clone a schema as it existed before the date and time in the specified timestamp:

```sqlexample
CREATE SCHEMA mytestschema_clone_restore CLONE testschema
  BEFORE (TIMESTAMP => TO_TIMESTAMP(40*365*86400));
```

Clone a table as it existed exactly at the date and time of the specified timestamp:

```sqlexample
CREATE TABLE orders_clone_restore CLONE orders
  AT (TIMESTAMP => TO_TIMESTAMP_TZ('04/05/2013 01:02:03', 'mm/dd/yyyy hh24:mi:ss'));
```

Clone a table as it existed immediately before the execution of the specified statement. Replace the query ID for the STATEMENT
parameter in the example and execute the following CREATE TABLE statement:

```sqlexample
CREATE TABLE orders_clone_restore CLONE orders BEFORE (STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726');
```

Clone a database and all its objects as they existed four days ago and skip any tables that have a data retention period of
less than four days:

```sqlexample
CREATE DATABASE restored_db CLONE my_db
  AT (TIMESTAMP => DATEADD(days, -4, current_timestamp)::timestamp_tz)
  IGNORE TABLES WITH INSUFFICIENT DATA RETENTION;
```

Clone a schema that contains a mixture of standard tables and hybrid tables:

```sqlexample
CREATE OR REPLACE SCHEMA clone_ht_schema CLONE ht_schema
  IGNORE HYBRID TABLES;
```

The new schema will only contain the standard tables from the original schema. If IGNORE HYBRID TABLES is not specified
in this example, the command fails with an error because schemas that contain hybrid tables can’t be cloned.

---
title: CREATE ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/create-account.md
section: SQL Commands
---

# CREATE ACCOUNT

Creates a new account in your organization.

See also:
:   [DROP ACCOUNT](drop-account.md), [SHOW ACCOUNTS](show-accounts.md), [UNDROP ACCOUNT](undrop-account.md)

## Syntax

```sqlsyntax
CREATE ACCOUNT <name>
      ADMIN_NAME = '<string_literal>'
    { ADMIN_PASSWORD = '<string_literal>' | ADMIN_RSA_PUBLIC_KEY = '<string_literal>' }
    [ ADMIN_USER_TYPE = { PERSON | SERVICE | LEGACY_SERVICE | NULL } ]
    [ FIRST_NAME = '<string_literal>' ]
    [ LAST_NAME = '<string_literal>' ]
      EMAIL = '<string_literal>'
    [ MUST_CHANGE_PASSWORD = { TRUE | FALSE } ]
      EDITION = { STANDARD | ENTERPRISE | BUSINESS_CRITICAL }
    [ REGION_GROUP = <region_group_id> ]
    [ REGION = <snowflake_region_id> ]
    [ COMMENT = '<string_literal>' ]
    [ POLARIS = { TRUE | FALSE } ]
```

## Required parameters

`name`
:   Specifies the `account_name` substring in an [account identifier](../../user-guide/admin-account-identifier.md).

    This name should conform with all the [requirements for account identifiers](../../user-guide/admin-account-identifier.md).

`ADMIN_NAME = 'string_literal'`
:   Login name of the initial administrative user of the account. A new user is created in the new account with this name and password and
    granted the ACCOUNTADMIN role in the account.

    A login name can be any string consisting of letters, numbers, and underscores. Login names are always case-insensitive.

`ADMIN_PASSWORD = 'string_literal'`
:   Password for the initial administrative user of the account. The password for the user must be enclosed in single or double quotes.

    Optional if the `ADMIN_RSA_PUBLIC_KEY` parameter is specified.

    For more information about passwords in Snowflake, see [Snowflake-provided password policy](../../user-guide/password-authentication.md).

`ADMIN_RSA_PUBLIC_KEY = 'string_literal'`
:   Assigns a public key to the initial administrative user of the account in order to implement
    [key pair authentication](../../user-guide/key-pair-auth.md) for the user.

    Optional if the `ADMIN_PASSWORD` parameter is specified.

`EMAIL = 'string_literal'`
:   Email address of the initial administrative user of the account. This email address is used to send any notifications about the account.

`EDITION = { STANDARD | ENTERPRISE | BUSINESS_CRITICAL }`
:   [Snowflake Edition](../../user-guide/intro-editions.md) of the account.

## Optional parameters

`ADMIN_USER_TYPE = { PERSON | SERVICE | LEGACY_SERVICE | NULL }`
:   Used for setting the [type](../../user-guide/admin-user-management.md) of the first user that is assigned the ACCOUNTADMIN role during account
    creation.

    > **Note:**
    >
    > The LEGACY_SERVICE type is being deprecated. Use the SERVICE type for services and applications. For a timeline of the deprecation of
    > LEGACY_SERVICE, see [Planning for the deprecation of single-factor password sign-ins](../../user-guide/security-mfa-rollout.md).

    Default: `NULL` (Same as `PERSON`).

`FIRST_NAME = string` , . `LAST_NAME = string`
:   First and last name of the initial administrative user of the account.

    Default: `NULL`

`MUST_CHANGE_PASSWORD = { TRUE | FALSE }`
:   Specifies whether the new user created to administer the account is forced to change their password upon first login into the account.

    Default: `FALSE`

`REGION_GROUP = region_group_id`
:   ID of the region group where the account is created. To retrieve the region group ID for existing accounts in your organization, execute
    the [SHOW REGIONS](show-regions.md) command. For information about when you might need to specify region group, see
    [Region groups](../../user-guide/admin-account-identifier.md).

    Default: Current region group.

`REGION = snowflake_region_id`
:   [Snowflake Region ID](../../user-guide/admin-account-identifier.md) of the region where the account is created. If no value is provided, Snowflake
    creates the account in the same Snowflake Region as the current account (i.e. the account in which the CREATE ACCOUNT statement is
    executed.)

    To obtain a list of the regions that are available for an organization, execute the [SHOW REGIONS](show-regions.md) command.

    Default: Current Snowflake Region.

`COMMENT = 'string_literal'`
:   Specifies a comment for the account.

    Default: No value

`POLARIS = { TRUE | FALSE }`
:   Specifies whether to create a Snowflake Open Catalog account.

    Default: FALSE

## Access control requirements

Only [organization administrators](../../user-guide/organization-administrators.md) can execute this SQL command.

## Usage notes

* An account can be associated with your organization in one of the following ways:

  + Create a new account using the SQL command described in the current topic.
  + Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to link an existing account to your organization.
* By default, the maximum number of accounts in an organization cannot exceed 25. To have this limit raised, contact Snowflake Support.
* It takes about 30 seconds for the DNS changes to propagate before you can access a newly created account. If the account is not accessible
  immediately, wait for approximately 30 seconds and try again.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a new Snowflake account in the `aws_us_west_2` Snowflake Region on Amazon Web Services (AWS). The user who executes the CREATE ACCOUNT
statement can be logged into an account in the same or a different Snowflake Region:

> ```sqlexample
> create account myaccount1
>   admin_name = admin
>   admin_password = 'TestPassword1'
>   first_name = Jane
>   last_name = Smith
>   email = 'myemail@myorg.org'
>   edition = enterprise
>   region = aws_us_west_2;
> ```

Create a new Snowflake account in the same region group and Snowflake Region in which the CREATE ACCOUNT statement is executed. The new account
administrator user must change their password upon first login:

> ```sqlexample
> create account myaccount2
>   admin_name = admin
>   admin_password = 'TestPassword1'
>   email = 'myemail@myorg.org'
>   edition = enterprise;
> ```

Create a new Open Catalog account in the aws_us_west_2 Snowflake Region on Amazon Web Services (AWS):

> ```sqlexample
> create account myaccount1
>   admin_name = admin
>   admin_password = 'TestPassword1'
>   first_name = Jane
>   last_name = Smith
>   email = 'myemail@myorg.org'
>   edition = enterprise
>   region = aws_us_west_2
>   polaris = true;
> ```

---
title: CREATE AGENT
source: https://docs.snowflake.com/en/sql-reference/sql/create-agent.md
section: SQL Commands
---

# CREATE AGENT

Creates a new [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md) object with the specified attributes and specification.

See also:
:   [ALTER AGENT](alter-agent.md), [DESCRIBE AGENT](desc-agent.md), [DROP AGENT](drop-agent.md), [SHOW AGENTS](show-agents.md), [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](../functions/data_agent_run-snowflake-cortex.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] AGENT [ IF NOT EXISTS ] <name>
  [ COMMENT = '<comment>' ]
  [ PROFILE = '<profile_object>' ]
  FROM SPECIFICATION
  $$
  <specification_object>
  $$;
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the agent; must be unique for the schema in which the agent is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`COMMENT = comment`
:   Description of the agent.

`PROFILE = profile_object`
:   Specifies the [OBJECT](../data-types-semistructured.md) value containing agent profile information, such as display name, avatar, and color. Serialize the `profile_object` into a string as follows:

    ```none
    '{"display_name": "<display_name>", "avatar": "<avatar>", "color": "<color>"}'
    ```

    The following table describes the key-value pairs in this object:

    | Key | Type | Description |
    | --- | --- | --- |
    | `display_name` | String | Display name for the agent. |
    | `avatar` | String | Avatar image file name or identifier. |
    | `color` | String | Color theme for the agent (such as “blue”, “green”, “red”) |

`FROM SPECIFICATION $$ specification_object $$`
:   Specifies the VARCHAR value containing the settings for an agent as a YAML object. The maximum length of the specification object is 100,000 bytes.

    The YAML object should have the following structure:

    ```YAML
    models:
      orchestration: <model_name>

    orchestration:
      budget:
          seconds: <number_of_seconds>
          tokens: <number_of_tokens>

    instructions:
      response: '<response_instructions>'
      orchestration: '<orchestration_instructions>'
      system: '<system_instructions>'
      sample_questions:
          - question: '<sample_question>'
            answer: '<sample_answer>'
          ...

    tools:
      - tool_spec:
          type: '<tool_type>'
          name: '<tool_name>'
          description: '<tool_description>'
          input_schema:
              type: 'object'
              properties:
                <property_name>:
                  type: '<property_type>'
                  description: '<property_description>'
              required: <required_property_names>
      ...

    tool_resources:
      <tool_name>:
        <resource_key>: '<resource_value>'
        ...
      ...
    ```

    The following table describes the key-value pairs in this object:

    | Key | Type | Description |
    | --- | --- | --- |
    | `models` | [ModelConfig](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional model configuration for the agent. Includes the orchestration model (e.g., claude-4-sonnet). If not provided, a model is automatically selected. Currently only available for the `orchestration` step. |
    | `orchestration` | [OrchestrationConfig](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional orchestration configuration, including budget constraints (e.g., seconds, tokens). |
    | `instructions` | [AgentInstructions](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | Optional instructions for the agent’s behavior, including response, orchestration, system, and sample questions. |
    | `tools` | array of [Tool](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional list of tools available for the agent to use. Each tool includes a `tool_spec` with type, name, description, and input schema. Tools may have a corresponding configuration in `tool_resources`. |
    | `tool_resources` | map of [ToolResource](../../user-guide/snowflake-cortex/cortex-agents-rest-api.md) | An optional configuration for each tool referenced in the tools array. Keys must match the name of the respective tool. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE AGENT | Schema | Required to create the Cortex Agent. |
| USAGE | Cortex Search service | Required to run the Cortex Search services in the Cortex Agents request. |
| USAGE | Database, schema, table | Required to access the objects referenced in the Cortex Agents semantic model. |

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

```sqlexample
CREATE OR REPLACE AGENT my_agent1
  COMMENT = 'agent level comment'
  PROFILE = '{"display_name": "My Business Assistant", "avatar":  "business-icon.png", "color": "blue"}'
  FROM SPECIFICATION
  $$
  models:
    orchestration: claude-4-sonnet

  orchestration:
    budget:
      seconds: 30
      tokens: 16000

  instructions:
    response: "You will respond in a friendly but concise manner"
    orchestration: "For any revenue question use Analyst; for policy use Search"
    system: "You are a friendly agent that helps with business questions"
    sample_questions:
      - question: "What was our revenue last quarter?"
        answer: "I'll analyze the revenue data using our financial database."

  tools:
    - tool_spec:
        type: "cortex_analyst_text_to_sql"
        name: "Analyst1"
        description: "Converts natural language to SQL queries for financial analysis"
    - tool_spec:
        type: "cortex_search"
        name: "Search1"
        description: "Searches company policy and documentation"

  tool_resources:
    Analyst1:
      semantic_view: "db.schema.semantic_view"
    Search1:
      name: "db.schema.service_name"
      max_results: "5"
      filter:
        "@eq":
          region: "North America"
      title_column: "<title_name>"
      id_column: "<column_name>"
  $$;
```

---
title: CREATE AGGREGATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-aggregation-policy.md
section: SQL Commands
---

# CREATE AGGREGATION POLICY

Creates a new [aggregation policy](../../user-guide/aggregation-policies.md) in the current/specified schema or replaces an existing
aggregation policy.

After creating an aggregation policy, assign the aggregation policy to a table using an [ALTER TABLE](alter-table.md) command or a view using an [ALTER VIEW](alter-view.md) command.

See also:
:   [Aggregation policy DDL reference](../../user-guide/aggregation-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] AGGREGATION POLICY [ IF NOT EXISTS ] <name>
  AS () RETURNS AGGREGATION_CONSTRAINT -> <body>
  [ COMMENT = '<string_literal>' ]
```

## Parameters

`name`
:   Identifier for the aggregation policy; must be unique for your schema.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`body`
:   SQL expression that determines the restrictions of an aggregation policy.

    To define the constraints of the aggregation policy, use the SQL expression to call one or more of the following functions:

    NO_AGGREGATION_CONSTRAINT
    :   When the policy body returns a value from this function, queries can return data from an aggregation-constrained table or view
        without restriction. For example, the body of the policy could call this function when an administrator needs to obtain unaggregated
        results from the aggregation-constrained table or view.

        Call NO_AGGREGATION_CONSTRAINT without an argument.

    AGGREGATION_CONSTRAINT
    :   When the policy body returns a value from this function, queries must aggregate data in order to return results. Use the
        MIN_GROUP_SIZE argument to specify how many records must be included in each aggregation group.

        The syntax of the AGGREGATION_CONSTRAINT function is:

        ```sqlsyntax
        AGGREGATION_CONSTRAINT ( MIN_GROUP_SIZE => <integer_expression> )
        ```

        Where:

        `MIN_GROUP_SIZE => integer_expression`
        :   Specifies how many rows or [entities](../../user-guide/aggregation-policies-entity-privacy.md) must be included in the groups returned by
            a query against the aggregation-constrained table or view.

            There is a difference between passing a `1` and a `0` as the argument to the function. Both require results to be aggregated.

            * Passing a `1` also requires that each aggregation group contain at least one record from the aggregation-constrained table. So for
              outer joins, at least one record from the aggregation-constrained table must match a record from an unprotected table.
            * Passing a `0` allows the query to return groups that consist entirely of records from another table. So for outer joins between an
              aggregation-constrained table and an unprotected table, a group could consist of records from the unprotected table that do not match
              any records in the aggregation-constrained table.

    The body of a policy cannot reference user-defined functions, tables, or views.

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the aggregation policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE AGGREGATION POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on aggregation policy DDL and privileges, see [Privileges and commands](../../user-guide/aggregation-policies.md).

## Usage notes

* If you want to update an existing aggregation policy and need to see the current body of the policy, run the
  [DESCRIBE AGGREGATION POLICY](desc-aggregation-policy.md) command or [GET_DDL](../functions/get_ddl.md) function.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create an aggregation policy that requires queries to return groups of five or more rows:

> ```sqlexample
> CREATE AGGREGATION POLICY my_policy AS ()
>   RETURNS AGGREGATION_CONSTRAINT ->
>   AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5);
> ```

Create an aggregation policy that allows a user with role `admin` to return unaggregated results while requiring all other queries
to return groups of five or more rows:

> ```sqlexample
> CREATE AGGREGATION POLICY my_policy AS ()
>   RETURNS AGGREGATION_CONSTRAINT ->
>     CASE
>       WHEN CURRENT_ROLE() = 'ADMIN'
>         THEN NO_AGGREGATION_CONSTRAINT()
>       ELSE AGGREGATION_CONSTRAINT(MIN_GROUP_SIZE => 5)
>     END;
> ```

---
title: CREATE ALERT
source: https://docs.snowflake.com/en/sql-reference/sql/create-alert.md
section: SQL Commands
---

# CREATE ALERT

Creates a new [alert](../../user-guide/alerts.md) in the current schema.

This command also supports the following variant:

* CREATE ALERT … CLONE (creates a clone of an existing alert)

See also:
:   [ALTER ALERT](alter-alert.md) , [DESCRIBE ALERT](desc-alert.md), [DROP ALERT](drop-alert.md) , [SHOW ALERTS](show-alerts.md) , [EXECUTE ALERT](execute-alert.md)

> **Important:**
>
> Newly created or cloned alerts are suspended upon creation. For information on resuming suspended alerts, see
> [Suspending and resuming an alert](../../user-guide/alerts.md).

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ALERT [ IF NOT EXISTS ] <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ SCHEDULE = '{ <num> MINUTE | USING CRON <expr> <time_zone> }' ]
  [ WAREHOUSE = <warehouse_name> ]
  [ COMMENT = '<string_literal>' ]
  IF( EXISTS(
    <condition>
  ))
  THEN
    <action>
```

## Variant syntax

**CREATE ALERT … CLONE**

Creates a new alert with the same parameter values:

> ```sqlsyntax
> CREATE [ OR REPLACE ] ALERT <name> CLONE <source_alert>
>   [ ... ]
> ```

For more details, see [CREATE <object> … CLONE](create-clone.md).

> **Note:**
>
> When you clone an alert by using CREATE ALERT <name> CLONE or by cloning a schema or database containing the alert, the new
> alert has all of the properties of the original alert except for any properties that you explicitly override.

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the alert; must be unique for the schema in which the alert is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`IF( EXISTS( condition ))`
:   The SQL statement that represents the condition for the alert. You can use the following commands:

    * [SELECT](select.md)
    * [SHOW <objects>](show.md)
    * [CALL](call.md)

    If the statement returns one or more rows, the action for the alert is executed.

`THEN action`
:   The SQL statement that should be executed if the condition returns one or more rows.

    To send a notification, you can
    [call the SYSTEM$SEND_EMAIL or SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure](../../user-guide/notifications/about-notifications.md).

## Optional parameters

`WAREHOUSE = warehouse_name`
:   Specifies the [virtual warehouse](../../user-guide/warehouses.md) that provides compute resources for executing this alert.

    > **Note:**
    >
    > For [serverless alerts](../../user-guide/alerts.md), do not set this property.

`SCHEDULE ...`
:   Specifies the schedule for periodically evaluating the condition for the alert on a schedule.

    When you create an alert, omitting this parameter or setting it to NULL creates an
    [alert on new data](../../user-guide/alerts.md).

    For alerts on a schedule, you can specify the schedule in one of the following ways:

    * `USING CRON expr time_zone`

      Specifies a cron expression and time zone for periodically evaluating the condition for the alert. Supports a subset of
      standard cron utility syntax.

      The cron expression consists of the following fields:

      ```bash
      # __________ minute (0-59)
      # | ________ hour (0-23)
      # | | ______ day of month (1-31, or L)
      # | | | ____ month (1-12, JAN-DEC)
      # | | | | _ day of week (0-6, SUN-SAT, or L)
      # | | | | |
      # | | | | |
        * * * * *
      ```

      The following special characters are supported:

      | Special Character | Description |
      | --- | --- |
      | `*` | Wildcard. When specified for a given field, the alert runs at every unit of time for that field.  For example, `*` in the month field specifies that the alert runs every month. |
      | `L` | Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a given month. In the day-of-month field, it specifies the last day of the month. |
      | `/n` | Indicates the `n`th instance of a given unit of time. Each quanta of time is computed independently.  For example, if `4/3` is specified in the month field, then the evaluation of the condition is scheduled for April, July and October (i.e. every 3 months, starting with the 4th month of the year).  The same schedule is maintained in subsequent years. That is, the condition is not scheduled to be evaluated in January (3 months after the October run). |

      > **Note:**
      > + The cron expression currently evaluates against the specified time zone only. Altering the
      >   [TIMEZONE](../parameters.md) parameter value for the account (or setting the value at the user or session level) does not
      >   change the time zone for the alert.
      > + The cron expression defines all valid times for the evaluation of the condition for the alert. Snowflake attempts
      >   to evaluate the condition based on this schedule; however, any valid run time is skipped if a previous run has not
      >   completed before the next valid run time starts.
      > + When both a specific day of month and day of week are included in the cron expression, then the evaluation of the
      >   condition is scheduled on days satisfying either the day of month or day of week. For example,
      >   `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'` schedules an evaluation at 0AM on any 10th to 20th day of the month
      >   and also on any Tuesday or Thursday outside of those dates.
    * `num MINUTE`

      Specifies an interval (in minutes) of wait time inserted between evaluations of the alert. Accepts positive integers only.

      Also supports `num M` syntax.

      To avoid ambiguity, a *base interval time* is set when the alert is resumed (using
      [ALTER ALERT … RESUME](alter-alert.md)).

      The base interval time starts the interval counter from the current clock time. For example, if an alert is created with
      `10 MINUTE` and the alert is resumed at 9:03 AM, then the condition for the alert is evaluated at 9:13 AM, 9:23 AM, and so
      on. Note that we make a best effort to ensure absolute precision, but only guarantee that conditions are not evaluated
      before their set interval occurs (e.g. in the current example, the condition could be evaluated first at 9:14 AM but
      definitely not at 9:12 AM).

      > **Note:**
      >
      > The maximum supported value is `11520` (8 days). Alerts that have a greater `num MINUTE` value never have their
      > conditions evaluated.

`COMMENT = 'string_literal'`
:   Specifies a comment for the alert.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| EXECUTE MANAGED ALERT | Account | Required only for [serverless alerts](../../user-guide/alerts.md). |
| EXECUTE ALERT | Account |  |
| CREATE ALERT | Schema |  |
| USAGE | Warehouse | Required only for [alerts that specify a warehouse to use](../../user-guide/alerts.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Alerts are executed using the privileges granted to the alert owner (i.e. the role that has the OWNERSHIP privilege on the
  alert). For the list of minimum required privileges to execute alerts, see [Granting the privileges to create alerts](../../user-guide/alerts.md).

  To verify that the alert owner role has the required privileges to execute SQL statements for the condition and action, we
  recommend that you execute these statements using the alert owner role before specifying them in CREATE ALERT.
* When you create an alert, the alert is suspended by default.

  To make the alert active, you must execute [ALTER ALERT … RESUME](alter-alert.md).

* When you execute CREATE ALERT or ALTER ALERT, some validation checks are not performed on the statements in the condition and
  action, including:

  + The resolution of the identifiers for objects.
  + The resolution of the data types of expressions.
  + The verification of the number and types of arguments in a function call.

  The CREATE ALERT and ALTER ALERT commands do not fail if the SQL statement for a condition or action specifies an invalid
  identifier, incorrect data type, incorrect number and types of function arguments, etc. Instead, the failure occurs when the
  alert executes.

  To check for failures in an existing alert, use the [ALERT_HISTORY](../functions/alert_history.md) table function.

  To avoid these types of failures, before you specify the conditions and actions for alerts, verify the SQL expressions and
  statements for those conditions and actions.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

See [Creating an alert](../../user-guide/alerts.md).

---
title: CREATE API INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-api-integration.md
section: SQL Commands
---

# CREATE API INTEGRATION

Creates a new API integration object in the account or replaces an existing API integration.

An API integration object stores information about a service reached via HTTPS API, including information about some of the following:

> * A cloud platform provider (such as Amazon AWS).
> * A Git repository API.
> * The type of service (such as when a cloud platform provider offers more than one type of proxy service).
> * The identifier and access credentials for the external service that has sufficient privileges to use the
>   service. For example, on AWS, the role’s ARN (Amazon resource name) serves as the identifier and access
>   credentials.
>
>   When this user is granted appropriate privileges, Snowflake can use this user to access resources. For example, this might be an instance
>   of the cloud platform’s native HTTPS proxy service, for example, an instance of an Amazon API Gateway.
> * An API integration object also specifies allowed (and optionally blocked) endpoints and resources on those services.

See also:
:   [ALTER API INTEGRATION](alter-api-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW INTEGRATIONS](show-integrations.md) , [Writing external functions](../external-functions.md) ,
    [CREATE EXTERNAL FUNCTION](create-external-function.md)

## Syntax

The syntax is different for each external API.

### For Amazon API Gateway

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = { aws_api_gateway | aws_private_api_gateway | aws_gov_api_gateway | aws_gov_private_api_gateway }
  API_AWS_ROLE_ARN = '<iam_role>'
  [ API_KEY = '<api_key>' ]
  API_ALLOWED_PREFIXES = ('<...>')
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

Note that `aws_api_gateway` or `aws_private_api_gateway` or `aws_gov_api_gateway` or `aws_gov_private_api_gateway` should
not be in quotation marks.

### For Azure API Management

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = azure_api_management
  AZURE_TENANT_ID = '<tenant_id>'
  AZURE_AD_APPLICATION_ID = '<azure_application_id>'
  [ API_KEY = '<api_key>' ]
  API_ALLOWED_PREFIXES = ( '<...>' )
  [ API_BLOCKED_PREFIXES = ( '<...>' ) ]
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

Note that `azure_api_management` should not be in quotation marks.

### For Google Cloud API Gateway

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = google_api_gateway
  GOOGLE_AUDIENCE = '<google_audience_claim>'
  API_ALLOWED_PREFIXES = ( '<...>' )
  [ API_BLOCKED_PREFIXES = ( '<...>' ) ]
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

Note that `google_api_gateway` should not be in quotation marks.

### For Git repository

When integrating with a Git repository, you can use a personal access token or OAuth.

[Preview Feature](../../release-notes/preview-features.md) — Open

OAuth support is generally available only when the repository is hosted at [github.com](https://github.com/).

OAuth support is in preview for repository providers other than github.com.

TokenGitHub appOAuth2 parametersPrivate Link

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = git_https_api
  API_ALLOWED_PREFIXES = ('<...>')
  [ API_BLOCKED_PREFIXES = ('<...>') ]
  [ ALLOWED_AUTHENTICATION_SECRETS = ( { <secret_name> [, <secret_name>, ... ] } ) | all | none ]
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = git_https_api
  API_ALLOWED_PREFIXES = ('https://github.com/<...>')
  [ API_BLOCKED_PREFIXES = ('<...>') ]
  API_USER_AUTHENTICATION = (
    TYPE = SNOWFLAKE_GITHUB_APP
  )
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = git_https_api
  API_ALLOWED_PREFIXES = ('https://example.com/<...>')
  [ API_BLOCKED_PREFIXES = ('<...>') ]
  API_USER_AUTHENTICATION = (
    TYPE = OAUTH2
    {oauth_parameters}
  )
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

```sqlsyntax
CREATE [ OR REPLACE ] API INTEGRATION [ IF NOT EXISTS ] <integration_name>
  API_PROVIDER = git_https_api
  API_ALLOWED_PREFIXES = ('<...>')
  [ API_BLOCKED_PREFIXES = ('<...>') ]
  [ ALLOWED_AUTHENTICATION_SECRETS = ( { <secret_name> [, <secret_name>, ... ] } ) | all | none ]
  USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }
  [ TLS_TRUSTED_CERTIFICATES = ( { <secret_name> [, <secret_name>, ... ] } ) ]
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
  ;
```

Note that `git_https_api` should not be in quotation marks.

You can combine Private Link routing with OAuth2 user authentication.

## Required parameters

### For Amazon API Gateway

`integration_name`
:   Specifies the name of the API integration. This name follows the rules for [Object identifiers](../identifiers.md).
    The name must be unique among API integrations in your account.

`API_PROVIDER = { aws_api_gateway | aws_private_api_gateway | aws_gov_api_gateway | aws_gov_private_api_gateway }`
:   Specifies the HTTPS proxy service type. Valid values are:

    > * `aws_api_gateway`: for Amazon API Gateway using regional endpoints.
    > * `aws_private_api_gateway`: for Amazon API Gateway using private endpoints.
    > * `aws_gov_api_gateway`: for Amazon API Gateway using U.S. government GovCloud endpoints.
    > * `aws_gov_private_api_gateway`: for Amazon API Gateway using U.S. government GovCloud endpoints that are also private
    >   endpoints.

`API_AWS_ROLE_ARN = iam_role`
:   > For Amazon AWS, this is the ARN (Amazon resource name) of a cloud platform role.

`API_ALLOWED_PREFIXES = (...)`
:   Explicitly limits external functions that use the integration to reference one or more HTTPS proxy
    service endpoints (such as Amazon API Gateway) and resources within those proxies. Supports a comma-separated
    list of URLs, which are treated as prefixes (for details, see below).

    Each URL in `API_ALLOWED_PREFIXES = (...)` is treated as a prefix. For example, if you specify:

    `https://xyz.amazonaws.com/production/`

    that means all resources under

    `https://xyz.amazonaws.com/production/`

    are allowed. For example the following is allowed:

    `https://xyz.amazonaws.com/production/ml1`

    To maximize security, you should restrict allowed locations as narrowly as practical.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this API integration is enabled or disabled. If the API integration is disabled, any external function that relies
    on it will not work.

    The value is case-insensitive.

    The default is `TRUE`.

### For Azure API Management Service

`integration_name`
:   Specifies the name of the API integration. This name follows the rules for [Object identifiers](../identifiers.md).
    The name should be unique among API integrations in your account.

`API_PROVIDER = azure_api_management`
:   Specifies that this integration is used with Azure API Management services. Do not use quotation marks around `azure_api_management`.

`AZURE_TENANT_ID = tenant_id`
:   Specifies the ID for your Office 365 tenant that all Azure API Management instances belong to. An API integration
    can authenticate to only one tenant, and so the allowed and blocked locations must refer to API Management
    instances that all belong to this tenant.

    To find your tenant ID, sign in to the Azure portal and select Azure Active Directory » Properties.
    The tenant ID is displayed in the Tenant ID field.

`AZURE_AD_APPLICATION_ID = azure_application_id`
:   The “Application (client) id” of the Azure AD (Active Directory) app for your remote service.
    If you followed the instructions in [Creating external functions on Microsoft Azure](../external-functions-creating-azure.md),
    then this is the `Azure Function App AD Application ID` that you recorded in the worksheet in those instructions.

`API_ALLOWED_PREFIXES = (...)`
:   Explicitly limits external functions that use the integration to reference one or more HTTPS proxy
    service endpoints (such as Azure API Management services) and resources within those proxies. Supports a comma-separated
    list of URLs, which are treated as prefixes (for details, see below).

    Each URL in `API_ALLOWED_PREFIXES = (...)` is treated as a prefix. For example, if you specify:

    `https://my-external-function-demo.azure-api.net/my-function-app-name`

    that means all resources under

    `https://my-external-function-demo.azure-api.net/my-function-app-name`

    are allowed. For example the following is allowed:

    `https://my-external-function-demo.azure-api.net/my-function-app-name/my-http-trigger-function`

    To maximize security, you should restrict allowed locations as narrowly as practical.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this API integration is enabled or disabled. If the API integration is disabled, any external function that relies
    on it will not work.

    The value is case-insensitive.

    The default is `TRUE`.

### For Google Cloud API Gateway

`integration_name`
:   Specifies the name of the API integration. This name follows the rules for [Object identifiers](../identifiers.md).
    The name should be unique among API integrations in your account.

`API_PROVIDER = google_api_gateway`
:   Specifies that this integration is used with Google Cloud. The only valid value for this purpose is `google_api_gateway`.
    The value must not be in quotation marks.

`GOOGLE_AUDIENCE = google_audience`
:   This is used as the audience claim when generating the JWT (JSON Web Token) to authenticate to the Google API Gateway.
    For more information about authenticating with Google, please see the Google service account
    [authentication documentation.](https://cloud.google.com/api-gateway/docs/authenticate-service-account#configure_auth)

`API_ALLOWED_PREFIXES = (...)`
:   Explicitly limits external functions that use the integration to reference one or more HTTPS proxy
    service endpoints (such as Google Cloud API Gateways) and resources within those proxies. Supports a comma-separated
    list of URLs, which are treated as prefixes (for details, see below).

    Each URL in `API_ALLOWED_PREFIXES = (...)` is treated as a prefix. For example, if you specify:

    `https://my-external-function-demo.uc.gateway.dev/x`

    that means all resources under

    `https://my-external-function-demo.uc.gateway.dev/x`

    are allowed. For example the following is allowed:

    `https://my-external-function-demo.uc.gateway.dev/x/y`

    To maximize security, you should restrict allowed locations as narrowly as practical.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this API integration is enabled or disabled. If the API integration is disabled, any external function that relies
    on it will not work.

    The value is case-insensitive.

    The default is `TRUE`.

### For Git repository

[Preview Feature](../../release-notes/preview-features.md) — Open

OAuth support is generally available only when the repository is hosted at [github.com](https://github.com/).

OAuth support is in preview for repository providers other than github.com.

For an example, see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).

`integration_name`
:   Specifies the name of the API integration. This name follows the rules for [Object identifiers](../identifiers.md).
    The name must be unique among API integrations in your account.

`API_PROVIDER = git_https_api`
:   Specifies that this integration is used with [CREATE GIT REPOSITORY](create-git-repository.md) to create an
    [integration with a remote Git repository](../../developer-guide/git/git-overview.md). The only valid value for this purpose is
    `git_https_api`. The value must not be in quotation marks.

`API_ALLOWED_PREFIXES = (...)`
:   Explicitly limits requests that use the integration to reference one or more HTTPS endpoints and resources beneath those
    endpoints. Supports a comma-separated list of URLs, which are treated as prefixes.

    In most cases, Snowflake supports any HTTPS Git repository URL. For example, you can specify a custom URL to a corporate Git server
    within your own domain.

    `https://example.com/my-repo`

    Each URL in `API_ALLOWED_PREFIXES = (...)` is treated as a prefix. For example, you can specify the following:

    `https://example.com/my-account`

    With this prefix, all resources under that URL are allowed. For example, the following is allowed:

    `https://example.com/my-account/myproject`

    To maximize security, you should restrict allowed locations as narrowly as practical.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this API integration is enabled or disabled. If the API integration is disabled, the Git repository will not be accessible.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

### For all integrations

`API_KEY = api_key`
:   The [API key](../external-functions-security.md) (also called a “subscription key”).

`API_BLOCKED_PREFIXES = (...)`
:   Lists the endpoints and resources in the HTTPS proxy service that are not allowed to be called from Snowflake.

    The possible values for locations follow the same rules as for API_ALLOWED_PREFIXES above.

    API_BLOCKED_PREFIXES takes precedence over API_ALLOWED_PREFIXES. If a prefix matches both, then it is blocked.
    In other words, Snowflake allows all values that match API_ALLOWED_PREFIXES except values that also
    match API_BLOCKED_PREFIXES.

    If a value is outside API_ALLOWED_PREFIXES, you do not need to explicitly block it.

`COMMENT = 'string_literal'`
:   A description of the integration.

### For Git repository

> [Preview Feature](../../release-notes/preview-features.md) — Open
>
> OAuth support is generally available only when the repository is hosted at [github.com](https://github.com/).
>
> OAuth support is in preview for repository providers other than github.com.

In addition to parameters for all integrations, use the following parameters when you’re using the integration
to connect to a remote Git repository by setting the integration’s API_PROVIDER parameter to `git_https_api`.

`ALLOWED_AUTHENTICATION_SECRETS = ( secret_name [, secret_name ... ] | all | none )`
:   Specifies the secrets that UDF or procedure handler code can use when accessing the Git repository at the API_ALLOWED_PREFIXES value. You
    specify a secret from this list when specifying Git credentials with the [GIT_CREDENTIALS parameter](create-git-repository.md).

    This parameter’s value must be one of the following:

    * One or more fully-qualified Snowflake secret names to allow any of the listed secrets.
    * (Default) `all` to allow any secret.
    * `none` to allow no secrets.

    The ALLOWED_API_AUTHENTICATION_INTEGRATIONS parameter can also specify allowed secrets. For more information, see
    [Usage notes](create-external-access-integration.md).

    For reference information about secrets, refer to [CREATE SECRET](create-secret.md).

`API_USER_AUTHENTICATION = ( TYPE = snowflake_github_app | TYPE = OAUTH2 oauth_parameters )`
:   Specifies security integration settings for an OAuth 2.0 flow.

    How you set this parameter differs depending on the repository provider. For more information, see [Configure for authenticating with OAuth](../../developer-guide/git/git-setting-up.md).

    * `TYPE = snowflake_github_app`: Authenticate with GitHub using the Snowflake GitHub App, as described in
      [Configure for authenticating with OAuth](../../developer-guide/git/git-setting-up.md). No other values are required for API_USER_AUTHENTICATION in this case.
    * `TYPE = OAUTH2`: Authenticate using OAuth2 parameters, as described in [Configure for authenticating with OAuth](../../developer-guide/git/git-setting-up.md).

      When you specify this value, you must also specify the parameters, as required, under `oauth_parameters` (next).
    * `oauth_parameters`: Authenticate using the specified OAuth 2.0 parameters, including the following parameters:

      + `OAUTH_AUTHORIZATION_ENDPOINT = 'endpoint_url'`

        Specifies the URL for authenticating to the repository.
      + `OAUTH_TOKEN_ENDPOINT = 'token_endpoint_url'`

        Specifies the token endpoint used by the client to obtain an access token by presenting its authorization grant or refresh token.
        The client uses the token endpoint with every authorization grant except for the implicit grant type (because an access token is
        issued directly).
      + `OAUTH_CLIENT_ID = 'client_id'`

        Specifies the client ID for the OAuth application in the repository provider. The value for this parameter is specific to your
        organization.
      + `OAUTH_CLIENT_SECRET = 'client_secret'`

        Specifies the client secret for the OAuth application in the repository provider. The value for this parameter is specific to your
        organization.
      + `OAUTH_ACCESS_TOKEN_VALIDITY = integer`

        Specifies the default lifetime, in seconds, of the OAuth access token issued by an OAuth server.

        The value set in this property is used if the access token lifetime is not returned as part of OAuth token response. When both
        values are available, the smaller of the two values is used to refresh the access token.
      + `OAUTH_REFRESH_TOKEN_VALIDITY = integer`

        Specifies the value, in seconds, to determine the validity of the refresh token obtained from the OAuth server.
      + `OAUTH_ALLOWED_SCOPES = ( { 'read_api' | 'read_repository' | 'write_repository' } [ , ... ] )`
        Specifies the scope to use when making a request from the provider. Specify the following values:

        - `'read_api'`: Read from the repository provider’s API.
        - `'read_repository'`: Read from the repository.
        - `'write_repository'`: Write to the repository.
      + `OAUTH_USERNAME = 'string_literal'`

        Optional. The Git repository username. Set this value based on the repository provider’s requirements. For example, for Bitbucket,
        set this `x-token-auth`.

`TLS_TRUSTED_CERTIFICATES = ( {secret_name} [, {secret_name} ... ] )`
:   Specifies secrets containing self-signed certificates to be used when
    [authenticating with a Git repository server](../../developer-guide/git/git-setting-up.md) over private link. This parameter is
    needed only when the certificate is self-signed, rather than signed by a certificate authority.

    This parameter’s value must be one or more fully qualified Snowflake secret names. The secrets must be of type
    [generic string](create-secret.md) whose SECRET_STRING value is Base64-encoded certificate data.

`USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
:   Specifies whether this API integration will be used only to
    [configure access to a remote Git repository over an outbound private link connection](../../developer-guide/git/git-setting-up.md) through
    [private connectivity](../../user-guide/private-connectivity-outbound.md).

    This parameter must be set to `FALSE` (the default) for public Git servers.

    The default is `FALSE`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only Snowflake roles with OWNERSHIP or USAGE privileges on the API integration can use the API integration directly
  (for example, by creating an external function that specifies that API integration).
* An API integration object is tied to a specific cloud platform account and role within that account, but not
  to a specific HTTPS proxy URL. You can create more than one instance of an HTTPS proxy service in a cloud provider
  account, and you can use the same API integration to authenticate to multiple proxy services in that account.
* Your Snowflake account can have multiple API integration objects, for example, for different cloud platform accounts.
* Multiple external functions can use the same API integration object, and thus the same HTTPS proxy service.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

### Amazon API Gateway

The following example shows creation of an API integration and use of that API integration in a subsequent
CREATE EXTERNAL FUNCTION statement:

```sqlexample
CREATE OR REPLACE API INTEGRATION demonstration_external_api_integration_01
  API_PROVIDER = aws_api_gateway
  API_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/my_cloud_account_role'
  API_ALLOWED_PREFIXES = ('https://xyz.execute-api.us-west-2.amazonaws.com/production')
  ENABLED = TRUE;

CREATE OR REPLACE EXTERNAL FUNCTION local_echo(string_col VARCHAR)
  RETURNS VARIANT
  API_INTEGRATION = demonstration_external_api_integration_01
  AS 'https://xyz.execute-api.us-west-2.amazonaws.com/production/remote_echo';
```

### Git repository

For an example of an API integration used to integrate a Git repository, see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).

---
title: CREATE APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-application.md
section: SQL Commands
---

# CREATE APPLICATION

Creates a Snowflake Native App based on an application package or listing. Providers use this
command to install an app in their development account.

When this command runs, it runs the
[setup script](../../developer-guide/native-apps/creating-setup-script.md) to create the app.

See also:
:   [ALTER APPLICATION](alter-application.md), [DESCRIBE APPLICATION](desc-application.md), [DROP APPLICATION](drop-application.md), [SHOW APPLICATIONS](show-applications.md)

## Syntax

```sqlsyntax
CREATE APPLICATION <name> FROM APPLICATION PACKAGE <package_name>
   [ USING RELEASE CHANNEL { QA | ALPHA | DEFAULT } ]
   [ COMMENT = '<string_literal>' ]
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
   [ AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE } ]
   [ WITH FEATURE POLICY = <policy_name> ]

CREATE APPLICATION <name> FROM APPLICATION PACKAGE <package_name>
  USING <path_to_version_directory>
  [ DEBUG_MODE = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [, ...] ) ]
  [ AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE } ]
  [ WITH FEATURE POLICY = <policy_name> ]

CREATE APPLICATION <name> FROM APPLICATION PACKAGE <package_name>
  USING VERSION  <version_identifier> [ PATCH <patch_num> ]
  [ DEBUG_MODE = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
  [ AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE } ]
  [ WITH FEATURE POLICY = <policy_name> ]

CREATE APPLICATION <name> FROM LISTING <listing_name>
   [ USING RELEASE CHANNEL { QA | ALPHA | DEFAULT } ]
   [ COMMENT = '<string_literal>' ]
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
   [ BACKGROUND_INSTALL = { TRUE | FALSE } ]
   [ AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE } ]
   [ WITH FEATURE POLICY = <policy_name> ]
```

## Required parameters

`name`
:   Specifies the identifier for the app. Must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces
    or special characters unless the entire identifier string is enclosed in double quotes
    (for example, `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, refer to [Identifier requirements](../identifiers-syntax.md).

`FROM APPLICATION PACKAGE package_name`
:   Specifies the name of the application package used to create the app. To use this
    clause to create an app from an application package without specifying a stage or a
    version/patch, the application package must have a default release directive defined.

    This clause can only be used to create an app in the same account as the application
    package. This clause cannot be used to create an app in development mode.

`FROM LISTING listing_name`
:   Specifies the name of the listing that contains the application package used to create the app.

`USING RELEASE CHANNEL QA | ALPHA | DEFAULT`
:   Specifies the release channel defined in the application package or listing used to create
    the app. If you do not specify this clause, the default release channel is used.

    * `QA` specifies the quality assurance release channel.
    * `ALPHA` specifies the alpha release channel.
    * `DEFAULT` specifies the default release channel.

    This clause can be used only when creating an app from an application package that has a
    release directive defined or when creating an app from a listing.

`USING path_to_version_directory`
:   Specifies the path to the stage that contains the files required by the app.

`USING version [ PATCH patch_num ]`
:   Specifies the version, and optionally the patch, defined in the application package
    used to create the app.

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the app.

    Default: No value

`DEBUG_MODE = { TRUE | FALSE }`
:   Enables or disables [debug mode](../../developer-guide/native-apps/installing-testing-application.md) for the app
    being created. Debug mode allows a provider to see the contents of the app.

    * `TRUE` enables debug mode for the installed app.
    * `FAlSE` disables debug mode for the installed app.

    > **Note:**
    >
    > You can only enable debug mode under the following conditions:
    >
    > * The app is in the same account as the application package.
    > * The app is being created based on a specific version or from files on a named stage.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`BACKGROUND_INSTALL = { TRUE | FALSE }`
:   Creates the app from a listing in the background. If you specify this clause, the
    command returns you to the prompt immediately, and the installation process continues in the
    background. To monitor that status of the installation, use the
    [DESCRIBE APPLICATION](desc-application.md) command.

    > **Note:**
    >
    > When this clause is used, the app is created even if the command fails. In this
    > situation, use the [DROP APPLICATION](drop-application.md) command to delete the object before
    > running the CREATE APPLICATION command again.

    This clause is primarily used by Snowsight to install a Snowflake Native App in the background. Background
    installation allows the consumer to navigate away from the listing in Snowsight during
    installation. A provider might use this clause when testing the installation of a Snowflake Native App
    from a listing before publishing the listing.

`AUTHORIZE_TELEMETRY_EVENT_SHARING = { TRUE | FALSE }`
:   Enables [logging and event sharing](../../developer-guide/native-apps/event-about.md) in the app.

`WITH FEATURE POLICY = policy_name`
:   Create the app with the specified feature policy. If the app attempts to create an object that the feature policy prohibits (such as a database), the command fails.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE APPLICATION | Account |  |
| DEVELOP | Application package |  |
| INSTALL | Application package |  |
| IMPORT SHARE  CREATE APPLICATION | Account | These privileges are required to create an app in an account different than the account that contains the application package. |
| APPLY FEATURE POLICY  APPLY or OWNERSHIP | Account  Feature policy | These privileges are required to apply a feature policy when creating the app using the WITH FEATURE POLICY clause. |

## Usage notes

* To create an app directly from an application package, you must specify a default release directive in the application package.
* The app differs from a database in the following ways:

  + An app may not be transient.
  + The role with the OWNERSHIP privilege on the app has the following abilities and limitations:

    - Can drop the database or modify the COMMENT property and any properties that are specific to the app.
    - Cannot see or modify the contents of the app except via the privileges granted the application roles.
    - Cannot create a database-level object, such as a schema or a database role.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

```sqlexample
CREATE APPLICATION hello_snowflake_app
  FROM APPLICATION PACKAGE hello_snowflake_package
  USING VERSION v1;
```

```output
+---------------------------------------------------------+
| status                                                  |
|---------------------------------------------------------|
| Application 'hello_snowflake_app' created successfully. |
+---------------------------------------------------------+
```

```sqlexample
CREATE APPLICATION hello_snowflake_app
  FROM APPLICATION PACKAGE hello_snowflake_package
  USING '@hello_snowflake_code.core.hello_snowflake_stage';
```

```output
+---------------------------------------------------------+
| status                                                  |
|---------------------------------------------------------|
| Application 'hello_snowflake_app' created successfully. |
+---------------------------------------------------------+
```

---
title: CREATE APPLICATION PACKAGE
source: https://docs.snowflake.com/en/sql-reference/sql/create-application-package.md
section: SQL Commands
---

# CREATE APPLICATION PACKAGE

Creates a new application package that contains the data content and application logic of
Snowflake Native App. An application package contains the following information about an app:

* The version and patch number of the app.
* The data content that is available to the application.
* The setup script of the app.
* The manifest file of the app.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md), [DROP APPLICATION PACKAGE](drop-application-package.md), [SHOW APPLICATION PACKAGES](show-application-packages.md)

## Syntax

```sqlsyntax
CREATE APPLICATION PACKAGE [ IF NOT EXISTS ] <name>
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
  [ DISTRIBUTION = { INTERNAL | EXTERNAL } ]
  [ LISTING_AUTO_REFRESH = { TRUE | FALSE } ]
  [ MULTIPLE_INSTANCES = TRUE ]
  [ ENABLE_RELEASE_CHANNELS = TRUE ]
```

## Required parameters

`name`
:   Specifies the identifier for the application package; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the application
    package, as well as specifying the default Time Travel retention time for all schemas created in the database.

    For more details, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see [Parameters](../parameters.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition:
    >
    >   + `0` to `90` for permanent databases

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the account level)

    > **Note:**
    >
    > A value of `0` disables Time Travel for the database.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in the application package to prevent streams on the tables from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`DEFAULT_DDL_COLLATION = 'collation_specification'`
:   Specifies a default [collation specification](../collation.md) for all schemas and tables added to the application package. The default
    can be overridden at the schema and individual table level.

    For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the application package.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`DISTRIBUTION = { INTERNAL | EXTERNAL }`
:   Specifies the type of listing a provider can create when using the application package as the data product of a listing.

    * `INTERNAL` indicates that a provider can only create a private listing within the same organization
      where the application package was created. The automated security scan is not performed
      when the DISTRIBUTION property is set to INTERNAL.
    * `EXTERNAL` indicates that a provider can create listings outside the same organization where
      the application package was created.

    See [Run the automated security scan](../../developer-guide/native-apps/security-run-scan.md) for information on setting the DISTRIBUTION property and
    the automated security scan.

    > **Note:**
    >
    > Setting the `DISTRIBUTION` parameter to `EXTERNAL` triggers an automated security review for each
    > active version and patch defined in the application package.
    >
    > The following restrictions apply until the automated security review has a status of `APPROVED`:
    >
    > * You cannot set a release directive for a version or patch.
    > * You cannot publish a listing for the application package.

`LISTING_AUTO_REFRESH = TRUE | FALSE`
:   When set to TRUE, initiates replication to all remote regions when there is a change to the release directive of the application package. When a release directive changes, the application package does not wait for the Cross-Cloud Auto-Fulfillment schedule.

`MULTIPLE_INSTANCES = TRUE`
:   Enables the consumer to install multiple instances of an app from the application package. This property cannot be
    set for applications packages that are included in a trial or paid listing.

    When multiple instances are allowed, consumers can install a maximum of 10 instances of an app in their account.

    > **Caution:**
    >
    > After this property is set to `TRUE`, it cannot be set to `FALSE` or unset later.

`ENABLE_RELEASE_CHANNELS = TRUE | FALSE`
:   Enables [release channels](../../developer-guide/native-apps/release-channels.md) for the application package.

    > **Caution:**
    >
    > After setting this property to `TRUE`, it cannot be set to `FALSE` or unset later.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE APPLICATION PACKAGE | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to other roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To create an application package, the caller must have the CREATE APPLICATION PACKAGE privilege on the account.
* There are no restrictions on the types of objects that may reside in the application package or what roles (database or account level)
  that may own those objects.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

```sqlexample
CREATE APPLICATION PACKAGE hello_snowflake_package;
```

```output
+-----------------------------------------------------------------------+
| status                                                                |
|-----------------------------------------------------------------------|
| Application Package 'hello_snowflake_package' created successfully.   |
+-----------------------------------------------------------------------+
```

---
title: CREATE APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-application-role.md
section: SQL Commands
---

# CREATE APPLICATION ROLE

Creates a new application role or replaces an existing application role. Use application roles to
enable access control security for objects within an application object.

See [About application roles](../../developer-guide/native-apps/creating-setup-script.md) for more information.

> **Note:**
>
> Application roles are only valid within the context of an application object.

When creating an application role, you can grant privileges on objects to the application role.
Within the setup script, you can then grant the application role to other application roles.

After installing a Snowflake Native App, consumers can grant application roles to account roles to
enable access to the app.

With application roles, you can grant privileges on other objects within the application or
objects owned by the application in the consumer account.

Application roles are implicitly granted to the application owner WITH GRANT OPTION. The
application owner may grant these roles to account level roles, providing access to the
objects that are owned by the application.

Additionally, this command supports the following variants:

* CREATE OR ALTER APPLICATION ROLE: Creates a new application role if it doesn’t exist or
  alters an existing application role.

See also:
:   [ALTER APPLICATION ROLE](alter-application-role.md), [GRANT APPLICATION ROLE](grant-application-role.md),
    [REVOKE APPLICATION ROLE](revoke-application-role.md), [SHOW APPLICATION ROLES](show-application-roles.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] APPLICATION ROLE [ IF NOT EXISTS ] <name>
  [ COMMENT = '<string_literal>' ]
```

## Variant syntax

### CREATE OR ALTER APPLICATION ROLE

Creates a new application role if it doesn’t already exist, or transforms an existing application role
into the role defined in the statement. A CREATE OR ALTER APPLICATION ROLE statement follows the syntax rules of a
CREATE APPLICATION ROLE statement and has the same limitations as an [ALTER APPLICATION ROLE](alter-application-role.md)
statement.

```sqlsyntax
CREATE OR ALTER APPLICATION ROLE <name>
  [ COMMENT = '<string_literal>' ]
```

For more information, see CREATE OR ALTER APPLICATION ROLE usage notes.

## Required parameters

`name`
:   Specifies the identifier for the application role. This value must be unique within the application object
    in which the role is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is not fully qualified, in the form of `application_name.application_role_name`, the command creates the
    application role in the current application for the session.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the application role.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Application role | Required to execute a CREATE OR ALTER APPLICATION ROLE statement for an *existing* application role.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* The maximum number of application roles that can be created in an application object is 1000.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## CREATE OR ALTER APPLICATION ROLE usage notes

* All limitations of the [ALTER APPLICATION ROLE](alter-application-role.md) command apply.
* Setting or unsetting a tag is not supported; however, existing tags are not altered by a CREATE
  OR ALTER APPLICATION ROLE statement and remain unchanged.

## Examples

```sqlexample
CREATE APPLICATION ROLE app_role
  COMMENT = 'Application role for the Hello Snowflake application.';
```

---
title: CREATE AUTHENTICATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-authentication-policy.md
section: SQL Commands
---

# CREATE AUTHENTICATION POLICY

Creates a new [authentication policy](../../user-guide/authentication-policies.md) in the current or specified schema or replaces
an existing authentication policy. You can use authentication policies to define authentication controls and security requirements
for accounts or users.

This command supports the following variants:

* CREATE OR ALTER AUTHENTICATION POLICY: Creates an authentication policy if it doesn’t exist, or alters an existing authentication policy.

See also:
:   [ALTER AUTHENTICATION POLICY](alter-authentication-policy.md), [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md), [DROP AUTHENTICATION POLICY](drop-authentication-policy.md), [SHOW AUTHENTICATION POLICIES](show-authentication-policies.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] AUTHENTICATION POLICY [ IF NOT EXISTS ] <name>
  [ AUTHENTICATION_METHODS = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ CLIENT_TYPES = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ CLIENT_POLICY = ( <client_type> = ( MINIMUM_VERSION = '<version>' ) [ , ... ] ) ]
  [ SECURITY_INTEGRATIONS = ( '<string_literal>' [ , '<string_literal>' , ... ] ) ]
  [ MFA_ENROLLMENT = { 'REQUIRED' | 'REQUIRED_PASSWORD_ONLY' } ]
  [ MFA_POLICY= ( <list_of_properties> ) ]
  [ PAT_POLICY = ( <list_of_properties> ) ]
  [ WORKLOAD_IDENTITY_POLICY = ( <list_of_properties> ) ]
  [ COMMENT = '<string_literal>' ]
```

## Variant syntax

### CREATE OR ALTER AUTHENTICATION POLICY

Creates a new authentication policy if it doesn’t already exist, or alters an existing authentication policy into the one defined in the statement.
A CREATE OR ALTER AUTHENTICATION POLICY statement follows the syntax rules of a CREATE AUTHENTICATION POLICY statement and has the same limitations as an
[ALTER AUTHENTICATION POLICY](alter-authentication-policy.md) statement.

```sqlsyntax
CREATE OR ALTER AUTHENTICATION POLICY <name>
  [ AUTHENTICATION_METHODS = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ CLIENT_TYPES = ( '<string_literal>' [ , '<string_literal>' , ...  ] ) ]
  [ CLIENT_POLICY = ( <client_type> = ( MINIMUM_VERSION = '<version>' ) [ , ... ] ) ]
  [ SECURITY_INTEGRATIONS = ( '<string_literal>' [ , '<string_literal>' , ... ] ) ]
  [ MFA_ENROLLMENT = { 'REQUIRED' | 'REQUIRED_PASSWORD_ONLY' } ]
  [ MFA_POLICY= ( <list_of_properties> ) ]
  [ PAT_POLICY = ( <list_of_properties> ) ]
  [ WORKLOAD_IDENTITY_POLICY = ( <list_of_properties> ) ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the [identifier](../identifiers.md) for the authentication policy.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`AUTHENTICATION_METHODS = ( 'string_literal' [ , 'string_literal' , ... ] )`
:   > **Caution:**
    >
    > Restricting by authentication method can have unintended consequences, such as blocking driver connections or third-party integrations.

    A list of authentication methods that are allowed during login. This parameter accepts one or more of the following values:

    `ALL`
    :   Allow all authentication methods.

    `SAML`
    :   Allows [SAML2 security integrations](../../user-guide/admin-security-fed-auth-security-integration.md). If `SAML` is
        present, an SSO login option appears. If `SAML` is not present, an SSO login option does not appear.

    `PASSWORD`
    :   Allows users to authenticate using username and password.

    `OAUTH`
    :   Allows [External OAuth](../../user-guide/oauth-ext-overview.md).

    `KEYPAIR`
    :   Allows [Key pair authentication](../../user-guide/key-pair-auth.md).

    `PROGRAMMATIC_ACCESS_TOKEN`
    :   Allows users to authenticate with a [programmatic access token](../../user-guide/programmatic-access-tokens.md).

    `WORKLOAD_IDENTITY`
    :   Allows users to authenticate through [workload identity federation](../../user-guide/workload-identity-federation.md).

    Default: `ALL`.

`CLIENT_TYPES = ( 'string_literal' [ , 'string_literal' , ... ] )`
:   A list of clients that can authenticate with Snowflake.

    If a client tries to connect, and the client is not one of the valid `CLIENT_TYPES` values listed below, then the login attempt fails.

    If you set `MFA_ENROLLMENT` to `REQUIRED`, then you must include `SNOWFLAKE_UI` in the `CLIENT_TYPES` list to allow
    users to enroll in MFA.

    If you want to exclude `SNOWFLAKE_UI` from the `CLIENT_TYPES` list, then you must set `MFA_ENROLLMENT` to
    `OPTIONAL`.

    The `CLIENT_TYPES` property of an authentication policy is a best-effort method to block user logins based on specific clients. It should not be used as the sole control to establish a security boundary. Notably, it does not restrict access to the Snowflake REST APIs.

    This parameter accepts one or more of the following values:

    `ALL`
    :   Allow all clients to authenticate.

    `SNOWFLAKE_UI`
    :   [Snowsight](../../user-guide/ui-snowsight-gs.md), the Snowflake web interface.

        > **Caution:**
        >
        > If `SNOWFLAKE_UI` is not included in the `CLIENT_TYPES` list while `MFA_ENROLLMENT` is set to `REQUIRED`, or `MFA_ENROLLMENT` is unspecified, MFA enrollment doesn’t work.

    `DRIVERS`
    :   Drivers allow access to Snowflake from applications written in
        [supported languages](../../developer-guide/drivers.md). For example, the [Go](../../developer-guide/golang/go-driver.md),
        [JDBC](../../developer-guide/jdbc/jdbc.md), [.NET](../../developer-guide/dotnet/dotnet-driver.md) drivers, and
        [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md).

        > **Caution:**
        >
        > If `DRIVERS` is not included in the `CLIENT_TYPES` list, automated ingestion may stop working.

    `SNOWFLAKE_CLI`
    :   A [command-line client](../../developer-guide/snowflake-cli/index.md) for connecting to Snowflake and for managing developer-centric workloads and SQL operations.

    `SNOWSQL`
    :   A [command-line client](../../user-guide/snowsql.md) for connecting to Snowflake.

    Default: `ALL`.

`CLIENT_POLICY = client_type = ( MINIMUM_VERSION = 'version' )`
:   Specifies a policy within the authentication policy that sets the minimum version allowed for each specified client type.

    If CLIENT_TYPES is empty, contains `ALL`, or contains `DRIVERS`, the CLIENT_POLICY parameter accepts one or more of the following driver clients (and a specific version string). For any driver client that is not specified, the policy implicitly allows any
    version of that client.

    If CLIENT_TYPES contains another value, such as `SNOWFLAKE_CLI`, and does not also contain `DRIVERS`, specifying any of the following client types results in an error. You can’t create (or alter) an authentication policy such that the CLIENT_TYPES and CLIENT_POLICY parameters aren’t compatible.

    `client_type`
    :   One or more valid client type values. This is a different set of values from those that the CLIENT_TYPES parameter accepts. Do not use single quotes for these values.

        * `JDBC_DRIVER` (Snowflake JDBC Driver)
        * `ODBC_DRIVER` (Snowflake ODBC Driver)
        * `PYTHON_DRIVER` (Snowflake Python Driver)
        * `JAVASCRIPT_DRIVER` (Snowflake Javascript Driver)
        * `C_DRIVER` (Libsnowflakeclient C Driver)
        * `GO_DRIVER` (Snowflake Go Driver)
        * `PHP_DRIVER` (Snowflake PHP PDO Driver)
        * `DOTNET_DRIVER` (Snowflake .NET Driver)
        * `SQL_API` (SQL API)
        * `SNOWPIPE_STREAMING_CLIENT_SDK` (Snowpipe Streaming Client SDK)
        * `PY_CORE` (Snowflake Python Core Driver)
        * `SPROC_PYTHON` (Snowflake Python Stored Procedure)
        * `PYTHON_SNOWPARK` (Snowflake Python Snowpark Driver)
        * `SQL_ALCHEMY` (Snowflake SQLAlchemy)
        * `SNOWPARK` (Snowpark)
        * `SNOWFLAKE_CLIENT` (Snowflake Client SDK)

    `'version'`
    :   The minimum accepted version for each specified client type: a sequence of three digits delimited by periods and enclosed by single quotation marks.
        For example: `'1.0.0'` or `'3.14.1'`. Authentication attempts with lower client versions are blocked when this policy is in effect for an account or a user.

    The CLIENT_POLICY property of an authentication policy is a best-effort method to block user logins based on specific client versions. It should not be used as the sole control to establish a security boundary.

`SECURITY_INTEGRATIONS = ( 'string_literal' [ , 'string_literal' , ... ] )`
:   A list of security integrations the authentication policy is associated with. This parameter has no effect when `SAML` or
    `OAUTH` are not in the `AUTHENTICATION_METHODS` list.

    All values in the `SECURITY_INTEGRATIONS` list must be compatible with the values in the `AUTHENTICATION_METHODS` list. For
    example, if `SECURITY_INTEGRATIONS` contains a SAML security integration, and `AUTHENTICATION_METHODS` contains
    `OAUTH`, then you cannot create the authentication policy.

    `ALL`
    :   Allow all security integrations.

    Default: `ALL`.

`MFA_ENROLLMENT = { 'REQUIRED' | 'REQUIRED_PASSWORD_ONLY' | 'OPTIONAL' }`
:   Determines whether a user must enroll in multi-factor authentication. If this value is used, then
    the `CLIENT_TYPES` parameter must include `SNOWFLAKE_UI`, because Snowsight is the only place users can
    [enroll in multi-factor authentication (MFA)](../../user-guide/ui-snowsight-profile.md).

    It’s possible for the value of the `MFA_ENROLLMENT` parameter to be `REQUIRED_SNOWFLAKE_UI_PASSWORD_ONLY`. This value is part
    of Snowflake’s gradual deprecation of single-factor passwords, and cannot be set directly. If you run a
    [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md) command and `MFA_ENROLLMENT = 'REQUIRED_SNOWFLAKE_UI_PASSWORD_ONLY`, then password
    users must enroll in MFA if they are using Snowsight.

    `REQUIRED`
    :   Human users who are using password or single-sign on (SSO) authentication must enroll in MFA.

    `REQUIRED_PASSWORD_ONLY`
    :   All human users who are using password authentication must enroll in MFA, regardless of the client they are using. Users using SSO
        authentication are not required to enroll.

    `OPTIONAL`
    :   Retained for backwards compatibility only.

    Default: `OPTIONAL`. For backwards compatibility, you can create an authentication policy without specifying an
    `MFA_ENROLLMENT` value, but the actual value that is enforced won’t be `OPTIONAL` because Snowflake is moving toward requiring
    MFA for all human users. To determine which value is being enforced for an existing authentication policy, run the
    [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md) command.

`MFA_POLICY= ( list_of_properties )`
:   Specifies the policies that affect how multi-factor authentication (MFA) is enforced. Set this to a space-delimited list of one or more
    of the following properties and values:

    `ALLOWED_METHODS = ( { 'ALL' | 'PASSKEY' | 'TOTP' | 'OTP' | 'DUO' } [ , { 'PASSKEY' | 'TOTP' | 'OTP' | 'DUO' } ... ] )`
    :   Specifies the multi-factor authentication (MFA) methods that users can use as a second factor of authentication. You can specify more than one method as a comma-delimited list.

        `ALL`
        :   Users can use a passkey, an authenticator app, or Duo as their second factor of authentication.

        `PASSKEY`
        :   Users can use a passkey as their second factor of authentication.

        `TOTP`
        :   Users can use an authenticator app as their second factor of authentication.

        `OTP`
        :   User can use a one-time passcode as their second factor of authentication. For more information, see [Setting up administrators for break glass access](../../user-guide/security-mfa.md).

        `DUO`
        :   Users can use Duo as their second factor of authentication.

        Default: `ALL`.

    `ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION = { 'ALL' | 'NONE' }`
    :   Specifies whether multi-factor authentication (MFA) is required when users authenticate with single sign-on (SSO). To require MFA, specify
        `ALL`.

        Default: `NONE`

`PAT_POLICY = ( list_of_properties )`
:   Specifies the policies for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md). Set this to a
    space-delimited list of one or more of the following properties and values:

    `DEFAULT_EXPIRY_IN_DAYS = number_of_days`
    :   Specifies the default expiration time (in days) for a programmatic access token. You can specify a value from 1 to the
        maximum expiration time (which you can specify by setting MAX_EXPIRY_IN_DAYS).

        The default expiration time is 15 days.

        For more information, see [Setting the default expiration time](../../user-guide/programmatic-access-tokens.md).

    `MAX_EXPIRY_IN_DAYS = number_of_days`
    :   Specifies the maximum number of days that can be set for the expiration time for a programmatic access token. You can specify
        a value from the default expiration time (which you can specify by setting DEFAULT_EXPIRY_IN_DAYS) to 365.

        The default maximum expiration time is 365 days.

        > **Note:**
        >
        > If there are existing programmatic access tokens with expiration times that exceed the new maximum expiration time, attempts to
        > authenticate with those tokens will fail.
        >
        > For example, suppose that you generate a programmatic access token named `my_token` with the expiration time of 7 days. If you
        > later change the maximum expiration time for all tokens to 2 days, authenticating with `my_token` will fail because the
        > expiration time of the token exceeds the new maximum expiration time.

        For more information, see [Setting the maximum expiration time](../../user-guide/programmatic-access-tokens.md).

    `NETWORK_POLICY_EVALUATION = { ENFORCED_REQUIRED | ENFORCED_NOT_REQUIRED | NOT_ENFORCED }`
    :   Specifies how network policy requirements are handled for programmatic access tokens.

        By default, a user must be subject to a [network policy](../../user-guide/network-policies.md) with one or more
        [network rules](../../user-guide/network-rules.md) to generate or use programmatic access tokens:

        * Service users (with TYPE=SERVICE) must be subject to a network policy to generate and use programmatic access tokens.
        * Human users (with TYPE=PERSON) must be subject to a network policy to use programmatic access tokens.

        To override this behavior, set this property to one of the following values:

        `ENFORCED_REQUIRED` (default behavior)
        :   The user must be subject to a network policy to generate and use programmatic access tokens.

            If the user is subject to a network policy, the network policy is enforced during authentication.

        `ENFORCED_NOT_REQUIRED`
        :   The user does not need to be subject to a network policy to generate and use programmatic access tokens.

            If the user is subject to a network policy, the network policy is enforced during authentication.

        `NOT_ENFORCED`
        :   The user does not need to be subject to a network policy to generate and use programmatic access tokens.

            If the user is subject to a network policy, the network policy is not enforced during authentication.

    `REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = { TRUE | FALSE }`
    :   If TRUE, when you generate a programmatic access token for a service user, you must restrict the use of that token to a
        specific role.

        If you set this parameter to FALSE, you can generate a programmatic access token for a service user without restricting that
        token to a specific role.

        Changing REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS from FALSE back to TRUE invalidates any programmatic access tokens for
        service users that were generated without the role restriction.

        Default value: TRUE

    The following example of the PAT_POLICY clause specifies the following policy:

    * By default, programmatic access tokens expire in 30 days.
    * Programmatic access tokens have a maximum expiration time of 365 days.
    * You can generate a programmatic access token for a user if the user is not subject to a network policy requirement. Any
      network policy that the user is subject to is still enforced.
    * When you generate a programmatic access token for a service user, you do not need to restrict to token to use a specific role.

    ```sqlexample
    PAT_POLICY=(
      DEFAULT_EXPIRY_IN_DAYS=30
      MAX_EXPIRY_IN_DAYS=365
      NETWORK_POLICY_EVALUATION = ENFORCED_NOT_REQUIRED
      REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = FALSE
    );
    ```

`WORKLOAD_IDENTITY_POLICY = ( list_of_properties )`
:   Specifies the policies for [workload identity federation](../../user-guide/workload-identity-federation.md). Set this to a
    space-delimited list that contains one or more of the following properties and values:

    `ALLOWED_PROVIDERS = ( { ALL | AWS | AZURE | GCP | OIDC } [ , { AWS | AZURE | GCP | OIDC } ... ] )`
    :   Specifies the workload identity providers allowed by the authentication policy during workload identity authentication.
        If this parameter is omitted, all workload identity providers are allowed.

        `ALL`
        :   Users can authenticate with any supported and configured workload identity provider.

        `AWS`
        :   Users can authenticate with an AWS IAM role or user.

        `AZURE`
        :   Users can authenticate with an Azure Entra ID access token.

        `GCP`
        :   Users can authenticate with a Google-signed ID token.

        `OIDC`
        :   Users can authenticate with an ID token from a configured OIDC provider.

    `ALLOWED_AWS_ACCOUNTS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Specifies the list of AWS account IDs allowed by the authentication policy during workload identity authentication of type `AWS`.

        By default, when a Snowflake service user has a `WORKLOAD_IDENTITY` of type `AWS`, then the ARN can reference any AWS account.
        If this parameter is set, then only ARNs from the specified AWS account IDs are allowed to authenticate.

        Each element must be a 12-digit string representing the AWS account ID.

        For more information, see [View AWS account identifiers](https://docs.aws.amazon.com/accounts/latest/reference/manage-acct-identifiers.html).

    `ALLOWED_AZURE_ISSUERS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Specifies the list of Azure Entra ID issuers allowed by the authentication policy during workload identity authentication of type `AZURE`.

        By default, when a Snowflake service user has a `WORKLOAD_IDENTITY` of type `AZURE`, then the issuer can be any Entra ID tenant.
        If this parameter is set, then only Azure tokens from the specified issuers are allowed to authenticate.

        Each element must be a valid Authority URL with following format:

        * `https://login.microsoftonline.com/tenantId/v2.0`

    `ALLOWED_OIDC_ISSUERS = ( 'string_literal' [ , 'string_literal' , ... ] )`
    :   Specifies the list of OIDC issuers allowed by the authentication policy during workload identity authentication of type `OIDC`.

        By default, when a Snowflake service user has a `WORKLOAD_IDENTITY` of type `OIDC`, then the issuer can be any valid OIDC issuer.
        If this parameter is set, then only tokens from the specified OIDC issuers are allowed to authenticate.

        Each element must be a valid HTTPS URL that contains scheme, host, and optionally, port number and path components but no query or fragment
        components. The URL must not contain spaces, and it must not exceed 2048 characters in length.

    For example:

    ```sqlexample
    WORKLOAD_IDENTITY_POLICY=(
      ALLOWED_PROVIDERS = (AWS, AZURE, GCP, OIDC)
      ALLOWED_AWS_ACCOUNTS = ('123456789012', '210987654321')
      ALLOWED_AZURE_ISSUERS = ('https://login.microsoftonline.com/8c7832f5-de56-4d9f-ba94-3b2c361abe6b/v2.0',
        'https://login.microsoftonline.com/9ebd1ec9-9a78-4429-8f53-5cf870a812d1/v2.0')
      ALLOWED_OIDC_ISSUERS = ('https://my.custom.oidc.issuer/', 'https://another.custom/oidc/issuer')
    );
    ```

`COMMENT = 'string_literal'`
:   Specifies a description of the policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE AUTHENTICATION POLICY | Schema |  |
| OWNERSHIP | Authentication Policy | * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object   that already exists in the schema. * Required to execute a CREATE OR ALTER AUTHENTICATION POLICY statement for an *existing* authentication policy. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* After creating an authentication policy, you must use the [ALTER ACCOUNT](alter-account.md) or
  [ALTER USER](alter-user.md) command to set it on an account or user before Snowflake enforces the policy.
* If you want to update an existing authentication policy and need to see the definition of the policy, run the
  [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md) command or [GET_DDL](../functions/get_ddl.md) function.

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create an authentication policy named `restrict_client_types_policy` that only allows access through Snowsight:

```sqlexample
CREATE AUTHENTICATION POLICY restrict_client_types_policy
  CLIENT_TYPES = ('SNOWFLAKE_UI')
  COMMENT = 'Auth policy that only allows access through the web interface';
```

Set multi-factor authentication and update the list of clients:

```sqlexample
CREATE OR ALTER AUTHENTICATION POLICY restrict_client_types_policy
  MFA_ENROLLMENT = REQUIRED
  CLIENT_TYPES = ('SNOWFLAKE_UI', 'SNOWFLAKE_CLI');
```

Create an authentication policy that includes a client policy. The client policy sets the minimum version for two specific driver
clients:

```sqlexample
CREATE AUTHENTICATION POLICY two_driver_policy
  CLIENT_TYPES = ('DRIVERS')
  CLIENT_POLICY = (
    GO_DRIVER = (MINIMUM_VERSION = '1.14.1'),
    JDBC_DRIVER = (MINIMUM_VERSION = '3.25.0')
    )
  COMMENT = 'JDBC and Go Driver minimum versions';
```

The following attempt to create an authentication policy fails because the CLIENT_POLICY parameter specifies drivers
that are not permitted by the CLIENT_TYPES parameter:

```sqlexample
CREATE AUTHENTICATION POLICY go_driver_policy_test
  CLIENT_TYPES = ('SNOWFLAKE_UI', 'SNOWFLAKE_CLI')
  CLIENT_POLICY = (GO_DRIVER = (MINIMUM_VERSION = '1.14.1'));
```

```output
004800 (22023): Authentication policy can not contain CLIENT_POLICY of 'GO_DRIVER' without including 'DRIVERS' in CLIENT_TYPES.
```

For more examples, see [Authentication policies](../../user-guide/authentication-policies.md).

---
title: CREATE BACKUP POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-backup-policy.md
section: SQL Commands
---

# CREATE BACKUP POLICY

Creates a [backup](../../user-guide/backups.md) policy.
You associate the policy with one or more backup sets.
The settings in the policy define the schedule and expiration periods for each backup sets
that uses the policy.

The schedule determines how often Snowflake automatically makes a backup and adds the resulting backup
to the backup set that’s governed by the policy.
The expiration period determines how long each backup is retained before Snowflake automatically deletes it from the
associated backup set.

> **Tip:**
>
> The backup policy is optional for a backup set. If you don’t need scheduled backups, a retention lock,
> or an expiration period, you can create a backup set without a backup policy. You can also use
> ALTER BACKUP SET to apply a backup policy later to an existing backup set, or to suspend and resume
> the scheduled backups specified in the backup policy.

See also:
:   [ALTER BACKUP POLICY](alter-backup-policy.md),
    [DROP BACKUP POLICY](drop-backup-policy.md),
    [SHOW BACKUP POLICIES](show-backup-policies.md),
    [CREATE BACKUP SET](create-backup-set.md)
    [ALTER BACKUP SET](alter-backup-set.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] BACKUP POLICY [ IF NOT EXISTS ] <name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH RETENTION LOCK ]
   [ SCHEDULE = '{ <num> MINUTE | <num> HOUR | USING CRON <expr> <time_zone> }' ]
   [ EXPIRE_AFTER_DAYS = <days_integer> ]
   [ COMMENT = <string> ]
```

## Required parameters

`name`
:   Identifier for the backup policy; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`OR REPLACE`
:   If a backup policy with this name already exists, delete it and create a new one.
    This clause is mutually exclusive with `IF NOT EXISTS`.

`IF NOT EXISTS`
:   Creates the backup policy only if there isn’t a backup policy with the same name.
    If a backup policy already exists, the command returns a success message even though it has no effect.
    This clause is mutually exclusive with `OR REPLACE`.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH RETENTION LOCK`
:   Specifies the mandatory retention period for backups. Backups with retention locks
    can’t be deleted, even by a privileged user.
    For more information, see the [restrictions for a backup with a retention lock](../../user-guide/backups.md).

    > **Note:**
    >
    > Only a user with the APPLY BACKUP RETENTION LOCK privilege can create a backup policy with retention lock.

    > **Important:**
    >
    > Applying a backup policy with a retention lock to a backup set is *irreversible*.
    > Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a backup set,
    > you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
    > you set a retention lock on a backup set with a long expiration period, to avoid unexpected storage charges
    > for undeletable backup sets, and the schemas and databases that contain them.
    >
    > If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
    > Snowflake deletes all backups, including those with retention locks. Deleting a Snowflake organization
    > requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

`SCHEDULE = '{ num MINUTE | num HOUR | USING CRON expr time_zone }'`
:   Specifies the schedule for creating backups of an object.

    > **Note:**
    >
    > The minimum schedule for backups must be 60 minutes or 1 hour.
    >
    > Each backup policy must have one or both of the schedule and expiration period properties.
    > For more information, see [Backup policy](../../user-guide/backups.md).

    * `USING CRON expr time_zone`
      :   Specifies a cron expression and time zone for the point in time a backup of an object is created. Supports a subset of
          standard cron utility syntax.

          For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
          (in Wikipedia).

          The cron expression consists of the following fields:

          ```output
          # __________ minute (0-59)
          # | ________ hour (0-23)
          # | | ______ day of month (1-31, or L)
          # | | | ____ month (1-12, JAN-DEC)
          # | | | | __ day of week (0-6, SUN-SAT, or L)
          # | | | | |
          # | | | | |
            * * * * *
          ```

          The following special characters are supported:

          `*`
          :   Wildcard. Specifies any occurrence of the field.

          `L`
          :   Stands for “last”. When used in the day-of-week field, it lets you specify constructs such as “the last Friday” (“5L”) of a
              given month. In the day-of-month field, it specifies the last day of the month.

          `/n`
          :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
              specified in the month field, then the backup is scheduled for April, July and October (that is, every 3 months, starting with the 4th
              month of the year). The same schedule is maintained in subsequent years. That is, the backup is not scheduled to run in
              January (3 months after the October run).

          > **Note:**
          > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
          >   for the account (or setting the value at the user or session level) does not change the time zone for the backup.
          > + The cron expression defines all valid run times for the backup. Snowflake attempts to create a backup based on
          >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
          > + When both a specific day of month and day of week are included in the cron expression, then the backup is scheduled on days
          >   satisfying either the day of the month or the day of the week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
          >   schedules a backup at 0AM (midnight) on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
    * `num MINUTE` or `num MINUTES`
      :   Specifies an interval (in minutes) of wait time between backups. Accepts positive integers only.

          Also supports `num M` syntax.
    * `num HOUR` or `num HOURS`
      :   Specifies an interval (in hours) of wait time between backups. Accepts positive integers only.

          Also supports `num H` syntax.

    To avoid ambiguity, a *base interval time* is set in the following circumstances:

    * When the object is created (using CREATE BACKUP SET … WITH BACKUP POLICY).
    * When a different interval is set (using ALTER BACKUP SET … APPLY BACKUP POLICY or
      ALTER BACKUP POLICY … SET SCHEDULE).

    The base interval time starts the interval counter from the current clock time. For example, if an
    INTERVAL value of `10 MINUTES` is set and the scheduled backup is enabled at 9:03 AM, then the next backup
    is created at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to ensure absolute
    precision, but only guarantee that a backup does not execute before the set interval occurs
    (that is, in the current example, the backup could first run at 9:14 AM, but will definitely not run
    at 9:12 AM).

`EXPIRE_AFTER_DAYS = days_integer`
:   Specifies the number of days until the backup expires. Snowflake automatically deletes expired backups.
    If this parameter is not specified, backups remain in the backup set until they are manually deleted from the set.

    * Minimum value: `1`.
    * Maximum value: `3653` (roughly 10 years) if you don’t specify the `SCHEDULE` clause.

    > **Note:**
    >
    > Each backup policy must have one or both of the schedule and expiration period properties.
    > For more information, see [Backup policy](../../user-guide/backups.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the backup policy.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| CREATE BACKUP POLICY | The role used to create a backup policy must have this privilege on the schema in which the policy is created. |
| APPLY BACKUP RETENTION LOCK | Only a user with this privilege on the account can create a backup policy with retention lock. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* [Time Travel and Failsafe](../../user-guide/data-time-travel.md) retention do not apply to backups. A backup can’t be
  recovered after it expires.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Important:**
>
> If the backup policy has a retention lock applied to it, and there are any
> unexpired backups in the backup set, then you can’t delete the backup set.
> In that case, you must wait for all the backups in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a backup policy.

## Examples

Create a backup policy that creates a backup every hour and expires after 90 days:

```sqlexample
CREATE BACKUP POLICY hourly_backup_policy
  SCHEDULE = '60 MINUTE'
  EXPIRE_AFTER_DAYS = 90
  COMMENT = 'Hourly backups that expire after 90 days';
```

Create a backup policy with a retention lock that creates a backup every 24 hours and expires after 90 days. The backups
created using this backup policy can’t be modified or deleted before the expiration period ends:

```sqlexample
CREATE BACKUP POLICY daily_backup_policy_with_lock
  WITH RETENTION LOCK
  SCHEDULE = '1440 MINUTE'
  EXPIRE_AFTER_DAYS = 90
  COMMENT = 'regulatory backups expire after 90 days with retention lock';
```

Create a backup policy using a cron expression for the schedule. The following statement creates a policy that creates backups
every Tuesday and Friday of the week at 11PM:

```sqlexample
CREATE BACKUP POLICY twice_weekly_backup_policy
  SCHEDULE = 'USING CRON 0 23 * * 2,5 UTC'
  EXPIRE_AFTER_DAYS = 7
  COMMENT = 'Twice-weekly backups that expire after 7 days';
```

---
title: CREATE BACKUP SET
source: https://docs.snowflake.com/en/sql-reference/sql/create-backup-set.md
section: SQL Commands
---

# CREATE BACKUP SET

Creates a [backup](../../user-guide/backups.md) set for a table, a schema, or a database. Once the backup set exists, you can
add a new backup (backup) to the backup set at any time by running an ALTER BACKUP SET command. Snowflake also adds backups
to the backup set automatically, if you defined a schedule in a [backup policy](../../user-guide/backups.md)
and associated that backup policy with the backup set.

Each backup set represents a set of backups for a specific table, or the objects in a
specific schema, or the objects in a specific database. That way, you can make your backups
very granular or very comprehensive. And the backups for each table, schema, or database can
have their own independent schedules.

For the kinds of objects that are included in schema backups and database backups, see
[Backup objects](../../user-guide/backups.md).

See also:
:   [ALTER BACKUP SET](alter-backup-set.md),
    [DROP BACKUP SET](drop-backup-set.md),
    [SHOW BACKUP SETS](show-backup-sets.md),
    [CREATE BACKUP POLICY](create-backup-policy.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] BACKUP SET [ IF NOT EXISTS ] <name>
   FOR [ DYNAMIC ] TABLE <table_name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH BACKUP POLICY <policy_name> ]
   [ COMMENT = <string> ]
```

```sqlsyntax
CREATE [ OR REPLACE ] BACKUP SET [ IF NOT EXISTS ] <name>
  FOR SCHEMA <schema_name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH BACKUP POLICY <policy_name> ]
   [ COMMENT = <string> ]
```

```sqlsyntax
CREATE [ OR REPLACE ] BACKUP SET [ IF NOT EXISTS ] <name>
  FOR DATABASE <database_name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH BACKUP POLICY <policy_name> ]
   [ COMMENT = <string> ]
```

## Required parameters

`name`
:   Identifier for the backup set; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FOR [ DYNAMIC ] TABLE table_name`
:   Specifies the name of the table or dynamic table. In that case, the backup set represents backups
    of a single table.

`FOR SCHEMA schema_name`
:   Specifies the name of the schema. In that case, the backup set represents backups
    of all the tables and other objects in a specific schema.

`FOR DATABASE database_name`
:   Specifies the name of the database. In that case, the backup set represents backups
    of all the tables, schemas, and other objects in a specific database.

## Optional parameters

`OR REPLACE`
:   If a backup set with this name already exists, delete it and create a new one.
    If the backup set can’t be deleted because of backup policy rules for retention locks,
    legal holds, and expiry times, the command fails.
    This clause is mutually exclusive with `IF NOT EXISTS`.

`IF NOT EXISTS`
:   Creates the backup set only if there isn’t a backup set with the same name.
    If a backup set already exists, the command returns a success message even though it has no effect.
    This clause is mutually exclusive with `OR REPLACE`.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH BACKUP POLICY policy_name`
:   Specifies the name of the backup policy for the set.
    The backup policy defines properties of the backup set such as the schedule for backups,
    the retention period for each backup, and whether to prevent backups from being
    removed before the end of the retention period.

    If you omit this parameter from the CREATE BACKUP SET command, you can apply a
    policy later with the ALTER BACKUP SET command.

    > **Important:**
    >
    > Applying a backup policy with a retention lock to a backup set is *irreversible*.
    > Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a backup set,
    > you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
    > you set a retention lock on a backup set with a long expiration period, to avoid unexpected storage charges
    > for undeletable backup sets, and the schemas and databases that contain them.
    >
    > If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
    > Snowflake deletes all backups, including those with retention locks. Deleting a Snowflake organization
    > requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

`COMMENT = 'string_literal'`
:   Specifies a comment for the backup set.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| CREATE BACKUP SET | The role used to create a backup set must have this privilege granted on the schema in which the backup set is created. To actually create the backup set also requires the appropriate privilege on the object that’s the subject of the backup set: SELECT for a table backup, or USAGE for a schema backup or database backup. |
| SELECT | The role used to create a backup set for a table must have the SELECT privilege on that table. |
| USAGE | The role used to create a backup set for a schema or database must have the USAGE privilege on that schema or database. |
| APPLY | The role used to apply a backup policy on a backup set must have this privilege on the backup policy. |
| APPLY BACKUP RETENTION LOCK | The role used to apply a backup policy with retention lock on a backup set must have this privilege on the account. |

These privileges are required on the currently active primary role, not a secondary role.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

Regarding metadata:

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Important:**
>
> If the backup policy has a retention lock applied to it, and there are any
> unexpired backups in the backup set, then you can’t delete the backup set.
> In that case, you must wait for all the backups in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a backup policy.

## Examples

Create a backup set named `t1_backups` for table `t1`:

```sqlexample
CREATE BACKUP SET t1_backups
  FOR TABLE t1;
```

Create a backup set `t1_backups` for table `t1` with a backup policy:

```sqlexample
CREATE BACKUP SET t1_backups
  FOR TABLE t1
  WITH BACKUP POLICY hourly_backup_policy;
```

Create a backup set `s1_backups` for schema `s1` with a backup policy:

```sqlexample
CREATE BACKUP SET s1_backups
  FOR SCHEMA s1
  WITH BACKUP POLICY hourly_backup_policy;
```

Create a backup set `d1_backups` for database `d1` with a backup policy:

```sqlexample
CREATE BACKUP SET d1_backups
  FOR DATABASE d1
  WITH BACKUP POLICY hourly_backup_policy;
```

---
title: CREATE CATALOG INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration.md
section: SQL Commands
---

# CREATE CATALOG INTEGRATION

Creates a new [catalog integration](../../user-guide/tables-iceberg.md) for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md)
in the account or replaces an existing catalog integration.

The syntax of the command depends on the type of external Iceberg catalog that you use. The following topics explain the syntax for
creating catalog integrations for different use cases:

* [CREATE CATALOG INTEGRATION (AWS Glue)](create-catalog-integration-glue.md)
* [CREATE CATALOG INTEGRATION (Object storage)](create-catalog-integration-object-storage.md)
* [CREATE CATALOG INTEGRATION (Snowflake Open Catalog)](create-catalog-integration-open-catalog.md)
* [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](create-catalog-integration-rest.md)
* [CREATE CATALOG INTEGRATION (SAP® Business Data Cloud)](create-catalog-integration-sap.md)

See also:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md), [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

---
title: CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)
source: https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-rest.md
section: SQL Commands
---

# CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)

Creates a new [catalog integration](../../user-guide/tables-iceberg.md) in the account or replaces an existing catalog integration
for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) managed in a remote catalog that complies with the
open source [Apache Iceberg™ REST OpenAPI specification](https://github.com/apache/iceberg/blob/main/open-api/rest-catalog-open-api.yaml).

> **Note:**
>
> To create an integration for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview), see [CREATE CATALOG INTEGRATION (Snowflake Open Catalog)](create-catalog-integration-open-catalog.md) instead.

See also:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md), [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] CATALOG INTEGRATION [ IF NOT EXISTS ] <name>
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  [ CATALOG_NAMESPACE = '<namespace>' ]
  REST_CONFIG = (
    restConfigParams
  )
  REST_AUTHENTICATION = (
    restAuthenticationParams
  )
  ENABLED = { TRUE | FALSE }
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

Where:

```sqlsyntax
restConfigParams ::=

  CATALOG_URI = '<rest_api_endpoint_url>'
  [ PREFIX = '<prefix>' ]
  [ CATALOG_NAME = '<catalog_name>' ]
  [ CATALOG_API_TYPE = { PUBLIC | PRIVATE | AWS_API_GATEWAY | AWS_PRIVATE_API_GATEWAY | AWS_GLUE | AWS_PRIVATE_GLUE} ]
  [ ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS | EXTERNAL_VOLUME_CREDENTIALS } ]
```

The `restAuthenticationParams` are as follows, depending on your authentication method:

**OAuth**

```sqlsyntax
restAuthenticationParams (for OAuth) ::=

  TYPE = OAUTH
  [ OAUTH_TOKEN_URI = 'https://<token_server_uri>' ]
  OAUTH_CLIENT_ID = '<oauth_client_id>'
  OAUTH_CLIENT_SECRET = '<oauth_client_secret>'
  OAUTH_ALLOWED_SCOPES = ('<scope_1>', '<scope_2>')
```

**Bearer token**

```sqlsyntax
restAuthenticationParams (for Bearer token) ::=

  TYPE = BEARER
  BEARER_TOKEN = '<bearer_token>'
```

**SigV4**

```sqlsyntax
restAuthenticationParams (for SigV4) ::=

  TYPE = SIGV4
  SIGV4_IAM_ROLE = '<iam_role_arn>'
  [ SIGV4_SIGNING_REGION = '<region>' ]
  [ SIGV4_EXTERNAL_ID = '<external_id>' ]
```

## Parameters

`name`
:   String that specifies the identifier (name) for the catalog integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CATALOG_SOURCE = ICEBERG_REST`
:   Specifies that the catalog source is a REST catalog that’s compliant with the Apache Iceberg REST specification.

`TABLE_FORMAT = ICEBERG`
:   Specifies ICEBERG as the table format supplied by the catalog.

`CATALOG_NAMESPACE = 'namespace'`
:   Optionally specifies the namespace in the external catalog. Snowflake uses this namespace for all Iceberg tables that you associate with
    this catalog integration.

    If specified, you can override this value by specifying a namespace at the table level using the
    [CATALOG_NAMESPACE](create-iceberg-table-rest.md) parameter for [CREATE ICEBERG TABLE (Iceberg REST catalog)](create-iceberg-table-rest.md).
    If not specified, you must set it at the table level by using the CATALOG_NAMESPACE parameter for
    CREATE ICEBERG TABLE (Iceberg REST catalog).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether the catalog integration is available to use for Iceberg tables.

    > * `TRUE` allows users to create new Iceberg tables that reference this integration.
    > * `FALSE` prevents users from creating new Iceberg tables that reference this integration.

    The value is case-insensitive.

    The default is `TRUE`.

`REFRESH_INTERVAL_SECONDS = value`
:   Specifies the number of seconds that Snowflake waits between attempts to poll the external Iceberg catalog for metadata updates
    for [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    For Delta-based tables, specifies the number of seconds that Snowflake waits between attempts to poll your external cloud storage for
    new metadata.

    Values: 30 to 86400, inclusive

    Default: 30 seconds

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

### REST configuration parameters (restConfigParams)

`CATALOG_URI = 'rest_api_endpoint_url'`
:   The endpoint URL for your catalog REST API. For AWS Glue REST, specify the
    [service endpoint for the AWS Glue Iceberg REST catalog](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html).

`PREFIX`
:   Optionally specifies a prefix to append to all API routes.

`CATALOG_API_TYPE = { PUBLIC | PRIVATE | AWS_API_GATEWAY | AWS_PRIVATE_API_GATEWAY | AWS_GLUE | AWS_PRIVATE_GLUE }`
:   Specifies the connection type for the catalog API. Required for SigV4 authentication; otherwise, this parameter is optional.

    * `PUBLIC` specifies an API that is publicly accessible and isn’t managed using Amazon API Gateway; used for non-SigV4 APIs.
    * `PRIVATE` specifies that the catalog, such as Databricks Unity Catalog or a generic Iceberg REST catalog, is accessible
      through a private endpoint. For more information, see [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](../../user-guide/tables-iceberg-configure-catalog-integration-rest-private.md).
    * `AWS_API_GATEWAY` specifies a public API managed using Amazon API Gateway.
    * `AWS_PRIVATE_API_GATEWAY` specifies a private API managed using Amazon API Gateway.
    * `AWS_GLUE` specifies that the AWS Glue REST catalog is publicly accessible. With this option, you must also specify a value
      for `CATALOG_NAME`.
    * `AWS_PRIVATE_GLUE` specifies that the AWS Glue REST catalog is accessible through a private endpoint. With this option, you must
      also specify a value for `CATALOG_NAME`. For more information, see [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](../../user-guide/tables-iceberg-configure-catalog-integration-rest-private.md).

    Default: `PUBLIC`

`CATALOG_NAME`
:   Specifies the catalog or identifier to request from your remote catalog service.

    When you use `CATALOG_API_TYPE = AWS_GLUE`, specify the ID of your AWS account for this parameter.

    This parameter is required by some
    third-party catalog services. Check with your catalog provider to determine whether you must specify a catalog name.

    > **Note:**
    >
    > Before Snowflake version 9.6, this parameter was called `WAREHOUSE`. Snowflake still recognizes `WAREHOUSE` if specified,
    > but we recommend that you use `CATALOG_NAME`.

`ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS | EXTERNAL_VOLUME_CREDENTIALS }`
:   Specifies the access delegation mode to use for accessing Iceberg table files in your external cloud storage.

    * `VENDED_CREDENTIALS` specifies that Snowflake should use vended credentials.
    * `EXTERNAL_VOLUME_CREDENTIALS` specifies that Snowflake should use an external volume.

    Default: `EXTERNAL_VOLUME_CREDENTIALS`

### REST authentication parameters (restAuthenticationParams)

**OAuth**

> `TYPE = OAUTH`
> :   Specifies OAuth as the authentication type for Snowflake to use to connect to your Iceberg REST catalog.
>
> `OAUTH_TOKEN_URI = token_server_uri`
> :   Optional URL for your third-party identity provider. If not specified, Snowflake assumes that the remote catalog provider is the OAuth identity provider.
>
> `OAUTH_CLIENT_ID = oauth_client_id`
> :   Your OAuth2 client ID.
>
> `OAUTH_CLIENT_SECRET = oauth_client_secret`
> :   Your OAuth2 client secret.
>
> `OAUTH_ALLOWED_SCOPES = ( 'scope_1', 'scope_2' )`
> :   The scope of the OAuth token. The Iceberg REST API specification includes only one scope,
>     but catalogs can support more than one scope in their implementation.

**Bearer token**

> `TYPE = BEARER`
> :   Specifies a bearer token as the authentication type for Snowflake to use to connect to your Iceberg REST catalog.
>
> `BEARER_TOKEN = bearer_token`
> :   The bearer token for your identity provider. You can alternatively specify a personal access token (PAT).

**SigV4**

> `TYPE = SIGV4`
> :   Specifies Signature Version 4 as the authentication type for Snowflake to use to connect to your Iceberg REST catalog.
>
> `SIGV4_IAM_ROLE = 'iam_role_arn'`
> :   Specifies the Amazon Resource Name (ARN) for an IAM role that has permission to access your REST API in API Gateway.
>
> `SIGV4_SIGNING_REGION = 'region'`
> :   Optionally specifies the AWS Region associated with your API in API Gateway. If you don’t specify this parameter, Snowflake uses the region
>     in which your Snowflake account is deployed.

`SIGV4_EXTERNAL_ID = 'external_id'`
:   Optionally specifies an external ID that Snowflake uses to establish a trust relationship with AWS.
    You must specify the same external ID in the trust policy of the IAM role
    that you configured for this catalog integration.

    If you don’t specify a value for this parameter, Snowflake automatically generates a unique external ID when you create (or replace) a catalog integration.

    For more information about external IDs,
    see
    [How to use an external ID when granting access to your AWS resources to a third party](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Catalog integrations provide read-only access to external Iceberg catalogs.
* You can’t modify an existing catalog integration; use a CREATE OR REPLACE CATALOG INTEGRATION statement instead.
* You can’t drop or replace a catalog integration if one or more Apache Iceberg™ tables
  are associated with the catalog integration.

  To view the tables that depend on a catalog integration,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `catalog_name` column.

  > **Note:**
  >
  > The column identifier (`catalog_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "catalog_name" = 'my_catalog_integration_1';
  ```
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

The following example creates a REST catalog integration that uses OAuth to connect
to Tabular. It sets a default namespace using the `CATALOG_NAMESPACE` parameter.

To override the default namespace at the table level,
use the [CATALOG_NAMESPACE](create-iceberg-table-rest.md) parameter for CREATE ICEBERG TABLE.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION tabular_catalog_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'default'
  REST_CONFIG = (
    CATALOG_URI = 'https://api.tabular.io/ws'
    CATALOG_NAME = '<tabular_warehouse_name>'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_TOKEN_URI = 'https://api.tabular.io/ws/v1/oauth/tokens'
    OAUTH_CLIENT_ID = '<oauth_client_id>'
    OAUTH_CLIENT_SECRET = '<oauth_client_secret>'
    OAUTH_ALLOWED_SCOPES = ('catalog')
  )
  ENABLED = TRUE;
```

Create a catalog integration for AWS Glue REST with SigV4 authentication:

```sqlexample
CREATE CATALOG INTEGRATION glue_rest_catalog_int
  CATALOG_SOURCE = ICEBERG_REST
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'rest_catalog_integration'
  REST_CONFIG = (
    CATALOG_URI = 'https://glue.us-west-2.amazonaws.com/iceberg'
    CATALOG_API_TYPE = AWS_GLUE
    CATALOG_NAME = '123456789012'
  )
  REST_AUTHENTICATION = (
    TYPE = SIGV4
    SIGV4_IAM_ROLE = 'arn:aws:iam::123456789012:role/my-role'
    SIGV4_SIGNING_REGION = 'us-west-2'
  )
  ENABLED = TRUE;
```

For examples that cover the other authentication options, see [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md).

---
title: CREATE CATALOG INTEGRATION (AWS Glue)
source: https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-glue.md
section: SQL Commands
---

# CREATE CATALOG INTEGRATION (AWS Glue)

Creates a new [catalog integration](../../user-guide/tables-iceberg.md)
in the account or replaces an existing catalog integration for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md)
that use AWS Glue as the catalog.

> **Important:**
>
> To integrate with AWS Glue, we recommend that you instead create a catalog integration for the
> [AWS Glue Iceberg REST endpoint](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html),
> which supports additional Iceberg table features such as catalog-vended credentials.
>
> For instructions, see [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](create-catalog-integration-rest.md).

> **Note:**
>
> When you create a catalog integration for AWS Glue, you must complete additional steps to establish a
> trust relationship between Snowflake and the Glue Data Catalog. For information, see [Configure a catalog integration for AWS Glue](../../user-guide/tables-iceberg-configure-catalog-integration-glue.md).

See also:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md), [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] CATALOG INTEGRATION [IF NOT EXISTS]
  <name>
  CATALOG_SOURCE = GLUE
  TABLE_FORMAT = ICEBERG
  GLUE_AWS_ROLE_ARN = '<arn-for-AWS-role-to-assume>'
  GLUE_CATALOG_ID = '<glue-catalog-id>'
  [ GLUE_REGION = '<AWS-region-of-the-glue-catalog>' ]
  [ CATALOG_NAMESPACE = '<catalog-namespace>' ]
  ENABLED = { TRUE | FALSE }
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (name) for the catalog integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CATALOG_SOURCE = GLUE`
:   Specifies that the integration is for AWS Glue.

`TABLE_FORMAT = ICEBERG`
:   Specifies Glue Iceberg tables.

`GLUE_AWS_ROLE_ARN = 'arn-for-AWS-role-to-assume'`
:   Specifies the Amazon Resource Name (ARN) of the AWS Identity and Access Management (IAM) role to assume.

`GLUE_CATALOG_ID = 'glue-catalog-id'`
:   Specifies the ID of your AWS account.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether the catalog integration is available to use for Iceberg tables.

    > * `TRUE` lets users create new Iceberg tables that reference this integration. Existing Iceberg tables that reference
    >   this integration function normally.
    > * `FALSE` prevents users from creating new Iceberg tables that reference this integration. Existing Iceberg tables that
    >   reference this integration cannot access the catalog in the table definition.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

`[ GLUE_REGION = 'AWS-region-of-the-glue-catalog' ]`
:   Specifies the AWS Region of your AWS Glue Data Catalog. You must specify a value for this parameter if
    your Snowflake account is not hosted on AWS. Otherwise, the default region is the Snowflake deployment region for the account.

`CATALOG_NAMESPACE = 'catalog-namespace'`
:   Specifies your AWS Glue Data Catalog namespace (for example, `my_glue_database`). This is the default namespace for all Iceberg tables
    that you associate with this catalog integration.

    > * If specified, you can override this value by specifying the namespace at the table level when you create a table.
    > * If not specified, you must specify the namespace at the table level when you create a table.

`REFRESH_INTERVAL_SECONDS = value`
:   Specifies the number of seconds that Snowflake waits between attempts to poll the external Iceberg catalog for metadata updates
    for [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    For Delta-based tables, specifies the number of seconds that Snowflake waits between attempts to poll your external cloud storage for
    new metadata.

    Values: 30 to 86400, inclusive

    Default: 30 seconds

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You cannot modify an existing catalog integration; use a CREATE OR REPLACE CATALOG INTEGRATION statement instead.
* You can’t drop or replace a catalog integration if one or more Apache Iceberg™ tables
  are associated with the catalog integration.

  To view the tables that depend on a catalog integration,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `catalog_name` column.

  > **Note:**
  >
  > The column identifier (`catalog_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "catalog_name" = 'my_catalog_integration_1';
  ```
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

The following example creates a catalog integration that uses an AWS Glue catalog source.
When you create a catalog integration for Glue, you must complete additional steps to establish a
trust relationship between Snowflake and the Glue Data Catalog. For information, see [Configure a catalog integration for AWS Glue](../../user-guide/tables-iceberg-configure-catalog-integration-glue.md).

> ```sqlexample
> CREATE CATALOG INTEGRATION glueCatalogInt
>   CATALOG_SOURCE = GLUE
>   CATALOG_NAMESPACE = 'myNamespace'
>   TABLE_FORMAT = ICEBERG
>   GLUE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/myGlueRole'
>   GLUE_CATALOG_ID = '123456789012'
>   GLUE_REGION = 'us-east-2'
>   ENABLED = TRUE;
> ```

---
title: CREATE CATALOG INTEGRATION (Object storage)
source: https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-object-storage.md
section: SQL Commands
---

# CREATE CATALOG INTEGRATION (Object storage)

Creates a new [catalog integration](../../user-guide/tables-iceberg.md)
in the account or replaces an existing catalog integration for the following sources:

* Apache Iceberg™ metadata files
* Delta table files

See also:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md), [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] CATALOG INTEGRATION [IF NOT EXISTS]
  <name>
  CATALOG_SOURCE = OBJECT_STORE
  TABLE_FORMAT = { ICEBERG | DELTA }
  ENABLED = { TRUE | FALSE }
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (name) for the catalog integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CATALOG_SOURCE = OBJECT_STORE`
:   Specifies external Iceberg metadata files or Delta files in object storage as the source.

`TABLE_FORMAT = { ICEBERG | DELTA }`
:   Specifies the table format.

    * `ICEBERG`: Specifies Glue Iceberg tables or Iceberg tables from metadata in an external cloud storage location.
    * `DELTA`: Specifies Delta tables.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether the catalog integration is available to use for Iceberg tables.

    > * `TRUE` allows users to create new Iceberg tables that reference this integration. Existing Iceberg tables that reference
    >   this integration function normally.
    > * `FALSE` prevents users from creating new Iceberg tables that reference this integration. Existing Iceberg tables that
    >   reference this integration cannot access the catalog in the table definition.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

`REFRESH_INTERVAL_SECONDS = value`
:   Specifies the number of seconds that Snowflake waits between attempts to poll the external Iceberg catalog for metadata updates
    for [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    For Delta-based tables, specifies the number of seconds that Snowflake waits between attempts to poll your external cloud storage for
    new metadata.

    Values: 30 to 86400, inclusive

    Default: 30 seconds

> **Note:**
>
> The REFRESH_INTERVAL_SECONDS parameter is only supported when `TABLE_FORMAT = DELTA` for this type of catalog integration.

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You cannot modify an existing catalog integration; use a CREATE OR REPLACE CATALOG INTEGRATION statement instead.
* You can’t drop or replace a catalog integration if one or more Apache Iceberg™ tables
  are associated with the catalog integration.

  To view the tables that depend on a catalog integration,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `catalog_name` column.

  > **Note:**
  >
  > The column identifier (`catalog_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "catalog_name" = 'my_catalog_integration_1';
  ```
* [Automatically refresh Apache Iceberg™ tables](../../user-guide/tables-iceberg-auto-refresh.md) is only supported for this type of catalog integration when `TABLE_FORMAT = DELTA`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

The following example creates an integration that uses Iceberg metadata in external cloud storage. OBJECT_STORE corresponds to
the object storage that you associate with an [external volume](create-external-volume.md).

> ```sqlexample
> CREATE CATALOG INTEGRATION myCatalogInt
>   CATALOG_SOURCE = OBJECT_STORE
>   TABLE_FORMAT = ICEBERG
>   ENABLED = TRUE;
> ```

---
title: CREATE CATALOG INTEGRATION (SAP® Business Data Cloud)
source: https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-sap.md
section: SQL Commands
---

# CREATE CATALOG INTEGRATION (SAP® Business Data Cloud)

Creates a new catalog integration in the account or replaces an existing catalog integration for
SAP® Business Data Cloud to interact with SAP® Data Products managed in the SAP® Business Data Cloud object store.

See also:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md), [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] CATALOG INTEGRATION [ IF NOT EXISTS ] <name>
  CATALOG_SOURCE = SAP_BDC
  TABLE_FORMAT = DELTA
  REST_CONFIG = (
    restConfigParams
  )
  ENABLED = { TRUE | FALSE }
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

Where:

```sqlsyntax
restConfigParams ::=

SAP_BDC_INVITATION_LINK = '<Invitation Link from SAP BDC>'
[ ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS } ]
```

## Parameters

`name`
:   String that specifies the identifier (name) for the catalog integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CATALOG_SOURCE = SAP_BDC`
:   Specifies that the catalog source is SAP® Business Data Cloud.

`TABLE_FORMAT = DELTA`
:   Specifies DELTA as the table format supplied by the catalog.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether the catalog integration is available to use for Iceberg tables.

    > * `TRUE` allows users to create new Iceberg tables that reference this integration.
    > * `FALSE` prevents users from creating new Iceberg tables that reference this integration.

    The value is case-insensitive.

    The default is `TRUE`.

`REFRESH_INTERVAL_SECONDS = <value>`
:   Specifies the number of seconds that Snowflake waits between attempts to poll SAP®
    Business Data Cloud catalog for metadata updates for automated refresh.

    Values: 30 to 86400, inclusive

    Default: 30 seconds

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

### REST configuration parameters (restConfigParams)

`ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS`
:   Specifies the access delegation mode to use for accessing table files from SAP® Business Data Cloud.
    The only option supported is VENDED_CREDENTIALS.

`SAP_BDC_INVITATION_LINK = VENDED_CREDENTIALS`
:   Specifies the Invitation Link obtained from [SAP 4 Me](https://me.sap.com/) as documented in
    [Provisioning SAP Business Data Cloud Connect](https://help.sap.com/docs/business-data-cloud/administering-sap-business-data-cloud/provision-sap-business-data-cloud-connector-for-supported-external-systems)

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example creates a catalog integration and enrolls it with SAP® Business Data Cloud.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION MY_SAP_BDC_CATALOG_INT
  CATALOG_SOURCE = SAP_BDC
  TABLE_FORMAT = DELTA
  REST_CONFIG = (
    SAP_BDC_INVITATION_LINK = '<Invitation URL from SAP BDC>'
    ACCESS_DELEGATION_MODE = VENDED_CREDENTIALS
  )
  ENABLED = TRUE
  COMMENT = 'My SAP BDC catalog integration'
  ;
```

---
title: CREATE CATALOG INTEGRATION (Snowflake Open Catalog)
source: https://docs.snowflake.com/en/sql-reference/sql/create-catalog-integration-open-catalog.md
section: SQL Commands
---

# CREATE CATALOG INTEGRATION (Snowflake Open Catalog)

Creates a new [catalog integration](../../user-guide/tables-iceberg.md) for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md)
that integrate with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) in the account or replaces an existing catalog integration.

You can also use this command to create a catalog integration for Iceberg tables in [Apache Polaris™](https://polaris.apache.org/).

See also:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md), [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

### CATALOG_API_TYPE: PUBLIC

Use this catalog integration to connect Snowflake to Open Catalog through the public internet. The default for the CATALOG_API_TYPE
parameter is PUBLIC, so you don’t have to specify this parameter.

```sqlsyntax
CREATE [ OR REPLACE ] CATALOG INTEGRATION [ IF NOT EXISTS ]
  <name>
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  [ CATALOG_NAMESPACE = '<open_catalog_namespace>' ]
  REST_CONFIG = (
    CATALOG_URI = '<open_catalog_account_url>'
    [ CATALOG_API_TYPE = PUBLIC ]
    CATALOG_NAME = '<open_catalog_catalog_name>'
    [ ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS | EXTERNAL_VOLUME_CREDENTIALS } ]
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    [ OAUTH_TOKEN_URI = 'https://<token_server_uri>' ]
    OAUTH_CLIENT_ID = '<oauth_client_id>'
    OAUTH_CLIENT_SECRET = '<oauth_secret>'
    OAUTH_ALLOWED_SCOPES = ('<scope 1>', '<scope 2>')
  )
  ENABLED = { TRUE | FALSE }
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

### CATALOG_API_TYPE: PRIVATE

If you use [private connectivity for inbound network traffic in Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/private-connectivity-inbound),
use this catalog integration to connect Snowflake to Open Catalog through a private IP address.

```sqlsyntax
CREATE [ OR REPLACE ] CATALOG INTEGRATION [ IF NOT EXISTS ]
  <name>
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  [ CATALOG_NAMESPACE = '<open_catalog_namespace>' ]
  REST_CONFIG = (
    CATALOG_URI = '<open_catalog_account_url>'
    CATALOG_API_TYPE = PRIVATE
    CATALOG_NAME = '<open_catalog_catalog_name>'
    [ ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS | EXTERNAL_VOLUME_CREDENTIALS } ]
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = '<oauth_client_id>'
    OAUTH_CLIENT_SECRET = '<oauth_secret>'
    OAUTH_ALLOWED_SCOPES = ('<scope 1>', '<scope 2>')
  )
  ENABLED = { TRUE | FALSE }
  [ REFRESH_INTERVAL_SECONDS = <value> ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (name) for the catalog integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CATALOG_SOURCE = POLARIS`
:   Specifies Snowflake Open Catalog as the catalog source.

`TABLE_FORMAT = ICEBERG`
:   Specifies Apache Iceberg™ as the table format supplied by the catalog.

`REST_CONFIG = ( ... )`
:   Specifies information about your Open Catalog account and catalog name.

> `CATALOG_URI = 'https://open_catalog_account_url'`
> :   Your Open Catalog account URL. Supported values are:
>
>     * `https://<open_catalog_account_identifier>.snowflakecomputing.com/polaris/api/catalog`: When `CATALOG_API_TYPE = PUBLIC`. Examples values:
>
>       + `https://<orgname>-<my-snowflake-open-catalog-account-name>.snowflakecomputing.com/polaris/api/catalog`
>       + `https://<account_locator>.<cloud_region_id>.<cloud>.snowflakecomputing.com/polaris/api/catalog`
>       > **Note:**
>       > + To find your Snowflake organization name (`<orgname>`), follow the steps in [Finding the organization and account name for an account](../../user-guide/admin-account-identifier.md).
>       > + To find `<my-snowflake-open-catalog-account-name`,
>       >   see [Find the account name for a Snowflake Open Catalog account](https://other-docs.snowflake.com/en/opencatalog/find-account-name) in
>       >   the Snowflake Open Catalog documentation.
>       > + To find your `<account_locator>`, `<cloud_region_id>`, and `<cloud>`, see [Format 2: Account locator in a region](../../user-guide/admin-account-identifier.md).
>     * `https://<open_catalog_privatelink_account_url>/polaris/api/catalog`: When `CATALOG_API_TYPE = PRIVATE`.
>
>       > **Note:**
>       >
>       > For `<open_catalog_privatelink_account_url>`, enter one of the following values:
>       >
>       > + **PrivateLink Account URL**
>       > + **Regionless PrivateLink Account URL**
>       >
>       > To obtain these values, retrieve your Open Catalog account settings for private connectivity. For details, see the instructions for the
>       > cloud platform where your Open Catalog account is hosted:
>       >
>       > + [AWS](http://docs.snowflake.com/user-guide/opencatalog/private-connectivity-inbound-configure-aws#step-3-retrieve-your-open-catalog-account-settings)
>       > + [Azure](http://docs.snowflake.com/user-guide/opencatalog/private-connectivity-inbound-configure-azure#step-1-retrieve-your-open-catalog-account-settings)
>
> `CATALOG_API_TYPE = { PRIVATE | PUBLIC }`
> :   Specifies the catalog API type. If your connection between Snowflake and Open Catalog should be routed through the public internet, this
>     parameter is optional.
>
>     > * `PRIVATE`: If you’re using [private connectivity for inbound network traffic in Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/private-connectivity-inbound),
>     >   connects Snowflake to Open Catalog through a private IP address.
>     > * `PUBLIC`: Connects Snowflake to Open Catalog through the public internet.
>
>     Default: `PUBLIC`
>
> `CATALOG_NAME = 'open_catalog_name'`
> :   Specifies the name of the catalog to use in Open Catalog.
>
> `ACCESS_DELEGATION_MODE = { VENDED_CREDENTIALS | EXTERNAL_VOLUME_CREDENTIALS }`
> :   Specifies the access delegation mode to use for accessing Iceberg table files in your external cloud storage.
>
>     * `VENDED_CREDENTIALS` specifies that Snowflake should use vended credentials.
>     * `EXTERNAL_VOLUME_CREDENTIALS` specifies that Snowflake should use an external volume.
>
>     Default: `EXTERNAL_VOLUME_CREDENTIALS`

`REST_AUTHENTICATION = ( ... )`
:   Specifies authentication details that Snowflake uses to connect to Open Catalog.

    `TYPE = OAUTH`
    :   Specifies OAuth as the authentication type to use.

    `OAUTH_TOKEN_URI = token_server_uri`
    :   Optional URL for your third-party identity provider. To configure a third-party identity provider, see [External OAuth](https://other-docs.snowflake.com/en/opencatalog/oauth-ext-overview)
        in the Snowflake Open Catalog documentation. If the OAuth identity provider is not specified, Snowflake assumes that it is the remote catalog provider.

        > **Important:**
        >
        > If you’re using External OAuth with private connectivity (CATALOG_API_TYPE=PRIVATE), Snowflake routes the token requests for External
        > OAuth over the public internet.

    `OAUTH_CLIENT_ID = 'oauth_client_id'`
    :   The client ID of the OAuth2 credential associated with your Open Catalog service connection.

    `OAUTH_CLIENT_SECRET = 'oauth_secret'`
    :   The secret of the OAuth2 credential associated with your Open Catalog service connection.

    `OAUTH_ALLOWED_SCOPES = ( 'scope_1', 'scope_2')`
    :   One or more scopes for the OAuth token.

`ENABLED = {TRUE | FALSE}`
:   Specifies whether the catalog integration is available to use for Iceberg tables.

    > * `TRUE` allows users to create new Iceberg tables that reference this integration. Existing Iceberg tables that reference
    >   this integration function normally.
    > * `FALSE` prevents users from creating new Iceberg tables that reference this integration. Existing Iceberg tables that
    >   reference this integration cannot access the catalog in the table definition.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

`CATALOG_NAMESPACE = 'open_catalog_namespace'`
:   * If you’re creating the catalog integration to [query a table in Snowflake Open Catalog using Snowflake](../../user-guide/tables-iceberg-open-catalog-query.md),
      you can optionally specify the namespace from Open Catalog. Snowflake uses this namespace for all Iceberg tables that you associate with
      this catalog integration.

      If specified, you can override this value at the table level when you create a table. If not specified, you
      must set a namespace at the table level when you create a table.
    * If you’re creating the catalog integration to [sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md),
      this parameter has no effect on how you sync the table with Open Catalog. Snowflake syncs the table to the external catalog in Open Catalog
      that you specify in the catalog integration by using a predefined rule.

      For example, if you have a `db1.public.table1`
      Iceberg table registered in Snowflake and you specify `catalog1` in the catalog integration, Snowflake syncs the table with Open Catalog
      with the following fully qualified name: `catalog1.db1.public.table1`.

`REFRESH_INTERVAL_SECONDS = value`
:   Specifies the number of seconds that Snowflake waits between attempts to poll the external Iceberg catalog for metadata updates
    for [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    For Delta-based tables, specifies the number of seconds that Snowflake waits between attempts to poll your external cloud storage for
    new metadata.

    Values: 30 to 86400, inclusive

    Default: 30 seconds

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You can’t modify an existing catalog integration; use a CREATE OR REPLACE CATALOG INTEGRATION statement instead.
* You can’t drop or replace a catalog integration if one or more Apache Iceberg™ tables
  are associated with the catalog integration.

  To view the tables that depend on a catalog integration,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `catalog_name` column.

  > **Note:**
  >
  > The column identifier (`catalog_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "catalog_name" = 'my_catalog_integration_1';
  ```
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* To troubleshoot issues with creating a catalog integration, see [You can’t create a catalog integration for Open Catalog](../../user-guide/tables-iceberg-open-catalog-troubleshooting.md).

## Examples

The following example creates a catalog integration for Open Catalog for a particular namespace in an internal catalog
in Open Catalog to query tables grouped under this namespace in Snowflake. For more information about internal catalogs in Open Catalog, see
[Catalog types](https://other-docs.snowflake.com/en/opencatalog/overview#catalog-types) in the Open Catalog documentation.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION open_catalog_int
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  CATALOG_NAMESPACE = 'my_catalog_namespace'
  REST_CONFIG = (
    CATALOG_URI = 'https://my_org_name-my_snowflake_open_catalog_account_name.snowflakecomputing.com/polaris/api/catalog'
    CATALOG_NAME = 'my_catalog_name'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = 'my_client_id'
    OAUTH_CLIENT_SECRET = 'my_client_secret'
    OAUTH_ALLOWED_SCOPES = ('PRINCIPAL_ROLE:ALL')
  )
  ENABLED = TRUE;
```

The following example creates a catalog integration for Open Catalog to sync Snowflake-managed tables to the `customers` catalog in
Open Catalog, which is an external catalog. For more information about external catalogs in Open Catalog, see
[Catalog types](https://other-docs.snowflake.com/en/opencatalog/overview#catalog-types) in the Open Catalog documentation.

```sqlexample
CREATE OR REPLACE CATALOG INTEGRATION open_catalog_int2
  CATALOG_SOURCE = POLARIS
  TABLE_FORMAT = ICEBERG
  REST_CONFIG = (
    CATALOG_URI = 'https://my_org_name-my_snowflake_open_catalog_account_name.snowflakecomputing.com/polaris/api/catalog'
    CATALOG_NAME = 'customers'
  )
  REST_AUTHENTICATION = (
    TYPE = OAUTH
    OAUTH_CLIENT_ID = 'my_client_id'
    OAUTH_CLIENT_SECRET = 'my_client_secret'
    OAUTH_ALLOWED_SCOPES = ('PRINCIPAL_ROLE:my-principal-role', 'PRINCIPAL_ROLE:my-principal-role2', 'PRINCIPAL_ROLE:my-principal-role3')
  )
  ENABLED = TRUE;
```

---
title: CREATE COMPUTE POOL
source: https://docs.snowflake.com/en/sql-reference/sql/create-compute-pool.md
section: SQL Commands
---

# CREATE COMPUTE POOL

Creates a new [compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) in the current account.

See also:
:   [ALTER COMPUTE POOL](alter-compute-pool.md) , [DESCRIBE COMPUTE POOL](desc-compute-pool.md), [DROP COMPUTE POOL](drop-compute-pool.md) , [SHOW COMPUTE POOLS](show-compute-pools.md)

## Syntax

```sqlsyntax
CREATE COMPUTE POOL [ IF NOT EXISTS ] <name>
  [ FOR APPLICATION <app-name> ]
  MIN_NODES = <num>
  MAX_NODES = <num>
  INSTANCE_FAMILY = <instance_family_name>
  [ AUTO_RESUME = { TRUE | FALSE } ]
  [ INITIALLY_SUSPENDED = { TRUE | FALSE } ]
  [ AUTO_SUSPEND_SECS = <num>  ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COMMENT = '<string_literal>' ]
  [ PLACEMENT_GROUP = '<placement_group_name>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (that is, the name) for the compute pool; it must be unique for your account. Quoted names for special characters or case-sensitive names are not supported.

`MIN_NODES = num`
:   Specifies the minimum number of nodes for the compute pool. This value must be greater than 0. For more information, see
    [Creating a compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

`MAX_NODES = num`
:   Specifies the maximum number of nodes for the compute pool.

`INSTANCE_FAMILY = instance_family_name`
:   Identifies the type of machine you want to provision for the nodes in the compute pool. The machine type determines the amount
    of compute resources in the compute pool and, therefore, the number of credits consumed while the compute pool is running.

    The INSTANCE_FAMILY values in the following table can be grouped into 3 categories:

    * **Generic instance types:** Provide a balance of CPU, memory and disk. This does not include GPU. These instance family names
      start with “CPU”.
    * **High memory instance types:** Similar to generic instance types, but these provide more memory. These instance family
      names start with “HighMemory”.
    * **Instance types with GPU attached:** These instance family names start with “GPU”.

    You can also use the [SHOW COMPUTE POOL INSTANCE FAMILIES](show-compute-pool-instance-families.md) command to get this list of available instance families.

    > | INSTANCE_FAMILY, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) | vCPU | Memory (GiB) | Storage (GB) | Bandwidth limit (Gbps) | GPU | GPU Memory per GPU (GB) | Node limit | Description |
    > | --- | --- | --- | --- | --- | --- | --- | --- | --- |
    > | CPU_X64_XS | 1 | 6 | 100 | Up to 12.5 | n/a | n/a | 150 | Smallest instance available for Snowpark Containers. Ideal for cost-savings and getting started. |
    > | CPU_X64_S | 3 | 13 | 100 | Up to 12.5 | n/a | n/a | 150 | Ideal for hosting multiple services/jobs while saving cost. |
    > | CPU_X64_M | 6 | 28 | 100 | Up to 12.5 | n/a | n/a | 150 | Ideal for having a full stack application or multiple services |
    > | CPU_X64_SL (except China) | 14 | 54 | 100 | Up to 12.5 | n/a | n/a | 150 | For applications which need a large number of CPUs, memory and Storage. |
    > | CPU_X64_L | 28 | 116 | 100 | 12.5 | n/a | n/a | 150 | For applications which need an unusually large number of CPUs, memory and Storage. |
    > | HIGHMEM_X64_S | 6 | 58 | 100 | AWS and GCP: Up to 12.5, Azure: 8 | n/a | n/a | 150 | For memory intensive applications. |
    > | HIGHMEM_X64_M | 28 | AWS: 240, Azure and GCP: 244 | 100 | AWS: 12.5, Azure and GCP: 16 | n/a | n/a | 150 | For hosting multiple memory intensive applications on a single machine. |
    > | HIGHMEM_X64_SL (Azure and GCP, except GCP Dammam region) | 92 | 654 | 100 | 32 | n/a | n/a | 20 | Largest Azure or GCP high-memory machine available for processing large in-memory data. |
    > | HIGHMEM_X64_L (AWS only) | 124 | 984 | 100 | 50 | n/a | n/a | 150 | Largest AWS high-memory machine available for processing large in-memory data. |
    > | GPU_NV_S (AWS only, except Singapore, Switzerland North, Paris, and Osaka regions) | 6 | 27 | 300 (NVMe) | Up to 10 | 1 NVIDIA A10G | 24 | 150 | Our smallest NVIDIA GPU size available for Snowpark Containers to get started. |
    > | GPU_NV_M (AWS only, except gov regions, Singapore, Switzerland North, Paris, and Osaka regions) | 44 | 178 | 3.4 TB (NVMe) | 40 | 4 NVIDIA A10G | 24 | 10 | Optimized for intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |
    > | GPU_NV_L (AWS only, available only in AWS US West and US East non-gov regions by request; limited availability might be possible in other regions upon request) | 92 | 1112 | 6.8 TB (NVMe) | 400 | 8 NVIDIA A100 | 40 | On request | Largest GPU instance for specialized and advanced GPU cases like LLMs and Clustering, etc. |
    > | GPU_NV_XS (Azure only, except Switzerland North, UAE North, Central US, and UK South regions) | 3 | 26 | 100 | 8 | 1 NVIDIA T4 | 16 | 10 | Our smallest Azure NVIDIA GPU size available for Snowpark Containers to get started. |
    > | GPU_NV_SM (Azure only, except Central US region) | 32 | 424 | 100 | 40 | 1 NVIDIA A10 | 24 | 10 | A smaller Azure NVIDIA GPU size available for Snowpark Containers to get started. |
    > | GPU_NV_2M (Azure only, except Central US region) | 68 | 858 | 100 | 80 | 2 NVIDIA A10 | 24 | 5 | Optimized for intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |
    > | GPU_NV_3M (Azure only, except Central US, North Europe, and UAE North regions) | 44 | 424 | 100 | 40 | 2 NVIDIA A100 | 80 | On request | Optimized for memory-intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |
    > | GPU_NV_SL (Azure only, except Central US, North Europe, and UAE North regions) | 92 | 858 | 100 | 80 | 4 NVIDIA A100 | 80 | On request | Largest GPU instance for specialized and advanced GPU cases like LLMs and Clustering, etc. |
    > | GPU_GCP_NV_L4_1_24G (Google Cloud only) | 6 | 28 | 300 | Up to 16 | 1 NVIDIA L4 | 24 | 10 | Our smallest NVIDIA GPU size available for Snowpark Containers to get started. |
    > | GPU_GCP_NV_L4_4_24G (Google Cloud only) | 44 | 178 | 1200 | Up to 50 | 4 NVIDIA L4 | 24 | 10 | GPU usage scenarios like Computer Vision or LLMs. |
    > | GPU_GCP_NV_A100_8_40G (Google Cloud only, available only in GCP US Central1 and Europe West4 regions by request) | 92 | 654 | 2500 | Up to 100 | 8 NVIDIA A100 | 40 | On request | Optimized for memory-intensive GPU usage scenarios like Computer Vision or LLMs/VLMs. |

    Note the following:

    * The consumption table link in the first column heading provides information about the credit consumption rate for the specific `INSTANCE_FAMILY`.
    * The Node limit column indicates the maximum number of nodes a Snowflake account can provision for the specific `INSTANCE_FAMILY` type. Contact your account representative to increase the limit.

## Optional parameters

`FOR APPLICATION app_name`
:   Specifies the Snowflake Native App name. If specified, the compute pool can only be used by the native app. The [SHOW COMPUTE POOLS](show-compute-pools.md) command output includes the `is_exclusive` and `application` columns to indicate whether the compute pool is created exclusively for an app and provides the app name.

`AUTO_RESUME = { TRUE | FALSE }`
:   Specifies whether to automatically resume a compute pool when a service or job is submitted to it.

    * If AUTO_RESUME is FALSE, you need to explicitly resume the compute pool (using ALTER COMPUTE POOL RESUME) before you can
      start a service or job on the compute pool.
    * If AUTO_RESUME is TRUE, if you start a new service on a suspended compute pool, Snowflake starts the compute pool. Similarly,
      when you use a service either by invoking a service function or accessing ingress (see
      [Using a service](../../developer-guide/snowpark-container-services/working-with-services.md)), Snowflake starts the previously suspended compute pool and resumes
      the service.

    Default: TRUE

`INITIALLY_SUSPENDED = { TRUE | FALSE }`
:   Specifies whether the compute pool is created initially in the suspended state. If you create a compute pool with
    INITIALLY_SUSPENDED set to TRUE, Snowflake will not provision any nodes requested for the compute pool at the compute pool
    creation time. You can start the suspended compute pool using [ALTER COMPUTE POOL … RESUME](alter-compute-pool.md).

    Default: FALSE

`AUTO_SUSPEND_SECS = num`
:   Number of seconds of inactivity after which you want Snowflake to automatically suspend the compute pool. An inactive compute
    pool is one in which no services or jobs are currently active on any node in the pool. If `auto_suspend_secs` is set to 0,
    Snowflake does not suspend the compute pool automatically.

    Default: 3600 seconds

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the compute pool.

    Default: No value

`PLACEMENT_GROUP = placement_group_name`
:   Identifies the placement group of the compute pool. Use the [SHOW COMPUTE POOLS](show-compute-pools.md)
    and [DESCRIBE COMPUTE POOL](desc-compute-pool.md)
    commands to review the assignment of the compute pool into placement groups.

    You can also set `placement_group` to `DISTRIBUTED`. In this case, Snowflake attempts to distribute compute pool nodes across all available placement groups to maintain an even distribution across multiple placement groups so that the groups are more fault tolerant. For more information, see [Compute pool placement](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE COMPUTE POOL | Account |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a 1-node compute pool. This example command specifies the minimum required parameters:

```sqlexample
CREATE COMPUTE POOL tutorial_compute_pool
  MIN_NODES = 1
  MAX_NODES = 1
  INSTANCE_FAMILY = CPU_X64_XS;
```

The following command specifies the optional AUTO_RESUME parameter:

```sqlexample
CREATE COMPUTE POOL tutorial_compute_pool
  MIN_NODES = 1
  MAX_NODES = 1
  INSTANCE_FAMILY = CPU_X64_XS
  AUTO_RESUME = FALSE;
```

---
title: CREATE CONNECTION
source: https://docs.snowflake.com/en/sql-reference/sql/create-connection.md
section: SQL Commands
---

# CREATE CONNECTION

Creates a new [connection](../../user-guide/client-redirect.md) in the account.

See also:
:   [ALTER CONNECTION](alter-connection.md) , [DROP CONNECTION](drop-connection.md) , [SHOW CONNECTIONS](show-connections.md)

## Syntax

**Primary Connection**

```sqlsyntax
CREATE CONNECTION [ IF NOT EXISTS ] <name>
  [ COMMENT = '<string_literal>' ]
```

**Secondary Connection**

```sqlsyntax
CREATE CONNECTION [ IF NOT EXISTS ] <name>
  AS REPLICA OF <organization_name>.<account_name>.<name>
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the connection. It must conform to the following:

    * Must start with an alphabetic character and may only
      contain letters, decimal digits (0-9), and underscores (_).
    * For a primary connection, the name must be unique across connection names and account names in the organization.
    * For a secondary connection, the name must match the name of its primary connection.

### Secondary connection parameters

`AS REPLICA OF organization_name.account_name.name`
:   Specifies the identifier for a primary connection from which to create a replica (i.e. a secondary connection).

    `organization_name`
    :   Specifies the identifier for the organization.

    `account_name`
    :   Specifies the identifier for the account.

    `name`
    :   Specifies the identifier for the primary connection.

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the connection.

    Default: No value

## Access control requirements

Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL command.

## Usage notes

* If private connectivity to the Snowflake service is enabled for your Snowflake account, your network manager must create and manage
  a DNS CNAME record. For more details, see [Configuring the DNS settings for private connectivity to the Snowflake service](../../user-guide/client-redirect.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a connection. For this example, suppose that you are connected to the account `myaccount1`
in the organization `myorg`.

```sqlexample
CREATE CONNECTION IF NOT EXISTS myconnection;
```

Create a secondary connection as a replica of its primary connection. Substitute your own account and
organization values in the fully qualified name used in the parameter.
You can get the fully qualified value to use from the `primary` column in the output of [SHOW CONNECTIONS](show-connections.md).

```sqlexample
CREATE CONNECTION myconnection AS REPLICA OF myorg.myaccount1.myconnection;
```

---
title: CREATE CONTACT
source: https://docs.snowflake.com/en/sql-reference/sql/create-contact.md
section: SQL Commands
---

# CREATE CONTACT

Creates a new [contact](../../user-guide/contacts-using.md) or replaces an existing contact.

See also:
:   [ALTER CONTACT](alter-contact.md) , [DROP CONTACT](drop-contact.md) , [SHOW CONTACTS](show-contacts.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] CONTACT [ IF NOT EXISTS ] <name>
  [ {
    USERS = ( '<user_name>' [ , '<user_name>' ... ] )
    | EMAIL_DISTRIBUTION_LIST = '<email>'
    | URL = '<url>'
    } ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the name of the new contact.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`USERS = ( 'user_name' [ , 'user_name' ... ] )`
:   Comma-delimited list of Snowflake users who can be contacted, specified by the name of their user objects.

    If the user name is case-sensitive or includes any special characters or spaces, double quotes are required. The double quotes must be
    enclosed within the single quotes. For example, if the user is `joe@example.com`, you must specify `'"joe@example.com"'`.

`EMAIL_DISTRIBUTION_LIST = 'email'`
:   A valid email address, which can be a distribution list.

`URL = 'url'`
:   A URL that can be used to contact people about an object.

`COMMENT`
:   A user-defined string. Specifies a comment for the contact.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE CONTACT | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

```sqlexample
CREATE CONTACT my_contact
  EMAIL_DISTRIBUTION_LIST = 'support@example.com';
```

---
title: CREATE CORTEX SEARCH SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/create-cortex-search.md
section: SQL Commands
---

# CREATE CORTEX SEARCH SERVICE

Creates a new [Cortex Search service](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) or replaces an existing one.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] CORTEX SEARCH SERVICE [ IF NOT EXISTS ] <name>
  ON <search_column>
  [ PRIMARY KEY ( <col_name> [, ... ] ) ]
  ATTRIBUTES <col_name> [ , ... ]
  WAREHOUSE = <warehouse_name>
  TARGET_LAG = '<num> { seconds | minutes | hours | days }'
  [ EMBEDDING_MODEL = <embedding_model_name> ]
  [ REFRESH_MODE = { FULL | INCREMENTAL } ]
  [ INITIALIZE = { ON_CREATE | ON_SCHEDULE } ]
  [ FULL_INDEX_BUILD_INTERVAL_DAYS = <num> ]
  [ REQUEST_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<comment>' ]
AS <query>;

CREATE [ OR REPLACE ] CORTEX SEARCH SERVICE <name>
  TEXT INDEXES <text_column_name> [ , ... ]
  VECTOR INDEXES <column_specification> [ , ... ]
  [ PRIMARY KEY ( <col_name> [, ... ] ) ]
  ATTRIBUTES <col_name> [ , ... ]
  WAREHOUSE = <warehouse_name>
  TARGET_LAG = '<num> { seconds | minutes | hours | days }'
  [ REFRESH_MODE = { FULL | INCREMENTAL } ]
  [ INITIALIZE = { ON_CREATE | ON_SCHEDULE } ]
  [ FULL_INDEX_BUILD_INTERVAL_DAYS = <num> ]
  [ REQUEST_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<comment>' ]
AS <query>;
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the Cortex Search service; must be unique for the schema in which the service is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ON search_column`
:   Specifies the text column in the base table that you wish to search on, for single-index Cortex Search. This column must be a text value.

`TEXT INDEXES text_column_name [, ... ]`
:   Specifies comma-separated text columns in the base table to search on, for multi-index Cortex Search. Columns must be text values.

`VECTOR INDEXES column_specification [ , ... ]`
:   Specifies columns for vector similarity searches. Column specifications include:

    * *Managed vector embeddings*: `text_column_name (model='embedding_model')`: Specifies a text column and the embedding model used for vector generation.
      Must use one of the [supported embedding models](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). If no model is specified,
      the default model `snowflake-arctic-embed-m-v1.5` is used.
    * *User-provided vector embeddings*: `vector_column_name`: Specifies a user-provided vector embedding column.
    * *User-provided vector embeddings with managed query embeddings*: `vector_column_name(query_model='embedding_model')`:
      Specifies a user-provided vector embedding column and the embedding model used for embedding text at query time.
      The `query_model` must be one of the [Snowflake-managed embedding models supported in Cortex Search](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).
      If no `query_model` is specified, then the user-provided vector column can only be used with a vector embedding query.

    For information on the behavior of vector embeddings, see Usage Notes.

`ATTRIBUTES col_name [ , ... ]`
:   Specifies comma-separated list of columns in the base table that you wish to filter on when issuing queries to the service.
    Attribute columns must be included in the source query, either via explicit enumeration or wildcard, ( `*` ).

`WAREHOUSE = warehouse_name`
:   Specifies the warehouse to use for running the source query, building the search index, and keeping it refreshed per the TARGET_LAG target.

`TARGET_LAG = 'num { seconds | minutes | hours | days }'`
:   Specifies the maximum amount of time that the Cortex Search service content should lag behind updates to the base tables specified in the source query.

    > **Note:**
    >
    > Ensure that the target lag is shorter than the data retention period of the source tables. If the target lag exceeds the data retention period, the service may be unable to detect changes in the source data and could require recreation. For more information, see [DATA_RETENTION_TIME_IN_DAYS](../parameters.md).

## Optional parameters

`PRIMARY KEY ( col_name [, ... ] )`
:   Specifies a set of columns that uniquely identify each row in the source query. The combination of values in the
    designated columns must be unique for each row; rows with duplicate primary key values are ignored in the resulting
    search index. Primary key columns must be of the [TEXT](../data-types-text.md) data type. Services
    with primary keys can make use of an optimized refresh path when the underlying data changes, resulting in significant
    reductions to the cost and latency of a refresh. For more information, see [Primary keys](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

`EMBEDDING_MODEL = <embedding_model_name>`
:   Optional parameter that specifies the embedding model to use in the Cortex Search Service. This property cannot be altered after you create the Cortex
    Search Service. To modify the property, recreate the Cortex Search Service with a CREATE OR REPLACE CORTEX SEARCH SERVICE command.

    Some embedding models are only available in certain cloud regions for Cortex Search.
    For an availability list by model by region, see
    [Cortex Search Regional Availability](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

    Each model may incur a different cost per million input tokens processed.
    Refer to the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) for each function’s cost in credits per million tokens.

    If the `EMBEDDING_MODEL` is not specified, the default model is used. The default model is `snowflake-arctic-embed-m-v1.5`.

`REFRESH_MODE = { FULL | INCREMENTAL }`
:   Specifies the refresh mode for the Cortex Search Service.

    This property cannot be altered after you create the Cortex Search Service. To modify the property, recreate the Cortex Search
    Service with a CREATE OR REPLACE CORTEX SEARCH SERVICE command.

    > `FULL`
    > :   Enforces a full refresh of the Cortex Search Service. A full refresh recomputes all embeddings and rebuilds the index on every
    >     change to the underlying source data. Given the cost of recomputing embeddings, full refresh should only be considered if incremental
    >     refresh is not supported for your workload.
    >
    > `INCREMENTAL`
    > :   Enforces an incremental refresh of the Cortex Search Service. An incremental refresh applies only the changes since the
    >     last refresh, making it more efficient for large datasets with small updates. If the Cortex Search Service cannot perform
    >     an incremental refresh, service creation fails and displays an error message.
    >
    >     Incremental refresh requires change tracking to be enabled on all underlying objects. For more information, see
    >     Change Tracking Requirements.
    >
    > Default: `INCREMENTAL`

`INITIALIZE`
:   Specifies the behavior of the initial [refresh](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) of the Cortex Search Service. This property cannot be
    altered after you create the service. To modify the property, replace the cortex search service with a CREATE OR REPLACE CORTEX SEARCH SERVICE command.

    > `ON_CREATE`
    > :   Refreshes the Cortex Search Service synchronously at creation. If this refresh fails, service creation fails and displays an error message.
    >
    > `ON_SCHEDULE`
    > :   Refreshes the Cortex Search Service at the next scheduled refresh.
    >
    >     The Cortex Search Service is populated when the refresh schedule process runs. No data is populated when the Cortex Search Service is created.
    >     If you try to query the service, you might see the following error because the first scheduled refresh has not yet occurred.
    >
    >     ```output
    >     Your service has not yet been loaded into our serving system. Please retry your request in a few minutes.
    >     ```
    >
    > Default: `ON_CREATE`

`FULL_INDEX_BUILD_INTERVAL_DAYS = num`
:   Specifies the target interval, in days, between full index rebuilds for a Cortex Search service with
    [primary keys](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) defined. This property is only applicable to services that have primary keys set.

    This value is a soft target. Full index rebuilds may occur more frequently than the specified interval to optimize
    serving performance based on factors such as service target lag, change rate in the service source data, and overall
    service size.

    Default: 0

`REQUEST_LOGGING = { TRUE | FALSE }`
:   Enables or disables request logging for the Cortex Search Service. When enabled, the service records
    information about search requests, which you can query for monitoring and analysis purposes.
    For more information, see [Monitor Cortex Search requests](../../user-guide/snowflake-cortex/cortex-search/cortex-search-monitor.md).

    Default: `FALSE`

`COMMENT = 'comment'`
:   Specifies a comment for the service.

`AS query`
:   Specifies a query defining the base table from which the service is created.

## Access Control Requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| CREATE CORTEX SEARCH SERVICE | Schema in which you are creating the search service. |
| SELECT | Tables and views that the service queries. |
| USAGE | Warehouse that refreshes the service. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

> **Attention:**
>
> To create a Cortex Search Service, your role must have the required privileges to use the Cortex embedding functions.
> This requires granting the [SNOWFLAKE.CORTEX_USER](../snowflake-db-roles.md) database role
> or the [SNOWFLAKE.CORTEX_EMBED_USER](../snowflake-db-roles.md) database
> role to the service creator role.

## Usage Notes

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The size of the Warehouse used to run the Cortex Search service source query does impact the speed and cost of each refresh. A
  larger warehouse decreases build and refresh time. However, during this preview, Snowflake recommends using a warehouse size no larger
  than MEDIUM for Cortex Search services.
* Snowflake recommends using a dedicated warehouse for each Cortex Search service so as to not interfere with other workloads.
* The search index is built as part of the create statement, which means the CREATE CORTEX SEARCH SERVICE statement may take longer to
  complete for larger datasets.
* When creating a multi-index search service, at least one column must be specified in the VECTOR INDEXES clause in order to ensure the highest quality of search results. Attempting to create a service with no vector indexes returns an error.
* A column can be specified in the TEXT INDEXES clause, the VECTOR INDEXES clause, or both:

  + Columns specified as text indexes can be used for keyword (lexical) search. When querying a text index,
    results are scored based on the degree of lexical similarity.
  + Columns specified as vector indexes can be used for vector (semantic) search. When querying a vector index,
    results are scored based on the degree of semantic similarity.
* Columns specified as both text and vector indexes are used for both types of search.
* Each vector index column employs one of three methods for managing embeddings:

  + **Managed vector embeddings**: Snowflake calculates the vector embeddings when a text column is specified
    either in the ON or VECTOR INDEXES clauses. Must use one of the [supported embedding models](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).
  + **User-provided vector embeddings**: You are responsible for computing the vector embeddings
    with a [Snowflake-provided vector embedding model](../../user-guide/snowflake-cortex/vector-embeddings.md) or an externally-hosted embedding model
    prior to ingestion by the Cortex Search Service, as well for text inputs at query time.
  + **User-provided vector embeddings with managed query embeddings**: You are responsible for computing the vector embeddings
    with one of the [Snowflake-managed embedding models supported in Cortex Search](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md)
    prior to ingestion by the Cortex Search Service. At query time, Cortex Search will embed text queries using the specified
    `query_model`.

## Change Tracking Requirements

When creating a Cortex Search Service, if change tracking is not already enabled on the tables that it queries, Snowflake
automatically attempts to enable change tracking on them. In order to support incremental refreshes, change tracking must be enabled with
[non-zero time travel retention](../parameters.md) on all underlying objects used by a Cortex Search Service.

As base objects change, so does the Cortex Search Service. If you recreate a base object, you must re-enable change tracking.

For more information about enabling change tracking, see [Enable change tracking](../../user-guide/dynamic-tables-create.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a Cortex Search service named `mysvc`
using the `snowflake-arctic-embed-l-v2.0` embedding model:

```sqlsyntax
CREATE OR REPLACE CORTEX SEARCH SERVICE mysvc
  ON transcript_text
  ATTRIBUTES region,agent_id
  WAREHOUSE = mywh
  TARGET_LAG = '1 hour'
  EMBEDDING_MODEL = 'snowflake-arctic-embed-l-v2.0'
AS (
  SELECT
      transcript_text,
      date,
      region,
      agent_id
  FROM support_db.public.transcripts_etl
);
```

Create a Cortex Search service named `mysvc`, with the first refresh
scheduled to run after one `TARGET_LAG` period (1 hour) has passed.

```sqlsyntax
CREATE OR REPLACE CORTEX SEARCH SERVICE mysvc
  ON transcript_text
  ATTRIBUTES region
  WAREHOUSE = mywh
  TARGET_LAG = '1 hour'
  INITIALIZE = ON_SCHEDULE
AS SELECT * FROM support_db.public.transcripts_etl;
```

Create a multi-index search service named `business_search_service` that searches the table `business_directory`, where:

* `name` and `address` are specified as text indexes, so they are searchable with keyword search only.
* `description` is specified as a vector index, so it is eligible for vector (semantic) search using
  managed vector embeddings and the `snowflake-arctic-embed-m-v1.5` model.

```sqlexample
-- Generate sample data
CREATE OR REPLACE TABLE business_directory (name TEXT, address TEXT, description TEXT);
INSERT INTO business_directory VALUES
    ('Joe''s Coffee', '123 Bean St, Brewtown','A cozy café known for artisan espresso and baked goods.'),
    ('Sparkle Wash', '456 Clean Ave, Sudsville', 'Eco-friendly car wash with free vacuum service.'),
    ('Tech Haven', '789 Circuit Blvd, Siliconia', 'Computer store offering the latest gadgets and tech repair services.'),
    ('Joe''s Wash n'' Fold', '456 Apple Ct, Sudsville', 'Laundromat offering coin laundry and premium wash and fold services.'),
    ('Circuit Town', '459 Electron Dr, Sudsville', 'Technology store selling used computer parts at discounted prices.')
;

-- Create the Cortex Search Service
CREATE OR REPLACE CORTEX SEARCH SERVICE business_search_service
    TEXT INDEXES name, address
    VECTOR INDEXES description (model='snowflake-arctic-embed-m-v1.5')
    WAREHOUSE = mywh
    TARGET_LAG = '1 hour'
    AS ( SELECT * FROM business_directory );
```

Create a multi-index Cortex Search service with custom vector embeddings called `custom_vector_search_service`. This service searches a table with a text column (`document_contents`) and a separate user-provided vector embedding column (`document_embedding`) that contains embeddings corresponding to the text column.

> **Note:**
>
> This example uses mock embeddings for simplicity. In a production use-case, vectors should be generated through a [Snowflake vector embedding model](../../user-guide/snowflake-cortex/vector-embeddings.md) or an externally-hosted embedding model.

```sqlexample
-- Generate sample data
CREATE OR REPLACE TABLE business_documents (
  document_contents VARCHAR,
  document_embedding VECTOR(FLOAT, 3)
);
INSERT INTO business_documents VALUES
  ('Quarterly financial report for Q1 2024: Revenue increased by 15%, with expenses stable. Highlights include strategic investments in marketing and technology.', [1, 1, 1]::VECTOR(float, 3)),
  ('IT manual for employees: Instructions for usage of internal technologies, including hardware and software guides and commonly asked tech questions.', [2, 2, 2]::VECTOR(float, 3)),
  ('Employee handbook 2024: Updated policies on remote work, health benefits, and company culture initiatives.', [2, 3, 2]::VECTOR(float, 3)),
  ('Marketing strategy document: Target audience segmentation for upcoming product launch.', [1, -1, -1]::VECTOR(float, 3))
;

-- Create the Cortex Search Service
CREATE OR REPLACE CORTEX SEARCH SERVICE custom_vector_search_service
  TEXT INDEXES (document_contents)
  VECTOR INDEXES (document_embedding)
  WAREHOUSE = mywh
  TARGET_LAG = '1 minute'
  AS SELECT * FROM business_documents;
```

Create a service `managed_vector_search_service` with user-managed vector embeddings and managed query embeddings:

```sqlexample
-- Generate sample data
CREATE OR REPLACE TABLE business_documents (
  document_contents VARCHAR
);

INSERT INTO business_documents VALUES
  ('Quarterly financial report for Q1 2024: Revenue increased by 15%, with expenses stable. Highlights include strategic investments in marketing and technology.'),
  ('IT manual for employees: Instructions for usage of internal technologies, including hardware and software guides and commonly asked tech questions.'),
  ('Employee handbook 2024: Updated policies on remote work, health benefits, and company culture initiatives.'),
  ('Marketing strategy document: Target audience segmentation for upcoming product launch.');

-- Add managed vector embeddings
ALTER TABLE business_documents ADD COLUMN document_embeddings VECTOR(FLOAT, 768);
UPDATE business_documents SET document_embeddings = AI_EMBED('snowflake-arctic-embed-m-v1.5', document_contents);

-- Create the Cortex Search Service
CREATE OR REPLACE CORTEX SEARCH SERVICE managed_vector_search_service
  TEXT INDEXES document_contents
  VECTOR INDEXES document_embedding(query_model='snowflake-arctic-embed-m-v1.5')
  WAREHOUSE = mywh
  TARGET_LAG = '1 minute'
  AS SELECT * FROM business_documents;
```

---
title: CREATE DATA METRIC FUNCTION
source: https://docs.snowflake.com/en/sql-reference/sql/create-data-metric-function.md
section: SQL Commands
---

# CREATE DATA METRIC FUNCTION

Creates a new data metric function (DMF) in the current or specified schema, or replaces an existing data metric function.

After creating a DMF, apply it to a table column using an
[ALTER TABLE … ALTER COLUMN](alter-table-column.md) command or a view column using the [ALTER VIEW](alter-view.md) command.

This command supports the following variants:

* CREATE OR ALTER DATA METRIC FUNCTION: Creates a new data metric function if it doesn’t exist or alters an existing data metric function.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ SECURE ] DATA METRIC FUNCTION [ IF NOT EXISTS ] <name>
  ( <table_arg> TABLE( <col_arg> <data_type> [ , ... ] )
    [ , <table_arg> TABLE( <col_arg> <data_type> [ , ... ] ) ] )
  RETURNS NUMBER [ [ NOT ] NULL ]
  [ LANGUAGE SQL ]
  [ COMMENT = '<string_literal>' ]
  AS
  '<expression>'
```

## Variant syntax

### CREATE OR ALTER DATA METRIC FUNCTION

Creates a new data metric function if it doesn’t already exist, or transforms an existing data metric function into
the function defined in the statement. A CREATE OR ALTER DATA METRIC FUNCTION statement follows the syntax rules of
a CREATE DATA METRIC FUNCTION statement and has the same limitations as an [ALTER FUNCTION (DMF)](alter-function-dmf.md)
statement.

Unlike a CREATE OR REPLACE DATA METRIC FUNCTION command, a CREATE OR ALTER command updates the object without
deleting and recreating it.

Supported function alterations include changes to the COMMENT property.

For more information, see CREATE OR ALTER DATA METRIC FUNCTION usage notes.

```sqlsyntax
CREATE [ OR ALTER ] DATA METRIC FUNCTION ...
```

## Required parameters

`name`
:   Identifier for the DMF; must be unique for your schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`( table_arg TABLE( col_arg data_type [ , ... ] ) [ , table_arg TABLE( col_arg data_type [ , ... ] ) ] )`
:   The signature for the DMF, which is used as input for the expression.

    You must specify:

    * An argument name for each table (`table_arg`).
    * For each table, an argument name for at least one column, along with its data type (`col_arg data_type`).

      You can optionally specify arguments for additional columns and their data types. The columns must be in the same table and cannot
      reference a different table.

`RETURNS NUMBER`
:   The data type of the output of the function.

    The data type can only be NUMBER.

`AS expression`
:   SQL expression that determines the output of the function. The expression must be deterministic and return a scalar value. The expression
    can reference other table objects, such as by using a [WITH](../constructs/with.md) clause or a
    [WHERE](../constructs/where.md) clause.

    The delimiters around the `expression` can be either single quotes or a pair of dollar signs. Using `$$` as the delimiter makes
    it easier to write expressions that contain single quotes.

    If the delimiter for the `expression` is the single quote character, then any single quotes within `expression`
    (for example, string literals) must be escaped by single quotes.

    The `expression` does not support the following:

    * Using nondeterministic functions (for example, [CURRENT_TIME](../functions/current_time.md)).
    * Referencing an object that depends on a UDF or UDTF.
    * Returning a nonscalar output.

## Optional parameters

`SECURE`
:   Specifies that the data metric function is secure. For more information, see [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../../developer-guide/secure-udf-procedure.md).

`LANGUAGE SQL`
:   Specifies the language used to write the expression.

    SQL is the only supported language.

`COMMENT = 'string_literal'`
:   A comment for the DMF.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE DATA METRIC FUNCTION | Schema | The privilege only enables the creation of data metric functions in the schema.  If you want to enable the creation of user-defined functions, such as SQL or Java UDFs, the role must have the CREATE FUNCTION privilege. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* If you want to update an existing data metric function and need to see the current definition of the function, run the
  [DESCRIBE FUNCTION (DMF)](desc-function-dmf.md) command or call the [GET_DDL](../functions/get_ddl.md) function.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## CREATE OR ALTER DATA METRIC FUNCTION usage notes

You can’t modify the DMF’s arguments. Specifying new arguments creates a new overloaded DMF.

## Example: Single table argument

Create a DMF that calls the [COUNT](../functions/count.md) function to return the total number of rows that
have positive numbers in three columns of the table:

```sqlexample
CREATE OR REPLACE DATA METRIC FUNCTION governance.dmfs.count_positive_numbers(
  arg_t TABLE(
    arg_c1 NUMBER,
    arg_c2 NUMBER,
    arg_c3 NUMBER
  )
)
RETURNS NUMBER
AS
$$
  SELECT
    COUNT(*)
  FROM arg_t
  WHERE
    arg_c1>0
    AND arg_c2>0
    AND arg_c3>0
$$;
```

## Example: Multiple table arguments

Returns the number of records where the value of a column in one table does not have a corresponding value in the column of another table:

```sqlexample
CREATE OR REPLACE DATA METRIC FUNCTION governance.dmfs.referential_check(
  arg_t1 TABLE (arg_c1 INT), arg_t2 TABLE (arg_c2 INT))
RETURNS NUMBER
AS
$$
  SELECT
    COUNT(*)
    FROM arg_t1
  WHERE
    arg_c1 NOT IN (SELECT arg_c2 FROM arg_t2)
$$;
```

For an example that uses this DMF to validate referential integrity, see [Example: Using multiple table arguments to perform referential checks](../../user-guide/data-quality-custom-dmfs.md).

## Example: Alter a data metric function using the CREATE OR ALTER DATA METRIC FUNCTION command

Alters the single-table data metric function created in the example above to set security and comment.

```sqlexample
CREATE OR ALTER SECURE DATA METRIC FUNCTION governance.dmfs.count_positive_numbers(
  arg_t TABLE(
    arg_c1 NUMBER,
    arg_c2 NUMBER,
    arg_c3 NUMBER
  )
)
RETURNS NUMBER
COMMENT = "count positive numbers"
AS
$$
  SELECT
    COUNT(*)
  FROM arg_t
  WHERE
    arg_c1>0
    AND arg_c2>0
    AND arg_c3>0
$$;
```

---
title: CREATE DATABASE
source: https://docs.snowflake.com/en/sql-reference/sql/create-database.md
section: SQL Commands
---

# CREATE DATABASE

Creates a new database in the system.

This command supports the following variants:

* CREATE OR ALTER DATABASE: Creates a database if it doesn’t exist or alters an existing database.
* CREATE DATABASE … CLONE: Creates a clone of an existing database, either at its current state or at a specific time/point in the past
  (using Time Travel). For more information about cloning a database, see [Cloning considerations](../../user-guide/object-clone.md).
* CREATE DATABASE … FROM BACKUP SET (restores a database from a backup under a new name)

In addition, this command can be used to:

* Create a database from a specified listing. See [About sharing with listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about).
* Create a database from a share provided by another Snowflake account. For more information about shares, see
  [About Secure Data Sharing](../../user-guide/data-sharing-intro.md).
* Create a replica of an existing primary database (for example, a secondary database). For more information about database replication, see
  [Introduction to database replication across multiple accounts](../../user-guide/db-replication-intro.md).

See also:
:   [ALTER DATABASE](alter-database.md) , [DESCRIBE DATABASE](desc-database.md) , [DROP DATABASE](drop-database.md) , [SHOW DATABASES](show-databases.md) , [UNDROP DATABASE](undrop-database.md)

    [DESCRIBE SHARE](desc-share.md) , [SHOW SHARES](show-shares.md), [CREATE LISTING](create-listing.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

**Standard Database**

```sqlsyntax
CREATE [ OR REPLACE ] [ TRANSIENT ] DATABASE [ IF NOT EXISTS ] <name>
    [ CLONE <source_schema>
        [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
        [ IGNORE TABLES WITH INSUFFICIENT DATA RETENTION ]
        [ IGNORE HYBRID TABLES ] ]
    [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
    [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
    [ EXTERNAL_VOLUME = <external_volume_name> ]
    [ CATALOG = <catalog_integration_name> ]
    [ ICEBERG_VERSION_DEFAULT = <integer> ]
    [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
    [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
    [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
    [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
    [ COMMENT = '<string_literal>' ]
    [ CATALOG_SYNC = '<snowflake_open_catalog_integration_name>' ]
    [ CATALOG_SYNC_NAMESPACE_MODE = { NEST | FLATTEN } ]
    [ CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER = '<string_literal>' ]
    [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
    [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
    [ OBJECT_VISIBILITY = { <object_visibility_spec> | PRIVILEGED } ]
    [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
```

**Restored database (from a backup)**

```sqlsyntax
CREATE DATABASE <name> FROM BACKUP SET <backup_set> IDENTIFIER '<backup_id>'
```

**Standard Database (from a listing)**

```sqlsyntax
CREATE DATABASE <name> FROM LISTING '<listing_global_name>'
```

**Shared Database (from a Share)**

```sqlsyntax
CREATE DATABASE <name> FROM SHARE <provider_account>.<share_name>
```

**Secondary Database (Database Replication)**

```sqlsyntax
CREATE DATABASE <name>
    AS REPLICA OF <account_identifier>.<primary_db_name>
    [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
```

## Variant syntax

### CREATE OR ALTER DATABASE

Creates a new database if it doesn’t already exist, or transforms an existing database into the database defined in the statement.
A CREATE OR ALTER DATABASE statement follows the syntax rules of a CREATE DATABASE statement and has the same limitations as an
[ALTER DATABASE](alter-database.md) statement.

The following modifications are supported:

* Changing the following database properties and parameters:

  + [DATA_RETENTION_TIME_IN_DAYS](../parameters.md)
  + [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md)
  + [EXTERNAL_VOLUME](../parameters.md)
  + [CATALOG](../parameters.md)
  + [ICEBERG_VERSION_DEFAULT](../parameters.md)
  + [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md)
  + [REPLACE_INVALID_CHARACTERS](../parameters.md)
  + [DEFAULT_DDL_COLLATION](../parameters.md)
  + [STORAGE_SERIALIZATION_POLICY](../parameters.md)
  + [COMMENT](comment.md)

For more information, see CREATE OR ALTER DATABASE usage notes.

```sqlsyntax
CREATE OR ALTER [ TRANSIENT ] DATABASE <name>
    [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
    [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
    [ EXTERNAL_VOLUME = <external_volume_name> ]
    [ CATALOG = <catalog_integration_name> ]
    [ ICEBERG_VERSION_DEFAULT = <integer> ]
    [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
    [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
    [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
    [ LOG_LEVEL = '<log_level>' ]
    [ METRIC_LEVEL = '<metric_level>' ]
    [ TRACE_LEVEL = '<trace_level>' ]
    [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
    [ COMMENT = '<string_literal>' ]
    [ OBJECT_VISIBILITY = { <object_visibility_spec> | PRIVILEGED } ]
```

### CREATE DATABASE … CLONE

Creates a new database with the same parameter values:

> ```sqlsyntax
> CREATE [ OR REPLACE ] DATABASE [ IF NOT EXISTS ] <name> CLONE <source_database>
>   [ ... ]
> ```

For more details, see [CREATE <object> … CLONE](create-clone.md).

## Required parameters

`name`
:   Specifies the identifier for the database; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    > **Important:**
    >
    > As a best practice for [Database Replication and Failover](../../user-guide/db-replication-intro.md), we recommend giving each
    > secondary database the same name as its primary database. This practice supports referencing fully-qualified objects
    > (i.e. `'<db>.<schema>.<object>'`) by other objects in the same database, such as querying a fully-qualified table name in a view.
    >
    > If a secondary database has a different name from the primary database, then these object references would break in the secondary database.

### Secure Data Sharing parameters

`provider_account.share_name`
:   Specifies the identifier of the [share](../../user-guide/data-sharing-intro.md) from which to create the database. As documented, the name of the
    share must be fully-qualified with the name of the account providing the share.

### Database replication parameters

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

`AS REPLICA OF account_identifier.primary_db_name`
:   Specifies the identifier for a primary database from which to create a replica (i.e. a secondary database). If the identifier contains spaces,
    special characters, or mixed-case characters, the entire string must be enclosed in double quotes.

    Requires the account identifier and name of the primary database.

    `account_identifier`
    :   Unique identifier of the account that stores the primary database. The preferred identifier is `organization_name.account_name`.
        To view the list of accounts enabled for replication in your organization, query SHOW REPLICATION ACCOUNTS.

        Though the legacy account locator can also be used as the account identifier, its use is discouraged as it may not work in the future.
        For more details about using the account locator as an account identifier, see Database Replication Usage Notes.

    `primary_db_name`
    :   Name of the primary database. As a best practice, we recommend giving each secondary database the same name as its primary database.

    > **Note:**
    >
    > As a best practice for Database Replication and Failover, we recommend setting the optional parameter
    > DATA_RETENTION_TIME_IN_DAYS to the same value on the secondary database as on the
    > primary database.

### Backup parameters

The FROM BACKUP SET clause restores a database from a backup. You don’t specify other database
properties because they’re all the same as in the backed-up database.

> **Note:**
>
> The FROM SNAPSHOT SET clause is deprecated. Use FROM BACKUP SET instead.

This form doesn’t have a CREATE OR REPLACE clause. You typically either restore the
database under a new name and recover any data or other objects from this new database,
or rename the original database and then restore the database under the original name.

> **Note:**
>
> The restored database is independent of the original database from the backup.
> There isn’t any cloning relationship between the restored and original databases.
> Therefore, all the micro-partitions for tables in the restored database are owned
> by that database.
>
> If you want to make backups of the newly restored database, create a new backup set for it.

For more information about backups, see [Backups for disaster recovery and immutable storage](../../user-guide/backups.md).

`backup_set`
:   Specifies the name of a backup set created for a specific database.
    You can use the SHOW BACKUP SETS command to locate the right backup set.

`backup_id`
:   Specifies the identifier of a specific backup within that backup set.
    You can use the SHOW BACKUPS IN BACKUP SET command to locate the right identifier within the backup
    set, based on the creation date and time for the backup.

### Listing parameters

`'listing_global_name'`
:   Specifies the global name of the listing from which to create the database, which must meet the following requirements:

    * Can’t be a paid listing.
    * Listing terms, if not of type `OFFLINE`, must have been accepted using Snowsight.
    * Listing data products must be available locally in the current region.

      Whether a listing is available in the local region can be determined by viewing the `is_ready_for_import` column
      of [DESCRIBE AVAILABLE LISTING](desc-available-listing.md).

You must have the IMPORT LISTING privilege to create a database from a listing.
You must have the IMPORT SHARE privilege to create a database from a share.

## Optional parameters

`TRANSIENT`
:   Specifies a database as transient. Transient databases do not have a Fail-safe period so they do not incur additional storage costs once
    they leave Time Travel; however, this means they are also not protected by Fail-safe in the event of a data loss. For more information, see
    [Understanding and viewing Fail-safe](../../user-guide/data-failsafe.md).

    In addition, by definition, all schemas (and consequently all tables) created in a transient database are transient. For more information about
    transient tables, see [CREATE TABLE](create-table.md).

    Default: No value (i.e. database is permanent)

`CLONE source_db`
:   Specifies to create a clone of the specified source database. For more details about cloning a database, see [CREATE <object> … CLONE](create-clone.md).

`AT | BEFORE ( TIMESTAMP => timestamp | OFFSET => time_difference | STATEMENT => id )`
:   When cloning a database, the [AT | BEFORE](../constructs/at-before.md) clause specifies to use Time Travel to clone the database at or
    before a specific point in the past. If the specified Time Travel time is at or before the point in time when the database was created,
    the cloning operation fails with an error.

`IGNORE TABLES WITH INSUFFICIENT DATA RETENTION`
:   Ignore tables that no longer have historical data available in Time Travel to clone. If the time in the past specified in the
    AT | BEFORE clause is beyond the data retention period for any child table in a database or schema, skip the cloning operation
    for the child table. For more information, see
    [Child Objects and Data Retention Time](../../user-guide/object-clone.md).

`IGNORE HYBRID TABLES`
:   Ignore hybrid tables, which will not be cloned. Use this option to clone a database that contains hybrid tables.
    The cloned database includes other objects but skips hybrid tables.

    If you don’t use this option and your database contains one or more hybrid tables, the command ignores hybrid tables silently. However, the error handling for databases that contain hybrid tables will change in an upcoming release; therefore, you may want to add this parameter to your commands preemptively.

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the database, as well as specifying the
    default Time Travel retention time for all schemas created in the database. For more details, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see
    [Parameters](../parameters.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition:
    >
    >   + `0` to `90` for permanent databases
    >   + `0` or `1` for transient databases

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the database.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in the
    database to prevent streams on the tables from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`EXTERNAL_VOLUME = external_volume_name`
:   Object parameter that specifies the default external volume to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    For more information about this parameter, see [EXTERNAL_VOLUME](../parameters.md).

`CATALOG = catalog_integration_name`
:   Object parameter that specifies the default catalog integration to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    For more information about this parameter, see [CATALOG](../parameters.md).

`ICEBERG_VERSION_DEFAULT = integer`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the version of the Apache Iceberg™ table specification that Iceberg tables conform to.

    Values:
    :   `2`: New tables conform with Iceberg version 2.

        `3`: New tables conform with Iceberg version 3.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    Default:
    :   `2`

    For more information about this parameter, see [ICEBERG_VERSION_DEFAULT](../parameters.md).

`ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

    Values:
    :   `TRUE`: New tables use merge-on-read behavior.

        `FALSE`: New tables use copy-on-write behavior.

    Default:
    :   `TRUE`

    For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md). For more information about merge-on-read
    and copy-on-write behavior in Snowflake, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

`REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results for an
    [Iceberg table](create-iceberg-table.md).
    You can only set this parameter for tables that use an external Iceberg catalog.

    * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
    * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
      characters in a Parquet data file.

    Default: `FALSE`

`DEFAULT_DDL_COLLATION = 'collation_specification'`
:   Specifies a default [collation specification](../collation.md) for all schemas and tables added to the database. The
    default can be overridden at the schema and individual table level.

    For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

`LOG_LEVEL = 'log_level'`
:   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
    the specified level (and at more severe levels) are ingested.

    For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see
    [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`METRIC_LEVEL = 'metric_level'`
:   Specifies whether metrics data should be ingested and made available in the active event table.

    For more information, see [METRIC_LEVEL](../parameters.md) and [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`TRACE_LEVEL = 'trace_level'`
:   Controls how trace events are ingested into the event table.

    For information about levels, see [TRACE_LEVEL](../parameters.md). For information about setting trace level, see
    [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }`
:   Specifies the storage serialization policy for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) that use Snowflake as the catalog.

    * `COMPATIBLE`: Snowflake performs encoding and compression of data files that ensures interoperability with third-party compute engines.
    * `OPTIMIZED`: Snowflake performs encoding and compression of data files that ensures the best table performance within Snowflake.

    Default: `OPTIMIZED`

`COMMENT = 'string_literal'`
:   Specifies a comment for the database.

    Default: No value

`CATALOG_SYNC = 'snowflake_open_catalog_integration_name'`
:   Specifies the name of a catalog integration configured for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).
    If specified, Snowflake syncs Snowflake-managed Apache Iceberg™ tables in the database with an external catalog in your Snowflake Open Catalog
    account. For more information about syncing Snowflake-managed Iceberg tables with Open Catalog, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

    For more information about this parameter, see [CATALOG_SYNC](../parameters.md).

    Default: No value

`CATALOG_SYNC_NAMESPACE_MODE = { NEST | FLATTEN }`
:   Specifies the catalog sync namespace mode for Snowflake-managed Iceberg tables in the database that you sync with
    Snowflake Open Catalog. This property specifies whether Snowflake syncs the table to Open Catalog with one or two parent namespaces. It
    only applies if you’re setting the `CATALOG_SYNC` parameter. After you create the database, you can’t alter this property.

    * `NEST`: Snowflake syncs two parent namespaces with the table.

      For example, suppose you have a `db2.public.table1` Iceberg table registered in Snowflake. You want to sync this table, along with its
      two parent namespaces, to the `catalog2` external catalog in Open Catalog. To sync the table with its two parent namespaces, use the
      default for `CATALOG_SYNC_NAMESPACE_MODE` (`NEST`). If you don’t specify the `CATALOG_SYNC_NAMESPACE_MODE` property, the default for
      this property is applied, which is `NEST`. Because you’re using the default for `CATALOG_SYNC_NAMESPACE_MODE`, you don’t need to specify
      `CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER`. As a result, Snowflake syncs the table to Open Catalog with the following fully qualified
      name: `catalog2.db2.public.table1`.
    * `FLATTEN`: Snowflake syncs one parent namespace with the table, which contains the delimiter you set by using the
      `CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER` property.

      > **Important:**
      >
      > If your third-party query engine can only query tables located up to the second namespace level in a catalog, you must set the
      > `CATALOG_SYNC_NAMESPACE_MODE` property to `FLATTEN`. Otherwise, Snowflake will sync Snowflake-managed Iceberg tables to the
      > third namespace level in Open Catalog and you can’t query the table.

      For example, suppose that you have a `db1.public.table1` Iceberg table registered in Snowflake. You want to sync this table and one parent
      namespace named `db1-public` with the `catalog1` external catalog in Open Catalog, so that the table is located at the second namespace level in Open Catalog.

      To sync the table with the `db1-public` parent namespace, set `CATALOG_SYNC_NAMESPACE_MODE` to `FLATTEN` and specify a hyphen (`-`) as the value
      for `CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER`. As a result, Snowflake syncs this table to Open Catalog with the following
      fully-qualified name: `catalog1.db1-public.table1`.

    Default: `NEST`

`CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER = 'string_literal'`
:   Specifies a delimiter, which Snowflake inserts in the flattened namespace that results when Snowflake syncs a Snowflake-managed Iceberg
    table to Snowflake Open Catalog with one parent namespace. This delimiter property only applies when you set the CATALOG_SYNC_NAMESPACE_MODE
    property to `FLATTEN`. Snowflake inserts this delimiter to avoid conflicts that could
    arise from flattening parent namespaces for different tables. After you create the database, you can’t alter this property.

    For example, suppose you want to sync the `customer.data.table1` and `custom.erdata.table1` Snowflake-managed Iceberg tables to the `catalog1`
    external catalog in Open Catalog. By setting the CATALOG_SYNC_NAMESPACE_MODE property set to `FLATTEN` and specifying a hyphen (`-`) for the
    delimiter, Snowflake syncs these tables with Open Catalog with the following fully qualified names:

    > * `catalog1.customer-data.table1`
    > * `catalog1.custom-erdata.table1`

    If you set the `CATALOG_SYNC_NAMESPACE_MODE` property to `FLATTEN`, a non-empty delimiter value is required. However, if you set the
    `CATALOG_SYNC_NAMESPACE_MODE` property to `NEST`, this delimiter property doesn’t apply and the configured value will be ignored.

    Valid characters: `0-9`, `A-Z`, `a-z`, `_`, `$`, `-`

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

`OBJECT_VISIBILITY = { object_visibility_spec | PRIVILEGED }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    * A YAML specification describing the visibility in one of the following formats:

      ```sqlexample-yaml
      $$
      organization_targets:
        - all_accounts_including_external
      $$
      ```

      Or

      ```sqlexample-yaml
      $$
      organization_targets:
        - account: <account_name_1>
        - account: <account_name_2>
        - ...
        - organization_user_group: <org_user_group_1>
        - organization_user_group: <org_user_group_2>
      $$
      ```

      In the syntax above:

      + `all_accounts_including_external`: Specifies that all users in all accounts in the organization can see the object. This includes
        all accounts within the organization, even those to which external parties may have been given access, such as
        [reader accounts](../../user-guide/data-sharing-reader-create.md).
      + `account: account_name`: Specifies that all users in the specified account can see the object. You can specify multiple accounts.
        Note that `account` is the account name, not the account locator. You must specify only the account name, excluding the organization name.09-22
      + `organization_user_group: org_user_group`: Specifies that the specified [organization user group](../../user-guide/organization-users.md) can
        see the object in all accounts in the organization where the [organization user group has been imported](../../user-guide/organization-users.md).
    * `PRIVILEGED`: Specifies that only roles within the current account that are granted an explicit privilege on the object can see the object.
      This is the default behavior in Snowflake.

    For examples, see [Make database objects discoverable in Universal Search](../../user-guide/ui-snowsight/object-visibility-universal-search.md).

`ENABLE_DATA_COMPACTION = { TRUE | FALSE }`
:   Specifies whether Snowflake should enable data compaction on Snowflake-managed [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    * `TRUE`: Snowflake performs data compaction on the tables.
    * `FALSE`: Snowflake doesn’t perform data compaction on the tables.

    Default: `TRUE`

    For more information, see [ENABLE_DATA_COMPACTION](../parameters.md) and [Set data compaction](../../user-guide/tables-iceberg-manage.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE DATABASE | Account | Required to create a new database.  Only the SYSADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |
| USAGE | External volume, catalog integration | Required if setting the `EXTERNAL_VOLUME` or `CATALOG` object parameters, respectively. |
| IMPORT LISTING | Account | Required to create a database from a listing. |
| IMPORT SHARE | Account | Required to create a database from a share. |
| MANAGE VISIBILITY | Account | Required to set the OBJECT_VISIBILITY property. Only the SECURITYADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| MODIFY LOG LEVEL | Account | Required to set the LOG_LEVEL for a database. |
| MODIFY TRACE LEVEL | Account | Required to set the TRACE_LEVEL for a database. |
| OWNERSHIP | Database | Required when executing an [ALTER DATABASE](alter-database.md) or [ALTER SCHEMA](alter-schema.md) statement to set object visibility, or when executing a CREATE OR ALTER DATABASE statement for an existing database.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* Creating a database automatically sets it as the active/current database for the current session (equivalent to using the [USE DATABASE](use-database.md)
  command for the database).
* If a database with the same name already exists, an error is returned and the database is not created, unless the optional `OR REPLACE`
  keyword is specified in the command.

  > **Important:**
  >
  > Using `OR REPLACE` is the equivalent of using [DROP DATABASE](drop-database.md) on the existing database and then creating a new database with
  > the same name; however, the dropped database is not permanently removed from the system. Instead, it is retained in Time Travel.
  > This is important because dropped databases in Time Travel contribute to data storage for your account. For more information, see
  > [Storage costs for Time Travel and Fail-safe](../../user-guide/data-cdp-storage-costs.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Creating a new database automatically creates two schemas in the database:

  + PUBLIC: Default schema for the database.
  + INFORMATION_SCHEMA: Schema which contains views and table functions that can be used for querying metadata about the objects in the
    database, as well as across all objects in the account.
* Databases created from shares differ from standard databases in the following ways:

  + They do not have the PUBLIC or INFORMATION_SCHEMA schemas unless these schemas were explicitly granted to the share.
  + They cannot be cloned.
  + Properties, such as `TRANSIENT` and `DATA_RETENTION_TIME_IN_DAYS`, do not apply.
* When a database is active/current, the PUBLIC schema is also active/current by default unless a different schema is used or the PUBLIC
  schema has been dropped.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER DATABASE usage notes

* All limitations of the [ALTER DATABASE](alter-database.md) command apply.
* This command supports the properties and syntax that overlap between the CREATE DATABASE and ALTER DATABASE commands. For this
  reason, the following are *not* supported:

  + Swapping databases using the SWAP WITH parameter.
  + Renaming a database using the RENAME TO parameter.
  + Creating a clone of a database using the CLONE parameter.
  + Adding or changing tags and policies. Any existing tags and policies are preserved.
  + Converting a TRANSIENT database into a non-TRANSIENT database, or vice versa.
  + Creating a database from a share using CREATE OR ALTER DATABASE … FROM SHARE.
  + Creating a secondary (replica) database using CREATE OR ALTER DATABASE … AS REPLICA OF.

## Database replication usage notes

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

* Database replication uses Snowflake-provided compute resources instead of your own virtual warehouse to copy objects and data. However, the
  [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md) session/object parameter still controls how long a statement runs before it is canceled. The
  default value is `172800` (2 days). Because the initial replication of a primary database can take longer than 2 days to complete
  (depending on the amount of metadata in the database as well as the amount of data in database objects), we recommend increasing the
  STATEMENT_TIMEOUT_IN_SECONDS value to `604800` (7 days, the maximum value) for the session in which you run the replication operation.

  Run the following [ALTER SESSION](alter-session.md) statement prior to executing the `ALTER DATABASE secondary_db_name REFRESH`
  statement in the same session:

  ```sqlexample
  ALTER SESSION SET STATEMENT_TIMEOUT_IN_SECONDS = 604800;
  ```

  Note that the STATEMENT_TIMEOUT_IN_SECONDS parameter also applies to the active warehouse in a session. The parameter honors the *lower*
  value set at the session or warehouse level. If you have an active warehouse in the current session, also set STATEMENT_TIMEOUT_IN_SECONDS
  to `604800` for this warehouse (using [ALTER WAREHOUSE](alter-warehouse.md)).

  For example:

  ```sqlexample
  -- determine the active warehouse in the current session (if any)
  SELECT CURRENT_WAREHOUSE();

  +---------------------+
  | CURRENT_WAREHOUSE() |
  |---------------------|
  | MY_WH               |
  +---------------------+

  -- change the STATEMENT_TIMEOUT_IN_SECONDS value for the active warehouse

  ALTER WAREHOUSE my_wh SET STATEMENT_TIMEOUT_IN_SECONDS = 604800;
  ```

  You can reset the parameter value to the default after the replication operation is completed:

  ```sqlexample
  ALTER WAREHOUSE my_wh UNSET STATEMENT_TIMEOUT_IN_SECONDS;
  ```
* The preferred method of identifying the account that stores the primary database uses the organization name and account name as the
  account identifier. If you decide to use the legacy account locator instead, see [Account identifiers for replication and failover](../../user-guide/admin-account-identifier.md).
* The CREATE DATABASE … AS REPLICA command does not support the WITH TAG clause.

  This clause is not supported because the secondary database is read only. If your primary database specifies the WITH TAG clause, remove
  the clause prior to creating the secondary database. To verify whether your database has the WITH TAG clause, call the
  [GET_DDL](../functions/get_ddl.md) function in your Snowflake account and specify the primary database in the function argument. If
  a tag is set on the database, the function output will include an ALTER DATABASE … SET TAG statement.

  For more information, see [Replication and tags](../../user-guide/account-replication-considerations.md).

## Examples

Create two permanent databases, one with a data retention period of 10 days:

```sqlexample
CREATE DATABASE mytestdb;

CREATE DATABASE mytestdb2 DATA_RETENTION_TIME_IN_DAYS = 10;

SHOW DATABASES LIKE 'my%';

+---------------------------------+------------+------------+------------+--------+----------+---------+---------+----------------+
| created_on                      | name       | is_default | is_current | origin | owner    | comment | options | retention_time |
|---------------------------------+------------+------------+------------+--------+----------+---------+---------+----------------|
| Tue, 17 Mar 2016 16:57:04 -0700 | MYTESTDB   | N          | N          |        | PUBLIC   |         |         | 1              |
| Tue, 17 Mar 2016 17:06:32 -0700 | MYTESTDB2  | N          | N          |        | PUBLIC   |         |         | 10             |
+---------------------------------+------------+------------+------------+--------+----------+---------+---------+----------------+
```

Create a transient database:

```sqlexample
CREATE TRANSIENT DATABASE mytransientdb;

SHOW DATABASES LIKE 'my%';

+---------------------------------+---------------+------------+------------+--------+----------+---------+-----------+----------------+
| created_on                      | name          | is_default | is_current | origin | owner    | comment | options   | retention_time |
|---------------------------------+---------------+------------+------------+--------+----------+---------+-----------+----------------|
| Tue, 17 Mar 2016 16:57:04 -0700 | MYTESTDB      | N          | N          |        | PUBLIC   |         |           | 1              |
| Tue, 17 Mar 2016 17:06:32 -0700 | MYTESTDB2     | N          | N          |        | PUBLIC   |         |           | 10             |
| Tue, 17 Mar 2015 17:07:51 -0700 | MYTRANSIENTDB | N          | N          |        | PUBLIC   |         | TRANSIENT | 1              |
+---------------------------------+---------------+------------+------------+--------+----------+---------+-----------+----------------+
```

Create a database from a share provided by account `ab67890`:

```sqlexample
CREATE DATABASE snow_sales FROM SHARE ab67890.sales_s;
```

For more detailed examples of creating a database from a share, see [Consume imported data](../../user-guide/data-share-consumers.md).

## Database replication examples

> **Important:**
>
> This section describes a limited database replication feature that is different from the
> [account replication feature](../../user-guide/account-replication-intro.md). Snowflake strongly
> recommends using the account replication feature to replicate and failover databases.

For an example of creating a replication group to replicate a single database to a target account, see
[Replicate a single database](create-replication-group.md).

## CREATE OR ALTER DATABASE examples

### Create a simple database

Create a database named `db1`:

```sqlexample
CREATE OR ALTER DATABASE db1;
```

Alter database `db1` to set the DATA_RETENTION_TIME_IN_DAYS and DEFAULT_DDL_COLLATION parameters:

```sqlexample
CREATE OR ALTER DATABASE db1
  DATA_RETENTION_TIME_IN_DAYS = 5
  DEFAULT_DDL_COLLATION = 'de';
```

### Unset a parameter previously set on database

The [absence of a previously set parameter](create-or-alter.md) in the modified database definition results
in unsetting it. In the following example, unset the DATA_RETENTION_TIME_IN_DAYS parameter for the database `db1` created
in the previous example:

```sqlexample
CREATE OR ALTER DATABASE db1
  DEFAULT_DDL_COLLATION = 'de';
```

---
title: CREATE DATABASE (catalog-linked)
source: https://docs.snowflake.com/en/sql-reference/sql/create-database-catalog-linked.md
section: SQL Commands
---

# CREATE DATABASE (catalog-linked)

Creates a new [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md) for Apache Iceberg™ tables that use
an external Iceberg REST catalog.

## Syntax

```sqlsyntax
CREATE DATABASE <name>
  LINKED_CATALOG = ( catalogParams ),
  [ EXTERNAL_VOLUME = '<external_vol>' ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ CATALOG_CASE_SENSITIVITY = { CASE_SENSITIVE | CASE_INSENSITIVE } ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

Where:

> ```sqlsyntax
> catalogParams ::=
>   CATALOG = '<catalog_int>',
>   [ ALLOWED_NAMESPACES = ('<namespace1>', '<namespace2>', ... ) ]
>   [ BLOCKED_NAMESPACES = ('<namespace1>', '<namespace2>', ... ) ]
>   [ ALLOWED_WRITE_OPERATIONS = { NONE | ALL } ]
>   [ NAMESPACE_MODE = { IGNORE_NESTED_NAMESPACE | FLATTEN_NESTED_NAMESPACE } ]
>   [ NAMESPACE_FLATTEN_DELIMITER = '<string_literal>' ]
>   [ SYNC_INTERVAL_SECONDS = <value> ]
> ```

## Required parameters

`name`
:   Specifies the identifier for the catalog-linked database; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`EXTERNAL_VOLUME = my_external_vol`
:   Specifies an [external volume](create-external-volume.md)
    that provides access to the data and metadata for your remote Iceberg tables.

    Not required if using [vended credentials](../../user-guide/tables-iceberg-configure-catalog-integration-vended-credentials.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the database.

    Default: No value

`CATALOG_CASE_SENSITIVITY = { CASE_SENSITIVE | CASE_INSENSITIVE }`
:   Specifies the case sensitivity that your external Iceberg catalog uses for identifiers.

    * `CASE_SENSITIVE`: The external Iceberg catalog uses case-sensitive identifiers. For example, Snowflake Open Catalog is a
      case-sensitive catalog.

      + Snowflake matches identifiers exactly as they appear, including case. Snowflake automatically converts unquoted identifiers to
        uppercase, but quoted identifiers must match exactly the case in your external catalog.
      + However, if the external Iceberg catalog is actually case insensitive, and normalizes to lowercase, you must surround identifiers in
        double quotes.

      These requirements only apply to identifying existing schemas, tables, and table columns.
    * `CASE_INSENSITIVE`: The external Iceberg catalog uses case-insensitive identifiers. For example, Unity Catalog and AWS Glue are
      case-insensitive catalogs.

      + If the external Iceberg catalog is case insensitive and you run one of the following commands, you must surround identifiers in
        double quotes:

        - CREATE ICEBERG TABLE
        - CREATE SCHEMA
        - ALTER ICEBERG TABLE ADD COLUMN
        - ALTER ICEBERG TABLE RENAME COLUMN
      + However, if the external Iceberg catalog is actually case sensitive, Snowflake treats unquoted identifiers as case-insensitive and
        automatically converts unquoted identifiers to uppercase. When you create or query objects, Snowflake matches identifiers regardless
        of case, as long as they are unquoted.

        Using this pattern is discouraged because Snowflake can’t resolve two different identifiers that differ in casing. This pattern only
        works when no two identifiers are different in casing only.

      Except where otherwise noted, these requirements only apply to identifying existing schemas, tables, and table columns.

    Default: `CASE_INSENSITIVE`

    For more information on the requirements for identifier resolution, including examples, see [Requirements for identifier resolution in a catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Catalog parameters (catalogParams)

`CATALOG = catalog_int`
:   Specifies the name of your catalog integration.

`ALLOWED_NAMESPACES = ('namespace1', 'namespace2', ... )`
:   Optionally specifies one or more namespaces in your remote catalog to limit the scope of automatic table discovery.
    Snowflake syncs the specified namespaces and all namespaces and tables that are nested under them.
    If a nested namespace is in the ALLOWED_NAMESPACES list but you set the NAMESPACE_MODE parameter to IGNORE_NESTED_NAMESPACE,
    Snowflake does not sync the nested namespace or any schemas and tables under it.

`BLOCKED_NAMESPACES = ('namespace1', 'namespace2', ... )`
:   Optionally specifies one or more namespaces in your remote catalog to block for automatic table discovery.

    Snowflake blocks the specified namespaces and all namespaces and tables that are nested under them.

    If you specify both ALLOWED_NAMESPACES and BLOCKED_NAMESPACES, the BLOCKED_NAMESPACES list takes precedence.
    For example, if `ns1.ns2` is allowed, but `ns1` is blocked, then Snowflake won’t sync `ns1.ns2`.

`ALLOWED_WRITE_OPERATIONS = { NONE | ALL }`
:   Specifies whether your catalog-linked database is read-only or writable.

    * `NONE`: Your catalog-linked database is read-only.

      When your catalog-linked database is read only, any operation that you run that requires committing to the catalog fails. For
      example, DROP ICEBERG TABLE.
    * `ALL`: Your catalog-linked database is writable.

      > **Warning:**
      >
      > When your catalog-linked database has write permissions enabled, Snowflake propagates table drops to the remote catalog, which removes
      > the table and data from both systems.

    Default: `ALL`

`NAMESPACE_MODE = { IGNORE_NESTED_NAMESPACE | FLATTEN_NESTED_NAMESPACE }`
:   Specifies how Snowflake handles namespaces for Iceberg tables in the catalog-linked database.

    * `IGNORE_NESTED_NAMESPACE`: Snowflake links only tables in the first namespace level for your catalog.
    * `FLATTEN_NESTED_NAMESPACE`: Snowflake links tables in all namespace levels for your catalog. For a table in a nested namespace, Snowflake
      uses the NAMESPACE_FLATTEN_DELIMITER parameter to construct a flattened namespace. With this option, you must set
      the NAMESPACE_FLATTEN_DELIMITER parameter.

      For example, consider a table named `iceberg_table_5` in the `namespace3aa` namespace:

      ```none
      my_catalog_linked_db
      |-- namespace3
      |   |-- namespace3a
      |       |-- namespace3aa
      |           |-- iceberg_table_5
      ```

      If you set `NAMESPACE_FLATTEN_DELIMITER = "/"`, you can specify
      `"my_catalog_linked_db"."namespace3/namespace3a/namespace3aa"."iceberg_table_5"` to reference the table.

    Default: `IGNORE_NESTED_NAMESPACE`

`NAMESPACE_FLATTEN_DELIMITER = 'string_literal'`
:   Required if you set NAMESPACE_MODE = FLATTEN_NESTED_NAMESPACE.
    Specifies a delimiter, which Snowflake uses to construct flattened namespaces for tables in your catalog.

    > **Important:**
    >
    > The character that you choose for a delimiter can’t appear in your remote namespaces. During the autodiscovery process,
    > Snowflake skips any namespace that contains the delimiter and does not create a corresponding schema in your catalog-linked database.

    Valid characters: Any characters allowed in [Snowflake identifiers](../identifiers-syntax.md).

`SYNC_INTERVAL_SECONDS = value`
:   Specifies the time interval in seconds that Snowflake should use for automatically discovering schemas and tables in your remote catalog.

    Values: 30 to 86400 (1 day), inclusive

    Default: 30 seconds

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE DATABASE | Account | Required to create a new database.  Only the SYSADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |
| USAGE | External Volume | Required to reference an existing external volume. |
| USAGE | Catalog integration | Required to reference an existing catalog integration. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Supported only when you use a catalog integration for Iceberg REST (for example, Snowflake Open Catalog).
* To limit automatic table discovery to a specific set of namespaces, use the ALLOWED_NAMESPACES parameter. You can also use the
  BLOCKED_NAMESPACES parameter to block a set of namespaces.
* Snowflake doesn’t sync remote catalog access control for users or roles.
* You can create schemas, externally managed Iceberg tables, or database roles in a catalog-linked database. Creating other Snowflake objects
  isn’t currently supported.
* When you create a catalog-linked database, you can’t specify the default Iceberg version or merge-on-read behavior to use for
  Iceberg tables.

  However, you can modify these properties for an existing database by using the [ALTER DATABASE (catalog-linked)](alter-database-catalog-linked.md)
  command to set the following parameters:

  + ICEBERG_VERSION_DEFAULT
  + ENABLE_ICEBERG_MERGE_ON_READ
* For Iceberg tables in a catalog-linked database:

  + Snowflake doesn’t copy remote catalog table properties, such as retention policies or buffers, and doesn’t currently support altering table properties.
  + [Automated refresh](../../user-guide/tables-iceberg-auto-refresh.md) is enabled by default. If the `table-uuid` of an external table
    and the catalog-linked database table don’t match, refresh fails and Snowflake drops the table from the catalog-linked database; Snowflake doesn’t change the remote table.
  + If you drop a table from the remote catalog, Snowflake drops the table from the catalog-linked database.
    This action is asynchronous, so you might not see the change in the remote catalog right away.
  + If you rename a table in the remote catalog, Snowflake drops the existing table from the catalog-linked database and creates a table with the new name.
  + Masking policies and tags are supported. Other Snowflake-specific features, including replication and cloning, aren’t supported.
  + The character that you choose for the NAMESPACE_FLATTEN_DELIMITER parameter can’t appear in your remote namespaces. During the auto discovery process,
    Snowflake skips any namespace that contains the delimiter, and doesn’t create a corresponding schema in your catalog-linked database.
  + If you specify anything other than `_`, `$`, or numbers for the NAMESPACE_FLATTEN_DELIMITER parameter,
    you must put the schema name in quotes when you query the table.
  + For databases linked to AWS Glue, you must use lowercase letters and surround the schema, table, and column names in double quotes.
    This is also required for other Iceberg REST catalogs that only support lowercase identifiers.

    The following example shows a valid query:

    ```sqlexample
    CREATE SCHEMA "s1";
    ```

    The following statements aren’t valid, because they use uppercase letters or omit the double quotes:

    ```sqlexample
    CREATE SCHEMA s1;
    CREATE SCHEMA "Schema1";
    ```
  + Using UNDROP ICEBERG TABLE isn’t supported.
  + Sharing:

    - Sharing with a listing isn’t currently supported
    - Direct sharing is supported
* For writing to tables in a catalog-linked database:

  + Creating tables in nested namespaces isn’t currently supported.
  + Writing to tables in nested namespaces isn’t currently supported.
  + Position [row-level deletes](https://iceberg.apache.org/spec/#row-level-deletes) are supported for tables stored
    on Amazon S3, Azure, or Google Cloud. Row-level deletes with equality delete files aren’t supported. For more information about row-level deletes,
    see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md). To turn off position deletes, which enable
    running the Data Manipulation Language (DML) operations in copy-on-write mode, set the `ENABLE_ICEBERG_MERGE_ON_READ` parameter to FALSE at the table, schema, or
    database level.

* For ALLOWED_NAMESPACES and BLOCKED_NAMESPACES, Snowflake doesn’t store nested namespaces if the set already contains the parent namespace.
  For example, if you create a database and specify `ALLOWED_NAMESPACES = ('ns1', 'ns1.ns2', 'ns1.ns3')`, Snowflake only stores `ns1` since the other two are automatically included.
  If you use [GET_DDL](../functions/get_ddl.md) on the example database, Snowflake returns `ALLOWED_NAMESPACES = ('ns1')`. The same applies for BLOCKED_NAMESPACES.
* You can create [database roles](create-database-role.md) in a catalog-linked database to manage
  access control for objects in the database. For example, you can grant privileges on schemas and tables in a catalog-linked database
  to a database role, and then grant the database role to account roles.
* For querying tables in a catalog-linked database:

  + Snowflake automatically converts unquoted identifiers (table and column names) to uppercase.
    If your external Iceberg catalog uses case-sensitive identifiers, you must surround table and column names in double quotes.

    For more information about object identifiers, see [Identifier requirements](../identifiers-syntax.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a catalog-linked database with flattened, nested namespaces that uses an external volume.

```sqlexample
CREATE DATABASE my_linked_db
  LINKED_CATALOG = (
    CATALOG = 'my_catalog_int',
    NAMESPACE_MODE = FLATTEN_NESTED_NAMESPACE,
    NAMESPACE_FLATTEN_DELIMITER = '-'
  )
  EXTERNAL_VOLUME = 'my_external_vol';
```

Create a catalog-linked database that uses vended credentials and specifies one allowed namespace:

```sqlexample
CREATE DATABASE my_linked_db
  LINKED_CATALOG = (
    CATALOG = 'my_catalog_int_vended_creds',
    ALLOWED_NAMESPACES = ('my_namespace')
  );
```

Create a database role in a catalog-linked database and grant privileges:

```sqlexample
CREATE DATABASE ROLE my_linked_db.analyst;

GRANT USAGE ON SCHEMA my_linked_db.my_namespace TO DATABASE ROLE my_linked_db.analyst;

GRANT SELECT ON ALL ICEBERG TABLES IN SCHEMA my_linked_db.my_namespace TO DATABASE ROLE my_linked_db.analyst;

GRANT DATABASE ROLE my_linked_db.analyst TO ROLE data_consumer;
```

---
title: CREATE DATABASE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-database-role.md
section: SQL Commands
---

# CREATE DATABASE ROLE

Create a new [database role](../../user-guide/security-access-control-considerations.md) or replace an existing database role in the system.

After creating database roles, you can grant object privileges to the database role and then grant the database role to other database
roles or account roles to enable access control security for objects in the system.

This command supports the following variants:

* CREATE OR ALTER DATABASE ROLE: Creates a new database role if it doesn’t exist or alters an existing database role.

See also:
:   [GRANT <privileges> … TO ROLE](grant-privilege.md), [GRANT DATABASE ROLE](grant-database-role.md) , [GRANT OWNERSHIP](grant-ownership.md) , [DROP DATABASE ROLE](drop-database-role.md) , [ALTER DATABASE ROLE](alter-database-role.md) ,
    [SHOW DATABASE ROLES](show-database-roles.md), [CREATE <object> … CLONE](create-clone.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] DATABASE ROLE [ IF NOT EXISTS ] <name>
  [ COMMENT = '<string_literal>' ]
```

## Variant syntax

### CREATE OR ALTER DATABASE ROLE

Creates a new database role if it doesn’t already exist, or transforms an existing database role into the role defined in the statement.
A CREATE OR ALTER DATABASE ROLE statement follows the syntax rules of a CREATE DATABASE ROLE statement and has the same limitations as an
[ALTER DATABASE ROLE](alter-database-role.md) statement.

```sqlsyntax
CREATE OR ALTER DATABASE ROLE <name>
  [ COMMENT = '<string_literal>' ]
```

For more information, see CREATE OR ALTER DATABASE ROLE usage notes.

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the database role; must be unique in the database in which the role is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is not fully qualified in the form of `db_name.database_role_name`, the command creates the database role
    in the current database for the session.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the database role.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE DATABASE ROLE | Database | A role with the OWNERSHIP privilege on the database can grant the CREATE DATABASE ROLE privilege to another account role. |
| OWNERSHIP | Database role | Required to execute a CREATE OR ALTER DATABASE ROLE statement for an *existing* database role.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* You can create database roles in a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).
* When you create a database role, the USAGE privilege on the database that contains the database role is granted to the database role
  automatically.

> **Caution:**
>
> Avoid recreating a database role (using the OR REPLACE keywords). Behind the scenes, recreating an object (using CREATE OR REPLACE
> *<object>*) first drops and then creates the object. Recreating a database role drops the database role from any shares that it is
> granted to. You must grant the database role to these shares again.
>
> If you must recreate a database role, notify any data consumers of a share that includes the database role. They must grant the database
> role to their own account roles again.

Regarding metadata:

> > **Attention:**
> >
> > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER DATABASE ROLE usage notes

* All limitations of the [ALTER DATABASE ROLE](alter-database-role.md) command apply.
* Setting or unsetting a tag is not supported; however, existing tags are not altered by a CREATE OR ALTER DATABASE ROLE statement and remain
  unchanged.

## Examples

Create database role `dr1` in database `d1`:

> ```sqlexample
> CREATE DATABASE ROLE d1.dr1;
> ```

Create a database role in a catalog-linked database:

> ```sqlexample
> CREATE DATABASE ROLE my_linked_db.reader_role
>   COMMENT = 'Read-only role for catalog-linked database';
> ```

---
title: CREATE DATASET
source: https://docs.snowflake.com/en/sql-reference/sql/create-dataset.md
section: SQL Commands
---

# CREATE DATASET

Creates a new [machine learning dataset](../../developer-guide/snowflake-ml/dataset.md) in the current schema or the schema that you specify.

See also:
:   [ALTER DATASET](alter-dataset.md) , [ALTER DATASET … ADD VERSION](alter-dataset-add-version.md) , [ALTER DATASET … DROP VERSION](alter-dataset-drop-version.md), [SHOW DATASETS](show-datasets.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ IF NOT EXISTS ] DATASET <name>
```

## Required parameters

`name`
:   The name of the dataset that you’re creating within the current schema or a schema that you specify.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE DATASET | Schema | Only provides the privilege to create a dataset. You must also have the USAGE privilege on the schema. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example creates a dataset called `my_dataset`:

```sqlexample
CREATE DATASET my_dataset;
```

The following example creates or replaces a dataset called `my_dataset`:

```sqlexample
CREATE OR REPLACE DATASET my_dataset;
```

---
title: CREATE DBT PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/create-dbt-project.md
section: SQL Commands
---

# CREATE DBT PROJECT

Creates a new [dbt project object](../../user-guide/data-engineering/dbt-projects-on-snowflake.md) or replaces an existing dbt project. Running CREATE DBT PROJECT with the OR REPLACE option resets the version identifier to `version$1` and removes all version name aliases. For more information, see [Versions for dbt project objects and files](../../user-guide/data-engineering/dbt-projects-on-snowflake-versions.md).

See also:
:   [ALTER DBT PROJECT](alter-dbt-project.md), [DESCRIBE DBT PROJECT](desc-dbt-project.md), [EXECUTE DBT PROJECT](execute-dbt-project.md), [SHOW DBT PROJECTS](show-dbt-projects.md), [DROP DBT PROJECT](drop-dbt-project.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] DBT PROJECT [ IF NOT EXISTS ] <name>
  [ FROM '<source_location>' ]
  [ COMMENT = '<string_literal>' ]
  [ DBT_VERSION = <version_number> ]
  [ DEFAULT_TARGET = <default_target> ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ]
```

## Parameters

`name`
:   String that specifies the identifier (that is, the name) for the dbt project object within Snowflake; must be unique for the schema in which the dbt project is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FROM 'source_location'`
:   A string that specifies the location in Snowflake of the source files for the dbt project object. This can be a parent directory that contains multiple dbt projects, or a specific subdirectory that contains a dbt project and `dbt_project.yml` file.

    If the specified location doesn’t contain a `dbt_project.yml` file, the [EXECUTE DBT PROJECT](execute-dbt-project.md) command must use the PROJECT_ROOT parameter to specify the subdirectory path to a `dbt_project.yml` file.

    If no value is specified, Snowflake creates an empty dbt project.

    The dbt project source files can be in any one of the following locations:

    > * **A Git repository stage**, for example:
    >
    >   `'@my_db.my_schema.my_git_repository_stage/branches/my_branch/path/to/dbt_project_or_projects_parent'`
    >
    >   For more information about creating a Git repository object in Snowflake that connects a Git repository to a workspace for dbt Projects on Snowflake, see [Create a workspace connected to your Git repository](../../user-guide/tutorials/dbt-projects-on-snowflake-getting-started-tutorial.md). For more information about creating and managing a Git repository object and stage without using a workspace, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md) and [CREATE GIT REPOSITORY](create-git-repository.md).
    > * **An existing dbt project stage**, for example:
    >
    >   `'snow://dbt/my_db.my_schema.my_existing_dbt_project_object/versions/last'`
    >
    >   The version specifier is required and can be `last` (as shown in the previous example), `first`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](../../user-guide/data-engineering/dbt-projects-on-snowflake-versions.md).
    > * **An internal named stage**, for example:
    >
    >   `'@my_db.my_schema.my_internal_named_stage/path/to/dbt_projects_or_projects_parent'`
    >
    >   Internal user stages and table stages aren’t supported.
    > * **A workspace for dbt on Snowflake**, for example:
    >
    >   `'snow://workspace/user$.public."my_workspace_name"/versions/live/path/to/dbt_projects_or_projects_parent'`
    >
    >   We recommend enclosing the workspace name in double quotes because workspace names are case-sensitive and can contain special characters.
    >
    >   The version specifier is required and can be `last`, `first`, `live`, or the specifier for any existing version in the form `version$<num>`. For more information, see [Versions for dbt project objects and files](../../user-guide/data-engineering/dbt-projects-on-snowflake-versions.md).

    Default: No value

`COMMENT = 'string_literal'`
:   Specifies a comment for the dbt project object.

    Default: No value

`DBT_VERSION = version_number`
:   Specifies a version for the dbt Project.

    Default: `1.9.4`, unless an administrator set a version using the [DEFAULT_DBT_VERSION](../parameters.md) account parameter.

`DEFAULT_TARGET = default_target`
:   Specifies the profile used for compilation and subsequent runs (for example, `prod`) of the dbt project object. You can override this parameter by using the [EXECUTE DBT PROJECT](execute-dbt-project.md)
    command with `ARGS = --target`.

    Default: No value

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   Specifies the external access integration used to grant permissions to pull remote dependencies from dbt package hub or GitHub. When declared on an object, `dbt deps` will run automatically during deployment.
    For more information, see [Understand dependencies for dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake-dependencies.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| CREATE DBT PROJECT | Schema |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

* Create a dbt project object from a Git repository stage in Snowflake
* Create a dbt project object from a subdirectory within a Git repository stage in Snowflake
* Create a dbt project object from a specific version of an existing dbt project object
* Create a dbt project object from a workspace that contains multiple dbt projects

### Create a dbt project object from a Git repository stage in Snowflake

Create a dbt project object named `sales_dbt_model` from dbt project files in a Git repository stage. This example references the `main`
branch of a Git repository stage named `sales_dbt_git_stage` in Snowflake, where the project’s `dbt_project.yml` file is saved in the
repository root. The command also sets the default target used when executing dbt commands and specifies the external access integrations required
by the project.

```sqlexample
CREATE DBT PROJECT sales_db.dbt_projects_schema.sales_model
  FROM '@sales_db.integrations_schema.sales_dbt_git_stage/branches/main'
  DEFAULT_TARGET = 'prod'
  EXTERNAL_ACCESS_INTEGRATIONS = 'my_external_access_integration'
  COMMENT = 'Generates sales data models.';
```

### Create a dbt project object from a subdirectory within a Git repository stage in Snowflake

Create a dbt project object named `sw_region_sales_model` from a subdirectory inside a Git repository stage that contains multiple dbt projects.
The example references the `main` branch of a Git repository stage named `sales_dbt_git_stage` in Snowflake, where the project’s
`dbt_project.yml` file is saved in the `sw_region_dbt_project` subdirectory of the `sales_dbt_projects_parent` directory.

This example also sets the following properties:

* dbt version
* Default execution target (for example, `prod` or `dev`) used by dbt commands executed through Snowflake.
* External access integrations the dbt Project is permitted to use to pull remote dependencies from dbt package hub or Github.

```sqlexample
CREATE DBT PROJECT sales_db.dbt_projects_schema.sw_region_sales_model
  FROM '@sales_db.integrations_schema.sales_dbt_git_stage/branches/main/sales_dbt_projects_parent/sw_region_dbt_project'
  DBT_VERSION = '1.10.15'
  DEFAULT_TARGET = 'prod'
  EXTERNAL_ACCESS_INTEGRATIONS = 'my_external_access_integration'
  COMMENT = 'Generates data models for SW sales region.';
```

### Create a dbt project object from a specific version of an existing dbt project object

Create a new dbt project object named `sales_model_nw_region` from `version$2` of the existing `sales_model` dbt project.

This example also sets a default execution target using DEFAULT_TARGET, and specifies allowed external access integrations using EXTERNAL_ACCESS_INTEGRATIONS.

```sqlexample
CREATE DBT PROJECT sales_db.dbt_projects_schema.sales_model_nw_region
  FROM 'snow://dbt/sales_db.dbt_projects_schema.sales_model/versions/version$2'
  DEFAULT_TARGET = 'prod'
  EXTERNAL_ACCESS_INTEGRATIONS = (my_ext_integration_1, my_ext_integration_2)
  COMMENT = 'Generates data models for the NW sales region.';
```

### Create a dbt project object from a workspace that contains multiple dbt projects

Create a new dbt project object named `sales_model_from_workspace` from the live version of a workspace containing multiple dbt project directories. “My dbt
Project Workspace” inside the user’s personal database. This is useful when the workspace has several subprojects and you want to create a dbt project object
from a specific subdirectory. Workspaces are case-sensitive and can include special characters, so we recommend enclosing the workspace name in double quotes.

```sqlexample
CREATE DBT PROJECT sales_db.dbt_projects_schema.sales_model_from_workspace
  FROM 'snow://workspace/user$.public."My dbt Project Workspace"/versions/live/project2'

EXECUTE DBT PROJECT sales_db.dbt_projects_schema.sales_model_from_workspace
  ARGS = 'run --target prod';
```

---
title: CREATE DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/create-dcm-project.md
section: SQL Commands
---

# CREATE DCM PROJECT

Creates a new [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) or replaces an existing DCM project.

See also:
:   [ALTER DCM PROJECT](alter-dcm-project.md) , [DESCRIBE DCM PROJECT](desc-dcm-project.md) , [DROP DCM PROJECT](drop-dcm-project.md) , [EXECUTE DCM PROJECT](execute-dcm-project.md) , [SHOW DCM PROJECTS](show-dcm-projects.md) , [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] DCM PROJECT [ IF NOT EXISTS ] <name>
  [LOG_LEVEL = { DEBUG | INFO | WARN | ERROR }]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (name) for the DCM project; must be unique for the schema in which the DCM project is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`LOG_LEVEL = { DEBUG | INFO | WARN | ERROR }`
:   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at the specified
    level (and at more severe levels) are ingested.

    For more information, see [LOG_LEVEL](../parameters.md) and [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the DCM project.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| CREATE DCM PROJECT | Schema |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

* Create a DCM project:

  ```sqlexample
  CREATE DCM PROJECT MY_PROJECT;
  ```
* Create a DCM project with a comment:

  ```sqlexample
  CREATE DCM PROJECT MY_PROJECT
    COMMENT = 'My DCM project for data management';
  ```

---
title: CREATE DYNAMIC TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table.md
section: SQL Commands
---

# CREATE DYNAMIC TABLE

Creates a [dynamic table](../../user-guide/dynamic-tables-about.md), based on a specified query.

This command supports the following variants:

* CREATE OR ALTER DYNAMIC TABLE: Creates a dynamic table if it doesn’t exist or alters an existing dynamic table.
* CREATE DYNAMIC TABLE FROM BACKUP SET: Restores a dynamic table from a back up.
* CREATE DYNAMIC TABLE … CLONE: Creates a clone of an existing dynamic table.
* CREATE DYNAMIC ICEBERG TABLE: Creates a dynamic Apache Iceberg™ table.

See also:
:   [ALTER DYNAMIC TABLE](alter-dynamic-table.md), [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md), [DROP DYNAMIC TABLE](drop-dynamic-table.md) , [SHOW DYNAMIC TABLES](show-dynamic-tables.md),
    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ TRANSIENT ] DYNAMIC TABLE [ IF NOT EXISTS ] <name> (
    -- Column definition
    <col_name> <col_type>
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ [ WITH ] PROJECTION POLICY <policy_name> ]
      [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
      [ COMMENT '<string_literal>' ]
      [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ ... ] ]

  )
  TARGET_LAG = { '<num> { seconds | minutes | hours | days }' | DOWNSTREAM }
  [ SCHEDULER = DISABLE | ENABLE ]
  WAREHOUSE = <warehouse_name>
  [ INITIALIZATION_WAREHOUSE = <warehouse_name> ]
  [ REFRESH_MODE = { AUTO | FULL | INCREMENTAL } ]
  [ INITIALIZE = { ON_CREATE | ON_SCHEDULE } ]
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ COMMENT = '<string_literal>' ]
  [ COPY GRANTS ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> [ ENTITY KEY ( <col_name> [ , <col_name> ... ] ) ] ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ REQUIRE USER ]
  [ IMMUTABLE WHERE ( <expr> ) ]
  [ BACKFILL FROM ]
  [ EXECUTE AS USER <user_name>
    [ USE SECONDARY ROLES { ALL | NONE | <role> [ , ... ] } ]
  ]
  [ ROW_TIMESTAMP = { TRUE | FALSE } ]
  AS <query>
```

## Variant syntax

### CREATE OR ALTER DYNAMIC TABLE

```sqlsyntax
CREATE OR ALTER DYNAMIC TABLE <name> (
  -- Column definition
  <col_name> <col_type>
    [ COLLATE '<collation_specification>' ]
    [ COMMENT '<string_literal>' ]

  -- Additional column definitions
  [ , <col_name> <col_type> [ ... ] ]
  )
  TARGET_LAG = { '<num> { seconds | minutes | hours | days }' | DOWNSTREAM }
  [ SCHEDULER = DISABLE | ENABLE ]
  WAREHOUSE = <warehouse_name>
  [ REFRESH_MODE = { AUTO | FULL | INCREMENTAL } ]
  [ IMMUTABLE WHERE ( <expr> ) ]
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ COMMENT = '<string_literal>' ]
  [ ROW_TIMESTAMP = { TRUE | FALSE } ]
  AS <query>
```

Creates a dynamic table if it doesn’t exist, or alters it according to the dynamic table
definition. The CREATE OR ALTER DYNAMIC TABLE syntax follows the rules of a
CREATE DYNAMIC TABLE statement and has the same limitations as
an [ALTER DYNAMIC TABLE](alter-dynamic-table.md) statement.

For more information, see [CREATE OR ALTER <object>](create-or-alter.md).

Changes to the following dynamic table properties and parameters preserve data:

* TARGET_LAG
* WAREHOUSE
* CLUSTER BY
* DATA_RETENTION_TIME_IN_DAYS
* MAX_DATA_EXTENSION_TIME_IN_DAYS
* COMMENT
* IMMUTABLE WHERE

  + When specified, only the mutable region is reinitialized and data in the immutable region is preserved. For more information, see [Understanding immutability constraints](../../user-guide/dynamic-tables-immutability-constraints.md).

Changes to the following dynamic table properties and parameters trigger a [reinitialization](../../user-guide/dynamic-tables-refresh.md):

* REFRESH_MODE
* Changes to the query or column list:

  + Dropping existing columns is supported.
  + Adding new columns is supported, but they can only be added at the end of existing columns.
  + Dropping columns that are used in an IMMUTABLE WHERE predicate or as clustering keys isn’t supported.

For more information, see [CREATE OR ALTER TABLE usage notes](create-table.md).

### CREATE DYNAMIC TABLE FROM BACKUP SET

```sqlsyntax
CREATE DYNAMIC TABLE <name> FROM BACKUP SET <backup_set> IDENTIFIER '<backup_id>'
```

The FROM BACKUP SET clause restores a dynamic table from a backup. You don’t specify other table
properties because they’re all the same as in the backed-up table.

This form doesn’t have a CREATE OR REPLACE clause. You typically either restore the
dynamic table under a new name and recover any data or other objects from this new table,
or rename the original table and then restore the table under the original name.

> **Note:**
>
> The backup set is associated with the internal table ID of the original table.
> Any more backups you add to the backup set use the original table, even if you
> changed its name. If you want to make backups of the newly restored table, create a
> new backup set for it.
>
> When you restore a dynamic table from a backup, Snowflake
> [automatically initializes](../../user-guide/dynamic-tables-refresh.md)
> the new table during its first refresh.

For more information about backups, see [Backups for disaster recovery and immutable storage](../../user-guide/backups.md).

`backup_set`
:   Specifies the name of a backup set created for a specific dynamic table.
    You can use the SHOW BACKUP SETS command to locate the right backup set.

`backup_id`
:   Specifies the identifier of a specific backup within that backup set.
    You can use the SHOW BACKUPS IN BACKUP SET command to locate the right identifier within the backup
    set, based on the creation date and time for the backup.

### CREATE DYNAMIC TABLE … CLONE

Creates a new dynamic table with the same column definitions and containing all the
existing data from the source dynamic table, without actually copying the data.

Cloned dynamic tables, whether cloned directly or as part of a cloned database or schema, are suspended by default. In [DYNAMIC_TABLE_GRAPH_HISTORY](../functions/dynamic_table_graph_history.md),
this appears as CLONED_AUTO_SUSPENDED in the SCHEDULING_STATE column. Any downstream dynamic tables are also suspended, shown as UPSTREAM_CLONED_AUTO_SUSPENDED.
For more information, see [Automatic dynamic table suspension](../../user-guide/dynamic-tables-suspend-resume.md).

You can also clone a dynamic table as it existed at a specific point in the past. For
more information, see [Cloning considerations](../../user-guide/object-clone.md).

```sqlsyntax
CREATE [ OR REPLACE ] [ TRANSIENT ] DYNAMIC TABLE <name>
  CLONE <source_dynamic_table>
        [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
  [
    COPY GRANTS
    TARGET_LAG = { '<num> { seconds | minutes | hours | days }' | DOWNSTREAM }
    WAREHOUSE = <warehouse_name>
    EXECUTE AS USER <user_name>
      USE SECONDARY ROLES { ALL | NONE | <role> [ , ... ] }
  ]
```

If the source dynamic table has clustering keys, then the cloned dynamic table has
clustering keys. By default, Automatic Clustering is suspended for the new table, even
if Automatic Clustering was not suspended for the source table.

For more details about cloning, see [CREATE <object> … CLONE](create-clone.md).

### CREATE DYNAMIC ICEBERG TABLE

Creates a new dynamic Apache Iceberg™ table. For information about Iceberg tables, see
[Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) and [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](create-iceberg-table-snowflake.md).

```sqlsyntax
CREATE [ OR REPLACE ] DYNAMIC ICEBERG TABLE <name> (
  -- Column definition
  <col_name> <col_type>
    [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
    [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
    [ COMMENT '<string_literal>' ]

  -- Additional column definitions
  [ , <col_name> <col_type> [ ... ] ]

)
TARGET_LAG = { '<num> { seconds | minutes | hours | days }' | DOWNSTREAM }
WAREHOUSE = <warehouse_name>
[ EXTERNAL_VOLUME = '<external_volume_name>' ]
[ CATALOG = 'SNOWFLAKE' ]
[ BASE_LOCATION = '<optional_directory_for_table_files>' ]
[ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
[ PARTITION BY ( partitionExpression [, partitionExpression , ...] ) ]
[ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
[ ICEBERG_VERSION = <integer> ]
[ REFRESH_MODE = { AUTO | FULL | INCREMENTAL } ]
[ IMMUTABLE WHERE ( <expr> ) ]
[ INITIALIZE = { ON_CREATE | ON_SCHEDULE } ]
[ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
[ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
[ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
[ COMMENT = '<string_literal>' ]
[ COPY GRANTS ]
[ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
[ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
[ REQUIRE USER ]
[ EXECUTE AS USER <user_name>
  [ USE SECONDARY ROLES { ALL | NONE | <role> [ , ... ] } ]
]
AS <query>
```

Where:

```sqlsyntax
partitionExpression ::=
  <col_name> -- identity transform
  | BUCKET ( <num_buckets> , <col_name> )
  | TRUNCATE ( <width> , <col_name> )
  | YEAR ( <col_name> )
  | MONTH ( <col_name> )
  | DAY ( <col_name> )
  | HOUR ( <col_name> )
```

For more information about usage and limitations, see
[Create dynamic Apache Iceberg™ tables](../../user-guide/dynamic-tables-create-iceberg.md).

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the dynamic table; must be unique for the schema in which the dynamic table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TARGET_LAG = { num { seconds | minutes | hours | days } | DOWNSTREAM }`
:   Specifies the lag for the dynamic table:

    > `'num seconds | minutes | hours | days'`
    > :   Specifies the maximum amount of time that the dynamic table’s content should lag behind updates to the source tables.
    >
    >     For example:
    >
    >     * If the data in the dynamic table should lag by no more than 5 minutes, specify `5 minutes`.
    >     * If the data in the dynamic table should lag by no more than 5 hours, specify `5 hours`.
    >
    >     Must be a minimum of 60 seconds. If the dynamic table depends on another dynamic table, the minimum target lag must
    >     be greater than or equal to the target lag of the dynamic table it depends on.
    >
    > `DOWNSTREAM`
    > :   Specifies that the dynamic table should be refreshed only when dynamic tables that depend on it are refreshed.

    Required when `SCHEDULER = ENABLE`.

    For information on how target lag affects refresh frequency and costs, see [Identify the right target lag](../../user-guide/dynamic-tables-performance-optimize.md).

`WAREHOUSE = warehouse_name`
:   Specifies the name of the warehouse that provides the compute resources for refreshing the dynamic table.

    You must use a role that has the USAGE privilege on this warehouse in order to create the dynamic table. For limitations and more
    information, see [Privileges to create a dynamic table](../../user-guide/dynamic-tables-privileges.md).

    For guidance on choosing a warehouse for optimal refresh performance, see [Adjust your warehouse configuration](../../user-guide/dynamic-tables-performance-optimize.md).

`AS query`
:   Specifies the query whose results the dynamic table should contain.

## Optional parameters

`SCHEDULER = { DISABLE | ENABLE }`
:   Specifies whether the dynamic table is to be refreshed automatically by Snowflake’s dynamic table scheduler.

    `DISABLE`
    :   Excludes the dynamic table from automatic background refresh. The table isn’t refreshed on a schedule, either directly or
        through downstream dependencies.

        * Manual control: Refreshing must be triggered manually by using `ALTER DYNAMIC TABLE ... REFRESH`.
        * Isolation: A manual refresh of a disabled table doesn’t automatically refresh its upstream dependencies. This creates a “isolation
          boundary,” allowing external orchestrators, like dbt, to manage specific table refreshes in isolation without triggering the entire
          pipeline.
        * `TARGET_LAG` can’t be defined when `SCHEDULER = DISABLE`.

    `ENABLE`
    :   Enables the automated background scheduler for the dynamic table. The scheduler ensures that the table is refreshed alongside its
        dependencies to maintain snapshot consistency. In this mode, Snowflake automatically calculates the optimal refresh frequency based on
        the defined `TARGET_LAG`. With this setting, `TARGET_LAG` must be set.

    If not specified, the dynamic table is scheduler-managed by default. [SHOW DYNAMIC TABLES](show-dynamic-tables.md) displays `NULL` for the `SCHEDULER` column when the attribute isn’t explicitly set.

`INITIALIZATION_WAREHOUSE = warehouse_name`
:   Specifies a warehouse to use for all dynamic table [initializations and reinitializations](../../user-guide/dynamic-tables-refresh.md).

    If this parameter isn’t included in the CREATE DYNAMIC TABLE statement, the dynamic table uses the warehouse that is specified by the
    required WAREHOUSE parameter for all refreshes.

    You must use a role that has the USAGE privilege on this warehouse for you to create the dynamic table. For limitations and more
    information, see [Privileges to create a dynamic table](../../user-guide/dynamic-tables-privileges.md).

`TRANSIENT`
:   Specifies that the table is transient.

    Like permanent dynamic tables, [transient](../../user-guide/tables-temp-transient.md) dynamic tables exist until
    they’re explicitly dropped, and are available to any user with the appropriate privileges. Transient dynamic
    tables don’t retain data in fail-safe storage, which helps reduce storage costs, especially for tables that
    refresh frequently. Due to this reduced level of durability, transient dynamic tables are best used for
    transitory data that doesn’t need the same level of data protection and recovery provided by permanent tables.

    Default: No value. If a dynamic table is not declared as `TRANSIENT`, it is permanent.

`REFRESH_MODE = { AUTO | FULL | INCREMENTAL }`
:   Specifies the [refresh mode](../../user-guide/dynamic-tables-refresh.md) for the dynamic table.

    This property cannot be altered after you create the dynamic table. To modify the property, recreate the dynamic table with a CREATE OR
    REPLACE DYNAMIC TABLE command.

    > `AUTO`
    > :   When refresh mode is `AUTO`, the system attempts to apply an incremental refresh by default. However, when incremental refresh isn’t
    >     supported or expected to perform well, the dynamic table automatically selects full refresh instead. For more information, see
    >     [Dynamic table refresh modes](../../user-guide/dynamic-tables-refresh.md) and [Choose a refresh mode](../../user-guide/dynamic-tables-performance-optimize.md).
    >
    >     To determine the best mode for your use case, experiment with refresh modes and automatic recommendations. For consistent behavior across
    >     Snowflake releases, explicitly set the refresh mode on all dynamic tables.
    >
    >     To verify the refresh mode for your dynamic tables, see [Refresh mode](../../user-guide/dynamic-tables-performance-monitor.md).
    >
    > `FULL`
    > :   Enforces a full refresh of the dynamic table, even if the dynamic table can be incrementally refreshed.
    >
    > `INCREMENTAL`
    > :   Enforces an incremental refresh of the dynamic table. If the query that underlies the dynamic table can’t perform an incremental refresh,
    >     dynamic table creation fails and displays an error message.
    >
    >     For information about how operators affect incremental refresh, see [Optimize queries for incremental refresh](../../user-guide/dynamic-tables-performance-optimize-query.md).
    >
    > Default: `AUTO`

`INITIALIZE`
:   Specifies the behavior of the [initial refresh](../../user-guide/dynamic-tables-refresh.md) of the dynamic table. This property cannot be
    altered after you create the dynamic table. To modify the property, replace the dynamic table with a CREATE OR REPLACE DYNAMIC TABLE command.

    > `ON_CREATE`
    > :   Refreshes the dynamic table synchronously at creation. If this refresh fails, dynamic table creation fails and displays an error message.
    >
    > `ON_SCHEDULE`
    > :   Refreshes the dynamic table at the next scheduled refresh.
    >
    >     The dynamic table is populated when the refresh schedule process runs. No data is populated when the dynamic table is created. If you try to
    >     query the table using `SELECT * FROM DYNAMIC TABLE`, you might see the following error because the first scheduled refresh has not yet
    >     occurred.
    >
    >     ```output
    >     Dynamic Table is not initialized. Please run a manual refresh or wait for a scheduled refresh before querying.
    >     ```
    >
    > Default: `ON_CREATE`

`COMMENT 'string_literal'`
:   Specifies a comment for the column.

    (Note that comments can be specified at the column level or the table level. The syntax for each is slightly different.)

`MASKING POLICY = policy_name`
:   Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.

`PROJECTION POLICY policy_name`
:   Specifies the [projection policy](../../user-guide/projection-policies.md) to set on a column.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`column_list`
:   If you want to change the name of a column or add a comment to a column in the dynamic table,
    include a column list that specifies the column names and, if needed, comments about
    the columns. You do not need to specify the data types of the columns.

    If any of the columns in the dynamic table are based on expressions - for example, not simple column names -
    then you must supply a column name for each column in the dynamic table. For instance, the column names are
    required in the following case:

    ```sqlexample
    CREATE DYNAMIC TABLE my_dynamic_table (pre_tax_profit, taxes, after_tax_profit)
      TARGET_LAG = '20 minutes'
        WAREHOUSE = mywh
        AS
          SELECT revenue - cost, (revenue - cost) * tax_rate, (revenue - cost) * (1.0 - tax_rate)
          FROM staging_table;
    ```

    You can specify an optional comment for each column. For example:

    ```sqlexample
    CREATE DYNAMIC TABLE my_dynamic_table (pre_tax_profit COMMENT 'revenue minus cost',
                    taxes COMMENT 'assumes taxes are a fixed percentage of profit',
                    after_tax_profit)
      TARGET_LAG = '20 minutes'
        WAREHOUSE = mywh
        AS
          SELECT revenue - cost, (revenue - cost) * tax_rate, (revenue - cost) * (1.0 - tax_rate)
          FROM staging_table;
    ```

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

`ICEBERG_VERSION = integer`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the version of the Apache Iceberg™ specification that the table conforms to.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    If you don’t set this parameter, the Iceberg table defaults to the Iceberg version for the schema, database, or account. The schema
    takes precedence over the database, and the database takes precedence over the account.

    > * `2`: The table conforms with Iceberg version 2.
    > * `3`: The table conforms with Iceberg version 3.
    >
    > Default: `2`
    >
    > For more information about this parameter, see [ICEBERG_VERSION](../parameters.md).

`TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }'`
:   Specifies a target Parquet file size for the table.

    * `'{ 16MB | 32MB | 64MB | 128MB }'` specifies a fixed target file size for the table.
    * `'AUTO'` works differently, depending on the table type:

      + Snowflake-managed tables: AUTO specifies that Snowflake should choose the file size for the table based on table characteristics
        such as size, DML patterns, ingestion workload, and clustering configuration. Snowflake automatically
        adjusts the file size, starting at 16 MB, for better read and write performance in Snowflake. Use this option to optimize table performance
        in Snowflake.
      + Externally managed tables: AUTO specifies that Snowflake should aggressively scale to the largest file size (128 MB).

    For more information, see [Set a target file size](../../user-guide/tables-iceberg-manage.md).

    Default: AUTO

`PARTITION BY ( partitionExpression [ , partitionExpression , ... ] )`
:   Specifies one or more [partition expressions](create-iceberg-table-snowflake.md)
    for the dynamic Iceberg table. For parameter details, see
    [Partition expression parameters (partitionExpression)](create-iceberg-table-snowflake.md) in
    [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](create-iceberg-table-snowflake.md).

`PATH_LAYOUT = { FLAT | HIERARCHICAL }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the path layout that Snowflake uses when writing Parquet data files to the table:

    * `FLAT`: Snowflake writes all Parquet data files under the `data/` directory for the table.
    * `HIERARCHICAL`: Snowflake writes partitioned data under the `data/` directory for the table by using a hierarchical
      path layout. With this layout, each partition column is represented
      as a directory level in the path. To define these partition
      columns, use the PARTITION BY parameter. This layout is also called “Hive-style” partitioning.

      If you specify PATH_LAYOUT = HIERARCHICAL without a PARTITION BY clause,
      Snowflake stores the Parquet data files by using a flat layout path. You can’t
      modify the path layout for an existing table, so you might set this
      parameter to HIERARCHICAL without specifying a PARTITION BY clause if you don’t want to use partitioning with
      hierarchical paths now but you might in the future.

    > **Note:**
    >
    > For externally managed tables that you create in a standard Snowflake database, Snowflake infers and honors the partitioning scheme
    > that is specified by the remote catalog.

    Default: `FLAT`

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies one or more columns or column expressions in the dynamic table as the clustering key. Before you specify a clustering
    key for a dynamic table, you should understand micro-partitions. For more information, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

    Note the following when using clustering keys with dynamic tables:

    * Column definitions are required and must be explicitly specified in the statement.
    * By default, Automatic Clustering is not suspended for the new dynamic table, even if Automatic Clustering is suspended for the
      source table.
    * Clustering keys are not intended or recommended for all tables; they typically benefit very large (for example
      multi-terabyte) tables.
    * Specifying CLUSTER BY doesn’t cluster the data at creation time; instead, CLUSTER BY relies on
      Automatic Clustering to recluster the data over time.

    For more information, see [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).

    Default: No value (no clustering key is defined for the table)

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the retention period for the dynamic table so that Time Travel actions (SELECT, CLONE) can be performed on historical
    data in the dynamic table. Time Travel behaves the same way for dynamic tables as it behaves for traditional tables. For more
    information, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see
    [Parameters](../parameters.md).

    Values:

    * Standard Edition: `0` or `1`
    * Enterprise Edition:

      + `0` to `90` for permanent tables
      + `0` or `1` for temporary and transient tables

    Default:

    * Standard Edition: `1`
    * Enterprise Edition (or higher): `1` (unless a different default value was specified at the schema, database, or account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the table.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   An object parameter that sets the maximum number of days Snowflake can extend the data retention period to prevent streams on the dynamic
    table from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the dynamic table.

    (Note that comments can be specified at the column level or the table level. The syntax for each is slightly different.)

    Default: No value.

`COPY GRANTS`
:   Specifies to retain the access privileges from the original table when a new dynamic table is created using any of the following CREATE DYNAMIC TABLE variants:

    * CREATE OR REPLACE DYNAMIC TABLE
    * CREATE OR REPLACE DYNAMIC ICEBERG TABLE
    * CREATE OR REPLACE DYNAMIC TABLE … CLONE

    This parameter copies all privileges except OWNERSHIP from the existing dynamic table to the new dynamic table. The new dynamic
    table does not inherit any future grants defined for the object type in the schema. By default, the role that executes the
    CREATE DYNAMIC TABLE statement owns the new dynamic table.

    If this parameter is not included in the CREATE DYNAMIC TABLE statement, then the new table does not inherit any explicit access
    privileges granted on the original dynamic table, but does inherit any future grants defined for the object type in the schema.

    If the statement is replacing an existing table of the same name, then the grants are copied from the table being replaced. If there is
    no existing table of that name, then the grants are copied.

    For example, the following statement creates a dynamic table `dt1` cloned from `dt0` with all grants copied from `dt0`. The first
    time you run the command, `dt1` copies all grants from `dt0`. If you run the same command again, `dt1` will copy all grants from
    `dt1` and not `dt0`.

    ```sqlexample
    CREATE OR REPLACE DYNAMIC TABLE dt1 CLONE dt0
      COPY GRANTS;
    ```

    Note the following:

    * With [data sharing](../../guides-overview-sharing.md):

      + If the existing dynamic table was shared to another account, the replacement dynamic table is also shared.
      + If the existing dynamic table was shared with your account as a data consumer, and access was further granted to other roles in
        the account (using `GRANT IMPORTED PRIVILEGES` on the parent database), access is also granted to the replacement dynamic
        table.
    * The [SHOW GRANTS](show-grants.md) output for the replacement dynamic table lists the grantee for the copied privileges as the
      role that executed the CREATE TABLE statement, with the current timestamp when the statement was executed.
    * The operation to copy grants occurs atomically in the CREATE DYNAMIC TABLE command (i.e. within the same transaction).

    > **Important:**
    >
    > The COPY GRANTS parameter can be placed anywhere in a CREATE [ OR REPLACE ] DYNAMIC TABLE command, except after the query
    > definition.
    >
    > For example, the following dynamic table will fail to create:
    >
    > ```sqlexample
    > CREATE OR REPLACE DYNAMIC TABLE my_dynamic_table
    >   TARGET_LAG = DOWNSTREAM
    >   WAREHOUSE = mywh
    >   AS
    >     SELECT * FROM staging_table
    >     COPY GRANTS;
    > ```

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on a dynamic table.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`AGGREGATION POLICY policy_name [ ENTITY KEY ( col_name [ , col_name ... ] ) ]`
:   Specifies an [aggregation policy](../../user-guide/aggregation-policies.md) to set on a dynamic table. You can apply one or more aggregation
    policies on a table.

    Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the dynamic table. For more information,
    see [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md). You can specify one or more entity keys for an aggregation policy.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`REQUIRE USER`
:   When specified, the dynamic table cannot run unless a user is specified. The dynamic table is not able to refresh unless a user is set
    in a manual refresh with the [COPY SESSION](alter-dynamic-table.md) parameter specified.

    If this option is enabled, the dynamic table must be created with the ON_SCHEDULE parameter for
    `INITIALIZE`.

`IMMUTABLE WHERE`
:   Specifies a condition that defines the immutable portion of the dynamic table. For more information, see [Understanding immutability constraints](../../user-guide/dynamic-tables-immutability-constraints.md).

`BACKFILL FROM <name>`
:   Specifies the table to backfill data from.

    Only data defined by the [IMMUTABLE WHERE immutability constraint](../../user-guide/dynamic-tables-immutability-constraints.md) can be backfilled because
    the backfill data must remain unchanged, even if it differs from the upstream source.

    For more information, see [Backfill examples](../../user-guide/dynamic-tables-performance-optimize-immutability.md).

`EXECUTE AS USER user_name`
:   Refreshes the dynamic table as the specified user.

    To specify EXECUTE AS USER, you must use a role that has been granted the IMPERSONATE privilege on the `user_name` user. To grant this privilege,
    run the [GRANT <privileges> … TO ROLE](grant-privilege.md) command.

    `USE SECONDARY ROLES { ALL | NONE | <role> [ , ... ] }`
    :   Specifies the secondary roles to use on the dynamic table. Can be used to override the default secondary roles that are otherwise used in execution.

        Can only be used with the EXECUTE AS USER option.

    For more information, see [Refresh dynamic tables with specific user privileges and secondary roles](../../user-guide/dynamic-tables-privileges.md).

`ROW_TIMESTAMP = { TRUE | FALSE }`
:   Specifies whether to enable row timestamps on the table. You must use a role with the OWNERSHIP privilege.

    For more information, see [Use row timestamps to measure latency in your pipelines](../../user-guide/data-engineering/row-timestamps.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE DYNAMIC TABLE | Schema in which you plan to create the dynamic table. |  |
| SELECT | Tables, views, and dynamic tables that you plan to query for the new dynamic table. |  |
| USAGE | Warehouse that you plan to use to refresh the table. |  |
| IMPERSONATE | User specified in EXECUTE AS USER | To refresh the dynamic table as a user, you must use a role that has been granted the IMPERSONATE privilege on that user. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When you execute the CREATE DYNAMIC TABLE command, the current role in use becomes
  the owner of the dynamic table. This role is used to perform refreshes of the dynamic
  table in the background.
* You cannot make changes to the schema after you create a dynamic table.
* Dynamic tables are updated as underlying database objects change. Change tracking must
  be enabled on all underlying objects used by a dynamic table. See
  [Enable change tracking](../../user-guide/dynamic-tables-create.md).
* If you want to replace an existing dynamic table and need to see its current definition,
  call the [GET_DDL](../functions/get_ddl.md) function.
* Using [ORDER BY](../constructs/order-by.md) in the definition of a dynamic table
  might produce results sorted in an unexpected order. You can use ORDER BY when querying
  your dynamic table to ensure that rows selected return in a specific order.
* Snowflake doesn’t support using ORDER BY to create a view that selects from a dynamic
  table.
* To influence the order in which rows are stored in a dynamic table, consider enabling clustering.
* Some expressions, clauses, and functions are not currently supported in dynamic tables.
  For a complete list, see [Dynamic table limitations](../../user-guide/dynamic-tables-limitations.md).
* You can use `DYNAMIC_TABLE_REFRESH_BOUNDARY()` in the definition query to prevent an upstream dynamic table from being refreshed together
  with this dynamic table. The upstream dynamic table is treated as belonging to a separate pipeline, which means cascading refreshes and
  snapshot isolation do not apply across the boundary. For more information, see [Dynamic table refresh boundary](../../user-guide/dynamic-tables-refresh-boundary.md).
* Using `OR REPLACE` is the equivalent to using DROP DYNAMIC TABLE on the existing dynamic table and then creating a new
  dynamic table with the same name. However, Snowflake doesn’t drop the old dynamic table until it has created the new dynamic table,
  including the initial refresh if `INITIALIZE = ON_CREATE` is specified. Instead, the new dynamic table is created as a
  hidden table, the refresh is run, then Snowflake atomically swaps it in for the existing dynamic table.
* Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## CREATE OR ALTER DYNAMIC TABLE usage notes

* All limitations of the [ALTER DYNAMIC TABLE](alter-dynamic-table.md) command apply.

### Limitations

The following actions *aren’t* supported:

> * Swapping dynamic tables by using the SWAP WITH parameter.
> * Renaming a dynamic table by using the RENAME TO parameter.
> * Creating a clone of a dynamic table by using the CLONE parameter.
> * Suspending or resuming by using the SUSPEND and RESUME parameters.
> * Converting a TRANSIENT dynamic table into a non-TRANSIENT dynamic table, or vice versa.
> * Adding or changing tags and policies. Any existing tags and policies are preserved,
>   and other statements might still add or remove tags and policies.
> * Creating or altering dynamic Apache Iceberg™ tables.
> * Time Travel clone for times that are before the latest definition or refresh mode change.

Additionally, modifying the values for the REFRESH_MODE and INITIALIZE properties after
the dynamic table has been created isn’t supported. You can switch between the `AUTO`
refresh mode and the specific `INCREMENTAL` and `FULL` refresh modes, but doing so
doesn’t change the actual physical refresh mode of the dynamic table.

For example:

* If you create a dynamic table with `AUTO` refresh mode, the system immediately assigns
  a concrete mode (`INCREMENTAL` or `FULL`). When you run a subsequent CREATE OR ALTER
  DYNAMIC TABLE statement, you can specify `AUTO` or the concrete refresh mode that is chosen by
  the engine at creation. However, this doesn’t alter the assigned refresh mode; it
  remains the same.
* If you create a dynamic table with a specific refresh mode (`INCREMENTAL` or `FULL`),
  you can later specify `AUTO` in a CREATE OR ALTER DYNAMIC TABLE statement to enable
  forward compatibility. For example, if your dynamic table was created with `FULL`
  mode and is version-controlled, specifying `AUTO` in a CREATE OR ALTER DYNAMIC TABLE
  statement enables new tables to use `AUTO`, while existing tables remain in `FULL`
  mode without breaking compatibility.

### No implicit refreshes

If you change an existing dynamic table by using the CREATE OR ALTER DYNAMIC TABLE
command, the command doesn’t trigger a refresh of the dynamic table. The dynamic table is
refreshes according to its normal schedule.

However, if you create a new dynamic table by using the CREATE OR ALTER DYNAMIC TABLE
command and you specify `INITIALIZE = ON_CREATE`, the command triggers a refresh of the
dynamic table.

### Atomicity

The CREATE OR ALTER DYNAMIC TABLE command doesn’t guarantee *atomicity*. This means that if
a CREATE OR ALTER DYNAMIC TABLE statement fails during execution, it’s possible that a
subset of changes might have been applied to the table. If there’s a possibility of
partial changes, in most cases, the error message includes the following text:

```output
CREATE OR ALTER execution failed. Partial updates may have been applied.
```

For example, suppose that you wanted to change the `TARGET_LAG` property and add a
clustering key for a dynamic table, but you change your mind and terminate the statement. In
this case, the `TARGET_LAG` property might still change while the clustering key isn’t
applied.

When changes are partially applied, the resulting table is in a valid state. In the
previous example, you can use additional ALTER DYNAMIC TABLE statements to complete the
original set of changes.

To recover from partial updates, try the following recovery methods:

* **Fix forward**: Re-execute the CREATE OR ALTER DYNAMIC TABLE statement. If the
  statement succeeds on the second attempt, the target state is achieved.

  If the statement doesn’t succeed, investigate the error message. If possible, fix the
  error and re-execute the CREATE OR ALTER DYNAMIC TABLE statement.
* **Roll back**: If it isn’t possible to fix forward, manually roll back the partial
  changes:

  + Investigate the state of the table by using the [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md)
    and [SHOW DYNAMIC TABLES](show-dynamic-tables.md) commands. Determine which partial
    changes were applied, if any.

    If partial changes were applied, execute the appropriate ALTER DYNAMIC TABLE
    statements to transform the dynamic table back to its original statement.

For additional help, contact [Snowflake Support](../../user-guide/contacting-support.md).

## IMMUTABLE WHERE usage notes

* You can set only one IMMUTABLE WHERE predicate per dynamic table. Setting another predicate
  replaces the existing one.
* IMMUTABLE WHERE constraints can’t contain the following items:

  + Subqueries
  + Nondeterministic functions (except timestamp functions like CURRENT_TIMESTAMP or
    CURRENT_DATE)
  + User-defined or external functions
  + Metadata columns (those starting with `METADATA$`)
  + Columns that result from aggregates, window functions, or nondeterministic functions
  + Columns that are passed through a window function operator and not a `PARTITION BY` column. Example: `col2` in `SUM(col1) OVER
    (PARTITION BY col1 ORDER BY col2)`
* When you use timestamp functions, the immutable region can’t shrink over time. For example,
  `TIMESTAMP_COL < CURRENT_TIMESTAMP()` is allowed, but
  `TIMESTAMP_COL > CURRENT_TIMESTAMP()` is not.
* Columns referenced in the IMMUTABLE WHERE condition must be columns in the dynamic table,
  not columns from the base table.
* When the dynamic table has both an IMMUTABLE WHERE predicate and at least one primary key or unique constraint
  with the [RELY property](../../user-guide/join-elimination.md), the columns referenced in the IMMUTABLE WHERE
  predicate must be a subset of the columns referenced in the set of all RELY primary key and RELY unique constraints.
  Only RELY constraints are considered. For details, see [Interaction with primary key and unique constraints (RELY)](../../user-guide/dynamic-tables-immutability-constraints.md).
* The following limitations apply when you work with [immutability constraints](../../user-guide/dynamic-tables-immutability-constraints.md)
  and backfilled data:

  + Currently, only regular and dynamic tables can be used for backfilling.
  + You can’t specify policies or tags in the new dynamic table because they are copied from the backfill table.
  + Clustering keys in the new dynamic table and backfill table must be the same.

## Examples

Create a dynamic table named `my_dynamic_table`:

```sqlexample
CREATE OR REPLACE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = '20 minutes'
  WAREHOUSE = mywh
  AS
    SELECT product_id, product_name FROM staging_table;
```

In the example above:

* The dynamic table materializes the results of a query of the `product_id` and `product_name` columns of the
  `staging_table` table.
* The target lag time is 20 minutes, which means that the data in the dynamic table should ideally be no more than 20 minutes
  older than the data in `staging_table`.
* The automated refresh process uses the compute resources in warehouse `mywh` to refresh the data in the dynamic table.

Create a dynamic Iceberg table named `my_dynamic_table` that reads from `my_iceberg_table`:

```sqlexample
CREATE DYNAMIC ICEBERG TABLE my_dynamic_table (date TIMESTAMP_NTZ, id NUMBER, content STRING)
  TARGET_LAG = '20 minutes'
  WAREHOUSE = mywh
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_iceberg_table'
  AS
    SELECT product_id, product_name FROM staging_table;
```

Create a dynamic table with a multi-column clustering key:

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table (date TIMESTAMP_NTZ, id NUMBER, content VARIANT)
  TARGET_LAG = '20 minutes'
  WAREHOUSE = mywh
  CLUSTER BY (date, id)
  AS
    SELECT product_id, product_name FROM staging_table;
```

Clone a dynamic table as it existed exactly at the date and time of the specified timestamp:

```sqlexample
CREATE DYNAMIC TABLE my_cloned_dynamic_table CLONE my_dynamic_table AT (TIMESTAMP => TO_TIMESTAMP_TZ('04/05/2013 01:02:03', 'mm/dd/yyyy hh24:mi:ss'));
```

Configure a dynamic table to require a user for refreshes and then refresh the dynamic table:

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = 'DOWNSTREAM'
  WAREHOUSE = mywh
  INITIALIZE = on_schedule
  REQUIRE USER
  AS
    SELECT product_id, product_name FROM staging_table;
```

```sqlexample
ALTER DYNAMIC TABLE my_dynamic_table REFRESH COPY SESSION;
```

Create a dynamic table with an immutability constraint:

```sqlexample
CREATE DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = '1 hour'
  WAREHOUSE = my_warehouse
  IMMUTABLE WHERE (ts < CURRENT_TIMESTAMP() - INTERVAL '1 day')
AS
  SELECT * FROM source_table;
```

Create a dynamic table by using the CREATE OR ALTER DYNAMIC TABLE command:

```sqlexample
CREATE OR ALTER DYNAMIC TABLE my_dynamic_table
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = mywh
  AS
    SELECT a, b FROM t;
```

> **Note:**
>
> CREATE OR ALTER TABLE statements for existing tables can only be executed by a role with the OWNERSHIP privilege on `my_dynamic_table`.

Alter a dynamic table to set the DATA_RETENTION_TIME_IN_DAYS parameter and add a clustering key:

```sqlexample
CREATE OR ALTER DYNAMIC TABLE my_dynamic_table
 TARGET_LAG = DOWNSTREAM
 WAREHOUSE = mywh
 DATA_RETENTION_TIME_IN_DAYS = 2
 CLUSTER BY (a)
 AS
   SELECT a, b FROM t;
```

Modify the target lag and change the warehouse:

```sqlexample
CREATE OR ALTER DYNAMIC TABLE my_dynamic_table
 TARGET_LAG = '5 minutes'
 WAREHOUSE = my_other_wh
 DATA_RETENTION_TIME_IN_DAYS = 2
 CLUSTER BY (a)
 AS
   SELECT a, b FROM t;
```

Unset the DATA_RETENTION_TIME_IN_DAYS parameter. The absence of a parameter in the
modified CREATE OR ALTER DYNAMIC TABLE statement results in unsetting it. In this case,
unsetting the DATA_RETENTION_TIME_IN_DAYS parameter for the dynamic table resets it to
the default value of 1:

```sqlexample
CREATE OR ALTER DYNAMIC TABLE my_dynamic_table
 TARGET_LAG = '5 minutes'
 WAREHOUSE = my_other_wh
 CLUSTER BY (a)
 AS
   SELECT a, b FROM t;
```

**Write a v3 Snowflake-managed Iceberg table**

The following example writes a v3 Snowflake-managed Iceberg table as the output of a dynamic table:

```sqlexample
CREATE DYNAMIC ICEBERG TABLE my_dynamic_iceberg_v3_table (
    num_orders NUMBER(10,0),
    order_day
  )
  TARGET_LAG = '20 minutes'
  WAREHOUSE = my_warehouse
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my_dynamic_iceberg_v3_table'
  ICEBERG_VERSION = 3
  AS
    SELECT
        COUNT(DISTINCT order_id)
        DATE_TRUNC('DAY', order_timestamp_ns) AS order_day
      FROM staging_v3_iceberg_table;
```

> **Note:**
>
> Writing either a v2 or v3 externally managed Iceberg table as the target of a dynamic table isn’t supported. The output of a dynamic
> Iceberg table can only be Snowflake-managed.

---
title: CREATE EVENT TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-event-table.md
section: SQL Commands
---

# CREATE EVENT TABLE

Creates an [event table](../../developer-guide/logging-tracing/event-table-setting-up.md) that captures events, including logged messages
from functions and procedures.

See also:
:   [ALTER TABLE (event tables)](alter-table-event-table.md) , [DESCRIBE EVENT TABLE](desc-event-table.md), [DROP TABLE](drop-table.md),
    [SHOW EVENT TABLES](show-event-tables.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] EVENT TABLE [ IF NOT EXISTS ] <name>
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ COPY GRANTS ]
  [ [ WITH ] COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

## Variant syntax

### CREATE EVENT TABLE … CLONE

Creates a new event table with the same [predefined column definitions](../../developer-guide/logging-tracing/event-table-columns.md) and
containing all the existing data from the source table without actually copying the data. You can also use this variant to clone an
event table at a specific time/point in the past (using [Time Travel](../../user-guide/data-time-travel.md)):

```sqlsyntax
CREATE [ OR REPLACE ] EVENT TABLE [ IF NOT EXISTS ] <name>
  CLONE <source_table>
    [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
    [ COPY GRANTS ]
    [ ... ]
```

> **Note:**
>
> If the statement replaces an event table of the same name, the grants are copied from the event table
> being replaced. Otherwise, the grants are copied from the source event table being cloned.

For more details about COPY GRANTS, see COPY GRANTS in this document.

For more details about cloning, see [CREATE <object> … CLONE](create-clone.md).

## Required parameters

`name`
:   Specifies the identifier (the name) for the event table; must be unique for the schema in which the event table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`source_table`
:   Required for CLONE.

    Specifies the event table to use as the source for the clone.

## Optional parameters

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies one or more columns or column expressions in the table as the clustering key. For more details, see
    [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).

    Default: No value (no clustering key is defined for the table)

    > **Important:**
    >
    > Clustering keys are not intended or recommended for all tables; they typically benefit very large (i.e. multi-terabyte)
    > tables.
    >
    > Before you specify a clustering key for a table, please read [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the retention period for the table so that Time Travel actions (SELECT, CLONE, UNDROP) can be performed on historical
    data in the table. For more details, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see
    [Parameters](../parameters.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition:
    >
    >   + `0` to `90` for permanent tables

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the schema, database, or account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the table.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for the table to
    prevent streams on the table from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`CHANGE_TRACKING = TRUE | FALSE`
:   Specifies whether to enable change tracking on the table.

    * `TRUE` enables change tracking on the table. This setting adds a pair of hidden columns to the source table and begins
      storing change tracking metadata in the columns. These columns consume a small amount of storage.

      The change tracking metadata can be queried using the [CHANGES](../constructs/changes.md) clause for
      [SELECT](select.md) statements, or by creating and querying one or more streams on the table.
    * `FALSE` does not enable change tracking on the table.

    Default: FALSE

`DEFAULT_DDL_COLLATION = 'collation_specification'`
:   Specifies a default [collation specification](../collation.md) for the columns in the table.

    For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

`COPY GRANTS`
:   Specifies to retain the access privileges from the original table when a new table is created using any of the following
    CREATE TABLE variants:

    * CREATE OR REPLACE EVENT TABLE
    * CREATE EVENT TABLE … CLONE

    The parameter copies all privileges, except OWNERSHIP, from the existing table to the new table. The new table does not
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE EVENT TABLE statement
    owns the new table.

    If the parameter is not included in the CREATE EVENT TABLE statement, then the new table does not inherit any explicit access
    privileges granted on the original table, but does inherit any future grants defined for the object type in the schema.

    Note:

    > * The [SHOW GRANTS](show-grants.md) output for the replacement table lists the grantee for the copied privileges as the
    >   role that executed the CREATE EVENT TABLE statement, with the current timestamp when the statement was executed.
    > * The operation to copy grants occurs atomically in the CREATE EVENT TABLE command (in other words, within the same transaction).

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on a table.

`COMMENT = 'string_literal'`
:   Specifies a comment for the table.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE EVENT TABLE | Schema in which you plan to create the event table. |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* A schema cannot contain event tables, tables, and/or views with the same name. When creating an event table:

  + If a table or view with the same name already exists in the schema, an error is returned and the event table is not created.
  + If an event table with the same name already exists in the schema, an error is returned and the event table is not created,
    unless the optional `OR REPLACE` keyword is included in the command.
  > **Important:**
  >
  > Using `OR REPLACE` is the equivalent of using [DROP TABLE](drop-table.md) on the existing event table and then
  > creating a new event table with the same name; however, the dropped table is not permanently removed from the system.
  > Instead, it is retained in Time Travel. This is important to note because dropped tables in Time Travel can be recovered, but
  > they also contribute to data storage for your account. For more information, see [Storage costs for Time Travel and Fail-safe](../../user-guide/data-cdp-storage-costs.md).
  >
  > CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >
  > This means that any queries concurrent with the CREATE OR REPLACE EVENT TABLE operation use either the old or new table version.
* Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale.
  A stale stream is unreadable.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE EVENT TABLE … CLONE:

  If the source event table has clustering keys, the new event table has clustering keys. By default, Automatic Clustering is suspended
  for the new event table—even if Automatic Clustering was not suspended for the source table.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.

## Examples

Create an event table named `my_events`:

> ```sqlexample
> CREATE EVENT TABLE my_events;
> ```

---
title: CREATE EXPERIMENT
source: https://docs.snowflake.com/en/sql-reference/sql/create-experiment.md
section: SQL Commands
---

# CREATE EXPERIMENT

Creates a new [experiment](../../developer-guide/snowflake-ml/experiments.md) or replaces an existing experiment.

See also:
:   [ALTER EXPERIMENT](alter-experiment.md) , [SHOW EXPERIMENTS](show-experiments.md) , [DROP EXPERIMENT](drop-experiment.md) , [SHOW RUNS IN EXPERIMENT](show-runs-in-experiment.md) , [SHOW RUN … IN EXPERIMENT](show-run-in-experiment.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] EXPERIMENT [ IF NOT EXISTS ] <name>
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the experiment; must be unique for the schema in which the experiment is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE EXPERIMENT | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

---
title: CREATE EXTERNAL ACCESS INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-external-access-integration.md
section: SQL Commands
---

# CREATE EXTERNAL ACCESS INTEGRATION

Creates an [external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md) for access
to external network locations from a UDF or procedure handler.

See also:
:   [ALTER EXTERNAL ACCESS INTEGRATION](alter-external-access-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] EXTERNAL ACCESS INTEGRATION <name>
  ALLOWED_NETWORK_RULES = ( <rule_name_1> [, <rule_name_2>, ... ] )
  [ ALLOWED_API_AUTHENTICATION_INTEGRATIONS = { ( <integration_name_1> [, <integration_name_2>, ... ] ) | none } ]
  [ ALLOWED_AUTHENTICATION_SECRETS = { ( <secret_name_1> [, <secret_name_2>, ... ] ) | all | none } ]
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the external access integration.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`ALLOWED_NETWORK_RULES = (rule_name [ , rule_name ... ])`
:   Specifies the allowed [network rules](create-network-rule.md). Only egress rules may be specified.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this integration is enabled or disabled. If the integration is disabled, any handler code that relies
    on it will be unable to reach the external network location.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

`ALLOWED_API_AUTHENTICATION_INTEGRATIONS = ( integration_name_1 [, integration_name_2, ... ] ) | none`
:   Specifies the security integrations whose OAuth authorization server issued the secret used by the UDF or procedure. The security
    integration must be [the type used for external API integration](create-security-integration-api-auth.md).

    This parameter’s value must be one of the following:

    * One or more Snowflake security integration names to allow any of the listed integrations.
    * `none` to allow no integrations.

    Security integrations specified by this parameter – as well as secrets specified by the ALLOWED_AUTHENTICATION_SECRETS parameter – are
    ways to allow secrets for use in a UDF or procedure that uses this external access integration. For more information, see
    Usage notes.

    For reference information about security integrations, refer to [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md).

`ALLOWED_AUTHENTICATION_SECRETS = ( secret_name [, secret_name ... ] ) | all | none`
:   Specifies the secrets that UDF or procedure handler code can use when accessing the external network locations referenced in allowed
    network rules.

    This parameter’s value must be one of the following:

    * One or more Snowflake secret names to allow any of the listed secrets.
    * `all` to allow any secret.
    * `none` to allow no secrets.

    The ALLOWED_API_AUTHENTICATION_INTEGRATIONS parameter can also specify allowed secrets. For more information, see
    Usage notes.

    For reference information about secrets, refer to [CREATE SECRET](create-secret.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the external access integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| USAGE | Secret | Required for all secrets referenced by the integration. |
| USAGE | Schema | Required for all schemas containing any secrets referenced by the integration. |
| CREATE EXTERNAL ACCESS INTEGRATION | Account | Grants the ability to create external access integrations. This privilege does not grant the ability to create other types of integrations. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You can allow secrets for use by a UDF or procedure by using two external access integration parameters, as described below.

  + With the ALLOWED_AUTHENTICATION_SECRETS parameter. You can specify secrets as parameter values or set the parameter’s value to
    `all`, allowing handler code to use any secret.
  + With the ALLOWED_API_AUTHENTICATION_INTEGRATIONS parameter. A [secret](create-secret.md) is allowed for use when
    the secret itself specifies a security integration whose name is also specified by this parameter. The secret specifies the security
    integration with its API_AUTHENTICATION parameter. In other words, when both the secret and the external access integration specify
    the security integration, the secret is allowed for use in functions and procedures that specify the external access integration.

  Note that these two alternatives function independently of one another. A secret is allowed if either (or both) of the parameters allows
  it, regardless of the value specified for the other parameter. For example, setting one of the parameters to `none` does not
  prevent a secret specified by the other parameter from being used in handler code.
* While you can specify network rules using a hostname, Snowflake enforces the rules at the IP level of granularity. Snowflake will not
  inspect your application’s traffic, so it is your responsibility to ensure that the external location’s host has the authentic
  service and that it is not possible to connect to other services on the same host. Whenever possible, you should use secure protocols
  such as HTTPS and TLS when communicating with internet endpoints.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create an external access integration that provides access to the Google Translation API.

For a more complete example, refer to [Creating and using an external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md).

1. Create a secret representing credentials.

   To create a secret, you must have been assigned a role with the CREATE SECRET privilege on the current schema. For other kinds of
   secret supported by this command, refer to [CREATE SECRET](create-secret.md). In this example, `google_translate_oauth`
   refers to a security integration. For more information, refer to [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md).

   ```sqlexample
   CREATE OR REPLACE SECRET oauth_token
     TYPE = OAUTH2
     API_AUTHENTICATION = google_translate_oauth
     OAUTH_REFRESH_TOKEN = 'my-refresh-token';
   ```
2. Grant READ privileges on the secret to the `developer` role so that UDF developers can use it.

   Create the role that will be required for developers needing to use the secret.

   ```sqlexample
   USE ROLE USERADMIN;
   CREATE OR REPLACE ROLE developer;
   ```

   Grant the READ privilege to the `developer` role.

   ```sqlexample
   USE ROLE SECURITYADMIN;
   GRANT READ ON SECRET oauth_token TO ROLE developer;
   ```
3. Create a network rule representing the external network location. Use a role with the privileges described in
   [CREATE NETWORK RULE](create-network-rule.md).

   ```sqlexample
   USE ROLE SYSADMIN;
   CREATE OR REPLACE NETWORK RULE google_apis_network_rule
     MODE = EGRESS
     TYPE = HOST_PORT
     VALUE_LIST = ('translation.googleapis.com');
   ```
4. Create an external access integration using the secret and network rule.

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION google_apis_access_integration
     ALLOWED_NETWORK_RULES = (google_apis_network_rule)
     ALLOWED_AUTHENTICATION_SECRETS = (oauth_token)
     ENABLED = true;
   ```
5. Grant USAGE privileges on the integration to the `developer` role so that UDF developers can use it.

   ```sqlexample
   GRANT USAGE ON INTEGRATION google_apis_access_integration TO ROLE developer;
   ```
6. Create a UDF `google_translate_python` that translates the specified text into a phrase in the specified language. For more
   information, refer to [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md).

   ```sqlexample
   USE ROLE developer;

   CREATE OR REPLACE FUNCTION google_translate_python(sentence STRING, language STRING)
   RETURNS STRING
   LANGUAGE PYTHON
   RUNTIME_VERSION = 3.10
   HANDLER = 'get_translation'
   EXTERNAL_ACCESS_INTEGRATIONS = (google_apis_access_integration)
   PACKAGES = ('snowflake-snowpark-python','requests')
   SECRETS = ('cred' = oauth_token )
   AS
   $$
   import _snowflake
   import requests
   import json
   session = requests.Session()
   def get_translation(sentence, language):
     token = _snowflake.get_oauth_access_token('cred')
     url = "https://translation.googleapis.com/language/translate/v2"
     data = {'q': sentence,'target': language}
     response = session.post(url, json = data, headers = {"Authorization": "Bearer " + token})
     return response.json()['data']['translations'][0]['translatedText']
   $$;
   ```
7. Grant the USAGE privilege on the `google_translate_python` function so that those with the user role can call it.

   ```sqlexample
   GRANT USAGE ON FUNCTION google_translate_python(string, string) TO ROLE user;
   ```
8. Execute the `google_translate_python` function to translate a phrase.

   ```sqlexample
   USE ROLE user;
   SELECT google_translate_python('Happy Thursday!', 'zh-CN');
   ```

   This generates the following output.

   ```output
   -------------------------------------------------------
   | GOOGLE_TRANSLATE_PYTHON('HAPPY THURSDAY!', 'ZH-CN') |
   -------------------------------------------------------
   | 快乐星期四！                                          |
   -------------------------------------------------------
   ```

---
title: CREATE EXTERNAL FUNCTION
source: https://docs.snowflake.com/en/sql-reference/sql/create-external-function.md
section: SQL Commands
---

# CREATE EXTERNAL FUNCTION

Creates a new [external function](../external-functions.md).

This command supports the following variants:

* CREATE OR ALTER EXTERNAL FUNCTION: Creates an external function if it doesn’t exist or alters an existing external function.

See also:
:   [ALTER FUNCTION](alter-function.md) , [SHOW EXTERNAL FUNCTIONS](show-external-functions.md) ,
    [DROP FUNCTION](drop-function.md) , [DESCRIBE FUNCTION](desc-function.md) ,
    [CREATE API INTEGRATION](create-api-integration.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ SECURE ] EXTERNAL FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS <result_data_type>
  [ [ NOT ] NULL ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  [ COMMENT = '<string_literal>' ]
  API_INTEGRATION = <api_integration_name>
  [ HEADERS = ( '<header_1>' = '<value_1>' [ , '<header_2>' = '<value_2>' ... ] ) ]
  [ CONTEXT_HEADERS = ( <context_function_1> [ , <context_function_2> ...] ) ]
  [ MAX_BATCH_ROWS = <integer> ]
  [ COMPRESSION = <compression_type> ]
  [ REQUEST_TRANSLATOR = <request_translator_udf_name> ]
  [ RESPONSE_TRANSLATOR = <response_translator_udf_name> ]
  AS '<url_of_proxy_and_resource>';
```

## Variant syntax

### CREATE OR ALTER EXTERNAL FUNCTION

Creates a new external function if it doesn’t already exist, or transforms an existing external function into the
function defined in the statement. A CREATE OR ALTER EXTERNAL FUNCTION statement follows the syntax rules of a
CREATE EXTERNAL FUNCTION statement and has the same limitations as an [ALTER FUNCTION](alter-function.md)
statement.

Supported function alterations include changes to the following:

* API_INTEGRATION
* COMMENTS
* COMPRESSION
* CONTEXT_HEADERS
* HEADERS
* MAX_BATCH_ROWS
* RESPONSE_TRANSLATOR
* REQUEST_TRANSLATOR
* SECURE

For more information, see CREATE OR ALTER EXTERNAL FUNCTION usage notes.

```sqlsyntax
CREATE [ OR ALTER ] EXTERNAL FUNCTION ...
```

## Required parameters

`name`:
:   Specifies the identifier for the function.

    The identifier can contain the schema name and database name, as well as the function name.

    The identifier does not need to be unique for the schema in which the function is created because functions are
    identified and resolved by their name and argument types. However, the signature (name and argument data types)
    must be unique within the schema.

    The `name` must follow the rules for Snowflake [identifiers](../identifiers.md).
    For more details, see [Identifier requirements](../identifiers-syntax.md).

    Setting `name` the same as the remote service name can make the relationship more clear.
    However, this is not required.

`( [ arg_name arg_data_type ] [ , ... ] )`
:   Specifies the arguments/inputs for the external function. These should correspond to the arguments that the remote
    service expects.

    If there are no arguments, then include the parentheses without any argument name(s) and data type(s).

`RETURNS result_data_type`
:   Specifies the data type returned by the function.

`API_INTEGRATION = api_integration_name`
:   This is the name of the API integration object that should be used to authenticate the call to the proxy service.

`AS 'url_of_proxy_and_resource'`
:   This is the invocation URL of the proxy service (e.g. API Gateway or API Management service) and resource through
    which Snowflake calls the remote service.

## Optional parameters

`SECURE`
:   Specifies that the function is secure. If a function is secure, the URL, the HTTP headers, and the context headers
    are hidden from all users who are not owners of the function.

`[ [ NOT ] NULL ]`
:   This clause indicates whether the function can return NULL values or must return only NON-NULL values.
    If `NOT NULL` is specified, the function must return only non-NULL values. If `NULL` is specified, the
    function can return NULL values.

    Default: The default is NULL (i.e. the function can return NULL values).

`CALLED ON NULL INPUT` or . `{ RETURNS NULL ON NULL INPUT | STRICT }`
:   Specifies the behavior of the function when called with null inputs. In contrast to system-defined functions,
    which always return null when any input is null, external functions can handle null inputs,
    returning non-null values even when an input is null:

    > * `CALLED ON NULL INPUT` will always call the function with null inputs. It is up to the function to
    >   handle such values appropriately.
    > * `RETURNS NULL ON NULL INPUT` (or its synonym `STRICT`) will not call the function if any input
    >   is null. Instead, a null value will always be returned for that row. Note that the function might
    >   still return null for non-null inputs.

    Default: `CALLED ON NULL INPUT`

`{ VOLATILE | IMMUTABLE }`
:   Specifies the behavior of the function when returning results:

    > * `VOLATILE`: The function can return different values for different rows, even for the same input (e.g.
    >   due to non-determinism and statefulness).
    > * `IMMUTABLE`: The function always returns the same result when called with the same input.
    >   Snowflake does not check or guarantee this; the remote service must be designed to behave this way.
    >   Specifying `IMMUTABLE` for a function that actually returns different values for the same input will
    >   result in undefined behavior.

    Default: `VOLATILE`

    Snowflake recommends that you set this explicitly rather than accept the default. Setting this
    explicitly reduces the chance of error, and tells users how the function behaves.
    (The [SHOW EXTERNAL FUNCTIONS](show-external-functions.md) command shows whether a function is volatile or immutable.)

    For important additional information about VOLATILE vs. IMMUTABLE external functions, see
    [Categorize your function as volatile or immutable](../external-functions-best-practices.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the function, which is displayed in the DESCRIPTION column in the
    [SHOW FUNCTIONS](show-functions.md) and [SHOW EXTERNAL FUNCTIONS](show-external-functions.md) output.

    Default: `user-defined function`

`HEADERS = ( 'header_1' = 'value_1' [ , 'header_2' = 'value_2' ... ] )`
:   This clause allows users to specify key-value metadata that is sent with every request.
    The creator of the external function decides what goes into the headers, and the caller does not have any control
    over it. Snowflake prepends all of the specified header names with the prefix “sf-custom-”, and sends them as HTTP
    headers.

    The value must be a constant string, not an expression.

    Here’s an example:

    ```sqlexample
    HEADERS = (
        'volume-measure' = 'liters',
        'distance-measure' = 'kilometers'
    )
    ```

    This causes Snowflake to add 2 HTTP headers into every HTTPS request:
    `sf-custom-volume-measure` and `sf-custom-distance-measure`, with their corresponding values.

    The rules for header names are different from the rules for Snowflake database identifiers. Header names can be
    composed of most visible standard ASCII characters (decimal 32 - 126) except the following:

    * the space character
    * `(`
    * `)`
    * `,`
    * `/`
    * `:`
    * `;`
    * `<`
    * `>`
    * `=`
    * `"`
    * `?`
    * `@`
    * `[`
    * `]`
    * `\`
    * `{`
    * `}`
    * `_`

    Note specifically that the underscore character is not allowed in header names.

    The header name and value are delimited by single quotes, so any single quotes inside the header name or value
    must be escaped with the backslash character.

    If the backslash character is used as a literal character inside a header value, it must be escaped.

    In header values, both spaces and tabs are allowed, but header values should not contain more than one whitespace
    character in a row. This restriction applies to combinations of whitespace characters (e.g. a space followed by a
    tab) as well as individual whitespace characters (e.g. two spaces in a row).

    If the function author marks the function as secure (with `CREATE SECURE EXTERNAL FUNCTION...`), then the
    headers, the context headers, the binary context headers, and the URL are not visible to function users.

    The sum of the sizes of the header names and header values for an external function must be less than or equal
    to 8 KB.

`CONTEXT_HEADERS = ( context_function_1 [ , context_function_2 ...] )`
:   This is similar to HEADERS, but instead of using constant strings, it binds Snowflake context function results to HTTP headers.
    (For more information about Snowflake context functions, see: [Context functions](../functions-context.md).)

    Not all context functions are supported in context headers. The following are supported:

    * CURRENT_ACCOUNT()
    * CURRENT_CLIENT()
    * CURRENT_DATABASE()
    * CURRENT_DATE()
    * CURRENT_IP_ADDRESS()
    * CURRENT_REGION()
    * CURRENT_ROLE()
    * CURRENT_SCHEMA()
    * CURRENT_SCHEMAS()
    * CURRENT_SESSION()
    * CURRENT_STATEMENT()
    * CURRENT_TIME()
    * CURRENT_TIMESTAMP()
    * CURRENT_TRANSACTION()
    * CURRENT_USER()
    * CURRENT_VERSION()
    * CURRENT_WAREHOUSE()
    * LAST_QUERY_ID()
    * LAST_TRANSACTION()
    * LOCALTIME()
    * LOCALTIMESTAMP()

    When function names are listed in the CONTEXT_HEADERS clause, the function names should not be quoted.

    Snowflake prepends `sf-context` to the header before it is written to the HTTP request.

    Example:

    ```sqlexample
    CONTEXT_HEADERS = (current_timestamp)
    ```

    In this example, Snowflake writes the header `sf-context-current-timestamp` into the HTTP request.

    The characters allowed in context header names and values are the same as the characters allowed in
    custom header names and values.

    Context functions can generate characters that are illegal in HTTP header values, including (but not limited to):

    * newline
    * `Ä`
    * `Î`
    * `ß`
    * `ë`
    * `¬`
    * `±`
    * `©`
    * `®`

    Snowflake replaces each sequence of one or more illegal characters with one space character. (The replacement
    is per sequence, not per character.)

    For example, suppose that the context function CURRENT_STATEMENT() returns:

    ```sqlexample
    select
      /*ÄÎßë¬±©®*/
      my_external_function(1);
    ```

    The value sent in `sf-context-current-statement` is:

    ```sqlexample
    select /* */ my_external_function(1);
    ```

    To ensure that remote services can access the original result (with illegal characters) from the context function
    even if illegal characters have been replaced, Snowflake also sends a binary context header that contains the
    context function result encoded in [base64](../binary-input-output.md).

    In the example above, the value sent in the base64 header is the result of calling:

    ```sqlexample
    base64_encode('select\n/ÄÎßë¬±©®/\nmy_external_function(1)')
    ```

    The remote service is responsible for decoding the base64 value if needed.

    Each such base64 header is named according to the following convention:

    ```sqlsyntax
    sf-context-<context-function>-base64
    ```

    In the example above, the name of the header would be

    ```none
    sf-context-current-statement-base64
    ```

    If no context headers are sent, then no base64 context headers are sent.

    If the rows sent to an external function are split across multiple batches, then all batches contain the same
    context headers and the same binary context headers.

`MAX_BATCH_ROWS = integer`
:   This specifies the maximum number of rows in each batch sent to the proxy service.

    The purpose of this parameter is to limit batch sizes for remote services that have memory constraints or other
    limitations. This parameter is not a performance tuning parameter. This parameter specifies a maximum
    size, not a recommended size.

    If you do not specify MAX_BATCH_ROWS, Snowflake estimates the optimal batch size and uses that.

    Snowflake recommends leaving this parameter unset unless the remote service requires a limit.

`COMPRESSION = compression_type`
:   If this clause is specified, the JSON payload is compressed when sent from Snowflake to the proxy service, and when
    sent back from the proxy service to Snowflake.

    Valid values are:

    * `NONE`.
    * `GZIP`.
    * `DEFLATE`.
    * `AUTO`.

      + On AWS, `AUTO` is equivalent to `GZIP`.
      + On Azure, `AUTO` is equivalent to `NONE`.
      + On GCP, `AUTO` is equivalent to `NONE`.

    The Amazon API Gateway automatically compresses/decompresses requests. For more information about
    Amazon API Gateway compression and decompression, see:
    <https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-gzip-compression-decompression.html>

    For information about compression and decompression for other cloud platform proxy services, see the documentation
    for those cloud platforms.

    Default: The default is `AUTO`.

`REQUEST_TRANSLATOR = request_translator_udf_name`
:   This specifies the name of the request translator function. For more information, see [Using request and response translators with data for a remote service](../external-functions-translators.md).

`RESPONSE_TRANSLATOR = response_translator_udf_name`
:   This specifies the name of the response translator function. For more information, see [Using request and response translators with data for a remote service](../external-functions-translators.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FUNCTION | Schema | Operating on functions also requires the USAGE privilege on the parent database and schema. |
| Either OWNERSHIP or USAGE | API integration | Required to create external functions that reference an API integration. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* When compression is used, Snowflake sets the HTTP headers “Content-Encoding” and “Accept-Encoding”.
* The argument type(s) and the return type cannot be GEOGRAPHY.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## CREATE OR ALTER EXTERNAL FUNCTION usage notes

* Alterations to the function definition and its return type are not supported.

## Examples

### Create an external function through an Amazon API Gateway proxy service

The following example shows a CREATE EXTERNAL FUNCTION statement that is called through an Amazon API Gateway
proxy service:

```sqlexample
CREATE OR REPLACE EXTERNAL FUNCTION local_echo(string_col VARCHAR)
  RETURNS VARIANT
  API_INTEGRATION = demonstration_external_api_integration_01
  AS 'https://xyz.execute-api.us-west-2.amazonaws.com/prod/remote_echo';
```

In this example:

* `local_echo` is the name called from a SQL statement (for example, you can execute
  `SELECT local_echo(varchar_column) ...;`).
* `string_col VARCHAR` contains the name and data type of the input parameter(s). An external function can have 0 or more
  input parameters.
* `variant` is the data type of the value returned by the external function.
* The name `demonstration_external_api_integration_01` is the name of the API integration created earlier in a
  [CREATE API INTEGRATION](create-api-integration.md) statement.
* The URL `https://xyz.execute-api.us-west-2.amazonaws.com/prod/remote_echo` is the string that identifies the proxy service and
  resource. An HTTP POST command is sent to this URL.

### Alter an external function using the CREATE OR ALTER EXTERNAL FUNCTION command

Alter the external function `local_echo` created above to set the maximum number of batch rows to 100, compression
to GZIP, and add heads and a context header:

```sqlexample
CREATE OR ALTER SECURE EXTERNAL FUNCTION local_echo(string_col VARCHAR)
  RETURNS VARIANT
  API_INTEGRATION = demonstration_external_api_integration_01
  HEADERS = ('header_variable1'='header_value', 'header_variable2'='header_value2')
  CONTEXT_HEADERS = (current_account)
  MAX_BATCH_ROWS = 100
  COMPRESSION = "GZIP"
  AS 'https://xyz.execute-api.us-west-2.amazonaws.com/prod/remote_echo';
```

---
title: CREATE EXTERNAL TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-external-table.md
section: SQL Commands
---

# CREATE EXTERNAL TABLE

Creates a new [external table](../../user-guide/tables-external-intro.md) in the current or specified schema
or replaces an existing external table. When queried, an external table reads
data from a set of one or more files in a specified external stage, and then outputs the data in a single VARIANT column.

Additional columns can be defined, with each column definition consisting of a name, data type, and optionally whether the column requires
a value (NOT NULL) or has any referential integrity constraints (such as primary key, foreign key). For more information, see the usage notes.

See also:
:   [ALTER EXTERNAL TABLE](alter-external-table.md) , [DROP EXTERNAL TABLE](drop-external-table.md) , [SHOW EXTERNAL TABLES](show-external-tables.md) , [DESCRIBE EXTERNAL TABLE](desc-external-table.md)

## Syntax

```sqlsyntax
-- Partitions computed from expressions
CREATE [ OR REPLACE ] EXTERNAL TABLE [IF NOT EXISTS]
  <table_name>
    ( [ <col_name> <col_type> AS <expr> | <part_col_name> <col_type> AS <part_expr> ]
      [ inlineConstraint ]
      [ , <col_name> <col_type> AS <expr> | <part_col_name> <col_type> AS <part_expr> ... ]
      [ , ... ] )
  cloudProviderParams
  [ PARTITION BY ( <part_col_name> [, <part_col_name> ... ] ) ]
  [ WITH ] LOCATION = externalStage
  [ REFRESH_ON_CREATE =  { TRUE | FALSE } ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ PATTERN = '<regex_pattern>' ]
  FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET } [ formatTypeOptions ] } )
  [ AWS_SNS_TOPIC = '<string>' ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON (VALUE) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]

-- Partitions added and removed manually
CREATE [ OR REPLACE ] EXTERNAL TABLE [IF NOT EXISTS]
  <table_name>
    ( [ <col_name> <col_type> AS <expr> | <part_col_name> <col_type> AS <part_expr> ]
      [ inlineConstraint ]
      [ , <col_name> <col_type> AS <expr> | <part_col_name> <col_type> AS <part_expr> ... ]
      [ , ... ] )
  cloudProviderParams
  [ PARTITION BY ( <part_col_name> [, <part_col_name> ... ] ) ]
  [ WITH ] LOCATION = externalStage
  PARTITION_TYPE = USER_SPECIFIED
  FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET } [ formatTypeOptions ] } )
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON (VALUE) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]

-- Delta Lake
CREATE [ OR REPLACE ] EXTERNAL TABLE [IF NOT EXISTS]
  <table_name>
    ( [ <col_name> <col_type> AS <expr> | <part_col_name> <col_type> AS <part_expr> ]
      [ inlineConstraint ]
      [ , <col_name> <col_type> AS <expr> | <part_col_name> <col_type> AS <part_expr> ... ]
      [ , ... ] )
  cloudProviderParams
  [ PARTITION BY ( <part_col_name> [, <part_col_name> ... ] ) ]
  [ WITH ] LOCATION = externalStage
  PARTITION_TYPE = USER_SPECIFIED
  FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET } [ formatTypeOptions ] } )
  [ TABLE_FORMAT = DELTA ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON (VALUE) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

Where:

> ```sqlsyntax
> inlineConstraint ::=
>   [ NOT NULL ]
>   [ CONSTRAINT <constraint_name> ]
>   { UNIQUE | PRIMARY KEY | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> ] ) ] }
>   [ <constraint_properties> ]
> ```
>
> For additional inline constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> cloudProviderParams (for Google Cloud Storage) ::=
>   [ INTEGRATION = '<integration_name>' ]
>
> cloudProviderParams (for Microsoft Azure) ::=
>   [ INTEGRATION = '<integration_name>' ]
> ```
>
> ```sqlsyntax
> externalStage ::=
>   @[<namespace>.]<ext_stage_name>[/<path>]
> ```
>
> ```sqlsyntax
> formatTypeOptions ::=
> -- If FILE_FORMAT = ( TYPE = CSV ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      RECORD_DELIMITER = '<string>' | NONE
>      FIELD_DELIMITER = '<string>' | NONE
>      MULTI_LINE = TRUE | FALSE
>      SKIP_HEADER = <integer>
>      SKIP_BLANK_LINES = TRUE | FALSE
>      ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
>      TRIM_SPACE = TRUE | FALSE
>      FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
>      NULL_IF = ( '<string1>' [ , '<string2>' , ... ] )
>      EMPTY_FIELD_AS_NULL = TRUE | FALSE
>      ENCODING = '<string>'
> -- If FILE_FORMAT = ( TYPE = JSON ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      MULTI_LINE = TRUE | FALSE
>      ALLOW_DUPLICATE = TRUE | FALSE
>      STRIP_OUTER_ARRAY = TRUE | FALSE
>      STRIP_NULL_VALUES = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
> -- If FILE_FORMAT = ( TYPE = AVRO ... )
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
> -- If FILE_FORMAT = ( TYPE = ORC ... )
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ]
> -- If FILE_FORMAT = ( TYPE = PARQUET ... )
>      COMPRESSION = AUTO | SNAPPY | NONE
>      BINARY_AS_TEXT = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
> ```

## Variant syntax

### CREATE EXTERNAL TABLE … USING TEMPLATE

Creates a new external table with the column definitions derived from a set of staged files that contain semi-structured data. This feature supports Apache Parquet, Apache Avro, ORC, JSON, and CSV files. The support for CSV and JSON files is currently in preview.

> ```sqlsyntax
> CREATE [ OR REPLACE ] EXTERNAL TABLE <table_name>
>   USING TEMPLATE <query>
>   [ ... ]
>   [ COPY GRANTS ]
> ```

> **Note:**
>
> If the statement is replacing an existing table of the same name, then the grants are copied from the table
> being replaced. If there is no existing table of that name, then the grants are copied from the source table
> being cloned.

For more information about COPY GRANTS, see [COPY GRANTS](create-table.md) in this document.

## Required parameters

`table_name`
:   String that specifies the identifier (that is, name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and can’t contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`[ WITH ] LOCATION =`
:   Specifies the external stage and optional path where the files containing data to be read are staged:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]ext_stage_name[/path]` | Files are in the specified named external stage. |

    Neither string literals nor SQL variables are supported.

    Where:

    > * `namespace` is the database or schema in which the external stage resides, in the form of `database_name.schema_name`
    >   or `schema_name`. It is optional if a database and schema are currently in use within the user session; otherwise, it
    >   is required.
    > * `path` is an optional case-sensitive directory path for files in the cloud storage location that limits the set of files to load.
    >   Paths are alternatively called *prefixes* or *folders* by different cloud storage services.
    >
    >   The external table appends this directory path to any path specified in the stage definition. To view the stage definition,
    >   run `DESC STAGE stage_name` and check the `url` property value. For example, if the stage URL includes
    >   path `a` and the external table location includes path `b`, then the external table reads files staged in
    >   `stage/a/b`.
    >
    >   > **Note:**
    >   > + Specify a full *directory* path, and not a partial path (shared prefix) for files in your storage location (for example, use a path like `@my_ext_stage/2025/`
    >   >   instead of `@my_ext_stage/2025-*`). To filter for files that share a common prefix, use partition columns instead.
    >   > + The `[ WITH ] LOCATION` value cannot reference specific file names. To point an external table to individual
    >   >   staged files, use the `PATTERN` parameter.

`FILE_FORMAT = ( FORMAT_NAME = 'file_format_name' )` or . `FILE_FORMAT = ( TYPE = CSV | JSON | AVRO | ORC | PARQUET [ ... ] )`
:   String (constant) that specifies the file format:

    > `FORMAT_NAME = file_format_name`
    > :   Specifies an existing named file format that describes the staged data files to scan. The named file format determines the format
    >     type (such as, CSV, JSON), and any other format options, for data files.
    >
    > `TYPE = CSV | JSON | AVRO | ORC | PARQUET [ ... ]`
    > :   Specifies the format type of the staged data files to scan when querying the external table.
    >
    >     If a file format type is specified, additional format-specific options can be specified. For more information, see
    >     Format Type Options in this topic.

    Default: `TYPE = CSV`.

    > **Important:**
    >
    > An external table doesn’t inherit FILE_FORMAT options specified in a stage definition when that stage is used for loading data into the table. To specify FILE_FORMAT options, you must explicitly do so in the external table definition. Snowflake uses defaults for any FILE_FORMAT parameters omitted from the external table definition.

    > **Note:**
    >
    > `FORMAT_NAME` and `TYPE` are mutually exclusive; to avoid unintended behavior, only specify one or the other
    > when you create an external table.

## Optional parameters

`col_name`
:   String that specifies the column identifier (that is, name). All the requirements for table identifiers also apply to column identifiers.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`col_type`
:   String (constant) that specifies the data type for the column. The data type must match the result of `expr` for the column.

    For information about the data types that can be specified for table columns, see [SQL data types reference](../../sql-reference-data-types.md).

`expr`
:   String that specifies the expression for the column. When queried, the column returns results derived from this expression.

    External table columns are virtual columns, which are defined by using an explicit expression. Add virtual columns as expressions by using the
    VALUE column or the METADATA$FILENAME pseudocolumn:

    VALUE:
    :   A VARIANT type column that represents a single row in the external file.

        CSV:
        :   The VALUE column structures each row as an object with elements identified by column position (that is,
            `{c1: <column_1_value>, c2: <column_2_value>, c3: <column_1_value> ...}`).

            For example, add a VARCHAR column named `mycol` that references the first column in the staged CSV files:

            ```sqlexample
            mycol varchar as (value:c1::varchar)
            ```

        Semi-structured data:
        :   Enclose element names and values in double-quotes. Traverse the path in the VALUE column by using dot notation.

            Suppose the following example represents a single row of semi-structured data in a staged file:

            ```bash
            { "a":"1", "b": { "c":"2", "d":"3" } }
            ```

            Add a VARCHAR column named `mycol` that references the nested repeating `c` element in the staged file:

            ```sqlexample
            mycol varchar as (value:"b"."c"::varchar)
            ```

    METADATA$FILENAME:
    :   A pseudocolumn that identifies the name of each staged data file that is included in the external table, including its path in the stage. For
        an example, see Partitions Added Automatically From Partition Column Expressions in this topic.

`CONSTRAINT ...`
:   String that defines an inline or out-of-line constraint for the specified columns in the table.

    For syntax details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md). For more information about constraints, see
    [Constraints](../constraints.md).

`REFRESH_ON_CREATE = TRUE | FALSE`
:   Specifies whether to automatically refresh the external table metadata one time, immediately after the external table is created. Refreshing
    the external table metadata synchronizes the metadata with the current list of data files in the specified stage path. This action is
    required for the metadata to register any existing data files in the named external stage specified in the
    `[ WITH ] LOCATION =` setting.

    `TRUE`
    :   Snowflake automatically refreshes the external table metadata one time after creation.

        > **Note:**
        >
        > If the specified location contains close to 1 million files or more, we recommend that you
        > set `REFRESH_ON_CREATE = FALSE`. After you create the external table, refresh the metadata
        > incrementally by running ALTER EXTERNAL TABLE … REFRESH statements that specify subpaths in
        > the location (that is, subsets of files to include in the refresh) until the metadata includes
        > all of the files in the location.

    `FALSE`
    :   Snowflake doesn’t automatically refresh the external table metadata. To register any existing data files in the stage, you must
        manually refresh the external table metadata one time by using [ALTER EXTERNAL TABLE](alter-external-table.md) … REFRESH.

    Default: `TRUE`

`AUTO_REFRESH = TRUE | FALSE`
:   Specifies whether Snowflake should enable triggering automatic refreshes of the external table metadata when new or updated data
    files are available in the named external stage specified in the `[ WITH ] LOCATION =` setting.

    > **Note:**
    >
    > * Setting this parameter to TRUE isn’t supported by partitioned external tables when partitions are added manually by the
    >   object owner (that is, when `PARTITION_TYPE = USER_SPECIFIED`).
    > * Setting this parameter to TRUE isn’t supported for external tables that reference data files
    >   in S3-compatible storage (a storage application or device
    >   that provides an API compliant with the S3 REST API). For more information, see [Work with Amazon S3-compatible storage](../../user-guide/data-load-s3-compatible-storage.md).
    >
    >   You must manually refresh the metadata by running an [ALTER EXTERNAL TABLE … REFRESH](alter-external-table.md) command.
    > * You must configure an event notification for your storage location to notify Snowflake when new or updated data is available
    >   to read into the external table metadata. For more information, see the instructions for your cloud storage service:
    >
    >   + Amazon S3:
    >     :   [Refresh external tables automatically for Amazon S3](../../user-guide/tables-external-s3.md)
    >   + Google Cloud Storage:
    >     :   [Refresh external tables automatically for Google Cloud Storage](../../user-guide/tables-external-gcs.md)
    >   + Microsoft Azure:
    >     :   [Refresh external tables automatically for Azure Blob Storage](../../user-guide/tables-external-azure.md)
    > * When an external table is created, its metadata is refreshed automatically one time unless `REFRESH_ON_CREATE = FALSE`.

    `TRUE`
    :   Snowflake enables triggering automatic refreshes of the external table metadata.

    `FALSE`
    :   Snowflake doesn’t enable triggering automatic refreshes of the external table metadata. You must manually refresh the external table
        metadata periodically by using [ALTER EXTERNAL TABLE](alter-external-table.md) … REFRESH to synchronize the metadata with the current list of files in the
        stage path.

    Default: `TRUE`

`PATTERN = 'regex_pattern'`
:   A regular expression pattern string, enclosed in single quotes, specifying the filenames and paths on the external stage to match.

    > **Tip:**
    >
    > For the best performance, don’t apply patterns that filter on a large number of files.

`AWS_SNS_TOPIC = 'string'`
:   Required only when configuring AUTO_REFRESH for Amazon S3 stages using Amazon Simple Notification Service (SNS). Specifies the
    Amazon Resource Name (ARN) for the SNS topic for your S3 bucket. The CREATE EXTERNAL TABLE statement subscribes the Amazon Simple Queue
    Service (SQS) queue to the specified SNS topic. Event notifications through the SNS topic trigger metadata refreshes. For more information,
    see [Refresh external tables automatically for Amazon S3](../../user-guide/tables-external-s3.md).

`TABLE_FORMAT = DELTA`
:   > **Note:**
    >
    > This feature is still supported but will be deprecated in a future release.
    >
    > Consider using an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) instead. Iceberg tables
    > use an [external volume](../../user-guide/tables-iceberg.md)
    > to connect to Delta table files in your cloud storage.
    >
    > For more information, see [Iceberg tables](../../user-guide/tables-iceberg.md) and [CREATE ICEBERG TABLE (Delta files in object storage)](create-iceberg-table-delta.md).
    > You can also [Migrate a Delta external table to Apache Iceberg™](../../user-guide/tables-external-intro.md).

    Identifies the external table as referencing a Delta Lake on the cloud storage location. A Delta Lake on Amazon S3, Google Cloud Storage,
    or Microsoft Azure cloud storage is supported.

    > **Note:**
    >
    > This [preview feature](../../release-notes/preview-features.md) is available to all accounts.

    When this parameter is set, the external table scans for Delta Lake transaction log files in the `[ WITH ] LOCATION` location.
    Delta log files have names like `_delta_log/00000000000000000000.json` and
    `_delta_log/00000000000000000010.checkpoint.parquet`.

    When the metadata for an external table is refreshed, Snowflake parses the Delta Lake transaction logs and determines which Parquet
    files are current. In the background, the refresh performs add and remove file operations to keep the external table metadata in sync.

    > **Note:**
    >
    > * The external stage and optional path specified in `[ WITH ] LOCATION =` must contain the data files and metadata for a
    >   single Delta Lake table only. That is, the specified storage location can only contain one `__delta_log`
    >   directory.
    > * The ordering of event notifications triggered by DDL operations in cloud storage isn’t guaranteed. Therefore, the ability to
    >   automatically refresh isn’t available for external tables that reference Delta Lake files. Both `REFRESH_ON_CREATE` and
    >   `AUTO_REFRESH` must be set to FALSE.
    >
    >   Periodically run an [ALTER EXTERNAL TABLE … REFRESH](alter-external-table.md) statement to register any
    >   added or removed files.
    > * The `FILE_FORMAT` value must specify Parquet as the file type.
    > * For optimal performance, we recommend defining partition columns for the external table.
    > * The following parameters aren’t supported when referencing a Delta Lake:
    >
    >   + `AWS_SNS_TOPIC = 'string'`
    >   + `PATTERN = 'regex_pattern'`

`COPY GRANTS`
:   Specifies retaining the access permissions from the original table when an external table is recreated using the CREATE OR REPLACE TABLE
    variant. The parameter copies all permissions, except OWNERSHIP, from the existing table to the new table. By default, the role
    that runs the CREATE EXTERNAL TABLE command owns the new external table.

    > **Note:**
    >
    > The operation to copy grants occurs atomically in the CREATE EXTERNAL TABLE command (that is, within the same transaction).

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the external table.

    Default: No value

`ROW ACCESS POLICY <policy_name> ON (VALUE)`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on the table.

    Specify the VALUE column when applying a row access policy to an external table.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

### Partitioning parameters

Use these parameters to partition your external table.

`part_col_name col_type AS part_expr`
:   Defines one or more partition columns in the external table.

    The format of a partition column definition differs depending on whether partitions are computed and added automatically from an
    expression in each partition column or the partitions are added manually.

    Added from an expression:
    :   A partition column must evaluate as an expression that parses the path or filename information in the METADATA$FILENAME
        pseudocolumn. Partition columns optimize query performance by pruning out the data files that don’t need to be scanned (that is,
        partitioning the external table). A partition consists of all data files that match the path or filename in the expression for
        the partition column.

        |  |  |
        | --- | --- |
        | `part_col_name` | String that specifies the partition column identifier (that is, name). All the requirements for table identifiers also apply to column identifiers. |
        | `col_type` | String (constant) that specifies the data type for the column. The data type must match the result of `part_expr` for the column. |
        | `part_expr` | String that specifies the expression for the column. The expression must include the METADATA$FILENAME pseudocolumn. |

        External tables currently support the following subset of functions in partition expressions:

        * `=`, `<>`, `>`, `>=`, `<`, `<=`
        * `||`
        * `+`, `-`
        * `-` (negate)
        * `*`
        * `AND`, `OR`
        * [ARRAY_CONSTRUCT](../functions/array_construct.md)
        * [CASE](../functions/case.md)
        * [CAST , ::](../functions/cast.md)
        * [CONCAT , ||](../functions/concat.md)
        * [ENDSWITH](../functions/endswith.md)
        * [IS [ NOT ] NULL](../functions/is-null.md)
        * [IFF](../functions/iff.md)
        * [IFNULL](../functions/ifnull.md)
        * [[ NOT ] IN](../functions/in.md)
        * [LOWER](../functions/lower.md)
        * `NOT`
        * [NULLIF](../functions/nullif.md)
        * [NVL2](../functions/nvl2.md)
        * [SPLIT_PART](../functions/split_part.md)
        * [STARTSWITH](../functions/startswith.md)
        * [SUBSTR , SUBSTRING](../functions/substr.md)
        * [UPPER](../functions/upper.md)
        * [ZEROIFNULL](../functions/zeroifnull.md)

    Added manually:
    :   Required: Also set the `PARTITION_TYPE` parameter value to `USER_SPECIFIED`.

        A partition column definition is an expression that parses the column metadata in the internal (hidden)
        METADATA$EXTERNAL_TABLE_PARTITION column. Essentially, the definition only defines the data type for the column. The following example shows the format of the
        partition column definition:

        `part_col_name col_type AS ( PARSE_JSON (METADATA$EXTERNALTABLE_PARTITION):part_col_name::data_type )`

        For example, suppose columns `col1`, `col2`, and `col3` contain varchar, number, and timestamp (time zone) data, respectively:

        ```sqlexample
        col1 varchar as (parse_json(metadata$external_table_partition):col1::varchar),
        col2 number as (parse_json(metadata$external_table_partition):col2::number),
        col3 timestamp_tz as (parse_json(metadata$external_table_partition):col3::timestamp_tz)
        ```

    After defining any partition columns for the table, identify these columns by using the PARTITION BY clause.

    > **Note:**
    >
    > The maximum length of user-specified partition column names is 32 characters.

`PARTITION_TYPE = USER_SPECIFIED`
:   Defines the partition type for the external table as *user-defined*. The owner of the external table (that is, the role that has the
    OWNERSHIP privilege on the external table) must add partitions to the external metadata manually by running ALTER EXTERNAL
    TABLE … ADD PARTITION statements.

    Don’t set this parameter if partitions are added to the external table metadata automatically upon evaluation of expressions
    in the partition columns.

`[ PARTITION BY ( part_col_name [, part_col_name ... ] ) ]`
:   Specifies any partition columns to evaluate for the external table.

    Usage:
    :   When you query an external table, include one or more partition columns in a WHERE clause; for example:

        `... WHERE part_col_name = 'filter_value'`

        Snowflake filters on the partition columns to restrict the set of data files to scan. All rows in these files are scanned.
        If a WHERE clause includes non-partition columns, those filters are evaluated after the data files are filtered.

        A common practice is to partition the data files based on increments of time; or, if the data files are staged from multiple sources,
        to partition by a data source identifier and date or timestamp.

## Cloud provider parameters (`cloudProviderParams`)

> **Google Cloud Storage**
>
> > `INTEGRATION = integration_name`
> > :   Specifies the name of the notification integration used to automatically refresh the external table metadata using Google Pub/Sub
> >     event notifications. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party
> >     cloud message queuing services.
> >
> >     This parameter is required to enable auto-refresh operations for the external table. For instructions about how to configure the
> >     auto-refresh capability, see [Refresh external tables automatically for Google Cloud Storage](../../user-guide/tables-external-gcs.md).
>
> **Microsoft Azure**
>
> > `INTEGRATION = integration_name`
> > :   Specifies the name of the notification integration used to automatically refresh the external table metadata using Azure Event Grid
> >     notifications. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party cloud
> >     message queuing services.
> >
> >     This parameter is required to enable auto-refresh operations for the external table. For instructions about how to configure the auto-refresh
> >     capability, see [Refresh external tables automatically for Azure Blob Storage](../../user-guide/tables-external-azure.md).

## Format type options (`formatTypeOptions`)

Format type options are used for [loading data into](../../guides-overview-loading-data.md) and [unloading data out of](../../user-guide/data-unload-overview.md)
tables.

Depending on the file format type specified (`FILE_FORMAT = ( TYPE = ... )`), you can include one or more of the following
format-specific options (separated by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be queried. Snowflake uses this option to detect
    how already-compressed data files were compressed so that the compressed data in the files can be extracted for querying.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If querying Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` | Must be specified when querying Brotli-compressed files. |
    | `ZSTD` | Zstandard v0.8 (and higher) supported. |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Data files have not been compressed. |

`RECORD_DELIMITER = 'string' | NONE`
:   One or more characters that separate records in an input file. Accepts common escape sequences or the following singlebyte or multibyte characters:

    Singlebyte characters:
    :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

    Multibyte characters:
    :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

        The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (e.g. `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

    The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

    Also accepts a value of `NONE`.

    Default: New line character. Note that “new line” is logical such that `\r\n` is understood as a new line for files on a Windows platform.

`FIELD_DELIMITER = 'string' | NONE`
:   One or more singlebyte or multibyte characters that separate fields in an input file. Accepts common escape sequences or the following singlebyte or multibyte characters:

    Singlebyte characters:
    :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

    Multibyte characters:
    :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

        The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (e.g. `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        > > **Note:**
        > >
        > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

    The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

    Also accepts a value of `NONE`.

    Default: comma (`,`)

`MULTI_LINE = TRUE | FALSE`
:   Boolean that specifies whether multiple lines are allowed.

    If MULTI_LINE is set to `FALSE` and the specified record delimiter is present within a CSV field, the record containing the field will be interpreted as an error.

    Default: `TRUE`

`SKIP_HEADER = integer`
:   Number of lines at the start of the file to skip.

    Note that SKIP_HEADER does not use the RECORD_DELIMITER or FIELD_DELIMITER values to determine what a header line is; rather, it simply skips the specified number of CRLF (Carriage Return, Line Feed)-delimited lines in the file. RECORD_DELIMITER and FIELD_DELIMITER are then used to determine the rows of data to query.

    Default: `0`

`SKIP_BLANK_LINES = TRUE | FALSE`
:   Use:
    :   Data querying only

    Definition:
    :   Boolean that specifies to skip any blank lines encountered in the data files; otherwise, blank lines produce an end-of-record error (default behavior).

    Default: `FALSE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   A singlebyte character string used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

    Accepts common escape sequences, octal values, or hex values.

    Specifies the escape character for unenclosed fields only.

    > **Note:**
    >
    > * The default value is `\\`. If a row in a data file ends in the backslash (`\`) character, this character escapes the newline or
    >   carriage return character specified for the `RECORD_DELIMITER` file format option. As a result, this row and the next row are
    >   handled as a single row of data. To avoid this issue, set the value to `NONE`.
    > * This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
    >   as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
    >   the option value.
    >
    >   In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
    >   option as the character encoding for your data files to ensure the character is interpreted correctly.

    Default: backslash (`\\`)

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove white space from fields.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
    field (that is, the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces when querying data.

    As another example, if leading or trailing spaces surround quotes that enclose strings, you can remove the surrounding spaces using this option and the quote character using the
    `FIELD_OPTIONALLY_ENCLOSED_BY` option. Note that any spaces within the quotes are preserved. For example, assuming `FIELD_DELIMITER = '|'` and `FIELD_OPTIONALLY_ENCLOSED_BY = '"'`:

    ```sqlexample
    |"Hello world"|    /* returned as */  >Hello world<
    |" Hello world "|  /* returned as */  > Hello world <
    | "Hello world" |  /* returned as */  >Hello world<
    ```

    Note that the brackets in this example are not returned; they are used to demarcate the beginning and end of the returned strings.

    Default: `FALSE`

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex representation (`0x27`) or the double single-quoted escape (`''`).

    Default: `NONE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   String used to convert to and from SQL NULL:

    When querying data, Snowflake replaces these values in the returned data with SQL NULL. To specify more than one string, enclose
    the list of strings in parentheses and use commas to separate each value.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as
    a value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    Default: `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\`)

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Specifies whether to return SQL NULL for empty fields in an input file, which are represented by two successive delimiters (e.g. `,,`).

    If set to `FALSE`, Snowflake attempts to cast an empty field to the corresponding column type. An empty string is returned for columns of type STRING. For other column types, the query returns an error.

    Default: `TRUE`

`ENCODING = 'string'`
:   String (constant) that specifies the character set of the source data when querying data.

    > | Character Set | `ENCODING` Value | Supported Languages | Notes |
    > | --- | --- | --- | --- |
    > | Big5 | `BIG5` | Traditional Chinese |  |
    > | EUC-JP | `EUCJP` | Japanese |  |
    > | EUC-KR | `EUCKR` | Korean |  |
    > | GB18030 | `GB18030` | Chinese |  |
    > | IBM420 | `IBM420` | Arabic |  |
    > | IBM424 | `IBM424` | Hebrew |  |
    > | IBM949 | `IBM949` | Korean |  |
    > | ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
    > | ISO-2022-JP | `ISO2022JP` | Japanese |  |
    > | ISO-2022-KR | `ISO2022KR` | Korean |  |
    > | ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
    > | ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
    > | ISO-8859-5 | `ISO88595` | Russian |  |
    > | ISO-8859-6 | `ISO88596` | Arabic |  |
    > | ISO-8859-7 | `ISO88597` | Greek |  |
    > | ISO-8859-8 | `ISO88598` | Hebrew |  |
    > | ISO-8859-9 | `ISO88599` | Turkish |  |
    > | ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
    > | KOI8-R | `KOI8R` | Russian |  |
    > | Shift_JIS | `SHIFTJIS` | Japanese |  |
    > | UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
    > | UTF-16 | `UTF16` | All languages |  |
    > | UTF-16BE | `UTF16BE` | All languages |  |
    > | UTF-16LE | `UTF16LE` | All languages |  |
    > | UTF-32 | `UTF32` | All languages |  |
    > | UTF-32BE | `UTF32BE` | All languages |  |
    > | UTF-32LE | `UTF32LE` | All languages |  |
    > | windows-874 | `WINDOWS874` | Thai |  |
    > | windows-949 | `WINDOWS949` | Korean |  |
    > | windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
    > | windows-1251 | `WINDOWS1251` | Russian |  |
    > | windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
    > | windows-1253 | `WINDOWS1253` | Greek |  |
    > | windows-1254 | `WINDOWS1254` | Turkish |  |
    > | windows-1255 | `WINDOWS1255` | Hebrew |  |
    > | windows-1256 | `WINDOWS1256` | Arabic |  |

    Default: `UTF8`

    > **Note:**
    >
    > Snowflake stores all data internally in the UTF-8 character set. The data is converted into UTF-8.

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be returned. Snowflake uses this option to
    detect how already-compressed data files were compressed so that the compressed data in the files can be extracted for querying.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If querying Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` |  |
    | `ZSTD` |  |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Indicates the files have not been compressed. |

    Default: `AUTO`

`MULTI_LINE = TRUE | FALSE`
:   Boolean that specifies whether multiple lines are allowed.

    If MULTI_LINE is set to `FALSE` and a new line is present within a JSON record, the record containing the new line will be interpreted as an error.

    Default: `TRUE`

`ALLOW_DUPLICATE = TRUE | FALSE`
:   Boolean that specifies to allow duplicate object field names (only the last one will be preserved).

    Default: `FALSE`

`STRIP_OUTER_ARRAY = TRUE | FALSE`
:   Boolean that instructs the JSON parser to remove outer brackets (that is, `[ ]`).

    Default: `FALSE`

`STRIP_NULL_VALUES = TRUE | FALSE`
:   Boolean that instructs the JSON parser to remove object fields or array elements containing `null` values. For example, when set to `TRUE`:

    > | Before | After |
    > | --- | --- |
    > | `[null]` | `[]` |
    > | `[null,null,3]` | `[,,3]` |
    > | `{"a":null,"b":null,"c":123}` | `{"c":123}` |
    > | `{"a":[1,null,2],"b":{"x":null,"y":88}}` | `{"a":[1,,2],"b":{"y":88}}` |

    Default: `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default: `FALSE`

### TYPE = AVRO

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   String (constant) that specifies the current compression algorithm for the data files to be queried. Snowflake uses this option to
    detect how already-compressed data files were compressed so that the compressed data in the files can be extracted for querying.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If querying Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO`. |
    | `GZIP` |  |
    | `BZ2` |  |
    | `BROTLI` |  |
    | `ZSTD` |  |
    | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    | `NONE` | Data files to query have not been compressed. |

    Default: `AUTO`.

> **Note:**
>
> We recommend that you use the default `AUTO` option because it will determine both the file and codec compression. Specifying a compression option refers to the compression of files, not the compression of blocks (codecs).

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default: `FALSE`

### TYPE = ORC

`TRIM_SPACE = TRUE | FALSE`
:   Boolean that specifies whether to remove leading and trailing white space from strings.

    For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the field (that is, the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces.

    This file format option is applied to the following actions only:

    * Querying object values in staged ORC data files.
    * Querying ORC data in separate columns using the MATCH_BY_COLUMN_NAME copy option.
    * Querying ORC data in separate columns by specifying a query in the COPY statement (that is, COPY transformation).

    Default: `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default: `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data source with SQL NULL. To specify more than
    one string, enclose the list of strings in parentheses and use commas to separate each value.

    Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
    value, all instances of `2` as either a string or number are converted.

    For example:

    `NULL_IF = ('\N', 'NULL', 'NUL', '')`

    Note that this option can include empty strings.

    This file format option is applied when querying object values in staged ORC data files.

    Default: `\N` (that is, NULL)

### TYPE = PARQUET

`COMPRESSION = AUTO | SNAPPY | NONE`
:   String (constant) that specifies the current compression algorithm for columns in the Parquet files.

    | Supported Values | Notes |
    | --- | --- |
    | `AUTO` | Compression algorithm detected automatically. Supports the following compression algorithms: Brotli, gzip, Lempel-Ziv-Oberhumer (LZO), LZ4, Snappy, or Zstandard v0.8 (and higher). |
    | `SNAPPY` |  |
    | `NONE` | Data files have not been compressed. |

    Default: `AUTO`

`BINARY_AS_TEXT = TRUE | FALSE`
:   Boolean that specifies whether to interpret columns with no defined logical data type as UTF-8 text. When set to `FALSE`, Snowflake interprets these columns as binary data.

    Default: `TRUE`

    > **Note:**
    >
    > Snowflake recommends that you set BINARY_AS_TEXT to FALSE to avoid any potential conversion issues.

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
    option performs a one-to-one character replacement.

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default: `FALSE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE EXTERNAL TABLE | Schema |  |
| CREATE STAGE | Schema | Required if creating a new stage. |
| USAGE | Stage | Required if referencing an existing stage. |
| USAGE | File format |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* External tables support external (S3, Azure, or GCS) stages only; internal (Snowflake) stages aren’t supported.

  External tables don’t support storage versioning (S3 versioning, Object Versioning in Google Cloud Storage, or versioning for Azure Storage).

  You cannot access data held in archival cloud storage classes that requires restoration before it can be retrieved. These archival storage classes include, for example, the Amazon S3 Glacier Flexible Retrieval or Glacier Deep Archive storage class, or Microsoft Azure Archive Storage.
* Snowflake doesn’t enforce integrity constraints on external tables. In particular, unlike normal tables, Snowflake doesn’t enforce
  NOT NULL constraints.
* External tables include the following metadata column:

  + METADATA$FILENAME: Name of each staged data file that is included in the external table. Includes the path to the data file in the stage.
  + METADATA$FILE_ROW_NUMBER: Row number for each record in the staged data file.

* The following items aren’t supported for external tables:

  + Clustering keys
  + Cloning
  + Data in XML format
  + Time Travel
* For information about using an external table with a policy, see the following topics:

  + [Masking policies and external tables](../../user-guide/security-column-intro.md).
  + [Row access policies and external tables](../../user-guide/security-row-intro.md).
* Using `OR REPLACE` is the equivalent of using [DROP EXTERNAL TABLE](drop-external-table.md) on the existing external table, and then creating a new
  external table with the same name.

  CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

  This means that any queries concurrent with the CREATE OR REPLACE EXTERNAL TABLE operation use either the old or new external table version.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* When you create an external table with a row access policy added to the external table, use the
  [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the external table that is protected by a row access policy.
* [SELECT](select.md) `*` always returns the VALUE column, in which all regular or semi-structured data is cast to variant rows.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.

## Examples

### Partitions added automatically from partition column expressions

Create an external table with partitions computed from expressions in the partition column definitions.

In step 2 of the following example, the data files are organized in cloud storage with the following structure: `logs/YYYY/MM/DD/HH24`.
For example:

* `logs/2018/08/05/0524/`
* `logs/2018/08/27/1408/`

1. Create an external stage named `s1` for the storage location where the data files are stored. For more information, see
   [CREATE STAGE](create-stage.md).

   The stage definition includes the path `/files/logs/`:

   **Amazon S3**

   > ```sqlexample
   > CREATE STAGE s1
   >   URL='s3://mybucket/files/logs/'
   >   ...
   >   ;
   > ```

   **Google Cloud Storage**

   > ```sqlexample
   > CREATE STAGE s1
   >   URL='gcs://mybucket/files/logs/'
   >   ...
   >   ;
   > ```

   **Microsoft Azure**

   > ```sqlexample
   > CREATE STAGE s1
   >   URL='azure://mycontainer/files/logs/'
   >   ...
   >   ;
   > ```
2. Query the METADATA$FILENAME pseudocolumn in the staged data, and then use the results to develop your partition columns:

   ```sqlexample
   SELECT metadata$filename FROM @s1/;

   +----------------------------------------+
   | METADATA$FILENAME                      |
   |----------------------------------------|
   | files/logs/2018/08/05/0524/log.parquet |
   | files/logs/2018/08/27/1408/log.parquet |
   +----------------------------------------+
   ```
3. Create the partitioned external table.

   The partition column `date_part` casts `YYYY/MM/DD` in the METADATA$FILENAME pseudocolumn as a date by using
   [TO_DATE , DATE](../functions/to_date.md). The SQL command also specifies Parquet as the file format type.

   The external tables for Amazon S3 and Microsoft Azure cloud storage include the parameter that is required to refresh the metadata
   automatically when triggered by event notifications from the respective cloud messaging service:

   **Amazon S3**

   > ```sqlexample
   > CREATE EXTERNAL TABLE et1(
   >  date_part date AS TO_DATE(SPLIT_PART(metadata$filename, '/', 3)
   >    || '/' || SPLIT_PART(metadata$filename, '/', 4)
   >    || '/' || SPLIT_PART(metadata$filename, '/', 5), 'YYYY/MM/DD'),
   >  timestamp bigint AS (value:timestamp::bigint),
   >  col2 varchar AS (value:col2::varchar))
   >  PARTITION BY (date_part)
   >  LOCATION=@s1/logs/
   >  AUTO_REFRESH = true
   >  FILE_FORMAT = (TYPE = PARQUET)
   >  AWS_SNS_TOPIC = 'arn:aws:sns:us-west-2:001234567890:s3_mybucket';
   > ```

   **Google Cloud Storage**

   > ```sqlexample
   > CREATE EXTERNAL TABLE et1(
   >   date_part date AS TO_DATE(SPLIT_PART(metadata$filename, '/', 3)
   >     || '/' || SPLIT_PART(metadata$filename, '/', 4)
   >     || '/' || SPLIT_PART(metadata$filename, '/', 5), 'YYYY/MM/DD'),
   >   timestamp bigint AS (value:timestamp::bigint),
   >   col2 varchar AS (value:col2::varchar))
   >   PARTITION BY (date_part)
   >   LOCATION=@s1/logs/
   >   AUTO_REFRESH = true
   >   FILE_FORMAT = (TYPE = PARQUET);
   > ```

   **Microsoft Azure**

   > ```sqlexample
   > CREATE EXTERNAL TABLE et1(
   >   date_part date AS TO_DATE(SPLIT_PART(metadata$filename, '/', 3)
   >     || '/' || SPLIT_PART(metadata$filename, '/', 4)
   >     || '/' || SPLIT_PART(metadata$filename, '/', 5), 'YYYY/MM/DD'),
   >   timestamp bigint AS (value:timestamp::bigint),
   >   col2 varchar AS (value:col2::varchar))
   >   PARTITION BY (date_part)
   >   INTEGRATION = 'MY_INT'
   >   LOCATION=@s1/logs/
   >   AUTO_REFRESH = true
   >   FILE_FORMAT = (TYPE = PARQUET);
   > ```
4. Refresh the external table metadata:

   ```sqlexample
   ALTER EXTERNAL TABLE et1 REFRESH;
   ```

When you query the external table, filter the data by the partition column by using a WHERE clause. Snowflake only scans the files in the
specified partitions that match the filter conditions:

```sqlexample
SELECT timestamp, col2 FROM et1 WHERE date_part = to_date('08/05/2018');
```

### Partitions added manually

Create an external table with user-defined partitions (that is, the partitions are added manually by the external table owner).

1. Create an external stage named `s2` for the storage location where the data files are stored:

   The stage definition includes the path `/files/logs/`:

   **Amazon S3**

   > ```sqlexample
   > CREATE STAGE s2
   >   URL='s3://mybucket/files/logs/'
   >   ...
   >   ;
   > ```

   **Google Cloud Storage**

   > ```sqlexample
   > CREATE STAGE s2
   >   URL='gcs://mybucket/files/logs/'
   >   ...
   >   ;
   > ```

   **Microsoft Azure**

   > ```sqlexample
   > CREATE STAGE s2
   >   URL='azure://mycontainer/files/logs/'
   >   ...
   >   ;
   > ```
2. Create the partitioned external table. The external table includes three partition columns with different data types.

   The following rules apply:

   * The column names in the partition expressions are case-sensitive.
   * A partition column name must be in uppercase, unless the column name is enclosed in double quotes. Alternatively,
     use [GET_IGNORE_CASE](../functions/get_ignore_case.md) instead of the case-sensitive `:` character in the SQL
     expression.
   * If a column name is enclosed in double quotes (for example, “Column1”), the partition column name must also be enclosed in
     double quotes and match the column name exactly.

   The syntax for each of the three cloud storage services (Amazon S3, Google Cloud Storage, and Microsoft Azure) is identical
   because the external table metadata isn’t refreshed:

   ```sqlexample
   create external table et2(
     col1 date as (parse_json(metadata$external_table_partition):COL1::date),
     col2 varchar as (parse_json(metadata$external_table_partition):COL2::varchar),
     col3 number as (parse_json(metadata$external_table_partition):COL3::number))
     partition by (col1,col2,col3)
     location=@s2/logs/
     partition_type = user_specified
     file_format = (type = parquet);
   ```
3. Add partitions for the partition columns:

   ```sqlexample
   ALTER EXTERNAL TABLE et2 ADD PARTITION(col1='2022-01-24', col2='a', col3='12') LOCATION '2022/01';
   ```

   Snowflake adds the partitions to the metadata for the external table. The operation also adds any new data files in the specified
   location to the metadata:

   ```sqlexample
   +---------------------------------------+----------------+-------------------------------+
   |                       file            |     status     |          description          |
   +---------------------------------------+----------------+-------------------------------+
   | mycontainer/files/logs/2022/01/24.csv | REGISTERED_NEW | File registered successfully. |
   | mycontainer/files/logs/2022/01/25.csv | REGISTERED_NEW | File registered successfully. |
   +---------------------------------------+----------------+-------------------------------+
   ```

When you query the external table, filter the data by the partition columns by using a WHERE clause. This example returns the records in the
order in which they are stored in the staged data files:

```sqlexample
SELECT col1, col2, col3 FROM et1 WHERE col1 = TO_DATE('2022-01-24') AND col2 = 'a' ORDER BY METADATA$FILE_ROW_NUMBER;
```

### Materialized view on an external table

Create a materialized view that is based on a subquery of the columns in the external table created in the
Partitions Added Automatically From Partition Column Expressions example:

```sqlexample
CREATE MATERIALIZED VIEW et1_mv
  AS
  SELECT col2 FROM et1;
```

For general syntax, usage notes, and further examples for this SQL command, see [CREATE MATERIALIZED VIEW](create-materialized-view.md).

### External table created with detected column definitions

Create an external table where the column definitions are derived from a set of staged files that contain Avro, Parquet, or ORC data.

> **Note:**
>
> The `mystage` stage and `my_parquet_format` file format referenced in the statement must already exist. A set of files must
> already be staged in the cloud storage location referenced in the stage definition.

The following example builds on an example in the [INFER_SCHEMA](../functions/infer_schema.md) topic:

> ```sqlexample
> CREATE EXTERNAL TABLE mytable
>   USING TEMPLATE (
>     SELECT ARRAY_AGG(OBJECT_CONSTRUCT(*))
>     FROM TABLE(
>       INFER_SCHEMA(
>         LOCATION=>'@mystage',
>         FILE_FORMAT=>'my_parquet_format'
>       )
>     )
>   )
>   LOCATION=@mystage
>   FILE_FORMAT=my_parquet_format
>   AUTO_REFRESH=false;
> ```

Using `*` for `ARRAY_AGG(OBJECT_CONSTRUCT())` might result in an error if the returned result is larger than 16 MB. Avoid using `*` for larger result sets, and only use the required columns, `COLUMN NAME`, `TYPE`, and `NULLABLE`, for the query, as the following example demonstrates. Optional column `ORDER_ID` can be included when using `WITHIN GROUP (ORDER BY order_id)`:

> ```sqlexample
> CREATE EXTERNAL TABLE mytable
>   USING TEMPLATE (
>     SELECT ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME',COLUMN_NAME, 'TYPE',TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION',EXPRESSION))
>     FROM TABLE(
>       INFER_SCHEMA(
>         LOCATION=>'@mystage',
>         FILE_FORMAT=>'my_parquet_format'
>       )
>     )
>   )
>   LOCATION=@mystage
>   FILE_FORMAT=my_parquet_format
>   AUTO_REFRESH=false;
> ```

---
title: CREATE EXTERNAL VOLUME
source: https://docs.snowflake.com/en/sql-reference/sql/create-external-volume.md
section: SQL Commands
---

# CREATE EXTERNAL VOLUME

Creates a new [external volume](../../user-guide/tables-iceberg.md) for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md)
in the account or replaces an existing external volume.

See also:
:   [ALTER EXTERNAL VOLUME](alter-external-volume.md) , [DROP EXTERNAL VOLUME](drop-external-volume.md) , [SHOW EXTERNAL VOLUMES](show-external-volumes.md), [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] EXTERNAL VOLUME [IF NOT EXISTS]
  <name>
  STORAGE_LOCATIONS =
    (
      (
        NAME = '<storage_location_name>'
        { cloudProviderParams | s3CompatibleStorageParams }
      )
      [, (...), ...]
    )
  [ ALLOW_WRITES = { TRUE | FALSE }]
  [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> cloudProviderParams (for Amazon S3) ::=
>   STORAGE_PROVIDER = '{ S3 | S3GOV }'
>   STORAGE_AWS_ROLE_ARN = '<iam_role>'
>   STORAGE_BASE_URL = '<protocol>://<bucket>[/<path>/]'
>   [ STORAGE_AWS_ACCESS_POINT_ARN = '<string>' ]
>   [ STORAGE_AWS_EXTERNAL_ID = '<external_id>' ]
>   [ ENCRYPTION = ( [ TYPE = 'AWS_SSE_S3' ] |
>               [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ] ] |
>               [ TYPE = 'NONE' ] ) ]
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Google Cloud Storage) ::=
>   STORAGE_PROVIDER = 'GCS'
>   STORAGE_BASE_URL = 'gcs://<bucket>[/<path>/]'
>   [ ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' ] [ KMS_KEY_ID = '<string>' ] |
>               [ TYPE = 'NONE' ] ) ]
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Microsoft Azure) ::=
>   STORAGE_PROVIDER = 'AZURE'
>   AZURE_TENANT_ID = '<tenant_id>'
>   STORAGE_BASE_URL = 'azure://...'
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```

> ```sqlsyntax
> s3CompatibleStorageParams ::=
>   STORAGE_PROVIDER = 'S3COMPAT'
>   STORAGE_BASE_URL = 's3compat://<bucket>[/<path>/]'
>   CREDENTIALS = ( AWS_KEY_ID = '<string>' AWS_SECRET_KEY = '<string>' )
>   STORAGE_ENDPOINT = '<s3_api_compatible_endpoint>'
> ```

## Required parameters

`name`
:   String that specifies the identifier (the name) for the external volume; must be unique in your account.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`STORAGE_LOCATIONS = ( ( NAME = 'storage_location_name' { cloudProviderParams | s3CompatibleStorageParams } ) [, (...), ...] )`
:   Set of named cloud storage locations in different regions and, optionally, cloud platforms.

    > **Note:**
    >
    > * Each external volume that you create supports a single
    >   [active storage location](../../user-guide/tables-iceberg-storage.md).

## Optional parameters

`ALLOW_WRITES = '{ TRUE | FALSE }'`
:   Specifies whether write operations are allowed for the external volume; must be set to TRUE for the following tables:

    * Iceberg tables that use Snowflake as the catalog.
    * Iceberg tables that use an external catalog and are writable. Externally managed Iceberg tables are writable when you access them
      through a catalog-linked database that has the ALLOWED_WRITE_OPERATIONS parameter set to TRUE.

    For Iceberg tables created from Delta table files, setting this parameter to TRUE enables Snowflake to write Iceberg
    metadata to your external storage. For more information, see [Delta-based tables](../../user-guide/tables-iceberg-metadata.md).

    The value of this parameter must also match the permissions that you
    set on the cloud storage account for each specified storage location.

    > **Note:**
    >
    > If you plan to use the external volume for reading externally managed Iceberg tables, you can set this parameter to FALSE.
    > Snowflake doesn’t write data or Iceberg metadata files to your cloud storage when you read tables in an external Iceberg catalog.

    Default: TRUE

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the external volume.

    Default: No value

## Cloud provider parameters (`cloudProviderParams`)

> **Note:**
>
> The KMS keys are managed by the storage owner in Amazon S3 or Google Cloud Storage instances. The service principals (IAM role and
> GCS service account) must be granted privileges to use KMS keys.
> For more information, see [Encrypting table files](../../user-guide/tables-iceberg-storage.md).

**Amazon S3**

> `STORAGE_PROVIDER = '{ S3 | S3GOV }'`
> :   Specifies the cloud storage provider that stores your data files.
>
>     * `'S3'`: S3 storage in public AWS regions outside of China.
>     * `'S3GOV'`: S3 storage in AWS [government regions](../../user-guide/intro-regions.md).
>
> `STORAGE_AWS_ROLE_ARN = 'iam_role'`
> :   Specifies the case-sensitive Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges on the S3 bucket
>     containing your data files. For more information, see [Configure an external volume for Amazon S3](../../user-guide/tables-iceberg-configure-external-volume-s3.md).
>
> `STORAGE_BASE_URL = 'protocol://bucket[/path/]'`
> :   Specifies the base URL for your cloud storage location, where:
>
>     * `protocol` is one of the following:
>
>       + `s3` refers to S3 storage in public AWS regions outside of China.
>       + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
>     * `bucket` is the name of an S3 bucket that stores your data files or the [bucket-style alias](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-alias.html)
>       for an S3 bucket access point. For an S3 access point, you must also specify a value for the `STORAGE_AWS_ACCESS_POINT_ARN` parameter.
>     * `path` is an optional path that can be used to provide granular control over objects in the bucket.
>
>     > **Note:**
>     >
>     > Snowflake can’t support external volumes with S3 bucket names that contain dots (for example, `my.s3.bucket`).
>     > S3 doesn’t support SSL for virtual-hosted-style buckets with dots in the name, and
>     > Snowflake uses virtual-host-style paths and HTTPS to access data in S3.
>
>     > **Important:**
>     >
>     > To create an Iceberg table that uses an external catalog, your Parquet data files
>     > and Iceberg metadata files must be within the `STORAGE_BASE_URL` location.
>
> `STORAGE_AWS_ACCESS_POINT_ARN = 'string'`
> :   Specifies the Amazon resource name (ARN) for your S3 access point. Required only when you specify an S3 access point alias
>     for your storage `STORAGE_BASE_URL`.

> `STORAGE_AWS_EXTERNAL_ID = 'external_id'`
> :   Optionally specifies an external ID that Snowflake uses to establish a trust relationship with AWS.
>     You must specify the same external ID in the trust policy of the IAM role
>     that you configured for this external volume. For more information,
>     see
>     [How to use an external ID when granting access to your AWS resources to a third party](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html).
>
>     If you don’t specify a value for this parameter, Snowflake automatically generates an external ID when you create the external volume.
>
> `ENCRYPTION = ( [ TYPE = 'AWS_SSE_S3' ] | [ TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = 'string' ] ] | [ TYPE = 'NONE' ] )`
> :   Specifies the properties needed to encrypt data on the external volume.
>
>     `TYPE = ...`
>     :   Specifies the encryption type used. Possible values are:
>
>         * `'AWS_SSE_S3'` : Server-side encryption using S3-managed encryption keys. For more information, see [Using server-side encryption with Amazon S3-managed encryption keys (SSE-S3)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingServerSideEncryption.html).
>         * `'AWS_SSE_KMS'` : Server-side encryption using keys stored in KMS. For more information, see [Using server-side encryption with AWS Key Management Service (SSE-KMS)](https://docs.aws.amazon.com/AmazonS3/latest/userguide/UsingKMSEncryption.html).
>         * `'NONE'`: No encryption.
>
>     `KMS_KEY_ID = 'string'` (applies to `AWS_SSE_KMS` encryption only)
>     :   Optionally specifies the ID for the AWS KMS-managed key used to encrypt files written to the bucket. If no value is provided, your default KMS key is used to encrypt files for writing data.
>
>         Note that this value is ignored when reading data.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see
>     [Private connectivity to external volumes for Amazon Web Services](../../user-guide/tables-iceberg-configure-external-volume-s3-private.md).

**Google Cloud Storage**

> `STORAGE_PROVIDER = 'GCS'`
> :   Specifies the cloud storage provider that stores your data files.
>
> `STORAGE_BASE_URL = 'gcs://bucket[/path/]'`
> :   Specifies the base URL for your cloud storage location, where:
>
>     * `bucket` is the name of a Cloud Storage bucket that stores your data files.
>     * `path` is an optional path that can be used to provide granular control over objects in the bucket.
>
>     > **Important:**
>     >
>     > To create an Iceberg table that uses an external catalog, your Parquet data files
>     > and Iceberg metadata files must be within the `STORAGE_BASE_URL` location.
>
> `ENCRYPTION = ( [ TYPE = 'GCS_SSE_KMS' ] [ KMS_KEY_ID = 'string' ] | [ TYPE = 'NONE' ] )`
> :   Specifies the properties needed to encrypt data on the external volume.
>
>     `TYPE = ...`
>     :   Specifies the encryption type used. Possible values are:
>
>         * `'GCS_SSE_KMS'`: Server-side encryption using keys stored in KMS. For more information, see [customer-managed encryption keys](https://cloud.google.com/storage/docs/encryption/customer-managed-keys).
>         * `'NONE'`: No encryption.
>
>     `KMS_KEY_ID = 'string'` (applies to `GCS_SSE_KMS` encryption only)
>     :   Specifies the ID for the Cloud KMS-managed key used to encrypt files written to the bucket.
>
>         Note that this value is ignored when reading data. The read operation should succeed if the service account has sufficient
>         permissions to the data and any specified KMS keys.

**Microsoft Azure**

> `STORAGE_PROVIDER = 'AZURE'`
> :   Specifies the cloud storage provider that stores your data files.
>
> `AZURE_TENANT_ID = 'tenant_id'`
> :   Specifies the ID for your Office 365 tenant that the storage location belongs to. An external volume can
>     authenticate to only one tenant, so the storage location must refer to a storage account
>     that belongs to this tenant.
>
>     To find your tenant ID, log into the Azure portal and select Azure Active Directory » Properties. The tenant ID is
>     displayed in the Tenant ID field.

> `STORAGE_BASE_URL = 'azure://...'`
> :   Specifies the base URL for your cloud storage location (case-sensitive).
>
>     * For Azure Blob Storage, specify `azure://account.blob.core.windows.net/container[/path/]`, where:
>
>       + `account` is the name of your Azure account; for example, `myaccount`.
>       + `container` is the name of an Azure container that stores your data files.
>       + `path` is an optional path that can be used to provide granular control over logical directories in the container.
>
>     > [Preview feature](../../release-notes/preview-features.md) — Open
>     >
>     > Available to all accounts. Configuring an external volume that is connected to Data Lake Storage is in public preview.
>
>     * For Data Lake Storage, specify `azure://account.dfs.core.windows.net/container[/path/]`, where:
>
>       + `account` is the name of your Azure account; for example, `myaccount`.
>       + `container` is the name of an Azure container that stores your data files.
>       + `path` is an optional path that can be used to provide granular control over logical directories in the container.
>     * For Fabric OneLake, specify `azure://[region-]onelake.dfs | blob.fabric.microsoft.com/workspace/lakehouse/path/`, where:
>
>       + `region` optionally specifies the endpoint region; for example, `westus`. If specified, this must be the same region used
>         by your Microsoft Fabric capacity, and the same region in which your Snowflake account is hosted.
>       + `dfs | blob` specifies the endpoint type.
>       + `workspace` is either your Fabric workspace ID or workspace name; for example, `cfafbeb1-8037-4d0c-896e-a46fb27ff227` or `my_workspace`.
>         You must use the same type of identifier (ID or name) for both your workspace and Lakehouse.
>       + `lakehouse` is either your Lakehouse ID or Lakehouse name. You must use the same type of identifier (ID or name)
>         for both your workspace and Lakehouse; for example, `5b218778-e7a5-4d73-8187-f10824047715` or `my_lakehouse.Lakehouse`.
>       + `path` is a path to your storage location in the specified Lakehouse and Workspace.
>
>       Preview Feature — Open
>
>       Available to all accounts
>
>       This feature is not available in the People’s Republic of China.
>
>     > **Note:**
>     >
>     > Use the `azure://` prefix and not `https://`.
>
>     > **Important:**
>     >
>     > To create an Iceberg table that uses an external catalog, your Parquet data files
>     > and Iceberg metadata files must be within the `STORAGE_BASE_URL` location.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see
>     [Private connectivity to external volumes for Microsoft Azure](../../user-guide/tables-iceberg-configure-external-volume-azure-private.md).

## S3-compatible storage parameters (`s3CompatibleStorageParams`)

`STORAGE_PROVIDER = 'S3COMPAT'`
:   Specifies S3-compatible storage as your storage provider.

`STORAGE_BASE_URL = 's3compat://bucket[/path/]'`
:   Specifies the URL for the external location used to store data files (an existing bucket accessed using an S3-compatible API endpoint), where:

    * `bucket` is the name of the bucket.
    * `path` is an optional case-sensitive path (or *prefix* in S3 terminology) for files in the cloud storage location
      (files with names that begin with a common string).

`CREDENTIALS = ( AWS_KEY_ID = 'string' AWS_SECRET_KEY = 'string' )`
:   Specifies the security credentials for connecting to and accessing your S3-compatible storage location.

`STORAGE_ENDPOINT = 's3_api_compatible_endpoint'`
:   Specifies a fully qualified domain that points to your S3-compatible API endpoint.

    > **Note:**
    >
    > The storage endpoint should not include a bucket name; for example, specify `example.com` instead of `my_bucket.example.com`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE EXTERNAL VOLUME | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

> **Important:**
>
> **External volumes in Amazon S3 storage only:** If you recreate an external volume (using the CREATE OR REPLACE EXTERNAL VOLUME syntax)
> without specifying an external ID, you must repeat the steps to grant the AWS identity and access management (IAM) user
> for your Snowflake account the access permissions required on the S3 storage location.
> For more information, see the instructions for retrieving the AWS IAM user for your Snowflake
> account in [Configure an external volume for Amazon S3](../../user-guide/tables-iceberg-configure-external-volume-s3.md).

* You can’t drop or replace an external volume if one or more Iceberg tables
  are associated with the external volume.

  To view the tables that depend on an external volume,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `external_volume_name` column.

  > **Note:**
  >
  > The column identifier (`external_volume_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "external_volume_name" = 'my_external_volume_1';
  ```
* If you use a regional endpoint for a Microsoft Fabric OneLake storage location,
  use the same region as your Microsoft Fabric capacity. This must also be the same region that hosts your Snowflake account.
* For S3 external volumes that use an S3 access point:

  + You must configure the IAM policy for the external volume
    to grant permission to your S3 access point. For more information,
    see [Step 1: Create an IAM policy that grants access to your S3 location](../../user-guide/tables-iceberg-configure-external-volume-s3.md).
  + Multi-region access points aren’t supported.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

The following examples create external volumes that define writable storage locations with different cloud providers:

**Amazon S3**

The following example creates an external volume that defines an Amazon S3 storage location with encryption:

```sqlexample
CREATE OR REPLACE EXTERNAL VOLUME exvol
  STORAGE_LOCATIONS =
      (
        (
            NAME = 'my-s3-us-west-2'
            STORAGE_PROVIDER = 'S3'
            STORAGE_BASE_URL = 's3://my-example-bucket/'
            STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::123456789012:role/myrole'
            ENCRYPTION = ( TYPE = 'AWS_SSE_KMS' KMS_KEY_ID = '1234abcd-12ab-34cd-56ef-1234567890ab' )
        )
      )
  ALLOW_WRITES = TRUE;
```

**Google Cloud Storage**

The following example creates an external volume that defines a GCS storage location with encryption:

```sqlexample
CREATE EXTERNAL VOLUME exvol
  STORAGE_LOCATIONS =
    (
      (
        NAME = 'my-us-east-1'
        STORAGE_PROVIDER = 'GCS'
        STORAGE_BASE_URL = 'gcs://mybucket1/path1/'
        ENCRYPTION=(TYPE='GCS_SSE_KMS' KMS_KEY_ID = '1234abcd-12ab-34cd-56ef-1234567890ab')
      )
    )
  ALLOW_WRITES = TRUE;
```

**Microsoft Azure**

The following example creates an external volume that defines an Azure storage location with encryption:

```sqlexample
CREATE EXTERNAL VOLUME exvol
  STORAGE_LOCATIONS =
    (
      (
        NAME = 'my-azure-northeurope'
        STORAGE_PROVIDER = 'AZURE'
        STORAGE_BASE_URL = 'azure://exampleacct.blob.core.windows.net/my_container_northeurope/'
        AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
      )
    )
  ALLOW_WRITES = TRUE;
```

**S3-compatible storage**

Create an external volume that defines an S3-compatible storage location. For more information, see
[Configure an external volume for S3-compatible storage](../../user-guide/tables-iceberg-s3-compatible.md).

```sqlexample
CREATE OR REPLACE EXTERNAL VOLUME ext_vol_s3_compat
  STORAGE_LOCATIONS = (
    (
      NAME = 'my_s3_compat_storage_location'
      STORAGE_PROVIDER = 'S3COMPAT'
      STORAGE_BASE_URL = 's3compat://mybucket/unload/mys3compatdata'
      CREDENTIALS = (
        AWS_KEY_ID = '1a2b3c...'
        AWS_SECRET_KEY = '4x5y6z...'
      )
      STORAGE_ENDPOINT = 'example.com'
    )
  );
```

---
title: CREATE FAILOVER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/create-failover-group.md
section: SQL Commands
---

# CREATE FAILOVER GROUP

Creates a new [failover group](../../user-guide/account-replication-intro.md) of specified objects in the system.

For more information about using failover groups, see [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md).

This command can be used to:

* Create a failover group in the source account to enable replication and failover of specified objects to a target account in
  the same organization.
* Create a secondary failover group in a target account as a replica of the primary failover group in the source account in the same
  organization.

See also:
:   [ALTER FAILOVER GROUP](alter-failover-group.md) , [DROP FAILOVER GROUP](drop-failover-group.md) , [SHOW FAILOVER GROUPS](show-failover-groups.md)

## Syntax

```sqlsyntax
CREATE FAILOVER GROUP [ IF NOT EXISTS ] <name>
    OBJECT_TYPES = <object_type> [ , <object_type> , ... ]
    [ ALLOWED_DATABASES = <db_name> [ , <db_name> , ... ] ]
    [ ALLOWED_EXTERNAL_VOLUMES = <external_volume_name> [ , <external_volume_name> , ... ] ]
    [ ALLOWED_SHARES = <share_name> [ , <share_name> , ... ] ]
    [ ALLOWED_INTEGRATION_TYPES = <integration_type_name> [ , <integration_type_name> , ... ] ]
    ALLOWED_ACCOUNTS = <org_name>.<target_account_name> [ , <org_name>.<target_account_name> ,  ... ]
    [ IGNORE EDITION CHECK ]
    [ REPLICATION_SCHEDULE = '{ <num> MINUTE | USING CRON <expr> <time_zone> }' ]
    [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
    [ ERROR_INTEGRATION = <integration_name> ]
```

**Secondary Failover Group**

```sqlsyntax
CREATE FAILOVER GROUP [ IF NOT EXISTS ] <secondary_name>
    AS REPLICA OF <org_name>.<source_account_name>.<name>
```

## Parameters

`name`
:   Specifies the identifier for the failover group. The identifier must start with an alphabetic character and cannot contain spaces or
    special characters unless the identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double
    quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`OBJECT_TYPES = object_type [ , object_type , ... ]`
:   Type(s) of objects for which you are enabling replication and failover from the source account to the target account.

    The following object types are supported:

    > ACCOUNT PARAMETERS:
    > :   All account-level parameters. This includes [account parameters](../parameters.md) and parameters that can be [set for
    >     your account](../../user-guide/admin-account-management.md).
    >
    > DATABASES:
    > :   Add database objects to the list of object types. If database objects are included in the list of specified object types, the
    >     `ALLOWED_DATABASES` parameter must be set.
    >
    > EXTERNAL VOLUMES:
    > :   Add external volume objects to the list of object types. If external volume objects are included in the list of specified object types,
    >     the `ALLOWED_EXTERNAL_VOLUMES` parameter must be set.
    >
    > INTEGRATIONS:
    > :   Currently, only security, API, storage, external access, and certain types of notification integrations are supported.
    >     For details, see [Integration replication](../../user-guide/account-replication-intro.md).
    >
    >     If integration objects are included in the list of specified object types, the
    >     `ALLOWED_INTEGRATION_TYPES` parameter must be set.
    >
    > LISTINGS:
    > :   Add listings to the list of object types. When adding listings to a failover group, adding shares is optional. Snowflake automatically
    >     selects all of the eligible listings and their shares for replication and failover. For more information, see [Listing support in Business Continuity and Disaster Recovery](../../collaboration/listings-bcdr.md).
    >
    > NETWORK POLICIES:
    > :   All network policies in the source account.
    >
    > PROFILES:
    > :   All profiles in the source account. Review [profile replication constraints](../../collaboration/listings-bcdr.md) for information about current constraints.
    >
    > RESOURCE MONITORS:
    > :   All resource monitors in the source account.
    >
    > ROLES:
    > :   All roles in the source account. Replicating roles implicitly includes all grants for object types included in the replication group.
    >     For example, if `ROLES` is the only object type that is replicated, then only hierarchies of roles (that is, roles granted to
    >     other roles) are replicated to target accounts. If the `USERS` object type is also included, then role grants to users are
    >     also replicated.
    >
    > SHARES:
    > :   Add share objects to the list of object types. If share objects are included in the list of specified object types, the
    >     `ALLOWED_SHARES` parameter must be set.
    >
    > USERS:
    > :   All users in the source account.
    >
    > WAREHOUSES:
    > :   All warehouses in the source account.

    > **Note:**
    >
    > If you replicate users and roles, programmatic access tokens for users are replicated automatically.

    To modify the list of replicated object types to a specified target account, use [ALTER FAILOVER GROUP](alter-failover-group.md) to reset the list of
    object types.

`ALLOWED_DATABASES = db_name [ , db_name , ... ]`
:   Specifies the database or list of databases for which you are enabling replication and failover from the source account to the target
    account. In order for you to set this parameter, the `OBJECT_TYPES` list must include `DATABASES`.

    `db_name`
    :   Specifies the identifier for the database.

`ALLOWED_EXTERNAL_VOLUMES = external_volume_name [ , external_volume_name , ... ]`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the external volume or list of external volumes for which you are enabling replication and failover from the source account to the target
    account. For you to set this parameter, the `OBJECT_TYPES` list must include `EXTERNAL VOLUMES`.

    `external_volume_name`
    :   Specifies the identifier for the external volume.

`ALLOWED_SHARES = share_name [ , share_name , ... ]`
:   Specifies the share or list of shares for which you are enabling replication and failover from the source account to the target account.
    For you to set this parameter, the `OBJECT_TYPES` list must include `SHARES`.

    `share_name`
    :   Specifies the identifier for the share.

`ALLOWED_INTEGRATION_TYPES = integration_type_name [ , integration_type_name , ... ]`
:   Type(s) of integrations for which you are enabling replication and failover from the source account to the target account.

    > This property requires that the `OBJECT_TYPES` list include `INTEGRATIONS` to set this parameter.
    >
    > The following integration types are supported:
    >
    > > SECURITY INTEGRATIONS:
    > > :   Specifies security integrations.
    > >
    > >     This property requires that the `OBJECT_TYPES` list include `ROLES`.
    > >
    > > API INTEGRATIONS:
    > > :   Specifies API integrations.
    > >
    > >     API integration replication requires additional set up after the API integration is replicated to the target account.
    > >     For more information, see [Updating the remote service for API integrations](../../user-guide/account-replication-config.md).
    > >
    > > STORAGE INTEGRATIONS:
    > > :   Specifies storage integrations.
    > >
    > > EXTERNAL ACCESS INTEGRATIONS:
    > > :   Specifies [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md).
    > >
    > >     For more information, see [Replication of stored procedures and user-defined functions (UDFs)](../../user-guide/account-replication-considerations.md).
    > >
    > > NOTIFICATION INTEGRATIONS:
    > > :   Specifies notification integrations.
    > >
    > >     Only some types of notification integrations are replicated. For details, see
    > >     [Integration replication](../../user-guide/account-replication-intro.md).

`ALLOWED_ACCOUNTS = org_name.target_account_name [ , org_name.target_account_name , ... ]`
:   Specifies the target account or list of target accounts to which replication and failover of specified objects from the source account is
    enabled. Secondary failover groups in the target accounts in this list can be promoted to serve as the primary failover group in
    case of failover.

    `org_name`
    :   Name of your Snowflake organization.

    `target_account_name`
    :   Target account to which you are enabling replication of the specified objects.

`IGNORE EDITION CHECK`
:   Allows replicating objects to accounts in the following scenario:

    > The primary failover group is in a Business Critical (or higher) account and a signed business associate agreement is in place to
    > store PHI data in the account per HIPAA and [HITRUST](../../user-guide/intro-cloud-platforms.md) regulations. However, no such agreement is in place
    > for one or more of the accounts approved for replication, regardless if they are Business Critical (or higher) accounts.

    This scenario is prohibited by default.

`REPLICATION_SCHEDULE ...`
:   Specifies the schedule for refreshing secondary failover groups.

    * `USING CRON expr time_zone`
      :   Specifies a cron expression and time zone for the secondary group refresh. Supports a subset of standard cron utility syntax.

          For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
          (in Wikipedia).

          The cron expression consists of the following fields:

          ```output
          # __________ minute (0-59)
          # | ________ hour (0-23)
          # | | ______ day of month (1-31, or L)
          # | | | ____ month (1-12, JAN-DEC)
          # | | | | __ day of week (0-6, SUN-SAT, or L)
          # | | | | |
          # | | | | |
            * * * * *
          ```

          The following special characters are supported:

          `*`
          :   Wildcard. Specifies any occurrence of the field.

          `L`
          :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a
              given month. In the day-of-month field, it specifies the last day of the month.

          `/n`
          :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
              specified in the month field, then the refresh is scheduled for April, July and October (i.e. every 3 months, starting with the 4th
              month of the year). The same schedule is maintained in subsequent years. That is, the refresh is not scheduled to run in
              January (3 months after the October run).

          > **Note:**
          > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
          >   for the account (or setting the value at the user or session level) does not change the time zone for the refresh.
          > + The cron expression defines all valid run times for the refresh. Snowflake attempts to refresh secondary groups based on
          >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
          > + When both a specific day of month and day of week are included in the cron expression, then the refresh is scheduled on days
          >   satisfying either the day of month or day of week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
          >   schedules a refresh at 0AM on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
    * `num MINUTE`
      :   Specifies an interval (in minutes) of wait time between refreshes. Accepts positive integers only.

          Also supports `num M` syntax.

          To avoid ambiguity, a *base interval time* is set:

          + When the object is created (using CREATE <object>) or
          + When a different interval is set (using ALTER <object> … SET REPLICATION_SCHEDULE)

          The base interval time starts the interval counter from the current clock time. For example, if an INTERVAL value of `10` is set and
          the scheduled refresh is enabled at 9:03 AM, then the refresh runs at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to
          ensure absolute precision, but only guarantee that refreshes do not execute before their set interval occurs (e.g. in the
          current example, the refresh could first run at 9:14 AM, but will definitely not run at 9:12 AM).

          > **Note:**
          >
          > The maximum supported value is `11520` (8 days). If the replication schedule has a greater `num MINUTE` value, the
          > refresh operation never runs.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`ERROR_INTEGRATION = integration_name`
:   Specifies the name of the notification integration to use to send notifications when refresh errors occur for the failover
    group. For more details, see [Error notifications for replication and failover groups](../../user-guide/account-replication-error-notifications.md).

**Secondary Failover Group Parameters**

`secondary_name`
:   Specifies the identifier for the secondary failover group. The identifier must start with an alphabetic character and cannot contain
    spaces or special characters unless the identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in
    double quotes are also case-sensitive. For more details, see [Identifier requirements](../identifiers-syntax.md).

    The identifiers for the secondary failover group (`secondary_name`) and primary failover group (`name`) can be, but
    are not required to be, identical.

`AS REPLICA OF org_name.source_account_name.name`
:   Specifies the identifier of the primary failover group from which to create a secondary failover group.

    `org_name`
    :   Name of your Snowflake organization.

    `source_account_name`
    :   Source account from which you are enabling replication and failover of the specified objects.

    `name`
    :   Identifier for the primary failover group in the source account.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FAILOVER GROUP | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| MONITOR | Database | To add a database to a failover group, the active role must have the MONITOR privilege on the database. |
| USAGE | External volume | To add an external volume to a failover group, the active role must have the USAGE privilege on the external volume. |
| OWNERSHIP | Share | To add a share to a failover group, the active role must have the OWNERSHIP privilege on the share. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Identifiers for failover groups and replication groups in an account must be unique.
* Objects other than databases, external volumes, and shares must be in the same failover group.
* A database can only be added to one failover group.
* An external volume can only be added to one replication or failover group.
* [Inbound shares](../../user-guide/data-share-consumers.md) (shares from providers) *cannot* be added to a replication or failover group.
* To retrieve the set of accounts in your organization that are enabled for replication, use
  [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).
* To retrieve the list of failover groups in your organization, use [SHOW FAILOVER GROUPS](show-failover-groups.md).
* If there are account objects (for example, users or roles) in a target account that you do not want to drop during replication,
  use the [SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME](../functions/system_link_account_objects_by_name.md) system function to apply a global identifier to objects
  created by means other than replication. For more information, see
  [Apply Global IDs to Objects Created by Scripts in Target Accounts](../../user-guide/account-replication-config.md) before
  you create a failover group.
* Automatically [scheduled refresh operations](../../user-guide/account-replication-intro.md) are executed using the role with the OWNERSHIP
  privilege on the group. If a scheduled refresh operation fails due to insufficient privileges, grant the required privileges
  to the role with the OWNERSHIP privilege on the group.

* If you create a replication or failover group with a tag or modify a replication or failover group by setting a tag on it,
  [tag inheritance](../../user-guide/object-tagging/inheritance.md) does not apply to any objects that you specify in the replication or failover group.

  Tag inheritance is only applicable to objects with a [parent-child relationship](../../user-guide/security-access-control-overview.md), such
  database, schema, and table. There are no child objects of replication or failover groups.
* You cannot set a tag or modify a tag on a secondary replication or failover group because these objects are read
  only.
* When you refresh a secondary replication or failover group, any tags that are set on the primary group are then set on
  the secondary group.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* For an account that is newly upgraded to Business Critical Edition (or higher), it might take up to 12 hours for failover capabilities to
  become available.

## Examples

### Create a failover group to enable replication and failover for a database

**Executed on source account**

Create a failover group named `myfg` to enable replication and failover of database `db1` from the source account to the
target account `myaccount2`. Set the replication schedule for `myfg` to refresh the database every 10 minutes:

```sqlexample
CREATE FAILOVER GROUP myfg
    OBJECT_TYPES = DATABASES
    ALLOWED_DATABASES = db1
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

**Executed on target account**

Create a failover group in the target account as a replica of the failover group `myfg` in the source account:

```sqlexample
CREATE FAILOVER GROUP myfg
    AS REPLICA OF myorg.myaccount1.myfg;
```

### Create a failover group to enable replication and failover for multiple databases

**Executed on source account**

Create a failover group named `myfg` in the source account to enable replication and failover of databases
`db1`, `db2`, `db3` from the source to the `myaccount2` account. Set the replication schedule for `myfg`
to refresh the databases every 10 minutes:

```sqlexample
CREATE FAILOVER GROUP myfg
    OBJECT_TYPES = DATABASES
    ALLOWED_DATABASES = db1, db2, db3
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

**Executed on target account**

Create a failover group in the target account as a replica of the failover group `myfg` in the source account:

```sqlexample
CREATE FAILOVER GROUP myfg
    AS REPLICA OF myorg.myaccount1.myfg;
```

### Create a failover group to enable replication and failover for account objects

**Executed on source account**

Create a failover group named `myfg` in the source account to enable replication and failover of users, roles, warehouses, resource
monitors, storage integrations, and notification integrations from the source account to the `myaccount2` account:

```sqlexample
CREATE FAILOVER GROUP myfg
    OBJECT_TYPES = USERS, ROLES, WAREHOUSES, RESOURCE MONITORS, INTEGRATIONS
    ALLOWED_INTEGRATION_TYPES = STORAGE INTEGRATIONS, NOTIFICATION INTEGRATIONS
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

**Executed on target account**

Create a failover group in the target account as a replica of the failover group `myfg` in the source account:

```sqlexample
CREATE FAILOVER GROUP myfg
    AS REPLICA OF myorg.myaccount1.myfg;
```

### Create a failover group to enable replication and failover for profiles

**Executed on source account**

Create a failover group named `myfg` in the source account to enable replication and failover of users, roles, and profiles from the source account to the `myaccount2` account:

```sqlexample
CREATE FAILOVER GROUP myfg
    OBJECT_TYPES = USERS, ROLES, PROFILES
    ALLOWED_ACCOUNTS = myorg.myaccount2;
```

**Executed on target account**

Create a failover group in the target account as a replica of the failover group `myfg` in the source account:

```sqlexample
CREATE FAILOVER GROUP myfg
    AS REPLICA OF myorg.myaccount1.myfg;
ALTER FAILOVER GROUP myfg REFRESH;
```

**Confirm replication of profiles**

To confirm that profiles are replicated on the target account, follow these steps:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) on the target account.
2. Navigate to Marketplace » Provider Studio » Profiles to view the profiles that were replicated from the source account.

### Create a failover group to enable replication and failover for security integrations and network policies

For more information and examples for replicating security integrations and network policies,
see [Replication of security integrations & network policies across multiple accounts](../../user-guide/account-replication-security-integrations.md).

---
title: CREATE FEATURE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-feature-policy.md
section: SQL Commands
---

# CREATE FEATURE POLICY

Creates a new [feature policy](../../developer-guide/native-apps/ui-consumer-feature-policies.md).

See also:
:   [ALTER FEATURE POLICY](alter-feature-policy.md) , [DESCRIBE FEATURE POLICY](desc-feature-policy.md), [DROP FEATURE POLICY](drop-feature-policy.md), [SHOW FEATURE POLICIES](show-feature-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] FEATURE POLICY [ IF NOT EXISTS ] <name>
  BLOCKED_OBJECT_TYPES_FOR_CREATION = ( <type> [ , ... ] )
  [ COMMENT = '<string-literal>' ]
```

## Parameters

`name`
:   Specifies the identifier for the feature policy.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`BLOCKED_OBJECT_TYPES_FOR_CREATION = ( type [ , ... ] )`
:   Specifies a list of objects that an app can’t create in the consumer account. The
    following objects can be blocked:

    * COMPUTE POOLS
    * WAREHOUSES
    * TASKS
    * DATABASES

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the feature policy.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FEATURE POLICY | SCHEMA | Grants the ability to create feature policies. You must have this privilege set on the schema containing the policy to be created. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If a policy is bound to an object, for example an account or an app, the policy cannot be replaced.
  Use the [ALTER FEATURE POLICY](alter-feature-policy.md) to update or rename the feature policy.
* This command does not support using the CLONE clause to create a copy of a feature policy.

## Examples

The following example creates a new feature policy that prohibits an app from creating a database:

```sqlexample
CREATE FEATURE POLICY block_create_db_policy
  BLOCKED_OBJECT_TYPES_FOR_CREATION = (DATABASES);
```

The following example creates a new feature policy, but doesn’t specify any objects to prohibit.

```sqlexample
CREATE FEATURE POLICY block_nothing_policy
  BLOCKED_OBJECT_TYPES_FOR_CREATION = ();
```

> **Note:**
>
> This syntax would typically be applied to an app to lift any restrictions that were applied at
> the account level.

---
title: CREATE FILE FORMAT
source: https://docs.snowflake.com/en/sql-reference/sql/create-file-format.md
section: SQL Commands
---

# CREATE FILE FORMAT

Creates a named file format that describes a set of staged data to access or load into Snowflake tables.

This command supports the following variants:

* CREATE OR ALTER FILE FORMAT: Creates a named file format if it doesn’t exist or alters an existing file format.

See also:
:   [ALTER FILE FORMAT](alter-file-format.md) , [DROP FILE FORMAT](drop-file-format.md) , [SHOW FILE FORMATS](show-file-formats.md) , [DESCRIBE FILE FORMAT](desc-file-format.md)

    [COPY INTO <location>](copy-into-location.md) , [COPY INTO <table>](copy-into-table.md) , [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY | VOLATILE } ] FILE FORMAT [ IF NOT EXISTS ] <name>
  [ TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] ]
  [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> formatTypeOptions ::=
> -- If TYPE = CSV
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      RECORD_DELIMITER = '<string>' | NONE
>      FIELD_DELIMITER = '<string>' | NONE
>      MULTI_LINE = TRUE | FALSE
>      FILE_EXTENSION = '<string>'
>      PARSE_HEADER = TRUE | FALSE
>      SKIP_HEADER = <integer>
>      SKIP_BLANK_LINES = TRUE | FALSE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      ESCAPE = '<character>' | NONE
>      ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
>      TRIM_SPACE = TRUE | FALSE
>      FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
>      ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      EMPTY_FIELD_AS_NULL = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
>      ENCODING = '<string>' | UTF8
> -- If TYPE = JSON
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      DATE_FORMAT = '<string>' | AUTO
>      TIME_FORMAT = '<string>' | AUTO
>      TIMESTAMP_FORMAT = '<string>' | AUTO
>      BINARY_FORMAT = HEX | BASE64 | UTF8
>      TRIM_SPACE = TRUE | FALSE
>      MULTI_LINE = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
>      FILE_EXTENSION = '<string>'
>      ENABLE_OCTAL = TRUE | FALSE
>      ALLOW_DUPLICATE = TRUE | FALSE
>      STRIP_OUTER_ARRAY = TRUE | FALSE
>      STRIP_NULL_VALUES = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> -- If TYPE = AVRO
>      COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = ORC
>      TRIM_SPACE = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = PARQUET
>      COMPRESSION = AUTO | LZO | SNAPPY | NONE
>      SNAPPY_COMPRESSION = TRUE | FALSE
>      BINARY_AS_TEXT = TRUE | FALSE
>      USE_LOGICAL_TYPE = TRUE | FALSE
>      TRIM_SPACE = TRUE | FALSE
>      USE_VECTORIZED_SCANNER = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      NULL_IF = ( '<string>' [ , '<string>' ... ] )
> -- If TYPE = XML
>      COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
>      IGNORE_UTF8_ERRORS = TRUE | FALSE
>      PRESERVE_SPACE = TRUE | FALSE
>      STRIP_OUTER_ELEMENT = TRUE | FALSE
>      DISABLE_AUTO_CONVERT = TRUE | FALSE
>      REPLACE_INVALID_CHARACTERS = TRUE | FALSE
>      SKIP_BYTE_ORDER_MARK = TRUE | FALSE
> ```

## Variant syntax

### CREATE OR ALTER FILE FORMAT

Creates a new named file format if it doesn’t already exist, or transforms an existing file format into the one defined in the statement.
A CREATE OR ALTER FILE FORMAT statement follows the syntax rules of a CREATE FILE FORMAT statement and has the same limitations as an
[ALTER FILE FORMAT](alter-file-format.md) statement.

Supported alterations include changes to the formatTypeOptions and COMMENT properties.
You can’t alter the TYPE property.

For more information, see CREATE OR ALTER FILE FORMAT usage notes.

```sqlsyntax
CREATE OR ALTER [ { TEMP | TEMPORARY | VOLATILE } ] FILE FORMAT <name>
  [ TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ] ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier for the file format; must be unique for the schema in which the file format is created.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`), Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`{ TEMP | TEMPORARY | VOLATILE }`
:   Specifies that the file format persists only for the duration of the [session](../../user-guide/session-policies.md) that you created it in.
    A temporary file format is dropped at the end of the session.

    Default: No value. If a file format is not declared as `TEMPORARY`, the file format is permanent.

    If you want to avoid unexpected conflicts, avoid naming temporary file formats after file formats that already exist in the schema.

    If you created a temporary file format with the same name as another file format in the schema, all queries and operations used on the
    file format only affect the temporary file format in the session, until you drop the temporary file format. If you drop the file format
    using a DROP FILE FORMAT command, you drop the temporary file format, and not the file format that already exists in the schema.

`TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML [ ... ]`
:   Specifies the format of the input files (for data loading) or output files (for data unloading). Depending on the format type, you can
    specify additional format-specific options. For more information, see Format Type Options
    (in this topic).

    Valid values depend on whether the file format is for loading or unloading data:

    > `CSV` (for loading or unloading)
    > :   Any flat, delimited plain text file that uses specific characters such as the following:
    >
    >     * Separators for fields within records (for example, commas).
    >     * Separators for records (for example, new line characters).
    >
    >     Although the name (CSV) suggests comma-separated values, you can use any valid character as a field separator.
    >
    > `JSON` (for loading or unloading)
    > :   Any plain text file containing one or more JSON documents (such as objects or arrays). JSON is a semi-structured file format. The
    >     documents can be comma-separated and optionally enclosed in a big array. A single JSON document can span multiple lines.
    >
    >     > **Note:**
    >     >
    >     > * When you load data from files into tables, Snowflake supports either [NDJSON](https://github.com/ndjson/ndjson-spec) (newline delimited JSON)
    >     >   standard format or comma-separated JSON format.
    >     > * When you unload table data to files, Snowflake outputs *only* to NDJSON format.
    >
    > `AVRO` (for loading only; you can’t unload data to AVRO format)
    > :   Binary file in AVRO format.
    >
    > `ORC` (for loading only; you can’t unload data to ORC format)
    > :   Binary file in ORC format.
    >
    > `PARQUET` (for loading or unloading)
    > :   Binary file in PARQUET format.
    >
    > `XML` (for loading only; you can’t unload data to XML format)
    > :   Plain text file containing XML elements.

    For more information about CSV, see Usage Notes in this topic. For more information about JSON and the other semi-structured file formats,
    see [Introduction to loading semi-structured data](../../user-guide/semistructured-intro.md).

    Default: `CSV`

`COMMENT = 'string_literal'`
:   Specifies a comment for the file format.

    Default: No value

## Format type options (`formatTypeOptions`)

Depending on the file format type specified (`TYPE = ...`), you can include one or more of the following format-specific options
(separated by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified when loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`RECORD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate records in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   Data loading:
        :   New line character. Note that “new line” is logical such that `\r\n` will be understood as a new line for files on a Windows platform.

        Data unloading:
        :   New line character (`\n`).

`FIELD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate fields in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

            > > **Note:**
            > >
            > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   comma (`,`)

`MULTI_LINE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and the specified record delimiter is present within a CSV field, the record containing the field will be interpreted as an error.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > If you are loading large uncompressed CSV files (greater than 128MB) that follow the RFC4180 specification, Snowflake supports parallel scanning of these CSV files when MULTI_LINE is set to `FALSE`, COMPRESSION is set to `NONE`, and ON_ERROR is set to `ABORT_STATEMENT` or `CONTINUE`.

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.csv[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

    > **Note:**
    >
    > If the `SINGLE` copy option is `TRUE`, then the COPY command unloads a file without a file extension by default. To specify a file extension, provide a file name and extension in the
    > `internal_location` or `external_location` path (For example, `copy into @stage/data.csv`).

`PARSE_HEADER = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to use the first row headers in the data files to determine column names.

    This file format option is applied to the following actions only:

    > * Automatically detecting column definitions by using the INFER_SCHEMA function.
    > * Loading CSV data into separate columns by using the INFER_SCHEMA function and MATCH_BY_COLUMN_NAME copy option.

    If the option is set to TRUE, the first row headers will be used to determine column names. The default value FALSE will return column names as c\*, where \* is the position of the column.

    > **Note:**
    >
    > * This option isn’t supported for external tables.
    > * The SKIP_HEADER option isn’t supported if you set `PARSE_HEADER = TRUE`.

    Default:
    :   `FALSE`

`SKIP_HEADER = integer`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Number of lines at the start of the file to skip.

    Note that SKIP_HEADER does not use the RECORD_DELIMITER or FIELD_DELIMITER values to determine what a header line is; rather, it simply skips the specified number of CRLF (Carriage Return, Line Feed)-delimited lines in the file. RECORD_DELIMITER and FIELD_DELIMITER are then used to determine the rows of data to load.

    Default:
    :   `0`

`SKIP_BLANK_LINES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to skip any blank lines encountered in the data files; otherwise, blank lines produce an end-of-record error (default behavior).

    Default:
    :   `FALSE`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of date values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) (data loading) or [DATE_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of time values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) (data loading) or [TIME_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of timestamp values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) (data loading) or [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the encoding format for binary input or output. The option can be used when loading data into or unloading data from binary columns in a table.

    Default:
    :   `HEX`

`ESCAPE = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character string used as the escape character for enclosed or unenclosed field values. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_OPTIONALLY_ENCLOSED_BY` character in the data as literals.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for enclosed fields only. Specify the character used to enclose fields by setting `FIELD_OPTIONALLY_ENCLOSED_BY`.

        > **Note:**
        >
        > This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        > as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        > the option value.
        >
        > In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        > option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If this option is set, it overrides the escape character set for `ESCAPE_UNENCLOSED_FIELD`.

    Default:
    :   `NONE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   A singlebyte character string used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for unenclosed fields only.

        > **Note:**
        >
        > * The default value is `\\`. If a row in a data file ends in the backslash (`\`) character, this character escapes the newline or
        >   carriage return character specified for the `RECORD_DELIMITER` file format option. As a result, the load operation treats
        >   this row and the next row as a single row of data. To avoid this issue, set the value to `NONE`.
        > * This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        >   as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        >   the option value.
        >
        >   In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        >   option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If `ESCAPE` is set, the escape character set for that file format option overrides this option.

    Default:
    :   backslash (`\\`)

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove white space from fields.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        As another example, if leading or trailing spaces surround quotes that enclose strings, you can remove the surrounding spaces using this option and the quote character using the
        `FIELD_OPTIONALLY_ENCLOSED_BY` option. Note that any spaces within the quotes are preserved. For example, assuming `FIELD_DELIMITER = '|'` and `FIELD_OPTIONALLY_ENCLOSED_BY = '"'`:

        ```sqlexample
        |"Hello world"|    /* loads as */  >Hello world<
        |" Hello world "|  /* loads as */  > Hello world <
        | "Hello world" |  /* loads as */  >Hello world<
        ```

        (the brackets in this example are not loaded; they are used to demarcate the beginning and end of the loaded strings)

    Default:
    :   `FALSE`

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex representation (`0x27`) or the double single-quoted escape (`''`).

        Data unloading only:
        :   When a field in the source table contains this character, Snowflake escapes it using the same character for unloading. For example, if the value is the double quote character and a field contains the string `A "B" C`, Snowflake escapes the double quotes for unloading as follows:

            `A ""B"" C`

    Default:
    :   `NONE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   String used to convert to and from SQL NULL:

        * When loading data, Snowflake replaces these values in the data load source with SQL NULL. To specify more than one string, enclose
          the list of strings in parentheses and use commas to separate each value.

          Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as
          a value, all instances of `2` as either a string or number are converted.

          For example:

          `NULL_IF = ('\N', 'NULL', 'NUL', '')`

          Note that this option can include empty strings.
        * When unloading data, Snowflake converts SQL NULL values to the first value in the list.

    Default:
    :   `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\`)

`ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to generate a parsing error if the number of delimited columns (i.e. fields) in an input file does not match the number of columns in the corresponding table.

        If set to `FALSE`, an error is not generated and the load continues. If the file is successfully loaded:

        * If the input file contains records with more fields than columns in the table, the matching fields are loaded in order of occurrence in the file and the remaining fields are not loaded.
        * If the input file contains records with fewer fields than columns in the table, the non-matching columns in the table are loaded with NULL values.

        This option assumes all the records within the input file are the same length (i.e. a file containing records of varying length return an error regardless of the value specified for this parameter).

    Default:
    :   `TRUE`

    > **Note:**
    >
    > When [transforming data during loading](../../user-guide/data-load-transform.md) (i.e. using a query as the source for the COPY command), this option is ignored. There is no requirement for your data files to have
    > the same number and ordering of columns as your target table.

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`).

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies whether to insert SQL NULL for empty fields in an input file, which are represented by two successive delimiters (For example, `,,`).

          If set to `FALSE`, Snowflake attempts to cast an empty field to the corresponding column type. An empty string is inserted into columns of type STRING. For other column types, the COPY command produces an error.
        * When unloading data, this option is used in combination with `FIELD_OPTIONALLY_ENCLOSED_BY`. When `FIELD_OPTIONALLY_ENCLOSED_BY = NONE`, setting `EMPTY_FIELD_AS_NULL = FALSE` specifies to unload empty strings in tables to empty string values without quotes enclosing the field values.

          If set to `TRUE`, `FIELD_OPTIONALLY_ENCLOSED_BY` must specify a character to enclose strings.

    Default:
    :   `TRUE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

`ENCODING = 'string'`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String (constant) that specifies the character set of the source data when loading data into a table.

        | Character Set | `ENCODING` Value | Supported Languages | Notes |
        | --- | --- | --- | --- |
        | Big5 | `BIG5` | Traditional Chinese |  |
        | EUC-JP | `EUCJP` | Japanese |  |
        | EUC-KR | `EUCKR` | Korean |  |
        | GB18030 | `GB18030` | Chinese |  |
        | IBM420 | `IBM420` | Arabic |  |
        | IBM424 | `IBM424` | Hebrew |  |
        | IBM949 | `IBM949` | Korean |  |
        | ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
        | ISO-2022-JP | `ISO2022JP` | Japanese |  |
        | ISO-2022-KR | `ISO2022KR` | Korean |  |
        | ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
        | ISO-8859-5 | `ISO88595` | Russian |  |
        | ISO-8859-6 | `ISO88596` | Arabic |  |
        | ISO-8859-7 | `ISO88597` | Greek |  |
        | ISO-8859-8 | `ISO88598` | Hebrew |  |
        | ISO-8859-9 | `ISO88599` | Turkish |  |
        | ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
        | KOI8-R | `KOI8R` | Russian |  |
        | Shift_JIS | `SHIFTJIS` | Japanese |  |
        | UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
        | UTF-16 | `UTF16` | All languages |  |
        | UTF-16BE | `UTF16BE` | All languages |  |
        | UTF-16LE | `UTF16LE` | All languages |  |
        | UTF-32 | `UTF32` | All languages |  |
        | UTF-32BE | `UTF32BE` | All languages |  |
        | UTF-32LE | `UTF32LE` | All languages |  |
        | windows-874 | `WINDOWS874` | Thai |  |
        | windows-949 | `WINDOWS949` | Korean |  |
        | windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
        | windows-1251 | `WINDOWS1251` | Russian |  |
        | windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | windows-1253 | `WINDOWS1253` | Greek |  |
        | windows-1254 | `WINDOWS1254` | Turkish |  |
        | windows-1255 | `WINDOWS1255` | Hebrew |  |
        | windows-1256 | `WINDOWS1256` | Arabic |  |

    Default:
    :   `UTF8`

    > **Note:**
    >
    > Snowflake stores all data internally in the UTF-8 character set. The data is converted into UTF-8 before it is loaded into Snowflake.

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of date string values in the data files. If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of time string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of timestamp string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the encoding format for binary string values in the data files. The option can be used when loading data into binary columns in a table.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `HEX`

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`MULTI_LINE = TRUE | FALSE`
:   Use: Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and a new line is present within a JSON record, the record containing the new line will be interpreted as an error.

    Default:
    :   `TRUE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.json[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

`ENABLE_OCTAL = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that enables parsing of octal numbers.

    Default:
    :   `FALSE`

`ALLOW_DUPLICATE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to allow duplicate object field names (only the last one will be preserved).

    Default:
    :   `FALSE`

`STRIP_OUTER_ARRAY = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove outer brackets (i.e. `[ ]`).

    Default:
    :   `FALSE`

`STRIP_NULL_VALUES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove object fields or array elements containing `null` values. For example, when set to `TRUE`:

        | Before | After |
        | --- | --- |
        | `[null]` | `[]` |
        | `[null,null,3]` | `[,,3]` |
        | `{"a":null,"b":null,"c":123}` | `{"c":123}` |
        | `{"a":[1,null,2],"b":{"x":null,"y":88}}` | `{"a":[1,,2],"b":{"y":88}}` |

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

### TYPE = AVRO

`COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`.

> **Note:**
>
> We recommend that you use the default `AUTO` option because it will determine both the file and codec compression. Specifying a compression option refers to the compression of files, not the compression of blocks (codecs).

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = ORC

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = PARQUET

`COMPRESSION = AUTO | LZO | SNAPPY | NONE`
:   Use:
    :   Data unloading and external tables

    Definition:

    * When unloading data, specifies the compression algorith for columns in the Parquet files.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically. Supports the following compression algorithms: Brotli, gzip, Lempel-Ziv-Oberhumer (LZO), LZ4, Snappy, or Zstandard v0.8 (and higher). . When unloading data, unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `LZO` | When unloading data, files are compressed using the Snappy algorithm by default. If unloading data to LZO-compressed files, specify this value. |
        | `SNAPPY` | When unloading data, files are compressed using the Snappy algorithm by default. You can optionally specify this value. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`SNAPPY_COMPRESSION = TRUE | FALSE`
:   Use:
    :   Data unloading only

        | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | Unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `SNAPPY` | May be specified if unloading Snappy-compressed files. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Definition:
    :   Boolean that specifies whether unloaded file(s) are compressed using the SNAPPY algorithm.

    > **Note:**
    >
    > Deprecated. Use `COMPRESSION = SNAPPY` instead.

    Limitations:
    :   Only supported for data unloading operations.

    Default:
    :   `TRUE`

`BINARY_AS_TEXT = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to interpret columns with no defined logical data type as UTF-8 text. When set to `FALSE`, Snowflake interprets these columns as binary data.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > Snowflake recommends that you set BINARY_AS_TEXT to FALSE to avoid any potential conversion issues.

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`USE_LOGICAL_TYPE = TRUE | FALSE`
:   Use:
    :   Data loading, data querying in staged files, and schema detection.

    Definition:
    :   Boolean that specifies whether to use Parquet logical types. With this file format option, Snowflake can interpret Parquet logical types during data loading. For more information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md). To enable Parquet logical types, set USE_LOGICAL_TYPE as TRUE when you create a new file format option.

    Limitations:
    :   Not supported for data unloading.

`USE_VECTORIZED_SCANNER = TRUE | FALSE`
:   Use:
    :   Data loading and data querying in staged files

    Definition:
    :   Boolean that specifies whether to use a vectorized scanner for loading Parquet files.

    Default:
    :   `FALSE`. In a future BCR, the default value will be `TRUE`.

    Using the vectorized scanner can significantly reduce the latency for loading Parquet files, because this scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file. The scanner only downloads relevant sections of the Parquet file into memory, such as the subset of selected columns.

    If `USE_VECTORIZED_SCANNER` is set to `TRUE`, the vectorized scanner has the following behaviors:

    > * The `BINARY_AS_TEXT` option is always treated as `FALSE` and the `USE_LOGICAL_TYPE` option is always treated as `TRUE`, no matter what the actual value is being set to.
    > * The vectorized scanner supports Parquet map types. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >   {
    >   >    "k1": "v1",
    >   >    "k2": "v2"
    >   >   }
    >   > ```
    > * The vectorized scanner shows `NULL` values in the output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "nickname": null,
    >   >   "age": 34,
    >   >   "phone_numbers":
    >   >   [
    >   >     "1234567890",
    >   >     "0987654321",
    >   >     null,
    >   >     "6781234590"
    >   >   ]
    >   >   }
    >   > ```
    > * The vectorized scanner handles Time and Timestamp as follows:
    >
    >   > | Parquet | Snowflake vectorized scanner |
    >   > | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS/NANOS) | TIME |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_LTZ |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_NTZ |
    >   > | INT96 | TIMESTAMP_LTZ |

    If `USE_VECTORIZED_SCANNER` is set to `FALSE`, the scanner has the following behaviors:

    > * This option does not support Parquet maps. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >  {
    >   >   "key_value":
    >   >   [
    >   >    {
    >   >           "key": "k1",
    >   >           "value": "v1"
    >   >       },
    >   >       {
    >   >           "key": "k2",
    >   >           "value": "v2"
    >   >       }
    >   >     ]
    >   >   }
    >   > ```
    > * This option does not explicitly show `NULL` values in the scan output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "age": 34
    >   >   "phone_numbers":
    >   >   [
    >   >    "1234567890",
    >   >    "0987654321",
    >   >    "6781234590"
    >   >   ]
    >   >  }
    >   > ```
    > * This option handles Time and Timestamp as follows:
    >
    >   > | Parquet | When USE_LOGICAL_TYPE = TRUE | When USE_LOGICAL_TYPE = FALSE |
    >   > | --- | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS) | TIME | + TIME (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=NANOS) | TIME | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS) | TIMESTAMP_LTZ | TIMESTAMP_NTZ |
    >   > | TimestampType(isAdjustedToUtc=True, unit=NANOS) | TIMESTAMP_LTZ | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS) | TIMESTAMP_NTZ | + TIMESTAMP_LTZ (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimestampType(isAdjustedToUtc=False, unit=NANOS) | TIMESTAMP_NTZ | INTEGER |
    >   > | INT96 | TIMESTAMP_NTZ | TIMESTAMP_NTZ |

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = XML

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`PRESERVE_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser preserves leading and trailing spaces in element content.

    Default:
    :   `FALSE`

`STRIP_OUTER_ELEMENT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser strips out the outer XML element, exposing 2nd level elements as separate documents.

    Default:
    :   `FALSE`

`DISABLE_AUTO_CONVERT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser disables automatic conversion of numeric and Boolean values from text to native representation.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip any BOM (byte order mark) present in an input file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FILE FORMAT | Schema |  |
| OWNERSHIP | File format | * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object   that already exists in the schema. * Required to execute a CREATE OR ALTER FILE FORMAT statement for an *existing* file format.   Note that in a [managed access schema](../../user-guide/security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## CREATE OR ALTER FILE FORMAT usage notes

* All limitations of the [ALTER FILE FORMAT](alter-file-format.md) command apply.
* You can’t turn a TEMP FILE FORMAT into a regular FILE FORMAT and vice versa.
* You can’t alter the TYPE property.

## Usage notes

> **Caution:**
>
> Recreating a file format (using CREATE OR REPLACE FILE FORMAT) breaks the association between the file format and any external table that
> references it. This is because an external table links to a file format using a hidden ID rather than the name of the file format.
> Behind the scenes, the CREATE OR REPLACE syntax drops an object and recreates it with a different hidden ID.
>
> If you must recreate a file format after it has been linked to one or more external tables, you must recreate each of the external tables
> (using CREATE OR REPLACE EXTERNAL TABLE) to reestablish the association. Call the [GET_DDL](../functions/get_ddl.md) function to
> retrieve a DDL statement to recreate each of the external tables.

* Conflicting file format values in a SQL statement produce an error. A conflict occurs when the same option is specified multiple times
  with different values (e.g. `...TYPE = 'CSV' ... TYPE = 'JSON'...`).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a CSV file format named `my_csv_format` that uses all the default CSV format options:

```sqlexample
CREATE OR REPLACE FILE FORMAT my_csv_format
  TYPE = CSV
  COMMENT = 'my_file_format';
```

Alter `my_csv_format` so that it defines the following rules for data files and unsets the comment:

* Fields are delimited using the pipe character (`|`).
* Files include a single header line that will be skipped.
* The strings `NULL` and `null` will be replaced with NULL values.
* Empty strings will be interpreted as NULL values.
* Files will be compressed/decompressed using GZIP compression.

```sqlexample
CREATE OR ALTER FILE FORMAT my_csv_format
  TYPE = CSV
  FIELD_DELIMITER = '|'
  SKIP_HEADER = 1
  NULL_IF = ('NULL', 'null')
  EMPTY_FIELD_AS_NULL = true
  COMPRESSION = gzip;
```

Create a JSON file format named `my_json_format` that uses all the default JSON format options:

```sqlexample
CREATE OR REPLACE FILE FORMAT my_json_format
  TYPE = JSON;
```

Create a PARQUET file format named `my_parquet_format` that uses PARQUET logical types, instead of physical types or the legacy converted types.

```sqlexample
CREATE OR REPLACE FILE FORMAT my_parquet_format
  TYPE = PARQUET
  USE_VECTORIZED_SCANNER = TRUE
  USE_LOGICAL_TYPE = TRUE;
```

---
title: CREATE FUNCTION
source: https://docs.snowflake.com/en/sql-reference/sql/create-function.md
section: SQL Commands
---

# CREATE FUNCTION

Creates a new [UDF (user-defined function)](../../developer-guide/udf/udf-overview.md). Depending on how you configure it, the function can
return either scalar results or tabular results.

When you create a UDF, you specify a handler whose code is written in one of the supported languages. Depending on the handler’s language,
you can either include the handler source code in-line with the CREATE FUNCTION statement or reference the handler’s location from
CREATE FUNCTION, where the handler is precompiled or source code on a stage.

The following table lists each of the supported languages and whether its code may be kept in-line with CREATE FUNCTION or kept on a stage.
For more information, see [Keeping handler code in-line or on a stage](../../developer-guide/inline-or-staged.md).

| Language | Handler Location |
| --- | --- |
| [Java](../../developer-guide/udf/java/udf-java-introduction.md) | In-line or staged |
| [JavaScript](../../developer-guide/udf/javascript/udf-javascript-introduction.md) | In-line |
| [Python](../../developer-guide/udf/python/udf-python-introduction.md) | In-line or staged |
| [Scala](../../developer-guide/udf/scala/udf-scala-introduction.md) | In-line or staged |
| [SQL](../../developer-guide/udf/sql/udf-sql-introduction.md) | In-line |

This command supports the following variants:

* CREATE OR ALTER FUNCTION: Creates a function if it doesn’t exist or alters an existing function.

See also:
:   [ALTER FUNCTION](alter-function.md), [DROP FUNCTION](drop-function.md), [SHOW USER FUNCTIONS](show-user-functions.md) , [DESCRIBE FUNCTION](desc-function.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

The syntax for CREATE FUNCTION varies depending on which language you’re using as the UDF handler.

### Java handler

Use the syntax below if the source code is in-line:

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] FUNCTION [ IF NOT EXISTS ] <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE JAVA
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  [ RUNTIME_VERSION = <java_jdk_version> ]
  [ COMMENT = '<string_literal>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ PACKAGES = ( '<package_name_and_version>' [ , ... ] ) ]
  HANDLER = '<path_to_method>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  AS '<function_definition>'
```

Use the following syntax if the handler code will be referenced on a stage (such as in a JAR):

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] FUNCTION [ IF NOT EXISTS ] <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE JAVA
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  [ RUNTIME_VERSION = <java_jdk_version> ]
  [ COMMENT = '<string_literal>' ]
  IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] )
  HANDLER = '<path_to_method>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
```

### JavaScript handler

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] FUNCTION <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE JAVASCRIPT
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  [ COMMENT = '<string_literal>' ]
  AS '<function_definition>'
```

### Python handler

Use the syntax below if the source code is in-line:

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] [ AGGREGATE ] FUNCTION [ IF NOT EXISTS ] <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE PYTHON
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  RUNTIME_VERSION = <python_version>
  [ COMMENT = '<string_literal>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ PACKAGES = ( '<package_name>[==<version>]' [ , ... ] ) ]
  [ ARTIFACT_REPOSITORY = '<repository_name>' ]
  HANDLER = '<function_name>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
  AS '<function_definition>'
```

Use the following syntax if the handler code will be referenced on a stage (such as in a module):

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] [ AGGREGATE ] FUNCTION [ IF NOT EXISTS ] <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE PYTHON
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  RUNTIME_VERSION = <python_version>
  [ COMMENT = '<string_literal>' ]
  [ ARTIFACT_REPOSITORY = '<repository_name>' ]
  IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] )
  [ PACKAGES = ( '<package_name>[==<version>]' [ , ... ] ) ]
  HANDLER = '<module_file_name>.<function_name>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
```

### Scala handler

Use the syntax below if the source code is in-line:

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] FUNCTION [ IF NOT EXISTS ] <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS <result_data_type>
  [ [ NOT ] NULL ]
  LANGUAGE SCALA
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  [ RUNTIME_VERSION = <scala_version> ]
  [ COMMENT = '<string_literal>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ PACKAGES = ( '<package_name_and_version>' [ , ... ] ) ]
  HANDLER = '<path_to_method>'
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  AS '<function_definition>'
```

Use the following syntax if the handler code will be referenced on a stage (such as in a JAR):

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] FUNCTION [ IF NOT EXISTS ] <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS <result_data_type>
  [ [ NOT ] NULL ]
  LANGUAGE SCALA
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  [ RUNTIME_VERSION = <scala_version> ]
  [ COMMENT = '<string_literal>' ]
  IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] )
  HANDLER = '<path_to_method>'
```

### SQL handler

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] FUNCTION <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  [ { VOLATILE | IMMUTABLE } ]
  [ MEMOIZABLE ]
  [ COMMENT = '<string_literal>' ]
  AS '<function_definition>'
```

## Variant syntax

### CREATE OR ALTER FUNCTION

Creates a new function if it doesn’t already exist, or transforms an existing function into the function defined in the statement.
A CREATE OR ALTER FUNCTION statement follows the syntax rules of a CREATE FUNCTION statement and has the same limitations as an
[ALTER FUNCTION](alter-function.md) statement.

Supported function alterations include:

* Change function properties and parameters. For example, SECURE, MAX_BATCH_ROWS, LOG_LEVEL, or COMMENT.
* Change function definition. For example, RUNTIME_VERSION, ARTIFACT_REPOSITORY (Python), PACKAGES, IMPORTS, return type, and function body.

For more information, see CREATE OR ALTER FUNCTION usage notes.

```sqlsyntax
CREATE [ OR ALTER ] FUNCTION ...
```

> **Note:**
>
> The COPY GRANTS parameter is not supported with this variant syntax.

## Required parameters

### All languages

`name ( [ arg_name arg_data_type [ DEFAULT default_value ] ] [ , ... ] )`
:   Specifies the identifier (`name`), any input arguments, and the default values for any optional arguments for the UDF.

    * For the identifier:

      + The identifier does not need to be unique for the schema in which the function is created because UDFs are
        [identified and resolved by the combination of the name and argument types](../../developer-guide/udf-stored-procedure-naming-conventions.md).
      + The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
        identifier string is enclosed in double quotes (for example, “My object”). Identifiers enclosed in double quotes are also
        case-sensitive. See [Identifier requirements](../identifiers-syntax.md).
    * For the input arguments:

      + For `arg_name`, specify the name of the input argument.
      + For `arg_data_type`, use the Snowflake data type that corresponds to the handler language that you are using.

        - For [Java handlers](../../developer-guide/udf/java/udf-java-introduction.md), see [SQL-Java Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For [JavaScript handlers](../../developer-guide/udf/javascript/udf-javascript-introduction.md), see [SQL and JavaScript data type mapping](../../developer-guide/stored-procedure/stored-procedures-javascript.md).
        - For [Python handlers](../../developer-guide/udf/python/udf-python-introduction.md), see [SQL-Python Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For [Scala handlers](../../developer-guide/udf/scala/udf-scala-introduction.md), see [SQL-Scala Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + To indicate that an argument is optional, use `DEFAULT default_value` to specify the default value of the argument.
        For the default value, you can use a literal or an expression.

        If you specify any optional arguments, you must place these after the required arguments.

        If a function has optional arguments, you cannot define additional functions with the same name and different signatures.

        For details, see [Specify optional arguments](../../developer-guide/udf-stored-procedure-arguments.md).

`RETURNS ...`
:   Specifies the results returned by the UDF, which determines the UDF type:

    > * `result_data_type`: Creates a scalar UDF that returns a single value with the specified data type.
    >
    >   > **Note:**
    >   >
    >   > For UDF handlers written in Java, Python, or Scala, the `result_data_type` must be in the `SQL Data Type` column of the
    >   > following table corresponding to the handler language:
    >   >
    >   > + [SQL-Java Type Mappings table](../../developer-guide/udf-stored-procedure-data-type-mapping.md)
    >   > + [SQL-Python Type Mappings table](../../developer-guide/udf-stored-procedure-data-type-mapping.md)
    >   > + [SQL-Scala Type Mappings table](../../developer-guide/udf-stored-procedure-data-type-mapping.md)
    > * `TABLE ( col_name col_data_type , ... )`: Creates a table UDF that returns tabular results with the specified table column(s)
    >   and column type(s).
    >
    >   > **Note:**
    >   >
    >   > For Scala UDFs, the TABLE return type is not supported.

`AS function_definition`
:   Defines the handler code executed when the UDF is called. The `function_definition` value must be source code in one of the
    languages supported for handlers. The code may be:

    * Java. For more information, see [Introduction to Java UDFs](../../developer-guide/udf/java/udf-java-introduction.md).
    * JavaScript. For more information, see [Introduction to JavaScript UDFs](../../developer-guide/udf/javascript/udf-javascript-introduction.md).
    * Python. For more information, see [Introduction to Python UDFs](../../developer-guide/udf/python/udf-python-introduction.md).
    * Scala. For more information, see [Introduction to Scala UDFs](../../developer-guide/udf/scala/udf-scala-introduction.md).
    * A SQL expression or Snowflake Scripting block. For more information, see
      [Introduction to SQL UDFs](../../developer-guide/udf/sql/udf-sql-introduction.md).

    For more information, see General usage notes in this topic.

    > **Note:**
    >
    > The AS clause is not required when the UDF handler code is referenced on a stage with the IMPORTS clause.

### Java

`LANGUAGE JAVA`
:   Specifies that the code is in the Java language.

`RUNTIME_VERSION = java_jdk_version`
:   Specifies the Java JDK runtime version to use. The supported versions of Java are:

    * 11.x
    * 17.x

    If RUNTIME_VERSION is not set, Java JDK 11 is used.

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [ , ... ] )`
:   The location (stage), path, and name of the directory or file(s) to import.

    A file can be a JAR file or another type of file.

    If the file is a JAR file, it can contain one or more .class files and zero or more resource files.

    Java UDFs can also read non-JAR files. For an example, see [Reading a file specified statically in IMPORTS](../../developer-guide/udf/java/udf-java-cookbook.md).

    If you plan to copy a file (JAR file or other file) to a stage, then Snowflake recommends using a named internal stage because the
    PUT command supports copying files to named internal stages, and the PUT command is usually the easiest way to move a JAR file
    to a stage.

    External stages are allowed, but are not supported by PUT.

    Each file in the IMPORTS clause must have a unique name, even if the files are in different subdirectories or different stages.

    If both the IMPORTS and TARGET_PATH clauses are present, the file name in the TARGET_PATH clause must be different
    from each file name in the IMPORTS clause, even if the files are in different subdirectories or different stages.

    Snowflake returns an error if the TARGET_PATH matches an existing file; you cannot use TARGET_PATH to overwrite an existing file.

    For a UDF whose handler is on a stage, the IMPORTS clause is required because it specifies the location of the JAR file that
    contains the UDF.

    For UDF whose handler code is in-line, the IMPORTS clause is needed only if the in-line UDF needs to access other files, such as
    libraries or text files.

    For Snowflake system packages, such as the [Snowpark package](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/index.html),
    you can specify the package with the PACKAGES clause rather than specifying its JAR file with IMPORTS. When you do, the package
    JAR file need not be included in an IMPORTS value.

    **In-line Java**

    `AS function_definition`
    :   In-line Java UDFs require a function definition.

`HANDLER = handler_name`
:   The name of the handler method or class.

    * If the handler is for a scalar UDF, returning a non-tabular value, the HANDLER value should be a method name, as in the following
      form: `MyClass.myMethod`.
    * If the handler is for a tabular UDF, the HANDLER value should be the name of a handler class.

### JavaScript

`LANGUAGE JAVASCRIPT`
:   Specifies that the code is in the JavaScript language.

### Python

`LANGUAGE PYTHON`
:   Specifies that the code is in the Python language.

`RUNTIME_VERSION = python_version`
:   Specifies the Python version to use. The supported versions of Python are:

    Generally available versions:

    * 3.9 (deprecated)
    * 3.10
    * 3.11
    * 3.12
    * 3.13

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [ , ... ] )`
:   The location (stage), path, and name of the directory or file(s) to import.

    A file can be a `.py` file or another type of file.

    Python UDFs can also read non-Python files, such as text files. For an example, see [Reading files and assets](../../developer-guide/udf/python/udf-python-examples.md).

    If you plan to copy a file to a stage, then Snowflake recommends using a named internal stage because the
    PUT command supports copying files to named internal stages, and the PUT command is usually the easiest way to move a file
    to a stage.

    External stages are allowed, but are not supported by PUT.

    Each file in the IMPORTS clause must have a unique name, even if the files are in different subdirectories or different stages.

    When the handler code is stored in a stage, you must use the IMPORTS clause to specify the handler code’s location.

    For an in-line Python UDF, the IMPORTS clause is needed only if the UDF handler needs to access other files, such as
    packages or text files.

    For packages included on the Snowflake system, such [numpy](https://numpy.org/doc/stable/),
    you can specify the package with the PACKAGES clause alone, omitting the package’s source as an IMPORTS value.

`HANDLER = handler_name`
:   The name of the handler function or class.

    * If the handler is for a scalar UDF, returning a non-tabular value, the HANDLER value should be a function name. If the handler code
      is in-line with the CREATE FUNCTION statement, you can use the function name alone. When the handler code is referenced at a stage, this
      value should be qualified with the module name, as in the following form: `my_module.my_function`.
    * If the handler is for a tabular UDF, the HANDLER value should be the name of a handler class.

### Scala

`LANGUAGE SCALA`
:   Specifies that the code is in the Scala language.

`RUNTIME_VERSION = scala_version`
:   Specifies the Scala runtime version to use. The supported versions of Scala are:

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Support for version 2.13 is in preview. Available to all accounts.

    * 2.13
    * 2.12

    For more information, see [Writing code to support different Scala versions](../../developer-guide/scala-version-differences.md).

    If RUNTIME_VERSION is not set, Scala 2.12 is used.

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [ , ... ] )`
:   The location (stage), path, and name of the directory or file(s) to import, such as a JAR or other kind of file.

    * The JAR file might contain handler dependency libraries. It can contain one or more .class files and zero or more resource files.
    * A non-JAR file might be a file read by handler code. For an example, see [Reading a file specified statically in IMPORTS](../../developer-guide/udf/java/udf-java-cookbook.md).

    If you plan to copy a file to a stage, then Snowflake recommends using a named internal stage because the PUT command supports
    copying files to named internal stages, and the PUT command is usually the easiest way to move a JAR file to a stage. External
    stages are allowed, but are not supported by PUT.

    Each file in the IMPORTS clause must have a unique name, even if the files are in different stage subdirectories or different stages.

    If both the IMPORTS and TARGET_PATH clauses are present, the file name in the TARGET_PATH clause must be different
    from that of any file listed in the IMPORTS clause, even if the files are in different stage subdirectories or different stages.

    For a UDF whose handler is on a stage, the IMPORTS clause is required because it specifies the location of the JAR file that
    contains the UDF.

    For UDF whose handler code is in-line, the IMPORTS clause is needed only if the in-line UDF needs to access other files, such as
    libraries or text files.

    For Snowflake system packages, such as the [Snowpark package](https://docs.snowflake.com/en/developer-guide/snowpark/reference/java/index.html),
    you can specify the package with the PACKAGES clause rather than specifying its JAR file with IMPORTS. When you do, the package
    JAR file need not be included in an IMPORTS value.

    **In-line Scala**

    `AS function_definition`
    :   UDFs with in-line Scala handler code require a function definition.

`HANDLER = handler_name`
:   The name of the handler method or class.

    * If the handler is for a scalar UDF, returning a non-tabular value, the HANDLER value should be a method name, as in the following
      form: `MyClass.myMethod`.

## Optional parameters

### All languages

`SECURE`
:   Specifies that the function is secure. For more information about secure functions, see [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../../developer-guide/secure-udf-procedure.md).

`{ TEMP | TEMPORARY }`
:   Specifies that the function persists only for the duration of the [session](../../user-guide/session-policies.md) that you created it in. A
    temporary function is dropped at the end of the session.

    Default: No value. If a function is not declared as `TEMPORARY`, the function is permanent.

    You cannot create temporary [user-defined functions](../../developer-guide/udf/udf-overview.md) that have the same name as a function that already
    exists in the schema.

`[ [ NOT ] NULL ]`
:   Specifies whether the function can return NULL values or must return only NON-NULL values. The default is NULL (i.e. the function can
    return NULL).

    > **Note:**
    >
    > Currently, the `NOT NULL` clause is not enforced for SQL UDFs.
    > SQL UDFs declared as `NOT NULL` can return NULL values. Snowflake recommends avoiding `NOT NULL`
    > for SQL UDFs unless the code in the function is written to ensure that NULL values are never returned.

`CALLED ON NULL INPUT` or . `{ RETURNS NULL ON NULL INPUT | STRICT }`
:   Specifies the behavior of the UDF when called with null inputs. In contrast to system-defined functions, which always return null when any
    input is null, UDFs can handle null inputs, returning non-null values even when an input is null:

    * `CALLED ON NULL INPUT` will always call the UDF with null inputs. It is up to the UDF to handle such values appropriately.
    * `RETURNS NULL ON NULL INPUT` (or its synonym `STRICT`) will not call the UDF if any input is null. Instead, a null value
      will always be returned for that row. Note that the UDF might still return null for non-null inputs.

    > **Note:**
    >
    > `RETURNS NULL ON NULL INPUT` (`STRICT`) is not supported for SQL UDFs. SQL UDFs effectively use
    > `CALLED ON NULL INPUT`. In your SQL UDFs, you must handle null input values.

    Default: `CALLED ON NULL INPUT`

`{ VOLATILE | IMMUTABLE }`
:   Specifies the behavior of the UDF when returning results:

    > * `VOLATILE`: UDF might return different values for different rows, even for the same input (e.g. due to non-determinism and
    >   statefulness).
    > * `IMMUTABLE`: UDF assumes that the function, when called with the same inputs, will always return the same result. This guarantee
    >   is not checked. Specifying `IMMUTABLE` for a UDF that returns different values for the same input will result in undefined
    >   behavior.

    Default: `VOLATILE`

    > **Note:**
    >
    > IMMUTABLE is not supported on an aggregate function (when you use the AGGREGATE parameter). Therefore, all aggregate functions are
    > VOLATILE by default.

`COMMENT = 'string_literal'`
:   Specifies a comment for the UDF, which is displayed in the DESCRIPTION column in the [SHOW FUNCTIONS](show-functions.md) and [SHOW USER FUNCTIONS](show-user-functions.md)
    output.

    Default: `user-defined function`

`COPY GRANTS`
:   Specifies to retain the access privileges from the original function when a new function is created using CREATE OR REPLACE FUNCTION.

    The parameter copies all privileges, except OWNERSHIP, from the existing function to the new function. The new function will
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE FUNCTION
    statement owns the new function.

    Note:

    * With [data sharing](../../user-guide/data-sharing-gs.md), if the existing function was shared to another account, the replacement function is
      also shared.
    * The [SHOW GRANTS](show-grants.md) output for the replacement function lists the grantee for the copied privileges as the
      role that executed the CREATE FUNCTION statement, with the current timestamp when the statement was executed.
    * The operation to copy grants occurs atomically in the CREATE FUNCTION command (i.e. within the same transaction).

### Java

`PACKAGES = ( 'package_name_and_version' [ , ... ] )`
:   The name and version number of Snowflake system packages required as dependencies. The value should be of the form
    `package_name:version_number`, where `package_name` is `snowflake_domain:package`. Note that you can
    specify `latest` as the version number in order to have Snowflake use the latest version available on the system.

    For example:

    ```sqlexample
    -- Use version 1.2.0 of the Snowpark package.
    PACKAGES=('com.snowflake:snowpark:1.2.0')

    -- Use the latest version of the Snowpark package.
    PACKAGES=('com.snowflake:snowpark:latest')
    ```

    You can discover the list of supported system packages by executing the following SQL in Snowflake:

    ```sqlexample
    SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'java';
    ```

    For a dependency you specify with PACKAGES, you do not need to also specify its JAR file in an IMPORTS clause.

    **In-line Java**

    `TARGET_PATH = stage_path_and_file_name_to_write`
    :   Specifies the location to which Snowflake should write the JAR file containing the result of compiling the handler source code specified
        in the `function_definition`.

        If this clause is included, Snowflake writes the resulting JAR file to the stage location specified by the clause’s value. If this
        clause is omitted, Snowflake re-compiles the source code each time the code is needed. In that case, the JAR file is not stored
        permanently, and the user does not need to clean up the JAR file.

        Snowflake returns an error if the TARGET_PATH matches an existing file; you cannot use TARGET_PATH to overwrite an
        existing file.

        The generated JAR file remains until you explicitly delete it, even if you drop the function. When you drop the UDF you should
        separately remove the JAR file because the JAR is no longer needed to support the UDF.

        For example, the following TARGET_PATH example would result in a `myhandler.jar` file generated and copied to the
        `handlers` stage.

        ```sqlexample
        TARGET_PATH = '@handlers/myhandler.jar'
        ```

        When you drop this UDF to remove it, you’ll also need to remove its handler JAR file, such as by executing the
        [REMOVE command](remove.md).

        ```sqlexample
        REMOVE @handlers/myhandler.jar;
        ```

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   The names of [external access integrations](create-external-access-integration.md) needed in order for this
    function’s handler code to access external networks.

    An external access integration specifies [network rules](create-network-rule.md) and
    [secrets](create-secret.md) that specify external locations and credentials (if any) allowed for use by handler code
    when making requests of an external network, such as an external REST API.

`SECRETS = ( 'secret_variable_name' = secret_name [ , ...  ] )`
:   Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from
    secrets in handler code.

    Secrets you specify here must be allowed by the [external access integration](create-external-access-integration.md)
    specified as a value of this CREATE FUNCTION command’s EXTERNAL_ACCESS_INTEGRATIONS parameter

    This parameter’s value is a comma-separated list of assignment expressions with the following parts:

    * `secret_name` as the name of the allowed secret.

      You will receive an error if you specify a SECRETS value whose secret isn’t also included in an integration specified by the
      EXTERNAL_ACCESS_INTEGRATIONS parameter.
    * `'secret_variable_name'` as the variable that will be used in handler code when retrieving information from the secret.

    For more information, including an example, refer to [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md).

### Python

`AGGREGATE`
:   Specifies that the function is an aggregate function. For more information about user-defined aggregate functions, see
    [Python user-defined aggregate functions](../../developer-guide/udf/python/udf-python-aggregate-functions.md).

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Using Python to write a handler for a user-defined aggregate function (UDAF) is a preview feature that is available to all accounts.

    > **Note:**
    >
    > IMMUTABLE is not supported on an aggregate function (when you use the AGGREGATE parameter). Therefore, all aggregate functions are
    > VOLATILE by default.

`ARTIFACT_REPOSITORY = repository_name`
:   Specifies the name of the repository to use for installing PyPI packages for use by your function.

    Snowflake installs these packages from the artifact repository.

    Specify a list of the names of the packages that you want to install and use in your function.

    Snowflake installs these packages from the artifact repository.

`PACKAGES = ( 'package_name_and_version' [ , ... ] )`
:   The name and version number of packages required as dependencies. The value should be of the form
    `package_name==version_number`. If you omit the version number, Snowflake will use the latest package available on the
    system.

    For example:

    ```sqlexample
    -- Use version 1.2.2 of the NumPy package.
    PACKAGES=('numpy==1.2.2')

    -- Use the latest version of the NumPy package.
    PACKAGES=('numpy')
    ```

    You can discover the list of supported system packages by executing the following SQL in Snowflake:

    ```sqlexample
    SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'python';
    ```

    For more information about included packages, see [Using third-party packages](../../developer-guide/udf/python/udf-python-packages.md).

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Specifying a range of Python package versions is available as a preview feature to all accounts.

    You can specify package versions by using these version
    specifiers: `==`, `<=`, `>=`, `<`,or `>`.

    For example:

    ```sqlexample
    -- Use version 1.2.3 or higher of the NumPy package.
    PACKAGES=('numpy>=1.2.3')
    ```

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   The names of [external access integrations](create-external-access-integration.md) needed in order for this
    function’s handler code to access external networks.

    An external access integration specifies [network rules](create-network-rule.md) and
    [secrets](create-secret.md) that specify external locations and credentials (if any) allowed for use by handler code
    when making requests of an external network, such as an external REST API.

`SECRETS = ( 'secret_variable_name' = secret_name [ , ...  ] )`
:   Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from
    secrets in handler code.

    Secrets you specify here must be allowed by the [external access integration](create-external-access-integration.md)
    specified as a value of this CREATE FUNCTION command’s EXTERNAL_ACCESS_INTEGRATIONS parameter

    This parameter’s value is a comma-separated list of assignment expressions with the following parts:

    * `secret_name` as the name of the allowed secret.

      You will receive an error if you specify a SECRETS value whose secret isn’t also included in an integration specified by the
      EXTERNAL_ACCESS_INTEGRATIONS parameter.
    * `'secret_variable_name'` as the variable that will be used in handler code when retrieving information from the secret.

    For more information, including an example, refer to [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md).

### SQL

`MEMOIZABLE`
:   Specifies that the function is memoizable.

    For more information, see [Memoizable UDFs](../../developer-guide/udf/sql/udf-sql-scalar-functions.md).

### Scala

`PACKAGES = ( 'package_name_and_version' [ , ... ] )`
:   The name and version number of Snowflake system packages required as dependencies. The value should be of the form
    `package_name:version_number`, where `package_name` is `snowflake_domain:package`. Note that you can
    specify `latest` as the version number in order to have Snowflake use the latest version available on the system.

    For example:

    ```sqlexample
    -- Use version 1.7.0 of the Snowpark package.
    PACKAGES=('com.snowflake:snowpark:1.7.0')

    -- Use the latest version of the Snowpark package.
    PACKAGES=('com.snowflake:snowpark:latest')
    ```

    You can discover the list of supported system packages by executing the following SQL in Snowflake:

    ```sqlexample
    SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'scala';
    ```

    For a dependency you specify with PACKAGES, you do not need to also specify its JAR file in an IMPORTS clause.

    `TARGET_PATH = stage_path_and_file_name_to_write`
    :   Specifies the location to which Snowflake should write the JAR file containing the result of compiling the handler source code specified
        in the `function_definition`.

        If this clause is included, Snowflake writes the resulting JAR file to the stage location specified by the clause’s value. If this
        clause is omitted, Snowflake re-compiles the source code each time the code is needed. In that case, the JAR file is not stored
        permanently, and the user does not need to clean up the JAR file.

        Snowflake returns an error if the TARGET_PATH matches an existing file; you cannot use TARGET_PATH to overwrite an
        existing file.

        The generated JAR file remains until you explicitly delete it, even if you drop the function. When you drop the UDF you should
        separately remove the JAR file because the JAR is no longer needed to support the UDF.

        For example, the following TARGET_PATH example would result in a `myhandler.jar` file generated and copied to the
        `handlers` stage.

        ```sqlexample
        TARGET_PATH = '@handlers/myhandler.jar'
        ```

        When you drop this UDF to remove it, you’ll also need to remove its handler JAR file, such as by executing the
        [REMOVE command](remove.md).

        ```sqlexample
        REMOVE @handlers/myhandler.jar;
        ```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FUNCTION | Schema | The privilege only enables the creation of user-defined functions in the schema.  If you want to enable the creation of data metric functions, the role must have the CREATE DATA METRIC FUNCTION privilege. |
| USAGE | Function | Granting the USAGE privilege on the newly created function to a role allows users with that role to call the function elsewhere in Snowflake (such as masking policy owner role for External Tokenization). |
| USAGE | External access integration | Required on integrations, if any, specified by the EXTERNAL_ACCESS_INTEGRATIONS parameter. For more information, see [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md). |
| READ | Secret | Required on secrets, if any, specified by the SECRETS parameter. For more information, see [Creating a secret to represent credentials](../../developer-guide/external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md). |
| USAGE | Schema | Required on schemas containing secrets, if any, specified by the SECRETS parameter. For more information, see [Creating a secret to represent credentials](../../developer-guide/external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

### All languages

* `function_definition` has size restrictions. The maximum allowable size is subject to change.
* The delimiters around the `function_definition` can be either single quotes or a pair of dollar signs.

  Using `$$` as the delimiter makes it easier to write functions that contain single quotes.

  If the delimiter for the body of the function is the single quote character,
  then any single quotes within `function_definition` (such as string
  literals) must be escaped by single quotes.
* If using a UDF in a [masking policy](create-masking-policy.md), ensure the data type of the column, UDF, and masking policy match. For
  more information, see [User-defined functions in a masking policy](../../user-guide/security-column-intro.md).
* If you specify the [CURRENT_DATABASE](../functions/current_database.md) or [CURRENT_SCHEMA](../functions/current_schema.md) function in the
  handler code of the UDF, the function returns the database or schema that contains the UDF, not the database or schema in use for
  the session.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Setting LOG_LEVEL or TRACE_LEVEL as properties in a CREATE FUNCTION statement is not supported. To set these properties
  on a function, use [ALTER FUNCTION](alter-function.md) after creating the function, or use
  CREATE OR ALTER FUNCTION.

### Java

* In Java, primitive data types don’t allow NULL values, so passing a NULL for an argument of such a type results in
  an error.
* In the HANDLER clause, the method name is case-sensitive.
* In the IMPORTS and TARGET_PATH clauses:

  + Package, class, and file name(s) are case-sensitive.
  + Stage name(s) are case-insensitive.
* You can use the PACKAGES clause to specify package names and version numbers for Snowflake system-defined dependencies, such as those
  from Snowpark. For other dependencies, specify dependency JAR files with the IMPORTS clause.
* Snowflake validates that:

  + The JAR file specified in the CREATE FUNCTION statement’s HANDLER exists and contains the specified
    class and method.
  + The input and output types specified in the UDF declaration are compatible with the input and output types
    of the Java method.

  Validation can be done at creation time or execution time, depending on whether you are connected to an active Snowflake warehouse.

  + Creation time — If you are connected to an active Snowflake warehouse at the time the CREATE FUNCTION statement is
    executed, the UDF is validated at creation time.
  + Execution time — If you are not connected to an active Snowflake warehouse, the UDF is created, but is not validated
    immediately, and Snowflake returns the following message:

    `Function <name> created successfully, but could not be validated since there is no active warehouse`.

### JavaScript

* Snowflake does not validate JavaScript code at UDF creation time. In other words, creation of the UDF succeeds regardless of whether
  the code is valid. If the code is not valid, Snowflake returns errors when the UDF is called at query time.

### Python

* In the HANDLER clause, the handler function name is case-sensitive.
* In the IMPORTS clause:

  + File name(s) are case-sensitive.
  + Stage name(s) are case-insensitive.
* You can use the PACKAGES clause to specify package names and version numbers for dependencies, such as those
  from Snowpark. For other dependencies, specify dependency files with the IMPORTS clause.
* Snowflake validates that:

  + The function or class specified in the CREATE FUNCTION statement’s HANDLER exists.
  + The input and output types specified in the UDF declaration are compatible with the input and output types
    of the handler.

### Scala

* In the HANDLER clause, the method name is case-sensitive.
* In the IMPORTS and TARGET_PATH clauses:

  + Package, class, and file name(s) are case-sensitive.
  + Stage name(s) are case-insensitive.
* You can use the PACKAGES clause to specify package names and version numbers for Snowflake system-defined dependencies, such as those
  from Snowpark. For other dependencies, specify dependency JAR files with the IMPORTS clause.
* Snowflake validates that:

  + The JAR file specified in the CREATE FUNCTION statement’s HANDLER exists and contains the specified
    class and method.
  + The input and output types specified in the UDF declaration are compatible with the input and output types
    of the Scala method.

  Validation can be done at creation time or execution time, depending on whether you are connected to an active Snowflake warehouse.

  + Creation time — If you are connected to an active Snowflake warehouse at the time the CREATE FUNCTION statement is
    executed, the UDF is validated at creation time.
  + Execution time — If you are not connected to an active Snowflake warehouse, the UDF is created, but is not validated
    immediately, and Snowflake returns the following message:

    `Function <name> created successfully, but could not be validated since there is no active warehouse`.

### SQL

* Currently, the NOT NULL clause is not enforced for SQL UDFs.

## CREATE OR ALTER FUNCTION usage notes

* All limitations of the [ALTER FUNCTION](alter-function.md) command apply.
* You cannot replace or transform a FUNCTION with a PROCEDURE or a PROCEDURE with a FUNCTION.
* You cannot replace or transform a temporary FUNCTION with a non-temporary FUNCTION, or a non-temporary FUNCTION with a temporary FUNCTION.
* You cannot replace or transform a regular FUNCTION with an EXTERNAL FUNCTION, or an EXTERNAL FUNCTION with a regular FUNCTION.
* Changing the LANGUAGE, HANDLER, VOLATILITY, NULL_HANDLING, TARGET_PATH properties, and the function input arguments is not supported.
* Setting or unsetting a tag is not supported. Existing tags are not altered by a CREATE OR ALTER FUNCTION statement and remain unchanged.

## Examples

### Java

Here is a basic example of CREATE FUNCTION with an in-line handler:

```sqlexample
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE JAVA
  CALLED ON NULL INPUT
  HANDLER = 'TestFunc.echoVarchar'
  TARGET_PATH = '@~/testfunc.jar'
  AS
  'class TestFunc {
    public static String echoVarchar(String x) {
      return x;
    }
  }';
```

Here is a basic example of CREATE FUNCTION with a reference to a staged handler:

```sqlexample
create function my_decrement_udf(i numeric(9, 0))
    returns numeric
    language java
    imports = ('@~/my_decrement_udf_package_dir/my_decrement_udf_jar.jar')
    handler = 'my_decrement_udf_package.my_decrement_udf_class.my_decrement_udf_method'
    ;
```

For more examples of Java UDFs, see [examples](../../developer-guide/udf/java/udf-java-cookbook.md).

### JavaScript

Create a JavaScript UDF named `js_factorial`:

```sqlexample
CREATE OR REPLACE FUNCTION js_factorial(d double)
  RETURNS double
  LANGUAGE JAVASCRIPT
  STRICT
  AS '
  if (D <= 0) {
    return 1;
  } else {
    var result = 1;
    for (var i = 2; i <= D; i++) {
      result = result * i;
    }
    return result;
  }
  ';
```

### Python

Code in the following example creates a `py_udf` function whose handler code is in-line as `udf`.

```sqlexample
CREATE OR REPLACE FUNCTION py_udf()
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.10'
  PACKAGES = ('numpy','pandas','xgboost==1.5.0')
  HANDLER = 'udf'
AS $$
import numpy as np
import pandas as pd
import xgboost as xgb
def udf():
    return [np.__version__, pd.__version__, xgb.__version__]
$$;
```

Code in the following example creates a `dream` function whose handler is in a `sleepy.py` file located on the
`@my_stage` stage.

```sqlexample
CREATE OR REPLACE FUNCTION dream(i int)
  RETURNS VARIANT
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.10'
  HANDLER = 'sleepy.snore'
  IMPORTS = ('@my_stage/sleepy.py')
```

### Scala

Here is a basic example of CREATE FUNCTION with an in-line handler:

Scala 2.12Scala 2.13 (Preview)

```sqlexample
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  HANDLER='Echo.echoVarchar'
  AS
  $$
  class Echo {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

```sqlexample
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  HANDLER='Echo.echoVarchar'
  AS
  $$
  class Echo {
    def echoVarchar(x : String): String = {
      return x
    }
  }
  $$;
```

Here is a basic example of CREATE FUNCTION with a reference to a staged handler:

Scala 2.12Scala 2.13 (Preview)

```sqlexample
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.12
  IMPORTS = ('@udf_libs/echohandler.jar')
  HANDLER='Echo.echoVarchar';
```

```sqlexample
CREATE OR REPLACE FUNCTION echo_varchar(x VARCHAR)
  RETURNS VARCHAR
  LANGUAGE SCALA
  RUNTIME_VERSION = 2.13
  IMPORTS = ('@udf_libs/echohandler.jar')
  HANDLER='Echo.echoVarchar';
```

For more examples of Scala UDFs, see [Scala UDF handler examples](../../developer-guide/udf/scala/udf-scala-examples.md).

### SQL

Create a simple SQL scalar UDF that returns a hard-coded approximation of the
mathematical constant pi:

```sqlexample
CREATE FUNCTION pi_udf()
  RETURNS FLOAT
  AS '3.141592654::FLOAT'
  ;
```

Create a simple SQL table UDF that returns hard-coded values:

```sqlexample
CREATE FUNCTION simple_table_function ()
  RETURNS TABLE (x INTEGER, y INTEGER)
  AS
  $$
    SELECT 1, 2
    UNION ALL
    SELECT 3, 4
  $$
  ;
```

```sqlexample
SELECT * FROM TABLE(simple_table_function());
```

Output:

```sqlexample
SELECT * FROM TABLE(simple_table_function());
+---+---+
| X | Y |
|---+---|
| 1 | 2 |
| 3 | 4 |
+---+---+
```

Create a UDF that accepts multiple parameters:

```sqlexample
CREATE FUNCTION multiply1 (a number, b number)
  RETURNS number
  COMMENT='multiply two numbers'
  AS 'a * b';
```

Create a SQL table UDF named `get_countries_for_user` that returns the results of a query:

```sqlexample
CREATE OR REPLACE FUNCTION get_countries_for_user ( id NUMBER )
  RETURNS TABLE (country_code CHAR, country_name VARCHAR)
  AS 'SELECT DISTINCT c.country_code, c.country_name
      FROM user_addresses a, countries c
      WHERE a.user_id = id
      AND c.country_code = a.country_code';
```

### Create and alter a simple function using the CREATE OR ALTER FUNCTION command

Create a function `multiply` that accepts two numbers:

```sqlexample
CREATE OR ALTER FUNCTION multiply(a NUMBER, b NUMBER)
  RETURNS NUMBER
  AS 'a * b';
```

Alter `multiply` to add a comment and make the function secure:

```sqlexample
CREATE OR ALTER SECURE FUNCTION multiply(a NUMBER, b NUMBER)
  RETURNS NUMBER
  COMMENT = 'Multiply two numbers.'
  AS 'a * b';
```

---
title: CREATE FUNCTION (Snowpark Container Services)
source: https://docs.snowflake.com/en/sql-reference/sql/create-function-spcs.md
section: SQL Commands
---

# CREATE FUNCTION (Snowpark Container Services)

Creates a [service function](../../developer-guide/snowpark-container-services/working-with-services.md).

This command supports the following variants:

* CREATE OR ALTER FUNCTION (Snowpark Container Services): Creates a service function if it doesn’t exist or alters an existing service function.

See also:
:   [Service functions](../../developer-guide/snowpark-container-services/working-with-services.md), [CREATE EXTERNAL FUNCTION](create-external-function.md),
    [DESC FUNCTION](desc-function-spcs.md), [DROP FUNCTION](drop-function-spcs.md), [ALTER FUNCTION](alter-function-spcs.md),
    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS <result_data_type>
  [ [ NOT ] NULL ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ]
  SERVICE = <service_name>
  ENDPOINT = <endpoint_name>
  [ COMMENT = '<string_literal>' ]
  [ CONTEXT_HEADERS = ( <context_function_1> [ , <context_function_2> ...] ) ]
  [ MAX_BATCH_ROWS = <integer> ]
  [ MAX_BATCH_RETRIES = <integer> ]
  [ ON_BATCH_FAILURE = { ABORT | RETURN_NULL } ]
  [ BATCH_TIMEOUT_SECS = <integer> ]
  AS '<http_path_to_request_handler>'
```

## Variant syntax

### CREATE OR ALTER FUNCTION (Snowpark Container Services)

Creates a new service function if it doesn’t already exist, or transforms an existing service function into the service function
defined in the statement. A CREATE OR ALTER FUNCTION (Snowpark Container Services) statement follows the syntax rules of a CREATE
FUNCTION (Snowpark Container Services) statement and has the same limitations as an [ALTER FUNCTION (Snowpark Container Services)](alter-function-spcs.md)
statement.

Supported function alterations include changes to the following:

* CONTEXT_HEADERS
* SERVICE
* ENDPOINT
* MAX_BATCH_ROWS
* MAX_BATCH_RETRIES
* ON_BATCH_FAILURE
* BATCH_TIMEOUT_SECS

For more information, see CREATE OR ALTER FUNCTION (Snowpark Container Services) usage notes.

```sqlsyntax
CREATE [ OR ALTER ] FUNCTION ...
```

## Required parameters

`name`
:   Specifies the identifier (`name`) and any input arguments for the function.

    * The identifier does not need to be unique for the schema in which the function is created because functions are
      [identified and resolved by the combination of the name and argument types](../../developer-guide/udf-stored-procedure-naming-conventions.md).
    * The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
      identifier string is enclosed in double quotes (for example, “My object”). Identifiers enclosed in double quotes are also
      case-sensitive. See [Identifier requirements](../identifiers-syntax.md).

`( [ arg_name arg_data_type ] [ , ... ] )`
:   Specifies the arguments/inputs for the service function. These should correspond to the arguments that the
    service expects.

    If there are no arguments, then include the parentheses without any argument name(s) and data type(s).

`RETURNS result_data_type`
:   Specifies the data type of the result returned by the function.

`SERVICE = service_name`
:   Specifies the name of the Snowpark Container Services service.

`ENDPOINT = endpoint_name`
:   Specifies the name of the endpoint as defined in the service specification.

`AS http_path_to_request_handler`
:   Specifies the HTTP path to the service code that is executed when the function is called.

## Optional parameters

`[ [ NOT ] NULL ]`
:   Specifies whether the function can return NULL values or must return only NON-NULL values. The default is NULL (that is, the function can
    return NULL).

`CALLED ON NULL INPUT` or . `{ RETURNS NULL ON NULL INPUT | STRICT }`
:   Specifies the behavior of the function when called with null inputs. In contrast to system-defined functions, which always return null when any
    input is null, functions can handle null inputs, returning non-null values even when an input is null:

    * `CALLED ON NULL INPUT` will always call the function with null inputs. It’s up to the function to handle such values appropriately.
    * `RETURNS NULL ON NULL INPUT` (or its synonym `STRICT`) will not call the function if any input is null. Instead, a null value
      will always be returned for that row. Note that the function might still return null for non-null inputs.

    Default: `CALLED ON NULL INPUT`

`{ VOLATILE | IMMUTABLE }`
:   Specifies the behavior of the function when returning results:

    > * `VOLATILE`: function might return different values for different rows, even for the same input (for example, due to non-determinism and
    >   statefulness).
    > * `IMMUTABLE`: function assumes that the function, when called with the same inputs, will always return the same result. This guarantee
    >   is not checked. Specifying `IMMUTABLE` for a function that returns different values for the same input will result in undefined
    >   behavior.

    Default: `VOLATILE`

`MAX_BATCH_ROWS = integer`
:   Specifies the [batch size](../../developer-guide/snowpark-container-services/working-with-services.md) when sending data to a service to increase concurrency

`MAX_BATCH_RETRIES = integer`
:   Specifies the number of times you want Snowflake to retry a failed batch.

    Default: 3

`ON_BATCH_FAILURE = { ABORT | RETURN_NULL }`
:   Specifies the behavior of the function after Snowflake reaches the maximum number of retries processing the batch.

    * `ABORT`: Service function aborts execution. Any remaining batches of rows are not processed.
    * `RETURN_NULL`: Service function returns a NULL for each row in the failed batch and continues processing the remaining batches. If you choose this option, note the following caveats:

      + If these batches depend on each other and one batch fails, this could lead to unexpected results.
      + If your service can return a NULL as a valid response, then it is not possible to differentiate NULL returned by Snowflake due to batch failure and NULL returned by your service.

    Default: `ABORT`

`BATCH_TIMEOUT_SECS = integer`
:   Specifies the maximum duration for processing a single batch of rows, including retries (and polling for async function requests), after which Snowflake should terminate the batch request.

    Acceptable Values: greater than 0 and less than or equal to 604800 seconds (7 days).

    Default: 3600 seconds (1 hour)

`COMMENT = 'string_literal'`
:   Specifies a comment for the function, which is displayed in the DESCRIPTION column in the [SHOW FUNCTIONS](show-functions.md) and [SHOW USER FUNCTIONS](show-user-functions.md)
    output.

    Default: `user-defined function`

`CONTEXT_HEADERS = ( context_function_1 [ , context_function_2 ...] )`
:   This binds Snowflake context function results to HTTP headers.
    (For more information about Snowflake context functions, see: [Context functions](../functions-context.md).)

    Not all context functions are supported in context headers. The following are supported:

    * CURRENT_ACCOUNT()
    * CURRENT_CLIENT()
    * CURRENT_DATABASE()
    * CURRENT_DATE()
    * CURRENT_IP_ADDRESS()
    * CURRENT_REGION()
    * CURRENT_ROLE()
    * CURRENT_SCHEMA()
    * CURRENT_SCHEMAS()
    * CURRENT_SESSION()
    * CURRENT_STATEMENT()
    * CURRENT_TIME()
    * CURRENT_TIMESTAMP()
    * CURRENT_TRANSACTION()
    * CURRENT_USER()
    * CURRENT_VERSION()
    * CURRENT_WAREHOUSE()
    * LAST_QUERY_ID()
    * LAST_TRANSACTION()
    * LOCALTIME()
    * LOCALTIMESTAMP()

    When function names are listed in the CONTEXT_HEADERS clause, the function names should not be quoted.

    Snowflake prepends `sf-context` to the header before it’s written to the HTTP request.

    Example:

    ```sqlexample
    CONTEXT_HEADERS = (current_timestamp)
    ```

    In this example, Snowflake writes the header `sf-context-current-timestamp` into the HTTP request.

    Context functions can generate characters that are illegal in HTTP header values, including (but not limited to) the following:

    * newline
    * `Ä`
    * `Î`
    * `ß`
    * `ë`
    * `¬`
    * `±`
    * `©`
    * `®`

    Snowflake replaces each sequence of one or more illegal characters with one space character. (The replacement
    is per sequence, not per character.)

    For example, suppose that the context function CURRENT_STATEMENT() returns the following:

    ```sqlexample
    select
      /*ÄÎßë¬±©®*/
      my_service_function(1);
    ```

    The value sent in `sf-context-current-statement` is the following:

    ```sqlexample
    select /* */ my_service_function(1);
    ```

    To ensure that your service code can access the original result (with illegal characters) from the context function
    even if illegal characters have been replaced, Snowflake also sends a binary context header that contains the
    context function result encoded in [base64](../binary-input-output.md).

    In the example above, the value sent in the base64-encoded header is the result of the following call:

    ```sqlexample
    base64_encode('select\n/ÄÎßë¬±©®/\nmy_service_function(1)')
    ```

    The remote service is responsible for decoding the base64 value if needed.

    Each such base64 header is named according to the following convention:

    ```sqlsyntax
    sf-context-<context-function>-base64
    ```

    In the example above, the name of the header would be the following:

    ```none
    sf-context-current-statement-base64
    ```

    If no context headers are sent, then no base64 context headers are sent.

    If the rows sent to a service function are split across multiple batches, then all batches contain the same
    context headers and the same binary context headers.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE FUNCTION | Schema |  |
| USAGE | Service Endpoint | Usage on a service endpoint is granted to service roles defined in the service specification. You then grant the service role to the role creating the service function. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER FUNCTION (Snowpark Container Services) usage notes

The following alterations are not supported:

* RETURNS
* Volatility (VOLATILE/IMMUTABLE)
* Null handling (CALLED ON NULL INPUT / RETURNS NULL ON NULL)

## Examples

### Create a simple service function

In [Tutorial-1](../../developer-guide/snowpark-container-services/tutorials/tutorial-1.md), you create the following service function:

```sqlexample
CREATE FUNCTION my_echo_udf (InputText VARCHAR)
  RETURNS VARCHAR
  SERVICE=echo_service
  ENDPOINT=echoendpoint
  AS '/echo';
```

This function connects with the specific ENDPOINT of the specified SERVICE. When you invoke this function, Snowflake sends a
request to the `/echo` path inside the service container.

Note the following:

* The `my_echo_udf` function takes a string as input and returns a string.
* The SERVICE property identifies the service (`echo_service`), and the ENDPOINT property identifies the user-friendly
  endpoint name (`echoendpoint`).
* The `AS '/echo'` specifies the path for the service. In `echo_service.py` (see service code), the `@app.post` decorator associates this
  path with the `echo` function.

### Alter a service function using the CREATE OR ALTER FUNCTION (Snowpark Container Services) command

Alter a function `my_echo_udf` to set the maximum number of batch rows to 100, and add a context header and endpoint:

```sqlexample
CREATE OR ALTER FUNCTION my_echo_udf (InputText VARCHAR)
  RETURNS VARCHAR
  SERVICE = echo_service
  ENDPOINT = reverse_echoendpoint
  CONTEXT_HEADERS = (current_account)
  MAX_BATCH_ROWS = 100
  AS '/echo';
```

---
title: CREATE GATEWAY
source: https://docs.snowflake.com/en/sql-reference/sql/create-gateway.md
section: SQL Commands
---

# CREATE GATEWAY

Creates a new [gateway](../../developer-guide/snowpark-container-services/gateway.md)
in the current schema. A gateway enables traffic splitting across multiple service endpoints.

See also:
:   [ALTER GATEWAY](alter-gateway.md) , [DESCRIBE GATEWAY](desc-gateway.md), [DROP GATEWAY](drop-gateway.md) , [SHOW GATEWAYS](show-gateways.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] GATEWAY [ IF NOT EXISTS ] <name>
  FROM SPECIFICATION <specification_text>
```

## Required parameters

`name`
:   String that specifies the identifier for the gateway; it must be unique for the schema in which the gateway is created.

`FROM SPECIFICATION`
:   Specifies the gateway specification inline. The specification defines the traffic split configuration.

    The specification uses the following format:

    ```yaml
    spec:
      type: traffic_split
      split_type: custom
      targets:
      - type: endpoint
        value: <db>.<schema>.<service>!<endpoint>
        weight: <weight>
      - type: endpoint
        value: <db>.<schema>.<service>!<endpoint>
        weight: <weight>
    ```

## Specification parameters

`type`
:   Fixed value. Must be set to `traffic_split`.

`split_type`
:   Fixed value. Must be set to `custom`.

`targets`
:   A list of target endpoints to route traffic to. Each target must specify:

    `type`
    :   Fixed value. Must be set to `endpoint`.

    `value`
    :   The fully qualified endpoint name in the format `db.schema.service!endpoint`. Each target endpoint must exist.

    `weight`
    :   The traffic weight for this endpoint, specified as an integer. All weights must add up to 100.

> **Note:**
>
> * Maximum number of endpoints per gateway is 5 by default.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE GATEWAY | Schema | Required to create a gateway in the schema. |
| BIND SERVICE ENDPOINT | Account | Required to bind service endpoints to the gateway. |
| USAGE | Database | Required on the database containing the gateway. |
| USAGE | Schema | Required on the schema containing the gateway. |
| USAGE | Service endpoints | Required on the target service endpoints. Grant the service role `ALL_ENDPOINTS_USAGE` to provide access. |

To grant the required privileges, use the following commands:

```sqlexample
-- Grant CREATE GATEWAY privilege in the schema
GRANT CREATE GATEWAY ON SCHEMA <schema_name> TO ROLE <role_name>;

-- Grant BIND SERVICE ENDPOINT privilege on the account
GRANT BIND SERVICE ENDPOINT ON ACCOUNT TO ROLE <role_name>;

-- Grant USAGE on target endpoints via service role
GRANT SERVICE ROLE <db_name>.<schema_name>.<service_name>!ALL_ENDPOINTS_USAGE TO ROLE <role_name>;
```

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a gateway that splits traffic between two service endpoints:

```sqlexample
CREATE GATEWAY split_gateway
  FROM SPECIFICATION $$
spec:
  type: traffic_split
  split_type: custom
  targets:
  - type: endpoint
    value: db.schema.s2!ep1
    weight: 60
  - type: endpoint
    value: db.schema.s1!ep1
    weight: 40
$$;
```

Create or replace a gateway with a new traffic split configuration:

```sqlexample
CREATE OR REPLACE GATEWAY split_gateway
  FROM SPECIFICATION $$
spec:
  type: traffic_split
  split_type: custom
  targets:
  - type: endpoint
    value: db.schema.service1!endpoint1
    weight: 70
  - type: endpoint
    value: db.schema.service2!endpoint1
    weight: 30
$$;
```

---
title: CREATE GIT REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/create-git-repository.md
section: SQL Commands
---

# CREATE GIT REPOSITORY

Creates a Snowflake Git repository clone in the schema or replaces an existing Git repository clone.

For an overview, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md).

See also:
:   [ALTER GIT REPOSITORY](alter-git-repository.md), [DESCRIBE GIT REPOSITORY](desc-git-repository.md), [DROP GIT REPOSITORY](drop-git-repository.md),
    [SHOW GIT BRANCHES](show-git-branches.md), [SHOW GIT REPOSITORIES](show-git-repositories.md), [SHOW GIT TAGS](show-git-tags.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] GIT REPOSITORY [ IF NOT EXISTS ] <name>
  ORIGIN = '<repository_url>'
  API_INTEGRATION = <integration_name>
  [ GIT_CREDENTIALS = <secret_name> ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

## Required parameters

`name`
:   Specifies the identifier for the Git repository clone to create.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ORIGIN = 'repository_url'`
:   Specifies the origin URL of the remote Git repository that this Git repository clone represents. The URL must use HTTPS.

    Snowflake supports any HTTPS Git repository URL. For example, you can specify a custom URL to a corporate Git server within your own
    domain.

    From the command line, you can use the `git config` command from within your local repository to get the value to use for the
    ORIGIN parameter, as shown in the following example:

    ```none
    $ git config --get remote.origin.url
    https://github.com/mycompany/My-Repo.git
    ```

`API_INTEGRATION = integration_name`
:   Specifies the [API INTEGRATION](create-api-integration.md) that contains information about the remote Git
    repository such as allowed credentials and prefixes for target URLs.

    The API integration you specify here must have an API_PROVIDER parameter whose value is set to `git_https_api`.

    For reference information about API integrations, see [CREATE API INTEGRATION](create-api-integration.md).

## Optional parameters

`GIT_CREDENTIALS = secret_name`
:   Specifies the Snowflake [secret](create-secret.md) containing the credentials to use for authenticating with the
    remote Git repository. Omit this parameter to use the default secret specified by the API integration or if this integration does not require
    authentication.

    As a best practice, use a personal access token for the secret’s PASSWORD value. For information about creating a personal access token
    in GitHub, see [Managing your personal access tokens](https://docs.github.com/en/authentication/keeping-your-account-and-data-secure/managing-your-personal-access-tokens)
    in the GitHub documentation.

    The secret you specify here must be a secret specified by the ALLOWED_AUTHENTICATION_SECRETS parameter of the API integration you specify
    with this command’s API_INTEGRATION parameter.

    Default: No value

    For reference information about secrets, see [CREATE SECRET](create-secret.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the Git repository clone.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE GIT REPOSITORY | Schema |  |
| USAGE | API integration | The integration specified by this command’s API INTEGRATION parameter |
| USAGE | Secret | The secret specified by this command’s GIT_CREDENTIALS parameter |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Code in the following example creates a Git repository clone called `snowflake_extensions`, where the remote repository’s origin URL
is `https://github.com/my-account/snowflake-extensions.git`. The example uses an API integration called `git_api_integration`.
It also uses a secret called `git_secret` to store credentials for authenticating with the remote repository.

For details about setting up integration with a remote Git repository, see [Setting up Snowflake to use Git](../../developer-guide/git/git-setting-up.md).

```sqlexample
CREATE OR REPLACE GIT REPOSITORY snowflake_extensions
  API_INTEGRATION = git_api_integration
  GIT_CREDENTIALS = git_secret
  ORIGIN = 'https://github.com/my-account/snowflake-extensions.git';
```

---
title: CREATE HYBRID TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-hybrid-table.md
section: SQL Commands
---

# CREATE HYBRID TABLE

Creates a new hybrid table in the current/specified schema or replaces an existing table. A table can have multiple columns,
with each column definition consisting of a name, data type, and optionally whether the column:

* Requires a NOT NULL value.
* Has a default value or is an identity column.
* Has any inline constraints.

> **Note:**
>
> When you create a hybrid table, you must define a PRIMARY KEY constraint on one or more columns.

You can also use the following CREATE TABLE variants to create hybrid tables:

* CREATE HYBRID TABLE … AS SELECT (CTAS) (creates a populated table; also referred to as CTAS)
* CREATE HYBRID TABLE … LIKE (creates an empty copy of an existing hybrid table)

For the full CREATE TABLE syntax used for standard Snowflake tables, see [CREATE TABLE](create-table.md).

> **Tip:**
>
> Before creating and using hybrid tables, you should become familiar with some
> [unsupported features and limitations](../../user-guide/tables-hybrid-limitations.md).

See also:
:   [CREATE INDEX](create-index.md) [DROP INDEX](drop-index.md), [SHOW INDEXES](show-indexes.md), [ALTER TABLE](alter-table.md) , [DROP TABLE](drop-table.md) , [SHOW TABLES](show-tables.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] HYBRID TABLE [ IF NOT EXISTS ] <table_name>
  ( <col_name> <col_type>
    [
      {
        DEFAULT <expr>
        | { AUTOINCREMENT | IDENTITY }
          [
            {
              ( <start_num> , <step_num> )
              | START <num> INCREMENT <num>
            }
          ]
          [ { ORDER | NOORDER } ]
      }
    ]
    [ NOT NULL ]
    [ inlineConstraint ]
    [ COLLATE '<collation_specification>' ]
    [ COMMENT '<string_literal>' ]
    [ , <col_name> <col_type> [ ... ] ]
    [ , outoflineConstraint ]
    [ , outoflineIndex ]
    [ , ... ]
  )
  [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> inlineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   { UNIQUE | PRIMARY KEY | { [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ] } }
>   [ <constraint_properties> ]
>
> outoflineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   { UNIQUE [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | PRIMARY KEY [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | [ FOREIGN KEY ] [ ( <col_name> [ , <col_name> , ... ] ) ]
>       REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
>   }
>   [ <constraint_properties> ]
>   [ COMMENT '<string_literal>' ]
>
> outoflineIndex ::=
>   INDEX <index_name> ( <col_name> [ , <col_name> , ... ] )
>     [ INCLUDE ( <col_name> [ , <col_name> , ... ] ) ]
> ```
>
> For inline and out-of-line constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`col_name`
:   Specifies the column identifier (i.e. name). All the requirements for table identifiers also apply to column identifiers.

    For more details, see [Identifier requirements](../identifiers-syntax.md) and [Reserved & limited keywords](../reserved-keywords.md).

    > **Note:**
    >
    > In addition to the standard reserved keywords, the following keywords cannot be used as column identifiers because they are reserved for ANSI-standard context functions:
    >
    > * `CURRENT_DATE`
    > * `CURRENT_ROLE`
    > * `CURRENT_TIME`
    > * `CURRENT_TIMESTAMP`
    > * `CURRENT_USER`
    >
    > For the list of reserved keywords, see [Reserved & limited keywords](../reserved-keywords.md).

`col_type`
:   Specifies the data type for the column.

    For details about the data types that can be specified for table columns, see [SQL data types reference](../../sql-reference-data-types.md).

`PRIMARY KEY ( col_name [ , col_name , ... ] )`
:   Specifies the required primary key constraint for the table, either within a column definition (inline) or separately (out-of-line).
    See also Constraints for hybrid tables.

    For complete syntax details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md). For general information about constraints, see
    [Constraints](../constraints.md).

## Optional parameters

`DEFAULT ...` or . `AUTOINCREMENT ...`
:   Specifies whether a default value is automatically inserted in the column if a value is not explicitly specified via an INSERT or
    CREATE HYBRID TABLE AS SELECT statement:

    > `DEFAULT expr`
    > :   Column default value is defined by the specified expression which can be any of the following:
    >
    >     * Constant value.
    >     * Simple expression.
    >     * Sequence reference (`seq_name.NEXTVAL`).
    >
    >     A simple expression is an expression that returns a scalar value; however, the expression cannot contain
    >     references to:
    >
    >     * Subqueries.
    >     * Aggregates.
    >     * Window functions.
    >     * External functions.
    >
    > `{ AUTOINCREMENT | IDENTITY }` . `[ { ( start_num , step_num ) | START num INCREMENT num } ]` . `[ { ORDER | NOORDER } ]`
    > :   When `AUTOINCREMENT` is used, the default value for the column starts with a specified number and each successive
    >     value is automatically generated. Values generated by an `AUTOINCREMENT` column are guaranteed to be unique. The
    >     difference between any pair of the generated values is guaranteed to be a multiple of the increment amount.
    >
    >     The optional `ORDER` and `NOORDER` parameters specify whether or not the generated values provide ordering
    >     guarantees as specified in [Sequence Semantics](../../user-guide/querying-sequences.md). `NOORDER` is the default option for `AUTOINCREMENT`
    >     columns on hybrid tables. `NOORDER` typically provides significantly better performance for point writes.
    >
    >     These parameters can only be used for columns with numeric data types (NUMBER, INT, FLOAT, etc.)
    >
    >     `AUTOINCREMENT` and `IDENTITY` are synonymous. If either is specified for a column, Snowflake utilizes a
    >     sequence to generate the values for the column. For more information about sequences, see
    >     [Using Sequences](../../user-guide/querying-sequences.md).
    >
    >     The default value for both start and step/increment is `1`.

    Default: No value (the column has no default value)

    > **Note:**
    >
    > * `DEFAULT` and `AUTOINCREMENT` are mutually exclusive; only one can be specified for a column.
    > * For performance-sensitive workloads, `NOORDER` is the recommended option for `AUTOINCREMENT` columns.

`CONSTRAINT ...`
:   Defines an inline or out-of-line constraint for the specified column(s) in the table. UNIQUE and FOREIGN KEY constraints
    are optional for hybrid table columns. See also Constraints for hybrid tables.

    For complete syntax details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md). For general information about constraints, see
    [Constraints](../constraints.md).

`COLLATE 'collation_specification'`
:   Specifies the collation to use for column operations such as string comparisons. This parameter applies only to
    [text columns](../data-types-text.md) that are not indexed. For more information,
    see Collations on hybrid table columns and [Collation specifications](../collation.md).

`INDEX index_name ( col_name [ , col_name , ... ]`
:   Specifies a secondary index on one or more columns in the table. (When you define constraints on hybrid table columns,
    indexes are automatically created on those columns.)

    Indexes cannot be defined on the following columns:

    * [Semi-structured columns](../data-types-semistructured.md) (VARIANT, OBJECT, ARRAY)
      because of space constraints associated with the underlying storage engines for the key of each record.
    * [Geospatial columns](../data-types-geospatial.md) (GEOGRAPHY, GEOMETRY) or
      [VECTOR columns](../data-types-vector.md).
    * [TIMESTAMP_TZ](../data-types-datetime.md) columns (or [TIMESTAMP](../data-types-datetime.md)
      columns that resolve to TIMESTAMP_TZ). TIMESTAMP_NTZ columns are supported.

    Indexes can be defined when the table is created, or with the CREATE INDEX command. For more information about creating indexes for
    hybrid tables, see [Index hybrid tables](../../user-guide/tables-hybrid-index.md) and [CREATE INDEX](create-index.md).

`INCLUDE ( col_name [ , col_name , ... ] )`
:   Specifies one or more included columns for a secondary index. Using included columns with a secondary index is
    particularly useful when queries frequently contain a set of columns in the SELECT list but not in
    the list of WHERE predicates. See [INCLUDE columns](../../user-guide/tables-hybrid-index.md).

    INCLUDE columns cannot be semi-structured columns (VARIANT, OBJECT, ARRAY) or geospatial columns (GEOGRAPHY, GEOMETRY).

`COMMENT = 'string_literal'`
:   Specifies a comment at the column, constraint, or table level. For details, see [Comments on constraints](create-table-constraint.md).

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE TABLE | Schema | Note that there is no CREATE HYBRID TABLE privilege. |
| SELECT | Table, external table, view | Required on queried tables and/or views only when cloning a table or executing CTAS statements. |
| APPLY | Masking policy, row access policy, tag | Required only when applying a masking policy, row access policy, object tags, or any combination of these [governance](../../guides-overview-govern.md) features when creating tables. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

## Usage notes

* To recreate or replace a hybrid table, call the [GET_DDL](../functions/get_ddl.md) function to see the definition of the
  hybrid table before running a CREATE OR REPLACE HYBRID TABLE command.
* You cannot create hybrid tables that are [temporary or transient](../../user-guide/tables-temp-transient.md). In turn, you cannot
  create hybrid tables within transient schemas or databases.
* A schema cannot contain tables and/or views with the same name. When creating a table:

  + If a view with the same name already exists in the schema, an error is returned and the table is not created.
  + If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the
    optional `OR REPLACE` keyword is included in the command.
  > **Important:**
  >
  > Using `OR REPLACE` is the equivalent of using [DROP TABLE](drop-table.md) on the existing table and then
  > creating a new table with the same name.
  >
  > Note that the drop and create actions occur in a single atomic operation. This means that any queries concurrent with the
  > CREATE OR REPLACE TABLE operation use either the old or new table version.
  >
  > Recreating or swapping a table drops its change data.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
* For information about cloning hybrid tables, see [Clone databases that contain hybrid tables](../../user-guide/tables-hybrid-clone.md).
* Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as
  column names.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Constraints for hybrid tables

The following rules apply to constraints that are defined on hybrid tables.

* A hybrid table must be created with a PRIMARY KEY constraint.

  Multi-column (or composite) primary keys are supported. To define a multi-column primary key, use the
  syntax shown in the following example, where the constraint is defined “out of line” and refers to
  multiple columns that were previously defined for the table:

  ```sqlexample
  CREATE OR REPLACE HYBRID TABLE ht2pk (
    col1 INTEGER NOT NULL,
    col2 INTEGER NOT NULL,
    col3 VARCHAR,
    CONSTRAINT pkey_1 PRIMARY KEY (col1, col2)
    );
  ```
* PRIMARY KEY, UNIQUE, and FOREIGN KEY constraints are all enforced on hybrid tables, and you cannot set the NOT ENFORCED
  property on these constraints.
* PRIMARY KEY, UNIQUE, and FOREIGN KEY constraints build their own underlying indexes. The creation of indexes results in
  additional data being stored. Secondary (or covering) indexes can also be defined explicitly when the table is created,
  using the `outoflineIndex` syntax.
* Constraints are enforced at the row level, not at the statement or transaction level (that is, deferred constraints).
* Constraints can only be defined at table creation.
* You cannot alter a column to be UNIQUE.

The following rules apply specifically to FOREIGN KEY constraints:

* A foreign key in a hybrid table that references a primary key cannot be NULL. If you attempt to
  load a NULL value into a column that has a FOREIGN KEY constraint, the load operation fails with a constraint error.
  See Create two hybrid tables with a primary-key/foreign-key relationship.
* FOREIGN KEY constraints are supported only among hybrid tables that belong to the same database.
* The referenced table from a FOREIGN KEY constraint cannot be truncated as long as the FOREIGN KEY relationship exists.
* FOREIGN KEY constraints do not support partial matching.
* FOREIGN KEY constraints do not support deferrable behavior.
* FOREIGN KEY constraints only support [RESTRICT and NO ACTION properties](create-table-constraint.md)
  for DELETE and UPDATE operations.

## Collations on hybrid table columns

Collations are not supported on PRIMARY KEY columns and other indexed columns in hybrid tables. However, if you do not intend to
index a column, and the column has a [character data type](../data-types-text.md), you can specify a COLLATE clause for
that column.

For example:

```sqlexample
CREATE OR REPLACE HYBRID TABLE ht1 (c1 INT PRIMARY KEY, c2 VARCHAR(10) COLLATE 'de');

DESCRIBE TABLE ht1;
```

```output
+------+--------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type                     | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+--------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| C1   | NUMBER(38,0)             | COLUMN | N     | NULL    | Y           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| C2   | VARCHAR(10) COLLATE 'de' | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+--------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

In some cases, you might need to disable collation for hybrid table columns by using the `DEFAULT_DDL_COLLATION = ''` syntax,
which applies to all columns in the table. You might need to do this when a default collation is set at the account level or for all columns
in all tables in a schema or database.

For example:

```sqlexample
ALTER SCHEMA ht SET DEFAULT_DDL_COLLATION = 'de';

CREATE OR REPLACE HYBRID TABLE ht2 (c1 INT PRIMARY KEY, c2 VARCHAR(10),
  INDEX idx_c2 (c2));
```

```output
391464 (0A000): SQL compilation error: Collations are not supported on primary keys or indexed columns.
```

```sqlexample
CREATE OR REPLACE HYBRID TABLE ht2 (c1 INT PRIMARY KEY, c2 VARCHAR(10),
  INDEX idx_c2 (c2))
  DEFAULT_DDL_COLLATION = '';
```

```output
+---------------------------------+
| status                          |
|---------------------------------|
| Table HT2 successfully created. |
+---------------------------------+
```

Table `ht2` is defined without a collation setting on the indexed column `c2`, despite the fact that
the DEFAULT_DDL_COLLATION parameter is set to `'de'` at the schema level.

For general information about collations, see [Collation control](../collation.md).

## CREATE HYBRID TABLE … AS SELECT (CTAS)

Creates a new hybrid table that contains the results of a query:

> ```sqlsyntax
> CREATE [ OR REPLACE ] HYBRID TABLE <table_name> [ ( <col_name> [ <col_type> ] , <col_name> [ <col_type> ] , ... ) ]
>   [ ... ]
>   AS <query>
> ```
>
> > **Note:**
> >
> > When you use a CTAS statement to create a hybrid table, you must define the table schema explicitly. You must specify the
> > following table properties in the syntax before the definition of the query:
> >
> > * Column definitions
> > * A PRIMARY KEY constraint
> > * Other constraints, as needed (UNIQUE, NOT NULL, FOREIGN KEY)
> > * Secondary indexes (and any INCLUDE columns)
> >
> > The schema of the new hybrid table can’t be inferred from a SELECT statement.

The number of column names specified must match the number of [SELECT](select.md) list items in the query.

To create the table with rows in a specific order, use an ORDER BY clause at the end of the query.

For information about loading hybrid tables, see [Loading data](../../user-guide/tables-hybrid-create.md).

## CREATE HYBRID TABLE … LIKE

Creates a new hybrid table with the same column definitions as an existing hybrid table, but without copying data from the
existing table.

Column names, types, defaults, constraints, and indexes are copied to the new table:

> ```sqlsyntax
> CREATE [ OR REPLACE ] HYBRID TABLE <table_name> LIKE <source_hybrid_table>
>   [ ... ]
> ```
>
> > **Note:**
> >
> > CREATE HYBRID TABLE … LIKE only supports another hybrid table as the source table type.
> >
> > CREATE HYBRID TABLE … LIKE for a table with an auto-increment sequence accessed through a data share is
> > not supported.

## Examples

Create a hybrid table in the current database with `customer_id` as the primary key, a unique constraint on `email`,
and a secondary index on `full_name`:

```sqlexample
CREATE HYBRID TABLE mytable (
  customer_id INT AUTOINCREMENT PRIMARY KEY,
  full_name VARCHAR(255),
  email VARCHAR(255) UNIQUE,
  extended_customer_info VARIANT,
  INDEX index_full_name (full_name)
);
```

```output
+-------------------------------------+
| status                              |
|-------------------------------------|
| Table MYTABLE successfully created. |
+-------------------------------------+
```

Insert a row into this table:

```sqlexample
INSERT INTO mytable (customer_id, full_name, email, extended_customer_info)
  SELECT 100, 'Jane Doe', 'jdoe@example.com',
    parse_json('{"address": "1234 Main St", "city": "San Francisco", "state": "CA", "zip":"94110"}');
```

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       1 |
+-------------------------+
```

The primary key must be unique. For example, if you try to insert the same primary key from the previous example a second time,
the command fails with the following error:

```output
200001 (22000): Primary key already exists
```

The email address must also follow the inline UNIQUE constraint. For example, if you attempt to insert two records with the
same email address, the statement fails with the following error:

```output
Duplicate key value violates unique constraint "SYS_INDEX_MYTABLE_UNIQUE_EMAIL"
```

View table properties and metadata. Note the value of the `is_hybrid` column:

```sqlexample
SHOW TABLES LIKE 'mytable';
```

```output
+-------------------------------+---------+---------------+-------------+-------+-----------+---------+------------+------+-------+--------+----------------+----------------------+-----------------+---------------------+------------------------------+---------------------------+-------------+
| created_on                    | name    | database_name | schema_name | kind  | is_hybrid | comment | cluster_by | rows | bytes | owner  | retention_time | automatic_clustering | change_tracking | search_optimization | search_optimization_progress | search_optimization_bytes | is_external |
|-------------------------------+---------+---------------+-------------+-------+-----------+---------+------------+------+-------+--------+----------------+----------------------+-----------------+---------------------+------------------------------+---------------------------+-------------|
| 2022-02-23 23:53:19.707 +0000 | MYTABLE | MYDB          | PUBLIC      | TABLE | Y         |         |            | NULL |  NULL | MYROLE | 10             | OFF                  | OFF             | OFF                 |                         NULL |                      NULL | N           |
+-------------------------------+---------+---------------+-------------+-------+-----------+---------+------------+------+-------+--------+----------------+----------------------+-----------------+---------------------+------------------------------+---------------------------+-------------+
```

View details for all hybrid tables:

```sqlexample
SHOW HYBRID TABLES;
```

```output
+-------------------------------+---------------------------+---------------+-------------+--------------+--------------+------+-------+---------+
| created_on                    | name                      | database_name | schema_name | owner        | datastore_id | rows | bytes | comment |
|-------------------------------+---------------------------+---------------+-------------+--------------+--------------+------+-------+---------|
| 2022-02-24 02:07:31.877 +0000 | MYTABLE                   | DEMO_DB       | PUBLIC      | ACCOUNTADMIN |         2002 | NULL |  NULL |         |
+-------------------------------+---------------------------+---------------+-------------+--------------+--------------+------+-------+---------+
```

Display information about the columns in the table:

```sqlexample
DESCRIBE TABLE mytable;
```

```output
+-------------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
| name              | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |
|-------------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------|
| CUSTOMER_ID       | NUMBER(38,0) | COLUMN | N     | NULL    | Y           | N          | NULL  | NULL       | NULL    | NULL        |
| FULL_NAME         | VARCHAR(256) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
| APPLICATION_STATE | VARIANT      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        |
+-------------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+
```

Select data from the table:

```sqlexample
SELECT customer_id, full_name, email, extended_customer_info
  FROM mytable
  WHERE extended_customer_info['state'] = 'CA';
```

```output
+-------------+-----------+------------------+------------------------------+
| CUSTOMER_ID | FULL_NAME | EMAIL            | EXTENDED_CUSTOMER_INFO       |
|-------------+-----------+------------------+------------------------------|
|         100 | Jane Doe  | jdoe@example.com | {                            |
|             |           |                  |   "address": "1234 Main St", |
|             |           |                  |   "city": "San Francisco",   |
|             |           |                  |   "state": "CA",             |
|             |           |                  |   "zip": "94110"             |
|             |           |                  | }                            |
+-------------+-----------+------------------+------------------------------+
```

### Create two hybrid tables with a primary-key/foreign-key relationship

This example shows the creation of two hybrid tables that reference each other. The first table, `team`, has a
PRIMARY KEY constraint on its `team_id` column. The second table, `player`, has a FOREIGN KEY constraint on
its `team_id` column, which references the `team_id` column in the `team` table.

```sqlexample
CREATE OR REPLACE HYBRID TABLE team
  (team_id INT PRIMARY KEY,
  team_name VARCHAR(40),
  stadium VARCHAR(40));

CREATE OR REPLACE HYBRID TABLE player
  (player_id INT PRIMARY KEY,
  first_name VARCHAR(40),
  last_name VARCHAR(40),
  team_id INT,
  FOREIGN KEY (team_id) REFERENCES team(team_id));
```

You can verify that referential integrity is enforced by inserting some rows into both tables. You can also
confirm that NULL values are not allowed in columns defined as foreign keys.

The first insert into the `player` table succeeds as expected. The second insert fails because `3`
does not exist as an ID in the `team` table. The third insert fails because NULL is not allowed as a foreign key.

```sqlexample
INSERT INTO team VALUES (1, 'Bayern Munich', 'Allianz Arena');
INSERT INTO player VALUES (100, 'Harry', 'Kane', 1);
INSERT INTO player VALUES (301, 'Gareth', 'Bale', 3);
```

```output
200009 (22000): Foreign key constraint "SYS_INDEX_PLAYER_FOREIGN_KEY_TEAM_ID_TEAM_TEAM_ID" was violated.
```

```sqlexample
INSERT INTO player VALUES (200, 'Tommy', 'Atkins', NULL);
```

```output
200009 (22000): Foreign key constraint "SYS_INDEX_PLAYER_FOREIGN_KEY_TEAM_ID_TEAM_TEAM_ID" was violated.
```

```sqlexample
SELECT * FROM team t, player p WHERE t.team_id=p.team_id;
```

```output
+---------+---------------+---------------+-----------+------------+-----------+---------+
| TEAM_ID | TEAM_NAME     | STADIUM       | PLAYER_ID | FIRST_NAME | LAST_NAME | TEAM_ID |
|---------+---------------+---------------+-----------+------------+-----------+---------|
|       1 | Bayern Munich | Allianz Arena |       100 | Harry      | Kane      |       1 |
+---------+---------------+---------------+-----------+------------+-----------+---------+
```

A possible workaround for the rejection of NULL in this case is to insert a “dummy” row into the
`team` table with a team ID of `0`. Then you can insert rows into the `player` table that use a
matching placeholder value of `0` instead of NULL. For example:

```sqlexample
INSERT INTO team VALUES (0, 'Unknown', 'Unknown');
INSERT INTO player VALUES (200, 'Tommy', 'Atkins', 0);

SELECT * FROM team t, player p WHERE t.team_id=p.team_id;
```

```output
+---------+---------------+---------------+-----------+------------+-----------+---------+
| TEAM_ID | TEAM_NAME     | STADIUM       | PLAYER_ID | FIRST_NAME | LAST_NAME | TEAM_ID |
|---------+---------------+---------------+-----------+------------+-----------+---------|
|       1 | Bayern Munich | Allianz Arena |       100 | Harry      | Kane      |       1 |
|       0 | Unknown       | Unknown       |       200 | Tommy      | Atkins    |       0 |
+---------+---------------+---------------+-----------+------------+-----------+---------+
```

### Create a hybrid table with a comment on the primary key column

Create a hybrid table that includes a comment within the column definition for the primary key.

```sqlexample
CREATE OR REPLACE HYBRID TABLE ht1pk
  (COL1 NUMBER(38,0) NOT NULL COMMENT 'Primary key',
  COL2 NUMBER(38,0) NOT NULL,
  COL3 VARCHAR(16777216),
  CONSTRAINT PKEY_1 PRIMARY KEY (COL1));

DESCRIBE TABLE ht1pk;
```

```output
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+-------------+-------------+----------------+
| name | type              | kind   | null? | default | primary key | unique key | check | expression | comment     | policy name | privacy domain |
|------+-------------------+--------+-------+---------+-------------+------------+-------+------------+-------------+-------------+----------------|
| COL1 | NUMBER(38,0)      | COLUMN | N     | NULL    | Y           | N          | NULL  | NULL       | Primary key | NULL        | NULL           |
| COL2 | NUMBER(38,0)      | COLUMN | N     | NULL    | N           | N          | NULL  | NULL       | NULL        | NULL        | NULL           |
| COL3 | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL        | NULL        | NULL           |
+------+-------------------+--------+-------+---------+-------------+------------+-------+------------+-------------+-------------+----------------+
```

Note that if you put this comment in the CONSTRAINT clause, the comment will not be visible in the DESCRIBE TABLE output. You can query
the [TABLE_CONSTRAINTS view](../info-schema/table_constraints.md) to see complete information about constraints.

---
title: CREATE ICEBERG TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table.md
section: SQL Commands
---

# CREATE ICEBERG TABLE

Creates or replaces an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) in the current/specified schema.

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md), [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)

## Syntax

This section provides an overview of the syntax for *all* types of Iceberg tables.
The syntax for creating an Iceberg table varies considerably depending on whether you use Snowflake as the Iceberg catalog
or an external Iceberg catalog.

To view the syntax, parameter descriptions, usage notes, and examples for specific use cases, see the following pages:

* **Snowflake as the Iceberg catalog**

  + [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](create-iceberg-table-snowflake.md)
* **External Iceberg catalog**

  + [CREATE ICEBERG TABLE (REST or Snowflake Open Catalog)](create-iceberg-table-rest.md)

    > **Tip:**
    >
    > To automatically bring the tables in your remote REST catalog into Snowflake, you can [create a catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md). With a
    > catalog-linked database, you don’t need to create individual externally managed
    > Iceberg tables to access the existing tables in your remote catalog from Snowflake. In addition, you can use the [CREATE ICEBERG TABLE (catalog-linked database)](create-iceberg-table-rest.md)
    > or [CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT](create-iceberg-table-rest.md) variant syntax with your catalog-linked database to create
    > new remote Iceberg tables from Snowflake.
  + [CREATE ICEBERG TABLE (Delta files in object storage)](create-iceberg-table-delta.md)
  + [CREATE ICEBERG TABLE (Iceberg files in object storage)](create-iceberg-table-iceberg-files.md)

### Snowflake as the Iceberg catalog

```sqlsyntax
CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name> (
    -- Column definition
    <col_name> <col_type> [ DEFAULT <col_default> ]
      [ inlineConstraint ]
      [ NOT NULL ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ [ WITH ] PROJECTION POLICY <policy_name> ]
      [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
      [ COMMENT '<string_literal>' ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ DEFAULT <col_default> ] [ ... ] ]

    -- Out-of-line constraints
    [ , outoflineConstraint [ ... ] ]
  )
  [ PARTITION BY ( partitionExpression [, partitionExpression , ...] ) ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = 'SNOWFLAKE' ]
  [ BASE_LOCATION = '<directory_for_table_files>' ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ CATALOG_SYNC = '<open_catalog_integration_name>']
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ COPY GRANTS ]
  [ ERROR_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
```

Where:

> ```sqlsyntax
> inlineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   {   UNIQUE
>     | PRIMARY KEY
>     | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
>     | CHECK ( <expr> )
>   }
>   [ <constraint_properties> ]
> ```
>
> For additional inline constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> outoflineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   {   UNIQUE [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | PRIMARY KEY [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | [ FOREIGN KEY ] [ ( <col_name> [ , <col_name> , ... ] ) ]
>       REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
>     | CHECK ( <expr> )
>   }
>   [ <constraint_properties> ]
> ```
>
> > **Note:**
> >
> > * Snowflake represents columns defined as PRIMARY KEY as identifier fields in the Iceberg metadata. The IDs for these columns are populated
> >   in the metadata as [identifier field IDs](https://iceberg.apache.org/spec/#identifier-field-ids).
> > * Snowflake doesn’t enforce NOT NULL and UNIQUE constraints on PRIMARY KEY columns for Iceberg tables.
>
> For additional out-of-line constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> partitionExpression ::=
>   <col_name> -- identity transform
>   | BUCKET ( <num_buckets> , <col_name> )
>   | TRUNCATE ( <width> , <col_name> )
>   | YEAR ( <col_name> )
>   | MONTH ( <col_name> )
>   | DAY ( <col_name> )
>   | HOUR ( <col_name> )
> ```

For more information, see [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](create-iceberg-table-snowflake.md).

#### CREATE ICEBERG TABLE … AS SELECT (also referred to as CTAS)

> ```sqlsyntax
> CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE <table_name> [ ( <col_name> [ <col_type> ] [ DEFAULT <col_default> ] , <col_name> [ <col_type> ] [ DEFAULT <col_default> ] , ... ) ]
>   [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
>   [ EXTERNAL_VOLUME = '<external_volume_name>' ]
>   [ CATALOG = 'SNOWFLAKE' ]
>   [ BASE_LOCATION = '<relative_path_from_external_volume>' ]
>   [ COPY GRANTS ]
>   [ ICEBERG_VERSION = <integer> ]
>   [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
>   [ ... ]
>   AS SELECT <query>
> ```

For more information, see [CREATE ICEBERG TABLE … AS SELECT](create-iceberg-table-snowflake.md).

#### CREATE ICEBERG TABLE … LIKE

> ```sqlsyntax
> CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE <table_name> LIKE <source_table>
>   [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
>   [ COPY GRANTS ]
>   [ ... ]
> ```

For more information, see [CREATE ICEBERG TABLE … LIKE](create-iceberg-table-snowflake.md).

### External Iceberg catalog

#### Iceberg REST (including Snowflake Open Catalog)

> **Tip:**
>
> To automatically bring the tables in your remote REST catalog into Snowflake, [create a catalog linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).
> With a catalog-linked database, you don’t have to create individual externally managed Iceberg tables to bring your remote tables into
> Snowflake.

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  CATALOG_TABLE_NAME = '<rest_catalog_table_name>'
  [ CATALOG_NAMESPACE = '<catalog_namespace>' ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

Where:

```sqlsyntax
partitionExpression ::=
  <col_name> -- identity transform
  | BUCKET ( <num_buckets> , <col_name> )
  | TRUNCATE ( <width> , <col_name> )
  | YEAR ( <col_name> )
  | MONTH ( <col_name> )
  | DAY ( <col_name> )
  | HOUR ( <col_name> )
```

For more information, see [CREATE ICEBERG TABLE (Iceberg REST catalog)](create-iceberg-table-rest.md).

#### Iceberg REST in a catalog-linked database

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [
    --Column definition
    <col_name> <col_type> [ DEFAULT <col_default> ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ DEFAULT <col_default> ] [ ... ] ]
  ]
  [ PARTITION BY ( partitionExpression [ , partitionExpression , ... ] ) ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ BASE_LOCATION = '<path_to_directory_for_table_files>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
```

Where:

```sqlsyntax
partitionExpression ::=
  <col_name> -- identity transform
  | BUCKET ( <num_buckets> , <col_name> )
  | TRUNCATE ( <width> , <col_name> )
  | YEAR ( <col_name> )
  | MONTH ( <col_name> )
  | DAY ( <col_name> )
  | HOUR ( <col_name> )
```

For more information, see [CREATE ICEBERG TABLE (Iceberg REST catalog)](create-iceberg-table-rest.md).

#### Delta files

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  BASE_LOCATION = '<relative_path_from_external_volume>'
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

For more information, see [CREATE ICEBERG TABLE (Delta files in object storage)](create-iceberg-table-delta.md).

#### Iceberg files in object storage

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  METADATA_FILE_PATH = '<metadata_file_path>'
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

For more information, see [CREATE ICEBERG TABLE (Iceberg files in object storage)](create-iceberg-table-iceberg-files.md).

---
title: CREATE ICEBERG TABLE (AWS Glue as the Iceberg catalog)
source: https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-aws-glue.md
section: SQL Commands
---

# CREATE ICEBERG TABLE (AWS Glue as the Iceberg catalog)

> **Important:**
>
> To integrate with AWS Glue, we recommend that you instead use
> [AWS Glue Iceberg REST](https://docs.aws.amazon.com/glue/latest/dg/connect-glu-iceberg-rest.html),
> which supports additional Iceberg table features such as catalog-vended credentials.
>
> For instructions, see [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](create-catalog-integration-rest.md) and [CREATE ICEBERG TABLE (Iceberg REST catalog)](create-iceberg-table-rest.md).

Creates or replaces an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) in the current/specified schema using
an Iceberg table that is registered in the AWS Glue Data Catalog.
This type of Iceberg table requires a [catalog integration](../../user-guide/tables-iceberg.md)
to connect Snowflake to AWS Glue.

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

> **Note:**
>
> Before creating a table, you must create the [external volume](create-external-volume.md) where the Iceberg metadata
> and data files are stored.
> For instructions, see [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md).
>
> You also need a catalog integration for the table.
> To learn more, see [Configure a catalog integration for AWS Glue](../../user-guide/tables-iceberg-configure-catalog-integration-glue.md).

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md) , [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  CATALOG_TABLE_NAME = '<catalog_table_name>'
  [ CATALOG_NAMESPACE = '<catalog_namespace>' ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

## Required parameters

`table_name`
:   Specifies the identifier (name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CATALOG_TABLE_NAME = 'catalog_table_name'`
:   Specifies the table name as recognized by the AWS Glue Data Catalog. For an example of using
    `CATALOG_TABLE_NAME` when you create an Iceberg table,
    see Examples (in this topic).

    This parameter cannot be changed after you create the table.

## Optional parameters

`EXTERNAL_VOLUME = 'external_volume_name'`
:   Specifies the identifier (name) for the external volume where the Iceberg table stores its metadata files and data in Parquet
    format. Iceberg metadata and manifest files store the table schema, partitions, snapshots, and other metadata.

    If you don’t specify this parameter, the Iceberg table defaults to the external volume for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`CATALOG = 'catalog_integration_name'`
:   Specifies the identifier (name) of the catalog integration for this table.
    You must specify a catalog integration that you have configured for AWS Glue. For information,
    see [Configure a catalog integration for AWS Glue](../../user-guide/tables-iceberg-configure-catalog-integration-glue.md).

    If not specified, the Iceberg table defaults to the catalog integration for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`CATALOG_NAMESPACE = 'catalog_namespace'`
:   Optionally specifies the namespace (for example, `my_glue_database`)
    for the AWS Glue Data Catalog source. By specifying a namespace with the catalog integration and then at the table level, you can use a single catalog integration for AWS Glue to create Iceberg tables across different databases. If you don’t specify a namespace with the table, the table uses the default catalog namespace associated with the catalog integration

    If a default namespace isn’t specified with the catalog integration, you must specify a namespace for the AWS Glue Data Catalog to set a
    catalog namespace for the table.

`REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results.
    You can only set this parameter for tables that use an external Iceberg catalog.

    * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
    * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
      characters in a Parquet data file.

    If not specified, the Iceberg table defaults to the parameter value for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

    Default: `FALSE`

`AUTO_REFRESH = { TRUE | FALSE }`
:   Specifies whether Snowflake should automatically poll the external Iceberg catalog that is associated with the table for metadata updates.

    If no value is specified for the `REFRESH_INTERVAL_SECONDS` parameter on the catalog integration, Snowflake uses a default
    refresh interval of 30 seconds.

    For more information, see [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    Default: FALSE

    > > **Note:**
    > >
    > > Using AUTO_REFRESH with INFER_SCHEMA isn’t supported.

`COMMENT = 'string_literal'`
:   Specifies a comment for the table.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ICEBERG TABLE | Schema |  |
| CREATE EXTERNAL VOLUME | Account | Required to create a new external volume. |
| USAGE | External Volume | Required to reference an existing external volume. |
| CREATE INTEGRATION | Account | Required to create a new catalog integration. |
| USAGE | Catalog integration | Required to reference an existing catalog integration. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Considerations for running this command:

  + If you created your external volume or catalog integration using a double-quoted identifier,
    you must specify the identifier exactly as created (including the double quotes) in your CREATE ICEBERG TABLE statement.
    Failure to include the quotes might result in an `Object does not exist` error (or
    similar type of error).

    To view an example, see the Examples (in this topic) section.
* Considerations for creating tables:

  > + A schema cannot contain tables and/or views with the same name. When creating a table:
  >
  >   > - If a view with the same name already exists in the schema, an error is returned and the table is not created.
  >   > - If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the optional
  >   >   `OR REPLACE` keyword is included in the command.
  > + CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >
  >   This means that any queries concurrent with the CREATE OR REPLACE ICEBERG TABLE operation use either the old or new table version.
  > + The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
  > + Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  >   ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as column names.
  > + Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale. A stale
  >   stream is unreadable.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Create an Iceberg table with AWS Glue as the catalog

This example creates an Iceberg table that uses the AWS Glue Data Catalog. To override the default catalog namespace and set a
catalog namespace for the table, the statement uses the optional `CATALOG_NAMESPACE` parameter.

```sqlexample
CREATE ICEBERG TABLE glue_iceberg_table
  EXTERNAL_VOLUME='glue_catalog_volume'
  CATALOG='glue_catalog_integration'
  CATALOG_TABLE_NAME='my_glue_catalog_table'
  CATALOG_NAMESPACE='icebergcatalogdb2'
  AUTO_REFRESH = TRUE;
```

### Specify an external volume or catalog integration with a double-quoted identifier

This example creates an Iceberg table with an external volume and catalog integration
whose identifiers contain double quotes. Identifiers enclosed in double quotes are case-sensitive and often contain special characters.

The identifiers `"glue_volume_1"` and `"glue_catalog_integration_1"` are specified exactly as created (including the double quotes).
Failure to include the quotes might result in an `Object does not exist` error (or similar type of error).

To learn more, see [Double-quoted identifiers](../identifiers-syntax.md).

```sqlexample
CREATE OR REPLACE ICEBERG TABLE itable_with_quoted_catalog
  EXTERNAL_VOLUME = '"glue_volume_1"'
  CATALOG = '"glue_catalog_integration_1"'
  CATALOG_TABLE_NAME='my_glue_catalog_table'
  CATALOG_NAMESPACE='icebergcatalogdb2'
  AUTO_REFRESH = TRUE;
```

---
title: CREATE ICEBERG TABLE (Delta files in object storage)
source: https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-delta.md
section: SQL Commands
---

# CREATE ICEBERG TABLE (Delta files in object storage)

Creates or replaces an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) in the current/specified schema
using Delta table files in object storage (external cloud storage).
This type of Iceberg table requires a [catalog integration](../../user-guide/tables-iceberg.md).

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

> **Note:**
>
> Before creating a table, you must create the [external volume](create-external-volume.md) where the Iceberg metadata
> and data files are stored.
> For instructions, see [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md).
>
> You also need a catalog integration for the table.
> For more information, see [Configure a catalog integration for files in object storage](../../user-guide/tables-iceberg-configure-catalog-integration-object-storage.md).

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md) , [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  BASE_LOCATION = '<relative_path_from_external_volume>'
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

## Required parameters

`table_name`
:   Specifies the identifier (name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`BASE_LOCATION = 'relative_path_from_external_volume'`
:   Specifies a relative path from the table’s `EXTERNAL_VOLUME` location to a directory where Snowflake can access your Delta
    table files.

    The base location must point to a directory and cannot point to a single file.
    It must contain the Delta transaction log subfolder (for example, `my/base/location/_delta_log/`).

## Optional parameters

`EXTERNAL_VOLUME = 'external_volume_name'`
:   Specifies the identifier (name) for the external volume where the Iceberg table stores its metadata files and data in Parquet
    format. Iceberg metadata and manifest files store the table schema, partitions, snapshots, and other metadata.

    If you don’t specify this parameter, the Iceberg table defaults to the external volume for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`CATALOG = 'catalog_integration_name'`
:   Specifies the identifier (name) of the catalog integration for this table.

    If not specified, the Iceberg table defaults to the catalog integration for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results.
    You can only set this parameter for tables that use an external Iceberg catalog.

    * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
    * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
      characters in a Parquet data file.

    If not specified, the Iceberg table defaults to the parameter value for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

    Default: `FALSE`

`AUTO_REFRESH = { TRUE | FALSE }`
:   Specifies whether Snowflake should automatically poll your external cloud storage for updates.

    If no value is specified for the `REFRESH_INTERVAL_SECONDS` parameter on the catalog integration, Snowflake uses a default
    refresh interval of 30 seconds.

    For more information, see [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    Default: FALSE

    > > **Note:**
    > >
    > > Using AUTO_REFRESH with INFER_SCHEMA isn’t supported.

`COMMENT = 'string_literal'`
:   Specifies a comment for the table.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ICEBERG TABLE | Schema |  |
| CREATE EXTERNAL VOLUME | Account | Required to create a new external volume. |
| USAGE | External Volume | Required to reference an existing external volume. |
| CREATE INTEGRATION | Account | Required to create a new catalog integration. |
| USAGE | Catalog integration | Required to reference an existing catalog integration. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Considerations for running this command:

  + If you created your external volume or catalog integration using a double-quoted identifier,
    you must specify the identifier exactly as created (including the double quotes) in your CREATE ICEBERG TABLE statement.
    Failure to include the quotes might result in an `Object does not exist` error (or
    similar type of error).
* Considerations for Iceberg tables created from Delta table files:

  + You can use [Time Travel](../../user-guide/data-time-travel.md) to query Iceberg tables created from Delta table files.
    The table versions correspond to the individual Delta log commit files.
  + Snowflake supports minReaderVersion 3 and can read all tables written by engines that use the latest version of Delta Lake,
    which is 4.0.0. Delta Lake version 4.0.0 includes support for deletion vectors and liquid clustering.
  + Snowflake streams aren’t supported for Iceberg tables created from Delta table files with partition columns.
    However, insert-only streams for tables created from Delta files *without* partition columns are supported.
  + Iceberg tables created from Delta files that were created before the [2024_04](../../release-notes/bcr-bundles/2025_04_bundle.md) release bundle are not supported in dynamic tables.
  + Snowflake doesn’t support creating Iceberg tables from Delta table definitions in the AWS Glue Data Catalog.
  + Parquet files (data files for Delta tables) that use any of the following features or data types aren’t supported:

    - Field IDs.
    - The INTERVAL data type.
    - The DECIMAL data type with precision higher than 38.
    - LIST or MAP types with one-level or two-level representation.
    - Unsigned integer types (INT(signed = false)).
    - The FLOAT16 data type.
  + You can use the Parquet physical type `int96` for TIMESTAMP, but Snowflake doesn’t support `int96` for TIMESTAMP_NTZ.
  + For more information about Delta data types and Iceberg tables, see [Delta data types](../../user-guide/tables-iceberg-data-types.md).
  + Snowflake processes a maximum of 1000 Delta commit files each time you refresh a table using CREATE/ALTER … REFRESH.
    If your table has over 1000 commit files, you can do additional manual refreshes.
    Each time, the refresh process continues from where the last one stopped.

    > **Note:**
    >
    > Snowflake uses Delta checkpoint files when creating an Iceberg table.
    > The 1,000 commit file limit only applies to commits after the latest checkpoint.
    >
    > When you refresh an existing table, Snowflake processes Delta commit files, but not checkpoint files. If table maintenance removes stale log and data files for the source
    > Delta table, you should refresh Delta-based
    > Iceberg tables in Snowflake more frequently than the retention period of Delta logs and data files.
  + The following Delta Lake features aren’t currently supported: Row Tracking, change data files, change metadata,
    DataChange, CDC, protocol evolution.
* Considerations for creating tables:

  > + A schema cannot contain tables and/or views with the same name. When creating a table:
  >
  >   > - If a view with the same name already exists in the schema, an error is returned and the table is not created.
  >   > - If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the optional
  >   >   `OR REPLACE` keyword is included in the command.
  > + CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >
  >   This means that any queries concurrent with the CREATE OR REPLACE ICEBERG TABLE operation use either the old or new table version.
  > + The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
  > + Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  >   ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as column names.
  > + Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale. A stale
  >   stream is unreadable.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example command creates an Iceberg table from Delta table files in object storage with
[automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

The example specifies an external volume associated with the cloud location of the Delta table files,
a [catalog integration configured for Delta](../../user-guide/tables-iceberg-configure-catalog-integration-object-storage.md),
and a value for the required `BASE_LOCATION` parameter.

```sqlexample
CREATE ICEBERG TABLE my_delta_iceberg_table
  CATALOG = delta_catalog_integration
  EXTERNAL_VOLUME = delta_external_volume
  BASE_LOCATION = 'relative/path/from/ext/vol/'
  AUTO_REFRESH = TRUE;
```

If the Delta table uses a partitioning scheme, Snowflake automatically interprets the scheme from the Delta log.

---
title: CREATE ICEBERG TABLE (Iceberg files in object storage)
source: https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-iceberg-files.md
section: SQL Commands
---

# CREATE ICEBERG TABLE (Iceberg files in object storage)

Creates or replaces an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) in the current/specified schema
using Iceberg files in object storage (external cloud storage).
This type of Iceberg table requires a [catalog integration](../../user-guide/tables-iceberg.md).

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

> **Note:**
>
> Before creating a table, you must create the [external volume](create-external-volume.md) where the Iceberg metadata
> and data files are stored.
> For instructions, see [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md).
>
> You also need a catalog integration for the table.
> To learn more, see [Configure a catalog integration for files in object storage](../../user-guide/tables-iceberg-configure-catalog-integration-object-storage.md).

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md) , [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  METADATA_FILE_PATH = '<metadata_file_path>'
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

## Required parameters

`table_name`
:   Specifies the identifier (name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`METADATA_FILE_PATH = 'metadata_file_path'`
:   Specifies the relative path of the Iceberg metadata file to use for column definitions.

    For example, if
    `s3://mybucket_us_east_1/metadata/v1.metadata.json` is the full path to your metadata file,
    and the external volume storage location is `s3://mybucket_us_east_1/`,
    specify `metadata/v1.metadata.json` as the value for `METADATA_FILE_PATH`.

    Before Snowflake version 7.34, this parameter was called `METADATA_FILE_NAME`.

> **Note:**
>
> * Don’t include a leading forward slash in the file path.
> * With Snowflake versions 7.34 and later, you do not specify a `BASE_LOCATION` to create a table from Iceberg files
>   in object storage.
>
>   Before version 7.34,
>   a parameter named `BASE_LOCATION` (also called `FILE_PATH` in previous versions) was required to create a table
>   from Iceberg files in object storage. The parameter specified a relative path from the `EXTERNAL_VOLUME`
>   location.
>
>   You can continue to execute a script or statement that uses the old syntax.
>   If you do, the following notes apply:
>
>   + The Parquet data files and Iceberg metadata files for the table must be within the `BASE_LOCATION`.
>   + To refresh the table, you must specify a path *relative* to the `BASE_LOCATION`. For example,
>     if the full path to your metadata file is `s3://mybucket_us_east_1/my_base_location/metadata/v1.metadata.json`,
>     specify `metadata/v1.metadata.json` as the `metadata-file-relative-path`.
>
>     For more information, see [ALTER ICEBERG TABLE … REFRESH](alter-iceberg-table-refresh.md).

## Optional parameters

`EXTERNAL_VOLUME = 'external_volume_name'`
:   Specifies the identifier (name) for the external volume where the Iceberg table stores its metadata files and data in Parquet
    format. Iceberg metadata and manifest files store the table schema, partitions, snapshots, and other metadata.

    If you don’t specify this parameter, the Iceberg table defaults to the external volume for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`CATALOG = 'catalog_integration_name'`
:   Specifies the identifier (name) of the catalog integration for this table.

    If not specified, the Iceberg table defaults to the catalog integration for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results.
    You can only set this parameter for tables that use an external Iceberg catalog.

    * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
    * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
      characters in a Parquet data file.

    If not specified, the Iceberg table defaults to the parameter value for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

    Default: `FALSE`

`COMMENT = 'string_literal'`
:   Specifies a comment for the table.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ICEBERG TABLE | Schema |  |
| CREATE EXTERNAL VOLUME | Account | Required to create a new external volume. |
| USAGE | External Volume | Required to reference an existing external volume. |
| CREATE INTEGRATION | Account | Required to create a new catalog integration. |
| USAGE | Catalog integration | Required to reference an existing catalog integration. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Considerations for running this command:

  + If you created your external volume or catalog integration using a double-quoted identifier,
    you must specify the identifier exactly as created (including the double quotes) in your CREATE ICEBERG TABLE statement.
    Failure to include the quotes might result in an `Object does not exist` error (or
    similar type of error).

    To view an example, see the Examples (in this topic) section.
  + With Snowflake versions 7.34 and later, you do not specify a `BASE_LOCATION` to create a table from Iceberg files
    in object storage.

    Before version 7.34,
    a parameter named `BASE_LOCATION` (also called `FILE_PATH` in previous versions) was required to create a table
    from Iceberg files in object storage. The parameter specified a relative path from the `EXTERNAL_VOLUME`
    location.

    You can continue to execute a script or statement that uses the old syntax.
    If you do, the following notes apply:

    - The Parquet data files and Iceberg metadata files for the table must be within the `BASE_LOCATION`.
    - To refresh the table, you must specify a path *relative* to the `BASE_LOCATION`. For example,
      if the full path to your metadata file is `s3://mybucket_us_east_1/my_base_location/metadata/v1.metadata.json`,
      specify `metadata/v1.metadata.json` as the `metadata-file-relative-path`.

      For more information, see [ALTER ICEBERG TABLE … REFRESH](alter-iceberg-table-refresh.md).
* Considerations for creating tables:

  > + A schema cannot contain tables and/or views with the same name. When creating a table:
  >
  >   > - If a view with the same name already exists in the schema, an error is returned and the table is not created.
  >   > - If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the optional
  >   >   `OR REPLACE` keyword is included in the command.
  > + CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >
  >   This means that any queries concurrent with the CREATE OR REPLACE ICEBERG TABLE operation use either the old or new table version.
  > + The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
  > + Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  >   ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as column names.
  > + Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale. A stale
  >   stream is unreadable.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Create an Iceberg table from Iceberg metadata in object storage

This example creates an Iceberg table from Iceberg metadata files in object storage by
specifying a relative path (*without* a leading forward slash `/`) to the table metadata on the external volume.

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table
  EXTERNAL_VOLUME='my_external_volume'
  CATALOG='my_catalog_integration'
  METADATA_FILE_PATH='path/to/metadata/v1.metadata.json';
```

### Specify an external volume or catalog integration with a double-quoted identifier

This example creates an Iceberg table with an external volume and catalog integration
whose identifiers contain double quotes. Identifiers enclosed in double quotes are case-sensitive and often contain special characters.

The identifiers `"external_volume_1"` and `"catalog_integration_1"` are specified exactly as created (including the double quotes).
Failure to include the quotes might result in an `Object does not exist` error (or similar type of error).

To learn more, see [Double-quoted identifiers](../identifiers-syntax.md).

```sqlexample
CREATE OR REPLACE ICEBERG TABLE itable_with_quoted_catalog
  EXTERNAL_VOLUME = '"external_volume_1"'
  CATALOG = '"catalog_integration_1"'
  METADATA_FILE_PATH='path/to/metadata/v1.metadata.json';
```

---
title: CREATE ICEBERG TABLE (Iceberg REST catalog)
source: https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-rest.md
section: SQL Commands
---

# CREATE ICEBERG TABLE (Iceberg REST catalog)

Creates or replaces an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) in the current/specified schema for an Iceberg REST catalog.

Use this command for the following scenarios:

* You want to use a remote Iceberg catalog that complies with the open source
  [Apache Iceberg REST OpenAPI specification](https://github.com/apache/iceberg/blob/main/open-api/rest-catalog-open-api.yaml).
* You want to query a table in [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) or Apache Polaris™. For more information, see [Query a table in Snowflake Open Catalog using Snowflake](../../user-guide/tables-iceberg-open-catalog-query.md).
* You want to create an externally managed table in a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md).
  See CREATE ICEBERG TABLE (catalog-linked database).

> **Note:**
>
> Before creating a table, you must create the [external volume](create-external-volume.md) where the Iceberg metadata
> and data files are stored.
> For instructions, see [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md).
>
> You also need a catalog integration for the table.
> For more information, see [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md)
> or [Configure a catalog integration for Snowflake Open Catalog](../../user-guide/tables-iceberg-configure-catalog-integration-open-catalog.md).

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md) , [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = '<catalog_integration_name>' ]
  CATALOG_TABLE_NAME = '<rest_catalog_table_name>'
  [ CATALOG_NAMESPACE = '<catalog_namespace>' ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

Where:

```sqlsyntax
partitionExpression ::=
  <col_name> -- identity transform
  | BUCKET ( <num_buckets> , <col_name> )
  | TRUNCATE ( <width> , <col_name> )
  | YEAR ( <col_name> )
  | MONTH ( <col_name> )
  | DAY ( <col_name> )
  | HOUR ( <col_name> )
```

## Variant syntax

### CREATE ICEBERG TABLE (catalog-linked database)

```sqlsyntax
CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name>
  [
    --Column definition
    <col_name> <col_type> [ DEFAULT <col_default> ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ DEFAULT <col_default> ] [ ... ] ]
  ]
  [ PARTITION BY ( partitionExpression [ , partitionExpression , ... ] ) ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ AUTO_REFRESH = { TRUE | FALSE } ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ BASE_LOCATION = '<path_to_directory_for_table_files>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
```

Where:

```sqlsyntax
partitionExpression ::=
  <col_name> -- identity transform
  | BUCKET ( <num_buckets> , <col_name> )
  | TRUNCATE ( <width> , <col_name> )
  | YEAR ( <col_name> )
  | MONTH ( <col_name> )
  | DAY ( <col_name> )
  | HOUR ( <col_name> )
```

### CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT

> ```sqlsyntax
> CREATE [ OR REPLACE ] ICEBERG TABLE <table_name> [ ( <col_name> [ <col_type> ] , <col_name> [ <col_type> ] , ... ) ]
>   [ ... ]
>   AS SELECT <query>
> ```

You can apply a masking policy to a column in a CTAS statement. Specify the masking policy after the column data type. For example:

> ```sqlsyntax
> CREATE [ OR REPLACE ] ICEBERG TABLE <table_name> ( <col1> <data_type> [ WITH ] MASKING POLICY <policy_name> [ , ... ] )
>   [ ... ]
>   AS SELECT <query>
> ```

## Required parameters

`table_name`
:   Specifies the identifier (name) for the table in Snowflake; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    > **Note:**
    >
    > To retrieve a list of tables or namespaces in your remote catalog, you can use the following functions:
    >
    > * [SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG](../functions/system_list_iceberg_tables_from_catalog.md)
    > * [SYSTEM$LIST_NAMESPACES_FROM_CATALOG](../functions/system_list_namespaces_from_catalog.md)

`CATALOG_TABLE_NAME = 'rest_catalog_table_name'`
:   Specifies the table name as recognized by your external catalog. This parameter can’t be changed after
    you create the table.

    > **Note:**
    >
    > Don’t specify a namespace with the table name (`mynamespace.mytable`). To specify a namespace for this table, and override the default
    > namespace set for the catalog integration, use the CATALOG_NAMESPACE parameter.

`col_name`
:   For creating a table in a [catalog-linked database (preview)](../../user-guide/tables-iceberg-catalog-linked-database.md).

    Specifies a column identifier (name). All the requirements for table identifiers also apply to column identifiers.

    For more information, see [Identifier requirements](../identifiers-syntax.md) and [Reserved & limited keywords](../reserved-keywords.md).

    > **Note:**
    >
    > In addition to the standard reserved keywords, the following keywords can’t be used as column identifiers because they are
    > reserved for ANSI-standard context functions:
    >
    > * `CURRENT_DATE`
    > * `CURRENT_ROLE`
    > * `CURRENT_TIME`
    > * `CURRENT_TIMESTAMP`
    > * `CURRENT_USER`
    >
    > For the list of reserved keywords, see [Reserved & limited keywords](../reserved-keywords.md).

`col_type`
:   For creating a table in a [catalog-linked database (preview)](../../user-guide/tables-iceberg-catalog-linked-database.md).

    Specifies the data type for the column.

    For information about the data types that can be specified for table columns, see [Data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).

## Optional parameters

`col_name col_type DEFAULT col_default`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    For a table that conforms to Iceberg v3, specifies both the initial default and write default for the specified column. If the data type for the
    column is string, you must surround the default value with single quotes.

    > **Important:**
    >
    > When you specify a default value for a column, you must specify a static value; you can’t specify an expression or
    > function for the value. This requirement is in accordance with the Iceberg v3 specification and applies to both the initial default
    > and write default.

    Default values is an Iceberg v3 feature, so you can’t specify a default value for a table that conforms to Iceberg v2. For more
    information about using default values with Iceberg tables, see [Use default values with Iceberg tables](../../user-guide/tables-iceberg-manage.md).

    > **Note:**
    >
    > To change the write default for the column after you create the table, run [ALTER ICEBERG TABLE … ALTER COLUMN … SET WRITE DEFAULT](alter-iceberg-table.md).

`PARTITION BY = ( partitionExpression [ , partitionExpression , ... ] )`
:   Specifies one or more partition expressions.

`PATH_LAYOUT = { FLAT | HIERARCHICAL }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the path layout that Snowflake uses when writing Parquet data files to the table:

    * `FLAT`: Snowflake writes all Parquet data files under the `data/` directory for the table.
    * `HIERARCHICAL`: Snowflake writes partitioned data under the `data/` directory for the table by using a hierarchical
      path layout. With this layout, each partition column is represented
      as a directory level in the path. To define these partition
      columns, use the PARTITION BY parameter. This layout is also called “Hive-style” partitioning.

      If you specify PATH_LAYOUT = HIERARCHICAL without a PARTITION BY clause,
      Snowflake stores the Parquet data files by using a flat layout path. You can’t
      modify the path layout for an existing table, so you might set this
      parameter to HIERARCHICAL without specifying a PARTITION BY clause if you don’t want to use partitioning with
      hierarchical paths now but you might in the future.

    > **Note:**
    >
    > For externally managed tables that you create in a standard Snowflake database, Snowflake infers and honors the partitioning scheme
    > that is specified by the remote catalog.

    Default: `FLAT`

`MASKING POLICY = policy_name`
:   For creating a table in a [catalog-linked database (preview)](../../user-guide/tables-iceberg-catalog-linked-database.md).

    Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.
    The masking policy must belong to a standard Snowflake database (not the catalog-linked database).

`EXTERNAL_VOLUME = 'external_volume_name'`
:   Specifies the identifier (name) for the external volume where the Iceberg table stores its metadata files and data in Parquet
    format. Iceberg metadata and manifest files store the table schema, partitions, snapshots, and other metadata.

    If you don’t specify this parameter, the Iceberg table defaults to the external volume for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`CATALOG = 'catalog_integration_name'`
:   Specifies the identifier (name) of the catalog integration for this table.

    If you don’t specify this parameter, the Iceberg table defaults to the catalog integration for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

`CATALOG_NAMESPACE = 'catalog_namespace'`
:   * Optionally specifies the namespace (for example, `my_database`) for the
      REST catalog source. By specifying a namespace with the
      catalog integration and then at the table level, you can use a single REST catalog integration to create Iceberg tables across different
      databases. If you don’t specify a namespace with the table, the table uses the default catalog namespace associated with the catalog
      integration.
    * If a default namespace isn’t specified with the catalog integration, you must specify the namespace for the REST catalog source to set
      a catalog namespace for the table.

    > **Note:**
    >
    > To retrieve a list of tables or namespaces in your remote catalog, you can use the following functions:
    >
    > * [SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG](../functions/system_list_iceberg_tables_from_catalog.md)
    > * [SYSTEM$LIST_NAMESPACES_FROM_CATALOG](../functions/system_list_namespaces_from_catalog.md)

`TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }'`
:   Specifies a target Parquet file size for the table.

    * `'{ 16MB | 32MB | 64MB | 128MB }'` specifies a fixed target file size for the table.
    * `'AUTO'` works differently, depending on the table type:

      + Snowflake-managed tables: AUTO specifies that Snowflake should choose the file size for the table based on table characteristics
        such as size, DML patterns, ingestion workload, and clustering configuration. Snowflake automatically
        adjusts the file size, starting at 16 MB, for better read and write performance in Snowflake. Use this option to optimize table performance
        in Snowflake.
      + Externally managed tables: AUTO specifies that Snowflake should aggressively scale to the largest file size (128 MB).

    For more information, see [Set a target file size](../../user-guide/tables-iceberg-manage.md).

    Default: AUTO

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for the table to
    prevent streams on the table from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results.
    You can only set this parameter for tables that use an external Iceberg catalog.

    * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
    * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
      characters in a Parquet data file.

    If not specified, the Iceberg table defaults to the parameter value for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.

    Default: `FALSE`

`AUTO_REFRESH = { TRUE | FALSE }`
:   Specifies whether Snowflake should automatically poll the external Iceberg catalog that is associated with the table for metadata updates.

    If no value is specified for the `REFRESH_INTERVAL_SECONDS` parameter on the catalog integration, Snowflake uses a default
    refresh interval of 30 seconds.

    For more information, see [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

    Default: FALSE

    > > **Note:**
    > >
    > > Using AUTO_REFRESH with INFER_SCHEMA isn’t supported.

`COPY GRANTS`
:   Specifies to retain the access privileges from the original table when a new table is created using any of the following
    CREATE TABLE variants:

    * CREATE OR REPLACE TABLE

    The parameter copies all privileges, except OWNERSHIP, from the existing table to the new table. The new table does not
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE TABLE statement
    owns the new table.

    If the parameter is not included in the CREATE ICEBERG TABLE statement, then the new table does not inherit any explicit access
    privileges granted on the original table, but does inherit any future grants defined for the object type in the schema.

    Note:

    * With [data sharing](../../guides-overview-sharing.md):

      + If the existing table was shared to another account, the replacement table is also shared.
      + If the existing table was shared with your account as a data consumer, and access was further granted to other roles in
        the account (using `GRANT IMPORTED PRIVILEGES` on the parent database), access is also granted to the replacement
        table.
    * The [SHOW GRANTS](show-grants.md) output for the replacement table lists the grantee for the copied privileges as the
      role that executed the CREATE ICEBERG TABLE statement, with the current timestamp when the statement was executed.
    * The operation to copy grants occurs atomically in the CREATE ICEBERG TABLE command (that is, within the same transaction).

`COMMENT = 'string_literal'`
:   Specifies a comment for the table.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`BASE_LOCATION = 'path_to_directory_for_table_files'`
:   The path to a directory, which Snowflake uses to construct write paths for the table’s data and metadata files.

    If you use an `EXTERNAL_VOLUME`, this path must be included with the storage paths that are specified for the external volume
    and you have the option to specify a relative path. If you specify a relative path, it is relative to the `STORAGE_BASE_URL`
    for the external volume.
    If not specified, Snowflake constructs a write path
    by using attributes such as the value of the [BASE_LOCATION_PREFIX](../parameters.md) parameter and
    the table name.

    If you’re using vended credentials, you must also specify an absolute path.

    > **Note:**
    >
    > This directory can’t be changed after you create a table.

`ICEBERG_VERSION = integer`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the version of the Apache Iceberg™ specification that the table conforms to.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    If you don’t set this parameter, the Iceberg table defaults to the Iceberg version for the schema, database, or account. The schema
    takes precedence over the database, and the database takes precedence over the account.

    > * `2`: The table conforms with Iceberg version 2.
    > * `3`: The table conforms with Iceberg version 3.
    >
    > Default: `2`
    >
    > For more information about this parameter, see [ICEBERG_VERSION](../parameters.md).

`ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies whether the table uses merge-on-read behavior.

    If you don’t set this parameter, the Iceberg table defaults to the merge-on-read behavior that is specified for the schema, database,
    or account. The schema takes precedence over the database, and the database takes precedence over the account.

    Values:

    `TRUE`: The table uses merge-on-read behavior. Depending on whether the table conforms to v2 or v3 of the
    Apache Iceberg™ table specification, the behavior is as described in the following list:

    * If the table conforms with v2, use positional delete files.
    * If the table conforms with v3, use deletion vectors.

    `FALSE`: The table uses copy-on-write behavior.

    Default: `TRUE`

    For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Partition expression parameters (`partitionExpression`)

Snowflake supports all partition transforms in version 2 of the Apache Iceberg specification. For more information, see
[Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

For more information about partitioning Iceberg tables, see [Iceberg partitioning](../../user-guide/tables-iceberg-metadata.md).

`col_name`
:   Specifies the identifier (name) for the source column to partition.

    When used alone, without a transform such as YEAR, specifies an identity transform on the source column.
    For more information, see [identity](https://iceberg.apache.org/spec/#partition-transforms).

`BUCKET`
:   Specifies a bucket transform. For more information, see [Bucket Transform Details](https://iceberg.apache.org/spec/#bucket-transform-details).

    `num_buckets` is the number of buckets to group the data into.

`TRUNCATE`
:   Specifies a truncate transform, which partitions the data based on the truncated values of the specified source column.
    For more information, see [Truncate Transform Details](https://iceberg.apache.org/spec/#truncate-transform-details).

`YEAR`
:   Specifies a year transform, which extracts the year from a date or timestamp source-column value.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

`MONTH`
:   Specifies a month transform.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

`DAY`
:   Specifies a day transform, which extracts the day from a date or timestamp source-column value.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

`HOUR`
:   Specifies an hour transform, which extracts the hour from a timestamp source-column value.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ICEBERG TABLE | Schema |  |
| CREATE EXTERNAL VOLUME | Account | Required to create a new external volume. |
| USAGE | External Volume | Required to reference an existing external volume. |
| CREATE INTEGRATION | Account | Required to create a new catalog integration. |
| USAGE | Catalog integration | Required to reference an existing catalog integration. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If you created your external volume or catalog integration using a double-quoted identifier,
  you must specify the identifier exactly as created (including the double quotes) in your CREATE ICEBERG TABLE statement.
  Failure to include the quotes might result in an `Object does not exist` error or
  a similar type of error.
* The OR REPLACE option performs a non-atomic operation, which in this case is a DROP operation followed by CREATE,
  in your external Iceberg catalog.
* For creating an [Iceberg table with write support](../../user-guide/tables-iceberg-externally-managed-writes.md):

  + If you use a standard Snowflake database, you must first create an Iceberg table in your remote catalog. For example, you might use Spark to write
    an Iceberg table to Open Catalog. Don’t specify column definitions in your CREATE ICEBERG TABLE statement.
  + If you use a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md), you must specify column definitions when you create the table.
    Alternatively, you can write to Iceberg tables that Snowflake automatically discovers in your remote catalog.
* The TARGET_FILE_SIZE property is only supported for tables with [write support (preview)](../../user-guide/tables-iceberg-externally-managed-writes.md).
* Considerations for creating tables:

  > + A schema cannot contain tables and/or views with the same name. When creating a table:
  >
  >   > - If a view with the same name already exists in the schema, an error is returned and the table is not created.
  >   > - If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the optional
  >   >   `OR REPLACE` keyword is included in the command.
  > + CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >
  >   This means that any queries concurrent with the CREATE OR REPLACE ICEBERG TABLE operation use either the old or new table version.
  > + The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
  > + Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  >   ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as column names.
  > + Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale. A stale
  >   stream is unreadable.
* Using variant syntax:

  + CREATE ICEBERG TABLE … LIKE:

    - For [partitioned Iceberg tables](../../user-guide/tables-iceberg-metadata.md), the partitioning of the source table is ignored. To
      override this behavior, specify the PARTITION BY clause with the command.
  + CREATE ICEBERG TABLE … CLONE:

    - For [partitioned Iceberg tables](../../user-guide/tables-iceberg-metadata.md), the cloned table retains the partitioning information of
      the source table.
  + CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT:

    - Currently not supported if you use AWS Glue as your remote catalog.

      Alternatively, you can use the CREATE ICEBERG TABLE (Iceberg REST catalog) syntax to create an empty Iceberg table and then use
      an [INSERT INTO … SELECT](insert.md) statement to insert data into the empty table. However, this alternative
      uses two separate transactions, so it doesn’t guarantee atomicity.
* Using default values:

  + You can’t use expressions or functions, such as CURRENT_TIMESTAMP(), for default values on v3 Iceberg tables. Only constant values are
    permitted in the Apache Iceberg v3 table specification.

    - For v2 Iceberg tables, you can use expressions such as CURRENT_TIMESTAMP() with Snowflake. However, this property isn’t persisted into
      Iceberg metadata because the default values specification was introduced in version 3. Columns in v2 Iceberg tables with default values as
      expressions are only used with Snowflake, but the table remains interoperable with other engines and compliant with the
      version 2 specification.
  + Using default values with CREATE ICEBERG TABLE (catalog-linked database) … *is* supported.
  + Using default values with CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT *isn’t* supported.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Create an Iceberg table that uses a remote Iceberg REST catalog

```sqlexample
CREATE OR REPLACE ICEBERG TABLE my_iceberg_table
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'my_rest_catalog_integration'
  CATALOG_TABLE_NAME = 'my_remote_table'
  AUTO_REFRESH = TRUE;
```

### Create an Iceberg table to query a table in Snowflake Open Catalog

This example creates an Iceberg table that you can use to
[Query a table in Snowflake Open Catalog using Snowflake](../../user-guide/tables-iceberg-open-catalog-query.md).

```sqlexample
CREATE ICEBERG TABLE open_catalog_iceberg_table
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'open_catalog_int'
  CATALOG_TABLE_NAME = 'my_open_catalog_table'
  AUTO_REFRESH = TRUE;
```

### Create an Iceberg table in a catalog-linked database

The following example creates a writable Iceberg table in a
[catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md)
with column definitions:

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA 'my_namespace';

CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
  first_name string,
  last_name string,
  amount int,
  create_date date
);
```

### Create a partitioned table in a catalog-linked database

The following example creates an [externally managed Iceberg table](../../user-guide/tables-iceberg-externally-managed-writes.md)
by using the value of a timestamp column named `start_date` to
partition the table by day:

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA 'my_namespace';

CREATE OR REPLACE ICEBERG TABLE iceberg_partitioned_date_time (start_date timestamp)
  PARTITION BY (DAY(start_date));
```

You can insert data into the table by using supported table-loading features. For example, use an INSERT INTO statement to
insert the following data into the empty `iceberg_partitioned_date_time` table created previously:

```sqlexample
INSERT INTO iceberg_partitioned_date_time (start_date)
  VALUES
    (to_timestamp_ntz('2023-01-02 00:00:00')),
    (to_timestamp_ntz('2023-02-03 00:00:00')),
    (to_timestamp_ntz('2023-01-02 01:00:00')),
    (to_timestamp_ntz('2023-02-03 02:00:00'));
```

For more information, see [Iceberg partitioning](../../user-guide/tables-iceberg-metadata.md).

### Create an externally managed Iceberg v3 table

The following example creates an Apache Iceberg™ table that uses a remote Iceberg REST catalog and conforms to v3 of the Apache Iceberg™ specification:

> **Note:**
>
> You don’t need to specify `ICEBERG_VERSION = 3` with the command because the format version is already defined in the
> external catalog’s metadata, so Snowflake retrieves this version from the metadata.

```sqlexample
CREATE ICEBERG TABLE my_v3_iceberg_table
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'my_rest_catalog_integration'
  CATALOG_TABLE_NAME = 'my_remote_table'
  AUTO_REFRESH = TRUE;
```

### Create an Iceberg v3 table in a catalog-linked database

The following example creates a writable Iceberg table in a
[catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md)
with column definitions and conforms to v3 of the Apache Iceberg™ specification:

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA 'my_namespace';

CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
  first_name string,
  last_name string,
  amount int,
  create_date date
)
  ICEBERG_VERSION = 3;
```

### Create a partitioned table in a catalog-linked database with hierarchical path layout

The following example creates an [externally managed Iceberg table](../../user-guide/tables-iceberg-externally-managed-writes.md)
by using the value of a timestamp column named `start_date` to
partition the table by day. Because PATH_LAYOUT = HIERARCHICAL, Snowflake writes data to the partitioned Iceberg table by using a hierarchical
path layout for files where partitioning information is included in the file paths:

```sqlexample
USE DATABASE my_catalog_linked_db;

USE SCHEMA 'my_namespace';

CREATE OR REPLACE ICEBERG TABLE iceberg_partitioned_date_time (start_date timestamp)
  PARTITION BY (DAY(start_date))
  PATH_LAYOUT = HIERARCHICAL;
```

You can insert data into the table by using supported table-loading features. For example, use an INSERT INTO statement to
insert the following data into the empty `iceberg_partitioned_date_time` table created previously:

```sqlexample
INSERT INTO iceberg_partitioned_date_time (start_date)
  VALUES
    (to_timestamp_ntz('2023-01-02 00:00:00')),
    (to_timestamp_ntz('2023-02-03 00:00:00')),
    (to_timestamp_ntz('2023-01-02 01:00:00')),
    (to_timestamp_ntz('2023-02-03 02:00:00'));
```

For more information, see [Partitioning with hierarchical paths](../../user-guide/tables-iceberg-metadata.md).

---
title: CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)
source: https://docs.snowflake.com/en/sql-reference/sql/create-iceberg-table-snowflake.md
section: SQL Commands
---

# CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)

Creates or replaces an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) that uses
[Snowflake as the Iceberg catalog](../../user-guide/tables-iceberg.md)
in the current/specified schema.

This command supports the following variants:

* CREATE ICEBERG TABLE … AS SELECT (creates a populated table; also referred to as CTAS)
* CREATE ICEBERG TABLE … LIKE (creates an empty copy of an existing table)

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

> **Note:**
>
> To store Iceberg data and metadata in **your** cloud storage, create an [external volume](create-external-volume.md) and reference it from the table.
> For instructions, see [Configure an external volume](../../user-guide/tables-iceberg-configure-external-volume.md).
>
> To use **Snowflake Storage** instead, set `EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'` (or rely on defaults when the catalog is Snowflake). You don’t need to create a separate external volume object in this case.
> For more information, see [Snowflake storage for Apache Iceberg™ tables](../../user-guide/tables-iceberg-internal-storage.md).

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md) , [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE [ IF NOT EXISTS ] <table_name> (
    -- Column definition
    <col_name> <col_type> [ DEFAULT <col_default> ]
      [ inlineConstraint ]
      [ NOT NULL ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ [ WITH ] PROJECTION POLICY <policy_name> ]
      [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
      [ COMMENT '<string_literal>' ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ DEFAULT <col_default> ] [ ... ] ]

    -- Out-of-line constraints
    [ , outoflineConstraint [ ... ] ]
  )
  [ PARTITION BY ( partitionExpression [, partitionExpression , ...] ) ]
  [ PATH_LAYOUT = { FLAT | HIERARCHICAL } ]
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ EXTERNAL_VOLUME = '<external_volume_name>' ]
  [ CATALOG = 'SNOWFLAKE' ]
  [ BASE_LOCATION = '<directory_for_table_files>' ]
  [ TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }' ]
  [ CATALOG_SYNC = '<open_catalog_integration_name>']
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ COPY GRANTS ]
  [ ERROR_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ ICEBERG_VERSION = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
```

Where:

> ```sqlsyntax
> inlineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   {   UNIQUE
>     | PRIMARY KEY
>     | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
>     | CHECK ( <expr> )
>   }
>   [ <constraint_properties> ]
> ```
>
> For additional inline constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> outoflineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   {   UNIQUE [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | PRIMARY KEY [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | [ FOREIGN KEY ] [ ( <col_name> [ , <col_name> , ... ] ) ]
>       REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
>     | CHECK ( <expr> )
>   }
>   [ <constraint_properties> ]
> ```
>
> > **Note:**
> >
> > * Snowflake represents columns defined as PRIMARY KEY as identifier fields in the Iceberg metadata. The IDs for these columns are populated
> >   in the metadata as [identifier field IDs](https://iceberg.apache.org/spec/#identifier-field-ids).
> > * Snowflake doesn’t enforce NOT NULL and UNIQUE constraints on PRIMARY KEY columns for Iceberg tables.
>
> For additional out-of-line constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> partitionExpression ::=
>   <col_name> -- identity transform
>   | BUCKET ( <num_buckets> , <col_name> )
>   | TRUNCATE ( <width> , <col_name> )
>   | YEAR ( <col_name> )
>   | MONTH ( <col_name> )
>   | DAY ( <col_name> )
>   | HOUR ( <col_name> )
> ```

## Variant syntax

### CREATE ICEBERG TABLE … AS SELECT (also referred to as CTAS)

Creates a new table populated with the data returned by a query. Place the AS SELECT clause at the end of the statement.

> ```sqlsyntax
> CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE <table_name> [ ( <col_name> [ <col_type> ] [ DEFAULT <col_default> ] , <col_name> [ <col_type> ] [ DEFAULT <col_default> ] , ... ) ]
>   [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
>   [ EXTERNAL_VOLUME = '<external_volume_name>' ]
>   [ CATALOG = 'SNOWFLAKE' ]
>   [ BASE_LOCATION = '<relative_path_from_external_volume>' ]
>   [ COPY GRANTS ]
>   [ ICEBERG_VERSION = <integer> ]
>   [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
>   [ ... ]
>   AS SELECT <query>
> ```

A masking policy can be applied to a column in a CTAS statement. Specify the masking policy after the column data type. Similarly, a
row access policy can be applied to the table. For example:

> ```sqlsyntax
> CREATE ICEBERG TABLE <table_name> ( <col1> <data_type> [ DEFAULT <col_default> ] [ WITH ] MASKING POLICY <policy_name> [ , ... ] )
>   [ EXTERNAL_VOLUME = '<external_volume_name>' ]
>   [ CATALOG = 'SNOWFLAKE' ]
>   [ BASE_LOCATION = '<directory_for_table_files>' ]
>   [ ICEBERG_VERSION = <integer> ]
>   [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
>   [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col1> [ , ... ] )
>   [ ... ]
>   AS SELECT <query>
> ```

> **Note:**
>
> In a CTAS, the COPY GRANTS parameter is valid only when combined with the OR REPLACE clause. COPY GRANTS copies
> privileges from the table being replaced with CREATE OR REPLACE (if it already exists), not from the source
> table(s) being queried in the SELECT statement. CTAS with COPY GRANTS lets you overwrite a table with a new
> set of data while keeping existing grants on that table.
>
> For more information about the COPY GRANTS parameter, see COPY GRANTS in this document.

For more information about this variant syntax, see the usage notes.

### CREATE ICEBERG TABLE … LIKE

Creates a new table with the same column definitions as an existing table, but without copying data from the existing table. Column
names, types, defaults, and constraints are copied to the new table:

> ```sqlsyntax
> CREATE [ OR REPLACE ] [ TRANSIENT ] ICEBERG TABLE <table_name> LIKE <source_table>
>   [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
>   [ COPY GRANTS ]
>   [ ... ]
> ```

For more information about the COPY GRANTS parameter, see COPY GRANTS in this document.

> > **Note:**
> >
> > CREATE TABLE … LIKE isn’t supported for tables with an auto-increment sequence accessed through a data share.

For more information about this variant syntax, see the usage notes.

### CREATE ICEBERG TABLE … CLONE

Creates a new Iceberg table with the same column definitions and containing all the existing data from the source table, without actually
copying the data. You can also use this variant to clone a table at a specific time or point in the past (using
[Time Travel](../../user-guide/data-time-travel.md)):

> ```sqlsyntax
> CREATE [ OR REPLACE ] ICEBERG TABLE [ IF NOT EXISTS ] <name>
>   CLONE <source_iceberg_table>
>     [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
>     [COPY GRANTS]
>     ...
> ```

> **Note:**
>
> If the statement replaces an existing Iceberg table of the same name, Snowflake copies the grants from the table
> being replaced. If there is no existing table of that name, Snowflake copies the grants from the source table
> being cloned.

For more information about the COPY GRANTS parameter, see COPY GRANTS in this document.

For more information about cloning, see [CREATE <object> … CLONE](create-clone.md) and [Cloning and Apache Iceberg™ tables](../../user-guide/object-clone.md).

## Required parameters

`table_name`
:   Specifies the identifier (name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`col_name`
:   Specifies the column identifier (name). All the requirements for table identifiers also apply to column identifiers.

    For more information, see [Identifier requirements](../identifiers-syntax.md) and [Reserved & limited keywords](../reserved-keywords.md).

    > **Note:**
    >
    > In addition to the standard reserved keywords, the following keywords cannot be used as column identifiers because they are
    > reserved for ANSI-standard context functions:
    >
    > * `CURRENT_DATE`
    > * `CURRENT_ROLE`
    > * `CURRENT_TIME`
    > * `CURRENT_TIMESTAMP`
    > * `CURRENT_USER`
    >
    > For the list of reserved keywords, see [Reserved & limited keywords](../reserved-keywords.md).

`col_type`
:   Specifies the data type for the column.

    For information about the data types that can be specified for table columns, see [Data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).

    > **Note:**
    >
    > You can’t use `float` or `double` as primary keys (in accordance with the
    > [Apache Iceberg spec](https://iceberg.apache.org/spec/#identifier-field-ids)).

## Optional parameters

`TRANSIENT`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Creates a transient Iceberg table. Transient tables don’t have a [Fail-safe](../../user-guide/data-failsafe.md) period, so they don’t incur
    Fail-safe storage costs.

    For Iceberg tables that use Snowflake-provided storage (`EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'`), the TRANSIENT keyword determines whether
    the table data is protected by Fail-safe. For more information, see [Snowflake storage for Apache Iceberg™ tables](../../user-guide/tables-iceberg-internal-storage.md).

    > **Note:**
    >
    > Transient Iceberg tables are only supported with Snowflake-provided storage (`EXTERNAL_VOLUME = 'SNOWFLAKE_MANAGED'`). You cannot
    > create a transient Iceberg table with any other external volume.

`col_name col_type DEFAULT col_default`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    For a table that conforms to Iceberg v3, specifies both the initial default and write default for the specified column. If the data type for the
    column is string, you must surround the default value with single quotes.

    > **Important:**
    >
    > When you specify a default value for a column, you must specify a static value; you can’t specify an expression or
    > function for the value. This requirement is in accordance with the Iceberg v3 specification and applies to both the initial default
    > and write default.

    Default values is an Iceberg v3 feature, so you can’t specify a default value for a table that conforms to Iceberg v2. For more
    information about using default values with Iceberg tables, see [Use default values with Iceberg tables](../../user-guide/tables-iceberg-manage.md).

    > **Note:**
    >
    > To change the write default for the column after you create the table, run [ALTER ICEBERG TABLE … ALTER COLUMN … SET WRITE DEFAULT](alter-iceberg-table.md).

`BASE_LOCATION = 'directory_for_table_files'`
:   The path to a directory, which Snowflake uses to construct write paths for the table’s data and metadata files.
    Specify a relative path from the table’s `EXTERNAL_VOLUME` location.

    If not specified, Snowflake constructs a write path
    using attributes such as the value of the [BASE_LOCATION_PREFIX](../parameters.md) parameter and
    the table name.

    For more information, see [Data and metadata directories](../../user-guide/tables-iceberg-storage.md).

    This directory can’t be changed after you create a table.

`TARGET_FILE_SIZE = '{ AUTO | 16MB | 32MB | 64MB | 128MB }'`
:   Specifies a target Parquet file size for the table.

    * `'{ 16MB | 32MB | 64MB | 128MB }'` specifies a fixed target file size for the table.
    * `'AUTO'` works differently, depending on the table type:

      + Snowflake-managed tables: AUTO specifies that Snowflake should choose the file size for the table based on table characteristics
        such as size, DML patterns, ingestion workload, and clustering configuration. Snowflake automatically
        adjusts the file size, starting at 16 MB, for better read and write performance in Snowflake. Use this option to optimize table performance
        in Snowflake.
      + Externally managed tables: AUTO specifies that Snowflake should aggressively scale to the largest file size (128 MB).

    For more information, see [Set a target file size](../../user-guide/tables-iceberg-manage.md).

    Default: AUTO

`CONSTRAINT ...`
:   Defines an inline or out-of-line constraint for the specified column(s) in the table.

    For syntax information, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md). For more information about constraints, see [Constraints](../constraints.md).

`MASKING POLICY = policy_name`
:   Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.

`PROJECTION POLICY policy_name`
:   Specifies the [projection policy](../../user-guide/projection-policies.md) to set on a column.

`COMMENT 'string_literal'`
:   Specifies a comment for the column.

    (Note that comments can be specified at the column level or the table level. The syntax for each is slightly different.)

`USING ( col_name , cond_col_1 ... )`
:   Specifies the arguments to pass into the SQL expression for the conditional masking policy.

    The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
    column to which the masking policy is set.

    The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query result
    when a query selects from the first column.

    If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
    [masking policy](../../user-guide/security-column-intro.md).

`PARTITION BY = ( partitionExpression [ , partitionExpression , ... ] )`
:   Specifies one or more partition expressions.

`PATH_LAYOUT = { FLAT | HIERARCHICAL }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the path layout that Snowflake uses when writing Parquet data files to the table:

    * `FLAT`: Snowflake writes all Parquet data files under the `data/` directory for the table.
    * `HIERARCHICAL`: Snowflake writes partitioned data under the `data/` directory for the table by using a hierarchical
      path layout. With this layout, each partition column is represented
      as a directory level in the path. To define these partition
      columns, use the PARTITION BY parameter. This layout is also called “Hive-style” partitioning.

      If you specify PATH_LAYOUT = HIERARCHICAL without a PARTITION BY clause,
      Snowflake stores the Parquet data files by using a flat layout path. You can’t
      modify the path layout for an existing table, so you might set this
      parameter to HIERARCHICAL without specifying a PARTITION BY clause if you don’t want to use partitioning with
      hierarchical paths now but you might in the future.

    > **Note:**
    >
    > For externally managed tables that you create in a standard Snowflake database, Snowflake infers and honors the partitioning scheme
    > that is specified by the remote catalog.

    Default: `FLAT`

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies one or more columns or column expressions in the table as the clustering key. For more information, see
    [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).

    When using variant syntax (LIKE, AS SELECT), see the variant syntax usage notes.

    Default: No value (no clustering key is defined for the table)

    > **Important:**
    >
    > Clustering keys are not intended or recommended for all tables; they typically benefit very large (that is, multi-terabyte)
    > tables.
    >
    > Before you specify a clustering key for a table, you should understand micro-partitions.
    > For more information, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

`EXTERNAL_VOLUME = 'external_volume_name'`
:   Specifies where the Iceberg table stores its metadata files and data in Parquet format. Iceberg metadata and manifest files store the table schema, partitions, snapshots, and other metadata.

    Use one of the following:

    * The identifier for an [external volume](create-external-volume.md) that you created in your account. Iceberg data and metadata are stored in your cloud storage according to that volume’s storage locations.
    * The reserved value `SNOWFLAKE_MANAGED` to use Snowflake-provided storage. `SNOWFLAKE_MANAGED` is not a user-created external volume object; you don’t run `CREATE EXTERNAL VOLUME` for it. For more information, see [Snowflake storage for Apache Iceberg™ tables](../../user-guide/tables-iceberg-internal-storage.md).

    If you don’t specify this parameter, the Iceberg table defaults to the external volume for the schema, database, or account.
    The schema takes precedence over the database, and the database takes precedence over the account.
    When the effective catalog is Snowflake (`CATALOG = 'SNOWFLAKE'`), the default external volume is `SNOWFLAKE_MANAGED` unless a different default is set at the schema, database, or account level.

`CATALOG = 'SNOWFLAKE'`
:   Specifies Snowflake as the Iceberg catalog. Snowflake handles all life-cycle maintenance, such as compaction, for the table.

`CATALOG_SYNC = 'open_catalog_integration_name'`
:   Optionally specifies the name of a catalog integration configured for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview). If specified, Snowflake syncs
    the table with an external catalog in your Snowflake Open Catalog account. For more information about syncing Snowflake-managed Iceberg tables with Open Catalog, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

    For more information about this parameter, see [CATALOG_SYNC](../parameters.md).

`ICEBERG_VERSION = integer`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the version of the Apache Iceberg™ specification that the table conforms to.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    If you don’t set this parameter, the Iceberg table defaults to the Iceberg version for the schema, database, or account. The schema
    takes precedence over the database, and the database takes precedence over the account.

    > * `2`: The table conforms with Iceberg version 2.
    > * `3`: The table conforms with Iceberg version 3.
    >
    > Default: `2`
    >
    > For more information about this parameter, see [ICEBERG_VERSION](../parameters.md).

`ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies whether the table uses merge-on-read behavior.

    If you don’t set this parameter, the Iceberg table defaults to the merge-on-read behavior that is specified for the schema, database,
    or account. The schema takes precedence over the database, and the database takes precedence over the account.

    Values:

    `TRUE`: The table uses merge-on-read behavior. Depending on whether the table conforms to v2 or v3 of the
    Apache Iceberg™ table specification, the behavior is as described in the following list:

    * If the table conforms with v2, use positional delete files.
    * If the table conforms with v3, use deletion vectors.

    `FALSE`: The table uses copy-on-write behavior.

    Default: `TRUE`

    For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md).

`STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }`
:   Specifies the storage serialization policy for the table.
    If not specified at table creation, the table inherits the value set at the schema, database, or account level. If the value isn’t
    specified at any level, the table uses the default value.

    You can’t change the value of this parameter after table creation.

    * `COMPATIBLE`: Snowflake performs encoding and compression that ensures interoperability with third-party compute engines.
    * `OPTIMIZED`: Snowflake performs encoding and compression that ensures the best table performance within Snowflake.

    Default: `OPTIMIZED`

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the retention period for a Snowflake-managed table so that Time Travel actions (SELECT, CLONE, UNDROP) can be performed on historical
    data in the table. For more information, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see
    [Parameters](../parameters.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition: `0` to `90` for permanent tables

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the schema, database, or account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the table.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for the table to
    prevent streams on the table from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`CHANGE_TRACKING = { TRUE | FALSE }`
:   Specifies whether to enable change tracking on the table.

    * `TRUE` enables change tracking on the table. This setting adds a pair of hidden columns to the source table and begins
      storing change tracking metadata in the columns. These columns consume a small amount of storage.

      The change tracking metadata can be queried using the [CHANGES](../constructs/changes.md) clause for
      [SELECT](select.md) statements, or by creating and querying one or more streams on the table.
    * `FALSE` does not enable change tracking on the table.

    Default: FALSE

`COPY GRANTS`
:   Specifies to retain the access privileges from the original table when a new table is created using any of the following
    CREATE TABLE variants:

    > * CREATE OR REPLACE TABLE
    > * CREATE TABLE … LIKE
    > * CREATE TABLE … CLONE

    The parameter copies all privileges, except OWNERSHIP, from the existing table to the new table. The new table does not
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE TABLE statement
    owns the new table.

    If the parameter is not included in the CREATE ICEBERG TABLE statement, then the new table does not inherit any explicit access
    privileges granted on the original table, but does inherit any future grants defined for the object type in the schema.

    Note:

    > * With [data sharing](../../guides-overview-sharing.md):
    >
    >   > + If the existing table was shared to another account, the replacement table is also shared.
    >   > + If the existing table was shared with your account as a data consumer, and access was further granted to other roles in
    >   >   the account (using `GRANT IMPORTED PRIVILEGES` on the parent database), access is also granted to the replacement
    >   >   table.
    > * The [SHOW GRANTS](show-grants.md) output for the replacement table lists the grantee for the copied privileges as the
    >   role that executed the CREATE ICEBERG TABLE statement, with the current timestamp when the statement was executed.
    > * The operation to copy grants occurs atomically in the CREATE ICEBERG TABLE command (that is, within the same transaction).

`ERROR_LOGGING = { TRUE | FALSE }`
:   Specifies whether to turn on DML error logging for the table.

    * `TRUE` turns on DML error logging for the table.
    * `FALSE` turns off DML error logging for the table.

    For more information, see [DML error logging](../../user-guide/data-load-overview.md).

    > **Note:**
    >
    > If the [OPT_OUT_ERROR_LOGGING](../parameters.md) parameter is set to `TRUE` for a session,
    > DML error logging isn’t turned on, regardless of whether it is turned on for specific tables.

`COMMENT = 'string_literal'`
:   Specifies a comment. You can specify a comment at the column level or the table level.
    The syntax for each is slightly different.

    Default: No value

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on a table.

`AGGREGATION POLICY policy_name`
:   Specifies the [aggregation policy](../../user-guide/aggregation-policies.md) to set on a table.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`ENABLE_DATA_COMPACTION = { TRUE | FALSE }`
:   Specifies whether Snowflake should enable data compaction on the table.

    * `TRUE`: Snowflake performs data compaction on the table.
    * `FALSE`: Snowflake doesn’t perform data compaction on the table.

    Default: `TRUE`

    For more information, see [ENABLE_DATA_COMPACTION](../parameters.md) and [Set data compaction](../../user-guide/tables-iceberg-manage.md).

`ICEBERG_VERSION = integer`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the version of the Apache Iceberg™ specification that the table conforms to.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    If you don’t set this parameter, the Iceberg table defaults to the Iceberg version for the schema, database, or account. The schema
    takes precedence over the database, and the database takes precedence over the account.

    > * `2`: The table conforms with Iceberg version 2.
    > * `3`: The table conforms with Iceberg version 3.
    >
    > Default: `2`
    >
    > For more information about this parameter, see [ICEBERG_VERSION](../parameters.md).

`ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

    Values:
    :   `TRUE`: New tables use merge-on-read behavior.

        `FALSE`: New tables use copy-on-write behavior.

    Default:
    :   `TRUE`

    For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md). For more information about merge-on-read
    and copy-on-write behavior in Snowflake, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

## Partition expression parameters (`partitionExpression`)

Snowflake supports all partition transforms in version 2 of the Apache Iceberg specification. For more information, see
[Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms) in the Apache Iceberg specification.

For more information about partitioning Iceberg tables, see [Iceberg partitioning](../../user-guide/tables-iceberg-metadata.md).

`col_name`
:   Specifies the identifier (name) for the source column to partition.

    When used alone, without a transform such as YEAR, specifies an identity transform on the source column.
    For more information, see [identity](https://iceberg.apache.org/spec/#partition-transforms).

`BUCKET`
:   Specifies a bucket transform. For more information, see [Bucket Transform Details](https://iceberg.apache.org/spec/#bucket-transform-details).

    `num_buckets` is the number of buckets to group the data into.

`TRUNCATE`
:   Specifies a truncate transform, which partitions the data based on the truncated values of the specified source column.
    For more information, see [Truncate Transform Details](https://iceberg.apache.org/spec/#truncate-transform-details).

`YEAR`
:   Specifies a year transform, which extracts the year from a date or timestamp source-column value.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

`MONTH`
:   Specifies a month transform.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

`DAY`
:   Specifies a day transform, which extracts the day from a date or timestamp source-column value.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

`HOUR`
:   Specifies an hour transform, which extracts the hour from a timestamp source-column value.
    For more information, see [Partition Transforms](https://iceberg.apache.org/spec/#partition-transforms).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ICEBERG TABLE | Schema |  |
| CREATE EXTERNAL VOLUME | Account | Required to create a new external volume. |
| USAGE | External Volume | Required to reference an existing external volume. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Considerations for running this command:

  + Cross-cloud and cross-region Iceberg tables are not currently supported when you use Snowflake as the Iceberg catalog.
    If CREATE ICEBERG TABLE returns an error message like `"External volume <volume_name> must have a STORAGE_LOCATION
    defined in the local region ..."`, make sure that your external volume uses an active storage location
    in the same region as your Snowflake account.
  + If you created your external volume using a double-quoted identifier,
    you must specify the identifier exactly as created (including the double quotes) in your CREATE ICEBERG TABLE statement.
    Failure to include the quotes might result in an `Object does not exist` error (or
    similar type of error).

    To view an example, see the Examples (in this topic) section.
  + To create an [Iceberg table](../../user-guide/tables-iceberg.md) with the USING TEMPLATE clause (and column definitions derived from
    INFER_SCHEMA output), you must specify `KIND => 'ICEBERG'` for the [INFER_SCHEMA](../functions/infer_schema.md) function.
* Considerations for creating tables:

  > + A schema cannot contain tables and/or views with the same name. When creating a table:
  >
  >   > - If a view with the same name already exists in the schema, an error is returned and the table is not created.
  >   > - If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the optional
  >   >   `OR REPLACE` keyword is included in the command.
  > + CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >
  >   This means that any queries concurrent with the CREATE OR REPLACE ICEBERG TABLE operation use either the old or new table version.
  > + The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
  > + Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  >   ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as column names.
  > + Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale. A stale
  >   stream is unreadable.

* Using variant syntax:

  + CREATE ICEBERG TABLE … LIKE:

    > - If you don’t specify a clustering key, the table inherits the clustering key of the source table (if one exists).
    > - By default, [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) is not suspended for the new table even if Automatic Clustering is
    >   suspended for the source table.
    > - For [partitioned Iceberg tables](../../user-guide/tables-iceberg-metadata.md), the partitioning of the source table is ignored. To override this behavior, specify the PARTITION BY clause with the command.
  + CREATE ICEBERG TABLE … AS SELECT (CTAS):

    When clustering keys are specified in a CTAS statement:

    - Column definitions are required and must be explicitly specified in the statement.
    - By default, [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) is enabled for the new table even if Automatic Clustering is
      suspended for the source table.
    - The data is clustered when the new table is created. A clustered table generates a query plan
      that includes a sort operation and takes longer to create than an equivalent table that is not clustered.

      Alternatively, you can create a table with rows in sorted order by using an ORDER BY clause in the CTAS query.
  + CREATE ICEBERG TABLE … CLONE:

    - For [partitioned Iceberg tables](../../user-guide/tables-iceberg-metadata.md), the cloned table retains the partitioning information of
      the source table.
* Using default values:

  + You can’t use expressions or functions, such as CURRENT_TIMESTAMP(), for default values on v3 Iceberg tables. Only constant values are
    permitted in the Apache Iceberg v3 table specification.

    - For v2 Iceberg tables, you can use expressions such as CURRENT_TIMESTAMP() with Snowflake. However, this property isn’t persisted into
      Iceberg metadata because the default values specification was introduced in version 3. Columns in v2 Iceberg tables with default values as
      expressions are only used with Snowflake, but the table remains interoperable with other engines and compliant with the
      version 2 specification.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* If you’re creating a table that you will sync with Snowflake Open Catalog, keep the following in mind:

  > **Important:**
  >
  > To ensure that access privileges in Open Catalog are enforced correctly on the table, make sure the table meets certain conditions
  > before creating it. These conditions relate to the directory structure hierarchy for the catalog. For these conditions and instructions on
  > how to meet them, see the note in
  > [Organize catalog content](https://other-docs.snowflake.com/en/opencatalog/organize-catalog-content#conditions-correct-access-privileges)
  > in the Snowflake Open Catalog documentation.

To troubleshoot issues with creating a Snowflake-managed table, see [You can’t create a Snowflake-managed table](../../user-guide/tables-iceberg-open-catalog-troubleshooting.md).

## Examples

### Create an Iceberg table with Snowflake as the catalog

This example creates an Iceberg table with Snowflake as the Iceberg catalog.
The resulting table is managed by Snowflake and supports read and write access.

The example sets the table name (`my_iceberg_table`) as the `BASE_LOCATION`. This way,
Snowflake writes data and metadata to a directory that uses the name of the table in your external volume
location.

```sqlexample
CREATE ICEBERG TABLE my_iceberg_table (amount int)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table';
```

### Create a partitioned Iceberg table

The following example creates a Snowflake-managed Iceberg table by using the value of a column named `c_nationkey` to partition the table:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE customer_iceberg_partitioned (
  c_custkey INTEGER,
  c_name STRING,
  c_address STRING,
  c_nationkey INTEGER,
  c_phone STRING,
  c_acctbal INTEGER,
  c_mktsegment STRING,
  c_comment STRING
)
  PARTITION BY (c_nationkey)
  EXTERNAL_VOLUME = 'my_ext_vol'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'customer_iceberg_partitioned';
```

For more information, see [Iceberg partitioning](../../user-guide/tables-iceberg-metadata.md).

### Create a partitioned Iceberg table with hierarchical layout

The following example creates a Snowflake-managed Iceberg table by using the value of a column named `c_nationkey` to
partition the table. Because PATH_LAYOUT = HIERARCHICAL, Snowflake writes data to the partitioned Iceberg table by using a hierarchical
path layout for files where partitioning information is included in the file paths:

```sqlexample
CREATE OR REPLACE ICEBERG TABLE customer_iceberg_partitioned (
  c_custkey INTEGER,
  c_name STRING,
  c_address STRING,
  c_nationkey INTEGER,
  c_phone STRING,
  c_acctbal INTEGER,
  c_mktsegment STRING,
  c_comment STRING
)
  PARTITION BY (c_nationkey)
  PATH_LAYOUT = HIERARCHICAL
  EXTERNAL_VOLUME = 'my_ext_vol'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'customer_iceberg_partitioned';
```

For more information, see [Partitioning with hierarchical paths](../../user-guide/tables-iceberg-metadata.md).

### Create an Iceberg table by using the CTAS variant syntax

This example use the CREATE ICEBERG TABLE … AS SELECT variant syntax to create a *new* Iceberg table from a table named
`base_iceberg_table`. The AS SELECT clause must be at the end of the statement.

```sqlexample
CREATE OR REPLACE ICEBERG TABLE iceberg_table_copy (column1 int)
  EXTERNAL_VOLUME = 'my_external_volume'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'iceberg_table_copy'
  AS SELECT * FROM base_iceberg_table;
```

### Specify an external volume with a double-quoted identifier

This example creates an Iceberg table with an external volume whose identifier contains double quotes.
Identifiers enclosed in double quotes are case-sensitive and often contain special characters.

The identifier `"external_volume_1"` is specified exactly as created (including the double quotes).
Failure to include the quotes might result in an `Object does not exist` error (or similar type of error).

To learn more, see [Double-quoted identifiers](../identifiers-syntax.md).

```sqlexample
CREATE OR REPLACE ICEBERG TABLE table_with_quoted_external_volume
  EXTERNAL_VOLUME = '"external_volume_1"'
  CATALOG = 'SNOWFLAKE'
  BASE_LOCATION = 'my/relative/path/from/external_volume';
```

### Create a v3 Iceberg table

The following example creates a Snowflake-managed Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ specification:

```sqlexample
CREATE ICEBERG TABLE my_v3_iceberg_table (
  record VARIANT,
  event_timestamp TIMESTAMP_LTZ(6)
)
  CATALOG = 'SNOWFLAKE'
  EXTERNAL_VOLUME = 'my_external_volume'
  BASE_LOCATION = 'my_iceberg_table'
  ICEBERG_VERSION = 3;
```

---
title: CREATE IMAGE REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/create-image-repository.md
section: SQL Commands
---

# CREATE IMAGE REPOSITORY

Creates a new [image repository](../../developer-guide/snowpark-container-services/working-with-registry-repository.md) in the
current schema.

See also:
:   [DROP IMAGE REPOSITORY](drop-image-repository.md) , [SHOW IMAGE REPOSITORIES](show-image-repositories.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] IMAGE REPOSITORY [ IF NOT EXISTS ] <name>
  [ ENCRYPTION = ( TYPE = 'SNOWFLAKE_FULL' | TYPE = 'SNOWFLAKE_SSE' ) ]
```

## Required parameters

`name`
:   Specifies the identifier (that is, the name) for the image repository; it must be unique for the schema in which the repository is created.

    Quoted names for special characters or case-sensitive names are not supported. The same constraint also applies to database and
    schema names where you create an image repository. That is, database and schema names without quotes are valid when creating an
    image repository.

## Optional parameters

`ENCRYPTION = ( TYPE = 'SNOWFLAKE_FULL' | TYPE = 'SNOWFLAKE_SSE' )`
:   Specifies the type of encryption to use for binaries stored in the image repository. You cannot change the encryption type after you create the image repository.

    `TYPE = ...`
    :   Specifies the encryption type to use.

    > **Important:**
    >
    > If you require Tri-Secret Secure for security compliance, use the `SNOWFLAKE_FULL` encryption type for internal stages.
    > `SNOWFLAKE_SSE` does not support Tri-Secret Secure.

    Possible values are the following:

    * `SNOWFLAKE_FULL`: On-host (image registry host) and server-side encryption. Data is first encrypted by Snowflake’s image registry service before sending the data to cloud service provider storage (for example, Amazon S3) where your Snowflake account is hosted.

      Snowflake uses AES-GCM with a 128-bit encryption key by default.
      You can configure a 256-bit key by setting the [CLIENT_ENCRYPTION_KEY_SIZE](../parameters.md) parameter. All binaries are also automatically encrypted using AES-256 strong encryption on the server side.

      > **Note:**
      >
      > With SNOWFLAKE_FULL encryption, Snowflake might throttle requests against the public image repository API. This throttling is triggered only when Snowflake detects an unusually large number of parallel requests against the repository. Note that, the service creation will never be impacted.
    * `SNOWFLAKE_SSE`: Server-side encryption only. The binaries are encrypted by the cloud service provider (for example, Amazon S3) where your Snowflake account is hosted when they arrive on the image repository storage area.

    Default: `SNOWFLAKE_FULL`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE IMAGE REPOSITORY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create an image repository:

```sqlexample
CREATE OR REPLACE IMAGE REPOSITORY tutorial_repository;
```

Create an image repository with SNOWFLAKE_FULL encryption:

```sqlexample
CREATE OR REPLACE IMAGE REPOSITORY tutorial_repository
ENCRYPTION = (type = 'SNOWFLAKE_SSE');
```

---
title: CREATE INDEX
source: https://docs.snowflake.com/en/sql-reference/sql/create-index.md
section: SQL Commands
---

# CREATE INDEX

Creates a new secondary index in an existing [hybrid table](../../user-guide/tables-hybrid.md) and populates the index with data.

The creation of an index is an online (non-blocking) operation. The hybrid table remains available for SELECT and DML
statements while the index is being built. However, if the hybrid table isn’t in active use and downtime isn’t an issue,
Snowflake recommends that you recreate the hybrid table with the indexes defined. See also [Create hybrid tables](../../user-guide/tables-hybrid-create.md)
and [Index hybrid tables](../../user-guide/tables-hybrid-index.md).

See also:
:   [DROP INDEX](drop-index.md) , [SHOW INDEXES](show-indexes.md) , [CREATE HYBRID TABLE](create-hybrid-table.md) , [DROP TABLE](drop-table.md) , [DESCRIBE TABLE](desc-table.md) , [SHOW HYBRID TABLES](show-hybrid-tables.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] INDEX [ IF NOT EXISTS ] <index_name>
  ON <table_name>
    ( <col_name> [ , <col_name> , ... ] )
    [ INCLUDE ( <col_name> [ , <col_name> , ... ] ) ]
```

## Parameters

`index_name`
:   Specifies the identifier for the new index. You must specify a unique name for each new index on a given hybrid table.
    No other secondary index with either the same name or the same ordered set of columns can exist on the hybrid table.

`table_name`
:   Specifies the name of an existing hybrid table that will hold the new index.

`col_name`
:   Specifies the name of an existing column in the hybrid table. All the requirements for index columns defined at table creation
    apply to column identifiers.

    A hybrid table cannot contain two secondary indexes defined on the same ordered set of columns.

    Columns with [geospatial data types](../data-types-geospatial.md)
    (GEOGRAPHY and GEOMETRY), [semi-structured data types](../data-types-semistructured.md)
    (ARRAY, OBJECT, VARIANT), and [vector data types](../data-types-vector.md) (VECTOR) are not supported in secondary indexes.

## Optional parameters

`INCLUDE ( col_name [ , col_name , ... ] )`
:   Specifies one or more included columns for a secondary index. Using included columns with a secondary index is
    particularly useful when queries frequently contain a set of columns in the SELECT list but not in
    the list of WHERE predicates. For more information, see [INCLUDE columns](../../user-guide/tables-hybrid-index.md).

    INCLUDE columns can’t be semi-structured columns (VARIANT, OBJECT, ARRAY) or geospatial columns (GEOGRAPHY, GEOMETRY).

## Access control requirements

To create an index, you must use a role that has OWNERSHIP privilege on the hybrid table.

## Usage notes

* The CREATE INDEX command cannot be used to add a foreign, primary, or unique key constraint.
* The creation of a new index does not concurrently block other workloads. The hybrid table is available for concurrent SELECT
  and DML statements.
* Only one active index build operation per hybrid table can run at any time.
* You can track the progress of an index build by using [SHOW INDEXES](show-indexes.md). The STATUS column can take the following values:

  + `ACTIVE`: Index is complete and can be used to retrieve data.
  + `SUSPENDED`: Index is only updated and is not used to retrieve data.
  + `BUILD FAILURE`: An error has occurred with the index build process. You need to drop and recreate the index.
  + `BUILD IN PROGRESS`: Index is being built and is not used to retrieve data.
* You can rebuild a non-active index, where the status is `SUSPENDED`, `BUILD FAILURE`, or `BUILD IN PROGRESS`, by using DROP INDEX
  and CREATE INDEX.
* If you want to drop a column that is part of an index that is being built, first stop the index build by dropping the index, then
  drop the column. If you try to drop the column before dropping the index, you will receive this error message:

  ```output
  Column '<col_name>' cannot be dropped because it is used by index '<index-name>'.
  ```
* Online index builds do not make progress until all the active transactions with DMLs on the same table at the time when the
  CREATE INDEX statement was issued are completed. If any of those transactions remain idle for more than 5 minutes, they will
  abort by default. See [Transactions](../transactions.md).
* During the index build process, any DML performs its writes to the new index, but does not use the index to retrieve data.
* A small number of concurrent DMLs, which began executing after the CREATE INDEX command was complete, may fail and return
  this error:

  ```output
  DML was unaware of concurrent DDL. Please retry this query.
  ```

  If the aborted DML statements belong to a multi-statement transaction, the transaction will roll back only if the
  [TRANSACTION_ABORT_ON_ERROR](../parameters.md) parameter is set to TRUE.
* A newly created index will be used for retrieving data only when the index build process concludes successfully and the
  status of the index is `ACTIVE`.
* Indexed columns do not support collations. For more information, see [Collations on hybrid table columns](create-hybrid-table.md) and
  [Collation control](../collation.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

To run the following CREATE INDEX example, first create and load the hybrid table.

```sqlexample
CREATE OR REPLACE HYBRID TABLE mytable (
  pk INT PRIMARY KEY,
  val INT,
  val2 INT
);

INSERT INTO mytable SELECT seq, seq+100, seq+200
  FROM (SELECT seq8() seq FROM TABLE(GENERATOR(rowcount => 100)) v);
```

Now you can create an index on the table.

```sqlexample
CREATE OR REPLACE INDEX vidx ON mytable (val);
```

```output
+----------------------------------+
| status                           |
|----------------------------------|
| Statement executed successfully. |
+----------------------------------+
```

If a failure occurs while the index is being built, the SHOW INDEXES command reports the following status:

```output
BUILD FAILURE Index build failed. Please drop the index and re-create it.
```

If you decide to stop the index build, use a [DROP INDEX](drop-index.md) command:

```sqlexample
DROP INDEX mytable.vidx;
```

```output
+-------------------------------------+
| status                              |
|-------------------------------------|
| Statement executed successfully.    |
+-------------------------------------+
```

---
title: CREATE INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-integration.md
section: SQL Commands
---

# CREATE INTEGRATION

Creates a new integration in the system or replaces an existing integration. An integration is a Snowflake object that provides an
interface between Snowflake and third-party services.

See also:
:   [ALTER INTEGRATION](alter-integration.md), [DROP INTEGRATION](drop-integration.md), [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] <integration_type> INTEGRATION [ IF NOT EXISTS ] <object_name>
  [ <integration_type_params> ]
  [ COMMENT = '<string_literal>' ]
```

Where `integration_type_params` are specific to the integration type.

For specific syntax, usage notes, and examples, see:

* [CREATE API INTEGRATION](create-api-integration.md)
* [CREATE CATALOG INTEGRATION](create-catalog-integration.md)
* [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md)
* [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md)
* [CREATE SECURITY INTEGRATION](create-security-integration.md)
* [CREATE STORAGE INTEGRATION](create-storage-integration.md)

## General usage notes

* `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive; they cannot both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

---
title: CREATE INTERACTIVE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-interactive-table.md
section: SQL Commands
---

# CREATE INTERACTIVE TABLE

Creates a new [interactive table](../../user-guide/interactive.md) in the current/specified schema or
replaces an existing table. Interactive tables are optimized for low-latency, interactive queries
and provide the best performance when queried using interactive warehouses.

Interactive tables support a more limited set of SQL operations than standard tables and are
designed for high-concurrency, real-time query workloads such as dashboards and data-powered APIs.

> **Note:**
>
> When you create an interactive table, you must define a CLUSTER BY clause on one or more columns
> that are used in the WHERE clauses for your most time-critical queries.

You can also use the following CREATE INTERACTIVE TABLE variants:

* Variant syntax: Static interactive table (creates a static interactive table populated from a query)
* Variant syntax: Dynamic interactive table (creates a dynamic interactive table with automatic refresh)

For the full CREATE TABLE syntax used for standard Snowflake tables, see [CREATE TABLE](create-table.md).

> **Tip:**
>
> Before creating and using interactive tables, you should become familiar with the
> [limitations and use cases](../../user-guide/interactive.md). Interactive tables work best with simple SELECT statements with selective WHERE clauses.

See also:
:   [CREATE WAREHOUSE](create-warehouse.md), [ALTER WAREHOUSE](alter-warehouse.md), [SHOW TABLES](show-tables.md), [SHOW WAREHOUSES](show-warehouses.md), [DROP TABLE](drop-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] INTERACTIVE TABLE [ IF NOT EXISTS ] <table_name>
  (
    <col_name> <col_type>
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ , <col_name> <col_type> [ ... ] ]
  )
  CLUSTER BY ( <expr> [ , <expr> , ... ] )
  [ TARGET_LAG = '<num> { seconds | minutes | hours | days }' ]
  [ WAREHOUSE = <warehouse_name> ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> [ ENTITY KEY ( <col_name> [ , <col_name> ... ] ) ] ]
  [ [ WITH ] JOIN POLICY <policy_name> [ ALLOWED JOIN KEYS ( <col_name> [ , ... ] ) ] ]
  [ [ WITH ] STORAGE LIFECYCLE POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  AS <query>
```

## Required parameters

`table_name`
:   Specifies the identifier (i.e. name) for the interactive table; must be unique for the schema in
    which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or
    special characters unless the entire identifier string is enclosed in double quotes (e.g.
    `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies one or more columns or column expressions in the table as the clustering key. Choose
    clustering columns that are used in the WHERE clauses of your most time-critical queries, as this
    significantly affects query performance.

    For more details about choosing effective clustering keys, see [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).

`AS query`
:   Specifies the [SELECT statement](../constructs.md) that populates the interactive
    table. This query must be specified last in the CREATE INTERACTIVE TABLE statement, regardless of
    other parameters included.

    The query follows CREATE TABLE AS SELECT (CTAS) patterns and defines the data and schema for the
    interactive table.

`col_name`
:   Specifies the column identifier (i.e. name). Column identifiers must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier string is enclosed in double quotes.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`col_type`
:   Specifies the data type for the column.

    For details about the data types that can be specified for table columns, see [SQL data types reference](../../sql-reference-data-types.md).

## Optional parameters

`MASKING POLICY policy_name`
:   Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.

`USING ( col_name , cond_col_1 ... )`
:   Specifies the arguments to pass into the conditional masking policy SQL expression.

    The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
    column to which the masking policy is set.

    The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query result
    when a query is made on the first column.

    If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
    [masking policy](../../user-guide/security-column-intro.md).

`OR REPLACE`
:   Specifies to replace the interactive table if it already exists in the schema. This is equivalent
    to using [DROP TABLE](drop-table.md) on the existing table and then creating a new table with the same name.

`IF NOT EXISTS`
:   Specifies to create the interactive table only if it does not already exist in the schema. If a
    table with the same name already exists, the statement succeeds without creating a new table.

    > **Note:**
    >
    > The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive and cannot both be used in the
    > same statement.

`TARGET_LAG = 'num { seconds | minutes | hours | days }'`
:   Specifies the maximum lag time for automatic refresh of the interactive table. When specified, the
    interactive table becomes a dynamic interactive table that automatically refreshes to stay within
    the specified lag time of the source data.

    * The minimum value is 60 seconds (1 minute).
    * If no unit is specified, the number represents seconds.
    * If TARGET_LAG is not specified, the table is created as a static interactive table.

    When TARGET_LAG is specified, the WAREHOUSE parameter is also required.

`WAREHOUSE = warehouse_name`
:   **Required when TARGET_LAG is specified.** Specifies the standard warehouse used for refresh operations when TARGET_LAG is set. This must be a standard warehouse, not an interactive warehouse.

`COPY GRANTS`
:   Specifies to retain the access privileges from the original table when replacing an interactive table using CREATE OR REPLACE INTERACTIVE TABLE.

    The parameter copies all privileges, except OWNERSHIP, from the existing table to the new table. By default, the role that executes the CREATE INTERACTIVE TABLE statement owns the new table.

`COMMENT = 'string_literal'`
:   Specifies a comment for the interactive table.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on a table.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`AGGREGATION POLICY policy_name [ ENTITY KEY ( col_name [ , col_name ... ] ) ]`
:   Specifies an [aggregation policy](../../user-guide/aggregation-policies.md) to set on a table. You can apply one or more aggregation
    policies on a table.

    Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the table. For more information, see
    [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md). You can specify one or more entity keys for an aggregation policy.

`JOIN POLICY policy_name [ ALLOWED JOIN KEYS ( col_name [ , ... ] ) ]`
:   Specifies the [join policy](../../user-guide/join-policies.md) to set on a table.

    Use the optional ALLOWED JOIN KEYS parameter to define which columns are allowed to be used as joining columns when
    this policy is in effect. For more information, see [Join policies](../../user-guide/join-policies.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`STORAGE LIFECYCLE POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies a [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md)
    to attach to the table.

    The columns specified in the ON clause must match the argument count and data types defined in the
    policy function signature. Snowflake uses these columns to evaluate the policy expression and
    determine which rows to archive or expire.

    > **Important:**
    >
    > If you attach an archival storage policy to a table, the table is permanently assigned to the specified archive tier for its lifetime. You can’t change the archive tier by applying a new policy. For example, you can’t specify a policy created with a COOL archive tier in ALTER TABLE…DROP STORAGE LIFECYCLE POLICY and then subsequently alter the table to add a policy created with a COLD archive tier. To alter the archive tier for a table, contact Snowflake Support to request deletion of the currently archived data. For additional considerations, see [Archival storage policies](../../user-guide/storage-management/storage-lifecycle-policies.md).

    For more information about creating and managing storage lifecycle policies, see
    [Create and manage storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-create-manage.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTERACTIVE TABLE | Schema | Required to create an interactive table in the schema. |
| SELECT | Table, external table, view | Required on queried tables and/or views in the AS SELECT clause. |
| APPLY | Masking policy, row access policy, tag, storage lifecycle policy | Required only when applying a masking policy, row access policy, object tags, storage lifecycle policy, or any combination of these [governance](../../guides-overview-govern.md) features when creating tables. |
| USAGE | Database, Schema | Required on the database and schema containing the interactive table. |
| USAGE | Warehouse | Required on the warehouse specified in the WAREHOUSE parameter (when TARGET_LAG is used). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Interactive tables must be created using a standard warehouse, not an interactive warehouse.
* The CLUSTER BY clause is required for all interactive tables and significantly affects query performance. Choose clustering columns carefully based on your most common WHERE clause patterns.
* Interactive tables provide the best performance when queried through interactive warehouses. To get optimal performance for an interactive table:

  1. Create an interactive warehouse
  2. Associate the interactive table with the interactive warehouse using ALTER WAREHOUSE … ADD TABLES
  3. Resume the interactive warehouse
  4. Use the interactive warehouse to query the interactive table
* Interactive tables support a limited set of SQL operations compared to standard tables:

  + SELECT statements with WHERE clauses are optimized.
  + Simple GROUP BY operations are supported.
  + DML operations (INSERT, UPDATE, DELETE) are not supported. The only allowed DML operation is INSERT OVERWRITE.
  + Complex query operations may have limited performance benefits.
* Dynamic interactive tables (with TARGET_LAG) automatically refresh using the specified standard warehouse. The lag time balances data freshness with compute costs.
* Static interactive tables don’t automatically refresh. They require manual updates to reflect
  changes in source data. To do so, run a CREATE OR REPLACE command or an INSERT OVERWRITE command
  on the interactive table.
* A single masking policy that uses conditional columns can be applied to multiple tables provided that the column structure of the table
  matches the columns specified in the policy.
* When creating a table with a masking policy on one or more table columns, or a row access policy added to the table, use the
  [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a masking policy and the table
  protected by a row access policy.
* Interactive tables store additional metadata and index information to accelerate queries, but this is compressed and has minimal impact on storage size.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* For creating a table with the WITH STORAGE LIFECYCLE POLICY clause:

  + You must have the necessary privileges to apply the policy. For information about required privileges, see
    [Storage lifecycle policy privileges](../../user-guide/security-access-control-privileges.md).
  + A table can have only one attached storage lifecycle policy.
  + The number of columns must match the argument count in the policy function signature, and the column data must be compatible with the argument types.
  + Associated policies aren’t affected if you rename table columns. Snowflake associates policies to tables by using the column IDs.
  + In order to evaluate and apply storage lifecycle policy expressions, Snowflake internally and temporarily bypasses any governance policies on a table.

## Variant syntax: Static interactive table

Creates a static interactive table that is populated once from the source query:

```sqlsyntax
CREATE [ OR REPLACE ] INTERACTIVE TABLE <table_name>
  CLUSTER BY ( <expr> [ , <expr> , ... ] )
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  AS <query>
```

Static interactive tables don’t automatically refresh. They require manual updates to reflect
changes in source data. To do so, run a CREATE OR REPLACE command or an INSERT OVERWRITE command on
the interactive table.

## Variant syntax: Dynamic interactive table

Creates a dynamic interactive table that automatically refreshes based on the specified lag time:

```sqlsyntax
CREATE [ OR REPLACE ] INTERACTIVE TABLE <table_name>
  CLUSTER BY ( <expr> [ , <expr> , ... ] )
  TARGET_LAG = '<num> { seconds | minutes | hours | days }'
  WAREHOUSE = <warehouse_name>
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  AS <query>
```

Dynamic interactive tables automatically refresh to stay within the specified TARGET_LAG of the source data, using the specified standard warehouse for refresh operations.

## Examples

The following examples show different ways that you can create interactive tables,
along with specifying the source of their data and how to refresh the data.

### Basic static interactive table

Create a static interactive table from existing order data, clustered by customer and date for optimal query performance:

```sqlexample
CREATE INTERACTIVE TABLE orders_interactive
  CLUSTER BY (customer_id, order_date)
  COMMENT = 'Interactive table for real-time order analytics'
AS
  SELECT customer_id, order_date, product_id, quantity, total_amount
  FROM orders_staging
  WHERE order_date >= '2024-01-01';
```

### Dynamic interactive table with auto-refresh

Create a dynamic interactive table that refreshes every 5 minutes to provide near real-time sales summaries:

```sqlexample
CREATE INTERACTIVE TABLE sales_summary_interactive
  CLUSTER BY (region, product_category)
  TARGET_LAG = '5 minutes'
  WAREHOUSE = refresh_warehouse
  COMMENT = 'Real-time sales dashboard data'
AS
  SELECT
    region,
    product_category,
    SUM(sales_amount) as total_sales,
    COUNT(*) as transaction_count,
    AVG(sales_amount) as avg_sale
  FROM sales_data
  GROUP BY region, product_category;
```

### Multi-column clustering for complex queries

Create an interactive table with multi-column clustering optimized for various query patterns:

```sqlexample
CREATE INTERACTIVE TABLE customer_analytics_interactive
  CLUSTER BY (customer_tier, region, signup_date)
  TARGET_LAG = '10 minutes'
  WAREHOUSE = analytics_warehouse
AS
  SELECT
    customer_id,
    customer_tier,
    region,
    signup_date,
    total_orders,
    lifetime_value,
    last_order_date
  FROM customer_metrics
  WHERE customer_tier IN ('GOLD', 'PLATINUM', 'DIAMOND');
```

### Replace existing interactive table

Replace an existing interactive table with updated clustering and refresh settings:

```sqlexample
CREATE OR REPLACE INTERACTIVE TABLE product_performance_interactive
  CLUSTER BY (category, brand, launch_date)
  TARGET_LAG = '2 minutes'
  WAREHOUSE = fast_refresh_warehouse
  COPY GRANTS
AS
  SELECT
    product_id,
    category,
    brand,
    launch_date,
    units_sold,
    revenue,
    customer_rating
  FROM product_sales_view
  WHERE launch_date >= DATEADD('month', -6, CURRENT_DATE());
```

---
title: CREATE INTERACTIVE WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/sql/create-interactive-warehouse.md
section: SQL Commands
---

# CREATE INTERACTIVE WAREHOUSE

Creates a new interactive [virtual warehouse](../../user-guide/warehouses-overview.md) optimized for low-latency, high-concurrency workloads with interactive tables.

Interactive warehouses are designed to deliver optimal query performance when working with interactive tables, which provide
fast query responses for frequently accessed data through intelligent caching and optimization.

See also:
:   [CREATE WAREHOUSE](create-warehouse.md), [ALTER WAREHOUSE](alter-warehouse.md), [DESCRIBE WAREHOUSE](desc-warehouse.md), [DROP WAREHOUSE](drop-warehouse.md), [SHOW WAREHOUSES](show-warehouses.md), [CREATE INTERACTIVE TABLE](create-interactive-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] INTERACTIVE WAREHOUSE [ IF NOT EXISTS ] <name>
       [ TABLES ( <table_name> [ , <table_name> ... ] ) ]
       [ [ WITH ] objectProperties ]
       [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
       [ objectParams ]
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   WAREHOUSE_SIZE = { XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE | X5LARGE | X6LARGE }
>   MAX_CLUSTER_COUNT = <num>
>   MIN_CLUSTER_COUNT = <num>
>   AUTO_SUSPEND = { <num> | NULL }
>   AUTO_RESUME = { TRUE | FALSE }
>   INITIALLY_SUSPENDED = { TRUE | FALSE }
>   RESOURCE_MONITOR = <monitor_name>
>   COMMENT = '<string_literal>'
> ```
>
> ```sqlsyntax
> objectParams ::=
>   MAX_CONCURRENCY_LEVEL = <num>
>   STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>
>   STATEMENT_TIMEOUT_IN_SECONDS = <num>
> ```

## Parameters

`name`
:   Specifies the identifier for the interactive warehouse. The identifier must be unique within your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TABLES ( ... )`
:   Optionally specifies a comma-separated list of interactive table names to immediately associate with the interactive warehouse.
    Using this clause starts the cache-warming process for the specified tables when the warehouse is created.

    `table_name`
    :   Specifies the identifier for an interactive table to associate with the warehouse. You can specify multiple table names separated by commas.

        > **Note:**
        >
        > * All specified tables must be interactive tables created with the `INTERACTIVE` keyword.
        > * If this clause is omitted, you can associate interactive tables later using [ALTER WAREHOUSE](alter-warehouse.md) with the `ADD TABLES` clause.
        > * Cache warming may take significant time depending on the size of the data.

`WAREHOUSE_SIZE = string_constant`
:   Specifies the size of the interactive warehouse. Interactive warehouses support specific sizes optimized for interactive workloads.

    Valid values:
    :   * `XSMALL` , `'X-SMALL'`
        * `SMALL`
        * `MEDIUM`
        * `LARGE`
        * `XLARGE` , `'X-LARGE'`
        * `XXLARGE` , `X2LARGE` , `'2X-LARGE'`
        * `XXXLARGE` , `X3LARGE` , `'3X-LARGE'`

    Default:
    :   `XSMALL`

    > **Note:**
    >
    > * To use a value that contains a hyphen (for example, `'2X-LARGE'`), you must enclose the value in single quotes, as shown.
    > * Choose a warehouse size to match your workload requirements. You can adjust the
    >   `MIN_CLUSTER_COUNT` and `MAX_CLUSTER_COUNT` properties to optimize for concurrency.

`MAX_CLUSTER_COUNT = num`
:   Specifies the maximum number of clusters for a multi-cluster interactive warehouse.

    Valid values:
    :   `1` to `10` (depending on warehouse size)

    Default:
    :   `1` (single-cluster warehouse)

`MIN_CLUSTER_COUNT = num`
:   Specifies the minimum number of clusters for a multi-cluster interactive warehouse.

    Valid values:
    :   `1` to the value of MAX_CLUSTER_COUNT

    Default:
    :   `1`

`AUTO_SUSPEND = { num | NULL }`
:   Specifies the number of seconds of inactivity after which the interactive warehouse is automatically suspended.

    The minimum value for interactive warehouses is `86400` (24 hours). If you specify a value less
    than 86400, Snowflake uses 86400. Setting the value to `NULL` disables auto-suspend.

    Default:
    :   `NULL` (auto-suspend is disabled)

`AUTO_RESUME = { TRUE | FALSE }`
:   Specifies whether to automatically resume the interactive warehouse when a SQL statement is
    submitted to it.

    Default:
    :   `FALSE`

`INITIALLY_SUSPENDED = { TRUE | FALSE }`
:   Specifies whether the interactive warehouse is created in a suspended state.

    Default:
    :   `TRUE` (interactive warehouses are created suspended)

`RESOURCE_MONITOR = monitor_name`
:   Specifies the identifier of a resource monitor to assign to the interactive warehouse for credit usage control.

    Valid values:
    :   Any existing resource monitor

    Default:
    :   No value (no resource monitor assigned)

`COMMENT = 'string_literal'`
:   Specifies a comment for the interactive warehouse.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`MAX_CONCURRENCY_LEVEL = num`
:   Specifies the concurrency level for SQL statements executed by the interactive warehouse cluster.

`STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = num`
:   Specifies the time, in seconds, a SQL statement can be queued before being canceled.

`STATEMENT_TIMEOUT_IN_SECONDS = num`
:   Specifies the time, in seconds, after which a running SQL statement is canceled.
    Interactive warehouses have a maximum timeout interval of five seconds.
    Any larger values are ignored.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE WAREHOUSE | Account | Required to create any warehouse, including interactive warehouses. |
| USAGE | Interactive Table | Required on each interactive table specified in the `TABLES` clause, if used. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Interactive warehouses are created in a `SUSPENDED` state by default. Use [ALTER WAREHOUSE](alter-warehouse.md)
  with the RESUME clause to start the warehouse.
* When you specify the TABLES clause, cache warming begins immediately for the specified
  interactive tables. This process may take significant time depending on data size.
* Interactive warehouses can only query interactive tables. To query standard tables, use a standard
  warehouse created with [CREATE WAREHOUSE](create-warehouse.md).
* Interactive warehouses support auto-suspend and auto-resume. The minimum AUTO_SUSPEND value
  is 86400 seconds (24 hours). For more information, see [Resuming and suspending an interactive warehouse](../../user-guide/interactive.md).
* Interactive warehouses support multi-cluster configuration for handling high-concurrency workloads.
* If you don’t specify the `TABLES` clause during creation, you can associate interactive tables
  later using [ALTER WAREHOUSE](alter-warehouse.md) with the ADD TABLES clause.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Billing and pricing

For information about billing and pricing considerations for interactive warehouses, see
[Cost and billing considerations](../../user-guide/interactive.md).

## Examples

Create an interactive warehouse associated with specific interactive tables:

```sqlexample
CREATE OR REPLACE INTERACTIVE WAREHOUSE sales_interactive_wh
  TABLES (orders, customers, products)
  WAREHOUSE_SIZE = 'MEDIUM'
  COMMENT = 'Interactive warehouse for sales team analytics';
```

Create an interactive warehouse without associated tables (to be added later):

```sqlexample
CREATE INTERACTIVE WAREHOUSE analytics_interactive_wh
  WAREHOUSE_SIZE = 'LARGE'
  MAX_CLUSTER_COUNT = 3
  MIN_CLUSTER_COUNT = 3;
```

Create an interactive warehouse with resource monitoring:

```sqlexample
CREATE INTERACTIVE WAREHOUSE dev_interactive_wh
  WAREHOUSE_SIZE = 'XSMALL'
  RESOURCE_MONITOR = dev_resource_monitor
  COMMENT = 'Development interactive warehouse';
```

Resume an interactive warehouse and associate tables with it:

```sqlexample
-- Resume the warehouse
ALTER WAREHOUSE sales_interactive_wh RESUME;

-- Add additional tables if needed
ALTER WAREHOUSE sales_interactive_wh ADD TABLES (inventory);
```

---
title: CREATE JOIN POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-join-policy.md
section: SQL Commands
---

# CREATE JOIN POLICY

Creates a new [join policy](../../user-guide/join-policies.md) in the current/specified schema or replaces an existing
join policy.

After creating a join policy, assign the policy to a table using an [ALTER TABLE](alter-table.md) command or a view using an [ALTER VIEW](alter-view.md) command. Alternatively, you can assign a join policy to a table when you create it.

See also:
:   [Join policy DDL reference](../../user-guide/join-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] JOIN POLICY [ IF NOT EXISTS ] <name>
  AS () RETURNS JOIN_CONSTRAINT -> <body>
  [ COMMENT = '<string_literal>' ]
```

## Parameters

`name`
:   Identifier for the join policy; must be unique for your schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`AS () RETURNS JOIN_CONSTRAINT`
:   Signature and return type of the policy. The signature does not accept any arguments, and the return type is JOIN_CONSTRAINT, which is an internal data type. All join policies have the same signature and return
    type.

`body`
:   SQL expression that determines the restrictions of a join policy.

    To define the body of the join policy, call the JOIN_CONSTRAINT function, which returns TRUE or FALSE.
    When the function returns TRUE, queries are required to use a join to return results.

    The syntax of the JOIN_CONSTRAINT function is:

    ```sqlsyntax
    JOIN_CONSTRAINT (
      { JOIN_REQUIRED => <boolean_expression> }
      )
    ```

    Where:

    `JOIN_REQUIRED => boolean_expression`
    :   Specifies whether a join is required in queries when data is selected from tables or views that have
        the join policy assigned to them.

    The body of a policy cannot reference user-defined functions, tables, or views.

    Allowed join columns are specified in the CREATE or ALTER statement for the table or view to which the
    policy is applied, not in the CREATE JOIN POLICY statement.

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the join policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE JOIN POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about join policy DDL and privileges, see [Managing join policies](../../user-guide/join-policies.md).

## Usage notes

* If you want to update an existing join policy and need to see the current body of the policy, run the
  [DESCRIBE JOIN POLICY](desc-join-policy.md) command or [GET_DDL](../functions/get_ddl.md) function.

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a join policy that requires queries to include a join (when the policy is
applied to tables and views that appear in those queries):

> ```sqlexample
> CREATE JOIN POLICY jp1 AS ()
>   RETURNS JOIN_CONSTRAINT -> JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE);
> ```

Create a join policy that allows a user with the ACCOUNTADMIN role to run queries without joins;
other users must run join queries:

> ```sqlexample
> CREATE JOIN POLICY jp2 AS ()
>   RETURNS JOIN_CONSTRAINT ->
>     CASE
>       WHEN CURRENT_ROLE() = 'ACCOUNTADMIN'
>         THEN JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE)
>       ELSE JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE)
>     END;
> ```

---
title: CREATE LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/create-listing.md
section: SQL Commands
---

# CREATE LISTING

Create a free listing to share directly with specific consumers, with an inline YAML manifest, or from a file located in a stage location.

See also:
:   [ALTER LISTING](alter-listing.md), [DESCRIBE LISTING](desc-listing.md), [SHOW LISTINGS](show-listings.md), [SHOW VERSIONS IN LISTING](show-versions-in-listing.md), [DROP LISTING](drop-listing.md), [Listing manifest reference](../../progaccess/listing-manifest-reference.md)

## Syntax

```sqlsyntax
CREATE EXTERNAL LISTING [ IF NOT EXISTS ] <name>
  [ { SHARE <share_name>  |  APPLICATION PACKAGE <package_name> } ]
  AS '<yaml_manifest_string>'
  [ PUBLISH = { TRUE | FALSE } ]
  [ REVIEW = { TRUE | FALSE } ]
  [ COMMENT = '<string>' ]

CREATE EXTERNAL LISTING [ IF NOT EXISTS ] <name>
  [ { SHARE <share_name>  |  APPLICATION PACKAGE <package_name> } ]
  FROM '<yaml_manifest_stage_location>'
  [ PUBLISH = { TRUE | FALSE } ]
  [ REVIEW = { TRUE | FALSE } ]
```

## Parameters

`name`
:   Specifies the listing identifier (name). It must conform to the following:

    * Must be unique within an organization, regardless of which Snowflake Region the account is located in.
    * Must start with an alphabetic character and cannot contain spaces or special characters except for
      underscores (`_`).

`SHARE share_name`
:   Specifies the identifier for the share to attach to the listing.

`APPLICATION PACKAGE package_name`
:   Specifies the application package attached to the listing.

    See also [SHOW APPLICATION PACKAGES](show-application-packages.md).

`AS 'yaml_manifest_string'`
:   Specifies the YAML manifest for the listing. For manifest parameters, see [Listing manifest reference](../../progaccess/listing-manifest-reference.md).

    Manifests are normally provided as dollar quoted strings.
    For more information, see [Dollar-quoted string constants](../data-types-text.md).

`FROM 'yaml_manifest_stage_location'`
:   Specifies the path for the internal stage or Git repository clone manifest.yml file.

`PUBLISH = { TRUE | FALSE }`
:   Specifies how the listing should be published.

    If TRUE, listing is published immediately on listing to Marketplace Ops for review.

    Default: TRUE.

`REVIEW =  { TRUE | FALSE }`
:   Specifies whether the listing should or should not submitted to Marketplace Ops review.

    Default: TRUE.

Different combinations of values for the PUBLISH and REVIEW properties result in the following behaviors:

| PUBLISH | REVIEW | Behavior |
| --- | --- | --- |
| TRUE | TRUE | Request review then immediately publish after approval. |
| TRUE | FALSE | Results in an error. You cannot publish a listing on the Snowflake Marketplace without review. |
| FALSE | TRUE | Request a review without publishing automatically after review. |
| FALSE | FALSE | Save your listing as a draft without requesting review or publishing. |

`COMMENT = 'string_literal'`
:   A comment for the listing.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE LISTING | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| Delegated privileges to configure cross-cloud auto-fulfillment. | If the ALTER command is modifying the manifest content for auto-fulfillment. | See [Auto-fulfillment for listings](../../collaboration/provider-listings-auto-fulfillment.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Listings created using CREATE LISTING … are automatically published. For information about unpublish and publish operations, see [ALTER LISTING](alter-listing.md).

## Examples

Creates a listing named ‘MYLISTING’ with a specific YAML format manifest, and submits it for review and subsequent publication.

For additional examples and use-cases associated with managing listings using SQL, see [Manage listings with SQL as a provider - examples](../../progaccess/listing-progaccess-examples.md).

> **Note:**
>
> This example uses the default values for PUBLISH and REVIEW.

```sqlexample
CREATE EXTERNAL LISTING MYLISTING
SHARE MySHARE AS
$$
title: "MyListing"
subtitle: "Subtitle for MyListing"
description: "Description for MyListing"
listing_terms:
   type: "STANDARD"
targets:
    accounts: ["Org1.Account1"]
usage_examples:
    - title: "this is a test sql"
      description: "Simple example"
      query: "select *"
$$
;
```

Creates a draft listing named ‘MYLISTING’ with a specific YAML format manifest:

```sqlexample
CREATE EXTERNAL LISTING MYLISTING
SHARE MySHARE AS
$$
title: "MyListing"
subtitle: "Subtitle for MyListing"
description: "Description for MyListing"
listing_terms:
  type: "OFFLINE"
targets:
   regions: ["PUBLIC.AWS_US_EAST_1", "PUBLIC.AZURE_WESTUS2"]
usage_examples:
   - title: "this is a test sql"
     description: "Simple example"
     query: "select *"
$$ PUBLISH=FALSE REVIEW=FALSE;
```

Creates a draft listing named ‘MYLISTING’ from a specific stage location. In the following example, the `manifest.yml` file is located in the `listingmanifests` folder in the stage named `listingstage`.

```sqlexample
CREATE EXTERNAL LISTING MYLISTING
SHARE MySHARE FROM @dbforstage.public.listingstage/listingmanifests;
```

---
title: CREATE MAINTENANCE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-maintenance-policy.md
section: SQL Commands
---

# CREATE MAINTENANCE POLICY

Creates a new [maintenance policy](../../developer-guide/native-apps/consumer-maintenance-policies.md) in the current or specified schema.

See also:
:   [ALTER MAINTENANCE POLICY](alter-maintenance-policy.md), [DROP MAINTENANCE POLICY](drop-maintenance-policy.md), [SHOW MAINTENANCE POLICIES](show-maintenance-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] MAINTENANCE POLICY [ IF NOT EXISTS ] <name>
  SCHEDULE = 'USING CRON <cron_spec> <timezone>'
  [ COMMENT = '<comment>' ]
```

## Required parameters

`name`
:   Specifies the identifier of the maintenance policy. The identifier must be
    unique within the schema.

`SCHEDULE = 'USING CRON cron_spec timezone`
:   Specifies the schedule for the maintenance policy. This parameter uses the
    same syntax as the `SCHEDULE` parameter of the [CREATE TASK](create-task.md) command.

## Optional parameters

`COMMENT = 'comment'`
:   Specifies an optional comment for the maintenance policy.

## Usage notes

* Each app or account can have only one maintenance policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE MAINTENANCE POLICY | Schema |  |

## Examples

The following example creates a maintenance policy that schedules
upgrades for Saturdays at 2 AM UTC:

```sqlexample
CREATE MAINTENANCE POLICY my_maintenance_policy
  SCHEDULE = 'USING CRON 0 2 * * SAT UTC'
  COMMENT = 'Weekly Saturday maintenance window';
```

---
title: CREATE MANAGED ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/create-managed-account.md
section: SQL Commands
---

# CREATE MANAGED ACCOUNT

Creates a new managed account. Currently used by data providers to create reader accounts for their consumers. For more details, see
[Manage reader accounts](../../user-guide/data-sharing-reader-create.md).

See also:
:   [DROP MANAGED ACCOUNT](drop-managed-account.md) , [SHOW MANAGED ACCOUNTS](show-managed-accounts.md)

## Syntax

```sqlsyntax
CREATE MANAGED ACCOUNT <name>
    ADMIN_NAME = <username> , ADMIN_PASSWORD = <user_password> ,
    TYPE = READER ,
    [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the managed account; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`).

    For more details, see [Identifier requirements](../identifiers-syntax.md).

    > **Important:**
    >
    > The identifier for the managed account is not the same as the account name, which is required to access the account. The account name,
    > also known as the *locator*, is assigned by Snowflake.

`ADMIN_NAME = username`
:   Identifier, as well as login name, for the initial user in the managed account. This user serves as the account administrator for the
    account (i.e. this user is automatically created when the account is created and is assigned the ACCOUNTADMIN role).

    Once the account is created, you will log into the account as this user to configure (i.e. “bootstrap”) the account.

`ADMIN_PASSWORD = user_password`
:   Password for the initial user in the managed account. The password is a string literal that must be enclosed in single or double quotes
    and must conform to the [Snowflake-provided password policy](../../user-guide/password-authentication.md).

`TYPE = READER`
:   Specifies the type of managed account. Currently, the only type supported is `READER` (i.e. reader accounts used for data sharing).

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the managed account.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ACCOUNT | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* By default, the total number of reader accounts a provider can create is 20. If you reach the limit and require creating additional
  accounts, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

  If you dropped a reader account in order to create a new account without exceeding this limit, you cannot create the new reader account for
  7 days, which is the retention period for deleted reader accounts.
* If the command completes successfully, it returns a JSON object containing the account name/locator and the URL for accessing the account.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

```sqlexample
CREATE MANAGED ACCOUNT reader_acct1
    ADMIN_NAME = user1 , ADMIN_PASSWORD = 'Sdfed43da!44' ,
    TYPE = READER;
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| status                                                                                                                                                                            |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| {"accountName":"READER_ACCT1","accountLocator":"IIB88126","url":"https://myorg-reader_acct1.snowflakecomputing.com","accountLocatorUrl":"https://iib88126.snowflakecomputing.com"}|
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: CREATE MASKING POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-masking-policy.md
section: SQL Commands
---

# CREATE MASKING POLICY

Creates a new masking policy in the current/specified schema or replaces an existing masking policy.

After creating a masking policy, apply the masking policy to a column in a table using an [ALTER TABLE … ALTER COLUMN](alter-table-column.md) command or a view using
an [ALTER VIEW](alter-view.md) command.

See also:
:   [Choosing a centralized, hybrid, or decentralized approach](../../user-guide/security-column-intro.md), [Advanced Column-level Security topics](../../user-guide/security-column-advanced.md)

    [Masking policy DDL](../../user-guide/security-column-intro.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] MASKING POLICY [ IF NOT EXISTS ] <name> AS
( <arg_name_to_mask> <arg_type_to_mask> [ , <arg_1> <arg_type_1> ... ] )
RETURNS <arg_type_to_mask> -> <body>
[ COMMENT = '<string_literal>' ]
[ EXEMPT_OTHER_POLICIES = { TRUE | FALSE } ]
```

## Required parameters

`name`
:   Identifier for the masking policy; must be unique for your schema.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`AS ( arg_name_to_mask arg_type_to_mask [ , arg_1 arg_type_1 ... ] )`
:   The signature for the masking policy; specifies the input columns and data types to evaluate at query runtime.

    For more details, see [SQL data types reference](../../sql-reference-data-types.md).

    `arg_name_to_mask arg_type_to_mask`
    :   The first column and its data type always indicate the column data type values to mask or tokenize in the subsequent
        policy conditions.

        Note that you can not specify a virtual column as the first column argument in a conditional masking policy.

    `[ , arg_1 arg_type_1 ... ]`
    :   Specifies the conditional columns and their data types to evaluate to determine whether the policy conditions should mask or tokenize
        the data in the first column in each row of the query result.

        If these additional columns and data types are not specified, Snowflake evaluates the policy as a normal masking policy.

`RETURNS arg_type_to_mask`
:   The return data type must match the input data type of the first column that is specified as an input column.

`body`
:   SQL expression that transforms the data in the column designated by `arg_name_to_mask`.

    The expression can include [Conditional expression functions](../expressions-conditional.md) to represent conditional logic, built-in functions, or UDFs to
    transform the data.

    If a UDF or external function is used inside the masking policy body, the policy owner must have USAGE on the UDF or external function.
    The USAGE privilege on the UDF or external function is not required for the role used to query a column that has a masking policy applied
    to it.

    If a UDF or external function is used inside the conditional masking policy body, the policy owner must have OWNERSHIP on the UDF or
    external function. Users querying a column that has a conditional masking policy applied to it do not need to have USAGE on the UDF or
    external function.

## Optional parameters

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the masking policy.

`EXEMPT_OTHER_POLICIES = TRUE | FALSE`
:   One of the following depending on the usage:

    * Specifies whether a row access policy or conditional masking policy can reference a column that is already protected by this masking
      policy.
    * Specifies whether a masking policy assigned to a virtual column overrides the masking policy that the virtual column inherits from the
      VALUE column. When working with external tables, specify this property in the masking policy that protects the VALUE column.

    `TRUE`
    :   Allows a different policy to reference the masked column or allows the masking policy set on a virtual column to override the masking
        policy the virtual column inherits from the VALUE column in an external table.

    `FALSE`
    :   Does not allow a different policy to reference the masked column or allow the masking policy and does not allow the masking policy the virtual column inherits from the VALUE column in an external table.

    Note the following:

    * The value of this property in the masking policy cannot change after setting the masking policy on a table or view. To update
      the value of this property setting, execute a CREATE OR REPLACE MASKING POLICY statement on the masking policy.
    * When the property is set to true it is included in the output of calling the [GET_DDL](../functions/get_ddl.md) function on the
      policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE MASKING POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

When specifying the `EXEMPT_OTHER_POLICIES` property in a masking policy, the role that owns the masking policy
(i.e. the role with OWNERSHIP privilege on the policy) must be in the role hierarchy of the role that owns the row access
policy or the conditional masking policy.

For example, the policy administrator custom roles can form a [role hierarchy](../../user-guide/security-access-control-overview.md) as
follows:

> `masking_admin` » `rap_admin` » SYSADMIN
>
> `masking_admin` » `cond_masking_admin` » SYSADMIN

Where:

`masking_admin`
:   Specifies the custom role that owns the masking policy that is set on the column that will be specified in the signature of a row access
    policy or a conditional masking policy.

`rap_admin`
:   Specifies the custom role that owns the row access policy.

`cond_masking_admin`
:   Specifies the custom role that owns the conditional masking policy.

For additional details on masking policy DDL and privileges, see [Managing Column-level Security](../../user-guide/security-column-intro.md).

## Usage notes

* If you want to replace an existing masking policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE MASKING POLICY](desc-masking-policy.md) command.
* For masking policies that include a subquery in the masking policy body, use [EXISTS](../operators-subquery.md) in the
  WHEN branch of the CASE function. For a representative example, refer to the custom entitlement table example in the
  Normal Masking Policy section (in this topic).
* If the policy `body` contains a mapping table lookup, create a centralized mapping table and store the mapping table
  in the same database as the protected table. This is particularly important if the `body` calls the
  [IS_DATABASE_ROLE_IN_SESSION](../functions/is_database_role_in_session.md) function. For details, see the function usage notes.
* A given table or view column can be specified in either a masking policy signature or a row access policy signature. In other words, the
  same column cannot be specified in both a masking policy signature and a row access policy signature at the same time.

  For more information, see [CREATE ROW ACCESS POLICY](create-row-access-policy.md).
* A data sharing provider cannot create a masking policy in a [reader account](../../user-guide/data-sharing-reader-create.md).
* If using a [UDF](../../developer-guide/udf/udf-overview.md) in a masking policy, ensure the data type of the column, UDF, and masking
  policy match. For more information, see [User-defined functions in a masking policy](../../user-guide/security-column-intro.md).
* If you specify the [CURRENT_DATABASE](../functions/current_database.md) or [CURRENT_SCHEMA](../functions/current_schema.md) function in the
  body of a masking or row access policy, the function returns the database or schema that contains the protected table, not the database or
  schema in use for the session.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Example: Normal masking policy

You can use [Conditional expression functions](../expressions-conditional.md), [Context functions](../functions-context.md), and UDFs to write the SQL expression.

The following are representative examples of the policy body to show how to create masking policy conditions using different SQL
expressions, functions, and data types.

These examples mostly use the [CURRENT_ROLE](../functions/current_role.md) context function. If role activation and role hierarchy is
necessary in the policy conditions, use [IS_ROLE_IN_SESSION](../functions/is_role_in_session.md).

Full mask:

> The `analyst` custom role can see the plain-text value. Users without the `analyst` custom role see a full mask.
>
> ```sqlexample
> CREATE OR REPLACE MASKING POLICY email_mask AS (val string) returns string ->
>   CASE
>     WHEN current_role() IN ('ANALYST') THEN VAL
>     ELSE '*********'
>   END;
> ```

Allow a production [account](../../user-guide/admin-account-identifier.md) to see unmasked values and all other accounts
(e.g. development, test) to see masked values.

> ```sqlexample
> case
>   when current_account() in ('<prod_account_identifier>') then val
>   else '*********'
> end;
> ```

Return NULL for unauthorized users:

> ```sqlexample
> case
>   when current_role() IN ('ANALYST') then val
>   else NULL
> end;
> ```

Return a static masked value for unauthorized users:

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   ELSE '********'
> END;
> ```

Return a hash value using [SHA2 , SHA2_HEX](../functions/sha2.md) for unauthorized users. Using a hashing function in a masking policy may result in collisions; therefore, exercise caution with this approach. For more information, see [Advanced Column-level Security topics](../../user-guide/security-column-advanced.md).

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   ELSE sha2(val) -- return hash of the column value
> END;
> ```

Apply a partial mask or full mask:

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   WHEN current_role() IN ('SUPPORT') THEN regexp_replace(val,'.+\@','*****@') -- leave email domain unmasked
>   ELSE '********'
> END;
> ```

Using timestamps.

> ```sqlexample
> case
>   WHEN current_role() in ('SUPPORT') THEN val
>   else date_from_parts(0001, 01, 01)::timestamp_ntz -- returns 0001-01-01 00:00:00.000
> end;
> ```
>
> > **Important:**
> >
> > Currently, Snowflake does not support different input and output data types in a masking policy, such as defining the masking policy to target a timestamp and return a string (e.g. `***MASKED***`); the input and output data types must match.
> >
> > A workaround is to cast the actual timestamp value with a fabricated timestamp value. For more information, see [DATE_FROM_PARTS](../functions/date_from_parts.md) and [CAST , ::](../functions/cast.md).

Using a UDF:

> ```sqlexample
> CASE
>   WHEN current_role() IN ('ANALYST') THEN val
>   ELSE mask_udf(val) -- custom masking function
> END;
> ```

On variant data:

> ```sqlexample
> CASE
>    WHEN current_role() IN ('ANALYST') THEN val
>    ELSE OBJECT_INSERT(val, 'USER_IPADDRESS', '****', true)
> END;
> ```

Using a custom entitlement table. Note the use of [EXISTS](../operators-subquery.md) in the WHEN clause. Always use EXISTS when including a subquery in the masking policy body. For more information on subqueries that Snowflake supports, see [Working with Subqueries](../../user-guide/querying-subqueries.md).

> ```sqlexample
> CASE
>   WHEN EXISTS
>     (SELECT role FROM <db>.<schema>.entitlement WHERE mask_method='unmask' AND role = current_role()) THEN val
>   ELSE '********'
> END;
> ```

Using [DECRYPT](../functions/decrypt.md) on previously encrypted data with either [ENCRYPT](../functions/encrypt.md) or [ENCRYPT_RAW](../functions/encrypt_raw.md), with a passphrase on the encrypted data:

> ```sqlexample
> case
>   when current_role() in ('ANALYST') then DECRYPT(val, $passphrase)
>   else val -- shows encrypted value
> end;
> ```

Using a [<JavaScript UDF](../../developer-guide/udf/javascript/udf-javascript-introduction.md) on JSON (VARIANT):

> In this example, a JavaScript UDF masks location data in a JSON string. It is important to set the data type as VARIANT in the UDF and
> the masking policy. If the data type in the table column, UDF, and masking policy signature do not match, Snowflake returns an error
> message because it cannot resolve the SQL.
>
> ```sqlexample
> -- Flatten the JSON data
>
> create or replace table <table_name> (v variant) as
> select value::variant
> from @<table_name>,
>   table(flatten(input => parse_json($1):stationLocation));
>
> -- JavaScript UDF to mask latitude, longitude, and location data
>
> CREATE OR REPLACE FUNCTION full_location_masking(v variant)
>   RETURNS variant
>   LANGUAGE JAVASCRIPT
>   AS
>   $$
>     if ("latitude" in V) {
>       V["latitude"] = "**latitudeMask**";
>     }
>     if ("longitude" in V) {
>       V["longitude"] = "**longitudeMask**";
>     }
>     if ("location" in V) {
>       V["location"] = "**locationMask**";
>     }
>
>     return V;
>   $$;
>
>   -- Grant UDF usage to ACCOUNTADMIN
>
>   grant ownership on function FULL_LOCATION_MASKING(variant) to role accountadmin;
>
>   -- Create a masking policy using JavaScript UDF
>
>   create or replace masking policy json_location_mask as (val variant) returns variant ->
>     CASE
>       WHEN current_role() IN ('ANALYST') THEN val
>       else full_location_masking(val)
>       -- else object_insert(val, 'latitude', '**locationMask**', true) -- limited to one value at a time
>     END;
> ```

Using the [GEOGRAPHY](../data-types-geospatial.md) data type:

> In this example, a masking policy uses the [TO_GEOGRAPHY](../functions/to_geography.md) function to convert all GEOGRAPHY data in a
> column to a fixed point, the longitude and latitude for Snowflake in San Mateo, California, for users whose CURRENT_ROLE is not
> `ANALYST`.
>
> > ```sqlexample
> > create masking policy mask_geo_point as (val geography) returns geography ->
> >   case
> >     when current_role() IN ('ANALYST') then val
> >     else to_geography('POINT(-122.35 37.55)')
> >   end;
> > ```
>
> Set the masking policy on a column with the GEOGRAPHY data type and set the [GEOGRAPHY_OUTPUT_FORMAT](../parameters.md) value for the session to
> `GeoJSON`:
>
> > ```sqlexample
> > alter table mydb.myschema.geography modify column b set masking policy mask_geo_point;
> > alter session set geography_output_format = 'GeoJSON';
> > use role public;
> > select * from mydb.myschema.geography;
> > ```
>
> Snowflake returns the following:
>
> > ```sqlexample
> > ---+--------------------+
> >  A |         B          |
> > ---+--------------------+
> >  1 | {                  |
> >    |   "coordinates": [ |
> >    |     -122.35,       |
> >    |     37.55          |
> >    |   ],               |
> >    |   "type": "Point"  |
> >    | }                  |
> >  2 | {                  |
> >    |   "coordinates": [ |
> >    |     -122.35,       |
> >    |     37.55          |
> >    |   ],               |
> >    |   "type": "Point"  |
> >    | }                  |
> > ---+--------------------+
> > ```
>
> The query result values in column B depend on the GEOGRAPHY_OUTPUT_FORMAT parameter value for the session. For example, if the parameter
> value is set to `WKT`, Snowflake returns the following:
>
> > ```sqlexample
> > alter session set geography_output_format = 'WKT';
> > select * from mydb.myschema.geography;
> >
> > ---+----------------------+
> >  A |         B            |
> > ---+----------------------+
> >  1 | POINT(-122.35 37.55) |
> >  2 | POINT(-122.35 37.55) |
> > ---+----------------------+
> > ```

For examples using other context functions and role hierarchy, see [Advanced Column-level Security topics](../../user-guide/security-column-advanced.md).

## Example: Conditional masking policy

The following example returns unmasked data for users whose [CURRENT_ROLE](../functions/current_role.md) is the `admin` custom role, or
whose value in the visibility column is `Public`. All other conditions result in a fixed masked value.

> ```sqlexample
> -- Conditional Masking
>
> create masking policy email_visibility as
> (email varchar, visibility string) returns varchar ->
>   case
>     when current_role() = 'ADMIN' then email
>     when visibility = 'Public' then email
>     else '***MASKED***'
>   end;
> ```

The following example returns detokenized data for users whose [CURRENT_ROLE](../functions/current_role.md) is the `admin` custom role,
and whose value in a different column is `Public`. All other conditions result in a tokenized value.

> ```sqlexample
> -- Conditional Tokenization
>
> create masking policy de_email_visibility as
>  (email varchar, visibility string) returns varchar ->
>    case
>      when current_role() = 'ADMIN' and visibility = 'Public' then de_email(email)
>      else email -- sees tokenized data
>    end;
> ```

## Example: Allow a masked column in a row access policy or conditional masking policy

Replace a masking policy that either allows viewing the email address, viewing only the email address domain, or a viewing fixed masked
value:

> ```sqlexample
> create or replace masking policy governance.policies.email_mask
> as (val string) returns string ->
> case
>   when current_role() in ('ANALYST') then val
>   when current_role() in ('SUPPORT') then regexp_replace(val,'.+\@','*****@')
>   else '********'
> end
> comment = 'specify in row access policy'
> exempt_other_policies = true
> ;
> ```

This policy can now be set on a column and a row access policy or a conditional masking policy can reference the column protected by this
masking policy as needed.

---
title: CREATE MATERIALIZED VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/create-materialized-view.md
section: SQL Commands
---

# CREATE MATERIALIZED VIEW

Creates a new materialized view in the current/specified schema, based on a query of an existing table, and populates the view with data.

For more details, see [Working with Materialized Views](../../user-guide/views-materialized.md).

See also:
:   [ALTER MATERIALIZED VIEW](alter-materialized-view.md) , [DROP MATERIALIZED VIEW](drop-materialized-view.md) , [SHOW MATERIALIZED VIEWS](show-materialized-views.md) , [DESCRIBE MATERIALIZED VIEW](desc-materialized-view.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ SECURE ] [ INTERACTIVE ] MATERIALIZED VIEW [ IF NOT EXISTS ] <name>
  [ COPY GRANTS ]
  ( <column_list> )
  [ <col1> [ WITH ] MASKING POLICY <policy_name> [ USING ( <col1> , <cond_col1> , ... ) ]
           [ WITH ] PROJECTION POLICY <policy_name>
           [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ , <col2> [ ... ] ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> [ ENTITY KEY ( <col_name> [ , <col_name> ... ] ) ] ]
  [ CLUSTER BY ( <expr1> [, <expr2> ... ] ) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  AS <select_statement>
```

## Required parameters

`name`
:   Specifies the identifier for the view; must be unique for the schema in which the view is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`select_statement`
:   Specifies the query used to create the view. This query serves as the text/definition for the view. This query is displayed in the output
    of [SHOW VIEWS](show-views.md) and [SHOW MATERIALIZED VIEWS](show-materialized-views.md).

    There are limitations on the `select_statement`. For details, see:

    * Usage notes.
    * [Limitations on Creating Materialized Views](../../user-guide/views-materialized.md).

## Optional parameters

`column_list`
:   If you do not want the column names in the view to be the same as the column names of the underlying table, you may include a column list in
    which you specify the column names. (You do not need to specify the data types of the columns.)

    If you include a CLUSTER BY clause for the materialized view, then you
    must include the column name list.

`MASKING POLICY = policy_name`
:   Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.

`USING ( col_name , cond_col_1 ... )`
:   Specifies the arguments to pass into the conditional masking policy SQL expression.

    The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
    column to which the masking policy is set.

    The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query result
    when a query is made on the first column.

    If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
    [masking policy](../../user-guide/security-column-intro.md).

`PROJECTION POLICY policy_name`
:   Specifies the [projection policy](../../user-guide/projection-policies.md) to set on a column.

`string_literal`
:   Specifies a comment for the view. The string literal should be in single quotes. (The string literal should not contain single
    quotes unless they are escaped.)

    Default: No value.

`INTERACTIVE`
:   Creates an interactive materialized view, which is optimized for low-latency queries on [interactive tables](../../user-guide/interactive.md).
    An interactive materialized view must be based on a single interactive table. After you create the interactive materialized view,
    you must add both the materialized view and its underlying base table to the interactive warehouse.

    For more information, see [Materialized view support for interactive tables](../../user-guide/interactive.md).

    Default: No value (creates a standard materialized view)

`SECURE`
:   Specifies that the view is secure. For more information about secure views, see [Working with Secure Views](../../user-guide/views-secure.md).

    Default: No value (view is not secure)

`COPY GRANTS`
:   If you are replacing an existing view by using the `OR REPLACE` clause, then the replacement view retains the access permissions
    from the original view. This parameter copies all privileges, except OWNERSHIP, from the existing view to the new view. The
    new view does not inherit any future grants defined for the object type in the schema. By default, the role that executes
    the CREATE MATERIALIZED VIEW statement owns the new view.

    If the parameter is not included in the CREATE VIEW statement, then the new view does not inherit any explicit access privileges
    granted on the original view but does inherit any future grants defined for the object type in the schema.

    Note that the operation to copy grants occurs atomically with the CREATE VIEW statement (i.e. within the same transaction).

    Default: No value (grants are not copied).

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on the materialized view.

`AGGREGATION POLICY policy_name`
:   Specifies the [aggregation policy](../../user-guide/aggregation-policies.md) to set on the materialized view.

`expr#`
:   Specifies an expression on which to cluster the materialized view. Typically, each expression is the name of a column in the
    materialized view.

    For more information about clustering materialized views, see [Materialized Views and Clustering](../../user-guide/views-materialized.md). For more
    information about clustering in general, see [What is Data Clustering?](../../user-guide/tables-clustering-micropartitions.md).

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Usage notes

* Creating a materialized view requires CREATE MATERIALIZED VIEW privilege on the schema, and SELECT privilege on
  the base table. For more information about privileges and materialized views, see [Privileges on a Materialized View’s Schema](../../user-guide/views-materialized.md).
* If you specify the [CURRENT_DATABASE](../functions/current_database.md) or [CURRENT_SCHEMA](../functions/current_schema.md) function in the
  definition of the view, the function returns the database or schema that contains the view, not the database or schema in
  use for the session.
* When you choose a name for the materialized view, note that a schema cannot contain a table and view with the same name. CREATE
  [ MATERIALIZED ] VIEW produces an error if a table with the same name already exists in the schema.
* When specifying the `select_statement`, note the following:

  + You cannot specify a HAVING clause or an ORDER BY clause.
  + If you include a CLUSTER BY clause for the materialized view, you must include the `column_list` clause.
  + If you refer to the base table more than once in the `select_statement`, use the same
    [qualifier](../identifiers.md) for all references for the base table.

    For example, don’t use a mix of `base_table`, `schema.base_table`, and `database.schema.base_table` in the
    same `select_statement`. Instead, choose one of these forms (e.g. `database.schema.base_table`), and use this
    consistently throughout the `select_statement`.
  + Do not query stream objects in the SELECT statement. Streams are not designed to serve as source objects for views or materialized
    views.
* Some column names are not allowed in materialized views. If a column name is not allowed, you can define an alias for the
  column. For details, see [Handling Column Names That Are Not Allowed in Materialized Views](../../user-guide/views-materialized.md).
* If the materialized view queries external tables, you must refresh the file-level metadata
  for the external tables to reflect changes in the referenced cloud storage location, including
  new, updated, and removed files.

  You can refresh the metadata for an external table
  [automatically](../../user-guide/tables-external-auto.md) using the event notification service
  for your cloud storage service or manually using
  [ALTER EXTERNAL TABLE … REFRESH](alter-external-table.md) statements.
* Materialized views have a number of other restrictions. For details, see
  [Limitations on Creating Materialized Views](../../user-guide/views-materialized.md) and [Limitations on Working With Materialized Views](../../user-guide/views-materialized.md).
* When you create an interactive materialized view (using the INTERACTIVE keyword):

  + The materialized view must be based on a single [interactive table](../../user-guide/interactive.md).
    You can’t create an interactive materialized view based on a standard table.
  + Joins aren’t supported in interactive materialized views, just like standard materialized views.
  + After creating the interactive materialized view, you must add both the materialized view
    **and** its underlying base table to the interactive warehouse using
    [ALTER WAREHOUSE … ADD TABLES](alter-warehouse.md).
  + You can’t use an interactive materialized view as the source for another materialized view
    or a dynamic table.

  For more information, see [Materialized view support for interactive tables](../../user-guide/interactive.md).
* View definitions are not updated if the schema of the underlying source table is changed so that the view definition becomes
  invalid. For example:

  + A view is created from a base table, and a column is subsequently dropped from that base table.
  + The base table for the materialized view is dropped.

  In these scenarios, querying the view returns an error that includes the reason why the view was invalidated. For example:

  ```output
  Failure during expansion of view 'MV1':
    SQL compilation error: Materialized View MV1 is invalid.
    Invalidation reason: DDL Statement was executed on the base table 'MY_INVENTORY'.
    Marked Materialized View as invalid.
  ```

  When this occurs, you can do the following:

  + If the base table has been dropped and this is within the
    [data retention period for Time Travel](../../user-guide/data-time-travel.md), you can
    [undrop the base table](undrop-table.md) to make the materialized view valid again.
  + Use the CREATE OR REPLACE MATERIALIZED VIEW command to recreate the view.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* Using `OR REPLACE` is the equivalent of using [DROP MATERIALIZED VIEW](drop-materialized-view.md) on the existing materialized view and then creating a
  new view with the same name.

  CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

  This means that any queries concurrent with the CREATE OR REPLACE MATERIALIZED VIEW operation use either the old or new materialized
  view version.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
* When creating a materialized view with a masking policy on one or more materialized view columns, or a row access policy added to the
  materialized view, use the [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a
  masking policy and the materialized view protected by a row access policy.

## Examples

Create a materialized view in the current schema, with a comment, that selects all the rows from a table:

> ```sqlexample
> CREATE MATERIALIZED VIEW mymv
>     COMMENT='Test view'
>     AS
>     SELECT col1, col2 FROM mytable;
> ```

Create an interactive materialized view based on an interactive table, then add both the materialized view and its
base table to an interactive warehouse:

> ```sqlexample
> CREATE INTERACTIVE MATERIALIZED VIEW IF NOT EXISTS mv_summary
>     AS
>     SELECT SUM(quantity) AS total_quantity, SUM(net_paid) AS total_net_paid
>     FROM my_interactive_table
>     WHERE call_center_id = 52;
>
> ALTER WAREHOUSE interactive_wh ADD TABLES (mv_summary, my_interactive_table);
> ```

For more examples, see the examples in [Working with Materialized Views](../../user-guide/views-materialized.md).

---
title: CREATE MCP SERVER
source: https://docs.snowflake.com/en/sql-reference/sql/create-mcp-server.md
section: SQL Commands
---

# CREATE MCP SERVER

Creates a new MCP (Model Context Protocol) server or replaces an existing MCP server.

See also:
:   [DESCRIBE MCP SERVER](desc-mcp-server.md), [DROP MCP SERVER](drop-mcp-server.md) , [SHOW MCP SERVERS](show-mcp-servers.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] MCP SERVER [ IF NOT EXISTS ] <name>
  FROM SPECIFICATION $$<specification_yaml>$$
```

## Parameters

`name`
:   String that specifies the identifier for the MCP server; must be unique for the schema in which the MCP server is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FROM SPECIFICATION $$specification_yaml$$`
:   Specifies the YAML specification defining the tools and configuration for the MCP server.

    The specification must include a `tools` array with one or more tool definitions. Each tool must specify:

    * `name`: Unique identifier for the tool
    * `type`: Tool type (see supported tool types)
    * `title`: Human-readable title for the tool
    * `description`: Description of what the tool does

    **Supported tool types:**

    * `CORTEX_SEARCH_SERVICE_QUERY`: Cortex Search Service tool
    * `CORTEX_ANALYST_MESSAGE`: Cortex Analyst tool
    * `SYSTEM_EXECUTE_SQL`: SQL execution tool
    * `CORTEX_AGENT_RUN`: Cortex Agent tool
    * `GENERIC`: Custom tool for UDFs and stored procedures

    **Tool-specific requirements:**

    For `CORTEX_SEARCH_SERVICE_QUERY`, `CORTEX_ANALYST_MESSAGE`, and `CORTEX_AGENT_RUN` tools:

    * `identifier`: Fully qualified name of the underlying object (for example, `database.schema.object_name`)

    For `GENERIC` tools:

    * `identifier`: Fully qualified name of the UDF or stored procedure
    * `config`: Configuration object specifying:

      + `type`: Either `function` (for UDF) or `procedure` (for stored procedure)
      + `warehouse`: Warehouse to use for execution
      + `input_schema`: JSON schema defining the function/procedure parameters

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| CREATE MCP SERVER | Schema |
| USAGE | Schema |

For tools that reference other objects, additional privileges are required:

| Privilege | Object |
| --- | --- |
| USAGE | Cortex Search Service (for CORTEX_SEARCH_SERVICE_QUERY tools) |
| SELECT | Semantic View (for CORTEX_ANALYST_MESSAGE tools) |
| USAGE | Cortex Agent (for CORTEX_AGENT_RUN tools) |
| USAGE | User-defined function or stored procedure (for GENERIC tools) |
| USAGE | Warehouse (for GENERIC tools) |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
* When configuring hostnames for MCP server connections, use hyphens (`-`) instead of underscores (`_`). MCP servers have connection issues with hostnames containing underscores.
* The MCP server specification is stored as metadata and can be viewed using [DESCRIBE MCP SERVER](desc-mcp-server.md).
* Multiple tools can be defined in a single MCP server specification.
* Tool names must be unique within a single MCP server.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

**Example 1: Create MCP server with Cortex Search and Analyst tools**

```sqlexample-yaml
CREATE MCP SERVER my_mcp_server
  FROM SPECIFICATION $$
    tools:
      - name: "product-search"
        type: "CORTEX_SEARCH_SERVICE_QUERY"
        identifier: "database1.schema1.cortex_search_service1"
        description: "Cortex search service for all products"
        title: "Product Search"

      - name: "revenue-semantic-view"
        type: "CORTEX_ANALYST_MESSAGE"
        identifier: "database1.schema1.semantic_view_1"
        description: "Semantic view for all revenue tables"
        title: "Semantic view for revenue"
  $$;
```

**Example 2: Create MCP server with SQL execution tool**

```sqlexample-yaml
CREATE MCP SERVER sql_exec_server
  FROM SPECIFICATION $$
    tools:
      - title: "SQL Execution Tool"
        name: "sql_exec_tool"
        type: "SYSTEM_EXECUTE_SQL"
        description: "A tool to execute SQL queries against the connected Snowflake database."
  $$;
```

**Example 3: Create MCP server with custom UDF tool**

```sqlexample-yaml
CREATE MCP SERVER custom_tools_server
  FROM SPECIFICATION $$
    tools:
      - title: "Multiply by Ten"
        identifier: "example_database.agents.multiply_by_ten"
        name: "multiply_by_ten"
        type: "GENERIC"
        description: "Multiplies input value by ten and returns the result."
        config:
          type: "function"
          warehouse: "compute_service_warehouse"
          input_schema:
            type: "object"
            properties:
              x:
                description: "A number to be multiplied by ten"
                type: "number"
  $$;
```

**Example 4: Create MCP server with custom stored procedure tool**

```sqlexample-yaml
CREATE MCP SERVER procedure_tools_server
  FROM SPECIFICATION $$
    tools:
      - title: "Calculate Values"
        identifier: "example_database.agents.calculate_values_sp"
        name: "calculate_values_sp"
        type: "GENERIC"
        description: "Calculates the product and sum of two numbers and returns them in a JSON object."
        config:
          type: "procedure"
          warehouse: "compute_service_warehouse"
          input_schema:
            type: "object"
            properties:
              x:
                description: "First number"
                type: "number"
              y:
                description: "Second number"
                type: "number"
  $$;
```

**Example 5: Create MCP server with Agent tool**

```sqlexample-yaml
CREATE MCP SERVER agent_server
  FROM SPECIFICATION $$
    tools:
      - title: "Customer Service Agent"
        name: "customer_agent"
        type: "CORTEX_AGENT_RUN"
        identifier: "support_db.agents_schema.customer_service_agent"
        description: "Agent that handles customer service inquiries"
  $$;
```

---
title: CREATE MODEL
source: https://docs.snowflake.com/en/sql-reference/sql/create-model.md
section: SQL Commands
---

# CREATE MODEL

Creates a new machine learning model in the current/specified schema or replaces an existing model.

> **Note:**
>
> Use the [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md) Python API
> to create models from scratch. In SQL, you can only create models from other models.

Models are versioned. All models must have at least one version, and one version must be designated as the default. To add
a version to a model, use [ALTER MODEL … ADD VERSION](alter-model-add-version.md).

Some properties of a model can be modified (see [ALTER MODEL](alter-model.md)), and multiple versions can be added.

This command also supports the following variant:

* CREATE MODEL … FROM internalStage (creates a model from files in an external stage)

See also:
:   [ALTER MODEL](alter-model.md) , [ALTER MODEL … ADD VERSION](alter-model-add-version.md) , [DROP MODEL](drop-model.md) , [SHOW MODELS](show-models.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] MODEL [ IF NOT EXISTS ] <name> [ WITH VERSION <version_name> ]
    FROM MODEL <source_model_name> [ VERSION <source_version_or_alias_name> ]
```

## Variant Syntax

This variant is used by the [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md) Python API.
It is not possible to create models from scratch in SQL.

```sqlsyntax
CREATE [ OR REPLACE ] MODEL [ IF NOT EXISTS ] <name> [ WITH VERSION <version_name> ]
  FROM internalStage
```

Where:

```sqlsyntax
internalStage ::=
    @[<namespace>.]<int_stage_name>[/<path>]
  | @[<namespace>.]%<table_name>[/<path>]
  | @~[/<path>]
```

For additional internal stage details, see [Choosing an internal stage for local files](../../user-guide/data-load-local-file-system-create-stage.md).

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the new model; must be unique for the schema in which the model
    is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FROM MODEL source_model_name`
:   Required if not using FROM internalStage variant
    :   Specifies the name of the model from which to create the new model.

`FROM internalStage`
:   Required if using FROM internalStage variant
    :   Specifies the internal stage that contains the model’s files. The required layout of these files is not currently
        documented.

## Optional parameters

`WITH VERSION version_name`
:   For use with FROM MODEL variant
    :   Specifies the name of the version to create in the new model.

`VERSION source_version_or_alias_name`
:   For use with FROM MODEL variant
    :   Specifies the name or alias of the version to be copied from the source model. If not specified, uses the default version
        from the source model.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE MODEL | Schema | Implied by OWNERSHIP on schema |
| OWNERSHIP | Model | A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object that already exists in the schema. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

---
title: CREATE MODEL MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/create-model-monitor.md
section: SQL Commands
---

# CREATE MODEL MONITOR

Create or replace a [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md) in the current or specified schema.

See also:
:   [ALTER MODEL MONITOR](alter-model-monitor.md),
    [SHOW MODEL MONITORS](show-model-monitors.md),
    [DESCRIBE MODEL MONITOR](desc-model-monitor.md),
    [DROP MODEL MONITOR](drop-model-monitor.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] MODEL MONITOR [ IF NOT EXISTS ] <monitor_name> WITH
    MODEL = <model_name>
    VERSION = '<version_name>'
    FUNCTION = '<function_name>'
    SOURCE = <source_name>
    WAREHOUSE = <warehouse_name>
    REFRESH_INTERVAL = '<num> { seconds | minutes | hours | days }'
    AGGREGATION_WINDOW = '<num> days'
    TIMESTAMP_COLUMN = <timestamp_name>
    [ BASELINE = <baseline_name> ]
    [ ID_COLUMNS = <id_column_name_array> ]
    [ PREDICTION_CLASS_COLUMNS = <prediction_class_column_name_array> ]
    [ PREDICTION_SCORE_COLUMNS = <prediction_column-name_array> ]
    [ ACTUAL_CLASS_COLUMNS = <actual_class_column_name_array> ]
    [ ACTUAL_SCORE_COLUMNS = <actual_column_name_array> ]
    [ SEGMENT_COLUMNS = <segment_column_name_array> ]
    [ CUSTOM_METRIC_COLUMNS = <custom_metric_column_name_array> ]
```

## Required parameters

`monitor_name`
:   Specifies the identifier for the model monitor; must be unique in the schema where the monitor is created,
    and must be in the same schema as the model being monitored.

    If the monitor identifier is not fully qualified (in the form of `db_name.schema_name.name` or
    `schema_name.name`), the command creates the model in the current schema for the session.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`MODEL = model_name`
:   The name of the model to be monitored. Must be in the same schema where the monitor is created.

`VERSION = 'version_name'`
:   Name of the model version to be monitored.

`FUNCTION = function_name`
:   Name of the specific function in the model version to be monitored.

`SOURCE = source_name`
:   Name of the source table or view that contains the feature, inferences and ground truth labels.

`WAREHOUSE = warehouse_name`
:   The name of the Snowflake warehouse to use for the monitor’s internal compute operations.

`REFRESH_INTERVAL = 'num { seconds | minutes | hours | days }'`
:   The interval at which the monitor refreshes its internal state. The value must be a string representing a time period,
    such as `'1 day'`. The minimum refresh interval is `'60 seconds'`. Supported units include seconds, minutes, hours, and days.
    You may use singular (“hour”) or plural (“hours”) for the interval name.

`AGGREGATION_WINDOW = 'num days'`
:   The window over which the monitor aggregates data. The value must be a string representing a time period, such as `'1
    day'`. Only days are supported. You may use singular (“day”) or plural (“days”) for the interval name.

`TIMESTAMP_COLUMN = timestamp_name`
:   Name of the column in the source data that contains the timestamps. Must be of type TIMESTAMP_NTZ.

## Optional parameters

`BASELINE = baseline_name`
:   Name of the baseline table that contains a snapshot of data similar to SOURCE, which is used to compute drift.
    A snapshot of this data is embedded within the monitor object. Although this parameter is optional, if is not set, the
    monitor cannot detect drift.

`ID_COLUMNS = id_column_name_array`
:   An array of string column names that, together, uniquely identify each row in the source data. See [ARRAY constants](../data-types-semistructured.md).

> **Note:**
>
> At least one prediction column (either a prediction score or a prediction class) is mandatory.
>
> * For binary classification models: Predictions can be either scores or classes; actuals must be classes.
> * For multi-class classification models: Predictions and actuals must be classes.
> * For regression models: Both predictions and actuals must be numbers.

`PREDICTION_CLASS_COLUMNS = prediction_class_column_name_array`
:   An array of strings naming all prediction class columns in the data source. See [ARRAY constants](../data-types-semistructured.md).
    If the model task is `TABULAR_BINARY_CLASSIFICATION` or `TABULAR_REGRESSION`, the columns must be of type NUMBER.
    If the model task is `TABULAR_MULTI_CLASSIFICATION`, the columns must be of type STRING.

`PREDICTION_SCORE_COLUMNS = prediction_column_name_array`
:   An array of strings naming all prediction score columns in the data source. See [ARRAY constants](../data-types-semistructured.md).
    Columns must be of type NUMBER.

`ACTUAL_CLASS_COLUMNS = actual_class_column_name_array`
:   An array of strings naming all actual class columns in the data source. See [ARRAY constants](../data-types-semistructured.md).
    If the model task is `TABULAR_BINARY_CLASSIFICATION` or `TABULAR_REGRESSION`, the columns must be of type NUMBER.
    If the model task is `TABULAR_MULTI_CLASSIFICATION`, the columns must be of type STRING.

`ACTUAL_SCORE_COLUMNS = actual_column_name_array`
:   An array of strings naming all actual score columns in the data source. See [ARRAY constants](../data-types-semistructured.md).
    Columns must be of type NUMBER.

`SEGMENT_COLUMNS = segment_column_name_array`
:   An array of strings naming all segment columns in the data source. See [ARRAY constants](../data-types-semistructured.md).
    Segment columns must be of type STRING in source data.
    You can specify up to 5 segment columns per monitor. Each segment column should have fewer than 25 unique values for optimal performance.
    For more information about segments, see [ML Observability: Monitoring model behavior over time](../../developer-guide/snowflake-ml/model-registry/model-observability.md).

`CUSTOM_METRIC_COLUMNS = custom_metric_column_name_array`
:   An array of strings naming columns in the source data that are used for custom metrics. These columns are not treated as features. See [ARRAY constants](../data-types-semistructured.md).
    Columns must be of type NUMBER.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Model monitor | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| CREATE MODEL MONITOR | Schema |  |
| SELECT | Table or view specified by the SOURCE parameter |  |
| USAGE | Warehouse specified by the WAREHOUSE parameter |  |
| USAGE | Model specified by the MODEL parameter |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The following requirements apply to the parameters:

  + Model task must be `tabular_binary_classification` or `tabular_regression`.
  + Multiple-output models are not currently supported. Although the prediction and actual columns are arrays, the arrays
    must have only one element.
  + At least one of the prediction columns must be specified.
  + Actual columns are optional, but accuracy metrics are not computed if they are not specified.
  + A column may be specified once across all parameters (for example, an ID column cannot also be a prediction column).
* The number of monitored features is limited to 500.
* Segment column requirements:

  + Segment columns must be of type STRING.
  + A maximum of 5 segment columns per monitor (hard limit).
  + Each segment column should have fewer than 25 unique values (recommended limit).
  + Segment values are case sensitive and special characters are not supported for segment queries.
* The basic configuration of MODEL MONITOR instances, including the model it monitors and data sources it uses, cannot be
  changed after the monitor is created. You can modify only a few options using
  [ALTER MODEL MONITOR](alter-model-monitor.md). To change a monitor’s configuration, drop the instance and create a new
  one.
* [Replication](../../user-guide/account-replication-intro.md) is supported only for instances
  of the [CUSTOM_CLASSIFIER](../classes/custom_classifier.md) class.

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

**Basic example**

Create a model monitor that refreshes daily and uses single prediction and actual score columns.

```sqlexample
CREATE MODEL MONITOR my_monitor WITH
    MODEL = my_model
    VERSION = 'v1'
    FUNCTION = 'predict'
    SOURCE = mydb.myschema.scoring_data
    WAREHOUSE = compute_wh
    REFRESH_INTERVAL = '1 day'
    AGGREGATION_WINDOW = '1 day'
    TIMESTAMP_COLUMN = event_time
    PREDICTION_SCORE_COLUMNS = ( 'prediction_score' )
    ACTUAL_SCORE_COLUMNS = ( 'actual_score' );
```

**Example with CUSTOM_METRIC_COLUMNS**

Specify custom numeric columns to compute additional bespoke metrics.

```sqlexample
CREATE MODEL MONITOR my_monitor_custom WITH
    MODEL = my_model
    VERSION = 'v1'
    FUNCTION = 'predict'
    SOURCE = mydb.myschema.scoring_data
    WAREHOUSE = compute_wh
    REFRESH_INTERVAL = '1 day'
    AGGREGATION_WINDOW = '1 day'
    TIMESTAMP_COLUMN = event_time
    PREDICTION_SCORE_COLUMNS = ( 'prediction_score' )
    ACTUAL_SCORE_COLUMNS = ( 'actual_score' )
    CUSTOM_METRIC_COLUMNS = ( 'latency_ms', 'num_impressions' );
```

In this example, we include two custom metrics: `latency_ms` and `num_impressions`.
These are columns in the source data that are not features to the model, but are useful to track next to the model’s performance.

---
title: CREATE NETWORK POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-network-policy.md
section: SQL Commands
---

# CREATE NETWORK POLICY

Creates a network policy or replaces an existing network policy.

> **Note:**
>
> Only security administrators (i.e. users with the SECURITYADMIN role) or higher or a role with the global CREATE NETWORK POLICY
> privilege can create network policies.

See also:
:   [ALTER NETWORK POLICY](alter-network-policy.md) , [DROP NETWORK POLICY](drop-network-policy.md) , [SHOW NETWORK POLICIES](show-network-policies.md) , [DESCRIBE NETWORK POLICY](desc-network-policy.md)

    [ALTER ACCOUNT](alter-account.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NETWORK POLICY [ IF NOT EXISTS ] <name>
  [ ALLOWED_NETWORK_RULE_LIST = ( '<network_rule>' [ , '<network_rule>' , ... ] ) ]
  [ BLOCKED_NETWORK_RULE_LIST = ( '<network_rule>' [ , '<network_rule>' , ... ] ) ]
  [ ALLOWED_IP_LIST = ( [ '<ip_address>' ] [ , '<ip_address>' , ... ] ) ]
  [ BLOCKED_IP_LIST = ( [ '<ip_address>' ] [ , '<ip_address>' , ... ] ) ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the network policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`ALLOWED_NETWORK_RULE_LIST = ( 'network_rule' [ , 'network_rule' , ... ] )`
:   Specifies a list of [network rules](../../user-guide/network-rules.md) that contain the network identifiers that are allowed access to
    Snowflake. There is no limit on the number of network rules in the list.

`BLOCKED_NETWORK_RULE_LIST = ( 'network_rule' [ , 'network_rule' , ... ] )`
:   Specifies a list of network rules that contain the network identifiers that are denied access to Snowflake. There is no limit on the
    number of network rules in the list.

`ALLOWED_IP_LIST = ( [ ip_address ] [ , ip_address , ... ] )`
:   Specifies a list of IPv4 addresses that are allowed access to your Snowflake account. This is referred to as the *allowed list*.

    Snowflake recommends using network rules in conjunction with network policies rather than using this property. Use the
    `ALLOWED_NETWORK_RULE_LIST` property to specify network rules that contain IPv4 addresses.

    If you are not yet using network rules, specify at least one IPv4 address or CIDR block range to allow access to your Snowflake
    account. Additionally, if you are not using network rules and this property is specified with an empty list, no IPv4 addresses are
    allowed to access your Snowflake account.

`BLOCKED_IP_LIST = ( [ ip_address ] [ , ip_address , ... ] )`
:   Specifies a list of IPv4 addresses that are denied access to your Snowflake account. This is referred to as the *blocked list*.
    To unset this parameter, specify a different CIDR block range, a series of IPv4 addresses, or a single IPv4 address.

    Snowflake recommends using network rules in conjunction with network policies rather than using this parameter. Use the
    `BLOCKED_NETWORK_RULE_LIST` property to specify network rules that contain IPv4 addresses.

    To block public access, use a network rule and add the network rule to the `BLOCKED_NETWORK_RULE_LIST` property. The result is
    that only IP addresses that use private connectivity, such as AWS PrivateLink, can access your Snowflake account.

    Default: No value; no IP addresses in `ALLOWED_IP_LIST` property are blocked.

`COMMENT = 'string_literal'`
:   Specifies a comment for the network policy.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE NETWORK POLICY | Account | Only the SECURITYADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Use network rules in conjunction with the network policy to manage access to your Snowflake account.
* You cannot execute a CREATE OR REPLACE NETWORK POLICY command to replace an existing network policy if that policy is currently assigned
  to an account, security integration, or user.
* Each `ip_address` can cover a range of addresses using Classless Inter-Domain Routing (CIDR) notation:

  > `ip_address[/optional_prefix_length]`

  For example:

  > `192.168.1.0/24`
* When a network policy includes values for both `ALLOWED_IP_LIST` and `BLOCKED_IP_LIST`, Snowflake applies the
  blocked list first.
* The maximum number of characters for the `ALLOWED_IP_LIST` list is 100,000. Snowflake returns an error message when this
  character limit is exceeded.
* After creating a network policy, you must associate it with your account before Snowflake enforces the policy. You can associate a
  policy with your account through the [ALTER ACCOUNT](alter-account.md) command, which must be run by a user with the SECURITYADMIN
  role (or higher).

  For example:

  > ```sqlexample
  > USE ROLE SECURITYADMIN;
  >
  > ALTER ACCOUNT SET NETWORK_POLICY = <policy_name>;
  > ```

  For more details, see [Parameter management](../../user-guide/admin-account-management.md). Note that [NETWORK_POLICY](../parameters.md) is currently the only account
  parameter that can be set by users with the SECURITYADMIN role.
* Before associating a network policy with your account, your current IP address must be included in `ALLOWED_IP_LIST`; otherwise,
  the ALTER ACCOUNT command returns an error. In addition, your current IP address cannot be included in `BLOCKED_IP_LIST`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Example

Create a network policy named `allow_vpceid_block_public_policy` based on two network rules, one that allows a VPCE ID and one that
blocks public network traffic, as described in [Interaction between allowed lists and blocked lists](../../user-guide/network-policies.md).

```sqlexample
CREATE NETWORK POLICY allow_vpceid_block_public_policy
  ALLOWED_NETWORK_RULE_LIST = ('allow_vpceid_access')
  BLOCKED_NETWORK_RULE_LIST = ('block_public_access');

DESC NETWORK POLICY rule_based_policy;
```

```output
+---------------------------+---------------------+
| name                      | value               |
|---------------------------+---------------------|
| ALLOWED_NETWORK_RULE_LIST | ALLOW_VPCEID_ACCESS |
+---------------------------+---------------------+
| BLOCKED_NETWORK_RULE_LIST | BLOCK_PUBLIC_ACCESS |
+---------------------------+---------------------+
```

---
title: CREATE NETWORK RULE
source: https://docs.snowflake.com/en/sql-reference/sql/create-network-rule.md
section: SQL Commands
---

# CREATE NETWORK RULE

Creates a network rule or replaces an existing network rule.

See also:
:   [ALTER NETWORK RULE](alter-network-rule.md) , [DROP NETWORK RULE](drop-network-rule.md) , [SHOW NETWORK RULES](show-network-rules.md) ,
    [DESCRIBE NETWORK RULE](desc-network-rule.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NETWORK RULE <name>
   TYPE = { IPV4 | AWSVPCEID | AZURELINKID | GCPPSCID | HOST_PORT | PRIVATE_HOST_PORT }
   VALUE_LIST = ( '<value>' [, '<value>', ... ] )
   MODE = { INGRESS | INTERNAL_STAGE | SNOWFLAKE_MANAGED_STORAGE_VOLUME | EGRESS }
   [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the network rule.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = { IPV4 | AWSVPCEID | AZURELINKID | GCPPSCID | HOST_PORT | PRIVATE_HOST_PORT }`
:   Specifies the type of network identifiers being allowed or blocked. A network rule can have only one type.

    * `IPV4` indicates that the network rule will allow or block network traffic based on the IPv4 address of the request origin.
    * `AWSVPCEID` indicates that the network rule will allow or block network traffic over
      [AWS PrivateLink](https://docs.aws.amazon.com/vpc/latest/privatelink/what-is-privatelink.html).
    * `AZURELINKID` indicates that the network rule will allow or block network traffic over
      [Azure Private Link](https://learn.microsoft.com/en-us/azure/private-link/private-link-overview).
    * `GCPPSCID` indicates that the network rule will allow or block network traffic over
      [Google Cloud Private Service Connect](https://docs.cloud.google.com/vpc/docs/private-service-connect#endpoints).
    * `HOST_PORT` indicates that the network rule will allow outgoing network traffic based on the domain of the request destination.

      When `TYPE = HOST_PORT`, the `MODE` parameter should be set to `EGRESS`.
    * `PRIVATE_HOST_PORT` indicates that the network rule allows outgoing network traffic to use
      [private connectivity](../../user-guide/private-connectivity-outbound.md) to an external network location.

      When `TYPE = PRIVATE_HOST_PORT`, the `MODE` parameter must be set to `EGRESS`.

`VALUE_LIST = ( 'value' [, 'value', ... ] )`
:   Specifies the network identifiers that will be allowed or blocked.

    Valid values in the list are determined by the type of network rule:

    > * When `TYPE = IPV4`, each value must be a valid IPv4 address or [range of addresses](alter-network-rule.md).
    > * When `TYPE = AWSVPCEID`, each value must be a valid VPCE ID. VPC IDs are not supported.
    > * When `TYPE = AZURELINKID`, each value must be a valid LinkID of an Azure [private endpoint](https://learn.microsoft.com/en-us/azure/private-link/private-endpoint-overview).
    >   Execute the [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](../functions/system_get_privatelink_authorized_endpoints.md) function to retrieve the LinkID associated
    >   with an account.
    > * When `TYPE = GCPPSCID`, each value must be a valid pscConnectionID of a [Google Cloud Private Service Connect (PSC) endpoint](https://docs.cloud.google.com/vpc/docs/private-service-connect#endpoints). Run the [gcloud compute forwarding-rules describe command](https://docs.cloud.google.com/memorystore/docs/cluster/multiple-vpcs-automatically-registered-psc-connection#get_the_connection_id_1) to get the pscConnectionID for each forwarding rule.
    > * When `TYPE = HOST_PORT`, each value must resolve to a valid domain. Optionally, it can also include a port or range of ports.
    >
    >   In most cases, the valid port range is 1-65535. If you do not specify a port, it defaults to 443. If an external network location supports dynamic ports, you need to specify all possible ports.
    >
    >   To allow access to all ports, define the port as 0; for example, `example.com:0`.
    >
    >   When the value resolves to a domain, you can use a single asterisk as a wildcard character. The asterisk matches only alphanumeric
    >   characters and hyphens (`-`).
    >
    >   Wildcards are supported only for a single level of subdomains, as in the following examples:
    >
    >   + `*.google.com`
    >   + `snowflake-*.google.com` and `snowflake*abc.google.com`
    >
    >   You can allow requests to all outbound endpoints by specifying `0.0.0.0` as the domain, as in the examples below.
    >   When you specify `0.0.0.0` as the domain, you may use only 443 and 80 as port values.
    >
    >   + Allow access to all endpoints at port 80
    >
    >     ```none
    >     value_list = ('0.0.0.0:80');
    >     ```
    >   + Allow access to all endpoints at port 443
    >
    >     ```none
    >     value_list = ('0.0.0.0:443');
    >     ```
    >
    >     ```none
    >     value_list = ('0.0.0.0');
    >     ```
    >   + Allow access to all endpoints at both port 80 and 443
    >
    >     ```none
    >     value_list = ('0.0.0.0:80', '0.0.0.0:443');
    >     ```
    > * When `TYPE = PRIVATE_HOST_PORT`, specify one valid domain.
    >
    >   In most cases, the valid port range is 1-65535. If you do not specify a port, it defaults to 443. If an external network location supports dynamic ports, you need to specify all possible ports.
    >
    >   To allow access to all ports, define the port as 0; for example, `example.com:0`.

`MODE = { INGRESS | INTERNAL_STAGE | SNOWFLAKE_MANAGED_STORAGE_VOLUME | EGRESS }`
:   Specifies what is restricted by the network rule.

    `INGRESS`
    :   The behavior of the `INGRESS` mode depends on the value of the network rule’s `TYPE` property.

        * If `TYPE=IPV4`, by default the network rule controls access to the Snowflake service only.

          If the account administrator enables the [ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES](../parameters.md) parameter, then `MODE=INGRESS` and
          `TYPE=IPV4` also protects an AWS internal stage.
        * If `TYPE=AWSVPCEID`, `TYPE=AZURELINKID`, or `TYPE=GCPPSCID`,then the network rule controls access to the Snowflake service only.

    `INTERNAL_STAGE`
    :   Allows or blocks requests to an AWS internal stage without restricting access to the Snowflake service. Using this mode requires the
        following:

        * The account administrator must enable the [ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES](../parameters.md) parameter.
        * The `TYPE` property of the network rule must be `AWSVPCEID`.

    `SNOWFLAKE_MANAGED_STORAGE_VOLUME`
    :   Allows or blocks requests to an AWS Snowflake-managed storage volume without restricting access to the Snowflake service. Using
        this mode requires the following:

        * The account administrator must enable the
          [ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME](../parameters.md) parameter.
        * The `TYPE` property of the network rule must be `AWSVPCEID`.

    `EGRESS`
    :   Allows Snowflake to send requests to an external destination.

    Default: `INGRESS`

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the network rule.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE NETWORK RULE | Schema | Only the ACCOUNTADMIN and SECURITYADMIN roles, along with the schema owner, have this privilege by default. It can be granted to additional roles as needed. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When specifying IP addresses for a network rule, Snowflake supports ranges of IP addresses using [Classless Inter-Domain Routing (CIDR) notation](https://tools.ietf.org/html/rfc4632).

  For example, `192.168.1.0/24` represents all IPv4 addresses in the range of `192.168.1.0` to `192.168.1.255`.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a network rule that is used to allow or block traffic from an AWS S3 endpoint to the internal stage:

```sqlexample
CREATE NETWORK RULE corporate_network
  TYPE = AWSVPCEID
  VALUE_LIST = ('vpce-123abc3420c1931')
  MODE = INTERNAL_STAGE
  COMMENT = 'corporate privatelink endpoint';
```

Create a network rule that is used to allow or block traffic from an AWS S3 endpoint to a Snowflake-managed storage volume:

```sqlexample
CREATE NETWORK RULE managed_volume_network
  TYPE = AWSVPCEID
  VALUE_LIST = ('vpce-123abc3420c1931')
  MODE = SNOWFLAKE_MANAGED_STORAGE_VOLUME
  COMMENT = 'Snowflake-managed storage volume privatelink endpoint';
```

Create a network rule that is used to allow or block traffic from a range of IP addresses to the Snowflake service and internal stage:

```sqlexample
CREATE NETWORK RULE cloud_network
  TYPE = IPV4
  VALUE_LIST = ('47.88.25.32/27')
  COMMENT ='cloud egress ip range';
```

Create a network rule that is used to allow or block traffic from a Google Cloud pscConnectionId to the Snowflake service:

```sqlexample
CREATE NETWORK RULE gcp_rule
  TYPE = GCPPSCID
  MODE = INGRESS
  VALUE_LIST = ('31618973889077266');
```

Create a network rule that is used to allow a domain and domain/port combination when Snowflake is sending requests to external destinations:

```sqlexample
CREATE NETWORK RULE external_access_rule
  TYPE = HOST_PORT
  MODE = EGRESS
  VALUE_LIST = ('example.com', 'example.com:443');
```

Create a network rule to enable outbound private connectivity for
[external network access](../../developer-guide/external-network-access/external-network-access-overview.md):

```sqlexample
CREATE OR REPLACE NETWORK RULE ext_network_access_db.network_rules.azure_sql_private_rule
  MODE = EGRESS
  TYPE = PRIVATE_HOST_PORT
  VALUE_LIST = ('externalaccessdemo.database.windows.net');
```

---
title: CREATE NOTEBOOK
source: https://docs.snowflake.com/en/sql-reference/sql/create-notebook.md
section: SQL Commands
---

# CREATE NOTEBOOK

Creates a new [Snowflake notebook](../../user-guide/ui-snowsight/notebooks.md) or replaces an existing notebook.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTEBOOK [ IF NOT EXISTS ] <name>
  [ FROM '<source_location>' ]
  [ MAIN_FILE = '<main_file_name>' ]
  [ COMMENT = '<string_literal>' ]
  [ QUERY_WAREHOUSE = <warehouse_to_run_nb_and_sql_queries_in> ]
  [ IDLE_AUTO_SHUTDOWN_TIME_SECONDS = <number_of_seconds> ]
  [ RUNTIME_NAME = '<runtime_name>' ]
  [ COMPUTE_POOL = '<compute_pool_name>' ]
  [ WAREHOUSE = <warehouse_to_run_notebook_python_runtime> ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the notebook; must be unique for the schema in which the notebook is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`FROM 'source_location'`
:   Specifies that the notebook should be created from an `.ipynb` file in the specified stage location. To create the notebook from a file
    on a stage, set `source_location` to the stage location of the file, and set the MAIN_FILE parameter to the name of the file.

    If this parameter is not specified, the notebook object is created from a template notebook.

`MAIN_FILE = 'main_file_name'`
:   User-specified identifier for the notebook file name. This is separate from the notebook object name, which is specified in the
    `name` parameter. This file must be an `ipynb` file.

`QUERY_WAREHOUSE = warehouse_name`
:   Specifies the warehouse where SQL queries in the notebook are run.
    This parameter is optional. However, it is required to run the EXECUTE NOTEBOOK command.

`IDLE_AUTO_SHUTDOWN_TIME_SECONDS = number_of_seconds`
:   Number of seconds of idle time before the notebook is shut down automatically. This parameter is only available for notebooks running
    on the Container Runtime. The value must be an integer between 60 and 259200 (72 hours).

    Default: 3600 seconds

`RUNTIME_NAME = runtime_name`
:   * `'SYSTEM$WAREHOUSE_RUNTIME'` (default): Runs the notebook in a Snowflake warehouse (Warehouse Runtime only).
    * `'SYSTEM$BASIC_RUNTIME'`: Runs the notebook in a Snowpark Container Services (SPCS) container using a CPU runtime (Container Runtime only).
    * `'SYSTEM$GPU_RUNTIME'`: Runs the notebook in a Snowpark Container Services (SPCS) container using a GPU runtime (Container Runtime only).

    When specifying a Container Runtime (`SYSTEM$BASIC_RUNTIME` or `SYSTEM$GPU_RUNTIME`), you must also include the `COMPUTE_POOL` parameter. `SYSTEM$WAREHOUSE_RUNTIME` is for Warehouse Runtime only.

`COMPUTE_POOL = compute_pool_name`
:   (Container Runtime only) Specifies the compute pool that hosts the notebook when using a Container Runtime. This parameter is required when `RUNTIME_NAME` is set
    to `SYSTEM$BASIC_RUNTIME` or `SYSTEM$GPU_RUNTIME`.

    For more information about compute pools, see [Snowpark Container Services: Working with compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

`WAREHOUSE = warehouse_name`
:   The warehouse is used to run:

    > * For Warehouse Runtime: Both the notebook kernel and SQL queries (including Snowpark pushdown compute).
    > * For Container Runtime: Only SQL queries (including Snowpark pushdown compute). The notebook kernel runs on the compute pool.

    If you don’t specify a warehouse when you create a notebook, Snowflake uses the default warehouse defined by the schema lineage
    parameter `DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE`. You can set this parameter at the schema, database, or account lineage level to define a
    preferred warehouse.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| USAGE | Database |
| USAGE or OWNERSHIP | Schema |
| CREATE NOTEBOOK | Schema |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When creating a notebook that uses a Container Runtime, the notebook runs inside a Snowpark Container Services environment. Container runtime notebooks must
  specify both the `RUNTIME_NAME` and `COMPUTE_POOL` parameters.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

The following creates a notebook named `mynotebook`:

```sqlexample
CREATE NOTEBOOK mynotebook;
```

Although the QUERY_WAREHOUSE parameter is optional, specifying it is recommended when creating a new notebook so
that EXECUTE NOTEBOOK can be run on the warehouse.

```sqlexample
CREATE NOTEBOOK mynotebook
 QUERY_WAREHOUSE = my_warehouse;
```

The following example creates a notebook from an `ipynb` file on a stage:

```sqlexample
CREATE NOTEBOOK mynotebook
 FROM '@my_db.my_schema.my_stage'
 MAIN_FILE = 'my_notebook_file.ipynb'
 QUERY_WAREHOUSE = my_warehouse;
```

The following example creates a notebook using Container Runtime (CPU):

```sqlexample
CREATE NOTEBOOK my_cpu_notebook
  RUNTIME_NAME = 'SYSTEM$BASIC_RUNTIME'
  COMPUTE_POOL = 'my_compute_pool';
```

The following example creates a notebook using Container Runtime (GPU):

```sqlexample
CREATE NOTEBOOK my_gpu_notebook
  RUNTIME_NAME = 'SYSTEM$GPU_RUNTIME'
  COMPUTE_POOL = 'gpu_pool_1';
```

---
title: CREATE NOTEBOOK PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/create-notebook-project.md
section: SQL Commands
---

# CREATE NOTEBOOK PROJECT

Creates a notebook project object. A [notebook project object (NPO)](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-schedule.md) links a Snowsight workspace
to a database and schema. When the NPO is created, all files from the workspace are copied into the project in the specified database and schema.
The notebook project can then be executed using [EXECUTE NOTEBOOK PROJECT](execute-notebook-project.md).
You can create a notebook project object from a stage or a private workspace.

> **Note:**
>
> Creating notebook project objects from shared workspaces is not currently supported.

See also:
:   [EXECUTE NOTEBOOK PROJECT](execute-notebook-project.md), [SHOW NOTEBOOK PROJECTS](show-notebook-projects.md), [CREATE NOTEBOOK](create-notebook.md), [EXECUTE NOTEBOOK](execute-notebook.md)

## Syntax

**Create a notebook project object from a private workspace:**

```sqlsyntax
CREATE NOTEBOOK PROJECT <database_name>.<schema_name>.<project_name>
  FROM 'snow://workspace/<workspace_path>'
  [ COMMENT = '<string_literal>' ];
```

**Create a notebook project object from a stage:**

```sqlsyntax
CREATE NOTEBOOK PROJECT [ IF NOT EXISTS ] <database_name>.<schema_name>.<project_name>
  FROM '@<database_name>.<schema_name>.<stage_name>'
  [ COMMENT = '<string_literal>' ];
```

## Required parameters

`database_name.schema_name.project_name`
:   Fully qualified identifier for the notebook project.

    The project name must be unique within the schema.

    Identifiers must start with an alphabetic character and cannot contain spaces or special characters unless the identifier is enclosed in double
    quotes (for example, `"My Project"`).

    Identifiers in double quotes are case-sensitive.

`FROM 'snow://workspace/{workspace_path' | '@database_name.schema_name.stage_name' }`
:   Specifies the source that backs this notebook project.

    * Use a `snow://workspace/...` URL to create the notebook project from a workspace version in Snowsight.
    * Use a stage reference (for example, `'@my_db.my_schema.my_stage'`) to create the notebook project from notebook files that you have
      deployed to an internal or temporary stage.

    When creating from a workspace, the value must be a `snow://workspace/...` URL pointing to a workspace version.

    The path typically includes:

    * USER$ or another owner.
    * Schema.
    * Workspace name.
    * Version (for example, `versions/last`).

    For example:

    * `snow://workspace/USER$.MY_SCHEMA."my_notebook_workspace"/versions/last`

To locate the workspace path, run the following command:

```sqlexample
LIST 'snow://workspace/USER$.PUBLIC.DEFAULT$/versions/last/';
```

## Optional parameters

`COMMENT = 'string_literal'`
:   Adds a comment or description to the notebook project object.

    Use comments to describe the purpose or workflow (for example, `COMMENT = 'Notebook project for this workflow'`).

    Comments are stored as object metadata; avoid including sensitive data in comments.

## Access control requirements

To execute `CREATE NOTEBOOK PROJECT`, a role must have sufficient privileges to create objects in the target database and schema. Required
privileges include:

* USAGE or OWNERSHIP on the database.
* USAGE or OWNERSHIP on the schema.
* CREATE NOTEBOOK PROJECT on the schema that allows creating objects within that schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* A notebook project points to the specified workspace version indicated in the FROM clause. Using `versions/last` always references the latest
  workspace version; using a fixed path references a static version.
* If you create the notebook project from a stage, you can update it by adding versions from the stage. For details,
  see [Run and schedule Notebooks in Workspaces](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-schedule.md).
* Use descriptive project names to simplify workflow orchestration.
* Replacing a project updates the stored workspace path and metadata.
* To run the `CREATE NOTEBOOK PROJECT` command, you must execute it from a SQL file or SQL worksheet in Workspaces, not from within a notebook cell.

## Examples

Create a notebook project for a workspace:

```sqlexample
CREATE NOTEBOOK PROJECT analytics_db.workflow_schema.workflow_proj
  FROM 'snow://workspace/USER$.workflow_schema."etl_workflow"/versions/last'
  COMMENT = 'Notebook project for nightly ETL workflow';
```

Create a notebook project from a stage:

```sqlexample
CREATE NOTEBOOK PROJECT analytics_db.workflow_schema.workflow_proj
  FROM '@NOTEBOOK_PROJECT_STAGE'
  COMMENT = 'Notebook project created from an internal or temporary stage';
```

Create a notebook project from a stage using IF NOT EXISTS:

```sqlexample
CREATE NOTEBOOK PROJECT IF NOT EXISTS ML_TRAIN_NOTEBOOK3
  FROM '@NOTEBOOK_PROJECT_STAGE1'
  COMMENT = 'Notebook project created from an internal or temporary stage';
```

---
title: CREATE NOTIFICATION INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION

Creates a new notification integration in the account or replaces an existing integration. A notification integration is a
Snowflake object that provides an interface between Snowflake and third-party messaging services (third-party cloud message
queuing services, email services, webhooks, etc.).

The syntax of the command depends on the type of the messaging service and whether the message is inbound or outbound. The
following topics explain the syntax for creating notification integrations for different use cases:

* [CREATE NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](create-notification-integration-queue-inbound-azure.md)
* [CREATE NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](create-notification-integration-queue-inbound-gcp.md)
* [CREATE NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)](create-notification-integration-queue-outbound-aws.md)
* [CREATE NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)](create-notification-integration-queue-outbound-azure.md)
* [CREATE NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)](create-notification-integration-queue-outbound-gcp.md)
* [CREATE NOTIFICATION INTEGRATION (email)](create-notification-integration-email.md)
* [CREATE NOTIFICATION INTEGRATION (webhooks)](create-notification-integration-webhooks.md)

See also:
:   [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md) , [DESCRIBE INTEGRATION](desc-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

---
title: CREATE NOTIFICATION INTEGRATION (email)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-email.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (email)

Creates a new notification integration in the account or replaces an existing integration for
[sending email messages](../../user-guide/notifications/email-notifications.md).

See also:
:   [ALTER NOTIFICATION INTEGRATION (email)](alter-notification-integration-email.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  TYPE = EMAIL
  ENABLED = { TRUE | FALSE }
  [ ALLOWED_RECIPIENTS = ( '<email_address>' [ , ... '<email_address>' ] ) ]
  [ DEFAULT_RECIPIENTS = ( '<email_address>' [ , ... '<email_address>' ] ) ]
  [ DEFAULT_SUBJECT = '<subject_line>' ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = EMAIL`
:   Specifies that the integration creates an interface between Snowflake and a third-party email service.

## Optional parameters

`ALLOWED_RECIPIENTS = ( 'email_address' [ , ... 'email_address' ] )`
:   (For `TYPE = EMAIL`) A comma-separated list of quoted email addresses that can receive notification emails from this
    integration.

    You must specify email addresses of users in the current account. These users must
    [verify their email addresses](../../user-guide/notifications/email-notifications.md).

    The maximum number of email addresses that you can specify is 50.

    If you omit this parameter, you can send email notifications to any verified email address in the current account.

`DEFAULT_RECIPIENTS = ( 'email_address' [ , ... 'email_address' ] )`
:   Specifies the list of default recipients for messages sent with this integration. Use a comma-separated list of quoted email
    addresses to specify the default recipients.

    You must specify email addresses of users in the current account. These users must verify their email addresses.

    To override the default recipients for a given message, use the [EMAIL_INTEGRATION_CONFIG](../functions/email_integration_config.md) helper
    function when calling the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

`DEFAULT_SUBJECT = 'subject_line'`
:   Specifies the default subject line for messages sent with this integration.

    The subject cannot exceed 256 characters in length.

    Default: ‘Snowflake Email Notification’

    To override the default subject line for a given message, use the [EMAIL_INTEGRATION_CONFIG](../functions/email_integration_config.md)
    helper function when calling the [SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](../stored-procedures/system_send_snowflake_notification.md) stored procedure.

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

See [Sending email notifications](../../user-guide/notifications/email-notifications.md).

---
title: CREATE NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-queue-inbound-gcp.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)

Creates a new notification integration in the account or replaces an existing integration for receiving messages from a Google
Pub/Sub topic.

See also:
:   [ALTER NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](alter-notification-integration-queue-inbound-gcp.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  ENABLED = { TRUE | FALSE }
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  GCP_PUBSUB_SUBSCRIPTION_NAME = '<subscription_id>'
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = QUEUE`
:   Specifies that this is an integration between Snowflake and a third-party cloud message-queuing service.

`NOTIFICATION_PROVIDER = GCP_PUBSUB`
:   Specifies Google Cloud Pub/Sub as the third-party cloud message queuing service.

`GCP_PUBSUB_SUBSCRIPTION_NAME = 'subscription_id'`
:   Pub/Sub topic subscription ID used to allow Snowflake access to event messages.

    > **Note:**
    >
    > A single notification integration supports a single Google Cloud Pub/Sub subscription. Referencing the same Pub/Sub
    > subscription in multiple notification integrations can result in missing data in target tables because event notifications
    > are split between notification integrations.

## Optional parameters

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Creating a single notification integration for multiple Google Cloud Pub/Sub subscriptions is not supported.

  When you create a new pipe using a notification integration with the same queue URL as another notification integration, the
  pipe creation fails with an error:

  ```output
  Notification queue already in use with another integration.
  ```
* Using the same Google Cloud Pub/Sub subscription for multiple inbound notification integrations is not supported for automated
  data loads or metadata refreshes.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* The government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions.

## Examples

See the following topics:

* [Automating Snowpipe for Google Cloud Storage](../../user-guide/data-load-snowpipe-auto-gcs.md)
* [Refresh directory tables automatically for Google Cloud Storage](../../user-guide/data-load-dirtables-auto-gcs.md)
* [Refresh external tables automatically for Google Cloud Storage](../../user-guide/tables-external-gcs.md)

---
title: CREATE NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-queue-inbound-azure.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)

Creates a new notification integration in the account or replaces an existing integration for receiving messages from an Azure
Event Grid topic.

See also:
:   [ALTER NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](alter-notification-integration-queue-inbound-azure.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  ENABLED = { TRUE | FALSE }
  TYPE = QUEUE
  NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
  AZURE_STORAGE_QUEUE_PRIMARY_URI = '<queue_url>'
  AZURE_TENANT_ID = '<ad_directory_id>';
  [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = QUEUE`
:   Specifies that this is an integration between Snowflake and a third-party cloud message-queuing service.

`NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE`
:   Specifies Microsoft Azure Event Grid as the third-party cloud message queuing service.

`AZURE_STORAGE_QUEUE_PRIMARY_URI = 'queue_url`

> Specifies the queue URL for the Azure Queue Storage queue created for Event Grid notifications. Use a URL in the following
> format:
>
> `https://storage_queue_account.queue.core.windows.net/storage_queue_name`
>
> > **Note:**
> >
> > A single notification integration supports a single Azure Storage queue. Referencing the same storage queue in multiple
> > notification integrations can result in missing data in target tables because event notifications are split between
> > notification integrations.

`AZURE_TENANT_ID = 'ad_directory_id'`
:   Specifies the ID of the Azure Active Directory tenant used for identity management. This ID is needed to generate the consent URL
    that grants Snowflake access to the Event Grid notification subscription.

## Optional parameters

`USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
:   Specifies whether to use private connectivity. For information about using this parameter, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Creating a single notification integration for multiple Microsoft Azure Storage queues is not supported.

  When you create a new pipe using a notification integration with the same queue URL as another notification integration, the
  pipe creation fails with an error:

  ```output
  Notification queue already in use with another integration.
  ```
* Using the same Microsoft Azure Storage queue for multiple inbound notification integrations is not supported for automated
  data loads or metadata refreshes.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* The government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions.
  For more information, see [Azure Government](https://learn.microsoft.com/en-us/azure/azure-government/).

## Examples

See the following topics:

* [Automating Snowpipe for Microsoft Azure Blob Storage](../../user-guide/data-load-snowpipe-auto-azure.md)
* [Refresh directory tables automatically for Azure Blob Storage](../../user-guide/data-load-dirtables-auto-azure.md)
* [Refresh external tables automatically for Azure Blob Storage](../../user-guide/tables-external-azure.md)

---
title: CREATE NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-queue-outbound-gcp.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)

Creates a new notification integration in the account or replaces an existing integration for
[sending a message to a Google Pub/Sub topic](../../user-guide/notifications/creating-notification-integration-google-pubsub.md).

> **Note:**
>
> Currently, this feature is limited to Snowflake accounts hosted on Google Cloud (GC).

See also:
:   [ALTER NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)](alter-notification-integration-queue-outbound-gcp.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  ENABLED = { TRUE | FALSE }
  TYPE = QUEUE
  DIRECTION = OUTBOUND
  NOTIFICATION_PROVIDER = GCP_PUBSUB
  GCP_PUBSUB_TOPIC_NAME = '<topic_id>'
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = QUEUE`
:   Specifies that this is an integration between Snowflake and a third-party cloud message-queuing service.

`DIRECTION = OUTBOUND`
:   Specifies that Snowflake produces the notification sent to the cloud messaging service.

`NOTIFICATION_PROVIDER = GCP_PUBSUB`
:   Specifies Google Cloud Pub/Sub as the third-party cloud message queuing service.

`GCP_PUBSUB_TOPIC_NAME = 'topic_id'`
:   Identification of the Pub/Sub topic to which Snowflake pushes notifications.

## Optional parameters

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Using the same outbound notification integration for multiple pipes is supported for push notifications.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* The government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions.

## Examples

See the following topics:

* [Enabling Snowpipe error notifications for Google Pub/Sub](../../user-guide/data-load-snowpipe-errors-gcs.md)
* [Creating a notification integration to send notifications to a Google Cloud Pub/Sub topic](../../user-guide/notifications/creating-notification-integration-google-pubsub.md)

---
title: CREATE NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-queue-outbound-aws.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)

Creates a new notification integration in the account or replaces an existing integration for
[sending a message to an Amazon SNS topic](../../user-guide/notifications/creating-notification-integration-amazon-sns.md).

> **Note:**
>
> Currently, this feature is limited to Snowflake accounts hosted on AWS.

See also:
:   [ALTER NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)](alter-notification-integration-queue-outbound-aws.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  ENABLED = { TRUE | FALSE }
  TYPE = QUEUE
  DIRECTION = OUTBOUND
  NOTIFICATION_PROVIDER = AWS_SNS
  AWS_SNS_TOPIC_ARN = '<topic_arn>'
  AWS_SNS_ROLE_ARN = '<iam_role_arn>'
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = QUEUE`
:   Specifies that this is an integration between Snowflake and a third-party cloud message-queuing service.

`DIRECTION = OUTBOUND`
:   Specifies that Snowflake produces the notification sent to the cloud messaging service.

`NOTIFICATION_PROVIDER = AWS_SNS`
:   Specifies Amazon Simple Notification Service (SNS) as the third-party cloud message queuing service.

`AWS_SNS_TOPIC_ARN = 'topic_arn'`
:   Amazon Resource Name (ARN) of the Amazon SNS (SNS) topic to which notifications are pushed.

`AWS_SNS_ROLE_ARN = 'iam_role_arn'`
:   ARN of the IAM role that has permissions to publish messages to the SNS topic.

    > **Note:**
    >
    > The value of AWS_SNS_ROLE_ARN is case-sensitive. Use the exact value that is specified in your AWS account.

## Optional parameters

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Using the same outbound notification integration for multiple pipes is supported for push notifications.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* The government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions.
  For more information, see [AWS GovCloud (US)](https://docs.aws.amazon.com/govcloud-us/latest/UserGuide/govcloud-s3.html).

## Examples

See the following topics:

* [Enabling Snowpipe error notifications for Amazon SNS](../../user-guide/data-load-snowpipe-errors-sns.md)
* [Creating a notification integration to send notifications to an Amazon SNS topic](../../user-guide/notifications/creating-notification-integration-amazon-sns.md)

---
title: CREATE NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-queue-outbound-azure.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)

Creates a new notification integration in the account or replaces an existing integration for
[sending a message to an Azure Event Grid topic](../../user-guide/notifications/creating-notification-integration-azure-event-grid.md).

> **Note:**
>
> Currently, this feature is limited to Snowflake accounts hosted on Microsoft Azure.

See also:
:   [ALTER NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)](alter-notification-integration-queue-outbound-azure.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  ENABLED = { TRUE | FALSE }
  TYPE = QUEUE
  DIRECTION = OUTBOUND
  NOTIFICATION_PROVIDER = AZURE_EVENT_GRID
  AZURE_EVENT_GRID_TOPIC_ENDPOINT = '<event_grid_topic_endpoint>'
  AZURE_TENANT_ID = '<ad_directory_id>';
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = QUEUE`
:   Specifies that this is an integration between Snowflake and a third-party cloud message-queuing service.

`DIRECTION = OUTBOUND`
:   Specifies that Snowflake produces the notification sent to the cloud messaging service.

`NOTIFICATION_PROVIDER = AZURE_EVENT_GRID`
:   Specifies Microsoft Azure Event Grid as the third-party cloud message queuing service.

`AZURE_EVENT_GRID_TOPIC_ENDPOINT = 'event_grid_topic_endpoint'`
:   Event Grid topic endpoint to which Snowflake pushes notifications.

`AZURE_TENANT_ID = 'ad_directory_id'`
:   ID of the Azure Active Directory tenant used for identity management. This ID is needed to generate the consent URL that grants
    Snowflake access to the Event Grid topic.

## Optional parameters

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Using the same outbound notification integration for multiple pipes is supported for push notifications.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* The government regions of the cloud providers do not allow event notifications to be sent to or from other commercial regions.
  For more information, see [Azure Government](https://learn.microsoft.com/en-us/azure/azure-government/).

## Examples

See the following topics:

* [Enabling Snowpipe error notifications for Microsoft Azure Event Grid](../../user-guide/data-load-snowpipe-errors-azure.md)
* [Creating a notification integration to send notifications to a Microsoft Azure Event Grid topic](../../user-guide/notifications/creating-notification-integration-azure-event-grid.md)

---
title: CREATE NOTIFICATION INTEGRATION (webhooks)
source: https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration-webhooks.md
section: SQL Commands
---

# CREATE NOTIFICATION INTEGRATION (webhooks)

Creates a new notification integration or replaces an existing integration for a
[webhook](../../user-guide/notifications/webhook-notifications.md).

See also:
:   [ALTER NOTIFICATION INTEGRATION (webhooks)](alter-notification-integration-webhooks.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md) , [DROP INTEGRATION](drop-integration.md) ,
    [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] NOTIFICATION INTEGRATION [ IF NOT EXISTS ] <name>
  TYPE = WEBHOOK
  ENABLED = { TRUE | FALSE }
  WEBHOOK_URL = '<url>'
  [ WEBHOOK_SECRET = <secret_name> ]
  [ WEBHOOK_BODY_TEMPLATE = '<template_for_http_request_body>' ]
  [ WEBHOOK_HEADERS = ( '<header_1>'='<value_1>' [ , '<header_N>'='<value_N>', ... ] ) ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to
      work.

    The value is case-insensitive.

    The default is `TRUE`.

`TYPE = WEBHOOK`
:   Specifies that this is a notification integration for a webhook.

`WEBHOOK_URL = 'url'`
:   Specifies the URL for the webhook. The URL must use the `https://` protocol.

    You can only specify the following URLs:

    * URLs for Slack webhooks. These URLs must start with `https://hooks.slack.com/services/`.
    * URLs for Microsoft Teams webhooks. These URLs must use the following general format:

      + Up until November 30, 2025, Microsoft Teams supports URLs in the following format:

        ```none
        https://<hostname>.<region>.logic.azure.com:443/workflows/<secret>
        ```
      + [From November 30, 2025 onward](https://learn.microsoft.com/en-us/troubleshoot/power-platform/power-automate/flow-run-issues/triggers-troubleshoot?tabs=new-designer#changes-to-http-or-teams-webhook-trigger-flows),
        Microsoft Teams supports URLs in the following format:

        ```none
        https://default<hostname>.environment.api.powerplatform.com/powerautomate/automations/direct/workflows/<secret>/triggers/manual/paths/invoke
        ```
      > **Note:**
      >
      > You must omit the port number (`:443`) from the URL in the WEBHOOK_URL parameter.
      > For information about the Microsoft API data format, see <https://adaptivecards.io/> .
    * URLs for PagerDuty webhooks. This URL must be `https://events.pagerduty.com/v2/enqueue`.

    If the URL includes a secret and you [created a secret object for that secret](../../user-guide/notifications/webhook-notifications.md),
    replace that secret in the URL with SNOWFLAKE_WEBHOOK_SECRET. For example, if you
    [created a secret object for the secret in a Slack webhook URL](../../user-guide/notifications/webhook-notifications.md), set
    WEBHOOK_URL to:

    ```sqlexample
    WEBHOOK_URL='https://hooks.slack.com/services/SNOWFLAKE_WEBHOOK_SECRET'
    ```

## Optional parameters

`WEBHOOK_SECRET = secret_name`
:   Specifies the [secret to use with this integration](../../user-guide/notifications/webhook-notifications.md).

    If you are using the SNOWFLAKE_WEBHOOK_SECRET placeholder in WEBHOOK_URL, WEBHOOK_BODY_TEMPLATE, or WEBHOOK_HEADERS, the
    placeholder is replaced by this secret when you send a notification.

    If the database and schema containing the secret object will not be active when you send a notification,
    [qualify the secret name with the schema name or the database and schema names](../name-resolution.md). For
    example:

    ```sqlexample
    WEBHOOK_SECRET = my_secrets_db.my_secrets_schema.my_slack_webhook_secret
    ```

    You must have the USAGE privilege on the secret (and the database and schema that contain it) to specify this parameter.

    Default: No value

`WEBHOOK_BODY_TEMPLATE = 'template_for_http_request_body'`
:   Specifies a template for the body of the HTTP request to send for the notification.

    If the webhook requires a specific format for the body of the HTTP request (for example, a specific JSON format), set this to
    a string that specifies the format. In this string:

    * If the message needs to include a secret and you
      [created a secret object for that secret](../../user-guide/notifications/webhook-notifications.md), use the SNOWFLAKE_WEBHOOK_SECRET
      placeholder where the secret should appear in the message.
    * Use the SNOWFLAKE_WEBHOOK_MESSAGE placeholder where the notification message needs to be included.

    For example:

    ```sqlexample
    WEBHOOK_BODY_TEMPLATE='{
      "routing_key": "SNOWFLAKE_WEBHOOK_SECRET",
      "event_action": "trigger",
      "payload":
        {
          "summary": "SNOWFLAKE_WEBHOOK_MESSAGE",
          "source": "Snowflake monitoring",
          "severity": "INFO",
        }
      }'
    ```

    If you set WEBHOOK_BODY_TEMPLATE, you must also set WEBHOOK_HEADERS to include the `Content-Type` header with the type
    of your message. For example, if you set WEBHOOK_BODY_TEMPLATE to a template in JSON format, set WEBHOOK_HEADERS to include
    the header `Content-Type: application/json`:

    ```sqlexample
    WEBHOOK_HEADERS=('Content-Type'='application/json')
    ```

    Default: No value

`WEBHOOK_HEADERS = ( 'header'='value' [ , 'header'='value', ... ] )`
:   Specifies a list of HTTP headers and values to include in the HTTP request for the webhook.

    If an HTTP header must include a secret (for example, the `Authorization` header) and you
    [created a secret object for that secret](../../user-guide/notifications/webhook-notifications.md), use the SNOWFLAKE_WEBHOOK_SECRET
    placeholder in the header value. For example:

    ```sqlexample
    WEBHOOK_HEADERS=('Authorization'='Basic SNOWFLAKE_WEBHOOK_SECRET')
    ```

    Default: No value

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| USAGE | Secret | If you set the WEBHOOK_SECRET property to a secret object, you must have the USAGE privilege on that secret and on the database and schema containing that secret. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

See [Creating a webhook notification integration](../../user-guide/notifications/webhook-notifications.md).

---
title: CREATE ONLINE FEATURE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-online-feature-table.md
section: SQL Commands
---

# CREATE ONLINE FEATURE TABLE

Creates a new online feature table in the current/specified schema or replaces an existing table.

See also:
:   [ALTER ONLINE FEATURE TABLE](alter-online-feature-table.md) , [DESCRIBE ONLINE FEATURE TABLE](desc-online-feature-table.md), [DROP ONLINE FEATURE TABLE](drop-online-feature-table.md) , [SHOW ONLINE FEATURE TABLES](show-online-feature-tables.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ONLINE FEATURE TABLE <name>
  PRIMARY KEY ( <col_name> [ , <col_name> , ... ] )
  TARGET_LAG = '<num> { seconds | minutes | hours | days }'
  WAREHOUSE = <warehouse_name>
  [ REFRESH_MODE = { AUTO | FULL | INCREMENTAL } ]
  [ TIMESTAMP_COLUMN = <col_name> ]
  [ [ WITH ] COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
FROM <source>
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the online feature table; must be unique for the schema in which the table is created.

`PRIMARY KEY ( col_name [ , col_name , ... ] )`
:   Specifies the required primary key constraint of the table. Primary key serves as a unique identifier of every row in the table and serves as a lookup key in the fast SELECT queries.

`TARGET_LAG = 'num { seconds | minutes | hours | days }'`
:   Specifies the maximum amount of time that the online feature table’s content should lag behind updates to the source.

    Must be between 10 seconds and 8 days, inclusive.

`WAREHOUSE = warehouse_name`
:   Specifies the name of the warehouse that provides the compute resources for refreshing the online feature table.

    You must use a role that has the USAGE privilege on this warehouse in order to create the online feature table.

`FROM source`
:   Specifies the data source of the online feature table. Must be either a view or a dynamic table.

## Optional parameters

`REFRESH_MODE = { AUTO | FULL | INCREMENTAL }`
:   Specifies the refresh mode for the online feature table.

    > **Note:**
    >
    > This property cannot be altered after you create the online feature table. To modify the property, recreate the online feature table.

    `AUTO`
    :   When refresh mode is AUTO, the system attempts to apply an incremental refresh by default. However, when incremental refresh isn’t supported or expected to perform well, the online feature table automatically selects full refresh instead.

        To determine the best mode for your use case, experiment with refresh modes and automatic recommendations. For consistent behavior across Snowflake releases, explicitly set the refresh mode on all online feature tables.

        To verify the refresh mode for your online feature tables, view online feature table refresh mode using SHOW ONLINE FEATURE TABLES command.

    `FULL`
    :   Enforces a full refresh of the online feature table, even if the online feature table can be incrementally refreshed.

    `INCREMENTAL`
    :   Enforces an incremental refresh of the online feature table. If the query that underlies the online feature table can’t perform an incremental refresh, online feature table creation fails and displays an error message.

    Default: AUTO

`TIMESTAMP_COLUMN = col_name`
:   Specifies the column in the source treated as the timestamp column.

    Default: No value

`COMMENT = 'string_literal'`
:   Specifies a comment for the online feature table.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the tag name and the tag string value. The maximum number of characters for the tag value is 256.

    Default: No value

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ONLINE FEATURE TABLE | Schema | Role that has the CREATE ONLINE FEATURE TABLE privilege on the schema. |
| USAGE | Warehouse | Required on the warehouse specified in the WAREHOUSE parameter |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following example creates an online feature table named `my_online_feature_table` with a primary key on the `ID` column:

```sqlexample
CREATE ONLINE FEATURE TABLE my_online_feature_table
  PRIMARY KEY (ID)
  TIMESTAMP_COLUMN = 'TS'
  TARGET_LAG = '30 seconds'
  WAREHOUSE = MY_WAREHOUSE
FROM MY_SOURCE_DYNAMIC_TABLE;
```

In this example, `ID` and `TS` refer to the respective columns in the existing dynamic table `MY_SOURCE_DYNAMIC_TABLE`.

---
title: CREATE OR ALTER <object>
source: https://docs.snowflake.com/en/sql-reference/sql/create-or-alter.md
section: SQL Commands
---

# CREATE OR ALTER *<object>*

CREATE OR ALTER commands are DDL commands that combine the functionality of the CREATE command and the ALTER command, enabling you to define
an object using the syntax supported by the CREATE <object> command with the limitations of the ALTER <object> command.

The commands maintain data and associations, meaning that data and other states, tag associations and attached policies, and privilege grants
on the object are preserved. However, some object transformations can result
in dropped data. For example, if a CREATE OR ALTER TABLE statement results in a dropped column, any data contained in the column is lost (but
can still be recovered with Time Travel).

CREATE OR ALTER commands enable you to apply incremental updates to objects using a declarative, idempotent method. When executed, a CREATE OR
ALTER statement results in one of these outcomes:

* If the object doesn’t exist, it’s created according to the definition.
* If the object exists, it’s altered into the object defined in the statement.
* If the object already matches the definition, it remains unchanged.

See also:
:   [CREATE <object>](create.md) , [ALTER <object>](alter.md)

## CREATE OR ALTER commands

For specific syntax, usage notes, and examples, see:

**Account Objects:**

> * [CREATE OR ALTER AUTHENTICATION POLICY](create-authentication-policy.md)
> * [CREATE OR ALTER DATABASE](create-database.md)
> * [CREATE OR ALTER ROLE](create-role.md)
> * [CREATE OR ALTER WAREHOUSE](create-warehouse.md)

**Database Objects:**

> * [CREATE OR ALTER APPLICATION ROLE](create-application-role.md)
> * [CREATE OR ALTER DATABASE ROLE](create-database-role.md)
> * [CREATE OR ALTER DATA METRIC FUNCTION](create-data-metric-function.md)
> * [CREATE OR ALTER DYNAMIC TABLE](create-dynamic-table.md)
> * [CREATE OR ALTER EXTERNAL FUNCTION](create-external-function.md)
> * [CREATE OR ALTER FILE FORMAT](create-file-format.md)
> * [CREATE OR ALTER FUNCTION](create-function.md)
> * [CREATE OR ALTER FUNCTION (Snowpark Container Services)](create-function-spcs.md)
> * [CREATE OR ALTER PROCEDURE](create-procedure.md)
> * [CREATE OR ALTER SCHEMA](create-schema.md)
> * [CREATE OR ALTER STAGE](create-stage.md)
> * [CREATE OR ALTER TABLE](create-table.md)
> * [CREATE OR ALTER TASK](create-task.md)
> * [CREATE OR ALTER VIEW](create-view.md)
> * [CREATE OR ALTER TAG](create-tag.md)

## General usage notes

* **Data governance**: The CREATE OR ALTER commands don’t support data governance changes. Existing tags or policies are unaffected by CREATE
  OR ALTER statements and remain unchanged.
* **Unsetting object properties and parameters**: If a previously set property or parameter is absent in the modified object definition, it
  unsets it.

  If you unset an explicit [parameter](../parameters.md) value, the parameter is reset to the default value. If the parameter
  is set on an object that contains the target object, the target object inherits the value set on the object that contains it. Otherwise,
  the parameter value for the object is reset to the default value.

  Unlike other properies, the CHANGE_TRACKING property will not be unset if not specified in a CREATE OR ALTER command.
* **Atomicity**: The CREATE OR ALTER TABLE command currently does not guarantee atomicity. This means that if a CREATE OR ALTER TABLE
  statement fails during execution, it is possible that a subset of changes might have been applied to the table. If there is a possibility
  of partial changes, the error message, in most cases, includes the following text:

  ```output
  CREATE OR ALTER execution failed. Partial updates may have been applied.
  ```

  For example, if the statement is attempting to drop column `A` and add a new column `B` to a table, and the
  statement is aborted, it is possible that column `A` was dropped but column `B` was not added.

  > **Note:**
  >
  > If changes are partially applied, the resulting table is still in a valid state, and you can use additional ALTER TABLE
  > statements to complete the original set of changes.

  To recover from partial updates, Snowflake recommends the following recovery mechanisms:

  + Fix forward

    - Re-execute the CREATE OR ALTER TABLE statement. If the statements succeeds on the second attempt, the target
      state is achieved.
    - Investigate the error message. If possible, fix the error and re-execute the CREATE OR ALTER TABLE statement.
  + Roll back

    If it is not possible to fix forward, Snowflake recommends manually rolling back partial changes:

    - Investigate the state of the table using the [DESCRIBE TABLE](desc-table.md) and [SHOW TABLES](show-tables.md) commands. Determine which partial
      changes were applied, if any.
    - If any partial changes were applied, execute the appropriate ALTER TABLE statements to transform the table back to its
      original state.

      > **Note:**
      >
      > In some cases, you might not be able to undo partial changes. For more information, see the supported and unsupported
      > actions for modifying column properties in the [ALTER TABLE … ALTER COLUMN](alter-table-column.md) topic.
  + If you need help recovering from a partial update, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Limitations

The specific limitations of the CREATE OR ALTER <object> command depend on the object. Some examples of limitations are as follows:

* CREATE OR ALTER TABLE commands don’t support search optimization because search optimization is not part of the CREATE TABLE syntax.
* You can’t change the data type of a column in a table to an incompatible data type.
* You can’t change the definition of an existing view.
* You must suspend a task before you can alter it.
* The variant syntax for creating objects (for example, CREATE OR ALTER TABLE … AS SELECT) is currently not supported.

For the limitations for a specific object, see the reference topic for the object.

## Example use case

If you have SQL scripts that set up Snowflake objects for an application, you can use CREATE OR ALTER <object> statements in your scripts to
make it easier to deploy changes across development, testing, and production environments. As the application evolves, you can make
modifications to the script.

By using a CREATE OR ALTER <object> statement, you can run the script in a new environment, while also re-running the script in an existing
environment. This lets you write the object definition that you want once, and then apply it across environments.

---
title: CREATE OR ALTER VERSIONED SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/create-versioned-schema.md
section: SQL Commands
---

# CREATE OR ALTER VERSIONED SCHEMA

Creates a new versioned schema or modifies an existing versioned schema. This command is only supported for application instances in the
Native Apps Framework.

See also:
:   [CREATE APPLICATION](create-application.md), [CREATE APPLICATION PACKAGE](create-application-package.md)

## Syntax

```sqlsyntax
CREATE OR ALTER VERSIONED SCHEMA <name>
  [ WITH MANAGED ACCESS ]
  [ DATA_RETENTION_TIME_IN_DAYS = ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier for the schema; must be unique for the application instance in which the schema is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`WITH MANAGED ACCESS`
:   Specifies a managed versioned schema. Managed access versioned schemas centralize privilege
    management with the schema owner.

    In regular versioned schemas, the owner of an object (i.e. the role that has the OWNERSHIP
    privilege on the object) can grant further privileges on their objects to other roles.

    In managed schemas, the schema owner manages all privilege grants, including
    [future grants](../../user-guide/security-access-control-configure.md), on objects in the schema.
    Object owners retain the OWNERSHIP privileges on the objects, however, only the schema owner can
    manage privilege grants on the objects.

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the schema, as well as specifying the
    default Time Travel retention time for all tables created in the schema. For more details, refer
    [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, refer to
    [Parameters](../parameters.md). For more information about table-level retention time, refer to
    [CREATE TABLE](create-table.md) and [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition:
    >
    >   + `0` to `90` for permanent schemas
    >   + `0` or `1` for transient schemas

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the database or account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the schema.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in
    the schema to prevent streams on the tables from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`DEFAULT_DDL_COLLATION = 'collation_specification'`
:   Specifies a default [collation specification](../collation.md) for all tables added to the schema. The default
    can be overridden at the individual table level.

    For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the schema.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SCHEMA | Application | If the schema already exists and you want to modify the schema, the OWNERSHIP privilege on the application is required. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

> **Note:**
>
> While you typically create a versioned schema in the set up script, a versioned schema can be created:
>
> * From an owner’s rights stored procedure.
> * In the consumer account using an application role that has the CREATE SCHEMA privilege on the application.

## Usage notes

* If the schema does not exist, Snowflake creates a versioned schema.
* If the schema exists and already matches command, Snowflake views this as a no-operation.
* If the schema exists and does not match the command, Snowflake modifies the versioned schema to match the command.

---
title: CREATE ORGANIZATION ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/create-organization-account.md
section: SQL Commands
---

# CREATE ORGANIZATION ACCOUNT

Creates a new [organization account](../../user-guide/organization-accounts.md).

See also:
:   [ALTER ORGANIZATION ACCOUNT](alter-organization-account.md), [SHOW ORGANIZATION ACCOUNTS](show-organization-accounts.md)

## Syntax

```sqlsyntax
CREATE ORGANIZATION ACCOUNT <name>
    ADMIN_NAME = <string>
    { ADMIN_PASSWORD = '<string_literal>' | ADMIN_RSA_PUBLIC_KEY = <string> }
    [ FIRST_NAME = <string> ]
    [ LAST_NAME = <string> ]
    EMAIL = '<string>'
    [ MUST_CHANGE_PASSWORD = { TRUE | FALSE } ]
    EDITION = { ENTERPRISE | BUSINESS_CRITICAL }
    [ REGION_GROUP = <region_group_id> ]
    [ REGION = <snowflake_region_id> ]
    [ COMMENT = '<string_literal>' ]
```

## Required Parameters

`name`
:   Specifies the identifier (that is, name) for the organization account. It must conform to the following:

    * Must be unique within an organization, regardless of which Snowflake Region the
      organization account is in.
    * Must start with an alphabetic character and cannot contain spaces or special characters except for
      underscores (`_`).

`ADMIN_NAME = string`
:   Login name of the initial administrative user of the organization account. A new user is created in the new organization account with this
    name and password and granted the GLOBALORGADMIN role in the organization account.

    A login name can be any string consisting of letters, numbers, and underscores. Login names are always case-insensitive.

`ADMIN_PASSWORD = 'string_literal'`
:   Password for the initial administrative user of the organization account. The password for the user must be enclosed in single or double quotes.

    Optional if the `ADMIN_RSA_PUBLIC_KEY` parameter is specified.

    For more information about passwords in Snowflake, see [Snowflake-provided password policy](../../user-guide/password-authentication.md).

`ADMIN_RSA_PUBLIC_KEY = string`
:   Assigns a public key to the initial administrative user of the organization account in order to implement
    [key pair authentication](../../user-guide/key-pair-auth.md) for the user.

    Optional if the `ADMIN_PASSWORD` parameter is specified.

`EMAIL = 'string_literal'`
:   Email address of the initial administrative user of the organization account. This email address is used to send any notifications about the
    organization account.

`EDITION = ENTERPRISE | BUSINESS_CRITICAL`
:   [Snowflake Edition](../../user-guide/intro-editions.md) of the organization account.

## Optional Parameters

`FIRST_NAME = string` , . `LAST_NAME = string`
:   First and last name of the initial administrative user of the organization account.

    Default: `NULL`

`MUST_CHANGE_PASSWORD = TRUE | FALSE`
:   Specifies whether the new user created to administer the organization is forced to change their password upon first login into the
    organization account.

    Default: `FALSE`

`REGION_GROUP = region_group_id`
:   ID of the region group where the organization account is created. To retrieve the region group ID for existing accounts in your
    organization, execute the [SHOW REGIONS](show-regions.md) command. For information about when you might need to specify region
    group, see [Region groups](../../user-guide/admin-account-identifier.md).

    Default: Current region group.

`REGION = snowflake_region_id`
:   [Snowflake Region ID](../../user-guide/admin-account-identifier.md) of the region where the organization account is created. If no value is provided,
    Snowflake creates the organization account in the same Snowflake Region as the current account (that is, the account in which the CREATE
    ORGANIZATION ACCOUNT statement is executed.)

    To obtain a list of the regions that are available for an organization, execute the [SHOW REGIONS](show-regions.md) command.

    Default: Current Snowflake Region.

`COMMENT = 'string_literal'`
:   Specifies a comment for the organization account.

    Default: No value

## Access Control Requirements

Only users with the ORGADMIN role can execute the command.

## Examples

Create an organization account in the same region group and Snowflake Region in which the CREATE ORGANIZATION ACCOUNT statement is executed.
The new organization administrator must change their password upon first login:

> ```sqlexample
> CREATE ORGANIZATION ACCOUNT myorgaccount
>   ADMIN_NAME = admin
>   ADMIN_PASSWORD = 'TestPassword1'
>   EMAIL = 'myemail@myorg.org'
>   MUST_CHANGE_PASSWORD = true
>   EDITION = enterprise;
> ```

---
title: CREATE ORGANIZATION LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/create-organization-listing.md
section: SQL Commands
---

# CREATE ORGANIZATION LISTING

Create an organization listing to share data products securely within your organization.

## Syntax

```sqlsyntax
CREATE ORGANIZATION LISTING [ IF NOT EXISTS ] <name>
  [ { SHARE <share_name>  |  APPLICATION PACKAGE <package_name> } ]
  AS '<yaml_manifest_string>'
  [ PUBLISH = { TRUE | FALSE } ]

CREATE ORGANIZATION LISTING [ IF NOT EXISTS ] <name>
  [ { SHARE <share_name>  |  APPLICATION PACKAGE <package_name> } ]
  FROM '<yaml_manifest_stage_location>'
  [ PUBLISH = { TRUE | FALSE } ]
```

## Parameters

`name`
:   Specifies the identifier (name) for the listing. It must conform to the following:

    * Must be unique within an account, regardless of which Snowflake Region the account is located in. The Uniform Listing Locator (ULL) must be unique within an organization.
    * Cannot contain embedded dollar signs.
    * Must conform to Snowflake identifier requirements. See [Identifier requirements](../identifiers-syntax.md).

`FROM 'yaml_manifest_stage_location'`
:   Specifies the path for the internal stage or Git repository clone manifest.yml file.

`SHARE share_name`
:   Specifies the identifier for the share to attach to the listing.

`APPLICATION PACKAGE package_name`
:   Specifies the application package attached to the listing.

    See also [SHOW APPLICATION PACKAGES](show-application-packages.md).

`AS 'yaml_manifest_string'`
:   The YAML manifest for the organization profile. For manifest field details and examples,
    see [Organization listing manifest reference](../../user-guide/collaboration/listings/organizational/org-listing-manifest-reference.md).

    Manifests are normally provided as dollar-quoted strings. For more information, see
    [Dollar-quoted string constants](../data-types-text.md).

`PUBLISH = { TRUE | FALSE }`
:   Specifies how to publish the listing.

    If TRUE, the listing is published to the Internal Marketplace immediately.

    Default: TRUE.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION LISTING or CREATE LISTING | Account | To create and alter organization listings. |

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION LISTING | ACCOUNT | To attach the specified share or the specified Snowflake Native App Framework to a listing. When specifying a Snowflake Native App Framework, OWNERSHIP or ATTACH LISTING are also required. |
| IMPORT ORGANIZATION LISTING | ACCOUNT | To mount a listing or to execute a query that uses a Uniform Listing Locator (ULL) to reference an organizational listing. |

## Usage notes

* Listings created using CREATE ORGANIZATION LISTING … are automatically published.

## Examples

This example creates a listing named MYORGLISTING using the settings specified in the manifest YAML. It targets one role in one account in one region and includes support and approver contacts.

> **Note:**
>
> `support_contact` is required.
> `approver_contact` is required if a `discovery` target is provided.

```sqlexample-yaml
USE ROLE <organization_listing_role>;

CREATE ORGANIZATION LISTING MYORGLISTING
SHARE <share_name> AS
$$
title: "My title"
description: "One region, all accounts"
organization_profile: "INTERNAL"
organization_targets:
  discovery:
    - account: "<account_name>"
      roles:
        - "<role>"
  access:
    - account: "<account_name>"
      roles:
        - "<role>"
support_contact: "support@somedomain.com"
approver_contact: "approver@somedomain.com"
locations:
   access_regions:
   - name: "PUBLIC.<snowflake_region>"
$$
```

Creates a draft listing named ‘MYLISTING’ from a specific stage location. In the following example, the `manifest.yml` file is located in the `listingmanifests` folder in the stage named `listingstage`.

```sqlexample
CREATE ORGANIZATION LISTING MYLISTING
SHARE MySHARE FROM @dbforstage.public.listingstage/listingmanifests;
```

---
title: CREATE ORGANIZATION PROFILE
source: https://docs.snowflake.com/en/sql-reference/sql/create-organization-profile.md
section: SQL Commands
---

# CREATE ORGANIZATION PROFILE

Create the organization profile that forms part of the Uniform Listing Locator (ULL)
used to publish organizational listings or query organizational listing information
without mounting the listing. To create an organization profile, you modify the
listing manifest and then move it to a stage where you can publish or unpublish it.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md), [SHOW ORGANIZATION PROFILES](show-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md), [Organization profile manifest reference](../../user-guide/collaboration/organization-profiles/org-profile-manifest-reference.md).

## Syntax

```sqlsyntax
CREATE ORGANIZATION PROFILE [ IF NOT EXISTS ] <name>

CREATE ORGANIZATION PROFILE [ IF NOT EXISTS ] <name>
  AS '<yaml_manifest_string>'
  [ VERSION <version_alias_name> ]
  [ PUBLISH = { TRUE | FALSE } ]

CREATE ORGANIZATION PROFILE [ IF NOT EXISTS ] <name>
  FROM @<yaml_manifest_stage_location>
  [ VERSION <version_alias_name> ]
  [ PUBLISH = { TRUE | FALSE } ]
```

## Required parameters

`name`
:   String that specifies the identifier (name) for the organization profile. It must be unique within the current organization. The identifier must conform to Snowflake identifier requirements. See [Identifier requirements](../identifiers-syntax.md). Additionally, organization profile names can only contain uppercase characters or numbers, they must start with an uppercase character, and the name length cannot exceed 128 characters.

`AS 'yaml_manifest_string'`
:   Specifies the YAML manifest for the organization profile.
    For organizational listing profile manifest fields,
    see [Organization profile manifest reference](../../user-guide/collaboration/organization-profiles/org-profile-manifest-reference.md).

    Inline manifests are normally provided as dollar-quoted strings.
    For more information, see [Dollar-quoted string constants](../data-types-text.md).

`FROM @yaml_manifest_stage_location`
:   Specifies the external stage, internal stage, or Git repository clone YAML format manifest stage location.

## Optional parameters

`VERSION version_alias_name`
:   Optional. Specifies the unique version identifier for the version being added. If `VERSION version_name` isn’t specified, an alias isn’t created. If the identifier contains spaces, special characters, or mixed-case characters, the entire identifier must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. The FIRST, LAST, DEFAULT or LIVE keywords are reserved as version shortcuts and can’t be used. The unique version identifier can’t start with “version$” and can’t contain slashes ( / ). For information about identifier syntax, see [Identifier requirements](../identifiers-syntax.md).

`PUBLISH = { TRUE | FALSE }`
:   Optional. Specifies how the organization profile should be published.

    If TRUE, the organization profile is published immediately.

    Default: FALSE.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION PROFILE | Account | Organization profiles can only be created from the organization account in an organization. The GLOBALORGADMIN role has been granted the CREATE ORGANIZATION PROFILE privilege. |

## Usage notes

* Organization profiles created using CREATE ORGANIZATION PROFILE are DRAFT until they are published.
* For usage examples of organization profile manifests, see [Manage organizational listings](../../user-guide/collaboration/listings/organizational/org-listing-manage.md).

## Examples

This example creates a database named OrgProfileDB, a stage named my_test_state_org_profile, and an organization profile with a title of MY_ORG_PROFILE. The `title` field represents the provider domain, and it’s shown under the Organization Listing and as a filter option under Providers in an Internal Marketplace.

```sqlexample-yaml
CREATE DATABASE OrgProfileDB;
CREATE STAGE my_test_stage_org_profile;
COPY INTO @my_test_stage_org_profile/manifest.yml
  FROM (
    SELECT $$
      title: "MY_ORG_PROFILE"
      description: "Profile for SE Business Unit"
      contact: "contact_name@myemail.com"
      approver_contact: "approver_name@email.com"
      allowed_publishers:
        access:
          - all_internal_accounts: "true"
      logo: "urn:icon:shieldlock:blue"
    $$
  )
  SINGLE = TRUE
  OVERWRITE = TRUE
  FILE_FORMAT = (
    COMPRESSION = NONE
    ESCAPE_UNENCLOSED_FIELD = NONE
  );
```

This example publishes an organization profile named MYPROFILENAME from the `my_test_stage_org_profile` stage.

```sqlexample
CREATE ORGANIZATION PROFILE MYPROFILENAME
 FROM @my_test_stage_org_profile
 PUBLISH=TRUE;
```

---
title: CREATE ORGANIZATION USER
source: https://docs.snowflake.com/en/sql-reference/sql/create-organization-user.md
section: SQL Commands
---

# CREATE ORGANIZATION USER

Creates a new [organization user](../../user-guide/organization-users.md).

See also:
:   [ALTER ORGANIZATION USER](alter-organization-user.md) , [DROP ORGANIZATION USER](drop-organization-user.md) , [SHOW ORGANIZATION USERS](show-organization-users.md)

## Syntax

```sqlsyntax
CREATE ORGANIZATION USER [ IF NOT EXISTS ] <name>
  [ objectProperties ]
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   EMAIL = '<string>'
>   LOGIN_NAME = '<string>'
>   DISPLAY_NAME = '<string>'
>   FIRST_NAME = '<string>'
>   MIDDLE_NAME = '<string>'
>   LAST_NAME = '<string>'
>   COMMENT = '<string>'
> ```

## Required parameters

`name`
:   Identifier for the organization user; must be unique for your organization.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`EMAIL = 'string'`
:   Email address of the user.

## Optional parameters

`LOGIN_NAME = 'string'`
:   Name that the user enters to log into the system. Login names for users must be unique across your entire organization. It cannot match the login name in a regular account that tries to import the organization user.

    A login name can be any string, including spaces and non-alphanumeric characters, such as exclamation points (`!`), percent signs
    (`%`), and asterisks (`*`); however, if the string contains spaces or non-alphanumeric characters, it must be enclosed in single
    or double quotes. Login names are always case insensitive.

    Snowflake allows specifying different user and login names to enable using common identifiers (for example, email addresses) for login.

    Default: User’s name/identifier (that is, if no value is specified, the value specified for `name` is used as the login name)

`DISPLAY_NAME = 'string'`
:   Name displayed for the user in the Snowflake web interface.

    Default: User’s name/identifier (that is, if no value is specified, the value specified for `name` is used as the display name)

`FIRST_NAME = 'string'` , . `MIDDLE_NAME = string` , . `LAST_NAME = 'string'`
:   First, middle, and last name of the user.

    Default: `NULL`

`COMMENT = 'string'`
:   Description of the user.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION USER | ACCOUNT | By default, only the GLOBALORGADMIN and USERADMIN system roles in the organization account have this privilege. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Create an organization user and set the EMAIL property:

```sqlexample
CREATE ORGANIZATION USER joe EMAIL = 'joe.davis@example.com';
```

---
title: CREATE ORGANIZATION USER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/create-organization-user-group.md
section: SQL Commands
---

# CREATE ORGANIZATION USER GROUP

Creates a new [organization user group](../../user-guide/organization-users.md).

See also:
:   [ALTER ORGANIZATION USER GROUP](alter-organization-user-group.md) , [DROP ORGANIZATION USER GROUP](drop-organization-user-group.md) , [SHOW ORGANIZATION USER GROUPS](show-organization-user-groups.md)

## Syntax

```sqlsyntax
CREATE ORGANIZATION USER GROUP [ IF NOT EXISTS ] <name>
  [ IS_GRANTABLE = { TRUE | FALSE } ]
```

## Required parameters

`name`
:   Identifier for the organization user group; must be unique for your organization.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`IS_GRANTABLE = { TRUE | FALSE }`
:   Specifies whether the role that is imported into a regular account from the organization user group can be granted to an account-specific role. If `TRUE`, the role that is created when the ACCOUNTADMIN imports the organization user group can be granted to another role.

    Default: `FALSE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ORGANIZATION USER GROUP | Account | By default, only the GLOBALORGADMIN and USERADMIN system roles in the organization account have this privilege. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Create an organization user group named `data_stewards`:

```sqlexample
CREATE ORGANIZATION USER GROUP data_stewards;
```

---
title: CREATE PACKAGES POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-packages-policy.md
section: SQL Commands
---

# CREATE PACKAGES POLICY

Creates a new [packages policy](../../developer-guide/udf/python/packages-policy.md) or replaces an
existing packages policy.

After creating a packages policy, apply the packages policy to your Snowflake account using
an ALTER ACCOUNT statement.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] PACKAGES POLICY [ IF NOT EXISTS ] <name>
  LANGUAGE PYTHON
  [ ALLOWLIST = ( [ '<packageSpec>' ] [ , '<packageSpec>' ... ] ) ]
  [ BLOCKLIST = ( [ '<packageSpec>' ] [ , '<packageSpec>' ... ] ) ]
  [ ADDITIONAL_CREATION_BLOCKLIST = ( [ '<packageSpec>' ] [ , '<packageSpec>' ... ] ) ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the packages policy; must be unique for the schema in which the packages policy is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`LANGUAGE PYTHON`
:   Specifies the language that this packages policy will apply to.

## Optional parameters

`ALLOWLIST = ( [ 'packageSpec' ] [ , 'packageSpec' ... ] )`
:   Specifies a list of package specs that are allowed.

    Default: `('*')` (i.e. allow all packages).

`BLOCKLIST = ( [ 'packageSpec' ] [ , 'packageSpec' ... ] )`
:   Specifies a list of package specs that are blocked. To unset this parameter, specify an empty list.

    Default: `()` (i.e. do not block any packages).

`ADDITIONAL_CREATION_BLOCKLIST = ( [ 'packageSpec' ] [ , 'packageSpec' ... ] )`
:   Specifies a list of package specs that are blocked at creation time. To unset this parameter, specify an empty list.
    If the `ADDITIONAL_CREATION_BLOCKLIST` is set, it is appended to the basic BLOCKLIST at the creation time.
    For temporary UDFs and anonymous stored procedures, the `ADDITIONAL_CREATION_BLOCKLIST` is appended to the basic BLOCKLIST at both creation and execution time.

    Default: `()` (i.e. do not block any packages).

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the packages policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE PACKAGES POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a packages policy for your current account:

> ```sqlexample
> CREATE PACKAGES POLICY yourdb.yourschema.packages_policy_prod_1
>   LANGUAGE PYTHON
>   ALLOWLIST = ('numpy', 'pandas==1.2.3', ...)
>   BLOCKLIST = ('numpy==1.2.3', 'bad_package', ...)
>   COMMENT = 'Packages policy for the prod_1 environment';
> ```

---
title: CREATE PASSWORD POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-password-policy.md
section: SQL Commands
---

# CREATE PASSWORD POLICY

Creates a new password policy or replaces an existing password policy.

After creating a password policy, apply the password policy to an account using an [ALTER ACCOUNT](alter-account.md) statement or
a user using an [ALTER USER](alter-user.md) statement.

See also:
:   [Using password policies](../../user-guide/password-authentication.md) , [DDL commands](../../user-guide/password-authentication.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] PASSWORD POLICY [ IF NOT EXISTS ] <name>
  [ PASSWORD_MIN_LENGTH = <integer> ]
  [ PASSWORD_MAX_LENGTH = <integer> ]
  [ PASSWORD_MIN_UPPER_CASE_CHARS = <integer> ]
  [ PASSWORD_MIN_LOWER_CASE_CHARS = <integer> ]
  [ PASSWORD_MIN_NUMERIC_CHARS = <integer> ]
  [ PASSWORD_MIN_SPECIAL_CHARS = <integer> ]
  [ PASSWORD_MIN_AGE_DAYS = <integer> ]
  [ PASSWORD_MAX_AGE_DAYS = <integer> ]
  [ PASSWORD_MAX_RETRIES = <integer> ]
  [ PASSWORD_LOCKOUT_TIME_MINS = <integer> ]
  [ PASSWORD_HISTORY = <integer> ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the password policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`PASSWORD_MIN_LENGTH = integer`
:   Specifies the minimum number of characters the password must contain.

    Supported range: 8 to 256, inclusive.

    Default: 14

`PASSWORD_MAX_LENGTH = integer`
:   Specifies the maximum number of characters the password must contain. This number must be greater than or equal to the sum of
    `PASSWORD_MIN_LENGTH`, `PASSWORD_MIN_UPPER_CASE_CHARS`, and `PASSWORD_MIN_LOWER_CASE_CHARS`.

    Supported range: 8 to 256, inclusive.

    Default: 256

`PASSWORD_MIN_UPPER_CASE_CHARS = integer`
:   Specifies the minimum number of uppercase characters the password must contain.

    Supported range: 0 to 256, inclusive.

    Default: 1

`PASSWORD_MIN_LOWER_CASE_CHARS = integer`
:   Specifies the minimum number of lowercase characters the password must contain.

    Supported range: 0 to 256, inclusive.

    Default: 1

`PASSWORD_MIN_NUMERIC_CHARS = integer`
:   Specifies the minimum number of numeric characters the password must contain.

    Supported range: 0 to 256, inclusive.

    Default: 1

`PASSWORD_MIN_SPECIAL_CHARS = integer`
:   Specifies the minimum number of special characters the password must contain.

    Supported range: 0 to 256, inclusive.

    Default: 0

`PASSWORD_MIN_AGE_DAYS = integer`
:   Specifies the number of days the user must wait before a recently changed password can be changed again.

    Supported range: 0 to 999, inclusive.

    Default: 0

`PASSWORD_MAX_AGE_DAYS = integer`
:   Specifies the maximum number of days before the password must be changed.

    Supported range: 0 to 999, inclusive.

    A value of zero (i.e. `0`) indicates that the password does not need to be changed. Snowflake does not recommend choosing this
    value for a default account-level password policy or for any user-level policy. Instead, choose a value that meets your internal
    security guidelines.

    Default: 90, which means the password must be changed every 90 days.

    > **Important:**
    >
    > This parameter is stateful. For details, see the note in [Custom password policy for the account and users](../../user-guide/password-authentication.md).

`PASSWORD_MAX_RETRIES = integer`
:   Specifies the maximum number of attempts to enter a password before being locked out.

    Supported range: 1 to 10, inclusive.

    Default: 5

    > **Important:**
    >
    > This parameter is stateful. For details, see the note in [Custom password policy for the account and users](../../user-guide/password-authentication.md).

`PASSWORD_LOCKOUT_TIME_MINS = integer`
:   Specifies the number of minutes the user account will be locked after exhausting the designated number of password retries
    (i.e. `PASSWORD_MAX_RETRIES`).

    Supported range: 1 to 999, inclusive.

    Default: 15

    > **Important:**
    >
    > This parameter is stateful. For details, see the note in [Custom password policy for the account and users](../../user-guide/password-authentication.md).

`PASSWORD_HISTORY = integer`
:   Specifies the number of the most recent passwords that Snowflake stores. These stored passwords cannot be repeated when a user updates
    their password value.

    The current password value does not count towards the history.

    When you increase the history value, Snowflake saves the previous values.

    When you decrease the value, Snowflake saves the stored values up to that value that is set. For example, if the history value is 8 and
    you change the history value to 3, Snowflake stores the most recent 3 passwords and deletes the 5 older password values from the history.

    Default: 5

    Max: 24

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the password policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE PASSWORD POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on password policy DDL and privileges, see [DDL commands](../../user-guide/password-authentication.md).

## Usage notes

* If you want to replace an existing password policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE PASSWORD POLICY](desc-password-policy.md) command.

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a password policy named `password_policy_prod_1` for your current account:

> ```sqlexample
> CREATE PASSWORD POLICY PASSWORD_POLICY_PROD_1
>     PASSWORD_MIN_LENGTH = 14
>     PASSWORD_MAX_LENGTH = 24
>     PASSWORD_MIN_UPPER_CASE_CHARS = 2
>     PASSWORD_MIN_LOWER_CASE_CHARS = 2
>     PASSWORD_MIN_NUMERIC_CHARS = 2
>     PASSWORD_MIN_SPECIAL_CHARS = 2
>     PASSWORD_MAX_AGE_DAYS = 30
>     PASSWORD_MAX_RETRIES = 3
>     PASSWORD_LOCKOUT_TIME_MINS = 30
>     PASSWORD_HISTORY = 5
>     COMMENT = 'production account password policy';
> ```

---
title: CREATE PIPE
source: https://docs.snowflake.com/en/sql-reference/sql/create-pipe.md
section: SQL Commands
---

# CREATE PIPE

Creates a new pipe in the system for defining the [COPY INTO <table>](copy-into-table.md) statement used by
[Snowpipe](../../user-guide/data-load-snowpipe-intro.md) to load data from an ingestion queue, or by [Snowpipe Streaming with high-performance architecture](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md) to load data from a streaming source directly into tables.

See also:
:   [ALTER PIPE](alter-pipe.md), [DROP PIPE](drop-pipe.md) , [SHOW PIPES](show-pipes.md) , [DESCRIBE PIPE](desc-pipe.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] PIPE [ IF NOT EXISTS ] <name>
  [ AUTO_INGEST = [ TRUE | FALSE ] ]
  [ ERROR_INTEGRATION = <integration_name> ]
  [ AWS_SNS_TOPIC = '<string>' ]
  [ INTEGRATION = '<string>' ]
  [ COMMENT = '<string_literal>' ]
  AS <copy_statement>
```

> **Note:**
>
> You can use the `<copy_statement>` with two different types of data sources:
>
> * A staged location: `COPY INTO mytable FROM @mystage ...`
> * A streaming source: `COPY INTO mytable FROM (SELECT ... FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING')))`

## Required parameters

`name`
:   Identifier for the pipe; must be unique for the schema in which the pipe is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`copy_statement`
:   [COPY INTO <table>](copy-into-table.md) statement used to load data from queued files into a Snowflake table. This statement serves
    as the text/definition for the pipe and is displayed in the [SHOW PIPES](show-pipes.md) output.

> **Note:**
>
> We currently do not recommend using the following functions in the `copy_statement` for Snowpipe:
>
> * CURRENT_DATE
> * CURRENT_TIME
> * CURRENT_TIMESTAMP
> * GETDATE
> * LOCALTIME
> * LOCALTIMESTAMP
> * SYSDATE
> * SYSTIMESTAMP
>
> It is a known issue that the time values inserted using these functions can be a few hours earlier than the LOAD_TIME values returned
> by the [COPY_HISTORY function](../functions/copy_history.md) or the
> [COPY_HISTORY view](../account-usage/copy_history.md).
>
> It is recommended to query [METADATA$START_SCAN_TIME](../../user-guide/querying-metadata.md) instead, which provides a more accurate representation of record loading.

## Optional parameters

`AUTO_INGEST = TRUE | FALSE`
:   Specifies whether to automatically load data files from the internal or external stage:

    > * `TRUE` enables automatic data loading.
    >
    >   Snowpipe supports loading from external stages (Amazon S3, Google Cloud Storage, or Microsoft Azure).
    > * `FALSE` disables automatic data loading. You must make calls to the Snowpipe REST API endpoints to load data files.
    >
    >   Snowpipe supports loading from internal stages (i.e. Snowflake named stages or table stages, but not user stages) or
    >   external stage (Amazon S3, Google Cloud Storage, or Microsoft Azure).

`ERROR_INTEGRATION = 'integration_name'`
:   Required only when configuring Snowpipe to send error notifications to a cloud messaging service.

    Specifies the name of the notification integration used to communicate with the messaging service. For more information, see
    [Snowpipe error notifications](../../user-guide/data-load-snowpipe-errors.md).

`AWS_SNS_TOPIC = 'string'`
:   Required only when configuring AUTO_INGEST for Amazon S3 external stages using SNS.

    Specifies the Amazon Resource Name (ARN) for the SNS topic for your S3 bucket. The CREATE PIPE statement subscribes the
    Amazon Simple Queue Service (SQS) queue to the specified SNS topic. The pipe copies files to the ingest queue triggered by event
    notifications via the SNS topic. For more information, see [Automating Snowpipe for Amazon S3](../../user-guide/data-load-snowpipe-auto-s3.md).

`INTEGRATION = 'string'`
:   Required only when configuring AUTO_INGEST for Google Cloud Storage or Microsoft Azure external stages.

    Specifies the existing notification integration used to access the storage queue. For more information, see:

    * [Automating Snowpipe for Google Cloud Storage](../../user-guide/data-load-snowpipe-auto-gcs.md)
    * [Automating Snowpipe for Microsoft Azure Blob Storage](../../user-guide/data-load-snowpipe-auto-azure.md)

    The integration name must be typed in all uppercase.

`COMMENT = 'string_literal'`
:   Specifies a comment for the pipe.

    Default: No value

## Pipes for Snowpipe Streaming with high-performance architecture

You can define a pipe for Snowpipe Streaming to load data directly from the Snowpipe Streaming API, without requiring a staged file location. This method is designed for low-latency, row-based ingestion.

The COPY INTO statement for a streaming pipe must use the DATA_SOURCE table function in the FROM clause, with the `TYPE => 'STREAMING'` argument.

> **Note:**
>
> * Pipes created for streaming don’t require an `AUTO_INGEST` parameter or a `FROM @stage` clause.
> * The `copy_statement` within a streaming pipe’s definition is used to transform and load the data received from the API.
> * Snowpipe Streaming also provides a [default pipe](../../user-guide/snowpipe-streaming/snowpipe-streaming-pipe-object.md) for each table, which is automatically created on demand. You only need to create a custom pipe if you require features like in-flight transformations or pre-clustering.

For examples, see Examples.

## Usage notes

* This SQL command requires the following minimum permissions:

  | Privilege | Object | Notes |
  | --- | --- | --- |
  | CREATE PIPE | Schema |  |
  | USAGE | Stage in the pipe definition | External stages only |
  | USAGE | Integration | Required for receiving Snowpipe error notifications |
  | READ | Stage in the pipe definition | Internal stages only |
  | SELECT, INSERT | Table in the pipe definition |  |

  SQL operations on schema objects also require the USAGE privilege on the database and schema that contain the object.
* All [COPY INTO <table>](copy-into-table.md) copy options are supported except for the following:

  + `FILES = ( 'file_name1' [ , 'file_name2', ... ] )`
  + `ON_ERROR = ABORT_STATEMENT`
  + `SIZE_LIMIT = num`
  + `PURGE = TRUE | FALSE` (i.e. automatic purging while loading)
  + `FORCE = TRUE | FALSE`

    Note that you can manually remove files from an internal (i.e. Snowflake) stage (after they’ve been loaded) using the
    [REMOVE](remove.md) command.
  + `RETURN_FAILED_ONLY = TRUE | FALSE`
  + `VALIDATION_MODE = RETURN_n_ROWS | RETURN_ERRORS | RETURN_ALL_ERRORS`
* The `PATTERN = 'regex_pattern'` copy option filters the set of files to load using a regular expression. Pattern matching
  behaves as follows depending on the AUTO_INGEST parameter value:

  > + `AUTO_INGEST = TRUE`: The regular expression filters the list of files in the stage and optional path (i.e. cloud storage location)
  >   in the COPY INTO *<table>* statement.
  > + `:AUTO_INGEST = FALSE`: The regular expression filters the list of files submitted in calls to the Snowpipe REST API
  >   `insertFiles` endpoint.
  >
  > Snowpipe trims any path segments in the stage definition from the storage location and applies the regular expression to any
  > remaining path segments and filenames. To view the stage definition, execute the [DESCRIBE STAGE](desc-stage.md) command for the
  > stage. The URL property consists of the bucket or container name and zero or more path segments. For example, if the FROM location in a COPY INTO *<table>* statement is `@s/path1/path2/` and the URL value for stage `@s` is `s3://mybucket/path1/`, then Snowpipe trims `s3://mybucket/path1/path2/` from the storage location in the FROM clause and applies the regular expression to the remaining filenames in the path.
  >
  > > **Important:**
  > >
  > > Snowflake recommends that you enable cloud event filtering for Snowpipe to reduce costs, event noise, and latency. Only use
  > > the PATTERN option when your cloud provider’s event filtering feature is not sufficient. For more information about configuring
  > > event filtering for each cloud provider, see the following pages:
  > >
  > > + **Amazon S3:** [Configuring event notifications using object key name filtering](https://docs.aws.amazon.com/AmazonS3/latest/userguide/notification-how-to-filtering.html)
  > > + **Microsoft Azure Event Grid:** [Understand event filtering for Event Grid subscriptions](https://docs.microsoft.com/en-us/azure/event-grid/event-filtering)
  > > + **Google Cloud Pub/Sub:** [Filtering messages](https://cloud.google.com/pubsub/docs/filtering)
* Using a query as the source for the COPY statement for column reordering, column omission, and casts (i.e. transforming data during
  a load) is supported. For usage examples, see [Transform data during a load](../../user-guide/data-load-transform.md). Note that only simple SELECT statements are
  supported. Filtering using a WHERE clause is not supported.
* Pipe definitions are not dynamic (i.e. a pipe is not automatically updated if the underlying stage or table changes, such as renaming
  or dropping the stage/table). Instead, you must create a new pipe and submit this pipe name in future Snowpipe REST API calls.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

> **Important:**
>
> If you recreate a pipe (using the CREATE OR REPLACE PIPE syntax), see [Recreating pipes](../../user-guide/data-load-snowpipe-manage.md) for related
> considerations and best practices.

## Examples

Create a pipe in the current schema that loads all the data from files staged in the `mystage` stage into `mytable`:

```sqlexample
CREATE PIPE mypipe
  AS
  COPY INTO mytable
  FROM @mystage
  FILE_FORMAT = (TYPE = 'JSON');
```

Same as the previous example, but with a data transformation. Only load data from the 4th and 5th columns in the staged files, in
reverse order:

```sqlexample
CREATE PIPE mypipe2
  AS
  COPY INTO mytable(C1, C2)
  FROM (SELECT $5, $4 FROM @mystage)
  FILE_FORMAT = (TYPE = 'JSON');
```

Create a pipe that loads all the data into columns in the target table that match corresponding columns represented in the data. Column names are case-insensitive.

In addition, load metadata from the METADATA$START_SCAN_TIME and METADATA$FILENAME [metadata columns](../../user-guide/querying-metadata.md) to the columns named `c1` and `c2`.

```sqlexample
CREATE PIPE mypipe3
  AS
  (COPY INTO mytable
    FROM @mystage
    MATCH_BY_COLUMN_NAME=CASE_INSENSITIVE
    INCLUDE_METADATA = (c1= METADATA$START_SCAN_TIME, c2=METADATA$FILENAME)
    FILE_FORMAT = (TYPE = 'JSON'));
```

Create a pipe in the current schema for automatic loading of data using event notifications received from a messaging service:

**Amazon S3**

```sqlexample
CREATE PIPE mypipe_s3
  AUTO_INGEST = TRUE
  AWS_SNS_TOPIC = 'arn:aws:sns:us-west-2:001234567890:s3_mybucket'
  AS
  COPY INTO snowpipe_db.public.mytable
  FROM @snowpipe_db.public.mystage
  FILE_FORMAT = (TYPE = 'JSON');
```

**Google Cloud Storage**

```sqlexample
CREATE PIPE mypipe_gcs
  AUTO_INGEST = TRUE
  INTEGRATION = 'MYINT'
  AS
  COPY INTO snowpipe_db.public.mytable
  FROM @snowpipe_db.public.mystage
  FILE_FORMAT = (TYPE = 'JSON');
```

**Microsoft Azure**

```sqlexample
CREATE PIPE mypipe_azure
  AUTO_INGEST = TRUE
  INTEGRATION = 'MYINT'
  AS
  COPY INTO snowpipe_db.public.mytable
  FROM @snowpipe_db.public.mystage
  FILE_FORMAT = (TYPE = 'JSON');
```

**Internal named stage**

Create a pipe in the current schema that automatically loads all the data files on the internal named stage named `mystage`.

> ```sqlexample
> CREATE PIPE mypipe_aws
>   AUTO_INGEST = TRUE
>   AS
>   COPY INTO snowpipe_db.public.mytable
>   FROM @snowpipe_db.public.mystage
>   FILE_FORMAT = (TYPE = 'JSON');
> ```

**Snowpipe Streaming with high-performance architecture**

Create a basic streaming pipe:

```sqlexample
CREATE OR REPLACE PIPE my_streaming_pipe
AS COPY INTO my_table
  FROM (SELECT $1, $1:c1, $1:ts FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING')));
```

Create a streaming pipe with in-flight transformations by specifying column expressions in the SELECT clause:

```sqlexample
CREATE OR REPLACE PIPE my_pipe_with_transforms
AS COPY INTO my_table (col1, col2, col3)
  FROM (
    SELECT
      $1:field1::STRING AS col1,
      $1:field2::NUMBER AS col2,
      CURRENT_TIMESTAMP() AS col3
    FROM TABLE (DATA_SOURCE(TYPE => 'STREAMING'))
  );
```

Create a streaming pipe with pre-clustering enabled for improved query performance. The target table must have clustering keys defined:

```sqlexample
CREATE OR REPLACE PIPE my_pipe_with_clustering
AS COPY INTO my_table
  FROM (SELECT $1, $1:c1, $1:ts FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING')))
  CLUSTER_AT_INGEST_TIME = TRUE;
```

**Apache Iceberg v3 support**

The following example loads data from files for Iceberg v3 tables, for both Snowflake-managed and externally managed tables:

```sqlexample
CREATE PIPE mypipe
  AUTO_INGEST = TRUE
  INTEGRATION = 'MYINT'
  AS
  COPY INTO snowpipe_db.public.my_v3_iceberg_table
  FROM @snowpipe_db.public.mystage
  FILE_FORMAT = (TYPE = 'JSON');
```

---
title: CREATE POSTGRES INSTANCE
source: https://docs.snowflake.com/en/sql-reference/sql/create-postgres-instance.md
section: SQL Commands
---

# CREATE POSTGRES INSTANCE

Creates a new [Snowflake Postgres instance](../../user-guide/snowflake-postgres/about.md) or creates a
fork of an existing instance.

Forking creates a **full, independent copy** of an instance at a specific point in time using
[point-in-time recovery (PITR)](../../user-guide/snowflake-postgres/postgres-point-in-time-recovery.md).
This is useful for recovery, testing, or creating development environments from production data.

See also:
:   [ALTER POSTGRES INSTANCE](alter-postgres-instance.md), [DESCRIBE POSTGRES INSTANCE](desc-postgres-instance.md), [DROP POSTGRES INSTANCE](drop-postgres-instance.md), [SHOW POSTGRES INSTANCES](show-postgres-instances.md)

## Syntax

```sqlsyntax
CREATE POSTGRES INSTANCE <name>
  COMPUTE_FAMILY = '<compute_family>'
  STORAGE_SIZE_GB = <storage_gb>
  AUTHENTICATION_AUTHORITY = { POSTGRES | POSTGRES_OR_SNOWFLAKE }
  [ POSTGRES_VERSION = { 16 | 17 | 18 } ]
  [ NETWORK_POLICY = '<network_policy>' ]
  [ HIGH_AVAILABILITY = { TRUE | FALSE } ]
  [ STORAGE_INTEGRATION = '<storage_integration_name>' ]
  [ POSTGRES_SETTINGS = '<json_string>' ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
```

The following syntax creates a fork of an existing instance at a point in time. The FORK clause uses
[point-in-time recovery](../../user-guide/snowflake-postgres/postgres-point-in-time-recovery.md)
with the same AT | BEFORE syntax as [Time Travel](../../user-guide/data-time-travel.md), but creates a
full physical copy of the Postgres instance:

```sqlsyntax
CREATE POSTGRES INSTANCE <name>
  FORK <source_instance>
  [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> } ) ]
  [ COMPUTE_FAMILY = '<compute_family>' ]
  [ STORAGE_SIZE_GB = <storage_gb> ]
  [ HIGH_AVAILABILITY = { TRUE | FALSE } ]
  [ POSTGRES_SETTINGS = '<json_string>' ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
```

## Required parameters

`name`
:   Specifies the identifier (name) for the Postgres instance; must be unique for the account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`COMPUTE_FAMILY = 'compute_family'`
:   Specifies the [instance size](../../user-guide/snowflake-postgres/postgres-instance-sizes.md) for the Postgres instance.

    Snowflake Postgres offers three tiers:

    * **Burstable** (BURST_XS, BURST_S, BURST_M): Cost-effective for development and intermittent workloads. Limited to
      100GB storage and does not support high availability.
    * **Standard** (STANDARD_M through STANDARD_24XL): Balanced CPU and memory for general-purpose workloads. Supports
      all features including high availability.
    * **Memory-optimized** (HIGHMEM_L through HIGHMEM_48XL): Higher memory-to-CPU ratio for memory-intensive queries
      and large indexes. Supports all features including high availability.

    > **Note:**
    >
    > Some features require specific compute families. For example, high availability (`HIGH_AVAILABILITY = TRUE`)
    > is only available on STANDARD and HIGHMEM instances, not on BURST instances.

`STORAGE_SIZE_GB = storage_gb`
:   Specifies the storage size in GB. Must be between 10 and 65,535.

    Storage is billed separately from compute based on the allocated amount. You can increase or decrease storage size later
    using ALTER POSTGRES INSTANCE. For more information about costs, see [Snowflake Postgres Cost Evaluation](../../user-guide/snowflake-postgres/postgres-cost.md).

    > **Note:**
    >
    > When you decrease the storage size, you can’t set it too close to current disk usage. The new size must be
    > at least 1.4x the disk space currently in use. That way, there’s still room to add more data without
    > triggering an automatic storage increase.

`AUTHENTICATION_AUTHORITY = { POSTGRES | POSTGRES_OR_SNOWFLAKE }`
:   Specifies the authentication method for the instance. POSTGRES indicates that only Postgres user passwords can be used.
    POSTGRES_OR_SNOWFLAKE also allows the use of short-lived access token passwords. See
    [Snowflake Token Authentication for Snowflake Postgres](../../user-guide/snowflake-postgres/postgres-token-auth.md) for more details.

## Optional parameters

`POSTGRES_VERSION = { 16 | 17 | 18 }`
:   Specifies the major version of Postgres to use.

    While the latest version includes new features and improvements, you might choose an earlier version for application
    compatibility or to match existing instances. You can upgrade to a newer version later using ALTER POSTGRES INSTANCE.

    Default: The latest Postgres version.

`NETWORK_POLICY = 'network_policy'`
:   Specifies the [network policy](../../user-guide/snowflake-postgres/postgres-network.md) to use for the instance.
    To specify this parameter, you must have been granted the USAGE privilege on the network policy object.

    Default: No network policy is applied.

    > **Important:**
    >
    > Without a network policy, the instance can’t accept incoming connections. You can still view the instance
    > using SHOW and DESCRIBE commands, but can’t connect to the Postgres database until you attach a network
    > policy using ALTER POSTGRES INSTANCE.

`STORAGE_INTEGRATION = 'storage_integration_name'`
:   Attaches a storage integration of type `POSTGRES_EXTERNAL_STORAGE` to the Postgres instance,
    enabling the pg_lake extension to access data in external object storage. For the complete setup
    procedure, see [Configuring S3 Storage for pg_lake](../../user-guide/snowflake-postgres/postgres-pg_lake.md).

    You can also attach or remove a storage integration later using [ALTER POSTGRES INSTANCE](alter-postgres-instance.md).

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Default: No storage integration is attached.

`HIGH_AVAILABILITY = { TRUE | FALSE }`
:   Specifies whether to enable [high availability](../../user-guide/snowflake-postgres/high-availability.md) for the instance.

    High availability provisions a standby instance in a separate availability zone for automatic failover. This minimizes
    downtime if the primary becomes unavailable. Without HA, recovery requires restoring from backup, which can take hours
    for large or active instances. Note that enabling or disabling HA later using ALTER POSTGRES INSTANCE requires a
    [maintenance operation](../../user-guide/snowflake-postgres/managing-instances.md).

    > **Important:**
    >
    > Burstable instance sizes (BURST_XS, BURST_S, BURST_M) do not support high availability.

    Default: `FALSE`

`POSTGRES_SETTINGS = 'json_string'`
:   Specifies custom [Postgres server settings](../../user-guide/snowflake-postgres/postgres-server-settings.md) for the instance
    in JSON format:

    ```none
    '{"component:name" = "value", ...}'
    ```

    The format uses `component:name` where `component` is either `postgres` (for PostgreSQL server settings) or
    `pgbouncer` (for connection pooler settings). For example:

    ```none
    '{"postgres:work_mem" = "128MB", "pgbouncer:default_pool_size" = "200"}'
    ```

    See [Snowflake Postgres Server Settings](../../user-guide/snowflake-postgres/postgres-server-settings.md) for available settings.

    Default: No custom Postgres configuration parameters are set.

`COMMENT = 'string_literal'`
:   Specifies a comment for the Postgres instance.

    Comments are useful for documenting the purpose or ownership of an instance, such as “Production instance for billing
    service” or “QA environment for team X”. Unlike tags, comments are free-form text and not used for organization or
    cost tracking.

    Default: No value.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Fork parameters

Forking a Snowflake Postgres instance creates an identical copy with all the same
schema objects and table data. You can also specify a point in time so that the forked
instance reflects a previous state of the instance. That way, you can recover from data
integrity issues such as accidentally dropping objects. You can also explore scenarios
in a development and test environment, such as trying different instance configurations
with identical data. For more information, see
[Snowflake Postgres point-in-time recovery](../../user-guide/snowflake-postgres/postgres-point-in-time-recovery.md).

`FORK source_instance`
:   Creates a new instance as a fork (copy) of the specified source instance.

`{ AT | BEFORE } ( { TIMESTAMP => timestamp | OFFSET => time_difference } )`
:   Specifies the point in time to fork from. You can’t fork from a point in time more than 10 days
    in the past. The timestamp or offset must fall within the 10-day Postgres data retention period.

    The [AT | BEFORE](../constructs/at-before.md) clause accepts one of the following parameters:

    `TIMESTAMP => timestamp`
    :   Specifies an exact date and time to use for Time Travel. The value must be explicitly cast to a
        TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ data type.

    `OFFSET => time_difference`
    :   Specifies the difference in seconds from the current time, in the form `-N` where `N`
        can be an integer or arithmetic expression (for example, `-120` is 120 seconds, `-30*60` is 30 minutes).

    Default: Uses the current time.

When creating a fork, the following parameters are optional and default to the values from the source instance:

* `COMPUTE_FAMILY`
* `STORAGE_SIZE_GB`
* `HIGH_AVAILABILITY`
* `POSTGRES_SETTINGS`

## Output

When you create a new instance, the command returns one row with the following columns:

| Column | Description |
| --- | --- |
| `status` | Status of the create operation. |
| `host` | Hostname for connecting to the instance. |
| `access_roles` | User names and passwords for the `snowflake_admin` and `application` roles. |
| `default_database` | Default database for the instance. |

> **Important:**
>
> The `access_roles` column contains credentials that you can’t retrieve later. Save these details in a secure location.

When you create a fork, the command returns one row with only `status` and `host` columns. The fork uses
the same credentials that the source instance had, at the point in time that the fork corresponds to.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE POSTGRES INSTANCE | Account | By default, only the ACCOUNTADMIN role has this privilege. |
| USAGE | Network policy | Required only if specifying a NETWORK_POLICY. |
| USAGE | Storage integration | Required only if specifying a STORAGE_INTEGRATION. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Creating a new instance takes some time to complete. The instance displays its current
  [state](../../user-guide/snowflake-postgres/managing-instances.md) while it’s being built. You can use the DESC POSTGRES INSTANCE
  command to track the status during the instance setup.
* When you create a fork, you don’t specify or see the credentials. That’s because the fork uses
  the same credentials that the source instance had, at the point in time that the fork corresponds
  to. You can regenerate credentials for the forked instance later, if you need to provide access to
  a different set of users than on the original instance.
* The time needed to create a fork depends on the amount of data in the source instance. Larger databases with more
  data take longer to fork. The compute family (instance size) of the source doesn’t significantly affect fork duration.
* Forking performs a complete data copy using backup and write-ahead log (WAL) replay, which means
  that the forked instance is entirely separate: dropping the source instance does not affect any
  forks that you created from it.

  > **Note:**
  >
  > Postgres forking isn’t part of the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature, which uses
  > zero-copy technology for tables. However, forking uses the same AT | BEFORE syntax to specify a point in time.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a basic Postgres instance:

```sqlexample
CREATE POSTGRES INSTANCE my_postgres
  COMPUTE_FAMILY = 'STANDARD_S'
  STORAGE_SIZE_GB = 50
  AUTHENTICATION_AUTHORITY = POSTGRES;
```

Create a Postgres instance with high availability and a network policy:

```sqlexample
CREATE POSTGRES INSTANCE prod_postgres
  COMPUTE_FAMILY = 'STANDARD_M'
  STORAGE_SIZE_GB = 500
  AUTHENTICATION_AUTHORITY = POSTGRES
  POSTGRES_VERSION = 17
  HIGH_AVAILABILITY = TRUE
  NETWORK_POLICY = 'my_network_policy'
  COMMENT = 'Production Postgres instance';
```

Create an instance and configure a network policy later:

```sqlexample
-- Step 1: Create instance without network policy
CREATE POSTGRES INSTANCE my_postgres
  COMPUTE_FAMILY = 'STANDARD_S'
  STORAGE_SIZE_GB = 50
  AUTHENTICATION_AUTHORITY = POSTGRES;

-- Step 2: Monitor instance creation
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN ('name', 'state', 'host');

-- Step 3: Once READY, attach network policy to enable connections
ALTER POSTGRES INSTANCE my_postgres
  SET NETWORK_POLICY = 'my_network_policy';

-- Step 4: Now you can connect to the Postgres database using the host and credentials
-- from the CREATE output
```

Create a fork of an existing instance:

```sqlexample
CREATE POSTGRES INSTANCE my_fork
  FORK my_source_instance;
```

Create a fork at a specific point in time:

```sqlexample
CREATE POSTGRES INSTANCE my_fork
  FORK my_source_instance
  AT (TIMESTAMP => '2025-01-15 12:00:00'::TIMESTAMP_NTZ);
```

Create a fork from 2 hours ago with a different instance size:

```sqlexample
CREATE POSTGRES INSTANCE my_fork
  FORK my_source_instance
  AT (OFFSET => -7200)
  COMPUTE_FAMILY = 'STANDARD_L';
```

Create a fork for reporting with a larger instance size and different storage:

```sqlexample
-- Fork production instance for reporting workload
CREATE POSTGRES INSTANCE reporting_instance
  FORK prod_instance
  COMPUTE_FAMILY = 'HIGHMEM_XL'
  STORAGE_SIZE_GB = 500
  COMMENT = 'Dedicated reporting instance to offload analytics queries';
```

Create a fork at midnight UTC for daily testing:

```sqlexample
-- Fork at start of day (midnight UTC)
CREATE POSTGRES INSTANCE daily_test_instance
  FORK prod_instance
  AT (TIMESTAMP => '2026-02-05 00:00:00'::TIMESTAMP_NTZ);
```

Create a development fork with HA disabled to reduce costs:

```sqlexample
CREATE POSTGRES INSTANCE dev_instance
  FORK prod_instance
  COMPUTE_FAMILY = 'STANDARD_S'
  STORAGE_SIZE_GB = 100
  HIGH_AVAILABILITY = FALSE
  COMMENT = 'Development environment from prod data';
```

Recover from accidental data deletion using a fork from before the incident:

```sqlexample
-- Recover by forking from 30 minutes ago
CREATE POSTGRES INSTANCE recovered_instance
  FORK damaged_instance
  AT (OFFSET => -1800)
  COMMENT = 'Recovery fork from before data deletion';
```

---
title: CREATE PRIVACY POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-privacy-policy.md
section: SQL Commands
---

# CREATE PRIVACY POLICY

Creates a new [privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) or replaces an existing privacy policy.

See also:
:   [ALTER PRIVACY POLICY](alter-privacy-policy.md) , [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md) , [DROP PRIVACY POLICY](drop-privacy-policy.md) , [SHOW PRIVACY POLICIES](show-privacy-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] PRIVACY POLICY [ IF NOT EXISTS ] <name>
  AS () RETURNS PRIVACY_BUDGET -> <body>
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (that is, name) for the privacy policy; must be unique for the schema in which the privacy policy is
    created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`body`
:   The SQL expression of the body calls two functions to control the return value of the policy:
    NO_PRIVACY_POLICY and PRIVACY_BUDGET. When a query is executed against a table that has been assigned the
    policy, Snowflake evaluates the conditions of the body to call the appropriate function and return a value. This return value determines
    which privacy budget, if any, is associated with the query against the privacy-protected table.

    The expression can use context functions such as [CURRENT_ROLE](../functions/current_role.md) or [INVOKER_ROLE](../functions/invoker_role.md)
    to associate a user or group of users with a privacy budget.

    If you use a [CASE](../functions/case.md) block in the body’s expression, it must include an ELSE statement that
    calls either NO_PRIVACY_POLICY or PRIVACY_BUDGET. Every user must either be associated with a privacy budget or have unrestricted access to
    the privacy-protected table. If a user should not have any access to a privacy-protected table or view, revoke SELECT privileges rather than
    trying to define this in the privacy policy.

    `NO_PRIVACY_POLICY`
    :   Use the body’s expression to call the `NO_PRIVACY_POLICY` function when you want a query to have unrestricted access to the table or view to which the privacy policy is assigned.

    `PRIVACY_BUDGET`
    :   Use the body’s expression to call the `PRIVACY_BUDGET` function when you want to return a privacy budget from the policy. The
        expression can contain conditions that allow the policy to return different privacy budgets for different queries based on factors like
        the user who is executing the query.

        In cross-account collaboration, privacy budgets are automatically namespaced by the account identifier of the consumer account, which
        prevents two different consumer accounts from sharing the same privacy budget even if the name of the privacy budget is the same. Using
        the [CURRENT_ACCOUNT](../functions/current_account.md) function to concatenate the name of the account with the name of the privacy budget
        can help distinguish between privacy budgets. For example, you could call the function as follows:
        `PRIVACY_BUDGET(BUDGET_NAME => 'external_budget.' || CURRENT_ACCOUNT())`.

        The signature of the `PRIVACY_BUDGET` function is:

        ```sqlsyntax
        PRIVACY_BUDGET(
          BUDGET_NAME=> '<string>'
          [, BUDGET_LIMIT=> <decimal> ]
          [, MAX_BUDGET_PER_AGGREGATE=> <decimal> ]
          [, BUDGET_WINDOW=> <string> ]
        )
        ```

        **Privacy budget arguments:**

        `BUDGET_NAME => expression`
        :   Resolves to the name of a privacy budget. Snowflake creates the privacy budget automatically when its name is
            specified in the body of the privacy policy.

        `BUDGET_LIMIT => decimal`
        :   A decimal number > 0 that specifies the budget limit for this privacy policy.
            This controls the total amount of privacy loss allowed. Adjusting this value
            changes how many total differentially private aggregates can be calculated
            against tables protected by this privacy budget during the refresh period. When a query is run that would
            cause the cumulative privacy loss to exceed this number, the query will fail.
            As a rough estimate, a budget
            limit of 233 with `MAX_BUDGET_PER_AGGREGATE=1` permits about 1000 aggregates
            per refresh period.

            Default: 233.0

        `MAX_BUDGET_PER_AGGREGATE => decimal`
        :   Specifies how much privacy budget is used for each aggregate function in a
            query. Adjusting this value changes the amount of noise added to each aggregate
            query, as well as the number of aggregates that can be calculated before the budget limit is reached. As an example, the query
            `select count(*), avg(a) ...` has two aggregates: `count(*)` and `avg(a)`. Specify a decimal value > 0.

            Default: 0.5

        `BUDGET_WINDOW => string`
        :   How often the privacy budget is refreshed, that is, has its cumulative privacy loss reset to 0. Valid values:

            * `Daily`: Refreshed every day at 12:00 AM UTC
            * `Weekly`: Refreshed every Sunday at 12:00 AM UTC
            * `Monthly`: Refreshed on the first day of the calendar month at 12:00 AM UTC
            * `Yearly`: Refreshed on January 1 at 12:00 AM UTC
            * `Never`: Privacy budget is never refreshed.

            Default: Weekly

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the privacy policy.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE PRIVACY POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a privacy policy that always returns a budget named `analysts`:

> ```sqlexample
> CREATE PRIVACY POLICY my_priv_policy
>   AS ( ) RETURNS PRIVACY_BUDGET ->
>   PRIVACY_BUDGET(BUDGET_NAME=> 'analysts');
> ```

Create a privacy policy that will give `admin` unrestricted access to the privacy-protected table while associating all other users with
the privacy budget `analysts`:

> ```sqlexample
> CREATE PRIVACY POLICY my_priv_policy
>   AS () RETURNS PRIVACY_BUDGET ->
>     CASE
>       WHEN CURRENT_USER() = 'ADMIN'
>         THEN NO_PRIVACY_POLICY()
>       ELSE PRIVACY_BUDGET(BUDGET_NAME => 'analysts')
>     END;
> ```

---
title: CREATE PROCEDURE
source: https://docs.snowflake.com/en/sql-reference/sql/create-procedure.md
section: SQL Commands
---

# CREATE PROCEDURE

Creates a new [stored procedure](../../developer-guide/stored-procedure/stored-procedures-usage.md).

A procedure can be written in one of the following languages:

* [Java (using Snowpark)](../../developer-guide/stored-procedure/java/procedure-java-overview.md)
* [JavaScript](../../developer-guide/stored-procedure/stored-procedures-javascript.md)
* [Python (using Snowpark)](../../developer-guide/stored-procedure/python/procedure-python-overview.md)
* [Scala (using Snowpark)](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md)
* [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md)

> **Note:**
>
> When you want to create and call a procedure that is anonymous (rather than stored), use [CALL (with anonymous procedure)](call-with.md).
> Creating an anonymous procedure does not require a role with CREATE PROCEDURE schema privileges.

This command supports the following variants:

* CREATE OR ALTER PROCEDURE: Creates a new procedure if it doesn’t exist or alters an existing procedure.

See also:
:   [ALTER PROCEDURE](alter-procedure.md), [DROP PROCEDURE](drop-procedure.md) , [SHOW PROCEDURES](show-procedures.md) , [DESCRIBE PROCEDURE](desc-procedure.md), [CALL](call.md),
    [SHOW USER PROCEDURES](show-user-procedures.md)

    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

### Java handler

You can create a stored procedure that either includes its handler code in-line, or refers to its handler code in a JAR file. For more
information, see [Keeping handler code in-line or on a stage](../../developer-guide/inline-or-staged.md).

For examples of Java stored procedures, see [Writing Java handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/java/procedure-java-overview.md).

For in-line stored procedures, use the following syntax:

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE JAVA
  RUNTIME_VERSION = '<java_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
  AS '<procedure_definition>'
```

For a stored procedure that uses a precompiled handler, use the following syntax.

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE JAVA
  RUNTIME_VERSION = '<java_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
```

### JavaScript handler

For examples of JavaScript stored procedures, see [Writing stored procedures in JavaScript](../../developer-guide/stored-procedure/stored-procedures-javascript.md).

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS <result_data_type> [ NOT NULL ]
  LANGUAGE JAVASCRIPT
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
  AS '<procedure_definition>'
```

> **Important:**
>
> JavaScript is case-sensitive, whereas SQL is not. See [Case-sensitivity in JavaScript arguments](../../developer-guide/stored-procedure/stored-procedures-javascript.md) for
> important information about using stored procedure argument names in the JavaScript code.

### Python handler

For examples of Python stored procedures, see [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md).

For in-line stored procedures, use the following syntax:

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] [ SECURE ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE PYTHON
  RUNTIME_VERSION = '<python_version>'
  [ ARTIFACT_REPOSITORY = `<repository_name>` ]
  [ PACKAGES = ( '<package_name>' [ , ... ] ) ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<function_name>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER }]
  AS '<procedure_definition>'
```

For a stored procedure in which the code is in a file on a stage, use the following syntax:

```sqlsyntax
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE PYTHON
  RUNTIME_VERSION = '<python_version>'
  [ ARTIFACT_REPOSITORY = `<repository_name>` ]
  [ PACKAGES = ( '<package_name>' [ , ... ] ) ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<module_file_name>.<function_name>'
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <name_of_integration> [ , ... ] ) ]
  [ SECRETS = ('<secret_variable_name>' = <secret_name> [ , ... ] ) ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
```

### Scala handler

You can create a stored procedure that either includes its handler code in-line, or refers to its handler code in a JAR file. For more
information, see [Keeping handler code in-line or on a stage](../../developer-guide/inline-or-staged.md).

For examples of Scala stored procedures, see [Writing Scala handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md).

For in-line stored procedures, use the following syntax:

```sqlsyntax
CREATE [ OR REPLACE ] [ SECURE ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE SCALA
  RUNTIME_VERSION = '<scala_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark_<scala_version>:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ TARGET_PATH = '<stage_path_and_file_name_to_write>' ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
  AS '<procedure_definition>'
```

For a stored procedure that uses a precompiled handler, use the following syntax.

```sqlsyntax
CREATE [ OR REPLACE ] [ SECURE ] PROCEDURE <name> (
    [ <arg_name> <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> [ [ NOT ] NULL ] | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  LANGUAGE SCALA
  RUNTIME_VERSION = '<scala_runtime_version>'
  PACKAGES = ( 'com.snowflake:snowpark_<scala_version>:<version>' [, '<package_name_and_version>' ...] )
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [, '<stage_path_and_directory_or_file_name_to_read>' ...] ) ]
  HANDLER = '<fully_qualified_method_name>'
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
```

### Snowflake Scripting handler

For examples of Snowflake Scripting stored procedures, see [Writing stored procedures in Snowflake Scripting](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

```sqlsyntax
CREATE [ OR REPLACE ] PROCEDURE <name> (
    [ <arg_name> [ { IN | INPUT | OUT | OUTPUT } ] <arg_data_type> [ DEFAULT <default_value> ] ] [ , ... ] )
  [ COPY GRANTS ]
  RETURNS { <result_data_type> | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  [ NOT NULL ]
  LANGUAGE SQL
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ { VOLATILE | IMMUTABLE } ] -- Note: VOLATILE and IMMUTABLE are deprecated.
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { OWNER | CALLER | RESTRICTED CALLER } ]
  AS <procedure_definition>
```

> **Note:**
>
> If you are creating a Snowflake Scripting procedure in SnowSQL or Snowsight, you must
> use [string literal delimiters](../data-types-text.md) (`'` or `$$`) around
> `procedure definition`. See [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md).

## Variant syntax

### CREATE OR ALTER PROCEDURE

Creates a new procedure if it doesn’t already exist, or transforms an existing procedure into the procedure defined in the
statement. A CREATE OR ALTER PROCEDURE statement follows the syntax rules of a CREATE PROCEDURE statement and has the same
limitations as an [ALTER PROCEDURE](alter-procedure.md) statement.

Alterations to the following are supported:

* LOG_LEVEL, TRACE_LEVEL, COMMENT, SECURE, return type, and the procedure body.
* SECRETS, EXTERNAL_ACCESS_INTEGRATIONS, RUNTIME_VERSION, IMPORTS, and PACKAGES for Python, Scala, and Java stored procedures; also ARTIFACT_REPOSITORY for Python stored procedures.
* Execution privileges (EXECUTE AS CALLER or EXECUTE AS OWNER)

For more information, see CREATE OR ALTER PROCEDURE usage notes.

```sqlsyntax
CREATE [ OR ALTER ] PROCEDURE ...
```

## Required parameters

### All languages

`name ( [ arg_name [ { IN | INPUT | OUT | OUTPUT } ] arg_data_type` . `[ DEFAULT {default_value} ] ] [ , ... ] )`
:   Specifies the identifier (`name`), any arguments, and the default values for any optional arguments for the
    stored procedure.

    * For the identifier:

      + The identifier does not need to be unique for the schema in which the procedure is created because stored procedures are
        [identified and resolved by the combination of the name and argument types](../../developer-guide/udf-stored-procedure-naming-conventions.md).
      + The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
        identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also
        case-sensitive. See [Identifier requirements](../identifiers-syntax.md).
    * For the arguments:

      + For `arg_name`, specify the name of the argument.
      + For `{ IN | INPUT | OUT | OUTPUT }`, specify the type of the argument (input or output). The type specification is only valid
        for a Snowflake Scripting stored procedure. For more information, see [Using arguments passed to a stored procedure](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).
      + For `arg_data_type`, use the Snowflake data type that corresponds to the language that you are using.

        - For [Java stored procedures](../../developer-guide/stored-procedure/java/procedure-java-overview.md), see [SQL-Java Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For [JavaScript stored procedures](../../developer-guide/stored-procedure/stored-procedures-javascript.md), see
          [SQL and JavaScript data type mapping](../../developer-guide/stored-procedure/stored-procedures-javascript.md).
        - For [Python stored procedures](../../developer-guide/stored-procedure/python/procedure-python-overview.md), see
          [SQL-Python Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For [Scala stored procedures](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md), see [SQL-Scala Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
        - For Snowflake Scripting, a [SQL data type](../../sql-reference-data-types.md).
        > **Note:**
        >
        > For stored procedures you write in Java, Python, or Scala (which use Snowpark APIs), omit the argument for the Snowpark
        > `Session` object.
        >
        > The `Session` argument is not a formal parameter that you specify in CREATE PROCEDURE or CALL. When you call your
        > stored procedure, Snowflake automatically creates a `Session` object and passes it to the handler function for your
        > stored procedure.
      + To indicate that an argument is optional, use `DEFAULT default_value` to specify the default value of the argument.
        For the default value, you can use a literal or an expression.

        If you specify any optional arguments, you must place these after the required arguments.

        If a procedure has optional arguments, you cannot define additional procedures with the same name and different signatures.

        For details, see [Specify optional arguments](../../developer-guide/udf-stored-procedure-arguments.md).

`RETURNS { result_data_type [ [ NOT ] NULL ] | TABLE ( [ col_name col_data_type [ , ... ] ] ) }`
:   Specifies the type of the result returned by the stored procedure.

    * For `result_data_type`, use the Snowflake data type that corresponds to the type of the language that you are using.

      + For [Java stored procedures](../../developer-guide/stored-procedure/java/procedure-java-overview.md), see [SQL-Java Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + For [JavaScript stored procedures](../../developer-guide/stored-procedure/stored-procedures-javascript.md), see
        [SQL and JavaScript data type mapping](../../developer-guide/stored-procedure/stored-procedures-javascript.md).
      + For [Python stored procedures](../../developer-guide/stored-procedure/python/procedure-python-overview.md), see
        [SQL-Python Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + For [Scala stored procedures](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md), see [SQL-Scala Data Type Mappings](../../developer-guide/udf-stored-procedure-data-type-mapping.md).
      + For Snowflake Scripting, a [SQL data type](../../sql-reference-data-types.md).
      > **Note:**
      >
      > Stored procedures you write in Snowpark (Java or Scala) must have a return value. In Snowpark (Python), when a stored procedure
      > returns no value, it is considered to be returning `None`. Note that every CREATE PROCEDURE statement must include a RETURNS
      > clause that defines a return type, even if the procedure does not explicitly return anything.
    * For `RETURNS TABLE ( [ col_name col_data_type [ , ... ] ] )`, if you know the
      [Snowflake data types](../../sql-reference-data-types.md) of the columns in the returned table, specify the column names and
      types:

      ```sqlexample
      CREATE OR REPLACE PROCEDURE get_top_sales()
        RETURNS TABLE (sales_date DATE, quantity NUMBER)
      ...
      ```

      Otherwise (for example, if you are determining the column types during run time), you can omit the column names and types:

      ```sqlexample
      CREATE OR REPLACE PROCEDURE get_top_sales()
        RETURNS TABLE ()
      ```

      > **Note:**
      >
      > Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
      > applies whether you are creating a stored or anonymous procedure.
      >
      > ```sqlexample
      > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
      >   RETURNS TABLE(g GEOGRAPHY)
      >   ...
      > ```
      >
      > ```sqlexample
      > WITH test_return_geography_table_1() AS PROCEDURE
      >   RETURNS TABLE(g GEOGRAPHY)
      >   ...
      > CALL test_return_geography_table_1();
      > ```
      >
      > If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
      >
      > ```none
      > Stored procedure execution error: data type of returned table does not match expected returned table type
      > ```
      >
      > To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
      >
      > ```sqlexample
      > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
      >   RETURNS TABLE()
      >   ...
      > ```
      >
      > ```sqlexample
      > WITH test_return_geography_table_1() AS PROCEDURE
      >   RETURNS TABLE()
      >   ...
      > CALL test_return_geography_table_1();
      > ```

      RETURNS TABLE(…) is supported only when the handler is written in the following languages:

      + [Java](../../developer-guide/stored-procedure/java/procedure-java-tabular-data.md)
      + [Python](../../developer-guide/stored-procedure/python/procedure-python-tabular-data.md)
      + [Scala](../../developer-guide/stored-procedure/scala/procedure-scala-tabular-data.md)
      + [Snowflake Scripting](../snowflake-scripting/return.md)

    As a practical matter, outside of a [Snowflake Scripting block](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md),
    [the returned value cannot be used because the call cannot be part of an expression](../../developer-guide/stored-procedures-vs-udfs.md).

`LANGUAGE language`
:   Specifies the language of the stored procedure code. Note that this is optional for stored procedures written with
    [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md).

    Currently, the supported values for `language` include:

    * `JAVA` (for [Java](../../developer-guide/stored-procedure/java/procedure-java-overview.md))
    * `JAVASCRIPT` (for [JavaScript](../../developer-guide/stored-procedure/stored-procedures-javascript.md))
    * `PYTHON` (for [Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md))
    * `SCALA` (for [Scala](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md))
    * `SQL` (for [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md))

    Default: `SQL`.

`AS procedure_definition`
:   Defines the code executed by the stored procedure. The definition can consist of any valid code.

    Note the following:

    * For stored procedures for which the code is not in-line, omit the AS clause. This includes stored procedures with staged handlers.

      Instead, use the IMPORTS clause to specify the location of the file containing the code for the stored procedure. For
      details, see:

      + [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md)
      + [Writing Java handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/java/procedure-java-overview.md)
      + [Writing Scala handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md)

      For more information on in-line and staged handlers, see [Keeping handler code in-line or on a stage](../../developer-guide/inline-or-staged.md).
    * You must use [string literal delimiters](../data-types-text.md) (`'` or `$$`) around
      `procedure definition` if:

      + You are using a language other than Snowflake Scripting.
      + You are creating a Snowflake Scripting procedure in SnowSQL or Snowsight. See
        [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md).
    * For stored procedures in JavaScript, if you are writing a string that contains newlines, you can use
      backquotes (also called “backticks”) around the string.

      The following example of a JavaScript stored procedure uses `$$` and backquotes because the body of the stored procedure
      contains single quotes and double quotes:

      > ```javascript
      > CREATE OR REPLACE TABLE table1 ("column 1" VARCHAR);
      > ```
      >
      > ```javascript
      > CREATE or replace PROCEDURE proc3()
      >   RETURNS VARCHAR
      >   LANGUAGE javascript
      >   AS
      >   $$
      >   var rs = snowflake.execute( { sqlText:
      >       `INSERT INTO table1 ("column 1")
      >            SELECT 'value 1' AS "column 1" ;`
      >        } );
      >   return 'Done.';
      >   $$;
      > ```
    * Snowflake does not completely validate the code when you execute the CREATE PROCEDURE command.

      For example, for Snowpark (Scala) stored procedures, the number and types of arguments are validated, but the body of
      the function is not validated. If the number or types do not match (e.g. if the Snowflake data type NUMBER is used when the
      argument is a non-numeric type), executing the CREATE PROCEDURE command causes an error.

      If the code is not valid, the CREATE PROCEDURE command will succeed, and errors will be returned when the stored procedure is
      called.

    For more details about stored procedures, see [Working with stored procedures](../../developer-guide/stored-procedure/stored-procedures-usage.md).

### Java

`RUNTIME_VERSION = 'language_runtime_version'`
:   The language runtime version to use. Currently, the supported versions are:

    * 11

`PACKAGES = ( 'snowpark_package_name' [, 'package_name' ...] )`
:   A comma-separated list of the names of packages deployed in Snowflake that should be included in the handler code’s
    execution environment. The Snowpark package is required for stored procedures, so it must always be referenced in the PACKAGES clause.
    For more information about Snowpark, see [Snowpark API](../../developer-guide/snowpark/index.md).

    By default, the environment in which Snowflake runs stored procedures includes a selected set of packages for supported languages.
    When you reference these packages in the PACKAGES clause, it is not necessary to reference a file containing the package in the IMPORTS
    clause because the package is already available in Snowflake. You can also specify the package version.

    For the list of supported packages and versions for Java, query the
    [INFORMATION_SCHEMA.PACKAGES view](../info-schema/packages.md) for rows, specifying the language. For example:

    ```sqlexample
    SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'java';
    ```

    To specify the package name and version number use the following form:

    ```none
    domain:package_name:version
    ```

    To specify the latest version, specify `latest` for `version`.

    For example, to include a package from the latest Snowpark library in Snowflake, use the following:

    ```sqlexample
    PACKAGES = ('com.snowflake:snowpark:latest')
    ```

    When specifying a package from the Snowpark library, you must specify version 1.3.0 or later.

`HANDLER = 'fully_qualified_method_name'`
:   Use the fully qualified name of the method or function for the stored procedure. This is typically in the
    following form:

    ```none
    com.my_company.my_package.MyClass.myMethod
    ```

    where:

    ```none
    com.my_company.my_package
    ```

    corresponds to the package containing the object or class:

    ```none
    package com.my_company.my_package;
    ```

### Python

`RUNTIME_VERSION = 'language_runtime_version'`
:   The language runtime version to use. Currently, the supported versions are:

    > Generally available versions:
    >
    > * 3.9 (deprecated)
    > * 3.10
    > * 3.11
    > * 3.12
    > * 3.13

`PACKAGES = ( 'snowpark_package_name' [, 'package_name' ...] )`
:   A comma-separated list of the names of packages deployed in Snowflake that should be included in the handler code’s
    execution environment. The Snowpark package is required for stored procedures, so it must always be referenced in the PACKAGES clause.
    For more information about Snowpark, see [Snowpark API](../../developer-guide/snowpark/index.md).

    By default, the environment in which Snowflake runs stored procedures includes a selected set of packages for supported languages.
    When you reference these packages in the PACKAGES clause, it is not necessary to reference a file containing the package in the IMPORTS
    clause because the package is already available in Snowflake. You can also specify the package version.

    For the list of supported packages and versions for Python, query the
    [INFORMATION_SCHEMA.PACKAGES view](../info-schema/packages.md) for rows, specifying the language. For example:

    ```sqlexample
    SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'python';
    ```

    Snowflake includes a large number of packages available through Anaconda; for more information, see
    [Using third-party packages](../../developer-guide/udf/python/udf-python-packages.md).

    To specify the package name and version number use the following form:

    ```none
    package_name[==version]
    ```

    To specify the latest version, omit the version number.

    For example, to include the spacy package version 2.3.5 (along with the latest version of the required Snowpark package), use the
    following:

    ```sqlexample
    PACKAGES = ('snowflake-snowpark-python', 'spacy==2.3.5')
    ```

    When specifying a package from the Snowpark library, you must specify version 0.4.0 or later. Omit the version number to use the
    latest version available in Snowflake.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Specifying a range of Python package versions is available as a preview feature to all accounts.

    You can specify package versions by using these version
    specifiers: `==`, `<=`, `>=`, `<`,or `>`.

    For example:

    ```sqlexample
    -- Use version 1.2.3 or higher of the NumPy package.
    PACKAGES=('numpy>=1.2.3')
    ```

`HANDLER = 'fully_qualified_method_name'`
:   Use the name of the stored procedure’s function or method. This can differ depending on whether the code is in-line or
    referenced at a stage.

    * When the code is in-line, you can specify just the function name, as in the following example:

      ```sqlexample
      CREATE OR REPLACE PROCEDURE MYPROC(from_table STRING, to_table STRING, count INT)
        ...
        HANDLER = 'run'
      AS
      $$
      def run(session, from_table, to_table, count):
        ...
      $$;
      ```
    * When the code is imported from a stage, specify the fully-qualified handler function name as `<module_name>.<function_name>`.

      ```sqlexample
      CREATE OR REPLACE PROCEDURE MYPROC(from_table STRING, to_table STRING, count INT)
        ...
        IMPORTS = ('@mystage/my_py_file.py')
        HANDLER = 'my_py_file.run';
      ```

### Scala

`RUNTIME_VERSION = 'language_runtime_version'`

> Specifies the Scala runtime version to use. The supported versions of Scala are:
>
> [Preview Feature](../../release-notes/preview-features.md) — Open
>
> Support for version 2.13 is in preview. Available to all accounts.
>
> * 2.13
> * 2.12
>
> For more information, see [Writing code to support different Scala versions](../../developer-guide/scala-version-differences.md).

`PACKAGES = ( 'snowpark_package_name' [, 'package_name' ...] )`
:   A comma-separated list of the names of packages deployed in Snowflake that should be included in the handler code’s
    execution environment. The Snowpark package is required for stored procedures, so it must always be referenced in the PACKAGES clause.
    For more information about Snowpark, see [Snowpark API](../../developer-guide/snowpark/index.md).

    By default, the environment in which Snowflake runs stored procedures includes a selected set of packages for supported languages.
    When you reference these packages in the PACKAGES clause, it is not necessary to reference a file containing the package in the IMPORTS
    clause because the package is already available in Snowflake. You can also specify the package version.

    For the list of supported packages and versions for Scala, query the
    [INFORMATION_SCHEMA.PACKAGES view](../info-schema/packages.md) for rows, specifying the language. For example:

    ```sqlexample
    SELECT * FROM INFORMATION_SCHEMA.PACKAGES WHERE LANGUAGE = 'scala';
    ```

    To specify the package name and version number use the following form:

    ```none
    domain:package_name:version
    ```

    To specify the latest version, specify `latest` for `version`.

    For example, to include a package from the latest Snowpark library in Snowflake, use the following:

    ```sqlexample
    PACKAGES = ('com.snowflake:snowpark:latest')
    ```

    Snowflake supports using Snowpark version 0.9.0 or later in a Scala stored procedure. Note, however, that these versions have
    limitations. For example, versions prior to 1.1.0 do not support the use of transactions in a stored procedure.

`HANDLER = 'fully_qualified_method_name'`
:   Use the fully qualified name of the method or function for the stored procedure. This is typically in the following form:

    ```none
    com.my_company.my_package.MyClass.myMethod
    ```

    where:

    ```none
    com.my_company.my_package
    ```

    corresponds to the package containing the object or class:

    ```none
    package com.my_company.my_package;
    ```

## Optional parameters

### All languages

`SECURE`
:   Specifies that the procedure is secure. For more information about secure procedures, see [Protecting Sensitive Information with Secure UDFs and Stored Procedures](../../developer-guide/secure-udf-procedure.md).

`{ TEMP | TEMPORARY }`
:   Specifies that the procedure persists for only the duration of the [session](../../user-guide/session-policies.md) in which you created it.
    A temporary procedure is dropped at the end of the session.

    Default: No value. If a procedure is not declared as `TEMPORARY`, it is permanent.

    You cannot create temporary [procedures](../../developer-guide/stored-procedure/stored-procedures-overview.md) that have the same name as
    a procedure that already exists in the schema.

    Note that creating a temporary procedure does not require the CREATE PROCEDURE privilege on the schema in which the object is created.

    For more information about creating temporary procedures, see [Temporary procedures](../../developer-guide/stored-procedure/stored-procedures-overview.md).

`[ [ NOT ] NULL ]`
:   Specifies whether the stored procedure can return NULL values or must return only NON-NULL values.

    The default is NULL (i.e. the stored procedure can return NULL).

`CALLED ON NULL INPUT` or . `{ RETURNS NULL ON NULL INPUT | STRICT }`
:   Specifies the behavior of the stored procedure when called with null inputs. In contrast to system-defined functions, which
    always return null when any input is null, stored procedures can handle null inputs, returning non-null values even when an
    input is null:

    * `CALLED ON NULL INPUT` will always call the stored procedure with null inputs. It is up to the procedure to handle such
      values appropriately.
    * `RETURNS NULL ON NULL INPUT` (or its synonym `STRICT`) will not call the stored procedure if any input is null,
      so the statements inside the stored procedure will not be executed. Instead, a null value will always be returned. Note that
      the procedure might still return null for non-null inputs.

    Default: `CALLED ON NULL INPUT`

`VOLATILE | IMMUTABLE`
:   Deprecated

    > **Attention:**
    >
    > These keywords are deprecated for stored procedures. These keywords are not intended to apply to stored procedures. In a
    > future release, these keywords will be removed from the documentation.

`COMMENT = 'string_literal'`
:   Specifies a comment for the stored procedure, which is displayed in the DESCRIPTION column in the [SHOW PROCEDURES](show-procedures.md) output.

    Default: `stored procedure`

`EXECUTE AS OWNER` or . `EXECUTE AS CALLER` or . `EXECUTE AS RESTRICTED CALLER`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Restricted caller’s rights (`EXECUTE AS RESTRICTED CALLER`) is a preview feature available to all accounts.

    Specifies whether the stored procedure executes with the privileges of the owner (an “owner’s rights” stored procedure) or with
    the privileges of the caller (a “caller’s rights” stored procedure):

    * If you execute CREATE PROCEDURE … EXECUTE AS OWNER, then the procedure will execute as an owner’s rights procedure.
    * If you execute the statement CREATE PROCEDURE … EXECUTE AS CALLER, then in the future the procedure will execute as a
      caller’s rights procedure.
    * If you execute the statement CREATE PROCEDURE … EXECUTE AS RESTRICTED CALLER, then in the future the procedure will execute as a
      caller’s rights procedure, but might not be able to run with all of the caller’s privileges. For more information, see
      [Restricted caller’s rights](../../developer-guide/restricted-callers-rights.md).

    If `EXECUTE AS ...` isn’t specified, the procedure runs as an owner’s rights stored procedure. Owner’s rights stored
    procedures have less access to the caller’s environment (for example, the caller’s session variables), and Snowflake defaults to this
    higher level of privacy and security.

    For more information, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

    Default: `OWNER`

`COPY GRANTS`
:   Specifies to retain the access privileges from the original procedure when a new procedure is created using CREATE OR REPLACE PROCEDURE.

    The parameter copies all privileges, except OWNERSHIP, from the existing procedure to the new procedure. The new procedure will
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE PROCEDURE
    statement owns the new procedure.

    Note:

    * The [SHOW GRANTS](show-grants.md) output for the replacement procedure lists the grantee for the copied privileges as the
      role that executed the CREATE PROCEDURE statement, with the current timestamp when the statement was executed.
    * The operation to copy grants occurs atomically in the CREATE PROCEDURE command (i.e. within the same transaction).

### Java

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [, 'stage_path_and_directory_or_file_name_to_read' ...] )`
:   The location (stage), path, and name of the directory or file(s) to import. You must set the IMPORTS clause to include any files that
    your stored procedure depends on:

    * If you are writing an in-line stored procedure, you can omit this clause, unless your code depends on classes defined outside
      the stored procedure or resource files.
    * If you are writing a stored procedure with a staged handler, you must also include a path to the JAR file containing the
      stored procedure’s handler code.
    * The IMPORTS definition cannot reference variables from arguments that are passed into the stored procedure.

    Each file in the IMPORTS clause must have a unique name, even if the files are in different subdirectories or different stages.

`TARGET_PATH = stage_path_and_file_name_to_write`
:   Specifies the location to which Snowflake should write the JAR file containing the result of compiling the handler source code specified
    in the `procedure_definition`.

    If this clause is included, Snowflake writes the resulting JAR file to the stage location specified by the clause’s value. If this
    clause is omitted, Snowflake re-compiles the source code each time the code is needed. In that case, the JAR file is not stored
    permanently, and the user does not need to clean up the JAR file.

    Snowflake returns an error if the TARGET_PATH matches an existing file; you cannot use TARGET_PATH to overwrite an
    existing file.

    If you specify both the IMPORTS and TARGET_PATH clauses, the file name in the TARGET_PATH clause must
    be different from each file name in the IMPORTS clause, even if the files are in different subdirectories or different
    stages.

    The generated JAR file remains until you explicitly delete it, even if you drop the procedure. When you drop the procedure you should
    separately remove the JAR file because the JAR is no longer needed to support the procedure.

    For example, the following TARGET_PATH example would result in a `myhandler.jar` file generated and copied to the
    `handlers` stage.

    ```sqlexample
    TARGET_PATH = '@handlers/myhandler.jar'
    ```

    When you drop this procedure to remove it, you’ll also need to remove its handler JAR file, such as by executing the
    [REMOVE command](remove.md).

    ```sqlexample
    REMOVE @handlers/myhandler.jar;
    ```

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   The names of [external access integrations](create-external-access-integration.md) needed in order for this
    procedure’s handler code to access external networks.

    An external access integration specifies [network rules](create-network-rule.md) and
    [secrets](create-secret.md) that specify external locations and credentials (if any) allowed for use by handler code
    when making requests of an external network, such as an external REST API.

`SECRETS = ( 'secret_variable_name' = secret_name [ , ...  ] )`
:   Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from
    secrets in handler code.

    Secrets you specify here must be allowed by the [external access integration](create-external-access-integration.md)
    specified as a value of this CREATE PROCEDURE command’s EXTERNAL_ACCESS_INTEGRATIONS parameter.

    This parameter’s value is a comma-separated list of assignment expressions with the following parts:

    * `secret_name` as the name of the allowed secret.

      You will receive an error if you specify a SECRETS value whose secret isn’t also included in an integration specified by the
      EXTERNAL_ACCESS_INTEGRATIONS parameter.
    * `'secret_variable_name'` as the variable that will be used in handler code when retrieving information from the secret.

    For more information, including an example, refer to [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md).

### Python

`ARTIFACT_REPOSITORY = artifact_repository`

Specifies the name of the repository to use for installing PyPI packages for use by your procedure.

Set this to `snowflake.snowpark.pypi_shared_repository`, which is the default artifact repository provided by Snowflake.

`PACKAGES = ( 'package_name' [ , ... ] )`

Specify a list of the names of the packages that you want to install and use in your procedure. Snowflake installs these packages from the artifact repository.

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [, 'stage_path_and_directory_or_file_name_to_read' ...] )`
:   The location (stage), path, and name of the directory or file(s) to import. You must set the IMPORTS clause to include any files that
    your stored procedure depends on:

    * If you are writing an in-line stored procedure, you can omit this clause, unless your code depends on classes defined outside
      the stored procedure or resource files.
    * If your stored procedure’s code will be on a stage, you must also include a path to the module file your code is in.
    * The IMPORTS definition cannot reference variables from arguments that are passed into the stored procedure.

    Each file in the IMPORTS clause must have a unique name, even if the files are in different subdirectories or different stages.

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   The names of [external access integrations](create-external-access-integration.md) needed in order for this
    procedure’s handler code to access external networks.

    An external access integration specifies [network rules](create-network-rule.md) and
    [secrets](create-secret.md) that specify external locations and credentials (if any) allowed for use by handler code
    when making requests of an external network, such as an external REST API.

`SECRETS = ( 'secret_variable_name' = secret_name [ , ...  ] )`
:   Assigns the names of secrets to variables so that you can use the variables to reference the secrets when retrieving information from
    secrets in handler code.

    Secrets you specify here must be allowed by the [external access integration](create-external-access-integration.md)
    specified as a value of this CREATE PROCEDURE command’s EXTERNAL_ACCESS_INTEGRATIONS parameter.

    This parameter’s value is a comma-separated list of assignment expressions with the following parts:

    * `secret_name` as the name of the allowed secret.

      You will receive an error if you specify a SECRETS value whose secret isn’t also included in an integration specified by the
      EXTERNAL_ACCESS_INTEGRATIONS parameter.
    * `'secret_variable_name'` as the variable that will be used in handler code when retrieving information from the secret.

    For more information, including an example, refer to [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md).

### Scala

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [, 'stage_path_and_directory_or_file_name_to_read' ...] )`
:   The location (stage), path, and name of the directory or file(s) to import. You must set the IMPORTS clause to include any files that
    your stored procedure depends on:

    * If you are writing an in-line stored procedure, you can omit this clause, unless your code depends on classes defined outside
      the stored procedure or resource files.
    * If you are writing a stored procedure with a staged handler, you must also include a path to the JAR file containing the
      stored procedure’s handler code.
    * The IMPORTS definition cannot reference variables from arguments that are passed into the stored procedure.

    Each file in the IMPORTS clause must have a unique name, even if the files are in different subdirectories or different stages.

`TARGET_PATH = stage_path_and_file_name_to_write`
:   Specifies the location to which Snowflake should write the JAR file containing the result of compiling the handler source code specified
    in the `procedure_definition`.

    If this clause is included, Snowflake writes the resulting JAR file to the stage location specified by the clause’s value. If this
    clause is omitted, Snowflake re-compiles the source code each time the code is needed. In that case, the JAR file is not stored
    permanently, and the user does not need to clean up the JAR file.

    Snowflake returns an error if the TARGET_PATH matches an existing file; you cannot use TARGET_PATH to overwrite an
    existing file.

    If you specify both the IMPORTS and TARGET_PATH clauses, the file name in the TARGET_PATH clause must
    be different from each file name in the IMPORTS clause, even if the files are in different subdirectories or different
    stages.

    The generated JAR file remains until you explicitly delete it, even if you drop the procedure. When you drop the procedure you should
    separately remove the JAR file because the JAR is no longer needed to support the procedure.

    For example, the following TARGET_PATH example would result in a `myhandler.jar` file generated and copied to the
    `handlers` stage.

    ```sqlexample
    TARGET_PATH = '@handlers/myhandler.jar'
    ```

    When you drop this procedure to remove it, you’ll also need to remove its handler JAR file, such as by executing the
    [REMOVE command](remove.md).

    ```sqlexample
    REMOVE @handlers/myhandler.jar;
    ```

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE PROCEDURE | Schema | Required to create a permanent stored procedure. Not required when creating a temporary procedure that persists for only the duration of the session in which the procedure was created. |
| USAGE | Procedure | Granting the USAGE privilege on the newly created procedure to a role allows users with that role to call the procedure elsewhere in Snowflake. |
| USAGE | External access integration | Required on integrations, if any, specified when creating the procedure. For more information, see [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md). |
| READ | Secret | Required on secrets, if any, specified when creating the procedure. For more information, see [Creating a secret to represent credentials](../../developer-guide/external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md). |
| USAGE | Schema | Required on schemas containing secrets, if any, specified when creating the procedure. For more information, see [Creating a secret to represent credentials](../../developer-guide/external-network-access/creating-using-external-network-access.md) and [Using the external access integration in a function or procedure](../../developer-guide/external-network-access/creating-using-external-network-access.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

For additional usage notes, see the following.

### All handler languages

* Stored procedures support [overloading](../../developer-guide/udf-stored-procedure-naming-conventions.md). Two procedures can have the same
  name if they have a different number of parameters or different data types for their parameters.
* Stored procedures are not atomic; if one statement in a stored procedure fails, the other statements in the stored
  procedure are not necessarily rolled back. For information about stored procedures and transactions, see
  [Transaction management](../../developer-guide/stored-procedure/stored-procedures-usage.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Tip:**
>
> If your organization uses a mix of caller’s rights and owner’s rights stored procedures, you might want to use a
> naming convention for your stored procedures to indicate whether an individual stored procedure is a caller’s
> rights stored procedure or an owner’s rights stored procedure.

* Setting LOG_LEVEL or TRACE_LEVEL as properties in a CREATE PROCEDURE statement is not supported. To set these properties
  on a procedure, use [ALTER PROCEDURE](alter-procedure.md) after creating the procedure, or use
  CREATE OR ALTER PROCEDURE.

### Java

See the [known limitations](../../developer-guide/stored-procedure/java/procedure-java-limitations.md).

### Javascript

A JavaScript stored procedure can return only a single value, such as a string (for example, a success/failure indicator)
or a number (for example, an error code). If you need to return more extensive information, you can return a
VARCHAR that contains values separated by a delimiter (such as a comma), or a semi-structured data type, such
as [VARIANT](../data-types-semistructured.md).

### Python

See the [known limitations](../../developer-guide/stored-procedure/python/procedure-python-limitations.md).

### Scala

See the [known limitations](../../developer-guide/stored-procedure/scala/procedure-scala-limitations.md).

## CREATE OR ALTER PROCEDURE usage notes

* All limitations of the [ALTER PROCEDURE](alter-procedure.md) command apply.
* All limitations described in [CREATE OR ALTER FUNCTION usage notes](create-function.md) apply.

## Examples

This creates a trivial stored procedure that returns a hard-coded value. This is unrealistic, but shows the basic
SQL syntax with minimal JavaScript code:

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE sp_pi()
    RETURNS FLOAT NOT NULL
    LANGUAGE JAVASCRIPT
    AS
    $$
    return 3.1415926;
    $$
    ;
```

This shows a more realistic example that includes a call to the JavaScript API. A more extensive version of this
procedure could allow a user to insert data into a table that the user didn’t have privileges to insert into directly.
JavaScript statements could check the input parameters and execute the SQL `INSERT` only if certain requirements
were met.

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE stproc1(FLOAT_PARAM1 FLOAT)
    RETURNS STRING
    LANGUAGE JAVASCRIPT
    STRICT
    EXECUTE AS OWNER
    AS
    $$
    var sql_command =
     "INSERT INTO stproc_test_table1 (num_col1) VALUES (" + FLOAT_PARAM1 + ")";
    try {
        snowflake.execute (
            {sqlText: sql_command}
            );
        return "Succeeded.";   // Return a success/error indicator.
        }
    catch (err)  {
        return "Failed: " + err;   // Return a success/error indicator.
        }
    $$
    ;
```

For more examples, see [Working with stored procedures](../../developer-guide/stored-procedure/stored-procedures-usage.md).

### In-line handler

Code in the following example creates a procedure called `my_proc` with an in-line Python handler function `run`. Through
the PACKAGES clause, the code references the included Snowpark library for Python, whose `Session` is required when Python
is the procedure handler language.

```sqlexample-python
CREATE OR REPLACE PROCEDURE my_proc(from_table STRING, to_table STRING, count INT)
  RETURNS STRING
  LANGUAGE PYTHON
  RUNTIME_VERSION = '3.9'
  PACKAGES = ('snowflake-snowpark-python')
  HANDLER = 'run'
AS
$$
def run(session, from_table, to_table, count):
  session.table(from_table).limit(count).write.save_as_table(to_table)
  return "SUCCESS"
$$;
```

### Staged handler

Code in the following example creates a procedure called `my_proc` with a staged Java handler method `MyClass.myMethod`.
Through the PACKAGES clause, the code references the included Snowpark library for Java, whose `Session` is required when Java
is the procedure handler language. With the IMPORTS clause, the code references the staged JAR file containing the handler code.

```sqlexample-java
CREATE OR REPLACE PROCEDURE my_proc(fromTable STRING, toTable STRING, count INT)
  RETURNS STRING
  LANGUAGE JAVA
  RUNTIME_VERSION = '11'
  PACKAGES = ('com.snowflake:snowpark:latest')
  IMPORTS = ('@mystage/myjar.jar')
  HANDLER = 'MyClass.myMethod';
```

## Create and alter a procedure using the CREATE OR ALTER PROCEDURE command

Create an owner’s rights Python stored procedure with external access integrations and default OWNER privileges.

```sqlexample
CREATE OR ALTER PROCEDURE python_add1(A NUMBER)
  RETURNS NUMBER
  LANGUAGE PYTHON
  HANDLER='main'
  RUNTIME_VERSION=3.10
  EXTERNAL_ACCESS_INTEGRATIONS=(example_integration)
  PACKAGES = ('snowflake-snowpark-python')
  EXECUTE AS OWNER
  AS
$$
def main(session, a):
    return a+1
$$;
```

Alter the stored procedure’s secrets and change the stored procedure to a caller’s rights procedure:

```sqlexample
CREATE OR ALTER PROCEDURE python_add1(A NUMBER)
  RETURNS NUMBER
  LANGUAGE PYTHON
  HANDLER='main'
  RUNTIME_VERSION=3.10
  EXTERNAL_ACCESS_INTEGRATIONS=(example_integration)
  secrets=('secret_variable_name'=secret_name)
  PACKAGES = ('snowflake-snowpark-python')
  EXECUTE AS CALLER
  AS
$$
def main(session, a):
    return a+1
$$;
```

---
title: CREATE PROJECTION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-projection-policy.md
section: SQL Commands
---

# CREATE PROJECTION POLICY

Creates a new [projection policy](../../user-guide/projection-policies.md) in the current/specified schema or replaces an existing
projection policy.

After creating a projection policy, apply the projection policy to a table column using an
[ALTER TABLE … ALTER COLUMN](alter-table-column.md) command or a view column using the [ALTER VIEW](alter-view.md) command.

See also:
:   [Projection policy DDL reference](../../user-guide/projection-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] PROJECTION POLICY [ IF NOT EXISTS ] <name>
  AS () RETURNS PROJECTION_CONSTRAINT -> <body>
  [ COMMENT = '<string_literal>' ]
```

## Parameters

`name`
:   Identifier for the projection policy; must be unique for your schema.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`body`
:   SQL expression that determines whether to project a column.

    The expression can contain CASE and other logic statements, but must call the PROJECTION_CONSTRAINT function:

    ```sqlsyntax
    PROJECTION_CONSTRAINT(ALLOW=>{TRUE|FALSE}, ENFORCEMENT=><enforcement_style>)
    ```

    * `ALLOW` (*boolean*) - TRUE allows the column to be projected. FALSE prevents the column from being projected, with the behavior
      specified by ENFORCEMENT. FALSE affects only columns that appear in the final results table.
    * `ENFORCEMENT` (*string, optional*) - If ALLOW=FALSE, specifies what should happen if a query includes a protected column.
      Supported values:

      + FAIL - The query will fail if a protected column is included in the outermost query.
      + NULLIFY - All rows in the protected column return the value NULL.

      Default: FAIL

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the projection policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE PROJECTION POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on projection policy DDL and privileges, see [Privileges and commands](../../user-guide/projection-policies.md).

## Usage notes

* If you want to update an existing projection policy and need to see the current definition of the policy, run the
  [DESCRIBE PROJECTION POLICY](desc-projection-policy.md) command or [GET_DDL](../functions/get_ddl.md) function.

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Do not allow projecting a column:

> ```sqlexample
> CREATE OR REPLACE PROJECTION POLICY do_not_project AS ()
>   RETURNS PROJECTION_CONSTRAINT ->
>   PROJECTION_CONSTRAINT(ALLOW => false);
> ```

Project a column for the `analyst` custom role, otherwise allow the query, but replace all protected column values with NULL:

> ```sqlexample
> CREATE OR REPLACE PROJECTION POLICY project_analyst_only AS ()
>   RETURNS PROJECTION_CONSTRAINT ->
>     CASE
>       WHEN CURRENT_ROLE() = 'ANALYST'
>         THEN PROJECTION_CONSTRAINT(ALLOW => true)
>       ELSE PROJECTION_CONSTRAINT(ALLOW => false, ENFORCEMENT => 'NULLIFY')
>     END;
> ```

---
title: CREATE PROVISIONED THROUGHPUT
source: https://docs.snowflake.com/en/sql-reference/sql/create-provisioned-throughput.md
section: SQL Commands
---

# CREATE PROVISIONED THROUGHPUT

Creates a new [Provisioned Throughput resource](../../user-guide/snowflake-cortex/provisioned-throughput.md) or replaces an existing one.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] PROVISIONED THROUGHPUT <name>
    CLOUD_PROVIDER = '<cloud_provider>'
    MODEL = '<model_name>'
    PTUS = <num_ptus>
    TERM_START = '<start_date>'
    TERM_END = '<end_date>';
```

## Required parameters

`name`
:   String that specifies the identifier (i.e., name) for the provisioned throughput resource; must be unique for the schema in which the resource is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`CLOUD_PROVIDER = 'cloud_provider'`
:   Specifies the cloud provider where the provisioned throughput will be allocated. Supported values are `aws` and `azure`.

`MODEL = 'model_name'`
:   Specifies the model for which the provisioned throughput is being reserved. Supported models include:

    * Mistral Large 2
    * Llama 3.1-405B
    * Llama 3.1-70B
    * Llama 3.1-8B
    * Snowflake-Llama3.3-70B
    * Snowflake-Llama3.3-405B

`PTUS = num_ptus`
:   Specifies the number of provisioned throughput units (PTUs) to allocate. The value must meet the minimum and incremental PTU requirements for the specified model.

`TERM_START = 'start_date'`
:   Specifies the start date of the provisioned throughput term in the format `YYYY-MM-DD`.

`TERM_END = 'end_date'`
:   Specifies the end date of the provisioned throughput term in the format `YYYY-MM-DD`.

## Access Control Requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| CREATE PROVISIONED THROUGHPUT | Account level. |
| USAGE | Schema in which you plan to create the provisioned throughput. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

> **Attention:**
>
> To create a Provisioned Throughput resource, your role must have the CREATE PROVISIONED THROUGHPUT privilege at the account level.

## Usage Notes

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* Provisioned Throughput is subject to minimum and incremental PTU requirements. Ensure that your PTU request meets these requirements for the specified model.
* The term for provisioned throughput starts and ends at 8:00 a.m. PT on the specified dates.
* Provisioned Throughput does not renew automatically. To reserve throughput for another term, create a new provisioned throughput resource.

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Example

Create a provisioned throughput resource named `my_pt` for the `Llama 3.1-8B` model on AWS, allocating 64 PTUs for a term from April 15, 2025, to May 15, 2025:

```sqlexample
CREATE PROVISIONED THROUGHPUT my_pt
    CLOUD_PROVIDER = 'aws'
    MODEL = 'llama3.1-8B'
    PTUS = 64
    TERM_START = '2025-04-15'
    TERM_END = '2025-05-15';
```

Replace an existing provisioned throughput resource named `my_pt` with updated PTUs and term dates:

```sqlexample
CREATE OR REPLACE PROVISIONED THROUGHPUT my_pt
    CLOUD_PROVIDER = 'aws'
    MODEL = 'llama3.1-8B'
    PTUS = 128
    TERM_START = '2025-06-01'
    TERM_END = '2025-07-01';
```

---
title: CREATE REPLICATION GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/create-replication-group.md
section: SQL Commands
---

# CREATE REPLICATION GROUP

Creates a new [replication group](../../user-guide/account-replication-intro.md) of specified objects in the system.

For more information about using replication groups, see [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md).

This command can be used to:

* Create a replication group in the source account to enable replication of specified objects to a target account in the same
  organization.
* Create a secondary replication group in a target account as a replica of the primary replication group in the source account in the
  same organization.

See also:
:   [ALTER REPLICATION GROUP](alter-replication-group.md) , [DROP REPLICATION GROUP](drop-replication-group.md) , [SHOW REPLICATION GROUPS](show-replication-groups.md)

## Syntax

```sqlsyntax
CREATE REPLICATION GROUP [ IF NOT EXISTS ] <name>
    OBJECT_TYPES = <object_type> [ , <object_type> , ... ]
    [ ALLOWED_DATABASES = <db_name> [ , <db_name> , ... ] ]
    [ ALLOWED_EXTERNAL_VOLUMES = <external_volume_name> [ , <external_volume_name> , ... ] ]
    [ ALLOWED_SHARES = <share_name> [ , <share_name> , ... ] ]
    [ ALLOWED_INTEGRATION_TYPES = <integration_type_name> [ , <integration_type_name> , ... ] ]
    ALLOWED_ACCOUNTS = <org_name>.<target_account_name> [ , <org_name>.<target_account_name> , ... ]
    [ IGNORE EDITION CHECK ]
    [ REPLICATION_SCHEDULE = '{ <num> MINUTE | USING CRON <expr> <time_zone> }' ]
    [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
    [ ERROR_INTEGRATION = <integration_name> ]
```

**Secondary Replication Group**

```sqlsyntax
CREATE REPLICATION GROUP [ IF NOT EXISTS ] <secondary_name>
    AS REPLICA OF <org_name>.<source_account_name>.<name>
```

## Parameters

`name`
:   Specifies the identifier for the replication group. The identifier must start with an alphabetic character and cannot contain spaces or
    special characters unless the identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double
    quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`OBJECT_TYPES = object_type [ , object_type , ... ]`
:   Type(s) of objects for which you are enabling replication from the source account to the target account.

    The following object types are supported:

    > ACCOUNT PARAMETERS:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     All account-level parameters. This includes [account parameters](../parameters.md) and parameters that can be
    >     [set for your account](../../user-guide/admin-account-management.md).
    >
    > DATABASES:
    > :   Add database objects to the list of object types. If database objects are included in the list of specified object types, the
    >     `ALLOWED_DATABASES` parameter must be set.
    >
    > EXTERNAL VOLUMES:
    > :   Add external volume objects to the list of object types. If external volume objects are included in the list of specified object types,
    >     the `ALLOWED_EXTERNAL_VOLUMES` parameter must be set.
    >
    > INTEGRATIONS:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     Currently, only security, API, storage, external access, and certain types of notification integrations are supported.
    >     For details, see
    >     [Integration replication](../../user-guide/account-replication-intro.md).
    >
    >     If integration objects are included in the list of specified object types, the
    >     `ALLOWED_INTEGRATION_TYPES` parameter must be set.
    >
    > NETWORK POLICIES:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     All network policies in the source account.
    >
    > RESOURCE MONITORS:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     All resource monitors in the source account.
    >
    > ROLES:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     All roles in the source account. Replicating roles implicitly includes all grants for object types included in the replication group.
    >     For example, if `ROLES` is the only object type that is replicated, then only hierarchies of roles (that is, roles granted to
    >     other roles) are replicated to target accounts. If the `USERS` object type is also included, then role grants to users are
    >     also replicated.
    >
    > SHARES:
    > :   Add share objects to the list of object types. If share objects are included in the list of specified object types, the
    >     `ALLOWED_SHARES` parameter must be set.
    >
    > USERS:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     All users in the source account.
    >
    > WAREHOUSES:
    > :   *Requires Business Critical Edition (or higher).*
    >
    >     All warehouses in the source account.

    > **Note:**
    >
    > If you replicate users and roles, programmatic access tokens for users are replicated automatically.

    To modify the list of replicated object types to a specified target account, use [ALTER REPLICATION GROUP](alter-replication-group.md)
    to reset the list of object types.

`ALLOWED_DATABASES = db_name [ , db_name , ... ]`
:   Specifies the database or list of databases for which you are enabling replication from the source account to the target account.
    For you to set this parameter, the `OBJECT_TYPES` list must include `DATABASES`.

`ALLOWED_EXTERNAL_VOLUMES = external_volume_name [ , external_volume_name , ... ]`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the external volume or list of external volumes for which you are enabling replication from the source account to the target
    account. For you to set this parameter, the `OBJECT_TYPES` list must include `EXTERNAL VOLUMES`.

`ALLOWED_SHARES = share_name [ , share_name , ... ]`
:   Specifies the share or list of shares for which you are enabling replication from the source account to the target account.
    For you to set this parameter, the `OBJECT_TYPES` list must include `SHARES`.

`ALLOWED_INTEGRATION_TYPES = integration_type_name [ , integration_type_name , ... ]`
:   *Requires Business Critical Edition (or higher).*

    Type(s) of integrations for which you are enabling replication from the source account to the target account.

    > This property requires that the `OBJECT_TYPES` list include `INTEGRATIONS` to set this parameter.
    >
    > The following integration types are supported:
    >
    > > SECURITY INTEGRATIONS:
    > > :   Specifies security integrations.
    > >
    > >     This property requires that the `OBJECT_TYPES` list include `ROLES`.
    > >
    > > API INTEGRATIONS:
    > > :   Specifies API integrations.
    > >
    > >     API integration replication requires additional set up after the API integration is replicated to the target account.
    > >     For more information, see [Updating the remote service for API integrations](../../user-guide/account-replication-config.md).
    > >
    > > STORAGE INTEGRATIONS:
    > > :   Specifies storage integrations.
    > >
    > > EXTERNAL ACCESS INTEGRATIONS:
    > > :   Specifies [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md).
    > >
    > >     For more information, see [Replication of stored procedures and user-defined functions (UDFs)](../../user-guide/account-replication-considerations.md).
    > >
    > > NOTIFICATION INTEGRATIONS:
    > > :   Specifies notification integrations.
    > >
    > >     Only some types of notification integrations are replicated. For details, see
    > >     [Integration replication](../../user-guide/account-replication-intro.md).

`ALLOWED_ACCOUNTS = org_name.target_account_name1 [ , org_name.target_account_name2 , ... ]`
:   Specifies the target account or list of target accounts to which replication of specified objects from the source account is
    enabled.

    `org_name`
    :   Name of your Snowflake organization.

    `target_account_name`
    :   Target account to which you are enabling replication of the specified objects.

`IGNORE EDITION CHECK`
:   Allows replicating objects to accounts on lower editions in either of the following scenarios:

    * A primary replication group with only database and/or share objects is in a Business Critical (or higher) account but
      one or more accounts approved for replication are on lower editions. Business Critical Edition is intended for Snowflake accounts
      with extremely sensitive data.
    * A primary replication group with any object type is in a Business
      Critical (or higher) account and a signed business associate agreement is in place to store PHI data in the account per HIPAA and
      [HITRUST](../../user-guide/intro-cloud-platforms.md) regulations. However, no such agreement is in place for one or more of the accounts approved
      for replication, regardless if they are Business Critical (or higher) accounts.

    Both scenarios are prohibited by default in an effort to help prevent account administrators for Business Critical (or higher) accounts
    from inadvertently replicating sensitive data to accounts on lower editions.

`REPLICATION_SCHEDULE ...`
:   Specifies the schedule for refreshing secondary replication groups.

    * `USING CRON expr time_zone`
      :   Specifies a cron expression and time zone for the secondary group refresh. Supports a subset of standard cron utility syntax.

          For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
          (in Wikipedia).

          The cron expression consists of the following fields:

          ```output
          # __________ minute (0-59)
          # | ________ hour (0-23)
          # | | ______ day of month (1-31, or L)
          # | | | ____ month (1-12, JAN-DEC)
          # | | | | __ day of week (0-6, SUN-SAT, or L)
          # | | | | |
          # | | | | |
            * * * * *
          ```

          The following special characters are supported:

          `*`
          :   Wildcard. Specifies any occurrence of the field.

          `L`
          :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a
              given month. In the day-of-month field, it specifies the last day of the month.

          `/n`
          :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
              specified in the month field, then the refresh is scheduled for April, July and October (i.e. every 3 months, starting with the 4th
              month of the year). The same schedule is maintained in subsequent years. That is, the refresh is not scheduled to run in
              January (3 months after the October run).

          > **Note:**
          > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
          >   for the account (or setting the value at the user or session level) does not change the time zone for the refresh.
          > + The cron expression defines all valid run times for the refresh. Snowflake attempts to refresh secondary groups based on
          >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
          > + When both a specific day of month and day of week are included in the cron expression, then the refresh is scheduled on days
          >   satisfying either the day of month or day of week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
          >   schedules a refresh at 0AM on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
    * `num MINUTE`
      :   Specifies an interval (in minutes) of wait time between refreshes. Accepts positive integers only.

          Also supports `num M` syntax.

          To avoid ambiguity, a *base interval time* is set:

          + When the object is created (using CREATE <object>) or
          + When a different interval is set (using ALTER <object> … SET REPLICATION_SCHEDULE)

          The base interval time starts the interval counter from the current clock time. For example, if an INTERVAL value of `10` is set and
          the scheduled refresh is enabled at 9:03 AM, then the refresh runs at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to
          ensure absolute precision, but only guarantee that refreshes do not execute before their set interval occurs (e.g. in the
          current example, the refresh could first run at 9:14 AM, but will definitely not run at 9:12 AM).

          > **Note:**
          >
          > The maximum supported value is `11520` (8 days). If the replication schedule has a greater `num MINUTE` value, the
          > refresh operation never runs.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`ERROR_INTEGRATION = integration_name`
:   Specifies the name of the notification integration to use to email/push notifications when refresh errors occur for the replication
    group. For more details, see [Error notifications for replication and failover groups](../../user-guide/account-replication-error-notifications.md).

**Secondary Replication Group Parameters**

`secondary_name`
:   Specifies the identifier for the secondary replication group. The identifier must start with an alphabetic character and cannot contain
    spaces or special characters unless the identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in
    double quotes are also case-sensitive. For more details, see [Identifier requirements](../identifiers-syntax.md).

    The identifiers for the secondary replication group (`secondary_name`) and primary replication group (`name`) can be, but
    are not required to be, identical.

`AS REPLICA OF org_name.source_account_name.name`
:   Specifies the identifier of the primary replication group from which to create a secondary replication group.

    `org_name`
    :   Name of your Snowflake organization.

    `source_account_name`
    :   Source account from which you are enabling replication of the specified objects.

    `name`
    :   Identifier for the primary replication group in the source account.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE REPLICATION GROUP | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| MONITOR | Database | To add a database to a replication group, the active role must have the MONITOR privilege on the database. |
| USAGE | External volume | To add an external volume to a replication group, the active role must have the USAGE privilege on the external volume. |
| OWNERSHIP | Share | To add a share to a replication group, the active role must have the OWNERSHIP privilege on the share. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Identifiers for failover groups and replication groups in an account must be unique.
* A database can only be added to one replication or failover group.
* An external volume can only be added to one replication or failover group.
* [Inbound shares](../../user-guide/data-share-consumers.md) (shares from providers) *cannot* be added to a replication or failover group.
* To retrieve the set of accounts in your organization that are enabled for replication, use
  [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).
* To retrieve the list of replication and failover groups in your organization, use [SHOW REPLICATION GROUPS](show-replication-groups.md).
  The `allowed_accounts` column lists all target accounts enabled for object replication from a source account.
* If there are account objects (for example, users or roles) in a target account that you do not want to drop during replication,
  use the [SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME](../functions/system_link_account_objects_by_name.md) system function to apply a global identifier to objects
  created by means other than replication. For more information, see
  [Apply Global IDs to Objects Created by Scripts in Target Accounts](../../user-guide/account-replication-config.md) before
  you create a replication group.
* Automatically [scheduled refresh operations](../../user-guide/account-replication-intro.md) are executed using the role with the OWNERSHIP
  privilege on the group. If a scheduled refresh operation fails due to insufficient privileges, grant the required privileges
  to the role with the OWNERSHIP privilege on the group.

* If you create a replication or failover group with a tag or modify a replication or failover group by setting a tag on it,
  [tag inheritance](../../user-guide/object-tagging/inheritance.md) does not apply to any objects that you specify in the replication or failover group.

  Tag inheritance is only applicable to objects with a [parent-child relationship](../../user-guide/security-access-control-overview.md), such
  database, schema, and table. There are no child objects of replication or failover groups.
* You cannot set a tag or modify a tag on a secondary replication or failover group because these objects are read
  only.
* When you refresh a secondary replication or failover group, any tags that are set on the primary group are then set on
  the secondary group.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Replicate a single database

**Executed on the source account**

Create a replication group named `myrg` in the source account to enable replication of database `db1` from the source
account to the `myaccount2` account. Set the replication schedule to refresh the database every 10 minutes:

```sqlexample
CREATE REPLICATION GROUP myrg
    OBJECT_TYPES = DATABASES
    ALLOWED_DATABASES = db1
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

**Executed on target account**

Create a replication group in the target account as a replica of the replication group `myrg` in the source account:

```sqlexample
CREATE REPLICATION GROUP myrg
    AS REPLICA OF myorg.myaccount1.myrg;
```

### Replicate a database and share objects

**Executed on source account**

Create a replication group named `myrg` in the source account to enable replication of database `db1`, and share `s1` from the source
account to the `myaccount2` account. Set the replication schedule to refresh automatically every 10 minutes:

```sqlexample
CREATE REPLICATION GROUP myrg
    OBJECT_TYPES = DATABASES, SHARES
    ALLOWED_DATABASES = db1
    ALLOWED_SHARES = s1
    ALLOWED_ACCOUNTS = myorg.myaccount2
    REPLICATION_SCHEDULE = '10 MINUTE';
```

**Executed on target account**

Create a replication group in the target account as a replica of the replication group `myrg` in the source account:

```sqlexample
CREATE REPLICATION GROUP myrg
    AS REPLICA OF myorg.myaccount1.myrg;
```

### Replicate account objects

For examples of multiple database and account object replication, see the
[examples for CREATE FAILOVER GROUP](create-failover-group.md).

---
title: CREATE RESOURCE MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/create-resource-monitor.md
section: SQL Commands
---

# CREATE RESOURCE MONITOR

Creates a new [resource monitor](../../user-guide/resource-monitors.md). This command can only be executed by account administrators.

See also:
:   [ALTER RESOURCE MONITOR](alter-resource-monitor.md) , [DROP RESOURCE MONITOR](drop-resource-monitor.md) , [SHOW RESOURCE MONITORS](show-resource-monitors.md) , [ALTER WAREHOUSE](alter-warehouse.md) , [ALTER ACCOUNT](alter-account.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] RESOURCE MONITOR [ IF NOT EXISTS ] <name> WITH
                      [ CREDIT_QUOTA = <number> ]
                      [ FREQUENCY = { MONTHLY | DAILY | WEEKLY | YEARLY | NEVER } ]
                      [ START_TIMESTAMP = { <timestamp> | IMMEDIATELY } ]
                      [ END_TIMESTAMP = <timestamp> ]
                      [ NOTIFY_USERS = ( <user_name> [ , <user_name> , ... ] ) ]
                      [ TRIGGERS triggerDefinition [ triggerDefinition ... ] ]
```

Where:

> ```sqlsyntax
> triggerDefinition ::=
>     ON <threshold> PERCENT DO { SUSPEND | SUSPEND_IMMEDIATE | NOTIFY }
> ```

## Required parameters

`name`
:   Identifier for the resource monitor; must be unique for your account.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier string
    is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`CREDIT_QUOTA = num`
:   The number of credits allocated to the resource monitor per frequency interval. When total usage for all warehouses assigned to the
    monitor reaches this number for the current frequency interval, the resource monitor is considered to be at 100% of quota.

    If a value is not specified for a resource monitor, the monitor has no quota and will never reach 100% usage within the specified interval.

    Default: No value (i.e. no credit quota)

`FREQUENCY = MONTHLY | DAILY | WEEKLY | YEARLY | NEVER`
:   The frequency interval at which the credit usage resets to `0`.

    If you set a frequency for a resource monitor, you must also set `START_TIMESTAMP`.

    If you specify `NEVER` for the frequency, the credit usage for the warehouse does not reset.

    Default: No value (i.e. legacy behavior, whereby the credit quota resets at the beginning of each calendar month)

`START_TIMESTAMP = timestamp | IMMEDIATELY`
:   The date and time when the resource monitor starts monitoring credit usage for the assigned warehouses.

    If you set a timestamp for a resource monitor, you must also set `FREQUENCY`.

    If you specify `IMMEDIATELY` for the start timestamp, the current timestamp is used.

    If you specify a date without a time, the current time is used.

    If you set a time without specifying a time zone, UTC is used as the default time zone.

    Default: No value (i.e. legacy behavior, whereby the resource monitor starts monitoring warehouses immediately)

`END_TIMESTAMP = timestamp`
:   The date and time when the resource monitor suspends the assigned warehouses.

    Default: No value (i.e. no warehouse suspension date)

`NOTIFY_USERS = ( user_name [ , user_name , ... ] )`
:   Specifies the list of users to receive email notifications on resource monitors. If a user identifier includes spaces or special
    characters or is case-sensitive, then the identifier must be enclosed in double quotes (e.g. “Mary Smith”). See
    [Identifier requirements](../identifiers-syntax.md) for details.

    The user identifier, `user_name`, is the value of the `name` column from the output of
    [SHOW USERS](show-users.md).

    Each user listed must have a verified email address. For instructions on verifying email addresses in the web interface, see [Verify your email address](../../user-guide/ui-support.md).

    Email notifications for non-administrator users do not supersede email notifications for administrators. Any account administrators that
    have [enabled email notifications](../../user-guide/resource-monitors.md) will continue to receive email notifications.

    > **Note:**
    >
    > * The following limitations apply for non-administrator users:
    >
    >   + Non-administrator users can only receive [notifications](../../user-guide/resource-monitors.md)
    >     for [warehouse monitors](../../user-guide/resource-monitors.md).
    >   + Non-administrator users are notified by email but can’t see notifications in Snowsight.
    >   + Non-administrator users can’t create resource monitors.
    >   + Non-administrator users can’t assign other users to be notified.

`TRIGGERS ...` (aka *actions*)
:   Specifies one or more triggers for the resource monitor. Each trigger definition consists of the following:

    > `ON threshold PERCENT`
    > :   A numeric value specified as a percentage of the credit quota for the resource monitor; values larger than `100` are supported.
    >     Once usage reaches this threshold for the current frequency interval, the trigger fires.
    >
    > `DO SUSPEND | SUSPEND_IMMEDIATE | NOTIFY`
    > :   Specifies the action performed by the trigger when the threshold is reached:
    >
    >     * `SUSPEND`: Suspend all assigned warehouses while allowing currently running queries to complete. No new queries can be executed
    >       by the warehouses until the credit quota for the resource monitor is increased. In addition, this action sends a notification to all
    >       users who have enabled notifications for themselves.
    >     * `SUSPEND_IMMEDIATE`: Suspend all assigned warehouses immediately and cancel any currently running queries or statements using
    >       the warehouses. In addition, this action sends a notification to all users who have enabled notifications for themselves.
    >     * `NOTIFY`: Send a notification (to all account administrators with notifications enabled), but do not take any other action.

    Default: No value (i.e. resource monitor performs no actions)

## Usage notes

* Triggers are optional; however, at least one trigger must be added to a resource monitor before it can perform any actions.
* Each resource monitor supports up to a maximum of 5 `NOTIFY` action triggers.
* After a resource monitor is created, it must be assigned to a warehouse or account before it can perform any monitoring actions:

  + To assign a warehouse to a resource monitor, use [ALTER WAREHOUSE](alter-warehouse.md) (or [CREATE WAREHOUSE](create-warehouse.md) if you are creating the warehouse).
  + To assign a resource monitor at the account level, use [ALTER ACCOUNT](alter-account.md). The NOTIFY_USERS parameter must be null.
* To view all resource monitors created in your account and their assignment, use the [SHOW RESOURCE MONITORS](show-resource-monitors.md) command. The command
  output displays `NULL` in the `level` column for resource monitors that are not assigned to the account or any warehouses
  and, therefore, are not monitoring any credit usage.
* If `frequency` and `start_timestamp` parameters are set on a resource monitor, the day for the credit usage reset is
  calculated based on those parameters. The time the credit usage resets to `0` is 12:00 AM UTC regardless of the time specified in
  `start_timestamp`.
* If you specify an `end_timestamp`, monitoring ends at that specified date and time and all assigned warehouses are suspended
  at that date and time even if the credit quota has not been reached.

  When this occurs, a notification is sent that states the resource monitor has reached a percentage of its quota and has triggered a
  suspend immediate action. The percentage of the quota reflects the number of credits used in the current interval up to the end date
  and might not be a threshold you specified.
* If there are non-administrator users in the notification list, the following notes apply:

  + If any user in the notification list does not have a [verified email](../../user-guide/notifications/email-notifications.md),
    the SQL statement fails.
  + If any user in the notification list changes their email address and does not verify the new email address, the
    notification silently fails.
  + The notification list is limited to a maximum number of 5 non-administrator users.
  + Account administrators can view the notification list of non-administrator users in the output of
    [SHOW RESOURCE MONITORS](show-resource-monitors.md) in the `notify_user` column.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

> **Important:**
>
> To receive notifications generated by resource monitors, account administrators and non-administrator users in the notification
> list must explicitly enable notifications in their preferences. In addition, to receive email notifications, users must have a
> verified email in their preferences. Preferences can only
> be set in the Snowflake web interface. For more information, see [Enabling receipt of notifications](../../user-guide/resource-monitors.md).

## Examples

Create a resource monitor named `limiter` with 3 triggers:

> ```sqlexample
> CREATE OR REPLACE RESOURCE MONITOR limiter
>   WITH CREDIT_QUOTA = 5000
>   TRIGGERS ON 75 PERCENT DO NOTIFY
>            ON 100 PERCENT DO SUSPEND
>            ON 110 PERCENT DO SUSPEND_IMMEDIATE;
> ```

Create a resource monitor to send notifications to three users when 75% of the credit quota is reached. In this example, the
`user_name` for two of the users includes a space and must be enclosed in double quotes:

> ```sqlexample
> CREATE OR REPLACE RESOURCE MONITOR limiter
>   WITH CREDIT_QUOTA = 5000
>        NOTIFY_USERS = (JDOE, "Jane Smith", "John Doe")
>   TRIGGERS ON 75 PERCENT DO NOTIFY
>            ON 100 PERCENT DO SUSPEND
>            ON 110 PERCENT DO SUSPEND_IMMEDIATE;
> ```

---
title: CREATE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-role.md
section: SQL Commands
---

# CREATE ROLE

Create a new role or replace an existing role in the system.

After creating roles, you can grant object privileges to the role and then grant the role to other roles or individual users to enable
access control security for objects in the system.

This command supports the following variants:

* CREATE OR ALTER ROLE: Creates a role if it doesn’t exist or alters an existing role.

See also:
:   [GRANT <privileges> … TO ROLE](grant-privilege.md), [GRANT ROLE](grant-role.md) , [GRANT OWNERSHIP](grant-ownership.md) , [DROP ROLE](drop-role.md) , [ALTER ROLE](alter-role.md) , [SHOW ROLES](show-roles.md)

    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] ROLE [ IF NOT EXISTS ] <name>
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

## Variant syntax

### CREATE OR ALTER ROLE

Creates a new role if it doesn’t already exist, or transforms an existing role into the role defined in the statement.
A CREATE OR ALTER ROLE statement follows the syntax rules of a CREATE ROLE statement and has the same limitations as an
[ALTER ROLE](alter-role.md) statement.

```sqlsyntax
CREATE OR ALTER ROLE <name>
  [ COMMENT = '<string_literal>' ]
```

For more information, see CREATE OR ALTER ROLE usage notes.

## Required parameters

`name`
:   Identifier for the role; must be unique for your account.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the role.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ROLE | Account | Only the USERADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |
| OWNERSHIP | Database role | Required to execute a CREATE OR ALTER ROLE statement for an *existing* role.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## CREATE OR ALTER ROLE usage notes

* All limitations of the [ALTER ROLE](alter-role.md) command apply.
* Setting or unsetting a tag is not supported; however, existing tags are not altered by a CREATE OR ALTER ROLE statement and remain
  unchanged.

## Examples

```sqlexample
CREATE ROLE myrole;
```

---
title: CREATE ROW ACCESS POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-row-access-policy.md
section: SQL Commands
---

# CREATE ROW ACCESS POLICY

Creates a new row access policy in the current/specified schema or replaces an existing row access policy.

After creating a row access policy, add the policy to a table using an [ALTER TABLE](alter-table.md) command or a view using an [ALTER VIEW](alter-view.md)
command.

See also:
:   [Row access policy DDL](../../user-guide/security-row-intro.md)

## Syntax

Snowflake supports the following syntax to create a row access policy.

```sqlsyntax
CREATE [ OR REPLACE ] ROW ACCESS POLICY [ IF NOT EXISTS ] <name> AS
( <arg_name> <arg_type> [ , ... ] ) RETURNS BOOLEAN -> <body>
[ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the row access policy; must be unique for your schema.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md)

`AS ( <arg_name> <arg_type> [ , ... ] )`
:   The signature for the row access policy.

    A signature specifies a set of attributes that must be considered to determine whether the row is accessible. The attribute values come
    from the database object (e.g. table or view) to be protected by the row access policy.

`RETURNS BOOLEAN`
:   A row access policy must evaluate to true or false. A user that queries a table protected by a row access policy sees rows in the output
    based on how the `body` is written.

`body`
:   SQL expression that operates on the argument values in the signature to determine which rows to return for a query on a table that is
    protected by a row access policy.

    The `body` can be any boolean-valued SQL expression. Snowflake supports expressions that invoke
    [User-defined functions overview](../../developer-guide/udf/udf-overview.md), [Writing external functions](../external-functions.md), and expressions that use sub-queries.

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the row access policy.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE ROW ACCESS POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Managing Column-level Security](../../user-guide/security-column-intro.md).

## Usage notes

* Including one or more [subqueries](../../user-guide/querying-subqueries.md) in the policy body may cause errors. When possible, limit the
  number of subqueries, limit the number of JOIN operations, and simplify WHERE clause conditions.
* If a database object has both a row access policy and one or more [masking policy](../../user-guide/security-column-intro.md), the row access
  policy is evaluated first.

  For more information on row access policies during query runtime, see [Understanding row access policies](../../user-guide/security-row-intro.md).
* A given table or view column can be specified in either a masking policy signature or a row access policy signature. In other words, the
  same column cannot be specified in both a masking policy signature and a row access policy signature at the same time.

  For more information, see [CREATE MASKING POLICY](create-masking-policy.md).
* You cannot change the policy signature (i.e. argument name or input/output data type) using
  CREATE OR REPLACE ROW ACCESS POLICY if the policy is attached to a table or view, or using
  [ALTER ROW ACCESS POLICY](alter-row-access-policy.md). If you need to change the signature, execute a
  [DROP ROW ACCESS POLICY](drop-row-access-policy.md) statement on the policy and create a new row access policy.
* If the policy `body` contains a mapping table lookup, create a centralized mapping table and store the mapping table
  in the same database as the protected table. This is particularly important if the `body` calls the
  [IS_DATABASE_ROLE_IN_SESSION](../functions/is_database_role_in_session.md) function. For details, see the function usage notes.
* A data sharing provider cannot create a row access policy in a [reader account](../../user-guide/data-sharing-reader-create.md).
* If you specify the [CURRENT_DATABASE](../functions/current_database.md) or [CURRENT_SCHEMA](../functions/current_schema.md) function in the
  body of a masking or row access policy, the function returns the database or schema that contains the protected table, not the database or
  schema in use for the session.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

These examples use the [CURRENT_ROLE](../functions/current_role.md) context function. If role activation and role hierarchy is
necessary in the policy conditions, use [IS_ROLE_IN_SESSION](../functions/is_role_in_session.md).

The following row access policy allows users whose CURRENT_ROLE is the `it_admin` custom role to see rows that contain the
employee ID number (i.e. `empl_id`) in the query result.

> ```sqlexample
> create or replace row access policy rap_it as (empl_id varchar) returns boolean ->
>   case
>       when 'it_admin' = current_role() then true
>       else false
>   end
> ;
> ```

The following row access policy allows users to view rows in the query result if either of the following two conditions are true:

1. The current role is the `sales_executive_role` custom role. Call the [CURRENT_ROLE](../functions/current_role.md) function to
   determine the current role.
2. The current role is the `sales_manager` custom role and the query specifies a `sales_region` that corresponds to the
   `salesmanageregions` mapping table.

> ```sqlexample
> use role securityadmin;
>
> create or replace row access policy rap_sales_manager_regions_1 as (sales_region varchar) returns boolean ->
>   'sales_executive_role' = current_role()
>       or exists (
>             select 1 from salesmanagerregions
>               where sales_manager = current_role()
>                 and region = sales_region
>           )
> ;
> ```
>
> Where:
>
> > `rap_sales_manager_regions_1`
> > :   The name of the row access policy.
> >
> > `as (sales_region varchar)`
> > :   The signature for the row access policy.
> >
> >     A signature specifies a set of attributes that must be considered to determine whether the row is accessible. The attribute values
> >     come from the table to be protected by the row access policy.
> >
> > `returns boolean ->`
> > :   Specifies the application of the row access policy.
> >
> >     Note that the `<expression>` of the row access policy immediately follows the right-arrow (i.e. `->`).
> >
> >     The expression can be any boolean-valued SQL expression. Snowflake supports expressions that invoke UDFs, External Functions, and
> >     expressions that use subqueries.
> >
> > `'sales_executive_role' = current_role()`
> > :   The first condition of the row access policy expression that allows users with the sales_executive_role custom role to view data.
> >
> > `or exists (select 1 from salesmanagerregions where sales_manager = current_role() and region = sales_region)`
> > :   The second condition of the row access policy expression that uses a subquery.
> >
> >     The subquery requires the [CURRENT_ROLE](../functions/current_role.md) to be the sales_manager custom role with the executed query on
> >     the data to specify a region listed in the `salesmanagerregions` mapping table.

The following row access policy specifies two attributes in the policy signature:

> ```sqlexample
> create or replace row access policy rap_test2 as (n number, v varchar)
>   returns boolean -> true;
> ```
>
> Where:
>
> > `rap_test2`
> > :   The name of the row access policy.
> >
> > `(n number, v varchar)`
> > :   The signature for the row access policy.
> >
> >     A signature specifies a set of attributes that must be considered to determine whether the row is accessible. The attribute values
> >     come from the table to be protected by the row access policy.
> >
> > `returns boolean -> true`
> > :   Determines the application of the row access policy.
> >
> >     The returned value determines whether the user has access to a given row on the database object to which the row access policy is
> >     added.

For additional examples, see [Use row access policies](../../user-guide/security-row-using.md).

---
title: CREATE SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/create-schema.md
section: SQL Commands
---

# CREATE SCHEMA

Creates a new schema in the current database.

This command supports the following variants:

* CREATE OR ALTER SCHEMA: Creates a schema if it doesn’t exist or alters an existing schema.
* CREATE SCHEMA … CLONE: Creates a clone of an existing schema, either at its current state or at a specific
  time/point in the past (using Time Travel). For more information about cloning a schema, see [Cloning considerations](../../user-guide/object-clone.md).
* CREATE SCHEMA … FROM BACKUP SET (restores a schema from a backup under a new name)

See also:
:   [ALTER SCHEMA](alter-schema.md) , [DESCRIBE SCHEMA](desc-schema.md) , [DROP SCHEMA](drop-schema.md) , [SHOW SCHEMAS](show-schemas.md) , [UNDROP SCHEMA](undrop-schema.md)

    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ TRANSIENT ] SCHEMA [ IF NOT EXISTS ] <name>
  [ CLONE <source_schema>
      [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
      [ IGNORE TABLES WITH INSUFFICIENT DATA RETENTION ]
      [ IGNORE HYBRID TABLES ] ]
  [ WITH MANAGED ACCESS ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ EXTERNAL_VOLUME = <external_volume_name> ]
  [ CATALOG = <catalog_integration_name> ]
  [ ICEBERG_VERSION_DEFAULT = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ CLASSIFICATION_PROFILE = '<classification_profile>' ]
  [ COMMENT = '<string_literal>' ]
  [ CATALOG_SYNC = '<snowflake_open_catalog_integration_name>' ]
  [ OBJECT_VISIBILITY = PRIVILEGED ]
  [ ENABLE_DATA_COMPACTION = { TRUE | FALSE } ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
```

**Restored schema (from a backup)**

```sqlsyntax
CREATE SCHEMA <name> FROM BACKUP SET <backup_set> IDENTIFIER '<backup_id>'
```

## Variant syntax

### CREATE OR ALTER SCHEMA

Creates a new schema if it doesn’t already exist, or transforms an existing schema into the schema defined in the statement.
A CREATE OR ALTER SCHEMA statement follows the syntax rules of a CREATE SCHEMA statement and has the same limitations as an
[ALTER SCHEMA](alter-schema.md) statement.

For more information, see CREATE OR ALTER SCHEMA usage notes.

```sqlsyntax
CREATE OR ALTER [ TRANSIENT ] SCHEMA <name>
  [ WITH MANAGED ACCESS ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ EXTERNAL_VOLUME = <external_volume_name> ]
  [ CATALOG = <catalog_integration_name> ]
  [ ICEBERG_VERSION_DEFAULT = <integer> ]
  [ ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE } ]
  [ REPLACE_INVALID_CHARACTERS = { TRUE | FALSE } ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ LOG_LEVEL = '<log_level>' ]
  [ TRACE_LEVEL = '<trace_level>' ]
  [ STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED } ]
  [ COMMENT = '<string_literal>' ]
  [ OBJECT_VISIBILITY = PRIVILEGED ]
```

### CREATE SCHEMA … CLONE

Creates a new schema with the same parameter values:

> ```sqlsyntax
> CREATE [ OR REPLACE ] SCHEMA [ IF NOT EXISTS ] <name> CLONE <source_schema>
>   [ ... ]
> ```

For more details, see [CREATE <object> … CLONE](create-clone.md).

## Required parameters

`name`
:   Specifies the identifier for the schema; must be unique for the database in which the schema is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`TRANSIENT`
:   Specifies a schema as transient. Transient schemas do not have a Fail-safe period so they do not incur additional storage costs once
    they leave Time Travel; however, this means they are also not protected by Fail-safe in the event of a data loss. For more information,
    see [Understanding and viewing Fail-safe](../../user-guide/data-failsafe.md).

    In addition, by definition, all tables created in a transient schema are transient. For more information about transient tables, see
    [CREATE TABLE](create-table.md).

    Default: No value (i.e. schema is permanent)

`CLONE source_schema`
:   Specifies to create a clone of the specified source schema. For more details about cloning a schema, see [CREATE <object> … CLONE](create-clone.md).

`AT | BEFORE ( TIMESTAMP => timestamp | OFFSET => time_difference | STATEMENT => id )`
:   When cloning a schema, the [AT | BEFORE](../constructs/at-before.md) clause specifies to use Time Travel to clone the schema at or
    before a specific point in the past.

`IGNORE TABLES WITH INSUFFICIENT DATA RETENTION`
:   Ignore tables that no longer have historical data available in Time Travel to clone. If the time in the past specified in the
    AT | BEFORE clause is beyond the data retention period for any child table in a database or schema, skip the cloning operation
    for the child table. For more information, see
    [Child Objects and Data Retention Time](../../user-guide/object-clone.md).

`IGNORE HYBRID TABLES`
:   Ignore hybrid tables, which will not be cloned. Use this option to clone a schema that contains hybrid tables.
    The cloned schema includes other objects but skips hybrid tables.

    If you don’t use this option and your schema contains one or more hybrid tables, the command ignores hybrid tables silently. However, the error handling for schemas that contain hybrid tables will change in an upcoming release; therefore, you may want to add this parameter to your commands preemptively.

`WITH MANAGED ACCESS`
:   Specifies a managed schema. Managed access schemas centralize privilege management with the schema owner.

    In regular schemas, the owner of an object (i.e. the role that has the OWNERSHIP privilege on the object) can grant further privileges
    on their objects to other roles. In managed schemas, the schema owner manages all privilege grants, including
    [future grants](../../user-guide/security-access-control-configure.md), on objects in the schema. Object owners retain the OWNERSHIP
    privileges on the objects; however, only the schema owner can manage privilege grants on the objects.

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the number of days for which Time Travel actions (CLONE and UNDROP) can be performed on the schema, as well as specifying the
    default Time Travel retention time for all tables created in the schema. For more details, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see
    [Parameters](../parameters.md). For more information about table-level retention time, see
    [CREATE TABLE](create-table.md) and [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition:
    >
    >   + `0` to `90` for permanent schemas
    >   + `0` or `1` for transient schemas

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the database or account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the schema.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for tables in
    the schema to prevent streams on the tables from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`EXTERNAL_VOLUME = external_volume_name`
:   Object parameter that specifies the default external volume to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    For more information about this parameter, see [EXTERNAL_VOLUME](../parameters.md).

`CATALOG = catalog_integration_name`
:   Object parameter that specifies the default catalog integration to use for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    For more information about this parameter, see [CATALOG](../parameters.md).

`ICEBERG_VERSION_DEFAULT = integer`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the version of the Apache Iceberg™ table specification that Iceberg tables conform to.

    Values:
    :   `2`: New tables conform with Iceberg version 2.

        `3`: New tables conform with Iceberg version 3.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    Default:
    :   `2`

    For more information about this parameter, see [ICEBERG_VERSION_DEFAULT](../parameters.md).

`ENABLE_ICEBERG_MERGE_ON_READ = { TRUE | FALSE }`
:   [Preview feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies whether to enable merge-on-read behavior for Apache Iceberg™ tables.

    Values:
    :   `TRUE`: New tables use merge-on-read behavior.

        `FALSE`: New tables use copy-on-write behavior.

    Default:
    :   `TRUE`

    For a detailed description of this parameter, see [ENABLE_ICEBERG_MERGE_ON_READ](../parameters.md). For more information about merge-on-read
    and copy-on-write behavior in Snowflake, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

`REPLACE_INVALID_CHARACTERS = { TRUE | FALSE }`
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results for an
    [Iceberg table](create-iceberg-table.md).
    You can only set this parameter for tables that use an external Iceberg catalog.

    * `TRUE` replaces invalid UTF-8 characters with the Unicode replacement character.
    * `FALSE` leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message when it encounters invalid UTF-8
      characters in a Parquet data file.

    Default: `FALSE`

`DEFAULT_DDL_COLLATION = 'collation_specification'`
:   Specifies a default [collation specification](../collation.md) for all tables added to the schema. The default
    can be overridden at the individual table level.

    For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

`LOG_LEVEL = 'log_level'`
:   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
    the specified level (and at more severe levels) are ingested.

    For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting log level, see
    [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`TRACE_LEVEL = 'trace_level'`
:   Controls how trace events are ingested into the event table.

    For information about levels, see [TRACE_LEVEL](../parameters.md). For information about setting trace level, see
    [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`STORAGE_SERIALIZATION_POLICY = { COMPATIBLE | OPTIMIZED }`
:   Specifies the storage serialization policy for [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) that use Snowflake as the catalog.

    * `COMPATIBLE`: Snowflake performs encoding and compression of data files that ensures interoperability with third-party compute engines.
    * `OPTIMIZED`: Snowflake performs encoding and compression of data files that ensures the best table performance within Snowflake.

    Default: `OPTIMIZED`

`CLASSIFICATION_PROFILE = 'classification_profile'`
:   Associates the schema with a classification profile so that sensitive data in the schema is
    [automatically classified](../../user-guide/classify-auto.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the schema.

    Default: No value

`CATALOG_SYNC = 'snowflake_open_catalog_integration_name'`
:   Specifies the name of a catalog integration configured for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).
    If specified, Snowflake syncs Snowflake-managed Apache Iceberg™ tables in the schema with an external catalog in your Snowflake Open Catalog account.
    For more information about syncing Snowflake-managed Iceberg tables with Open Catalog, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

    For more information about this parameter, see [CATALOG_SYNC](../parameters.md).

    Default: No value

`ENABLE_DATA_COMPACTION = { TRUE | FALSE }`
:   Specifies whether Snowflake should enable data compaction on Snowflake-managed [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

    * `TRUE`: Snowflake performs data compaction on the tables.
    * `FALSE`: Snowflake doesn’t perform data compaction on the tables.

    Default: `TRUE`

    For more information, see [ENABLE_DATA_COMPACTION](../parameters.md) and [Set data compaction](../../user-guide/tables-iceberg-manage.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`OBJECT_VISIBILITY = PRIVILEGED`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies the visibility of objects in the account, which controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md)
    and enables users without explicit access privileges to find objects and request access. For examples, see [Examples](../../user-guide/ui-snowsight/object-visibility-universal-search.md).

    * `PRIVILEGED`: Specifies that only roles within the current account that are granted an explicit privilege on the object can see the object.
      This is the default behavior in Snowflake.

    For examples, see [Make database objects discoverable in Universal Search](../../user-guide/ui-snowsight/object-visibility-universal-search.md).

## Backup parameters

The FROM BACKUP SET clause restores a schema from a backup. You don’t specify other schema
properties because they’re all the same as in the backed-up schema.

> **Note:**
>
> The FROM SNAPSHOT SET clause is deprecated. Use FROM BACKUP SET instead.

This form doesn’t have a CREATE OR REPLACE clause. You typically either restore the
schema under a new name and recover any data or other objects from this new schema,
or rename the original schema and then restore the schema under the original name.

> **Note:**
>
> The restored schema is independent of the original schema from the backup.
> There isn’t any cloning relationship between the restored and original schemas.
> Therefore, all the micro-partitions in the restored schema are owned by that schema.
>
> If you want to make backups of the newly restored schema, create a new backup set for it.

For more information about backups, see [Backups for disaster recovery and immutable storage](../../user-guide/backups.md).

`backup_set`
:   Specifies the name of a backup set created for a specific schema.
    You can use the SHOW BACKUP SETS command to locate the right backup set.

`backup_id`
:   Specifies the identifier of a specific backup within that backup set.
    You can use the SHOW BACKUPS IN BACKUP SET command to locate the right identifier within the backup
    set, based on the creation date and time for the backup.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SCHEMA | Database | Can create both regular and [managed access](../../user-guide/security-access-control-configure.md) schemas. |
| CREATE SCHEMA … CLONE … WITH MANAGED ACCESS | Options | The required privileges depends on whether the source schema is managed or unmanaged:   * Managed: OWNERSHIP on the source schema. * Unmanaged: MANAGE GRANTS ON ACCOUNT and USAGE on the source schema. |
| USAGE | External volume, catalog integration | Required if setting the `EXTERNAL_VOLUME` or `CATALOG` object parameters, respectively. |
| MANAGE VISIBILITY | Account | Required to set the OBJECT_VISIBILITY property. Only the SECURITYADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| MODIFY LOG LEVEL | Account | Required to set the LOG_LEVEL for a schema. |
| MODIFY TRACE LEVEL | Account | Required to set the TRACE_LEVEL for a schema. |
| OWNERSHIP | Schema | Required only when executing a CREATE OR ALTER SCHEMA statement for an existing schema.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* Creating a schema automatically sets it as the active/current schema for the current session (equivalent to using the
  [USE SCHEMA](use-schema.md) command for the schema).
* If a schema with the same name already exists in the database, an error is returned and the schema is not created, unless the optional
  `OR REPLACE` keyword is specified in the command.

  > **Important:**
  >
  > Using `OR REPLACE` is the equivalent of using [DROP SCHEMA](drop-schema.md) on the existing schema and then creating a new schema with
  > the same name; however, the dropped schema is not permanently removed from the system. Instead, it is retained in Time Travel.
  > This is important because dropped schemas in Time Travel contribute to data storage for your account. For more information, see
  > [Storage costs for Time Travel and Fail-safe](../../user-guide/data-cdp-storage-costs.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* In a managed access schema, the schema owner manages grants on the contained objects (e.g. tables or views) but has no other
  privileges (USAGE, SELECT, DROP, etc.) on the objects.
* In a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md), this command
  creates a namespace in your linked Iceberg REST catalog and a corresponding schema in your Snowflake database. For this use case, Snowflake
  supports only the following options:

  + CLASSIFICATION_PROFILE
  + COMMENT
  + STORAGE_SERIALIZATION_POLICY
  + TAG
  + WITH CONTACT
  + WITH MANAGED ACCESS

  The CREATE OR ALTER and CLONE variants aren’t supported.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER SCHEMA usage notes

* All limitations of the [ALTER SCHEMA](alter-schema.md) command apply.
* This command does *not* support the following:

  + Swapping schemas using the SWAP WITH parameter.
  + Renaming a schema using the RENAME TO parameter.
  + Creating a clone of a schema using the CLONE parameter.
  + Adding or changing tags and policies. Any existing tags and policies are preserved.
  + Converting a TRANSIENT schema to a non-TRANSIENT schema, or vice versa.

## Examples

Create a permanent schema:

> ```sqlexample
> CREATE SCHEMA myschema;
>
> SHOW SCHEMAS;
>
> +-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+---------+----------------+
> | created_on                    | name               | is_default | is_current | database_name | owner        | comment                                                   | options | retention_time |
> |-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+---------+----------------|
> | 2018-12-10 09:34:02.127 -0800 | INFORMATION_SCHEMA | N          | N          | MYDB          |              | Views describing the contents of schemas in this database |         | 1              |
> | 2018-12-10 09:33:56.793 -0800 | MYSCHEMA           | N          | Y          | MYDB          | PUBLIC       |                                                           |         | 1              |
> | 2018-11-26 06:08:24.263 -0800 | PUBLIC             | N          | N          | MYDB          | PUBLIC       |                                                           |         | 1              |
> +-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+---------+----------------+
> ```

Create a transient schema:

> ```sqlexample
> CREATE TRANSIENT SCHEMA tschema;
>
> SHOW SCHEMAS;
>
> +-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+-----------+----------------+
> | created_on                    | name               | is_default | is_current | database_name | owner        | comment                                                   | options   | retention_time |
> |-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+-----------+----------------|
> | 2018-12-10 09:34:02.127 -0800 | INFORMATION_SCHEMA | N          | N          | MYDB          |              | Views describing the contents of schemas in this database |           | 1              |
> | 2018-12-10 09:33:56.793 -0800 | MYSCHEMA           | N          | Y          | MYDB          | PUBLIC       |                                                           |           | 1              |
> | 2018-11-26 06:08:24.263 -0800 | PUBLIC             | N          | N          | MYDB          | PUBLIC       |                                                           |           | 1              |
> | 2018-12-10 09:35:32.326 -0800 | TSCHEMA            | N          | Y          | MYDB          | PUBLIC       |                                                           | TRANSIENT | 1              |
> +-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+-----------+----------------+
> ```

Create a managed access schema:

> ```sqlexample
> CREATE SCHEMA mschema WITH MANAGED ACCESS;
>
> SHOW SCHEMAS;
>
> +-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+----------------+----------------+
> | created_on                    | name               | is_default | is_current | database_name | owner        | comment                                                   | options        | retention_time |
> |-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+----------------+----------------|
> | 2018-12-10 09:34:02.127 -0800 | INFORMATION_SCHEMA | N          | N          | MYDB          |              | Views describing the contents of schemas in this database |                | 1              |
> | 2018-12-10 09:36:47.738 -0800 | MSCHEMA            | N          | Y          | MYDB          | ROLE1        |                                                           | MANAGED ACCESS | 1              |
> | 2018-12-10 09:33:56.793 -0800 | MYSCHEMA           | N          | Y          | MYDB          | PUBLIC       |                                                           |                | 1              |
> | 2018-11-26 06:08:24.263 -0800 | PUBLIC             | N          | N          | MYDB          | PUBLIC       |                                                           |                | 1              |
> | 2018-12-10 09:35:32.326 -0800 | TSCHEMA            | N          | Y          | MYDB          | PUBLIC       |                                                           | TRANSIENT      | 1              |
> +-------------------------------+--------------------+------------+------------+---------------+--------------+-----------------------------------------------------------+----------------+----------------+
> ```

## CREATE OR ALTER SCHEMA examples

### Create a simple schema

Create a schema named `s1`:

```sqlexample
CREATE OR ALTER SCHEMA s1;
```

Create or alter schema `s1` and set properties and parameters:

```sqlexample
CREATE OR ALTER SCHEMA s1
  WITH MANAGED ACCESS
  DATA_RETENTION_TIME_IN_DAYS = 5
  DEFAULT_DDL_COLLATION = 'de';
```

### Unset a parameter previously set on schema

The [absence of a previously set parameter](create-or-alter.md) in the modified schema definition results
in unsetting it. In the following example, turn off managed access for the schema `s1` created
in the previous example:

```sqlexample
CREATE OR ALTER SCHEMA s1
  DATA_RETENTION_TIME_IN_DAYS = 5
  DEFAULT_DDL_COLLATION = 'de';
```

---
title: CREATE SECRET
source: https://docs.snowflake.com/en/sql-reference/sql/create-secret.md
section: SQL Commands
---

# CREATE SECRET

Creates a new secret in the current or specified schema or replaces an existing secret.

See also:
:   [ALTER SECRET](alter-secret.md) , [DESCRIBE SECRET](desc-secret.md) , [DROP SECRET](drop-secret.md) ,
    [SHOW SECRETS](show-secrets.md)

## Syntax

**OAuth with client credentials flow:**

```sqlsyntax
CREATE [ OR REPLACE ] SECRET [ IF NOT EXISTS ] <name>
  TYPE = OAUTH2
  API_AUTHENTICATION = <security_integration_name>
  OAUTH_SCOPES = ( '<scope_1>' [ , '<scope_2>' ... ] )
  [ COMMENT = '<string_literal>' ]
```

**OAuth with authorization code grant flow:**

```sqlsyntax
CREATE [ OR REPLACE ] SECRET [ IF NOT EXISTS ] <name>
  TYPE = OAUTH2
  OAUTH_REFRESH_TOKEN = '<string_literal>'
  OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '<string_literal>'
  API_AUTHENTICATION = <security_integration_name>;
  [ COMMENT = '<string_literal>' ]
```

**Cloud provider:**

```sqlsyntax
CREATE [ OR REPLACE ] SECRET [ IF NOT EXISTS ] <name>
  TYPE = CLOUD_PROVIDER_TOKEN
  API_AUTHENTICATION = '<cloud_provider_security_integration>'
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
```

**Basic authentication:**

```sqlsyntax
CREATE [ OR REPLACE ] SECRET [ IF NOT EXISTS ] <name>
  TYPE = PASSWORD
  USERNAME = '<username>'
  PASSWORD = '<password>'
  [ COMMENT = '<string_literal>' ]
```

**Generic string:**

```sqlsyntax
CREATE [ OR REPLACE ] SECRET [ IF NOT EXISTS ] <name>
  TYPE = GENERIC_STRING
  SECRET_STRING = '<string_literal>'
  [ COMMENT = '<string_literal>' ]
```

**Symmetric key:**

```sqlsyntax
CREATE [ OR REPLACE ] SECRET [ IF NOT EXISTS ] <name>
TYPE = SYMMETRIC_KEY
ALGORITHM = GENERIC
```

## OAuth with client credentials flow required parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = OAUTH2`
:   Specifies a secret to use with an OAuth grant flow.

`API_AUTHENTICATION = security_integration_name`
:   Specifies the `name` value of the Snowflake security integration that connects Snowflake to an external service.

`OAUTH_SCOPES = ( 'scope_1' [ , 'scope_2' ... ] )`
:   Specifies a comma-separated list of scopes to use when making a request from the OAuth server by a role with USAGE on the integration
    during the OAuth client credentials flow.

    This list must be a subset of the scopes defined in the `OAUTH_ALLOWED_SCOPES` property of the security integration. If the
    `OAUTH_SCOPES` property values are not specified, the secret inherits all of the scopes that are specified in the security
    integration.

    For the ServiceNow connector, the only possible scope value is `'useraccount'`.

## OAuth with authorization code grant flow required parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = OAUTH2`
:   Specifies a secret to use with the OAuth grant flow.

`OAUTH_REFRESH_TOKEN = 'string_literal'`
:   Specifies the token as a string that is used to obtain a new access token from the OAuth authorization server when the access token
    expires.

`OAUTH_REFRESH_TOKEN_EXPIRY_TIME = 'string_literal'`
:   Specifies the timestamp as a string when the OAuth refresh token expires.

`API_AUTHENTICATION = security_integration_name`
:   Specifies the `name` value of the Snowflake security integration that connects Snowflake to an external service.

## AWS IAM required parameters

`TYPE = CLOUD_PROVIDER_TOKEN`
:   Specifies that this is secret for use with a cloud provider, such as Amazon Web Services (AWS).

`API_AUTHENTICATION = 'cloud_provider_security_integration'`
:   Specifies the `name` value of the Snowflake [security integration](create-security-integration-aws-iam.md)
    that connects Snowflake to a cloud provider.

## Basic authentication required parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = PASSWORD`
:   Specifies a secret to use with basic authentication.

    When specifying this type you must specify values for the username and password properties.

`USERNAME = 'username'`
:   Specifies the username value to store in the secret.

    Specify this value when setting the `TYPE` value to `PASSWORD` for use with basic authentication.

`PASSWORD = 'password'`
:   Specifies the password value to store in the secret.

    Specify this value when setting the `TYPE` value to `PASSWORD` for use with basic authentication.

## Generic string parameters

`name`
:   String that specifies the identifier (i.e. name) for the secret, must be unique in your schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = GENERIC_STRING`
:   Specifies a secret to store a sensitive string value.

`SECRET_STRING = 'string_literal'`
:   Specifies the string to store in the secret.

    The string can be an API token or a string of sensitive value that can be used in the handler code of a UDF or stored procedure. For
    details, see [Creating and using an external access integration](../../developer-guide/external-network-access/creating-using-external-network-access.md).

    You should not use this property to store any kind of OAuth token; use one of the other secret types for your OAuth use cases.

## Symmetric key parameters

Symmetric key secrets generate a cryptographic key that can be used for cryptographic operations. Currently only used to generate
[synthetic data](../../user-guide/synthetic-data.md).

`ALGORITHM`
:   Specifies which algorithm to use to generate the symmetric key. The only value supported is `GENERIC`, which generates a 256-bit key.

## Optional parameters

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the secret.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SECRET | Schema |  |
| USAGE | Database or schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### OAuth with client credentials

Create a secret for use with the OAuth client credentials flow:

```sqlexample
CREATE OR REPLACE SECRET mysecret
  TYPE = OAUTH2
  API_AUTHENTICATION = mysecurityintegration
  OAUTH_SCOPES = ('useraccount')
  COMMENT = 'secret for the service now connector'
```

### OAuth with authorization code grant

Create a secret for use with the OAuth code grant flow:

```sqlexample
CREATE SECRET service_now_creds_oauth_code
  TYPE = OAUTH2
  OAUTH_REFRESH_TOKEN = '34n;vods4nQsdg09wee4qnfvadH'
  OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '2022-01-06 20:00:00'
  API_AUTHENTICATION = sn_oauth;
```

### AWS IAM

Create a secret for use with Amazon Web Services (AWS) by including the AWS IAM ARN for authentication:

```sqlexample
CREATE SECRET aws_secret
  TYPE = CLOUD_PROVIDER_TOKEN
  API_AUTHENTICATION = myawsiamintegration
  ENABLED = TRUE;
```

### Basic authentication

Create a secret that specifies a username and password to access ServiceNow:

```sqlexample
CREATE SECRET service_now_creds_pw
  TYPE = password
  USERNAME = 'jsmith1'
  PASSWORD = 'W3dr@fg*7B1c4j';
```

---
title: CREATE SECURITY INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION

Creates a new security integration in the account or replaces an existing integration. An integration is a Snowflake object that provides
an interface between Snowflake and a third-party service.

See also:
:   [ALTER SECURITY INTEGRATION](alter-security-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [ IF NOT EXISTS ]
  <name>
  TYPE = { API_AUTHENTICATION | EXTERNAL_OAUTH | OAUTH | SAML2 | SCIM }
  ...
```

The syntax varies considerably among security environments (i.e. types of security integrations). For specific syntax, usage notes, and
examples, see:

* [CREATE SECURITY INTEGRATION (AWS IAM Authentication)](create-security-integration-aws-iam.md)
* [CREATE SECURITY INTEGRATION (External API Authentication)](create-security-integration-api-auth.md)
* [CREATE SECURITY INTEGRATION (External OAuth)](create-security-integration-oauth-external.md)
* [CREATE SECURITY INTEGRATION (Snowflake OAuth)](create-security-integration-oauth-snowflake.md)
* [CREATE SECURITY INTEGRATION (SAML2)](create-security-integration-saml2.md)
* [CREATE SECURITY INTEGRATION (SCIM)](create-security-integration-scim.md)

---
title: CREATE SECURITY INTEGRATION (AWS IAM Authentication)
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-aws-iam.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION (AWS IAM Authentication)

Creates a new security integration for external authentication using Amazon Web Services (AWS) Identity and Access Management (IAM).

For information about creating other types of security integrations (e.g. External OAuth), see [CREATE SECURITY INTEGRATION](create-security-integration.md).

See also:
:   [ALTER SECURITY INTEGRATION (AWS IAM Authentication)](alter-security-integration-aws-iam.md) , [DESCRIBE INTEGRATION](desc-integration.md) ,
    [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
CREATE SECURITY INTEGRATION <name>
  TYPE = API_AUTHENTICATION
  AUTH_TYPE = AWS_IAM
  AWS_ROLE_ARN = '<iam_role_arn>'
  ENABLED = { TRUE | FALSE }
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the integration. This value must be unique in your account.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = API_AUTHENTICATION`
:   Specifies that the security integration is an interface between Snowflake and one or more AWS services that use OAuth 2.0 or AWS IAM
    credentials.

`AUTH_TYPE = AWS_IAM`
:   Specifies that the integration uses AWS IAM to authenticate to one or more AWS services.

`AWS_ROLE_ARN = 'iam_role_arn'`
:   Specifies the Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges for AWS resources.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this security integration is enabled or disabled.

    `TRUE`
    :   Allows the integration to run based on the parameters specified in the integration definition.

    `FALSE`
    :   Suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a security integration to connect Snowflake to AWS as the role named in AWS as `arn:aws:iam::001234567890:role/myrole`.

> ```sqlexample
> CREATE SECURITY INTEGRATION aws_iam
>   TYPE = API_AUTHENTICATION
>   AUTH_TYPE = AWS_IAM
>   AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
>   ENABLED = true;
> ```

---
title: CREATE SECURITY INTEGRATION (External API Authentication)
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-api-auth.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION (External API Authentication)

Creates a new security integration for external API Authentication in the account or replaces an existing integration.

For information about creating other types of security integrations (e.g. External OAuth), see [CREATE SECURITY INTEGRATION](create-security-integration.md).

See also:
:   [ALTER SECURITY INTEGRATION (External API Authentication)](alter-security-integration-api-auth.md) , [DESCRIBE INTEGRATION](desc-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

### OAuth: Client credentials

```sqlsyntax
CREATE SECURITY INTEGRATION <name>
  TYPE = API_AUTHENTICATION
  AUTH_TYPE = OAUTH2
  ENABLED = { TRUE | FALSE }
  [ OAUTH_TOKEN_ENDPOINT = '<string_literal>' ]
  [ OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST } ]
  [ OAUTH_CLIENT_ID = '<string_literal>' ]
  [ OAUTH_CLIENT_SECRET = '<string_literal>' ]
  [ OAUTH_GRANT = 'CLIENT_CREDENTIALS']
  [ OAUTH_ACCESS_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_ALLOWED_SCOPES = ( '<scope_1>' [ , '<scope_2>' ... ] ) ]
  [ COMMENT = '<string_literal>' ]
```

### OAuth: Authorization code grant flow

```sqlsyntax
CREATE SECURITY INTEGRATION <name>
  TYPE = API_AUTHENTICATION
  AUTH_TYPE = OAUTH2
  ENABLED = { TRUE | FALSE }
  [ OAUTH_AUTHORIZATION_ENDPOINT = '<string_literal>' ]
  [ OAUTH_TOKEN_ENDPOINT = '<string_literal>' ]
  [ OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST } ]
  [ OAUTH_CLIENT_ID = '<string_literal>' ]
  [ OAUTH_CLIENT_SECRET = '<string_literal>' ]
  [ OAUTH_GRANT = 'AUTHORIZATION_CODE']
  [ OAUTH_ACCESS_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
  [ COMMENT = '<string_literal>' ]
```

### OAuth: JWT bearer flow

```sqlsyntax
CREATE SECURITY INTEGRATION <name>
  TYPE = API_AUTHENTICATION
  AUTH_TYPE = OAUTH2
  ENABLED = { TRUE | FALSE }
  [ OAUTH_AUTHORIZATION_ENDPOINT = '<string_literal>' ]
  [ OAUTH_TOKEN_ENDPOINT = '<string_literal>' ]
  [ OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST } ]
  [ OAUTH_CLIENT_ID = '<string_literal>' ]
  [ OAUTH_CLIENT_SECRET = '<string_literal>' ]
  [ OAUTH_GRANT = 'JWT_BEARER']
  [ OAUTH_ACCESS_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the integration. This value must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless
    the entire identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = API_AUTHENTICATION`
:   Specifies that you are creating a security interface between Snowflake and an external service that uses OAuth 2.0 with
    External API Authentication.

`AUTH_TYPE = OAUTH2`
:   Specifies that the integration uses OAuth 2.0 to authenticate to the external service.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this security integration is enabled or disabled.

    `TRUE`
    :   Allows the integration to run based on the parameters specified in the integration definition.

    `FALSE`
    :   Suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    The value is case-insensitive.

    The default is `TRUE`.

## Optional parameters

Note that this is an exhaustive list of parameters that you can configure. Configure the parameters in the integration to match the
parameters that you configure when [creating a secret](create-secret.md) based on the OAuth flow that you choose.

`OAUTH_AUTHORIZATION_ENDPOINT = 'string_literal'`
:   Specifies the URL for authenticating to the external service. For example, to connect to the ServiceNow instance the URL should be in the
    following format:

    ```none
    https://<instance_name>.service-now.com/oauth_token
    ```

    Where `instance_name` is the name of your ServiceNow instance.

`OAUTH_TOKEN_ENDPOINT = 'string_literal'`
:   Specifies the token endpoint used by the client to obtain an access token by presenting its authorization grant or refresh token.
    The token endpoint is used with every authorization grant except for the implicit grant type (since an access token is issued directly).

`OAUTH_CLIENT_AUTH_METHOD = { CLIENT_SECRET_BASIC | CLIENT_SECRET_POST }`
:   Controls how client credentials are sent to the external service.

    `CLIENT_SECRET_BASIC`
    :   Specifies that client credentials are sent using the HTTP Basic Authentication Scheme.

    `CLIENT_SECRET_POST`
    :   Specifies that client credentials are sent in the HTTP request body of a POST request.

    Default: `CLIENT_SECRET_BASIC`

`OAUTH_CLIENT_ID = 'string_literal'`
:   Specifies the client ID for the OAuth application in the external service.

`OAUTH_CLIENT_SECRET = 'string_literal'`
:   Specifies the client secret for the OAuth application in the external service.

`OAUTH_GRANT = 'string_literal'`
:   Specifies the type of OAuth flow. One of the following:

    * `'CLIENT_CREDENTIALS'` when the integration will use client credentials.
    * `'AUTHORIZATION_CODE'` when the integration will use an authorization code.
    * `'JWT_BEARER'` when the integration will use a JWT bearer token.

`OAUTH_ACCESS_TOKEN_VALIDITY = integer`
:   Specifies the default lifetime of the OAuth access token (in seconds) issued by an OAuth server.

    The value set in this property is used if the access token lifetime is not returned as part of OAuth token response. When both
    values are available, the smaller of the two values will be used to refresh the access token.

`OAUTH_REFRESH_TOKEN_VALIDITY = integer`
:   Specifies the value to determine the validity of the refresh token obtained from the OAuth server.

`OAUTH_ALLOWED_SCOPES = ( 'scope_1' [ , 'scope_2' ... ] )`
:   Specifies a comma-separated list of scopes, with single quotes surrounding each scope, to use when making a request from the OAuth by a
    role with USAGE on the integration during the OAuth client credentials flow.

    This list must be a subset of the scopes defined in the `OAUTH_ALLOWED_SCOPES` property of the security integration. If the
    `OAUTH_SCOPES` property values are not specified, the secret inherits all of the scopes that are specified in the security
    integration.

    For the ServiceNow connector, the only possible scope value is `'useraccount'`.

    Default: Empty list (i.e. `[]`).

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |
| CREATE SECURITY INTEGRATION | Account | Grants the ability to create external security integrations of type API_AUTHENTICATION. This privilege does not grant the ability to create other types of security integrations. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a security integration named `servicenow_oauth` to connect Snowflake to the ServiceNow instance named `myinstance` using
OAuth with the code grant flow:

> ```sqlexample
> CREATE SECURITY INTEGRATION servicenow_oauth
>   TYPE = API_AUTHENTICATION
>   AUTH_TYPE = OAUTH2
>   OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
>   OAUTH_CLIENT_ID = 'sn-oauth-134o9erqfedlc'
>   OAUTH_CLIENT_SECRET = 'eb9vaXsrcEvrFdfcvCaoijhilj4fc'
>   OAUTH_TOKEN_ENDPOINT = 'https://myinstance.service-now.com/oauth_token.do'
>   ENABLED = TRUE;
> ```

Create a security integration named `sharepoint_security_integration` to connect Snowflake to Microsoft Sharepoint
using OAuth with client credentials:

> ```sqlexample
> CREATE SECURITY INTEGRATION sharepoint_security_integration
>   TYPE = API_AUTHENTICATION
>   AUTH_TYPE = OAUTH2
>   OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
>   OAUTH_CLIENT_ID = 'YOUR_CLIENT_ID'
>   OAUTH_CLIENT_SECRET = 'YOUR_CLIENT_SECRET'
>   OAUTH_GRANT = 'CLIENT_CREDENTIALS'
>   OAUTH_TOKEN_ENDPOINT = 'https://login.microsoftonline.com/YOUR_TENANT_ID/oauth2/v2.0/token'
>   OAUTH_ALLOWED_SCOPES = ('https://graph.microsoft.com/.default')
>   ENABLED = TRUE;
> ```

---
title: CREATE SECURITY INTEGRATION (External OAuth)
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-oauth-external.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION (External OAuth)

> **Attention:**
>
> Mentions of Microsoft Azure Active Directory refer to Microsoft Entra ID.

Creates a new External OAuth security integration in the account or replaces an existing integration. An External OAuth security
integration allows a client to use a third-party authorization server to obtain the access tokens needed to interact with Snowflake.

For information about creating other types of security integrations (e.g. Snowflake OAuth), see [CREATE SECURITY INTEGRATION](create-security-integration.md).

See also:
:   [ALTER SECURITY INTEGRATION (External OAuth)](alter-security-integration-oauth-external.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [IF NOT EXISTS]
  <name>
  TYPE = EXTERNAL_OAUTH
  ENABLED = { TRUE | FALSE }
  EXTERNAL_OAUTH_TYPE = { OKTA | AZURE | PING_FEDERATE | CUSTOM }
  EXTERNAL_OAUTH_ISSUER = '<string_literal>'
  EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = { '<string_literal>' | ('<string_literal>' [ , '<string_literal>' , ... ] ) }
  EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = { 'LOGIN_NAME' | 'EMAIL_ADDRESS' }
  [ EXTERNAL_OAUTH_JWS_KEYS_URL = { '<string_literal>' | ('<string_literal>' [ , '<string_literal>' , ... ] ) } ]
  [ EXTERNAL_OAUTH_BLOCKED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ EXTERNAL_OAUTH_ALLOWED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ EXTERNAL_OAUTH_RSA_PUBLIC_KEY = <public_key1> ]
  [ EXTERNAL_OAUTH_RSA_PUBLIC_KEY_2 = <public_key2> ]
  [ EXTERNAL_OAUTH_AUDIENCE_LIST = { '<string_literal>' | ('<string_literal>' [ , '<string_literal>' , ... ] ) } ]
  [ EXTERNAL_OAUTH_ANY_ROLE_MODE = { DISABLE | ENABLE | ENABLE_FOR_PRIVILEGE } ]
  [ EXTERNAL_OAUTH_SCOPE_DELIMITER = '<string_literal>' ]
  [ EXTERNAL_OAUTH_SCOPE_MAPPING_ATTRIBUTE = '<string_literal>' ]
  [ NETWORK_POLICY = '<network_policy>' ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = EXTERNAL_OAUTH`
:   Distinguishes the [External OAuth](../../user-guide/oauth-ext-overview.md) integration from a
    [Snowflake OAuth](../../user-guide/oauth-snowflake-overview.md) integration.

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` allows the integration to run based on the parameters specified in the pipe definition.
    * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails
      to work.

    The value is case-insensitive.

    The default is `TRUE`.

`EXTERNAL_OAUTH_TYPE = { OKTA | AZURE | PING_FEDERATE | CUSTOM }`
:   Specifies the OAuth 2.0 authorization server to be Okta, Microsoft Entra ID, Ping Identity PingFederate, or a Custom OAuth 2.0 authorization
    server.

`EXTERNAL_OAUTH_ISSUER = 'string_literal'`
:   Specifies the URL to define the OAuth 2.0 authorization server.

`EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = { 'string_literal' | ('string_literal' [ , 'string_literal' , ... ] ) }`
:   Specifies the access token claim or claims to map the access token to a user record.

    The data type of the claim must be a string or a list of strings.

`EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = { 'LOGIN_NAME' | 'EMAIL_ADDRESS' }`
:   Indicates which Snowflake user record attribute should be used to map the access token to a user record.

## Optional parameters

`EXTERNAL_OAUTH_JWS_KEYS_URL = { 'string_literal' | ('string_literal' [ , 'string_literal' , ... ] ) }`
:   Specifies the HTTPS URL or a list of HTTPS URLs from where you can download public keys or certificates to validate an External OAuth access
    token.

    If you set the `EXTERNAL_OAUTH_TYPE` parameter to `AZURE`, then you can specify a maximum of three URLs. For example, to
    specify two URLs, use the following syntax:

    > ```sqlexample
    > EXTERNAL_OAUTH_JWS_KEYS_URL = ('https://example.ca', 'https://example.co.uk')
    > ```

    If you set the `EXTERNAL_OAUTH_TYPE` parameter to `OKTA`, `PING_FEDERATE`, or `CUSTOM`, then you can specify only
    one URL. For example:

    > ```sqlexample
    > EXTERNAL_OAUTH_JWS_KEYS_URL = 'https://example.ca'
    > ```

`EXTERNAL_OAUTH_RSA_PUBLIC_KEY = public_key1`
:   Specifies a Base64-encoded RSA public key, without the `-----BEGIN PUBLIC KEY-----` and `-----END PUBLIC KEY-----`
    headers.

    Snowflake supports cryptographic keys generated using the following algorithms:

    * RSA digital signature algorithms RS256, RS384, and RS512.
    * Elliptic Curve Digital Signature Algorithms (ECDSA) algorithms ES256(P-256), ES384 (P-384), and ES512 (P-512).

    These signatures use the SHA-256, SHA-384, and SHA-512 hash algorithms, respectively.

`EXTERNAL_OAUTH_RSA_PUBLIC_KEY_2 = public_key2`
:   Specifies a second RSA public key, without the `-----BEGIN PUBLIC KEY-----` and `-----END PUBLIC KEY-----` headers. Used
    for key rotation.

`EXTERNAL_OAUTH_BLOCKED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
:   Specifies the list of roles that a client cannot set as the [primary role](../../user-guide/security-access-control-overview.md).
    A role in this list cannot be used when creating a Snowflake session based on the access token from the External OAuth
    authorization server.

    By default, this list includes the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles. To remove these privileged roles from the list, use
    the [ALTER ACCOUNT](alter-account.md) command to set the [EXTERNAL_OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST](../parameters.md) account parameter to
    `FALSE`.

`EXTERNAL_OAUTH_ALLOWED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
:   Specifies the list of roles that the client can set as the primary role.

    A role in this list can be used when creating a Snowflake session based on the access token from the External OAuth authorization
    server.

    > **Caution:**
    >
    > This parameter supports the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN system roles.
    >
    > Exercise caution when creating a Snowflake session with these highly privileged roles set as the primary role.

`EXTERNAL_OAUTH_AUDIENCE_LIST = { 'string_literal' | ('string_literal' [ , 'string_literal' , ... ] ) }`
:   Specifies additional values for the access token’s audience validation on top of using the Customer’s Snowflake
    Account URL (i.e. `<account_identifier>.snowflakecomputing.com`). For more information, see
    [Account identifiers](../../user-guide/admin-account-identifier.md).

    For details on this parameter when using Power BI SSO, refer to
    [Power BI SSO security integrations](../../user-guide/oauth-powerbi.md).

    Currently, multiple audience URLs can be specified for [External OAuth Custom Clients](../../user-guide/oauth-ext-custom.md) only. Each URL
    must be enclosed in single quotes, with a comma separating each URL. For example:

    > ```sqlexample
    > EXTERNAL_OAUTH_AUDIENCE_LIST = ('https://example.com/api/v2/', 'https://example.com')
    > ```

`EXTERNAL_OAUTH_ANY_ROLE_MODE = { DISABLE | ENABLE | ENABLE_FOR_PRIVILEGE }`
:   Specifies whether the OAuth client or user can use a role that is not defined in the OAuth access token. Note that with a
    [Power BI to Snowflake integration](../../user-guide/oauth-powerbi.md), the PowerBI user cannot switch roles even when this parameter is
    enabled.

    * `DISABLE` does not allow the OAuth client or user to switch roles (i.e. `USE ROLE role;`). Default.
    * `ENABLE` allows the OAuth client or user to switch roles.
    * `ENABLE_FOR_PRIVILEGE` allows the OAuth client or user to switch roles only for a client or user with the `USE_ANY_ROLE`
      privilege. This privilege can be granted and revoked to one or more roles available to the user. For example:

      ```sqlexample
      GRANT USE_ANY_ROLE ON INTEGRATION external_oauth_1 TO role1;
      ```

      ```sqlexample
      REVOKE USE_ANY_ROLE ON INTEGRATION external_oauth_1 FROM role1;
      ```

    Note that the value can be optionally enclosed in single quotes (e.g. either `DISABLE` or `'DISABLE'`).

`EXTERNAL_OAUTH_SCOPE_DELIMITER = 'string_literal'`
:   Specifies the scope delimiter in the authorization token, overriding the default delimiter, `','`. The delimiter can be any single
    character, such as comma (`','`) or space (`' '`).

    You can only use this property if you set the `EXTERNAL_OAUTH_TYPE` parameter to `CUSTOM`.

`EXTERNAL_OAUTH_SCOPE_MAPPING_ATTRIBUTE = 'string_literal'`
:   Specifies the access token claim to map the access token to an account role.

    You can only set this parameter to `scp` or `scope`.

    You can only use this parameter if you set the `EXTERNAL_OAUTH_TYPE` parameter to `CUSTOM`.

`NETWORK_POLICY = 'network_policy'`
:   Specifies an existing [network policy](../../user-guide/network-policies.md). This network policy controls network traffic from the client
    to Snowflake.

    For more information, see [Restricting network traffic for External OAuth](../../user-guide/oauth-ext-overview.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

### Microsoft Entra ID example

The following example creates an External OAuth security integration for a Microsoft Entra ID OAuth 2.0 authorization server.

> ```sqlexample
> CREATE SECURITY INTEGRATION external_oauth_azure_1
>     TYPE = external_oauth
>     ENABLED = true
>     EXTERNAL_OAUTH_TYPE = azure
>     EXTERNAL_OAUTH_ISSUER = '<AZURE_AD_ISSUER>'
>     EXTERNAL_OAUTH_JWS_KEYS_URL = '<AZURE_AD_JWS_KEY_ENDPOINT>'
>     EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = 'upn'
>     EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = 'login_name';
> ```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

```sqlexample
DESC SECURITY INTEGRATION external_oauth_azure_1;
```

### Okta example

The following example creates an External OAuth security integration for an Okta OAuth 2.0 authorization server.

> ```sqlexample
> CREATE SECURITY INTEGRATION external_oauth_okta_1
>     TYPE = external_oauth
>     ENABLED = true
>     EXTERNAL_OAUTH_TYPE = okta
>     EXTERNAL_OAUTH_ISSUER = '<OKTA_ISSUER>'
>     EXTERNAL_OAUTH_JWS_KEYS_URL = '<OKTA_JWS_KEY_ENDPOINT>'
>     EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = 'sub'
>     EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = 'login_name';
> ```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

```sqlexample
DESC SECURITY INTEGRATION external_oauth_okta_1;
```

### Microsoft Power BI SSO examples

For examples, see:

* [Creating a Power BI security integration](../../user-guide/oauth-powerbi.md)
* [Using Power BI SSO with B2B guest users](../../user-guide/oauth-powerbi.md)

---
title: CREATE SECURITY INTEGRATION (SAML2)
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-saml2.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION (SAML2)

Creates a new SAML2 security integration in the account or replaces an existing integration. A SAML2 security integration provides
single sign-on (SSO) workflows by creating an interface between Snowflake and a third-party Identity Provider (IdP).

For information about creating other types of security integrations (e.g. SCIM), see [CREATE SECURITY INTEGRATION](create-security-integration.md).

See also:
:   [ALTER SECURITY INTEGRATION (SAML2)](alter-security-integration-saml2.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [ IF NOT EXISTS ]
    <name>
    TYPE = SAML2
    ENABLED = { TRUE | FALSE }
    { METADATA_URL = '<string_literal>' | <idp_parameters> }
    [ ALLOWED_USER_DOMAINS = ( '<string_literal>' [ , '<string_literal>' , ... ] ) ]
    [ ALLOWED_EMAIL_PATTERNS = ( '<string_literal>' [ , '<string_literal>' , ... ] ) ]
    [ SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = '<string_literal>' ]
    [ SAML2_ENABLE_SP_INITIATED = TRUE | FALSE ]
    [ SAML2_SNOWFLAKE_X509_CERT = '<string_literal>' ]
    [ SAML2_SIGN_REQUEST = TRUE | FALSE ]
    [ SAML2_REQUESTED_NAMEID_FORMAT = '<string_literal>' ]
    [ SAML2_POST_LOGOUT_REDIRECT_URL = '<string_literal>' ]
    [ SAML2_FORCE_AUTHN = TRUE | FALSE ]
    [ SAML2_SNOWFLAKE_ISSUER_URL = '<string_literal>' ]
    [ SAML2_SNOWFLAKE_ACS_URL = '<string_literal>' ]
    [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = SAML2`
:   Specify the type of integration:

    * `SAML2`: Creates a security interface between Snowflake and the identity provider (IdP).

`ENABLED = { TRUE | FALSE }`
:   The Boolean that specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` allows the integration to run based on the parameters specified in the pipe definition.
    * `FALSE` suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    The value is case-insensitive.

    The default is `TRUE`.

`{ METADATA_URL = 'string_literal' | idp_parameters }`
:   Specifies information about the IdP to establish the relationship between the IdP and Snowflake as the service provider.

    You must use the METADATA_URL parameter or the other required IdP parameters, but can’t specify both. Snowflake uses a metadata
    URL to obtain all of the information specified by the other parameters. The metadata URL is preferred because it’s less error prone and
    allows Snowflake to dynamically update IdP configuration settings.

    `METADATA_URL = 'string_literal'`
    :   Specifies the metadata URL of the IdP. A metadata URL is an endpoint that allows Snowflake to dynamically
        retrieve and synchronize IdP configuration settings, including certificate updates.

        This parameter is only supported for Okta and Microsoft Entra ID. For help obtaining the metadata URL, see the section in
        [Configuring an identity provider (IdP) for Snowflake](../../user-guide/admin-security-fed-auth-configure-idp.md) that corresponds to your IdP.

        After defining the metadata URL, you can keep Snowflake up-to-date with modified IdP configuration settings by running an ALTER
        SECURITY INTEGRATION REFRESH METADATA_URL command.

    `idp_parameters`
    :   Parameters that specify information about the IdP, including its issuer identifier and certificate. The parameters can’t be set if you
        specified a `METADATA_URL`.

        `SAML2_ISSUER = 'string_literal'`
        :   The string containing the EntityID / Issuer of the IdP.

        `SAML2_SSO_URL = 'string_literal'`
        :   The string containing the IdP SSO URL, where the user should be redirected by Snowflake (the Service Provider) with a SAML
            `AuthnRequest` message.

        `SAML2_PROVIDER = 'string_literal'`
        :   The string describing the IdP.

            One of the following: OKTA, ADFS, `Custom`.

        `SAML2_X509_CERT = 'string_literal'`
        :   The Base64 encoded IdP signing certificate on a single line without the leading `-----BEGIN CERTIFICATE-----` and ending
            `-----END CERTIFICATE-----` markers.

## Optional parameters

`ALLOWED_USER_DOMAINS = ( 'string_literal' [ , 'string_literal' , ... ] )`
:   A list of email domains that can authenticate with a SAML2 security integration. For example,
    `ALLOWED_USER_DOMAINS = ("example.com", "example2.com", ...)`.

    This parameter can be used to associate a user with an IdP for configurations that use multiple IdPs. For details, see [Using multiple identity providers for federated authentication](../../user-guide/admin-security-fed-auth-security-integration-multiple.md).

`ALLOWED_EMAIL_PATTERNS = ( 'string_literal' [ , 'string_literal' , ... ] )`
:   A list of regular expressions that email addresses are matched against to authenticate with a SAML2 security integration. For example,
    `ALLOWED_EMAIL_PATTERNS = ("^(.+dev)@example.com$", "^(.+dev)@example2.com$", ... )`.

    This parameter can be used to associate a user with an IdP for configurations that use multiple IdPs. For details, see [Using multiple identity providers for federated authentication](../../user-guide/admin-security-fed-auth-security-integration-multiple.md).

`SAML2_SP_INITIATED_LOGIN_PAGE_LABEL = 'string_literal'`
:   The string containing the label to display after the Log In With button on the login page.

`SAML2_ENABLE_SP_INITIATED = { TRUE | FALSE }`
:   The Boolean indicating if the Log In With button will be shown on the login page.

    * `TRUE` displays the Log in With button on the login page.
    * `FALSE` does not display the Log in With button on the login page.

`SAML2_SNOWFLAKE_X509_CERT = 'string_literal'`
:   The Base64 encoded self-signed certificate generated by Snowflake used for [encrypting SAML assertions](../../user-guide/admin-security-fed-auth-security-integration.md) and [sending signed SAML requests](../../user-guide/admin-security-fed-auth-security-integration.md).

    You must have at least one of these features (encrypted SAML assertions or signed SAML responses) enabled in your Snowflake account to
    access the certificate value.

`SAML2_SIGN_REQUEST = { TRUE | FALSE }`
:   The Boolean indicating whether SAML requests are signed.

    * `TRUE` allows SAML requests to be signed.
    * `FALSE` does not allow SAML requests to be signed.

`SAML2_REQUESTED_NAMEID_FORMAT = 'string_literal'`
:   The SAML NameID format allows Snowflake to set an expectation of the identifying attribute of the user (i.e. SAML Subject) in the SAML
    assertion from the IdP to ensure a valid authentication to Snowflake. If a value is not specified, Snowflake sends the
    `urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress` value in the authentication request to the IdP.

    Optional.

    If you choose to specify the SAML `NameID` format, use one of the following values:

    * `urn:oasis:names:tc:SAML:1.1:nameid-format:unspecified`
    * `urn:oasis:names:tc:SAML:1.1:nameid-format:emailAddress`
    * `urn:oasis:names:tc:SAML:1.1:nameid-format:X509SubjectName`
    * `urn:oasis:names:tc:SAML:1.1:nameid-format:WindowsDomainQualifiedName`
    * `urn:oasis:names:tc:SAML:2.0:nameid-format:kerberos`
    * `urn:oasis:names:tc:SAML:2.0:nameid-format:persistent`
    * `urn:oasis:names:tc:SAML:2.0:nameid-format:transient`

`SAML2_POST_LOGOUT_REDIRECT_URL = 'string_literal'`
:   The endpoint to which Snowflake redirects users after clicking the Log Out button in Snowsight.

    Snowflake terminates the Snowflake session upon redirecting to the specified endpoint.

`SAML2_FORCE_AUTHN = { TRUE | FALSE }`
:   The Boolean indicating whether users, during the initial authentication flow, are forced to authenticate again to access Snowflake. When
    set to `TRUE`, Snowflake sets the `ForceAuthn` SAML parameter to `TRUE` in the outgoing request from Snowflake to the
    identity provider.

    * `TRUE` forces users to authenticate again to access Snowflake, even if a valid session with the identity provider exists.
    * `FALSE` does not force users to authenticate again to access Snowflake.

    Default: `FALSE`.

`SAML2_SNOWFLAKE_ISSUER_URL = 'string_literal'`
:   The string containing the `EntityID` / `Issuer` for the Snowflake service provider.

    If an incorrect value is specified, Snowflake returns an error message indicating the acceptable values to use.

    The value of this property must match the Snowflake account URL specified in the IdP. It defaults to the
    [legacy URL](../../user-guide/admin-account-identifier.md), so if you define a different [URL format](../../user-guide/organizations-connect.md) in the IdP, make
    sure to set this property appropriately when creating the security integration. For details, see [Create a SAML2 security integration](../../user-guide/admin-security-fed-auth-security-integration.md).

`SAML2_SNOWFLAKE_ACS_URL = 'string_literal'`
:   The string containing the Snowflake Assertion Consumer Service URL to which the IdP will send its SAML authentication response back to
    Snowflake. This property will be set in the SAML authentication request generated by Snowflake when initiating a SAML SSO operation with
    the IdP.

    If an incorrect value is specified, Snowflake returns an error message indicating the acceptable values to use.

    The value of this property must match the Snowflake account URL specified in the IdP. It defaults to the
    [legacy URL](../../user-guide/admin-account-identifier.md), so if you define a different [URL format](../../user-guide/organizations-connect.md) in the IdP, make
    sure to set this property appropriately when creating the security integration. For details, see [Create a SAML2 security integration](../../user-guide/admin-security-fed-auth-security-integration.md).

    Default: `https://<account_locator>.<region>.snowflakecomputing.com/fed/login`

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Example

The following example creates a Microsoft Active Directory Federation Services (AD FS) security integration with the two optional settings:

```sqlexample
CREATE SECURITY INTEGRATION my_idp
  TYPE = saml2
  ENABLED = true
  METADATA_URL = 'https://integrator-26580.okta.com/app/ex2kbcS30N697/sso/saml/metadata'
  SAML2_SNOWFLAKE_ISSUER_URL = 'https://myorg-acct1.privatelink.snowflakecomputing.com'
  SAML2_SNOWFLAKE_ACS_URL = 'https://myorg-acct1.privatelink.snowflakecomputing.com/fed/login';
```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

> ```sqlexample
> DESC SECURITY INTEGRATION my_idp;
> ```

---
title: CREATE SECURITY INTEGRATION (SCIM)
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-scim.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION (SCIM)

> **Attention:**
>
> Mentions of Microsoft Azure Active Directory refer to Microsoft Entra ID.

Creates a new SCIM security integration in the account or replaces an existing integration. A SCIM security integration allows the
automated management of user identities and groups (i.e. roles) by creating an interface between Snowflake and a third-party Identity
Provider (IdP).

For information about creating other types of security integrations (e.g. SAML2), see [CREATE SECURITY INTEGRATION](create-security-integration.md).

See also:
:   [ALTER SECURITY INTEGRATION (SCIM)](alter-security-integration-scim.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [ IF NOT EXISTS ]
    <name>
    TYPE = SCIM
    ENABLED = { TRUE | FALSE }
    SCIM_CLIENT = { 'OKTA' | 'AZURE' | 'GENERIC' }
    RUN_AS_ROLE = { 'OKTA_PROVISIONER' | 'AAD_PROVISIONER' | 'GENERIC_SCIM_PROVISIONER' | '<custom_role>' }
    [ NETWORK_POLICY = '<network_policy>' ]
    [ SYNC_PASSWORD = { TRUE | FALSE } ]
    [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = SCIM`
:   Specify the type of integration:

    * `SCIM`: Creates a security interface between Snowflake and a client that supports SCIM.

`ENABLED = { TRUE | FALSE }`
:   Specify whether the security integration is enabled. To create a security integration that is disabled, set `ENABLED = FALSE`.

    The value is case-insensitive.

    Default: `TRUE`

`SCIM_CLIENT = { 'OKTA' | 'AZURE' | 'GENERIC' }`
:   Specify the SCIM client.

`RUN_AS_ROLE = { 'OKTA_PROVISIONER' | 'AAD_PROVISIONER' | 'GENERIC_SCIM_PROVISIONER' | 'custom_role' }`
:   Specify the SCIM role in Snowflake that owns any users and roles that are imported from the identity provider into Snowflake using SCIM.

    The values `OKTA_PROVISIONER`, `AAD_PROVISIONER`, and `GENERIC_SCIM_PROVISIONER` are case-sensitive and must
    always be capitalized. You can also specify a custom role.

## Optional parameters

`NETWORK_POLICY = 'network_policy'`
:   Specifies an existing [network policy](../../user-guide/network-policies.md) that controls SCIM network traffic.

    If there are also network policies set for the account or user, see [Network policy precedence](../../user-guide/network-policies.md).

`SYNC_PASSWORD = { TRUE | FALSE }`
:   Specifies whether to enable or disable the synchronization of a user password from an Okta SCIM client as part of the API request to
    Snowflake.

    * `TRUE` enables password synchronization.
    * `FALSE` disables password synchronization.

    Default `FALSE`. If a security integration is created without setting this parameter, Snowflake sets this parameter to `FALSE`.

    If user passwords should not be synchronized from the client to Snowflake, ensure this property value is set to `FALSE` and
    disable password synchronization in the client.

    Note that this property is supported for Okta and Custom SCIM integrations. Microsoft Entra ID SCIM integrations are not supported because
    Microsoft Entra ID does not support password synchronization. To request support, please contact Microsoft.

    For more information, see [Snowflake SCIM support](../../user-guide/scim-intro.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

### Microsoft Entra ID example

The following example creates a Microsoft Entra ID SCIM integration with the default settings:

```sqlexample
CREATE OR REPLACE SECURITY INTEGRATION aad_provisioning
    TYPE = scim
    SCIM_CLIENT = 'AZURE'
    RUN_AS_ROLE = 'AAD_PROVISIONER';
```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

```sqlexample
DESC SECURITY INTEGRATION aad_provisioning;
```

### Okta example

The following example creates an Okta SCIM integration with the default settings:

```sqlexample
CREATE OR REPLACE SECURITY INTEGRATION okta_provisioning
    TYPE = scim
    SCIM_CLIENT = 'OKTA'
    RUN_AS_ROLE = 'OKTA_PROVISIONER';
```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

> ```sqlexample
> DESC SECURITY INTEGRATION okta_provisioning;
> ```

---
title: CREATE SECURITY INTEGRATION (Snowflake OAuth)
source: https://docs.snowflake.com/en/sql-reference/sql/create-security-integration-oauth-snowflake.md
section: SQL Commands
---

# CREATE SECURITY INTEGRATION (Snowflake OAuth)

Creates a new Snowflake OAuth security integration in the account or replaces an existing integration. A Snowflake OAuth security
integration enables clients that support OAuth to redirect users to an authorization page and generate access tokens (and optionally,
refresh tokens) for access to Snowflake.

For information about creating other types of security integrations (e.g. External OAuth), see [CREATE SECURITY INTEGRATION](create-security-integration.md).

See also:
:   [ALTER SECURITY INTEGRATION (Snowflake OAuth)](alter-security-integration-oauth-snowflake.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

**Snowflake OAuth for partner applications**

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [IF NOT EXISTS]
  <name>
  TYPE = OAUTH
  OAUTH_CLIENT = <partner_application>
  OAUTH_REDIRECT_URI = '<uri>'  -- Required when OAUTH_CLIENT=LOOKER
  [ ENABLED = { TRUE | FALSE } ]
  [ OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE } ]
  [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
  [ OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED = { TRUE | FALSE } ]
  [ OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE } ]
  [ NETWORK_POLICY = '<network_policy>' ]
  [ BLOCKED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
```

**Snowflake OAuth for custom clients**

```sqlsyntax
CREATE [ OR REPLACE ] SECURITY INTEGRATION [IF NOT EXISTS]
  <name>
  TYPE = OAUTH
  OAUTH_CLIENT = CUSTOM
  OAUTH_CLIENT_TYPE = 'CONFIDENTIAL' | 'PUBLIC'
  OAUTH_REDIRECT_URI = '<uri>'
  [ ENABLED = { TRUE | FALSE } ]
  [ OAUTH_ALLOW_NON_TLS_REDIRECT_URI = { TRUE | FALSE } ]
  [ OAUTH_ENFORCE_PKCE = { TRUE | FALSE } ]
  [ OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED = { TRUE | FALSE } ]
  [ OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE } ]
  [ PRE_AUTHORIZED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ BLOCKED_ROLES_LIST = ( '<role_name>' [ , '<role_name>' , ... ] ) ]
  [ OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE } ]
  [ OAUTH_REFRESH_TOKEN_VALIDITY = <integer> ]
  [ NETWORK_POLICY = '<network_policy>' ]
  [ OAUTH_CLIENT_RSA_PUBLIC_KEY = <public_key1> ]
  [ OAUTH_CLIENT_RSA_PUBLIC_KEY_2 = <public_key2> ]
  [ USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters (all OAuth clients)

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes
    (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = OAUTH`
:   Specify the type of integration:

    * `OAUTH`: Creates a security interface between Snowflake and a client that supports OAuth.

`OAUTH_CLIENT = { CUSTOM | partner_application }`
:   Specify the client type:

    * `CUSTOM`: Creates an OAuth interface between Snowflake and a custom client.
    * `partner_application`: Creates an OAuth interface between Snowflake and a partner application. Supported values are:

      + `TABLEAU_DESKTOP`: Tableau Desktop version 2019.1 or higher.
      + `TABLEAU_SERVER`: Tableau Cloud. If Tableau Cloud is connecting to Snowflake using private connectivity to
        the Snowflake service, be sure to specify `OAUTH_CLIENT = CUSTOM` instead.
      + `LOOKER`: The Looker business intelligence tool.

`OAUTH_REDIRECT_URI = 'uri'`
:   Specifies the client URI. After a user is authenticated, the web browser is redirected to this URI.

    This parameter is required when `OAUTH_CLIENT = LOOKER`. For details, see the example in the
    [Looker documentation](https://docs.looker.com/setup-and-management/database-config/snowflake#oauth).

## Additional required parameters (custom clients)

Required only when OAUTH_CLIENT = CUSTOM (i.e. when creating an integration for a custom client)

`OAUTH_CLIENT_TYPE = { 'CONFIDENTIAL' | 'PUBLIC' }`
:   Specifies the type of client being registered. Snowflake supports both confidential and public clients. Confidential clients can store a
    secret. They run in a protected area where end users cannot access them. For example, a secured service deployed on the cloud could be a
    confidential client; whereas, a client running on a desktop or distributed through an app store could be a public client.

`OAUTH_REDIRECT_URI = 'uri'`
:   Specifies the client URI. After a user is authenticated, the web browser is redirected to this URI. The URI must be protected by TLS
    (Transport Layer Security) unless the optional `OAUTH_ALLOW_NON_TLS_REDIRECT_URI` parameter is set to `TRUE`.

    Do not include query parameters sent with the redirect URI in the request to the
    [authorization endpoint](../../user-guide/oauth-custom.md). For example, if the value of the `redirect_uri` query parameter
    in the request to the authorization endpoint is `https://www.example.com/connect?authType=snowflake`, make sure the OAUTH_REDIRECT_URI
    parameter is set to `https://www.example.com/connect`.

## Optional parameters (all OAuth clients)

`ENABLED = { TRUE | FALSE }`
:   Specifies whether to initiate operation of the integration or suspend it.

    * `TRUE` enables the integration.
    * `FALSE` disables the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

    The value is case-insensitive.

    The default is `TRUE`.

`OAUTH_SINGLE_USE_REFRESH_TOKENS_REQUIRED =  { TRUE | FALSE }`
:   Specifies whether [single-use refresh tokens](../../user-guide/single-use-refresh-tokens.md) should be used.

    Default: `FALSE`

`USE_PRIVATELINK_FOR_AUTHORIZATION_ENDPOINT = { TRUE | FALSE }`
:   When TRUE, the interaction between Snowflake as the authorization server and the user who is authenticating uses
    [private connectivity](../../user-guide/private-connectivity-inbound.md). Interactions between Snowflake and the client, including the
    initial request to the authorization endpoint, still happens over the public internet.

    Default: FALSE

`NETWORK_POLICY = 'network_policy'`
:   Specifies an existing [network policy](../../user-guide/network-policies.md). This network policy controls network traffic that is
    attempting to exchange an authorization code for an access or refresh token, use a refresh token to obtain a new
    access token, or obtain Snowflake resources with an access token.

    For more information, see [Restricting network traffic for Snowflake OAuth](../../user-guide/oauth-snowflake-overview.md).

## Additional optional parameters (partner applications)

Valid when OAUTH_CLIENT = <partner_application> (i.e. when creating an integration for a partner application)

`OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE }`
:   Boolean that specifies whether to allow the client to exchange a refresh token for an access token when the current access token has
    expired. If set to `FALSE`, a refresh token is not issued regardless of the integer value set in
    `OAUTH_REFRESH_TOKEN_VALIDITY`. User consent is revoked, and the user must confirm authorization again.

    Default: `TRUE`

    > **Note:**
    >
    > If this parameter is set to `FALSE` and the security integration also has `ENABLED = TRUE`, the Snowflake OAuth flow
    > repeats, a non-configurable access token is issued, and the access token is valid for 600 seconds (10 minutes). After this access token
    > expires, the user must authenticate again.
    >
    > Setting this parameter to `FALSE` and `ENABLED = FALSE` results in no tokens being issued and the integration is disabled.

`OAUTH_REFRESH_TOKEN_VALIDITY = integer`
:   Integer that specifies how long refresh tokens should be valid (in seconds). This can be used to expire the refresh token periodically.
    Note that OAUTH_ISSUE_REFRESH_TOKENS must be set to `TRUE`.

    When a refresh token expires, the application will need to direct the user through the authorization flow again to obtain a new refresh
    token.

    The supported minimum, maximum, and default values are as follows:

    * Minimum: `3600` (1 hour)
    * Maximum: `7776000` (90 days)
    * Default: `7776000` (90 days)

    If you have a business need to lower the minimum value or raise the maximum value, ask your account administrator to send a request to
    [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

`OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE }`
:   * `IMPLICIT`: Default secondary roles set in the user properties are activated by default in the session being opened.
    * `NONE`: Default secondary roles are not supported in the session being opened.

    Default: `NONE`

`BLOCKED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
:   Comma-separated list of Snowflake roles that a user cannot explicitly consent to using after authenticating
    (e.g. `'BLOCKED_ROLES_LIST = ('custom_role1', 'custom_role2')`).

    By default, Snowflake prevents the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles from authenticating. To allow these
    privileged roles to authenticate, use the [ALTER ACCOUNT](alter-account.md) command to set the [OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST](../parameters.md) account parameter to `FALSE`.

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Additional optional parameters (custom clients)

Valid when OAUTH_CLIENT = CUSTOM (i.e. when creating an integration for a custom client)

`OAUTH_ALLOW_NON_TLS_REDIRECT_URI = { TRUE | FALSE }`
:   If `TRUE`, allows setting `OAUTH_REDIRECT_URI` to a URI not protected by TLS. We highly recommend use of TLS to
    prevent man-in-the-middle OAuth redirects for use in phishing attacks.

    Default: `FALSE`

`OAUTH_ENFORCE_PKCE = { TRUE | FALSE }`
:   Boolean that specifies whether Proof Key for Code Exchange (PKCE) should be required for the integration.

    By default, PKCE is optional and is enforced only if the `code_challenge` and `code_challenge_method` parameters are both
    included in the authorization endpoint URL. However, we highly recommend that your client require PKCE for all authorizations
    to make the OAuth flow more secure. For more information, see [Configure Snowflake OAuth for custom clients](../../user-guide/oauth-custom.md).

    Default: `FALSE`

`OAUTH_USE_SECONDARY_ROLES = { IMPLICIT | NONE }`
:   * `IMPLICIT`: Default secondary roles set in the user properties are activated by default in the session being opened.
    * `NONE`: Default secondary roles are not supported in the session being opened.

    Default: `NONE`

`PRE_AUTHORIZED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
:   Comma-separated list of Snowflake roles that a user does not need to explicitly consent to using after authenticating (e.g.
    `PRE_AUTHORIZED_ROLES_LIST = ('custom_role1', 'custom_role2')`). The ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and
    SECURITYADMIN roles cannot be included in this list.

    > **Note:**
    >
    > This parameter is supported for confidential clients only.

`BLOCKED_ROLES_LIST = ( 'role_name' [ , 'role_name' , ... ] )`
:   Comma-separated list of Snowflake roles that a user cannot explicitly consent to using after authenticating. For example,
    `BLOCKED_ROLES_LIST = ('custom_role1', 'custom_role2')`.

    The ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles are included in this list by default; however, if these roles should
    be removed for your account, ask your account administrator to send a request to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

`OAUTH_ISSUE_REFRESH_TOKENS = { TRUE | FALSE }`
:   Boolean that specifies whether to allow the client to exchange a refresh token for an access token when the current access token has
    expired. If set to `FALSE`, a refresh token is not issued. User consent is revoked, and the user must confirm authorization again.

    Default: `TRUE`

`OAUTH_REFRESH_TOKEN_VALIDITY = integer`
:   Integer that specifies how long refresh tokens should be valid (in seconds). This can be used to expire the refresh token periodically.
    Note that OAUTH_ISSUE_REFRESH_TOKENS must be set to `TRUE`.

    Note that if your organization would like the minimum or maximum values lowered or raised, respectively, ask your account administrator
    to send a request to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    Values:
    :   `86400` (1 day) to `7776000` (90 days)

    Default:
    :   `7776000`

`NETWORK_POLICY = 'network_policy'`
:   Specifies an existing [network policy](../../user-guide/network-policies.md). This network policy controls network traffic that is
    attempting to exchange an authorization code for an access or refresh token, use a refresh token to obtain a new
    access token, or obtain Snowflake resources with an access token.

    For more information, see [Restricting network traffic for Snowflake OAuth](../../user-guide/oauth-snowflake-overview.md).

    `network_policy` is a string literal that you must enclose in single quotes. If the network policy name is
    [case-sensitive or includes any special characters or spaces](../identifiers-syntax.md), then you must enclose the
    name in double quotes, and then enclose the double-quoted name in single quotes. For example,
    `NETWORK_POLICY = '"Case-Sensitive Name"'`.

`OAUTH_CLIENT_RSA_PUBLIC_KEY = public_key1`
:   Specifies an RSA public key.

`OAUTH_CLIENT_RSA_PUBLIC_KEY_2 = public_key2`
:   Specifies a second RSA public key. Used for key rotation.

`COMMENT = 'string_literal'`
:   Specifies a comment for the integration.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

### Tableau Desktop example

The following example creates an OAuth integration with the default settings:

> ```sqlexample
> CREATE SECURITY INTEGRATION td_oauth_int1
>   TYPE = oauth
>   ENABLED = true
>   OAUTH_CLIENT = tableau_desktop;
> ```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

```sqlexample
DESC SECURITY INTEGRATION td_oauth_int1;
```

The following example creates an OAuth integration with refresh tokens that expire after 10 hours (36000 seconds). The integration blocks
users from starting a session with SYSADMIN as the active role:

> ```sqlexample
> CREATE SECURITY INTEGRATION td_oauth_int2
>   TYPE = oauth
>   ENABLED = true
>   OAUTH_CLIENT = tableau_desktop
>   OAUTH_REFRESH_TOKEN_VALIDITY = 36000
>   BLOCKED_ROLES_LIST = ('SYSADMIN');
> ```

### Tableau Cloud example

The following example creates an OAuth integration with the default settings:

> ```sqlexample
> CREATE SECURITY INTEGRATION ts_oauth_int1
>   TYPE = oauth
>   ENABLED = true
>   OAUTH_CLIENT = tableau_server;
> ```

View the integration settings using [DESCRIBE INTEGRATION](desc-integration.md):

```sqlexample
DESC SECURITY INTEGRATION ts_oauth_int1;
```

The following example creates an OAuth integration with refresh tokens that expire after 1 day (86400 seconds). The integration blocks
users from starting a session with SYSADMIN as the active role:

> ```sqlexample
> CREATE SECURITY INTEGRATION ts_oauth_int2
>   TYPE = oauth
>   ENABLED = true
>   OAUTH_CLIENT = tableau_server
>   OAUTH_REFRESH_TOKEN_VALIDITY = 86400
>   BLOCKED_ROLES_LIST = ('SYSADMIN');
> ```

### Custom client example

The following example creates an OAuth integration that uses key pair authentication. The integration allows refresh tokens, which expire
after 1 day (86400 seconds). The integration blocks users from starting a session with SYSADMIN as the active role:

> ```sqlexample
> CREATE SECURITY INTEGRATION oauth_kp_int
>   TYPE = oauth
>   ENABLED = true
>   OAUTH_CLIENT = custom
>   OAUTH_CLIENT_TYPE = 'CONFIDENTIAL'
>   OAUTH_REDIRECT_URI = 'https://localhost.com'
>   OAUTH_ISSUE_REFRESH_TOKENS = TRUE
>   OAUTH_REFRESH_TOKEN_VALIDITY = 86400
>   PRE_AUTHORIZED_ROLES_LIST = ('MYROLE')
>   BLOCKED_ROLES_LIST = ('SYSADMIN');
> ```

---
title: CREATE SEMANTIC VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/create-semantic-view.md
section: SQL Commands
---

# CREATE SEMANTIC VIEW

Creates a new [semantic view](../../user-guide/views-semantic/overview.md) in the current/specified schema.

The semantic view must comply with [these validation rules](../../user-guide/views-semantic/validation-rules.md).

See also:
:   [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md) , [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../stored-procedures/system_create_semantic_view_from_yaml.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SEMANTIC VIEW [ IF NOT EXISTS ] <name>
  TABLES ( logicalTable [ , ... ] )
  [ RELATIONSHIPS ( relationshipDef [ , ... ] ) ]
  [ FACTS ( factExpression [ , ... ] ) ]
  [ DIMENSIONS ( dimensionExpression [ , ... ] ) ]
  [ METRICS ( { metricExpression | windowFunctionMetricExpression } [ , ... ] ) ]
  [ COMMENT = '<comment_about_semantic_view>' ]
  [ AI_SQL_GENERATION '<instructions_for_sql_generation>' ]
  [ AI_QUESTION_CATEGORIZATION '<instructions_for_question_categorization>' ]
  [ COPY GRANTS ]
```

where:

* The parameters for logical tables are:

  ```sqlsyntax
  logicalTable ::=
    [ <table_alias> AS ] <table_name>
    [ PRIMARY KEY ( <primary_key_column_name> [ , ... ] ) ]
    [
      UNIQUE ( <unique_column_name> [ , ... ] )
      [ ... ]
    ]
    [
      CONSTRAINT [ <constraint_name> ]
        DISTINCT RANGE BETWEEN <start_column> AND <end_column> EXCLUSIVE
    ]
    [ WITH SYNONYMS [ = ] ( '<synonym>' [ , ... ] ) ]
    [ COMMENT = '<comment_about_table>' ]
  ```
* The parameters for relationships are:

  ```sqlsyntax
  relationshipDef ::=
    [ <relationship_identifier> AS ]
    <table_alias> ( <column_name> [ , ... ] )
    REFERENCES
    <ref_table_alias> [ (
      [ ASOF ] <ref_column_name> [ , ... ] |
      BETWEEN <start_column> AND <end_column> EXCLUSIVE
    ) ]
  ```
* The parameters for expressions in the definitions of facts are:

  ```sqlsyntax
  factExpression ::=
    [ { PRIVATE | PUBLIC } ] <table_alias>.<fact> AS <sql_expr>
    [ WITH SYNONYMS [ = ] ( '<synonym>' [ , ... ] ) ]
    [ COMMENT = '<comment_about_the_fact>' ]
  ```
* The parameters for expressions in the definitions of dimensions are:

  ```sqlsyntax
  dimensionExpression ::=
    [ PUBLIC ] <table_alias>.<dimension> AS <sql_expr>
    [ WITH SYNONYMS [ = ] ( '<synonym>' [ , ... ] ) ]
    [ COMMENT = '<comment_about_the_dimension>' ]
    [ WITH CORTEX SEARCH SERVICE <search_service_name> [ USING <search_service_column_name> ] ]
  ```
* The parameters for expressions in the definitions of metrics are:

  ```sqlsyntax
  metricExpression ::=
    [ { PRIVATE | PUBLIC } ] <table_alias>.<metric>
      [ USING ( <relationship_name> [ , ... ] ) ]
      [
        NON ADDITIVE BY (
          <dimension> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ]
          [ , ... ]
        )
      ]
      AS <sql_expr>
    [ WITH SYNONYMS [ = ] ( '<synonym>' [ , ... ] ) ]
    [ COMMENT = '<comment_about_the_metric>' ]
  ```

* You can define a metric that uses a window function (a *window function metric*) by using the following syntax:

  ```sqlsyntax
  windowFunctionMetricExpression ::=
    [ { PRIVATE | PUBLIC } ] <table_alias>.<metric> AS
      <window_function>( <metric> ) OVER (
        [ PARTITION BY { <exprs_using_dimensions_or_metrics> | EXCLUDING <dimensions> } ]
        [ ORDER BY <exprs_using_dimensions_or_metrics> [ ASC | DESC ] [ NULLS { FIRST | LAST } ] [, ...] ]
        [ <windowFrameClause> ]
      )
  ```

  For information about this syntax, see Parameters for window function metrics.

> **Note:**
>
> The order of the clauses is important. For example, you must specify the FACTS clause before the DIMENSIONS clause.
>
> You can refer to semantic expressions that are defined in later clauses. For example, even if `fact_2` is defined after
> `fact_1`, you can still use `fact_2` in the definition of `fact_1`.

## Required parameters

`name`
:   Specifies the name of the semantic view; the name must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`COMMENT = 'comment_about_semantic_view'`
:   Specifies a comment about the semantic view.

`AI_SQL_GENERATION 'instructions_for_sql_generation'`
:   Specifies [instructions for Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst/custom-instructions.md) that explain
    how to generate the SQL statement.

    For more information, see [Providing custom instructions for Cortex Analyst](../../user-guide/views-semantic/sql.md).

`AI_QUESTION_CATEGORIZATION 'instructions_for_question_categorization'`
:   Specifies [instructions for Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst/custom-instructions.md) that explain
    how to classify questions.

    For more information, see [Providing custom instructions for Cortex Analyst](../../user-guide/views-semantic/sql.md).

`COPY GRANTS`
:   When you specify OR REPLACE to replace an existing semantic view with a new semantic view, you can set this parameter to copy
    any privileges granted on the existing semantic view to the new semantic view.

    The command copies all privilege grants except OWNERSHIP from the existing semantic view to the new semantic view. The
    role that executes the CREATE SEMANTIC VIEW statement owns the new view.

    The new semantic view does not inherit any future grants defined for the object type in the schema.

    The operation to copy grants occurs atomically with the CREATE SEMANTIC VIEW statement (in other words, within the same
    transaction).

    If you omit COPY GRANTS, the new semantic view does not inherit any explicit access privileges granted on the existing
    semantic view but does inherit any future grants defined for the object type in the schema.

## Parameters for logical tables

These parameters are part of the syntax for logical tables:

`table_alias AS`
:   Specifies an optional alias for the logical table.

    * If you specify an alias, you must use this alias when referring to the logical table in relationships, facts, dimensions,
      and metrics.
    * If you do not specify an alias, you use the unqualified logical table name to refer to the table.

`table_name`
:   Specifies the name of the logical table.

`PRIMARY KEY ( primary_key_column_name [ , ... ] )`
:   Specifies the names of one or more columns in the logical table that serve as the primary key of the table.

`UNIQUE ( unique_column_name [ , ... ] )`
:   Specifies the name of a column containing a unique value or the names of columns that contain unique combinations of values.

    For example, if the column `service_id` contains unique values, specify:

    ```sqlexample
    TABLES(
      ...
      product_table UNIQUE (service_id)
    ```

    If the combination of values in the `product_area_id` and `product_id` columns is unique, specify:

    ```sqlexample
    TABLES(
      ...
      product_table UNIQUE (product_area_id, product_id)
      ...
    ```

    You can identify multiple columns and multiple combinations of columns as unique in a given logical table:

    ```sqlexample
    TABLES(
      ...
      product_table UNIQUE (product_area_id, product_id) UNIQUE (service_id)
      ...
    ```

    > **Note:**
    >
    > If you already identified a column as a primary key column (by using PRIMARY KEY), do not add the UNIQUE clause for that
    > column.

`CONSTRAINT [ constraint_name ]` . `DISTINCT RANGE BETWEEN start_column AND end_column EXCLUSIVE`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies a constraint for a [range join](../../user-guide/views-semantic/sql.md).

    `constraint_name`
    :   Specifies an optional name for the constraint.

        If you omit this name, the command uses a system-generated name for the constraint.

    `DISTINCT RANGE BETWEEN start_column AND end_column EXCLUSIVE` specifies that in each row, the range
    between `start_column` and `end_column` is a distinct range:

    * The range is a [half-open interval](https://en.wikipedia.org/wiki/Interval_(mathematics)#Definitions_and_terminology),
      where the range is closed on the left side (`start_column`) and open on the right
      (`end_column`).

      In other words, the time on the left is included in the range, but the time on the right is excluded from the range.

      For example, for a row in this table, if the value in `start_column` is `2024-01-15 00:00:00.000` and the value
      in `end_column` is `2024-02-01 00:00:00.000`, the range is:

      `2024-01-15 00:00:00.000 <= timestamp_from_other_table < 2024-02-01 00:00:00.000`

      The timestamp `2024-01-15 00:00:00.000` is included in this range, but the timestamp `2024-02-01 00:00:00.000` is not.
    * `start_column` and `end_column` must be physical columns from the same table or facts or dimensions from
      the same table.

`WITH SYNONYMS [ = ] ( 'synonym' [ , ... ] )`
:   Specifies one or more synonyms for the logical table. Unlike aliases, synonyms are used for informational purposes only. You do
    not use synonyms to refer to the logical table in relationships, dimensions, metrics, and facts.

`COMMENT = 'comment_about_table'`
:   Specifies a comment about the logical table.

## Parameters for relationships

These parameters are part of the syntax for relationships:

`relationship_identifier AS`
:   Specifies an optional identifier for the relationship.

`table_alias ( column_name [ , ... ] )`
:   Specifies one of the logical tables and one or more of its columns that refers to columns in another logical table.

`ref_table_alias [ ( ... ) ]`
:   Specifies the other logical table referred to by the first logical table.

    You can specify one of the following in parentheses, depending on how you want to join the tables:

    `ref_column_name [ , ... ]`
    :   Specifies a column identified with the PRIMARY KEY or UNIQUE constraint in the
        logical table definition.

    `ASOF ref_column_name [ , ... ] )`
    :   For an [ASOF join](../../user-guide/views-semantic/sql.md), specifies a column of one of
        [the supported types](../constructs/asof-join.md).

        > **Note:**
        >
        > You can specify at most one ASOF keyword in the definition of a given relationship. You can specify this keyword before any
        > column in the list.

    `BETWEEN start_column AND end_column EXCLUSIVE`
    :   [Preview Feature](../../release-notes/preview-features.md) — Open

        Available to all accounts.

        For a [range join](../../user-guide/views-semantic/sql.md), specifies the range of possible values in the first table.

        `start_column` . `end_column`
        :   Specifies the columns that define the start and end of the range.

            * You must define a constraint for these columns.
            * You cannot use the same column for both `start_column` and `end_column`.

              If you want to use the same column, use an [ASOF relationship](../../user-guide/views-semantic/sql.md).

            > **Note:**
            >
            > `column_name` must have a data type that can be [coerced](../data-type-conversion.md) to the
            > data types for `start_column` and `end_column`.

## Parameters for facts, dimensions, and metrics

In a semantic view, you must define at least one dimension or metric, which means that you must specify at least the DIMENSIONS
clause or the METRICS clause.

These parameters are part of the syntax for defining a fact,
dimension, or
metric:

`{ PRIVATE | PUBLIC }`
:   Specifies whether a fact or metric is [private](../../user-guide/views-semantic/sql.md) or public. Facts and metrics that are
    marked as private cannot be queried or used in a query condition.

    > **Note:**
    >
    > You cannot mark a dimension as private. Dimensions are always public. For a dimension, the effect is the same whether you
    > specify or omit PUBLIC.

    If you omit PRIVATE and PUBLIC, the dimension, fact, or metric is public by default.

`table_alias.semantic_expression_name`
:   Specifies a name for a dimension, fact, or metric.

`USING relationship_name [ , ... ]`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    For metric definitions, specifies the relationship that should be used to join the tables and calculate the metric, when
    [multiple relationship paths exist between two logical tables](../../user-guide/views-semantic/sql.md).

    To define a [derived metric](../../user-guide/views-semantic/sql.md) (a metric that combines multiple metrics from
    different logical tables), omit `table_alias.` from the name.

    See [How Snowflake validates semantic views](../../user-guide/views-semantic/validation-rules.md) for the rules for defining a valid semantic view.

`NON ADDITIVE BY ( dimension [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [ , ... ] )`
:   Specifies a list of dimensions that should not be used when summing the metric.

    Instead, during query processing, the rows are sorted by the non-additive dimensions, and the values from the last rows (the
    *latest snapshots of values*) are aggregated to compute the metric.

    `{ ASC | DESC }`
    :   Optionally sorts the values of the non-additive dimensions in ascending (lowest to highest) or descending (highest to lowest)
        order, which determines what the last snapshot is.

        Default: ASC

    `NULLS { FIRST | LAST }`
    :   Optionally specifies whether NULL values are sorted before/after non-NULL values, based on the sort order (ASC or DESC). The
        sort order determines what the last snapshot is.

        Default: Depends on the sort order (ASC or DESC); see
        [the usage notes in the ORDER BY documentation](../constructs/order-by.md).

    Specifying the NON ADDITIVE BY clause makes the metric a *semi-additive* metric.

    For information, see [Identifying the dimensions that should be non-additive for a metric](../../user-guide/views-semantic/sql.md).

`AS sql_expr`
:   Specifies the SQL expression for computing the dimension, fact, or metric.

    See [Defining facts, dimensions, and metrics](../../user-guide/views-semantic/sql.md). For the validation rules for these expressions, see
    [How Snowflake validates semantic views](../../user-guide/views-semantic/validation-rules.md).

`WITH SYNONYMS [ = ] ( 'synonym' [ , ... ] )`
:   Specifies one or more optional synonyms for the dimension, fact, or metric. Note that synonyms are used for informational
    purposes only. You cannot use a synonym to refer to a dimension, fact, or metric in another dimension, fact, or metric.

`COMMENT = 'comment_about_dim_fact_or_metric'`
:   Specifies an optional comment about the dimension, fact, or metric.

`WITH CORTEX SEARCH SERVICE search_service_name [ USING search_service_column_name ]`
:   Specifies the
    [Cortex Search Service to use for this dimension](../../user-guide/views-semantic/sql.md).

    You can only specify this parameter for dimensions (and not for facts or metrics).

    If the Cortex Search Service is in a different database or schema,
    [qualify the name of the service](../name-resolution.md) (for example, `my_db.my_schema.my_service`).

    You can set the optional USING clause to the name of the column in the Cortex Search Service.

## Parameters for window function metrics

These parameters are part of the
syntax for defining window function metrics:

`metric`
:   Specifies a metric expression for this window function. You can specify a metric or any valid metric expression that you can use
    to define a metric in this entity.

`PARTITION BY ...`
:   Groups rows into partitions. You can either partition by a specified set of expressions or by all dimensions (except selected
    dimensions) specified in the query:

    `PARTITION BY exprs_using_dimensions_or_metrics`
    :   Groups rows into partitions by SQL expressions. In the SQL expression:

        * Any dimensions in the expression must be accessible from the same entity that defines the window function metric.
        * Any metrics must belong to the same table where this metric is being defined.
        * You cannot specify aggregates, window functions, or subqueries.

    `PARTITION BY EXCLUDING dimensions`
    :   Groups rows into partitions by all of the dimensions specified in the [SEMANTIC_VIEW](../constructs/semantic_view.md) clause of
        the query, except for the dimensions specified by `dimensions`.

        `dimensions` must only refer to dimensions that are accessible from the entity that defines the window function
        metric.

        For example, suppose that you exclude the dimension `table_1.dimension_1` from partitioning:

        ```sqlexample
        CREATE SEMANTIC VIEW sv
          ...
          METRICS (
            table_1.metric_2 AS SUM(table_1.metric_1) OVER
              (PARTITION BY EXCLUDING table_l.dimension_1 ORDER BY table_1.dimension_2)
          )
          ...
        ```

        Suppose that you run a query that specifies the dimension `table_1.dimension_1`:

        ```sqlexample
        SELECT * FROM SEMANTIC VIEW(
          sv
          METRICS (
            table_1.metric_2
          )
          DIMENSIONS (
            table_1.dimension_1,
            table_1.dimension_2,
            table_1.dimension_3
          );
        ```

        In the query, the metric `table_1.metric_2` is evaluated as:

        ```sqlexample
        SUM(table_1.metric_1) OVER (
          PARTITION BY table_1.dimension_2, table_1.dimension_3
          ORDER BY table_1.dimension_2
        )
        ```

        Note how `table_1.dimension_1` is excluded from the PARTITION BY clause.

        > **Note:**
        >
        > You cannot use EXCLUDING outside of metric definitions in semantic views. EXCLUDING is not supported in window function
        > calls in any other context.

`ORDER BY exprs_using_dimensions_or_metrics  [ ASC | DESC ] [ NULLS FIRST | LAST ] [, ... ]`
:   Orders rows within each partition. In the SQL expression:

    * Any dimensions in the expression must be accessible from the same entity that defines the window function metric.
    * Any metrics must belong to the same table where this metric is being defined.
    * You cannot specify aggregates, window functions, or subqueries.

`windowFrameClause`
:   See [Window function syntax and usage](../functions-window-syntax.md).

For additional information about the parameters for window functions and examples, see
[Defining and querying window function metrics](../../user-guide/views-semantic/querying.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SEMANTIC VIEW | Schema | Required to create a new semantic view. |
| SELECT | Table, view | Required on any tables and/or views used in the semantic view definition. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The semantic view must be valid and must follow the rules described in
  [How Snowflake validates semantic views](../../user-guide/views-semantic/validation-rules.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

See [Creating a semantic view by using the CREATE SEMANTIC VIEW command](../../user-guide/views-semantic/sql.md).

---
title: CREATE SEQUENCE
source: https://docs.snowflake.com/en/sql-reference/sql/create-sequence.md
section: SQL Commands
---

# CREATE SEQUENCE

Creates a new sequence, which can be used for generating sequential, unique numbers.

> **Important:**
>
> Snowflake does not guarantee generating sequence numbers without gaps. The generated numbers are not necessarily contiguous.

For more details, see [Using Sequences](../../user-guide/querying-sequences.md).

See also:
:   [DROP SEQUENCE](drop-sequence.md) , [ALTER SEQUENCE](alter-sequence.md) , [SHOW SEQUENCES](show-sequences.md) , [DESCRIBE SEQUENCE](desc-sequence.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SEQUENCE [ IF NOT EXISTS ] <name>
  [ WITH ]
  [ START [ WITH ] [ = ] <initial_value> ]
  [ INCREMENT [ BY ] [ = ] <sequence_interval> ]
  [ { ORDER | NOORDER } ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier for the sequence; must be unique for the schema in which the sequence is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details about identifiers, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`START [ WITH ] [ = ] initial_value`
:   Specifies the first value returned by the sequence. Supported values are any value that can be represented by a 64-bit two’s
    complement integer (from `-2^63` to `2^63 - 1`).

    Default: `1`

`INCREMENT [ BY ] [ = ] sequence_interval`
:   Specifies the step interval of the sequence:

    > * For positive sequence interval `n`, the next `n-1` values are reserved by each sequence call.
    > * For negative sequence interval `-n`, the next `n-1` lower values are reserved by each sequence call.

    Supported values are any non-zero value that can be represented by a 64-bit two’s complement integer.

    Default: `1`

`{ ORDER | NOORDER }`
:   Specifies whether or not the values are generated for the sequence in
    [increasing or decreasing order](../../user-guide/querying-sequences.md).

    * ORDER specifies that the values generated for a sequence or auto-incremented column are in increasing order (or, if the interval
      is a negative value, in decreasing order).

      For example, if a sequence or auto-incremented column has `START 1 INCREMENT 2`, the generated values might be
      `1`, `3`, `5`, `7`, `9`, etc.
    * NOORDER specifies that the values are not guaranteed to be in increasing order.

      For example, if a sequence has `START 1 INCREMENT 2`, the generated values might be `1`, `3`, `101`, `5`, `103`, etc.

      NOORDER can improve performance when multiple INSERT operations are performed concurrently (for example, when multiple
      clients are executing multiple INSERT statements).

    Default: The [NOORDER_SEQUENCE_AS_DEFAULT](../parameters.md) parameter determines which property is set by default.

`COMMENT = 'string_literal'`
:   Specifies a comment for the sequence.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SEQUENCE | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The first/initial value for a sequence cannot be changed after the sequence is created.
* A sequence does not necessarily produce a gap-free sequence. Values increase (until the limit is reached) and are unique,
  but are not necessarily contiguous. For more information, including the upper and lower limits, see [Sequence Semantics](../../user-guide/querying-sequences.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Here is a simple example of using sequences:

> > ```sqlexample
> > CREATE OR REPLACE SEQUENCE seq_01 START = 1 INCREMENT = 1;
> > CREATE OR REPLACE TABLE sequence_test_table (i INTEGER);
> > ```
> >
> > ```sqlexample
> > SELECT seq_01.nextval;
> > +---------+
> > | NEXTVAL |
> > |---------|
> > |       1 |
> > +---------+
> > ```
>
> Run the same query again; note how the sequence numbers change:
>
> > ```sqlexample
> > SELECT seq_01.nextval;
> > +---------+
> > | NEXTVAL |
> > |---------|
> > |       2 |
> > +---------+
> > ```
>
> Now use the sequence while inserting into a table:
>
> > ```sqlexample
> > INSERT INTO sequence_test_table (i) VALUES (seq_01.nextval);
> > ```
> >
> > ```sqlexample
> > SELECT i FROM sequence_test_table;
> > +---+
> > | I |
> > |---|
> > | 3 |
> > +---+
> > ```

Create a sequence that increments by 5 rather than by 1:

> > ```sqlexample
> > CREATE OR REPLACE SEQUENCE seq_5 START = 1 INCREMENT = 5;
> > ```
> >
> > ```sqlexample
> > SELECT seq_5.nextval a, seq_5.nextval b, seq_5.nextval c, seq_5.nextval d;
> > +---+---+----+----+
> > | A | B |  C |  D |
> > |---+---+----+----|
> > | 1 | 6 | 11 | 16 |
> > +---+---+----+----+
> > ```
>
> Run the same query again; note how the sequence numbers change. You might expect that the next set of sequence numbers would start 5
> higher than the previous statement left off. However, the next sequence number starts 20 higher (5 \* 4, where 5 is the size of the
> increment and 4 is the number of `NEXTVAL` operations in the statement):
>
> > ```sqlexample
> > SELECT seq_5.nextval a, seq_5.nextval b, seq_5.nextval c, seq_5.nextval d;
> > +----+----+----+----+
> > |  A |  B |  C |  D |
> > |----+----+----+----|
> > | 36 | 41 | 46 | 51 |
> > +----+----+----+----+
> > ```

This example demonstrates that you can use a sequence as a default value for a column to provide unique identifiers for each row in
a table:

> ```sqlexample
> CREATE OR REPLACE SEQUENCE seq90;
> CREATE OR REPLACE TABLE sequence_demo (i INTEGER DEFAULT seq90.nextval, dummy SMALLINT);
> INSERT INTO sequence_demo (dummy) VALUES (0);
>
> -- Keep doubling the number of rows:
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> INSERT INTO sequence_demo (dummy) SELECT dummy FROM sequence_demo;
> ```
>
> ```sqlexample
> SELECT i FROM sequence_demo ORDER BY i LIMIT 10;
> +----+
> |  I |
> |----|
> |  1 |
> |  2 |
> |  3 |
> |  4 |
> |  5 |
> |  6 |
> |  7 |
> |  8 |
> |  9 |
> | 10 |
> +----+
> ```
>
> This query shows that each row in the table has a distinct value:
>
> ```sqlexample
> SELECT COUNT(i), COUNT(DISTINCT i) FROM sequence_demo;
> +----------+-------------------+
> | COUNT(I) | COUNT(DISTINCT I) |
> |----------+-------------------|
> |     1024 |              1024 |
> +----------+-------------------+
> ```

More examples are available in [Using Sequences](../../user-guide/querying-sequences.md).

---
title: CREATE SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/create-service.md
section: SQL Commands
---

# CREATE SERVICE

Creates a new [Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md)
in the current schema. If a service with that name already exists, use the [DROP SERVICE](drop-service.md) command to delete the previously
created service.

You can run more than one instance of your service. Each service instance is a collection of containers, as defined in the
service specification file, that run together on a node in your compute pool. If you run multiple instances of a service, a load
balancer manages incoming traffic.

Note that the command parameters must be specified in specific order. For more information, see the Usage Notes section.

See also:
:   [ALTER SERVICE](alter-service.md) , [DESCRIBE SERVICE](desc-service.md), [DROP SERVICE](drop-service.md) , [SHOW SERVICES](show-services.md)

## Syntax

```sqlsyntax
CREATE SERVICE [ IF NOT EXISTS ] <name>
  IN COMPUTE POOL <compute_pool_name>
  {
     fromSpecification
     | fromSpecificationTemplate
  }
  [ AUTO_SUSPEND_SECS = <num> ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <EAI_name> [ , ... ] ) ]
  [ AUTO_RESUME = { TRUE | FALSE } ]
  [ MIN_INSTANCES = <num> ]
  [ MIN_READY_INSTANCES = <num> ]
  [ MAX_INSTANCES = <num> ]
  [ LOG_LEVEL = '<log_level>' ]
  [ QUERY_WAREHOUSE = <warehouse_name> ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COMMENT = '{string_literal}']
```

Where:

> ```sqlsyntax
> fromSpecification ::=
>   {
>     FROM SPECIFICATION_FILE = '<yaml_file_path>' -- for native app service.
>     | FROM @<stage> SPECIFICATION_FILE = '<yaml_file_path>' -- for non-native app service.
>     | FROM SPECIFICATION <specification_text>
>   }
> ```
>
> ```sqlsyntax
> fromSpecificationTemplate ::=
>   {
>     FROM SPECIFICATION_TEMPLATE_FILE = '<yaml_file_stage_path>' -- for native app service.
>     | FROM @<stage> SPECIFICATION_TEMPLATE_FILE = '<yaml_file_stage_path>' -- for non-native app service.
>     | FROM SPECIFICATION_TEMPLATE <specification_text>
>   }
>   USING ( <key> => <value> [ , <key> => <value> [ , ... ] ]  )
> ```

## Required parameters

`name`
:   String that specifies the identifier (that is, the name) for the service; it must be unique for the schema in which the service
    is created.

    Quoted names for special characters or case-sensitive names are not supported. The same constraint also applies to database
    and schema names where you create a service. That is, database and schema names without quotes are valid when creating a
    service.

`IN COMPUTE POOL compute_pool_name`
:   Specifies the name of the compute pool in your account on which to run the service.

`FROM ...`
:   Identifies the [specification](../../developer-guide/snowpark-container-services/specification-reference.md) or
    the [template](../../developer-guide/snowpark-container-services/working-with-services.md) specification for the service.

    **Using a service specification**

    You can either define the specification either [inline or in a separate file](../../developer-guide/snowpark-container-services/working-with-services.md).

    `SPECIFICATION_FILE = 'yaml_file_path'` or . `@stage SPECIFICATION_FILE = 'yaml_file_path'` or . `SPECIFICATION specification_text`
    :   Specifies the file containing the service specification or the service specification inline. If your service specification is in a file, use SPECIFICATION_FILE. For services created in a Snowflake Native App, omit `@stage`, and specify a path relative to the app root directory. For services created in other contexts, specify the Snowflake internal stage and path to the service specification file.

    **Using a service specification template**

    You can either define the [template specification](../../developer-guide/snowpark-container-services/working-with-services.md) either [inline or in a separate file](../../developer-guide/snowpark-container-services/working-with-services.md).

    `SPECIFICATION_TEMPLATE_FILE = 'yaml_file_path'` or . `@stage SPECIFICATION_TEMPLATE_FILE = 'yaml_file_path'` or . `SPECIFICATION_TEMPLATE specification_text`
    :   Specifies the file containing the service specification template or the service specification template inline. If your service specification template is in a file, use SPECIFICATION_TEMPLATE_FILE. For services created in a Snowflake Native App, omit `@stage`, and specify a path relative to the app root directory. For services created in other contexts, specify the Snowflake internal stage and path to the service specification file. When using template specification, you should also include the `USING` parameter.

    `USING ( key => value [ , key => value [ , ... ] ]  )`
    :   Specifies the template variables and the values of those variables.

        * `key` is the name of the template variable. The template variable name can optionally be enclosed in double quotes
          (`"`).
        * `value` is the value to assign to the variable in the template. String values must be enclosed in `'` or
          `$$`. The value must either be alphanumeric or valid JSON.

        Use a comma between each key-value pair.

## Optional parameters

`AUTO_SUSPEND_SECS = num`
:   Specifies the number of seconds of inactivity (service is idle) after which Snowflake automatically suspends the service. Inactivity means no queries (that invoke a service function) executed for the time period specified by AUTO_SUSPEND_SECS. You can configure this value to 300 seconds or more to enable auto-suspension. For more information, see [Suspending a service](../../developer-guide/snowpark-container-services/working-with-services.md).

    Default: 0 seconds, which indicates Snowflake does not suspend the service automatically.

    [Preview Feature](../../release-notes/preview-features.md) — Open

    Configuring the automatic suspension of a Snowpark Container Services service using the AUTO_SUSPEND_SECS property is a [preview feature](../../release-notes/preview-features.md).

`EXTERNAL_ACCESS_INTEGRATIONS = ( EAI_name [ , ... ] )`
:   Specifies the names of the [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md) that allow your service to access external sites.
    The names in this list are case-sensitive. By default, application containers don’t have
    permission to access the internet. If you want to allow your service to access an external site, create an External Access Integration
    (EAI), and configure your service to use that integration. For more
    information, see [Configure service egress](../../developer-guide/snowpark-container-services/service-network-communications.md).

`AUTO_RESUME = { TRUE | FALSE }`
:   Specifies whether to automatically resume a service when user performs one of the following actions that depend on the service:

    * Executing a query is that uses a [service function](../../developer-guide/snowpark-container-services/working-with-services.md).
    * Sending a request to the public endpoint exposed by the service ([ingress](../../developer-guide/snowpark-container-services/working-with-services.md)).

    If AUTO_RESUME is FALSE, you need to explicitly resume the service (using [ALTER SERVICE … RESUME](alter-service.md)).

    Default: TRUE.

`MIN_INSTANCES = num`
:   Specifies the minimum number of service instances to run.

    Default: 1.

`MIN_READY_INSTANCES = num`
:   Indicates the minimum service instances that must be ready for Snowflake to consider the service is ready to process requests.
    MIN_READY_INSTANCES must be equal to or less than MIN_INSTANCES. For more information, see [Scaling services](../../developer-guide/snowpark-container-services/working-with-services.md).

    Default: The value of the MIN_INSTANCES property.

`MAX_INSTANCES = num`
:   Specifies the maximum number of service instances to run.

    Default: The value of the MIN_INSTANCES property.

`LOG_LEVEL = 'log_level'`
:   Specifies the severity level of messages that should be ingested and made available in the active event table. Messages at
    the specified level (and at more severe levels) are ingested.
    Currently, LOG_LEVEL is supported only for [platform events](../../developer-guide/snowpark-container-services/monitoring-services.md), Changing LOG_LEVEL for [container logs](../../developer-guide/snowpark-container-services/monitoring-services.md) is not supported.

    For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see
    [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`QUERY_WAREHOUSE = warehouse_name`
:   Warehouse to use if a service container connects to Snowflake to execute a query but does not explicitly specify a warehouse
    to use. This parameter also supports object references in Native Apps. For more information, see [Request references and object-level privileges from consumers](../../developer-guide/native-apps/requesting-refs.md).

    Default: none.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the service.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SERVICE | Schema |  |
| USAGE | Compute pool |  |
| READ | Stage | This is the stage where the specification is stored. |
| READ | Image repository | Repository of images referenced by the specification. |
| BIND SERVICE ENDPOINT | Account | A role must have this privilege to create a service with public endpoints. This allows the service access through the public endpoints. If the service’s owner role loses this privilege, the public endpoints will not be accessible. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When calling CREATE SERVICE, the parameters should be provided in this order: specify compute pool, followed by the service specification (either provider specification file on stage or inline specification), and then other properties.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Create a service with two service instances running:

```sqlexample
CREATE SERVICE echo_service
  IN COMPUTE POOL tutorial_compute_pool
  FROM @tutorial_stage
  SPECIFICATION_FILE='echo_spec.yaml'
  MIN_INSTANCES=2
  MAX_INSTANCES=2
```

---
title: CREATE SESSION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-session-policy.md
section: SQL Commands
---

# CREATE SESSION POLICY

Creates a new session policy or replaces an existing session policy.

A session policy defines the idle session timeout period in minutes. Administrators can optionally set different timeout values for the
Snowflake web interface and other Snowflake clients.

After creating a session policy, apply the session policy to your Snowflake account using an [ALTER ACCOUNT](alter-account.md)
statement or a user using an [ALTER USER](alter-user.md) statement.

See also:
:   [Session Policy DDL Reference](../../user-guide/session-policies-managing.md)

## Syntax

```sqlsyntax
CREATE [OR REPLACE] SESSION POLICY [IF NOT EXISTS] <name>
  [ SESSION_IDLE_TIMEOUT_MINS = <integer> ]
  [ SESSION_UI_IDLE_TIMEOUT_MINS = <integer> ]
  [ ALLOWED_SECONDARY_ROLES = ( [ { 'ALL' | <role_name> [ , <role_name> ... ] } ] ) ]
  [ BLOCKED_SECONDARY_ROLES = ( [ { 'ALL' | <role_name> [ , <role_name> ... ] } ] ) ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the session policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`SESSION_IDLE_TIMEOUT_MINS = integer`
:   For Snowflake clients and programmatic clients, the number of minutes in which a session can be idle before users must authenticate to
    Snowflake again. If a value is not specified, Snowflake uses the default value.

    The number of minutes can be any integer between `5` and `1440`, inclusive.

    Default: `240` (4 hours)

`SESSION_UI_IDLE_TIMEOUT_MINS = integer`
:   For Snowsight, the number of minutes in which a session can be idle before a user must authenticate to Snowflake again. If a
    value is not specified, Snowflake uses the default value.

    The number of minutes can be any integer between `5` and `1440`, inclusive.

    Default: `240` (4 hours)

`ALLOWED_SECONDARY_ROLES = ( [ { 'ALL' | role_name [ , role_name ... ] } ] )`
:   Specifies the allowed secondary roles for a session policy, if any.

    The possible values for the property are:

    `()`
    :   Disallows secondary roles.

    `('ALL')`
    :   Allows all secondary roles.

    `( role_name [ , role_name ... ] )`
    :   Allows the specified roles as secondary roles. The secondary roles can be user-defined account roles or system roles. Specify the
        role name as it is stored in Snowflake. For details, see [Identifier requirements](../identifiers-syntax.md).

    Default: `('ALL')`. If you do not set the property when you create a new session policy, all secondary roles are allowed.

`BLOCKED_SECONDARY_ROLES = ( [ { 'ALL' | role_name [ , role_name ... ] } ] )`
:   Specifies the blocked secondary roles for a session policy, if any. Blocked secondary roles take precedence over
    allowed secondary roles.

    The possible values for the property are:

    `()`
    :   Allows all secondary roles.

    `('ALL')`
    :   Disallows secondary roles.

    `( role_name [ , role_name ... ] )`
    :   Blocks the specified roles as secondary roles. The specified roles, and the roles granted to those roles, cannot be
        activated as secondary roles. These blocked roles can be user-defined account roles or system roles. Specify the
        role name as it is stored in Snowflake. For details, see [Identifier requirements](../identifiers-syntax.md).

    Default: `()`. If you do not set the property when you create a new session policy, all secondary roles are allowed.

`COMMENT = 'string_literal'`
:   Adds a comment or overwrites an existing comment for the session policy.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SESSION POLICY | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on session policy DDL and privileges, see [Managing session policies](../../user-guide/session-policies-managing.md).

## Usage notes

* If you want to replace an existing session policy and need to see the current definition of the policy, call the
  [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE SESSION POLICY](desc-session-policy.md) command.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a session policy for your current account:

> ```sqlexample
> CREATE SESSION POLICY session_policy_prod_1
>   SESSION_IDLE_TIMEOUT_MINS = 30
>   SESSION_UI_IDLE_TIMEOUT_MINS = 30
>   COMMENT = 'session policy for use in the prod_1 environment'
> ;
> ```

---
title: CREATE SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/create-share.md
section: SQL Commands
---

# CREATE SHARE

Creates a new, empty [share](../../user-guide/data-sharing-intro.md). Once the share is created, you can include a database and
objects from the database (schemas, tables, and views) in the share using the [GRANT <privilege> … TO SHARE](grant-privilege-share.md) command. You can then use
[ALTER SHARE](alter-share.md) to add one or more accounts to the share.

See also:
:   [DROP SHARE](drop-share.md) , [ALTER SHARE](alter-share.md) , [SHOW SHARES](show-shares.md) , [DESCRIBE SHARE](desc-share.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SHARE [ IF NOT EXISTS ] <name>
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier for the share; must be unique for the account in which the share is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the share.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SHARE | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about access control requirements for Snowflake Secure Data Sharing specifically, see
[Enable non-ACCOUNTADMIN roles to perform data sharing tasks](../../user-guide/security-access-privileges-shares.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create an empty share named `sales_s`:

> ```sqlexample
> CREATE SHARE sales_s;
> ```
>
> ```output
> +-----------------------------------------+
> | status                                  |
> |-----------------------------------------|
> | Share SALES_S successfully created.     |
> +-----------------------------------------+
> ```

After you create the share, complete it by running the following commands:

> 1. Run the [GRANT <privilege> … TO SHARE](grant-privilege-share.md) command to add a database (and objects in the database) to the share.
> 2. Run the [ALTER SHARE](alter-share.md) command to add accounts to the share.

---
title: CREATE SNAPSHOT
source: https://docs.snowflake.com/en/sql-reference/sql/create-snapshot.md
section: SQL Commands
---

# CREATE SNAPSHOT

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Creates or replaces a [snapshot of a block storage volume](../../developer-guide/snowpark-container-services/block-storage-volume.md) for a specified volume and service instance. The snapshot is created in the current schema.

See also:
:   [ALTER SNAPSHOT](alter-snapshot.md), [DESCRIBE SNAPSHOT](desc-snapshot.md), [DROP SNAPSHOT](drop-snapshot.md) , [SHOW SNAPSHOTS](show-snapshots.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNAPSHOT [ IF NOT EXISTS ] <name>
  FROM SERVICE <service_name>
  VOLUME "<volume_name>"
  INSTANCE <instance_id>
  [ COMMENT = '<string_literal>']
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , ... ] ) ]
```

## Required parameters

`name`
:   String that specifies the identifier (that is, name) for the snapshot; must be unique for the schema in which the snapshot is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FROM SERVICE service_name`
:   Specifies the name of the service.

`VOLUME "volume_name"`
:   Specifies the name of the volume associated with the service. Snapshots can only be taken for block storage volumes (and not for local, memory, or stage volumes).

    Volume names are case-sensitive. Therefore, double quotes should always be used to match the corresponding name in the service specification.

`INSTANCE instance_id`
:   Index of the service instance. The service instance index starts at 0 and the range is `[0, ...,  MAX_INSTANCES - 1]`. You can call the [SYSTEM$GET_SERVICE_STATUS — Deprecated](../functions/system_get_service_status.md) function to get the relevant information.

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the service.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SNAPSHOT | Schema |  |
| OPERATE | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Snowflake deletes job services approximately 10 minutes after its execution completes. To preserve the content of a block storage volume used by the job service, you must create a snapshot before Snowflake deletes the job. For example, you might use a stored procedure to first execute a job service and create a snapshot immediately following it.
* A schema cannot contain snapshots with the same name. When creating a snapshot, if a snapshot with the same name already exists in the schema, an error is returned and the snapshot is not created, unless the optional `OR REPLACE` keyword is included in the command, in which case Snowflake deletes the existing snapshot and creates a new snapshot.

  > **Important:**
  >
  > A snapshot deleted using the DROP SNAPSHOT or the CREATE OR REPLACE SNAPSHOT command cannot be restored. For volumes containing critical data, create new snapshots with unique names, such as using timestamps in the snapshot name.

## Examples

If you create a service with two instances (the number of containers isn’t relevant) with a volume named “data”, you would create a snapshot of the volume associated with the first instance using the following SQL:

```sqlexample
CREATE SNAPSHOT snapshot_0
  FROM SERVICE example_service
  VOLUME "data"
  INSTANCE 0
  COMMENT='new snapshot';
```

To create a snapshot of the volume associated with the second service instance, you specify `INSTANCE 1` in the preceding SQL.

---
title: CREATE SNAPSHOT POLICY — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/create-snapshot-policy.md
section: SQL Commands
---

# CREATE SNAPSHOT POLICY — *Deprecated*

Creates a [snapshot](../../user-guide/backups.md) policy.
You associate the policy with one or more snapshot sets.
The settings in the policy define the schedule and expiration periods for each snapshot sets
that uses the policy.

The schedule determines how often Snowflake automatically makes a backup and adds the resulting snapshot
to the snapshot set that’s governed by the policy.
The expiration period determines how long each snapshot is retained before Snowflake automatically deletes it from the
associated snapshot set.

> **Tip:**
>
> The snapshot policy is optional for a snapshot set. If you don’t need scheduled backups, a retention lock,
> or an expiration period, you can create a snapshot set without a snapshot policy. You can also use
> ALTER SNAPSHOT SET to apply a snapshot policy later to an existing snapshot set, or to suspend and resume
> the scheduled backups specified in the snapshot policy.

See also:
:   [ALTER SNAPSHOT POLICY — Deprecated](alter-snapshot-policy.md),
    [DROP SNAPSHOT POLICY — Deprecated](drop-snapshot-policy.md),
    [SHOW SNAPSHOT POLICIES — Deprecated](show-snapshot-policies.md),
    [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md)
    [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNAPSHOT POLICY [ IF NOT EXISTS ] <name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH RETENTION LOCK ]
   [ SCHEDULE = '{ <num> MINUTE | <num> HOUR | USING CRON <expr> <time_zone> }' ]
   [ EXPIRE_AFTER_DAYS = <days_integer> ]
   [ COMMENT = <string> ]
```

## Required parameters

`name`
:   Identifier for the snapshot policy; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`OR REPLACE`
:   If a snapshot policy with this name already exists, delete it and create a new one.
    This clause is mutually exclusive with `IF NOT EXISTS`.

`IF NOT EXISTS`
:   Creates the snapshot policy only if there isn’t a snapshot policy with the same name.
    If a snapshot policy already exists, the command returns a success message even though it has no effect.
    This clause is mutually exclusive with `OR REPLACE`.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH RETENTION LOCK`
:   Specifies the mandatory retention period for snapshots. Snapshots with retention locks
    can’t be deleted, even by a privileged user.
    For more information, see the [restrictions for a snapshot with a retention lock](../../user-guide/backups.md).

    > **Note:**
    >
    > Only a user with the APPLY SNAPSHOT RETENTION LOCK privilege can create a snapshot policy with retention lock.

    > **Important:**
    >
    > Applying a snapshot policy with a retention lock to a snapshot set is *irreversible*.
    > Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a snapshot set,
    > you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
    > you set a retention lock on a snapshot set with a long expiration period, to avoid unexpected storage charges
    > for undeletable snapshot sets, and the schemas and databases that contain them.
    >
    > If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
    > Snowflake deletes all snapshots, including those with retention locks. Deleting a Snowflake organization
    > requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

`SCHEDULE = '{ num MINUTE | num HOUR | USING CRON expr time_zone }'`
:   Specifies the schedule for creating snapshots of an object.

    > **Note:**
    >
    > The minimum schedule for snapshots must be 60 minutes or 1 hour.
    >
    > Each snapshot policy must have one or both of the schedule and expiration period properties.
    > For more information, see [Backup policy](../../user-guide/backups.md).

    * `USING CRON expr time_zone`
      :   Specifies a cron expression and time zone for the point in time a snapshot of an object is created. Supports a subset of
          standard cron utility syntax.

          For a list of time zones, see the [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones)
          (in Wikipedia).

          The cron expression consists of the following fields:

          ```output
          # __________ minute (0-59)
          # | ________ hour (0-23)
          # | | ______ day of month (1-31, or L)
          # | | | ____ month (1-12, JAN-DEC)
          # | | | | __ day of week (0-6, SUN-SAT, or L)
          # | | | | |
          # | | | | |
            * * * * *
          ```

          The following special characters are supported:

          `*`
          :   Wildcard. Specifies any occurrence of the field.

          `L`
          :   Stands for “last”. When used in the day-of-week field, it lets you specify constructs such as “the last Friday” (“5L”) of a
              given month. In the day-of-month field, it specifies the last day of the month.

          `/n`
          :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
              specified in the month field, then the snapshot is scheduled for April, July and October (that is, every 3 months, starting with the 4th
              month of the year). The same schedule is maintained in subsequent years. That is, the snapshot is not scheduled to run in
              January (3 months after the October run).

          > **Note:**
          > + The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value
          >   for the account (or setting the value at the user or session level) does not change the time zone for the snapshot.
          > + The cron expression defines all valid run times for the snapshot. Snowflake attempts to create a snapshot based on
          >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
          > + When both a specific day of month and day of week are included in the cron expression, then the snapshot is scheduled on days
          >   satisfying either the day of the month or the day of the week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
          >   schedules a snapshot at 0AM (midnight) on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
    * `num MINUTE`
      :   Specifies an interval (in minutes) of wait time between snapshots. Accepts positive integers only.

          Also supports `num M` syntax.
    * `num HOUR` or `num HOURS`
      :   Specifies an interval (in hours) of wait time between backups. Accepts positive integers only.

          Also supports `num H` syntax.

    To avoid ambiguity, a *base interval time* is set in the following circumstances:

    * When the object is created (using CREATE BACKUP SET … WITH BACKUP POLICY).
    * When a different interval is set (using ALTER BACKUP SET … APPLY BACKUP POLICY or
      ALTER BACKUP POLICY … SET SCHEDULE).

    The base interval time starts the interval counter from the current clock time. For example, if an
    INTERVAL value of `10 MINUTES` is set and the scheduled backup is enabled at 9:03 AM, then the next backup
    is created at 9:13 AM, 9:23 AM, and so on. Note that we make a best effort to ensure absolute
    precision, but only guarantee that a backup does not execute before the set interval occurs
    (that is, in the current example, the backup could first run at 9:14 AM, but will definitely not run
    at 9:12 AM).

`EXPIRE_AFTER_DAYS = days_integer`
:   Specifies the number of days until the snapshot expires. Snowflake automatically deletes expired snapshots.
    If this parameter is not specified, snapshots remain in the snapshot set until they are manually deleted from the set.

    * Minimum value: `1`.
    * Maximum value: `3653` (roughly 10 years) if you don’t specify the `SCHEDULE` clause.

    > **Note:**
    >
    > Each snapshot policy must have one or both of the schedule and expiration period properties.
    > For more information, see [Backup policy](../../user-guide/backups.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the snapshot policy.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| CREATE SNAPSHOT POLICY | The role used to create a snapshot policy must have this privilege on the schema in which the policy is created. |
| APPLY SNAPSHOT RETENTION LOCK | Only a user with this privilege on the account can create a snapshot policy with retention lock. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* [Time Travel and Failsafe](../../user-guide/data-time-travel.md) retention do not apply to snapshots. A snapshot can’t be
  recovered after it expires.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Important:**
>
> If the snapshot policy has a retention lock applied to it, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.

## Examples

Create a snapshot policy that creates a snapshot every hour and expires after 90 days:

```sqlexample
CREATE SNAPSHOT POLICY hourly_snapshot_policy
  SCHEDULE = '60 MINUTE'
  EXPIRE_AFTER_DAYS = 90
  COMMENT = 'Hourly snapshots that expire after 90 days';
```

Create a snapshot policy with a retention lock that creates a snapshot every 24 hours and expires after 90 days. The snapshots
created using this snapshot policy can’t be modified or deleted before the expiration period ends:

```sqlexample
CREATE SNAPSHOT POLICY daily_snapshot_policy_with_lock
  WITH RETENTION LOCK
  SCHEDULE = '1440 MINUTE'
  EXPIRE_AFTER_DAYS = 90
  COMMENT = 'regulatory backups expire after 90 days with retention lock';
```

Create a snapshot policy using a cron expression for the schedule. The following statement creates a policy that creates snapshots
every Tuesday and Friday of the week at 11PM:

```sqlexample
CREATE SNAPSHOT POLICY twice_weekly_snapshot_policy
  SCHEDULE = 'USING CRON 0 23 * * 2,5 UTC'
  EXPIRE_AFTER_DAYS = 7
  COMMENT = 'Twice-weekly snapshots that expire after 7 days';
```

---
title: CREATE SNAPSHOT SET — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/create-snapshot-set.md
section: SQL Commands
---

# CREATE SNAPSHOT SET — *Deprecated*

Creates a [snapshot](../../user-guide/backups.md) set for a table, a schema, or a database. Once the snapshot set exists, you can
add a new backup (snapshot) to the snapshot set at any time by running an ALTER SNAPSHOT SET command. Snowflake also adds snapshots
to the snapshot set automatically, if you defined a schedule in a [snapshot policy](../../user-guide/backups.md)
and associated that snapshot policy with the snapshot set.

Each snapshot set represents a set of backups for a specific table, or the objects in a
specific schema, or the objects in a specific database. That way, you can make your backups
very granular or very comprehensive. And the backups for each table, schema, or database can
have their own independent schedules.

For the kinds of objects that are included in schema snapshots and database snapshots, see
[Backup objects](../../user-guide/backups.md).

See also:
:   [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md),
    [DROP SNAPSHOT SET — Deprecated](drop-snapshot-set.md),
    [SHOW SNAPSHOT SETS — Deprecated](show-snapshot-sets.md),
    [CREATE SNAPSHOT POLICY — Deprecated](create-snapshot-policy.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNAPSHOT SET [ IF NOT EXISTS ] <name>
   FOR [ DYNAMIC ] TABLE <table_name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH SNAPSHOT POLICY <policy_name> ]
   [ COMMENT = <string> ]
```

```sqlsyntax
CREATE [ OR REPLACE ] SNAPSHOT SET [ IF NOT EXISTS ] <name>
  FOR SCHEMA <schema_name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH SNAPSHOT POLICY <policy_name> ]
   [ COMMENT = <string> ]
```

```sqlsyntax
CREATE [ OR REPLACE ] SNAPSHOT SET [ IF NOT EXISTS ] <name>
  FOR DATABASE <database_name>
   [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
   [ WITH SNAPSHOT POLICY <policy_name> ]
   [ COMMENT = <string> ]
```

## Required parameters

`name`
:   Identifier for the snapshot set; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`FOR [ DYNAMIC ] TABLE table_name`
:   Specifies the name of the table or dynamic table. In that case, the snapshot set represents backups
    of a single table.

`FOR SCHEMA schema_name`
:   Specifies the name of the schema. In that case, the snapshot set represents backups
    of all the tables and other objects in a specific schema.

`FOR DATABASE database_name`
:   Specifies the name of the database. In that case, the snapshot set represents backups
    of all the tables, schemas, and other objects in a specific database.

## Optional parameters

`OR REPLACE`
:   If a snapshot set with this name already exists, delete it and create a new one.
    If the snapshot set can’t be deleted because of snapshot policy rules for retention locks,
    legal holds, and expiry times, the command fails.
    This clause is mutually exclusive with `IF NOT EXISTS`.

`IF NOT EXISTS`
:   Creates the snapshot set only if there isn’t a snapshot set with the same name.
    If a snapshot set already exists, the command returns a success message even though it has no effect.
    This clause is mutually exclusive with `OR REPLACE`.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH SNAPSHOT POLICY policy_name`
:   Specifies the name of the snapshot policy for the set.
    The snapshot policy defines properties of the snapshot set such as the schedule for backups,
    the retention period for each snapshot, and whether to prevent snapshots from being
    removed before the end of the retention period.

    If you omit this parameter from the CREATE SNAPSHOT SET command, you can apply a
    policy later with the ALTER SNAPSHOT SET command.

    > **Important:**
    >
    > Applying a snapshot policy with a retention lock to a snapshot set is *irreversible*.
    > Due to the strong guarantees that are needed for regulatory compliance, after you put a retention lock on a snapshot set,
    > you can’t revoke the lock. Snowflake support also can’t revoke such a retention lock. Plan carefully before
    > you set a retention lock on a snapshot set with a long expiration period, to avoid unexpected storage charges
    > for undeletable snapshot sets, and the schemas and databases that contain them.
    >
    > If a Snowflake organization is deleted, the organization is no longer a Snowflake customer. In this case,
    > Snowflake deletes all snapshots, including those with retention locks. Deleting a Snowflake organization
    > requires the involvement of Snowflake support. It isn’t something that an administrator can do by accident.

`COMMENT = 'string_literal'`
:   Specifies a comment for the snapshot set.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| CREATE SNAPSHOT SET | The role used to create a snapshot set must have this privilege granted on the schema in which the snapshot set is created. To actually create the snapshot set also requires the appropriate privilege on the object that’s the subject of the snapshot set: SELECT for a table snapshot, or USAGE for a schema snapshot or database snapshot. |
| SELECT | The role used to create a snapshot set for a table must have the SELECT privilege on that table. |
| USAGE | The role used to create a snapshot set for a schema or database must have the USAGE privilege on that schema or database. |
| APPLY | The role used to apply a snapshot policy on a snapshot set must have this privilege on the snapshot policy. |
| APPLY SNAPSHOT RETENTION LOCK | The role used to apply a snapshot policy with retention lock on a snapshot set must have this privilege on the account. |

These privileges are required on the currently active primary role, not a secondary role.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

Regarding metadata:

> **Attention:**
>
> Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

> **Important:**
>
> If the snapshot policy has a retention lock applied to it, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.

## Examples

Create a snapshot set named `t1_snapshots` for table `t1`:

```sqlexample
CREATE SNAPSHOT SET t1_snapshots
  FOR TABLE t1;
```

Create a snapshot set `t1_snapshots` for table `t1` with a snapshot policy:

```sqlexample
CREATE SNAPSHOT SET t1_snapshots
  FOR TABLE t1
  WITH SNAPSHOT POLICY hourly_snapshot_policy;
```

Create a snapshot set `s1_snapshots` for schema `s1` with a snapshot policy:

```sqlexample
CREATE SNAPSHOT SET s1_snapshots
  FOR SCHEMA s1
  WITH SNAPSHOT POLICY hourly_snapshot_policy;
```

Create a snapshot set `d1_snapshots` for database `d1` with a snapshot policy:

```sqlexample
CREATE SNAPSHOT SET d1_snapshots
  FOR DATABASE d1
  WITH SNAPSHOT POLICY hourly_snapshot_policy;
```

---
title: CREATE STAGE
source: https://docs.snowflake.com/en/sql-reference/sql/create-stage.md
section: SQL Commands
---

# CREATE STAGE

Creates a new named *internal* or *external* stage to use for loading data from files into Snowflake tables and unloading data from
tables into files:

Internal stage:
:   Stores data files internally within Snowflake. For more details,
    see [Choosing an internal stage for local files](../../user-guide/data-load-local-file-system-create-stage.md).

External stage:
:   References data files stored in a location outside of Snowflake. Currently, the following cloud storage services are
    supported:

    * Amazon S3 buckets
    * Google Cloud Storage buckets
    * Microsoft Azure containers

    The storage location can be either private/protected or public.

    You cannot access data held in archival cloud storage classes that requires restoration before it can be retrieved. These archival storage classes include, for example, the Amazon S3 Glacier Flexible Retrieval or Glacier Deep Archive storage class, or Microsoft Azure Archive Storage.

An internal or external stage can include a *directory table*. [Directory tables](../../user-guide/data-load-dirtables.md) store a catalog
of staged files in cloud storage.

Additionally, this command supports the following variants:

* CREATE OR ALTER STAGE: Creates a new stage if it doesn’t exist or alters an existing stage.
* CREATE STAGE … CLONE: Creates a clone of an existing stage. For more information, see [Cloning considerations](../../user-guide/object-clone.md).

See also:
:   [DROP STAGE](drop-stage.md) , [ALTER STAGE](alter-stage.md) , [SHOW STAGES](show-stages.md) , [DESCRIBE STAGE](desc-stage.md)

    [PUT](put.md) , [COPY INTO <table>](copy-into-table.md)

    [COPY INTO <location>](copy-into-location.md) , [GET](get.md), [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
-- Internal stage
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] STAGE [ IF NOT EXISTS ] <internal_stage_name>
    internalStageParams
    directoryTableParams
  [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM } [ formatTypeOptions ] } ) ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]

-- External stage
CREATE [ OR REPLACE ] [ { TEMP | TEMPORARY } ] STAGE [ IF NOT EXISTS ] <external_stage_name>
    externalStageParams
    directoryTableParams
  [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM } [ formatTypeOptions ] } ) ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

Where:

```sqlsyntax
internalStageParams ::=
  [ ENCRYPTION = (   TYPE = 'SNOWFLAKE_FULL'
                   | TYPE = 'SNOWFLAKE_SSE' ) ]
```

```sqlsyntax
externalStageParams (for Amazon S3) ::=
  URL = '<protocol>://<bucket>[/<path>/]'
  [ AWS_ACCESS_POINT_ARN = '<string>' ]
  [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( {  { AWS_KEY_ID = '<string>' AWS_SECRET_KEY = '<string>' [ AWS_TOKEN = '<string>' ] } | AWS_ROLE = '<string>'  } ) } ]
  [ ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] MASTER_KEY = '<string>'
                   | TYPE = 'AWS_SSE_S3'
                   | TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = '<string>' ]
                   | TYPE = 'NONE' ) ]
  [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
```

```sqlsyntax
externalStageParams (for Google Cloud Storage) ::=
  URL = 'gcs://<bucket>[/<path>/]'
  [ STORAGE_INTEGRATION = <integration_name> ]
  [ ENCRYPTION = (   TYPE = 'GCS_SSE_KMS' [ KMS_KEY_ID = '<string>' ]
                   | TYPE = 'NONE' ) ]
```

```sqlsyntax
externalStageParams (for Microsoft Azure) ::=
  URL = 'azure://<account>.blob.core.windows.net/<container>[/<path>/]'
  [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( [ AZURE_SAS_TOKEN = '<string>' ] ) } ]
  [ ENCRYPTION = (   TYPE = 'AZURE_CSE' MASTER_KEY = '<string>'
                   | TYPE = 'NONE' ) ]
  [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
```

```sqlsyntax
externalStageParams (for Microsoft Fabric OneLake) ::=
  URL = 'azure://onelake.blob.fabric.microsoft.com/<workspace_id>/<item_id>/Files[/<path>/]'
  [ { STORAGE_INTEGRATION = <integration_name> } | { CREDENTIALS = ( [ AZURE_SAS_TOKEN = '<string>' ] ) } ]
  [ ENCRYPTION = (   TYPE = 'AZURE_CSE' MASTER_KEY = '<string>'
                   | TYPE = 'NONE' ) ]
```

```sqlsyntax
externalStageParams (for Amazon S3-compatible Storage) ::=
  URL = 's3compat://{bucket}[/{path}/]'
  ENDPOINT = '<s3_api_compatible_endpoint>'
  [ { CREDENTIALS = ( AWS_KEY_ID = '<string>' AWS_SECRET_KEY = '<string>' ) } ]
```

```sqlsyntax
directoryTableParams (for internal stages) ::=
  [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
                  [ AUTO_REFRESH = { TRUE | FALSE } ] ) ]
```

```sqlsyntax
directoryTableParams (for Amazon S3) ::=
  [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
                  [ REFRESH_ON_CREATE =  { TRUE | FALSE } ]
                  [ AUTO_REFRESH = { TRUE | FALSE } ] ) ]
```

```sqlsyntax
directoryTableParams (for Google Cloud Storage) ::=
  [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
                  [ AUTO_REFRESH = { TRUE | FALSE } ]
                  [ REFRESH_ON_CREATE =  { TRUE | FALSE } ]
                  [ NOTIFICATION_INTEGRATION = '<notification_integration_name>' ] ) ]
```

```sqlsyntax
directoryTableParams (for Microsoft Azure) ::=
  [ DIRECTORY = ( ENABLE = { TRUE | FALSE }
                  [ REFRESH_ON_CREATE =  { TRUE | FALSE } ]
                  [ AUTO_REFRESH = { TRUE | FALSE } ]
                  [ NOTIFICATION_INTEGRATION = '<notification_integration_name>' ] ) ]
```

```sqlsyntax
formatTypeOptions ::=
-- If TYPE = CSV
     COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
     RECORD_DELIMITER = '<string>' | NONE
     FIELD_DELIMITER = '<string>' | NONE
     MULTI_LINE = TRUE | FALSE
     FILE_EXTENSION = '<string>'
     PARSE_HEADER = TRUE | FALSE
     SKIP_HEADER = <integer>
     SKIP_BLANK_LINES = TRUE | FALSE
     DATE_FORMAT = '<string>' | AUTO
     TIME_FORMAT = '<string>' | AUTO
     TIMESTAMP_FORMAT = '<string>' | AUTO
     BINARY_FORMAT = HEX | BASE64 | UTF8
     ESCAPE = '<character>' | NONE
     ESCAPE_UNENCLOSED_FIELD = '<character>' | NONE
     TRIM_SPACE = TRUE | FALSE
     FIELD_OPTIONALLY_ENCLOSED_BY = '<character>' | NONE
     NULL_IF = ( '<string>' [ , '<string>' ... ] )
     ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE
     REPLACE_INVALID_CHARACTERS = TRUE | FALSE
     EMPTY_FIELD_AS_NULL = TRUE | FALSE
     SKIP_BYTE_ORDER_MARK = TRUE | FALSE
     ENCODING = '<string>' | UTF8
-- If TYPE = JSON
     COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
     DATE_FORMAT = '<string>' | AUTO
     TIME_FORMAT = '<string>' | AUTO
     TIMESTAMP_FORMAT = '<string>' | AUTO
     BINARY_FORMAT = HEX | BASE64 | UTF8
     TRIM_SPACE = TRUE | FALSE
     MULTI_LINE = TRUE | FALSE
     NULL_IF = ( '<string>' [ , '<string>' ... ] )
     FILE_EXTENSION = '<string>'
     ENABLE_OCTAL = TRUE | FALSE
     ALLOW_DUPLICATE = TRUE | FALSE
     STRIP_OUTER_ARRAY = TRUE | FALSE
     STRIP_NULL_VALUES = TRUE | FALSE
     REPLACE_INVALID_CHARACTERS = TRUE | FALSE
     IGNORE_UTF8_ERRORS = TRUE | FALSE
     SKIP_BYTE_ORDER_MARK = TRUE | FALSE
-- If TYPE = AVRO
     COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
     TRIM_SPACE = TRUE | FALSE
     REPLACE_INVALID_CHARACTERS = TRUE | FALSE
     NULL_IF = ( '<string>' [ , '<string>' ... ] )
-- If TYPE = ORC
     TRIM_SPACE = TRUE | FALSE
     REPLACE_INVALID_CHARACTERS = TRUE | FALSE
     NULL_IF = ( '<string>' [ , '<string>' ... ] )
-- If TYPE = PARQUET
     COMPRESSION = AUTO | LZO | SNAPPY | NONE
     SNAPPY_COMPRESSION = TRUE | FALSE
     BINARY_AS_TEXT = TRUE | FALSE
     USE_LOGICAL_TYPE = TRUE | FALSE
     TRIM_SPACE = TRUE | FALSE
     USE_VECTORIZED_SCANNER = TRUE | FALSE
     REPLACE_INVALID_CHARACTERS = TRUE | FALSE
     NULL_IF = ( '<string>' [ , '<string>' ... ] )
-- If TYPE = XML
     COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE
     IGNORE_UTF8_ERRORS = TRUE | FALSE
     PRESERVE_SPACE = TRUE | FALSE
     STRIP_OUTER_ELEMENT = TRUE | FALSE
     DISABLE_AUTO_CONVERT = TRUE | FALSE
     REPLACE_INVALID_CHARACTERS = TRUE | FALSE
     SKIP_BYTE_ORDER_MARK = TRUE | FALSE
```

> **Note:**
>
> Do not specify copy options using the CREATE STAGE, ALTER STAGE, CREATE TABLE, or ALTER TABLE commands. We recommend that you use the [COPY INTO <table>](copy-into-table.md) command to specify copy options.

## Variant Syntax

### CREATE OR ALTER STAGE

Creates a new stage if it doesn’t already exist, or transforms an existing stage into the stage defined in the statement.
A CREATE OR ALTER STAGE statement follows the syntax rules of a CREATE STAGE statement and has the same limitations as an
[ALTER STAGE](alter-stage.md) statement.

For more information, see CREATE OR ALTER STAGE usage notes.

```sqlsyntax
-- Internal stage
CREATE OR ALTER [ { TEMP | TEMPORARY } ] STAGE <internal_stage_name>
    internalStageParams
    directoryTableParams
  [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM } [ formatTypeOptions ] } ) ]
  [ COMMENT = '<string_literal>' ]

-- External stage
CREATE OR ALTER [ { TEMP | TEMPORARY } ] STAGE <external_stage_name>
    externalStageParams
    directoryTableParams
  [ FILE_FORMAT = ( { FORMAT_NAME = '<file_format_name>' | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM } [ formatTypeOptions ] } ) ]
  [ COMMENT = '<string_literal>' ]
```

### CREATE STAGE … CLONE

Creates a new stage with the same parameter values:

```sqlsyntax
CREATE [ OR REPLACE ] STAGE [ IF NOT EXISTS ] <name> CLONE <source_stage>
  [ ... ]
```

For more details, see [CREATE <object> … CLONE](create-clone.md).

## Required parameters

`internal_stage_name` or . `external_stage_name`
:   Specifies the identifier for the stage; must be unique for the schema in which the stage is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

> **Note:**
>
> When creating an external stage, a URL is also required. For more details, see
> External Stage Parameters (in this topic).
>
> If a URL is not specified, Snowflake creates an internal stage by default.

## Optional parameters

`{ TEMP | TEMPORARY }`
:   Specifies that the stage created is temporary and will be dropped at the end of the session in which it was created. Note:

    * When a temporary *external* stage is dropped, only the stage itself is dropped; the data files are not removed.
    * When a temporary *internal* stage is dropped, all of the files in the stage are purged from Snowflake, regardless of their load status.
      This prevents files in temporary internal stages from using data storage and, consequently, accruing storage charges. However, this also
      means that the staged files cannot be recovered through Snowflake once the stage is dropped.

      > **Tip:**
      >
      > If you plan to create and use temporary internal stages, you should maintain copies of your data files outside of Snowflake.

`FILE_FORMAT = ( FORMAT_NAME = 'file_format_name' )` or . `FILE_FORMAT = ( TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM [ ... ] )`
:   Specifies the file format for the stage, which can be either:

    > `FORMAT_NAME = 'file_format_name'`
    > :   Specifies an existing named file format to use for the stage. The named file format determines the format type (CSV, JSON, etc.), as
    >     well as any other format options, for the data files loaded using this stage. For more details, see [CREATE FILE FORMAT](create-file-format.md).
    >
    > `TYPE = CSV | JSON | AVRO | ORC | PARQUET | XML | CUSTOM [ ... ]`
    > :   Specifies the type of files for the stage:
    >
    >     > * Loading data from a stage (using [COPY INTO <table>](copy-into-table.md)) accommodates all of the supported format types.
    >     > * Unloading data into a stage (using [COPY INTO <location>](copy-into-location.md)) accommodates `CSV`, `JSON`, or
    >     >   `PARQUET`.
    >
    >     If a file format type is specified, additional format-specific options can be specified. For more details, see
    >     Format type options (formatTypeOptions) (in this topic).
    >
    >     The `CUSTOM` format type specifies that the underlying stage holds unstructured data and can only be used with the `FILE_PROCESSOR` copy option.
    >
    > Default: `TYPE = CSV`

> **Note:**
>
> `FORMAT_NAME` and `TYPE` are mutually exclusive; you can only specify one or the other for a stage.

`COMMENT = 'string_literal'`
:   Specifies a comment for the stage.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Internal stage parameters (`internalStageParams`)

`[ ENCRYPTION = ( TYPE = 'SNOWFLAKE_FULL' | TYPE = 'SNOWFLAKE_SSE' ) ]`
:   Specifies the type of encryption supported for all files stored on the stage. You cannot change the encryption type after you create the stage.

    `TYPE = ...`
    :   Specifies the encryption type used.

        > **Important:**
        >
        > If you require Tri-Secret Secure for security compliance, use the `SNOWFLAKE_FULL` encryption type for internal stages.
        > `SNOWFLAKE_SSE` does not support Tri-Secret Secure.

        Possible values are:

        * `SNOWFLAKE_FULL`: Client-side and server-side encryption. The files are encrypted by a client when it uploads them to the internal stage
          using [PUT](put.md). Snowflake uses a 128-bit encryption key by default. You can configure a 256-bit key by setting the [CLIENT_ENCRYPTION_KEY_SIZE](../parameters.md) parameter.

          All files are also automatically encrypted using AES-256 strong encryption on the server side.
        * `SNOWFLAKE_SSE`: Server-side encryption only. The files are encrypted when they arrive on the stage by the cloud service
          where your Snowflake account is hosted.

          Specify server-side encryption if you plan to query pre-signed URLs for your staged files. For more information, see
          [Types of URLs available to access files](../../user-guide/unstructured-intro.md).

    Default: `SNOWFLAKE_FULL`

## External stage parameters (`externalStageParams`)

`URL = 'cloud_specific_url'`
:   If this parameter is omitted, Snowflake creates an internal stage

    > **Important:**
    >
    > * Enclose the URL in single quotes (`''`) in order for Snowflake to identify the string. If the quotes are omitted, any credentials
    >   you supply may be displayed in plain text in the history. We strongly recommend verifying the syntax of the CREATE STAGE statement
    >   before you execute it.
    >
    >   When you create a stage in the Snowflake web interface, the interface automatically encloses field values in quotation characters,
    >   as needed.
    > * Append a forward slash (`/`) to the URL to filter to the specified folder path. If the forward slash is omitted, all files and
    >   folders starting with the prefix for the specified path are included.
    >
    >   Note that the forward slash is required to access and retrieve unstructured data files in the stage.

    **Amazon S3**

    > `URL = 'protocol://bucket[/path/]'`
    > :   Specifies the URL for the external location (existing S3 bucket) used to store data files for loading/unloading, where:
    >
    >     * `protocol` is one of the following:
    >
    >       + `s3` refers to S3 storage in public AWS regions outside of China.
    >       + `s3china` refers to S3 storage in public AWS regions in China.
    >       + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
    >
    >       Accessing cloud storage in a [government region](../../user-guide/intro-regions.md) using a storage integration is limited to Snowflake
    >       accounts hosted in the same government region.
    >
    >       Similarly, if you need to access cloud storage in a region in China, you can use a storage integration only from a Snowflake
    >       account hosted in the same region in China.
    >
    >       In these cases, use the CREDENTIALS parameter in the CREATE STAGE command (rather than using a storage
    >       integration) to provide the credentials for authentication.
    >     * `bucket` is the name of the S3 bucket or the [bucket-style alias](https://docs.aws.amazon.com/AmazonS3/latest/userguide/access-points-alias.html)
    >       for an S3 bucket access point. For an S3 access point, you must also specify a value for the
    >       `AWS_ACCESS_POINT_ARN` parameter.
    >     * `path` is an optional case-sensitive path for files in the cloud storage location (files have names that begin with
    >       a common string) that limits the set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >       services.
    >
    > `AWS_ACCESS_POINT_ARN = 'string'`
    > :   Specifies the Amazon resource name (ARN) for your S3 access point. Required only when you specify an S3 access point alias
    >     for your storage `URL`.

    **Google Cloud Storage**

    > `URL = 'gcs://bucket[/path/]'`
    > :   Specifies the URL for the external location (existing GCS bucket) used to store data files for loading/unloading, where:
    >
    >     * `bucket` is the name of the GCS bucket.
    >     * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
    >       common string) that limits the set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >       services.

    **Microsoft Azure**

    > `URL = 'azure://account.blob.core.windows.net/container[/path/]'`
    > :   Specifies the URL for the external location (existing Azure container) used to store data files for loading, where:
    >
    >     * `account` is the name of the Azure account (e.g. `myaccount`). Use the `blob.core.windows.net` endpoint for all
    >       supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
    >
    >       Note that currently, accessing Azure blob storage in [government regions](../../user-guide/intro-regions.md) using a storage
    >       integration is limited to Snowflake accounts hosted on Azure in the same government region. Accessing your blob storage from an
    >       account hosted outside of the government region using direct credentials is supported.
    >     * `container` is the name of the Azure container (e.g. `mycontainer`).
    >
    >     * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with
    >       a common string) that limits the set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
    >       services.

    **Microsoft Fabric OneLake**

    > `URL = 'azure://onelake.blob.fabric.microsoft.com/workspace_id/item_id/Files[/path/]'`
    > :   Specifies the URL for the external location in Microsoft Fabric OneLake that you use to store data files for loading or unloading, where:
    >
    >     * `onelake.blob.fabric.microsoft.com` is the global service root for OneLake. This single endpoint automatically routes
    >       requests to the correct geographical region where your data resides.
    >     * `workspace_id` is the unique 128-bit GUID of the Fabric Workspace; for example, `aab1c234-567d-8901-234e-fgh56789ij`.
    >     * `item_id` is the unique GUID of the specific Fabric item, such as a Lakehouse or Warehouse.
    >     * `Files` is the mandatory path segment for Lakehouse items. This segment points to the unmanaged section of the lake where
    >       you store raw data such as CSV, Parquet, or JSON.
    >     * `path` is an optional case-sensitive path to a specific folder or file prefix. Although optional, providing a path is
    >       recommended when loading specific datasets to improve performance and prevent accidental processing of unrelated files.

    Default: No value (an internal stage is created)

`STORAGE_INTEGRATION = integration_name` or . `CREDENTIALS = ( cloud_specific_credentials )`
:   Required only if the storage location is private/protected; not required for public buckets/containers

    **Amazon S3**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a
    >     Snowflake identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > * We highly recommend the use of storage integrations. This option avoids the need to supply cloud storage credentials using
    >     >   the CREDENTIALS parameter when creating stages or loading data.
    >     > * Accessing S3 storage in government regions using a storage integration is limited to Snowflake accounts hosted on AWS in
    >     >   the same government region. Accessing your S3 storage from an account hosted outside of the government region using direct
    >     >   credentials is supported.
    >
    > `CREDENTIALS = ( AWS_KEY_ID = 'string' AWS_SECRET_KEY = 'string' [ AWS_TOKEN = 'string' ] )` or . `CREDENTIALS = ( AWS_ROLE = 'string' )`
    > :   Specifies the security credentials for connecting to AWS and accessing the private/protected S3 bucket where the files to
    >     load/unload are staged. For more information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).
    >
    >     The credentials you specify depend on whether you associated the Snowflake access permissions for the bucket with an AWS IAM
    >     (Identity & Access Management) user or role:
    >
    >     * **IAM user:** IAM credentials are required. Temporary (aka “scoped”) credentials are generated by AWS Security Token Service
    >       (STS) and consist of three components:
    >
    >       + `AWS_KEY_ID`
    >       + `AWS_SECRET_KEY`
    >       + `AWS_TOKEN`
    >
    >       All three are required to access a private/protected bucket. After a designated period of time, temporary credentials
    >       expire and can no longer be used. You must then generate a new set of valid temporary credentials.
    >
    >       > **Important:**
    >       >
    >       > The COPY command also allows permanent (aka “long-term”) credentials to be used; however, for security reasons, Snowflake does
    >       > not recommend using them. If you must use permanent credentials, Snowflake recommends periodically generating new
    >       > permanent credentials for external stages.
    >     * **IAM role:** Omit the security credentials and access keys and, instead, identify the role using `AWS_ROLE` and specify
    >       the AWS role ARN (Amazon Resource Name).

    **Google Cloud Storage**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a
    >     Snowflake identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).

    **Microsoft Azure**

    > `STORAGE_INTEGRATION = integration_name`
    > :   Specifies the name of the storage integration used to delegate authentication responsibility for external cloud storage to a
    >     Snowflake identity and access management (IAM) entity. For more details, see [CREATE STORAGE INTEGRATION](create-storage-integration.md).
    >
    >     > **Note:**
    >     >
    >     > * We highly recommend the use of storage integrations. This option avoids the need to supply cloud storage credentials using
    >     >   the CREDENTIALS parameter when creating stages or loading data.
    >     > * Accessing Azure blob storage in [government regions](../../user-guide/intro-regions.md)
    >     >   using a storage integration is limited to Snowflake accounts hosted on Azure in the
    >     >   same government region. Accessing your blob storage from an account hosted outside
    >     >   of the government region using direct credentials is supported.
    >
    > `CREDENTIALS = ( AZURE_SAS_TOKEN = 'string' )`
    > :   Specifies the SAS (shared access signature) token for connecting to Azure and accessing the private/protected container
    >     where the files containing loaded data are staged. Credentials are generated by Azure.

    Default: No value (no credentials are provided for the external stage)

`ENCRYPTION = ( cloud_specific_encryption )`
:   Required when loading from encrypted files or unloading into encrypted files. Not required if storage location and files are unencrypted.

    Data loading:
    :   Modifies the encryption settings used to decrypt encrypted files in the storage location and extract data.

    Data unloading:
    :   Modifies the encryption settings used to encrypt files unloaded to the storage location.

    **Amazon S3**

    > `ENCRYPTION = ( [ TYPE = 'AWS_CSE' ] MASTER_KEY = 'string' | TYPE = 'AWS_SSE_S3' | TYPE = 'AWS_SSE_KMS' [ KMS_KEY_ID = 'string' ] | TYPE = 'NONE' )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AWS_CSE`: Client-side encryption (requires a `MASTER_KEY` value). Currently, the client-side
    > >       [master key](https://csrc.nist.gov/glossary/term/master_key) you provide can only be a symmetric key. When a
    > >       `MASTER_KEY` value is provided, Snowflake assumes `TYPE = AWS_CSE` (when a `MASTER_KEY` value is
    > >       provided, `TYPE` is not required).
    > >     * `AWS_SSE_S3`: Server-side encryption that requires no additional encryption settings.
    > >     * `AWS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >     For more information about the encryption types, see the AWS documentation for
    > >     [client-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/UsingClientSideEncryption.html)
    > >     or [server-side encryption](http://docs.aws.amazon.com/AmazonS3/latest/dev/serv-side-encryption.html).
    > >
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to `AWS_CSE` encryption only)
    > > :   Specifies the client-side master key used to encrypt the files in the bucket. The master key must be a 128-bit or 256-bit key
    > >     in Base64-encoded form.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `AWS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the AWS KMS-managed key used to encrypt files unloaded into the bucket. If no value
    > >     is provided, your default KMS key ID is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading.
    > >
    > > Default: `NONE`

    **Google Cloud Storage**

    > `ENCRYPTION = ( TYPE = 'GCS_SSE_KMS' [ KMS_KEY_ID = 'string' ] | TYPE = 'NONE' )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `GCS_SSE_KMS`: Server-side encryption that accepts an optional `KMS_KEY_ID` value.
    > >
    > >       For more information, see the Google Cloud documentation:
    > >
    > >       + <https://cloud.google.com/storage/docs/encryption/customer-managed-keys>
    > >       + <https://cloud.google.com/storage/docs/encryption/using-customer-managed-keys>
    > >     * `NONE`: No encryption.
    > >
    > > `KMS_KEY_ID = 'string'` (applies to `GCS_SSE_KMS` encryption only)
    > > :   Optionally specifies the ID for the Cloud KMS-managed key that is used to encrypt files unloaded into the bucket. If
    > >     no value is provided, your default KMS key ID set on the bucket is used to encrypt files on unload.
    > >
    > >     Note that this value is ignored for data loading. The load operation should succeed if the service account has sufficient
    > >     permissions to decrypt data in the bucket.
    > >
    > > Default: `NONE`

    **Microsoft Azure**

    > `ENCRYPTION = ( TYPE = 'AZURE_CSE' MASTER_KEY = 'string' | TYPE = 'NONE' )`
    >
    > > `TYPE = ...`
    > > :   Specifies the encryption type used. Possible values are:
    > >
    > >     * `AZURE_CSE`: Client-side encryption (requires a MASTER_KEY value). For information, see the
    > >       [Client-side encryption information](https://docs.microsoft.com/en-us/azure/storage/common/storage-client-side-encryption)
    > >       in the Microsoft Azure documentation.
    > >     * `NONE`: No encryption.
    > >
    > > `MASTER_KEY = 'string'` (applies to AZURE_CSE encryption only)
    > > :   Specifies the client-side master key used to encrypt or decrypt files. The master key must be a 128-bit or 256-bit key in
    > >     Base64-encoded form.
    > >
    > > Default: `NONE`

`USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
:   Specifies whether to use [private connectivity](../../user-guide/private-connectivity-outbound.md) for an external stage to harden your
    security posture.

    If the external stage uses a storage integration, and that integration is configured for private connectivity, set this parameter to
    FALSE.

    For information about using this parameter, see one of the following:

    * [Private connectivity to external stages for Amazon Web Services](../../user-guide/data-load-aws-private.md).
    * [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

## External stage parameters for Amazon S3-compatible storage (`externalStageParams`)

> `URL = 's3compat://bucket[/path/]'`
> :   Specifies the URL for the external location (existing bucket accessed using an S3-compatible API endpoint) used to store data files, where:
>
>     * `bucket` is the name of the bucket.
>     * `path` is an optional case-sensitive path (or *prefix* in S3 terminology) for files in the cloud storage location (i.e. files with names that begin with a common string).
>
> `ENDPOINT = 's3_api_compatible_endpoint'`
> :   Fully-qualified domain that points to the S3-compatible API endpoint.

## Directory table parameters (`directoryTableParams`)

### Internal named stages

`ENABLE = { TRUE | FALSE }`
:   Specifies whether to enable a [directory table](../../user-guide/data-load-dirtables.md) on the internal named stage.

    Default: `FALSE`

`AUTO_REFRESH = { TRUE | FALSE }`
:   Specifies whether Snowflake should automatically refresh the directory table metadata when new or updated
    data files are available on the [internal named stage](../../user-guide/data-load-local-file-system-create-stage.md).

    `TRUE`
    :   Snowflake automatically refreshes the directory table metadata.

    `FALSE`
    :   Snowflake does not automatically refresh the directory table metadata. You must manually refresh the directory
        table metadata periodically using [ALTER STAGE](alter-stage.md) … REFRESH to synchronize the metadata with the current
        list of files in the stage path.

    Default: `FALSE`

### External stages

> **Amazon S3**
>
> > `ENABLE = { TRUE | FALSE }`
> > :   Specifies whether to add a [directory table](../../user-guide/data-load-dirtables.md) to the stage. When the value is TRUE, a directory table is created with the stage.
> >
> >     Default: `FALSE`
> >
> > `REFRESH_ON_CREATE = { TRUE | FALSE }`
> > :   Specifies whether to automatically refresh the directory table metadata once, immediately after the stage is
> >     created. Refreshing the directory table metadata synchronizes the metadata with the current list of data files
> >     in the specified stage path. This action is required for the metadata to register any existing data
> >     files in the named stage specified in the `URL =` setting.
> >
> >     `TRUE`
> >     :   Snowflake automatically refreshes the directory table metadata once after the stage creation.
> >
> >         > **Note:**
> >         >
> >         > If the specified cloud storage URL contains close to 1 million files or more, we recommend that you
> >         > set `REFRESH_ON_CREATE = FALSE`. After creating the stage, refresh the directory table metadata
> >         > incrementally by executing ALTER STAGE … REFRESH statements that specify subpaths in
> >         > the storage location (i.e. subsets of files to include in the refresh) until the metadata includes
> >         > all of the files in the location.
> >
> >     `FALSE`
> >     :   Snowflake does not automatically refresh the directory table metadata. To register any data files that
> >         exist in the stage, you must manually refresh the directory table metadata once using [ALTER STAGE](alter-stage.md) … REFRESH.
> >
> >     Default: `TRUE`
> >
> > `AUTO_REFRESH = { TRUE | FALSE }`
> > :   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated
> >     data files are available in the named external stage specified in the URL value.
> >
> >     `TRUE`
> >     :   Snowflake enables triggering automatic refreshes of the directory table metadata.
> >
> >     `FALSE`
> >     :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory
> >         table metadata periodically using [ALTER STAGE](alter-stage.md) … REFRESH to synchronize the metadata with the current
> >         list of files in the stage path.
> >
> >     Default: `FALSE`
>
> **Google Cloud Storage**
>
> > `ENABLE = { TRUE | FALSE }`
> > :   Specifies whether to add a [directory table](../../user-guide/data-load-dirtables.md) to the stage. When the value is TRUE, a directory table is created with the stage.
> >
> >     Default: `FALSE`
> >
> > `REFRESH_ON_CREATE = { TRUE | FALSE }`
> > :   Specifies whether to automatically refresh the directory table metadata once, immediately after the stage is
> >     created. Refreshing the directory table metadata synchronizes the metadata with the current list of data files
> >     in the specified stage path. This action is required for the metadata to register any existing data
> >     files in the named stage specified in the `URL =` setting.
> >
> >     `TRUE`
> >     :   Snowflake automatically refreshes the directory table metadata once after the stage creation.
> >
> >         > **Note:**
> >         >
> >         > If the specified cloud storage URL contains close to 1 million files or more, we recommend that you
> >         > set `REFRESH_ON_CREATE = FALSE`. After creating the stage, refresh the directory table metadata
> >         > incrementally by executing ALTER STAGE … REFRESH statements that specify subpaths in
> >         > the storage location (i.e. subsets of files to include in the refresh) until the metadata includes
> >         > all of the files in the location.
> >
> >     `FALSE`
> >     :   Snowflake does not automatically refresh the directory table metadata. To register any data files that
> >         exist in the stage, you must manually refresh the directory table metadata once using [ALTER STAGE](alter-stage.md) … REFRESH.
> >
> >     Default: `TRUE`
> >
> > `AUTO_REFRESH = { TRUE | FALSE }`
> > :   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated
> >     data files are available in the named external stage specified in the `[ WITH ] LOCATION =` setting.
> >
> >     `TRUE`
> >     :   Snowflake enables triggering automatic refreshes of the directory table metadata.
> >
> >     `FALSE`
> >     :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory
> >         table metadata periodically using [ALTER STAGE](alter-stage.md) … REFRESH to synchronize the metadata with the current
> >         list of files in the stage path.
> >
> > `NOTIFICATION_INTEGRATION = 'notification_integration_name'`
> > :   Specifies the name of the notification integration used to automatically refresh the directory table metadata using GCS Pub/Sub
> >     notifications. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party cloud
> >     message queuing services.
>
> **Microsoft Azure and OneLake**
>
> > `ENABLE = { TRUE | FALSE }`
> > :   Specifies whether to add a [directory table](../../user-guide/data-load-dirtables.md) to the stage. When the value is TRUE, a directory table is created with the stage.
> >
> >     Default: `FALSE`
> >
> > `REFRESH_ON_CREATE = { TRUE | FALSE }`
> > :   Specifies whether to automatically refresh the directory table metadata once, immediately after the stage is
> >     created. Refreshing the directory table metadata synchronizes the metadata with the current list of data files
> >     in the specified stage path. This action is required for the metadata to register any existing data
> >     files in the named stage specified in the `URL =` setting.
> >
> >     `TRUE`
> >     :   Snowflake automatically refreshes the directory table metadata once after the stage creation.
> >
> >         > **Note:**
> >         >
> >         > If the specified cloud storage URL contains close to 1 million files or more, we recommend that you
> >         > set `REFRESH_ON_CREATE = FALSE`. After creating the stage, refresh the directory table metadata
> >         > incrementally by executing ALTER STAGE … REFRESH statements that specify subpaths in
> >         > the storage location (i.e. subsets of files to include in the refresh) until the metadata includes
> >         > all of the files in the location.
> >
> >     `FALSE`
> >     :   Snowflake does not automatically refresh the directory table metadata. To register any data files that
> >         exist in the stage, you must manually refresh the directory table metadata once using [ALTER STAGE](alter-stage.md) … REFRESH.
> >
> >     Default: `TRUE`
> >
> > `AUTO_REFRESH = { TRUE | FALSE }`
> > :   Specifies whether Snowflake should enable triggering automatic refreshes of the directory table metadata when new or updated
> >     data files are available in the named external stage specified in the `[ WITH ] LOCATION =` setting.
> >
> >     `TRUE`
> >     :   Snowflake enables triggering automatic refreshes of the directory table metadata.
> >
> >     `FALSE`
> >     :   Snowflake does not enable triggering automatic refreshes of the directory table metadata. You must manually refresh the directory
> >         table metadata periodically using [ALTER STAGE](alter-stage.md) … REFRESH to synchronize the metadata with the current
> >         list of files in the stage path.
> >
> >     > **Note:**
> >     >
> >     > Automatic refresh isn’t supported for Microsoft Fabric OneLake stages.
> >
> >     Default: `FALSE`
> >
> > `NOTIFICATION_INTEGRATION = 'notification_integration_name'`
> > :   Specifies the name of the notification integration used to automatically refresh the directory table metadata using Azure Event Grid
> >     notifications. A notification integration is a Snowflake object that provides an interface between Snowflake and third-party cloud
> >     message queuing services.

## Format type options (`formatTypeOptions`)

Depending on the file format type specified (`FILE_FORMAT = ( TYPE = ... )`), you can include one or more of the following format-specific options (separated by blank spaces, commas, or new lines):

### TYPE = CSV

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified when loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`RECORD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate records in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   Data loading:
        :   New line character. Note that “new line” is logical such that `\r\n` will be understood as a new line for files on a Windows platform.

        Data unloading:
        :   New line character (`\n`).

`FIELD_DELIMITER = 'string' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   One or more singlebyte or multibyte characters that separate fields in an input file (data loading) or unloaded file (data unloading). Accepts common escape sequences or the following singlebyte or multibyte characters:

        Singlebyte characters:
        :   Octal values (prefixed by `\\`) or hex values (prefixed by `0x` or `\x`). For example, for records delimited by the circumflex accent (`^`) character, specify the octal (`\\136`) or hex (`0x5e`) value.

        Multibyte characters:
        :   Hex values (prefixed by `\x`). For example, for records delimited by the cent (`¢`) character, specify the hex (`\xC2\xA2`) value.

            The delimiter for RECORD_DELIMITER or FIELD_DELIMITER cannot be a substring of the delimiter for the other file format option (For example, `FIELD_DELIMITER = 'aa' RECORD_DELIMITER = 'aabb'`).

            > > **Note:**
            > >
            > > For non-ASCII characters, you must use the hex byte sequence value to get a deterministic behavior.

        The specified delimiter must be a valid UTF-8 character and not a random sequence of bytes. Also note that the delimiter is limited to a maximum of 20 characters.

        Also accepts a value of `NONE`.

    Default:
    :   comma (`,`)

`MULTI_LINE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and the specified record delimiter is present within a CSV field, the record containing the field will be interpreted as an error.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > If you are loading large uncompressed CSV files (greater than 128MB) that follow the RFC4180 specification, Snowflake supports parallel scanning of these CSV files when MULTI_LINE is set to `FALSE`, COMPRESSION is set to `NONE`, and ON_ERROR is set to `ABORT_STATEMENT` or `CONTINUE`.

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.csv[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

    > **Note:**
    >
    > If the `SINGLE` copy option is `TRUE`, then the COPY command unloads a file without a file extension by default. To specify a file extension, provide a file name and extension in the
    > `internal_location` or `external_location` path (For example, `copy into @stage/data.csv`).

`PARSE_HEADER = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to use the first row headers in the data files to determine column names.

    This file format option is applied to the following actions only:

    > * Automatically detecting column definitions by using the INFER_SCHEMA function.
    > * Loading CSV data into separate columns by using the INFER_SCHEMA function and MATCH_BY_COLUMN_NAME copy option.

    If the option is set to TRUE, the first row headers will be used to determine column names. The default value FALSE will return column names as c\*, where \* is the position of the column.

    > **Note:**
    >
    > * This option isn’t supported for external tables.
    > * The SKIP_HEADER option isn’t supported if you set `PARSE_HEADER = TRUE`.

    Default:
    :   `FALSE`

`SKIP_HEADER = integer`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Number of lines at the start of the file to skip.

    Note that SKIP_HEADER does not use the RECORD_DELIMITER or FIELD_DELIMITER values to determine what a header line is; rather, it simply skips the specified number of CRLF (Carriage Return, Line Feed)-delimited lines in the file. RECORD_DELIMITER and FIELD_DELIMITER are then used to determine the rows of data to load.

    Default:
    :   `0`

`SKIP_BLANK_LINES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to skip any blank lines encountered in the data files; otherwise, blank lines produce an end-of-record error (default behavior).

    Default:
    :   `FALSE`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of date values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) (data loading) or [DATE_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of time values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) (data loading) or [TIME_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the format of timestamp values in the data files (data loading) or table (data unloading). If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) (data loading) or [TIMESTAMP_OUTPUT_FORMAT](../parameters.md) (data unloading) parameter is used.

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading and unloading

    Definition:
    :   Defines the encoding format for binary input or output. The option can be used when loading data into or unloading data from binary columns in a table.

    Default:
    :   `HEX`

`ESCAPE = 'character' | NONE`
:   Use:
    :   Data loading and unloading

    Definition:
    :   A singlebyte character string used as the escape character for enclosed or unenclosed field values. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_OPTIONALLY_ENCLOSED_BY` character in the data as literals.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for enclosed fields only. Specify the character used to enclose fields by setting `FIELD_OPTIONALLY_ENCLOSED_BY`.

        > **Note:**
        >
        > This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        > as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        > the option value.
        >
        > In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        > option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If this option is set, it overrides the escape character set for `ESCAPE_UNENCLOSED_FIELD`.

    Default:
    :   `NONE`

`ESCAPE_UNENCLOSED_FIELD = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   A singlebyte character string used as the escape character for unenclosed field values only. An escape character invokes an alternative interpretation on subsequent characters in a character sequence. You can use the ESCAPE character to interpret instances of the `FIELD_DELIMITER` or `RECORD_DELIMITER` characters in the data as literals. The escape character can also be used to escape instances of itself in the data.

        Accepts common escape sequences, octal values, or hex values.

    Loading data:
    :   Specifies the escape character for unenclosed fields only.

        > **Note:**
        >
        > * The default value is `\\`. If a row in a data file ends in the backslash (`\`) character, this character escapes the newline or
        >   carriage return character specified for the `RECORD_DELIMITER` file format option. As a result, the load operation treats
        >   this row and the next row as a single row of data. To avoid this issue, set the value to `NONE`.
        > * This file format option supports singlebyte characters only. Note that UTF-8 character encoding represents high-order ASCII characters
        >   as multibyte characters. If your data file is encoded with the UTF-8 character set, you cannot specify a high-order ASCII character as
        >   the option value.
        >
        >   In addition, if you specify a high-order ASCII character, we recommend that you set the `ENCODING = 'string'` file format
        >   option as the character encoding for your data files to ensure the character is interpreted correctly.

    Unloading data:
    :   If `ESCAPE` is set, the escape character set for that file format option overrides this option.

    Default:
    :   backslash (`\\`)

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove white space from fields.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        As another example, if leading or trailing spaces surround quotes that enclose strings, you can remove the surrounding spaces using this option and the quote character using the
        `FIELD_OPTIONALLY_ENCLOSED_BY` option. Note that any spaces within the quotes are preserved. For example, assuming `FIELD_DELIMITER = '|'` and `FIELD_OPTIONALLY_ENCLOSED_BY = '"'`:

        ```sqlexample
        |"Hello world"|    /* loads as */  >Hello world<
        |" Hello world "|  /* loads as */  > Hello world <
        | "Hello world" |  /* loads as */  >Hello world<
        ```

        (the brackets in this example are not loaded; they are used to demarcate the beginning and end of the loaded strings)

    Default:
    :   `FALSE`

`FIELD_OPTIONALLY_ENCLOSED_BY = 'character' | NONE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   Character used to enclose strings. Value can be `NONE`, single quote character (`'`), or double quote character (`"`). To use the single quote character, use the octal or hex representation (`0x27`) or the double single-quoted escape (`''`).

        Data unloading only:
        :   When a field in the source table contains this character, Snowflake escapes it using the same character for unloading. For example, if the value is the double quote character and a field contains the string `A "B" C`, Snowflake escapes the double quotes for unloading as follows:

            `A ""B"" C`

    Default:
    :   `NONE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   String used to convert to and from SQL NULL:

        * When loading data, Snowflake replaces these values in the data load source with SQL NULL. To specify more than one string, enclose
          the list of strings in parentheses and use commas to separate each value.

          Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as
          a value, all instances of `2` as either a string or number are converted.

          For example:

          `NULL_IF = ('\N', 'NULL', 'NUL', '')`

          Note that this option can include empty strings.
        * When unloading data, Snowflake converts SQL NULL values to the first value in the list.

    Default:
    :   `\N` (that is, NULL, which assumes the `ESCAPE_UNENCLOSED_FIELD` value is `\\`)

`ERROR_ON_COLUMN_COUNT_MISMATCH = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to generate a parsing error if the number of delimited columns (i.e. fields) in an input file does not match the number of columns in the corresponding table.

        If set to `FALSE`, an error is not generated and the load continues. If the file is successfully loaded:

        * If the input file contains records with more fields than columns in the table, the matching fields are loaded in order of occurrence in the file and the remaining fields are not loaded.
        * If the input file contains records with fewer fields than columns in the table, the non-matching columns in the table are loaded with NULL values.

        This option assumes all the records within the input file are the same length (i.e. a file containing records of varying length return an error regardless of the value specified for this parameter).

    Default:
    :   `TRUE`

    > **Note:**
    >
    > When [transforming data during loading](../../user-guide/data-load-transform.md) (i.e. using a query as the source for the COPY command), this option is ignored. There is no requirement for your data files to have
    > the same number and ordering of columns as your target table.

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`).

    If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`EMPTY_FIELD_AS_NULL = TRUE | FALSE`
:   Use:
    :   Data loading, data unloading, and external tables

    Definition:
    :   * When loading data, specifies whether to insert SQL NULL for empty fields in an input file, which are represented by two successive delimiters (For example, `,,`).

          If set to `FALSE`, Snowflake attempts to cast an empty field to the corresponding column type. An empty string is inserted into columns of type STRING. For other column types, the COPY command produces an error.
        * When unloading data, this option is used in combination with `FIELD_OPTIONALLY_ENCLOSED_BY`. When `FIELD_OPTIONALLY_ENCLOSED_BY = NONE`, setting `EMPTY_FIELD_AS_NULL = FALSE` specifies to unload empty strings in tables to empty string values without quotes enclosing the field values.

          If set to `TRUE`, `FIELD_OPTIONALLY_ENCLOSED_BY` must specify a character to enclose strings.

    Default:
    :   `TRUE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

`ENCODING = 'string'`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String (constant) that specifies the character set of the source data when loading data into a table.

        | Character Set | `ENCODING` Value | Supported Languages | Notes |
        | --- | --- | --- | --- |
        | Big5 | `BIG5` | Traditional Chinese |  |
        | EUC-JP | `EUCJP` | Japanese |  |
        | EUC-KR | `EUCKR` | Korean |  |
        | GB18030 | `GB18030` | Chinese |  |
        | IBM420 | `IBM420` | Arabic |  |
        | IBM424 | `IBM424` | Hebrew |  |
        | IBM949 | `IBM949` | Korean |  |
        | ISO-2022-CN | `ISO2022CN` | Simplified Chinese |  |
        | ISO-2022-JP | `ISO2022JP` | Japanese |  |
        | ISO-2022-KR | `ISO2022KR` | Korean |  |
        | ISO-8859-1 | `ISO88591` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | ISO-8859-2 | `ISO88592` | Czech, Hungarian, Polish, Romanian |  |
        | ISO-8859-5 | `ISO88595` | Russian |  |
        | ISO-8859-6 | `ISO88596` | Arabic |  |
        | ISO-8859-7 | `ISO88597` | Greek |  |
        | ISO-8859-8 | `ISO88598` | Hebrew |  |
        | ISO-8859-9 | `ISO88599` | Turkish |  |
        | ISO-8859-15 | `ISO885915` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish | Identical to ISO-8859-1 except for 8 characters, including the Euro currency symbol. |
        | KOI8-R | `KOI8R` | Russian |  |
        | Shift_JIS | `SHIFTJIS` | Japanese |  |
        | UTF-8 | `UTF8` | All languages | For loading data from delimited files (CSV, TSV, etc.), UTF-8 is the default. . . For loading data from all other supported file formats (JSON, Avro, etc.), as well as unloading data, UTF-8 is the only supported character set. |
        | UTF-16 | `UTF16` | All languages |  |
        | UTF-16BE | `UTF16BE` | All languages |  |
        | UTF-16LE | `UTF16LE` | All languages |  |
        | UTF-32 | `UTF32` | All languages |  |
        | UTF-32BE | `UTF32BE` | All languages |  |
        | UTF-32LE | `UTF32LE` | All languages |  |
        | windows-874 | `WINDOWS874` | Thai |  |
        | windows-949 | `WINDOWS949` | Korean |  |
        | windows-1250 | `WINDOWS1250` | Czech, Hungarian, Polish, Romanian |  |
        | windows-1251 | `WINDOWS1251` | Russian |  |
        | windows-1252 | `WINDOWS1252` | Danish, Dutch, English, French, German, Italian, Norwegian, Portuguese, Swedish |  |
        | windows-1253 | `WINDOWS1253` | Greek |  |
        | windows-1254 | `WINDOWS1254` | Turkish |  |
        | windows-1255 | `WINDOWS1255` | Hebrew |  |
        | windows-1256 | `WINDOWS1256` | Arabic |  |

    Default:
    :   `UTF8`

    > **Note:**
    >
    > Snowflake stores all data internally in the UTF-8 character set. The data is converted into UTF-8 before it is loaded into Snowflake.

### TYPE = JSON

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`DATE_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of date string values in the data files. If a value is not specified or is `AUTO`, the value for the [DATE_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIME_FORMAT = 'string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of time string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIME_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`TIMESTAMP_FORMAT = string' | AUTO`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the format of timestamp string values in the data files. If a value is not specified or is `AUTO`, the value for the [TIMESTAMP_INPUT_FORMAT](../parameters.md) parameter is used.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `AUTO`

`BINARY_FORMAT = HEX | BASE64 | UTF8`
:   Use:
    :   Data loading only

    Definition:
    :   Defines the encoding format for binary string values in the data files. The option can be used when loading data into binary columns in a table.

        This file format option is applied to the following actions only:

        * Loading JSON data into separate columns using the MATCH_BY_COLUMN_NAME copy option.
        * Loading JSON data into separate columns by specifying a query in the COPY statement (i.e. COPY transformation).

    Default:
    :   `HEX`

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`MULTI_LINE = TRUE | FALSE`
:   Use: Data loading and external tables

    Definition:
    :   Boolean that specifies whether multiple lines are allowed. If MULTI_LINE is set to `FALSE` and a new line is present within a JSON record, the record containing the new line will be interpreted as an error.

    Default:
    :   `TRUE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading JSON data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

`FILE_EXTENSION = 'string' | NONE`
:   Use:
    :   Data unloading only

    Definition:
    :   Specifies the extension for files unloaded to a stage. Accepts any extension. The user is responsible for specifying a file extension that can be read by any desired software or services.

    Default:
    :   null, meaning the file extension is determined by the format type: `.json[compression]`, where `compression` is the extension added by the compression method, if `COMPRESSION` is set.

`ENABLE_OCTAL = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that enables parsing of octal numbers.

    Default:
    :   `FALSE`

`ALLOW_DUPLICATE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies to allow duplicate object field names (only the last one will be preserved).

    Default:
    :   `FALSE`

`STRIP_OUTER_ARRAY = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove outer brackets (i.e. `[ ]`).

    Default:
    :   `FALSE`

`STRIP_NULL_VALUES = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that instructs the JSON parser to remove object fields or array elements containing `null` values. For example, when set to `TRUE`:

        | Before | After |
        | --- | --- |
        | `[null]` | `[]` |
        | `[null,null,3]` | `[,,3]` |
        | `{"a":null,"b":null,"c":123}` | `{"c":123}` |
        | `{"a":[1,null,2],"b":{"x":null,"y":88}}` | `{"a":[1,,2],"b":{"y":88}}` |

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip the BOM (byte order mark), if present in a data file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

### TYPE = AVRO

`COMPRESSION = AUTO | GZIP | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`.

> **Note:**
>
> We recommend that you use the default `AUTO` option because it will determine both the file and codec compression. Specifying a compression option refers to the compression of files, not the compression of blocks (codecs).

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Avro data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = ORC

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading and external tables

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Orc data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = PARQUET

`COMPRESSION = AUTO | LZO | SNAPPY | NONE`
:   Use:
    :   Data unloading and external tables

    Definition:

    * When unloading data, specifies the compression algorith for columns in the Parquet files.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically. Supports the following compression algorithms: Brotli, gzip, Lempel-Ziv-Oberhumer (LZO), LZ4, Snappy, or Zstandard v0.8 (and higher). . When unloading data, unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `LZO` | When unloading data, files are compressed using the Snappy algorithm by default. If unloading data to LZO-compressed files, specify this value. |
        | `SNAPPY` | When unloading data, files are compressed using the Snappy algorithm by default. You can optionally specify this value. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`SNAPPY_COMPRESSION = TRUE | FALSE`
:   Use:
    :   Data unloading only

        | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | Unloaded files are compressed using the [Snappy](https://google.github.io/snappy/) compression algorithm by default. |
        | `SNAPPY` | May be specified if unloading Snappy-compressed files. |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Definition:
    :   Boolean that specifies whether unloaded file(s) are compressed using the SNAPPY algorithm.

    > **Note:**
    >
    > Deprecated. Use `COMPRESSION = SNAPPY` instead.

    Limitations:
    :   Only supported for data unloading operations.

    Default:
    :   `TRUE`

`BINARY_AS_TEXT = TRUE | FALSE`
:   Use:
    :   Data loading and external tables

    Definition:
    :   Boolean that specifies whether to interpret columns with no defined logical data type as UTF-8 text. When set to `FALSE`, Snowflake interprets these columns as binary data.

    Default:
    :   `TRUE`

    > **Note:**
    >
    > Snowflake recommends that you set BINARY_AS_TEXT to FALSE to avoid any potential conversion issues.

`TRIM_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to remove leading and trailing white space from strings.

        For example, if your external database software encloses fields in quotes, but inserts a leading space, Snowflake reads the leading space rather than the opening quotation character as the beginning of the
        field (i.e. the quotation marks are interpreted as part of the string of field data). Set this option to `TRUE` to remove undesirable spaces during the data load.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

    Default:
    :   `FALSE`

`USE_LOGICAL_TYPE = TRUE | FALSE`
:   Use:
    :   Data loading, data querying in staged files, and schema detection.

    Definition:
    :   Boolean that specifies whether to use Parquet logical types. With this file format option, Snowflake can interpret Parquet logical types during data loading. For more information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md). To enable Parquet logical types, set USE_LOGICAL_TYPE as TRUE when you create a new file format option.

    Limitations:
    :   Not supported for data unloading.

`USE_VECTORIZED_SCANNER = TRUE | FALSE`
:   Use:
    :   Data loading and data querying in staged files

    Definition:
    :   Boolean that specifies whether to use a vectorized scanner for loading Parquet files.

    Default:
    :   `FALSE`. In a future BCR, the default value will be `TRUE`.

    Using the vectorized scanner can significantly reduce the latency for loading Parquet files, because this scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file. The scanner only downloads relevant sections of the Parquet file into memory, such as the subset of selected columns.

    If `USE_VECTORIZED_SCANNER` is set to `TRUE`, the vectorized scanner has the following behaviors:

    > * The `BINARY_AS_TEXT` option is always treated as `FALSE` and the `USE_LOGICAL_TYPE` option is always treated as `TRUE`, no matter what the actual value is being set to.
    > * The vectorized scanner supports Parquet map types. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >   {
    >   >    "k1": "v1",
    >   >    "k2": "v2"
    >   >   }
    >   > ```
    > * The vectorized scanner shows `NULL` values in the output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "nickname": null,
    >   >   "age": 34,
    >   >   "phone_numbers":
    >   >   [
    >   >     "1234567890",
    >   >     "0987654321",
    >   >     null,
    >   >     "6781234590"
    >   >   ]
    >   >   }
    >   > ```
    > * The vectorized scanner handles Time and Timestamp as follows:
    >
    >   > | Parquet | Snowflake vectorized scanner |
    >   > | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS/NANOS) | TIME |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_LTZ |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS/NANOS) | TIMESTAMP_NTZ |
    >   > | INT96 | TIMESTAMP_LTZ |

    If `USE_VECTORIZED_SCANNER` is set to `FALSE`, the scanner has the following behaviors:

    > * This option does not support Parquet maps. The output of scanning a map type is as follows:
    >
    >   > ```sqlexample
    >   > "my_map":
    >   >  {
    >   >   "key_value":
    >   >   [
    >   >    {
    >   >           "key": "k1",
    >   >           "value": "v1"
    >   >       },
    >   >       {
    >   >           "key": "k2",
    >   >           "value": "v2"
    >   >       }
    >   >     ]
    >   >   }
    >   > ```
    > * This option does not explicitly show `NULL` values in the scan output, as the following example demonstrates:
    >
    >   > ```sqlexample
    >   > "person":
    >   >  {
    >   >   "name": "Adam",
    >   >   "age": 34
    >   >   "phone_numbers":
    >   >   [
    >   >    "1234567890",
    >   >    "0987654321",
    >   >    "6781234590"
    >   >   ]
    >   >  }
    >   > ```
    > * This option handles Time and Timestamp as follows:
    >
    >   > | Parquet | When USE_LOGICAL_TYPE = TRUE | When USE_LOGICAL_TYPE = FALSE |
    >   > | --- | --- | --- |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=MILLIS/MICROS) | TIME | + TIME (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimeType(isAdjustedToUtc=True/False, unit=NANOS) | TIME | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=True, unit=MILLIS/MICROS) | TIMESTAMP_LTZ | TIMESTAMP_NTZ |
    >   > | TimestampType(isAdjustedToUtc=True, unit=NANOS) | TIMESTAMP_LTZ | INTEGER |
    >   > | TimestampType(isAdjustedToUtc=False, unit=MILLIS/MICROS) | TIMESTAMP_NTZ | + TIMESTAMP_LTZ (If ConvertedType present) + INTEGER (If ConvertedType not present) |
    >   > | TimestampType(isAdjustedToUtc=False, unit=NANOS) | TIMESTAMP_NTZ | INTEGER |
    >   > | INT96 | TIMESTAMP_NTZ | TIMESTAMP_NTZ |

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`NULL_IF = ( 'string1' [ , 'string2' , ... ] )`
:   Use:
    :   Data loading only

    Definition:
    :   String used to convert to and from SQL NULL. Snowflake replaces these strings in the data load source with SQL NULL. To
        specify more than one string, enclose the list of strings in parentheses and use commas to separate each value.

        This file format option is applied to the following actions only when loading Parquet data into separate columns using the
        MATCH_BY_COLUMN_NAME copy option.

        Note that Snowflake converts all instances of the value to NULL, regardless of the data type. For example, if `2` is specified as a
        value, all instances of `2` as either a string or number are converted.

        For example:

        `NULL_IF = ('\N', 'NULL', 'NUL', '')`

        Note that this option can include empty strings.

    Default:
    :   `\N` (that is, NULL)

### TYPE = XML

`COMPRESSION = AUTO | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Use:
    :   Data loading only

    Definition:
    :   * When loading data, specifies the current compression algorithm for the data file. Snowflake uses this option to detect how an already-compressed data file was compressed so that the compressed data in the file can be extracted for loading.
        * When unloading data, compresses the data file using the specified compression algorithm.

    Values:
    :   | Supported Values | Notes |
        | --- | --- |
        | `AUTO` | When loading data, compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. When unloading data, files are automatically compressed using the default, which is gzip. |
        | `GZIP` |  |
        | `BZ2` |  |
        | `BROTLI` | Must be specified if loading/unloading Brotli-compressed files. |
        | `ZSTD` | Zstandard v0.8 (and higher) is supported. |
        | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
        | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
        | `NONE` | When loading data, indicates that the files have not been compressed. When unloading data, specifies that the unloaded files are not compressed. |

    Default:
    :   `AUTO`

`IGNORE_UTF8_ERRORS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether UTF-8 encoding errors produce error conditions. It is an alternative syntax for `REPLACE_INVALID_CHARACTERS`.

    Values:
    :   If set to `TRUE`, any invalid UTF-8 sequences are silently replaced with the Unicode character `U+FFFD` (i.e. “replacement character”).

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`PRESERVE_SPACE = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser preserves leading and trailing spaces in element content.

    Default:
    :   `FALSE`

`STRIP_OUTER_ELEMENT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser strips out the outer XML element, exposing 2nd level elements as separate documents.

    Default:
    :   `FALSE`

`DISABLE_AUTO_CONVERT = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether the XML parser disables automatic conversion of numeric and Boolean values from text to native representation.

    Default:
    :   `FALSE`

`REPLACE_INVALID_CHARACTERS = TRUE | FALSE`
:   Use:
    :   Data loading and external table

    Definition:
    :   Boolean that specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (`�`). This
        option performs a one-to-one character replacement.

    Values:
    :   If set to `TRUE`, Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

        If set to `FALSE`, the load operation produces an error when invalid UTF-8 character encoding is detected.

    Default:
    :   `FALSE`

`SKIP_BYTE_ORDER_MARK = TRUE | FALSE`
:   Use:
    :   Data loading only

    Definition:
    :   Boolean that specifies whether to skip any BOM (byte order mark) present in an input file. A BOM is a character code at the beginning of a data file that defines the byte order and encoding form.

        If set to `FALSE`, Snowflake recognizes any BOM in data files, which could result in the BOM either causing an error or being merged into the first column in the table.

    Default:
    :   `TRUE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Storage integration | Required only if accessing a cloud storage service using a [storage integration](create-storage-integration.md). |
| CREATE STAGE | Schema | Required only if creating a permanent stage. |
| OWNERSHIP | Stage | * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object   that already exists in the schema. * Required to execute a CREATE OR ALTER STAGE statement for an *existing* stage.   OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege).  Note that in a [managed access schema](../../user-guide/security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

> **Important:**
>
> If you require Tri-Secret Secure for security compliance, use the `SNOWFLAKE_FULL` encryption type for internal stages.
> `SNOWFLAKE_SSE` does not support Tri-Secret Secure.

> **Caution:**
>
> Recreating a stage (using CREATE OR REPLACE STAGE) has the following additional, potentially undesirable, outcomes:
>
> * The existing directory table for the stage, if any, is dropped. If the stage is recreated with a directory table, the directory is
>   empty by default.
> * The association breaks between the stage and any external table that references it.
>
>   This is because an external table links to a stage using a hidden ID rather than the name of the stage. Behind the scenes, the CREATE OR
>   REPLACE syntax drops an object and recreates it with a different hidden ID.
>
>   If you must recreate a stage after it has been linked to one or more external tables, you must recreate each of the external tables
>   (using CREATE OR REPLACE EXTERNAL TABLE) to reestablish the association. Call the [GET_DDL](../functions/get_ddl.md) function to
>   retrieve a DDL statement to recreate each of the external tables.
> * Any pipes that reference the stage stop loading data. The execution status of the pipes changes to `STOPPED_STAGE_DROPPED`. To
>   resume loading data, these pipe objects must be recreated (using the CREATE OR REPLACE PIPE syntax).

* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE STAGE doesn’t check whether the specified URL or credentials are valid. If the credentials aren’t valid, when you attempt to
  use the stage, the system returns an error.
* Snowflake uses multipart uploads when uploading to Amazon S3 and Google Cloud Storage.
  This process might leave incomplete uploads in the storage location for your external stage.

  To prevent incomplete uploads from accumulating, we recommend that you set a lifecycle rule.
  For instructions, see the [Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/mpu-abort-incomplete-mpu-lifecycle-config.html)
  or [Google Cloud Storage](https://cloud.google.com/storage/docs/lifecycle#abort-mpu) documentation.
* For external stages that use an S3 access point:

  + If you’re using a storage integration, you must configure the IAM policy for the integration
    to grant permission to your S3 access point. For more information, see [Option 1: Configure a Snowflake storage integration to access Amazon S3](../../user-guide/data-load-s3-config-storage-integration.md).
  + Multi-region access points aren’t supported.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER STAGE usage notes

**Limitations**

* All limitations of the [ALTER STAGE](alter-stage.md) command apply.
* The CREATE OR ALTER STAGE command only accepts and handles properties that are compatible with the current type of Stage (internal or
  external). Properties incompatible with internal Stages cannot be used in a CREATE OR ALTER STAGE command on an internal Stage.
* The CREATE OR ALTER STAGE command cannot change the storage provider type of an external Stage.
* Setting or unsetting a tag is not supported; however existing tags are not altered by a CREATE OR ALTER STAGE statement and remain unchanged.

**Properties**

* The absence of a property that was previously set in the Stage definition results in resetting it to the default value.

**Directory table options**

* The CREATE OR ALTER STAGE command does not support the REFRESH_ON_CREATE option.
* The CREATE OR ALTER STAGE command does not support refreshing directory tables.

  + Newly created directory tables will not be refreshed.
  + To refresh a directory table use [ALTER REFRESH](alter-stage.md).

## Examples

### Basic examples

#### Internal stages

Create an internal stage and specify server-side encryption for the stage:

```sqlexample
CREATE STAGE my_int_stage
  ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');
```

Create a temporary internal stage with all the same properties as the previous example:

```sqlexample
CREATE TEMPORARY STAGE my_temp_int_stage;
```

Create a temporary internal stage that references a file format named `my_csv_format` (created using [CREATE FILE FORMAT](create-file-format.md)):

```sqlexample
CREATE TEMPORARY STAGE my_int_stage
  FILE_FORMAT = my_csv_format;
```

When you reference the stage in a [COPY INTO <table>](copy-into-table.md) statement, the file format options are automatically set.

Create an internal stage that includes a [directory table](../../user-guide/data-load-dirtables.md). The stage references a file format named `myformat`:

```sqlexample
CREATE STAGE mystage
  DIRECTORY = (ENABLE = TRUE)
  FILE_FORMAT = myformat;
```

#### External stages

**Amazon S3**

> In the examples below, if the S3 bucket is in a region in China, use the `s3china://` protocol for the URL parameter.
>
> Create an external stage using a private/protected S3 bucket named `load` with a folder path named `files`. Secure
> access to the S3 bucket is provided via the `myint` storage integration:
>
> ```sqlexample
> CREATE STAGE my_ext_stage
>   URL='s3://load/files/'
>   STORAGE_INTEGRATION = myint;
> ```
>
> Create an external stage using a private/protected S3 bucket named `load` with a folder path named `files`. The
> Snowflake access permissions for the S3 bucket are associated with an IAM user; therefore, IAM credentials are required:
>
> ```sqlexample
> CREATE STAGE my_ext_stage1
>   URL='s3://load/files/'
>   CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z');
> ```
>
> Note that the AWS_KEY_ID and AWS_SECRET_KEY values used in this example are for illustration purposes only.
>
> Create an external stage using an S3 bucket named `load` with a folder path named `encrypted_files` and client-side
> encryption (default encryption type) with the master key to decrypt/encrypt files stored in the bucket:
>
> ```sqlexample
> CREATE STAGE my_ext_stage2
>   URL='s3://load/encrypted_files/'
>   CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z')
>   ENCRYPTION=(MASTER_KEY = 'eSx...');
> ```
>
> Create an external stage using an S3 bucket named `load` with a folder path named `encrypted_files` and AWS_SSE_KMS
> server-side encryption with the ID for the master key to decrypt/encrypt files stored in the bucket:
>
> ```sqlexample
> CREATE STAGE my_ext_stage3
>   URL='s3://load/encrypted_files/'
>   CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z')
>   ENCRYPTION=(TYPE='AWS_SSE_KMS' KMS_KEY_ID = 'aws/key');
> ```
>
> Same example as the immediately preceding example, except that the Snowflake access permissions for the S3 bucket as associated
> with an IAM role instead of an IAM user. Note that credentials are handled separately from other stage parameters such as
> `ENCRYPTION`. Support for these other parameters is the same regardless of the credentials used to access your external
> S3 bucket:
>
> ```sqlexample
> CREATE STAGE my_ext_stage3
>   URL='s3://load/encrypted_files/'
>   CREDENTIALS=(AWS_ROLE='arn:aws:iam::001234567890:role/mysnowflakerole')
>   ENCRYPTION=(TYPE='AWS_SSE_KMS' KMS_KEY_ID = 'aws/key');
> ```
>
> Create a stage with a directory table in the active schema for the user session. The cloud storage URL includes the path `files`.
> The stage references a storage integration named `my_storage_int`:
>
> ```sqlexample
> CREATE STAGE mystage
>   URL='s3://load/files/'
>   STORAGE_INTEGRATION = my_storage_int
>   DIRECTORY = (
>     ENABLE = true
>     AUTO_REFRESH = true
>   );
> ```

**Google Cloud Storage**

> Create an external stage using a private/protected GCS bucket named `load` with a folder path named `files`. Secure
> access to the GCS bucket is provided via the `myint` storage integration:
>
> ```sqlexample
> CREATE STAGE my_ext_stage
>   URL='gcs://load/files/'
>   STORAGE_INTEGRATION = myint;
> ```
>
> Create a stage named `mystage` with a directory table in the active schema for the user session. The cloud storage URL
> includes the path `files`. The stage references a storage integration named `my_storage_int`:
>
> ```sqlexample
> CREATE STAGE mystage
>   URL='gcs://load/files/'
>   STORAGE_INTEGRATION = my_storage_int
>   DIRECTORY = (
>     ENABLE = true
>     AUTO_REFRESH = true
>     NOTIFICATION_INTEGRATION = 'MY_NOTIFICATION_INT'
>   );
> ```
>
> Create an external stage using an S3 bucket named `load` with a folder path named `encrypted_files` and client-side
> encryption (default encryption type) with the master key to decrypt/encrypt files stored in the bucket:
>
> ```sqlexample
> CREATE STAGE my_ext_stage2
>   URL='gcs://load/encrypted_files/'
>   STORAGE_INTEGRATION = my_storage_int
>   ENCRYPTION=(TYPE = 'GCS_SSE_KMS' KMS_KEY_ID = '{a1b2c3});
> ```

**Microsoft Azure**

> Create an external stage using a private/protected Azure container named `load` with a folder path named `files`.
> Secure access to the container is provided via the `myint` storage integration:
>
> ```sqlexample
> CREATE STAGE my_ext_stage
>   URL='azure://myaccount.blob.core.windows.net/load/files/'
>   STORAGE_INTEGRATION = myint;
> ```
>
> Create an external stage using an Azure storage account named `myaccount` and a container named `mycontainer` with
> a folder path named `files` and client-side encryption enabled:
>
> ```sqlexample
> CREATE STAGE mystage
>   URL='azure://myaccount.blob.core.windows.net/mycontainer/files/'
>   CREDENTIALS=(AZURE_SAS_TOKEN='?sv=2016-05-31&ss=b&srt=sco&sp=rwdl&se=2018-06-27T10:05:50Z&st=2017-06-27T02:05:50Z&spr=https,http&sig=bgqQwoXwxzuD2GJfagRg7VOS8hzNr3QLT7rhS8OFRLQ%3D')
>   ENCRYPTION=(TYPE='AZURE_CSE' MASTER_KEY = 'kPx...');
> ```
>
> (The `AZURE_SAS_TOKEN` and `MASTER_KEY` values used in this example are not actual values; they are provided for
> illustration purposes only.)
>
> Create a stage with a directory table in the active schema for the user session. The cloud storage URL includes the path `files`.
> The stage references a storage integration named `my_storage_int`:
>
> ```sqlexample
> CREATE STAGE mystage
>   URL='azure://myaccount.blob.core.windows.net/load/files/'
>   STORAGE_INTEGRATION = my_storage_int
>   DIRECTORY = (
>     ENABLE = true
>     AUTO_REFRESH = true
>     NOTIFICATION_INTEGRATION = 'MY_NOTIFICATION_INT'
>   );
> ```

### CREATE OR ALTER STAGE examples

#### Internal stage

Create an internal stage with a comment:

```sqlexample
CREATE OR ALTER STAGE my_int_stage
  COMMENT='my_comment'
  ;
```

Alter the internal stage to create a directory table and remove the comment:

```sqlexample
CREATE OR ALTER STAGE my_int_stage
  DIRECTORY=(ENABLE=true);
```

#### External stage

Create an external stage using an s3 bucket with credentials:

```sqlexample
CREATE OR ALTER STAGE my_ext_stage
  URL='s3://load/files/'
  CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z');
```

Alter the external stage to create a directory table:

```sqlexample
CREATE OR ALTER STAGE my_ext_stage
  URL='s3://load/files/'
  CREDENTIALS=(AWS_KEY_ID='1a2b3c' AWS_SECRET_KEY='4x5y6z')
  DIRECTORY=(ENABLE=true);
```

---
title: CREATE STORAGE INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/create-storage-integration.md
section: SQL Commands
---

# CREATE STORAGE INTEGRATION

Creates a new storage integration in the account or replaces an existing integration.

A storage integration is a Snowflake object that stores a generated identity and access management (IAM) entity for your external
cloud storage, along with an optional set of allowed or blocked storage locations (Amazon S3, Google Cloud Storage, or Microsoft Azure).
Cloud provider administrators in your organization grant permissions on the storage locations to the generated entity. This option
allows users to avoid supplying credentials when creating stages or when loading or unloading data.

A single storage integration can support multiple external stages. The URL in the stage definition must align with the storage location
specified for the STORAGE_ALLOWED_LOCATIONS parameter.

> **Note:**
>
> * If your cloud storage is located on a different cloud platform from your Snowflake
>   account, the storage location must be in the public cloud and not a virtual private environment.
>
>   Snowflake charges a per-byte fee when you unload data from Snowflake into an external stage in a different
>   [region](../../user-guide/intro-regions.md) or different cloud provider. For details, see the
>   [pricing page](https://www.snowflake.com/pricing/).
> * Accessing cloud storage in a [government region](../../user-guide/intro-regions.md) using a storage integration is limited to Snowflake
>   accounts hosted in the same government region.
>
>   Similarly, if you need to access cloud storage in a region in China, you can use a storage integration only from a Snowflake
>   account hosted in the same region in China.
>
>   In these cases, use the CREDENTIALS parameter in the [CREATE STAGE](create-stage.md) command (rather than using a storage
>   integration) to provide the credentials for authentication.

See also:
:   [ALTER STORAGE INTEGRATION](alter-storage-integration.md) , [DROP INTEGRATION](drop-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] STORAGE INTEGRATION [IF NOT EXISTS]
  <name>
  TYPE = { EXTERNAL_STAGE | POSTGRES_EXTERNAL_STORAGE }
  cloudProviderParams
  ENABLED = { TRUE | FALSE }
  STORAGE_ALLOWED_LOCATIONS = ('<cloud>://<bucket>/<path>/' [ , '<cloud>://<bucket>/<path>/' ... ] )
  [ STORAGE_BLOCKED_LOCATIONS = ('<cloud>://<bucket>/<path>/' [ , '<cloud>://<bucket>/<path>/' ... ] ) ]
  [ COMMENT = '<string_literal>' ]
```

Where:

> ```sqlsyntax
> cloudProviderParams (for Amazon S3) ::=
>   STORAGE_PROVIDER = 'S3'
>   STORAGE_AWS_ROLE_ARN = '<iam_role>'
>   [ STORAGE_AWS_EXTERNAL_ID = '<external_id>' ]
>   [ STORAGE_AWS_OBJECT_ACL = 'bucket-owner-full-control' ]
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Google Cloud Storage) ::=
>   STORAGE_PROVIDER = 'GCS'
> ```
>
> ```sqlsyntax
> cloudProviderParams (for Microsoft Azure) ::=
>   STORAGE_PROVIDER = 'AZURE'
>   AZURE_TENANT_ID = '<tenant_id>'
>   [ USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE } ]
> ```

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the integration; must be unique in your account.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`TYPE = { EXTERNAL_STAGE | POSTGRES_EXTERNAL_STORAGE }`
:   Specify the type of integration:

    * `EXTERNAL_STAGE`: Creates an interface between Snowflake and an external cloud storage location.
    * `POSTGRES_EXTERNAL_STORAGE`: Creates a storage integration for use with
      [Snowflake Postgres](../../user-guide/snowflake-postgres/postgres-pg_lake.md).
      Only one storage location is allowed for this type of integration.

      [Preview Feature](../../release-notes/preview-features.md) — Open

      Available to all accounts.
      Currently, this feature is only available on Amazon Web Services (AWS).

`ENABLED = { TRUE | FALSE }`
:   Specifies whether this storage integration is available for usage in stages.

    > * `TRUE` allows users to create new stages that reference this integration. Existing stages that reference this integration
    >   function normally.
    > * `FALSE` prevents users from creating new stages that reference this integration. Existing stages that reference this integration
    >   cannot access the storage location in the stage definition.

    The value is case-insensitive.

    The default is `TRUE`.

`STORAGE_ALLOWED_LOCATIONS = ( 'cloud_specific_url' )`
:   Explicitly limits external stages that use the integration to reference one or more storage locations (i.e. S3 bucket, GCS bucket, or
    Azure container). Supports a comma-separated list of URLs for existing buckets and, optionally, paths used to store data files for
    loading/unloading. Alternatively supports the `*` wildcard, meaning “allow access to all buckets and/or paths”.

    **Amazon S3**

    > `STORAGE_ALLOWED_LOCATIONS = ( 'protocol://bucket/path/' [ , 'protocol://bucket/path/' ... ]  )`
    >
    > > * `protocol` is one of the following:
    > >
    > >   + `s3` refers to S3 storage in public AWS regions outside of China.
    > >   + `s3china` refers to S3 storage in public AWS regions in China.
    > >   + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
    > > * `bucket` is the name of an S3 bucket that stores your data files (e.g. `mybucket`).
    > > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with
    > >   a common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud
    > >   storage services.

    **Google Cloud Storage**

    > `STORAGE_ALLOWED_LOCATIONS = ( 'gcs://bucket/path/' [ , 'gcs://bucket/path/' ... ] )`
    >
    > > * `bucket` is the name of a GCS bucket that stores your data files (e.g. `mybucket`).
    > > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with
    > >   a common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud
    > >   storage services.

    **Microsoft Azure**

    > `STORAGE_ALLOWED_LOCATIONS = ( 'azure://account.blob.core.windows.net/container/path/' [ , 'azure://account.blob.core.windows.net/container/path/' ... ] )`
    >
    > > * `account` is the name of the Azure storage account (e.g. `myaccount`). Use the `blob.core.windows.net` endpoint
    > >   for all supported types of Azure blob storage accounts, including Data Lake Storage Gen2.
    > > * `container` is the name of a Azure blob storage container that stores your data files (e.g. `mycontainer`).
    > > * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with
    > >   a common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud
    > >   storage services.

    **Microsoft Fabric OneLake**

    > `STORAGE_ALLOWED_LOCATIONS = ( 'azure://onelake.blob.fabric.microsoft.com/workspace_id/item_id/Files/path/' [ , ... ] )`
    >
    > > * `onelake.blob.fabric.microsoft.com` is the global service root for OneLake. This single endpoint automatically routes
    > >   requests to the correct geographical region where your data resides.
    > > * `workspace_id` is the unique 128-bit GUID of the Fabric Workspace; for example, `aab1c234-567d-8901-234e-fgh56789ij`.
    > > * `item_id` is the unique GUID of the specific Fabric item, such as a Lakehouse or Warehouse.
    > > * `Files` is the mandatory path segment for Lakehouse items. This segment points to the unmanaged section of the lake where
    > >   you store raw data such as CSV, Parquet, or JSON.
    > > * `path` is an optional case-sensitive path to a specific folder or file prefix. Although optional, providing a path is
    > >   recommended when loading specific datasets to improve performance and prevent accidental processing of unrelated files.

## Optional parameters

`STORAGE_BLOCKED_LOCATIONS = ( 'cloud_specific_url' )`
:   Explicitly prohibits external stages that use the integration from referencing one or more storage locations (i.e. S3 buckets or
    GCS buckets). Supports a comma-separated list of URLs for existing storage locations and, optionally, paths used to store data files
    for loading/unloading. Commonly used when STORAGE_ALLOWED_LOCATIONS is set to the `*` wildcard, allowing access to all buckets
    in your account except for blocked storage locations and, optionally, paths.

    > **Note:**
    >
    > Make sure to enclose only individual cloud storage location URLs in quotes. If you enclose the entire
    > `STORAGE_BLOCKED_LOCATIONS` value in quotes, the value is invalid. As a result, the `STORAGE_BLOCKED_LOCATIONS`
    > parameter setting is ignored when users create stages that reference the storage integration.

    **Amazon S3**

    > `STORAGE_BLOCKED_LOCATIONS = ( 'protocol://bucket/path/' [ , 'protocol://bucket/path/' ... ]  )`
    >
    > > * `protocol` is one of the following:
    > >
    > >   + `s3` refers to S3 storage in public AWS regions outside of China.
    > >   + `s3china` refers to S3 storage in public AWS regions in China.
    > >   + `s3gov` refers to S3 storage in [government regions](../../user-guide/intro-regions.md).
    > > * `bucket` is the name of an S3 bucket that stores your data files (e.g. `mybucket`).
    > > * `path` is an optional path (or *directory*) in the bucket that further limits access to the data files.

    **Google Cloud Storage**

    > `STORAGE_BLOCKED_LOCATIONS = ( 'gcs://bucket/path/' [ , 'gcs://bucket/path/' ... ] )`
    >
    > > * `bucket` is the name of a GCS bucket that stores your data files (e.g. `mybucket`).
    > > * `path` is an optional path (or *directory*) in the bucket that further limits access to the data files.

    **Microsoft Azure**

    > `STORAGE_BLOCKED_LOCATIONS = ( 'azure://account.blob.core.windows.net/container/path/' [ , 'azure://account.blob.core.windows.net/container/path/' ... ] )`
    >
    > > * `account` is the name of the Azure storage account (e.g. `myaccount`).
    > > * `container` is the name of a Azure blob storage container that stores your data files (e.g. `mycontainer`).
    > > * `path` is an optional path (or *directory*) in the bucket that further limits access to the data files.

    **Microsoft Fabric OneLake**

    > `STORAGE_BLOCKED_LOCATIONS = ( 'azure://onelake.blob.fabric.microsoft.com/workspace_id/item_id/Files/path/' [ , ... ] )`
    >
    > > * `workspace_id` is the unique 128-bit GUID of the Fabric Workspace.
    > > * `item_id` is the unique GUID of the specific Fabric item.
    > > * `Files` is the path segment for Lakehouse items.
    > > * `path` is an optional path that further limits the blocked location.

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the integration.

    Default: No value

## Cloud provider parameters (`cloudProviderParams`)

**Amazon S3**

> `STORAGE_PROVIDER = '{ S3 | S3CHINA | S3GOV }'`
> :   Specifies the cloud storage provider that stores your data files:
>
>     * `'S3'`: S3 storage in public AWS regions outside of China.
>     * `'S3CHINA'`: S3 storage in public AWS regions in China.
>     * `'S3GOV'`: S3 storage in AWS government regions.
>
> `STORAGE_AWS_ROLE_ARN = 'iam_role'`
> :   Specifies the Amazon Resource Name (ARN) of the AWS identity and access management (IAM) role that grants privileges on the S3 bucket
>     containing your data files. For more information, see [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md).

> `STORAGE_AWS_EXTERNAL_ID = 'external_id'`
> :   Optionally specifies an external ID that Snowflake uses to establish a trust relationship with AWS.
>     You must specify the same external ID in the trust policy of the IAM role
>     that you configured for this storage integration. For more information,
>     see [How to use an external ID when granting access to your AWS resources to a third party](https://docs.aws.amazon.com/IAM/latest/UserGuide/id_roles_create_for-user_externalid.html).
>
>     If you don’t specify a value for this parameter,
>     Snowflake automatically generates an external ID when you create the storage integration.
>
> `STORAGE_AWS_OBJECT_ACL = 'bucket-owner-full-control'`
> :   Enables support for AWS access control lists (ACLs) to grant the bucket owner full control. Files created in Amazon S3 buckets from
>     unloaded table data are owned by an AWS Identity and Access Management (IAM) role. ACLs support the use case where IAM roles in one
>     AWS account are configured to access S3 buckets in one or more other AWS accounts. Without ACL support, users in the bucket-owner
>     accounts could not access the data files unloaded to an external (S3) stage using a storage integration.
>
>     When users unload Snowflake table data to data files in an S3 stage using [COPY INTO <location>](copy-into-location.md), the unload
>     operation applies an ACL to the unloaded data files. The data files apply the `"s3:x-amz-acl":"bucket-owner-full-control"`
>     privilege to the files, granting the S3 bucket owner full control over them.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter, see
>     [Private connectivity to external stages for Amazon Web Services](../../user-guide/data-load-aws-private.md).

**Google Cloud Storage**

> `STORAGE_PROVIDER = 'GCS'`
> :   Specifies the cloud storage provider that stores your data files.

**Microsoft Azure**

> `STORAGE_PROVIDER = 'AZURE'`
> :   Specifies the cloud storage provider that stores your data files.
>
> `AZURE_TENANT_ID = 'tenant_id'`
> :   Specifies the ID for your Office 365 tenant that the allowed and blocked storage accounts belong to. A storage integration can
>     authenticate to only one tenant, and so the allowed and blocked storage locations must refer to storage accounts that all belong
>     this tenant.
>
>     To find your tenant ID, log into the Azure portal and click Azure Active Directory » Properties. The tenant ID
>     is displayed in the Tenant ID field.
>
> `USE_PRIVATELINK_ENDPOINT = { TRUE | FALSE }`
> :   Specifies whether to use outbound private connectivity to harden your security posture. For information about using this parameter,
>     see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

**Microsoft Fabric OneLake**

> `STORAGE_PROVIDER = 'AZURE'`
> :   Specifies the cloud storage provider. Use `'AZURE'` for Microsoft Fabric OneLake storage.
>
> `AZURE_TENANT_ID = 'tenant_id'`
> :   Specifies the ID for your Microsoft Entra ID (formerly Azure Active Directory) tenant that the Fabric Workspace belongs to.
>
> > **Note:**
> >
> > Private connectivity endpoints (USE_PRIVATELINK_ENDPOINT) aren’t supported for Microsoft Fabric OneLake storage locations.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE INTEGRATION | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

> **Caution:**
>
> Recreating a storage integration (using CREATE OR REPLACE STORAGE INTEGRATION) breaks the association between the storage integration
> and any stage that references it. This is because a stage links to a storage integration using a hidden ID rather than the name of the
> storage integration. Behind the scenes, the CREATE OR REPLACE syntax drops the object and recreates it with a different hidden ID.
>
> If you must recreate a storage integration after it has been linked to one or more stages, you must reestablish the association between
> each stage and the storage integration by executing [ALTER STAGE](alter-stage.md) `stage_name` SET STORAGE_INTEGRATION =
> `storage_integration_name`, where:
>
> * `stage_name` is the name of the stage.
> * `storage_integration_name` is the name of the storage integration.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

The following example creates an integration that explicitly limits external stages that use the integration to reference either of
two buckets and paths:

**Amazon S3**

> ```sqlexample
> CREATE STORAGE INTEGRATION s3_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'S3'
>   STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('s3://mybucket1/path1/', 's3://mybucket2/path2/');
> ```
>
> If the S3 storage is in a public AWS region in China, use `'S3CHINA'` for the STORAGE_PROVIDER parameter and
> `s3china://` protocol in STORAGE_ALLOWED_LOCATIONS.

**Google Cloud Storage**

> ```sqlexample
> CREATE STORAGE INTEGRATION gcs_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'GCS'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('gcs://mybucket1/path1/', 'gcs://mybucket2/path2/');
> ```

**Microsoft Azure**

> ```sqlexample
> CREATE STORAGE INTEGRATION azure_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'AZURE'
>   ENABLED = TRUE
>   AZURE_TENANT_ID = '<tenant_id>'
>   STORAGE_ALLOWED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer/path1/', 'azure://myaccount.blob.core.windows.net/mycontainer/path2/');
> ```

The following example creates an integration that allows external stages that use the integration to reference any bucket and
path in your account except for those that are explicitly blocked:

**Amazon S3**

> ```sqlexample
> CREATE STORAGE INTEGRATION s3_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'S3'
>   STORAGE_AWS_ROLE_ARN = 'arn:aws:iam::001234567890:role/myrole'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('*')
>   STORAGE_BLOCKED_LOCATIONS = ('s3://mybucket3/path3/', 's3://mybucket4/path4/');
> ```
>
> If the S3 storage is in a public AWS region in China, use `'S3CHINA'` for the STORAGE_PROVIDER parameter and
> `s3china://` protocol in STORAGE_BLOCKED_LOCATIONS.

**Google Cloud Storage**

> ```sqlexample
> CREATE STORAGE INTEGRATION gcs_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'GCS'
>   ENABLED = TRUE
>   STORAGE_ALLOWED_LOCATIONS = ('*')
>   STORAGE_BLOCKED_LOCATIONS = ('gcs://mybucket3/path3/', 'gcs://mybucket4/path4/');
> ```

**Microsoft Azure**

> ```sqlexample
> CREATE STORAGE INTEGRATION azure_int
>   TYPE = EXTERNAL_STAGE
>   STORAGE_PROVIDER = 'AZURE'
>   ENABLED = TRUE
>   AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
>   STORAGE_ALLOWED_LOCATIONS = ('*')
>   STORAGE_BLOCKED_LOCATIONS = ('azure://myaccount.blob.core.windows.net/mycontainer/path3/', 'azure://myaccount.blob.core.windows.net/mycontainer/path4/');
> ```

---
title: CREATE STORAGE LIFECYCLE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/create-storage-lifecycle-policy.md
section: SQL Commands
---

# CREATE STORAGE LIFECYCLE POLICY

Creates a new [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md) in the current or specified schema, or replaces
an existing policy.
The policy runs an expression on arguments that you specify to determine which rows to expire in the table that the policy is attached to.
The arguments in a policy refer to columns in your tables.

After you create a policy, use the [ALTER TABLE](alter-table.md) command to add the policy to a table.

See also:
:   [ALTER STORAGE LIFECYCLE POLICY](alter-storage-lifecycle-policy.md) , [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md) , [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md) , [SHOW STORAGE LIFECYCLE POLICIES](show-storage-lifecycle-policies.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] STORAGE LIFECYCLE POLICY [ IF NOT EXISTS ] <name>
  AS ( <arg_name> <arg_type> [ , ... ] )
  RETURNS BOOLEAN -> <body>
  [ ARCHIVE_TIER = { COOL | COLD } ]
  [ ARCHIVE_FOR_DAYS = <number_of_days> ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

## Required parameters

`name`
:   String that specifies the identifier for the storage lifecycle policy. This must be unique for the schema.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`AS ( arg_name arg_type [ , ... ] )`
:   The signature for the policy. You must include at least one argument in the signature.

    A signature specifies a set of attributes that must be considered to determine whether the row is ready for expiration. The attribute
    values come from the database object (table).

`RETURNS BOOLEAN -> body`
:   A storage lifecycle policy must evaluate to true or false. A user that queries a table protected by a storage lifecycle policy sees rows in the output
    based on how the `body` is written.

    `body`
    :   SQL expression that Snowflake uses to determine which rows to expire.

        To transform the data, you can use built-in functions such as [Conditional expression functions](../expressions-conditional.md) or
        [user-defined functions](../../developer-guide/udf/udf-overview.md) (UDFs).

        > **Note:**
        >
        > Currently, only SQL and JavaScript UDFs are supported in the body of a storage lifecycle policy.

## Optional parameters

`ARCHIVE_TIER = { COOL | COLD }`
:   Specifies the type of storage tier to use for archiving rows. After you set the ARCHIVE_TIER for a policy, you can’t modify it.
    For more information, see [Archive storage tiers](../../user-guide/storage-management/storage-lifecycle-policies.md).

    If you don’t specify this parameter, the policy is an expiration policy that deletes rows without archiving them.

    * `COOL` requires that you set an archival period (ARCHIVE_FOR_DAYS) of 90 days or longer to enable archiving.
    * `COLD` requires that you set an archival period (ARCHIVE_FOR_DAYS) of 180 days or longer to enable archiving.

    For supported cloud providers, see [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md).

    Default: No value

`ARCHIVE_FOR_DAYS = number_of_days`
:   Specifies the number of days to keep rows that match the policy expression in archive storage.
    If set, Snowflake moves the data into archive storage according
    to the value you select for ARCHIVE_TIER. If unset, Snowflake expires the rows from the table without archiving the data.

    Values:

    * ARCHIVE_TIER = COOL: `90` - `2147483647`
    * ARCHIVE_TIER = COLD: `180` - `2147483647`

    Default: Unset

`COMMENT = 'string_literal'`
:   Specifies a comment for the storage lifecycle policy.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| [CREATE STORAGE LIFECYCLE POLICY](../../user-guide/security-access-control-privileges.md) | Schema | None |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

  + If you specify OR REPLACE and the policy is attached to any objects, the command fails.
  + You can’t use `OR REPLACE` and `IF NOT EXISTS` together for this command.
  + If you want to replace an existing storage lifecycle policy and need to see the current definition of the policy, call the
    [GET_DDL](../functions/get_ddl.md) function or run the [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md) command.
* Including one or more [subqueries](../../user-guide/querying-subqueries.md) in the policy body might cause errors. When possible, limit the
  number of subqueries, limit the number of JOIN operations, and simplify WHERE clause conditions.
* You cannot change the policy signature if the policy is attached to a table. If you need to change the signature, use the
  [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md) command and create a new policy.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

The following lifecycle policy moves data from rows that correspond to closed accounts and are more than 60 days old into archive
storage (COOL tier).

```sqlexample
CREATE STORAGE LIFECYCLE POLICY example_policy
  AS (event_ts TIMESTAMP, account_id NUMBER)
  RETURNS BOOLEAN ->
    event_ts < DATEADD(DAY, -60, CURRENT_TIMESTAMP())
    AND EXISTS (
      SELECT 1 FROM closed_accounts
      WHERE id = account_id
    )
  ARCHIVE_TIER = COOL
  ARCHIVE_FOR_DAYS = 180;
```

---
title: CREATE STREAM
source: https://docs.snowflake.com/en/sql-reference/sql/create-stream.md
section: SQL Commands
---

# CREATE STREAM

Creates a new stream in the current/specified schema or replaces an existing [stream](../../user-guide/streams-intro.md). A stream records data
manipulation language (DML) changes made to a table, directory table, dynamic table, external table, or the underlying tables in a view (including
secure views). The object for which changes are recorded is called the *source object*.

In addition, this command supports the following variant:

* CREATE STREAM … CLONE (creates a clone of an existing stream)

See also:
:   [ALTER STREAM](alter-stream.md) , [DROP STREAM](drop-stream.md) , [SHOW STREAMS](show-streams.md) , [DESCRIBE STREAM](desc-stream.md)

## Syntax

The command syntax differs depending on the object on which the stream is created:

```sqlsyntax
-- table
CREATE [ OR REPLACE ] STREAM [IF NOT EXISTS]
  <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COPY GRANTS ]
  ON TABLE <table_name>
  [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> | STREAM => '<name>' } ) ]
  [ APPEND_ONLY = TRUE | FALSE ]
  [ SHOW_INITIAL_ROWS = TRUE | FALSE ]
  [ COMMENT = '<string_literal>' ]

-- Event table
CREATE [ OR REPLACE ] STREAM [IF NOT EXISTS]
  <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COPY GRANTS ]
  ON EVENT TABLE <table_name>
  [ COMMENT = '<string_literal>' ]

-- External table
CREATE [ OR REPLACE ] STREAM [IF NOT EXISTS]
  <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COPY GRANTS ]
  ON EXTERNAL TABLE <external_table_name>
  [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> | STREAM => '<name>' } ) ]
  [ INSERT_ONLY = TRUE ]
  [ COMMENT = '<string_literal>' ]

-- Directory table
CREATE [ OR REPLACE ] STREAM [IF NOT EXISTS]
  <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COPY GRANTS ]
  ON STAGE <stage_name>
  [ COMMENT = '<string_literal>' ]

-- Dynamic table
CREATE [ OR REPLACE ] STREAM [IF NOT EXISTS]
  <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COPY GRANTS ]
  ON DYNAMIC TABLE <table_name>
  [ COMMENT = '<string_literal>' ]

-- View
CREATE [ OR REPLACE ] STREAM [IF NOT EXISTS]
  <name>
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ COPY GRANTS ]
  ON VIEW <view_name>
  [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> | STREAM => '<name>' } ) ]
  [ APPEND_ONLY = TRUE | FALSE ]
  [ SHOW_INITIAL_ROWS = TRUE | FALSE ]
  [ COMMENT = '<string_literal>' ]
```

## Variant syntax

**CREATE STREAM … CLONE**

Creates a new stream with the same definition as the source stream. The clone inherits the current *offset* (i.e. the current
transactional [table version](../../user-guide/streams-intro.md)) from the source stream.

> ```sqlsyntax
> CREATE [ OR REPLACE ] STREAM <name> CLONE <source_stream>
>   [ COPY GRANTS ]
>   [ ... ]
> ```

For more information about cloning, see [CREATE <object> … CLONE](create-clone.md).

## Required parameters

`name`
:   String that specifies the identifier (i.e. name) for the stream; must be unique for the schema in which the stream is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`table_name`
:   String that specifies the identifier (i.e. name) for the table whose changes are tracked by the stream (i.e. the source table).

    Access control:
    :   To query a stream, a role must have the SELECT privilege on the underlying table.

`external_table_name`
:   String that specifies the identifier (i.e. name) for the external table whose changes are tracked by the stream (i.e. the source
    external table).

    Access control:
    :   To query a stream, a role must have the SELECT privilege on the underlying external table.

`stage_name`
:   String that specifies the identifier (i.e. name) for the stage whose directory table changes are tracked by the stream (i.e. the
    source directory table).

    Access control:
    :   To query a stream, a role must have the USAGE (external stage) or READ (internal stage) privilege on the underlying
        stage.

`view_name`
:   String that specifies the identifier (i.e. name) for the source view. The stream tracks DML changes to the underlying tables in
    the view.

    For more information about streams on views, see [Streams on views](../../user-guide/streams-intro.md).

    Access control:
    :   To query a stream, a role must have the SELECT privilege on the view.

## Optional parameters

`TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`COPY GRANTS`
:   Specifies to retain the access permissions from the original stream when a new stream is created using any of the following
    CREATE STREAM variants:

    > * CREATE OR REPLACE STREAM
    > * CREATE STREAM … CLONE

    The parameter copies all permissions, except OWNERSHIP, from the existing stream to the new stream. By default, the role
    that executes the CREATE STREAM command owns the new stream.

    > **Note:**
    >
    > * If the CREATE STREAM statement references more than one stream (e.g. `create or replace stream t1 clone t2;`), the
    >   `COPY GRANTS` clause gives precedence to the stream being replaced.
    > * The [SHOW GRANTS](show-grants.md) output for the replacement stream lists the grantee for the copied privileges as the
    >   role that executed the CREATE STREAM statement, with the current timestamp when the statement was executed.
    > * The operation to copy grants occurs atomically in the CREATE STREAM command (i.e. within the same transaction).

    > **Note:**
    >
    > This parameter is not supported currently.

`{ AT ( { TIMESTAMP => timestamp | OFFSET => time_difference | STATEMENT => id | STREAM => 'name' } ) | BEFORE ( { TIMESTAMP => timestamp | OFFSET => time_difference | STATEMENT => id } ) }`
:   Creates a stream at a specific time/point in the past (using [Time Travel](../../user-guide/data-time-travel.md)). The
    [AT | BEFORE](../constructs/at-before.md) clause determines the point in the past from which historical data is requested:

    > * The `AT` keyword specifies that the request is inclusive of any changes made by a statement or transaction with a timestamp
    >   equal to the specified parameter.
    >
    >   The `STREAM => '<name>'` value is special. When provided, the CREATE STREAM statement creates the new stream at the same
    >   offset as the specified stream. You can also provide this value when recreating an existing stream (using the `OR REPLACE`
    >   keywords) to retain the current offset of the stream after it is recreated. `'<name>'` is the identifier (i.e. name) for
    >   the existing stream whose offset is copied to the new or recreated stream.
    >
    >   The new or recreated stream advances the offset, as usual, when the stream is used in a DML transaction.
    > * The `BEFORE` keyword specifies that the request refers to a point immediately preceding the specified parameter.
    >
    > > **Note:**
    > >
    > > If no change tracking data is available on the source object at the point in the past specified in the AT | BEFORE clause, the
    > > CREATE STREAM statement fails. No stream can be created at a time in the past before change tracking was recorded.

`APPEND_ONLY = TRUE | FALSE`
:   Only supported for streams on standard tables or streams on views that query standard tables.

    Specifies whether this is an append-only stream. Append-only streams track row inserts only. Update and delete operations (including
    table truncates) are not recorded. For example, if 10 rows are inserted into a table and then 5 of those rows are deleted before the
    offset for an append-only stream is advanced, the stream records 10 rows.

    This type of stream improves query performance over standard streams and is very useful for extract, load, transform (ELT) and similar
    scenarios that depend exclusively on row inserts.

    A standard stream joins the deleted and inserted rows in the change set to determine which rows were deleted and which were updated.
    An append-only stream returns the appended rows only and therefore can be much more performant than a standard stream. For example,
    the source table can be truncated immediately after the rows in an append-only stream are consumed, and the record deletions do not
    contribute to the overhead the next time the stream is queried or consumed.

    Default:
    :   `FALSE`

`INSERT_ONLY = TRUE | FALSE`
:   Required for streams on external tables and externally managed Iceberg tables. Not supported by streams on other objects.

    Specifies whether this is an insert-only stream. Insert-only streams track row inserts only; they do not record delete operations
    that remove rows from an inserted set (i.e. no-ops). For example, in-between any two offsets, if `File1` is removed from the
    cloud storage location referenced by the external table, and `File2` is added, the stream returns records for the rows in
    `File2` only, regardless of whether `File1` was added before or within the requested change interval. Unlike
    when tracking change data capture (CDC) data for standard tables, access to the historical records for files in cloud storage is
    not governed by or guaranteed to Snowflake.

    Overwritten or appended files are essentially handled as new files: The old version of the file is removed from cloud storage, but the
    insert-only stream does not record the delete operation. The new version of the file is added to cloud storage, and the insert-only
    stream records the rows as inserts. The stream does not record the diff of the old and new file versions. Note that appends may not
    trigger an automatic refresh of the external table metadata, such as when using
    [Azure AppendBlobs](../../user-guide/tables-external-azure.md).

    Default:
    :   `FALSE`

`SHOW_INITIAL_ROWS = TRUE | FALSE`
:   Specifies the records to return the first time the stream is consumed.

    `TRUE`
    :   The stream returns only the rows that existed in the source object at the moment when the stream was created. The
        METADATA$ISUPDATE column shows a FALSE value in these rows. Subsequently, the stream returns any DML changes to the source object
        since the most recent offset; that is, the normal stream behavior.

        This parameter enables initializing any downstream process with the contents of the source object for the stream.

    `FALSE`

    > The stream returns any DML changes to the source object since the most recent offset.

    Default:
    :   `FALSE`

`COMMENT = 'string_literal'`
:   String (literal) that specifies a comment for the stream.

    Default: No value

## Output

The output for a stream includes the same columns as the source object along with the following additional columns:

* METADATA$ACTION: Specifies the action (INSERT or DELETE).
* METADATA$ISUPDATE: Specifies whether the action recorded (INSERT or DELETE) is part of an UPDATE applied to the rows in the source table
  or view.

  Note that streams record the differences between two offsets. If a row is added and then updated in the current offset, the delta change
  is a new row. The METADATA$ISUPDATE row records a FALSE value.
* METADATA$ROW_ID: Specifies the unique and immutable ID for the row, which can be used to track changes to specific rows over time.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

Streams on standard tables:

> | Object | Privilege | Notes |
> | --- | --- | --- |
> | Schema | CREATE STREAM |  |
> | Table | SELECT | If change tracking has not been enabled on the source table (using [ALTER TABLE … SET CHANGE_TRACKING = TRUE](alter-table.md)), then only the table owner (i.e. the role that has the OWNERSHIP privilege on the table) can create the initial stream on the table. Creating the initial stream automatically enables change tracking on the table. |
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

Streams on views:

> | Object | Privilege | Notes |
> | --- | --- | --- |
> | Schema | CREATE STREAM |  |
> | View | SELECT | If change tracking has not been enabled on the source view and its underlying tables, then only a role that has the OWNERSHIP privilege on the view and its underlying tables owner can create the initial stream on the view. Creating the initial stream automatically enables change tracking on the table. For instructions on enabling change tracking on a view and its underlying tables, refer to [Enabling change tracking on views and underlying tables](../../user-guide/streams-manage.md). Note that enabling change tracking locks the underlying tables while change tracking is being enabled. Locks on the underlying objects may cause latency in DDL/DML operations with these objects. For more information, refer to [Resource locking](../transactions.md). |
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

Streams on directory tables:

> | Object | Privilege | Notes |
> | --- | --- | --- |
> | Schema | CREATE STREAM |  |
> | Stage | USAGE (external stage) or READ (internal stage) |  |
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

Streams on external tables:

> | Object | Privilege | Notes |
> | --- | --- | --- |
> | Schema | CREATE STREAM |  |
> | External table | SELECT |  |
>
> Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* A stream can be queried multiple times to update multiple objects in the same transaction and it will return the same data.
* The stream position (i.e. *offset*) is advanced when the stream is used in a DML statement. The position is updated at the end of the
  transaction to the beginning timestamp of the transaction. The stream describes change records starting from the current position of the
  stream and ending at the current transactional timestamp.

  To ensure multiple statements access the same change records in the stream, surround them with an explicit transaction statement
  ([BEGIN](begin.md) .. [COMMIT](commit.md)). An explicit transaction locks the stream, so that DML updates to
  the source object are not reported to the stream until the transaction is committed.
* Streams have no Fail-safe period or Time Travel retention period. The metadata in these objects cannot be recovered if a stream is dropped.
* Streams on shared tables:

  + The retention period for a source table is not extended automatically to prevent any streams on the table from becoming stale.
* Standard streams cannot retrieve change data for geospatial data. We recommend creating append-only streams on objects that contain
  geospatial data.
* Streams on views:

  + Creating the first stream on a view using the view owner role (i.e. the role with the OWNERSHIP privilege on the view) enables change
    tracking on the view. If the same role also owns the underlying tables, change tracking is also enabled on the tables. If the role was
    not granted the OWNERSHIP privilege on both the view and its underlying tables, then change tracking must be enabled manually on the
    applicable objects. For instructions, see [Enabling change tracking on views and underlying tables](../../user-guide/streams-manage.md).
  + Depending on the number of joins in a view, a single change in the underlying tables could result in a large number of changes in the
    stream output.
  + Any stream on a given view breaks if the source view or underlying tables are dropped or recreated (using CREATE OR REPLACE VIEW).
  + Any streams on a secure view adhere to the secure view constraints.

    If the owner of a non-secure view (i.e. the role with the OWNERSHIP privilege on the view) changes it to a secure view (using ALTER
    VIEW … SET SECURE), any stream on the view automatically enforces secure view constraints.

    In addition, the retention period for the underlying tables is not extended automatically to prevent any streams on the secure
    view from becoming stale.
  + Streams based on views where the view uses non-deterministic functions can return non-deterministic results.

    For example, the results of [context functions](../functions-context.md) such as [CURRENT_DATE](../functions/current_date.md),
    and [CURRENT_USER](../functions/current_user.md) are non-deterministic. The results of [data generation functions](../functions-data-generation.md)
    such as [RANDOM](../functions/random.md) are also non-deterministic.
    If a view contains a non-deterministic function, then any stream on that view will not be a constant snapshot of the
    function’s output. Instead the value in the stream may change when queried.

    We recommend that you ensure that the non-determinism in the results of a view does not
    affect the correctness of the stream query results.

    For an example, see [Stream on a view that calls a non-deterministic SQL function](../../user-guide/streams-examples.md).
* Streams on directory tables: The METADATA$ROW_ID column values in the stream output are empty.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

### Creating a table stream

Create a stream on the `mytable` table:

```sqlexample
CREATE STREAM mystream ON TABLE mytable;
```

### Using Time Travel with the source table

Create a stream on the `mytable` table as it existed before the date and time in the specified timestamp:

```sqlexample
CREATE STREAM mystream ON TABLE mytable BEFORE (TIMESTAMP => TO_TIMESTAMP(40*365*86400));
```

Create a stream on the `mytable` table as it existed exactly at the date and time of the specified timestamp:

```sqlexample
CREATE STREAM mystream ON TABLE mytable AT (TIMESTAMP => TO_TIMESTAMP_TZ('02/02/2019 01:02:03', 'mm/dd/yyyy hh24:mi:ss'));
```

Create a stream on the `mytable` table as it existed 5 minutes ago:

```sqlexample
CREATE STREAM mystream ON TABLE mytable AT(OFFSET => -60*5);
```

Create a stream on the `mytable` table with the same offset as existing stream `oldstream` on the same source table:

```sqlexample
CREATE STREAM mystream ON TABLE mytable AT(STREAM => 'oldstream');
```

Recreate the existing `mystream` stream but retain its current offset:

```sqlexample
CREATE OR REPLACE STREAM mystream ON TABLE mytable AT(STREAM => 'mystream');
```

Create a stream on the `mytable` table including transactions up to, but not including any changes made by the specified transaction:

```sqlexample
CREATE STREAM mystream ON TABLE mytable BEFORE(STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726');
```

### Creating a stream on a single-table view

Create a stream on the `myview` view:

```sqlexample
CREATE STREAM mystream ON VIEW myview;
```

For additional examples, see [Stream examples](../../user-guide/streams-examples.md).

### Creating an insert-only stream on an external table

Create an external table stream and query the change data capture records in the stream, which track the records added to the external
table metadata:

```sqlexample
-- Create an external table that points to the MY_EXT_STAGE stage.
-- The external table is partitioned by the date (in YYYY/MM/DD format) in the file path.
CREATE EXTERNAL TABLE my_ext_table (
  date_part date as to_date(substr(metadata$filename, 1, 10), 'YYYY/MM/DD'),
  ts timestamp AS (value:time::timestamp),
  user_id varchar AS (value:userId::varchar),
  color varchar AS (value:color::varchar)
) PARTITION BY (date_part)
  LOCATION=@my_ext_stage
  AUTO_REFRESH = false
  FILE_FORMAT=(TYPE=JSON);

-- Create a stream on the external table
CREATE STREAM my_ext_table_stream ON EXTERNAL TABLE my_ext_table INSERT_ONLY = TRUE;

-- Execute SHOW streams
-- The MODE column indicates that the new stream is an INSERT_ONLY stream
SHOW STREAMS;
+-------------------------------+------------------------+---------------+-------------+--------------+-----------+------------------------------------+-------+-------+-------------+
| created_on                    | name                   | database_name | schema_name | owner        | comment   | table_name                         | type  | stale | mode        |
|-------------------------------+------------------------+---------------+-------------+--------------+-----------+------------------------------------+-------+-------+-------------|
| 2020-08-02 05:13:20.174 -0800 | MY_EXT_TABLE_STREAM    | MYDB          | PUBLIC      | MYROLE       |           | MYDB.PUBLIC.EXTTABLE_S3_PART       | DELTA | false | INSERT_ONLY |
+-------------------------------+------------------------+---------------+-------------+--------------+-----------+------------------------------------+-------+-------+-------------+

-- Add a file named '2020/08/05/1408/log-08051409.json' to the stage using the appropriate tool for the cloud storage service.

-- Manually refresh the external table metadata.
ALTER EXTERNAL TABLE my_ext_table REFRESH;

-- Query the external table stream.
-- The stream indicates that the rows in the added JSON file were recorded in the external table metadata.
SELECT * FROM my_ext_table_stream;
+----------------------------------------+------------+-------------------------+---------+-------+-----------------+-------------------+-----------------+---------------------------------------------+
| VALUE                                  | DATE_PART  | TS                      | USER_ID | COLOR | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID | METADATA$FILENAME                           |
|----------------------------------------+------------+-------------------------+---------+-------+-----------------+-------------------+-----------------+---------------------------------------------|
| {                                      | 2020-08-05 | 2020-08-05 15:57:01.000 | user25  | green | INSERT          | False             |                 | test/logs/2020/08/05/1408/log-08051409.json |
|   "color": "green",                    |            |                         |         |       |                 |                   |                 |                                             |
|   "time": "2020-08-05 15:57:01-07:00", |            |                         |         |       |                 |                   |                 |                                             |
|   "userId": "user25"                   |            |                         |         |       |                 |                   |                 |                                             |
| }                                      |            |                         |         |       |                 |                   |                 |                                             |
| {                                      | 2020-08-05 | 2020-08-05 15:58:02.000 | user56  | brown | INSERT          | False             |                 | test/logs/2020/08/05/1408/log-08051409.json |
|   "color": "brown",                    |            |                         |         |       |                 |                   |                 |                                             |
|   "time": "2020-08-05 15:58:02-07:00", |            |                         |         |       |                 |                   |                 |                                             |
|   "userId": "user56"                   |            |                         |         |       |                 |                   |                 |                                             |
| }                                      |            |                         |         |       |                 |                   |                 |                                             |
+----------------------------------------+------------+-------------------------+---------+-------+-----------------+-------------------+-----------------+---------------------------------------------+
```

### Creating a standard stream on a directory table

Create a stream on the directory table for a stage named `mystage`:

```sqlexample
CREATE STREAM dirtable_mystage_s ON STAGE mystage;
```

Manually refresh the directory table metadata to populate the stream:

```sqlexample
ALTER STAGE mystage REFRESH;
```

Query the stream after one or more files were added to the stage after the most recent offset for the stream:

```sqlexample
SELECT * FROM dirtable_mystage_s;

+-------------------+--------+-------------------------------+----------------------------------+----------------------------------+-------------------------------------------------------------------------------------------+-----------------+-------------------+-----------------+
| RELATIVE_PATH     | SIZE   | LAST_MODIFIED                 | MD5                              | ETAG                             | FILE_URL                                                                                  | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID |
|-------------------+--------+-------------------------------+----------------------------------+----------------------------------+-------------------------------------------------------------------------------------------+-----------------+-------------------+-----------------|
| file1.csv.gz      |   1048 | 2021-05-14 06:09:08.000 -0700 | c98f600c492c39bef249e2fcc7a4b6fe | c98f600c492c39bef249e2fcc7a4b6fe | https://myaccount.snowflakecomputing.com/api/files/MYDB/MYSCHEMA/MYSTAGE/file1%2ecsv%2egz | INSERT          | False             |                 |
| file2.csv.gz      |   3495 | 2021-05-14 06:09:09.000 -0700 | 7f1a4f98ef4c7c42a2974504d11b0e20 | 7f1a4f98ef4c7c42a2974504d11b0e20 | https://myaccount.snowflakecomputing.com/api/files/MYDB/MYSCHEMA/MYSTAGE/file2%2ecsv%2egz | INSERT          | False             |                 |
+-------------------+--------+-------------------------------+----------------------------------+----------------------------------+-------------------------------------------------------------------------------------------+-----------------+-------------------+-----------------+
```

---
title: CREATE STREAMLIT
source: https://docs.snowflake.com/en/sql-reference/sql/create-streamlit.md
section: SQL Commands
---

# CREATE STREAMLIT

Creates a new Streamlit object in Snowflake or replaces an existing Streamlit
object in the same schema.

See also:
:   [SHOW STREAMLITS](show-streamlits.md), [DESCRIBE STREAMLIT](desc-streamlit.md), [ALTER STREAMLIT](alter-streamlit.md),
    [DROP STREAMLIT](drop-streamlit.md), [UNDROP STREAMLIT](undrop-streamlit.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] STREAMLIT [ IF NOT EXISTS ] <name>
  [ FROM <source_location> ]
  [ MAIN_FILE = '<filename>' ]
  [ QUERY_WAREHOUSE = <warehouse_name> ]
  [ RUNTIME_NAME = '<runtime_name>' ]
  [ COMPUTE_POOL = <compute_pool_name> ]
  [ COMMENT = '<string_literal>' ]
  [ TITLE = '<app_title>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ]
  [ SECRETS = ( '<snowflake_secret_name>' = <snowflake_secret> [ , ... ] ) ]
```

**The following syntax is legacy:**

> **Important:**
>
> * ROOT_LOCATION is a legacy parameter and may be deprecated in a future release.
> * For container runtimes, ROOT_LOCATION is not supported.
> * For Streamlit apps created using ROOT_LOCATION, multi-file editing and Git integration are not supported.

```sqlsyntax
CREATE [ OR REPLACE ] STREAMLIT [ IF NOT EXISTS ] <name>
  ROOT_LOCATION = '<stage_path_and_root_directory>'
  MAIN_FILE = '<path_to_main_file_in_root_directory>'
  [ QUERY_WAREHOUSE = <warehouse_name> ]
  [ COMMENT = '<string_literal>' ]
  [ TITLE = '<app_title>' ]
  [ IMPORTS = ( '<stage_path_and_directory_or_file_name_to_read>' [ , ... ] ) ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ]
```

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the Streamlit object. This identifier
    must be unique for the schema where the object is created.

    In addition, the identifier must start with an alphabetic character and can’t
    contain spaces or special characters unless the entire identifier string is
    enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in
    double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`FROM source_location`
:   Copies the app source files from the specified location. The location must be
    within an internal named stage. The path can be relative or fully qualified.
    For example, if the stage is named
    `@streamlit_db.streamlit_schema.streamlit_stage`, valid source locations can
    include:

    * A fully qualified path to the root of the stage:
      `FROM '@streamlit_db.streamlit_schema.streamlit_stage'`
    * A relative path to the root of the stage:
      `FROM '@streamlit_stage'`
    * A fully qualified or relative path to a subdirectory within the stage:
      `FROM '@streamlit_db.streamlit_schema.streamlit_stage/subdir'`

    Files are copied only one time when the CREATE command is executed; future
    changes to the source location don’t automatically update the Streamlit app.

    If this parameter isn’t specified, Snowflake copies the source files for a
    default app with a `streamlit_app.py` entrypoint file.

`MAIN_FILE = 'filename'`
:   Specifies the Streamlit entrypoint file. The requirements depend on the runtime type:

    * **Warehouse runtimes**: The file must be in the root of the source directory specified in FROM.
      Only a filename is allowed, not a path.
    * **Container runtimes**: The file can be in the root or a subdirectory. You can specify a relative
      path from the root of the source directory, like `'subdir/my_app.py'`.

    If you are using ROOT_LOCATION instead of FROM, then MAIN_FILE can be a path relative to ROOT_LOCATION
    even though ROOT_LOCATION only supports warehouse runtimes.

    DEFAULT: `'streamlit_app.py'`

`QUERY_WAREHOUSE = warehouse_name`
:   Specifies the warehouse used by the Streamlit app. The behavior depends on the runtime type:

    * **Warehouse runtimes**: Specifies the warehouse to run the app code and execute SQL queries.
      This is the code warehouse. It’s recommended to manually switch to a different warehouse within your app code for queries.
    * **Container runtimes**: Specifies the warehouse to execute SQL queries issued by the app.
      The app code runs on the compute pool specified by COMPUTE_POOL.

    DEFAULT: No value

    > **Note:**
    >
    > Although you can create a Streamlit object without this parameter, the app
    > won’t run until you specify a query warehouse.

`RUNTIME_NAME = 'runtime_name'`
:   Specifies the runtime environment for the Streamlit app. The runtime determines where and how
    the app executes.

    * **Warehouse runtime**: Run the app in a virtual warehouse. Each viewer gets a personal instance
      of the app. Use `SYSTEM$WAREHOUSE_RUNTIME`. The Python version is selected separately
      using the `environment.yml` file.
    * **Container runtimes**: Run the app in a Snowpark Container Services compute pool. All viewers
      share a single, long-running instance of the app. Container runtime names include the Python
      version. The following container runtimes are valid:

      + `SYSTEM$ST_CONTAINER_RUNTIME_PY3_11`

    The runtime defaults to the warehouse runtime.

    DEFAULT: `SYSTEM$WAREHOUSE_RUNTIME`

`COMPUTE_POOL = compute_pool_name`
:   Specifies the compute pool where the Streamlit app runs. This parameter is used only with
    container runtimes and is ignored for warehouse runtimes.

    If you omit this parameter when using a container runtime, Snowflake uses the compute pool specified by the
    [DEFAULT_STREAMLIT_COMPUTE_POOL](../parameters.md) parameter. If the DEFAULT_STREAMLIT_COMPUTE_POOL parameter is
    updated after the Streamlit app is created, it won’t affect the compute pool used by the app.

    DEFAULT: The compute pool specified by the DEFAULT_STREAMLIT_COMPUTE_POOL account parameter.

`COMMENT = 'string_literal'`
:   Specifies a comment for the Streamlit object.

    DEFAULT: No value

`TITLE = 'app_title'`
:   Specifies a title for the Streamlit object to display in Snowsight.

    DEFAULT: The name of the Streamlit object passed to CREATE STREAMLIT.

`IMPORTS = ( 'stage_path_and_directory_or_file_name_to_read' [ , ... ] )`
:   The location (stage), path, and name of the directory or file(s) to import. This only applies to warehouse runtimes and
    is ignored for container runtimes.

    DEFAULT: No value

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   The names of [external access integrations](create-external-access-integration.md) needed in order for the
    Streamlit app code to access external networks.

    For container runtimes, external access integrations are required to install packages from external package indexes
    like PyPI. For all runtime types, external access integrations enable the app to make outbound network requests.

    DEFAULT: No value

`SECRETS = ( 'snowflake_secret_name' = snowflake_secret [ , ... ] )`
:   Maps Snowflake secrets to secret names that can be referenced in the Streamlit app code. The secret name (left side)
    is how you reference the secret in your code, and the secret object (right side) is the identifier of the Snowflake secret.

    For example: `SECRETS = ('api_key' = my_database.my_schema.my_secret)`

    In warehouse runtimes, secrets are accessed through the `_snowflake` module. In container runtimes,
    secrets are accessible through `st.secrets` and are also mapped to environment variables.
    Secrets must be associated with an external access integration in EXTERNAL_ACCESS_INTEGRATIONS.
    For more information, see [Manage secrets and configure your Streamlit app](../../developer-guide/streamlit/app-development/secrets-and-configuration.md).

    DEFAULT: No value

`ROOT_LOCATION = 'stage_path_and_root_directory'`
:   Specifies the path to the named stage containing the Streamlit Python files, media files, and the
    `environment.yml` file, for example:

    ```sqlexample
    ROOT_LOCATION = '@streamlit_db.streamlit_schema.streamlit_stage'
    ```

    In this example, the Streamlit files are located on a named stage named `streamlit_stage` within a database named
    `streamlit_db` and schema named `streamlit_schema`.

    > **Note:**
    >
    > * This parameter must point to a single directory inside a named internal stage.
    > * External stages are not supported for Streamlit in Snowflake.
    > * If you’re creating or replacing a Streamlit application object within the Snowflake Native App Framework, use `FROM 'relative_path_from_stage_root_directory'` and not `ROOT_LOCATION = 'stage_path_and_root_directory'`.

## Access control requirements

If your role does not own the objects in the following table, then your role
must have the listed
[privileges](../../user-guide/security-access-control-overview.md) on those objects:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE STREAMLIT | Schema where you create the Streamlit object |  |
| READ | Stage from which you copy the Streamlit app source files |  |
| USAGE | Warehouse used by the Streamlit app |  |
| USAGE | Compute pool used by the Streamlit app | This privilege is only required if your app uses a container runtime. |
| USAGE | External access integrations used by the Streamlit app | This privilege is only required if your app uses external access integrations. For container runtimes, this privilege is required to install packages from external package indexes like PyPI. |
| USAGE | Secrets used by the Streamlit app | This privilege is only required if your app uses secrets and only applies to warehouse runtimes. |
| CREATE STAGE | Schema where you create the Streamlit object | This privilege is only required to create Streamlit objects with the ROOT_LOCATION parameter. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You must initialize the app after creating it.

  > **Important:**
  >
  > After you use CREATE STREAMLIT, the Streamlit app isn’t live until you do one of the
  > following actions:
  >
  > + Execute ALTER STREAMLIT … ADD LIVE VERSION FROM LAST on the new
  >   Streamlit object.
  > + Visit the app in Snowsight with the role that owns the app.
* When you clone a schema or database containing a Streamlit object, the Streamlit object is not cloned.
* To specify the packages used by the Streamlit application, include a dependency file in the source files.
  The format of the dependency file depends on the runtime type:

  + **Warehouse runtime**: Use an `environment.yml` file.
  + **Container runtime**: Use a `pyproject.toml` or `requirements.txt` file.

  For more information, see [Manage dependencies for your Streamlit app](../../developer-guide/streamlit/app-development/dependency-management.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

### Create a Streamlit app with default source files

To create a container-runtime Streamlit app from built-in default files, run the CREATE STREAMLIT
command as shown in the following example:

```sqlexample
CREATE STREAMLIT hello_streamlit
  RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
  COMPUTE_POOL = my_compute_pool
  QUERY_WAREHOUSE = my_warehouse;
```

By default, apps use the latest warehouse runtime if RUNTIME_NAME isn’t specified. To create a warehouse-runtime
Streamlit app from built-in default files, run the CREATE STREAMLIT command as shown in the following example:

```sqlexample
CREATE STREAMLIT hello_streamlit
  QUERY_WAREHOUSE = my_warehouse;
```

### Create a Streamlit app from a custom source files

To create a container-runtime Streamlit app from custom source files, run the CREATE STREAMLIT
command as shown in the following example:

```sqlexample
CREATE STREAMLIT hello_streamlit
  FROM @streamlit_db.streamlit_schema.streamlit_stage
  MAIN_FILE = 'streamlit_main.py'
  QUERY_WAREHOUSE = my_warehouse
  RUNTIME_NAME = 'SYSTEM$ST_CONTAINER_RUNTIME_PY3_11'
  COMPUTE_POOL = my_compute_pool;
```

To create a warehouse-runtime Streamlit app from custom source files, run the CREATE STREAMLIT
command as shown in the following example:

```sqlexample
CREATE STREAMLIT hello_streamlit
  FROM @streamlit_db.streamlit_schema.streamlit_stage
  MAIN_FILE = 'streamlit_main.py'
  QUERY_WAREHOUSE = my_warehouse;
```

### Create a warehouse-runtime Streamlit app with secrets

To create a warehouse-runtime Streamlit app with secrets, run the CREATE STREAMLIT command as shown in the following example:

```sqlexample
CREATE STREAMLIT hello_streamlit
  FROM @streamlit_db.streamlit_schema.streamlit_stage
  MAIN_FILE = 'streamlit_main.py'
  QUERY_WAREHOUSE = my_warehouse
  SECRETS = ('api_key' = streamlit_db.streamlit_schema.my_api_secret);
```

For container-runtime apps, secrets are accessible through `st.secrets` and as environment variables.
For more information, see [Manage secrets and configure your Streamlit app](../../developer-guide/streamlit/app-development/secrets-and-configuration.md).

### Create a Streamlit app from a Git repository

To create a Streamlit app from a Git repository, run the CREATE STREAMLIT command as shown in the following example:

```sqlexample
CREATE STREAMLIT hello_streamlit
  FROM @streamlit_db.streamlit_schema.streamlit_repo/branches/streamlit_branch/
  MAIN_FILE = 'streamlit_main.py'
  QUERY_WAREHOUSE = my_warehouse;
```

---
title: CREATE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/create-table.md
section: SQL Commands
---

# CREATE TABLE

Creates a new table in the current/specified schema, replaces an existing table, or alters an existing table. A table can have multiple
columns, with each column definition consisting of a name, data type, and optionally whether the column:

* Requires a value (NOT NULL).
* Has a default value.
* Has any referential integrity constraints (primary key, foreign key, and so on).

In addition, this command supports the following variants:

* CREATE OR ALTER TABLE (creates a table if it doesn’t exist, or alters it according to the table definition)
* CREATE TABLE … AS SELECT (creates a populated table; also referred to as CTAS)
* CREATE TABLE … USING TEMPLATE (creates a table with the column definitions derived from a set of staged files)
* CREATE TABLE … LIKE (creates an empty copy of an existing table)
* CREATE TABLE … CLONE (creates a clone of an existing table)
* CREATE TABLE … FROM ARCHIVE OF (creates a table from archived data)
* CREATE TABLE … FROM BACKUP SET (restores a table from a backup under a new name)

See also:
:   [ALTER TABLE](alter-table.md) , [DROP TABLE](drop-table.md) , [SHOW TABLES](show-tables.md) , [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ]
    [ { [ { LOCAL | GLOBAL } ] TEMP | TEMPORARY | VOLATILE | TRANSIENT } ]
  TABLE [ IF NOT EXISTS ] <table_name>

  (
    -- Column definition
    <col_name> <col_type>
      [ inlineConstraint ]
      [ NOT NULL ]
      [ COLLATE '<collation_specification>' ]
      [
        {
          DEFAULT <expr>
          | { AUTOINCREMENT | IDENTITY }
            [
              {
                ( <start_num> , <step_num> )
                | START <num> INCREMENT <num>
              }
            ]
            [ { ORDER | NOORDER } ]
        }
      ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ [ WITH ] PROJECTION POLICY <policy_name> ]
      [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
      [ COMMENT '<string_literal>' ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ ... ] ]

    -- Out-of-line constraints
    [ , outoflineConstraint [ ... ] ]
  )

  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ ENABLE_SCHEMA_EVOLUTION = { TRUE | FALSE } ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ COPY GRANTS ]
  [ ERROR_LOGGING = { TRUE | FALSE } ]
  [ COPY TAGS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> [ ENTITY KEY ( <col_name> [ , <col_name> ... ] ) ] ]
  [ [ WITH ] JOIN POLICY <policy_name> [ ALLOWED JOIN KEYS ( <col_name> [ , ... ] ) ] ]
  [ [ WITH ] STORAGE LIFECYCLE POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  [ ROW_TIMESTAMP = { TRUE | FALSE } ]
```

Where:

> `col_name` is an [object identifier](../identifiers.md). It must follow the [requirements for Snowflake identifiers](../identifiers-syntax.md).
>
> `col_type` is one of the [Snowflake data types](../../sql-reference-data-types.md), such as
> [NUMBER](../data-types-numeric.md) or [VARCHAR](../data-types-text.md).
>
> ```sqlsyntax
> inlineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   {   UNIQUE
>     | PRIMARY KEY
>     | [ FOREIGN KEY ] REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
>     | CHECK ( <expr> )
>   }
>   [ <constraint_properties> ]
> ```
>
> For additional inline constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).
>
> ```sqlsyntax
> outoflineConstraint ::=
>   [ CONSTRAINT <constraint_name> ]
>   {   UNIQUE [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | PRIMARY KEY [ ( <col_name> [ , <col_name> , ... ] ) ]
>     | [ FOREIGN KEY ] [ ( <col_name> [ , <col_name> , ... ] ) ]
>       REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
>     | CHECK ( <expr> )
>   }
>   [ <constraint_properties> ]
>   [ COMMENT '<string_literal>' ]
> ```
>
> For additional out-of-line constraint details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md).

> **Note:**
>
> Do not specify copy options using the CREATE STAGE, ALTER STAGE, CREATE TABLE, or ALTER TABLE commands. We recommend that you use the [COPY INTO <table>](copy-into-table.md) command to specify copy options.

**Restored table (from a backup)**

```sqlsyntax
CREATE TABLE <name> FROM BACKUP SET <backup_set> IDENTIFIER '<backup_id>'
```

## Variant syntax

### CREATE OR ALTER TABLE

Creates a table if it doesn’t exist, or alters it according to the table definition. The CREATE OR ALTER TABLE syntax follows the
rules of a CREATE TABLE statement and has the same limitations as an ALTER TABLE statement. If the table is transformed, existing
data in the table is preserved when possible. If a column must be dropped, data loss might occur.

The following changes are supported when altering a table:

* Change table properties and parameters. For example, ENABLE_SCHEMA_EVOLUTION, DATA_RETENTION_TIME_IN_DAYS, or CLUSTER BY.
* Change column data type, default value, nullability, comment, or autoincrement.
* Add new columns to the end of the column list.
* Drop columns.
* Add, drop, or modify inline or out-of-line constraints.
* Add, drop, or modify clustering keys.

For more information, see CREATE OR ALTER TABLE usage notes.

```sqlsyntax
CREATE OR ALTER
    [ { [ { LOCAL | GLOBAL } ] TEMP | TEMPORARY | TRANSIENT } ]
  TABLE <table_name> (
    -- Column definition
    <col_name> <col_type>
      [ inlineConstraint ]
      [ NOT NULL ]
      [ COLLATE '<collation_specification>' ]
      [
        {
          DEFAULT <expr>
          | { AUTOINCREMENT | IDENTITY }
            [
              {
                ( <start_num> , <step_num> )
                | START <num> INCREMENT <num>
              }
            ]
            [ { ORDER | NOORDER } ]
        }
      ]
      [ COMMENT '<string_literal>' ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ ... ] ]

    -- Out-of-line constraints
    [ , outoflineConstraint [ ... ] ]
  )
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ ENABLE_SCHEMA_EVOLUTION = { TRUE | FALSE } ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ ERROR_LOGGING = { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  [ ROW_TIMESTAMP = { TRUE | FALSE } ]
```

### CREATE TABLE … AS SELECT (also referred to as CTAS)

Creates a new table populated with the data returned by a query:

> ```sqlsyntax
> CREATE [ OR REPLACE ] TABLE <table_name> [ ( <col_name> [ <col_type> ] , <col_name> [ <col_type> ] , ... ) ]
>   [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
>   [ COPY GRANTS ]
>   [ COPY TAGS ]
>   [ ... ]
>   AS <query>
> ```

A masking policy can be applied to a column in a CTAS statement. Specify the masking policy after the column data type. Similarly, a
row access policy can be applied to the table. For example:

> ```sqlsyntax
> CREATE TABLE <table_name> ( <col1> <data_type> [ WITH ] MASKING POLICY <policy_name> [ , ... ] )
>   ...
>   [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col1> [ , ... ] )
>   [ ... ]
>   AS <query>
> ```

> **Note:**
>
> In a CTAS statement, the COPY GRANTS clause is valid only when combined with the OR REPLACE clause. COPY GRANTS copies
> permissions from the table being replaced with CREATE OR REPLACE (if it already exists), not from the source
> table(s) being queried in the SELECT statement. CTAS with COPY GRANTS allows you to overwrite a table with a new
> set of data while keeping existing grants on that table.
>
> For more details about COPY GRANTS, see COPY GRANTS in this document.

### CREATE TABLE … USING TEMPLATE

Creates a new table with the column definitions derived from a set of staged files using the [INFER_SCHEMA](../functions/infer_schema.md) function.
This feature supports Apache Parquet, Apache Avro, ORC, JSON, and CSV files.

> ```sqlsyntax
> CREATE [ OR REPLACE ] TABLE <table_name>
>   [ COPY GRANTS ]
>   USING TEMPLATE <query>
>   [ ... ]
> ```

> **Note:**
>
> If the statement is replacing an existing table of the same name, then the grants are copied from the table
> being replaced. If there is no existing table of that name, then the grants are copied from the source table
> being cloned.

For more details about COPY GRANTS, see COPY GRANTS in this document.

### CREATE TABLE … LIKE

Creates a new table with the same column definitions as an existing table, but without copying data from the existing table. Column
names, types, defaults, and constraints are copied to the new table:

> ```sqlsyntax
> CREATE [ OR REPLACE ] TABLE <table_name> LIKE <source_table>
>   [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
>   [ COPY GRANTS ]
>   [ COPY TAGS ]
>   [ ... ]
> ```

For more details about COPY GRANTS, see COPY GRANTS in this document.

> > **Note:**
> >
> > CREATE TABLE … LIKE for a table with an auto-increment sequence accessed through a data share is currently not
> > supported.

### CREATE TABLE … CLONE

Creates a new table with the same column definitions and containing all the existing data from the source table, without actually
copying the data. This variant can also be used to clone a table at a specific time/point in the past (using
[Time Travel](../../user-guide/data-time-travel.md)):

> ```sqlsyntax
> CREATE [ OR REPLACE ]
>     [ {
>           [ { LOCAL | GLOBAL } ] TEMP [ READ ONLY ] |
>           TEMPORARY [ READ ONLY ] |
>           VOLATILE |
>           TRANSIENT
>     } ]
>   TABLE <name> CLONE <source_table>
>     [ { AT | BEFORE } ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
>     [ COPY GRANTS ]
>     [ COPY TAGS ]
>     [ ... ]
> ```

> **Note:**
>
> * If the statement is replacing an existing table of the same name,
>   then the grants are copied from the table being replaced.
>   If there is no existing table of that name, then the grants are
>   copied from the source table being cloned.
> * If you directly clone a table, any streams on that table are not cloned.
> * If you clone a schema including tables with streams, then the streams are also cloned.

For more details about COPY GRANTS, see COPY GRANTS in this document.

For more details about cloning, see [CREATE <object> … CLONE](create-clone.md).

For details about cloning dynamic tables to tables, see [Clone a dynamic table to a new table](../../user-guide/dynamic-tables-clone.md).

### CREATE TABLE … FROM ARCHIVE OF

Creates a new table containing rows that were archived by a
[storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md).
You can specify filter conditions to retrieve specific archived data.

```sqlsyntax
CREATE [ TRANSIENT ] TABLE [ IF NOT EXISTS ] <name>
  FROM ARCHIVE OF <source_table> [ [ AS ] <alias_name> ]
  WHERE <expression>
```

For more information, see FROM ARCHIVE OF parameters and
the Usage notes.

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the table; must be unique for the schema in which the table is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`col_name`
:   Specifies the column identifier (i.e. name). All the requirements for table identifiers also apply to column identifiers.

    For more details, see [Identifier requirements](../identifiers-syntax.md) and [Reserved & limited keywords](../reserved-keywords.md).

    > **Note:**
    >
    > In addition to the standard reserved keywords, the following keywords cannot be used as column identifiers because they are
    > reserved for ANSI-standard context functions:
    >
    > * `CURRENT_DATE`
    > * `CURRENT_ROLE`
    > * `CURRENT_TIME`
    > * `CURRENT_TIMESTAMP`
    > * `CURRENT_USER`
    >
    > For the list of reserved keywords, see [Reserved & limited keywords](../reserved-keywords.md).

`col_type`
:   Specifies the data type for the column.

    For details about the data types that can be specified for table columns, see [SQL data types reference](../../sql-reference-data-types.md).

`query`
:   Required for CTAS and USING TEMPLATE.

    * For CTAS, specifies the [SELECT statement](../constructs.md) that populates the table. This query must be
      specified last in the CTAS statement, regardless of the other parameters that you include.
    * For CREATE TABLE … USING TEMPLATE, specifies the subquery that calls the [INFER_SCHEMA](../functions/infer_schema.md) function and
      formats the output as an array. Alternatively, `USING TEMPLATE` accepts the INFER_SCHEMA output as a string
      literal or variable.

`source_table`
:   Required for LIKE, CLONE, and FROM ARCHIVE OF.

    * For CREATE TABLE … LIKE, specifies the table from which properties and column definitions are copied.
    * For CREATE TABLE … CLONE, specifies the table to use as the source for the clone.
    * For CREATE TABLE … FROM ARCHIVE OF, see FROM ARCHIVE OF parameters.

## Backup parameters

The FROM BACKUP SET clause restores a table from a backup. You don’t specify other table
properties because they’re all the same as in the backed-up table.

> **Note:**
>
> The FROM SNAPSHOT SET clause is deprecated. Use FROM BACKUP SET instead.

This form doesn’t have a CREATE OR REPLACE clause. You typically either restore the
table under a new name and recover any data or other objects from this new table,
or rename the original table and then restore the table under the original name.

> **Note:**
>
> The restored table is independent of the original table from the backup.
> There isn’t any cloning relationship between the restored and original tables.
> Therefore, all the micro-partitions in the restored table are owned by that table.
>
> If you want to make backups of the newly restored table, create a new backup set for it.

For more information about backups, see [Backups for disaster recovery and immutable storage](../../user-guide/backups.md).

`backup_set`
:   Specifies the name of a backup set created for a specific table.
    You can use the SHOW BACKUP SETS command to locate the right backup set.

`backup_id`
:   Specifies the identifier of a specific backup within that backup set.
    You can use the SHOW BACKUPS IN BACKUP SET command to locate the right identifier within the backup
    set, based on the creation date and time for the backup.

## FROM ARCHIVE OF parameters

`source_table`
:   Specifies the table whose rows have been archived by a
    [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md). This is the table from which
    archived data is retrieved.

`[ AS ] alias_name`
:   Specifies an optional alias name for the source table reference. Use this alias when referencing columns
    in the WHERE clause.

    Alias names must follow the rules for [Object identifiers](../identifiers.md).

`WHERE expression`
:   Specifies a required condition that filters the archived rows to retrieve. The expression can reference columns
    from the source table (using the alias if specified).

    For more information about WHERE expressions, see [WHERE](../constructs/where.md).

## Optional parameters

`{ [ { LOCAL | GLOBAL } ] TEMP [ READ ONLY] |` . `TEMPORARY [ READ ONLY] |` . `VOLATILE |` . `TRANSIENT }`
:   Specifies that the table persists only for the duration of the [session](../../user-guide/session-policies.md) that you created it in. A
    temporary table and all its contents are dropped at the end of the session.

    The synonyms and abbreviations for `TEMPORARY` (e.g. `GLOBAL TEMPORARY`) are provided for compatibility with other databases
    (e.g. to prevent errors when migrating CREATE TABLE statements). Tables created with any of these keywords appear and behave identically
    to a table created with the `TEMPORARY` keyword.

    Default: No value. If a table is not declared as `TEMPORARY` or `TRANSIENT`, the table is permanent.

    If you want to avoid unexpected conflicts, avoid naming temporary tables after tables that already exist in the schema.

    If you created a temporary table with the same name as another table in the schema, all queries and operations used on the table only
    affect the temporary table in the session, until you drop the temporary table. If you drop the table, you drop the temporary table, and
    not the table that already exists in the schema.

    For information about temporary or transient tables, and how they can affect storage and cost, refer to the following resources:

    * [Working with Temporary and Transient Tables](../../user-guide/tables-temp-transient.md)
    * [Storage costs for Time Travel and Fail-safe](../../user-guide/data-cdp-storage-costs.md)

    `READ ONLY`
    :   Specifies that the table is read-only. READ ONLY is valid only for a temporary table that is being created with the
        CREATE TABLE … CLONE variant of the CREATE TABLE command.

        A read-only table does not allow DML operations and only allows the following subset of DDL operations:

        * ALTER TABLE … { ALTER | MODIFY } COLUMN … { SET | UNSET } COMMENT
        * ALTER TABLE … { ALTER | MODIFY } COLUMN … { SET | UNSET } MASKING POLICY
        * ALTER TABLE … { ALTER | MODIFY } COLUMN … { SET | UNSET } TAG
        * ALTER TABLE … RENAME COLUMN … TO
        * ALTER TABLE … RENAME TO
        * ALTER TABLE … { SET | UNSET } COMMENT
        * ALTER TABLE … { SET | UNSET } TAG
        * COMMENT
        * DESCRIBE
        * DROP
        * SHOW
        * UNDROP

        Read-only tables have a `METADATA$ROW_POSITION` column. This metadata column assigns a row number to each row in
        the table that is continuous and starts from 0. The row number assigned to each row remains unchanged until the
        read-only table is dropped.

`TRANSIENT`
:   Specifies that the table is transient.

    Like a permanent table, a transient table exists until explicitly dropped and is visible to any
    user with the appropriate privileges. However, transient tables have a lower level of data protection than permanent tables, meaning
    that data in a transient table might be lost in the event of a system failure. As such, transient tables should only be used for data
    that can be recreated externally to Snowflake.

    Default: No value. If a table is not declared as `TRANSIENT` or `TEMPORARY`, the table is permanent.

    > **Note:**
    >
    > Transient tables have some storage considerations.
    >
    > For more information about these and other considerations when deciding whether to create temporary or transient tables, see
    > [Working with Temporary and Transient Tables](../../user-guide/tables-temp-transient.md) and [Storage costs for Time Travel and Fail-safe](../../user-guide/data-cdp-storage-costs.md).

`CONSTRAINT ...`
:   Defines an inline or out-of-line constraint for the specified column(s) in the table.

    For syntax details, see [CREATE | ALTER TABLE … CONSTRAINT](create-table-constraint.md). For more information about constraints, see [Constraints](../constraints.md).

`COLLATE 'collation_specification'`
:   Specifies the collation to use for column operations such as string comparisons. This parameter applies only to
    [text columns](../data-types-text.md): VARCHAR, STRING, TEXT, and so on. For more information,
    see [Collation specifications](../collation.md).

`DEFAULT ...` or . `AUTOINCREMENT ...`
:   Specifies whether a default value is automatically inserted in the column if a value is not explicitly specified via an INSERT
    or CREATE TABLE AS SELECT statement:

    > `DEFAULT expr`
    > :   Column default value is defined by the specified expression which can be any of the following:
    >
    >     * Constant value.
    >     * [Sequence reference](../../user-guide/querying-sequences.md) (`seq_name.NEXTVAL`).
    >     * Simple expression that returns a scalar value.
    >
    >       The simple expression can include a SQL UDF (user-defined function) if the UDF is not a
    >       [secure UDF](../../developer-guide/secure-udf-procedure.md).
    >
    >       > **Note:**
    >       >
    >       > If a default expression refers to a SQL UDF, then the function is replaced by its
    >       > definition at table creation time. If the user-defined function is redefined in the future, this does not
    >       > update the column’s default expression.
    >
    >       The simple expression cannot contain references to:
    >
    >       > + Subqueries.
    >       > + Aggregates.
    >       > + Window functions.
    >       > + Secure UDFs.
    >       > + UDFs written in languages other than SQL (e.g. Java, JavaScript).
    >       > + External functions.
    >
    > `{ AUTOINCREMENT | IDENTITY }` . `[ { ( start_num , step_num ) | START num INCREMENT num } ]` . `[ { ORDER | NOORDER } ]`
    > :   When you specify AUTOINCREMENT or IDENTITY, the default value for the column starts with a specified number and each
    >     successive value automatically increments by the specified amount.
    >
    >     AUTOINCREMENT and IDENTITY are synonymous and can be used only for columns with numeric data types, such as NUMBER, INT,
    >     FLOAT.
    >
    >     > **Caution:**
    >     >
    >     > Snowflake uses a sequence to generate the values for an auto-incremented column. Sequences have limitations;
    >     > see [Sequence Semantics](../../user-guide/querying-sequences.md).
    >
    >     The default value for both the start value and the step/increment value is `1`.
    >
    >     > **Note:**
    >     >
    >     > Manually inserting values into an AUTOINCREMENT or IDENTITY column can result in duplicate values. If you manually insert the
    >     > value `5` into an AUTOINCREMENT or IDENTITY column, a subsequently inserted row might use the same value `5` as the
    >     > default value for the column.
    >
    >     Use ORDER or NOORDER to specify whether or not the values are generated for the auto-incremented column in
    >     [increasing or decreasing order](../../user-guide/querying-sequences.md).
    >
    >     * ORDER specifies that the values generated for a sequence or auto-incremented column are in increasing order (or, if the interval
    >       is a negative value, in decreasing order).
    >
    >       For example, if a sequence or auto-incremented column has `START 1 INCREMENT 2`, the generated values might be
    >       `1`, `3`, `5`, `7`, `9`, etc.
    >     * NOORDER specifies that the values are not guaranteed to be in increasing order.
    >
    >       For example, if a sequence has `START 1 INCREMENT 2`, the generated values might be `1`, `3`, `101`, `5`, `103`, etc.
    >
    >       NOORDER can improve performance when multiple INSERT operations are performed concurrently (for example, when multiple
    >       clients are executing multiple INSERT statements).
    >
    >     If you do not specify ORDER or NOORDER, the [NOORDER_SEQUENCE_AS_DEFAULT](../parameters.md) parameter determines which property is
    >     set.

    > **Note:**
    >
    > DEFAULT and AUTOINCREMENT are mutually exclusive; only one can be specified for a column.

`MASKING POLICY = policy_name`
:   Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`PROJECTION POLICY policy_name`
:   Specifies the [projection policy](../../user-guide/projection-policies.md) to set on a column.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`COMMENT 'string_literal'`
:   Specifies a comment for the column.

    (Note that comments can be specified at the column level or the table level. The syntax for each is slightly different.)

`USING ( col_name , cond_col_1 ... )`
:   Specifies the arguments to pass into the conditional masking policy SQL expression.

    The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
    column to which the masking policy is set.

    The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query result
    when a query is made on the first column.

    If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
    [masking policy](../../user-guide/security-column-intro.md).

`CLUSTER BY ( expr [ , expr , ... ] )`
:   Specifies one or more columns or column expressions in the table as the clustering key. For more details, see
    [Clustering Keys & Clustered Tables](../../user-guide/tables-clustering-keys.md).

    Default: No value (no clustering key is defined for the table)

    > **Important:**
    >
    > Clustering keys are not intended or recommended for all tables; they typically benefit very large (i.e. multi-terabyte)
    > tables.
    >
    > Before you specify a clustering key for a table, you should understand micro-partitions. For more information, see [Understanding Snowflake Table Structures](../../user-guide/tables-micro-partitions.md).

`ENABLE_SCHEMA_EVOLUTION = { TRUE | FALSE }`
:   Enables or disables automatic changes to the table schema from data loaded into the table from source files, including:

    > * Added columns.
    >
    >   By default, schema evolution is limited to a maximum of 100 added columns per load operation. To request more than 100 added columns per load operation, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
    > * The NOT NULL constraint can be dropped from any number of columns missing in new data files.

    Setting it to `TRUE` enables automatic table schema evolution. The default `FALSE` disables automatic table schema evolution.

    > **Note:**
    >
    > Loading data from files evolves the table columns when all of the following are true:
    >
    > * The [COPY INTO <table>](copy-into-table.md) statement includes the `MATCH_BY_COLUMN_NAME` option.
    > * The role used to load the data has the EVOLVE SCHEMA or OWNERSHIP privilege on the table.
    >
    > Additionally, for schema evolution with CSV, when used with `MATCH_BY_COLUMN_NAME` and `PARSE_HEADER`, `ERROR_ON_COLUMN_COUNT_MISMATCH` must be set to false.

`DATA_RETENTION_TIME_IN_DAYS = integer`
:   Specifies the retention period for the table so that Time Travel actions (SELECT, CLONE, UNDROP) can be performed on historical
    data in the table. For more details, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md) and [Working with Temporary and Transient Tables](../../user-guide/tables-temp-transient.md).

    For a detailed description of this object-level parameter, as well as more information about object parameters, see
    [Parameters](../parameters.md).

    Values:

    > * Standard Edition: `0` or `1`
    > * Enterprise Edition:
    >
    >   + `0` to `90` for permanent tables
    >   + `0` or `1` for temporary and transient tables

    Default:

    > * Standard Edition: `1`
    > * Enterprise Edition (or higher): `1` (unless a different default value was specified at the schema, database, or account level)

    > **Note:**
    >
    > A value of `0` effectively disables Time Travel for the table.

`MAX_DATA_EXTENSION_TIME_IN_DAYS = integer`
:   Object parameter that specifies the maximum number of days for which Snowflake can extend the data retention period for the table to
    prevent streams on the table from becoming stale.

    For a detailed description of this parameter, see [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md).

`CHANGE_TRACKING = { TRUE | FALSE }`
:   Specifies whether to enable change tracking on the table.

    * `TRUE` enables change tracking on the table. This setting adds a pair of hidden columns to the source table and begins
      storing change-tracking metadata in the columns. These columns consume a small amount of storage.

      The change-tracking metadata can be queried using the [CHANGES](../constructs/changes.md) clause for
      [SELECT](select.md) statements, or by creating and querying one or more streams on the table.
    * `FALSE` does not enable change tracking on the table.

    Default: FALSE

`DEFAULT_DDL_COLLATION = 'collation_specification'`
:   Specifies a default [collation specification](../collation.md) for the columns in the table, including columns
    added to the table in the future.

    For more details about the parameter, see [DEFAULT_DDL_COLLATION](../parameters.md).

`COPY GRANTS`
:   Specifies to retain the access privileges from the original table when a new table is created using any of the following
    CREATE TABLE variants:

    > * CREATE OR REPLACE TABLE
    > * CREATE TABLE … LIKE
    > * CREATE TABLE … CLONE

    The parameter copies all privileges, except OWNERSHIP, from the existing table to the new table. The new table does not
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE TABLE statement
    owns the new table.

    If the parameter is not included in the CREATE TABLE statement, then the new table does not inherit any explicit access
    privileges granted on the original table, but does inherit any future grants defined for the object type in the schema.

    Note:

    > * With [data sharing](../../guides-overview-sharing.md):
    >
    >   > + If the existing table was shared to another account, the replacement table is also shared.
    >   > + If the existing table was shared with your account as a data consumer, and access was further granted to other roles in
    >   >   the account (using `GRANT IMPORTED PRIVILEGES` on the parent database), access is also granted to the replacement
    >   >   table.
    > * The [SHOW GRANTS](show-grants.md) output for the replacement table lists the grantee for the copied privileges as the
    >   role that executed the CREATE TABLE statement, with the current timestamp when the statement was executed.
    > * The operation to copy grants occurs atomically in the CREATE TABLE command (i.e. within the same transaction).
    > * This parameter is not supported by the CREATE OR ALTER variant syntax.

`ERROR_LOGGING = { TRUE | FALSE }`
:   Specifies whether to turn on DML error logging for the table.

    * `TRUE` turns on DML error logging for the table.
    * `FALSE` turns off DML error logging for the table.

    For more information, see [DML error logging](../../user-guide/data-load-overview.md).

    > **Note:**
    >
    > If the [OPT_OUT_ERROR_LOGGING](../parameters.md) parameter is set to `TRUE` for a session,
    > DML error logging isn’t turned on, regardless of whether it is turned on for specific tables.

`COPY TAGS`
:   Applies [tags](../../user-guide/object-tagging/introduction.md) when you use any of these CREATE TABLE forms:

    > * CREATE OR REPLACE TABLE
    > * CREATE OR REPLACE TABLE … LIKE
    > * CREATE OR REPLACE TABLE … CLONE

    If the statement uses CREATE OR REPLACE TABLE … COPY TAGS without LIKE, CLONE, or a WITH TAG clause, tags from the replaced table
    and its columns are retained on the new table.

    If the statement uses LIKE, CLONE, or WITH TAG together with COPY TAGS, Snowflake combines tags from the applicable sources. If both
    sources set the same tag, the value from the replaced table (carried over by COPY TAGS) takes precedence.

    For more information, including the effect when you alter columns in the CREATE OR REPLACE statement, see
    the usage notes for COPY TAGS.

`COMMENT = 'string_literal'`
:   Specifies a comment for the table.

    Default: No value

    (Note that comments can be specified at the column level, constraint level, or table level. The syntax for each is slightly different.)

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on a table.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`AGGREGATION POLICY policy_name [ ENTITY KEY ( col_name [ , col_name ... ] ) ]`
:   Specifies an [aggregation policy](../../user-guide/aggregation-policies.md) to set on a table. You can apply one or more aggregation
    policies on a table.

    Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the table. For more information, see
    [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md). You can specify one or more entity keys for an aggregation policy.

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`JOIN POLICY policy_name [ ALLOWED JOIN KEYS ( col_name [ , ... ] ) ]`
:   Specifies the [join policy](../../user-guide/join-policies.md) to set on a table.

    Use the optional ALLOWED JOIN KEYS parameter to define which columns are allowed to be used as joining columns when
    this policy is in effect. For more information, see [Join policies](../../user-guide/join-policies.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`STORAGE LIFECYCLE POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies a [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md) to attach to the table.

    The columns specified in the ON clause must match the argument count and data types defined in the policy function signature.
    Snowflake uses these columns to evaluate the policy expression and determine which rows to archive or expire.

    > **Important:**
    >
    > If you attach an archival storage policy to a table, the table is permanently assigned to the specified archive tier for its lifetime. You can’t change the archive tier by applying a new policy. For example, you can’t specify a policy created with a COOL archive tier in ALTER TABLE…DROP STORAGE LIFECYCLE POLICY and then subsequently alter the table to add a policy created with a COLD archive tier. To alter the archive tier for a table, contact Snowflake Support to request deletion of the currently archived data. For additional considerations, see [Archival storage policies](../../user-guide/storage-management/storage-lifecycle-policies.md).

    For more information about creating and managing storage lifecycle policies, see
    [Create and manage storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-create-manage.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`ROW_TIMESTAMP = { TRUE | FALSE }`
:   Specifies whether to enable row timestamps on the table. You must use a role with the OWNERSHIP privilege.

    For more information, see [Use row timestamps to measure latency in your pipelines](../../user-guide/data-engineering/row-timestamps.md).

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE TABLE | Schema | Note that creating a temporary table does not require the CREATE TABLE privilege. |
| SELECT | Table, external table, view | Required on queried tables and/or views only when cloning a table or executing CTAS statements. |
| APPLY | Masking policy, row access policy, tag, storage lifecycle policy | Required only when applying a masking policy, row access policy, object tags, storage lifecycle policy, or any combination of these [governance](../../guides-overview-govern.md) features when creating tables. |
| USAGE (external stage) or READ (internal stage) | Stage | Required to derive table column definitions from staged files using CREATE TABLE … USING TEMPLATE statements. |
| OWNERSHIP | Table | * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object   that already exists in the schema. * Required to execute a CREATE OR ALTER TABLE statement for an *existing* table.   OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege).  Note that in a [managed access schema](../../user-guide/security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* A schema cannot contain tables and/or views with the same name. When creating a table:

  > + If a view with the same name already exists in the schema, an error is returned and the table is not created.
  > + If a table with the same name already exists in the schema, an error is returned and the table is not created, unless the
  >   optional `OR REPLACE` keyword is included in the command.
  >
  >   > **Important:**
  >   >
  >   > Using `OR REPLACE` is the equivalent of using [DROP TABLE](drop-table.md) on the existing table and then creating a new table
  >   > with the same name; however, the dropped table is not permanently removed from the system. Instead, it is retained in
  >   > Time Travel. This is important to note because dropped tables in Time Travel can be recovered, but they also contribute to data
  >   > storage for your account. For more information, see [Storage costs for Time Travel and Fail-safe](../../user-guide/data-cdp-storage-costs.md).
  >   >
  >   > CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
  >   >
  >   > This means that any queries concurrent with the CREATE OR REPLACE TABLE operation use either the old or new table version.
  >   >
  >   > Recreating or swapping a table drops its change data. Any stream on the table becomes [stale](../../user-guide/streams-intro.md). In
  >   > addition, any stream on a view that has this table as an underlying table, becomes stale. A stale stream is unreadable.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
* Similar to [reserved keywords](../reserved-keywords.md), ANSI-reserved function names
  ([CURRENT_DATE](../functions/current_date.md), [CURRENT_TIMESTAMP](../functions/current_timestamp.md), etc.) cannot be used as column
  names.
* CREATE OR ALTER TABLE:

  For more information, see CREATE OR ALTER TABLE usage notes.
* CREATE TABLE … CLONE:

  If the source table has clustering keys, then the new table has clustering keys. By default, Automatic Clustering is suspended
  for the new table – even if Automatic Clustering was not suspended for the source table.

* CREATE TABLE … FROM ARCHIVE OF:

  + Using this command requires the OWNERSHIP privilege on the source table.
  + Specifying column definitions, policies, tags, or other constraints isn’t supported. Snowflake automatically retrieves
    the table schema, policies, tags, and constraints from the source table.
  + The WHERE clause is required. Reading archived data is expensive, and should be performed infrequently.
    Filtering results using the WHERE clause helps you minimize costs by ensuring that Snowflake reads only the data that you
    require from archival storage.
  + To estimate the number of files that Snowflake will retrieve from archive storage, run the [EXPLAIN](explain.md) command before
    this operation. The output includes a `createTableFromArchiveData` operation and displays `ARCHIVE OF <table>` in
    the `objects` column for the TableScan operation. For more information, see [Estimate retrieval costs with EXPLAIN](../../user-guide/storage-management/storage-lifecycle-policies-retrieving-archived-data.md).
  + To see a history of data retrieval from archive storage, use the [ARCHIVE_STORAGE_DATA_RETRIEVAL_USAGE_HISTORY view](../account-usage/archive_storage_data_retrieval_usage_history.md).
  + To retrieve data from the COLD tier of archive storage, Snowflake must first restore the files from external cloud storage. This process
    can take up to 48 hours.

    To support this process, set the following parameters appropriately:

    - [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md) must be at least 48 hours.
    - [ABORT_DETACHED_QUERY](../parameters.md) must be FALSE.

    COLD storage tier restore operations support a maximum of 1 million files per restore operation.
  + If you cancel a CREATE TABLE operation that retrieves data from archive storage, you might still incur retrieval costs.
* CREATE TABLE … CHANGE_TRACKING = TRUE:

  When change tracking is enabled, the table is locked for the duration of the operation.
  Locks can cause latency with some associated DDL/DML operations.
  For more information, refer to [Resource locking](../transactions.md).
* CREATE TABLE … LIKE:

  If the source table has clustering keys, then the new table has clustering keys. By default, Automatic Clustering is not
  suspended for the new table – even if Automatic Clustering was suspended for the source table.
* CREATE TABLE … AS SELECT (CTAS):

  + If the aliases for the column names in the [SELECT](select.md) list are valid columns, then the column definitions
    are not required in the CTAS statement; if omitted, the column names and types are inferred from the underlying query:

    > ```sqlsyntax
    > CREATE TABLE <table_name> AS SELECT ...
    > ```

    Alternatively, the names can be explicitly specified using the following syntax:

    > ```sqlsyntax
    > CREATE TABLE <table_name> ( <col1_name> , <col2_name> , ... ) AS SELECT ...
    > ```

    The number of column names specified must match the number of [SELECT](select.md) list items in the query; the
    types of the columns are inferred from the types produced by the query.
  + When clustering keys are specified in a CTAS statement:

    - Column definitions are required and must be explicitly specified in the statement.
    - By default, [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) is enabled for the new table even if Automatic Clustering is
      suspended for the source table.
    - The data is clustered when the new table is created. A clustered table generates a query plan
      that includes a sort operation and takes longer to create than an equivalent table that is not clustered. For example, the
      second of these commands is likely to take longer than the first:

      ```sqlexample
      CREATE TABLE ctas_large_table
        AS SELECT * FROM large_table;

      CREATE TABLE ctas_clustered_large_table CLUSTER BY (timestamp)
        AS SELECT * FROM large_table;
      ```

      Alternatively, you can create a table with rows in sorted order by using an ORDER BY clause in the CTAS query.

* CREATE OR REPLACE … COPY TAGS

  + You don’t need privileges on the tags to use COPY TAGS.
  + You can use COPY TAGS with a CREATE OR REPLACE … CLONE statement, a CREATE OR REPLACE … LIKE statement,
    or a WITH TAG clause. Tags from both sources are combined. If both sources set the same tag, the value from
    the replaced table (carried over by COPY TAGS) takes precedence.
  + If you rename a tagged column in the statement, the column in the new table will not retain the tag.
  + If you change the data type of a tagged column — for example, changing `NUMBER(8)` to `NUMBER(16)` — the column in the new table
    will not retain the tag.
  + If you swap column names in the statement, the tag stays with the column based on its name. For example, suppose only column `a` has a
    tag and you run the following command to swap the names of columns `a` and `b`:

    ```sqlexample
    CREATE OR REPLACE TABLE dst1 COPY TAGS AS SELECT b AS a, a AS b FROM src1
    ```

    Only column `a` is still tagged in the new table, although it contains the data from column `b` in the source table.
* Inside a transaction, any DDL statement (including CREATE TEMPORARY/TRANSIENT TABLE) commits
  the transaction before executing the DDL statement itself. The DDL statement then runs in its own transaction. The
  next statement after the DDL statement starts a new transaction. Therefore, you can’t create, use, and drop a
  temporary or transient table within a single transaction. If you want to use a temporary or transient table inside a
  transaction, then create the table before the transaction, and drop the table after the transaction.
* Recreating a table (using the optional `OR REPLACE` keyword) drops its history, which makes any stream on the table stale.
  A stale stream is unreadable.
* A single masking policy that uses conditional columns can be applied to multiple tables provided that the column structure of the table
  matches the columns specified in the policy.
* When creating a table with a masking policy on one or more table columns, or a row access policy added to the table, use the
  [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a masking policy and the table
  protected by a row access policy.

* For creating a table with the WITH STORAGE LIFECYCLE POLICY clause:

  + You must have the necessary privileges to apply the policy. For information about required privileges, see
    [Storage lifecycle policy privileges](../../user-guide/security-access-control-privileges.md).
  + A table can have only one attached storage lifecycle policy.
  + The number of columns must match the argument count in the policy function signature, and the column data must be compatible with the argument types.
  + Associated policies aren’t affected if you rename table columns. Snowflake associates policies to tables by using the column IDs.
  + In order to evaluate and apply storage lifecycle policy expressions, Snowflake internally and temporarily bypasses any governance policies on a table.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER TABLE usage notes

* **Limitations**

  + Currently only supports permanent, temporary, and transient tables. Read-only, external, dynamic, Apache Iceberg™, and hybrid tables
    are *not* supported.
  + All limitations of the [ALTER TABLE](alter-table.md) command apply.
  + Currently does *not* support the following:

    - CREATE TABLE … AS SELECT (CTAS) variant syntax.
    - CREATE TABLE … USING TEMPLATE variant syntax.
    - CREATE TABLE … LIKE variant syntax.
    - CREATE TABLE … CLONE variant syntax.
* **Table parameters and properties**

  + The absence of a property or parameter that was previously set in the modified table definition results in unsetting it.
  + Unsetting an explicit [parameter](../parameters.md) value results in setting it to the default parameter value.
    If the parameter is set on the schema or database that contain the table, the table inherits the parameter value set on
    the schema or database.
* **Data governance**

  + Setting or unsetting a tag or policy on a table or column using a CREATE OR ALTER TABLE statement is not supported.

    Existing policies or tags are *not* altered by a CREATE OR ALTER statement and remain unchanged.
* **Constraints**

  > Setting or unsetting an inline primary key changes the nullability of the column accordingly. This aligns with the behavior of
  > the CREATE TABLE command, but is different from the behavior of the ALTER TABLE command.
* **Columns**

  + New columns can only be added to the end of the column list.
  + Columns cannot be renamed. If you attempt to rename a column, the column is dropped and a new column is added.
  + The default value for a column can only be modified to use a sequence.
  + The default sequence for a column (for example, `SET DEFAULT seq_name.NEXTVAL`) can only be changed if the column
    already has a sequence.
  + For more information about modifying columns, see [ALTER TABLE … ALTER COLUMN](alter-table-column.md).
* **Collation**

  + Collation specifications cannot be altered.
  + Setting the [DEFAULT_DDL_COLLATION](../parameters.md) parameter in the CREATE OR ALTER TABLE command
    sets the default collation specification for existing columns, which ensures the CREATE OR ALTER TABLE command
    yields the same results as the CREATE TABLE command. Therefore, you can’t use the CREATE OR ALTER TABLE command to set the
    DEFAULT_DDL_COLLATION parameter on a table that has existing text columns. You can, however, make collations explicit
    for existing columns when changing the DEFAULT_DDL_COLLATION parameter for a table.

    For example, create a new table `my_table` and set the default collation specification for the table to ‘fr’:

    ```sqlexample
    CREATE OR ALTER TABLE my_table (
      a INT PRIMARY KEY,
      b VARCHAR(20)
    )
    DEFAULT_DDL_COLLATION = 'fr';
    ```

    The collation specification for column `b` is ‘fr’ and cannot be changed. To change the default collation specification for
    table `my_table`, you must explicitly set the collation for text column `b` in the CREATE OR ALTER statement:

    ```sqlexample
    CREATE OR ALTER TABLE my_table (
      a INT PRIMARY KEY,
      b VARCHAR(200) COLLATE 'fr'
    )
    DEFAULT_DDL_COLLATION = 'de';
    ```

* **Atomicity**

  The CREATE OR ALTER TABLE command currently does not guarantee atomicity. This means that if a CREATE OR ALTER TABLE statement
  fails during execution, it is possible that a subset of changes might have been applied to the table. If there is a possibility
  of partial changes, the error message, in most cases, includes the following text:

  ```output
  CREATE OR ALTER execution failed. Partial updates may have been applied.
  ```

  For example, if the statement is attempting to drop column `A` and add a new column `B` to a table, and the
  statement is aborted, it is possible that column `A` was dropped but column `B` was not added.

  > **Note:**
  >
  > If changes are partially applied, the resulting table is still in a valid state, and you can use additional ALTER TABLE
  > statements to complete the original set of changes.

  To recover from partial updates, Snowflake recommends the following recovery mechanisms:

  + Fix forward

    - Re-execute the CREATE OR ALTER TABLE statement. If the statements succeeds on the second attempt, the target
      state is achieved.
    - Investigate the error message. If possible, fix the error and re-execute the CREATE OR ALTER TABLE statement.
  + Roll back

    If it is not possible to fix forward, Snowflake recommends manually rolling back partial changes:

    - Investigate the state of the table using the [DESCRIBE TABLE](desc-table.md) and [SHOW TABLES](show-tables.md) commands. Determine which partial
      changes were applied, if any.
    - If any partial changes were applied, execute the appropriate ALTER TABLE statements to transform the table back to its
      original state.

      > **Note:**
      >
      > In some cases, you might not be able to undo partial changes. For more information, see the supported and unsupported
      > actions for modifying column properties in the [ALTER TABLE … ALTER COLUMN](alter-table-column.md) topic.
  + If you need help recovering from a partial update, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Examples

### Basic examples

Create a simple table in the current database and insert a row in the table:

> ```sqlexample
> CREATE TABLE mytable (amount NUMBER);
>
> +-------------------------------------+
> | status                              |
> |-------------------------------------|
> | Table MYTABLE successfully created. |
> +-------------------------------------+
>
> INSERT INTO mytable VALUES(1);
>
> SHOW TABLES like 'mytable';
>
> +---------------------------------+---------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
> | created_on                      | name    | database_name | schema_name | kind  | comment | cluster_by | rows | bytes | owner        | retention_time |
> |---------------------------------+---------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------|
> | Mon, 11 Sep 2017 16:32:28 -0700 | MYTABLE | TESTDB        | PUBLIC      | TABLE |         |            |    1 |  1024 | ACCOUNTADMIN | 1              |
> +---------------------------------+---------+---------------+-------------+-------+---------+------------+------+-------+--------------+----------------+
>
> DESC TABLE mytable;
>
> +--------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> | name   | type         | kind   | null? | default | primary key | unique key | check | expression | comment |
> |--------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------|
> | AMOUNT | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> +--------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> ```

Create a simple table and specify comments for both the table and the column in the table:

> ```sqlexample
> CREATE TABLE example (col1 NUMBER COMMENT 'a column comment') COMMENT='a table comment';
>
> +-------------------------------------+
> | status                              |
> |-------------------------------------|
> | Table EXAMPLE successfully created. |
> +-------------------------------------+
>
> SHOW TABLES LIKE 'example';
>
> +---------------------------------+---------+---------------+-------------+-------+-----------------+------------+------+-------+--------------+----------------+
> | created_on                      | name    | database_name | schema_name | kind  | comment         | cluster_by | rows | bytes | owner        | retention_time |
> |---------------------------------+---------+---------------+-------------+-------+-----------------+------------+------+-------+--------------+----------------|
> | Mon, 11 Sep 2017 16:35:59 -0700 | EXAMPLE | TESTDB        | PUBLIC      | TABLE | a table comment |            |    0 |     0 | ACCOUNTADMIN | 1              |
> +---------------------------------+---------+---------------+-------------+-------+-----------------+------------+------+-------+--------------+----------------+
>
> DESC TABLE example;
>
> +------+--------------+--------+-------+---------+-------------+------------+-------+------------+------------------+
> | name | type         | kind   | null? | default | primary key | unique key | check | expression | comment          |
> |------+--------------+--------+-------+---------+-------------+------------+-------+------------+------------------|
> | COL1 | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | a column comment |
> +------+--------------+--------+-------+---------+-------------+------------+-------+------------+------------------+
> ```

### CTAS examples

Create a table by selecting from an existing table:

> ```sqlexample
> CREATE TABLE mytable_copy (b) AS SELECT * FROM mytable;
>
> DESC TABLE mytable_copy;
>
> +------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> | name | type         | kind   | null? | default | primary key | unique key | check | expression | comment |
> |------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------|
> | B    | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> +------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
>
> CREATE TABLE mytable_copy2 AS SELECT b+1 AS c FROM mytable_copy;
>
> DESC TABLE mytable_copy2;
>
> +------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> | name | type         | kind   | null? | default | primary key | unique key | check | expression | comment |
> |------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------|
> | C    | NUMBER(39,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> +------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
>
> SELECT * FROM mytable_copy2;
>
> +---+
> | C |
> |---|
> | 2 |
> +---+
> ```

More advanced example of creating a table by selecting from an existing table; in this example, the values in the `summary_amount`
column in the new table are derived from two columns in the source table:

> ```sqlexample
> CREATE TABLE testtable_summary (name, summary_amount) AS SELECT name, amount1 + amount2 FROM testtable;
> ```

Create a table by selecting columns from a staged Parquet data file:

> ```sqlexample
> CREATE OR REPLACE TABLE parquet_col (
>   custKey NUMBER DEFAULT NULL,
>   orderDate DATE DEFAULT NULL,
>   orderStatus VARCHAR(100) DEFAULT NULL,
>   price VARCHAR(255)
> )
> AS SELECT
>   $1:o_custkey::number,
>   $1:o_orderdate::date,
>   $1:o_orderstatus::text,
>   $1:o_totalprice::text
> FROM @my_stage;
>
> +-----------------------------------------+
> | status                                  |
> |-----------------------------------------|
> | Table PARQUET_COL successfully created. |
> +-----------------------------------------+
>
> DESC TABLE parquet_col;
>
> +-------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> | name        | type         | kind   | null? | default | primary key | unique key | check | expression | comment |
> |-------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------|
> | CUSTKEY     | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | ORDERDATE   | DATE         | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | ORDERSTATUS | VARCHAR(100) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | PRICE       | VARCHAR(255) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> +-------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> ```

### CREATE TABLE … LIKE examples

Create a table with the same column definitions as another table, but with no rows:

> ```sqlexample
> CREATE TABLE mytable (amount NUMBER);
>
> INSERT INTO mytable VALUES(1);
>
> SELECT * FROM mytable;
>
> +--------+
> | AMOUNT |
> |--------|
> |      1 |
> +--------+
>
> CREATE TABLE mytable_2 LIKE mytable;
>
> DESC TABLE mytable_2;
>
> +--------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> | name   | type         | kind   | null? | default | primary key | unique key | check | expression | comment |
> |--------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------|
> | AMOUNT | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> +--------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
>
> SELECT * FROM mytable_2;
>
> +--------+
> | AMOUNT |
> |--------|
> +--------+
> ```

### CREATE TABLE examples that set parameters and properties

Create a table with a multi-column clustering key:

> ```sqlexample
> CREATE TABLE mytable (date TIMESTAMP_NTZ, id NUMBER, content VARIANT) CLUSTER BY (date, id);
>
> SHOW TABLES LIKE 'mytable';
>
> +---------------------------------+---------+---------------+-------------+-------+---------+------------------+------+-------+--------------+----------------+
> | created_on                      | name    | database_name | schema_name | kind  | comment | cluster_by       | rows | bytes | owner        | retention_time |
> |---------------------------------+---------+---------------+-------------+-------+---------+------------------+------+-------+--------------+----------------|
> | Mon, 11 Sep 2017 16:20:41 -0700 | MYTABLE | TESTDB        | PUBLIC      | TABLE |         | LINEAR(DATE, ID) |    0 |     0 | ACCOUNTADMIN | 1              |
> +---------------------------------+---------+---------------+-------------+-------+---------+------------------+------+-------+--------------+----------------+
> ```

Specify collation for columns in a table:

> ```sqlexample
> CREATE OR REPLACE TABLE collation_demo (
>   uncollated_phrase VARCHAR,
>   utf8_phrase VARCHAR COLLATE 'utf8',
>   english_phrase VARCHAR COLLATE 'en',
>   spanish_phrase VARCHAR COLLATE 'es');
>
> INSERT INTO collation_demo (
>       uncollated_phrase,
>       utf8_phrase,
>       english_phrase,
>       spanish_phrase)
>    VALUES (
>      'pinata',
>      'pinata',
>      'pinata',
>      'piñata');
> ```

### CREATE TABLE … USING TEMPLATE examples

Create a table where the column definitions are derived from a set of staged files that contain Avro, Parquet, or ORC data.

Note that the `mystage` stage and `my_parquet_format` file format referenced in the statement must already exist. A set of files
must already be staged in the cloud storage location referenced in the stage definition.

The following example creates a table using the detected schema from staged files and sorts the columns by `order_id`.
It builds on an example in the [INFER_SCHEMA](../functions/infer_schema.md) topic.

> ```sqlexample
> CREATE TABLE mytable
>   USING TEMPLATE (
>     SELECT ARRAY_AGG(OBJECT_CONSTRUCT(*))
>     WITHIN GROUP (ORDER BY order_id)
>       FROM TABLE(
>         INFER_SCHEMA(
>           LOCATION=>'@mystage',
>           FILE_FORMAT=>'my_parquet_format'
>         )
>       ));
> ```

Note that sorting the columns by `order_id` only applies if all staged files share a single schema. If the set of staged data
files includes multiple schemas with shared column names, the order represented in the `order_id` column might not match any
single file.

> **Note:**
>
> Using `*` for `ARRAY_AGG(OBJECT_CONSTRUCT())` might result in an error if the returned result is larger than 128 MB.
> We recommend that you avoid using `*` for larger result sets, and only use the required columns, `COLUMN NAME`,
> `TYPE`, and `NULLABLE`, for the query. Optional column `ORDER_ID` can be included when using
> `WITHIN GROUP (ORDER BY order_id)`.

### Temporary table examples

Create a temporary table that is dropped automatically at the end of the session:

> ```sqlexample
> CREATE TEMPORARY TABLE demo_temporary (i INTEGER);
> CREATE TEMP TABLE demo_temp (i INTEGER);
> ```

For compatibility with other vendors, Snowflake also supports using the keywords below as synonyms for TEMPORARY:

> ```sqlexample
> CREATE LOCAL TEMPORARY TABLE demo_local_temporary (i INTEGER);
> CREATE LOCAL TEMP TABLE demo_local_temp (i INTEGER);
>
> CREATE GLOBAL TEMPORARY TABLE demo_global_temporary (i INTEGER);
> CREATE GLOBAL TEMP TABLE demo_global_temp (i INTEGER);
>
> CREATE VOLATILE TABLE demo_volatile (i INTEGER);
> ```

### CREATE OR ALTER TABLE examples

Create a table `my_table` using the CREATE OR ALTER TABLE command:

```sqlexample
CREATE OR ALTER TABLE my_table(a INT);
```

> **Note:**
>
> CREATE OR ALTER TABLE statements for existing tables can only be executed by a role
> with the OWNERSHIP privilege on table `my_table`.

Alter table `my_table` to add and modify columns and set the DATA_RETENTION_TIME_IN_DAYS and
DEFAULT_DDL_COLLATION parameters:

```sqlexample
CREATE OR ALTER TABLE my_table(
    a INT PRIMARY KEY,
    b VARCHAR(200)
  )
  DATA_RETENTION_TIME_IN_DAYS = 5
  DEFAULT_DDL_COLLATION = 'de';
```

Unset the DATA_RETENTION_TIME_IN_DAYS parameter. The absence of a parameter in the modified table definition results in unsetting it.
In this case, unsetting the DATA_RETENTION_TIME_IN_DAYS parameter for the table resets it to the default value of `1`:

```sqlexample
CREATE OR ALTER TABLE my_table(
    a INT PRIMARY KEY,
    c VARCHAR(200)
  )
  DEFAULT_DDL_COLLATION = 'de';
```

The CREATE OR ALTER TABLE command supports adding columns at the end of the column list. If you attempt to rename an existing column, the existing
column is dropped, and a new column with the new column name is added. This might result in data loss if data exists in the original column.

The following example illustrates this behavior.

1. Create a table:

   ```sqlexample
   CREATE OR ALTER TABLE my_table(
       a INT PRIMARY KEY,
       b INT
     );
   ```
2. Insert data into table `my_table`:

   ```sqlexample
   INSERT INTO my_table VALUES (1, 2), (2, 3);

   SELECT * FROM my_table;
   ```

   Returns:

   ```output
   +---+---+
   | A | B |
   |---+---|
   | 1 | 2 |
   | 2 | 3 |
   +---+---+
   ```
3. Attempt to rename column `b`:

   ```sqlexample
   CREATE OR ALTER TABLE my_table(
       a INT PRIMARY KEY,
       c INT
     );
   ```

   Column `b` is dropped and column `c` is added:

   ```sqlexample
   SELECT * FROM my_table;
   ```

   Returns:

   ```output
   +---+------+
   | A | C    |
   |---+------|
   | 1 | NULL |
   | 2 | NULL |
   +---+------+
   ```

   > **Note:**
   >
   > You can recover dropped columns using [Time Travel](../../user-guide/data-time-travel.md).

Setting or unsetting an inline primary key changes the nullability of the column in a way that aligns with the behavior of the
CREATE TABLE command, but is different from the behavior of the ALTER TABLE command. For example, adding a PRIMARY KEY constraint
on a column using an ALTER TABLE statement does not change column nullability.

The following example illustrates this behavior.

1. Create a table:

   ```sqlexample
   CREATE TABLE t(a INT);
   ```
2. Alter the table to add a PRIMARY KEY constraint:

   ```sqlexample
   CREATE OR ALTER TABLE t(a INT PRIMARY KEY);
   ```

   Column `a` is now the primary key and is set to NOT NULL:

   ```sqlexample
   DESC TABLE t;
   ```

   Returns:

   ```output
   +------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
   | name | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
   |------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
   | A    | NUMBER(38,0) | COLUMN | N     | NULL    | Y           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
   +------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
   ```
3. Replace table `t`:

   ```sqlexample
   CREATE OR REPLACE TABLE t(a INT);
   ```
4. Insert a NULL value:

   ```sqlexample
   INSERT INTO t VALUES (null);
   ```
5. Add a PRIMARY KEY constraint to column `a`.

   The NULL value in column `a` causes the following statement to fail:

   ```sqlexample
   CREATE OR ALTER TABLE t(a INT PRIMARY KEY);
   ```

   Returns:

   ```output
   001471 (42601): SQL compilation error:
   Column 'A' contains null values. Not null constraint cannot be added.
   ```

---
title: CREATE TAG
source: https://docs.snowflake.com/en/sql-reference/sql/create-tag.md
section: SQL Commands
---

# CREATE TAG

Creates a new tag or replaces an existing tag in the system.

This command supports the following variants:

* CREATE OR ALTER TAG: Creates a tag if it doesn’t exist or alters an existing tag.

See also:
:   [ALTER TAG](alter-tag.md) , [SHOW TAGS](show-tags.md) , [DROP TAG](drop-tag.md) , [UNDROP TAG](undrop-tag.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] TAG [ IF NOT EXISTS ] <name>
    [ ALLOWED_VALUES '<val_1>' [ , '<val_2>' [ , ... ] ] ]
    [ PROPAGATE = { ON_DEPENDENCY_AND_DATA_MOVEMENT | ON_DEPENDENCY | ON_DATA_MOVEMENT }
      [ ON_CONFLICT = { '<string>' | ALLOWED_VALUES_SEQUENCE } ] ]
    [ COMMENT = '<string_literal>' ]
```

## Variant syntax

### CREATE OR ALTER TAG

Creates a new tag if it doesn’t already exist, or transforms an existing tag into the tag defined in the statement.
A CREATE OR ALTER TAG statement follows the syntax rules of a CREATE TAG statement and has the same limitations as an
[ALTER TAG](alter-tag.md) statement.

Supported alterations include changes to the ALLOWED_VALUES and COMMENT properties.

For more information, see CREATE OR ALTER TAG usage notes.

```sqlsyntax
CREATE OR ALTER TAG <name>
  [ ALLOWED_VALUES '<val_1>' [ , '<val_2>' [ , ... ] ] ]
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Identifier for the tag. Assign the tag string value on an [object](../../user-guide/object-tagging/introduction.md) using either a
    [CREATE <object>](create.md) statement or an [ALTER <object>](alter.md) statement.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. “My object”). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`ALLOWED_VALUES 'val_1' [ , 'val_2' [ , ... ] ]`
:   Specifies a comma-separated list of the possible string values that can be assigned to the tag when the tag is set on an
    [object](../../user-guide/object-tagging/introduction.md) using the corresponding [CREATE <object>](create.md) or
    [ALTER <object>](alter.md) command.

    Must come before all other parameters to work.

    The maximum number of tag values in this list is 5,000.

    If a tag is configured to automatically propagate to target objects, the order of values in the allowed list can affect how conflicts are
    resolved. For more information, see [Tag propagation conflicts](../../user-guide/object-tagging/propagation.md).

    Default: NULL (all string values are allowed, including an empty string value (that is, `' '`)).

`PROPAGATE = { ON_DEPENDENCY_AND_DATA_MOVEMENT | ON_DEPENDENCY | ON_DATA_MOVEMENT }`
:   [Enterprise Edition Feature](../../user-guide/intro-editions.md)

    This parameter requires Enterprise Edition or higher. To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    Specifies that the tag will be [automatically propagated](../../user-guide/object-tagging/propagation.md) from source objects to target
    objects. You can configure the tag to propagate when there is an [object dependency](../../user-guide/object-tagging/propagation.md),
    [data movement](../../user-guide/object-tagging/propagation.md), or both.

    `ON_DEPENDENCY_AND_DATA_MOVEMENT`
    :   Propagates the tag when there is an object dependency or data movement.

    `ON_DEPENDENCY`
    :   Propagates the tag for object dependencies, but not for data movement.

    `ON_DATA_MOVEMENT`
    :   Propagates the tag when there is data movement, but not for object dependencies.

`ON_CONFLICT = { 'string' | ALLOWED_VALUES_SEQUENCE }`
:   [Enterprise Edition Feature](../../user-guide/intro-editions.md)

    This parameter requires Enterprise Edition or higher. To inquire about upgrading, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

    Specifies what happens when there is a conflict between the values of [propagated tags](../../user-guide/object-tagging/propagation.md).

    If you don’t set this parameter and there is a conflict, the value of the tag is set to the string `CONFLICT`.

    Possible values are:

    `'string'`
    :   When there is a conflict, the value of the tag is set to the specified string.

    `ALLOWED_VALUES_SEQUENCE`
    :   The order of the values in the ALLOWED_VALUES property of the tag determines which value is used when there is a conflict. For example,
        suppose you created a tag with the following statement:

        ```sqlexample
        CREATE TAG my_tag ALLOWED_VALUES 'blue', 'red'
          PROPAGATE = ON_DEPENDENCY
          ON_CONFLICT = ALLOWED_VALUES_SEQUENCE;
        ```

        If there is a conflict, then the value of `my_tag` will be `blue` because it comes before `red` in the allowed values list.

    Default: Set the value of the tag to `CONFLICT`.

`COMMENT = 'string_literal'`
:   Specifies a comment for the tag.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE TAG | Schema |  |
| OWNERSHIP | Tag | * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object   that already exists in the schema. * Required to execute a CREATE OR ALTER TAG statement for an *existing* tag. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on tag DDL and privileges, see [Access control privileges](../../user-guide/object-tagging/work.md).

## General usage notes

* You must set the ALLOWED_VALUES parameter before all other parameters, such as COMMENT.
* For more information about how tags can be associated with Snowflake objects, see [Introduction to object tagging](../../user-guide/object-tagging/introduction.md).
* For more information about tag DDL authorization, see [required privileges](../../user-guide/object-tagging/work.md).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## CREATE OR ALTER TAG usage notes

* When you use this command, all unspecified parameters are reset. For example, if you specify a new comment only, then the PROPAGATE
  parameter no longer enables tag propagation.
* All limitations of the [ALTER TAG](alter-tag.md) command apply.
* Setting or unsetting a masking policy is not supported.

## Examples

Create a tag with the key `cost_center`.

```sqlexample
CREATE TAG cost_center COMMENT = 'cost_center tag';
```

Update `cost_center` to include new allowed values and unset the comment:

```sqlexample
CREATE OR ALTER TAG cost_center ALLOWED_VALUES 'finance', 'engineering', 'sales';
```

---
title: CREATE TASK
source: https://docs.snowflake.com/en/sql-reference/sql/create-task.md
section: SQL Commands
---

# CREATE TASK

Creates a new [task](../../user-guide/tasks-intro.md) in the current/specified schema or replaces an existing task.

This command supports the following variants:

* CREATE OR ALTER TASK: Creates a task if it doesn’t exist or alters an existing task.
* CREATE TASK … CLONE: Creates a clone of an existing task.

See also:
:   [ALTER TASK](alter-task.md) , [DROP TASK](drop-task.md) , [SHOW TASKS](show-tasks.md) , [DESCRIBE TASK](desc-task.md)

    [CREATE OR ALTER <object>](create-or-alter.md)

> **Important:**
>
> Newly created or cloned tasks are created suspended. For information about running suspended tasks, see [ALTER TASK … RESUME](alter-task.md) or [EXECUTE TASK](execute-task.md).

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] TASK [ IF NOT EXISTS ] <name>
    [ WITH TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
    [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
    [ { WAREHOUSE = <string> }
      | { USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = <string> } ]
    [ SCHEDULE = { '<num> { HOURS | MINUTES | SECONDS }'
      | 'USING CRON <expr> <time_zone>' } ]
    [ CONFIG = <configuration_string> ]
    [ OVERLAP_POLICY = { NO_OVERLAP | ALLOW_CHILD_OVERLAP | ALLOW_ALL_OVERLAP } ]
    [ <session_parameter> = <value>
      [ , <session_parameter> = <value> ... ] ]
    [ USER_TASK_TIMEOUT_MS = <num> ]
    [ SUSPEND_TASK_AFTER_NUM_FAILURES = <num> ]
    [ ERROR_INTEGRATION = <integration_name> ]
    [ SUCCESS_INTEGRATION = <integration_name> ]
    [ LOG_LEVEL = '<log_level>' ]
    [ COMMENT = '<string_literal>' ]
    [ FINALIZE = <string> ]
    [ TASK_AUTO_RETRY_ATTEMPTS = <num> ]
    [ USER_TASK_MINIMUM_TRIGGER_INTERVAL_IN_SECONDS = <num> ]
    [ TARGET_COMPLETION_INTERVAL = '<num> { HOURS | MINUTES | SECONDS }' ]
    [ SERVERLESS_TASK_MIN_STATEMENT_SIZE = '{ XSMALL | SMALL
      | MEDIUM | LARGE | XLARGE | XXLARGE }' ]
    [ SERVERLESS_TASK_MAX_STATEMENT_SIZE = '{ XSMALL | SMALL
      | MEDIUM | LARGE | XLARGE | XXLARGE }' ]
  [ AFTER <string> [ , <string> , ... ] ]
  [ EXECUTE AS USER <user_name> ]
  [ WHEN <boolean_expr> ]
  AS
    <sql>
```

## Variant syntax

### CREATE OR ALTER TASK

Creates a new task if it doesn’t already exist, or transforms an existing task into the task defined in the statement.
A CREATE OR ALTER TASK statement follows the syntax rules of a CREATE TASK statement and has the same limitations as an
[ALTER TASK](alter-task.md) statement.

Supported task alterations include:

* Change task properties and parameters. For example, SCHEDULE, USER_TASK_TIMEOUT_MS, or COMMENT.
* Set, unset, or change task predecessors.
* Set, unset, or change task condition (WHEN clause).
* Change the task definition (AS clause).

For more information, see CREATE OR ALTER TASK usage notes.

```sqlsyntax
CREATE OR ALTER TASK <name>
    [ { WAREHOUSE = <string> }
      | { USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = <string> } ]
    [ SCHEDULE = { '<num> { HOURS | MINUTES | SECONDS }'
      | 'USING CRON <expr> <time_zone>' } ]
    [ CONFIG = <configuration_string> ]
    [ OVERLAP_POLICY = { NO_OVERLAP | ALLOW_CHILD_OVERLAP | ALLOW_ALL_OVERLAP } ]
    [ USER_TASK_TIMEOUT_MS = <num> ]
    [ <session_parameter> = <value>
      [ , <session_parameter> = <value> ... ] ]
    [ SUSPEND_TASK_AFTER_NUM_FAILURES = <num> ]
    [ ERROR_INTEGRATION = <integration_name> ]
    [ SUCCESS_INTEGRATION = <integration_name> ]
    [ COMMENT = '<string_literal>' ]
    [ FINALIZE = <string> ]
    [ TASK_AUTO_RETRY_ATTEMPTS = <num> ]
  [ AFTER <string> [ , <string> , ... ] ]
  [ EXECUTE AS USER <user_name> ]
  [ WHEN <boolean_expr> ]
  AS
    <sql>
```

### CREATE TASK … CLONE

Creates a new task with the same parameter values:

> ```sqlsyntax
> CREATE [ OR REPLACE ] TASK <name> CLONE <source_task>
>   [ ... ]
> ```

For more details, see [CREATE <object> … CLONE](create-clone.md).

> **Note:**
>
> Cloning tasks using CREATE TASK <name> CLONE, or cloning a schema containing tasks,
> copies all underlying task properties unless explicitly overridden.

## Required parameters

`name`
:   String that specifies the identifier for the task; must be unique for the schema in which the task is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes, such as `"My object"`. Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`sql`
:   Any one of the following:

    * Single SQL statement
    * Call to a stored procedure
    * Procedural logic using [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md)

    The SQL code is executed when the task runs. Verify that the `{sql}` executes as expected before using it in a task.

### Clone tasks in a task graph

> For task graphs, you might also need to make clones of each dependent task (that is, each child task or finalizer task); for example:
>
> 1. Clone the task (for example, `CREATE TASK new_task_name CLONE old_task_name`).
> 2. Find dependent tasks by using the [TASK_DEPENDENTS](../functions/task_dependents.md) function; for example:
>
>    ```sqlexample
>    SELECT * FROM TABLE(INFORMATION_SCHEMA.TASK_DEPENDENTS('old_task_name'));
>    ```
> 3. Clone the dependent tasks (for example, `CREATE TASK new_child_task CLONE old_child_task`).
> 4. Update the new dependent tasks to use the new cloned task name (`ALTER TASK new_child_task ADD AFTER new_task_name`).

## Optional parameters

`CREATE OR REPLACE TASK` or . `CREATE TASK IF NOT EXISTS`

> * `..OR REPLACE`
>   Replaces an existing task with the same name. If the task doesn’t exist, this clause is ignored.
>
>   Consider the following behaviors when you replace a task:
>
>   + The recreated task is suspended by default.
>   + If a standalone or root task is recreated, the next scheduled run of the task is cancelled.
>   + CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.
>   + Tasks with large definitions can cause errors. If you experience an error due to task size, try using stored procedure or making your task definition less complex.
>
>   When you replace a task, any ongoing task run is completed.
>
>   + To stop an ongoing task run before replacing it with CREATE OR REPLACE TASK, use the [SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS](../functions/system_user_task_cancel_ongoing_executions.md) function.
>   + To stop an ongoing task run after you replace it with CREATE OR REPLACE TASK:
>
>     1. Find the query ID of the ongoing run; for example:
>
>        ```sqlexample
>        select name, query_id, state, scheduled_time, error_message
>        from table(information_schema.task_history(task_name => 'my_task'));
>        ```
>     2. Cancel the query using the [SYSTEM$CANCEL_QUERY](../functions/system_cancel_query.md) function with the query ID, for example:
>
>        ```sqlexample
>        select system$cancel_query('query_id');
>        ```
>     3. Monitor the task run for a few seconds until the cancel completes, for example:
>
>        ```sqlexample
>        select name, query_id, state, scheduled_time, error_message
>        from table(information_schema.task_history(task_name => 'my_task'));
>        ```
> * `...IF NOT EXISTS`
>   Creates a new task only if a task with the same name doesn’t already exist. If the task already exists, this clause is ignored.
>
> > **Note:**
> >
> > * The `CREATE OR REPLACE` and `CREATE IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.

`WAREHOUSE = string` or . `USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = string`

> `WAREHOUSE = string`
> :   Specifies the virtual warehouse that provides compute resources for task runs.
>
>     Omit this parameter to use serverless compute resources for runs of this task. Snowflake automatically resizes and scales serverless
>     compute resources as required for each workload. When a schedule is specified for a task, Snowflake adjusts the resource size to
>     complete future runs of the task within the specified time frame. To specify the initial warehouse size for the task, set the
>     `USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = string` parameter.
>
> `USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = string`
> :   Applied only to serverless tasks.
>
>     Specifies the size of the compute resources to provision for the first run of the task, before a task history is available for
>     Snowflake to determine an ideal size. Once a task has successfully completed a few runs, Snowflake ignores this parameter setting.
>
>     Note that if the task history is unavailable for a given task, the compute resources revert to this initial size.
>
>     > **Note:**
>     >
>     > If a `WAREHOUSE = string` parameter value is specified, then setting this parameter produces a user error.
>
>     The size is equivalent to the compute resources available when creating a warehouse (using
>     [CREATE WAREHOUSE](create-warehouse.md)), such as `SMALL`, `MEDIUM`, or `LARGE`. The largest size supported by the parameter
>     is `XXLARGE`. If the parameter is omitted, the first runs of the task are executed using a medium-sized (`MEDIUM`) warehouse.
>
>     You can change the initial size (using [ALTER TASK](alter-task.md)) after the task is created but
>     before it has run successfully once. Changing the parameter after the first run of this task starts has no effect on the
>     compute resources for current or future task runs.
>
>     Note that suspending and resuming a task doesn’t remove the task history used to size the compute resources. The task history is
>     only removed if the task is recreated (using the CREATE OR REPLACE TASK syntax).
>
>     For more information about this parameter, see [USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE](../parameters.md).

`SCHEDULE = ...`
:   Specifies the schedule for periodically running the task:

    > **Note:**
    >
    > * For [Triggered tasks](../../user-guide/tasks-triggered.md), a schedule is not required. For other tasks, a schedule must be defined for a standalone task or the root task in a [task graph](../../user-guide/tasks-graphs.md);
    >   otherwise, the task only runs if manually executed using [EXECUTE TASK](execute-task.md).
    > * A schedule cannot be specified for child tasks in a task graph.

    * `'USING CRON expr time_zone'`
      :   Specifies a cron expression and time zone for periodically running the task. Supports a subset of standard cron utility syntax.

      + `'expr'`: The cron expression consists of the following fields:

        ```bash
        # __________ minute (0-59)
        # | ________ hour (0-23)
        # | | ______ day of month (1-31, or L)
        # | | | ____ month (1-12, JAN-DEC)
        # | | | | __ day of week (0-6, SUN-SAT, or L)
        # | | | | |
        # | | | | |
          * * * * *
        ```

        The following special characters are supported:

        `*`
        :   Wildcard. Specifies any occurrence of the field.

        `L`
        :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of
            a given month. In the day-of-month field, it specifies the last day of the month.

        `/n`
        :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
            specified in the month field, then the task is scheduled for April, July, and October, which is every 3 months, starting with the
            4th month of the year. The same schedule is maintained in subsequent years. That is, the task is not scheduled to run in
            January (3 months after the October run).

        Timing examples:

        | SCHEDULE Value | Description |
        | --- | --- |
        | `* * * * * UTC` | Every minute. UTC time zone. |
        | `0/5 * * * * UTC` | Every five minutes, starting at the top of the hour. UTC time zone. |
        | `5 * * * * UTC` | The 5th minute of every hour. UTC time zone. |
        | `30 3 * * * UTC` | Every night at 3:30 a.m. UTC time zone. |
        | `0 6,18 * * * UTC` | Twice daily, at 6:00 a.m. and 6:00 p.m.UTC time zone. |
        | `0 3 * * MON-FRI UTC` | Weekdays at 3:00 a.m. UTC time zone. |
        | `0 0 1 * * UTC` | At midnight on the first day of every month. UTC time zone. |
        | `0 0 L * * UTC` | At midnight on the last day of every month. UTC time zone. |
      > > **Note:**
      > > + The cron expression defines all valid run times for the task. Snowflake attempts to run a task based on this schedule;
      > >   however, any valid run time is skipped if a previous run hasn’t completed before the next valid run time starts.
      > > + When both a specific day of month and day of week are included in the cron expression, then the task is scheduled on days
      > >   satisfying either the day of month or day of week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
      > >   schedules a task at midnight on any 10th to 20th day of the month and also on any Tuesday or Thursday outside of those dates.
      > > + The shortest granularity of time in cron is minutes. To set a task to run in a shorter interval, use the `SCHEDULE = ' <num> SECONDS'` parameter instead. For example, `SCHEDULE = '10 SECONDS'` runs the task every 10 seconds.
      > > + If a task is resumed during the minute defined in its cron expression,
      > >   the first scheduled run of the task is the next occurrence of the instance of the cron expression. For example, if task
      > >   scheduled to run daily at midnight (`USING CRON 0 0 * * *`) is resumed at midnight plus 5 seconds (`00:00:05`), the
      > >   first task run is scheduled for the following midnight.

      + `'time_zone'`: The cron time zone for the task. The time zone is specified as a string literal. For a list of time zones, see the [list of tz database time zones](https://wikipedia.org/wiki/List_of_tz_database_time_zones)
        (in Wikipedia). Example:

        | SCHEDULE Value | Description |
        | --- | --- |
        | `0 3 * * * America/Los_Angeles` | Every night at 3:00 a.m., Pacific Standard Time / Pacific Daylight Time (PST/PDT) time zone |

        > **Note:**
        > - The cron expression currently evaluates against the specified time zone only. Altering the [TIMEZONE](../parameters.md) parameter value for the account (or setting the value at the user or session level) does *not* change the time zone for the task.
        > - For time zones that observe daylight saving time, tasks scheduled during daylight saving time transitions can have unexpected behaviors. Examples:
        > - During the change from daylight saving time to standard time, a task scheduled to start at 1:00 a.m. in the America/Los_Angeles time zone (`0 1 * * * America/Los_Angeles`) would run twice: at 1:00 a.m., and then again when 1:59:59 a.m. shifts to 1:00:00 a.m. local time.
        > - During the change from standard time to daylight saving time, a task scheduled to start at 2:00 a.m. in the America/Los_Angeles time zone (`0 2 * * * America/Los_Angeles`) would not run because the local time shifts from 1:59:59 a.m. to 3:00:00 a.m.
        >
        > To avoid unexpected task executions due to daylight saving time, consider the following:
        >
        > - Don’t schedule tasks to start between 1:00 a.m. and 2:59 a.m.
        > - Manually adjust the cron expression for tasks scheduled between 1 a.m. and 3 a.m. twice each year to compensate for the time change.
        > - Use a time format that does not apply daylight saving time, such as UTC. Do not change the time zone for the task.

    > * `'num { HOURS | MINUTES | SECONDS }'`
    >   :   Specifies an interval of wait time between runs of the task.
    >
    >       Snowflake sets the base interval time when the task is resumed ([ALTER TASK … RESUME](alter-task.md)) or when a different interval is set ([ALTER TASK … SET SCHEDULE](alter-task.md)).
    >
    >       For example, if an INTERVAL value of `10 MINUTES` is set and the task is enabled at 9:03 a.m., then the task runs at 9:13 a.m., 9:23 a.m., and
    >       so on.
    >
    >       Snowflake ensures that a task won’t run before the set interval; however, Snowflake can’t guarantee task runs at precisely the specified interval.
    >
    >       Values: `{ 10 - 691200 } SECONDS`, `{ 1 - 11520 } MINUTES`, or `{ 1-192 } HOURS` (That is, from 10 seconds to the equivalent of 8 days). Accepts positive integers only.
    >
    >       Also supports the notations: HOUR, MINUTE, SECOND, and H, M, S.

    * `CONFIG = configuration_string`
      :   Specifies the default configuration string in valid JSON format that all tasks in a
          [task graph](../../user-guide/tasks-graphs.md) can access.
          This default configuration can be overridden for a single execution by using the
          [EXECUTE TASK](execute-task.md) command.

          Syntax:

          ```sqlsyntax
          CONFIG=$${"string1": value1 [, "string2": value2, ...] }$$
          ```

          Examples:

          ```sqlexample
          CONFIG=$${"learning_rate": 0.1}$$
          ```

          ```sqlexample
          CONFIG=$${"environment": "production", "path": "/prod_directory/"}$$
          ```

          > **Note:**
          > + To share information with tasks in a task graph, you must define this parameter in the root task.
          > + You can set this parameter on standalone tasks, but doing so doesn’t affect the task behavior.

`OVERLAP_POLICY = NO_OVERLAP | ALLOW_CHILD_OVERLAP | ALLOW_ALL_OVERLAP`
:   Specifies the overlap policy for task graph runs, controlling whether multiple instances of the task graph can run concurrently and the level of parallelism allowed.

    > **Note:**
    >
    > * You can only set this parameter on a root task. The setting applies to all tasks in the task graph.

    * `NO_OVERLAP`: Executes tasks serially with no parallelism. Snowflake schedules the next run of a root task only after all
      child tasks in the task graph finish running. If the cumulative time to run all tasks in the task graph exceeds the scheduled
      interval defined for the root task, Snowflake skips at least one task graph run.
    * `ALLOW_CHILD_OVERLAP`: Allows child task parallelism. When the next scheduled run time for the root task occurs while any
      child task is still running, Snowflake starts a new instance of the task graph. If the root task itself is still running when
      the next scheduled run time occurs, Snowflake skips that scheduled run.
    * `ALLOW_ALL_OVERLAP`: Allows unlimited true parallelism. Multiple instances of the entire task graph, including the root
      task, can run concurrently. When the next scheduled run time occurs, Snowflake starts a new instance of the task graph
      immediately, regardless of whether any task (including the root task) is still running.

    Default: `NO_OVERLAP`

`session_parameter = value [ , session_parameter = value ... ]`
:   Specifies a comma-separated list of session parameters to set for the session when the task runs. A task supports all session
    parameters. For the complete list, see [Session parameters](../parameters.md).

    > **Note:**
    >
    > The following session parameter configurations aren’t supported for tasks:
    >
    > * [SEARCH_PATH](../parameters.md) set to any value.
    > * [AUTOCOMMIT = FALSE](../parameters.md).

`USER_TASK_TIMEOUT_MS = num`
:   Specifies the time limit on a single run of the task before it times out (in milliseconds).

    > **Note:**
    >
    > * Before you increase the time limit on a task significantly, consider whether the SQL statement initiated by the task could be
    >   optimized (either by rewriting the statement or using a stored procedure) or the warehouse size should be increased.
    > * When both [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md) and USER_TASK_TIMEOUT_MS are set, the timeout is the lowest non-zero value of the two parameters.
    > * When both [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../parameters.md) and USER_TASK_TIMEOUT_MS are set, the value of USER_TASK_TIMEOUT_MS takes precedence.
    >
    > For more information about this parameter, see [USER_TASK_TIMEOUT_MS](../parameters.md).

    Values: `0` - `604800000` (7 days). A value of `0` specifies that the maximum timeout value is enforced.

    Default: `3600000` (1 hour)

`SUSPEND_TASK_AFTER_NUM_FAILURES = num`
:   Specifies the number of consecutive failed task runs after which the current task is suspended automatically. Failed task runs
    include runs in which the SQL code in the task body either produces a user error or times out. Task runs that are skipped,
    canceled, or that fail due to a system error are considered indeterminate and aren’t included in the count of failed task runs.

    Set the parameter on a standalone task or the root task in a task graph. When the parameter is set to a value greater than `0`, the
    following behavior applies to runs of the standalone task or task graph:

    * Standalone tasks are automatically suspended after the specified number of consecutive task runs either fail or time out.
    * The root task is automatically suspended after the run of any single task in a task graph fails or times out the specified
      number of times in consecutive runs.

    When the parameter is set to `0`, failed tasks aren’t automatically suspended.

    The setting applies to tasks that rely on either serverless compute resources or virtual warehouse compute resources.

    For more information about this parameter, see [SUSPEND_TASK_AFTER_NUM_FAILURES](../parameters.md).

    Values: `0` - No upper limit.

    Default: `10`

`ERROR_INTEGRATION = 'integration_name'`
:   Required only when configuring a task to send error notifications using Amazon Simple Notification Service (SNS), Microsoft Azure Event Grid, or Google Pub/Sub.

    Specifies the name of the notification integration used to communicate with Amazon SNS, MS Azure Event Grid, or Google Pub/Sub. For more information, see
    [Set up error notifications for tasks](../../user-guide/tasks-errors.md).

`SUCCESS_INTEGRATION = 'integration_name'`
:   Required only when configuring a task to send success notifications using Amazon Simple Notification Service (SNS), Microsoft Azure Event Grid, or Google Pub/Sub.

    Specifies the name of the notification integration used to communicate with Amazon SNS, MS Azure Event Grid, or Google Pub/Sub. For more information, see
    [Set up error notifications for tasks](../../user-guide/tasks-errors.md).

`LOG_LEVEL = 'log_level'`
:   Specifies the severity level of [events for this task](../../user-guide/tasks-events.md) that are ingested and made available in
    the active event table. Events at the specified level (and at more severe levels) are ingested.

    For more information about levels, see [LOG_LEVEL](../parameters.md). For information about setting the log level, see
    [Setting levels for logging, metrics, and tracing](../../developer-guide/logging-tracing/telemetry-levels.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the task.

    Default: No value

`AFTER string [ , string , ... ]`
:   Specifies one or more predecessor tasks for the current task. Use this option to create a [task graph](../../user-guide/tasks-graphs.md) or
    add this task to an existing task graph. A task graph is a series of tasks that starts with a scheduled root task and is linked together
    by dependencies.

    Note that the structure of a task graph can be defined after all of its component tasks are created. Execute
    [ALTER TASK](alter-task.md) … ADD AFTER statements to specify the predecessors for each task in the planned task graph.

    A task runs after all of its predecessor tasks have finished their own runs successfully (after a brief lag).

    > **Note:**
    >
    > * The root task should have a defined schedule. Each child task must have one or more defined predecessor tasks, specified
    >   using the `AFTER` parameter, to link the tasks together.
    > * A single task is limited to 100 predecessor tasks and 100 child tasks. In addition, a task graph is limited to a maximum of 1000 tasks
    >   total (including the root task) in either a resumed or suspended state.
    > * Accounts are currently limited to a maximum of 30000 resumed tasks.
    > * All tasks in a task graph must have the same task owner. A single role must have the OWNERSHIP privilege on all of the tasks in
    >   the task graph.
    > * All tasks in a task graph must exist in the same schema.
    > * The root task must be suspended before any task is recreated (using the CREATE OR REPLACE TASK syntax) or a child task
    >   is added (using CREATE TASK … AFTER or ALTER TASK … ADD AFTER) or removed (using ALTER TASK … REMOVE AFTER).
    > * If any task in a task graph is cloned, the role that clones the task becomes the owner of the clone by default.
    >
    >   + If the owner of the original task creates the clone, then the task clone retains the link between the task and the predecessor
    >     task. This means the same predecessor task triggers both the original task and the task clone.
    >   + If another role creates the clone, then the task clone can have a schedule but not a predecessor.
    > * Current limitations:
    >
    >   + Snowflake guarantees that at most one instance of a task with a defined schedule is running at a given time; however, we cannot
    >     provide the same guarantee for tasks with a defined predecessor task.

`WHEN boolean_expr`
:   Specifies a Boolean SQL expression; multiple conditions joined with AND/OR are supported. When a task is triggered (based on its
    `SCHEDULE` or `AFTER` setting), it validates the conditions of the expression to determine whether to execute. If the
    conditions of the expression are not met, then the task skips the current run. Any tasks that identify this task as a
    predecessor also don’t run.

    The following are supported in a task WHEN clause:

    * [SYSTEM$STREAM_HAS_DATA](../functions/system_stream_has_data.md) is supported for evaluation in the SQL expression.

      This function indicates whether a specified stream contains change tracking data. You can use this function to evaluate whether the specified stream contains
      change data before starting the current run. If the result is FALSE, then the task doesn’t run.

      > **Note:**
      >
      > [SYSTEM$STREAM_HAS_DATA](../functions/system_stream_has_data.md) is designed to avoid returning a FALSE value even when the stream contains
      > change data. However, this function isn’t guaranteed to avoid returning a TRUE value when the stream contains no change data.
    * [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](../functions/system_get_predecessor_return_value.md) is supported for evaluation in the SQL expression.

      This function retrieves the return value for the predecessor task in a task graph. The return value can be used as part of
      a boolean expression. When using SYSTEM$GET_PREDECESSOR_RETURN_VALUE, you can cast the returned value to
      the appropriate numeric, string, or boolean type if required.

      Simple examples include:

      ```sqlexample
      WHEN NOT SYSTEM$GET_PREDECESSOR_RETURN_VALUE('task_name')::BOOLEAN
      ```

      ```sqlexample
      WHEN SYSTEM$GET_PREDECESSOR_RETURN_VALUE('task_name') != 'VALIDATION'
      ```

      ```sqlexample
      WHEN SYSTEM$GET_PREDECESSOR_RETURN_VALUE('task_name')::FLOAT < 0.2
      ```

      > **Note:**
      >
      > Use of [PARSE_JSON](../functions/parse_json.md) in TASK … WHEN expressions isn’t supported as it requires warehouse based compute resources.
    * [Boolean operators](../operators-logical.md) such as AND, OR, NOT, and others.

      Simple example that runs whenever data changes in either of two streams:

      ```sqlexample
      CREATE TASK my_task
          WAREHOUSE = my_warehouse
          WHEN SYSTEM$STREAM_HAS_DATA('my_customer_stream')
          OR   SYSTEM$STREAM_HAS_DATA('my_order_stream')
          AS
            SELECT CURRENT_TIMESTAMP;
      ```
    * Casts between numeric, string, and boolean types.
    * [Comparison operators](../operators-comparison.md) such as equal, not equal, greater than, less than, and others.

    Validating the conditions of the WHEN expression does not require compute resources. The validation is instead processed in the cloud
    services layer. A nominal charge accrues each time a task evaluates its WHEN condition and doesn’t run. The charges accumulate each time
    the task is triggered until it runs. At that time, the charge is converted to Snowflake credits and added to the compute resource usage
    for the task run.

    Generally the compute time to validate the condition is insignificant compared to task execution time. As a best practice, align
    scheduled and actual task runs as closely as possible. Avoid task schedules that don’t align with task runs. For
    example, if data is inserted into a table with a stream roughly every 24 hours, don’t schedule a task that checks for stream data
    every minute. The charge to validate the WHEN expression with each run is generally insignificant, but the charges are cumulative.

    Note that daily consumption of cloud services that falls below the
    [10% quota of the daily usage of the compute resources](../../user-guide/cost-understanding-compute.md) accumulates no cloud services charges.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

`FINALIZE = string`
:   Specifies the name of a root task that the finalizer task is associated with. Finalizer tasks run after all other tasks in the task graph run to completion. You can define the SQL of a finalizer task to handle notifications and the release and cleanup of resources that a task graph uses. For more information, see [Finalizer task](../../user-guide/tasks-graphs.md).

    * A root task can only have one finalizer task. If you create multiple finalizer tasks for a root task, the task creation will fail.
    * A finalizer task cannot have any child tasks. Any command attempting to make the finalizer task a predecessor will fail.
    * A finalizer task cannot have a schedule. Creating a finalizer task with a schedule will fail.

    Default: No value

`TASK_AUTO_RETRY_ATTEMPTS = num`
:   Specifies the number of automatic task graph retry attempts. If any task graphs complete in a FAILED state, Snowflake can automatically
    retry the task graphs from the last task in the graph that failed.

    The automatic task graph retry is disabled by default. To enable this feature, set TASK_AUTO_RETRY_ATTEMPTS to a value greater than `0`
    on the root task of a task graph.

    Note that this parameter must be set to the root task of a task graph. If it’s set to a child task, an error will be returned.

    Values: `0` - `30`.

    Default: `0`

`USER_TASK_MINIMUM_TRIGGER_INTERVAL_IN_SECONDS = num`
:   Defines how frequently a task can execute in seconds. If data changes occur more often than the specified minimum, changes will be
    grouped and processed together.

    The task will run every 12 hours even if this value is set to more than 12 hours.

    Values: Minimum `10`, maximum `604800`.

    Default: `30`

`TARGET_COMPLETION_INTERVAL = 'num { HOURS | MINUTES | SECONDS }'`
:   Specifies the desired task completion time. This parameter only applies to serverless tasks. This property is only set on a Task.

    This parameter is required when you create serverless [Triggered tasks](../../user-guide/tasks-triggered.md).

    Values: `{ 10 - 86400 } SECONDS`, `{ 1 - 1440 } MINUTES`, or `{ 1-24 } HOURS` (That is, from 10 seconds to the equivalent of 1 day). Accepts positive integers only.

    Also supports the notations: HOUR, MINUTE, SECOND, and H, M, S.

    Default: Snowflake resizes serverless compute resources to complete before the next scheduled execution time.

`SERVERLESS_TASK_MIN_STATEMENT_SIZE = string`
:   Specifies the minimum allowed warehouse size for the serverless task. This parameter only applies to serverless tasks. This parameter can be specified on the Task, Schema, Database, or Account. Precedence follows the standard parameter hierarchy.

    Values: Minimum `XSMALL`, Maximum `XXLARGE`. Values are consistent with [WAREHOUSE_SIZE values](create-warehouse.md).

    Also supports the notation: X2LARGE.

    Default: `XSMALL`

    Note that if both SERVERLESS_TASK_MIN_STATEMENT_SIZE and USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE are specified, SERVERLESS_TASK_MIN_STATEMENT_SIZE must be equal to or smaller than USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE.

`SERVERLESS_TASK_MAX_STATEMENT_SIZE = string`
:   Specifies the maximum allowed warehouse size for the serverless task. This parameter only applies to serverless tasks. This parameter can be specified on the Task, Schema, Database, or Account. Precedence follows the standard parameter hierarchy.

    Values: Minimum `XSMALL`, Maximum `XXLARGE`.

    Also supports the notation: X2LARGE.

    Default: `XXLARGE`

    If both SERVERLESS_TASK_MIN_STATEMENT_SIZE and SERVERLESS_TASK_MAX_STATEMENT_SIZE are specified, SERVERLESS_TASK_MIN_STATEMENT_SIZE must be less than or equal to SERVERLESS_TASK_MAX_STATEMENT_SIZE. SERVERLESS_TASK_MAX_STATEMENT_SIZE must be equal to or greater than USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE

`EXECUTE AS USER user_name`
:   Runs the task on behalf of a specified user account. The user who runs the command must have permissions granted by using the [GRANT IMPERSONATE ON USER TO ROLE](grant-privilege-user.md) command.

    For more information, see [Run tasks with user privileges](../../user-guide/tasks-intro.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| EXECUTE TASK | Account | Required to run any tasks the role owns. Revoking the EXECUTE TASK privilege on a role prevents all subsequent task runs from starting under that role. |
| EXECUTE MANAGED TASK | Account | Required only for tasks that rely on serverless compute resources for runs. |
| CREATE TASK | Schema |  |
| USAGE | Warehouse | Required only for tasks that rely on user-managed warehouses for runs. |
| OWNERSHIP | Task | Required only when executing a CREATE OR ALTER TASK statement for an *existing* task.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Tasks run using the task owner’s privileges. For the list of minimum required privileges to run tasks, see
  [Task security](../../user-guide/tasks-intro.md).

  Run the SQL statement or call the stored procedure, as the task owner role, before you include it in a task definition to ensure
  the role has the required privileges on objects referenced by the SQL or stored procedure.
* For serverless tasks:

  + Serverless compute resources for a task can range from the equivalent of `XSMALL` to `XXLARGE` in warehouse sizes. To request a
    size increase, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
  + Individual tasks in a task graph can use serverless or user-managed compute resources. Using the serverless compute for
    all tasks in the task graph isn’t required.
* If a task fails with an unexpected error, you can receive a notification about the error.
  For more information on configuring task error notifications, see [Set up error notifications for tasks](../../user-guide/tasks-errors.md).
* By default, a DML statement executed without explicitly starting a transaction is automatically committed on success or rolled back on
  failure at the end of the statement. This behavior is called *autocommit* and is controlled with the [AUTOCOMMIT](../parameters.md) parameter.
  This parameter must be set to TRUE. If the AUTOCOMMIT parameter is set to FALSE at the account level, then set the parameter to
  TRUE for the individual task (using ALTER TASK … SET AUTOCOMMIT = TRUE); otherwise, any DML statement executed by the task fails.
* Only one task should consume data from a stream. Create multiple streams for the same table to be consumed by more than one task. When a
  task consumes the data in a stream using a DML statement, the stream advances the offset and change data is no longer available for the
  next task to consume.
* The `OVERLAP_POLICY` parameter replaces the deprecated `ALLOW_OVERLAPPING_EXECUTION` parameter. For backward compatibility,
  `ALLOW_OVERLAPPING_EXECUTION = TRUE` maps to `OVERLAP_POLICY = ALLOW_CHILD_OVERLAP`, and
  `ALLOW_OVERLAPPING_EXECUTION = FALSE` maps to `OVERLAP_POLICY = NO_OVERLAP`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## CREATE OR ALTER TASK usage notes

* All limitations of the [ALTER TASK](alter-task.md) command apply.
* A task cannot be resumed or suspended using the CREATE OR ALTER TASK command. To resume or suspend a task, use the ALTER TASK command.
* Setting or unsetting a tag is not supported; however existing tags are *not* altered by a CREATE OR ALTER statement and remain unchanged.

## Examples

### Single SQL statement

Create a serverless task that queries the current timestamp every hour starting at 9:00 a.m. and ending at 5:00 p.m. on Sundays
(America/Los_Angeles time zone).

The initial warehouse size is XSMALL:

```sqlexample
CREATE TASK t1
  SCHEDULE = 'USING CRON 0 9-17 * * SUN America/Los_Angeles'
  USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = 'XSMALL'
  AS
    SELECT CURRENT_TIMESTAMP;
```

Same as the previous example, but the task relies on a user-managed warehouse to provide the compute resources for runs:

```sqlexample
CREATE TASK mytask_hour
  WAREHOUSE = mywh
  SCHEDULE = 'USING CRON 0 9-17 * * SUN America/Los_Angeles'
  AS
    SELECT CURRENT_TIMESTAMP;
```

Create a serverless task that inserts the current timestamp into a table every hour. The task sets the [TIMESTAMP_INPUT_FORMAT](../parameters.md)
parameter for the session in which the task runs. This session parameter specifies the format of the inserted timestamp:

```sqlexample
CREATE TASK t1
  SCHEDULE = '60 MINUTES'
  TIMESTAMP_INPUT_FORMAT = 'YYYY-MM-DD HH24'
  USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = 'XSMALL'
  AS
    INSERT INTO mytable(ts) VALUES(CURRENT_TIMESTAMP);
```

Create a task that inserts the current timestamp into a table every 5 minutes:

```sqlexample
CREATE TASK mytask_minute
  WAREHOUSE = mywh
  SCHEDULE = '5 MINUTES'
  AS
    INSERT INTO mytable(ts) VALUES(CURRENT_TIMESTAMP);
```

Create a task that inserts change tracking data for INSERT operations from a stream into a table every 5 minutes. The task polls the
stream using the SYSTEM$STREAM_HAS_DATA function to determine whether change data exists and, if the result is `FALSE`, skips the
current run:

```sqlexample
CREATE TASK mytask1
  WAREHOUSE = mywh
  SCHEDULE = '5 MINUTES'
  WHEN
    SYSTEM$STREAM_HAS_DATA('MYSTREAM')
  AS
    INSERT INTO mytable1(id,name) SELECT id, name FROM mystream WHERE METADATA$ACTION = 'INSERT';
```

Create a serverless child task in a task graph and add multiple predecessor tasks. The child task runs only after all specified predecessor
tasks have successfully completed their own runs.

Suppose that the root task for a task graph is `task1` and that `task2`, `task3`, and `task4`
are child tasks of `task1`. This example adds child task `task5` to the task graph and specifies
`task2`, `task3`, and `task4` as predecessor tasks:

```sqlexample
-- Create task5 and specify task2, task3, task4 as predecessors tasks.
-- The new task is a serverless task that inserts the current timestamp into a table column.
CREATE TASK task5
  AFTER task2, task3, task4
AS
  INSERT INTO t1(ts) VALUES(CURRENT_TIMESTAMP);
```

### Stored procedure

Create a task named `my_copy_task` that calls a stored procedure to unload data from the `mytable` table to the named `mystage`
stage (using [COPY INTO <location>](copy-into-location.md)) every hour:

```sqlexample
-- Create a stored procedure that unloads data from a table
-- The COPY statement in the stored procedure unloads data to files in a path identified by epoch time (using the Date.now() method)
CREATE OR REPLACE PROCEDURE my_unload_sp()
  returns string not null
  language javascript
  AS
    $$
      var my_sql_command = ""
      var my_sql_command = my_sql_command.concat("copy into @mystage","/",Date.now(),"/"," from mytable overwrite=true;");
      var statement1 = snowflake.createStatement( {sqlText: my_sql_command} );
      var result_set1 = statement1.execute();
    return my_sql_command; // Statement returned for info/debug purposes
    $$;

-- Create a task that calls the stored procedure every hour
CREATE TASK my_copy_task
  WAREHOUSE = mywh
  SCHEDULE = '60 MINUTES'
  AS
    CALL my_unload_sp();
```

### Multiple SQL statements

Create a task that executes multiple SQL statements. In this example, the task modifies the TIMESTAMP_OUTPUT_FORMAT for the session and
then queries the CURRENT_TIMESTAMP function.

```sqlexample
CREATE OR REPLACE TASK test_logging
  USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = 'XSMALL'
  SCHEDULE = 'USING CRON  0 * * * * America/Los_Angeles'
  AS
    BEGIN
      ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF';
      SELECT CURRENT_TIMESTAMP;
    END;
```

### Procedural logic using Snowflake Scripting

Create a task that declares a variable, uses the variable, and returns the value of the variable every 15 seconds:

```sqlexample
CREATE TASK t1
  USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE = 'XSMALL'
  SCHEDULE = '15 SECONDS'
  AS
    DECLARE
      radius_of_circle float;
      area_of_circle float;
    BEGIN
      radius_of_circle := 3;
      area_of_circle := pi() * radius_of_circle * radius_of_circle;
      return area_of_circle;
    END;
```

### Root task with configuration

Create a task that specifies configuration, and then reads that configuration.

```sqlexample
CREATE OR REPLACE TASK root_task_with_config
  WAREHOUSE=mywarehouse
  SCHEDULE='10 m'
  CONFIG=$${"output_dir": "/temp/test_directory/", "learning_rate": 0.1}$$
  AS
    BEGIN
      LET OUTPUT_DIR STRING := SYSTEM$GET_TASK_GRAPH_CONFIG('output_dir')::string;
      LET LEARNING_RATE DECIMAL := SYSTEM$GET_TASK_GRAPH_CONFIG('learning_rate')::DECIMAL;
    ...
    END;
```

### Finalizer task

Create a finalizer task, associated with the root task of a task graph, that sends an email alert after task completion. For more
information about finalizer tasks, see [Finalizer task](../../user-guide/tasks-graphs.md).

```sqlexample
CREATE TASK finalize_task
  WAREHOUSE = my_warehouse
  FINALIZE = my_root_task
  AS
    CALL SYSTEM$SEND_EMAIL(
      'my_email_int',
      'first.last@example.com, first2.last2@example.com',
      'Email Alert: Task A has finished.',
      'Task A has successfully finished.\nStart Time: 10:10:32\nEnd Time: 12:15:45\nTotal Records Processed: 115678'
    );
```

### Triggered task

Create a triggered task, associated with a stream, that inserts data from the specified stream into the table every time there is new data in the stream. For more information, see [Triggered tasks](../../user-guide/tasks-triggered.md).

```sqlsyntax
CREATE TASK triggeredTask
  WAREHOUSE = my_warehouse
  WHEN system$stream_has_data('my_stream')
  AS
    INSERT INTO my_downstream_table
    SELECT * FROM my_stream;

ALTER TASK triggeredTask RESUME;
```

### Create and alter a simple task using the CREATE OR ALTER TASK command

Create a task `my_task` to execute every hour in warehouse `my_warehouse`:

```sqlexample
CREATE OR ALTER TASK my_task
  WAREHOUSE = my_warehouse
  SCHEDULE = '60 MINUTES'
  AS
    SELECT PI();
```

Alter task `my_task` to execute after task `my_other_task` and update the task definition:

```sqlexample
CREATE OR ALTER TASK my_task
  WAREHOUSE = regress
  AFTER my_other_task
  AS
    SELECT 2 * PI();
```

---
title: CREATE TYPE
source: https://docs.snowflake.com/en/sql-reference/sql/create-type.md
section: SQL Commands
---

# CREATE TYPE

Creates a [user-defined type](../data-types-user-defined.md).

See also:
:   [ALTER TYPE](alter-type.md) , [DESCRIBE TYPE](desc-type.md) , [SHOW TYPES](show-types.md) , [DROP TYPE](drop-type.md) , [UNDROP TYPE](undrop-type.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] TYPE [ IF NOT EXISTS ] <name> AS <type>
  [ COMMENT = '<string_literal>' ]
```

## Required parameters

`name`
:   Specifies the identifier for the user-defined type; must be unique for the schema in which the user-defined type
    is created.

    The name can’t be the same as a Snowflake type name. For example, the type name can’t be `array` or `geometry`.

    If the name is the same as a [Snowflake keyword](../reserved-keywords.md), it must be specified
    in double quotes.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`AS type`
:   An existing [Snowflake data type](../../sql-reference-data-types.md) definition.

    The specified type definition is the *base type* for the user-defined type being created.

    `type` can’t be another user-defined type.

## Optional parameters

`COMMENT = 'string_literal'`
:   Specifies a comment for the user-defined type.

    Default: No value

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE TYPE | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

Use the CREATE TYPE command to create a user-defined type based on the NUMBER data type:

```sqlexample
CREATE TYPE age AS NUMBER(3,0);
```

Create a user-defined type based on the OBJECT data type:

```sqlexample
CREATE TYPE path AS OBJECT(
  relative BOOLEAN,
  segments ARRAY(STRING)
);
```

For more examples, see [Examples for user-defined data types](../data-types-user-defined.md).

---
title: CREATE USER
source: https://docs.snowflake.com/en/sql-reference/sql/create-user.md
section: SQL Commands
---

# CREATE USER

Creates a new user or replaces an existing user in the system. For more details, see [User management](../../user-guide/admin-user-management.md).

> **Note:**
>
> Only user administrators (that is, users with the USERADMIN role or higher), or another role with the CREATE USER privilege on the account,
> can create users.

See also:
:   [DROP USER](drop-user.md) , [ALTER USER](alter-user.md) , [DESCRIBE USER](desc-user.md) , [SHOW PARAMETERS](show-parameters.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] USER [ IF NOT EXISTS ] <name>
  [ objectProperties ]
  [ objectParams ]
  [ sessionParams ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   PASSWORD = '<string>'
>   LOGIN_NAME = <string>
>   DISPLAY_NAME = <string>
>   FIRST_NAME = <string>
>   MIDDLE_NAME = <string>
>   LAST_NAME = <string>
>   EMAIL = <string>
>   MUST_CHANGE_PASSWORD = { TRUE | FALSE }
>   DISABLED = { TRUE | FALSE }
>   ALLOWED_INTERFACES = ( <list_of_interfaces> )
>   DAYS_TO_EXPIRY = <integer>
>   MINS_TO_UNLOCK = <integer>
>   DEFAULT_WAREHOUSE = <string>
>   DEFAULT_NAMESPACE = <string>
>   DEFAULT_ROLE = <string>
>   DEFAULT_SECONDARY_ROLES = { ( 'ALL' ) | () }
>   MINS_TO_BYPASS_MFA = <integer>
>   RSA_PUBLIC_KEY = <string>
>   RSA_PUBLIC_KEY_FP = <string>
>   RSA_PUBLIC_KEY_2 = <string>
>   RSA_PUBLIC_KEY_2_FP = <string>
>   TYPE = { PERSON | SERVICE | LEGACY_SERVICE }
>   WORKLOAD_IDENTITY = ( <list_of_properties> )
>   COMMENT = '<string_literal>'
> ```
>
> ```sqlsyntax
> objectParams ::=
>   ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR = TRUE | FALSE
>   ENABLE_UNREDACTED_SECURE_OBJECT_ERROR = TRUE | FALSE
>   NETWORK_POLICY = <string>
> ```
>
> ```sqlsyntax
> sessionParams ::=
>   ABORT_DETACHED_QUERY = TRUE | FALSE
>   AUTOCOMMIT = TRUE | FALSE
>   BINARY_INPUT_FORMAT = <string>
>   BINARY_OUTPUT_FORMAT = <string>
>   DATE_INPUT_FORMAT = <string>
>   DATE_OUTPUT_FORMAT = <string>
>   DEFAULT_NULL_ORDERING = <string>
>   ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS = TRUE | FALSE
>   ERROR_ON_NONDETERMINISTIC_MERGE = TRUE | FALSE
>   ERROR_ON_NONDETERMINISTIC_UPDATE = TRUE | FALSE
>   JSON_INDENT = <num>
>   LOCK_TIMEOUT = <num>
>   OPT_OUT_ERROR_LOGGING = TRUE | FALSE
>   QUERY_TAG = <string>
>   ROWS_PER_RESULTSET = <num>
>   SIMULATED_DATA_SHARING_CONSUMER = <string>
>   STATEMENT_TIMEOUT_IN_SECONDS = <num>
>   STRICT_JSON_OUTPUT = TRUE | FALSE
>   TIMESTAMP_DAY_IS_ALWAYS_24H = TRUE | FALSE
>   TIMESTAMP_INPUT_FORMAT = <string>
>   TIMESTAMP_LTZ_OUTPUT_FORMAT = <string>
>   TIMESTAMP_NTZ_OUTPUT_FORMAT = <string>
>   TIMESTAMP_OUTPUT_FORMAT = <string>
>   TIMESTAMP_TYPE_MAPPING = <string>
>   TIMESTAMP_TZ_OUTPUT_FORMAT = <string>
>   TIMEZONE = <string>
>   TIME_INPUT_FORMAT = <string>
>   TIME_OUTPUT_FORMAT = <string>
>   TRANSACTION_DEFAULT_ISOLATION_LEVEL = <string>
>   TWO_DIGIT_CENTURY_START = <num>
>   UNSUPPORTED_DDL_ACTION = <string>
>   USE_CACHED_RESULT = TRUE | FALSE
>   WEEK_OF_YEAR_POLICY = <num>
>   WEEK_START = <num>
> ```

> **Note:**
>
> For readability, the complete list of session parameters that can be set for a user is not included here. For a complete list of all
> session parameters, with their descriptions, as well as account and object parameters, see [Parameters](../parameters.md).

## Required parameters

`name`
:   Identifier for the user; must be unique for your account.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

> **Note:**
>
> The user does not use this value to log into Snowflake; instead, the user uses the value specified for the `LOGIN_NAME`
> property to log in. However, if no login name is explicitly specified for the user, the user name/identifier serves as the default
> login name.

## Optional object properties (`objectProperties`)

`PASSWORD = 'string'`
:   The password for the user must be enclosed in single or double quotes. If no password is specified, the user cannot log into Snowflake
    until a password has been explicitly specified for them.

    If the password uses the backslash (i.e. `\`) character, escape the character with a backslash or use double dollar sign (i.e. `$$`)
    delimiters when specifying the password in a SQL command. For details, refer to [String & binary data types](../data-types-text.md).

    For more information about passwords in Snowflake, refer to [Password policies](../../user-guide/password-authentication.md).

    Default: `NULL`

`LOGIN_NAME = string`
:   Name that the user enters to log into the system. Login names for users must be unique across your entire account.

    A login name can be any string, including spaces and non-alphanumeric characters, such as exclamation points (`!`), percent signs
    (`%`), and asterisks (`*`); however, if the string contains spaces or non-alphanumeric characters, it must be enclosed in single
    or double quotes. Login names are always case-insensitive.

    Snowflake allows specifying different user and login names to enable using common identifiers (e.g. email addresses) for login.

    Default: User’s name/identifier (i.e. if no value is specified, the value specified for `name` is used as the login name)

`DISPLAY_NAME = string`
:   Name displayed for the user in the Snowflake web interface.

    Default: User’s name/identifier (i.e. if no value is specified, the value specified for `name` is used as the display name)

`FIRST_NAME = string` , . `MIDDLE_NAME = string` , . `LAST_NAME = string`
:   First, middle, and last name of the user.

    Default: `NULL`

`EMAIL = string`
:   Email address for the user.

    An email address is not required to use Snowflake; however, to access the Snowflake Community to open support tickets or contribute to
    the community forums, a valid email address must be specified for the user.

    We recommend specifying a business email address rather than a personal email address. User email addresses are visible to all other
    users in your Snowflake account.

    Default: `NULL`

`MUST_CHANGE_PASSWORD = TRUE | FALSE`
:   Specifies whether the user is forced to change their password on next login (including their first/initial login) into the system.

    Default: `FALSE`

`DISABLED = TRUE | FALSE`
:   Specifies whether the user is disabled, which prevents the following actions:

    * For a new user, the user is locked out of Snowflake and cannot log in.
    * For an existing user, setting the property aborts all their currently-running queries and does not allow the user to issue any new
      queries; the user is also immediately locked out of Snowflake and cannot log back in.

    Default: `FALSE`

`ALLOWED_INTERFACES = ( {  'ALL' | 'interface' [ , ... ] } )`
:   Specifies which Snowflake interfaces the user can access.

    If you specify `('ALL')`, the user can access Snowsight and all other interfaces that can be
    specified for this property. If you specify one or more interfaces, the user can only access the interfaces
    specified and can’t interact with any Snowflake data outside of the interfaces specified.

    For `interface`, you can specify one or more of the following values in a comma-delimited list:

    > `SNOWFLAKE_INTELLIGENCE`
    > :   The user can access [Snowflake Intelligence](../../user-guide/snowflake-cortex/snowflake-intelligence.md).
    >
    > `STREAMLIT`
    > :   The user can access Streamlit apps through the app-viewer URLs.

    Default: `('ALL')`

`DAYS_TO_EXPIRY = integer`
:   Specifies the number of days after which the user status is set to “Expired” and the user is no longer allowed to log in. This is useful
    for defining temporary users (that is, users who should only have access to Snowflake for a limited time period).

    Setting `DAYS_TO_EXPIRY` for [account administrators](../../user-guide/security-access-control-considerations.md) (that is, users with the ACCOUNTADMIN role) is not
    allowed. If you set `DAYS_TO_EXPIRY` for [account administrators](../../user-guide/security-access-control-considerations.md), Snowflake ignores the setting.

    Once set, the value counts down to `0`, but doesn’t stop. A negative value indicates the status for the user is “Expired”. To reset
    the value, use [ALTER USER](alter-user.md) to set the following values:

    * To re-enable the user as a temporary user, set the value to a value greater than `0`.
    * To specify the user as a permanent user, set the value to `NULL` or `0`.

    Default: `NULL`

`MINS_TO_UNLOCK = integer`
:   Specifies the number of minutes until the temporary lock on the user login is cleared. To protect against unauthorized user login,
    Snowflake places a temporary lock on a user after five consecutive unsuccessful login attempts:

    * A positive value indicates the status for the user is “Locked”.
    * Once the value counts down to `0` (or a negative value), the lock is cleared and the user is allowed to log in again.
    * When the user successfully logs into Snowflake, the value resets to `NULL`.

    When creating a user, this property can be set to prevent them from logging in until the specified amount of time passes.

    To remove a lock immediately for a user, use [ALTER USER](alter-user.md) and specify a value of `0` for this parameter.

    Default: `NULL`

`DEFAULT_WAREHOUSE = string`
:   Specifies the virtual warehouse that is active by default for the user’s session upon login.

    A user can specify or change their current default virtual warehouse using [ALTER USER](alter-user.md). In addition, after starting a session
    (i.e. logging in), a user can change the virtual warehouse for the session using [USE WAREHOUSE](use-warehouse.md).

    Note that the CREATE USER operation does not verify that the warehouse exists.

    Default: `NULL`

`DEFAULT_NAMESPACE = string`
:   Specifies the namespace (database only or database and schema) that is active by default for the user’s session upon login:

    * To specify a database only, enter the database name.
    * To specify a schema, enter the fully-qualified schema name in the form of `db_name.schema_name`.

    A user can specify or change their current default namespace using [ALTER USER](alter-user.md). In addition, after starting a session
    (i.e. logging in), a user can change the namespace for their session using [USE DATABASE](use-database.md) or [USE SCHEMA](use-schema.md).

    Note that the CREATE USER operation does not verify that the namespace exists.

    Default: `NULL`

`DEFAULT_ROLE = string`
:   Specifies the primary role that is active by default for the user’s session upon login. The primary role is a single role that
    authorizes the execution of [CREATE <object>](create.md) statements or any other SQL action. The permissions to perform these
    actions can be granted to the primary role or any lower role in the role hierarchy.

    Note that specifying a default role for a user does not grant the role to the user. The role must be granted explicitly to the
    user using the [GRANT ROLE](grant-role.md) command. In addition, the CREATE USER operation does not verify that the role exists.

    A user can specify or change their current default role using [ALTER USER](alter-user.md). In addition, after starting a session (i.e. logging in),
    a user can change the role for the session using [USE ROLE](use-role.md). In either case, they can only choose from roles that have been
    explicitly granted to them.

    Default: `NULL`

`DEFAULT_SECONDARY_ROLES = ( 'ALL' ) | ()`
:   Specifies the set of secondary roles that are active for the user’s session upon login. Secondary roles are a set of roles that authorize
    any SQL action other than the execution of CREATE *<object>* statements. The permissions to perform these actions can be granted
    to the primary role, secondary roles, or any lower roles in the role hierarchies.

    Note that specifying a default secondary role for a user does not grant the role to the user. The role must also be granted
    explicitly to the user using the GRANT ROLE command.

    The following values are supported:

    > `('ALL')`
    > :   All roles that have been granted to the user.
    >
    >     Note that the set of roles is reevaluated when each SQL statement executes. If additional roles are granted to the user, and that
    >     user executes a new SQL statement, the newly granted roles are active secondary roles for the new SQL statement. The same logic
    >     applies to roles that are revoked from a user.
    >
    > `()`
    > :   No roles.

    Default: `ALL`

`MINS_TO_BYPASS_MFA = integer`
:   Specifies the number of minutes to temporarily bypass MFA for the user.

    This property can be used to allow a MFA-enrolled user to temporarily bypass MFA during login in the event that their MFA device is
    not available.

`RSA_PUBLIC_KEY = string`
:   Specifies the user’s RSA public key; used for [key pair authentication](../../user-guide/key-pair-auth.md).

`RSA_PUBLIC_KEY_FP = string`
:   Specifies the fingerprint of the user’s RSA public key; used for [key pair authentication](../../user-guide/key-pair-auth.md).

`RSA_PUBLIC_KEY_2 = string`
:   Specifies the user’s second RSA public key; used to rotate the public and private keys for
    [key pair authentication](../../user-guide/key-pair-auth.md) based on an expiration schedule set by your organization.

`RSA_PUBLIC_KEY_2_FP = string`
:   Specifies the fingerprint of the user’s second RSA public key; used to rotate the public and private keys for
    [key pair authentication](../../user-guide/key-pair-auth.md) based on an expiration schedule set by your organization.

`TYPE = { PERSON | SERVICE | LEGACY_SERVICE }`
:   Specifies the type of user. You can set this property to differentiate between human, service, and legacy service users. For information
    about the characteristics of these types of users, see [Types of users](../../user-guide/admin-user-management.md).

    `PERSON`
    :   User is a human user who can interact with Snowflake.

    `SERVICE`
    :   User is a service or application that interacts with Snowflake without human intervention.

    `LEGACY_SERVICE`
    :   A user with their `TYPE` property set to `LEGACY_SERVICE` represents a non-interactive integration. It is similar to
        `SERVICE`, but allows password and SAML authentication.

        > **Note:**
        >
        > The LEGACY_SERVICE type is being deprecated. Use the SERVICE type for services and applications. For a timeline of the deprecation of
        > LEGACY_SERVICE, see [Planning for the deprecation of single-factor password sign-ins](../../user-guide/security-mfa-rollout.md).

    Default: `PERSON`

`WORKLOAD_IDENTITY = ( list_of_properties )`
:   Configures the user to authenticate by using [workload identity federation](../../user-guide/workload-identity-federation.md).

    The following list shows the properties:

    `TYPE = { AWS | AZURE | GCP | OIDC }`
    :   Specifies the provider that issues the attestation that is sent by the application or workload to Snowflake.

    `ARN = 'string'`
    :   Required for `TYPE=AWS`. Not valid for other types.

        Specifies the Amazon Resource Identifier (ARN) that uniquely identifies the AWS user or role that is associated with the instance
        authenticating to Snowflake. Snowflake accepts the following forms of [IAM identifiers](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference_identifiers.html):

        * `arn:aws:iam::account:user/user_name_with_path`
        * `arn:aws:iam::account:role/role_name_with_path`
        * `arn:aws:sts::account:assumed_role/role_name/role_session_name`

        For help obtaining the ARN, see [Configure Snowflake](../../user-guide/workload-identity-federation.md).

    `ISSUER = 'string'`
    :   Required for `TYPE=AZURE` and `TYPE=OIDC`. Not valid for other types.

        * For `TYPE=AZURE`, specifies the Entra ID tenant’s Authority URL in the following form:

          `https://login.microsoftonline.com/tenant/v2.0`

          For help obtaining this URL, see [Configure Microsoft Azure](../../user-guide/workload-identity-federation.md).
        * For `TYPE=OIDC`, specifies the OpenID Connect (OIDC) issuer URL. An OIDC provider is identified by its issuer URL.

          For examples of how to obtain this issuer URL for different OIDC providers, [Use cases](../../user-guide/workload-identity-federation.md).

    `SUBJECT = 'string'`
    :   Required for `TYPE=AZURE`, `TYPE=GCP`, and `TYPE=OIDC`. Not valid for other types.

        * For `TYPE=AZURE`, specifies the case-sensitive Object ID (Principal ID) of the managed identity assigned to the Azure workload.
        * For `TYPE=GCP`, specifies the `uniqueId` property of the service account associated with the workload that is connecting to
          Snowflake.

          For help obtaining this identifier, see [Configure Snowflake](../../user-guide/workload-identity-federation.md).
        * For `TYPE=OIDC`, specifies the identifier of the workload that is connecting to Snowflake. The format of the value is specific to the
          OIDC provider that is issuing the attestation.

          For examples of how to construct the subject of an attestation issued by an OIDC provider, see [Use cases](../../user-guide/workload-identity-federation.md).

    `OIDC_AUDIENCE_LIST = ( 'string' [ , 'string' ... ] )`
    :   Optional for `TYPE=OIDC`. Not valid for other types.

        Specifies which values must be present in the `aud` claim of the ID token issued by the OIDC provider. Snowflake
        accepts the attestation if the `aud` claim contains at least one of the specified audiences.

        If omitted or empty, the audience is assumed to be `snowflakecomputing.com`.

`COMMENT = 'string_literal'`
:   Specifies a comment for the user.

    Default: `NULL`

## Optional object parameters (`objectParams`)

`ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR = { TRUE | FALSE }`
:   Controls how queries that fail due to syntax or parsing errors show up in a query history. If FALSE, the contents of a
    failed query is redacted from the views, pages, and functions that provide a query history.

    This parameter controls behavior for the user viewing the query history, not the user who executed the query.

    Only users with a role that is granted or inherits the AUDIT privilege can set the ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR parameter.

`ENABLE_UNREDACTED_SECURE_OBJECT_ERROR = { TRUE | FALSE }`
:   Controls whether error messages related to secure objects are redacted in metadata. For more information about
    error message redaction for secure objects, see [Secure objects: Redaction of information in error messages](../../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

    Only users with a role that is granted or inherits the AUDIT privilege can set the ENABLE_UNREDACTED_SECURE_OBJECT_ERROR parameter.

    When using the ALTER USER command to set the parameter to `TRUE` for a particular user, modify the user that you want to see the
    redacted error messages in metadata, not the user who caused the error.

`NETWORK_POLICY = string`
:   Specifies an existing [network policy](../../user-guide/network-policies.md) is active for the user. The network policy restricts the
    list of user IP addresses when exchanging an authorization code for an access or refresh token and when using a refresh token to
    obtain a new access token.

    If this parameter is not set, the network policy for the account (if any) is used instead.

## Optional session parameters (`sessionParams`)

Specifies one (or more) session parameter defaults to set for the user (separated by blank spaces, commas, or new lines). These defaults
are set each time the user logs into Snowflake and initiates session. The user can always change these defaults themselves within the
session using [ALTER SESSION](alter-session.md).

For the complete list of session parameters, including their default values, that can be specified for a user, see
[Parameters](../parameters.md).

## Optional parameters

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE USER | Account | Only the USERADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The `TYPE` property of a new user object can’t be `NULL`. You can’t set the `TYPE` property of an existing user to `NULL`.
  Running a CREATE USER command without setting the `TYPE` property sets the `TYPE` property for that user to `PERSON`.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

* The OR REPLACE and IF NOT EXISTS clauses are mutually exclusive. They can’t both be used in the same statement.
* CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

## Examples

Create a user with all default properties, a default role, and a basic password that must be changed by the user after their first
login:

> ```sqlexample
> CREATE USER user1 PASSWORD='abc123' DEFAULT_ROLE = myrole DEFAULT_SECONDARY_ROLES = ('ALL') MUST_CHANGE_PASSWORD = TRUE;
> ```

---
title: CREATE VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/create-view.md
section: SQL Commands
---

# CREATE VIEW

Creates a new view in the current/specified schema, based on a query of one or more existing tables (or any other valid query expression).

This command supports the following variants:

* CREATE OR ALTER VIEW: Creates a view if it doesn’t exist or alters an existing view.

See also:
:   [ALTER VIEW](alter-view.md) , [DROP VIEW](drop-view.md) , [SHOW VIEWS](show-views.md) , [DESCRIBE VIEW](desc-view.md)

    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] [ SECURE ] [ { [ { LOCAL | GLOBAL } ] TEMP | TEMPORARY | VOLATILE } ] [ RECURSIVE ] VIEW [ IF NOT EXISTS ] <name>
  [ ( <column_list> ) ]
  [ <col1> [ WITH ] MASKING POLICY <policy_name> [ USING ( <col1> , <cond_col1> , ... ) ]
           [ WITH ] PROJECTION POLICY <policy_name>
           [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ , <col2> [ ... ] ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] AGGREGATION POLICY <policy_name> [ ENTITY KEY ( <col_name> [ , <col_name> ... ] ) ] ]
  [ [ WITH ] JOIN POLICY <policy_name> [ ALLOWED JOIN KEYS ( <col_name> [ , ... ] ) ] ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
  [ WITH CONTACT ( <purpose> = <contact_name> [ , <purpose> = <contact_name> ... ] ) ]
  AS <select_statement>
```

## Variant syntax

### CREATE OR ALTER VIEW

Creates a new view if it doesn’t already exist, or updates the properties of an existing view to match those defined in the statement.
A CREATE OR ALTER VIEW statement follows the syntax rules of a CREATE VIEW statement and has the same limitations as an
[ALTER VIEW](alter-view.md) statement.

The following modifications are supported:

* Converting to (or reverting from) a secure view.
* Adding, overwriting, removing a comment for a view or a view’s columns.
* Enabling or disabling change tracking for a view.

For more information, see CREATE OR ALTER VIEW usage notes and [CREATE OR ALTER <object>](create-or-alter.md).

```sqlsyntax
CREATE OR ALTER [ SECURE ] [ { [ { LOCAL | GLOBAL } ] TEMP | TEMPORARY | VOLATILE } ] [ RECURSIVE ] VIEW <name>
  [ ( <column_list> ) ]
  [ CHANGE_TRACKING =  { TRUE | FALSE } ]
  [ COMMENT = '<string_literal>' ]
  AS <select_statement>
```

## Required parameters

`name`
:   Specifies the identifier for the view; must be unique for the schema in which the view is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

`select_statement`
:   Specifies the query used to create the view. Can be on one or more source tables or any other valid [SELECT](select.md) statement. This
    query serves as the text/definition for the view and is displayed in the [SHOW VIEWS](show-views.md) output and the
    [VIEWS](../info-schema/views.md) Information Schema view.

## Optional parameters

`SECURE`
:   Specifies that the view is secure. For more information about secure views, see [Working with Secure Views](../../user-guide/views-secure.md).

    Default: No value (view is not secure)

`{ [ { LOCAL | GLOBAL } ] TEMP | TEMPORARY | VOLATILE }`
:   Specifies that the view persists only for the duration of the [session](../../user-guide/session-policies.md) that you created it in. A
    temporary view and all its contents are dropped at the end of the session.

    The synonyms and abbreviations for `TEMPORARY` (e.g. `GLOBAL TEMPORARY`) are provided for compatibility with other databases
    (e.g. to prevent errors when migrating CREATE VIEW statements). Views created with any of these keywords appear and behave identically to
    a view created with the `TEMPORARY` keyword.

    Default: No value. If a view is not declared as `TEMPORARY`, the view is permanent.

    If you want to avoid unexpected conflicts, avoid naming temporary views after views that already exist in the schema.

    If you created a temporary view with the same name as another view in the schema, all queries and operations used on the view only affect
    the temporary view in the session, until you drop the temporary view. If you drop the view, you drop the temporary view, and not the view
    that already exists in the schema.

`RECURSIVE`
:   Specifies that the view can refer to itself using recursive syntax without necessarily using a CTE (common table
    expression). For more information about recursive views in general, and the RECURSIVE keyword in particular,
    see [Recursive Views (Non-materialized Views Only)](../../user-guide/views-introduction.md) and the recursive view examples below.

    Default: No value (view is not recursive, or is recursive only by using a CTE)

`column_list`
:   If you want to change the name of a column or add a comment to a column in the new view,
    include a column list that specifies the column names and (if needed) comments about
    the columns. (You do not need to specify the data types of the columns.)

    If any of the columns in the view are based on expressions (not just simple column names), then you must supply
    a column name for each column in the view. For example, the column names are required in the following case:

    ```sqlexample
    CREATE VIEW v1 (pre_tax_profit, taxes, after_tax_profit) AS
        SELECT revenue - cost, (revenue - cost) * tax_rate, (revenue - cost) * (1.0 - tax_rate)
        FROM table1;
    ```

    You can specify an optional comment for each column. For example:

    ```sqlexample
    CREATE VIEW v1 (pre_tax_profit COMMENT 'revenue minus cost',
                    taxes COMMENT 'assumes taxes are a fixed percentage of profit',
                    after_tax_profit)
        AS
        SELECT revenue - cost, (revenue - cost) * tax_rate, (revenue - cost) * (1.0 - tax_rate)
        FROM table1;
    ```

    Comments are particularly helpful when column names are cryptic.

    To view comments, use [DESCRIBE VIEW](desc-view.md).

`MASKING POLICY = policy_name`
:   Specifies the [masking policy](../../user-guide/security-column-intro.md) to set on a column.

`USING ( col_name , cond_col_1 ... )`
:   Specifies the arguments to pass into the conditional masking policy SQL expression.

    The first column in the list specifies the column for the policy conditions to mask or tokenize the data and must match the
    column to which the masking policy is set.

    The additional columns specify the columns to evaluate to determine whether to mask or tokenize the data in each row of the query result
    when a query is made on the first column.

    If the USING clause is omitted, Snowflake treats the conditional masking policy as a normal
    [masking policy](../../user-guide/security-column-intro.md).

`PROJECTION POLICY policy_name`
:   Specifies the [projection policy](../../user-guide/projection-policies.md) to set on a column.

`CHANGE_TRACKING = { TRUE | FALSE }`
:   Specifies whether to enable change tracking on the view.

    * `TRUE` enables change tracking on the view. This setting adds a pair of hidden columns to the source table and begins
      storing change tracking metadata in the columns. These columns consume a small amount of storage.

      The change-tracking metadata can be queried using the [CHANGES](../constructs/changes.md) clause for
      [SELECT](select.md) statements, or by creating and querying one or more streams on the table.
    * `FALSE` does not enable change tracking on the view.

`COPY GRANTS`
:   Retains the access permissions from the original view when a new view is created using the `OR REPLACE` clause.

    The parameter copies all privileges, except OWNERSHIP, from the existing view to the new view. The new view does not
    inherit any future grants defined for the object type in the schema. By default, the role that executes the CREATE VIEW statement owns
    the new view.

    If the parameter is not included in the CREATE VIEW statement, then the new view does not inherit any explicit access
    privileges granted on the original view but does inherit any future grants defined for the object type in the schema.

    Note that the operation to copy grants occurs atomically with the CREATE VIEW statement (i.e. within the same transaction).

    Default: No value (grants are not copied)

`COMMENT = 'string_literal'`
:   Specifies a comment for the view.

    Default: No value

`ROW ACCESS POLICY policy_name ON ( col_name [ , col_name ... ] )`
:   Specifies the [row access policy](../../user-guide/security-row-intro.md) to set on a view.

`AGGREGATION POLICY policy_name [ ENTITY KEY ( col_name [ , col_name ... ] ) ]`
:   Specifies the [aggregation policy](../../user-guide/aggregation-policies.md) to set on a view.

    Use the optional ENTITY KEY parameter to define which columns uniquely identity an entity within the view. For more information, see
    [Implementing entity-level privacy with aggregation policies](../../user-guide/aggregation-policies-entity-privacy.md).

`JOIN POLICY policy_name [ ALLOWED JOIN KEYS ( col_name [ , ... ] ) ]`
:   Specifies the [join policy](../../user-guide/join-policies.md) to set on a view.

    Use the optional ALLOWED JOIN KEYS parameter to define which columns are allowed to be used as joining columns when
    this policy is in effect. For more information, see [Join policies](../../user-guide/join-policies.md).

    This parameter is not supported by the CREATE OR ALTER variant syntax.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`WITH CONTACT ( purpose = contact [ , purpose = contact ...] )`
:   Associate the new object with one or more [contacts](../../user-guide/contacts-using.md).

    Specify the WITH CONTACT clause after all other clauses except the AS clause (if that clause is supported by this command).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE VIEW | Schema | Required to create a new view. |
| SELECT | Table, external table, view | Required on any tables and/or views queried in the view definition. |
| APPLY | Masking policy, row access policy, tag | Required only when applying a masking policy, row access policy, object tags, or any combination of these [governance](../../guides-overview-govern.md) features when creating views. |
| OWNERSHIP | View | * A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object   that already exists in the schema. * Required to execute a CREATE OR ALTER VIEW statement for an *existing* view.   OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege).  Note that in a [managed access schema](../../user-guide/security-access-control-configure.md), only the schema owner (i.e. the role with the OWNERSHIP privilege on the schema) or a role with the MANAGE GRANTS privilege can grant or revoke privileges on objects in the schema, including future grants. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* A view definition can include an [ORDER BY](../constructs/order-by.md) clause
  (e.g. `create view v1 as select * from t1 ORDER BY column1`). However, Snowflake recommends excluding
  the `ORDER BY` clause from most view definitions. If the view is used in contexts that don’t benefit from sorting,
  then the `ORDER BY` clause adds unnecessary costs. For example, when the view is used in a join, and the join
  column is not the same as the `ORDER BY` column, the extra cost to sort the view’s results is typically wasted.
  If you need to sort the query results, it’s usually more efficient to specify `ORDER BY` in the query that uses
  the view, rather than in the view itself.
* If you specify the [CURRENT_DATABASE](../functions/current_database.md) or [CURRENT_SCHEMA](../functions/current_schema.md) function in the
  definition of the view, the function returns the database or schema that contains the view, not the database or schema in
  use for the session.
* The definition for a view is limited to 95KB.
* Nesting levels are limited to a maximum of 20. An attempt to create a view that is nested more than 20 times will fail.
* View definitions are not dynamic. A view is not automatically updated if the underlying sources are modified such that they no longer
  match the view definition, particularly when columns are dropped. For example:

  + A view is created referencing a specific column in a source table, and the column is subsequently dropped from the table.
  + A view is created using `SELECT *` from a table, and changes are made to the columns in the table, such as:

    - A column is dropped.
    - A column is added.
    - The column order changes.

  In these scenarios, querying the view returns a column-related error.
* When you create a view, the view’s columns inherit the [collation specifications](../collation.md)
  of the columns in the source tables.
* If a source table for a view is dropped, querying the view returns an `object does not exist` error.
* A schema cannot contain a table and view with the same name. A CREATE VIEW statement produces an error if a table with the same name
  already exists in the schema.
* When a view is created, [unqualified](../name-resolution.md) references to tables and other database
  objects are resolved in the view’s schema, not in the session’s current schema. Similarly, objects that are
  partially qualified (i.e. schema.object) are resolved in the view’s database, not in the session’s current database.

  The `SEARCH_PATH` session parameter (if present) is ignored.
* Using `OR REPLACE` is the equivalent of using [DROP VIEW](drop-view.md) on the existing view and then creating a new view with the same
  name.

  CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

  This means that any queries concurrent with the CREATE OR REPLACE VIEW operation use either the old or new view version.

  Recreating or swapping a view drops its change data, which makes any stream on the view stale. A
  [stale](../../user-guide/streams-intro.md) stream is unreadable.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
* Using `COPY GRANTS`:

  + Data sharing:

    - If the existing secure view was shared to another account, the replacement view is also shared.
    - If the existing secure view was shared with your account as a data consumer, and access was further granted to other roles in the
      account (using GRANT IMPORTED PRIVILEGES on the parent database), access is also granted to the replacement view.
  + The [SHOW GRANTS](show-grants.md) output for the replacement view lists the grantee for the copied privileges as the role
    that executed the CREATE VIEW statement, with the current timestamp when the statement was executed.
* When you create a view and then grant privileges on that view to a role, the role can use the view even if the role does not have
  privileges on the underlying table(s) that the view accesses. This means that you can create a view to give a role access to only
  a subset of a table. For example, you can create a view that accesses medical billing information but not medical diagnosis
  information in the same table. Then you can grant privileges on that view to the “accountant” role so that the accountants
  can look at the billing information without seeing the patient’s diagnosis.
* By design, the [SHOW VIEWS](show-views.md) command does not provide information about secure views. To view information about a secure view,
  you must use the [VIEWS](../info-schema/views.md) view in the Information Schema and you must use the role that owns
  the view.
* A recursive view must provide a column name list.
* When defining recursive views, prevent infinite recursion. The WHERE clause in the recursive view definition should enable the
  recursion to stop eventually, typically by running out of data after processing the last level of a hierarchy of data.
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* When creating a view with a masking policy on one or more view columns, or a row access policy added to the view, use the
  [POLICY_CONTEXT](../functions/policy_context.md) function to simulate a query on the column(s) protected by a masking policy and the
  view protected by a row access policy.
* Avoid creating views that use streams as source objects, including those using the CHANGES clause (for example, `CHANGES (...) AT(STREAM => ...)`).
  This setup only works if the same role owns both the view and the source streams. For example, the same role, or a lower role in a role hierarchy,
  has the OWNERSHIP privilege on the view and source streams.

  Instead, create views directly from the source objects that you want to track. Then, create streams on those views.

  For more information, see [Streams on views](../../user-guide/streams-intro.md).

## Porting notes

* Some vendors support the `FORCE` keyword:

  > ```sqlexample
  > CREATE OR REPLACE FORCE VIEW ...
  > ```

  Snowflake accepts the `FORCE` keyword, but does not support it. In other words, you do not get a syntax error if you use this
  keyword, but using `FORCE` does not force the creation of a view if the underlying database objects (table(s) or view(s))
  do not already exist. Attempting to create a view of a non-existent table or view results in an error message even if the
  `FORCE` keyword is used.
* When looking up the tables in a view, some vendors search for unqualified table names in the active schema; Snowflake searches
  for unqualified table names
  [in the same schema as the view](../../user-guide/views-introduction.md).
  When porting to Snowflake, consider updating views to use fully-qualified table names.

## CREATE OR ALTER VIEW usage notes

* All limitations of the [ALTER VIEW](alter-view.md) command apply.
* This command *doesn’t* support the following:

  + Renaming a view using the RENAME TO parameter.
  + Adding or changing tags and policies. Any existing tags and policies are preserved.
  + Converting a TEMPORARY view into a permanent view, or vice versa.
  + Reordering columns in a view definition.

## Examples

### Basic examples

Create a view in the current schema, with a comment, that selects all the rows from a table:

> ```sqlexample
> CREATE VIEW myview COMMENT='Test view' AS SELECT col1, col2 FROM mytable;
>
> SHOW VIEWS;
>
> +---------------------------------+-------------------+----------+---------------+-------------+----------+-----------+--------------------------------------------------------------------------+
> | created_on                      | name              | reserved | database_name | schema_name | owner    | comment   | text                                                                     |
> |---------------------------------+-------------------+----------+---------------+-------------+----------+-----------+--------------------------------------------------------------------------|
> | Thu, 19 Jan 2017 15:00:37 -0800 | MYVIEW            |          | MYTEST1       | PUBLIC      | SYSADMIN | Test view | CREATE VIEW myview COMMENT='Test view' AS SELECT col1, col2 FROM mytable |
> +---------------------------------+-------------------+----------+---------------+-------------+----------+-----------+--------------------------------------------------------------------------+
> ```

The next example is the same as the previous example, except the view is secure:

> ```sqlexample
> CREATE OR REPLACE SECURE VIEW myview COMMENT='Test secure view' AS SELECT col1, col2 FROM mytable;
>
> SELECT is_secure FROM information_schema.views WHERE table_name = 'MYVIEW';
> ```

The following shows two ways of creating recursive views:

> First, create and load the table:
>
> ```sqlexample
> CREATE OR REPLACE TABLE employees (title VARCHAR, employee_ID INTEGER, manager_ID INTEGER);
> ```
>
> ```sqlexample
> INSERT INTO employees (title, employee_ID, manager_ID) VALUES
>     ('President', 1, NULL),  -- The President has no manager.
>         ('Vice President Engineering', 10, 1),
>             ('Programmer', 100, 10),
>             ('QA Engineer', 101, 10),
>         ('Vice President HR', 20, 1),
>             ('Health Insurance Analyst', 200, 20);
> ```
>
> Create a view using a recursive CTE, and then query the view.
>
> ```sqlexample
> CREATE VIEW employee_hierarchy (title, employee_ID, manager_ID, "MGR_EMP_ID (SHOULD BE SAME)", "MGR TITLE") AS (
>    WITH RECURSIVE employee_hierarchy_cte (title, employee_ID, manager_ID, "MGR_EMP_ID (SHOULD BE SAME)", "MGR TITLE") AS (
>       -- Start at the top of the hierarchy ...
>       SELECT title, employee_ID, manager_ID, NULL AS "MGR_EMP_ID (SHOULD BE SAME)", 'President' AS "MGR TITLE"
>         FROM employees
>         WHERE title = 'President'
>       UNION ALL
>       -- ... and work our way down one level at a time.
>       SELECT employees.title,
>              employees.employee_ID,
>              employees.manager_ID,
>              employee_hierarchy_cte.employee_id AS "MGR_EMP_ID (SHOULD BE SAME)",
>              employee_hierarchy_cte.title AS "MGR TITLE"
>         FROM employees INNER JOIN employee_hierarchy_cte
>        WHERE employee_hierarchy_cte.employee_ID = employees.manager_ID
>    )
>    SELECT *
>       FROM employee_hierarchy_cte
> );
> ```
>
> ```sqlexample
> SELECT *
>     FROM employee_hierarchy
>     ORDER BY employee_ID;
> +----------------------------+-------------+------------+-----------------------------+----------------------------+
> | TITLE                      | EMPLOYEE_ID | MANAGER_ID | MGR_EMP_ID (SHOULD BE SAME) | MGR TITLE                  |
> |----------------------------+-------------+------------+-----------------------------+----------------------------|
> | President                  |           1 |       NULL |                        NULL | President                  |
> | Vice President Engineering |          10 |          1 |                           1 | President                  |
> | Vice President HR          |          20 |          1 |                           1 | President                  |
> | Programmer                 |         100 |         10 |                          10 | Vice President Engineering |
> | QA Engineer                |         101 |         10 |                          10 | Vice President Engineering |
> | Health Insurance Analyst   |         200 |         20 |                          20 | Vice President HR          |
> +----------------------------+-------------+------------+-----------------------------+----------------------------+
> ```
>
> Create a view using the keyword RECURSIVE, and then query the view.
>
> ```sqlexample
> CREATE RECURSIVE VIEW employee_hierarchy_02 (title, employee_ID, manager_ID, "MGR_EMP_ID (SHOULD BE SAME)", "MGR TITLE") AS (
>       -- Start at the top of the hierarchy ...
>       SELECT title, employee_ID, manager_ID, NULL AS "MGR_EMP_ID (SHOULD BE SAME)", 'President' AS "MGR TITLE"
>         FROM employees
>         WHERE title = 'President'
>       UNION ALL
>       -- ... and work our way down one level at a time.
>       SELECT employees.title,
>              employees.employee_ID,
>              employees.manager_ID,
>              employee_hierarchy_02.employee_id AS "MGR_EMP_ID (SHOULD BE SAME)",
>              employee_hierarchy_02.title AS "MGR TITLE"
>         FROM employees INNER JOIN employee_hierarchy_02
>         WHERE employee_hierarchy_02.employee_ID = employees.manager_ID
> );
> ```
>
> ```sqlexample
> SELECT *
>     FROM employee_hierarchy_02
>     ORDER BY employee_ID;
> +----------------------------+-------------+------------+-----------------------------+----------------------------+
> | TITLE                      | EMPLOYEE_ID | MANAGER_ID | MGR_EMP_ID (SHOULD BE SAME) | MGR TITLE                  |
> |----------------------------+-------------+------------+-----------------------------+----------------------------|
> | President                  |           1 |       NULL |                        NULL | President                  |
> | Vice President Engineering |          10 |          1 |                           1 | President                  |
> | Vice President HR          |          20 |          1 |                           1 | President                  |
> | Programmer                 |         100 |         10 |                          10 | Vice President Engineering |
> | QA Engineer                |         101 |         10 |                          10 | Vice President Engineering |
> | Health Insurance Analyst   |         200 |         20 |                          20 | Vice President HR          |
> +----------------------------+-------------+------------+-----------------------------+----------------------------+
> ```

### CREATE OR ALTER VIEW examples

#### Basic example

Create a table `my_table` with one column:

```sqlexample
CREATE OR ALTER TABLE my_table(a INT);
```

Create a view named `v2` that selects column `a` from table `my_table`:

```sqlexample
CREATE OR ALTER VIEW v2(one)
  AS SELECT a FROM my_table;
```

Create or alter view `v2`. Add or update the COMMENT and CHANGE_TRACKING properties for the view:

```sqlexample
CREATE OR ALTER VIEW v2(one)
  COMMENT = 'fff'
  CHANGE_TRACKING = true
  AS SELECT a FROM my_table;
```

Create or alter view `v2` to add a comment to a column:

```sqlexample
CREATE OR ALTER VIEW v2(one COMMENT 'bar')
  COMMENT = 'foo'
  AS SELECT a FROM my_table;
```

#### Unset a property previously set on view

The [absence of a previously set property](create-or-alter.md) in the CREATE OR ALTER VIEW statement results
in unsetting it. In the following example, unset the COMMENT property for the view `v2` from the previous example:

```sqlexample
CREATE OR ALTER VIEW v2(one COMMENT 'bar')
  CHANGE_TRACKING = true
  AS SELECT a FROM my_table;
```

---
title: CREATE WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/sql/create-warehouse.md
section: SQL Commands
---

# CREATE WAREHOUSE

Creates a new [virtual warehouse](../../user-guide/warehouses-overview.md) in the system.

Initial creation of a virtual warehouse might take some time to provision the compute resources, unless the warehouse is created initially
in a `SUSPENDED` state.

This command supports the following variants:

* CREATE OR ALTER WAREHOUSE: Creates a new warehouse if it doesn’t exist or alters an existing warehouse.

See also:
:   [ALTER WAREHOUSE](alter-warehouse.md) , [DESCRIBE WAREHOUSE](desc-warehouse.md) , [DROP WAREHOUSE](drop-warehouse.md) , [SHOW WAREHOUSES](show-warehouses.md)

    [CREATE OR ALTER <object>](create-or-alter.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] WAREHOUSE [ IF NOT EXISTS ] <name>
       [ [ WITH ] objectProperties ]
       [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
       [ objectParams ]
```

Where:

> ```sqlsyntax
> objectProperties ::=
>   WAREHOUSE_TYPE = { STANDARD | 'SNOWPARK-OPTIMIZED' }
>   WAREHOUSE_SIZE = { XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE | X5LARGE | X6LARGE }
>   GENERATION = { '1' | '2' }
>   RESOURCE_CONSTRAINT = { STANDARD_GEN_1 | STANDARD_GEN_2 | MEMORY_1X | MEMORY_1X_x86 | MEMORY_16X | MEMORY_16X_x86 | MEMORY_64X | MEMORY_64X_x86 }
>   MAX_CLUSTER_COUNT = <num>
>   MIN_CLUSTER_COUNT = <num>
>   SCALING_POLICY = { STANDARD | ECONOMY }
>   AUTO_SUSPEND = { <num> | NULL }
>   AUTO_RESUME = { TRUE | FALSE }
>   INITIALLY_SUSPENDED = { TRUE | FALSE }
>   RESOURCE_MONITOR = <monitor_name>
>   COMMENT = '<string_literal>'
>   ENABLE_QUERY_ACCELERATION = { TRUE | FALSE }
>   QUERY_ACCELERATION_MAX_SCALE_FACTOR = <num>
> ```
>
> ```sqlsyntax
> objectParams ::=
>   MAX_CONCURRENCY_LEVEL = <num>
>   STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>
>   STATEMENT_TIMEOUT_IN_SECONDS = <num>
> ```

## Variant syntax

### CREATE OR ALTER WAREHOUSE

Creates a new warehouse if it doesn’t already exist, or transforms an existing warehouse into the warehouse defined in the statement.
A CREATE OR ALTER WAREHOUSE statement follows the syntax rules of a CREATE WAREHOUSE statement and has the same limitations as an
[ALTER WAREHOUSE](alter-warehouse.md) statement.

The following modifications are supported when altering a warehouse:

* Changing warehouse properties and parameters. For example, WAREHOUSE_TYPE, AUTO_RESUME or MAX_CLUSTER_COUNT.

For more information, see CREATE OR ALTER WAREHOUSE usage notes.

```sqlexample
CREATE OR ALTER WAREHOUSE <name>
     [ [ WITH ] objectProperties ]
     [ objectParams ]

objectProperties ::=
  WAREHOUSE_TYPE = { STANDARD | 'SNOWPARK-OPTIMIZED' }
  WAREHOUSE_SIZE = { XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE | X5LARGE | X6LARGE }
  GENERATION = { '1' | '2' }
  RESOURCE_CONSTRAINT = { STANDARD_GEN_1 | STANDARD_GEN_2 | MEMORY_1X | MEMORY_1X_x86 | MEMORY_16X | MEMORY_16X_x86 | MEMORY_64X | MEMORY_64X_x86 }
  MAX_CLUSTER_COUNT = <num>
  MIN_CLUSTER_COUNT = <num>
  SCALING_POLICY = { STANDARD | ECONOMY }
  AUTO_SUSPEND = { <num> | NULL }
  AUTO_RESUME = { TRUE | FALSE }
  INITIALLY_SUSPENDED = { TRUE | FALSE }
  RESOURCE_MONITOR = <monitor_name>
  COMMENT = '<string_literal>'
  ENABLE_QUERY_ACCELERATION = { TRUE | FALSE }
  QUERY_ACCELERATION_MAX_SCALE_FACTOR = <num>

objectParams ::=
  MAX_CONCURRENCY_LEVEL = <num>
  STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = <num>
  STATEMENT_TIMEOUT_IN_SECONDS = <num>
```

## Required parameters

`name`
:   Identifier for the virtual warehouse; must be unique for your account.

    In addition, the identifier must start with an alphabetic character and can’t contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Optional properties (`objectProperties`)

`WAREHOUSE_TYPE = { STANDARD | 'SNOWPARK-OPTIMIZED' }`
:   Specifies the warehouse type.

    Valid values:
    :   * `STANDARD`, `'STANDARD'`
        * `'SNOWPARK-OPTIMIZED'`

    Default:
    :   `STANDARD`

    > **Note:**
    >
    > To use a value that contains a hyphen (`'SNOWPARK-OPTIMIZED'`), you must enclose the value in single quotes, as shown.

`WAREHOUSE_SIZE = { XSMALL | SMALL | MEDIUM | LARGE | XLARGE | XXLARGE | XXXLARGE | X4LARGE | X5LARGE | X6LARGE }`
:   Specifies the size of the virtual warehouse. The size determines the amount of compute resources in each cluster in the warehouse and,
    therefore, the number of credits consumed while the warehouse is running.

    Valid values:
    :   | Supported Values | Synonyms |
        | --- | --- |
        | `XSMALL` | `'X-SMALL'` |
        | `SMALL` |  |
        | `MEDIUM` |  |
        | `LARGE` |  |
        | `XLARGE` | `'X-LARGE'` |
        | `XXLARGE` | `X2LARGE` , `'2X-LARGE'` |
        | `XXXLARGE` | `X3LARGE` , `'3X-LARGE'` |
        | `X4LARGE` | `'4X-LARGE'` |
        | `X5LARGE` | `'5X-LARGE'` |
        | `X6LARGE` | `'6X-LARGE'` |

    Default:
    :   `XSMALL`

    > **Note:**
    >
    > * X5LARGE and X6LARGE sizes for Snowpark-optimized warehouses are only supported with the MEMORY_16X resource constraint.
    > * X5LARGE and X6LARGE sizes aren’t supported for standard warehouses that use the STANDARD_GEN_2 resource constraint.
    > * The default size for Snowpark-optimized warehouses is MEDIUM.
    > * To use a value that contains a hyphen (for example, `'2X-LARGE'`), you must enclose the value in single quotes, as shown.
    > * Larger warehouse sizes 5X-Large and 6X-Large are generally available in all Amazon Web Services (AWS) and Microsoft Azure regions.
    >
    >   Larger warehouse sizes are in preview in US Government regions (requires FIPS support on ARM).

`GENERATION = { '1' | '2' }`
:   Specifies the warehouse generation for standard warehouses. This parameter provides a simplified way to set the warehouse generation,
    instead of using RESOURCE_CONSTRAINT = STANDARD_GEN_1 or STANDARD_GEN_2.

    Valid values:
    :   * `'1'`: Uses generation 1 compute resources. Equivalent to
          `RESOURCE_CONSTRAINT = STANDARD_GEN_1`.
        * `'2'`: Uses generation 2 compute resources. Equivalent to
          `RESOURCE_CONSTRAINT = STANDARD_GEN_2`.

    Default:
    :   `'1'` (generation 1 compute resources)

    > **Note:**
    >
    > * Values must be enclosed in single quotes (for example, `'1'`, not `1`).
    > * GENERATION applies only to standard warehouses (`WAREHOUSE_TYPE = STANDARD`).
    > * When both GENERATION and RESOURCE_CONSTRAINT are specified, any mismatch results in an error.
    > * You can’t use GENERATION with Snowpark-optimized warehouses or memory-based resource constraints (MEMORY_1X, MEMORY_16X, MEMORY_64X).

`RESOURCE_CONSTRAINT = { STANDARD_GEN_1 | STANDARD_GEN_2 | MEMORY_1X| MEMORY_1X_x86 | MEMORY_16X | MEMORY_16X_x86 | MEMORY_64X | MEMORY_64X_x86 }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    The 1 TB resource constraints (MEMORY_64X and MEMORY_64X_x86) are available as a preview feature.
    The 1 TB constraints are available only on the Amazon Web Services (AWS) cloud platform.

    All other MEMORY_\* resource constraint sizes are generally available and are available for all cloud platforms.

    Specifies the memory and CPU architecture for [Snowpark-optimized warehouses](../../user-guide/warehouses-snowpark-optimized.md),
    or generation 1 or [generation 2 capabilities for standard warehouses](../../user-guide/warehouses-gen2.md).

    The following table includes the valid values for the property, available memory, CPU architecture, and the minimum warehouse
    size required for the `resource_constraint` setting.
    For more information about regions and cloud service providers where generation 2 standard warehouses
    are available, see [Snowflake generation 2 standard warehouses](../../user-guide/warehouses-gen2.md).

    > Valid values:
    >
    > | Value | Memory (up to) | CPU architecture | Min warehouse size required | Max warehouse size |
    > | --- | --- | --- | --- | --- |
    > | `STANDARD_GEN_1` | 16 GB | Standard | XSMALL | X6LARGE |
    > | `STANDARD_GEN_2` | 16 GB | Standard (generation 2) | XSMALL | X4LARGE |
    > | `MEMORY_1X` | 16 GB | Standard | XSMALL | X4LARGE |
    > | `MEMORY_1X_x86` | 16 GB | x86 | XSMALL | X4LARGE |
    > | `MEMORY_16X` | 256 GB | Standard | MEDIUM | X6LARGE |
    > | `MEMORY_16X_x86` | 256 GB | x86 | MEDIUM | X4LARGE |
    > | `MEMORY_64X` | 1 TB | Standard | LARGE | X4LARGE |
    > | `MEMORY_64X_x86` | 1 TB | x86 | LARGE | X4LARGE |
    >
    > Default value:
    > :   `MEMORY_16X` for Snowpark-optimized warehouses. For standard warehouses, the default depends on
    >     Gen2 support for your cloud service provider region and whether your organization was created after
    >     Gen2 support became available in that region. For more information, see
    >     [Default value for the RESOURCE_CONSTRAINT for standard warehouses](../../user-guide/warehouses-gen2.md).
    >
    > > **Tip:**
    > >
    > > For standard warehouses, consider using the GENERATION parameter instead of STANDARD_GEN_1 and STANDARD_GEN_2 values.
    > > The GENERATION parameter provides a simpler way to specify the warehouse generation.
    > > Specify `GENERATION = '2'` or `GENERATION = '1'`. The quotes are required around the
    > > generation number.

`MAX_CLUSTER_COUNT = num`
:   Specifies the maximum number of clusters for a multi-cluster warehouse. For a single-cluster warehouse, this value is always `1`.

    Valid values:
    :   `1` to an upper limit that varies depending on warehouse size.

        Note that specifying a value greater than `1` indicates the warehouse is a multi-cluster warehouse; however, the value can only be set
        to a higher value in [Snowflake Enterprise Edition](../../user-guide/intro-editions.md) (or higher).

        For more information, including the upper limit for each warehouse size, see [Multi-cluster warehouses](../../user-guide/warehouses-multicluster.md).

    Default:
    :   `1` (single-cluster warehouse)

    > **Tip:**
    >
    > For Snowflake Enterprise Edition (or higher), we recommend always setting the value greater than `1` to help maintain
    > high-availability and optimal performance of a multi-cluster warehouse. This also helps ensure continuity in the unlikely event that a
    > cluster fails.

`MIN_CLUSTER_COUNT = num`
:   Specifies the minimum number of clusters for a multi-cluster warehouse (only applies to multi-cluster warehouses).

    Valid values:
    :   `1` to the value of `MAX_CLUSTER_COUNT`. The upper limit for `MAX_CLUSTER_COUNT` varies depending on the warehouse size.

        `MIN_CLUSTER_COUNT` must be equal to or less than `MAX_CLUSTER_COUNT`:

        * If both parameters are equal, the warehouse runs in Maximized mode.
        * If `MIN_CLUSTER_COUNT` is less than `MAX_CLUSTER_COUNT`, the warehouse runs in Auto-scale mode.

        For more information, including the upper limit for each warehouse size, see [Multi-cluster warehouses](../../user-guide/warehouses-multicluster.md).

    Default:
    :   `1`

`SCALING_POLICY = { STANDARD | ECONOMY }`
:   Specifies the policy for automatically starting and shutting down clusters in a multi-cluster warehouse running in Auto-scale mode.

    Valid values:
    :   * `STANDARD`: Minimizes queuing by starting clusters.
        * `ECONOMY`: Conserves credits by favoring keeping running clusters fully-loaded.

        For a more detailed description, see [Setting the scaling policy for a multi-cluster warehouse](../../user-guide/warehouses-multicluster.md).

    Default:
    :   `STANDARD`

`AUTO_SUSPEND = { num | NULL }`
:   Specifies the number of seconds of inactivity after which a warehouse is automatically suspended.

    Valid values:
    :   Any integer `0` or greater, or `NULL`:

        * The background process that suspends a warehouse runs approximately every 30 seconds and therefore, the setting for
          this property isn’t intended for enabling precise control over warehouse suspension.
        * Setting a value less than 30, or a value that isn’t a multiple of 30, is allowed but might not result in the expected
          behavior due to the 30 second poll interval for warehouse suspension.
        * Setting a `0` or `NULL` value means the warehouse never suspends.

    Default:
    :   `600` (the warehouse suspends automatically after 10 minutes of inactivity)

    > **Important:**
    >
    > Setting `AUTO_SUSPEND` to `0` or `NULL` is not recommended, unless your query workloads require a continually
    > running warehouse. Note that this can result in significant consumption of credits (and corresponding charges), particularly for
    > larger warehouses.

`AUTO_RESUME = { TRUE | FALSE }`
:   Specifies whether to automatically resume a warehouse when a SQL statement (for example, query) is submitted to it.

    Valid values:
    :   * `TRUE`: The warehouse resumes when a new query is submitted.
        * `FALSE`: The warehouse only resumes when explicitly resumed using [ALTER WAREHOUSE](alter-warehouse.md) or through the Snowflake web
          interface.

    Default:
    :   `TRUE` (the warehouse resumes automatically when a SQL statement is submitted to it)

`INITIALLY_SUSPENDED = { TRUE | FALSE }`
:   Specifies whether the warehouse is created initially in the ‘Suspended’ state.

    Valid values:
    :   * `TRUE`: The warehouse is created, but suspended.
        * `FALSE`: The warehouse starts running after it is created.

    Default:
    :   `FALSE`

`RESOURCE_MONITOR = monitor_name`
:   Specifies the name of a resource monitor that is explicitly assigned to the warehouse. When a resource monitor is explicitly assigned
    to a warehouse, the monitor controls the monthly credits used by the warehouse (and all other warehouses to which the monitor is
    assigned).

    Valid values:
    :   Any existing resource monitor.

        For more details, see [Working with resource monitors](../../user-guide/resource-monitors.md).

    Default:
    :   No value (no resource monitor assigned to the warehouse)

    > **Tip:**
    >
    > To view all resource monitors and their identifiers, use the [SHOW RESOURCE MONITORS](show-resource-monitors.md) command.

`COMMENT = 'string_literal'`
:   Specifies a comment for the warehouse.

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

### Query acceleration properties

`ENABLE_QUERY_ACCELERATION = { TRUE | FALSE }`
:   Specifies whether to enable the [query acceleration service](../../user-guide/query-acceleration-service.md) for queries that rely on this
    warehouse for compute resources.

    > Valid values:
    > :   * `TRUE` Enables Query Acceleration
    >     * `FALSE` Disables Query Acceleration
    >
    > Default:
    > :   `FALSE`: Query Acceleration is disabled

`QUERY_ACCELERATION_MAX_SCALE_FACTOR = num`
:   Specifies the maximum scale factor for leasing compute resources for query acceleration. The scale factor is used as a multiplier based
    on [warehouse size](../../user-guide/warehouses-overview.md).

    Setting the QUERY_ACCELERATION_MAX_SCALE_FACTOR to 0 eliminates the limit and allows queries to lease as many resources as necessary and
    as available to service the query.

    Regardless of the QUERY_ACCELERATION_MAX_SCALE_FACTOR value, the amount of available compute resources for query acceleration is bound by
    the available resources in the service and the number of other concurrent requests. For more details, refer to
    [Adjusting the scale factor](../../user-guide/query-acceleration-service.md).

    Valid values:
    :   `0` to `100`

    Default:
    :   `8`

## Optional parameters (`objectParams`)

`MAX_CONCURRENCY_LEVEL = num`
:   Object parameter that specifies the concurrency level for SQL statements (i.e. queries and DML) executed by a warehouse cluster.

    For a detailed description of this parameter, see [MAX_CONCURRENCY_LEVEL](../parameters.md).

`STATEMENT_QUEUED_TIMEOUT_IN_SECONDS = num`
:   Object parameter that specifies the time, in seconds, a SQL statement (query, DDL, DML, etc.) can be queued on a warehouse before it is
    canceled by the system.

    For a detailed description of this parameter, see [STATEMENT_QUEUED_TIMEOUT_IN_SECONDS](../parameters.md).

`STATEMENT_TIMEOUT_IN_SECONDS = num`
:   Object parameter that specifies the time, in seconds, after which a running SQL statement (query, DDL, DML, etc.) is canceled by the system.

    For a detailed description of this parameter, see [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE WAREHOUSE | Account | Only the SYSADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |
| OWNERSHIP | Warehouse | Required to execute a CREATE OR ALTER WAREHOUSE statement for an *existing* warehouse.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## General usage notes

* Creating a virtual warehouse automatically sets it as the warehouse in use for the current session (equivalent to using the
  [USE WAREHOUSE](use-warehouse.md) command for the warehouse).

  To change the warehouse in use for the current session, execute an explicit USE WAREHOUSE statement after the
  CREATE WAREHOUSE statement. For example, create warehouse `my_wh` but continue to use the current warehouse, not `my_wh`,
  to execute additional statements:

  ```sqlexample
  SET current_wh_name = (SELECT CURRENT_WAREHOUSE());

  CREATE OR REPLACE WAREHOUSE my_wh
    WAREHOUSE_SIZE = 'XSMALL';

  USE WAREHOUSE IDENTIFIER($current_wh_name);
  ```
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).
* Using `OR REPLACE` is the equivalent of using [DROP WAREHOUSE](drop-warehouse.md) on the existing warehouse and then
  creating a new warehouse with the same name.

  CREATE OR REPLACE *<object>* statements are atomic. That is, when an object is replaced, the old object is deleted and the new object is created in a single transaction.

  Any queries running on the dropped warehouse are aborted.
* The `OR REPLACE` and `IF NOT EXISTS` clauses are mutually exclusive. They can’t both be used in the same statement.
* Initial creation and resumption of a Snowpark-optimized virtual warehouse may take longer than standard warehouses.

## CREATE OR ALTER WAREHOUSE usage notes

**Limitations**

* All limitations of the [ALTER WAREHOUSE](alter-warehouse.md) command apply.
* The INITIALLY_SUSPENDED property can’t be altered (SET or UNSET).

**Warehouse parameters and properties**

* The absence of a property or parameter that was previously set in the modified warehouse definition results in unsetting it.
* Unsetting an explicit parameter value results in setting it to the default parameter value.

**Data governance**

* Setting or unsetting a tag or policy on a warehouse using a CREATE OR ALTER WAREHOUSE statement is *not* supported.
* Existing policies or tags can’t be not altered by a CREATE OR ALTER WAREHOUSE statement and remain unchanged.

## Billing and Pricing

For information on Snowpark-optimized warehouse credit consumption, see
`Table 1` in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Examples

### Basic examples

Create an X-Large warehouse:

> ```sqlexample
> CREATE OR REPLACE WAREHOUSE my_wh WITH WAREHOUSE_SIZE = 'X-LARGE';
> ```

Create a Large warehouse in a suspended state:

> ```sqlexample
> CREATE OR REPLACE WAREHOUSE my_wh WAREHOUSE_SIZE = LARGE INITIALLY_SUSPENDED = TRUE;
> ```

Create an X-Large Snowpark-optimized warehouse named `so_warehouse` with 256 GB memory for
Snowpark workloads that require x86 Python:

```sqlexample
CREATE WAREHOUSE so_warehouse WITH
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  WAREHOUSE_SIZE = XLARGE
  RESOURCE_CONSTRAINT = 'MEMORY_16X_x86';
```

Create a Large generation 2 standard warehouse:

```sqlexample
CREATE WAREHOUSE gen2_wh WITH
  WAREHOUSE_SIZE = LARGE
  GENERATION = '2';
```

### CREATE OR ALTER WAREHOUSE examples

#### Create a simple warehouse

The following example shows how to use CREATE OR ALTER WAREHOUSE to create a Snowpark-optimized
warehouse, then modify its AUTO_RESUME setting.

```sqlexample
CREATE OR ALTER WAREHOUSE so_warehouse
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  WAREHOUSE_SIZE = 'X-LARGE'
  RESOURCE_CONSTRAINT = 'MEMORY_16X_X86'
  AUTO_RESUME = TRUE
  COMMENT = 'Snowpark warehouse for ingestion';

CREATE OR ALTER WAREHOUSE so_warehouse
  WAREHOUSE_TYPE = 'SNOWPARK-OPTIMIZED'
  WAREHOUSE_SIZE = 'X-LARGE'
  RESOURCE_CONSTRAINT = 'MEMORY_16X_X86'
  AUTO_RESUME = FALSE
  COMMENT = 'Snowpark warehouse for ingestion (disabled for auto-resume)';
```

#### Create a Gen1 warehouse and alter it to Gen2

The following example demonstrates how CREATE OR ALTER WAREHOUSE works with the GENERATION
parameter, first creating a warehouse with generation 1 resources, then altering it to use
generation 2 resources.

```sqlexample
-- Create a new warehouse with GENERATION = '1'
CREATE OR ALTER WAREHOUSE test_gen_warehouse
  WITH WAREHOUSE_SIZE = XSMALL
    GENERATION = '1'
    AUTO_SUSPEND = 60
    INITIALLY_SUSPENDED = TRUE;

-- Verify that it was created
SHOW WAREHOUSES LIKE 'test_gen_warehouse'
  ->> SELECT "name", "resource_constraint" FROM $1;

-- Alter it to GENERATION = '2'
CREATE OR ALTER WAREHOUSE test_gen_warehouse
  WITH WAREHOUSE_SIZE = SMALL
    GENERATION = '2'
    AUTO_SUSPEND = 120;

-- Verify that it was altered
SHOW WAREHOUSES LIKE 'test_gen_warehouse'
  ->> SELECT "name", "resource_constraint" FROM $1;

-- Clean up when done
DROP WAREHOUSE test_gen_warehouse;
```

---
title: CREATE | ALTER TABLE … CONSTRAINT
source: https://docs.snowflake.com/en/sql-reference/sql/create-table-constraint.md
section: SQL Commands
---

# CREATE | ALTER TABLE … CONSTRAINT

This topic describes how to create constraints by specifying a CONSTRAINT clause in a
[CREATE TABLE](create-table.md), [CREATE HYBRID TABLE](create-hybrid-table.md),
or [ALTER TABLE](alter-table.md) statement:

* An inline constraint is specified as part of the individual column definition.
* An out-of-line constraint is specified as an independent clause:

  + When creating a table, the clause is part of the column definitions for the table.
  + When altering a table, the clause is specified as an explicit `ADD` action for the table.

For more information, see [Constraints](../constraints.md).

If you are creating or altering [hybrid tables](../../user-guide/tables-hybrid.md), the syntax for defining constraints is the same; however, the rules and requirements are different.

## Syntax for inline constraints

```sqlsyntax
CREATE TABLE <name> (
  <col1_name> <col1_type>  [ NOT NULL ] { inlineUniquePK | inlineFK | inlineCH }
  [ , <col2_name> <col2_type> [ NOT NULL ] { inlineUniquePK | inlineFK | inlineCH } ]
  [ , ... ]
)

ALTER TABLE <name> ADD COLUMN
  <col_name> <col_type> [ NOT NULL ] { inlineUniquePK | inlineFK | inlineCH }
```

Where:

> ```sqlsyntax
> inlineUniquePK ::=
>   [ CONSTRAINT <constraint_name> ]
>   { UNIQUE | PRIMARY KEY }
>   [ [ NOT ] ENFORCED ]
>   [ [ NOT ] DEFERRABLE ]
>   [ INITIALLY { DEFERRED | IMMEDIATE } ]
>   [ { ENABLE | DISABLE } ]
>   [ { VALIDATE | NOVALIDATE } ]
>   [ { RELY | NORELY } ]
> ```
>
> ```sqlsyntax
> inlineFK ::=
>   [ CONSTRAINT <constraint_name> ]
>   [ FOREIGN KEY ]
>   REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
>   [ MATCH { FULL | SIMPLE | PARTIAL } ]
>   [ ON [ UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
>        [ DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ] ]
>   [ [ NOT ] ENFORCED ]
>   [ [ NOT ] DEFERRABLE ]
>   [ INITIALLY { DEFERRED | IMMEDIATE } ]
>   [ { ENABLE | DISABLE } ]
>   [ { VALIDATE | NOVALIDATE } ]
>   [ { RELY | NORELY } ]
> ```
>
> ```sqlsyntax
> inlineCH ::=
>   [ CONSTRAINT <constraint_name> ] CHECK ( <expr> )
>   [ ENABLE { VALIDATE | NOVALIDATE } ]
> ```

## Syntax for out-of-line constraints

```sqlsyntax
CREATE TABLE <name> ... (
  <col1_name> <col1_type>
  [ , <col2_name> <col2_type> , ... ]
  [ , { outoflineUniquePK | outoflineFK | outoflineCH } ]
  [ , { outoflineUniquePK | outoflineFK | outoflineCH } ]
  [ , ... ]
)

ALTER TABLE <name> ... ADD { outoflineUniquePK | outoflineFK | outoflineCH }
```

Where:

> ```sqlsyntax
> outoflineUniquePK ::=
>   [ CONSTRAINT <constraint_name> ]
>   { UNIQUE | PRIMARY KEY } ( <col_name> [ , <col_name> , ... ] )
>   [ [ NOT ] ENFORCED ]
>   [ [ NOT ] DEFERRABLE ]
>   [ INITIALLY { DEFERRED | IMMEDIATE } ]
>   [ { ENABLE | DISABLE } ]
>   [ { VALIDATE | NOVALIDATE } ]
>   [ { RELY | NORELY } ]
>   [ COMMENT '<string_literal>' ]
> ```
>
> ```sqlsyntax
> outoflineFK ::=
>   [ CONSTRAINT <constraint_name> ]
>   FOREIGN KEY ( <col_name> [ , <col_name> , ... ] )
>   REFERENCES <ref_table_name> [ ( <ref_col_name> [ , <ref_col_name> , ... ] ) ]
>   [ MATCH { FULL | SIMPLE | PARTIAL } ]
>   [ ON [ UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
>        [ DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ] ]
>   [ [ NOT ] ENFORCED ]
>   [ [ NOT ] DEFERRABLE ]
>   [ INITIALLY { DEFERRED | IMMEDIATE } ]
>   [ { ENABLE | DISABLE } ]
>   [ { VALIDATE | NOVALIDATE } ]
>   [ { RELY | NORELY } ]
>   [ COMMENT '<string_literal>' ]
> ```
>
> ```sqlsyntax
> outoflineCH ::=
>   [ CONSTRAINT <constraint_name> ] CHECK ( <expr> )
>   [ ENABLE { VALIDATE | NOVALIDATE } ]
> ```

## Constraint properties

For compatibility with other databases, and for use with hybrid tables, Snowflake provides constraint properties.
The properties that can be specified for a constraint depend on the type:

* Some properties apply to all keys (unique, primary, and foreign).
* Other properties apply only to foreign keys.

> **Important:**
>
> For standard Snowflake tables, these properties are provided to facilitate migrating from other databases. They are not
> enforced or maintained by Snowflake. This means that the defaults can be changed for these properties, but changing the
> defaults results in Snowflake not creating the constraint.
>
> An exception is the RELY property. If you have ensured that the data in your standard tables complies with UNIQUE, PRIMARY
> KEY, and FOREIGN KEY constraints, you can set the RELY property for those constraints. See also
> [Setting the RELY Constraint Property to Eliminate Unnecessary Joins](../../user-guide/join-elimination.md).
>
> If you are creating or altering [hybrid tables](../../user-guide/tables-hybrid.md), the rules and requirements are different.
> See [Overview of constraints](../constraints-overview.md).

Most of the supported constraint properties are ANSI SQL standard properties; however, the following properties are Snowflake extensions:

* ENABLE | DISABLE
* VALIDATE | NOVALIDATE
* RELY | NORELY

You can also define a comment within an out-of-line constraint definition; see Comments on constraints.

### Properties (for all constraints)

The following properties apply to all constraints (the order of the properties is interchangeable):

```sqlsyntax
[ NOT ] ENFORCED
[ NOT ] DEFERRABLE
INITIALLY { DEFERRED | IMMEDIATE }
{ ENABLE | DISABLE }
{ VALIDATE | NOVALIDATE }
{ RELY | NORELY }
```

`{ ENFORCED | NOT ENFORCED }`
:   Specifies whether the constraint is enforced in a transaction. For standard tables, NOT NULL is the
    *only* type of constraint that is enforced by Snowflake, regardless of this property.

    For hybrid tables, you can’t set the NOT ENFORCED property on PRIMARY KEY, FOREIGN KEY, and UNIQUE constraints.
    Setting this property results in an “invalid constraint property” error.

    See also [Referential Integrity Constraints](../../user-guide/table-considerations.md).

    Default: NOT ENFORCED

`{ DEFERRABLE | NOT DEFERRABLE }`
:   Specifies whether, in subsequent transactions, the constraint check can be deferred until the end of the transaction.

    Default: NOT DEFERRABLE

`INITIALLY { DEFERRED | IMMEDIATE }`
:   For DEFERRABLE constraints, specifies whether the check for the constraints can be deferred, starting from the next transaction.

    Default: INITIALLY DEFERRED

`{ ENABLE | DISABLE }`
:   Specifies whether the constraint is enabled or disabled. These properties are provided for compatibility with Oracle.

    Default: DISABLE

`{ VALIDATE | NOVALIDATE }`
:   Specifies whether to validate existing data on the table when a constraint is created. Applies only when either
    `{ ENFORCED | NOT ENFORCED }` or `{ ENABLE | DISABLE }` is specified.

    Default for PRIMARY KEY and FOREIGN KEY constraints: NOVALIDATE

    Default for CHECK constraints: VALIDATE

`{ RELY | NORELY }`
:   Specifies whether a constraint in NOVALIDATE mode is taken into account during query rewrite.

    If you have ensured that the data in the table complies with the constraints, you can change this property
    to RELY to indicate that the query optimizer should expect such data integrity. For standard tables, it is your responsibility to
    enforce RELY constraints; otherwise, you might risk unintended behavior and unexpected results.

    If the RELY property is set for a constraint and a violation of referential integrity occurs, DML and CTAS statements might insert
    incorrect data.

    Setting the RELY property might improve query
    performance (for example, by [eliminating unnecessary joins](../../user-guide/join-elimination.md)).

    For related PRIMARY KEY and FOREIGN KEY constraints, set this property on both constraints. For example:

    ```sqlexample
    ALTER TABLE table_with_primary_key ALTER CONSTRAINT a_primary_key_constraint RELY;
    ALTER TABLE table_with_foreign_key ALTER CONSTRAINT a_foreign_key_constraint RELY;
    ```

    Default: NORELY

### Properties (for FOREIGN KEY constraints only)

The following constraint properties apply only to foreign keys (the order of the properties is interchangeable):

```sqlsyntax
MATCH { FULL | SIMPLE | PARTIAL }
ON [ UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
   [ DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
```

`MATCH { FULL | PARTIAL | SIMPLE }`
:   Specifies whether the FOREIGN KEY constraint is satisfied with regard to NULL values in one or more of the columns.

    Default: MATCH FULL

`UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION }`
:   Specifies the action performed when the primary or unique key for the foreign key is updated.

    Default: UPDATE NO ACTION

`DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION }`
:   Specifies the action performed when the primary or unique key for the foreign key is deleted.

    Default: DELETE NO ACTION

### Properties (for CHECK constraints only)

The following constraint properties apply only to CHECK constraints:

```sqlsyntax
CHECK ( <expr> )
```

`CHECK ( expr )`
:   An expression that defines the condition to enforce.

    The expression can contain any of the following items:

    * Table columns defined in the table on which the CHECK constraint operates.
    * Constant values.
    * [Scalar functions](../functions.md) that don’t rely on the environment or execution context.

    The expression can’t contain any of the following items:

    * User-defined functions (UDFs).
    * Aggregate functions, window functions, table functions, or subqueries.
    * System-defined functions that change database state, such as the SYSTEM$CANCEL_ALL_QUERIES function.
    * Non-deterministic system-defined functions, such as the RANDOM function.
    * System-defined functions that rely on the environment or execution context, such as the CURRENT_DATE
      function or the CURRENT_ROLE function.

    For more information, see [CHECK constraints](../constraints-overview.md).

### Non-default values for ENABLE and VALIDATE properties

For syntax compatibility with other databases, Snowflake supports specifying non-default values for constraint properties.

However, for PRIMARY KEY, UNIQUE, and FOREIGN KEY constraints, if you specify ENABLE or VALIDATE (the non-default values
for these properties) when creating a new constraint, *the constraint isn’t created*. This doesn’t apply to RELY. Specifying
RELY does result in the creation of the new constraint.

For CHECK constraints, ENABLE is the default and is required. If you specify DISABLE, then *the CHECK constraint isn’t created*.
Both NOVALIDATE and VALIDATE are supported for new tables. VALIDATE isn’t supported on existing tables.

Snowflake provides a session parameter, [UNSUPPORTED_DDL_ACTION](../parameters.md), which determines whether specifying non-default
values during constraint creation generates an error.

## Comments on constraints

Similar to other database objects and constructs, Snowflake supports comments on constraints:

* Out-of-line constraints support the COMMENT clause within the constraint definition.

  ```sqlexample
  CREATE OR REPLACE TABLE uni (c1 INT, c2 int, CONSTRAINT uni1 UNIQUE(C1) COMMENT 'Unique column');
  ```
* A COMMENT clause within the column definition can be used to comment on the column itself or its constraint:

  ```sqlexample
  CREATE OR REPLACE TABLE uni (c1 INT UNIQUE COMMENT 'Unique column', c2 int);
  ```

Note the following limitations:

* You can’t set comments on constraints by using the [COMMENT](comment.md) command.
* The [DESCRIBE TABLE](desc-table.md) command shows comments defined on columns, but not comments defined on constraints.
  To see comments on constraints, select from the [TABLE_CONSTRAINTS view](../info-schema/table_constraints.md) or the
  [REFERENTIAL_CONSTRAINTS view](../info-schema/referential_constraints.md).
* The COMMENT clause within column and constraint definitions does’t support the equals sign (`=`). Do not specify:

  ```sqlexample
  COMMENT = 'My comment'
  ```

  Use the syntax shown in the previous examples:

  ```sqlexample
  COMMENT 'My comment'
  ```

## Usage notes

* NOT NULL specifies that the column doesn’t allow NULL values:

  > + For standard Snowflake tables, this is the only constraint that is enforced. See [Referential Integrity Constraints](../../user-guide/table-considerations.md).
  > + It can be specified only as an inline constraint within the column definition.
  > + The default is to allow NULL values in columns.
* Multi-column constraints (composite unique or primary keys) can only be defined out-of-line.
* When defining foreign keys, either inline or out-of-line, column name(s) for the referenced table do not need to be specified if the
  signature (name and data type) of the foreign key column(s) and the referenced table’s primary key column(s) exactly match.

* If you create a foreign key, the columns in the REFERENCES clause must be listed in the same order as they were
  listed for the primary key. For example:

  ```sqlexample
  CREATE TABLE parent ... CONSTRAINT primary_key_1 PRIMARY KEY (c_1, c_2) ...
  CREATE TABLE child  ... CONSTRAINT foreign_key_1 FOREIGN KEY (...) REFERENCES parent (c_1, c_2) ...
  ```

  In both cases, the order of the columns is `c_1, c_2`. If the order of the columns in the foreign key had been different
  (for example, `c_2, c_1`), the attempt to create the foreign key would have failed.

## Access control requirements

For creating PRIMARY KEY or UNIQUE constraints:

* When altering an existing table to add the constraint, you must use a role that has the OWNERSHIP privilege on the table.
* When creating a new table, you must use a role that has the CREATE TABLE privilege on the schema where the table will be created.

For creating FOREIGN KEY constraints:

* You must use a role that has the OWNERSHIP privilege on the foreign key table.
* You must use a role that has the REFERENCES privilege on the unique or primary key table.

The REFERENCES privilege can be granted to and revoked from roles using the [GRANT <privileges> … TO ROLE](grant-privilege.md) and
[REVOKE <privileges> … FROM ROLE](revoke-privilege.md) commands:

> ```sqlsyntax
> GRANT REFERENCES ON TABLE <pk_table_name> TO ROLE <role_name>
>
> REVOKE REFERENCES ON TABLE <pk_table_name> FROM ROLE <role_name>
> ```

## Examples of constraints with standard tables

For examples of constraints with hybrid tables, see [CREATE HYBRID TABLE](create-hybrid-table.md).

The example below shows how to create a simple NOT NULL constraint while creating a table, and another NOT NULL
constraint while altering a table:

Create a table and create a constraint at the same time:

```sqlexample
CREATE TABLE table1 (col1 INTEGER NOT NULL);
```

Alter the table to add a column with a constraint:

```sqlexample
ALTER TABLE table1 ADD COLUMN col2 VARCHAR NOT NULL;
```

The following example specifies that the intent of the column is to hold unique values, but makes clear that the
constraint is not actually enforced. This example also demonstrates how to specify a name for the constraint
(“uniq_col3” in this case.)

```sqlexample
ALTER TABLE table1
  ADD COLUMN col3 VARCHAR NOT NULL CONSTRAINT uniq_col3 UNIQUE NOT ENFORCED;
```

The following creates a parent table with a PRIMARY KEY constraint and another table with a FOREIGN KEY constraint
that points to the same columns as the first table’s PRIMARY KEY constraint.

```sqlexample
CREATE TABLE table2 (
  col1 INTEGER NOT NULL,
  col2 INTEGER NOT NULL,
  CONSTRAINT pkey_1 PRIMARY KEY (col1, col2) NOT ENFORCED
);
CREATE TABLE table3 (
  col_a INTEGER NOT NULL,
  col_b INTEGER NOT NULL,
  CONSTRAINT fkey_1 FOREIGN KEY (col_a, col_b) REFERENCES table2 (col1, col2) NOT ENFORCED
);
```

The following example specifies an inline CHECK constraint in a CREATE TABLE statement:

```sqlexample
CREATE TABLE test_check_constraint_orders (
  order_id INT,
  quantity INT CHECK (quantity > 0),
  price NUMBER(10, 2));
```

This CHECK constraint fails for the following DML operations because the
quantity is a negative value or zero:

```sqlexample
INSERT INTO test_check_constraint_orders (order_id, quantity, price)
  VALUES (101, -5, 25.35);
```

```sqlexample
UPDATE test_CHECK_constraint_orders
  SET quantity = 0
  WHERE order_id = 101;
```

The following example specifies an out-of-line CHECK constraint on multiple columns:

```sqlexample
CREATE TABLE test_check_constraint_max_orders (
  order_id INT,
  quantity INT,
  price NUMBER(10, 2),
  max_price NUMBER(10, 2),
  CONSTRAINT chk_price_max CHECK (price < max_price));
```

The CHECK constraint ensures that price doesn’t exceed the maximum price.

The following example specifies an inline CHECK constraint in a CTAS statement:

```sqlexample
CREATE TABLE high_value_products (
  product_id INT,
  product_name VARCHAR(100),
  list_price NUMBER(10, 2),
  CONSTRAINT high_price CHECK (list_price > 100)
  )
  AS SELECT product_id,
            product_name,
            list_price
  FROM products
  WHERE list_price > 100;
```

The CHECK constraint ensures that the new `high_value_products` table only contains items that
are considered to be high-priced.

---
title: DELETE
source: https://docs.snowflake.com/en/sql-reference/sql/delete.md
section: SQL Commands
---

# DELETE

Remove rows from a table. You can use a WHERE clause to specify which rows should be removed. If you need to use a subquery(s) or
additional table(s) to identify the rows to be removed, specify the subquery(s) or table(s) in a USING clause.

> **Important:**
>
> Unlike [TRUNCATE TABLE](truncate-table.md), this command does not delete the external file load history. If you delete rows
> loaded into the table from a staged file, you cannot load the data from that file again unless you modify the file and stage it again.

## Syntax

```sqlsyntax
DELETE FROM <table_name>
            [ USING <additional_table_or_query> [, <additional_table_or_query> ] ]
            [ WHERE <condition> ]
```

## Required parameters

`table_name`
:   Specifies the table from which rows are removed.

## Optional parameters

`USING additional_table_or_query [, ... ]`
:   If you need to refer to additional tables in the WHERE clause to help identify the rows to be removed, then specify those table names in
    the USING clause. You can also use the USING clause to specify subqueries that identify the rows to be removed.

    If you specify a subquery, then put the subquery in parentheses.

    If you specify more than one table or query, use a comma to separate them.

`WHERE condition`
:   Specifies a condition to use to select rows for removal. If this parameter is omitted, all rows in the table are removed, but the table
    remains.

## Usage notes

* When deleting based on a JOIN (by specifying a `USING` clause), it is possible that a row in the target table joins against several
  rows in the `USING` table(s). If the DELETE condition is satisfied for any of the joined combinations, the target row is deleted.

  For example, given tables `tab1` and `tab2` with columns `(k number, v number)`:

  > ```sqlexample
  > select * from tab1;
  >
  > -------+-------+
  >    k   |   v   |
  > -------+-------+
  >    0   |   10  |
  > -------+-------+
  >
  > Select * from tab2;
  >
  > -------+-------+
  >    k   |   v   |
  > -------+-------+
  >    0   |   20  |
  >    0   |   30  |
  > -------+-------+
  > ```

  If you run the following query, the row in `tab1` is joined against both rows of `tab2`:

  > ```sqlexample
  > DELETE FROM tab1 USING tab2 WHERE tab1.k = tab2.k
  > ```

  Because at least one joined pair satisfies the condition, the row is deleted. As a result, after the statement completes, `tab1`
  is empty.

## Examples

Suppose that an organization that leases bicycles uses the following tables:

* The table named leased_bicycles lists the bicycles that were leased out.
* The table named returned_bicycles lists bicycles that have been returned recently. These bicycles need be removed from the table of
  leased bicycles.

Create tables:

> ```sqlexample
> CREATE TABLE leased_bicycles (bicycle_id INTEGER, customer_id INTEGER);
> CREATE TABLE returned_bicycles (bicycle_id INTEGER);
> ```

Load data:

> ```sqlexample
> INSERT INTO leased_bicycles (bicycle_ID, customer_ID) VALUES
>     (101, 1111),
>     (102, 2222),
>     (103, 3333),
>     (104, 4444),
>     (105, 5555);
> INSERT INTO returned_bicycles (bicycle_ID) VALUES
>     (102),
>     (104);
> ```

This example shows how to use the `WHERE` clause to delete a specified row(s). This example deletes by bicycle_ID:

> ```sqlexample
> DELETE FROM leased_bicycles WHERE bicycle_ID = 105;
> +------------------------+
> | number of rows deleted |
> |------------------------|
> |                      1 |
> +------------------------+
> ```

Show the data after the delete:

> ```sqlexample
> SELECT * FROM leased_bicycles ORDER BY bicycle_ID;
> +------------+-------------+
> | BICYCLE_ID | CUSTOMER_ID |
> |------------+-------------|
> |        101 |        1111 |
> |        102 |        2222 |
> |        103 |        3333 |
> |        104 |        4444 |
> +------------+-------------+
> ```

This example shows how to use the `USING` clause to specify rows to be deleted. This `USING` clause specifies the returned_bicycles
table, which lists the IDs of the bicycles to be deleted from the leased_bicycles table. The `WHERE` clause joins the leased_bicycles
table to the returned_bicycles table, and the rows in leased_bicycles that have the same bicycle_ID as the corresponding rows in
returned_bicycles are deleted.

> ```sqlexample
> BEGIN WORK;
> DELETE FROM leased_bicycles
>     USING returned_bicycles
>     WHERE leased_bicycles.bicycle_ID = returned_bicycles.bicycle_ID;
> TRUNCATE TABLE returned_bicycles;
> COMMIT WORK;
> ```

(To avoid trying to remove the same rows again in the future when it might be unnecessary or inappropriate, the returned_bicycles table is
truncated as part of the same transaction.)

Show the data after the delete:

> ```sqlexample
> SELECT * FROM leased_bicycles ORDER BY bicycle_ID;
> +------------+-------------+
> | BICYCLE_ID | CUSTOMER_ID |
> |------------+-------------|
> |        101 |        1111 |
> |        103 |        3333 |
> +------------+-------------+
> ```

Now suppose that another bicycle(s) is returned:

> ```sqlexample
> INSERT INTO returned_bicycles (bicycle_ID) VALUES (103);
> ```

The following query shows a `USING` clause that contains a subquery (rather than a table) to specify which bicycle_IDs to remove from
the leased_bicycles table:

> ```sqlexample
> BEGIN WORK;
> DELETE FROM leased_bicycles
>     USING (SELECT bicycle_ID AS bicycle_ID FROM returned_bicycles) AS returned
>     WHERE leased_bicycles.bicycle_ID = returned.bicycle_ID;
> TRUNCATE TABLE returned_bicycles;
> COMMIT WORK;
> ```

Show the data after the delete:

> ```sqlexample
> SELECT * FROM leased_bicycles ORDER BY bicycle_ID;
> +------------+-------------+
> | BICYCLE_ID | CUSTOMER_ID |
> |------------+-------------|
> |        101 |        1111 |
> +------------+-------------+
> ```

---
title: DESCRIBE <object>
source: https://docs.snowflake.com/en/sql-reference/sql/desc.md
section: SQL Commands
---

# DESCRIBE *<object>*

Describes the details for the specified object.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE <object>](create.md) , [SHOW <objects>](show.md)

## DESCRIBE commands

For specific syntax, usage notes, and examples, see:

**Session/Query Operations:**

> * [DESCRIBE RESULT](desc-result.md)
> * [DESCRIBE TRANSACTION](desc-transaction.md)

**Account Objects:**

> * [DESCRIBE APPLICATION](desc-application.md)
> * [DESCRIBE APPLICATION PACKAGE](desc-application-package.md)
> * [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)
> * [DESCRIBE COMPUTE POOL](desc-compute-pool.md)
> * [DESCRIBE DATABASE](desc-database.md)
> * [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md)
> * [DESCRIBE INTEGRATION](desc-integration.md)
> * [DESCRIBE OPENFLOW DATA PLANE INTEGRATION](desc-oflow-data-plane-integration.md)
> * [DESCRIBE NETWORK POLICY](desc-network-policy.md)
> * [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md)
> * [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md)
> * [DESCRIBE POSTGRES INSTANCE](desc-postgres-instance.md)
> * [DESCRIBE SHARE](desc-share.md)
> * [DESCRIBE SPECIFICATION](desc-specification.md)
> * [DESCRIBE USER](desc-user.md)
> * [DESCRIBE WAREHOUSE](desc-warehouse.md)

**Database Objects:**

> * [DESCRIBE AGENT](desc-agent.md)
> * [DESCRIBE AGGREGATION POLICY](desc-aggregation-policy.md)
> * [DESCRIBE ALERT](desc-alert.md)
> * [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md)
> * [DESCRIBE BACKUP POLICY](desc-backup-policy.md)
> * [DESCRIBE BACKUP SET](desc-backup-set.md)
> * [DESCRIBE CONFIGURATION](desc-configuration.md)
> * [DESCRIBE CORTEX SEARCH SERVICE](desc-cortex-search.md)
> * [DESCRIBE DBT PROJECT](desc-dbt-project.md)
> * [DESCRIBE DCM PROJECT](desc-dcm-project.md)
> * [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md)
> * [DESCRIBE EVENT TABLE](desc-event-table.md)
> * [DESCRIBE EXTERNAL TABLE](desc-external-table.md)
> * [DESCRIBE FEATURE POLICY](desc-feature-policy.md)
> * [DESCRIBE FILE FORMAT](desc-file-format.md)
> * [DESCRIBE FUNCTION](desc-function.md)
> * [DESCRIBE GATEWAY](desc-gateway.md)
> * [DESCRIBE GIT REPOSITORY](desc-git-repository.md)
> * [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)
> * [DESCRIBE JOIN POLICY](desc-join-policy.md)
> * [DESCRIBE LISTING](desc-listing.md)
> * [DESCRIBE MAINTENANCE POLICY](desc-maintenance-policy.md)
> * [DESCRIBE MASKING POLICY](desc-masking-policy.md)
> * [DESCRIBE MATERIALIZED VIEW](desc-materialized-view.md)
> * [DESCRIBE MCP SERVER](desc-mcp-server.md)
> * [DESCRIBE MODEL MONITOR](desc-model-monitor.md)
> * [DESCRIBE NETWORK RULE](desc-network-rule.md)
> * [DESCRIBE NOTEBOOK](desc-notebook.md)
> * [DESCRIBE ONLINE FEATURE TABLE](desc-online-feature-table.md)
> * [DESCRIBE PACKAGES POLICY](desc-packages-policy.md)
> * [DESCRIBE PASSWORD POLICY](desc-password-policy.md)
> * [DESCRIBE PIPE](desc-pipe.md)
> * [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md)
> * [DESCRIBE PROCEDURE](desc-procedure.md)
> * [DESCRIBE PROJECTION POLICY](desc-projection-policy.md)
> * [DESCRIBE ROW ACCESS POLICY](desc-row-access-policy.md)
> * [DESCRIBE SCHEMA](desc-schema.md)
> * [DESCRIBE SECRET](desc-secret.md)
> * [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md)
> * [DESCRIBE SEQUENCE](desc-sequence.md)
> * [DESCRIBE SERVICE](desc-service.md)
> * [DESCRIBE SESSION POLICY](desc-session-policy.md)
> * [DESCRIBE SNAPSHOT](desc-snapshot.md)
> * [DESCRIBE SNAPSHOT POLICY](desc-snapshot-policy.md) (deprecated; prefer [DESCRIBE BACKUP POLICY](desc-backup-policy.md))
> * [DESCRIBE SNAPSHOT SET](desc-snapshot-set.md) (deprecated; prefer [DESCRIBE BACKUP SET](desc-backup-set.md))
> * [DESCRIBE SPECIFICATION](desc-specification.md)
> * [DESCRIBE STAGE](desc-stage.md)
> * [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md)
> * [DESCRIBE STREAMLIT](desc-streamlit.md)
> * [DESCRIBE STREAM](desc-stream.md)
> * [DESCRIBE TABLE](desc-table.md)
> * [DESCRIBE TASK](desc-task.md)
> * [DESCRIBE TYPE](desc-type.md)
> * [DESCRIBE VIEW](desc-view.md)

---
title: DESCRIBE AGENT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-agent.md
section: SQL Commands
---

# DESCRIBE AGENT

Describes the properties of a [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER AGENT](alter-agent.md), [CREATE AGENT](create-agent.md), [DROP AGENT](drop-agent.md), [SHOW AGENTS](show-agents.md), [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](../functions/data_agent_run-snowflake-cortex.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } AGENT <name>
```

## Parameters

`name`
:   Specifies the name for the agent to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides Cortex Agent properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the agent. |
| `database_name` | Database containing the agent. |
| `schema_name` | Schema containing the agent. |
| `owner` | Owner role of the agent. |
| `comment` | Comment text for the agent. |
| `profile` | Agent profile JSON (display_name, avatar, color). |
| `agent_spec` | Complete YAML specification of the agent. |
| `created_on` | Timestamp when the agent was created. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP, USAGE, or MODIFY | Agent |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe a Cortex Agent named `MY_AGENT1` in the `TEST_DATABASE` database and `TEST_SCHEMA` schema:

```sqlexample
DESCRIBE AGENT mydb.myschema.my_agent;
```

The statement in the example prints the following output:

```output
+--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+
| name  | database_name | schema_name | owner     | comment          | profile                            | agent_spec                       | created_on         |
|--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------|
|| TEST_AGENT | EXAMPLE_DB   | AGENTS | TEST_ROLE | null | {"display_name":"test"} | "{\"models\":{\"orchestration\":\"llama3.1-70B\"},\"nested\":{\"key\":\"value\"}},\"orchestration\":{\"budget\":{\"seconds\":30,\"tokens\":16000}},\"instructions\":{\"response\":\"You will respond in a friendly but concise manner\",\"orchestration\":\"For any revenue question use Analyst; for policy use Search\",\"system\":\"You are a friendly agent.\",\"sample_questions\":[{\"question\":\"question 1\"},{\"question\":\"question 2\"},{\"question\":\"question 3\"}]},\"tools\":[{\"tool_spec\":{\"type\":\"cortex_analyst_text_to_sql\",\"name\":\"Analyst1\",\"description\":\"test\"}},{\"tool_spec\":{\"type\":\"cortex_analyst_sql_exec\",\"name\":\"SQL_exec1\"}},{\"tool_spec\":{\"type\":\"cortex_search\",\"name\":\"Search1\"}},{\"tool_spec\":{\"type\":\"web_search\",\"name\":\"web_search_1\"}},{\"tool_spec\":{\"type\":\"generic\",\"name\":\"get_weather\",\"input_schema\":{\"type\":\"object\",\"properties\":{\"location\":{\"type\":\"string\",\"description\":\"The city and state\"}},\"required\":[\"Location\"]}}}],\"tool_unable_to_answer\":\"I don't know the answer to that\",\"tool_resources\":{\"Analyst1\":{\"semantic_model_file\":\"stage1\"},\"Analyst2\":{\"semantic_view\":\"db.schema.semantic_view\"},\"Search1\":{\"name\":\"db.schema.service_name\",\"Max_results\":\"5\",\"filter\":{\"@eq\":{\"region\":\"North America\"}},\"Title_column\":\"<title_name>\",\"ID_column\":\"<column_name>\"},\"SQL_exec1\":{\"Name\":\"my_warehouse\",\"Timeout\":\"30\",\"AutoExecute\":\"true\"},\"web_search\":{\"name\":\"web_search_1\",\"Function\":\"db/schema/search_web\"}}}" | 2025-09-15 17:04:37.263 +0000 |
+--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+
```

The following example describes an agent in the current schema:

```sqlexample
DESCRIBE AGENT my_agent;
```

The following example describes the agent as a resource in JSON format:

```sqlexample
DESCRIBE AS RESOURCE AGENT my_agent;
```

---
title: DESCRIBE AGGREGATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-aggregation-policy.md
section: SQL Commands
---

# DESCRIBE AGGREGATION POLICY

Describes the details about an [aggregation policy](../../user-guide/aggregation-policies.md), including the creation date, name, and the
SQL expression.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Aggregation policy DDL reference](../../user-guide/aggregation-policies.md)

## Syntax

```sqlsyntax
DESC[RIBE] AGGREGATION POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the aggregation policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY AGGREGATION POLICY | Account |  |
| APPLY | Aggregation policy |  |
| OWNERSHIP | Aggregation policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on aggregation policy DDL and privileges, see [Privileges and commands](../../user-guide/aggregation-policies.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Describe the aggregation policy:

> ```sqlexample
> DESC AGGREGATION POLICY my_aggpolicy;
> ```

---
title: DESCRIBE ALERT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-alert.md
section: SQL Commands
---

# DESCRIBE ALERT

Describes the properties of an [alert](../../user-guide/alerts.md).

See also:
:   [CREATE ALERT](create-alert.md) , [ALTER ALERT](alter-alert.md), [DROP ALERT](drop-alert.md) , [SHOW ALERTS](show-alerts.md) , [EXECUTE ALERT](execute-alert.md)

## Syntax

```sqlsyntax
DESC[RIBE] ALERT <name>
```

## Required parameters

`name`
:   Identifier for the alert to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR, OPERATE, or OWNERSHIP | Alert | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only returns rows for an alert owner (i.e. the role with the OWNERSHIP privilege on an alert) or a role with the OPERATE
  privilege on an alert.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

See [Viewing details about an alert](../../user-guide/alerts.md).

---
title: DESCRIBE APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-application.md
section: SQL Commands
---

# DESCRIBE APPLICATION

Displays information about a Snowflake Native App.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md), [CREATE APPLICATION PACKAGE](create-application-package.md), [DROP APPLICATION PACKAGE](drop-application-package.md),
    [SHOW APPLICATION PACKAGES](show-application-packages.md),

## Syntax

```sqlsyntax
DESC[RIBE] APPLICATION <name>
```

## Parameters

`name`
:   Specifies the [identifier](../identifiers.md) of the app to
    describe.

## Output

The command displays properties of an app in the following columns:

| Column | Description |
| --- | --- |
| `property` | The name of the property of the app. This column can include the properties listed in the following table. |
| `value` | The value assigned to the property of the app. |

The `property` column can include the following properties of an app:

| Property | Description |
| --- | --- |
| `name` | The name of the app. |
| `source_organization` | The name of the organization of the account containing the application package used to create the app. |
| `source_account` | The account of the application package used to create the app. |
| `source_type` | The source used to create the app. Valid values are `APP_PACKAGE` and `LISTING`. |
| `source` | The name of the application package or listing used to create the app. |
| `version` | The version identifier of the app. |
| `version_label` | The version label of the app. This label is visible to consumer when they install a Snowflake Native App. |
| `patch` | The patch number of the app. |
| `created_on` | The timestamp when the app was created. |
| `last_upgraded_on` | The timestamp of the last upgrade of the app. |
| `restricted_callers_rights` | Indicates that restricted caller’s rights have been enabled for the app. See [Grant restricted caller’s rights to an executable in an app](../../developer-guide/native-apps/ui-consumer-restricted-callers-rights.md) for more information. |
| `share_events_with_provider` | Indicates whether [logging and event sharing](../../developer-guide/native-apps/event-about.md) is enabled for the app. |
| `authorize_telemetry_event_sharing` | The status of the `AUTHORIZE_TELEMETRY_EVENT_SHARING` flag. |
| `log_level` | The log level defined by the provider in the manifest file. |
| `log_event_level` | The log event level defined by the provider in the manifest file. |
| `trace_level` | The trace level defined by the provider in the manifest file. |
| `metric_level` | The metric level defined by the provider in the manifest file. |
| `auditlog_level` | The audit log level defined by the provider in the manifest file. |
| `effective_log_level` | The log level enabled for the app. |
| `effective_log_event_level` | The log event level enabled for the app. |
| `effective_trace_level` | The current trace level configured for the app. |
| `effective_metric_level` | The current metric level configured for the app. |
| `effective_auditlog_level` | The current audit log level configured for the app. |
| `debug_mode` | Indicates whether the app was created using debug mode. |
| `disable_application_redaction` | Indicates if redaction of provider data has been disabled. |
| `upgrade_state` | The current state of the background installation or upgrade of the app. Valid values are:   * `INSTALLING`: The application object is in the process of being created. * `INSTALL_FAILED`: The creation of the application object failed. The application object   remains in the `INSTALL_FAILED` state until it is dropped. See the `UPGRADE_FAILURE_REASON`   column of the DESCRIBE APPLICATION command for information about why the   installation or upgrade failed. * `COMPLETE`: The setup script successfully completed and the application object was created   or upgraded. * `QUEUED`: The application object is queued for upgrade. * `UPGRADING`: The application object is in the process of being upgraded. * `FAILED`: All upgrade attempts failed. The reason for the failure is listed in the   `UPGRADE_FAILURE_REASON` column, if present. The instance remains in the `FAILED` state until   a release directive is updated to point to a different version than the one that the upgrade was   targeting, as defined in the `TARGET_UPGRADE_VERSION` column. * `QUEUED_DELAYED`: The application object is queued for an upgrade that is scheduled for a future time. * `QUEUED_RETRY`: The instance failed one or more upgrade attempts. The reason for the failure   is indicated in `UPGRADE_FAILURE_REASON`: The instance is queued to perform another upgrade attempt. * `DISABLED`: The application object and its upgrades were disabled. In this state the instance will be   inaccessible for consumers, it will not be considered for upgrades and will not block application package   version drop. The reason for the failure is listed in the `UPGRADE_FAILURE_REASON` column, if present. |
| `upgrade_target_version` | The version identifier to which the app is being upgraded. |
| `upgrade_target_patch` | The patch to which the app is being upgraded. |
| `upgrade_attempt` | Indicates whether an upgrade was attempted for the app. |
| `upgrade_task_id` | The internal task identifier for the upgrade attempt. |
| `upgrade_started_on` | The timestamp when the upgrade was initiated. |
| `upgrade_attempted_on` | The timestamp for the last app installation or retry attempt. |
| `upgrade_failure_type` | The reason for an upgrade failure. Possible values are:   * `VERSION_SETUP`: indicates that an error occurred when running the setup script   for the app. This can occur if the setup script contains a syntax error, is empty, etc.   When this error occurs, an email notification is sent to the provider. * `INTERNAL`: indicates an internal Snowflake error, for example, if a required   object does not respond or cannot be found. |
| `upgrade_failure_reason` | The reason the upgrade failed, if applicable. |
| `upgrade_after` | Indicates that the provider has scheduled an upgrade to begin at this time. However, the app may be upgraded before this date and time. For more information, see [Manually upgrade an app](../../developer-guide/native-apps/release-channels-upgrade.md). |
| `upgrade_in_maintenance_window` | If `TRUE` indicates that the provider has scheduled the app to be upgraded during a Snowpark Container Services maintenance window.  This feature is currently in Preview. |
| `previous_version` | The identifier of the previous version of the app. |
| `previous_patch` | The number of the previous patch of the installed app. |
| `previous_version_state` | The state of the previous version of the app. |
| `comment` | Text that provides information about the app. |
| `disablement_reasons` | An array containing the reasons why the app was disabled. For more information, see [Reasons an app can become disabled](../data-sharing-usage/application-state-view.md). |
| `release_channel_name` | The type of release channel. Valid values are `QA`, `ALPHA`, `DEFAULT`. For more information, see [Publish an app using release channels](../../developer-guide/native-apps/release-channels.md). |

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the properties of an app:

```sqlexample
DESC APPLICATION hello_snowflake_app;
```

```output
+------------------------------------+-------------------------------+
| property                           | value                         |
|------------------------------------+-------------------------------|
| name                               | hello_snowflake_app           |
| source_organization                | my_organization               |
| source_account                     | provider_account              |
| source_type                        | APPLICATION PACKAGE           |
| source                             | hello_snowflake_package       |
| version                            | v1_0                          |
| version_label                      | NULL                          |
| patch                              | 0                             |
| created_on                         | 2024-05-25 08:30:41.520 -0700 |
| last_upgraded_on                   |                               |
| share_events_with_provider         | FALSE                         |
| authorize_telemetry_event_sharing  | FALSE                         |
| log_level                          | OFF                           |
| log_event_level                    | OFF                           |
| trace_level                        | OFF                           |
| debug_mode                         | FALSE                         |
| upgrade_state                      | COMPLETE                      |
| upgrade_target_version             | NULL                          |
| upgrade_target_patch               | 0                             |
| upgrade_attempt                    | NULL                          |
| upgrade_task_id                    | NULL                          |
| upgrade_started_on                 |                               |
| upgrade_attempted_on               |                               |
| upgrade_failure_type               | NULL                          |
| upgrade_failure_reason             | NULL                          |
| previous_version                   | NULL                          |
| previous_patch                     | 0                             |
| previous_version_state             | COMPLETE                      |
| comment                            |                               |
+------------------------------------+-------------------------------+
```

---
title: DESCRIBE APPLICATION PACKAGE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-application-package.md
section: SQL Commands
---

# DESCRIBE APPLICATION PACKAGE

Displays information about an application package.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md),
    [CREATE APPLICATION PACKAGE](create-application-package.md),
    [DROP APPLICATION PACKAGE](drop-application-package.md),
    [SHOW APPLICATION PACKAGES](show-application-packages.md)

## Syntax

```sqlsyntax
DESC[RIBE] APPLICATION PACKAGE <name>
```

## Parameters

`name`
:   Specifies the [identifier](../identifiers.md) of the application package to
    describe.

## Output

The command displays properties of an application package in the following columns:

| Column | Description |
| --- | --- |
| `property` | The name of the property of the application package. This column can include the properties listed in the following table. |
| `value` | The value assigned to the property of the application package. |

The `property` column can include the following properties of an application package:

| Property | Description |
| --- | --- |
| `name` | The name of the application package. |
| `created_on` | The timestamp when the application package was created. |
| `distribution` | The distribution method of the application package. Valid values are `INTERNAL` and `EXTERNAL`. |
| `multiple_instances` | Indicates whether multiple instances of the application package can be installed in a single account. Valid values are `TRUE` and `FALSE`. |
| `uses_container_services` | Indicates whether the application package uses Snowpark Container Services. Valid values are `TRUE` and `FALSE`. |
| `comment` | A description of the application package. |
| `owner` | The owner of the application package. |
| `release-channels` | Indicates whether release channels are enabled for the application package. Valid values are `ENABLED` and `DISABLED`. |
| `listing_auto_refresh` | Indicates whether Cross-Cloud Auto-Fulfillment is enabled for the application package. Valid values are `TRUE` and `FALSE`. |

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the properties of an application package:

```sqlexample
DESC APPLICATION PACKAGE hello_snowflake_app;
```

```output
+------------------------------------+-------------------------------+
| property                           | value                         |
|------------------------------------+-------------------------------|
| name                               | hello_snowflake_app_package   |
| created_on                         | 2025-07-14 14:29:56.927 -0700 |
| distribution                       | INTERNAL                      |
| multiple_instances                 | FALSE                         |
| uses_container_services            | FALSE                         |
| comment                            | My awesome app                |
| owner                              | APP_DEV_ROLE                  |
| release_channels                   | ENABLED                       |
| listing_auto_refresh               | DISABLED                      |
+------------------------------------+-------------------------------+
```

---
title: DESCRIBE AUTHENTICATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-authentication-policy.md
section: SQL Commands
---

# DESCRIBE AUTHENTICATION POLICY

Describes the properties of an [authentication policy](../../user-guide/authentication-policies.md).

See also:
:   [CREATE AUTHENTICATION POLICY](create-authentication-policy.md), [ALTER AUTHENTICATION POLICY](alter-authentication-policy.md), [DROP AUTHENTICATION POLICY](drop-authentication-policy.md), [SHOW AUTHENTICATION POLICIES](show-authentication-policies.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } AUTHENTICATION POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the authentication policy to describe. If the identifier contains spaces or special characters, you must enclose
    the string in double quotation marks. Identifiers enclosed in double quotation marks are case-sensitive. The identifier must meet the
    [identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY AUTHENTICATION POLICY | Account | Only the SECURITYADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |
| OWNERSHIP | Authentication policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe an authentication policy named `my_auth_policy`:

```sqlexample
DESC AUTHENTICATION POLICY my_auth_policy;
```

Use the [pipe operator](../operators-flow.md) to select specific output from the DESCRIBE AUTHENTICATION POLICY command:

```sqlexample
DESCRIBE AUTHENTICATION POLICY go_driver_policy
  ->> SELECT "property", "value"
        FROM $1
        WHERE "property" IN('NAME', 'CLIENT_TYPES', 'CLIENT_POLICY');
```

```output
+---------------+--------------------------------------+
| property      | value                                |
|---------------+--------------------------------------|
| NAME          | GO_DRIVER_POLICY                     |
| CLIENT_TYPES  | [DRIVERS]                            |
| CLIENT_POLICY | {GO_DRIVER={MINIMUM_VERSION=3.14.1}} |
+---------------+--------------------------------------+
```

---
title: DESCRIBE AVAILABLE LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/desc-available-listing.md
section: SQL Commands
---

# DESCRIBE AVAILABLE LISTING

Describes the columns in the listings that are available to the user who runs the command. For more information on available listings, see [Listing availability options](https://other-docs.snowflake.com/collaboration/collaboration-listings-about#label-listing-availability).

See also:
:   [CREATE LISTING](create-listing.md), [CREATE APPLICATION](create-application.md), [ALTER LISTING](alter-listing.md),
    [SHOW LISTINGS](show-listings.md), [DROP LISTING](drop-listing.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } AVAILABLE LISTING <listing_global_name>
```

## Parameters

`listing_global_name`
:   The global listing name to describe.

## Output

The command output provides listing properties and metadata in the following columns:

|  |  |
| --- | --- |
| Column | Description |
| `global_name` | Global name of the listing |
| `updated_on` | Date and time the listing was last updated. |
| `first_published_on` | Date and time the listing was first published. |
| `last_published_on` | Date and time the listing was last published. |
| `created_on` | Date and time the listing was created. |
| `title` | Title specified in the listing manifest |
| `subtitle` | Sub title specified in the listing manifest |
| `description` | Listing description. |
| `state` | State of the listing, one of:   * DRAFT * PUBLISHED * UNPUBLISHED |
| `profile` | Provider profile name as specified in the listing manifest. |
| `regions` | The listing regions. |
| `is_monetized` | `true` if the listing is monetized; `false` otherwise. |
| `is_targeted` | `true` if the listing is targeted; `false` otherwise. |
| `is_by_request` | `true` if the listing is by request; `false` otherwise. |
| `is_limited_trial` | `true` if the listing is a limited trial; `false` otherwise. |
| `is_ready_for_import` | `true` If the listing is available in the local region and does not require replication; `false` otherwise. |
| `is_imported` | `true` If the listing was previously imported into the caller’s account; `false` otherwise. |
| `is_application` | `true` If the listing is based on an application; `false` otherwise. |
| `application_data` | Associated application data, such as version or patch, where present. |
| `evaluation_plan` | Associated evaluation plan details, where present. Typically associated with trial listings. |
| `business_needs` | The business needs the listing satisfies. |
| `usage_examples` | Examples provided with the listing. |
| `categories` | The listing categories. |
| `data_attributes` | Data attributes of the listing. |
| `listing_terms` | The listing terms. |
| `resources` | The listing resources. |
| `data_dictionary_url` | Metadata about the data dictionary featured objects. |
| `data_preview_url` | URL of the data preview, if present. |
| `retired_on` | Date and time the listing was retired. Null if not retired. |
| `scheduled_drop_time` | Date and time the listing is scheduled to be dropped. Null if not scheduled. |
| `trial_details` | Details about the trial, if present. |
| `distribution` | Distribution details, if present. |
| `uniform_listing_locator` | The uniform listing locator. For more information about ULLs, see [Configure organizational listings](../../user-guide/collaboration/listings/organizational/org-listing-configure.md). |
| `organization_profile_name` | The associated organization profile name. |
| `is_mountless_queryable` | `true` If the listing can be queried without being mounted; `false` otherwise. |
| `discover_only` | `true` If the listing is discoverable only; `false` otherwise. |
| `approver_contact` | The contact information for the approver, if present. |
| `support_contact` | The contact information for the support, if present. |
| `compliance_badges` | The compliance badges associated with the listing. |
| `request_approval_type` | Displays the organization listing access request type. The access request type defines how discovery targets of a listing submit access requests to the listing approver. Any one of:  * `NULL` * `REQUEST_AND_APPROVE_IN_SNOWFLAKE` indicates access requests are submitted and approved within the Snowflake environment. * `REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE` indicates the provider manages access request submissions and approvals independently. The value for external listings is always `NULL`. |

## Examples

Describe the columns in the listing named `MYLISTING`:

```sqlexample
DESC AVAILABLE LISTING MYLISTING;
```

```output
+---------------------+------------------------------+-------------------------------+------------------------------+-------------------------------+------------------------+----------+-------------+-----------+-----------+---------+--------------+-------------+---------------+------------------+---------------------+--------------+----------------+------------------+-----------------+---------------------+----------------+------------+-----------------+----------------------+-----------+---------------------+------------------+------------+---------------------+---------------+-------------------+-----------------------+
| global_name         | updated_on                   | first_published_on            | last_published_on            | created_on                    | title                  | subtitle | description | state     | profile   | regions | is_monetized | is_targeted | is_by_request | is_limited_trial | is_ready_for_import | is_imported  | is_application | application_data | evaluation_plan | business_needs      | usage_examples | categories | data_attributes | listing_terms        | resources | data_dictionary_url | data_preview_url | retired_on | scheduled_drop_time | trial_details | compliance_badges | request_approval_type |
+---------------------+------------------------------+-------------------------------+------------------------------+-------------------------------+------------------------+----------+-------------+-----------+-----------+---------+--------------+-------------+---------------+------------------+---------------------+--------------+----------------+------------------+-----------------+---------------------+----------------+------------+-----------------+----------------------+-----------+---------------------+------------------+------------+---------------------+---------------+-------------------+-----------------------+
| GZDZKY6O            |2023-11-15 13:13:54.840 -0800 | 2023-11-15 13:15:05.751 -0800 | 2023-11-15 13:15:05.751 -0800| 2023-11-15 13:12:48.988 -0800 | public-listing-test-v2 |          | test        | PUBLISHED | GZDZKY57  | ALL     | false        | false       | false.        | false            | false               | false        | true           | "{...}"          | NULL            |  [ {"type":'...' }] | NULL           | HEALTH     |   {...}         |  {"type":"STANDARD"} |  {...}    | NULL                | NULL             | NULL       | NULL                | NULL          | NULL              |  NULL                 |
+---------------------+------------------------------+-------------------------------+------------------------------+-------------------------------+------------------------+----------+-------------+-----------+-----------+---------+--------------+-------------+---------------+------------------+---------------------+--------------+----------------+------------------+-----------------+---------------------+----------------+------------+-----------------+----------------------+-----------+---------------------+------------------+------------+---------------------+---------------+-------------------+-----------------------+
```

---
title: DESCRIBE AVAILABLE ORGANIZATION PROFILE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-available-organization-profile.md
section: SQL Commands
---

# DESCRIBE AVAILABLE ORGANIZATION PROFILE

Describes the active organization profile that can be associated with organizational listings.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md), [SHOW ORGANIZATION PROFILES](show-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } AVAILABLE ORGANIZATION PROFILE <name>
```

## Parameters

`name`
:   Specifies the identifier for the organization profile to describe. Must contain only uppercase characters or numbers.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. See [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | The date and time when the organization profile was created. |
| `name` | The name of the organization profile. |
| `title` | The title of the organization profile. |
| `system_generated` | Indicates the organization profile is system generated. |
| `state` | The organization profile state. One of ACTIVE or DRAFT. |
| `description` | The description of the organization profile. |
| `owner_contact` | The contact email of the owner of the organization profile. |
| `approver_contact` | The contact email of the access approver of the organization profile. |
| `logo` | The organization profile logo URL. |
| `can_publish_listings_with_profile` | Whether the current user can publish organizational listings using this organization profile. One of `TRUE` or `FALSE`. |

## Examples

The following example describes the ORGPROFILE organization profile:

```sqlexample
DESCRIBE AVAILABLE ORGANIZATION PROFILE orgprofile;
```

```output
+-------------------------+-------------+--------------------------+---------------------+---------------------+----------------------------------+---------------------+---------------------+--------------------+-----------------------------------+
|created_on               |name         |title                     |system_generated     |state                |description                       |owner_contact        |approver_contact     |logo                |can_publish_listings_with_profile  |
+-------------------------+-------------+--------------------------+---------------------+---------------------+----------------------------------+---------------------+---------------------+--------------------+-----------------------------------+
|2025-01-01 01:01:01.000  |ORGPROFILE   |My Organization Profile   |FALSE                |ACTIVE               |Organization profile description  |test@test.com        |test@test.com        |urn:icon:shield     |TRUE                               |
+-------------------------+-------------+--------------------------+---------------------+---------------------+----------------------------------+---------------------+---------------------+--------------------+-----------------------------------+
```

---
title: DESCRIBE BACKUP POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-backup-policy.md
section: SQL Commands
---

# DESCRIBE BACKUP POLICY

Describes a specific [backup policy](../../user-guide/backups.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE BACKUP POLICY](create-backup-policy.md),
    [ALTER BACKUP POLICY](alter-backup-policy.md),
    [DROP BACKUP POLICY](drop-backup-policy.md),
    [SHOW BACKUP POLICIES](show-backup-policies.md)

## Syntax

```sqlsyntax
DESC[RIBE] BACKUP POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the backup policy to describe. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

> **Note:**
>
> The backup policy is an object that’s inside a specific schema and database. Therefore, the policy
> gets replicated, dropped or undropped, and so on, when those operations are performed on the schema and database
> that contain it. If you can’t drop the backup policy because it’s associated with any backup sets,
> then you also can’t drop the schema or database containing the policy.

To determine whether a backup policy is associated with any backup sets, use the SHOW BACKUP SETS command.

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp backup policy was created. |
| `name` | Name of backup policy. |
| `database_name` | Name of database that contains the backup policy. |
| `schema_name` | Name of schema that contains the backup policy. |
| `owner` | Name of the role with the OWNERSHIP privilege on the backup policy. |
| `comment` | Comment for backup policy. |
| `schedule` | Schedule for backup creation. |
| `expire_after_days` | Number of days after backup creation when backup expires. |
| `has_retention_lock` | Indicates whether the policy includes a retention lock.  `Y` if policy has retention lock; `N` otherwise.  For more information, see [Retention lock](../../user-guide/backups.md). |
| `owner` | Name of the role with the OWNERSHIP privilege on the backup set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the backup policy. |

## Examples

Describe a backup policy:

```sqlexample
DESC BACKUP POLICY my_backup_policy;
```

---
title: DESCRIBE BACKUP SET
source: https://docs.snowflake.com/en/sql-reference/sql/desc-backup-set.md
section: SQL Commands
---

# DESCRIBE BACKUP SET

Describes a specific [backup set](../../user-guide/backups.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE BACKUP SET](create-backup-set.md),
    [ALTER BACKUP SET](alter-backup-set.md),
    [DROP BACKUP SET](drop-backup-set.md),
    [SHOW BACKUP SETS](show-backup-sets.md)

## Syntax

```sqlsyntax
DESC[RIBE] BACKUP SET <name>
```

## Parameters

`name`
:   Specifies the identifier for the backup set to describe. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp that the backup set was created. |
| `name` | Name of the backup set. |
| `database_name` | Name of the database that contains the backup set. |
| `schema_name` | Name of the schema that contains the backup set. |
| `object_kind` | Type of the object that the backup set is backing up. |
| `object_name` | Name of the object that the backup set is backing up. |
| `object_database_name` | Name of the database that contains the object being backed up by this backup set. |
| `object_schema_name` | Name of the schema that contains the object being backed up by this backup set. |
| `backup_policy_name` | Name of the backup policy attached to this backup set. |
| `backup_policy_database_name` | Name of the database that contains the backup policy. |
| `backup_policy_schema_name` | Name of the schema that contains the backup policy. |
| `backup_policy_state` | Current state of the backup policy. |
| `owner_role` | Name of the role with the OWNERSHIP privilege on the backup set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the backup set. |
| `comment` | Comment for backup set. |

## Examples

Describe a backup set:

```sqlexample
DESC BACKUP SET my_backup_set;
```

---
title: DESCRIBE CATALOG INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-catalog-integration.md
section: SQL Commands
---

# DESCRIBE CATALOG INTEGRATION

Describes the properties of a [catalog integration](../../user-guide/tables-iceberg.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE CATALOG INTEGRATION](create-catalog-integration.md) , [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md)

## Syntax

```sqlsyntax
DESC[RIBE] CATALOG INTEGRATION <name>
```

## Parameters

`name`
:   Specifies the identifier for the catalog integration to describe. If the identifier contains spaces or special characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `property` | The name of the property. This column can include the properties listed in the following table. |
| `property_type` | The property type. |
| `property_value` | The value assigned to the property. |
| `property_default` | The default property value. |

The `property` column can include the following properties of catalog integration object:

| Property | Description |
| --- | --- |
| `enabled` | Specifies whether the catalog integration is available to use for Apache Iceberg™ tables. |
| `catalog_source` | The type of catalog source; `ICEBERG_REST`, `POLARIS`, `OBJECT_STORE`, or `GLUE` (for non-REST Glue integrations). |
| `refresh_interval_seconds` | Specifies the number of seconds that Snowflake waits between attempts to poll the external Iceberg catalog for metadata updates for [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md). |
| `rest_authentication` | Specifies the REST authentication parameters for the catalog integration. |
| `rest_config` | Specifies the REST configuration parameters for the catalog integration. |
| `catalog_namespace` | The output for this column is as follows:   * If the catalog integration is for externally managed Iceberg tables, specifies the namespace of the external Iceberg catalog. If the   namespace is specified at the table level only, this column has no value in the function output. * If the catalog integration is for [syncing a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md)   , this column has no value in the function output because this field is   not required. |
| `table_format` | The table format supplied by the catalog; for example, `ICEBERG`. |
| `glue_aws_role_arn` | (AWS Glue) The Amazon Resource Name (ARN) of the IAM role that Snowflake assumes to connect to AWS Glue. |
| `glue_catalog_id` | (AWS Glue) The ID of your AWS account. |
| `glue_region` | (AWS Glue) The AWS Region of your AWS Glue Data Catalog. |
| `glue_aws_iam_user_arn` | (AWS Glue) The ARN of the AWS IAM user created for your Snowflake account when you created the catalog integration. |
| `glue_aws_external_id` | (AWS Glue) The external ID that Snowflake uses to establish a trust relationship with AWS Glue. |
| `comment` | The comment for the catalog integration. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration (catalog) |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe a catalog integration:

```sqlexample
DESC CATALOG INTEGRATION my_catalog_integration;
```

The following shows the output of DESCRIBE CATALOG INTEGRATION for an AWS Glue catalog integration.
The output includes AWS Glue-specific properties (for example, `GLUE_AWS_ROLE_ARN`) and common catalog integration properties.

```output
+-----------------------+---------------+----------------------------------+------------------+
|       property        | property_type |          property_value          | property_default |
+-----------------------+---------------+----------------------------------+------------------+
| ENABLED               | Boolean       | true                             | false            |
| CATALOG_SOURCE        | String        | GLUE                             |                  |
| CATALOG_NAMESPACE     | String        | dbname                           |                  |
| TABLE_FORMAT          | String        | ICEBERG                          |                  |
| GLUE_AWS_ROLE_ARN     | String        | arn:aws:iam::123:role/dummy-role |                  |
| GLUE_CATALOG_ID       | String        | 123456789012                     |                  |
| GLUE_REGION           | String        | us-west-2                        |                  |
| GLUE_AWS_IAM_USER_ARN | String        | arn:aws:iam::123:user/example    |                  |
| GLUE_AWS_EXTERNAL_ID  | String        | exampleGlueExternalId            |                  |
| COMMENT               | String        |                                  |                  |
+-----------------------+---------------+----------------------------------+------------------+
```

---
title: DESCRIBE COMPUTE POOL
source: https://docs.snowflake.com/en/sql-reference/sql/desc-compute-pool.md
section: SQL Commands
---

# DESCRIBE COMPUTE POOL

Describes the properties of a [compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE COMPUTE POOL](create-compute-pool.md) , [ALTER COMPUTE POOL](alter-compute-pool.md), [DROP COMPUTE POOL](drop-compute-pool.md) , [SHOW COMPUTE POOLS](show-compute-pools.md)

## Syntax

```sqlsyntax
DESC[RIBE] COMPUTE POOL <name>
```

## Parameters

`name`
:   Specifies the identifier for the compute pool to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides compute pool properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Compute pool name. |
| `state` | Current state of the compute pool. |
| `min_nodes` | Minimum number of nodes in the compute pool. |
| `max_nodes` | Maximum number of nodes in the compute pool. |
| `instance_family` | Specifies the machine type of nodes in the compute pool. |
| `num_services` | The number of services and jobs running on the compute pool. |
| `num_jobs` | Number of jobs running on the compute pool. |
| `auto_suspend_secs` | Specifies the number of seconds of inactivity after which the compute pool is automatically suspended. |
| `auto_resume` | Specifies whether to automatically resume a compute pool when Snowflake attempts to start a service or job. |
| `active_nodes` | Number of nodes in the compute pool that are active (one or more services or jobs are running). |
| `idle_nodes` | Number of nodes in the compute pool that are idle (no service or job is running). |
| `target_nodes` | Indicates the number of nodes that Snowflake is targeting for your compute pool. If `active_nodes` isn’t equal to the `target_nodes`, Snowflake autoscales the cluster to add or remove the nodes. For more information, see [About the target_nodes compute pool property](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| `placement_group` | Specifies the fault domain into which the compute pool nodes are placed. A fault domain is similar to the cloud provider’s availability zone. For more information, see [Compute pool placement](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| `created_on` | Date and time when the compute pool was created. |
| `resumed_on` | Date and time when the suspended compute pool was resumed. |
| `updated_on` | Date and time when the compute pool was updated using ALTER COMPUTE POOL. |
| `owner` | Role that owns the compute pool. |
| `comment` | Specifies a comment for the compute pool. |
| `is_exclusive` | `true` if the compute pool is created exclusively for a Snowflake Native App; `false` otherwise. |
| `application` | Name of the Snowflake Native App if the compute pool is created exclusively for the app. Otherwise, NULL. |
| `budget` | The name of the [budget](../../user-guide/budgets.md) monitoring the credit usage of the compute pool. |
| `error_code` | Error code, if any, relevant to the STATUS_MESSAGE. Otherwise, this field is empty. For example, when you resize a compute pool:   * If Snowflake encounters a capacity error (new nodes can’t be provisioned), Snowflake returns the error code 392507.  Note that the capacity error indicates the instance type you requested for your compute pool node is currently not available with the cloud provider. You can either wait for the capacity to become available or create a new compute pool with a different instance family. * If you have pending services (including job services) and Snowflake can’t scale up your compute pool, Snowflake returns the error code 392508. |
| `status_message` | Optional message about the status of the compute pool. For example:   * After creating a compute pool, if you run the DESC COMPUTE POOL command, the output might include the status message: “Compute pool is starting for last 1 minute”. * If Snowflake encounters a capacity error when provisioning a node, the output might include the status message: “Compute pool is   starting for the last 3 minutes. We have observed CAPACITY_ERROR.” * If you have pending services (including job services) and Snowflake can’t scale up your compute pool, the output might include   the status message: “Compute pool has reached the maximum node limit. Consider increasing max_nodes using the ALTER COMPUTE POOL command.” |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Compute pool |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the compute pool named `tutorial_compute_pool`:

```sqlexample
DESCRIBE COMPUTE POOL tutorial_compute_pool;
```

Sample output:

```output
+-----------------------+--------+-----------+-----------+-----------------+--------------+----------+-------------------+-------------+--------------+------------+--------------+-------------------------------+-------------------------------+-------------------------------+-----------+---------+--------------+-------------+--------+------------+----------------+-----------------+
| name                  | state  | min_nodes | max_nodes | instance_family | num_services | num_jobs | auto_suspend_secs | auto_resume | active_nodes | idle_nodes | target_nodes | created_on                    | resumed_on                    | updated_on                    | owner     | comment | is_exclusive | application | budget | error_code | status_message | placement_group |
|-----------------------+--------+-----------+-----------+-----------------+--------------+----------+-------------------+-------------+--------------+------------+--------------+-------------------------------+-------------------------------+-------------------------------+-----------+---------+--------------+-------------+--------+------------+----------------+-----------------|
| TUTORIAL_COMPUTE_POOL | ACTIVE |         1 |         1 | CPU_X64_XS      |            3 |        0 |              3600 | true        |            1 |          0 |            1 | 2024-02-24 20:41:31.978 -0800 | 2024-08-08 11:27:01.775 -0700 | 2024-08-18 13:29:08.124 -0700 | TEST_ROLE | NULL    | false        | NULL        | NULL   |            |                |      A          |
+-----------------------+--------+-----------+-----------+-----------------+--------------+----------+-------------------+-------------+--------------+------------+--------------+-------------------------------+-------------------------------+-------------------------------+-----------+---------+--------------+-------------+--------+------------+----------------+-----------------+
```

---
title: DESCRIBE CONFIGURATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-configuration.md
section: SQL Commands
---

# DESCRIBE CONFIGURATION

Describes the properties of a [configuration](../../developer-guide/native-apps/inter-app-communication.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [SHOW CONFIGURATIONS](show-configurations.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } CONFIGURATION <configuration_name> [ IN APPLICATION <app> ]
```

## Parameters

`configuration_name`
:   Specifies the identifier for the configuration to describe.

`app`
:   The name of the app to describe the configuration for.

    If an app runs this command, the parameter is optional and ignored. Listing configurations for another app from within an app is not supported.

    If this command is run directly using a workspace or the Snowflake CLI (that is, by a consumer), the `app` parameter is required.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `property` | The name of the configuration. |
| `value` | The timestamp when the configuration object was created. |

## Usage notes

* When this command is run outside of an app, the consumer role must be granted an application role
  that has access to the configuration. If not, an error is thrown.
* If the consumer role has the MONITOR or OWNERSHIP privilege on the app, the consumer can see all
  configurations in that app, regardless of which application roles they have been granted.

---
title: DESCRIBE CORTEX SEARCH SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-cortex-search.md
section: SQL Commands
---

# DESCRIBE CORTEX SEARCH SERVICE

Describes the properties of a [Cortex Search service](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

DESCRIBE can be abbreviated to DESC.

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } CORTEX SEARCH SERVICE <name>;
```

## Parameters

`name`
:   Specifies the identifier for the Cortex Search service.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides the Cortex Search service properties and metadata in the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `name` | TEXT | Name of the service. |
| `database_name` | TEXT | The database in which the service resides. |
| `schema_name` | TEXT | The schema in which the service resides. |
| `target_lag` | TEXT | The maximum amount of time that the service’s content should lag behind updates to the base tables. |
| `warehouse` | TEXT | The warehouse used for service refreshes. |
| `search_column` | TEXT | Name of the search column. |
| `attribute_columns` | TEXT | Comma-separated list of attribute columns in the service. |
| `columns` | TEXT | Comma-separated list of columns in the service. |
| `definition` | TEXT | SQL query used to create the service. |
| `comment` | TEXT | Any comments associated with the service. |
| `service_query_url` | TEXT | URL for querying the service. |
| `source_data_num_rows` | NUMBER | Current number of rows in the materialized source data. |
| `indexing_state` | TEXT | Indexing state of the service; one of SUSPENDED or RUNNING. |
| `indexing_error` | TEXT | Error encountered in the last indexing pipeline, if one exists. |
| `serving_state` | TEXT | Serving state of the Cortex Search Service; one of SUSPENDED or RUNNING. |
| `created_on` | TIMESTAMP_LTZ | Creation time of the Cortex Search Service. |
| `data_timestamp` | TIMESTAMP_LTZ | Time at which the source data was checked for changes resulting in the currently serving index. |
| `embedding_model` | TEXT | The vector embedding model used by the service. |
| `primary_key_columns` | TEXT | Comma-separated list of [primary key column](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) names defined on the service. Empty if no primary key is set. |
| `scoring_profile_count` | NUMBER | The number of [named scoring profiles](../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md) defined in the service. |
| `full_index_build_interval_days` | NUMBER | The target interval, in days, between full index rebuilds. Only applicable to services with [primary keys](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) defined. NULL if not set. |

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the Cortex Search service named `mysvc`:

```sqlexample
DESCRIBE CORTEX SEARCH SERVICE mysvc;
```

---
title: DESCRIBE DATABASE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-database.md
section: SQL Commands
---

# DESCRIBE DATABASE

Describes the database. For example, shows the schemas in the database.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER DATABASE](alter-database.md) , [CREATE DATABASE](create-database.md) , [DROP DATABASE](drop-database.md) , [SHOW DATABASES](show-databases.md) , [UNDROP DATABASE](undrop-database.md)

    [DATABASES view](../info-schema/databases.md) (Information Schema)

## Syntax

```sqlsyntax
DESC[RIBE] DATABASE <database_name>
```

## Parameters

`database_name`
:   Specifies the [identifier](../identifiers.md) of the database to describe.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

This demonstrates the DESCRIBE DATABASE command:

```sqlexample
CREATE DATABASE desc_demo;

CREATE SCHEMA sample_schema_1;

CREATE SCHEMA sample_schema_2;

DESCRIBE DATABASE desc_demo;
```

```output
+-------------------------------+--------------------+--------+
| created_on                    | name               | kind   |
|-------------------------------+--------------------+--------|
| 2022-06-23 00:00:00.000 -0700 | INFORMATION_SCHEMA | SCHEMA |
| 2022-06-23 00:00:00.000 -0700 | PUBLIC             | SCHEMA |
| 2022-06-23 01:00:00.000 -0700 | SAMPLE_SCHEMA_1    | SCHEMA |
| 2022-06-23 02:00:00.000 -0700 | SAMPLE_SCHEMA_2    | SCHEMA |
+-------------------------------+--------------------+--------+
```

---
title: DESCRIBE DBT PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-dbt-project.md
section: SQL Commands
---

# DESCRIBE DBT PROJECT

Describes the properties of a [dbt project object](../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE DBT PROJECT](create-dbt-project.md), [ALTER DBT PROJECT](alter-dbt-project.md), [EXECUTE DBT PROJECT](execute-dbt-project.md), [DROP DBT PROJECT](drop-dbt-project.md), [SHOW DBT PROJECTS](show-dbt-projects.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } DBT PROJECT <name>
```

## Parameters

`name`
:   Specifies the identifier for the dbt project object to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | The identifier (name) of the dbt project object. |
| `owner` | The role that was used to create the dbt project object. |
| `comment` | The comment associated with the dbt project object. |
| `dbt_version` | The version for the dbt Project. If no value is specified, the system uses version 1.9.4 by default. |
| `dbt_snowflake_version` | The Snowflake version the dbt project object is on. |
| `default_target` | The default execution target (for example, `prod` or `dev`) used by dbt commands executed through Snowflake. |
| `external_access_integrations` | The name of the external access integrations the dbt Project is permitted to use to pull remote dependencies from dbt package hub or GitHub. |

The following columns provide the value of a deprecated parameter:

| Column | Description |
| --- | --- |
| `default_version` | The version of the dbt project object:   * `LAST`: The most recent version of the dbt project object. * `FIRST`: The oldest version of the dbt project object. |
| `default_version_name` | The version identifier in the form `VERSION$num`, where `num` is a positive integer, for example: `VERSION$1`.  The version number begins at `1` when you create a dbt project object and increments by one with each new version of the dbt project object.  Snowflake increments the version identifier when you perform the following tasks:   * Redeploy dbt project from a workspace (runs the ALTER command with the ADD VERSION option). * Update the project by using the [ALTER DBT PROJECT](alter-dbt-project.md) command. * Run the Snow CLI `snow dbt deploy` command with the `--force` option.   Snowflake resets the version identifier to `1` and removes all version aliases when you run the CREATE DBT PROJECT command with the OR REPLACE option. |
| `default_version_alias` | The custom version name alias that you created for a specific version of the dbt project object using the ALTER DBT PROJECT command with the ADD VERSION option. A version name alias always maps to a specific version identifier, such as `VERSION$3`. |
| `default_version_location_uri` | The location URI of the default version. This is read only. |
| `default_version_source_location_uri` | The location URI of the default version’s source files in its Git object. If the dbt project object is not connected to a Git object, this is null. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| MONITOR | dbt project |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the dbt project object named `my_dbt_project`:

```sqlexample
DESCRIBE DBT PROJECT my_dbt_project;
```

```output
+----------------+--------------+------------+-------------+-----------------+----------------------+-----------------------+---------------------------------------------------------------+-------------------------------------+-----------------------+----------------+------------------------------+
|      name      |    owner     |  comment   | dbt_version | default_version | default_version_name | default_version_alias | default_version_location_uri                                  | default_version_source_location_uri | dbt_snowflake_version | default_target | external_access_integrations |
+----------------+--------------+------------+-------------+-----------------+----------------------+-----------------------+---------------------------------------------------------------+-------------------------------------+-----------------------+----------------+------------------------------+
| my_dbt_project | ACCOUNTADMIN | My comment | 1.9.4b      | LAST            | VERSION$1            | null                  | snow://dbt/MY_DB.MY_SCHEMA.my_dbt_project/versions/version$1/ | @s1                                 | null                  | dev            | null                         |
+----------------+--------------+------------+-------------+-----------------+----------------------+-----------------------+---------------------------------------------------------------+-------------------------------------+-----------------------+----------------+------------------------------+
```

---
title: DESCRIBE DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-dcm-project.md
section: SQL Commands
---

# DESCRIBE DCM PROJECT

Describes the properties of a [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [ALTER DCM PROJECT](alter-dcm-project.md) , [DROP DCM PROJECT](drop-dcm-project.md) , [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md), [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)

## Syntax

```sqlsyntax
{ DESCRIBE | DESC } DCM PROJECT <name>
```

## Parameters

`name`
:   Specifies the identifier for the DCM project to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

DCM project properties

| Column | Description |
| --- | --- |
| `name` | Name of the DCM project.  Example: `my_project` |
| `created_on` | Timestamp when the DCM project was created.  Example: `2022-01-01 00:00:00` |
| `owner` | User who owns the DCM project. |
| `comment` | User-defined comment associated with the DCM project. |
| `last_executed_version_name` | Name of the last executed DCM project version.  Example: `VERSION$2` |
| `last_executed_version_alias` | Version alias of the last executed DCM project version. |
| `last_executed_version_path` | URI of the last executed version.  Example: `snow://project/MY_DB.PUBLIC.P/versions/version$2/` |
| `last_executed_source_path` | Path to the last executed version sources.  Example: `@project_stg/v1/` |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| READ | DCM project |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the DCM project named `my_project`:

```sqlexample
DESCRIBE DCM PROJECT my_project;
```

---
title: DESCRIBE DYNAMIC TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-dynamic-table.md
section: SQL Commands
---

# DESCRIBE DYNAMIC TABLE

Describes the columns in a [dynamic table](../../user-guide/dynamic-tables-about.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE DYNAMIC TABLE](create-dynamic-table.md), [ALTER DYNAMIC TABLE](alter-dynamic-table.md), [DROP DYNAMIC TABLE](drop-dynamic-table.md), [SHOW DYNAMIC TABLES](show-dynamic-tables.md)

## Syntax

```sqlsyntax
DESC[RIBE] DYNAMIC TABLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the dynamic table to describe. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| SELECT | The dynamic table that you want to describe. | Some metadata is hidden if you don’t have the MONITOR privilege. For more information, see [Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To DESCRIBE a dynamic table, you must be using a role that has MONITOR privilege on the table.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the columns in `my_dynamic_table`:

> ```sqlexample
> DESC DYNAMIC TABLE my_dynamic_table;
> ```

---
title: DESCRIBE EVENT TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-event-table.md
section: SQL Commands
---

# DESCRIBE EVENT TABLE

Describes the columns in an [event table](../../developer-guide/logging-tracing/event-table-setting-up.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER TABLE (event tables)](alter-table-event-table.md) , [CREATE EVENT TABLE](create-event-table.md) , [SHOW EVENT TABLES](show-event-tables.md)

## Syntax

```sqlsyntax
DESC[RIBE] EVENT TABLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the event table to describe. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* This command does not show the object parameters for a table. Instead, use
  [SHOW PARAMETERS IN TABLE …](show-parameters.md).

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the columns in the event table named `my_logged_events`:

> ```sqlexample
> DESC EVENT TABLE my_logged_events;
> ```

---
title: DESCRIBE EXTERNAL TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-external-table.md
section: SQL Commands
---

# DESCRIBE EXTERNAL TABLE

Describes the VALUE column and virtual columns in an external table.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP EXTERNAL TABLE](drop-external-table.md) , [ALTER EXTERNAL TABLE](alter-external-table.md) , [CREATE EXTERNAL TABLE](create-external-table.md) , [SHOW EXTERNAL TABLES](show-external-tables.md)

    [DESCRIBE VIEW](desc-view.md)

## Syntax

```sqlsyntax
DESC[RIBE] [ EXTERNAL ] TABLE <name> [ TYPE =  { COLUMNS | STAGE } ]
```

## Parameters

`name`
:   Specifies the identifier for the external table to describe. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Create an example external table:

> ```sqlexample
> CREATE EXTERNAL TABLE emp ( ... );
> ```

Describe the columns in the table:

> ```sqlexample
> DESC EXTERNAL TABLE emp;
> ```

---
title: DESCRIBE EXTERNAL VOLUME
source: https://docs.snowflake.com/en/sql-reference/sql/desc-external-volume.md
section: SQL Commands
---

# DESCRIBE EXTERNAL VOLUME

Describes the properties of an [external volume](../../user-guide/tables-iceberg.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER EXTERNAL VOLUME](alter-external-volume.md) , [CREATE EXTERNAL VOLUME](create-external-volume.md) , [DROP EXTERNAL VOLUME](drop-external-volume.md) , [SHOW EXTERNAL VOLUMES](show-external-volumes.md)

## Syntax

```sqlsyntax
DESC[RIBE] EXTERNAL VOLUME <name>
```

## Parameters

`name`
:   Specifies the identifier for the external volume to describe. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `parent_property` | The parent property. This column includes the `STORAGE_LOCATIONS` property, which holds a set of named cloud storage locations. |
| `property` | The name of the property. This column can include the properties listed in the following table. |
| `property_type` | The property type. |
| `property_value` | The value assigned to the property. |
| `property_default` | The default property value. |

The `property` column can include the following properties of an external volume object:

| Property | Description |
| --- | --- |
| `comment` | The comment set for the external volume, if any. |
| `allow_writes` | Specifies whether write operations are allowed for the external volume. |
| `storage_location_n` | Details for a cloud storage location associated with the external volume, where `n` is a unique number that distinguishes the location from others in the `STORAGE_LOCATIONS` list; for example, `storage_location_1`.  For more information about storage location properties, see [CREATE EXTERNAL VOLUME](create-external-volume.md). |
| `active` | The name of the [active storage location](../../user-guide/tables-iceberg-storage.md) for the external volume. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | External volume |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe an external volume:

```sqlexample
DESC EXTERNAL VOLUME my_external_volume;
```

The following shows the output of DESCRIBE EXTERNAL VOLUME for an external volume with one storage location.
The property value for `STORAGE_LOCATION_1` is abbreviated for display purposes.

```output
+-------------------+--------------------+---------------+-------------------------------------------------------------------------------------------+------------------+
| parent_property   | property           | property_type | property_value                                                                            | property_default |
|-------------------+--------------------+---------------+-------------------------------------------------------------------------------------------+------------------|
|                   | ALLOW_WRITES       | Boolean       | true                                                                                      | true             |
| STORAGE_LOCATIONS | STORAGE_LOCATION_1 | String        | {"NAME":"my_storage_us_west","STORAGE_PROVIDER":"S3","STORAGE_BASE_URL":"s3://...", ...}  |                  |
| STORAGE_LOCATIONS | ACTIVE             | String        | my_storage_us_west                                                                        |                  |
+-------------------+--------------------+---------------+-------------------------------------------------------------------------------------------+------------------+
```

---
title: DESCRIBE FEATURE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-feature-policy.md
section: SQL Commands
---

# DESCRIBE FEATURE POLICY

Describes the properties of a [feature policy](../../developer-guide/native-apps/ui-consumer-feature-policies.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE FEATURE POLICY](create-feature-policy.md) , [ALTER FEATURE POLICY](alter-feature-policy.md), [DROP FEATURE POLICY](drop-feature-policy.md), [SHOW FEATURE POLICIES](show-feature-policies.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } FEATURE POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the feature policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command displays properties of a feature policy in the following columns:

| Column | Description |
| --- | --- |
| `property` | The name of the feature property policy. This column can include the properties listed in the following table. |
| `value` | The value assigned to the property of the feature policy. |

The `property` column can include the following properties of a feature policy:

| Property | Description |
| --- | --- |
| `created_on` | The timestamp when the feature policy was created. |
| `name` | The name of the feature policy. |
| `owner` | The role that owns the feature policy. |
| `owner_role_type` | The type of role that owns the object: ROLE or DATABASE_ROLE |
| `comment` | A description of the feature policy. |
| `blocked_object_types_for_creation` | The list of objects that the feature policy blocks for creation. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY FEATURE POLICY | Account |  |
| OWNERSHIP or APPLY | Feature policy |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example describes the feature policy named `block_db_policy`:

```sqlexample
DESCRIBE FEATURE POLICY block_db_policy;
```

```output
+------------------------------------+-------------------------------+
| property                           | value                         |
+------------------------------------|-------------------------------+
| created_on                         | 2025-05-23 08:19:49.483 -0700 |
| name                               | BLOCK_CREATE_DB_POLICY        |
| owner                              | ACCOUNTADMIN                  |
| owner_role_type                    | ROLE                          |
| comment                            |                               |
| blocked_object_types_for_creation  | DATABASES                     |
+------------------------------------+-------------------------------+
```

---
title: DESCRIBE FILE FORMAT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-file-format.md
section: SQL Commands
---

# DESCRIBE FILE FORMAT

Describes the property type (for example, `String` or `Integer`), the defined value of the property, and the default value for each property in a file format object definition. For more information about available properties for each file type, see “[Format type options](create-file-format.md)” in [CREATE FILE FORMAT](create-file-format.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP FILE FORMAT](drop-file-format.md) , [ALTER FILE FORMAT](alter-file-format.md) , [CREATE FILE FORMAT](create-file-format.md) , [SHOW FILE FORMATS](show-file-formats.md)

## Syntax

```sqlsyntax
DESC[RIBE] FILE FORMAT <name>
```

## Parameters

`name`
:   Specifies the identifier for the file format to describe. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the file format object named `my_csv_format`:

> ```sqlexample
> DESC FILE FORMAT my_csv_format;
> ```
>
> Output:
>
> > ```output
> > +--------------------------------+---------------+----------------+------------------+
> > | property                       | property_type | property_value | property_default |
> > +--------------------------------+---------------+----------------+------------------+
> > | TYPE                           | String        | csv            | CSV              |
> > | RECORD_DELIMITER               | String        | \n             | \n               |
> > | FIELD_DELIMITER                | String        | ,              | ,                |
> > | FILE_EXTENSION                 | String        |                |                  |
> > | SKIP_HEADER                    | Integer       | 0              | 0                |
> > | PARSE_HEADER                   | Boolean       | FALSE          | FALSE            |
> > | DATE_FORMAT                    | String        | AUTO           | AUTO             |
> > | TIME_FORMAT                    | String        | AUTO           | AUTO             |
> > | TIMESTAMP_FORMAT               | String        | AUTO           | AUTO             |
> > | BINARY_FORMAT                  | String        | HEX            | HEX              |
> > | ESCAPE                         | String        | NONE           | NONE             |
> > | ESCAPE_UNENCLOSED_FIELD        | String        | \\             | \\               |
> > | TRIM_SPACE                     | Boolean       | FALSE          | FALSE            |
> > | FIELD_OPTIONALLY_ENCLOSED_BY   | String        | NONE           | NONE             |
> > | NULL_IF                        | List          | [\\N]          | [\\N]            |
> > | COMPRESSION                    | String        | AUTO           | AUTO             |
> > | ERROR_ON_COLUMN_COUNT_MISMATCH | Boolean       | TRUE           | TRUE             |
> > | VALIDATE_UTF8                  | Boolean       | TRUE           | TRUE             |
> > | SKIP_BLANK_LINES               | Boolean       | FALSE          | FALSE            |
> > | REPLACE_INVALID_CHARACTERS     | Boolean       | FALSE          | FALSE            |
> > | EMPTY_FIELD_AS_NULL            | Boolean       | TRUE           | TRUE             |
> > | SKIP_BYTE_ORDER_MARK           | Boolean       | TRUE           | TRUE             |
> > | ENCODING                       | String        | UTF8           | UTF8             |
> > +--------------------------------+---------------+----------------+------------------+
> > ```

Describe the file format object named `my_json_format`:

> ```sqlexample
> DESC FILE FORMAT `my_json_format`;
> ```
>
> Output:
>
> > ```output
> > +----------------------------+---------------+----------------+------------------+
> > | property                   | property_type | property_value | property_default |
> > +----------------------------+---------------+----------------+------------------+
> > | TYPE                       | String        | JSON           | CSV              |
> > | FILE_EXTENSION             | String        |                |                  |
> > | DATE_FORMAT                | String        | AUTO           | AUTO             |
> > | TIME_FORMAT                | String        | AUTO           | AUTO             |
> > | TIMESTAMP_FORMAT           | String        | AUTO           | AUTO             |
> > | BINARY_FORMAT              | String        | HEX            | HEX              |
> > | TRIM_SPACE                 | Boolean       | FALSE          | FALSE            |
> > | NULL_IF                    | List          | []             | [\\N]            |
> > | COMPRESSION                | String        | AUTO           | AUTO             |
> > | ENABLE_OCTAL               | Boolean       | FALSE          | FALSE            |
> > | ALLOW_DUPLICATE            | Boolean       | FALSE          | FALSE            |
> > | STRIP_OUTER_ARRAY          | Boolean       | FALSE          | FALSE            |
> > | STRIP_NULL_VALUES          | Boolean       | FALSE          | FALSE            |
> > | IGNORE_UTF8_ERRORS         | Boolean       | FALSE          | FALSE            |
> > | REPLACE_INVALID_CHARACTERS | Boolean       | FALSE          | FALSE            |
> > | SKIP_BYTE_ORDER_MARK       | Boolean       | TRUE           | TRUE             |
> > +----------------------------+---------------+----------------+------------------+
> > ```

Describe the file format object named `my_parquet_format`:

> ```sqlexample
> DESC FILE FORMAT `my_parquet_format`;
> ```
>
> Output:
>
> > ```output
> > +----------------+---------------+----------------+------------------+
> > | property       | property_type | property_value | property_default |
> > +----------------+---------------+----------------+------------------+
> > | TYPE           | String        | PARQUET        | CSV              |
> > | TRIM_SPACE     | Boolean       | FALSE          | FALSE            |
> > | NULL_IF        | List          | []             | [\\N]            |
> > | COMPRESSION    | String        | SNAPPY         | AUTO             |
> > | BINARY_AS_TEXT | Boolean       | TRUE           | TRUE             |
> > +----------------+---------------+----------------+------------------+
> > ```

---
title: DESCRIBE FUNCTION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-function.md
section: SQL Commands
---

# DESCRIBE FUNCTION

Describes the specified user-defined function (UDF) or external function, including the signature (i.e. arguments),
return value, language, and body (i.e. definition).

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP FUNCTION](drop-function.md) , [ALTER FUNCTION](alter-function.md) , [CREATE FUNCTION](create-function.md) , [SHOW USER FUNCTIONS](show-user-functions.md) , [SHOW EXTERNAL FUNCTIONS](show-external-functions.md)

## Syntax

```sqlsyntax
DESC[RIBE] FUNCTION <name> ( [ <arg_data_type> ] [ , ... ] )
```

## Parameters

`name`
:   Specifies the identifier for the function to describe. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`arg_data_type [ , ... ]`
:   Specifies the data type of the argument(s), if any, for the function. The argument data types are necessary because functions support
    name overloading (i.e. two functions in the same schema can have the same name) and the argument data types are used to identify the
    function.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

This demonstrates the DESCRIBE FUNCTION command:

> ```sqlexample
> DESC FUNCTION multiply(number, number);
>
> -----------+----------------------------------+
>  property  |              value               |
> -----------+----------------------------------+
>  signature | (a NUMBER(38,0), b NUMBER(38,0)) |
>  returns   | NUMBER(38,0)                     |
>  language  | SQL                              |
>  body      | a * b                            |
> -----------+----------------------------------+
> ```

---
title: DESCRIBE FUNCTION (DMF)
source: https://docs.snowflake.com/en/sql-reference/sql/desc-function-dmf.md
section: SQL Commands
---

# DESCRIBE FUNCTION (DMF)

Describes the specified data metric function (DMF), including the signature (arguments), return value, language, and body (definition).

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } FUNCTION [ IF EXISTS ] <name>(
  TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ]
  )
```

## Parameters

`name`
:   Specifies the identifier for the function to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`TABLE( arg_data_type [ , ... ] ) [ , TABLE( arg_data_type [ , ... ] ) ]`
:   Specifies the data type of the column arguments for the DMF. The data types are necessary because DMFs support name overloading
    (that is, two DMFs in the same schema can have the same name), and the data types of the argument are used to identify the DMF you want to
    describe.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Data metric function |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Describe the DMF to view its properties:

```sqlexample
DESC FUNCTION governance.dmfs.count_positive_numbers(
  TABLE(
    NUMBER, NUMBER, NUMBER
  )
);
```

```output
+-----------+---------------------------------------------------------------------+
| property  | value                                                               |
+-----------+---------------------------------------------------------------------+
| signature | (ARG_T TABLE(ARG_C1 NUMBER, ARG_C2 NUMBER, ARG_C3 NUMBER))          |
| returns   | NUMBER(38,0)                                                        |
| language  | SQL                                                                 |
| body      | SELECT COUNT(*) FROM arg_t WHERE arg_c1>0 AND arg_c2>0 AND arg_c3>0 |
+-----------+---------------------------------------------------------------------+
```

---
title: DESCRIBE FUNCTION (Snowpark Container Services)
source: https://docs.snowflake.com/en/sql-reference/sql/desc-function-spcs.md
section: SQL Commands
---

# DESCRIBE FUNCTION (Snowpark Container Services)

Describes the specified [service function](../../developer-guide/snowpark-container-services/working-with-services.md), including the signature (arguments), return value, language, and body (path to the Snowpark Container Services service).

See also:
:   [Service functions](../../developer-guide/snowpark-container-services/working-with-services.md), [CREATE FUNCTION](create-function-spcs.md), [ALTER FUNCTION](alter-function-spcs.md), [DROP FUNCTION](drop-function-spcs.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> ] [ , ... ] )
```

## Required parameters

`name`
:   Specifies the identifier for the service function to describe. If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive.

`( [ arg_name arg_data_type ] [ , ... ] )`
:   Specifies the arguments/inputs for the service function. These should correspond to the arguments that the
    service expects.

    If there are no arguments, then include the parentheses without any argument name(s) and data type(s).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Service function |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

In [Tutorial-1](../../developer-guide/snowpark-container-services/tutorials/tutorial-1.md), you create a service function (my_echo_udf). The following DESC FUNCTION command returns the service function description:

```sqlexample
DESC FUNCTION my_echo_udf(VARCHAR);
```

Example output:

```output
+--------------------+----------------------+
| property           | value                |
|--------------------+----------------------|
| signature          | (INPUTTEXT VARCHAR)  |
| returns            | VARCHAR              |
| language           | NULL                 |
| null handling      | CALLED ON NULL INPUT |
| volatility         | VOLATILE             |
| body               | /echo                |
| headers            | null                 |
| context_headers    | null                 |
| max_batch_rows     | not set              |
| service            | ECHO_SERVICE         |
| service_endpoint   | echoendpoint         |
| max_batch_retries  | 3                    |
| on_batch_failure   | ABORT                |
| batch_timeout_secs | not set              |
+--------------------+----------------------+
```

---
title: DESCRIBE GATEWAY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-gateway.md
section: SQL Commands
---

# DESCRIBE GATEWAY

Describes the properties of a [gateway](../../developer-guide/snowpark-container-services/gateway.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE GATEWAY](create-gateway.md) , [ALTER GATEWAY](alter-gateway.md), [DROP GATEWAY](drop-gateway.md) , [SHOW GATEWAYS](show-gateways.md)

## Syntax

```sqlsyntax
DESC[RIBE] GATEWAY <name>
```

## Parameters

`name`
:   Specifies the identifier for the gateway to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides gateway properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Gateway name. |
| `ingress_url` | Gateway ingress URL. |
| `privatelink_ingress_url` | PrivateLink ingress URL. |
| `database_name` | Database in which the gateway is created. |
| `schema_name` | Schema in which the gateway is created. |
| `owner` | Role that owns the gateway. |
| `owner_role_type` | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `spec` | Gateway specification (YAML format). This column is only shown if the role executing the command has USAGE, MODIFY, or OWNERSHIP privilege on the gateway. |
| `created_on` | Timestamp when the gateway was created. |
| `updated_on` | Timestamp when the gateway was last updated. |
| `comment` | Gateway related comment. |

> **Note:**
>
> If the role used has USAGE, MODIFY, or OWNERSHIP privilege on the gateway, the `spec` column will be shown.
> If not, the other columns will be shown, but not the `spec` column.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE, MODIFY, or OWNERSHIP | Gateway | Any of these privileges allows describing the gateway. Only roles with these privileges can view the spec. |
| USAGE | Database | Required on the database containing the gateway. |
| USAGE | Schema | Required on the schema containing the gateway. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the gateway named `split_gateway`:

```sqlexample
DESCRIBE GATEWAY split_gateway;
```

---
title: DESCRIBE GIT REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-git-repository.md
section: SQL Commands
---

# DESCRIBE GIT REPOSITORY

Describes an existing Snowflake [Git repository clone](../../developer-guide/git/git-overview.md).

See also:
:   [ALTER GIT REPOSITORY](alter-git-repository.md), [CREATE GIT REPOSITORY](create-git-repository.md), [DROP GIT REPOSITORY](drop-git-repository.md), [SHOW GIT BRANCHES](show-git-branches.md),
    [SHOW GIT REPOSITORIES](show-git-repositories.md), [SHOW GIT TAGS](show-git-tags.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } GIT REPOSITORY <name>
```

## Parameters

`name`
:   Specifies the identifier for the Git repository clone to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output includes properties in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date the Git repository clone was created. |
| `name` | Name of the Git repository clone. |
| `database_name` | Name of the database containing this Git repository clone. |
| `schema_name` | Name of the schema containing this Git repository clone. |
| `origin` | URL of the remote Git repository’s origin. |
| `api_integration` | Name of the API integration included in this Git repository clone. |
| `git_credentials` | Name of the secret object in this Git repository clone. |
| `owner` | Role used when this Git repository clone was created. |
| `owner_role_type` | Type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `comment` | Comment specified when this Git repository clone was created. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Git repository | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example generates a description of the `snowflake_extensions` Git repository clone:

```sqlexample
DESCRIBE GIT REPOSITORY snowflake_extensions;
```

The preceding command generates output such as the following:

```output
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| CREATED_ON                    | NAME                 | DATABASE_NAME | SCHEMA_NAME | ORIGIN                                                 | API_INTEGRATION     | GIT_CREDENTIALS           | OWNER        | OWNER_ROLE_TYPE | COMMENT |
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-06-28 08:46:10.886 -0700 | SNOWFLAKE_EXTENSIONS | MY_DB         | MAIN        | https://github.com/my-account/snowflake-extensions.git | GIT_API_INTEGRATION | MY_DB.MAIN.GIT_SECRET     | ACCOUNTADMIN | ROLE            |         |
--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

---
title: DESCRIBE ICEBERG TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-iceberg-table.md
section: SQL Commands
---

# DESCRIBE ICEBERG TABLE

Describes either the columns in an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) or the current values,
as well as the default values, for the properties of an Iceberg table.

DESCRIBE can be abbreviated to DESC.

Note that this topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

See also:
:   [ALTER ICEBERG TABLE](alter-iceberg-table.md), [DROP ICEBERG TABLE](drop-iceberg-table.md), [CREATE ICEBERG TABLE](create-iceberg-table.md), [SHOW ICEBERG TABLES](show-iceberg-tables.md)

## Syntax

```sqlsyntax
DESC[RIBE] [ ICEBERG ] TABLE <name> [ TYPE =  { COLUMNS | STAGE } ]
```

## Parameters

`name`
:   Specifies the identifier for the table to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`TYPE = COLUMNS | STAGE`
:   Specifies whether to display the columns for the table or the stage properties (including their current and default values) for the
    table.

    Default: `TYPE = COLUMNS`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| SELECT | Iceberg table |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* This command does not show the object parameters for a table. Instead, use
  [SHOW PARAMETERS IN TABLE](show-parameters.md).
* DESC ICEBERG TABLE, [DESCRIBE TABLE](desc-table.md), and [DESCRIBE VIEW](desc-view.md) are interchangeable. Any of these
  commands retrieves the details for the table or view that matches the criteria in the statement; however, `TYPE = STAGE` does
  not apply for views because views don’t have stage properties.
* The output includes a `POLICY NAME` column to indicate the [masking policy](../../user-guide/security-column-intro.md) set on the column.

  If a masking policy isn’t set on the column or if the Snowflake account isn’t Enterprise Edition or higher, Snowflake returns
  `NULL`.
* The command returns the `NAME_MAPPING` column only if you configure Iceberg Compatibility V2
  ([icebergCompatV2](https://github.com/delta-io/delta/blob/master/PROTOCOL.md#iceberg-compatibility-v2)) for the Delta table
  that your Iceberg table is based on.

  > **Note:**
  >
  > To view the `NAME_MAPPING` column, you must also enable the 2025_01 behavior change bundle
  > in your account.
  >
  > To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
  > execute the following statement:
  >
  > ```sqlexample
  > SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_01');
  > ```

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Create an example Iceberg table:

> ```sqlexample
> CREATE OR REPLACE ICEBERG TABLE my_iceberg_table
>   CATALOG='my_catalog_integration'
>   EXTERNAL_VOLUME='my_ext_volume'
>   METADATA_FILE_PATH='path/to/metadata/v2.metadata.json';
> ```

Describe the columns in the table:

> ```sqlexample
> DESC ICEBERG TABLE my_iceberg_table ;
> ```

---
title: DESCRIBE INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-integration.md
section: SQL Commands
---

# DESCRIBE INTEGRATION

Describes the properties of an integration.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE INTEGRATION](create-integration.md) , [DROP INTEGRATION](drop-integration.md) , [ALTER INTEGRATION](alter-integration.md) , [SHOW INTEGRATIONS](show-integrations.md)

API integrations:
:   [ALTER API INTEGRATION](alter-api-integration.md) , [CREATE API INTEGRATION](create-api-integration.md)

Catalog integrations:
:   [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [CREATE CATALOG INTEGRATION](create-catalog-integration.md)

External access integrations:
:   [ALTER EXTERNAL ACCESS INTEGRATION](alter-external-access-integration.md) , [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md)

Notification integrations:
:   [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md) , [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md)

Security integrations:
:   [ALTER SECURITY INTEGRATION](alter-security-integration.md) , [CREATE SECURITY INTEGRATION](create-security-integration.md)

Storage integrations:
:   [ALTER STORAGE INTEGRATION](alter-storage-integration.md) , [CREATE STORAGE INTEGRATION](create-storage-integration.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } [ { API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE } ] INTEGRATION <name>
```

## Parameters

`{ API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE }`
:   Describes an integration of the specified type.

    For more information about some of these types, see the following topics:

    * [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)
    * [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md)

`name`
:   Specifies the identifier for the integration to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* If the integration is an API integration, then the output includes the API_KEY column. The API_KEY displays a masked value if an
  [API key](../external-functions-security.md) was entered. (This does not display either the original unencrypted key or the
  encrypted version of the key.)
* If the security integration has the `TYPE` property set to `OAUTH` (i.e. Snowflake OAuth), Snowflake returns two additional security
  integration properties in the query result that cannot be set with either a CREATE SECURITY INTEGRATION or an ALTER SECURITY INTEGRATION
  command:

  `OAUTH_ALLOWED_AUTHORIZATION_ENDPOINTS`
  :   A list of all supported endpoints for a client application to receive an authorization code from Snowflake.

  `OAUTH_ALLOWED_TOKEN_ENDPOINTS`
  :   A list of all supported endpoints for a client application to exchange an authorization code for an access token or to obtain a refresh
      token.

## Examples

Describe the properties of an integration named `my_int`:

```sqlexample
DESC INTEGRATION my_int;
```

---
title: DESCRIBE JOIN POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-join-policy.md
section: SQL Commands
---

# DESCRIBE JOIN POLICY

Describes the details about a [join policy](../../user-guide/join-policies.md), including the creation date, name, and the SQL expression.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Join policy DDL reference](../../user-guide/join-policies.md)

## Syntax

```sqlsyntax
{ DESCRIBE | DESC } JOIN POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the join policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY JOIN POLICY | Account |  |
| APPLY | Join policy |  |
| OWNERSHIP | Join policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about join policy DDL and privileges, see [Managing join policies](../../user-guide/join-policies.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Describe a join policy:

```sqlexample
DESCRIBE JOIN POLICY jp3;
```

```output
+------+-----------+-----------------+-----------------------------------------+
| name | signature | return_type     | body                                    |
|------+-----------+-----------------+-----------------------------------------|
| JP3  | ()        | JOIN_CONSTRAINT | JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE) |
+------+-----------+-----------------+-----------------------------------------+
```

---
title: DESCRIBE LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/desc-listing.md
section: SQL Commands
---

# DESCRIBE LISTING

Describes the columns in a [listing](../../collaboration/collaboration-listings-about.md).

See also:
:   [CREATE LISTING](create-listing.md), [ALTER LISTING](alter-listing.md), [SHOW LISTINGS](show-listings.md), [SHOW VERSIONS IN LISTING](show-versions-in-listing.md), [DROP LISTING](drop-listing.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } LISTING <name>  [ REVISION = { DRAFT | PUBLISHED } ]
```

## Parameters

`name`
:   The identifier, specified when the listing was created, for the listing to describe.
    If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    See [SHOW LISTINGS](show-listings.md) for listing details, including listing **name**.

`REVISION = { DRAFT | PUBLISHED }`
:   Specifies which revision to display.

    For example, If you have a draft of a published listing, you can specify either the draft or published version to display.

    Valid values:
    :   * `DRAFT`: Describe the draft version of the listing.
        * `PUBLISHED`: Describe the published version of the listing.

        Default:
        :   `PUBLISHED`

## Usage notes

* You can describe a listing only if you use a role that has the USAGE, MODIFY, or OWNERSHIP privilege on the listing.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides listing properties and metadata in the following columns:

|  |  |
| --- | --- |
| Column | Description |
| `global_name` | Global name of the listing |
| `name` | Name specified when the listing was created. |
| `owner` | The listing owner. |
| `owner_role_type` | The listing owner role type. |
| `created_on` | Date and time the listing was created. |
| `updated_on` | Date and time the listing was last updated. |
| `published_on` | Date and time the listing was last published. |
| `title` | Title specified in the listing manifest |
| `subtitle` | Sub title specified in the listing manifest |
| `description` | The listing description. |
| `listing_terms` | The listing terms. |
| `state` | State of the listing, one of:   * DRAFT * PUBLISHED * UNPUBLISHED |
| `share` | The share identifier for this listing. |
| `application_package` | The application package associated with the listing. |
| `business_needs` | The business needs the listing satisfies. |
| `usage_examples` | An example showing a query of the listing. |
| `data_attributes` | The listing’s attributes, including the refresh rate, geographic coverage, and time range. |
| `categories` | The listing categories. |
| `resources` | Listing resources, such as a documentation link. |
| `profile` | The provider’s profile name. |
| `customized_contact_info` | Provider contact information. |
| `data_dictionary` | Listing metadata. |
| `data_preview` | Preview of the listing data. |
| `comment` | Associated comment, if present. |
| `revisions` | Revision state, for public listings only. |
| `target_accounts` | Comma separated list of target accounts. |
| `regions` | The listing regions. |
| `refresh_schedule` | The listing refresh frequency in minutes. |
| `refresh_type` | The listing refresh type. |
| `review_state` | The listing review state. |
| `rejection_reason` | The reason the listing was rejected. |
| `unpublished_by_admin_reasons` | The reason the listing owner didn’t publish the listing. |
| `is_monetized` | Is monetized flag. |
| `is_application` | Is application flag. If `true`, an application package is attached to the listing. |
| `is_targeted` | Is targeted flag. |
| `is_limited_trial` | Is limited trial flag. |
| `is_by_request` | Is by request flag. |
| `limited_trial_plan` | The plan associated with a limited trial listing. |
| `retired_on` | Date and time the listing was retired. Null if not retired. |
| `scheduled_drop_time` | Date and time the listing is scheduled to be dropped (no longer available to existing consumers). Null if not scheduled. |
| `manifest_yaml` | The entire published manifest when `REVISION` is `PUBLISHED`, and the entire published manifest with draft changes when `REVISION` is `DRAFT`. |
| `distribution` | Distribution details, if present, such as `EXTERNAL`. |
| `is_mountless_queryable` | `true` If the listing can be queried without being mounted; `false` otherwise. |
| `organization_profile_name` | The associated organization profile name. |
| `uniform_listing_locator` | The uniform listing locator (ULL). For more information about ULLs, see [Configure organizational listings](../../user-guide/collaboration/listings/organizational/org-listing-configure.md). |
| `trial_details` | Details associated with trial listings. |
| `approver_contact` | Approver contact information. |
| `support_contact` | Support contact information. |
| `live_version_uri` | Full uniform resource indictor (URI) of the live version of the listing, against which stage operations can be performed. NULL if no live version exists for the listing. |
| `last_committed_version_uri` | Full URI of the last committed version of the listing. |
| `last_committed_version_name` | System-generated name for the last committed version of the listing. |
| `last_committed_version_alias` | User-specified alias for the last committed version of the listing. |
| `published_version_uri` | Full URI of the current published version of the listing. |
| `published_version_name` | System-generated name of the published version of the listing. |
| `published_version_alias` | User-specified alias for the last published version of the listing. |
| `compliance_badges` | Compliance badges associated with this listing, if any. |
| `is_share` | Is share flag. If `true`, the listing was created based on a share. |
| `monetization_version` | Monetization model that the listing uses. |
| `request_approval_type` | Listing access request type. The access request type defines how discovery targets of a listing submit access requests to the listing approver. Any one of:  * `NULL` * `REQUEST_AND_APPROVE_IN_SNOWFLAKE` indicates access requests are submitted and approved within the Snowflake environment. * `REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE` indicates the provider manages access request submissions and approvals independently. The value for external listings is always `NULL`. |
| `monetization_display_order` | The order in which pricing plans and offers are displayed to consumers. |
| `legacy_uniform_listing_locator` | Specifies the legacy Uniform Listing Locator (ULL). If an existing organizational listing profile is updated to use a custom organization profile, this column includes the ULL associated with the previous default profile that continues to be valid.  If no profile updates have been made, this column is NULL.  For more information on ULLs, see [Set the Uniform Listing Locator or listing name](../../user-guide/collaboration/listings/organizational/org-listing-configure.md). |
| `share_restrictions` | A flag that indicate whether share restrictions exist on external private listings. |

## Examples

To describe the columns in a listing named `MYLISTING`, run the following command:

```sqlexample
DESC LISTING MYLISTING;
```

---
title: DESCRIBE MAINTENANCE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-maintenance-policy.md
section: SQL Commands
---

# DESCRIBE MAINTENANCE POLICY

Shows the details of a [maintenance policy](../../developer-guide/native-apps/consumer-maintenance-policies.md).

See also:
:   [CREATE MAINTENANCE POLICY](create-maintenance-policy.md), [ALTER MAINTENANCE POLICY](alter-maintenance-policy.md), [DROP MAINTENANCE POLICY](drop-maintenance-policy.md), [SHOW MAINTENANCE POLICIES](show-maintenance-policies.md)

## Syntax

```sqlsyntax
DESCRIBE MAINTENANCE POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier of the maintenance policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY MAINTENANCE POLICY | Account |  |
| OWNERSHIP | Maintenance policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Examples

The following example describes a maintenance policy named `my_maintenance_policy`:

```sqlexample
DESCRIBE MAINTENANCE POLICY my_maintenance_policy;
```

---
title: DESCRIBE MASKING POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-masking-policy.md
section: SQL Commands
---

# DESCRIBE MASKING POLICY

Describes the details about a masking policy, including the creation date, name, data type, and SQL expression.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Masking policy DDL](../../user-guide/security-column-intro.md)

## Syntax

```sqlsyntax
DESC[RIBE] MASKING POLICY <name>
```

## Parameters

`name`
:   Identifier for the masking policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY MASKING POLICY | Account |  |
| APPLY | Masking policy |  |
| OWNERSHIP | Masking policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Managing Column-level Security](../../user-guide/security-column-intro.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

```sqlexample
DESC MASKING POLICY ssn_mask;
```

```output
+-----+------------+---------------+-------------------+-----------------------------------------------------------------------+
| Row | name       | signature     | return_type       | body                                                                  |
+-----+------------+---------------+-------------------+-----------------------------------------------------------------------+
| 1   | SSN_MASK   | (VAL VARCHAR) | VARCHAR(16777216) | case when current_role() in ('ANALYST') then val else '*********' end |
+-----+------------+---------------+-------------------+-----------------------------------------------------------------------+
```

---
title: DESCRIBE MATERIALIZED VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/desc-materialized-view.md
section: SQL Commands
---

# DESCRIBE MATERIALIZED VIEW

Describes the columns in a materialized view.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE MATERIALIZED VIEW](create-materialized-view.md) , [DROP MATERIALIZED VIEW](drop-materialized-view.md) , [ALTER MATERIALIZED VIEW](alter-materialized-view.md) , [SHOW MATERIALIZED VIEWS](show-materialized-views.md)

## Syntax

```sqlsyntax
DESC[RIBE] MATERIALIZED VIEW <name>
```

## Parameters

`name`
:   Specifies the identifier for the materialized view to describe. If the identifier contains spaces or special characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* The command output does not include the view definition. To see the materialized view’s definition, use [SHOW MATERIALIZED VIEWS](show-materialized-views.md)
  or [GET_DDL](../functions/get_ddl.md).
* DESC MATERIALIZED VIEW and [DESCRIBE TABLE](desc-table.md) are interchangeable. Either command retrieves the details for the table
  or view that matches the criteria in the statement.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Example setup:

> ```sqlexample
> CREATE MATERIALIZED VIEW emp_view
>     AS
>     SELECT id "Employee Number", lname "Last Name", location "Home Base" FROM emp;
> ```

Describe the materialized view:

> ```sqlexample
> DESC MATERIALIZED VIEW emp_view;
> ```

```output
+-----------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
| name            | type         | kind   | null? | default | primary key | unique key | check | expression | comment |
|-----------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------|
| Employee Number | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
| Last Name       | VARCHAR(50)  | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
| Home Base       | VARCHAR(100) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
+-----------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+
```

---
title: DESCRIBE MCP SERVER
source: https://docs.snowflake.com/en/sql-reference/sql/desc-mcp-server.md
section: SQL Commands
---

# DESCRIBE MCP SERVER

Describes the properties of an MCP (Model Context Protocol) server.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE MCP SERVER](create-mcp-server.md) , [DROP MCP SERVER](drop-mcp-server.md) , [SHOW MCP SERVERS](show-mcp-servers.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } MCP SERVER <name>
```

## Parameters

`name`
:   Specifies the identifier for the MCP server to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the MCP server. |
| `database_name` | Database that contains the MCP server. |
| `schema_name` | Schema that contains the MCP server. |
| `owner` | Role that owns the MCP server. |
| `comment` | Comment for the MCP server. |
| `server_spec` | JSON representation of the MCP server specification, including tools configuration. |
| `created_on` | Date and time when the MCP server was created. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| USAGE, MODIFY, or OWNERSHIP | MCP server |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The `server_spec` column contains the complete YAML specification that was provided when the MCP server was created, serialized as JSON.

## Examples

The following example describes the MCP server named `my_mcp_server`:

```sqlexample
DESCRIBE MCP SERVER my_mcp_server;
```

```output
+-----------------+---------------+-------------+--------------+---------+--------------------------------------------------------------------------------------------------------------------------------+----------------------------------------+
|      name       | database_name | schema_name |    owner     | comment |                                                           server_spec                                                          |               created_on               |
+-----------------+---------------+-------------+--------------+---------+--------------------------------------------------------------------------------------------------------------------------------+----------------------------------------+
| MY_MCP_SERVER   | TEST_DATABASE | TEST_SCHEMA | ACCOUNTADMIN | [NULL]  | {"version":1,"tools":[{"name":"product-search","identifier":"db.schema.search_service","type":"CORTEX_SEARCH_SERVICE_QUERY"}]} | Fri, 23 Jun 1967 07:00:00.123000 +0000 |
+-----------------+---------------+-------------+--------------+---------+--------------------------------------------------------------------------------------------------------------------------------+----------------------------------------+
```

---
title: DESCRIBE MODEL MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/desc-model-monitor.md
section: SQL Commands
---

# DESCRIBE MODEL MONITOR

Displays information about a specific [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md).
This command displays all the information shown by the [SHOW MODEL MONITORS](show-model-monitors.md) command, plus additional information.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE MODEL MONITOR](create-model-monitor.md),
    [ALTER MODEL MONITOR](alter-model-monitor.md),
    [SHOW MODEL MONITORS](show-model-monitors.md),
    [DROP MODEL MONITOR](drop-model-monitor.md)

## Syntax

```sqlsyntax
{ DESCRIBE | DESC } MODEL MONITOR <monitor_name>
```

## Parameters

`monitor_name`
:   Specifies the identifier for the model monitor to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides model monitor properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the model monitor was created. |
| `name` | Name of the model monitor. |
| `database_name` | Database in which the model monitor is stored. |
| `schema_name` | Schema in which the model monitor is stored. |
| `warehouse_name` | Warehouse used to monitor the model. |
| `refresh_interval` | The refresh interval (target lag) for triggering refresh of the model monitor. |
| `aggregation_window` | The aggregation window for calculating metrics. |
| `model_task` | The task of the model being monitored, either TABULAR_BINARY_CLASSIFICATION or TABULAR_REGRESSION. |
| `monitor_state` | The state of the model monitor:   * ACTIVE: The model monitor is active and operating correctly. * SUSPENDED: Model monitoring is paused. * PARTIALLY_SUSPENDED: An error condition in which one of the underlying tables has stopped refreshing at the expected interval. See DESCRIBE for more details. * UNKNOWN: An error condition in which the state of the underlying tables cannot be identified. |
| `source` | String representation of a JSON object detailing the source table or view on which aggregations are based. If the table does not exist or is not accessible, the value is an empty string. See Table JSON object specification. |
| `baseline` | String representation of a JSON object detailing baseline table being used for monitoring, of which a clone is embedded in the model monitor object. See Table JSON object specification. |
| `model` | String representation of a JSON object containing information specifically about the model being monitored. See Model JSON object specification. |
| `comment` | Comment about the model monitor. |
| *The following columns are the additional columns displayed by DESCRIBE compared to SHOW* |  |
| `aggregation_status` | JSON object containing aggregation status for each dynamic table type.  **Keys:**   * `SOURCE_AGGREGATED` / `ACCURACY_AGGREGATED` (non-segment) * `SOURCE_AGGREGATED_<segment_column>` / `ACCURACY_AGGREGATED_<segment_column>` (segment-specific)   **Values:** `ACTIVE` or `SUSPENDED` |
| `aggregation_last_error` | JSON object containing the last error for each dynamic table type.  **Keys:** Same as `aggregation_status`  **Values:** Error message, or empty string if successful |
| `aggregation_last_data_timestamp` | JSON object containing the last update timestamp for each dynamic table type.  **Keys:** Same as `aggregation_status`  **Values:** Timestamp of last successful update |
| `columns` | A string representation of a JSON object that contains names of columns being used in the source table. See Column JSON object specification. |

### Table JSON object specification

The following is the format of the JSON representation of a table, as used by the `source` and `baseline` columns in the command output:

| `name` | Name of the source or baseline table or view. |
| --- | --- |
| `database_name` | Database in which the table or view is stored. |
| `schema_name` | Schema in which the table or view is stored. |
| `status` | The status of the table:   * ACTIVE: The table or view is accessible by the user. * MASKED: The current user does not have access to the table or view. Values of other fields appear masked (that is, as a series of asterisks). * DELETED: The table or view has been deleted. * NOT_SET: The status has not been set. |

### Model JSON object specification

The following is the format of the JSON representation of a model, as used by the `model` column in the command output:

| Field | Description |
| --- | --- |
| `model_name` | Name of the model being monitored. |
| `version_name` | Version name of the model version being monitored. |
| `function_name` | Name of the specific function being monitored in the specified model version. |
| `database_name` | Database in which the model is stored. |
| `schema_name` | Schema in which the model is stored. |
| `model_status` | The status of the model. Can be ACTIVE, MASKED, or DELETED. MASKED indicates that the user does not have access to the model; other fields show as a series of asterisks. |
| `version_status` | The status of the model version. Can be ACTIVE or DELETED. (MASKED is not a valid status for a model version, because they do not have access control.) |

### Column JSON object specification

The following is the format of the JSON representation of columns, as used by the `columns` column in the command output:

| Field | Description |
| --- | --- |
| `timestamp_column` | Name of the timestamp column in the data source. |
| `id_columns` | An array of string column names that, together, uniquely identify each row in the source data. |
| `prediction_class_columns` | An array of strings naming all prediction class columns in the data source. |
| `prediction_score_columns` | An array of strings naming all prediction score columns in the data source. |
| `actual_class_columns` | An array of strings naming all actual class columns in the data source. |
| `numerical_columns` | An array of strings naming all numerical feature columns that the model monitor uses from the source table. |
| `string_columns` | An array of strings naming all string (categorical) feature columns that the model monitor uses from the source table. |
| `boolean_columns` | An array of strings naming all Boolean (categorical) feature columns that the model monitor uses from the source table. |
| `segment_columns` | An array of strings naming all segment columns in the data source. For existing model monitors created without segments, this field will be an empty array. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Model monitor |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: DESCRIBE NETWORK POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-network-policy.md
section: SQL Commands
---

# DESCRIBE NETWORK POLICY

Describes the properties specified for a network policy.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP NETWORK POLICY](drop-network-policy.md) , [ALTER NETWORK POLICY](alter-network-policy.md) , [CREATE NETWORK POLICY](create-network-policy.md) , [SHOW NETWORK POLICIES](show-network-policies.md)

## Syntax

```sqlsyntax
DESC[RIBE] NETWORK POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the network policy to describe. If the identifier contains spaces or special characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Only the network policy owner (i.e. role with the OWNERSHIP privilege on the network policy) or higher can execute this command.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Describe a network policy named `mypolicy`:

> ```sqlexample
> DESC NETWORK POLICY mypolicy;
> ```
>
> ```output
> -----------------+---------------+
>       name       |     value     |
> -----------------+---------------+
>  ALLOWED_IP_LIST | 192.168.0.100 |
>  BLOCKED_IP_LIST | 192.168.0.101 |
> -----------------+---------------+
> ```

---
title: DESCRIBE NETWORK RULE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-network-rule.md
section: SQL Commands
---

# DESCRIBE NETWORK RULE

Describes the properties specified for a network rule.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP NETWORK RULE](drop-network-rule.md) , [ALTER NETWORK RULE](alter-network-rule.md) , [CREATE NETWORK RULE](create-network-rule.md) , [SHOW NETWORK RULES](show-network-rules.md)

## Syntax

```sqlsyntax
DESC[RIBE] NETWORK RULE <name>
```

## Parameters

`name`
:   Specifies the identifier for the network rule you want to describe.

    If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are case-sensitive.

## Output

The command output provides network rule properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the network rule was created. |
| `name` | Name of the network rule. |
| `database_name` | Database that contains the schema in which the network rule was created. |
| `schema_name` | Schema in which the network rule was created. |
| `owner` | Role that has the OWNERSHIP privilege on the network rule. |
| `comment` | Descriptive text associated with the network rule. |
| `type` | Value of the network rule’s `TYPE` property. |
| `mode` | Value of the network rule’s `MODE` property. |
| `value_list` | Network identifiers defined in the `VALUE_LIST` property of the network rule. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Network Rule | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Describe a network rule named `myrule`:

> ```sqlexample
> DESC NETWORK RULE myrule;
> ```

---
title: DESCRIBE NOTEBOOK
source: https://docs.snowflake.com/en/sql-reference/sql/desc-notebook.md
section: SQL Commands
---

# DESCRIBE NOTEBOOK

Describes the properties of a [notebook](../../user-guide/ui-snowsight/notebooks.md).

DESCRIBE can be abbreviated to DESC.

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } NOTEBOOK <name>
```

## Parameters

`name`
:   Specifies the identifier for the notebook to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the notebook was created. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE or OWNERSHIP | Notebook | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object. Notebook ownerships cannot be transferred. |

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the notebook named `mybook`:

```sqlexample
DESCRIBE NOTEBOOK mybook;
```

---
title: DESCRIBE NOTIFICATION INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-notification-integration.md
section: SQL Commands
---

# DESCRIBE NOTIFICATION INTEGRATION

Describes the properties of a notification integration.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md) , [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md) , [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md) ,
    [DROP INTEGRATION](drop-integration.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } NOTIFICATION INTEGRATION <name>
```

## Parameters

`name`
:   Specifies the identifier for the notification integration to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `property` | The name of the property (see Properties of notification integrations). |
| `property_type` | The data type of the property (for example, `Boolean` or `String`). |
| `property_value` | The value assigned to the property. |
| `property_default` | The default value of the property. |

The `property` column can include the following properties of the notification integration:

Properties of notification integrations

| Property | Description |
| --- | --- |
| `ENABLED` | Specifies whether or not the notification integration is enabled. |
| `DIRECTION` | Specifies whether the notification integration supports sending notifications (`OUTBOUND`) or receiving notifications (`INBOUND`). |
| `COMMENT` | Specifies the comment for the notification integration. |
| Additional properties specific to the notification integration type. | These are the properties that you set when creating or altering the notification integration.  For more information about these properties, see the [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md) or [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md) command for the specific type. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration |  |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the properties of a notification integration named `my_notify_int`:

```sqlexample
DESC INTEGRATION my_notify_int;
```

---
title: DESCRIBE ONLINE FEATURE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-online-feature-table.md
section: SQL Commands
---

# DESCRIBE ONLINE FEATURE TABLE

Describes the columns in an [online feature table](create-online-feature-table.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE ONLINE FEATURE TABLE](create-online-feature-table.md) , [ALTER ONLINE FEATURE TABLE](alter-online-feature-table.md), [DROP ONLINE FEATURE TABLE](drop-online-feature-table.md) , [SHOW ONLINE FEATURE TABLES](show-online-feature-tables.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } ONLINE FEATURE TABLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the online feature table to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Online feature table | Role that has the MONITOR privilege on the online feature table. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: DESCRIBE OPENFLOW DATA PLANE INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-oflow-data-plane-integration.md
section: SQL Commands
---

# DESCRIBE OPENFLOW DATA PLANE INTEGRATION

Describes the columns in an Openflow data plane integration.

See also:
:   [ALTER OPENFLOW DATA PLANE](alter-oflow-data-plane.md), [SHOW OPENFLOW DATA PLANE INTEGRATIONS](show-oflow-data-plane-integration.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } OPENFLOW DATA PLANE INTEGRATION <name>
```

## Parameters

`name`
:   The identifier for the openflow data plane integration to describe.
    If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    See [SHOW OPENFLOW DATA PLANE INTEGRATIONS](show-oflow-data-plane-integration.md) for openflow data plane integration details, including openflow data plane integration **name**.

## Usage notes

* Openflow data plane integrations cannot be created directly, but rather are created when a deployment is created.
* To DESCRIBE an Openflow data plane integration, you must be using a role that
  has one of USAGE, MODIFY, or OWNERSHIP privilege on the data plane integration.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides properties and metadata for the Openflow data plane integration in the following columns:

|  |  |
| --- | --- |
| Column | Description |
| `enabled` | True if enabled, otherwise false. |
| `oauth_redirect_uri` | URI used for OATH2 authentication. |
| `data_plane_id` | Internal identifier for the data plane integration. |
| `event_table` | Fully qualified path to the <DATABASE>.<SCHEMA>.<EVENT TABLE NAME> is specified. |
| `comment` | Associated comment. |

## Examples

Describe the columns in the Openflow data plane integration with the specified name:

```sqlexample
DESC OPENFLOW DATA PLANE INTEGRATION edf6f909-d3ff-49d6-925f-xxxxx;
```

```output
+------------------------------------+----------------------------------+------------------+---------------+
|   enabled  |   oauth_redirect_uri  |   data_plane_id                  |   event_table    |   comment     |
+------------------------------------+----------------------------------+------------------+---------------+
|   true     |   https://...         |   edf6f909-d3ff-49d6-925f-xxxxx  |                  |   Example     |
+------------------------------------+----------------------------------+------------------+---------------+
```

---
title: DESCRIBE ORGANIZATION PROFILE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-organization-profile.md
section: SQL Commands
---

# DESCRIBE ORGANIZATION PROFILE

Describes the properties of an organization profile.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md), [SHOW ORGANIZATION PROFILES](show-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } ORGANIZATION PROFILE <name>
```

## Parameters

`name`
:   Specifies the identifier for the organization profile to describe. Must contain only uppercase characters or numbers.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. See [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | The date and time when the organization profile was created. |
| `name` | The organization profile name. |
| `title` | The title of the organization profile. |
| `system_generated` | Indicates the organization profile is system generated and can’t be dropped. One of `TRUE` or `FALSE`. |
| `state` | The organization profile state. One of ACTIVE or DRAFT. |
| `description` | The description of the organization profile. |
| `owner_contact` | The contact email of the owner of the organization profile. |
| `approver_contact` | The contact email of the access approver of the organization profile. |
| `logo` | The organization profile logo URL. |
| `allowed_publishers` | The accounts that are allowed to publish the organizational listing. |
| `manifest_yaml` | The contents of the default organization profile manifest. |
| `live_version_uri` | The URI for the live organization profile version. `NONE` when the URI is unavailable. |
| `published_version_uri` | The URI for the published organization profile version. `NONE` when the URI is unavailable. |
| `published_version_name` | The name of the published organization profile version. |
| `published_version_alias` | The alias for the published organization profile version. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY | Organization profile |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example describes the organization profile named `MYORGANIZATIONPROFILE`:

```sqlexample
DESCRIBE ORGANIZATION PROFILE myorganizationprofile;
```

```output
+-------------------------+-------------+--------------------------+---------------------+---------------------+----------------------------------+---------------------+---------------------+----------------+--------------------------------+----------------------------------------------------------------------------------------+----------------------------------------------------------+----------------------------------------------------------+-------------------------+-------------------------+
|created_on               |name         |title                     |system_generated     |state                |description                       |owner_contact        |approver_contact     |logo            |allowed_publishers              |manifest_yaml                                                                           |live_version_uri                                          |published_version_uri                                     |published_version_name   |published_version_alias  |
+-------------------------+-------------+--------------------------+---------------------+---------------------+----------------------------------+---------------------+---------------------+----------------+--------------------------------+----------------------------------------------------------------------------------------+----------------------------------------------------------+----------------------------------------------------------+-------------------------+-------------------------+
|2025-01-01 01:01:01.000  |ORGPROFILE   |My Organization Profile   |FALSE                |ACTIVE               |Organization profile description  |test@test.com        |test@test.com        |urn:icon:shield |{“all_internal_accounts”: true} | title: "My Organization Profile" description: "Organization profile description". . .  |snow://organization_profile/ORGPROFILE/versions/version$1 |snow://organization_profile/ORGPROFILE/versions/version$1 |VERSION$1                |V1                       |
+-------------------------+-------------+--------------------------+---------------------+---------------------+----------------------------------+---------------------+---------------------+----------------+--------------------------------+----------------------------------------------------------------------------------------+----------------------------------------------------------+----------------------------------------------------------+-------------------------+-------------------------+
```

---
title: DESCRIBE PACKAGES POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-packages-policy.md
section: SQL Commands
---

# DESCRIBE PACKAGES POLICY

Describes the details about a packages policy.

DESCRIBE can be abbreviated to DESC.

## Syntax

```sqlsyntax
DESC[RIBE] PACKAGES POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the packages policy to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Packages policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | Packages policy | Enables viewing a packages policy. Grants the ability to view the contents of a packages policy in a SHOW or DESCRIBE command and [INFORMATION_SCHEMA.CURRENT_PACKAGES_POLICY](../info-schema/current_packages_policy.md). Can be granted to a role using the [GRANT <privileges> … TO ROLE](grant-privilege.md) command. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

```sqlexample
DESC PACKAGES POLICY packages_policy_prod_1;
```

---
title: DESCRIBE PASSWORD POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-password-policy.md
section: SQL Commands
---

# DESCRIBE PASSWORD POLICY

Describes the details about a password policy.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DDL commands](../../user-guide/password-authentication.md)

## Syntax

```sqlsyntax
DESC[RIBE] PASSWORD POLICY <name>
```

## Parameters

`name`
:   Identifier for the password policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY PASSWORD POLICY | Account |  |
| OWNERSHIP | Password policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on password policy DDL and privileges, see [DDL commands](../../user-guide/password-authentication.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

```sqlexample
DESC PASSWORD POLICY password_policy_prod_1;
```

```output
+-----------------------------------+----------------------------------------+-------------+-----------------------------------------------------------------------------------------------------------------------------------------------+
|   property                        |   value                                |   default   |   description                                                                                                                                 |
+-----------------------------------+----------------------------------------+-------------+-----------------------------------------------------------------------------------------------------------------------------------------------+
|   NAME                            |   PASSWORD_POLICY_PROD_1               |   null      |   Name of password policy.                                                                                                                    |
|   OWNER                           |   PROD_ADMIN                           |   null      |   Owner of password policy.                                                                                                                   |
|   COMMENT                         |   production account password policy   |   null      |   user comment associated to an object in the dictionary                                                                                      |
|   PASSWORD_MIN_LENGTH             |   12                                   |   8         |   Minimum length of new password.                                                                                                             |
|   PASSWORD_MAX_LENGTH             |   24                                   |   256       |   Maximum length of new password.                                                                                                             |
|   PASSWORD_MIN_UPPER_CASE_CHARS   |   2                                    |   1         |   Minimum number of uppercase characters in new password.                                                                                     |
|   PASSWORD_MIN_LOWER_CASE_CHARS   |   2                                    |   1         |   Minimum number of lowercase characters in new password.                                                                                     |
|   PASSWORD_MIN_NUMERIC_CHARS      |   2                                    |   1         |   Minimum number of numeric characters in new password.                                                                                       |
|   PASSWORD_MIN_SPECIAL_CHARS      |   2                                    |   0         |   Minimum number of special characters in new password.                                                                                       |
|   PASSWORD_MIN_AGE_DAYS           |   1                                    |   0         |   Period after a password is changed during which a password cannot be changed again, in days.                                                |
|   PASSWORD_MAX_AGE_DAYS           |   30                                   |   90        |   Period after which password must be changed, in days.                                                                                       |
|   PASSWORD_MAX_RETRIES            |   5                                    |   5         |   Number of attempts users have to enter the correct password before their account is locked.                                                 |
|   PASSWORD_LOCKOUT_TIME_MINS      |   30                                   |   15        |   Period of time for which users will be locked after entering their password incorrectly many times (specified by MAX_RETRIES), in minutes   |
|   PASSWORD_HISTORY                |   5                                    |   24        |   Number of most recent passwords that may not be repeated by the user                                                                        |
+-----------------------------------+----------------------------------------+-------------+-----------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: DESCRIBE PIPE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-pipe.md
section: SQL Commands
---

# DESCRIBE PIPE

Describes the properties specified for a pipe, as well as the default values of the properties.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP PIPE](drop-pipe.md) , [ALTER PIPE](alter-pipe.md) , [CREATE PIPE](create-pipe.md) , [SHOW PIPES](show-pipes.md)

## Syntax

```sqlsyntax
DESC[RIBE] PIPE <name>
```

## Parameters

`name`
:   Specifies the identifier for the pipe to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Returns results only for the pipe owner (i.e. the role with the OWNERSHIP privilege on the pipe), a role with the MONITOR or OPERATE
  privilege on the pipe, or a role with the global MONITOR EXECUTION privilege.
* To determine the current status of a pipe, query the [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md) function.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides pipe properties and metadata in the following columns:

```sqlexample
| created_on | name | database_name | schema_name | definition | owner | notification_channel | comment | integration | pattern | error_integration | invalid_reason | kind |
```

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the pipe was created. |
| `name` | The name of the pipe object.  Manually created pipes: This is the name defined in the CREATE PIPE statement.  Default pipe (Snowpipe Streaming high-performance): The value is derived from the target table name; for example, MY_TABLE-STREAMING. |
| `database_name` | The name of the database that contains the Snowpipe object.  Manually created pipe: The name of the database that the pipe object belongs to.  Default pipe (Snowpipe Streaming high-performance): The name of the target table’s database. |
| `schema_name` | The name of the schema that contains the Snowpipe object.  Manually created pipe: The name of the schema that the pipe object belongs to.  Default pipe: The name of the target table’s schema. |
| `definition` | COPY statement that is used to load data from queued files into a Snowflake table. |
| `owner` | The name of the role that possesses the OWNERSHIP privilege on the pipe object.  Named pipe: The name of the role that owns the pipe, which is the role specified in the CREATE PIPE statement or granted ownership later.  Default pipe (Snowpipe Streaming high-performance): This column displays NULL. |
| `notification_channel` | Amazon Resource Name of the Amazon SQS queue for the stage that is named in the DEFINITION column. |
| `comment` | A user-provided or system-generated text string that describes the pipe object.  Named pipe: The user-defined comment that is provided during the CREATE PIPE statement.  Default pipe (Snowpipe Streaming High-Performance): A system-generated string that is always the following sentences: “Default pipe for Snowpipe Streaming High Performance ingestion to a table. Created and managed by Snowflake.” |
| `integration` | Name of the notification integration for pipes that rely on notification events to trigger data loads from Google Cloud Storage or Microsoft Azure cloud storage. |
| `pattern` | PATTERN copy option value in the [COPY INTO <table>](copy-into-table.md) statement in the pipe definition, if the copy option was specified. |
| `error_integration` | Notification integration name for pipes that rely on error events in Amazon S3 cloud storage to trigger notifications. |
| `invalid_reason` | Displays some detailed information for your pipes that may have issues. You can use the provided information to troubleshoot your pipes more effectively along with [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md). If there is no issue with the pipe, the value is NULL. |
| `kind` | The kind of the pipe, which is STAGE. |

Kafka-related columns

| Column | Description |
| --- | --- |
| `broker_integration` | Name of the external access integration used with Kafka. |
| `broker_secret` | Name of the secret used with Kafka. |
| `row_format` | Row format of records: `JSON` or `AVRO`. |
| `schema` | Schema of records represented as variant. |
| `topic` | Name of a synchronized topic. |

## Examples

Describe the `mypipe` pipe created in the examples in [CREATE PIPE](create-pipe.md):

> ```sqlexample
> desc pipe mypipe;
>
> +-------------------------------+--------+---------------+-------------+---------------------------------+----------+---------+
> | created_on                    | name   | database_name | schema_name | definition                      | owner    | comment |
> |-------------------------------+--------+---------------+-------------+---------------------------------+----------+---------|
> | 2017-08-15 06:11:05.703 -0700 | MYPIPE | MYDATABASE    | PUBLIC      | copy into mytable from @mystage | SYSADMIN |         |
> +-------------------------------+--------+---------------+-------------+---------------------------------+----------+---------+
> ```

---
title: DESCRIBE POSTGRES INSTANCE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-postgres-instance.md
section: SQL Commands
---

# DESCRIBE POSTGRES INSTANCE

Describes the properties of a [Snowflake Postgres instance](../../user-guide/snowflake-postgres/about.md).

Use this command to:

* Monitor the [state](../../user-guide/snowflake-postgres/managing-instances.md) of an instance during asynchronous operations like ALTER, CREATE, or FORK.
* Retrieve connection details such as the hostname.
* Check configuration settings like high availability status, Postgres version, and custom server settings.
* View the `origin` field to identify forked instances and their source.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE POSTGRES INSTANCE](create-postgres-instance.md), [ALTER POSTGRES INSTANCE](alter-postgres-instance.md), [DROP POSTGRES INSTANCE](drop-postgres-instance.md), [SHOW POSTGRES INSTANCES](show-postgres-instances.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } POSTGRES INSTANCE <name>
```

## Parameters

`name`
:   Specifies the identifier for the Postgres instance to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

The command returns results in a property/value format rather than columnar output. Each property appears as
a separate row with its corresponding value.

| Property | Description |
| --- | --- |
| `name` | Name of the Postgres instance. |
| `owner` | Role that owns the Postgres instance. |
| `owner_role_type` | Type of the owner role (for example, ROLE or DATABASE_ROLE). |
| `created_on` | Date and time when the Postgres instance was created. |
| `updated_on` | Date and time when the Postgres instance was last updated. |
| `type` | Type of the Postgres instance (for example, PRIMARY). |
| `host` | Hostname used to connect to the Postgres instance. |
| `privatelink_service_identifier` | Identifier for the [Private Link service](../../user-guide/admin-security-privatelink.md), if Private Link is configured for the instance. |
| `compute_family` | [Compute family](../../user-guide/snowflake-postgres/postgres-instance-sizes.md) (instance size) of the Postgres instance. |
| `storage_size_gb` | Storage size allocated to the Postgres instance, in GB. |
| `postgres_version` | Major version of Postgres running on the instance. |
| `postgres_settings` | Custom [Postgres server settings](../../user-guide/snowflake-postgres/postgres-server-settings.md) configured for the instance. |
| `high_availability` | Whether [high availability](../../user-guide/snowflake-postgres/high-availability.md) is enabled for the instance (`true` or `false`). |
| `authentication_authority` | Authentication method used for the instance (currently `POSTGRES`). |
| `maintenance_window_start` | Hour of day (0-23, UTC) when a [maintenance window](../../user-guide/snowflake-postgres/managing-instances.md) can start, or `None` if not set. |
| `state` | Current [state](../../user-guide/snowflake-postgres/managing-instances.md) of the instance. Possible values: `CREATING`, `RESTORING`, `STARTING`, `REPLAYING`, `FINALIZING`, `READY`, `RESTARTING`, `RESUMING`, `SUSPENDING`, `SUSPENDED`. |
| `comment` | Comment for the Postgres instance, or `None` if not set. |
| `origin` | Origin of the Postgres instance (for example, if forked from another instance), or `None` if not a fork. |
| `replicas` | List of [read replicas](../../user-guide/snowflake-postgres/postgres-create-replica.md) associated with the instance. |
| `operations` | Pending or in-progress operations on the instance (for example, resize, upgrade, HA enablement). |
| `network_policy` | [Network policy](../../user-guide/snowflake-postgres/postgres-network.md) attached to the instance, or `None` if not set. |
| `storage_integration` | Storage integration used by the instance, or `None` if not set. |
| `certificate` | [SSL certificate](../../user-guide/snowflake-postgres/postgres-ssl-certs.md) for secure connections to the Postgres instance. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OPERATE or OWNERSHIP | Postgres instance |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* Use this command to check the [state](../../user-guide/snowflake-postgres/managing-instances.md) of an instance during create, modify, or other
  asynchronous operations. The `operations` field is a JSON string that reflects whatever sequence of operations
  happens during a CREATE POSTGRES INSTANCE or ALTER POSTGRES INSTANCE operation. You can wait for the `operations`
  field to become empty, or for one of the tasks to have the value `ready`. The following shows an example of
  the `operations` field value near the end of an ALTER POSTGRES INSTANCE operation to change the
  COMPUTE_FAMILY setting.

```output
 {
   "upgrade" : {
     "state" : "UPGRADING",
     "start" : "2026-02-16 14:13:58.371 -0800",
     "duration" : "3m36s",
     "compute_family" : "BURST_M",
     "tasks" : [ {
       "flavor" : "resize",
       "state" : "creating"
     }, {
       "flavor" : "resize",
       "state" : "finalizing"
     }, {
       "flavor" : "resize",
       "state" : "ready"
     } ]
   }
}
```

## Examples

Describe a Postgres instance:

```sqlexample
DESCRIBE POSTGRES INSTANCE my_postgres;
```

The following shows typical output from that command:

```output
+------------------------------------------------------------------------+
| property                       | value                                 |
|--------------------------------+---------------------------------------|
| name                           | MY_TEST_INSTANCE                      |
| owner                          | ACCOUNTADMIN                          |
| owner_role_type                | ROLE                                  |
| created_on                     | 2026-01-29 10:04:59.485 -0800         |
| updated_on                     | 2026-02-16 13:21:58.018 -0800         |
| type                           | PRIMARY                               |
| host                           | my-instance-hostname.us-west-2.aws    |
|                                | .postgres.snowflake.pp                |
| privatelink_service_identifier | None                                  |
| compute_family                 | BURST_S                               |
| storage_size_gb                | 10                                    |
| postgres_version               | 18                                    |
| postgres_settings              | {}                                    |
| high_availability              | false                                 |
| authentication_authority       | POSTGRES                              |
| maintenance_window_start       | None                                  |
| state                          | READY                                 |
| comment                        | None                                  |
| origin                         | None                                  |
| replicas                       |                                       |
| operations                     | { }                                   |
| network_policy                 | None                                  |
| storage_integration            | None                                  |
| certificate                    | -----BEGIN CERTIFICATE-----           |
|                                | ... several lines of certificate ...  |
|                                | -----END CERTIFICATE-----             |
|                                |                                       |
+------------------------------------------------------------------------+
```

Use SHOW with the [flow operator](../operators-flow.md) to find an instance, then describe it:

```sqlexample
-- Find instances in a specific state
SHOW POSTGRES INSTANCES
  ->> SELECT "name", "state", "postgres_version"
      FROM $1
      WHERE "state" = 'READY' AND "postgres_version" = '17';

-- Then describe a specific instance for full details
DESCRIBE POSTGRES INSTANCE my_postgres;
```

Use the flow operator to extract specific properties:

```sqlexample
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "property", "value"
      FROM $1
      WHERE "property" IN ('name', 'state', 'host',
        'postgres_version', 'high_availability');
```

Check the connection hostname for an instance:

```sqlexample
DESCRIBE POSTGRES INSTANCE my_postgres
  ->> SELECT "value" AS hostname
      FROM $1
      WHERE "property" = 'host';
```

---
title: DESCRIBE PRIVACY POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-privacy-policy.md
section: SQL Commands
---

# DESCRIBE PRIVACY POLICY

Describes the properties of a [privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE PRIVACY POLICY](create-privacy-policy.md) , [ALTER PRIVACY POLICY](alter-privacy-policy.md) , [DROP PRIVACY POLICY](drop-privacy-policy.md) , [SHOW PRIVACY POLICIES](show-privacy-policies.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } PRIVACY POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the privacy policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

> The command output provides privacy policy properties and metadata in the following columns:
>
> | Column | Description |
> | --- | --- |
> | `name` | Name of the privacy policy. |
> | `signature` | Signature of the privacy policy. All privacy policies have the same signature, which does not accept any arguments. |
> | `return_type` | Return type of the privacy policy. All privacy policies return PRIVACY_BUDGET, which is an internal data type. |
> | `body` | SQL expression that determines whether the privacy policy returns a privacy budget, and if it does, which one. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY PRIVACY POLICY | Account |  |
| APPLY | Privacy policy |  |
| OWNERSHIP | Privacy policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the privacy policy named `myprivpolicy`:

```sqlexample
DESCRIBE PRIVACY POLICY myprivpolicy;
```

```output
+--------------------+---------------+--------------------+-----------------------------------------------+
|   name             |   signature   |   return_type      |   body                                        |
+--------------------+---------------+--------------------+-----------------------------------------------+
|   MYPRIVPOLICY     |   ()          |   PRIVACY_BUDGET   |   PRIVACY_BUDGET(BUDGET_NAME=>'new_budget')   |
+--------------------+---------------+--------------------+-----------------------------------------------+
```

---
title: DESCRIBE PROCEDURE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-procedure.md
section: SQL Commands
---

# DESCRIBE PROCEDURE

Describes the specified stored procedure, including the stored procedure’s signature (i.e. arguments), return value, language, and
body (i.e. definition).

See also:
:   [DROP PROCEDURE](drop-procedure.md) , [ALTER PROCEDURE](alter-procedure.md) , [CREATE PROCEDURE](create-procedure.md) , [SHOW PROCEDURES](show-procedures.md), [SHOW USER PROCEDURES](show-user-procedures.md)

## Syntax

```sqlsyntax
DESC[RIBE] PROCEDURE <procedure_name> ( [ <arg_data_type> [ , <arg_data_type_2> ... ] ] )
```

## Usage notes

* To describe a stored procedure, you must specify the name and the argument data type(s), if any, for the stored procedure. The
  arguments are required because stored procedures support name overloading (i.e. two stored procedures in the same schema can have
  the same name as long as their argument data types are different).
* The `body` property in the output displays the code for the stored procedure.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

This example shows how to describe a stored procedure that has no parameters:

> ```javascript
> DESC PROCEDURE my_pi();
> +---------------+----------------------+
> | property      | value                |
> |---------------+----------------------|
> | signature     | ()                   |
> | returns       | FLOAT                |
> | language      | JAVASCRIPT           |
> | null handling | CALLED ON NULL INPUT |
> | volatility    | VOLATILE             |
> | execute as    | CALLER               |
> | body          |                      |
> |               |   return 3.1415926;  |
> |               |                      |
> +---------------+----------------------+
> ```

This example shows how to describe a stored procedure that has a parameter:

> ```javascript
> DESC PROCEDURE area_of_circle(FLOAT);
> +---------------+------------------------------------------------------------------+
> | property      | value                                                            |
> |---------------+------------------------------------------------------------------|
> | signature     | (RADIUS FLOAT)                                                   |
> | returns       | FLOAT                                                            |
> | language      | JAVASCRIPT                                                       |
> | null handling | CALLED ON NULL INPUT                                             |
> | volatility    | VOLATILE                                                         |
> | execute as    | OWNER                                                            |
> | body          |                                                                  |
> |               |   var stmt = snowflake.createStatement(                          |
> |               |       {sqlText: "SELECT pi() * POW($RADIUS, 2)", binds:[RADIUS]} |
> |               |       );                                                         |
> |               |   var rs = stmt.execute();                                       |
> |               |   rs.next()                                                      |
> |               |   var output = rs.getColumnValue(1);                             |
> |               |   return output;                                                 |
> |               |                                                                  |
> +---------------+------------------------------------------------------------------+
> ```

---
title: DESCRIBE PROJECTION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-projection-policy.md
section: SQL Commands
---

# DESCRIBE PROJECTION POLICY

Describes the details about a [projection policy](../../user-guide/projection-policies.md), including the creation date, name, and the SQL
expression.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Projection policy DDL reference](../../user-guide/projection-policies.md)

## Syntax

```sqlsyntax
DESC[RIBE] PROJECTION POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the projection policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY PROJECTION POLICY | Account |  |
| APPLY | Projection policy |  |
| OWNERSHIP | Projection policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on projection policy DDL and privileges, see [Privileges and commands](../../user-guide/projection-policies.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Describe the projection policy:

> ```sqlexample
> DESC PROJECTION POLICY do_not_project;
> ```

---
title: DESCRIBE RESULT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-result.md
section: SQL Commands
---

# DESCRIBE RESULT

Describes the columns in the result of a query.

Snowflake persists the result of a query for a period of time, after which the result is purged. The query can be from the current session or
any of your other sessions, including past sessions, as long as the limited period has not elapsed. This period is not adjustable. For
more details, see [Using Persisted Query Results](../../user-guide/querying-persisted-results.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [LAST_QUERY_ID](../functions/last_query_id.md) (Context function) , [RESULT_SCAN](../functions/result_scan.md) (Table function)

## Syntax

```sqlsyntax
DESC[RIBE] RESULT { '<query_id>' | LAST_QUERY_ID() }
```

## Parameters

`query_id` or `LAST_QUERY_ID()`
:   Specifies either the ID for a query you executed (within the last 24 hours in any session) or the
    [LAST_QUERY_ID](../functions/last_query_id.md) function, which returns the ID for a query within your current session.

## Usage notes

* To retrieve the ID for a specific query:

  > + Locate the query ID in the web interface. The History  page lists the ID along with each query; however, note
  >   that you can only use this function for queries you have executed.
  > + Execute the [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../functions/query_history.md) table function, which returns a list of queries and their IDs; however,
  >   note that you can only use this function for queries you have executed.
  > + If the query was executed in the current session, execute the [LAST_QUERY_ID](../functions/last_query_id.md) function. For example:
  >
  >   > ```sqlexample
  >   > SELECT LAST_QUERY_ID(-2);
  >   > ```
  >
  >   Note that this is equivalent to using LAST_QUERY_ID() as the input for DESC RESULT.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the columns in the result of the specified query from any of your sessions (within the previous 24 hours):

> ```sqlexample
> DESC RESULT 'f2f07bdb-6a08-4689-9ad8-a1ba968a44b6';
> ```

Describe the columns in the results from your most recent query in the current session:

> ```sqlexample
> SELECT * FROM boston_sales;
>
> +---------------+-------+-------+--------+-------------+---------------------+-------+
> | CITY          | ZIP   | STATE | SQ__FT | TYPE        | SALE_DATE           | PRICE |
> |---------------+-------+-------+--------+-------------+---------------------+-------|
> | MA-Lexington  | 40502 | MA    |    836 | Residential | 0016-01-25T00:00:00 | 59222 |
> | MA-Belmont    | 02478 | MA    |    852 | Residential | 0016-02-21T00:00:00 | 69307 |
> | MA-Winchester | 01890 | MA    |   1122 | Condo       | 0016-01-31T00:00:00 | 89921 |
> +---------------+-------+-------+--------+-------------+---------------------+-------+
>
> DESC RESULT LAST_QUERY_ID();
>
> +-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> | name      | type              | kind   | null? | default | primary key | unique key | check | expression | comment |
> |-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------|
> | CITY      | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | ZIP       | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | STATE     | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | SQ__FT    | NUMBER(38,0)      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | TYPE      | VARCHAR(16777216) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | SALE_DATE | DATE              | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> | PRICE     | NUMBER(38,0)      | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    |
> +-----------+-------------------+--------+-------+---------+-------------+------------+-------+------------+---------+
> ```

---
title: DESCRIBE ROW ACCESS POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-row-access-policy.md
section: SQL Commands
---

# DESCRIBE ROW ACCESS POLICY

Describes a row access policy, including the creation date, name, data type, and SQL expression.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Row access policy DDL](../../user-guide/security-row-intro.md)

## Syntax

```sqlsyntax
DESC[RIBE] ROW ACCESS POLICY <name>;
```

## Parameters

`name`
:   Identifier for the row access policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY ROW ACCESS POLICY | Account |  |
| APPLY | Row access policy |  |
| OWNERSHIP | Row access policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Manage row access policies](../../user-guide/security-row-intro.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

The following example describes a row access policy.

> ```sqlexample
> DESC ROW ACCESS POLICY rap_table_employee_info;
> ```
>
> ```output
> +-------------------------+-------------+-------------+------+
> | name                    |  signature  | return_type | body |
> +-------------------------+-------------+-------------+------+
> | RAP_TABLE_EMPLOYEE_INFO | (V VARCHAR) | BOOLEAN     | true |
> +-------------------------+-------------+-------------+------+
> ```

---
title: DESCRIBE SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/desc-schema.md
section: SQL Commands
---

# DESCRIBE SCHEMA

Describes the schema. For example, lists the tables and views in the schema.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER SCHEMA](alter-schema.md) , [CREATE SCHEMA](create-schema.md) , [DROP SCHEMA](drop-schema.md) , [SHOW SCHEMAS](show-schemas.md) , [UNDROP SCHEMA](undrop-schema.md)

    [SCHEMATA view](../info-schema/schemata.md) (Information Schema)

## Syntax

```sqlsyntax
DESC[RIBE] SCHEMA <schema_name>
```

## Parameters

`schema_name`
:   Specifies the [identifier](../identifiers.md) of the schema to describe.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

This demonstrates the DESCRIBE SCHEMA command:

```sqlexample
CREATE SCHEMA sample_schema_2;
USE SCHEMA sample_schema_2;

CREATE TABLE sample_table_1 (i INTEGER);

CREATE VIEW sample_view_1 AS
    SELECT i FROM sample_table_1;

CREATE MATERIALIZED VIEW sample_mview_1 AS
    SELECT i FROM sample_table_1 WHERE i < 100;

DESCRIBE SCHEMA sample_schema_2;

+-------------------------------+----------------+-------------------+
| created_on                    | name           | kind              |
|-------------------------------+----------------+-------------------|
| 2022-06-23 01:00:00.000 -0700 | SAMPLE_TABLE_1 | TABLE             |
| 2022-06-23 02:00:00.000 -0700 | SAMPLE_VIEW_1  | VIEW              |
| 2022-06-23 03:00:00.000 -0700 | SAMPLE_MVIEW_1 | MATERIALIZED_VIEW |
+-------------------------------+----------------+-------------------+
```

---
title: DESCRIBE SEARCH OPTIMIZATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-search-optimization.md
section: SQL Commands
---

# DESCRIBE SEARCH OPTIMIZATION

Describes the [search optimization configuration](../../user-guide/search-optimization/enabling.md) for a specified table and
its columns.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Search optimization service](../../user-guide/search-optimization-service.md)

## Syntax

```sqlsyntax
DESC[RIBE] SEARCH OPTIMIZATION ON <table_name>;
```

## Parameters

`table_name`
:   Specifies the identifier for the table to describe. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Output

The command prints a table containing information on each search method and target in the search optimization configuration. The
table contains the following columns:

| Column Name | Description |
| --- | --- |
| `expression_id` | Unique identifier for a search method and target. |
| `method` | Search method for optimizing queries for a particular type of predicate:   * EQUALITY (for equality and IN predicates). * SUBSTRING (for predicates that match substrings – e.g. LIKE, ILIKE, etc.). * GEO (for predicates that use GEOGRAPHY types). |
| `target` | Column or VARIANT field that the method applies to. |
| `target_data_type` | Data type of the column or VARIANT field. |
| `active` | Specifies whether or not the expression has finished the initial build of the search access paths for the expression. |

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

See [Displaying the search optimization configuration for a table](../../user-guide/search-optimization/enabling.md).

---
title: DESCRIBE SECRET
source: https://docs.snowflake.com/en/sql-reference/sql/desc-secret.md
section: SQL Commands
---

# DESCRIBE SECRET

Describes the properties of a secret.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER SECRET](alter-secret.md) , [CREATE SECRET](create-secret.md) , [DROP SECRET](drop-secret.md) , [SHOW SECRETS](show-secrets.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } SECRET <name>
```

## Parameters

`name`
:   Specifies the identifier for the secret to describe. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Output

The command output provides secret properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the secret was created. |
| `name` | Name of the secret. |
| `schema_name` | Name of the schema that contains the secret. |
| `database_name` | Name of the database that contains the secret. |
| `owner` | Name of the role that owns the secret. |
| `comment` | Comment for the secret or NULL if a comment is not specified. |
| `secret_type` | Either `OAUTH2`, `PASSWORD`, `GENERIC`, or `SYMMETRIC_KEY`. |
| `username` | The username that is stored in the secret. |
| `oauth_access_token_expiry_time` | The timestamp as a string when the OAuth access token expires. |
| `oauth_refresh_token_expiry_time` | The timestamp as a string when the OAuth refresh token expires or NULL if the secret does not store this value. |
| `oauth_scopes` | A comma-separated list of scopes to use when making a request from the OAuth server by a role with USAGE on the integration during the OAuth client credentials flow or NULL if there are no scopes. |
| `integration_name` | The name of the External API Authentication integration that is referenced in the secret or NULL if the secret does not reference an External API Authentication integration. |
| `algorithm` | The algorithm used, for [symmetric key secrets](create-secret.md). |
| `key_length` | Length of the key used, for [symmetric key secrets](create-secret.md). |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Secret |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Snowflake never returns the `PASSWORD` property value.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Describe the secret:

> ```sqlexample
> DESC SECRET service_now_creds_pw;
> ```

---
title: DESCRIBE SEMANTIC VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/desc-semantic-view.md
section: SQL Commands
---

# DESCRIBE SEMANTIC VIEW

Describes the properties of the logical tables, dimensions, facts, and metrics that make up a
[semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
{ DESCRIBE | DESC } SEMANTIC VIEW <name>
```

## Parameters

`name`
:   Specifies the identifier for the semantic view to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides the properties and metadata about the logical tables, relationships, facts, dimensions, metrics, and
the semantic view itself.

Each row in the view represents a property of:

* A logical table
* A relationship
* A fact
* A dimension
* A metric
* The semantic view itself

The following is an example of the output of the command:

```output
+--------------+------------------------------+---------------+--------------------------+----------------------------------------+
| object_kind  | object_name                  | parent_entity | property                 | property_value                         |
|--------------+------------------------------+---------------+--------------------------+----------------------------------------|
| NULL         | NULL                         | NULL          | COMMENT                  | Comment about the semantic view        |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| ...          | ...                          | ...           | ...                      | ...                                    |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | TABLE                    | CUSTOMERS                              |
| ...          | ...                          | ...           | ...                      | ...                                    |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| ...          | ...                          | ...           | ...                      | ...                                    |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| ...          | ...                          | ...           | ...                      | ...                                    |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | TABLE                    | ORDERS                                 |
| ...          | ...                          | ...           | ...                      | ...                                    |
+--------------+------------------------------+---------------+--------------------------+----------------------------------------+
```

As shown above, each row represents a property of a logical table, dimension, relationship, metric, or fact. For example:

* The first row is the value of the `comment` property of the semantic view itself.
* The second row is the value of the `base_table_database_name` property of the logical table named `customers`.

The view includes the following columns:

| Column | Description |
| --- | --- |
| `object_kind` | Type of the object that has the property for this row. The value can be one of the following:   * `TABLE` (the logical tables for the view) * `RELATIONSHIP` * `DIMENSION` * `FACT` * `METRIC` * `DERIVED_METRIC` (for [derived metrics](../../user-guide/views-semantic/sql.md)) * `CUSTOM_INSTRUCTIONS` (for [custom instructions](../../user-guide/views-semantic/sql.md)) * `NULL` (properties that apply to the semantic view itself, such as a comment) |
| `object_name` | Name of the dimension, fact, metric, logical table, or relationship that has the property for this row.  For rows that represent properties of the semantic view itself and rows that represent custom instructions, the value in this column is NULL. |
| `parent_entity` | Name of the parent entity of the dimension, fact, metric, or relationship.  The value of this column is NULL for rows that represent:   * The semantic view itself. * Properties of logical tables. * Properties of [derived metrics](../../user-guide/views-semantic/sql.md). * Properties of [custom instructions](../../user-guide/views-semantic/sql.md). |
| `property` | Name of the property of the logical table, constraint, relationship, dimension, fact, metric, custom instruction, or semantic view.  The value in this column depends on the type of the object (`object_kind`).  See the following sections for details about the properties and their possible values, based on the value in the `object_kind` column:   * For `TABLE`, see Properties for logical tables. * For `CONSTRAINT`, see Properties for constraints. * For `RELATIONSHIP`, see Properties for relationships. * For `FACT`, `DIMENSION`, and `METRIC`, see Properties for facts, dimensions, and metrics. * For `CUSTOM_INSTRUCTION`, see Properties for custom instructions. * For `NULL`, see Properties for semantic views. |
| `property_value` | Value of the property of the logical table, relationship, dimension, fact, metric, custom instruction, or semantic view. |

### Properties for logical tables

If the `object_kind` column contains `TABLE`, the `property` column can contain the following values:

| Property name | Description |
| --- | --- |
| `BASE_TABLE_DATABASE_NAME` | Name of the database containing the logical table. |
| `BASE_TABLE_SCHEMA_NAME` | Name of the schema containing the logical table. |
| `BASE_TABLE_NAME` | Name of the logical table. |
| `SYNONYMS` | [Array](../data-types-semistructured.md) of VARCHAR values, representing the synonyms for the logical table. |
| `PRIMARY_KEY` | [Array](../data-types-semistructured.md) of VARCHAR values, specifying the names of the columns that make up the primary key for the logical table. |

### Properties for constraints

If the `object_kind` column contains `CONSTRAINT`, the row represents a constraint that is used for a
[range join](../../user-guide/views-semantic/sql.md). The `property` column can contain the following values:

| Property name | Description |
| --- | --- |
| `CONSTRAINT_TYPE` | The value is `DISTINCT_RANGE`. |
| `START_COLUMN` | Specifies the name of the column that represents the start of the range. |
| `END_COLUMN` | Specifies the name of the column that represents the end of the range. |

### Properties for relationships

If the `object_kind` column contains `RELATIONSHIP`, the `property` column can contain the following values:

| Property name | Description |
| --- | --- |
| `TABLE` | Name of one of the logical tables in the relationship. |
| `FOREIGN_KEY` | Name of the column in that logical table used in the relationship. |
| `REF_TABLE` | Name of the other logical table in the relationship. |
| `REF_KEY` | One of the following values:   * For relationships that represent [range joins](../../user-guide/views-semantic/sql.md), an array that contains   JSON-formatted strings for objects with the following keys:    + The `start_column` key specifies the name of the column that represents the start of the range.   + The `end_column` key specifies the name of the column that represents the end of the range.   + The `type` key is `RANGE`. * For relationships that represent [ASOF joins](../../user-guide/views-semantic/sql.md), an array that contains the   following elements:    + The name of the column in the first table.   + A JSON object with the following fields:      - `column`: Name of the column in the second table.     - `type`: `ASOF`.  * For other types of relationships, the name of the column in the other logical table in the relationship. |

### Properties for facts, dimensions, and metrics

If the `object_kind` column contains `FACT`, `DIMENSION`, or `METRIC`, the `property` column can contain the
following values:

| Property name | Description |
| --- | --- |
| `TABLE` | Name of the logical table used to define the dimension, fact, or metric. |
| `EXPRESSION` | The SQL expression for the dimension, fact, or metric. |
| `DATA_TYPE` | The SQL data type of the evaluated SQL expression. |
| `ACCESS_MODIFIER` | `PRIVATE` for [private facts and metrics](../../user-guide/views-semantic/sql.md). `PUBLIC` for everything else. |

> **Note:**
>
> For [derived metrics](../../user-guide/views-semantic/sql.md), the `TABLE` property is not present.

In addition, if the row represents a
[dimension that uses a Cortex Search Service](../../user-guide/views-semantic/sql.md), the `property`
column can contain the following values:

| Property name | Description |
| --- | --- |
| `CORTEX_SEARCH_SERVICE_COLUMN_NAME` | The name of the column that the Cortex Search Service allows you to search on. |
| `CORTEX_SEARCH_SERVICE_DATABASE_NAME` | The name of the database that contains the Cortex Search Service. |
| `CORTEX_SEARCH_SERVICE_SCHEMA_NAME` | The name of the schema that contains the Cortex Search Service. |
| `CORTEX_SEARCH_SERVICE_NAME` | The name of the Cortex Search Service. |

### Properties for custom instructions

If the `object_kind` column contains `CUSTOM_INSTRUCTIONS`, the `property` column can contain the following values:

| Property name | Description |
| --- | --- |
| `AI_QUESTION_CATEGORIZATION` | [Custom instructions for Cortex Analyst](../../user-guide/views-semantic/sql.md) that explain how to classify questions. |
| `AI_SQL_GENERATION` | Custom instructions for Cortex Analyst that explain how to generate the SQL statement. |

### Properties for semantic views

If the `object_kind` column is NULL, the `property` column can contain the following values:

| Property name | Description |
| --- | --- |
| `COMMENT` | Comment about the semantic view. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the semantic view named `tpch_rev_analysis`:

```sqlexample
DESC SEMANTIC VIEW tpch_rev_analysis;
```

```output
+--------------+------------------------------+---------------+--------------------------+----------------------------------------+
| object_kind  | object_name                  | parent_entity | property                 | property_value                         |
|--------------+------------------------------+---------------+--------------------------+----------------------------------------|
| NULL         | NULL                         | NULL          | COMMENT                  | Comment about the semantic view        |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_SCHEMA_NAME   | TPCH_SF1                               |
| TABLE        | CUSTOMERS                    | NULL          | BASE_TABLE_NAME          | CUSTOMER                               |
| TABLE        | CUSTOMERS                    | NULL          | PRIMARY_KEY              | ["C_CUSTKEY"]                          |
| TABLE        | CUSTOMERS                    | NULL          | COMMENT                  | Main table for customer data           |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | TABLE                    | CUSTOMERS                              |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | EXPRESSION               | customers.c_name                       |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | DATA_TYPE                | VARCHAR(25)                            |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | SYNONYMS                 | ["customer name"]                      |
| DIMENSION    | CUSTOMER_NAME                | CUSTOMERS     | COMMENT                  | Name of the customer                   |
| TABLE        | LINE_ITEMS                   | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| TABLE        | LINE_ITEMS                   | NULL          | BASE_TABLE_SCHEMA_NAME   | TPCH_SF1                               |
| TABLE        | LINE_ITEMS                   | NULL          | BASE_TABLE_NAME          | LINEITEM                               |
| TABLE        | LINE_ITEMS                   | NULL          | PRIMARY_KEY              | ["L_ORDERKEY","L_LINENUMBER"]          |
| TABLE        | LINE_ITEMS                   | NULL          | COMMENT                  | Line items in orders                   |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | REF_TABLE                | ORDERS                                 |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | FOREIGN_KEY              | ["L_ORDERKEY"]                         |
| RELATIONSHIP | LINE_ITEM_TO_ORDERS          | LINE_ITEMS    | REF_KEY                  | ["O_ORDERKEY"]                         |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | EXPRESSION               | l_extendedprice * (1 - l_discount)     |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | DATA_TYPE                | NUMBER(25,4)                           |
| FACT         | DISCOUNTED_PRICE             | LINE_ITEMS    | COMMENT                  | Extended price after discount          |
| FACT         | LINE_ITEM_ID                 | LINE_ITEMS    | TABLE                    | LINE_ITEMS                             |
| FACT         | LINE_ITEM_ID                 | LINE_ITEMS    | EXPRESSION               | CONCAT(l_orderkey, '-', l_linenumber)  |
| FACT         | LINE_ITEM_ID                 | LINE_ITEMS    | DATA_TYPE                | VARCHAR(134217728)                     |
| TABLE        | ORDERS                       | NULL          | BASE_TABLE_DATABASE_NAME | SNOWFLAKE_SAMPLE_DATA                  |
| TABLE        | ORDERS                       | NULL          | BASE_TABLE_SCHEMA_NAME   | TPCH_SF1                               |
| TABLE        | ORDERS                       | NULL          | BASE_TABLE_NAME          | ORDERS                                 |
| TABLE        | ORDERS                       | NULL          | SYNONYMS                 | ["sales orders"]                       |
| TABLE        | ORDERS                       | NULL          | PRIMARY_KEY              | ["O_ORDERKEY"]                         |
| TABLE        | ORDERS                       | NULL          | COMMENT                  | All orders table for the sales domain  |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | TABLE                    | ORDERS                                 |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | REF_TABLE                | CUSTOMERS                              |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | FOREIGN_KEY              | ["O_CUSTKEY"]                          |
| RELATIONSHIP | ORDERS_TO_CUSTOMERS          | ORDERS        | REF_KEY                  | ["C_CUSTKEY"]                          |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | TABLE                    | ORDERS                                 |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | EXPRESSION               | AVG(orders.count_line_items)           |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | DATA_TYPE                | NUMBER(36,6)                           |
| METRIC       | AVERAGE_LINE_ITEMS_PER_ORDER | ORDERS        | COMMENT                  | Average number of line items per order |
| FACT         | COUNT_LINE_ITEMS             | ORDERS        | TABLE                    | ORDERS                                 |
| FACT         | COUNT_LINE_ITEMS             | ORDERS        | EXPRESSION               | COUNT(line_items.line_item_id)         |
| FACT         | COUNT_LINE_ITEMS             | ORDERS        | DATA_TYPE                | NUMBER(18,0)                           |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | TABLE                    | ORDERS                                 |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | EXPRESSION               | AVG(orders.o_totalprice)               |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | DATA_TYPE                | NUMBER(30,8)                           |
| METRIC       | ORDER_AVERAGE_VALUE          | ORDERS        | COMMENT                  | Average order value across all orders  |
| DIMENSION    | ORDER_DATE                   | ORDERS        | TABLE                    | ORDERS                                 |
| DIMENSION    | ORDER_DATE                   | ORDERS        | EXPRESSION               | o_orderdate                            |
| DIMENSION    | ORDER_DATE                   | ORDERS        | DATA_TYPE                | DATE                                   |
| DIMENSION    | ORDER_DATE                   | ORDERS        | COMMENT                  | Date when the order was placed         |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | TABLE                    | ORDERS                                 |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | EXPRESSION               | YEAR(o_orderdate)                      |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | DATA_TYPE                | NUMBER(4,0)                            |
| DIMENSION    | ORDER_YEAR                   | ORDERS        | COMMENT                  | Year when the order was placed         |
+--------------+------------------------------+---------------+--------------------------+----------------------------------------+
```

---
title: DESCRIBE SEQUENCE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-sequence.md
section: SQL Commands
---

# DESCRIBE SEQUENCE

Describes a sequence, including the sequence’s interval.

DESCRIBE can be abbreviated to DESC.

See also:
:   [ALTER SEQUENCE](alter-sequence.md) , [CREATE SEQUENCE](create-sequence.md) , [DROP SEQUENCE](drop-sequence.md) , [SHOW SEQUENCES](show-sequences.md)

## Syntax

```sqlsyntax
DESC[RIBE] SEQUENCE <name>
```

## Parameters

`name`
:   Specifies the identifier for the sequence to describe.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

```sqlexample
DESC SEQUENCE my_sequence;
```

---
title: DESCRIBE SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-service.md
section: SQL Commands
---

# DESCRIBE SERVICE

Describes the properties of a
[Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md) (including job services). Use this command for both a service and a service running like a job.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE SERVICE](create-service.md) , [ALTER SERVICE](alter-service.md), [DROP SERVICE](drop-service.md) , [SHOW SERVICES](show-services.md)

## Syntax

```sqlsyntax
DESC[RIBE] SERVICE <name>
```

## Parameters

`name`
:   Specifies the identifier for the service to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides service properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Snowpark Container Services service name. |
| `status` | One of the following values, which indicates the current status of the service:   * `PENDING` * `RUNNING` * `FAILED` * `DONE` * `SUSPENDING` * `SUSPENDED` * `DELETING` * `DELETED` * `INTERNAL_ERROR` |
| `database_name` | Database in which the service is created. |
| `schema_name` | Schema in which the service is created. |
| `owner` | Role that owns the service. |
| `compute_pool` | Compute pool name where Snowflake runs the service. |
| `spec` | Service specification file. Output includes this column only if you are using the service’s owner role when executing the command. |
| `dns_name` | Snowflake-assigned DNS name of the service in this format: `service-name.unique-id.svc.spcs.internal`.  The `unique-id` is a 4-8 character long alphanumeric identifier that is unique to a particular instance of a database schema. To find the unique ID for a schema, call the SYSTEM$GET_SERVICE_DNS_DOMAIN function. For example:  ```sqlexample SELECT SYSTEM$GET_SERVICE_DNS_DOMAIN('mydb.myschema'); ```  Note the following:   * If you rename a schema, the identifier remains unchanged. * If you drop and recreate a schema with the same name, the identifier will change.   The DNS name enables service-to-service communications (see [Tutorial 4](../../developer-guide/snowpark-container-services/tutorials/advanced/tutorial-4.md)). |
| `current_instances` | The current number of instances for the service. |
| `target_instances` | The target number of service instances that should be running as determined by Snowflake.  When the `current_instances` value is not equal to the `target_instances` value, Snowflake is either in the process of shutting down or launching service instances.  For example,   * Suppose you create a service with MIN_INSTANCES = 1 and MAX_INSTANCES = 3. While the service is running, Snowflake might   determine that one instance is not enough. In this case, the value of `target_instances` will increase, indicating Snowflake is in the process of launching additional instances.  It is also possible that the `target_instances` value is less than the `current_instances` value, which indicates that Snowflake is   in the process of reducing the number of running instances. * If you create services but the compute pool doesn’t have capacity for the minimum number of instances that you requested, the   value of `target_instances` will be equal to the value of `min_instances`. The value of `current_instances` will be less than the value of `target_instances`. |
| `min_ready_instances` | Indicates the minimum number of service instances that must be ready for Snowflake to consider the service to be ready to process requests. |
| `min_instances` | Minimum number of service instances Snowflake should run. |
| `max_instances` | Maximum number of service instances that Snowflake can scale when needed. |
| `auto_resume` | If true, Snowflake auto-resumes the service, if suspended, when service function is called or when an incoming request (ingress) is received (see [Using a service](../../developer-guide/snowpark-container-services/working-with-services.md)). |
| `external_access_integrations` | List of external access integrations associated with the service. For more information, see [Configure service egress](../../developer-guide/snowpark-container-services/service-network-communications.md). |
| `created_on` | Timestamp when the service was created. |
| `updated_on` | Timestamp when the service was last updated. |
| `resumed_on` | Timestamp when the service was last resumed. |
| `suspended_on` | Timestamp when the service was last suspended. `suspended_on` is set when Snowflake suspends a service and remains unchanged even after the service is resumed. If `suspended_on` is NULL, the service was never suspended. |
| `auto_suspend_secs` | Number of seconds of inactivity after which Snowflake automatically suspends the service. If `auto_suspend_secs` is set to 0 or never set, Snowflake does not automatically suspend the service. |
| `comment` | Service related comment. |
| `owner_role_type` | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `query_warehouse` | When a service container connects to Snowflake to execute a query and does not explicitly specify a warehouse to use, Snowflake uses this warehouse as default. |
| `is_job` | `true` if the service is a job service; `false` otherwise. |
| `is_async_job` | `true` if the job service is running asynchronously. By default, Snowflake executes the job services synchronously. This column is included in the output of the DESC SERVICE, SHOW SERVICES, and SHOW JOB SERVICES commands but not in the output of the SHOW SERVICES EXCLUDING JOBS command. |
| `spec_digest` | The unique and immutable identifier representing the service spec content.  To observe the changes to the value of the `spec_digest` column over time, a service user might execute the SHOW SERVICES command periodically. If the service user notices a change in value, they can infer that the service was upgraded. |
| `is_upgrading` | TRUE, if Snowflake is in the process of upgrading the service. |
| `managing_object_domain` | The domain of the managing object (for example, the domain of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |
| `managing_object_name` | The name of the managing object (for example, the name of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP, USAGE, MONITOR or OPERATE | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the service named `echo_service`:

```sqlexample
DESCRIBE SERVICE echo_service;
```

```output
+--------------+---------+---------------+-------------+-----------+-----------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+-------------------+------------------+---------------------+---------------+---------------+-------------+------------------------------+-------------------------------+-------------------------------+------------+--------------+-------------------+---------+-----------------+-----------------+--------+--------------+------------------------------------------------------------------+--------------+------------------------+----------------------+
| name         | status  | database_name | schema_name | owner     | compute_pool          | spec                                                                                                                                                         | dns_name                            | current_instances | target_instances | min_ready_instances | min_instances | max_instances | auto_resume | external_access_integrations | created_on                    | updated_on                    | resumed_on | suspended_on | auto_suspend_secs | comment | owner_role_type | query_warehouse | is_job | is_async_job | spec_digest                                                      | is_upgrading | managing_object_domain | managing_object_name |
|--------------+---------+---------------+-------------+-----------+-----------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+-------------------+------------------+---------------------+---------------+---------------+-------------+------------------------------+-------------------------------+-------------------------------+------------+--------------+-------------------+---------+-----------------+-----------------+--------+--------------+------------------------------------------------------------------+--------------+------------------------+----------------------|
| ECHO_SERVICE | RUNNING | TUTORIAL_DB   | DATA_SCHEMA | TEST_ROLE | TUTORIAL_COMPUTE_POOL | ---                                                                                                                                                          | echo-service.k3m6.svc.spcs.internal |                 1 |                1 |                   1 |             1 |             1 | true        | NULL                         | 2024-11-29 12:12:47.310 -0800 | 2024-11-29 12:12:48.843 -0800 | NULL       | NULL         |                 0 | NULL    | ROLE            | NULL            | false  | false        | edaf548eb0c2744a87426529b53aac75756d0ea1c0ba5edb3cbb4295a381f2b4 | false        | NULL                   | NULL                 |
|              |         |               |             |           |                       | spec:                                                                                                                                                        |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |   containers:                                                                                                                                                |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |   - name: "echo"                                                                                                                                             |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     image: "sfengineering-prod1-snowservices-test2.registry.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest" |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     sha256: "@sha256:d04a2d7b7d9bd607df994926e3cc672edcb541474e4888a01703e8bb0dd3f173"                                                                       |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     env:                                                                                                                                                     |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |       SERVER_PORT: "8000"                                                                                                                                    |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |       CHARACTER_NAME: "Bob"                                                                                                                                  |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     readinessProbe:                                                                                                                                          |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |       port: 8000                                                                                                                                             |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |       path: "/healthcheck"                                                                                                                                   |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     resources:                                                                                                                                               |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |       limits:                                                                                                                                                |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |         memory: "6Gi"                                                                                                                                        |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |         cpu: "1"                                                                                                                                             |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |       requests:                                                                                                                                              |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |         memory: "0.5Gi"                                                                                                                                      |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |         cpu: "0.5"                                                                                                                                           |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |   endpoints:                                                                                                                                                 |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |   - name: "echoendpoint"                                                                                                                                     |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     port: 8000                                                                                                                                               |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |     public: true                                                                                                                                             |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
|              |         |               |             |           |                       |                                                                                                                                                              |                                     |                   |                  |                     |               |               |             |                              |                               |                               |            |              |                   |         |                 |                 |        |              |                                                                  |              |                        |                      |
+--------------+---------+---------------+-------------+-----------+-----------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------+-------------------+------------------+---------------------+---------------+---------------+-------------+------------------------------+-------------------------------+-------------------------------+------------+--------------+-------------------+---------+-----------------+-----------------+--------+--------------+------------------------------------------------------------------+--------------+------------------------+----------------------+
```

---
title: DESCRIBE SESSION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-session-policy.md
section: SQL Commands
---

# DESCRIBE SESSION POLICY

Describes the details about a session policy.

DESCRIBE can be abbreviated to DESC.

See also:
:   [Session Policy DDL Reference](../../user-guide/session-policies-managing.md)

## Syntax

```sqlsyntax
{ DESCRIBE | DESC } SESSION POLICY <name>
```

## Parameters

`name`
:   Identifier for the session policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY SESSION POLICY | Account |  |
| OWNERSHIP | Session policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on session policy DDL and privileges, see [Managing session policies](../../user-guide/session-policies-managing.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | The timestamp when the session policy was created. |
| `name` | Identifier for the session policy. |
| `session_idle_timeout_mins` | For Snowflake Clients and programmatic clients, the number of minutes in which a session can be idle before users must authenticate to Snowflake again. |
| `session_ui_idle_timeout_mins` | For Snowsight, the number of minutes in which a session can be idle before users must authenticate to Snowflake again. |
| `allowed_secondary_roles` | The secondary roles for a session policy, if any. |
| `comment` | Comment for the session policy. |

## Example

```sqlexample
DESC SESSION POLICY session_policy_prod_1;
```

```output
+---------------------------------+-----------------------+---------------------------+------------------------------+-------------------------+--------------------------------------------------+
| created_on                       | name                 | session_idle_timeout_mins | session_ui_idle_timeout_mins | allowed_secondary_roles |  comment                                         |
+---------------------------------+-----------------------+---------------------------+------------------------------+-------------------------+--------------------------------------------------+
| Mon, 11 Jan 2021 00:00:00 -0700 | session_policy_prod_1 | 60                        | 30                           |           []            | session policy for use in the prod_1 environment |
+---------------------------------+-----------------------+---------------------------+------------------------------+-------------------------+--------------------------------------------------+
```

---
title: DESCRIBE SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-share.md
section: SQL Commands
---

# DESCRIBE SHARE

Describes the data objects that are included in a [share](../../user-guide/data-sharing-intro.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP SHARE](drop-share.md) , [ALTER SHARE](alter-share.md) , [CREATE SHARE](create-share.md) , [SHOW SHARES](show-shares.md)

## Syntax

**Providers (outbound share)**

```sqlsyntax
DESC[RIBE] SHARE <name>
```

**Consumers (inbound share)**

```sqlsyntax
DESC[RIBE] SHARE <provider_account>.<share_name>
```

## Parameters

`name`
:   Specifies the identifier for the outbound share to describe. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`provider_account.share_name`
:   Specifies the fully-qualified identifier for the inbound share to describe.

## Usage notes

* Only the ACCOUNTADMIN role has the privileges to describe a share. Executing this command with any role other than ACCOUNTADMIN returns
  an error.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

* The output of the command is different depending on whether you are a provider or consumer:

  + For providers, the names of the objects in the share are prefixed with the database name.
  + For consumers, the names of the objects in the share are prefixed with a database name only if a database has been created from the share.
    If a database has not been created from the share, the objects are prefixed with `<DB>`.
* The `kind` column in the output displays the type of the objects in the share.

## Examples

As a provider, display the objects in the `sales_s` share:

> ```sqlexample
> DESC SHARE sales_s;
>
> +----------+--------------------------------------+-------------------------------+
> | kind     | name                                 | shared_on                     |
> |----------+--------------------------------------+-------------------------------|
> | DATABASE | SALES_DB                             | 2017-06-15 17:03:16.642 -0700 |
> | SCHEMA   | SALES_DB.AGGREGATES_EULA             | 2017-06-15 17:03:16.790 -0700 |
> | TABLE    | SALES_DB.AGGREGATES_EULA.AGGREGATE_1 | 2017-06-15 17:03:16.963 -0700 |
> +----------+--------------------------------------+-------------------------------+
> ```

As a consumer, display the objects in the `sales_s` share provided by account `ab67890`:

> ```sqlexample
> DESC SHARE ab67890.sales_s;
>
> +----------+----------------------------------+---------------------------------+
> | kind     | name                             | shared_on                       |
> |----------+----------------------------------+---------------------------------|
> | DATABASE | <DB>                             | Thu, 15 Jun 2017 17:03:16 -0700 |
> | SCHEMA   | <DB>.AGGREGATES_EULA             | Thu, 15 Jun 2017 17:03:16 -0700 |
> | TABLE    | <DB>.AGGREGATES_EULA.AGGREGATE_1 | Thu, 15 Jun 2017 17:03:16 -0700 |
> +----------+----------------------------------+---------------------------------+
> ```
>
> In this example, a database has not yet been created in the consumer’s account from the `sales_s` share.

---
title: DESCRIBE SNAPSHOT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-snapshot.md
section: SQL Commands
---

# DESCRIBE SNAPSHOT

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Describes the properties of a [snapshot of a block storage volume](../../developer-guide/snowpark-container-services/block-storage-volume.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE SNAPSHOT](create-snapshot.md) , [ALTER SNAPSHOT](alter-snapshot.md), [DROP SNAPSHOT](drop-snapshot.md), [SHOW SNAPSHOTS](show-snapshots.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } SNAPSHOT <name>
```

## Parameters

`name`
:   Specifies the identifier for the snapshot to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the snapshot. |
| `state` | One of the following values, which indicates the current status of the snapshot:   * INITIALIZED: The snapshot creation is in progress. * CREATED: The snapshot is created and can be used to create a volume. * ERROR: Snapshot creation failed. |
| `database_name` | Database in which the snapshot is created. |
| `schema_name` | Schema in which the snapshot is created. |
| `service_name` | Fully qualified service name from which the snapshot is created. |
| `volume_name` | Volume from the specified service instance for which the snapshot is created. |
| `instance` | ID of the service instance. |
| `size` | Size (in GB) of the snapshot. |
| `comment` | General comment about the snapshot. |
| `owner` | Role that owns the snapshot. |
| `owner_role_type` | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `created_on` | Date and time when the snapshot was created. |
| `encryption` | Encryption type configured for the volume, from which the snapshot was created. Possible values include `SNOWFLAKE_SSE` and `SNOWFLAKE_FULL`. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or USAGE | Snapshot | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the snapshot named `my_snapshot`:

```sqlexample
DESC SNAPSHOT my_snapshot;
```

Output:

```output
+-------------+---------+---------------+-------------+----------------------------------------------------+-------------+----------+------+--------------+-----------+-----------------+-------------------------------+-------------------------------+---------------+
| name        | state   | database_name | schema_name | service_name                                       | volume_name | instance | size | comment      | owner     | owner_role_type | created_on                    | updated_on                    | encryption    |
|-------------+---------+---------------+-------------+----------------------------------------------------+-------------+----------+------+--------------+-----------+-----------------+-------------------------------+-------------------------------+---------------|
| MY_SNAPSHOT | CREATED | TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_DB.DATA_SCHEMA.MY_SERVICE_WITH_EBS_VOLUME | block-vol1  | 0        |   10 | new snapshot | TEST_ROLE | ROLE            | 2024-05-09 21:36:58.502 -0700 | 2024-05-09 21:38:03.424 -0700 | SNOWFLAKE_SSE |
+-------------+---------+---------------+-------------+----------------------------------------------------+-------------+----------+------+--------------+-----------+-----------------+-------------------------------+-------------------------------+---------------+
```

---
title: DESCRIBE SNAPSHOT POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-snapshot-policy.md
section: SQL Commands
---

# DESCRIBE SNAPSHOT POLICY

Describes a specific [snapshot policy](../../user-guide/backups.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE SNAPSHOT POLICY — Deprecated](create-snapshot-policy.md),
    [ALTER SNAPSHOT POLICY — Deprecated](alter-snapshot-policy.md),
    [DROP SNAPSHOT POLICY — Deprecated](drop-snapshot-policy.md),
    [SHOW SNAPSHOT POLICIES — Deprecated](show-snapshot-policies.md)

## Syntax

```sqlsyntax
DESC[RIBE] SNAPSHOT POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the snapshot policy to describe. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

> **Note:**
>
> The snapshot policy is an object that’s inside a specific schema and database. Therefore, the policy
> gets replicated, dropped or undropped, and so on, when those operations are performed on the schema and database
> that contain it. If you can’t drop the snapshot policy because it’s associated with any snapshot sets,
> then you also can’t drop the schema or database containing the policy.

To determine whether a snapshot policy is associated with any snapshot sets, use the SHOW SNAPSHOT SETS command.

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp snapshot policy was created. |
| `name` | Name of snapshot policy. |
| `database_name` | Name of database that contains the snapshot policy. |
| `schema_name` | Name of schema that contains the snapshot policy. |
| `owner` | Name of the role with the OWNERSHIP privilege on the snapshot policy. |
| `comment` | Comment for snapshot policy. |
| `schedule` | Schedule for snapshot creation. |
| `expire_after_days` | Number of days after snapshot creation when snapshot expires. |
| `has_retention_lock` | Indicates whether the policy includes a retention lock.  `Y` if policy has retention lock; `N` otherwise.  For more information, see [Retention lock](../../user-guide/backups.md). |
| `owner` | Name of the role with the OWNERSHIP privilege on the snapshot set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the snapshot policy. |

## Examples

Describe a snapshot policy:

```sqlexample
DESC SNAPSHOT POLICY my_snapshot_policy;
```

---
title: DESCRIBE SNAPSHOT SET
source: https://docs.snowflake.com/en/sql-reference/sql/desc-snapshot-set.md
section: SQL Commands
---

# DESCRIBE SNAPSHOT SET

Describes a specific [snapshot set](../../user-guide/backups.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md),
    [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md),
    [DROP SNAPSHOT SET — Deprecated](drop-snapshot-set.md),
    [SHOW SNAPSHOT SETS — Deprecated](show-snapshot-sets.md)

## Syntax

```sqlsyntax
DESC[RIBE] SNAPSHOT SET <name>
```

## Parameters

`name`
:   Specifies the identifier for the snapshot set to describe. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp that the snapshot set was created. |
| `name` | Name of the snapshot set. |
| `database_name` | Name of the database that contains the snapshot set. |
| `schema_name` | Name of the schema that contains the snapshot set. |
| `object_kind` | Type of the object that the snapshot set is snapshotting. |
| `object_name` | Name of the object that the snapshot set is snapshotting. |
| `object_database_name` | Name of the database that contains the object being snapshotted by this snapshot set. |
| `object_schema_name` | Name of the schema that contains the object being snapshotted by this snapshot set. |
| `snapshot_policy_name` | Name of the snapshot policy attached to this snapshot set. |
| `snapshot_policy_database_name` | Name of the database that contains the snapshot policy. |
| `snapshot_policy_schema_name` | Name of the schema that contains the snapshot policy. |
| `snapshot_policy_state` | Current state of the snapshot policy. |
| `owner_role` | Name of the role with the OWNERSHIP privilege on the snapshot set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the snapshot set. |
| `comment` | Comment for backup set. |

## Examples

Describe a snapshot set:

```sqlexample
DESC SNAPSHOT SET my_snapshot_set;
```

---
title: DESCRIBE SPECIFICATION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-specification.md
section: SQL Commands
---

# DESCRIBE SPECIFICATION

Describes the details about an [app specification](../../developer-guide/native-apps/requesting-app-specs.md).

## Syntax

```sqlsyntax
{ DESCRIBE | DESC }  SPECIFICATION <name> [ IN APPLICATION <app_name> ];
```

## Parameters

`IN APPLICATION app_name`
:   Specifies the name of the app whose app specification you want to view.

## Usage notes

* Consumers must provide the name of an app using the IN APPLICATION clause.
* An app can run this command without specifying the
  IN APPLICATION clause.

## Output

This command displays the following output:

| Column | Description |
| --- | --- |
| `sequenceNumber` | ID for a version of an app specification. This value is incremented each time a provider changes the [app specification definition](../../developer-guide/native-apps/requesting-app-specs.md). |
| `property` | The name of the property of the app specification. |
| `value` | The value of the property. |

---
title: DESCRIBE STAGE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-stage.md
section: SQL Commands
---

# DESCRIBE STAGE

Describes the values specified for the properties in a stage (file format, copy, and location), as well as the default values for
each property.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP STAGE](drop-stage.md) , [ALTER STAGE](alter-stage.md) , [CREATE STAGE](create-stage.md) , [SHOW STAGES](show-stages.md)

## Syntax

```sqlsyntax
DESC[RIBE] STAGE <name>
```

## Parameters

`name`
:   Specifies the identifier for the stage to describe. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides stage properties and metadata in the following columns.

| Column | Description |
| --- | --- |
| `parent_property` | The parent property to which each stage property belongs. Possible values include STAGE_FILE_FORMAT, STAGE_COPY_OPTIONS, STAGE_LOCATION, STAGE_CREDENTIALS, and DIRECTORY. |
| `property` | The name of the property. For property descriptions, refer to [CREATE STAGE](create-stage.md). |
| `property_type` | The property type. |
| `property_value` | The value assigned to the property. |
| `property_default` | The default property value. |

> **Note:**
>
> For stages with a directory table, the output includes a property named LAST_REFRESHED_ON of type TIMESTAMP. LAST_REFRESHED_ON indicates when the
> metadata for the directory table was last synchronized with the associated files on the stage, either manually or automatically.

## Examples

Describe an internal stage named `my_s3_stage`:

> ```sqlexample
> DESC STAGE my_s3_stage;
> +--------------------+--------------------------------+---------------+-------------------------------------------------------+------------------+
> | parent_property    | property                       | property_type | property_value                                        | property_default |
> |--------------------+--------------------------------+---------------+-------------------------------------------------------+------------------|
> | STAGE_FILE_FORMAT  | TYPE                           | String        | CSV                                                   | CSV              |
> | STAGE_FILE_FORMAT  | RECORD_DELIMITER               | String        | \n                                                    | \n               |
> | STAGE_FILE_FORMAT  | FIELD_DELIMITER                | String        | ,                                                     | ,                |
> | STAGE_FILE_FORMAT  | FILE_EXTENSION                 | String        |                                                       |                  |
> | STAGE_FILE_FORMAT  | SKIP_HEADER                    | Integer       | 0                                                     | 0                |
> | STAGE_FILE_FORMAT  | DATE_FORMAT                    | String        | AUTO                                                  | AUTO             |
> | STAGE_FILE_FORMAT  | TIME_FORMAT                    | String        | AUTO                                                  | AUTO             |
> | STAGE_FILE_FORMAT  | TIMESTAMP_FORMAT               | String        | AUTO                                                  | AUTO             |
> | STAGE_FILE_FORMAT  | BINARY_FORMAT                  | String        | HEX                                                   | HEX              |
> | STAGE_FILE_FORMAT  | ESCAPE                         | String        | NONE                                                  | NONE             |
> | STAGE_FILE_FORMAT  | ESCAPE_UNENCLOSED_FIELD        | String        | \\                                                    | \\               |
> | STAGE_FILE_FORMAT  | TRIM_SPACE                     | Boolean       | false                                                 | false            |
> | STAGE_FILE_FORMAT  | FIELD_OPTIONALLY_ENCLOSED_BY   | String        | NONE                                                  | NONE             |
> | STAGE_FILE_FORMAT  | NULL_IF                        | List          | [\\N]                                                 | [\\N]            |
> | STAGE_FILE_FORMAT  | COMPRESSION                    | String        | AUTO                                                  | AUTO             |
> | STAGE_FILE_FORMAT  | ERROR_ON_COLUMN_COUNT_MISMATCH | Boolean       | true                                                  | true             |
> | STAGE_FILE_FORMAT  | VALIDATE_UTF8                  | Boolean       | true                                                  | true             |
> | STAGE_FILE_FORMAT  | SKIP_BLANK_LINES               | Boolean       | false                                                 | false            |
> | STAGE_FILE_FORMAT  | REPLACE_INVALID_CHARACTERS     | Boolean       | false                                                 | false            |
> | STAGE_FILE_FORMAT  | EMPTY_FIELD_AS_NULL            | Boolean       | true                                                  | true             |
> | STAGE_FILE_FORMAT  | SKIP_BYTE_ORDER_MARK           | Boolean       | true                                                  | true             |
> | STAGE_FILE_FORMAT  | ENCODING                       | String        | UTF8                                                  | UTF8             |
> | STAGE_COPY_OPTIONS | ON_ERROR                       | String        | ABORT_STATEMENT                                       | ABORT_STATEMENT  |
> | STAGE_COPY_OPTIONS | SIZE_LIMIT                     | Long          |                                                       |                  |
> | STAGE_COPY_OPTIONS | PURGE                          | Boolean       | false                                                 | false            |
> | STAGE_COPY_OPTIONS | RETURN_FAILED_ONLY             | Boolean       | false                                                 | false            |
> | STAGE_COPY_OPTIONS | ENFORCE_LENGTH                 | Boolean       | true                                                  | true             |
> | STAGE_COPY_OPTIONS | TRUNCATECOLUMNS                | Boolean       | false                                                 | false            |
> | STAGE_COPY_OPTIONS | FORCE                          | Boolean       | false                                                 | false            |
> | STAGE_LOCATION     | URL                            | String        | ["s3://EXAMPLE-S3-PATH/my-csvfiles/"] |                  |
> | STAGE_CREDENTIALS  | AWS_KEY_ID                     | String        |                                                       |                  |
> | DIRECTORY          | LAST_REFRESHED_ON              | Timestamp     | 2023-05-03 12:50:28.000 -0700                         |                  |
> | DIRECTORY          | ENABLE                         | Boolean       | true                                                  | false            |
> | DIRECTORY          | AUTO_REFRESH                   | Boolean       | false                                                 | false            |
> +--------------------+--------------------------------+---------------+-------------------------------------------------------+------------------+
> ```

---
title: DESCRIBE STORAGE LIFECYCLE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/desc-storage-lifecycle-policy.md
section: SQL Commands
---

# DESCRIBE STORAGE LIFECYCLE POLICY

Describes the properties of a [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE STORAGE LIFECYCLE POLICY](create-storage-lifecycle-policy.md) , [ALTER STORAGE LIFECYCLE POLICY](alter-storage-lifecycle-policy.md) , [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md) , [SHOW STORAGE LIFECYCLE POLICIES](show-storage-lifecycle-policies.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } STORAGE LIFECYCLE POLICY <policy_name>
```

## Parameters

`policy_name`
:   Specifies the identifier for the policy to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | The name of the policy. |
| `signature` | The columns the policy uses to evaluate rows for expiration. |
| `return_type` | A VARCHAR value that contains the data type of the return value. For example BOOLEAN, NUMBER, ARRAY, and OBJECT. |
| `body` | The function body that evaluates whether a row should be expired. |
| `archive_tier` | The archive storage tier; COOL or COLD. |
| `archive_for_days` | The (optional) number of days to archive table rows before expiration. If this property isn’t set, the value is NULL. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY STORAGE LIFECYCLE POLICY | Account | Allows DESC on all storage lifecycle policies in the account. |
| APPLY | Storage lifecycle policy | Allows DESC on the storage lifecycle policy. |
| OWNERSHIP | Storage lifecycle policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* If you’ve ever enabled archive storage for a policy, the `archive_tier` property in the command output
  shows the archive tier (COOL or COLD) that was set. This is true even
  if you transition the archival policy into an expiration policy by using ALTER STORAGE LIFECYCLE POLICY to unset the ARCHIVE_FOR_DAYS parameter.
  You can’t change the archive tier after setting it.

## Examples

The following example describes the storage lifecycle policy named `my_storage_lifecycle_policy`:

```sqlexample
DESCRIBE STORAGE LIFECYCLE POLICY example_slp;
```

Output:

```output
+-----------------------------+----------------+-------------+------+------------------+
| name                        | signature      | return_type | body | archive_for_days |
|-----------------------------+----------------+-------------+------+------------------|
| MY_STORAGE_LIFECYCLE_POLICY | (ARG1 BOOLEAN) | BOOLEAN     | arg1 |              365 |
+-----------------------------+----------------+-------------+------+------------------+
```

---
title: DESCRIBE STREAM
source: https://docs.snowflake.com/en/sql-reference/sql/desc-stream.md
section: SQL Commands
---

# DESCRIBE STREAM

Describes the properties specified for a stream.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP STREAM](drop-stream.md) , [ALTER STREAM](alter-stream.md) , [CREATE STREAM](create-stream.md) , [SHOW STREAMS](show-streams.md)

## Syntax

```sqlsyntax
DESC[RIBE] STREAM <name>
```

## Parameters

`name`
:   Specifies the identifier for the stream to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Create an example stream:

> ```sqlexample
> CREATE STREAM mystream ( ... );
> ```

Describe the columns in the stream:

> ```sqlexample
> DESC STREAM mystream;
> ```

---
title: DESCRIBE STREAMLIT
source: https://docs.snowflake.com/en/sql-reference/sql/desc-streamlit.md
section: SQL Commands
---

# DESCRIBE STREAMLIT

Describes the columns in a Streamlit object.

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE STREAMLIT](create-streamlit.md), [SHOW STREAMLITS](show-streamlits.md), [ALTER STREAMLIT](alter-streamlit.md), [DROP STREAMLIT](drop-streamlit.md)

## Syntax

```sqlsyntax
DESC[RIBE] STREAMLIT <name>
```

## Required parameters

`name`
:   Specifies the identifier for the Streamlit object to describe. If the identifier contains spaces or special
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also
    case-sensitive.

## Access control requirements

If your role does not own the objects in the following table, then your role
must have the listed
[privileges](../../user-guide/security-access-control-overview.md) on those objects:

| Privilege | Object |
| --- | --- |
| USAGE | Streamlit object that you describe |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides information about a Streamlit object in the following columns:

| Column | Description |
| --- | --- |
| `title` | Title of the Streamlit object that displays in Snowsight. |
| `main_file` | Name of the Streamlit app’s entrypoint file. |
| `query_warehouse` | Warehouse where queries issued by the Streamlit app are run. |
| `runtime_name` | Runtime environment for the Streamlit app, like `SYSTEM$WAREHOUSE_RUNTIME` or `SYSTEM$ST_CONTAINER_RUNTIME_PY3_11`. |
| `compute_pool` | Compute pool where the Streamlit app runs. This is only used for container runtimes and ignored for warehouse runtimes. |
| `url_id` | Unique ID associated with the Streamlit object. |
| `default_packages` | Default Python packages for the Streamlit application. |
| `user_packages` | Python packages that the user specified in the `environment.yml` file. This is empty if there is no `environment.yml` file and doesn’t apply to container runtimes. |
| `import_urls` | List of URLs that the Streamlit app imports. This doesn’t apply to container runtimes. |
| `external_access_integrations` | List of external access integrations associated with the Streamlit object. |
| `external_access_secrets` | List of external access secrets associated with the Streamlit object. |
| `name` | Unique name of the Streamlit object within its schema. |
| `comment` | Comment associated with the Streamlit object. |
| `default_version` | Default version of the Streamlit object to use when there is no live version. If your app doesn’t already have a live version and the owner opens the app on Snowsight, this is the version that is copied to the live version. |
| `default_version_name` | Name of the default version directory within the Streamlit object’s file system. |
| `default_version_alias` | Unsupported and always null. |
| `default_version_location_uri` | Location URI of the default version’s app files. This is read only. |
| `default_version_source_location_uri` | Location URI of the default version’s source files in its Git object. If the Streamlit object is not connected to a Git object, this is null. |
| `default_version_git_commit_hash` | Git commit hash of the default version of the Streamlit object. If the Streamlit object is not connected to a Git object, this is null. |
| `last_version_name` | Name of the last version directory within the Streamlit object’s file system. |
| `last_version_alias` | Unsupported and always null. |
| `last_version_location_uri` | Location URI of the last version’s app files. This is read only. |
| `last_version_source_location_uri` | Location URI of the last version’s source files in its Git object. If the Streamlit object is not connected to a Git object, this is null. |
| `last_version_git_commit_hash` | Git commit hash of the last version of the Streamlit object. If the Streamlit object is not connected to a Git object, this is null. |
| `live_version_location_uri` | Location URI of the live version of the Streamlit object. This location is readable and writable. Edits in Snowsight are saved in this location. You can remotely update a live app by copying files to this location. |

For Streamlit objects created using the `ROOT_LOCATION` parameter, the command output provides information in the following columns:

| Column | Description |
| --- | --- |
| `name` | Unique name of the Streamlit object within its schema. |
| `title` | Title of the Streamlit object that displays in Snowsight. |
| `root_location` | Location of the Streamlit object’s files. |
| `main_file` | Path to the Streamlit app’s entrypoint file, relative to the root location. |
| `query_warehouse` | Warehouse where queries issued by the Streamlit app are run. |
| `url_id` | Unique ID associated with the Streamlit object. |
| `default_packages` | Default Python packages for the Streamlit app. |
| `user_packages` | Python packages that the user specified in the `environment.yml` file. This is empty if there is no `environment.yml` file. |
| `import_urls` | List of URLs that the Streamlit app imports. |
| `external_access_integrations` | List of external access integrations associated with the Streamlit object. |
| `external_access_secrets` | List of external access secrets associated with the Streamlit object. |

---
title: DESCRIBE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-table.md
section: SQL Commands
---

# DESCRIBE TABLE

Describes either the columns in a table or the set of stage properties for the table (current values and default values).

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP TABLE](drop-table.md) , [ALTER TABLE](alter-table.md) , [CREATE TABLE](create-table.md) , [SHOW TABLES](show-tables.md)

    [DESCRIBE VIEW](desc-view.md)

## Syntax

```sqlsyntax
{ DESCRIBE | DESC } TABLE <name> [ TYPE =  { COLUMNS | STAGE } ]
```

## Parameters

`name`
:   Specifies the identifier for the table to describe. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`TYPE = COLUMNS | STAGE`
:   Specifies whether to display the columns for the table or the set of stage properties for the table (current values and default values).

    Default: `TYPE = COLUMNS`

## Usage notes

* This command does not show the object parameters for a table. Instead, use [SHOW PARAMETERS IN TABLE](show-parameters.md).
* DESCRIBE TABLE and [DESCRIBE VIEW](desc-view.md) are interchangeable. Both commands return details for the specified table or view; however,
  `TYPE = STAGE` does not apply for views because views do not have stage properties.
* If schema evolution is enabled on the table, the output contains a `schema_evolution_record` column. This column was introduced with the [2023_08 Bundle (Generally Enabled)](../../release-notes/bcr-bundles/2023_08_bundle.md). For more information, see [Enable automatic table schema evolution](../../user-guide/data-load-schema-evolution.md).
* The output includes a `policy name` column to indicate the [masking policy](../../user-guide/security-column-intro.md) set directly on the
  column. If the column is protected by a [tag-based masking policy](../../user-guide/tag-based-masking-policies.md), Snowflake returns
  `NULL`.

  If a masking policy is not set directly on the column or if the Snowflake account is not Enterprise Edition or higher, Snowflake returns
  `NULL`.
* The output includes a `privacy domain` column to indicate the [privacy domain](../../user-guide/diff-privacy/differential-privacy-privacy-domains.md)
  set on the column.

  If a privacy domain is not set on the column or if the Snowflake account is not Enterprise Edition or higher, Snowflake returns
  `NULL`.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

When `TYPE = COLUMNS`, the command output provides the following properties and metadata:

| Column | Description |
| --- | --- |
| `name` | Name of the column in the table. |
| `type` | Data type of the column in the table. If [collation](../collation.md) has been specified for the column, the collation specification is included. |
| `kind` | This value is always `COLUMN` for Snowflake tables. |
| `null?` | Whether the column accepts NULL values (`Y` or `N`). |
| `default` | The default value for the column, if any (otherwise `NULL`). |
| `primary key` | Whether the column is the primary key (or part of a multi-column primary key; `Y` or `N`). |
| `unique key` | Whether the column has a UNIQUE constraint (`Y` or `N`). |
| `check` | Reserved for future use. |
| `expression` | Reserved for future use. |
| `comment` | The comment set for the column, if any (otherwise `NULL`). |
| `policy name` | The [masking policy](create-masking-policy.md) set for the column, if any (otherwise `NULL`). |
| `privacy domain` | The [privacy domain](../../user-guide/diff-privacy/differential-privacy-privacy-domains.md) set for the column, if any (otherwise `NULL`). |
| `schema_evolution_record` | Records information about the latest triggered Schema Evolution for a given table column. This column contains the following subfields:   * EvolutionType: The type of the triggered schema evolution (ADD_COLUMN or DROP_NOT_NULL). * EvolutionMode: The triggering ingestion mechanism (COPY, SNOWPIPE, or SNOWPIPE_STREAMING). * FileName: The file name that triggered the evolution (NULL for SNOWPIPE_STREAMING). * TriggeringTime: The approximate time when the column was evolved. * QueryId or PipeId: A unique identifier of the triggering query or pipe (QUERY ID for COPY, PIPE ID for SNOWPIPE, or NULL for SNOWPIPE_STREAMING). * Pipe name: Fully qualified pipe name that triggered schema evolution (SNOWPIPE_STREAMING only). * Channel name: Channel that triggered schema evolution (SNOWPIPE_STREAMING only). * offsetTokenUpperBound: An offset at or before which schema evolution was triggered (SNOWPIPE_STREAMING only). |

When `TYPE = STAGE`, the command output provides the current and default values for the table’s stage properties. See
Example: Describe stage properties.

## Examples

The following examples show how to describe tables.

### Example: Describe a table that has constraints and other column attributes

Create a table with five columns, two with constraints. Give one column a DEFAULT value and a
comment.

```sqlexample
CREATE OR REPLACE TABLE desc_example(
  c1 INT PRIMARY KEY,
  c2 INT,
  c3 INT UNIQUE,
  c4 VARCHAR(30) DEFAULT 'Not applicable' COMMENT 'This column is rarely populated',
  c5 VARCHAR(100));
```

Describe the columns in the table:

```sqlexample
DESCRIBE TABLE desc_example;
```

```output
+------+--------------+--------+-------+------------------+-------------+------------+-------+------------+---------------------------------+-------------+----------------+-------------------------+
| name | type         | kind   | null? | default          | primary key | unique key | check | expression | comment                         | policy name | privacy domain | schema evolution record |
|------+--------------+--------+-------+------------------+-------------+------------+-------+------------+---------------------------------+-------------+----------------+-------------------------|
| C1   | NUMBER(38,0) | COLUMN | N     | NULL             | Y           | N          | NULL  | NULL       | NULL                            | NULL        | NULL           | NULL                    |
| C2   | NUMBER(38,0) | COLUMN | Y     | NULL             | N           | N          | NULL  | NULL       | NULL                            | NULL        | NULL           | NULL                    |
| C3   | NUMBER(38,0) | COLUMN | Y     | NULL             | N           | Y          | NULL  | NULL       | NULL                            | NULL        | NULL           | NULL                    |
| C4   | VARCHAR(30)  | COLUMN | Y     | 'Not applicable' | N           | N          | NULL  | NULL       | This column is rarely populated | NULL        | NULL           | NULL                    |
| C5   | VARCHAR(100) | COLUMN | Y     | NULL             | N           | N          | NULL  | NULL       | NULL                            | NULL        | NULL           | NULL                    |
+------+--------------+--------+-------+------------------+-------------+------------+-------+------------+---------------------------------+-------------+----------------+-------------------------+
```

### Example: Describe a table that has a masking policy on a column

Create a [normal masking policy](create-masking-policy.md), then recreate the `desc_example` table with the masking policy set on one column. (To run this example, create the `email_mask` masking policy first.)

```sqlexample
CREATE OR REPLACE TABLE desc_example(
  c1 INT PRIMARY KEY,
  c2 INT,
  c3 INT UNIQUE,
  c4 VARCHAR(30) DEFAULT 'Not applicable' COMMENT 'This column is rarely populated',
  c5 VARCHAR(100) WITH MASKING POLICY email_mask);
```

```output
+------+--------------+--------+-------+------------------+-------------+------------+-------+------------+---------------------------------+---------------------------------+----------------+-------------------------+
| name | type         | kind   | null? | default          | primary key | unique key | check | expression | comment                         | policy name                     | privacy domain | schema evolution record |
|------+--------------+--------+-------+------------------+-------------+------------+-------+------------+---------------------------------+---------------------------------+----------------|-------------------------|
| C1   | NUMBER(38,0) | COLUMN | N     | NULL             | Y           | N          | NULL  | NULL       | NULL                            | NULL                            | NULL           | NULL                    |
| C2   | NUMBER(38,0) | COLUMN | Y     | NULL             | N           | N          | NULL  | NULL       | NULL                            | NULL                            | NULL           | NULL                    |
| C3   | NUMBER(38,0) | COLUMN | Y     | NULL             | N           | Y          | NULL  | NULL       | NULL                            | NULL                            | NULL           | NULL                    |
| C4   | VARCHAR(30)  | COLUMN | Y     | 'Not applicable' | N           | N          | NULL  | NULL       | This column is rarely populated | NULL                            | NULL           | NULL                    |
| C5   | VARCHAR(100) | COLUMN | Y     | NULL             | N           | N          | NULL  | NULL       | NULL                            | HT_SENSORS.HT_SCHEMA.EMAIL_MASK | NULL           | NULL                    |
+------+--------------+--------+-------+------------------+-------------+------------+-------+------------+---------------------------------+---------------------------------+----------------+-------------------------+
```

### Example: Describe stage properties

Describe the current stage properties for the same table (only the first five rows are shown here):

```sqlexample
DESCRIBE TABLE desc_example TYPE = STAGE;
```

```output
+--------------------+--------------------------------+---------------+-----------------+------------------+
| parent_property    | property                       | property_type | property_value  | property_default |
|--------------------+--------------------------------+---------------+-----------------+------------------|
| STAGE_FILE_FORMAT  | TYPE                           | String        | CSV             | CSV              |
| STAGE_FILE_FORMAT  | RECORD_DELIMITER               | String        | \n              | \n               |
| STAGE_FILE_FORMAT  | FIELD_DELIMITER                | String        | ,               | ,                |
| STAGE_FILE_FORMAT  | FILE_EXTENSION                 | String        |                 |                  |
| STAGE_FILE_FORMAT  | SKIP_HEADER                    | Integer       | 0               | 0                |
...
```

---
title: DESCRIBE TASK
source: https://docs.snowflake.com/en/sql-reference/sql/desc-task.md
section: SQL Commands
---

# DESCRIBE TASK

Shows information about a task.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP TASK](drop-task.md) , [ALTER TASK](alter-task.md) , [CREATE TASK](create-task.md) , [SHOW TASKS](show-tasks.md)

## Syntax

```sqlsyntax
DESC[RIBE] TASK <name>
```

## Parameters

`name`
:   Specifies the identifier for the task to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Output

The command provides the same output as [SHOW_TASKS](show-tasks.md).

## Usage notes

* Only returns rows for a task owner—that is, the role with the OWNERSHIP privilege on a task—or a role with either the MONITOR
  or OPERATE privilege on a task.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Create an example task:

> ```sqlexample
> CREATE TASK mytask ( ... );
> ```

Show information about the task:

> ```sqlexample
> DESC TASK mytask;
> ```

For output examples, see [SHOW_TASKS](show-tasks.md).

---
title: DESCRIBE TRANSACTION
source: https://docs.snowflake.com/en/sql-reference/sql/desc-transaction.md
section: SQL Commands
---

# DESCRIBE TRANSACTION

Describes the [transaction](../transactions.md), including the start time and the state (running, committed, rolled
back).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CURRENT_TRANSACTION](../functions/current_transaction.md) , [LAST_TRANSACTION](../functions/last_transaction.md) , [BEGIN](begin.md) ,
    [COMMIT](commit.md) , [ROLLBACK](rollback.md) , [SHOW TRANSACTIONS](show-transactions.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } TRANSACTION <transaction_id>
```

## Parameters

`transaction_id`
:   Specifies the identifier of the transaction to describe.

    `transaction_id` must be a literal, not a session variable.

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `id` | Unique identifier of the transaction. |
| `user` | The user ID of the user who ran the transaction. |
| `session name` | The ID of the user session in which the transaction was executed. |
| `started_on` | Date and time that the transaction was created. |
| `state` | The transaction’s completion status, e.g. committed, rolled back, or still running. |
| `ended_on` | Date and time that the transaction finished. |

## Examples

```sqlexample
DESC TRANSACTION 1651535571261000000;
```

---
title: DESCRIBE TYPE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-type.md
section: SQL Commands
---

# DESCRIBE TYPE

Describes a [user-defined type](../data-types-user-defined.md).

DESCRIBE can be abbreviated to DESC.

See also:
:   [CREATE TYPE](create-type.md) , [ALTER TYPE](alter-type.md) , [SHOW TYPES](show-types.md) , [DROP TYPE](drop-type.md) , [UNDROP TYPE](undrop-type.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } TYPE <name>
```

## Parameters

`name`
:   Specifies the identifier for the user-defined type to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides user-defined type properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the user-defined type. |
| `type` | Snowflake type definition that is the base type for the user-defined type. |
| `created_on` | Date and time when the user-defined type was created. |
| `database_name` | Database in which the user-defined type is stored. |
| `schema_name` | Schema in which the user-defined type is stored. |
| `owner` | Name of the role that owns the user-defined type, which is the role that has the OWNERSHIP privilege on the user-defined type. |
| `comment` | The comment set for the type, if any; otherwise, it is `NULL`. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | User-defined type |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Use the DESCRIBE TYPE command to list details about the `age` user-defined type:

```sqlexample
DESC TYPE age;
```

```output
+------+-------------+-------------------------------+---------------+-------------+--------------+---------+
| name | type        | created_on                    | database_name | schema_name | owner        | comment |
|------+-------------+-------------------------------+---------------+-------------+--------------+---------|
| AGE  | NUMBER(3,0) | 2025-11-06 09:23:59.882 -0800 | MY_DB         | MY_SCHEMA   | MY_ROLE      | NULL    |
+------+-------------+-------------------------------+---------------+-------------+--------------+---------+
```

---
title: DESCRIBE USER
source: https://docs.snowflake.com/en/sql-reference/sql/desc-user.md
section: SQL Commands
---

# DESCRIBE USER

Describes a [user](../../user-guide/admin-user-management.md), including the current and default values of the properties of the
user.

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP USER](drop-user.md) , [ALTER USER](alter-user.md) , [CREATE USER](create-user.md) , [SHOW USERS](show-users.md)

## Syntax

```sqlsyntax
{ DESC | DESCRIBE } USER <name>
```

## Parameters

`name`
:   Specifies the identifier for the user to describe.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `property` | The name of the property (see Properties of users). |
| `property_type` | The data type of the property (for example, `Boolean` or `String`). |
| `property_value` | The value assigned to the property. |
| `property_default` | The default value of the property. |

The `property` column can include the following properties of the user:

Properties of users

| Property | Description |
| --- | --- |
| `NAME` | Name of the user. |
| `COMMENT` | Comment about the user. |
| `DISPLAY_NAME` | Name displayed for the user in [Snowsight](../../user-guide/ui-snowsight-gs.md). |
| `TYPE` | Type of the user. For a list of possible values, see [Types of users](../../user-guide/admin-user-management.md). |
| `LOGIN_NAME` | Name that the user enters to log into the system. |
| `FIRST_NAME` | First name of the user. |
| `MIDDLE_NAME` | Middle name of the user. |
| `LAST_NAME` | Last name of the user. |
| `EMAIL` | Email address for the user. |
| `PASSWORD` | Obfuscated password of the user. |
| `MUST_CHANGE_PASSWORD` | If TRUE, the user is forced to change their password on next login (including their first/initial login) into the system. |
| `DISABLED` | If TRUE, the user is [locked out of Snowflake and cannot log back in](../../user-guide/admin-user-management.md). |
| `SNOWFLAKE_LOCK` | If TRUE, the user is locked by Snowflake. When a user is locked, they are unable to log in until the lock is removed. |
| `SNOWFLAKE_SUPPORT` | If TRUE, Snowflake Support is allowed to use the user or account. |
| `DAYS_TO_EXPIRY` | Number of days after which the user status is set to “Expired” and the user is no longer allowed to log in. |
| `MINS_TO_UNLOCK` | Number of minutes until [the temporary lock on the user login is cleared](../../user-guide/admin-user-management.md). |
| `DEFAULT_WAREHOUSE` | Virtual warehouse that is active by default for the user’s session upon logging in. |
| `DEFAULT_NAMESPACE` | Namespace (database only or database and schema) that is active by default for the user’s session upon logging in. |
| `DEFAULT_ROLE` | Primary role that is active by default for the user’s session upon logging in. |
| `DEFAULT_SECONDARY_ROLES` | Set of secondary roles that are active for the user’s session upon logging in. |
| `EXT_AUTHN_DUO` | If TRUE, [Duo](../../user-guide/security-mfa-duo.md) is enabled for the user, which requires the user to use [MFA (multi-factor authentication)](../../user-guide/security-mfa.md) when logging in. |
| `EXT_AUTHN_UID` | Authorization ID used for Duo. |
| `DEFAULT_MFA_METHOD` | [Default MFA method](../../user-guide/security-mfa-second-factor.md) for the user. |
| `HAS_MFA` | If TRUE, the user is enrolled in [multi-factor authentication (MFA)](../../user-guide/security-mfa.md). |
| `HAS_PAT` | If TRUE, the user has one or more [programmatic access tokens](../../user-guide/programmatic-access-tokens.md). |
| `HAS_WORKLOAD_IDENTITY` | If TRUE, the user is configured to authenticate with [workload identity federation](../../user-guide/workload-identity-federation.md). |
| `MINS_TO_BYPASS_MFA` | Number of minutes to [temporarily bypass MFA requirement for the user](../../user-guide/security-mfa.md). |
| `MINS_TO_BYPASS_NETWORK_POLICY` | Number of minutes to [temporarily bypass the requirement of having a network policy for programmatic access tokens](../../user-guide/programmatic-access-tokens.md). |
| `RSA_PUBLIC_KEY` | RSA public key of the user for [key-pair authentication](../../user-guide/key-pair-auth.md). |
| `RSA_PUBLIC_KEY_FP` | Fingerprint of the user’s RSA public key. |
| `RSA_PUBLIC_KEY_LAST_SET_TIME` | Date and time when the RSA public key was last set for the user. |
| `RSA_PUBLIC_KEY_2` | Second RSA public key of the user for use during [key-pair rotation](../../user-guide/key-pair-auth.md). |
| `RSA_PUBLIC_KEY_2_FP` | Fingerprint of the user’s second RSA public key. |
| `RSA_PUBLIC_KEY_2_LAST_SET_TIME` | Date and time when the second RSA public key was last set for the user. |
| `PASSWORD_LAST_SET_TIME` | Date and time when the last non-NULL password was set for the user. If no password was set, the value of this property is NULL. |
| `CUSTOM_LANDING_PAGE_URL` | Reserved for future use. |
| `CUSTOM_LANDING_PAGE_URL_FLUSH_NEXT_UI_LOAD` | Reserved for future use. |
| `IS_FROM_ORGANIZATION_USER` | If TRUE, the user was imported from a global [organization user](../../user-guide/organization-users.md). |

## Access control requirements

Individual users can see their own properties by executing this command and specifying their own `name`.

To view the properties of another user, you must use a role that has the following privilege:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | User |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The user object property `MINS_TO_BYPASS_NETWORK_POLICY` defines the number of minutes in which a user can access Snowflake
  without conforming to an existing [network policy](../../user-guide/network-policies.md). The number of minutes can only be set by
  Snowflake (Default: `NULL`) and is intended as a temporary workaround to allow user access to Snowflake. To set a value for
  this property, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* This command does not show the session parameter defaults for a user. Instead, use [SHOW PARAMETERS IN USER](show-parameters.md).
* The user object property `PASSWORD_LAST_SET_TIME` defaults to `Null` if no password has been set yet. Values of
  `292278994-08-17 07:12:55.807` or `1969-12-31 23:59:59.999` indicate the password was set before the inclusion of this row.
  A value of `1969-12-31 23:59:59.999` can also indicate an expired password and the user needs to change their password.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example describes the user named `my_user`:

```sqlexample
DESCRIBE USER my_user;
```

```output
+--------------------------------------------+-------------------------+---------+--------------------------------------------------------------------------------------------------------------------------------------------+
| property                                   | value                   | default | description                                                                                                                                |
|--------------------------------------------+-------------------------+---------+--------------------------------------------------------------------------------------------------------------------------------------------|
| NAME                                       | JSMITH                  | null    | Name                                                                                                                                       |
| COMMENT                                    | null                    | null    | user comment associated to an object in the dictionary                                                                                     |
| DISPLAY_NAME                               | Jane Smith              | null    | Display name of the associated object                                                                                                      |
| TYPE                                       | PERSON                  | null    | Type of the account, application package, data exchange, data exchange listing, replication group, secret, network rule, or user.          |
| LOGIN_NAME                                 | JSMITH                  | null    | Login name of the user                                                                                                                     |
| FIRST_NAME                                 | Jane                    | null    | First name of the user                                                                                                                     |
| MIDDLE_NAME                                | null                    | null    | Middle name of the user                                                                                                                    |
| LAST_NAME                                  | Smith                   | null    | Last name of the user                                                                                                                      |
| EMAIL                                      | jane.smith@example.com  | null    | Email address of the user                                                                                                                  |
| PASSWORD                                   | ********                | null    | Password of the user                                                                                                                       |
| MUST_CHANGE_PASSWORD                       | false                   | false   | User must change the password                                                                                                              |
| DISABLED                                   | false                   | false   | Whether the entity is disabled                                                                                                             |
| SNOWFLAKE_LOCK                             | false                   | false   | Whether the user, account, or organization is locked by Snowflake                                                                          |
| SNOWFLAKE_SUPPORT                          | false                   | false   | Snowflake Support is allowed to use the user or account                                                                                    |
| DAYS_TO_EXPIRY                             | null                    | null    | User record will be treated as expired after specified number of days                                                                      |
| MINS_TO_UNLOCK                             | null                    | null    | Temporary lock on the user will be removed after specified number of minutes                                                               |
| DEFAULT_WAREHOUSE                          | MY_WAREHOUSE            | null    | Default warehouse for this user                                                                                                            |
| DEFAULT_NAMESPACE                          | MY_DB.MY_SCHEMA         | null    | Default database namespace prefix for this user                                                                                            |
| DEFAULT_ROLE                               | MY_ROLE                 | null    | Primary principal of user session will be set to this role                                                                                 |
| DEFAULT_SECONDARY_ROLES                    | []                      | [ALL]   | The secondary roles will be set to all roles provided here.                                                                                |
| EXT_AUTHN_DUO                              | false                   | false   | Whether Duo Security is enabled as second factor authentication                                                                            |
| EXT_AUTHN_UID                              | null                    | null    | External authentication ID of the user                                                                                                     |
| DEFAULT_MFA_METHOD                         | null                    | null    | Default MFA method for the user                                                                                                            |
| HAS_MFA                                    | true                    | false   | Whether the user is enrolled in multi-factor authentication                                                                                |
| HAS_PAT                                    | true                    | false   | Whether the user has a programmatic access token                                                                                           |
| HAS_FEDERATED_WORKLOAD_AUTHENTICATION      | false                   | false   | Reserved for future use                                                                                                                    |
| MINS_TO_BYPASS_MFA                         | null                    | null    | Temporary bypass MFA for the user for a specified number of minutes                                                                        |
| MINS_TO_BYPASS_NETWORK_POLICY              | null                    | null    | Temporary bypass network policy on the user for a specified number of minutes                                                              |
| RSA_PUBLIC_KEY                             | ...                     | null    | RSA public key of the user                                                                                                                 |
| RSA_PUBLIC_KEY_FP                          | SHA256:...=             | null    | Fingerprint of user's RSA public key.                                                                                                      |
| RSA_PUBLIC_KEY_LAST_SET_TIME               | null                    | null    | The timestamp at which the RSA public key was last set for the user. Defaults to null if no RSA public key has been set yet.               |
| RSA_PUBLIC_KEY_2                           | ...                     | null    | Second RSA public key of the user                                                                                                          |
| RSA_PUBLIC_KEY_2_FP                        | SHA256:...=             | null    | Fingerprint of user's second RSA public key.                                                                                               |
| RSA_PUBLIC_KEY_2_LAST_SET_TIME             | null                    | null    | The timestamp at which the second RSA public key was last set for the user. Defaults to null if no second RSA public key has been set yet. |
| PASSWORD_LAST_SET_TIME                     | 2020-10-08 01:33:13.43  | null    | The timestamp on which the last non-null password was set for the user. Default to null if no password has been set yet.                   |
| CUSTOM_LANDING_PAGE_URL                    | null                    | null    | Reserved for future use                                                                                                                    |
| CUSTOM_LANDING_PAGE_URL_FLUSH_NEXT_UI_LOAD | false                   | false   | Reserved for future use                                                                                                                    |
+--------------------------------------------+-------------------------+---------+--------------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: DESCRIBE VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/desc-view.md
section: SQL Commands
---

# DESCRIBE VIEW

Describes the columns in a view (or table).

DESCRIBE can be abbreviated to DESC.

See also:
:   [DROP VIEW](drop-view.md) , [ALTER VIEW](alter-view.md) , [CREATE VIEW](create-view.md) , [SHOW VIEWS](show-views.md)

    [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
DESC[RIBE] VIEW <name>
```

## Parameters

`name`
:   Specifies the identifier for the view to describe. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* The command output does not include the view definition. Instead, use [SHOW VIEWS](show-views.md).
* DESC VIEW and [DESCRIBE TABLE](desc-table.md) are interchangeable. Either command retrieves the details for the table or view that matches the criteria
  in the statement.
* The output returns a `POLICY NAME` column to indicate the [masking policy](../../user-guide/security-column-intro.md) set on the column.

  If a masking policy is not set on the column or if the Snowflake account is not Enterprise Edition or higher, Snowflake returns
  `NULL`.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Example setup:

> ```sqlexample
> CREATE VIEW emp_view AS SELECT id "Employee Number", lname "Last Name", location "Home Base" FROM emp;
> ```

Describe the view:

> ```sqlexample
> DESC VIEW emp_view;
>
> +-----------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+-----------------+
> | name            | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name |  privacy domain |
> |-----------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+-----------------+
> | Employee Number | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL            |
> | Last Name       | VARCHAR(50)  | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL            |
> | Home Base       | VARCHAR(100) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL            |
> +-----------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+-----------------+
> ```

---
title: DESCRIBE WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/sql/desc-warehouse.md
section: SQL Commands
---

# DESCRIBE WAREHOUSE

Describes a [virtual warehouse](../../user-guide/warehouses-overview.md). For example, shows the date that the warehouse was created.

You can abbreviate DESCRIBE to DESC.

See also:
:   [ALTER WAREHOUSE](alter-warehouse.md) , [CREATE WAREHOUSE](create-warehouse.md), [DROP WAREHOUSE](drop-warehouse.md) , [SHOW WAREHOUSES](show-warehouses.md)

## Syntax

```sqlsyntax
DESC[RIBE] WAREHOUSE <name>
```

## Parameters

`name`
:   Specifies the [identifier](../identifiers.md) of the warehouse to describe.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | Warehouse |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

This demonstrates the DESCRIBE WAREHOUSE command:

```sqlexample
CREATE WAREHOUSE temporary_warehouse WAREHOUSE_SIZE=XSMALL;
```

```sqlexample
DESCRIBE WAREHOUSE temporary_warehouse;
```

```output
+-------------------------------+---------------------+-----------+
| created_on                    | name                | kind      |
|-------------------------------+---------------------+-----------|
| 2022-06-23 00:00:00.000 -0700 | TEMPORARY_WAREHOUSE | WAREHOUSE |
+-------------------------------+---------------------+-----------+
```

---
title: DROP <object>
source: https://docs.snowflake.com/en/sql-reference/sql/drop.md
section: SQL Commands
---

# DROP *<object>*

Removes the specified object from the system.

See also:
:   [CREATE <object>](create.md) , [SHOW <objects>](show.md) , [UNDROP <object>](undrop.md)

## DROP commands

For specific syntax, usage notes, and examples, see:

**Organization Objects:**

* [DROP ACCOUNT](drop-account.md)

**Account Objects:**

* [DROP APPLICATION](drop-application.md)
* [DROP APPLICATION PACKAGE](drop-application-package.md)
* [DROP AUTHENTICATION POLICY](drop-authentication-policy.md)
* [DROP CATALOG INTEGRATION](drop-catalog-integration.md)
* [DROP COMPUTE POOL](drop-compute-pool.md)
* [DROP CONNECTION](drop-connection.md)
* [DROP DATABASE](drop-database.md)
* [DROP DATABASE ROLE](drop-database-role.md)
* [DROP EXTERNAL VOLUME](drop-external-volume.md)
* [DROP FAILOVER GROUP](drop-failover-group.md)
* [DROP FEATURE POLICY](drop-feature-policy.md)
* [DROP INTEGRATION](drop-integration.md)
* [DROP NETWORK POLICY](drop-network-policy.md)
* [DROP ORGANIZATION PROFILE](drop-organization-profile.md)
* [DROP POSTGRES INSTANCE](drop-postgres-instance.md)
* [DROP REPLICATION GROUP](drop-replication-group.md)
* [DROP RESOURCE MONITOR](drop-resource-monitor.md)
* [DROP ROLE](drop-role.md)
* [DROP SHARE](drop-share.md)
* [DROP USER](drop-user.md)
* [DROP WAREHOUSE](drop-warehouse.md)

**Database Objects:**

* [DROP AGENT](drop-agent.md)
* [DROP AGGREGATION POLICY](drop-aggregation-policy.md)
* [DROP ALERT](drop-alert.md)
* [DROP AUTHENTICATION POLICY](drop-authentication-policy.md)
* [DROP BACKUP POLICY](drop-backup-policy.md)
* [DROP BACKUP SET](drop-backup-set.md)
* [DROP CONTACT](drop-contact.md)
* [DROP CORTEX SEARCH SERVICE](drop-cortex-search.md)
* [DROP DBT PROJECT](drop-dbt-project.md)
* [DROP DCM PROJECT](drop-dcm-project.md)
* [DROP DYNAMIC TABLE](drop-dynamic-table.md)
* [DROP EXPERIMENT](drop-experiment.md)
* [DROP EXTERNAL TABLE](drop-external-table.md)
* [DROP FILE FORMAT](drop-file-format.md)
* [DROP FUNCTION](drop-function.md)
* [DROP GATEWAY](drop-gateway.md)
* [DROP GIT REPOSITORY](drop-git-repository.md)
* [DROP ICEBERG TABLE](drop-iceberg-table.md)
* [DROP IMAGE REPOSITORY](drop-image-repository.md)
* [DROP INDEX](drop-index.md)
* [DROP JOIN POLICY](drop-join-policy.md)
* [DROP LISTING](drop-listing.md)
* [DROP MAINTENANCE POLICY](drop-maintenance-policy.md)
* [DROP MASKING POLICY](drop-masking-policy.md)
* [DROP MATERIALIZED VIEW](drop-materialized-view.md)
* [DROP MCP SERVER](drop-mcp-server.md)
* [DROP MODEL](drop-model.md)
* [DROP MODEL MONITOR](drop-model-monitor.md)
* [DROP NETWORK RULE](drop-network-rule.md)
* [DROP NOTEBOOK](drop-notebook.md)
* [DROP ONLINE FEATURE TABLE](drop-online-feature-table.md)
* [DROP PACKAGES POLICY](drop-packages-policy.md)
* [DROP PASSWORD POLICY](drop-password-policy.md)
* [DROP PIPE](drop-pipe.md)
* [DROP PRIVACY POLICY](drop-privacy-policy.md)
* [DROP PROCEDURE](drop-procedure.md)
* [DROP PROJECTION POLICY](drop-projection-policy.md)
* [DROP ROW ACCESS POLICY](drop-row-access-policy.md)
* [DROP SCHEMA](drop-schema.md)
* [DROP SECRET](drop-secret.md)
* [DROP SEMANTIC VIEW](drop-semantic-view.md)
* [DROP SEQUENCE](drop-sequence.md)
* [DROP SERVICE](drop-service.md)
* [DROP SESSION POLICY](drop-session-policy.md)
* [DROP SNAPSHOT](drop-snapshot.md)
* [DROP SNAPSHOT POLICY — Deprecated](drop-snapshot-policy.md) (deprecated; prefer [DROP BACKUP POLICY](drop-backup-policy.md))
* [DROP SNAPSHOT SET — Deprecated](drop-snapshot-set.md) (deprecated; prefer [DROP BACKUP SET](drop-backup-set.md))
* [DROP STAGE](drop-stage.md)
* [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md)
* [DROP STREAM](drop-stream.md)
* [DROP STREAMLIT](drop-streamlit.md)
* [DROP TABLE](drop-table.md)
* [DROP TAG](drop-tag.md)
* [DROP TASK](drop-task.md)
* [DROP TYPE](drop-type.md)
* [DROP VIEW](drop-view.md)

**Classes:**

* [DROP SNOWFLAKE.ML.ANOMALY_DETECTION](../classes/anomaly-detection/commands/drop-anomaly-detection.md)
* [DROP BUDGET](../classes/budget/commands/drop-budget.md)
* [DROP CLASSIFICATION_PROFILE](../classes/classification_profile/commands/drop-classification-profile.md)
* [DROP CUSTOM_CLASSIFIER](../classes/custom_classifier/commands/drop-custom-classifier.md)
* [DROP SNOWFLAKE.ML.FORECAST](../classes/forecast/commands/drop-forecast.md)

---
title: DROP ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-account.md
section: SQL Commands
---

# DROP ACCOUNT

Drops an account, which initiates the process of [deleting the account](../../user-guide/organizations-manage-accounts-delete.md).

See also:
:   [CREATE ACCOUNT](create-account.md), [SHOW ACCOUNTS](show-accounts.md), [UNDROP ACCOUNT](undrop-account.md)

## Syntax

```sqlsyntax
DROP ACCOUNT [ IF EXISTS ] <name> GRACE_PERIOD_IN_DAYS = <integer>
```

## Parameters

`name`
:   Specifies the name of the account being dropped. As an example, if the full account identifier is `myorg-account123`, then specify
    `account123` as the name. If you do not know the account name, execute the [SHOW ACCOUNTS](show-accounts.md)
    command, and find the name in the `account_name` column.

    The legacy account locator cannot be used to identify the account.

`GRACE_PERIOD_IN_DAYS = integer`
:   Specifies the number of days during which the account can be restored (“undropped”). The minimum is 3 days and the maximum is 90 days.

## Usage notes

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute the command.
* The organization administrator cannot drop the account they are currently logged in to.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

> **Important:**
>
> If the account contains a snapshot set that has an associated snapshot policy with a retention lock, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the account containing the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.

## Example

To drop an account `my_account` and allow a 14-day grace period for restoring the account, enter:

> ```sqlexample
> DROP ACCOUNT my_account GRACE_PERIOD_IN_DAYS = 14;
> ```

---
title: DROP AGENT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-agent.md
section: SQL Commands
---

# DROP AGENT

Removes the specified [Cortex Agent](../../user-guide/snowflake-cortex/cortex-agents.md) with the specified name from the current or specified database and schema.

See also:
:   [ALTER AGENT](alter-agent.md), [CREATE AGENT](create-agent.md), [DESCRIBE AGENT](desc-agent.md), [SHOW AGENTS](show-agents.md), [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](../functions/data_agent_run-snowflake-cortex.md)

## Syntax

```sqlsyntax
DROP AGENT [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the Cortex Agent to be dropped.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP or MODIFY | Agent |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the agent named `my_agent` in the current schema:

```sqlexample
DROP AGENT my_agent;
```

The following example drops the agent named `my_agent` in the `mydb` database and `myschema` schema. This command fails if the agent does not exist:

```sqlexample
DROP AGENT mydb.myschema.my_agent;
```

The following example drops the agent named `my_agent` in the `mydb` database and `myschema` schema only if it exists:

```sqlexample
DROP AGENT IF EXISTS mydb.myschema.my_agent;
```

---
title: DROP AGGREGATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-aggregation-policy.md
section: SQL Commands
---

# DROP AGGREGATION POLICY

Removes an [aggregation policy](../../user-guide/aggregation-policies.md) from the current/specified schema.

See also:
:   [Aggregation policy DDL reference](../../user-guide/aggregation-policies.md)

## Syntax

```sqlsyntax
DROP AGGREGATION POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the aggregation policy to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Aggregation policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on aggregation policy DDL and privileges, see [Privileges and commands](../../user-guide/aggregation-policies.md).

## Usage notes

* Prior to dropping the aggregation policy, execute the following statement to determine if the aggregation policy is set on any tables or
  views.

  ```sqlexample
  SELECT * FROM TABLE(mydb.INFORMATION_SCHEMA.POLICY_REFERENCES(POLICY_NAME=>'my_agg_policy'));
  ```

  For more information, see [Identify aggregation policy references](../../user-guide/aggregation-policies.md).
* An aggregation policy cannot be dropped successfully if it is currently assigned to a table or view.

  Before executing a DROP statement, [detach the aggregation policy](../../user-guide/aggregation-policies.md) from the table or view with an
  ALTER TABLE or ALTER VIEW statement.

## Example

Drop the aggregation policy:

```sqlexample
DROP AGGREGATION POLICY my_aggpolicy;
```

---
title: DROP ALERT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-alert.md
section: SQL Commands
---

# DROP ALERT

Drops an existing [alert](../../user-guide/alerts.md).

See also:
:   [CREATE ALERT](create-alert.md) , [ALTER ALERT](alter-alert.md), [DESCRIBE ALERT](desc-alert.md) , [SHOW ALERTS](show-alerts.md) , [EXECUTE ALERT](execute-alert.md)

## Syntax

```sqlsyntax
DROP ALERT [ IF EXISTS ] <name>
```

## Required parameters

`name`
:   Identifier for the alert to drop. If the identifier contains spaces or special characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Alert | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When an alert is dropped, any current evaluation of the condition of the alert (i.e. a run with an EXECUTING state in the
  [ALERT_HISTORY](../functions/alert_history.md) output) is completed.
* An alert can be dropped by the alert owner (i.e. the role that has the OWNERSHIP privilege on the alert) or a higher role
  without first suspending the alert.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

See [Dropping an alert](../../user-guide/alerts.md).

---
title: DROP APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/drop-application.md
section: SQL Commands
---

# DROP APPLICATION

Removes an application from the system in the Native Apps Framework.

See also:
:   [ALTER APPLICATION](alter-application.md), [CREATE APPLICATION](create-application.md),
    [SHOW APPLICATIONS](show-applications.md)

## Syntax

```sqlsyntax
DROP APPLICATION [ IF EXISTS ] <name> [ CASCADE ]
```

## Required parameters

`name`
:   Specifies the identifier for the application object to drop. If the identifier contains spaces,
    special characters, or mixed-case characters, the entire string must be enclosed in double
    quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Optional parameters

`CASCADE`
:   Drops the application object and all objects owned by the app, including tables with primary or unique
    keys that are referenced by foreign keys in other tables.

    If `CASCADE` is not specified, this command returns an error if the app owns
    objects outside of itself.

    If `CASCADE` is specified all objects owned by the app are dropped, even if those
    objects contain other objects owned by the consumer. For example, if the consumer transfers
    ownership of a schema or table to an account role, but leaves the parent database owned
    by the app, running this command with `CASCADE` also drops those objects.

    To retain objects owned by the application, use the [GRANT OWNERSHIP](grant-ownership.md)
    command to transfer ownership of those objects, then run this command without `CASCADE`.

## Usage notes

* This command can be run by the app owner or a user with the MANAGE GRANTS privilege on the
  app.
* All app roles are dropped when the application object is dropped. Any access granted
  by those roles on objects in the consumer account are lost.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

```sqlexample
DROP APPLICATION hello_snowflake_app;
```

```output
+-------------------------------------------+
| status                                    |
|-------------------------------------------|
| hello_snowflake_app successfully dropped. |
+-------------------------------------------+
```

---
title: DROP APPLICATION PACKAGE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-application-package.md
section: SQL Commands
---

# DROP APPLICATION PACKAGE

Removes an application package from the system in the Native Apps Framework.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md), [CREATE APPLICATION PACKAGE](create-application-package.md), [SHOW APPLICATION PACKAGES](show-application-packages.md),

## Syntax

```sqlsyntax
DROP APPLICATION PACKAGE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the application package to drop. If the identifier contains spaces, special characters, or mixed-case characters, the entire string must be enclosed in double quotes. Identifiers
    enclosed in double quotes are also case-sensitive.

## Usage notes

* An application package can only be dropped if it is not currently associated with a listing.
* After you run this command, the application package is dropped and becomes unavailable within the
  provider account.

  Any application created from the application package remains visible to the consumer, but is otherwise inaccessible.
  Any attempt to access the application results in an error indicating the application package has been removed.
* A consumer must explicitly run [DROP APPLICATION](drop-application.md) to ensure that objects owned by the application
  have been appropriately transferred to other roles or removed.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

```sqlexample
DROP APPLICATION PACKAGE hello_snowflake_app;
```

```output
+-------------------------------------------+
| status                                    |
|-------------------------------------------|
| hello_snowflake_app successfully dropped. |
+-------------------------------------------+
```

---
title: DROP APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-application-role.md
section: SQL Commands
---

# DROP APPLICATION ROLE

Removes the specified application role from the system.

See also:
:   [CREATE APPLICATION ROLE](create-application-role.md) , [ALTER APPLICATION ROLE](alter-application-role.md) , [SHOW APPLICATION ROLES](show-application-roles.md)

## Syntax

```sqlsyntax
DROP APPLICATION ROLE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the application role to drop. If the identifier contains spaces or
    special characters, the entire string must be enclosed in double quotes. Identifiers enclosed
    in double quotes are also case-sensitive.

## Usage notes

* This command can only be run within the context of an application created using the Native
  App Framework.
* Dropped application roles cannot be recovered; they must be recreated within the application.
* Application roles are not versioned. When dropping an application role from a setup script,
  you must ensure that no running version of the application relies upon the role being
  dropped. Snowflake recommends to either avoid dropping application roles that may be in use or to
  wait until the version that depends on the role being dropped has itself also been
  dropped.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP APPLICATION ROLE APP_ROLE;
> ```

---
title: DROP AUTHENTICATION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-authentication-policy.md
section: SQL Commands
---

# DROP AUTHENTICATION POLICY

Removes an [authentication policy](../../user-guide/authentication-policies.md) from the system.

See also:
:   [CREATE AUTHENTICATION POLICY](create-authentication-policy.md), [ALTER AUTHENTICATION POLICY](alter-authentication-policy.md), [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md), [SHOW AUTHENTICATION POLICIES](show-authentication-policies.md)

## Syntax

```sqlsyntax
DROP AUTHENTICATION POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the authentication policy to drop. If the identifier contains spaces or special characters, you must enclose
    the string in double quotation marks. Identifiers enclosed in double quotation marks are case-sensitive. The identifier must meet the
    [identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Authentication policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You cannot recover dropped authentication policies. You must recreate them.
* You cannot drop an authentication policy if it is set on an account or user.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop an authentication policy named `my_auth_policy`:

```sqlexample
DROP AUTHENTICATION POLICY my_auth_policy;
```

---
title: DROP BACKUP POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-backup-policy.md
section: SQL Commands
---

# DROP BACKUP POLICY

Deletes a [backup](../../user-guide/backups.md) policy.

See also:
:   [CREATE BACKUP POLICY](create-backup-policy.md),
    [ALTER BACKUP POLICY](alter-backup-policy.md),
    [SHOW BACKUP POLICIES](show-backup-policies.md)

## Syntax

```sqlsyntax
DROP BACKUP POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the backup policy.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Backup policy | The role used to delete a backup policy must have the OWNERSHIP privilege on the policy. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

A backup policy can’t be deleted if it is attached to any backup set.

## Examples

Delete the backup policy `hourly_backup_policy`:

```sqlexample
DROP BACKUP POLICY hourly_backup_policy;
```

---
title: DROP BACKUP SET
source: https://docs.snowflake.com/en/sql-reference/sql/drop-backup-set.md
section: SQL Commands
---

# DROP BACKUP SET

Deletes a [backup](../../user-guide/backups.md) set.

See also:
:   [CREATE BACKUP SET](create-backup-set.md),
    [ALTER BACKUP SET](alter-backup-set.md),
    [SHOW BACKUP SETS](show-backup-sets.md)

## Syntax

```sqlsyntax
DROP BACKUP SET <name>
```

## Parameters

`name`
:   Specifies the identifier for the backup set.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Backup set | The role used to modify the backup policy for a backup set must have the OWNERSHIP privilege on the set. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

> **Important:**
>
> If the backup policy has a retention lock applied to it, and there are any
> unexpired backups in the backup set, then you can’t delete the backup set.
> In that case, you must wait for all the backups in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a backup policy.
>
> You also can’t drop a backup set if any of the backups it contains have a legal hold applied.

## Examples

Delete the backup set `t1_backups`:

```sqlexample
DROP BACKUP SET t1_backups;
```

---
title: DROP CATALOG INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/drop-catalog-integration.md
section: SQL Commands
---

# DROP CATALOG INTEGRATION

Removes a [catalog integration](../../user-guide/tables-iceberg.md) from the account.

See also:
:   [CREATE CATALOG INTEGRATION](create-catalog-integration.md) , [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md) , [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

```sqlsyntax
DROP CATALOG INTEGRATION [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the catalog integration to drop. If the identifier contains spaces, special characters,
    or mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed
    in double quotes are also case-sensitive (for example, `"My Catalog"`).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Integration (catalog) | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Dropped catalog integrations cannot be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* You can’t drop or replace a catalog integration if one or more Apache Iceberg™ tables
  are associated with the catalog integration.

  To view the tables that depend on a catalog integration,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `catalog_name` column.

  > **Note:**
  >
  > The column identifier (`catalog_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "catalog_name" = 'my_catalog_integration_1';
  ```

## Examples

Drop a catalog integration:

> ```sqlexample
> DROP CATALOG INTEGRATION myInt;
> ```

Drop the catalog integration again, but don’t raise an error if the integration doesn’t exist:

> ```sqlexample
> DROP CATALOG INTEGRATION IF EXISTS myInt;
> ```

---
title: DROP COMPUTE POOL
source: https://docs.snowflake.com/en/sql-reference/sql/drop-compute-pool.md
section: SQL Commands
---

# DROP COMPUTE POOL

Removes the specified [compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) from the
account.

See also:
:   [CREATE COMPUTE POOL](create-compute-pool.md) , [ALTER COMPUTE POOL](alter-compute-pool.md), [DESCRIBE COMPUTE POOL](desc-compute-pool.md) , [SHOW COMPUTE POOLS](show-compute-pools.md)

## Syntax

```sqlsyntax
DROP COMPUTE POOL [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the compute pool to be dropped.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Compute pool |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When dropping a compute pool, Snowflake automatically aborts any running jobs. However, Snowflake does not drop running services.
  If services are running this command will fail. You need to explicitly drop all running services before dropping a compute pool.
  You can run [ALTER COMPUTE POOL … STOP ALL](alter-compute-pool.md), which drops both services and jobs. You can also use
  the [DROP SERVICE](drop-service.md) command to drop individual services.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the compute pool named `tutorial_compute_pool`:

```sqlexample
DROP COMPUTE POOL tutorial_compute_pool;
```

```output
+---------------------------------------------+
| status                                      |
|---------------------------------------------|
| TUTORIAL_COMPUTE_POOL successfully dropped. |
+---------------------------------------------+
```

---
title: DROP CONNECTION
source: https://docs.snowflake.com/en/sql-reference/sql/drop-connection.md
section: SQL Commands
---

# DROP CONNECTION

Removes a connection from the account.

See also:
:   [CREATE CONNECTION](create-connection.md) , [ALTER CONNECTION](alter-connection.md) , [SHOW CONNECTIONS](show-connections.md)

## Syntax

```sqlsyntax
DROP CONNECTION [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the connection to drop.

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL command.
* A primary connection can’t be dropped if it has one or more secondary connections. To drop the primary connection, first promote a secondary
  connection to serve as the primary connection, and then drop the former primary connection. Alternatively, drop all of the secondary connections,
  and then drop the primary connection.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a connection:

> ```sqlexample
> SHOW CONNECTIONS LIKE 't2%';
>
>
> DROP CONNECTION t2;
>
>
> SHOW CONNECTIONS LIKE 't2%';
> ```

Drop the connection again, but don’t raise an error if the connection doesn’t exist:

> ```sqlexample
> DROP CONNECTION IF EXISTS t2;
> ```

---
title: DROP CONTACT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-contact.md
section: SQL Commands
---

# DROP CONTACT

Removes the specified [contact](../../user-guide/contacts-using.md) from the current schema.

See also:
:   [CREATE CONTACT](create-contact.md) , [ALTER CONTACT](alter-contact.md), [SHOW CONTACTS](show-contacts.md)

## Syntax

```sqlsyntax
DROP CONTACT <name>
```

## Parameters

`name`
:   Specifies the identifier of the contact to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

You must have the OWNERSHIP privilege on a contact to drop it.

## Examples

The following example drops the contact named `mycontact`:

```sqlexample
DROP CONTACT mycontact;
```

```output
+---------------------------------+
| status                          |
|---------------------------------|
| MYCONTACT successfully dropped. |
+---------------------------------+
```

---
title: DROP CORTEX SEARCH SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-cortex-search.md
section: SQL Commands
---

# DROP CORTEX SEARCH SERVICE

Removes the specified [Cortex Search service](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) from the current schema.

## Syntax

```sqlsyntax
DROP CORTEX SEARCH SERVICE <name>;
```

## Parameters

`name`
:   Specifies the identifier for the Cortex Search service to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access Control Requirements

| Privilege | Object |
| --- | --- |
| OWNERSHIP | Cortex Search service you want to remove. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage Notes

## Examples

The following example drops the Cortex Search service named `mysvc`:

```sqlexample
DROP CORTEX SEARCH SERVICE mysvc;
```

```output
+------------------------------+
| status                       |
|------------------------------|
| mysvc successfully dropped.  |
+------------------------------+
```

---
title: DROP DATABASE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-database.md
section: SQL Commands
---

# DROP DATABASE

Removes a database from the system.

See also:
:   [CREATE DATABASE](create-database.md) , [ALTER DATABASE](alter-database.md) , [DESCRIBE DATABASE](desc-database.md) , [SHOW DATABASES](show-databases.md) , [UNDROP DATABASE](undrop-database.md)

## Syntax

```sqlsyntax
DROP DATABASE [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

## Parameters

`name`
:   Specifies the identifier for the database to drop. If the identifier contains spaces, special characters, or mixed-case characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`CASCADE | RESTRICT`
:   Specifies whether the database can be dropped if foreign keys exist that reference any tables in the database:

    * `CASCADE` drops the database and all objects in the database, including tables with primary/unique keys that are referenced by
      foreign keys in other tables.
    * `RESTRICT` returns a warning about existing foreign key references and does not drop the database.

    Default: `CASCADE`

## Usage notes

* Dropping a database does not permanently remove it from the system. A version of the dropped database is retained in
  [Time Travel](../../user-guide/data-time-travel.md) for the number of days specified by the `DATA_RETENTION_TIME_IN_DAYS` parameter
  for the database:

  > 1. Within the Time Travel retention period, a dropped database can be restored using the [UNDROP DATABASE](undrop-database.md) command.
  > 2. When the Time Travel retention period ends, the next state for the dropped database depends on whether it is permanent or transient:
  >
  >    + A permanent database moves into [Fail-safe](../../user-guide/data-failsafe.md). In Fail-safe (7 days), a dropped database can be
  >      recovered, but only by Snowflake. When the database leaves Fail-safe, it is purged.
  >    + A transient database has no Fail-safe, so it is purged when it moves out of Time Travel.
  > 3. Once a dropped database has been purged, it cannot be recovered; it must be recreated.
* Currently, when a database is dropped, the data retention period for child schemas or tables, if explicitly set to be different from the
  retention of the database, is not honored. The child schemas or tables are retained for the same period of time as the database. To honor
  the data retention period for these child objects (schemas or tables), drop them explicitly before you drop the database or
  schema.
* After dropping a database, creating a database with the same name creates a new version of the database. The dropped version of the
  previous database can still be restored using the following method:

  > 1. Rename the current version of the database to a different name.
  > 2. Use the [UNDROP DATABASE](undrop-database.md) command to restore the previous version.
* If a policy or tag is attached a table or view column, dropping the database successfully requires the policy or tag to be self-contained
  within the database and schema. For example, `database_1` contains `policy_1` and `policy_1` is only used in `database_1`.
  Otherwise, a [dangling reference](../../user-guide/database-replication-considerations.md) occurs.
* The DROP operation fails if a session policy or password policy is set on a user or the account.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

> **Important:**
>
> If the database contains a snapshot set that has an associated snapshot policy with a retention lock, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the database containing the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.

## Database replication usage notes

* You can drop a secondary database at any time. Only the database owner (i.e. the role with the OWNERSHIP privilege on the database) can
  drop the database.
* A primary database cannot be dropped if one or more replicas of the database (i.e. secondary databases) exist. To drop the primary
  database, first promote a secondary database to serve as the primary database, and then drop the former primary database. Alternatively,
  drop all of the secondary databases for the primary database, and then drop the primary database.

  Note that only the database owner can drop the database.

## Examples

> ```sqlexample
> DROP DATABASE mytestdb2;
>
> +---------------------------------+
> | status                          |
> |---------------------------------|
> | MYTESTDB2 successfully dropped. |
> +---------------------------------+
>
> SHOW DATABASES LIKE 'mytestdb2';
>
> +------------+------+------------+------------+--------+-------+---------+---------+----------------+
> | created_on | name | is_default | is_current | origin | owner | comment | options | retention_time |
> |------------+------+------------+------------+--------+-------+---------+---------+----------------|
> +------------+------+------------+------------+--------+-------+---------+---------+----------------+
>
> SHOW DATABASES HISTORY LIKE 'mytestdb2';
>
> +---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+---------------------------------+
> | created_on                      | name      | is_default | is_current | origin | owner  | comment | options | retention_time | dropped_on                      |
> |---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+---------------------------------|
> | Wed, 25 Feb 2015 16:16:54 -0800 | MYTESTDB2 | N          | N          |        | PUBLIC |         |         |              1 | Fri, 13 May 2016 17:35:09 -0700 |
> +---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+---------------------------------+
> ```

---
title: DROP DATABASE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-database-role.md
section: SQL Commands
---

# DROP DATABASE ROLE

Removes the specified database role from the system.

See also:
:   [CREATE DATABASE ROLE](create-database-role.md) , [ALTER DATABASE ROLE](alter-database-role.md) , [SHOW DATABASE ROLES](show-database-roles.md)

## Syntax

```sqlsyntax
DROP DATABASE ROLE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) for the database role; must be unique in the database in which the role is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is not fully qualified (in the form of `db_name.database_role_name`, the command looks for the database role
    in the current database for the session.

## Usage notes

* Dropped database roles cannot be recovered; they must be recreated.
* Ownership of any objects owned by the dropped database role is transferred to the role that executes the DROP DATABASE ROLE
  command. To transfer ownership of each of these objects to a different database role, use
  [GRANT OWNERSHIP … COPY CURRENT GRANTS](grant-ownership.md).
* If a database role has a future privilege as a grantor or grantee, the database role can only be dropped by a user with a role
  that has the MANAGE GRANTS privilege.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* All current and future grants that name the database role as either the grantor or the grantee are removed when the database role is
  dropped.

  Query the [GRANTS_TO_ROLES](../account-usage/grants_to_roles.md) Account Usage view to retrieve the privilege grants
  that name a specified database role as the grantor or grantee:

  ```sqlsyntax
  SELECT *
    FROM snowflake.account_usage.grants_to_roles
    WHERE grantee_name = upper('<database_name>.<db_role_name>') OR granted_by = upper('<database_name>.<db_role_name>');
  ```

  The following example retrieves the grants where `d1.dr1` is the grantor or grantee:

  ```sqlexample
  SELECT *
    FROM snowflake.account_usage.grants_to_roles
    WHERE grantee_name = upper('d1.dr1') OR granted_by = upper('d1.dr1');
  ```

## Examples

> ```sqlexample
> DROP DATABASE ROLE d1.dr1;
> ```

---
title: DROP DBT PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-dbt-project.md
section: SQL Commands
---

# DROP DBT PROJECT

Removes the specified [dbt project object](../../user-guide/data-engineering/dbt-projects-on-snowflake.md) from the current or specified schema.

See also:
:   [CREATE DBT PROJECT](create-dbt-project.md), [ALTER DBT PROJECT](alter-dbt-project.md), [DESCRIBE DBT PROJECT](desc-dbt-project.md), [EXECUTE DBT PROJECT](execute-dbt-project.md), [SHOW DBT PROJECTS](show-dbt-projects.md)

## Syntax

```sqlsyntax
DROP DBT PROJECT [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the dbt project object to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | dbt project | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the dbt project object named `my_dbt_project` from the current schema:

```sqlexample
DROP DBT PROJECT my_dbt_project;
```

```output
+--------------------------------------+
| status                               |
|--------------------------------------|
| MY_DBT_PROJECT successfully dropped. |
+--------------------------------------+
```

---
title: DROP DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-dcm-project.md
section: SQL Commands
---

# DROP DCM PROJECT

Removes the specified [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) from the current/specified schema.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [ALTER DCM PROJECT](alter-dcm-project.md), [DESCRIBE DCM PROJECT](desc-dcm-project.md) , [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md), [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)

## Syntax

```sqlsyntax
DROP DCM PROJECT [ IF EXISTS ] <name>
```

## Required parameters

`name`
:   Specifies the identifier for the DCM project to drop.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`IF EXISTS`
:   Optionally specifies to not return an error when the DCM project does not exist.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | DCM project | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* Dropping a DCM project in Snowflake doesn’t remove any objects created by executing the DCM project.

## Examples

Drop the DCM project named `my_project`:

```sqlexample
DROP DCM PROJECT my_project;
```

---
title: DROP DYNAMIC TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-dynamic-table.md
section: SQL Commands
---

# DROP DYNAMIC TABLE

Removes a [dynamic table](../../user-guide/dynamic-tables-about.md) from the current/specified schema.

See also:
:   [CREATE DYNAMIC TABLE](create-dynamic-table.md), [ALTER DYNAMIC TABLE](alter-dynamic-table.md), [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md),
    [SHOW DYNAMIC TABLES](show-dynamic-tables.md), [UNDROP DYNAMIC TABLE](undrop-dynamic-table.md)

## Syntax

```sqlsyntax
DROP DYNAMIC TABLE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the dynamic table to drop. If the identifier contains spaces, special characters, or mixed-case
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive
    (e.g. `"My Object"`).

    If the table identifier is not fully-qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the command looks for the table in the current schema for the session.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | The dynamic table that you want to drop. |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To drop a dynamic table, you must be using a role that has OWNERSHIP privilege on that dynamic table.
* You can also drop a dynamic table using the [DROP TABLE](drop-table.md) command.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop `my_dynamic_table`:

> ```sqlexample
> DROP DYNAMIC TABLE my_dynamic_table;
> ```
>
> ```sqlexample
> DROP TABLE my_dynamic_table;
> ```

---
title: DROP EXPERIMENT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-experiment.md
section: SQL Commands
---

# DROP EXPERIMENT

Removes the specified [experiment](../../developer-guide/snowflake-ml/experiments.md) from the current/specified schema.

See also:
:   [CREATE EXPERIMENT](create-experiment.md) , [ALTER EXPERIMENT](alter-experiment.md) , [SHOW EXPERIMENTS](show-experiments.md) , [SHOW RUNS IN EXPERIMENT](show-runs-in-experiment.md) , [SHOW RUN … IN EXPERIMENT](show-run-in-experiment.md)

## Syntax

```sqlsyntax
DROP EXPERIMENT <name>;
```

## Parameters

`name`
:   Specifies the identifier for the experiment to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Experiment |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

---
title: DROP EXTERNAL TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-external-table.md
section: SQL Commands
---

# DROP EXTERNAL TABLE

Removes an external table from the current or specified schema. This is a metadata-only operation. None of the files that the
external table refers to are dropped.

See also:
:   [CREATE EXTERNAL TABLE](create-external-table.md) , [ALTER EXTERNAL TABLE](alter-external-table.md) , [SHOW EXTERNAL TABLES](show-external-tables.md) , [DESCRIBE EXTERNAL TABLE](desc-external-table.md)

## Syntax

```sqlsyntax
DROP EXTERNAL TABLE [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

## Parameters

`name`
:   Specifies the identifier for the external table to drop. If the identifier contains spaces, special characters, or mixed-case characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive;
    for example, `"My Object"`.

    If the external table identifier is not fully qualified, in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`, the command looks for the external table in the current schema for the session.

`CASCADE | RESTRICT`
:   Specifies whether the external table can be dropped if foreign keys exist that reference the table:

    * `CASCADE` drops the external table even if it has primary or unique keys that are referenced by foreign keys in other tables.
    * `RESTRICT` returns a warning about existing foreign key references and doesn’t drop the external table.

    Default: `CASCADE`

## Usage notes

* Unlike a standard table, dropping an external table purges it from the system. An external table can’t be recovered by using Time Travel;
  also, there is no UNDROP EXTERNAL TABLE command. A dropped external table must be recreated.
* After dropping an external table, creating an external table with the same name recreates the table. No history from the old version
  of the external table is retained.
* Before dropping an external table, verify that no views reference the table. Dropping an external table referenced by a view
  invalidates the view; that is, querying the view returns an “object does not exist” error.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop an external table:

> ```sqlexample
> SHOW EXTERNAL TABLES LIKE 't2%';
>
> +-------------------------------+------------------+---------------+-------------+-----------------------+---------+-----------------------------------------+------------------+------------------+-------+-----------+----------------------+
> | created_on                    | name             | database_name | schema_name | owner                 | comment | location                                | file_format_name | file_format_type | cloud | region    | notification_channel |
> |-------------------------------+------------------+---------------+-------------+-----------------------+---------+-----------------------------------------+------------------+------------------+-------+-----------+----------------------|
> | 2018-08-06 06:00:42.340 -0700 | T2               | MYDB          | PUBLIC      | MYROLE                |         | @MYDB.PUBLIC.MYSTAGE/                   |                  | JSON             | AWS   | us-east-1 | NULL                 |
> +-------------------------------+------------------+---------------+-------------+-----------------------+---------+-----------------------------------------+------------------+------------------+-------+-----------+----------------------+
>
> DROP EXTERNAL TABLE t2;
>
> +--------------------------+
> | status                   |
> |--------------------------|
> | T2 successfully dropped. |
> +--------------------------+
>
> SHOW EXTERNAL TABLES LIKE 't2%';
>
> +------------+------+---------------+-------------+-------+---------+----------+------------------+------------------+-------+--------+----------------------+
> | created_on | name | database_name | schema_name | owner | comment | location | file_format_name | file_format_type | cloud | region | notification_channel |
> |------------+------+---------------+-------------+-------+---------+----------+------------------+------------------+-------+--------+----------------------|
> +------------+------+---------------+-------------+-------+---------+----------+------------------+------------------+-------+--------+----------------------+
> ```

Drop the table again, but don’t raise an error if the table doesn’t exist:

> ```sqlexample
> DROP EXTERNAL TABLE IF EXISTS t2;
>
> +------------------------------------------------------------+
> | status                                                     |
> |------------------------------------------------------------|
> | Drop statement executed successfully (T2 already dropped). |
> +------------------------------------------------------------+
> ```

---
title: DROP EXTERNAL VOLUME
source: https://docs.snowflake.com/en/sql-reference/sql/drop-external-volume.md
section: SQL Commands
---

# DROP EXTERNAL VOLUME

Removes an [external volume](../../user-guide/tables-iceberg.md) from the account, but retains a version of the
external volume so that it can be recovered using [UNDROP EXTERNAL VOLUME](undrop-external-volume.md). For more information, see Usage Notes (in this topic).

See also:
:   [CREATE EXTERNAL VOLUME](create-external-volume.md) , [ALTER EXTERNAL VOLUME](alter-external-volume.md) , [SHOW EXTERNAL VOLUMES](show-external-volumes.md) , [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md)

## Syntax

```sqlsyntax
DROP EXTERNAL VOLUME [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the external volume to drop. If the identifier contains spaces, special characters, or mixed-case characters,
    the entire string must be enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | External volume | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You can’t drop or replace an external volume if one or more Iceberg tables
  are associated with the external volume.

  To view the tables that depend on an external volume,
  you can use the [SHOW ICEBERG TABLES](show-iceberg-tables.md) command and
  a query using the [pipe operator](../operators-flow.md) (`->>`) that filters on
  the `external_volume_name` column.

  > **Note:**
  >
  > The column identifier (`external_volume_name`) is case-sensitive.
  > Specify the column identifier exactly as it appears in the SHOW ICEBERG TABLES output.

  For example:

  ```sqlexample
  SHOW ICEBERG TABLES
    ->> SELECT *
          FROM $1
          WHERE "external_volume_name" = 'my_external_volume_1';
  ```
* Dropping an external volume does not permanently remove it from the system. Snowflake retains a version of the dropped external volume in
  [Time Travel](../../user-guide/data-time-travel.md). You can restore a dropped external volume by using
  the [UNDROP EXTERNAL VOLUME](undrop-external-volume.md) command.
* After a dropped external volume has been purged, it cannot be recovered; it must be recreated.
* After dropping an external volume, creating an external volume with the same name creates a new version of the external volume.
  You can restore the dropped version of the previous external volume by following these steps:

  1. Rename the current version of the external volume.
  2. Use the [UNDROP EXTERNAL VOLUME](undrop-external-volume.md) command to restore the previous version.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops an external volume named `my_external_volume`:

> ```sqlexample
> DROP EXTERNAL VOLUME my_external_volume;
> ```

---
title: DROP FAILOVER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/drop-failover-group.md
section: SQL Commands
---

# DROP FAILOVER GROUP

Removes a [failover group](../../user-guide/account-replication-intro.md) from the account.

See also:
:   [CREATE FAILOVER GROUP](create-failover-group.md) , [ALTER FAILOVER GROUP](alter-failover-group.md) , [SHOW FAILOVER GROUPS](show-failover-groups.md)

## Syntax

```sqlsyntax
DROP FAILOVER GROUP [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the failover group.

## Usage notes

* Only an account administrator (user with the ACCOUNTADMIN role) or the group owner (role with the OWNERSHIP privilege on the group) can
  execute this SQL command.
* A primary failover group can only be successfully dropped if no linked secondary failover groups exist.
* A database that is included in a failover group is not dropped when the failover group is dropped.

  + If a secondary failover group is dropped, any database previously included in the group loses read-only protection and becomes writable.
  + If the secondary failover group is re-created from the same primary failover group as before, the databases in the group are
    overwritten by the databases in the primary failover group during the first refresh. These databases are read-only.
* To retrieve the set of accounts in your organization that are enabled for replication, use
  [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).
* To retrieve the list of failover groups in your organization, use [SHOW FAILOVER GROUPS](show-failover-groups.md).

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

### Executed on source account

Drop a failover group named `myfg` in the source account.

```sqlexample
DROP FAILOVER GROUP IF EXISTS myfg;
```

---
title: DROP FEATURE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-feature-policy.md
section: SQL Commands
---

# DROP FEATURE POLICY

Removes the specified [feature policy](../../developer-guide/native-apps/ui-consumer-feature-policies.md).

See also:
:   [CREATE FEATURE POLICY](create-feature-policy.md) , [ALTER FEATURE POLICY](alter-feature-policy.md), [DESCRIBE FEATURE POLICY](desc-feature-policy.md), [SHOW FEATURE POLICIES](show-feature-policies.md)

## Syntax

```sqlsyntax
DROP FEATURE POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the feature policy to drop.

If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
Identifiers enclosed in double quotes are also case-sensitive.

For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Feature policy | This privilege is required to drop a feature policy. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage Notes

* A feature policy can’t be dropped if it is currently applied to an object. Use the
  [ALTER FEATURE POLICY](alter-feature-policy.md) command to un-apply the feature policy
  from the object, then drop the feature policy.

## Examples

The following example drops the feature policy named `block_db_policy`:

```sqlexample
DROP FEATURE POLICY block_db_policy;
```

```output
+---------------------------------------+
| status                                |
|---------------------------------------|
| BLOCK_DB_POLICY successfully dropped. |
+---------------------------------------+
```

---
title: DROP FILE FORMAT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-file-format.md
section: SQL Commands
---

# DROP FILE FORMAT

Removes the specified file format from the current/specified schema.

See also:
:   [CREATE FILE FORMAT](create-file-format.md) , [ALTER FILE FORMAT](alter-file-format.md) , [SHOW FILE FORMATS](show-file-formats.md) , [DESCRIBE FILE FORMAT](desc-file-format.md)

## Syntax

```sqlsyntax
DROP FILE FORMAT [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the file format to drop. If the identifier contains spaces, special characters, or mixed-case characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped file formats cannot be recovered; they must be recreated.
* Dropping a file format that is referenced in another object (e.g. named stage) does not cause errors because the object uses the
  file format defaults in place of the dropped file format.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP FILE FORMAT my_format;
>
> ---------------------------------+
>            status                |
> ---------------------------------+
> MY_FORMAT successfully dropped.  |
> ---------------------------------+
> ```

---
title: DROP FUNCTION
source: https://docs.snowflake.com/en/sql-reference/sql/drop-function.md
section: SQL Commands
---

# DROP FUNCTION

Removes the specified user-defined function (UDF) or external function from the current/specified schema.

See also:
:   [CREATE FUNCTION](create-function.md) , [ALTER FUNCTION](alter-function.md) , [SHOW FUNCTIONS](show-functions.md), [DESCRIBE FUNCTION](desc-function.md)

## Syntax

```sqlsyntax
DROP FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
```

## Parameters

`name`
:   Specifies the identifier for the UDF to drop. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`arg_data_type [ , ... ]`
:   Specifies the data type of the argument(s), if any, for the UDF. The argument types are necessary because UDFs support name
    overloading (i.e. two UDFs in the same schema can have the same name) and the argument types are used to identify the UDF you
    wish to drop.

## Usage notes

**All Languages**

* Dropped functions can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

**Java, Python, and Scala**

* For UDFs that store code in a file (such as a .jar file or .py file) in a stage, the `DROP FUNCTION` command does not remove
  the file. Different UDFs can use different functions/methods in the same file, so the file should not be removed while any UDF
  refers to it. Snowflake does not store a count of the number of references to each staged file and does not remove that staged
  file when there are no remaining references.

## Examples

This demonstrates the DROP FUNCTION command:

> ```sqlexample
> DROP FUNCTION multiply(number, number);
>
> --------------------------------+
>              status             |
> --------------------------------+
>  MULTIPLY successfully dropped. |
> --------------------------------+
> ```

---
title: DROP FUNCTION (DMF)
source: https://docs.snowflake.com/en/sql-reference/sql/drop-function-dmf.md
section: SQL Commands
---

# DROP FUNCTION (DMF)

Removes the specified data metric function (DMF) from the current or specified schema.

## Syntax

```sqlsyntax
DROP FUNCTION [ IF EXISTS ] <name>(
TABLE(  <arg_data_type> [ , ... ] ) [ , TABLE( <arg_data_type> [ , ... ] ) ]
)
```

## Parameters

`name`
:   Identifier for the DMF to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`TABLE( arg_data_type [ , ... ] ) [ , TABLE( arg_data_type [ , ... ] ) ]`
:   Specifies the data type of the column arguments for the DMF. The data types are necessary because DMFs support name overloading
    (that is, two DMFs in the same schema can have the same name), and the data types of the arguments are used to identify the DMF you want to
    drop.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Data metric function |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Example

Drop a custom DMF from the system:

```sqlexample
DROP FUNCTION governance.dmfs.count_positive_numbers(
  TABLE(
    NUMBER, NUMBER, NUMBER
  )
);
```

---
title: DROP FUNCTION (Snowpark Container Services)
source: https://docs.snowflake.com/en/sql-reference/sql/drop-function-spcs.md
section: SQL Commands
---

# DROP FUNCTION (Snowpark Container Services)

Removes the specified [service function](../../developer-guide/snowpark-container-services/working-with-services.md).

See also:
:   [Service functions](../../developer-guide/snowpark-container-services/working-with-services.md), [CREATE FUNCTION](create-function-spcs.md), [ALTER FUNCTION](alter-function-spcs.md), [DESC FUNCTION](desc-function-spcs.md)

## Syntax

```sqlsyntax
DROP FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] )
```

## Parameters

`name`
:   Specifies the identifier for the service function to drop. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive.

`arg_data_type [ , ... ]`
:   Specifies the data type of the argument(s), if any, for the service function. The argument types are necessary because service functions support name
    overloading (that is, two service functions in the same schema can have the same name) and the argument types are used to identify the UDF you
    wish to drop.

## Usage notes

* Dropped functions can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

This demonstrates the DROP FUNCTION command:

```sqlexample
DROP FUNCTION my_echo_udf(VARCHAR);
```

Example output:

```output
+-----------------------------------+
| status                            |
|-----------------------------------|
| MY_ECHO_UDF successfully dropped. |
+-----------------------------------+
```

---
title: DROP GATEWAY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-gateway.md
section: SQL Commands
---

# DROP GATEWAY

Removes the specified [gateway](../../developer-guide/snowpark-container-services/gateway.md) from the current
or specified schema.

See also:
:   [CREATE GATEWAY](create-gateway.md) , [ALTER GATEWAY](alter-gateway.md), [SHOW GATEWAYS](show-gateways.md) , [DESCRIBE GATEWAY](desc-gateway.md)

## Syntax

```sqlsyntax
DROP GATEWAY [ IF EXISTS ] <name>
```

## Required parameters

`name`
:   Specifies the identifier for the gateway to be dropped.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Gateway | Required to drop the gateway. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the gateway named `split_gateway`:

```sqlexample
DROP GATEWAY split_gateway;
```

```output
+-------------------------------------+
| status                              |
|-------------------------------------|
| SPLIT_GATEWAY successfully dropped. |
+-------------------------------------+
```

---
title: DROP GIT REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-git-repository.md
section: SQL Commands
---

# DROP GIT REPOSITORY

Removes the specified Snowflake Git repository clone from the current/specified schema.

See also:
:   [ALTER GIT REPOSITORY](alter-git-repository.md), [CREATE GIT REPOSITORY](create-git-repository.md), [DESCRIBE GIT REPOSITORY](desc-git-repository.md), [SHOW GIT BRANCHES](show-git-branches.md),
    [SHOW GIT REPOSITORIES](show-git-repositories.md), [SHOW GIT TAGS](show-git-tags.md)

## Syntax

```sqlsyntax
DROP GIT REPOSITORY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the Git repository clone to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Usage notes

* Dropped Git repositories can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP GIT REPOSITORY my_repository;
> ```
>
> ```output
> +-------------------------------------+
> |                status               |
> +-------------------------------------+
> | MY_REPOSITORY successfully dropped. |
> +-------------------------------------+
> ```

---
title: DROP ICEBERG TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-iceberg-table.md
section: SQL Commands
---

# DROP ICEBERG TABLE

Removes an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md) from the current/specified schema, but retains a version of the
Iceberg table so that it can be recovered using [UNDROP ICEBERG TABLE](undrop-iceberg-table.md). For more information, see Usage Notes (in this topic).

Note that this topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)

## Syntax

```sqlsyntax
DROP [ ICEBERG ] TABLE [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

## Parameters

`name`
:   Specifies the identifier for the table to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive
    (for example, `"My Object"`).

    If the table identifier is not fully qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the command looks for the table in the current schema for the session.

`CASCADE | RESTRICT`
:   Specifies whether the table can be dropped if foreign keys exist that reference the table:

    * `CASCADE` drops the table even if the table has primary/unique keys that are referenced by foreign keys in other tables.
    * `RESTRICT` returns a warning about existing foreign key references and does not drop the table.

    Default: `CASCADE`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Iceberg table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | External volume |  |
| USAGE | Integration (catalog) | Required if the Iceberg table uses an external catalog. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* For [externally managed Iceberg tables with writes enabled](../../user-guide/tables-iceberg-externally-managed-writes.md),
  Snowflake also instructs your external Iceberg REST catalog to drop the table.
  Snowflake makes a call to your remote Iceberg catalog, instructing it to drop the table and delete the table’s underlying data and metadata.

  Snowflake only drops the table after confirming that the table has successfully been dropped from the remote catalog.

  > **Note:**
  >
  > If you use the AWS Glue Data Catalog as your external catalog, dropping an externally managed table through Snowflake does not delete
  > the underlying table files. This behavior is specific to the AWS Glue Data Catalog implementation.
* Dropping a table does not permanently remove it from the system. Snowflake retains a version of the dropped table in
  [Time Travel](../../user-guide/data-time-travel.md) for the number of days specified by the `DATA_RETENTION_TIME_IN_DAYS` parameter for
  the table. For more information, see [Metadata and snapshots for Iceberg tables](../../user-guide/tables-iceberg.md).
* Within the Time Travel retention period, you can restore a dropped table by using the [UNDROP ICEBERG TABLE](undrop-iceberg-table.md) command.
* After a dropped table has been purged, it cannot be recovered; it must be recreated.
* After dropping a table, creating a table with the same name creates a new version of the table. You can restore
  the dropped version of the previous table with the following steps:

  1. Rename the current version of the table to a different name.
  2. Use the [UNDROP ICEBERG TABLE](undrop-iceberg-table.md) command to restore the previous version.
* Before you drop a table, verify that no views reference the table. Dropping a table that is referenced by a view
  invalidates the view (querying the view returns an “object does not exist” error).

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a table:

> ```sqlexample
> DROP ICEBERG TABLE t2;
>
> +--------------------------+
> | status                   |
> |--------------------------|
> | T2 successfully dropped. |
> +--------------------------+
> ```

Drop the table again, but don’t raise an error if the table doesn’t exist:

> ```sqlexample
> DROP ICEBERG TABLE IF EXISTS t2;
>
> +------------------------------------------------------------+
> | status                                                     |
> |------------------------------------------------------------|
> | Drop statement executed successfully (T2 already dropped). |
> +------------------------------------------------------------+
> ```

---
title: DROP IMAGE REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-image-repository.md
section: SQL Commands
---

# DROP IMAGE REPOSITORY

Removes the specified [image repository](../../developer-guide/snowpark-container-services/tutorials/tutorial-1.md) from
the current or specified schema.

See also:
:   [CREATE IMAGE REPOSITORY](create-image-repository.md) , [SHOW IMAGE REPOSITORIES](show-image-repositories.md)

## Syntax

```sqlsyntax
DROP IMAGE REPOSITORY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the repository to drop.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Image repository |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Dropping an image repository while services are running that reference images in that repository can cause problems. Currently
  running service instances and jobs will continue to run, but any attempt to create a new service instance will fail.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the repository named `tutorial_repository`:

```sqlexample
DROP IMAGE REPOSITORY tutorial_repository;
```

```output
+-------------------------------------------+
| status                                    |
|-------------------------------------------|
| TUTORIAL_REPOSITORY successfully dropped. |
+-------------------------------------------+
```

---
title: DROP INDEX
source: https://docs.snowflake.com/en/sql-reference/sql/drop-index.md
section: SQL Commands
---

# DROP INDEX

Drops a secondary index.

See also:
:   [CREATE INDEX](create-index.md) , [SHOW INDEXES](show-indexes.md) , [CREATE HYBRID TABLE](create-hybrid-table.md) , [DROP TABLE](drop-table.md) , [DESCRIBE TABLE](desc-table.md) , [SHOW HYBRID TABLES](show-hybrid-tables.md)

## Syntax

```sqlsyntax
DROP INDEX [ IF EXISTS ] <table_name>.<index_name>
```

## Parameters

`table_name`
:   Specifies the identifier for the table.

`index_name`
:   Specifies the identifier for the index.

## Usage notes

* This command can only be used to drop a *secondary* index. To drop an index that is used to enforce a UNIQUE
  or FOREIGN KEY constraint, use the [ALTER TABLE](alter-table.md) command to drop the constraint.
* Indexes cannot be undropped.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Removes the secondary index `c_idx` on table `t0`:

```sqlexample
DROP INDEX t0.c_idx;
```

---
title: DROP INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/sql/drop-integration.md
section: SQL Commands
---

# DROP INTEGRATION

Removes an integration from the account.

See also:
:   [CREATE INTEGRATION](create-integration.md) , [ALTER INTEGRATION](alter-integration.md) , [SHOW INTEGRATIONS](show-integrations.md) , [DESCRIBE INTEGRATION](desc-integration.md)

API integrations:
:   [CREATE API INTEGRATION](create-api-integration.md)

catalog integrations:
:   [CREATE CATALOG INTEGRATION](create-catalog-integration.md)

External access integrations:
:   [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md)

Notification integrations:
:   [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md)

Security integrations:
:   [CREATE SECURITY INTEGRATION](create-security-integration.md)

Storage integrations:
:   [CREATE STORAGE INTEGRATION](create-storage-integration.md)

## Syntax

```sqlsyntax
DROP [ { API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE } ] INTEGRATION [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the integration to drop. If the identifier contains spaces, special characters, or mixed-case characters,
    the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive
    (e.g. `"My Object"`).

`API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE`
:   Specifies the integration type.

## Usage notes

* Dropped integrations cannot be recovered; they must be recreated.
* Disabling or dropping the integrations may not take effect immediately, since integrations may be cached.
  It is recommended to remove the integration privilege from the cloud provider to take effect sooner.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop an integration:

> ```sqlexample
> SHOW INTEGRATIONS LIKE 't2%';
>
> DROP INTEGRATION t2;
>
> SHOW INTEGRATIONS LIKE 't2%';
> ```

Drop the integration again, but don’t raise an error if the integration does not exist:

> ```sqlexample
> DROP INTEGRATION IF EXISTS t2;
> ```

---
title: DROP JOIN POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-join-policy.md
section: SQL Commands
---

# DROP JOIN POLICY

Removes a [join policy](../../user-guide/join-policies.md) from the current/specified schema.

See also:
:   [Join policy DDL reference](../../user-guide/join-policies.md)

## Syntax

```sqlsyntax
DROP JOIN POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the join policy to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Join policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about join policy DDL and privileges, see [Managing join policies](../../user-guide/join-policies.md).

## Usage notes

* Prior to dropping the join policy, execute the following statement to determine if the policy is set on any tables or
  views.

  ```sqlexample
  SELECT * FROM TABLE(mydb.INFORMATION_SCHEMA.POLICY_REFERENCES(POLICY_NAME=>'my_join_policy'));
  ```

  For more information, see [Getting information about tables and views attached to join policies](../../user-guide/join-policies.md).
* A join policy cannot be dropped successfully if it is currently assigned to a table or view.

  Before executing a DROP statement, [detach the join policy](../../user-guide/join-policies.md) from the table or view with an ALTER TABLE or ALTER VIEW statement.

## Example

Drop a join policy:

```sqlexample
DROP JOIN POLICY my_join_policy;
```

---
title: DROP LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/drop-listing.md
section: SQL Commands
---

# DROP LISTING

Removes the specified [listing](../../collaboration/collaboration-listings-about.md) from the system and immediately revokes access for all consumers.

> **Important:**
>
> Before dropping a listing, ensure that:
>
> * The listing is in state DRAFT or UNPUBLISH. For more information about changing listing states, see [ALTER LISTING](alter-listing.md).
> * Previously published listings are not mounted by any consumers.

See also:

> [CREATE LISTING](create-listing.md), [ALTER LISTING](alter-listing.md), [DESCRIBE LISTING](desc-listing.md), [SHOW LISTINGS](show-listings.md), [SHOW VERSIONS IN LISTING](show-versions-in-listing.md), [Listing manifest reference](../../progaccess/listing-manifest-reference.md)

## Syntax

```sqlsyntax
DROP LISTING <name>
```

## Parameters

`name`
:   The identifier of the listing to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Only the listing owner, the role with the OWNERSHIP privilege on the listing, has the privileges to drop a listing.
  Executing this command with any other role returns an error.
* Dropped listings cannot be recovered; they must be recreated.
* Dropping a listing automatically invokes the retirement process for all public and monetized listings.
  Additionally, for other listing types the listing is dropped immediately, and all consumer access automatically revoked.
* Provider account Listing Auto-Fulfillment (LAF) replication groups don’t get dropped when you drop a private listing. To resolve this issue after you drop a private listing, revoke the existing grants on the replication group and then drop the replication group. For example:

  ```sqlexample
  GRANT OWNERSHIP ON REPLICATION GROUP myrg TO ROLE accountadmin
  REVOKE CURRENT GRANTS;
  DROP REPLICATION GROUP myrg;
  ```

## Examples

> ```sqlexample
> DROP LISTING IF EXISTS MYLISTING
> ```
>
> ```output
> +----------------------------------+
> | status                           |
> |----------------------------------|
> | MYLISTING successfully dropped. |
> +----------------------------------+
> ```

---
title: DROP MAINTENANCE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-maintenance-policy.md
section: SQL Commands
---

# DROP MAINTENANCE POLICY

Removes a [maintenance policy](../../developer-guide/native-apps/consumer-maintenance-policies.md) from the current or specified schema. The command
fails if the maintenance policy is applied to an app or account.

See also:
:   [CREATE MAINTENANCE POLICY](create-maintenance-policy.md), [ALTER MAINTENANCE POLICY](alter-maintenance-policy.md), [SHOW MAINTENANCE POLICIES](show-maintenance-policies.md)

## Syntax

```sqlsyntax
DROP MAINTENANCE POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier of the maintenance policy to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| DROP MAINTENANCE POLICY | Maintenance policy |  |
| OWNERSHIP | Maintenance policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Examples

The following example drops a maintenance policy named `my_maintenance_policy`:

```sqlexample
DROP MAINTENANCE POLICY my_maintenance_policy;
```

---
title: DROP MANAGED ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-managed-account.md
section: SQL Commands
---

# DROP MANAGED ACCOUNT

Removes a managed account, including all objects created in the account, and immediately restricts access to the account. Currently
used by data providers to create reader accounts for their consumers. For more details, see [Manage reader accounts](../../user-guide/data-sharing-reader-create.md).

See also:
:   [CREATE MANAGED ACCOUNT](create-managed-account.md) , [SHOW MANAGED ACCOUNTS](show-managed-accounts.md)

## Syntax

```sqlsyntax
DROP MANAGED ACCOUNT <name>
```

## Usage notes

* This command can be executed by users with the ACCOUNTADMIN role (or a role that has been granted the CREATE ACCOUNT global privilege).
* This operation can not be undone.

## Examples

```sqlexample
DROP MANAGED ACCOUNT reader_acct1;

  +------------------------------------+
  | status                             |
  |------------------------------------|
  | READER_ACCT1 successfully dropped. |
  +------------------------------------+
```

---
title: DROP MASKING POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-masking-policy.md
section: SQL Commands
---

# DROP MASKING POLICY

Removes a masking policy from the system.

See also:
:   [Masking policy DDL](../../user-guide/security-column-intro.md)

## Syntax

```sqlsyntax
DROP MASKING POLICY <name>
```

## Parameters

`name`
:   Identifier for the masking policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Masking policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Managing Column-level Security](../../user-guide/security-column-intro.md).

## Usage notes

* Prior to dropping a masking policy, execute the following statement to determine if any masking policies are applied to columns. For
  more information, see [POLICY_REFERENCES](../functions/policy_references.md).

  ```sqlexample
  SELECT * from table(information_schema.policy_references(policy_name=>'<string>'));
  ```
* A masking policy cannot be dropped successfully if it is currently assigned to a column or a tag.

  Before executing a DROP statement, UNSET the masking policy from the column with an [ALTER TABLE … ALTER COLUMN](alter-table-column.md) or [ALTER VIEW](alter-view.md)
  statement, and, if necessary, unset the masking policy from the tag using an [ALTER TAG](alter-tag.md) statement.
* You can drop a masking policy that’s in use by a table inside a [backup](../../user-guide/backups.md).

## Example

```sqlexample
DROP MASKING POLICY ssn_mask;
```

---
title: DROP MATERIALIZED VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/drop-materialized-view.md
section: SQL Commands
---

# DROP MATERIALIZED VIEW

Removes the specified materialized view from the current/specified schema.

See also:
:   [ALTER MATERIALIZED VIEW](alter-materialized-view.md) , [CREATE MATERIALIZED VIEW](create-materialized-view.md) , [SHOW MATERIALIZED VIEWS](show-materialized-views.md) , [DESCRIBE MATERIALIZED VIEW](desc-materialized-view.md)

## Syntax

```sqlsyntax
DROP MATERIALIZED VIEW [ IF EXISTS ] <view_name>
```

## Usage notes

* Dropping a materialized view does not update references to that view. For example, if you create a view named “V1” on top of a
  materialized view, and then you drop the materialized view, the definition of view “V1” will become out of date.
* Dropped materialized views can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP MATERIALIZED VIEW mv1;
>
> ---------------------------+
>            status          |
> ---------------------------+
>  MV1 successfully dropped. |
> ---------------------------+
> ```

---
title: DROP MCP SERVER
source: https://docs.snowflake.com/en/sql-reference/sql/drop-mcp-server.md
section: SQL Commands
---

# DROP MCP SERVER

Removes the specified MCP (Model Context Protocol) server from the current/specified schema.

See also:
:   [CREATE MCP SERVER](create-mcp-server.md) , [DESCRIBE MCP SERVER](desc-mcp-server.md) , [SHOW MCP SERVERS](show-mcp-servers.md)

## Syntax

```sqlsyntax
DROP MCP SERVER [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the MCP server to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| OWNERSHIP or MODIFY | MCP server |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage Notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* Dropping an MCP server removes the server object and its tool configurations. Any MCP clients currently connected to the server will lose access.
* The underlying objects referenced by the MCP server tools (Cortex Search Services, Cortex Agents, UDFs, stored procedures) are not affected by dropping the MCP server.

## Examples

The following example drops the MCP server named `my_mcp_server`:

```sqlexample
DROP MCP SERVER my_mcp_server;
```

```output
+----------------------------------------+
| status                                 |
|----------------------------------------|
| MY_MCP_SERVER successfully dropped.    |
+----------------------------------------+
```

The following example drops the MCP server named `my_mcp_server` if it exists:

```sqlexample
DROP MCP SERVER IF EXISTS my_mcp_server;
```

---
title: DROP MODEL
source: https://docs.snowflake.com/en/sql-reference/sql/drop-model.md
section: SQL Commands
---

# DROP MODEL

Removes a machine learning model from the current/specified schema.

See also:
:   [CREATE MODEL](create-model.md) , [ALTER MODEL](alter-model.md) , [SHOW MODELS](show-models.md)

## Syntax

```sqlsyntax
DROP MODEL <name>
```

## Parameters

`name`
:   Specifies the identifier for the model to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    If the model identifier is not fully-qualified (in the form of `db_name.schema_name.model_name` or
    `schema_name.model_name`), the command looks for the model in the current schema for the session.

## Usage notes

* All versions in the model are dropped along with the model.
* There is no UNDROP MODEL command. To restore a dropped model, train and log it again.

---
title: DROP MODEL MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/drop-model-monitor.md
section: SQL Commands
---

# DROP MODEL MONITOR

Removes the specified [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md) from the
current or specified schema. Dropped monitors cannot be recovered; they must be recreated.

See also:
:   [CREATE MODEL MONITOR](create-model-monitor.md),
    [ALTER MODEL MONITOR](alter-model-monitor.md),
    [SHOW MODEL MONITORS](show-model-monitors.md),
    [DESCRIBE MODEL MONITOR](desc-model-monitor.md)

## Syntax

```sqlsyntax
DROP MODEL MONITOR [ IF EXISTS ] <monitor_name>;
```

## Parameters

`monitor_name`
:   Specifies the identifier for the model monitor to drop. If the identifier contains spaces, special characters, or
    mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are
    also case-sensitive.

    If the model identifier is not fully qualified (in the form of `db_name.schema_name.monitor_name` or
    `schema_name.monitor_name`)), the command looks for the model in the current schema for the session.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Model monitor | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

---
title: DROP NETWORK POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-network-policy.md
section: SQL Commands
---

# DROP NETWORK POLICY

Removes the specified network policy from the system.

> **Note:**
>
> Only security administrators (i.e. users with the SECURITYADMIN role) can drop network policies.

See also:
:   [CREATE NETWORK POLICY](create-network-policy.md) , [ALTER NETWORK POLICY](alter-network-policy.md) , [SHOW NETWORK POLICIES](show-network-policies.md) , [DESCRIBE NETWORK POLICY](desc-network-policy.md)

    [ALTER ACCOUNT](alter-account.md)

## Syntax

```sqlsyntax
DROP NETWORK POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the network policy to drop. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Only the network policy owner (i.e. role with the OWNERSHIP privilege on the network policy) or higher can execute this command.
* Dropped network policies cannot be recovered; they must be recreated.
* A network policy cannot be dropped if it is currently assigned to an account, security integration, or user.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a network policy named `mypolicy`:

> ```sqlexample
> DROP NETWORK POLICY mypolicy;
> ```

---
title: DROP NETWORK RULE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-network-rule.md
section: SQL Commands
---

# DROP NETWORK RULE

Removes the specified network rule from the system.

See also:
:   [CREATE NETWORK RULE](create-network-rule.md) , [ALTER NETWORK RULE](alter-network-rule.md) , [SHOW NETWORK RULES](show-network-rules.md) , [DESCRIBE NETWORK RULE](desc-network-rule.md)

## Syntax

```sqlsyntax
DROP NETWORK RULE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the network rule to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in
    double quotes are case-sensitive.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Network Rule | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Dropped network rules can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a network rule named `myrule`:

> ```sqlexample
> DROP NETWORK RULE myrule;
> ```

---
title: DROP NOTEBOOK
source: https://docs.snowflake.com/en/sql-reference/sql/drop-notebook.md
section: SQL Commands
---

# DROP NOTEBOOK

Removes the specified [notebook](../../user-guide/ui-snowsight/notebooks.md) from the current/specified schema, but retains a version of the
notebook so that it can be recovered using [UNDROP NOTEBOOK](undrop-notebook.md).

## Syntax

```sqlsyntax
DROP NOTEBOOK <name>
```

## Parameters

`name`
:   Specifies the identifier for the notebook to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE or OWNERSHIP | Notebook | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example drops the notebook named `mynotebook`:

```sqlexample
DROP NOTEBOOK mynotebook;
```

---
title: DROP ONLINE FEATURE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-online-feature-table.md
section: SQL Commands
---

# DROP ONLINE FEATURE TABLE

Removes the specified [online feature table](create-online-feature-table.md) from the current/specified
schema.

See also:
:   [CREATE ONLINE FEATURE TABLE](create-online-feature-table.md) , [ALTER ONLINE FEATURE TABLE](alter-online-feature-table.md), [DESCRIBE ONLINE FEATURE TABLE](desc-online-feature-table.md) , [SHOW ONLINE FEATURE TABLES](show-online-feature-tables.md)

## Syntax

```sqlsyntax
DROP ONLINE FEATURE TABLE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the online feature table to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`IF EXISTS`
:   Specifies to not return an error if the online feature table does not exist.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Online feature table | Role that has the OWNERSHIP privilege on the online feature table. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage Notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the online feature table named `my_online_feature_table`:

```sqlexample
DROP ONLINE FEATURE TABLE my_online_feature_table;
```

```output
+------------------------------------------------+
| status                                         |
|------------------------------------------------|
| MY_ONLINE_FEATURE_TABLE successfully dropped. |
+------------------------------------------------+
```

The following example drops the online feature table named `my_online_feature_table` if it exists:

```sqlexample
DROP ONLINE FEATURE TABLE IF EXISTS my_online_feature_table;
```

---
title: DROP ORGANIZATION PROFILE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-organization-profile.md
section: SQL Commands
---

# DROP ORGANIZATION PROFILE

Removes an organization profile.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md), [SHOW ORGANIZATION PROFILES](show-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md)

## Syntax

```sqlsyntax
DROP ORGANIZATION PROFILE <name>
```

## Parameters

`name`
:   Specifies the identifier for the organization profile to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. For information about identifier syntax, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Organization profile | Executing this command with any other role returns an error. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Dropped organization profiles cannot be recovered; they must be recreated. An organization profile cannot be dropped if it is associated with an organizational listing.

## Examples

The following example drops the organization profile named `MYORGANIZATIONPROFILE`:

```sqlexample
DROP ORGANIZATION PROFILE myorganizationprofile;
```

```output
+---------------------------------------------+
| status                                      |
|---------------------------------------------|
| MYORGANIZATIONPROFILE successfully dropped. |
+---------------------------------------------+
```

---
title: DROP ORGANIZATION USER
source: https://docs.snowflake.com/en/sql-reference/sql/drop-organization-user.md
section: SQL Commands
---

# DROP ORGANIZATION USER

Removes an [organization user](../../user-guide/organization-users.md) from the organization.

See also:
:   [CREATE ORGANIZATION USER](create-organization-user.md) , [ALTER ORGANIZATION USER](alter-organization-user.md) , [SHOW ORGANIZATION USERS](show-organization-users.md)

## Syntax

```sqlsyntax
DROP ORGANIZATION USER [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Identifier for the organization user; must be unique for your organization.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case
    sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Organization user |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Dropped organization users cannot be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Example

```sqlexample
DROP ORGANIZATION USER joe;
```

---
title: DROP ORGANIZATION USER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/drop-organization-user-group.md
section: SQL Commands
---

# DROP ORGANIZATION USER GROUP

Removes an [organization user group](../../user-guide/organization-users.md) from the organization.

See also:
:   [CREATE ORGANIZATION USER GROUP](create-organization-user-group.md) , [ALTER ORGANIZATION USER GROUP](alter-organization-user-group.md) , [SHOW ORGANIZATION USER GROUPS](show-organization-user-groups.md)

## Syntax

```sqlsyntax
DROP ORGANIZATION USER GROUP [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Identifier for the organization user group; must be unique for your organization.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also case
    sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Organization user group |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If an organization user group is dropped, the local users that were created in a regular account when the group was imported are also
  deleted. These users can’t be recovered. You’d have to recreate the local users by creating a new organization user group with the
  organization users, and then importing the group into the regular account.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Example

```sqlexample
DROP ORGANIZATION USER GROUP data_stewards;
```

---
title: DROP PACKAGES POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-packages-policy.md
section: SQL Commands
---

# DROP PACKAGES POLICY

Removes a packages policy from the system.

## Syntax

```sqlsyntax
DROP PACKAGES POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the packages policy to drop. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Packages policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* A packages policy cannot be dropped successfully if it is currently attached to an account. Before executing a DROP statement,
  to UNSET the packages policy from the account,
  run [ALTER ACCOUNT UNSET PACKAGES POLICY](alter-account.md).

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

---
title: DROP PASSWORD POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-password-policy.md
section: SQL Commands
---

# DROP PASSWORD POLICY

Removes a password policy from the system.

See also:
:   [DDL commands](../../user-guide/password-authentication.md)

## Syntax

```sqlsyntax
DROP PASSWORD POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Identifier for the password policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Password policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on password policy DDL and privileges, see [DDL commands](../../user-guide/password-authentication.md).

## Usage notes

* Prior to dropping a password policy, execute the following statement to determine if any password policies are applied to the account or
  users in the account. For more information, see [POLICY_REFERENCES](../functions/policy_references.md).

  ```sqlexample
  SELECT * from table(information_schema.policy_references(policy_name=>'<string>'));
  ```
* A password policy cannot be dropped successfully if it is currently attached to an account or user. Before executing a DROP statement,
  UNSET the password policy from the account with an [ALTER ACCOUNT](alter-account.md) statement or unset the password policy from a
  user with an [ALTER USER](alter-user.md) statement.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Example

Drop a password policy:

> ```sqlexample
> DROP PASSWORD POLICY password_policy_production_1;
> ```

---
title: DROP PIPE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-pipe.md
section: SQL Commands
---

# DROP PIPE

Removes the specified pipe from the current/specified schema.

See also:
:   [CREATE PIPE](create-pipe.md) , [ALTER PIPE](alter-pipe.md) , [SHOW PIPES](show-pipes.md) , [DESCRIBE PIPE](desc-pipe.md)

## Syntax

```sqlsyntax
DROP PIPE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the pipe to drop. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped pipes can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP PIPE mypipe;
>
> +------------------------------+
> | status                       |
> |------------------------------|
> | MYPIPE successfully dropped. |
> +------------------------------+
> ```

---
title: DROP POSTGRES INSTANCE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-postgres-instance.md
section: SQL Commands
---

# DROP POSTGRES INSTANCE

Removes the specified [Snowflake Postgres instance](../../user-guide/snowflake-postgres/about.md) from the account.

See also:
:   [CREATE POSTGRES INSTANCE](create-postgres-instance.md), [ALTER POSTGRES INSTANCE](alter-postgres-instance.md), [DESCRIBE POSTGRES INSTANCE](desc-postgres-instance.md), [SHOW POSTGRES INSTANCES](show-postgres-instances.md)

## Syntax

```sqlsyntax
DROP POSTGRES INSTANCE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the Postgres instance to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Postgres instance |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Currently, dropped Postgres instances can’t be recovered; you must recreate them. However, if you have
  created a [fork](../../user-guide/snowflake-postgres/postgres-point-in-time-recovery.md) of the instance,
  the fork remains independent and unaffected. To make it easier to recreate instances later, you might use
  DESC POSTGRES INSTANCE to capture the details of each instance before dropping it.
* When this command is issued, Snowflake terminates the Postgres instance and releases the associated compute resources.
  Billing for compute resources stops after the instance is fully terminated.
* All data stored in the Postgres instance is permanently deleted. Ensure you have backed up any important data
  before dropping the instance.
* If the instance has [high availability](../../user-guide/snowflake-postgres/high-availability.md) enabled, the
  HA standby is also dropped along with the primary instance.
* If the instance has [read replicas](../../user-guide/snowflake-postgres/postgres-create-replica.md), those replicas
  are also dropped when the primary instance is dropped.
* [Forked instances](../../user-guide/snowflake-postgres/postgres-point-in-time-recovery.md) are independent copies.
  Dropping the source instance doesn’t affect any instances that were forked from it.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a Postgres instance:

```sqlexample
DROP POSTGRES INSTANCE my_postgres;
```

Drop a Postgres instance only if it exists:

```sqlexample
DROP POSTGRES INSTANCE IF EXISTS my_postgres;
```

Use the [flow operator](../operators-flow.md) to find an instance to drop:

```sqlexample
-- Find the oldest instance
-- Then use SET and IDENTIFIER() to drop it
SET oldest_instance = (
  SHOW POSTGRES INSTANCES
    ->> SELECT "name"
        FROM $1
        ORDER BY "created_on"
        LIMIT 1
);

DROP POSTGRES INSTANCE IDENTIFIER($oldest_instance);
```

Find instances below a storage threshold before dropping:

```sqlexample
-- Identify small instances
SHOW POSTGRES INSTANCES
  ->> SELECT "name", "storage_size", "created_on"
      FROM $1
      WHERE "storage_size" < 50
      ORDER BY "storage_size";

DROP POSTGRES INSTANCE some_extremely_small_instance;
```

Check ownership before attempting to drop:

```sqlexample
SHOW GRANTS ON POSTGRES INSTANCE my_postgres;

-- Verify that you have OWNERSHIP privilege, then drop
DROP POSTGRES INSTANCE my_postgres;
```

---
title: DROP PRIVACY POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-privacy-policy.md
section: SQL Commands
---

# DROP PRIVACY POLICY

Removes the specified [privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) from the current/specified schema.

See also:
:   [CREATE PRIVACY POLICY](create-privacy-policy.md) , [ALTER PRIVACY POLICY](alter-privacy-policy.md) , [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md) , [SHOW PRIVACY POLICIES](show-privacy-policies.md)

## Syntax

```sqlsyntax
DROP PRIVACY POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the privacy policy to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Privacy policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

A privacy policy cannot be dropped successfully if it is currently assigned to a table or view.

Before executing a DROP statement, execute the following statement to determine if the privacy policy is set on any tables or views.

```sqlexample
SELECT * FROM TABLE(mydb.INFORMATION_SCHEMA.POLICY_REFERENCES(POLICY_NAME=>'my_privacy_policy'));
```

For each table or view, use [ALTER TABLE … DROP PRIVACY POLICY …](alter-table.md) or
[ALTER VIEW … DROP PRIVACY POLICY …](alter-view.md) to [detach the privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) from the
table or view.

## Examples

The following example drops the privacy policy named `myprivpolicy`:

```sqlexample
DROP PRIVACY POLICY myprivpolicy;
```

```output
+------------------------------------+
| status                             |
|------------------------------------|
| MYPRIVPOLICY successfully dropped. |
+------------------------------------+
```

---
title: DROP PROCEDURE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-procedure.md
section: SQL Commands
---

# DROP PROCEDURE

Removes the specified stored procedure from the current/specified schema.

See also:
:   [CREATE PROCEDURE](create-procedure.md) , [ALTER PROCEDURE](alter-procedure.md) , [SHOW PROCEDURES](show-procedures.md) , [DESCRIBE PROCEDURE](desc-procedure.md), [SHOW USER PROCEDURES](show-user-procedures.md)

## Syntax

```sqlsyntax
DROP PROCEDURE [ IF EXISTS ] <procedure_name> ( [ <arg_data_type> , ... ] )
```

## Usage notes

**All Languages**

* For each argument defined for the procedure, the data type for the argument must be specified. This is required because overloading of
  procedure names is supported and the data type(s) for the argument(s) are required to identify the procedure.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

**Java, Python, and Scala**

* For procedures that store code in a file (such as a .jar file or .py file) in a stage, the `DROP PROCEDURE` command does not remove
  the file. Different procedures can use different functions/methods in the same file, so the file should not be removed
  while any procedure refers to it. Snowflake does not store a count of the number of references to each staged file and
  does not remove that staged file when there are no remaining references.

## Examples

> ```sqlexample
> DROP PROCEDURE add_accounting_user(varchar);
>
> -------------------------------------------+
>              status                        |
> -------------------------------------------+
>  ADD_ACCOUNTING_USER successfully dropped. |
> -------------------------------------------+
> ```

---
title: DROP PROJECTION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-projection-policy.md
section: SQL Commands
---

# DROP PROJECTION POLICY

Removes a [projection policy](../../user-guide/projection-policies.md) from the current/specified schema.

See also:
:   [Projection policy DDL reference](../../user-guide/projection-policies.md)

## Syntax

```sqlsyntax
DROP PROJECTION POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the projection policy to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Projection policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on projection policy DDL and privileges, see [Privileges and commands](../../user-guide/projection-policies.md).

## Usage notes

* Prior to dropping the projection policy, execute the following statement to determine if the projection policy is set on any columns.

  ```sqlexample
  SELECT * from table(mydb.information_schema.policy_references(policy_name=>'do_not_project'));
  ```

  For more information, see [Identify projection policy references](../../user-guide/projection-policies.md).
* A projection policy cannot be dropped successfully if it is currently assigned to a column.

  Before executing a DROP statement, UNSET the projection policy from the column with an [ALTER TABLE … ALTER COLUMN](alter-table-column.md) or an
  [ALTER VIEW](alter-view.md) statement.

## Example

Drop the projection policy:

```sqlexample
DROP PROJECTION POLICY do_not_project;
```

---
title: DROP REPLICATION GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/drop-replication-group.md
section: SQL Commands
---

# DROP REPLICATION GROUP

Removes a [replication group](../../user-guide/account-replication-intro.md) from the account.

See also:
:   [CREATE REPLICATION GROUP](create-replication-group.md) , [ALTER REPLICATION GROUP](alter-replication-group.md) , [SHOW REPLICATION GROUPS](show-replication-groups.md)

## Syntax

```sqlsyntax
DROP REPLICATION GROUP [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the replication group.

## Usage notes

* Only a user with a role with the OWNERSHIP privilege on the group can execute this SQL command.
* A primary replication group can only be successfully dropped if no linked secondary replication groups exist.
* A database that is included in a replication group is not dropped when the replication group is dropped.

  + If a secondary replication group is dropped, any database previously included in the group loses read-only protection and becomes writable.
  + If the secondary replication group is re-created from the same primary replication group as before, the databases in the group are
    overwritten by the databases in the primary replication group during the first refresh. These databases are read-only.
* To retrieve the set of accounts in your organization that are enabled for replication, use
  [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md).
* To retrieve the list of replication groups in your organization, use [SHOW REPLICATION GROUPS](show-replication-groups.md). The
  `allowed_accounts` column lists all target accounts enabled for object replication from a source account.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop the replication group `myrg`:

```sqlexample
DROP REPLICATION GROUP myrg;
```

---
title: DROP RESOURCE MONITOR
source: https://docs.snowflake.com/en/sql-reference/sql/drop-resource-monitor.md
section: SQL Commands
---

# DROP RESOURCE MONITOR

Removes the specified [resource monitor](../../user-guide/resource-monitors.md) from the system.

See also:
:   [CREATE RESOURCE MONITOR](create-resource-monitor.md) , [ALTER RESOURCE MONITOR](alter-resource-monitor.md) , [SHOW RESOURCE MONITORS](show-resource-monitors.md)

## Syntax

```sqlsyntax
DROP RESOURCE MONITOR [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the resource monitor to drop. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped resource monitors cannot be recovered; they must be recreated.
* Dropping a resource monitor immediately enables resuming any assigned warehouses that have been suspended due to the monitor reaching
  its monthly threshold.

  For more information, see [Working with resource monitors](../../user-guide/resource-monitors.md).

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop resource monitor `my_rm`, but don’t raise an error if the resource monitor doesn’t exist:

```sqlexample
DROP RESOURCE MONITOR IF EXISTS my_rm;
```

---
title: DROP ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-role.md
section: SQL Commands
---

# DROP ROLE

Removes the specified role from the system.

See also:
:   [CREATE ROLE](create-role.md) , [ALTER ROLE](alter-role.md) , [SHOW ROLES](show-roles.md)

## Syntax

```sqlsyntax
DROP ROLE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the role to drop. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped roles cannot be recovered; they must be recreated.
* The current primary role cannot be dropped. An attempt to drop this role returns an error. For example:

  ```sqlexample
  CREATE ROLE bobr_primary;

  GRANT ROLE bobr_primary to USER bobr;

  USE ROLE bobr_primary;

  DROP ROLE bobr_primary;
  ```

  ```output
  SQL execution error: Cannot drop role BOBR_PRIMARY as it is the current primary role.
  ```

  For more information, see [Active roles](../../user-guide/security-access-control-overview.md) and [Authorization through primary role and secondary roles](../../user-guide/security-access-control-overview.md).
* A role cannot be dropped if it has the OWNERSHIP privilege on a shared database. Use the [GRANT OWNERSHIP](grant-ownership.md) command to transfer the
  OWNERSHIP privilege on the shared database first, and then drop the role.
* Ownership of any objects owned by the dropped role is transferred to the role that executes the DROP ROLE command. To transfer
  ownership of each of these objects to a different role, use
  [GRANT OWNERSHIP … COPY CURRENT GRANTS](grant-ownership.md).
* All current and future grants that name the role as either the grantor or the grantee are revoked when the role is dropped.

  Query the [GRANTS_TO_ROLES](../account-usage/grants_to_roles.md) Account Usage view to retrieve the privilege grants
  that name a specified role as the grantor or grantee:

  ```sqlexample
  SELECT *
    FROM SNOWFLAKE.ACCOUNT_USAGE.GRANTS_TO_ROLES
    WHERE grantee_name = UPPER('<role_name>') OR granted_by = UPPER('<role_name>');
  ```

  The following example retrieves the grants where `myrole` is the grantor or grantee:

  ```sqlexample
  SELECT *
    FROM SNOWFLAKE.ACCOUNT_USAGE.GRANTS_TO_ROLES
    WHERE grantee_name = UPPER('myrole') OR granted_by = UPPER('myrole');
  ```
* If a role is a grantor of roles to users, dropping the role revokes these grants automatically.
* Revoking grants happens as the DROP ROLE command executes. If there are thousands or millions of grants to revoke, the DROP ROLE
  command might time out. It is safe to rerun the command to continue execution where the previous invocation stopped.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* You can drop a role that’s in use by an object inside a [backup](../../user-guide/backups.md). Doing so might take a long time
  if there are backups. That’s because Snowflake rewrites the metadata for grants associated with the objects
  inside backups when a role is dropped.

## Examples

```sqlexample
DROP ROLE myrole;
```

---
title: DROP ROW ACCESS POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-row-access-policy.md
section: SQL Commands
---

# DROP ROW ACCESS POLICY

Removes a row access policy from the system.

See also:
:   [Row access policy DDL](../../user-guide/security-row-intro.md)

## Syntax

```sqlsyntax
DROP ROW ACCESS POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Identifier for the row access policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Schema or  Row access policy | The schema that contains the row access policy.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Manage row access policies](../../user-guide/security-row-intro.md).

## Usage notes

* Prior to dropping a row access policy, execute the following statement to determine if the row access policy is applied to any tables or
  views. For more information, see [POLICY_REFERENCES](../functions/policy_references.md).

  ```sqlexample
  SELECT * from table(information_schema.policy_references(policy_name=>'<string>'));
  ```
* A row access policy cannot be dropped successfully if it is currently attached to a resource. Before executing a DROP statement, detach
  the row access policy from the table or view with an ALTER TABLE or ALTER VIEW statement as shown in
  [ALTER TABLE](alter-table.md) or [ALTER VIEW](alter-view.md).
* Snowflake does not support `UNDROP` with row access policy objects. Using `UNDROP` triggers an error message. For more information
  on this error message, see [Troubleshoot row access policies](../../user-guide/security-row-intro.md).
* If a table column has a row access policy attached to it, the column cannot be dropped from the table.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* You can drop a row access policy that’s in use by a table inside a [backup](../../user-guide/backups.md). In that case, you
  can’t immediately use the table after you restore it from the backup. To use the restored table, run an
  ALTER TABLE … DROP ALL ROW ACCESS POLICIES command after restoring it.

## Example

The following example drops a row access policy from a table.

> ```sqlexample
> DROP ROW ACCESS POLICY rap_table_employee_info;
> ```

---
title: DROP SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/drop-schema.md
section: SQL Commands
---

# DROP SCHEMA

Removes a schema from the current/specified database.

See also:
:   [CREATE SCHEMA](create-schema.md) , [ALTER SCHEMA](alter-schema.md) , [DESCRIBE SCHEMA](desc-schema.md) , [SHOW SCHEMAS](show-schemas.md) , [UNDROP SCHEMA](undrop-schema.md)

## Syntax

```sqlsyntax
DROP SCHEMA [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

## Parameters

`name`
:   Specifies the identifier for the schema to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    If the schema identifier is not fully-qualified (in the form of `db_name.schema_name`), the command looks for the schema
    in the current database for the session.

`CASCADE | RESTRICT`
:   Specifies whether the schema can be dropped if foreign keys exist that reference any tables in the schema:

    * `CASCADE` drops the schema and all objects in the schema, including tables with primary/unique keys that are referenced by
      foreign keys in other tables.
    * `RESTRICT` returns a warning about existing foreign key references and does not drop the schema.

    Default: `CASCADE`

## Usage notes

* Dropping a schema does not permanently remove it from the system. A version of the dropped schema is retained in
  [Time Travel](../../user-guide/data-time-travel.md) for the number of days specified by the `DATA_RETENTION_TIME_IN_DAYS`
  parameter for the schema:

  > 1. Within the Time Travel retention period, a dropped schema can be restored using the [UNDROP SCHEMA](undrop-schema.md) command.
  > 2. When the Time Travel retention period ends, the next state for the dropped schema depends on whether it is permanent or transient:
  >
  >    + A permanent schema moves into [Fail-safe](../../user-guide/data-failsafe.md). In Fail-safe (7 days), a dropped schema can be
  >      recovered, but only by Snowflake. When the schema leaves Fail-safe, it is purged.
  >    + A transient schema has no Fail-safe, so it is purged when it moves out of Time Travel.
  > 3. Once a dropped schema has been purged, it cannot be recovered; it must be recreated.
* Currently, when a schema is dropped, the data retention period for child tables, if explicitly set to be different from the retention
  of the schema, is not honored. The child tables are retained for the same period of time as the schema. To honor the data retention
  period for these tables, drop them explicitly before you drop the schema.
* After dropping a schema, creating a schema with the same name creates a new version of the schema. The dropped version of the previous
  schema can still be restored using the following method:

  > 1. Rename the current version of the schema to a different name.
  > 2. Use the [UNDROP SCHEMA](undrop-schema.md) command to restore the previous version.
* In a [catalog-linked database](../../user-guide/tables-iceberg-catalog-linked-database.md) that allows writes, this command
  simultaneously drops the schema from your catalog-linked database and its corresponding namespace from your remote catalog.
* If a policy or tag is attached a table or view column, dropping the schema successfully requires the policy or tag to be self-contained
  within the database and schema. For example, `database_1` contains `policy_1` and `policy_1` is only used in `database_1`.
  Otherwise, a [dangling reference](../../user-guide/database-replication-considerations.md) occurs.
* The DROP operation fails if a session policy or password policy is set on a user or the account.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

> **Important:**
>
> If the schema contains a snapshot set that has an associated snapshot policy with a retention lock, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the schema containing the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.

## Examples

Drop a schema named `myschema` (from the [CREATE SCHEMA](create-schema.md) examples):

> ```sqlexample
> DROP SCHEMA myschema;
>
> +--------------------------------+
> | status                         |
> |--------------------------------|
> | MYSCHEMA successfully dropped. |
> +--------------------------------+
>
> SHOW SCHEMAS;
>
> +---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+
> | created_on                      | name               | is_default | is_current | database_name | owner  | comment                                                   | options | retention_time |
> |---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------|
> | Fri, 13 May 2016 17:26:07 -0700 | INFORMATION_SCHEMA | N          | N          | MYTESTDB      |        | Views describing the contents of schemas in this database |         |              1 |
> | Tue, 17 Mar 2015 16:57:04 -0700 | PUBLIC             | N          | Y          | MYTESTDB      | PUBLIC |                                                           |         |              1 |
> +---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+
> ```

---
title: DROP SECRET
source: https://docs.snowflake.com/en/sql-reference/sql/drop-secret.md
section: SQL Commands
---

# DROP SECRET

Removes a secret from the system.

See also:
:   [ALTER SECRET](alter-secret.md) , [CREATE SECRET](create-secret.md) , [DESCRIBE SECRET](desc-secret.md) , [SHOW SECRETS](show-secrets.md)

## Syntax

```sqlsyntax
DROP SECRET [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the secret to drop. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Secret | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a secret:

> ```sqlexample
> DROP SECRET service_now_creds;
> ```

---
title: DROP SEMANTIC VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/drop-semantic-view.md
section: SQL Commands
---

# DROP SEMANTIC VIEW

Removes the specified [semantic view](../../user-guide/views-semantic/overview.md) from the current/specified schema.

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
DROP SEMANTIC VIEW [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the semantic view to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Semantic view | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example drops the semantic view named `my_semantic_view`:

```sqlexample
DROP SEMANTIC VIEW my_semantic_view;
```

---
title: DROP SEQUENCE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-sequence.md
section: SQL Commands
---

# DROP SEQUENCE

Removes a sequence from the current/specified schema.

See also:
:   [CREATE SEQUENCE](create-sequence.md) , [ALTER SEQUENCE](alter-sequence.md) , [SHOW SEQUENCES](show-sequences.md) , [DESCRIBE SEQUENCE](desc-sequence.md)

## Syntax

```sqlsyntax
DROP SEQUENCE [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

## Parameters

`name`
:   Specifies the identifier of the sequence to drop.

    If the sequence identifier is not fully-qualified (in the form of `db_name.schema_name.sequence_name` or
    `schema_name.sequence_name`), the command looks for the sequence in the current schema for the session.

`CASCADE | RESTRICT`
:   Snowflake allows the keywords `CASCADE` and `RESTRICT` syntactically, but does not act on them. For example,
    dropping a sequence with the `CASCADE` keyword does not actually drop a table that uses the sequence.
    Dropping a sequence with the `RESTRICT` keyword does not issue a warning if a table is still using the sequence.

## Usage notes

* To drop a sequence, you must be using a role that has ownership privilege on the sequence.
* After dropping a sequence, creating a sequence with the same name creates a new version of the sequence. The
  new sequence does not resume generating numbers where the old sequence left off.
* Before dropping a sequence, verify that no tables or other database objects reference the sequence.
* If the dropped sequence was referenced in the `DEFAULT` clause of a table, then calling `GET_DDL()` for that
  table results in an error, rather than in the DDL that created the table.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a sequence:

> ```sqlexample
> DROP SEQUENCE IF EXISTS invoice_sequence_number;
> ```

---
title: DROP SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-service.md
section: SQL Commands
---

# DROP SERVICE

Removes the specified
[Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md) from the current
or specified schema. The containers for the service are terminated.

See also:
:   [CREATE SERVICE](create-service.md) , [ALTER SERVICE](alter-service.md), [SHOW SERVICES](show-services.md) , [DESCRIBE SERVICE](desc-service.md)

## Syntax

```sqlsyntax
DROP SERVICE [ IF EXISTS ] <name> [ FORCE ]
```

## Required parameters

`name`
:   Specifies the identifier for the service to be dropped.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`FORCE`
:   Drops the service (including job services) and the associated block storage volumes.

    If `FORCE` is not specified and the service uses a
    [block storage volume](../../developer-guide/snowpark-container-services/block-storage-volume.md)
    an error is returned.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

The following example drops the service named `my_tutorial`:

```sqlexample
DROP SERVICE my_tutorial;
```

```output
+-----------------------------------+
| status                            |
|-----------------------------------|
| MY_TUTORIAL successfully dropped. |
+-----------------------------------+
```

---
title: DROP SESSION POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-session-policy.md
section: SQL Commands
---

# DROP SESSION POLICY

Removes a session policy from the system.

See also:
:   [Session Policy DDL Reference](../../user-guide/session-policies-managing.md)

## Syntax

```sqlsyntax
DROP SESSION POLICY [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Identifier for the session policy; must be unique for your account.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Session policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on session policy DDL and privileges, see [Managing session policies](../../user-guide/session-policies-managing.md).

## Usage notes

* Prior to dropping a session policy, execute the following statement to determine if any session policies are applied to accounts or
  users. For more information, see [POLICY_REFERENCES](../functions/policy_references.md).

  ```sqlexample
  SELECT * from table(information_schema.policy_references(policy_name=>'<string>'));
  ```
* A session policy cannot be dropped successfully if it is currently attached to an account or user. Before executing a DROP statement,
  UNSET the session policy from the account with an [ALTER ACCOUNT](alter-account.md) statement or unset the session policy from a
  user with an [ALTER USER](alter-user.md) statement.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Example

```sqlexample
DROP SESSION POLICY session_policy_production_1;
```

---
title: DROP SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-share.md
section: SQL Commands
---

# DROP SHARE

Removes the specified [share](../../user-guide/data-sharing-intro.md) from the system and immediately revokes access for all consumers
(i.e. accounts who have created a database from the share).

See also:
:   [CREATE SHARE](create-share.md) , [ALTER SHARE](alter-share.md) , [SHOW SHARES](show-shares.md) , [DESCRIBE SHARE](desc-share.md)

## Syntax

```sqlsyntax
DROP SHARE <name>
```

## Parameters

`name`
:   Specifies the identifier for the share to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Only the share owner, the role with the OWNERSHIP privilege on the share, has the privileges to drop a share.
  Executing this command with any other role returns an error.
* Dropped shares cannot be recovered; they must be recreated.
* Dropping a share does not affect the database in the share (or any of the objects in the database).

> **Important:**
>
> Before dropping a share, consider the downstream impact of performing this operation:
>
> * Consumer accounts that have created databases from the share will no longer be able to query these databases.
> * Recreating a share with the same name as a previous share does not restore the databases created (by any consumers) from the share.
>   Each consumer must create a new database from the new share.
> * A dropped share can not be restored. The share must be created again using the [CREATE SHARE](create-share.md) command and then
>   configured using [GRANT <privilege> … TO SHARE](grant-privilege-share.md) and [ALTER SHARE](alter-share.md).

## Examples

> ```sqlexample
> DROP SHARE sales_s;
>
> +-------------------------------+
> | status                        |
> |-------------------------------|
> | SALES_S successfully dropped. |
> +-------------------------------+
> ```

---
title: DROP SNAPSHOT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-snapshot.md
section: SQL Commands
---

# DROP SNAPSHOT

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Removes a [snapshot of a block storage volume](../../developer-guide/snowpark-container-services/block-storage-volume.md). A snapshot is persisted data that the customer pays for. DROP SNAPSHOT tells Snowflake to delete that data. The data is no longer available for use as a snapshot and the customer no longer pays for it.

See also:
:   [CREATE SNAPSHOT](create-snapshot.md) , [ALTER SNAPSHOT](alter-snapshot.md), [DESCRIBE SNAPSHOT](desc-snapshot.md), [SHOW SNAPSHOTS](show-snapshots.md)

## Syntax

```sqlsyntax
DROP SNAPSHOT [ IF EXISTS ] <name>;
```

## Parameters

`name`
:   Specifies the identifier for the snapshot to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Snapshot | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* Dropping a snapshot does not immediately remove it from the system. A version of the dropped snapshot is retained in [Time Travel](../../user-guide/data-time-travel.md) for
  the number of days specified by the DATA_RETENTION_TIME_IN_DAYS parameter for the parent schema, database, or account:

  + Within the Time Travel retention period, a dropped snapshot can be restored using the UNDROP SNAPSHOT command.
  + After the Time Travel retention period, it is permanently removed; it must be recreated.

  For more information, see [Data retention period](../../user-guide/data-time-travel.md).
* To immediately drop a snapshot without retention, set DATA_RETENTION_TIME_IN_DAYS to 0 at the schema level where the snapshot resides. This setting also affects the retention period for other objects within that schema.

## Examples

The following example drops the snapshot named `example_snapshot`:

```sqlexample
DROP SNAPSHOT example_snapshot;
```

```output
+----------------------------------------+
| status                                 |
|----------------------------------------|
| EXAMPLE_SNAPSHOT successfully dropped. |
+----------------------------------------+
```

---
title: DROP SNAPSHOT POLICY — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/drop-snapshot-policy.md
section: SQL Commands
---

# DROP SNAPSHOT POLICY — *Deprecated*

Deletes a [snapshot](../../user-guide/backups.md) policy.

See also:
:   [CREATE SNAPSHOT POLICY — Deprecated](create-snapshot-policy.md),
    [ALTER SNAPSHOT POLICY — Deprecated](alter-snapshot-policy.md),
    [SHOW SNAPSHOT POLICIES — Deprecated](show-snapshot-policies.md)

## Syntax

```sqlsyntax
DROP SNAPSHOT POLICY <name>
```

## Parameters

`name`
:   Specifies the identifier for the snapshot policy.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Snapshot policy | The role used to delete a snapshot policy must have the OWNERSHIP privilege on the policy. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

A snapshot policy can’t be deleted if it is attached to any snapshot set.

## Examples

Delete the snapshot policy `hourly_snapshot_policy`:

```sqlexample
DROP SNAPSHOT POLICY hourly_snapshot_policy;
```

---
title: DROP SNAPSHOT SET — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/drop-snapshot-set.md
section: SQL Commands
---

# DROP SNAPSHOT SET — *Deprecated*

Deletes a [snapshot](../../user-guide/backups.md) set.

See also:
:   [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md),
    [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md),
    [SHOW SNAPSHOT SETS — Deprecated](show-snapshot-sets.md)

## Syntax

```sqlsyntax
DROP SNAPSHOT SET <name>
```

## Parameters

`name`
:   Specifies the identifier for the snapshot set.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Snapshot set | The role used to modify the snapshot policy for a snapshot set must have the OWNERSHIP privilege on the set. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

> **Important:**
>
> If the snapshot policy has a retention lock applied to it, and there are any
> unexpired snapshots in the snapshot set, then you can’t delete the snapshot set.
> In that case, you must wait for all the snapshots in the set to expire.
> This restriction applies even to privileged roles such as ACCOUNTADMIN, and to Snowflake support.
> For that reason, be careful when specifying retention lock and a long expiration
> period in a snapshot policy.
>
> You also can’t drop a snapshot set if any of the snapshots it contains have a legal hold applied.

## Examples

Delete the snapshot set `t1_snapshots`:

```sqlexample
DROP SNAPSHOT SET t1_snapshots;
```

---
title: DROP STAGE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-stage.md
section: SQL Commands
---

# DROP STAGE

Removes the specified named internal or external stage from the current/specified schema. The status of the files in the stage depends on
the stage type:

* For an internal stage, all of the files in the stage are purged from Snowflake, regardless of their load status. This
  prevents the files from continuing to using storage and, consequently, accruing storage charges. However, this also means that the
  staged files cannot be recovered after a stage is dropped.
* For an external stage, only the stage itself is dropped; any data files in the referenced external location (Amazon S3, Google Cloud
  Storage, or Microsoft Azure) are not removed.

See also:
:   [CREATE STAGE](create-stage.md) , [ALTER STAGE](alter-stage.md) , [SHOW STAGES](show-stages.md) , [DESCRIBE STAGE](desc-stage.md)

## Syntax

```sqlsyntax
DROP STAGE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the stage to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped stages cannot be recovered; they must be recreated.
* This command cannot be used to drop the stage associated with a table or user; only named stages (internal or external) can be dropped.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP STAGE my_stage;
>
> --------------------------------+
>              status             |
> --------------------------------+
>  MY_STAGE successfully dropped. |
> --------------------------------+
> ```

---
title: DROP STORAGE LIFECYCLE POLICY
source: https://docs.snowflake.com/en/sql-reference/sql/drop-storage-lifecycle-policy.md
section: SQL Commands
---

# DROP STORAGE LIFECYCLE POLICY

Removes the specified [storage lifecycle policy](../../user-guide/storage-management/storage-lifecycle-policies.md) from the current or specified schema.

See also:
:   [CREATE STORAGE LIFECYCLE POLICY](create-storage-lifecycle-policy.md) , [ALTER STORAGE LIFECYCLE POLICY](alter-storage-lifecycle-policy.md) , [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md) , [SHOW STORAGE LIFECYCLE POLICIES](show-storage-lifecycle-policies.md)

## Syntax

```sqlsyntax
DROP STORAGE LIFECYCLE POLICY [ IF EXISTS ] <policy_name>
```

## Parameters

`policy_name`
:   Specifies the identifier for the storage lifecycle policy to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Schema or  Storage lifecycle policy | The schema that contains the policy.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

* Snowflake doesn’t support undropping storage lifecycle policy objects.
* If a table column has a storage lifecycle policy attached to it, you can’t drop the column from the table.
* You can’t drop a database or schema that contains a storage lifecycle policy attached to an object that belongs to a different database or schema.
* You can’t drop a storage lifecycle policy that is attached to a table. Remove the policy association before dropping the storage lifecycle policy.
* When you undrop a table or schema with an attached policy, the policy association is restored.

## Examples

The following example drops the storage lifecycle policy named `example_slp`:

```sqlexample
DROP STORAGE LIFECYCLE POLICY example_slp;
```

Output:

```output
+-----------------------------------+
| status                            |
|-----------------------------------|
| EXAMPLE_SLP successfully dropped. |
+-----------------------------------+
```

---
title: DROP STREAM
source: https://docs.snowflake.com/en/sql-reference/sql/drop-stream.md
section: SQL Commands
---

# DROP STREAM

Removes a stream from the current/specified schema.

See also:
:   [CREATE STREAM](create-stream.md) , [ALTER STREAM](alter-stream.md) , [SHOW STREAMS](show-streams.md) , [DESCRIBE STREAM](desc-stream.md)

## Syntax

```sqlsyntax
DROP STREAM [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the stream to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive
    (e.g. `"My Object"`).

    If the stream identifier is not fully-qualified (in the form of `db_name.schema_name.stream_name` or
    `schema_name.stream_name`), the command looks for the stream in the current schema for the session.

## Usage notes

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a stream:

> ```sqlexample
> SHOW STREAMS LIKE 't2%';
>
>
> DROP STREAM t2;
>
>
> SHOW STREAMS LIKE 't2%';
> ```

Drop the stream again, but don’t raise an error if the stream does not exist:

> ```sqlexample
> DROP STREAM IF EXISTS t2;
> ```

---
title: DROP STREAMLIT
source: https://docs.snowflake.com/en/sql-reference/sql/drop-streamlit.md
section: SQL Commands
---

# DROP STREAMLIT

Removes the specified Streamlit object from the current/specified schema.

See also:
:   [CREATE STREAMLIT](create-streamlit.md), [SHOW STREAMLITS](show-streamlits.md), [DESCRIBE STREAMLIT](desc-streamlit.md), [UNDROP STREAMLIT](undrop-streamlit.md), [ALTER STREAMLIT](alter-streamlit.md)

## Syntax

```sqlsyntax
DROP STREAMLIT [IF EXISTS] <name>
```

## Required parameters

`name`
:   Specifies the identifier for the Streamlit object to drop. If the identifier contains spaces, special characters, or
    mixed-case characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are
    also case-sensitive (e.g. `"My Object"`).

    If the Streamlit object identifier is not fully-qualified (in the form of
    `db_name.schema_name.streamlit_name` or `schema_name.streamlit_name`), the command looks for
    the Streamlit object in the current schema for the session.

## Access control requirements

Your role must have the following [privileges](../../user-guide/security-access-control-overview.md) on objects:

| Privilege | Object |
| --- | --- |
| OWNERSHIP | Streamlit object that you remove |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* For Streamlit objects created using ROOT_LOCATION, this command does not drop the underlying stage because
  the owner of the Streamlit object may not be the owner of the stage. Additionally, multiple Streamlit objects
  may point to the same stage. If you need to drop the corresponding stage, use the [DROP STAGE](drop-stage.md) command.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

---
title: DROP TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-table.md
section: SQL Commands
---

# DROP TABLE

Removes a table from the current or specified schema, but retains a version of the table so that it can be recovered by using
[UNDROP TABLE](undrop-table.md). For information, see Usage Notes.

See also:
:   [CREATE TABLE](create-table.md) , [ALTER TABLE](alter-table.md) , [SHOW TABLES](show-tables.md) , [TRUNCATE TABLE](truncate-table.md) , [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
DROP TABLE [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

## Parameters

`name`
:   Specifies the identifier for the table to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive
    (for example, `"My Object"`).

    If the table identifier is not fully-qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the command looks for the table in the current schema for the session.

`CASCADE | RESTRICT`
:   Specifies whether the table can be dropped if foreign keys exist that reference the table:

    * CASCADE: Drops the table even if the table has primary or unique keys that are referenced by foreign keys in other tables.
    * RESTRICT: Returns a warning about existing foreign key references and doesn’t drop the table.

    Default: CASCADE for standard tables; RESTRICT for hybrid tables. See also Dropping hybrid tables.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Table | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Dropping a table does not permanently remove it from the system. A version of the dropped table is retained in
  [Time Travel](../../user-guide/data-time-travel.md) for the number of days specified by the
  [data retention period](../../user-guide/data-time-travel.md) for the table:

  > + Within the Time Travel retention period, you can restore a dropped table by using the [UNDROP TABLE](undrop-table.md) command.
  > + Changing the Time Travel retention period for the account or for a parent object (a database or a schema) *after*
  >   you drop a table doesn’t change the Time Travel retention period for the dropped table.
  >   For more information, see the [note in the Time Travel topic](../../user-guide/data-time-travel.md).
  > + When the Time Travel retention period ends, the next state for the dropped table depends on whether it is permanent, transient, or
  >   temporary:
  >
  >   - A permanent table moves into [Fail-safe](../../user-guide/data-failsafe.md). In Fail-safe (7 days), a dropped table can be recovered,
  >     but only by Snowflake. When the table leaves Fail-safe, it is purged.
  >   - A transient or temporary table has no Fail-safe, so it is purged when it moves out of Time Travel.
  >
  >     > **Note:**
  >     >
  >     > A long-running Time Travel query delays the movement of any data and objects (tables, schemas, and databases) in the account into
  >     > Fail-safe, until the query completes. The purging of temporary and transient tables is delayed in the same way.
  >   - After a dropped table is purged, it can’t be recovered; it must be recreated.
* After you drop a table, creating a table with the same name creates a new version of the table. You can still restore the dropped version of the
  previous table by following these steps:

  1. Rename the current version of the table.
  2. Use the [UNDROP TABLE](undrop-table.md) command to restore the previous version of the table.
* Before dropping a table, verify that *no views reference the table*. Dropping a table referenced by a view invalidates the view
  (that is, querying the view returns an “object does not exist” error).
* To drop a table, you must use a role that has OWNERSHIP privilege on the table.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Dropping hybrid tables

When you drop a hybrid table without specifying the RESTRICT or CASCADE option, and the hybrid table
has a primary-key/foreign-key or unique-key/foreign-key relationship with another table, the DROP TABLE
command fails with an error. The default behavior is RESTRICT.

For example:

```sqlexample
CREATE OR REPLACE HYBRID TABLE ht1(
  col1 NUMBER(38,0) NOT NULL,
  col2 NUMBER(38,0) NOT NULL,
  CONSTRAINT pkey_ht1 PRIMARY KEY (col1, col2));

CREATE OR REPLACE HYBRID TABLE ht2(
  cola NUMBER(38,0) NOT NULL,
  colb NUMBER(38,0) NOT NULL,
  colc NUMBER(38,0) NOT NULL,
  CONSTRAINT pkey_ht2 PRIMARY KEY (cola),
  CONSTRAINT fkey_ht1 FOREIGN KEY (colb, colc) REFERENCES ht1(col1,col2));

DROP TABLE ht1;
```

```output
SQL compilation error:
Cannot drop the table because of dependencies
```

The DROP TABLE command fails in this case. If necessary, you can override the default behavior by specifying
CASCADE in the DROP TABLE command.

```sqlexample
DROP TABLE ht1 CASCADE;
```

Alternatively in this case, you could drop the dependent table `ht2` first, then drop table `ht1`.

## Examples

Drop a table:

> ```sqlexample
> SHOW TABLES LIKE 't2%';
>
> +---------------------------------+------+---------------+-------------+-----------+------------+------------+------+-------+--------------+----------------+
> | created_on                      | name | database_name | schema_name | kind      | comment    | cluster_by | rows | bytes | owner        | retention_time |
> |---------------------------------+------+---------------+-------------+-----------+------------+------------+------+-------+--------------+----------------+
> | Tue, 17 Mar 2015 16:48:16 -0700 | T2   | TESTDB        | PUBLIC      | TABLE     |            |            |    5 | 4096  | PUBLIC       |              1 |
> +---------------------------------+------+---------------+-------------+-----------+------------+------------+------+-------+--------------+----------------+
>
> DROP TABLE t2;
>
> +--------------------------+
> | status                   |
> |--------------------------|
> | T2 successfully dropped. |
> +--------------------------+
>
> SHOW TABLES LIKE 't2%';
>
> +------------+------+---------------+-------------+------+---------+------------+------+-------+-------+----------------+
> | created_on | name | database_name | schema_name | kind | comment | cluster_by | rows | bytes | owner | retention_time |
> |------------+------+---------------+-------------+------+---------+------------+------+-------+-------+----------------|
> +------------+------+---------------+-------------+------+---------+------------+------+-------+-------+----------------+
> ```

Drop the table again, but don’t raise an error if the table does not exist:

> ```sqlexample
> DROP TABLE IF EXISTS t2;
>
> +------------------------------------------------------------+
> | status                                                     |
> |------------------------------------------------------------|
> | Drop statement executed successfully (T2 already dropped). |
> +------------------------------------------------------------+
> ```

---
title: DROP TAG
source: https://docs.snowflake.com/en/sql-reference/sql/drop-tag.md
section: SQL Commands
---

# DROP TAG

Removes a tag from the system.

For information about this command and tag references, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

See also:
:   [CREATE TAG](create-tag.md) , [ALTER TAG](alter-tag.md) , [SHOW TAGS](show-tags.md) , [UNDROP TAG](undrop-tag.md)

## Syntax

```sqlsyntax
DROP TAG [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Identifier for the tag.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Tag | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on tag DDL and privileges, see [Access control privileges](../../user-guide/object-tagging/work.md).

## Usage notes

* Prior to dropping a tag, determine all of the objects the tag is assigned to by calling the Account Usage table function
  [TAG_REFERENCES_WITH_LINEAGE](../functions/tag_references_with_lineage.md).
* A tag can be dropped if it is currently assigned to an [object](../../user-guide/object-tagging/introduction.md). If dropping the tag was
  unintentional, execute an [UNDROP TAG](undrop-tag.md) command. Note that the UNDROP TAG command restores the tag assignments
  prior to the DROP TAG operation.
* A tag cannot be dropped if a masking policy is [assigned](alter-tag.md) to the tag.

  In this scenario, unset the masking policy from the tag first and then execute the DROP TAG statement.
* For more information on tag DDL authorization, see [required privileges](../../user-guide/object-tagging/work.md).

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Example

The following example drops a tag:

> ```sqlexample
> DROP TAG cost_center;
> ```

---
title: DROP TASK
source: https://docs.snowflake.com/en/sql-reference/sql/drop-task.md
section: SQL Commands
---

# DROP TASK

Removes a task from the current/specified schema.

See also:
:   [CREATE TASK](create-task.md) , [ALTER TASK](alter-task.md) , [SHOW TASKS](show-tasks.md) , [DESCRIBE TASK](desc-task.md)

## Syntax

```sqlsyntax
DROP TASK [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the task to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive
    (e.g. `"My Object"`).

    If the task identifier is not fully-qualified (in the form of `db_name.schema_name.task_name` or
    `schema_name.task_name`), the command looks for the task in the current schema for the session.

## Usage notes

* When a task is dropped, any current run of the task (i.e. a run with an EXECUTING state in the
  [TASK_HISTORY](../functions/task_history.md) output) is completed. To abort the run of the specified task, execute the
  [SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS](../functions/system_user_task_cancel_ongoing_executions.md) function.
* The root task in a [task graph](../../user-guide/tasks-graphs.md) must be suspended before any task in the task graph is dropped.
* A standalone task can be dropped by the task owner (i.e. the role that has the OWNERSHIP privilege on the task) or a higher role
  without first suspending the task.
* If a predecessor task in a task graph is dropped, then all former child tasks that identified this task as the predecessor become either
  standalone tasks or root tasks, depending on whether other tasks identify these former child tasks as their predecessor. These former
  child tasks are suspended by default and must be resumed manually.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

Drop a task:

> ```sqlexample
> SHOW TASKS LIKE 't2%';
>
>
> DROP TASK t2;
>
>
> SHOW TASKS LIKE 't2%';
> ```

Drop the task again, but don’t raise an error if the task does not exist:

> ```sqlexample
> DROP TASK IF EXISTS t2;
> ```

---
title: DROP TYPE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-type.md
section: SQL Commands
---

# DROP TYPE

Removes a [user-defined type](../data-types-user-defined.md).

See also:
:   [CREATE TYPE](create-type.md) , [ALTER TYPE](alter-type.md) , [DESCRIBE TYPE](desc-type.md) , [SHOW TYPES](show-types.md) , [UNDROP TYPE](undrop-type.md)

## Syntax

```sqlsyntax
DROP TYPE <name>
```

## Parameters

`name`
:   Specifies the identifier for the user-defined type to drop.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | User-defined type | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Before you drop a user-defined type, verify that *no* tables or other database objects reference the
  user-defined type.

  You can run SHOW commands to determine which database objects reference a user-defined type. For example,
  the following query runs the [SHOW COLUMNS](show-columns.md) command with the
  [pipe operator](../operators-flow.md) (`->>`) to return tables with columns of data types
  that include the text `AGE`:

  ```sqlexample
  SHOW COLUMNS ->>
    SELECT "table_name", "data_type"
      FROM $1
      WHERE "data_type" LIKE '%AGE%';
  ```
* If a user-defined type is dropped and a query directly references a column of that type, the query fails.
  Queries that don’t directly reference the column of the dropped type run normally.

## Examples

Use the DROP TYPE command to drop the `age` user-defined type:

```sqlexample
DROP TYPE age;
```

---
title: DROP USER
source: https://docs.snowflake.com/en/sql-reference/sql/drop-user.md
section: SQL Commands
---

# DROP USER

Removes the specified user from the system.

See also:
:   [CREATE USER](create-user.md) , [ALTER USER](alter-user.md) , [SHOW USERS](show-users.md) , [DESCRIBE USER](desc-user.md)

## Syntax

```sqlsyntax
DROP USER [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the user to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped users cannot be recovered; they must be recreated.

  If you want to disable a user, use [ALTER USER](alter-user.md) and set `DISABLED = TRUE` instead.
* If there is a conflict between a local user object and an [organization user](../../user-guide/organization-users.md), a user that
  corresponds to the organization user is automatically created when you drop the local user.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

> **Important:**
>
> When you drop a user, the folders, worksheets, and dashboards owned by that user become inaccessible and **do not** transfer to another user
> unless sharing is enabled.
>
> Share recipients with [View, View + Run, and Edit permissions](../../user-guide/ui-snowsight-worksheets.md)
> will retain their assigned permissions and can still access the shared folders, worksheets, and dashboards. However, only users with Edit
> permissions can modify or delete the shared folders, worksheets, and dashboards. If you don’t give Edit permissions to at least one other
> user before you drop the owner, that owner’s folders, worksheets, and dashboards cannot be deleted.
>
> If a dropped user’s worksheets do not have sharing enabled, an administrator can [recover up to 500 worksheets owned by the user](../../user-guide/ui-snowsight-worksheets.md).

> **Caution:**
>
> Any worksheets in the Classic Console will be permanently deleted, and dashboards will be inaccessible if they were not previously shared
> with another user.

## Examples

> ```sqlexample
> DROP USER user1;
> ```

---
title: DROP VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/drop-view.md
section: SQL Commands
---

# DROP VIEW

Removes the specified view from the current/specified schema.

See also:
:   [CREATE VIEW](create-view.md) , [ALTER VIEW](alter-view.md) , [SHOW VIEWS](show-views.md) , [DESCRIBE VIEW](desc-view.md)

## Syntax

```sqlsyntax
DROP VIEW [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the view to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    If the view identifier is not fully-qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the command looks for the view in the current schema for the session.

## Usage notes

* Dropped views can’t be recovered; they must be recreated.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

## Examples

> ```sqlexample
> DROP VIEW myview;
> ```
>
> ```output
> ------------------------------+
>            status             |
> ------------------------------+
>  MYVIEW successfully dropped. |
> ------------------------------+
> ```

---
title: DROP WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/sql/drop-warehouse.md
section: SQL Commands
---

# DROP WAREHOUSE

Removes the specified [virtual warehouse](../../user-guide/warehouses-overview.md) from the system.

See also:
:   [ALTER WAREHOUSE](alter-warehouse.md) , [CREATE WAREHOUSE](create-warehouse.md) , [DESCRIBE WAREHOUSE](desc-warehouse.md) , [SHOW WAREHOUSES](show-warehouses.md)

## Syntax

```sqlsyntax
DROP WAREHOUSE [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the warehouse to drop. If the identifier contains spaces, special characters, or mixed-case characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Dropped warehouses can’t be recovered; they must be recreated.
* When this command is issued, Snowflake aborts any queries being processed by the specified warehouse and shuts down the compute
  resources utilized by the warehouse. Metering on the compute resources for the warehouse stops after all running statements complete.

* When the IF EXISTS clause is specified and the target object doesn’t exist, the command completes successfully
  without returning an error.

> **Tip:**
>
> To prevent in-progress queries from being aborted for a dropped warehouse (i.e. you wish the queries to be completed):
>
> 1. First suspend the warehouse.
> 2. After all the queries have completed, drop the warehouse.

---
title: EXECUTE ALERT
source: https://docs.snowflake.com/en/sql-reference/sql/execute-alert.md
section: SQL Commands
---

# EXECUTE ALERT

Manually executes an [alert](../../user-guide/alerts.md) independent of the schedule for the alert.

> **Note:**
>
> You cannot use EXECUTE ALERT to execute an [alert on new data](../../user-guide/alerts.md).

See also:
:   [CREATE ALERT](create-alert.md) , [ALTER ALERT](alter-alert.md) , [DROP ALERT](drop-alert.md) , [SHOW ALERTS](show-alerts.md) , [DESCRIBE ALERT](desc-alert.md)

## Syntax

```sqlsyntax
EXECUTE ALERT <name>
```

## Parameters

`name`
:   Identifier for the alert to execute.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| EXECUTE ALERT | Account |  |
| OWNERSHIP or OPERATE | Alert |  |
| USAGE | Warehouse | Required on the warehouse used for the alert. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Alerts always run with the privileges of the owner of the alert, even if a different role with the OPERATE privilege uses
  EXECUTE ALERT to execute the alert.
* If the alert is currently suspended, the EXECUTE ALERT command executes the alert but does not resume the alert. The alert
  remains suspended.
* If the alert is currently running (meaning that the state of the alert in the [ALERT_HISTORY](../functions/alert_history.md)
  table function output or the [ALERT_HISTORY view](../account-usage/alert_history.md) is `EXECUTING`), the
  EXECUTE ALERT command schedules another run of the alert to start immediately after the current run is completed.
* If the alert is currently scheduled (meaning that the state of the alert in the ALERT_HISTORY table function output or the
  ALERT_HISTORY view is `SCHEDULED`), the scheduled run is replaced with the requested run and the current timestamp is set
  to the scheduled time.

  However, if the scheduled time has passed but the alert has not yet transitioned to the `EXECUTING` state, the scheduled run
  occurs as usual. (The scheduled run is not replaced with the run requested by the EXECUTE ALERT command.)

## Examples

The following statement manually triggers an alert named `myalert`:

```sqlexample
EXECUTE ALERT myalert;
```

---
title: EXECUTE DBT PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/execute-dbt-project.md
section: SQL Commands
---

# EXECUTE DBT PROJECT

Executes the specified [dbt project object](../../user-guide/data-engineering/dbt-projects-on-snowflake.md) or the dbt project in a Snowflake workspace using the dbt command and command-line options specified.

See also:
:   [CREATE DBT PROJECT](create-dbt-project.md), [ALTER DBT PROJECT](alter-dbt-project.md), [DESCRIBE DBT PROJECT](desc-dbt-project.md), [DROP DBT PROJECT](drop-dbt-project.md), [SHOW DBT PROJECTS](show-dbt-projects.md)

## Syntax

Executes the dbt project object with the specified name.

```sqlsyntax
EXECUTE DBT PROJECT [ IF EXISTS ] <name>
  [ ARGS = '[ <dbt_command> ] [ --<dbt_cli_option> <option_value_1> [ ... ] ] [ ... ]' ]
  [ DBT_VERSION = 'version_number' ]
```

## Variant syntax

Executes the dbt project that is saved in a workspace with the specified workspace name. The user who owns the workspace must be the user who runs this command variant.

```sqlsyntax
EXECUTE DBT PROJECT [ IF EXISTS ] [ FROM WORKSPACE <name> ]
  [ ARGS = '[ <dbt_command> ] [ --<dbt_cli_option> <option_value_1> [ ... ] [ ... ] ]' ]
  [ DBT_VERSION = 'version_number' ]
  [ PROJECT_ROOT = '<subdirectory_path>' ]
```

## Required parameters

`name`
:   When executing a dbt project object, specifies the name of the dbt project object to execute.

    When executing a dbt project by using the FROM WORKSPACE option, specifies the name of the workspace for dbt Projects on Snowflake. The workspace name is always specified in reference to the `public` schema in the user’s personal database, which is indicated by `user$`.

    We recommend enclosing the workspace name in double quotes because workspace names are case-sensitive and can contain special characters.

    The following example shows a workspace name reference:

    `user$.public."My dbt Project Workspace"`

## Optional parameters

`ARGS = '[ dbt_command ] [ --dbt_cli_option option_value_1 [ ... ] [ ... ] ]'`
:   Specifies the [dbt command](https://docs.getdbt.com/reference/dbt-commands) and supported [command-line options](https://docs.getdbt.com/reference/global-configs/about-global-configs#available-flags) to run when the dbt project executes. This is a literal string that must conform to the syntax and requirements of dbt CLI commands.

    If no value is specified, the dbt project executes with the [dbt command](https://docs.getdbt.com/reference/dbt-commands) and [command-line options](https://docs.getdbt.com/reference/global-configs/about-global-configs#available-flags) specified in the [dbt project object definition](create-dbt-project.md). If you specify dbt CLI options without specifying a dbt command, the dbt `run` command executes by default.

    Default: No value

`DBT_VERSION = 'version_number'`
:   Specifies a version for the dbt Project.

    Default: When you execute a dbt project, the system uses the default version you specified when creating the dbt project. If none was specified, the system uses `1.9.4` by default.

    For more information, see [Supported dbt Core versions for dbt Projects on Snowflake](../../user-guide/data-engineering/dbt-projects-on-snowflake-dbt-core-versions.md).

`PROJECT_ROOT = 'subdirectory_path'`
:   Specifies the subdirectory path to the `dbt_project.yml` file within the dbt project object or workspace. This parameter is only supported when executing a dbt project by using the FROM WORKSPACE option.

    If no value is specified, the dbt project executes with the `dbt_project.yml` file in the root directory of the dbt project object.

    If no `dbt_project.yml` file exists in the root directory or in the PROJECT_ROOT subdirectory, an error occurs.

    Default: No value

## Output

| Column | Description |
| --- | --- |
| `0|1 Success` | `TRUE` if the dbt project executed successfully; otherwise, `FALSE`. If the dbt project fails to execute, an exception message is returned. |
| `EXCEPTION` | Any exception message returned by the dbt project execution. If the dbt project executes successfully, the string `None` is returned. |
| `STDOUT` | The standard output returned by the dbt project execution. |
| `OUTPUT_ARCHIVE_URL` | The URL of the output archive that contains output files of the dbt project execution. This includes log files and artifacts that dbt writes to the `/target` directory. For more information, see [About dbt artifacts](https://docs.getdbt.com/reference/artifacts/dbt-artifacts) in dbt documentation. Selecting this link directly results in an error; however, you can use this URL to retrieve dbt project files and output. For more information, see [Access dbt artifacts and logs programmatically](../../user-guide/data-engineering/dbt-projects-on-snowflake-monitoring-observability.md). |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| USAGE | dbt project |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

> **Note:**
>
> The dbt command specified in EXECUTE DBT PROJECT runs with the privileges of the `role` specified in the `outputs` block of the projects `profiles.yml` file. Operations are further restricted to only those privileges granted to the Snowflake user calling EXECUTE DBT PROJECT. Both the user and the role specified must have the required privileges to use the `warehouse`, perform operations on the `database` and `schema` specified in the project’s `profiles.yml` file, and perform operations on any other Snowflake objects that the dbt model specifies.

## Examples

* Default run command with target and models specified
* Explicit test command with target and models specified
* Explicit run command with downstream models specified
* Run and test dbt projects using production tasks

### Default run command with target and models specified

Execute a dbt `run` targeting the `dev` profile in the `dbt_project.yml` file in the root directory of the dbt project object and selecting three models from the project DAG. No `run` command is explicitly specified and is executed by default.

```sqlexample
EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project
  ARGS = '--select simple_customers combined_bookings prepped_data --target dev';
```

### Explicit test command with target and models specified

Execute a dbt `test` command targeting the `prod` profile in the `dbt_project.yml` file in the root directory of the dbt project object and selecting three models from the project DAG.

```sqlexample
EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project
  ARGS = '--select simple_customers combined_bookings prepped_data --target prod';
```

### Explicit run command with downstream models specified

Execute a dbt `run` command targeting the `dev` profile in the `dbt_project.yml` file and selecting all models downstream of the `simple_customers` model using the dbt `+` notation.

```sqlexample
EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project
  ARGS = 'run --select simple_customers+ --target dev';
```

### Run and test dbt projects using production tasks

Create a task for a production dbt target that executes a dbt `run` command on a six-hour interval. Then create a task that executes the dbt `test` command after each dbt `run` task completes. The EXECUTE DBT PROJECT command for each task targets the `prod` profile in the `dbt_project.yml` file in the root directory of the dbt project object.

```sqlexample
CREATE OR ALTER TASK my_database.my_schema.run_dbt_project
  WAREHOUSE = my_warehouse
  SCHEDULE = '6 hours'
AS
  EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project args='run --target prod';

CREATE OR ALTER TASK change_this.public.test_dbt_project
        WAREHOUSE = my_warehouse
        AFTER run_dbt_project
AS
  EXECUTE DBT PROJECT my_database.my_schema.my_dbt_project args='test --target prod';
```

### Override the project’s pinned version at execution time for testing or temporary needs

`my_dbt_project` is pinned to 1.9.4. This execution overrides the dbt project’s default 1.9.4 version:

```sqlexample
EXECUTE DBT PROJECT finance_analytics
  DBT_VERSION = '1.10.15'
```

---
title: EXECUTE DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/execute-dcm-project.md
section: SQL Commands
---

# EXECUTE DCM PROJECT

Executes one of the following actions on a [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md):

* `EXECUTE DCM PROJECT <name> PLAN` performs a dry run of the DCM project to analyze the changes that would be applied to the target
  during a deployment, but doesn’t apply any changes.
* `EXECUTE DCM PROJECT <name> DEPLOY` deploys the changes defined in the project’s definition files to the account.
* `EXECUTE DCM PROJECT <name> REFRESH ALL` refreshes dynamic tables managed by the DCM project.
* `EXECUTE DCM PROJECT <name> TEST ALL` tests all expectations from attached data metric functions managed by the DCM project.
* `EXECUTE DCM PROJECT <name> PREVIEW` returns a data sample of the current definitions specified in the source path for
  the specified table, view, or dynamic table.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md), [ALTER DCM PROJECT](alter-dcm-project.md), [DESCRIBE DCM PROJECT](desc-dcm-project.md), [DROP DCM PROJECT](drop-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md), [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)

## Syntax

```sqlsyntax
EXECUTE DCM PROJECT <name>
  PLAN
  [ USING [ CONFIGURATION <config_name> ] [ (<expr>, [, <expr>, ...]) ] ]
  FROM '<source-files_path>'

EXECUTE DCM PROJECT <name>
  DEPLOY [ AS '<deployment_name_alias>' ]
  [ USING [ CONFIGURATION <name> ] [ (<expr>, [, <expr>, ...]) ] ]
  FROM '<source-files_path>'

EXECUTE DCM PROJECT <name>
  REFRESH ALL

EXECUTE DCM PROJECT <name>
  TEST ALL

EXECUTE DCM PROJECT <name>
  PREVIEW <fully_qualified_table_object_name>
  USING CONFIGURATION <config_name>
  FROM '<source_files_path>'
```

## Required parameters

`name`
:   Specifies the identifier for the DCM project to execute.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`PLAN`
:   Instructs Snowflake to perform a dry run of the DCM project. For a dry run, Snowflake analyzes the changes that
    would be applied to the target during a deployment, but doesn’t apply any changes.

`DEPLOY [ AS 'deployment_name_alias' ]`
:   Deploys the changes defined in the project’s definition files to the account; optionally specifies an alias for the deployment.

`FROM 'source_files_path'`
:   Specifies the directory that contains the source files for the DCM project. The directory must contain a manifest file and at least one
    definition file in `/sources/definitions/`. The manifest file provides the templating values in case a configuration was specified.

`REFRESH ALL`
:   Refreshes all dynamic tables that are currently managed by the DCM project.

`TEST ALL`
:   Tests all data quality expectations attached to tables, dynamic tables, or views which are currently managed by the DCM project.

`PREVIEW fully_qualified_table_object_name`
:   Returns a data sample of the current definitions specified in the source path for the specified table, view, or dynamic table -
    independent of any deployed state.

## Optional parameters

`USING CONFIGURATION config_name`
:   Specifies the configuration to use. This lets you customize deployments for different environments, such as development, staging, or
    production, without using different project definition files.

    If the configuration name is not in all uppercase, enclose it in double quotes.

`USING ( expr [, expr , ... ] )`
:   Optionally specifies template variable values. Using this option overrides any default or configuration values for this specific variable.
    The single expression must have the following form: `<variable_name> => <variable_value>`. For lists, use the following form:
    `<variable_name> => [<value1>, <value2>, ...]`. For example: `wh_size => 'MEDIUM'` or `teams =>
    ['TEAM_A', 'TEAM_B']`.

    This lets you customize deployments for different environments, such as development, staging, or production, without using different
    project definition files.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | DCM project | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

After the DCM project executes, this command returns the following output depending on the variation:

* PLAN and DEPLOY: A single row containing a JSON object with the change log.
* PREVIEW: A result set.
* REFRESH ALL: A single row containing a JSON object that contains the full response.
* TEST ALL: A single row containing a JSON object that contains the full response.

### PLAN and DEPLOY output

> **Note:**
>
> During the preview phase, the exact output format can be subject to change.

The standard plan output contains the following information about the plan execution in JSON format:

```text
{
  "version": 2,
  "metadata": {
    "timestamp": <timestamp>,
    "query_id": <query_id>,
    "project_name": <project_name>,
    "user": <user>,
    "role_name": <role_name>,
    "command": <command>
  },
  "changeset": [
    {
      "type": <type>,
      "object_id": {
        "domain": <domain>,
        "name": <name>,
        "fqn": <fqn>,
        "database": <database>,
        "schema": <schema>
      },
      "changes": [
        {
          "kind": <kind>,
          "attribute_name": <attribute_name>,
          "value": <value>,
          "changes": [
            {
              "kind": <kind>,
              "attribute_name": <attribute_name>,
              "value": <value>
            }
          ]
        }
      ]
    }
  ]
}
```

| Property | Description |
| --- | --- |
| `version` | Schema version of the output format. Version 2 is the latest and only supported version. |
| `metadata` | Contextual information about the execution. |
| `metadata.timestamp` | ISO 8601 timestamp of when the command was executed. |
| `metadata.query_id` | Unique identifier for the query that produced this plan. |
| `metadata.project_name` | Fully qualified name of the DCM Project object. |
| `metadata.user` | Name of the user who executed the command. |
| `metadata.role_name` | Active role used to execute the command. |
| `metadata.command` | The command that was executed. `PLAN` or `DEPLOY`. |
| `changeset` | An array of change entries. Each entry represents one object that would be or was created, altered, or dropped. An empty array indicates the project definitions are already in sync with the account. |
| `changeset[].type` | The planned action for the object. Possible values: `CREATE`, `ALTER`, `DROP`. |
| `changeset[].object_id` | Identifies the target object. |
| `changeset[].object_id.domain` | The Snowflake object type. |
| `changeset[].object_id.name` | Name of the object. |
| `changeset[].object_id.fqn` | Fully qualified name of the object. |
| `changeset[].object_id.database` | Database containing the object. Omitted for account-level objects. |
| `changeset[].object_id.schema` | Schema containing the object. Omitted for database-level and account-level objects. |
| `changeset[].changes` | An array of change descriptors detailing the specific attribute modifications. |
| `changeset[].changes[].kind` | The type of change. Possible values: `set`, `changed`, `unset`, `nested`, `collection`. The value of `kind` determines the remaining keys in the object. |
| `changeset[].changes[].attribute_name` | Name of the attribute being set or changed. Present when `kind` is `set`, `changed`, or `unset`. |
| `changeset[].changes[].value` | The new value for the attribute. Present when `kind` is `set` or `changed`. |
| `changeset[].changes[].prev_value` | The previous value of the attribute before the change. Present only when `kind` is `changed`. |
| `changeset[].changes[].collection_name` | Name of the collection being modified (for example, `columns`, `constraints`, `privileges`, `expectations`). Present only when `kind` is `collection`. |
| `changeset[].changes[].id_label` | Label used to identify items within the collection (for example, `name`). Present only on certain collections. |
| `changeset[].changes[].changes` | A nested array of collection item descriptors. Present only when `kind` is `collection`. |
| `changeset[].changes[].changes[].kind` | The type of change to the collection item. Possible values: `added`, `removed`, `modified`. |
| `changeset[].changes[].changes[].item_id` | Identifies the item within the collection. Can be a string or an object, depending on the collection type. |
| `changeset[].changes[].changes[].changes` | An array of further change descriptors for this item. Present for `added` and `modified` items. Always absent for `removed` items. |

An example of a plan output:

```text
{
  "version": 2,
  "metadata": {
    "timestamp": <timestamp>,
    "query_id": <query_id>,
    "project_name": <project_name>,
    "user": <user>,
    "role_name": <role_name>,
    "command": <command>
  },
  "changeset": [
    {
      "type": "CREATE",
      "object_id": {
        "domain": "TABLE",
        "name": "CUSTOMER_SUMMARY",
        "fqn": "MY_DB.ANALYTICS.CUSTOMER_SUMMARY",
        "database": "MY_DB",
        "schema": "ANALYTICS"
      },
      "changes": [
        {
          "kind": "set",
          "attribute_name": "warehouse_size",
          "value": "XSMALL"
        },
        {
          "kind": "set",
          "attribute_name": "query",
          "value": "SELECT customer_id, SUM(amount) AS total FROM orders GROUP BY customer_id"
        }
      ]
    },
    {
      "type": "ALTER",
      "object_id": {
        "domain": "DYNAMIC_TABLE",
        "name": "ORDER_DETAILS",
        "fqn": "MY_DB.ANALYTICS.ORDER_DETAILS",
        "database": "MY_DB",
        "schema": "ANALYTICS"
      },
      "changes": [
        {
          "kind": "changed",
          "attribute_name": "warehouse_size",
          "value": "SMALL",
          "prev_value": "XSMALL"
        },
        {
          "kind": "collection",
          "collection_name": "columns",
          "id_label": "name",
          "changes": [
            {
              "kind": "added",
              "item_id": "DISCOUNT_AMOUNT",
              "changes": [
                {
                  "kind": "set",
                  "attribute_name": "data_type",
                  "value": "NUMBER(10,2)"
                }
              ]
            },
            {
              "kind": "modified",
              "item_id": "ORDER_STATUS",
              "changes": [
                {
                  "kind": "changed",
                  "attribute_name": "data_type",
                  "value": "VARCHAR(50)",
                  "prev_value": "VARCHAR(20)"
                }
              ]
            },
            {
              "kind": "removed",
              "item_id": "LEGACY_FLAG"
            }
          ]
        }
      ]
    },
    {
      "type": "DROP",
      "object_id": {
        "domain": "VIEW",
        "name": "OLD_REPORT_VIEW",
        "fqn": "MY_DB.ANALYTICS.OLD_REPORT_VIEW",
        "database": "MY_DB",
        "schema": "ANALYTICS"
      },
      "changes": []
    }
  ]
}
```

### REFRESH ALL output

The JSON output contains the results of the dynamic table refresh operation in the following format:

```text
{
  "dts_refresh_result": {
    "refreshed_tables": [
      {
        "table_name": <table_name>,
        "statistics": {
          "inserted_rows": <inserted_rows>,
          "deleted_rows": <deleted_rows>
        },
        "data_timestamp": <data_timestamp>
      }
    ]
  }
}
```

| Property | Description |
| --- | --- |
| `dts_refresh_result` | Contains the results of the dynamic table refresh operation. |
| `refreshed_tables[]` | An array of entries, one for each dynamic table that was refreshed. |
| `table_name` | Fully qualified name of the dynamic table that was refreshed. |
| `statistics` | Refresh statistics for the table. |
| `inserted_rows` | Number of rows inserted during the refresh. |
| `deleted_rows` | Number of rows deleted during the refresh. |
| `data_timestamp` | ISO 8601 timestamp representing the point-in-time freshness of the data after the refresh. |

An example of the JSON output for a dynamic table refresh:

```text
{
  "dts_refresh_result": {
    "refreshed_tables": [
      {
        "table_name": "db.schema.my_dynamic_table",
        "statistics": {
          "inserted_rows": 150,
          "deleted_rows": 30
        },
        "data_timestamp": "2026-03-16T12:00:00.000Z"
      }
    ]
  }
}
```

### TEST ALL output

The TEST output contains the overall status and expectations with their values in the following format:

> **Note:**
>
> During the preview phase, the exact output format can be subject to change.

```text
{
  "status": <status>,
  "expectations": [
    {
      "table_name": <table_name>,
      "metric_database": <metric_database>,
      "metric_schema": <metric_schema>,
      "metric_name": <metric_name>,
      "expectation_name": <expectation_name>,
      "expectation_expression": <expectation_expression>,
      "value": <value>,
      "expectation_violated": <expectation_violated>,
      "column_names": <column_names>
    }
  ]
}
```

| Property | Description |
| --- | --- |
| `status` | Overall result of the test run. Possible values: `SUCCESSFUL` (all expectations met), `FAILED` (one or more expectations violated). |
| `expectations[]` | An array of expectation results, one for each data quality expectation evaluated. |
| `table_name` | Fully qualified name of the table or view on which the expectation was evaluated. |
| `metric_database` | Database that contains the data metric function. |
| `metric_schema` | Schema that contains the data metric function. |
| `metric_name` | Name of the data metric function (for example, `NULL_COUNT`, `MIN`, `UNIQUE_COUNT`). |
| `expectation_name` | Name of the expectation as defined in the project. |
| `expectation_expression` | Boolean expression that the metric value is evaluated against (for example, `value = 0`, `value >= 0`). |
| `value` | The result of the data metric function evaluation. Present only when `expectation_violated` is `false`. |
| `expectation_violated` | Whether the expectation was violated. `true` if the metric value did not satisfy the expectation expression; `false` otherwise. |
| `column_names` | An array of column names on which the data metric function was evaluated. |

An example of the JSON output for a data quality test:

```text
{
  "status": "FAILED",
  "expectations": [
    {
      "table_name": "db.schema.my_table",
      "metric_database": "SNOWFLAKE",
      "metric_schema": "CORE",
      "metric_name": "NULL_COUNT",
      "expectation_name": "no_nulls_in_id",
      "expectation_expression": "value = 0",
      "value": 0,
      "expectation_violated": false,
      "column_names": ["ID"]
    },
    {
      "table_name": "db.schema.my_table",
      "metric_database": "SNOWFLAKE",
      "metric_schema": "CORE",
      "metric_name": "UNIQUE_COUNT",
      "expectation_name": "unique_id_check",
      "expectation_expression": "value >= 100",
      "value": null,
      "expectation_violated": true,
      "column_names": ["ID"]
    }
  ]
}
```

## Usage notes

When executing a DCM project with EXECUTE DCM PROJECT PLAN, the output of the command is the same as for the actual deployment. The
difference is that no changes to the affected account are applied. This feature allows you to verify whether the rendered definition files
have a valid syntax, what changes would be applied to the account, and wether the project owner role has the required privileges to apply
these changes.

To avoid unintended changes and catch errors, always run EXECUTE DCM PROJECT PLAN before you deploy a DCM project.

### Support for template variables

Template variables let you dynamically choose the content of the parameterized definitions files during the DCM project execution. You can
use template variables in the following ways:

See the Template variable examples section for examples.

## Examples

### Basic examples

Execute a DCM project in PLAN mode to validate changes to a project without applying them:

```sqlexample
EXECUTE DCM PROJECT my_project
  PLAN
  FROM '@my_database.my_schema.my_stage/my_project';
```

Execute a DCM project in DEPLOY mode (to apply changes) to specify a deployment alias and
a configuration named PROD:

```sqlexample
EXECUTE DCM PROJECT my_project
  DEPLOY AS "my_update"
  USING CONFIGURATION PROD
  FROM '@my_database.my_schema.my_stage/my_project';
```

### Template variable examples

The following examples demonstrate how you can specify the value
for template variables in an EXECUTE DCM PROJECT statement.

**Override the template variable defined in the DCM project’s manifest file**

1. Define a template variable named `desc` in the manifest file:

   ```yaml
   manifest_version: 2
   type: DCM_PROJECT
   default_target: DCM_DEV
   targets:
     DCM_DEV:
       desc: "created by hello world project"
   ```
2. Create a definition file that uses the template variable:

   ```sqlexample
   DEFINE DATABASE NEW_DB;
   DEFINE TABLE NEW_DB.PUBLIC.TBL (ID INT) COMMENT = '{{desc}}';
   ```
3. Call the EXECUTE DCM PROJECT command in DEPLOY mode,
   and specify a value for the `desc` variable to override its default value in the manifest:

   ```sqlexample
   EXECUTE DCM PROJECT MY_PROJECT DEPLOY
     USING CONFIGURATION FIRST_CONFIG (desc => 'This object is mine')
     FROM '/my/project/source';
   ```

**Provide a value for a template variable not defined in the manifest file**

1. Create a definition file with the desired commands:

   ```sqlexample
   DEFINE DATABASE NEW_DB;
   DEFINE TABLE NEW_DB.PUBLIC.TBL (ID INT) COMMENT = '{{desc_new}}';
   ```
2. Call the EXECUTE DCM PROJECT command, and specify a value for the `desc_new` variable:

   ```sqlexample
   EXECUTE DCM PROJECT MY_PROJECT (desc_new => 'This object is mine');
   ```

---
title: EXECUTE IMMEDIATE
source: https://docs.snowflake.com/en/sql-reference/sql/execute-immediate.md
section: SQL Commands
---

# EXECUTE IMMEDIATE

Executes a string that contains a SQL statement or a
[Snowflake Scripting statement](../../developer-guide/snowflake-scripting/blocks.md).

You can use EXECUTE IMMEDIATE to do the following:

* In a Snowflake Scripting block, execute dynamic SQL, where parts of the SQL statement aren’t known
  until runtime. For examples, see Executing dynamic SQL in a Snowflake Scripting block.
* Set a session variable to a SQL statement, and reference the session variable to run the SQL statement.
  For an example, see Setting a session variable to a statement and executing it.
* If you are using SnowSQL or Snowsight, run a Snowflake Scripting anonymous block.
  For an example, see Running an anonymous block in SnowSQL or Snowsight.

## Syntax

```sqlsyntax
EXECUTE IMMEDIATE '<string_literal>'
    [ USING ( <bind_variable> [ , <bind_variable> ... ] ) ]

EXECUTE IMMEDIATE <variable>
    [ USING ( <bind_variable> [ , <bind_variable> ... ] ) ]

EXECUTE IMMEDIATE $<session_variable>
    [ USING ( <bind_variable> [ , <bind_variable> ... ] ) ]
```

## Required parameters

`'string_literal'` or . `variable` or . `session_variable`
:   A string literal, Snowflake Scripting [variable](../../developer-guide/snowflake-scripting/variables.md), or
    [session variable](../session-variables.md) that contains a statement. A statement can be any of the following:

    * A single SQL statement
    * A stored procedure call
    * A control-flow statement (for example, [looping](../../developer-guide/snowflake-scripting/loops.md) or
      [branching](../../developer-guide/snowflake-scripting/branch.md) statement)
    * A [block](../../developer-guide/snowflake-scripting/blocks.md)

    If you use a session variable, the length of the statement must not exceed the
    [maximum size of a session variable (256 bytes)](../session-variables.md).

## Optional parameters

`USING ( bind_variable [ , bind_variable ... ] )`
:   Specifies one or more bind variables that hold values to be used in the cursor’s query definition (for example,
    in a WHERE clause).

## Returns

EXECUTE IMMEDIATE returns the result of the executed statement. For example, if the string or variable contained a SELECT
statement, the result set of the SELECT statement is returned.

## Usage notes

* The `string_literal`, `variable`, or `session_variable` must contain only one statement.
  (A [block](../../developer-guide/snowflake-scripting/blocks.md) is considered one statement, even if the body of the block
  contains multiple statements.)
* A `session_variable` must be preceded by a dollar sign (`$`).
* A local `variable` must not be preceded by a dollar sign (`$`).

## Examples

The following are examples that use the EXECUTE IMMEDIATE command.

### Executing dynamic SQL in a Snowflake Scripting block

The following examples execute dynamic SQL in a Snowflake Scripting block.

#### Executing statements that contain variables

This example executes statements that are defined in two local variables in a
[Snowflake Scripting stored procedure](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).
This example also demonstrates that EXECUTE IMMEDIATE works not only with a string literal, but also
with an expression that evaluates to a string (VARCHAR).

```sqlexample
CREATE PROCEDURE execute_immediate_local_variable()
RETURNS VARCHAR
AS
DECLARE
  v1 VARCHAR DEFAULT 'CREATE TABLE temporary1 (i INTEGER)';
  v2 VARCHAR DEFAULT 'INSERT INTO temporary1 (i) VALUES (76)';
  result INTEGER DEFAULT 0;
BEGIN
  EXECUTE IMMEDIATE v1;
  EXECUTE IMMEDIATE v2  ||  ',(80)'  ||  ',(84)';
  result := (SELECT SUM(i) FROM temporary1);
  RETURN result::VARCHAR;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE PROCEDURE execute_immediate_local_variable()
RETURNS VARCHAR
AS
$$
DECLARE
  v1 VARCHAR DEFAULT 'CREATE TABLE temporary1 (i INTEGER)';
  v2 VARCHAR DEFAULT 'INSERT INTO temporary1 (i) VALUES (76)';
  result INTEGER DEFAULT 0;
BEGIN
  EXECUTE IMMEDIATE v1;
  EXECUTE IMMEDIATE v2  ||  ',(80)'  ||  ',(84)';
  result := (SELECT SUM(i) FROM temporary1);
  RETURN result::VARCHAR;
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL execute_immediate_local_variable();
```

```output
+----------------------------------+
| EXECUTE_IMMEDIATE_LOCAL_VARIABLE |
|----------------------------------|
| 240                              |
+----------------------------------+
```

#### Executing a statement that contains bind variables

This example uses EXECUTE IMMEDIATE to execute a SELECT statement that contains bind variables
in the USING parameter in a Snowflake Scripting stored procedure. First create the table and insert
the data:

```sqlexample
CREATE OR REPLACE TABLE invoices (id INTEGER, price NUMBER(12, 2));

INSERT INTO invoices (id, price) VALUES
  (1, 11.11),
  (2, 22.22);
```

Create the stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE min_max_invoices_sp(
    minimum_price NUMBER(12,2),
    maximum_price NUMBER(12,2))
  RETURNS TABLE (id INTEGER, price NUMBER(12, 2))
  LANGUAGE SQL
AS
DECLARE
  rs RESULTSET;
  query VARCHAR DEFAULT 'SELECT * FROM invoices WHERE price > ? AND price < ?';
BEGIN
  rs := (EXECUTE IMMEDIATE :query USING (minimum_price, maximum_price));
  RETURN TABLE(rs);
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE OR REPLACE PROCEDURE min_max_invoices_sp(
    minimum_price NUMBER(12,2),
    maximum_price NUMBER(12,2))
  RETURNS TABLE (id INTEGER, price NUMBER(12, 2))
  LANGUAGE SQL
AS
$$
DECLARE
  rs RESULTSET;
  query VARCHAR DEFAULT 'SELECT * FROM invoices WHERE price > ? AND price < ?';
BEGIN
  rs := (EXECUTE IMMEDIATE :query USING (minimum_price, maximum_price));
  RETURN TABLE(rs);
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL min_max_invoices_sp(20, 30);
```

```output
+----+-------+
| ID | PRICE |
|----+-------|
|  2 | 22.22 |
+----+-------+
```

### Setting a session variable to a statement and executing it

This example executes a statement defined in a session variable:

```sqlexample
SET stmt =
$$
    SELECT PI();
$$
;
```

```sqlexample
EXECUTE IMMEDIATE $stmt;
```

```output
+-------------+
|        PI() |
|-------------|
| 3.141592654 |
+-------------+
```

### Running an anonymous block in SnowSQL or Snowsight

When you run a [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) anonymous block
in [SnowSQL](../../user-guide/snowsql.md) or [Snowsight](../../user-guide/ui-snowsight-gs.md), you must specify the block as
a string literal (delimited by single quotes or double dollar signs), and you must pass the
block to the EXECUTE IMMEDIATE command. For more information, see
[Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md).

This example runs an anonymous block passed to the EXECUTE IMMEDIATE command:

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  radius_of_circle FLOAT;
  area_of_circle FLOAT;
BEGIN
  radius_of_circle := 3;
  area_of_circle := PI() * radius_of_circle * radius_of_circle;
  RETURN area_of_circle;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|    28.274333882 |
+-----------------+
```

---
title: EXECUTE IMMEDIATE FROM
source: https://docs.snowflake.com/en/sql-reference/sql/execute-immediate-from.md
section: SQL Commands
---

# EXECUTE IMMEDIATE FROM

EXECUTE IMMEDIATE FROM executes the SQL statements specified in a file in a stage. The file can contain
SQL statements or [Snowflake Scripting blocks](../../developer-guide/snowflake-scripting/blocks.md). The statements must be syntactically
correct SQL statements.

You can use the EXECUTE IMMEDIATE FROM command to execute the statements in a file from any Snowflake session.

This feature provides a mechanism to control the deployment and management of your Snowflake objects and code. For example, you can execute
a stored script to create a standard Snowflake environment for all your accounts. The configuration script might include statements
that create users, roles, databases, and schemas for every new account.

## Jinja2 templating

EXECUTE IMMEDIATE FROM can also execute a template file using the Jinja2 templating language.
A template can contain variables and expressions, enabling the use of loops, conditionals, variable substitution, macros, and more.
Templates can also include other templates and can import macros defined in other files located on a stage.

For more information about the templating language, see the [Jinja2 documentation](https://jinja.palletsprojects.com/).

The template file to be executed must be:

* A syntactically valid Jinja2 template.
* Located in a stage or [Git repository clone](../../developer-guide/git/git-overview.md).
* Able to render syntactically valid SQL statements.

Templating enables more flexible control structures and parameterization using environment variables. For example, you can use
a template to dynamically choose the deployment target of the objects defined in the script. To use a template to render a
SQL script, use the templating directive or add a
USING clause with at least one template variable.

### Templating directive

You can use either one of the two templating directives.

The recommended directive uses valid SQL syntax:

```sqlexample
--!jinja
```

Optionally, you can use the alternative directive:

```sqlexample
#!jinja
```

> **Note:**
>
> Only a byte order mark and up to 10 whitespace characters (newlines, tabs, spaces) may be placed in front of the directive.
> Any characters that come after the directive on the same line will be ignored.

### Using content from staged files in a template

A template can load other staged files either directly through the
[SnowflakeFile API](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/files)
or through Jinja2’s [include](https://jinja.palletsprojects.com/en/stable/templates/#include),
[import](https://jinja.palletsprojects.com/en/stable/templates/#import), and
[inheritance](https://jinja.palletsprojects.com/en/stable/templates/#template-inheritance) features.

Files can be referenced by absolute paths:

```sqlexample
{% include "@my_stage/path/to/my_template" %}
{% import "@my_stage/path/to/my_template" as my_template %}
{% extends "@my_stage/path/to/my_template" %}
{{ SnowflakeFile.open("@my_stage/path/to/my_template", 'r', require_scoped_url = False).read() }}
```

Include, import, and extends also support relative paths while the SnowflakeFile API supports scoped Snowflake file URLs:

```sqlexample
{% include "my_template" %}
{% import "../my_template" as my_template %}
{% extends "/path/to/my_template" %}
```

See also:
:   [EXECUTE IMMEDIATE](execute-immediate.md)

## Syntax

```sqlsyntax
EXECUTE IMMEDIATE
  FROM { absoluteFilePath | relativeFilePath }
  [ USING ( <key> => <value> [ , <key> => <value> [ , ... ] ]  )  ]
  [ DRY_RUN = { TRUE | FALSE } ]
```

Where:

> ```sqlsyntax
> absoluteFilePath ::=
>    @[ <namespace>. ]<stage_name>/<path>/<filename>
> ```
>
> ```sqlsyntax
> relativeFilePath ::=
>   '[ / | ./ | ../ ]<path>/<filename>'
> ```

## Required parameters

### Absolute file path (`absoluteFilePath`)

`namespace`
:   Database and/or schema in which the internal or external stage resides, in the form of `database_name.schema_name`
    or `schema_name`. The namespace is optional if a database and schema are currently in use for the user session; otherwise,
    it is required.

`stage_name`
:   Name of the internal or external stage.

`path`
:   Case-sensitive path to the file in the stage.

`filename`
:   Name of the file to execute. It must contain syntactically correct and valid SQL statements. Each statement must be
    separated by a semicolon.

### Relative file path (`relativeFilePath`)

`path`
:   Case-sensitive relative path to the file in the stage. Relative paths support established conventions such as a leading `/`
    to indicate the root of a stage’s file system, `./` to refer to the current directory (the directory the parent file is
    located in) and `../` to refer to the parent directory. For more information, see Usage notes.

`filename`
:   Name of the file to execute. It must contain syntactically correct and valid SQL statements. Each statement must be
    separated by a semicolon.

## Optional parameters

`USING ( <key> => <value> [ , <key> => <value> [ , ... ] ]  )`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Allows you to pass one or more key-value pairs that can be used to parameterize template expansion. The key-value pairs
    must form a comma-separated list.

    When the USING clause is present, the file is first rendered as a Jinja2 template
    before being executed as a SQL script.

    Where:

    > * `key` is the name of the template variable. The template variable name can optionally be enclosed in double quotes
    >   (`"`).
    > * `value` is the value to assign to the variable in the template. String values must be enclosed in `'` or
    >   `$$`. For an example, see Templating usage notes.

`DRY_RUN = { TRUE | FALSE }`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    Specifies whether to preview the rendered file without executing it as a SQL script.

    * `TRUE` returns the rendered file contents without executing the SQL statements.
    * `FALSE` renders SQL statements from the template and executes those statements.

    Default: `FALSE`

## Returns

EXECUTE IMMEDIATE FROM returns:

* The result of the last statement in the file if all statements are successfully executed.
* The error message, if any statement in the file failed.

  If there is an error in any statement in the file, the EXECUTE IMMEDIATE FROM command fails and returns the error message
  of the failed statement.

  > **Note:**
  >
  > If the EXECUTE IMMEDIATE FROM command fails and returns an error message, any statements in the file prior to the failed statement
  > have successfully completed.

## Access control requirements

* The [role](../../user-guide/security-access-control-overview.md) used to execute the EXECUTE IMMEDIATE FROM command must have the
  USAGE (external stage) or READ (internal stage) privilege on the stage where the file is located.

* The role used to execute the file can only execute the statements in the file for which it has privileges.
  For example, if there is a CREATE TABLE statement in the file, the role must have the
  [necessary privileges to create a table](create-table.md) in the account or the statement fails.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The SQL statements in a file to be executed can include EXECUTE IMMEDIATE FROM statements:

  + Nested EXECUTE IMMEDIATE FROM statements *can* use relative file paths.

    Relative paths are evaluated in respect to the stage and file path of the parent file. If the relative file path starts with
    `/`, the path starts at the root directory of the stage containing the parent file.

    For an example, see Examples.
  + Relative file paths must be enclosed in single quotes (`'`) or `$$`.
  + The maximum execution depth for nested files is 5.
* Absolute file paths can optionally be enclosed in single quotes (`'`) or `$$`.
* The file to be executed cannot be larger than 10MB in size.
* The file to be executed must be encoded in [UTF-8](https://en.wikipedia.org/wiki/UTF-8).
* The file to be executed must be uncompressed. If you use the [PUT](put.md) command to upload a file to an internal
  stage, you must explicitly set the [AUTO_COMPRESS parameter](put.md) to FALSE.

  For example, upload `my_file.sql` to `my_stage`:

  ```sqlexample
  PUT file://~/sql/scripts/my_file.sql @my_stage/scripts/
    AUTO_COMPRESS=FALSE;
  ```
* The execution of all files in a directory is not supported. For example, `EXECUTE IMMEDIATE FROM @stage_name/scripts/`
  results in an error.

## Templating usage notes

* Variable names in templates are case-sensitive.
* The template variable name can be optionally enclosed in double quotes. Enclosing the variable name can be useful if any
  [reserved keywords](../reserved-keywords.md) are used as variable names.
* The following parameter types are supported in the USING clause:

  + String. Must be enclosed by `'` or `$$`. For example, `USING (a => 'a', b => $$b$$)`.
  + Number (decimal and integer). For example, `USING (a => 1, b => -1.23)`.
  + Boolean. For example, `USING (a => TRUE, b => FALSE)`.
  + NULL. For example, `USING (a => NULL)`.

    > **Note:**
    >
    > The Jinja2 templating engine interprets a NULL value as the Python NoneType type.
  + [Session variables](../session-variables.md). For example, `USING (a => $var)`. Only session variables holding
    values of supported data types are allowed.
  + [Bind variables](../bind-variables.md). For example, `USING (a => :var)`. Only bind variables
    holding values of supported data types are allowed. You can use bind variables to pass stored procedure arguments to a template.
* Files in Snowflake Git repositories or in Snowflake Native Apps cannot be accessed from the template.
* The maximum result size for template rendering is 100,000 bytes.
* Templates are rendered using the Jinja2 version 3.1.6 templating engine.

## Troubleshooting EXECUTE IMMEDIATE FROM errors

This section contains some common errors that result from an EXECUTE IMMEDIATE FROM statement and how you can resolve them.

* File errors
* Stage errors
* Access control errors
* Templating errors

### File errors

|  |  |
| --- | --- |
| Error | ```output 001501 (02000): File '<directory_name>' not found in stage '<stage_name>'. ``` |
| Cause | There are multiple causes for this error:   * The file does not exist. * The file name is the root of a directory. For example `@stage_name/scripts/`. |
| Solution | Verify the name of the file and confirm the file exists. Executing all the files in a directory is not supported. |

|  |  |
| --- | --- |
| Error | ```output 001503 (42601): Relative file references like '<filename.sql>' cannot be used in top-level EXECUTE IMMEDIATE calls. ``` |
| Cause | The statement was executed using a relative file path outside of a file execution. |
| Solution | A relative file path can only be used in EXECUTE IMMEDIATE FROM statements in a file. Use the absolute file path for the file. For more information, see Usage notes. |

|  |  |
| --- | --- |
| Error | ```output 001003 (42000): SQL compilation error: syntax error line <n> at position <m> unexpected '<string>'. ``` |
| Cause | The file contains SQL syntax errors. |
| Solution | Fix the syntax errors in the file and reupload the file to the stage. |

### Stage errors

|  |  |
| --- | --- |
| Error | ```output 002003 (02000): SQL compilation error: Stage '<stage_name>' does not exist or not authorized. ``` |
| Cause | The stage does not exist or you do not have access to the stage. |
| Solution | * Verify the name of the stage and confirm the stage exists. * Execute the statement using a role that has the required privileges to access the stage. For more information, see   Access control requirements. |

### Access control errors

|  |  |
| --- | --- |
| Error | ```output 003001 (42501): Uncaught exception of type 'STATEMENT_ERROR' in file <file_name> on line <n> at position <m>: SQL access control error: Insufficient privileges to operate on schema '<schema_name>' ``` |
| Cause | The role used to execute the statement does not have the privileges required to execute some or all of the statements in the file. |
| Solution | Use a role that has the appropriate privileges to execute the statements in the file. For more information, see Access control requirements. |

See also: Stage errors.

### Templating errors

|  |  |
| --- | --- |
| Error | ```output 001003 (42000): SQL compilation error: syntax error line [n] at position [m] unexpected '{'. ``` |
| Cause | The file contains templating constructs (for example, `{{ table_name }}`) but is not rendered using the templating engine. If the template is not rendered, the lines of text in the template are executed as SQL statements. The templating constructs in the file are likely to result in SQL syntax errors. |
| Solution | Add a templating directive or re-execute the statement with the USING clause and specify at least one template variable. |

|  |  |
| --- | --- |
| Error | ```output 000005 (XX000): Python Interpreter Error: jinja2.exceptions.UndefinedError: '<key>' is undefined in template processing ``` |
| Cause | If any variables used in the template are left unspecified in the USING clause, an error occurs. |
| Solution | Verify the names and number of variables in the template and update the USING clause to include values for all template variables. |

|  |  |
| --- | --- |
| Error | ```output 001510 (42601): Unable to use value of template variable '<key>' ``` |
| Cause | The value for the variable `key` is an unsupported type. |
| Solution | Verify that you are using a supported parameter type for the template variable value. For more information, see the Templating usage notes. |

|  |  |
| --- | --- |
| Error | ```output 001518 (42601): Size of expanded template exceeds limit of 100,000 bytes. ``` |
| Cause | The size of the rendered template exceeds the current limit. |
| Solution | Split your templated file into multiple smaller templates and add a new script to execute them sequentially, while passing down template variables to the nested scripts. |

## Examples

### Basic example

This example executes the file `create-inventory.sql` located in stage `my_stage`.

1. Create a file named `create-inventory.sql` with the following statements:

   ```sqlexample
   CREATE OR REPLACE TABLE my_inventory(
     sku VARCHAR,
     price NUMBER
   );

   EXECUTE IMMEDIATE FROM './insert-inventory.sql';

   SELECT sku, price
     FROM my_inventory
     ORDER BY price DESC;
   ```
2. Create a file named `insert-inventory.sql` with the following statements:

   ```sqlexample
   INSERT INTO my_inventory
     VALUES ('XYZ12345', 10.00),
            ('XYZ81974', 50.00),
            ('XYZ34985', 30.00),
            ('XYZ15324', 15.00);
   ```
3. Create an internal stage `my_stage`:

   ```sqlexample
   CREATE STAGE my_stage;
   ```
4. Upload both local files to the stage using the [PUT](put.md) command:

   ```sqlexample
   PUT file://~/sql/scripts/create-inventory.sql @my_stage/scripts/
     AUTO_COMPRESS=FALSE;

   PUT file://~/sql/scripts/insert-inventory.sql @my_stage/scripts/
     AUTO_COMPRESS=FALSE;
   ```
5. Execute the `create-inventory.sql` script located in `my_stage`:

   ```sqlexample
   EXECUTE IMMEDIATE FROM @my_stage/scripts/create-inventory.sql;
   ```

   Returns:

   ```output
   +----------+-------+
   | SKU      | PRICE |
   |----------+-------|
   | XYZ81974 |    50 |
   | XYZ34985 |    30 |
   | XYZ15324 |    15 |
   | XYZ12345 |    10 |
   +----------+-------+
   ```

### A simple template example

1. Create a template file `setup.sql` with two variables and the templating directive:

   ```sqlexample
   --!jinja

   CREATE SCHEMA {{env}};

   CREATE TABLE RAW (COL OBJECT)
       DATA_RETENTION_TIME_IN_DAYS = {{retention_time}};
   ```
2. Create a stage — *optional* if you already have a stage to which you can upload files.

   For example, create an internal stage in Snowflake:

   ```sqlexample
   CREATE STAGE my_stage;
   ```
3. Upload the file to your stage.

   For example, use the [PUT](put.md) command from your local environment to upload file `setup.sql`
   to stage `my_stage`:

   ```sqlexample
   PUT file://path/to/setup.sql @my_stage/scripts/
     AUTO_COMPRESS=FALSE;
   ```
4. Execute the file `setup.sql`:

   ```sqlexample
   EXECUTE IMMEDIATE FROM @my_stage/scripts/setup.sql
       USING (env=>'dev', retention_time=>0);
   ```

### A template example with macros, conditionals, loops, and imports

1. Create a template file containing a macro definition.

   For example, create a file `macros.jinja` in your local environment:

   ```sqlexample
   {%- macro get_environments(deployment_type) -%}
     {%- if deployment_type == 'prod' -%}
       {{ "prod1,prod2" }}
     {%- else -%}
       {{ "dev,qa,staging" }}
     {%- endif -%}
   {%- endmacro -%}
   ```
2. Create a template file and add the templating directive (`--!jinja2`) to the top of the file.

   After the templating directive, add an `import` statement to import the macro defined in the
   file that you created in the previous step.
   For example, create a file `setup-env.sql` in your local environment:

   ```sqlexample
   --!jinja2
   {% from "macros.jinja" import get_environments %}

   {%- set environments = get_environments(DEPLOYMENT_TYPE).split(",") -%}

   {%- for environment in environments -%}
     CREATE DATABASE {{ environment }}_db;
     USE DATABASE {{ environment }}_db;
     CREATE TABLE {{ environment }}_orders (
       id NUMBER,
       item VARCHAR,
       quantity NUMBER);
     CREATE TABLE {{ environment }}_customers (
       id NUMBER,
       name VARCHAR);
   {% endfor %}
   ```
3. Create a stage — *optional* if you already have a stage to which you can upload files.

   For example, create an internal stage in Snowflake:

   ```sqlexample
   CREATE STAGE my_stage;
   ```
4. Upload the file to your stage.

   For example, use the [PUT](put.md) command from your local environment to upload the files
   `setup-env.sql` and `macros.jinja` to the stage `my_stage`:

   ```sqlexample
   PUT file://path/to/setup-env.sql @my_stage/scripts/
     AUTO_COMPRESS=FALSE;
   PUT file://path/to/macros.jinja @my_stage/scripts/
     AUTO_COMPRESS=FALSE;
   ```
5. Preview the SQL statements rendered by the template to check for any problems with your Jinja2 code:

   ```sqlexample
   EXECUTE IMMEDIATE FROM @my_stage/scripts/setup-env.sql
     USING (DEPLOYMENT_TYPE => 'prod') DRY_RUN = TRUE;
   ```

   Returns:

   ```output
   +----------------------------------+
   | rendered file contents           |
   |----------------------------------|
   | --!jinja2                        |
   | CREATE DATABASE prod1_db;        |
   |   USE DATABASE prod1_db;         |
   |   CREATE TABLE prod1_orders (    |
   |     id NUMBER,                   |
   |     item VARCHAR,                |
   |     quantity NUMBER);            |
   |   CREATE TABLE prod1_customers ( |
   |     id NUMBER,                   |
   |     name VARCHAR);               |
   | CREATE DATABASE prod2_db;        |
   |   USE DATABASE prod2_db;         |
   |   CREATE TABLE prod2_orders (    |
   |     id NUMBER,                   |
   |     item VARCHAR,                |
   |     quantity NUMBER);            |
   |   CREATE TABLE prod2_customers ( |
   |     id NUMBER,                   |
   |     name VARCHAR);               |
   |                                  |
   +----------------------------------+
   ```
6. Execute the file `setup-env.sql`:

   ```sqlexample
   EXECUTE IMMEDIATE FROM @my_stage/scripts/setup-env.sql
     USING (DEPLOYMENT_TYPE => 'prod');
   ```

---
title: EXECUTE JOB SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/execute-job-service.md
section: SQL Commands
---

# EXECUTE JOB SERVICE

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Executes a Snowpark Container Services service as a job.

A service, created using [CREATE SERVICE](create-service.md), is long-running and you must explicitly stop it when it is no longer needed. On the other hand, a job, created using EXECUTE JOB SERVICE, is a service that terminates when your code exits, similar to a stored procedure. When all containers exit, the job is done.

By default, the job runs synchronously; the EXECUTE JOB SERVICE command finishes only after all containers exit.

Alternatively, you can run the job service asynchronously by specifying the optional `ASYNC` parameter. In this case, the command returns immediately while the job is running. You can use the [DESCRIBE SERVICE](desc-service.md) command to poll for job completion and then call the [SYSTEM$WAIT_FOR_SERVICES](../functions/system_wait_for_services.md) function to wait for the job to complete.

After a job service completes, Snowflake automatically cleans up the resources allocated to the job service to help reduce costs. You can still access job metadata for up to 30 days by using the [DESCRIBE SERVICE](desc-service.md) and [SHOW SERVICES](show-services.md) commands. After 30 days, Snowflake automatically deletes the job.

When the job is done, if no other jobs or services are running on that compute pool node, Snowflake might consider the node is idle and reclaim it. When that happens, SYSTEM$GET_SERVICE_LOGS will not return local container logs from the job containers. You might consider persisting the container logs to an event table. For more information, see [Publishing and accessing container logs](../../developer-guide/snowpark-container-services/monitoring-services.md).

Note that the command parameters must be specified in a specific order. For more information, see Usage Notes.

See also:
:   [SYSTEM$GET_SERVICE_STATUS — Deprecated](../functions/system_get_service_status.md) , [SYSTEM$GET_SERVICE_LOGS](../functions/system_get_service_logs.md)

## Syntax

```sqlsyntax
EXECUTE JOB SERVICE
  IN COMPUTE POOL <compute_pool_name>
  {
     fromSpecification
     | fromSpecificationTemplate
  }
  [ NAME = [<db>.<schema>.]<name> ]
  [ ASYNC = { TRUE | FALSE } ]
  [ REPLICAS = = <num> ]
  [ QUERY_WAREHOUSE = <warehouse_name> ]
  [ COMMENT = '<string_literal>']
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <EAI_name> [ , ... ] ) ]
```

Where:

> ```sqlsyntax
> fromSpecification ::=
>   {
>     FROM @<stage> SPECIFICATION_FILE = '<yaml_file_stage_path>'
>     | FROM SPECIFICATION <specification_text>
>   }
> ```
>
> ```sqlsyntax
> fromSpecificationTemplate ::=
>   {
>     FROM @<stage> SPECIFICATION_TEMPLATE_FILE = '<yaml_file_stage_path>'
>     | FROM SPECIFICATION_TEMPLATE <specification_text>
>   }
>   USING ( <key> => <value> [ , <key> => <value> [ , ... ] ]  )
> ```

## Required parameters

`IN COMPUTE POOL compute_pool_name`
:   Specifies the name of the compute pool in your account on which to run the service.

`FROM stage`
:   Specifies the Snowflake internal stage where the specification file is stored; for example, `@tutorial_stage`.

`SPECIFICATION_FILE = 'yaml_file_stage_path'`
:   Specifies the path to the [service specification](../../developer-guide/snowpark-container-services/specification-reference.md)
    file on the stage; for example, `'some-dir/echo_spec.yaml'`.

`SPECIFICATION_TEMPLATE_FILE = 'yaml_file_stage_path'`
:   Specifies the path to the [service specification](../../developer-guide/snowpark-container-services/specification-reference.md)
    template file on the stage; for example, `'some-dir/echo_template_spec.yaml'`. When `SPECIFICATION_TEMPLATE_FILE` is specified, the `USING` parameter is required.

`FROM SPECIFICATION specification_text`
:   Specifies [service specification](../../developer-guide/snowpark-container-services/specification-reference.md). You can use
    a [pair of dollar signs](../data-types-text.md) (`$$`) to delimit the beginning and ending of the
    specification string.

`FROM SPECIFICATION_TEMPLATE specification_text`
:   Specifies [service specification](../../developer-guide/snowpark-container-services/specification-reference.md). You can use a
    [pair of dollar signs](../data-types-text.md) (`$$`) to delimit the beginning and ending of the
    specification string. When `SPECIFICATION_TEMPLATE` is specified, the `USING` parameter is required.

## Optional parameters

`NAME = [db.schema.]name`
:   The name (that is the identifier) for the service, that executes like a job; it must be unique for the schema in which the service
    is created.

    Quoted names for special characters or case-sensitive names are not supported. The same constraint also applies to database
    and schema names where you create a service. That is, database and schema names without quotes are valid when creating a
    service.

    Default: If not specified, Snowflake generates a name for the service in a format `JOB_<query_job_uuid>`.

`ASYNC = { TRUE | FALSE }`
:   Specifies whether to execute the job asynchronously.

    Default: FALSE

`REPLICAS = num`
:   Specifies the number of job replicas to run. For more information,
    see [Run multiple replicas of a job service (batch jobs)](../../developer-guide/snowpark-container-services/working-with-services.md).

    Default: 1.

`QUERY_WAREHOUSE = warehouse_name`
:   Warehouse to use if a service container connects to Snowflake to execute a query but does not explicitly specify a warehouse
    to use. This parameter also supports object references in Native Apps. For more information, see [Request references and object-level privileges from consumers](../../developer-guide/native-apps/requesting-refs.md).

    Default: none.

`EXTERNAL_ACCESS_INTEGRATIONS = ( EAI_name [ , ... ] )`
:   Specifies the names of the [external access integrations](../../developer-guide/external-network-access/creating-using-external-network-access.md)
    that allow your job to access external sites. The names in this list are case-sensitive. By default, application containers don’t have
    permission to access the internet. If you want to allow your job to access an external site, create an External Access Integration
    (EAI), and configure your job to use that integration. For more
    information, see [Configure service egress](../../developer-guide/snowpark-container-services/service-network-communications.md).

`COMMENT = 'string_literal'`
:   Specifies a comment for the service.

    Default: No value

`TAG ( tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ] )`
:   Specifies the [tag](../../user-guide/object-tagging/introduction.md) name and the tag string value.

    The tag value is always a string, and the maximum number of characters for the tag value is 256.

    For information about specifying tags in a statement, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

`USING ( key => value [ , key => value [ , ... ] ]  )`
:   Lets you provide values to parameterize specification template expansion.

    `USING` is required when using a specification template (`FROM SPECIFICATION_TEMPLATE_FILE` or `FROM SPECIFICATION_TEMPLATE`). The key-value pairs must form a comma-separated list.

    Where:

    * `key` is the name of the template variable. The template variable name can optionally be enclosed in double quotes
      (`"`).
    * `value` is the value to assign to the variable in the template. String values must be enclosed in `'` or
      `$$`. The value must either be alphanumeric or valid JSON.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SERVICE | Schema |  |
| USAGE | Compute pool |  |
| READ | Stage | This is the stage where the specification is stored. |
| READ | Image Repository | Repository of images referenced by the specification. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When calling EXECUTE JOB SERVICE, the parameters should be provided in this order: specify compute pool, followed by other properties, and finally the service specification (either provide specification file name on stage or inline specification).
* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

## Examples

### Execute a job asynchronously

Execute a Snowpark Container Services job service asynchronously.

```sqlexample-yaml
EXECUTE JOB SERVICE
  IN COMPUTE POOL tutorial_compute_pool
  NAME = tutorial_db.data_schema.example_job
  ASYNC = TRUE
  FROM @tutorial_stage
  FROM SPECIFICATION $$
  <job specification>
  $$;
```

### Execute a job with block storage mounted

Execute a job service with block storage configured in the specification.

```sqlexample-yaml
EXECUTE JOB SERVICE
  IN COMPUTE POOL tutorial_compute_pool
  NAME=tutorial_job_service
  FROM SPECIFICATION $$
  spec:
    container:
    - name: main
      image: /tutorial_db/data_schema/tutorial_repository/my_job_image:latest
      volumeMounts:
        - name: block-vol1
          mountPath: /opt/block/path
    volumes:
    - name: block-vol1
      source: block
      size: 10Gi
      blockConfig:
        iops: 4000
        throughput: 200
  $$;
```

The command does not specify the optional `ASYNC` parameter. Therefore, Snowflake executes the command synchronously.

### Execute a batch job

Run 3 instances of a job service by specifying REPLICAS parameter.

```sqlexample-yaml
EXECUTE JOB SERVICE
  IN COMPUTE POOL my_pool
  NAME = tutorial_2_job_service
  REPLICAS = 3
  FROM SPECIFICATION $$
  spec:
    containers:
    - name: main
      image: my_repo/my_job_image:latest
$$;
```

Use [SHOW SERVICE INSTANCES IN SERVICE](show-service-instances-in-service.md) command to find the status of each job service replica.

```sqlexample
SHOW SERVICE INSTANCES IN SERVICE tutorial_2_job_service;
```

Example output:

```output
+---------------+-------------+------------------------+----------------+-------------+-----------+------------------------------------------------------------------+----------------------+----------------------+--------------+
| database_name | schema_name | service_name           | service_status | instance_id | status    | spec_digest                                                      | creation_time        | start_time           | ip_address   |
|---------------+-------------+------------------------+----------------+-------------+-----------+------------------------------------------------------------------+----------------------+----------------------+--------------|
| TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_2_JOB_SERVICE | DONE           | 0           | SUCCEEDED | 80b42d8e1ec39dbaa7e2b9b6591e4b0cc11f74304703f56b50e1dfc10f421ac5 | 2025-08-07T00:44:49Z | 2025-08-07T00:44:49Z | 10.244.0.11  |
| TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_2_JOB_SERVICE | DONE           | 1           | SUCCEEDED | 80b42d8e1ec39dbaa7e2b9b6591e4b0cc11f74304703f56b50e1dfc10f421ac5 | 2025-08-07T00:44:49Z | 2025-08-07T00:44:57Z | 10.244.0.12  |
| TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_2_JOB_SERVICE | DONE           | 2           | SUCCEEDED | 80b42d8e1ec39dbaa7e2b9b6591e4b0cc11f74304703f56b50e1dfc10f421ac5 | 2025-08-07T00:44:49Z | 2025-08-07T00:44:49Z | 10.244.0.203 |
+---------------+-------------+------------------------+----------------+-------------+-----------+------------------------------------------------------------------+----------------------+----------------------+--------------+
```

In the output, the `instance_id` and `status` columns show the replica number and its status.

---
title: EXECUTE NOTEBOOK
source: https://docs.snowflake.com/en/sql-reference/sql/execute-notebook.md
section: SQL Commands
---

# EXECUTE NOTEBOOK

Executes the notebook outside the Notebook Editor. For example, you can run EXECUTE NOTEBOOK by itself from a worksheet, nest it within another
Snowflake executable such as a stored procedure or task, or use it in a third-party orchestrator.

The command runs the latest code from all cells in the notebook. Results are accessible from the Notebook Editor.

> **Note:**
>
> EXECUTE NOTEBOOK also requires the QUERY_WAREHOUSE parameter to be set, otherwise an error occurs. To set the QUERY_WAREHOUSE parameter,
> use the [ALTER NOTEBOOK](alter-notebook.md) command.

## Syntax

```sqlsyntax
EXECUTE NOTEBOOK <name>([ <parameter_string> [ , ... ] ]);
```

## Required parameters

`name`
:   Specifies the identifier (i.e. name) for the notebook; must be unique for the schema in which the notebook is created. Must be fully qualified
    if the notebook is not stored in the current `database.schema` you are operating in.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (for example, “My object”). Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`parameter_string`
:   Optionally pass in arguments to a notebook. In a Python cell in the notebook, you can access these arguments by using the `sys.argv` [variable](https://docs.python.org/3/library/sys.html#sys.argv).

    Only strings are supported; other data types (such as integers or booleans) are interpreted as NULL.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Notebook | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

When you run a notebook using the EXECUTE NOTEBOOK command:

* Python cells are executed on the compute pool defined by the RUNTIME parameter.
* SQL and Snowpark queries are executed using the virtual warehouse specified in the WAREHOUSE parameter.
* Snowflake doesn’t support embedding the `EXECUTE NOTEBOOK` command in a task that is configured to [EXECUTE AS USER](../../user-guide/tasks-intro.md).
  You will not see an error message when creating such a task, but when the task is executed, it will fail.
* When you execute a notebook that uses a compute pool, the Python code runs on the compute pool. However, you might see activity in
  Query History showing that a warehouse was used to run the EXECUTE NOTEBOOK command. This is expected behavior. The warehouse is
  used briefly to initialize the notebook execution environment, but it does not consume any warehouse credits. All code execution is
  handled by the compute pool.

## Example

The following example triggers the default version of the specified notebook without passing in any arguments:

```sqlexample
EXECUTE NOTEBOOK MY_DB.PUBLIC.MY_NOTEBOOK();
```

## Pass parameters to a notebook

You can optionally pass in arguments when running a notebook. In Python cells, you can access these arguments by using the `sys.argv` variable, which is [a built-in Python list that holds command-line arguments](https://docs.python.org/3/library/sys.html#sys.argv).

You can use arguments to customize notebook behavior; for example, you can pass in input values, specify a target
environment, or adjust execution logic based on these arguments.

### Example

```sqlexample
EXECUTE NOTEBOOK MY_DATABASE.PUBLIC.MY_NOTEBOOK(
  'parameter_string a,b,c,d',
  'target_database=PROD_DB'
);
```

In a Python cell in the notebook, you can access each argument as a string in the `sys.argv` list.

To learn how to access and use these arguments from a notebook (including how to parse lists or extract key-value pairs), see [Develop and run code in Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks-develop-run.md).

---
title: EXECUTE NOTEBOOK PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/execute-notebook-project.md
section: SQL Commands
---

# EXECUTE NOTEBOOK PROJECT

Executes a notebook stored in a notebook project (NPO). This command runs the notebook in a non-interactive (headless) mode and is useful for CI/CD
pipelines and other orchestrated workflows where you want to pass parameters or lock dependency versions for repeatable runs. The command can be run from:

* SQL files.
* Other Snowflake executables (Tasks).
* External orchestrators that issue SQL (for example, Airflow, Prefect, Dagster, CI/CD systems).

The command runs the notebook file you specify as `MAIN_FILE` using the runtime, compute pool, warehouse, and external access integrations you
configure.

> **Important:**
>
> Before triggering a non-interactive run, ensure that your notebook sets its execution context (database and schema) or uses fully qualified
> object names. For more information, see [Editing and running notebooks in Workspaces](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-edit-run.md).

See also:
[CREATE NOTEBOOK PROJECT](create-notebook-project.md), [CREATE TASK](create-task.md), [CI/CD workflow scenario](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-workflow-scenarios.md),
[Observability and logging for Notebooks in Workspaces](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-observability-logging.md), [Running notebooks with parameters](../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-parameters.md)

## Syntax

```sqlsyntax
EXECUTE NOTEBOOK PROJECT <database_name>.<schema_name>.<project_name>
  MAIN_FILE = 'notebook.ipynb'
  COMPUTE_POOL = '<compute_pool_name>'
  QUERY_WAREHOUSE = '<warehouse_name>'
  RUNTIME = '<runtime_version>'
  [ ARGUMENTS = '<parameter_string>' ]
  [ REQUIREMENTS_FILE = '<path/to/requirements.txt>' ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ];
```

## Required parameters

`database_name.schema_name.project_name`
:   Fully qualified identifier of the notebook project to execute.

    Must reference an existing notebook project created with [CREATE NOTEBOOK PROJECT](create-notebook-project.md).

    Must be fully qualified unless it resides in the current DATABASE and SCHEMA.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`MAIN_FILE = 'notebook_file_name.ipynb'`
:   Specifies the main notebook file within the workspace to execute (`path/to/notebook.ipynb`).

    Must be an `.ipynb` notebook file located in the workspace referenced by the project.

    The path is relative to the workspace root.

`COMPUTE_POOL = 'compute_pool_name'`
:   Specifies the compute pool used when executing the notebook on a Container Runtime.

    Required when the notebook runtime uses Snowpark Container Services.

`QUERY_WAREHOUSE = 'warehouse_name'`
:   Specifies the virtual warehouse used for executing SQL and Snowpark queries from the notebook.

    Required if the notebook performs SQL or Snowpark operations and no warehouse is otherwise configured.

    When using container runtimes, the warehouse handles query pushdown; Python executes on the compute pool.

`RUNTIME = 'runtime_version'`
:   Specifies the runtime image/version for executing the notebook (for example, `'1.0' or '2.2-CPU-PY3.11'`).

    Determines the Python version and execution environment used for the notebook execution.

    Corresponds to a Container Runtime image (CPU or GPU) or warehouse runtime variant.

## Optional parameters

Depending on how the project and runtime are configured, you may need to set the following parameters. The descriptions below define their
purpose and typical usage.

`ARGUMENTS = 'parameter_string'`
:   Optionally passes one or more string arguments to the notebook at runtime, which appear as command-line arguments in the `sys.argv` list.
    Arguments are useful for making notebook logic dynamic (for example, selecting an environment such as `env prod`).

    To pass multiple arguments, specify them in a single string separated by spaces. The arguments are parsed into `sys.argv` using
    whitespace as the delimiter. In a Python cell, access the arguments using `sys.argv[0]` for the notebook name, `sys.argv[1]` for
    the first argument, and so on.

    Only strings are supported; other data types (such as integers or Booleans) are interpreted as NULL.

    Examples:

    ```sqlexample
    ARGUMENTS = 'env prod';
    ```

    ```python
    import sys
    print(sys.argv)
    ```

`REQUIREMENTS_FILE = '<path/to/requirements.txt>'`
:   Optionally specifies a `requirements.txt` file in a workspace or on a stage to pre-install exact versions of libraries (such as pandas
    or scikit-learn) and other Python dependencies before notebook execution. Pinning dependencies is critical for idempotency and helps
    make notebook runs more repeatable, reducing errors caused by changes in library versions. The file must be accessible to the executing role.

`EXTERNAL_ACCESS_INTEGRATIONS = ( integration_name [ , ... ] )`
:   Specifies one or more external access integrations that the notebook can use during execution.

    Required when the notebook makes outbound network calls (for example, to external APIs).

    Each integration name must reference an existing external access integration.

    Multiple external access integrations can be specified in a comma-separated list inside the parentheses.

    Example:

    ```sqlexample
    EXTERNAL_ACCESS_INTEGRATIONS = (http_eai, s3_eai);
    ```

    > **Note:**
    >
    > The Snowflake-managed PyPI network rule `SNOWFLAKE.EXTERNAL_ACCESS.PYPI_RULE` is only accessible to the ACCOUNTADMIN role.
    > Consequently, using this rule in an External Access Integration (EAI) for notebook objects or scheduled tasks may cause them to fail.
    > To avoid this, create a user-defined network rule for PyPI and reference it in your external access integration. For more information,
    > see [Snowflake-managed egress network rules](../../user-guide/network-rules.md).

## Access control requirements

The role executing `EXECUTE NOTEBOOK PROJECT` must have either OWNERSHIP or USAGE privileges on the notebook project object (NPO).

In addition, the executing role must have USAGE and MONITOR on the query warehouse, and USAGE or OWNERSHIP on:

* The compute pool.
* The database and schema containing the notebook project.
* Tasks and external access integrations referenced by the command.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* It is not possible to use the EXECUTE NOTEBOOK PROJECT command from a notebook.
* You can call `EXECUTE NOTEBOOK PROJECT` from tasks, thus enabling notebook runs as part of larger workflows.
* Snowflake doesn’t support embedding the `EXECUTE NOTEBOOK PROJECT` command in a task that is configured to run using the [EXECUTE AS USER](../../user-guide/tasks-intro.md) clause.
  You will not see an error message when creating such a task, but when the task is executed, it will fail.
* Cell output visibility is restricted to the user who initiated the execution. Other users can’t view the output of cells executed through this command.
* When you run a notebook using the `EXECUTE NOTEBOOK PROJECT` command:

  + Notebook code is executed on the compute pool specified by the COMPUTE_POOL parameter using the runtime specified by the RUNTIME parameter.
  + SQL and Snowpark queries are executed using the warehouse specified by the QUERY_WAREHOUSE parameter.

## Examples

Execute a notebook project:

```sqlexample
EXECUTE NOTEBOOK PROJECT "sales_detection_db"."schema"."DEFAULT_PROJ_B32BCFD4"
  MAIN_FILE = 'notebook_file.ipynb'
  COMPUTE_POOL = 'test_X_CPU'
  QUERY_WAREHOUSE = 'ENG_INFRA_WH'
  RUNTIME = 'V2.2-CPU-PY3.10'
  ARGUMENTS = 'env prod'
  REQUIREMENTS_FILE = 'path/to/requirements.txt'
  EXTERNAL_ACCESS_INTEGRATIONS = ('test_EAI');
```

---
title: EXECUTE TASK
source: https://docs.snowflake.com/en/sql-reference/sql/execute-task.md
section: SQL Commands
---

# EXECUTE TASK

Manually triggers an asynchronous single run of a task (either a standalone task or the root task in a
[task graph](../../user-guide/tasks-graphs.md)) independent of the schedule defined for the task.

A successful run of a root task triggers a cascading run of child tasks in the task graph as their precedent task completes, as though the
root task had run on its defined schedule.

Additionally, you can manually trigger the re-execution of a previously failed task.

See also:
:   [CREATE TASK](create-task.md) , [DESCRIBE TASK](desc-task.md) , [ALTER TASK](alter-task.md) , [DROP TASK](drop-task.md) , [SHOW TASKS](show-tasks.md)

## Syntax

```sqlsyntax
EXECUTE TASK <name>
  [ USING CONFIG = <configuration_string> ]

EXECUTE TASK <name> RETRY LAST
```

## Parameters

`name`
:   Identifier for the standalone task or root task to run. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`USING CONFIG = configuration_string`
:   Specifies a configuration string in valid JSON format for this single execution.
    This parameter creates a new execution with the dynamically specified configuration, but doesn’t modify the task definition.

    Snowflake merges the dynamic configuration with the *default* configuration, which is
    the CONFIG parameter that you set in the task definition with CREATE or ALTER.
    For matching fields, Snowflake uses the dynamically specified values. For non-matching fields, Snowflake uses the
    values from the default configuration. For an example, see Use a dynamic CONFIG.

    The configuration string follows the same format as the CONFIG parameter in [CREATE TASK](create-task.md)
    or [ALTER TASK](alter-task.md):

    ```sqlsyntax
    CONFIG = $${"string1": value1 [, "string2": value2, ...] }$$
    ```

    Example:

    ```sqlexample
    CONFIG = $${"learning_rate": 0.2, "environment": "testing"}$$
    ```

`RETRY LAST`
:   Re-execute the last failed task of the [task graph](../../user-guide/tasks-graphs.md) with `name` restarting from where the tasks failed.

    To re-execute a task the following conditions must be met:

    * The last task graph run must be in state FAILED or CANCELED.
    * The task graph must not have been modified since it was last run.
    * The last failed graph run’s first attempt must have been executed in the last 14 days.

    To view task history, see either the [TASK_HISTORY](../functions/task_history.md) table function or the [Tasks page on Snowsight](../../user-guide/ui-snowsight-tasks.md).

    > **Note:**
    >
    > RETRY LAST creates a new graph run which begins execution at the last failed task(s).
    >
    > Specifically, all FAILED or CANCELED task runs are immediately re-executed,
    > and associated child tasks are scheduled if all of their predecessors execute successfully.
    >
    > Additionally the new task graph run produced by the retry will have an ATTEMPT NUMBER that is one greater than the previous failed
    > graph run, and share the same GRAPH_RUN_GROUP_ID as the retried, or original task graph run.

## Usage notes

* Executing a task requires either the OWNERSHIP or OPERATE privilege on the task.

  When the EXECUTE TASK command triggers a task run, Snowflake verifies that the role with the OWNERSHIP privilege on the task also has
  the USAGE privilege on the warehouse assigned to the task, as well as the global EXECUTE TASK privilege; if not, an error is produced.

  Tasks always run with the privileges of the original owner role, even if a different role with the OPERATE privilege uses EXECUTE TASK to
  run the task.
* By default, Snowflake runs tasks by using the system user with the privileges of the task owner role.
  To run a task as a specific user, configure the task with EXECUTE AS USER. For more information, see [Run tasks with user privileges](../../user-guide/tasks-intro.md).
* For the USING CONFIG option:

  + If the task graph is currently executing and you run this command, Snowflake waits for the current execution to
    complete before starting a new execution with the dynamic configuration.
  + If you run this command multiple times while a task is executing, Snowflake uses the configuration from the most recent
    command for the next run. Previous configurations are replaced and won’t be executed.
  + The dynamic configuration only applies to the single execution triggered by this command. Subsequent
    scheduled runs use the default CONFIG parameter from the task definition.
* The SQL command can only execute a standalone task or the root task in a task graph. If a child task is input, the command returns a
  user error.
* Manually executing a standalone or root task establishes a version of the task. The standalone task or entire task graph completes its
  run with this version. For more information about task versions, see [Versioning of task runs](../../user-guide/tasks-intro.md).
* A suspended root task is run without resuming the task; there is no need to explicitly resume the root task before you execute
  this SQL command. However, EXECUTE TASK does not automatically resume child tasks in the task graph. The command skips any child
  tasks that are suspended.

  To recursively resume all dependent tasks tied to a root task in a task graph, query the
  [SYSTEM$TASK_DEPENDENTS_ENABLE](../functions/system_task_dependents_enable.md) function rather than enabling each task individually (using ALTER TASK …
  RESUME).

  As a best practice when testing new or modified task graphs, set the root task to run on its intended production schedule
  but leave it in a suspended state. When you have tested the task graph successfully, resume the root task. Note that you must
  resume any suspended child tasks in the task graph for testing; otherwise, they are skipped during runs of the task graph.
* If no instance of the task is running, a new run starts immediately.
* If another instance is scheduled (that is, if the task shows a SCHEDULED state in the [TASK_HISTORY](../functions/task_history.md)
  output), the requested run replaces the scheduled run. The requested run starts immediately, using the current timestamp as the scheduled time.
* If the task or task graph is currently queueing or executing (that is, if the task shows an EXECUTING state in the
  [TASK_HISTORY](../functions/task_history.md) output), then the current run continues using the
  [task version](../../user-guide/tasks-intro.md) that was current when the command was executed. A new run is then scheduled to start,
  at a time depending on the task type:

  + For standalone tasks, a new run is scheduled to start after the current run completes.
  + For task graphs, the behavior depends on the OVERLAP_POLICY setting. For more information, see
    [OVERLAP_POLICY](create-task.md) in the CREATE TASK documentation.

  If the EXECUTE TASK command is executed again before the next scheduled run starts, the requested run replaces the scheduled run.
* If a task fails with an unexpected error, you can receive a notification about the error.
  For more information on configuring task error notifications refer to [Set up error notifications for tasks](../../user-guide/tasks-errors.md).
* To view the task information you can either:

  + In Snowsight, in the navigation menu, select Transformation » Tasks.
  + Call the [COMPLETE_TASK_GRAPHS](../functions/complete_task_graphs.md) table function, and examine the results.

## Examples

The following examples show how to manually trigger a task run and how to use a dynamic CONFIG.

### Manually trigger a task run

Manually trigger a run of a task named `mytask`:

```sqlexample
EXECUTE TASK mytask;
```

### Use a dynamic CONFIG

Create a root task named `my_root_task` with a default configuration:

```sqlexample
CREATE OR REPLACE TASK my_root_task
  WAREHOUSE = regress
  SCHEDULE = '10 m'
  CONFIG = $${
    "environment": "production",
    "output_paths": {
      "logs": "/prod/logs",
      "results": "/prod/results"
    }
  }$$
  AS ...;
```

Now, execute the task and specify a dynamic configuration:

```sqlexample
EXECUTE TASK my_root_task
  USING CONFIG=$${
    "output_paths": {
      "results": "/temp/testing"
    }
  }$$;
```

The following example shows the resulting configuration for this execution:

```json
{
  "environment": "production",
  "output_paths": {
    "logs": "/prod/logs",
    "results": "/temp/testing"
  }
}
```

The `environment` field and the `output_paths.logs` field remain unchanged from the default configuration;
only `output_paths.results` is updated with the dynamic value.

---
title: EXPLAIN
source: https://docs.snowflake.com/en/sql-reference/sql/explain.md
section: SQL Commands
---

# EXPLAIN

Returns the logical execution plan for the specified SQL statement.

An explain plan shows the operations (for example, table scans and joins) that Snowflake would perform to execute the
query.

See also:
:   [SYSTEM$EXPLAIN_PLAN_JSON](../functions/system_explain_plan_json.md) ,
    [SYSTEM$EXPLAIN_JSON_TO_TEXT](../functions/system_explain_json_to_text.md) ,
    [EXPLAIN_JSON](../functions/explain_json.md)

## Syntax

```sqlsyntax
EXPLAIN [ USING { TABULAR | JSON | TEXT } ] <statement>
```

## Parameters

`statement`
:   This is the SQL statement for which you want the explain plan.

`USING output_format`
:   This optional clause specifies the output format. The possible output formats are:

    * JSON: JSON output is easier to store in a table and query.
    * TABULAR: tabular output is generally more human-readable than JSON output.
    * TEXT: formatted text output is generally more human-readable than JSON output.

    The default is TABULAR.

## Output

The output contains the following information:

| Column | Description |
| --- | --- |
| `step` | Most queries contain a single step, but some are executed as multiple distinct steps. This column denotes to which step the operation belongs. |
| `id` | Unique identifier assigned to each operation in the query plan. |
| `parentOperators` | Array of identifiers for the operation’s parent nodes. In the query profile, a parent is shown above its child with a link connecting the two. |
| `operation` | Name of the operation, for example, Result, Filter, TableScan, Join, or CreateTableFromArchiveData. |
| `objects` | Name of the object referenced by a table scan operation, for example, table, materialized view, secure view, or ARCHIVE OF <table>. |
| `alias` | Alias of a referenced object, if the object has been given an alias in the query. |
| `expressions` | List of expressions relevant to the current operation such as filters, join predicates, projections, aggregations, etc. |
| `partitionsTotal` | The total number of micro-partitions in the referenced database object. |
| `partitionsAssigned` | The number of partitions from the referenced object that are left after compile-time pruning, i.e. the number of partitions that might be scanned by the query. |
| `bytesAssigned` | The number of bytes contained in the partitionsAssigned. |

## Usage notes

* EXPLAIN compiles the SQL statement, but does not execute it, so EXPLAIN does not require a running warehouse.
* The EXPLAIN plan might differ depending on the size of the current warehouse. If you run EXPLAIN outside of a
  current warehouse, Snowflake constructs the EXPLAIN plan based on the capacity of an XSMALL warehouse.
* Although EXPLAIN does not consume any compute credits, the compilation of the query does consume Cloud Service
  credits, just as other metadata operations do.
* To post-process the output of this command, you can:

  + Use the [RESULT_SCAN](../functions/result_scan.md) function, which treats the output as a table that can be
    queried.
  + Generate the output in JSON format and insert the JSON-formatted output into a table for analysis later.
    If you store the output in JSON format, you can use the function [SYSTEM$EXPLAIN_JSON_TO_TEXT](../functions/system_explain_json_to_text.md) or
    [EXPLAIN_JSON](../functions/explain_json.md) to convert the JSON to a more human readable format (either tabular or formatted text).
* The assignedPartitions and assignedBytes values are upper bound estimates for query execution. Runtime optimizations
  such as join pruning can reduce the number of partitions and bytes scanned during query execution.
* The EXPLAIN plan is the “logical” explain plan. It shows the operations that will be performed, and their
  logical relationship to each other. The actual execution order of the operations in the plan does not necessarily
  match the logical order shown by the plan.
* If any of the database objects in the EXPLAIN statement are INFORMATION_SCHEMA objects, the statement fails with error
  `EXPLAIN command has insufficient privilege on object <objName>`.

## Examples

This example shows the EXPLAIN output for a simple query against two small tables.

> Create the tables:
>
> > ```sqlexample
> > CREATE TABLE Z1 (ID INTEGER);
> > CREATE TABLE Z2 (ID INTEGER);
> > CREATE TABLE Z3 (ID INTEGER);
> > ```
>
> Generate the EXPLAIN plan in tabular format for the query:
>
> > ```sqlexample
> > EXPLAIN USING TABULAR SELECT Z1.ID, Z2.ID
> >     FROM Z1, Z2
> >     WHERE Z2.ID = Z1.ID;
> > +------+------+-----------------+-------------+------------------------------+-------+--------------------------+-----------------+--------------------+---------------+
> > | step | id   | parentOperators | operation   | objects                      | alias | expressions              | partitionsTotal | partitionsAssigned | bytesAssigned |
> > |------+------+-----------------+-------------+------------------------------+-------+--------------------------+-----------------+--------------------+---------------|
> > | NULL | NULL |            NULL | GlobalStats | NULL                         | NULL  | NULL                     |               2 |                  2 |          1024 |
> > |    1 |    0 |            NULL | Result      | NULL                         | NULL  | Z1.ID, Z2.ID             |            NULL |               NULL |          NULL |
> > |    1 |    1 |             [0] | InnerJoin   | NULL                         | NULL  | joinKey: (Z2.ID = Z1.ID) |            NULL |               NULL |          NULL |
> > |    1 |    2 |             [1] | TableScan   | TESTDB.TEMPORARY_DOC_TEST.Z2 | NULL  | ID                       |               1 |                  1 |           512 |
> > |    1 |    3 |             [1] | JoinFilter  | NULL                         | NULL  | joinKey: (Z2.ID = Z1.ID) |            NULL |               NULL |          NULL |
> > |    1 |    4 |             [3] | TableScan   | TESTDB.TEMPORARY_DOC_TEST.Z1 | NULL  | ID                       |               1 |                  1 |           512 |
> > +------+------+-----------------+-------------+------------------------------+-------+--------------------------+-----------------+--------------------+---------------+
> > ```
>
> Generate the EXPLAIN plan for the query as formatted text:
>
> > ```sqlexample
> > EXPLAIN USING TEXT SELECT Z1.ID, Z2.ID
> >     FROM Z1, Z2
> >     WHERE Z2.ID = Z1.ID;
> > +------------------------------------------------------------------------------------------------------------------------------------+
> > | content                                                                                                                            |
> > |------------------------------------------------------------------------------------------------------------------------------------|
> > | GlobalStats:                                                                                                                       |
> > |     partitionsTotal=2                                                                                                              |
> > |     partitionsAssigned=2                                                                                                           |
> > |     bytesAssigned=1024                                                                                                             |
> > | Operations:                                                                                                                        |
> > | 1:0     ->Result  Z1.ID, Z2.ID                                                                                                     |
> > | 1:1          ->InnerJoin  joinKey: (Z2.ID = Z1.ID)                                                                                 |
> > | 1:2               ->TableScan  TESTDB.TEMPORARY_DOC_TEST.Z2  ID  {partitionsTotal=1, partitionsAssigned=1, bytesAssigned=512}      |
> > | 1:3               ->JoinFilter  joinKey: (Z2.ID = Z1.ID)                                                                           |
> > | 1:4                    ->TableScan  TESTDB.TEMPORARY_DOC_TEST.Z1  ID  {partitionsTotal=1, partitionsAssigned=1, bytesAssigned=512} |
> > |                                                                                                                                    |
> > +------------------------------------------------------------------------------------------------------------------------------------+
> > ```
>
> Generate the EXPLAIN plan for the query as JSON:
>
> > ```sqlexample
> > EXPLAIN USING JSON SELECT Z1.ID, Z2.ID
> >     FROM Z1, Z2
> >     WHERE Z2.ID = Z1.ID;
> > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > | content                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                           |
> > |---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> > | {"GlobalStats":{"partitionsTotal":2,"partitionsAssigned":2,"bytesAssigned":1024},"Operations":[[{"id":0,"operation":"Result","expressions":["Z1.ID","Z2.ID"]},{"id":1,"parentOperators":[0],"operation":"InnerJoin","expressions":["joinKey: (Z2.ID = Z1.ID)"]},{"id":2,"parentOperators":[1],"operation":"TableScan","objects":["TESTDB.TEMPORARY_DOC_TEST.Z2"],"expressions":["ID"],"partitionsAssigned":1,"partitionsTotal":1,"bytesAssigned":512},{"id":3,"parentOperators":[1],"operation":"JoinFilter","expressions":["joinKey: (Z2.ID = Z1.ID)"]},{"id":4,"parentOperators":[3],"operation":"TableScan","objects":["TESTDB.TEMPORARY_DOC_TEST.Z1"],"expressions":["ID"],"partitionsAssigned":1,"partitionsTotal":1,"bytesAssigned":512}]]} |
> > +---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> > ```

---
title: GET
source: https://docs.snowflake.com/en/sql-reference/sql/get.md
section: SQL Commands
---

# GET

Downloads data files from one of the following [internal stage](../../user-guide/data-load-overview.md)
types to a local directory or folder on a client machine:

* Named internal stage.
* Internal stage for a specified table.
* Internal stage for the current user.

You can use this command to download data files after unloading data from a table onto a
Snowflake stage using the [COPY INTO <location>](copy-into-location.md) command.

For more information about using the GET command, see [Unload into a Snowflake stage](../../user-guide/data-unload-snowflake.md).

See also:
:   [LIST](list.md) , [PUT](put.md) , [REMOVE](remove.md) , [COPY FILES](copy-files.md)

## Syntax

```sqlsyntax
GET internalStage file://<local_directory_path>
    [ PARALLEL = <integer> ]
    [ PATTERN = '<regex_pattern>'' ]
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>]
>   | @[<namespace>.]%<table_name>[/<path>]
>   | @~[/<path>]
> ```

## Required parameters

`internalStage`
:   Specifies the location in Snowflake from which to download the files:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]int_stage_name[/path]` | Files are downloaded from the specified named internal stage. |
    > | `@[namespace.]%table_name[/path]` | Files are downloaded from the stage for the specified table. |
    > | `@~[/path]` | Files are downloaded from the stage for the current user. |

    Where:

    * `namespace` is the database and/or schema in which the named internal stage or table resides. It is optional if a
      database and schema are currently in use within the session; otherwise, it is required.
    * `path` is an optional case-sensitive path for files in the cloud storage location (that is, files have names that begin with a
      common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud
      storage services. If `path` is specified, but no file is explicitly named in the path, all data files in the path are
      downloaded.

    > **Note:**
    >
    > If the stage name or path includes spaces or special characters, it must be enclosed in single quotes (example: `'@"my stage"'`
    > for a stage named `"my stage"`).

`file://local_directory_path`
:   Specifies the local directory path on the client machine where the files are downloaded:

    Linux/macOS:
    :   You must include the initial forward slash in the path (example: `file:///tmp/load`).

        If the directory path includes special characters, the entire file URI must be enclosed in single quotes.

    Windows:
    :   You must include the drive and backslash in the path (example: `file://C:tempload`).

        If the directory path includes special characters, the entire file URI must be enclosed in single quotes. Note that
        the drive and path separator is a forward slash (`/`) in enclosed URIs (example: `'file://C:/Users/%Username%/Data 2025-01'`).

    > **Note:**
    >
    > The GET command returns an error if you specify a filename as part of the path, except if you use the
    > [JDBC driver](../../developer-guide/jdbc/jdbc.md) or [ODBC driver](../../developer-guide/odbc/odbc.md). If you specify a filename when
    > using either driver, the driver treats the filename as part of the directory path and creates a subdirectory with the specified
    > filename.
    >
    > For example, if you specify `file:///tmp/load/file.csv`, the JDBC or ODBC driver creates a subdirectory named `file.csv/`
    > under the path `/tmp/load/`. The GET command then downloads the staged files into this new subdirectory.

## Optional parameters

`PARALLEL = integer`
:   Specifies the number of threads to use for downloading the files. The granularity unit for downloading is one file.

    Increasing the number of threads can improve performance when downloading large files.

    Supported values: Any integer value from `1` (no parallelism) to `99` (use 99 threads for downloading files).

    Default: `10`

`PATTERN = 'regex_pattern'`
:   Specifies a regular expression pattern for filtering files to download. The command lists all files in the specified `path`
    and applies the regular expression pattern on each of the files found.

    Default: No value (all files in the specified stage are downloaded)

## Usage notes

* GET does not support the following actions:

  + Downloading files from external stages. To download files from external stages, use the utilities
    provided by your cloud service.
  + Downloading multiple files with divergent directory paths. The command
    *does not* preserve stage directory structure when transferring files to your client machine.

    For example, the following GET statement returns an error since you can’t download multiple files named `tmp.parquet` that are in
    different subdirectories on the stage.

    ```sqlexample
    GET @my_int_stage my_target_path PATTERN = "tmp.parquet";
    ```
* The [ODBC driver](../../developer-guide/odbc/odbc.md) supports GET with Snowflake accounts hosted on the following platforms:

> * Amazon Web Services (using ODBC Driver Version 2.17.5 and higher).
> * Google Cloud (using ODBC Driver Version 2.21.5 and higher).
> * Microsoft Azure (using ODBC Driver Version 2.20.2 and higher).

* The command cannot be executed from the Worksheets  page in either Snowflake web interface; instead, use the
  SnowSQL client to download data files, or check the documentation for the specific Snowflake client to verify support for this command.
* The command does not rename files.
* Downloaded files are automatically decrypted using the same key that was used to encrypt the file when it was either uploaded
  (using [PUT](put.md)) or unloaded from a table (using [COPY INTO <location>](copy-into-location.md)).
* For the [PUT](put.md) and GET commands,
  an EXECUTION_STATUS of `success` in the [QUERY_HISTORY](../account-usage/query_history.md)
  does *not* mean that data files were successfully uploaded or downloaded.
  Instead, the status indicates that Snowflake received authorization to proceed with the file transfer.

## Examples

Download all files in the stage for the `mytable` table to the `/tmp/data` local directory (in a Linux or macOS environment):

> ```sqlexample
> GET @%mytable file:///tmp/data/;
> ```

Download files from the `myfiles` path in the stage for the current user to the `/tmp/data` local directory (in a Linux or
macOS environment):

> ```sqlexample
> GET @~/myfiles file:///tmp/data/;
> ```

For additional examples, see [Unload into a Snowflake stage](../../user-guide/data-unload-snowflake.md).

---
title: GRANT <privilege> … TO SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-privilege-share.md
section: SQL Commands
---

# GRANT *<privilege>* … TO SHARE

Grants access privileges for databases and other supported database objects (schemas, UDFs, tables, and views) to a share. Granting
privileges on these objects effectively adds the objects to the share, which can then be shared with one or more consumer accounts.

For more details, see [About Secure Data Sharing](../../user-guide/data-sharing-intro.md) and [Create and configure shares](../../user-guide/data-sharing-provider.md).

See also:
:   [REVOKE <privilege> … FROM SHARE](revoke-privilege-share.md)

    [GRANT <privileges> … TO ROLE](grant-privilege.md)

## Syntax

```sqlsyntax
GRANT objectPrivilege ON
     {  DATABASE <name>
      | SCHEMA <name>
      | FUNCTION <name>
      | SEMANTIC VIEW <name>
      | { TABLE <name> | ALL TABLES IN SCHEMA <schema_name> }
      | { EXTERNAL TABLE <name> | ALL EXTERNAL TABLES IN SCHEMA <schema_name> }
      | { ICEBERG TABLE <name> | ALL ICEBERG TABLES IN SCHEMA <schema_name> }
      | { DYNAMIC TABLE <name> | ALL DYNAMIC TABLES IN SCHEMA <schema_name> }
      | TAG <name>
      | VIEW <name>  }
  TO SHARE <share_name>
```

Where:

```sqlsyntax
objectPrivilege ::=
-- For DATABASE
   REFERENCE_USAGE [ , ... ]
-- For DATABASE, FUNCTION, or SCHEMA
   USAGE [ , ... ]
-- For SEMANTIC VIEW
   { REFERENCES | SELECT } [ , ... ]
-- For TABLE
   EVOLVE SCHEMA [ , ... ]
-- For EXTERNAL TABLE, ICEBERG TABLE, TABLE, or VIEW
   SELECT [ , ... ]
-- For TAG
   READ
```

## Parameters

`name`
:   Specifies the identifier for the object for which the specified privilege is granted.

`schema_name`
:   Specifies the identifier for the schema for which the specified privilege is granted for all tables.

`share_name`
:   Specifies the identifier for the share from which the specified privilege is granted.

## Usage notes

* The USAGE privilege on only a single database can be granted to a share; however, within that database, privileges on multiple schemas,
  UDFs, tables, and views can be granted to the share.
* Privileges on individual objects must be granted to a share in separate GRANT statements. The only exception is the SELECT privilege on
  tables (including Apache Iceberg™ tables). Using an `ALL` clause, you can grant SELECT on all tables in a specified schema to a share.
* The SELECT privilege on views can only be granted on secure views. Attempting to grant the SELECT privilege on a non-secure view to a
  share returns an error.
* The USAGE privilege can only be granted on secure UDFs. Attempting to grant the USAGE privilege on a non-secure UDF to a share returns
  an error.
* Currently, sharing a UDF that references an object from another database is not supported. For example, if you attempt to grant USAGE
  on a UDF that references a secure view from another database, an error is returned.
* Use the REFERENCE_USAGE privilege when sharing a secure view that references objects belonging to multiple databases, as follows:

  + The REFERENCE_USAGE privilege must be granted individually on each database.
  + The REFERENCE_USAGE privilege must be granted on a database before granting the SELECT privilege on a secure view to a share.

  For more details, see [Share data from multiple databases](../../user-guide/data-sharing-multiple-db.md).
* [Secure Data Sharing](../../user-guide/data-sharing-intro.md): Data providers cannot add new objects to a share automatically using
  future grants. That is, data providers cannot grant privileges on future objects to a share using
  GRANT *<privilege>* … TO SHARE statements.
* You cannot reshare a database or database objects created from a share. If you attempt to grant the USAGE privilege on a database or
  database objects created from a share to a different share, an error is returned.
* If you specify a `TABLE` object that is an *Iceberg* table, the command grants the privilege on that Iceberg table.

## Examples

This is an example of sharing objects from a single database:

> ```sqlexample
> GRANT USAGE ON DATABASE mydb TO SHARE share1;
>
> GRANT USAGE ON SCHEMA mydb.public TO SHARE share1;
>
> GRANT USAGE ON FUNCTION mydb.shared_schema.function1 TO SHARE share1;
>
> GRANT USAGE ON FUNCTION mydb.shared_schema.function2 TO SHARE share1;
>
> GRANT SELECT ON ALL TABLES IN SCHEMA mydb.public TO SHARE share1;
>
> GRANT SELECT ON ALL EXTERNAL TABLES IN SCHEMA mydb.public TO SHARE share1;
>
> GRANT SELECT ON ALL ICEBERG TABLES IN SCHEMA mydb.public TO SHARE share1;
>
> GRANT SELECT ON ALL DYNAMIC TABLES IN SCHEMA mydb.public TO SHARE share1;
>
> GRANT USAGE ON SCHEMA mydb.shared_schema TO SHARE share1;
>
> GRANT SELECT ON VIEW mydb.shared_schema.view1 TO SHARE share1;
>
> GRANT SELECT ON VIEW mydb.shared_schema.view3 TO SHARE share1;
>
> GRANT SELECT ON ICEBERG TABLE mydb.shared_schema.iceberg_table_1 TO SHARE share1;
>
> GRANT SELECT ON DYNAMIC TABLE mydb.public TO SHARE share1;
> ```

This is an example of sharing a secure view that references objects from a different database:

> ```sqlexample
> CREATE SECURE VIEW view2 AS SELECT * FROM database2.public.sampletable;
>
> GRANT USAGE ON DATABASE database1 TO SHARE share1;
>
> GRANT USAGE ON SCHEMA database1.schema1 TO SHARE share1;
>
> GRANT REFERENCE_USAGE ON DATABASE database2 TO SHARE share1;
>
> GRANT SELECT ON VIEW view2 TO SHARE share1;
> ```

---
title: GRANT <privileges> … TO APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/grant-privilege-application.md
section: SQL Commands
---

# GRANT *<privileges>* … TO APPLICATION

Grants one or more access privileges on a securable object to an application. The privileges that can be
granted are object-specific.

Variations:
:   [REVOKE <privileges> … FROM APPLICATION](revoke-privilege-application.md)

## Syntax

```sqlsyntax
GRANT {  { globalPrivileges } ON ACCOUNT
       | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { USER | RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | CONNECTION | FAILOVER GROUP | REPLICATION GROUP | EXTERNAL VOLUME } <object_name>
       | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
       | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
      }
    TO APPLICATION <name>
```

Where:

```sqlsyntax
globalPrivileges ::=
  {
      CREATE {
       COMPUTE POOL | DATABASE | WAREHOUSE
      }
      | BIND SERVICE ENDPOINT
      | EXECUTE MANAGED TASK
      | MANAGE WAREHOUSES
      | READ SESSION
  }
  [ , ... ]
```

```sqlsyntax
accountObjectPrivileges ::=
-- For COMPUTE POOL
   { MODIFY | MONITOR | OPERATE | USAGE } [ , ... ]
-- For CONNECTION
   { FAILOVER } [ , ... ]
-- For DATABASE
   { APPLYBUDGET | CREATE { DATABASE ROLE | SCHEMA }
   | IMPORTED PRIVILEGES | MODIFY | MONITOR | USAGE } [ , ... ]
-- For EXTERNAL VOLUME
   { USAGE } [ , ... ]
-- For FAILOVER GROUP
   { FAILOVER | MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For INTEGRATION
   { USAGE | USE_ANY_ROLE } [ , ... ]
-- For REPLICATION GROUP
   { MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For RESOURCE MONITOR
   { MODIFY | MONITOR } [ , ... ]
-- For USER
   { MONITOR } [ , ... ]
-- For WAREHOUSE
   { APPLYBUDGET | MODIFY | MONITOR | USAGE | OPERATE } [ , ... ]
```

```sqlsyntax
schemaPrivileges ::=
ADD SEARCH OPTIMIZATION
| CREATE {
    ALERT | EXTERNAL TABLE | FILE FORMAT | FUNCTION
    | IMAGE REPOSITORY | MATERIALIZED VIEW | PIPE | PROCEDURE
    | { AGGREGATION | MASKING | PASSWORD | PROJECTION | ROW ACCESS | SESSION } POLICY
    | SECRET | SEMANTIC VIEW | SEQUENCE | SERVICE | SNAPSHOT | STAGE | STREAM
    | TAG | TABLE | TASK | VIEW
  }
| MODIFY | MONITOR | USAGE
[ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For DYNAMIC TABLE
     OPERATE, SELECT [ , ...]
  -- For EVENT TABLE
     { INSERT | SELECT } [ , ... ]
  -- For FILE FORMAT, FUNCTION (UDF or external function), PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { AGGREGATION | MASKING | PACKAGES | PASSWORD | PROJECTION | ROW ACCESS | SESSION } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     READ, USAGE [ , ... ]
  -- For SEMANTIC VIEW
     REFERENCES [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
```

For more details about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are granted.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `ALERT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `NETWORK RULE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PROCEDURE`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `STAGE`
    * `STREAM`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`

`object_type_plural`
:   Plural form of `object_type` (e.g. `TABLES`, `VIEWS`).

    Bulk grants on pipes are not allowed.

`name`
:   Specifies the identifier for the recipient application (the application to which the privileges are granted).

## Usage notes

* Granting OWNERSHIP privileges on an object or all objects of a specified type in a schema or database to an application, or transferring ownership of the object from one application to another application, is not allowed.
* Any ACCOUNT level privilege grant (not REVOKE) that is not in the current application version manifest is not allowed.

## Example

Grant the SELECT privilege on a view to an application:

```sqlexample
GRANT SELECT ON VIEW data.views.credit_usage
  TO APPLICATION app_snowflake_credits;
```

---
title: GRANT <privileges> … TO APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-privilege-application-role.md
section: SQL Commands
---

# GRANT *<privileges>* … TO APPLICATION ROLE

Grants one or more access privileges on a securable schema-level object to an application role. The privileges that can be granted are
object-specific.

For more details about roles and securable objects, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

Variations:
:   [GRANT OWNERSHIP](grant-ownership.md) , [REVOKE <privileges> … FROM APPLICATION ROLE](revoke-privilege-application-role.md)

## Syntax

```sqlsyntax
GRANT {
        { schemaPrivileges         | ALL [ PRIVILEGES ] } ON SCHEMA <schema_name>
        | { schemaObjectPrivileges | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
        | { schemaObjectPrivileges | ALL [ PRIVILEGES ] } ON FUTURE <object_type_plural> IN SCHEMA <schema_name>
      }
    TO APPLICATION ROLE <name> [ WITH GRANT OPTION ]
```

Where:

```sqlsyntax
schemaPrivileges ::=
  {
    ADD SEARCH OPTIMIZATION
    | CREATE {
        ALERT | EXTERNAL TABLE | FILE FORMAT | FUNCTION
        | IMAGE REPOSITORY | MATERIALIZED VIEW | PIPE | PROCEDURE
        | { AGGREGATION | MASKING | PASSWORD | PROJECTION | ROW ACCESS | SESSION } POLICY
        | SECRET | SEMANTIC VIEW | SEQUENCE | SERVICE | SNAPSHOT | STAGE | STREAM
        | TAG | TABLE | TASK | VIEW
      }
    | MODIFY | MONITOR | USAGE
  }
  [ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For DYNAMIC TABLE
     OPERATE, SELECT [ , ...]
  -- For EVENT TABLE
     { INSERT | SELECT } [ , ... ]
  -- For FILE FORMAT, FUNCTION (UDF or external function), PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { AGGREGATION | MASKING | PACKAGES | PASSWORD | PROJECTION | ROW ACCESS | SESSION } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     READ, USAGE [ , ... ]
  -- For SEMANTIC VIEW
     REFERENCES [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
```

For more details about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are granted.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `ALERT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `NETWORK RULE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PROCEDURE`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `STAGE`
    * `STREAM`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`

`object_type_plural`
:   Plural form of `object_type` (e.g. `TABLES`, `VIEWS`).

    Note that bulk grants on pipes are not allowed.

`name`
:   Specifies the identifier for the recipient application role (i.e. the role to which the privileges are granted).

## Optional parameters

`ON FUTURE`
:   Specifies that privileges are granted on new (i.e. future) schema objects of a specified type rather than existing objects. Future grants
    can be revoked at any time using [REVOKE <privileges> … FROM APPLICATION ROLE](revoke-privilege-application-role.md) with the ON FUTURE keywords; any privileges granted
    on existing objects are retained. For more information about future grants, see Future Grants on Schema Objects in this topic.

`WITH GRANT OPTION`
:   If specified, allows the recipient application role to grant the privileges to other application roles.

    Default: No value, which means the recipient application role cannot grant the privileges to other application roles.

    > **Note:**
    >
    > The WITH GRANT OPTION clause does not support the IMPORTED PRIVILEGES privilege. For more information, refer to
    > [Granting privileges on an imported database](../../user-guide/data-share-consumers.md).

## Usage notes

You must use an application role to grant and revoke privileges on objects in an application.

This command has different restrictions depending on whether you are the application provider or consumer.

The application consumer cannot do the following with respect to an application role:

* Grant or revoke object privileges with respect to an application role.
* Grant an application role to a database or share, or revoke an application role from a database or share.
* Grant an application role to same application or a different application, or revoke an application role from the same application or a
  different application.

These items apply the application provider with respect to an application role.

* To grant the OWNERSHIP privilege on an object or all objects of a specified type in a schema to an application role, transferring
  ownership of the object from one application role to another application role, use the [GRANT OWNERSHIP](grant-ownership.md) command.
* Multiple privileges can be specified for the same object type in a single GRANT statement with each privilege separated by commas.

  However, only privileges held and grantable by the application role executing the GRANT command are actually granted to the target
  application role. A warning message is returned for any privileges that could not be granted.
* Privileges granted to a particular application role are automatically inherited by any other application roles to which the application
  role is granted, as well as any other higher-level application roles within the role hierarchy.

  For more details, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).
* In managed access schemas:

  + The OWNERSHIP privilege on objects can only be transferred to a subordinate role of the schema owner.
  + For stages:

    - USAGE only applies to external stages.
    - READ
    - WRITE only applies to internal stages. In addition, to grant the WRITE privilege on an internal stage, the READ privilege must
      first be granted on the stage.

  For more details about external and internal stages, refer to [CREATE STAGE](create-stage.md) and Access Control Requirements
  (in this topic).
* When granting privileges on an individual UDF or stored procedure, you must specify the data types of the arguments, if any,
  using the syntax shown below:

  ```sqlsyntax
  <udf_or_stored_procedure_name> ( [ <arg_data_type> [ , ... ] ] )
  ```

  Snowflake uses argument data types to resolve UDFs and stored procedures that have the same name within a schema. For more
  information, refer [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).

## Access control requirements

* This command can only be executed from within the application.
* Privileges can only be granted or revoked on objects owned by the application. To determine these objects,
  use the [SHOW OBJECTS](show-objects.md) command:

  ```sqlexample
  SHOW OBJECTS OWNED BY APPLICATION myapp;
  ```
* Regarding managed access schemas:

  + In managed access schemas (i.e. schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax), object owners lose
    the ability to make grant decisions.

    The following roles can grant privileges on objects in a managed access schema:

    - The application role because this role is the schema owner (i.e. the role with the OWNERSHIP privilege on the schema).
    - A role that inherits the application role.
    - A role with the global MANAGE GRANTS privilege can grant privileges on objects in the schema.

      A role that holds the global MANAGE GRANTS privilege can grant additional privileges to the current (grantor) role.
  + Refer to Future Grants on Schema Objects (in this topic) for the access control requirements of future grants in managed access
    schemas.

## Future grants on schema objects

The notes in these sections apply when assigning future grants on objects in a schema (i.e. when using the ON FUTURE keywords).

### Considerations

* When future grants are defined on the same object type for a schema, the schema-level grants take precedence over the database
  level grants, and the database level grants are ignored. This behavior applies to privileges on future objects granted to one application
  role or different application roles.

### Restrictions and limitations

* No more than one future grant of the OWNERSHIP privilege is allowed on each securable object type.

* Future grants cannot be defined on objects of the following types:

  + Compute pool
  + External function
  + Image repository
  + Organization profile
  + Policy objects:

    - Aggregation policy
    - Join policy
    - Masking policy
    - Packages policy
    - Projection policy
    - Row access policy
    - Session policy
    - Storage lifecycle policy
  + Snapshot
  + Tag

* A future grant of the OWNERSHIP privilege on objects of a specified type in a database do not apply to new objects in a managed
  access schema.
* The following restrictions apply to future grants on objects in a managed access schema:

  + A future grant of the OWNERSHIP privilege on objects can only be applied to a subordinate role of the schema owner
    (i.e. the role that has the OWNERSHIP privilege on the schema).
  + Before ownership of a managed access schema can be transferred to a different role, all open future grants of the OWNERSHIP
    privilege must be revoked using [REVOKE <privileges> … FROM ROLE](revoke-privilege.md) with the ON FUTURE keywords.
* Future grants are not applied when renaming or swapping a table.
* Future grants are supported on named stages with the following restrictions:

  + The WRITE privilege cannot be specified without the READ privilege.
  + The READ privilege cannot be revoked if the WRITE privilege is present.
  + For internal stages, only future grants with the READ or WRITE privilege are materialized.
  + For external stages, only future grants with the USAGE privileges are materialized.
* In a managed access schema, the application role and a role with the global MANAGE GRANTS privilege can grant privileges on future
  objects in the managed access schema.

  In standard schemas, the global MANAGE GRANTS privilege is required to grant privileges on future objects in the schema.

## Example

Grant the SELECT privilege on a view to an application role:

```sqlexample
GRANT SELECT ON VIEW data.views.credit_usage
  TO APPLICATION ROLE app_snowflake_credits;
```

---
title: GRANT <privileges> … TO ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-privilege.md
section: SQL Commands
---

# GRANT *<privileges>* … TO ROLE

Grants one or more access privileges on a securable object to a role or database role. The privileges that can be granted are object-specific.

For information on granting privileges on securable objects to a share, see [GRANT <privilege> … TO SHARE](grant-privilege-share.md).

Roles:
:   The privileges that can be granted to roles are grouped into the following categories:

    * Global privileges.
    * Privileges for account objects, such as resource monitors, virtual warehouses, and databases.
    * Privileges for schemas.
    * Privileges for schema objects, such as tables, views, stages, file formats, UDFs, and sequences.

Database roles:
:   The privileges that can be granted to database roles are grouped into the following categories:

    * Privileges for the database that contains the database role.
    * Privileges for schemas in the database that contains the database role.
    * Privileges for schema objects, such as tables, views, stages, file formats, UDFs, and sequences in the database that contains the
      database role.

For more details about roles and securable objects, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

Variations:
:   [GRANT OWNERSHIP](grant-ownership.md) , [GRANT <privilege> … TO SHARE](grant-privilege-share.md)

See also:
:   [REVOKE <privileges> … FROM ROLE](revoke-privilege.md)

## Syntax

Account roles:

```sqlsyntax
GRANT {  { globalPrivileges         | ALL [ PRIVILEGES ] } ON ACCOUNT
       | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { USER | RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | CONNECTION | FAILOVER GROUP | REPLICATION GROUP | EXTERNAL VOLUME } <object_name>
       | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
       | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { FUTURE SCHEMAS IN DATABASE <db_name> }
       | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> } }
       | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON FUTURE <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
      }
  TO [ ROLE ] <role_name> [ WITH GRANT OPTION ]
```

Database roles:

```sqlsyntax
GRANT {  { CREATE SCHEMA | MODIFY | MONITOR | USAGE } [ , ... ] } ON DATABASE <object_name>
       | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
       | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { FUTURE SCHEMAS IN DATABASE <db_name> }
       | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> } }
       | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON FUTURE <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
      }
  TO DATABASE ROLE <database_role_name> [ WITH GRANT OPTION ]
```

Where:

```sqlsyntax
globalPrivileges ::=
  {
      CREATE {
          ACCOUNT | APPLICATION | APPLICATION PACKAGE | COMPUTE POOL | LISTING
          | DATABASE | EXTERNAL VOLUME | FAILOVER GROUP | INTEGRATION | NETWORK POLICY
          | ORGANIZATION LISTING | ORGANIZATION PROFILE | REPLICATION GROUP | ROLE | SHARE
       | USER | WAREHOUSE
      }
      | ATTACH POLICY | AUDIT | BIND SERVICE ENDPOINT
      | APPLY {
         { AGGREGATION | AUTHENTICATION | JOIN | MASKING | PACKAGES | PASSWORD
           | PROJECTION | ROW ACCESS | SESSION | STORAGE LIFECYCLE } POLICY
         | CONTACT
         | TAG }
      | EXECUTE { ALERT | DATA METRIC FUNCTION | MANAGED ALERT | MANAGED TASK | TASK }
      | IMPORT { SHARE | ORGANIZATION LISTING }
 | MANAGE { ACCOUNT SUPPORT CASES | EVENT SHARING | GRANTS | LISTING AUTO FULFILLMENT | ORGANIZATION SUPPORT CASES | SHARE TARGET | USER SUPPORT CASES | VISIBILITY | WAREHOUSES }
      | MODIFY { LOG LEVEL | TRACE LEVEL | SESSION LOG LEVEL | SESSION TRACE LEVEL }
      | MONITOR { EXECUTION | SECURITY | USAGE }
      | OVERRIDE SHARE RESTRICTIONS | PURCHASE DATA EXCHANGE LISTING | RESOLVE ALL
      | READ SESSION
      | READ UNREDACTED ERROR TABLE
      | USE AI FUNCTIONS
  }
  [ , ... ]
```

```sqlsyntax
accountObjectPrivileges ::=
-- For APPLICATION PACKAGE
    { ATTACH LISTING | DEVELOP | INSTALL | MANAGE VERSIONS | MANAGE RELEASES } [ , ... ]
-- For COMPUTE POOL
   { MODIFY | MONITOR | OPERATE | USAGE } [ , ... ]
-- For CONNECTION
   { FAILOVER } [ , ... ]
-- For DATABASE
   { APPLYBUDGET | CREATE { DATABASE ROLE | SCHEMA }
   | IMPORTED PRIVILEGES | MODIFY | MONITOR | USAGE } [ , ... ]
-- For EXTERNAL VOLUME
   { USAGE } [ , ... ]
-- For FAILOVER GROUP
   { FAILOVER | MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For INTEGRATION
   { USAGE | USE_ANY_ROLE } [ , ... ]
-- For ORGANIZATION PROFILE
   { MODIFY } [ , ... ]
-- For REPLICATION GROUP
   { MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For RESOURCE MONITOR
   { MODIFY | MONITOR } [ , ... ]
-- For USER
   { IMPERSONATE | MODIFY PROGRAMMATIC AUTHENTICATION METHODS | MONITOR } [ , ... ]
-- For WAREHOUSE
   { APPLYBUDGET | MODIFY | MONITOR | USAGE | OPERATE } [ , ... ]
```

```sqlsyntax
schemaPrivileges ::=

    ADD SEARCH OPTIMIZATION | APPLYBUDGET
  | CREATE {
       AGENT | ALERT | CONTACT | CORTEX SEARCH SERVICE | DATA METRIC FUNCTION | DATASET
      | DBT PROJECT | EVENT TABLE | EXPERIMENT | FILE FORMAT | FUNCTION
      | GATEWAY | { GIT | IMAGE } REPOSITORY | MCP SERVER
      | MODEL | NETWORK RULE | NOTEBOOK | PIPE | PROCEDURE
      | { AGGREGATION | AUTHENTICATION | MASKING | PACKAGES
         | PASSWORD | PRIVACY | PROJECTION | ROW ACCESS | SESSION
         | STORAGE LIFECYCLE } POLICY
      | SECRET | SEQUENCE | SERVICE | SNAPSHOT | SNAPSHOT POLICY | SNAPSHOT SET
      | STAGE | STREAM | STREAMLIT
      | SNOWFLAKE.CORE.BUDGET
      | SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
      | SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER
      | SNOWFLAKE.ML.ANOMALY_DETECTION | SNOWFLAKE.ML.CLASSIFICATION
         | SNOWFLAKE.ML.FORECAST | SNOWFLAKE.ML.TOP_INSIGHTS
      | SNOWFLAKE.ML.DOCUMENT_INTELLIGENCE
      | [ { DYNAMIC | EXTERNAL | ICEBERG | INTERACTIVE | ONLINE FEATURE } ] TABLE
      | TAG | TASK | TYPE | WORKSPACE | [ { MATERIALIZED | SEMANTIC } ] VIEW
      }
   | MODIFY | MONITOR | USAGE
   [ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For AGENT
     { MODIFY | MONITOR | USAGE } [ , ... ]
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For CONTACT
     { APPLY | MODIFY } [ , ... ]
  -- For CORTEX SEARCH SERVICE
     { OPERATE | USAGE } [ , ... ]
  -- For DATA METRIC FUNCTION
     USAGE [ , ... ]
  -- For DATASET, FILE FORMAT, FUNCTION (UDF or external function), MODEL, PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For SNAPSHOT POLICY or SNAPSHOT SET (for WORM snapshots)
     USAGE [ , ... ]
  -- For DBT PROJECT
     USAGE, MONITOR [ , ... ]
  -- For DYNAMIC TABLE
     MONITOR, OPERATE, SELECT [ , ... ]
  -- For EXPERIMENT
     { CREATE | MODIFY | USAGE } [ , ... ]
  -- For EVENT TABLE
     { APPLYBUDGET | DELETE | OWNERSHIP | REFERENCES | SELECT | TRUNCATE } [ , ... ]
  -- For GATEWAY
     { CREATE | MODIFY | USAGE } [ , ... ]
  -- For GIT REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For HYBRID TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For ICEBERG TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For INTERACTIVE TABLE
     { REFERENCES | SELECT } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
 -- For MCP SERVER
     { MODIFY | USAGE } [ , ... ]
  -- For ONLINE FEATURE TABLE
     { MONITOR | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { AGGREGATION | AUTHENTICATION | MASKING | JOIN | PACKAGES | PASSWORD | PRIVACY | PROJECTION | ROW ACCESS | SESSION | STORAGE LIFECYCLE } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     { READ | USAGE } [ , ... ]
  -- For SEMANTIC VIEW
     { SELECT | REFERENCES | MONITOR } [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For STREAMLIT
     USAGE [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | SELECT ERROR TABLE | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
  -- For WORKSPACE
     { READ | WRITE } [ , ... ]
```

For more details about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are granted.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `AGENT`
    * `AGGREGATION POLICY`
    * `ALERT`
    * `AUTHENTICATION POLICY`
    * `CORTEX SEARCH SERVICE`
    * `DATA METRIC FUNCTION`
    * `DATASET`
    * `DBT PROJECT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXPERIMENT`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `GATEWAY`
    * `GIT REPOSITORY`
    * `IMAGE REPOSITORY`
    * `ICEBERG TABLE`
    * `INTERACTIVE TABLE`
    * `JOIN POLICY`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `MCP SERVER`
    * `MODEL`
    * `MODEL MONITOR`
    * `NETWORK RULE`
    * `NOTEBOOK`
    * `ONLINE FEATURE TABLE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PRIVACY POLICY`
    * `PROCEDURE`
    * `PROJECTION POLICY`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SERVICE`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `SNAPSHOT`
    * `SNAPSHOT POLICY`
    * `SNAPSHOT SET`
    * `STAGE`
    * `STORAGE LIFECYCLE POLICY`
    * `STREAM`
    * `STREAMLIT`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`
    * `WORKSPACE`

`object_type_plural`
:   Plural form of `object_type` (for example, `TABLES`, `VIEWS`).

    Note that bulk grants on pipes are not allowed.

`role_name`
:   Specifies the identifier for the recipient role (that is, the role to which the privileges are granted).

`database_role_name`
:   Specifies the identifier for the recipient database role (that is, the role to which the privileges are granted). If the identifier is not
    fully qualified in the form of `db_name.database_role_name`, the command looks for the database role in the current database
    for the session.

    All privileges are limited to the database that contains the database role, as well as other objects in the same database.

## Optional parameters

`FUTURE`
:   Specifies that privileges are granted on new (that is, future) database or schema objects of a specified type (such as tables or views) rather
    than on existing objects. Note that future grants can be revoked at any time using [REVOKE <privileges> … FROM ROLE](revoke-privilege.md) with the
    ON FUTURE parameter; any privileges granted on existing objects are retained. For more information about future grants, see
    Future grants on database or schema objects in this topic.

`WITH GRANT OPTION`
:   If specified, allows the recipient role to grant the privileges to other roles.

    Default: No value, which means the recipient role cannot grant the privileges to other roles.

    > **Note:**
    >
    > The WITH GRANT OPTION parameter does not support the IMPORTED PRIVILEGES privilege. For more information, see
    > [Granting privileges on an imported database](../../user-guide/data-share-consumers.md).

## Usage notes

* Privileges cannot be granted or revoked directly on any class. You can, however, create an instance of a class and
  grant [instance roles](../snowflake-db-classes.md) to an account role. Grant the CREATE <class_name> privilege on the schema to enable a
  role to create an instance of a class.
* OWNERSHIP is a valid privilege across all object types that support future grants.
* To grant the OWNERSHIP privilege on an object (or all objects of a specified type in a schema) to a role, transferring ownership of the
  object from one role to another role, use [GRANT OWNERSHIP](grant-ownership.md) instead. The GRANT OWNERSHIP command has a different
  syntax.
* Multiple privileges can be specified for the same object type in a single GRANT statement (with each privilege separated by commas), or
  the special `ALL [ PRIVILEGES ]` keyword can be used to grant all applicable privileges to the specified object type. Note,
  however, that only privileges held and grantable by the role executing the GRANT command are actually granted to the target role. A
  warning message is returned for any privileges that could not be granted.

  + You cannot specify this keyword for tags.
  + This keyword does not grant privileges on a class if you try to grant `ALL` privileges on a schema. To allow a role to create
    instances of a particular class, grant the CREATE privilege directly as shown in the Classes example.
* Privileges granted to a particular role are automatically inherited by any other roles to which the role is granted, as well as any other
  higher-level roles within the role hierarchy. For more details, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).
* For databases, the IMPORTED PRIVILEGES privilege only applies to shared databases (that is, databases created from a share). For more details,
  see [Consume imported data](../../user-guide/data-share-consumers.md). Note that the IMPORTED PRIVILEGES privilege cannot be granted to a database role.
* For schemas and objects in schemas, an `ALL object_type_plural in container` option is provided to grant privileges on all
  objects of the same type within the container (that is, a database or schema). This is a convenience option; internally, the command is expanded
  into a series of individual GRANT commands on each object. Only objects that currently exist within the container are affected.

  However, note that, in the Snowflake model, bulk granting of privileges is not a recommended practice. Instead, Snowflake recommends
  creating a shared role and using the role to create objects that are automatically accessible to all users who have been granted the role.

  You cannot specify ALL TAGS or ALL MASKING POLICIES.
* In managed access schemas:

  + The OWNERSHIP privilege on objects can only be transferred to a subordinate role of the schema owner.
* For stages:

  + USAGE only applies to external stages.
  + READ | WRITE only applies to internal stages. In addition, to grant the WRITE privilege on an internal stage, the READ privilege must
    first be granted on the stage.

  For more details about external and internal stages, see [CREATE STAGE](create-stage.md).
* When granting privileges on an individual UDF or stored procedure, you must specify the data types of the arguments, if any,
  using the following syntax:

  ```sqlsyntax
  <udf_or_stored_procedure_name> ( [ <arg_data_type> [ , ... ] ] )
  ```

  Snowflake uses argument data types to resolve UDFs or stored procedures that have the same name within a schema. For more
  information, see [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).
* For dynamic tables, the receiving role must be granted the USAGE privilege on the database and schema that contains the dynamic table, and
  on the warehouse used to refresh the table. For more information, see [Dynamic table access control](../../user-guide/dynamic-tables-privileges.md).
* To grant privileges on hybrid tables, use the standard TABLE or TABLES keyword. You cannot specify HYBRID TABLE or HYBRID TABLES.

## Access control requirements

Granting privileges on individual objects:
:   In general, a role with any one of the following sets of privileges can grant privileges on an object to other roles:

    * The global MANAGE GRANTS privilege.

      Only the SECURITYADMIN and ACCOUNTADMIN system roles have the MANAGE GRANTS privilege; however, the privilege can be granted
      to custom roles.
    * The OWNERSHIP privilege on the object. When granting privileges on schema objects (e.g. tables and views), the role must
      also have the USAGE privilege on the parent database and schema.
    * If a privilege was granted to a role with the WITH GRANT OPTION parameter included in the
      GRANT *<privileges>* … TO ROLE statement, the role can grant the same privilege to other roles.

    In managed access schemas (that is, schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax), object owners lose
    the ability to make grant decisions. Only the schema owner (that is, the role with the OWNERSHIP privilege on the schema) or a
    role with the global MANAGE GRANTS privilege can grant privileges on objects in the schema.

    Note that a role that holds the global MANAGE GRANTS privilege can grant additional privileges to the current (grantor) role.

Defining grants on future objects of a specified type:
:   **Database level**

    The global MANAGE GRANTS privilege is required to grant privileges on future objects in a database. Only the SECURITYADMIN and
    ACCOUNTADMIN system roles have the MANAGE GRANTS privilege; however, the privilege can be granted to custom roles.

    **Schema level**

    In managed access schemas (that is, schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax), either the schema owner
    (that is, the role with the OWNERSHIP privilege on the schema) or a role with the global MANAGE GRANTS privilege can grant privileges
    on future objects in the schema.

    In standard schemas, the global MANAGE GRANTS privilege is required to grant privileges on future objects in the schema.

    **Database roles**

    To grant future privileges to a database role, a role with the global MANAGE GRANTS privilege, such as SECURITYADMIN, also
    requires the USAGE privilege on the database that contains the database role.

For more information about defining grants on future objects of a specified type, see
Future grants on database or schema objects (in this topic).

## Future grants on database or schema objects

The notes in this section apply when assigning future grants on objects in a schema or a database; that is, when using the ON FUTURE parameter.

For more information, see [managed access schemas](../../user-guide/security-access-control-configure.md).

### Considerations

* When future grants are defined on the same object type for a database and a schema in the same database, the schema-level
  grants take precedence over the database level grants, and the database level grants are ignored. This behavior applies to privileges
  on future objects granted to one role or different roles.

  For example, the following statements grant different privileges on objects of the same type at the database and schema levels.

  Grant the SELECT privilege on all future tables in database `d1` to role `r1`. This grant gives access to all future tables in all
  schemas in `d1`:

  ```sqlexample
  GRANT SELECT ON FUTURE TABLES IN DATABASE d1 TO ROLE r1;
  ```

  Grant the INSERT and DELETE privileges on all future tables only in the `d1.s1` schema to role `r2`.

  ```sqlexample
  GRANT INSERT, DELETE ON FUTURE TABLES IN SCHEMA d1.s1 TO ROLE r2;
  ```

  The future grants assigned to the `r1` role are ignored completely. When new tables are created in schema `d1.s1`, only the
  future privileges defined on tables for the `r2` role are granted.
* Database-level future grants apply to both regular and
  [managed access schemas](../../user-guide/security-access-control-configure.md).

### Restrictions and limitations

* No more than one future grant of the OWNERSHIP privilege is allowed on each securable object type.

* Future grants cannot be defined on objects of the following types:

  + Compute pool
  + External function
  + Image repository
  + Organization profile
  + Policy objects:

    - Aggregation policy
    - Join policy
    - Masking policy
    - Packages policy
    - Projection policy
    - Row access policy
    - Session policy
    - Storage lifecycle policy
  + Snapshot
  + Tag

* A future grant of the OWNERSHIP privilege on objects of a specified type in a database do not apply to new objects in a managed
  access schema.
* The following restrictions apply to future grants on objects in a managed access schema:

  + A future grant of the OWNERSHIP privilege on objects can only be applied to a subordinate role of the schema owner (that is, the role
    that has the OWNERSHIP privilege on the schema).
  + Before ownership of a managed access schema can be transferred to a different role, all open future grants of the OWNERSHIP
    privilege must be revoked using [REVOKE <privileges> … FROM ROLE](revoke-privilege.md) with the ON FUTURE parameter.
* Future grants are not applied when renaming or swapping a table.
* Future grants are supported on named stages with the following restrictions:

  + The WRITE privilege cannot be specified without the READ privilege.
  + The READ privilege cannot be revoked if the WRITE privilege is present.
  + For internal stages, only future grants with the READ or WRITE privilege are materialized.
  + For external stages, only future grants with the USAGE privileges are materialized.

## Examples

### Roles

Grant the necessary privileges to operate (that is, suspend or resume) the `report_wh` warehouse to the `analyst` role:

> ```sqlexample
> GRANT OPERATE ON WAREHOUSE report_wh TO ROLE analyst;
> ```

Repeat the previous example, but also allow the `analyst` role to grant the privilege to other roles:

> ```sqlexample
> GRANT OPERATE ON WAREHOUSE report_wh TO ROLE analyst WITH GRANT OPTION;
> ```

Grant the SELECT privilege on all existing tables in the `mydb.myschema` schema to the `analyst` role:

> ```sqlexample
> GRANT SELECT ON ALL TABLES IN SCHEMA mydb.myschema to ROLE analyst;
> ```

Grant all privileges on two UDFs in the `mydb.myschema` schema to the `analyst` role:

> ```sqlexample
> GRANT ALL PRIVILEGES ON FUNCTION mydb.myschema.add5(number) TO ROLE analyst;
>
> GRANT ALL PRIVILEGES ON FUNCTION mydb.myschema.add5(string) TO ROLE analyst;
> ```
>
> > **Note:**
> >
> > The UDFs have different arguments, which is how Snowflake uniquely identifies UDFs with the same name. For more details about
> > UDF naming, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

Grant USAGE privilege on a stored procedure in the `mydb.myschema` schema to the `analyst` role:

> ```sqlexample
> GRANT USAGE ON PROCEDURE mydb.myschema.myprocedure(number) TO ROLE analyst;
> ```
>
> > **Note:**
> >
> > Stored procedure names (like UDF names) can be overloaded, so you must specify the data type of the arguments(s). For more details about
> > name overloading, see [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).

Grant the WRITE privilege on a shared workspace in the `mydb.myschema` schema to the `analyst` role:

> ```sqlexample
> GRANT WRITE ON WORKSPACE mydb.myschema.my_workspace TO ROLE analyst;
> ```

Grant the CREATE PROVISIONED THROUGHPUT privilege to a role:

> ```sqlexample
> GRANT CREATE PROVISIONED THROUGHPUT ON ACCOUNT TO ROLE myrole;
> ```

Grant the privilege to create materialized views in the specified schema:

> ```sqlexample
> GRANT CREATE MATERIALIZED VIEW ON SCHEMA mydb.myschema TO ROLE myrole;
> ```

Grant the SELECT and INSERT privileges on all future tables created in the `mydb.myschema` schema to the `role1` role:

> ```sqlexample
> GRANT SELECT, INSERT ON FUTURE TABLES IN SCHEMA mydb.myschema TO ROLE role1;
> ```

Grant the USAGE privilege on all future schemas in the `mydb` database to the `role1` role:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> GRANT USAGE ON FUTURE SCHEMAS IN DATABASE mydb TO ROLE role1;
> ```

Grant ALL PRIVILEGES on all tables in a given schema to a given role. Note that this grant applies to both standard tables and hybrid tables
in the specified schema:

> ```sqlexample
> GRANT ALL PRIVILEGES ON ALL TABLES IN SCHEMA ht_schema TO ROLE ht_role;
> ```

### Database roles

Create a database role `dr1` in the `mydb.myschema` schema:

> ```sqlexample
> CREATE DATABASE ROLE mydb.myschema.dr1;
> ```

Grant the SELECT privilege on all existing tables in the `mydb.myschema` schema to the `mydb.dr1` database role:

> ```sqlexample
> GRANT SELECT ON ALL TABLES IN SCHEMA mydb.myschema
>   TO DATABASE ROLE mydb.dr1;
> ```

Grant all privileges on two UDFs in the `mydb.myschema` schema to the `mydb.dr1` database role:

> ```sqlexample
> GRANT ALL PRIVILEGES ON FUNCTION mydb.myschema.add5(number)
>   TO DATABASE ROLE mydb.dr1;
>
> GRANT ALL PRIVILEGES ON FUNCTION mydb.myschema.add5(string)
>   TO DATABASE ROLE mydb.dr1;
> ```
>
> > **Note:**
> >
> > The UDFs have different arguments, which is how Snowflake uniquely identifies UDFs with the same name. For more details about UDF naming,
> > see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

Grant usage privilege on a stored procedure in the `mydb.myschema` schema to the `mydb.dr1` database role:

> ```sqlexample
> GRANT USAGE ON PROCEDURE mydb.myschema.myprocedure(number)
>   TO DATABASE ROLE mydb.dr1;
> ```
>
> > **Note:**
> >
> > Stored procedure names (like UDF names) can be overloaded, so you must specify the data type of the arguments(s). For more
> > details about overloading stored procedures, see [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).

Grant the privilege to create materialized views in the specified schema to the `mydb.dr1` database role:

> ```sqlexample
> GRANT CREATE MATERIALIZED VIEW ON SCHEMA mydb.myschema
>   TO DATABASE ROLE mydb.dr1;
> ```

Grant the SELECT and INSERT privileges on all future tables created in the `mydb.myschema` schema to the `mydb.dr1` database role:

> ```sqlexample
> GRANT SELECT,INSERT ON FUTURE TABLES IN SCHEMA mydb.myschema
>   TO DATABASE ROLE mydb.dr1;
> ```

Grant the USAGE privilege on all future schemas in the `mydb` database to the `dr1` role:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> GRANT USAGE ON FUTURE SCHEMAS IN DATABASE mydb
>   TO DATABASE ROLE mydb.dr1;
> ```

Grant future privileges to the database role `mydb.dr1` using the SECURITYADMIN role:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> GRANT USAGE ON DATABASE mydb TO ROLE SECURITYADMIN;
>
> USE ROLE SECURITYADMIN;
>
> GRANT SELECT, INSERT ON FUTURE TABLES IN SCHEMA mydb.myschema
>   TO DATABASE ROLE mydb.dr1;
> ```
>
> > **Note:**
> >
> > The SECURITYADMIN role requires the USAGE privilege on a database that contains a database role in order to grant future privileges
> > to that database role.

Show that a user `testuser`, with role `public` and granted database role `dr1` , can select and insert to a new table in `mydb.myschema`:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
> CREATE USER testuser DEFAULT_ROLE=public DEFAULT_SECONDARY_ROLES=all;
> CREATE TABLE mydb.myschema.test_table (id INT, name VARCHAR(100));
>
> USE ROLE SECURITYADMIN;
> GRANT DATABASE ROLE mydb.dr1 TO USER testuser;
> ```

When logged in as `testuser`:

> ```sqlexample
> INSERT INTO mydb.myschema.test_table (id, name)
>   VALUES (1, 'Test Record');
>
> SELECT * FROM mydb.myschema.test_table;
> ```

Expected output:

> ```output
> -- +----+-------------+
> -- | ID | NAME        |
> -- +----+-------------+
> -- | 1  | Test Record |
> -- +----+-------------+
> ```

### Classes

To allow an account role to create budgets in a schema, grant the CREATE SNOWFLAKE.CORE.BUDGET privilege on the schema to the role:

> ```sqlexample
> USE ROLE ACCOUNTADMIN;
>
> GRANT CREATE SNOWFLAKE.CORE.BUDGET ON SCHEMA budgets_db.budgets_schema
>   TO ROLE budget_admin;
> ```

To allow an account role to create an ML Function model or instance (forecast, anomaly detection, or classification) in a schema,
grant the appropriate privilege on the schema to the role. The following privileges are available.

* CREATE SNOWFLAKE.ML.ANOMALY_DETECTION
* CREATE SNOWFLAKE.ML.CLASSIFICATION
* CREATE SNOWFLAKE.ML.FORECAST
* CREATE SNOWFLAKE.ML.TOP_INSIGHTS

---
title: GRANT <privileges> … TO USER
source: https://docs.snowflake.com/en/sql-reference/sql/grant-privilege-user.md
section: SQL Commands
---

# GRANT *<privileges>* … TO USER

Grants one or more access privileges on a securable object to a user. The privileges that can be granted are object-specific.

For more information about roles and securable objects, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about privileges, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

See also:
:   [GRANT <privileges> … TO ROLE](grant-privilege.md) , [REVOKE <privileges> … FROM USER](revoke-privilege-user.md)

## Syntax

```sqlsyntax
GRANT {  { globalPrivileges         | ALL [ PRIVILEGES ] } ON ACCOUNT
       | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { USER | RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | CONNECTION | FAILOVER GROUP | REPLICATION GROUP | EXTERNAL VOLUME } <object_name>
       | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
       | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> } }
      }
  TO [ USER ] <user_name> [ WITH GRANT OPTION ]
```

Where:

```sqlsyntax
globalPrivileges ::=
  {
      | ATTACH POLICY | AUDIT | BIND SERVICE ENDPOINT
      | APPLY {
         { AGGREGATION | AUTHENTICATION | JOIN | MASKING | PACKAGES | PASSWORD
           | PROJECTION | ROW ACCESS | SESSION } POLICY
         | TAG }
      | EXECUTE { ALERT | DATA METRIC FUNCTION | MANAGED ALERT | MANAGED TASK | TASK }
      | IMPORT SHARE
      | MANAGE { ACCOUNT SUPPORT CASES | EVENT SHARING | GRANTS | LISTING AUTO FULFILLMENT | ORGANIZATION SUPPORT CASES | USER SUPPORT CASES | WAREHOUSES }
      | MODIFY { LOG LEVEL | TRACE LEVEL | SESSION LOG LEVEL | SESSION TRACE LEVEL }
      | MONITOR { EXECUTION | SECURITY | USAGE }
      | OVERRIDE SHARE RESTRICTIONS | PURCHASE DATA EXCHANGE LISTING | RESOLVE ALL
      | READ SESSION
  }
  [ , ... ]
```

```sqlsyntax
accountObjectPrivileges ::=
-- For COMPUTE POOL
   { MODIFY | MONITOR | OPERATE | USAGE } [ , ... ]
-- For CONNECTION
   { FAILOVER } [ , ... ]
-- For DATABASE
   { APPLYBUDGET
   | IMPORTED PRIVILEGES | MODIFY | MONITOR | USAGE } [ , ... ]
-- For EXTERNAL VOLUME
   { USAGE } [ , ... ]
-- For FAILOVER GROUP
   { FAILOVER | MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For INTEGRATION
   { USAGE | USE_ANY_ROLE } [ , ... ]
-- For REPLICATION GROUP
   { MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For RESOURCE MONITOR
   { MODIFY | MONITOR } [ , ... ]
-- For USER
   { MONITOR } [ , ... ]
-- For WAREHOUSE
   { APPLYBUDGET | MODIFY | MONITOR | USAGE | OPERATE } [ , ... ]
```

```sqlsyntax
schemaPrivileges ::=

    ADD SEARCH OPTIMIZATION | APPLYBUDGET
   | MODIFY | MONITOR | USAGE
   [ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For DATA METRIC FUNCTION
     USAGE [ , ... ]
  -- For DYNAMIC TABLE
     MONITOR, OPERATE, SELECT [ , ...]
  -- For EVENT TABLE
     { APPLYBUDGET | DELETE | REFERENCES | SELECT | TRUNCATE } [ , ... ]
  -- For FILE FORMAT, FUNCTION (UDF or external function), MODEL, PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For GIT REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For HYBRID TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For ICEBERG TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { AGGREGATION | AUTHENTICATION | MASKING | JOIN | PACKAGES | PASSWORD | PRIVACY | PROJECTION | ROW ACCESS | SESSION } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     { READ | USAGE } [ , ... ]
  -- For SEMANTIC VIEW
     REFERENCES [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For STREAMLIT
     USAGE [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | SELECT ERROR TABLE | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
```

For more information about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are granted.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `AGENT`
    * `AGGREGATION POLICY`
    * `ALERT`
    * `AUTHENTICATION POLICY`
    * `CORTEX SEARCH SERVICE`
    * `DATA METRIC FUNCTION`
    * `DATASET`
    * `DBT PROJECT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXPERIMENT`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `GATEWAY`
    * `GIT REPOSITORY`
    * `IMAGE REPOSITORY`
    * `ICEBERG TABLE`
    * `INTERACTIVE TABLE`
    * `JOIN POLICY`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `MCP SERVER`
    * `MODEL`
    * `MODEL MONITOR`
    * `NETWORK RULE`
    * `NOTEBOOK`
    * `ONLINE FEATURE TABLE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PRIVACY POLICY`
    * `PROCEDURE`
    * `PROJECTION POLICY`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SERVICE`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `SNAPSHOT`
    * `SNAPSHOT POLICY`
    * `SNAPSHOT SET`
    * `STAGE`
    * `STORAGE LIFECYCLE POLICY`
    * `STREAM`
    * `STREAMLIT`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`
    * `WORKSPACE`

`object_type_plural`
:   Plural form of `object_type` (for example `TABLES`, `VIEWS`).

    Note that bulk grants on pipes are not allowed.

`user_name`
:   Specifies the identifier for the recipient user (the user to which the privileges are granted).

## Optional parameters

`WITH GRANT OPTION`
:   If specified, allows the recipient user to grant the privileges to other roles or users.

    Default: No value, which means the recipient role cannot grant the privileges to other roles or users.

    > **Note:**
    >
    > The WITH GRANT OPTION parameter does not support the IMPORTED PRIVILEGES privilege. For more information, see
    > [Granting privileges on an imported database](../../user-guide/data-share-consumers.md).

## Usage notes

* Privileges assigned directly to users are only effective when the user has all secondary roles enabled.
* Granting privileges directly to users may increase the proliferation of grants in your account. Outside of person-to-person sharing
  scenarios, we recommend that you grant privileges to roles to manage access that users need in Snowflake.
* [Future grants](grant-privilege.md) is not available.
* CREATE and OWNERSHIP privileges cannot be granted to users.

* Privileges cannot be granted or revoked directly on any class.

* Multiple privileges can be specified for the same object type in a single GRANT statement (with each privilege separated by commas).
  Alternatively, the special `ALL [ PRIVILEGES ]` keyword can be used to grant all applicable privileges to the specified object type.

  > **Note:**
  > + Only privileges held and grantable by the user executing the GRANT command are actually granted to the target role. A warning message
  >   is returned for any privileges not granted.
  > + You cannot specify `ALL [ PRIVILEGES ]` for tags.
  > + `ALL [ PRIVILEGES ]` does not grant privileges on a *class* if you try to grant `ALL [ PRIVILEGES ]` on a *schema*.

* For schemas and objects in schemas, an `ALL object_type_plural IN container` option is provided to grant privileges on all
  objects of the same type within the container (that is, database or schema). This option provides convenience. Internally, the command is
  expanded into a series of individual GRANT commands on each object. This option only affects objects that currently exist within the
  container.

  > **Note:**
  >
  > Bulk granting of privileges is not a recommended practice in the Snowflake model. Instead, Snowflake recommends creating a shared role and
  > then using that role to create objects that are automatically accessible to all users who have been granted the role.

  You cannot specify ALL TAGS or ALL MASKING POLICIES.

* For stages:

  + USAGE only applies to external stages.
  + READ | WRITE only applies to internal stages. In addition, to grant the WRITE privilege on an internal stage, the READ
    privilege must first be granted on the stage.

  For more information about external and internal stages, see [CREATE STAGE](create-stage.md).
* When granting privileges on an individual UDF or stored procedure, you must specify the data types of the arguments, if any,
  using syntax such as `udf_or_stored_procedure_name ( [ arg_data_type [ , ... ] ] )`. Snowflake uses argument data types to
  resolve UDFs or stored procedures that have the same name within a schema. For more information, see [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).
* For dynamic tables, the receiving user must be granted the USAGE privilege on the database and schema that contains the dynamic table, and
  on the warehouse used to refresh the table. For more information, see [Dynamic table access control](../../user-guide/dynamic-tables-privileges.md).
* When granting privileges on an individual UDF, you must specify the data types for the arguments, if any, for the UDF using syntax such as
  `udf_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument data types to resolve UDFs that
  have the same name within a schema. For more information, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).
* When granting privileges on an individual stored procedure, you must specify the data types for the arguments, if any, for the
  procedure using syntax such as `procedure_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument
  data types to resolve stored procedures that have the same name within a schema.

  For more information, see [managed access schemas](../../user-guide/security-access-control-configure.md).

## Access control requirements

Granting privileges on individual objects:
:   In general, a role or user with any one of the following privileges can grant privileges on an object to other users:

    * The global `MANAGE GRANTS` privilege.

      Only the SECURITYADMIN and ACCOUNTADMIN system roles have the MANAGE GRANTS privilege; however, the privilege can be granted
      to custom roles or users.
    * The OWNERSHIP privilege.

      The role that has OWNERSHIP privilege on the object.
    * The USAGE privilege.
      When granting privileges on schema objects (for example, tables and views), the role or user must also have the USAGE privilege on the
      parent database and schema.

    If a privilege has been granted to a user with the `GRANT privileges ... TO USER WITH GRANT OPTION` command, then that user can
    re-grant that same privilege to other users or roles.

    In [managed access schemas](../../user-guide/security-access-control-configure.md) (schemas created using the `CREATE SCHEMA ... WITH MANAGED ACCESS`)
    syntax, object owners lose the ability to make grant decisions. Only the schema owner (the role with the OWNERSHIP privilege on the
    schema) or a role with the global MANAGE GRANTS privilege can grant privileges on objects in that schema.

    > **Note:**
    >
    > A role that holds the global MANAGE GRANTS privilege can grant additional privileges to the current (grantor) role or user.

## Examples

To grant the USAGE privilege on a Streamlit application to a specific user, `joe`:

```sqlexample
GRANT USAGE ON STREAMLIT streamlit_db.streamlit_schema.streamlit_app TO USER joe;
```

To grant the USAGE privilege on a procedure to a specific user, `user1`:

```sqlexample
GRANT USAGE ON PROCEDURE mydb.myschema.myprocedure(number) TO USER user1;
```

---
title: GRANT APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-application-role.md
section: SQL Commands
---

# GRANT APPLICATION ROLE

Assigns an application role to an account role, another application role, an application, or a user.

This command creates a “parent-child” relationship between the application role and the role
to which it is granted, also referred to as a role hierarchy.

For more details, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [ALTER APPLICATION ROLE](alter-application-role.md), [CREATE APPLICATION ROLE](create-application-role.md),
    [REVOKE APPLICATION ROLE](revoke-application-role.md), [SHOW APPLICATION ROLES](show-application-roles.md)

## Syntax

```sqlsyntax
GRANT APPLICATION ROLE <name> TO  { ROLE <parent_role_name> | APPLICATION ROLE <application_role> | APPLICATION <application_name> | USER <user_name> }
```

## Parameters

`name`
:   Specifies the identifier for the application role to grant. If the identifier contains spaces or
    special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in
    double quotes are also case-sensitive.

`ROLE parent_role_name`
:   Grants the application role to the specified account role.

`APPLICATION ROLE application_role`
:   Grants the application role to the specified application role. This grant creates a role
    hierarchy of application roles.

    An application role can be granted to either an account role or another application role in the
    same application. If the parent role is an application role and the identifier is not fully
    qualified in the form of `application_name.application_role_name`, the command looks
    for the application role in the current application for the session.

`APPLICATION application_name`
:   Grants the application role to the specified application.

`USER user_name`
:   Grants the application role to the specified user.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege or role | Object | Notes |
| --- | --- | --- |
| ACCOUNTADMIN | Application role | A user with this role can grant a [Budgets application role](../../user-guide/budgets.md) to a custom role. |
| OWNERSHIP | Role | Role that is granted to an account role or another application role. However the application owner can grant an application role to another application role or account role. OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Only the application owner can grant an application role to other roles or users. Only the SECURITYADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

Granting an application role to another application role can only be performed within the
context of an installed application, for example in application setup script.

## Examples

Grants an application role `app_role` to a different application role `other_app_role`:

```sqlexample
GRANT APPLICATION ROLE app_role to APPLICATION ROLE other_app_role;
```

Grants an application role `app_role` to a user `user1`:

```sqlexample
GRANT APPLICATION ROLE app_role to USER user1;
```

---
title: GRANT CALLER
source: https://docs.snowflake.com/en/sql-reference/sql/grant-caller.md
section: SQL Commands
---

# GRANT CALLER

Grants [caller grants](../../developer-guide/restricted-callers-rights.md) to a role. If an executable owned by the role runs with restricted caller’s rights, it can only run with the caller’s privileges specified by the caller grants.

Variations of the GRANT CALLER command are as follows:

* GRANT CALLER — Grant caller grants on a specific object. Each caller grant created by the statement allows the executable to
  run with a specified privilege.
* GRANT ALL CALLER PRIVILEGES — Grant caller grants on a specific object. The caller grants created by the statement allow the
  executable to run with all of the caller’s privileges.
* GRANT INHERITED CALLER — Grant caller grants on all current and future objects of the same type when they share a common schema, database,
  or account. Each caller grant created by the statement allows the executable to run with a specified privilege.
* GRANT ALL INHERITED CALLER PRIVILEGES — Grant caller grants on all current and future objects of the same type when they share a common
  schema, database, or account. The caller grants created by the statement allow the executable to run with all of the caller’s privileges.

## Syntax

```sqlsyntax
GRANT CALLER <object_privilege> [ , <object_privilege> ... ]
  ON <object_type> <object_name>
  TO { ROLE | DATABASE ROLE | APPLICATION } <grantee_name>

GRANT ALL CALLER PRIVILEGES
  ON <object_type> <object_name>
  TO { ROLE | DATABASE ROLE | APPLICATION } <grantee_name>

GRANT INHERITED CALLER <object_privilege> [ , <object_privilege> ... ]
  ON ALL <object_type_plural>
  IN { ACCOUNT | DATABASE <db_name> | SCHEMA <schema_name> | APPLICATION <app_name> | APPLICATION PACKAGE <app_pkg_name> }
  TO { ROLE | DATABASE ROLE | APPLICATION } <grantee_name>

GRANT ALL INHERITED CALLER PRIVILEGES
  ON ALL <object_type_plural>
  IN { ACCOUNT | DATABASE <db_name> | SCHEMA <schema_name> | APPLICATION <app_name> | APPLICATION PACKAGE <app_pkg_name> }
  TO { ROLE | DATABASE ROLE | APPLICATION } <grantee_name>
```

## Parameters

`object_privilege [ , object_privilege ... ]`
:   The object privileges that executables can run with. For a list of privileges for a specific object type, see
    [Access control privileges](../../user-guide/security-access-control-privileges.md).

    Use a comma-delimited list to specify more than one object privilege.

`ON object_type object_name`
:   Allows an executable to run with the specified `object_privilege` when accessing the specified object (`object_name`). Use
    the singular form of `object_type`, for example, `TABLE` or `WAREHOUSE`.

`ON ALL object_type_plural IN ACCOUNT` or . `ON ALL object_type_pluarl IN DATABASE db_name` or . `ON ALL object_type_plural IN SCHEMA schema_name` or . `ON ALL object_type_plural IN APPLICATION app_name` or . `ON ALL object_type_plural IN APPLICATION PACKAGE app_pkg_name`
:   Allows an executable to run with object-level privileges when accessing an object of the specified type. Use the plural form of
    the object type, for example, `TABLES` or `WAREHOUSES`.

    You can use the GRANT statement to control access to all objects in the current account, or just objects in the specified
    database, schema, application, or application package.

`TO ROLE grantee_name` or . `TO DATABASE ROLE grantee_name`
:   Owner of the executables that you want to secure with caller grants.

    If you specify a database role, privileges are limited to objects in the same database as the database role.

`TO APPLICATION app_name`
:   Specifies a Snowflake Native App as the grantee.

    > **Note:**
    >
    > If you specify IN ACCOUNT not all object types are supported when using TO APPLICATION.
    > Only the following objects are supported:
    >
    > * DATABASE
    > * APPLICATION PACKAGE
    > * APPLICATION
    > * Object types that are contained within a database or schema.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE CALLER GRANTS | Account | The account-level MANAGE CALLER GRANTS privilege pertains to caller grants only. It does not allow you to grant privileges to roles. |
| Any privilege | All specified objects | You need at least one privilege on the objects specified in the caller grant. For example, granting a caller grant on a table requires that you have at least one privilege on that table. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Executables owned by `owner_role` that access a `v1` view can run with the SELECT privilege on the view:

> ```sqlexample
> GRANT CALLER SELECT ON VIEW v1 TO owner_role;
> ```

Executables owned by `owner_role` that access any table in the `db.sch` schema can run with the caller’s SELECT and INSERT privileges.

> ```sqlexample
> GRANT INHERITED CALLER SELECT, INSERT ON ALL TABLES IN SCHEMA db.sch TO ROLE owner_role;
> ```

Executables owned by `owner_role` that access schemas in the current account can run with all of the caller’s privileges on the schemas.

> ```sqlexample
> GRANT ALL INHERITED CALLER PRIVILEGES ON ALL SCHEMAS IN ACCOUNT TO ROLE owner_role;
> ```

Executables owned by the `db.r` database role that access the `db.sch1.t1` table can run with the SELECT privilege on the table.

> ```sqlexample
> GRANT CALLER SELECT ON TABLE db.sch1.t1 TO DATABASE ROLE db.r;
> ```

Executables owned by `owner_role` that access the `my_db` database can run with all of the caller’s privileges on the database.

> ```sqlexample
> GRANT ALL CALLER PRIVILEGES ON DATABASE my_db TO ROLE owner_role;
> ```

---
title: GRANT DATABASE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-database-role.md
section: SQL Commands
---

# GRANT DATABASE ROLE

Assigns a database role to an [account role, another database role](../../user-guide/security-access-control-overview.md), or a user. A user with
OWNERSHIP privilege on a database role can grant that database role to either an account role, another database role, or a user in the same
database. Granting a database role to another role creates a “parent-child” relationship (also referred to as a *role hierarchy*) between
the database role and the other role. For specific limitations on database roles, see [Database roles and role hierarchies](../../user-guide/security-access-control-overview.md).

See also:
:   [REVOKE DATABASE ROLE](revoke-database-role.md) , [GRANT ROLE](grant-role.md) , [REVOKE ROLE](revoke-role.md) , [GRANT <privileges> … TO ROLE](grant-privilege.md)

## Syntax

```sqlsyntax
GRANT DATABASE ROLE <name> TO { DATABASE ROLE <parent_role_name> | ROLE <parent_role_name> | USER <user_name> }

GRANT DATABASE ROLE <name> TO APPLICATION <app_name>
```

## Parameters

`name`
:   Specifies the identifier (name) for the database role; must be unique in the database in which the database role is created.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

    If the identifier is not fully qualified in the form of `db_name.database_role_name`, the command looks for the database role
    in the current database for the session.

`ROLE parent_role_name`
:   Grants the database role to the specified account role.

`DATABASE ROLE parent_role_name`
:   Grants the database role to the specified database role. If the parent role is a database role and the identifier is not fully qualified
    in the form of `db_name.database_role_name`, the command looks for the database role in the current database for the session.

`APPLICATION app_name`
:   Grants the database role to the specified Snowflake Native App.

`USER user_name`
:   Grants the database role to the specified user.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege or role | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Database role | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Examples

Grants the database role `analyst` to the SYSADMIN role:

```sqlexample
GRANT DATABASE ROLE analyst TO ROLE SYSADMIN;
```

Grants the database role `dr1` to the database role `dr2`:

```sqlexample
GRANT DATABASE ROLE dr1 TO DATABASE ROLE dr2;
```

Grants the database role `db1` to the Snowflake Native App named `hello_snowflake_app`:

```sqlexample
GRANT DATABASE ROLE db1 TO APPLICATION hello_snowflake_app;
```

Grants the database role `dr3` to the user `user1`:

```sqlexample
GRANT DATABASE ROLE dr3 TO USER user1;
```

---
title: GRANT DATABASE ROLE … TO SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-database-role-share.md
section: SQL Commands
---

# GRANT DATABASE ROLE … TO SHARE

Grants a database role to a share. Granting a database role effectively adds privileges on a single database to the share, which can then
be shared with one or more consumer accounts.

After consumers create a database from the share, they can grant the shared database roles to roles in their account to allow users with
those roles to access database objects in the share.

For more details, see [About Secure Data Sharing](../../user-guide/data-sharing-intro.md) and [Create and configure shares](../../user-guide/data-sharing-provider.md).

See also:
:   [REVOKE DATABASE ROLE … FROM SHARE](revoke-database-role-share.md)

## Syntax

```sqlsyntax
GRANT DATABASE ROLE <name>
  TO SHARE <share_name>
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) for the database role; must be unique in the database in which the role is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is not fully qualified (in the form of `db_name.database_role_name`, the command looks for the database role
    in the current database for the session.

`share_name`
:   Specifies the identifier for the share from which the specified database role is granted.

## Usage notes

* Granting a database role to a share fails if any DDL or other restricted privilege was granted to the database role. A database role can
  only grant permissions for read-only activity on a database and its objects.
* A shared database role does not support future grants. Snowflake returns the following error message depending on the action that you
  take:

  + Grant future privileges on an object to a database role and grant the database role to the share:

    ```sqlexample
    GRANT SELECT ON FUTURE TABLES IN SCHEMA sh TO DATABASE ROLE dbr1;
    GRANT DATABASE ROLE dbr1 TO SHARE myshare;
    ```

    ```output
    Cannot share a database role with future grants to it.
    ```
  + Grant the database role to a share and grant future privileges on an object to the database role:

    ```sqlexample
    GRANT DATABASE ROLE dbr1 TO SHARE myshare;
    GRANT SELECT ON FUTURE TABLES IN SCHEMA sh TO DATABASE ROLE dbr1;
    ```

    ```output
    Cannot grant future grants to a database role that is granted to a share.
    ```

  Use the following commands to identify whether you have future grants associated with a database role to avoid these error messages:

  ```sqlexample
  SHOW FUTURE GRANTS IN DATABASE parent_db;
  SHOW FUTURE GRANTS IN shared_schema;
  ```

## Examples

Grant the database role `dr1` in database `d1` to share `share1`:

> ```sqlexample
> GRANT DATABASE ROLE d1.dr1 TO SHARE share1;
> ```

---
title: GRANT OWNERSHIP
source: https://docs.snowflake.com/en/sql-reference/sql/grant-ownership.md
section: SQL Commands
---

# GRANT OWNERSHIP

Transfers ownership of an object or all objects of a specified type in a schema from one role to another role. *Role* refers to either
a role or a database role.

OWNERSHIP is a special type of privilege that can only be granted from one role to another role; it cannot be revoked. For more details,
see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

This command is a variation of [GRANT <privileges> … TO ROLE](grant-privilege.md).

See also:
:   [REVOKE <privileges> … FROM ROLE](revoke-privilege.md)

## Syntax

**For object types that are not an instance of a class:**

```sqlsyntax
GRANT OWNERSHIP
  { ON {
            <object_type> <object_name>
          | ALL <object_type_plural> IN { DATABASE <database_name> | SCHEMA <schema_name> }
       }
    | ON FUTURE <object_type_plural> IN { DATABASE <database_name> | SCHEMA <schema_name> }
  }
  TO { ROLE <role_name> | DATABASE ROLE <database_role_name> }
  [ { REVOKE | COPY } CURRENT GRANTS ]
```

**For an instance of a class:**

```sqlsyntax
GRANT OWNERSHIP
  ON  <class_name> <instance_name>
  TO { ROLE <role_name> | DATABASE ROLE <database_role_name> }
  [ { REVOKE | COPY } CURRENT GRANTS ]
```

## Required parameters

`object_name`
:   Specifies the identifier for the object on which you are transferring ownership.

`object_type`
:   Specifies the type of object.

    One of the following:

    * `AGENT`
    * `AGGREGATION POLICY`
    * `ALERT`
    * `AUTHENTICATION POLICY`
    * `COMPUTE POOL`
    * `CORTEX SEARCH SERVICE`
    * `DATA METRIC FUNCTION`
    * `DATABASE`
    * `DATABASE ROLE`
    * `DBT PROJECT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXPERIMENT`
    * `EXTERNAL TABLE`
    * `EXTERNAL VOLUME`
    * `FAILOVER GROUP`
    * `FILE FORMAT`
    * `FUNCTION`
    * `GATEWAY`
    * `GIT REPOSITORY`
    * `ICEBERG TABLE`
    * `IMAGE REPOSITORY`
    * `INTEGRATION`
    * `JOIN POLICY`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `MCP SERVER`
    * `NETWORK POLICY`
    * `NETWORK RULE`
    * `NOTEBOOK`
    * `ONLINE FEATURE TABLE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PRIVACY POLICY`
    * `PROCEDURE`
    * `PROJECTION POLICY`
    * `REPLICATION GROUP`
    * `RESOURCE MONITOR`
    * `ROLE`
    * `ROW ACCESS POLICY`
    * `SCHEMA`
    * `SEMANTIC VIEW`
    * `SESSION POLICY`
    * `SECRET`
    * `SEQUENCE`
    * `SNAPSHOT`
    * `SNAPSHOT POLICY`
    * `SNAPSHOT SET`
    * `STAGE`
    * `STORAGE LIFECYCLE POLICY`
    * `STREAM`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `USER`
    * `VIEW`
    * `WAREHOUSE`
    * `WORKSPACE`

`object_type_plural`
:   Plural form of `object_type` (e.g. `TABLES`, `VIEWS`).

    Note that bulk grants on pipes and data metric functions are not allowed.

`role_name`
:   The identifier for the role to which the object ownership is transferred.

`database_role_name`
:   The identifier for the database role to which the object ownership is transferred. If the identifier is not fully qualified (in the
    form of `db_name.database_role_name`, the command looks for the database role in the current database for the session.

    Ownership is limited to objects in the database that contains the database role.

## Optional parameters

`[ REVOKE | COPY ] CURRENT GRANTS`
:   Specifies whether to remove or transfer all existing outbound privileges on the object when ownership is transferred to a new role:

    > **Note:**
    >
    > *Outbound* privileges refer to any privileges granted on the individual object whose ownership is changing.
    >
    > When transferring ownership of a role, current grants refers to any roles that were granted to the current role (to create a role
    > hierarchy). If ownership of a role is transferred with the current grants copied, then
    > the output of the SHOW GRANTS command shows the new owner as the grantor of any child roles to the current role.

    `REVOKE`
    :   Enforces RESTRICT semantics, which require removing all outbound privileges on an object before transferring ownership to a new role.
        This is intended to protect the new owning role from unknowingly inheriting the object with privileges already granted on it.

        After transferring ownership, the privileges for the object must be explicitly re-granted on the role.

        Note that the REVOKE keyword does not work when granting ownership of future objects of a specified type in a database or schema to
        a role (using GRANT OWNERSHIP ON FUTURE `<object_type>`).

    `COPY`
    :   Transfers ownership of an object along with a copy of any existing outbound privileges on the object. After the transfer, the new
        owner is identified in the system as the grantor of the copied outbound privileges (that is, in the [SHOW GRANTS](show-grants.md) output for the
        object, the new owner is listed in the GRANTED_BY column for all privileges). As a result, any privileges that were subsequently
        re-granted before the change in ownership are no longer dependent on the original grantor role.

        Revoking a privilege using [REVOKE <privileges> … FROM ROLE](revoke-privilege.md) with the `CASCADE` option does not recursively revoke these formerly
        dependent grants. The grants must be explicitly revoked.

        The `COPY` parameter requires at least one of the following:

        * An active role has the MANAGE GRANTS privilege on the account.
        * An active role is the new owner (or a higher) role. The system role PUBLIC is naturally captured by this requirement because PUBLIC is
          granted to every role.

        The active role considers both primary and secondary roles. For more information, see [Active roles](../../user-guide/security-access-control-overview.md).

    Default: None. Neither operation is performed on any existing outbound privileges.

    > > **Note:**
    > >
    > > A GRANT OWNERSHIP statement fails if existing outbound privileges on the object are neither revoked nor copied.

## Usage notes

* You cannot transfer the OWNERSHIP privilege for the following objects:

  + `APPLICATION ROLE`
  + `CONNECTION`

    Only the ACCOUNTADMIN role can have the OWNERSHIP privilege on a connection object.
  + Instances of a [class](../snowflake-db-classes.md).
  + Machine learning objects (that is, models, model versions, and model monitors).
  + `SERVICE`
  + `SHARE`
* The GRANT OWNERSHIP statement is blocked if outbound (that is, dependent) privileges exist on the object. The object owner (or a higher role)
  can explicitly copy all current privileges to the new owning role (using the `COPY CURRENT GRANTS` option) or revoke all outbound
  privileges on the object before transferring ownership (using the `REVOKE CURRENT GRANTS` option).

  For role objects, if you do not specify these clauses, the GRANT OWNERSHIP statement is not blocked when transferring a role to a new
  owner role. The new owner role is updated. However, a `SHOW GRANTS OF ROLE transferred_role` command shows two rows for the
  transferred role being granted to the same user:

  + In the `granted_by` column, the value in one row is for the grant by the original owner role.
  + In the `granted_by` column, the value in the other row is for the grant by the new owner role.

  Snowflake prevents the GRANT OWNERSHIP … REVOKE CURRENT GRANTS command on a shared database. For details, see the Shared database
  example in this topic.
* The transfer of ownership only affects existing objects at the time the command is issued. Any objects created after the command is
  issued are owned by the role in use when the object is created.
* Transferring ownership of objects of the following types is blocked unless additional conditions are met:

  Pipes:
  :   The pipe must be paused.

  Tasks:
  :   You must suspend the scheduled task. Snowflake suspends all tasks in the container automatically if all tasks in a specified database or schema are transferred to another role. Tasks transferred to the same role using the `COPY CURRENT GRANTS` option are also suspended automatically. For more information, see [Task security](../../user-guide/tasks-intro.md).
* When future grants on the same object type are defined at both the database and
  schema level, the schema-level grants take precedence over the database-level grants, and
  the database-level grants are ignored.
* To grant ownership on a materialized view, use `GRANT OWNERSHIP ON VIEW`. There is no separate
  `GRANT OWNERSHIP ON MATERIALIZED VIEW` statement.
* To grant ownership on a hybrid table, use `GRANT OWNERSHIP ON TABLE`. There is no separate
  `GRANT OWNERSHIP ON HYBRID TABLE` statement.
* You cannot transfer the OWNERSHIP privilege on a share, nor can you transfer the OWNERSHIP privilege on a connection. Only the ACCOUNTADMIN role can own the connection.
* For granting the OWNERSHIP privilege on dynamic tables, ensure the receiving role has the USAGE privilege on the database and schema
  that contains the dynamic table, and on the warehouse used to refresh the table. Otherwise, subsequent scheduled refreshes fail.
* For granting the OWNERSHIP privilege on future dynamic tables:

  + If the dynamic table is set to initialize on creation (that is, `INITIALIZE = ON_CREATE`), ensure the new role has
    [sufficient privileges](../../user-guide/dynamic-tables-privileges.md) on referenced objects. Otherwise, the initial refresh fails and results in
    an error stating that the object cannot be found.
  + If the dynamic table is set to initialize on schedule (that is, `INITIALIZE = ON_SCHEDULE`), ensure the new role has
    [sufficient privileges](../../user-guide/dynamic-tables-privileges.md) on referenced objects. Otherwise, the subsequent scheduled refreshes fail.
* When you transfer ownership of an Apache Iceberg™ table to a different role,
  Snowflake doesn’t transfer the OWNERSHIP privilege on the external volume
  (and catalog integration if the table is externally managed) associated with the table.

  To give the target role full control over the table and its related objects,
  you must grant the OWNERSHIP privilege on the external volume and catalog integration to the role.
* After the ownership of a notebook is transferred to a new role, the original owner role loses all access to the notebook.
* **Database roles:**

  Ownership can only be transferred on objects in the same database as the database role.
* Transferring ownership on an external table or its parent database blocks automatic refreshes of the table metadata
  by setting the `AUTO_REFRESH` property to `FALSE`. To reset the property after you transfer ownership,
  use the [ALTER EXTERNAL TABLE](alter-external-table.md) command.

## Examples

### Roles

Revoke all outbound privileges on the `mydb` database, currently owned by the `manager` role, before transferring ownership
to the `analyst` role:

> ```sqlexample
> REVOKE ALL PRIVILEGES ON DATABASE mydb FROM ROLE manager;
>
> GRANT OWNERSHIP ON DATABASE mydb TO ROLE analyst;
>
> GRANT ALL PRIVILEGES ON DATABASE mydb TO ROLE analyst;
> ```
>
> Note that this example illustrates the default (and recommended) multi-step process for transferring ownership.

In a single step, revoke all privileges on the existing tables in the `mydb.public` schema and transfer ownership of the tables
(along with a copy of their current privileges) to the `analyst` role:

> ```sqlexample
> GRANT OWNERSHIP ON ALL TABLES IN SCHEMA mydb.public TO ROLE analyst COPY CURRENT GRANTS;
> ```

Grant ownership on the `mydb.public.mytable` table to the `analyst` role along with a copy of all current outbound privileges
on the table:

> ```sqlexample
> GRANT OWNERSHIP ON TABLE mydb.public.mytable TO ROLE analyst COPY CURRENT GRANTS;
> ```

Grant ownership on a notebook called `mynotebook` from the `data_science` role to the `finance` role:

> ```sqlexample
> USE ROLE data_science;
> GRANT OWNERSHIP ON NOTEBOOK db_one.schema_one.mynotebook TO ROLE finance;
> ```

### Database roles

In a single step, revoke all privileges on the existing tables in the `mydb.public` schema and transfer ownership of the tables
(along with a copy of their current privileges) to the `mydb.dr1` database role:

> ```sqlexample
> GRANT OWNERSHIP ON ALL TABLES IN SCHEMA mydb.public
>   TO DATABASE ROLE mydb.dr1
>   COPY CURRENT GRANTS;
> ```

Grant ownership on the `mydb.public.mytable` table to the `mydb.dr1` database role along with a copy of all current outbound
privileges on the table:

> ```sqlexample
> GRANT OWNERSHIP ON TABLE mydb.public.mytable
>   TO ROLE mydb.dr1
>   COPY CURRENT GRANTS;
> ```

### Shared database

To transfer the OWNERSHIP privilege on a shared database, use these commands:

> ```sqlexample
> REVOKE USAGE ON DATABASE mydb FROM SHARE myshare;
> GRANT OWNERSHIP ON DATABASE mydb TO ROLE r2;
> GRANT USAGE ON DATABASE mydb TO ROLE r2;
> ```

If necessary, re-grant the database to the share using a [GRANT <privilege> … TO SHARE](grant-privilege-share.md) command.

---
title: GRANT ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-role.md
section: SQL Commands
---

# GRANT ROLE

Assigns a role to a user or another role:

* Granting a role to another role creates a “parent-child” relationship between the roles (also referred to as a *role hierarchy*).
* Granting a role to a user enables the user to perform all operations allowed by the role (through the access privileges granted to the role).

For more details, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [REVOKE ROLE](revoke-role.md)

    [GRANT DATABASE ROLE](grant-database-role.md) , [REVOKE DATABASE ROLE](revoke-database-role.md)

    [GRANT <privileges> … TO ROLE](grant-privilege.md)

## Syntax

```sqlsyntax
GRANT ROLE <name> TO { ROLE <parent_role_name> | USER <user_name> }
```

## Parameters

`name`
:   Specifies the identifier for the role to grant. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`ROLE parent_role_name`
:   Grants the role to the specified role.

`USER user_name`
:   Grants the role to the specified user.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Role | Role that is granted to a user or another role. |

Alternatively, use a role with the global MANAGE GRANTS privilege. Only the SECURITYADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The system-defined roles, including PUBLIC, do not need to be granted to other roles because the role hierarchy for these roles is
  defined and maintained by Snowflake.

## Examples

```sqlexample
GRANT ROLE analyst TO ROLE SYSADMIN;
```

```sqlexample
GRANT ROLE analyst TO USER user1;
```

---
title: GRANT SERVICE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/grant-service-role.md
section: SQL Commands
---

# GRANT SERVICE ROLE

Assigns a service role to an account role, application role, or database role. For more information, see [Managing service-related privileges](../../developer-guide/snowpark-container-services/working-with-services.md).

See also:
:   [REVOKE SERVICE ROLE](revoke-service-role.md), [SHOW ROLES IN SERVICE](show-roles-in-service.md),
    [SHOW GRANTS](show-grants.md)

## Syntax

```sqlsyntax
GRANT SERVICE ROLE <name> TO
{
  ROLE <role_name>                     |
  APPLICATION ROLE <application_role_name>  |
  DATABASE ROLE <database_role_name>
}
```

## Parameters

`name`
:   Specifies the identifier for the service role to grant. If the identifier contains spaces or
    special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in
    double quotes are also case-sensitive.

    Specify the service role name in the following format:

    > `service-name!service-role-name`

    For example, `echo_service!echoendpoint_role`.

`ROLE role_name`
:   Name of the account role to grant the service role to.

`APPLICATION ROLE application_role_name`
:   Name of the application role to grant the service role to.

`DATABASE ROLE database_role_name`
:   Name of the database role to grant the service role to.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege or role | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Service | Only the service owner can grant the service role. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following command grants the `echoendpoint_role` service role defined in the `echo_service` service specification to the `service_function_user_role` role.

```sqlexample
GRANT SERVICE ROLE echo_service!echoendpoint_role TO ROLE service_function_user_role;
```

---
title: INSERT
source: https://docs.snowflake.com/en/sql-reference/sql/insert.md
section: SQL Commands
---

# INSERT

Updates a table by inserting one or more rows into the table. The values inserted into each column in the table can be explicitly-specified
or the results of a query.

See also:
:   [INSERT (multi-table)](insert-multi-table.md)

## Syntax

```sqlsyntax
INSERT [ OVERWRITE ] INTO <target_table> [ ( <target_col_name> [ , ... ] ) ]
       {
           VALUES ( { <value> | DEFAULT | NULL } [ , ... ] ) [ , ( ... ) ]
         | <query>
       }
```

## Required parameters

`target_table`
:   Specifies the target table into which to insert rows.

`VALUES ( value | DEFAULT | NULL [ , ... ] )  [ , ( ... ) ]`
:   Specifies one or more values to insert into the corresponding columns in the target table.

    In a `VALUES` clause, you can specify the following:

    * `value`: Inserts the explicitly-specified value. The value can be a literal or an expression
      that evaluates to a single value.
    * `DEFAULT`: Inserts the default value for the corresponding column in the target table.
    * `NULL`: Inserts a `NULL` value.

    Each value in the clause must be separated by a comma.

    You can insert multiple rows by specifying additional sets of values in the clause. For more information, see the
    Usage notes and the Examples.

`query`
:   Specify a [query](../constructs.md) statement that returns values to be inserted into the corresponding
    columns. This allows you to insert rows into a target table from one or more source tables.

## Optional parameters

`OVERWRITE`
:   Specifies that the target table should be truncated before inserting the values into the table. Note that specifying this option does
    not affect the access control privileges on the table.

    INSERT statements with `OVERWRITE` can be processed within the scope of the current transaction, which avoids DDL statements that
    commit a transaction, such as:

    > ```sqlexample
    > DROP TABLE t;
    > CREATE TABLE t AS SELECT * FROM ... ;
    > ```

    Default: No value (the target table is not truncated before performing the inserts).

`( target_col_name [ , ... ] )`
:   Specifies one or more columns in the target table into which the corresponding values are inserted. The number of target columns specified
    must match the number of specified values or columns (if the values are the results of a query) in the `VALUES` clause.

    Default: No value (all the columns in the target table are updated).

## Usage notes

* Using a single INSERT command, you can insert multiple rows into a table by specifying additional sets of values separated by commas in
  the VALUES clause.

  For example, the following clause would insert 3 rows in a 3-column table, with values `1`, `2`, and `3` in the first two rows and
  values `2`, `3`, and `4` in the third row:

  ```sqlexample
  VALUES ( 1, 2, 3 ) ,
         ( 1, 2, 3 ) ,
         ( 2, 3, 4 )
  ```
* To use the OVERWRITE option on INSERT, you must use a role that has DELETE privilege on the table because OVERWRITE will delete the
  existing records in the table.
* Some types of expressions can’t be specified in the VALUES clause, including the following expressions:

  + Subqueries

    For example:

    ```sqlexample
    ... VALUES (SELECT id FROM other_table)
    ```
  + Values of the [semi-structured](../data-types-semistructured.md) or
    [structured](../data-types-structured.md) data type.

    For example:

    ```sqlexample
    ... VALUES (ARRAY_CONSTRUCT(1, 2, 3))
    ```
  + [Window functions](../functions-window.md)

    For example:

    ```sqlexample
    ... VALUES (ROW_NUMBER() OVER (...))
    ```
  + [Aggregate functions](../functions-aggregation.md)

    For example:

    ```sqlexample
    ... VALUES (SUM(x))
    ```

  As an alternative to the VALUES clause, specify the expression in a `query` clause. For example, you
  can replace the following expression:

  ```sqlexample
  INSERT INTO table1 (ID, varchar1, variant1)
      VALUES (4, 'Fourier', PARSE_JSON('{ "key1": "value1", "key2": "value2" }'));
  ```

  with this expression:

  ```sqlexample
  INSERT INTO table1 (ID, varchar1, variant1)
      SELECT 4, 'Fourier', PARSE_JSON('{ "key1": "value1", "key2": "value2" }');
  ```
* The VALUES clause is limited to 200,000 rows. This limit applies to a single INSERT INTO … VALUES
  statement and a single INSERT INTO … SELECT … FROM VALUES statement. Consider using the
  [COPY INTO <table>](copy-into-table.md) command to perform a bulk data load. For more information
  about using the VALUES clause in a SELECT statement, see [VALUES](../constructs/values.md).
* For information about inserting data into hybrid tables, see [Loading data](../../user-guide/tables-hybrid-create.md).

## Examples

The following examples use the INSERT command.

### Single row insert using a query

Convert three string values to dates or timestamps and insert them into a single row in the `mytable` table:

```sqlexample
CREATE OR REPLACE TABLE mytable (
  col1 DATE,
  col2 TIMESTAMP_NTZ,
  col3 TIMESTAMP_NTZ);

DESC TABLE mytable;
```

```output
+------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type             | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| COL1 | DATE             | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| COL2 | TIMESTAMP_NTZ(9) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| COL3 | TIMESTAMP_NTZ(9) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

```sqlexample
INSERT INTO mytable
  SELECT
    TO_DATE('2013-05-08T23:39:20.123'),
    TO_TIMESTAMP('2013-05-08T23:39:20.123'),
    TO_TIMESTAMP('2013-05-08T23:39:20.123');

SELECT * FROM mytable;
```

```output
+------------+-------------------------+-------------------------+
| COL1       | COL2                    | COL3                    |
|------------+-------------------------+-------------------------|
| 2013-05-08 | 2013-05-08 23:39:20.123 | 2013-05-08 23:39:20.123 |
+------------+-------------------------+-------------------------+
```

Similar to previous example, but specify to update only the first and third columns in the table:

```sqlexample
INSERT INTO mytable (col1, col3)
  SELECT
    TO_DATE('2013-05-08T23:39:20.123'),
    TO_TIMESTAMP('2013-05-08T23:39:20.123');

SELECT * FROM mytable;
```

```output
+------------+-------------------------+-------------------------+
| COL1       | COL2                    | COL3                    |
|------------+-------------------------+-------------------------|
| 2013-05-08 | 2013-05-08 23:39:20.123 | 2013-05-08 23:39:20.123 |
| 2013-05-08 | NULL                    | 2013-05-08 23:39:20.123 |
+------------+-------------------------+-------------------------+
```

### Multi-row insert using explicitly-specified values

Create the `employees` table and insert four rows of data into it by providing sets of values in a
comma-separated list in the VALUES clause:

```sqlexample
CREATE TABLE employees (
  first_name VARCHAR,
  last_name VARCHAR,
  workphone VARCHAR,
  city VARCHAR,
  postal_code VARCHAR);

INSERT INTO employees
  VALUES
    ('May', 'Franklin', '1-650-249-5198', 'San Francisco', 94115),
    ('Gillian', 'Patterson', '1-650-859-3954', 'San Francisco', 94115),
    ('Lysandra', 'Reeves', '1-212-759-3751', 'New York', 10018),
    ('Michael', 'Arnett', '1-650-230-8467', 'San Francisco', 94116);

SELECT * FROM employees;
```

```output
+------------+-----------+----------------+---------------+-------------+
| FIRST_NAME | LAST_NAME | WORKPHONE      | CITY          | POSTAL_CODE |
|------------+-----------+----------------+---------------+-------------|
| May        | Franklin  | 1-650-249-5198 | San Francisco | 94115       |
| Gillian    | Patterson | 1-650-859-3954 | San Francisco | 94115       |
| Lysandra   | Reeves    | 1-212-759-3751 | New York      | 10018       |
| Michael    | Arnett    | 1-650-230-8467 | San Francisco | 94116       |
+------------+-----------+----------------+---------------+-------------+
```

In multi-row inserts, make sure that the data types of the inserted values are consistent across the rows because the data type of the
first row is used as a guide. Create a table and insert two rows:

```sqlexample
CREATE OR REPLACE TABLE demo_insert_type_mismatch (v VARCHAR);
```

The first insert works as expected:

```sqlexample
INSERT INTO demo_insert_type_mismatch (v) VALUES
  ('three'),
  ('four');
```

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       2 |
+-------------------------+
```

The second insert fails because the data type of the value in the second row (`'d'`) is a string, which is
different from the numeric data type of the value in the first row (`3`). The insert fails even though both values
can be [coerced](../data-type-conversion.md) to VARCHAR, which is the data type of the column in
the table. The insert fails even though the data type of the value `'d'` is the same as the data type of column `v`:

```sqlexample
INSERT INTO demo_insert_type_mismatch (v) VALUES
  (3),
  ('d');
```

```output
100038 (22018): DML operation to table DEMO_INSERT_TYPE_MISMATCH failed on column V with error: Numeric value 'd' is not recognized
```

When the data types are consistent across the rows, the insert succeeds, and both numeric values are coerced to the VARCHAR data type:

```sqlexample
INSERT INTO demo_insert_type_mismatch (v) VALUES
  (3),
  (4);
```

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       2 |
+-------------------------+
```

### Multi-row insert using query

Insert multiple rows of data from the `contractors` table into the `employees` table:

* Select only those rows where the `worknum` column contains area code `650`.
* Insert a NULL value in the `city` column.

```sqlexample
SELECT * FROM employees;
```

```output
+------------+-----------+----------------+---------------+-------------+
| FIRST_NAME | LAST_NAME | WORKPHONE      | CITY          | POSTAL_CODE |
|------------+-----------+----------------+---------------+-------------|
| May        | Franklin  | 1-650-249-5198 | San Francisco | 94115       |
| Gillian    | Patterson | 1-650-859-3954 | San Francisco | 94115       |
| Lysandra   | Reeves    | 1-212-759-3751 | New York      | 10018       |
| Michael    | Arnett    | 1-650-230-8467 | San Francisco | 94116       |
+------------+-----------+----------------+---------------+-------------+
```

```sqlexample
CREATE TABLE contractors (
  contractor_first VARCHAR,
  contractor_last VARCHAR,
  worknum VARCHAR,
  city VARCHAR,
  zip_code VARCHAR);

INSERT INTO contractors
  VALUES
    ('Bradley', 'Greenbloom', '1-650-445-0676', 'San Francisco', 94110),
    ('Cole', 'Simpson', '1-212-285-8904', 'New York', 10001),
    ('Laurel', 'Slater', '1-650-633-4495', 'San Francisco', 94115);

SELECT * FROM contractors;
```

```output
+------------------+-----------------+----------------+---------------+----------+
| CONTRACTOR_FIRST | CONTRACTOR_LAST | WORKNUM        | CITY          | ZIP_CODE |
|------------------+-----------------+----------------+---------------+----------|
| Bradley          | Greenbloom      | 1-650-445-0676 | San Francisco | 94110    |
| Cole             | Simpson         | 1-212-285-8904 | New York      | 10001    |
| Laurel           | Slater          | 1-650-633-4495 | San Francisco | 94115    |
+------------------+-----------------+----------------+---------------+----------+
```

```sqlexample
INSERT INTO employees(first_name, last_name, workphone, city, postal_code)
  SELECT contractor_first, contractor_last, worknum, NULL, zip_code
    FROM contractors
    WHERE CONTAINS(worknum,'650');

SELECT * FROM employees;
```

```output
+------------+------------+----------------+---------------+-------------+
| FIRST_NAME | LAST_NAME  | WORKPHONE      | CITY          | POSTAL_CODE |
|------------+------------+----------------+---------------+-------------|
| May        | Franklin   | 1-650-249-5198 | San Francisco | 94115       |
| Gillian    | Patterson  | 1-650-859-3954 | San Francisco | 94115       |
| Lysandra   | Reeves     | 1-212-759-3751 | New York      | 10018       |
| Michael    | Arnett     | 1-650-230-8467 | San Francisco | 94116       |
| Bradley    | Greenbloom | 1-650-445-0676 | NULL          | 94110       |
| Laurel     | Slater     | 1-650-633-4495 | NULL          | 94115       |
+------------+------------+----------------+---------------+-------------+
```

Insert multiple rows of data from the `contractors` table into the `employees` table using a common table expression:

```sqlexample
INSERT INTO employees (first_name, last_name, workphone, city, postal_code)
  WITH cte AS
    (SELECT contractor_first AS first_name,
            contractor_last AS last_name,
            worknum AS workphone,
            city,
            zip_code AS postal_code
       FROM contractors)
  SELECT first_name, last_name, workphone, city, postal_code
    FROM cte;
```

Insert columns from two tables (`emp_addr`, `emp_ph`) into a third table (`emp`) using an INNER JOIN on the `id`
column in the source tables:

```sqlexample
INSERT INTO emp (id, first_name, last_name, city, postal_code, ph)
  SELECT a.id, a.first_name, a.last_name, a.city, a.postal_code, b.ph
    FROM emp_addr a
    INNER JOIN emp_ph b ON a.id = b.id;
```

### Multi-row insert for JSON data

Insert two JSON objects into a VARIANT column in a table:

```sqlexample
CREATE TABLE prospects (column1 VARIANT);

INSERT INTO prospects
  SELECT PARSE_JSON(column1)
  FROM VALUES
  ('{
    "_id": "57a37f7d9e2b478c2d8a608b",
    "name": {
      "first": "Lydia",
      "last": "Williamson"
    },
    "company": "Miralinz",
    "email": "lydia.williamson@miralinz.info",
    "phone": "+1 (914) 486-2525",
    "address": "268 Havens Place, Dunbar, Rhode Island, 02801"
  }')
  , ('{
    "_id": "57a37f7d622a2b1f90698c01",
    "name": {
      "first": "Denise",
      "last": "Holloway"
    },
    "company": "DIGIGEN",
    "email": "denise.holloway@digigen.net",
    "phone": "+1 (979) 587-3021",
    "address": "441 Dover Street, Ada, New Mexico, 87105"
  }');
```

### Insert using OVERWRITE

This example uses INSERT with OVERWRITE to rebuild the `sf_employees` table from `employees` after new records were added
to the `employees` table.

Here is the initial data for both tables:

```sqlexample
SELECT * FROM employees;
```

```output
+------------+-----------+----------------+---------------+-------------+
| FIRST_NAME | LAST_NAME | WORKPHONE      | CITY          | POSTAL_CODE |
|------------+-----------+----------------+---------------+-------------|
| May        | Franklin  | 1-650-111-1111 | San Francisco | 94115       |
| Gillian    | Patterson | 1-650-222-2222 | San Francisco | 94115       |
| Lysandra   | Reeves    | 1-212-222-2222 | New York      | 10018       |
| Michael    | Arnett    | 1-650-333-3333 | San Francisco | 94116       |
+------------+-----------+----------------+---------------+-------------+
```

```sqlexample
SELECT * FROM sf_employees;
```

```output
+------------+-----------+----------------+---------------+-------------+
| FIRST_NAME | LAST_NAME | WORKPHONE      | CITY          | POSTAL_CODE |
|------------+-----------+----------------+---------------+-------------|
| Mary       | Smith     | 1-650-999-9999 | San Francisco | 94115       |
+------------+-----------+----------------+---------------+-------------+
```

This statement inserts rows into the `sf_employees` table using the OVERWRITE clause:

```sqlexample
INSERT OVERWRITE INTO sf_employees
  SELECT * FROM employees
  WHERE city = 'San Francisco';
```

Because the INSERT used the OVERWRITE clause, the old rows from `sf_employees` are gone:

```sqlexample
SELECT * FROM sf_employees;
```

```output
+------------+-----------+----------------+---------------+-------------+
| FIRST_NAME | LAST_NAME | WORKPHONE      | CITY          | POSTAL_CODE |
|------------+-----------+----------------+---------------+-------------|
| May        | Franklin  | 1-650-111-1111 | San Francisco | 94115       |
| Gillian    | Patterson | 1-650-222-2222 | San Francisco | 94115       |
| Michael    | Arnett    | 1-650-333-3333 | San Francisco | 94116       |
+------------+-----------+----------------+---------------+-------------+
```

### Write to a v3 Apache Iceberg™ table

The following example inserts a row into an Apache Iceberg™ table that conforms to v3 of the Apache Iceberg™ table specification:

```sqlexample
INSERT INTO my_v3_iceberg_table (id, payload) VALUES (1, PARSE_JSON('{"name": "Alice", "age": 30}'));
```

---
title: INSERT (multi-table)
source: https://docs.snowflake.com/en/sql-reference/sql/insert-multi-table.md
section: SQL Commands
---

# INSERT (multi-table)

Updates multiple tables by inserting one or more rows with column values (from a query) into the tables. Supports both unconditional and
conditional inserts.

See also:
:   [INSERT](insert.md)

## Syntax

```sqlsyntax
-- Unconditional multi-table insert
INSERT [ OVERWRITE ] ALL
  intoClause [ ... ]
<subquery>

-- Conditional multi-table insert
INSERT [ OVERWRITE ] { FIRST | ALL }
  { WHEN <condition> THEN intoClause [ ... ] }
  [ ... ]
  [ ELSE intoClause ]
<subquery>
```

Where:

> ```sqlsyntax
> intoClause ::=
>   INTO <target_table> [ ( <target_col_name> [ , ... ] ) ] [ VALUES ( { <source_col_name> | DEFAULT | NULL } [ , ... ] ) ]
> ```

## Required parameters

`ALL`
:   Unconditional multi-table insert only

    Specifies that each row executes every `INTO` clause in the INSERT statement.

    > **Note:**
    >
    > If the `FIRST` keyword is specified in an unconditional multi-table insert (or the `ALL` keyword is not specified),
    > Snowflake returns a syntax error.

`FIRST` or `ALL`
:   Conditional multi-table insert only

    `FIRST`
    :   Specifies that each row executes only the first `WHEN` clause for which the condition evaluates to TRUE. If no `WHEN`
        clause evaluates to TRUE, then the `ELSE` clause, if present, executes.

    `ALL`
    :   Specifies that each row executes all `WHEN` clauses. If no `WHEN` clause evaluates to TRUE, then the `ELSE`
        clause, if present, executes.

    > **Note:**
    >
    > * A conditional multi-table insert must contain at least one `WHEN` clause.
    > * Each `WHEN` clause can contain multiple `INTO` clauses and the `INTO` clauses can insert into the same target
    >   table.
    > * To always execute a `WHEN` clause, use:
    >
    >   > `WHEN 1=1 THEN ...`

`condition`
:   Conditional multi-table insert only

    Specifies the condition that must evaluate to TRUE in order for the values specified in the `INTO` clause to be inserted. The
    condition can be a [SELECT](select.md) list.

`target_table`
:   Specifies a target table into which to insert rows. The same table may be referenced more than once (in separate `WHEN` clauses).

    Multiple tables can be targeted by including a `INTO` clause for each table.

`subquery`
:   Specifies the [SELECT](select.md) list that determines the source of the values to be inserted into the target tables.

## Optional parameters

`OVERWRITE`
:   Specifies to truncate the target tables before inserting into the tables, while retaining access control privileges on the tables.

    INSERT statements with `OVERWRITE` can be processed within the scope of the current transaction, which avoids DDL statements that
    commit a transaction, such as:

    > ```sqlexample
    > DROP TABLE t;
    > CREATE TABLE t AS SELECT * FROM ... ;
    > ```

    Default: No value (the target tables are not truncated before performing the inserts)

`( target_col_name [ , ... ] )`
:   Specifies one or more columns in the target table into which the values from the corresponding column in the source is inserted. The
    number of target columns specified must match the number of values specified in the source.

    Default: No value (all the columns in the target table are updated)

`VALUES ( source_col_name | DEFAULT | NULL [ , ... ] )`
:   Specifies one or more values to insert into the corresponding columns in the target table. The values can be:

    * `source_col_name`: Specifies the column in the source that contains the value to be inserted into the corresponding column in
      the target table.
    * `DEFAULT`: Inserts the default value for the corresponding column in the target table.
    * `NULL`: Inserts a `NULL` value.

    Each value in the clause must be separated by a comma. Also, the number of values specified must match the number of columns specified
    for the target table.

    Default: No value (values from all the columns in the source are inserted into the corresponding columns in the target table)

## Usage notes

* In an `INTO` clause, the `VALUES` clause is optional. If it is omitted, the values from the [SELECT](select.md) list are inserted
  into the target table in their natural order.
* Expressions in `WHEN` clauses (for conditional multi-table inserts) and `VALUES` clauses can only reference the subquery
  via an alias. The alias must be one of the following:

  > + Explicit alias specified for a [SELECT](select.md) expression.
  > + Default alias for an expression.
  > + Positional alias ($1, $2, etc.).

  In addition, columns and expressions of the subquery that are not in the outermost [SELECT](select.md) list can not be referenced in
  `WHEN` and `VALUES` clauses. For details, see Examples (in this topic).
* In each row produced by the `subquery`, the value in `source_col_name` must be compatible with the data type of the
  corresponding `target_col_name`. This rule applies even to rows that would be filtered out by the `condition` in the
  `WHEN` clause. The order of operations does not guarantee that the filter in the `WHEN` clause is applied before the value in
  `source_col_name` is evaluated for data type compatibility.

## Examples

### Unconditional multi-table inserts

Insert each row in the `src` table twice into tables `t1` and `t2`. In this example, the inserted rows are not
identical; each of the inserted rows has different values/orders because we use the VALUES clause to vary the data:

> ```sqlexample
> INSERT ALL
>   INTO t1
>   INTO t1 (c1, c2, c3) VALUES (n2, n1, DEFAULT)
>   INTO t2 (c1, c2, c3)
>   INTO t2 VALUES (n3, n2, n1)
> SELECT n1, n2, n3 from src;
>
> -- If t1 and t2 need to be truncated before inserting, OVERWRITE must be specified
> INSERT OVERWRITE ALL
>   INTO t1
>   INTO t1 (c1, c2, c3) VALUES (n2, n1, DEFAULT)
>   INTO t2 (c1, c2, c3)
>   INTO t2 VALUES (n3, n2, n1)
> SELECT n1, n2, n3 from src;
> ```

### Conditional multi-table inserts

The next two examples show how to create conditional multi-table inserts by
using `WHEN` clauses and an `ELSE` clause to decide which table(s), if any, each row is inserted into.

These examples also show the difference between using `INSERT ALL` and
`INSERT FIRST`.

Execute all `WHEN` clauses with an `ELSE` clause:

* Rows where `n1 > 100` also satisfy the condition `n1 > 10` and are therefore inserted in `t1` twice when the
  `ALL` keyword is used.
* Rows where `n1 <= 10` satisfy the `ELSE` case and are inserted in `t2`.

  ```sqlexample
  INSERT ALL
    WHEN n1 > 100 THEN
      INTO t1
    WHEN n1 > 10 THEN
      INTO t1
      INTO t2
    ELSE
      INTO t2
  SELECT n1 from src;
  ```

If the table src contains 3 rows, in which n1 has the values 1, 11, and 101,
then after the INSERT statement the tables t1 and t2 will hold the values shown
below:

t1:

|  |  |
| --- | --- |
| 101 | 101 > 100, so the first `WHEN` clause inserts into t1 |
| 101 | 101 > 10, so the second `WHEN` clause also inserts into t1 |
| 11 | 11 > 10, so the second `WHEN` clause inserts into t1 |

The row with `n1 = 1` is not inserted into t1 because it does not satisfy
any `WHEN` clause that inserts into t1, and because the `ELSE`
clause does not insert into t1.

t2:

|  |  |
| --- | --- |
| 101 | 101 > 10, so the second `WHEN` clause inserts into t2. (The row also qualifies for the clause `WHEN n1 > 100`; however, that clause does not insert into t2.) |
| 11 | 11 > 10, so the second `WHEN` clause inserts into t2 |
| 1 | the row didn’t satisfy any of the `WHEN` clauses, so it’s inserted into t2 by the `ELSE` clause |

The next example is similar to the previous example, except with a `FIRST` clause.

> ```sqlexample
> INSERT FIRST
>   WHEN n1 > 100 THEN
>     INTO t1
>   WHEN n1 > 10 THEN
>     INTO t1
>     INTO t2
>   ELSE
>     INTO t2
> SELECT n1 from src;
> ```

If the table src contains 3 rows, in which n1 has the values 1, 11, and 101, then after the INSERT statement the tables t1 and t2 will
hold the values shown below:

t1:

|  |  |
| --- | --- |
| 101 | 101 > 100, so the first `WHEN` clause inserts into t1 |
| 11 | 11 > 10, so the second `WHEN` clause inserts into t1 |

The row with `n1 = 1` is not inserted into t1 because it does not satisfy any `WHEN` clause that inserts into t1, and because
the `ELSE` clause does not insert into t1.

Unlike in the previous example, which used `ALL`, the row with `n1 = 101` is inserted into t1 only once because the first
`WHEN` clause evaluates to TRUE so the second `WHEN` clause is ignored.

t2:

|  |  |
| --- | --- |
| 11 | 11 > 10, so the second `WHEN` clause inserts into t2 |
| 1 | the row didn’t satisfy any of the `WHEN` clauses, so it’s inserted into t2 by the `ELSE` clause |

The row `n1 = 101` is not inserted into t2 because 101 is greater than 100, so it matches the first `WHEN` clause, but the
first `WHEN` clause doesn’t insert into t2, and the statement doesn’t check any of the other `WHEN` clauses or use the
`ELSE` clause because the row already qualified for the first `WHEN` clause.

### Multi-table inserts with aliases and references

Insert values using a positional alias (`$1`), an explicit alias (`an_alias`), and a default alias (`"10 + 20"`);
this example inserts a single row with values `(1, 50, 30)` into table `t1`:

> ```sqlexample
> INSERT ALL
>   INTO t1 VALUES ($1, an_alias, "10 + 20")
> SELECT 1, 50 AS an_alias, 10 + 20;
> ```

Illustrate inserting values from columns that must be selected to be referenced (`b` and `c` in table `src`):

> ```sqlexample
> -- Returns error
>   INSERT ALL
>     WHEN c > 10 THEN
>       INTO t1 (col1, col2) VALUES (a, b)
>   SELECT a FROM src;
>
> -- Completes successfully
>   INSERT ALL
>     WHEN c > 10 THEN
>       INTO t1 (col1, col2) VALUES (a, b)
>   SELECT a, b, c FROM src;
> ```

Illustrate inserting values from a column that cannot be referenced (`src1.key`); instead, it must be selected and aliased:

> ```sqlexample
> -- Returns error
>   INSERT ALL
>     INTO t1 VALUES (src1.key, a)
>   SELECT src1.a AS a
>   FROM src1, src2 WHERE src1.key = src2.key;
>
> -- Completes successfully
>   INSERT ALL
>     INTO t1 VALUES (key, a)
>   SELECT src1.key AS key, src1.a AS a
>   FROM src1, src2 WHERE src1.key = src2.key;
> ```

---
title: LIST
source: https://docs.snowflake.com/en/sql-reference/sql/list.md
section: SQL Commands
---

# LIST

Returns a list of files from one of the following Snowflake storage features:

* Stage

  + Named internal
  + Named external
  + For a specified table
  + For the current user
* [Git repository clone in Snowflake](../../developer-guide/git/git-overview.md)

LIST can be abbreviated to LS.

See also:
:   [REMOVE](remove.md), [PUT](put.md), [COPY INTO <table>](copy-into-table.md), [COPY INTO <location>](copy-into-location.md), [GET](get.md)

## Syntax

The syntax differs depending on whether you’re listing files in a stage or a Git repository clone.

### For a stage

```sqlsyntax
LIST { internalStage | externalStage } [ PATTERN = '<regex_pattern>' ]
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>]
>   | @[<namespace>.]%<table_name>[/<path>]
>   | @~[/<path>]
> ```
>
> ```sqlsyntax
> externalStage ::=
>   @[<namespace>.]<ext_stage_name>[/<path>]
> ```

### For a Git repository clone

```sqlsyntax
LIST repositoryClone [ PATTERN = '<regex_pattern>' ]
```

Where:

> ```sqlsyntax
> repositoryClone ::=
>   @[<namespace>.]<repository_clone>/<path>
> ```

## Required parameters

### For a stage

`internalStage | externalStage`
:   Specifies the location where the data files are staged:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]int_stage_name[/path]` | Files are in the specified named internal stage. |
    > | `@[namespace.]ext_stage_name[/path]` | Files are in the specified named external stage. |
    > | `@[namespace.]%table_name[/path]` | Files are in the stage for the specified table. |
    > | `@~[/path]` | Files are in the stage for the current user. |

    Where:

    * `namespace` is the database and/or schema in which the named stage or table resides. It is optional if a database and
      schema are currently in use within the session; otherwise, it is required.
    * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
      common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud storage
      services.

    > **Note:**
    >
    > If the stage name or path includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'` for a
    > stage named `"my stage"`).

    > **Tip:**
    >
    > Specifying a path provides a scope for the LIST command, potentially reducing the amount of time required to run the command.

### For a Git repository clone

`repositoryClone`
:   Specifies the name of the [repository clone](create-git-repository.md) and the branch, tag, or commit for
    which to list files.

    `@[namespace.]repository_clone/path`

    When listing files from a Git repository clone, the `path` is required and must begin with one of the following:

    > |  |  |
    > | --- | --- |
    > | `branches/branch_name` | List files from the specified branch. |
    > | `tags/tag_name` | List files from the specified tag. |
    > | `commits/commit_hash` | List files from the commit specified by the commit hash. |

    > **Note:**
    >
    > If the repository clone name or path includes spaces or special characters, it must be enclosed in single quotes (for example,
    > `'@"my repository"'` for a repository named `"my repository"`).

## Optional parameters

`PATTERN = 'regex_pattern'`
:   Specifies a regular expression pattern for filtering files from the output. The command lists all files in the specified `path`
    and applies the regular expression pattern on each of the files found.

## Usage notes

* To run this command with an external stage that uses a storage integration,
  you must use a role that has or inherits the USAGE privilege on the storage integration.

  For more information, see [Stage privileges](../../user-guide/security-access-control-privileges.md).
* In contrast to named stages, table and user stages are not first-class database objects; rather, they are implicit stages associated with
  the table/user. As such, they have no grantable privileges of their own:

  + You can always list files in your user stage (i.e. no privileges are required).
  + To list files in a table stage, you must use a role that has the OWNERSHIP privilege on the table.
  + PATTERN supports the [Java Pattern class](https://docs.oracle.com/javase/8/docs/api/java/util/regex/Pattern.html) syntax.

## Output

The command returns columns in the following tables. Column values differ depending on whether you’re using LIST with a stage or Git
repository clone.

### For a stage

| Column | Data type | Description |
| --- | --- | --- |
| name | VARCHAR | Name of the staged file |
| size | NUMBER | Size of the file compressed (in bytes) |
| md5 | VARCHAR | The MD5 column stores an MD5 hash of the contents of the staged data file.  For internal stages with default encryption (SNOWFLAKE_FULL), during upload the source file is encrypted with a random key, and its resulting MD5 digest will always differ from the original local file.  Amazon S3 stages report the value via the S3 eTag field, which might not be an MD5 hash of the file contents.  For Google Cloud stages that use a customer-managed encryption key (CMEK), md5 is expected to be NULL.  For more information, see [Customer-managed encryption keys](https://cloud.google.com/storage/docs/encryption/customer-managed-keys). |
| sha1 | VARCHAR | Not used |
| last_modified | VARCHAR | Timestamp when the file was last updated in the stage |

### For a Git repository clone

| Column | Data type | Description |
| --- | --- | --- |
| name | VARCHAR | Full file path with extension |
| size | NUMBER | Size of the file compressed (in bytes) |
| md5 | VARCHAR | Not used |
| sha1 | VARCHAR | A unique identifier generated by applying the SHA-1 hashing algorithm to the file’s contents. It is used by Git to track and reference the exact version of a file in the repository, and can be used to detect changes in the file’s content. |
| last_modified | VARCHAR | Timestamp of the commit associated with the listed files. This does not necessarily indicate when the file content was last changed. |

## Examples

### For a stage

List all the files in the stage for the `mytable` table:

```sqlexample
LIST @%mytable;
```

List all the files in the `path1` path of the `mystage` named stage:

```sqlexample
LIST @mystage/path1;
```

List the files that match a regular expression (i.e. all file names containing the string `data_0`) in the stage for the `mytable`
table:

```sqlexample
LIST @%mytable PATTERN='.*data_0.*';
```

List the files in the `/analysis/` path of the `my_csv_stage` named stage that match a regular expression (i.e. all file names containing
the string `data_0`):

```sqlexample
LIST @my_csv_stage/analysis/ PATTERN='.*data_0.*';
```

Use the abbreviated form of the command to list all the files in the stage for the current user:

```sqlexample
LS @~;
```

### For a Git repository clone

For examples, see [View a list of repository files](../../developer-guide/git/git-operations.md).

---
title: MERGE
source: https://docs.snowflake.com/en/sql-reference/sql/merge.md
section: SQL Commands
---

# MERGE

Inserts, updates, and deletes values in a table that are based on values in a second table or a subquery. Merging can be
useful if the second table is a change log that contains new rows (to be inserted), modified rows (to be updated),
or marked rows (to be deleted) in the target table.

The command supports semantics for handling the following cases:

* Values that match (for updates and deletes).
* Values that don’t match (for inserts).

See also:
:   [DELETE](delete.md) , [UPDATE](update.md)

## Syntax

```sqlsyntax
MERGE INTO <target_table>
  USING <source>
  ON <join_expr>
  { matchedClause | notMatchedClause } [ ... ]
```

Where:

> ```sqlsyntax
> matchedClause ::=
>   WHEN MATCHED
>     [ AND <case_predicate> ]
>     THEN { UPDATE { ALL BY NAME | SET <col_name> = <expr> [ , <col_name> = <expr> ... ] } | DELETE } [ ... ]
> ```
>
> ```sqlsyntax
> notMatchedClause ::=
>    WHEN NOT MATCHED
>      [ AND <case_predicate> ]
>      THEN INSERT { ALL BY NAME | [ ( <col_name> [ , ... ] ) ] VALUES ( <expr> [ , ... ] ) }
> ```

## Parameters

`target_table`
:   Specifies the table to merge.

`source`
:   Specifies the table or subquery to join with the target table.

`join_expr`
:   Specifies the expression on which to join the target table and source.

### `matchedClause` (for updates or deletes)

`WHEN MATCHED ... AND case_predicate`
:   Optionally specifies an expression which, when true, causes the matching case to be executed.

    Default: No value (matching case is always executed)

`WHEN MATCHED ... THEN { UPDATE { ALL BY NAME | SET ... } | DELETE }`
:   Specifies the action to perform when the values match.

    `ALL BY NAME`
    :   Updates all columns in the target table with values from the source. Each column in
        the target table is updated with the values of the column with the same name from the source.

        The target table and source must have the same number of columns and the same names for all of the
        columns. However, the column order can be different between the target table and the source.

    `SET col_name = expr [ , col_name = expr ... ]`
    :   Updates the specified column in the target table by using the corresponding expression for the new column
        value (can refer to both the target and source relations).

        In a single `SET` subclause, you can specify multiple columns to update.

    `DELETE`
    :   Deletes the rows in the target table when they match the source.

### `notMatchedClause` (for inserts)

`WHEN NOT MATCHED ... AND case_predicate`
:   Optionally specifies an expression which, when true, causes the not-matching case to be executed.

    Default: No value (not-matching case is always executed)

`WHEN NOT MATCHED ... THEN INSERT` . `{ ALL BY NAME | [ ( col_name [ , ... ] ) ] VALUES ( expr [ , ... ] ) }`
:   Specifies the action to perform when the values don’t match.

    `ALL BY NAME`
    :   Inserts all columns in the target table with values from the source. Each column in
        the target table is inserted with the values of the column with the same name from the source.

        The target table and source must have the same number of columns and the same names for all of the
        columns. However, the column order can be different between the target table and the source.

    `( col_name [ , ... ] )`
    :   Optionally specifies one or more columns in the target table to be inserted with values from the source.

        Default: No value (all columns in the target table are inserted)

    `VALUES ( expr [ , ... ] )`
    :   Specifies the corresponding expressions for the inserted column values (must refer to the source relations).

## Usage notes

* A single MERGE statement can include multiple matching and not-matching clauses (that is, `WHEN MATCHED ...` and
  `WHEN NOT MATCHED ...`).
* Any matching or not-matching clause that omits the `AND` subclause (default behavior) must be the last of its clause
  type in the statement (for example, a `WHEN MATCHED ...` clause can’t be followed by a `WHEN MATCHED AND ...` clause). Doing
  so results in an unreachable case, which returns an error.

## Duplicate join behavior

When multiple rows in the source table match a single row in the target table, the results can be deterministic or nondeterministic.
This section describes MERGE behavior for these use cases.

### Nondeterministic results for UPDATE and DELETE

When a merge joins a row in the target table against multiple rows in the source, the following join conditions produce nondeterministic
results (that is, the system is unable to determine the source value to use to update or delete the target row):

* A target row is selected to be updated with multiple values (for example, `WHEN MATCHED ... THEN UPDATE`).
* A target row is selected to be both updated and deleted (for example, `WHEN MATCHED ... THEN UPDATE` , `WHEN MATCHED ... THEN DELETE`).

In this situation, the outcome of the merge depends on the value specified for the [ERROR_ON_NONDETERMINISTIC_MERGE](../parameters.md) session
parameter:

* If TRUE (default value), the merge returns an error.
* If FALSE, one row from among the duplicates is selected to perform the update or delete; the row selected is not defined.

### Deterministic results for UPDATE and DELETE

Deterministic merges always complete without error. A merge is deterministic if it meets *at least one* of the following conditions
for each target row:

* One or more source rows satisfy the `WHEN MATCHED ... THEN DELETE` clauses, and no other source rows satisfy any
  `WHEN MATCHED` clauses
* Exactly one source row satisfies a `WHEN MATCHED ... THEN UPDATE` clause, and no other source rows satisfy any
  `WHEN MATCHED` clauses.

This makes MERGE semantically equivalent to the [UPDATE](update.md) and [DELETE](delete.md) commands.

> **Note:**
>
> To avoid errors when multiple rows in the data source (that is, the source table or subquery) match the target table based on the ON
> condition, use [GROUP BY](../constructs/group-by.md) in the source clause to ensure that each target row joins against one row
> (at most) in the source.
>
> In the following example, assume `src` includes multiple rows with the same `k` value. It’s ambiguous which values (`v`) will
> be used to update rows in the target row with the same value of `k`. By using the MAX function and GROUP BY, the query clarifies exactly
> which value of `v` from `src` is used:
>
> ```sqlexample
> MERGE INTO target
>   USING (SELECT k, MAX(v) AS v FROM src GROUP BY k) AS b
>   ON target.k = b.k
>   WHEN MATCHED THEN UPDATE SET target.v = b.v
>   WHEN NOT MATCHED THEN INSERT (k, v) VALUES (b.k, b.v);
> ```

### Deterministic results for INSERT

Deterministic merges always complete without error.

If the MERGE statement contains a `WHEN NOT MATCHED ... THEN INSERT` clause, and if there are no matching rows in the target, and if the
source contains duplicate values, then the target gets one copy of the row for *each* copy in the source. For an example,
see Perform a merge with source duplicates.

## Examples

The following examples use the MERGE command:

* Perform a basic merge that updates values
* Perform a basic merge with multiple operations
* Perform a merge by using ALL BY NAME
* Perform a merge with source duplicates
* Perform a merge with deterministic and nondeterministic results
* Perform a merge based on DATE values

### Perform a basic merge that updates values

The following example performs a basic merge that updates values in the target table by using values from the source
table. Create and load two tables:

```sqlexample
CREATE OR REPLACE TABLE merge_example_target (id INTEGER, description VARCHAR);

INSERT INTO merge_example_target (id, description) VALUES
  (10, 'To be updated (this is the old value)');

CREATE OR REPLACE TABLE merge_example_source (id INTEGER, description VARCHAR);

INSERT INTO merge_example_source (id, description) VALUES
  (10, 'To be updated (this is the new value)');
```

Display the values in the tables:

```sqlexample
SELECT * FROM merge_example_target;
```

```output
+----+---------------------------------------+
| ID | DESCRIPTION                           |
|----+---------------------------------------|
| 10 | To be updated (this is the old value) |
+----+---------------------------------------+
```

```sqlexample
SELECT * FROM merge_example_source;
```

```output
+----+---------------------------------------+
| ID | DESCRIPTION                           |
|----+---------------------------------------|
| 10 | To be updated (this is the new value) |
+----+---------------------------------------+
```

Run the MERGE statement:

```sqlexample
MERGE INTO merge_example_target
  USING merge_example_source
  ON merge_example_target.id = merge_example_source.id
  WHEN MATCHED THEN
    UPDATE SET merge_example_target.description = merge_example_source.description;
```

```output
+------------------------+
| number of rows updated |
|------------------------|
|                      1 |
+------------------------+
```

Display the new values in the target table (the source table is unchanged):

```sqlexample
SELECT * FROM merge_example_target;
```

```output
+----+---------------------------------------+
| ID | DESCRIPTION                           |
|----+---------------------------------------|
| 10 | To be updated (this is the new value) |
+----+---------------------------------------+
```

```sqlexample
SELECT * FROM merge_example_source;
```

```output
+----+---------------------------------------+
| ID | DESCRIPTION                           |
|----+---------------------------------------|
| 10 | To be updated (this is the new value) |
+----+---------------------------------------+
```

### Perform a basic merge with multiple operations

Perform a basic merge with a mix of operations (INSERT, UPDATE, and DELETE).

Create and load two tables:

```sqlexample
CREATE OR REPLACE TABLE merge_example_mult_target (
  id INTEGER,
  val INTEGER,
  status VARCHAR);

INSERT INTO merge_example_mult_target (id, val, status) VALUES
  (1, 10, 'Production'),
  (2, 20, 'Alpha'),
  (3, 30, 'Production');

CREATE OR REPLACE TABLE merge_example_mult_source (
  id INTEGER,
  marked VARCHAR,
  isnewstatus INTEGER,
  newval INTEGER,
  newstatus VARCHAR);

INSERT INTO merge_example_mult_source (id, marked, isnewstatus, newval, newstatus) VALUES
  (1, 'Y', 0, 10, 'Production'),
  (2, 'N', 1, 50, 'Beta'),
  (3, 'N', 0, 60, 'Deprecated'),
  (4, 'N', 0, 40, 'Production');
```

Display the values in the tables:

```sqlexample
SELECT * FROM merge_example_mult_target;
```

```output
+----+-----+------------+
| ID | VAL | STATUS     |
|----+-----+------------|
|  1 |  10 | Production |
|  2 |  20 | Alpha      |
|  3 |  30 | Production |
+----+-----+------------+
```

```sqlexample
SELECT * FROM merge_example_mult_source;
```

```output
+----+--------+-------------+--------+------------+
| ID | MARKED | ISNEWSTATUS | NEWVAL | NEWSTATUS  |
|----+--------+-------------+--------+------------|
|  1 | Y      |           0 |     10 | Production |
|  2 | N      |           1 |     50 | Beta       |
|  3 | N      |           0 |     60 | Deprecated |
|  4 | N      |           0 |     40 | Production |
+----+--------+-------------+--------+------------+
```

The following merge example performs the following actions on the `merge_example_mult_target` table:

* Deletes the row with `id` set to `1` because the `marked` column for the row with the same `id` is
  `Y` in `merge_example_mult_source`.
* Updates the `val` and `status` values in the row with `id` set to `2` with values in the row with the same
  `id` in `merge_example_mult_source`, because `isnewstatus` is set to `1` for the same row in
  `merge_example_mult_source`.
* Updates the `val` value in the row with `id` set to `3` with the value in the row with the same
  `id` in `merge_example_mult_source`. The MERGE statement doesn’t update the `status` value in `merge_example_mult_target`
  because `isnewstatus` is set to `0` for this row in `merge_example_mult_source`.
* Inserts the row with `id` set to `4` because the row exists in `merge_example_mult_source` and there is no
  matching row in `merge_example_mult_target`.

```sqlexample
MERGE INTO merge_example_mult_target
  USING merge_example_mult_source
  ON merge_example_mult_target.id = merge_example_mult_source.id
  WHEN MATCHED AND merge_example_mult_source.marked = 'Y'
    THEN DELETE
  WHEN MATCHED AND merge_example_mult_source.isnewstatus = 1
    THEN UPDATE SET val = merge_example_mult_source.newval, status = merge_example_mult_source.newstatus
  WHEN MATCHED
    THEN UPDATE SET val = merge_example_mult_source.newval
  WHEN NOT MATCHED
    THEN INSERT (id, val, status) VALUES (
      merge_example_mult_source.id,
      merge_example_mult_source.newval,
      merge_example_mult_source.newstatus);
```

```output
+-------------------------+------------------------+------------------------+
| number of rows inserted | number of rows updated | number of rows deleted |
|-------------------------+------------------------+------------------------|
|                       1 |                      2 |                      1 |
+-------------------------+------------------------+------------------------+
```

To see the results of the merge, display the values in the `merge_example_mult_target` table:

```sqlexample
SELECT * FROM merge_example_mult_target ORDER BY id;
```

```output
+----+-----+------------+
| ID | VAL | STATUS     |
|----+-----+------------|
|  2 |  50 | Beta       |
|  3 |  60 | Production |
|  4 |  40 | Production |
+----+-----+------------+
```

### Perform a merge by using ALL BY NAME

The following example performs a merge that inserts and updates values in the target table by using values from the
source table. The example uses the `WHEN MATCHED ... THEN ALL BY NAME` and
`WHEN NOT MATCHED ... THEN ALL BY NAME` subclauses to specify that the merge applies to all columns.

Create two tables with the same number of columns and the same names for the columns,
but with a different order for two of the columns:

```sqlexample
CREATE OR REPLACE TABLE merge_example_target_all (
  id INTEGER,
  x INTEGER,
  y VARCHAR);

CREATE OR REPLACE TABLE merge_example_source_all (
  id INTEGER,
  y VARCHAR,
  x INTEGER);
```

Load the tables:

```sqlexample
INSERT INTO merge_example_target_all (id, x, y) VALUES
  (1, 10, 'Skiing'),
  (2, 20, 'Snowboarding');

INSERT INTO merge_example_source_all (id, y, x) VALUES
  (1, 'Skiing', 10),
  (2, 'Snowboarding', 25),
  (3, 'Skating', 30);
```

Display the values in the tables:

```sqlexample
SELECT * FROM merge_example_target_all;
```

```output
+----+----+--------------+
| ID |  X | Y            |
|----+----+--------------|
|  1 | 10 | Skiing       |
|  2 | 20 | Snowboarding |
+----+----+--------------+
```

```sqlexample
SELECT * FROM merge_example_source_all;
```

```output
+----+--------------+----+
| ID | Y            |  X |
|----+--------------+----|
|  1 | Skiing       | 10 |
|  2 | Snowboarding | 25 |
|  3 | Skating      | 30 |
+----+--------------+----+
```

Run the MERGE statement:

```sqlexample
MERGE INTO merge_example_target_all
  USING merge_example_source_all
  ON merge_example_target_all.id = merge_example_source_all.id
  WHEN MATCHED THEN
    UPDATE ALL BY NAME
  WHEN NOT MATCHED THEN
    INSERT ALL BY NAME;
```

```output
+-------------------------+------------------------+
| number of rows inserted | number of rows updated |
|-------------------------+------------------------|
|                       1 |                      2 |
+-------------------------+------------------------+
```

Display the new values in the target table:

```sqlexample
SELECT *
  FROM merge_example_target_all
  ORDER BY id;
```

```output
+----+----+--------------+
| ID |  X | Y            |
|----+----+--------------|
|  1 | 10 | Skiing       |
|  2 | 25 | Snowboarding |
|  3 | 30 | Skating      |
+----+----+--------------+
```

### Perform a merge with source duplicates

Perform a merge in which the source has duplicate values and the target has no matching values. All copies of the source
record are inserted into the target. For more information, see Deterministic results for INSERT.

Truncate both tables and load new rows into the source table that include duplicates:

```sqlexample
TRUNCATE table merge_example_target;

TRUNCATE table merge_example_source;

INSERT INTO merge_example_source (id, description) VALUES
  (50, 'This is a duplicate in the source and has no match in target'),
  (50, 'This is a duplicate in the source and has no match in target');
```

The `merge_example_target` has no values. Display the values in the
`merge_example_source` table:

```sqlexample
SELECT * FROM merge_example_source;
```

```output
+----+--------------------------------------------------------------+
| ID | DESCRIPTION                                                  |
|----+--------------------------------------------------------------|
| 50 | This is a duplicate in the source and has no match in target |
| 50 | This is a duplicate in the source and has no match in target |
+----+--------------------------------------------------------------+
```

Run the MERGE statement:

```sqlexample
MERGE INTO merge_example_target
  USING merge_example_source
  ON merge_example_target.id = merge_example_source.id
  WHEN MATCHED THEN
    UPDATE SET merge_example_target.description = merge_example_source.description
  WHEN NOT MATCHED THEN
    INSERT (id, description) VALUES
      (merge_example_source.id, merge_example_source.description);
```

```output
+-------------------------+------------------------+
| number of rows inserted | number of rows updated |
|-------------------------+------------------------|
|                       2 |                      0 |
+-------------------------+------------------------+
```

Display the new values in the target table:

```sqlexample
SELECT * FROM merge_example_target;
```

```output
+----+--------------------------------------------------------------+
| ID | DESCRIPTION                                                  |
|----+--------------------------------------------------------------|
| 50 | This is a duplicate in the source and has no match in target |
| 50 | This is a duplicate in the source and has no match in target |
+----+--------------------------------------------------------------+
```

### Perform a merge with deterministic and nondeterministic results

Merge records by using joins that produce nondeterministic and deterministic results.

Create and load two tables:

```sqlexample
CREATE OR REPLACE TABLE merge_example_target_orig (k NUMBER, v NUMBER);

INSERT INTO merge_example_target_orig VALUES (0, 10);

CREATE OR REPLACE TABLE merge_example_src (k NUMBER, v NUMBER);

INSERT INTO merge_example_src VALUES (0, 11), (0, 12), (0, 13);
```

When you perform the merge in the following example, multiple updates conflict with each other. If
the [ERROR_ON_NONDETERMINISTIC_MERGE](../parameters.md) session parameter is set to `true`, the MERGE statement
returns an error. Otherwise, the MERGE statement updates `merge_example_target_clone.v` with a value
(for example, `11`, `12`, or `13`) from one of the duplicate rows (row not defined):

```sqlexample
CREATE OR REPLACE TABLE merge_example_target_clone
  CLONE merge_example_target_orig;

MERGE INTO  merge_example_target_clone
  USING merge_example_src
  ON merge_example_target_clone.k = merge_example_src.k
  WHEN MATCHED THEN UPDATE SET merge_example_target_clone.v = merge_example_src.v;
```

Updates and deletes conflict with each other. If the [ERROR_ON_NONDETERMINISTIC_MERGE](../parameters.md) session
parameter is set to `true`, the MERGE statement returns an error. Otherwise, the MERGE statement either deletes the row
or updates `merge_example_target_clone.v` with a value (for example, `12` or `13`) from one of the
duplicate rows (row not defined):

```sqlexample
CREATE OR REPLACE TABLE merge_example_target_clone
  CLONE merge_example_target_orig;

MERGE INTO merge_example_target_clone
  USING merge_example_src
  ON merge_example_target_clone.k = merge_example_src.k
  WHEN MATCHED AND merge_example_src.v = 11 THEN DELETE
  WHEN MATCHED THEN UPDATE SET merge_example_target_clone.v = merge_example_src.v;
```

Multiple deletes don’t conflict with each other. Joined values that don’t match any clause don’t prevent
the delete (`merge_example_src.v = 13`). The MERGE statement succeeds and the target row is deleted:

```sqlexample
CREATE OR REPLACE TABLE target CLONE merge_example_target_orig;

MERGE INTO merge_example_target_clone
  USING merge_example_src
  ON merge_example_target_clone.k = merge_example_src.k
  WHEN MATCHED AND merge_example_src.v <= 12 THEN DELETE;
```

Joined values that don’t match any clause don’t prevent an update (`merge_example_src.v = 12, 13`).
The MERGE statement succeeds and the target row is set to `target.v = 11`:

```sqlexample
CREATE OR REPLACE TABLE merge_example_target_clone CLONE target_orig;

MERGE INTO merge_example_target_clone
  USING merge_example_src
  ON merge_example_target_clone.k = merge_example_src.k
  WHEN MATCHED AND merge_example_src.v = 11
    THEN UPDATE SET merge_example_target_clone.v = merge_example_src.v;
```

Use GROUP BY in the source clause to ensure that each target row joins against one row
in the source:

```sqlexample
CREATE OR REPLACE TABLE merge_example_target_clone CLONE merge_example_target_orig;

MERGE INTO merge_example_target_clone
  USING (SELECT k, MAX(v) AS v FROM merge_example_src GROUP BY k) AS b
  ON merge_example_target_clone.k = b.k
  WHEN MATCHED THEN UPDATE SET merge_example_target_clone.v = b.v
  WHEN NOT MATCHED THEN INSERT (k, v) VALUES (b.k, b.v);
```

### Perform a merge based on DATE values

In the following example, the `members` table stores the names, addresses, and current fees (`members.fee`) paid to a
local gym. The `signup` table stores each member’s signup date (`signup.date`). The MERGE statement applies a standard
$40 fee to members who joined the gym more than 30 days ago, after the free trial expired:

```sqlexample
MERGE INTO members m
  USING (SELECT id, date
    FROM signup
    WHERE DATEDIFF(day, CURRENT_DATE(), signup.date::DATE) < -30) s
  ON m.id = s.id
  WHEN MATCHED THEN UPDATE SET m.fee = 40;
```

---
title: PUT
source: https://docs.snowflake.com/en/sql-reference/sql/put.md
section: SQL Commands
---

# PUT

Uploads one or more data files from a local file system onto an [internal stage](../../user-guide/data-load-local-file-system-create-stage.md).

After you upload files onto an internal stage, you can load data from the files into a table using the [COPY INTO <table>](copy-into-table.md) command.

> **Note:**
>
> * PUT does not support uploading files onto an external stage. To upload files to an external stage, use the utilities provided
>   by your cloud service.
> * [snowflake.snowpark.FileOperation.put](/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.FileOperation.put) from Snowpark stored procedures does support external stages and bills at normal data transfer rates.
> * The [ODBC driver](../../developer-guide/odbc/odbc.md) supports PUT with Snowflake accounts hosted on the following platforms:
>
>   + Amazon Web Services
>   + Google Cloud
>   + Microsoft Azure

See also:
:   [GET](get.md) , [LIST](list.md) , [REMOVE](remove.md) , [COPY FILES](copy-files.md) , [CREATE STAGE](create-stage.md) , [Overview of data loading](../../user-guide/data-load-overview.md)

## Syntax

```sqlsyntax
PUT file://<absolute_path_to_file>/<filename> internalStage
    [ PARALLEL = <integer> ]
    [ AUTO_COMPRESS = TRUE | FALSE ]
    [ SOURCE_COMPRESSION = AUTO_DETECT | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE ]
    [ OVERWRITE = TRUE | FALSE ]
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>]
>   | @[<namespace>.]%<table_name>[/<path>]
>   | @~[/<path>]
> ```

## Required parameters

`file://absolute_path_to_file/filename`
:   Specifies the URI for the data files on the client machine, where:

    * `absolute_path_to_file` is the local directory path to the files to upload.
    * `filename` is the name of the file to upload. You can use wildcard characters (`*`, `?`) to upload multiple files. If the
      directory path or filename includes special characters or spaces, enclose the entire file URI in single quotes.

      > **Attention:**
      >
      > Be careful when selecting multiple files using a PUT query. PUT queries that match a large number of files can have significant cost and performance consequences.

    The URI formatting differs depending on your client operating system.

    > Linux/macOS:
    > :   Specify the absolute path to the file from the root directory (`/`).
    >     For example, for a file named `my-data.csv` use `file:///my/file/path/my-data.csv`.
    >
    > Windows:
    > :   Specify the absolute path from the root of the drive where the file or files are located.
    >     For example, for a file named `my-data.csv` use `file://C:temp\my-data.csv`.
    >
    >     If the file path includes special characters, you must enclose the entire path in single quotes and change
    >     the drive and path separator from a backward slash to a forward slash (`/`).
    >     For example, for a file named `my$data.csv`, use: `'file://C:/temp/my$data.csv'`.

    > **Note:**
    >
    > Snowflake doesn’t support tar (tape archive) files.

`internalStage`
:   Specifies the internal stage location to upload the files onto:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]int_stage_name[/path]` | Files are uploaded onto the specified named internal stage. |
    > | `@[namespace.]%table_name[/path]` | Files are uploaded onto the stage for the specified table. |
    > | `@~[/path]` | Files are uploaded onto the stage for the current user. |

    Where:

    * `namespace` is the database or schema that contains the named internal stage or table. It is optional if a
      database and schema are in use within the session.
    * `path` is an optional case-sensitive path for files in the cloud storage location that limits access to a set of files. Paths
      are alternatively called *prefixes* or *folders* by different cloud storage services.

    > **Note:**
    >
    > If the stage name or path includes spaces or special characters, enclose it in single quotes. For example, use `'@"my stage"'`
    > for a stage named `"my stage"`.

## Optional parameters

`PARALLEL = integer`
:   Specifies the number of threads to use for uploading files. Snowflake uploads separate batches of data files by size:

    * Files that are smaller than 64 MB (compressed or uncompressed) are staged in parallel as individual files.
    * Larger files are automatically split into chunks, staged concurrently, and reassembled in the target stage. A single thread can
      upload multiple chunks.

    Increasing the number of threads can improve performance when uploading large files.

    Supported values: Any integer value from `1` (no parallelism) to `99` (use 99 threads for uploading files).

    Default: `4`

    > **Note:**
    >
    > A 16 MB limit applies to older versions of Snowflake drivers, including:
    >
    > * JDBC Driver versions prior to 3.12.1.
    > * ODBC Driver versions prior to 2.20.5.
    > * Python Connector versions prior to 2.2.0.

`AUTO_COMPRESS = TRUE | FALSE`
:   Specifies whether Snowflake uses gzip to compress files during upload:

    * `TRUE`: Snowflake compresses the files (if they are not already compressed).
    * `FALSE`: Snowflake doesn’t compress the files.

    This option does not support other compression types. To use a different compression type, compress the file separately before
    executing the PUT command. Then, identify the compression type using the `SOURCE_COMPRESSION` option.

    Ensure your local folder has sufficient space for Snowflake to compress the data files before staging them. If necessary, set the
    `TEMP`, `TMPDIR` or `TMP` environment variable in your operating system to point to a local folder that contains additional
    free space.

    Default: `TRUE`

`SOURCE_COMPRESSION = AUTO_DETECT | GZIP | BZ2 | BROTLI | ZSTD | DEFLATE | RAW_DEFLATE | NONE`
:   Specifies the method of compression used on already-compressed files that are being staged:

    > | Supported Values | Notes |
    > | --- | --- |
    > | `AUTO_DETECT` | Compression algorithm detected automatically, except for Brotli-compressed files, which cannot currently be detected automatically. If you’re uploading Brotli-compressed files, explicitly use `BROTLI` instead of `AUTO_DETECT`. |
    > | `GZIP` | Doesn’t support the `*.tar.gz` file format. |
    > | `BZ2` | Doesn’t support the `*.tar.bz2` file format. |
    > | `BROTLI` | Must be used if uploading Brotli-compressed files. |
    > | `ZSTD` | Zstandard v0.8 (and higher) supported. |
    > | `DEFLATE` | Deflate-compressed files (with zlib header, RFC1950). |
    > | `RAW_DEFLATE` | Raw Deflate-compressed files (without header, RFC1951). |
    > | `NONE` | Data files have not been compressed. |

    Default: `AUTO_DETECT`

    > **Note:**
    >
    > Snowflake uses this option to detect how the data files were compressed so that they can be uncompressed and the data extracted
    > for uploading; it does not use this option to compress the files.
    >
    > Loading files that were compressed with other utilities is not currently supported.

`OVERWRITE = TRUE | FALSE`
:   Specifies whether Snowflake overwrites an existing file with the same name during upload:

    * `TRUE`: An existing file with the same name is overwritten.
    * `FALSE`: An existing file with the same name is not overwritten.

      Snowflake performs a LIST operation on the stage in the background, which can affect the performance of the PUT operation.

      If attempts to PUT a file fail because a file with the same name exists in the target stage, you can take the following actions:

      + Load the data from the existing file into one or more tables, and remove the file from the stage. Then PUT a file with new or
        updated data onto the stage.
      + Rename the local file, and then attempt the PUT operation again.
      + Set `OVERWRITE = TRUE` in the PUT statement. Do this only if it’s safe to overwrite the existing (staged) file with the same name.

    If your Snowflake account is hosted on Google Cloud, PUT statements don’t recognize when the OVERWRITE parameter is
    set to TRUE. A PUT operation always overwrites any existing files in the target stage with the local files you’re uploading.

    The following clients support the OVERWRITE option for Snowflake accounts hosted on Amazon Web Services or Microsoft Azure:

    > * SnowSQL
    > * Snowflake ODBC Driver
    > * Snowflake JDBC Driver
    > * Snowflake Connector for Python

    Supported values: TRUE, FALSE.

    Default: `FALSE`.

## Usage notes

* The command cannot be executed from the Worksheets  page in either Snowflake web interface; instead, use the
  [SnowSQL client](../../user-guide/snowsql.md) or [Drivers](../../developer-guide/drivers.md) to upload data files,
  or check the documentation for a specific Snowflake client to verify support for this command.

  Alternatively, you can [use the Snowsight UI to upload files onto a name internal stage](../../user-guide/data-load-local-file-system-stage-ui.md).
* File-globbing patterns, like wildcards, are supported unless the files that match the pattern have divergent directory paths.
  The command *does not* support uploading multiple files with divergent directory paths, because Snowflake
  doesn’t preserve file system directory structure when uploading files onto your stage.

  For example, the following PUT statement returns an error since you
  can’t specify multiple files in nested subdirectories.

  ```sqlexample
  PUT file:///tmp/data/** @my_int_stage AUTO_COMPRESS=FALSE;
  ```
* The command does not create or rename files.
* All files stored on internal stages for data loading and unloading operations are automatically encrypted using AES-256 strong encryption
  on the server side. By default, Snowflake provides additional client-side encryption with a 128-bit key
  (with the option to configure a 256-bit key). For more information, see [encryption types for internal stages](create-stage.md).
* The command ignores any duplicate files you attempt to upload to the same stage. A duplicate file is an unmodified file with the same
  name as an already-staged file.

  To overwrite an already-staged file, you must modify the file you are uploading so that its contents are different from the staged file,
  which results in a new checksum for the newly-staged file.
* For the PUT and [GET](get.md) commands,
  an EXECUTION_STATUS of `success` in the [QUERY_HISTORY](../account-usage/query_history.md)
  does *not* mean that data files were successfully uploaded or downloaded.
  Instead, the status indicates that Snowflake received authorization to proceed with the file transfer.

> **Tip:**
>
> For security reasons, the command times out after a set period of time. This can occur when uploading large, uncompressed data files. To
> avoid timeout issues, we recommend compressing large data files using one of the supported compression types before uploading the files.
> Then, specify the compression type for the files using the `SOURCE_COMPRESSION` option.
>
> You can also consider increasing the value of the `PARALLEL` option, which can help with performance when uploading large data files.
>
> Furthermore, to take advantage of parallel operations when loading data into tables (using the
> [COPY INTO <table>](copy-into-table.md) command), we recommend using data files ranging in size from roughly 100 to 250 MB
> compressed. If your data files are larger, consider using a third-party tool to split them into smaller files before compressing
> and uploading them.

## Examples

### Linux and macOS

**Load a file onto an internal stage**

Load a file named `mydata.csv` in the `/tmp/data` directory to an internal stage named
`my_int_stage`:

```sqlexample
PUT file:///tmp/data/mydata.csv @my_int_stage;
```

**Load a file onto a table stage**

Load a file named `orders_001.csv` in the `/tmp/data` directory to the stage for the
`orderstiny_ext` table, with automatic data compression disabled:

```sqlexample
PUT file:///tmp/data/orders_001.csv @%orderstiny_ext
  AUTO_COMPRESS = FALSE;
```

**Load multiple files onto an internal stage**

Use wildcard characters in the filename to upload multiple files:

```sqlexample
PUT file:///tmp/data/orders_*01.csv @my_int_stage
  AUTO_COMPRESS = FALSE;
```

**Specify a file path with special characters**

Enclose a file path with special characters or spaces in single quotes:

```sqlexample
PUT 'file:///tmp/data/orders 001.csv' @my_int_stage
  AUTO_COMPRESS = FALSE;
```

### Windows

**Load a file onto the current user’s stage**

Load a file named `mydata.csv` in the `C:\temp\data` directory onto the stage for the current
user, with automatic data compression enabled:

```sqlexample
PUT file://C:\temp\data\mydata.csv @~
  AUTO_COMPRESS = TRUE;
```

**Specify a file path with special characters**

To specify a Windows file path with special characters, you must
enclose the path in single quotes and change backslashes to forward slashes.

In this example, the file name contains a space (`my data.csv`):

```sqlexample
PUT 'file://C:/temp/data/my data.csv' @my_int_stage
  AUTO_COMPRESS = TRUE;
```

---
title: REMOVE
source: https://docs.snowflake.com/en/sql-reference/sql/remove.md
section: SQL Commands
---

# REMOVE

Removes files from either an external (external cloud storage) or internal (i.e. Snowflake) stage.

For internal stages, the following stage types are supported:

* Named internal stage
* Stage for a specified table
* Stage for the current user

REMOVE can be abbreviated to RM.

See also:
:   [LIST](list.md)

## Syntax

```sqlsyntax
REMOVE { internalStage | externalStage } [ PATTERN = '<regex_pattern>' ]
```

Where:

> ```sqlsyntax
> internalStage ::=
>     @[<namespace>.]<int_stage_name>[/<path>]
>   | @[<namespace>.]%<table_name>[/<path>]
>   | @~[/<path>]
> ```
>
> ```sqlsyntax
> externalStage ::=
>     @[<namespace>.]<ext_stage_name>[/<path>]
> ```

## Required parameters

`internalStage | externalStage`
:   Specifies the location where the data files are staged:

    > |  |  |
    > | --- | --- |
    > | `@[namespace.]int_stage_name[/path]` | Files are in the specified named internal stage. |
    > | `@[namespace.]ext_stage_name[/path]` | Files are in the specified named external stage. |
    > | `@[namespace.]%table_name[/path]` | Files are in the stage for the specified table. |
    > | `@~[/path]` | Files are in the stage for the current user. |

    Where:

    * `namespace` is the database and/or schema in which the named internal stage or table resides. It is optional if a
      database and schema are currently in use within the session; otherwise, it is required.
    * `path` is an optional case-sensitive path for files in the cloud storage location (i.e. files have names that begin with a
      common string) that limits access to a set of files. Paths are alternatively called *prefixes* or *folders* by different cloud
      storage services.

    > **Note:**
    >
    > If the stage name or path includes spaces or special characters, it must be enclosed in single quotes (e.g. `'@"my stage"'`
    > for a stage named `"my stage"`).

## Optional parameters

`PATTERN = 'regex_pattern'`
:   Specifies a regular expression pattern for filtering files to remove. The command lists all files in the specified `path`
    and applies the regular expression pattern on each of the files found.

## Usage notes

* If you are loading data from a file on a stage, do not remove the staged files until the data has been loaded successfully. To check if the data has been loaded successfully, use the [COPY_HISTORY](../functions/copy_history.md) command. Check the `STATUS` column to determine if the data from the file has been loaded. Note that if the status is `Load in progress`, removing the staged file can result in partial loads and data loss.
* To run this command with an external stage that uses a storage integration,
  you must use a role that has or inherits the USAGE privilege on the storage integration.

  For more information, see [Stage privileges](../../user-guide/security-access-control-privileges.md).
* Removing files from an external stage requires granting the following role or permission to Snowflake in your cloud storage account:

  | Cloud Storage Service | Role or Permission | Instructions |
  | --- | --- | --- |
  | Amazon S3 | `s3:DeleteObject` | [Configuring secure access to Amazon S3](../../user-guide/data-load-s3-config.md) |
  | Google Cloud Storage | `storage.objects.delete` | [Configure an integration for Google Cloud Storage](../../user-guide/data-load-gcs-config.md) |
  | Microsoft Azure (Blob storage) | `Storage Blob Data Contributor` | [Configure an Azure container for loading data](../../user-guide/data-load-azure-config.md) |
* The command removes all directories and files that match a specified path. For example, the following statement would match any of
  the following objects in the `mytable` table stage:

  > + `myobject.csv.gz` (file)
  > + `myobject` (directory)
  > + `myobject_new` (directory)
  >
  > ```sqlsyntax
  > rm @%mytable/myobject;
  > ```
* To remove all files for a specific directory, include a forward-slash (`/`) at the end of the path. For example:

  > ```sqlsyntax
  > rm @%mytable/myobject/;
  > ```
* If a REMOVE statement is interrupted before it has completed running, any files already removed by the statement are not restored.

## Examples

Remove all files from the `path1/subpath2` path in a named internal or external stage named `mystage`:

> ```sqlexample
> REMOVE @mystage/path1/subpath2;
> ```

Remove all files from the stage for the `orders` table:

> ```sqlexample
> REMOVE @%orders;
> ```

Use the abbreviated form of the command to remove files whose names match the pattern `*jun*` from the stage for the current user:

> ```sqlexample
> RM @~ pattern='.*jun.*';
> ```

---
title: REVOKE <privilege> … FROM SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-privilege-share.md
section: SQL Commands
---

# REVOKE *<privilege>* … FROM SHARE

Revokes access privileges for databases and other supported database objects (schemas, tables, and views) from a share. Revoking
privileges on these objects effectively removes the objects from the share, disabling access to the objects granted via the database
role in all consumer accounts that have created a database from the share.

For more details, see [About Secure Data Sharing](../../user-guide/data-sharing-intro.md) and [Create and configure shares](../../user-guide/data-sharing-provider.md).

See also:
:   [GRANT <privilege> … TO SHARE](grant-privilege-share.md)

    [REVOKE <privileges> … FROM ROLE](revoke-privilege.md)

## Syntax

```sqlsyntax
REVOKE objectPrivilege ON
     {  DATABASE <name>
      | SCHEMA <name>
      | SEMANTIC VIEW <name>
      | { TABLE <name> | ALL TABLES IN SCHEMA <schema_name> }
      | { EXTERNAL TABLE <name> | ALL EXTERNAL TABLES IN SCHEMA <schema_name> }
      | { ICEBERG TABLE <name> | ALL ICEBERG TABLES IN SCHEMA <schema_name> }
      | { DYNAMIC TABLE <name> | ALL DYNAMIC TABLES IN SCHEMA <schema_name> }
      | { VIEW <name> | ALL VIEWS IN SCHEMA <schema_name> }  }
  FROM SHARE <share_name>
```

Where:

```sqlsyntax
objectPrivilege ::=
-- For DATABASE
   REFERENCE_USAGE [ , ... ]
-- For DATABASE, FUNCTION, or SCHEMA
   USAGE [ , ... ]
-- For SEMANTIC VIEW
   { REFERENCES | SELECT } [ , ... ]
-- For TABLE
   EVOLVE SCHEMA [ , ... ]
-- For EXTERNAL TABLE, ICEBERG TABLE, TABLE, or VIEW
   SELECT [ , ... ]
-- For TAG
   READ
```

## Parameters

`name`
:   Specifies the identifier for the object (database, schema, table, or secure view) for which the specified privilege is revoked.

`schema_name`
:   Specifies the identifier for the schema for which the specified privilege is revoked for all tables or views.

`share_name`
:   Specifies the identifier for the share for which the specified privilege is revoked.

## Usage notes

* Each object privilege must be revoked individually from a role, except for tables, Apache Iceberg™ tables, and views.
  Using an `ALL` clause, you can revoke the SELECT privilege from all the tables or views in the specified schema from a role.
* If you specify a `TABLE` object that is an *Iceberg* table, the command revokes the privilege from that Iceberg table.

## Examples

> ```sqlexample
> REVOKE SELECT ON VIEW mydb.shared_schema.view1 FROM SHARE share1;
>
> REVOKE SELECT ON VIEW mydb.shared_schema.view3 FROM SHARE share1;
>
> REVOKE USAGE ON SCHEMA mydb.shared_schema FROM SHARE share1;
>
> REVOKE SELECT ON ALL TABLES IN SCHEMA mydb.public FROM SHARE share1;
>
> REVOKE SELECT ON ALL ICEBERG TABLES IN SCHEMA mydb.public FROM SHARE share1;
>
> REVOKE SELECT ON ALL DYNAMIC TABLES IN SCHEMA mydb.public FROM SHARE share1;
>
> REVOKE SELECT ON ICEBERG TABLE mydb.shared_schema.iceberg_table_1 FROM SHARE share1;
>
> REVOKE SELECT ON DYNAMIC TABLE mydb TO SHARE share1;
>
> REVOKE USAGE ON SCHEMA mydb.public FROM SHARE share1;
>
> REVOKE USAGE ON DATABASE mydb FROM SHARE share1;
> ```

This example disallows a shared secure view to reference objects from a different database:

> ```sqlexample
> REVOKE REFERENCE_USAGE ON DATABASE database2 FROM SHARE share1;
> ```

---
title: REVOKE <privileges> … FROM APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-privilege-application.md
section: SQL Commands
---

# REVOKE *<privileges>* … FROM APPLICATION

Revokes one or more access privileges on a securable object from an application. The privileges that can be revoked are
object-specific.

For more details about roles and securable objects, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

Variations:
:   [GRANT <privileges> … TO APPLICATION](grant-privilege-application.md)

## Syntax

```sqlsyntax
REVOKE {  { globalPrivileges } ON ACCOUNT
        | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { USER | RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | CONNECTION | FAILOVER GROUP | REPLICATION GROUP | EXTERNAL VOLUME } <object_name>
        | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
        | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
       }
     FROM APPLICATION <name>
```

Where:

```sqlsyntax
globalPrivileges ::=
  {
      CREATE {
       COMPUTE POOL | DATABASE | WAREHOUSE
      }
      | BIND SERVICE ENDPOINT
      | EXECUTE MANAGED TASK
      | MANAGE WAREHOUSES
      | READ SESSION
  }
  [ , ... ]
```

```sqlsyntax
accountObjectPrivileges ::=
-- For COMPUTE POOL
   { MODIFY | MONITOR | OPERATE | USAGE } [ , ... ]
-- For CONNECTION
   { FAILOVER } [ , ... ]
-- For DATABASE
   { APPLYBUDGET | CREATE { DATABASE ROLE | SCHEMA }
   | IMPORTED PRIVILEGES | MODIFY | MONITOR | USAGE } [ , ... ]
-- For EXTERNAL VOLUME
   { USAGE } [ , ... ]
-- For FAILOVER GROUP
   { FAILOVER | MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For INTEGRATION
   { USAGE | USE_ANY_ROLE } [ , ... ]
-- For REPLICATION GROUP
   { MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For RESOURCE MONITOR
   { MODIFY | MONITOR } [ , ... ]
-- For USER
   { MONITOR } [ , ... ]
-- For WAREHOUSE
   { APPLYBUDGET | MODIFY | MONITOR | USAGE | OPERATE } [ , ... ]
```

```sqlsyntax
schemaPrivileges ::=
ADD SEARCH OPTIMIZATION
| CREATE {
    ALERT | EXTERNAL TABLE | FILE FORMAT | FUNCTION
    | IMAGE REPOSITORY | MATERIALIZED VIEW | PIPE | PROCEDURE
    | { AGGREGATION | MASKING | PASSWORD | PROJECTION | ROW ACCESS | SESSION } POLICY
    | SECRET | SEMANTIC VIEW | SEQUENCE | SERVICE | SNAPSHOT | STAGE | STREAM
    | TAG | TABLE | TASK | VIEW
  }
| MODIFY | MONITOR | USAGE
[ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For DYNAMIC TABLE
     OPERATE, SELECT [ , ...]
  -- For EVENT TABLE
     { INSERT | SELECT } [ , ... ]
  -- For FILE FORMAT, FUNCTION (UDF or external function), PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { MASKING | PACKAGES | PASSWORD | ROW ACCESS | SESSION } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     READ, USAGE [ , ... ]
  -- For SEMANTIC VIEW
     REFERENCES [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
```

For more information about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are granted.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `ALERT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `NETWORK RULE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PROCEDURE`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `STAGE`
    * `STREAM`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`

`object_type_plural`
:   Plural form of `object_type` (e.g. `TABLES`, `VIEWS`).

    Bulk grants on pipes are not allowed.

`name`
:   Specifies the identifier for the recipient application (the role to which the privileges are granted).

## Security requirements

Revoking privileges on individual objects:
:   You can use an [active role](../../user-guide/security-access-control-overview.md) that meets either of the following criteria, or a
    [higher role](../../user-guide/security-access-control-overview.md), to revoke privileges on an object from other application
    roles:

    * The role is identified as the *grantor* of the privilege in the GRANTED_BY column in the [SHOW GRANTS](show-grants.md) output.

      If you have multiple instances of a privilege grant on the specified object, only the instances granted by the active grantor role
      are revoked.
    * The role has the global MANAGE GRANTS privilege.

      If you have multiple instances of a privilege grant on the specified object, all instances are revoked.

      Note that only the SECURITYADMIN system role and higher have the MANAGE GRANTS privilege by default; however, the privilege can be
      granted to custom roles.

    In managed access schemas (schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax), only the schema owner (the
    role with the OWNERSHIP privilege on the schema) or a role with the global MANAGE GRANTS privilege, or a higher role, can revoke
    privileges on objects in the schema.

## Usage notes

* Privileges cannot be granted or revoked directly on any class. You can, however, create an instance of a class and
  revoke [instance roles](../snowflake-db-classes.md) from an
  account role. Revoke the CREATE <class_name> privilege on the schema to prevent a role from creating an instance of a
  class.
* A privilege can be granted to a role multiple times by different grantors. A REVOKE *<privilege>* statement only revokes grants for which
  the active role, or a lower role in a hierarchy, is the grantor. Any additional grants of a specified privilege by other grantors are
  ignored.

  Also note that a REVOKE *<privilege>* statement is successful even if no privileges are revoked. A REVOKE *<privilege>* statement only
  returns an error if a specified privilege has dependent grants and the CASCADE clause is omitted in the statement.
* Multiple privileges can be specified for the same object type in a single GRANT statement (with each privilege separated by commas),
  or the special `ALL [ PRIVILEGES ]` keyword can be used to grant all applicable privileges to the specified object type. Note,
  however, that only privileges held and grantable by the role executing the GRANT command are actually granted to the target role.
  A warning message is returned for any privileges that could not be granted.

  You cannot specify this keyword for tags.
* For databases, the IMPORTED PRIVILEGES privilege only applies to shared databases (i.e. databases created from a share). For more
  details, see [Consume imported data](../../user-guide/data-share-consumers.md).
* For schemas and objects in schemas, an option is provided to grant privileges on all objects of the same type within the container
  (database or schema). This is a convenience option; internally, the command is expanded into a series of individual GRANT commands
  on each object. Only objects that currently exist within the container are affected.

  However, note that, in the Snowflake model, bulk granting of privileges is not a recommended practice. Instead, Snowflake recommends
  creating a shared role and using the role to create objects that are automatically accessible to all users who have been granted the
  role.
* For stages:

  + USAGE only applies to external stages.
  + READ | WRITE only applies to internal stages. In addition, to grant the WRITE privilege on an internal stage, the READ privilege
    must first be granted on the stage.

  For more details about external and internal stages, see [CREATE STAGE](create-stage.md).
* When granting privileges on an individual UDF, you must specify the data types for the arguments, if any, for the UDF in the form of
  `udf_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument data types to resolve UDFs that
  have the same name within a schema. For more details, see
  [User-defined functions overview](../../developer-guide/udf/udf-overview.md).
* When granting privileges on an individual stored procedure, you must specify the data types for the arguments, if any, for the
  procedure in the form of `procedure_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument
  data types to resolve stored procedures that have the same name within a schema.

  For more information, see [managed access schemas](../../user-guide/security-access-control-configure.md).

## Example

Revoke the SELECT privilege on a view from an application:

```sqlexample
REVOKE SELECT ON VIEW data.views.credit_usage
  FROM APPLICATION app_snowflake_credits;
```

---
title: REVOKE <privileges> … FROM APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-privilege-application-role.md
section: SQL Commands
---

# REVOKE *<privileges>* … FROM APPLICATION ROLE

Revokes one or more access privileges on a securable schema-level object from an application role. The privileges that can be revoked are
object-specific.

For more details about roles and securable objects, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

Variations:
:   [GRANT OWNERSHIP](grant-ownership.md) , [GRANT <privileges> … TO APPLICATION ROLE](grant-privilege-application-role.md)

## Syntax

Account roles:

```sqlsyntax
REVOKE [ GRANT OPTION FOR ]
    {
    | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
    | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { FUTURE SCHEMAS IN DATABASE <db_name> }
    | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN SCHEMA <schema_name> }
    | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON FUTURE <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
    }
  FROM APPLICATION ROLE <name> [ RESTRICT | CASCADE ]
```

Where:

```sqlsyntax
schemaPrivileges ::=
  {
    ADD SEARCH OPTIMIZATION
    | CREATE {
        ALERT | EXTERNAL TABLE | FILE FORMAT | FUNCTION
        | IMAGE REPOSITORY | MATERIALIZED VIEW | PIPE | PROCEDURE
        | { AGGREGATION | MASKING | PASSWORD | PROJECTION | ROW ACCESS | SESSION } POLICY
        | SECRET | SEMANTIC VIEW | SEQUENCE | SERVICE | SNAPSHOT | STAGE | STREAM
        | TAG | TABLE | TASK | VIEW
      }
    | MODIFY | MONITOR | USAGE
  }
  [ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For DYNAMIC TABLE
     OPERATE, SELECT [ , ...]
  -- For EVENT TABLE
     { INSERT | SELECT } [ , ... ]
  -- For FILE FORMAT, FUNCTION (UDF or external function), PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { MASKING | PACKAGES | PASSWORD | ROW ACCESS | SESSION } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     READ, USAGE [ , ... ]
  -- For SEMANTIC VIEW
     REFERENCES [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
```

For more details about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are granted.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `ALERT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `NETWORK RULE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PROCEDURE`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `STAGE`
    * `STREAM`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`

`object_type_plural`
:   Plural form of `object_type` (e.g. `TABLES`, `VIEWS`).

    Note that bulk grants on pipes are not allowed.

`name`
:   Specifies the identifier for the recipient application role (i.e. the role to which the privileges are granted).

## Optional parameters

`FUTURE`
:   If specified, only removes privileges granted on new (i.e. future) schema objects of a specified type (e.g. tables or views) rather than
    existing objects. Note that any privileges granted on existing objects are retained.

`RESTRICT | CASCADE`
:   If specified, determines whether the revoke operation succeeds or fails for the privileges, based on the whether the privileges had been
    re-granted to another application role.

    `RESTRICT`
    :   If the privilege being revoked has been re-granted to another application role, the REVOKE command fails.

    `CASCADE`
    :   If the privilege being revoked has been re-granted, the REVOKE command recursively revokes these dependent grants. If the same
        privilege on an object has been granted to the target role by a different grantor (parallel grant), that grant is not affected and the
        target role retains the privilege.

    Default: `RESTRICT`

## Security requirements

Revoking privileges on individual objects:
:   You can use an [active role](../../user-guide/security-access-control-overview.md) that meets either of the following criteria, or a
    [higher role](../../user-guide/security-access-control-overview.md), to revoke privileges on an object from other application
    roles:

    * The role is identified as the *grantor* of the privilege in the GRANTED_BY column in the [SHOW GRANTS](show-grants.md) output.

      If you have multiple instances of a privilege grant on the specified object, only the instances granted by the active grantor role
      are revoked.
    * The role has the global MANAGE GRANTS privilege.

      If you have multiple instances of a privilege grant on the specified object, all instances are revoked.

      Note that only the SECURITYADMIN system role and higher have the MANAGE GRANTS privilege by default; however, the privilege can be
      granted to custom roles.

    The following roles can revoke privileges from objects in a managed access schema
    (i.e. schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax):

    * The application role because this role is the schema owner (i.e. has the OWNERSHIP privilege on the schema).
      (i.e. the role with the OWNERSHIP privilege on the schema)
    * A role with the global MANAGE GRANTS privilege.

Revoking grants on future objects of a specified type:
:   In managed access schemas, either the application role or a role with the global MANAGE GRANTS privilege can revoke privileges on
    future objects in the schema.

    In standard schemas, the global MANAGE GRANTS privilege is required to revoke privileges on future objects in the schema.

## Usage notes

* A privilege can be granted to an application role multiple times by different grantors. A REVOKE *<privilege>* statement only revokes
  grants for which the active role, or a lower role in a hierarchy, is the grantor. Any additional grants of a specified privilege by other
  grantors are ignored.

  A REVOKE *<privilege>* statement is successful even if no privileges are revoked. A REVOKE *<privilege>* statement only
  returns an error if a specified privilege has dependent grants and the CASCADE clause is omitted in the statement.
* When revoking privileges on an individual UDF, you must specify the data types for the arguments, if any, for the UDF in the form of
  `udf_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument data types to resolve UDFs that
  have the same name within a schema. For more details, refer [User-defined functions overview](../../developer-guide/udf/udf-overview.md).
* When revoking privileges on an individual stored procedure, you must specify the data types for the arguments, if any, for the
  procedure in the form of `procedure_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument
  data types to resolve stored procedures that have the same name within a schema.
* **Future grants:** Revoking future grants only drops grants of privileges for future objects of a specified type. Any
  privileges granted on existing objects are retained.

  For more information, see [managed access schemas](../../user-guide/security-access-control-configure.md).

## Example

Revoke the SELECT privilege on a view from an application role:

```sqlexample
REVOKE SELECT ON VIEW data.views.credit_usage
  FROM APPLICATION ROLE app_snowflake_credits;
```

---
title: REVOKE <privileges> … FROM ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-privilege.md
section: SQL Commands
---

# REVOKE *<privileges>* … FROM ROLE

Removes one or more privileges on a securable object from a role or database role. The privileges that can be revoked are object-specific.

Roles:
:   The privileges that can be revoked from roles are grouped into the following categories:

    * Global privileges
    * Privileges for account objects (resource monitors, virtual warehouses, and databases)
    * Privileges for schemas
    * Privileges for schema objects (tables, views, stages, file formats, UDFs, and sequences)

Database roles:
:   The privileges that can be revoked from database roles are grouped into the following categories:

    * Privileges for the database that contains the database role.
    * Privileges for schemas in the database that contains the database role.
    * Privileges for schema objects (tables, views, stages, file formats, UDFs, and sequences) in the database that contains the database role.

See also:
:   [GRANT <privileges> … TO ROLE](grant-privilege.md) , [GRANT OWNERSHIP](grant-ownership.md)

    [REVOKE <privilege> … FROM SHARE](revoke-privilege-share.md)

## Syntax

Account roles:

```sqlsyntax
REVOKE [ GRANT OPTION FOR ]
    {
       { globalPrivileges         | ALL [ PRIVILEGES ] } ON ACCOUNT
     | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | CONNECTION | FAILOVER GROUP | REPLICATION GROUP | EXTERNAL VOLUME } <object_name>
     | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
     | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { FUTURE SCHEMAS IN DATABASE <db_name> }
     | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN SCHEMA <schema_name> }
     | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON FUTURE <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
    }
  FROM [ ROLE ] <role_name> [ RESTRICT | CASCADE ]
```

Database roles:

```sqlsyntax
REVOKE [ GRANT OPTION FOR ]
    {
       { CREATE SCHEMA | MODIFY | MONITOR | USAGE } [ , ... ] } ON DATABASE <object_name>
       { globalPrivileges         | ALL [ PRIVILEGES ] } ON ACCOUNT
     | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | EXTERNAL VOLUME } <object_name>
     | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
     | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { FUTURE SCHEMAS IN DATABASE <db_name> }
     | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN SCHEMA <schema_name> }
     | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON FUTURE <object_type_plural> IN { DATABASE <db_name> | SCHEMA <schema_name> }
    }
  FROM DATABASE ROLE <database_role_name> [ RESTRICT | CASCADE ]
```

Where:

```sqlsyntax
globalPrivileges ::=
  {
      CREATE {
          ACCOUNT | APPLICATION | APPLICATION PACKAGE | COMPUTE POOL | LISTING
          | DATABASE | EXTERNAL VOLUME | FAILOVER GROUP | INTEGRATION | NETWORK POLICY
          | ORGANIZATION LISTING | ORGANIZATION PROFILE | REPLICATION GROUP | ROLE | SHARE
       | USER | WAREHOUSE
      }
      | ATTACH POLICY | AUDIT | BIND SERVICE ENDPOINT
      | APPLY {
         { AGGREGATION | AUTHENTICATION | JOIN | MASKING | PACKAGES | PASSWORD
           | PROJECTION | ROW ACCESS | SESSION | STORAGE LIFECYCLE } POLICY
         | CONTACT
         | TAG }
      | EXECUTE { ALERT | DATA METRIC FUNCTION | MANAGED ALERT | MANAGED TASK | TASK }
      | IMPORT { SHARE | ORGANIZATION LISTING }
 | MANAGE { ACCOUNT SUPPORT CASES | EVENT SHARING | GRANTS | LISTING AUTO FULFILLMENT | ORGANIZATION SUPPORT CASES | SHARE TARGET | USER SUPPORT CASES | VISIBILITY | WAREHOUSES }
      | MODIFY { LOG LEVEL | TRACE LEVEL | SESSION LOG LEVEL | SESSION TRACE LEVEL }
      | MONITOR { EXECUTION | SECURITY | USAGE }
      | OVERRIDE SHARE RESTRICTIONS | PURCHASE DATA EXCHANGE LISTING | RESOLVE ALL
      | READ SESSION
      | READ UNREDACTED ERROR TABLE
      | USE AI FUNCTIONS
  }
  [ , ... ]
```

```sqlsyntax
accountObjectPrivileges ::=
-- For APPLICATION PACKAGE
    { ATTACH LISTING | DEVELOP | INSTALL | MANAGE VERSIONS | MANAGE RELEASES } [ , ... ]
-- For COMPUTE POOL
   { MODIFY | MONITOR | OPERATE | USAGE } [ , ... ]
-- For CONNECTION
   { FAILOVER } [ , ... ]
-- For DATABASE
   { APPLYBUDGET | CREATE { DATABASE ROLE | SCHEMA }
   | IMPORTED PRIVILEGES | MODIFY | MONITOR | USAGE } [ , ... ]
-- For EXTERNAL VOLUME
   { USAGE } [ , ... ]
-- For FAILOVER GROUP
   { FAILOVER | MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For INTEGRATION
   { USAGE | USE_ANY_ROLE } [ , ... ]
-- For ORGANIZATION PROFILE
   { MODIFY } [ , ... ]
-- For REPLICATION GROUP
   { MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For RESOURCE MONITOR
   { MODIFY | MONITOR } [ , ... ]
-- For USER
   { IMPERSONATE | MODIFY PROGRAMMATIC AUTHENTICATION METHODS | MONITOR } [ , ... ]
-- For WAREHOUSE
   { APPLYBUDGET | MODIFY | MONITOR | USAGE | OPERATE } [ , ... ]
```

```sqlsyntax
schemaPrivileges ::=

    ADD SEARCH OPTIMIZATION | APPLYBUDGET
  | CREATE {
       AGENT | ALERT | CONTACT | CORTEX SEARCH SERVICE | DATA METRIC FUNCTION | DATASET
      | DBT PROJECT | EVENT TABLE | EXPERIMENT | FILE FORMAT | FUNCTION
      | GATEWAY | { GIT | IMAGE } REPOSITORY | MCP SERVER
      | MODEL | NETWORK RULE | NOTEBOOK | PIPE | PROCEDURE
      | { AGGREGATION | AUTHENTICATION | MASKING | PACKAGES
         | PASSWORD | PRIVACY | PROJECTION | ROW ACCESS | SESSION
         | STORAGE LIFECYCLE } POLICY
      | SECRET | SEQUENCE | SERVICE | SNAPSHOT | SNAPSHOT POLICY | SNAPSHOT SET
      | STAGE | STREAM | STREAMLIT
      | SNOWFLAKE.CORE.BUDGET
      | SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
      | SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER
      | SNOWFLAKE.ML.ANOMALY_DETECTION | SNOWFLAKE.ML.CLASSIFICATION
         | SNOWFLAKE.ML.FORECAST | SNOWFLAKE.ML.TOP_INSIGHTS
      | SNOWFLAKE.ML.DOCUMENT_INTELLIGENCE
      | [ { DYNAMIC | EXTERNAL | ICEBERG | INTERACTIVE | ONLINE FEATURE } ] TABLE
      | TAG | TASK | TYPE | WORKSPACE | [ { MATERIALIZED | SEMANTIC } ] VIEW
      }
   | MODIFY | MONITOR | USAGE
   [ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For AGENT
     { MODIFY | MONITOR | USAGE } [ , ... ]
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For CONTACT
     { APPLY | MODIFY } [ , ... ]
  -- For CORTEX SEARCH SERVICE
     { OPERATE | USAGE } [ , ... ]
  -- For DATA METRIC FUNCTION
     USAGE [ , ... ]
  -- For DATASET, FILE FORMAT, FUNCTION (UDF or external function), MODEL, PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For SNAPSHOT POLICY or SNAPSHOT SET (for WORM snapshots)
     USAGE [ , ... ]
  -- For DBT PROJECT
     USAGE, MONITOR [ , ... ]
  -- For DYNAMIC TABLE
     MONITOR, OPERATE, SELECT [ , ... ]
  -- For EXPERIMENT
     { CREATE | MODIFY | USAGE } [ , ... ]
  -- For EVENT TABLE
     { APPLYBUDGET | DELETE | OWNERSHIP | REFERENCES | SELECT | TRUNCATE } [ , ... ]
  -- For GATEWAY
     { CREATE | MODIFY | USAGE } [ , ... ]
  -- For GIT REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For HYBRID TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For ICEBERG TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For INTERACTIVE TABLE
     { REFERENCES | SELECT } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
 -- For MCP SERVER
     { MODIFY | USAGE } [ , ... ]
  -- For ONLINE FEATURE TABLE
     { MONITOR | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { AGGREGATION | AUTHENTICATION | MASKING | JOIN | PACKAGES | PASSWORD | PRIVACY | PROJECTION | ROW ACCESS | SESSION | STORAGE LIFECYCLE } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     { READ | USAGE } [ , ... ]
  -- For SEMANTIC VIEW
     { SELECT | REFERENCES | MONITOR } [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For STREAMLIT
     USAGE [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | SELECT ERROR TABLE | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
  -- For WORKSPACE
     { READ | WRITE } [ , ... ]
```

For more details about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are revoked.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `AGENT`
    * `AGGREGATION POLICY`
    * `ALERT`
    * `AUTHENTICATION POLICY`
    * `CORTEX SEARCH SERVICE`
    * `DATA METRIC FUNCTION`
    * `DATASET`
    * `DBT PROJECT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXPERIMENT`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `GATEWAY`
    * `GIT REPOSITORY`
    * `IMAGE REPOSITORY`
    * `ICEBERG TABLE`
    * `INTERACTIVE TABLE`
    * `JOIN POLICY`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `MCP SERVER`
    * `MODEL`
    * `MODEL MONITOR`
    * `NETWORK RULE`
    * `NOTEBOOK`
    * `ONLINE FEATURE TABLE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PRIVACY POLICY`
    * `PROCEDURE`
    * `PROJECTION POLICY`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SERVICE`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `SNAPSHOT`
    * `SNAPSHOT POLICY`
    * `SNAPSHOT SET`
    * `STAGE`
    * `STORAGE LIFECYCLE POLICY`
    * `STREAM`
    * `STREAMLIT`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`
    * `WORKSPACE`

`object_type_plural`
:   Plural form of `object_type` (for example, `TABLES`, `VIEWS`).

`role_name`
:   Specifies the identifier for the recipient role (that is, the role from which the privileges are revoked).

`database_role_name`
:   Specifies the identifier for the recipient database role (that is, the role from which the privileges are revoked). If the identifier is not
    fully qualified (in the form of `db_name.database_role_name`), the command looks for the database role in the current database
    for the session.

## Optional parameters

`GRANT OPTION FOR`
:   If specified, removes the ability for the recipient role to grant the privileges to another role.

    Default: No value

`ON FUTURE`
:   If specified, only removes privileges granted on new (that is, future) schema objects of a specified type (such as tables or views) rather than
    existing objects. Note that any privileges granted on existing objects are retained.

`RESTRICT | CASCADE`
:   If specified, determines whether the revoke operation succeeds or fails for the privileges, based on the whether the privileges had been
    re-granted to another role.

    * `RESTRICT`: If the privilege being revoked has been re-granted to another role, the REVOKE command fails.
    * `CASCADE`: If the privilege being revoked has been re-granted, the REVOKE command recursively revokes these dependent grants.
      If the same privilege on an object has been granted to the target role by a different grantor (parallel grant), that grant is not
      affected and the target role retains the privilege.

    Default: `RESTRICT`

## Security requirements

Revoking privileges on individual objects:
:   An [active role](../../user-guide/security-access-control-overview.md) that meets either of the following criteria, or a
    [higher role](../../user-guide/security-access-control-overview.md), can be used to revoke privileges on an object from other roles:

    * The role is identified as the *grantor* of the privilege in the GRANTED_BY column in the [SHOW GRANTS](show-grants.md) output.

      If multiple instances of a privilege have been granted on the specified object, only the instances granted by the active grantor role
      are revoked.
    * The role has the global MANAGE GRANTS privilege.

      If multiple instances of a privilege have been granted on the specified object, all instances are revoked.

      Note that only the SECURITYADMIN system role and higher have the MANAGE GRANTS privilege by default; however, the privilege can be
      granted to custom roles.

    In managed access schemas (that is, schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax), only the schema owner (that is, the
    role with the OWNERSHIP privilege on the schema) or a role with the global MANAGE GRANTS privilege, or a higher role, can revoke
    privileges on objects in the schema.

Revoking grants on future objects of a specified type:
:   **Database level**

    The global MANAGE GRANTS privilege is required to revoke privileges on future objects in a database. Only the SECURITYADMIN system role
    and higher have the MANAGE GRANTS privilege; however, the privilege can be granted to custom roles.

    **Schema level**

    In managed access schemas (that is, schemas created using the CREATE SCHEMA … WITH MANAGED ACCESS syntax),
    either the schema owner (that is, the role with the OWNERSHIP privilege on the schema) or a role with the
    global MANAGE GRANTS privilege can revoke privileges on future objects in the schema.

    In standard schemas, the global MANAGE GRANTS privilege is required to revoke privileges on future objects
    in the schema.

## Usage notes

* Privileges cannot be granted or revoked directly on any class. You can, however, create an instance of a class and
  revoke [instance roles](../snowflake-db-classes.md) from an
  account role. Revoke the CREATE <class_name> privilege on the schema to prevent a role from creating an instance of a
  class.
* A privilege can be granted to a role multiple times by different grantors. A REVOKE *<privilege>* statement only revokes grants for which
  the active role, or a lower role in a hierarchy, is the grantor. Any additional grants of a specified privilege by other grantors are
  ignored.

  Also note that a REVOKE *<privilege>* statement is successful even if no privileges are revoked. A REVOKE *<privilege>* statement only
  returns an error if a specified privilege has dependent grants and the CASCADE clause is omitted in the statement.
* Multiple privileges can be specified for the same object type in a single GRANT statement (with each privilege separated by commas),
  or the special `ALL [ PRIVILEGES ]` keyword can be used to grant all applicable privileges to the specified object type. Note,
  however, that only privileges held and grantable by the role executing the GRANT command are actually granted to the target role.
  A warning message is returned for any privileges that could not be granted.

  You cannot specify this keyword for tags.
* Privileges granted to a particular role are automatically inherited by any other roles to which the role is granted, as well as any
  other higher-level roles within the role hierarchy. For more details, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).
* For databases, the IMPORTED PRIVILEGES privilege only applies to shared databases (that is, databases created from a share). For more
  details, see [Consume imported data](../../user-guide/data-share-consumers.md).
* For schemas and objects in schemas, an option is provided to grant privileges on all objects of the same type within the container
  (that is, a database or schema). This is a convenience option; internally, the command is expanded into a series of individual GRANT commands
  on each object. Only objects that currently exist within the container are affected.

  However, note that, in the Snowflake model, bulk granting of privileges is not a recommended practice. Instead, Snowflake recommends
  creating a shared role and using the role to create objects that are automatically accessible to all users who have been granted the
  role.
* For stages:

  + USAGE only applies to external stages.
  + READ | WRITE only applies to internal stages. In addition, to grant the WRITE privilege on an internal stage, the READ privilege
    must first be granted on the stage.

  For more details about external and internal stages, see [CREATE STAGE](create-stage.md).
* For storage integrations:

  + To run the following commands using an external stage that relies on a storage integration,
    you must use a role that has or inherits the USAGE privilege on the storage integration.

    - [LIST](list.md)
    - [REMOVE](remove.md)
    - [COPY INTO <table>](copy-into-table.md)
    - [COPY INTO <location>](copy-into-location.md)

    If you revoke the USAGE privilege from the role, the role
    can’t run these commands. For more information, see [Stage privileges](../../user-guide/security-access-control-privileges.md).
  + Revoking the USAGE privilege on a storage integration does not block a role from querying external tables associated with the storage
    integration. Querying an external table does not require the USAGE privilege on its underlying storage integration.
* When granting privileges on an individual UDF, you must specify the data types for the arguments, if any, for the UDF in the form of
  `udf_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument data types to resolve UDFs that
  have the same name within a schema. For an example, see Examples (in this topic). For more details, see
  [User-defined functions overview](../../developer-guide/udf/udf-overview.md).
* When granting privileges on an individual stored procedure, you must specify the data types for the arguments, if any, for the
  procedure in the form of `procedure_name ( [ arg_data_type , ... ] )`. This is required because Snowflake uses argument
  data types to resolve stored procedures that have the same name within a schema.
* OWNERSHIP is a valid privilege across all object types that support future grants.
* **Future grants:** Revoking future grants only drops grants of privileges for future objects of a specified type. Any
  privileges granted on existing objects are retained.

  For more information, see [managed access schemas](../../user-guide/security-access-control-configure.md).
* To revoke privileges on hybrid tables, use the standard TABLE or TABLES keyword. You cannot specify HYBRID TABLE or HYBRID TABLES.
* To revoke privileges on interactive tables, use the standard TABLE or TABLES keyword. You cannot specify INTERACTIVE TABLE or INTERACTIVE TABLES.

## Examples

### Roles

Revoke the privilege to create a warehouse in the account from the `analyst` role:

```sqlexample
REVOKE CREATE WAREHOUSE ON ACCOUNT FROM ROLE analyst;
```

Revoke the necessary privileges to operate (that is, suspend or resume) the `report_wh` warehouse
from the `analyst` role:

```sqlexample
REVOKE OPERATE ON WAREHOUSE report_wh FROM ROLE analyst;
```

Revoke only the GRANT OPTION privilege for the OPERATE privilege on the `report_wh` warehouse from the
`analyst` role. The role retains the OPERATE privilege but can no longer grant the OPERATE privilege
on the warehouse to other roles:

```sqlexample
REVOKE GRANT OPTION FOR OPERATE ON WAREHOUSE report_wh FROM ROLE analyst;
```

Revoke the SELECT privilege on all existing tables in the `mydb.myschema` schema from the `analyst` role:

```sqlexample
REVOKE SELECT ON ALL TABLES IN SCHEMA mydb.myschema from ROLE analyst;
```

Revoke all privileges on two UDFs (with the same name in the current schema) from the `analyst` role:

```sqlexample
REVOKE ALL PRIVILEGES ON FUNCTION add5(number) FROM ROLE analyst;

REVOKE ALL PRIVILEGES ON FUNCTION add5(string) FROM ROLE analyst;
```

Note that the UDFs have different arguments, which is how Snowflake uniquely identifies UDFs with the same name.
For more details about UDF naming, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

Revoke all privileges on two stored procedures (with the same name in the current schema) from the `analyst` role:

```sqlexample
REVOKE ALL PRIVILEGES ON PROCEDURE clean_schema(string) FROM ROLE analyst;

REVOKE ALL PRIVILEGES ON procedure clean_schema(string, string) FROM ROLE analyst;
```

Note that the two stored procedures have different arguments, which is how Snowflake uniquely identifies procedures
with the same name.

Revoke the SELECT and INSERT privileges granted on all future tables created in the `mydb.myschema` schema from the
`role1` role:

```sqlexample
REVOKE SELECT, INSERT ON FUTURE TABLES IN SCHEMA mydb.myschema
  FROM ROLE role1;
```

Revoke the USAGE privilege on a notebook called `mynotebook` from the `finance` role:

> ```sqlexample
> REVOKE USAGE ON NOTEBOOK db_one.schema_one.mynotebook FROM ROLE finance;
> ```

### Database roles

Revoke the SELECT privilege on all existing tables in the `mydb.myschema` schema from the `mydb.dr1` database role:

```sqlexample
REVOKE SELECT ON ALL TABLES IN SCHEMA mydb.myschema
  FROM DATABASE ROLE mydb.dr1;
```

Revoke all privileges on two UDFs (with the same name in the current schema) from the `mydb.dr1` database role:

```sqlexample
REVOKE ALL PRIVILEGES ON FUNCTION add5(number)
  FROM DATABASE ROLE mydb.dr1;

REVOKE ALL PRIVILEGES ON FUNCTION add5(string)
  FROM DATABASE ROLE mydb.dr1;
```

Note that the UDFs have different arguments, which is how Snowflake uniquely identifies UDFs with the same name.
For more details about UDF naming, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

Revoke all privileges on two stored procedures (with the same name in the current schema) from the
`mydb.dr1` database role:

```sqlexample
REVOKE ALL PRIVILEGES ON PROCEDURE clean_schema(string)
  FROM DATABASE ROLE mydb.dr1;

REVOKE ALL PRIVILEGES ON procedure clean_schema(string, string)
  FROM DATABASE ROLE mydb.dr1;
```

Note that the two stored procedures have different arguments, which is how Snowflake uniquely identifies procedures
with the same name.

Revoke the SELECT and INSERT privileges granted on all future tables created in the `mydb.myschema` schema
from the `mydb.dr1` database role:

```sqlexample
REVOKE SELECT,INSERT ON FUTURE TABLES IN SCHEMA mydb.myschema
  FROM DATABASE ROLE mydb.dr1;
```

Revoke the USAGE privilege on a notebook from the `mydb.dr1` database role:

```sqlexample
REVOKE USAGE ON NOTEBOOK db_one.schema_one.mynotebook
  FROM DATABASE ROLE mydb.dr1;
```

---
title: REVOKE <privileges> … FROM USER
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-privilege-user.md
section: SQL Commands
---

# REVOKE *<privileges>* … FROM USER

Removes one or more privileges on a securable object from a user. The privileges that can be revoked are object-specific.

See also:

> [GRANT <privileges> … TO USER](grant-privilege-user.md)

## Syntax

```sqlsyntax
REVOKE [ GRANT OPTION FOR ]
    {
       { globalPrivileges         | ALL [ PRIVILEGES ] } ON ACCOUNT
     | { accountObjectPrivileges  | ALL [ PRIVILEGES ] } ON { RESOURCE MONITOR | WAREHOUSE | COMPUTE POOL | DATABASE | INTEGRATION | CONNECTION | FAILOVER GROUP | REPLICATION GROUP | EXTERNAL VOLUME } <object_name>
     | { schemaPrivileges         | ALL [ PRIVILEGES ] } ON { SCHEMA <schema_name> | ALL SCHEMAS IN DATABASE <db_name> }
     | { schemaObjectPrivileges   | ALL [ PRIVILEGES ] } ON { <object_type> <object_name> | ALL <object_type_plural> IN SCHEMA <schema_name> }
    }
  FROM [ USER ] <user_name> [ RESTRICT | CASCADE ]
```

Where:

```sqlsyntax
globalPrivileges ::=
  {
      | ATTACH POLICY | AUDIT | BIND SERVICE ENDPOINT
      | APPLY {
         { AGGREGATION | AUTHENTICATION | JOIN | MASKING | PACKAGES | PASSWORD
           | PROJECTION | ROW ACCESS | SESSION } POLICY
         | TAG }
      | EXECUTE { ALERT | DATA METRIC FUNCTION | MANAGED ALERT | MANAGED TASK | TASK }
      | IMPORT SHARE
      | MANAGE { ACCOUNT SUPPORT CASES | EVENT SHARING | GRANTS | LISTING AUTO FULFILLMENT | ORGANIZATION SUPPORT CASES | USER SUPPORT CASES | WAREHOUSES }
      | MODIFY { LOG LEVEL | TRACE LEVEL | SESSION LOG LEVEL | SESSION TRACE LEVEL }
      | MONITOR { EXECUTION | SECURITY | USAGE }
      | OVERRIDE SHARE RESTRICTIONS | PURCHASE DATA EXCHANGE LISTING | RESOLVE ALL
      | READ SESSION
  }
  [ , ... ]
```

```sqlsyntax
accountObjectPrivileges ::=
-- For COMPUTE POOL
   { MODIFY | MONITOR | OPERATE | USAGE } [ , ... ]
-- For CONNECTION
   { FAILOVER } [ , ... ]
-- For DATABASE
   { APPLYBUDGET
   | IMPORTED PRIVILEGES | MODIFY | MONITOR | USAGE } [ , ... ]
-- For EXTERNAL VOLUME
   { USAGE } [ , ... ]
-- For FAILOVER GROUP
   { FAILOVER | MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For INTEGRATION
   { USAGE | USE_ANY_ROLE } [ , ... ]
-- For REPLICATION GROUP
   { MODIFY | MONITOR | REPLICATE } [ , ... ]
-- For RESOURCE MONITOR
   { MODIFY | MONITOR } [ , ... ]
-- For USER
   { MONITOR } [ , ... ]
-- For WAREHOUSE
   { APPLYBUDGET | MODIFY | MONITOR | USAGE | OPERATE } [ , ... ]
```

```sqlsyntax
schemaPrivileges ::=

    ADD SEARCH OPTIMIZATION | APPLYBUDGET
   | MODIFY | MONITOR | USAGE
   [ , ... ]
```

```sqlsyntax
schemaObjectPrivileges ::=
  -- For ALERT
     { MONITOR | OPERATE } [ , ... ]
  -- For DATA METRIC FUNCTION
     USAGE [ , ... ]
  -- For DYNAMIC TABLE
     MONITOR, OPERATE, SELECT [ , ...]
  -- For EVENT TABLE
     { APPLYBUDGET | DELETE | REFERENCES | SELECT | TRUNCATE } [ , ... ]
  -- For FILE FORMAT, FUNCTION (UDF or external function), MODEL, PROCEDURE, SECRET, SEQUENCE, SNAPSHOT, or TYPE
     USAGE [ , ... ]
  -- For GIT REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For HYBRID TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For IMAGE REPOSITORY
     { READ, WRITE } [ , ... ]
  -- For ICEBERG TABLE
     { APPLYBUDGET | DELETE | INSERT | REFERENCES | SELECT | TRUNCATE | UPDATE } [ , ... ]
  -- For MATERIALIZED VIEW
     { APPLYBUDGET | REFERENCES | SELECT } [ , ... ]
  -- For PIPE
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For { AGGREGATION | AUTHENTICATION | MASKING | JOIN | PACKAGES | PASSWORD | PRIVACY | PROJECTION | ROW ACCESS | SESSION } POLICY or TAG
     APPLY [ , ... ]
  -- For SECRET
     { READ | USAGE } [ , ... ]
  -- For SEMANTIC VIEW
     REFERENCES [ , ... ]
  -- For SERVICE
     { MONITOR | OPERATE } [ , ... ]
  -- For external STAGE
     USAGE [ , ... ]
  -- For internal STAGE
     READ [ , WRITE ] [ , ... ]
  -- For STREAM
     SELECT [ , ... ]
  -- For STREAMLIT
     USAGE [ , ... ]
  -- For TABLE
     { APPLYBUDGET | DELETE | EVOLVE SCHEMA | INSERT | REFERENCES | SELECT | SELECT ERROR TABLE | TRUNCATE | UPDATE } [ , ... ]
  -- For TAG
     READ
  -- For TASK
     { APPLYBUDGET | MONITOR | OPERATE } [ , ... ]
  -- For VIEW
     { REFERENCES | SELECT } [ , ... ]
```

For more information about the privileges supported for each object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

## Required parameters

`object_name`
:   Specifies the identifier for the object on which the privileges are revoked.

`object_type`
:   Specifies the type of object for schema-level objects.

    * `AGENT`
    * `AGGREGATION POLICY`
    * `ALERT`
    * `AUTHENTICATION POLICY`
    * `CORTEX SEARCH SERVICE`
    * `DATA METRIC FUNCTION`
    * `DATASET`
    * `DBT PROJECT`
    * `DYNAMIC TABLE`
    * `EVENT TABLE`
    * `EXPERIMENT`
    * `EXTERNAL TABLE`
    * `FILE FORMAT`
    * `FUNCTION`
    * `GATEWAY`
    * `GIT REPOSITORY`
    * `IMAGE REPOSITORY`
    * `ICEBERG TABLE`
    * `INTERACTIVE TABLE`
    * `JOIN POLICY`
    * `MASKING POLICY`
    * `MATERIALIZED VIEW`
    * `MCP SERVER`
    * `MODEL`
    * `MODEL MONITOR`
    * `NETWORK RULE`
    * `NOTEBOOK`
    * `ONLINE FEATURE TABLE`
    * `PACKAGES POLICY`
    * `PASSWORD POLICY`
    * `PIPE`
    * `PRIVACY POLICY`
    * `PROCEDURE`
    * `PROJECTION POLICY`
    * `ROW ACCESS POLICY`
    * `SECRET`
    * `SEMANTIC VIEW`
    * `SERVICE`
    * `SESSION POLICY`
    * `SEQUENCE`
    * `SNAPSHOT`
    * `SNAPSHOT POLICY`
    * `SNAPSHOT SET`
    * `STAGE`
    * `STORAGE LIFECYCLE POLICY`
    * `STREAM`
    * `STREAMLIT`
    * `TABLE`
    * `TAG`
    * `TASK`
    * `TYPE`
    * `VIEW`
    * `WORKSPACE`

`object_type_plural`
:   Plural form of `object_type` (for example, `TABLES`, `VIEWS`).

`user_name`
:   Specifies the identifier for the recipient user (the user from which the privileges are revoked).

## Optional parameters

`GRANT OPTION FOR`
:   If specified, removes the ability for the recipient user to grant the privileges to another role or user.

    Default: No value

`RESTRICT | CASCADE`
:   If specified, determines whether the revoke operation succeeds or fails for the privileges, based on the whether the privileges had been
    re-granted to another role or user.

    * `RESTRICT`: If the privilege being revoked has been re-granted to another role or user, the REVOKE command fails.
    * `CASCADE`: If the privilege being revoked has been re-granted, the REVOKE command recursively revokes these dependent grants.
      If the same privilege on an object has been granted to the target user by a different grantor (parallel grant), that grant is not
      affected and the target user retains the privilege.

    Default: `RESTRICT`

## Usage notes

* Privileges cannot be granted or revoked directly on any class.

* A privilege can be granted to a user multiple times by different grantors. A `REVOKE privilege` statement only revokes grants for
  which the user is the grantor. Any additional grants of a specified privilege by other grantors are ignored.

  Also note that a `REVOKE privilege` statement is successful even if no privileges are revoked. `REVOKE privilege` only
  returns an error if a specified privilege has dependent grants and the CASCADE clause is omitted from the statement.
* Multiple privileges can be specified for the same object type in a single GRANT statement (with each privilege separated by commas),
  or the special `ALL [ PRIVILEGES ]` keyword can be used to grant all applicable privileges to the specified object type. Note,
  however, that only privileges held and grantable by the role or user executing the GRANT command are actually granted to the target user.
  A warning message is returned for any privileges that could not be granted.

  You cannot specify the `ALL [ PRIVILEGES ]` keyword for tags.

* For stages:

  + USAGE only applies to external stages.
  + READ | WRITE only applies to internal stages. In addition, to grant the WRITE privilege on an internal stage, the READ privilege
    must first be granted on the stage.

  For more information about external and internal stages, see [CREATE STAGE](create-stage.md).
* For storage integrations:

  + To run the following commands using an external stage that relies on a storage integration, the USAGE privilege on the storage
    integration must be directly granted to the user or use a role that has or inherits the privilege.

    - [LIST](list.md)
    - [REMOVE](remove.md)
    - [COPY INTO <table>](copy-into-table.md)
    - [COPY INTO <location>](copy-into-location.md)

    If you revoke the USAGE privilege from the user, the user cannot run these commands. For more information, see
    [Stage privileges](../../user-guide/security-access-control-privileges.md).
  + Revoking the USAGE privilege on a storage integration does not block a user from querying external tables associated with the
    storage integration. Querying an external table does not require the USAGE privilege on its underlying storage integration.

## Access control requirements

Revoking privileges on individual objects:
:   An [active role](../../user-guide/security-access-control-overview.md) or user that meets either of the following criteria, or a
    [higher role](../../user-guide/security-access-control-overview.md), can be used to revoke privileges on an object from users:

    * The role or user is identified as the *grantor* of the privilege in the GRANTED_BY column in the [SHOW GRANTS](show-grants.md)
      output.

      If multiple instances of a privilege have been granted on the specified object, only the instances granted by the active grantor role
      are revoked.
    * The role or user has the global MANAGE GRANTS privilege.

      If multiple instances of a privilege have been granted on the specified object, all instances are revoked.

      Note that only the SECURITYADMIN system role and higher have the MANAGE GRANTS privilege by default; however, the privilege can be
      granted to custom roles.

    In [managed access schemas](../../user-guide/security-access-control-configure.md) (schemas created using the `CREATE SCHEMA ... WITH MANAGED ACCESS`)
    syntax, only the schema owner (the role with the OWNERSHIP privilege on the schema), a role or user with the global MANAGE GRANTS
    privilege, or a higher role can revoke privileges on objects in the schema.

## Examples

To revoke the USAGE privilege on a Streamlit application from a specific user, `joe`:

```sqlexample
REVOKE USAGE ON STREAMLIT streamlit_db.streamlit_schema.streamlit_app FROM USER joe;
```

To revoke the USAGE privilege on a procedure from a specific user, `user1`:

```sqlexample
REVOKE USAGE ON PROCEDURE mydb.myschema.myprocedure(number) FROM USER user1;
```

---
title: REVOKE APPLICATION ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-application-role.md
section: SQL Commands
---

# REVOKE APPLICATION ROLE

Revokes an application role from an account role or another application role.

See also:
:   [ALTER APPLICATION ROLE](alter-application-role.md), [CREATE APPLICATION ROLE](create-application-role.md), [GRANT APPLICATION ROLE](grant-application-role.md),
    [SHOW APPLICATION ROLES](show-application-roles.md)

## Syntax

```sqlsyntax
REVOKE APPLICATION ROLE <name> FROM { ROLE <parent_role_name> | APPLICATION ROLE <application_role> | APPLICATION <application> }
```

## Parameters

`name`
:   Specifies the identifier for the application role to revoke. If the identifier contains spaces or special
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`FROM ROLE parent_role_name`
:   Revokes the application role from the specified account role.

`APPLICATION ROLE application_role`
:   Revokes the role from the specified application role.

`APPLICATION ROLE application`
:   Revokes the role from the specified application.

## Usage notes

An application role may only be revoked from another application role within the context of
the installed application, for example within the application setup script.

## Examples

```sqlexample
REVOKE APPLICATION ROLE app_role FROM APPLICATION ROLE other_role;
```

---
title: REVOKE CALLER
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-caller.md
section: SQL Commands
---

# REVOKE CALLER

Revokes privileges that were previously granted to an executable owner using a
[caller grant](../../developer-guide/restricted-callers-rights.md).

Variations of the REVOKE CALLER command are as follows:

* REVOKE CALLER — Revoke specific privileges on a specific object.
* REVOKE ALL CALLER PRIVILEGES — Revoke all privileges on a specific object. The executable will not be
  able to run with any privileges from the caller when it tries to access the object.
* REVOKE INHERITED CALLER — Revoke caller grants on all current and future objects of the same type when they share a common schema, database,
  or account. Only privileges in a specified list are revoked.
* REVOKE ALL INHERITED CALLER PRIVILEGES — Revoke caller grants on all current and future objects of the same type when they share a common
  schema, database, or account. All privileges are revoked; the executable will not be able to run with any privileges from the caller.

## Syntax

```sqlsyntax
REVOKE CALLER <object_privilege> [ , <object_privilege> ... ]
  ON <object_type> <object_name>
  FROM { ROLE | DATABASE ROLE } <grantee_name>

REVOKE ALL CALLER PRIVILEGES
  ON <object_type> <object_name>
  FROM { ROLE | DATABASE ROLE } <grantee_name>

REVOKE INHERITED CALLER <object_privilege> [ , <object_privilege> ... ]
  ON ALL <object_type_plural>
  IN { ACCOUNT | DATABASE <db_name> | SCHEMA <schema_name> | APPLICATION <app_name> | APPLICATION PACKAGE <app_pkg_name> }
  FROM { ROLE | DATABASE ROLE } <grantee_name>

REVOKE ALL INHERITED CALLER PRIVILEGES
  ON ALL <object_type_plural>
  IN { ACCOUNT | DATABASE <db_name> | SCHEMA <schema_name> | APPLICATION <app_name> | APPLICATION PACKAGE <app_pkg_name> }
  FROM { ROLE | DATABASE ROLE } <grantee_name>
```

## Parameters

`object_privilege [ , object_privilege ... ]`
:   The object privileges that you want to revoke. Executables owned by the specified role can no longer
    run with these privileges. For a list of privileges for a specific object type, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

    Use a comma-delimited list to specify more than one object privilege.

`ON object_type object_name`
:   The object, including its type, that you want to revoke privileges for. Use the singular form of `object_type`, for example, `TABLE` or `WAREHOUSE`.

`ON ALL object_type_plural IN ACCOUNT` or . `ON ALL object_type_plural IN DATABASE db_name` or . `ON ALL object_type_plural IN SCHEMA schema_name` or . `ON ALL object_type_plural IN APPLICATION app_name` or . `ON ALL object_type_plural IN APPLICATION PACKAGE app_pkg_name`
:   Revokes privileges on all objects of a certain type. Use the plural form of the object type, for example, `TABLES` or `WAREHOUSES`.

    You can use the REVOKE statement to revoke access to all objects in the current account or just to objects in the specified database,
    schema, application, or application package.

`FROM ROLE grantee_name` or . `FROM DATABASE ROLE grantee_name`
:   Executable owner who was previously granted a caller grant.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MANAGE CALLER GRANTS | Account | The account-level MANAGE CALLER GRANTS privilege pertains to caller grants only. It does not allow you to revoke privileges from roles. |
| Any privilege | All specified objects | You need at least one privilege on the objects specified in the REVOKE command. For example, revoking a caller grant on a table requires that you have at least one privilege on that table. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Executables owned by `owner_role` can no longer run with the caller’s privileges when they access views in the current account.

> ```sqlexample
> REVOKE ALL INHERITED CALLER PRIVILEGES ON ALL VIEWS IN ACCOUNT FROM ROLE owner_role;
> ```

Executables owned by `owner_role` can no longer run with the USAGE privilege when they access the `db.sch1` schema.

> ```sqlexample
> REVOKE CALLER USAGE ON SCHEMA db.sch1 FROM ROLE owner_role;
> ```

---
title: REVOKE DATABASE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-database-role.md
section: SQL Commands
---

# REVOKE DATABASE ROLE

Revokes a database role from an [account role or another database role](../../user-guide/security-access-control-overview.md).

See also:
:   [GRANT DATABASE ROLE](grant-database-role.md) , [GRANT ROLE](grant-role.md) , [REVOKE ROLE](revoke-role.md) , [GRANT <privileges> … TO ROLE](grant-privilege.md)

## Syntax

```sqlsyntax
REVOKE DATABASE ROLE <name> FROM { ROLE | DATABASE ROLE } <parent_role_name>

REVOKE DATABASE ROLE <name> FROM APPLICATION <app_name>
```

## Parameters

`name`
:   Specifies the identifier for the database role to revoke. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`DATABASE ROLE parent_role_name`
:   Revokes the database role from the specified database role.

`ROLE parent_role_name`
:   Revokes the database role from the specified account role.

`APPLICATION app_name`
:   Revokes the database role from the specified Snowflake Native App.

## Examples

Revokes the database role named `analyst` from the account role named `SYSADMIN`.

```sqlexample
REVOKE DATABASE ROLE analyst FROM ROLE SYSADMIN;
```

Revokes the database role named `dr1` from another database role named `dr2`.

```sqlexample
REVOKE DATABASE ROLE dr1 FROM DATABASE ROLE dr2;
```

Revokes the database role named `dr1` from the Snowflake Native App named `hello_snowflake_app`.

```sqlexample
REVOKE DATABASE ROLE dr1 FROM APPLICATION hello_snowflake_app;
```

---
title: REVOKE DATABASE ROLE … FROM SHARE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-database-role-share.md
section: SQL Commands
---

# REVOKE DATABASE ROLE … FROM SHARE

Revokes a database role from a share.

Revoking a database role effectively removes privileges on objects granted to the database role from the share, disabling access to the
objects in all consumer accounts that have created a database from the share.

For more details, see [About Secure Data Sharing](../../user-guide/data-sharing-intro.md) and [Create and configure shares](../../user-guide/data-sharing-provider.md).

See also:
:   [GRANT DATABASE ROLE … TO SHARE](grant-database-role-share.md)

## Syntax

```sqlsyntax
REVOKE DATABASE ROLE <name>
  FROM SHARE <share_name>
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) for the database role; must be unique in the database in which the role is created.

    The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier
    string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    If the identifier is not fully qualified (in the form of `db_name.database_role_name`, the command looks for the database role
    in the current database for the session.

`share_name`
:   Specifies the identifier for the share to which the specified database role is revoked.

## Usage notes

None.

## Examples

Revoke the database role `dr1` in database `d1` from share `share1`:

> ```sqlexample
> REVOKE DATABASE ROLE d1.dr1 FROM SHARE share1;
> ```

---
title: REVOKE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-role.md
section: SQL Commands
---

# REVOKE ROLE

Removes a role from another role or a user.

See also:
:   [GRANT ROLE](grant-role.md)

## Syntax

```sqlsyntax
REVOKE ROLE <name> FROM { ROLE <parent_role_name> | USER <user_name> }
```

## Parameters

`name`
:   Specifies the identifier for the role to revoke. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

`ROLE parent_role_name`
:   Revokes the role from the specified role.

`USER user_name`
:   Revokes the role from the specified user.

## Examples

```sqlexample
REVOKE ROLE analyst FROM ROLE SYSADMIN;
```

```sqlexample
REVOKE ROLE analyst FROM USER user1;
```

---
title: REVOKE SERVICE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/revoke-service-role.md
section: SQL Commands
---

# REVOKE SERVICE ROLE

Revokes a service role from an account role, application role, or database role. For more information, see [Managing service-related privileges](../../developer-guide/snowpark-container-services/working-with-services.md).

See also:
:   [GRANT SERVICE ROLE](grant-service-role.md), [SHOW ROLES IN SERVICE](show-roles-in-service.md), [SHOW GRANTS](show-grants.md)

## Syntax

```sqlsyntax
REVOKE SERVICE ROLE <name> FROM
{
  ROLE <role_name>                     |
  APPLICATION ROLE <application_role_name>  |
  DATABASE ROLE <database_role_name>
}
```

## Parameters

`name`
:   Specifies the identifier for the service role to revoke. If the identifier contains spaces or special
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    Specify the service role name in the following format:

    > `service-name!service-role-name`

    For example, `echo_service!echoendpoint_role`.

`ROLE role_name`
:   Name of the account role to revoke the service role from.

`APPLICATION ROLE application_role`
:   Name of the application role to revoke the service role from.

`DATABASE ROLE database_name`
:   Name of the database role to revoke the service role from.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege or role | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Service | Only the service owner can revoke the service role. |

## Examples

The following command revokes the `echoendpoint_role` service role defined in the `echo_service` service specification from the `service_function_user_role` role.

```sqlexample
REVOKE SERVICE ROLE echo_service!echoendpoint_role FROM ROLE service_function_user_role;
```

---
title: ROLLBACK
source: https://docs.snowflake.com/en/sql-reference/sql/rollback.md
section: SQL Commands
---

# ROLLBACK

Rolls back an open transaction in the current session.

See also:
:   [BEGIN](begin.md) , [COMMIT](commit.md) , [SHOW TRANSACTIONS](show-transactions.md) , [DESCRIBE TRANSACTION](desc-transaction.md)

## Syntax

```sqlsyntax
ROLLBACK [ WORK ]
```

## Parameters

`WORK`
:   Optional keyword that provides compatibility with other database systems.

## Examples

Begin a transaction, insert some values into a table, and then complete the transaction by rolling back the changes made in the transaction:

```sqlexample
SELECT COUNT(*) FROM A1;

+----------+
| COUNT(*) |
|----------+
|        0 |
+----------+

BEGIN NAME T4;

SELECT CURRENT_TRANSACTION();

+-----------------------+
| CURRENT_TRANSACTION() |
|-----------------------+
| 1432071523422         |
+-----------------------+

INSERT INTO A1 VALUES (1), (2);

+-------------------------+
| number of rows inserted |
|-------------------------+
| 2                       |
+-------------------------+

ROLLBACK;

SELECT COUNT(*) FROM A1;

+----------+
| COUNT(*) |
|----------+
|        0 |
+----------+

SELECT CURRENT_TRANSACTION();

+-----------------------+
| CURRENT_TRANSACTION() |
|-----------------------+
| [NULL]                |
+-----------------------+

SELECT LAST_TRANSACTION();

+--------------------+
| LAST_TRANSACTION() |
|--------------------+
| 1432071523422      |
+--------------------+
```

---
title: SELECT
source: https://docs.snowflake.com/en/sql-reference/sql/select.md
section: SQL Commands
---

# SELECT

SELECT can be used as either a statement or as a clause within other statements:

* As a statement, the SELECT statement is the most commonly executed SQL statement; it queries the database and retrieves a set of rows.
* As a clause, SELECT defines the set of columns returned by a query.

See also:
:   [Query syntax](../constructs.md)

## Syntax

The following sections describe the syntax for this command:

* Selecting all columns
* Selecting specific columns

### Selecting all columns

```sqlsyntax
[ ... ]
SELECT [ { ALL | DISTINCT } ]
       [ TOP <n> ]
       [{<object_name>|<alias>}.]*

       [ ILIKE '<pattern>' ]

       [ EXCLUDE
         {
           <col_name> | ( <col_name>, <col_name>, ... )
         }
       ]

       [ REPLACE
         {
           ( <expr> AS <col_name> [ , <expr> AS <col_name>, ... ] )
         }
       ]

       [ RENAME
         {
           <col_name> AS <col_alias>
           | ( <col_name> AS <col_alias>, <col_name> AS <col_alias>, ... )
         }
       ]
```

You can specify the following combinations of keywords after SELECT \*. The keywords must be in the order shown below:

```sqlsyntax
SELECT * ILIKE ... REPLACE ...
```

```sqlsyntax
SELECT * ILIKE ... RENAME ...
```

```sqlsyntax
SELECT * ILIKE ... REPLACE ... RENAME ...
```

```sqlsyntax
SELECT * EXCLUDE ... REPLACE ...
```

```sqlsyntax
SELECT * EXCLUDE ... RENAME ...
```

```sqlsyntax
SELECT * EXCLUDE ... REPLACE ... RENAME ...
```

```sqlsyntax
SELECT * REPLACE ... RENAME ...
```

### Selecting specific columns

```sqlsyntax
[ ... ]
SELECT [ { ALL | DISTINCT } ]
       [ TOP <n> ]
       {
         [{<object_name>|<alias>}.]<col_name>
         | [{<object_name>|<alias>}.]$<col_position>
         | <expr>
       }
       [ [ AS ] <col_alias> ]
       [ , ... ]
[ ... ]
```

A trailing comma is supported in a column list. For example, the following SELECT statement is supported:

```sqlexample
SELECT emp_id,
       name,
       dept,
  FROM employees;
```

For more information about SELECT as a statement and the other clauses within the statement, see
[Query syntax](../constructs.md).

## Parameters

`ALL | DISTINCT`
:   Specifies whether to perform duplicate elimination on the result set:

    * `ALL` includes all values in the result set.
    * `DISTINCT` eliminates duplicate values from the result set.

    Default: `ALL`

`TOP n`
:   Specifies the maximum number of results to return. See [TOP <n>](../constructs/top_n.md).

`object_name` or . `alias`
:   Specifies the object identifier or object alias as defined in the [FROM](../constructs/from.md) clause.

`*`
:   The asterisk is shorthand to indicate that the output should include all columns of the specified object, or all columns of
    all objects if `*` is not qualified with an object name or alias. The columns are returned in the order shown by
    executing the [DESCRIBE](desc.md) command on the object.

    When you specify `*`, you can also specify `ILIKE`, `EXCLUDE`, `REPLACE`, and `RENAME`:

    `ILIKE 'pattern'`
    :   Specifies that only the columns that match `pattern` should be included in the results.

        In `pattern`, you can use the following SQL wildcards:

        * Use an underscore (`_`) to match any single character.
        * Use a percent sign (`%`) to match any sequence of zero or more characters.

        To match a sequence anywhere within the column name, begin and end the pattern with `%`.

        Matching is case-insensitive.

        If no columns match the specified pattern, a compilation error occurs (`001080 (42601): ... SELECT with no columns`).

    `EXCLUDE col_name` . `EXCLUDE (col_name, col_name, ...)`
    :   Specifies the columns that should be excluded from the results.

        If you are selecting from multiple tables, use `SELECT table_name.*` to specify that you want to select all columns
        from a specific table, and specify the unqualified column name in `EXCLUDE`. For example:

        ```sqlexample
        SELECT table_a.* EXCLUDE column_in_table_a ,
          table_b.* EXCLUDE column_in_table_b
          ...
        ```

    `REPLACE (expr AS col_name [ , expr AS col_name, ...] )`
    :   Replaces the value of `col_name` with the value of the evaluated expression `expr`.

        For example, to prepend the string `'DEPT-'` to the values in the `department_id` column, use:

        ```sqlexample
        SELECT * REPLACE ('DEPT-' || department_id AS department_id) ...
        ```

        For `col_name`:

        * The column must exist and cannot be filtered out by `ILIKE` or `EXCEPT`.
        * You cannot specify the same column more than once in the list of replacements.
        * If the column is in multiple tables (for example, in both tables in a join), the statement fails with an “ambiguous column”
          error.

        `expr` must evaluate to a single value.

    `RENAME col_name AS col_alias` . `RENAME (col_name AS col_alias, col_name AS col_alias, ...)`
    :   Specifies the column aliases that should be used in the results.

        If you are selecting from multiple tables, use `SELECT table_name.*` to specify that you want to select all columns
        from a specific table, and specify the unqualified column name in `RENAME`. For example:

        ```sqlexample
        SELECT table_a.* RENAME column_in_table_a AS col_alias_a,
          table_b.* RENAME column_in_table_b AS col_alias_b
          ...
        ```

    > **Note:**
    >
    > When specifying a combination of keywords after `SELECT *`:
    >
    > * You cannot specify both `ILIKE` and `EXCLUDE`.
    > * If you specify `EXCLUDE` with `RENAME` or `REPLACE`:
    >
    >   + You must specify `EXCLUDE` before `RENAME` or `REPLACE`:
    >
    >     ```sqlexample
    >     SELECT * EXCLUDE col_a RENAME col_b AS alias_b ...
    >     ```
    >
    >     ```sqlexample
    >     SELECT * EXCLUDE employee_id REPLACE ('DEPT-' || department_id AS department_id) ...
    >     ```
    >   + You cannot specify the same column in `EXCLUDE` and `RENAME`.
    > * If you specify `ILIKE` with `RENAME` or `REPLACE`, you must specify `ILIKE` first:
    >
    >   ```sqlexample
    >   SELECT * ILIKE '%id%' RENAME department_id AS department ...
    >   ```
    >
    >   ```sqlexample
    >   SELECT * ILIKE '%id%' REPLACE ('DEPT-' || department_id AS department_id) ...
    >   ```
    > * If you specify `REPLACE` and `RENAME`:
    >
    >   + You must specify `REPLACE` first:
    >
    >     ```sqlexample
    >     SELECT * REPLACE ('DEPT-' || department_id AS department_id) RENAME employee_id as employee ...
    >     ```
    >   + You can specify the same column name in `REPLACE` and `RENAME`:
    >
    >     ```sqlexample
    >     SELECT * REPLACE ('DEPT-' || department_id AS department_id) RENAME department_id as department ...
    >     ```

`col_name`
:   Specifies the column identifier as defined in the [FROM](../constructs/from.md) clause.

`$col_position`
:   Specifies the position of the column (1-based) as defined in the [FROM](../constructs/from.md) clause. If a column is
    referenced from a table, this number can’t exceed the maximum number of columns in the table.

`expr`
:   Specifies an expression, such as a mathematical expression, that evaluates
    to a specific value for any given row.

`[ AS ] col_alias`
:   Specifies the column alias assigned to the resulting expression. This is used as the display name in a top-level SELECT list, and the column name in an inline view.

    Do not assign a column alias that is the same as the name of another column referenced in the query.
    For example, if you are selecting columns named `prod_id` and `product_id`, do not alias `prod_id` as `product_id`.
    See Error case: Specifying an alias that matches another column name.

## Usage notes

* Aliases and identifiers are case-insensitive by default. To preserve case, enclose them within double quotes (`"`). For more
  information, see [Object identifiers](../identifiers.md).
* Without an ORDER BY clause, the results returned by SELECT are an unordered set. Running the same query repeatedly against the
  same tables might result in a different output order every time. If order matters, use the `ORDER BY` clause.
* SELECT can be used not only as an independent statement, but also as a clause in other statements, for example
  `INSERT INTO ... SELECT ...;`. SELECT can also be used in a
  [subquery](../../user-guide/querying-subqueries.md) within a statement.
* In many cases, when you use a column alias for an expression (i.e. `expr AS col_alias`) in other parts of the same
  query (in JOIN, FROM, WHERE, GROUP BY, other column expressions, etc.), the expression is evaluated only once.

  However, note that in some cases, the expression can be evaluated multiple times, which can result in different values for the
  alias used in different parts of the same query.

## Examples

A few simple examples are provided below.

* Setting up the data for the examples
* Examples of selecting all columns (SELECT \*)
* Examples of selecting specific columns (SELECT colname)

Many additional examples are included in other parts of the documentation, including the detailed descriptions of
[Query syntax](../constructs.md).

For examples related to querying an event table (whose schema is predefined by Snowflake), refer to
[Viewing log messages](../../developer-guide/logging-tracing/logging-accessing-messages.md) and [Viewing trace data](../../developer-guide/logging-tracing/tracing-accessing-events.md).

### Setting up the data for the examples

Some of the queries below use the following tables and data:

> ```sqlexample
> CREATE TABLE employee_table (
>     employee_ID INTEGER,
>     last_name VARCHAR,
>     first_name VARCHAR,
>     department_ID INTEGER
>     );
>
> CREATE TABLE department_table (
>     department_ID INTEGER,
>     department_name VARCHAR
>     );
> ```
>
> ```sqlexample
> INSERT INTO employee_table (employee_ID, last_name, first_name, department_ID) VALUES
>     (101, 'Montgomery', 'Pat', 1),
>     (102, 'Levine', 'Terry', 2),
>     (103, 'Comstock', 'Dana', 2);
>
> INSERT INTO department_table (department_ID, department_name) VALUES
>     (1, 'Engineering'),
>     (2, 'Customer Support'),
>     (3, 'Finance');
> ```

### Examples of selecting all columns (SELECT \*)

* Selecting all columns in the table
* Selecting all columns with names that match a pattern
* Selecting all columns except one column
* Selecting all columns except two or more columns
* Selecting all columns and renaming one column
* Selecting all columns and renaming multiple columns
* Selecting all columns with names that match a pattern and renaming a column
* Selecting all columns, excluding a column, and renaming multiple columns
* Selecting all columns and replacing the value of a column
* Selecting all columns, replacing the value of a column, and renaming the column
* Selecting all columns with names that match a pattern and replacing the value in a column
* Selecting all columns from multiple tables, excluding a column, and renaming a column

#### Selecting all columns in the table

This example shows how to select all columns in `employee_table`:

```sqlexample
SELECT * FROM employee_table;
```

```output
+-------------+------------+------------+---------------+
| EMPLOYEE_ID | LAST_NAME  | FIRST_NAME | DEPARTMENT_ID |
|-------------+------------+------------+---------------|
|         101 | Montgomery | Pat        |             1 |
|         102 | Levine     | Terry      |             2 |
|         103 | Comstock   | Dana       |             2 |
+-------------+------------+------------+---------------+
```

#### Selecting all columns with names that match a pattern

This example shows how to select all columns in `employee_table` with names that contain `id`:

```sqlexample
SELECT * ILIKE '%id%' FROM employee_table;
```

```output
+-------------+---------------+
| EMPLOYEE_ID | DEPARTMENT_ID |
|-------------+---------------|
|         101 |             1 |
|         102 |             2 |
|         103 |             2 |
+-------------+---------------+
```

#### Selecting all columns except one column

This example shows how to select all columns in `employee_table` except for the `department_id` column:

```sqlexample
SELECT * EXCLUDE department_id FROM employee_table;
```

```output
+-------------+------------+------------+
| EMPLOYEE_ID | LAST_NAME  | FIRST_NAME |
|-------------+------------+------------|
|         101 | Montgomery | Pat        |
|         102 | Levine     | Terry      |
|         103 | Comstock   | Dana       |
+-------------+------------+------------+
```

#### Selecting all columns except two or more columns

This example shows how to select all columns in `employee_table` except for the `department_id` and `employee_id` columns:

```sqlexample
SELECT * EXCLUDE (department_id, employee_id) FROM employee_table;
```

```output
+------------+------------+
| LAST_NAME  | FIRST_NAME |
|------------+------------|
| Montgomery | Pat        |
| Levine     | Terry      |
| Comstock   | Dana       |
+------------+------------+
```

#### Selecting all columns and renaming one column

This example shows how to select all columns in `employee_table` and rename the `department_id` column:

```sqlexample
SELECT * RENAME department_id AS department FROM employee_table;
```

```output
+-------------+------------+------------+------------+
| EMPLOYEE_ID | LAST_NAME  | FIRST_NAME | DEPARTMENT |
|-------------+------------+------------+------------|
|         101 | Montgomery | Pat        |          1 |
|         102 | Levine     | Terry      |          2 |
|         103 | Comstock   | Dana       |          2 |
+-------------+------------+------------+------------+
```

#### Selecting all columns and renaming multiple columns

This example shows how to select all columns in `employee_table` and rename the `department_id` and `employee_id` columns:

```sqlexample
SELECT * RENAME (department_id AS department, employee_id AS id) FROM employee_table;
```

```output
+-----+------------+------------+------------+
|  ID | LAST_NAME  | FIRST_NAME | DEPARTMENT |
|-----+------------+------------+------------|
| 101 | Montgomery | Pat        |          1 |
| 102 | Levine     | Terry      |          2 |
| 103 | Comstock   | Dana       |          2 |
+-----+------------+------------+------------+
```

#### Selecting all columns, excluding a column, and renaming multiple columns

This example shows how to select all columns in `employee_table`, exclude the `first_name` column, and rename the
`department_id` and `employee_id` columns:

```sqlexample
SELECT * EXCLUDE first_name RENAME (department_id AS department, employee_id AS id) FROM employee_table;
```

```output
+-----+------------+------------+
|  ID | LAST_NAME  | DEPARTMENT |
|-----+------------+------------|
| 101 | Montgomery |          1 |
| 102 | Levine     |          2 |
| 103 | Comstock   |          2 |
+-----+------------+------------+
```

#### Selecting all columns with names that match a pattern and renaming a column

This example shows how to select all columns in `employee_table` with names that contain `id` and rename the
`department_id` column:

```sqlexample
SELECT * ILIKE '%id%' RENAME department_id AS department FROM employee_table;
```

```output
+-------------+------------+
| EMPLOYEE_ID | DEPARTMENT |
|-------------+------------|
|         101 |          1 |
|         102 |          2 |
|         103 |          2 |
+-------------+------------+
```

#### Selecting all columns and replacing the value of a column

This example shows how to select all columns in `employee_table` and replace the value in the `department_id` column with
the ID prepended with `DEPT-`:

```sqlexample
SELECT * REPLACE ('DEPT-' || department_id AS department_id) FROM employee_table;
```

```output
+-------------+------------+------------+---------------+
| EMPLOYEE_ID | LAST_NAME  | FIRST_NAME | DEPARTMENT_ID |
|-------------+------------+------------+---------------|
|         101 | Montgomery | Pat        | DEPT-1        |
|         102 | Levine     | Terry      | DEPT-2        |
|         103 | Comstock   | Dana       | DEPT-2        |
+-------------+------------+------------+---------------+
```

#### Selecting all columns, replacing the value of a column, and renaming the column

This example shows how to select all columns in `employee_table`, replace the value in the `department_id` column with
the ID prepended with `DEPT-`, and rename the column:

```sqlexample
SELECT * REPLACE ('DEPT-' || department_id AS department_id) RENAME department_id AS department FROM employee_table;
```

```output
+-------------+------------+------------+------------+
| EMPLOYEE_ID | LAST_NAME  | FIRST_NAME | DEPARTMENT |
|-------------+------------+------------+------------|
|         101 | Montgomery | Pat        | DEPT-1     |
|         102 | Levine     | Terry      | DEPT-2     |
|         103 | Comstock   | Dana       | DEPT-2     |
+-------------+------------+------------+------------+
```

#### Selecting all columns with names that match a pattern and replacing the value in a column

This example shows how to select all columns in `employee_table` with names that contain `id` and prepending `DEPT-` to the
values in the `department_id` column:

```sqlexample
SELECT * ILIKE '%id%' REPLACE('DEPT-' || department_id AS department_id) FROM employee_table;
```

```output
+-------------+---------------+
| EMPLOYEE_ID | DEPARTMENT_ID |
|-------------+---------------|
|         101 | DEPT-1        |
|         102 | DEPT-2        |
|         103 | DEPT-2        |
+-------------+---------------+
```

#### Selecting all columns from multiple tables, excluding a column, and renaming a column

This example joins two tables and selects all columns from both tables except one column from `employee_table`. The example also
renames one of the columns selected from `department_table`.

```sqlexample
SELECT
  employee_table.* EXCLUDE department_id,
  department_table.* RENAME department_name AS department
FROM employee_table INNER JOIN department_table
  ON employee_table.department_id = department_table.department_id
ORDER BY department, last_name, first_name;
```

```output
+-------------+------------+------------+---------------+------------------+
| EMPLOYEE_ID | LAST_NAME  | FIRST_NAME | DEPARTMENT_ID | DEPARTMENT       |
|-------------+------------+------------+---------------+------------------|
|         103 | Comstock   | Dana       |             2 | Customer Support |
|         102 | Levine     | Terry      |             2 | Customer Support |
|         101 | Montgomery | Pat        |             1 | Engineering      |
+-------------+------------+------------+---------------+------------------+
```

### Examples of selecting specific columns (SELECT colname)

* Selecting a single column by name
* Selecting multiple columns by name from joined tables
* Selecting a column by position
* Specifying an alias for a column in the output
* Error case: Specifying an alias that matches another column name

#### Selecting a single column by name

This example shows how to look up an employee’s last name if you know their ID.

```sqlexample
SELECT last_name FROM employee_table WHERE employee_ID = 101;
+------------+
| LAST_NAME  |
|------------|
| Montgomery |
+------------+
```

#### Selecting multiple columns by name from joined tables

This example lists each employee and the name of the department that each employee works in. The output is in order by department
name, and within each department the employees are in order by name. This query uses a join to relate the information in one table
to the information in another table.

```sqlexample
SELECT department_name, last_name, first_name
    FROM employee_table INNER JOIN department_table
        ON employee_table.department_ID = department_table.department_ID
    ORDER BY department_name, last_name, first_name;
+------------------+------------+------------+
| DEPARTMENT_NAME  | LAST_NAME  | FIRST_NAME |
|------------------+------------+------------|
| Customer Support | Comstock   | Dana       |
| Customer Support | Levine     | Terry      |
| Engineering      | Montgomery | Pat        |
+------------------+------------+------------+
```

#### Selecting a column by position

This example shows how to use `$` to specify a column by column number, rather than by column name:

```sqlexample
SELECT $2 FROM employee_table ORDER BY $2;
+------------+
| $2         |
|------------|
| Comstock   |
| Levine     |
| Montgomery |
+------------+
```

#### Specifying an alias for a column in the output

This example shows that the output columns do not need to be taken directly from the tables in the `FROM` clause; the output columns
can be general expressions. This example calculates the area of a circle that has a radius of 2.0. This example also shows how to use
a column alias so that the output has a meaningful column name:

```sqlexample
SELECT pi() * 2.0 * 2.0 AS area_of_circle;
+----------------+
| AREA_OF_CIRCLE |
|----------------|
|   12.566370614 |
+----------------+
```

#### Error case: Specifying an alias that matches another column name

This example demonstrates why it is not recommended to use a column alias that matches
the name of another column that is used in the query. This GROUP BY query results in a
SQL compiler error, not an ambiguous column error.
The alias `prod_id` that is assigned to `product_id` in `table1` matches the name
of the `prod_id` column in `table2`. The simplest solution to this error is to give
the column a different alias.

```sqlexample
CREATE OR REPLACE TABLE table1 (product_id NUMBER);

CREATE OR REPLACE TABLE table2 (prod_id NUMBER);

SELECT t1.product_id AS prod_id, t2.prod_id
  FROM table1 AS t1 JOIN table2 AS t2
    ON t1.product_id=t2.prod_id
  GROUP BY prod_id, t2.prod_id;
```

```output
001104 (42601): SQL compilation error: error line 1 at position 7
'T1.PRODUCT_ID' in select clause is neither an aggregate nor in the group by clause.
```

---
title: SET
source: https://docs.snowflake.com/en/sql-reference/sql/set.md
section: SQL Commands
---

# SET

Initializes the value of a [session variable](../session-variables.md) to the result of a SQL expression.

See also:
:   [SHOW VARIABLES](show-variables.md) , [UNSET](unset.md)

## Syntax

```sqlsyntax
SET <var> = <expr>

SET ( <var> [ , <var> ... ] )  = ( <expr> [ , <expr> ... ] )
```

## Parameters

`var`
:   Specifies the identifier for the variable to initialize.

`expr`
:   Specifies the SQL expression for the variable.

## Usage notes

* You can set multiple variables in the same statement.
* If you specify complex expressions, a running virtual warehouse might be required in the session.
* The number of expressions must match the number of variables to initialize.
* The size of string or binary variables is limited to 256 bytes.
* The identifier (i.e. name) for a SQL variable is limited to 256 characters.
* Variable names such as `CURRENT` or `PUBLIC` are reserved for future use by Snowflake and cannot be used.

## Examples

These two examples use constants to set variables:

```sqlexample
SET V1 = 10;

SET V2 = 'example';
```

This example sets more than one variable at a time:

```sqlexample
SET (V1, V2) = (10, 'example');
```

This example sets the variable to the value of a non-trivial expression that uses a SQL query:

```sqlexample
SET id_threshold = (SELECT COUNT(*)/2 FROM table1);
```

The following example shows the result when a SET command evaluates all of the expressions on the right-hand side of the assignment operator
before setting the first expression on the left-hand side of the operator. Note that the value of the variable named `max` is set
based on the old value of `min`, not the new value.

```sqlexample
SET (min, max) = (40, 70);
```

```sqlexample
SET (min, max) = (50, 2 * $min);

SELECT $max;
```

```output
+------+
| $MAX |
|------|
|   80 |
+------+
```

---
title: SHOW <objects>
source: https://docs.snowflake.com/en/sql-reference/sql/show.md
section: SQL Commands
---

# SHOW *<objects>*

Lists the existing objects for the specified object type. The output includes metadata for the objects, including:

* Common properties (name, creation timestamp, owning role, comment, etc.)
* Object-specific properties

See also:
:   [CREATE <object>](create.md) , [DESCRIBE <object>](desc.md)

## SHOW commands

For specific syntax, usage notes, and examples, see:

**Account Operations**

> * [SHOW ACCOUNTS](show-accounts.md)
> * [SHOW CONNECTIONS](show-connections.md)
> * [SHOW GLOBAL ACCOUNTS](show-global-accounts.md)
> * [SHOW ORGANIZATION ACCOUNTS](show-organization-accounts.md)
> * [SHOW PARAMETERS](show-parameters.md)
> * [SHOW REGIONS](show-regions.md)
> * [SHOW RELEASE DIRECTIVES](show-release-directives.md)
> * [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md)
> * [SHOW VERSIONS IN APPLICATION PACKAGE](show-versions.md)

**Session / User Operations:**

> * [SHOW LOCKS](show-locks.md)
> * [SHOW PARAMETERS](show-parameters.md)
> * [SHOW TRANSACTIONS](show-transactions.md)
> * [SHOW VARIABLES](show-variables.md)

**Account Objects:**

> * [SHOW APPLICATIONS](show-applications.md)
> * [SHOW APPLICATION PACKAGES](show-application-packages.md)
> * [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md)
> * [SHOW COMPUTE POOLS](show-compute-pools.md)
> * [SHOW COMPUTE POOL INSTANCE FAMILIES](show-compute-pool-instance-families.md)
> * [SHOW DATABASE ROLES](show-database-roles.md)
> * [SHOW DATABASES](show-databases.md)
> * [SHOW EXTERNAL VOLUMES](show-external-volumes.md)
> * [SHOW FAILOVER GROUPS](show-failover-groups.md)
> * [SHOW INTEGRATIONS](show-integrations.md)
> * [SHOW FEATURE POLICIES](show-feature-policies.md)
> * [SHOW FUNCTIONS](show-functions.md)
> * [SHOW GRANTS](show-grants.md)
> * [SHOW NETWORK POLICIES](show-network-policies.md)
> * [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)
> * [SHOW OPENFLOW DATA PLANE INTEGRATIONS](show-oflow-data-plane-integration.md)
> * [SHOW ORGANIZATION PROFILES](show-organization-profiles.md)
> * [SHOW PARAMETERS](show-parameters.md)
> * [SHOW POSTGRES INSTANCES](show-postgres-instances.md)
> * [SHOW REPLICATION DATABASES](show-replication-databases.md)
> * [SHOW REPLICATION GROUPS](show-replication-groups.md)
> * [SHOW RESOURCE MONITORS](show-resource-monitors.md)
> * [SHOW ROLES](show-roles.md)
> * [SHOW SHARES](show-shares.md)
> * [SHOW SPECIFICATIONS](show-specifications.md)
> * [SHOW USER PROGRAMMATIC ACCESS TOKENS](show-user-programmatic-access-tokens.md)
> * [SHOW USERS](show-users.md)
> * [SHOW WAREHOUSES](show-warehouses.md)

**Database Objects:**

> * [SHOW AGENTS](show-agents.md)
> * [SHOW AGGREGATION POLICIES](show-aggregation-policies.md)
> * [SHOW ALERTS](show-alerts.md)
> * [SHOW AUTHENTICATION POLICIES](show-authentication-policies.md)
> * [SHOW BACKUP POLICIES](show-backup-policies.md)
> * [SHOW BACKUP SETS](show-backup-sets.md)
> * [SHOW CHANNELS](show-channels.md)
> * [SHOW CLASSES](show-classes.md)
> * [SHOW COLUMNS](show-columns.md)
> * [SHOW CONFIGURATIONS](show-configurations.md)
> * [SHOW CONTACTS](show-contacts.md)
> * [SHOW CORTEX SEARCH SERVICES](show-cortex-search.md)
> * [SHOW DATA METRIC FUNCTIONS](show-data-metric-functions.md)
> * [SHOW DATASETS](show-datasets.md)
> * [SHOW DBT PROJECTS](show-dbt-projects.md)
> * [SHOW DCM PROJECTS](show-dcm-projects.md)
> * [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)
> * [SHOW DYNAMIC TABLES](show-dynamic-tables.md)
> * [SHOW ENTITIES IN DCM PROJECT](show-entities-in-dcm-project.md)
> * [SHOW EVENT TABLES](show-event-tables.md)
> * [SHOW EXPERIMENTS](show-experiments.md)
> * [SHOW EXTERNAL FUNCTIONS](show-external-functions.md)
> * [SHOW EXTERNAL TABLES](show-external-tables.md)
> * [SHOW FILE FORMATS](show-file-formats.md)
> * [SHOW FUNCTIONS](show-functions.md)
> * [SHOW GATEWAYS](show-gateways.md)
> * [SHOW GIT BRANCHES](show-git-branches.md)
> * [SHOW GIT REPOSITORIES](show-git-repositories.md)
> * [SHOW GIT TAGS](show-git-tags.md)
> * [SHOW GRANTS IN DCM PROJECT](show-grants-in-dcm-project.md)
> * [SHOW HYBRID TABLES](show-hybrid-tables.md)
> * [SHOW ICEBERG TABLES](show-iceberg-tables.md)
> * [SHOW INDEXES](show-indexes.md)
> * [SHOW IMAGE REPOSITORIES](show-image-repositories.md)
> * [SHOW JOIN POLICIES](show-join-policies.md)
> * [SHOW LISTINGS](show-listings.md)
> * [SHOW MAINTENANCE POLICIES](show-maintenance-policies.md)
> * [SHOW MASKING POLICIES](show-masking-policies.md)
> * [SHOW MATERIALIZED VIEWS](show-materialized-views.md)
> * [SHOW MCP SERVERS](show-mcp-servers.md)
> * [SHOW MODEL MONITORS](show-model-monitors.md)
> * [SHOW MODELS](show-models.md)
> * [SHOW NETWORK RULES](show-network-rules.md)
> * [SHOW NOTEBOOKS](show-notebooks.md)
> * [SHOW NOTEBOOK PROJECTS](show-notebook-projects.md)
> * [SHOW OBJECTS](show-objects.md)
> * [SHOW OBJECTS OWNED BY APPLICATION](show-objects-owned-by-application.md)
> * [SHOW ONLINE FEATURE TABLES](show-online-feature-tables.md)
> * [SHOW PACKAGES POLICIES](show-packages-policies.md)
> * [SHOW PASSWORD POLICIES](show-password-policies.md)
> * [SHOW PIPES](show-pipes.md)
> * [SHOW PRIVACY POLICIES](show-privacy-policies.md)
> * [SHOW PROCEDURES](show-procedures.md)
> * [SHOW PROJECTION POLICIES](show-projection-policies.md)
> * [SHOW ROW ACCESS POLICIES](show-row-access-policies.md)
> * [SHOW SCHEMAS](show-schemas.md)
> * [SHOW SECRETS](show-secrets.md)
> * [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md)
> * [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md)
> * [SHOW SEMANTIC FACTS](show-semantic-facts.md)
> * [SHOW SEMANTIC METRICS](show-semantic-metrics.md)
> * [SHOW SEMANTIC VIEWS](show-semantic-views.md)
> * [SHOW SEQUENCES](show-sequences.md)
> * [SHOW SERVICES](show-services.md)
> * [SHOW SESSION POLICIES](show-session-policies.md)
> * [SHOW SNAPSHOT POLICIES — Deprecated](show-snapshot-policies.md) (deprecated; prefer [SHOW BACKUP POLICIES](show-backup-policies.md))
> * [SHOW SNAPSHOT SETS — Deprecated](show-snapshot-sets.md) (deprecated; prefer [SHOW BACKUP SETS](show-backup-sets.md))
> * [SHOW SNAPSHOTS](show-snapshots.md)
> * [SHOW SPECIFICATIONS](show-specifications.md)
> * [SHOW STAGES](show-stages.md)
> * [SHOW STORAGE LIFECYCLE POLICIES](show-storage-lifecycle-policies.md)
> * [SHOW STREAMLITS](show-streamlits.md)
> * [SHOW STREAMS](show-streams.md)
> * [SHOW TABLES](show-tables.md)
> * [SHOW TAGS](show-tags.md)
> * [SHOW TASKS](show-tasks.md)
> * [SHOW TYPES](show-types.md)
> * [SHOW USER FUNCTIONS](show-user-functions.md)
> * [SHOW USER PROCEDURES](show-user-procedures.md)
> * [SHOW VERSIONS IN DATASET](show-versions-in-dataset.md)
> * [SHOW VERSIONS IN DBT PROJECT](show-versions-in-dbt-project.md)
> * [SHOW VERSIONS IN LISTING](show-versions-in-listing.md)
> * [SHOW VERSIONS IN MODEL](show-versions-in-model.md)
> * [SHOW VIEWS](show-views.md)
> * [SHOW WORKSPACES](show-workspaces.md)

**Classes:**

> * [SHOW SNOWFLAKE.ML.ANOMALY_DETECTION](../classes/anomaly-detection/commands/show-anomaly-detection.md)
> * [SHOW BUDGET](../classes/budget/commands/show-budget.md)
> * [SHOW SNOWFLAKE.ML.CLASSIFICATION](../classes/classification/commands/show-classification.md)
> * [SHOW CLASSIFICATION_PROFILE](../classes/classification_profile/commands/show-classification-profile.md)
> * [SHOW CUSTOM_CLASSIFIER](../classes/custom_classifier/commands/show-custom-classifiers.md)
> * [SHOW SNOWFLAKE.ML.FORECAST](../classes/forecast/commands/show-forecast.md)

---
title: SHOW ACCOUNTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-accounts.md
section: SQL Commands
---

# SHOW ACCOUNTS

Lists all the accounts in your organization, excluding [managed accounts](../../user-guide/data-sharing-reader-create.md).

See also:
:   [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md), [SHOW MANAGED ACCOUNTS](show-managed-accounts.md)

## Syntax

```sqlsyntax
SHOW ACCOUNTS [ HISTORY ] [ LIKE '<pattern>' ]
```

## Parameters

`HISTORY`
:   Optionally includes dropped accounts that have not yet been deleted. The output of SHOW ACCOUNTS HISTORY includes additional
    columns related to dropped accounts.

    Default: No value (dropped accounts are not included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Output

The command output provides global account properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `organization_name` | Name of the organization. |
| `account_name` | User-defined name that identifies an account within the organization. |
| `region_group` | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. **Note**: This column is only displayed for organizations that span multiple region groups. |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `edition` | [Snowflake Edition](../../user-guide/intro-editions.md) of the account. |
| `account_url` | Preferred Snowflake account URL that includes the values of organization_name and account_name. |
| `created_on` | Date and time when the account was created. |
| `comment` | Comment for the account. |
| `account_locator` | System-assigned identifier of the account. |
| `account_locator_url` | Legacy Snowflake account URL syntax that includes the region_name and account_locator. |
| `managed_accounts` | Indicates how many [managed accounts](../../user-guide/data-sharing-reader-create.md) have been created by the account. |
| `consumption_billing_entity_name` | Name of the consumption billing entity. |
| `marketplace_consumer_billing_entity_name` | Name of the marketplace consumer billing entity. |
| `marketplace_provider_billing_entity_name` | Name of the marketplace provider billing entity. |
| `old_account_url` | If the original [account URL](../../user-guide/organizations-connect.md) was saved when the account was renamed, provides the original URL. If the original account URL was dropped, the value is NULL even if the account was renamed. |
| `is_org_admin` | Indicates whether the ORGADMIN role is enabled in an account. If TRUE, the role is enabled. |
| `dropped_on` [1] | Date and time when the account was last dropped. |
| `scheduled_deletion_time` [1] | Date and time when the account is scheduled to be permanently deleted. Accounts are deleted within one hour after the scheduled time. |
| `restored_on` [1] | Date and time when the account was last restored. |
| `account_old_url_saved_on` | If the original account URL was saved when the account was renamed, provides the date and time when the original account URL was saved. |
| `account_old_url_last_used` | If the original account URL was saved when the account was renamed, indicates the last time the account was accessed using the original URL. |
| `organization_old_url` | If the account’s organization was changed in a way that created a new [account URL](../../user-guide/organizations-connect.md) and the original account URL was saved, provides the original account URL. If the original account URL was dropped, the value is NULL even if the organization changed. |
| `organization_old_url_saved_on` | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, provides the date and time when the original account URL was saved. |
| `organization_old_url_last_used` | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, indicates the last time the account was accessed using the original account URL. |
| `moved_to_organization` [1] | If the account was moved to a different organization, provides the name of that organization. |
| `moved_on` [1] | Date and time when the account was moved to a different organization. |
| `organization_URL_expiration_on` [1] | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, provides the date and time when the original account URL will be dropped. Dropped URLs cannot be used to access the account. |
| `is_events_account` | Indicates whether an account is an events account. For more information, see [Use logging and event tracing for an app](../../developer-guide/native-apps/event-about.md). |
| `is_organization_account` | Indicates whether an account is the [organization account](../../user-guide/organization-accounts.md). |

[1]
(1,2,3,4,5,6)

This column is only displayed when the HISTORY keyword is specified for the command.

## Access control requirements

When an [organization administrator](../../user-guide/organization-administrators.md) runs this command, the output includes all of the
columns.

You can also use a role with one of the following privileges to run the command, but only a subset of the columns are returned:

* CREATE LISTING
* CREATE DATA EXCHANGE LISTING
* CREATE ORGANIZATION LISTING

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the accounts whose name starts with `myaccount`:

```sqlexample
SHOW ACCOUNTS LIKE 'myaccount%';
```

---
title: SHOW AGENTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-agents.md
section: SQL Commands
---

# SHOW AGENTS

Lists the [Cortex Agents](../../user-guide/snowflake-cortex/cortex-agents.md) for which you have access privileges.

See also:
:   [ALTER AGENT](alter-agent.md), [CREATE AGENT](create-agent.md), [DROP AGENT](drop-agent.md), [DESCRIBE AGENT](desc-agent.md), [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](../functions/data_agent_run-snowflake-cortex.md)

## Syntax

```sqlsyntax
SHOW AGENTS
  [ LIKE '<pattern>' ]
  [ IN { ACCOUNT | DATABASE <db_name> | SCHEMA [<db_name>.]<schema_name> } ]
  [ STARTS WITH '<string>' ]
  [ LIMIT <rows> [ FROM '<string_from>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The command output provides Cortex Agent properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp when the agent was created. |
| `name` | Name of the agent. |
| `database_name` | Database containing the agent. |
| `schema_name` | Schema containing the agent. |
| `owner` | Owner role of the agent. |
| `comment` | Comment text for the agent. |
| `profile` | Agent profile JSON (display_name, avatar, color). |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP, USAGE, MODIFY, or MONITOR | Agent |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

List all agents in the current schema:

```sqlexample
SHOW AGENTS;
```

Sample output:

```output
+--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+
| created_on         | name  | database_name | schema_name | owner     | comment          | profile                            |
|--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------|
| 2025-09-15 17:04:37.263 +0000 | TEST_AGENT | EXAMPLE_DB   | AGENTS | TEST_ROLE | null | {"display_name":"test"} |
+--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+
```

The following example lists agents in a specific schema:

```sqlexample
SHOW AGENTS IN SCHEMA mydb.myschema;
```

The following example lists agents in a specific database:

```sqlexample
SHOW AGENTS IN DATABASE mydb;
```

The following example lists all agents in the account:

```sqlexample
SHOW AGENTS IN ACCOUNT;
```

The following example lists agents with names that start with `my_agent`:

```sqlexample
SHOW AGENTS LIKE 'my_agent%';
```

The following example lists the first 10 agents. The second statement lists the first 10 agents, started from the agent named `AGENT_NAME`.

```sqlexample
SHOW AGENTS LIMIT 10;
SHOW AGENTS LIMIT 10 FROM 'AGENT_NAME';
```

The following example lists agents as resources in JSON format:

```sqlexample
SHOW AS RESOURCE AGENTS;
```

---
title: SHOW AGGREGATION POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-aggregation-policies.md
section: SQL Commands
---

# SHOW AGGREGATION POLICIES

Lists information about existing [aggregation policies](../../user-guide/aggregation-policies.md), including the creation date, database and
schema names, owner, and any available comments.

See also:
:   [Aggregation policy DDL reference](../../user-guide/aggregation-policies.md)

## Syntax

```sqlsyntax
SHOW AGGREGATION POLICIES  [ LIKE '<pattern>' ]
                           [ IN
                               {
                                 ACCOUNT                  |

                                 DATABASE [ <database_name> ] |

                                 SCHEMA [ <schema_name> ]     |
                               }
                           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY AGGREGATION POLICY | Account |  |
| APPLY | Aggregation policy |  |
| OWNERSHIP | Aggregation policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on aggregation policy DDL and privileges, see [Privileges and commands](../../user-guide/aggregation-policies.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Example

```sqlexample
SHOW AGGREGATION POLICIES;
```

---
title: SHOW ALERTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-alerts.md
section: SQL Commands
---

# SHOW ALERTS

Lists the [alerts](../../user-guide/alerts.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE ALERT](create-alert.md) , [ALTER ALERT](alter-alert.md), [DROP ALERT](drop-alert.md) , [DESCRIBE ALERT](desc-alert.md) , [EXECUTE ALERT](execute-alert.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] ALERTS [ LIKE '<pattern>' ]
                      [ IN
                            {
                              ACCOUNT                                         |

                              DATABASE                                        |
                              DATABASE <database_name>                        |

                              SCHEMA                                          |
                              SCHEMA <schema_name>                            |
                              <schema_name>

                              APPLICATION <application_name>                  |
                              APPLICATION PACKAGE <application_package_name>  |
                            }
                      ]
                      [ STARTS WITH '<name_string>' ]
                      [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind` (shows NULL for all alerts)
    * `database_name`
    * `schema_name`
    * `schedule`
    * `state`

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the alert was created. |
| `name` | Name of the alert. |
| `database_name` | Database in which the alert is stored. |
| `schema_name` | Schema in which the alert is stored. |
| `owner` | Role that owns the alert (i.e. has the OWNERSHIP privilege on the alert) |
| `comment` | Comment for the alert. |
| `warehouse` | Warehouse that provides the required resources to run the alert. |
| `schedule` | Schedule for evaluating the condition for the alert. |
| `state` | Specifies the state of the alert. An alert can have one of the following states:   * `suspended` * `started` |
| `condition` | The text of the SQL statement that serves as the condition when the alert should be triggered. |
| `action` | The text of the SQL statement that should be executed when the alert is triggered. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR, OPERATE, or OWNERSHIP | Alert | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Only returns rows for an alert owner (that is, the role with the OWNERSHIP privilege on an alert) or a role with the OPERATE
  privilege on an alert.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

See [Viewing details about an alert](../../user-guide/alerts.md).

---
title: SHOW APPLICATION PACKAGES
source: https://docs.snowflake.com/en/sql-reference/sql/show-application-packages.md
section: SQL Commands
---

# SHOW APPLICATION PACKAGES

Lists the application packages for which you have access privileges across your entire account in the Native Apps Framework.

The output returns application package metadata and properties, ordered lexicographically by name.
This is important to note if you wish to filter the results using the provided filters.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md), [CREATE APPLICATION PACKAGE](create-application-package.md), [DROP APPLICATION PACKAGE](drop-application-package.md)

## Syntax

```sqlsyntax
SHOW APPLICATION PACKAGES [ LIKE '<pattern>' ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ];
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

```sqlexample
SHOW APPLICATION PACKAGES;
```

```output
+-------------------------------+-------------------------+------------+------------+--------------+----------------+----------+---------+----------------+------------+-------------------+-----------+
| created_on                    | name                    | is_default | is_current | distribution | owner          | comment  | options | retention_time | dropped_on | application_class | type      |
| 2023-06-02 16:28:31.371 -0700 | hello_snowflake_package | N          | N          | INTERNAL     | ACCOUNTADMIN   |          |         | 1              | NULL       | NULL              | NATIVE    |
+-------------------------------+-------------------------+------------+------------+--------------+----------------+----------+---------+----------------+------------+-------------------+-----------+
```

---
title: SHOW APPLICATION ROLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-application-roles.md
section: SQL Commands
---

# SHOW APPLICATION ROLES

Lists the application roles in the specified app for which you have access privileges.

See also:
:   [ALTER APPLICATION ROLE](alter-application-role.md), [CREATE APPLICATION ROLE](create-application-role.md), [GRANT APPLICATION ROLE](grant-application-role.md),
    [REVOKE APPLICATION ROLE](revoke-application-role.md)

## Syntax

```sqlsyntax
SHOW APPLICATION ROLES [ LIKE <pattern> ] IN APPLICATION <name>
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Required parameters

`name`
:   Specifies the app whose application roles you want to view.

## Optional parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* The output of this command displays `SNOWFLAKE` in the `owner` column.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

To view all application roles in a specific app:

```sqlexample
SHOW APPLICATION ROLES IN APPLICATION hello_snowflake_app;
```

To view up to ten application roles in the app named `myapp` after the first application role named `app_role2`:

```sqlexample
SHOW APPLICATION ROLES IN APPLICATION myapp LIMIT 10 FROM 'app_role2';
```

To view application roles with a name that includes the substring ‘role’ in the app named `myapp`:

```sqlexample
SHOW APPLICATION ROLES like '%role%' IN APPLICATION myapp;
```

---
title: SHOW APPLICATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-applications.md
section: SQL Commands
---

# SHOW APPLICATIONS

Lists the Snowflake Native Apps that you have access privileges for across your entire account.

The output returns metadata and properties for the app, ordered lexicographically
by name. This is important to note if you want to filter the results using the provided filters.

See also:
:   [ALTER APPLICATION](alter-application.md),
    [CREATE APPLICATION](create-application.md),
    [DESCRIBE APPLICATION](desc-application.md),
    [DROP APPLICATION](drop-application.md)

## Syntax

```sqlsyntax
SHOW APPLICATIONS [ LIKE '<pattern>' ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ];
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides app properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the app was created. |
| `name` | The name of the app. |
| `is_default` | Specifies whether the app is in the default namespace for the user. |
| `is_current` | Specifies whether the app is in the current session context. |
| `source_type` | Specifies the source of the app. The following values are valid:   * APPLICATION PACKAGE * LISTING |
| `source` | The name of the application package or listing used to create the app. |
| `owner` | The role used to create the app. |
| `comment` | Text that provides information about the app. |
| `version` | The version identifier used to create the app. |
| `label` | The version label of the app. This label is visible to consumers when they install an app. |
| `patch` | The patch number used to create the app. |
| `options` | For an app, this field is always empty. |
| `retention_time` | The retention time of the app. |
| `upgrade_state` | The current state of the background installation or upgrade of the app. See [Application version upgrade states](../data-sharing-usage/application-state-view.md) for more information. |
| `disablement_reasons` | The reason why the app was disabled. For more information, see [Disabled apps](../../developer-guide/native-apps/release-channels-upgrade.md). |
| `last_upgraded_on` | The timestamp when the app was last upgraded. |
| `release_channel_name` | The name of the release channel used to create the app. If the app was not created from a release channel, the value of this property is `default`. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

```sqlexample
SHOW APPLICATIONS;
```

```output
+-------------------------------+------------------------+------------+------------+---------------------+----------------------------+---------------+---------+---------------------+-----------------+-------+---------+----------------+---------------+-----------+
| created_on                    | name                   | is_default | is_current | source_type         | source                     | owner         | comment | version             | label           | patch | options | retention_time | upgrade_state | type      |
|-------------------------------+------------------------+------------+------------+---------------------+----------------------------+---------------+---------+---------------------+-----------------+-------+---------+----------------|---------------+-----------+
| 2023-02-03 10:14:09.828 -0800 | hello_snowflake_app    | N          | Y          | APPLICATION PACKAGE | hello_snowflake_package    | PROVIDER_ROLE |         | v1                  | Version v1      |     0 |         | 1              | COMPLETE      | NATIVE    |
| 2023-03-22 16:12:40.373 -0700 | PRODUCTION_APP         | Y          | Y          | APPLICATION PACKAGE | hello_snowflake_package    | PROVIDER_ROLE |         | v2                  | Version v2      |     0 |         | 1              | COMPLETE      | NATIVE    |
+-------------------------------+------------------------+------------+------------+---------------------+----------------------------+---------------+---------+---------------------+-----------------+-------+---------+----------------+---------------+-----------+
```

---
title: SHOW AUTHENTICATION POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-authentication-policies.md
section: SQL Commands
---

# SHOW AUTHENTICATION POLICIES

Lists [authentication policy](../../user-guide/authentication-policies.md) information, including the creation date, database and
schema names, owner, and any available comments.

See also:
:   [CREATE AUTHENTICATION POLICY](create-authentication-policy.md), [ALTER AUTHENTICATION POLICY](alter-authentication-policy.md), [DESCRIBE AUTHENTICATION POLICY](desc-authentication-policy.md), [DROP AUTHENTICATION POLICY](drop-authentication-policy.md)

## Syntax

```sqlsyntax
SHOW AUTHENTICATION POLICIES
  [ LIKE '<pattern>' ]
  [ IN
       {
         ACCOUNT                                         |

         DATABASE                                        |
         DATABASE <database_name>                        |

         SCHEMA                                          |
         SCHEMA <schema_name>                            |

         APPLICATION <application_name>                  |
         APPLICATION PACKAGE <application_package_name>  |
       }
    |
    ON
       {
         ACCOUNT           |
         USER <user_name>  |
       }
  ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`[ ON ... ]`
:   Lists the policies that are effective on the specified object. This command considers precedence.
    For example, listing policies on a user will show the account or built-in policy that is effective
    for the user if there is no policy set specifically on the user. Specify one of the following:

    `ACCOUNT`
    :   Returns policies effective on the account.

    `USER user_name`
    :   Returns policies effective on the specified user.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY AUTHENTICATION POLICY | Account | Only the SECURITYADMIN role, or a higher role, has this privilege by default. The privilege can be granted to additional roles as needed. |
| OWNERSHIP | Authentication policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the authentication policy was created. |
| `name` | Name of the authentication policy. |
| `database_name` | Name of the database where the authentication policy was created. NULL for the built-in system policy. |
| `schema_name` | Name of the schema where the authentication policy was created. NULL for the built-in system policy. |
| `kind` | `AUTHENTICATION_POLICY` |
| `owner` | Role that owns the authentication policy. |
| `comment` | Comment that was defined for the authentication policy when it was created or altered. |
| `owner_role_type` | Role type of the owner. |
| `options` | For SHOW AUTHENTICATION POLICIES ON, details about how the policy is set. |
| `set_on` | For SHOW AUTHENTICATION POLICIES ON, the object type where the policy is set: USER, ACCOUNT, or SYSTEM. |

## Examples

The following example returns one authentication policy, which belongs to the current database and schema:

```sqlexample
SHOW AUTHENTICATION POLICIES;
```

```output
+-------------------------------+------------------------------+---------------+----------------+-----------------------+--------------+---------------------------------------------------------------+-----------------+---------+
| created_on                    | name                         | database_name | schema_name    | kind                  | owner        | comment                                                       | owner_role_type | options |
|-------------------------------+------------------------------+---------------+----------------+-----------------------+--------------+---------------------------------------------------------------+-----------------+---------|
| 2025-09-10 16:38:57.530 -0700 | RESTRICT_CLIENT_TYPES_POLICY | CLIENTS_DB    | CLIENTS_SCHEMA | AUTHENTICATION_POLICY | CLIENTS_ROLE | Auth policy that only allows access through the web interface | ROLE            |         |
+-------------------------------+------------------------------+---------------+----------------+-----------------------+--------------+---------------------------------------------------------------+-----------------+---------+
```

The following example returns all of the authentication policies in the account:

```sqlexample
SHOW AUTHENTICATION POLICIES IN ACCOUNT;
```

```output
+-------------------------------+------------------------------+---------------+----------------+-----------------------+--------------+---------------------------------------------------------------+-----------------+---------+
| created_on                    | name                         | database_name | schema_name    | kind                  | owner        | comment                                                       | owner_role_type | options |
|-------------------------------+------------------------------+---------------+----------------+-----------------------+--------------+---------------------------------------------------------------+-----------------+---------|
| 2025-09-10 16:38:57.530 -0700 | RESTRICT_CLIENT_TYPES_POLICY | CLIENTS_DB    | CLIENTS_SCHEMA | AUTHENTICATION_POLICY | CLIENTS_ROLE | Auth policy that only allows access through the web interface | ROLE            |         |
| 2025-06-25 13:37:11.092 -0700 | MULTIPLE_AUTH_MODES          | POLICY1_DB    | POLICY1_SCHEMA | AUTHENTICATION_POLICY | POLICY1_ROLE |                                                               | ROLE            |         |
+-------------------------------+------------------------------+---------------+----------------+-----------------------+--------------+---------------------------------------------------------------+-----------------+---------+
```

---
title: SHOW AVAILABLE LISTINGS
source: https://docs.snowflake.com/en/sql-reference/sql/show-available-listings.md
section: SQL Commands
---

# SHOW AVAILABLE LISTINGS

Lists the listings that are available to the user who runs the command.
For more information, see [Listing availability options](https://other-docs.snowflake.com/collaboration/collaboration-listings-about#label-listing-availability).

See also:
:   [CREATE LISTING](create-listing.md), [CREATE APPLICATION](create-application.md), [ALTER LISTING](alter-listing.md), [DESCRIBE LISTING](desc-listing.md), [DROP LISTING](drop-listing.md)

## Syntax

```sqlsyntax
SHOW AVAILABLE LISTINGS

SHOW [ TERSE ] AVAILABLE LISTINGS
    [ LIMIT <rows> ]
    [ IS_IMPORTED = TRUE ]
    [ IS_ORGANIZATION = TRUE ]
    [ IS_SHARED_WITH_ME = TRUE ]
```

## Parameters

`LIMIT rows`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

`TERSE`
:   Optionally returns output containing only the following columns:

    * `created_on`
    * `global_name`
    * `profile`
    * `title`

`IS_SHARED_WITH_ME = TRUE`
:   Optional, shows only listings shared privately with the current user.

    > | Property value | Behavior |
    > | --- | --- |
    > | Not set | All listings are returned. |
    > | TRUE | Only listings shared privately with the current user are returned. |

`IS_IMPORTED = TRUE`
:   Optional, shows only imported listings, but filters returned results according to:

    | Property value | Behavior |
    | --- | --- |
    | Not set | All listings are returned. |
    | TRUE | Only imported listings are returned. |

`IS_ORGANIZATION = TRUE`
:   Optional, shows only organization level listings.

    | Property value | Behavior |
    | --- | --- |
    | Not set | All listings are returned. |
    | TRUE | Shows organization level listings. |

## Usage notes

Only one of filters `IS_IMPORTED`, `IS_ORGANIZATION`, or `IS_SHARED_WITH_ME` may be specified at a time.

---
title: SHOW AVAILABLE OFFERS
source: https://docs.snowflake.com/en/sql-reference/sql/show-available-offers.md
section: SQL Commands
---

# SHOW AVAILABLE OFFERS

Lists the [offers](../../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md) that are available to the user who runs the command.

## Syntax

```sqlsyntax
SHOW AVAILABLE OFFERS [ LIKE '<pattern>' ] IN LISTING <listing>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN LISTING listing`
:   The listing associated with the offer you want shown.

## Output

The command output provides offer properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | The offer name. |
| `state` | Offer status, one of:   * DRAFT * PUBLISHED * WITHDRAWN |
| `state_updated_on` | The date and time the offer state was last updated. |
| `access_start_date_preference` | The preferred date for consumer listing access, one of:   * OFFER_ACCEPTED_DATE * SPECIFIC_DATE |
| `contract_value` | The total contract value. |
| `contract_type` | The contract type, one of:   * SUBSCRIPTION * LIMITED_TIME * PAY_AS_YOU_GO |
| `contract_duration_months` | The contract duration in months. |
| `invoice_start_date_preference` | The preferred invoicing start date, one of:   * OFFER_ACCEPTED_DATE * SPECIFIC_DATE * FIRST_DAY_NEXT_MONTH |
| `invoice_start_time` | The date and time invoicing started. |
| `is_default` | Specifies a default offer is included with the pricing plan, one of:   * TRUE * FALSE (default) |
| `display_name` | The offer name visible to consumers. |
| `expiration_time` | The date and time the offer expires. |
| `payment_terms` | Additional pricing plan parameters, one of:   * PAYMENT_TYPE * INSTALLMENT_SCHEDULE * ALLOWED_PAYMENT_METHODS |
| `access_end_time` | The date and time consumers lose access to the listing. |
| `access_start_time` | The date and time consumers can access the listing. |
| `discount` | The offer discount. |
| `target_consumer` | The consumer the offer targets. |
| `terms_of_service` | The terms of service associated with the offer. |
| `additional_information` | Additional offer information. |
| `pricing_plan` | The pricing plan associated with the offer. |
| `updated_on` | The date and time the offer was last updated. |

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| PURCHASE DATA EXCHANGE LISTING | Global | This privilege grants the ability to purchase a paid listing. If you don’t have a role with this privilege, contact your account administrator. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all available offers with names that start with `myoffer` in `mylisting`:

```sqlexample
SHOW AVAILABLE OFFERS LIKE 'MYOFFER%' IN LISTING MYLISTING;
```

---
title: SHOW AVAILABLE ORGANIZATION PROFILES
source: https://docs.snowflake.com/en/sql-reference/sql/show-available-organization-profiles.md
section: SQL Commands
---

# SHOW AVAILABLE ORGANIZATION PROFILES

Lists the organization profiles available in the user’s organization.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW ORGANIZATION PROFILES](show-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md)

## Syntax

```sqlsyntax
SHOW AVAILABLE ORGANIZATION PROFILES
```

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | The date and time when the organization profile was created. |
| `name` | The name of the organization profile. |
| `system_generated` | Indicates the organization profile is system generated. |
| `state` | The organization profile state. One of ACTIVE or DRAFT. |
| `organization_name` | The name of the organization associated with the organization profile. |
| `title` | The title of the organization profile. |
| `description` | The description of the organization profile. |
| `owner_contact` | The contact email of the owner of the organization profile. |
| `approver_contact` | The contact email of the access approver of the organization profile. |
| `can_publish_listings_with_profile` | Whether the current user can publish organizational listings using this organization profile. One of `TRUE` or `FALSE`. |

## Examples

The following example lists the organization profiles that you have the privileges to access:

```sqlexample
SHOW AVAILABLE ORGANIZATION PROFILES;
```

```output
+-------------------------+-------------+---------------------+---------------------+---------------------+------------------------+---------------------------------+---------------------+---------------------+-----------------------------------+
|created_on               |name         |system_generated     |state                |organization_name    |title                   |description                      |owner_contact        |approver_contact     |can_publish_listings_with_profile  |
+-------------------------+-------------+---------------------+---------------------+---------------------+------------------------+---------------------------------+---------------------+---------------------+-----------------------------------+
|2025-01-01 01:01:01.000  |ORGPROFILE   |FALSE                |ACTIVE               |TESTORG              |My Organization Profile |Organization profile description |test@test.com        |test@test.com        |TRUE                               |
+-------------------------+-------------+---------------------+---------------------+---------------------+------------------------+---------------------------------+---------------------+---------------------+-----------------------------------+
```

---
title: SHOW BACKUP POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-backup-policies.md
section: SQL Commands
---

# SHOW BACKUP POLICIES

Lists all the [backup](../../user-guide/backups.md) policies in your account for which you have access privileges.

See also:
:   [CREATE BACKUP POLICY](create-backup-policy.md),
    [ALTER BACKUP POLICY](alter-backup-policy.md),
    [DROP BACKUP POLICY](drop-backup-policy.md)

## Syntax

```sqlsyntax
SHOW BACKUP POLICIES
   [ LIKE '<pattern>' ]
   [ IN { ACCOUNT | DATABASE | DATABASE <db_name> | SCHEMA | SCHEMA <schema_name> }
     [ STARTS WITH '<name_string>' ]
     [ LIMIT <rows> ]
   ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    `STARTS WITH 'name_string'`
    :   Optionally filters the command output based on the characters that appear at the beginning of
        the object name. The string must be enclosed in single quotes and is case sensitive.

        For example, the following strings return different results:

        `... STARTS WITH 'B' ...`

        `... STARTS WITH 'b' ...`

        . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output)

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

To determine whether a backup policy is associated with any backup sets, use the SHOW BACKUP SETS command.

> **Note:**
>
> The backup policy is an object that’s inside a specific schema and database. Therefore, the policy
> gets replicated, dropped or undropped, and so on, when those operations are performed on the schema and database
> that contain it. If you can’t drop the backup policy because it’s associated with any backup sets,
> then you also can’t drop the schema or database containing the policy.

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp backup policy was created. |
| `name` | Name of backup policy. |
| `database_name` | Name of database that contains the backup policy. |
| `schema_name` | Name of schema that contains the backup policy. |
| `owner` | Name of the role with the OWNERSHIP privilege on the backup policy. |
| `comment` | Comment for backup policy. |
| `schedule` | Schedule for backup creation. |
| `expire_after_days` | Number of days after backup creation when backup expires. |
| `has_retention_lock` | Indicates whether the policy includes a retention lock.  `Y` if policy has retention lock; `N` otherwise.  For more information, see [Retention lock](../../user-guide/backups.md). |
| `owner` | Name of the role with the OWNERSHIP privilege on the backup set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the backup policy. |

## Examples

List all backup policies you have privileges for in the current account:

```sqlexample
SHOW BACKUP POLICIES IN ACCOUNT;
```

---
title: SHOW BACKUP SETS
source: https://docs.snowflake.com/en/sql-reference/sql/show-backup-sets.md
section: SQL Commands
---

# SHOW BACKUP SETS

Lists all the [backup](../../user-guide/backups.md) sets for which you have access privileges.
The scope of this command can be your entire account, or a specified database or schema.

See also:
:   [CREATE BACKUP SET](create-backup-set.md),
    [ALTER BACKUP SET](alter-backup-set.md),
    [DROP BACKUP SET](drop-backup-set.md)

## Syntax

```sqlsyntax
SHOW BACKUP SETS
   [ LIKE '<pattern>' ]
   [ IN { ACCOUNT | DATABASE | DATABASE <db_name> | SCHEMA | SCHEMA <schema_name> }
     [ STARTS WITH '<name_string>' ]
     [ LIMIT <rows> ]
   ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    `STARTS WITH 'name_string'`
    :   Optionally filters the command output based on the characters that appear at the beginning of
        the object name. The string must be enclosed in single quotes and is case sensitive.

        For example, the following strings return different results:

        `... STARTS WITH 'B' ...`

        `... STARTS WITH 'b' ...`

        . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output)

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp that the backup set was created. |
| `name` | Name of the backup set. |
| `database_name` | Name of the database that contains the backup set. |
| `schema_name` | Name of the schema that contains the backup set. |
| `object_kind` | Type of the object that the backup set backs up. |
| `object_name` | Name of the object that the backup set backs up. |
| `object_database_name` | Name of the database that contains the object that is backed up by this backup set. |
| `object_schema_name` | Name of the schema that contains the object that is backed up by this backup set. |
| `backup_policy_name` | Name of the backup policy attached to this backup set. |
| `backup_policy_database_name` | Name of the database that contains the backup policy. |
| `backup_policy_schema_name` | Name of the schema that contains the backup policy. |
| `backup_policy_state` | Current state of the backup policy. |
| `owner_role` | Name of the role with the OWNERSHIP privilege on the backup set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the backup set. |
| `comment` | Comment for backup set. |

## Examples

List all backup sets that you have privileges for in the current account:

```sqlexample
SHOW BACKUP SETS IN ACCOUNT;
```

List backup sets that include `T1` in the name:

```sqlexample
SHOW BACKUP SETS LIKE '%T1%';
```

---
title: SHOW BACKUPS IN BACKUP SET
source: https://docs.snowflake.com/en/sql-reference/sql/show-backups-in-backup-set.md
section: SQL Commands
---

# SHOW BACKUPS IN BACKUP SET

Lists all the [backups](../../user-guide/backups.md) in a backup set.

See also:
:   [CREATE BACKUP SET](create-backup-set.md),
    [ALTER BACKUP SET](alter-backup-set.md),
    [SHOW BACKUP SETS](show-backup-sets.md)

## Syntax

```sqlsyntax
SHOW BACKUPS IN BACKUP SET <name>
  [ LIMIT <rows> ]
```

## Parameters

`name`
:   Specifies the identifier for the backup set.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`LIMIT rows`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output)

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| OWNERSHIP | You must have the OWNERSHIP privilege on the backup set to see the backups that it contains. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp when the backup is created. |
| `backup_id` | Snowflake-generated identifier of the backup. The backup ID is a UUID value, in the format returned by the [UUID_STRING](../functions/uuid_string.md) function. |
| `backup_set_name` | Name of backup set that contains the backup. |
| `database_name` | Name of database that contains the backup set. |
| `schema_name` | Name of schema that contains the backup set. |
| `expire_on` | Timestamp when the backup expires. |

## Examples

List all backups in backup set `t1_backups`:

```sqlexample
SHOW BACKUPS IN BACKUP SET t1_backups;
```

Show the creation date and backup ID for the oldest backup in backup set `t1_backups`:

```sqlexample
SHOW BACKUPS IN BACKUP SET t1_backups ->>
  SELECT "created_on", "backup_id" FROM $1
    ORDER BY "created_on" LIMIT 1;
```

Show the backup ID and the date and time when the final backup in backup set `t1_backups` will expire.
This example presumes that the backup policy doesn’t include a schedule, or the backup policy is suspended
for the backup set, so that no new backups are being added to the backup set. You’re just waiting for
all the existing backups to expire so that you can drop the backup set.

```sqlexample
SHOW BACKUPS IN BACKUP SET t1_backups ->>
  SELECT "expire_on", "backup_id" FROM $1
    ORDER BY "expire_on" DESC LIMIT 1;
```

---
title: SHOW CALLER GRANTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-caller-grants.md
section: SQL Commands
---

# SHOW CALLER GRANTS

Lists the [caller grants](../../developer-guide/restricted-callers-rights.md) being used to implement restricted caller’s rights.

## Syntax

```sqlsyntax
SHOW CALLER GRANTS
{
{ ON <object_type> <object_name> | ON ACCOUNT }
| TO { ROLE | DATABASE ROLE }  <owner_name>
}
```

## Parameters

`ON object_type object_name` or . `ON ACCOUNT`
:   Specifies whether to list the caller grants on a specific object or list all caller grants involving the account.

    Use the singular form of `object_type`, for example, `TABLE` or `WAREHOUSE`.

`TO ROLE <owner_name>` or . `TO DATABASE ROLE <owner_name>`
:   Specifies an executable owner, which lists all caller grants that have been granted to that owner.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the caller grant was granted. |
| `privilege` | Privilege that executables owned by `grantee_name` can run with. |
| `granted_on` | Type of object that is subject to the caller grant, regardless of whether it was granted directly on an object or on all objects of that type. |
| `name` | If the caller grant was granted directly on a specific object, specifies the name of the object. |
| `is_inherited` | If `TRUE`, the caller grant was granted to all objects of a certain type using a GRANT INHERITED CALLER or GRANT ALL INHERITED CALLER PRIVILEGES statement.  If `FALSE`, the caller grant was granted directly on the `name` object. |
| `inherited_from` | If the caller grant was granted to all objects of a certain type using a GRANT INHERITED CALLER or GRANT ALL INHERITED CALLER PRIVILEGES statement, indicates the level at which it was granted. One of `ACCOUNT`, `DATABASE`, or `SCHEMA`. |
| `inherited_from_database` | If `inherited_from` is a database (including an application or application package), specifies the name of the database. If `inherited_from` is a schema, specifies the name of the database that contains the schema. |
| `inherited_from_schema` | If `inherited_from` is a schema, specifies the name of the schema. |
| `granted_to` | Type of executable owner to which the caller grant was granted. One of `ROLE` or `DATABASE ROLE`. |
| `grantee_name` | Name of the executable owner to which the caller grant was granted. |

## Access control requirements

Anyone can execute a SHOW CALLER GRANTS TO … command to list caller grants that have been granted to a specific executable owner.

Executing a SHOW CALLER GRANTS ON … command requires the following privilege:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any privilege | Specified object | You need at least one privilege on the object specified in the SHOW CALLER GRANTS command. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* When executing a SHOW CALLER GRANTS ON … statement, different rows of the output can indicate different things. For example, one row
  could indicate a caller grant was granted directly on an object while another row indicates that the object was specified with an IN clause
  in the GRANT statement. For more information, see [List caller grants](../../developer-guide/restricted-callers-rights.md).
* When a user executes SHOW CALLER GRANTS, the results only contain objects to which they have at least one privilege. For more information,
  see [Conditional output](../../developer-guide/restricted-callers-rights.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List caller grants that have been granted on the table `t1`.

> ```sqlexample
> SHOW CALLER GRANTS ON TABLE t1;
> ```

List all of the caller grants that have been granted for the current account. This includes grants directly on the account
(GRANT CALLER … ON ACCOUNT) and grants to all objects in an account (GRANT INHERITED CALLER … IN ACCOUNT).

> ```sqlexample
> SHOW CALLER GRANTS ON ACCOUNT;
> ```

List all of the caller grants that have been granted to the database role `db.owner_role`.

> ```sqlexample
> SHOW CALLER GRANTS TO DATABASE ROLE db.owner_role;
> ```

---
title: SHOW CATALOG INTEGRATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-catalog-integrations.md
section: SQL Commands
---

# SHOW CATALOG INTEGRATIONS

Lists the [catalog integrations](../../user-guide/tables-iceberg.md) in your account.
The output returns integration metadata and properties.

See also:
:   [CREATE CATALOG INTEGRATION](create-catalog-integration.md) , [ALTER CATALOG INTEGRATION](alter-catalog-integration.md) , [DROP CATALOG INTEGRATION](drop-catalog-integration.md) , [DESCRIBE CATALOG INTEGRATION](desc-catalog-integration.md)

## Syntax

```sqlsyntax
SHOW CATALOG INTEGRATIONS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration |  |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the catalog integration. |
| `enabled` | Specifies whether the catalog integration is available to use for Apache Iceberg™ tables. |
| `type` | Type of the integration. The value is always CATALOG. |
| `category` | Category of the integration. The value is always CATALOG. |
| `comment` | String (literal) that specifies a comment for the integration. |
| `created_on` | Date and time when the catalog integration was created. |

For more information about the properties that can be specified for a catalog integration, see [CREATE CATALOG INTEGRATION](create-catalog-integration.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all catalog integrations:

```sqlexample
SHOW CATALOG INTEGRATIONS;
```

Show all the catalog integrations whose name starts with `demo` that you have privileges to view:

```sqlexample
SHOW CATALOG INTEGRATIONS LIKE 'demo%';
```

---
title: SHOW CHANNELS
source: https://docs.snowflake.com/en/sql-reference/sql/show-channels.md
section: SQL Commands
---

# SHOW CHANNELS

Lists the [Snowpipe Streaming channels](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) for which you have access privileges. This command can be used to list the channels for a specified table, database or schema
(or the current database/schema for the session), or your entire account.

See also:

> * [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md)
> * [SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN](../functions/system_snowpipe_streaming_update_channel_offset_token.md)

## Syntax

```sqlsyntax
SHOW CHANNELS [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |

                  DATABASE                 |
                  DATABASE <database_name> |

                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>            |

                  TABLE                    |
                  TABLE <table_name>       |

                  PIPE                     |
                  PIPE <pipe_name>
                }
           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `TABLE`, . `TABLE table_name`
    :   Returns records for the table.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides pipe properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the channel is created. |
| name | Name of the channel. |
| database_name | The name of the database where the Snowpipe Streaming channel is logically stored.  Named channel: The name of the database where the user-defined channel is created.  Default pipe (high-performance): The name of the target table’s database. |
| schema_name | The name of the schema where the Snowpipe Streaming channel is logically stored.  Named channel: The name of the schema where the user-defined channel is created.  Default pipe (high-performance): The name of the target table’s schema. |
| table_name | The name of the Snowflake table that the channel is mapped to for data ingestion.  For Snowpipe Streaming classic channels: The column shows the name of the target table, establishing the channel’s mapping.  For default pipes (High-Performance): The column is populated with the name of the target table, providing context for the default pipe’s destination. |
| client_sequencer | For internal use. |
| row_sequencer | For internal use. |
| offset_token | String used to track the ingestion process. |
| parent_name | Table or pipe where the channel is mapped to. |
| parent_domain | Domain (table or pipe) where the channel is mapped to. |

## Examples

Show all the channels that you have privileges to view in the `public` schema in the `mydb` database:

> ```sqlexample
> use database mydb;
>
> show channels;
>
> +-------------------------------+-----------+---------------+------------------+------------------------+------------------+---------------+--------------+
> | created_on                    | name      | database_name | schema_name      | table_name             | client_sequencer | row_sequencer | offset_token |
> |-------------------------------+-----------+---------------+------------------+------------------------+------------------+---------------+--------------+
> | 2023-05-05 17:13:17.579 -0700 | CHANNEL8  | TEST_DB1      | STREAMING_INGEST | STREAMING_INGEST_TABLE | 7                | 1             | 0            |
> |                               |           |               |                  |                        |                  |               |              |
> +-------------------------------+-----------+---------------+------------------+------------------------+------------------+---------------+--------------+
> ```

Show all the channels for a specific pipe:

> ```sqlexample
> show channels in pipe MY_PIPE;
> ```

---
title: SHOW CLASSES
source: https://docs.snowflake.com/en/sql-reference/sql/show-classes.md
section: SQL Commands
---

# SHOW CLASSES

Lists all available classes.

See also:
:   [SHOW FUNCTIONS](show-functions.md) , [SHOW PROCEDURES](show-procedures.md) , [SHOW ROLES](show-roles.md)

    [Snowflake classes](../snowflake-db-classes.md)

## Syntax

```sqlsyntax
SHOW CLASSES [ LIKE '<pattern>' ]
             [ IN DATABASE [ <db_name> ] ]
             [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN DATABASE db_name`
:   Specifies the scope of the command, which determines whether the command lists records only for the current/specified database or
    across your entire account.

    The `DATABASE` keyword is not required; you can set the scope by specifying only the database name. Likewise, the database name
    is not required if the session currently has a database in use.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* The `owner` and `owner_role_type` columns don’t return a value.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all classes in the Snowflake database:

```sqlexample
SHOW CLASSES IN DATABASE SNOWFLAKE;

+-------------------------------+-----------------------+---------------+-------------+---------+---------+-------+------------------+-----------------+
| created_on                    | name                  | database_name | schema_name | version | comment | owner | is_service_class | owner_role_type |
|-------------------------------+-----------------------+---------------+-------------+---------+---------+-------|------------------|-----------------+
| 2023-04-17 11:48:31.222 -0700 | ANOMALY_DETECTION     | SNOWFLAKE     | ML          | NULL    | NULL    |       | false            |                 |
| 2023-05-26 10:01:24.852 -0700 | FORECAST              | SNOWFLAKE     | ML          | NULL    | NULL    |       | false            |                 |
+-------------------------------+-----------------------+---------------+-------------+---------+---------+-------+------------------+-----------------+
```

---
title: SHOW COLUMNS
source: https://docs.snowflake.com/en/sql-reference/sql/show-columns.md
section: SQL Commands
---

# SHOW COLUMNS

Lists the columns in the tables or views and the dimensions, facts, and metrics in the
[semantic views](../../user-guide/views-semantic/overview.md) for which you have access privileges. This command can be used to list
the columns, dimensions, facts, and metrics for the following objects:

* The specified table or view.
* All tables and views in the specified schema or in the schema that is currently in use.
* All tables and views in the specified database or in the database that is currently in use.
* All tables and views in your account.

See also:
:   [DESCRIBE TABLE](desc-table.md)

    [COLUMNS view](../info-schema/columns.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW COLUMNS [ LIKE '<pattern>' ]
             [ IN { ACCOUNT | DATABASE [ <database_name> ] | SCHEMA [ <schema_name> ] | TABLE | [ TABLE ] <table_name> | VIEW | [ VIEW ] <view_name> } | APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> ]
```

## Parameters

`LIKE '<pattern>'`
:   Filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL wildcard
    characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

`IN { ACCOUNT | DATABASE [ <database_name> ] | SCHEMA [ <schema_name> ] | TABLE | [ TABLE ] <table_name> | VIEW | [ VIEW ] <view_name> | APPLICATION <application_name>  | APPLICATION PACKAGE <application_package_name> }`
:   Specifies the scope of the command, which determines whether the command lists records only for the current/specified database,
    schema, table, or view, or across your entire account:

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    If you specify the keyword `TABLE` without a `table_name`, then:

    * If there is a current database, then:

      + If there is a current schema, then the command retrieves records for the current schema in the current database.
      + If there is no current schema, then the command retrieves records for all schemas in the current database.
    * If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    If you specify a `<table_name>` (with or without the keyword `TABLE`), then:

    * If you specify a fully-qualified `<table_name>` (e.g. `my_database_name.my_schema_name.my_table_name`),
      then the command retrieves all records for the specified table.
    * If you specify a schema-qualified `<table_name>` (e.g. `my_schema_name.my_table_name`), then:

      + If a current database exists, then the command retrieves all records for the specified table.
      + If no current database exists, then the command displays an error similar to
        `Cannot perform SHOW <object_type>. This session does not have a current database...`.
    * If you specify an unqualified `<table_name>`, then:

      + If a current database and current schema exist, then the command retrieves records for the specified table in the current
        schema of the current database.
      + If no current database exists or no current schema exists, then the command displays an error similar to:
        `SQL compilation error: <object> does not exist or not authorized.`.

    If you specify the `VIEW` keyword or a view name, the rules for views parallel the rules for tables.

    If you specify the `APPLICATION` or `APPLICATION PACKAGE` keywords, records for the specified Snowflake Native App Framework application or
    application package are returned.

    Default: Depends on whether the session currently has a database in use:

    > * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    > * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

## Usage notes

* You can use the `VIEW` keyword and specify a view name for standard views, materialized views, and semantic views.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

> **Note:**
>
> The column names in the output table for the SHOW COLUMNS command are lowercase (that is,
> `table_name`, `schema_name`, `column_name`, and so on). However, the values in
> the `column_name` column reflect the column name that is stored. For example, if a column name
> is added without being enclosed in double quotes using the `ALTER TABLE ... ADD COLUMN MYCOLUMN`
> statement, the column name is stored in uppercase and appears as `MYCOLUMN` in the `column_name`
> column.

## Output

The command output provides column properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `table_name` | Name of the table or view that the column, dimension, fact, or metric belongs to. |
| `schema_name` | Schema for the table. |
| `column_name` | Name of the column, dimension, fact, or metric. |
| `data_type` | JSON object containing the data type and applicable properties of the column, dimension, fact, or metric.  The `type` key-value pair specifies the data type of the column, dimension, fact, or metric.  For [string](../data-types-text.md) and [numeric](../data-types-numeric.md) data types, `type` specifies one of the following values:   * `TEXT` for all string types. * `FIXED` for all fixed-point numeric types. * `REAL` for all floating-point numeric types.   The other key-value pairs describe the properties that are applicable to the particular data type. For example:   * If `type` is `TEXT` or `BINARY`, the additional key-value pairs can include `length`, `byteLength`,   `nullable`, and `fixed`. * If `type` is `FIXED`, `TIME`, `TIMESTAMP_NTZ`, `TIMESTAMP_LTZ`, or `TIMESTAMP_TZ`, the additional key-value   pairs can include `precision`, `scale`, and `nullable`. * If `type` is `REAL`, `DATE`, or `BOOLEAN`, the additional key-value pairs can include `nullable`. |
| `null?` | Whether the column can contain NULL values. |
| `default` | Default value, if any, defined for the column. |
| `kind` | One of the following values:   * `COLUMN` for columns in tables, views, and materialized views. * `DIMENSION` for dimensions in [semantic views](../../user-guide/views-semantic/overview.md). * `FACT` for facts in semantic views. * `METRIC` for metrics in semantic views. |
| `expression` |  |
| `comment` | Comment, if any, for the column, dimension, fact, or metric. |
| `database_name` | Database for the table. |
| `autoincrement` | Auto-increment start and increment values, if any, for the column. If the column has the NOORDER property, the value includes `NOORDER` (for example, `IDENTITY START 1 INCREMENT 1 NOORDER`). Otherwise, the value includes `ORDER`. |
| `schema_evolution_record` | Records information about the latest triggered Schema Evolution for a given table column. This column contains the following subfields:   * EvolutionType: The type of the triggered schema evolution (ADD_COLUMN or DROP_NOT_NULL). * EvolutionMode: The triggering ingestion mechanism (COPY, SNOWPIPE, or SNOWPIPE_STREAMING). * FileName: The file name that triggered the evolution (NULL for SNOWPIPE_STREAMING). * TriggeringTime: The approximate time when the column was evolved. * QueryId or PipeId: A unique identifier of the triggering query or pipe (QUERY ID for COPY, PIPE ID for SNOWPIPE, or NULL for SNOWPIPE_STREAMING). * Pipe name: Fully qualified pipe name that triggered schema evolution (SNOWPIPE_STREAMING only). * Channel name: Channel that triggered schema evolution (SNOWPIPE_STREAMING only). * offsetTokenUpperBound: An offset at or before which schema evolution was triggered (SNOWPIPE_STREAMING only). |

## Examples

The following example creates a table and then runs the SHOW COLUMNS command to list the
columns in the table:

```sqlexample
CREATE OR REPLACE TABLE test_show_columns (
  n1 NUMBER DEFAULT 5,
  n2_int INTEGER DEFAULT n1+5,
  n3_bigint BIGINT AUTOINCREMENT,
  n4_dec DECIMAL IDENTITY (1,10),
  f1 FLOAT,
  f2_double DOUBLE,
  f3_real REAL,
  s1 STRING,
  s2_var VARCHAR,
  s3_char CHAR,
  s4_text TEXT,
  "s5_case_sensitive" VARCHAR,
  b1 BINARY,
  b2_var VARBINARY,
  bool1 BOOLEAN,
  d1 DATE,
  t1 TIME,
  ts1 TIMESTAMP,
  ts2_ltz TIMESTAMP_LTZ,
  ts3_ntz TIMESTAMP_NTZ,
  ts4_tz TIMESTAMP_TZ);

SHOW COLUMNS IN TABLE test_show_columns;
```

```output
+-------------------+----------------+-------------------+---------------------------------------------------------------------------------------+-------+--------------------------+--------+------------+---------+---------------+---------------------------------------+-------------------------+
| table_name        | schema_name    | column_name       | data_type                                                                             | null? | default                  | kind   | expression | comment | database_name | autoincrement                         | schema_evolution_record |
|-------------------+----------------+-------------------+---------------------------------------------------------------------------------------+-------+--------------------------+--------+------------+---------+---------------+---------------------------------------+-------------------------|
| TEST_SHOW_COLUMNS | MY_SCHEMA      | N1                | {"type":"FIXED","precision":38,"scale":0,"nullable":true}                             | true  | 5                        | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | N2_INT            | {"type":"FIXED","precision":38,"scale":0,"nullable":true}                             | true  | TEST_SHOW_COLUMNS.N1 + 5 | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | N3_BIGINT         | {"type":"FIXED","precision":38,"scale":0,"nullable":true}                             | true  |                          | COLUMN |            |         | MY_DB         | IDENTITY START 1 INCREMENT 1 NOORDER  | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | N4_DEC            | {"type":"FIXED","precision":38,"scale":0,"nullable":true}                             | true  |                          | COLUMN |            |         | MY_DB         | IDENTITY START 1 INCREMENT 10 NOORDER | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | F1                | {"type":"REAL","nullable":true}                                                       | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | F2_DOUBLE         | {"type":"REAL","nullable":true}                                                       | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | F3_REAL           | {"type":"REAL","nullable":true}                                                       | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | S1                | {"type":"TEXT","length":16777216,"byteLength":16777216,"nullable":true,"fixed":false} | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | S2_VAR            | {"type":"TEXT","length":16777216,"byteLength":16777216,"nullable":true,"fixed":false} | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | S3_CHAR           | {"type":"TEXT","length":1,"byteLength":4,"nullable":true,"fixed":false}               | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | S4_TEXT           | {"type":"TEXT","length":16777216,"byteLength":16777216,"nullable":true,"fixed":false} | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | s5_case_sensitive | {"type":"TEXT","length":16777216,"byteLength":16777216,"nullable":true,"fixed":false} | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | B1                | {"type":"BINARY","length":8388608,"byteLength":8388608,"nullable":true,"fixed":true}  | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | B2_VAR            | {"type":"BINARY","length":8388608,"byteLength":8388608,"nullable":true,"fixed":false} | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | BOOL1             | {"type":"BOOLEAN","nullable":true}                                                    | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | D1                | {"type":"DATE","nullable":true}                                                       | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | T1                | {"type":"TIME","precision":0,"scale":9,"nullable":true}                               | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | TS1               | {"type":"TIMESTAMP_NTZ","precision":0,"scale":9,"nullable":true}                      | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | TS2_LTZ           | {"type":"TIMESTAMP_LTZ","precision":0,"scale":9,"nullable":true}                      | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | TS3_NTZ           | {"type":"TIMESTAMP_NTZ","precision":0,"scale":9,"nullable":true}                      | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
| TEST_SHOW_COLUMNS | MY_SCHEMA      | TS4_TZ            | {"type":"TIMESTAMP_TZ","precision":0,"scale":9,"nullable":true}                       | true  |                          | COLUMN |            |         | MY_DB         |                                       | NULL                    |
+-------------------+----------------+-------------------+---------------------------------------------------------------------------------------+-------+--------------------------+--------+------------+---------+---------------+---------------------------------------+-------------------------+
```

---
title: SHOW COMPUTE POOL INSTANCE FAMILIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-compute-pool-instance-families.md
section: SQL Commands
---

# SHOW COMPUTE POOL INSTANCE FAMILIES

Lists the available [compute pool instance families](../../developer-guide/snowpark-container-services/working-with-compute-pool.md)
that you can use to create a compute pool.

See also:
:   [CREATE COMPUTE POOL](create-compute-pool.md) , [ALTER COMPUTE POOL](alter-compute-pool.md)

## Syntax

```sqlsyntax
SHOW COMPUTE POOL INSTANCE FAMILIES
```

## Output

The command output provides compute pool instance family properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Instance family name. |
| `description` | Instance family description. |
| `vcpu` | Number of vCPUs that are accessible to the user. |
| `memory_gib` | Memory in GiB that is accessible to the user. |
| `storage_gib` | Storage in GiB that is accessible to the user. |
| `gpu` | Name of the GPU if applicable, else an empty string. |
| `gpu_count` | Count of GPUs if applicable, else 0. |
| `gpu_memory_gib` | GPU Memory available per GPU if applicable, else 0. |
| `current_node_usage` | Number of nodes of this type currently in use by your Snowflake account. |
| `message` | Additional information about the instance family. |

## Examples

The following command lists the compute pool instance families:

```sqlexample
SHOW COMPUTE POOL INSTANCE FAMILIES;
```

---
title: SHOW COMPUTE POOLS
source: https://docs.snowflake.com/en/sql-reference/sql/show-compute-pools.md
section: SQL Commands
---

# SHOW COMPUTE POOLS

Lists the [compute pools](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) in your account for which you have access privileges.

See also:
:   [CREATE COMPUTE POOL](create-compute-pool.md) , [ALTER COMPUTE POOL](alter-compute-pool.md), [DESCRIBE COMPUTE POOL](desc-compute-pool.md) , [DROP COMPUTE POOL](drop-compute-pool.md)

## Syntax

```sqlsyntax
SHOW COMPUTE POOLS [ LIKE '<pattern>' ]
                   [ STARTS WITH '<name_string>' ]
                   [ LIMIT <ROWS> [ FROM '<name-string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP, USAGE, MONITOR, or OPERATE | Compute pool |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides compute pool properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Compute pool name. |
| `state` | State of the compute pool. For more information, see [Compute pool lifecycle](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| `min_nodes` | Minimum number of nodes in the compute pool. |
| `max_nodes` | Maximum number of nodes in the compute pool. |
| `instance_family` | Machine type of nodes in the compute pool. |
| `num_services` | Number of services running on the compute pool. |
| `num_jobs` | Number of jobs running on the compute pool. |
| `auto_suspend_secs` | Number of seconds of inactivity after which Snowflake automatically suspends the compute pool. |
| `auto_resume` | Whether to automatically resume a compute pool when Snowflake starts a service or job. |
| `active_nodes` | Number of nodes in the compute pool that are active (one or more services or jobs are running). |
| `idle_nodes` | Number of nodes in the compute pool that are idle (no service or job is running). |
| `target_nodes` | Indicates the number of nodes that Snowflake is targeting for your compute pool. If `active_nodes` isn’t equal to the `target_nodes`, Snowflake autoscales the cluster to add or remove the nodes. For more information, see [About the target_nodes compute pool property](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| `placement_group` | Specifies the fault domain into which the compute pool nodes are placed. A fault domain is similar to the cloud provider’s availability zone. For more information, see [Compute pool placement](../../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| `created_on` | Date and time when the compute pool was created. |
| `resumed_on` | Date and time when the suspended compute pool was resumed. |
| `updated_on` | Date and time when the compute pool was updated using ALTER COMPUTE POOL. |
| `owner` | Role that owns the compute pool. |
| `comment` | Specifies a comment for the compute pool. |
| `is_exclusive` | `true` if the compute pool is created exclusively for a Snowflake Native App; `false` otherwise. |
| `application` | Name of the Snowflake Native App if the compute pool is created exclusively for the app. Otherwise, NULL. |
| `budget` | The name of the [budget](../../user-guide/budgets.md) monitoring the credit usage of the compute pool. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following command lists the compute pools for which you have access privileges in the current account:

```sqlexample
SHOW COMPUTE POOLS;
```

The following command lists one compute pool:

```sqlexample
SHOW COMPUTE POOLS LIMIT 1;
```

The following command lists compute pools with names containing “tu”:

```sqlexample
SHOW COMPUTE POOLS LIKE '%tu%';
```

The following command lists two compute pools with names containing “my_pool”:

```sqlexample
SHOW COMPUTE POOLS LIKE '%my_pool%' LIMIT 2;
```

Sample output:

```output
+-------------------------+-----------+-----------+-----------+-----------------+--------------+----------+-------------------+-------------+--------------+------------+--------------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+--------------+-------------+--------+-----------------+
| name                    | state     | min_nodes | max_nodes | instance_family | num_services | num_jobs | auto_suspend_secs | auto_resume | active_nodes | idle_nodes | target_nodes | created_on                    | resumed_on                    | updated_on                    | owner        | comment | is_exclusive | application | budget | placement_group |
|-------------------------+-----------+-----------+-----------+-----------------+--------------+----------+-------------------+-------------+--------------+------------+--------------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+--------------+-------------+--------|-----------------|
| TUTORIAL_COMPUTE_POOL   | ACTIVE    |         1 |         1 | CPU_X64_XS      |            3 |        0 |              3600 | true        |            1 |          0 |            1 | 2024-02-24 20:41:31.978 -0800 | 2024-08-08 11:27:01.775 -0700 | 2024-08-18 13:47:08.150 -0700 | TEST_ROLE    | NULL    | false        | NULL        | NULL   |      A          |
| TUTORIAL_COMPUTE_POOL_2 | SUSPENDED |         1 |         1 | CPU_X64_XS      |            0 |        0 |              3600 | true        |            0 |          0 |            0 | 2024-01-15 21:23:09.744 -0800 | 2024-04-06 15:24:50.541 -0700 | 2024-08-18 13:46:08.110 -0700 | ACCOUNTADMIN | NULL    | false        | NULL        | NULL   |      NULL       |
+-------------------------+-----------+-----------+-----------+-----------------+--------------+----------+-------------------+-------------+--------------+------------+--------------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+--------------+-------------+--------+-----------------+
```

---
title: SHOW CONFIGURATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-configurations.md
section: SQL Commands
---

# SHOW CONFIGURATIONS

Lists the [configurations](../../developer-guide/native-apps/inter-app-communication.md) in the specified app for which you have access privileges.

See also:
:   [DESCRIBE CONFIGURATION](desc-configuration.md)

## Syntax

```sqlsyntax
SHOW CONFIGURATIONS [ IN APPLICATION <app> ]
```

## Parameters

`app`
:   The name of the app to show the configurations for. If an app runs this command, the parameter is optional. If this command is run directly using a workspace or the Snowflake CLI, the `app` parameter is required.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | The name of the configuration. |
| `created_on` | The timestamp when the configuration object was created. |
| `updated_on` | The timestamp when the configuration object was last updated. |
| `type` | One of the following values: `APPLICATION_NAME` and `STRING`. |
| `status` | One of the following values: `PENDING`, `DONE` |
| `value` | The value set by the consumer. |
| `value_updated_on` | The timestamp when the value was set or unset. |
| `label` | A user-friendly name to be displayed in the UI. |
| `description` | A description of the configuration. |
| `application_roles` | The app roles that have access to the configuration. This field returns the most up-to-date names of the app roles, but the value may have been updated. If an app role has been dropped, it will not be returned in this field. |

## Usage notes

* When this command is run outside of an app, a configuration will only be returned if the consumer role is granted an application role
  that has access to the configuration. However, if the consumer role has MONITOR or OWNERSHIP privilege on the app, the consumer can
  see all the configurations in that app, regardless of which application roles they have been granted.

---
title: SHOW CONNECTIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-connections.md
section: SQL Commands
---

# SHOW CONNECTIONS

Lists the [connections](../../user-guide/client-redirect.md) for which you have access privileges.

The output returns connection metadata and properties, ordered by connection name (see Output in this topic for descriptions of the
output columns). This is important to note if you intend to filter the results using the provided filters.

See also:
:   [CREATE CONNECTION](create-connection.md) , [ALTER CONNECTION](alter-connection.md) , [DROP CONNECTION](drop-connection.md)

## Syntax

```sqlsyntax
SHOW CONNECTIONS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Output

The command output provides connection properties and metadata in the following columns. The command output for organizations that span multiple [region groups](../../user-guide/admin-account-identifier.md) includes an additional
`region_group` column.

| Column | Description |
| --- | --- |
| `region_group` | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. **Note**: This column is only displayed for organizations that span multiple region groups. |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `created_on` | Date and time when the connection was created. |
| `account_name` | Name of the account. An organization administrator can change the account name. |
| `name` | Name of the connection. |
| `comment` | Comment for the connection. |
| `is_primary` | Indicates whether the connection is a primary connection. |
| `primary` | Organization name, account name, and connection name of the primary connection. This value can be copied into the AS REPLICA OF clause of the [CREATE CONNECTION](create-connection.md) command when creating secondary connections. |
| `failover_allowed_to_accounts` | A list of any accounts that the primary connection can redirect to. |
| `connection_url` | Connection URL that users pass to a client to establish a connection to Snowflake. |
| `organization_name` | Name of your Snowflake organization. |
| `account_locator` | Account locator in a region. |

For more information about the properties that can be specified for a connection, see [CREATE CONNECTION](create-connection.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all the connections whose name starts with `test`:

> ```sqlexample
> SHOW CONNECTIONS LIKE 'test%';
> ```

---
title: SHOW CONTACTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-contacts.md
section: SQL Commands
---

# SHOW CONTACTS

Lists the [contacts](../../user-guide/contacts-using.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE CONTACT](create-contact.md) , [ALTER CONTACT](alter-contact.md) , [DROP CONTACT](drop-contact.md)

## Syntax

```sqlsyntax
SHOW CONTACTS [ LIKE '<pattern>' ]
          [ IN
              {
                ACCOUNT                  |

                DATABASE                 |
                DATABASE <database_name> |

                SCHEMA                   |
                SCHEMA <schema_name>     |
                <schema_name>
              }
          ]
          [ STARTS WITH '<name_string>' ]
          [ LIMIT <rows> ]
          [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the contact was created. |
| `name` | Name of the contact. |
| `database_name` | Name of the database that contains the contact. |
| `schema_name` | Name of the schema that contains the contact. |
| `owner` | Role that has the OWNERSHIP privilege on the contact. |
| `comment` | User-specified string describing the contact, if specified. |
| `owner_role_type` | The type of role that has OWNERSHIP privilege on the contact. Either ROLE, DATABASE_ROLE, or APPLICATION (if a Snowflake Native App owns the object). |
| `email_distribution_list` | Email addresses associated with the contact. |
| `url` | URL associated with the contact. |
| `entries_in_users` | If user names are associated with the contact, displays how many users are associated. |
| `users` | Array of the users associated with the contact. |

## Access control requirements

Executing SHOW CONTACTS requires the USAGE privilege on the schema that contains the contact.

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the contacts that you have the privileges to view in the PUBLIC schema of the `mydb` database:

```sqlexample
USE DATABASE mydb;

SHOW CONTACTS;
```

---
title: SHOW CORTEX SEARCH SERVICES
source: https://docs.snowflake.com/en/sql-reference/sql/show-cortex-search.md
section: SQL Commands
---

# SHOW CORTEX SEARCH SERVICES

Lists the [Cortex Search services](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) for which you have access
privileges.

## Syntax

```sqlsyntax
SHOW CORTEX SEARCH SERVICES
  [ LIKE PATTERN '<pattern>' ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

> `LIKE 'pattern'`
> :   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
>     wildcard characters (`%` and `_`).
>
>     For example, the following patterns return the same results:
>
>     `... LIKE '%testing%' ...`
>
>     `... LIKE '%TESTING%' ...`
>
>     . Default: No value (no filtering is applied to the output).
>
> `STARTS WITH 'name_string'`
> :   Optionally filters the command output based on the characters that appear at the beginning of
>     the object name. The string must be enclosed in single quotes and is case sensitive.
>
>     For example, the following strings return different results:
>
>     `... STARTS WITH 'B' ...`
>
>     `... STARTS WITH 'b' ...`
>
>     . Default: No value (no filtering is applied to the output)
>
> `LIMIT rows [ FROM 'name_string' ]`
> :   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
>     returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.
>
>     The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
>     specified number of rows following the first row whose object name matches the specified string:
>
>     * The string must be enclosed in single quotes and is case sensitive.
>     * The string does not have to include the full object name; partial names are supported.
>
>     Default: No value (no limit is applied to the output)
>
>     > **Note:**
>     >
>     > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
>     > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
>     > returned.
>     >
>     > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
>     > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
>     >
>     > For example:
>     >
>     > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
>     > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
>     > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides the Cortex Search service properties and metadata in the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `created_on` | TIMESTAMP_LTZ | Creation time of the Cortex Search Service. |
| `name` | TEXT | Name of the service. |
| `schema_name` | TEXT | The schema in which the service resides. |
| `database_name` | TEXT | The database in which the service resides. |
| `warehouse` | TEXT | The warehouse used for service refreshes. |
| `target_lag` | TEXT | The maximum amount of time that the service’s content should lag behind updates to the base tables. |
| `comment` | TEXT | Any comments associated with the service. |
| `definition` | TEXT | SQL query used to create the service. |
| `search_column` | TEXT | Name of the search column. |
| `attribute_columns` | TEXT | Comma-separated list of attribute columns in the service. |
| `columns` | TEXT | Comma-separated list of columns in the service. |
| `primary_key_columns` | TEXT | Comma-separated list of [primary key column](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) names defined on the service. Empty if no primary key is set. |
| `scoring_profile_count` | NUMBER | The number of [named scoring profiles](../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md) defined in the service. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example lists the Cortex Search service that you have the privileges to view in the PUBLIC schema of the `mydb` database:

```sqlexample
USE DATABASE mydb;

SHOW CORTEX SEARCH SERVICES;
```

---
title: SHOW DATA METRIC FUNCTIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-data-metric-functions.md
section: SQL Commands
---

# SHOW DATA METRIC FUNCTIONS

Lists the [data metric functions](../../user-guide/data-quality-intro.md) (DMFs) for which you have access privileges.

You can use this command to list the DMFs in the current database and schema for the session, a specified database or
schema, or your entire account.

See also:
:   [CREATE DATA METRIC FUNCTION](create-data-metric-function.md) , [ALTER FUNCTION (DMF)](alter-function-dmf.md), [DESCRIBE FUNCTION (DMF)](desc-function-dmf.md) , [DROP FUNCTION (DMF)](drop-function-dmf.md)

## Syntax

```sqlsyntax
SHOW DATA METRIC FUNCTIONS
  [ LIKE '<pattern>' ]
  [ IN
      {
        ACCOUNT                  |

        DATABASE                 |
        DATABASE <database_name> |

        SCHEMA                   |
        SCHEMA <schema_name>     |
        <schema_name>
      }
  ]
  [ STARTS WITH '<name_string>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

## Output

The command output provides DMF properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp at which the function was created. |
| `name` | Name of the function. |
| `schema_name` | Name of the schema that the function exists in. NULL for built-in functions. |
| `is_builtin` | `Y` if the function is a built-in function; `N` otherwise. |
| `is_aggregate` | `Y` if the function is an aggregate function; `N` otherwise. |
| `is_ansi` | `Y` if the function is defined as part of the ANSI SQL standard; `N` otherwise. |
| `min_num_arguments` | Minimum number of arguments. |
| `max_num_arguments` | Maximum number of arguments. |
| `arguments` | Shows the data types of the arguments and of the return value. |
| `description` | Description of the function. |
| `catalog_name` | Name of the database that the function exists in. NULL for built-in functions. |
| `is_table_function` | `Y` if the function is a table function; `N` otherwise. |
| `valid_for_clustering` | `Y` if the function can be used in a CLUSTER BY expression; `N` otherwise. |
| `is_secure` | `Y` if the function is a secure function; `N` otherwise. |
| `is_external_function` | `Y` if the function is an external function; `N` otherwise. |
| `language` | * For built-in functions, this column shows `SQL`. * For user-defined functions, this column shows the language in which the function was written, such as `JAVASCRIPT` or `SQL`. See [SHOW USER FUNCTIONS](show-user-functions.md). * For external functions, this column shows `EXTERNAL`. |
| `is_memoizable` | `Y` if the function is memoizable; `N` otherwise. |
| `is_data_metric` | `Y` if the function is a DMF; `N` otherwise. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Data metric function |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the DMFs that you have the privileges to view in the `dmfs` schema of the
`governance` database:

```sqlexample
USE SCHEMA governance.dmfs;

SHOW DATA METRIC FUNCTIONS;
```

```output
+--------------------------+------------------------+-------------+------------+--------------+---------+-------------------+-------------------+--------------------------------------------------------------------------------------------+-----------------------+--------------+-------------------+----------------------+-----------+----------------------+----------+---------------+----------------+
| created_on               | name                   | schema_name | is_builtin | is_aggregate | is_ansi | min_num_arguments | max_num_arguments | arguments                                                                                  | description           | catalog_name | is_table_function | valid_for_clustering | is_secure | is_external_function | language | is_memoizable | is_data_metric |
+--------------------------+------------------------+-------------+------------+--------------+---------+-------------------+-------------------+--------------------------------------------------------------------------------------------+-----------------------+--------------+-------------------+----------------------+-----------+----------------------+----------+---------------+----------------+
| 2023-12-11T23:30:02.785Z | COUNT_POSITIVE_NUMBERS | DMFS        | N          | N            | N       | 1                 | 1                 | "COUNT_POSITIVE_NUMBERS(TABLE(NUMBER, NUMBER, NUMBER)) RETURNS NUMBER"                     | user-defined function | GOVERNANCE   | N                 | N                    | N         | N                    | SQL      | N             | Y              |
+--------------------------+------------------------+-------------+------------+--------------+---------+-------------------+-------------------+--------------------------------------------------------------------------------------------+-----------------------+--------------+-------------------+----------------------+-----------+----------------------+----------+---------------+----------------+
```

---
title: SHOW DATABASE ROLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-database-roles.md
section: SQL Commands
---

# SHOW DATABASE ROLES

Lists all the database roles in the specified database.

> **Important:**
>
> A user with any active role that has been granted any privilege on the active database (e.g. USAGE) can list the database roles in the
> database. However, this does not necessarily mean the role allows users to use the database roles to perform SQL actions. To use a
> database role, it must first be granted to an account role that users can activate in a user session, or to an account role lower in a
> hierarchy.
>
> This is a part of Discretionary Access Control and Role-Based Access Control. For more information, see
> [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [SHOW GRANTS](show-grants.md) , [CREATE DATABASE ROLE](create-database-role.md) , [ALTER DATABASE ROLE](alter-database-role.md) , [DROP DATABASE ROLE](drop-database-role.md)

## Syntax

```sqlsyntax
SHOW DATABASE ROLES IN DATABASE <name>
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Required parameters

`name`
:   Specifies the name of the database.

    The command returns an error if you do not specify the name identifier.

## Optional parameters

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* This command only supports showing database roles in a specific database.

  You can’t use this command to show database roles in the account.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

Return up to ten database roles in the database named `mydb` after the first database role named `db_role2`:

```sqlexample
SHOW DATABASE ROLES IN DATABASE mydb LIMIT 10 FROM 'db_role2';
```

---
title: SHOW DATABASES
source: https://docs.snowflake.com/en/sql-reference/sql/show-databases.md
section: SQL Commands
---

# SHOW DATABASES

Lists the databases for which you have access privileges across your entire account, including dropped databases that are still within
the Time Travel retention period and, therefore, can be undropped.

The output returns database metadata and properties, ordered lexicographically by database name. This is important to note if you wish
to filter the results using the provided filters.

See also:
:   [CREATE DATABASE](create-database.md) , [ALTER DATABASE](alter-database.md) , [DESCRIBE DATABASE](desc-database.md) , [DROP DATABASE](drop-database.md) , [UNDROP DATABASE](undrop-database.md)

    [DATABASES view](../info-schema/databases.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW [ TERSE ] DATABASES [ HISTORY ] [ LIKE '<pattern>' ]
                                     [ STARTS WITH '<name_string>' ]
                                     [ LIMIT <rows> [ FROM '<name_string>' ] ]
                                     [ WITH PRIVILEGES <object_privilege> [ , <object_privilege> [ , ... ] ] ]
```

## Parameters

`TERSE`
:   Optionally returns output containing only the following columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

    Note that `kind`, `database_name`, and `schema_name` always display `NULL` because the columns are not
    applicable for databases.

    Default: No value (all columns are included in the output)

`HISTORY`
:   Optionally includes dropped databases that have not yet been purged (i.e. they are still within their respective Time Travel
    retention periods). If multiple versions of a dropped database exist, the output displays a row for each version. The output also
    includes an additional `dropped_on` column, which displays:

    * Date and timestamp (for dropped databases).
    * `NULL` (for active databases).

    Default: No value (dropped databases are not included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

`WITH PRIVILEGES object_privilege [ , object_privilege [ , ... ] ]`
:   Optionally limits rows to objects for which the [active role](../../user-guide/security-access-control-overview.md) for the current
    user has been granted all of the specified privileges in the list on the object.

    If a CREATE <object> privilege is included in the privileges list, the command excludes objects for which secondary roles have
    been granted privileges. This is because only the primary role has the authorization to create objects. For more information, see
    [Authorization through primary role and secondary roles](../../user-guide/security-access-control-overview.md).

`OBJECT_VISIBILITY`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    This property controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md) in the account,
    enabling users without explicit access privileges to find objects and request access.

## Usage notes

* The `HISTORY` and `WITH PRIVILEGES` parameters are mutually exclusive; they cannot both be used in the same statement.
* For a [personal database](../../user-guide/personal-databases.md), the value in the `kind` column is `PERSONAL DATABASE`.
* For [catalog-linked databases](../../user-guide/tables-iceberg-catalog-linked-database.md), the `kind` column is `CATALOG-LINKED DATABASE`.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all databases that you have privileges to view in your account:

```sqlexample
SHOW DATABASES;
```

```output
+---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+----------+-----------------+-------------------+-------------------------------------+
| created_on                      | name      | is_default | is_current | origin | owner  | comment | options | retention_time | kind     | owner_role_type | object_visibility | data_quality_monitoring_settings    |
|---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+----------|-----------------|-------------------|-------------------------------------|
| Tue, 17 Mar 2015 16:57:04 -0700 | MYTESTDB  | N          | Y          |        | PUBLIC |         |         | 1              | STANDARD | ROLE            | NULL              | NULL                                |
| Wed, 25 Feb 2015 17:30:04 -0800 | SALES1    | N          | N          |        | PUBLIC |         |         | 1              | STANDARD | ROLE            | NULL              | NULL                                |
| Fri, 13 Feb 2015 19:21:49 -0800 | DEMO1     | N          | N          |        | PUBLIC |         |         | 1              | STANDARD | ROLE            | NULL              | NULL                                |
+---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+----------+-----------------+-------------------+-------------------------------------+
```

Show all databases that you have privileges to view in the system, including dropped databases (this example builds on the
[DROP DATABASE](drop-database.md) examples):

```sqlexample
SHOW DATABASES HISTORY;
```

```output
+---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+---------------------------------+----------+-----------------+-------------------+-------------------------------------+
| created_on                      | name      | is_default | is_current | origin | owner  | comment | options | retention_time | dropped_on                      | kind     | owner_role_type | object_visibility | data_quality_monitoring_settings    |
|---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+---------------------------------|----------|-----------------|-------------------|-------------------------------------|
| Tue, 17 Mar 2015 16:57:04 -0700 | MYTESTDB  | N          | Y          |        | PUBLIC |         |         | 1              | [NULL]                          | STANDARD | ROLE            | NULL              | NULL                                |
| Wed, 25 Feb 2015 17:30:04 -0800 | SALES1    | N          | N          |        | PUBLIC |         |         | 1              | [NULL]                          | STANDARD | ROLE            | NULL              | NULL                                |
| Fri, 13 Feb 2015 19:21:49 -0800 | DEMO1     | N          | N          |        | PUBLIC |         |         | 1              | [NULL]                          | STANDARD | ROLE            | NULL              | NULL                                |
| Wed, 25 Feb 2015 16:16:54 -0800 | MYTESTDB2 | N          | N          |        | PUBLIC |         |         | 1              | Fri, 13 May 2016 17:35:09 -0700 | STANDARD | ROLE            | NULL              | NULL                                |
+---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+---------------------------------+----------+-----------------+-------------------+-------------------------------------+
```

Show all databases that you have been granted the USAGE and MODIFY privileges on:

```sqlexample
SHOW DATABASES WITH PRIVILEGES USAGE, MODIFY;
```

```output
+-------------------------------+------------+------------+------------+---------------------------+--------------+---------+---------+----------------+-------------------+-----------------+-------------------+-------------------------------------+
| created_on                    | name       | is_default | is_current | origin                    | owner        | comment | options | retention_time | kind              | owner_role_type | object_visibility | data_quality_monitoring_settings    |
|-------------------------------+------------+------------+------------+---------------------------+--------------+---------+---------+----------------+-------------------+-----------------|-------------------|-------------------------------------|
| 2023-01-27 14:33:11.417 -0800 | BOOKS_DB   | N          | N          |                           | DATA_ADMIN   |         |         | 1              | STANDARD          | ROLE            | NULL              | NULL                                |
| 2023-09-15 15:22:51.111 -0700 | TEST_DB    | N          | N          |                           | ACCOUNTADMIN |         |         | 4              | STANDARD          | ROLE            | NULL              | NULL                                |
| 2023-08-18 13:33:01.024 -0700 | SNOWFLAKE  | N          | N          | SNOWFLAKE.ACCOUNT_USAGE   |              |         |         | 0              | APPLICATION       |                 | NULL              | NULL                                |
+-------------------------------+------------+------------+------------+---------------------------+--------------+---------+---------+----------------+-------------------+-----------------+-------------------+-------------------------------------+
```

---
title: SHOW DATABASES IN FAILOVER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/show-databases-in-failover-group.md
section: SQL Commands
---

# SHOW DATABASES IN FAILOVER GROUP

Lists databases in a [failover group](../../user-guide/account-replication-intro.md).

See also:
:   [SHOW LISTINGS IN FAILOVER GROUP](show-listings-in-failover-group.md), [SHOW SHARES IN FAILOVER GROUP](show-shares-in-failover-group.md)

## Syntax

```sqlsyntax
SHOW DATABASES IN FAILOVER GROUP <name>
```

## Parameters

`name`
:   Specifies the identifier for the failover group.

## Usage notes

* Executing this command requires a role with either the OWNERSHIP or MONITOR privilege on the failover group. The command
  returns results only for a role with the MONITOR privilege on a database.
* To retrieve the list of failover groups in your organization, use [SHOW FAILOVER GROUPS](show-failover-groups.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List the databases in the failover group `myfg`:

```sqlexample
SHOW DATABASES IN FAILOVER GROUP myfg;
```

---
title: SHOW DATABASES IN REPLICATION GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/show-databases-in-replication-group.md
section: SQL Commands
---

# SHOW DATABASES IN REPLICATION GROUP

Lists databases in a [replication group](../../user-guide/account-replication-intro.md).

See also:
:   [SHOW SHARES IN REPLICATION GROUP](show-shares-in-replication-group.md)

## Syntax

```sqlsyntax
SHOW DATABASES IN REPLICATION GROUP <name>
```

## Parameters

`name`
:   Specifies the identifier for the replication group.

## Usage notes

* Executing this command requires a role with either the OWNERSHIP or MONITOR privilege on the replication group. The command
  returns results only for a role with the MONITOR privilege on a database.
* To retrieve the list of replication (and failover) groups in your organization, use [SHOW REPLICATION GROUPS](show-replication-groups.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List the databases in the replication group `myrg`:

```sqlexample
SHOW DATABASES IN REPLICATION GROUP myrg;
```

---
title: SHOW DATASETS
source: https://docs.snowflake.com/en/sql-reference/sql/show-datasets.md
section: SQL Commands
---

# SHOW DATASETS

Displays information about the datasets in your account.
You can show all datasets or use the IN subcommand to only display results at the schema or database level.

See also:
:   [CREATE DATASET](create-dataset.md) , [ALTER DATASET](alter-dataset.md)

## Syntax

```sqlsyntax
SHOW DATASETS
  [ LIKE '<pattern>' ]
  [ IN { SCHEMA <schema_name> | DATABASE <db_name> | ACCOUNT } ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Optional parameters

`LIKE pattern`
:   Restricts the list of returned datasets to those matching the specified pattern.

`IN SCHEMA <schema_name> | DATABASE <db_name> | ACCOUNT`
:   Restricts the list of returned datasets to those in the specified schema or database within an account.

`DATABASE db_name`
:   Restricts the list of returned datasets to those in the specified database.
    If you specify a database without `db_name` and no database is in use, they keyword has no
    effect on the output.

`SCHEMA schema_name`
:   By default, returns records for the schema in use. You can also specify a `schema_name`.

`STARTS WITH name_string`
:   Uses the string that you specify to limit the datasets returned.
    The names of the datasets returned have the same beginning characters as the specified string.

`LIMIT rows [ FROM name_string ]`
:   Limits the number of returned datasets to the specified number of rows.
    The optional FROM clause specifies the starting point for the returned datasets.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or USAGE | Dataset | Provides the privilege to show the datasets within the account. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example shows two datasets in the PUBLIC schema:

```sqlexample
SHOW DATASETS IN SCHEMA PUBLIC LIMIT 2;
```

---
title: SHOW DBT PROJECTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-dbt-projects.md
section: SQL Commands
---

# SHOW DBT PROJECTS

Lists the [dbt project objects](../../user-guide/data-engineering/dbt-projects-on-snowflake.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE DBT PROJECT](create-dbt-project.md), [ALTER DBT PROJECT](alter-dbt-project.md), [EXECUTE DBT PROJECT](execute-dbt-project.md), [DROP DBT PROJECT](drop-dbt-project.md), SHOW DBT PROJECTS

## Syntax

```sqlsyntax
SHOW DBT PROJECTS [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |

                  DATABASE                 |
                  DATABASE <database_name> |

                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>
                }
           ]
           [ STARTS WITH '<name_string>' ]
           [ LIMIT <rows> ]
           [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `external_access_integrations` | The name of the external access integrations the dbt Project is permitted to use to pull remote dependencies from dbt package hub or GitHub. |
| `name` | The identifier of the dbt project object. |
| `database_name` | The name of the database in which the dbt project object is defined. |
| `schema_name` | The name of the schema in which the dbt project object is defined. |
| `created_on` | Date and time when the dbt project object was created. |
| `updated_on` | Date and time when the dbt project object was last updated. |
| `owner` | The name of the role that owns the dbt project object. |
| `comment` | The comment associated with the dbt project object. |
| `dbt_version` | The version for the dbt Project. If no value is specified, the system uses version 1.9.4 by default. |
| `dbt_snowflake_version` | The Snowflake version the dbt project object is on. |
| `default_target` | The default execution target (for example, `prod` or `dev`) used by dbt commands executed through Snowflake. |

The following columns provide the value of a deprecated parameter:

| Column | Description |
| --- | --- |
| `default_version` | The version of the dbt project object:   * `LAST`: The most recent version of the dbt project object. * `FIRST`: The oldest version of the dbt project object. |
| `default_version_name` | The version identifier in the form `VERSION$num`, where `num` is a positive integer, for example: `VERSION$1`.  The version number begins at `1` when you create a dbt project object and increments by one with each new version of the dbt project object.  Snowflake increments the version identifier when you perform the following tasks:   * Redeploy dbt project from a workspace (runs the ALTER command with the ADD VERSION option). * Update the project by using the [ALTER DBT PROJECT](alter-dbt-project.md) command. * Run the Snow CLI `snow dbt deploy` command with the `--force` option.   Snowflake resets the version identifier to `1` and removes all version aliases when you run the CREATE DBT PROJECT command with the OR REPLACE option. |
| `default_version_alias` | The custom version name alias that you created for a specific version of the dbt project object using the ALTER DBT PROJECT command with the ADD VERSION option. A version name alias always maps to a specific version identifier, such as `VERSION$3`. |
| `default_version_location_uri` | The location URI of the default version. This is read only. |
| `default_version_source_location_uri` | The location URI of the default version’s source files in its Git object. If the dbt project object is not connected to a Git object, this is null. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| USAGE | dbt project |
| MONITOR | dbt project |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the dbt project objects that you have privileges to view in the `public` schema of the `my_db` database:

```sqlexample
SHOW DBT PROJECTS IN DATABASE my_db;
```

```output
+-----------------------------+----------------+---------------+-------------+-------------------------------+-------------------------------+--------------+---------+-----------------+----------------------+-----------------------+------------------------------------------------------------+-------------------------------------+-----------------------+----------------+
| external_access_integrations |    name        | database_name | schema_name |          created_on           |          updated_on           |    owner     | comment | default_version | default_version_name | default_version_alias | default_version_location_uri                               | default_version_source_location_uri | dbt_snowflake_version | default_target |
+-----------------------------+----------------+---------------+-------------+-------------------------------+-------------------------------+--------------+---------|-----------------|----------------------|-----------------------+------------------------------------------------------------+-------------------------------------+-----------------------+----------------+
| my_ext_integration_1        | COSMOS         | MY_DB         | PUBLIC      | 2025-04-29 17:21:25.413 -0700 | 2025-04-29 17:21:29.462 -0700 | ACCOUNTADMIN |         | LAST            | VERSION$1            | null                  | snow://dbt/MY_DB.PUBLIC.COSMOS/versions/version$1/         | @s1                                 | 1.9.2b                | null           |
| my_ext_integration_1        | Jaffle_shop    | MY_DB         | PUBLIC      | 2025-03-25 12:36:16.574 -0700 | 2025-03-25 12:36:17.833 -0700 | ACCOUNTADMIN |         | LAST            | VERSION$1            | null                  | snow://dbt/MY_DB.PUBLIC.Jaffle_shop/versions/version$1/    | @s1                                 | 1.9.2b                | prod           |
| my_ext_integration_2        | MY_DBT_PROJECT | MY_DB         | PUBLIC      | 2025-05-02 13:42:36.306 -0700 | 2025-05-02 13:42:38.584 -0700 | ACCOUNTADMIN |         | LAST            | VERSION$1            | null                  | snow://dbt/MY_DB.PUBLIC.MY_DBT_PROJECT/versions/version$1/ | @s1                                 | 1.9.2b                | dev            |
| null                        | MY_SHOP        | MY_DB         | PUBLIC      | 2025-04-29 17:15:27.295 -0700 | 2025-04-29 17:15:28.709 -0700 | ACCOUNTADMIN |         | LAST            | VERSION$1            | null                  | snow://dbt/MY_DB.PUBLIC.MY_SHOP/versions/version$1/        | @s1                                 | 1.9.2b                | null           |
+-----------------------------+----------------+---------------+-------------+-------------------------------+-------------------------------+--------------+---------+-----------------+----------------------+-----------------------+------------------------------------------------------------+-------------------------------------+-----------------------+----------------+
```

---
title: SHOW DCM PROJECTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-dcm-projects.md
section: SQL Commands
---

# SHOW DCM PROJECTS

Lists the [DCM projects](../../user-guide/dcm-projects/dcm-projects-overview.md) for which you have at least READ privilege.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [ALTER DCM PROJECT](alter-dcm-project.md) , [DESCRIBE DCM PROJECT](desc-dcm-project.md) , [DROP DCM PROJECT](drop-dcm-project.md), [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DEPLOYMENTS IN DCM PROJECT](show-deployments-in-dcm-project.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] DCM PROJECTS [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |

                  DATABASE                 |
                  DATABASE <database_name> |

                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>
                }
           ]
           [ LIMIT <rows> ]
```

## Required parameters

None.

## Optional parameters

`TERSE`
:   Optionally returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the DCM project was created. |
| `name` | Name of the DCM project. |
| `database_name` | Database in which the DCM project is stored. |
| `schema_name` | Schema in which the DCM project is stored. |
| `comment` | Comment for the DCM project. |
| `owner` | Role that owns the DCM project. |
| `kind` | Always `DCM Project`. |
| `last_executed_deployment_time` | Timestamp of the last executed deployment. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| READ | DCM project |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

List the DCM projects that you have the privileges to view in the PUBLIC schema of the `mydb` database:

```sqlexample
USE DATABASE mydb;

SHOW DCM PROJECTS;
```

Show the available DCM projects in the `my_schema` schema:

```sqlexample
SHOW DCM PROJECTS IN SCHEMA my_schema;
```

Show the available DCM projects in the `my_db` database:

```sqlexample
SHOW DCM PROJECTS IN DATABASE my_db;
```

Show the available DCM projects whose names begin with `my_`:

```sqlexample
SHOW DCM PROJECTS LIKE 'my_%';
```

---
title: SHOW DELEGATED AUTHORIZATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-delegated-authorizations.md
section: SQL Commands
---

# SHOW DELEGATED AUTHORIZATIONS

Lists the active delegated authorizations for which you have access privileges. This command can be used to list the
DELEGATED AUTHORIZATIONS for a specified user or integration (or the current user), or your entire account.

## Syntax

```sqlsyntax
SHOW DELEGATED AUTHORIZATIONS

SHOW DELEGATED AUTHORIZATIONS BY USER <username>

SHOW DELEGATED AUTHORIZATIONS TO SECURITY INTEGRATION <integration_name>
```

## Variants

`SHOW DELEGATED AUTHORIZATIONS BY USER username`
:   Lists all the active delegated authorizations that have been approved by a user. This variant requires the MODIFY privilege
    on the user.

`SHOW DELEGATED AUTHORIZATIONS TO SECURITY INTEGRATION integration_name`
:   Lists all the active delegated authorizations that have been approved for an integration. This variant requires the
    ACCOUNTADMIN role.

For more details on each of these variants, see:

* [Viewing Delegated Authorizations for OAuth User Consent](../../user-guide/oauth-consent.md)
* [Display OAuth Consents in OAuth Partner Applications](../../user-guide/oauth-partner.md)

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

List all delegated authorizations for your account:

> ```sqlexample
> SHOW DELEGATED AUTHORIZATIONS;
>
> +-------------------------------+-----------+-----------+-------------------+--------------------+
> | created_on                    | user_name | role_name | integration_name  | integration_status |
> |-------------------------------+-----------+-----------+-------------------+--------------------|
> | 2018-11-27 07:43:10.914 -0800 | JSMITH    | PUBLIC    | MY_OAUTH_INT1     | ENABLED            |
> | 2018-11-27 08:14:56.123 -0800 | MJONES    | PUBLIC    | MY_OAUTH_INT2     | ENABLED            |
> +-------------------------------+-----------+-----------+-------------------+--------------------+
> ```

List all delegated authorizations for a specified user:

> ```sqlexample
> SHOW DELEGATED AUTHORIZATIONS BY USER jsmith;
>
> +-------------------------------+-----------+-----------+-------------------+--------------------+
> | created_on                    | user_name | role_name | integration_name  | integration_status |
> |-------------------------------+-----------+-----------+-------------------+--------------------|
> | 2018-11-27 07:43:10.914 -0800 | JSMITH    | PUBLIC    | MY_OAUTH_INT1     | ENABLED            |
> +-------------------------------+-----------+-----------+-------------------+--------------------+
> ```

List all delegated authorizations for a specified integration:

> ```sqlexample
> SHOW DELEGATED AUTHORIZATIONS TO SECURITY INTEGRATION my_oauth_int2;
>
> +-------------------------------+-----------+-----------+-------------------+--------------------+
> | created_on                    | user_name | role_name | integration_name  | integration_status |
> |-------------------------------+-----------+-----------+-------------------+--------------------|
> | 2018-11-27 08:14:56.123 -0800 | MJONES    | PUBLIC    | MY_OAUTH_INT2     | ENABLED            |
> +-------------------------------+-----------+-----------+-------------------+--------------------+
> ```

---
title: SHOW DEPLOYMENTS IN DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/show-deployments-in-dcm-project.md
section: SQL Commands
---

# SHOW DEPLOYMENTS IN DCM PROJECT

Shows all deployments for the specified [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md).

The command returns deployment metadata and properties, ordered by creation date.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [ALTER DCM PROJECT](alter-dcm-project.md), [DESCRIBE DCM PROJECT](desc-dcm-project.md) , [DROP DCM PROJECT](drop-dcm-project.md), [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md)

## Syntax

```sqlsyntax
SHOW DEPLOYMENTS IN DCM PROJECT <name> [ LIMIT <rows> ]
```

## Required parameters

`IN DCM PROJECT name`
:   Specifies the identifier of the DCM project that contains the deployments to list.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The command output provides deployment properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the deployment was created. |
| `name` | Name of the deployment. |
| `alias` | User-specified deployment alias. |
| `deployment_file_path` | Full location URL for the deployment. Example: `snow://project/MY_DB.PUBLIC.P/deployment/deployment$2/` |
| `source_file_path` | Source location where this deployment is created from. This is the value provided with FROM `<source_location>`. |
| `git_commit_hash` | The Git commit hash that indicates the deployment of files in the Git repository from which the DCM project originates. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| MONITOR | DCM project |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Show all deployments of the DCM project named `my_project`:

```sqlexample
SHOW DEPLOYMENTS IN DCM PROJECT my_project;
```

---
title: SHOW DYNAMIC TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-dynamic-tables.md
section: SQL Commands
---

# SHOW DYNAMIC TABLES

Lists the [dynamic tables](../../user-guide/dynamic-tables-about.md) for which you have access privileges. The command can be used to list dynamic
tables for the current/specified database or schema, or across your entire account.

See also:
:   [CREATE DYNAMIC TABLE](create-dynamic-table.md), [ALTER DYNAMIC TABLE](alter-dynamic-table.md), [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md), [DROP DYNAMIC TABLE](drop-dynamic-table.md),
    [SHOW OBJECTS](show-objects.md), [TABLES view](../info-schema/tables.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW DYNAMIC TABLES [ LIKE '<pattern>' ]
                    [ IN
                      {
                           ACCOUNT              |

                           DATABASE             |
                           DATABASE <db_name>   |

                           SCHEMA               |
                           SCHEMA <schema_name> |
                           <schema_name>
                      }
                    ]
                    [ STARTS WITH '<name_string>' ]
                    [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| SELECT | The dynamic tables that you want to list. | Some metadata is hidden if you don’t have the MONITOR privilege. For more information, see [Privileges to view a dynamic table’s metadata](../../user-guide/dynamic-tables-privileges.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To SHOW a dynamic table, you must be using a role that has MONITOR privilege on the table.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the dynamic table was created. |
| `name` | Name of the dynamic table. |
| `database_name` | Database in which the dynamic table is stored. |
| `schema_name` | Schema in which the dynamic table is stored. |
| `cluster_by` | The clustering key(s) for the dynamic table. |
| `rows` | Number of rows in the table. |
| `bytes` | Number of bytes that will be scanned if the entire dynamic table is scanned in a query. . . Note that this number may be different than the number of actual physical bytes (i.e. bytes stored on-disk) for the table. |
| `owner` | Role that owns the dynamic table. |
| `target_lag` | The maximum duration that the dynamic table’s content should lag behind real time. `NULL` if `scheduler` is set to `DISABLE`. |
| `scheduler` | Specifies whether the dynamic table is to be refreshed automatically by Snowflake’s Dynamic Table scheduler. `ENABLE` if the dynamic table is scheduled automatically. `DISABLE` if the dynamic table isn’t scheduled automatically. `NULL` if the `SCHEDULER` attribute wasn’t explicitly set (the dynamic table is scheduler-managed by default). |
| `refresh_mode` | Returns `INCREMENTAL` if the dynamic table uses incremental refreshes, or `FULL` if it recomputes the whole table on every refresh. |
| `refresh_mode_reason` | Explanation for why the refresh mode was chosen. If Snowflake chose `FULL` when `INCREMENTAL` is supported, the output provides a reason for why it thinks full refresh performs better. NULL if no pertinent information is available. |
| `warehouse` | Warehouse that provides the required resources to perform the incremental refreshes. |
| `comment` | Comment for the dynamic table. |
| `text` | The text of the command that created this dynamic table (e.g. `CREATE DYNAMIC TABLE ...`). |
| `automatic_clustering` | Whether auto-clustering is enabled on the dynamic table. Not currently supported for dynamic tables. |
| `scheduling_state` | Displays RUNNING for dynamic tables that are actively scheduling refreshes and SUSPENDED for suspended dynamic tables. |
| `last_suspended_on` | Timestamp of last suspension. |
| `is_clone` | TRUE if the dynamic table is a clone; else FALSE. |
| `is_replica` | TRUE if the dynamic table is a replica; else FALSE. |
| `is_iceberg` | TRUE if the dynamic table is a dynamic Apache Iceberg™ table; else FALSE. |
| `data_timestamp` | Timestamp of the data in the base object(s) that is included in the dynamic table. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . . Database-level roles, for example `DATABASE_ROLE`, can’t be owners. The owner of a dynamic table must have the USAGE privilege on the warehouse. Since the warehouse is an account-level object, a database role, which operates at the database level, can’t be granted access to it. . . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| `immutable_where` | Displays the IMMUTABLE WHERE [immutability constraint](../../user-guide/dynamic-tables-immutability-constraints.md) set on the dynamic table. Displays NULL if there is none. |
| `execute_as_user` | Displays the user name of a user refreshing a dynamic table using impersonated privileges (EXECUTE AS USER). NULL if executed as the system user (default). INVALID if the specified user ID is no longer valid (for example, user dropped). For more information, see [Refresh dynamic tables with specific user privileges and secondary roles](../../user-guide/dynamic-tables-privileges.md). |

## Examples

Show all the dynamic tables with names that start with `product_` in the `mydb.myschema` schema:

> ```sqlexample
> SHOW DYNAMIC TABLES LIKE 'product_%' IN SCHEMA mydb.myschema;
> ```

---
title: SHOW ENDPOINTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-endpoints.md
section: SQL Commands
---

# SHOW ENDPOINTS

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Lists the endpoints in a
[Snowpark Container Services service](../../developer-guide/snowpark-container-services/working-with-services.md) (or a job service). Use the command to list endpoints in a service or service running as a job.

See also:
:   [CREATE SERVICE](create-service.md) , [ALTER SERVICE](alter-service.md), [DROP SERVICE](drop-service.md) , [SHOW SERVICES](show-services.md)

## Syntax

```sqlsyntax
SHOW ENDPOINTS IN SERVICE <name>
```

## Parameters

`name`
:   Specifies the identifier for the service whose endpoints to list.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The command output provides service properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | User-friendly endpoint name that represents the corresponding port. |
| `port` | The network port the service is listening on. NULL, when `portRange` is specified. |
| `port_range` | The network port range the service is listening on. NULL, when `port` is specified. |
| `protocol` | Supported network protocol (TCP, HTTP, or HTTPS). The default is HTTP. Public endpoints and service functions (see [Using a service](../../developer-guide/snowpark-container-services/working-with-services.md)) require HTTP or HTTPS. |
| `is_public` | True, if the endpoint is public, accessible from internet. |
| `ingress_url` | Endpoint URL accessible from the internet. |
| `privatelink_ingress_url` | Endpoint URL accessible via Private Connectivity. The column is returned only for [Business Critical](../../user-guide/intro-editions.md) accounts. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example lists endpoints exposed by `echo_service` service:

```sqlexample
SHOW ENDPOINTS IN SERVICE echo_service;
```

```output
+--------------+------+------------+----------+-----------+------------------------------------------------------------------------------+-----------------------------------------------+
| name         | port | port_range | protocol | is_public | ingress_url                                                                  | privatelink_ingress_url                       |
|--------------+------+------------+----------+-----------+------------------------------------------------------------------------------|-----------------------------------------------*
| echoendpoint | 8080 |            | HTTP     | true      | d7qoajz-orgname-acctname.pp-snowflakecomputing.app                           | d7qoajz.spcs.pdxaac.privatelink.snowflake.app |
+--------------+------+------------+----------+-----------+------------------------------------------------------------------------------+-----------------------------------------------*
```

---
title: SHOW ENTITIES IN DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/show-entities-in-dcm-project.md
section: SQL Commands
---

# SHOW ENTITIES IN DCM PROJECT

Shows all Snowflake objects that are currently managed by a specified [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md).

It provides a mixed list of fully qualified names for all objects. To see any results, users need both READ privilege on the DCM project
and READ privilege on the managed object itself.

> **Note:**
>
> The result does not necessarily match the entities of the most recent deployment. Objects that were manually dropped or detached from the
> project, will not be listed here.

The command returns object metadata and properties, ordered by creation date.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [ALTER DCM PROJECT](alter-dcm-project.md), [DESCRIBE DCM PROJECT](desc-dcm-project.md) , [DROP DCM PROJECT](drop-dcm-project.md), [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md)

## Syntax

```sqlsyntax
SHOW ENTITIES IN DCM PROJECT <name> [ LIMIT <rows> ]

SHOW ENTITIES LIKE <pattern> IN DCM PROJECT <name>;

SHOW ENTITIES IN DCM PROJECT <name> STARTS WITH <prefix>;

SHOW ENTITIES IN DCM PROJECT <name> LIMIT <n> FROM <cursor>;
```

## Required parameters

`IN DCM PROJECT name`
:   Specifies the identifier of the DCM project that contains the deployments to list.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

| Column | Description |
| --- | --- |
| `CREATED_ON` | creation timestamp (LTZ) |
| `NAME` | fully-qualified name of the object (FQN), suitable for DESC |
| `OBJECT_TYPE` | object type |
| `OWNER` | owning role, per-domain conventions |
| `COMMENT` | user-specified comment |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| READ | * DCM project * Managed object |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Show all entities in the `my_project` DCM project:

```sqlexample
SHOW ENTITIES IN DCM PROJECT my_project;
```

Show all entities in the `my_project` DCM project that start with `my_`:

```sqlexample
SHOW ENTITIES LIKE 'my_%' IN DCM PROJECT my_project;
```

Show all dynamic tables in the `my_project` DCM project:

```sqlexample
SHOW ENTITIES IN DCM PROJECT my_project
  ->> SELECT * FROM $1 WHERE "object_type" = 'DYNAMIC_TABLE';
```

---
title: SHOW EVENT TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-event-tables.md
section: SQL Commands
---

# SHOW EVENT TABLES

Lists the [event tables](../../developer-guide/logging-tracing/event-table-setting-up.md) for which you have access privileges, including
dropped tables that are still within the Time Travel retention period and, therefore, can be undropped. The command can be used to list
event tables for the current/specified database or schema, or across your entire account.

The output returns table metadata and properties, ordered lexicographically by database, schema, and event table name (see
Output in this topic for descriptions of the output columns). This is important to note if you wish to filter the results using
the provided filters.

See also:
:   [CREATE EVENT TABLE](create-event-table.md), [ALTER TABLE (event tables)](alter-table-event-table.md), [DROP TABLE](drop-table.md),
    [UNDROP TABLE](undrop-table.md)

    [TABLES view](../info-schema/tables.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW [ TERSE ] EVENT TABLES [ LIKE '<pattern>' ]
  [ IN { ACCOUNT | DATABASE [ <db_name> ] | SCHEMA [ <schema_name> ] } ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Optionally returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN ACCOUNT | DATABASE [ db_name ] | SCHEMA [ schema_name ]`
:   Optionally specifies the scope of the command, which determines whether the command lists records only for the current/specified database or schema, or across your entire account.

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the current
      database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the event table was created. |
| `name` | Name of the event table. |
| `database_name` | Database in which the event table is stored. |
| `schema_name` | Schema in which the event table is stored. |
| `owner` | Role that owns the event table. |
| `comment` | Comment for the event table. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

For more information about the properties that can be specified for an event table, see [CREATE EVENT TABLE](create-event-table.md).

## Usage notes

* If an account (or database or schema) has a large number of event tables, then searching the entire account (or table or schema)
  can consume a significant amount of compute resources.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the event tables whose name starts with `mylogs` that you have privileges to view in the `tpch.public`
schema:

> ```sqlexample
> SHOW EVENT TABLES LIKE 'mylogs%' IN tpch.public;
> ```

---
title: SHOW EXPERIMENTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-experiments.md
section: SQL Commands
---

# SHOW EXPERIMENTS

Lists the [experiments](../../developer-guide/snowflake-ml/experiments.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE EXPERIMENT](create-experiment.md) , [ALTER EXPERIMENT](alter-experiment.md), [SHOW RUNS IN EXPERIMENT](show-runs-in-experiment.md) , [DROP EXPERIMENT](drop-experiment.md), [SHOW RUN … IN EXPERIMENT](show-run-in-experiment.md)

## Syntax

```sqlsyntax
SHOW EXPERIMENTS [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                      |
                  DATABASE [ <database_name> ] |
                  SCHEMA [ <schema_name> ]
                }
           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the experiment was created. |
| `name` | The identifier for the experiment. |
| `database_name` | The database that the experiment is stored in. |
| `schema_name` | The schema that the experiment is stored in. |
| `owner` | The role that owns the experiment. |
| `runs` | A JSON array containing the names of runs in the experiment. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

---
title: SHOW EXTERNAL FUNCTIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-external-functions.md
section: SQL Commands
---

# SHOW EXTERNAL FUNCTIONS

Lists all the external functions created for your account.

For more information, see [Writing external functions](../external-functions.md).

See also:
:   [SHOW FUNCTIONS](show-functions.md) ,
    [SHOW USER FUNCTIONS](show-user-functions.md),
    [CREATE EXTERNAL FUNCTION](create-external-function.md) ,
    [ALTER FUNCTION](alter-function.md)

## Syntax

```sqlsyntax
SHOW EXTERNAL FUNCTIONS [ LIKE '<pattern>' ]
           [ IN { APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> }  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

## Usage notes

* The commands [SHOW FUNCTIONS](show-functions.md) and [SHOW USER FUNCTIONS](show-user-functions.md) also display information
  about external functions.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all external functions:

> ```sqlexample
> SHOW EXTERNAL FUNCTIONS;
> ```

Show only external functions matching the specified regular expression:

> ```sqlexample
> SHOW EXTERNAL FUNCTIONS LIKE 'SQUARE%';
> ```

---
title: SHOW EXTERNAL TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-external-tables.md
section: SQL Commands
---

# SHOW EXTERNAL TABLES

Lists the external tables for which you have access privileges. The command can be used to list external tables for the current or specified
database or schema, or across your entire account.

The output returns external table metadata and properties, ordered lexicographically by database, schema, and external table name.
For more information, see Output in this topic for descriptions of the output columns. This behavior is important to understand if you want to filter the results by using
the provided filters.

See also:
:   [CREATE EXTERNAL TABLE](create-external-table.md) , [DROP EXTERNAL TABLE](drop-external-table.md) , [ALTER EXTERNAL TABLE](alter-external-table.md) , [DESCRIBE EXTERNAL TABLE](desc-external-table.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] EXTERNAL TABLES [ LIKE '<pattern>' ]
                               [ IN
                                        {
                                          ACCOUNT                                         |

                                          DATABASE                                        |
                                          DATABASE <database_name>                        |

                                          SCHEMA                                          |
                                          SCHEMA <schema_name>                            |
                                          <schema_name>

                                          APPLICATION <application_name>                  |
                                          APPLICATION PACKAGE <application_package_name>  |
                                        }
                               ]
                               [ STARTS WITH '<name_string>' ]
                               [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the external table was created. |
| name | Name of the external table. |
| database_name | Database for the schema for the external table. |
| schema_name | Schema for the external table. |
| invalid | TRUE if either the stage or file format referenced in the external table description is dropped. |
| invalid_reason | Reason why the external table is invalid, when the INVALID column shows a TRUE value. |
| owner | Role that owns the external table. |
| comment | Comment for the external table. |
| stage | Fully qualified name of the stage referenced in the external table definition. |
| location | External stage and folder path in the external table definition. NULL for external tables in an imported share in a data consumer account. |
| file_format_name | Named file format in the external table definition. Doesn’t display a file format specified in the stage definition. |
| file_format_type | File format type specified in the external table definition. Doesn’t display a file format type specified in the stage definition. |
| cloud | Cloud in which the staged data files are located. |
| region | Region in which the staged data files are located. |
| notification_channel | Amazon Resource Name of the Amazon SQS queue for the external table. |
| last_refreshed_on | Timestamp that indicates when the metadata for the external table was last synchronized with the latest set of associated files in the external stage and path, either manually or automatically. |
| table_format | Table format of the staged files that are referenced by the external table. Possible values: DELTA, UNSPECIFIED. |
| last_refresh_details | Supports future functionality; currently NULL only. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

For more information about the properties that can be specified for an external table, see [CREATE EXTERNAL TABLE](create-external-table.md).

## Usage notes

* This command doesn’t list external tables that have been dropped.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the external tables whose name starts with `line` that you have privileges to view in the `tpch.public` schema:

> ```sqlexample
> SHOW EXTERNAL TABLES LIKE 'line%' IN tpch.public;
> ```

---
title: SHOW EXTERNAL VOLUMES
source: https://docs.snowflake.com/en/sql-reference/sql/show-external-volumes.md
section: SQL Commands
---

# SHOW EXTERNAL VOLUMES

Lists the [external volumes](../../user-guide/tables-iceberg.md) in your account for which you have access privileges.

The output returns external volume metadata and properties.

See also:
:   [CREATE EXTERNAL VOLUME](create-external-volume.md) , [DROP EXTERNAL VOLUME](drop-external-volume.md) , [ALTER EXTERNAL VOLUME](alter-external-volume.md) , [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md)

## Syntax

```sqlsyntax
SHOW EXTERNAL VOLUMES [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | External volume | To see a particular external volume in the output for SHOW EXTERNAL VOLUMES, a role must have the USAGE privilege on that external volume. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the external volume. |
| `allow_writes` | Signifies whether Snowflake can write files to the storage location(s). |
| `comment` | Comment for the external volume. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all external volumes:

> ```sqlexample
> SHOW EXTERNAL VOLUMES;
> ```

Show all the external volumes whose name starts with `aws` that you have privileges to view:

> ```sqlexample
> SHOW EXTERNAL VOLUMES LIKE 'aws%';
> ```

---
title: SHOW FAILOVER GROUPS
source: https://docs.snowflake.com/en/sql-reference/sql/show-failover-groups.md
section: SQL Commands
---

# SHOW FAILOVER GROUPS

Lists the primary and secondary [failover groups](../../user-guide/account-replication-intro.md) in your account,
as well as the failover groups in other accounts that are associated with your account.

For the other accounts:

* Lists the primary failover groups enabled for replication and failover to this account.
* Lists the secondary failover groups linked to groups in this account.

See also:
:   [CREATE FAILOVER GROUP](create-failover-group.md) , [ALTER FAILOVER GROUP](alter-failover-group.md) , [DROP FAILOVER GROUP](drop-failover-group.md)

## Syntax

```sqlsyntax
SHOW FAILOVER GROUPS [ IN ACCOUNT <account> ]
```

## Parameters

`account`
:   Specifies the identifier for the account. Account name is a unique identifier within your organization. For more details about account
    name, see [Format 1 (preferred): Account name in your organization](../../user-guide/admin-account-identifier.md).

## Usage notes

* Executing this command requires a role with any one of the following privileges on a failover group:

  + FAILOVER
  + MONITOR
  + OWNERSHIP
  + REPLICATE
* The output of SHOW FAILOVER GROUPS only includes groups of type `FAILOVER`.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command returns the following columns:

| Column | Description |
| --- | --- |
| `region_group` | Region group where the account is located. **Note:** this column is only visible to organizations that span multiple [Region groups](../../user-guide/admin-account-identifier.md). |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `created_on` | Date and time failover group was created. |
| `account_name` | Name of the account. |
| `name` | Name of the failover group. |
| `type` | Type of group. Valid value is `FAILOVER`. |
| `comment` | Comment string. |
| `is_primary` | Indicates whether the failover group is the primary group. |
| `primary` | Name of the primary group. |
| `object_types` | List of specified object types enabled for replication and failover. |
| `allowed_integration_types` | A list of integration types that are enabled for replication.  Snowflake always includes this column in the output even if integrations were not specified in the CREATE FAILOVER GROUP or ALTER FAILOVER GROUP command. |
| `allowed_accounts` | List of accounts enabled for replication and failover. |
| `organization_name` | Name of your Snowflake organization. |
| `account_locator` | Account locator in a region. |
| `replication_schedule` | Scheduled interval for refresh; NULL if no replication schedule is set. |
| `secondary_state` | Current state of scheduled refresh. Valid values are `started` or `suspended`. NULL if no replication schedule is set. |
| `next_scheduled_refresh` | Date and time of the next scheduled refresh. |
| `owner` | Name of the role with the OWNERSHIP privilege on the failover group. NULL if the failover group is in a different region. |
| `is_listing_auto_fulfillment_group` | TRUE if the replication group is used for [Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md). FALSE otherwise. |

## Examples

List failover groups in account `myaccount1`.

```sqlexample
SHOW FAILOVER GROUPS IN ACCOUNT myaccount1;

+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+------------+-----------------------------------+
| snowflake_region | created_on                    | account_name | name | type     | comment | is_primary | primary               | object_types                                | allowed_integration_types |  allowed_accounts                            | organization_name | account_locator   | replication_schedule | secondary_state | next_scheduled_refresh        | owner      | is_listing_auto_fulfillment_group |
+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+------------+-----------------------------------+
| AWS_US_EAST_1    | 2021-10-25 19:08:15.209 -0700 | MYACCOUNT1   | MYFG | FAILOVER |         | true       | MYORG.MYACCOUNT1.MYFG | DATABASES, ROLES, USERS, WAREHOUSES, SHARES |                           | MYORG.MYACCOUNT1.MYFG,MYORG.MYACCOUNT2.MYFG  | MYORG             | MYACCOUNT1LOCATOR | 10 MINUTE            | NULL            |                               | MYROLE     | false                             |
+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+------------+-----------------------------------+
| AWS_US_WEST_2    | 2021-10-25 19:08:15.209 -0700 | MYACCOUNT2   | MYFG | FAILOVER |         | false      | MYORG.MYACCOUNT1.MYFG |                                             |                           |                                              | MYORG             | MYACCOUNT2LOCATOR | 10 MINUTE            | STARTED         | 2022-03-06 12:10:35.280 -0800 | NULL       | false                             |
+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+------------+-----------------------------------+
```

---
title: SHOW FEATURE POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-feature-policies.md
section: SQL Commands
---

# SHOW FEATURE POLICIES

Lists the [feature policies](../../developer-guide/native-apps/ui-consumer-feature-policies.md) for which you have access privileges.

See also:
:   [CREATE FEATURE POLICY](create-feature-policy.md) , [ALTER FEATURE POLICY](alter-feature-policy.md), [DESCRIBE FEATURE POLICY](desc-feature-policy.md), [DROP FEATURE POLICY](drop-feature-policy.md)

## Syntax

```sqlsyntax
SHOW FEATURE POLICIES
  [ IN
    {
      ACCOUNT                                        |
      APPLICATION {app_name}                         |
      APPLICATION PACKAGE {app_package_name}         |
      DATABASE {database_name}                       |
      SCHEMA {schema_name}                           |
    }
  ]

SHOW FEATURE POLICIES ON ACCOUNT

SHOW FEATURE POLICIES ON APPLICATION <application_name>
```

## Parameters

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns information about feature policies created in the specified account.

    `APPLICATION app_name`
    :   Returns information about feature policies created in the specified app.

    `APPLICATION PACKAGE app_package_name`
    :   Returns information about feature policies created in the specified application package.

    `DATABASE database_name`
    :   Returns information about feature policies created in the specified database.

    `SCHEMA schema_name`
    :   Returns information about feature policies created in the specified schema.

`ON ACCOUNT`
:   Shows the feature policies that have been applied to the current account.

`ON APPLICATION app_name`
:   Shows the feature policies that have been applied on the specified app. This command also
    displays feature policies that are inherited from those applied on the account.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Feature policy | This privilege is required to use this command. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | The timestamp when the policy was created. |
| `name` | The name of the policy. |
| `database_name` | The name of the database containing the policy. |
| `schema_name` | The name of the schema containing the policy. |
| `kind` | The type of feature policy. Currently, only `FEATURE_POLICY` is supported. |
| `owner` | The role that owns the feature policy. |
| `comment` | A comment containing information about the policy. |
| `owner_role_type` | The type of the role that owns the feature policy. |
| `options` | Currently, always NULL. |

## Examples

The following example lists the feature policies that you have the privileges to view
in the current account:

```sqlexample
SHOW FEATURE POLICIES;
```

The following example lists the feature policies that you have the privileges to view
in an app named `hello_snowflake_app`:

```sqlexample
SHOW FEATURE POLICIES IN APPLICATION hello_snowflake_app;
```

The following example lists the feature policies that have been applied on the current account:

```sqlexample
SHOW FEATURE POLICIES ON ACCOUNT
```

---
title: SHOW FILE FORMATS
source: https://docs.snowflake.com/en/sql-reference/sql/show-file-formats.md
section: SQL Commands
---

# SHOW FILE FORMATS

Lists the file formats for which you have access privileges. This command can be used to list the file formats for a specified
database or schema (or the current database/schema for the session), or your entire account.

See also:
:   [CREATE FILE FORMAT](create-file-format.md) , [DROP FILE FORMAT](drop-file-format.md) , [ALTER FILE FORMAT](alter-file-format.md) , [DESCRIBE FILE FORMAT](desc-file-format.md)

## Syntax

```sqlsyntax
SHOW FILE FORMATS [ LIKE '<pattern>' ]
                  [ IN
                       {
                          ACCOUNT                                         |

                          DATABASE                                        |
                          DATABASE <database_name>                        |

                          SCHEMA                                          |
                          SCHEMA <schema_name>                            |
                          <schema_name>

                          APPLICATION <application_name>                  |
                          APPLICATION PACKAGE <application_package_name>  |
                       }
                  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The command output provides file format properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| format_options | Values of all options for the file format type. Returns the default value for any option that is not explicitly defined. |
| created_on | Date and time when the file format was created. |
| name | Name of the file format. |
| database_name | Database in which the file format is stored. |
| schema_name | Schema in which the file format is stored. |
| type | File format type: CSV, JSON, Avro, ORC, Parquet, or XML. |
| owner | Role that owns the file format. |
| comment | Comment for the file format. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* The output of this command might include objects with names like `SN_TEMP_OBJECT_<n>` (where `<n>` is a number). These are
  temporary objects that are created by the [Snowpark](../../developer-guide/snowpark/index.md) library on behalf of the user.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following examples are all equivalent:

> ```sqlexample
> USE DATABASE testdb;
>
> SHOW FILE FORMATS;
> ```
>
> ```output
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> | created_on                      | name      | database_name | schema_name | type | owner        | comment | owner_role_type |
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> | Wed, 29 Apr 2015 18:59:03 -0700 | MY_FORMAT | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | CSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | VSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | TSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> ```
>
> ```sqlexample
> SHOW FILE FORMATS IN DATABASE testdb;
> ```
>
> ```output
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> | created_on                      | name      | database_name | schema_name | type | owner        | comment | owner_role_type |
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> | Wed, 29 Apr 2015 18:59:03 -0700 | MY_FORMAT | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | CSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | VSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | TSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> ```
>
> ```sqlexample
> SHOW FILE FORMATS IN SCHEMA testdb.public;
> ```
>
> ```output
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> | created_on                      | name      | database_name | schema_name | type | owner        | comment | owner_role_type |
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> | Wed, 29 Apr 2015 18:59:03 -0700 | MY_FORMAT | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | CSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | VSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> | Mon, 27 Apr 2015 17:49:12 -0700 | TSV       | TESTDB        | PUBLIC      | CSV  | ACCOUNTADMIN |         | ROLE            |
> +---------------------------------+-----------+---------------+-------------+------+--------------+---------+-----------------+
> ```

---
title: SHOW FUNCTIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-functions.md
section: SQL Commands
---

# SHOW FUNCTIONS

Lists all functions that you have privileges to access, including built-in, user-defined, and external functions.

For a command that lists only user-defined functions, see [SHOW USER FUNCTIONS](show-user-functions.md).

See also:
:   [SHOW USER FUNCTIONS](show-user-functions.md) , [SHOW EXTERNAL FUNCTIONS](show-external-functions.md) , [SHOW FUNCTIONS IN MODEL](show-functions-in-model.md) , [CREATE FUNCTION](create-function.md) , [DROP FUNCTION](drop-function.md) , [ALTER FUNCTION](alter-function.md) ,
    [DESCRIBE FUNCTION](desc-function.md)

## Syntax

```sqlsyntax
SHOW FUNCTIONS [ LIKE '<pattern>' ]
  [ IN
    {
      ACCOUNT                       |

      CLASS <class_name>            |

      DATABASE                      |
      DATABASE <database_name>      |

      SCHEMA                        |
      SCHEMA <schema_name>          |
      <schema_name>
    }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `CLASS class_name`
    :   Returns records for the specified class (`class_name`).

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully-qualified `schema_name` (e.g. `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The command output provides function properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp at which the function was created. |
| `name` | Name of the function. |
| `schema_name` | Name of the schema that the function exists in. NULL for built-in functions. |
| `is_builtin` | `Y` if the function is a built-in function; `N` otherwise. |
| `is_aggregate` | `Y` if the function is an aggregate function; `N` otherwise. |
| `is_ansi` | `Y` if the function is defined as part of the ANSI SQL standard; `N` otherwise. |
| `min_num_arguments` | Minimum number of arguments. |
| `max_num_arguments` | Maximum number of arguments. |
| `arguments` | Shows the data types of the arguments and of the return value. |
| `description` | Description of the function. |
| `catalog_name` | Name of the database that the function exists in. NULL for built-in functions. |
| `is_table_function` | `Y` if the function is a table function; `N` otherwise. |
| `valid_for_clustering` | `Y` if the function can be used in a CLUSTER BY expression; `N` otherwise. |
| `is_secure` | `Y` if the function is a secure function; `N` otherwise. |
| `is_external_function` | `Y` if the function is an external function; `N` otherwise. |
| `language` | * For built-in functions, this column shows `SQL`. * For user-defined functions, this column shows the language in which the function was written, such as `JAVASCRIPT` or `SQL`. See [SHOW USER FUNCTIONS](show-user-functions.md). * For external functions, this column shows `EXTERNAL`. |
| `is_memoizable` | `Y` if the function is memoizable; `N` otherwise. |
| `is_data_metric` | `Y` if the function is a DMF; `N` otherwise. |

## Usage notes

* If you specify `CLASS`, the command only returns the following columns:

  ```output
  | name | min_num_arguments | max_num_arguments | arguments | descriptions | language |
  ```

* The output of this command might include objects with names like `SN_TEMP_OBJECT_<n>` (where `<n>` is a number). These are
  temporary objects that are created by the [Snowpark](../../developer-guide/snowpark/index.md) library on behalf of the user.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all functions:

```sqlexample
SHOW FUNCTIONS;
```

Show only functions matching the specified regular expression:

```sqlexample
SHOW FUNCTIONS LIKE 'SQUARE';
```

```output
------------+--------+-------------+------------+--------------+---------+-------------------+-------------------+----------------------------------------------------------------------+------------------------------------------------------------+----------+---------------+----------------+
 created_on | name   | schema_name | is_builtin | is_aggregate | is_ansi | min_num_arguments | max_num_arguments |                               arguments                              |                      description                           | language | is_memoizable | is_data_metric |
------------+--------+-------------+------------+--------------+---------+-------------------+-------------------+----------------------------------------------------------------------+------------------------------------------------------------+----------+---------------+----------------+
            | SQUARE |             | Y          | N            | Y       | 1                 | 1                 | SQUARE(NUMBER(38,0)) RETURN NUMBER(38,0), SQUARE(FLOAT) RETURN FLOAT | Compute the square of the input expression.                | SQL      | N             | N              |
------------+--------+-------------+------------+--------------+---------+-------------------+-------------------+----------------------------------------------------------------------+------------------------------------------------------------+----------+---------------+----------------+
```

---
title: SHOW FUNCTIONS IN MODEL
source: https://docs.snowflake.com/en/sql-reference/sql/show-functions-in-model.md
section: SQL Commands
---

# SHOW FUNCTIONS IN MODEL

Lists functions defined in machine learning models.

For more information, see [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md).

See also:
:   [SHOW FUNCTIONS](show-functions.md) , [SHOW MODELS](show-models.md) , [SHOW VERSIONS IN MODEL](show-versions-in-model.md)

## Syntax

```sqlsyntax
SHOW FUNCTIONS [ LIKE '<pattern>' ] IN MODEL <model_name>
               [ VERSION <version_name> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`MODEL model_name`, . `MODEL model_name VERSION version_name`
:   Returns records for the specified version (`version_name`) of the specified machine learning model (`model_name`).

    If a version is not specified, records are displayed for the model’s default version.

## Output

The SHOW FUNCTIONS IN MODEL command output provides function properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | The timestamp at which the function was created. |
| `name` | The function’s name. |
| `version_name` | The name of the model version that the function belongs to. |
| `min_num_arguments` | The minimum number of arguments to the function. |
| `max_num_arguments` | The maximum number of arguments to the function. |
| `arguments` | The data types of the arguments as a JSON-formatted string. |
| `return_type` | The data type of the return value. |
| `description` | Description of the function. |
| `language` | The language in which the function was written, such as “PYTHON”. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW GATEWAYS
source: https://docs.snowflake.com/en/sql-reference/sql/show-gateways.md
section: SQL Commands
---

# SHOW GATEWAYS

Lists the [gateway](../../developer-guide/snowpark-container-services/gateway.md) for which you have access privileges.

See also:
:   [CREATE GATEWAY](create-gateway.md) , [ALTER GATEWAY](alter-gateway.md), [DROP GATEWAY](drop-gateway.md) , [DESCRIBE GATEWAY](desc-gateway.md)

## Syntax

```sqlsyntax
SHOW GATEWAYS [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |

                  DATABASE                 |
                  DATABASE <database_name> |

                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>
                }
           ]
           [ STARTS WITH '<name_string>' ]
           [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The command output provides gateway properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the gateway was created. |
| `name` | Gateway name. |
| `database_name` | Database in which the gateway is created. |
| `schema_name` | Schema in which the gateway is created. |
| `owner` | Role that owns the gateway. |
| `owner_role_type` | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `gateway_type` | The type of gateway. |
| `comment` | Gateway related comment. |

> **Note:**
>
> Only gateways on which the role used has USAGE, MODIFY, or OWNERSHIP privilege will be shown.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE, MODIFY, or OWNERSHIP | Gateway | Only gateways on which the role has one of these privileges will be shown. |
| USAGE | Database | Required on the database containing the gateways. |
| USAGE | Schema | Required on the schema containing the gateways. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists gateways in a specific schema:

```sqlexample
SHOW GATEWAYS IN SCHEMA db.schema;
```

The following example lists gateways in the current database and schema for the session:

```sqlexample
SHOW GATEWAYS;
```

The following example lists gateways with names containing “split”:

```sqlexample
SHOW GATEWAYS LIKE '%split%';
```

The following example lists one gateway:

```sqlexample
SHOW GATEWAYS LIMIT 1;
```

---
title: SHOW GIT BRANCHES
source: https://docs.snowflake.com/en/sql-reference/sql/show-git-branches.md
section: SQL Commands
---

# SHOW GIT BRANCHES

Lists the branches in the specified Snowflake Git repository clone.

See also:
:   [SHOW GIT TAGS](show-git-tags.md), [CREATE GIT REPOSITORY](create-git-repository.md), [SHOW GIT REPOSITORIES](show-git-repositories.md)

## Syntax

```sqlsyntax
SHOW GIT BRANCHES [ LIKE '<pattern>' ] IN [ GIT REPOSITORY ] <repository_name>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN [ GIT REPOSITORY ] repository_name`
:   Specifies the Git repository clone containing the branches to show.

## Output

The command output provides Git branches properties in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the branch. |
| `path` | Path of the branch. |
| `checkouts` | Currently shows no data. |
| `commit_hash` | Commit hash of the branch. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| READ | Git repository | Git repository clone containing the branches to show |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example lists branches in the Git repository clone `snowflake_extensions`.

```sqlexample
SHOW GIT BRANCHES IN snowflake_extensions;
```

The preceding command generates output such as the following:

```output
--------------------------------------------------------------------------------
| name | path           | checkouts | commit_hash                              |
--------------------------------------------------------------------------------
| main | /branches/main |           | 0f81b1487dfc822df9f73ac6b3096b9ea9e42d69 |
--------------------------------------------------------------------------------
```

---
title: SHOW GIT REPOSITORIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-git-repositories.md
section: SQL Commands
---

# SHOW GIT REPOSITORIES

Lists the [Git repository clones](../../developer-guide/git/git-overview.md) that you have privileges to access.

The [SHOW STAGES](show-stages.md) command also lists Snowflake Git repositories. In the SHOW STAGES output, a Snowflake Git
repository has the value `GIT REPOSITORY` in its `type` column.

See also:
:   [ALTER GIT REPOSITORY](alter-git-repository.md), [CREATE GIT REPOSITORY](create-git-repository.md), [DESCRIBE GIT REPOSITORY](desc-git-repository.md), [DROP GIT REPOSITORY](drop-git-repository.md),
    [SHOW GIT BRANCHES](show-git-branches.md), [SHOW GIT TAGS](show-git-tags.md)

## Syntax

```sqlsyntax
SHOW GIT REPOSITORIES [ LIKE '<pattern>' ]
  [ IN
      {
        ACCOUNT                  |

        DATABASE                 |
        DATABASE <database_name> |

        SCHEMA                   |
        SCHEMA <schema_name>     |
        <schema_name>
      }
  ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The command output provides Git repository clone properties in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date the Git repository clone was created. |
| `name` | Name of the Git repository clone. |
| `database_name` | Name of the database containing this Git repository clone. |
| `schema_name` | Name of the schema containing this Git repository clone. |
| `origin` | URL of the remote Git repository’s origin. |
| `api_integration` | Name of the API integration included in this Git repository clone. |
| `git_credentials` | Name of the secret object in this Git repository clone. |
| `owner` | Role used when this Git repository clone was created. |
| `owner_role_type` | Type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `comment` | Comment specified when this Git repository clone was created. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Git repository | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | Schema |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example lists repositories in the current schema.

```sqlexample
SHOW GIT REPOSITORIES;
```

The preceding command generates output such as the following:

```output
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| CREATED_ON                    | NAME                 | DATABASE_NAME | SCHEMA_NAME | ORIGIN                                                  | API_INTEGRATION     | GIT_CREDENTIALS              | OWNER        | OWNER_ROLE_TYPE | COMMENT |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-06-28 08:46:10.886 -0700 | SNOWFLAKE_EXTENSIONS | MY_DB         | MAIN        | https://github.com/my-account/snowflake-extensions.git  | GIT_API_INTEGRATION | MY_DB.MAIN.EXTENSIONS_SECRET | ACCOUNTADMIN | ROLE            |         |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
| 2023-06-28 08:46:10.886 -0700 | SNOWFLAKE_AI         | MY_DB         | MAIN        | https://github.com/my-account/snowflake-AI.git          | GIT_API_INTEGRATION | MY_DB.MAIN.AI_SECRET         | ACCOUNTADMIN | ROLE            |         |
------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```

---
title: SHOW GIT TAGS
source: https://docs.snowflake.com/en/sql-reference/sql/show-git-tags.md
section: SQL Commands
---

# SHOW GIT TAGS

Lists the tags in the specified Snowflake [Git repository clone](../../developer-guide/git/git-overview.md).

See also:
:   [ALTER GIT REPOSITORY](alter-git-repository.md), [CREATE GIT REPOSITORY](create-git-repository.md), [DESCRIBE GIT REPOSITORY](desc-git-repository.md), [DROP GIT REPOSITORY](drop-git-repository.md),
    [SHOW GIT BRANCHES](show-git-branches.md), [SHOW GIT REPOSITORIES](show-git-repositories.md)

## Syntax

```sqlsyntax
SHOW GIT TAGS [ LIKE '<pattern>' ] IN [ GIT REPOSITORY ] <repository_name>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN [ GIT REPOSITORY ] repository_name`
:   Specifies the Git repository clone containing the tags to show.

## Output

The command output provides Git tags properties in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the tag. |
| `path` | Path of the tag. |
| `commit_hash` | Commit hash of the tag. |
| `author` | Author of the tag. |
| `message` | Commit message for the tag. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| READ | Git repository | Git repository clone containing the tags to show |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example lists tags in the Git repository clone `snowflake_extensions`.

```sqlexample
SHOW GIT TAGS IN snowflake_extensions;
```

The preceding command generates output such as the following:

```output
-----------------------------------------------------------------------------------------------------------------------------------------------
| name    | path          | commit_hash                              | author                                     | message                   |
-----------------------------------------------------------------------------------------------------------------------------------------------
| example | /tags/example | 16e262d401297cd097d5d6c266c80ff9f7e1e4be | Gladys Kravits (gladyskravits@example.com) | Example code for preview. |
-----------------------------------------------------------------------------------------------------------------------------------------------
```

---
title: SHOW GLOBAL ACCOUNTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-global-accounts.md
section: SQL Commands
---

# SHOW GLOBAL ACCOUNTS

Lists all the accounts in your organization that are enabled for replication and indicates the Snowflake Region in which each account
is located.

Currently, linking accounts in your organization for replication requires assistance from Snowflake Support.

See also:
:   [SHOW REPLICATION DATABASES](show-replication-databases.md)

## Syntax

```sqlsyntax
SHOW GLOBAL ACCOUNTS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Output

The command output provides global account properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `region_group` | Region group where the account is located. |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `created_on` | Date and time when the account was created. |
| `name` | Name of the account. |
| `comment` | Comment for the account. |
| `is_org_admin` | Indicates whether the ORGADMIN role is enabled in an account. If TRUE, the role is enabled. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the global accounts whose name starts with `myaccount`:

```sqlexample
SHOW GLOBAL ACCOUNTS LIKE 'myaccount%';
```

---
title: SHOW GRANTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-grants.md
section: SQL Commands
---

# SHOW GRANTS

Lists all access control privileges that have been explicitly granted to roles, users, and shares.

For more information about privileges and roles, refer to [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about shares, refer to [About Secure Data Sharing](../../user-guide/data-sharing-intro.md).

> **Note:**
>
> SHOW GRANTS is a special variation that uses different syntax from all the other [SHOW <objects>](show.md) commands.

## Syntax

```sqlsyntax
SHOW GRANTS [ LIMIT <rows> ]

SHOW GRANTS ON ACCOUNT [ LIMIT <rows> ]

SHOW GRANTS ON <object_type> <object_name> [ LIMIT <rows> ]

SHOW GRANTS TO {
  APPLICATION <app_name>
  | APPLICATION ROLE [ <app_name>. ]<app_role_name>
  | SERVICE ROLE <service_name>!<service_role_name>
  | <class_name> ROLE <instance_name>!<instance_role_name>
  | ROLE <role_name>
  | SHARE <share_name> [ IN APPLICATION PACKAGE <app_package_name> ]
  | USER <user_name>
} [ LIMIT <rows> ]

SHOW GRANTS OF {
  APPLICATION ROLE <app_role_name>
  | SERVICE ROLE <service_name>!<service_role_name>
  | ROLE <role_name>
} [ LIMIT <rows> ]

SHOW GRANTS OF SHARE <share_name> [ LIMIT <rows> ]

SHOW FUTURE GRANTS IN SCHEMA { <schema_name> } [ LIMIT <rows> ]

SHOW FUTURE GRANTS IN DATABASE { <database_name> } [ LIMIT <rows> ]

SHOW FUTURE GRANTS TO ROLE <role_name> [ LIMIT <rows> ]

SHOW FUTURE GRANTS TO DATABASE ROLE <database_role_name>
```

## Variants

`SHOW GRANTS`
:   Syntactically equivalent to `SHOW GRANTS TO USER current_user`. Lists all the roles granted to the current user.

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

`SHOW GRANTS ON ...`
:   `ACCOUNT`
    :   Lists all the account-level (i.e. global) privileges that have been granted to roles.

    `object_type object_name`
    :   Lists all privileges that have been granted on the object.

        For database roles, you can use the fully qualified name, `database_name.database_role_name`, or the relative name,
        `database_role_name`. If you use the relative name for the database role, Snowflake uses the database in session to resolve the
        relative name of the database role.

`SHOW GRANTS TO ...`
:   `APPLICATION app_name`
    :   Lists all the privileges and roles granted to the application.

    `APPLICATION ROLE [ app_name. ]app_role_name`
    :   Lists all the privileges and roles granted to the application role.

        The name of the application, `app_name`, is optional. If not specified, Snowflake uses the current application. If the
        application is not a database, this command does not return results.

    `SERVICE ROLE service_name!service_role_name`
    :   Lists the service endpoints for which the service role is granted the USAGE privilege.

    `class_name ROLE instance_name!instance_role_name`
    :   Lists all the privileges and roles granted to the [instance role](../snowflake-db-classes.md).

        If the database and schema that contains the `class_name` is not [in use](use.md) or is not specified in
        your [search path](../snowflake-db-classes.md), specify the fully-qualified name of the class. For example,
        `SNOWFLAKE.CORE.BUDGET`.

        For details, see the instance role example.

    `ROLE role_name`
    :   Lists all privileges and roles granted to the role. If the role has a grant on a temporary object, then the grant only exists in the
        session that the temporary object was created.

        SHOW GRANTS TO ROLE PUBLIC exposes the following *irrevocable* database roles granted to the public role:

        * ALERT_VIEWER
        * CLASSIFICATION_VIEWER
        * CORE_VIEWER
        * DATA_PRIVACY_VIEWER
        * ML_USER
        * MONITORING_VIEWER
        * NOTIFICATION_VIEWER
        * SNOWFLAKE_TEMPLATE_SNOWGIT_VIEWER
        * SPCS_REGISTRY_VIEWER

    `SHARE share_name`
    :   Lists all the privileges granted to the share.

    `SHARE share_name IN APPLICATION PACKAGE app_package_name`
    :   Lists all of the privileges and roles granted to a share in the application package.

    `USER user_name`
    :   Lists all the roles granted to the user. Note that the PUBLIC role, which is automatically available to every user, is not listed.

`SHOW GRANTS OF...`
:   `APPLICATION ROLE [ app_name. ]app_role`
    :   Lists all the users and roles to which the application role has been granted.

        The name of the application, `app_name`, is optional. If not specified, Snowflake uses the current application. If the
        application is not a database, this command does not return results.

    `SERVICE ROLE service_name!service_role_name`
    :   Lists all the users and roles to which the service role has been granted.

    `ROLE role_name`
    :   Lists all users and roles to which the role has been granted.

    `SHARE share_name`
    :   Lists all the accounts that are consuming the share. Accounts that have not yet consumed the share are excluded.
        To see all accounts that have been added to a share, query the SNOWFLAKE.ACCOUNT_USAGE.SHARES view.

`SHOW FUTURE GRANTS IN ...`
:   `SCHEMA database_name.schema_name`
    :   Lists all privileges on new (i.e. future) objects of a specified type in the schema granted to a role. `database_name.` specifies the database in which the schema resides and is optional when querying a schema in the current database.

    `DATABASE database_name`
    :   Lists all privileges on new (i.e. future) objects of a specified type in the database granted to a role.

`SHOW FUTURE GRANTS TO ROLE role_name`
:   Lists all privileges on new (i.e. future) objects of a specified type in a database or schema granted to the role.

`SHOW FUTURE GRANTS TO DATABASE ROLE database_role_name`
:   Lists all privileges on new (i.e. future) objects of a specified type in a database or schema granted to the database role.

    A shared database role does not support future grants. For details, see the usage notes in the [GRANT DATABASE ROLE … TO SHARE](grant-database-role-share.md) command.

## Usage notes

* The `granted_by` column indicates the role that authorized a privilege grant to the grantee. The authorization role is known as the
  *grantor*.

  When you grant privileges on an object to a role using [GRANT <privileges> … TO ROLE](grant-privilege.md), the following authorization rules
  determine which role is listed as the grantor of the privilege:

  1. If an [active role](../../user-guide/security-access-control-overview.md) is the object owner (i.e. has the OWNERSHIP privilege on the
     object), that role is the grantor.
  2. If an active role holds the specified permission with the grant option authorized (i.e., the privilege was granted to the active role
     with the GRANT *<privileges>* … TO ROLE *<role_name>* WITH GRANT OPTION, where *<role_name>* is one of the active roles). If so, the
     role that holds the privilege with the grant option authorized is the grantor role. Note that if multiple active roles meet this
     criterion, it is non-deterministic which of the roles becomes the grantor role.
  3. If an active role holds the global MANAGE GRANTS privilege, the grantor role is the object owner, not the role that held the
     MANAGE GRANTS privilege. That is, the MANAGE GRANTS privilege allows a role to impersonate the object owner for the purposes of
     granting privileges on that object.

  If the `granted_by` column is empty, the privilege was granted by the Snowflake SYSTEM role. Certain internal operations are
  performed with this role. Grants of privileges authorized by the SYSTEM role cannot be modified by customers.
* When using the SHOW GRANTS … TO SHARE IN APPLICATION PACKAGE syntax:

  + The `grantee_name` column specifies the name of the application package.
  + The `granted_to` column specifies `APPLICATION PACKAGE SHARE`.
* The `granted_by_role_type` column specifies the type of grantor role that performed the grant: `ROLE`, `DATABASE_ROLE`, or
  `APPLICATION_ROLE`. This column only appears in the output when using the SHOW GRANTS ON syntax.
* A data sharing consumer can only view the privileges on objects that are [granted to the share](grant-privilege-share.md), such as
  SELECT on a table. Depending on how the grants are set up, the output of a SHOW GRANTS command that is run by the consumer might show
  empty values for shared objects in the following columns: `granted_to`, `grantee_name`, `granted_by_role_type`, and
  `granted_by`. For example:

  + If an account role owns the shared object, the consumer cannot view the OWNERSHIP privilege on shared objects because the consumer
    cannot access (resolve) the role that owns the object (account roles are not shared).
  + If a database role owns the shared object and the provider shares the database role, the consumer can view the OWNERSHIP privilege on
    the shared object because they can resolve the shared database role.
* The `grant_options` column returns `FALSE` when you run a SHOW GRANTS ON <object_type> <object_name> command for an object in the
  managed access schema.
* The `privilege` column includes the OWNERSHIP and MANAGE GRANTS privileges for the role that owns the managed access schema when
  you run a SHOW GRANTS ON SCHEMA <managed_access_schema> command.
* With database roles and the SHOW FUTURE GRANTS TO DATABASE ROLE syntax, the command returns results for database roles that are not
  granted to a share.

  In the data sharing consumer account, this command does not return any rows when a shared database role is granted future
  privileges. However, depending on your account and the timing of support for future privileges to database roles in this command, you
  might see this error message:

  > ```output
  > Invalid state of the shared database role. Please revoke the future grants to the shared database role.
  > ```
  >
  > As the consumer, ask the provider to revoke the future grants from the shared database role.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List all privileges that have been granted on the `sales` database:

```sqlexample
SHOW GRANTS ON DATABASE sales;
```

```output
+---------------------------------+-----------+------------+------------+------------+--------------+--------------+----------------------+--------------+
| created_on                      | privilege | granted_on | name       | granted_to | grantee_name | grant_option | granted_by_role_type | granted_by   |
+---------------------------------+-----------+------------+------------+------------+--------------+--------------+----------------------+--------------+
| Thu, 07 Jul 2016 05:22:29 -0700 | OWNERSHIP | DATABASE   | REALESTATE | ROLE       | ACCOUNTADMIN | true         | ROLE                 | ACCOUNTADMIN |
| Thu, 07 Jul 2016 12:14:12 -0700 | USAGE     | DATABASE   | REALESTATE | ROLE       | PUBLIC       | false        | ROLE                 | ACCOUNTADMIN |
+---------------------------------+-----------+------------+------------+------------+--------------+--------------+----------------------+--------------+
```

List all privileges granted to the `analyst` role:

```sqlexample
SHOW GRANTS TO ROLE analyst;
```

```output
+---------------------------------+------------------+------------+------------+------------+--------------+--------------+------------+
| created_on                      | privilege        | granted_on | name       | granted_to | grantee_name | grant_option | granted_by |
|---------------------------------+------------------+------------+------------+------------+--------------+--------------+------------+
| Wed, 17 Dec 2014 18:19:37 -0800 | CREATE WAREHOUSE | ACCOUNT    | DEMOENV    | ROLE       |  ANALYST     | false        | SYSADMIN   |
+---------------------------------+------------------+------------+------------+------------+--------------+--------------+------------+
```

List all privileges granted to the `public` role:

```sqlexample
SHOW GRANTS TO ROLE public;
```

(example trimmed to show only the irrevocable database roles granted to the public role)

```output
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| created_on                      | privilege | granted_on    | name                              | granted_to | grantee_name | grant_option | granted_by |
|---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------|
| ...                             |           |               |                                   |            |              |              |            |
|---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | ALERT_VIEWER                      | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | CLASSIFICATION_VIEWER             | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | CORE_VIEWER                       | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | DATA_PRIVACY_VIEWER               | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+-------- -----+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | ML_USER                           | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | MONITORING_VIEWER                 | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | NOTIFICATION_VIEWER               | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | SNOWFLAKE_TEMPLATE_SNOWGIT_VIEWER | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| 2023-08-18 13:33:01.156 -0700   | USAGE     | DATABASE_ROLE | SPCS_REGISTRY_VIEWER              | ROLE       | PUBLIC       | false        |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
| ...                             |           |               |                                   |            |              |              |            |
+---------------------------------+-----------+---------------+-----------------------------------+------------+--------------+--------------+------------+
```

List all the roles granted to the `user1` user:

```sqlexample
SHOW GRANTS TO USER user1;
```

```output
+-------------------------------+-----------+------------+---------------------------+-----------+------------+--------------+--------------+---------------+
| created_on                    | privilege | granted_on | name                      |  role     | granted_to | grantee_name | grant_option | granted_by    |
|-------------------------------+-----------+------------+---------------------------+-----------+------------+--------------+------------------------------|
| 2025-05-07 09:08:43.773 -0800 | USAGE     | DATABASE   | test_db                   | null      | USER       | user1        | false        | SECURITYADMIN |
| 2025-05-07 09:08:55.253 -0800 | USAGE     | SCHEMA     | test_db.test_sch          | null      | USER       | user1        | false        | SECURITYADMIN |
| 2025-05-07 09:08:55.253 -0800 | SELECT    | TABLE      | test_db.test_sch.test_tbl | null      | USER       | user1        | false        | SECURITYADMIN |
| 2025-05-07 09:08:34.838 -0800 | USAGE     | WAREHOUSE  | test_wh                   | null      | USER       | user1        | false        | SECURITYADMIN |
+-------------------------------+-----------+------------+---------------------------+-----------+------------+--------------+--------------+---------------+
```

Show all privileges granted on an interactive table:

```sqlexample
SHOW GRANTS ON TABLE my_interactive_tbl;
```

```output
+-------------------------------+------------+-------------------+----------------------------------+------------+--------------+--------------+--------------+----------------------+
| created_on                    | privilege  | granted_on        | name                             | granted_to | grantee_name | grant_option | granted_by   | granted_by_role_type |
|-------------------------------+------------+-------------------+----------------------------------+------------+--------------+--------------+--------------+----------------------|
| 2025-11-06 22:41:29.679 +0000 | OWNERSHIP  | INTERACTIVE_TABLE | MYDB.MYSCHEMA.MY_INTERACTIVE_TBL | ROLE       | ACCOUNTADMIN | true         | ACCOUNTADMIN | ROLE                 |
| 2025-11-06 22:41:30.794 +0000 | REFERENCES | INTERACTIVE_TABLE | MYDB.MYSCHEMA.MY_INTERACTIVE_TBL | ROLE       | ANALYST      | false        | ACCOUNTADMIN | ROLE                 |
| 2025-11-06 22:41:30.564 +0000 | SELECT     | INTERACTIVE_TABLE | MYDB.MYSCHEMA.MY_INTERACTIVE_TBL | USER       | USER1        | false        | ACCOUNTADMIN | ROLE                 |
+-------------------------------+------------+-------------------+----------------------------------+------------+--------------+--------------+--------------+----------------------+
```

List all roles and users who have been granted the `analyst` role:

```sqlexample
SHOW GRANTS OF ROLE analyst;
```

```output
+---------------------------------+---------+------------+--------------+---------------+
| created_on                      | role    | granted_to | grantee_name | granted_by    |
|---------------------------------+---------+------------+--------------+---------------|
| Tue, 05 Jul 2016 16:16:34 -0700 | ANALYST | ROLE       | ANALYST_US   | SECURITYADMIN |
| Tue, 05 Jul 2016 16:16:34 -0700 | ANALYST | ROLE       | DBA          | SECURITYADMIN |
| Fri, 08 Jul 2016 10:21:30 -0700 | ANALYST | USER       | JOESM        | SECURITYADMIN |
+---------------------------------+---------+------------+--------------+---------------+
```

List all privileges granted on future objects in the `sales.public` schema:

```sqlexample
SHOW FUTURE GRANTS IN SCHEMA sales.public;
```

```output
+-------------------------------+-----------+----------+---------------------------+----------+-----------------------+--------------+
| created_on                    | privilege | grant_on | name                      | grant_to | grantee_name          | grant_option |
|-------------------------------+-----------+----------+---------------------------+----------+-----------------------+--------------|
| 2018-12-21 09:22:26.946 -0800 | INSERT    | TABLE    | SALES.PUBLIC.<TABLE>      | ROLE     | ROLE1                 | false        |
| 2018-12-21 09:22:26.946 -0800 | SELECT    | TABLE    | SALES.PUBLIC.<TABLE>      | ROLE     | ROLE1                 | false        |
+-------------------------------+-----------+----------+---------------------------+----------+-----------------------+--------------+
```

List all roles privileges granted to the instance role named `cost.budgets.my_budget!ADMIN`:

```sqlexample
SHOW GRANTS TO SNOWFLAKE.CORE.BUDGET ROLE cost.budgets.my_budget!ADMIN;
```

```output
+-------------------------------+-----------+------------+----------------------------------------------------------------------------------------------------------------------------------------+
| created_on                    | privilege | granted_on | name                                                                                                                                   |
+-------------------------------+-----------+------------+----------------------------------------------------------------------------------------------------------------------------------------+
| 2023-10-31 15:57:41.489 +0000 | USAGE     | ROLE       | SNOWFLAKE.CORE.BUDGET!ADMIN                                                                                                            |
| 2023-09-25 22:56:12.798 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!ACTIVATE():VARCHAR(16777216)                                                                                     |
| 2023-09-25 22:56:13.304 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!ADD_RESOURCE(TARGET_REF VARCHAR):VARCHAR(16777216)                                                               |
| 2023-09-25 22:56:12.863 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_ACTIVATION_DATE():DATE                                                                                       |
| 2023-09-25 22:56:12.412 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_BUDGET_NAME():VARCHAR(16777216)                                                                              |
| 2023-09-25 22:56:11.510 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_CONFIG():TABLE: ()                                                                                           |
| 2023-09-25 22:56:13.432 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_LINKED_RESOURCES():TABLE: ()                                                                                 |
| 2023-09-25 22:56:11.582 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_MEASUREMENT_TABLE():TABLE: ()                                                                                |
| 2023-09-25 22:56:12.153 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_NOTIFICATION_EMAIL():VARCHAR(16777216)                                                                       |
| 2023-09-25 22:56:12.016 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_NOTIFICATION_INTEGRATION_NAME():VARCHAR(16777216)                                                            |
| 2023-09-25 22:56:12.286 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_NOTIFICATION_MUTE_FLAG():VARCHAR(16777216)                                                                   |
| 2023-09-25 22:56:13.068 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_SERVICE_TYPE_USAGE(SERVICE_TYPE VARCHAR):TABLE: ()                                                           |
| 2023-09-25 22:56:13.245 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_SERVICE_TYPE_USAGE(SERVICE_TYPE VARCHAR, TIME_DEPART VARCHAR, USER_TIMEZONE VARCHAR, TIME_LOWER_BOUND VARCHA |
| 2023-09-25 22:56:12.595 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_SPENDING_HISTORY():TABLE: ()                                                                                 |
| 2023-09-25 22:56:12.732 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_SPENDING_HISTORY(TIME_LOWER_BOUND VARCHAR, TIME_UPPER_BOUND VARCHAR):TABLE: ()                               |
| 2023-09-25 22:56:11.716 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!GET_SPENDING_LIMIT():NUMBER(38,0)                                                                                |
| 2023-09-25 22:56:13.367 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!REMOVE_RESOURCE(TARGET_REF VARCHAR):VARCHAR(16777216)                                                            |
| 2023-09-25 22:56:11.856 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!SET_EMAIL_NOTIFICATIONS(NOTIFICATION_CHANNEL_NAME VARCHAR, EMAIL VARCHAR):VARCHAR(16777216)                      |
| 2023-09-25 22:56:12.349 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!SET_NOTIFICATION_MUTE_FLAG(USER_MUTE_FLAG BOOLEAN):VARCHAR(16777216)                                             |
| 2023-09-25 22:56:11.780 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!SET_SPENDING_LIMIT(SPENDING_LIMIT FLOAT):VARCHAR(16777216)                                                       |
| 2023-09-25 22:56:12.475 +0000 | USAGE     | PROCEDURE  | SNOWFLAKE.CORE.BUDGET!SET_TASK_SCHEDULE(NEW_SCHEDULE VARCHAR):VARCHAR(16777216)                                                        |
+-------------------------------+-----------+------------+----------------------------------------------------------------------------------------------------------------------------------------+
```

---
title: SHOW GRANTS IN DCM PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/show-grants-in-dcm-project.md
section: SQL Commands
---

# SHOW GRANTS IN DCM PROJECT

`SHOW GRANTS IN DCM PROJECT` lists all grants deployed and managed by the specified [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md).

`SHOW FUTURE GRANTS IN DCM PROJECT` lists all grants that will be deployed and managed by the specified [DCM project](../../user-guide/dcm-projects/dcm-projects-overview.md) when the next deployment is executed.

The command returns grant metadata and properties, ordered by creation date.

See also:
:   [CREATE DCM PROJECT](create-dcm-project.md) , [ALTER DCM PROJECT](alter-dcm-project.md), [DESCRIBE DCM PROJECT](desc-dcm-project.md) , [DROP DCM PROJECT](drop-dcm-project.md), [EXECUTE DCM PROJECT](execute-dcm-project.md), [SHOW DCM PROJECTS](show-dcm-projects.md)

## Syntax

```sqlsyntax
SHOW GRANTS IN DCM PROJECT <name> [ LIMIT <rows> ]

SHOW FUTURE GRANTS IN DCM PROJECT <name> [ LIMIT <rows> ]
```

## Required parameters

`IN DCM PROJECT name`
:   Specifies the identifier of the DCM project that contains the deployments to list.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Optional parameters

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

SHOW GRANTS IN DCM PROJECT returns the following output, including one row per grant:

* `CREATED_ON`
* `PRIVILEGE`
* `GRANTED_ON` - object type, such as DATABASE, TABLE, ROLE
* `NAME` - object name
* `GRANTED_TO` - such as ROLE, DATABASE ROLE, SHARE
* `GRANTEE_NAME`
* `GRANT_OPTION`
* `GRANTED_BY`
* `GRANTED_BY_ROLE_TYPE`

---
title: SHOW HYBRID TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-hybrid-tables.md
section: SQL Commands
---

# SHOW HYBRID TABLES

Lists the [hybrid tables](../../user-guide/tables-hybrid.md) for which you have access privileges.

The command can be used to list hybrid tables for the current/specified database or schema, or across your entire account.

This command returns different output columns than [SHOW TABLES](show-tables.md).

The output returns hybrid table metadata and properties, ordered lexicographically by database, schema, and the name of the
hybrid table (see Output in this topic for descriptions of the output columns). This is important to note if you wish to
filter the results using the provided filters.

Note that this topic refers to hybrid tables as simply “tables” except where specifying *hybrid tables* avoids confusion.

See also:
:   [CREATE INDEX](create-index.md), [DROP INDEX](drop-index.md) , [SHOW INDEXES](show-indexes.md) , [CREATE HYBRID TABLE](create-hybrid-table.md) , [DROP TABLE](drop-table.md) , [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] [ HYBRID ] TABLES [ LIKE '<pattern>' ]
                                 [ IN { ACCOUNT | DATABASE [ <db_name> ] | SCHEMA [ <schema_name> ] } ]
                                 [ STARTS WITH '<name_string>' ]
                                 [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Optionally returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`

      The `kind` column value is always HYBRID TABLE.
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`HYBRID`
:   Returns hybrid tables only.

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN  ACCOUNT | DATABASE [ db_name ] | SCHEMA [ schema_name ]`
:   Optionally specifies the scope of the command, which determines whether the command lists records only for the
    current/specified database or schema, or across your entire account.

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default. The command returns the objects you have privileges to view in the current
      database.
    * No database: `ACCOUNT` is the default. The command returns the objects you have privileges to view in your account.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* If an account (or database or schema) has a large number of hybrid tables, then searching the entire account (or database or
  schema) can consume a significant amount of compute resources.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Output

> **Note:**
>
> The following output schema is for the SHOW HYBRID TABLES command. For information about the output of SHOW TABLES,
> see Identifying hybrid tables with SHOW TABLES (in this topic).

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the table was created. |
| `name` | Name of the table. |
| `database_name` | Database in which the table is stored. |
| `schema_name` | Schema in which the table is stored. |
| `owner` | Role that owns the table. |
| `rows` | Number of rows in the table. |
| `bytes` | Number of bytes that will be scanned if the entire table is scanned in a query. Note that this number may be different from the number of actual physical bytes (that is, bytes stored on-disk) for the table. |
| `comment` | Comment for the table. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

> **Note:**
>
> Numbers in the `rows` and `bytes` columns might not be accurate if data is changing constantly (for example, if new data is being continuously inserted into the hybrid table).
>
> You might see NULL values in these columns after initially creating and loading the table; a background compaction operation runs periodically and updates these statistics. For large loads, the initial compaction itself might take a long time. In the meantime, you can run a `SELECT COUNT(*)` query on the table to get an accurate row count.

## Identifying hybrid tables with SHOW TABLES

The [SHOW TABLES](show-tables.md) command output has a column that indicates whether a table is a hybrid table.

This column appears in addition to the regular SHOW TABLES [output columns](show-tables.md).

The column has the following name and possible values:

| Column Name | Values |
| --- | --- |
| is_hybrid | `Y` if the table is a hybrid table; otherwise, `N`. |

## Examples

Show all the hybrid tables whose name starts with `product_` that you have privileges to view in the `mydb.myschema` schema:

```sqlexample
SHOW HYBRID TABLES LIKE 'product_%' IN mydb.myschema;
```

---
title: SHOW ICEBERG TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-iceberg-tables.md
section: SQL Commands
---

# SHOW ICEBERG TABLES

Lists the [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md) for which you have access privileges.

The command can be used to list Iceberg tables for the current/specified database or schema, or across your entire account.

This command returns different output columns than [SHOW TABLES](show-tables.md).
The output returns Iceberg table metadata and properties, ordered lexicographically by database, schema, and Iceberg table name (see
Output in this topic for descriptions of the output columns). This is important to note if you want to filter the results using the
provided filters.

Note that this topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md) , [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [SHOW TABLES](show-tables.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] [ ICEBERG ] TABLES [ LIKE '<pattern>' ]
                                  [ IN
                                        {
                                          ACCOUNT                  |

                                          DATABASE                 |
                                          DATABASE <database_name> |

                                          SCHEMA                   |
                                          SCHEMA <schema_name>     |
                                          <schema_name>
                                        }
                                  ]
                                  [ STARTS WITH '<name_string>' ]
                                  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Optionally returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`

      The `kind` column value is always ICEBERG TABLE.
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`ICEBERG`
:   Returns Iceberg tables only.

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| SELECT | Iceberg table | To see a particular Iceberg table in the output for SHOW ICEBERG TABLES, a role must have the SELECT privilege on that table. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If an account (or database or schema) has a large number of Iceberg tables, then searching the entire account (or database or schema)
  can consume a significant amount of compute resources.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Output

> **Note:**
>
> The following output schema is for the SHOW ICEBERG TABLES command. For information about the output of SHOW TABLES,
> see Identifying Iceberg Tables with SHOW TABLES (in this topic).

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the table was created. |
| `name` | Name of the table. |
| `database_name` | Database in which the table is stored. |
| `schema_name` | Schema in which the table is stored. |
| `owner` | Role that owns the table. |
| `external_volume_name` | Name of the external volume where the Iceberg table data and metadata are stored. |
| `catalog_name` | Name of the catalog integration object associated with the Iceberg table when the table is not managed by Snowflake. `SNOWFLAKE` when the table is managed by Snowflake. |
| `iceberg_table_type` | Type of Iceberg table. `UNMANAGED` if the table is not managed by Snowflake. `NOT ICEBERG` otherwise. |
| `catalog_table_name` | Name of the table as recognized by the catalog. |
| `catalog_namespace` | For externally managed tables, the namespace that was defined when the table was created. If not defined at the table level, the default namespace associated with the catalog integration used by the table. For Snowflake-managed tables that you sync with Snowflake Open Catalog, this field isn’t required, so the value is `null`. |
| `base_location` | Relative path from the `EXTERNAL_VOLUME` location to the table metadata and data files. |
| `can_write_metadata` | Signifies whether Snowflake can write metadata to the location specified by the `base_location`. |
| `comment` | Comment for the table. |
| `name_mapping` | List of objects with information about table columns that use [column projection](https://iceberg.apache.org/spec/#column-projection). For more information, see name_mapping. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| `catalog_sync_name` | Denotes the name of the catalog integration for Snowflake Open Catalog that the Snowflake-managed Apache Iceberg™ table syncs to. If the table doesn’t sync with Snowflake Open Catalog or isn’t managed by Snowflake, the value is `NULL`. |
| `auto_refresh_status` | The automated refresh status for an externally managed Iceberg table. This column displays the same results for the table as the [SYSTEM$AUTO_REFRESH_STATUS](../functions/system_auto_refresh_status.md) function. |
| `partition_specs` | List of objects describing the Apache Iceberg™ partition specifications for the table, as found in the Iceberg metadata file. Includes specification for both Snowflake-managed and externally managed Iceberg tables. For more information, see partition_specs. |
| `current_partition_spec_id` | ID for the partition spec that is currently active for the Iceberg table. This ID corresponds to a value for `spec-id` in `partition_specs`. |

### name_mapping

The `name_mapping` output column provides information about table columns that use [column projection](https://iceberg.apache.org/spec/#column-projection).

If a table doesn’t contain any columns with an associated name mapping, the output column has a value of `[NULL]`. Otherwise, the value is
a list of objects, where each object corresponds to a column that has an associated name mapping (sometimes referred to as a mapped field).
Each object can contain the following three properties:

* `field-id`: The Iceberg field ID.
* `names`: A list of name strings for the field.
* `fields`: A list of field mappings for the child fields of struct, map, or list columns.

For example:

```json
[
  {
    "field-id": 1,
    "names": [
      "id",
      "record_id"
    ]
  },
  {
    "field-id": 2,
    "names": [
      "data"
    ]
  },
  {
    "field-id": 3,
    "names": [
      "location"
    ],
    "fields": [
      {
        "field-id": 4,
        "names": [
          "latitude",
          "lat"
        ]
      },
      {
        "field-id": 5,
        "names": [
          "longitude",
          "long"
        ]
      }
    ]
  }
]
```

> **Note:**
>
> Field IDs can be non-consecutive if a column, or a field in a [structured type](../data-types-structured.md)
> column doesn’t have an associated name mapping.

### partition_specs

Each object in the `partition_specs` column includes a `spec-id`,
followed by the fields for the partition specification. Each field is an OBJECT value with the
following key-value pairs:

* `name`: The name of the partition.
* `transform`: The transformation applied to the source column to generate a partition value. This value determines how data is grouped
  into partitions.
* `source-id`: The identifier of the original table column or field that is used for partitioning.
* `field-id`: The partition field ID. This field identifies a partition field and is unique in a partition specification. However, for
  Iceberg v2 table metadata, the field ID is unique across all partition specifications.

For example:

```json
[ {
    "spec-id" : 0,
    "fields" : [ {
      "name" : "COL1",
      "transform" : "identity",
      "source-id" : 1,
      "field-id" : 1000
      }, {
      "name" : "COL1_trunc_100",
      "transform" : "truncate[100]",
      "source-id" : 1,
      "field-id" : 1001
      }
    ]
} ]
```

The example shows one partition specification; however, a table can have multiple partition specifications.

## Examples

Show all the Iceberg tables whose name starts with `glue` that you have privileges to view in the `tpch.public` schema:

> ```sqlexample
> SHOW ICEBERG TABLES LIKE 'glue%' IN tpch.public;
> ```

## Identifying Iceberg tables with SHOW TABLES

The [SHOW TABLES](show-tables.md) command output has a column that indicates whether a table is an Iceberg table.
This column appears in addition to the regular SHOW TABLES [output columns](show-tables.md).

The column has the following name and possible values:

| Column name | Values |
| --- | --- |
| is_iceberg | `Y` if the table is an Iceberg table; `N` otherwise. |

---
title: SHOW IMAGE REPOSITORIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-image-repositories.md
section: SQL Commands
---

# SHOW IMAGE REPOSITORIES

Lists the [image repositories](../../developer-guide/snowpark-container-services/tutorials/tutorial-1.md) for which you
have access privileges.

You can use this command to list the repositories in the current database and schema for the session, a specified database or
schema, or your entire account.

See also:
:   [CREATE IMAGE REPOSITORY](create-image-repository.md) , [DROP IMAGE REPOSITORY](drop-image-repository.md)

## Syntax

```sqlsyntax
SHOW IMAGE REPOSITORIES [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |

                  DATABASE                 |
                  DATABASE <database_name> |

                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>
                }
           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

Any of the following repository privileges grants the permission to see repositories in the SHOW IMAGE REPOSITORIES output. If you
don’t have any of these privileges, SHOW IMAGE REPOSITORIES will return an empty result.

| Privilege | Object | Notes |
| --- | --- | --- |
| READ | Image repository | To pull an image from a repository, the role requires this permission. |
| WRITE | Image repository | To push an image to a repository, the role requires this permission. |
| OWNERSHIP | Image repository | To create a repository, the role requires this permission. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides repository properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the repository was created. |
| `database_name` | Database in which the repository was created. |
| `schema_name` | Schema in which the repository was created. |
| `repository_url` | URL of the image repository. You need this URL to push (for example, `docker push`) or pull (for example, `docker pull`) images from the repository. |
| `owner` | Role that owns the repository. |
| `owner_role_type` | The type of role that owns the object; either ROLE or DATABASE_ROLE. |
| `comment` | Description for the repository. |
| `encryption` | Encryption type configured for the image repository. |
| `privatelink_repository_url` | URL of the image repository, accessible only via Private Connectivity. The column is returned only for [Business Critical](../../user-guide/intro-editions.md) accounts. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following two examples list repositories in the current database and current schema:

```sqlexample
SHOW IMAGE REPOSITORIES;
```

```sqlexample
SHOW IMAGE REPOSITORIES IN SCHEMA;
```

The following example lists repositories in the current database and the specified schema:

```sqlexample
SHOW IMAGE REPOSITORIES IN SCHEMA sc1;
```

The following example lists repositories in the current database and all schemas:

```sqlexample
SHOW IMAGE REPOSITORIES IN DATABASE;
```

The following example lists repositories in the specified database and all schemas:

```sqlexample
SHOW IMAGE REPOSITORIES IN DATABASE db1;
```

The following example lists repositories in the current account (all databases and all schemas):

```sqlexample
SHOW IMAGE REPOSITORIES IN ACCOUNT;
```

Sample output:

```output
+-------------------------------+---------------------+---------------+-------------+-----------------------------------------------------------------------------------------------------------------------+-----------+-----------------+---------+---------------+--------------------------------------------------------------+
| created_on                    | name                | database_name | schema_name | repository_url                                                                                                        | owner     | owner_role_type | comment | encryption    | privatelink_repository_url                                   |
|-------------------------------+---------------------+---------------+-------------+-----------------------------------------------------------------------------------------------------------------------+-----------+-----------------+---------+---------------|--------------------------------------------------------------+
| 2024-04-18 13:41:53.481 -0700 | TUTORIAL_REPOSITORY | TUTORIAL_DB   | DATA_SCHEMA | orgname-acctname.registry-dev.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository                      | TEST_ROLE | ROLE            |         | SNOWFLAKE_SSE | orgname-acctname.registry.privatelink.snowflakecomputing.com |
+-------------------------------+---------------------+---------------+-------------+-----------------------------------------------------------------------------------------------------------------------+-----------+-----------------+---------+---------------+--------------------------------------------------------------+
```

---
title: SHOW IMAGES IN IMAGE REPOSITORY
source: https://docs.snowflake.com/en/sql-reference/sql/show-images-in-image-repository.md
section: SQL Commands
---

# SHOW IMAGES IN IMAGE REPOSITORY

Lists the images in an [image repository](../../developer-guide/snowpark-container-services/working-with-registry-repository.md).

See also:
:   [CREATE IMAGE REPOSITORY](create-image-repository.md), [DROP IMAGE REPOSITORY](drop-image-repository.md),
    [SHOW IMAGE REPOSITORIES](show-image-repositories.md)

## Syntax

```sqlsyntax
SHOW IMAGES IN IMAGE REPOSITORY <name>
```

## Parameters

`name`
:   Name of the image repository.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Image repository |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the image was uploaded to the image repository. |
| `image_name` | Image name |
| `tags` | Image tags |
| `digest` | SHA256 digest of the image |
| `image_path` | Image path (`database_name/schema_name/repository_name/image_name:image_tag`) |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List the images in the `tutorial_repository` image repository.

```sqlexample
SHOW IMAGES IN IMAGE REPOSITORY tutorial_db.data_schema.tutorial_repository;
```

```output
+-------------------------------+-----------------------+--------+-------------------------------------------------------------------------+--------------------------------------------------------------------------+
| created_on                    | image_name            | tags   | digest                                                                  | image_path                                                               |
|-------------------------------+-----------------------+--------+-------------------------------------------------------------------------+--------------------------------------------------------------------------|
| 2024-04-18 13:51:35.000 -0700 | my_echo_service_image | latest | sha256:70421668b2635b2996c6d5bc80627cf6d98c0716948b5f60d198d6411d4b4681 | tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest |
+-------------------------------+-----------------------+--------+-------------------------------------------------------------------------+--------------------------------------------------------------------------+
```

---
title: SHOW INDEXES
source: https://docs.snowflake.com/en/sql-reference/sql/show-indexes.md
section: SQL Commands
---

# SHOW INDEXES

Lists all the indexes in your account for which you have access privileges.

See also:
:   [CREATE HYBRID TABLE](create-hybrid-table.md) , [CREATE INDEX](create-index.md) , [DROP INDEX](drop-index.md) , [DROP TABLE](drop-table.md) , [DESCRIBE TABLE](desc-table.md) , [SHOW HYBRID TABLES](show-hybrid-tables.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] INDEXES
  [ LIKE '<pattern>' ]
  [ IN { ACCOUNT | DATABASE [ <database_name> ] | SCHEMA [ <schema_name> ] | TABLE | TABLE <table_name> } ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN { ACCOUNT | DATABASE [ database_name ] | SCHEMA [ schema_name ] | TABLE | TABLE table_name }`
:   Filters the output by the specified database, schema, table, or account.

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    If you specify the keyword `TABLE` without a `table_name`, then:

    * If there is a current database, then:

      + If there is a current schema, then the command retrieves records for the current schema in the current database.
      + If there is no current schema, then the command retrieves records for all schemas in the current database.
    * If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    If you specify a `<table_name>` (with or without the keyword `TABLE`), then:

    * If you specify a fully-qualified `<table_name>` (e.g. `my_database_name.my_schema_name.my_table_name`),
      then the command retrieves all records for the specified table.
    * If you specify a schema-qualified `<table_name>` (e.g. `my_schema_name.my_table_name`), then:

      + If a current database exists, then the command retrieves all records for the specified table.
      + If no current database exists, then the command displays an error similar to
        `Cannot perform SHOW <object_type>. This session does not have a current database...`.
    * If you specify an unqualified `<table_name>`, then:

      + If a current database and current schema exist, then the command retrieves records for the specified table in the current
        schema of the current database.
      + If no current database exists or no current schema exists, then the command displays an error similar to:
        `SQL compilation error: <object> does not exist or not authorized.`.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the index was created. |
| `name` | Name of the index. |
| `is_unique` | Whether the index is a unique index. |
| `columns` | List of indexed columns. |
| `included_columns` | List of covered columns. |
| `table` | Name of the table. |
| `database_name` | Database in which the index is stored. |
| `schema_name` | Schema in which the index is stored. |
| `owner` | Role that owns the index. |
| `owner_role_type` | Role type of the owner. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

These SHOW INDEX examples use the current database and schema.

Return a terse list of indexes that contain the string `DEVICE` in their names:

```sqlexample
SHOW TERSE INDEXES LIKE '%DEVICE%';
```

```output
+-------------------------------+---------------------------------------+-----------------+---------------+-------------+
| created_on                    | name                                  | kind            | database_name | schema_name |
|-------------------------------+---------------------------------------+-----------------+---------------+-------------|
| 2024-08-29 12:24:49.197 -0700 | SYS_INDEX_SENSOR_DATA_DEVICE1_PRIMARY | KEY_VALUE_INDEX | HT_SENSORS    | HT_SCHEMA   |
| 2024-08-29 12:24:49.197 -0700 | DEVICE_IDX                            | KEY_VALUE_INDEX | HT_SENSORS    | HT_SCHEMA   |
| 2024-08-29 14:03:36.537 -0700 | SYS_INDEX_SENSOR_DATA_DEVICE2_PRIMARY | KEY_VALUE_INDEX | HT_SENSORS    | HT_SCHEMA   |
| 2024-08-29 14:03:36.537 -0700 | DEVICE_IDX                            | KEY_VALUE_INDEX | HT_SENSORS    | HT_SCHEMA   |
+-------------------------------+---------------------------------------+-----------------+---------------+-------------+
```

Only return indexes that have covered columns (`included_columns`). Use the [pipe operator](../operators-flow.md)
(`->>`) to select specific rows and columns from the full output of the SHOW INDEXES command.

```sqlexample
SHOW INDEXES
  ->> SELECT "name",
             "is_unique",
             "table",
             "columns",
             "included_columns",
             "database_name",
             "schema_name"
        FROM $1
        WHERE "included_columns" != '[]';
```

The following output shows the SELECT query result only. One index qualifies for the WHERE clause condition:

```output
+------------+-----------+---------------------+-------------+------------------+---------------+-------------+
| name       | is_unique | table               | columns     | included_columns | database_name | schema_name |
|------------+-----------+---------------------+-------------+------------------+---------------+-------------|
| DEVICE_IDX | N         | SENSOR_DATA_DEVICE2 | [DEVICE_ID] | [TEMPERATURE]    | HT_SENSORS    | HT_SCHEMA   |
+------------+-----------+---------------------+-------------+------------------+---------------+-------------+
```

---
title: SHOW INTEGRATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-integrations.md
section: SQL Commands
---

# SHOW INTEGRATIONS

Lists the integrations in your account.

The output returns integration metadata and properties.

See also:
:   [CREATE INTEGRATION](create-integration.md) , [DROP INTEGRATION](drop-integration.md) , [ALTER INTEGRATION](alter-integration.md) , [DESCRIBE INTEGRATION](desc-integration.md)

API integrations:
:   [CREATE API INTEGRATION](create-api-integration.md)

Catalog integrations:
:   [CREATE CATALOG INTEGRATION](create-catalog-integration.md)

External access integrations:
:   [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md)

Notification integrations:
:   [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md)

Security integrations:
:   [CREATE SECURITY INTEGRATION](create-security-integration.md)

Storage integrations:
:   [CREATE STORAGE INTEGRATION](create-storage-integration.md)

## Syntax

```sqlsyntax
SHOW [ { API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE } ] INTEGRATIONS [ LIKE '<pattern>' ]
```

## Parameters

`{ API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE }`
:   Returns integrations of the specified type only.

    For more information about some of these types, see the following topics:

    * [SHOW CATALOG INTEGRATIONS](show-catalog-integrations.md)
    * [SHOW NOTIFICATION INTEGRATIONS](show-notification-integrations.md)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration |  |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Currently, only the `API | CATALOG | EXTERNAL ACCESS | NOTIFICATION | SECURITY | STORAGE` parameter is supported.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides integration properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the integration |
| `type` | Type of the integration |
| `category` | Category of the integration |
| `enabled` | Current status of the integration, either TRUE (enabled) or FALSE (disabled) |
| `comment` | Comment for the integration |
| `created_on` | Date and time when the integration was created |

For more information about the properties that can be specified for an integration, see the following topic for the integration by type:

* [CREATE API INTEGRATION](create-api-integration.md)
* [CREATE CATALOG INTEGRATION](create-catalog-integration.md)
* [CREATE EXTERNAL ACCESS INTEGRATION](create-external-access-integration.md)
* [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md)
* [CREATE SECURITY INTEGRATION](create-security-integration.md)
* [CREATE STORAGE INTEGRATION](create-storage-integration.md)

## Examples

Show all notification integrations:

> ```sqlexample
> SHOW NOTIFICATION INTEGRATIONS;
> ```

Show all the integrations whose name starts with `line` that you have privileges to view:

> ```sqlexample
> SHOW INTEGRATIONS LIKE 'line%';
> ```

---
title: SHOW JOIN POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-join-policies.md
section: SQL Commands
---

# SHOW JOIN POLICIES

Lists information about existing [join policies](../../user-guide/join-policies.md), including the creation date, database and schema names, owner, and any available comments.

See also:
:   [Join policy DDL reference](../../user-guide/join-policies.md)

## Syntax

```sqlsyntax
SHOW JOIN POLICIES  [ LIKE '<pattern>' ]
                           [ IN
                               {
                                 ACCOUNT |
                                 DATABASE [ <database_name> ] |
                                 SCHEMA [ <schema_name> ] |
                               }
                           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY JOIN POLICY | Account |  |
| APPLY | Join policy |  |
| OWNERSHIP | Join policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For more information about join policy DDL and privileges, see [Managing join policies](../../user-guide/join-policies.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Example

```sqlexample
SHOW JOIN POLICIES;
```

```output
+-------------------------------+------+---------------+----------------+-------------+--------------+---------+-----------------+---------+
| created_on                    | name | database_name | schema_name    | kind        | owner        | comment | owner_role_type | options |
|-------------------------------+------+---------------+----------------+-------------+--------------+---------+-----------------+---------|
| 2024-12-04 15:15:49.591 -0800 | JP1  | POLICY1_DB    | POLICY1_SCHEMA | JOIN_POLICY | POLICY1_ROLE |         | ROLE            |         |
+-------------------------------+------+---------------+----------------+-------------+--------------+---------+-----------------+---------+
```

---
title: SHOW LISTINGS
source: https://docs.snowflake.com/en/sql-reference/sql/show-listings.md
section: SQL Commands
---

# SHOW LISTINGS

Lists the [listings](../../collaboration/collaboration-listings-about.md) that you have privileges to access.
Shows only listings where the user running the command has any of USAGE, MODIFY, or OWNERSHIP against the listing.

See also:
:   [CREATE LISTING](create-listing.md), [ALTER LISTING](alter-listing.md), [DESCRIBE LISTING](desc-listing.md), [DROP LISTING](drop-listing.md)

## Syntax

```sqlsyntax
SHOW LISTINGS [ LIKE '<pattern>' ]
              [ STARTS WITH '<name_string>' ]
              [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Optional parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* You can show a listing only if you use a role that has the USAGE, MODIFY, or OWNERSHIP privilege on the listing.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides listing properties and metadata in the following columns:

|  |  |
| --- | --- |
| Column | Description |
| `global_name` | Global name of the listing |
| `name` | Name specified when the listing was created. |
| `title` | Title specified in the listing manifest. |
| `subtitle` | Sub title specified in the listing manifest. |
| `profile` | Provider profile name as specified in the listing manifest. |
| `created_on` | Date and time when the listing was created. |
| `updated_on` | Date and time when the listing was last updated. |
| `published_on` | Date and time when the listing was last published. |
| `state` | State of the listing, one of:   * DRAFT * PUBLISHED * UNPUBLISHED |
| `review_state` | Review state for public listings only, one of:   * UNSENT * PENDING * REJECTED * APPROVED * CANCELLED |
| `comment` | Associated comment, if present. |
| `owner` | Listing owner. |
| `owner_role_type` | Owner role type. |
| `regions` | List of regions where a public listing is available. |
| `target_accounts` | Comma separated list of target accounts. |
| `is_monetized` | Is monetized flag. |
| `is_application` | Is application flag. If `true` a Snowflake Native App is attached to the listing. |
| `is_targeted` | Is targeted flag. |
| `is_limited_trial` | Whether the listing is available for limited trial before purchasing. |
| `is_by_request` | Whether the listing is a personalized listing. |
| `distribution` | Whether the listing is an EXTERNAL or ORGANIZATION listing. |
| `is_mountless_queryable` | Whether the listing can be queried by a consumer without mounting using the Uniform Listing Locator (ULL) for the listing. |
| `rejected_on` | Date and time when the public listing for approval was last rejected. |
| `organization_profile_name` | The profile associated with the ORGANIZATION listing. |
| `uniform_listing_locator` | The ULL tha allows consumers to access the organization listing without mounting. |
| `detailed_target_accounts` | Private listing target account details with company name included. |
| `compliance_badges` | List of compliance certifications that were approved by Snowflake’s compliance team for the listing, if any. Available certifications include:   * SOC2 * HIPAA * ISO27001 |

## Examples

Show all the listings with names that start with `MYLISTING`:

> ```sqlexample
> SHOW LISTINGS LIKE 'MYLISTING%'
> ```

Show ten listings starting from listing `MYLISTING`:

> ```sqlexample
> SHOW LISTINGS LIMIT 10 FROM 'MYLISTING%'
> ```

---
title: SHOW LISTINGS IN FAILOVER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/show-listings-in-failover-group.md
section: SQL Commands
---

# SHOW LISTINGS IN FAILOVER GROUP

Shows the listings in a [failover group](../../user-guide/account-replication-intro.md).

See also:
:   [SHOW DATABASES IN FAILOVER GROUP](show-databases-in-failover-group.md), [SHOW SHARES IN FAILOVER GROUP](show-shares-in-failover-group.md)

## Syntax

```sqlsyntax
SHOW LISTINGS IN FAILOVER GROUP <name>
```

## Parameters

`name`
:   Specifies the identifier for the failover group.

## Access control requirements

To review the roles that are required to monitor replication and failover on group objects in the system, see [Replication privileges](../../user-guide/account-replication-considerations.md).

## Usage notes

* Executing this command requires a role with either the OWNERSHIP or MONITOR privilege on the failover group. The command
  only returns objects for which the current user’s current role has been granted at least one access privilege.
* To retrieve the list of failover groups in your organization, use [SHOW FAILOVER GROUPS](show-failover-groups.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides listing properties and metadata in the following columns:

|  |  |
| --- | --- |
| Column | Description |
| `global_name` | Global name of the listing |
| `name` | Name specified when the listing was created. |
| `title` | Title specified in the listing manifest. |
| `subtitle` | Sub title specified in the listing manifest. |
| `profile` | Provider profile name as specified in the listing manifest. |
| `created_on` | Date and time when the listing was created. |
| `updated_on` | Date and time when the listing was last updated. |
| `published_on` | Date and time when the listing was last published. |
| `state` | State of the listing, one of:   * DRAFT * PUBLISHED * UNPUBLISHED |
| `review_state` | Review state for public listings only, one of:   * UNSENT * PENDING * REJECTED * APPROVED * CANCELLED |
| `comment` | Associated comment, if present. |
| `owner` | Listing owner. |
| `owner_role_type` | Owner role type. |
| `regions` | List of regions where a public listing is available. |
| `target_accounts` | Comma separated list of target accounts. |
| `is_monetized` | Is monetized flag. |
| `is_application` | Is application flag. If `true` a Snowflake Native App is attached to the listing. |
| `is_targeted` | Is targeted flag. |
| `is_limited_trial` | Whether the listing is available for limited trial before purchasing. |
| `is_by_request` | Whether the listing is a personalized listing. |
| `distribution` | Whether the listing is an EXTERNAL or ORGANIZATION listing. |
| `is_mountless_queryable` | Whether the listing can be queried by a consumer without mounting using the Uniform Listing Locator (ULL) for the listing. |
| `rejected_on` | Date and time when the public listing for approval was last rejected. |
| `organization_profile_name` | The profile associated with the ORGANIZATION listing. |
| `uniform_listing_locator` | The ULL tha allows consumers to access the organization listing without mounting. |
| `detailed_target_accounts` | Private listing target account details with company name included. |
| `compliance_badges` | List of compliance certifications that were approved by Snowflake’s compliance team for the listing, if any. Available certifications include:   * SOC2 * HIPAA * ISO27001 |

## Examples

List the listings in the failover group `myfg`:

```sqlexample
SHOW LISTINGS IN FAILOVER GROUP myfg;
```

---
title: SHOW LOCKS
source: https://docs.snowflake.com/en/sql-reference/sql/show-locks.md
section: SQL Commands
---

# SHOW LOCKS

Lists all running transactions that have locks on resources. The command can be used to show locks for the current user in all the
user’s sessions or all users in the account.

For information about transactions and resource locking, see [Transactions](../transactions.md).

See also:
:   [SHOW TRANSACTIONS](show-transactions.md)

## Syntax

```sqlsyntax
SHOW LOCKS [ IN ACCOUNT ]
```

## Parameters

`IN ACCOUNT`
:   Returns all locks across all users in the account. This parameter only applies when executed by users with the ACCOUNTADMIN role
    (account administrators).

    For all other roles, the function only shows locks across all sessions for the current user.

## Output

The command output shows lock metadata in the following columns:

| Column | Description |
| --- | --- |
| `resource` | A fully qualified table name or a transaction ID. |
| `type` | `PARTITIONS` (for standard table locks) or `ROW` (for hybrid table locks). |
| `transaction` | Transaction ID (a signed 64-bit integer). |
| `transaction_started_on` | Timestamp that specifies when the transaction started executing. |
| `status` | Current status of the transaction: `HOLDING` or `WAITING`. |
| `acquired_on` | Timestamp that specifies when the lock was acquired. |
| `query_id` | Internal/system-generated identifier for the SQL statement. |
| `session` | Session ID (visible to users with the ACCOUNTADMIN role only). |

## Usage notes

* The command output includes the IDs for all running transactions that have locks on resources. These IDs can be used as input for
  [SYSTEM$ABORT_TRANSACTION](../functions/system_abort_transaction.md) to abort a specified transaction.
* For hybrid tables, this command displays a lock only if a transaction is blocked, or is blocking another transaction.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

In this example, a transaction is holding a lock on the specified standard table (the table named in the `resource` column).

```sqlexample
SHOW LOCKS;
```

```output
+---------------------------+------------+---------------------+-------------------------------+---------+-------------------------------+--------------------------------------+
| resource                  | type       |         transaction | transaction_started_on        | status  | acquired_on                   | query_id                             |
|---------------------------+------------+---------------------+-------------------------------+---------+-------------------------------+--------------------------------------|
| CALIBAN_DB.PUBLIC.WEATHER | PARTITIONS | 1721330303831000000 | 2024-07-18 12:18:23.831 -0700 | HOLDING | 2024-07-18 12:18:49.832 -0700 | 01b5c1c6-0002-8691-0000-a9950068a0c6 |
+---------------------------+------------+---------------------+-------------------------------+---------+-------------------------------+--------------------------------------+
```

In this example, a transaction is holding a row-level lock on a hybrid table. Another transaction is waiting on
that lock.

```sqlexample
SHOW LOCKS;
```

```output
+---------------------+------+---------------------+-------------------------------+---------+-------------+--------------------------------------+
| resource            | type |         transaction | transaction_started_on        | status  | acquired_on | query_id                             |
|---------------------+------+---------------------+-------------------------------+---------+-------------+--------------------------------------|
| 1721165584820000000 | ROW  | 1721165584820000000 | 2024-07-16 14:33:04.820 -0700 | HOLDING | NULL        |                                      |
| 1721165584820000000 | ROW  | 1721165674582000000 | 2024-07-16 14:34:34.582 -0700 | WAITING | NULL        | 01b5b715-0002-852b-0000-a99500665352 |
+---------------------+------+---------------------+-------------------------------+---------+-------------+--------------------------------------+
```

---
title: SHOW MAINTENANCE POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-maintenance-policies.md
section: SQL Commands
---

# SHOW MAINTENANCE POLICIES

Lists the [maintenance policies](../../developer-guide/native-apps/consumer-maintenance-policies.md) applied to the specified account or app.

See also:
:   [CREATE MAINTENANCE POLICY](create-maintenance-policy.md), [ALTER MAINTENANCE POLICY](alter-maintenance-policy.md), [DROP MAINTENANCE POLICY](drop-maintenance-policy.md)

## Syntax

```sqlsyntax
SHOW MAINTENANCE POLICIES { ON | IN } { ACCOUNT | APPLICATION <app_name> | <entity_type> <entity_name> }
```

`ACCOUNT`
:   Shows the maintenance policies applied to the account.

`APPLICATION <app_name>`
:   Shows the maintenance policies applied to the specified app.

## Parameters

`{ ON | IN }`
:   Specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `APPLICATION app_name`
    :   Returns records for the specified app.

    `IN entity_type entity_name`
    :   Returns records for the specified entity. Specify one of the following for the `entity_type`:

        * `DATABASE`
        * `APPLICATION PACKAGE`
        * `SCHEMA`

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY MAINTENANCE POLICY | Account |  |
| OWNERSHIP | Maintenance policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Examples

The following example shows all maintenance policies applied to the account:

```sqlexample
SHOW MAINTENANCE POLICIES ON ACCOUNT;
```

Show maintenance policies for a specific app:

```sqlexample
SHOW MAINTENANCE POLICIES ON APPLICATION my_app;
```

Show maintenance policies for a specific database:

```sqlexample
SHOW MAINTENANCE POLICIES IN DATABASE my_database;
```

Show maintenance policies for a specific app package:

```sqlexample
SHOW MAINTENANCE POLICIES IN APPLICATION PACKAGE my_app_package;
```

Show maintenance policies for a specific schema:

```sqlexample
SHOW MAINTENANCE POLICIES IN SCHEMA my_schema;
```

---
title: SHOW MANAGED ACCOUNTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-managed-accounts.md
section: SQL Commands
---

# SHOW MANAGED ACCOUNTS

Lists the managed accounts created for your account. Currently used by data providers to create reader accounts for their consumers. For
more details, see [Manage reader accounts](../../user-guide/data-sharing-reader-create.md).

See also:
:   [CREATE MANAGED ACCOUNT](create-managed-account.md) , [DROP MANAGED ACCOUNT](drop-managed-account.md)

## Syntax

```sqlsyntax
SHOW MANAGED ACCOUNTS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Usage notes

* The command can be executed by users with the ACCOUNTADMIN role (or a role that has been granted the MONITOR USAGE global privilege).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

The command output provides managed account properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `account_name` | Name of the account. |
| `cloud` | Cloud in which the managed account is located. For reader accounts, this is always the same as the cloud for the provider account. |
| `region` | Region in which the managed account is located. For reader accounts, this is always the same as the region for the provider account. |
| `account_locator` | Legacy identifier for the account. |
| `created_on` | Date and time when the managed account was created. |
| `account_url` | [Account URL](../../user-guide/organizations-connect.md) that is used to connect to the account, in the account name format. The [account identifier](../../user-guide/admin-account-identifier.md) in this format follows the pattern `<orgname>-<account_name>`. |
| `account_locator_url` | Account URL that is used to connect to the account, in the legacy account locator format. |
| `is_reader` | Specifies whether the managed account is a reader account (for sharing data). |
| `comment` | Comment for the managed account. |
| `region_group` | Region group in which the managed account is located. |
| `old_account_url` | If the original [account URL](../../user-guide/organizations-connect.md) was saved when the account was renamed, provides the original URL. If the original account URL was dropped, the value is NULL even if the account was renamed. |
| `account_old_url_saved_on` | If the original account URL was saved when the account was renamed, provides the date and time when the original account URL was saved. |
| `account_old_url_last_used` | If the original account URL was saved when the account was renamed, indicates the last time the account was accessed using the original URL. |
| `organization_old_url` | If the account’s organization was changed in a way that created a new [account URL](../../user-guide/organizations-connect.md) and the original account URL was saved, provides the original account URL. If the original account URL was dropped, the value is NULL even if the organization changed. |
| `organization_old_url_saved_on` | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, provides the date and time when the original account URL was saved. |
| `organization_old_url_last_used` | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, indicates the last time the account was accessed using the original account URL. |

## Examples

```sqlexample
SHOW MANAGED ACCOUNTS;
```

```output
+--------------+-------+-----------+---------+-------------------------------+--------------------------------------------+----------------------------------------+-----------+---------+----------------+
| name         | cloud | region    | locator | created_on                    | url                                        |  account_locator_url                   | is_reader | comment |  region_group  |
|--------------+-------+-----------+---------+-------------------------------+--------------------------------------------+----------------------------------------+-----------+---------|----------------|
| ACCT1        | aws   | us-west-2 | RE47190 | 2018-05-30 14:38:54.479 -0700 | https://bazco-acct1.snowflakecomputing.com  |  https://re47190.snowflakecomputing.com | true    |         |     PUBLIC     |
+--------------+-------+-----------+---------+-------------------------------+--------------------------------------------+----------------------------------------+-----------+---------+----------------+
```

---
title: SHOW MASKING POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-masking-policies.md
section: SQL Commands
---

# SHOW MASKING POLICIES

Lists masking policy information, including the creation date, database and schema names, owner, and any available comments.

See also:
:   [Masking policy DDL](../../user-guide/security-column-intro.md)

## Syntax

```sqlsyntax
SHOW MASKING POLICIES  [ LIKE '<pattern>' ]
                       [ IN
                            {
                              ACCOUNT                                         |

                              DATABASE                                        |
                              DATABASE <database_name>                        |

                              SCHEMA                                          |
                              SCHEMA <schema_name>                            |
                              <schema_name>

                              APPLICATION <application_name>                  |
                              APPLICATION PACKAGE <application_package_name>  |
                            }
                       ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY MASKING POLICY | Account |  |
| APPLY | Masking policy |  |
| OWNERSHIP | Masking policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on masking policy DDL and privileges, see [Managing Column-level Security](../../user-guide/security-column-intro.md).

## Usage notes

* The OPTIONS column returns an empty string (that is, `""`) when the masking policy property `EXEMPT_OTHER_POLICIES` is set to
  `FALSE`.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Example

```sqlexample
SHOW MASKING POLICIES IN SCHEMA governance.policies;
```

```output
+-------------------------------+------------+---------------+-------------+----------------+---------------+------------------------------+-----------------------------------+-----------------+
| created_on                    | name       | database_name | schema_name | kind           | owner         | comment                      | options                           | owner_role_type |
+-------------------------------+------------+---------------+-------------+----------------+---------------+------------------------------+-----------------------------------+-----------------+
| 2022-08-13 16:59:59.733 +0000 | EMAIL_MASK | GOVERNANCE    | POLICIES    | MASKING_POLICY | MASKING_ADMIN | SPECIFY IN ROW ACCESS POLICY | {“EXEMPT_OTHER_POLICIES”: "TRUE"} | ROLE            |
+-------------------------------+------------+---------------+-------------+----------------+---------------+------------------------------+-----------------------------------+-----------------+
```

---
title: SHOW MATERIALIZED VIEWS
source: https://docs.snowflake.com/en/sql-reference/sql/show-materialized-views.md
section: SQL Commands
---

# SHOW MATERIALIZED VIEWS

Lists the materialized views that you have privileges to access.

For more information about materialized views, see [Working with Materialized Views](../../user-guide/views-materialized.md).

See also:
:   [CREATE MATERIALIZED VIEW](create-materialized-view.md) , [ALTER MATERIALIZED VIEW](alter-materialized-view.md) , [DROP MATERIALIZED VIEW](drop-materialized-view.md) , [DESCRIBE MATERIALIZED VIEW](desc-materialized-view.md)

## Syntax

```sqlsyntax
SHOW MATERIALIZED VIEWS [ LIKE '<pattern>' ]
                        [ IN
                             {
                               ACCOUNT                                         |

                               DATABASE                                        |
                               DATABASE <database_name>                        |

                               SCHEMA                                          |
                               SCHEMA <schema_name>                            |
                               <schema_name>

                               APPLICATION <application_name>                  |
                               APPLICATION PACKAGE <application_package_name>  |
                             }
                        ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

* The output columns are similar to the output columns for [SHOW TABLES](show-tables.md), but includes the following additional columns:

  + refreshed_on: time of the last DML operation on the base table that was processed by a
    [“refresh” operation](../../user-guide/views-materialized.md).
  + compacted_on: time of the last DML operation on the base table that was processed by a
    [“compaction” operation](../../user-guide/views-materialized.md).
  + behind_by: If the background process that updates the materialized view
    with changes from the base table has not yet brought the materialized view
    up to date, then this column shows approximately how many seconds the
    materialized view is “behind” the base table. Note that even if this shows
    that the materialized view is not up to date, any queries on the
    materialized view will still return up-to-date results (they just might
    take a little longer as extra information is retrieved from the base table).
* The command SHOW VIEWS also shows information about materialized views.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

The command output provides materialized view properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | The timestamp at which the materialized view was created. |
| name | The name of the materialized view. |
| reserved | Reserved for future use. |
| database_name | The name of the database in which the materialized view exists. |
| schema_name | The name of the schema in which the materialized view exists. |
| cluster_by | Information about the clustering columns (if the materialized view is clustered). |
| rows | The number of rows in the materialized view. |
| bytes | The number of bytes of data in the materialized view. |
| source_database_name | The name of the database in which the materialized view’s base table exists. |
| source_schema_name | The name of the schema in which the materialized view’s base table exists. |
| source_table_name | The name of the materialized view’s base table. |
| refreshed_on | The timestamp of the last DML operation on the base table that was processed by a [“refresh” operation](../../user-guide/views-materialized.md). |
| compacted_on | The timestamp of the last DML operation on the base table that was processed by a [“compaction” operation](../../user-guide/views-materialized.md). |
| owner | The owner of the materialized view. |
| invalid | True if the materialized view is currently invalid (for example, if the base table dropped a column that the view used); false otherwise. |
| invalid_reason | The reason (if any) that the materialized view is currently invalid. |
| behind_by | How far the updates of the materialized view are behind the updates of the base table. |
| comment | Optional comment. |
| text | The text of the command that created this materialized view (e.g. CREATE MATERIALIZED VIEW …). |
| is_secure | True if the materialized view is a secure view; false otherwise. |
| automatic_clustering | True if the view is clustered and the clustering is automatic. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Examples

Show all materialized views:

> ```sqlexample
> SHOW MATERIALIZED VIEWS;
> ```

Show only materialized views with names matching the specified regular expression:

> ```sqlexample
> SHOW MATERIALIZED VIEWS LIKE 'mv1%';
>
> +-------------------------------+------+----------+---------------+-------------+------------+------+-------+----------------------+--------------------+-------------------+-------------------------------+--------------+----------+---------+----------------+-----------+---------+--------------------------------------------+-----------+----------------------+-----------------+
> | created_on                    | name | reserved | database_name | schema_name | cluster_by | rows | bytes | source_database_name | source_schema_name | source_table_name | refreshed_on                  | compacted_on | owner    | invalid | invalid_reason | behind_by | comment | text                                       | is_secure | automatic_clustering | owner_role_type |
> |-------------------------------+------+----------+---------------+-------------+------------+------+-------+----------------------+--------------------+-------------------+-------------------------------+--------------+----------+---------+----------------+-----------+---------+--------------------------------------------+-----------|----------------------+-----------------|
> | 2018-10-05 17:13:17.579 -0700 | MV1  |          | TEST_DB1      | PUBLIC      |            |    0 |     0 | TEST_DB1             | PUBLIC             | INVENTORY         | 2018-10-05 17:13:50.373 -0700 | NULL         | SYSADMIN | false   | NULL           | 0s        |         | CREATE OR REPLACE MATERIALIZED VIEW mv1 AS | false     | OFF                  | ROLE            |
> |                               |      |          |               |             |            |      |       |                      |                    |                   |                               |              |          |         |                |           |         |       SELECT ID, price FROM inventory;     |           |                      |                 |          |
> +-------------------------------+------+----------+---------------+-------------+------------+------+-------+----------------------+--------------------+-------------------+-------------------------------+--------------+----------+---------+----------------+-----------+---------+--------------------------------------------+-----------+----------------------+-----------------+
> ```

---
title: SHOW MCP SERVERS
source: https://docs.snowflake.com/en/sql-reference/sql/show-mcp-servers.md
section: SQL Commands
---

# SHOW MCP SERVERS

Lists the MCP (Model Context Protocol) servers for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE MCP SERVER](create-mcp-server.md) , [DESCRIBE MCP SERVER](desc-mcp-server.md) , [DROP MCP SERVER](drop-mcp-server.md)

## Syntax

```sqlsyntax
SHOW MCP SERVERS [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |
                  DATABASE                 |
                  DATABASE <database_name> |
                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>
                }
           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the MCP server was created. |
| `name` | Name of the MCP server. |
| `database_name` | Database that contains the MCP server. |
| `schema_name` | Schema that contains the MCP server. |
| `owner` | Role that owns the MCP server. |
| `comment` | Comment for the MCP server. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| USAGE | Database (when IN DATABASE is specified) |
| USAGE | Schema (when IN SCHEMA is specified) |

The command returns records for MCP servers based on the privileges held by the role used to execute the command.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the MCP servers that you have the privileges to view in the PUBLIC schema of the `mydb` database:

```sqlexample
USE DATABASE mydb;

SHOW MCP SERVERS;
```

```output
|               created_on               |       name        | database_name | schema_name |    owner     |           comment            |
------------------------------------------+-------------------+---------------+-------------+--------------+------------------------------
| Fri, 23 Jun 1967 07:00:00.123000 +0000 | TEST_MCP_SERVER   | TEST_DATABASE | TEST_SCHEMA | ACCOUNTADMIN | [NULL]                       |
| Fri, 23 Jun 1967 07:00:00.123000 +0000 | TEST_MCP_SERVER_2 | TEST_DATABASE | TEST_SCHEMA | ACCOUNTADMIN | Test MCP server with comment |
```

The following example lists the MCP servers in the specified database:

```sqlexample
SHOW MCP SERVERS IN DATABASE mydb;
```

The following example lists the MCP servers in the specified schema:

```sqlexample
SHOW MCP SERVERS IN SCHEMA mydb.public;
```

The following example lists all MCP servers in the account:

```sqlexample
SHOW MCP SERVERS IN ACCOUNT;
```

---
title: SHOW MFA METHODS
source: https://docs.snowflake.com/en/sql-reference/sql/show-mfa-methods.md
section: SQL Commands
---

# SHOW MFA METHODS

Lists the [second factors of authentication](../../user-guide/security-mfa-second-factor.md) that a user enrolled in multi-factor
authentication uses to sign in to Snowflake.

## Syntax

```sqlsyntax
SHOW MFA METHODS [ FOR USER <user> ]
```

## Parameters

`[ FOR USER user ]`
:   Specifies the user for whom you want to list second factors of authentication. Omitting this clause returns the authentication methods
    of the current user.

    Only users with the ACCOUNTADMIN role can use this clause.

## Usage notes

Executing this command without the FOR USER clause returns the authentication methods for the current user.

## Output

The command output provides information about authentication methods in the following columns:

| Column | Description |
| --- | --- |
| `name` | System-generated name of the authentication method. |
| `type` | Type of second factor of authentication. Possible values are:   * `PASSKEY` - User can use a passkey as their second factor of authentication. * `TOTP` - User can use a time-based one-time passcode from an authenticator app as their second factor of authentication. * `DUO` - User can use Duo as their second factor of authentication. |
| `comment` | User-specified name of the authentication method. This name appears in Snowsight when authenticating.  Empty if Duo is the second factor of authentication. |
| `last_used` | Date and time when the user last authenticated with the authentication method.  Empty if Duo is the second factor of authentication. |
| `created_on` | Date and time when the user configured the authentication method for themselves.  Empty if Duo is the second factor of authentication. |

## Examples

As an administrator, find the second factors of authentication that user `joe` configured for himself.

```sqlexample
USE ROLE ACCOUNTADMIN;

SHOW MFA METHODS FOR USER joe;
```

List the second factors of authentication of the current user.

```sqlexample
SHOW MFA METHODS;
```

---
title: SHOW MODEL MONITORS
source: https://docs.snowflake.com/en/sql-reference/sql/show-model-monitors.md
section: SQL Commands
---

# SHOW MODEL MONITORS

Lists all [model monitor](../../developer-guide/snowflake-ml/model-registry/model-observability.md) that you can access in
the current or specified schema and displays information about each one.

See also:
:   [CREATE MODEL MONITOR](create-model-monitor.md),
    [ALTER MODEL MONITOR](alter-model-monitor.md),
    [DESCRIBE MODEL MONITOR](desc-model-monitor.md),
    [DROP MODEL MONITOR](drop-model-monitor.md)

## Syntax

```sqlsyntax
SHOW MODEL MONITORS
[ LIKE <pattern> ]
[ IN
    {
      ACCOUNT                  |

      DATABASE                 |
      DATABASE <database_name> |

      SCHEMA                   |
      SCHEMA <schema_name>     |
      <schema_name>
    }
 ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The command output provides model monitor properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the model monitor was created. |
| `name` | Name of the model monitor. |
| `database_name` | Database in which the model monitor is stored. |
| `schema_name` | Schema in which the model monitor is stored. |
| `warehouse_name` | Warehouse used to monitor the model. |
| `refresh_interval` | The refresh interval (target lag) for triggering refresh of the model monitor. |
| `aggregation_window` | The aggregation window for calculating metrics. |
| `model_task` | The task of the model being monitored, either TABULAR_BINARY_CLASSIFICATION or TABULAR_REGRESSION. |
| `monitor_state` | The state of the model monitor:   * ACTIVE: The model monitor is active and operating correctly. * SUSPENDED: Model monitoring is paused. * PARTIALLY_SUSPENDED: An error condition in which one of the underlying tables has stopped refreshing at the expected interval. See DESCRIBE for more details. * UNKNOWN: An error condition in which the state of the underlying tables cannot be identified. |
| `source` | String representation of a JSON object detailing the source table or view on which aggregations are based. If the table does not exist or is not accessible, the value is an empty string. See Table JSON object specification. |
| `baseline` | String representation of a JSON object detailing baseline table being used for monitoring, of which a clone is embedded in the model monitor object. See Table JSON object specification. |
| `model` | String representation of a JSON object containing information specifically about the model being monitored. See Model JSON object specification. |
| `comment` | Comment about the model monitor. |

### Table JSON object specification

The following is an example of the JSON representation of a table, view, or other table-like object, as used by the `source` and `baseline` columns in the command output:

| `name` | Name of the source or baseline table or view. |
| --- | --- |
| `database_name` | Database in which the table or view is stored. |
| `schema_name` | Schema in which the table or view is stored. |
| `status` | The status of the table:   * ACTIVE: The table or view is accessible by the user. * MASKED: The current user does not have access to the table or view. Values of other fields appear masked (that is, as a series of asterisks). * DELETED: The table or view has been deleted. * NOT_SET: The property has not been set. Only applicable for baseline data. |

### Model JSON object specification

The following is an example of the JSON representation of a model, as used by the `model` column in the command output:

| Field | Description |
| --- | --- |
| `model_name` | Name of the model being monitored. |
| `version_name` | Version name of the model version being monitored. |
| `function_name` | Name of the specific function being monitored in the specified model version. |
| `database_name` | Database in which the model is stored. |
| `schema_name` | Schema in which the model is stored. |
| `model_status` | The status of the model. Can be ACTIVE, MASKED, or DELETED. MASKED indicates that the user does not have access to the model; other fields show as a series of asterisks. |
| `version_status` | The status of the model version. Can be ACTIVE or DELETED. (MASKED is not a valid status for a model version, because they do not have access control.) |

## Access control requirements

| Privilege | Target |
| --- | --- |
| Any | Model monitor |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW MODELS
source: https://docs.snowflake.com/en/sql-reference/sql/show-models.md
section: SQL Commands
---

# SHOW MODELS

Lists the machine learning models that you have privileges to access.

The output returns table metadata and properties, ordered lexicographically by database, schema, and model name (see Output in this
topic for descriptions of the output columns). This is important to note if you wish to filter the results using the provided filters.

See also:
:   [CREATE MODEL](create-model.md) , [DROP MODEL](drop-model.md) , [ALTER MODEL](alter-model.md), [SHOW VERSIONS IN MODEL](show-versions-in-model.md)

## Syntax

```sqlsyntax
SHOW MODELS [ LIKE '<pattern>' ]
            [ IN { DATABASE [ <db_name> ] | SCHEMA [ <schema_name> ] } ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN DATABASE [ db_name ] | SCHEMA [ schema_name ]`
:   Optionally specifies the scope of the command, which determines whether the command lists models only in the current/specified
    database or schema.

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the models you have privileges to view in the current
      database).
    * No database: Account scope is the default (i.e. the command returns the models you have privileges to view in your account).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the model was created. |
| name | Name of the model. |
| model_type | The type of the model, either USER_MODEL for models that contain user code, or CORTEX_FINETUNED for models created with [Cortex Fine-tuning](../../user-guide/snowflake-cortex/cortex-finetuning.md) |
| database_name | Database in which the model is stored. |
| schema_name | Schema in which the model is stored. |
| owner | Role that owns the model. |
| comment | Comment for the model. |
| versions | JSON array listing versions of the model. |
| default_version_name | Version of the model used when referring to the model without a version. |
| aliases | A SQL object mapping [model version aliases](../../developer-guide/snowflake-ml/model-registry/overview.md) to the corresponding model version name. |

## Usage notes

* Results are sorted by database name, schema name, and then model name. This means results for a database can contain models from multiple schemas
  and might break pagination. In order for pagination to work as expected, you must execute the SHOW MODELS statement for a single schema. You can
  use the IN SCHEMA `schema_name` parameter to the SHOW MODELS command. Alternatively, you can use the schema in the current context by
  executing a USE SCHEMA `schema_name` before executing SHOW MODELS.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW NETWORK POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-network-policies.md
section: SQL Commands
---

# SHOW NETWORK POLICIES

Lists all network policies defined in the system.

See also:
:   [ALTER NETWORK POLICY](alter-network-policy.md) , [CREATE NETWORK POLICY](create-network-policy.md) , [DESCRIBE NETWORK POLICY](desc-network-policy.md) , [DROP NETWORK POLICY](drop-network-policy.md)

## Syntax

```sqlsyntax
SHOW NETWORK POLICIES
```

## Usage notes

* Only the network policy owner (that is, role with the OWNERSHIP privilege on the network policy) or higher can execute this command.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

List all network policies:

> ```sqlexample
> SHOW NETWORK POLICIES;
> ```
>
> ```output
> +-------------------------------+----------+---------+----------------------------+----------------------------+---------------------------------------------------------------------+
> | created_on                    | name     | comment | entries_in_allowed_ip_list | entries_in_blocked_ip_list | entries_in_allowed_network_rules | entries_in_blocked_network_rules |
> |-------------------------------+----------+---------+----------------------------+----------------------------+----------------------------------+----------------------------------|
> | 2016-04-29 13:22:34.034 -0700 | Policy1  |         |                          2 |                          1 |                                 0|                                0 |
> | 2016-04-28 17:31:59.269 -0700 | Policy2  |         |                          1 |                          0 |                                 0|                                0 |
> +-------------------------------+----------+---------+----------------------------+----------------------------+----------------------------------+----------------------------------+
> ```

---
title: SHOW NETWORK RULES
source: https://docs.snowflake.com/en/sql-reference/sql/show-network-rules.md
section: SQL Commands
---

# SHOW NETWORK RULES

Lists all network rules defined in the system.

See also:
:   [ALTER NETWORK RULE](alter-network-rule.md) , [CREATE NETWORK RULE](create-network-rule.md) , [DESCRIBE NETWORK RULE](desc-network-rule.md) , [DROP NETWORK RULE](drop-network-rule.md)

## Syntax

```sqlsyntax
SHOW NETWORK RULES [ LIKE '<pattern>' ]
                   [ IN { ACCOUNT | DATABASE [ <db_name> ] | [ SCHEMA ] [ <schema_name> ] } ]
                   [ STARTS WITH '<name_string>' ]
                   [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN ACCOUNT | [ DATABASE ] db_name | [ SCHEMA ] schema_name`
:   Optionally specifies the scope of the command, which determines whether the command lists records only for the current/specified
    database or schema, or across your entire account:

    The `DATABASE` or `SCHEMA` keyword is not required; you can set the scope by specifying only the database or schema name.
    Likewise, the database or schema name is not required if the session currently has a database in use:

    * If `DATABASE` or `SCHEMA` is specified without a name and the session does not currently have a database in use, the
      parameter has no effect on the output.
    * If `SCHEMA` is specified with a name and the session does not currently have a database in use, the schema name must
      be fully qualified with the database name (e.g. `testdb.testschema`).

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides network rule properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the network rule was created. |
| `name` | Name of the network rule. |
| `database_name` | Database that contains the schema in which the network rule was created. |
| `schema_name` | Schema in which the network rule was created. |
| `owner` | Role that has the OWNERSHIP privilege on the network rule. |
| `comment` | Descriptive text associated with the network rule. |
| `type` | Value of the network rule’s `TYPE` property. |
| `mode` | Value of the network rule’s `MODE` property. |
| `entries_in_valuelist` | Number of network identifiers specified in the `VALUE_LIST` property of the network rule. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Network Rule | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | Schema |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

List all network rules:

```sqlexample
SHOW NETWORK RULES;
```

To see the current list of Snowflake-managed network rules, run the following command:

```sqlexample
SHOW NETWORK RULES IN SNOWFLAKE.NETWORK_SECURITY;
```

> **Note:**
>
> The SHOW command doesn’t explicitly expose IP addresses, only the number of IP addresses per rule.

To see your current Snowflake-managed network rules, including IP addresses, use the [NETWORK_RULES view](../account-usage/network_rules.md).

---
title: SHOW NOTEBOOK PROJECTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-notebook-projects.md
section: SQL Commands
---

# SHOW NOTEBOOK PROJECTS

Lists the notebook projects (Snowflake `NOTEBOOK` objects) visible to the current role.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE NOTEBOOK PROJECT](create-notebook-project.md), [EXECUTE NOTEBOOK PROJECT](execute-notebook-project.md), [SHOW NOTEBOOKS](show-notebooks.md), [DESCRIBE NOTEBOOK](desc-notebook.md)

## Syntax

```sqlsyntax
SHOW NOTEBOOK PROJECTS;

SHOW NOTEBOOK PROJECTS IN SCHEMA <database_name>.<schema_name>;

SHOW NOTEBOOK PROJECTS IN DATABASE <database_name>;

SHOW NOTEBOOK PROJECTS IN ACCOUNT;
```

## Parameters

`IN SCHEMA <database_name>.<schema_name>`
:   Lists notebook projects in the specified schema.

`IN DATABASE <database_name>`
:   Lists notebook projects in all schemas of the specified database.

`IN ACCOUNT`
:   Lists all notebook projects in the account that are visible to the current role.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp of creation. |
| `name` | Name of the notebook project. |
| `database_name` | Database containing the notebook project. |
| `schema_name` | Schema containing the notebook project. |
| `owner` | The role that owns the notebook project. |
| `comment` | Comment associated with the notebook project. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE or OWNERSHIP | Database | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE or OWNERSHIP | Schema | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Returns all Snowflake `NOTEBOOK` objects visible to the current role.
* Use [DESCRIBE NOTEBOOK](desc-notebook.md) or `GET_DDL('NOTEBOOK', ...)` to inspect contents.
* Identifiers containing special characters must be double-quoted.

## Examples

List all notebook projects visible to the current role:

```sqlexample
SHOW NOTEBOOK PROJECTS;
```

List notebook projects in a specific schema:

```sqlexample
SHOW NOTEBOOK PROJECTS IN SCHEMA TESTDB.TESTSCHEMA;
```

List notebook projects in a specific database:

```sqlexample
SHOW NOTEBOOK PROJECTS IN DATABASE TESTDB;
```

List notebook projects in the account:

```sqlexample
SHOW NOTEBOOK PROJECTS IN ACCOUNT;
```

---
title: SHOW NOTEBOOKS
source: https://docs.snowflake.com/en/sql-reference/sql/show-notebooks.md
section: SQL Commands
---

# SHOW NOTEBOOKS

Lists the [notebooks](../../user-guide/ui-snowsight/notebooks.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order is important to note if you want to
filter the results.

## Syntax

```sqlsyntax
SHOW NOTEBOOKS [ LIKE '<pattern>' ]
               [ IN
                     {
                       ACCOUNT                  |

                       DATABASE                 |
                       DATABASE <database_name> |

                       SCHEMA                   |
                       SCHEMA <schema_name>     |
                       <schema_name>
                     }
               ]
               [ STARTS WITH '<name_string>' ]
               [ LIMIT <rows> ]
               [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

> `LIKE 'pattern'`
> :   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
>     wildcard characters (`%` and `_`).
>
>     For example, the following patterns return the same results:
>
>     `... LIKE '%testing%' ...`
>
>     `... LIKE '%TESTING%' ...`
>
>     . Default: No value (no filtering is applied to the output).
>
> `[ IN ... ]`
> :   Optionally specifies the scope of the command. Specify one of the following:
>
>     `ACCOUNT`
>     :   Returns records for the entire account.
>
>     `DATABASE`, . `DATABASE db_name`
>     :   Returns records for the current database in use or for a specified database (`db_name`).
>
>         If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.
>
>         > **Note:**
>         >
>         > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
>         >
>         > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
>         > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
>         > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.
>
>     `SCHEMA`, . `SCHEMA schema_name`
>     :   Returns records for the current schema in use or a specified schema (`schema_name`).
>
>         `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).
>
>         If no database is in use, specifying `SCHEMA` has no effect on the output.
>
>     If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:
>
>     * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
>       same effect as specifying `IN DATABASE`.
>     * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
>       same effect as specifying `IN ACCOUNT`.
>
> `STARTS WITH 'name_string'`
> :   Optionally filters the command output based on the characters that appear at the beginning of
>     the object name. The string must be enclosed in single quotes and is case sensitive.
>
>     For example, the following strings return different results:
>
>     `... STARTS WITH 'B' ...`
>
>     `... STARTS WITH 'b' ...`
>
>     . Default: No value (no filtering is applied to the output)
>
> `LIMIT rows`
> :   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
>     example, the number of existing objects is less than the specified limit.
>
>     Default: No value (no limit is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the notebook was created. |
| `name` | Name of the notebook object. |
| `database_name` | Database in which the notebook is stored. |
| `schema_name` | Schema in which the notebook is stored. |
| `comment` | Comment for the notebook object. |
| `owner` | Role that owns the notebook object. |
| `query_warehouse` | Warehouse where queries issued in the notebook are run. |
| `url_id` | Unique ID associated with the notebook object. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| `code_warehouse` | Warehouse where the notebook kernel is run. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE or OWNERSHIP | Notebook | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the notebooks that you have the privileges to view in the current schema:

```sqlexample
SHOW NOTEBOOKS;
```

The following example lists notebooks with names that start with `test`:

```sqlexample
SHOW NOTEBOOKS STARTS WITH 'test';
```

Returns:

```output
+--------------------------------+--------------+---------------+----------------------------------------------------------------------------------+--------+-----------------+----------------------+-----------------+------------------------------+
| created_on                     | name         | database_name | schema_name | comment                                                            | owner  | query_warehouse | url_id               | owner_role_type | code_warehouse               |
+--------------------------------+--------------+---------------+----------------------------------------------------------------------------------+--------+-----------------+----------------------+-----------------+------------------------------+
|  2024-03-20 06:37:08.402 +0000 | test_notebook| PUBLIC        | PUBLIC      | {"lastUpdatedUser":"309334439262","lastUpdatedTime":1711566800002} | PUBLIC | HLEVE1          | 2mbdchin3kn2tlzgqtca | ROLE            | SYSTEM$STREAMLIT_NOTEBOOK_WH |
+--------------------------------+--------------+---------------+-------------+--------------------------------------------------------------------+--------+-----------------+----------------------+-----------------+------------------------------+
```

---
title: SHOW NOTIFICATION INTEGRATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-notification-integrations.md
section: SQL Commands
---

# SHOW NOTIFICATION INTEGRATIONS

Lists the notification integrations in your account.

The output includes metadata and properties of each notification integration.

See also:
:   [CREATE NOTIFICATION INTEGRATION](create-notification-integration.md) , [ALTER NOTIFICATION INTEGRATION](alter-notification-integration.md) , [DESCRIBE NOTIFICATION INTEGRATION](desc-notification-integration.md),
    [DROP INTEGRATION](drop-integration.md)

## Syntax

```sqlsyntax
SHOW NOTIFICATION INTEGRATIONS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the notification integration. |
| `type` | Type of the notification integration. The value can be one of the following:   * `QUEUE - AZURE_STORAGE_QUEUE`: For   [inbound notifications](create-notification-integration-queue-inbound-azure.md) from Azure Event   Grid topics. * `QUEUE - GCP_PUBSUB`: For   [inbound](create-notification-integration-queue-inbound-gcp.md) and   [outbound notifications](create-notification-integration-queue-outbound-gcp.md) to and from Google   Pub/Sub topics. * `QUEUE - AWS_SNS`: For   [outbound notifications](create-notification-integration-queue-outbound-aws.md) to Amazon SNS   topics. * `QUEUE - AZURE_EVENT_GRID`: For   [outbound notifications](create-notification-integration-queue-outbound-azure.md) to Azure Event   Grid topics. * `EMAIL`: For [email notifications](create-notification-integration-email.md). * `WEBHOOK`: For [webhook notifications](create-notification-integration-webhooks.md). |
| `category` | Category of the integration. For notification integrations, this is always `NOTIFICATION`. |
| `enabled` | Indicates whether or not the notification integration is enabled:   * If `true`, the notification integration is enabled. * If `false`, the notification integration is disabled. |
| `comment` | Comment for the notification integration. |
| `created_on` | Date and time when the notification integration was created. |
| `direction` | Indicates whether the integration supports sending or receiving notifications. The value can be one of the following:   * `OUTBOUND`: Snowflake uses the integration to send notifications to a third-party messaging service.  This value appears for notification integrations with any of the following properties:    + `TYPE=QUEUE` and `DIRECTION=OUTBOUND`   + `TYPE=EMAIL`   + `TYPE=WEBHOOK` * `INBOUND`: Snowflake uses the integration to receive notifications from a third-party messaging service.  This value appears for notification integrations that do not specify DIRECTION=OUTBOUND. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Integration |  |
| OWNERSHIP | Integration | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command does not require a running warehouse to execute.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all notification integrations:

```sqlexample
SHOW NOTIFICATION INTEGRATIONS;
```

```output
+-----------------------------+-----------------------------+--------------+---------+---------+-------------------------------+-----------+
| name                        | type                        | category     | enabled | comment | created_on                    | direction |
|-----------------------------+-----------------------------+--------------+---------+---------+-------------------------------+-----------|
| MY_AZURE_INBOUND_QUEUE_INT  | QUEUE - AZURE_STORAGE_QUEUE | NOTIFICATION | true    | NULL    | 2025-03-08 11:34:55.861 -0800 | INBOUND   |
| MY_GCP_INBOUND_QUEUE_INT    | QUEUE - GCP_PUBSUB          | NOTIFICATION | true    | NULL    | 2025-03-08 11:35:35.163 -0800 | INBOUND   |
| MY_GCP_OUTBOUND_QUEUE_INT   | QUEUE - GCP_PUBSUB          | NOTIFICATION | true    | NULL    | 2025-03-08 11:37:06.487 -0800 | OUTBOUND  |
| MY_AWS_OUTBOUND_QUEUE_INT   | QUEUE - AWS_SNS             | NOTIFICATION | true    | NULL    | 2025-03-08 11:36:13.072 -0800 | OUTBOUND  |
| MY_EMAIL_INT                | EMAIL                       | NOTIFICATION | true    | NULL    | 2025-03-08 11:38:55.866 -0800 | OUTBOUND  |
| MY_AZURE_OUTBOUND_QUEUE_INT | QUEUE - AZURE_EVENT_GRID    | NOTIFICATION | true    | NULL    | 2025-03-08 11:36:40.822 -0800 | OUTBOUND  |
| MY_WEBHOOK_INT              | WEBHOOK                     | NOTIFICATION | true    | NULL    | 2025-03-08 11:40:17.336 -0800 | OUTBOUND  |
+-----------------------------+-----------------------------+--------------+---------+---------+-------------------------------+-----------+
```

---
title: SHOW OBJECTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-objects.md
section: SQL Commands
---

# SHOW OBJECTS

Lists the tables and views for which you have access privileges. This command can be used to list the tables and views for a specified
database or schema (or the current database/schema for the session), or your entire account.

## Syntax

```sqlsyntax
SHOW [ TERSE ] OBJECTS [ LIKE '<pattern>' ]
                       [ IN
                             {
                               ACCOUNT                                         |

                               DATABASE                                        |
                               DATABASE <database_name>                        |

                               SCHEMA                                          |
                               SCHEMA <schema_name>                            |
                               <schema_name>

                               APPLICATION <application_name>                  |
                               APPLICATION PACKAGE <application_package_name>  |
                             }
                       ]
                       [ STARTS WITH '<name_string>' ]
                       [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output).

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the object was created. |
| name | Name of the object. |
| database_name | Database in which the object is stored. |
| schema_name | Schema in which the object is stored. |
| kind | Object type: TABLE, VIEW. |
| comment | Comment for the object. |
| cluster_by | Column(s) defined as clustering key(s) for the object. |
| rows | Number of rows in the object. |
| bytes | Number of bytes that will be scanned if the entire object is scanned in a query. Note that this number may be different than the number of actual physical bytes (i.e. bytes stored on-disk) for the object. |
| owner | Role that owns the object. |
| retention_time | Number of days that modified and deleted data is retained for Time Travel. |
| is_hybrid | `Y` if the object is a hybrid table; `N` otherwise. |
| is_dynamic | `Y` if the object is a dynamic table; `N` otherwise. |
| is_iceberg | `Y` if the object is an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md); `N` otherwise. |

## Usage notes

* For a [personal database](../../user-guide/personal-databases.md), the value in the `kind` column is `PERSONAL DATABASE`.
* Personal databases can appear in the output when the command is run by a role with sufficient privileges (for example, ACCOUNTADMIN).
* To view objects in a specific personal database, use:

  ```sqlsyntax
  SHOW OBJECTS IN DATABASE "USER$<username>";
  ```
* For materialized views and semantic views, the `kind` column contains `VIEW`.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all tables and views whose names start with `HT_` that you have privileges to see in the current database:

```sqlexample
SHOW OBJECTS IN DATABASE STARTS WITH 'HT_';
```

```output
+-------------------------------+------------------------+---------------+----------------+-------+---------+------------+---------+-----------+--------------+----------------+-----------------+--------+-----------+------------+
| created_on                    | name                   | database_name | schema_name    | kind  | comment | cluster_by |    rows |     bytes | owner        | retention_time | owner_role_type | budget | is_hybrid | is_dynamic |
|-------------------------------+------------------------+---------------+----------------+-------+---------+------------+---------+-----------+--------------+----------------+-----------------+--------+-----------+------------|
| 2024-05-13 19:08:41.946 -0700 | HT_PRECIP              | HYBRID1_DB    | HYBRID1_SCHEMA | TABLE |         |            |       0 |         0 | HYBRID1_ROLE | 1              | ROLE            | NULL   | Y         | N          |
| 2024-08-23 11:44:13.694 -0700 | HT_SENSOR_DATA_DEVICE1 | HYBRID1_DB    | HYBRID1_SCHEMA | TABLE |         |            | 2678400 | 133920000 | HYBRID1_ROLE | 1              | ROLE            | NULL   | Y         | N          |
| 2024-05-13 16:37:29.217 -0700 | HT_WEATHER             | HYBRID1_DB    | HYBRID1_SCHEMA | TABLE |         |            |      55 |      2985 | HYBRID1_ROLE | 1              | ROLE            | NULL   | Y         | N          |
| 2024-07-18 12:17:27.381 -0700 | HT_WEATHER             | HYBRID1_DB    | PUBLIC         | TABLE |         |            |      55 |      3040 | ACCOUNTADMIN | 1              | ROLE            | NULL   | Y         | N          |
+-------------------------------+------------------------+---------------+----------------+-------+---------+------------+---------+-----------+--------------+----------------+-----------------+--------+-----------+------------+
```

---
title: SHOW OBJECTS OWNED BY APPLICATION
source: https://docs.snowflake.com/en/sql-reference/sql/show-objects-owned-by-application.md
section: SQL Commands
---

# SHOW OBJECTS OWNED BY APPLICATION

Lists the objects owned by an app that exists outside the app.

See also:
:   [SHOW APPLICATIONS](show-applications.md)

## Syntax

```sqlsyntax
SHOW OBJECTS OWNED BY APPLICATION <app_name>
```

## Parameters

`app_name`
:   The name of the app whose objects you want to list.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MANAGE GRANTS | App | One of these privileges is required to view the objects owned by the app. |

## Output

| Column | Description |
| --- | --- |
| `created_on` | The timestamp when the object was created. |
| `name` | The name of the object owned by the app |
| `type` | The type of object, for example COMPUTE_POOL. |

## Examples

```sqlexample
SHOW OBJECTS OWNED BY APPLICATION hello_snowflake_app;
```

```output
+---------------------------------+----------------------+---------------------+
| created_on                      | name                 | object_type         |
|---------------------------------|----------------------|---------------------|
| 2024-11-20 17:56:08.887 -0800   | HELLO_SNOWFLAKE_APP  | COMPUTE_POOL        |
+---------------------------------+----------------------+---------------------+
```

---
title: SHOW OFFERS
source: https://docs.snowflake.com/en/sql-reference/sql/show-offers.md
section: SQL Commands
---

# SHOW OFFERS

Provides information about all [offers](../../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md) added to a listing.

## Syntax

```sqlsyntax
SHOW OFFERS [ LIKE '<pattern>' ] IN LISTING <listing>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN LISTING listing`
:   The listing associated with the offer you want shown.

## Output

The command output provides offer properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | The offer name. |
| `state` | Offer status, one of:   * DRAFT * PUBLISHED * WITHDRAWN |
| `state_updated_on` | The date and time the offer state was last updated. |
| `access_start_date_preference` | The preferred date for consumer listing access, one of:   * OFFER_ACCEPTED_DATE * SPECIFIC_DATE |
| `comment` | Comments about the offer added by the provider. |
| `contract_value` | The total contract value. |
| `contract_type` | The contract type, one of:   * SUBSCRIPTION * LIMITED_TIME * PAY_AS_YOU_GO |
| `contract_duration_months` | The contract duration in months. |
| `invoice_start_date_preference` | The preferred invoicing start date, one of:   * OFFER_ACCEPTED_DATE * SPECIFIC_DATE * FIRST_DAY_NEXT_MONTH |
| `invoice_start_time` | The date and time invoicing started. |
| `is_default` | Specifies a default offer is included with the pricing plan, one of:   * TRUE * FALSE (default) |
| `display_name` | The offer name visible to consumers. |
| `expiration_time` | The date and time the offer expires. |
| `payment_terms` | Additional pricing plan parameters, one of:   * PAYMENT_TYPE * INSTALLMENT_SCHEDULE * ALLOWED_PAYMENT_METHODS |
| `pricing_plan_name` | The pricing plan associated with the offer. |
| `access_end_time` | The date and time consumers lose access to the listing. |
| `access_start_time` | The date and time consumers can access the listing. |
| `discount` | The offer discount. |
| `target_consumer` | The consumer the offer targets. |
| `terms_of_service` | The terms of service associated with the offer. |
| `additional_information` | Additional information about the offer. |
| `updated_on` | The date the offer was last updated. |

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE LISTING | Account | Only the ACCOUNTADMIN role has this privilege by default. The privilege can be granted to additional roles as needed. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all the offers with names that start with `myoffer` in `mylisting`:

```sqlexample
SHOW OFFERS LIKE 'MYOFFER%' IN LISTING MYLISTING;
```

---
title: SHOW ONLINE FEATURE TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-online-feature-tables.md
section: SQL Commands
---

# SHOW ONLINE FEATURE TABLES

Lists the [online feature tables](create-online-feature-table.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE ONLINE FEATURE TABLE](create-online-feature-table.md) , [ALTER ONLINE FEATURE TABLE](alter-online-feature-table.md), [DESCRIBE ONLINE FEATURE TABLE](desc-online-feature-table.md) , [DROP ONLINE FEATURE TABLE](drop-online-feature-table.md)

## Syntax

```sqlsyntax
SHOW ONLINE FEATURE TABLES [ LIKE '<pattern>' ]
                            [ IN
                               {
                                 ACCOUNT                  |
                                 DATABASE                 |
                                 DATABASE <database_name> |
                                 SCHEMA                   |
                                 SCHEMA <schema_name>     |
                                 <schema_name>
                               }
                            ]
                            [ STARTS WITH '<name_string>' ]
                            [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Creation time of the online feature table. |
| `name` | Name of the online feature table. |
| `database_name` | The database in which the online feature table resides. |
| `schema_name` | The schema in which the online feature table resides. |
| `rows` | Number of rows in the storage. |
| `bytes` | Number of bytes that will be scanned if the entire online feature table is scanned in a query.  Note that this number may be different than the number of actual physical bytes stored for the table. |
| `owner` | Role that owns the online feature table. |
| `source` | Name of the source of the online feature table data. |
| `target_lag` | The maximum duration that the online feature table’s content should lag behind real time. |
| `warehouse` | The warehouse used for online feature table refreshes. |
| `timestamp_column` | The timestamp column specified when the online feature table was created. |
| `refresh_mode` | `INCREMENTAL` if the table refreshes the data from source incrementally, or `FULL` if it ingests the full data source on every refresh. |
| `refresh_mode_reason` | Explanation for why the refresh mode was chosen. If Snowflake chose `FULL` when `INCREMENTAL` is supported, the output provides a reason for why it thinks full refresh performs better. `NULL` if no pertinent information is available. |
| `scheduling_state` | Displays `RUNNING` for online feature tables that are actively scheduling refreshes and `SUSPENDED` for suspended online feature tables. |
| `comment` | Comment for the online feature table. |

> **Note:**
>
> Numbers in the `rows` and `bytes` columns might not be accurate if data is changing frequently. You can run a `SELECT COUNT(*)` query on the table to get an accurate row count.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Schema | Role that has the USAGE privilege on the schema. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the online feature tables that you have the privileges to view in the PUBLIC schema of the `mydb` database:

```sqlexample
USE DATABASE mydb;

SHOW ONLINE FEATURE TABLES;
```

The following example lists all online feature tables in the current account that start with `feature_`:

```sqlexample
SHOW ONLINE FEATURE TABLES STARTS WITH 'feature_' IN ACCOUNT;
```

The following example lists online feature tables with names that match the pattern `%test%` in the `analytics` schema:

```sqlexample
SHOW ONLINE FEATURE TABLES LIKE '%test%' IN SCHEMA analytics;
```

---
title: SHOW OPENFLOW DATA PLANE INTEGRATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-oflow-data-plane-integration.md
section: SQL Commands
---

# SHOW OPENFLOW DATA PLANE INTEGRATIONS

List OPENFLOW DATA PLANE INTEGRATIONS.
Shows only OPENFLOW DATA PLANE INTEGRATIONS where the user running the command
has any of USAGE, MODIFY, or OWNERSHIP against the OPENFLOW DATA PLANE INTEGRATION.

See also:
:   [ALTER OPENFLOW DATA PLANE](alter-oflow-data-plane.md), [DESCRIBE OPENFLOW DATA PLANE INTEGRATION](desc-oflow-data-plane-integration.md)

## Syntax

```sqlsyntax
SHOW OPENFLOW DATA PLANE INTEGRATIONS [ LIKE '<pattern>' ]
              [ STARTS WITH '<name_string>' ]
              [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Optional parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* Openflow data plane integrations cannot be created directly, but rather are created when a deployment is created.
* To SHOW an OPENFLOW DATA PLANE INTEGRATION, you must be using a role that has USAGE, MODIFY, or OWNERSHIP privilege on the Openflow data plane integration.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides Openflow data plane integration properties and metadata in the following columns:

|  |  |
| --- | --- |
| Column | Description |
| `name` | Name of the Openflow data plane integration |
| `type` | Always `OPENFLOW_DATA_PLANE` |
| `category` | Always `OPENFLOW_DATA_PLANE` |
| `enabled` | True if enabled, otherwise false |
| `comment` | Associated comment. |
| `created_on` | Date and time the data plane integration was created |
| `data_plane_id` | Internal identifier for the data plane integration |

## Examples

Show all the data plane integrations with names that start with MYDATAPLANE:

> ```sqlexample
> SHOW OPENFLOW DATA PLANE INTEGRATIONS LIKE 'MYDATAPLANE%'
> ```

---
title: SHOW ORGANIZATION ACCOUNTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-organization-accounts.md
section: SQL Commands
---

# SHOW ORGANIZATION ACCOUNTS

Lists the [organization account](../../user-guide/organization-accounts.md) of the organization.

> **Important:**
>
> Previously, this command was used to list all accounts in the organization, but has been repurposed to list the organization account. If
> you want to list all accounts in the organization, use [SHOW ACCOUNTS](show-accounts.md).

See also:
:   [CREATE ORGANIZATION ACCOUNT](create-organization-account.md), [ALTER ORGANIZATION ACCOUNT](alter-organization-account.md)

## Syntax

```sqlsyntax
SHOW ORGANIZATION ACCOUNTS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Usage notes

* Only users with the GLOBALORGADMIN role can run this command, which means it can only be run from the organization account.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides global account properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `organization_name` | Name of the organization. |
| `account_name` | User-defined name that identifies an account within the organization. |
| `region_group` | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. **Note**: This column is only displayed for organizations that span multiple region groups. |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `edition` | [Snowflake Edition](../../user-guide/intro-editions.md) of the account. |
| `account_url` | Preferred Snowflake account URL that includes the values of organization_name and account_name. |
| `created_on` | Date and time when the account was created. |
| `comment` | Comment for the account. |
| `account_locator` | System-assigned identifier of the account. |
| `account_locator_url` | Legacy Snowflake account URL syntax that includes the region_name and account_locator. |
| `managed_accounts` | Indicates how many [managed accounts](../../user-guide/data-sharing-reader-create.md) have been created by the account. |
| `consumption_billing_entity_name` | Name of the consumption billing entity. |
| `marketplace_consumer_billing_entity_name` | Name of the marketplace consumer billing entity. |
| `marketplace_provider_billing_entity_name` | Name of the marketplace provider billing entity. |
| `old_account_url` | If the original [account URL](../../user-guide/organizations-connect.md) was saved when the account was renamed, provides the original URL. If the original account URL was dropped, the value is NULL even if the account was renamed. |
| `is_org_admin` | Indicates whether the ORGADMIN role is enabled in an account. If TRUE, the role is enabled. |
| `account_old_url_saved_on` | If the original account URL was saved when the account was renamed, provides the date and time when the original account URL was saved. |
| `account_old_url_last_used` | If the original account URL was saved when the account was renamed, indicates the last time the account was accessed using the original URL. |
| `organization_old_url` | If the account’s organization was changed in a way that created a new [account URL](../../user-guide/organizations-connect.md) and the original account URL was saved, provides the original account URL. If the original account URL was dropped, the value is NULL even if the organization changed. |
| `organization_old_url_saved_on` | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, provides the date and time when the original account URL was saved. |
| `organization_old_url_last_used` | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, indicates the last time the account was accessed using the original account URL. |
| `is_events_account` | Indicates whether an account is an events account. For more information, see [Use logging and event tracing for an app](../../developer-guide/native-apps/event-about.md). |
| `is_organization_account` | Indicates whether an account is the [organization account](../../user-guide/organization-accounts.md). |

## Examples

Show information about the organization account:

```sqlexample
SHOW ORGANIZATION ACCOUNTS;
```

---
title: SHOW ORGANIZATION PROFILES
source: https://docs.snowflake.com/en/sql-reference/sql/show-organization-profiles.md
section: SQL Commands
---

# SHOW ORGANIZATION PROFILES

Lists the organization profiles for which you have access privileges.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md), [SHOW VERSIONS IN ORGANIZATION PROFILE](show-versions-in-organization-profile.md)

## Syntax

```sqlsyntax
SHOW ORGANIZATION PROFILES
```

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | The date and time when the organization profile was created. |
| `name` | The organization profile name. |
| `system_generated` | Indicates the organization profile is system generated and can’t be dropped. One of `TRUE` or `FALSE`. |
| `state` | The organization profile state. One of ACTIVE or DRAFT. |
| `organization_name` | The name of the organization associated with the organization profile. |
| `title` | The title of the organization profile. |
| `description` | The description of the organization profile. |
| `owner_contact` | The contact email of the owner of the organization profile. |
| `approver_contact` | The contact email of the access approver of the organization profile. |
| `owner` | The owner role of the organization profile. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MODIFY or a privileged role, such as ACCOUNTADMIN or SECURITYADMIN | Organization profile |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example lists the organization profiles that you have privileges to access:

```sqlexample
SHOW ORGANIZATION PROFILES;
```

```output
+-------------------------+-----------------+---------------------+---------------------+---------------------+---------------------------+----------------------------------+---------------------+---------------------+---------------------+
|created_on               |name             |system_generated     |state                |organization_name    |title                      |description                       |owner_contact        |approver_contact     |owner                |
+-------------------------+-----------------+---------------------+---------------------+---------------------+---------------------------+----------------------------------+---------------------+---------------------+---------------------+
| 2025-01-01 01:01:01.000 |ORGPROFILE       |FALSE                |ACTIVE               |TESTORG              |My Organization Profile    |Organization profile description  |test@test.com        |test@test.com        |ACCOUNTADMIN         |
+-------------------------+-----------------+---------------------+---------------------+---------------------+---------------------------+----------------------------------+---------------------+---------------------+---------------------+
```

---
title: SHOW ORGANIZATION USER GROUPS
source: https://docs.snowflake.com/en/sql-reference/sql/show-organization-user-groups.md
section: SQL Commands
---

# SHOW ORGANIZATION USER GROUPS

Lists [organization user groups](../../user-guide/organization-users.md).

* If the command is executed in the organization account, it lists all organization user groups in the organization.
* If the command is executed in a regular account, it lists the organization user groups that are available to the account.

See also:
:   [CREATE ORGANIZATION USER GROUP](create-organization-user-group.md) , [ALTER ORGANIZATION USER GROUP](alter-organization-user-group.md) , [DROP ORGANIZATION USER GROUP](drop-organization-user-group.md)

## Syntax

```sqlsyntax
SHOW ORGANIZATION USER GROUPS
```

## Parameters

None

## Access control requirements

The access control requirements for this command vary depending on the account where it is being executed.

Regular account:
:   Executing this command in a regular account requires the ACCOUNTADMIN role.

Organization account:
:   A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
    [privileges](../../user-guide/security-access-control-overview.md) at a minimum:

    | Privilege | Object | Notes |
    | --- | --- | --- |
    | MANAGE ORGANIZATION USER GROUPS | Account | By default, only the GLOBALORGADMIN and USERADMIN system roles in the organization account have this privilege. |

    For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

    For general information about roles and privilege grants for performing SQL actions on
    [securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the organization user group. |
| `is_imported` | When executed from a regular account, indicates whether the organization user group has been added to the account successfully. If TRUE, the organization user group was added and the role of the same name created. |
| `created_on` | Date and time when the organization user group was created. |
| `is_grantable` | When executed from a regular account, indicates whether the role that was imported from the organization user group can be granted to a local, account-specific role. If `TRUE`, the role imported from the organization user group can be granted to account-specific roles. |

## Examples

Show information about the organization user groups in the organization:

```sqlexample
SHOW ORGANIZATION USER GROUPS;
```

---
title: SHOW ORGANIZATION USERS
source: https://docs.snowflake.com/en/sql-reference/sql/show-organization-users.md
section: SQL Commands
---

# SHOW ORGANIZATION USERS

Lists [organization users](../../user-guide/organization-users.md). Administrators in the organization account can use this command to list all
organization users in the organization. Administrators in a regular account use this command to list all organization users in a specific
organization user group that was added to the account.

See also:
:   [CREATE ORGANIZATION USER](create-organization-user.md) , [ALTER ORGANIZATION USER](alter-organization-user.md) , [DROP ORGANIZATION USER](drop-organization-user.md)

## Syntax

```sqlsyntax
SHOW ORGANIZATION USERS [ IN ORGANIZATION USER GROUP <org_user_group> ]
```

## Parameters

`IN ORGANIZATION USER GROUP org_user_group`
:   Name of an organization user group. This command displays all of the organization users in the specified group.

    Required when the command is executed by the account administrator in a regular account.

## Access control requirements

The access control requirements for this command vary depending on the account where it is being executed.

Regular account:
:   Executing this command in a regular account requires the ACCOUNTADMIN role.

Organization account:
:   A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
    [privileges](../../user-guide/security-access-control-overview.md) at a minimum:

    | Privilege | Object | Notes |
    | --- | --- | --- |
    | MANAGE ORGANIZATION USERS | Account | By default, only the GLOBALORGADMIN and USERADMIN system roles in the organization account have this privilege. |

    For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

    For general information about roles and privilege grants for performing SQL actions on
    [securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the organization user. |
| `created_on` | Date and time when the organization user was created. |
| `is_imported` | When executed from a regular account, indicates whether the organization user in the specified organization user group was successfully imported. |
| `display_name` | Name displayed for the user in Snowsight. |
| `login_name` | Name that the user enters to log into the system. |
| `first_name` | First name of the organization user. |
| `middle_name` | Middle name of the organization user. |
| `last_name` | Last name of the organization user. |
| `email` | Email address of the organization user. |
| `comment` | User-specified description of the organization user object. |

## Examples

As an organization administrator, list all of the organization users in the organization:

```sqlexample
USE ROLE GLOBALORGADMIN;

SHOW ORGANIZATION USERS;
```

As an account administrator, show information about the organization users in the organization user group `data_stewards`:

```sqlexample
USE ROLE ACCOUNTADMIN;

SHOW ORGANIZATION USERS IN ORGANIZATION USER GROUP data_stewards;
```

---
title: SHOW PACKAGES POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-packages-policies.md
section: SQL Commands
---

# SHOW PACKAGES POLICIES

Lists packages policy information.

## Syntax

```sqlsyntax
SHOW PACKAGES POLICIES [ IN
                            {
                              SCHEMA                   |
                              SCHEMA <schema_name>     |
                              <schema_name>
                            }
                       ]
```

## Parameters

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `SCHEMA`, . `SCHEMA schema_name`, . `schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

## Output

The command output provides policy properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the policy was created. |
| `name` | Name of the policy. |
| `database_name` | Database in which the policy is stored. |
| `schema_name` | Schema in which the policy is stored. |
| `kind` | The kind of policy. |
| `owner` | Role that owns the policy (i.e. has the OWNERSHIP privilege on the policy) |
| `comment` | Comment for the policy. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Packages policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |
| USAGE | Packages policy | Also grants the ability to execute a SHOW or DESCRIBE command on the packages policy. Can be granted to a role using the [GRANT <privileges> … TO ROLE](grant-privilege.md) command. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Example

```sqlexample
SHOW PACKAGES POLICIES;
```

---
title: SHOW PARAMETERS
source: https://docs.snowflake.com/en/sql-reference/sql/show-parameters.md
section: SQL Commands
---

# SHOW PARAMETERS

Lists all the account, session, and object parameters that can be set, as well as the current and default values for each parameter:

* Account parameters can only be set at the account level.
* Session parameters can be set at the account, user, and session level.
* Object parameters can be set at the account and object level.

If a parameter has been explicitly set, the output of this command also shows the level at which the parameter has been set.

For descriptions of the different parameter types, as well as detailed descriptions for each parameter, see [Parameters](../parameters.md).

## Syntax

```sqlsyntax
SHOW PARAMETERS
  [ LIKE '<pattern>' ]
  [ { IN | FOR } {
        { SESSION | ACCOUNT }
      | { USER | WAREHOUSE | DATABASE | SCHEMA | TASK } [ <name> ]
      | TABLE [ <table_or_view_name> ]
    } ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN | FOR`
:   `IN ...` or `FOR ...` specifies the scope of the command, which determines the parameters that are returned:

    `SESSION`
    :   Returns all the session parameters and their settings for the current session. A user can change these parameters for their session
        using [ALTER SESSION](alter-session.md).

    `ACCOUNT`
    :   Returns a list of the account, session, and object parameters that can be set at the account level. A user with the ACCOUNTADMIN role
        (i.e. account administrator) can change these parameters via [ALTER ACCOUNT](alter-account.md). For more information, see
        [Parameter management](../../user-guide/admin-account-management.md).

    `USER [ name ]`
    :   Returns a list of the session parameter defaults that are set for the specified user (or the current user) each time the user
        logs in.

        * If no user is specified, the command returns results for the current user.
        * An administrator with the appropriate user privileges can change the session parameter defaults for a user using [ALTER USER](alter-user.md).
        * Individual users can also change their session parameter defaults using [ALTER USER](alter-user.md).

    `WAREHOUSE | DATABASE | SCHEMA | TASK [ name ]`
    :   Returns the object parameters that can be set for the current/specified object. Users with the appropriate privileges can change these
        parameters using the corresponding [ALTER <object>](alter.md) command.

    `TABLE [ table_or_view_name ]`
    :   Returns the object parameters that can be set for the specified table or view. Users with the appropriate privileges can change these
        parameters using the [ALTER TABLE](alter-table.md) command.

        Use `TABLE` as the domain for all table-like objects, such as tables, views, and materialized views.

    Default: `SESSION`

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all the session parameters that can be set for the current session:

> ```sqlexample
> SHOW PARAMETERS;
>
> +-------------------------------------+----------------------------------+----------------------------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | key                                 | value                            | default                          | level   | description                                                                                                                                                                         |
> |-------------------------------------+----------------------------------+----------------------------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> | ABORT_DETACHED_QUERY                | false                            | false                            | SESSION | If true, Snowflake will automatically abort queries when it detects that the client has disappeared.                                                                                |
> | AUTOCOMMIT                          | true                             | true                             | SESSION | The autocommit property determines whether is statement should to be implicitly                                                                                                     |
> |                                     |                                  |                                  |         | wrapped within a transaction or not. If autocommit is set to true, then a                                                                                                           |
> |                                     |                                  |                                  |         | statement that requires a transaction is executed within a transaction                                                                                                              |
> |                                     |                                  |                                  |         | implicitly. If autocommit is off then an explicit commit or rollback is required                                                                                                    |
> |                                     |                                  |                                  |         | to close a transaction. The default autocommit value is true.                                                                                                                       |
> | AUTOCOMMIT_API_SUPPORTED            | true                             | true                             |         | Whether autocommit feature is enabled for this client. This parameter is for                                                                                                        |
> |                                     |                                  |                                  |         | Snowflake use only.                                                                                                                                                                 |
> | BINARY_INPUT_FORMAT                 | HEX                              | HEX                              |         | input format for binary                                                                                                                                                             |
> | BINARY_OUTPUT_FORMAT                | HEX                              | HEX                              |         | display format for binary                                                                                                                                                           |
> | CLIENT_SESSION_KEEP_ALIVE           | false                            | false                            |         | If true, client session will not expire automatically                                                                                                                               |
> | DATE_INPUT_FORMAT                   | AUTO                             | AUTO                             |         | input format for date                                                                                                                                                               |
> | DATE_OUTPUT_FORMAT                  | YYYY-MM-DD                       | YYYY-MM-DD                       |         | display format for date                                                                                                                                                             |
> | ERROR_ON_NONDETERMINISTIC_MERGE     | true                             | true                             |         | raise an error when attempting to merge-update a row that joins many rows                                                                                                           |
> | ERROR_ON_NONDETERMINISTIC_UPDATE    | false                            | false                            |         | raise an error when attempting to update a row that joins many rows                                                                                                                 |
> | LOCK_TIMEOUT                        | 43200                            | 43200                            |         | Number of seconds to wait while trying to lock a resource, before timing out                                                                                                        |
> |                                     |                                  |                                  |         | and aborting the statement. A value of 0 turns off lock waiting i.e. the                                                                                                            |
> |                                     |                                  |                                  |         | statement must acquire the lock immediately or abort. If multiple resources                                                                                                         |
> |                                     |                                  |                                  |         | need to be locked by the statement, the timeout applies separately to each                                                                                                          |
> |                                     |                                  |                                  |         | lock attempt.                                                                                                                                                                       |
> | QUERY_TAG                           |                                  |                                  |         | String (up to 2000 characters) used to tag statements executed by the session                                                                                                       |
> | QUOTED_IDENTIFIERS_IGNORE_CASE      | false                            | false                            |         | If true, the case of quoted identifiers is ignored                                                                                                                                  |
> | ROWS_PER_RESULTSET                  | 0                                | 0                                |         | maxium number of rows in a result set                                                                                                                                               |
> | STATEMENT_QUEUED_TIMEOUT_IN_SECONDS | 0                                | 0                                |         | Timeout in seconds for queued statements: statements will automatically be canceled if they are queued on a warehouse for longer than this amount of time; disabled if set to zero. |
> | STATEMENT_TIMEOUT_IN_SECONDS        | 0                                | 0                                |         | Timeout in seconds for statements: statements will automatically be canceled if they run for longer than this amount of time; disabled if set to zero.                              |
> | TIMESTAMP_DAY_IS_ALWAYS_24H         | false                            | true                             | SYSTEM  | If set, arithmetic on days always uses 24 hours per day,                                                                                                                            |
> |                                     |                                  |                                  |         | possibly not preserving the time (due to DST changes)                                                                                                                               |
> | TIMESTAMP_INPUT_FORMAT              | AUTO                             | AUTO                             |         | input format for timestamp                                                                                                                                                          |
> | TIMESTAMP_LTZ_OUTPUT_FORMAT         |                                  |                                  |         | Display format for TIMESTAMP_LTZ values. If empty, TIMESTAMP_OUTPUT_FORMAT is used.                                                                                                 |
> | TIMESTAMP_NTZ_OUTPUT_FORMAT         | YYYY-MM-DD HH24:MI:SS.FF3        | YYYY-MM-DD HH24:MI:SS.FF3        | SYSTEM  | Display format for TIMESTAMP_NTZ values. If empty, TIMESTAMP_OUTPUT_FORMAT is used.                                                                                                 |
> | TIMESTAMP_OUTPUT_FORMAT             | YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM | YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM | SYSTEM  | Default display format for all timestamp types.                                                                                                                                     |
> | TIMESTAMP_TYPE_MAPPING              | TIMESTAMP_NTZ                    | TIMESTAMP_NTZ                    | SYSTEM  | If TIMESTAMP type is used, what specific TIMESTAMP* type it should map to:                                                                                                          |
> |                                     |                                  |                                  |         |   TIMESTAMP_LTZ (default), TIMESTAMP_NTZ or TIMESTAMP_TZ                                                                                                                            |
> | TIMESTAMP_TZ_OUTPUT_FORMAT          |                                  |                                  |         | Display format for TIMESTAMP_TZ values. If empty, TIMESTAMP_OUTPUT_FORMAT is used.                                                                                                  |
> | TIMEZONE                            | America/Los_Angeles              | America/Los_Angeles              |         | time zone                                                                                                                                                                           |
> | TIME_INPUT_FORMAT                   | AUTO                             | AUTO                             |         | input format for time                                                                                                                                                               |
> | TIME_OUTPUT_FORMAT                  | HH24:MI:SS                       | HH24:MI:SS                       |         | display format for time                                                                                                                                                             |
> | TRANSACTION_ABORT_ON_ERROR          | false                            | false                            |         | If this parameter is true, and a statement issued within a non-autocommit                                                                                                           |
> |                                     |                                  |                                  |         | transaction returns with an error, then the non-autocommit transaction is                                                                                                           |
> |                                     |                                  |                                  |         | aborted. All statements issued inside that transaction will fail until an                                                                                                           |
> |                                     |                                  |                                  |         | commit or rollback statement is executed to close that transaction.                                                                                                                 |
> | TRANSACTION_DEFAULT_ISOLATION_LEVEL | READ COMMITTED                   | READ COMMITTED                   |         | The default isolation level when starting a starting a transaction, when no                                                                                                         |
> |                                     |                                  |                                  |         | isolation level was specified                                                                                                                                                       |
> | TWO_DIGIT_CENTURY_START             | 1970                             | 1970                             |         | For 2-digit dates, defines a century-start year.                                                                                                                                    |
> |                                     |                                  |                                  |         | For example, when set to 1980:                                                                                                                                                      |
> |                                     |                                  |                                  |         |   - parsing a string '79' will produce 2079                                                                                                                                         |
> |                                     |                                  |                                  |         |   - parsing a string '80' will produce 1980                                                                                                                                         |
> | UNSUPPORTED_DDL_ACTION              | ignore                           | ignore                           |         | The action to take upon encountering an unsupported ddl statement                                                                                                                   |
> | USE_CACHED_RESULT                   | true                             | true                             |         | If enabled, query results can be reused between successive invocations of the same query as long as the original result has not expired                                             |
> +-------------------------------------+----------------------------------+----------------------------------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> ```
>
> Note that the output for this example does not include any of the account or object parameters because they cannot be set at the session level.
>
> For more information about account parameters, as well as setting parameters at the account level, see [Parameter management](../../user-guide/admin-account-management.md).

Show all the object parameters that can be set for the specified warehouse (`testwh`):

> ```sqlexample
> SHOW PARAMETERS IN WAREHOUSE testwh;
>
> +-------------------------------------+--------+---------+-------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> | key                                 | value  | default | level | description                                                                                                                                                                                                                   |
> |-------------------------------------+--------+---------+-------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
> | MAX_CONCURRENCY_LEVEL               | 8      | 8       |       | Concurrency level for SQL statements (i.e. queries and DML) executed by a warehouse cluster (used to determine when statements are queued or additional clusters are started). Small SQL statements count as a fraction of 1. |
> | STATEMENT_QUEUED_TIMEOUT_IN_SECONDS | 0      | 0       |       | Timeout in seconds for queued statements: statements will automatically be canceled if they are queued on a warehouse for longer than this amount of time; disabled if set to zero.                                           |
> | STATEMENT_TIMEOUT_IN_SECONDS        | 172800 | 172800  |       | Timeout in seconds for statements: statements are automatically canceled if they run for longer; if set to zero, max value (604800) is enforced.                                                                              |
> +-------------------------------------+--------+---------+-------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
> ```

Show all the object parameters that can be set for the current database (`testdb`):

> ```sqlexample
> USE DATABASE testdb;
>
> SHOW PARAMETERS IN DATABASE;
>
> +-----------------------------+-------+---------+-------+------------------------------------------------------------------+
> | key                         | value | default | level | description                                                      |
> |-----------------------------+-------+---------+-------+------------------------------------------------------------------|
> | DATA_RETENTION_TIME_IN_DAYS | 1     | 1       |       | number of days to retain the old version of deleted/updated data |
> +-----------------------------+-------+---------+-------+------------------------------------------------------------------+
> ```

---
title: SHOW PASSWORD POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-password-policies.md
section: SQL Commands
---

# SHOW PASSWORD POLICIES

Lists password policy information, including the creation date, database and schema names, owner, and any available comments.

See also:
:   [DDL commands](../../user-guide/password-authentication.md)

## Syntax

```sqlsyntax
SHOW PASSWORD POLICIES [ LIKE '<pattern>' ]
                       [ IN
                            {
                              ACCOUNT                                         |

                              DATABASE                                        |
                              DATABASE <database_name>                        |

                              SCHEMA                                          |
                              SCHEMA <schema_name>                            |

                              APPLICATION <application_name>                  |
                              APPLICATION PACKAGE <application_package_name>  |
                            }
                         |
                         ON
                            {
                              ACCOUNT           |
                              USER <user_name>  |
                            }
                       ]
                       [ STARTS WITH '<name_string>' ]
                       [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`[ ON ... ]`
:   Lists the policies that are effective on the specified object. This command considers precedence.
    For example, listing policies on a user will show the account or built-in policy that is effective
    for the user if there is no policy set specifically on the user. Specify one of the following:

    `ACCOUNT`
    :   Returns policies effective on the account.

    `USER user_name`
    :   Returns policies effective on the specified user.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY PASSWORD POLICY | Account |  |
| OWNERSHIP | Password policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on password policy DDL and privileges, see [DDL commands](../../user-guide/password-authentication.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the password policy was created. |
| `name` | Name of the password policy. |
| `database_name` | Name of the database where the password policy was created. NULL for the built-in system policy. |
| `schema_name` | Name of the schema where the password policy was created. NULL for the built-in system policy. |
| `kind` | `PASSWORD_POLICY` |
| `owner` | Role that owns the password policy. |
| `comment` | Comment that was defined for the password policy when it was created or altered. |
| `owner_role_type` | Role type of the owner. |
| `options` | For SHOW PASSWORD POLICIES ON, details about how the policy is set. |
| `set_on` | For SHOW PASSWORD POLICIES ON, the object type where the policy is set: USER, ACCOUNT, or SYSTEM. |

## Example

```sqlexample
SHOW PASSWORD POLICIES;
```

```output
+---------------------------------+------------------------+------------+------------------------------------+---------+
| CREATED_ON                      | NAME                   | OWNER      | COMMENT                            | options |
+---------------------------------+------------------------+------------+------------------------------------+---------+
| Fri, 10 Dec 2021 00:00:00 -0700 | PASSWORD_POLICY_PROD_1 | PROD_ADMIN | production account password policy | ""      |
+---------------------------------+------------------------+------------+------------------------------------+---------+
```

---
title: SHOW PIPES
source: https://docs.snowflake.com/en/sql-reference/sql/show-pipes.md
section: SQL Commands
---

# SHOW PIPES

Lists the pipes for which you have access privileges. This command can be used to list the pipes for a specified database or schema
(or the current database/schema for the session), or your entire account.

See also:
:   [ALTER PIPE](alter-pipe.md) , [CREATE PIPE](create-pipe.md) , [DESCRIBE PIPE](desc-pipe.md) , [DROP PIPE](drop-pipe.md)

## Syntax

```sqlsyntax
SHOW PIPES [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                                         |

                  DATABASE                                        |
                  DATABASE <database_name>                        |

                  SCHEMA                                          |
                  SCHEMA <schema_name>                            |
                  <schema_name>

                  APPLICATION <application_name>                  |
                  APPLICATION PACKAGE <application_package_name>  |
                }
           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

* Returns results only for the pipe owner (that is, the role with the OWNERSHIP privilege on the pipe), a role with the MONITOR or OPERATE
  privilege on the pipe, or a role with the global MONITOR EXECUTION privilege.
* To determine the current status of a pipe, query the [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md) function.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

The command output provides pipe properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the pipe was created. |
| `name` | The name of the pipe object.  Manually created pipes: This is the name defined in the CREATE PIPE statement.  Default pipe (Snowpipe Streaming high-performance): The value is derived from the target table name; for example, MY_TABLE-STREAMING. |
| `database_name` | The name of the database that contains the Snowpipe object.  Manually created pipe: The name of the database that the pipe object belongs to.  Default pipe (Snowpipe Streaming high-performance): The name of the target table’s database. |
| `schema_name` | The name of the schema that contains the Snowpipe object.  Manually created pipe: The name of the schema that the pipe object belongs to.  Default pipe: The name of the target table’s schema. |
| `definition` | COPY statement used to load data from queued files into a Snowflake table. |
| `owner` | The name of the role that possesses the OWNERSHIP privilege on the pipe object.  Named pipe: The name of the role that owns the pipe, which is the role specified in the CREATE PIPE statement or granted ownership later.  Default pipe (Snowpipe Streaming high-performance): This column displays NULL. |
| `notification_channel` | Amazon Resource Name of the Amazon SQS queue for the stage named in the DEFINITION column. |
| `comment` | A user-provided or system-generated text string that describes the pipe object.  Named pipe: The user-defined comment that is provided during the CREATE PIPE statement.  Default pipe (Snowpipe Streaming High-Performance): A system-generated string that is always the following sentences: “Default pipe for Snowpipe Streaming High Performance ingestion to a table. Created and managed by Snowflake.” |
| `integration` | Name of the notification integration for pipes that rely on notification events to trigger data loads from Google Cloud Storage or Microsoft Azure cloud storage. |
| `pattern` | PATTERN copy option value in the [COPY INTO <table>](copy-into-table.md) statement in the pipe definition, if the copy option was specified. |
| `error_integration` | Notification integration name for pipes that rely on error events in Amazon S3 cloud storage to trigger notifications. |
| `owner_role_type` | The type of entity that currently owns the object.  Standard ownership: The type of object that holds the OWNERSHIP privilege. For a standard Snowflake role owner, the value is ROLE. If a Snowflake Native App owns the object, the value is APPLICATION.  Default pipe (Snowpipe Streaming High-Performance): This column displays NULL.  Deleted objects: If the pipe object was deleted, this column displays NULL, as a deleted object no longer has an active owner role. |
| `invalid_reason` | Displays some detailed information for your pipes that might have issues. You can use the provided information to troubleshoot your pipes more effectively along with [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md). If there is no issue with the pipe, the value is NULL. |
| `kind` | The kind of the pipe, which is STAGE. |

## Examples

Show all the pipes that you have privileges to view in the `public` schema in the `mydb` database:

> ```sqlexample
> use database mydb;
>
> show pipes;
> ```

---
title: SHOW POSTGRES INSTANCES
source: https://docs.snowflake.com/en/sql-reference/sql/show-postgres-instances.md
section: SQL Commands
---

# SHOW POSTGRES INSTANCES

Lists the [Snowflake Postgres instances](../../user-guide/snowflake-postgres/about.md) for which you have access privileges.

See also:
:   [CREATE POSTGRES INSTANCE](create-postgres-instance.md), [ALTER POSTGRES INSTANCE](alter-postgres-instance.md), [DESCRIBE POSTGRES INSTANCE](desc-postgres-instance.md), [DROP POSTGRES INSTANCE](drop-postgres-instance.md)

## Syntax

```sqlsyntax
SHOW POSTGRES INSTANCES [ LIKE '<pattern>' ]
                        [ STARTS WITH '<name_string>' ]
                        [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the Postgres instance. |
| `owner` | Role that owns the Postgres instance. |
| `owner_role_type` | Type of the owner role (for example, ROLE or DATABASE_ROLE). |
| `created_on` | Date and time when the Postgres instance was created. |
| `updated_on` | Date and time when the Postgres instance was last updated. |
| `type` | Type of the Postgres instance. |
| `origin` | Origin of the Postgres instance (for example, if forked from another instance). |
| `host` | Hostname used to connect to the Postgres instance. |
| `privatelink_service_identifier` | Identifier for the Private Link service, if Private Link is configured for the instance. |
| `compute_family` | [Compute family](../../user-guide/snowflake-postgres/postgres-instance-sizes.md) (instance size) of the Postgres instance. |
| `authentication_authority` | Authentication method used for the instance (currently `POSTGRES`). |
| `storage_size` | Storage size allocated to the Postgres instance, in GB. |
| `postgres_version` | Major version of Postgres running on the instance. |
| `postgres_settings` | Custom [Postgres server settings](../../user-guide/snowflake-postgres/postgres-server-settings.md) configured for the instance. |
| `is_ha` | Whether [high availability](../../user-guide/snowflake-postgres/high-availability.md) is enabled for the instance. |
| `retention_time` | Data retention time for the instance. |
| `state` | Current [state](../../user-guide/snowflake-postgres/managing-instances.md) of the instance. Possible values: `CREATING`, `RESTORING`, `STARTING`, `REPLAYING`, `FINALIZING`, `READY`, `RESTARTING`, `RESUMING`, `SUSPENDING`, `SUSPENDED`. |
| `comment` | Comment for the Postgres instance. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OPERATE or OWNERSHIP | Postgres instance | Only instances for which you have one of these privileges appear in the output. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Use this command to monitor the state and configuration of your Postgres instances for capacity planning,
  troubleshooting, and auditing purposes.
* Common use cases include checking instance states during operations, identifying instances that need upgrades,
  and reviewing storage usage across your account.

## Examples

List all Postgres instances in the account:

```sqlexample
SHOW POSTGRES INSTANCES;
```

List Postgres instances with names starting with `prod`:

```sqlexample
SHOW POSTGRES INSTANCES STARTS WITH 'PROD';
```

List Postgres instances matching a pattern:

```sqlexample
SHOW POSTGRES INSTANCES LIKE 'DEV_%';
```

Use the [flow operator](../operators-flow.md) to filter and select specific columns:

```sqlexample
SHOW POSTGRES INSTANCES
  ->> SELECT "name", "state", "compute_family", "storage_size"
      FROM $1
      WHERE "state" = 'READY'
      ORDER BY "name";
```

Find all instances with high availability enabled:

```sqlexample
SHOW POSTGRES INSTANCES
  ->> SELECT "name", "compute_family", "is_ha", "postgres_version"
      FROM $1
      WHERE "is_ha" = 'true';
```

Get a summary of storage usage across all instances:

```sqlexample
SHOW POSTGRES INSTANCES
  ->> SELECT "name", "storage_size", "created_on"
      FROM $1
      WHERE "storage_size" > 100
      ORDER BY "storage_size" DESC;
```

---
title: SHOW PRICING PLANS
source: https://docs.snowflake.com/en/sql-reference/sql/show-pricing-plans.md
section: SQL Commands
---

# SHOW PRICING PLANS

Lists visible and hidden [pricing plans](../../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md).

## Syntax

```sqlsyntax
SHOW PRICING PLANS [ LIKE '<pattern>' ] IN LISTING <listing>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN LISTING listing`
:   The listing associated with the pricing plan you want shown.

## Output

The command output provides pricing plan properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | The pricing plan display name. |
| `state` | Pricing plan status, one of:   * DRAFT * PUBLISHED * RETIRED |
| `display_name` | The pricing plan name visible to providers. |
| `currency` | The pricing plan currency code. |
| `pricing_model` | The pricing plan pricing model, one of:   * FLAT_FEE * USAGE_BASED |
| `usage_details` | The pricing plan usage details. |
| `base_fee` | The pricing plan base fee. |
| `billing_duration_months` | The pricing plan billing duration in months. |
| `sales_motion` | The pricing plan sales method, one of:   * SELF_SERVE * TALK_TO_SALES |
| `comment` | Comments about the pricing plan added by the provider. |
| `metadata` | The pricing plan metadata added by the provider. |
| `visibility` | The pricing plan visibility, one of:   * VISIBLE * HIDDEN |
| `contract_type` | The pricing plan contract type, one of:   * SUBSCRIPTION * LIMITED_TIME |
| `contract_duration_months` | The pricing plan duration in months. |
| `updated_on` | The date and time the pricing plan was last updated. |

## Usage notes

* You can show a pricing plan only if the listing exists and has a DRAFT or PUBLISHED status.
* You can show a pricing plan only if you use a role that has the [Global CREATE LISTING privilege](../../user-guide/data-exchange-marketplace-privileges.md).

## Examples

Show all the pricing plans with names that start with `mypricingplan` in listing `mylisting`:

```sqlexample
SHOW PRICING PLANS LIKE 'MYPRICINGPLAN%' IN LISTING 'MYLISTING';
```

---
title: SHOW PRIMARY KEYS
source: https://docs.snowflake.com/en/sql-reference/sql/show-primary-keys.md
section: SQL Commands
---

# SHOW PRIMARY KEYS

Lists primary keys for one or more tables. You can specify the following options:

* A single table
* All tables in the current or specified schema
* All tables in the current or specified database
* All tables in the current account

## Syntax

```sqlsyntax
SHOW [ TERSE ] PRIMARY KEYS
    [ IN { ACCOUNT | DATABASE [ <database_name> ] | SCHEMA [ <schema_name> ] | TABLE | [ TABLE ] <table_name> } ]
```

## Parameters

`TERSE`
:   This clause is accepted in the syntax but has no effect on the output.

`IN { ACCOUNT | DATABASE [ <database_name> ] | SCHEMA [ <schema_name> ] | TABLE | [ TABLE ] <table_name> }`
:   Specifies the scope of the command, which determines whether the command lists records only for the current or specified database,
    schema, table, or account.

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    If you specify the keyword `TABLE` without a `table_name`, then:

    * If there is a current database, then:

      + If there is a current schema, then the command retrieves records for the current schema in the current database.
      + If there is no current schema, then the command retrieves records for all schemas in the current database.
    * If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    If you specify a `<table_name>` (with or without the keyword `TABLE`), then:

    * If you specify a fully-qualified `<table_name>` (e.g. `my_database_name.my_schema_name.my_table_name`),
      then the command retrieves all records for the specified table.
    * If you specify a schema-qualified `<table_name>` (e.g. `my_schema_name.my_table_name`), then:

      + If a current database exists, then the command retrieves all records for the specified table.
      + If no current database exists, then the command displays an error similar to
        `Cannot perform SHOW <object_type>. This session does not have a current database...`.
    * If you specify an unqualified `<table_name>`, then:

      + If a current database and current schema exist, then the command retrieves records for the specified table in the current
        schema of the current database.
      + If no current database exists or no current schema exists, then the command displays an error similar to:
        `SQL compilation error: <object> does not exist or not authorized.`.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (that is, the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (that is, the command returns the objects you have privileges to view in your account).

## Usage notes

* For each single-column primary key, the output contains one row.
* For each multi-column primary key, the output contains one row for each column in the primary key.
* If an account (or database or schema) has a large number of tables, searching the entire account (or table or schema)
  can consume a significant amount of compute resources.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

> **Important:**
>
> For standard tables, Snowflake does not enforce PRIMARY KEY constraints; however, they are enforced on
> [hybrid tables](../../user-guide/tables-hybrid.md).

## Output

The command output provides primary key properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the table was created. |
| `database_name` | Database in which the table is stored. |
| `schema_name` | Schema in which the table is stored. |
| `table_name` | Name of the table. |
| `column_name` | Name of the column in the primary key. |
| `key_sequence` | If the primary key is composed of multiple columns, the number in the `key_sequence` column indicates the order of those columns in the primary key. For example, if the primary key is defined as `CONSTRAINT pkey1 PRIMARY KEY (column_x, column_y)`, the `key_sequence` number for `column_x` is 1 and the key_sequence number for `column_y` is 2. |
| `comment` | The comment (if any) specified for the constraint when the constraint was created. |
| `constraint_name` | The name of the constraint. |

## Examples

```sqlexample
SHOW PRIMARY KEYS;

SHOW PRIMARY KEYS IN ACCOUNT;

SHOW PRIMARY KEYS IN DATABASE;

SHOW PRIMARY KEYS IN DATABASE my_database;

SHOW PRIMARY KEYS IN SCHEMA;

SHOW PRIMARY KEYS IN SCHEMA my_schema;

SHOW PRIMARY KEYS IN SCHEMA my_database.my_schema;

SHOW PRIMARY KEYS IN my_table;

SHOW PRIMARY KEYS IN my_database.my_schema.my_table;
```

---
title: SHOW PRIVACY POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-privacy-policies.md
section: SQL Commands
---

# SHOW PRIVACY POLICIES

Lists the [privacy policies](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) for which you have access privileges.

You can use this command to list the privacy policies in the current database and schema for the session, a specified database or schema,
or your entire account.

See also:
:   [CREATE PRIVACY POLICY](create-privacy-policy.md) , [ALTER PRIVACY POLICY](alter-privacy-policy.md) , [DESCRIBE PRIVACY POLICY](desc-privacy-policy.md) , [DROP PRIVACY POLICY](drop-privacy-policy.md)

## Syntax

```sqlsyntax
SHOW PRIVACY POLICIES [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT
                  | DATABASE [ <database_name> ]
                  | SCHEMA [ <schema_name> ]
                }
           ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY PRIVACY POLICY | Account |  |
| APPLY | Privacy policy |  |
| OWNERSHIP | Privacy policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the privacy policies that you have the privileges to view in the `PRIVACY_POLICY_DB` database:

```sqlexample
USE DATABASE privacy_policy_db;
SHOW PRIVACY POLICIES;
```

```output
+---------------------------------+----------------+-------------------------------------+-------------------------------------+----------------+--------------+---------+-----------------+---------+
| created_on                      | name           | database_name                       | schema_name                         | kind           | owner        | comment | owner_role_type | options |
|---------------------------------+----------------+-------------------------------------+-------------------------------------+----------------+--------------+---------+-----------------+---------|
| Fri, 23 Jun 2021 07:00:00 +0000 | MY_PRIV_POLICY | PRIVACY_POLICY_DB                   | PRIVACY_POLICY_SH                   | PRIVACY_POLICY | ACCOUNTADMIN |         | ROLE            |         |
+---------------------------------+----------------+-------------------------------------+-------------------------------------+----------------+--------------+---------+-----------------+---------+
```

---
title: SHOW PRIVILEGES
source: https://docs.snowflake.com/en/sql-reference/sql/show-privileges.md
section: SQL Commands
---

# SHOW PRIVILEGES

Lists the privileges granted to an application.

## Syntax

```sqlsyntax
SHOW PRIVILEGES IN APPLICATION <name>
```

## Parameters

`name`
:   Specifies the name of the application.

## Output

Specifies the privileges granted to an application.

| Column | Description |
| --- | --- |
| privilege | The name of the privilege as specified in the manifest file. |
| description | A description of the privilege, which is specified in the manifest file. For details, refer to [Access control privileges](../../user-guide/security-access-control-privileges.md). |
| is_granted | Specifies if the consumer has granted the privilege. |
| is_grantable | Specifies if the user running the command has an [activated role](../../user-guide/security-access-control-overview.md) that can grant this privilege |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW PROCEDURES
source: https://docs.snowflake.com/en/sql-reference/sql/show-procedures.md
section: SQL Commands
---

# SHOW PROCEDURES

Lists all stored procedures that you have privileges to access, including built-in and user-defined procedures.

For a command that lists only user-defined procedures, see [SHOW USER PROCEDURES](show-user-procedures.md).

See also:
:   [ALTER PROCEDURE](alter-procedure.md) , [CREATE PROCEDURE](create-procedure.md) , [DROP PROCEDURE](drop-procedure.md) , [DESCRIBE PROCEDURE](desc-procedure.md)

## Syntax

```sqlsyntax
SHOW PROCEDURES [ LIKE '<pattern>' ]
  [ IN
    {
      ACCOUNT                                         |

      CLASS <class_name>                              |

      DATABASE                                        |
      DATABASE <database_name>                        |

      SCHEMA                                          |
      SCHEMA <schema_name>                            |
      <schema_name>

      APPLICATION <application_name>                  |
      APPLICATION PACKAGE <application_package_name>  |
    }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* If you specify `CLASS`, the command only returns the following columns:

  ```output
  | name | min_num_arguments | max_num_arguments | arguments | descriptions | language |
  ```

## Output

The command output provides procedure properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp at which the stored procedure was created. |
| `name` | Name of the stored procedure. |
| `schema_name` | Name of the schema in which the stored procedure exists. |
| `is_builtin` | `Y` if the stored procedure is built-in (rather than user-defined); `N` otherwise. |
| `is_aggregate` | Not applicable currently. |
| `is_ansi` | `Y` if the stored procedure is defined in the ANSI standard; `N` otherwise. |
| `min_num_arguments` | Minimum number of arguments. |
| `max_num_arguments` | Maximum number of arguments. |
| `arguments` | Data types of the arguments and of the return types. Optional arguments are displayed with the `DEFAULT` keyword. For [Snowflake Scripting stored procedures](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md), `OUT` is displayed for output arguments. |
| `description` | Description of the stored procedure. |
| `catalog_name` | Name of the database in which the stored procedure exists. |
| `is_table_function` | `Y` if the stored procedure returns tabular data; `N` otherwise. |
| `valid_for_clustering` | Not applicable currently. |
| `is_secure` | `Y` if the stored procedure is a secure stored procedure; `N` otherwise. |

## Examples

Show all procedures:

```sqlexample
SHOW PROCEDURES;
```

This example shows how to use `SHOW PROCEDURE` on a stored procedure that has a parameter. This also shows how to limit the list of
procedures to those that match the specified regular expression.

```javascript
SHOW PROCEDURES LIKE 'area_of_%';
+-------------------------------+----------------+--------------------+------------+--------------+---------+-------------------+-------------------+------------------------------------+------------------------+-----------------------+-------------------+----------------------+-----------+
| created_on                    | name           | schema_name        | is_builtin | is_aggregate | is_ansi | min_num_arguments | max_num_arguments | arguments                          | description            | catalog_name          | is_table_function | valid_for_clustering | is_secure |
|-------------------------------+----------------+--------------------+------------+--------------+---------+-------------------+-------------------+------------------------------------+------------------------+-----------------------+-------------------+----------------------+-----------|
| 1967-06-23 00:00:00.123 -0700 | AREA_OF_CIRCLE | TEMPORARY_DOC_TEST | N          | N            | N       |                 1 |                 1 | AREA_OF_CIRCLE(FLOAT) RETURN FLOAT | user-defined procedure | TEMPORARY_DOC_TEST_DB | N                 | N                    | N         |
+-------------------------------+----------------+--------------------+------------+--------------+---------+-------------------+-------------------+------------------------------------+------------------------+-----------------------+-------------------+----------------------+-----------+
```

The output columns are similar to the output columns for [SHOW FUNCTIONS](show-functions.md) and
[SHOW USER FUNCTIONS](show-user-functions.md). For stored procedures, some of these columns are not currently meaningful
(e.g. `is_aggregate`, `valid_for_clustering`), but are reserved for future use.

---
title: SHOW PROJECTION POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-projection-policies.md
section: SQL Commands
---

# SHOW PROJECTION POLICIES

Lists [projection policy](../../user-guide/projection-policies.md) information, including the creation date, database and schema names,
owner, and any available comments.

See also:
:   [Projection policy DDL reference](../../user-guide/projection-policies.md)

## Syntax

```sqlsyntax
SHOW PROJECTION POLICIES [ LIKE '<pattern>' ]
                         [ IN
                              {
                                ACCOUNT                  |

                                DATABASE [ <database_name> ] |

                                SCHEMA [ <schema_name> ]     |
                              }
                         ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY PROJECTION POLICY | Account |  |
| APPLY | Projection policy |  |
| OWNERSHIP | Projection policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on projection policy DDL and privileges, see [Privileges and commands](../../user-guide/projection-policies.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Example

```sqlexample
SHOW PROJECTION POLICIES;
```

---
title: SHOW REFERENCES
source: https://docs.snowflake.com/en/sql-reference/sql/show-references.md
section: SQL Commands
---

# SHOW REFERENCES

Lists the references defined for an application in the manifest file and the references the
consumer has associated to the application.

## Syntax

```sqlsyntax
SHOW REFERENCES IN APPLICATION <name>
```

## Parameters

`name`
:   Specifies the name of the application.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Application | To run this command you must have the ownership privilege on the app. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

This command displays the following information about the references defined for the application:

| Column | Description |
| --- | --- |
| name | The name of the reference. |
| label | The label of the reference as specified in the manifest file. |
| description | A description of the reference and what it does. |
| privileges | The privileges that the reference requires. Refer to [Object types and privileges that a reference can contain](../../developer-guide/native-apps/requesting-refs.md) for the list of privileges that a reference can require for an object. |
| object_type | The type of object associated with the reference. Refer to [Object types and privileges that a reference can contain](../../developer-guide/native-apps/requesting-refs.md) for a list of the supported objects for a reference. |
| multi-valued | Indicates if the reference requires more than one type of object. |
| object_name | The name of the object specified by the reference after the consumer associates the object with the application. |
| schema_name | The name of the schema of the object associated with this reference or NULL if no object has been associated or if the associated object is an account object. |
| database_name | The name of database of the object associated with this reference or NULL if one of the following is true:   * No object is specified in the reference definition. * The object is not a database or database object. |
| alias | A name that uniquely identifies a reference to an object, including the object name, scope and privileges |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW REGIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-regions.md
section: SQL Commands
---

# SHOW REGIONS

Lists all the [regions](../../user-guide/intro-regions.md) in which accounts can be created. This command returns the Snowflake Region name,
the cloud provider (AWS, Google Cloud Platform, or Microsoft Azure) that hosts the account, and the cloud provider’s name for the region.

See also:
:   [CURRENT_REGION](../functions/current_region.md)

## Syntax

```sqlsyntax
SHOW REGIONS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Output

The command output provides region properties and metadata in the following columns. The command output for organizations that span multiple [region groups](../../user-guide/admin-account-identifier.md) includes an additional
`region_group` column.

| Column | Description |
| --- | --- |
| `region_group` | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. **Note**: This column is only displayed for organizations that span multiple region groups. |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `cloud` | Name of the cloud provider that hosts the account. |
| `region` | Region where the account is located; i.e. the cloud provider’s name for the region. |
| `display_name` | Human-readable cloud region name, e.g. `US West (Oregon)` |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW RELEASE CHANNELS
source: https://docs.snowflake.com/en/sql-reference/sql/show-release-channels.md
section: SQL Commands
---

# SHOW RELEASE CHANNELS

Lists the
[release channels](../../developer-guide/native-apps/release-channels.md) for
an application package or listing.

## Syntax

```sqlsyntax
SHOW RELEASE CHANNELS IN APPLICATION PACKAGE <application_package_name>

SHOW RELEASE CHANNELS IN LISTING <listing_name>
```

## Parameters

`application_package_name`
:   Specifies the identifier of the application package.

`listing_name`
:   Specifies the identifier of the listing.

## Output

The command output displays release channel properties and metadata in the following columns:

**Output for application packages**

| Column | Description |
| --- | --- |
| `name` | The type of the release channel. The following values are possible: `QA`, `ALPHA`, and `DEFAULT`. |
| `description` | A description of the release channel. |
| `versions` | The versions defined in the release channel. |
| `default_version_name` | The name of the version specified in the default release directive of the release channel. |
| `default_patch_number` | The patch number in the default release directive of the release channel. |
| `targets` | The target accounts added to the release channel. This only applies to the nondefault channels.  You cannot add target accounts to the default channel. However, you can add targets to the custom release directives of the default release channel. |
| `created_on` | The timestamp when the release channel was created. |
| `updated_on` | The timestamp when the release channel was last updated. |

**Output for listings**

| Column | Description |
| --- | --- |
| `name` | The type of the release channel. The following values are possible: `QA`, `ALPHA`, and `DEFAULT`. |
| `version` | The version of the app included in the listing. |
| `patch` | The patch number of the app included in the listing. |
| `description` | A description of the release channel. |
| `created_on` | The timestamp when the release channel was created. |
| `updated_on` | The timestamp when the release channel was last updated. |

---
title: SHOW RELEASE DIRECTIVES
source: https://docs.snowflake.com/en/sql-reference/sql/show-release-directives.md
section: SQL Commands
---

# SHOW RELEASE DIRECTIVES

Lists the release directives defined for an application package.

The output returns metadata and properties for the release directives in an application package,
ordered lexicographically by name. This is important to note if you want to filter the results
using the provided filters.

See also:
:   [ALTER APPLICATION PACKAGE](alter-application-package.md), [CREATE APPLICATION PACKAGE](create-application-package.md),
    [DROP APPLICATION PACKAGE](drop-application-package.md), [SHOW APPLICATION PACKAGES](show-application-packages.md)

## Syntax

```sqlsyntax
SHOW RELEASE DIRECTIVES [ LIKE '<pattern>' ]
  IN APPLICATION PACKAGE <name>
  [ FOR RELEASE CHANNEL <release_channel> ]
```

## Parameters

`name`
:   Specifies the identifier of the application package.

`LIKE 'pattern'`
:   Optionally filters the command output by the version name specified in the application
    package. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%v1%' ...`

    `... LIKE '%V1%' ...`

    . Default: No value (no filtering is applied to the output).

`FOR RELEASE CHANNEL release_channel`
:   Returns only the release directives defined for the specified release channels.

## Output

The command output provides release directive properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Specifies the name of the release directive. For the default release directive, the name is `DEFAULT`. |
| `target_type` | Specifies the type of target for the directive. The following values are possible:   * DEFAULT * ACCOUNT |
| `target_name` | Specifies the name of the organization or account. The value for the default release directive is always `NULL`. |
| `created_on` | Specifies the timestamp when the release directive was created. |
| `version` | Specifies the application version literal if applicable; if not, the value is NULL. |
| `patch` | Specifies the patch number of the application version if applicable; if not, the value is NULL. |
| `modified_on` | Specifies the timestamp when the release directive was last modified or NULL if it hasn’t been modified. |
| `active_regions` | Specifies the list of Snowflake regions where the release directive is allowed to affect upgrades. This value is ignored when `RELEASE_STATUS` is `HOLDING`. |
| `pending_regions` | Specifies the list of Snowflake regions where the release directive will be applied in the future. Upgrade progress in active regions is monitored for a period before new regions are activated. |
| `release_status` | Specifies the current release status. The following values are possible:   * IN_PROGRESS: Upgrades are proceeding in the listed `ACTIVE_REGIONS`. * HOLDING: Upgrades are temporarily suspended. * DEPLOYED: Upgrades are permitted in all regions where the app is installed. |
| `deployed_on` | Specifies the time and date the release directive was deployed. When too many target regions are identified as unhealthy during deployment, the release directive temporarily moves to `HOLDING`. |
| `release_channel` | Specifies the release channel the release directive belongs to. |
| `upgrade_in_maintenance_window` | TRUE if the upgrade is configured to respect consumer maintenance windows, otherwise FALSE. |
| `upgrade_deadline` | The deadline by which the upgrade must be completed. After this time, the system automatically upgrades the application regardless of the consumer’s maintenance policy. |

## Usage notes

* This command requires the OWNERSHIP privilege, the MANAGE
  RELEASES privilege, or the MANAGE VERSIONS privilege on the application package.
* The command returns results for release directives that match the privileges granted to the role that
  executes this command.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

```sqlexample
SHOW RELEASE DIRECTIVES IN APPLICATION PACKAGE hello_snowflake_package;
```

```output
+---------+-------------+---------------------------------+-------------------------------+---------+-------+-------------------------------+------------------------+--------------------------+----------------+-------------------------------+
| name    | target_type | target_name                     | created_on                    | version | patch | modified_on                   | active_regions         | pending_regions          | release_status | deployed_on                   |
|---------+-------------+---------------------------------+-------------------------------+---------+-------+-------------------------------+------------------------+--------------------------+----------------+-------------------------------+
| DEFAULT | DEFAULT     | NULL                            | 2023-04-02 14:55:17.304 -0700 | V2      |     0 | 2023-04-02 15:47:08.673 -0700 | PUBLIC.AWS_AP_SOUTH_1  | PUBLIC.AWS_AP_SOUTH_1    | IN PROGRESS    |                               |
| NEW_RD  | ACCOUNT     | [PROVIDER_DEV.PROVIDER_AWS]     | 2023-04-02 16:30:44.443 -0700 | V1      |     1 | 2023-04-03 07:10:42.428 -0700 | ALL                    |                          | DEPLOYED       | 2023-04-03 07:10:42.428 -0700 |         |
+---------+-------------+---------------------------------+-------------------------------+---------+-------+-------------------------------+------------------------+--------------------------+----------------+-------------------------------+
```

---
title: SHOW REPLICATION ACCOUNTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-replication-accounts.md
section: SQL Commands
---

# SHOW REPLICATION ACCOUNTS

Lists all the accounts in your organization that are enabled for replication and indicates the [region](../../user-guide/intro-regions.md) in
which each account is located.

> **Note:**
>
> Use this SQL command instead of [SHOW GLOBAL ACCOUNTS](show-global-accounts.md), which is deprecated.

See also:
:   [SHOW REPLICATION DATABASES](show-replication-databases.md)

## Syntax

```sqlsyntax
SHOW REPLICATION ACCOUNTS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Output

The command output provides replication account properties and metadata in the following columns. The command output for organizations that span multiple [region groups](../../user-guide/admin-account-identifier.md) includes an additional
`region_group` column.

| Column | Description |
| --- | --- |
| `region_group` | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. **Note**: This column is only displayed for organizations that span multiple region groups. |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `created_on` | Date and time when the account was created. |
| `account_name` | Account name in your organization. |
| `account_locator` | Account locator in a region. |
| `comment` | Comment for the account. |
| `organization_name` | Name of your Snowflake organization. |
| `is_org_admin` | Indicates whether the ORGADMIN role is enabled in an account. If TRUE, the role is enabled. |

## Usage notes

* Only account administrators (users with the ACCOUNTADMIN role) can execute this SQL command.
* Returns results only when an account administrator (user with the ACCOUNTADMIN role) executes the command in a replication-enabled
  account.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the global accounts whose name starts with `myaccount`:

```sqlexample
SHOW REPLICATION ACCOUNTS LIKE 'myaccount%';
```

---
title: SHOW REPLICATION DATABASES
source: https://docs.snowflake.com/en/sql-reference/sql/show-replication-databases.md
section: SQL Commands
---

# SHOW REPLICATION DATABASES

Lists all the primary and secondary databases (that is to say, all the databases for which replication has been enabled) in your account
and indicates the [region](../../user-guide/intro-regions.md) in which each account is located.

See also:
:   [SHOW REPLICATION ACCOUNTS](show-replication-accounts.md)

## Syntax

```sqlsyntax
SHOW REPLICATION DATABASES [ LIKE '<pattern>' ]
                           [ WITH PRIMARY <account_identifier>.<primary_db_name> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`WITH PRIMARY {account_identifier}.{primary_db_name}`
:   Specifies the scope of the command, which determines whether the command lists records only for the specified primary database.
    The `account_identifier` can be in the form `org_name.account_name` or `snowflake_region.account_locator`.
    See [Account identifiers for replication and failover](../../user-guide/admin-account-identifier.md) for details.

## Output

The command output provides primary and secondary database properties and metadata in the following columns. The command output for organizations that span multiple [region groups](../../user-guide/admin-account-identifier.md) includes an additional
`region_group` column.

| Column | Description |
| --- | --- |
| `region_group` | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. **Note**: This column is only displayed for organizations that span multiple region groups. |
| `snowflake_region` | Snowflake Region where the account that stores the database is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `created_on` | Date and time when the database was created. |
| `account_name` | Name of the account in which the database is stored. |
| `name` | Name of the database. |
| `comment` | Comment for the database. |
| `is_primary` | Whether the database is a primary database; otherwise, is a secondary database. |
| `primary` | Fully-qualified name of a primary database, including the region, account, and database name. |
| `replication_allowed_to_accounts` | Where `IS_PRIMARY` is TRUE, shows the fully-qualified names of accounts where replication has been enabled for this primary database. A secondary database can be created in each of these accounts. |
| `failover_allowed_to_accounts` | Where `IS_PRIMARY` is TRUE, shows the fully-qualified names of accounts where failover has been enabled for this primary database. A secondary database can be created in each of these accounts for business continuity and disaster recovery. |
| `organization_name` | Name of your Snowflake organization. |
| `account_locator` | Account locator in a region. |

## Usage notes

* Returns results for a role with any privilege on the database (for example, USAGE or MONITOR).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the replication databases whose name starts with `mydb`:

```sqlexample
SHOW REPLICATION DATABASES LIKE 'mydb%';
```

Show all the secondary databases for the `myorg.account1.mydb1` org, account, and primary database, respectively:

```sqlexample
SHOW REPLICATION DATABASES WITH PRIMARY myorg.account1.mydb1;
```

---
title: SHOW REPLICATION GROUPS
source: https://docs.snowflake.com/en/sql-reference/sql/show-replication-groups.md
section: SQL Commands
---

# SHOW REPLICATION GROUPS

Displays information about [replication groups and failover groups](../../user-guide/account-replication-intro.md).

* Lists each primary or secondary replication or failover group in this account.
* Lists primary replication and failover groups in other accounts enabled for replication to this account.
* Lists secondary replication and failover groups in other accounts linked to groups in this account.

See also:
:   [CREATE REPLICATION GROUP](create-replication-group.md) , [ALTER REPLICATION GROUP](alter-replication-group.md) , [DROP REPLICATION GROUP](drop-replication-group.md)

## Syntax

```sqlsyntax
SHOW REPLICATION GROUPS [ IN ACCOUNT <account> ]
```

## Parameters

`account`
:   Specifies the identifier for the account.

## Output

The command returns the following columns:

| Column | Description |
| --- | --- |
| `region_group` | Region group where the account is located. **Note:** this column is only visible to organizations that span multiple [Region groups](../../user-guide/admin-account-identifier.md). |
| `snowflake_region` | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| `created_on` | Date and time replication or failover group was created. |
| `account_name` | Name of the account. |
| `name` | Name of the replication or failover group. |
| `type` | Type of group. Valid values are `REPLICATION` or `FAILOVER`. |
| `comment` | Comment string. |
| `is_primary` | Indicates whether the replication or failover group is the primary group. |
| `primary` | Name of the primary group. |
| `object_types` | List of specified object types enabled for replication (and failover in the case of a `FAILOVER` group). |
| `allowed_integration_types` | A list of integration types that are enabled for replication.  Snowflake always includes this column in the output even if integrations were not specified in the CREATE *<object>* or ALTER *<object>* command. |
| `allowed_accounts` | List of accounts enabled for replication and failover. |
| `organization_name` | Name of your Snowflake organization. |
| `account_locator` | Account locator in a region. |
| `replication_schedule` | Scheduled interval for refresh; NULL if no replication schedule is set. |
| `secondary_state` | Current state of scheduled refresh. Valid values are `started` or `suspended`. NULL if no replication schedule is set. |
| `next_scheduled_refresh` | Date and time of the next scheduled refresh. |
| `owner` | Name of the role with the OWNERSHIP privilege on the replication or failover group. NULL if the replication or failover group is in a different region. |
| `is_listing_auto_fulfillment_group` | TRUE if the replication group is used for [Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md). FALSE otherwise. |

## Usage notes

* Executing this command requires a role with any one of the following privileges on a replication group:

  + MONITOR
  + OWNERSHIP
  + REPLICATE
* The output of SHOW REPLICATION GROUPS includes groups of types `FAILOVER` and `REPLICATION`.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List replication groups in `myaccount1`:

```sqlexample
SHOW REPLICATION GROUPS IN ACCOUNT myaccount1;

+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+---------+-----------------------------------+
| snowflake_region | created_on                    | account_name | name | type     | comment | is_primary | primary               | object_types                                | allowed_integration_types | allowed_accounts                             | organization_name | account_locator   | replication_schedule | secondary_state | next_scheduled_refresh        | owner   | is_listing_auto_fulfillment_group |
+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+---------+-----------------------------------+
| AWS_US_EAST_1    | 2021-10-25 19:08:15.209 -0700 | MYACCOUNT1   | MYFG | FAILOVER |         | true       | MYORG.MYACCOUNT1.MYFG | DATABASES, ROLES, USERS, WAREHOUSES, SHARES |                           | MYORG.MYACCOUNT1.MYFG,MYORG.MYACCOUNT2.MYFG  | MYORG             | MYACCOUNT1LOCATOR | 10 MINUTE            |                 |                               | MYROLE  | false                             |
+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+---------+-----------------------------------+
| AWS_US_WEST_2    | 2021-10-25 19:08:15.209 -0700 | MYACCOUNT2   | MYFG | FAILOVER |         | false      | MYORG.MYACCOUNT1.MYFG |                                             |                           |                                              | MYORG             | MYACCOUNT2LOCATOR | 10 MINUTE            | STARTED         | 2022-03-06 12:10:35.280 -0800 | NULL    | false                             |
+------------------+-------------------------------+--------------+------+----------+---------+------------+-----------------------+---------------------------------------------+---------------------------+----------------------------------------------+-------------------+-------------------+----------------------+-----------------+-------------------------------+---------+-----------------------------------+
```

---
title: SHOW RESOURCE MONITORS
source: https://docs.snowflake.com/en/sql-reference/sql/show-resource-monitors.md
section: SQL Commands
---

# SHOW RESOURCE MONITORS

Lists all the resource monitors in your account for which you have access privileges.

See also:
:   [ALTER RESOURCE MONITOR](alter-resource-monitor.md) , [CREATE RESOURCE MONITOR](create-resource-monitor.md) , [DROP RESOURCE MONITOR](drop-resource-monitor.md)

## Syntax

```sqlsyntax
SHOW RESOURCE MONITORS [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Usage notes

* The command output includes a `level` column with the following values:

  + `WAREHOUSE`: The resource monitor is assigned to one or more warehouses and, therefore, is monitoring the credit usage for
    the assigned warehouse(s).
  + `ACCOUNT`: The resource monitor is assigned at the account-level and, therefore, monitoring the credit usage for your entire
    account.
  + `NULL`: The resource monitor is not assigned to the account or any warehouses and, therefore, is not monitoring any credit
    usage.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

---
title: SHOW ROLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-roles.md
section: SQL Commands
---

# SHOW ROLES

Lists all the roles which you can view across your entire account, including the system-defined roles and any custom roles that exist.

> **Important:**
>
> Snowflake allows users to list roles; however, the ability to list roles is not the same as using any role. Knowing the names of
> roles does not allow any additional access.
>
> This is a part of Discretionary Access Control and Role-Based Access Control. For more information, see
> [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [SHOW GRANTS](show-grants.md) , [CREATE ROLE](create-role.md) , [ALTER ROLE](alter-role.md) , [DROP ROLE](drop-role.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] ROLES
  [ LIKE '<pattern>' ]
  [ IN CLASS <class_name> ]
  [ STARTS WITH '<name_string>']
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of columns:

    `is_default`
    :   Specifies whether the role used to run the command is the user’s default role.

    `is_current`
    :   Specifies whether the role used to run the command is the user’s current role.

    `is_inherited`
    :   Specifies whether the role used to run the command inherits the specified role.

    `is_from_organization_user_group`
    :   If TRUE, the role was imported from an [organization user group](../../user-guide/organization-users.md).

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN CLASS class_name`
:   Returns records for the specified class (`class_name`).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* If you specify `CLASS`, only the following columns are returned:

  ```output
  | created_on | name | comment |
  ```

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all roles:

> ```sqlexample
> SHOW ROLES;
> ```
>
> ```output
> ---------------------------------+---------------+------------+------------+--------------+-------------------+------------------+---------------+---------------+--------------------------+
>            created_on            |     name      | is_default | is_current | is_inherited | assigned_to_users | granted_to_roles | granted_roles |     owner     |         comment          |
> ---------------------------------+---------------+------------+------------+--------------+-------------------+------------------+---------------+---------------+--------------------------+
>  Fri, 05 Dec 2014 16:25:06 -0800 | ACCOUNTADMIN  | Y          | Y          | N            | 1                 | 0                | 2             |               |                          |
>  Mon, 15 Dec 2014 17:58:33 -0800 | ANALYST       | N          | N          | N            | 0                 | 6                | 0             | SECURITYADMIN | Data analyst             |
>  Fri, 05 Dec 2014 16:25:06 -0800 | PUBLIC        | N          | N          | Y            | 0                 | 0                | 0             |               |                          |
>  Fri, 05 Dec 2014 16:25:06 -0800 | SECURITYADMIN | N          | N          | Y            | 0                 | 1                | 0             |               |                          |
>  Fri, 05 Dec 2014 16:25:06 -0800 | SYSADMIN      | N          | N          | Y            | 5                 | 1                | 2             |               |                          |
> ---------------------------------+---------------+------------+------------+--------------+-------------------+------------------+---------------+---------------+--------------------------+
> ```

In this example:

* The ACCOUNTADMIN system-defined role is the current role and default role for the current (i.e. logged-in) user.
* In addition to the four system-defined roles, one custom role (ANALYST) has been created. The role is owned by the SECURITYADMIN
  system-defined role.

Return up to ten account roles in the account after the first role named `my_role2`:

```sqlexample
SHOW ROLES LIMIT 10 FROM 'my_role2';
```

---
title: SHOW ROLES IN SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/show-roles-in-service.md
section: SQL Commands
---

# SHOW ROLES IN SERVICE

Lists all the service roles associated with a service. These are the roles defined in the service specification. For more information, see [Managing service-related privileges](../../developer-guide/snowpark-container-services/working-with-services.md).

See also:
:   [REVOKE SERVICE ROLE](revoke-service-role.md), [GRANT SERVICE ROLE](grant-service-role.md),
    [SHOW GRANTS](show-grants.md)

## Syntax

```sqlsyntax
SHOW ROLES IN SERVICE <name>
```

## Parameters

`name`
:   Name of the service.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Service |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Output

The command output provides the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the service role was created |
| `name` | Service role name |
| `comment` | Comment, if any, for the service role |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List the service roles in the `echo_service` service.

```sqlexample
SHOW ROLES IN SERVICE echo_service;
```

```output
+-------------------------------+-------------------------+------------+
| created_on                    |   name                      |  comment   |
+-------------------------------+-------------------------+------------+
| 2024-04-29 14:58:50.063 -0700 |   ALL_ENDPOINTS_USAGE   |            |
+-------------------------------+-------------------------+------------+
```

---
title: SHOW ROW ACCESS POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-row-access-policies.md
section: SQL Commands
---

# SHOW ROW ACCESS POLICIES

Lists the row access policies for which you have access privileges. Returns information that includes the creation date, database and
schema names, owner, and any available comments.

See also:
:   [Row access policy DDL](../../user-guide/security-row-intro.md)

## Syntax

```sqlsyntax
SHOW ROW ACCESS POLICIES [ LIKE '<pattern>' ]
                         [ LIMIT <rows> [ FROM '<name_string>' ] ]
                         [ IN
                              {
                                ACCOUNT                                         |

                                DATABASE                                        |
                                DATABASE <database_name>                        |

                                SCHEMA                                          |
                                SCHEMA <schema_name>                            |
                                <schema_name>

                                APPLICATION <application_name>                  |
                                APPLICATION PACKAGE <application_package_name>  |
                              }
                         ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY ROW ACCESS POLICY | Account |  |
| APPLY | Row access policy |  |
| OWNERSHIP | Row access policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on row access policy DDL and privileges, see [Manage row access policies](../../user-guide/security-row-intro.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example is representative of a user with the ACCOUNTADMIN role executing the query.

> ```sqlexample
> SHOW ROW ACCESS POLICIES;
> ```
>
> ```output
> +---------------------------------+------+---------------+-------------+-------------------+--------------+---------+---------+-----------------+
> |          created_on             | name | database_name | schema_name |       kind        |    owner     | comment | options | owner_role_type |
> |---------------------------------+------+---------------+-------------+-------------------+--------------+---------+---------+-----------------+
> | Fri, 23 Jun 1967 00:00:00 -0700  | P1   | RLS_AUTHZ_DB  | S_D_1       | ROW_ACCESS_POLICY | ACCOUNTADMIN |         | ""      | ROLE           |
> | Fri, 23 Jun 1967 00:00:00 -0700  | P2   | RLS_AUTHZ_DB  | S_D_2       | ROW_ACCESS_POLICY | ACCOUNTADMIN |         | ""      | ROLE           |
> +---------------------------------+------+---------------+-------------+-------------------+--------------+---------+---------+-----------------+
> ```

The following example is representative of a role that does not have USAGE on the parent schema in which row access policies exist and is
not the ACCOUNTADMIN role.

> ```sqlexample
> SHOW ROW ACCESS POLICIES;
> ```
>
> ```output
> +--------------------------------+------+---------------+-------------+-------------------+--------------+---------+---------+-----------------+
> |         created_on             | name | database_name | schema_name |       kind        |    owner     | comment | options | owner_role_type |
> |--------------------------------+------+---------------+-------------+-------------------+--------------+---------+---------+-----------------+
> +--------------------------------+------+---------------+-------------+-------------------+--------------+---------+---------+-----------------+
> ```

---
title: SHOW RUN … IN EXPERIMENT
source: https://docs.snowflake.com/en/sql-reference/sql/show-run-in-experiment.md
section: SQL Commands
---

# SHOW RUN … IN EXPERIMENT

Displays logged parameters or metrics for [experiment runs](../../developer-guide/snowflake-ml/experiments.md).

See also:
:   [CREATE EXPERIMENT](create-experiment.md) , [ALTER EXPERIMENT](alter-experiment.md), [SHOW EXPERIMENTS](show-experiments.md) , [DROP EXPERIMENT](drop-experiment.md) , [SHOW RUNS IN EXPERIMENT](show-runs-in-experiment.md)

## Syntax

```sqlsyntax
SHOW RUN METRICS [ LIKE '<pattern>' ]
  IN EXPERIMENT <experiment_name> [ RUN <run_name> ]
  [ LIMIT <rows> [ FROM <name_string> ] ]

SHOW RUN PARAMETERS [ LIKE '<pattern>' ]
  IN EXPERIMENT <experiment_name> [ RUN <run_name> ]
  [ LIMIT <rows> [ FROM <name_string> ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`METRICS`
:   Display metrics logged for runs.

`PARAMETERS`
:   Display parameters logged for runs.

`IN EXPERIMENT experiment_name`
:   The name of the experiment containing the runs to query.

`RUN run_name`
:   The name of an individual run to query.

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the SHOW RUN METRICS command includes the following columns, which describe the properties and metadata of runs:

| Column | Description |
| --- | --- |
| `run_name` | The name of the run. |
| `name` | The name of the metric. |
| `step` | The step of the metric value. |
| `value` | The value of the metric at the specified step. |

The output of the SHOW RUN PARAMETERS command includes the following columns, which describe the properties and metadata of runs:

| Column | Description |
| --- | --- |
| `run_name` | The name of the run. |
| `name` | The name of the parameter. |
| `value` | The value of the parameter. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Experiment |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

---
title: SHOW RUNS IN EXPERIMENT
source: https://docs.snowflake.com/en/sql-reference/sql/show-runs-in-experiment.md
section: SQL Commands
---

# SHOW RUNS IN EXPERIMENT

Lists the runs in an [experiment](../../developer-guide/snowflake-ml/experiments.md).

See also:
:   [CREATE EXPERIMENT](create-experiment.md) , [ALTER EXPERIMENT](alter-experiment.md), [SHOW EXPERIMENTS](show-experiments.md) , [DROP EXPERIMENT](drop-experiment.md) , [SHOW RUN … IN EXPERIMENT](show-run-in-experiment.md)

## Syntax

```sqlsyntax
SHOW RUNS [ LIKE '<pattern>' ] IN EXPERIMENT <name>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`name`
:   Specifies the identifier of the experiment to inspect.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the run was created. |
| `name` | The identifier for the run. |
| `database_name` | The database that the run is stored in. |
| `schema_name` | The schema that the run is stored in. |
| `experiment_name` | The experiment that the run belongs to. |
| `metadata` | A JSON object containing the run status and metrics.  The `status` field of the run indicates if it’s `RUNNING` or `FINISHED`.  The `metrics` field reports the metrics of the run. Only the latest metric value (the one with the highest `step`) is included. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Experiment |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

---
title: SHOW SCHEMAS
source: https://docs.snowflake.com/en/sql-reference/sql/show-schemas.md
section: SQL Commands
---

# SHOW SCHEMAS

Lists the schemas for which you have access privileges, including dropped schemas that are still within the Time Travel retention period
and, therefore, can be undropped. The command can be used to list schemas for the current/specified database, or across your entire
account.

The output returns schema metadata and properties, ordered lexicographically by database and schema name. This is important to note if
you wish to filter the results using the provided filters.

See also:
:   [CREATE SCHEMA](create-schema.md) , [ALTER SCHEMA](alter-schema.md) , [DESCRIBE SCHEMA](desc-schema.md) , [DROP SCHEMA](drop-schema.md) , [UNDROP SCHEMA](undrop-schema.md)

    [SCHEMATA view](../info-schema/schemata.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW [ TERSE ] SCHEMAS
  [ HISTORY ]
  [ LIKE '<pattern>' ]
  [ IN { ACCOUNT | DATABASE [ <db_name> ] | APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> } ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
  [ WITH PRIVILEGES <object_privilege> [ , <object_privilege> [ , ... ] ] ]
```

## Parameters

`TERSE`
:   Returns output containing only the following columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

    Note that `kind` and `schema_name` always display `NULL` because `kind` is not applicable for schemas and
    `schema_name` is redundant with `name`.

    Default: No value (all columns are included in the output)

`HISTORY`
:   Includes dropped schemas that have not yet been purged (i.e. they are still within their respective Time Travel retention periods).
    If multiple versions of a dropped schema exist, the output displays a row for each version. The output also includes an additional
    `dropped_on` column, which displays:

    * Date and timestamp (for dropped schemas)
    * `NULL` (for active schemas).

    Default: No value (dropped schemas are not included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN { ACCOUNT | [ DATABASE ] [ db_name ] | APPLICATION application_name | APPLICATION PACKAGE application_package_name  }`
:   Specifies the scope of the command, which determines whether the command lists records only for the current/specified database or
    across your entire account.

    The `APPLICATION` and `APPLICATION PACKAGE` keywords are not required, but they specify the scope for the named Snowflake Native App.

    The `DATABASE` keyword is not required; you can set the scope by specifying only the database name. Likewise, the database name
    is not required if the session currently has a database in use.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

`WITH PRIVILEGES object_privilege [ , object_privilege [ , ... ] ]`
:   Optionally limits rows to objects for which the [active role](../../user-guide/security-access-control-overview.md) for the current
    user has been granted all of the specified privileges in the list on the object.

    If a CREATE <object> privilege is included in the privileges list, the command excludes objects for which secondary roles have
    been granted privileges. This is because only the primary role has the authorization to create objects. For more information, see
    [Authorization through primary role and secondary roles](../../user-guide/security-access-control-overview.md).

`OBJECT_VISIBILITY`
:   [Preview Feature](../../release-notes/preview-features.md) — Open

    Available to all accounts.

    This property controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md) in the account,
    enabling users without explicit access privileges to find objects and request access.

## Usage notes

* When you specify the scope to either `APPLICATION` or the database named `SNOWFLAKE`, the `owner` column returns
  `SNOWFLAKE` as the owner for the schema named `LOCAL`. For example:

  > ```sqlexample
  > SHOW SCHEMAS IN APPLICATION my_app;
  > SHOW SCHEMAS IN DATABASE SNOWFLAKE;
  > ```

  The `owner` column returns:

  > ```output
  > +-----+-------+-----+-----------+-----+
  > | ... | name  | ... | owner     | ... |
  > +-----+-------+-----+-----------+-----+
  > | ... | LOCAL | ... | SNOWFLAKE | ... |
  > +-----+-------+-----+-----------+-----+
  > ```

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* The `HISTORY` and `WITH PRIVILEGES` parameters are mutually exclusive; they cannot both be used in the same statement.

## Examples

Show all schemas in the current database, `mytestdb`, that you have privileges to view:

```sqlexample
SHOW SCHEMAS;
```

```output
+---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+-----------------+-------------------+
| created_on                      | name               | is_default | is_current | database_name | owner  | comment                                                   | options | retention_time | owner_role_type | object_visibility |
|---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+-----------------+-------------------+
| Fri, 13 May 2016 17:58:37 -0700 | INFORMATION_SCHEMA | N          | N          | MYTESTDB      |        | Views describing the contents of schemas in this database |         |              1 | ROLE            | NULL              |
| Wed, 25 Feb 2015 16:16:54 -0800 | PUBLIC             | N          | Y          | MYTESTDB      | PUBLIC |                                                           |         |              1 | ROLE            | NULL              |
+---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+-----------------+-------------------+
```

Show all schemas in the current database, `mytestdb`, that you have privileges to view, including dropped schemas (this example
builds on the [DROP SCHEMA](drop-schema.md) examples):

```sqlexample
SHOW SCHEMAS HISTORY;
```

```output
+---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+---------------------------------+-----------------+-------------------+
| created_on                      | name               | is_default | is_current | database_name | owner  | comment                                                   | options | retention_time | dropped_on                      | owner_role_type | object_visibility |
|---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+---------------------------------+-----------------+-------------------+
| Fri, 13 May 2016 17:59:50 -0700 | INFORMATION_SCHEMA | N          | N          | MYTESTDB      |        | Views describing the contents of schemas in this database |         |              1 | NULL                            |                 | NULL              |
| Wed, 25 Feb 2015 16:16:54 -0800 | PUBLIC             | N          | Y          | MYTESTDB      | PUBLIC |                                                           |         |              1 | NULL                            | ROLE            | NULL              |
| Tue, 17 Mar 2015 16:42:29 -0700 | MYSCHEMA           | N          | N          | MYTESTDB      | PUBLIC |                                                           |         |              1 | Fri, 13 May 2016 17:25:32 -0700 | ROLE            | NULL              |
+---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+---------------------------------+-----------------+-------------------+
```

Show all schemas in the current database that you have been granted the USAGE privilege on:

```sqlexample
SHOW SCHEMAS WITH PRIVILEGES USAGE;
```

```output
+-------------------------------+----------------+------------+------------+-----------------------------------------------------------+--------------+---------+---------+----------------+-----------------+-------------------+
| created_on                    | name           | is_default | is_current | database_name                                             | owner        | comment | options | retention_time | owner_role_type | object_visibility |
|-------------------------------+----------------+------------+------------+-----------------------------------------------------------+--------------+---------+---------+----------------+-----------------+-------------------+
| 2023-01-27 15:01:12.940 -0800 | PUBLIC         | N          | N          | BOOKS_DB                                                  | DATA_ADMIN   |         |         | 1              | ROLE            | NULL              |
| 2023-09-15 15:22:51.164 -0700 | PUBLIC         | N          | N          | TEST_DB                                                   | ACCOUNTADMIN |         |         | 4              | ROLE            | NULL              |
| 2023-01-13 10:58:49.584 -0800 | ACCOUNT_USAGE  | N          | N          | SNOWFLAKE                                                 |              |         |         | 1              |                 | NULL              |
+-------------------------------+----------------+------------+------------+-----------------------------------------------------------+--------------+---------+---------+----------------+-----------------+-------------------+
```

---
title: SHOW SECRETS
source: https://docs.snowflake.com/en/sql-reference/sql/show-secrets.md
section: SQL Commands
---

# SHOW SECRETS

Lists the secrets for which you have rights to see. This command can be used to list the secrets for a specified database
or schema (or the current database/schema for the session), or your entire account.

See also:
:   [ALTER SECRET](alter-secret.md) , [CREATE SECRET](create-secret.md) , [DESCRIBE SECRET](desc-secret.md) , [DROP SECRET](drop-secret.md)

## Syntax

```sqlsyntax
SHOW SECRETS [ LIKE '<pattern>' ]
             [ IN { ACCOUNT | [ DATABASE ] <db_name> | [ SCHEMA ] <schema_name> | APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> } ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Secret |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Snowflake never returns the `PASSWORD` property value.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

---
title: SHOW SEMANTIC DIMENSIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-semantic-dimensions.md
section: SQL Commands
---

# SHOW SEMANTIC DIMENSIONS

Lists the dimensions in the [semantic views](../../user-guide/views-semantic/overview.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
SHOW SEMANTIC DIMENSIONS [ LIKE '<pattern>' ]
                         [ IN
                              {
                                <semantic_view_name>           |

                                ACCOUNT                        |

                                DATABASE                       |
                                DATABASE <db_name>             |

                                SCHEMA                         |
                                SCHEMA <db_name>.<schema_name>
                              }
                         ]
                         [ STARTS WITH '<name_string>' ]
                         [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `semantic_view_name`
    :   Returns records for the specified semantic view.

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

    `SCHEMA`, . `SCHEMA db_name.schema_name`
    :   Returns records for the current schema in use or a specified schema (`db_name.schema_name`). You must specify the
        fully qualified name of the schema.

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `database_name` | Name of the database that contains the semantic view. |
| `schema_name` | Name of the schema that contains the semantic view. |
| `semantic_view_name` | Name of the semantic view that contains the dimension. |
| `table_name` | Name of the logical table for the dimension. |
| `name` | Name of the dimension. |
| `data_type` | Data type of the dimension. |
| `synonyms` | Alternative names or synonyms for the dimension. |
| `comment` | Comment about the dimension. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the dimensions for semantic views that you have any privilege on. The list includes dimensions in
semantic views in the current schema of the current database.

```sqlexample
SHOW SEMANTIC DIMENSIONS;
```

```output
+---------------+-------------+--------------------+------------+-------------------------+-------------+-------------------+--------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name                    | data_type   | synonyms          | comment                        |
|---------------+-------------+--------------------+------------+-------------------------+-------------+-------------------+--------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_COUNTRY_CODE   | VARCHAR(15) | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_MARKET_SEGMENT | VARCHAR(10) | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_NAME           | VARCHAR(25) | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_NATION_NAME    | VARCHAR(25) | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_REGION_NAME    | VARCHAR(25) | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | NATION     | NATION_NAME             | VARCHAR(25) | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | ORDERS     | ORDER_DATE              | DATE        | NULL              | NULL                           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | CUSTOMER_NAME           | VARCHAR(25) | ["customer name"] | Name of the customer           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | ORDER_DATE              | DATE        | NULL              | Date when the order was placed |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | ORDER_YEAR              | NUMBER(4,0) | NULL              | Year when the order was placed |
+---------------+-------------+--------------------+------------+-------------------------+-------------+-------------------+--------------------------------+
```

The following example lists the dimensions for the semantic view named `tpch_rev_analysis` in the current schema of the current database:

```sqlexample
SHOW SEMANTIC DIMENSIONS IN tpch_rev_analysis;
```

```output
+---------------+-------------+--------------------+------------+---------------+-------------+-------------------+--------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name          | data_type   | synonyms          | comment                        |
|---------------+-------------+--------------------+------------+---------------+-------------+-------------------+--------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | CUSTOMER_NAME | VARCHAR(25) | ["customer name"] | Name of the customer           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | ORDER_DATE    | DATE        | NULL              | Date when the order was placed |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | ORDER_YEAR    | NUMBER(4,0) | NULL              | Year when the order was placed |
+---------------+-------------+--------------------+------------+---------------+-------------+-------------------+--------------------------------+
```

---
title: SHOW SEMANTIC DIMENSIONS FOR METRIC
source: https://docs.snowflake.com/en/sql-reference/sql/show-semantic-dimensions-for-metric.md
section: SQL Commands
---

# SHOW SEMANTIC DIMENSIONS FOR METRIC

Lists the dimensions that you can return when querying a specific metric in a
[semantic view](../../user-guide/views-semantic/overview.md).

When you specify a dimension and a metric in a semantic view query, the logical table for the dimension must be related to the
logical table for the metric. In addition, the logical table for the dimension must have an equal or lower level of granularity
than the logical table for the metric.

To determine which dimensions meet this criteria, you can run this command.

For details, see [Choosing the dimensions that you can return for a given metric](../../user-guide/views-semantic/querying.md).

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
SHOW SEMANTIC DIMENSIONS [ LIKE '<pattern>' ]
                         IN <semantic_view_name>
                         FOR METRIC <metric_name>
                         [ STARTS WITH '<name_string>' ]
                         [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN semantic_view_name`
:   Specifies the name of the semantic view containing the dimensions and metric.

`FOR METRIC metric_name`
:   Specifies the name of the metric for which to show associated dimensions.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `table_name` | Name of the logical table for the dimension. |
| `name` | Name of the dimension. |
| `data_type` | Data type of the dimension. |
| `required` | Indicates whether the dimension is required for the metric. |
| `synonyms` | Alternative names or synonyms for the dimension. |
| `comment` | Comment about the dimension. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the dimensions that you can specify in a query for the `order_average_value` metric in the
`tpch_rev_analysis` semantic view:

```sqlexample
SHOW SEMANTIC DIMENSIONS IN tpch_rev_analysis FOR METRIC order_average_value;
```

```output
+------------+---------------+-------------+----------+-------------------+--------------------------------+
| table_name | name          | data_type   | required | synonyms          | comment                        |
|------------+---------------+-------------+----------+-------------------+--------------------------------|
| CUSTOMERS  | CUSTOMER_NAME | VARCHAR(25) | false    | ["customer name"] | Name of the customer           |
| ORDERS     | ORDER_DATE    | DATE        | false    | NULL              | Date when the order was placed |
| ORDERS     | ORDER_YEAR    | NUMBER(4,0) | false    | NULL              | Year when the order was placed |
+------------+---------------+-------------+----------+-------------------+--------------------------------+
```

The following example lists the dimensions that are required when you query a window function metric.

This example uses the semantic view that you defined in [Defining window function metrics](../../user-guide/views-semantic/querying.md). The example returns
the dimensions that you can specify in the query for the `avg_7_days_sales_quantity` metric.

```sqlexample
SHOW SEMANTIC DIMENSIONS IN sv_window_function_example FOR METRIC avg_7_days_sales_quantity;
```

```output
+------------+-----------+--------------+----------+----------+---------+
| table_name | name      | data_type    | required | synonyms | comment |
|------------+-----------+--------------+----------+----------+---------|
| DATE       | DATE      | DATE         | true     | NULL     | NULL    |
| DATE       | D_DATE_SK | NUMBER(38,0) | false    | NULL     | NULL    |
| DATE       | YEAR      | NUMBER(38,0) | true     | NULL     | NULL    |
+------------+-----------+--------------+----------+----------+---------+
```

Note that the `required` column contains `true` for the `date` and `year` dimensions. This is because the definition of
the `avg_7_days_sales_quantity` metric specifies the `date` and `year` dimensions in PARTITION BY EXCLUDING:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW sv_window_function_example
  ...
  METRICS (
    ...
      store_sales.avg_7_days_sales_quantity as AVG(total_sales_quantity)
        OVER (PARTITION BY EXCLUDING date.date, date.year ORDER BY date.date
          RANGE BETWEEN INTERVAL '6 days' PRECEDING AND CURRENT ROW)
        WITH SYNONYMS = ('Running 7-day average of total sales quantity'),
```

Because of this, the `date` and `year` dimensions are required in any query of the `avg_7_days_sales_quantity` metric. You
must specify these dimensions in the query:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  sv_window_function_example
  DIMENSIONS date.date, date.year
  METRICS store_sales.avg_7_days_sales_quantity
);
```

---
title: SHOW SEMANTIC FACTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-semantic-facts.md
section: SQL Commands
---

# SHOW SEMANTIC FACTS

Lists the facts in the [semantic views](../../user-guide/views-semantic/overview.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
SHOW SEMANTIC FACTS [ LIKE '<pattern>' ]
                    [ IN
                         {
                           <semantic_view_name>           |

                           ACCOUNT                        |

                           DATABASE                       |
                           DATABASE <db_name>             |

                           SCHEMA                         |
                           SCHEMA <db_name>.<schema_name>
                         }
                    ]
                    [ STARTS WITH '<name_string>' ]
                    [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `semantic_view_name`
    :   Returns records for the specified semantic view.

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

    `SCHEMA`, . `SCHEMA db_name.schema_name`
    :   Returns records for the current schema in use or a specified schema (`db_name.schema_name`). You must specify the
        fully qualified name of the schema.

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `database_name` | Name of the database that contains the semantic view. |
| `schema_name` | Name of the schema that contains the semantic view. |
| `semantic_view_name` | Name of the semantic view that contains the fact. |
| `table_name` | Name of the logical table for the fact. |
| `name` | Name of the fact. |
| `data_type` | Data type of the fact. |
| `synonyms` | Alternative names or synonyms for the fact. |
| `comment` | Comment about the fact. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |
| REFERENCES or OWNERSHIP | Semantic view | One of these privileges is required if you want the output to include [private facts](../../user-guide/views-semantic/sql.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the facts for semantic views that you have any privilege on. The list includes facts in semantic
views in the current schema of the current database.

```sqlexample
SHOW SEMANTIC FACTS;
```

```output
+---------------+-------------+--------------------+------------+------------------------+--------------------+----------+-------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name                   | data_type          | synonyms | comment                       |
|---------------+-------------+--------------------+------------+------------------------+--------------------+----------+-------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | C_CUSTOMER_ORDER_COUNT | NUMBER(18,0)       | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | LINEITEM   | LINE_ITEM_ID           | VARCHAR(134217728) | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | NATION     | N_NAME                 | VARCHAR(25)        | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | ORDERS     | COUNT_LINE_ITEMS       | NUMBER(18,0)       | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | ORDERS     | O_ORDERKEY             | NUMBER(38,0)       | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | REGION     | R_NAME                 | VARCHAR(25)        | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | LINE_ITEMS | DISCOUNTED_PRICE       | NUMBER(25,4)       | NULL     | Extended price after discount |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | LINE_ITEMS | LINE_ITEM_ID           | VARCHAR(134217728) | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | COUNT_LINE_ITEMS       | NUMBER(18,0)       | NULL     | NULL                          |
+---------------+-------------+--------------------+------------+------------------------+--------------------+----------+-------------------------------+
```

The following example lists the facts for the semantic view named `tpch_rev_analysis` in the current schema of the current
database:

```sqlexample
SHOW SEMANTIC FACTS IN tpch_rev_analysis;
```

```output
+---------------+-------------------+--------------------+------------+------------------+--------------------+----------+-------------------------------+
| database_name | schema_name       | semantic_view_name | table_name | name             | data_type          | synonyms | comment                       |
|---------------+-------------------+--------------------+------------+------------------+--------------------+----------+-------------------------------|
| MY_DB         | MY_SCHEMA         | TPCH_REV_ANALYSIS  | LINE_ITEMS | DISCOUNTED_PRICE | NUMBER(25,4)       | NULL     | Extended price after discount |
| MY_DB         | MY_SCHEMA         | TPCH_REV_ANALYSIS  | LINE_ITEMS | LINE_ITEM_ID     | VARCHAR(134217728) | NULL     | NULL                          |
| MY_DB         | MY_SCHEMA         | TPCH_REV_ANALYSIS  | ORDERS     | COUNT_LINE_ITEMS | NUMBER(18,0)       | NULL     | NULL                          |
+---------------+-------------------+--------------------+------------+------------------+--------------------+----------+-------------------------------+
```

---
title: SHOW SEMANTIC METRICS
source: https://docs.snowflake.com/en/sql-reference/sql/show-semantic-metrics.md
section: SQL Commands
---

# SHOW SEMANTIC METRICS

Lists the metrics in the [semantic views](../../user-guide/views-semantic/overview.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC VIEWS](show-semantic-views.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md)

## Syntax

```sqlsyntax
SHOW SEMANTIC METRICS [ LIKE '<pattern>' ]
                      [ IN
                           {
                             <semantic_view_name>           |

                             ACCOUNT                        |

                             DATABASE                       |
                             DATABASE <db_name>             |

                             SCHEMA                         |
                             SCHEMA <db_name>.<schema_name>
                           }
                      ]
                      [ STARTS WITH '<name_string>' ]
                      [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `semantic_view_name`
    :   Returns records for the specified semantic view.

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

    `SCHEMA`, . `SCHEMA db_name.schema_name`
    :   Returns records for the current schema in use or a specified schema (`db_name.schema_name`). You must specify the
        fully qualified name of the schema.

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `database_name` | Name of the database that contains the semantic view. |
| `schema_name` | Name of the schema that contains the semantic view. |
| `semantic_view_name` | Name of the semantic view that contains the metric. |
| `table_name` | Name of the logical table for the metric. |
| `name` | Name of the metric. |
| `data_type` | Data type of the metric. |
| `synonyms` | Alternative names or synonyms for the metric. |
| `comment` | Comment about the metric. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |
| REFERENCES or OWNERSHIP | Semantic view | One of these privileges is required if you want the output to include [private metrics](../../user-guide/views-semantic/sql.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the metrics for semantic views that you have any privilege on. The list includes metrics in semantic
views in the current schema of the current database.

```sqlexample
SHOW SEMANTIC METRICS;
```

```output
+---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name                         | data_type    | synonyms | comment                                |
|---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_COUNT               | NUMBER(18,0) | NULL     | NULL                                   |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | CUSTOMER   | CUSTOMER_ORDER_COUNT         | NUMBER(30,0) | NULL     | NULL                                   |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | ORDERS     | AVERAGE_LINE_ITEMS_PER_ORDER | NUMBER(36,6) | NULL     | NULL                                   |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | ORDERS     | ORDER_AVERAGE_VALUE          | NUMBER(30,8) | NULL     | NULL                                   |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | ORDERS     | ORDER_COUNT                  | NUMBER(18,0) | NULL     | NULL                                   |
| MY_DB         | MY_SCHEMA   | TPCH_ANALYSIS      | SUPPLIER   | SUPPLIER_COUNT               | NUMBER(18,0) | NULL     | NULL                                   |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | CUSTOMER_COUNT               | NUMBER(18,0) | NULL     | Count of number of customers           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | AVERAGE_LINE_ITEMS_PER_ORDER | NUMBER(36,6) | NULL     | Average number of line items per order |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | ORDER_AVERAGE_VALUE          | NUMBER(30,8) | NULL     | Average order value across all orders  |
+---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------+
```

The following example lists the metrics for the semantic view named `tpch_rev_analysis` in the current schema of the current database:

```sqlexample
SHOW SEMANTIC METRICS IN tpch_rev_analysis;
```

```output
+---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------+
| database_name | schema_name | semantic_view_name | table_name | name                         | data_type    | synonyms | comment                                |
|---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------|
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | CUSTOMERS  | CUSTOMER_COUNT               | NUMBER(18,0) | NULL     | Count of number of customers           |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | AVERAGE_LINE_ITEMS_PER_ORDER | NUMBER(36,6) | NULL     | Average number of line items per order |
| MY_DB         | MY_SCHEMA   | TPCH_REV_ANALYSIS  | ORDERS     | ORDER_AVERAGE_VALUE          | NUMBER(30,8) | NULL     | Average order value across all orders  |
+---------------+-------------+--------------------+------------+------------------------------+--------------+----------+----------------------------------------+
```

---
title: SHOW SEMANTIC VIEWS
source: https://docs.snowflake.com/en/sql-reference/sql/show-semantic-views.md
section: SQL Commands
---

# SHOW SEMANTIC VIEWS

Lists the [semantic views](../../user-guide/views-semantic/overview.md) for which you have access privileges. You can list
views for the current or specified schema.

The output returns view metadata and properties, ordered lexicographically by database, schema, and semantic view name. This is
important to note if you want to filter the results using the provided filters.

See also:
:   [CREATE SEMANTIC VIEW](create-semantic-view.md) , [ALTER SEMANTIC VIEW](alter-semantic-view.md) , [DESCRIBE SEMANTIC VIEW](desc-semantic-view.md) , [DROP SEMANTIC VIEW](drop-semantic-view.md) , [SHOW SEMANTIC DIMENSIONS](show-semantic-dimensions.md) , [SHOW SEMANTIC DIMENSIONS FOR METRIC](show-semantic-dimensions-for-metric.md) , [SHOW SEMANTIC FACTS](show-semantic-facts.md) , [SHOW SEMANTIC METRICS](show-semantic-metrics.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] SEMANTIC VIEWS [ LIKE '<pattern>' ]
  [ IN
       {
         ACCOUNT                                         |

         DATABASE                                        |
         DATABASE <database_name>                        |

         SCHEMA                                          |
         SCHEMA <schema_name>                            |
         <schema_name>
       }
  ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`

      The `kind` column value is always `SEMANTIC_VIEW`.
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides semantic view properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the semantic view was created. |
| `name` | Name of the semantic view. |
| `kind` | View type. This is always `SEMANTIC_VIEW`.  This column only appears in the output if you specify TERSE. |
| `database_name` | Database in which the semantic view is stored. |
| `schema_name` | Schema in which the semantic view is stored. |
| `comment` | Comment about the semantic view. |
| `owner` | Role that owns the semantic view. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | Semantic view |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the semantic views in the database that is currently in use:

```sqlexample
SHOW SEMANTIC VIEWS;
```

```output
+-------------------------------+----------------------+---------------+-------------+---------+---------+-----------------+-----------+
| created_on                    | name                 | database_name | schema_name | comment | owner   | owner_role_type | extension |
|-------------------------------+----------------------+---------------+-------------+---------+---------+-----------------+-----------+
| 2025-04-10 08:29:02.732 -0700 | MY_SEMANTIC_VIEW_1   | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:29:21.117 -0700 | MY_SEMANTIC_VIEW_2   | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:29:38.040 -0700 | MY_SEMANTIC_VIEW_3   | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:33.161 -0700 | MY_SEMANTIC_VIEW_4   | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:46.294 -0700 | MY_SEMANTIC_VIEW_5   | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:58.480 -0700 | MY_SEMANTIC_VIEW_6   | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-02-28 16:16:04.002 -0800 | O_TPCH_SEMANTIC_VIEW | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-03-21 07:03:54.120 -0700 | TPCH_REV_ANALYSIS    | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
+-------------------------------+----------------------+---------------+-------------+---------+---------+-----------------+-----------+
```

The following example includes only a subset of the output columns:

```sqlexample
SHOW TERSE SEMANTIC VIEWS;
```

```output
+-------------------------------+-----------------------+---------------+---------------+-------------------+
| created_on                    | name                  | kind          | database_name | schema_name       |
|-------------------------------+-----------------------+---------------+---------------+-------------------|
| 2025-04-10 08:29:02.732 -0700 | MY_SEMANTIC_VIEW_1    | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-04-10 08:29:21.117 -0700 | MY_SEMANTIC_VIEW_2    | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-04-10 08:29:38.040 -0700 | MY_SEMANTIC_VIEW_3    | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-04-10 08:47:33.161 -0700 | MY_SEMANTIC_VIEW_4    | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-04-10 08:47:46.294 -0700 | MY_SEMANTIC_VIEW_5    | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-04-10 08:47:58.480 -0700 | MY_SEMANTIC_VIEW_6    | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-02-28 16:16:04.002 -0800 | O_TPCH_SEMANTIC_VIEW  | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
| 2025-03-21 07:03:54.120 -0700 | TPCH_REV_ANALYSIS     | SEMANTIC_VIEW | MY_DB         | MY_SCHEMA         |
+-------------------------------+-----------------------+---------------+---------------+-------------------+
```

The following example displays the semantic views with names that have the string `tpch`:

```sqlexample
SHOW SEMANTIC VIEWS LIKE '%tpch%';
```

```output
+-------------------------------+----------------------+---------------+-------------+---------+---------+-----------------+-----------+
| created_on                    | name                 | database_name | schema_name | comment | owner   | owner_role_type | extension |
|-------------------------------+----------------------+---------------+-------------+---------+---------+-----------------+-----------|
| 2025-02-28 16:16:04.002 -0800 | O_TPCH_SEMANTIC_VIEW | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-03-21 07:03:54.120 -0700 | TPCH_REV_ANALYSIS    | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
+-------------------------------+----------------------+---------------+-------------+---------+---------+-----------------+-----------+
```

The following example displays the semantic views with names that start with `MY_SEMANTIC_VIEW`:

```sqlexample
SHOW SEMANTIC VIEWS STARTS WITH 'MY_SEMANTIC_VIEW';
```

```output
+-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------+
| created_on                    | name               | database_name | schema_name | comment | owner   | owner_role_type | extension |
|-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------|
| 2025-04-10 08:29:02.732 -0700 | MY_SEMANTIC_VIEW_1 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:29:21.117 -0700 | MY_SEMANTIC_VIEW_2 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:29:38.040 -0700 | MY_SEMANTIC_VIEW_3 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:33.161 -0700 | MY_SEMANTIC_VIEW_4 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:46.294 -0700 | MY_SEMANTIC_VIEW_5 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:58.480 -0700 | MY_SEMANTIC_VIEW_6 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
+-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------+
```

The following example displays the first three semantic views with names that start with `MY_SEMANTIC_VIEW`:

```sqlexample
SHOW SEMANTIC VIEWS STARTS WITH 'MY_SEMANTIC_VIEW' LIMIT 3;
```

```output
+-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------+
| created_on                    | name               | database_name | schema_name | comment | owner   | owner_role_type | extension |
|-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------|
| 2025-04-10 08:29:02.732 -0700 | MY_SEMANTIC_VIEW_1 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:29:21.117 -0700 | MY_SEMANTIC_VIEW_2 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:29:38.040 -0700 | MY_SEMANTIC_VIEW_3 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
+-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------+
```

The following example displays the three semantic views with names that start with `MY_SEMANTIC_VIEW` after the view named
`MY_SEMANTIC_VIEW_3`:

```sqlexample
SHOW SEMANTIC VIEWS STARTS WITH 'MY_SEMANTIC_VIEW' LIMIT 3 FROM 'MY_SEMANTIC_VIEW_3';
```

```output
+-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------+
| created_on                    | name               | database_name | schema_name | comment | owner   | owner_role_type | extension |
|-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------|
| 2025-04-10 08:47:33.161 -0700 | MY_SEMANTIC_VIEW_4 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:46.294 -0700 | MY_SEMANTIC_VIEW_5 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
| 2025-04-10 08:47:58.480 -0700 | MY_SEMANTIC_VIEW_6 | MY_DB         | MY_SCHEMA   |         | MY_ROLE | ROLE            | NULL      |
+-------------------------------+--------------------+---------------+-------------+---------+---------+-----------------+-----------+
```

---
title: SHOW SEQUENCES
source: https://docs.snowflake.com/en/sql-reference/sql/show-sequences.md
section: SQL Commands
---

# SHOW SEQUENCES

Lists all the sequences for which you have access privileges. This command can be used to list the sequences for a specified schema or
database (or the current schema/database for the session), or your entire account.

See also:
:   [SEQUENCES view](../info-schema/sequences.md) (Information Schema) , [CREATE SEQUENCE](create-sequence.md) , [ALTER SEQUENCE](alter-sequence.md) , [DROP SEQUENCE](drop-sequence.md) ,
    [DESCRIBE SEQUENCE](desc-sequence.md)

## Syntax

```sqlsyntax
SHOW SEQUENCES [ LIKE '<pattern>' ]
               [ IN
                    {
                      ACCOUNT                                         |

                      DATABASE                                        |
                      DATABASE <database_name>                        |

                      SCHEMA                                          |
                      SCHEMA <schema_name>                            |
                      <schema_name>

                      APPLICATION <application_name>                  |
                      APPLICATION PACKAGE <application_package_name>  |
                    }
               ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

---
title: SHOW SERVICE CONTAINERS IN SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/show-service-containers-in-service.md
section: SQL Commands
---

# SHOW SERVICE CONTAINERS IN SERVICE

Lists the containers in all instances of a [service](../../developer-guide/snowpark-container-services/working-with-services.md).

If Snowflake encounters issues executing one or more of your service containers, this command provides visibility into the status of individual containers. Similarly, during a rolling upgrade, it shows the version of your service code running in each container.

See also:
:   [Snowpark Container Services overview](../../developer-guide/snowpark-container-services/overview.md), [CREATE SERVICE](create-service.md), [SHOW SERVICES](show-services.md), [SHOW SERVICE INSTANCES IN SERVICE](show-service-instances-in-service.md)

## Syntax

```sqlsyntax
SHOW SERVICE CONTAINERS IN SERVICE <name>
```

## Parameters

`name`
:   Specifies the identifier for the service whose containers to list.

    Quoted names for special characters or case-sensitive names are not supported.

## Output

The command output provides properties and metadata of the service containers in the following columns:

| Column | Description |
| --- | --- |
| `database_name` | Database in which the service is created. |
| `schema_name` | Schema in which the service is created. |
| `service_name` | Name of the service. |
| `service_status` | One of the following values, which indicates the current status of the service:   * `PENDING` * `RUNNING` * `FAILED` * `DONE` * `SUSPENDING` * `SUSPENDED` * `DELETING` * `DELETED` * `INTERNAL_ERROR`   The value in this column is the same as the `status` column in the output of the [DESCRIBE SERVICE](desc-service.md). |
| `instance_id` | ID of the service instance (this is the index of the service instance starting from 0). When there are no service instances running (that is, service is either SUSPENDED or PENDING), instance_id and instance_status are returned as NULL. Also, container related fields in the output are also returned as NULL. |
| `instance_status` | One of the following values, which indicates the current status of the service instance:   * `PENDING`: The service instance is currently being deployed and is not yet ready to serve requests. * `READY`: All containers in the service instance are ready; the service instance is ready to serve requests. * `FAILED`: At least one container in the service instance has exited with a failure. * `TERMINATING`: The service instance is in the process of termination and will be removed after the process is complete. * `SUCCEEDED`: The service is a job service and all containers in the service instance have terminated successfully.   Note that for a given service instance, as identified by the `instance_id` column, the value in the `instance_status` column matches the value in the `status` column in the output of the SHOW SERVICE INSTANCES IN SERVICE command. |
| `container_name` | Name of the container. If no containers are running (that is, the service is in a SUSPENDED or PENDING state), the container name is returned as NULL, and all container-specific field values are also NULL. |
| `status` | Service container status. Currently supported status values include the following:   * `PENDING`: The container is currently being deployed. * `READY`: The container started and the readiness probe returned HTTP 200 OK status. * `DONE`: The container exited with a 0 exit code. * `FAILED`: The container exited with a non-zero exit code (exit code 0 indicates success). * `TERMINATING`: The container is shutting down due to an error, restart, completion, or deletion. * `UNKNOWN`: Snowflake could not retrieve the container status. Contact support. |
| `message` | Additional clarification about status. For example, when status is FAILED, Snowflake might provide additional information. |
| `image_name` | Image name used to create the service. |
| `image_digest` | The unique and immutable identifier representing the image content. |
| `restart_count` | Number of times Snowflake restarted the service. |
| `start_time` | Date and time when the container started. |
| `last_exit_code` | Indicates the exit code when the container last exited. For service containers, Snowflake restarts the container if it exits prematurely. The exit code is represented as an integer value:   * `NULL`: The container is currently running and has never exited. * 0: The container’s last exit was successful. * Non-zero value: The container encountered a failure. |
| `last_restart_time` | Provides the timestamp of the most recent restart of the container by Snowflake. A NULL value indicates the container never restarted. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP or MONITOR | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists containers of the `echo_service` service in the current database and schema for the session:

```sqlexample
SHOW SERVICE CONTAINERS IN SERVICE echo_service;
```

Sample output:

```output
+---------------+-------------+--------------+----------------+-------------+-----------------+----------------+--------+---------+----------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+----------------------+----------------+-------------------+
| database_name | schema_name | service_name | service_status | instance_id | instance_status | container_name | status | message | image_name                                                                                                                                         | image_digest                                                            | restart_count | start_time           | last_exit_code | last_restart_time |
|---------------+-------------+--------------+----------------+-------------+-----------------+----------------+--------+---------+----------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+----------------------+----------------+-------------------|
| TUTORIAL_DB   | DATA_SCHEMA | ECHO_SERVICE | RUNNING        | 0           | READY           | echo           | READY  | Running | orgname.acctname.registry-dev.snowflakecomputing.com/tutorial_db/data_schema/tutorial_repository/my_echo_service_image:latest                      | sha256:d04a2d7b7d9bd607df994926e3cc672edcb541474e4888a01703e8bb0dd3f173 |             0 | 2025-04-25T06:01:38Z |           NULL | NULL              |
+---------------+-------------+--------------+----------------+-------------+-----------------+----------------+--------+---------+----------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------+---------------+----------------------+----------------+-------------------+
```

---
title: SHOW SERVICE INSTANCES IN SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/show-service-instances-in-service.md
section: SQL Commands
---

# SHOW SERVICE INSTANCES IN SERVICE

Lists instances of a [service](../../developer-guide/snowpark-container-services/working-with-services.md).

The command output offers visibility into auto-scaling and rolling upgrades by displaying the status of each individual service instance.

See also:
:   [Snowpark Container Services overview](../../developer-guide/snowpark-container-services/overview.md), [CREATE SERVICE](create-service.md), [SHOW SERVICES](show-services.md), [SHOW SERVICE CONTAINERS IN SERVICE](show-service-containers-in-service.md)

## Syntax

```sqlsyntax
SHOW SERVICE INSTANCES IN SERVICE <name>
```

## Parameters

`name`
:   Specifies the identifier for the service whose instances to list.

    Quoted names for special characters or case-sensitive names are not supported.

## Output

The command output provides properties and metadata of the service instances in the following columns:

| Column | Description |
| --- | --- |
| `database_name` | Database in which the service is created. |
| `schema_name` | Schema in which the service is created. |
| `service_name` | Name of the service. |
| `service_status` | One of the following values, which indicates the current status of the service:   * `PENDING` * `RUNNING` * `FAILED` * `DONE` * `SUSPENDING` * `SUSPENDED` * `DELETING` * `DELETED` * `INTERNAL_ERROR`   Note that the value in this column is the same as the `status` column in the output of the [DESCRIBE SERVICE](desc-service.md). |
| `instance_id` | ID of the service instance (this is the index of the service instance starting from 0). |
| `status` | One of the following values, which indicates the current status of the service instance:   * `PENDING`: The service instance is currently being deployed and is not yet ready to serve requests. * `READY`: All containers in the service instance are ready; the service instance is ready to serve requests. * `FAILED`: At least one container in the service instance has exited with a failure. * `TERMINATING`: The service instance is in the process of termination and will be removed after the process is complete. * `SUCCEEDED`: The service is a job service and all containers in the service instance have terminated successfully. |
| `spec_digest` | The unique and immutable identifier that represents the service specification content. |
| `creation_time` | The time when Snowflake started creating the service instance. |
| `start_time` | The time when Snowflake acknowledged the service instance is running on a node. |
| `ip_address` | IP address of the service instance. Other instances of the same service (or other services) can use this IP address to connect to a specific service instance.  When you’re running multiple service instances, you can implement leader election among the instances of a service by electing the instance with `instance_id` 0 as the leader. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP or MONITOR | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists instances of the `echo_service` service in the current database and schema for the session:

```sqlexample
SHOW SERVICE INSTANCES IN SERVICE echo_service;
```

Sample output:

```output
+---------------+-------------+--------------+----------------+-------------+--------+------------------------------------------------------------------+----------------------+----------------------+------------+
| database_name | schema_name | service_name | service_status | instance_id | status | spec_digest                                                      | creation_time        | start_time           | ip_address |
|---------------+-------------+--------------+----------------+-------------+--------+------------------------------------------------------------------+----------------------+----------------------+------------|
| TUTORIAL_DB   | DATA_SCHEMA | ECHO_SERVICE | RUNNING        | 0           | READY  | 2831c241b8d64104fbc562d60764d7abd28602c70b6a8357341e8c8210b79da4 | 2025-04-25T06:01:32Z | 2025-04-25T06:01:32Z | 10.244.0.9 |
+---------------+-------------+--------------+----------------+-------------+--------+------------------------------------------------------------------+----------------------+----------------------+------------+
```

---
title: SHOW SERVICE VOLUMES IN SERVICE
source: https://docs.snowflake.com/en/sql-reference/sql/show-service-volumes-in-service.md
section: SQL Commands
---

# SHOW SERVICE VOLUMES IN SERVICE

Lists the storage volumes for all instances of a [service](../../developer-guide/snowpark-container-services/working-with-services.md).
For each mounted volume, the output includes a line for every
container mounting that volume. The output shows only volumes that are mounted to at least one container
in the service; volumes specified but unused by any container aren’t included.

See also:
:   [Snowpark Container Services overview](../../developer-guide/snowpark-container-services/overview.md),
    [CREATE SERVICE](create-service.md), [SHOW SERVICES](show-services.md),
    [SHOW SERVICE INSTANCES IN SERVICE](show-service-instances-in-service.md), [SHOW SERVICE CONTAINERS IN SERVICE](show-service-containers-in-service.md),
    [SHOW <objects>](show.md)

## Syntax

```sqlsyntax
SHOW SERVICE VOLUMES IN SERVICE <name>
```

## Parameters

`name`
:   Specifies the name of the service for which to display the list of mounted volumes.

    Quoted names for special characters or case-sensitive names aren’t supported.

## Output

The command output provides properties of service volumes in the following columns:

| Column | Description |
| --- | --- |
| `volume_name` | Name of the volume |
| `instance_id` | ID of the service instance, which is the index of the service instance starting from 0. |
| `container_name` | Name of the container to which a volume is mounted. |
| `volume_type` | Type of the volume. This can be one of the following types:   * `block` * `stage` * `local` * `memory`   For a detailed description of volume types, see [service specification](../../developer-guide/snowpark-container-services/specification-reference.md). |
| `size` | Size of the volume in the format of `numberGi`. |
| `iops` | Only applicable to block volumes. Shows the configured input/output operations per second for each block volume. |
| `throughput` | Only applicable to block volumes. Shows the configured throughput for each block volume. |
| `encryption` | Only applicable to stage and block volumes. In the case of block volumes, it shows the configured volume encryption type. For a detailed description of block volumes encryption types, see [Encryption Support for block storage volumes](../../developer-guide/snowpark-container-services/block-storage-volume.md). In the case of stage volumes, it shows the encryption type of the underlying stage. USAGE or OWNERSHIP privilege on a stage for the caller is required to get the stage encryption information. |
| `snapshot_used` | Only applicable to block volumes. Shows which snapshot was used to create the volume. The snapshot is listed in this column only if you are using a role that has been granted to the USAGE or OWNERSHIP privilege on the snapshot. |
| `stage_source` | Only applicable to stage volumes. Shows the fully qualified name for a stage that is used for the stage volume. |
| `volume_mounts` | Comma-separated list of paths where the volume is mounted in the given container. |

If a field is applicable to specific volume types, it is populated with NULL for every other volume type.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or MONITOR | Service | None |
| OWNERSHIP or USAGE | Snapshot | Without access to a block storage snapshot, Snowflake populates the `snapshot_used` field with an authorization error, but the command doesn’t fail. |
| OWNERSHIP or USAGE | Stage | Without access to a stage, Snowflake populates the encryption field with an authorization error for stage volumes, but the command doesn’t fail. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the volumes for the `echo_service` service:

```sqlexample
SHOW SERVICE VOLUMES IN SERVICE echo_service;
```

Sample output:

```output
+----------------+-------------+----------------+-------------+--------+--------+------------+----------------+---------------+--------------+---------------------------+
| volume_name    | instance_id | container_name | volume_type |  size  |  iops  | throughput |   encryption   | snapshot_used | stage_source |       volume_mounts       |
+----------------+-------------+----------------+-------------+--------+--------+------------+----------------+---------------+--------------+---------------------------+
| block-volume-1 | 0           | main           | block       | 1Gi    | 3000   | 125        | SNOWFLAKE_SSE  | [NULL]        | [NULL]       | /tmp/block1               |
| block-volume-1 | 0           | secondary      | block       | 1Gi    | 3000   | 125        | SNOWFLAKE_SSE  | [NULL]        | [NULL]       | /data/shared              |
| block-volume-2 | 0           | main           | block       | 50Gi   | 3500   | 150        | SNOWFLAKE_FULL | [NULL]        | [NULL]       | /tmp/block2               |
| local-volume   | 0           | main           | local       | [NULL] | [NULL] | [NULL]     | [NULL]         | [NULL]        | [NULL]       | /tmp/local                |
| memory-volume  | 0           | main           | memory      | 512Mi  | [NULL] | [NULL]     | [NULL]         | [NULL]        | [NULL]       | /tmp/memory, /tmp/memory2 |
| memory-volume  | 0           | secondary      | memory      | 512Mi  | [NULL] | [NULL]     | [NULL]         | [NULL]        | [NULL]       | /cache/memory             |
+----------------+-------------+----------------+-------------+--------+--------+------------+----------------+---------------+--------------+---------------------------+
```

---
title: SHOW SERVICES
source: https://docs.snowflake.com/en/sql-reference/sql/show-services.md
section: SQL Commands
---

# SHOW SERVICES

Lists the [Snowpark Container Services services](../../developer-guide/snowpark-container-services/working-with-services.md) (including job services) for
which you have access privileges.

* The SHOW SERVICES output also includes services running as jobs (see [EXECUTE JOB SERVICE](execute-job-service.md)).
* SHOW JOB SERVICES provides only the list of services running as jobs.
* SHOW SERVICES EXCLUDE JOBS output does not include services running as jobs.

See also:
:   [CREATE SERVICE](create-service.md) , [ALTER SERVICE](alter-service.md), [DROP SERVICE](drop-service.md) , [DESCRIBE SERVICE](desc-service.md), [SHOW SERVICE INSTANCES IN SERVICE](show-service-instances-in-service.md), [SHOW SERVICE CONTAINERS IN SERVICE](show-service-containers-in-service.md)

## Syntax

```sqlsyntax
SHOW [ JOB ] SERVICES [ EXCLUDE JOBS ] [ LIKE '<pattern>' ]
           [ IN
                {
                  ACCOUNT                  |

                  DATABASE                 |
                  DATABASE <database_name> |

                  SCHEMA                   |
                  SCHEMA <schema_name>     |
                  <schema_name>            |

                  COMPUTE POOL <compute_pool_name>
                }
           ]
           [ STARTS WITH '<name_string>' ]
           [ LIMIT <rows> [ FROM '<name_string>' ] ]
           [ OF TYPE <workload_type> [ , ... ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

`OF TYPE workload_type [ , ... ]`
:   Optionally filters the command output by the workload types. For a list of available workload types, see [ALLOWED_SPCS_WORKLOAD_TYPES](../parameters.md). The filter is case-insensitive.

    Default: ALL

## Output

The command output provides service properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Snowpark Container Services service name. |
| `status` | One of the following values, which indicates the current status of the service:   * `PENDING` * `RUNNING` * `FAILED` * `DONE` * `SUSPENDING` * `SUSPENDED` * `DELETING` * `DELETED` * `INTERNAL_ERROR` |
| `database_name` | Database in which the service is created. |
| `schema_name` | Schema in which the service is created. |
| `owner` | Role that owns the service. |
| `compute_pool` | Compute pool name where Snowflake runs the service. |
| `dns_name` | Snowflake-assigned DNS name of the service in this format: `service-name.unique-id.svc.spcs.internal`.  The `unique-id` is a 4-8 character long alphanumeric identifier that is unique to a particular instance of a database schema. To find the unique ID for a schema, call the SYSTEM$GET_SERVICE_DNS_DOMAIN function. For example:  ```sqlexample SELECT SYSTEM$GET_SERVICE_DNS_DOMAIN('mydb.myschema'); ```  Note the following:   * If you rename a schema, the identifier remains unchanged. * If you drop and recreate a schema with the same name, the identifier will change.   The DNS name enables service-to-service communications (see [Tutorial 4](../../developer-guide/snowpark-container-services/tutorials/advanced/tutorial-4.md)). |
| `current_instances` | The current number of instances for the service. |
| `target_instances` | The target number of service instances that should be running as determined by Snowflake.  When the `current_instances` value is not equal to the `target_instances` value, Snowflake is either in the process of shutting down or launching service instances.  For example, consider the following:   * Suppose you create a service with MIN_INSTANCES = 1 and MAX_INSTANCES = 3. While the service is running, Snowflake might   determine that one instance is not enough. In this case, the value of `target_instances` will increase, indicating Snowflake is in the process of launching additional instances.  It’s also possible that the `target_instances` value is less than the `current_instances` value, which indicates that Snowflake is   in the process of reducing the number of running instances. * If you create services but the compute pool does’t have capacity for the minimum number of instances that you requested, the   value of `target_instances` will be equal to the value of `min_instances`. The value of `current_instances` will be less than the value of `target_instances`. |
| `min_ready_instances` | Indicates the minimum service instances that must be ready for Snowflake to consider the service is ready to process requests. |
| `min_instances` | Minimum number of service instances Snowflake should run. |
| `max_instances` | Maximum number of service instances that Snowflake can scale when needed. |
| `auto_resume` | If `true`, Snowflake auto-resumes the service, if suspended, when service function is called or when an incoming request (ingres) is received (see [Using a service](../../developer-guide/snowpark-container-services/working-with-services.md)). |
| `external_access_integrations` | List of external access integrations associated with the service. For more information, see [Configure service egress](../../developer-guide/snowpark-container-services/service-network-communications.md). |
| `created_on` | Date and time when the service was created. |
| `updated_on` | Date and time when service is last updated. |
| `resumed_on` | Timestamp when the service was last resumed. |
| `suspended_on` | Timestamp when the service was last suspended. `suspended_on` is set when Snowflake suspends a service and remains unchanged even after the service is resumed. If `suspended_on` is NULL, the service was never suspended. |
| `auto_suspend_secs` | Number of seconds of inactivity after which Snowflake automatically suspends the service. If `auto_suspend_secs` is set to 0 or never set, Snowflake does not automatically suspend the service. |
| `comment` | Service related comment. |
| `owner_role_type` | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `query_warehouse` | When a service container connects to Snowflake to execute a query and does not explicitly specify a warehouse to use, Snowflake uses this warehouse as default. |
| `is_job` | `true` if the service is a job service; `false` otherwise. SHOW JOB SERVICES and SHOW SERVICES EXCLUDE JOBS do not include this column in the output. |
| `is_async_job` | If TRUE, the job service is running asynchronously. By default, Snowflake executes the job services synchronously. This column is included in the output of the SHOW SERVICES, and SHOW JOB SERVICES commands but not in the output of the SHOW SERVICES EXCLUDING JOBS command. |
| `spec_digest` | The unique and immutable identifier representing the service spec content.  To observe the changes to the value of the `spec_digest` column over time, a service user might execute the SHOW SERVICES command periodically. If the service user notices a change in value, they can infer that the service was upgraded. |
| `is_upgrading` | TRUE, if Snowflake is in the process of upgrading the service. |
| `managing_object_domain` | The domain of the managing object (for example, the domain of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |
| `managing_object_name` | The name of the managing object (for example, the name of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any one of these privileges: OWNERSHIP, USAGE, MONITOR or OPERATE | Service |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists services in the current database and schema for the session:

```sqlexample
SHOW SERVICES;
```

Sample output:

```output
+--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+-------------------+------------------+---------------------+---------------+---------------+-------------+------------------------------+-------------------------------+-------------------------------+------------+--------------+-------------------+---------+-----------------+-----------------+--------+--------------+------------------------------------------------------------------+--------------+------------------------+----------------------+
| name         | status  | database_name | schema_name | owner     | compute_pool          | dns_name                            | current_instances | target_instances | min_ready_instances | min_instances | max_instances | auto_resume | external_access_integrations | created_on                    | updated_on                    | resumed_on | suspended_on | auto_suspend_secs | comment | owner_role_type | query_warehouse | is_job | is_async_job | spec_digest                                                      | is_upgrading | managing_object_domain | managing_object_name |
|--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+-------------------+------------------+---------------------+---------------+---------------+-------------+------------------------------+-------------------------------+-------------------------------+------------+--------------+-------------------+---------+-----------------+-----------------+--------+--------------+------------------------------------------------------------------+--------------+------------------------+----------------------|
| ECHO_SERVICE | RUNNING | TUTORIAL_DB   | DATA_SCHEMA | TEST_ROLE | TUTORIAL_COMPUTE_POOL | echo-service.k3m6.svc.spcs.internal |                 1 |                1 |                   1 |             1 |             1 | true        | NULL                         | 2024-11-29 12:12:47.310 -0800 | 2024-11-29 12:12:48.843 -0800 | NULL       | NULL         |                 0 | NULL    | ROLE            | NULL            | false  | false        | edaf548eb0c2744a87426529b53aac75756d0ea1c0ba5edb3cbb4295a381f2b4 | false        | NULL                   | NULL                 |
+--------------+---------+---------------+-------------+-----------+-----------------------+-------------------------------------+-------------------+------------------+---------------------+---------------+---------------+-------------+------------------------------+-------------------------------+-------------------------------+------------+--------------+-------------------+---------+-----------------+-----------------+--------+--------------+------------------------------------------------------------------+--------------+------------------------+----------------------+
```

The following example lists one service:

```sqlexample
SHOW SERVICES LIMIT 1;
```

The following example lists services with names containing “echo”:

```sqlexample
SHOW SERVICES LIKE '%echo%';
```

The following example lists one service with a name containing “echo”:

```sqlexample
SHOW SERVICES LIKE '%echo%' LIMIT 1;
```

The following example lists only services running as a job:

```sqlexample
SHOW JOB SERVICES;
```

---
title: SHOW SESSION POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-session-policies.md
section: SQL Commands
---

# SHOW SESSION POLICIES

Lists session policy information, including the creation date, database and schema names, owner, and any available comments.

See also:
:   [Session Policy DDL Reference](../../user-guide/session-policies-managing.md)

## Syntax

```sqlsyntax
SHOW SESSION POLICIES
  [ LIKE '<pattern>' ]
  [ IN
       {
         ACCOUNT                                         |

         DATABASE                                        |
         DATABASE <database_name>                        |

         SCHEMA                                          |
         SCHEMA <schema_name>                            |

         APPLICATION <application_name>                  |
         APPLICATION PACKAGE <application_package_name>  |
       }
    |
    ON
       {
         ACCOUNT           |
         USER <user_name>  |
       }
  ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`[ ON ... ]`
:   Lists the policies that are effective on the specified object. This command considers precedence.
    For example, listing policies on a user will show the account or built-in policy that is effective
    for the user if there is no policy set specifically on the user. Specify one of the following:

    `ACCOUNT`
    :   Returns policies effective on the account.

    `USER user_name`
    :   Returns policies effective on the specified user.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY SESSION POLICY | Account |  |
| OWNERSHIP | Session policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on session policy DDL and privileges, see [Managing session policies](../../user-guide/session-policies-managing.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Example

```sqlexample
SHOW SESSION POLICIES;
```

```output
----------------------------------+-----------------------+---------------+-------------+----------------+--------------+--------------------------------------------------+---------+
         created_on               | name                  | database_name | schema_name |      kind      |  owner       |   comment                                        | options |
----------------------------------+-----------------------+---------------+-------------+----------------+--------------+--------------------------------------------------+---------+
  Mon, 11 Jan 2021 00:00:00 -0700 | session_policy_prod_1 | MY_DB         | MY_SCHEMA   | SESSION_POLICY | POLICY_ADMIN | session policy for use in the prod_1 environment | ""      |
----------------------------------+-----------------------+---------------+-------------+----------------+--------------+--------------------------------------------------+---------+
```

---
title: SHOW SHARED CONTENT IN APPLICATION PACKAGE
source: https://docs.snowflake.com/en/sql-reference/sql/show-shared-content.md
section: SQL Commands
---

# SHOW SHARED CONTENT IN APPLICATION PACKAGE

Shows all of the objects for which you have access privileges that have been shared from a Declarative Native App application package.

## Syntax

```sqlsyntax
SHOW SHARED CONTENT IN APPLICATION PACKAGE <pkg_name> FOR VERSION <version_name>
```

## Parameters

`pkg_name`
:   Specifies the package (`pkg_name`) containing the shared objects.

`FOR VERSION version_name`
:   Specifies the version (`version_name`) of the package containing the shared objects.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `version_name` | The automatically generated version name for the shared object. If the object is part of the live version, the value is `LIVE`. |
| `database_name` | The name of the database containing the shared object. |
| `schema_name` | The name of the schema containing the shared object. |
| `entity_name` | The name of the shared object. |
| `entity_type` | The type of the shared object, for example, TABLE, VIEW, or NOTEBOOK. |

## Access control requirements

This command requires a role with the relevant privilege on the entities returned. For example, if the application package contains a shared table, the role must have the USAGE privilege on the database and schema containing the table, and the SELECT privilege on the table.

## Examples

The following example shows how to use the SHOW SHARED CONTENT IN APPLICATION PACKAGE command to list all of the objects in a specific version of a Declarative Native App application package.

```sqlexample
SHOW SHARED CONTENT IN APPLICATION PACKAGE decl_share_app_pkg FOR VERSION VERSION$2;
```

```output
+-------------------------------------------------------------------------------+
| version_name | database_name | schema_name     | entity_name    | entity_type |
|--------------+---------------+-----------------+----------------+-------------|
| VERSION$2    | DB_TO_SHARE   | SCHEMA_TO_SHARE | TABLE_TO_SHARE | TABLE       |
+-------------------------------------------------------------------------------+
```

---
title: SHOW SHARES
source: https://docs.snowflake.com/en/sql-reference/sql/show-shares.md
section: SQL Commands
---

# SHOW SHARES

Lists all [shares](../../user-guide/data-sharing-intro.md) available in the system:

* Outbound shares (to consumers) that have been created in your account (as a provider).
* Inbound shares (from providers) that are available for your account to consume.

See also:
:   [CREATE SHARE](create-share.md) , [ALTER SHARE](alter-share.md) , [DROP SHARE](drop-share.md) , [DESCRIBE SHARE](desc-share.md)

## Syntax

```sqlsyntax
SHOW SHARES [ LIKE '<pattern>' ]
            [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* The command lists shares only for users with a role that has the IMPORT SHARE privilege.

  + By default, the ACCOUNTADMIN role has this privilege.

    - For the ACCOUNTADMIN role, this command lists all outbound shares created in the account.
    - For other roles with the IMPORT SHARE privilege, this command lists only the outbound shares owned by the active role of the session.
  + A user with the ACCOUNTADMIN role can delegate this privilege. See [Enable non-ACCOUNTADMIN roles to perform data sharing tasks](../../user-guide/security-access-privileges-shares.md).
  > **Note:**
  >
  > Executing this command without sufficient privileges returns empty results.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

* The `kind` column displays:

  + `INBOUND` indicates the share is available to your account to consume (i.e. you can create a database from the share).
  + `OUTBOUND` indicates that your account is sharing data with other accounts and this share was created in your account.
* For `OUTBOUND` shares, if accounts have been added to the share, the `to` column displays these accounts. The maximum number
  of accounts displayed in this column is three; however, there is no hard limit on the number of accounts that can be added to a share.

## Examples

Show all shares that have been created in your account or are available to consume by your account:

> ```sqlexample
> SHOW SHARES;
>
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> | created_on                    | kind     | owner_account        | name          | database_name         | to               | owner        | comment                                | listing_global_name |                  |
> |-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------|---------------------|
> | 2016-07-09 19:18:09.821 -0700 | INBOUND  | SNOW.MY_TEST_ACCOUNT | SAMPLE_DATA   | SNOWFLAKE_SAMPLE_DATA |                  |              | Sample data sets provided by Snowflake |                     |
> | 2017-06-15 17:02:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT | SALES_S       | SALES_DB              | XY12345, YZ23456 | ACCOUNTADMIN |                                        |                     |
> +-------------------------------+----------+----------------------+---------------+-----------------------+------------------+--------------+----------------------------------------+---------------------+
> ```

Show all shares that have been created in your account or are available to consume by your account that include the string ‘SNOW’:

> ```sqlexample
> SHOW SHARES LIMIT 5 FROM 'SNOW';
>
> +-------------------------------+----------+-------------------------+-----------------+----------------+------------------+--------------+---------+---------------------+
> | created_on                    | kind     | owner_account           | name            | database_name  | to               | owner        | comment | listing_global_name |
> |-------------------------------+----------+-------------------------+-----------------+----------------+------------------+--------------+---------+---------------------|
> | 2020-07-07 19:18:09.821 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT    | SNOW_DATA       | EXAMPLE        |                  | ACCOUNTADMIN |         |                     |
> | 2020-07-10 19:18:09.821 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT    | DATA_SNOWS      | EXAMPLE        |                  | ACCOUNTADMIN |         |                     |
> | 2022-08-18 12:02:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT    | SNOW_DATA       | ALFALFA_DB     | AB12345, YZ23456 | ACCOUNTADMIN |         |                     |
> | 2022-08-18 13:04:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT    | SNOW_SHARE      | SALES_DB       | AB12345          | ACCOUNTADMIN |         |                     |
> | 2022-08-18 14:02:40.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT    | SNOWIER_SHARE   | SALES_DB       |                  | ACCOUNTADMIN |         |                     |
> +-------------------------------+----------+-------------------------+-----------------+----------------+------------------+--------------+---------+---------------------+
> ```

Show all shares that have been created in your account or are available to consume by your account that start with SNOW, sorted in
lexicographic order:

> ```sqlexample
> SHOW SHARES STARTS WITH 'SNOW' LIMIT 5 FROM 'A';
>
> +-------------------------------+----------+------------------------+------------------------+----------------+------------------+--------------+---------+---------------------+
> | created_on                    | kind     | owner_account          |  name                  | database_name  | to               | owner        | comment | listing_global_name |
> |-------------------------------+----------+------------------------+------------------------+----------------+------------------+--------------+---------+---------------------|
> | 2020-07-07 19:18:09.821 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT   | SNOW_DATA              | EXAMPLE        |                  | ACCOUNTADMIN |         |                     |
> | 2022-08-18 12:02:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT   | SNOW_DATA              | ALFALFA_DB     | AB12345, YZ23456 | ACCOUNTADMIN |         |                     |
> | 2022-08-18 14:02:40.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT   | SNOWIER_SHARE          | SALES_DB       |                  | ACCOUNTADMIN |         |                     |
> | 2022-08-20 15:03:50.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT   | SNOWY_SHARE            | SALES_DB       |                  | ACCOUNTADMIN |         |                     |
> | 2022-08-18 13:04:29.625 -0700 | OUTBOUND | SNOW.MY_TEST_ACCOUNT   | SNOW_SHARE             | SALES_DB       | AB12345          | ACCOUNTADMIN |         |                     |
> +-------------------------------+----------+------------------------+------------------------+----------------+------------------+--------------+---------+---------------------+
> ```

---
title: SHOW SHARES IN FAILOVER GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/show-shares-in-failover-group.md
section: SQL Commands
---

# SHOW SHARES IN FAILOVER GROUP

Lists shares in a [failover group](../../user-guide/account-replication-intro.md).

See also:
:   [SHOW DATABASES IN FAILOVER GROUP](show-databases-in-failover-group.md), [SHOW LISTINGS IN FAILOVER GROUP](show-listings-in-failover-group.md)

## Syntax

```sqlsyntax
SHOW SHARES IN FAILOVER GROUP <name>
```

## Parameters

`name`
:   Specifies the identifier for the failover group.

## Usage notes

* Executing this command requires a role with either the OWNERSHIP or MONITOR privilege on the failover group. The command
  returns results only for a role with the MONITOR privilege on a share.
* To retrieve the list of failover groups in your organization, use [SHOW FAILOVER GROUPS](show-failover-groups.md).
* If the failover group has listings, then this command will return the shares that are automatically managed in the failover group as part
  of the [listing support in Business Continuity and Disaster Recovery](../../collaboration/listings-bcdr.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List the shares in the failover group `myrg`:

```sqlexample
SHOW SHARES IN FAILOVER GROUP myrg;
```

---
title: SHOW SHARES IN REPLICATION GROUP
source: https://docs.snowflake.com/en/sql-reference/sql/show-shares-in-replication-group.md
section: SQL Commands
---

# SHOW SHARES IN REPLICATION GROUP

Lists shares in a [replication group](../../user-guide/account-replication-intro.md).

See also:
:   [SHOW DATABASES IN REPLICATION GROUP](show-databases-in-replication-group.md)

## Syntax

```sqlsyntax
SHOW SHARES IN REPLICATION GROUP <name>
```

## Parameters

`name`
:   Specifies the identifier for the replication group.

## Usage notes

* Executing this command requires a role with either the OWNERSHIP or MONITOR privilege on the replication group. The command
  returns results only for a role with the MONITOR privilege on a share.
* To retrieve the list of replication groups in your organization, use [SHOW REPLICATION GROUPS](show-replication-groups.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

List the shares in the replication group `myrg`:

```sqlexample
SHOW SHARES IN REPLICATION GROUP myrg;
```

---
title: SHOW SNAPSHOT POLICIES — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/show-snapshot-policies.md
section: SQL Commands
---

# SHOW SNAPSHOT POLICIES — *Deprecated*

Lists all the [snapshot](../../user-guide/backups.md) policies in your account for which you have access privileges.

See also:
:   [CREATE SNAPSHOT POLICY — Deprecated](create-snapshot-policy.md),
    [ALTER SNAPSHOT POLICY — Deprecated](alter-snapshot-policy.md),
    [DROP SNAPSHOT POLICY — Deprecated](drop-snapshot-policy.md)

## Syntax

```sqlsyntax
SHOW SNAPSHOT POLICIES
   [ LIKE '<pattern>' ]
   [ IN { ACCOUNT | DATABASE | DATABASE <db_name> | SCHEMA | SCHEMA <schema_name> }
     [ STARTS WITH '<name_string>' ]
     [ LIMIT <rows> [ FROM '<name_string>' ]
   ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    `STARTS WITH 'name_string'`
    :   Optionally filters the command output based on the characters that appear at the beginning of
        the object name. The string must be enclosed in single quotes and is case sensitive.

        For example, the following strings return different results:

        `... STARTS WITH 'B' ...`

        `... STARTS WITH 'b' ...`

        . Default: No value (no filtering is applied to the output)

    `LIMIT rows [ FROM 'name_string' ]`
    :   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
        returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

        The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
        specified number of rows following the first row whose object name matches the specified string:

        * The string must be enclosed in single quotes and is case sensitive.
        * The string does not have to include the full object name; partial names are supported.

        Default: No value (no limit is applied to the output)

        > **Note:**
        >
        > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
        > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
        > returned.
        >
        > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
        > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
        >
        > For example:
        >
        > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
        > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
        > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

To determine whether a snapshot policy is associated with any snapshot sets, use the SHOW SNAPSHOT SETS command.

> **Note:**
>
> The snapshot policy is an object that’s inside a specific schema and database. Therefore, the policy
> gets replicated, dropped or undropped, and so on, when those operations are performed on the schema and database
> that contain it. If you can’t drop the snapshot policy because it’s associated with any snapshot sets,
> then you also can’t drop the schema or database containing the policy.

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp snapshot policy was created. |
| `name` | Name of snapshot policy. |
| `database_name` | Name of database that contains the snapshot policy. |
| `schema_name` | Name of schema that contains the snapshot policy. |
| `owner` | Name of the role with the OWNERSHIP privilege on the snapshot policy. |
| `comment` | Comment for snapshot policy. |
| `schedule` | Schedule for snapshot creation. |
| `expire_after_days` | Number of days after snapshot creation when snapshot expires. |
| `has_retention_lock` | Indicates whether the policy includes a retention lock.  `Y` if policy has retention lock; `N` otherwise.  For more information, see [Retention lock](../../user-guide/backups.md). |
| `owner` | Name of the role with the OWNERSHIP privilege on the snapshot set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the snapshot policy. |

## Examples

List all snapshot policies you have privileges for in the current account:

```sqlexample
SHOW SNAPSHOT POLICIES IN ACCOUNT;
```

---
title: SHOW SNAPSHOT SETS — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/show-snapshot-sets.md
section: SQL Commands
---

# SHOW SNAPSHOT SETS — *Deprecated*

Lists all the [snapshot](../../user-guide/backups.md) sets for which you have access privileges.
The scope of this command can be your entire account, or a specified database or schema.

See also:
:   [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md),
    [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md),
    [DROP SNAPSHOT SET — Deprecated](drop-snapshot-set.md)

## Syntax

```sqlsyntax
SHOW SNAPSHOT SETS
   [ LIKE '<pattern>' ]
   [ IN { ACCOUNT | DATABASE | DATABASE <db_name> | SCHEMA | SCHEMA <schema_name> }
     [ STARTS WITH '<name_string>' ]
     [ LIMIT <rows> [ FROM '<name_string>' ]
   ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    `STARTS WITH 'name_string'`
    :   Optionally filters the command output based on the characters that appear at the beginning of
        the object name. The string must be enclosed in single quotes and is case sensitive.

        For example, the following strings return different results:

        `... STARTS WITH 'B' ...`

        `... STARTS WITH 'b' ...`

        . Default: No value (no filtering is applied to the output)

    `LIMIT rows [ FROM 'name_string' ]`
    :   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
        returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

        The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
        specified number of rows following the first row whose object name matches the specified string:

        * The string must be enclosed in single quotes and is case sensitive.
        * The string does not have to include the full object name; partial names are supported.

        Default: No value (no limit is applied to the output)

        > **Note:**
        >
        > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
        > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
        > returned.
        >
        > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
        > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
        >
        > For example:
        >
        > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
        > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
        > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp that the snapshot set was created. |
| `name` | Name of the snapshot set. |
| `database_name` | Name of the database that contains the snapshot set. |
| `schema_name` | Name of the schema that contains the snapshot set. |
| `object_kind` | Type of the object that the snapshot set is snapshotting. |
| `object_name` | Name of the object that the snapshot set is snapshotting. |
| `object_database_name` | Name of the database that contains the object being snapshotted by this snapshot set. |
| `object_schema_name` | Name of the schema that contains the object being snapshotted by this snapshot set. |
| `snapshot_policy_name` | Name of the snapshot policy attached to this snapshot set. |
| `snapshot_policy_database_name` | Name of the database that contains the snapshot policy. |
| `snapshot_policy_schema_name` | Name of the schema that contains the snapshot policy. |
| `snapshot_policy_state` | Current state of the snapshot policy. |
| `owner_role` | Name of the role with the OWNERSHIP privilege on the snapshot set. |
| `owner_role_type` | Type of role with the OWNERSHIP privilege on the snapshot set. |
| `comment` | Comment for backup set. |

## Examples

List all snapshot sets that you have privileges for in the current account:

```sqlexample
SHOW SNAPSHOT SETS IN ACCOUNT;
```

List snapshot sets that include `T1` in the name:

```sqlexample
SHOW SNAPSHOT SETS LIKE '%T1%';
```

---
title: SHOW SNAPSHOTS
source: https://docs.snowflake.com/en/sql-reference/sql/show-snapshots.md
section: SQL Commands
---

# SHOW SNAPSHOTS

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Lists the [snapshots of block storage volumes](../../developer-guide/snowpark-container-services/block-storage-volume.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE SNAPSHOT](create-snapshot.md), [ALTER SNAPSHOT](alter-snapshot.md), [DESCRIBE SNAPSHOT](desc-snapshot.md), [DROP SNAPSHOT](drop-snapshot.md)

## Syntax

```sqlsyntax
SHOW SNAPSHOTS [ LIKE '<pattern>' ]
               [ IN
                   {
                       ACCOUNT                  |

                       DATABASE                 |
                       DATABASE <database_name> |

                       SCHEMA                   |
                       SCHEMA <schema_name>     |
                       <schema_name>            |
                   }
               ]
               [ STARTS WITH '<name_string>' ]
               [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the snapshot. |
| `state` | One of the following values, which indicates the current status of the snapshot:   * INITIALIZED: The snapshot creation is in progress. * CREATED: The snapshot is created and can be used to create a volume. * ERROR: Snapshot creation failed. |
| `database_name` | Database in which the snapshot is created. |
| `schema_name` | Schema in which the snapshot is created. |
| `service_name` | Fully qualified service name from which the snapshot is created. |
| `volume_name` | Volume from the specified service instance for which the snapshot is created. |
| `instance` | ID of the service instance. |
| `size` | Size (in GB) of the snapshot. |
| `comment` | General comment about the snapshot. |
| `owner` | Role that owns the snapshot. |
| `owner_role_type` | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| `created_on` | Date and time when the snapshot was created. |
| `encryption` | Encryption type configured for the volume, from which the snapshot was created. Possible values include `SNOWFLAKE_SSE` and `SNOWFLAKE_FULL`. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or USAGE | Snapshot | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

The following example lists the snapshots in the current database and schema:

```sqlexample
SHOW SNAPSHOTS;
```

Output:

```output
+-------------+---------+---------------+-------------+----------------------------------------------------+-------------+----------+------+--------------+-----------+-----------------+-------------------------------+-------------------------------+---------------+
| name        | state   | database_name | schema_name | service_name                                       | volume_name | instance | size | comment      | owner     | owner_role_type | created_on                    | updated_on                    | encryption    |
|-------------+---------+---------------+-------------+----------------------------------------------------+-------------+----------+------+--------------+-----------+-----------------+-------------------------------+-------------------------------+---------------|
| MY_SNAPSHOT | CREATED | TUTORIAL_DB   | DATA_SCHEMA | TUTORIAL_DB.DATA_SCHEMA.MY_SERVICE_WITH_EBS_VOLUME | block-vol1  | 0        |   10 | new snapshot | TEST_ROLE | ROLE            | 2024-05-09 21:36:58.502 -0700 | 2024-05-09 21:38:03.424 -0700 | SNOWFLAKE_SSE |
+-------------+---------+---------------+-------------+----------------------------------------------------+-------------+----------+------+--------------+-----------+-----------------+-------------------------------+-------------------------------+---------------+
```

---
title: SHOW SNAPSHOTS IN SNAPSHOT SET — Deprecated
source: https://docs.snowflake.com/en/sql-reference/sql/show-snapshots-in-snapshot-set.md
section: SQL Commands
---

# SHOW SNAPSHOTS IN SNAPSHOT SET — *Deprecated*

Lists all the [snapshots](../../user-guide/backups.md) in a snapshot set.

See also:
:   [CREATE SNAPSHOT SET — Deprecated](create-snapshot-set.md),
    [ALTER SNAPSHOT SET — Deprecated](alter-snapshot-set.md),
    [SHOW SNAPSHOT SETS — Deprecated](show-snapshot-sets.md)

## Syntax

```sqlsyntax
SHOW SNAPSHOTS IN SNAPSHOT SET <name>
```

## Parameters

`name`
:   Specifies the identifier for the snapshot set.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Notes |
| --- | --- |
| OWNERSHIP | You must have the OWNERSHIP privilege on the snapshot set to see the snapshots that it contains. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

| Column | Description |
| --- | --- |
| `created_on` | Timestamp snapshot is created. |
| `snapshot_id` | Snowflake-generated identifier of the snapshot. The snapshot ID is a UUID value, in the format returned by the [UUID_STRING](../functions/uuid_string.md) function. |
| `snapshot_set_name` | Name of snapshot set that contains the snapshot. |
| `database_name` | Name of database that contains the snapshot set. |
| `schema_name` | Name of schema that contains the snapshot set. |
| `expire_on` | Timestamp when the snapshot expires. |

## Examples

List all snapshots in snapshot set `t1_snapshots`:

```sqlexample
SHOW SNAPSHOTS IN SNAPSHOT SET t1_snapshots;
```

Show the creation date and snapshot ID for the oldest snapshot in snapshot set `t1_snapshots`:

```sqlexample
SHOW SNAPSHOTS IN SNAPSHOT SET t1_snapshots ->>
  SELECT "created_on", "snapshot_id" FROM $1
    ORDER BY "created_on" LIMIT 1;
```

Show the snapshot ID and the date and time when the final snapshot in snapshot set `t1_snapshots` will expire.
This example presumes that the snapshot policy doesn’t include a schedule, or the snapshot policy is suspended
for the snapshot set, so that no new snapshots are being added to the snapshot set. You’re just waiting for
all the existing snapshots to expire so that you can drop the snapshot set.

```sqlexample
SHOW SNAPSHOTS IN SNAPSHOT SET t1_snapshots ->>
  SELECT "expire_on", "snapshot_id" FROM $1
    ORDER BY "expire_on" DESC LIMIT 1;
```

---
title: SHOW SPECIFICATIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-specifications.md
section: SQL Commands
---

# SHOW SPECIFICATIONS

Lists the app specifications that have been defined for an app.

See also:
:   [DESCRIBE SPECIFICATION](desc-specification.md), [ALTER APPLICATION SET SPECIFICATION](alter-application-set-app-spec.md)

## Syntax

```sqlsyntax
SHOW [ { APPROVED | DECLINED | PENDING } ] SPECIFICATIONS [ IN APPLICATION <app_name> ];
```

## Parameters

`APPROVED | DECLINED | PENDING`
:   Narrows the output to app specifications with one of these statuses.

`IN APPLICATION app_name`
:   Specifies the name of the app whose app specification you want to view.

## Usage notes

* Consumers must provide the name of an app using the IN APPLICATION clause.
* An app can run this command without specifying the
  IN APPLICATION clause.

## Output

The command output provides information about the properties of an app specification in the following
columns:

| Column | Description |
| --- | --- |
| `name` | Name of the app specification. |
| `requested_on` | Timestamp when the app specification was requested. |
| `type` | Type of app specification. Supported values are: `EXTERNAL_ACCESS`, `SECURITY_INTEGRATION`, `LISTING`, and `CONNECTION`. |
| `sequence_number` | ID for a version of an app specification. This value is incremented each time a provider changes the [app specification definition](../../developer-guide/native-apps/requesting-app-specs.md). |
| `status` | Specifies the current status of the app specification. Possible values are:   * APPROVED: The consumer approved the app specification. * PENDING: The app specification is waiting for the consumer to approve or   decline. * DECLINED: The consumer declined the app specification. |
| `status_updated_on` | Timestamp of the last status change. |
| `label` | Name of the app specification that is displayed to the consumer in Snowsight. |
| `description` | Description of the app specification that is displayed to the consumer in Snowsight. |
| `definition` | Values that are part of the [app specification definition](../../developer-guide/native-apps/requesting-app-specs.md). The values of this column depend on the type of app specification. |

## Examples

Show all specifications that you have privileges to view:

```sqlexample
SHOW SPECIFICATIONS;
```

---
title: SHOW STAGES
source: https://docs.snowflake.com/en/sql-reference/sql/show-stages.md
section: SQL Commands
---

# SHOW STAGES

Lists all the stages for which you have access privileges. This command can be used to list the stages for a specified schema or
database (or the current schema/database for the session), or your entire account.

See also:
:   [CREATE STAGE](create-stage.md) , [ALTER STAGE](alter-stage.md) , [DROP STAGE](drop-stage.md) , [DESCRIBE STAGE](desc-stage.md)

## Syntax

```sqlsyntax
SHOW STAGES [ LIKE '<pattern>' ]
            [ IN
                 {
                   ACCOUNT                                         |

                   DATABASE                                        |
                   DATABASE <database_name>                        |

                   SCHEMA                                          |
                   SCHEMA <schema_name>                            |
                   <schema_name>

                   APPLICATION <application_name>                  |
                   APPLICATION PACKAGE <application_package_name>  |
                 }
            ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Output

The command output provides stage properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the stage was created. |
| name | Name of the stage. |
| database_name | Database in which the stage is stored. |
| schema_name | Schema in which the stage is stored. |
| url | URL for the external stage; blank for an internal stage. |
| has_credentials | Indicates that the external stage has access credentials; always `N` for an internal stage. |
| has_encryption_key | Indicates that the external stage contains encrypted files; always `N` for an internal stage. |
| owner | Role that owns the stage. |
| comment | Comment for the stage. |
| region | Region where the stage is located. |
| type | Indicates whether the stage is an external stage or internal stage, as well as whether the internal stage is permanent or temporary. |
| cloud | Cloud provider; always `NULL` for an internal stage. |
| notification_channel | Amazon Resource Name of the Amazon SQS queue for the stage. Deprecated column. |
| storage_integration | Storage integration associated with the stage; always `NULL` for an internal stage. |
| endpoint | The S3-compatible API endpoint associated with the stage; always `NULL` for stages that are not S3-compatible. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| directory_enabled | Indicates whether the stage has a directory table enabled. `Y` if a directory table is enabled, `N` if not enabled. |

For more information about the stage properties, see [CREATE STAGE](create-stage.md).

---
title: SHOW STORAGE LIFECYCLE POLICIES
source: https://docs.snowflake.com/en/sql-reference/sql/show-storage-lifecycle-policies.md
section: SQL Commands
---

# SHOW STORAGE LIFECYCLE POLICIES

Lists the [storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md) for which you have access privileges.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

See also:
:   [CREATE STORAGE LIFECYCLE POLICY](create-storage-lifecycle-policy.md) , [ALTER STORAGE LIFECYCLE POLICY](alter-storage-lifecycle-policy.md) , [DESCRIBE STORAGE LIFECYCLE POLICY](desc-storage-lifecycle-policy.md) , [DROP STORAGE LIFECYCLE POLICY](drop-storage-lifecycle-policy.md)

## Syntax

```sqlsyntax
SHOW STORAGE LIFECYCLE POLICIES
  [ LIKE '<pattern>' ]
  [ IN
        {
          ACCOUNT                  |

          DATABASE                 |
          DATABASE <database_name> |

          SCHEMA                   |
          SCHEMA <schema_name>     |
          <schema_name>
        }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time the policy was created. |
| `name` | The name of the policy. |
| `database_name` | The name of the database the policy is associated with. |
| `schema_name` | The name of the schema the policy uses. |
| `kind` | The type of storage lifecycle policy. |
| `owner` | The name of the role that created the policy. |
| `comment` | An optional comment that describes the policy. |
| `owner_role_type` | The type of role that the owner of the policy used to create the policy. |
| `options` | Optional parameters added to the policy to change how the policy behaves:   * `archive_for_days`: Number of days to archive rows before expiration. If this property isn’t set for the policy, the value is NULL. * `archive_tier`: The storage tier for the policy; COOL or COLD. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| APPLY STORAGE LIFECYCLE POLICY | Account | Allows SHOW on all storage lifecycle policies in the account. |
| APPLY | Storage lifecycle policy | Allows SHOW on the policy. |
| OWNERSHIP | Storage lifecycle policy | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the storage lifecycle policies that you have the privileges to view in the PUBLIC schema of the `mydb`
database:

```sqlexample
USE DATABASE mydb;

SHOW STORAGE LIFECYCLE POLICIES;
```

Output:

```output
+----------------------------------------+------------------+---------------------------+-------------------------------+--------------------------+--------------+-------------------+-----------------+---------------------------+
|               created_on               |       name       |       database_name       |          schema_name          |           kind           |    owner     |      comment      | owner_role_type |          options          |
+----------------------------------------+------------------+---------------------------+-------------------------------+--------------------------+--------------+-------------------+-----------------+---------------------------+
| Fri, 23 Jun 1967 07:00:00.123000 +0000 | MY_POLICY        | MYDB                      | PUBLIC                        | STORAGE_LIFECYCLE_POLICY | TESTACCOUNT  | identity          | ROLE            | {"ARCHIVE_FOR_DAYS":null} |
| Fri, 23 Jun 1967 07:00:00.123000 +0000 | MY_SECOND_POLICY | MYDB                      | PUBLIC                        | STORAGE_LIFECYCLE_POLICY | TESTACCOUNT  | identity with UDF | ROLE            | {"ARCHIVE_FOR_DAYS":365}  |
| Fri, 23 Jun 1967 07:00:00.123000 +0000 | MY_THIRD_POLICY  | MYDB                      | PUBLIC                        | STORAGE_LIFECYCLE_POLICY | TESTACCOUNT  | always true       | ROLE            | {"ARCHIVE_FOR_DAYS":180}  |
+----------------------------------------+------------------+---------------------------+-------------------------------+--------------------------+--------------+-------------------+-----------------+---------------------------+
```

---
title: SHOW STREAMLITS
source: https://docs.snowflake.com/en/sql-reference/sql/show-streamlits.md
section: SQL Commands
---

# SHOW STREAMLITS

Lists the Streamlit objects for which you have access privileges.

See also:
:   [CREATE STREAMLIT](create-streamlit.md), [DESCRIBE STREAMLIT](desc-streamlit.md), [ALTER STREAMLIT](alter-streamlit.md), [DROP STREAMLIT](drop-streamlit.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] STREAMLITS [ LIKE '<pattern>' ]
                          [ IN
                                {
                                  ACCOUNT                   |

                                  DATABASE                  |
                                  DATABASE <db_name>        |

                                  SCHEMA
                                  SCHEMA <schema_name>      |
                                  <schema_name>             |
                                }
                          ]
                          [ LIMIT <rows> [ FROM '<name_string>' ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`

      The `kind` column value is always Streamlit.
    * `database_name`
    * `schema_name`
    * `title`
    * `url_id`

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Access control requirements

If your role does not own the objects in the following table, then your role
must have the listed
[privileges](../../user-guide/security-access-control-overview.md) on those objects:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Streamlit objects that you list | Anyone can execute this command, but only Streamlit objects for which you have USAGE privileges are returned in the output. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* You can use this command to list Streamlit objects for the current/specified database or schema, or across your entire account.
* The command doesn’t list Streamlit objects that have been dropped.
* The command doesn’t require a running warehouse to run.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides Streamlit object properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the Streamlit object was created. |
| `name` | Name of the Streamlit object. |
| `database_name` | Database in which the Streamlit object is stored. |
| `schema_name` | Schema in which the Streamlit object is stored. |
| `title` | Title of the Streamlit app that displays in Snowsight. |
| `comment` | Comment for the Streamlit object. |
| `owner` | Role that owns the Streamlit object. |
| `query_warehouse` | Warehouse where queries issued by the Streamlit application are run. |
| `url_id` | Unique ID associated with the Streamlit object. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

---
title: SHOW STREAMS
source: https://docs.snowflake.com/en/sql-reference/sql/show-streams.md
section: SQL Commands
---

# SHOW STREAMS

Lists the streams for which you have access privileges. The command can be used to list streams for the current/specified database
or schema, or across your entire account.

The output returns stream metadata and properties, ordered lexicographically by database, schema, and stream name (see Output
in this topic for descriptions of the output columns). This is important to note if you wish to filter the results using the provided
filters.

See also:
:   [CREATE STREAM](create-stream.md) , [ALTER STREAM](alter-stream.md) , [DROP STREAM](drop-stream.md) , [DESCRIBE STREAM](desc-stream.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] STREAMS [ LIKE '<pattern>' ]
                       [ IN { ACCOUNT | DATABASE [ <db_name> ] | [ SCHEMA ] [ <schema_name> ] | APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> } ]
                       [ STARTS WITH '<name_string>' ]
                       [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind` (rename of `type` column in full set of columns)
    * `database_name`
    * `schema_name`
    * `tableOn` (rename of `table_name` column in full set of columns)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN ACCOUNT | [ DATABASE ] db_name | [ SCHEMA ] schema_name | APPLICATION application_name | APPLICATION PACKAGE application_package_name`
:   Specifies the scope of the command, which determines whether the command lists records only for the current/specified database or
    schema, or across your entire account:

    The `APPLICATION` and `APPLICATION PACKAGE` keywords are not required, but they specify the scope for the named Snowflake Native App.

    The `DATABASE` or `SCHEMA` keyword is not required; you can set the scope by specifying only the database or schema name.
    Likewise, the database or schema name is not required if the session currently has a database in use.

    * If `DATABASE` or `SCHEMA` is specified without a name and the session does not currently have a database in use, the
      parameter has no effect on the output.
    * If `SCHEMA` is specified with a name and the session does not currently have a database in use, the schema name must
      be fully qualified with the database name (e.g. `testdb.testschema`).

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides stream properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the stream was created. |
| `name` | Name of the stream. |
| `database_name` | Database for the schema for the stream. |
| `schema_name` | Schema for the stream. |
| `owner` | Role that owns the stream. |
| `comment` | Comment for the stream. |
| `table_name` | Table whose DML updates are tracked by the stream. |
| `source_type` | Source object for the stream: table, view, directory table, or external table. |
| `base_tables` | Underlying tables for the view. This column applies to streams on views only. |
| `type` | Type of the stream; currently DELTA only. |
| `stale` | Indicates whether the stream was last read before the `stale_after` time (see below). If this is `TRUE`, the stream may be stale. When a stream is stale, it cannot be read. Recreate the stream to resume reading from it. To prevent a stream from becoming stale, consume the stream before `stale_after`. |
| `mode` | Displays `APPEND_ONLY` if the stream is an append-only stream. . Displays `INSERT_ONLY` if the stream only returns information for inserted rows; currently applies to streams on external tables only. . For streams on tables, the column displays `DEFAULT`. |
| `stale_after` | Timestamp when the stream became or may become stale if not consumed. . . This value is calculated by adding the retention period for the source table (i.e. the larger of the [DATA_RETENTION_TIME_IN_DAYS](../parameters.md) or [MAX_DATA_EXTENSION_TIME_IN_DAYS](../parameters.md) parameter setting) to the last time the stream was read. If the data retention period is set at the schema or database level, the current role and account must have access to the relevant object (schema, database, or shared tables/views) to obtain an accurate `stale_after` timestamp. . . This time can be inaccurate in a few cases: . - Some time can elapse between when the stream is *permitted* to become stale and when the underlying data is actually dropped. During this period, `stale_after` will be in the past, but reading from the stream may succeed. The duration of this period is subject to change, so you should not depend on it. . - If parameters affecting table retention are increased, streams that are already stale will remain stale, but the `stale_after` time might be in the future. |
| `invalid_reason` | Reason why the stream cannot be queried successfully. This column supports future functionality. Currently, the only value returned is `N/A`. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

For more information about the properties that can be specified for a stream, see [CREATE STREAM](create-stream.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns source object names for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the streams whose name starts with `line` that you have privileges to view in the `tpch.public` schema:

> ```sqlexample
> SHOW STREAMS LIKE 'line%' IN tpch.public;
> ```

---
title: SHOW TABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-tables.md
section: SQL Commands
---

# SHOW TABLES

Lists the tables for which you have access privileges, including dropped tables that are still within the Time Travel retention period
and, therefore, can be undropped. The command can be used to list tables for the current/specified database or schema, or across your
entire account.

The output returns table metadata and properties, ordered lexicographically by database, schema, and table name (see Output in this
topic for descriptions of the output columns). This is important to note if you want to filter the results using the provided filters.

See also:
:   [CREATE TABLE](create-table.md) , [DROP TABLE](drop-table.md) , [UNDROP TABLE](undrop-table.md) , [ALTER TABLE](alter-table.md) , [DESCRIBE TABLE](desc-table.md)

    [TABLES view](../info-schema/tables.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW [ TERSE ] TABLES [ HISTORY ] [ LIKE '<pattern>' ]
                                  [ IN
                                        {
                                          ACCOUNT                                         |

                                          DATABASE                                        |
                                          DATABASE <database_name>                        |

                                          SCHEMA                                          |
                                          SCHEMA <schema_name>                            |
                                          <schema_name>

                                          APPLICATION <application_name>                  |
                                          APPLICATION PACKAGE <application_package_name>  |
                                        }
                                  ]
                                  [ STARTS WITH '<name_string>' ]
                                  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Optionally returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`

      The `kind` column value is always TABLE.
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`HISTORY`
:   Optionally includes dropped tables that have not yet been purged (i.e. they are still within their respective Time Travel retention
    periods). If multiple versions of a dropped table exist, the output displays a row for each version. The output also includes an
    additional `dropped_on` column, which displays:

    * Date and timestamp (for dropped tables).
    * `NULL` (for active tables).

    Default: No value (dropped tables are not included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the table was created. |
| `name` | Name of the table. |
| `database_name` | Database in which the table is stored. |
| `schema_name` | Schema in which the table is stored. |
| `kind` | Table type: TABLE (for permanent tables), TEMPORARY, or TRANSIENT. |
| `comment` | Comment for the table. |
| `cluster_by` | Column(s) defined as clustering key(s) for the table. |
| `rows` | Number of rows in the table. Returns NULL for external tables. |
| `bytes` | Number of bytes that will be scanned if the entire table is scanned in a query. Note that this number may be different than the number of actual physical bytes (i.e. bytes stored on-disk) for the table. |
| `owner` | Role that owns the table. |
| `retention_time` | Number of days that modified and deleted data is retained for Time Travel. |
| `dropped_on` | Date and time when the table was dropped; NULL if the table is active. This column is only displayed when the HISTORY keyword is specified for the command. |
| `automatic_clustering` | If [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) is enabled for your account, specifies whether it is explicitly enabled (`ON`) or disabled (`OFF`) for the table. This column is not displayed if Automatic Clustering is not enabled for your account. |
| `change_tracking` | If `ON`, change tracking is enabled. You can query this change tracking data using [streams](../../user-guide/streams-intro.md) or the [CHANGES](../constructs/changes.md) clause for [SELECT](select.md) statements. If `OFF`, change tracking is currently disabled but could be [enabled](../../user-guide/streams-manage.md). |
| `search_optimization` | If `ON`, the table has the [search optimization service](../../user-guide/search-optimization-service.md) enabled. Otherwise, the value is `OFF`. |
| `search_optimization_progress` | Percentage of the table that has been optimized for search. This value increases when optimization is first added to a table and when maintenance is done on the search optimization service. Before you measure the performance improvement of search optimization on a newly-optimized table, wait until this shows that the table has been fully optimized. |
| `search_optimization_bytes` | Number of additional bytes of storage that the search optimization service consumes for this table. |
| `is_external` | `Y` if it is an external table; `N` otherwise. |
| `enable_schema_evolution` | `Y` if the table has [schema evolution](../../user-guide/data-load-schema-evolution.md) enabled; `N` otherwise. You can enable automatic table schema evolution by using the [CREATE TABLE](create-table.md) or [ALTER TABLE](alter-table.md) commands. |
| `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| `is_event` | `Y` if it is an event table; `N` otherwise. |
| `is_hybrid` | `Y` if it is a hybrid table; `N` otherwise. |
| `is_iceberg` | `Y` if the table is an [Apache Iceberg™ table](../../user-guide/tables-iceberg.md); `N` otherwise. |
| `is_immutable` | `Y` if the table was created with the [READ ONLY](create-table.md) property; `N` otherwise. |

For more information about the properties that can be specified for a table, see [CREATE TABLE](create-table.md).

> **Note:**
>
> For cloned tables and tables with deleted data, the `bytes` displayed for the table may be different than the number of physical
> bytes for the table:
>
> * A cloned table does not utilize additional data storage until new rows are added to the table or existing rows in the table are
>   modified or deleted. If few or no changes have been made to the table, the number of bytes displayed is more than the
>   actual physical bytes stored for the table.
> * Data deleted from a table is maintained in Snowflake until both the Time Travel retention period (default is 1 day) and Fail-safe
>   period (7 days) for the data have passed. During these two periods, the number of bytes displayed is less than the actual
>   physical bytes stored for the table.
>
> For more detailed information about table size in bytes as it relates to cloning, Time Travel, and Fail-safe, see the
> [TABLE_STORAGE_METRICS](../info-schema/table_storage_metrics.md) Information Schema view.

## Usage notes

* If an account (or database or schema) has a large number of tables, then searching the entire account (or table or schema)
  can consume a significant amount of compute resources.
* In the output, results are sorted by database name, schema name, and then table name. This means results for a database
  can contain tables from multiple schemas and might break pagination. In order for pagination to work as expected, you
  must execute the SHOW TABLES command for a single schema. You can use the IN SCHEMA `schema_name` parameter to
  the SHOW TABLES command. Alternatively, you can use the schema in the current context by executing a USE SCHEMA command
  before executing a SHOW TABLES command.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

These examples show all of the tables that you have privileges to view based on the specified parameters.

Run SHOW TABLES on tables in the [Sample data sets](../../user-guide/sample-data.md). The examples use the TERSE parameter to limit the output.

> Show all the tables with a name that starts with `LINE` in the `tpch_sf1` schema:
>
> ```sqlexample
> SHOW TERSE TABLES IN tpch_sf1 STARTS WITH 'LINE';
> ```
>
> ```output
> +-------------------------------+----------+-------+-----------------------+-------------+
> | created_on                    | name     | kind  | database_name         | schema_name |
> |-------------------------------+----------+-------+-----------------------+-------------|
> | 2016-07-08 13:41:59.960 -0700 | LINEITEM | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> +-------------------------------+----------+-------+-----------------------+-------------+
> ```
>
> Show all of the tables with a name that includes the substring `PART` in the `tpch_sf1` schema:
>
> ```sqlexample
> SHOW TERSE TABLES LIKE '%PART%' IN tpch_sf1;
> ```
>
> ```output
> +-------------------------------+-----------+-------+-----------------------+-------------+
> | created_on                    | name      | kind  | database_name         | schema_name |
> |-------------------------------+-----------+-------+-----------------------+-------------|
> | 2016-07-08 13:41:59.960 -0700 | JPART     | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> | 2016-07-08 13:41:59.960 -0700 | JPARTSUPP | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> | 2016-07-08 13:41:59.960 -0700 | PART      | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> | 2016-07-08 13:41:59.960 -0700 | PARTSUPP  | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> +-------------------------------+-----------+-------+-----------------------+-------------+
> ```
>
> Show the tables in the `tpch_sf1` schema, but limit the output to three rows, and start with the table
> names that begin with `J`:
>
> ```sqlexample
> SHOW TERSE TABLES IN tpch_sf1 LIMIT 3 FROM 'J';
> ```
>
> ```output
> +-------------------------------+-----------+-------+-----------------------+-------------+
> | created_on                    | name      | kind  | database_name         | schema_name |
> |-------------------------------+-----------+-------+-----------------------+-------------|
> | 2016-07-08 13:41:59.960 -0700 | JCUSTOMER | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> | 2016-07-08 13:41:59.960 -0700 | JLINEITEM | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> | 2016-07-08 13:41:59.960 -0700 | JNATION   | TABLE | SNOWFLAKE_SAMPLE_DATA | TPCH_SF1    |
> +-------------------------------+-----------+-------+-----------------------+-------------+
> ```

Show a dropped table using the HISTORY parameter.

> Create a table in your current schema, then drop it:
>
> ```sqlexample
> CREATE OR REPLACE TABLE test_show_tables_history(c1 NUMBER);
>
> DROP TABLE test_show_tables_history;
> ```
>
> Use the HISTORY parameter to include dropped tables in the command output:
>
> ```sqlexample
> SHOW TABLES HISTORY LIKE 'test_show_tables_history';
> ```
>
> In the output, the `dropped_on` column shows the date and time when the table was dropped.

Sort the tables to show the newest tables first.

```sqlexample
SHOW TERSE TABLES ->> SELECT * FROM $1 ORDER BY "created_on" DESC;
```

Sort the tables to show the tables with the most data first. The `"bytes"` columns
isn’t available in the SHOW TERSE TABLES output, so this example filters the
output of the full SHOW TABLES command.

```sqlexample
SHOW TABLES
  ->> SELECT "name", "database_name", "schema_name", "bytes" FROM $1 ORDER BY "bytes" DESC;
```

Transform the SHOW TABLES output into a set of fully qualified table names.

```sqlexample
SHOW TABLES IN ACCOUNT
  ->> SELECT "database_name" || '.' || "schema_name" || '.' || "name" AS fully_qualified_name
        FROM $1
        ORDER BY fully_qualified_name;
```

---
title: SHOW TAGS
source: https://docs.snowflake.com/en/sql-reference/sql/show-tags.md
section: SQL Commands
---

# SHOW TAGS

Lists the tag information.

See also:
:   [CREATE TAG](create-tag.md) , [ALTER TAG](alter-tag.md) , [DROP TAG](drop-tag.md) , [UNDROP TAG](undrop-tag.md)

## Syntax

```sqlsyntax
SHOW TAGS [ LIKE '<pattern>' ]
          [ IN
               {
                 ACCOUNT                                         |

                 DATABASE                                        |
                 DATABASE <database_name>                        |

                 SCHEMA                                          |
                 SCHEMA <schema_name>                            |
                 <schema_name>

                 APPLICATION <application_name>                  |
                 APPLICATION PACKAGE <application_package_name>  |
               }
          ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this SQL command must have at least one of the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE | Schema | This privilege must match the schema containing the tag. |
| APPLY TAG | Account |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on tag DDL and privileges, see [Access control privileges](../../user-guide/object-tagging/work.md).

## Usage notes

* The ALLOWED_VALUES column specifies the possible string values that can be assigned to the tag when the tag is set
  on an [object](../../user-guide/object-tagging/introduction.md) or NULL if the tag does not have any specified allowed values. For details, see
  [Set a list of allowed tag values](../../user-guide/object-tagging/work.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show tags in a given schema:

> ```sqlexample
> SHOW TAGS IN SCHEMA my_db.my_schema;
> ```
>
> ```output
> ------------------------------+----------------+---------------+-------------+--------------+--------------------+----------------+-----------------+
>                    created_on | name           | database_name | schema_name | owner        | comment            | allowed_values | owner_role_type |
> ------------------------------+----------------+---------------+-------------+--------------+--------------------+----------------+-----------------+
> 2021-03-20 21:09:38.317 +0000 | CLASSIFICATION | MY_DB         | MY_SCHEMA   | ACCOUNTADMIN | secure information | [NULL]         | ROLE            |
> 2021-03-20 21:08:59.000 +0000 | COST_CENTER    | MY_DB         | MY_SCHEMA   | ACCOUNTADMIN | cost_center tag    | [NULL]         | ROLE            |
> ------------------------------+----------------+---------------+-------------+--------------+--------------------+----------------+-----------------+
> ```

---
title: SHOW TASKS
source: https://docs.snowflake.com/en/sql-reference/sql/show-tasks.md
section: SQL Commands
---

# SHOW TASKS

Lists the tasks for which you have access privileges. The command can be used to list tasks for the current/specified database or schema,
or across your entire account.

The output returns task metadata and properties, ordered lexicographically by database, schema, and task name (see Output in this topic
for descriptions of the output columns). This is important to note if you wish to filter the results using the provided filters.

See also:
:   [CREATE TASK](create-task.md) , [ALTER TASK](alter-task.md) , [DROP TASK](drop-task.md) , [DESCRIBE TASK](desc-task.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] TASKS [ LIKE '<pattern>' ]
                     [ IN { ACCOUNT | DATABASE [ <db_name> ] | [ SCHEMA ] [ <schema_name> ] | APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> } ]
                     [ STARTS WITH '<name_string>' ]
                     [ ROOT ONLY ]
                     [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only a subset of the output columns:

    * created_on
    * name
    * kind (shows NULL for all task records)
    * database_name
    * schema_name
    * schedule

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN ACCOUNT | DATABASE [ db_name ] | SCHEMA [ schema_name ] | APPLICATION application_name | APPLICATION PACKAGE application_package_name`
:   Optionally specifies the scope of the command, which determines whether the command lists records only for the current/specified database or schema, or across your entire account.

    The `APPLICATION` and `APPLICATION PACKAGE` keywords are not required, but they specify the scope for the named Snowflake Native App.

    If you specify the keyword `ACCOUNT`, then the command retrieves records for all schemas in all databases
    of the current account.

    If you specify the keyword `DATABASE`, then:

    * If you specify a `db_name`, then the command retrieves records for all schemas of the specified database.
    * If you don’t specify a `db_name`, then:

      + If there is a current database, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and schemas in the account.

    If you specify the keyword `SCHEMA`, then:

    * If you specify a qualified schema name (for example, `my_database.my_schema`), then the command
      retrieves records for the specified database and schema.
    * If you specify an unqualified `schema_name`, then:

      + If there is a current database, then the command retrieves records for the specified schema in the current database.
      + If there is no current database, then the command displays the error
        `SQL compilation error: Object does not exist, or operation cannot be performed`.
    * If you don’t specify a `schema_name`, then:

      + If there is a current database, then:

        - If there is a current schema, then the command retrieves records for the current schema in the current database.
        - If there is no current schema, then the command retrieves records for all schemas in the current database.
      + If there is no current database, then the command retrieves records for all databases and all schemas in the account.

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default; that is, the command returns the objects that you have privileges to view in the database.
    * No database: `ACCOUNT` is the default; that is, the command returns the objects that you have privileges to view in your account.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`ROOT ONLY`
:   Filters the command output to return only root tasks (tasks with no predecessors).

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides task properties and metadata in the following columns:

| Column Name | Description |
| --- | --- |
| created_on | Date and time when the task was created. |
| name | Name of the task. |
| id | Unique identifier for each task. Note that recreating a task (using CREATE OR REPLACE TASK) essentially creates a new task, which has a new ID. |
| database_name | Database in which the task is stored. |
| schema_name | Schema in which the task is stored. |
| owner | Role that owns the task; that is, has the OWNERSHIP privilege on the task |
| comment | Comment for the task. |
| warehouse | Warehouse that provides the required resources to run the task. |
| schedule | Schedule for running the task. Displays NULL if no schedule is specified or the task is a triggered task. |
| scheduling_mode | Displays whether a serverless task is FIXED or FLEXIBLE.   * For FIXED, the task execution is based on the user-specified schedule for the task. * For FLEXIBLE, the task execution is based on the user-specified schedule and target completion interval for the task. |
| target_completion_interval | Target completion interval for a serverless task. Used to determine compute resource size for execution. |
| predecessors | JSON array of any tasks that are identified in the AFTER parameter for the task; that is, predecessor tasks. When they are run successfully to completion, these tasks trigger the current task. Individual task names in the array are fully qualified; that is, they include the container database and schema names. . . Displays an empty array if the task has no predecessor. |
| state | ‘started’ or ‘suspended’ based on the current state of the task. |
| definition | SQL statements executed when the task runs. |
| condition | Condition specified in the WHEN clause for the task. |
| allow_overlapping_execution | For root tasks in a [task graph](../../user-guide/tasks-graphs.md), displays TRUE if overlapping execution of the task graph is explicitly allowed. For child tasks in a task graph, displays NULL. |
| error_integration | Name of the notification integration used to access Amazon Simple Notification Service (SNS), Google Pub/Sub, or Microsoft Azure Event Grid to relay error notifications for the task. |
| success_integration | Name of the notification integration that is used to access Amazon Simple Notification Service (SNS), Google Pub/Sub, or Microsoft Azure Event Grid to relay success notifications for the task. |
| last_committed_on | Timestamp when a [version](../../user-guide/tasks-intro.md) of the task was last set. If no version was set—that is, if the task wasn’t resumed or manually run after it was created—the value is NULL. |
| last_suspended_on | Timestamp when the task was last suspended. Displays the timestamps for both the root tasks and the child tasks. If the task hasn’t been suspended yet, the value is NULL. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| config | For the root task in a task graph, displays the default configuration set in the task definition with [CREATE TASK](create-task.md) or [ALTER TASK](alter-task.md). For child tasks in a task graph, displays NULL. |
| last_suspended_reason | Displays the reason why the task was suspended. The possible reasons include the following:  * USER_SUSPENDED: The user suspended the task by running the `alter task <name> suspend` command. * SCHEMA_OR_DATABASE_DELETED: The schema or database of the task was dropped. * GRANT_OWNERSHIP: The user transferred the ownership of the task to another role by running the `grant ownership` command. * SUSPENDED_DUE_TO_ERRORS: The task failed a certain number of consecutive times and was suspended. You can set the [SUSPEND_TASK_AFTER_NUM_FAILURES](../parameters.md) parameter for the number of failures required to suspend this task. * CHILD_BECAME_ROOT: The task was previously a child task in a task graph, but all predecessors of the child task were removed and the child task became a root task. * FINALIZER_BECAME_ROOT: The task was previously a finalizer task in a task graph, but the finalization was removed and the task became a root task. * MATCHING_OWNER_NOT_FOUND: During [task replication](../../user-guide/account-replication-considerations.md), the role that owns the task was not found on the secondary database.  Displays NULL if the task has never been suspended, or if the task was last suspended before the column was introduced with [2023_08 Bundle (Generally Enabled)](../../release-notes/bcr-bundles/2023_08_bundle.md). |
| task_relations | JSON object that describes the task relationships, including predecessor tasks and finalizer tasks. The object can contain the following fields:   * `Predecessors`: Array of fully qualified names of tasks identified in the AFTER parameter for the task. When all predecessor tasks run successfully to   completion, they trigger the current task.   If the task has no predecessors, this is an empty array. * `FinalizerTask`: Fully qualified name of the finalizer task associated with this root task.   Displayed only for root tasks that have a finalizer task. * `FinalizedRootTask`: Fully qualified name of the root task that this task finalizes. Displayed only for finalizer tasks.   Examples:   * Root task with a finalizer task: `{"Predecessors":[],"FinalizerTask":"MY_DB.MY_SCHEMA.FINALIZE_LONG_RUNNING_TASK"}` * Finalizer task: `{"FinalizedRootTask":"MY_DB.MY_SCHEMA.LONG_RUNNING_TASK","Predecessors":[]}` * Task with predecessors: `{"Predecessors":["MY_DB.MY_SCHEMA.DEFTASK"]}` |
| execute_as_user | Shows as NULL if executed as the system user (default). Shows the user name of a user running a task using impersonated privileges (EXECUTE AS USER). To learn more, see [Run tasks with user privileges](../../user-guide/tasks-intro.md). |

For more information about the properties that can be specified for a task, see [CREATE TASK](create-task.md).

## Usage notes

* Only returns rows for a task owner—that is, the role with the OWNERSHIP privilege on a task—or a role with either the MONITOR
  or OPERATE privilege on a task.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the tasks whose name starts with `line` that you have privileges to view in the `tpch.public` schema:

> ```sqlexample
> SHOW TASKS LIKE 'line%' IN tpch.public;
> ```

Show all the tasks that you have privileges to view in the `tpch.public` schema:

> ```sqlexample
> SHOW TASKS IN tpch.public;
> ```

---
title: SHOW TELEMETRY EVENT DEFINITIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-telemetry-event-definitions.md
section: SQL Commands
---

# SHOW TELEMETRY EVENT DEFINITIONS

Lists the [event definitions](../../developer-guide/native-apps/event-definition.md) for the specified app.

## Syntax

```sqlsyntax
SHOW TELEMETRY EVENT DEFINITIONS IN APPLICATION <name>
```

## Parameters

`name`
:   Specifies the identifier for the app. If the identifier contains
    spaces, special characters, or mixed-case characters, the entire string must be enclosed
    in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Output

Shows information about the event definitions for an app.

| Column | Description |
| --- | --- |
| `name` | The name of the event definition. Event definition names begin with the `SNOWFLAKE$` prefix. |
| `type` | The type of event definition. See [Configure event definitions for an app](../../developer-guide/native-apps/event-definition.md) for more information. |
| `sharing` | Specifies if the event definition is `MANDATORY` or `OPTIONAL`. |
| `status` | Specifies if the event definition is enabled in the consumer account. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

```sqlexample
SHOW TELEMETRY EVENT DEFINITIONS IN APPLICATION hello_snowflake;
```

```output
+--------------------------+----------------+---------------+--------------+
|   name                   |   type         |   sharing     |   status     |
+--------------------------+----------------+---------------+--------------+
|   SNOWFLAKE$DEBUG_LOGS   |   DEBUG_LOGS   |   OPTIONAL    |   ENABLED    |
|   SNOWFLAKE$TRACES       |   TRACES       |   MANDATORY   |   ENABLED    |
+--------------------------+----------------+---------------+--------------+
```

---
title: SHOW TRANSACTIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-transactions.md
section: SQL Commands
---

# SHOW TRANSACTIONS

List all running transactions. The command can be used to show transactions for the current user or all users in the account.

See also:
:   [SHOW LOCKS](show-locks.md)

## Syntax

```sqlsyntax
SHOW TRANSACTIONS [ IN ACCOUNT ]
```

## Parameters

`IN ACCOUNT`
:   Shows all transactions across all users in the account. It can only be used by users with the ACCOUNTADMIN role (i.e. account administrators).

## Output

The command output shows transaction metadata in the following columns:

| Column | Description |
| --- | --- |
| `id` | Transaction ID (a signed 64-bit integer). |
| `user` | Current user. |
| `session` | Session ID. |
| `name` | User-defined name or system-generated name (UUID) for the transaction. |
| `started_on` | Timestamp that specifies when the transaction started executing. |
| `state` | Transaction state: `running`. |
| `scope` | ID of the operation that created a stored procedure in a scoped transaction. `0` for non-scoped transactions. |

## Usage notes

* The command output includes the IDs for all running transactions. These IDs can be used as input for
  [SYSTEM$ABORT_TRANSACTION](../functions/system_abort_transaction.md) to abort a specified transaction.
* A stored procedure that contains a transaction can be called from within another transaction. These
  transactions are separate but “scoped.” The values in the `scope` column are useful for discovering whether two transactions are in the same scope.
  For more information, see [Scoped transactions](../transactions.md).

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Example

In this example, two sessions are being run by the same user, with one transaction in progress for each session.

```sqlexample
SHOW TRANSACTIONS;
```

```output
+---------------------+---------+-----------------+--------------------------------------+-------------------------------+---------+-------+
|                  id | user    |         session | name                                 | started_on                    | state   | scope |
|---------------------+---------+-----------------+--------------------------------------+-------------------------------+---------+-------|
| 1721165674582000000 | CALIBAN | 186457423713330 | 551f494d-90ed-438d-b32b-1161396c3a22 | 2024-07-16 14:34:34.582 -0700 | running |     0 |
| 1721165584820000000 | CALIBAN | 186457423749354 | a092aa44-9a0a-4955-9659-123b35c0efeb | 2024-07-16 14:33:04.820 -0700 | running |     0 |
+---------------------+---------+-----------------+--------------------------------------+-------------------------------+---------+-------+
```

---
title: SHOW TYPES
source: https://docs.snowflake.com/en/sql-reference/sql/show-types.md
section: SQL Commands
---

# SHOW TYPES

Lists the [user-defined types](../data-types-user-defined.md) for which you have access privileges.
Use this command to list the user-defined types for a specified schema or database, the current schema or
database for the session, or your entire account.

See also:
:   [CREATE TYPE](create-type.md) , [ALTER TYPE](alter-type.md) , [DESCRIBE TYPE](desc-type.md) , [DROP TYPE](drop-type.md) , [UNDROP TYPE](undrop-type.md)

## Syntax

```sqlsyntax
SHOW TYPES [ LIKE '<pattern>' ]
               [ IN
                    {
                      ACCOUNT                                         |

                      DATABASE                                        |
                      DATABASE <database_name>                        |

                      SCHEMA                                          |
                      SCHEMA <schema_name>                            |
                      <schema_name>

                      APPLICATION <application_name>                  |
                      APPLICATION PACKAGE <application_package_name>  |
                    }
               ]
           [ STARTS WITH '<name_string>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following parameters:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

## Output

The command output provides user-defined type properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `name` | Name of the user-defined type. |
| `type` | Snowflake type definition that is the base type for the user-defined type. |
| `created_on` | Date and time when the user-defined type was created. |
| `database_name` | Database in which the user-defined type is stored. |
| `schema_name` | Schema in which the user-defined type is stored. |
| `owner` | Name of the role that owns the user-defined type (that is, the role that has the OWNERSHIP privilege on the user-defined type). |
| `comment` | The comment set for the type, if any (otherwise `NULL`). |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any | User-defined type |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

Use the SHOW TYPES command to list details about the `age` user-defined type:

```sqlexample
SHOW TYPES LIKE 'age';
```

```output
+------+-------------+-------------------------------+---------------+----------------+--------------+---------+
| name | type        | created_on                    | database_name | schema_name    | owner        | comment |
|------+-------------+-------------------------------+---------------+----------------+--------------+---------|
| AGE  | NUMBER(3,0) | 2025-10-28 09:09:43.279 -0700 | MY_DB         | MY_SCHEMA      | MY_ROLE      | NULL    |
+------+-------------+-------------------------------+---------------+----------------+--------------+---------+
```

---
title: SHOW USER FUNCTIONS
source: https://docs.snowflake.com/en/sql-reference/sql/show-user-functions.md
section: SQL Commands
---

# SHOW USER FUNCTIONS

Lists all user-defined functions (UDFs) for which you have access privileges. Use this command to list the UDFs for a specified
database or schema (or the current database/schema for the session), or across your entire account.

For a command that lists all functions, including built-in functions, see [SHOW FUNCTIONS](show-functions.md).

See also:
:   [SHOW FUNCTIONS](show-functions.md), [SHOW EXTERNAL FUNCTIONS](show-external-functions.md), [FUNCTIONS view](../info-schema/functions.md) (Information Schema),
    [FUNCTIONS view](../account-usage/functions.md) (Account Usage)

## Syntax

```sqlsyntax
SHOW USER FUNCTIONS [ LIKE '<pattern>' ]
  [ IN
    {
      ACCOUNT                                         |

      DATABASE                                        |
      DATABASE <database_name>                        |

      SCHEMA                                          |
      SCHEMA <schema_name>                            |
      <schema_name>

      APPLICATION <application_name>                  |
      APPLICATION PACKAGE <application_package_name>  |
    }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The command output provides user function properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp at which the user-defined function (UDF) was created. |
| `name` | Name of the UDF. |
| `schema_name` | Name of the schema in which the UDF exists. |
| `is_builtin` | Always `N` for user-defined functions. See [SHOW FUNCTIONS](show-functions.md) for a command to list all functions, including built-in functions. |
| `is_aggregate` | `Y` if the function is an aggregate function; `N` otherwise. |
| `is_ansi` | Not applicable currently. |
| `min_num_arguments` | Minimum number of arguments to the UDF. |
| `max_num_arguments` | Maximum number of arguments to the UDF. |
| `arguments` | Data types of the arguments and return value. |
| `description` | Description of the UDF. |
| `catalog_name` | Name of the database in which the UDF exists. |
| `is_table_function` | `Y` if the UDF is a table function; `N` otherwise. |
| `valid_for_clustering` | `Y` if the UDF can be used in a CLUSTER BY expression; `N` otherwise. |
| `is_secure` | `Y` if the UDF is a secure UDF; `N` otherwise. |
| `secrets` | Map of [secret](create-secret.md) values specified by the function’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| `external_access_integrations` | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the function’s EXTERNAL_ACCESS_INTEGRATION parameter. |
| `is_external_function` | `Y` if the function is an external function; `N` otherwise. See [SHOW EXTERNAL FUNCTIONS](show-external-functions.md) for a command to list external functions. |
| `language` | Programming language of the UDF (for example, `PYTHON` or `SQL`). |
| `is_memoizable` | `Y` if the function is [memoizable](../../developer-guide/udf/sql/udf-sql-scalar-functions.md); `N` otherwise. |
| `is_data_metric` | `Y` if the function is a [data metric function](../../user-guide/data-quality-intro.md); `N` otherwise. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all the UDFs that you have privileges to view in the current database:

> ```sqlexample
> SHOW USER FUNCTIONS LIKE 'ALLOWED_REGIONS%' IN SCHEMA;
> ```
>
> ```output
> ---------------------------------+--------------------------+-------------+------------+--------------+---------+-------------------+-------------------+-----------------------------------------+-----------------------+----------------+-------------------+----------------------+-----------+---------+-----------------------------+----------------------+----------+---------------+----------------+
>           created_on             |           name           | schema_name | is_builtin | is_aggregate | is_ansi | min_num_arguments | max_num_arguments |                arguments                |      description      |  catalog_name  | is_table_function | valid_for_clustering | is_secure | secrets | external_access_integration | is_external_function | language | is_memoizable | is_data_metric |
> ---------------------------------+--------------------------+-------------+------------+--------------+---------+-------------------+-------------------+-----------------------------------------+-----------------------+----------------+-------------------+----------------------+-----------+---------+-----------------------------+----------------------+----------+---------------+----------------+
>  Fri, 23 Jun 1967 00:00:00 -0700 | ALLOWED_REGIONS          | PUBLIC      | N          | N            | N       | 0                 | 0                 | ALLOWED_REGIONS() RETURN ARRAY          | user-defined function | MEMO_FUNC_TEST | N                 | N                    | N         |         |                             | N                    | SQL      | Y             | N              |
>  Fri, 23 Jun 1967 00:00:00 -0700 | ALLOWED_REGIONS_NON_MEMO | PUBLIC      | N          | N            | N       | 0                 | 0                 | ALLOWED_REGIONS_NON_MEMO() RETURN ARRAY | user-defined function | MEMO_FUNC_TEST | N                 | N                    | N         |         |                             | N                    | SQL      | N             | N              |
> ---------------------------------+--------------------------+-------------+------------+--------------+---------+-------------------+-------------------+-----------------------------------------+-----------------------+----------------+-------------------+----------------------+-----------+---------+-----------------------------+----------------------+----------+---------------+----------------+
> ```

---
title: SHOW USER PROCEDURES
source: https://docs.snowflake.com/en/sql-reference/sql/show-user-procedures.md
section: SQL Commands
---

# SHOW USER PROCEDURES

Lists all user-defined procedures for which you have access privileges. Use this command to list the user-defined procedures for a specified
database or schema (or the current database/schema for the session), application, or for your entire account.

For a command that lists all procedures, including both built-in and user-defined procedures, see [SHOW PROCEDURES](show-procedures.md).

See also:
:   [SHOW PROCEDURES](show-procedures.md), [PROCEDURES view](../info-schema/procedures.md) (Information Schema),
    [PROCEDURES view](../account-usage/procedures.md) (Account Usage), SHOW USER PROCEDURES

## Syntax

```sqlsyntax
SHOW USER PROCEDURES [ LIKE '<pattern>' ]
  [ IN
    {
      ACCOUNT                                         |

      DATABASE                                        |
      DATABASE <database_name>                        |

      SCHEMA                                          |
      SCHEMA <schema_name>                            |
      <schema_name>

      APPLICATION <application_name>                  |
      APPLICATION PACKAGE <application_package_name>  |
    }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    `APPLICATION application_name`, . `APPLICATION PACKAGE application_package_name`
    :   Returns records for the named Snowflake Native App or application package.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Output

The command output lists user procedure properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Timestamp at which the procedure was created. |
| `name` | Name of the procedure. |
| `schema_name` | Name of the schema in which the procedure exists. |
| `is_builtin` | `Y` if the procedure is built in; `N` otherwise (always `N` for user-created procedures). |
| `is_aggregate` | Not applicable currently. |
| `is_ansi` | Not applicable currently. |
| `min_num_arguments` | Minimum number of arguments to the procedure. |
| `max_num_arguments` | Maximum number of arguments to the procedure. |
| `arguments` | Data types of the arguments and return value. For [Snowflake Scripting stored procedures](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md), `OUT` is displayed for output arguments. |
| `description` | Description of the procedure. |
| `catalog_name` | Name of the database in which the procedure exists. |
| `is_table_function` | `Y` if the procedure returns a table; `N` otherwise. |
| `valid_for_clustering` | `Y` if the procedure can be used in a CLUSTER BY expression; `N` otherwise. |
| `is_secure` | `Y` if the procedure is a secure procedure; `N` otherwise. |
| `secrets` | Map of [secret](create-secret.md) values specified by the procedure’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| `external_access_integrations` | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the procedure’s EXTERNAL_ACCESS_INTEGRATION parameter. |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show procedures that you have privileges to view in the current schema whose names begin with `GET_`:

> ```sqlexample
> SHOW USER PROCEDURES LIKE 'GET_%' IN SCHEMA;
> ```
>
> ```output
> -------------------------------+-----------------+-------------+------------+--------------+---------+-------------------+-------------------+---------------------------------------+------------------------+--------------+-------------------+----------------------+-----------+---------+------------------------------+
>           created_on           | name            | schema_name | is_builtin | is_aggregate | is_ansi | min_num_arguments | max_num_arguments | arguments                             | description            | catalog_name | is_table_function | valid_for_clustering | is_secure | secrets | external_access_integrations |
> -------------------------------+-----------------+-------------+------------+--------------+---------+-------------------+-------------------+---------------------------------------+------------------------+--------------+-------------------+----------------------+-----------+---------+------------------------------+
>  2023-01-27 15:01:13.862 -0800 | GET_FILE        | PUBLIC      | N          | N            | N       | 1                 | 1                 | GET_FILE(VARCHAR) RETURN VARCHAR      | user-defined procedure | BOOKS_DB     | N                 | N                    | N         |         |                              |
>  2023-03-23 10:38:10.423 -0700 | GET_NUM_RESULTS | PUBLIC      | N          | N            | N       | 1                 | 1                 | GET_NUM_RESULTS(VARCHAR) RETURN FLOAT | user-defined procedure | BOOKS_DB     | N                 | N                    | N         |         |                              |
>  2023-03-23 09:47:55.840 -0700 | GET_RESULTS     | PUBLIC      | N          | N            | N       | 1                 | 1                 | GET_RESULTS(VARCHAR) RETURN TABLE ()  | user-defined procedure | BOOKS_DB     | Y                 | N                    | N         |         |                              |
> -------------------------------+-----------------+-------------+------------+--------------+---------+-------------------+-------------------+---------------------------------------+------------------------+--------------+-------------------+----------------------+-----------+---------+------------------------------+
> ```

---
title: SHOW USER PROGRAMMATIC ACCESS TOKENS
source: https://docs.snowflake.com/en/sql-reference/sql/show-user-programmatic-access-tokens.md
section: SQL Commands
---

# SHOW USER PROGRAMMATIC ACCESS TOKENS

Lists the [programmatic access tokens](../../user-guide/programmatic-access-tokens.md) associated with a user.

> **Note:**
>
> The list includes programmatic access tokens that have expired within the past 30 days. To view information about tokens that
> have expired more than 30 days ago, query the [CREDENTIALS view](../account-usage/credentials.md).

See also:
:   [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-add-programmatic-access-token.md) ,
    [ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-modify-programmatic-access-token.md) ,
    [ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-rotate-programmatic-access-token.md) ,
    [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](alter-user-remove-programmatic-access-token.md)

## Syntax

```sqlsyntax
SHOW USER { PROGRAMMATIC ACCESS TOKENS | PATS } [ FOR USER <username> ]
```

You can use the keyword PATS as a shorter way of specifying the keywords PROGRAMMATIC ACCESS TOKENS.

## Parameters

`FOR USER username`
:   Lists the programmatic access tokens for the specified user.

    Default: Lists the programmatic access tokens for the current user.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

The command output includes the following columns, which provide properties and metadata for each programmatic access token:

| Column | Description |
| --- | --- |
| `name` | The name of the programmatic access token. |
| `user_name` | The username associated with the programmatic access token.  If the user associated with the programmatic access token was removed from the account, then Snowflake returns the user ID instead of the username. You can find information about a removed user by using the [USERS view](../account-usage/users.md) in the [ACCOUNT_USAGE](../account-usage.md) schema. |
| `role_restriction` | The name of the role that the programmatic access token inherits privileges from. |
| `expires_at` | The timestamp when the programmatic access token expires. |
| `status` | The status of the programmatic access token. This column can be one of the following values:   * `ACTIVE`: The programmatic access token can be used to authenticate and has not expired yet. * `EXPIRED`: The programmatic access token cannot be used to authenticate because the expiration date has passed. * `DISABLED`: The programmatic access token is [disabled](../../user-guide/programmatic-access-tokens.md) because user login access is disabled or   the user is locked out of logging in. |
| `comment` | A user-provided comment about the programmatic access token. |
| `created_on` | The date when the programmatic access token was created. |
| `created_by` | The username or user ID of the user who created the programmatic access token. |
| `mins_to_bypass_required_network_policy` | The number of minutes during which a user can use this token to access Snowflake without being subject to an active [network policy](../../user-guide/network-policies.md). |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY | User | Required only when displaying programmatic access tokens for a human user other than yourself or a service user. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command lists all programmatic access tokens for a given user, not all programmatic access tokens for an account.
* The programmatic access token secret is never returned after creation.
* After seven days, expired programmatic access tokens are deleted and no longer appear in the output of the command.

## Examples

Show information about programmatic access tokens associated with the user `example_user`:

```sqlexample
SHOW USER PROGRAMMATIC ACCESS TOKENS FOR USER example_user;
```

---
title: SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS
source: https://docs.snowflake.com/en/sql-reference/sql/show-user-workload-identity-authentication-methods.md
section: SQL Commands
---

# SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS

Lists the [workload identity federation](../../user-guide/workload-identity-federation.md) settings for a service user.

## Syntax

```sqlsyntax
SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS [ FOR USER <username> ]
```

## Parameters

`FOR USER username`
:   > Lists the workload identity federation settings for the specified user.

    If no user is specified, the command lists the settings for the current user.

## Output

| Column | Description |
| --- | --- |
| `name` | Name of the service user. |
| `type` | The identity provider that is issuing attestations for the service user. Possible values are:   * `AWS`: AWS Identity and Access Management (AWS IAM) is the identity provider, which indicates the workload is running on AWS. * `AZURE`: Microsoft Entra ID is the identity provider, which indicates the workload is running on Microsoft Azure. * `GCP`: Google Accounts is the identity provider, which indicates the workload is running on Google Cloud. * `OIDC`: An OpenID Connect (OIDC) provider is the identity provider. |
| `comment` | Reserved for future use. |
| `last_used` | Date and time that the service user last used workload identity federation to authenticate to Snowflake. |
| `created_on` | Date and time that someone ran a CREATE USER or ALTER USER command to set the `WORKLOAD_IDENTITY` parameter. |
| `additional_info` | Additional details about how the service user is configured to use workload identity federation. The details depend on the value in the `type` column.   * For `TYPE = 'AWS'`, the column contains an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `awsPartition` key, the value is the AWS partition for the federated identity.   + For the `awsAccount` key, the value is the AWS account identifier for the federated identity.   + For the `type` key, the value is the type of the federated identity. This can be `IAM_USER` or `IAM_ROLE`.   + For the `iamRole` key, the value is the name of the federated IAM role or user. * For `TYPE = 'AZURE'`, the column contains an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `issuer` key, the value is the Entra ID tenant’s Authority URL.   + For the `subject` key, the value is the Object ID (Principal ID) assigned to the Azure workload that is using a managed identity. * For `TYPE = 'GCP'`, the column contains an [OBJECT](../data-types-semistructured.md) value with the following key-value pair:    + For the `subject` key, the value is the `uniqueId` property of the Google Cloud service account associated with the federated workload. * For `TYPE = 'OIDC'`, the column contains an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `issuer` key, the value is the issuer URL of the OpenID Connect (OIDC) provider.   + For the `subject` key, the value is the identifier of the federated workload.   + For the `audienceList` key, the value is the custom audiences that are allowed in an OIDC ID token. An empty value means the default audience `snowflakecomputing.com` is required. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MONITOR | User | Required only when displaying workload identity federation settings for a different service user. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

Show workload identity authentication settings for the user `example_service_user`:

```sqlexample
SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS FOR USER example_service_user;
```

---
title: SHOW USERS
source: https://docs.snowflake.com/en/sql-reference/sql/show-users.md
section: SQL Commands
---

# SHOW USERS

Lists all [users](../../user-guide/admin-user-management.md) in the system.

See also:
:   [CREATE USER](create-user.md) , [ALTER USER](alter-user.md) , [DROP USER](drop-user.md) , [DESCRIBE USER](desc-user.md)

## Syntax

```sqlsyntax
SHOW [ TERSE ] USERS
  [ LIKE '<pattern>' ]
  [ STARTS WITH '<name_string>' ]
  [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Returns only the following output columns:

    * `name`
    * `created_on`
    * `display_name`
    * `first_name`
    * `last_name`
    * `email`
    * `org_identity`
    * `comment`
    * `has_password`
    * `has_rsa_public_key`
    * `type`
    * `has_mfa`
    * `has_pat`
    * `has_federated_workload_authentication`
    * `is_from_organization_user`

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the user. |
| `created_on` | Date and time when the user was created. |
| `login_name` | Name that the user enters to log into the system. |
| `display_name` | Name displayed for the user in [Snowsight](../../user-guide/ui-snowsight-gs.md). |
| `first_name` | First name of the user. |
| `last_name` | Last name of the user. |
| `email` | Email addresss for the user. |
| `mins_to_unlock` | Number of minutes until [the temporary lock on the user login is cleared](../../user-guide/admin-user-management.md). |
| `days_to_expiry` | Number of days after which the user status is set to “Expired” and the user is no longer allowed to log in. |
| `comment` | Comment about the user. |
| `disabled` | If TRUE, the user is [locked out of Snowflake and cannot log back in](../../user-guide/admin-user-management.md). |
| `must_change_password` | If TRUE, the user is forced to change their password on next login (including their first/initial login) into the system. |
| `snowflake_lock` | If TRUE, the user is locked by Snowflake. When a user is locked, they are unable to log in until the lock is removed. |
| `default_warehouse` | Virtual warehouse that is active by default for the user’s session upon logging in. |
| `default_namespace` | Namespace (database only or database and schema) that is active by default for the user’s session upon logging in. |
| `default_role` | Primary role that is active by default for the user’s session upon logging in. |
| `default_secondary_roles` | Set of secondary roles that are active for the user’s session upon logging in. |
| `ext_authn_duo` | If TRUE, [Duo](../../user-guide/security-mfa-duo.md) is enabled for the user, which requires the user to use [MFA (multi-factor authentication)](../../user-guide/security-mfa.md) when logging in. |
| `ext_authn_uid` | Authorization ID used for Duo. |
| `mins_to_bypass_mfa` | Number of minutes to [temporarily bypass MFA requirement for the user](../../user-guide/security-mfa.md). |
| `owner` | Role that owns the user. |
| `last_success_login` | Date and time when the user last logged in to the Snowflake. |
| `expires_at_time` | Date and time when the user’s status is set to `EXPIRED` and the user can no longer log in. |
| `locked_until_time` | Number of minutes until the temporary lock on the user login is cleared. |
| `has_password` | If TRUE, the user has a password. |
| `has_rsa_public_key` | If TRUE, the user has a public key for [key-pair authentication](../../user-guide/key-pair-auth.md). |
| `type` | Type of the user. For a list of possible values, see [Types of users](../../user-guide/admin-user-management.md). |
| `has_mfa` | If TRUE, the user is enrolled in [multi-factor authentication (MFA)](../../user-guide/security-mfa.md). |
| `has_pat` | If TRUE, the user has one or more [programmatic access tokens](../../user-guide/programmatic-access-tokens.md). |
| `has_workload_identity` | If TRUE, the user is configured to authenticate with [workload identity federation](../../user-guide/workload-identity-federation.md). |
| `is_from_organization_user` | If TRUE, the user was imported from a global [organization user](../../user-guide/organization-users.md). |

## Access control requirements

Any user can execute the SHOW USERS command. The output always includes the username in the `name` column.

For the other columns, Snowflake filters the output based upon the privileges granted to the user’s
[active role](../../user-guide/security-access-control-overview.md). The values in the other columns are returned if the active role
has either of the following privileges:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | User |  |
| MANAGE GRANTS | Account |  |

Otherwise, the other columns contain NULL.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* If the account has more than 10,000 users, you can use the LIMIT … FROM …
  parameter to return smaller sets of users.

  For example, you can run `SHOW USERS LIMIT 10000 FROM my_user` to return the next 10000 users starting from the user named
  `my_user`.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

The following example lists the users in the account:

```sqlexample
SHOW USERS;
```

```output
+--------------+-------------------------------+---------------+--------------+------------+-----------+------------------------+----------------+----------------+---------+----------+----------------------+----------------+-------------------+-------------------+--------------+-------------------------+---------------+---------------+--------------------+--------------+-------------------------------+-----------------+-------------------+--------------+--------------------+--------+---------+---------+---------------------------------------+
| name         | created_on                    | login_name    | display_name | first_name | last_name | email                  | mins_to_unlock | days_to_expiry | comment | disabled | must_change_password | snowflake_lock | default_warehouse | default_namespace | default_role | default_secondary_roles | ext_authn_duo | ext_authn_uid | mins_to_bypass_mfa | owner        | last_success_login            | expires_at_time | locked_until_time | has_password | has_rsa_public_key | type   | has_mfa | has_pat | has_federated_workload_authentication |
|--------------+-------------------------------+---------------+------------- +------------+-----------+------------------------+----------------+----------------+---------+----------+----------------------+----------------+-------------------+-------------------+--------------+-------------------------+---------------+---------------+--------------------+--------------+-------------------------------+-----------------+-------------------+--------------+--------------------+--------+---------+---------+---------------------------------------|
| MY_USER_NAME | 2020-04-28 12:24:38.722 -0700 | MY_LOGIN_NAME | Jane Smith   | Jane       | Smith     | jane.smith@example.com | NULL           | NULL           | NULL    | false    | false                | false          | MY_WAREHOUSE      | MY_DB.MY_SCHEMA   | MY_ROLE      | []                      | false         | NULL          | NULL               | ACCOUNTADMIN | 2025-06-12 15:02:22.783 -0700 | NULL            | NULL              | true         | true               | PERSON | true    | true    | false                                 |
+--------------+-------------------------------+---------------+--------------+------------+-----------+------------------------+----------------+----------------+---------+----------+----------------------+----------------+-------------------+-------------------+--------------+-------------------------+---------------+---------------+--------------------+--------------+-------------------------------+-----------------+-------------------+--------------+--------------------+--------+---------+---------+---------------------------------------+
```

---
title: SHOW VARIABLES
source: https://docs.snowflake.com/en/sql-reference/sql/show-variables.md
section: SQL Commands
---

# SHOW VARIABLES

Lists all [variables](../session-variables.md) defined in the current session.

See also:
:   [SET](set.md) , [UNSET](unset.md)

## Syntax

```sqlsyntax
SHOW VARIABLES [ LIKE '<pattern>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

---
title: SHOW VERSIONS IN APPLICATION PACKAGE
source: https://docs.snowflake.com/en/sql-reference/sql/show-versions.md
section: SQL Commands
---

# SHOW VERSIONS IN APPLICATION PACKAGE

Lists the versions defined in the specified application package.

See also:
:   [ALTER APPLICATION](alter-application.md), [CREATE APPLICATION](create-application.md), [DESCRIBE APPLICATION](desc-application.md), [DROP APPLICATION](drop-application.md)

## Syntax

```sqlsyntax
SHOW VERSIONS [ LIKE <pattern> ]
  IN APPLICATION PACKAGE <name>;
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN APPLICATION PACKAGE name`
:   Specifies the identifier for the application package whose versions you want to view.

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Example

```sqlexample
SHOW VERSIONS IN APPLICATION PACKAGE hello_snowflake_app;
```

```output
+----------------+-------+---------+---------+-------------------------------+------------+-----------+-------------+-------+---------------+
| version        | patch | label   | comment | created_on                    | dropped_on | log_level | trace_level | state | review_status |
|----------------+-------+---------+---------+-------------------------------+------------+-----------+-------------+-------+---------------|
| V1_0           |     0 | NULL    | NULL    | 2023-05-10 17:11:47.696 -0700 | NULL       | OFF       | OFF         | READY | NOT_REVIEWED  |
+----------------+-------+---------+---------+-------------------------------+------------+-----------+-------------+-------+---------------+
```

---
title: SHOW VERSIONS IN DATASET
source: https://docs.snowflake.com/en/sql-reference/sql/show-versions-in-dataset.md
section: SQL Commands
---

# SHOW VERSIONS IN DATASET

Displays information about the datasets in your account at either the schema or database level.

See also:
:   [SHOW DATASETS](show-datasets.md) , [ALTER DATASET](alter-dataset.md), [CREATE DATASET](create-dataset.md)

## Syntax

```sqlsyntax
SHOW VERSIONS [ LIKE '<pattern>' ] IN DATASET <dataset_name>
  [ LIMIT <rows>]
```

## Parameters

`IN DATASET dataset_name`
:   Name of dataset for which versions are displayed.

`LIKE pattern`
:   Restricts the list of returned datasets to those matching the specified pattern. Matching is case-insensitive.

`LIMIT num`
:   Limits the maximum number of rows returned.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP or USAGE | Dataset | Provides the privilege to show the dataset versions within the account. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Regarding metadata:

  > **Attention:**
  >
  > Customers should ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered as metadata when using the Snowflake service. For more information, see [Metadata fields in Snowflake](../metadata.md).

---
title: SHOW VERSIONS IN DBT PROJECT
source: https://docs.snowflake.com/en/sql-reference/sql/show-versions-in-dbt-project.md
section: SQL Commands
---

# SHOW VERSIONS IN DBT PROJECT

Displays a list of all versions of a [dbt project object](../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

See also:
:   [ALTER DBT PROJECT](alter-dbt-project.md), [DESCRIBE DBT PROJECT](desc-dbt-project.md), [EXECUTE DBT PROJECT](execute-dbt-project.md), [SHOW DBT PROJECTS](show-dbt-projects.md), [DROP DBT PROJECT](drop-dbt-project.md)

## Syntax

```sqlsyntax
SHOW VERSIONS IN DBT PROJECT <name>
  [ LIMIT <number> ]
```

## Parameters

`name`
:   String that specifies the identifier (that is, the name) for the dbt project object within Snowflake; must be unique for the schema in which
    the dbt project is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`LIMIT rows`
:   Optionally limits the maximum number of rows returned. The actual number of rows returned might be less than the specified limit. For
    example, the number of existing objects is less than the specified limit.

    Default: No value (no limit is applied to the output).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| USAGE | The dbt project object |
| MONITOR | The dbt project object |
| OWNERSHIP | The dbt project object |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Output

The command output provides table properties and metadata about versions of dbt Projects in the following columns:

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the dbt project object was created. |
| `name` | The auto-assigned name of the dbt project version. For example, `VERSION$1`. |
| `alias` | The alias for the dbt Project you assigned (for example, `ALTER DBT PROJECT <name> ADD VERSION <alias> FROM ...`). Null if not specified. |
| `location_uri` | Full URL of the dbt project version. |
| `is_default` | TRUE if the default version of the dbt project object points to this version. |
| `is_live` | TRUE if the dbt project version is a live version of the listing. |
| `is_first` | TRUE if the dbt Project is the first version. |
| `is_last` | TRUE if the dbt Project is the last version. |
| `comment` | Comment set on the dbt Project. |
| `source_location_uri` | The source location URI where this dbt project version is created from. |
| `git_commit_hash` | The git commit hash, if the dbt project version was created from a git source. |

## Examples

Show all versions of `my_dbt_project`:

```sqlexample
SHOW VERSIONS IN DBT PROJECT my_dbt_project;
```

```output
+---------------------------------+-----------+-------+----------------------------------------------------------------------+------------+---------+----------+---------+---------+---------------------+-----------------+
|             created_on          | name      | alias |  location_uri                                                        | is_default | is_live | is_first | is_last | comment | source_location_uri | git_commit_hash |
+---------------------------------+-----------+-------+----------------------------------------------------------------------+------------+--------------------+---------+---------+---------------------+-----------------+
|   2025-01-08 11:18:24.550 -0800 | VERSION$2 | null  |  snow://dbtproject/mydb.my_schema.my_dbt_project/versions/version$2/ | TRUE       | FALSE   | FALSE    |  TRUE   | null    | null                | null            |
|   2025-01-08 11:17:32.894 -0800 | VERSION$1 | null  |  snow://dbtproject/mydb.my_schema.my_dbt_project/versions/version$2/ | FALSE      | FALSE   | TRUE     |  FALSE  | null    | null                | null            |
+---------------------------------+-----------+------------------------------+-----------------------------------------------+------------+--------------------+---------+---------+---------------------+-----------------+
```

---
title: SHOW VERSIONS IN LISTING
source: https://docs.snowflake.com/en/sql-reference/sql/show-versions-in-listing.md
section: SQL Commands
---

# SHOW VERSIONS IN LISTING

Lists and provides details of all listing versions.

See also:
:   [CREATE LISTING](create-listing.md), [ALTER LISTING](alter-listing.md), [DESCRIBE LISTING](desc-listing.md), [DROP LISTING](drop-listing.md)

## Syntax

```sqlsyntax
SHOW VERSIONS IN LISTING <name>
  [ LIMIT <rows> ]
```

## Parameters

`name`
:   Specifies the listing identifier (name). If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive.

    For more information, see [Identifier Requirements](../identifiers-syntax.md).

`LIMIT rows`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | Date and time the version was created. |
| `name` | The system generated name of the version. |
| `alias` | The user specified alias of the version. |
| `location_url` | Full URL of the version, against which stage operations can be performed. |
| `is_default` | Identifies the listing version that is published. |
| `is_live` | Identifies if the version is a live version of the listing. |
| `is_first` | Identifies if the version is the first listing version. |
| `is_last` | Identifies if the version is the last listing version. |
| `comment` | Optional comments for the listing version. |
| `source_location_url` | The source location URL where this version is created from. |
| `git_commit_hash` | The git commit hash, if the version is created from a git source. |

## Access control requirements

* To show listing versions, you must be using a role that has USAGE or OWNERSHIP privileges on the listing.

## Usage notes

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

## Examples

Show all versions of the MYLISTING listing:

```sqlexample
SHOW VERSIONS IN LISTING MYLISTING
```

```output
+-----------------------------------+------------------------------+------------------------------+-----------------------------------------------+--------------------+--------------------+--------------------+--------------------+---------------------------------------------+---------------------------------------+---------------------------------------+
|             created_on            |             name             |             alias            |                  location_uri                 |     is_default     |       is_live      |      is_first      |       is_last      |                   comment                   |          source_location_uri          |             git_commit_hash           |
+-----------------------------------+------------------------------+------------------------------+-----------------------------------------------+--------------------+--------------------+--------------------+--------------------+---------------------------------------------+---------------------------------------+---------------------------------------+
|   2025-01-08 11:18:39.921 -0800   |                              |                              |  snow://listing/MYLISTING/versions/live/      |        FALSE       |        TRUE        |        FALSE       |       FALSE        |                                             |            @listingstage              |                                       |
|   2025-01-08 11:18:24.550 -0800   |        VERSION$2             |                              |  snow://listing/MYLISTING/versions/version$2/ |        TRUE        |        FALSE       |        FALSE       |       TRUE         |                                             |            @listingstage              |                                       |
|   2025-01-08 11:17:32.894 -0800   |        VERSION$1             |                              |  snow://listing/MYLISTING/versions/version$1/ |        FALSE       |        FALSE       |        TRUE        |       FALSE        |                                             |            @listingstage              |                                       |
+-----------------------------------+------------------------------+------------------------------+-----------------------------------------------+--------------------+--------------------+--------------------+--------------------+---------------------------------------------+---------------------------------------+---------------------------------------+
```

---
title: SHOW VERSIONS IN MODEL
source: https://docs.snowflake.com/en/sql-reference/sql/show-versions-in-model.md
section: SQL Commands
---

# SHOW VERSIONS IN MODEL

Lists the versions in a machine learning model. Models may have multiple versions, one of which must be designated as
the default (see [ALTER MODEL](alter-model.md)).

The output returns table metadata and properties, ordered lexicographically by database, schema, and model name (see
Output in this topic for descriptions of the output columns). This is important to note if you wish to filter the
results using the provided filters.

See also:
:   [CREATE MODEL](create-model.md) , [DROP MODEL](drop-model.md) , [ALTER MODEL](alter-model.md), [SHOW MODELS](show-models.md)

## Syntax

```sqlsyntax
SHOW VERSIONS [ LIKE '<pattern>' ] IN MODEL <model_name>
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN MODEL model_name`
:   Specifies the identifier of the model that contains the versions to be listed. If the identifier contains spaces,
    special characters, or mixed-case characters, the entire identifier must be enclosed in double quotes. Identifiers
    enclosed in double quotes are also case-sensitive (e.g. `"My Object"`).

    If the model identifier is not fully-qualified (in the form of `db_name.schema_name.model_name` or
    `schema_name.model_name`), the command looks for the model in the current schema for the session.

## Output

The command output provides table properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the model version was created. |
| name | Name of the version. |
| aliases | Aliases of the model version, if any, including any you have assigned using [ALTER MODEL](alter-model.md) and any system aliases (DEFAULT, FIRST, or LAST) that apply. If a model version has no aliases, this column contains an empty ARRAY ([]). |
| database_name | Database in which the version is stored. |
| schema_name | Schema in which the version is stored. |
| model_name | Name of the model that this version belongs to. |
| is_default_version | Boolean value indicating whether this version is the model’s default version. |
| functions | JSON array of the names of the functions available in this version. |
| metadata | JSON object containing metadata as key-value pairs (`{}` if no metadata is specified). |
| user_data | JSON object from the `user_data` section of the model definition manifest (`{}` if no user data is specified). |

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

---
title: SHOW VERSIONS IN ORGANIZATION PROFILE
source: https://docs.snowflake.com/en/sql-reference/sql/show-versions-in-organization-profile.md
section: SQL Commands
---

# SHOW VERSIONS IN ORGANIZATION PROFILE

Lists the organization profile versions for which you have access privileges.

See also:
:   [ALTER ORGANIZATION PROFILE](alter-organization-profile.md), [CREATE ORGANIZATION PROFILE](create-organization-profile.md), [DESCRIBE AVAILABLE ORGANIZATION PROFILE](desc-available-organization-profile.md), [DESCRIBE ORGANIZATION PROFILE](desc-organization-profile.md), [DROP ORGANIZATION PROFILE](drop-organization-profile.md), [SHOW AVAILABLE ORGANIZATION PROFILES](show-available-organization-profiles.md)

## Syntax

```sqlsyntax
SHOW VERSIONS IN ORGANIZATION PROFILE <name>
```

## Parameters

`name`
:   Specifies the identifier for the organization profile on which you want to list organization profile versions. Must contain only uppercase characters or numbers.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case sensitive. See [Identifier requirements](../identifiers-syntax.md).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `created_on` | The date and time when the organization profile was created. |
| `name` | The organization profile name. |
| `alias` | The user-defined alias for the organization profile version. |
| `location_uri` | The URI for the organization profile version. |
| `is_live` | The organization profile version is live. One of `TRUE` or `FALSE`. |
| `is_default` | The organization profile version is the default. One of `TRUE` or `FALSE`. Must be `FALSE` when `is_live` is `TRUE`. |
| `is_first` | The organization profile is the first version. One of `TRUE` or `FALSE`. |
| `is_last` | The organization profile is the last version. One of `TRUE` or `FALSE`. |
| `comment` | Comments added by users. |
| `source_location_uri` | The source location URI for the organization profile version. |
| `git_commit_hash` | The git commit hash for the organization profile version when it’s created from a git source. `NONE` when a git commit hash is unavailable. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| MODIFY | Organization profile |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Examples

The following example lists the organization profile versions in the organization profile `MYORGANIZATIONPROFILE` that you have the privileges to access:

```sqlexample
SHOW VERSIONS IN ORGANIZATION PROFILE myorganizationprofile;
```

```output
+------------------------+---------------------+---------------------+-----------------------------------------------+---------------------+---------------------+-------------+-----------------+---------------------+---------------------+----------------+
|created_on              |name                 |alias                |location_uri                                   |is_live              |is_default           |is_first     |is_last          |comment              |source_location_uri  |git_commit_hash |
+------------------------+---------------------+---------------------+-----------------------------------------------+---------------------+---------------------+-------------+-----------------+---------------------+---------------------+----------------+
|2025-01-01 01:01:01.000 |VERSION$1            |V1                   |snow://notebook/mynotebook/versions/version$1  |TRUE                 |FALSE                |TRUE         |FALSE            |                     |@TESTDB.PUBLIC.STAGE |NONE            |
+------------------------+---------------------+---------------------+-----------------------------------------------+---------------------+---------------------+-------------+-----------------+---------------------+---------------------+----------------+
```

---
title: SHOW VIEWS
source: https://docs.snowflake.com/en/sql-reference/sql/show-views.md
section: SQL Commands
---

# SHOW VIEWS

Lists the views, including secure views, for which you have access privileges. The command can be used to list views for the
current/specified database or schema, or across your entire account.

The output returns view metadata and properties, ordered lexicographically by database, schema, and view name. This is important to note
if you wish to filter the results using the provided filters.

See also:
:   [ALTER VIEW](alter-view.md) , [CREATE VIEW](create-view.md) , [DROP VIEW](drop-view.md) , [DESCRIBE VIEW](desc-view.md)

    [VIEWS view](../info-schema/views.md) (Information Schema)

## Syntax

```sqlsyntax
SHOW [ TERSE ] VIEWS [ LIKE '<pattern>' ]
                     [ IN { ACCOUNT | DATABASE [ <db_name> ] | [ SCHEMA ] [ <schema_name> ] | APPLICATION <application_name> | APPLICATION PACKAGE <application_package_name> } ]
                     [ STARTS WITH '<name_string>' ]
                     [ LIMIT <rows> [ FROM '<name_string>' ] ]
```

## Parameters

`TERSE`
:   Optionally returns only a subset of the output columns:

    * `created_on`
    * `name`
    * `kind`
    * `database_name`
    * `schema_name`

    Default: No value (all columns are included in the output)

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`IN ACCOUNT | [ DATABASE ] db_name | [ SCHEMA ] schema_name | [ APPLICATION ] application_name | [ APPLICATION PACKAGE ] application_package_name`
:   Optionally specifies the scope of the command, which determines whether the command lists records only for the current/specified
    database or schema, or across your entire account:

    The `APPLICATION` and `APPLICATION PACKAGE` keywords are not required, but they specify the scope for the named Snowflake Native App.

    The `DATABASE` or `SCHEMA` keyword is not required; you can set the scope by specifying only the database or schema name.
    Likewise, the database or schema name is not required if the session currently has a database in use:

    * If `DATABASE` or `SCHEMA` is specified without a name and the session does not currently have a database in use, the
      parameter has no effect on the output.
    * If `SCHEMA` is specified with a name and the session does not currently have a database in use, the schema name must
      be fully qualified with the database name (e.g. `testdb.testschema`).

    Default: Depends on whether the session currently has a database in use:

    * Database: `DATABASE` is the default (i.e. the command returns the objects you have privileges to view in the database).
    * No database: `ACCOUNT` is the default (i.e. the command returns the objects you have privileges to view in your account).

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The command output provides view properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | The timestamp at which the view was created. |
| name | The name of the view. |
| reserved | (Reserved for future use.) |
| kind | The kind of view, either `VIEW` or `MATERIALIZED_VIEW`. |
| database_name | The name of the database in which the view exists. |
| schema_name | The name of the schema in which the view exists. |
| owner | The owner of the view. |
| comment | Optional comment. |
| text | The text of the command that created the view (e.g. CREATE VIEW …). |
| is_secure | True if the view is a secure view; false otherwise. |
| is_materialized | True if the view is a materialized view; false otherwise. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| change_tracking | Either `ON` or `OFF`. `ON` indicates enabled, and you query the change tracking data using streams or the CHANGES clause for SELECT statements. `OFF` indicates disabled, but you can optionally [enable](../../user-guide/streams-manage.md) change tracking as needed. |

## Usage notes

* By design, the command output includes secure views, but does not provide certain information about these views unless you are using
  the role that has ownership of the view. To view details for secure views, you must use the role that owns the view or use the
  [VIEWS](../info-schema/views.md) view in the Information Schema.

* The output of this command might include objects with names like `SN_TEMP_OBJECT_<n>` (where `<n>` is a number). These are
  temporary objects that are created by the [Snowpark](../../developer-guide/snowpark/index.md) library on behalf of the user.

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The value for `LIMIT rows` can’t exceed `10000`. If `LIMIT rows` is omitted, the command results in an error
  if the result set is larger than ten thousand rows.

  To view results for which more than ten thousand records exist, either include `LIMIT rows` or query the corresponding
  view in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show all views whose names start with `line` that you have privileges to see in the `mydb.public` schema:

> ```sqlexample
> SHOW VIEWS LIKE 'line%' IN mydb.public;
>
> +-------------------------------+---------+----------+---------------+-------------+----------+---------+-------------------------------------------------------+-----------+-----------------+-----------------+-----------------+
> | created_on                    | name    | reserved | database_name | schema_name | owner    | comment | text                                                  | is_secure | is_materialized | change_tracking | owner_role_type |
> +-------------------------------+---------+----------+---------------+-------------+----------+---------+-------------------------------------------------------+-----------+-----------------+-----------------+-----------------+
> | 2019-05-24 18:41:14.247 -0700 | liners1 |          | MYDB          | PUBLIC      | SYSADMIN |         | create materialized views liners1 as select * from t; | false     | false           | on              | ROLE            |
> +-------------------------------+---------+----------+---------------+-------------+----------+---------+-------------------------------------------------------+-----------+-----------------+-----------------+-----------------+
> ```

---
title: SHOW WAREHOUSES
source: https://docs.snowflake.com/en/sql-reference/sql/show-warehouses.md
section: SQL Commands
---

# SHOW WAREHOUSES

Lists all the [virtual warehouses](../../user-guide/warehouses-overview.md) in your account for which you have access privileges.

See also:
:   [ALTER WAREHOUSE](alter-warehouse.md) , [CREATE WAREHOUSE](create-warehouse.md) , [DESCRIBE WAREHOUSE](desc-warehouse.md) , [DROP WAREHOUSE](drop-warehouse.md)

## Syntax

```sqlsyntax
SHOW WAREHOUSES
  [ LIKE '<pattern>' ]
  [ WITH PRIVILEGES <objectPrivilege> [ , <objectPrivilege> [ , ... ] ] ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`WITH PRIVILEGES object_privilege [ , object_privilege [ , ... ] ]`
:   Optionally limits rows to objects for which the [active role](../../user-guide/security-access-control-overview.md) for the current
    user has been granted all of the specified privileges in the list on the object.

    If a CREATE <object> privilege is included in the privileges list, the command excludes objects for which secondary roles have
    been granted privileges. This is because only the primary role has the authorization to create objects. For more information, see
    [Authorization through primary role and secondary roles](../../user-guide/security-access-control-overview.md).

## Output

The columns in the output provide the following information. For accounts that have the [query acceleration service](../../user-guide/query-acceleration-service.md) feature enabled, the output provides additional information.

> | Column | Description |
> | --- | --- |
> | `name` | Name of the warehouse. |
> | `state` | Whether the warehouse is:  active/running (`STARTED`), inactive (`SUSPENDED`), or resizing (`RESIZING`). |
> | `type` | Warehouse type. STANDARD and SNOWPARK-OPTIMIZED are the only currently supported types. |
> | `size` | Size of the warehouse (X-Small, Small, Medium, Large, X-Large, etc.) |
> | `min_cluster_count` | Minimum number of clusters for the (multi-cluster) warehouse (always 1 for single-cluster warehouses). |
> | `max_cluster_count` | Maximum number of clusters for the (multi-cluster) warehouse (always 1 for single-cluster warehouses). |
> | `started_clusters` | Number of clusters currently started. |
> | `running` | Number of SQL statements that are being executed by the warehouse. |
> | `queued` | Number of SQL statements that are queued for the warehouse. |
> | `is_default` | Whether the warehouse is the default for the current user. |
> | `is_current` | Whether the warehouse is in use for the session.  Only one warehouse can be in use at a time for a session. To specify or change the warehouse for a session, use the [USE WAREHOUSE](use-warehouse.md) command. |
> | `is_interactive` | Whether the warehouse is an [interactive warehouse](../../user-guide/interactive.md) (`Y`) or not (`N`). Currently, the interactive warehouse feature is only available on Amazon Web Services (AWS). |
> | `auto_suspend` | Period of inactivity, in seconds, after which a running warehouse will automatically suspend and stop using credits.  A value of `null` indicates the warehouse never automatically suspends. |
> | `auto_resume` | Whether the warehouse, if suspended, automatically resumes when a query is submitted to the warehouse. |
> | `available` | Percentage of the warehouse compute resources that are provisioned and available. |
> | `provisioning` | Percentage of the warehouse compute resources that are in the process of provisioning. |
> | `quiescing` | Percentage of the warehouse compute resources that are executing SQL statements, but will be shut down once the queries complete. |
> | `other` | Percentage of the warehouse compute resources that are in a state other than `available`, `provisioning`, or `quiescing`. |
> | `created_on` | Date and time when the warehouse was created. |
> | `resumed_on` | Date and time when the warehouse was last started or restarted. |
> | `updated_on` | Date and time when the warehouse was last updated, which includes changing any of the properties of the warehouse or changing the state (`STARTED`, `SUSPENDED`, `RESIZING`) of the warehouse. |
> | `owner` | Role that owns the warehouse. |
> | `comment` | Comment for the warehouse. |
> | `enable_query_acceleration` | Whether the [query acceleration service](../../user-guide/query-acceleration-service.md) is enabled for the warehouse. |
> | `query_acceleration_max_scale_factor` | [Maximum scale factor](create-warehouse.md) for the query acceleration service. |
> | `resource_monitor` | ID of [resource monitor](../../user-guide/resource-monitors.md) explicitly assigned to the warehouse; controls the monthly credit usage for the warehouse. |
> | `actives` , `pendings` , `failed` , `suspended` , `uuid` | These five columns are for internal use and will be removed in a future release. |
> | `scaling_policy` | Policy that determines when additional clusters (in a multi-cluster warehouse) are automatically started and shut down. |
> | `owner_role_type` | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
> | `resource_constraint` | If type is `SNOWPARK-OPTIMIZED`, one of:   * `MEMORY_1X`, `MEMORY_1X_x86`, `MEMORY_16X`, `MEMORY_16X_x86`, `MEMORY_64X`, `MEMORY_64X_x86`.   Otherwise `NULL`. |
> | `generation` | The [generation](../../user-guide/warehouses-gen2.md) type of the warehouse. |

For more information about the properties that can be specified for a warehouse, see [CREATE WAREHOUSE](create-warehouse.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

## Examples

Show warehouses with names that start with `test` that you have privileges to view:

```sqlexample
SHOW WAREHOUSES LIKE 'test%';
```

```output
+---------------+-----------+--------------------+---------+-------------------+-------------------+------------------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+---------------------------+-------------------------------------+------------------+---------+----------+--------+-----------+----------+----------------+------------------+--------------------+
| name          | state     | type               | size    | min_cluster_count | max_cluster_count | started_clusters | running | queued | is_default | is_current | auto_suspend | auto_resume | available | provisioning | quiescing | other | created_on                    | resumed_on                    | updated_on                    | owner        | comment | enable_query_acceleration | query_acceleration_max_scale_factor | resource_monitor | actives | pendings | failed | suspended | uuid     | scaling_policy | owner_role_type | resource_constraint | generation |
|---------------+-----------+--------------------+---------+-------------------+-------------------+------------------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+---------------------------+-------------------------------------+------------------+---------+----------+--------+-----------+----------+----------------------------------|---------------------+
| TEST1         | SUSPENDED | STANDARD           | Medium  |                 1 |                 1 |                0 |       0 |      0 | N          | N          |          600 | true        |           |              |           |       | 2023-01-27 14:57:07.768 -0800 | 2023-05-10 16:17:49.258 -0700 | 2023-05-10 16:17:49.258 -0700 | MY_ROLE      |         | true                      |                                   8 | null             |       0 |        0 |      0 |         4 | 76064    | STANDARD       | ROLE            | NULL  | 1                 +
| TEST2         | SUSPENDED | STANDARD           | X-Small |                 1 |                 1 |                0 |       0 |      0 | N          | N          |          600 | true        |           |              |           |       | 2023-01-27 14:57:07.953 -0800 | 1969-12-31 16:00:00.000 -0800 | 2023-01-27 14:57:08.356 -0800 | MY_ROLE      |         | true                      |                                  16 | MYTEST_RM        |       0 |        0 |      0 |         1 | 76116    | STANDARD       |  ROLE           | NULL  | 2                 +
| TEST3         | SUSPENDED | STANDARD           | Small   |                 1 |                 1 |                0 |       0 |      0 | N          | N          |          600 | true        |           |              |           |       | 2023-08-08 10:26:45.534 -0700 | 2023-08-08 10:26:45.681 -0700 | 2023-08-08 10:26:45.681 -0700 | MY_ROLE      |         | false                     |                                   8 | null             |       0 |        0 |      0 |         2 | 19464517 | STANDARD       | ROLE            | NULL   | NULL             +
| TEST4         | RESUMING  | SNOWPARK-OPTIMIZED | Large   |                 1 |                 1 |                0 |       0 |      0 | N          | Y          |          600 | true        |           |              |           |       | 2023-09-21 17:29:58.165 -0700 | 2023-09-21 17:29:58.165 -0700 | 2023-09-21 17:29:58.207 -0700 | MY_ROLE      |         | false                     |                                   8 | null             |       0 |        0 |      0 |         0 | 19464585 | STANDARD       | ROLE            | MEMORY_16X_X86 | NULL             +
+---------------+-----------+--------------------+---------+-------------------+-------------------+------------------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+---------+---------------------------+-------------------------------------+------------------+---------+----------+--------+-----------+----------+----------------+-----------------+---------------------+
```

Show warehouses that you have been granted the MODIFY and OPERATE privileges on:

```sqlexample
SHOW WAREHOUSES WITH PRIVILEGES MODIFY, OPERATE;
```

```output
+------------------------------+-----------+----------+---------+-------------------+-------------------+------------------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+-------------------------------------------------+---------------------------+-------------------------------------+------------------+---------+----------+--------+-----------+----------+----------------+-----------------+---------------------+
| name                         | state     | type     | size    | min_cluster_count | max_cluster_count | started_clusters | running | queued | is_default | is_current | auto_suspend | auto_resume | available | provisioning | quiescing | other | created_on                    | resumed_on                    | updated_on                    | owner        | comment                                         | enable_query_acceleration | query_acceleration_max_scale_factor | resource_monitor | actives | pendings | failed | suspended | uuid     | scaling_policy | owner_role_type |
|------------------------------+-----------+----------+---------+-------------------+-------------------+------------------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+-------------------------------------------------+---------------------------+-------------------------------------+------------------+---------+----------+--------+-----------+----------+----------------+-----------------+---------------------+
| TEST_WH                      | SUSPENDED | STANDARD | X-Small |                 1 |                 1 |                0 |       0 |      0 | Y          | Y          |          600 | true        |           |              |           |       | 2023-01-27 14:57:07.768 -0800 | 2024-07-30 13:39:24.118 -0700 | 2024-07-30 13:39:24.118 -0700 | TEST_ROLE    |                                                 | true                      |                                  32 | TEST_RM          |       0 |        0 |      0 |         1 | 76056    | STANDARD       | ROLE            | NULL                +
| SNOWPARK_DEMO                | SUSPENDED | STANDARD | X-Large |                 1 |                 1 |                0 |       0 |      0 | N          | N          |          600 | true        |           |              |           |       | 2023-01-27 14:57:07.903 -0800 | 2023-04-10 11:47:03.146 -0700 | 2023-04-10 11:47:03.146 -0700 | ACCOUNTADMIN | Created by straut for Snowpark quickstart       | false                     |                                   8 | null             |       0 |        0 |      0 |        16 | 76104    | STANDARD       | ROLE            | NULL                +
| TASTY_DEV_WH                 | SUSPENDED | STANDARD | X-Small |                 1 |                 1 |                0 |       0 |      0 | N          | N          |           60 | true        |           |              |           |       | 2023-10-25 16:25:43.681 -0700 | 2023-10-25 16:25:43.681 -0700 | 2023-10-25 16:25:43.711 -0700 | SYSADMIN     | developer warehouse for tasty bytes             | false                     |                                   8 | null             |       0 |        0 |      0 |         1 | 19464633 | STANDARD       | ROLE            | NULL                +
| TB_DOCS_WH                   | SUSPENDED | STANDARD | X-Small |                 1 |                 1 |                0 |       0 |      0 | N          | N          |           60 | true        |           |              |           |       | 2024-07-24 15:02:32.172 -0700 | 2024-07-24 15:33:30.502 -0700 | 2024-07-24 15:33:30.502 -0700 | SYSADMIN     | developer warehouse for tasty bytes             | false                     |                                   8 | null             |       0 |        0 |      0 |         1 | 19465097 | STANDARD       | ROLE            | NULL                +
+------------------------------+-----------+----------+---------+-------------------+-------------------+------------------+---------+--------+------------+------------+--------------+-------------+-----------+--------------+-----------+-------+-------------------------------+-------------------------------+-------------------------------+--------------+-------------------------------------------------+---------------------------+-------------------------------------+------------------+---------+----------+--------+-----------+----------+----------------+-----------------+---------------------+
```

Show certain details about warehouses by filtering and reordering data from the full SHOW WAREHOUSES output.
This stored procedure runs a SHOW WAREHOUSES command, then calls the [RESULT_SCAN](../functions/result_scan.md) function to filter and transform
the result set from the most recent SQL command. You can use this technique to generate different types of reports
if you don’t need the entire output of a SHOW command.

```sqlexample
CREATE OR REPLACE PROCEDURE started_and_suspended_warehouses()
  RETURNS TABLE(name VARCHAR, status VARCHAR, type VARCHAR, size VARCHAR)
  LANGUAGE SQL
  AS
  $$
    DECLARE
      res RESULTSET;
    BEGIN
      SHOW WAREHOUSES;
      res := (SELECT "name" name, "state" state, "type" type, "size" size
        FROM TABLE(RESULT_SCAN(LAST_QUERY_ID(-1)))
        WHERE "state" IN ('STARTED','SUSPENDED')
        ORDER BY "state", "name");
      RETURN TABLE(res);
    END;
  $$
  ;

CALL started_and_suspended_warehouses();
```

```output
+------------------------------+-----------+--------------------+---------+
| NAME                         | STATUS    | TYPE               | SIZE    |
|------------------------------+-----------+--------------------+---------|
| COMPUTE_WH                   | STARTED   | STANDARD           | X-Small |
| DEFAULT_SIZE                 | SUSPENDED | STANDARD           | Small   |
| DEFAULT_SIZE_2               | SUSPENDED | STANDARD           | X-Small |
| MEDIUM                       | SUSPENDED | SNOWPARK-OPTIMIZED | Medium  |
| PRIV_WH                      | SUSPENDED | STANDARD           | X-Small |
| SYSTEM$STREAMLIT_NOTEBOOK_WH | SUSPENDED | STANDARD           | X-Small |
| XSMALL                       | SUSPENDED | STANDARD           | Medium  |
+------------------------------+-----------+--------------------+---------+
```

---
title: SHOW WORKSPACES
source: https://docs.snowflake.com/en/sql-reference/sql/show-workspaces.md
section: SQL Commands
---

# SHOW WORKSPACES

Lists the [workspaces](../../user-guide/ui-snowsight/workspaces.md) for which you have access privileges. Each workspace is associated with a specific database and schema, allowing for organized data access and collaboration.

You can use this command to list objects in the current database and schema for the session, a specified database or schema, or
your entire account.

The output includes the metadata and properties for each object. The objects are sorted lexicographically by database, schema,
and object name (see Output in this topic for descriptions of the output columns). The order of rows in the results is important
to note if you want to filter the results.

## Syntax

```sqlsyntax
SHOW WORKSPACES [ LIKE '<pattern>' ]
                [ IN
                     {
                       ACCOUNT                  |

                       DATABASE                 |
                       DATABASE <database_name> |

                       SCHEMA                   |
                       SCHEMA <schema_name>     |
                       <schema_name>
                     }
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

`ACCOUNT`
:   Returns records for the entire account.

`DATABASE`, . `DATABASE db_name`
:   Returns records for the current database in use or for a specified database (`db_name`).

    If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

    > **Note:**
    >
    > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
    >
    > Objects with the same name are only displayed once if no `IN` clause is used.

`SCHEMA`, . `SCHEMA schema_name`
:   Returns records for the current schema in use or a specified schema (`schema_name`).

    `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

    If no database is in use, specifying `SCHEMA` has no effect on the output.

If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

* If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
  same effect as specifying `IN DATABASE`.
* If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
  same effect as specifying `IN ACCOUNT`.

`STARTS WITH 'name_string'`
:   Optionally filters the command output based on the characters that appear at the beginning of
    the object name. The string must be enclosed in single quotes and is case sensitive.

    For example, the following strings return different results:

    `... STARTS WITH 'B' ...`

    `... STARTS WITH 'b' ...`

    . Default: No value (no filtering is applied to the output)

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Output

The output of the command includes the following columns, which describe the properties and metadata of the object:

| Column | Description |
| --- | --- |
| `name` | Name of the workspace object. |
| `database_name` | Database in which the workspace is stored. |
| `schema_name` | Schema in which the workspace is stored. |
| `created_on` | Date and time when the workspace was created. |
| `updated_on` | Date and time when the workspace was last updated. |
| `owner` | Role that owns the workspace object. |
| `comment` | Comment for the workspace object. |

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| Any privilege (for example, READ, WRITE, or OWNERSHIP) | Workspace | A workspace appears in the results if the active role has any privilege on it. OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the GRANT OWNERSHIP command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The command doesn’t require a running warehouse to execute.
* The command only returns objects for which the current user’s current role has been granted at least one access privilege.
* The MANAGE GRANTS access privilege implicitly allows its holder to see every object in the account. By default, only the account
  administrator (users with the ACCOUNTADMIN role) and security administrator (users with the SECURITYADMIN role) have the
  MANAGE GRANTS privilege.

* To post-process the output of this command, you can use the [pipe operator](../operators-flow.md)
  (`->>`) or the [RESULT_SCAN](../functions/result_scan.md) function. Both constructs treat the output as a
  result set that you can query.

  For example, you can use the pipe operator or RESULT_SCAN function to select specific columns from the SHOW
  command output or filter the rows.

  When you refer to the output columns, use [double-quoted identifiers](../identifiers-syntax.md) for
  the column names. For example, to select the output column `type`, specify `SELECT "type"`.

  You must use double-quoted identifiers because the output column names for SHOW commands are in lowercase.
  The double quotes ensure that the column names in the SELECT list or WHERE clause match the column names
  in the SHOW command output that was scanned.

* The command returns a maximum of ten thousand records for the specified object type, as dictated by the access privileges for the role
  used to execute the command. Any records above the ten thousand records limit aren’t returned, even with a filter applied.

  To view results for which more than ten thousand records exist, query the corresponding view (if one exists) in the [Snowflake Information Schema](../info-schema.md).

* Executing the command for schema-level objects only returns an object if the current role also has at least one privilege on the
  parent database and schema.

## Examples

The following example lists the workspaces that you have the privileges to view in the current schema:

```sqlexample
SHOW WORKSPACES;
```

The following example lists workspaces with names that start with `test`:

```sqlexample
SHOW WORKSPACES STARTS WITH 'test';
```

---
title: TRUNCATE MATERIALIZED VIEW
source: https://docs.snowflake.com/en/sql-reference/sql/truncate-materialized-view.md
section: SQL Commands
---

# TRUNCATE MATERIALIZED VIEW

Removes all rows from a materialized view, but leaves the view intact (including all privileges and constraints on the materialized view).

Note that this is different from [DROP MATERIALIZED VIEW](drop-materialized-view.md), which removes the materialized view from the system.

See also:
:   [ALTER MATERIALIZED VIEW](alter-materialized-view.md) , [CREATE MATERIALIZED VIEW](create-materialized-view.md)

## Syntax

```sqlsyntax
TRUNCATE MATERIALIZED VIEW <name>
```

## Parameters

`name`
:   Specifies the identifier for the materialized view to truncate. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive (e.g. `"My Object"`).

    If the materialized view identifier is not fully-qualified (in the form of `db_name.schema_name.materialized_view_name`
    or `schema_name.materialized_view_name`), then the command looks for the materialized view in the current schema for the
    session.

## Usage notes

* Snowflake no longer supports truncation of materialized views.
* If you truncate a materialized view, the background maintenance service automatically updates the materialized view. If
  any queries are executed on the view while it is in the process of being updated, Snowflake ensures consistent results
  by retrieving any rows, as needed, from the base table.

  However, the maintenance service uses computing resources to update the materialized view and it is usually more efficient
  (i.e. less costly) to let an out-of-date materialized view “catch up” naturally over time than to truncate the view. As such,
  we do not generally recommend truncating a materialized view.
* Although each query on the view will still show up-to-date results, the query might run more slowly as Snowflake
  updates the materialized view or looks up data in the base table.

## Examples

This feature has been obsoleted.

---
title: TRUNCATE TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/truncate-table.md
section: SQL Commands
---

# TRUNCATE TABLE

Removes all rows from a table but leaves the table intact (including all privileges and constraints on the table). Also deletes the load
metadata for the table, which allows the same files to be loaded into the table again after the command completes.

Note that this is different from [DROP TABLE](drop-table.md), which removes the table from the system but retains a version of the table
(along with its load history) so that they can be recovered.

See also:
:   [CREATE TABLE](create-table.md)

## Syntax

```sqlsyntax
TRUNCATE [ TABLE ] [ IF EXISTS ] <name>

TRUNCATE [ TABLE ] [ IF EXISTS ] ERROR_TABLE( <base_table_name> )
```

## Parameters

`name`
:   Specifies the identifier for the table to truncate. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive (for example, `"My Object"`).

    If the table identifier is not fully-qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the command looks for the table in the current schema for the session.

`ERROR_TABLE( base_table_name )`
:   Truncates the error table associated with the specified base table. For more information about error tables, see
    [DML error logging](../../user-guide/data-load-overview.md).

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes. Identifiers
    enclosed in double quotes are also case-sensitive (for example, `"My Object"`).

    If the table identifier is not fully-qualified (in the form of `db_name.schema_name.table_name` or
    `schema_name.table_name`), the command looks for the table in the current schema for the session.

## Usage notes

* Both [DELETE](delete.md) and TRUNCATE TABLE maintain deleted data for recovery purposes (i.e. using Time Travel) for the data retention period.
  However, when a table is truncated, the load metadata cannot be recovered.
* The `TABLE` keyword is optional if the table name is fully qualified or a database and schema are currently in use for the session.

## Examples

The following example truncates a table:

1. Create a basic table and insert data:

   ```sqlexample
   CREATE OR REPLACE TABLE temp_test_truncate (i number);

   INSERT INTO temp_test_truncate SELECT seq8() FROM table(generator(rowcount=>20)) v;

   SELECT COUNT (*) FROM temp_test_truncate;
   ```

   ```output
   +-----------+
   | COUNT (*) |
   |-----------|
   |        20 |
   +-----------+
   ```
2. Truncate the table:

   ```sqlexample
   TRUNCATE TABLE IF EXISTS temp_test_truncate;
   ```
3. Verify that the table is now empty:

   ```sqlexample
   SELECT COUNT (*) FROM temp_test_truncate;
   ```

   ```output
   +-----------+
   | COUNT (*) |
   |-----------|
   |         0 |
   +-----------+
   ```

---
title: UNDROP <object>
source: https://docs.snowflake.com/en/sql-reference/sql/undrop.md
section: SQL Commands
---

# UNDROP *<object>*

Restores the specified object to the system. This command is part of the [Time Travel](../../user-guide/data-time-travel.md) feature.

See also:
:   [CREATE <object>](create.md) , [DROP <object>](drop.md) , [SHOW <objects>](show.md)

## UNDROP commands

For specific syntax, usage notes, and examples, see:

**Organization Objects:**

* [UNDROP ACCOUNT](undrop-account.md)

**Account Objects:**

> * [UNDROP DATABASE](undrop-database.md)

**Database Objects:**

> * [UNDROP DYNAMIC TABLE](undrop-dynamic-table.md)
> * [UNDROP EXTERNAL VOLUME](undrop-external-volume.md)
> * [UNDROP ICEBERG TABLE](undrop-iceberg-table.md)
> * [UNDROP NOTEBOOK](undrop-notebook.md)
> * [UNDROP SCHEMA](undrop-schema.md)
> * [UNDROP SNAPSHOT](undrop-snapshot.md)
> * [UNDROP STREAMLIT](undrop-streamlit.md)
> * [UNDROP TABLE](undrop-table.md)
> * [UNDROP TAG](undrop-tag.md)
> * [UNDROP TYPE](undrop-type.md)

---
title: UNDROP ACCOUNT
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-account.md
section: SQL Commands
---

# UNDROP ACCOUNT

Restores a [dropped account](../../user-guide/organizations-manage-accounts-delete.md) that has not yet been permanently deleted
(a dropped account that is within its grace period).

To obtain a list of dropped accounts that can be restored, refer to [Viewing dropped accounts](../../user-guide/organizations-manage-accounts-delete.md).

See also:
:   [CREATE ACCOUNT](create-account.md), [DROP ACCOUNT](drop-account.md), [SHOW ACCOUNTS](show-accounts.md)

## Syntax

```sqlsyntax
UNDROP ACCOUNT <name>
```

## Parameters

`name`
:   Specifies the name of the account being restored. As an example, if the full account identifier is `myorg-account123`, then
    specify `account123` as the name.

    The legacy account locator cannot be used to identify the account.

## Usage notes

* Only [organization administrators](../../user-guide/organization-administrators.md) can execute the command.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Example

To restore the dropped account `myaccount123`, which was still within the grace period, enter:

```sqlexample
UNDROP ACCOUNT myaccount123;
```

---
title: UNDROP DATABASE
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-database.md
section: SQL Commands
---

# UNDROP DATABASE

Restores the most recent version of a dropped database.

See also:
:   [CREATE DATABASE](create-database.md) , [ALTER DATABASE](alter-database.md) , [DESCRIBE DATABASE](desc-database.md) , [DROP DATABASE](drop-database.md) , [SHOW DATABASES](show-databases.md)

## Syntax

```sqlsyntax
UNDROP DATABASE <name>
```

## Parameters

`name`
:   Specifies the identifier for the database to restore. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* If a database with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

* Hybrid tables that belong to the specified database are not undropped.
* If you have multiple dropped databases with the same name, you can use the [IDENTIFIER keyword](../identifier-literal.md)
  with the system-generated identifier (from the [DATABASES view](../account-usage/databases.md)) to specify which database to
  restore. The name of the restored database remains the same. See Examples.

  > **Note:**
  >
  > You can only use the system-generated identifier with the IDENTIFIER() keyword when executing the UNDROP command for notebooks, tables, block storage snapshots, schemas, and databases.

## Examples

### Basic example

Restore the most recent version of a dropped database (this example builds on the [DROP DATABASE](drop-database.md) examples):

```sqlexample
UNDROP DATABASE mytestdb2;
```

```output
+-------------------------------------------+
| status                                    |
|-------------------------------------------|
| Database MYTESTDB2 successfully restored. |
+-------------------------------------------+
```

```sqlexample
SHOW DATABASES HISTORY;
```

```output
+---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+------------+
| created_on                      | name      | is_default | is_current | origin | owner  | comment | options | retention_time | dropped_on |
|---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+------------|
| Tue, 17 Mar 2015 16:57:04 -0700 | MYTESTDB  | N          | Y          |        | PUBLIC |         |         |              1 | [NULL]     |
| Tue, 17 Mar 2015 17:06:32 -0700 | MYTESTDB2 | N          | N          |        | PUBLIC |         |         |              1 | [NULL]     |
| Wed, 25 Feb 2015 17:30:04 -0800 | SALES1    | N          | N          |        | PUBLIC |         |         |              1 | [NULL]     |
| Fri, 13 Feb 2015 19:21:49 -0800 | DEMO1     | N          | N          |        | PUBLIC |         |         |              1 | [NULL]     |
+---------------------------------+-----------+------------+------------+--------+--------+---------+---------+----------------+------------+
```

### UNDROP database using the database ID

Restore a dropped database by ID using IDENTIFIER(). You can find the database ID of the specific database to restore using the
`database_id` column in the [DATABASES view](../account-usage/databases.md). For example, if you have multiple dropped
databases named `my_database`, and you want to restore the second-to-last dropped database `my_database`, follow
these steps:

1. Find the database ID of the dropped database in the Account Usage DATABASES view:

   ```sqlexample
   SELECT database_id,
     database_name,
     created,
     deleted,
     comment
   FROM SNOWFLAKE.ACCOUNT_USAGE.DATABASES
   WHERE database_name = 'MY_DATABASE'
   AND deleted IS NOT NULL
   ORDER BY deleted;
   ```

   ```output
   +-------------+---------------+-------------------------------+-------------------------------+---------+
   | DATABASE_ID | DATABASE_NAME | CREATED                       | DELETED                       | COMMENT |
   |-------------+---------------+-------------------------------+-------------------------------+---------|
   |         494 | MY_DATABASE   | 2024-07-01 17:51:33.380 -0700 | 2024-07-01 17:51:46.228 -0700 | NULL    |
   |         492 | MY_DATABASE   | 2024-07-01 17:51:52.560 -0700 | 2024-07-01 17:52:39.881 -0700 | NULL    |
   |         493 | MY_DATABASE   | 2024-07-01 17:52:39.849 -0700 | 2024-07-01 17:52:44.562 -0700 | NULL    |
   +-------------+---------------+-------------------------------+-------------------------------+---------+
   ```
2. Undrop `my_database` by database ID. To restore the second-to-last deleted database, use database ID `492` from the output of
   the previous statement. After you execute the following statement, the database is restored with its original name, `my_database`:

   ```sqlexample
   UNDROP DATABASE IDENTIFIER(492);
   ```

---
title: UNDROP DYNAMIC TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-dynamic-table.md
section: SQL Commands
---

# UNDROP DYNAMIC TABLE

Restores the most recent version of a dropped [dynamic table](../../user-guide/dynamic-tables-about.md).

See also:
:   [CREATE DYNAMIC TABLE](create-dynamic-table.md), [ALTER DYNAMIC TABLE](alter-dynamic-table.md), [DESCRIBE DYNAMIC TABLE](desc-dynamic-table.md),
    [SHOW DYNAMIC TABLES](show-dynamic-tables.md), [DROP DYNAMIC TABLE](drop-dynamic-table.md)

## Syntax

```sqlsyntax
UNDROP DYNAMIC TABLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the dynamic table to restore.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | The dynamic table that you want to undrop. |  |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* To undrop a dynamic table, you must be using a role that has OWNERSHIP privilege
  on that dynamic table.
* If a table with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Examples

Restore the most recent version of a dropped dynamic table:

```sqlexample
UNDROP DYNAMIC TABLE my_dynamic_table;
```

---
title: UNDROP EXTERNAL VOLUME
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-external-volume.md
section: SQL Commands
---

# UNDROP EXTERNAL VOLUME

Restores the most recent version of a dropped [external volume](../../user-guide/tables-iceberg.md).

See also:
:   [CREATE EXTERNAL VOLUME](create-external-volume.md) , [ALTER EXTERNAL VOLUME](alter-external-volume.md), [DESCRIBE EXTERNAL VOLUME](desc-external-volume.md) , [SHOW EXTERNAL VOLUMES](show-external-volumes.md) ,
    [DROP EXTERNAL VOLUME](drop-external-volume.md)

## Syntax

```sqlsyntax
UNDROP EXTERNAL VOLUME <name>
```

## Parameters

`name`
:   Specifies the identifier for the external volume to restore.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Usage notes

* If an external volume with the same name already exists, the UNDROP command returns an error.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Examples

Restore the most recent version of a dropped external volume named `my_external_volume`:

```sqlexample
UNDROP EXTERNAL VOLUME my_external_volume;
```

---
title: UNDROP ICEBERG TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-iceberg-table.md
section: SQL Commands
---

# UNDROP ICEBERG TABLE

Restores the most recent version of a dropped [Apache Iceberg™ table](../../user-guide/tables-iceberg.md).

This topic refers to Iceberg tables as simply “tables” except where specifying *Iceberg tables* avoids confusion.

See also:
:   [CREATE ICEBERG TABLE](create-iceberg-table.md) , [ALTER ICEBERG TABLE](alter-iceberg-table.md) , [DROP ICEBERG TABLE](drop-iceberg-table.md) ,
    [SHOW ICEBERG TABLES](show-iceberg-tables.md) , [DESCRIBE ICEBERG TABLE](desc-iceberg-table.md)

## Syntax

```sqlsyntax
UNDROP ICEBERG TABLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the table to restore. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* This command isn’t supported for tables in a catalog-linked database.
* Restoring Iceberg tables is only supported in the current schema or current database, even if the table name is fully qualified.
* If an Iceberg table with the same name already exists, an error is returned.
* To undrop an Iceberg table whose external volume has been dropped, undrop the external volume first. You can’t undrop the Iceberg table
  by creating a new external volume with same name as the dropped external volume.
* You can’t restore a table that uses an external catalog if the associated catalog integration has been dropped.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Examples

Restore the most recent version of a dropped table `my_iceberg_table`:

```sqlexample
UNDROP ICEBERG TABLE my_iceberg_table;
```

---
title: UNDROP NOTEBOOK
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-notebook.md
section: SQL Commands
---

# UNDROP NOTEBOOK

Restores the most recent version of a dropped notebook.

See also:
:   [CREATE NOTEBOOK](create-notebook.md) , [ALTER NOTEBOOK](alter-notebook.md) , [DROP NOTEBOOK](drop-notebook.md) , [SHOW NOTEBOOKS](show-notebooks.md) , [DESCRIBE NOTEBOOK](desc-notebook.md)

## Syntax

```sqlsyntax
UNDROP NOTEBOOK <name>
```

## Parameters

`name`
:   Specifies the identifier for the notebook to restore. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Notebooks can only be restored to the database and schema that contained the notebook at the time of deletion.
* If a notebook with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Example

The following example restores the most recent version of a dropped notebook named `mynotebook` (this example builds on the examples
provided for [DROP NOTEBOOK](drop-notebook.md)):

```sqlexample
UNDROP NOTEBOOK mynotebook;
```

```output
+--------------------------------------------+
| status                                     |
|--------------------------------------------|
| Notebook mynotebook successfully restored. |
+--------------------------------------------+
```

---
title: UNDROP SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-schema.md
section: SQL Commands
---

# UNDROP SCHEMA

Restore the most recent version of a dropped schema.

See also:
:   [CREATE SCHEMA](create-schema.md) , [ALTER SCHEMA](alter-schema.md) , [DESCRIBE SCHEMA](desc-schema.md) , [DROP SCHEMA](drop-schema.md) , [SHOW SCHEMAS](show-schemas.md)

## Syntax

```sqlsyntax
UNDROP SCHEMA <name>
```

## Parameters

`name`
:   Specifies the identifier for the schema to restore. If the identifier contains spaces or special characters, the entire string must be
    enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* A schema can only be restored to the database that contained the schema at the time of its deletion. For example, if you
  create and drop schema `s1` in database `db1`, then change the current database to `db2` and attempt to restore
  schema `s1` by ID (or fully-qualified name, `db1.s1`), schema `s1` is restored in database `db1` rather than in the
  current database, `db2`.
* If a schema with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

* Hybrid tables that belong to the specified schema are not undropped.
* If you have multiple dropped schemas with the same name, you can use the [IDENTIFIER keyword](../identifier-literal.md)
  with the system-generated identifier (from the [SCHEMATA view](../account-usage/schemata.md)) to specify which schema to restore.
  The name of the restored schema remains the same. See Examples.

  > **Note:**
  >
  > You can only use the system-generated identifier with the IDENTIFIER() keyword when executing the UNDROP command for notebooks, tables, block storage snapshots, schemas, and databases.

## Examples

### Basic example

Restore the most recent version of a dropped schema (this example builds on the examples provided for [DROP SCHEMA](drop-schema.md)):

```sqlexample
UNDROP SCHEMA myschema;
```

```output
+----------------------------------------+
| status                                 |
|----------------------------------------|
| Schema MYSCHEMA successfully restored. |
+----------------------------------------+
```

```sqlexample
SHOW SCHEMAS HISTORY;
```

```output
+---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+------------+
| created_on                      | name               | is_default | is_current | database_name | owner  | comment                                                   | options | retention_time | dropped_on |
|---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+------------|
| Fri, 13 May 2016 17:26:07 -0700 | INFORMATION_SCHEMA | N          | N          | MYTESTDB      |        | Views describing the contents of schemas in this database |         |              1 | [NULL]     |
| Tue, 17 Mar 2015 17:18:42 -0700 | MYSCHEMA           | N          | N          | MYTESTDB      | PUBLIC |                                                           |         |              1 | [NULL]     |
| Tue, 17 Mar 2015 16:57:04 -0700 | PUBLIC             | N          | Y          | MYTESTDB      | PUBLIC |                                                           |         |              1 | [NULL]     |
+---------------------------------+--------------------+------------+------------+---------------+--------+-----------------------------------------------------------+---------+----------------+------------+
```

### UNDROP schema using the schema ID

Restore a dropped schema by ID using IDENTIFIER(). You can find the schema ID of the specific schema to undrop using the `schema_id`
column in the [SCHEMATA view](../account-usage/schemata.md). For example, if you have multiple dropped schemas named `s1`, and you want
to restore the second-to-last dropped schema `s1`, follow these steps:

1. Find the schema ID of the dropped schema in the Account Usage SCHEMATA view:

   ```sqlexample
   SELECT schema_id,
     schema_name,
     catalog_name,
     created,
     deleted,
     comment
   FROM SNOWFLAKE.ACCOUNT_USAGE.SCHEMATA
   WHERE schema_name = 'S1'
   AND catalog_name = 'DB1'
   AND deleted IS NOT NULL
   ORDER BY deleted;
   ```

   ```output
   +-----------+-------------+---------------+-------------------------------+-------------------------------+---------+
   | SCHEMA_ID | SCHEMA_NAME | CATALOG_NAME  | CREATED                       | DELETED                       | COMMENT |
   |-----------+-------------+---------------+-------------------------------+-------------------------------+---------|
   |       797 | S1          | DB1           | 2024-07-01 17:53:01.955 -0700 | 2024-07-01 17:53:11.889 -0700 | NULL    |
   |       798 | S1          | DB1           | 2024-07-01 17:53:11.889 -0700 | 2024-07-01 17:53:16.327 -0700 | NULL    |
   |       799 | S1          | DB1           | 2024-07-01 17:53:16.327 -0700 | 2024-07-01 17:53:25.066 -0700 | NULL    |
   +-----------+-------------+---------------+-------------------------------+-------------------------------+---------+
   ```
2. Undrop schema `s1` using schema ID. To restore the second-to-last deleted schema, use schema ID `798` from the output of the previous
   statement. After you execute the following statement, the schema is restored with its original name, `s1`:

   ```sqlexample
   UNDROP SCHEMA IDENTIFIER(798);
   ```

---
title: UNDROP SNAPSHOT
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-snapshot.md
section: SQL Commands
---

# UNDROP SNAPSHOT

> **Note:**
>
> This operation is not currently covered by the Service Level set forth in
> [Snowflake’s Support Policy and Service Level Agreement](https://www.snowflake.com/legal/support-policy-and-service-level-agreement/).

Restores a previously removed [snapshot of a block storage volume](../../developer-guide/snowpark-container-services/block-storage-volume.md). After Snowflake restores the snapshot, the data is available for use.

See also:
:   [Managing snapshots](../../developer-guide/snowpark-container-services/block-storage-volume.md), [DROP SNAPSHOT](drop-snapshot.md), [CREATE SNAPSHOT](create-snapshot.md)

## Syntax

```sqlsyntax
UNDROP SNAPSHOT { <name> | IDENTIFIER( <id> ) }
 [ RENAME TO <new_snapshot_name> ];
```

## Parameters

`name`
:   Specifies the name of the snapshot to restore. If you specify a snapshot name, the command restores the most recently dropped snapshot with that name.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`IDENTIFIER( id )`
:   Specifies the system-generated identifier for the snapshot to restore.

    If you have multiple dropped snapshots with the same name, you can query the [BLOCK_STORAGE_SNAPSHOTS view](../account-usage/block_storage_snapshots.md) to get the system-generated identifier of the dropped snapshot that you want to restore. Then, use the [IDENTIFIER keyword](../identifier-literal.md) to specify that you want to restore this snapshot. The restored snapshot keeps its original name.

    For an example of restoring a snapshot by system-generated identifier, see Examples.

    > **Note:**
    >
    > You can only use the system-generated identifier with the IDENTIFIER() keyword when executing the UNDROP command for notebooks, tables, block storage snapshots, schemas, and databases.

`RENAME TO new_snapshot_name`
:   Specifies the name for the snapshot after it is restored. This lets you restore the snapshot to a different name.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Snapshot | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Snapshots can only be restored to the database and schema where the snapshot was located at the time of deletion. For example, if you
  create and drop a snapshot in schema `s1`, then change the current schema in your session to `s2` and attempt to undrop the
  snapshot, the snapshot will be restored in schema `s1`, not in the current schema `s2`.
* If a snapshot with the same name already exists, UNDROP SNAPSHOT returns an error.
  In this case, you have the option to specify a different name by using `RENAME TO` parameter.
* UNDROP SNAPSHOT relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be
  restored only if the object was deleted within the [data retention period](../../user-guide/data-time-travel.md).
  The default retention period is 24 hours. After the data retention period has passed, you can’t restore the snapshot.

## Examples

### Restore snapshot using name

The following example restores a previously dropped snapshot named `example_snapshot`:

```sqlexample
UNDROP SNAPSHOT example_snapshot;
```

```output
+--------------------------------------------------+
| status                                           |
|--------------------------------------------------|
| Snapshot EXAMPLE_SNAPSHOT successfully restored. |
+--------------------------------------------------+
```

### Restore snapshot using ID

Restore a dropped snapshot by ID using [IDENTIFIER()](../identifier-literal.md). You can find the snapshot ID of the specific snapshot to restore by using the snapshot_id column in the [BLOCK_STORAGE_SNAPSHOTS view](../account-usage/block_storage_snapshots.md) view. For example, if you have multiple dropped snapshots named `MY_SNAPSHOT`, and you want to restore the second-to-last dropped snapshot `MY_SNAPSHOT`, follow these steps:

1. In the Account Usage BLOCK_STORAGE_SNAPSHOTS view, find the snapshot ID of the dropped snapshot:

   ```sqlexample
   SELECT snapshot_id,
       snapshot_name,
       database_name,
       schema_name,
       created_on,
       deleted_on
     FROM SNOWFLAKE.ACCOUNT_USAGE.BLOCK_STORAGE_SNAPSHOTS
     WHERE database_name = 'TUTORIAL_DB'
       AND schema_name = 'DATA_SCHEMA'
       AND snapshot_name = 'MY_SNAPSHOT'
       AND deleted_on IS NOT NULL
     ORDER BY deleted_on;
   ```

   Example output:

   ```output
   +-------------+---------------+---------------+-------------+-------------------------------+-------------------------------+
   | SNAPSHOT_ID | SNAPSHOT_NAME | DATABASE_NAME | SCHEMA_NAME | CREATED_ON                    | DELETED_ON                    |
   |-------------+---------------+---------------+-------------+-------------------------------+-------------------------------|
   |           1 | MY_SNAPSHOT   | TUTORIAL_DB   | DATA_SCHEMA | 2025-09-06 09:51:47.131 -0700 | 2025-09-15 14:21:49.683 -0700 |
   +-------------+---------------+---------------+-------------+-------------------------------+-------------------------------+
   ```
2. Undrop `MY_SNAPSHOT` by snapshot ID; to restore the second-to-last deleted snapshot, use snapshot ID 1 from the output of the previous statement.

   After you execute the following statement, the snapshot is restored with its original name, `MY_SNAPSHOT`:

   ```sqlexample
   UNDROP SNAPSHOT IDENTIFIER(1);
   ```

---
title: UNDROP STREAMLIT
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-streamlit.md
section: SQL Commands
---

# UNDROP STREAMLIT

Restores the most recent version of a dropped Streamlit object.

See also:
:   [CREATE STREAMLIT](create-streamlit.md) , [ALTER STREAMLIT](alter-streamlit.md) , [DROP STREAMLIT](drop-streamlit.md) , [SHOW STREAMLITS](show-streamlits.md) , [DESCRIBE STREAMLIT](desc-streamlit.md)

## Syntax

```sqlsyntax
UNDROP STREAMLIT <name>
```

## Parameters

`name`
:   Specifies the identifier for the Streamlit object to restore.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

If your role does not own the objects in the following table, then your role
must have the listed
[privileges](../../user-guide/security-access-control-overview.md) on those objects:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Streamlit object that you restore |  |
| CREATE STREAMLIT | Schema where you restore the Streamlit object |  |
| USAGE | Warehouse used by the Streamlit app |  |
| USAGE | Compute pool used by the Streamlit app | This privilege is only required if your app has a COMPUTE_POOL. |
| USAGE | External access integrations used by the Streamlit app | This privilege is only required if your app has EXTERNAL_ACCESS_INTEGRATIONS. |
| USAGE | Secrets used by the Streamlit app | This privilege is only required if your app has SECRETS. |
| CREATE STAGE | Schema where you restore the Streamlit object | This privilege is only required to undrop Streamlit objects that were created with the legacy ROOT_LOCATION parameter. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Streamlit object can only be restored to the database and schema that contained the Streamlit object at the time of deletion.
* If a Streamlit with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Example

The following example restores the most recent version of a dropped Streamlit named `hello_streamlit`:

```sqlexample
UNDROP STREAMLIT hello_streamlit;
```

---
title: UNDROP TABLE
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-table.md
section: SQL Commands
---

# UNDROP TABLE

Restores the most recent version of a dropped table.

See also:
:   [CREATE TABLE](create-table.md) , [ALTER TABLE](alter-table.md) , [DROP TABLE](drop-table.md) , [SHOW TABLES](show-tables.md) , [DESCRIBE TABLE](desc-table.md)

## Syntax

```sqlsyntax
UNDROP TABLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the table to restore. If the identifier contains spaces or special characters, the entire string must
    be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* Tables can only be restored to the database and schema that contained the table at the time of deletion. For example, if you
  create and drop table `t1` in schema `s1`, then change the current schema to `s2` and attempt to restore table `t1`
  by ID (or qualified name, `s1.t1`), table `t1` is restored in schema `s1` rather than in the current schema, `s2`.
* If a table with the same name already exists, an error is returned.
* If you have multiple dropped tables with the same name, you can use the [IDENTIFIER keyword](../identifier-literal.md)
  with the system-generated identifier (from the [TABLES view](../account-usage/tables.md)) to specify which table to restore.
  The name of the restored table remains the same. See Examples.

  > **Note:**
  >
  > You can only use the system-generated identifier with the IDENTIFIER() keyword when executing the UNDROP command for notebooks, tables, block storage snapshots, schemas, and databases.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

* You cannot undrop a hybrid table.

## Examples

### Basic example

Restore the most recent version of a dropped table (this example builds on the examples provided for [DROP TABLE](drop-table.md)):

```sqlexample
UNDROP TABLE t2;
```

```output
+---------------------------------+
| status                          |
|---------------------------------|
| Table T2 successfully restored. |
+---------------------------------+
```

### UNDROP table using the table ID

Restore a dropped table by ID using IDENTIFIER(). You can find the table ID of the specific table to undrop using the `table_id`
column in the [TABLES view](../account-usage/tables.md). For example, if you have multiple dropped tables named `my_table`, and
you want to restore the second-to-last dropped table `my_table`, follow these steps:

1. Find the table ID of the dropped table in the Account Usage TABLES view:

   ```sqlexample
   SELECT table_id,
     table_name,
     table_schema,
     table_catalog,
     created,
     deleted,
     comment
   FROM SNOWFLAKE.ACCOUNT_USAGE.TABLES
   WHERE table_catalog = 'DB1'
   AND table_schema = 'S1'
   AND table_name = 'MY_TABLE'
   AND deleted IS NOT NULL
   ORDER BY deleted;
   ```

   ```output
   +----------+------------+--------------+---------------+-------------------------------+-------------------------------+---------+
   | TABLE_ID | TABLE_NAME | TABLE_SCHEMA | TABLE_CATALOG | CREATED                       | DELETED                       | COMMENT |
   |----------+------------+--------------+---------------+-------------------------------+-------------------------------+---------|
   |   408578 | MY_TABLE   | S1           | DB1           | 2024-07-01 15:39:07.565 -0700 | 2024-07-01 15:40:28.161 -0700 | NULL    |
   +----------+------------+--------------+---------------+-------------------------------+-------------------------------+---------+
   |   408607 | MY_TABLE   | S1           | DB1           | 2024-07-01 17:43:07.565 -0700 | 2024-07-01 17:44:28.161 -0700 | NULL    |
   +----------+------------+--------------+---------------+-------------------------------+-------------------------------+---------+
   ```
2. Undrop `my_table` by table ID. To restore the second-to-last deleted table, use table ID `408578` from the output of the previous
   statement. After you execute the following statement, the table is restored with its original name, `my_table`:

   ```sqlexample
   UNDROP TABLE IDENTIFIER(408578);
   ```

---
title: UNDROP TAG
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-tag.md
section: SQL Commands
---

# UNDROP TAG

Restores the most recent version of a tag to the system.

For details about this command and tag references, see [Tag quotas](../../user-guide/object-tagging/introduction.md).

See also:
:   [CREATE TAG](create-tag.md) , [ALTER TAG](alter-tag.md) , [DROP TAG](drop-tag.md) , [SHOW TAGS](show-tags.md)

## Syntax

```sqlsyntax
UNDROP TAG <name>
```

## Parameters

`name`
:   Identifier for the tag.

    The identifier value must start with an alphabetic character and cannot contain spaces or special characters unless the entire
    identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Tag | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

For additional details on tag DDL and privileges, see [Access control privileges](../../user-guide/object-tagging/work.md).

## Usage notes

* Restoring tags is only supported in the current schema or current database, even if the table name is fully-qualified.
* If the tag was assigned to one or more objects when the [DROP TAG](drop-tag.md) command was executed, the UNDROP command
  restores the tag assignments to the objects. For details, see [Tag quotas](../../user-guide/object-tagging/introduction.md).
* If a tag with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Example

The following example restores the most recent version of the tag named `cost_center`:

> ```sqlexample
> UNDROP TAG cost_center;
> ```

---
title: UNDROP TYPE
source: https://docs.snowflake.com/en/sql-reference/sql/undrop-type.md
section: SQL Commands
---

# UNDROP TYPE

Restores the most recent version of a [user-defined type](../data-types-user-defined.md).

See also:
:   [CREATE TYPE](create-type.md) , [ALTER TYPE](alter-type.md) , [DESCRIBE TYPE](desc-type.md) , [SHOW TYPES](show-types.md) , [DROP TYPE](drop-type.md)

## Syntax

```sqlsyntax
UNDROP TYPE <name>
```

## Parameters

`name`
:   Specifies the identifier for the user-defined type to restore.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | User-defined type | OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* Restoring user-defined types is only supported in the current schema or current database, even if the type name is fully-qualified.
* If a user-defined type with the same name already exists, an error is returned.

* UNDROP relies on the Snowflake [Time Travel](../../user-guide/data-time-travel.md) feature. An object can be restored only if
  the object was deleted within the [Data retention period](../../user-guide/data-time-travel.md). The default value is 24 hours.

## Example

Use the UNDROP TYPE command to restore the most recent version of the `age` user-defined type:

```sqlexample
UNDROP TYPE age;
```

---
title: UNSET
source: https://docs.snowflake.com/en/sql-reference/sql/unset.md
section: SQL Commands
---

# UNSET

Drops a [session variable](../session-variables.md).

See also:
:   [SHOW VARIABLES](show-variables.md) , [SET](set.md)

## Syntax

```sqlsyntax
UNSET <var>

UNSET ( <var> [ , <var> ... ] )
```

## Parameters

`var`
:   Specifies the identifier for the variable to drop.

## Usage notes

* The command supports dropping multiple variables in the same statement.
* The command does not require a running warehouse to execute.

## Examples

```sqlexample
UNSET V1;

UNSET V2;

UNSET (V1, V2);
```

---
title: UPDATE
source: https://docs.snowflake.com/en/sql-reference/sql/update.md
section: SQL Commands
---

# UPDATE

Updates specified rows in the target table with new values.

## Syntax

```sqlsyntax
UPDATE <target_table>
       SET <col_name> = <value> [ , <col_name> = <value> , ... ]
        [ FROM <additional_tables> ]
        [ WHERE <condition> ]
```

## Required parameters

`target_table`
:   Specifies the table to update.

`col_name`
:   Specifies the name of a column in `target_table`. Do not include the table name. For example, `UPDATE t1 SET t1.col = 1`
    is invalid.

`value`
:   Specifies the new value to set in `col_name`.

## Optional parameters

`FROM additional_tables`
:   Specifies one or more tables to use for selecting rows to update or for setting new values. Note that repeating the target table results
    in a self-join.

`WHERE condition`
:   Expression that specifies the rows in the target table to update.

    Default: No value (all rows of the target table are updated)

## Usage notes

* When a [FROM](../constructs/from.md) clause contains a [JOIN](../constructs/join.md) between
  tables (e.g. `t1` and `t2`), a target row in `t1` may join against (i.e. match) more than one row in table `t2`. When
  this occurs, the target row is called a *multi-joined row*. When updating a multi-joined row, the
  [ERROR_ON_NONDETERMINISTIC_UPDATE](../parameters.md) session parameter controls the outcome of the update:

  + If `FALSE` (default value), no error is returned and one of the joined rows is used to update the target row; however, the
    selected joined row is nondeterministic.
  + IF `TRUE`, an error is returned, including an example of the values of a target row that joins multiple rows.

  To set the parameter:

  > ```sqlexample
  > ALTER SESSION SET ERROR_ON_NONDETERMINISTIC_UPDATE=TRUE;
  > ```

## Examples

Perform a standard update using two tables:

> ```sqlexample
> UPDATE t1
>   SET number_column = t1.number_column + t2.number_column, t1.text_column = 'ASDF'
>   FROM t2
>   WHERE t1.key_column = t2.t1_key and t1.number_column < 10;
> ```

Update with join that produces nondeterministic results:

> ```sqlexample
> select * from target;
>
> +---+----+
> | K |  V |
> |---+----|
> | 0 | 10 |
> +---+----+
>
> Select * from src;
>
> +---+----+
> | K |  V |
> |---+----|
> | 0 | 11 |
> | 0 | 12 |
> | 0 | 13 |
> +---+----+
>
> -- Following statement joins all three rows in src against the single row in target
> UPDATE target
>   SET v = src.v
>   FROM src
>   WHERE target.k = src.k;
>
> +------------------------+-------------------------------------+
> | number of rows updated | number of multi-joined rows updated |
> |------------------------+-------------------------------------|
> |                      1 |                                   1 |
> +------------------------+-------------------------------------+
> ```
>
> * With [ERROR_ON_NONDETERMINISTIC_UPDATE](../parameters.md) = FALSE, the statement randomly updates the single row in `target` using
>   values from one of the following rows in `src`:
>
>   > `(0, 11)` , `(0, 12)` , `(0,13)`
> * With [ERROR_ON_NONDETERMINISTIC_UPDATE](../parameters.md) = TRUE, an error is returned reporting a duplicate DML row `[0, 10]`.

To avoid this nondeterministic behavior and error, use a 1-to-1 join:

> ```sqlexample
> UPDATE target SET v = b.v
>   FROM (SELECT k, MIN(v) v FROM src GROUP BY k) b
>   WHERE target.k = b.k;
> ```
>
> This statement results in the single row in `target` updated to `(0, 11)` (values from the row with the minimum value for
> `v` in `src`) and will never result in an error.

---
title: USE <object>
source: https://docs.snowflake.com/en/sql-reference/sql/use.md
section: SQL Commands
---

# USE *<object>*

Specifies the role, warehouse, database, or schema to use for the current session.

## USE commands

For specific syntax, usage notes, and examples, see:

* [USE ROLE](use-role.md)
* [USE SECONDARY ROLES](use-secondary-roles.md)
* [USE WAREHOUSE](use-warehouse.md)
* [USE DATABASE](use-database.md)
* [USE SCHEMA](use-schema.md)

## Viewing the current session context

To view the current role, secondary roles, database, schema, and warehouse for the session, use the corresponding context functions.
For example:

```sqlexample
SELECT CURRENT_ROLE(),
       CURRENT_SECONDARY_ROLES(),
       CURRENT_WAREHOUSE(),
       CURRENT_DATABASE(),
       CURRENT_SCHEMA();
```

```output
+----------------+--------------------------+---------------------+--------------------+------------------+
| CURRENT_ROLE() | CURRENT_SECONDARY_ROLES  | CURRENT_WAREHOUSE() | CURRENT_DATABASE() | CURRENT_SCHEMA() |
|----------------+--------------------------+---------------------+--------------------+------------------|
| SYSADMIN       | ALL                      | MYWH                | MYTESTDB           | PUBLIC           |
+----------------+--------------------------+---------------------+--------------------+------------------+
```

For more details, see [Context functions](../functions-context.md).

---
title: USE DATABASE
source: https://docs.snowflake.com/en/sql-reference/sql/use-database.md
section: SQL Commands
---

# USE DATABASE

Specifies the active/current database for the session:

* If a database is not specified for a session, any objects referenced in queries and other SQL statements executed in
  the session must be fully qualified with the database and schema, also known as the *namespace*, for the object
  (in the form of `db_name.schema_name.object_name`). For more information about fully-qualified object names,
  see [Object name resolution](../name-resolution.md).
* If a database is specified for a session but the schema is not specified for a session, any objects referenced in queries
  and other SQL statements executed in the session must be qualified with the schema for the object (in the form of
  `schema_name.object_name`).
* If the database and schema are specified for a user session, unqualified object names are allowed in SQL statements and
  queries.

See also:
:   [CREATE DATABASE](create-database.md) , [ALTER DATABASE](alter-database.md) , [DROP DATABASE](drop-database.md) , [SHOW DATABASES](show-databases.md)

## Syntax

```sqlsyntax
USE [ DATABASE ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the database to use for the session. If the identifier contains spaces or special characters, the
    entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* The DATABASE keyword does not need to be specified.
* USE DATABASE automatically specifies PUBLIC as the current schema, unless the PUBLIC schema doesn’t exist (e.g. it has been dropped).
  To specify a different schema for a session, use the [USE SCHEMA](use-schema.md) command.

## Examples

The following example specifies the database to use for subsequent SQL commands:

```sqlexample
USE DATABASE mydb;
```

The following example shows how commands that refer to objects using unqualified names
produce different output after a USE command to switch databases. The schemas, tables,
table data, and so on can differ from one database to another.

When the [SHOW SCHEMAS](show-schemas.md) command is run in the context of `database_one`,
it produces output reflecting the objects in that database:

```sqlexample
USE DATABASE database_one;
SHOW SCHEMAS ->> SELECT "created_on", "name" FROM $1 ORDER BY "created_on";

+-------------------------------+--------------------+
| 2025-07-11 14:34:24.386 -0700 | PUBLIC             |
| 2025-07-11 14:42:23.509 -0700 | TEST_SCHEMA        |
| 2025-07-11 14:42:29.158 -0700 | STAGING_SCHEMA     |
| 2025-07-11 14:45:43.124 -0700 | INFORMATION_SCHEMA |
+-------------------------------+--------------------+
```

After a USE command switches to the `database_two` database, the SHOW SCHEMAS
command produces output reflecting a different set of objects:

```sqlexample
USE DATABASE database_two;
SHOW SCHEMAS ->> SELECT "created_on", "name" FROM $1 ORDER BY "created_on";
```

```output
+-------------------------------+--------------------+
| 2025-07-11 14:34:31.496 -0700 | PUBLIC             |
| 2025-07-11 14:43:04.394 -0700 | PRODUCTION_SCHEMA  |
| 2025-07-11 14:44:23.006 -0700 | DASHBOARDS_SCHEMA  |
| 2025-07-11 14:45:54.372 -0700 | INFORMATION_SCHEMA |
+-------------------------------+--------------------+
```

The following example changes from one database to another, then back to
the original database. The name of the original database is stored in a
variable. Run the following commands:

```sqlexample
SELECT CURRENT_DATABASE();
SET original_database = (SELECT CURRENT_DATABASE());
USE DATABASE database_two;
SELECT CURRENT_DATABASE();
USE DATABASE IDENTIFIER($original_database);
SELECT CURRENT_DATABASE();
```

The output for these commands shows how the current database value changes:

```output
>SELECT CURRENT_DATABASE();
+--------------+
| DATABASE_ONE |
+--------------+

>SET original_database = (SELECT CURRENT_DATABASE());

>USE DATABASE database_two;
>SELECT CURRENT_DATABASE();
+--------------+
| DATABASE_TWO |
+--------------+

>USE DATABASE IDENTIFIER($original_database);
>SELECT CURRENT_DATABASE();
+--------------+
| DATABASE_ONE |
+--------------+
```

---
title: USE ROLE
source: https://docs.snowflake.com/en/sql-reference/sql/use-role.md
section: SQL Commands
---

# USE ROLE

Specifies the active/current primary role for the session. The currently active primary role sets the context that determines whether the
current user has the necessary privileges to execute [CREATE <object>](create.md) statements or perform any other SQL action.

Authorization to perform any SQL action other than creating objects can be provided by secondary roles.

For more information, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [USE SECONDARY ROLES](use-secondary-roles.md) , [CREATE ROLE](create-role.md) , [ALTER ROLE](alter-role.md) , [DROP ROLE](drop-role.md) , [SHOW ROLES](show-roles.md)

## Syntax

```sqlsyntax
USE ROLE <name>
```

## Parameters

`name`
:   Specifies the identifier for the role to use for the session. If the identifier contains spaces or special characters, the entire string
    must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Usage notes

* To use a role, the role must have been granted to the user.
* Only a single primary role can be active at a time in a user session.

  [Secondary roles](../../user-guide/security-access-control-overview.md) enable you to perform SQL actions using
  the combined privileges of the other roles granted to you.

## Examples

```sqlexample
USE ROLE myrole;
```

---
title: USE SCHEMA
source: https://docs.snowflake.com/en/sql-reference/sql/use-schema.md
section: SQL Commands
---

# USE SCHEMA

Specifies the active/current schema for the session:

* If a database is not specified for a session, any objects referenced in queries and other SQL statements executed in
  the session must be fully qualified with the database and schema, also known as the *namespace*, for the object
  (in the form of `db_name.schema_name.object_name`). For more information about fully-qualified object names,
  see [Object name resolution](../name-resolution.md).
* If a database is specified for a session but the schema is not specified for a session, any objects referenced in queries
  and other SQL statements executed in the session must be qualified with the schema for the object (in the form of
  `schema_name.object_name`).
* If the database and schema are specified for a user session, unqualified object names are allowed in SQL statements and
  queries.

See also:
:   [CREATE SCHEMA](create-schema.md) , [ALTER SCHEMA](alter-schema.md) , [DROP SCHEMA](drop-schema.md) , [SHOW SCHEMAS](show-schemas.md)

## Syntax

```sqlsyntax
USE [ SCHEMA ] [<db_name>.]<name>
```

## Parameters

`[db_name.]name`
:   Specifies the identifier for the schema to use for the session. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

    The SCHEMA keyword is optional if the schema name is fully qualified (in the form of `db_name.schema_name`).

    The database name (`db_name`) is optional if the database is specified in the user session and the SCHEMA keyword
    is included.

## Examples

Use the `myschema` schema with the database specified in the user session:

```sqlexample
USE SCHEMA myschema;
```

Use the `myschema` schema in the `mydb` database:

```sqlexample
USE mydb.myschema;
```

The following example shows how commands that refer to objects using unqualified names
produce different output after a USE command to switch schemas. The tables, table data,
views, user-defined functions, and so on can differ from one schema to another.

When the [SHOW TABLES](show-tables.md) command is run in the context of `schema_one`,
it produces output reflecting the objects in that schema:

```sqlexample
USE SCHEMA schema_one;
SHOW TABLES ->> SELECT "created_on", "name" FROM $1 ORDER BY "created_on";
```

```output
+-------------------------------+-----------+
| created_on                    | name      |
|-------------------------------+-----------|
| 2025-07-13 23:48:49.129 -0700 | TABLE_ABC |
| 2025-07-13 23:49:50.329 -0700 | TABLE_DEF |
+-------------------------------+-----------+
```

After a USE command switches to the `schema_two` schema, the SHOW TABLES command
produces output reflecting a different set of objects:

```sqlexample
USE SCHEMA schema_two;
SHOW TABLES ->> SELECT "created_on", "name" FROM $1 ORDER BY "created_on";
```

```output
+-------------------------------+-----------+
| created_on                    | name      |
|-------------------------------+-----------|
| 2025-07-13 23:52:06.144 -0700 | TABLE_IJK |
| 2025-07-13 23:53:29.851 -0700 | TABLE_XYZ |
+-------------------------------+-----------+
```

The following example changes from one schema to another, then back to
the original schema. The name of the original schema is stored in a
variable. Run the following commands:

```sqlexample
SELECT CURRENT_SCHEMA();
SET original_schema = (SELECT CURRENT_SCHEMA());
USE SCHEMA schema_two;
SELECT CURRENT_SCHEMA();
USE SCHEMA IDENTIFIER($original_schema);
SELECT CURRENT_SCHEMA();
```

The output for these commands shows how the current schema value changes:

```output
>SELECT CURRENT_SCHEMA();
+------------+
| SCHEMA_ONE |
+------------+

>SET original_schema = (SELECT CURRENT_SCHEMA());

>USE SCHEMA schema_two;
>SELECT CURRENT_SCHEMA();
+------------+
| SCHEMA_TWO |
+------------+

>USE SCHEMA IDENTIFIER($original_schema);
>SELECT CURRENT_SCHEMA();
+------------+
| SCHEMA_ONE |
+------------+
```

---
title: USE SECONDARY ROLES
source: https://docs.snowflake.com/en/sql-reference/sql/use-secondary-roles.md
section: SQL Commands
---

# USE SECONDARY ROLES

Specifies the active/current secondary roles for the session. The currently-active secondary roles set the context that determines whether
the current user has the necessary privileges to perform SQL actions.

Note that authorization to execute [CREATE <object>](create.md) statements to create objects is provided by the primary role.

For more information, see [secondary role enforcement](../../user-guide/security-access-control-overview.md).

See also:
:   [USE ROLE](use-role.md)

## Syntax

```sqlsyntax
USE SECONDARY ROLES {
      ALL
    | NONE
    | <role_name> [ , <role_name> ... ]
  }
```

## Parameters

`ALL`
:   All roles that have been granted to the user in addition to the current active primary role.

    Note that the set of roles is reevaluated when each SQL statement executes. If additional roles are granted to the user, and that user
    executes a new SQL statement, the newly granted roles are active secondary roles for the new SQL statement. The same logic applies to
    roles that are revoked from a user.

`NONE`
:   Disables secondary roles. The authorization for all SQL actions is provided via the primary role.

`role_name [ , role_name ... ]`
:   Activates the specified roles as secondary roles. The secondary roles can be user-defined account roles or system roles. Specify the role
    name as it is stored in Snowflake.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

## Usage notes

* When specifying individual role names:

  + Each named role must have been granted to the current user. The command immediately
    validates each specified role; if any role has not been granted, the command fails with
    an error.
  + The command records the desired set of secondary roles for the session. The roles
    activated for each subsequent SQL statement might be a subset of the desired set,
    for example, if a session policy restricts certain secondary roles.
* When `ALL` is specified, the command doesn’t validate role grants up front. Instead, the
  active secondary roles are determined dynamically when each SQL statement executes. This
  means newly granted roles are activated automatically, and revoked roles are no longer
  active, without needing to reissue the command.
* If a session policy restricts which secondary roles can be activated, the command still
  succeeds but might return an informational message indicating that the activated secondary
  roles will be limited by the policy.

## Examples

```sqlexample
USE SECONDARY ROLES ALL;
```

```sqlexample
USE SECONDARY ROLES test_role_1, test_role_2;
```

---
title: USE WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/sql/use-warehouse.md
section: SQL Commands
---

# USE WAREHOUSE

Specifies the active/current [virtual warehouse](../../user-guide/warehouses-overview.md) for the session.
You must specify a warehouse for a session, and the warehouse must be running
before you can execute queries and DML statements in the session.

To view the current warehouse for a session, call the [CURRENT_WAREHOUSE](../functions/current_warehouse.md) context function.

See also:
:   [ALTER WAREHOUSE](alter-warehouse.md) , [CREATE WAREHOUSE](create-warehouse.md) , [SHOW WAREHOUSES](show-warehouses.md)

## Syntax

```sqlsyntax
USE WAREHOUSE <name>
```

## Parameters

`name`
:   Specifies the identifier for the warehouse to use for the session. If the identifier contains spaces or special characters, the entire
    string must be enclosed in double quotes. Identifiers enclosed in double quotes are also case-sensitive.

## Examples

The following example specifies the warehouse where the current session
performs its work:

```sqlexample
USE WAREHOUSE mywarehouse;
```

The following example changes from one warehouse to another, then back to
the original warehouse. The name of the original warehouse is stored in a
variable. Run the following commands:

```sqlexample
SELECT CURRENT_WAREHOUSE();
SET original_warehouse = (SELECT CURRENT_WAREHOUSE());
USE WAREHOUSE warehouse_two;
SELECT CURRENT_WAREHOUSE();
USE WAREHOUSE IDENTIFIER($original_warehouse);
SELECT CURRENT_WAREHOUSE();
```

The output for these commands shows how the current warehouse value changes:

```output
>SELECT CURRENT_WAREHOUSE();
+---------------------+
| WAREHOUSE_ONE       |
+---------------------+

>SET original_warehouse = (SELECT CURRENT_WAREHOUSE());

>USE WAREHOUSE warehouse_two;
>SELECT CURRENT_WAREHOUSE();
+---------------------+
| WAREHOUSE_TWO       |
+---------------------+

>USE WAREHOUSE IDENTIFIER($original_warehouse);
>SELECT CURRENT_WAREHOUSE();
+---------------------+
| WAREHOUSE_ONE       |
+---------------------+
```

## Account Usage

SNOWFLAKE.ACCOUNT_USAGE schema views for querying account-level activity and history.

---
title: ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/access_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ACCESS_HISTORY view

This Account Usage view can be used to query the access history of Snowflake objects (e.g. table, view, column) within the last 365 days
(1 year).

## Columns

This section consists of tables that do the following:

* Provide a sample value for each column.
* Provide a description of each column in the view.
* Provide a description for each field in the JSON array for the `base_objects_accessed`, `direct_objects_accessed`, and
  `objects_modified` columns.
* Provide a description for each field in the object for the `object_modified_by_ddl` column.

### Sample column values

The following table provides a sample value for each column in the view.

| Column name | Example |
| --- | --- |
| `query_id` | `a0fda135-d678-4184-942b-c3411ae8d1ce` |
| `query_start_time` | `2022-01-25 16:17:47.388 +0000` |
| `user_name` | `JSMITH` |
| `direct_objects_accessed` | ```sqljson [   {     "objectDomain": "FUNCTION",     "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",     "objectId": "2",     "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",     "dataType": "NUMBER(38,0)"   },   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "GOVERNANCE.TABLES.T1"   } ] ``` |
| `base_objects_accessed` | ```sqljson [   {     "objectDomain": "FUNCTION",     "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",     "objectId": "2",     "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",     "dataType": "NUMBER(38,0)"   },   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "GOVERNANCE.TABLES.T1"   } ] ``` |
| `objects_modified` | ```sqljson [   {     "objectDomain": "STRING",     "objectId":  NUMBER,     "objectName": "STRING",     "columns": [       {         "columnId": "NUMBER",         "columnName": "STRING",         "baseSources": [           {             "columnName": STRING,             "objectDomain": "STRING",             "objectId": NUMBER,             "objectName": "STRING"           }         ],         "directSources": [           {             "columnName": STRING,             "objectDomain": "STRING",             "objectId": NUMBER,             "objectName": "STRING"           }         ]       }     ]   },   ... ] ``` |
| `object_modified_by_ddl` | ```sqljson {   "objectDomain": STRING,   "objectName": STRING,   "objectId": NUMBER,   "operationType": STRING,   "properties": ARRAY } ``` |
| `policies_referenced` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "SSN",         "policies": [           {               "policyName": "governance.policies.ssn_mask",               "policyId": 68811,               "policyKind": "MASKING_POLICY"           }         ]       }     ],     "objectDomain": "VIEW",     "objectId": 66564,     "objectName": "GOVERNANCE.VIEWS.V1",     "policies": [       {         "policyName": "governance.policies.rap1",         "policyId": 68813,         "policyKind": "ROW_ACCESS_POLICY"       }     ]   } ] ``` |

### Column descriptions

The following table provides a description of each column in the view.

If a column contains `-1` in a number field or `TRUNCATED` in a string field, information in the column might have been truncated. For
more information, see Usage notes: Truncation.

| Column Name | Data Type | Description |
| --- | --- | --- |
| `query_id` | VARCHAR | An internal, system-generated identifier for the SQL statement. This value is also mentioned in the [QUERY_HISTORY view](query_history.md). |
| `query_start_time` | TIMESTAMP_LTZ | The statement start time (UTC time zone). |
| `user_name` | VARCHAR | The user who issued the query. |
| `direct_objects_accessed` | ARRAY | A JSON array of data objects such as user-defined functions (i.e. UDFs and UDTFs), stored procedures, tables, views, and columns directly named in the query explicitly or through shortcuts such as using an asterisk (i.e. `*`).  Virtual columns can be returned in this field.  For additional notes about UDFs, see Usage notes. |
| `base_objects_accessed` | ARRAY | A JSON array of all base data objects to execute a query, including columns, external functions, UDFs, and stored procedures.  In this example, the fields in the first array specify a UDF. These same fields in the first array also specify a stored procedure, when applicable.  Note the following:   * This field specifies view names or view columns, including virtual columns, if a shared view is accessed in a data sharing consumer   account. * For additional notes about UDFs, see Usage notes. |
| `objects_modified` | ARRAY | A JSON array that specifies the objects that were associated with a write operation in the query.  The UDF and stored procedure array is the same as what is shown earlier and appears in the arrays for `baseSources` and `directSources` depending on how the access took place. For brevity, this example omits the UDF and stored procedure array.  For additional notes about UDFs, see Usage notes. |
| `object_modified_by_ddl` | OBJECT | Specifies the DDL operation on a database, schema, table, view, and column. These operations also include statements that specify a row access policy on a table or view, a masking policy on a column, and tag updates (e.g. set a tag, change a tag value) on the object or column. |
| `policies_referenced` | ARRAY | Specifies information about the enforced masking policy set on the column and the enforced row access policy set on the table, including policies set on intermediate objects or columns. |
| `parent_query_id` | VARCHAR | The query ID of the parent job or NULL if the job does not have a parent. |
| `root_query_id` | VARCHAR | The query ID of the top most job in the chain or NULL if the job does not have a parent. |
| `event_source` | VARCHAR | Indicates the source of the event that resulted in an access history record. Possible values include the following:   * `snowflake_sql` — Events generated by SQL statements that were executed within Snowflake. * `horizon_irc` — Events generated by calls made to the [Horizon Iceberg REST Catalog API](../../user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md). |
| `additional_properties` | VARIANT | Provides operational metadata for the source of the event. |

### JSON field descriptions

The following table defines the fields in the JSON array for the `base_objects_accessed`, `direct_objects_accessed`, and
`objects_modified` columns.

| Field | Data Type | Description |
| --- | --- | --- |
| accountName [1] | VARCHAR | The account locator of the consumer account that queried the provider’s data object. If the query wasn’t executed by a consumer, this field is omitted. |
| columnId | NUMBER | A column ID that is unique within the account. This value is identical to the columnID in the [COLUMNS view](columns.md). |
| columnName | VARCHAR | The name of the accessed column. For policies, specifies the column on which the masking policy is set. |
| objectId | NUMBER | An identifier for the object, which is unique within a given account and domain. This number will match:   * The `TABLE_ID` number for a table, view, or materialized view. You can obtain this value from [TABLES view](tables.md), [VIEWS view](views.md), or [MATERIALIZED_VIEW_REFRESH_HISTORY view](materialized_view_refresh_history.md). * If a stage was accessed, this number will match the:    + `NAME` identifier for a user stage (see [USERS view](users.md))   + `TABLE_ID` number for a table stage (see [TABLES view](tables.md))   + `STAGE_ID` number for a name stage (see [STAGES view](stages.md)) |
| objectName | VARCHAR | The fully qualified name of the object that was accessed.  If a masking policy is set on a column or a row access policy is set on a table or view, the value refers to the fully qualified name of the table or view on which the row access policy is set or the table or view that has a masking policy set on one of its columns.  If a stage was accessed, this value will be the:   * `username` (User stage). * `table_name` (Table stage). * `stage_name` (Named stage). |
| objectDomain | VARCHAR | The type of object. For a list of supported objects, see [Supported Objects](../../user-guide/access-history.md).  Note that `FUNCTION` specifies UDFs, UDTFs, and external functions.  For data access policies, specifies the domain of the object on which the policy is set. |
| location | VARCHAR | The URL of the external location when the data access is an external location (e.g. `s3://mybucket/a.csv`). . If the query does not access a stage, this field is omitted. |
| stageKind | VARCHAR | When writing to a stage, one of the following: `Table | User | Internal Named | External Named` If the query does not access a stage, this field is omitted. |
| baseSources | VARCHAR | The columns that serve as the source columns for the columns specified by `directSources`. These columns facilitate column lineage. |
| directSources | VARCHAR | The columns specifically mentioned in the data write portion of the SQL statement that serves as the source columns in the target table to which data is written. These columns facilitate column lineage. |
| policyName | VARCHAR | The fully-qualified name of the policy. |
| policyId | NUMBER | An identifier for the policy, which is unique within a given account and domain. This value matches the identifier for a masking policy in the [MASKING_POLICIES view](masking_policies.md) or the identifier for a row access policy in the [ROW_ACCESS_POLICIES view](row_access_policies.md). |
| policyKind | VARCHAR | Either: MASKING_POLICY or ROW_ACCESS_POLICY |
| argumentSignature | VARCHAR | The name and data type for each argument in the UDF or stored procedure. |
| dataType |  | The data type of the return value for a UDF or stored procedure.  This value helps to differentiate two or more UDFs that have the same name but different return types. |
| joinObjects | VARCHAR | If a query contains a join, returns an array containing the joined objects and type of join. |
| joinObject | VARCHAR | The table or view that was joined with the accessed object. |
| type | VARCHAR | The type of join, as described in [JOIN](../constructs/join.md), [ASOF JOIN](../constructs/asof-join.md), and [LATERAL](../constructs/join-lateral.md). |

[1]

This field is found in the ACCESS_HISTORY view of the ORGANIZATION_USAGE schema, but not the ACCESS_HISTORY view of the ACCOUNT_USAGE schema.

### Object field descriptions for `object_modified_by_ddl`

The following table describes the fields of objects in the `object_modified_by_ddl` column.

| Field | Data type | Description |
| --- | --- | --- |
| objectDomain | VARCHAR | Type of the object defined or modified by the DDL operation. For more information about supported object types, see [Supported Objects](../../user-guide/access-history.md). |
| objectId | NUMBER | The identifier for the object, which is unique within a given account and domain, defined or modified by the DDL operation. |
| objectName | VARCHAR | The fully qualified name of the object defined or modified by the DDL operation. |
| operationType | VARCHAR | The SQL keyword that specifies the operation on the table, view, or column. For ALTER, CREATE, and DROP, this can also apply to listings and shares. For GRANT and REVOKE, this can also apply to shares. The following values are supported: ALTER | CREATE | DESCRIBE | DROP | REPLACE | UNDROP | REFRESH | SHOW | SUSPEND | RESUME | GRANT | REVOKE |
| properties | ARRAY | A JSON array that specifies the object or column properties when you create, modify, drop, or undrop the object or column. There are two types of properties: atomic and compound. |

For the `properties` JSON array:

* Atomic: one value per property (e.g. a `comment` has a single string value, the `enabled` property is a boolean and has one value).
* Compound: the property is multi-valued (e.g. `allowed_values` for a tag, masking policy).

Compound properties are recorded in a JSON array. For example, if a table contains a single column named EMAIL, the column is recorded as
follows:

```json
"columns": {
  "email": {
    "objectId": {
      "value": 1
    },
    "subOperationType": "ADD"
  }
}
```

In the previous example,

* `objectId` specifies the identifier for the column or object, except for allowed tag values, which don’t have an identifier.
* `subOperationType` can be one of the following values:

  + `ADD` specifies adding a compound property (for example, adding a column, setting allowed values).
  + `DROP` specifies removing a compound property.
  + `ALTER` specifies modifying a compound property.

#### CREATE or ALTER LISTING properties of OBJECT_MODIFIED_BY_DDL

The following table describes available `properties` arrays when `operationType` is CREATE or ALTER for a *listing*.

| Command | Properties of OBJECT_MODIFIED_BY_DDL |
| --- | --- |
| ```sqlexample CREATE EXTERNAL LISTING my_listing SHARE my_share   AS $$my_manifest$$ ``` | ```json "manifest": {   "value": "my_manifest" }, "share": {   "value": "my_share" } ``` |
| ```sqlexample ALTER LISTING my_listing   AS $$my_manifest$$ ``` | ```json "manifest": {   "value": "my_manifest" } ``` |
| ```sqlexample ALTER LISTING my_listing   ADD TARGETS $$my_targets_manifest$$; ``` | ```json "addTargets": {   "value": "my_targets_manifest" } ``` |
| ```sqlexample ALTER LISTING my_listing   REMOVE TARGETS $$my_targets_manifest$$; ``` | ```json "removeTargets": {   "value": "my_targets_manifest" } ``` |
| ```sqlexample ALTER LISTING my_listing   ADD VERSION V3   FROM @listing_db.listing_schema.stage1; ``` | ```json "manifestStageLocation": {   "value": "@listing_db.listing_schema.stage1" }, "versionAlias": {   "value": "V3" } ``` |

#### CREATE or ALTER SHARE properties of OBJECT_MODIFIED_BY_DDL

The following table describes available `properties` arrays when the `operationType` is CREATE or ALTER for a *share*.

| Command | Properties of OBJECT_MODIFIED_BY_DDL |
| --- | --- |
| ```sqlexample CREATE SHARE my_share   SECURE_OBJECTS_ONLY=FALSE; ``` | ```json "secureObjectsOnly": {   "value": false } ``` |
| ```sqlexample ALTER SHARE my_share   SET ACCOUNTS = acc1, acc2; ``` | ```json "accountsToSet": {   "value": [ "acc1", "acc2" ] } ``` |
| ```sqlexample ALTER SHARE my_share   ADD ACCOUNTS = acc1, acc2   SHARE_RESTRICTIONS = false; ``` | ```json "accountsToAdd": {  "value": [ "acc1", "acc2" ] }, "shareRestrictions": {   "value": false } ``` |
| ```sqlexample ALTER SHARE my_share   REMOVE ACCOUNTS = acc1, acc2; ``` | ```json "accountsToRemove": {   "value": [ "acc1", "acc2" ] } ``` |

#### GRANT TO SHARE or REVOKE FROM SHARE properties of OBJECT_MODIFIED_BY_DDL

The following table describes available `properties` arrays when the `operationType` is GRANT TO or REVOKE FROM for a *share*.

| Command | Properties of OBJECT_MODIFIED_BY_DDL |
| --- | --- |
| ```sqlexample GRANT USAGE ON DATABASE my_db   TO SHARE my_share; ``` | ```json "grant": {   "value": {     "PRIVILEGES": [       "USAGE"     ],     "SECURABLE_OBJECT_DOMAIN": "Database",     "SECURABLE_OBJECT_ID": 1234,     "SECURABLE_OBJECT_NAME": "MY_DB"   } } ``` |
| ```sqlexample GRANT SELECT ON ALL TABLES IN SCHEMA my_db.my_sch   TO SHARE my_share; ``` | ```json "grant": {   "value": {     "PRIVILEGES": [       "SELECT"     ],     "SECURABLE_OBJECT_DOMAIN": "Table",     "SECURABLE_OBJECT_SCOPE": "MY_DB.MY_SCH",     "SECURABLE_OBJECT_SCOPE_DOMAIN": "Schema"   } } ``` |
| ```sqlexample GRANT DATABASE ROLE my_db.my_role   TO SHARE my_share; ``` | ```json "grant": {   "value": {     "ROLES": [       "MY_DB.MY_ROLE"     ]   } } ``` |
| ```sqlexample REVOKE SELECT ON VIEW my_db.my_sch.my_view   FROM SHARE my_share; ``` | ```json "revoke": {   "value": {     "PRIVILEGES": [       "SELECT"     ],     "SECURABLE_OBJECT_DOMAIN": "View",     "SECURABLE_OBJECT_ID": 6789,     "SECURABLE_OBJECT_NAME": "MY_DB.MY_SCH.MY_VIEW"   } } ``` |

## Usage notes

Latency and historical data:
:   * The view displays data starting from February 22, 2021.
    * Latency for the view may be up to 180 minutes (3 hours).

Ancestor queries:
:   The `parent_query_id` and `root_query_id` columns begin to record data starting on January 15-16, 2024, depending on when
    your Snowflake account was updated based on the `2023_08` behavior change bundle transitioning to enabled by default. This date is
    necessary to distinguish between the following records in the view:

    * Queries that ran before the bundle was enabled by default.
    * Queries that ran after the feature was enabled by default but do not have a value in the `parent_query_id`.

General notes:
:   * For increased performance, filter queries on the `query_start_time` column and choose narrower time ranges. For sample queries,
      see [Querying the ACCESS_HISTORY View](../../user-guide/access-history.md).
    * Secure Views. The log record contains the underlying base table (i.e. `base_objects_accessed`) to generate the view. Examples
      include queries on other Account Usage and Organization Usage views and queries on base tables for extract, transform, and load
      (i.e. ETL) operations.
    * Records in the QUERY_HISTORY view do not always get recorded in the
      ACCESS_HISTORY view. The structure of the SQL statement determines whether Snowflake records an entry in the ACCESS_HISTORY view.
    * Specifying the `USING` clause while querying this view might cause non-referenced columns to be recorded in
      `direct_objects_accessed` field. As a workaround, replace the `USING` clause with a `JOIN ... ON ...` clause.
      For details, refer to:

      + [JOIN and USING](../constructs/join.md) (in the JOIN reference topic)
      + [Tracking Sensitive stage data movement](../../user-guide/access-history.md) (in the Access History query example)

Read query notes:
:   This view supports read queries of the following type:

    * SELECT, including CREATE TABLE … AS SELECT (i.e. CTAS).

      + Snowflake records the SELECT subquery in a CTAS operation.
    * CREATE TABLE … CLONE

      + Snowflake records the source table in a CLONE operation.
    * COPY INTO … TABLE

      + Snowflake logs this query only when the table is specified as the source in a FROM clause.
    * DML operations that read data (e.g. contains a SELECT subquery, specifies certain columns in WHERE or JOIN): INSERT … SELECT,
      UPDATE, DELETE, and MERGE.
    * UDFs and [Tabular SQL UDFs (UDTFs)](../../developer-guide/udf/sql/udf-sql-tabular-functions.md) if tables are included in queries inside the functions. This is
      logged in the `base_objects_accessed` field.

Write operation notes:
:   This view supports write operations of the following type:

    * GET `<internal_stage>`
    * PUT `<internal_stage>`
    * DELETE
    * TRUNCATE
    * INSERT

      + INSERT INTO … FROM SELECT \*
      + INSERT INTO TABLE … VALUES ()
    * MERGE INTO … FROM SELECT \*
    * UPDATE

      + UPDATE TABLE … FROM SELECT \* FROM …
      + UPDATE TABLE … WHERE …
    * Data loading statements:

      + COPY INTO TABLE FROM internalStage
      + COPY INTO TABLE FROM externalStage
      + COPY INTO TABLE FROM externalLocation
    * Data unloading statements:

      + COPY INTO internalStage FROM TABLE
      + COPY INTO externalStage FROM TABLE
      + COPY INTO externalLocation FROM TABLE
    * CREATE:

      + CREATE DATABASE … CLONE
      + CREATE SCHEMA … CLONE
      + CREATE TABLE … CLONE
      + CREATE TABLE … AS SELECT
    * For write operations that call the [CASE](../functions/case.md) function to determine the columns to access, such as a CTAS
      statement with the CASE function in the SELECT query, all columns referenced in every CASE branch are recorded in the
      `base_objects_accessed` column, the `direct_objects_accessed` column, or both columns depending on how the CTAS statement
      is written.

Data sharing notes:
:   If a Data Sharing provider account shares objects to Data Sharing consumer accounts through a share:

    * **Provider accounts:** The queries and logs on the shared objects executed in the provider account are not visible to
      Data Sharing consumer accounts.
    * **Consumer accounts:** The queries on the data share executed in the consumer account are logged and only visible to
      the consumer account, not the Data Sharing provider account.

      For example, if the provider shares a table and a view built from the table to the consumer account, and there is a query on the
      shared view, Snowflake records the shared view access in the `base_objects_accessed` column. This record, which includes the
      `columnName` and `objectName` values, allows the consumer to know which object was accessed in their account and also protects
      the provider because the underlying table (via the `objectId` and `columnId`) is not revealed to the consumer.
    * For column lineage:

      If a data sharing provider makes a view available to the data sharing consumer, the source columns for the view are not visible to the
      consumer because the columns originate from the data sharing provider.

      If the data sharing consumer moves data from the shared view to a table, Snowflake does not record the view columns as
      `baseSources` for the newly created table.
    * For shared UDFs and UDTFs:

      + In the consumer account, the local ACCESS_HISTORY view records the UDF/UDTF that was shared by the provider when the shared UDF/UDTF
        is invoked by the consumer.
      + In the provider account, the local ACCESS_HISTORY view records provider usage of a shared UDF/UDTF. Users in the consumer account
        cannot view how the provider account uses the shared UDF/UDTF.
    * For tracking policy references:

      The `policies_referenced` column contains policies that are local to the account that queries the data.

      If a provider shares a policy-protected table and a consumer accesses this table, the consumer cannot see the policy the provider set
      on the table or its columns.

      If a consumer creates a view (`v1`) from the shared object, sets a policy to the view (`v1`) or its columns, and a user in the
      consumer account accesses the protected view (`v1`) or another view (`v2`) created from the protected view (`v1`), the
      ACCESS_HISTORY view in the consumer account contains the policy that protects the view (`v1`) and its columns. The provider cannot
      see the record that corresponds to `v1`.

Hybrid tables:
:   Short-running queries that operate exclusively against hybrid tables will no
    longer generate a record in the QUERY_HISTORY view, in [QUERY_HISTORY view](query_history.md), or
    in the output of the QUERY_HISTORY table function. To monitor such queries, use the
    [AGGREGATE_QUERY_HISTORY](aggregate_query_history.md).

    To monitor Access History for such queries, use the
    [AGGREGATE_ACCESS_HISTORY](aggregate_access_history.md).
    This view allows you to more easily monitor high-throughput operational
    workloads for Access History.

Snowflake Native App Framework notes:
:   Some queries related to a Snowflake Native App are redacted. For details, see [Information redacted from SQL commands and views](../../developer-guide/native-apps/redacted-content.md).

Tag-based masking notes:
:   If a user accesses a table or view protected by a [tag-based masking policy](../../user-guide/tag-based-masking-policies.md), the
    `policies_referenced` column contains the masking policy applied through the tag when Snowflake enforces the masking policy on the
    protected column.

    The ACCESS_HISTORY view does not record any tag information.

UDFs & Stored Procedure notes:
:   These notes apply to external functions, UDFs and UDTFs for all languages, including when these functions have the `SECURE` property,
    and stored procedures with owner’s rights and caller’s rights:

    Column details:

    * The `direct_objects_accessed` column records explicit mention of these functions and procedures in a query.

      Snowflake does not record nested UDFs (i.e. a UDF mentioned in the definition of another UDF) in this column.
    * The `base_objects_accessed` column records external functions, shared functions, non-SQL UDFs, and stored procedures that are
      called in a query.
    * The `objects_modified` column records:

      + The UDF/UDTF when the result of calling the function copies the result to another column.
      + The UDF, UDTF, and an external function can be recorded in the arrays for `baseSources` and `directSources` depending on how the
        query is written.

Not supported:
:   This view does not log accesses of the following types:

    * Snowflake-provided [table functions](../functions-table.md), [Account Usage](../account-usage.md) views, and
      [Organization Usage](../organization-usage.md) views.
    * [RESULT_SCAN](../functions/result_scan.md) to obtain prior results.
    * An Access History record is generated when DDL operations are performed on
      [sequences](../../user-guide/querying-sequences.md). It is not generated when a sequence is used in any other
      operations, including generating new values.
    * Intermediate views accessed between the base table and direct object.

      For example, consider a query on View_A with the following object structure: View_A » View_B » View_C » Base_Table.

      The ACCESS_HISTORY view records the query on View_A and the Base_Table, not View_B and View_C.
    * The operations to update streams.
    * Data movement resulting from replication.
    * Failed queries, although logged in the QUERY_HISTORY view, will *not* be logged in the ACCESS_HISTORY view.

## Usage Notes: Column Lineage

These additional notes pertain to column lineage:

Supported operations:
:   Column lineage tracks details for the following SQL operations:

    * [CREATE TABLE … AS SELECT](../sql/create-table.md) (CTAS)
    * [CREATE TABLE … CLONE](../sql/create-table.md)
    * [INSERT … SELECT …](../sql/insert.md)
    * [MERGE](../sql/merge.md)
    * [UPDATE](../sql/update.md), two possible variations, for example:

      + Self-update:

        ```sqlexample
        UPDATE mydb.s1.t1 SET col_1 = col_1 + 1;
        ```
      + Two table update:

        ```sqlexample
        UPDATE mydb.s1.t1 FROM mydb.s2.t2 SET t1.col1 = t2.col1;
        ```
    * [ALTER TABLE](../sql/alter-table.md) … RENAME TO

Query Conditions:
:   * [Query profile/plan](../../user-guide/ui-snowsight-activity.md)

      The query plan Snowflake writes determines whether the ACCESS_HISTORY view contains column lineage. If a column needs to be
      evaluated as part of the query plan, Snowflake contains the column in the ACCESS_HISTORY view, even if the end result of the query plan
      is that the column is not included in the end result.

      For example, consider the following [INSERT](../sql/insert.md) statement with a `WHERE` clause for a particular column value:

      > ```sqlexample
      > insert into a(c1)
      > select c2
      > from b
      > where c3 > 1;
      > ```

      Even if the WHERE clause evaluates to `FALSE`, Snowflake records the `c2` column as a source column for the `c1` column. The
      `c3` column is not listed as a source column for either `baseSources` or `directSources`.
    * Masked columns:

      + The masked column is always listed in the `directSources` field.
      + The record in the `baseSources` field depends on the policy definition. For example:

        - If the masking policy conditions use a [CASE](../functions/case.md) function, then all of the columns referenced in each of
          the CASE branches are recorded in the `baseSources` field.
        - If the masking policy conditions only specify a constant value (e.g. `*****`), then the `baseSources` field is empty.
    * UDFs:

      + When passing a column as an argument to a UDF and writing the result to another column, the column that is passed as the argument
        is recorded in the `directSources` field. For example:

        > ```sqlexample
        > insert into A(col1) select f(col2) from B;
        > ```

        In this example, Snowflake records `col2` in the `directSources` field because the column is an argument for the UDF named
        `f`.
      + The record in the `baseSources` field depends on the UDF definition.

View columns:
:   View columns are not considered to be source columns and are not listed in the `baseSources` field when data from a view column
    is copied to a table column. The view columns in this case are listed in the `directSources` field.

EXISTS Subquery:
:   Columns that are referenced in the [EXISTS](../operators-subquery.md) subquery clause are not considered to be source
    columns.

## Usage Notes: `object_modified_by_ddl` Column

`IF [ NOT ] EXISTS` clauses: The `object_modified_by_ddl` column only records `CREATE` or `REPLACE` when creating
or modifying an object.

The column records these changes based on the following SQL operations. The DROP and UNDROP operations apply to tables and views, not
columns.

```sqlexample
CREATE OR REPLACE

ALTER ... { SET | UNSET }

ALTER ... ADD ROW ACCESS POLICY

ALTER ... DROP ROW ACCESS POLICY

ALTER ... DROP ALL ROW ACCESS POLICIES

DROP | UNDROP
```

The following table summarizes the relationship between DDL operations, supported domains, and the properties Snowflake records.

| Operation | Domain | Properties | Notes |
| --- | --- | --- | --- |
| CREATE [ OR REPLACE ] | TABLE | EXTERNAL TABLE | VIEW | MATERIALIZED VIEW | ICEBERG TABLE | Column name, column identifier. | CREATE DATABASE and CREATE SCHEMA operations do not have properties recorded. |
| CREATE | TABLE … { AS SELECT | USING TEMPLATE | LIKE | CLONE } | Column name, column identifier. | Snowflake records the creation source for LIKE and CLONE operations.  Snowflake does not record the creation source when the source object is from a share or with USING TEMPLATE. |
| ALTER … RENAME TO  ALTER TABLE … RENAME COLUMN | TABLE | VIEW | MATERIALIZED VIEW | ICEBERG TABLE | DATABASE | SCHEMA | The new name of the object or column. |  |
| ALTER … SWAP WITH | TABLE | SCHEMA | DATABASE | objectName, objectId, objectDomain | There are two records in the view, one for each swap target. Each record contains the same query identifier value. |
| ALTER … { ADD | DROP } COLUMN | TABLE | Column name, column identifier, and the ADD or DROP subOperationType. |  |
| DROP | TABLE | VIEW | MATERIALIZED VIEW | ICEBERG TABLE | DATABASE | SCHEMA | Snowflake does not record properties for these operations. |  |
| UNDROP | TABLE | ICEBERG TABLE | SCHEMA | DATABASE | Snowflake does not record properties for these operations. |  |

## Usage notes: Truncation

When a record exceeds the size limit for the view, Snowflake applies a progressive truncation strategy that preserves the most critical audit information while reducing the record size. Truncation adheres to the following general guidelines:

* Column-level information is truncated before object-level information.
* Lineage information is truncated before data access and data protection policy information.
* Query-level metadata columns (`query_id`, `query_start_time`, and `user_name`) are always preserved.

When truncating information, Snowflake replaces numbers with `-1` and replaces strings with `TRUNCATED`. These sentinel elements indicate that information has been truncated.

The following sections describe the order in which records are truncated. Truncation stops as soon as the record fits within the size constraints.

Phase 1: Truncate column lineage in the `object_modified` column
:   ```json
      {
      "objectDomain": "Stream",
      "objectId":  1105,
      "objectName": "\"NESTED_ALERT_PIPELINE_ALERT_eK1VYsLDcTcpqPAA\"",
      "columns": [
        {
          "columnId": -1,
          "columnName": "TRUNCATED",
        }
      ]
    }
    ```

Phase 2: Truncate column information in the `policies_referenced` column
:   ```json
    [
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED",
          }
        ],
        "objectDomain": "VIEW",
        "objectId": 66564,
        "objectName": "GOVERNANCE.VIEWS.V1",
        "policies": [
          {
            "policyName": "governance.policies.rap1",
            "policyId": 68813,
            "policyKind": "ROW_ACCESS_POLICY"
          }
      ]
      }
    ]
    ```

Phase 3: Truncate column access information in the `base_objects_accessed` column
:   ```json
    [
      {
        "objectDomain": "Function",
        "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",
        "objectId": "2",
        "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
        "dataType": "NUMBER(38,0)"
      },
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED"
          }
        ],
        "objectDomain": "Table",
        "objectId": 66564,
        "objectName": "GOVERNANCE.TABLES.T1"
      }
    ]
    ```

Phase 4: Truncate column access information in the `direct_objects_accessed` column
:   ```json
    [
      {
        "objectDomain": "Function",
        "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",
        "objectId": "2",
        "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
        "dataType": "NUMBER(38,0)"
      },
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED"
          }
        ],
        "objectDomain": "Table",
        "objectId": 66564,
        "objectName": "GOVERNANCE.TABLES.T1"
      }
    ]
    ```

Phase 5: Truncate column properties in the `object_modified_by_ddl` column
:   ```json
    {
      "objectDomain": "Table",
      "objectId": 20196,
      "objectName": "MY_DB.PUBLIC.T2",
      "operationType": "REPLACE",
      "properties": {
        "columns": "TRUNCATED",
      }
    }
    ```

Phase 6: Truncate column access information in the `provider_base_objects_accessed` column
:   ```json
    [
      {
        "objectDomain": "FUNCTION",
        "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",
        "objectId": "2",
        "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
        "dataType": "NUMBER(38,0)"
      },
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED"
          }
        ],
        "objectDomain": "Table",
        "objectId": 66564,
        "objectName": "GOVERNANCE.TABLES.T1"
      }
    ]
    ```

Phase 7: Truncate column information in the `provider_policies_referenced` column
:   ```json
    [
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED",
          }
        ],
        "objectDomain": "VIEW",
        "objectId": 66564,
        "objectName": "GOVERNANCE.VIEWS.V1",
        "policies": [
          {
            "policyName": "governance.policies.rap1",
            "policyId": 68813,
            "policyKind": "ROW_ACCESS_POLICY"
          }
        ]
      }
    ]
    ```

Phase 8: Replace information in columns with a single sentinel record
:   > As the last phase in the truncation process, Snowflake replaces all the information in a column with a single sentinel object. Snowflake replaces information in the following order:
    >
    > * `policies_referenced` column
    > * `objects_modified` column
    > * `base_objects_accessed` column
    > * `provider_base_objects_accessed` column (ORGANIZATION_USAGE schema only)
    > * `provider_policies_referenced` column (ORGANIZATION_USAGE schema only)

    The following is an example of a sentinel object found in a column:

    > ```json
    > [
    >   {
    >     "objectDomain": "TRUNCATED",
    >     "objectId": -1,
    >     "objectName": "TRUNCATED",
    >   }
    > ]
    > ```

---
title: ACCOUNT_USAGE.ONLINE_FEATURE_TABLE_REFRESH_HISTORY
source: https://docs.snowflake.com/en/sql-reference/account-usage/online_feature_table_refresh_history.md
section: Account Usage
---

# ACCOUNT_USAGE.ONLINE_FEATURE_TABLE_REFRESH_HISTORY

This Account Usage view displays information for online feature table refresh history.

See also:
:   [ONLINE_FEATURE_TABLE_REFRESH_HISTORY](../functions/online-feature-table-refresh-history.md) (Information Schema)

## Columns

| Column | Data type | Description |
| --- | --- | --- |
| `NAME` | TEXT | Name of the online feature table. |
| `SCHEMA_NAME` | TEXT | Name of the schema that contains the online feature table. |
| `DATABASE_NAME` | TEXT | Name of the database that contains the online feature table. |
| `QUALIFIED_NAME` | TEXT | Fully qualified name of the online feature table. |
| `STATE` | TEXT | Status of the refresh for the online feature table. The status can be one of the following:   * `EXECUTING`: refresh in progress. * `SUCCEEDED`: refresh completed successfully. * `FAILED`: refresh failed during execution. * `CANCELLED`: refresh was canceled before completion. |
| `STATE_CODE` | TEXT | Code representing the current state of the refresh. |
| `STATE_MESSAGE` | TEXT | Description of the current state of the refresh. |
| `REFRESH_START_TIME` | TIMESTAMP_LTZ | Time when the refresh job started. |
| `REFRESH_END_TIME` | TIMESTAMP_LTZ | Time when the refresh completed. |
| `REFRESH_TRIGGER` | TEXT | One of:   * `SCHEDULED`: normal background refresh to meet target lag. * `MANUAL`: user/task ran `ALTER ONLINE FEATURE TABLE <name> REFRESH` command. * `CREATION`: refresh performed during the creation DDL statement, triggered by the creation of the online feature table. |
| `REFRESH_ACTION` | TEXT | One of:   * `NO_DATA`: no new data in base tables. Doesn’t apply to the initial refresh of newly created online feature tables regardless of whether or not the base tables have data. * `REINITIALIZE`: base table changed. * `FULL`: Full refresh, because refresh mode of the online feature table is set to FULL. * `INCREMENTAL`: normal incremental refresh. |

## Usage notes

* Online Feature Table refresh history in this view may lag by up to 3 hours.
* To query this view, use a role that is granted the SNOWFLAKE database role USAGE_VIEWER.
* The [ONLINE_FEATURE_TABLE_REFRESH_HISTORY](../functions/online-feature-table-refresh-history.md) function offers up-to-date refresh history.

## Access control requirements

| Privilege | Object | Notes |
| --- | --- | --- |
| USAGE_VIEWER database role | SNOWFLAKE database | Required to query Account Usage views. |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

---
title: AGGREGATE_ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/aggregate_access_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# AGGREGATE_ACCESS_HISTORY view

This Account Usage view provides aggregated [Access History](../../user-guide/access-history.md) for all workloads in Snowflake.
When a workload involves highly recurrent transactional queries, the access pattern of those queries is also frequently repeated. It is more efficient to view such access history information in an aggregation.

The AGGREGATE_ACCESS_HISTORY view contains similar data to the
[ACCESS_HISTORY view](access_history.md), aggregated over time for
repeated queries in one-minute intervals.

This view also provides access history information associated with both
analytical and transactional queries. In contrast, note that the
[ACCESS_HISTORY view](access_history.md) contains access history
information associated only with queries that appear in the
[QUERY_HISTORY view](query_history.md), and does not include
certain short-running transactional queries.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| INTERVAL_START_TIME | TIMESTAMP_LTZ | Start time of the window of measurement. |
| INTERVAL_END_TIME | TIMESTAMP_LTZ | End time of the window of measurement. |
| QUERY_PARAMETERIZED_HASH | TEXT | Unique ID to identify identical parameterized queries. See [QUERY_PARAMETERIZED_HASH column](aggregate_query_history.md). |
| USER_NAME | TEXT | User who issued the query. |
| CALLS | NUMBER | The number of times the access behavior occurred during the window of time specified by INTERVAL_START_TIME and INTERVAL_END_TIME and triggered by a specific parameterized query and user. |
| DIRECT_OBJECTS_ACCESSED | ARRAY | A JSON array of data objects such as user-defined functions (i.e. UDFs and UDTFs), stored procedures, tables, views, and columns directly named in the query explicitly or through shortcuts such as using an asterisk (i.e. `*`).  Virtual columns can be returned in this field.  For additional notes about UDFs, see [Usage notes](access_history.md). |
| BASE_OBJECTS_ACCESSED | ARRAY | A JSON array of all base data objects to execute a query, including columns, external functions, UDFs, and stored procedures.  In the example in [ACCESS_HISTORY view](access_history.md), the fields in the first array specify a UDF. These same fields in the first array also specify a stored procedure, when applicable.  Note the following:   * This field specifies view names or view columns, including virtual columns, if a shared view is accessed in a data sharing consumer   account.   For additional notes about UDFs, see [Usage notes](access_history.md). |
| OBJECTS_MODIFIED | ARRAY | A JSON array that specifies the objects that were associated with a write operation in the query.  The UDF and stored procedure array is the same as what appears in the arrays for `baseSources` and `directSources` in the examples in [ACCESS_HISTORY view](access_history.md), depending on how the access took place. For brevity, the example omits the UDF and stored procedure array  For additional notes about UDFs, see [Usage notes](access_history.md). |
| OBJECT_MODIFIED_BY_DDL | OBJECT | Specifies the DDL operation on a database, schema, table, view, and column. These operations also include statements that specify a row access policy on a table or view, a masking policy on a column, and tag updates (e.g. set a tag, change a tag value) on the object or column. |
| POLICIES_REFERENCED | ARRAY | Specifies information about the enforced masking policy set on the column and the enforced row access policy set on the table, including policies set on intermediate objects or columns. |

The fields in the JSON array for the DIRECT_OBJECTS_ACCESSED, BASE_OBJECTS_ACCESSED, OBJECTS_MODIFIED, and POLICIES_REFERENCED columns are
described below.

| Field | Data Type | Description |
| --- | --- | --- |
| columnId | NUMBER | A column ID that is unique within the account. This value is identical to the value in the `column_id` column in the [COLUMNS](columns.md) view. |
| columnName | TEXT | The name of the accessed column. For policies, specifies the column on which the masking policy is set. |
| objectId | NUMBER | An identifier for the object, which is unique within a given account and domain. This number will match:   * The value in the `TABLE_ID` column in the [TABLE](tables.md), [VIEWS](views.md),   and [MATERIALIZED_VIEW_REFRESH_HISTORY](materialized_view_refresh_history.md) views. * If a stage was accessed, this number will match the:    + `NAME` identifier for a [user](users.md) (User stage).   + `TABLE_ID` number for a [table](tables.md) (Table stage).   + `STAGE_ID` number for a [stage](stages.md) (Named stage). |
| objectName | TEXT | The fully qualified name of the object that was accessed.  If a masking policy is set on a column or a row access policy is set on a table or view, the value refers to the fully qualified name of the table or view on which the row access policy is set or the table or view that has a masking policy set on one of its columns.  If a stage was accessed, this value will be the:   * `username` (User stage). * `table_name` (Table stage). * `stage_name` (Named stage). |
| objectDomain | TEXT | One of the following: `EXTERNAL TABLE`, `FUNCTION`, `MATERIALIZED VIEW`, `PROCEDURE`, `STAGE`, `STREAM`, or `VIEW`.  Note that `FUNCTION` specifies UDFs, UDTFs, and external functions.  For policies, specifies the domain of the object on which the row access policy is set. |
| location | TEXT | The URL of the external location when data is accessed from an external location (for example, `s3://mybucket/a.csv`).  If the query does not access a stage, this field is omitted. |
| stageKind | TEXT | When writing to a stage, one of the following: `Table`, `User`, `Internal Named`, or `External Named`.  If the query does not access a stage, this field is omitted. |
| baseSources | TEXT | The columns that serve as the source columns for the columns specified by `directSources`. These columns facilitate column lineage. |
| directSources | TEXT | The columns specifically mentioned in the data write portion of the SQL statement that serves as the source columns in the target table to which data is written. These columns facilitate column lineage. |
| policyName | TEXT | The fully-qualified name of the policy. |
| policyId | NUMBER | An identifier for the policy, which is unique within a given account and domain. This value matches the identifier for a masking policy in the [MASKING_POLICIES view](masking_policies.md) or the identifier for a row access policy in the [ROW_ACCESS_POLICIES view](row_access_policies.md) |
| policyKind | TEXT | Either: MASKING_POLICY or ROW_ACCESS_POLICY |
| argumentSignature | TEXT | The name and data type for each argument in the UDF or stored procedure. |
| dataType |  | The data type of the return value for a UDF or stored procedure.  This value helps to differentiate two or more UDFs that have the same name but different return types. |

The fields for the OBJECT_MODIFIED_BY_DDL column are described below.

| Field | Data type | Description |
| --- | --- | --- |
| objectDomain | TEXT | The domain of the object defined or modified by the DDL operation, which includes [all objects that can be tagged](../../user-guide/object-tagging/introduction.md) and `MASKING POLICY`, `ROW ACCESS POLICY`, and `TAG`. |
| objectId | NUMBER | The identifier for the object, which is unique within a given account and domain, defined or modified by the DDL operation. |
| objectName | TEXT | The fully qualified name of the object defined or modified by the DDL operation. |
| operationType | TEXT | The SQL keyword that specifies the operation on the table, view, or column: `ALTER`, `CREATE`, `DROP`, `REPLACE`, or `UNDROP`. |
| properties | ARRAY | A JSON array that specifies the object or column properties when you create, modify, drop, or undrop the object or column. There are two types of properties: atomic and compound. |

For the `properties` field:

* Atomic: one value per property (e.g. a `comment` has a single string value, the `enabled` property is a boolean and has one value).
* Compound: the property is multi-valued (e.g. `allowed_values` for a tag, masking policy).

Compound properties are recorded in a JSON array. For example, if a table contains a single column named EMAIL, the column is recorded as
follows:

```sqljson
columns: {
  "email": {
    objectId: {
      "value": 1
    },
    "subOperationType": "ADD"
  }
}
```

The `subOperationType` value can be one of the following:

* `ADD` specifies adding a compound property (e.g. add a column, set allowed values).
* `DROP` specifies removing a compound property.
* `ALTER` specifies modifying a compound property.

The `objectId` specifies the identifier for the column or object, except for allowed tag values which do not have an
identifier.

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* This Account Usage view can be used to query the aggregated access history of Snowflake objects (e.g. table, view, column) within the last 365 days (1 year).

---
title: AGGREGATE_QUERY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/aggregate_query_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# AGGREGATE_QUERY_HISTORY view

This Account Usage view enables you to monitor and track execution of statements
over time. It contains similar data to the QUERY_HISTORY view but is aggregated
in one-minute intervals for repeated SQL statements. You can use this view to
monitor your workload and analyze performance.

In addition to queries against hybrid tables, all queries that you execute in
Snowflake are included in AGGREGATE_QUERY_HISTORY. However, AGGREGATE_QUERY_HISTORY
is particularly useful for monitoring and analyzing Unistore workloads
that execute a small number of distinct statements repeatedly at high throughput.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CALLS | NUMBER | Number of times the statement (query + query plan) was executed in the aggregation interval. |
| INTERVAL_START_TIME | TIMESTAMP_LTZ | Start time of the window of measurement (in the local time zone). |
| INTERVAL_END_TIME | TIMESTAMP_LTZ | End time of the window of measurement (in the local time zone). |
| QUERY_PARAMETERIZED_HASH | TEXT | Unique ID to identify identical parameterized queries. See QUERY_PARAMETERIZED_HASH column. |
| QUERY_TEXT | TEXT | Sample text of the SQL statement. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that was in use. |
| DATABASE_NAME | TEXT | Database that was in use at the time of the query. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that was in use. |
| SCHEMA_NAME | TEXT | Schema that was in use at the time of the query. |
| QUERY_TYPE | TEXT | DML, query, etc. If the query failed, then the query type may be UNKNOWN. |
| SESSION_ID | NUMBER | Session that executed the statement. |
| USER_NAME | TEXT | User who issued the query. |
| ROLE_NAME | TEXT | Role that was active in the session at the time of the query. |
| ROLE_TYPE | TEXT | Specifies `APPLICATION`, `DATABASE_ROLE`, or `ROLE` that executed the query. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse that was used. |
| WAREHOUSE_NAME | TEXT | Warehouse that the query executed on, if any. |
| WAREHOUSE_SIZE | TEXT | Size of the warehouse when this statement executed. |
| WAREHOUSE_TYPE | TEXT | Type of the warehouse when this statement executed. |
| QUERY_TAG | TEXT | Query tag set for this statement through the QUERY_TAG session parameter. |
| IS_CLIENT_GENERATED_STATEMENT | BOOLEAN | Indicates whether the query was client-generated. |
| RELEASE_VERSION | TEXT | Release version in the format of `major_release.minor_release.patch_release`. |
| ERRORS | ARRAY | List of error codes and messages that occurred during the aggregation interval. Each error is in the format of `{"code": "code1", "message": "msg1", "count": 10}`. |
| TOTAL_ELAPSED_TIME | OBJECT | Elapsed time (in milliseconds). |
| BYTES_SCANNED | OBJECT | Number of bytes scanned by this statement. |
| PERCENTAGE_SCANNED_FROM_CACHE | OBJECT | The percentage of data scanned from the local disk cache. The value ranges from 0.0 to 1.0. Multiply by 100 to get a true percentage. |
| BYTES_WRITTEN | OBJECT | Number of bytes written (e.g. when loading into a table). |
| BYTES_WRITTEN_TO_RESULT | OBJECT | Number of bytes written to a result object. For example, `select * from . . .` would produce a set of results in tabular format representing each field in the selection. . . In general, the results object represents whatever is produced as a result of the query, and `BYTES_WRITTEN_TO_RESULT` represents the size of the returned result. |
| BYTES_READ_FROM_RESULT | OBJECT | Number of bytes read from a result object. |
| ROWS_PRODUCED | OBJECT | Number of rows produced by this statement. |
| ROWS_INSERTED | OBJECT | Number of rows inserted by the query. |
| ROWS_UPDATED | OBJECT | Number of rows updated by the query. |
| ROWS_DELETED | OBJECT | Number of rows deleted by the query. |
| ROWS_UNLOADED | OBJECT | Number of rows unloaded during data export. |
| BYTES_DELETED | OBJECT | Number of bytes deleted by the query. |
| PARTITIONS_SCANNED | OBJECT | Number of micro-partitions scanned. |
| PARTITIONS_TOTAL | OBJECT | Total micro-partitions of all tables included in this query. |
| BYTES_SPILLED_TO_LOCAL_STORAGE | OBJECT | Volume of data spilled to local disk. |
| BYTES_SPILLED_TO_REMOTE_STORAGE | OBJECT | Volume of data spilled to remote disk. |
| BYTES_SENT_OVER_THE_NETWORK | OBJECT | Volume of data sent over the network. |
| COMPILATION_TIME | OBJECT | Compilation time (in milliseconds). |
| EXECUTION_TIME | OBJECT | Execution time (in milliseconds). |
| QUEUED_PROVISIONING_TIME | OBJECT | Time (in milliseconds) spent in the warehouse queue, waiting for the warehouse compute resources to provision, due to warehouse creation, resume, or resize. |
| QUEUED_REPAIR_TIME | OBJECT | Time (in milliseconds) spent in the warehouse queue, waiting for compute resources in the warehouse to be repaired. |
| QUEUED_OVERLOAD_TIME | OBJECT | Time (in milliseconds) spent in the warehouse queue, due to the warehouse being overloaded by the current query workload. |
| TRANSACTION_BLOCKED_TIME | OBJECT | Time (in milliseconds) spent blocked by a concurrent DML. |
| OUTBOUND_DATA_TRANSFER_CLOUD | TEXT | Target cloud provider for statements that unload data to another region and/or cloud. |
| OUTBOUND_DATA_TRANSFER_REGION | TEXT | Target region for statements that unload data to another region and/or cloud. |
| OUTBOUND_DATA_TRANSFER_BYTES | OBJECT | Number of bytes transferred in statements that unload data to another region and/or cloud. |
| INBOUND_DATA_TRANSFER_CLOUD | TEXT | Source cloud provider for statements that load data from another region and/or cloud. |
| INBOUND_DATA_TRANSFER_REGION | TEXT | Source region for statements that load data from another region and/or cloud. |
| INBOUND_DATA_TRANSFER_BYTES | OBJECT | Number of bytes transferred in a replication operation from another account. The source account could be in the same region or a different region than the current account. |
| LIST_EXTERNAL_FILES_TIME | OBJECT | Time (in milliseconds) spent listing external files. |
| CREDITS_USED_CLOUD_SERVICES | OBJECT | Number of credits used for cloud services. |
| EXTERNAL_FUNCTION_TOTAL_INVOCATIONS | OBJECT | Aggregate number of times that this query called remote services. For important details, see the Usage Notes. |
| EXTERNAL_FUNCTION_TOTAL_SENT_ROWS | OBJECT | Total number of rows that this query sent in all calls to all remote services. |
| EXTERNAL_FUNCTION_TOTAL_RECEIVED_ROWS | OBJECT | Total number of rows that this query received from all calls to all remote services. |
| EXTERNAL_FUNCTION_TOTAL_SENT_BYTES | OBJECT | Total number of bytes that this query sent in all calls to all remote services. |
| EXTERNAL_FUNCTION_TOTAL_RECEIVED_BYTES | OBJECT | Total number of bytes that this query received from all calls to all remote services. |
| QUERY_LOAD_PERCENT | OBJECT | The approximate percentage of active compute resources in the warehouse for this query execution. |
| QUERY_ACCELERATION_BYTES_SCANNED | OBJECT | Number of bytes scanned by the [query acceleration service](../../user-guide/query-acceleration-service.md). |
| QUERY_ACCELERATION_PARTITIONS_SCANNED | OBJECT | Number of partitions scanned by the query acceleration service. |
| QUERY_ACCELERATION_UPPER_LIMIT_SCALE_FACTOR | OBJECT | Upper limit [scale factor](../../user-guide/query-acceleration-service.md) that a [query would have benefited from](../../user-guide/query-acceleration-service.md). |
| CHILD_QUERIES_WAIT_TIME | OBJECT | Time (in milliseconds) to complete the cached lookup when calling a [memoizable function](../../developer-guide/udf/sql/udf-sql-scalar-functions.md). |
| HYBRID_TABLE_REQUESTS_THROTTLED_COUNT | NUMBER | Number of hybrid table queries that were throttled. |

The OBJECT data type contains the following fields:

| Field Name | Description |
| --- | --- |
| [sum](../functions/sum.md) | Sum across all executions within the aggregation interval. |
| [avg](../functions/avg.md) | Average across all executions within the aggregation interval. |
| [stddev](../functions/stddev.md) | Standard deviation across all executions within the aggregation interval. |
| [min](../functions/min.md) | Minimum across all executions within the aggregation interval. |
| [median](../functions/median.md) | Median across all executions within the aggregation interval. |
| [p90](../functions/percentile_cont.md) | 90th percentile across all executions within the aggregation interval. |
| [p99](../functions/percentile_cont.md) | 99th percentile across all executions within the aggregation interval. |
| [p99.9](../functions/percentile_cont.md) | 99.9th percentile across all executions within the aggregation interval. |
| [max](../functions/max.md) | Maximum across all executions within the aggregation interval. |

> **Note:**
>
> The following columns of the type OBJECT do not contain a `sum` field:
>
> * PERCENTAGE_SCANNED_FROM_CACHE
> * QUERY_LOAD_PERCENT
> * QUERY_ACCELERATION_UPPER_LIMIT_SCALE_FACTOR

### QUERY_PARAMETERIZED_HASH column

The QUERY_PARAMETERIZED_HASH column contains a hash value that is computed based on the parameterized query, which means the version of the query after parameterizing all literals.

For example, the following queries have the same QUERY_PARAMETERIZED_HASH value:

```sqlexample
SELECT * FROM table1 WHERE table1.name = 'TIM'
```

```sqlexample
SELECT * FROM table1 WHERE table1.name = 'AIHUA'
```

The QUERY_PARAMETERIZED_HASH value has the following restrictions:

> * The constant literal must be in the following binary functions on predicates: equal, not equal, greater (or equal) than, smaller (or equal) than.
> * The aliases must be the same.

As long as there are difference in the SQL text, the QUERY_HASH and QUERY_PARAMETERIZED_HASH values will be different, with the following exceptions:

> * Identifier/session variable/stage name are case insensitive.
> * White space differences are ignored.
> * Literals satisfying the binary predicate rule mentioned above.

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

## Examples

You can use the AGGREGATE_QUERY_HISTORY view to monitor for potential problems with errors, queueing, lock blocking, or hybrid table throttling.
You typically want these metrics to be consistently low. If you see a spike in any of these metrics, it may indicate a problem:

> ```sqlexample
> SET (START_DATE, END_DATE) = ('2023-11-01', '2023-11-08');
>
> WITH time_issues AS
> (
>     SELECT
>         interval_start_time
>         , SUM(transaction_blocked_time:"sum") AS transaction_blocked_time
>         , SUM(queued_provisioning_time:"sum") AS queued_provisioning_time
>         , SUM(queued_repair_time:"sum") AS queued_repair_time
>         , SUM(queued_overload_time:"sum") AS queued_overload_time
>         , SUM(hybrid_table_requests_throttled_count) AS hybrid_table_requests_throttled_count
>     FROM snowflake.account_usage.aggregate_query_history
>     WHERE TRUE
>         AND interval_start_time > $START_DATE
>         AND interval_start_time < $END_DATE
>     GROUP BY ALL
> ),
> errors AS
> (
>     SELECT
>         interval_start_time
>         , SUM(value:"count") as error_count
>     FROM
>     (
>         SELECT
>             a.interval_start_time
>             , e.*
>         FROM
>             snowflake.account_usage.aggregate_query_history a,
>             TABLE(FLATTEN(input => errors)) e
>         WHERE TRUE
>             AND interval_start_time > $START_DATE
>             AND interval_start_time < $END_DATE
>     )
>     GROUP BY ALL
> )
> SELECT
>     time_issues.interval_start_time
>     , error_count
>     , transaction_blocked_time
>     , queued_provisioning_time
>     , queued_repair_time
>     , queued_overload_time
>     , hybrid_table_requests_throttled_count
> FROM
>     time_issues FULL JOIN errors ON errors.interval_start_time = time_issues.interval_start_time
> ;
> ```

You can query the view to monitor your overall workload throughput and concurrency. Many workloads have a regular cyclical pattern.
Any unexpected spikes or drops may be worth investigating.

For example, monitor throughput and concurrency for warehouse `my_warehouse` in the first week of November:

```sqlexample
SELECT
    interval_start_time
    , SUM(calls) AS execution_count
    , SUM(calls) / 60 AS queries_per_second
    , COUNT(DISTINCT session_id) AS unique_sessions
    , COUNT(user_name) AS unique_users
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
    AND warehouse_name = 'MY_WAREHOUSE'
    AND interval_start_time > '2023-11-01'
    AND interval_start_time < '2023-11-08'
GROUP BY
    interval_start_time
;
```

The most common and heavily repeated queries can be a good place to focus any efforts to optimize or improve the efficiency of
your workload. You can query the view to identify top queries for a workload by execution count.

For example, identify the top queries by execution count for warehouse `my_warehouse`:

```sqlexample
SELECT
    query_parameterized_hash
    , ANY_VALUE(query_text)
    , SUM(calls) AS execution_count
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
    AND warehouse_name = 'MY_WAREHOUSE'
    AND interval_start_time > '2023-11-01'
    AND interval_start_time < '2023-11-08'
GROUP BY
    query_parameterized_hash
ORDER BY execution_count DESC
;
```

To identify slowest queries by average total latency:

```sqlexample
SELECT
    query_parameterized_hash
    , any_value(query_text)
    , SUM(total_elapsed_time:"sum"::NUMBER) / SUM (calls) as avg_latency
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
    AND warehouse_name = 'MY_WAREHOUSE'
    AND interval_start_time > '2023-07-01'
    AND interval_start_time < '2023-07-08'
GROUP BY
    query_parameterized_hash
ORDER BY avg_latency DESC
;
```

To analyze performance over time for a specific query of interest:

```sqlexample
SELECT
    interval_start_time
    , total_elapsed_time:"avg"::number avg_elapsed_time
    , total_elapsed_time:"min"::number min_elapsed_time
    , total_elapsed_time:"p90"::number p90_elapsed_time
    , total_elapsed_time:"p99"::number p99_elapsed_time
    , total_elapsed_time:"max"::number max_elapsed_time
FROM snowflake.account_usage.aggregate_query_history
WHERE TRUE
    AND query_parameterized_hash = '<123456>'
    AND interval_start_time > '2023-07-01'
    AND interval_start_time < '2023-07-08'
ORDER BY interval_start_time DESC
;
```

---
title: AGGREGATION_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/aggregation_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# AGGREGATION_POLICIES view

This Account Usage view provides the aggregation policies in your account.

Each row in this view corresponds to a different aggregation policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_NAME | VARCHAR | Name of the aggregation policy. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the aggregation policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema that contains the aggregation policy. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the aggregation policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the aggregation policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the aggregation policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Aggregation policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the aggregation policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the aggregation policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the aggregation policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the aggregation policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).
* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: ALERT_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/alert_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ALERT_HISTORY view

This Account Usage view enables you to retrieve the history of [alert](../../user-guide/alerts.md) usage within the last 365 days
(1 year). The view displays one row for each run of a alert in the history.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the alert. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the alert. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the alert. |
| ACTION | VARCHAR | The text of the SQL statement that serves as the action for the alert. |
| ACTION_QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement executed as the action of the alert. |
| CONDITION | VARCHAR | The text of the SQL statement that serves as the condition for the alert. |
| CONDITION_QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement executed as the condition of the alert. |
| ERROR_CODE | NUMBER | Error code, if the alert returned an error or failed to execute (e.g. if the current user did not have privileges to execute the alert). |
| ERROR_MESSAGE | VARCHAR | Error message, if the alert returned an error. |
| STATE | VARCHAR | Status of the alert. This can be one of the following:   * SCHEDULED: The alert will execute at the time specified by the SCHEDULED_TIME column. This status does not apply to   [alerts on new data](../../user-guide/alerts.md). * EXECUTING: The condition or action of the alert is currently executing. * FAILED: The alert failed. Either the alert condition or alert action encountered an error that prevented it from being   executed. * CANCELLED: The alert execution was cancelled (e.g. when the alert is suspended). * CONDITION_FALSE: The condition was evaluated successfully but returned no data. As a result, the action was not executed.   This status does not apply to [alerts on new data](../../user-guide/alerts.md). * CONDITION_FAILED: The evaluation of the condition failed. For details on the failure, check the ERROR_CODE and   ERROR_MESSAGE columns. * ACTION_FAILED: The condition was evaluated successfully, but the execution of the action failed. For details on the   failure, check the ERROR_CODE and ERROR_MESSAGE columns. * TRIGGERED: The condition was evaluated successfully, and the action was executed successfully. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the scheduled alert is/was scheduled to start running.  Note that we make a best effort to ensure absolute precision, but only guarantee that alerts do not execute *before* the scheduled time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the alert completed, or NULL if SCHEDULED_TIME is in the future or if the alert is still running. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database containing the schema. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema. |
| SCHEDULED_FROM | VARCHAR | Specifies what initiated the alert. The column contains one of the following values:   * `SCHEDULE`: The alert was scheduled to run normally, as described in SCHEDULE clause of   [CREATE ALERT](../sql/create-alert.md). * `EXECUTE ALERT`: The alert was scheduled to run using [EXECUTE ALERT](../sql/execute-alert.md). * `TRIGGER`: The [alert on new data](../../user-guide/alerts.md) was run because the underlying table or view   contains new data. |

## Usage notes

* Latency for the view may be up to 45 minutes.

* For increased performance, filter queries on the COMPLETED_TIME or SCHEDULED_TIME column.

## Examples

Retrieve records for the 10 most recent completed alert runs:

> ```sqlexample
> SELECT name, condition, condition_query_id, action, action_query_id, state
> FROM snowflake.account_usage.alert_history
> LIMIT 10;
> ```

Retrieve records for alert runs completed in the past hour:

> ```sqlexample
> SELECT name, condition, condition_query_id, action, action_query_id, state
> FROM snowflake.account_usage.alert_history
> WHERE COMPLETED_TIME > DATEADD(hours, -1, CURRENT_TIMESTAMP());
> ```

---
title: ANOMALIES_DAILY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/anomalies_daily.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ANOMALIES_DAILY view

This Account Usage view provides insights into whether [cost anomalies](../../user-guide/cost-anomalies.md) occurred in the account.

Each row provides the consumption on a specific day, and whether that consumption was a cost anomaly.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| DATE | DATE | Day in UTC when the consumption occurred. |
| ANOMALY_ID | VARCHAR | System-generated identifier. |
| IS_ANOMALY | BOOLEAN | If true, consumption has been identified as a cost anomaly because it has gone outside the range of the upper and lower bound. |
| ACTUAL_VALUE | NUMBER | Amount of consumption measured in credits. |
| UPPER_BOUND | NUMBER | Predicted highest level of consumption based on the anomaly-detecting algorithm, measured in credits. Consumption levels above this value are considered an anomaly. |
| LOWER_BOUND | NUMBER | Predicted lowest level of consumption based on the anomaly-detecting algorithm, measured in credits. Consumption levels below this value are considered an anomaly. |
| FORECASTED_VALUE | NUMBER | Predicted consumption based on the anomaly-detecting algorithm, measured in credits. |

## Usage notes

Latency for the view might be up to 8 hours.

---
title: APPLICATION_CALLBACK_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/application_callback_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# APPLICATION_CALLBACK_HISTORY view

The `APPLICATION_CALLBACK_HISTORY` view provides a history of callback invocations for Snowflake Native Apps in your Snowflake account. Each row in the view represents a callback invocation, including the callback type, state, and any error information.

For more information about callbacks, see [Callbacks](../../developer-guide/native-apps/callbacks.md).

The retention time for this view is 365 days (1 year).

## Columns

The following table provides definitions for the `APPLICATION_CALLBACK_HISTORY` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| TYPE | VARCHAR | The callback type as defined in the manifest file. |
| APPLICATION_NAME | VARCHAR | The name of the app that defines the callback. |
| STATE | VARCHAR | The state of the callback execution. Possible values are: `QUEUED`, `SCHEDULED`, `EXECUTING`, `COMPLETED`, `FAILED`, `ABORTED`. For descriptions of each state, see [Callback states](../functions/application_callback_history.md). |
| STARTED_ON | TIMESTAMP_LTZ | The timestamp when the callback was invoked. |
| COMPLETED_ON | TIMESTAMP_LTZ | The completion timestamp. NULL if the callback has not yet completed. |
| TRIGGERING_QUERY_ID | VARCHAR | The query ID of the SQL statement that triggered the callback. NULL if not applicable. |
| QUERY_ID | VARCHAR | The query ID of the callback procedure execution. |
| ERROR_CODE | VARCHAR | The error code. NULL unless STATE is `FAILED` or `ABORTED`. |
| ERROR_MESSAGE | VARCHAR | The error message. NULL unless STATE is `FAILED` or `ABORTED`. This column is redacted unless the app is installed on the same account as the app package. |

## Examples

Retrieve the callback history for all applications in the current account:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_CALLBACK_HISTORY;
```

Retrieve the callback history for a specific app:

```sqlexample
SELECT *
FROM SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_CALLBACK_HISTORY
WHERE APPLICATION_NAME = 'my_app'
ORDER BY STARTED_ON DESC;
```

Retrieve only failed or aborted callback invocations:

```sqlexample
SELECT *
FROM SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_CALLBACK_HISTORY
WHERE STATE IN ('FAILED', 'ABORTED')
ORDER BY STARTED_ON DESC;
```

---
title: APPLICATION_CONFIGURATION_VALUE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/application_configuration_value_history.md
section: Account Usage
---

# APPLICATION_CONFIGURATION_VALUE_HISTORY view

This Account Usage view provides a history of value changes for application configurations in the account.

For more information about application configuration, see [Application configuration](../../developer-guide/native-apps/app-configuration.md).
The retention time for this view is 365 days, meaning that data older than 365 days will not be available in the view.
Columns
———————————————————————-

The following table provides definitions for the `APPLICATION_CONFIGURATION_VALUE_HISTORY` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| ID | STRING | The system-generated unique identifier for the app configuration. |
| NAME | STRING | The system-generated unique identifier for the app configuration. |
| APPLICATION_ID | STRING | The system-generated unique identifier for the application that contains the configuration. |
| APPLICATION_NAME | STRING | The name of the application that the configuration is in. |
| CREATED_ON | TIMESTAMP | The timestamp when the configuration object was created. |
| MODIFIED_ON | TIMESTAMP | The timestamp when the configuration object was last updated. |
| TYPE | STRING | The type of the configuration. Possible values are APPLICATION_NAME and STRING. |
| STATUS | STRING | The status of the configuration. Possible values are PENDING and DONE. |
| SENSITIVE | BOOLEAN | Whether the value is sensitive or not. |
| VALUE | STRING | The value that is set by the consumer.  For application configurations of the APPLICATION_NAME type, this is the most up-to-date name of the application specified by the consumer. This may not be the same as initially provided if the application has been renamed. If the application has been dropped, no value will be shown here, as if the value is not set.  When `SENSITIVE=TRUE`, the value is hidden, unless the executing role is the application owning the configuration. |
| VALUE_UPDATED_ON | TIMESTAMP | The last updated timestamp when the value was set or unset. |
| LABEL | STRING | A user-friendly name to be displayed in the UI, provided by the provider. |
| DESCRIPTION | STRING | The description of the configuration. |
| APPLICATION_ROLES | STRING | The comma-separated app role names that have access to the configuration.  This displays the most up-to-date names, even if roles have been renamed. If an application role has been dropped, it will not be included in the output list. |

## Examples

Retrieve the history of value changes for application configurations in the current account:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_CONFIGURATION_VALUE_HISTORY;
```

---
title: APPLICATION_CONFIGURATIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/application_configurations.md
section: Account Usage
---

# APPLICATION_CONFIGURATIONS view

This Account Usage view displays a row for each application configuration currently defined in the specified or current database where the account usage schema is located.

For more information about application configuration, see [Application configuration](../../developer-guide/native-apps/app-configuration.md).

## Columns

The following table provides definitions for the `APPLICATION_CONFIGURATIONS` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | The system-generated ID for the application configuration. |
| NAME | STRING | The name of the configuration. |
| APPLICATION_ID | NUMBER | The system-generated ID for the application that the configuration is in. |
| APPLICATION_NAME | STRING | The name of the application that the configuration is in. |
| CREATED_ON | TIMESTAMP | The timestamp when the configuration object was created. |
| MODIFIED_ON | TIMESTAMP | The timestamp when the configuration object was last updated. |
| DELETED_ON | TIMESTAMP | The timestamp when the configuration object was deleted. |
| TYPE | STRING | The type of the configuration. Possible values are APPLICATION_NAME and STRING. |
| STATUS | STRING | The status of the configuration. Possible values are PENDING and DONE. |
| SENSITIVE | BOOLEAN | Whether the value is sensitive or not. |
| VALUE | STRING | The value that is set by the consumer.  For application configurations of the APPLICATION_NAME type, this is the most up-to-date name of the application specified by the consumer. This may not be the same as initially provided if the application has been renamed. If the application has been dropped, no value will be shown here, as if the value is not set.  When `SENSITIVE=TRUE`, the value is hidden, unless the executing role is the application owning the configuration. |
| VALUE_UPDATED_ON | TIMESTAMP | The last updated timestamp when the value was set or unset. |
| LABEL | STRING | A user-friendly name to be displayed in the UI, provided by the provider. |
| DESCRIPTION | STRING | The description of the configuration. |
| APPLICATION_ROLES | STRING | The comma-separated app role names that have access to the configuration.  This displays the most up-to-date names, even if roles have been renamed. If an application role has been dropped, it will not be included in the output list. |

## Usage notes

* The retention time for deleted app configurations is 365 days. Data older than 365 days will not be available in the view.

## Examples

Retrieve all listings in the current account:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_CONFIGURATIONS;
```

---
title: APPLICATION_DAILY_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/application_daily_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# APPLICATION_DAILY_USAGE_HISTORY view

Use this view to return the daily credit and storage usage for Snowflake Native Apps in an account within the last 365 days
(1 year).

## Columns

The following table provides definitions for the APPLICATION_DAILY_USAGE_HISTORY view columns.

| Field | Data type | Description |
| --- | --- | --- |
| APPLICATION_NAME | VARCHAR | The application name. |
| APPLICATION_ID | NUMBER | An internal, system-generated identifier for the application. |
| LISTING_GLOBAL_NAME | VARCHAR | The listing global name that appears in Snowflake Marketplace or in the data exchange hosting the application. |
| USAGE_DATE | DATE | The date the Snowflake Native App usage occurred. |
| CREDITS_USED | NUMBER | The number of credits consumed by the Snowflake Native App in a day. |
| CREDITS_USED_BREAKDOWN | ARRAY | An array of data objects that identify the Snowflake service that consumed daily credits. See CREDITS_USED_BREAKDOWN array for formatting. |
| STORAGE_BYTES | NUMBER | The daily average of storage bytes used by the Snowflake Native App. |
| STORAGE_BYTES_BREAKDOWN | ARRAY | An array of data objects that identify the type and number of storage bytes used. See STORAGE_BYTES_BREAKDOWN array for formatting. |

## Usage notes

* The maximum latency for this view is one day.
* Usage is attributed to the start day when usage events span multiple days.
* The APPLICATION_DAILY_USAGE_HISTORY view and the Snowsight cost management tools can return different daily credit and storage usage values. This discrepancy is caused by the methods used to determine daily credit and storage usage. To determine these values, the APPLICATION_DAILY_USAGE_HISTORY view uses the current session’s [TIMEZONE](../parameters.md) parameter and the Snowsight cost management tools use Coordinated Universal Time (UTC). To resolve any discrepancies, Snowflake recommends setting the TIMEZONE parameter to UTC.

### CREDITS_USED_BREAKDOWN array

The CREDITS_USED_BREAKDOWN array provides details about the services that consumed daily credits.

Example:

```sqljson
[
  {
    "credits": 0.005840921,
    "serviceType": "AUTO_CLUSTERING"
  },
  {
    "credits": 0.115940725,
    "serviceType": "SERVERLESS_TASK"
  },
  {
    "credits": 6.033448041,
    "serviceType": "SNOWPARK_CONTAINER_SERVICES"
  }
]
```

The following table provides descriptions for the key-value pairs in the objects in the array.

| Field | Data type | Description |
| --- | --- | --- |
| `credits` | DECIMAL | Number of credits consumed by the service type specified by `serviceType` on the usage date. |
| `serviceType` | VARCHAR | The service type, which can be one of the following values:   * `AUTO_CLUSTERING` — See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `DATA_QUALITY_MONITORING` — See [Introduction to data quality checks](../../user-guide/data-quality-intro.md). * `MATERIALIZED_VIEW` — See [Working with Materialized Views](../../user-guide/views-materialized.md). * `PIPE` — See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `SEARCH_OPTIMIZATION` — See [Search optimization service](../../user-guide/search-optimization-service.md). * `SERVERLESS_TASK` — See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPARK_CONTAINER_SERVICES` — See [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md). * `WAREHOUSE_METERING` — See [Overview of warehouses](../../user-guide/warehouses-overview.md). |

The following are used in the determination of credit consumption:

* The credits used by objects in the Snowflake Native App. For example, auto-clustering on tables in the Snowflake Native App.
* The credits used by the warehouses owned by the Snowflake Native App.
* The credits used by the compute pools dedicated to the Snowflake Native App.

### STORAGE_BYTES_BREAKDOWN array

The STORAGE_BYTES_BREAKDOWN array provides details about the services that consumed storage.

Example:

```sqljson
[
  {
    "bytes": 34043221,
    "storageType": "DATABASE"
  },
  {
    "bytes": 109779541,
    "storageType": "FAILSAFE"
  }
]
```

The following table provides descriptions for the key-value pairs in the objects in the array.

| Field | Data type | Description |
| --- | --- | --- |
| `bytes` | INTEGER | Number of storage bytes used. |
| `storageType` | VARCHAR | The storage type, which can be one of the following values:   * `DATABASE`: Database storage. * `FAILSAFE`: [Fail-safe storage](../../user-guide/data-failsafe.md). * `HYBRID_TABLE`: Storage for [hybrid tables](../../user-guide/tables-hybrid.md). |

Only data stored in the Snowflake Native App is used to determine storage byte consumption. External databases created by the Snowflake Native App are not included in the determination of this value.

## Examples

Retrieve the daily credit and storage usage for a Snowflake Native App in an account and order the results by usage date:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.APPLICATION_DAILY_USAGE_HISTORY
  ORDER BY usage_date DESC;
```

---
title: APPLICATION_SPECIFICATION_STATUS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/application_specification_status_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# APPLICATION_SPECIFICATION_STATUS_HISTORY view

The `APPLICATION_SPECIFICATION_STATUS_HISTORY` view provides a history of the status changes for app specifications in your Snowflake account. Each row in the view represents a change in the status of an app specification, including when it was approved or declined or when the app creates a new request

The retention time for this view is 365 days, meaning that data older than 365 days will not be available in the view.

## Columns

The following table provides definitions for the `APPLICATION_SPECIFICATION_STATUS_HISTORY` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | An internal, system-generated identifier for the app specification. |
| NAME | TEXT | The name of the app specification. |
| APPLICATION_ID | NUMBER | An internal, system-generated identifier for the app that contains the app specification. |
| APPLICATION_NAME | TEXT | The name of the app that contains the app specification. |
| TYPE | TEXT | The type of app specification. Possible values are EXTERNAL_ACCESS, SECURITY_INTEGRATION, and LISTING. |
| SEQUENCE_NUMBER | NUMBER | The sequence number of the app specification. |
| USER_NAME | TEXT | The name of the user that updated the app specification status. This field is empty if the app specification is a new pending request created by the app. |
| STATUS | TEXT | The status of the app specification. Possible values are: APPROVED, PENDING, DECLINED. |
| STATUS_UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the app specification was last updated, including when it was created, approved, or declined. |
| LABEL | TEXT | A label containing the name of the app specification that is displayed to consumer in Snowsight. |
| DESCRIPTION | TEXT | A description on the app specification that is displayed to the consumer. |
| DEFINITION | TEXT | The fields that comprise the app specification definition. For more information, see [Overview of app specifications](../../developer-guide/native-apps/requesting-app-specs.md). |

---
title: APPLICATION_SPECIFICATIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/application_specifications.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# APPLICATION_SPECIFICATIONS view

The `APPLICATION_SPECIFICATIONS` displays a list of all app specifications created your Snowflake account.
Each row in the view represents an app specification. Deleted app specifications are also included in the view.
Because an app specification can be deleted and recreated with the same name. To differentiate between app specifications
with the same name, use the ID column.

The retention time for deleted app specifications is 365 days. Data older than 365 days will not be available in the view.

## Columns

The following table provides definitions for the `APPLICATION_SPECIFICATIONS` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | An internal, system-generated identifier for the app specification. |
| NAME | TEXT | The name of the app specification. |
| APPLICATION_ID | NUMBER | An internal, system-generated identifier for the app that contains the app specification. |
| APPLICATION_NAME | TEXT | The name of the app that contains the app specification. |
| CREATED | TIMESTAMP_LTZ | The timestamp when the app specification was created. |
| DELETED | TIMESTAMP_LTZ | The timestamp when the app specification was deleted. This value is NULL if the app specification has not been deleted. |
| TYPE | TEXT | The type of app specification. Possible values are EXTERNAL_ACCESS, SECURITY_INTEGRATION, and LISTING. |
| SEQUENCE_NUMBER | NUMBER | The sequence number of the app specification. |
| REQUESTED_ON | TIMESTAMP_LTZ | The timestamp when the app created the app specification. |
| STATUS | TEXT | The status of the app specification. Possible values are: APPROVED, PENDING, or DECLINED. |
| STATUS_UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the app specification was last updated, including when it was created, approved, or declined. |
| LABEL | TEXT | A label containing the name of the app specification that is displayed to consumer in Snowsight. |
| DESCRIPTION | TEXT | A description of the app specification. This description is displayed to the consumer. |
| DEFINITION | TEXT | The fields that comprise the app specification definition. For more information, |

---
title: ARCHIVE_STORAGE_DATA_RETRIEVAL_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/archive_storage_data_retrieval_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ARCHIVE_STORAGE_DATA_RETRIEVAL_USAGE_HISTORY view

This Account Usage view displays a history of archived data retrieval (in bytes) for your
account over the past 12 months (one year).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| OBJECT_TYPE | VARCHAR | The type of the retrieved object; for example, `TABLE`. |
| OBJECT_ID | NUMBER | Internal or system-generated identifier for the retrieved object. |
| OBJECT_NAME | VARCHAR | Name of the retrieved object. |
| SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema for the retrieved object. |
| SCHEMA_NAME | VARCHAR | Name of the schema for the retrieved object. |
| DATABASE_ID | NUMBER | Internal, Snowflake-generated identifier of the database for the retrieved object. |
| DATABASE_NAME | VARCHAR | Name of the database for the retrieved object. |
| BYTES | NUMBER | Bytes retrieved from archive storage. |
| ARCHIVE_STORAGE_TIER | VARCHAR | The archive storage tier from which Snowflake retrieved the object; for example, `COOL` or `COLD`. |

## Usage notes

* Latency for the view is up to 1 hour.
* The view contains historical data for the past 12 months (one year).
* For cost information related to data retrieval from archive storage,
  see [billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md).

---
title: AUTOMATIC_CLUSTERING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/automatic_clustering_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# AUTOMATIC_CLUSTERING_HISTORY view

This Account Usage view can be used to query the [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) history. The information returned by the view includes the credits consumed, bytes updated, and rows updated each time a table is reclustered.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | NUMBER | Number of credits billed for automatic clustering during the START_TIME and END_TIME window. |
| NUM_BYTES_RECLUSTERED | NUMBER | Number of bytes reclustered during the START_TIME and END_TIME window. |
| NUM_ROWS_RECLUSTERED | NUMBER | Number of rows reclustered during the START_TIME and END_TIME window. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table. |
| TABLE_NAME | VARCHAR | Name of the table. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance that the object belongs to. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```
* A row might be clustered multiple times, depending on data skew, clustering key distribution, and reordering required for micro-partitions. A large table with poor initial clustering might need multiple passes to reach an optimally clustered state. Therefore, the NUM_ROWS_RECLUSTERED value for a table could be as high as the total number of rows in the table or even higher.

---
title: BACKUP_OPERATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/backup_operation_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BACKUP_OPERATION_HISTORY view

This Account Usage view provides information about the backup operations that were performed for
[backup sets](../../user-guide/backups.md).
Snowflake returns one row for each operation performed on backups within backup sets over the last year.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The timestamp at which the backup operation started. |
| END_TIME | TIMESTAMP_LTZ | The timestamp at which the backup operation ended. |
| BACKUP_SET_ID | NUMBER | The local backup set ID. |
| BACKUP_ID | VARCHAR | The unique identifier of backup being worked on. |
| OPERATION_TYPE | VARCHAR | Could be one of the below operations:   * CREATE * EXPIRE * RESTORE * ADD_LEGAL_HOLD * REMOVE_LEGAL_HOLD |
| Query_ID | VARCHAR | Internal system-generated identifier for the SQL statement. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* Snowflake retains the history data for 365 days (approximately one year).

---
title: BACKUP_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/backup_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BACKUP_POLICIES view

This Account Usage view provides information about [backup policies](../../user-guide/backups.md)
and their properties.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the backup policy. |
| NAME | VARCHAR | Name of the backup policy. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the backup policy. |
| SCHEMA_NAME | VARCHAR | Schema that the backup policy belongs to. |
| CATALOG_ID | NUMBER | Internal system-generated identifier for the database of the backup policy. |
| CATALOG_NAME | VARCHAR | Database that the backup policy belongs to. |
| SCHEDULE | VARCHAR | Schedule for backup creation. |
| EXPIRE_AFTER_DAYS | NUMBER | Days after backup creation when backup should be expired and automatically deleted. |
| HAS_RETENTION_LOCK | VARCHAR | Indicates whether the policy includes a retention lock. Y if the policy has a retention lock; N otherwise.  Retention lock protects backups from being deleted by anyone for the defined retention period. The retention lock also prevents the retention period from being decreased on the policy. |
| OWNER | VARCHAR | Name of the role that owns the backup policy. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the backup policy. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the backup policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the backup policy was deleted. |
| COMMENT | VARCHAR | Comment for the backup policy. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: BACKUP_SETS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/backup_sets.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BACKUP_SETS view

This Account Usage view provides information about [backup sets](../../user-guide/backups.md) and their properties.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the backup set. |
| NAME | VARCHAR | Name of the backup set |
| SCHEMA_ID | NUMBER | Internal system-generated identifier for the schema of the backup set. |
| SCHEMA_NAME | VARCHAR | Schema that the backup set belongs to. |
| CATALOG_ID | NUMBER | Internal system-generated identifier for the database of the backup set. |
| CATALOG_NAME | VARCHAR | Database that the backup set belongs to. |
| OBJECT_KIND | VARCHAR | Type of object that the backup set is backing up. |
| OBJECT_ID | NUMBER | ID of object that the backup set is backing up. |
| OBJECT_NAME | VARCHAR | Name of object that the backup set is backing up. |
| OBJECT_SCHEMA_ID | NUMBER | ID of schema that contains the object that is backed up by this backup set. |
| OBJECT_SCHEMA_NAME | VARCHAR | Name of schema that contains the object that is backed up by this backup set. |
| OBJECT_CATALOG_ID | NUMBER | ID of database that contains the object that is backed up by this backup set. |
| OBJECT_CATALOG_NAME | VARCHAR | Name of database that contains the object that is backed up by this backup set. |
| BACKUP_POLICY_ID | NUMBER | ID of backup policy attached to this backup set. |
| BACKUP_POLICY_NAME | VARCHAR | Name of backup policy attached to this backup set. |
| BACKUP_POLICY_SCHEMA_ID | NUMBER | ID of the schema that contains the backup policy. |
| BACKUP_POLICY_SCHEMA_NAME | VARCHAR | Name of the schema that contains the backup policy. |
| BACKUP_POLICY_CATALOG_ID | NUMBER | ID of the database that contains the backup policy. |
| BACKUP_POLICY_CATALOG_NAME | VARCHAR | Name of the database that contains the backup policy. |
| OWNER | VARCHAR | Name of the role that owns the backup set. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the backup set. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the backup set was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the backup set was deleted. |
| COMMENT | VARCHAR | Comment for the backup set. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: BACKUP_STORAGE_USAGE view
source: https://docs.snowflake.com/en/sql-reference/account-usage/backup_storage_usage.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BACKUP_STORAGE_USAGE view

This Account Usage view provides information about storage usage for [backups](../../user-guide/backups.md).

> **Note:**
>
> The same tables might be included in multiple table backups, schema backups, and database backups.
> Therefore, the numbers of bytes shown in this view don’t entirely answer questions about how much storage
> you can save by deleting a backup or a backup set. The same data files might be retained as part of
> a different backup set.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| BACKUP_SET_ID | NUMBER | Internal system-generated identifier for the backup set. |
| BACKUP_ID | VARCHAR | Internal system-generated identifier for the backup. |
| LOGICAL_BYTES | NUMBER | Number of bytes created when this backup is restored. |
| INCREMENTAL_BYTES_FROM_PREVIOUS_BACKUP | NUMBER | Number of logical bytes of the micro-partitions that *are* in this backup, but *aren’t* in the previous backup within the same backup set.  For the oldest active backup in a backup set, this is 0. |
| DECREMENTAL_BYTES_FROM_PREVIOUS_BACKUP | NUMBER | Number of logical bytes of the micro-partitions that *aren’t* in this backup, but *are* in the previous backup within the same backup set.  For the oldest active backup in a backup set, this is 0. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).

---
title: BACKUPS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/backups.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BACKUPS view

This Account Usage view provides information on [backups](../../user-guide/backups.md).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Snowflake-generated identifier of the backup.  Note: this is not the local ID, this is the globally unique UUID of the Backup. |
| BACKUP_SET_ID | NUMBER | ID of backup set that contains the backup. |
| BACKUP_SET_NAME | VARCHAR | Name of backup set that contains the backup. |
| BACKUP_SET_SCHEMA_ID | NUMBER | ID of schema that the backup set belongs to. |
| BACKUP_SET_SCHEMA | VARCHAR | Name of schema that the backup set belongs to. |
| BACKUP_SET_CATALOG_ID | NUMBER | ID of database that the backup set belongs to. |
| BACKUP_SET_CATALOG | VARCHAR | Name of database that the backup set belongs to. |
| CREATED | TIMESTAMP_LTZ | Timestamp at which backup was created. |
| DELETED | TIMESTAMP_LTZ | Timestamp at which the backup was deleted.  This column isn’t displayed by the SHOW command, because the SHOW command output doesn’t include deleted objects. |
| EXPIRATION_SCHEDULED_FOR | TIMESTAMP_LTZ | Timestamp at which backup will be expired and deleted. |
| IS_UNDER_LEGAL_HOLD | BOOLEAN | True if backup is under legal hold; False otherwise. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).

---
title: BLOCK_STORAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/block_storage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BLOCK_STORAGE_HISTORY view

Use the BLOCK_STORAGE_HISTORY view in the ACCOUNT_USAGE schema to query the average daily block storage and snapshot usage for an account within the last 365 days.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | TIMESTAMP_LTZ | Date of this storage usage record. The date is based on the local time zone. |
| STORAGE_TYPE | VARCHAR | `BLOCK_STORAGE` or `SNAPSHOT`. |
| COMPUTE_POOL_NAME | VARCHAR | Name of the compute pool associated with this storage usage. For the `SNAPSHOT` storage type, this field is NULL. |
| BYTES | NUMBER | Average number of bytes used on the given date. |
| ADDITIONAL_IOPS | NUMBER | Average number of additional IOPS used on the given date. |
| ADDITIONAL_THROUGHPUT | NUMBER | Average amount of additional throughput (MiB per second) used on the given date. |

## Usage notes

* Latency for the view can be up to 180 minutes (3 hours).
* The view provides daily block storage and snapshot usage within the last 365 days (1 year) for an account.
* Snapshots are not associated with compute pools; therefore, for snapshots the view has the NULL value in the COMPUTE_POOL_NAME column.
* The BYTES column shows average usage of block storage volumes for a specific day, for a specific storage type, and for a specific compute pool (where appropriate) in the Snowflake account. For example, consider the following:

  + You use a 10 GiB block volume for 6 hours on 2024-02-01 for compute pool POOL_1. Using 10 GiB for 6 hours is equivalent to 2.5 GiB per day
    (10 GiB \* 6/24 hours = 2.5 GiB per day = 2,684,354,560 bytes per day).
  + You use a 10 GiB block volume for 12 hours on 2024-02-01 for another compute pool POOL_2. Using 10 GiB for 12 hours is equivalent to 5 GiB
    per day (10 GiB \* 12/24 hours = 5 GiB per day = 5,368,709,120 bytes per day).
  + You use a 20 GiB snapshot for 24 hours on 2024-02-01. Using 20 GiB for 24 hours is equivalent to 20 GiB per day = 21,474,836,480 bytes per day.

  Suppose that you query the BLOCK_STORAGE_HISTORY view:

  ```sqlexample
  SELECT * FROM snowflake.account_usage.BLOCK_STORAGE_HISTORY
  ```

  The query returns the following results:

  ```output
  +-------------------------------+--------------------+-------------------------+----------------+-----------------------+-----------------------------+
  | USAGE_DATE                    | STORAGE_TYPE       | COMPUTE_POOL_NAME       |       BYTES    |       ADDITIONAL_IOPS |       ADDITIONAL_THROUGHPUT |
  |-------------------------------+--------------------+-------------------------+----------------|-----------------------|-----------------------------|
  | 2025-02-01 00:00:00.000 -0700 | BLOCK_STORAGE      | POOL_1                  | 2,684,354,560  | 250.000000000         | 25.000000000                |
  | 2025-02-01 00:00:00.000 -0700 | BLOCK_STORAGE      | POOL_2                  | 5,368,709,120  | 0.50000000            | 0.500000000                 |
  | 2025-02-01 00:00:00.000 -0700 | SNAPSHOT           | NULL                    | 21,474,836,480 | 0.000000000           | 0.000000000                 |
  +-------------------------------+--------------------+-------------------------+----------------+-----------------------+-----------------------------+
  ```
* The additional IOPS (ADDITIONAL_IOPS) and throughput (ADDITIONAL_THROUGHPUT) values show the amount that your [configured values](../../developer-guide/snowpark-container-services/block-storage-volume.md) exceed their default values. For example, on AWS, the block configuration default IOPS is 3,000, and the default throughput is 125 MiB/second. If you configure an AWS block device with 4,000 IOPS and 225 MiB/second throughput, the additional IOPS would be 1,000 (4,000 - 3,000), and the additional throughput would be 100 MiB/second (225 - 125).

  The following three examples illustrate how you can get this information from the BLOCK_STORAGE_HISTORY view. Suppose that your account is set up with the following:

  + Your account provisioned a 10 GiB block volume (as part of a service) with 1000 additional IOPS and 100 MiB/second additional throughput for 6 hours on 2025-02-01 for compute pool `pool_1`. If you query the view, you can get the following information from the `additional_iops` and `additional_throughput` columns:

    - Using 10 GiB for 6 hours equals 2.5 GiB per day (10 GiB x 6/24 hours = 2.5 GiB = 2,684,354,560 bytes per day).
    - Using 1000 additional IOPS for 6 hours equals 250 IOPS per day (1000 IOPS \* 6/24 hours = 250 IOPS per day).
    - Using 100 additional MiB/second for 6 hours equals average 25 MiB/second per day (100 MiB \* 6/24 hours = 25 MiB per day).
  + Your account is provisioned a 10 GiB block volume (as part of a service) with 1 additional IOPS and 1 MiB/s additional throughput for 12 hours on 2025-02-01 for compute pool `POOL_2`.

    - Using 10 GiB for 12 hours equals 5 GiB per day (10 GiB \* 12/24 hours = 5 GiB per day = 5,368,709,120 bytes per day).
    - 1 additional IOPS used for 12 hours equals 0.5 IOPS per day (1 IOPS \* 12/24 hours = 0.5 IOPS per day).
    - 1 additional MiB/second throughput MiB/s used for 12 hours equals 0.5 MiB/second per day (1 MiB \* 12/24 hours = 0.5 MiB per day)
  + You use a 20 GiB snapshot for 24 hours on 2025-02-01. Using 20 GiB for 24 hours is equivalent to 20 GiB per day.

  When you query the view:

  ```sqlexample
  SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.BLOCK_STORAGE_HISTORY;
  ```

  The `bytes`, `additional_iops`, and `additional_throughput` columns in the query output provide this information:

  ```output
  +-------------------------------+--------------------+-------------------------+----------------+-----------------------+-----------------------------+
  | USAGE_DATE                    | STORAGE_TYPE       | COMPUTE_POOL_NAME       |       BYTES    |       ADDITIONAL_IOPS |       ADDITIONAL_THROUGHPUT |
  |-------------------------------+--------------------+-------------------------+----------------|-----------------------|-----------------------------|
  | 2025-02-01 00:00:00.000 -0700 | BLOCK_STORAGE      | POOL_1                  | 2,684,354,560  | 250.000000000         | 25.000000000                |
  | 2025-02-01 00:00:00.000 -0700 | BLOCK_STORAGE      | POOL_2                  | 5,368,709,120  | 0.50000000            | 0.500000000                 |
  | 2025-02-01 00:00:00.000 -0700 | SNAPSHOT           | NULL                    | 21,474,836,480 | 0.000000000           | 0.000000000                 |
  +-------------------------------+--------------------+-------------------------+----------------+-----------------------+-----------------------------+
  ```

  > **Note:**
  > + If you attach multiple block volumes to a compute pool, the view aggregates the usage and returns one row.
  > + If there are multiple snapshots present on a given day, the view aggregates the usage and returns one row.
  > + If you attach a single block volume to a compute pool and use it for three days, then the view returns three rows because the view reports daily usage for each compute pool having block volumes attached.

---
title: BLOCK_STORAGE_SNAPSHOTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/block_storage_snapshots.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# BLOCK_STORAGE_SNAPSHOTS view

This Account Usage view displays a row for each [block storage snapshot](../../developer-guide/snowpark-container-services/block-storage-volume.md) in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SNAPSHOT_ID | NUMBER | ID of the snapshot. |
| SNAPSHOT_NAME | VARCHAR | Name of the snapshot. |
| DATABASE_ID | VARCHAR | Internal, Snowflake-generated identifier of the database that the snapshot belongs to. |
| DATABASE_NAME | VARCHAR | Name of the database that the snapshot belongs to. |
| SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema that the snapshot belongs to. |
| SCHEMA_NAME | VARCHAR | Name of the schema that the snapshot belongs to. |
| SERVICE_ID | NUMBER | ID of the service for which the snapshot is created. |
| SERVICE_NAME | VARCHAR | Name of the service for which the snapshot is created. |
| VOLUME _NAME | VARCHAR | Volume from the specified service for which the snapshot is created. |
| INSTANCE | NUMBER | ID of the service instance for which the snapshot is created. |
| SIZE | NUMBER | Size in GB of the snapshot. |
| ENCRYPTION | VARCHAR | [Encryption type of the volume](../../developer-guide/snowpark-container-services/block-storage-volume.md) from which the snapshot was created. |
| COMMENT | VARCHAR | General comment about the snapshot. |
| OWNER | VARCHAR | Role that owns the snapshot. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the snapshot. |
| CREATED_ON | TIMESTAMP_LTZ | Creation time of the snapshot. |
| LAST_ALTERED_ON | TIMESTAMP_LTZ | Last altered time of the snapshot. |
| DELETED_ON | TIMESTAMP_LTZ | Deletion time of the snapshot. |

## Usage notes

* Latency for the view might be up to 180 minutes (3 hours).

## Example

```sqlexample
SELECT *
FROM SNOWFLAKE.ACCOUNT_USAGE.BLOCK_STORAGE_SNAPSHOTS;
```

---
title: CATALOG_LINKED_DATABASE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/catalog_linked_database_usage_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# CATALOG_LINKED_DATABASE_USAGE_HISTORY view

Use this Account Usage view to view the credit usage for your
[catalog-linked databases](../../user-guide/tables-iceberg-catalog-linked-database.md)
within the last 12 months.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the catalog-linked database operation took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the catalog-linked database operation took place. |
| DATABASE_ID | NUMBER | Internal identifier for the catalog-linked database that consumed credits. |
| DATABASE_NAME | VARCHAR | Name of the catalog-linked database that consumed credits. |
| CREDITS_USED_COMPUTE | NUMBER(38,9) | Number of credits used by the catalog-linked database for table creation operations between the START_TIME and END_TIME. The cost for this usage is described in Table 5 of the [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) on the Snowflake website. See the Snowflake-managed compute column for the Automated Refresh and Data Registration row. |
| CREDITS_USED_CLOUD_SERVICES | NUMBER(38,9) | Number of credits used by the catalog-linked database for automatic table discovery, schema creation or deletion, and table deletion between the START_TIME and END_TIME. Usage for cloud services is charged only if the daily consumption of cloud services exceeds 10% of the daily usage of virtual warehouses. For more information, see [Understanding billing for cloud services usage](../../user-guide/cost-understanding-compute.md). |
| CREDITS_USED | NUMBER(38,9) | Number of credits billed for this catalog-linked database between the START_TIME and END_TIME. |
|  |  |  |

---
title: CLASS_INSTANCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/class_instances.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CLASS_INSTANCES view

This Account Usage view displays a row for each instance of a class defined in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the instance. |
| NAME | VARCHAR | Name of the instance. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the instance. |
| SCHEMA_NAME | VARCHAR | Name of the schema the instance belongs to. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the instance. |
| DATABASE_NAME | VARCHAR | Name of the database the instance belongs to. |
| CLASS_ID | NUMBER | Internal/system-generated identifier for the class the instance is instantiated from. |
| CLASS_NAME | VARCHAR | Name of the class the instance is instantiated from. |
| CLASS_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the class the instance is instantiated from. |
| CLASS_SCHEMA_NAME | VARCHAR | Name of the schema of the class the instance is instantiated from. |
| CLASS_DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the class the instance is instantiated from. |
| CLASS_DATABASE_NAME | VARCHAR | Name of the database of the class the instance is instantiated from. |
| OWNER_NAME | VARCHAR | Name of the role that owns the instance. |
| OWNER_ROLE_TYPE | VARCHAR | The internal/system-generated identifier of the role that owns the instance of the class. |
| CREATED | TIMESTAMP_LTZ | Date and time when the instance was created. |
| DELETED | TIMESTAMP_LTZ | Date and time when the instance was deleted. |
| COMMENT | VARCHAR | Comment for the instance. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* The view only displays the instances for which the current role for the session has been granted access privileges.

## Examples

The following example finds all instances of the [ANOMALY_DETECTION](../classes/anomaly_detection.md) class:

```sqlexample
SELECT NAME, DATABASE_NAME, SCHEMA_NAME, CLASS_NAME
  FROM SNOWFLAKE.ACCOUNT_USAGE.CLASS_INSTANCES
  WHERE CLASS_NAME = 'ANOMALY_DETECTION';
```

The following example joins this view with [TABLES view](tables.md) on the INSTANCE_ID column to find the tables
that belong to each instance:

```sqlexample
SELECT a.TABLE_NAME,
       b.NAME AS instance_name,
       b.CLASS_NAME
  FROM SNOWFLAKE.ACCOUNT_USAGE.TABLES a
  JOIN SNOWFLAKE.ACCOUNT_USAGE.CLASS_INSTANCES b
  ON a.INSTANCE_ID = b.ID
  WHERE b.DELETED IS NULL;
```

---
title: CLASSES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/classes.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CLASSES view

This Account Usage view displays a row for each [class](../snowflake-db-classes.md)
in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the class. |
| NAME | VARCHAR | Name of the class. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the class. |
| SCHEMA_NAME | VARCHAR | Name of the schema the class belongs to. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the class. |
| DATABASE_NAME | VARCHAR | Name of the database the class belongs to. |
| OWNER_NAME | VARCHAR | Name of the role that owns the class. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the class was created. |
| DELETED | TIMESTAMP_LTZ | Date and time when the class was deleted. |
| COMMENT | VARCHAR | Comment for the class. |

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

## Examples

The following example finds all classes in the account:

```sqlexample
SELECT name, database_name, schema_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.CLASSES;
```

---
title: COLUMN_QUERY_PRUNING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/column_query_pruning_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# COLUMN_QUERY_PRUNING_HISTORY view

Use this Account Usage view to gain a better understanding of data access patterns during
query execution, including some column-level details, such as the “access type” and candidate
[search optimization expressions](../../user-guide/search-optimization-service.md) that are potentially beneficial.

You can use this view in combination with the [TABLE_QUERY_PRUNING_HISTORY view](table_query_pruning_history.md). For example,
you can identify access to target tables by using the TABLE_QUERY_PRUNING_HISTORY view, then
identify frequently used columns on those tables by using the COLUMN_QUERY_PRUNING_HISTORY view.

Each row in this view represents the query pruning history for a specific column within a given time interval. The data is
aggregated per column, per table, per interval, and includes metrics such as the number of queries executed, partitions scanned,
partitions pruned, rows scanned, rows pruned, and rows matched.

See also [TABLE_PRUNING_HISTORY view](table_pruning_history.md) and [Query Pruning](../../user-guide/tables-clustering-micropartitions.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| INTERVAL_START_TIME | TIMESTAMP_LTZ | Start of the time range (on the hour mark) during which the queries were executed and completed. |
| INTERVAL_END_TIME | TIMESTAMP_LTZ | End of the time range (on the hour mark) during which the queries were executed and completed. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that was queried. |
| TABLE_NAME | VARCHAR | Name of the table that was queried. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table that was queried. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table that was queried. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table that was queried. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table that was queried. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse that was used to run the queries. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that ran the queries. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| COLUMN_ID | NUMBER | Internal/system-generated identifier for the column accessed from the table that was queried. |
| COLUMN_NAME | VARCHAR | Name of the column accessed from the table that was queried. |
| VARIANT_PATH | VARCHAR | Path to the semi-structured data being accessed (if applicable). NULL if the column accessed does not have a semi-structured data type. |
| ACCESS_TYPE | VARCHAR | Type of access performed on the column (`WHERE` or `JOIN` condition). |
| NUM_QUERIES | NUMBER | Number of queries executed in this time range with this specific QUERY_HASH value, using this warehouse, accessing this column (and variant path if applicable) on this table with this type of access. |
| AGGREGATE_QUERY_ELAPSED_TIME | NUMBER | Total elapsed time (in milliseconds) for queries defined by NUM_QUERIES. This total includes queueing and other time not associated with compilation and execution. |
| AGGREGATE_QUERY_COMPILATION_TIME | NUMBER | Total compilation time (in milliseconds) for queries defined by NUM_QUERIES. |
| AGGREGATE_QUERY_EXECUTION_TIME | NUMBER | Total execution time (in milliseconds) for queries defined by NUM_QUERIES. |
| PARTITIONS_SCANNED | NUMBER | Number of partitions scanned on this table for queries defined by NUM_QUERIES. |
| PARTITIONS_PRUNED | NUMBER | Number of partitions pruned on this table for queries defined by NUM_QUERIES. These partitions were eliminated during query processing and not scanned, improving the efficiency of the query. |
| ROWS_SCANNED | NUMBER | Number of rows scanned on this table for queries defined by NUM_QUERIES. |
| ROWS_PRUNED | NUMBER | Number of rows pruned on this table for queries defined by NUM_QUERIES. These rows were eliminated during query processing and not scanned, improving the efficiency of the query. |
| ROWS_MATCHED | NUMBER | Number of rows that matched the WHERE clause filters while scanning this table for the queries defined by NUM_QUERIES. |
| SEARCH_OPTIMIZATION_SUPPORTED_EXPRESSIONS | ARRAY | List of supported search optimization expressions on this column that could potentially speed up scanning this table for the queries defined by NUM_QUERIES. |

## Usage notes

* Latency for the view may be up to 4 hours.
* Data is retained for 1 year.
* This view does not include pruning information for [hybrid tables](../../user-guide/tables-hybrid.md).
* Users and roles that have been granted the USAGE_VIEWER database role can access this view. For more information, see
  [SNOWFLAKE database roles](../snowflake-db-roles.md).
* The ACCESS_TYPE column contains one of the following values:

  + `WHERE`: The column is used in a filter condition in the [WHERE](../constructs/where.md) clause.
  + `JOIN`: The column is used in a condition for a [JOIN](../constructs/join.md) operation.
* The access behavior shown in this view reflects the actual query plan that was executed, which might be different from the original query text. For example, if a HAVING clause does not reference aggregated results produced by the GROUP BY clause, it might be optimized and rewritten as a WHERE clause, and the ACCESS_TYPE value will be `WHERE`.
* For complex filtering conditions that can’t benefit from a pushdown optimization, rows might not be filtered out during the table scan operation, even if they do not match the filtering condition. Therefore, these rows are counted in the ROWS_MATCHED value.
* Currently, the SEARCH_OPTIMIZATION_SUPPORTED_EXPRESSIONS column only suggests the EQUALITY and SUBSTRING [search methods](../sql/alter-table.md).
* This view retains data for the 1,000 longest-running table scans per query. Only extremely complex queries
  exceed this number of scans so data is rarely omitted.

## Example

For a given day, return column-level pruning history for queries against a specific table:

```sqlexample
SELECT interval_start_time, table_name, column_name, access_type, num_queries,
    rows_scanned, rows_pruned, rows_matched,
    search_optimization_supported_expressions::VARCHAR as search_optim
  FROM SNOWFLAKE.ACCOUNT_USAGE.COLUMN_QUERY_PRUNING_HISTORY
  WHERE interval_start_time like '2025-04-24%' AND table_name='SENSOR_DATA_TS'
  ORDER BY 3, 1;
```

```output
+-------------------------------+----------------+-------------+-------------+-------------+--------------+-------------+--------------+-----------------------------+
| INTERVAL_START_TIME           | TABLE_NAME     | COLUMN_NAME | ACCESS_TYPE | NUM_QUERIES | ROWS_SCANNED | ROWS_PRUNED | ROWS_MATCHED | SEARCH_OPTIM                |
|-------------------------------+----------------+-------------+-------------+-------------+--------------+-------------+--------------+-----------------------------|
| 2025-04-24 14:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |            5 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 14:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |            5 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 15:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 15:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 15:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |            5 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 15:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 19:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 19:00:00.000 -0700 | SENSOR_DATA_TS | DEVICE_ID   | WHERE       |           1 |      2678400 |     2678400 |      2678400 | ["EQUALITY(\"DEVICE_ID\")"] |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |      3262387 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      2678400 |     2678400 |       394106 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |      1227686 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      2678400 |     2678400 |       216642 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      2678400 |     2678400 |       216642 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |      1227686 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |       820272 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |      3262387 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |      3262387 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      5356800 |           0 |      1227686 | NULL                        |
| 2025-04-24 17:00:00.000 -0700 | SENSOR_DATA_TS | TEMPERATURE | WHERE       |           1 |      2678400 |     2678400 |       216642 | NULL                        |
+-------------------------------+----------------+-------------+-------------+-------------+--------------+-------------+--------------+-----------------------------+
```

The `sensor_data_ts` table in this query contains 5356800 rows of synthetic time-series data. Exactly half of the rows in the table (2678400) were
pruned for a number of queries that filtered the `device_id` and `temperature` columns in WHERE clause conditions.

The `device_id` column is suggested as a target for a search optimization that uses the EQUALITY search method. Table scans might benefit from the addition of this
search optimization.

> **Tip:**
>
> You can use the [ARRAY_TO_STRING](../functions/array_to_string.md) function to convert the SEARCH_OPTIMIZATION_SUPPORTED_EXPRESSIONS column to a string for easier
> readability. For example:
>
> ```sqlexample
> ARRAY_TO_STRING(search_optimization_supported_expressions, ', ')
> ```

---
title: COLUMNS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/columns.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# COLUMNS view

This Account Usage view displays a row for each column in the tables defined in the account.

See also:
:   [DATABASES view](databases.md)

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| COLUMN_ID | NUMBER | Internal/system-generated identifier for the column. |
| COLUMN_NAME | TEXT | Name of the column. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table or view for the column. |
| TABLE_NAME | TEXT | Table or view that the column belongs to. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the table or view for the column. |
| TABLE_SCHEMA | TEXT | Schema that the table or view belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the table or view for the column. |
| TABLE_CATALOG | TEXT | Database that the table or view belongs to. |
| ORDINAL_POSITION | NUMBER | Ordinal position of the column in the table/view. |
| COLUMN_DEFAULT | TEXT | Default value of the column. |
| IS_NULLABLE | TEXT | Whether the column allows NULL values. |
| DATA_TYPE | TEXT | Data type of the column.  This column shows the standard Snowflake data type of the column. The DATA_TYPE_ALIAS column displays the original data type name that was specified for the column when the table was created, or when the column was altered. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string columns. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string columns. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric columns. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric columns. |
| NUMERIC_SCALE | NUMBER | Scale of numeric columns. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | TEXT | Not applicable for Snowflake. |
| INTERVAL_PRECISION | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | TEXT | Not applicable for Snowflake. |
| COLLATION_CATALOG | TEXT | Not applicable for Snowflake. |
| COLLATION_SCHEMA | TEXT | Not applicable for Snowflake. |
| COLLATION_NAME | TEXT | Not applicable for Snowflake. |
| DOMAIN_CATALOG | TEXT | Not applicable for Snowflake. |
| DOMAIN_SCHEMA | TEXT | Not applicable for Snowflake. |
| DOMAIN_NAME | TEXT | Not applicable for Snowflake. |
| UDT_CATALOG | TEXT | Not applicable for Snowflake. |
| UDT_SCHEMA | TEXT | Not applicable for Snowflake. |
| UDT_NAME | TEXT | Not applicable for Snowflake. |
| SCOPE_CATALOG | TEXT | Not applicable for Snowflake. |
| SCOPE_SCHEMA | TEXT | Not applicable for Snowflake. |
| SCOPE_NAME | TEXT | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | TEXT | Not applicable for Snowflake. |
| DTD_IDENTIFIER | TEXT | Not applicable for Snowflake. |
| IS_SELF_REFERENCING | TEXT | Not applicable for Snowflake. |
| IS_IDENTITY | TEXT | Whether the column is an identity column. |
| IDENTITY_GENERATION | TEXT | Whether an identity column’s value is always generated or only generated by default. Snowflake only supports `BY DEFAULT`. |
| IDENTITY_START | TEXT | Not applicable for Snowflake. |
| IDENTITY_INCREMENT | TEXT | Not applicable for Snowflake. |
| IDENTITY_MAXIMUM | TEXT | Not applicable for Snowflake. |
| IDENTITY_MINIMUM | TEXT | Not applicable for Snowflake. |
| IDENTITY_CYCLE | TEXT | Whether the value of an identity column allows cycling. Snowflake only supports `NO CYCLE`. |
| IDENTITY_ORDERED | TEXT | If `YES`, the column is an identity column and has the ORDER property. If `NO`, the column is an identity column and has the NOORDER property. |
| SCHEMA_EVOLUTION_RECORD | TEXT | Records information about the latest triggered Schema Evolution for a given table column. This column contains the following subfields:   * EvolutionType: The type of the triggered schema evolution (ADD_COLUMN or DROP_NOT_NULL). * EvolutionMode: The triggering ingestion mechanism (COPY, SNOWPIPE, or SNOWPIPE_STREAMING). * FileName: The file name that triggered the evolution (NULL for SNOWPIPE_STREAMING). * TriggeringTime: The approximate time when the column was evolved. * QueryId or PipeId: A unique identifier of the triggering query or pipe (QUERY ID for COPY, PIPE ID for SNOWPIPE, or NULL for SNOWPIPE_STREAMING). * Pipe name: Fully qualified pipe name that triggered schema evolution (SNOWPIPE_STREAMING only). * Channel name: Channel that triggered schema evolution (SNOWPIPE_STREAMING only). * offsetTokenUpperBound: An offset at or before which schema evolution was triggered (SNOWPIPE_STREAMING only). |
| COMMENT | TEXT | Comment for the column. |
| DELETED | TIMESTAMP_LTZ | Date and time when the column was deleted. |
| DATA_TYPE_ALIAS | TEXT | The data type alias or synonym specified for the column when the table was created or when the column was last altered.  For example, the BIGINT type is synonymous with the NUMBER type. If BIGINT was specified as the type for a column, then BIGINT is displayed in this DATA_TYPE_ALIAS column.  For columns in tables that were created before the [2025_07 behavior change bundle](../../release-notes/bcr-bundles/2025_07_bundle.md) was enabled, and not altered after the behavior change, the value in this column is NULL. For more information, see [COLUMNS view (multiple schemas): New column](../../release-notes/bcr-bundles/2025_07/bcr-2061.md). |

## Usage notes

* Latency for the view may be up to 90 minutes.

* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.

## Examples

The following example retrieves all columns in the `myTable` table defined in the `mydb` database:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.COLUMNS
  WHERE
    table_catalog = 'mydb' AND
    table_name = 'myTable' AND
    deleted IS NULL;
```

---
title: COMPLETE_TASK_GRAPHS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/complete_task_graphs.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# COMPLETE_TASK_GRAPHS view

You can use the Account Usage view to query the status of completed *graph* runs, such as runs that executed successfully, failed, or were
cancelled. A graph is currently defined as a single scheduled task or a [task graph](../../user-guide/tasks-graphs.md) composed of a scheduled
root task and one or more child tasks. For the purposes of this function, *root task* refers to either the single scheduled task or the
root task in a task graph.

The view avoids the 10,000 row limitation of the [COMPLETE_TASK_GRAPHS](../functions/complete_task_graphs.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROOT_TASK_NAME | TEXT | Name of the root task. |
| DATABASE_NAME | TEXT | Name of the database that contains the graph. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the graph. |
| STATE | TEXT | State of the graph run:   * `SUCCEEDED`: All tasks in the graph ran successfully to completion, or the root task run succeeded and one or more child task runs were skipped. * `FAILED`: One or more task runs in the graph failed, or the root task run succeeded and one or more child task runs failed. * `CANCELLED`: One or more task runs in the graph were cancelled, or the root task run succeeded and one or more child task runs were cancelled.   Note that if the state of the root task run is SKIPPED, the function does not return a row for the run. |
| SCHEDULED_FROM | TEXT | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| FIRST_ERROR_TASK_NAME | TEXT | Name of the first task in the graph that returned an error; returns NULL if no task produced an error. |
| FIRST_ERROR_CODE | NUMBER | Error code of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| FIRST_ERROR_MESSAGE | TEXT | Error message of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the root task was scheduled to start running. Note that we make a best effort to ensure absolute precision, but only guarantee that tasks do not execute *before* the scheduled time. |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the root task definition started to run. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| NEXT_SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the standalone or root task (in a [DAG](../../user-guide/tasks-graphs.md) of tasks) is next scheduled to start running, assuming the current run of the standalone task or [DAG](../../user-guide/tasks-graphs.md) started at the SCHEDULED_TIME time completes in time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the last task in the [DAG](../../user-guide/tasks-graphs.md) was completed. |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a [DAG](../../user-guide/tasks-graphs.md). This ID matches the ID column value in the SHOW TASKS output for the same task. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the [DAG](../../user-guide/tasks-graphs.md) that was run, or is scheduled to be run. |
| RUN_ID | NUMBER | Time when the standalone or root task in a [DAG](../../user-guide/tasks-graphs.md) is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system may reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run prior to retry. You may use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| CONFIG | TEXT | Displays the graph level configuration used during the graph run if explicitly set. Otherwise displays NULL. |
| GRAPH_RUN_GROUP_ID | TEXT | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Usage notes

* Latency for the view may be up to 45 minutes.

* The view only displays objects for which the current role for the session has been granted access privileges.

## Examples

Retrieve records for the 10 most recent task graph runs completed in your account:

```sqlexample
select root_task_name, state from snowflake.account_usage.complete_task_graphs
  limit 10;
```

---
title: COMPUTE_POOLS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/compute_pools.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# COMPUTE_POOLS view

Use this view to get a historical view of compute pools (creation, deletion) in your account for the last 365 days.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Compute pool name. |
| IS_SUSPENDED | BOOLEAN | Whether the pool is currently suspended. |
| MIN_NODES | NUMBER | Minimum number of nodes in the compute pool. |
| MAX_NODES | NUMBER | Maximum number of nodes in the compute pool. |
| INSTANCE_FAMILY | VARCHAR | Machine type of nodes in the compute pool. |
| AUTO_SUSPEND_SECS | NUMBER | Number of seconds of inactivity after which the compute pool is automatically suspended. |
| AUTO_RESUME | BOOLEAN | Whether the compute pool is automatically resumed when Snowflake attempts to start a service or job. |
| CREATED | TIMESTAMP | Date and time when the compute pool was created. |
| LAST_RESUMED | TIMESTAMP | Date and time when the suspended compute pool was last resumed. |
| LAST_ALTERED | TIMESTAMP | Date and time when the compute pool was last updated. |
| DELETED | TIMESTAMP | Date and time when the compute pool was deleted. |
| OWNER | VARCHAR | Role name that owns the compute pool. |
| OWNER_ROLE_TYPE | VARCHAR | Type of the role that owns the compute pool. |
| IS_EXCLUSIVE | BOOLEAN | Whether the compute pool was created for an application. |
| APPLICATION_NAME | VARCHAR | Application name for which the compute pool was created. Null if the compute pool was not created for an application or if the application no longer exists. |
| APPLICATION_ID | VARCHAR | Application ID for which the compute pool was created. Null if the compute pool was not created for an application. |
| COMMENT | VARCHAR | A comment. |

## Usage notes

* Latency for the view can be up to 180 minutes (3 hours).

---
title: CONTACT_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/contact_references.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CONTACT_REFERENCES view

This Account Usage view can be used to identify the associations between [contacts](../../user-guide/contacts-using.md) and the objects to
which they have been added.

Contact lineage is not included in this view. For example, if a contact is associated with a schema, the view does not have records for
associations between the contact and all the tables in the schema even though the tables inherit the association from the schema.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| CONTACT_DATABASE | VARCHAR | Name of the database in which the contact exists. |
| CONTACT_SCHEMA | VARCHAR | Name of schema in which the contact exists. |
| CONTACT_ID | NUMBER | Internal/system-generated identifier for the contact. |
| CONTACT_NAME | VARCHAR | Name of a contact. |
| CONTACT_PURPOSE | VARCHAR | Purpose that was specified when the contact was associated with the object. |
| OBJECT_DATABASE | VARCHAR | Name of the database that contains the referenced object. If the object is not a database or schema object, the value is empty. |
| OBJECT_SCHEMA | VARCHAR | Name of the schema that contains the referenced object. If the referenced object is not a schema object (for example, a warehouse), the value is empty. |
| OBJECT_ID | NUMBER | Internal/system-generated identifier of the referenced object. |
| OBJECT_NAME | VARCHAR | Name of the referenced object. |
| OBJECT_DELETED | TIMESTAMP_LTZ | Date and time when the referenced object was dropped or when its parent object was dropped. |
| OBJECT_DOMAIN | VARCHAR | Type of the referenced object. |

## Usage notes

* Latency for the view is 2 hours.

---
title: CONTACTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/contacts.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CONTACTS view

This Account Usage view displays a row for each [contact](../../user-guide/contacts-using.md) in the account.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| CONTACT_ID | NUMBER | Internal/system-generated identifier of the contact. |
| CONTACT_NAME | VARCHAR | Name of the contact. |
| CONTACT_SCHEMA_ID | NUMBER | Internal/system-generated identifier of the schema in which the contact exists. |
| CONTACT_SCHEMA | VARCHAR | Name of the schema in which the contact exists. |
| CONTACT_DATABASE_ID | NUMBER | Internal/system-generated identifier of the database in which the contact exists. |
| CONTACT_DATABASE | VARCHAR | Name of the database in which the contact exists. |
| CONTACT_OWNER | VARCHAR | Name of the role that owns the contact. |
| CONTACT_OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the object. Either ROLE, DATABASE_ROLE, or APPLICATION (if a Snowflake Native App owns the object).  Deleted contacts have a NULL value. |
| CONTACT_USERS | ARRAY | Array of Snowflake users to contact. |
| CONTACT_EMAIL_DISTRIBUTION_LIST | VARCHAR | Email address used to communicate with the contact. |
| CONTACT_URL | VARCHAR | URL used to communicate with the contact. |
| COMMENT | VARCHAR | Comments for the contact, if any. |
| CREATED | TIMESTAMP_LTZ | Date and time when the contact was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the contact was dropped or the date and time when its parent was dropped. |

## Usage notes

* Latency for the view is 2 hours.

---
title: COPY_FILES_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/copy_files_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# COPY_FILES_HISTORY view

This Account Usage view includes information about compute credit usage, number of bytes copied, and number of files copied for the
following operations:

* Using [COPY FILES](../sql/copy-files.md) to copy files from a source stage to an output stage.
* Cloning named internal stages.

See also:
:   [COPY FILES](../sql/copy-files.md) , [CREATE <object> … CLONE](../sql/create-clone.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| DATABASE_ID | NUMBER | ID of the database from which the files are copied. You can map this to the ENTITY_ID in the [METERING_HISTORY view](metering_history.md) view. |
| DATABASE_NAME | VARCHAR | Name of the database from which the staged files are copied. |
| SUB_SERVICE_TYPE | VARCHAR | Type of service that is copying files, which can be one of the following:   * `COPY STAGE FILES`: See [COPY FILES](../sql/copy-files.md). * `SCHEMA CLONE`: See [CREATE <object> … CLONE](../sql/create-clone.md). * `DATABASE CLONE`: See [CREATE <object> … CLONE](../sql/create-clone.md). |
| JOB_ROOT_ENTITY_ID | NUMBER | Entity ID for the root job; varies by SUB_SERVICE_TYPE. For COPY STAGE FILES, indicates the ID of the stage from which files are copied. For SCHEMA CLONE, indicates the schema ID. For DATABASE CLONE, indicates the database ID. |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the copy operation took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the copy operation took place. |
| CREDITS_USED | NUMBER | Number of compute credits used by warehouses and serverless compute resources between the START_TIME and END_TIME. |
| BYTES_COPIED | NUMBER | Number of bytes copied from the root entity (stage, schema, or database) between the START_TIME and END_TIME. |
| FILES_COPIED | NUMBER | Number of files copied from the root entity (stage, schema, or database) between the START_TIME and END_TIME. |

---
title: COPY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/copy_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# COPY_HISTORY view

This Account Usage view can be used to query Snowflake data loading history for the last 365 days (1 year). The view displays load activity
for both [COPY INTO <table>](../sql/copy-into-table.md) statements and continuous data loading using
[Snowpipe](../../user-guide/data-load-snowpipe-intro.md). The view avoids the 10,000 row limitation of
the [LOAD_HISTORY view](../info-schema/load_history.md).

You can also view data loading details in Snowsight. See [Monitor data loading activity by using Copy History](../../user-guide/data-load-monitor.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_NAME | VARCHAR | Name of the source file and relative path to the file. |
| STAGE_LOCATION | VARCHAR | Name of the stage where the source file is located. |
| LAST_LOAD_TIME | TIMESTAMP_LTZ | Date and time of when the file finished loading. |
| ROW_COUNT | NUMBER | Number of rows loaded from the source file. |
| ROW_PARSED | NUMBER | Number of rows parsed from the source file; `NULL` if STATUS is `Load in progress`. |
| FILE_SIZE | NUMBER | Observed size of the source file in the internal or external stage before it loads. If the file is compressed, this shows the compressed size. If the file is uncompressed, this shows the uncompressed size. |
| FIRST_ERROR_MESSAGE | VARCHAR | First error of the source file. |
| FIRST_ERROR_LINE_NUMBER | NUMBER | Line number of the first error. |
| FIRST_ERROR_CHARACTER_POS | NUMBER | Position of the first error character. |
| FIRST_ERROR_COLUMN_NAME | VARCHAR | Column name of the first error. |
| ERROR_COUNT | NUMBER | Number of error rows in the source file. |
| ERROR_LIMIT | NUMBER | If the number of errors reaches this limit, then abort. |
| STATUS | VARCHAR | Status: `Loaded`, `Load failed`, `Partially loaded`, or `Load skipped`. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the target table. |
| TABLE_NAME | VARCHAR | Name of the target table.TABLE_NAME |
| TABLE_SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema for the table. |
| TABLE_SCHEMA_NAME | VARCHAR | Name of the schema in which the target table resides. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the table. |
| TABLE_CATALOG_NAME | VARCHAR | Name of the database in which the target table resides. |
| PIPE_CATALOG_NAME | VARCHAR | Name of the database in which the pipe resides. |
| PIPE_SCHEMA_NAME | VARCHAR | Name of the schema in which the pipe resides. |
| PIPE_NAME | VARCHAR | Name of the pipe defining the load parameters; `NULL` for COPY statement loads. |
| PIPE_RECEIVED_TIME | TIMESTAMP_LTZ | Date and time when the INSERT request for the file loaded through the pipe was received; `NULL` for COPY statement loads. |
| FIRST_COMMIT_TIME | TIMESTAMP_LTZ | Date and time when the first chunk of the file is committed. Snowpipe may load a file in multiple chunks that are separately committed. |
| BYTES_BILLED | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing visibility into Snowpipe’s cost implications directly within these history views. |

## Usage notes

* In most cases, latency for the view may be up to 120 minutes (2 hours). The latency for a given table’s copy history may be up to 2 days
  if both of the following conditions are true:

  + Fewer than 32 DML statements have been added to the given table since it was last updated in COPY_HISTORY.
  + Fewer than 100 rows have been added to the given table since it was last updated in COPY_HISTORY.

* The view only includes COPY INTO commands that executed to completion, with or without errors.
* Dropping or recreating a table object removes the load history metadata for bulk data load deduplication (COPY INTO *<table>* statements) into the table.
* Renaming a table object updates the corresponding TABLE_NAME entries in the copy history.
* Dropping or recreating a pipe object doesn’t remove the load history metadata for the pipe.
* The view only displays objects for which the current role for the session has been granted access privileges.
* After the replication of copy history, the COPY_HISTORY Account Usage view shows the history only after the latest truncate operation on the target table. This is different from the view without replication, which shows a complete copy history.

## Examples

Retrieve records for the 10 most recent COPY INTO commands executed:

```sqlexample
select file_name, error_count, status, last_load_time from snowflake.account_usage.copy_history
  order by last_load_time desc
  limit 10;
```

---
title: CORTEX_AGENT_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_agent_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_AGENT_USAGE_HISTORY view

The CORTEX_AGENT_USAGE_HISTORY view can be used to query the usage
history of Cortex Agents.

> **Note:**
>
> This view does not include requests originating from Snowflake Intelligence. Requests originating from Snowflake Intelligence are recorded in the [SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY](snowflake_intelligence_usage_history_view.md) view.

The information in the view includes the number of credits consumed each time a user interacts
with Cortex Agents. A request results in one or more calls to underlying tools, for example, Cortex Analyst and Cortex Search. Each row in the view represents a call to the agent and provides detail about
the aggregated tokens and credits in the call as well as granular detail. The view also includes
relevant metadata, such as the user ID, request ID, and the agent ID.
For more information about Cortex billing, see [Cost considerations](../../user-guide/snowflake-cortex/cortex-agents.md).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start time when the Cortex Agent message request was received. |
| END_TIME | TIMESTAMP_LTZ | End time when the Cortex Agent message response was sent. |
| USER_ID | NUMBER | The unique identifier of the user who made the request. |
| USER_NAME | VARCHAR | The name of the user who made the request. |
| USER_TAGS | ARRAY | Tags associated with the user. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “ACCOUNT” or “USER”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| REQUEST_ID | VARCHAR | The unique identifier for the request. |
| PARENT_REQUEST_ID | VARCHAR | The identifier of the parent request, if applicable. |
| AGENT_DATABASE_ID | NUMBER | The unique identifier of the agent database. |
| AGENT_DATABASE_NAME | VARCHAR | The name of the agent database. |
| AGENT_SCHEMA_ID | NUMBER | The unique identifier of the agent schema. |
| AGENT_SCHEMA_NAME | VARCHAR | The name of the agent schema. |
| AGENT_ID | NUMBER | The unique identifier of the agent. |
| AGENT_NAME | VARCHAR | The name of the agent. |
| AGENT_TAGS | ARRAY | Tags associated with the Agent. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “DATABASE” or “CORTEX_AGENT”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| TOKEN_CREDITS | NUMBER | The number of token credits used for the request. Used for user-level budgeting. |
| TOKENS | NUMBER | Sum of the tokens used by the Cortex Agent. |
| TOKENS_GRANULAR | ARRAY | Granular breakdown of token usage by request, service type (cortex_agents, cortex_analyst), and model. Includes input, cache_read_input, cache_write_input, and output token counts per model. The “unknown” model name is used when a model is not present in the pricing data. Each object in the array contains the following value pairs:   * `request_id`: The unique identifier for the request. * `service_type`: The service type, such as “cortex_agents” or “cortex_analyst”. * `model`: The model name used for the request. * `input`: Number of input tokens. * `cache_read_input`: Number of cache read input tokens. * `cache_write_input`: Number of cache write input tokens. * `output`: Number of output tokens. * `start_time`: The start time of the request. |
| CREDITS_GRANULAR | ARRAY | Granular breakdown of credit usage by request, service type (cortex_agents, cortex_analyst), and model. Includes input, cache_read_input, cache_write_input, and output credit values per model. The “unknown” model name is used when a model is not present in the pricing data. Each object in the array contains the following value pairs:   * `request_id`: The unique identifier for the request. * `service_type`: The service type, such as “cortex_agents” or “cortex_analyst”. * `model`: The model name used for the request. * `input`: Credit value for input tokens. * `cache_read_input`: Credit value for cache read input tokens. * `cache_write_input`: Credit value for cache write input tokens. * `output`: Credit value for output tokens. * `start_time`: The start time of the request. |
| METADATA | OBJECT | Additional metadata, including:   * `role_id`: ID of the primary role used for the request. * `role_name`: Name of the primary role used for the request. * `interaction_interface`: The interface through which the Cortex Agent was accessed (for example, `agent_admin_ui`, `sql_function`, `microsoft_teams`, or `external`). Contains NULL if the interface is unknown or the record predates the introduction of this field. * `ai_functions_credits`: Credits consumed by [AI functions](../../user-guide/snowflake-cortex/aisql.md) invoked during the request. Contains NULL if no AI functions were used. |

## Examples

Retrieve Cortex Agent usage history:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AGENT_USAGE_HISTORY;
```

```output
+-------------------------------+-------------------------------+---------+-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------+-------------------+-------------------+---------------------+-----------------+-------------------+----------+------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+---------------+--------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------------------------------------------------+
| START_TIME                    | END_TIME                      | USER_ID | USER_NAME | USER_TAGS                                                                                                                                                                                                                                                | REQUEST_ID                           | PARENT_REQUEST_ID | AGENT_DATABASE_ID | AGENT_DATABASE_NAME | AGENT_SCHEMA_ID | AGENT_SCHEMA_NAME | AGENT_ID | AGENT_NAME | AGENT_TAGS                                                                                                                                                                                                                                                | TOKEN_CREDITS | TOKENS | TOKENS_GRANULAR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | CREDITS_GRANULAR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                 | METADATA                                                                                                          |
+-------------------------------+-------------------------------+---------+-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------+-------------------+-------------------+---------------------+-----------------+-------------------+----------+------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+---------------+--------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------------------------------------------------+
| 2026-02-06 10:11:51.642 +0000 | 2026-02-06 10:11:55.932 +0000 | 42563   | JKOWAL    | [{"level": "ACCOUNT", "tag_database": "SI", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "engineering"}, {"level": "USER", "tag_database": "FINANCE", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "engineering"}] | 5caf3de3-86b2-4896-b706-9f2d7629d337 | NULL              | 234               | finance             | 4231            | analytics         | 9234     | agent1     | [{"level": "DATABASE", "tag_database": "SI", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "finance"}, {"level": "CORTEX_AGENT", "tag_database": "FINANCE", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "finance"}] | 20.000000000  | 1900   | [{"5caf3de3-86b2-4896-b706-9f2d7629d337": {"cortex_agents": {"modelX": {"input": 100, "cache_read_input": 300, "cache_write_input": 400, "output": 200}}, "start_time": "2026-02-06 10:11:51.642 +0000"}}, {"a98b2946-4a7d-4028-9b19-1dab89fbf6c7": {"cortex_analyst": {"modelY": {"input": 100, "output": 200}, "modelZ": {"input": 100, "output": 200}}, "start_time": "2026-02-06 10:11:52.313 +0000"}}, {"996abb8b-678a-440d-9061-d186b6acc91b": {"cortex_analyst": {"unknown": {"input": 100, "output": 200}}, "start_time": "2026-02-06 10:11:53.112 +0000"}}] | [{"5caf3de3-86b2-4896-b706-9f2d7629d337": {"cortex_agents": {"modelX": {"input": 1, "cache_read_input": 2, "cache_write_input": 3, "output": 4}}, "start_time": "2026-02-06 10:11:51.642 +0000"}}, {"a98b2946-4a7d-4028-9b19-1dab89fbf6c7": {"cortex_analyst": {"modelY": {"input": 1, "output": 4}, "modelZ": {"input": 1, "output": 4}}, "start_time": "2026-02-06 10:11:52.313 +0000"}}, {"996abb8b-678a-440d-9061-d186b6acc91b": {"cortex_analyst": {"unknown": {"input": 0, "output": 0}}, "start_time": "2026-02-06 10:11:53.112 +0000"}}] | {"role_id": 12720, "role_name": "ENGINEER", "interaction_interface": "sql_function", "ai_functions_credits": 0.5} |
+-------------------------------+-------------------------------+---------+-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------+-------------------+-------------------+---------------------+-----------------+-------------------+----------+------------+-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+---------------+--------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------------------------------------------------------------------------------------------------------+
```

---
title: CORTEX_AI_FUNCTIONS_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_ai_functions_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_AI_FUNCTIONS_USAGE_HISTORY view

This Account Usage view can be used to query the usage history of [Cortex AI Functions](../../user-guide/snowflake-cortex/aisql.md).

The view includes the number of tokens and credits consumed each time a Cortex Function is called, aggregated in one
hour windows. The view also includes relevant metadata, such as the warehouse ID, start and end times of the function
execution, and the name of the function and the model, if specified. Each row represents the usage for a single function
call.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the usage aggregation window. The window resolution is 1 hour. For example, if a query began at 05:30 and completed at 08:30, four records appear in the usage view, one each for the 5:00, 6:00, 7:00, and 8:00 aggregation windows. |
| END_TIME | TIMESTAMP_LTZ | End of the usage aggregation window. |
| FUNCTION_NAME | VARCHAR | Name of the Cortex AI Function called. Usage history contains a row for each function called in a query. |
| MODEL_NAME | VARCHAR | Model name. Empty for Cortex AI Functions where a model is not specified as an argument. Usage history contains a row for each model used in a query. |
| QUERY_ID | VARCHAR | The ID of the query in which the function was called. |
| WAREHOUSE_ID | NUMBER | System-generated identifier for the warehouse used by the query calling the Cortex AI Function. |
| ROLE_NAMES | ARRAY | Roles associated with the query. The primary role is the first element of the array. |
| QUERY_TAG | VARCHAR | The tag, if any, associated with the query in which the function was called. |
| USER_ID | VARCHAR | System-generated identifier for the user that executed the query calling the Cortex AI Function. |
| METRICS | ARRAY | A breakdown of usage metrics for the specified function and model for the combination of QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. See Metrics column below for more details. |
| CREDITS | NUMBER | Number of credits billed for Cortex AI Function usage based on metrics for the specified function and model for the combination of QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. Does not include warehouse usage credits. |
| IS_COMPLETED | BOOLEAN | Whether the query was completed in this aggregation window. |

## Metrics column

The metrics column contains a breakdown of usage metrics for the specified function and model for the combination of
QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. Each element contains a `key` object (with `metric` type and `unit`
fields) and a `value`. The structure varies by metering method, as follows:

* **Token-based metering** (most AI Functions): Bills by token count, either as separate input and output token counts or as total token count, depending on the function.

  Example: `` [{"key":{"metric":"input","unit":"tokens"},"value":17},{"key":{"metric":"output","unit":"tokens"},"value":65}]`<br>`[{"key":{"metric":"total","unit":"tokens"},"value":527}] ``
* **Page-based metering** (AI_PARSE_DOCUMENT): Bills by page.

  Example: `[{"key":{"metric":"total","unit":"pages"},"value":3}]`

## Usage notes

* This view includes only usage that occurred on or after January 5, 2026.
* User ID attribution, Query tag, and Roles fields are available for data acquired after February 16, 2026.
* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* The view tracks both function calls that have completed and calls that are still in progress.
* Running queries are updated every 30 minutes (best effort) with a SLA of one hour.
* The credit rate usage is determined based on the function called, model used and the tokens processed as outlined in the
  [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: CORTEX_AISQL_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_aisql_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_AISQL_USAGE_HISTORY view

The CORTEX_AISQL_USAGE_HISTORY view can be used to query the usage history of [Cortex AI Functions](../../user-guide/snowflake-cortex/aisql.md).

The information in the view includes the number of credits consumed each time an AI function is called, aggregated in
one-hour increment, based on the time each query completed. The view also includes relevant metadata, such as the user
ID, query ID, function, and model. Each row in the view represents the usage for a specific combination of function, model, query, and
warehouse. For more information on Cortex billing, see [Cost considerations](../../user-guide/snowflake-cortex/aisql.md).

> **Important:**
>
> This view replaces both the [CORTEX_FUNCTIONS_USAGE_HISTORY](cortex_functions_usage_history.md) and
> [CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY](cortex_functions_query_usage_history.md) views.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_TIME | TIMESTAMP_LTZ | The date and the beginning of the hour (in the local time zone) in which this usage record was billed. Usage is not recorded until the query completes, so this timestamp represents the hour in which the query completed. For example, if a query begins at 05:30 and completes at 08:30, the record is aggregated in the 08:00-09:00 hour. |
| MODEL_NAME | TEXT | Name of the model used in the query. A query can use more than one model; in this case, usage history includes a row for each model. |
| FUNCTION_NAME | TEXT | The name of the Cortex AI Function called. A query can use more than one function; in this case, usage history includes a row for each function. |
| TOKEN_CREDITS | NUMBER | Number of credits billed for Cortex AI Function usage based on tokens processed for the specified function and model for the combination of QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. Does not include warehouse usage credits. |
| TOKENS | NUMBER | Number of tokens processed for the specified function and model for the combination of QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. |
| TOKEN_CREDITS_GRANULAR | OBJECT | A SQL object that provides a breakdown of credits billed by token type (input or output) for the specified function and model for the combination of QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. |
| TOKENS_GRANULAR | OBJECT | A SQL object that provides a breakdown of tokens processed by token type (input or output) for the specified function and model for the combination of QUERY_ID, MODEL_NAME, and WAREHOUSE_ID. |
| QUERY_ID | TEXT | The ID of the query in which the function was called. |
| QUERY_TAG | TEXT | The tag, if any, associated with the query in which the function was called. |
| USER_ID | TEXT | The internal ID of the user who invoked the function.  For more information about authenticating, see [Authenticating to the server](../../developer-guide/sql-api/authenticating.md). |
| WAREHOUSE_ID | TEXT | The ID of the virtual warehouse that processed the query in which the function was called. |

## Usage notes

* This view includes only usage that occurred on or after November 17, 2025.
* Billing is reported only after the query completes, and the timestamp is the hour in which the query completed.
* Credit usage is based on the number of tokes processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: CORTEX_ANALYST_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_analyst_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_ANALYST_USAGE_HISTORY view

The CORTEX_ANALYST_USAGE_HISTORY view can be used to query the usage history of [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md).

The information in the view includes the number of credits consumed each time Cortex Analyst is called, aggregated in one-hour increments.
The view also includes relevant metadata, such as the start and end times of the messages and the number of messages sent.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the Cortex Analyst message request was received. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the Cortex Analyst message response was sent. |
| REQUEST_COUNT | NUMBER | The number of messages sent to Cortex Analyst. |
| CREDITS | NUMBER | The number of credits billed for a set of messages sent to Cortex Analyst. |
| USERNAME | TEXT | The username of the user who sent the Cortex Analyst message request. The username is included with the session.  For more information about authenticating, see [Authenticating to the server](../../developer-guide/sql-api/authenticating.md). |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Credit rate usage is based on the number of messages processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: CORTEX_CODE_CLI_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_code_cli_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_CODE_CLI_USAGE_HISTORY view

The CORTEX_CODE_CLI_USAGE_HISTORY view can be used to query the usage history of [Cortex Code CLI](../../user-guide/cortex-code/cortex-code-cli.md).

The information in the view includes the number of credits consumed each time a user interacts
with Cortex Code CLI. Each row in the view represents a single request and provides detail about
the aggregated tokens and credits as well as a granular breakdown by model. The view also includes
relevant metadata, such as the user ID and request ID.

> **Note:**
>
> This view does not include requests originating from Cortex Code in Snowsight. Requests originating from Cortex Code in Snowsight are recorded in the [CORTEX_CODE_SNOWSIGHT_USAGE_HISTORY](cortex_code_snowsight_usage_history.md) view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USER_ID | NUMBER | The unique identifier of the user who made the request. |
| USER_TAGS | ARRAY | Tags associated with the user. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “ACCOUNT” or “USER”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| REQUEST_ID | VARCHAR | The unique identifier for the request. |
| PARENT_REQUEST_ID | VARCHAR | The identifier of the parent request, if applicable. |
| USAGE_TIME | TIMESTAMP_TZ | The timestamp when the usage was recorded. |
| TOKEN_CREDITS | NUMBER | The number of token credits used for the request. |
| TOKENS | NUMBER | The total number of tokens used for the request. |
| TOKENS_GRANULAR | OBJECT | Granular breakdown of token usage by model. Each key is a model name, and each value is an object containing the following fields:   * `input`: Number of input tokens. * `cache_read_input`: Number of cache read input tokens. * `cache_write_input`: Number of cache write input tokens. * `output`: Number of output tokens. |
| CREDITS_GRANULAR | OBJECT | Granular breakdown of credit usage by model. Each key is a model name, and each value is an object containing the following fields:   * `input`: Credit value for input tokens. * `cache_read_input`: Credit value for cache read input tokens. * `cache_write_input`: Credit value for cache write input tokens. * `output`: Credit value for output tokens. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Credit rate usage is based on the number of tokens processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Examples

Retrieve Cortex Code CLI usage history:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_CODE_CLI_USAGE_HISTORY;
```

Retrieve total credits consumed per user in the last 30 days:

```sqlexample
SELECT USER_ID,
       SUM(TOKEN_CREDITS) AS TOTAL_CREDITS
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_CODE_CLI_USAGE_HISTORY
  WHERE USAGE_TIME >= DATEADD('day', -30, CURRENT_TIMESTAMP())
  GROUP BY USER_ID
  ORDER BY TOTAL_CREDITS DESC;
```

---
title: CORTEX_CODE_SNOWSIGHT_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_code_snowsight_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_CODE_SNOWSIGHT_USAGE_HISTORY view

The CORTEX_CODE_SNOWSIGHT_USAGE_HISTORY view can be used to query the usage history of [Cortex Code in Snowsight](../../user-guide/cortex-code/cortex-code-snowsight.md).

The information in the view includes the number of credits consumed each time a user interacts
with Cortex Code in Snowsight. Each row in the view represents a single request and provides detail about
the aggregated tokens and credits as well as a granular breakdown by model. The view also includes
relevant metadata, such as the user ID and request ID.

> **Note:**
>
> This view does not include requests originating from Cortex Code CLI. Requests originating from Cortex Code CLI are recorded in the [CORTEX_CODE_CLI_USAGE_HISTORY](cortex_code_cli_usage_history.md) view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USER_ID | NUMBER | The unique identifier of the user who made the request. |
| USER_TAGS | ARRAY | Tags associated with the user. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “ACCOUNT” or “USER”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| REQUEST_ID | VARCHAR | The unique identifier for the request. |
| PARENT_REQUEST_ID | VARCHAR | The identifier of the parent request, if applicable. |
| USAGE_TIME | TIMESTAMP_TZ | The timestamp when the usage was recorded. |
| TOKEN_CREDITS | NUMBER | The number of token credits used for the request. |
| TOKENS | NUMBER | The total number of tokens used for the request. |
| TOKENS_GRANULAR | OBJECT | Granular breakdown of token usage by model. Each key is a model name, and each value is an object containing the following fields:   * `input`: Number of input tokens. * `cache_read_input`: Number of cache read input tokens. * `cache_write_input`: Number of cache write input tokens. * `output`: Number of output tokens. |
| CREDITS_GRANULAR | OBJECT | Granular breakdown of credit usage by model. Each key is a model name, and each value is an object containing the following fields:   * `input`: Credit value for input tokens. * `cache_read_input`: Credit value for cache read input tokens. * `cache_write_input`: Credit value for cache write input tokens. * `output`: Credit value for output tokens. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Credit rate usage is based on the number of tokens processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Examples

Retrieve Cortex Code in Snowsight usage history:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_CODE_SNOWSIGHT_USAGE_HISTORY;
```

Retrieve total credits consumed per user in the last 30 days:

```sqlexample
SELECT USER_ID,
       SUM(TOKEN_CREDITS) AS TOTAL_CREDITS
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_CODE_SNOWSIGHT_USAGE_HISTORY
  WHERE USAGE_TIME >= DATEADD('day', -30, CURRENT_TIMESTAMP())
  GROUP BY USER_ID
  ORDER BY TOTAL_CREDITS DESC;
```

---
title: CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_document_processing_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view

This Account Usage view displays document processing function activity, including [PARSE_DOCUMENT (SNOWFLAKE.CORTEX)](../functions/parse_document-snowflake-cortex.md),
[AI_EXTRACT](../functions/ai_extract.md), and `<model_build_name>!PREDICT` calls. It shows pages processed and credits
used, aggregated hourly by function and model. The view includes metadata such as the following:

* Warehouse ID
* Execution timestamps
* Function names
* Model names

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | A unique identifier for the SQL query |
| CREDITS_USED | NUMBER(38,9) | The number of credits billed for Cortex Document processing functions for the specified query |
| START_TIME | TIMESTAMP_LTZ | Start of the hourly time range in which the query usage took place. |
| END_TIME | TIMESTAMP_LTZ | End of the hourly time range in which the query usage took place. |
| FUNCTION_NAME | TEXT | The name of the Cortex Document processing function |
| MODEL_NAME | TEXT | The name of the model |
| OPERATION_NAME | TEXT | The name of the operation  Valid values:   * `inference` * `train` |
| PAGE_COUNT | NUMBER | The number of pages processed |
| DOCUMENT_COUNT | NUMBER | The number of documents processed |
| FEATURE_COUNT | NUMBER | The number of data values defined for document processing operations that involve entry extraction |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Credit rate usage is based on the number of messages processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: CORTEX_FINE_TUNING_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_fine_tuning_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_FINE_TUNING_USAGE_HISTORY view

This Account Usage view can be used to query the training usage history of [Cortex Fine-tuning](../../user-guide/snowflake-cortex/cortex-finetuning.md).
This view includes the number of tokens processed and the training credits consumed by Cortex Fine-tuning jobs, aggregated by the job’s base model and the hour in which the job completed. This view only contains credits consumed for
fine-tuning training but not costs for using the fine-tuned model in inference, costs for storage, or costs associated with data replication. For
inference usage, see [CORTEX_FUNCTIONS_USAGE_HISTORY view](cortex_functions_usage_history.md). For more information, see
[Cost considerations](../../user-guide/snowflake-cortex/cortex-finetuning.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the Cortex Fine-tuning job terminated. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the Cortex Fine-tuning job terminated. |
| MODEL_NAME | VARCHAR | Name of the base model. |
| TOKEN_CREDITS | NUMBER | Number of credits billed for Cortex Fine-tuning usage based on tokens processed by training jobs that terminated during the specified time range. |
| TOKENS | NUMBER | Number of tokens billed for Cortex Fine-tuning jobs terminated during the specified time range. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* In some cases where a model is used but is not billed, the model column may be empty.

---
title: CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_functions_query_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view

> **Important:**
>
> This view is no longer updated. Use the [CORTEX_AISQL_USAGE_HISTORY](cortex_aisql_usage_history.md) view instead.

The CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view can be used to view the usage history of each [Cortex Functions](../../user-guide/snowflake-cortex/aisql.md) query in a Snowflake account. For more information, see [Cost considerations](../../user-guide/snowflake-cortex/aisql.md).

The information in the view includes the number of tokens and credits consumed for each query.

The view also includes relevant metadata, such as the model name and the ID of the warehouse running the queries.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FUNCTION_NAME | VARCHAR | Function name for the model. |
| MODEL_NAME | VARCHAR | Model name used in the query. A query can have more than one model. For queries with multiple models, the usage history includes a row for each model. |
| QUERY_ID | VARCHAR | Query ID |
| TOKENS | NUMBER | Number of tokens used for the (`QUERY_ID`, `MODEL_NAME`, `WAREHOUSE_ID`) combination. |
| TOKEN_CREDITS | NUMBER | Tokens converted to credits for the (`QUERY_ID`, `MODEL_NAME`, `WAREHOUSE_ID`) combination. |
| WAREHOUSE_ID | VARCHAR | ID of the warehouse used to run the query. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Query usage data might take a few hours to appear in the CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view.
* Credit rate usage is based on the number of messages processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: CORTEX_FUNCTIONS_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_functions_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_FUNCTIONS_USAGE_HISTORY view

> **Important:**
>
> This view is no longer updated. Use the [CORTEX_AISQL_USAGE_HISTORY](cortex_aisql_usage_history.md) view instead.

This Account Usage view can be used to query the usage history of [Cortex Functions](../../user-guide/snowflake-cortex/aisql.md) such
as COMPLETE and TRANSLATE. The information in the view includes the number of tokens and credits consumed each time a Cortex Function is
called, aggregated in one hour increments based on function and model. The view also includes relevant metadata, such as the warehouse ID,
start and end times of the function execution, and the name of the function and the model, if specified.

> **Note:**
>
> The view might not include usage information on functions called with recently added models. A new model can take up to 2 weeks to
> be included in this view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the Cortex LLM function usage took place. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the Cortex LLM function usage took place. |
| FUNCTION_NAME | VARCHAR | Name of the Cortex LLM function. |
| MODEL_NAME | VARCHAR | Model name. Empty for Cortex LLM functions where a model is not specified as an argument. |
| WAREHOUSE_ID | NUMBER | System-generated identifier for the warehouse used by the query calling the Cortex LLM function. |
| TOKENS | NUMBER | Number of tokens billed. |
| TOKEN_CREDITS | NUMBER | Number of credits billed for Cortex LLM functions usage based on tokens processed for the specified function and model (if applicable) during the START_TIME and END_TIME window. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* The credit rate usage is determined based on the function called, model used and the tokens processed as outlined in the
  [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
* In some cases where a model is used but is not billed, the model column may be empty.

---
title: CORTEX_PROVISIONED_THROUGHPUT_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_provisioned_throughput_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_PROVISIONED_THROUGHPUT_USAGE_HISTORY view

This Account Usage view lets you retrieve billing data for provisioned throughputs.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PROVISIONED_THROUGHPUT_ID | VARCHAR | UUID identifying the provisioned throughput. |
| INTERVAL_START_TIME | TIMESTAMP_TZ | Start of the measurement interval for the billing period. |
| INTERVAL_END_TIME | TIMESTAMP_TZ | End of the measurement interval for the billing period. |
| CLOUD_SERVICE_PROVIDER | VARCHAR | Host cloud provider. |
| MODEL_NAME | VARCHAR | Configured model name. |
| TERM_START_DATE | DATE | Start of the provisioned throughput’s term. |
| TERM_END_DATE | DATE | End of the provisioned throughput’s term. |
| PTU_COUNT | NUMBER | Number of PTUs active. |
| PTU_CREDITS | NUMBER(38,9) | Number of credits billed during this interval. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).

---
title: CORTEX_REST_API_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_rest_api_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_REST_API_USAGE_HISTORY view

Query the CORTEX_REST_API_USAGE_HISTORY view to see the history of Cortex REST API calls.

The information in the view includes the number of tokens processed and credits consumed for each REST API request. The view also includes
relevant metadata, such as the request ID, model name, user ID, and inference region. For more information on Cortex billing, see
[Cost considerations](../../user-guide/snowflake-cortex/aisql.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The beginning of the time range for the usage history. |
| END_TIME | TIMESTAMP_LTZ | The end of the time range for the usage history. |
| REQUEST_ID | TEXT | The unique identifier for the REST API request. |
| MODEL_NAME | TEXT | Name of the model used in the REST API call. |
| TOKENS | NUMBER | Number of tokens processed for the REST API request. |
| TOKENS_GRANULAR | OBJECT | A SQL object that provides a breakdown of tokens processed by token type (input or output) for the REST API request. |
| USER_ID | TEXT | The internal ID of the user who invoked the REST API.  For more information about authenticating, see [Authenticating to the server](../../developer-guide/sql-api/authenticating.md). |
| INFERENCE_REGION | TEXT | The region in which the inference was performed. |

## Usage notes

* The view provides up-to-date usage information for an account within the last 365 days (1 year).
* Credit usage is based on the number of tokens processed, as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

---
title: CORTEX_SEARCH_BATCH_QUERY_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_search_batch_query_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_SEARCH_BATCH_QUERY_USAGE_HISTORY view

This Account Usage view can be used to query the usage history of [Cortex Search batch search queries](../../user-guide/snowflake-cortex/cortex-search/batch-cortex-search.md).
Batch search queries incur three types of cost:

* **Serving cost**: Charged based on the search index data size and the duration of the batch search query.
* **Query embedding cost**: Charged based on the number of tokens embedded from the input workload. Standard embedding costs apply.
* **Virtual warehouse compute cost**: Charged for the virtual warehouse used to run the batch search query. This cost isn’t included in this view.

This view tracks serving cost and query embedding cost only. It includes the credits consumed, billable indexed data, billable duration,
and token usage for each batch search query submitted to a Cortex Search Service.
For more information, see [Cost considerations](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_TZ | Start of the specified time range in which the Cortex Search batch search query usage took place. |
| END_TIME | TIMESTAMP_TZ | End of the specified time range in which the Cortex Search batch search query usage took place. |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement that invoked the batch search query. |
| DATABASE_NAME | VARCHAR | Name of the database in which the Cortex Search Service resides. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database in which the Cortex Search Service resides. |
| SCHEMA_NAME | VARCHAR | Name of the schema in which the Cortex Search Service resides. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the Cortex Search Service resides. |
| SERVICE_NAME | VARCHAR | Name of the Cortex Search Service. |
| SERVICE_ID | NUMBER | Internal/system-generated identifier for the Cortex Search Service. |
| CONSUMPTION_TYPE | VARCHAR | The category of consumption incurred for the batch search query. One of: “BATCH_SERVING” (serving cost based on indexed data size and query duration) or “BATCH_EMBED_TEXT_TOKENS” (query embedding cost based on input tokens). |
| CREDITS_USED | NUMBER | Number of credits consumed for the batch search query for the specified CONSUMPTION_TYPE. |
| BILLABLE_INDEXED_DATA_BYTES | NUMBER | For CONSUMPTION_TYPE = “BATCH_SERVING”, the number of bytes of indexed data billed during the batch search query. NULL for other consumption types. |
| BILLABLE_DURATION_SECONDS | NUMBER | For CONSUMPTION_TYPE = “BATCH_SERVING”, the duration in seconds billed for the batch search query. NULL for other consumption types. |
| MODEL_NAME | VARCHAR | For CONSUMPTION_TYPE = “BATCH_EMBED_TEXT_TOKENS”, the name of the embedding model used to generate vector embeddings for the batch search query input. NULL for other consumption types. |
| TOKENS | NUMBER | For CONSUMPTION_TYPE = “BATCH_EMBED_TEXT_TOKENS”, the number of input tokens consumed for query embedding. NULL for other consumption types. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Each row represents a single batch search query identified by QUERY_ID.
* Serving cost is incurred per gigabyte-hour of indexed data, metered by the billable duration of the batch search query.
* Query embedding cost is incurred per input token. Unlike interactive search, query embedding for batch search queries is a billable cost.
* For daily-level aggregated batch usage, you can also query the [CORTEX_SEARCH_DAILY_USAGE_HISTORY](cortex_search_daily_usage_history.md) view
  with `CONSUMPTION_TYPE = 'BATCH'`.

---
title: CORTEX_SEARCH_DAILY_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_search_daily_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_SEARCH_DAILY_USAGE_HISTORY view

This Account Usage view can be used to query the daily usage history of [Cortex Search](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md),
with consumption broken out by category. The information in this view includes the number of credits consumed per day for a Cortex Search Service
for serving, embedding text, and batch search queries, but not the other costs associated with a Cortex Search Service.
For more information, see [Cost considerations](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | TIMESTAMP_LTZ | Start of the specified time range in which the Cortex Search serving usage took place. |
| DATABASE_NAME | VARCHAR | Name of the database in which the Cortex Search Service resides. |
| SCHEMA_NAME | VARCHAR | Name of the schema in which the Cortex Search Service resides. |
| SERVICE_NAME | VARCHAR | Name of the Cortex Search Service. |
| SERVICE_ID | NUMBER | ID of the Cortex Search Service. |
| CONSUMPTION_TYPE | VARCHAR | The category of consumption incurred. One of “SERVING”, “EMBED_TEXT_TOKENS”, or “BATCH”. |
| CREDITS | NUMBER | Number of credits billed for Cortex Search usage on the USAGE_DATE date for the specified CONSUMPTION_TYPE. |
| MODEL_NAME | VARCHAR | For CONSUMPTION_TYPE = “EMBED_TEXT_TOKENS”, the name of the embedding model used to generate vector embeddings (nullable). |
| TOKENS | VARCHAR | For CONSUMPTION_TYPE = “EMBED_TEXT_TOKENS”, the number of input tokens consumed (nullable). |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Serving costs are incurred per gigabyte-month of indexed data, metered at one-second resolution. You can get an estimate of
  the indexed data size for a given service using the credit rate defined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
* EMBED_TEXT_TOKENS cost is incurred per input token.
* BATCH cost includes both serving and query embedding costs incurred by batch search queries. For per-query details, see the
  [CORTEX_SEARCH_BATCH_QUERY_USAGE_HISTORY](cortex_search_batch_query_usage_history.md) view.

---
title: CORTEX_SEARCH_SERVING_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/cortex_search_serving_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CORTEX_SEARCH_SERVING_USAGE_HISTORY view

This Account Usage view can be used to query the hourly serving usage history of [Cortex Search](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).
The information in this view includes the number of serving credits consumed per hour for a Cortex Search Service. This view
only contains credits consumed for serving, not the other costs associated with a Cortex Search Service. For more information, see
[Cost considerations](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the Cortex Search serving usage took place. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the Cortex Search usage took place. |
| DATABASE_NAME | VARCHAR | Name of the database in which the Cortex Search Service resides. |
| SCHEMA_NAME | VARCHAR | Name of the schema in which the Cortex Search Service resides. |
| SERVICE_NAME | VARCHAR | Name of the Cortex Search Service. |
| SERVICE_ID | NUMBER | ID of the Cortex Search Service. |
| CREDITS | NUMBER | Number of credits billed for Cortex Search serving usage based on the size of indexed data during the START_TIME and END_TIME window. |

## Usage notes

* The view provides up-to-date credit usage for an account within the last 365 days (1 year).
* Serving credits are incurred per GB-mo of indexed data, metered at the second-level. One may get an estimate of
  the indexed data size for a given service using the credit rate defined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).
* This view may show small discrepancies in incurred cost from hour-to-hour based on the second-level metering.

---
title: CREDENTIALS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/credentials.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# CREDENTIALS view

This Account Usage view includes a row for each credential used as a first or
[second factor](../../user-guide/security-mfa-second-factor.md) for authentication. This view includes rows for the following types
of credentials:

* [Programmatic access tokens](../../user-guide/programmatic-access-tokens.md)
* [Passkeys](../../user-guide/security-mfa-second-factor.md)
* [Time-based one-time passcodes (TOTPs)](../../user-guide/security-mfa-second-factor.md)
* [Workload identity federation](../../user-guide/workload-identity-federation.md)

> **Note:**
>
> This view does not include information about [Duo authenticators](../../user-guide/security-mfa-duo.md) (Duo push and passcodes).
>
> To determine if a user has configured Duo as a second factor for authentication, you can run the
> [SHOW MFA METHODS](../sql/show-mfa-methods.md) command.

This view does not include credentials that have been deleted.

## Columns

| Column | Data type | Description |
| --- | --- | --- |
| CREDENTIAL_ID | NUMBER | Internal/system-generated identifier for the credential. |
| NAME | VARCHAR | Name of the credential. |
| USER_NAME | VARCHAR | Name of the user associated with the credential. |
| TYPE | VARCHAR | Type of the credential. These types include:   * `PASSKEY`: [Passkey](../../user-guide/security-mfa-second-factor.md). * `PAT`: [Programmatic access token](../../user-guide/programmatic-access-tokens.md). * `TOTP`: [Time-based one-time passcode](../../user-guide/security-mfa-second-factor.md). * `AWS`: AWS Identity and Access Management (AWS IAM) is the identity provider, which indicates the workload is running on AWS. See   [Workload identity federation](../../user-guide/workload-identity-federation.md). * `AZURE`: Microsoft Entra ID is the identity provider, which indicates the workload is running on Microsoft Azure. See   [Workload identity federation](../../user-guide/workload-identity-federation.md). * `GCP`: Google Accounts is the identity provider, which indicates the workload is running on Google Cloud. See   [Workload identity federation](../../user-guide/workload-identity-federation.md). * `OIDC`: An OpenID Connect (OIDC) provider is the identity provider. See [Workload identity federation](../../user-guide/workload-identity-federation.md). |
| DOMAIN | VARCHAR | Domain of the credential. The domains include:   * `MFA_METHOD`: The credential is used as a   [second factor of authentication](../../user-guide/security-mfa-second-factor.md). * `PROGRAMMATIC_ACCESS_TOKEN`: [Programmatic access token](../../user-guide/programmatic-access-tokens.md). * `WORKLOAD_IDENTITY_FEDERATION_METHOD`: [Workload identity federation](../../user-guide/workload-identity-federation.md).   A given domain can have one or more possible types (specified in the TYPE column). |
| COMMENT | VARCHAR | Comment about the credential. |
| STATUS | VARCHAR | Status of the credential. The status depends on the value in the TYPE column:   * For `TYPE = 'PAT'` ([programmatic access tokens](../../user-guide/programmatic-access-tokens.md)), the status can be one   of the following:    + `ACTIVE`: The programmatic access token can be used to authenticate and has not expired yet.   + `EXPIRED`: The programmatic access token cannot be used to authenticate because the expiration date has passed.   + `DISABLED`: The programmatic access token is [disabled](../../user-guide/programmatic-access-tokens.md) because user login access is disabled or     the user is locked out of logging in. * For other types of credentials, the status can be one of the following:    + `PENDING`: The user started the enrollment process for an MFA method but has not completed the process. For example,     the user started registering an authenticator but never finished the setup process for the authenticator. As a result,     the MFA method is not considered to be valid yet.   + `ENROLLED`: The user has completed the enrollment process for the MFA method, and the MFA method can be used for     second-factor authentication. |
| ADDITIONAL_DETAILS | OBJECT | Additional details about the credential. The additional details depend on the type of the credential (the value in the TYPE column):   * For `TYPE = 'PAT'` ([programmatic access tokens](../../user-guide/programmatic-access-tokens.md)), the column contains   an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT` key, the value is an integer representing the number of minutes     during which the [requirement of having a network policy](../../user-guide/programmatic-access-tokens.md) is bypassed. You can     specify this value when [generating the token](../../user-guide/programmatic-access-tokens.md).   + For the `ROLE_RESTRICTION` key, the value is an array of the roles that are used for privilege evaluation and     object creation during the session authenticated with this token. You can specify these roles when     [generating the token](../../user-guide/programmatic-access-tokens.md).   + For the `ROTATED_TO` key, the value is the name of the newer token that this token was replaced by during     [rotation](../../user-guide/programmatic-access-tokens.md). These key-value pairs are present only if the corresponding properties are set in the token. For example:  ```json   {     "MINS_TO_BYPASS_NETWORK_POLICY_REQUIREMENT":       60,     "ROLE_RESTRICTION": [       "MY_ROLE"     ],     "ROTATED_TO": "MY_PAT_NAME"   }   ```  If none of these are specified for the token, the column contains an empty object (`{}`). * For `TYPE = 'PASSKEY'` ([passkey](../../user-guide/security-mfa-second-factor.md)), the column contains   an [OBJECT](../data-types-semistructured.md) value with the key-value pair `aaguid`. For example:  ```json   {     "aaguid": "a12345678-..."   }   ``` * For `TYPE = 'TOTP'` ([time-based one-time passcode](../../user-guide/security-mfa-second-factor.md)), the column contains NULL. * For `TYPE = 'AWS'` ([workload identity federation](../../user-guide/workload-identity-federation.md)), the column contains   an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `aws_partition` key, the value is the AWS partition for the federated identity.   + For the `aws_account` key, the value is the AWS account identifier for the federated identity.   + For the `type` key, the value is the type of the federated identity. This can be `IAM_USER` or `IAM_ROLE`.   + For the `iam_role` key, the value is the name of the federated IAM role or user. * For `TYPE = 'AZURE'` ([workload identity federation](../../user-guide/workload-identity-federation.md)), the column contains   an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `issuer` key, the value is the Entra ID tenant’s Authority URL.   + For the `subject` key, the value is the Object ID (Principal ID) assigned to the Azure workload that is using a     managed identity. * For `TYPE = 'GCP'` ([workload identity federation](../../user-guide/workload-identity-federation.md)), the column contains   an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `subject` key, the value is the `uniqueId` property of the Google Cloud service account associated with the     federated workload. * For `TYPE = 'OIDC'` ([workload identity federation](../../user-guide/workload-identity-federation.md)), the column contains   an [OBJECT](../data-types-semistructured.md) value with the following key-value pairs:    + For the `issuer` key, the value is the issuer URL of the OpenID Connect (OIDC) provider.   + For the `subject` key, the value is the identifier of the federated workload.   + For the `audience_list` key, the value is the custom audiences that are allowed in an OIDC ID token. An empty value means     the default audience `snowflakecomputing.com` is required. |
| CREATED_BY | VARCHAR | Name of the user who created the credential. |
| LAST_ALTERED_BY | VARCHAR | Name of the user who last modified the credential. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time when the credential was created. |
| LAST_USED_ON | TIMESTAMP_LTZ | Date and time when the credential was last used for authentication. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the credential was last modified. |
| EXPIRATION_DATE | TIMESTAMP_LTZ | Date and time when the credential expires. |

## Usage notes

* Latency for the view might be up to two hours.
* If a programmatic access token is generated soon after a user is created, the information about that user in this view might
  be incomplete. It might take some time for the user information to be included in the view.

## Examples

The following example returns rows for [programmatic access tokens](../../user-guide/programmatic-access-tokens.md):

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CREDENTIALS WHERE type = 'PAT';
```

```output
+---------------+---------------+--------------+------+---------------------------+-------------------+--------+--------------------+--------------+-----------------+-------------------------+-------------------------+-------------------------+
| CREDENTIAL_ID | NAME          | USER_NAME    | TYPE | DOMAIN                    | COMMENT           | STATUS | ADDITIONAL_DETAILS | CREATED_BY   | LAST_ALTERED_BY | CREATED_ON              | LAST_USED_ON            | LAST_ALTERED            |
|---------------+---------------+--------------+------+---------------------------+-------------------+--------+--------------------+--------------+-----------------+-------------------------+-------------------------+-------------------------|
|      19464837 | EXAMPLE_TOKEN | EXAMPLE_USER | PAT  | PROGRAMMATIC_ACCESS_TOKEN | My token for APIs | ACTIVE | {}                 | EXAMPLE_USER | EXAMPLE_USER    | 2025-04-14 22:05:19.661 | 2025-04-14 22:05:19.661 | 2025-04-14 22:05:19.661 |
+---------------+---------------+--------------+------+---------------------------+-------------------+--------+--------------------+--------------+-----------------+-------------------------+-------------------------+-------------------------+
```

---
title: DATA_CLASSIFICATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/data_classification_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATA_CLASSIFICATION_HISTORY view

This Account Usage view displays all historical sensitive data classification results for each table in the account. Unlike
[DATA_CLASSIFICATION_LATEST view](data_classification_latest.md), which shows only the most recent classification per table,
this view shows all classification events over time, limited to the last 365 days.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that was classified. |
| TABLE_NAME | VARCHAR | Name of the table. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table. |
| RESULT | VARIANT | Classification result at the time of classification. For a description of the JSON object, see the output of the [SYSTEM$GET_CLASSIFICATION_RESULT](../functions/system_get_classification_result.md) function. |
| TRIGGER_TYPE | VARCHAR | Mode of the classification trigger: `MANUAL` or `AUTO CLASSIFICATION`, where `MANUAL` indicates that someone called a system function to initiate the classification process. |
| CLASSIFIED_ON | TIMESTAMP_LTZ | Time when the classification was performed. |
| TABLE_DELETED_ON | TIMESTAMP_LTZ | Date and time when the object or parent object was dropped. NULL if the object has not been deleted. |

## Usage notes

* Latency for this view might be up to three hours.
* Data is retained for 365 days (one year). Rows are removed only when a classification event is older than one year.
* Unlike [DATA_CLASSIFICATION_LATEST view](data_classification_latest.md), this view retains data for classification events
  even when the associated table, schema, or database is dropped. The `TABLE_NAME`, `SCHEMA_NAME`, and `DATABASE_NAME`
  columns reflect the table and its database/schema location recorded for that classification result, but do not preserve historical
  object names across subsequent rename operations. If a table is later moved to a different schema and reclassified, a new row
  reflects the new location. The `TABLE_DELETED_ON` column is non-null if the table has been dropped.

For more information on how to query this view, see [Query the classification history](../../user-guide/classify-results.md).

---
title: DATA_CLASSIFICATION_LATEST view
source: https://docs.snowflake.com/en/sql-reference/account-usage/data_classification_latest.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATA_CLASSIFICATION_LATEST view

This Account Usage view displays one row for the most recent result of a classified table for each classified table. Each row corresponds
to a different table.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | Number | Internal/system-generated identifier for the table that was classified. |
| TABLE_NAME | VARCHAR | Name of the table. |
| SCHEMA_ID | Number | Internal/system-generated identifier for the schema that contains the table. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table. |
| DATABASE_ID | Number | Internal/system-generated identifier for the database that contains the table. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table. |
| RESULT | VARIANT | Latest classification result. For a description of the JSON object, see the output of the [SYSTEM$GET_CLASSIFICATION_RESULT](../functions/system_get_classification_result.md) function. |
| STATUS | VARCHAR | One of the following: `CLASSIFIED` or `REVIEWED`. |
| TRIGGER_TYPE | VARCHAR | Mode of the classification trigger: `MANUAL` or `AUTO CLASSIFICATION`, where `MANUAL` indicates that someone called a system function to initiate the classification process. |
| LAST_CLASSIFIED_ON | TIMESTAMP_LTZ | Time when the table was last successfully classified. |
| LAST_CLASSIFICATION_ATTEMPT | TIMESTAMP_LTZ | Timestamp of the last sensitive data classification attempt. If the value is greater than `LAST_CLASSIFIED_ON`, it indicates that the last sensitive data classification attempt resulted in a failure. |
| ERROR_MESSAGE | VARCHAR | Error message from the last sensitive data classification attempt, if it resulted in a failure. |

## Usage notes

* Latency for this view might be up to three hours.
* This view retains data for as long as the table exists.
* A row in the view is removed when the following occur:

  + A table is dropped or renamed.
  + The table is reclassified.

---
title: DATA_METRIC_FUNCTION_EXPECTATIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/data_metric_function_expectations.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATA_METRIC_FUNCTION_EXPECTATIONS view

This Account Usage view lists the [expectations](../../user-guide/data-quality-expectations.md) in an account. It lists the expectations that
were added to an association between a data metric function (DMF) and an object.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| METRIC_DATABASE_NAME | VARCHAR | Database that contains the data metric function. |
| METRIC_SCHEMA_NAME | VARCHAR | Schema that contains the data metric function. |
| METRIC_NAME | VARCHAR | Name of the data metric function. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the metric arguments. |
| DATA_TYPE | VARCHAR | Return data type of the data metric function. |
| REF_DATABASE_NAME | VARCHAR | Database that contains the object that is associated with the data metric function. |
| REF_SCHEMA_NAME | VARCHAR | Schema that contains the object that is associated with the data metric function. |
| REF_ENTITY_NAME | VARCHAR | Name of the table or view that is associated with the data metric function. |
| REF_ENTITY_DOMAIN | VARCHAR | Type of the object (table, view) that the data metric function is associated with. |
| REF_ARGUMENTS | ARRAY | Reference arguments used to evaluate the rule. |
| REF_ID | VARCHAR | System-generated identifier for the association of the data metric function to the table or view. |
| EXPECTATION_ID | VARCHAR | System-generated identifier. |
| EXPECTATION_NAME | VARCHAR | Name that was given to the expectation when it was added to the association between the DMF and the object. |
| EXPECTATION_EXPRESSION | VARCHAR | Boolean expression of the expectation. See [Defining what meets the expectation](../../user-guide/data-quality-expectations.md). |

## Usage notes

Latency for the view might be up to 30 minutes.

---
title: DATA_METRIC_FUNCTION_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/data_metric_function_references.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATA_METRIC_FUNCTION_REFERENCES view

This Account Usage view can be used to identify data metric function objects and their references in your account.

The view is complementary to the Information Schema table function [DATA_METRIC_FUNCTION_REFERENCES](../functions/data_metric_function_references.md).

## Columns

The view returns the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| `metric_database_name` | VARCHAR | The database that stores the data metric function. |
| `metric_schema_name` | VARCHAR | The schema that stores the data metric function. |
| `metric_name` | VARCHAR | The name of the data metric function. |
| `argument_signature` | VARCHAR | The type signature of the metrics arguments. |
| `data_type` | VARCHAR | The return data type of the data metric function. |
| `ref_database_name` | VARCHAR | The database name that contains the object on which the data metric function is added. |
| `ref_schema_name` | VARCHAR | The schema name that contains the object on which the data metric function is added. |
| `ref_entity_name` | VARCHAR | The name of the table or view on which the data metric function is set. |
| `ref_entity_domain` | VARCHAR | The object type (table, view) on which the data metric function is set. |
| `ref_arguments` | ARRAY | Identifies the reference arguments used to evaluate the rule. |
| `ref_id` | VARCHAR | A unique identifier for the association of the data metric function to the table or view. |
| `schedule` | VARCHAR | The schedule to run the data metric function on the table or view. The value for the schedule is always the most recent and effective schedule. |
| `schedule_status` | VARCHAR | The status of the metrics association. One of the following:  `STARTED`  The data metric association on the table or view is scheduled to run.  `SUSPENDED`  The data metric association on the table or view is not scheduled to run. This value also occurs when the role in use that calls the function does not have the OWNERSHIP privilege on the table.  When querying the Account Usage view, the following values are visible by default; however, when calling the table function you must use a role with the OWNERSHIP privilege on the table to see these values:  `SUSPENDED_TABLE_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`  One of the following:   * The table is dropped. * The schema or database that contains the table is dropped * The schema or database that contains the table cannot be resolved by the table owner role.  “Resolved” means the role that calls the function does not have the appropriate privileges on the schema or database that   contains the table.  `SUSPENDED_DATA_METRIC_FUNCTION_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`  One of the following:   * The DMF is dropped. * The schema or database that contains the DMF is dropped. * The schema or database that contains the DMF cannot be resolved by the table owner role.  `SUSPENDED_TABLE_COLUMN_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`  One of the following:   * The target table column is dropped. * The schema or database that contains the column is dropped. * The schema or database that contains the column cannot be resolved by the table owner role.  `SUSPENDED_INSUFFICIENT_PRIVILEGE_TO_EXECUTE_DATA_METRIC_FUNCTION`  The table owner role does not have the EXECUTE DATA METRIC FUNCTION privilege.  `SUSPENDED_ACTIVE_EVENT_TABLE_DOES_NOT_EXIST_OR_NOT_AUTHORIZED`  The event table is not set at the account level. |
| `data_quality_notification_status` | VARCHAR | Reserved for future use. |
| `anomaly_detection_status` | VARCHAR | Indicates whether [anomaly detection](../../user-guide/data-quality-anomaly.md) is enabled for the association between the DMF and the object. If the value is `TRAINING_IN_PROGRESS`, see [About the training period](../../user-guide/data-quality-anomaly.md). |
| `anomaly_detection_sensitivity_level` | VARCHAR | The sensitivity level of anomaly detection. For more information, see [Adjust the sensitivity level of anomaly detection](../../user-guide/data-quality-anomaly.md). |
| `use_role` | VARCHAR | The access control role used to execute the metric function. |
| `level` | VARCHAR | The level at which the metric function is associated with the object. TABLE for all table-like objects. |
| `exclude_table_types` | VARCHAR | Reserved for future use. |

## Usage notes

* Latency for the view might be up to 3 hours.
* To query this view, use a role that is granted either of these [database roles](../snowflake-db-roles.md) at a minimum:
  SNOWFLAKE.GOVERNANCE_VIEWER or SNOWFLAKE.USAGE_VIEWER.

---
title: DATA_QUALITY_MONITORING_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/data_quality_monitoring_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATA_QUALITY_MONITORING_USAGE_HISTORY view

The DATA_QUALITY_MONITORING_USAGE_HISTORY view in the ACCOUNT_USAGE schema records the daily credit consumption for data metric function
evaluations on tables in an account within the last 365 days (1 year).

See also:
:   [Introduction to data quality checks](../../user-guide/data-quality-intro.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | NUMBER | Number of credits billed for data metric function evaluations on the table. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table monitored by data metric functions. |
| TABLE_NAME | VARCHAR | Name of the table. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that stores the table. |
| SCHEMA_NAME | VARCHAR | Name of the schema that stores the table. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that stores the table. |
| DATABASE_NAME | VARCHAR | Name of the database that stores the table. |

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

---
title: DATA_TRANSFER_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/data_transfer_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATA_TRANSFER_HISTORY view

This Account Usage view can be used to query the history of data transferred from Snowflake tables into a different cloud storage provider’s network (i.e. from Snowflake on AWS, Google Cloud Platform, or Microsoft Azure into
the other cloud provider’s network) and/or geographical region within the last 365 days (1 year). The view includes the history for your entire Snowflake account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range in which the data transfer took place. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range in which the data transfer took place. |
| SOURCE_CLOUD | VARCHAR | Name of the cloud provider where the data transfer originated: Amazon Web Services (AWS), Google Cloud Platform, or Microsoft Azure. |
| SOURCE_REGION | VARCHAR | Region where the data transfer originated. |
| TARGET_CLOUD | VARCHAR | Name of the cloud provider where the data was sent: AWS, Google Cloud Platform, or Microsoft Azure. |
| TARGET_REGION | VARCHAR | Region where the data was sent. |
| BYTES_TRANSFERRED | VARIANT | Number of bytes transferred during the START_TIME and END_TIME window. |
| TRANSFER_TYPE | VARCHAR | Type of operation that caused the transfer. [COPY](../sql/copy-into-location.md), [COPY_FILES](../sql/copy-files.md), [DATA_LAKE](../../user-guide/tables-iceberg.md), [EXTERNAL_ACCESS](../../developer-guide/external-network-access/external-network-access-overview.md), [EXTERNAL_FUNCTION](../external-functions.md), [INTERNAL](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md), [REPLICATION](../../user-guide/account-replication-intro.md), [SNOWPARK_CONTAINER_SERVICES](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

---
title: DATABASE_REPLICATION_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/database_replication_usage_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATABASE_REPLICATION_USAGE_HISTORY view

This Account Usage view can be used to query the database replication history.
The returned results include the database name, credits consumed, and bytes transferred for replication.
Usage data is retained for 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the replication usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the replication usage took place. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| CREDITS_USED | NUMBER | Total number of credits used for database replication during the START_TIME and END_TIME window. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for database replication during the START_TIME and END_TIME window. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* Results are only returned for secondary databases in the target account.

---
title: DATABASE_STORAGE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/database_storage_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATABASE_STORAGE_USAGE_HISTORY view

This Account Usage view can be used to query the average daily storage usage, in bytes, for databases in the account for the last 365 days (1 year). The data includes:

* All data stored in tables in the database(s).
* All historical data maintained in Fail-safe for the database(s).

See also:
:   [STORAGE_DAILY_HISTORY view](../organization-usage/storage_daily_history.md) , [STORAGE_USAGE view](storage_usage.md) , [TABLE_STORAGE_METRICS view](table_storage_metrics.md)

> **Note:**
>
> This view isn’t designed to reconcile with your Snowflake bill. As a result, the sum of database-level usage in this view won’t equal the billed storage for your account.
>
> For a view that more closely reflects billed storage at the account and organization level, see [STORAGE_DAILY_HISTORY view](../organization-usage/storage_daily_history.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Date (in the local time zone) of this storage usage record. It is recommended that you change the query session to use the UTC time zone instead (e.g. `ALTER SESSION SET TIMEZONE='UTC'`). |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| DELETED | TIMESTAMP_LTZ | Date and time when the database was dropped; NULL for active databases. |
| AVERAGE_DATABASE_BYTES | FLOAT | Number of bytes of database storage used, including bytes currently in Time Travel. |
| AVERAGE_FAILSAFE_BYTES | FLOAT | Number of bytes of Fail-safe storage used. |
| AVERAGE_HYBRID_TABLE_STORAGE_BYTES | FLOAT | Number of bytes of hybrid table storage used (data in the row store). |
| AVERAGE_ARCHIVE_STORAGE_COOL_BYTES | FLOAT | Average number of bytes (including active bytes, time travel bytes, and bytes subject to [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md)) of table storage used in the COOL storage tier. |
| AVERAGE_ARCHIVE_STORAGE_COLD_BYTES | FLOAT | Average number of bytes (including active bytes, time travel bytes, and bytes subject to [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md)) of table storage used in the COLD storage tier. |
| AVERAGE_COOL_FAILSAFE_BYTES | FLOAT | Average number of bytes of Fail-safe storage used in the COOL storage tier. |
| AVERAGE_COLD_FAILSAFE_BYTES | FLOAT | Average number of bytes of Fail-safe storage used in the COLD storage tier. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* This view is suitable for comparing relative storage usage between databases in an account over time. It isn’t suitable for calculating exact database-level storage charges.
* To approximate database-level storage costs for chargeback, combine each database’s share of usage in this view with your total billed storage from [STORAGE_DAILY_HISTORY view](../organization-usage/storage_daily_history.md) or your invoice. Treat the result as an approximation, not an exact billing figure.
* > **Note:**
  >
  > With [BCR-2127](../../release-notes/bcr-bundles/2025_07/bcr-2127.md),
  > this view includes new columns for storage lifecycle policies.
  > To view storage lifecycle policy columns, you must enable the 2025_07 behavior change bundle
  > in your account.
  >
  > To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
  > execute the following statement:
  >
  > ```sqlexample
  > SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_07');
  > ```

---
title: DATABASES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/databases.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DATABASES view

This Account Usage view displays a row for each database defined in your account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| DATABASE_OWNER | VARCHAR | Name of the role that owns the database. |
| IS_TRANSIENT | VARCHAR | Whether the database is transient. |
| COMMENT | VARCHAR | Comment for the database. |
| CREATED | TIMESTAMP_LTZ | Date and time when the database was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the database was dropped. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| RESOURCE_GROUP | VARCHAR | For internal use. |
| TYPE | VARCHAR | Specifies the type of database. Valid values are: . . - APPLICATION: a Snowflake Native App. . - APPLICATION_PACKAGE: an application package. . - STANDARD: a normal database. . - IMPORTED DATABASE: a database created from a share. . - PERSONAL DATABASE: a personal database linked to its owner. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| OBJECT_VISIBILITY | OBJECT | `OBJECT_VISIBILITY`  [Preview Feature](../../release-notes/preview-features.md) — Open  Available to all accounts.  This property controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md) in the account, enabling users without explicit access privileges to find objects and request access. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* The view displays all of the databases in an account.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: DOCUMENT_AI_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/document_ai_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DOCUMENT_AI_USAGE_HISTORY view

The DOCUMENT_AI_USAGE_HISTORY view can be used to query the job history for Document AI.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the hourly time range in which the query usage took place. |
| END_TIME | TIMESTAMP_LTZ | End of the hourly time range in which the query usage took place. |
| CREDITS_USED | NUMBER(38,9) | Number of credits used for Document AI compute between START_TIME and END_TIME. |
| QUERY_ID | VARCHAR | A unique identifier for the SQL query. |
| OPERATION_NAME | TEXT | Name of the Document AI operation: `Inference` (entity extraction) or `Inference-Table-Extraction` (table extraction). |
| PAGE_COUNT | NUMBER | Number of pages processed. |
| DOCUMENT_COUNT | NUMBER | Number of documents processed. |
| FEATURE_COUNT | NUMBER | Number of data values defined to be extracted. |

---
title: DYNAMIC_TABLE_REFRESH_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/dynamic_table_refresh_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# DYNAMIC_TABLE_REFRESH_HISTORY view

This Account Usage view displays information for dynamic table refresh history.

See also:
:   [DYNAMIC_TABLE_REFRESH_HISTORY](../functions/dynamic_table_refresh_history.md) (Information Schema)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the dynamic table. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the dynamic table. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the dynamic table. |
| ID | NUMBER | Internal, Snowflake-generated identifier for the dynamic table. |
| SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema that contains the dynamic table. |
| DATABASE_ID | NUMBER | Internal, Snowflake-generated identifier of the database that contains the dynamic table. |
| STATE | VARCHAR | Status of the refresh for the dynamic table. This can be one of the following: . - EXECUTING: refresh in progress. . - SUCCEEDED: refresh completed successfully. . - FAILED: refresh failed during execution. . - CANCELLED: refresh was canceled before execution. . - UPSTREAM_FAILED: refresh not performed due to an upstream failed refresh. |
| STATE_CODE | VARCHAR | Code representing the current state of the refresh. |
| STATE_MESSAGE | VARCHAR | Description of the current state of the refresh. |
| QUERY_ID | VARCHAR | ID of the SQL statement that produced the results for the dynamic table. |
| DATA_TIMESTAMP | TIMESTAMP_LTZ | Transactional timestamp when the refresh was evaluated. (This might be slightly before the actual time of the refresh.) All data, in base objects, that arrived before this timestamp is currently included in the dynamic table. |
| REFRESH_START_TIME | TIMESTAMP_LTZ | Time when the refresh job started. |
| REFRESH_END_TIME | TIMESTAMP_LTZ | Time when the refresh completed. |
| COMPLETION_TARGET | TIMESTAMP_LTZ | Time by which this refresh should complete to keep lag under the TARGET_LAG parameter for the dynamic table. This is equal to the DATA_TIMESTAMP of the last refresh + TARGET_LAG. |
| QUALIFIED_NAME | VARCHAR | Fully qualified name of the dynamic table as it appears in the graph of dynamic tables. You can use this to join the output with the output of the [DYNAMIC_TABLE_GRAPH_HISTORY](../functions/dynamic_table_graph_history.md) function. |
| LAST_COMPLETED_DEPENDENCY | OBJECT | Contains the following properties: . - `qualified_name`: The qualified name of the latest dependency to become available. . - `data_timestamp`: The refresh version of that dependency. |
| STATISTICS | OBJECT | Contains the following properties: . - `numInsertedRows`: The number of inserted rows. . - `numDeletedRows`: The number of rows that were deleted. . - `numCopiedRows`: The number of rows that were copied unchanged. . - `numAddedPartitions`: The number of added partitions. . - `numRemovedPartitions` : The number of removed partitions. . - `queuedTimeMs`: The time (in milliseconds) spent in the queued state. . - `compilationTimeMs`: The time (in milliseconds) spent compiling the refresh query. . - `executionTimeMs`: The time (in milliseconds) spent executing the refresh query. . For successful refreshes, this column includes both the row/partition statistics and the time distribution information. For failed refreshes, this column is populated with the time distribution information only. . For example: If an UPDATE statement updates 1 row in a partition with 10 rows. Then the metrics above show 1 row inserted, 1 deleted, and 9 copied. Additionally, 1 partition is removed and 1 partition added. |
| REFRESH_ACTION | VARCHAR | One of: . - NO_DATA - no new data in base tables. Doesn’t apply to the initial refresh of newly created dynamic tables regardless of whether or not the base tables have data. . - REINITIALIZE - base table changed or source table of a cloned dynamic table was refreshed during clone. . - FULL - Full refresh, because dynamic table contains query elements that are not incrementalizable (see SHOW DYNAMIC TABLE refresh_mode_reason) or because full refresh was cheaper than incremental refresh. . - INCREMENTAL - normal incremental refresh. |
| REFRESH_TRIGGER | VARCHAR | One of: . - SCHEDULED - normal background refresh to meet target lag or downstream target lag. . - MANUAL - user/task used ALTER DYNAMIC TABLE <name> REFRESH . - CREATION - refresh performed during the creation DDL statement, triggered by the creation of the dynamic table or any consumer dynamic tables. |
| TARGET_LAG_SEC | NUMBER | Describes the target lag value for the dynamic tables at the time the refresh occurred. |
| GRAPH_HISTORY_VALID_FROM | TIMESTAMP_NTZ | Encodes the VALID_FROM timestamp of the DYNAMIC_TABLE_GRAPH_HISTORY table function when the refresh occurred to clarify which version of a dynamic table a specific refresh corresponds to. This value can also be NULL if the corresponding dynamic table hasn’t been created. |

## Usage notes

* Latency for the view may be up to 3 hours.
* To query this view, use a role that is granted the SNOWFLAKE.USAGE_VIEWER [database role](../snowflake-db-roles.md).

## Examples

Find failed dynamic table refreshes during the past week.

> ```sqlexample
> SELECT
>     data_timestamp,
>     database_name,
>     schema_name,
>     name,
>     state,
>     state_message,
>     query_id
>   FROM snowflake.account_usage.dynamic_table_refresh_history
>   WHERE state = 'FAILED' AND data_timestamp >= dateadd(WEEK, -1, current_date())
>   ORDER BY data_timestamp DESC
>   LIMIT 10;
> ```

---
title: ELEMENT_TYPES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/element_types.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# ELEMENT_TYPES view

This Account Usage view displays a row for each [structured ARRAY type](../data-types-structured.md) in an
object (a column in a table) in the account.

Each row describes the type of the element in the structured ARRAY.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| COLLECTION_TYPE_IDENTIFIER | VARCHAR | Type identifier. Use this to join on:   * The DTD_IDENTIFIER column in the [COLUMNS view](../info-schema/columns.md). * The DTD_IDENTIFIER column in this view (for nested types). * The DTD_IDENTIFIER column in the [FIELDS view](../info-schema/fields.md) (for nested types). |
| OBJECT_ID | VARCHAR | Internal/system-generated identifier for the object that uses this ARRAY type (e.g. name of a table). |
| OBJECT_NAME | VARCHAR | Name of the object that uses this ARRAY type (e.g. name of a table). |
| OBJECT_TYPE | VARCHAR | Type of the object that uses this ARRAY type:   * TABLE (if used by a column) |
| OBJECT_SCHEMA_ID | VARCHAR | Internal/system-generated identifier for the schema of the object that uses this ARRAY type. |
| OBJECT_SCHEMA | VARCHAR | Schema that contains the object that uses this ARRAY type. |
| OBJECT_CATALOG_ID | VARCHAR | Internal/system-generated identifier for the database of the object that uses this ARRAY type. |
| OBJECT_CATALOG | VARCHAR | Database that contains the object that uses this ARRAY type. |
| DATA_TYPE | VARCHAR | Data type of the element. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string elements. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string elements. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric elements. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric elements. |
| NUMERIC_SCALE | NUMBER | Scale of numeric elements. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | VARCHAR | Not applicable for Snowflake. |
| INTERVAL_PRECISION | NUMBER | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| COLLATION_CATALOG | VARCHAR | Not applicable for Snowflake. |
| COLLATION_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | The collation specification for this element |
| UDT_CATALOG | VARCHAR | Not applicable for Snowflake. |
| UDT_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| UDT_NAME | VARCHAR | Not applicable for Snowflake. |
| SCOPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| SCOPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| SCOPE_NAME | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | NUMBER | Maximum cardinality. Currently, this is always set to NULL. |
| DTD_IDENTIFIER | VARCHAR | Nested type identifier. Use this to join on:   * The COLLECTION_TYPE_IDENTIFIER column in this view. * The ROW_IDENTIFIER column in the [FIELDS view](../info-schema/fields.md) (for nested types). |
| IS_NULLABLE | VARCHAR | `Y` if the structured ARRAY allows NULL values; `N` otherwise. |
| DELETED | TIMESTAMP_LTZ | Date and time when the object was dropped. |

## Usage notes

* Latency for the view may be up to 90 minutes.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.

---
title: EVENT_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/event_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# EVENT_USAGE_HISTORY view

This view can be used to query the history of data loaded into Snowflake event tables within the last 365 days (1 year).

The view displays the history of data loaded and credits billed for your entire Snowflake account.

For more information about event tables, refer to [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md).

For more information about logging and tracing, refer to [Logging, tracing, and metrics](../../developer-guide/logging-tracing/logging-tracing-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the time range (in the UTC time zone) in which data loading took place. |
| END_TIME | TIMESTAMP_LTZ | End of the time range (in the UTC time zone) in which data loading took place. |
| CREDITS_USED | NUMBER | Number of credits billed for loading data into the event table during the START_TIME and END_TIME window. |
| BYTES_INGESTED | NUMBER | Number of bytes of data loaded during the START_TIME and END_TIME window. |

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

---
title: EXTERNAL_ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/external_access_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# EXTERNAL_ACCESS_HISTORY view

This Account Usage view can be used to query the history of external access performed by procedure or UDF handler code within the last
365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_ID | TEXT | ID of the query or job that called the UDF or procedure performing external access. |
| HOSTNAMES | TEXT | Name of the hosts accessed. |
| STATUS | TEXT | Status of the attempt to connect to the external location. One of the following values:   * `Success` if the connection was successful * `Deny` if the connection was denied |
| IP | VARCHAR | IP address for the external network location. |
| SOURCE_CLOUD | TEXT | Name of the cloud provider where the data transfer originated. One of the following:   * `aws` * `gcp` * `azure` |
| SOURCE_REGION | TEXT | Region where the data transfer originated. |
| TARGET_CLOUD | TEXT | Name of the cloud provider to which the data was sent. One of the following:   * `aws` * `gcp` * `azure` * `internet` (for regions not on a cloud provider) |
| TARGET_REGION | TEXT | Region to which the data was sent. `internet` for regions not on a cloud provider. |
| SENT_BYTES | VARIANT | Number of bytes sent to the external endpoint. |
| RECEIVED_BYTES | VARIANT | Number of bytes received from the external endpoint. |

## Usage notes

General notes:

* Each row in the view represents a single IP address that the procedure or UDF accesses. As a result, there might be multiple
  rows with different IP addresses, but with the same query ID. There might also be multiple hostnames mapped to the same IP
  address.
* Latency for the view might be up to 180 minutes (3 hours).

---
title: FEATURE_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/feature_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# FEATURE_POLICIES view

This Account Usage view provides the
[feature policies](../../developer-guide/native-apps/ui-consumer-feature-policies.md) in your account.

Each row in this view corresponds to a different feature policy.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the feature policy. |
| NAME | TEXT | Name of the feature policy. |
| SCHEMA_ID | TEXT | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | TEXT | Schema to which the feature policy belongs. |
| DATABASE_ID | TEXT | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | TEXT | Database to which the feature policy belongs. |
| OWNER | TEXT | Name of the role that owns the feature policy. |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example ROLE. If a Snowflake Native App owns the object, the value is APPLICATION. Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| BLOCKED_OBJECT_TYPES_FOR_CREATION | TEXT | A comma-separated list of object types that the feature policy blocks for creation. See [Feature Policies](../../developer-guide/native-apps/ui-consumer-feature-policies.md) for more information. |
| COMMENT | TEXT | Comments entered for the feature policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the feature policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. |
| DELETED | TIMESTAMP_LTZ | Date and time when the feature policy was dropped. |

## Usage notes

* Latency for the view may be up to 120 minutes (two hours).

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: FIELDS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/fields.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# FIELDS view

This Account Usage view displays a row for each field in a [structured OBJECT type](../data-types-structured.md)
and a row for the key and value in a [MAP](../data-types-structured.md) in an object (a column in a table) in the
account.

For MAPs, the view contains separate rows for the key and value.

Each row describes the type of the element in the structured ARRAY.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROW_IDENTIFIER | VARCHAR | Type identifier. Use this to join on:   * The DTD_IDENTIFIER column in the [COLUMNS view](../info-schema/columns.md). * The DTD_IDENTIFIER column in the [ELEMENT_TYPES view](../info-schema/element_types.md) (for nested types). * The DTD_IDENTIFIER column in this view (for nested types). |
| FIELD_NAME | VARCHAR | One of the following values:   * For structured OBJECTs, the name of the key. * For MAPs, KEY for the key or VALUE for the value. |
| OBJECT_ID | VARCHAR | Internal/system-generated identifier for the object that uses this OBJECT or MAP type (e.g. name of a table). |
| OBJECT_NAME | VARCHAR | Name of the object that uses this OBJECT or MAP type (e.g. name of a table). |
| OBJECT_TYPE | VARCHAR | Type of the object that uses this OBJECT or MAP type:   * TABLE (if used by a column) |
| OBJECT_SCHEMA_ID | VARCHAR | Internal/system-generated identifier for the schema for the object that uses this OBJECT or MAP type. |
| OBJECT_SCHEMA | VARCHAR | Schema that contains the object that uses this OBJECT or MAP type. |
| OBJECT_CATALOG_ID | VARCHAR | Internal/system-generated identifier for the database for the object that uses this OBJECT or MAP type. |
| OBJECT_CATALOG | VARCHAR | Database that contains the object that uses this OBJECT or MAP type. |
| ORDINAL_POSITION | NUMBER | The ordinal position of the key in the OBJECT or MAP. The position is 1-based.  For MAPs, the ordinal position of the key is 1, and the ordinal position of the value is 2. |
| DATA_TYPE | VARCHAR | Data type of the value (for OBJECTs) or the key or value (for MAPs). |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string keys or values. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string keys or values. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric keys or values. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric keys or values. |
| NUMERIC_SCALE | NUMBER | Scale of numeric keys or values. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | VARCHAR | Not applicable for Snowflake. |
| INTERVAL_PRECISION | NUMBER | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| COLLATION_CATALOG | VARCHAR | Not applicable for Snowflake. |
| COLLATION_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | The collation specification for this keys or values. |
| UDT_CATALOG | VARCHAR | Not applicable for Snowflake. |
| UDT_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| UDT_NAME | VARCHAR | Not applicable for Snowflake. |
| SCOPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| SCOPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| SCOPE_NAME | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | NUMBER | Maximum cardinality. Currently, this is always set to NULL. |
| DTD_IDENTIFIER | VARCHAR | Nested type identifier. Use this to join on:   * The COLLECTION_TYPE_IDENTIFIER column in the [ELEMENT_TYPES view](../info-schema/element_types.md). * The ROW_IDENTIFIER column in this view (for nested types). |
| IS_NULLABLE | VARCHAR | `Y` if the structured OBJECT or MAP allows NULL values; `N` otherwise. |
| DELETED | TIMESTAMP_LTZ | Date and time when the object was dropped. |

## Usage notes

* Latency for the view may be up to 90 minutes.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.

---
title: FILE_FORMATS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/file_formats.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# FILE_FORMATS view

This Account Usage view displays a row for each file format defined in the account.

File formats are named objects that can be used for loading/unloading data. For more information, see [CREATE FILE FORMAT](../sql/create-file-format.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_FORMAT_ID | NUMBER | Internal/system-generated identifier for the file format. |
| FILE_FORMAT_NAME | VARCHAR | Name of the file format, |
| FILE_FORMAT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the file format. |
| FILE_FORMAT_SCHEMA | VARCHAR | Schema that the file format belongs to. |
| FILE_FORMAT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the file format. |
| FILE_FORMAT_CATALOG | VARCHAR | Database that the file format belongs to. |
| FILE_FORMAT_OWNER | VARCHAR | Name of the role that owns the file format. |
| FILE_FORMAT_TYPE | VARCHAR | File format type of the file format (`CSV`, `JSON`, etc.). |
| RECORD_DELIMITER | VARCHAR | Character that separates records. |
| FIELD_DELIMITER | VARCHAR | Character that separates fields. |
| SKIP_HEADER | NUMBER | Number of lines skipped at the start of the file. |
| DATE_FORMAT | VARCHAR | Date format. |
| TIME_FORMAT | VARCHAR | Time format. |
| TIMESTAMP_FORMAT | VARCHAR | Timestamp format. |
| BINARY_FORMAT | VARCHAR | Binary format. |
| ESCAPE | VARCHAR | String used as the escape character for any field values. |
| ESCAPE_UNENCLOSED_FIELD | VARCHAR | String used as the escape character for unenclosed field values. |
| TRIM_SPACE | BOOLEAN | Whether whitespace is removed from fields. |
| FIELD_OPTIONALLY_ENCLOSED_BY | VARCHAR | Character used to enclose strings. |
| NULL_IF | VARCHAR | A list of strings to be replaced by null. |
| COMPRESSION | VARCHAR | Compression method for the data file. |
| ERROR_ON_COLUMN_COUNT_MISMATCH | VARCHAR | Whether to generate a parsing error if the number of fields in an input file does not match the number of columns in the corresponding table. |
| CREATED | TIMESTAMP_LTZ | Date and time when the file format was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the file format was dropped. |
| COMMENT | VARCHAR | Comment for the file format. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: FUNCTIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/functions.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# FUNCTIONS view

This Account Usage view displays a row for each user-defined function (UDF) defined in the account.

For more information about UDFs, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FUNCTION_ID | NUMBER | Internal/system-generated identifier for the UDF. |
| FUNCTION_NAME | VARCHAR | Name of the UDF. |
| FUNCTION_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the UDF. |
| FUNCTION_SCHEMA | VARCHAR | Schema which the UDF belongs to. |
| FUNCTION_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the UDF. |
| FUNCTION_CATALOG | VARCHAR | Database which the UDF belongs to. |
| FUNCTION_OWNER | VARCHAR | Name of the role that owns the UDF. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the UDF’s arguments. |
| DATA_TYPE | VARCHAR | Return value data type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string return value. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string return value. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric return value. |
| NUMERIC_SCALE | NUMBER | Scale of numeric return value. |
| FUNCTION_LANGUAGE | VARCHAR | Language of the UDF. |
| FUNCTION_DEFINITION | VARCHAR | UDF definition. |
| VOLATILITY | VARCHAR | Whether the UDF is volatile or immutable. |
| IS_NULL_CALL | VARCHAR | Whether the UDF is called when input is null. |
| CREATED | TIMESTAMP_LTZ | Date and time when the UDF was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the UDF was dropped. |
| COMMENT | VARCHAR | Comment for the function. |
| IS_EXTERNAL [1] | VARCHAR(3) | `YES` if the function is an [external function](../external-functions.md); otherwise, `NO`. |
| API_INTEGRATION [1] | VARCHAR | Name of the API integration object to authenticate the call to the proxy service. |
| CONTEXT_HEADERS [1] | VARCHAR | Context header information for the external function. |
| MAX_BATCH_ROWS [1] | NUMBER | Maximum number of rows in each batch sent to the proxy service. |
| COMPRESSION [1] | VARCHAR | Type of compression. |
| PACKAGES | VARCHAR | Packages requested by the function. |
| RUNTIME_VERSION | VARCHAR | Runtime version of the language used by the function. NULL if the function is SQL or JavaScript. |
| INSTALLED_PACKAGES | VARCHAR | All packages installed by the function. Output for Python functions only. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| IS_MEMOIZABLE | VARCHAR(3) | `YES` if the function is [memoizable](../../developer-guide/udf/sql/udf-sql-scalar-functions.md); otherwise, `NO`. |
| IS_DATA_METRIC | VARCHAR(3) | `YES` if the function is a [data metric function](../../user-guide/data-quality-intro.md); otherwise, `NO`. |
| SECRETS | JSON map | Map of [secrets](../sql/create-secret.md) specified by the function’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| EXTERNAL_ACCESS_INTEGRATIONS | VARCHAR | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the function’s EXTERNAL_ACCESS_INTEGRATION parameter. |
| IS_AGGREGATE | VARCHAR(3) | `YES` if the function is an aggregate function; otherwise, `NO`. |

[1]
(1,2,3,4,5)

These fields apply only to [Writing external functions](../external-functions.md).

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently might show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: GRANTS_TO_ROLES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/grants_to_roles.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# GRANTS_TO_ROLES view

This Account Usage view can be used to query access control privileges that have been granted to an account role, application, application
role, database role, instance role, or user.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the privilege is granted to the role. |
| MODIFIED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the privilege is updated. |
| PRIVILEGE | VARCHAR | Name of the privilege added to the role. |
| GRANTED_ON | VARCHAR | Object kind, such as `TABLE` or `DATABASE`, on which the privilege is granted. |
| NAME | VARCHAR | Name of the object on which the privilege is granted. |
| TABLE_CATALOG | VARCHAR | Name of the database for the current table or the name of the database that stores the instance of a class. |
| TABLE_SCHEMA | VARCHAR | Name of the schema for the current table or the name of the schema that stores the instance of a class. |
| GRANTED_TO | VARCHAR | `ACCOUNT ROLE`, `APPLICATION`, `APPLICATION_ROLE`, `DATABASE_ROLE`, `INSTANCE_ROLE`, or `USER`. |
| GRANTEE_NAME | VARCHAR | Identifier for the recipient role, the role to which the privilege is granted, or the name of the Snowflake Native App object. |
| GRANT_OPTION | BOOLEAN | `TRUE / FALSE`. If set to `TRUE`, the recipient role can grant the privilege to other roles. |
| GRANTED_BY | VARCHAR | Indicates the role that authorized a privilege grant to the grantee. `GRANTED_BY` displays empty for privileges granted by the SNOWFLAKE system role. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the privilege is revoked. |
| GRANTED_BY_ROLE_TYPE | VARCHAR | Either `APPLICATION`, `ROLE` or `DATABASE_ROLE`. |
| OBJECT_INSTANCE | VARCHAR | The fully-qualified name of the object that contains the instance role for a particular class in the format `database.schema.class`. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The GRANTS_TO_ROLES view shows a subset of all supported objects. The supported set is subject to change. The view is updated periodically
  to include support for new objects.
* The view does not contain grants to database roles from databases created from shares.
* The view does not contain grants on dropped objects.
* The `GRANTED_BY` column indicates the role that authorized a privilege grant to the grantee. The authorization role is known as the
  *grantor*.

  When you grant privileges on an object to a role using [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md), the following authorization rules
  determine which role is listed as the grantor of the privilege:

  1. If an [active role](../../user-guide/security-access-control-overview.md) is the object owner (i.e. has the OWNERSHIP privilege on the
     object), that role is the grantor.
  2. If an active role was given privileges on the object by a GRANT PRIVILEGE … WITH GRANT OPTION statement, then the active role is the
     grantor. If multiple active roles meet this criterion and one of these active roles is the primary role, then the primary role is the
     grantor. If there are multiple active roles, and none of them are the primary role, Snowflake randomly selects one of the roles as the
     grantor.
  3. If an active role holds the global MANAGE GRANTS privilege, the grantor role is the object owner, not the role that held the
     MANAGE GRANTS privilege. That is, the MANAGE GRANTS privilege allows a role to impersonate the object owner for the purposes of
     granting privileges on that object.

  The `GRANTED_BY` column displays empty for privileges granted by the Snowflake SYSTEM role. Certain internal operations are
  performed with this role. Grants of privileges authorized by the SYSTEM role cannot be modified by customers.

---
title: GRANTS_TO_SHARES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/grants_to_shares.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# GRANTS_TO_SHARES view

This Account Usage view can be used to query access control privileges that have been granted to a share. The information in this view could have a latency of up to 3 hours.

## Columns

The following table provides definitions for the GRANTS_TO_SHARES view columns.

| Column | Data type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | The date and time when the privilege was granted to the share. |
| MODIFIED_ON | TIMESTAMP_LTZ | The date and time when the privilege was last updated. |
| DELETED_ON | TIMESTAMP_LTZ | The date and time when the privilege was revoked from the share. This value is null if the privilege hasn’t been revoked. |
| PRIVILEGE | VARCHAR | The name of the privilege granted on the object. |
| GRANTED_ON | VARCHAR | The kind of the object on which the privilege was granted. |
| OBJECT_NAME | VARCHAR | The name of the object on which the privilege was granted. |
| OBJECT_DATABASE | VARCHAR | The database that contains the object on which the privilege was granted. A null value indicates that the object is not database-scoped. |
| OBJECT_SCHEMA | VARCHAR | The schema that contains the object on which the privilege was granted. A null value indicates that the object is not schema-scoped. |
| SHARE_NAME | VARCHAR | The name of the share to which the privilege was granted. |
| GRANTED_BY | VARCHAR | The role that granted the privilege. A null value indicates that the privilege is a system grant. |
| GRANTED_BY_ROLE_TYPE | VARCHAR | The type of role that granted the privilege. Values are `ROLE` and `DATABASE_ROLE`. |

## Usage notes

* This view doesn’t include access control privileges to the shares that have been dropped.
* This view records current grants and historical grants, including grants that were revoked or granted again.
* This view supports common data object types that can be granted to a share, including Database, Schema, Table, View, Function, Database
  Role, and so on.

---
title: GRANTS_TO_USERS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/grants_to_users.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# GRANTS_TO_USERS view

This Account Usage view can be used to query the roles that have been granted to a user.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the role is granted. |
| DELETED_ON | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the role is revoked. |
| ROLE | VARCHAR | Identifier for the role granted to the user. |
| GRANTED_TO | VARCHAR | For this view, the value is `USER`. |
| GRANTEE_NAME | VARCHAR | Name of the user to whom the privilege is granted. |
| GRANTED_BY | VARCHAR | Identifier for the role that granted the privilege. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The GRANTS_TO_USERS view **does not** include grants of privileges and non-account roles to users. For that information, see the
  [GRANTS_TO_ROLES view](../organization-usage/grants_to_roles.md).
* This view records current grants and historical grants, including grants that were revoked and granted again. When a single grant occurs
  and as long as it remains active (that is, not revoked):

  + The view includes one row for the grant of the same role to the same user.
  + A regrant of the same role to the same user is not recorded as a new row. Instead, the DELETED_ON column remains NULL while the grant
    is active.
* When a grant is revoked from the user, the DELETED_ON column for the grant is updated from NULL to the timestamp when the grant was
  revoked.
* After revoking the role from the user, a grant of the same role to the same user is recorded in a new row. In this new row, the
  DELETED_ON column value is NULL because the grant is now active.

---
title: HYBRID_TABLE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/hybrid_table_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# HYBRID_TABLE_USAGE_HISTORY view

> **Note:**
>
> As of March 1, 2026, Snowflake no longer bills customers for hybrid table requests,
> and metering was disabled soon after this pricing change took effect. Any new data
> in the view as of March 1, 2026, will not be billed to customers, and you can still
> query the historical data in the view.

This Account Usage view displays consumption of hybrid table requests
(serverless compute resources), in terms of credits billed for
your entire Snowflake account, within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| OBJECT_TYPE | TEXT | Type of object referenced for scope of consumption: `ACCOUNT` for hybrid tables in your account. |
| OBJECT_ID | NUMBER | Internal identifier of object referenced for scope of consumption: `NULL` because scope of consumption for hybrid tables is tracked at the account level. |
| OBJECT_NAME | TEXT | Name of object referenced for scope of consumption: `NULL` because scope of consumption for hybrid tables is tracked at the account level. |
| START_TIME | TIMESTAMP_LTZ | Date and start time (in the local time zone) when usage of hybrid tables occurred. |
| END_TIME | TIMESTAMP_LTZ | Date and end time (in the local time zone) when usage of hybrid tables occurred. |
| CREDITS_USED | NUMBER | Number of credits used for hybrid table requests between the values for `START_TIME` and `END_TIME`. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* This view may return usage data that is slightly inconsistent with metrics
  returned in [METERING_DAILY_HISTORY view](metering_daily_history.md) and
  [METERING_HISTORY view](metering_history.md). The discrepancy in the
  calculation of credits used is due to rounding during division.

## Examples

The following queries return the total number of credits used by hybrid tables in your
account over specific periods of time.

The first query returns credits used for all time (the past year):

```sqlexample
SELECT SUM(credits_used) AS total_credits
  FROM SNOWFLAKE.ACCOUNT_USAGE.HYBRID_TABLE_USAGE_HISTORY;
```

The second query returns credits used over the past 5 days. Alternatively, you could specify some number
of weeks or months:

```sqlexample
SELECT SUM(credits_used) AS total_credits
  FROM SNOWFLAKE.ACCOUNT_USAGE.HYBRID_TABLE_USAGE_HISTORY
  WHERE start_time >= DATEADD(day, -5, CURRENT_TIMESTAMP());
```

---
title: HYBRID_TABLES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/hybrid_tables.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# HYBRID_TABLES view

This Account Usage view displays a row for each hybrid table defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | ID of the hybrid table. |
| NAME | TEXT | Name of the hybrid table. |
| SCHEMA_ID | NUMBER | ID of the schema to which the hybrid table belongs. |
| SCHEMA_NAME | TEXT | Schema to which the hybrid table belongs. |
| DATABASE_ID | NUMBER | ID of the database to which the hybrid table belongs. |
| DATABASE_NAME | TEXT | Database to which the hybrid table belongs. |
| OWNER | TEXT | Owner of the hybrid table. |
| ROW_COUNT | NUMBER | Approximate row count of the hybrid table. |
| BYTES | NUMBER | Approximate size in bytes of the row store of the hybrid table. |
| RETENTION_TIME | NUMBER | Retention time for data in the hybrid table. |
| CREATED | TIMESTAMP_LTZ | Creation time of the hybrid table. |
| LAST_ALTERED | TIMESTAMP_LTZ | Last time this hybrid table was altered by a DDL statement, a TRUNCATE or INSERT OVERWRITE statement, or a compaction job. Note that regular DML operations are not recorded here. |
| DELETED | TIMESTAMP_LTZ | Date and time when the hybrid table was dropped. |
| COMMENT | TEXT | Comment for the hybrid table. |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

---
title: ICEBERG_STORAGE_OPTIMIZATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/iceberg_storage_optimization_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ICEBERG_STORAGE_OPTIMIZATION_HISTORY view

Use this Account Usage view to query Iceberg storage optimization jobs, which includes *data compaction*, within the last 365 days (1 year)
for Apache Iceberg™ tables in your account. You can query jobs for the following Iceberg tables:

* Snowflake-managed tables
* Open Catalog-managed tables

> **Note:**
>
> * Snowflake starts billing for data compaction of data files for Snowflake-managed Iceberg tables on October 20th, 2025.
> * To enable or disable data compaction on Snowflake-managed Iceberg tables, see [Set data compaction](../../user-guide/tables-iceberg-manage.md).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the time range (on the hour mark) during which the operations were performed. |
| END_TIME | TIMESTAMP_LTZ | End of the time range (on the hour mark) during which the operations were performed. |
| CREDITS_USED | NUMBER | Number of credits billed for data compaction during the START_TIME and END_TIME window. |
| NUM_BYTES_SCANNED | NUMBER | Number of bytes scanned during the START_TIME and END_TIME window. |
| NUM_ROWS_WRITTEN | NUMBER | Number of rows compacted during the START_TIME and END_TIME window. |
| TABLE_ID | NUMBER | Internal, system-generated identifier for the Iceberg table in Snowflake. |
| TABLE_NAME | VARCHAR | Name of the Iceberg table defined in Snowflake. |
| ICEBERG_TABLE_UUID | VARCHAR | Apache Iceberg™ table identifier, generated by the external Iceberg engine or catalog. |
| SCHEMA_ID | VARCHAR | System-generated identifier for the Snowflake schema that the table is in. |
| SCHEMA_NAME | VARCHAR | Name of the schema the table is in. |
| DATABASE_ID | NUMBER | System-generated identifier for the Snowflake database that the schema and table belong to. |
| DATABASE_NAME | VARCHAR | Name of the database that the schema and table belong to. |
| INSTANCE_ID | NUMBER | Internal, system-generated identifier for the instance that the object belongs to. |

## Usage notes

* Latency for the view is up to 2 hours.
* The view contains historical usage data for the last 365 days.
* The USAGE_VIEWER role is granted the SELECT privilege on this view. For more information, see [SNOWFLAKE database roles](../snowflake-db-roles.md).
* This view doesn’t include data compaction information for externally managed Iceberg table that aren’t managed by Open Catalog.

## Examples

The following example shows how to filter for tables whose number of credits billed is more than a specified amount:

```sqlexample
  SELECT
      table_name,
      start_time,
      credits_used
    FROM SNOWFLAKE.ACCOUNT_USAGE.ICEBERG_STORAGE_OPTIMIZATION_HISTORY
    WHERE credits_used > 0.0005
    ORDER BY
      credits_used DESC;

The query returns the following results:
```

```output
+------------------+-------------------------------+--------------+
| TABLE_NAME       | START_TIME                    | CREDITS_USED |
+------------------+-------------------------------+--------------+
| my_iceberg_table | 2025-09-15 09:00:00.000 -0700 | 0.000529445  |
| my_iceberg_table | 2025-09-15 08:00:00.000 -0700 | 0.000516791  |
+------------------+-------------------------------+--------------+
```

---
title: INDEX_COLUMNS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/index_columns.md
section: Account Usage
---

# INDEX_COLUMNS view

This Account Usage schema view displays a row for each column in the indexes defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | ID of the column. |
| NAME | TEXT | Name of the column. |
| INDEX_ID | NUMBER | ID of the index. |
| INDEX_NAME | TEXT | Name of the index. |
| TABLE_ID | NUMBER | ID of the hybrid table. |
| TABLE_NAME | TEXT | Name of the hybrid table. |
| SCHEMA_ID | TEXT | ID of the schema to which the hybrid table belongs. |
| SCHEMA_NAME | TEXT | Schema to which the hybrid table belongs. |
| DATABASE_ID | NUMBER | ID of the database to which the hybrid table belongs. |
| DATABASE_NAME | TEXT | Database to which the hybrid table belongs. |
| KEY_SEQUENCE | NUMBER | Position of the column in the index. |
| INDEX_OWNER | TEXT | Owner of the index. |
| IS_UNIQUE | TEXT | With `YES` or `NO`, indicates whether this index is a unique index. |
| IS_INCLUDED_COLUMN | TEXT | With `YES` or `NO`, indicates whether this column is covered by an index. |
| CONSTRAINT_NAME | TEXT | Name of the constraint that is associated with this index. |
| STATUS | TEXT | Status of this index. |
| CREATED | TIMESTAMP_LTZ | Time of creation for this index. |
| DELETED | TIMESTAMP_LTZ | Date and time when the hybrid table was dropped. |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

---
title: INDEXES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/indexes.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# INDEXES view

This Account Usage view displays a row for each index defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | ID of the index. |
| NAME | TEXT | Name of the index. |
| TABLE_ID | NUMBER | ID of the hybrid table. |
| TABLE_NAME | TEXT | Name of the hybrid table. |
| SCHEMA_ID | TEXT | ID of the schema to which the hybrid table belongs. |
| SCHEMA_NAME | TEXT | Schema to which the hybrid table belongs. |
| DATABASE_ID | NUMBER | ID of the database to which the hybrid table belongs. |
| DATABASE_NAME | TEXT | Database to which the hybrid table belongs. |
| OWNER | TEXT | Owner of the hybrid table. |
| IS_UNIQUE | TEXT | With `YES` or `NO`, indicates whether this index is a unique index. |
| CONSTRAINT_NAME | TEXT | Name of the constraint that is associated with this index. |
| STATUS | TEXT | Latest status of this index. |
| CREATED | TIMESTAMP_LTZ | Time of creation for this index. |
| DELETED | TIMESTAMP_LTZ | Date and time when the hybrid table was dropped. |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

Latency for the view may be up to 180 minutes (3 hours).

---
title: INGRESS_NETWORK_ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/ingress_network_access_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# INGRESS_NETWORK_ACCESS_HISTORY view

This Account Usage view can be used to query any network access attempts to your Snowflake account within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Time, (in the UTC time zone) of the ingress event occurrence. |
| REQUEST_ID | VARCHAR | Ingress request ID. |
| REQUEST METHOD | VARCHAR. | Ingress request method, such as GET or POST. |
| REQUEST PATH | VARCHAR | Ingress request path. |
| USER_NAME | VARCHAR. | User associated with this event. |
| CLIENT_IP | VARCHAR | Client IP. |
| CLIENT_PRIVATELINK_ID | VARCHAR | Client private link ID. |
| BYTES_RX | NUMBER | Bytes transferred into Snowflake. |
| BYTES_TX | NUMBER | Bytes transferred out of Snowflake. |
| IS_SUCCESS | BOOLEAN | Whether the user’s request was successful or not. |
| ERROR_CODE | VARCHAR | Error code, if the request was not successful. |
| ERROR_MESSAGE | VARCHAR | Error message returned to the user, if the request was not successful. |
| JOB_UUID | VARCHAR | Number automatically assigned to a query job. |
| REQUEST_AUTHORITY | VARCHAR | The URL that a client used to access a Snowflake ingress location. |

## Usage notes

* Latency for the view may be up to 240 minutes (4 hours).

* Although the [INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](internal_stage_network_access_history.md) is user-enabled, the INGRESS_NETWORK_ACCESS_HISTORY view is enabled by default
  in your Snowflake account.

---
title: INTERNAL_DATA_TRANSFER_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/internal_data_transfer_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# INTERNAL_DATA_TRANSFER_HISTORY view

Use this view to get a historical view of Snowpark Container Services internal data transfers in your account for the last 365 days.

This view reports the following two types of internal data transfers:

* **SERVICE_FUNCTION:** When a [service function](../../developer-guide/snowpark-container-services/working-with-services.md) is invoked, it sends a request to its associated service. Note that the query invoking the service functions executes in a warehouse, while the service runs in a compute pool. There is an internal data transfer cost associated with it. The view captures any data exchanged during the request and response as an internal data transfer of the SERVICE_FUNCTION type.
* **COMPUTE_POOL:** Through [service-to-service communication](../../developer-guide/snowpark-container-services/working-with-services.md), a service can transfer data to another service running in a different compute pool. This incurs internal data transfer costs, which the view reports as data transfer cost with the COMPUTE_POOL transfer type.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the data transfer took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the data transfer took place. |
| TRANSFER_TYPE | VARCHAR | It is either `SERVICE_FUNCTION` or `COMPUTE_POOL`. |
| COMPUTE_POOL_NAME | VARCHAR | If the transfer type is `SERVICE_FUNCTION`, it represents the name of the compute pool that the service function interacts with. If the transfer type is `COMPUTE_POOL`, it represents the source compute pool that initiated the traffic. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred during the START_TIME and END_TIME window. |

## Usage notes

* Latency for the view can be up to 180 minutes (3 hours).

---
title: INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/internal_stage_network_access_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view

This Account Usage view can be used to query any network access attempts to an internal stage within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Cloud service provider event timestamp. |
| EVENT_ID | VARCHAR | Cloud service provider event ID. |
| EVENT_NAME | VARCHAR | Cloud service provider event name. |
| EVENT_TYPE | VARCHAR | Cloud service provider event type. |
| CLOUD_PROVIDER | VARCHAR | Cloud service provider. |
| USER_NAME | VARCHAR | User associated with this event. |
| CLIENT_IP | VARCHAR | Client IP accessing the internal stage. |
| CLIENT_PRIVATELINK_ID | VARCHAR | Client private link ID accessing the internal stage: for example, a VPCE ID. |
| BYTES_IN | NUMBER | Bytes transferred into the stage. |
| BYTES_OUT | NUMBER | Bytes transferred out of the stage. |
| IS_SUCCESS | BOOLEAN | Whether the user’s request was successful or not. |
| ERROR_CODE | VARCHAR | Error code, if the request was not successful. |
| ERROR_MESSAGE | VARCHAR | Error message returned to the user, if the request was not successful. |
| AUTHENTICATION_METHOD | VARCHAR | Cloud service provider authentication method, such as `AuthHeader` or `QueryString`. |
| STAGE_PATH | VARCHAR | Network directory path for the internal stage location. For example, the path appearing after the AWS bucket name in the URL. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* To enable the view in your account, call the [SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS](../functions/system_opt_in_internal_stage_network_logs.md) function.
* To disable the view in your account, call the [SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS](../functions/system_opt_out_internal_stage_network_logs.md) function.

* Network access record collection starts at the time you enable the view.
* Network access records are retained for 1 year, starting at the time you enable the view.

---
title: JOIN_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/join_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# JOIN_POLICIES view

This Account Usage view lists the [join policies](../../user-guide/join-policies.md) in your account.

Each row in this view corresponds to a different join policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the policy. |
| POLICY_NAME | VARCHAR | Name of the policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema that contains the policy. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The view only displays objects for which the current role for the session has been granted access privileges.

## Example

```sqlexample
SELECT policy_name, policy_body, created
  FROM SNOWFLAKE.ACCOUNT_USAGE.JOIN_POLICIES
  WHERE policy_name='JP2' AND created LIKE '2024-11-26%';
```

```output
+-------------+----------------------------------------------------------+-------------------------------+
| POLICY_NAME | POLICY_BODY                                              | CREATED                       |
|-------------+----------------------------------------------------------+-------------------------------|
| JP2         | CASE                                                     | 2024-11-26 11:22:54.848 -0800 |
|             |           WHEN CURRENT_ROLE() = 'ACCOUNTADMIN'           |                               |
|             |             THEN JOIN_CONSTRAINT(JOIN_REQUIRED => FALSE) |                               |
|             |           ELSE JOIN_CONSTRAINT(JOIN_REQUIRED => TRUE)    |                               |
|             |         END                                              |                               |
+-------------+----------------------------------------------------------+-------------------------------+
```

---
title: LISTINGS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/listings.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# LISTINGS view

This Account Usage view returns all the listings owned by the current account, including dropped listings. The information in this view has a latency of up to 3 hours.

## Columns

The following table provides definitions for the LISTINGS view columns.

| Column | Data type | Description |
| --- | --- | --- |
| GLOBAL_NAME | VARCHAR | The global name of the listing. |
| NAME | VARCHAR | The object name of the listing. |
| OWNER | VARCHAR | The name of the role that owns the listing. |
| CREATED_ON | TIMESTAMP_LTZ | The timestamp when the listing was created. |
| UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the listing was last updated. |
| PUBLISHED_ON | TIMESTAMP_LTZ | The timestamp when the listing was published. |
| DELETED_ON | TIMESTAMP_LTZ | The timestamp when the listing was deleted. This value is NULL if the listing hasn’t been deleted. |
| TITLE | VARCHAR | The title of the listing. |
| SUBTITLE | VARCHAR | The subtitle of the listing. |
| DESCRIPTION | VARCHAR | The description of the listing. |
| LISTING_TERMS | OBJECT | The terms of service associated with the listing. |
| STATE | VARCHAR | The current state of the listing. |
| SHARE | VARCHAR | The name of the share associated with the listing. |
| APPLICATION_PACKAGE | VARCHAR | The name of the application package associated with the listing. This is only populated if `IS_APPLICATION` is true. |
| DATA_ATTRIBUTES | OBJECT | Data attributes associated with the listing. |
| CATEGORIES | VARCHAR | Categories associated with the listing. |
| PROFILE | VARCHAR | The profile attached to the external listing. |
| CUSTOMIZED_CONTACT_INFO | VARCHAR | Customized contact information associated with the listing. |
| COMMENT | VARCHAR | Comment associated with the listing, if any. |
| TARGETS | OBJECT | Targets consolidating external/organizational listings with regions. |
| AUTO_FULFILLMENT | OBJECT | Auto-fulfillment information associated with the listing. |
| IS_SHARE | BOOLEAN | Indicates whether this is a data share listing. |
| IS_APPLICATION | BOOLEAN | Indicates whether this is an application listing. |
| DISTRIBUTION | VARCHAR | The distribution of the listing. Possible values are `EXTERNAL` and `ORGANIZATION`. |
| ORGANIZATION_PROFILE_NAME | VARCHAR | The organization profile attached to the listing. |
| UNIFORM_LISTING_LOCATOR | VARCHAR | The uniform listing locator (ULL) of the listing. |
| APPROVER_CONTACT | VARCHAR | The approver contact information associated with the listing. |
| SUPPORT_CONTACT | VARCHAR | The support contact information associated with the listing. |
| RESHARING | OBJECT | Resharing configuration of the listing. |

---
title: LOAD_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/load_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# LOAD_HISTORY view

This Account Usage view enables you to retrieve the history of data loaded into tables using the [COPY INTO <table>](../sql/copy-into-table.md) command within the last 365 days (1 year). The view displays one row for each file loaded.

> **Note:**
>
> This view does not return the history of data loaded using Snowpipe. For this historical information, query the [COPY_HISTORY](copy_history.md) view instead.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the target table |
| TABLE_NAME | VARCHAR | Name of target table |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the target table |
| SCHEMA_NAME | VARCHAR | Schema of target table |
| CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the target table |
| CATALOG_NAME | VARCHAR | Database of target table |
| FILE_NAME | VARCHAR | Name of source file |
| LAST_LOAD_TIME | TIMESTAMP_LTZ | Date and time (in the UTC time zone) of the load record |
| STATUS | VARCHAR | Status: `LOADED`, `LOAD FAILED`, or `PARTIALLY LOADED` |
| ROW_COUNT | NUMBER | Number of rows loaded from the source file |
| ROW_PARSED | NUMBER | Number of rows parsed from the source file |
| FIRST_ERROR_MESSAGE | VARCHAR | First error of the source file |
| FIRST_ERROR_LINE_NUMBER | NUMBER | Line number of the first error |
| FIRST_ERROR_CHARACTER_POSITION | NUMBER | Position of the first error character |
| FIRST_ERROR_COL_NAME | VARCHAR | Column name of the first error |
| ERROR_COUNT | NUMBER | Number of error rows in the source file |
| ERROR_LIMIT | NUMBER | If the number of error reach this limit, then abort |

## Usage notes

* In most cases, latency for the view may be up to 90 minutes. The latency for a given table’s load history in the view may be up to 2 days if both of the following conditions are true:

  + Fewer than 32 DML statements have been added to the given table since it was last updated in LOAD_HISTORY.
  + Fewer than 100 rows have been added to the given table since it was last updated in LOAD_HISTORY.

* The view only includes COPY INTO commands that executed to completion, with or without errors. No record is added if the transaction is rolled back, for example, or if the ON_ERROR = ABORT_STATEMENT copy option is included in the COPY INTO *<table>* statement and a detected error in a data file aborts the load operation.
* When including a WHERE clause that references the `LAST_LOAD_TIME` column, you can specify any day of the week. For example, April 1, 2016 was a Friday; however, specifying Sunday instead does not
  affect the query results:

  ```sqlexample
  WHERE last_load_time > 'Sun, 01 Apr 2016 16:00:00 -0800'
  ```

* After the replication of load history, the LOAD_HISTORY Account Usage view shows the history only after the latest truncate operation on the target table. This is different from the view without replication, which shows a complete data loading history.

## Examples

Retrieve records for the 10 most recent COPY INTO commands executed:

> ```sqlexample
> SELECT file_name, last_load_time FROM snowflake.account_usage.load_history
>   ORDER BY last_load_time DESC
>   LIMIT 10;
> ```

---
title: LOCK_WAIT_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/lock_wait_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# LOCK_WAIT_HISTORY view

This Account Usage view includes the history of [transactions](../transactions.md) that wait on locks.
For details, see [Analyzing blocked transactions with the LOCK_WAIT_HISTORY view](../transactions.md).

## Columns

| OBJECT_ID | NUMBER | Internal/system-generated identifier for the blocking object (such as a table) on which the transaction is waiting for a lock. |
| --- | --- | --- |
| LOCK_TYPE | VARCHAR | Type of lock. Valid values are `PARTITION`, `STREAM`, `TABLE`, and `ROW`. `ROW` is shown for hybrid table locks. |
| OBJECT_NAME | VARCHAR | Identifier for the object (such as a table) on which the transaction is waiting for a lock. `ROW` is shown for hybrid table locks. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the object on which the transaction is waiting for a lock. `0` is shown for hybrid tables. |
| SCHEMA_NAME | VARCHAR | Identifier for the schema of the object on which the transaction is waiting for a lock. NULL is shown for `ROW` locks. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the object on which the transaction is waiting for a lock. |
| DATABASE_NAME | VARCHAR | Identifier for the database of the object on which the transaction is waiting for a lock. |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement that is waiting on the lock. |
| TRANSACTION_ID | NUMBER | Internal/system-generated [identifier for the transaction](../transactions.md) with the statement that is waiting on the lock. Can be joined with the [QUERY_HISTORY view](query_history.md) for additional details about the statements in the transaction. |
| REQUESTED_AT | TIMESTAMP_LTZ | Timestamp when the lock was requested by the transaction waiting for the lock. |
| ACQUIRED_AT | TIMESTAMP_LTZ | Timestamp when the lock was acquired by the transaction holding the lock. |
| BLOCKER_QUERIES | VARIANT | JSON array of objects. Each object is a blocker query with the following properties:   * `is_snowflake`: TRUE if the query is a background process run by Snowflake (e.g., automatic maintenance of   materialized views). * `query_id`: Query ID of the current statement in the blocker transaction that blocked the statement. Empty if   `is_snowflake` is true. * `transaction_id`: ID of the blocker transaction. Empty if `is_snowflake` is true.   There may be up to 20 objects in this array. |

## Usage notes

* The first blocker query ID that is returned in the `blocker_queries` array is the ID of the query that was being executed
  in the transaction that holds the lock when the transaction waiting for the lock started waiting.
  Note that it is possible that queries prior to that query in the blocker transaction also acquired the lock and should be investigated.
* Each row in the output represents a transaction waiting on a lock. Note that there may be other transactions ahead
  of that transaction, waiting on the same lock.

## Examples

Find all the blocked transactions that requested locks within the past 24 hours:

```sqlexample
SELECT query_id, object_name, transaction_id, blocker_queries
  FROM SNOWFLAKE.ACCOUNT_USAGE.LOCK_WAIT_HISTORY
  WHERE requested_at >= DATEADD('hours', -24, CURRENT_TIMESTAMP());
```

For additional examples, see [Analyzing blocked transactions with the LOCK_WAIT_HISTORY view](../transactions.md).

---
title: LOGIN_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/login_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md) , [READER_ACCOUNT_USAGE](../account-usage.md)

# LOGIN_HISTORY view

This Account Usage view can be used to query login attempts by Snowflake users within the last 365 days (1 year).

Details about the error codes/messages for login attempts that were unsuccessful can be found in the following documentation:

* [Federated authentication & SSO error codes](../../user-guide/errors-saml.md)
* [Multi-factor authentication (MFA) error codes](../../user-guide/security-mfa-duo.md)
* [OAuth error codes](../../user-guide/oauth-snowflake-overview.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| READER_ACCOUNT_NAME | VARCHAR | Name of the reader account for the user authentication event. This column is only included in the view in the READER_ACCOUNT_USAGE schema. |
| EVENT_ID | NUMBER | Internal/system-generated identifier for the login attempt. |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Time (in the UTC time zone) of the event occurrence. |
| EVENT_TYPE | VARCHAR | Event type, such as LOGIN for authentication events. |
| USER_NAME | VARCHAR | User associated with this event. |
| CLIENT_IP | VARCHAR | IP address where the request originated. |
| REPORTED_CLIENT_TYPE | VARCHAR | Reported type of the client software, such as JDBC_DRIVER, ODBC_DRIVER, and so on. This information is not authenticated. |
| REPORTED_CLIENT_VERSION | VARCHAR | Reported version of the client software. This information is not authenticated. |
| FIRST_AUTHENTICATION_FACTOR | VARCHAR | Method used to authenticate the user (the first factor in multi factor authentication, if used). |
| SECOND_AUTHENTICATION_FACTOR | VARCHAR | The second factor in multi factor authentication. If the user did not use multi-factor authentication, this value is NULL. |
| IS_SUCCESS | VARCHAR | Whether the user’s request was successful or not. |
| ERROR_CODE | NUMBER | Error code, if the request was not successful. |
| ERROR_MESSAGE | VARCHAR | Error message returned to the user, if the request was not successful. |
| RELATED_EVENT_ID | NUMBER | Reserved for future use. |
| CONNECTION | VARCHAR | Name of the connection used by the client, or NULL if the client is not using a connection URL. A connection is a Snowflake object that is part of [Client Redirect](../../user-guide/client-redirect.md). It represents a connection URL that you can use to fail over to another account for business continuity and disaster recovery. . , NOTE: If a client authenticates through an identity provider (IdP) that is configured with the account URL rather than the connection URL, the IdP directs the client to the account URL after authentication is complete. The CONNECTION column for this login event is NULL. See [Authentication and Client Redirect](../../user-guide/client-redirect.md). |
| CLIENT_PRIVATE_LINK_ID | VARCHAR | If the user logged in using [private connectivity](../../user-guide/private-connectivity-inbound.md), specifies the identifier of the endpoint from which the request originated. |
| FIRST_AUTHENTICATION_FACTOR_ID | VARCHAR | ID of the [credential](credentials.md) used to authenticate the user (the first factor in multi-factor authentication, if used). |
| SECOND_AUTHENTICATION_FACTOR_ID | VARCHAR | ID of the [credential](credentials.md) used for the second factor in multi-factor authentication. If the user did not use multi-factor authentication, this value is NULL. |
| LOGIN_DETAILS | VARCHAR | Displays details for each login event, including malicious IP protection category name, risk category, and blocking status. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* `INTERNAL_SNOWFLAKE_IP/0.0.0.0` appears as the client IP for login events triggered by internal Snowflake operations that support
  your usage. For example:

  + Because worksheets exist as unique sessions, when a user accesses a worksheet in [Snowsight](../../user-guide/ui-snowsight-gs.md),
    Snowflake creates a login event that originates from `INTERNAL_SNOWFLAKE_IP/0.0.0.0`.
  + When a Snowpark Container Services [service](../../developer-guide/snowpark-container-services/overview.md) logs into Snowflake, the client
    IP is masked to `INTERNAL_SNOWFLAKE_IP/0.0.0.0`.
* This view doesn’t record the activity of internal users the system defines to perform various operations, such as maintaining
  Snowsight worksheets.
* To see the blocking status of potentially malicious IP addresses, examine the LOGIN_DETAILS column output. For examples, see [View network login details](../../user-guide/malicious-ip-protection.md).

---
title: MASKING_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/masking_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# MASKING_POLICIES view

This Account Usage view provides the masking policies in your account.

Each row in this view corresponds to a different masking policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_NAME | VARCHAR | Name of the masking policy. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the masking policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema to which the masking policy belongs. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the masking policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the masking policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the masking policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Masking policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the masking policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the masking policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the masking policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| OPTIONS | VARIANT | The value for the EXEMPT_OTHER_POLICIES property in the policy. If set to `TRUE`, the column returns `{ "EXEMPT_OTHER_POLICIES: "TRUE" }`. If the property is set to `FALSE` or not set at all, the column returns NULL. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: MATERIALIZED_VIEW_REFRESH_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/materialized_view_refresh_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# MATERIALIZED_VIEW_REFRESH_HISTORY view

This Account Usage view can be used to query the [materialized views](../../user-guide/views-materialized.md) refresh history. The information returned by the view includes the view name and credits consumed each time a materialized view is refreshed.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | VARCHAR | Number of credits billed for materialized view maintenance during the START_TIME and END_TIME window. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the materialized view. |
| TABLE_NAME | VARCHAR | Name of the materialized view. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the materialized view. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the materialized view. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the materialized view. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the materialized view. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

* The history is displayed in increments of 1 hour.

---
title: METERING_DAILY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/metering_daily_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# METERING_DAILY_HISTORY view

The METERING_DAILY_HISTORY view in the ACCOUNT_USAGE schema can be used to return the daily credit usage and a cloud services rebate for an account within the last 365 days (1 year).

> **Note:**
>
> As of March 1, 2026, Snowflake no longer bills customers for hybrid table requests,
> and metering was disabled soon after this pricing change took effect. Any new data
> in the view as of March 1, 2026, will not be billed to customers, and you can still
> query the historical data in the view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | Type of service that is consuming credits. The following list includes many, **but not all**, of the possible service types:   * `AI_SERVICES`: See [Snowflake Cortex AI Functions (including LLM functions)](../../user-guide/snowflake-cortex/aisql.md) and [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md). * `ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `ARCHIVE_STORAGE_WRITE`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `AUTO_CLUSTERING`: See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `BACKUP`: See [Backups for disaster recovery and immutable storage](../../user-guide/backups.md). * `COPY_FILES`: See [COPY FILES](../sql/copy-files.md). * `DATA_QUALITY_MONITORING`: See [Introduction to data quality checks](../../user-guide/data-quality-intro.md). * `FAILSAFE_RECOVERY`: See [Understanding and viewing Fail-safe](../../user-guide/data-failsafe.md). * `HYBRID_TABLE_REQUESTS`: See [Hybrid tables](../../user-guide/tables-hybrid.md). * `MATERIALIZED_VIEW`: See [Working with Materialized Views](../../user-guide/views-materialized.md). * `OPENFLOW_COMPUTE_BYOC`: See [Openflow BYOC cost and scaling considerations](../../user-guide/data-integration/openflow/cost-byoc.md). * `OPENFLOW_COMPUTE_SNOWFLAKE`: See [Openflow Snowflake Deployment cost and scaling considerations](../../user-guide/data-integration/openflow/cost-spcs.md). * `PIPE`: See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `POSTGRES_COMPUTE`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `POSTGRES_COMPUTE_HA`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `QUERY_ACCELERATION`: See [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md). * `REPLICATION`: See [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md). * `SEARCH_OPTIMIZATION`: See [Search optimization service](../../user-guide/search-optimization-service.md). * `SENSITIVE_DATA_CLASSIFICATION`: See [Introduction to sensitive data classification](../../user-guide/classify-intro.md). * `SERVERLESS_ALERTS`: See [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md). * `SERVERLESS_TASK`: See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPARK_CONTAINER_SERVICES`: See [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md). * `SNOWPIPE_STREAMING`: See [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `STORAGE_LIFECYCLE_POLICY_EXECUTION`: Compute cost to apply a policy on a target table and expire or archive rows (policy execution). See [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md). * `TELEMETRY_DATA_INGEST`: See [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md). * `TRUST_CENTER`: See [Trust Center](../../user-guide/trust-center/overview.md). * `WAREHOUSE_METERING`: See [Overview of warehouses](../../user-guide/warehouses-overview.md). * `WAREHOUSE_METERING_READER`: See [Manage reader accounts](../../user-guide/data-sharing-reader-create.md). |
| USAGE_DATE | DATE | Date when the usage took place. |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits billed for warehouses, serverless compute, and [Openflow](../../user-guide/data-integration/openflow/about.md) resources in the day. |
| CREDITS_USED_CLOUD_SERVICES | NUMBER | Number of credits billed for cloud services in the day. Always `0` when the SERVICE_TYPE is one of the Openflow types. |
| CREDITS_USED | NUMBER | Sum of CREDITS_USED_COMPUTE and CREDITS_USED_CLOUD_SERVICES. |
| CREDITS_ADJUSTMENT_CLOUD_SERVICES | NUMBER | Number of credits [adjusted for cloud services](../../user-guide/cost-understanding-compute.md). This is a negative value (e.g. `-9`). |
| CREDITS_BILLED | NUMBER | Total number of credits billed for the account in the day. This is a sum of CREDITS_USED_COMPUTE, CREDITS_USED_CLOUD_SERVICES, and CREDITS_ADJUSTMENT_CLOUD_SERVICES. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

## Example

[Usage for cloud services](../../user-guide/cost-understanding-compute.md) is billed only if the daily consumption of cloud
services exceeds 10% of the daily usage of virtual warehouses. This query returns how much of cloud services consumption was actually
billed for a particular day, ordered by the highest billed amount.

```sqlexample
SELECT
    usage_date,
    credits_used_cloud_services,
    credits_adjustment_cloud_services,
    credits_used_cloud_services + credits_adjustment_cloud_services AS billed_cloud_services
FROM snowflake.account_usage.metering_daily_history
WHERE usage_date >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    AND credits_used_cloud_services > 0
ORDER BY 4 DESC;
```

---
title: METERING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/metering_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md) , [READER_ACCOUNT_USAGE](../account-usage.md)

# METERING_HISTORY view

The METERING_HISTORY view in the ACCOUNT_USAGE schema can be used to return the hourly credit usage for an account within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | Type of service that is consuming credits. The following list includes many, **but not all**, of the possible service types:   * `AI_SERVICES`: See [Snowflake Cortex AI Functions (including LLM functions)](../../user-guide/snowflake-cortex/aisql.md) and [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md). * `ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `ARCHIVE_STORAGE_WRITE`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `AUTO_CLUSTERING`: See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `BACKUP`: See [Backups for disaster recovery and immutable storage](../../user-guide/backups.md). * `COPY_FILES`: See [COPY FILES](../sql/copy-files.md). * `DATA_QUALITY_MONITORING`: See [Introduction to data quality checks](../../user-guide/data-quality-intro.md). * `FAILSAFE_RECOVERY`: See [Understanding and viewing Fail-safe](../../user-guide/data-failsafe.md). * `HYBRID_TABLE_REQUESTS`: See [Hybrid tables](../../user-guide/tables-hybrid.md). * `MATERIALIZED_VIEW`: See [Working with Materialized Views](../../user-guide/views-materialized.md). * `OPENFLOW_COMPUTE_BYOC`: See [Openflow BYOC cost and scaling considerations](../../user-guide/data-integration/openflow/cost-byoc.md). * `OPENFLOW_COMPUTE_SNOWFLAKE`: See [Openflow Snowflake Deployment cost and scaling considerations](../../user-guide/data-integration/openflow/cost-spcs.md). * `PIPE`: See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `POSTGRES_COMPUTE`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `POSTGRES_COMPUTE_HA`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `QUERY_ACCELERATION`: See [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md). * `REPLICATION`: See [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md). * `SEARCH_OPTIMIZATION`: See [Search optimization service](../../user-guide/search-optimization-service.md). * `SENSITIVE_DATA_CLASSIFICATION`: See [Introduction to sensitive data classification](../../user-guide/classify-intro.md). * `SERVERLESS_ALERTS`: See [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md). * `SERVERLESS_TASK`: See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPARK_CONTAINER_SERVICES`: See [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md). * `SNOWPIPE_STREAMING`: See [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `STORAGE_LIFECYCLE_POLICY_EXECUTION`: Compute cost to apply a policy on a target table and expire or archive rows (policy execution). See [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md). * `TELEMETRY_DATA_INGEST`: See [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md). * `TRUST_CENTER`: See [Trust Center](../../user-guide/trust-center/overview.md). * `WAREHOUSE_METERING`: See [Overview of warehouses](../../user-guide/warehouses-overview.md). * `WAREHOUSE_METERING_READER`: See [Manage reader accounts](../../user-guide/data-sharing-reader-create.md). |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| ENTITY_ID | NUMBER | A system-generated identifier for the entity associated with the service.  In most cases, this is the internal ID of the monitored entity; for example, a pipe, task, or replication group.  When the SERVICE_TYPE is COPY_FILES, this column shows the ID of the database, schema, or stage from which files are copied.  If the SERVICE_TYPE is an Openflow type, the value is NULL.  If the SERVICE_TYPE is Snowpipe Streaming, this shows the ID of the relevant pipe; which is the default pipe ID for the default pipe. |
| ENTITY_TYPE | VARCHAR | Type of Snowflake resource that consumed credits, such as WAREHOUSE, TASK, or TABLE. Note that TABLE is used for all table-like objects. |
| NAME | VARCHAR | The name of the service or object associated with the cost entry, which varies significantly based on the SERVICE_TYPE.  Standard (General): This column shows the name of the service type itself; for example, REPLICATION, TASK.  SNOWPIPE_STREAMING: This service type generates two distinct cost entries, and the NAME column varies for each:   * Cost entry 1 (table name): The value is the name of the Snowflake target table. For the high-performance default pipe, the name is derived from the target table name and appended with -STREAMING; for example, MY_TABLE-STREAMING. * Cost entry 2 (client string): The value is a colon-separated string in the format: SNOWPIPE_STREAMING:CLIENT_NAME:SNOWFLAKE_PROVIDED_ID. This is used for tracking client-side costs.   COPY_FILES: The value is the name of the database from which the files are copied.  Openflow Types: The value is NULL. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier of the database associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific database; for example, a warehouse or compute pool. |
| DATABASE_NAME | VARCHAR | Name of the database associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific database. |
| SCHEMA_ID | NUMBER | Internal or system-generated identifier of the schema associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific schema. |
| SCHEMA_NAME | VARCHAR | Name of the schema associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific schema. |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits used by warehouses, serverless compute, and [Openflow](../../user-guide/data-integration/openflow/about.md) resources in the hour. |
| CREDITS_USED_CLOUD_ SERVICES | NUMBER | Number of credits used for cloud services in the hour. Always `0` when the SERVICE_TYPE is one of the Openflow types. |
| CREDITS_USED | NUMBER | Total number of credits used for the account in the hour. This is a sum of CREDITS_USED_COMPUTE and CREDITS_USED_CLOUD_SERVICES. This value does not take into account the adjustment for cloud services, and may therefore be greater than your actual credit consumption. |
| BYTES | NUMBER | When the service type is `auto_clustering`, indicates the number of bytes reclustered during the START_TIME and END_TIME window. When the service type is `pipe`, indicates the number of bytes inserted during the START_TIME and END_TIME window. When the service type is `SNOWPIPE_STREAMING`, indicates the number of bytes migrated during the START_TIME and END_TIME window. When the service type is `COPY_FILES`, columns are aggregated at the database level. |
| ROWS | NUMBER | When the service type is `auto_clustering`, indicates number of rows reclustered during the START_TIME and END_TIME window. When the service type is `SNOWPIPE_STREAMING`, indicates the number of rows migrated during the START_TIME and END_TIME window. |
| FILES | NUMBER | When the service type is `pipe`, indicates number of files loaded during the START_TIME and END_TIME window. When the service type is `SNOWPIPE_STREAMING`, this is NULL. When the service type is `COPY_FILES`, columns are aggregated at the database level. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours), except for the CREDITS_USED_CLOUD_SERVICES column. Latency for
  CREDITS_USED_CLOUD_SERVICES may be up to 6 hours.
* Latency for showing the credit consumption of `SNOWPIPE_STREAMING` may be up to 12 hours.

---
title: NETWORK_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/network_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# NETWORK_POLICIES view

This Account Usage view returns one row for each network policy in your account.

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for network policy. |
| NAME | VARCHAR | Network policy name. |
| OWNER | VARCHAR | Name of role that owns the network policy. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| COMMENT | VARCHAR | Comment for the network policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time that the network policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time that the network policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time that the network policy was dropped. |
| ALLOWED_IP_LIST | VARCHAR | List of allowed IPv4 addresses and CIDR block ranges in the corresponding network policy. |
| BLOCKED_IP_LIST | VARCHAR | List of blocked IPv4 addresses and CIDR block ranges in the corresponding network policy. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: NETWORK_RULE_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/network_rule_references.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# NETWORK_RULE_REFERENCES view

This Account Usage view returns one row for each network rule that is associated with an external access integration or a network policy.

The view is complementary to the Information Schema table function [NETWORK_RULE_REFERENCES](../functions/network_rule_references.md).

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| `network_rule_db` | VARCHAR | Database name that contains the network rule. |
| `network_rule_schema` | VARCHAR | Schema that contains the network rule. |
| `network_rule_id` | NUMBER | Internal/system-defined identifier for the network rule. |
| `network_rule_mode` | VARCHAR | Either: `ingress` or `egress`. |
| `network_rule_type` | VARCHAR | Either: `AWSLinkId` `AzureLinkId`, `HOST_PORT`, or `IPV4` |
| `network_rule_name` | VARCHAR | Name of the network rule. |
| `container_id` | NUMBER | Internal system-defined identifier for the container. |
| `container_name` | VARCHAR | Name of the external access integration or network policy with which the network rule is associated. |
| `container_type` | VARCHAR | Either: `INTEGRATION` or `NETWORK_POLICY` |
| `action_type` | VARCHAR | Either: `ALLOW` or `BLOCK` |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: NETWORK_RULES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/network_rules.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# NETWORK_RULES view

This Account Usage view returns one row for each network rule in your account.

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| `id` | NUMBER | Internal system-generated identifier for network rule. |
| `name` | VARCHAR | Network rule name |
| `schema_id` | NUMBER | Internal system-generated identifier for the schema that contains the network rule. |
| `schema_name` | VARCHAR | Name of the schema that contains the network rule. |
| `database_id` | NUMBER | Internal system-generated identifier for the database that contains the network rule. |
| `database_name` | VARCHAR | Name of the database that contains the network rule. |
| `owner` | VARCHAR | Name of role that owns the network rule. |
| `owner_role_type` | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| `comment` | VARCHAR | Comment for the network rule (if any). |
| `created` | TIMESTAMP_LTZ | Date and time that the network rule was created. |
| `last_altered` | TIMESTAMP_LTZ | Date and time that the network rule was last altered. |
| `deleted` | TIMESTAMP_LTZ | Date and time the network rule was dropped. |
| `mode` | VARCHAR | Mode of the network rule. For supported values, see [CREATE NETWORK RULE](../sql/create-network-rule.md). |
| `type` | VARCHAR | Type of network rule. For supported values, see [CREATE NETWORK RULE](../sql/create-network-rule.md). |
| `value_list` | VARCHAR | List of values for the network rule. For supported values, see [CREATE NETWORK RULE](../sql/create-network-rule.md). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.

## Example

To see your current Snowflake-managed rules, *including* IP addresses, query the NETWORK_RULES view and filter on rows where the database is SNOWFLAKE and the schema is NETWORK_SECURITY:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.NETWORK_RULES
  WHERE DATABASE = 'SNOWFLAKE' AND SCHEMA = 'NETWORK_SECURITY';
```

---
title: NOTEBOOKS_CONTAINER_RUNTIME_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/notebooks_container_runtime_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# NOTEBOOKS_CONTAINER_RUNTIME_HISTORY view

You can use the NOTEBOOKS_CONTAINER_RUNTIME_HISTORY view in the ACCOUNT_USAGE schema to return the hourly credit usage for notebooks running on Snowpark Container Services within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| NOTEBOOK_NAME | VARCHAR | The name of the notebook (running on Snowpark Container Services) that incurred the credit usage. |
| NOTEBOOK_ID | NUMBER | The ID of the notebook that incurred the credit usage. |
| USER_NAME | VARCHAR | The name of the user associated with the notebook. NULL if the notebook was not run interactively. |
| USER_ID | NUMBER | The ID of the user associated with the notebook. NULL if the notebook was not run interactively. |
| COMPUTE_POOL_NAME | VARCHAR | The name of the compute pool associated with the notebook. |
| COMPUTE_POOL_ID | NUMBER | The ID of the compute pool associated with the notebook. |
| SERVICE_NAME | VARCHAR | The name of the service associated with the notebook. |
| SERVICE_ID | NUMBER | The ID of the service associated with the notebook. |
| NOTEBOOK_EXECUTION_TIME_SECS | NUMBER | The run time of the notebook in the given hour. |
| CREDITS | NUMBER(38, 9) | The number of credits that the notebook used in the hour. |

## Usage notes

* Latency for the view might be up to 180 minutes (3 hours).

* The view provides hourly container notebook credit usage for an account within the last 365 days (1 year).
* The credit rate usage is determined based on the machine type (instance family) of the compute pool, as outlined in the consumption table.

---
title: OBJECT_ACCESS_REQUEST_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/object_access_request_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# OBJECT_ACCESS_REQUEST_HISTORY view

This Account Usage view allows consumers to access audit logs that track the submission, rejection, and approval of object access requests through the Internal Marketplace to maintain security and compliance.

## Columns

The following table provides a description of each column in the view.

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | The organization name of current account. |
| ACCOUNT_NAME | VARCHAR | The account name of the current account. |
| TIMESTAMP | TIMESTAMP_LTZ | The state transition event. |
| USER_REGION | VARCHAR | The region of the requester or approver. |
| USER_ACCOUNT_NAME | VARCHAR | The account name of the requester or approver. |
| USER_NAME | VARCHAR | The username of the requester or approver. |
| USER_EMAIL | VARCHAR | The email of the requester or approver. |
| USER_COMMENT | VARCHAR | For CREATE_REQUEST and CANCEL REQUEST, this shows the reason for access provided by the requester.  For APPROVE_REQUEST, DENY_REQUEST, and AUTO_APPROVE_REQUEST, this is the comment for approval/denial provided by the approver. |
| ACTION | VARCHAR | The requester or approver action. This can be one of the following:   * CREATE_REQUEST * CANCEL_REQUEST * APPROVE_REQUEST * DENY_REQUEST * AUTO_APPROVE_REQUEST |
| REQUEST_ID | VARCHAR | The UUID of the request. This can be used to track the history of the request. |
| OBJECT_DOMAIN | VARCHAR | The requested object domain. Currently, this can only be DATA_EXCHANGE_LISTING. |
| OBJECT_REGION | VARCHAR | The snowflake region name where the requested object is located. |
| OBJECT_ACCOUNT_NAME | VARCHAR | The account where the requested object is located. |
| OBJECT_NAME | VARCHAR | The name of the requested object. |
| GRANTEE_TO_AUTHORIZE | VARCHAR | For CREATE_REQUEST and CANCEL REQUEST, this shows the role provided by the requester.  For APPROVE_REQUEST, DENY_REQUEST, and AUTO_APPROVE_REQUEST, this is the role granted or denied by the approver. |
| GRANTEE_TYPE | VARCHAR | The type of the grantee. Currently, this can only be ROLE. |

## Usage notes

For approver-initiated actions, such as APPROVE_REQUEST, DENY_REQUEST, and AUTO_APPROVE_REQUEST, the requester can’t see the approver’s USER_ACCOUNT_NAME, USER_NAME, USER_EMAIL, and OBJECT_ACCOUNT_NAME.

---
title: OBJECT_DEPENDENCIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/object_dependencies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# OBJECT_DEPENDENCIES view

This Account Usage view displays object dependencies. An object dependency results when an object references a base object but does not
materialize or copy data, such as when a view references a table.

For example, while creating a view from a single table, the view is dependent on the table. Snowflake returns one row to record the
dependency of the view on the table.

However, if creating the view is dependent on two tables, Snowflake returns one row to record the dependency of the view on the first table
and, separately, one row to record the dependency of the view on the second table. This pattern continues for however many dependencies
there are for a given object.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REFERENCED_DATABASE | TEXT | The parent database of the referenced object. |
| REFERENCED_SCHEMA | TEXT | The parent schema of the referenced object. |
| REFERENCED_OBJECT_NAME | TEXT | The name of the referenced object. |
| REFERENCED_OBJECT_ID | NUMBER | The object ID of the referenced object. |
| REFERENCED_OBJECT_DOMAIN | TEXT | The domain (e.g. `TABLE`, `VIEW`) of the referenced object. |
| REFERENCING_DATABASE | TEXT | The parent database of the referencing object. |
| REFERENCING_SCHEMA | TEXT | The parent schema of the referencing object. |
| REFERENCING_OBJECT_NAME | TEXT | The name of the referencing object. |
| REFERENCING_OBJECT_ID | NUMBER | The object ID of the referencing object. |
| REFERENCING_OBJECT_DOMAIN | TEXT | The domain (e.g. `TABLE`, `VIEW`) of the referencing object. |
| DEPENDENCY_TYPE | TEXT | The type of dependency (`BY_ID`, `BY_NAME`, or `BY_NAME_AND_ID`). |

## Usage notes

* Latency for this view may be up to three hours.

* For a complete list of supported objects and their dependency type, see [Supported object dependencies](../../user-guide/object-dependencies.md).
* Data movement, such as when data is copied or materialized from one object to another, does not result in an object dependency. For
  example, CREATE TABLE AS SELECT (CTAS), INSERT, or MERGE operations on tables result in data movement and are not included in this view.
* This view was backfilled on January 22, 2022 to include dependencies prior to making the view available. Snowflake continues to record
  dependencies after this date.

  Note that if a view or [UDF](../../developer-guide/udf/udf-overview.md) was invalid due to a missing dependency prior to this date and
  the missing dependency is fixed later, Snowflake does not record the dependency for the view or UDF.

  For example, if you created a view that depends on a table on December 1, 2021, dropped the table on the same day, and then undropped the
  table on February 1, 2022, Snowflake does not record that the view depends on the table.

  As a workaround, create or replace the view or UDF to so that this view records the dependency.

### Data sharing usage notes

General notes:
:   The view updates assume the share is not deleted.

    The view schema (i.e. column names, data types, and values) remains the same with these exceptions:

    * The value for the REFERENCED_OBJECT_ID column in the consumer account is always NULL for a shared object.

      This value prevents a customer from discovering the source object in the provider account.
    * The value for REFERENCED_OBJECT_DOMAIN is `TABLE` for all table-like objects.

Snowflake objects:
:   Shared objects, such as Account Usage views, are now supported as referenced objects.

    For example, if a user-defined view depends on data from another Account Usage view, such as LOGIN_HISTORY, the OBJECT_DEPENDENCIES view
    in the consumer account specifies the LOGIN_HISTORY view as the referenced object.

Rename notes:
:   When a provider renames a shared database, shared schema, or shared object:

    * The consumer OBJECT_DEPENDENCIES view record shows the record of the original name for the database, schema, or object prior to
      the renaming, not the renamed object.

      Newly renamed shared objects are not shown in the consumer OBJECT_DEPENDENCIES view to prevent the consumer from determining the object
      lifecycle in the provider account. A new referencing object would need to refer to the newly renamed object in order for the renamed
      object to appear in the local OBJECT_DEPENDENCIES view in the consumer account.
    * Renaming the shared database preserves the dependency in the consumer account.
    * Renaming a shared schema or shared objects in a shared schema breaks the dependency in the consumer account.

    If the consumer renames a shared database, all existing dependencies on that database break. Consequently, Snowflake removes the
    corresponding records from the OBJECT_DEPENDENCIES view in the consumer account.

    For example, the shared database contains a view named `db1_shared.views.view_1_shared`. The consumer renames the shared database to
    `mydb`. The view now has a fully-qualified name of `mydb.views.view_1_shared`. Snowflake removes the row specifying
    `db1_shared.views.view_1_shared` in the consumer’s OBJECT_DEPENDENCIES view because the dependency on the database named
    `db1_shared` is broken.

Not supported:
:   The `BY_ID` dependency type for referenced objects is not supported.

    * [Limitations](../../user-guide/object-dependencies.md)
    * [Object dependencies with snowflake features and services](../../user-guide/object-dependencies.md)

---
title: OPENFLOW_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/openflow_usage_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# OPENFLOW_USAGE_HISTORY view

This Account Usage view returns the hourly runtime credit usage for an account within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| DATA_PLANE_ID | VARCHAR | ID of the data plane which incurred the credit usage. |
| DATA_PLANE_NAME | VARCHAR | Name of the data plane which incurred the credit usage. |
| DATA_PLANE_TYPE | VARCHAR | Type of the data plane. Supported values include:  * `BYOC` * `SNOWFLAKE` |
| DATA_PLANE_CREDITS_USED | NUMBER | Number of compute credits the data plane used in the hour. The data plane credits are only incurred for SNOWFLAKE data planes. For `BYOC`, there are no credits incurred for data planes and customers are charged credits only for runtime usage. |
| RUNTIME_ID | VARCHAR | ID of the runtime which incurred the credit usage. |
| RUNTIME_NAME | VARCHAR | Name of the runtime which incurred the credit usage. |
| RUNTIME_TYPE | VARCHAR | Type of the runtime. Supported values include:   * `RUNTIME` * `READ_ONLY RUNTIME` |
| RUNTIME_CREDITS_USED | NUMBER | Number of compute credits the runtime used in the hour. This does not include the credits used by the data plane or the credits used to ingest the data in Snowflake. |

---
title: OUTBOUND_PRIVATELINK_ENDPOINTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/outbound_privatelink_endpoints.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# OUTBOUND_PRIVATELINK_ENDPOINTS view

This Account Usage view displays one row for each private endpoint that has been created for
[outbound private connectivity](../../user-guide/private-connectivity-outbound.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PROVIDER_RESOURCE_ID | VARCHAR | Identifier of the AWS service or the Microsoft Azure resource that the endpoint connects to. |
| HOSTNAME | VARCHAR | Hostname of the AWS service or Microsoft Azure resource that the endpoint connects to. |
| SUBRESOURCE | VARCHAR | Subresource of the Microsoft Azure resource that the endpoint connects to. Endpoints for AWS do not have a subresource. |
| SNOWFLAKE_RESOURCE_ID | VARCHAR | Identifier of the private endpoint that connects to the AWS service or Microsoft Azure resource. For AWS, this is the VPCE_ID of the endpoint. For Microsoft Azure, this is the resource ID of the endpoint. |
| ENDPOINT_STATE | VARCHAR | Current state of the endpoint. One of the following:   * `PENDING_CREATION`: The endpoint is still being created. * `CREATED`: The endpoint is created and ready to use. This state indicates that Snowflake received a response from the cloud provider   about the endpoint being successfully created. * `FAILED`: The endpoint is in an unexpected state on the cloud provider, and cannot be used. * `PENDING_DELETION`: The endpoint is on the deletion queue, but can be restored. * `DELETING`: The endpoint is being deleted on the cloud provider and cannot be restored. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time when the endpoint was created. |
| LAST_ALTERED_ON | TIMESTAMP_LTZ | Date and time when the endpoint state last changed. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time when the endpoint was deleted. NULL if an endpoint has not been deleted, including deprovisioned endpoints that haven’t been deleted yet. |

## Usage notes

* Latency for this view might be up to 2 hours.
* Users with the SECURITY_VIEWER database role can access this view.
* Data for deleted endpoints is retained for 1 year.
* For endpoints created during the preview of the outbound private connectivity feature (before November 2024), values in the
  LAST_ALTERED_ON column might be the time at which the data became available in the OUTBOUND_PRIVATELINK_ENDPOINTS view, not the creation
  times of the endpoints.

---
title: PASSWORD_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/password_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PASSWORD_POLICIES view

This Account Usage view provides the user-defined [password policies](../../user-guide/password-authentication.md) in your account.

Each row in this view corresponds to a different password policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the policy. |
| ID | NUMBER | Internal/system-generated identifier for the password policy. |
| SCHEMA_ID | VARCHAR | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | VARCHAR | Schema to which the password policy belongs. |
| DATABASE_ID | VARCHAR | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | VARCHAR | Database to which the password policy belongs. |
| OWNER | VARCHAR | Name of the role that owns the password policy. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| PASSWORD_MIN_LENGTH | NUMBER | Minimum password length allowed for the policy. |
| PASSWORD_MAX_LENGTH | NUMBER | Maximum password length allowed for the policy. |
| PASSWORD_MIN_UPPER_CASE_CHARS | NUMBER | Minimum number of uppercase characters allowed for the policy. |
| PASSWORD_MIN_LOWER_CASE_CHARS | NUMBER | Minimum number of lowercase characters allowed for the policy. |
| PASSWORD_MIN_NUMERIC_CHARS | NUMBER | Minimum number of numeric characters allowed for the policy. |
| PASSWORD_MIN_SPECIAL_CHARS | NUMBER | Minimum number of special characters allowed for the policy. |
| PASSWORD_MIN_AGE_DAYS | NUMBER | The number of days a user must wait before a recently changed password can be changed again. |
| PASSWORD_MAX_AGE_DAYS | NUMBER | Maximum number of days password is valid. |
| PASSWORD_MAX_RETRIES | NUMBER | Maximum number of password attempts allowed. |
| PASSWORD_LOCKOUT_TIME_MINS | NUMBER | Minimum time in minutes before password can be retried. |
| COMMENT | VARCHAR | Comments entered for the password policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the password policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the password policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the password policy was dropped. |
| PASSWORD_HISTORY | NUMBER | The number of the most recent passwords that Snowflake stores. These stored passwords cannot be repeated when a user updates their password value. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: PIPE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/pipe_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PIPE_USAGE_HISTORY view

This Account Usage view can be used to query the history of data loaded into tables
using [Snowpipe](../../user-guide/data-load-snowpipe-intro.md) or the history of credits used for
[Iceberg automated refresh](../../user-guide/tables-iceberg-auto-refresh.md) within the last 365 days (1 year).

The view displays the history of data loaded and credits billed for your entire Snowflake account.
You can use the `pipe_name` column to filter the view for a specific pipe or Iceberg table with automated refresh.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PIPE_ID | NUMBER | Internal/system-generated identifier for the pipe used for the data load. Displays NULL if no pipe name was specified in the query. Each row includes the totals for all pipes in use within the time range. |
| PIPE_NAME | VARCHAR | Name of the pipe or Iceberg table with automated refresh. Displays NULL for the internal (hidden) pipe object used to refresh the metadata for an external table or Delta-based Iceberg table. |
| START_TIME | TIMESTAMP_LTZ | Start time of the period when data-ingestion information is aggregated. |
| END_TIME | TIMESTAMP_LTZ | End time of the period when data-ingestion information is aggregated. |
| CREDITS_USED | NUMBER | Number of credits billed for Snowpipe data loads during the START_TIME and END_TIME window. |
| BYTES_INSERTED | FLOAT | Number of bytes loaded during the START_TIME and END_TIME window. |
| FILES_INSERTED | VARIANT | Number of files loaded during the START_TIME and END_TIME window. |
| BYTES_BILLED | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing visibility into Snowpipe’s cost implications directly within these history views. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```
* Occasionally, the data compaction and maintenance process can consume Snowflake credits. For example, the returned results might show that you consumed credits with 0 BYTES_INSERTED and 0 FILES_INSERTED. This means that your data is not being loaded, but the data compaction and maintenance process has consumed some credits.

* Snowflake bills for auto-refresh notifications in external tables and directory tables on internal named stages and external stages at a rate equivalent to the Snowpipe file charge. You can estimate charges incurred by your external table and directory table auto-refresh
  notifications by examining this PIPE_USAGE_HISTORY view or querying the [PIPE_USAGE_HISTORY](../functions/pipe_usage_history.md) function. Note that the auto-refresh pipes will be listed under a NULL pipe
  name. You can also view your external table auto-refresh notification history at the table-level/stage-level granularity by using the
  Information Schema table function [AUTO_REFRESH_REGISTRATION_HISTORY](../functions/auto_refresh_registration_history.md).

  To avoid charges for auto-refresh notifications, perform a manual refresh for external tables and directory tables. For external tables, the
  ALTER EXTERNAL TABLE <name> REFRESH … statement can be used to manually synchronize your external table to external storage. For directory
  tables, the ALTER STAGE <name> REFRESH … statement can be used to manually synchronize the directory to external storage.
* Snowflake does not bill Snowpipe file charges for [Iceberg automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

## Examples

This query provides the pipe usage history for a pipe named `my_auto_refresh_pipe` starting on a particular date:

```sqlexample
SELECT
    pipe_id,
    start_time,
    end_time,
    credits_used,
    bytes_inserted,
    files_inserted
  FROM SNOWFLAKE.ACCOUNT_USAGE.PIPE_USAGE_HISTORY
  WHERE pipe_name = 'my_auto_refresh_pipe'
  AND START_TIME >= '2025-04-01';
```

This query displays the credits used for automated refresh charges for an Iceberg table named `iceberg_glue_table`
starting on a particular date:

```sqlexample
SELECT
    pipe_id,
    start_time,
    end_time,
    credits_used,
  FROM SNOWFLAKE.ACCOUNT_USAGE.PIPE_USAGE_HISTORY
  WHERE pipe_name = 'iceberg_glue_table'
  AND START_TIME >= '2025-04-01';
```

---
title: PIPES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/pipes.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PIPES view

This Account Usage view displays a row for each pipe defined in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PIPE_ID | NUMBER | Internal or system-generated identifier for the pipe. |
| PIPE_NAME | VARCHAR | The name of the pipe object.  For manually created pipes, this is the name defined in the CREATE PIPE statement.  For the Snowpipe Streaming high-performance default pipe, this is derived from the target table name; for example, `MY_TABLE-STREAMING`. |
| PIPE_SCHEMA_ID | NUMBER | Internal or system-generated identifier for the schema that the pipe belongs to.  For the default pipe, this corresponds to the target table’s schema ID. |
| PIPE_SCHEMA | VARCHAR | Schema that the pipe belongs to.  For the default pipe, this corresponds to the target table’s schema. |
| PIPE_CATALOG_ID | NUMBER | Internal or system-generated identifier for the database that the pipe belongs to.  For the default pipe, this corresponds to the target table’s database ID. |
| PIPE_CATALOG | VARCHAR | Name of the database that the pipe belongs to.  For the default pipe, this corresponds to the target table’s database. |
| IS_AUTOINGEST_ENABLED | VARCHAR | Whether AUTO-INGEST is enabled for the pipe. Represents future functionality. |
| NOTIFICATION_CHANNEL_NAME | VARCHAR | Amazon Resource Name of the Amazon SQS queue for the stage named in the DEFINITION column. Represents future functionality. |
| PIPE_OWNER | VARCHAR | Name of the role that owns the pipe.  Returns NULL for the default pipe. |
| DEFINITION | VARCHAR | COPY statement used to load data from queued files into a Snowflake table. |
| CREATED | TIMESTAMP_LTZ | Creation time of the pipe. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this pipe.  Returns the following message for the default pipe: “Default pipe for Snowpipe Streaming High Performance ingestion to a table. Created and managed by Snowflake.” |
| PATTERN | VARCHAR | PATTERN copy option value in the [COPY INTO <table>](../sql/copy-into-table.md) statement in the pipe definition, if the copy option was specified. |
| DELETED | TIMESTAMP_LTZ | Date and time when the pipe was deleted. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object; for example, ROLE.  If a Snowflake Native App owns the object, the value is APPLICATION.  Snowflake returns NULL if you delete the object because a deleted object doesn’t have an owner role.  Returns NULL for the default pipe. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

The following example joins this view with [PIPE_USAGE_HISTORY view](pipe_usage_history.md) on the PIPE_ID column to track the credit usage associated with each unique PIPE object:

```sqlexample
select a.PIPE_CATALOG as PIPE_CATALOG,
       a.PIPE_SCHEMA as PIPE_SCHEMA,
       a.PIPE_NAME as PIPE_NAME,
       b.CREDITS_USED as CREDITS_USED
from SNOWFLAKE.ACCOUNT_USAGE.PIPES a join SNOWFLAKE.ACCOUNT_USAGE.PIPE_USAGE_HISTORY b
on a.pipe_id = b.pipe_id
where b.START_TIME > date_trunc(month, current_date);
```

---
title: POLICY_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/policy_references.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# POLICY_REFERENCES view

This Account Usage view lists policy objects and their references in your account.

The view supports aggregation, masking, network, projection, row access, and storage lifecycle policies.

The view is complementary to the Information Schema table function [POLICY_REFERENCES](../functions/policy_references.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_DB | VARCHAR | The database in which the policy is set. |
| POLICY_SCHEMA | VARCHAR | The schema in which the policy is set. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the policy. |
| POLICY_NAME | VARCHAR | The name of the policy. |
| POLICY_KIND | VARCHAR(17) | The type of policy. |
| REF_DATABASE_NAME | VARCHAR | The name of the database containing an object that the queried object references. |
| REF_SCHEMA_NAME | VARCHAR | The name of the schema containing an object that the queried object references. |
| REF_ENTITY_NAME | VARCHAR | The name of the object (i.e. table_name, view_name, external_table_name) on which the policy is set. |
| REF_ENTITY_DOMAIN | VARCHAR | The object type (i.e. table, view) on which the policy is set. |
| REF_COLUMN_NAME | VARCHAR | The column name on which the policy is set. |
| REF_ARG_COLUMN_NAMES | VARCHAR | Returns NULL for rows in the query result in which a Column-level Security masking policy is set. |
| TAG_DATABASE | VARCHAR | The name of the database containing the tag that has a policy assigned to the tag or NULL if a policy is not assigned to the tag. |
| TAG_SCHEMA | VARCHAR | The name of the schema containing the tag that has a policy assigned to the tag or NULL if a policy is not assigned to the tag. |
| TAG_NAME | VARCHAR | The name of the tag that has a policy assigned to it or NULL if a policy is not assigned to the tag. |
| POLICY_STATUS | VARCHAR | Specifies the status of the policy, which can be one of four possible values: `ACTIVE`, `MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`, `COLUMN_IS_MISSING_FOR_SECONDARY_ARG`, or `COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`. |

Note the following for the POLICY_STATUS column:

> `ACTIVE`
> :   Specifies that the column (i.e. REF_COLUMN_NAME) is only associated with a single policy.
>
> `MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`
> :   Specifies that multiple masking policies are assigned to the same column.
>
> `COLUMN_IS_MISSING_FOR_SECONDARY_ARG`
> :   Specifies that the policy (i.e. POLICY_NAME) is a conditional masking policy and the table (i.e. REF_ENTITY_NAME) does not have a
>     column with the same name.
>
> `COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`
> :   Specifies that the policy is a conditional masking policy and the table has a column with the same name but a different data type than
>     the data type in the masking policy signature.

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

---
title: POSTGRES_STORAGE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/postgres_storage_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# POSTGRES_STORAGE_USAGE_HISTORY view

This Account Usage view can be used to query the hourly storage used in byte-months for [Postgres instances](../../user-guide/snowflake-postgres/about.md)
in the account for the last 365 days (1 year). The data includes all data stored on the instance.

## Columns

The following table provides definitions for the POSTGRES_STORAGE_USAGE_HISTORY view columns.

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| TOTAL_BYTE_MONTHS_STANDARD | NUMBER | Number of byte-months of standard storage used by Postgres instances. |
| TOTAL_BYTE_MONTHS_HA | NUMBER | Number of byte-months of high-availability storage used by Postgres instances. |

## Usage notes

* The maximum latency for this view is three hours.
* The POSTGRES_STORAGE_USAGE_HISTORY view and the Snowsight cost management tools can return different daily storage usage
  values. This discrepancy is caused by the methods used to determine storage usage. To determine these values, the
  POSTGRES_STORAGE_USAGE_HISTORY view uses the current session’s [TIMEZONE](../parameters.md) parameter and the Snowsight cost
  management tools use Coordinated Universal Time (UTC). To resolve any discrepancies, Snowflake recommends setting the TIMEZONE
  parameter to UTC.

---
title: PRIVACY_BUDGETS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/privacy_budgets.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PRIVACY_BUDGETS view

This Account Usage view lets you retrieve the privacy budgets associated with
[privacy policies](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) in an account.

For more information about viewing privacy budgets, see [View a privacy budget](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md).

## Columns

| Column | Data type | Description |
| --- | --- | --- |
| `database_name` | VARCHAR | Database that contains the privacy policy. |
| `schema_name` | VARCHAR | Schema that contains the privacy policy. |
| `policy_name` | VARCHAR | Name of the privacy policy. |
| `budget_name` | VARCHAR | Name of the privacy budget in the privacy policy. |
| `consumer_id` | VARCHAR | Organization and account where users executed queries that incurred privacy loss. |
| `budget_spent` | FLOAT | Cumulative privacy loss since the last time the [privacy budget was refreshed](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md). |

## Usage notes

* Latency for the view may be up to 24 hours.
* A privacy budget only appears if analysts associated with the privacy budget have incurred privacy loss or if an administrator has
  [reset the privacy budget](../../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md).

---
title: PRIVACY_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/privacy_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PRIVACY_POLICIES view

This Account Usage view provides the [privacy policies](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) in your account.

Each row in this view corresponds to a different privacy policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| `policy_id` | NUMBER | Internal/system-generated identifier for the privacy policy. |
| `policy_name` | VARCHAR | Name of the privacy policy. |
| `policy_schema_id` | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| `policy_schema` | VARCHAR | Schema that contains the privacy policy. |
| `policy_catalog_id` | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| `policy_catalog` | VARCHAR | Database to which the privacy policy belongs. |
| `policy_owner` | VARCHAR | Name of the role that owns the privacy policy. |
| `policy_signature` | VARCHAR | Type signature of the privacy policy’s arguments. |
| `policy_return_type` | VARCHAR | Return value data type. |
| `policy_body` | VARCHAR | Privacy policy definition. |
| `policy_comment` | VARIANT | Comments entered for the privacy policy (if any). |
| `created` | TIMESTAMP_LTZ | Date and time when the privacy policy was created. |
| `last_altered` | TIMESTAMP_LTZ | Date and time when the privacy policy was last altered. |
| `deleted` | TIMESTAMP_LTZ | Date and time when the privacy policy was dropped. |
| `owner_role_type` | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).
* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: PROCEDURES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/procedures.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PROCEDURES view

This Account Usage view displays a row for each stored procedure defined in the account.

For more information about stored procedures, see [Stored procedures overview](../../developer-guide/stored-procedure/stored-procedures-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PROCEDURE_CATALOG | VARCHAR | Database to which the stored procedure belongs. |
| PROCEDURE_SCHEMA | VARCHAR | Schema to which the stored procedure belongs. |
| PROCEDURE_NAME | VARCHAR | Name of the stored procedure. |
| PROCEDURE_OWNER | VARCHAR | Name of the role that owns the stored procedure. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the stored procedure’s arguments. |
| DATA_TYPE | VARCHAR | Return value data type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string return value. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string return value. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric return value. |
| NUMERIC_SCALE | NUMBER | Scale of numeric return value. |
| PROCEDURE_LANGUAGE | VARCHAR | Language of the stored procedure. |
| PROCEDURE_DEFINITION | VARCHAR | Stored procedure definition. |
| CREATED | TIMESTAMP_LTZ | Creation time of the stored procedure. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this stored procedure. |
| DELETED | TIMESTAMP_LTZ | Date and time when the procedure was dropped. |
| RUNTIME_VERSION | VARCHAR | Runtime version of the language used by the procedure. |
| PACKAGES | VARCHAR | Packages requested by the procedure. |
| INSTALLED_PACKAGES | VARCHAR | All packages installed by the function. Output for Python procedures only. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| PROCEDURE_SCHEMA_ID | NUMBER | Internal/system-generated identifier of the schema to which the stored procedure belongs. |
| PROCEDURE_CATALOG_ID | NUMBER | Internal/system-generated identifier of the database to which the stored procedure belongs. |
| SECRETS | JSON map | Map of [secrets](../sql/create-secret.md) specified by the function’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| EXTERNAL_ACCESS_INTEGRATIONS | VARCHAR | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the function’s EXTERNAL_ACCESS_INTEGRATION parameter. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command when both are
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: PROJECTION_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/projection_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# PROJECTION_POLICIES view

This Account Usage view provides the projection policies in your account.

Each row in this view corresponds to a different projection policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_NAME | VARCHAR | Name of the projection policy. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the projection policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema that contains the projection policy. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the projection policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the projection policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the projection policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Projection policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the projection policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the projection policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the projection policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the projection policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: QUERY_ACCELERATION_ELIGIBLE view
source: https://docs.snowflake.com/en/sql-reference/account-usage/query_acceleration_eligible.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# QUERY_ACCELERATION_ELIGIBLE view

This Account Usage view can be used to identify queries that are eligible for the
[query acceleration service](../../user-guide/query-acceleration-service.md) (QAS).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| QUERY_TEXT | VARCHAR | Text of the SQL statement. |
| START_TIME | TIMESTAMP_LTZ | Statement start time. |
| END_TIME | TIMESTAMP_LTZ | Statement end time. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that the query executed on. |
| WAREHOUSE_SIZE | VARCHAR | Size of the warehouse when this statement executed. |
| ELIGIBLE_QUERY_ACCELERATION_TIME | NUMBER | Amount of query execution time (in seconds) eligible for the query acceleration service. |
| UPPER_LIMIT_SCALE_FACTOR | NUMBER | Upper limit [scale factor](../sql/create-warehouse.md) for the given query. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_PARAMETERIZED_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |
|  |  |  |

## Usage notes

* Latency for the view may be up to 180 minutes (three hours).

* Query acceleration is supported for the following SQL commands:

  > + SELECT
  > + INSERT
  > + CREATE TABLE AS SELECT (CTAS)
  > + COPY INTO <table>

  For more information about query eligibility, see [Eligible queries](../../user-guide/query-acceleration-service.md).
* This view only includes eligible queries that have *not* been accelerated. If you have enabled
  the query acceleration service and previously QAS-eligible queries are now accelerated, they
  are not included in this view.

## Examples

Identify the warehouses with the most queries eligible in a given period of time for the query acceleration service:

```sqlexample
SELECT warehouse_name, COUNT(query_id) AS num_eligible_queries
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time >= '2024-06-01 00:00'::TIMESTAMP
  AND end_time <= '2024-06-07 00:00'::TIMESTAMP
  GROUP BY warehouse_name
  ORDER BY num_eligible_queries DESC;
```

For more example queries, see [Identifying queries and warehouses with the QUERY_ACCELERATION_ELIGIBLE view](../../user-guide/query-acceleration-service.md).

---
title: QUERY_ACCELERATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/query_acceleration_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# QUERY_ACCELERATION_HISTORY view

This Account Usage view can be used to query the history of queries accelerated by the
[query acceleration service](../../user-guide/query-acceleration-service.md). The information returned by the view includes the warehouse name
and the credits consumed by the query acceleration service.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | NUMBER | Number of credits billed for the query acceleration service during the START_TIME and END_TIME window. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |

## Usage notes

* Billing history is not necessarily updated immediately. Latency for the view may be up to 180 minutes (3 hours).

* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

## Examples

This query returns the total number of credits used by each warehouse in your account for the query acceleration service
(month-to-date):

```sqlexample
SELECT warehouse_name,
       SUM(credits_used) AS total_credits_used
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ACCELERATION_HISTORY
  WHERE start_time >= DATE_TRUNC(month, CURRENT_DATE)
  GROUP BY 1
  ORDER BY 2 DESC;
```

---
title: QUERY_ATTRIBUTION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/query_attribution_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# QUERY_ATTRIBUTION_HISTORY view

This Account Usage view can be used to determine the compute cost of a given query run on warehouses in your account
in the last 365 days (1 year).

For more information, see [Viewing cost by tag in SQL](../../user-guide/cost-attributing.md).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| PARENT_QUERY_ID | VARCHAR | Query ID of the parent query or NULL if the query does not have a parent. |
| ROOT_QUERY_ID | VARCHAR | Query ID of the topmost query in the chain or NULL if the query does not have a parent. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse that the query was executed on. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that the query executed on. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_TAG | VARCHAR | Query tag set for this statement through the [QUERY_TAG](../parameters.md) session parameter. |
| USER_NAME | VARCHAR | User who issued the query. |
| START_TIME | TIMESTAMP_LTZ | Time when query execution started (in the local time zone). |
| END_TIME | TIMESTAMP_LTZ | Time when query execution ended (in the local time zone). |
| CREDITS_ATTRIBUTED_COMPUTE | FLOAT | Number of credits attributed to this query. Includes only the credit usage for the query execution and doesn’t include any warehouse idle time. |
| CREDITS_USED_QUERY_ACCELERATION | FLOAT | Number of credits consumed by the [Query Acceleration Service](../../user-guide/query-acceleration-service.md) to accelerate the query. NULL if the query is not accelerated. . . The total cost for an accelerated query is the sum of this column and the CREDITS_ATTRIBUTED_COMPUTE column. |

## Usage notes

* Latency for this view can be up to eight hours.
* This view displays results for any role granted the USAGE_VIEWER or GOVERNANCE_VIEWER
  [database role](../snowflake-db-roles.md).

* The value in the `credits_attributed_compute` column contains the warehouse credit usage for executing the query,
  inclusive of any resizing and/or autoscaling of multi-cluster warehouse(s). This cost is attributed based on
  the weighted average of the resource consumption.

  The value doesn’t include any credit usage for warehouse idle time. Idle time is a period
  of time in which no queries are running in the warehouse and can be measured at the warehouse level.

  The value doesn’t include any other credit usage that is incurred as a result of query execution.
  For example, the following are not included in the query cost:

  + Data transfer costs
  + Storage costs
  + Cloud services costs
  + Costs for serverless features
  + Costs for tokens processed by AI services
* For queries that are executed concurrently, the cost of the warehouse is attributed to individual queries based on the
  weighted average of their resource consumption during a given time interval.
* Short-running queries (<= ~100ms) are currently too short for per query cost attribution and are not included in the view.
* Data for all columns is available starting from mid-August, 2024. Some data prior to this date might be available in the view, but
  might be incomplete.

## Examples

### Query costs for related queries

To determine the costs of a specific query and similar queries using the query parameterized hash, replace `<query_id>`
and execute the following statements:

```sqlexample
SET query_id = '<query_id>';

WITH query_hash_of_query AS (
  SELECT query_parameterized_hash
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
  WHERE query_id = $query_id
  LIMIT 1
)
SELECT
  query_parameterized_hash,
  COUNT (*) AS query_count,
  SUM(credits_attributed_compute) AS recurrent_query_attributed_credits
FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
WHERE start_time >= DATE_TRUNC('MONTH', CURRENT_DATE)
  AND start_time < CURRENT_DATE
  AND query_parameterized_hash = (SELECT query_parameterized_hash FROM query_hash_of_query)
GROUP BY ALL;
```

### Query costs for the current user

To determine the costs of queries executed by the current user for the current month, execute the following statement:

```sqlexample
SELECT user_name, SUM(credits_attributed_compute) AS credits
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
  WHERE user_name = CURRENT_USER()
    AND start_time >= DATE_TRUNC('MONTH', CURRENT_DATE)
    AND start_time < CURRENT_DATE
  GROUP BY user_name;
```

For an example of attributing warehouse costs to users, see [Resources shared by users from different departments](../../user-guide/cost-attributing.md).

### Query costs for stored procedures

For stored procedures that issue multiple hierarchical queries, you can compute the attributed query costs for the
procedure by using the root query ID for the procedure.

1. To find the root query ID for a stored procedure, use the [ACCESS_HISTORY view](access_history.md). For example,
   to find the root query ID for a stored procedure, set the `query_id` and execute the following statements:

   ```sqlexample
   SET query_id = '<query_id>';

   SELECT query_id,
          parent_query_id,
          root_query_id,
          direct_objects_accessed
     FROM SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY
     WHERE query_id = $query_id;
   ```

   For more information, see [Ancestor queries with stored procedures](../../user-guide/access-history.md).
2. To sum the query cost for the entire procedure, replace `<root_query_id>` and execute the following statements:

   ```sqlexample
   SET query_id = '<root_query_id>';

   SELECT SUM(credits_attributed_compute) AS total_attributed_credits
     FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_ATTRIBUTION_HISTORY
     WHERE (root_query_id = $query_id OR query_id = $query_id);
   ```

### Additional examples

For more examples, see [Resources shared by users from different departments](../../user-guide/cost-attributing.md).

---
title: QUERY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/query_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md) , [READER_ACCOUNT_USAGE](../account-usage.md)

# QUERY_HISTORY view

This Account Usage view can be used to query Snowflake query history by various dimensions (time range, session, user, warehouse, and so on) within the last 365 days (1 year).

The view is available in both the ACCOUNT_USAGE and READER_ACCOUNT_USAGE schemas with the following differences:

* The following columns are available *only* in the reader account view:

  + `reader_account_name`
  + `reader_account_deleted_on`

Alternatively, you can call the Information Schema table function, also named QUERY_HISTORY; however, note that the table function restricts
the results to activity over the past 7 days, versus 365 days for the Account Usage view. See the
[description of the QUERY_HISTORY function](../functions/query_history.md).

See also:

* [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../functions/query_history.md) (Information Schema table function)
* [Monitor query activity with Query History](../../user-guide/ui-snowsight-activity.md) (Snowsight dashboard)
* [Use the Grouped Query History view in Snowsight](../../user-guide/ui-snowsight-activity.md)

## Columns

The *Available only in reader account usage views* column in the following table indicates whether the QUERY_HISTORY column is available in the
[READER_ACCOUNT_USAGE](../account-usage.md) schema.

| Column Name | Data Type | Description | Available only in reader account usage views |
| --- | --- | --- | --- |
| `reader_account_name` | VARCHAR | Name of the reader account in which the SQL statement was executed. | ✔ |
| `query_id` | VARCHAR | Internal/system-generated identifier for the SQL statement. | ✔ |
| `query_text` | VARCHAR | Text of the SQL statement. The limit is 100K characters. Longer SQL statements are truncated. |  |
| `database_id` | NUMBER | internal/system-generated identifier for the database that was in use. | ✔ |
| `database_name` | VARCHAR | database that was specified in the context of the query at compilation. | ✔ |
| `schema_id` | NUMBER | Internal/system-generated identifier for the schema that was in use. | ✔ |
| `schema_name` | VARCHAR | Schema that was specified in the context of the query at compilation. | ✔ |
| `query_type` | VARCHAR | DML, query, etc. If the query failed, then the query type may be UNKNOWN. |  |
| `session_id` | NUMBER | Session that executed the statement. | ✔ |
| `authn_event_id` | NUMBER | ID for the event for the authentication of the user for this query. This ID corresponds to the value in the `event_id` column in the [LOGIN_HISTORY](login_history.md) view. |  |
| `user_name` | VARCHAR | User who issued the query. |  |
| `role_name` | VARCHAR | Role that was active in the session at the time of the query. | ✔ |
| `warehouse_id` | NUMBER | Internal/system-generated identifier for the warehouse that was used. | ✔ |
| `warehouse_name` | VARCHAR | Warehouse that the query executed on, if any. | ✔ |
| `warehouse_size` | VARCHAR | Size of the warehouse when this statement executed. | ✔ |
| `warehouse_type` | VARCHAR | Type of the warehouse when this statement executed. | ✔ |
| `cluster_number` | NUMBER | The cluster (in a multi-cluster warehouse) that this statement executed on. | ✔ |
| `query_tag` | VARCHAR | Query tag set for this statement through the QUERY_TAG session parameter. | ✔ |
| `execution_status` | VARCHAR | Execution status for the query. Valid values: `success`, `fail`, `incident`. | ✔ |
| `error_code` | NUMBER | Error code, if the query returned an error | ✔ |
| `error_message` | VARCHAR | Error message, if the query returned an error. The limit is 5K characters. Longer error messages are truncated. | ✔ |
| `start_time` | TIMESTAMP_LTZ | Statement start time (in the local time zone) | ✔ |
| `end_time` | TIMESTAMP_LTZ | Statement end time (in the local time zone). | ✔ |
| `total_elapsed_time` | NUMBER | Elapsed time (in milliseconds). | ✔ |
| `bytes_scanned` | NUMBER | Number of bytes scanned by this statement. | ✔ |
| `percentage_scanned_from_cache` | FLOAT | Percentage of data scanned from the local disk cache. The value ranges from 0.0 to 1.0. Multiply by 100 to get a true percentage. |  |
| `bytes_written` | NUMBER | Number of bytes written (e.g. when loading into a table). |  |
| `bytes_written_to_result` | NUMBER | Number of bytes written to a result object. For example, `SELECT * FROM ...` would produce a set of results in tabular format representing each field in the selection. . . In general, the results object represents whatever is produced as a result of the query, and `bytes_written_to_result` represents the size of the returned result. |  |
| `bytes_read_from_result` | NUMBER | Number of bytes read from a result object. |  |
| `rows_produced` | NUMBER | The number of rows produced by this statement. The `rows_produced` column will be deprecated in a future release. The value in the `rows_produced` column doesn’t always reflect the logical number of rows affected by a query. Snowflake recommends using the `rows_inserted`, `rows_updated`, `rows_written_to_result`, or `rows_deleted` columns instead. | ✔ |
| `rows_inserted` | NUMBER | Number of rows inserted by the query. |  |
| `rows_updated` | NUMBER | Number of rows updated by the query. |  |
| `rows_deleted` | NUMBER | Number of rows deleted by the query. |  |
| `rows_unloaded` | NUMBER | Number of rows unloaded during data export. |  |
| `bytes_deleted` | NUMBER | Number of bytes deleted by the query. |  |
| `partitions_scanned` | NUMBER | Number of micro-partitions scanned. |  |
| `partitions_total` | NUMBER | Total micro-partitions of all tables included in this query. |  |
| `bytes_spilled_to_local_storage` | NUMBER | Volume of data spilled to local disk. |  |
| `bytes_spilled_to_remote_storage` | NUMBER | Volume of data spilled to remote disk. |  |
| `bytes_sent_over_the_network` | NUMBER | Volume of data sent over the network. |  |
| `compilation_time` | NUMBER | Compilation time (in milliseconds) | ✔ |
| `execution_time` | NUMBER | Execution time (in milliseconds) | ✔ |
| `queued_provisioning_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, waiting for the warehouse compute resources to provision, due to warehouse creation, resume, or resize. | ✔ |
| `queued_repair_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, waiting for compute resources in the warehouse to be repaired. | ✔ |
| `queued_overload_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, due to the warehouse being overloaded by the current query workload. | ✔ |
| `transaction_blocked_time` | NUMBER | Time (in milliseconds) spent blocked by a concurrent DML. | ✔ |
| `outbound_data_transfer_cloud` | VARCHAR | Target cloud provider for statements that unload data to another region and/or cloud. | ✔ |
| `outbound_data_transfer_region` | VARCHAR | Target region for statements that unload data to another region and/or cloud. | ✔ |
| `outbound_data_transfer_bytes` | NUMBER | Number of bytes transferred in statements that unload data from Snowflake tables. | ✔ |
| `inbound_data_transfer_cloud` | VARCHAR | Source cloud provider for statements that load data from another region and/or cloud. | ✔ |
| `inbound_data_transfer_region` | VARCHAR | Source region for statements that load data from another region and/or cloud. | ✔ |
| `inbound_data_transfer_bytes` | NUMBER | Number of bytes transferred in a replication operation from another account. The source account could be in the same region or a different region than the current account. | ✔ |
| `list_external_files_time` | NUMBER | Time (in milliseconds) spent listing external files. |  |
| `credits_used_cloud_services` | NUMBER | Number of credits used for cloud services. This value does not take into account the [adjustment for cloud services](../../user-guide/cost-understanding-compute.md), and may therefore be greater than the credits that are billed. To determine how many credits were actually billed, run queries against the [METERING_DAILY_HISTORY view](metering_daily_history.md). | ✔ |
| `reader_account_deleted_on` | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the reader account is deleted. | ✔ |
| `release_version` | VARCHAR | Release version in the format of `major_release.minor_release.patch_release`. |  |
| `external_function_total_invocations` | NUMBER | The aggregate number of times that this query called remote services. For important details, see the Usage Notes. |  |
| `external_function_total_sent_rows` | NUMBER | The total number of rows that this query sent in all calls to all remote services. |  |
| `external_function_total_received_rows` | NUMBER | The total number of rows that this query received from all calls to all remote services. |  |
| `external_function_total_sent_bytes` | NUMBER | The total number of bytes that this query sent in all calls to all remote services. |  |
| `external_function_total_received_bytes` | NUMBER | The total number of bytes that this query received from all calls to all remote services. |  |
| `query_load_percent` | NUMBER | The approximate percentage of active compute resources in the warehouse for this query execution. |  |
| `is_client_generated_statement` | BOOLEAN | Indicates whether the query was client-generated. |  |
| `query_acceleration_bytes_scanned` | NUMBER | Number of bytes scanned by the [query acceleration service](../../user-guide/query-acceleration-service.md). |  |
| `query_acceleration_partitions_scanned` | NUMBER | Number of partitions scanned by the query acceleration service. |  |
| `query_acceleration_upper_limit_scale_factor` | NUMBER | Upper limit [scale factor](../../user-guide/query-acceleration-service.md) that a [query would have benefited from](../../user-guide/query-acceleration-service.md). |  |
| `transaction_id` | NUMBER | [ID of the transaction](../transactions.md) that contains the statement or 0 if the statement is not executed within a transaction. |  |
| `child_queries_wait_time` | NUMBER | Time (in milliseconds) to complete the cached lookup when calling a [memoizable function](../../developer-guide/udf/sql/udf-sql-scalar-functions.md). |  |
| `role_type` | VARCHAR | Specifies whether an APPLICATION, DATABASE_ROLE, or ROLE executed the query. |  |
| `query_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |  |
| `query_hash_version` | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |  |
| `query_parameterized_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |  |
| `query_parameterized_hash_version` | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |  |
| `secondary_role_stats` | VARCHAR | A JSON-formatted string that contains three fields regarding secondary roles that were evaluated in the query: a list of secondary roles or `ALL` depending on the session, a count of the number of secondary roles, and the internal/system-generated ID for each secondary role. The count and number of IDs have a maximum of 50. |  |
| `rows_written_to_result` | NUMBER | Number of rows written to a result object. For CREATE TABLE AS SELECT (CTAS) and all DML operations, this result is `1`. |  |
| `query_retry_time` | NUMBER | Total execution time (in milliseconds) for query retries caused by actionable errors. For more information, see Query retry columns. |  |
| `query_retry_cause` | VARCHAR | Error that caused the query to retry. If there is no query retry, the field is NULL. For more information, see Query retry columns. |  |
| `fault_handling_time` | NUMBER | Total execution time (in milliseconds) for query retries caused by errors that are *not* actionable. For more information, see Query retry columns. |  |
| `user_type` | VARCHAR | The type of the user executing the query. It’s the same as the `type` column in the [USERS view](users.md). If a Snowpark Container Services service executes the query, the user type is SNOWFLAKE_SERVICE. |  |
| `user_database_name` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |  |
| `user_database_id` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s database; otherwise, it’s NULL |  |
| `user_schema_name` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |  |
| `user_schema_id` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s schema; otherwise, it’s NULL. |  |
| `bind_values` | ARRAY | Bind values in serialized form. If the query contains no bind values, then this column contains an empty array. If the array is too large or the [ALLOW_BIND_VALUES_ACCESS](../parameters.md) parameter is set to `FALSE`, this column contains NULL. For more information, see [Retrieve bind variable values](../bind-variables.md). |  |
|  |  |  |  |

## Usage notes

### General

* Latency for the view may be up to 45 minutes.

* The values for the columns
  `external_function_total_invocations`, `external_function_total_sent_rows`,
  `external_function_total_received_rows`, `external_function_total_sent_bytes`, and `external_function_total_received_bytes`
  are affected by many factors, including:

  + The number of external functions in the SQL statement.
  + The number of rows per batch sent to each remote service.
  + The number of retries due to transient errors (for example, because a response was not received within the expected time).
* If you want to filter on client-generated query statements, use
  [QUERY_HISTORY](../functions/query_history.md) (an Information Schema table function).
* Canceled queries are identified by their `error_message` text (`SQL execution canceled`), not by their `execution_status` value.

### Query retry columns

A query might need to be retried one or more times in order to successfully complete. There can be multiple causes that result in a query
retry. Some of these causes are *actionable*, that is, a user can make changes to reduce or eliminate query retries for a specific query.
For example, if a query is retried due to an out of memory error, modifying warehouse settings might resolve the issue.

Some query retries are caused by a fault that is not actionable. That is, there is no change a user can make to prevent the
query retry. For example, a network outage might result in a query retry. In this case, there is no change to the query or to the
warehouse that executes it that can prevent the query retry.

The QUERY_RETRY_TIME, QUERY_RETRY_CAUSE, and FAULT_HANDLING_TIME columns can help you optimize queries that are retried and better
understand fluctuations in query performance.

### Query history for hybrid tables

The following notes explain when records are logged in the QUERY_HISTORY view for queries against hybrid tables:

* Short-running queries that operate exclusively against hybrid tables do not generate a record in this
  view or [QUERY_HISTORY](../functions/query_history.md) (Information
  Schema table function). To monitor such queries, use the
  [AGGREGATE_QUERY_HISTORY](aggregate_query_history.md) view.
  This view allows you to more easily monitor high-throughput operational
  workloads for trends and issues.
* Short-running queries that operate exclusively against hybrid tables do not provide a query profile
  that you can inspect in Snowsight.
* Queries against hybrid tables do generate both a record in the QUERY_HISTORY view and a query profile if any of the
  following conditions are met:

  + A query is executed against any table type other than the hybrid table type. This
    condition ensures that there is no behavior change for any existing
    non-Unistore workloads.
  + A query fails with an EXECUTION_STATUS of `failed_with_incident` (see
    [QUERY_HISTORY](../functions/query_history.md)). This
    condition ensures that you can investigate and report the specific failed
    query to receive assistance.
  + A query is running longer than approximately 500 milliseconds. This
    condition ensures that you can investigate performance issues for slow queries.
  + Query result size is too large.
  + A query is associated with a Snowflake transaction.
  + A query contains a system function with side effects.
  + A query is not one of the following statement types: SELECT, INSERT,
    DELETE, UPDATE, MERGE.
  + A query is executed from SnowSQL, Snowsight, or Classic Console. This
    condition ensures that you can manually generate a full query profile to
    investigate performance issues for any specific query even if it is not
    categorized as long-running.
  + Even if a query does not meet any of these criteria, queries can be
    periodically sampled to generate a record in the QUERY_HISTORY view and a
    query profile to help your investigation.

### PUT and GET commands

For the [PUT](../sql/put.md) and [GET](../sql/get.md) commands,
an EXECUTION_STATUS of `success` in the QUERY_HISTORY
does *not* mean that data files were successfully uploaded or downloaded.
Instead, the status indicates that Snowflake received authorization to proceed with the file transfer.

---
title: QUERY_INSIGHTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/query_insights.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# QUERY_INSIGHTS view

This Account Usage view displays a row for each [insight produced for a query](../../user-guide/query-insights.md).

## Columns

| Column name | Type | Description |
| --- | --- | --- |
| `start_time` | TIMESTAMP_LTZ | Start time of the query. |
| `end_time` | TIMESTAMP_LTZ | End time of the query. |
| `total_elapsed_time` | NUMBER | Total elapsed time of the query (in milliseconds). |
| `query_id` | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| `query_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| `query_parameterized_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| `warehouse_id` | VARCHAR | Internal/system-generated identifier for the warehouse that was used. |
| `warehouse_name` | VARCHAR | Warehouse that the query executed on, if any. |
| `insight_instance_id` | NUMBER | Internal/system-generated identifier for the insight. |
| `insight_type_id` | VARCHAR | Identifier of the [insight type](../../user-guide/query-insights.md). |
| `message` | VARIANT | Structured information and details about the insight. |
| `suggestions` | ARRAY | Array of strings, each containing a recommended action for the insight. |
| `is_opportunity` | BOOLEAN | If `true`, the insight includes suggestions to improve query performance. For example:   * For an insight with the type ID `QUERY_INSIGHT_NO_FILTER_ON_TOP_OF_TABLE_SCAN`, this column contains `true` because   the insight includes suggestions for improving performance. * For an insight with the type ID `QUERY_INSIGHT_FILTER_WITH_CLUSTERING_KEY`, this column contains `false` because the   insight does not include suggestions for improving performance. |
| `insight_topic` | VARCHAR | Label that identifies the type of performance impact detected by this insight. For the list of labels, see Insight topics. |

### Insight topics

For the `insight_topic` column, the label can be one of the following:

* `TABLE_SCAN`: Insights about the efficiency of accessing tables. This label applies to the following types of insights:

  + [QUERY_INSIGHT_NO_FILTER_ON_TOP_OF_TABLE_SCAN](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_INAPPLICABLE_FILTER_ON_TABLE_SCAN](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_UNSELECTIVE_FILTER](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_LIKE_WITH_LEADING_WILDCARD](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_FILTER_WITH_CLUSTERING_KEY](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_SEARCH_OPTIMIZATION_USED](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_SNOWFLAKE_OPTIMA](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_SEARCH_OPTIMIZATION_AND_SNOWFLAKE_OPTIMA](../../user-guide/query-insights.md)
* `JOIN`: Insights about the efficiency of JOIN operations in the query. This label applies to the following types of insights:

  + [QUERY_INSIGHT_JOIN_WITH_NO_JOIN_CONDITION](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_INEFFICIENT_JOIN_CONDITION](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_NESTED_EXPLODING_JOIN](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_EXPLODING_JOIN](../../user-guide/query-insights.md)
* `AGGREGATE`: Insights about the efficiency of aggregate operations in the query. This label applies to the following types of
  insights:

  + [QUERY_INSIGHT_INEFFICIENT_AGGREGATE](../../user-guide/query-insights.md)
* `UNION`: Insights about the efficiency of UNION operations in the query. This label applies to the following types of
  insights:

  + [QUERY_INSIGHT_UNNECESSARY_UNION_DISTINCT](../../user-guide/query-insights.md)
* `WAREHOUSE`: Insights about the warehouse used for the query. This label applies to the following types of insights:

  + [QUERY_INSIGHT_REMOTE_SPILLAGE](../../user-guide/query-insights.md)
  + [QUERY_INSIGHT_QUEUED_OVERLOAD](../../user-guide/query-insights.md)

## Usage notes

* Latency for the view may be up to 90 minutes.

## Examples

The following example returns the query insights for the query with the ID
`01bd3a9d-0910-8327-0000-09717704c032`:

```sqlexample
SELECT query_id, insight_type_id, message, suggestions
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_INSIGHTS
  WHERE query_id = '01bd3a9d-0910-8327-0000-09717704c032';
```

The following example returns the query insights for queries that have the same
[hash of parameterized query text](../../user-guide/query-hash.md). These are queries that use the same SELECT statement except for
the literals specified in the statement.

```sqlexample
SELECT query_id, insight_type_id, message, suggestions
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_INSIGHTS
  WHERE query_parameterized_hash = '4bb66effc1a3c8b4e94a728f7caaa736';
```

The following example returns the query insights for queries that ran during the past week:

```sqlexample
SELECT query_id, insight_type_id, message, suggestions
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_INSIGHTS
  WHERE start_time > TO_DATE(DATEADD(DAY, -7, CURRENT_DATE()));
```

The following example returns the query insights for queries that ran during the past week and took more than an hour to complete:

```sqlexample
SELECT query_id, insight_type_id, message, suggestions
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_INSIGHTS
  WHERE start_time > TO_DATE(DATEADD(DAY, -7, CURRENT_DATE()))
    AND total_duration > 3600000;
```

The following example returns the query insights for queries that ran during the past week, took more than an hour to complete,
and used the warehouse with the ID `84412315`:

```sqlexample
SELECT query_id, insight_type_id, message, suggestions
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_INSIGHTS
  WHERE start_time > TO_DATE(DATEADD(DAY, -7, CURRENT_DATE()))
    AND total_duration > 3600000
    AND warehouse_id = 84412315;
```

---
title: REFERENTIAL_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/referential_constraints.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# REFERENTIAL_CONSTRAINTS view

This Account Usage view displays a row for each FOREIGN KEY constraint that is defined for tables in the account.

FOREIGN KEY constraints are used to enforce referential integrity. For more information, see
[Constraints](../constraints.md) and [Referential Integrity Constraints](../../user-guide/table-considerations.md).

To return information about other constraint types (as well as FOREIGN KEY constraints), query the [TABLE_CONSTRAINTS view](table_constraints.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the constraint. |
| CONSTRAINT_CATALOG | VARCHAR | Database that the constraint belongs to |
| CONSTRAINT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the constraint. |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the constraint belongs to |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint |
| UNIQUE_CONSTRAINT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the current constraint. |
| UNIQUE_CONSTRAINT_CATALOG | VARCHAR | Database of the unique constraint referenced by the current constraint. |
| UNIQUE_CONSTRAINT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the constraint. |
| UNIQUE_CONSTRAINT_SCHEMA | VARCHAR | Schema of the unique constraint referenced by the current constraint. |
| UNIQUE_CONSTRAINT_NAME | VARCHAR | Name of the unique constraint referenced by the current constraint. |
| MATCH_OPTION | VARCHAR | Match option for the constraint. |
| UPDATE_RULE | VARCHAR | Update Rule for the current constraint. |
| DELETE_RULE | VARCHAR | Delete Rule for the current constraint. |
| COMMENT | VARCHAR | Comment for the constraint. |
| CREATED | TIMESTAMP_LTZ | Date and time when the constraint was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the constraint was dropped. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: REPLICATION_GROUP_REFRESH_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/replication_group_refresh_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# REPLICATION_GROUP_REFRESH_HISTORY view

This Account Usage view can be used to query the refresh history for a specified
[replication or failover group](../../user-guide/account-replication-intro.md).

See also:
:   [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](../functions/replication_group_refresh_history.md) (Information Schema table function)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REPLICATION_GROUP_NAME | VARCHAR | Name of the secondary replication or failover group. |
| REPLICATION_GROUP_ID | NUMBER | Internal/system-generated identifier for the replication or failover group. |
| PHASE_NAME | VARCHAR | Current phase in the replication operation. For the list of phases, see the Usage notes. |
| START_TIME | TIMESTAMP_LTZ | Time when the replication operation began. |
| END_TIME | TIMESTAMP_LTZ | Time when the replication operation finished, if applicable. `NULL` if it is in progress. |
| JOB_UUID | VARCHAR | Query ID for the refresh job. |
| TOTAL_BYTES | VARIANT | A JSON object that provides detailed information about refreshed databases:   * `totalBytesToReplicate`: Total number of bytes expected to be replicated. * `bytesUploaded`: Actual number of bytes uploaded. * `bytesDownloaded`: Actual number of bytes downloaded. * `databases`: List of JSON objects containing the following fields for each member database:    + `name`: Name of the database.   + `totalBytesToReplicate`: Total bytes expected to be replicated for the database. |
| OBJECT_COUNT | VARIANT | A JSON object that provides detailed information about refreshed objects:   * `totalObjects`: Total number of objects in the replication or failover group. * `completedObjects`: Total number of objects completed. * `objectTypes`: List of JSON objects containing the following fields for each type:    + `objectType`: Type of object (for example users, roles, grants, warehouses, schemas, tables, columns, etc).   + `totalObjects`: Total number of objects of this type in the replication or failover group.   + `completedObjects`: Total number of objects of this type that were completed. |
| PRIMARY_SNAPSHOT_TIMESTAMP | TIMESTAMP_LTZ | Timestamp when the primary snapshot was created. |
| ERROR | VARIANT | NULL if the refresh operation is successful. If the refresh operation fails, returns a JSON object that provides detailed information about the error:   * `errorCode`: Error code of the failure. * `errorMessage`: Error message of the failure. |

## Usage notes

* Latency for the view may be up to 180 minutes (three hours).

  To view real-time refresh progress, use the [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](../functions/replication_group_refresh_history.md) table function.

* Results are only returned for secondary failover or replication groups in the current account (the target account).
* The following is the list of phases in the order processed:

  | # | Phase name | Description |
  | --- | --- | --- |
  | 1 | `SECONDARY_SYNCHRONIZING_MEMBERSHIP` | The secondary replication or failover group receives information from the primary group about the objects included in the group, and updates its membership metadata. |
  | 2 | `SECONDARY_UPLOADING_INVENTORY` | The secondary replication or failover group sends an inventory of its objects in the target account to the primary group. |
  | 3 | `PRIMARY_UPLOADING_METADATA` | The primary replication or failover group creates a snapshot of metadata in the source account and sends it to the secondary group. |
  | 4 | `PRIMARY_UPLOADING_DATA` | The primary replication or failover group copies the files the secondary group needs to reconcile any deltas between the objects in the source and target accounts. |
  | 5 | `SECONDARY_DOWNLOADING_METADATA` | The secondary replication or failover group applies the snapshot of the metadata that was sent by the primary. The metadata updates are not applied atomically and instead applied over time. |
  | 6 | `SECONDARY_DOWNLOADING_DATA` | The secondary replication or failover group copies the files sent by the primary group to the target account. |
  | 7 | `COMPLETED` / `FAILED` / `CANCELED` | Refresh operation status. |

## Examples

To retrieve the refresh history for the secondary failover group `myfg`, execute the following statement:

```sqlexample
SELECT phase_name, start_time, end_time,
       total_bytes, object_count, error
  FROM SNOWFLAKE.ACCOUNT_USAGE.REPLICATION_GROUP_REFRESH_HISTORY
  WHERE replication_group_name = 'MYFG';
```

To retrieve the last refresh record for each replication or failover group, execute the following statement:

```sqlexample
SELECT replication_group_name, phase_name,
       start_time, end_time,
       total_bytes, object_count, error,
       ROW_NUMBER() OVER (
         PARTITION BY replication_group_name
         ORDER BY end_time DESC
       ) AS row_num
  FROM SNOWFLAKE.ACCOUNT_USAGE.REPLICATION_GROUP_REFRESH_HISTORY
  QUALIFY row_num = 1;
```

---
title: REPLICATION_GROUP_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/replication_group_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# REPLICATION_GROUP_USAGE_HISTORY view

This Account Usage view can be used to query the replication history for a specified
[replication or failover group](../../user-guide/account-replication-intro.md).

The returned results include the replication or
failover group name, credits consumed, and bytes transferred for replication. Usage data is retained for 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the replication usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the replication usage took place. |
| REPLICATION_GROUP_NAME | VARCHAR | Name of the secondary replication or failover group. |
| REPLICATION_GROUP_ID | NUMBER | Internal/system-generated identifier for the replication or failover group. |
| CREDITS_USED | NUMBER | Total number of credits used for replication during the START_TIME and END_TIME window. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for replication during the START_TIME and END_TIME window. |
|  |  |  |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* Results are only returned for secondary failover or replication groups in the target account.
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

---
title: REPLICATION_GROUPS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/replication_groups.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# REPLICATION_GROUPS view

This Account Usage view displays a row for each
[replication group and failover group](../../user-guide/account-replication-intro.md) in the account.

The returned results include details such as the replication or failover group name,
the types of objects that it applies to, and its schedule for replication refresh operations.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CREATED | TIMESTAMP_LTZ | Date and time the replication or failover group was created. |
| DELETED | TIMESTAMP_LTZ | Date and time the replication or failover group was deleted. |
| NAME | VARCHAR | Name of the replication or failover group. |
| TYPE | VARCHAR | Type of group. Valid values are REPLICATION or FAILOVER. |
| COMMENT | VARCHAR | Comment string. |
| OBJECT_TYPES | VARCHAR | List of specified object types enabled for replication (and failover in the case of a FAILOVER group). |
| ALLOWED_INTEGRATION_TYPES | VARCHAR | List of integration types that are enabled for replication. Snowflake always includes this column in the output, even if integrations weren’t specified in the CREATE or ALTER command. |
| REPLICATION_SCHEDULE | VARCHAR | Scheduled interval for refresh; NULL if no replication schedule is set. |
| OWNER | VARCHAR | Name of the role with the OWNERSHIP privilege on the replication or failover group. |
| IS_LISTING_AUTO_FULFILLMENT_GROUP | BOOLEAN | TRUE if the replication group is used for Cross-Cloud Auto-Fulfillment. FALSE otherwise. |
| ERROR_INTEGRATION | VARCHAR | The name of the notification integration for the replication group or failover group to which the error notification is sent in cases of refresh failures. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

## Examples

The following example returns the active failover groups for your Snowflake account:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.REPLICATION_GROUPS
  WHERE type = 'FAILOVER' AND deleted IS NULL
  ORDER BY name;
```

---
title: REPLICATION_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/replication_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# REPLICATION_USAGE_HISTORY view

This Account Usage view can be used to query the replication history for a specified database. The returned results include the database name, credits consumed, and bytes transferred for replication. Usage data is retained for 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the replication usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the replication usage took place. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| CREDITS_USED | NUMBER | Total number of credits used for database replication during the START_TIME and END_TIME window. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for database replication during the START_TIME and END_TIME window. |
|  |  |  |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* The view displays data starting from September 1, 2019.

---
title: RESOURCE_MONITORS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/resource_monitors.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md) , [READER_ACCOUNT_USAGE](../account-usage.md)

# RESOURCE_MONITORS view

This Account Usage view displays the resource monitors that have been created in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| READER_ACCOUNT_NAME | VARCHAR | Name of the reader account where the resource monitor was created. Column only included in view in READER_ACCOUNT_USAGE schema. |
| NAME | VARCHAR | Name of the resource monitor. |
| CREATED | TIMESTAMP_LTZ | Date and time when the resource monitor was created. |
| CREDIT_QUOTA | VARIANT | Monthly credit quota for the resource monitor. |
| USED_CREDITS | VARIANT | Number of credits used in the current monthly billing cycle by all the warehouses associated with the resource monitor. |
| REMAINING_CREDITS | FLOAT | Number of credits still available to use in the current monthly billing cycle. |
| OWNER | VARCHAR | Name of the role that owns the resource monitor. |
| WAREHOUSES | VARCHAR | Names of the warehouses that are associate with the resource monitor. |
| NOTIFY | NUMBER | Percentage of the credit quota. When consumption reaches this threshold, notifications are sent. |
| SUSPEND | NUMBER | Percentage of the credit quota. When consumption reaches this threshold, assigned warehouses are suspended but currently running queries are allowed to complete. |
| SUSPEND_IMMEDIATE | NUMBER | Percentage of the credit quota. When consumption reaches this threshold, all assigned warehouses are suspended immediately, including those running queries. |
| LEVEL | VARCHAR | Indicates whether it is an account-level or a warehouse-level resource monitor. |
| READER_ACCOUNT_DELETED_ON | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the reader account is deleted. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: ROLES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/roles.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ROLES view

This Account Usage view can be used to query a list of all roles defined in the account. The data is retained for 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROLE_ID | NUMBER | Internal/system-generated identifier for the role. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the role was created. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the role was deleted. |
| NAME | VARCHAR | Name of the role. |
| COMMENT | VARCHAR | Comment for the role. |
| OWNER | VARCHAR | Role with the OWNERSHIP privilege on the object. |
| ROLE_TYPE | VARCHAR | Either `ROLE`, `DATABASE_ROLE`, `INSTANCE_ROLE`, or `APPLICATION_ROLE`. |
| ROLE_DATABASE_NAME | VARCHAR | Name of the database that contains the database role if the role is a database role. |
| ROLE_INSTANCE_ID | NUMBER | Internal/system-generated identifier for the class instance that the role belongs to. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| IS_FROM_ORGANIZATION_USER_GROUP | BOOLEAN | If TRUE, the role was imported from an [organization user group](../../user-guide/organization-users.md). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view does not include database roles for databases created from shares.

### Internal Snowflake role for Snowsight

The first time [Snowsight](../../user-guide/ui-snowsight.md) is accessed in an account, Snowflake creates the internal APPADMIN and
WORKSHEETS_APP_RL roles to support the web interface. These roles are used to cache query results in an internal stage in your account.
This cached data is encrypted and protected by the key hierarchy for the account. The limited privileges granted to these internal roles
only allow Snowsight to access the internal stage to store those results. Thes roles cannot list objects in your account or access
data in your tables. For more information, see [Getting started with Snowsight](../../user-guide/ui-snowsight-gs.md).

---
title: ROW_ACCESS_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/row_access_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# ROW_ACCESS_POLICIES view

This Account Usage view displays a row for each row access policy defined in your account.

Each row corresponds to a different row access policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_NAME | VARCHAR | Name of the row access policy. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the row access policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema to which the row access policy belongs. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the row access policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the row access policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the row access policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Row access policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the row access policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the row access policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the row access policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| OPTIONS | VARIANT | The value for the EXEMPT_OTHER_POLICIES property in the policy. If set to `TRUE`, the column returns `{ "EXEMPT_OTHER_POLICIES: "TRUE" }`. If the property is set to `FALSE` or not set at all, the column returns NULL. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only returns rows if at least one row access policy has been created.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Example

Obtain all of the row access policies created in your account, ordered by the timestamp on which the policy was created:

> ```sqlexample
> select policy_name, policy_signature, created
> from row_access_policies
> order by created
> ;
> ```

---
title: SCHEMATA view
source: https://docs.snowflake.com/en/sql-reference/account-usage/schemata.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SCHEMATA view

This Account Usage view displays a row for each schema in the account except the ACCOUNT_USAGE, READER_ACCOUNT_USAGE, and INFORMATION_SCHEMA schemas.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema. |
| SCHEMA_NAME | VARCHAR | Name of the schema. |
| CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the schema. |
| CATALOG_NAME | VARCHAR | Database that the schema belongs to. |
| SCHEMA_OWNER | VARCHAR | Name of the role that owns the schema. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| IS_TRANSIENT | VARCHAR | Whether the schema is transient. |
| IS_MANAGED_ACCESS | VARCHAR | Whether the schema is a managed access schema. |
| DEFAULT_CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| SQL_PATH | VARCHAR | Not applicable for Snowflake. |
| COMMENT | VARCHAR | Comment for the schema. |
| CREATED | TIMESTAMP_LTZ | Date and time when the schema was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the schema was dropped. |
| SCHEMA_TYPE | VARCHAR | Specifies the schema type. Valid values are: . . - STANDARD: normal schema. . - VERSIONED: versioned schema. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| SCHEMA_TYPE | VARCHAR | Type of schema. Possible values are `STANDARD` and `VERSIONED`. |
| VERSION_NAME | VARCHAR | Name of the schema if it is a versioned schema. NULL otherwise. |
| VERSIONED_SCHEMA_ID | NUMBER | Internal/system-generated identifier if the schema is a versioned schema. NULL, otherwise. |
| OBJECT_VISIBILITY | OBJECT | `OBJECT_VISIBILITY`  [Preview Feature](../../release-notes/preview-features.md) — Open  Available to all accounts.  This property controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md) in the account, enabling users without explicit access privileges to find objects and request access. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SEARCH_OPTIMIZATION_BENEFITS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/search_optimization_benefits.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEARCH_OPTIMIZATION_BENEFITS view

This Account Usage view can be used to determine the efficacy of pruning due to
[search optimization](../../user-guide/search-optimization-service.md).

This view provides information about pruning, similar to the information provided by the [TABLE_PRUNING_HISTORY view](table_pruning_history.md). Note that
TABLE_PRUNING_HISTORY view provides information about all pruning, as opposed to pruning due to search optimization.

You can use this view to compare the effects on pruning before and after adding search optimization to a table. When you query
this view, compare the number of partitions pruned due to search optimization (`PARTITIONS_PRUNED_ADDITIONAL`) against the
total number of partitions pruned (`PARTITIONS_PRUNED_DEFAULT + PARTITIONS_PRUNED_ADDITIONAL`).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the time range (on the hour mark) during which the queries were executed. |
| END_TIME | TIMESTAMP_LTZ | End of the time range (on the hour mark) during which the queries were executed. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that was queried. |
| TABLE_NAME | VARCHAR | Name of the table that was queried. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table that was queried. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table that was queried. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table that was queried. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table that was queried. |
| NUM_SCANS | NUMBER | Number of scan operations (from all queries on the table during the START_TIME and END_TIME window) that used [search optimization](../../user-guide/search-optimization-service.md) to improve pruning. Note that a given query might result in multiple scan operations on the same table. |
| PARTITIONS_SCANNED | NUMBER | Number of partitions scanned during the scan operations described in `NUM_SCANS`. |
| PARTITIONS_PRUNED_DEFAULT | NUMBER | Number of partitions that were pruned as a result of the default (natural) ordering of data for the queries described in `NUM_SCANS`. These partitions were eliminated during query processing, improving the efficiency of the query. |
| PARTITIONS_PRUNED_ADDITIONAL | NUMBER | Number of partitions that were pruned as a result of [search optimization](../../user-guide/search-optimization-service.md) for the queries described in `NUM_SCANS`. These partitions were eliminated during query processing, improving the efficiency of the query. |

## Usage Notes

* Latency for the view may be up to 6 hours.
* This view retains data for the 1,000 longest-running table scans per query. Only extremely complex queries
  exceed this number of scans so data is rarely omitted.

## Examples

List the top five tables that have benefited the most from search optimization within the last seven days:

```sqlexample
SELECT
    table_id,
    ANY_VALUE(table_name) AS table_name,
    SUM(num_scans) AS total_num_scans,
    SUM(partitions_pruned_default) AS total_partitions_pruned_default,
    SUM(partitions_pruned_additional) AS total_partitions_pruned_additional,
    SUM(partitions_scanned) AS total_partitions_scanned
  FROM SNOWFLAKE.ACCOUNT_USAGE.SEARCH_OPTIMIZATION_BENEFITS
  WHERE start_time >= DATEADD(day, -7, CURRENT_TIMESTAMP())
  GROUP BY table_id
  ORDER BY
    total_partitions_pruned_additional / GREATEST(total_partitions_pruned_default + total_partitions_pruned_additional, 1) DESC,
    total_partitions_pruned_additional DESC
  LIMIT 5;
```

The example above uses [GREATEST](../functions/greatest.md) to avoid dividing by zero when the number of partitions pruned is
zero.

---
title: SEARCH_OPTIMIZATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/search_optimization_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEARCH_OPTIMIZATION_HISTORY view

This Account Usage view can be used to query the [search](../../user-guide/search-optimization-service.md) history. The information returned by the view includes the search optimization service name and credits consumed by the service.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | NUMBER | Number of credits billed for the search optimization service during the START_TIME and END_TIME window. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the search optimization service. |
| TABLE_NAME | VARCHAR | This is a system-generated alias that contains the ID of the table for which search optimization was enabled; that ID is embedded inside a string of the form “SEARCH OPTIMIZATION ON TABLE_ID: <optimized_table_id>”. For example, if you enable search optimization on a table named `accounts`, and if `accounts` has ID 1200, then the TABLE_NAME (alias) shown in this column will be “SEARCH OPTIMIZATION ON TABLE_ID: 1200”. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the search optimization service. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the search optimization service. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the search optimization service. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the search optimization service. |

## Usage notes

* Billing history is not necessarily updated immediately. Latency for the view may be up to 180 minutes (3 hours).

* Remember that the TABLE_ID column and the TABLE_NAME column do not refer to the same database object.

  + The TABLE_ID identifies the search optimization service instance.
  + The TABLE_NAME shows the table ID of the base table, which is the table
    on which the search optimization service is enabled.
* The output contains one row for each search optimization maintenance operation that is executed. Each optimization
  operation updates information about one table. The number of operations executed on each table depends on the number and
  size of updates to the data in that table.

  You can use combinations of aggregate functions and GROUP BY clauses to aggregate costs per table, or across all tables.
* The view shows only base table IDs, not base table names, so the view does not directly show costs associated with base
  tables by name.
* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

---
title: SECRETS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/secrets.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SECRETS view

This Account Usage view provides the [secrets](../sql/create-secret.md) in your account.

Each row in this view corresponds to a different secret.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| `id` | NUMBER | Internal, system-generated identifier for the secret. |
| `name` | VARCHAR | Name of the secret. |
| `schema_id` | NUMBER | Internal, system-generated identifier for the schema of the secret. |
| `schema` | VARCHAR | Schema that the secret belongs to. |
| `database_id` | NUMBER | Internal, system-generated identifier for the database of the secret. |
| `database` | VARCHAR | Database that the secret belongs to. |
| `owner` | VARCHAR | Name of the role that owns the secret; NULL if it has been dropped. |
| `owner_role_type` | VARCHAR(13) | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| `secret_type` | VARCHAR | The type of secret (`GENERIC_STRING`, `OAUTH2`, or `PASSWORD`). |
| `oauth_access_token_expiry_timestamp` | TIMESTAMP_LTZ(6) | The expiry time of the OAuth access token stored in the secret. |
| `oauth_refresh_token_expiry_timestamp` | TIMESTAMP_LTZ(6) | The expiry time of the OAuth refresh token stored in the secret. |
| `oauth_scopes` | VARCHAR | A comma-separated list of scopes to use when making a request from the OAuth server by a role with USAGE on the integration during the OAuth client credentials flow. |
| `api_authentication_integration_name` | VARCHAR | The name of the API Authentication Integration used by this secret for authentication. |
| `comment` | VARCHAR | Comment for the secret. |
| `created_on` | TIMESTAMP_LTZ(6) | Date and time when the secret was created. |
| `last_altered_on` | TIMESTAMP_LTZ(6) | Date and time when the secret was last altered. |
| `deleted_on` | TIMESTAMP_LTZ(6) | Date and time when the secret was dropped. |
| `algorithm` | VARCHAR | Algorithm used to generate the key for a symmetric key secret. |
| `key_length` | VARCHAR | Length of the key used for a symmetric key secret. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).
* Sensitive values that the secret stores, such as the values for username, password, and OAuth refresh token, are not reported in this
  view.

---
title: SEMANTIC_DIMENSIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/semantic_dimensions.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEMANTIC_DIMENSIONS view

This ACCOUNT_USAGE view displays a row for each dimension defined in a [semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [SEMANTIC_DIMENSIONS view (Information Schema)](../info-schema/semantic_dimensions.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_dimension_id` | NUMBER | ID of the dimension in the semantic view. |
| `semantic_dimension_name` | VARCHAR | Name of the dimension in the semantic view. |
| `semantic_table_id` | NUMBER | ID of the semantic table the dimension belongs to. |
| `semantic_table_name` | VARCHAR | Name of the semantic table the dimension belongs to. |
| `semantic_view_id` | NUMBER | ID of the semantic view. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `semantic_view_schema_id` | NUMBER | ID of the schema to which the semantic view belongs. |
| `semantic_view_schema_name` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_database_id` | NUMBER | ID of the database to which the semantic view belongs. |
| `semantic_view_database_name` | VARCHAR | Database to which the semantic view belongs. |
| `data_type` | VARCHAR | Data type of the dimension expression. |
| `expression` | VARCHAR | The SQL expression used to calculate the dimension. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the dimension. |
| `created` | TIMESTAMP_LTZ | Creation time of the dimension. |
| `last_altered` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| `deleted` | TIMESTAMP_LTZ | Date and time when the dimension was dropped. |
| `comment` | VARCHAR | Description of the dimension. |
| `cortex_search_service_database` | VARCHAR | Name of the database containing the [Cortex Search Service that the dimension uses](../../user-guide/views-semantic/sql.md). |
| `cortex_search_service_schema` | VARCHAR | Name of the schema containing the Cortex Search Service that the dimension uses. |
| `cortex_search_service` | VARCHAR | Name of the Cortex Search Service that the dimension uses. |
| `cortex_search_service_column` | VARCHAR | Name of the column that the Cortex Search Service allows you to search on, if the dimension uses a Cortex Search Service. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the list of all dimensions for the semantic view `O_TPCH_SEMANTIC_VIEW` in the database `MY_DB`:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SEMANTIC_DIMENSIONS
  WHERE semantic_view_name = 'O_TPCH_SEMANTIC_VIEW'
    AND semantic_view_database_name = 'MY_DB';
```

```output
+-----------------------+------------------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-------------+----------------------+----------+-------------------------------+-------------------------------+---------+---------+
| SEMANTIC_DIMENSION_ID | SEMANTIC_DIMENSION_NAME            | SEMANTIC_TABLE_ID | SEMANTIC_TABLE_NAME | SEMANTIC_VIEW_ID | SEMANTIC_VIEW_NAME   | SEMANTIC_VIEW_SCHEMA_ID | SEMANTIC_VIEW_SCHEMA_NAME | SEMANTIC_VIEW_DATABASE_ID | SEMANTIC_VIEW_DATABASE_NAME | DATA_TYPE   | EXPRESSION           | SYNONYMS | CREATED                       | LAST_ALTERED                  | DELETED | COMMENT |
|-----------------------+------------------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-------------+----------------------+----------+-------------------------------+-------------------------------+---------+---------|
|                   391 | D_CUSTOMER_REGION_NAME_FROM_REGION |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(25) | region.d_region_name | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                   392 | D_CUSTOMER_NATION_NAME             |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(25) | nation.d_nation_name | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                   393 | D_CUSTOMER_MARKET_SEGMENT          |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(10) | c_mktsegment         | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                   387 | D_NATION_NAME                      |                98 | NATION              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(25) | n_name               | NULL     | 2025-02-28 16:16:04.388 -0800 | 2025-02-28 16:16:04.388 -0800 | NULL    | NULL    |
|                   389 | D_REGION_NAME                      |                97 | REGION              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(25) | r_name               | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                   394 | D_CUSTOMER_COUNTRY_CODE            |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(15) | LEFT(c_phone, 2)     | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                   390 | D_CUSTOMER_REGION_NAME             |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(25) | nation.d_region_name | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                   388 | D_REGION_NAME                      |                98 | NATION              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | VARCHAR(25) | region.d_region_name | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
+-----------------------+------------------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-------------+----------------------+----------+-------------------------------+-------------------------------+---------+---------+
```

---
title: SEMANTIC_FACTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/semantic_facts.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEMANTIC_FACTS view

This ACCOUNT_USAGE view displays a row for each fact defined in a [semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [SEMANTIC_FACTS view (Information Schema)](../info-schema/semantic_facts.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_fact_id` | NUMBER | ID of the fact in the semantic view. |
| `semantic_fact_name` | VARCHAR | Name of the fact in the semantic view. |
| `semantic_table_id` | NUMBER | ID of the semantic table the fact belongs to. |
| `semantic_table_name` | VARCHAR | Name of the semantic table the fact belongs to. |
| `semantic_view_id` | NUMBER | ID of the semantic view. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `semantic_view_schema_id` | NUMBER | ID of the schema to which the semantic view belongs. |
| `semantic_view_schema_name` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_database_id` | NUMBER | ID of the database to which the semantic view belongs. |
| `semantic_view_database_name` | VARCHAR | Database to which the semantic view belongs. |
| `data_type` | VARCHAR | Data type of the fact expression. |
| `expression` | VARCHAR | The SQL expression used to calculate the fact. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the fact. |
| `comment` | VARCHAR | Description of the fact. |
| `created` | TIMESTAMP_LTZ | Creation time of the fact. |
| `last_altered` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| `deleted` | TIMESTAMP_LTZ | Date and time when the fact was dropped. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the list of all facts for the semantic view `O_TPCH_SEMANTIC_VIEW` in the database `MY_DB`:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SEMANTIC_FACTS
  WHERE semantic_view_name = 'O_TPCH_SEMANTIC_VIEW'
    AND semantic_view_database_name = 'MY_DB';
```

```output
+------------------+------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+--------------+--------------------------+----------+-------------------------------+-------------------------------+---------+---------+
| SEMANTIC_FACT_ID | SEMANTIC_FACT_NAME     | SEMANTIC_TABLE_ID | SEMANTIC_TABLE_NAME | SEMANTIC_VIEW_ID | SEMANTIC_VIEW_NAME   | SEMANTIC_VIEW_SCHEMA_ID | SEMANTIC_VIEW_SCHEMA_NAME | SEMANTIC_VIEW_DATABASE_ID | SEMANTIC_VIEW_DATABASE_NAME | DATA_TYPE    | EXPRESSION               | SYNONYMS | CREATED                       | LAST_ALTERED                  | DELETED | COMMENT |
|------------------+------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+--------------+--------------------------+----------+-------------------------------+-------------------------------+---------+---------|
|              386 | A_CUSTOMER_ORDER_COUNT |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | NUMBER(18,0) | COUNT(orders.d_orderkey) | NULL     | 2025-02-28 16:16:04.388 -0800 | 2025-02-28 16:16:04.388 -0800 | NULL    | NULL    |
|              385 | D_ORDERKEY             |               100 | ORDERS              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | NUMBER(38,0) | o_orderkey               | NULL     | 2025-02-28 16:16:04.388 -0800 | 2025-02-28 16:16:04.388 -0800 | NULL    | NULL    |
+------------------+------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+--------------+--------------------------+----------+-------------------------------+-------------------------------+---------+---------+
```

---
title: SEMANTIC_METRICS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/semantic_metrics.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEMANTIC_METRICS view

This ACCOUNT_USAGE view displays a row for each metric defined in a [semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [SEMANTIC_METRICS view (Information Schema)](../info-schema/semantic_metrics.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_metric_id` | NUMBER | ID of the metric in the semantic view. |
| `semantic_metric_name` | VARCHAR | Name of the metric in the semantic view. |
| `semantic_table_id` | NUMBER | ID of the logical table the metric belongs to. |
| `semantic_table_name` | VARCHAR | Name of the logical table the metric belongs to. |
| `semantic_view_id` | NUMBER | Internal, Snowflake-generated identifier for the semantic view in which the metric is defined. |
| `semantic_view_name` | VARCHAR | Name of the semantic view in which the metric is defined. |
| `semantic_view_schema_id` | NUMBER | Internal, Snowflake-generated identifier for the schema that the semantic view belongs to. |
| `semantic_view_schema_name` | VARCHAR | Schema that the semantic view belongs to. |
| `semantic_view_database_id` | NUMBER | Internal, Snowflake-generated identifier for the database that the semantic view belongs to. |
| `semantic_view_database_name` | VARCHAR | Database that the semantic view belongs to. |
| `data_type` | VARCHAR | Data type of the metric expression. |
| `expression` | VARCHAR | The SQL expression used to calculate the metric. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the metric. |
| `comment` | VARCHAR | Description of the metric. |
| `created` | TIMESTAMP_LTZ | Creation time of the metric. |
| `last_altered` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| `deleted` | TIMESTAMP_LTZ | Date and time when the metric was dropped. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the list of all metrics for the semantic view `O_TPCH_SEMANTIC_VIEW` in the database `MY_DB`:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SEMANTIC_METRICS
  WHERE semantic_view_name = 'O_TPCH_SEMANTIC_VIEW'
    AND semantic_view_database_name = 'MY_DB';
```

```output
i+--------------------+------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+--------------+--------------------------------------+----------+-------------------------------+-------------------------------+---------+---------+
| SEMANTIC_METRIC_ID | SEMANTIC_METRIC_NAME   | SEMANTIC_TABLE_ID | SEMANTIC_TABLE_NAME | SEMANTIC_VIEW_ID | SEMANTIC_VIEW_NAME   | SEMANTIC_VIEW_SCHEMA_ID | SEMANTIC_VIEW_SCHEMA_NAME | SEMANTIC_VIEW_DATABASE_ID | SEMANTIC_VIEW_DATABASE_NAME | DATA_TYPE    | EXPRESSION                           | SYNONYMS | CREATED                       | LAST_ALTERED                  | DELETED | COMMENT |
|--------------------+------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+--------------+--------------------------------------+----------+-------------------------------+-------------------------------+---------+---------|
|                396 | M_CUSTOMER_ORDER_COUNT |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | NUMBER(30,0) | SUM(customer.a_customer_order_count) | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                395 | M_CUSTOMER_COUNT       |                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | NUMBER(18,0) | COUNT(c_custkey)                     | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                398 | M_SUPPLIER_COUNT       |               102 | SUPPLIER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | NUMBER(18,0) | COUNT(s_suppkey)                     | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
|                397 | M_ORDER_COUNT          |               100 | ORDERS              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | NUMBER(18,0) | COUNT(o_orderkey)                    | NULL     | 2025-02-28 16:16:04.389 -0800 | 2025-02-28 16:16:04.389 -0800 | NULL    | NULL    |
+--------------------+------------------------+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+--------------+--------------------------------------+----------+-------------------------------+-------------------------------+---------+---------+
```

---
title: SEMANTIC_RELATIONSHIPS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/semantic_relationships.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEMANTIC_RELATIONSHIPS view

This ACCOUNT_USAGE view displays a row for each relationship defined in a
[semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [SEMANTIC_RELATIONSHIPS view (Information Schema)](../info-schema/semantic_relationships.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_relationship_id` | NUMBER | Internal, Snowflake-generated identifier for the relationship in the semantic view. |
| `semantic_relationship_name` | VARCHAR | Name of the relationship in the semantic view. |
| `semantic_view_id` | NUMBER | Internal, Snowflake-generated identifier for the semantic view in which the relationship is defined. |
| `semantic_view_name` | VARCHAR | Name of the semantic view in which the relationship is defined. |
| `semantic_view_schema_id` | NUMBER | Internal, Snowflake-generated identifier for the schema that the semantic view belongs to. |
| `semantic_view_schema_name` | VARCHAR | Schema that the semantic view belongs to. |
| `semantic_view_database_id` | NUMBER | Internal, Snowflake-generated identifier for the database that the semantic view belongs to. |
| `semantic_view_database_name` | VARCHAR | Database that the semantic view belongs to. |
| `semantic_table_id` | NUMBER | Internal, Snowflake-generated identifier for the logical table being referenced. |
| `semantic_table_name` | VARCHAR | Name of the logical table referencing the other table. |
| `ref_semantic_table_id` | NUMBER | Internal, Snowflake-generated identifier for the logical table referencing the other table. |
| `ref_semantic_table_name` | VARCHAR | Name of the logical table being referenced. |
| `foreign_keys` | ARRAY(VARCHAR) | List of the names of the columns referring to the columns of the other table. |
| `ref_keys` | ARRAY(VARCHAR) | One of the following values:   * For relationships that represent [range joins](../../user-guide/views-semantic/sql.md), an array that contains   JSON-formatted strings for objects with the following keys:    + The `start_column` key specifies the name of the column that represents the start of the range.   + The `end_column` key specifies the name of the column that represents the end of the range.   + The `type` key is `RANGE`. * For relationships that represent [ASOF joins](../../user-guide/views-semantic/sql.md), an array that contains the   following elements:    + The name of the column in the first table.   + A JSON object with the following fields:      - `column`: Name of the column in the second table.     - `type`: `ASOF`.  * For other types of relationships, an array containing the name of the column in the other logical table in the relationship. |
| `created` | TIMESTAMP_LTZ | Creation time of the relationship. |
| `last_altered` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| `deleted` | TIMESTAMP_LTZ | Date and time when the relationship was dropped. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the list of all relationships for the semantic view `O_TPCH_SEMANTIC_VIEW` in the database `MY_DB`:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SEMANTIC_RELATIONSHIPS
  WHERE semantic_view_name = 'O_TPCH_SEMANTIC_VIEW'
    AND semantic_view_database_name = 'MY_DB';
```

```output
+--------------------------+-------------------------------------------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-------------------+---------------------+-----------------+-----------------------+-------------------------+-----------------+-------------------------------+-------------------------------+---------+
| SEMANTIC_RELATIONSHIP_ID | SEMANTIC_RELATIONSHIP_NAME                            | SEMANTIC_VIEW_ID | SEMANTIC_VIEW_NAME   | SEMANTIC_VIEW_SCHEMA_ID | SEMANTIC_VIEW_SCHEMA_NAME | SEMANTIC_VIEW_DATABASE_ID | SEMANTIC_VIEW_DATABASE_NAME | SEMANTIC_TABLE_ID | SEMANTIC_TABLE_NAME | FOREIGN_KEYS    | REF_SEMANTIC_TABLE_ID | REF_SEMANTIC_TABLE_NAME | REF_KEYS        | CREATED                       | LAST_ALTERED                  | DELETED |
|--------------------------+-------------------------------------------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-------------------+---------------------+-----------------+-----------------------+-------------------------+-----------------+-------------------------------+-------------------------------+---------|
|                       99 | SYS_RELATIONSHIP_67ae9bb4-652a-4985-8dc5-c99fdf7f4276 |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       |               100 | ORDERS              | [               |                    99 | CUSTOMER                | [               | 2025-02-28 16:16:04.321 -0800 | 2025-02-28 16:16:04.321 -0800 | NULL    |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     |   "O_CUSTKEY"   |                       |                         |   "C_CUSTKEY"   |                               |                               |         |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     | ]               |                       |                         | ]               |                               |                               |         |
|                      100 | SYS_RELATIONSHIP_906b4d92-582a-4bef-b2c1-9a69e8f61af1 |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       |               101 | LINEITEM            | [               |                   100 | ORDERS                  | [               | 2025-02-28 16:16:04.363 -0800 | 2025-02-28 16:16:04.363 -0800 | NULL    |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     |   "L_ORDERKEY"  |                       |                         |   "O_ORDERKEY"  |                               |                               |         |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     | ]               |                       |                         | ]               |                               |                               |         |
|                      101 | SYS_RELATIONSHIP_fadc2c0f-db3a-48e4-b96a-53ea2767a2b0 |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       |               102 | SUPPLIER            | [               |                    98 | NATION                  | [               | 2025-02-28 16:16:04.376 -0800 | 2025-02-28 16:16:04.376 -0800 | NULL    |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     |   "S_NATIONKEY" |                       |                         |   "N_NATIONKEY" |                               |                               |         |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     | ]               |                       |                         | ]               |                               |                               |         |
|                       98 | SYS_RELATIONSHIP_8c9ad09e-0ba4-489f-aabb-0503ef80e11b |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       |                99 | CUSTOMER            | [               |                    98 | NATION                  | [               | 2025-02-28 16:16:04.309 -0800 | 2025-02-28 16:16:04.309 -0800 | NULL    |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     |   "C_NATIONKEY" |                       |                         |   "N_NATIONKEY" |                               |                               |         |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     | ]               |                       |                         | ]               |                               |                               |         |
|                       97 | SYS_RELATIONSHIP_8529b4a7-eaff-4c36-888f-d9e1ad2683de |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       |                98 | NATION              | [               |                    97 | REGION                  | [               | 2025-02-28 16:16:04.294 -0800 | 2025-02-28 16:16:04.294 -0800 | NULL    |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     |   "N_REGIONKEY" |                       |                         |   "R_REGIONKEY" |                               |                               |         |
|                          |                                                       |                  |                      |                         |                           |                           |                             |                   |                     | ]               |                       |                         | ]               |                               |                               |         |
+--------------------------+-------------------------------------------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-------------------+---------------------+-----------------+-----------------------+-------------------------+-----------------+-------------------------------+-------------------------------+---------+
```

---
title: SEMANTIC_TABLES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/semantic_tables.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEMANTIC_TABLES view

This ACCOUNT_USAGE view displays a row for each logical table defined in a
[semantic view](../../user-guide/views-semantic/overview.md).

See also:
:   [SEMANTIC_TABLES view (Information Schema)](../info-schema/semantic_tables.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_table_id` | NUMBER | Internal, Snowflake-generated identifier for the table in the semantic view. |
| `semantic_table_name` | VARCHAR | Name of the table in the semantic view. |
| `semantic_view_id` | NUMBER | Internal, Snowflake-generated identifier for the semantic view in which the table is defined. |
| `semantic_view_name` | VARCHAR | Name of the semantic view in which the table is defined. |
| `semantic_view_schema_id` | NUMBER | Internal, Snowflake-generated identifier for the schema that the semantic view belongs to. |
| `semantic_view_schema_name` | VARCHAR | Schema that the semantic view belongs to. |
| `semantic_view_database_id` | NUMBER | Internal, Snowflake-generated identifier for the database that the semantic view belongs to. |
| `semantic_view_database_name` | VARCHAR | Database that the semantic view belongs to. |
| `base_table_name` | VARCHAR | Name of the base table. |
| `base_table_schema_name` | VARCHAR | Schema that the base table belongs to. |
| `base_table_database_name` | VARCHAR | Database that the base table belongs to. |
| `primary_keys` | ARRAY(VARCHAR) | List of the primary key columns of the table. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the table. |
| `comment` | VARCHAR | Comment for the table. |
| `distinct_ranges` | ARRAY(OBJECT) | Array of OBJECT values, which describe the [constraints for the logical table containing the range](../../user-guide/views-semantic/sql.md). Each object contains the following key-value pairs:   * `constraint_name`: The name of the constraint. * `end_column`: The name of the column that represents the end of the range. * `start_column`: The name of the column that represents the start of the range. |
| `created` | TIMESTAMP_LTZ | Creation time of the table. |
| `last_altered` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| `deleted` | TIMESTAMP_LTZ | Date and time when the table was dropped. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the list of all logical tables for the semantic view `O_TPCH_SEMANTIC_VIEW` in the database `MY_DB`:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SEMANTIC_TABLES
  WHERE semantic_view_name = 'O_TPCH_SEMANTIC_VIEW'
    AND semantic_view_database_name = 'MY_DB';
```

```output
+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+------------------+----------+-------------------------------+-------------------------------+---------+---------+
| SEMANTIC_TABLE_ID | SEMANTIC_TABLE_NAME | SEMANTIC_VIEW_ID | SEMANTIC_VIEW_NAME   | SEMANTIC_VIEW_SCHEMA_ID | SEMANTIC_VIEW_SCHEMA_NAME | SEMANTIC_VIEW_DATABASE_ID | SEMANTIC_VIEW_DATABASE_NAME | PRIMARY_KEYS     | SYNONYMS | CREATED                       | LAST_ALTERED                  | DELETED | COMMENT |
|-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+------------------+----------+-------------------------------+-------------------------------+---------+---------|
|               101 | LINEITEM            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | [                | NULL     | 2025-02-28 16:16:04.363 -0800 | 2025-02-28 16:16:04.363 -0800 | NULL    | NULL    |
|                   |                     |                  |                      |                         |                           |                           |                             |   "L_ORDERKEY",  |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             |   "L_LINENUMBER" |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             | ]                |          |                               |                               |         |         |
|                99 | CUSTOMER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | [                | NULL     | 2025-02-28 16:16:04.309 -0800 | 2025-02-28 16:16:04.309 -0800 | NULL    | NULL    |
|                   |                     |                  |                      |                         |                           |                           |                             |   "C_CUSTKEY"    |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             | ]                |          |                               |                               |         |         |
|               100 | ORDERS              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | [                | NULL     | 2025-02-28 16:16:04.321 -0800 | 2025-02-28 16:16:04.321 -0800 | NULL    | NULL    |
|                   |                     |                  |                      |                         |                           |                           |                             |   "O_ORDERKEY"   |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             | ]                |          |                               |                               |         |         |
|               102 | SUPPLIER            |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | [                | NULL     | 2025-02-28 16:16:04.376 -0800 | 2025-02-28 16:16:04.376 -0800 | NULL    | NULL    |
|                   |                     |                  |                      |                         |                           |                           |                             |   "S_SUPPKEY"    |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             | ]                |          |                               |                               |         |         |
|                98 | NATION              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | [                | NULL     | 2025-02-28 16:16:04.294 -0800 | 2025-02-28 16:16:04.294 -0800 | NULL    | NULL    |
|                   |                     |                  |                      |                         |                           |                           |                             |   "N_NATIONKEY"  |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             | ]                |          |                               |                               |         |         |
|                97 | REGION              |               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | [                | NULL     | 2025-02-28 16:16:04.249 -0800 | 2025-02-28 16:16:04.249 -0800 | NULL    | NULL    |
|                   |                     |                  |                      |                         |                           |                           |                             |   "R_REGIONKEY"  |          |                               |                               |         |         |
|                   |                     |                  |                      |                         |                           |                           |                             | ]                |          |                               |                               |         |         |
+-------------------+---------------------+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+------------------+----------+-------------------------------+-------------------------------+---------+---------+
```

---
title: SEMANTIC_VIEWS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/semantic_views.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEMANTIC_VIEWS view

This ACCOUNT_USAGE view displays a row for each [semantic view](../../user-guide/views-semantic/overview.md) in the account.

See also:
:   [SEMANTIC_VIEWS view (Information Schema)](../info-schema/semantic_views.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_view_id` | NUMBER | Internal, Snowflake-generated identifier for the semantic view. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `semantic_view_schema_id` | NUMBER | Internal, Snowflake-generated identifier for the schema that the semantic view belongs to. |
| `semantic_view_schema_name` | VARCHAR | Schema that the semantic view belongs to. |
| `semantic_view_database_id` | NUMBER | Internal, Snowflake-generated identifier for the database that the semantic view belongs to. |
| `semantic_view_database_name` | VARCHAR | Database that the semantic view belongs to. |
| `owner` | VARCHAR | Name of the role that owns the semantic view. |
| `created` | TIMESTAMP_LTZ | Creation time of the semantic view. |
| `last_altered` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. |
| `deleted` | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| `comment` | VARCHAR | Comment for the semantic view. |

## Usage notes

* Latency for the view can be up to 120 minutes (2 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the semantic view `O_TPCH_SEMANTIC_VIEW` in the database `MY_DB`:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SEMANTIC_VIEWS
  WHERE semantic_view_name = 'O_TPCH_SEMANTIC_VIEW'
    AND semantic_view_database_name = 'MY_DB';
```

```output
+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-----------------+-------------------------------+-------------------------------+---------+---------+
| SEMANTIC_VIEW_ID | SEMANTIC_VIEW_NAME   | SEMANTIC_VIEW_SCHEMA_ID | SEMANTIC_VIEW_SCHEMA_NAME | SEMANTIC_VIEW_DATABASE_ID | SEMANTIC_VIEW_DATABASE_NAME | OWNER           | CREATED                       | LAST_ALTERED                  | DELETED | COMMENT |
|------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-----------------+-------------------------------+-------------------------------+---------+---------|
|               49 | O_TPCH_SEMANTIC_VIEW |                      92 | MY_SCHEMA                 |                         7 | MY_DB                       | DYOSHINAGA_ROLE | 2025-02-28 16:16:04.002 -0800 | 2025-02-28 16:16:04.589 -0800 | NULL    | NULL    |
+------------------+----------------------+-------------------------+---------------------------+---------------------------+-----------------------------+-----------------+-------------------------------+-------------------------------+---------+---------+
```

---
title: SEQUENCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/sequences.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SEQUENCES view

This Account Usage view displays a row for each sequence defined in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SEQUENCE_ID | NUMBER | Internal/system-generated identifier for the sequence. |
| SEQUENCE_NAME | VARCHAR | Name of the sequence. |
| SEQUENCE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the sequence. |
| SEQUENCE_SCHEMA | VARCHAR | Schema that the sequence belongs to. |
| SEQUENCE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the sequence. |
| SEQUENCE_CATALOG | VARCHAR | Database that the sequence belongs to. |
| SEQUENCE_OWNER | VARCHAR | Name of the role that owns the sequence. |
| DATA_TYPE | VARCHAR | Data type of the sequence. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of the data type of the sequence. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of the numeric precision of the data type of the sequence. |
| NUMERIC_SCALE | NUMBER | Scale of the data type of the sequence. |
| START_VALUE | VARCHAR | Initial value of the sequence. |
| MINIMUM_VALUE | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_VALUE | VARCHAR | Not applicable for Snowflake. |
| NEXT_VALUE | VARCHAR | Next value that the sequence will produce. |
| INCREMENT | VARCHAR | Increment of the sequence generator. |
| CYCLE_OPTION | VARCHAR | Not applicable for Snowflake. |
| ORDERED | VARCHAR | If `YES`, the sequence has the ORDER property. If `NO`, the sequence has the NOORDER property. |
| CREATED | TIMESTAMP_LTZ | Date and time when the sequence was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the sequence was dropped. |
| COMMENT | VARCHAR | Comment for the sequence. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SERVERLESS_ALERT_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/serverless_alert_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SERVERLESS_ALERT_HISTORY view

This Account Usage view can be used to query the [serverless alert](../../user-guide/alerts.md) usage history.
The information returned by the view includes the serverless alert name and credits consumed by serverless alert usage.

See also:
:   [SERVERLESS_ALERT_HISTORY function](../functions/serverless_alert_history.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | VARCHAR | Number of credits billed for serverless alert usage during the START_TIME and END_TIME window. |
| ALERT_ID | NUMBER | Internal/system-generated identifier for the serverless alert. |
| ALERT_NAME | VARCHAR | Name of the serverless alert. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the serverless alert. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the serverless alert. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the serverless alert. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the serverless alert. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

## Example

The following SQL statement queries for the credits used by the two most recent executions of serverless alerts:

```sqlexample
SELECT
    start_time,
    end_time,
    alert_id,
    alert_name,
    credits_used,
    schema_id,
    schema_name,
    database_id,
    database_name,
  FROM SNOWFLAKE.ACCOUNT_USAGE.SERVERLESS_ALERT_HISTORY
  LIMIT 2;
```

```output
+---------------------------------+---------------------------------+----------+---------------------+--------------+-----------+-------------+-------------+---------------+
|           START_TIME            |            END_TIME             | ALERT_ID |     ALERT_NAME      | CREDITS_USED | SCHEMA_ID | SCHEMA_NAME | DATABASE_ID | DATABASE_NAME |
+---------------------------------+---------------------------------+----------+---------------------+--------------+-----------+-------------+-------------+---------------+
| Tue, 10 Sep 2024 17:57:00 -0700 | Tue, 10 Sep 2024 17:58:00 -0700 | 202      | MY_SERVERLESS_ALERT | 0.000869065  | 52        | SCTEST      | 30          | DBTEST        |
| Tue, 10 Sep 2024 18:57:00 -0700 | Tue, 10 Sep 2024 18:58:00 -0700 | 202      | MY_SERVERLESS_ALERT | 0.000841918  | 52        | SCTEST      | 30          | DBTEST        |
+---------------------------------+---------------------------------+----------+---------------------+--------------+-----------+-------------+-------------+---------------+
```

---
title: SERVERLESS_TASK_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/serverless_task_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SERVERLESS_TASK_HISTORY view

This Account Usage view can be used to query the [serverless task](../../user-guide/tasks-intro.md) usage history. The information
returned by the view includes the serverless task name and credits consumed by serverless task usage.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the specified time range. |
| END_TIME | TIMESTAMP_LTZ | End of the specified time range. |
| CREDITS_USED | VARCHAR | Number of credits billed for serverless task usage during the START_TIME and END_TIME window. |
| TASK_ID | NUMBER | Internal/system-generated identifier for the serverless task. |
| TASK_NAME | VARCHAR | Name of the serverless task. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the serverless task. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the serverless task. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the serverless task. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the serverless task. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: SERVICES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/services.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SERVICES view

This SERVICES view in the Account Usage schema is similar to SERVICES view in information schema except this view includes deleted Snowpark Container Services services. For more information about difference in these schemas, see
[Differences between Account Usage and Information Schema](../account-usage.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_ID | NUMBER | Internal/system-generated identifier for the service. |
| SERVICE_NAME | VARCHAR | Name of the service. |
| SERVICE_CATALOG_ID | NUMBER | Internal, Snowflake-generated identifier of the database for the service. |
| SERVICE_CATALOG | VARCHAR | Database that the service belongs to. |
| SERVICE_SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema for the service. |
| SERVICE_SCHEMA | VARCHAR | Schema that the service belongs to. |
| SERVICE_OWNER | VARCHAR | Name of the role that owns the service. App instance name if in an app. |
| SERVICE_OWNER_ROLE_TYPE | VARCHAR | Type of the owner role. |
| COMPUTE_POOL_ID | NUMBER | Identifier of the compute pool that runs the service. |
| COMPUTE_POOL_NAME | VARCHAR | Compute pool where the job was executed. |
| DNS_NAME | VARCHAR | DNS name associated with the service. |
| MIN_READY_INSTANCES | NUMBER | Minimum service instances that must be ready for Snowflake to consider the service is ready to process requests. |
| MIN_INSTANCES | NUMBER | Minimum instances for the service. |
| MAX_INSTANCES | NUMBER | Maximum instances for the service. |
| AUTO_RESUME | BOOLEAN | Flag that determines if the service can be auto resumed. |
| QUERY_WAREHOUSE | VARCHAR | Name of the default query warehouse of the service. |
| CREATED | TIMESTAMP_LTZ | Creation time of the service. |
| LAST_ALTERED | TIMESTAMP_LTZ | Last altered time of the service. |
| LAST_RESUMED | TIMESTAMP_LTZ | Last resumed time of the service. |
| DELETED | TIMESTAMP_LTZ | Deletion time of the service. |
| COMMENT | VARCHAR | Comment for this service. |
| IS_JOB | BOOLEAN | `true` if the service is a job service; `false` otherwise. |

## Example

```sqlexample
SELECT *
FROM snowflake.account_usage.services
WHERE service_name LIKE '%myservice%';
```

---
title: SESSION_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/session_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SESSION_POLICIES view

This Account Usage view provides the [session policies](../../user-guide/session-policies.md) in your account.

Each row in this view corresponds to a different session policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the session policy. |
| NAME | VARCHAR | Name of the session policy. |
| SCHEMA_ID | VARCHAR | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | VARCHAR | Schema to which the session policy belongs. |
| DATABASE_ID | VARCHAR | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | VARCHAR | Database to which the session policy belongs. |
| OWNER | VARCHAR | Name of the role that owns the session policy. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| SESSION_IDLE_TIMEOUT_MINS | NUMBER | Session idle timeout in minutes for the policy. |
| SESSION_UI_IDLE_TIMEOUT_MINS | NUMBER | UI session idle timeout in minutes for the policy. |
| COMMENT | VARCHAR | Comments entered for the session policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the session policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the session policy was dropped. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SESSIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/sessions.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SESSIONS view

This Account Usage view provides information on the session, including information on the authentication method to Snowflake and the
Snowflake login event. Snowflake returns one row for each session created over the last year.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SESSION_ID | Number | The unique identifier for the current session. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the session was created. |
| USER_NAME | String | The user name of the user. |
| AUTHENTICATION_METHOD | String | The authentication method used to access Snowflake. |
| LOGIN_EVENT_ID | Number | The unique identifier for the login event. |
| CLIENT_APPLICATION_VERSION | String | The version number (e.g. 3.8.7) of the Snowflake-provided client application used to create the remote session to Snowflake. |
| CLIENT_APPLICATION_ID | String | The identifier for the Snowflake-provided client application used to create the remote session to Snowflake (e.g. JDBC 3.8.7) |
| CLIENT_ENVIRONMENT | String | The environment variables (e.g. operating system, OCSP mode) of the client used to create a remote session to Snowflake. |
| CLIENT_BUILD_ID | String | The build number (e.g. 41897) of the third-party client application used to create a remote session to Snowflake, if available. For example, a third-party Java application that uses the JDBC driver to connect to Snowflake. |
| CLIENT_VERSION | String | The version number (e.g. 47154) of the third-party client application that uses a Snowflake-provided client to create a remote session to Snowflake, if available. . |
| ACCESS_TIME | TIMESTAMP_LTZ | Date and time when the session was last used. |
| IS_OPEN | BOOLEAN | Whether the session is currently open (TRUE) or closed (FALSE). |
| CLOSED_REASON | String | The reason why a Snowflake session closed. NULL for sessions that are currently open. One of the following for closed sessions: DROP_USER, LOGOUT, FORCED_LOGOUT, ABANDONED, OAUTH_CRITICAL_CHANGE_INTEGRATION, DROP_ACCOUNT, OAUTH_CONSENT_REVOKED, TASK_COMPLETED, SFC_FORCED_LOGOUT. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* The view displays data starting from July 20-21, 2020.

* The SESSIONS view does not currently track SQL API transient sessions.
* This view does not record the activity of internal users the system defines to perform various operations
  (e.g. maintain Snowsight worksheets).

---
title: SHARES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/shares.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SHARES view

This Account Usage view returns all the shares owned by the current account, including dropped shares. The information in this view has a latency of up to 3 hours.

## Columns

The following table provides definitions for the SHARES view columns.

| Column | Data type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | The timestamp when the share was created. |
| MODIFIED_ON | TIMESTAMP_LTZ | The timestamp when the share was last updated. |
| DELETED_ON | TIMESTAMP_LTZ | The timestamp when the share was deleted. This value is NULL if the share hasn’t been deleted. |
| NAME | VARCHAR | The name of the share. |
| OWNER | VARCHAR | The name of the role that owns the share. |
| COMMENT | VARCHAR | Comment associated with the share, if any. |
| DATABASE_NAME | VARCHAR | The name of the primary database associated with the share. This field is empty if no database has been granted to the share. |
| SECURE_OBJECTS_ONLY | BOOLEAN | Indicates whether the share can only have secure objects granted to it. |
| TARGET_ACCOUNTS | VARCHAR | A comma-separated list of target accounts the share is shared with (outbound). This field is empty if the share has no target accounts. |
| LISTING_GLOBAL_NAME | VARCHAR | Global unique name of the listing associated with the share, if any. |

---
title: SNAPSHOT_OPERATION_HISTORY view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/account-usage/snapshot_operation_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNAPSHOT_OPERATION_HISTORY view — *Deprecated*

This Account Usage view provides information about the snapshot operations that were performed for
[snapshot sets](../../user-guide/backups.md).
Snowflake returns one row for each operation performed on snapshots within snapshot sets over the last year.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The timestamp at which the snapshot operation started. |
| END_TIME | TIMESTAMP_LTZ | The timestamp at which the snapshot operation ended. |
| SNAPSHOT_SET_ID | NUMBER | The local snapshot set ID. |
| SNAPSHOT_ID | VARCHAR | The unique identifier of snapshot being worked on. |
| OPERATION_TYPE | VARCHAR | Could be one of the below operations:   * CREATE * EXPIRE * RESTORE * ADD_LEGAL_HOLD * REMOVE_LEGAL_HOLD |
| Query_ID | VARCHAR | Internal system-generated identifier for the SQL statement. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* Snowflake retains the history data for 365 days (approximately one year).

---
title: SNAPSHOT_POLICIES view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/account-usage/snapshot_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNAPSHOT_POLICIES view — *Deprecated*

This Account Usage view provides information about [snapshot policies](../../user-guide/backups.md)
and their properties.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the snapshot policy. |
| NAME | VARCHAR | Name of the snapshot policy. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the snapshot policy. |
| SCHEMA_NAME | VARCHAR | Schema that the snapshot policy belongs to. |
| CATALOG_ID | NUMBER | Internal system-generated identifier for the database of the snapshot policy. |
| CATALOG_NAME | VARCHAR | Database that the snapshot policy belongs to. |
| SCHEDULE | VARCHAR | Schedule for snapshot creation. |
| EXPIRE_AFTER_DAYS | NUMBER | Days after snapshot creation when snapshot should be expired and automatically deleted. |
| HAS_RETENTION_LOCK | VARCHAR | Indicates whether the policy includes a retention lock. Y if the policy has a retention lock; N otherwise.  Retention lock protects snapshots from being deleted by anyone for the defined retention period. The retention lock also prevents the retention period from being decreased on the policy. |
| OWNER | VARCHAR | Name of the role that owns the snapshot policy. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the snapshot policy. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the snapshot policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the snapshot policy was deleted. |
| COMMENT | VARCHAR | Comment for the snapshot policy. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SNAPSHOT_SETS view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/account-usage/snapshot_sets.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNAPSHOT_SETS view — *Deprecated*

This Account Usage view provides information about [snapshot sets](../../user-guide/backups.md) and their properties.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the snapshot set. |
| NAME | VARCHAR | Name of the snapshot set |
| SCHEMA_ID | NUMBER | Internal system-generated identifier for the schema of the snapshot set. |
| SCHEMA_NAME | VARCHAR | Schema that the snapshot set belongs to |
| CATALOG_ID | NUMBER | Internal system-generated identifier for the database of the snapshot set. |
| CATALOG_NAME | VARCHAR | Database that the snapshot set belongs to. |
| OBJECT_KIND | VARCHAR | Type of object that the snapshot set is snapshotting. |
| OBJECT_ID | NUMBER | ID of object that the snapshot set is snapshotting. |
| OBJECT_NAME | VARCHAR | Name of object that the snapshot set is snapshotting. |
| OBJECT_SCHEMA_ID | NUMBER | ID of schema that contains the object being snapshotted by this snapshot set. |
| OBJECT_SCHEMA_NAME | VARCHAR | Name of schema that contains the object being snapshotted by this snapshot set. |
| OBJECT_CATALOG_ID | NUMBER | ID of database that contains the object being snapshotted by this snapshot set. |
| OBJECT_CATALOG_NAME | VARCHAR | Name of database that contains the object being snapshotted by this snapshot set. |
| SNAPSHOT_POLICY_ID | NUMBER | ID of snapshot policy attached to this snapshot set. |
| SNAPSHOT_POLICY_NAME | VARCHAR | Name of snapshot policy attached to this snapshot set. |
| SNAPSHOT_POLICY_SCHEMA_ID | NUMBER | ID of the schema that contains the snapshot policy. |
| SNAPSHOT_POLICY_SCHEMA_NAME | VARCHAR | Name of the schema that contains the snapshot policy. |
| SNAPSHOT_POLICY_CATALOG_ID | NUMBER | ID of the database that contains the snapshot policy. |
| SNAPSHOT_POLICY_CATALOG_NAME | VARCHAR | Name of the database that contains the snapshot policy. |
| OWNER | VARCHAR | Name of the role that owns the snapshot set. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the snapshot set. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the snapshot set was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the snapshot set was deleted. |
| COMMENT | VARCHAR | Comment for the snapshot set. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SNAPSHOT_STORAGE_USAGE view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/account-usage/snapshot_storage_usage.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNAPSHOT_STORAGE_USAGE view — *Deprecated*

This Account Usage view provides information about storage usage for [snapshots](../../user-guide/backups.md).

> **Note:**
>
> The same tables might be included in multiple table snapshots, schema snapshots, and database snapshots.
> Therefore, the numbers of bytes shown in this view don’t entirely answer questions about how much storage
> you can save by deleting a snapshot or a snapshot set. The same data files might be retained as part of
> a different snapshot set.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| SNAPSHOT_SET_ID | NUMBER | Internal system-generated identifier for the snapshot set. |
| SNAPSHOT_ID | VARCHAR | Internal system-generated identifier for the snapshot. |
| LOGICAL_BYTES | NUMBER | Number of bytes created when this snapshot is restored. |
| INCREMENTAL_BYTES_FROM_PREVIOUS_SNAPSHOT | NUMBER | Number of logical bytes of the micro-partitions that *are* in this snapshot, but *aren’t* in the previous snapshot within the same snapshot set.  For the oldest active snapshot in a snapshot set, this is 0. |
| DECREMENTAL_BYTES_FROM_PREVIOUS_SNAPSHOT | NUMBER | Number of logical bytes of the micro-partitions that *aren’t* in this snapshot, but *are* in the previous snapshot within the same snapshot set.  For the oldest active snapshot in a snapshot set, this is 0. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).

---
title: SNAPSHOTS view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/account-usage/snapshots.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNAPSHOTS view — *Deprecated*

This Account Usage view provides information on [snapshots](../../user-guide/backups.md).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Snowflake-generated identifier of the snapshot.  Note: this is not the local ID, this is the globally unique UUID of the Snapshot. |
| SNAPSHOT_SET_ID | NUMBER | ID of snapshot set that contains the snapshot. |
| SNAPSHOT_SET_NAME | VARCHAR | Name of snapshot set that contains the snapshot. |
| SNAPSHOT_SET_SCHEMA_ID | NUMBER | ID of schema that the snapshot set belongs to. |
| SNAPSHOT_SET_SCHEMA | VARCHAR | Name of schema that the snapshot set belongs to. |
| SNAPSHOT_SET_CATALOG_ID | NUMBER | ID of database that the snapshot set belongs to. |
| SNAPSHOT_SET_CATALOG | VARCHAR | Name of database that the snapshot set belongs to. |
| CREATED | TIMESTAMP_LTZ | Timestamp at which snapshot was created. |
| DELETED | TIMESTAMP_LTZ | Timestamp at which the snapshot was deleted.  This column isn’t displayed by the SHOW command, because the SHOW command output doesn’t include deleted objects. |
| EXPIRATION_SCHEDULED_FOR | TIMESTAMP_LTZ | Timestamp at which snapshot will be expired and deleted. |
| IS_UNDER_LEGAL_HOLD | BOOLEAN | True if snapshot is under legal hold; False otherwise. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).

---
title: SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/snowflake_intelligence_usage_history_view.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view

The SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view can be used to query the usage
history of Snowflake Intelligence.

> **Note:**
>
> This view does not include requests originating from Cortex Agents. Requests originating from Cortex Agents are recorded in the [CORTEX_AGENT_USAGE_HISTORY](cortex_agent_usage_history.md) view.

The information in the view includes the number of credits consumed each time a user interacts
with Snowflake Intelligence. A request results in one or more calls to underlying agents and any
tools, for example, Cortex Analyst and Cortex Search. Each row in the view represents a call to the agent and provides detail on
the aggregated tokens and credits in the call as well as granular detail. The view also includes
relevant metadata, such as the user ID, request ID, Snowflake Intelligence ID, and the agent ID.
For more information about Cortex billing, see [Cost considerations](../../user-guide/snowflake-cortex/cortex-agents.md).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start time when the Snowflake Intelligence message request was received. |
| END_TIME | TIMESTAMP_LTZ | End time when the Snowflake Intelligence message response was sent. |
| USER_ID | NUMBER | The unique identifier of the user who made the request. |
| USER_NAME | VARCHAR | The name of the user who made the request. |
| USER_TAGS | ARRAY | Tags associated with the user. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “ACCOUNT” or “USER”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| REQUEST_ID | VARCHAR | The unique identifier for the request. |
| PARENT_REQUEST_ID | VARCHAR | The identifier of the parent request, if applicable. |
| SNOWFLAKE_INTELLIGENCE_ID | NUMBER | The unique identifier of the Snowflake Intelligence instance. |
| SNOWFLAKE_INTELLIGENCE_NAME | VARCHAR | The name of the Snowflake Intelligence instance. |
| SNOWFLAKE_INTELLIGENCE_TAGS | ARRAY | Tags associated with the Snowflake Intelligence instance. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “ACCOUNT” or “SNOWFLAKE_INTELLIGENCE”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| AGENT_DATABASE_ID | NUMBER | The unique identifier of the agent database. |
| AGENT_DATABASE_NAME | VARCHAR | The name of the agent database. |
| AGENT_SCHEMA_ID | NUMBER | The unique identifier of the agent schema. |
| AGENT_SCHEMA_NAME | VARCHAR | The name of the agent schema. |
| AGENT_ID | NUMBER | The unique identifier of the agent. |
| AGENT_NAME | VARCHAR | The name of the agent. |
| AGENT_TAGS | ARRAY | Tags associated with the Agent. Each object in the array contains the following value pairs:   * `level`: The level at which the tag is applied (for example, “SCHEMA” or “CORTEX_AGENT”). * `tag_database`: The database where the tag is defined. * `tag_schema`: The schema where the tag is defined. * `tag_name`: The name of the tag. * `tag_value`: The value of the tag. |
| TOKEN_CREDITS | NUMBER | The number of token credits used for the request. Used for user-level budgeting. |
| TOKENS | NUMBER | Sum of the tokens used by the Snowflake Intelligent agent. |
| TOKENS_GRANULAR | ARRAY | Granular breakdown of token usage by request, service type (cortex_agents, cortex_analyst), and model. Includes input, cache_read_input, cache_write_input, and output token counts per model. The “unknown” model name is used when a model is not present in the pricing data. Each object in the array contains the following value pairs:   * `request_id`: The unique identifier for the request. * `service_type`: The service type, such as “cortex_agents” or “cortex_analyst”. * `model`: The model name used for the request. * `input`: Number of input tokens. * `cache_read_input`: Number of cache read input tokens. * `cache_write_input`: Number of cache write input tokens. * `output`: Number of output tokens. * `start_time`: The start time of the request. |
| CREDITS_GRANULAR | ARRAY | Granular breakdown of credit usage by request, service type (cortex_agents, cortex_analyst), and model. Includes input, cache_read_input, cache_write_input, and output credit values per model. The “unknown” model name is used when a model is not present in the pricing data. Each object in the array contains the following value pairs:   * `request_id`: The unique identifier for the request. * `service_type`: The service type, such as “cortex_agents” or “cortex_analyst”. * `model`: The model name used for the request. * `input`: Credit value for input tokens. * `cache_read_input`: Credit value for cache read input tokens. * `cache_write_input`: Credit value for cache write input tokens. * `output`: Credit value for output tokens. * `start_time`: The start time of the request. |
| METADATA | OBJECT | Additional metadata, including:   * `role_id`: ID of the primary role used for the request. * `role_name`: Name of the primary role used for the request. * `ai_functions_credits`: Credits consumed by [AI functions](../../user-guide/snowflake-cortex/aisql.md) invoked during the request. Contains NULL if no AI functions were used. |

## Examples

Retrieve Snowflake Intelligence usage history:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY;
```

```output
+-------------------------------+-------------------------------+---------+-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------+-------------------+---------------------------+-----------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------+---------------------+-----------------+-------------------+----------+------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+---------------+--------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------+
| START_TIME                    | END_TIME                      | USER_ID | USER_NAME | USER_TAGS                                                                                                                                                                                                                                                | REQUEST_ID                           | PARENT_REQUEST_ID | SNOWFLAKE_INTELLIGENCE_ID | SNOWFLAKE_INTELLIGENCE_NAME | SNOWFLAKE_INTELLIGENCE_TAGS                                                                                                                                                                                                                                        | AGENT_DATABASE_ID | AGENT_DATABASE_NAME | AGENT_SCHEMA_ID | AGENT_SCHEMA_NAME | AGENT_ID | AGENT_NAME | AGENT_TAGS                                                                                                                                                                                                                                              | TOKEN_CREDITS | TOKENS | TOKENS_GRANULAR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                      | CREDITS_GRANULAR                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               | METADATA                                                                 |
+-------------------------------+-------------------------------+---------+-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------+-------------------+---------------------------+-----------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------+---------------------+-----------------+-------------------+----------+------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+---------------+--------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------+
| 2026-02-06 10:11:51.642 +0000 | 2026-02-06 10:11:55.932 +0000 | 42563   | JKOWAL    | [{"level": "ACCOUNT", "tag_database": "SI", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "engineering"}, {"level": "USER", "tag_database": "FINANCE", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "engineering"}] | 9a5bbdce-7427-4166-b839-de8adc81e5cf | NULL              | 123456                    | finance_analytics_si        | [{"level": "ACCOUNT", "tag_database": "SI", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "finance"}, {"level": "SNOWFLAKE_INTELLIGENCE", "tag_database": "FINANCE", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "finance"}] | 234               | finance             | 4231            | analytics         | 9234     | agent1     | [{"level": "SCHEMA", "tag_database": "SI", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "finance"}, {"level": "CORTEX_AGENT", "tag_database": "FINANCE", "tag_schema": "AGENTS", "tag_name": "cost-center", "tag_value": "finance"}] | 20.000000000  | 1900   | [{"9a5bbdce-7427-4166-b839-de8adc81e5cf": {"cortex_agents": {"modelX": {"input": 100, "cache_read_input": 300, "cache_write_input": 400, "output": 200}}, "start_time": "2026-02-06 10:11:51.642 +0000"}}, {"28adaa9a-cbde-4293-bce2-157c807e0dd7": {"cortex_analyst": {"modelY": {"input": 100, "output": 200}, "modelZ": {"input": 100, "output": 200}}, "start_time": "2026-02-06 10:11:52.642 +0000"}}, {"1444950f-6f5b-493f-800c-f62bb307d21c": {"cortex_analyst": {"unknown": {"input": 100, "output": 200}}, "start_time": "2026-02-06 10:11:53.331 +0000"}}] | [{"9a5bbdce-7427-4166-b839-de8adc81e5cf": {"cortex_agents": {"modelX": {"input": 1, "cache_read_input": 2, "cache_write_input": 3, "output": 4}}, "start_time": "2026-02-06 10:11:51.642 +0000"}}, {"28adaa9a-cbde-4293-bce2-157c807e0dd7": {"cortex_agent": {"modelY": {"input": 1, "output": 4}, "modelZ": {"input": 1, "output": 4}}, "start_time": "2026-02-06 10:11:52.642 +0000"}}, {"1444950f-6f5b-493f-800c-f62bb307d21c": {"cortex_analyst": {"unknown": {"input": 0, "output": 0}}, "start_time": "2026-02-06 10:11:53.331 +0000"}}] | {"role_id": 12720, "role_name": "ENGINEER", "ai_functions_credits": 0.5} |
+-------------------------------+-------------------------------+---------+-----------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------+-------------------+---------------------------+-----------------------------+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+-------------------+---------------------+-----------------+-------------------+----------+------------+---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+---------------+--------+----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------------------------------------------------------+
```

---
title: SNOWPARK_CONTAINER_SERVICES_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/snowpark_container_services_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNOWPARK_CONTAINER_SERVICES_HISTORY view

The SNOWPARK_CONTAINER_SERVICES_HISTORY view in the ACCOUNT_USAGE schema can be used to return the hourly
[compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) credit usage for an account within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| COMPUTE_POOL_NAME | VARCHAR | Name of the compute pool which incurred the credit usage. |
| IS_EXCLUSIVE | BOOLEAN | TRUE, if the compute pool was created for an [application](../../developer-guide/native-apps/native-apps-about.md). |
| APPLICATION_NAME | VARCHAR | The name of the application for which the compute pool was created. NULL if the compute pool was not created for an application or if the application no longer exists. |
| APPLICATION_ID | VARCHAR | The ID of the application for which the compute pool was created; otherwise NULL. |
| CREDITS_USED | NUMBER | Number of credits the compute pool used in the hour. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* The view provides hourly [compute pool](../../developer-guide/snowpark-container-services/working-with-compute-pool.md) credit usage for an account within the last 365 days (1 year).
* The credit rate usage is determined based on the machine type (instance family) of the compute pool, as outlined in the consumption table.

---
title: SNOWPIPE_STREAMING_CHANNEL_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/snowpipe_streaming_channel_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNOWPIPE_STREAMING_CHANNEL_HISTORY view

This Account Usage view provides a historical record of pipeline errors, enabling users to monitor performance trends. This view displays key metrics such as processed data volume, error rates, and latency.

You can use this Account Usage view to query the error history for a specific pipe or channel.

> **Note:**
>
> The SNOWPIPE_STREAMING_CHANNEL_HISTORY view only applies to [Snowpipe Streaming with high-performance architecture](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ACCOUNT_ID | NUMBER | The ID of the Snowflake account. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time when the rowset channel history was created. |
| CHANNEL_ID | NUMBER | The internal, system-generated ID of the Snowpipe Streaming channel. |
| CHANNEL_NAME | VARCHAR | The user-defined name of the Snowpipe Streaming channel. |
| PIPE_ID | NUMBER | The internal ID of the Snowpipe object associated with this Snowpipe Streaming channel. |
| END_OFFSET | VARCHAR | The last offset token processed and included in this specific channel history record. |
| TABLE_ID | NUMBER | The internal ID of the target table for this Snowpipe Streaming channel. |
| TABLE_NAME | VARCHAR | The name of the target table for this Snowpipe Streaming channel. |
| TABLE_SCHEMA_ID | NUMBER | The internal ID of the schema containing the target table. |
| TABLE_SCHEMA_NAME | VARCHAR | The name of the schema containing the target table. |
| TABLE_DATABASE_ID | NUMBER | The internal ID of the database containing the target table. |
| TABLE_DATABASE_NAME | VARCHAR | The name of the database containing the target table. |
| PIPE_NAME | VARCHAR | The name of the Snowpipe object associated with the current Snowpipe Streaming channel history entry.  Named pipes: The value is the user-defined name of the Snowpipe object associated with the channel.  Default pipe: The value is automatically derived from the target table name; for example, MY_TABLE-STREAMING. |
| PIPE_SCHEMA_ID | NUMBER | The internal identifier for the schema associated with the Snowpipe Streaming channel.  Named pipes: The value is the internal ID of the schema that contains the user-defined Snowpipe object.  Default pipe: The value is the internal ID of the schema that contains the target table. |
| PIPE_SCHEMA_NAME | VARCHAR | The name of the schema associated with the Snowpipe Streaming channel.  Named pipes: The value is the user-defined name of the schema that contains the Snowpipe object.  Default pipe: The value is the name of the schema that contains the target table. |
| PIPE_DATABASE_ID | NUMBER | The internal identifier for the database associated with the Snowpipe Streaming channel.  Named pipes: The value is the internal ID of the database that contains the user-defined Snowpipe object.  Default pipe: The value is the internal ID of the database that contains the target table. |
| PIPE_DATABASE_NAME | VARCHAR | The name of the database associated with the Snowpipe Streaming channel.  Named pipes: The value is the user-defined name of the database that contains the Snowpipe object.  Default pipe: The value is the name of the database that contains the target table. |
| LAST_ERROR_OFFSET_UPPER_BOUND | VARCHAR | The upper bound of the offset token range of the last rowset that encountered errors during this historical period. |
| LAST_ERROR_MESSAGE | VARCHAR | The last error message encountered while writing data to the channel. This column displays a redacted error message when an error is encountered. |
| SNOWFLAKE_PROCESSING_LATENCY_MS | NUMBER | The average latency, in milliseconds, observed by the Snowflake service in processing rowsets for this channel during this historical period. |
| ROWS_INSERTED | NUMBER | The total number of rows successfully inserted through this channel during this historical period. |
| ROWS_PARSED | NUMBER | The total number of rows parsed (processed) by the channel during this historical period. |
| ROW_ERROR_COUNT | NUMBER | The total number of rows that encountered errors and were not inserted through this channel during this historical period. |

## Usage notes

* The Snowpipe Streaming high-performance architecture only supports ON_ERROR=CONTINUE. Other ON_ERROR options are not supported.

---
title: SNOWPIPE_STREAMING_CLIENT_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/snowpipe_streaming_client_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNOWPIPE_STREAMING_CLIENT_HISTORY view

This Account Usage view can be used to query the amount of time spent loading data into Snowflake tables using [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) within the last 365 days (1 year). The view displays the amount of data loaded and timestamp of the Snowpipe Streaming client calls for your entire Snowflake account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CLIENT_NAME | VARCHAR | Name of the Snowpipe Streaming ingest client. |
| SNOWFLAKE_PROVIDED_ID | VARCHAR | Internal/system-generated identifier for the Snowpipe Streaming ingest client used for the data load. |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Start of the time (in the local time zone) range in which data loading took place. |
| EVENT_TYPE | VARCHAR | Type of the event. |
| BLOB_SIZE_BYTES | NUMBER | The blob size in bytes. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

## Examples

Query the amount of time spent loading data into Snowflake tables using Snowpipe Streaming within the last 365 days.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SNOWPIPE_STREAMING_CLIENT_HISTORY;
```

The query returns the following results.

> ```sqlexample
> +----------------+----------------------------+------------------------------+--------------+----------------+
> |    CLIENT_NAME |    SNOWFLAKE_PROVIDED_ID   |              EVENT_TIMESTAMP |   EVENT_TYPE | BLOB_SIZE_BYTES|
> |----------------+--------------------------- +------------------------------+--------------|----------------|
> |      MY_CLIENT |FE0B1xJrBAAL3bAAUz1M9876nMCd| 2023-02-04 02:07:34.000 +0000| BLOB_PERSIST |           1,648|
> |      MY_CLIENT |D1CIBBPGGFyprBanMvAA1234V3ss| 2023-02-04 02:15:54.000 +0000| BLOB_PERSIST |           3,120|
> +----------------+----------------------------+------------------------------+--------------+----------------+
> ```

Query the hourly credits consumed by each client loading data into Snowflake tables using Snowpipe Streaming within the last 365 days.

```sqlexample
SELECT COUNT(DISTINCT event_timestamp) AS client_seconds, date_trunc('hour',event_timestamp) AS event_hour, client_seconds*0.000002777777778 as credits, client_name, snowflake_provided_id
FROM SNOWFLAKE.ACCOUNT_USAGE.SNOWPIPE_STREAMING_CLIENT_HISTORY
GROUP BY event_hour, client_name, snowflake_provided_id;
```

Note that there can be multiple events per second. The credits are consumed only by the actual time spent, and not by the number of events.

---
title: SNOWPIPE_STREAMING_FILE_MIGRATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/snowpipe_streaming_file_migration_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# SNOWPIPE_STREAMING_FILE_MIGRATION_HISTORY view

This Account Usage view can be used to query the history of data migrated into Snowflake tables using [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) within the last 365 days (1 year). The view displays the number of rows and bytes migrated and credits used for migration billed for your entire Snowflake account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the time (in the local time zone) range in which data migration took place. |
| END_TIME | TIMESTAMP_LTZ | End of the time (in the local time zone) range in which data migration took place. |
| CREDITS_USED | FLOAT | Number of credits billed for Snowpipe Streaming data migration during the START_TIME and END_TIME window. |
| NUM_BYTES_MIGRATED | NUMBER | Number of bytes migrated during the START_TIME and END_TIME window. |
| NUM_ROWS_MIGRATED | NUMBER | Number of rows migrated during the START_TIME and END_TIME window. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the target table that the Snowpipe Streaming client loads data into. |
| TABLE_NAME | VARCHAR | The name of the target table that the Snowpipe Streaming client loads data into. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that the target table belongs to. |
| SCHEMA_NAME | VARCHAR | The name of the schema that the target table belongs to. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that the target table belongs to. |
| DATABASE_NAME | VARCHAR | The name of the database that the target table belongs to. |

## Usage notes

* Latency for the view may be up to 12 hours.

* Note that file migration sometimes may be pre-empted by clustering or other DML operations. Migration may not always occur and therefore the migration history will be empty even after 12 hours.
* The NUM_BYTES_MIGRATED and NUM_ROWS_MIGRATED columns only show the number of bytes and rows processed during the migration process. These numbers may not equal the actual numbers of rows and bytes inserted by Snowpipe Streaming to the table because some rows and bytes are processed outside of the migration process due to clustering or other DML operations.

  For example, Snowpipe Streaming inserts 1M rows and the table has 1M rows, but the NUM_ROWS_MIGRATED column in the migration history view only shows 800K rows. This is because the other 200K rows are processed outside of the migration process.

## Examples

Query the history of data migrated into Snowflake tables using Snowpipe Streaming within the last 365 days.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.SNOWPIPE_STREAMING_FILE_MIGRATION_HISTORY;
```

The query returns the following results.

> ```sqlexample
> +-------------------------------+-------------------------------+--------------+--------------------+------------------+----------+-----------------+------------+--------------+---------------+--------------+
> | START_TIME                    | END_TIME                      | CREDITS_USED | NUM_BYTES_MIGRATED | NUM_ROWS_MIGRATED| TABLE_ID |      TABLE_NAME | SCHEMA_ID  |  SCHEMA_NAME |   DATABASE_ID | DATABASE_NAME|
> |-------------------------------+-------------------------------+--------------+--------------------+------------------+----------+----------------------------------------------------------------------------|
> |2023-02-08 19:00:00.000 +0000  |2023-02-08 20:00:00.000 +0000  | 0.0000325    |                 0  |                0 |  16849926| STREAMING_TABLE |   101351   |   SNOW       |  3166         |STREAMING     |
> |2023-02-07 19:00:00.000 +0000  |2023-02-07 20:00:00.000 +0000  | 0.000096761  |             7,850  |               39 |  16849926| STREAMING_TABLE |   101351   |   SNOW       |  3166         |STREAMING     |
> +-------------------------------+-------------------------------+--------------+--------------------+------------------+----------+-----------------+------------+--------------+---------------|--------------+
> ```

---
title: STAGE_STORAGE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/stage_storage_usage_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# STAGE_STORAGE_USAGE_HISTORY view

This Account Usage view can be used to query the average daily data storage usage, in bytes, within the last 365 days (1 year) for all the Snowflake internal stages in the account, including:

* Named internal stages.
* Default staging areas (for tables and users).

> **Note:**
>
> This view returns stage storage usage within the last 365 days (1 year).

See also:
:   [DATABASE_STORAGE_USAGE_HISTORY view](database_storage_usage_history.md) , [STORAGE_USAGE view](storage_usage.md) , [WAREHOUSE_METERING_HISTORY view](warehouse_metering_history.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Date of this storage usage record. |
| AVERAGE_STAGE_BYTES | NUMBER | Number of bytes of stage storage used. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

---
title: STAGES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/stages.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# STAGES view

This Account Usage view displays a row for each stage defined in the account.

Stages are named objects that can be used for loading/unloading data. For more information, see [CREATE STAGE](../sql/create-stage.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| STAGE_ID | NUMBER | Internal/system-generated identifier for the stage. |
| STAGE_NAME | VARCHAR | Name of the stage. |
| STAGE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the stage. |
| STAGE_SCHEMA | VARCHAR | Schema that the stage belongs to. |
| STAGE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the stage. |
| STAGE_CATALOG | VARCHAR | Database that the stage belongs to. |
| STAGE_URL | VARCHAR | If the stage is external, location of the stage; NULL if it is internal. |
| STAGE_REGION | VARCHAR | If the stage is external, region where the stage resides; NULL if it is internal. |
| STAGE_TYPE | VARCHAR | Type of stage (`Internal Named`, or `External Named`). |
| STAGE_OWNER | VARCHAR | Name of the role that owns the stage; NULL if it has been dropped. |
| COMMENT | VARCHAR | Comment for the stage. |
| CREATED | TIMESTAMP_LTZ | Date and time when the stage was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the stage was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| STORAGE_INTEGRATION | VARCHAR | The name of the storage integration associated with the stage; NULL for internal stages or stages that do not use a storage integration. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: STORAGE_LIFECYCLE_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/storage_lifecycle_policies.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# STORAGE_LIFECYCLE_POLICIES view

This Account Usage view displays [storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md).
Each row in this view corresponds to a different storage lifecycle policy.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | Name of the storage lifecycle policy. |
| ID | NUMBER | Internal/system-generated identifier for the storage lifecycle policy. |
| SCHEMA_ID | TEXT | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | TEXT | Schema to which the storage lifecycle policy belongs. |
| DATABASE_ID | TEXT | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | TEXT | Database to which the storage lifecycle policy belongs. |
| OWNER | TEXT | Name of the role that owns the storage lifecycle policy. |
| SIGNATURE | TEXT | Type signature of the storage lifecycle policy’s arguments. |
| RETURN_TYPE | TEXT | Return value data type. |
| BODY | TEXT | Storage lifecycle policy definition. |
| COMMENT | TEXT | Comments entered for the storage lifecycle policy. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time when the storage lifecycle policy was created. |
| LAST_ALTERED_ON | TIMESTAMP_LTZ | Date and time when the storage lifecycle policy was last altered. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time when the storage lifecycle policy was dropped. |
| OPTIONS | OBJECT | Storage lifecycle policy options, including ARCHIVE_FOR_DAYS (number of days to keep data in current tier) and ARCHIVE_TIER (target storage tier). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

---
title: STORAGE_LIFECYCLE_POLICY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/storage_lifecycle_policy_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# STORAGE_LIFECYCLE_POLICY_HISTORY view

This Account Usage view provides the aggregated execution history of
[storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md) in your account.
This view shows historical data from the past 12 months and only includes policy executions
that have completed successfully or with failures. The view doesn’t include queued, currently executing, or cancelled
policy executions.

Each row in this view corresponds to a different storage lifecycle policy execution.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_DB | VARCHAR | The name of the database that contains the storage lifecycle policy. |
| POLICY_SCHEMA | VARCHAR | The name of the schema that contains the storage lifecycle policy. |
| POLICY_NAME | VARCHAR | The name of the storage lifecycle policy. |
| REF_ENTITY_DB | VARCHAR | The name of the database that contains the object that the storage lifecycle policy is attached to. |
| REF_ENTITY_SCHEMA | VARCHAR | The name of the schema that contains the object that the storage lifecycle policy is attached to. |
| REF_ENTITY_NAME | VARCHAR | The name of the object that the storage lifecycle policy is attached to. |
| REF_ENTITY_DOMAIN | VARCHAR | The domain (type) of the object that the storage lifecycle policy is attached to; for example, Table. |
| STATE | VARCHAR | The aggregated state of the storage lifecycle policy execution: SUCCEEDED or FAILED (completed executions only). |
| START_TIME | TIMESTAMP_LTZ | Earliest timestamp of when any task in the storage lifecycle policy execution started. |
| END_TIME | TIMESTAMP_LTZ | Latest timestamp of when any task in the storage lifecycle policy execution completed. |
| EXECUTION_RESULT | VARIANT | JSON object containing detailed results for each task type in the storage lifecycle policy execution. The object can be of type EXPIRE, ARCHIVE, or EXPIRE_ARCHIVE ARCHIVE. Each nested object contains: start_time, end_time, state, and error details. |
| POLICY_BODY | VARCHAR | The body of the storage lifecycle policy. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view contains historical data for the past 12 months (one year).
* The view only shows completed policy executions. It doesn’t include queued, currently executing, or cancelled policy executions.

---
title: STORAGE_REQUEST_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/storage_request_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# STORAGE_REQUEST_HISTORY view

This Account Usage view displays historical data for storage request usage within the last 365 days (1 year).
The view tracks HTTP requests made by external query engines through
[Snowflake Horizon Catalog](../../user-guide/snowflake-horizon.md) to access
[Iceberg tables that use Snowflake storage](../../user-guide/tables-iceberg-internal-storage.md).

See also:
:   [Snowflake storage for Apache Iceberg™ tables](../../user-guide/tables-iceberg-internal-storage.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the data aggregation window. |
| END_TIME | TIMESTAMP_LTZ | End of the data aggregation window. |
| OPERATION_TYPE | VARCHAR | The type of operation: `Class 1` (PUT, COPY, POST, PATCH, and LIST operations) or `Class 2` (GET and SELECT operations). |
| COUNT | NUMBER | Total number of API calls during the aggregation window. |

## Usage notes

* Latency for the view may be up to 6 hours.
* This view tracks requests that are billed under the `STORAGE_REQUEST-1` (Class 1) and
  `STORAGE_REQUEST-2` (Class 2) SKUs on the billing report.
* This view only tracks requests for Iceberg tables that use Snowflake storage. For Iceberg tables
  that use customer-owned external storage (buckets), this view doesn’t apply.
* Snowflake doesn’t bill your account when you use the Snowflake query engine to directly access
  Iceberg tables. Only requests made through Horizon Catalog by external query engines are tracked
  in this view.
* For billing rates, see Table 3(g) of the
  [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Examples

Retrieve the storage request counts for the past 7 days:

```sqlexample
SELECT
  START_TIME,
  END_TIME,
  OPERATION_TYPE,
  COUNT
FROM SNOWFLAKE.ACCOUNT_USAGE.STORAGE_REQUEST_HISTORY
WHERE START_TIME >= DATEADD(day, -7, CURRENT_TIMESTAMP())
ORDER BY START_TIME DESC;
```

Calculate total requests by operation type for the past month:

```sqlexample
SELECT
  OPERATION_TYPE,
  SUM(COUNT) AS TOTAL_REQUESTS
FROM SNOWFLAKE.ACCOUNT_USAGE.STORAGE_REQUEST_HISTORY
WHERE START_TIME >= DATEADD(month, -1, CURRENT_TIMESTAMP())
GROUP BY OPERATION_TYPE;
```

---
title: STORAGE_USAGE view
source: https://docs.snowflake.com/en/sql-reference/account-usage/storage_usage.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md) , [READER_ACCOUNT_USAGE](../account-usage.md)

# STORAGE_USAGE view

This Account Usage view displays the average daily data storage usage, in bytes, within the last 365 days (1 year) across the entire account, including data in:

* Database tables.
* Files in all internal stages.

See also:
:   [STORAGE_DAILY_HISTORY view](../organization-usage/storage_daily_history.md) , [DATABASE_STORAGE_USAGE_HISTORY view](database_storage_usage_history.md) , [STAGE_STORAGE_USAGE_HISTORY view](stage_storage_usage_history.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Date of this storage usage record. The date is based on the local time zone. It is recommended that you change the query session to use the UTC time zone instead (for example, `ALTER SESSION SET TIMEZONE='UTC'`). |
| STORAGE_BYTES | NUMBER | Number of bytes of table storage used, including bytes currently in Time Travel. |
| STAGE_BYTES | NUMBER | Number of bytes of stage storage used by files in all internal stages (named, table, and user). |
| FAILSAFE_BYTES | NUMBER | Number of bytes of Fail-safe storage used. |
| HYBRID_TABLE_STORAGE_BYTES | NUMBER | Number of bytes of hybrid table storage used (data in the row store). |
| ARCHIVE_STORAGE_COOL_BYTES | NUMBER | Number of all bytes of table storage used in the COOL storage tier, including active bytes, Fail-safe bytes, Time Travel bytes, and bytes subject to [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). |
| ARCHIVE_STORAGE_COLD_BYTES | NUMBER | Number of all bytes of table storage used in the COLD storage tier, including active bytes, Fail-safe bytes, Time Travel bytes, and bytes subject to [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). |
| ARCHIVE_STORAGE_RETRIEVAL_TEMP_BYTES | NUMBER | Number of bytes used in the standard storage tier, during data retrieval from the COLD storage tier. |

## Usage notes

* In the ACCOUNT_USAGE schema, latency for the view is up to 120 minutes (2 hours).
* In the READER_ACCOUNT_USAGE schema, latency is up to 24 hours.
* This view uses a different measurement approach than the one used for billing, so the values here won’t match your invoice exactly. For the storage view that most closely reflects billed storage at the account and organization level, see [STORAGE_DAILY_HISTORY view](../organization-usage/storage_daily_history.md).

---
title: TABLE_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/table_constraints.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TABLE_CONSTRAINTS view

This Account Usage view displays a row for each table constraint that is defined for the tables in the account.

This view returns information about the following constraint types:

* PRIMARY KEY
* FOREIGN KEY
* UNIQUE

For general information about constraints, see [Constraints](../constraints.md).

See also:
:   [REFERENTIAL_CONSTRAINTS view](referential_constraints.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_ID | NUMBER | Internal/system-generated identifier for the constraint. |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint. |
| CONSTRAINT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the constraint. |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the constraint belongs to. |
| CONSTRAINT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the constraint. |
| CONSTRAINT_CATALOG | VARCHAR | Database that the constraint belongs to. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that the constraint belongs to. |
| TABLE_NAME | VARCHAR | Name of the current table. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the current table. |
| TABLE_SCHEMA | VARCHAR | Name of the schema for the current table. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the current table. |
| TABLE_CATALOG | VARCHAR | Name of the database for the current table. |
| CONSTRAINT_TYPE | VARCHAR | Type of the constraint (`PRIMARY KEY`, `UNIQUE KEY`, or `FOREIGN KEY`). |
| IS_DEFERRABLE | VARCHAR | Whether evaluation of the constraint can be deferred; by default, always `N`. |
| INITIALLY_DEFERRED | VARCHAR | Whether evaluation of the constraint is deferrable and initially deferred; by default, always `Y`. |
| ENFORCED | VARCHAR | Whether the constraint is enforced; by default, always `N`. |
| COMMENT | VARCHAR | Comment for the constraint. |
| CREATED | TIMESTAMP_LTZ | Date and time when the constraint was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the constraint was dropped. |
| RELY | VARCHAR | Whether a constraint in NOVALIDATE mode is taken into account during query rewrite. For details, see [Constraint properties](../sql/create-table-constraint.md). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: TABLE_DML_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/table_dml_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# TABLE_DML_HISTORY view

This Account Usage view can be used to determine the magnitude and effects of the DML operations performed on a table. Note that
these DML operations include ones initiated by [Snowpipe](../../user-guide/data-load-snowpipe-intro.md) but exclude operations initiated
by background maintenance services
(for example, [Automatic Clustering](../../user-guide/tables-auto-reclustering.md), maintenance for materialized views and
[search optimization](../../user-guide/search-optimization-service.md)).

You can query this view with the [QUERY_HISTORY view](query_history.md) and the
[LOAD_HISTORY view](load_history.md) to identify the DML operations that have a significant impact. This can
help you to identify opportunities for optimization.

In addition, you can query this view with the [AUTOMATIC_CLUSTERING_HISTORY view](automatic_clustering_history.md) and the
[SEARCH_OPTIMIZATION_HISTORY view](search_optimization_history.md) to visualize the relationship between these DML operations and the
credits charged for Automatic Clustering and the search optimization service. (These services can be triggered by DML operations.)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the time range (on the hour mark) during which the DML operations were performed. |
| END_TIME | TIMESTAMP_LTZ | End of the time range (on the hour mark) during which the DML operations were performed. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table modified by the DML operations. |
| TABLE_NAME | VARCHAR | Name of the table modified by the DML operations. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table modified by the DML operations. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table modified by the DML operations. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table modified by the DML operations. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table modified by the DML operations. |
| ROWS_ADDED | NUMBER | Number of rows added by DML operations performed by users on the table during the START_TIME and END_TIME window. |
| ROWS_REMOVED | NUMBER | Number of rows removed by DML operations performed by users on the table during the START_TIME and END_TIME window. |
| ROWS_UPDATED | NUMBER | Number of rows updated by DML operations performed by users on the table during the START_TIME and END_TIME window. |

## Usage notes

* Latency for the view may be up to 6 hours.
* This view does not include DML operations on [hybrid tables](../../user-guide/tables-hybrid.md).

## Examples

The following example returns the top five tables that had the most rows added, removed, and updated by DML operations within the
last seven days.

```sqlexample
SELECT
    table_id,
    ANY_VALUE(table_name) AS table_name,
    SUM(rows_added) AS total_rows_added,
    SUM(rows_removed) AS total_rows_removed,
    SUM(rows_updated) AS total_rows_updated
  FROM SNOWFLAKE.ACCOUNT_USAGE.TABLE_DML_HISTORY
  WHERE start_time >= DATEADD(day, -7, CURRENT_TIMESTAMP())
  GROUP BY table_id
  ORDER BY total_rows_added + total_rows_removed + total_rows_updated DESC
  LIMIT 5;
```

```output
+----------+----------------------+------------------+--------------------+--------------------+
| TABLE_ID | TABLE_NAME           | TOTAL_ROWS_ADDED | TOTAL_ROWS_REMOVED | TOTAL_ROWS_UPDATED |
|----------+----------------------+------------------+--------------------+--------------------|
|   338948 | SENSOR_DATA_TS       |          5356800 |             259200 |                  0 |
|   338950 | SENSOR_DATA_DEVICE2  |          2678400 |                  0 |                  0 |
|   341006 | SENSOR_DATA_30_ROWS  |               30 |                  0 |                  0 |
|   341004 | SENSOR_DATA_12_HOURS |               12 |                  0 |                  0 |
|   340005 | SENSOR_DATA_12_HOURS |               12 |                  0 |                  0 |
+----------+----------------------+------------------+--------------------+--------------------+
```

---
title: TABLE_PRUNING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/table_pruning_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# TABLE_PRUNING_HISTORY view

This Account Usage view can be used to determine the efficiency of pruning for all tables,
and to understand how a table’s default (natural) ordering of data affects pruning.

You can compare the number of partitions pruned (`PARTITIONS_PRUNED`) to the
total number of partitions scanned and pruned (`PARTITIONS_SCANNED + PARTITIONS_PRUNED`).

Each row in this view represents the pruning history for a specific table within a given time interval.
The data is aggregated by time interval and includes information about the number of scans, partitions
scanned, partitions pruned, rows scanned, and rows pruned.

You can also use this view to compare the effects on pruning before and after enabling
[Automatic Clustering](../../user-guide/tables-auto-reclustering.md) and
[search optimization](../../user-guide/search-optimization-service.md) for a table.

See also [TABLE_QUERY_PRUNING_HISTORY view](table_query_pruning_history.md) and
[COLUMN_QUERY_PRUNING_HISTORY view](column_query_pruning_history.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | Start of the time range (on the hour mark) during which the queries were executed and completed. |
| END_TIME | TIMESTAMP_LTZ | End of the time range (on the hour mark) during which the queries were executed and completed. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that was queried. |
| TABLE_NAME | VARCHAR | Name of the table that was queried. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table that was queried. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table that was queried. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table that was queried. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table that was queried. |
| NUM_SCANS | NUMBER | Number of scan operations from all queries (including SELECT statements and DML statements) on the table during the START_TIME and END_TIME window. Note that a given query might result in multiple scan operations on the same table. |
| PARTITIONS_SCANNED | NUMBER | Number of partitions scanned during the scan operations described in `NUM_SCANS`. |
| PARTITIONS_PRUNED | NUMBER | Number of partitions pruned for the queries described in `NUM_SCANS`. These partitions were eliminated during query processing, improving the efficiency of the query. |
| ROWS_SCANNED | NUMBER | Number of rows scanned during the scan operations described in `NUM_SCANS`. |
| ROWS_PRUNED | NUMBER | Number of rows pruned for the queries described in `NUM_SCANS`. These rows were eliminated during query processing, improving the efficiency of the query. |

## Usage notes

* Latency for the view may be up to 6 hours.
* This view does not include pruning information for [hybrid tables](../../user-guide/tables-hybrid.md).
* This view retains data for the 1,000 longest-running table scans per query. Only extremely complex queries
  exceed this number of scans so data is rarely omitted.

## Examples

List the top five tables that had the worst pruning efficiency within the last seven days:

```sqlexample
SELECT
    table_id,
    ANY_VALUE(table_name) AS table_name,
    SUM(num_scans) AS total_num_scans,
    SUM(partitions_scanned) AS total_partitions_scanned,
    SUM(partitions_pruned) AS total_partitions_pruned,
    SUM(rows_scanned) AS total_rows_scanned,
    SUM(rows_pruned) AS total_rows_pruned
  FROM SNOWFLAKE.ACCOUNT_USAGE.TABLE_PRUNING_HISTORY
  WHERE start_time >= DATEADD(day, -7, CURRENT_TIMESTAMP())
  GROUP BY table_id
  ORDER BY
    total_partitions_pruned / GREATEST(total_partitions_scanned + total_partitions_pruned, 1),
    total_partitions_scanned DESC
  LIMIT 5;
```

```output
+----------+----------------+-----------------+--------------------------+-------------------------+--------------------+-------------------+
| TABLE_ID | TABLE_NAME     | TOTAL_NUM_SCANS | TOTAL_PARTITIONS_SCANNED | TOTAL_PARTITIONS_PRUNED | TOTAL_ROWS_SCANNED | TOTAL_ROWS_PRUNED |
|----------+----------------+-----------------+--------------------------+-------------------------+--------------------+-------------------|
|   308226 | SENSOR_DATA_TS |              11 |                       21 |                       1 |           52500000 |           2500000 |
|   185364 | MATCH          |              16 |                       14 |                       2 |             240968 |             34424 |
|   209932 | ORDER_HEADER   |               2 |                      300 |                      56 |          421051748 |          75350790 |
|   209922 | K7_T1          |             261 |                      261 |                      52 |              30421 |              3272 |
|   338948 | SENSOR_DATA_TS |               9 |                       15 |                       3 |           38880000 |           8035200 |
+----------+----------------+-----------------+--------------------------+-------------------------+--------------------+-------------------+
```

The example above uses [GREATEST](../functions/greatest.md) to avoid dividing by zero when the sum of the number
of partitions scanned and the number of partitions pruned is zero.

---
title: TABLE_QUERY_PRUNING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/table_query_pruning_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md)

# TABLE_QUERY_PRUNING_HISTORY view

Use this Account Usage view to gain a better understanding of data access patterns during
query execution.

You can use this view in combination with the [COLUMN_QUERY_PRUNING_HISTORY view](column_query_pruning_history.md). For example,
you can identify access to target tables by using the TABLE_QUERY_PRUNING_HISTORY view, then
identify frequently used columns on those tables by using the COLUMN_QUERY_PRUNING_HISTORY view.

In particular, these views can help you make a more educated choice for
[clustering keys](../../user-guide/tables-clustering-keys.md).

Each row in this view represents the query pruning history for a specific table within a given time interval. The data is
aggregated by time interval and includes information about the number of queries executed, partitions scanned, partitions pruned,
rows scanned, rows pruned, and rows matched.

See also [TABLE_PRUNING_HISTORY view](table_pruning_history.md) and [Query Pruning](../../user-guide/tables-clustering-micropartitions.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| INTERVAL_START_TIME | TIMESTAMP_LTZ | Start of the time range (on the hour mark) during which the queries were executed. |
| INTERVAL_END_TIME | TIMESTAMP_LTZ | End of the time range (on the hour mark) during which the queries were executed. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that was queried. |
| TABLE_NAME | VARCHAR | Name of the table that was queried. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table that was queried. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table that was queried. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table that was queried. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table that was queried. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse that was used to run the queries. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that ran the queries. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| NUM_QUERIES | NUMBER | Number of queries executed in this time range with this specific QUERY_HASH value, using this warehouse, accessing this table. |
| AGGREGATE_QUERY_ELAPSED_TIME | NUMBER | Total elapsed time (in milliseconds) for queries defined by NUM_QUERIES. This total includes queueing and other time not associated with compilation and execution. |
| AGGREGATE_QUERY_COMPILATION_TIME | NUMBER | Total compilation time (in milliseconds) for queries defined by NUM_QUERIES. |
| AGGREGATE_QUERY_EXECUTION_TIME | NUMBER | Total execution time (in milliseconds) for queries defined by NUM_QUERIES. |
| PARTITIONS_SCANNED | NUMBER | Number of partitions scanned on this table for queries defined by NUM_QUERIES. |
| PARTITIONS_PRUNED | NUMBER | Number of partitions pruned on this table for queries defined by NUM_QUERIES. These partitions were eliminated during query processing and not scanned, improving the efficiency of the query. |
| ROWS_SCANNED | NUMBER | Number of rows scanned on this table for queries defined by NUM_QUERIES. |
| ROWS_PRUNED | NUMBER | Number of rows pruned on this table for queries defined by NUM_QUERIES. These rows were eliminated during query processing and not scanned, improving the efficiency of the query. |
| ROWS_MATCHED | NUMBER | Number of rows that matched the WHERE clause filters while scanning this table for the queries defined by NUM_QUERIES. |

## Usage notes

* Latency for the view may be up to 4 hours.
* Data is retained for 1 year.
* This view does not include pruning information for [hybrid tables](../../user-guide/tables-hybrid.md).
* For complex filtering conditions that can’t benefit from a pushdown optimization, rows might not be filtered out during the table scan operation, even if they do not match the filtering condition. Therefore, these rows are counted in the ROWS_MATCHED value.
* Users and roles that have been granted the USAGE_VIEWER database role can access this view. For more information, see
  [SNOWFLAKE database roles](../snowflake-db-roles.md).
* This view retains data for the 1,000 longest-running table scans per query. Only extremely complex queries
  exceed this number of scans so data is rarely omitted.

## Examples

The first query is a simple functional example that returns the pruning history for queries against a specific table
on a specific date where at least one row was pruned. Each row in the result belongs to a specific one-hour time window
for queries that were completed on the date specified in the WHERE clause (INTERVAL_START_TIME).

The `sensor_data_ts` table in this query contains 5356800 rows of synthetic time-series data. Exactly half of the rows in the table (2678400) were pruned for all of the queries shown here. The number of matched rows varies for these queries.

```sqlexample
SELECT interval_start_time, interval_end_time, table_id, table_name,
    num_queries, query_hash, rows_scanned, rows_pruned, rows_matched
  FROM SNOWFLAKE.ACCOUNT_USAGE.TABLE_QUERY_PRUNING_HISTORY
  WHERE interval_start_time LIKE '2025-04-24%'
    AND table_name='SENSOR_DATA_TS'
    AND rows_pruned > 0
  ORDER BY 1;
```

```output
+-------------------------------+-------------------------------+----------+----------------+-------------+----------------------------------+--------------+-------------+--------------+
| INTERVAL_START_TIME           | INTERVAL_END_TIME             | TABLE_ID | TABLE_NAME     | NUM_QUERIES | QUERY_HASH                       | ROWS_SCANNED | ROWS_PRUNED | ROWS_MATCHED |
|-------------------------------+-------------------------------+----------+----------------+-------------+----------------------------------+--------------+-------------+--------------|
| 2025-04-24 14:00:00.000 -0700 | 2025-04-24 15:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 833f4ec4ebbda62c7882e1839faec799 |      2678400 |     2678400 |            5 |
| 2025-04-24 14:00:00.000 -0700 | 2025-04-24 15:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 94d16d2fa0892247d27066e45b58d3e4 |      2678400 |     2678400 |            5 |
| 2025-04-24 15:00:00.000 -0700 | 2025-04-24 16:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 24e89f5c01209d7b395f56559f893dc8 |      2678400 |     2678400 |      2678400 |
| 2025-04-24 15:00:00.000 -0700 | 2025-04-24 16:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 71c9c6570ef849e66f83af0625b793a2 |      2678400 |     2678400 |      2678400 |
| 2025-04-24 15:00:00.000 -0700 | 2025-04-24 16:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | c75cb64d446c1ba222ac14ebd1923641 |      2678400 |     2678400 |      2678400 |
| 2025-04-24 15:00:00.000 -0700 | 2025-04-24 16:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 5a3784c59fc788804c903d96698dd969 |      2678400 |     2678400 |            5 |
| 2025-04-24 17:00:00.000 -0700 | 2025-04-24 18:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 069a076d4d6850e3d242fccf498c7c6d |      2678400 |     2678400 |       216642 |
| 2025-04-24 17:00:00.000 -0700 | 2025-04-24 18:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 4c9c5aacb7a61fc6858d107c5c46fb14 |      2678400 |     2678400 |       216642 |
| 2025-04-24 17:00:00.000 -0700 | 2025-04-24 18:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 3e509721380b262906c62c76107e46c9 |      2678400 |     2678400 |      2678400 |
| 2025-04-24 17:00:00.000 -0700 | 2025-04-24 18:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 9f7e607fe48faa18e332f65cde49f037 |      2678400 |     2678400 |      2678400 |
| 2025-04-24 17:00:00.000 -0700 | 2025-04-24 18:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | b4488d8a84ab18b00dd6b2fead4a4cb4 |      2678400 |     2678400 |       394106 |
| 2025-04-24 17:00:00.000 -0700 | 2025-04-24 18:00:00.000 -0700 |   652324 | SENSOR_DATA_TS |           1 | 157d775a79c5bae120fb5db9f7d8d027 |      2678400 |     2678400 |       216642 |
+-------------------------------+-------------------------------+----------+----------------+-------------+----------------------------------+--------------+-------------+--------------+
```

The following example calculates a “pruning ratio” for each table to help determine the pruning efficiency for queries
run on a given warehouse at a given time. The query also returns the number of partitions scanned per query, which
helps you to understand query performance with respect to the volume of data that has to be scanned.

Given the results of this query, users might conclude that while `sensor_data_ts` is accessed much more than `sensor_data1`,
these queries typically take less time and scan far fewer micro-partitions.

```sqlexample
SELECT
    SUM(aggregate_query_execution_time) as sum_exec_time,
    SUM(num_queries) as sum_num_queries,
    SUM(partitions_pruned)/SUM(partitions_pruned+partitions_scanned) AS pruning_ratio,
    SUM(partitions_scanned)/SUM(num_queries) AS partitions_scanned_per_query,
    table_name,
    schema_name,
    database_name
  FROM SNOWFLAKE.ACCOUNT_USAGE.TABLE_QUERY_PRUNING_HISTORY
  WHERE interval_start_time > '2025-04-25 12:00:00.000 -0700'
    AND warehouse_name = 'SENSORS_WH'
  GROUP BY ALL
  ORDER BY 1 DESC;
```

```output
+---------------+-----------------+---------------+------------------------------+----------------+----------------+---------------+
| SUM_EXEC_TIME | SUM_NUM_QUERIES | PRUNING_RATIO | PARTITIONS_SCANNED_PER_QUERY | TABLE_NAME     | SCHEMA_NAME    | DATABASE_NAME |
|---------------+-----------------+---------------+------------------------------+----------------+----------------+---------------|
|       1938743 |           19283 |      0.230000 |                  1800.000000 | SENSOR_DATA1   | SENSORS_SCHEMA | SENSORS_DB    |
|        123732 |           39320 |      0.950000 |                    12.000000 | SENSOR_DATA_TS | SENSORS_SCHEMA | SENSORS_DB    |
+---------------+-----------------+---------------+------------------------------+----------------+----------------+---------------+
```

---
title: TABLE_STORAGE_METRICS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/table_storage_metrics.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TABLE_STORAGE_METRICS view

This Account Usage view displays table-level storage utilization information, which is used to calculate the storage billing for each table in the account, including tables that have been dropped, but are still incurring storage costs.

In addition to table metadata, the view displays the number of storage bytes billed for each table. Snowflake breaks down the bytes into the following categories:

* Active bytes, representing data in the table that can be queried.
* Deleted bytes that are still accruing storage charges because they have not been purged yet from the system. These bytes are classified into the following sub-categories:

  > + Bytes in Time Travel (recently deleted, but still within the Time Travel retention period for the table).
  > + Bytes in Fail-safe (deleted bytes that are past the Time Travel retention period, but within the Fail-safe period for the table).
  > + Bytes retained for clones (deleted bytes that are no longer in Time Travel or Fail-safe, but are still retained because clones of the table reference the bytes).

In other words, rows are maintained in this view until the corresponding tables are no longer billed for any storage, regardless of various states that the data in the tables may be in (active, Time Travel, Fail-safe, or retained for clones).

For more details about data storage in tables, see [Data storage considerations](../../user-guide/tables-storage-considerations.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the table. |
| TABLE_NAME | VARCHAR | Name of the table. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the table. |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the table. |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to. |
| CLONE_GROUP_ID | NUMBER | Unique identifier for the oldest clone ancestor of this table. Same as ID if the table is not a clone. |
| IS_TRANSIENT | VARCHAR | ‘YES’ if table is transient or temporary, otherwise ‘NO’. Transient and temporary tables have no Fail-safe period. |
| ACTIVE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the active state for the table. For Iceberg table storage, active bytes aren’t billed to *Iceberg* tables. For more information, see [Iceberg table billing](../../user-guide/tables-iceberg.md). |
| TIME_TRAVEL_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the Time Travel state for the table. |
| FAILSAFE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the Fail-safe state for the table. |
| RETAINED_FOR_CLONE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are retained after deletion because they are referenced by one or more clones of this table, or by [WORM backups](../../user-guide/backups.md) that contain the table. |
| DELETED | BOOLEAN | TRUE if table has been dropped or recreated. |
| TABLE_CREATED | TIMESTAMP_LTZ | Date and time when the table was created. |
| TABLE_DROPPED | TIMESTAMP_LTZ | Date and time when the table was dropped. NULL if table has not been dropped. |
| TABLE_ENTERED_FAILSAFE | TIMESTAMP_LTZ | Date and time when the table, if dropped, entered the Fail-safe state, or NULL. In this state, the table cannot be restored using UNDROP. For transient tables, which aren’t recoverable using Fail-safe, this column indicates when the time travel retention period has passed. |
| SCHEMA_CREATED | TIMESTAMP_LTZ | Date and time when the schema for the table was created. |
| SCHEMA_DROPPED | TIMESTAMP_LTZ | Date and time when the schema for the table was dropped. |
| CATALOG_CREATED | TIMESTAMP_LTZ | Date and time when the database for the table was created. |
| CATALOG_DROPPED | TIMESTAMP_LTZ | Date and time when the database for the table was dropped. |
| COMMENT | VARCHAR | Comment for the table. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| ARCHIVE_STORAGE_COOL_ACTIVE_BYTES | NUMBER | The number of bytes in the cool storage tier owned by (and billed to) this table that are in the active state for the table. |
| ARCHIVE_STORAGE_COLD_ACTIVE_BYTES | NUMBER | The number of bytes in the cold storage tier owned by (and billed to) this table that are in the active state for the table. |
| ARCHIVE_STORAGE_COOL_TIME_TRAVEL_BYTES | NUMBER | The number of bytes in the cool storage tier owned by (and billed to) this table that are in the Time Travel state for the table. |
| ARCHIVE_STORAGE_COLD_TIME_TRAVEL_BYTES | NUMBER | The number of bytes in the cold storage tier owned by (and billed to) this table that are in the Time Travel state for the table. |
| ARCHIVE_STORAGE_COOL_FAILSAFE_BYTES | NUMBER | The number of bytes owned by (and billed to) this table in the COOL storage tier that are in the Fail-safe state for the table. |
| ARCHIVE_STORAGE_COLD_FAILSAFE_BYTES | NUMBER | The number of bytes owned by (and billed to) this table in the COLD storage tier that are in the Fail-safe state for the table. |
| ARCHIVE_STORAGE_COOL_EARLY_DELETION_PENALTY_BYTES | NUMBER | The number of penalty bytes deleted early and billed for) that are in the COOL storage tier. For more information, see [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). |
| ARCHIVE_STORAGE_COLD_EARLY_DELETION_PENALTY_BYTES | NUMBER | The number of penalty bytes deleted early and billed for) that are in the COLD storage tier. For more information, see [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). |

## Usage notes

* Latency for the view may be up to 90 minutes.
* Storage metrics for hybrid tables are not tracked in this view. For information about storage consumption for hybrid tables,
  see [Evaluate cost for hybrid tables](../../user-guide/tables-hybrid-cost.md).
* > **Note:**
  >
  > With [BCR-2127](../../release-notes/bcr-bundles/2025_07/bcr-2127.md),
  > this view includes new columns for storage lifecycle policies.
  > To view storage lifecycle policy columns, you must enable the 2025_07 behavior change bundle
  > in your account.
  >
  > To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
  > execute the following statement:
  >
  > ```sqlexample
  > SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_07');
  > ```

* `ID` and `CLONE_GROUP_ID`:

  > + `ID` does not change for a table throughout its lifecycle, including if the table is renamed or dropped.
  > + `CLONE_GROUP_ID` is the ID of the oldest ancestor of a clone, including if the table has been dropped, but is still accruing storage costs. For example:
  >
  >   > 1. Table `t2` is cloned from `t1`.
  >   > 2. Table `t3` is cloned from `t2`.
  >
  >   All three tables list the `ID` for `t1` as their `CLONE_GROUP_ID`, even if `t1` is dropped and eventually purged from Snowflake.
  > + If the IDs are identical, the table is not a clone.
  > + Storage bytes are always owned by, and therefore billed to, the table where the bytes were initially added. If the table is then cloned, storage metrics for these initial bytes never transfer
  >   to the clones, even if the bytes are deleted from the source table.
* Cloned tables share the same underlying storage (at the micro-partition level) until either the original table or cloned table is modified. With each change made to either table, the table takes
  “ownership” of the changed bytes.
* Dropped tables are displayed in the view as long as they still incur storage costs:

  > + Dropped tables retain their active storage metrics, indicating how many bytes will be active if the table is restored.
  > + Dropped tables in the Time Travel retention period for the table can be restored using the UNDROP command.
  > + Dropped tables in Fail-safe (`TABLE_ENTERED_FAILSAFE` is not `NULL`) will potentially display `NULL` values in most columns, except for:
  >
  >   > ID columns:
  >   > :   `ID` , `CLONE_GROUP_ID`
  >   >
  >   > Bytes columns:
  >   > :   `ACTIVE_BYTES` , `TIME_TRAVEL_BYTES` , `FAILSAFE_BYTES` , `RETAINED_FOR_CLONE_BYTES`
  >
  >   These tables cannot be restored using the UNDROP command.
* When data is deleted from a table with a Time Travel retention period of 0 days, asynchronous background processes purge the active bytes
  or move them directly into Fail-safe storage, depending on the table type. This may take a short time to complete. During that time, the
  `TIME_TRAVEL_BYTES` column may contain a non-zero value even when the Time Travel retention period is 0 days.
* `FAILSAFE_BYTES` denotes bytes that have passed beyond Time Travel. All such bytes are billed to the current table.
* If multiple rows have the same value in the `TABLE_NAME` column, this indicates that multiple versions of the table exist. A version is created each time a table is dropped and a new table
  with the same name is created, including when a [CREATE OR REPLACE TABLE](../sql/create-table.md) command is issued on an existing table. Note that the current version will have a
  `NULL` value for the `TABLE_DROPPED` column; all other versions will have a timestamp value. This is important to note because each version of a table incurs storage costs associated with
  Time Travel (and Fail-safe, if the table is permanent).
* Any data in the `DELETED` column prior to August 2018 may not be accurate.
* In some cases, active bytes might include bytes for data in a dropped column. For more information,
  see the [usage notes](../sql/alter-table.md) for ALTER TABLE.
* For Iceberg tables:

  + Snowflake doesn’t bill for [Iceberg table](../../user-guide/tables-iceberg.md) storage when the table uses
    an external volume that you manage. However, if the table uses
    [Snowflake Storage](../../user-guide/tables-iceberg-internal-storage.md) (`EXTERNAL_VOLUME = SNOWFLAKE_MANAGED`),
    Snowflake charges for the storage.
    For more information, see [Iceberg table billing](../../user-guide/tables-iceberg.md).
  + If the table is externally managed,
    this view might display inaccurate storage utilization information.

---
title: TABLES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/tables.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TABLES view

This Account Usage view displays a row for each table and view in the account.

See also:
:   [COLUMNS view](columns.md) , [VIEWS view](views.md), [TABLES view](../info-schema/tables.md) (Information Schema)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal, Snowflake-generated identifier for the table. |
| TABLE_NAME | VARCHAR | Name of the table. |
| TABLE_SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema for the table. |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal, Snowflake-generated identifier of the database for the table. |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to. |
| TABLE_OWNER | VARCHAR | Name of the role that owns the table. |
| TABLE_TYPE | VARCHAR | Indicates the table type. Valid values are `BASE TABLE`, `TEMPORARY TABLE`, `EXTERNAL TABLE`, `EVENT TABLE`, `VIEW`, or `MATERIALIZED VIEW`. |
| IS_TRANSIENT | VARCHAR | Indicates whether the table is transient. |
| CLUSTERING_KEY | VARCHAR | Column(s) and/or expression(s) that comprise the clustering key for the table. |
| ROW_COUNT | NUMBER | Number of rows in the table. |
| BYTES | NUMBER | Number of bytes accessed by a scan of the table. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| SELF_REFERENCING_COLUMN_NAME | VARCHAR | Not applicable for Snowflake. |
| REFERENCE_GENERATION | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_NAME | VARCHAR | Not applicable for Snowflake. |
| IS_INSERTABLE_INTO | VARCHAR | Not applicable for Snowflake. |
| IS_TYPED | VARCHAR | Not applicable for Snowflake. |
| COMMIT_ACTION | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Date and time when the table was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| DELETED | TIMESTAMP_LTZ | Date and time when the table was dropped. |
| AUTO_CLUSTERING_ON | VARCHAR | Status of Automatic Clustering for a table. For details, see [Viewing the Automatic Clustering status for a table](../../user-guide/tables-auto-reclustering.md). |
| COMMENT | VARCHAR | Comment for the table. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| IS_ICEBERG | VARCHAR | Indicates whether the table is an [Iceberg table](../../user-guide/tables-iceberg.md). Valid values are `YES` or `NO`. |
| IS_DYNAMIC | VARCHAR | Indicates whether the table is a [dynamic table](../../user-guide/dynamic-tables-about.md). Valid values are `YES` or `NO`. |
| IS_HYBRID | VARCHAR | Indicates whether the table is a [hybrid table](../../user-guide/tables-hybrid.md). Valid values are `YES` or `NO`. |
| ARCHIVE_STORAGE_COOL_ROW_COUNT | NUMBER | The number of rows that are in the COOL storage tier. |
| ARCHIVE_STORAGE_COOL_BYTES | NUMBER | The number of bytes accessed by retrieving data from the COOL storage tier. |
| ARCHIVE_STORAGE_COLD_ROW_COUNT | NUMBER | The number of rows that are in the COLD storage tier. |
| ARCHIVE_STORAGE_COLD_BYTES | NUMBER | The number of bytes accessed by retrieving data from the COLD storage tier. |

## Usage notes

* Latency for the view may be up to 90 minutes.
* > **Note:**
  >
  > With [BCR-2127](../../release-notes/bcr-bundles/2025_07/bcr-2127.md),
  > this view includes new columns for storage lifecycle policies.
  > To view storage lifecycle policy columns, you must enable the 2025_07 behavior change bundle
  > in your account.
  >
  > To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
  > execute the following statement:
  >
  > ```sqlexample
  > SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_07');
  > ```

* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* Querying the `SUM(BYTES)` for a table does not represent the total storage usage, because the amount does not include Time Travel and Fail-safe usage.
* Using the value in the LAST_ALTERED column for Time Travel is *not* recommended and can return unexpected results for the following
  reaons:

  + Time Travel can only be used to query historical data modified by a [DML operation](../../user-guide/data-time-travel.md).
  + The LAST_ALTERED column inludes both DML and DDL operations (see the next usage note).
  + For DML operations, the value in the LAST_ALTERED column is the timestamp at the beginning of the statement execution rather than
    the time of the commit of the transaction containing this statement.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.
* The LAST_ALTERED column does not necessarily indicate the last refreshed time for external tables.
  To retrieve the last refreshed time for an auto-refreshed external table, you can use the
  [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](../functions/system_external_table_pipe_status.md) function, which returns
  information such as the timestamp of the last file Snowflake has registered.

## Examples

Retrieve the total size (in bytes) of all active tables in all schemas in your account:

```sqlexample
SELECT table_schema, SUM(bytes)
  FROM SNOWFLAKE.ACCOUNT_USAGE.TABLES
  WHERE deleted IS NULL
  GROUP BY table_schema;
```

---
title: TAG_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/tag_references.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TAG_REFERENCES view

This Account Usage view can be used to identify the associations between objects and tags.

This view only records the direct relationship between the object and the tag. [Tag inheritance](../../user-guide/object-tagging/inheritance.md) is not included in this view.

The view is complementary to the information schema table function [TAG_REFERENCES](../functions/tag_references.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TAG_DATABASE | VARCHAR | The database in which the tag is set. |
| TAG_SCHEMA | VARCHAR | The schema in which the tag is set. |
| TAG_ID | NUMBER | Internal/system-generated identifier for the tag. Note that for system tags this value is NULL. |
| TAG_NAME | VARCHAR | The name of the tag. This is the `key` in the `key = 'value'` pair of the tag. |
| TAG_VALUE | VARCHAR | The value of tag. This is the `'value'` in the `key = 'value'` pair of the tag. |
| OBJECT_DATABASE | VARCHAR | Database name of the referenced object for database and schema objects. If the object is not a database or schema object, the value is empty. |
| OBJECT_SCHEMA | VARCHAR | Schema name of the referenced object (for schema objects). If the referenced object is not a schema object (e.g. warehouse), this value is empty. |
| OBJECT_ID | NUMBER | Internal identifier of the referenced object. |
| OBJECT_NAME | VARCHAR | Name of the referenced object if the tag association is on the object. If the tag association is on a column, Snowflake returns the parent table name. |
| OBJECT_DELETED | TIMESTAMP_LTZ | Date and time when the associated or parent object was dropped. |
| DOMAIN | VARCHAR | Domain of the reference object (e.g. table, view) if the tag association is on the object. For columns, the domain is COLUMN if the tag association is on a column. For more information, see [supported domains](../functions/tag_references.md). |
| COLUMN_ID | NUMBER | The local identifier of the reference column; not applicable if the tag association is not a column. |
| COLUMN_NAME | VARCHAR | Name of the referenced column; not applicable if the tag association is not a column. |
| APPLY_METHOD | VARCHAR | Specifies how the tag got assigned to the object.   * `CLASSIFIED`: The tag was automatically applied to a column that was classified as containing sensitive data. See [About tag mapping](../../user-guide/classify-auto.md). * `MANUAL`: Someone manually set the tag on the object using a CREATE <object> command or ALTER <object> command. See [Set a tag](../../user-guide/object-tagging/work.md). * `PROPAGATED`: The tag was automatically propagated from one object to another. See [Automatic tag propagation with user-defined tags](../../user-guide/object-tagging/propagation.md). * `NULL`: Legacy record. * `NONE`: Legacy record. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not contain information about columns that have been deleted.
* The TAG_DATABASE_ID column is not included in this view. To obtain this value in your query result, perform a JOIN operation with the
  [TAGS view](tags.md).

## Examples

Return the tag references for your Snowflake account:

> ```sqlexample
> select tag_name, tag_value, domain, object_id
> from snowflake.account_usage.tag_references
> order by tag_name, domain, object_id;
> ```

Return the active objects that have tag associations in your Snowflake account. The addition of the specified WHERE clause filters the
objects that are deleted:

> ```sqlexample
> select tag_name, tag_value, domain, object_id
> from snowflake.account_usage.tag_references
> where object_deleted is null
> order by tag_name, domain, object_id;
> ```

---
title: TAGS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/tags.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TAGS view

This Account Usage view lists the tags in an account.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| TAG_ID | NUMBER | The local identifier of a tag. |
| TAG_NAME | TEXT | The name of a tag. |
| TAG_SCHEMA_ID | NUMBER | The local identifier of the tag schema. |
| TAG_SCHEMA | TEXT | The name of schema in which the tag exists. |
| TAG_DATABASE_ID | NUMBER | The local identifier of the database in which the tag exists. |
| TAG_DATABASE | TEXT | The name of the database in which the tag exists. |
| TAG_OWNER | TEXT | The name of the role that owns the tag. |
| TAG_COMMENT | VARIANT | Comments for the tag, if any. |
| CREATED | TIMESTAMP_LTZ | Date and time when the tag was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the tag was dropped, or the date and time when its parents were dropped. |
| ALLOWED_VALUES | VARIANT | Specifies the possible string values that can be assigned to the tag when the tag is set on an [object](../../user-guide/object-tagging/introduction.md) or NULL if the tag does not have any specified allowed values. For details, see [Set a list of allowed tag values](../../user-guide/object-tagging/work.md). |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| PROPAGATE | VARCHAR | Indicates whether the tag is configured for automatic propagation. Possible values are the following:   * NULL — Tag is not propagated. * `ON_DEPENDENCY` — Tag is propagated when there is an object dependency (for example, creating a view from a tagged table). * `ON_DATA_MOVEMENT` — Tag is propagated when there is data movement (for example, using a CTAS statement to create a table   from a tagged table). * `ON_DEPENDENCY_AND_DATA_MOVEMENT` — Tag is propagated for both object dependencies and data movement. |
| ON_CONFLICT | VARCHAR | If the tag is configured for automatic propagation, indicates what happens when the value of the tag being propagated conflicts with the value that was specified when the tag was manually applied to the same object. For more information, see [Tag propagation conflicts](../../user-guide/object-tagging/propagation.md). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Return the tag references for your Snowflake account:

> ```sqlexample
> select * from snowflake.account_usage.tags
> order by tag_name;
> ```

---
title: TASK_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/task_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TASK_HISTORY view

This Account Usage view enables you to retrieve the history of [task](../../user-guide/tasks-intro.md) usage within the last 365 days (1 year).
The view displays one row for each run of a task in the history.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the task. |
| QUERY_TEXT | VARCHAR | Text of the SQL statement. |
| CONDITION_TEXT | VARCHAR | Text of WHEN condition the task evaluates when determining whether to run. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the task. |
| TASK_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the task. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the task. |
| TASK_DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the task. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the task is/was scheduled to start running. Note that we make a best effort to ensure absolute precision, but only guarantee that tasks do not execute *before* the scheduled time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the task completed. |
| STATE | VARCHAR | Status of the completed task: SUCCEEDED, FAILED, CANCELLED, or SKIPPED. Note that the view does not return SCHEDULED or EXECUTING task runs. To retrieve the task history details for runs in a scheduled or executing state, query the [TASK_HISTORY](../functions/task_history.md) table function in the Information Schema. The timed-out tasks always have a `FAILED` state in the task history. |
| RETURN_VALUE | VARCHAR | Value set for the predecessor task in a [task graph](../../user-guide/tasks-graphs.md). The return value is explicitly set by calling the [SYSTEM$SET_RETURN_VALUE](../functions/system_set_return_value.md) function by the predecessor task. |
| QUERY_ID | VARCHAR | ID of the SQL statement executed by the task. Can be joined with the QUERY_HISTORY view for additional details about the execution of the statement or stored procedure. |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the task definition started to run. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| ERROR_CODE | VARCHAR | Error code, if the statement returned an error. |
| ERROR_MESSAGE | VARCHAR | Error message, if the statement returned an error. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the task graph that was run, or is scheduled to be run. Each incremental increase in the value represents one or more modifications to tasks in the task graph. If the root task is recreated (using CREATE OR REPLACE TASK), then the version number restarts from 1. |
| RUN_ID | NUMBER | Time when the standalone or root task in a task graph is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system may reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run prior to retry. You may use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| ROOT_TASK_ID | VARCHAR | Unique identifier for the root task in a task graph. This ID matches the ID column value in the SHOW TASKS output for the same task. |
| SCHEDULED_FROM | VARCHAR | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| CONFIG | VARCHAR | Displays the graph level configuration if set for the root task, otherwise displays NULL. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_PARAMETERIZED_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |
| GRAPH_RUN_GROUP_ID | VARCHAR | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Usage notes

* Latency for the view may be up to 45 minutes.

* For increased performance, filter queries on the COMPLETED_TIME or SCHEDULED_TIME column.

## Examples

Retrieve records for the 10 most recent completed task runs:

> ```sqlexample
> SELECT query_text, completed_time
> FROM snowflake.account_usage.task_history
> ORDER BY completed_time DESC
> LIMIT 10;
> ```

Retrieve records for task runs completed in the past hour:

> ```sqlexample
> SELECT query_text, completed_time
> FROM snowflake.account_usage.task_history
> WHERE completed_time > DATEADD(hours, -1, CURRENT_TIMESTAMP());
> ```

---
title: TASK_VERSIONS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/task_versions.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TASK_VERSIONS view

This Account Usage view enables you to retrieve the history of [task versions](../../user-guide/tasks-intro.md). The returned rows
indicate the tasks that comprised a [task graph](../../user-guide/tasks-graphs.md) and their properties at a given time.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a DAG. This ID matches the ID column value in the SHOW TASKS output for the same task. Matches ROOT_TASK_ID in [COMPLETE_TASK_GRAPHS view](complete_task_graphs.md) and [TASK_HISTORY view](task_history.md). |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the task. Matches GRAPH_VERSION in [COMPLETE_TASK_GRAPHS view](complete_task_graphs.md). |
| GRAPH_VERSION_CREATED_ON | TIMESTAMP_LTZ | Date and time when this version of the task graph was saved. |
| NAME | TEXT | Name of the task. |
| ID | TEXT | Unique identifier for each task. Note that recreating a task (using CREATE OR REPLACE TASK) essentially creates a new task, which has a new ID. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contained the task. |
| DATABASE_NAME | TEXT | Name of the database in which the task is stored. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contained the task. |
| SCHEMA_NAME | TEXT | Name of the schema in which the task is stored. |
| OWNER | TEXT | Role that owns the task (that is, has the OWNERSHIP privilege on the task). |
| COMMENT | TEXT | Comment for the task. |
| WAREHOUSE_NAME | TEXT | Warehouse that provides the required resources to run the task. |
| SCHEDULE | TEXT | Schedule for running the task. Displays NULL if no schedule is specified. |
| PREDECESSORS | ARRAY | JSON array of any tasks identified in the AFTER parameter for the task (that is, predecessor tasks). When run successfully to completion, these tasks trigger the current task. Individual task names in the array are fully qualified (that is, include the container database and schema names). Displays an empty array if the task has no predecessor. |
| STATE | TEXT | Current state of the task: `started` or `suspended`. `NULL` for root tasks (tasks with no predecessors). |
| DEFINITION | TEXT | SQL statements executed when the task runs. |
| CONDITION_TEXT | TEXT | Condition specified in the WHEN clause for the task. |
| ALLOW_OVERLAPPING_EXECUTION | BOOLEAN | For root tasks in a DAG, displays TRUE if overlapping execution of the DAG is explicitly allowed. For child tasks in a DAG, displays NULL. |
| ERROR_INTEGRATION | TEXT | Name of the notification integration used to access Amazon Simple Notification Service (SNS), Google Pub/Sub, or Microsoft Azure Event Grid to relay error notifications for the task. |
| LAST_COMMITTED_ON | TIMESTAMP_LTZ | Timestamp when a version of the task was last set. If no version has been set (that is, if the task has not been resumed or manually executed after it was created), the value is NULL. |
| LAST_SUSPENDED_ON | TIMESTAMP_LTZ | Timestamp when the task was last suspended. If the task has not been suspended yet, the value is NULL. |
| TARGET_COMPLETION_INTERVAL | TEXT | The window of time when the task should perform. Only used for serverless tasks. Optional for serverless tasks, required for [serverless triggered tasks](../../user-guide/tasks-intro.md). |
| SCHEDULING_MODE | TEXT | Reserved for future functionality. Displays UNKNOWN. |

## Usage notes

Latency for the view may be up to 3 hours.

## Examples

Retrieve the tasks from a specific task graph based on the ROOT_TASK_ID and GRAPH_VERSION:

> ```sqlexample
> SELECT *
> FROM snowflake.account_usage.task_versions
> WHERE ROOT_TASK_ID = 'afb36ccc-. . .-b746f3bf555d' AND GRAPH_VERSION = 3;
> ```

Retrieve the task runs for a particular task graph and its descendant tasks from task_history, with additional task information from task_versions.

> ```sqlexample
> SELECT
> task_history.* rename state AS task_run_state,
> task_versions.state AS task_state,
> task_versions.graph_version_created_on,
> task_versions.warehouse_name,
> task_versions.comment,
> task_versions.schedule,
> task_versions.predecessors,
> task_versions.allow_overlapping_execution,
> task_versions.error_integration
> FROM snowflake.account_usage.task_history
> JOIN snowflake.account_usage.task_versions using (root_task_id, graph_version)
> WHERE task_history.ROOT_TASK_ID = 'afb36ccc-. . .-b746f3bf555d'
> ```

---
title: TRI_SECRET_SECURE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/tri-secret-secure-history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TRI_SECRET_SECURE_HISTORY view

This Account Usage view provides information about [Tri-Secret Secure](../../user-guide/security-encryption-tss.md)
customer-managed keys (CMKs) within the last year (365 days).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | System-generated ID that uniquely identifies a CMK used in the account. |
| REGISTERED_BY_USER_ID | NUMBER | Identifies the user who registered the CMK. Also appears in the ACCOUNT_USAGE view if a customer self-registered the CMK. |
| CMK_ACTIVATION_STATUS | VARCHAR | Activation status of the CMK. Valid values are:   * PENDING_ACTIVATION * ACTIVATED * PENDING_DEACTIVATION * DISABLED |
| CMK_IDENTIFIER | VARCHAR | Displays a value that you can use to locate your customer managed key. For example:   * AWS: key ARN * Azure: `https://mykeyvault.vault.azure.com/keys/my-rsa-key` * Google Cloud: `projects/PROJECT_ID/locations/LOCATION/keyRings/KEY_RING/cryptoKeys/KEY_NAME` |
| IS_REGISTERED | BOOLEAN | Whether the key represented by the CMK identifier is registered or not. |
| REGISTERED_ON | TIMESTAMP_LTZ | Identifies the last time when a user registered the CMK. |
| ACTIVATED_ON | TIMESTAMP_LTZ | Identifies the last time when a user activated the CMK. |
| DISABLED_ON | TIMESTAMP_LTZ | Identifies the last time when a user disabled the CMK. |
| UPDATED_ON | TIMESTAMP_LTZ | Identifies the time of the last update to this view. |

## Usage notes

* This view requires the SECURITY_VIEWER role.
* Latency for the view may be up to 120 minutes (2 hours).

## Examples

Retrieve all Tri-Secret Secure history records:

> ```sqlexample
> SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.TRI_SECRET_SECURE_HISTORY;
> ```

---
title: TRUST_CENTER_FINDINGS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/trust_center_findings.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TRUST_CENTER_FINDINGS view

This Account Usage view shows security violations discovered by [Trust Center scanners](../../user-guide/trust-center/overview.md).

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | System identifier of the account that had the finding. |
| PROVIDER_ID | VARCHAR | System identifier of the provider of the scanner package. |
| SCANNER_PACKAGE_ID | VARCHAR | System identifier of the scanner package. |
| SCANNER_ID | VARCHAR | System identifier of the scanner. |
| SEVERITY | VARCHAR | Severity of the finding, as assigned by the scanner [LOW, MEDIUM, HIGH, CRITICAL]. |
| STATE | VARCHAR | State of the finding [OPEN, RESOLVED, RESOLVED MANUALLY]. |
| CREATED_ON | TIMESTAMP_LTZ | The time at which the finding was initially created. |
| UPDATED_ON | TIMESTAMP_LTZ | The time at which the finding was last updated. |

## Usage notes

Latency for the view may be up to 60 minutes (1 hour).

---
title: TYPES view
source: https://docs.snowflake.com/en/sql-reference/account-usage/types.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# TYPES view

This Account Usage view displays a row for each [user-defined type](../data-types-user-defined.md)
defined in the account.

See also:
:   [TYPES view](../info-schema/types.md) (Information Schema) ,
    [TYPES view](../organization-usage/types.md) (Organization Usage)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| TYPE_ID | NUMBER | Internal/system-generated identifier for the type. |
| TYPE_NAME | VARCHAR | Name of the type. |
| TYPE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the type. |
| TYPE_SCHEMA | VARCHAR | Schema that contains the type. |
| TYPE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database that contains the type. |
| TYPE_CATALOG | VARCHAR | Database that contains the type. |
| TYPE_OWNER | VARCHAR | Name of the role that owns the type. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| BASE_DATA_TYPE | VARCHAR | Underlying data type of the user-defined type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters for VARCHAR types. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes for VARCHAR types. |
| NUMERIC_PRECISION | NUMBER | Numeric precision for NUMBER types. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of the numeric precision for NUMBER types. |
| NUMERIC_SCALE | NUMBER | Numeric scale for NUMBER types. |
| DATETIME_PRECISION | NUMBER | Fractional seconds precision for TIMESTAMP types. |
| CHECK_EXPRESSION | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_EXPRESSION | VARCHAR | Not applicable for Snowflake. |
| IS_NULLABLE_DEFAULT | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Date and time when the type was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the type was dropped. |
| COMMENT | VARCHAR | Comment for this type. |

## Usage notes

* Latency for the view might be up to 120 minutes (2 hours).
* The view only displays objects for which the current role for the session has been granted access privileges.
* The view doesn’t recognize the MANAGE GRANTS privilege and consequently might show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve all user-defined types in the account:

```sqlexample
SELECT type_name, type_catalog, type_schema, type_owner, base_data_type
  FROM SNOWFLAKE.ACCOUNT_USAGE.TYPES
  ORDER BY created DESC;
```

Retrieve user-defined types that have been dropped:

```sqlexample
SELECT type_name, type_catalog, type_schema, deleted
  FROM SNOWFLAKE.ACCOUNT_USAGE.TYPES
  WHERE deleted IS NOT NULL
  ORDER BY deleted DESC;
```

---
title: USERS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/users.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# USERS view

This Account Usage view can be used to query a list of all users in the account.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| USER_ID | NUMBER | Internal/system-generated identifier for the user. |
| NAME | VARCHAR | A unique identifier for the user. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the user’s account was created. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the user’s account was deleted. |
| LOGIN_NAME | VARCHAR | Name that the user enters to log into the system. |
| DISPLAY_NAME | VARCHAR | Name displayed for the user in the Snowflake web interface. |
| FIRST_NAME | VARCHAR | First name of the user. |
| LAST_NAME | VARCHAR | Last name of the user. |
| EMAIL | VARCHAR | Email address for the user. |
| MUST_CHANGE_PASSWORD | BOOLEAN | Specifies whether the user is forced to change their password on their next login. |
| HAS_PASSWORD | BOOLEAN | Specifies whether a password was created for the user. |
| COMMENT | VARCHAR | Comment for the user. |
| DISABLED | VARIANT | Specified whether the user account is disabled preventing the user from logging in to the Snowflake and running queries. |
| SNOWFLAKE_LOCK | VARIANT | Specifies whether a temporary lock has been placed on the user’s account. |
| DEFAULT_WAREHOUSE | VARCHAR | The virtual warehouse that is active by default for the user’s session upon login. |
| DEFAULT_NAMESPACE | VARCHAR | The namespace (database only or database and schema) that is active by default for the user’s session upon login. |
| DEFAULT_ROLE | VARCHAR | The role that is active by default for the user’s session upon login. |
| EXT_AUTHN_DUO | BOOLEAN | Specifies whether Duo Security is enabled for the user, which requires the user to use MFA (multi-factor authorization) for login. |
| EXT_AUTHN_UID | VARCHAR | The authorization ID used for Duo Security. |
| HAS_MFA | BOOLEAN | Specifies whether the user is enrolled for multi-factor authentication. |
| BYPASS_MFA_UNTIL | TIMESTAMP_LTZ | The number of minutes to temporarily bypass MFA for the user. |
| LAST_SUCCESS_LOGIN | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the user last logged in to the Snowflake. |
| EXPIRES_AT | TIMESTAMP_LTZ | The date and time when the user’s status is set to `EXPIRED` and the user can no longer log in. This is useful for defining temporary users (e.g. users who should only have access to Snowflake for a limited time period). |
| LOCKED_UNTIL_TIME | TIMESTAMP_LTZ | Specifies the number of minutes until the temporary lock on the user login is cleared. |
| HAS_RSA_PUBLIC_KEY | BOOLEAN | Specifies whether RSA public key used for key pair authentication has been set up for the user. |
| PASSWORD_LAST_SET_TIME | TIMESTAMP_LTZ | The timestamp on which the last non-null password was set for the user. Default to null if no password has been set yet or if Snowflake is unable to determine the timestamp for the user before the inclusion of this column. |
| OWNER | VARCHAR | Specifies the role with the OWNERSHIP privilege on the object. |
| DEFAULT_SECONDARY_ROLE | VARCHAR | Specifies the default secondary role for the user (that is, ALL) or NULL if not set. |
| HAS_PAT | BOOLEAN | If TRUE, a [programmatic access token (PAT)](../../user-guide/programmatic-access-tokens.md) has been generated for the user. |
| HAS_WORKLOAD_IDENTITY | BOOLEAN | If TRUE, the user is configured to use [workload identity federation](../../user-guide/workload-identity-federation.md) to authenticate with Snowflake. |
| TYPE | VARCHAR | Specifies the [type of user](../../user-guide/admin-user-management.md). |
| DATABASE_NAME | VARCHAR | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |
| DATABASE_ID | NUMBER | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s database; otherwise, it’s NULL. |
| SCHEMA_NAME | VARCHAR | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |
| SCHEMA_ID | NUMBER | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s schema; otherwise, it’s NULL. |
| IS_FROM_ORGANIZATION_USER | BOOLEAN | If TRUE, the user was imported from an [organization user](../../user-guide/organization-users.md). |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).

* The `LAST_SUCCESS_LOGIN` column may have a value that differs from the `last_success_login` column in the
  SHOW USERS command output because of different methodologies used to record near-real-time and historical logins. The
  column might have a NULL value if the login history data for the user is outside the one-year retention period of
  historical data.
* Columns that are not applicable to service users (that is, users with `TYPE=SERVICE`) contain NULL values. For example,
  `HAS_PASSWORD` contains NULL values for service users.
* The `deletedOn` column might not be accurate for Snowpark Container Services [service user](../../developer-guide/snowpark-container-services/spcs-execute-sql.md). For services created before release 8.42.0, the `deletedOn` column of the service user shows as empty even if the associated service is dropped; For services created after release 8.42.0, the `deletedOn` column of the service user shows as the deletion time of the associating service.

### Internal Snowflake User for Snowsight

The first time [Snowsight](../../user-guide/ui-snowsight.md) is accessed in an account, Snowflake creates an internal WORKSHEETS_APP_USER user to support the web interface. This user is used to cache query results in an internal stage in an account. For more information, see [Getting started with Snowsight](../../user-guide/ui-snowsight-gs.md).

---
title: VIEWS view
source: https://docs.snowflake.com/en/sql-reference/account-usage/views.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# VIEWS view

This Account Usage view displays a row for each view in the account, not including the views in the ACCOUNT_USAGE, READER_ACCOUNT_USAGE, and INFORMATION_SCHEMA schemas.

See also:
:   [TABLES view](tables.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the view. |
| TABLE_NAME | VARCHAR | Name of the view. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that the view belongs to. |
| TABLE_SCHEMA | VARCHAR | Schema that the view belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database that the view belongs to. |
| TABLE_CATALOG | VARCHAR | Database that the view belongs to. |
| TABLE_OWNER | VARCHAR | Name of the role that owns the view. |
| VIEW_DEFINITION | VARCHAR | Text of the query expression for the view. |
| CHECK_OPTION | VARCHAR | Not applicable for Snowflake. |
| IS_UPDATABLE | VARCHAR | Not applicable for Snowflake. |
| INSERTABLE_INTO | VARCHAR | Not applicable for Snowflake. |
| IS_SECURE | VARCHAR | Specifies whether the view is secure. |
| CREATED | TIMESTAMP_LTZ | Date and time when the view was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| DELETED | TIMESTAMP_LTZ | Date and time when the view was deleted. |
| COMMENT | VARCHAR | Comment for the view. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 90 minutes.

* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.

---
title: WAREHOUSE_EVENTS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/warehouse_events_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# WAREHOUSE_EVENTS_HISTORY view

This Account Usage view can be used to return the events that have been triggered for the single-cluster and multi-cluster warehouses
in your account in the last 365 days (1 year).

Supported events include:

* Creating, dropping, or altering a warehouse, including resizing the warehouse.
* Resuming or suspending a warehouse.
* Resuming, suspending, or resizing a cluster in a warehouse (single-cluster and multi-cluster warehouses).
* Stopping or starting additional clusters in a warehouse (multi-cluster warehouses only).

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| TIMESTAMP | TIMESTAMP_LTZ | The timestamp when the event is triggered. |
| WAREHOUSE_ID | NUMBER | The unique warehouse ID (assigned by Snowflake) that corresponds to the warehouse name in your account. |
| WAREHOUSE_NAME | VARCHAR | The name of the warehouse in your account. |
| CLUSTER_NUMBER | NUMBER | If an event was triggered for a specific cluster in a multi-cluster warehouse, the number of the cluster (starting with 1) for which the event was triggered; if the event was triggered for all clusters in the warehouse or is not applicable for a single-cluster warehouse, NULL is displayed. |
| EVENT_NAME | VARCHAR | Name of the event. For the list of possible values, see below. |
| EVENT_REASON | VARCHAR | The cause of the event. For the list of possible values, see below. |
| EVENT_STATE | VARCHAR | State of an event that might take time to complete: STARTED or COMPLETED. |
| USER_NAME | VARCHAR | User who initiated the event. |
| ROLE_NAME | VARCHAR | Role that was active in the session at the time the event was initiated. |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| SIZE | VARCHAR | Current size of the warehouse at the time of the event. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| CLUSTER_COUNT | VARCHAR | Number of warehouse clusters at the time of the event. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| WAREHOUSE_TYPE | VARCHAR | One of `STANDARD` or `SNOWPARK-OPTIMIZED`. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| RESOURCE_CONSTRAINT | VARCHAR | One of: . - `STANDARD_GEN_1` . - `STANDARD_GEN_2` . - `MEMORY_1X` . - `MEMORY_1X_x86` . - `MEMORY_16X` . - `MEMORY_16X_x86` . - `MEMORY_64X` . - `MEMORY_64X_x86` . This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. It’s also NULL for standard warehouses created before the release of the STANDARD_GEN_2 feature. |

### EVENT_NAME descriptions

The following sections describe the valid values for the EVENT_NAME column for warehouse-related and cluster-related events.

#### Warehouse-related events

The following table describes the valid values for the EVENT_NAME column for warehouse-related events:

| EVENT_NAME | Description |
| --- | --- |
| CONVERT_WAREHOUSE | Triggered by the conversion of a warehouse from standard to Snowpark-optimized, from Snowpark-optimized to standard, from Gen1 to Gen2, or Gen2 to Gen1. This event happens whether the warehouse is running or suspended when the conversion happens.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | CONVERT_TO_SNOWPARK_OPTIMIZED, CONVERT_TO_STANDARD, or CONVERT_RESOURCE_CONSTRAINT |   Cost impact: Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries.  **Tip:** For information about cost implications of changing the RESOURCE_CONSTRAINT property, see [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](../../user-guide/warehouses-gen2.md). |
| CREATE_WAREHOUSE | Triggered by the creation of a new warehouse, which can occur when a user manually creates a warehouse or when an account is provisioned and the default warehouse is automatically created in the account.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | None (N/A) |   Cost impact: None if the cluster is created with INITIALLY_SUSPENDED = TRUE. Otherwise, metering starts when all compute resources are provisioned for the warehouse or the warehouse starts processing statements (if the warehouse starts processing statements before the resources are fully provisioned). |
| DROP_WAREHOUSE | Triggered when an existing warehouse is dropped; all currently executing queries on the warehouse are stopped and the compute resources are released.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | None (N/A) |   Cost impact: Metering on the compute resources for the warehouse stops after all currently executing queries complete. |
| ALTER_WAREHOUSE | Triggered when the properties of an existing warehouse are changed, including resizing the warehouse. If the warehouse is resized, additional RESIZE_WAREHOUSE events are triggered. This event can also trigger RESUME_WAREHOUSE or SUSPEND_WAREHOUSE events.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | None (N/A) |   Cost impact: Depends on the event(s) that are triggered by the ALTER statement. |
| RESIZE_WAREHOUSE | Triggered by changing the size of a warehouse, which increases or decreases the compute resources in each cluster in the warehouse. For a running warehouse, this event also triggers a RESIZE_CLUSTER event for each cluster in the warehouse.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | STARTED | | Event reason | WAREHOUSE_RESIZE |   Cost impact: Resizing a running warehouse adds or removes compute resources in each cluster in the warehouse. Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries.  Resizing a suspended warehouse does not provision any new resources for the warehouse. |
| RESUME_WAREHOUSE | Triggered when a suspended warehouse is resumed or a new warehouse is created with INITIALLY_SUSPENDED = FALSE. This event also triggers a RESUME_CLUSTER event for each cluster in the warehouse.   |  |  | | --- | --- | | Cluster number | None (applies to all clusters) | | Event state | STARTED | | Event reason | WAREHOUSE_AUTORESUME or WAREHOUSE_RESUME |   Cost impact: Metering begins after all the compute resources are provisioned for the warehouse. |
| SUSPEND_WAREHOUSE | Triggered when a running warehouse is suspended. This event also triggers a SUSPEND_CLUSTER event for each cluster in the warehouse.   |  |  | | --- | --- | | Cluster number | None (applies to all clusters) | | Event state | STARTED | | Event reason | WAREHOUSE_AUTOSUSPEND or WAREHOUSE_SUSPEND |   Cost impact: Metering on the compute resources for the warehouse stops after all running statements complete. |
| WAREHOUSE_CONSISTENT | Triggered when pending changes to a warehouse complete. For more information, see Usage notes.   |  |  | | --- | --- | | Cluster number | NULL | | Event state | COMPLETED | | Event reason | NULL |   Cost impact: None. Metering occurs for the warehouse event that is logged with the STARTED state before the WAREHOUSE_CONSISTENT event.  For more information, see the cost impact of the warehouse events described in the previous rows. |

#### Cluster-related events

The following table describes the valid values for the EVENT_NAME column for cluster-related events:

| EVENT_NAME | Description |
| --- | --- |
| CONVERT_CLUSTER | Triggered by the conversion of a warehouse from standard to Snowpark-optimized, or from Snowpark-optimized to standard. This event is only emitted when the conversion happens while the warehouse is running.   |  |  | | --- | --- | | Cluster number | Number of the converted cluster (always `1` for a single-cluster warehouse) | | Event state | COMPLETED or STARTED | | Event reason | CONVERT_TO_SNOWPARK_OPTIMIZED or CONVERT_TO_STANDARD |   Cost impact: Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries.  **Tip:** For information about cost implications of changing the RESOURCE_CONSTRAINT property, see [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](../../user-guide/warehouses-gen2.md). |
| RESUME_CLUSTER | Triggered when a suspended cluster is resumed.   |  |  | | --- | --- | | Cluster number | Number of the resumed cluster (always `1` for a single-cluster warehouse) | | Event state | STARTED | | Event reason | * WAREHOUSE_AUTORESUME or WAREHOUSE_RESUME (single-cluster warehouse) * MULTICLUSTER_SPINUP (multi-cluster warehouse) |   Cost impact: Metering starts on the compute resources for the cluster after they are provisioned. |
| SUSPEND_CLUSTER | Triggered when a running cluster is suspended.   |  |  | | --- | --- | | Cluster number | Number of the suspended cluster (always `1` for a single-cluster warehouse) | | Event state | STARTED | | Event reason: | * WAREHOUSE_AUTOSUSPEND or WAREHOUSE_SUSPEND (single-cluster warehouse) * MULTICLUSTER_SPINDOWN (multi-cluster warehouse) * RESOURCE_MONITOR_SUSPEND |   Cost impact: Metering stops on the compute resources for the cluster after all currently executing queries complete. |
| RESIZE_CLUSTER | Triggered when a cluster is resized, usually as a result of resizing a warehouse.   |  |  | | --- | --- | | Cluster number | Number of the resized cluster (always `1` for a single-cluster warehouse) | | Event state | STARTED | | Event reason | * WAREHOUSE_AUTORESUME or WAREHOUSE_RESUME (single-cluster warehouse) * MULTICLUSTER_SPINDOWN or MULTICLUSTER_SPINUP (multi-cluster warehouse) * WAREHOUSE_RESIZE |   Cost impact: Depends on whether compute resources are added or removed due to resizing. Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries. |
| SPINUP_CLUSTER | Triggered when a cluster is started (multi-cluster warehouse only); usually happens when the mininimum or maximum cluster size is increased.   |  |  | | --- | --- | | Cluster number | Number of the cluster that was started | | Event state | STARTED | | Event reason | * WAREHOUSE_RESIZE (single-cluster warehouse) * MULTICLUSTER_SPINUP (multi-cluster warehouse) |   Cost impact: Metering starts on the compute resources for the cluster after they are provisioned. |
| SPINDOWN_CLUSTER | Triggered when a running cluster is shut down (multi-cluster warehouse only); usually happens when the minimum or maximum cluster size is decreased.   |  |  | | --- | --- | | Cluster number | Number of the cluster that was shut down | | Event state | STARTED | | Event reason | * WAREHOUSE_RESIZE (single-cluster warehouse) * MULTICLUSTER_SPINDOWN (multi-cluster warehouse) |   Cost impact: Metering stops on the compute resources for the cluster after all currently executing queries complete. |

### EVENT_REASON descriptions

The following table describes the valid values for the EVENT_REASON column:

| EVENT_REASON | Description |
| --- | --- |
| WAREHOUSE_AUTORESUME | A suspended warehouse was resumed automatically because AUTO_RESUME is enabled for the warehouse and a SQL statement was submitted to the warehouse. |
| WAREHOUSE_RESUME | A suspended warehouse was resumed manually by a user. |
| WAREHOUSE_AUTOSUSPEND | A running warehouse was suspended automatically because AUTO_SUSPEND is enabled for the warehouse and the defined period of inactivity for AUTO_SUSPEND has passed. |
| WAREHOUSE_SUSPEND | A running warehouse was suspended manually by a user. |
| WAREHOUSE_RESIZE | A warehouse was resized. |
| RESOURCE_MONITOR_SUSPEND | A warehouse was suspended because the credit quota for the resource monitor for the warehouse was reached. |
| MULTICLUSTER_SPINUP | A new or suspended cluster was provisioned in a multi-cluster warehouse; not applicable to single-cluster warehouses. |
| MULTICLUSTER_SPINDOWN | A running cluster was shut down in a multi-cluster warehouse; not applicable to single-cluster warehouses. |

## Usage notes

* Latency for the view may be up to three hours.

* An event can produce multiple rows in the view if it triggers additional, related events.
* The value for the EVENT_REASON, USER_NAME, ROLE_NAME, and QUERY_ID columns is NULL for a WAREHOUSE_CONSISTENT event.
* The WAREHOUSE_CONSISTENT event might share the same timestamp with another warehouse event and be listed out of order.

### Warehouse event that indicates that an operation has completed

Events that create a warehouse, change the size of the warehouse or the number of clusters, or suspend a warehouse are not atomic
operations. This means that some small amount of time is required for these operations to fully complete.

For example, if a warehouse is suspended using an ALTER WAREHOUSE … SUSPEND statement, any queries that are currently executing on the
warehouse must complete (or time out) before it can be suspended. In some cases, multiple warehouse events might be in-flight
(for example, resize and suspend). When all warehouse events have completed, the warehouse is in a *consistent* state.

If a warehouse event is logged with the STARTED state in the EVENT_STATE column, it is never
logged with a COMPLETED state. Instead, an event logged with the STARTED state is always followed by a subsequent WAREHOUSE_CONSISTENT
event. If multiple warehouse events are logged with the STARTED event state, those events coalesce to the same WAREHOUSE_CONSISTENT event.

If a warehouse event is logged with the COMPLETED state in the EVENT_STATE column, no subsequent WAREHOUSE_CONSISTENT event follows
unless another pending event is logged with a STARTED state.

## Examples

### View events history for the previous week

View the events history for warehouse `my_wh` for the previous week by executing the following statement:

```sqlexample
SELECT timestamp, warehouse_name, cluster_number,
       event_name, event_reason, event_state,
       size, cluster_count
  FROM SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_EVENTS_HISTORY
  WHERE warehouse_name = 'MY_WH'
  AND timestamp > DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY timestamp DESC;
```

### Example events history results

#### Events history for a statement with no pending changes

An ALTER WAREHOUSE statement is logged with the COMPLETED state when there are no additional changes pending. For example,
the following statement updates the comment for warehouse `my_wh`:

```sqlexample
ALTER WAREHOUSE my_wh SET
  COMMENT = 'Updated comment for warehouse';
```

This statement results in the following row in the WAREHOUSE_EVENTS_HISTORY view:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-04-26 16:42:13.513 +0000 | MY_WH | ALTER_WAREHOUSE | COMPLETED | NULL | NULL |

#### Events history for a statement that is followed by a WAREHOUSE_CONSISTENT event

When an ALTER WAREHOUSE statement changes the warehouse size, additional events follow. For example, resize warehouse
`my_wh`:

```sqlexample
ALTER WAREHOUSE my_wh SET
  WAREHOUSE_SIZE = 'SMALL';
```

This statement results in the following rows in the WAREHOUSE_EVENTS_HISTORY view:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-05-29 15:13:05.874 +0000 | MY_WH | ALTER_WAREHOUSE | STARTED | NULL | NULL |
| 2024-05-29 15:13:05.874 +0000 | MY_WH | RESIZE_WAREHOUSE | STARTED | NULL | NULL |
| 2024-05-29 15:13:06.036 +0000 | MY_WH | WAREHOUSE_CONSISTENT | COMPLETED | SMALL | 1 |
| 2024-05-29 15:13:06.036 +0000 | MY_WH | RESIZE_CLUSTER | COMPLETED | NULL | NULL |

#### Events history for a Snowflake-initiated warehouse event

When Snowflake resumes a multi-cluster warehouse, the following warehouse events are logged:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-04-23 17:04:11.618 +0000 | MY_WH | SPINUP_CLUSTER | STARTED | NULL | NULL |
| 2024-04-23 17:04:11.657 +0000 | MY_WH | RESUME_CLUSTER | STARTED | NULL | NULL |
| 2024-04-23 17:04:11.657 +0000 | MY_WH | WAREHOUSE_CONSISTENT | COMPLETED | LARGE | 5 |

---
title: WAREHOUSE_LOAD_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/warehouse_load_history.md
section: Account Usage
---

Schema:
:   [ACCOUNT_USAGE](../account-usage.md)

# WAREHOUSE_LOAD_HISTORY view

This Account Usage view can be used to analyze the workload on your warehouse within a specified date range.

See also:
:   [WAREHOUSE_METERING_HISTORY view](warehouse_metering_history.md)

## Columns

> **Note:**
>
> For the output columns of this view, the query load value is the ratio of the total execution time (in seconds) of all queries in a specific state in an interval by the total time (in seconds) for that interval.
>
> For example, if 276 seconds was the total time for 4 queries in a 5 minute (300 second) interval, then the query load value is 276 / 300 = 0.92.

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The start of the specified time range (in the UTC time zone) in which the warehouse usage took place. |
| END_TIME | TIMESTAMP_LTZ | The end of the specified time range (in the UTC time zone) in which the warehouse usage took place. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| AVG_RUNNING | NUMBER(38,9) | Query load value for queries executed. |
| AVG_QUEUED_LOAD | NUMBER(38,9) | Query load value for queries queued because the warehouse was overloaded. |
| AVG_QUEUED_PROVISIONING | NUMBER(38,9) | Query load value for queries queued because the warehouse was being provisioned. |
| AVG_BLOCKED | NUMBER(38,9) | Query load value for queries blocked by a transaction lock. |
|  |  |  |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

* Load history is shown in 5-minute intervals.

---
title: WAREHOUSE_METERING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/account-usage/warehouse_metering_history.md
section: Account Usage
---

Schemas:
:   [ACCOUNT_USAGE](../account-usage.md) , [READER_ACCOUNT_USAGE](../account-usage.md)

# WAREHOUSE_METERING_HISTORY view

This Account Usage view can be used to return the hourly credit usage for a single warehouse (or all the warehouses in your account) within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| READER_ACCOUNT_NAME | VARCHAR | Name of the reader account where the warehouse usage took place. Column only included in view in READER_ACCOUNT_USAGE schema. |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the warehouse usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the warehouse usage took place. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| CREDITS_USED | NUMBER | Total number of credits used for the warehouse in the hour. This is a sum of CREDITS_USED_COMPUTE and CREDITS_USED_CLOUD_SERVICES. This value does not take into account the [adjustment for cloud services](../../user-guide/cost-understanding-compute.md), and may therefore be greater than the credits that are billed. To determine how many credits were actually billed, run queries against the [METERING_DAILY_HISTORY view](metering_daily_history.md). |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits used for the warehouse in the hour. |
| CREDITS_USED_CLOUD_SERVICES | NUMBER | Number of credits used for cloud services in the hour. |
| CREDITS_ATTRIBUTED_COMPUTE_QUERIES | NUMBER | Number of credits attributed to queries in the hour. . . Includes only the credit usage for query execution and doesn’t include warehouse idle time usage. |

## Usage notes

* In the ACCOUNT_USAGE schema, latency for the view is up to 180 minutes (3 hours), except for the CREDITS_USED_CLOUD_SERVICES column. Latency for
  CREDITS_USED_CLOUD_SERVICES is up to 6 hours.
* In the READER_ACCOUNT_USAGE schema, latency for the view is up to 24 hours.
* Warehouse idle time is not included in the CREDITS_ATTRIBUTED_COMPUTE_QUERIES column.

  See Examples for a query that calculates the cost of idle time.

* If you want to reconcile the data in this view with a corresponding view in the [ORGANIZATION USAGE schema](../organization-usage.md), you must first set the timezone of the session to UTC. Before querying the Account Usage view, execute:

  > ```sqlexample
  > ALTER SESSION SET TIMEZONE = UTC;
  > ```

## Examples

For example, to determine the cost of idle time for each warehouse for the last 10 days, execute the following statement:

```sqlexample
SELECT
  (SUM(credits_used_compute) -
    SUM(credits_attributed_compute_queries)) AS idle_cost,
  warehouse_name
FROM SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_METERING_HISTORY
WHERE start_time >= DATEADD('days', -10, CURRENT_DATE())
  AND end_time < CURRENT_DATE()
GROUP BY warehouse_name;
```

## Organization Usage

SNOWFLAKE.ORGANIZATION_USAGE schema views for organization-level monitoring.

---
title: ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/access_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# ACCESS_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query the access history of Snowflake objects (e.g. table, view, column).

## Columns

This section consists of tables that do the following:

* Provide a sample value for each column.
* Provide a description of each column in the view.
* Provide a description for each field in the JSON array for the `base_objects_accessed`, `direct_objects_accessed`,
  `objects_modified`, `provider_base_objects_accessed`, and `provider_policies_referenced` columns.
* Provide a description for each field in the object for the `object_modified_by_ddl` column.

### Sample column values

The following table provides a sample value for each column in the view.

| Column name | Example |
| --- | --- |
| `query_id` | `a0fda135-d678-4184-942b-c3411ae8d1ce` |
| `query_start_time` | `2022-01-25 16:17:47.388 +0000` |
| `user_name` | `JSMITH` |
| `direct_objects_accessed` | ```sqljson [   {     "objectDomain": "FUNCTION",     "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",     "objectId": "2",     "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",     "dataType": "NUMBER(38,0)"   },   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "GOVERNANCE.TABLES.T1"   } ] ``` |
| `base_objects_accessed` | ```sqljson [   {     "objectDomain": "FUNCTION",     "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",     "objectId": "2",     "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",     "dataType": "NUMBER(38,0)"   },   {     "columns": [       {         "columnId": 68610,         "columnName": "CONTENT"       }     ],     "objectDomain": "Table",     "objectId": 66564,     "objectName": "GOVERNANCE.TABLES.T1"   } ] ``` |
| `objects_modified` | ```sqljson [   {     "objectDomain": "STRING",     "objectId":  NUMBER,     "objectName": "STRING",     "columns": [       {         "columnId": "NUMBER",         "columnName": "STRING",         "baseSources": [           {             "columnName": STRING,             "objectDomain": "STRING",             "objectId": NUMBER,             "objectName": "STRING"           }         ],         "directSources": [           {             "columnName": STRING,             "objectDomain": "STRING",             "objectId": NUMBER,             "objectName": "STRING"           }         ]       }     ]   },   ... ] ``` |
| `object_modified_by_ddl` | ```sqljson {   "objectDomain": STRING,   "objectName": STRING,   "objectId": NUMBER,   "operationType": STRING,   "properties": ARRAY } ``` |
| `policies_referenced` | ```sqljson [   {     "columns": [       {         "columnId": 68610,         "columnName": "SSN",         "policies": [           {               "policyName": "governance.policies.ssn_mask",               "policyId": 68811,               "policyKind": "MASKING_POLICY"           }         ]       }     ],     "objectDomain": "VIEW",     "objectId": 66564,     "objectName": "GOVERNANCE.VIEWS.V1",     "policies": [       {         "policyName": "governance.policies.rap1",         "policyId": 68813,         "policyKind": "ROW_ACCESS_POLICY"       }     ]   } ] ``` |

### Column descriptions

The following tables provide descriptions of each column in the view.

If a column contains `-1` in a number field or `TRUNCATED` in a string field, information in the column might have been truncated. For
more information, see Usage notes: Truncation.

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| `organization_name` | VARCHAR | Name of the organization. |
| `account_locator` | VARCHAR | System-defined identifier for an account. |
| `account_name` | VARCHAR | User-defined name that identifies an account within the organization. |
| `provider_base_objects_accessed` | ARRAY | Specifies the data objects in the provider’s account that were accessed by the consumer query.  Assumes the provider used an [organizational listing](../../user-guide/collaboration/listings/organizational/org-listing-about.md) to share the data object with the consumer. |
| `provider_policies_referenced` | ARRAY | If a consumer query accessed base objects that are protected by a policy in the provider’s account, this column lists the policy.  Assumes the provider used an [organizational listing](../../user-guide/collaboration/listings/organizational/org-listing-about.md) to share the data object with the consumer. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| `query_id` | VARCHAR | An internal, system-generated identifier for the SQL statement. This value is also mentioned in the [QUERY_HISTORY view](query_history.md). |
| `query_start_time` | TIMESTAMP_LTZ | The statement start time (UTC time zone). |
| `user_name` | VARCHAR | The user who issued the query. |
| `direct_objects_accessed` | ARRAY | A JSON array of data objects such as user-defined functions (i.e. UDFs and UDTFs), stored procedures, tables, views, and columns directly named in the query explicitly or through shortcuts such as using an asterisk (i.e. `*`).  Virtual columns can be returned in this field.  For additional notes about UDFs, see Usage notes. |
| `base_objects_accessed` | ARRAY | A JSON array of all base data objects to execute a query, including columns, external functions, UDFs, and stored procedures.  In this example, the fields in the first array specify a UDF. These same fields in the first array also specify a stored procedure, when applicable.  Note the following:   * This field specifies view names or view columns, including virtual columns, if a shared view is accessed in a data sharing consumer   account. * For additional notes about UDFs, see Usage notes. |
| `objects_modified` | ARRAY | A JSON array that specifies the objects that were associated with a write operation in the query.  The UDF and stored procedure array is the same as what is shown earlier and appears in the arrays for `baseSources` and `directSources` depending on how the access took place. For brevity, this example omits the UDF and stored procedure array.  For additional notes about UDFs, see Usage notes. |
| `object_modified_by_ddl` | OBJECT | Specifies the DDL operation on a database, schema, table, view, and column. These operations also include statements that specify a row access policy on a table or view, a masking policy on a column, and tag updates (e.g. set a tag, change a tag value) on the object or column. |
| `policies_referenced` | ARRAY | Specifies information about the enforced masking policy set on the column and the enforced row access policy set on the table, including policies set on intermediate objects or columns. |
| `parent_query_id` | VARCHAR | The query ID of the parent job or NULL if the job does not have a parent. |
| `root_query_id` | VARCHAR | The query ID of the top most job in the chain or NULL if the job does not have a parent. |
| `event_source` | VARCHAR | Indicates the source of the event that resulted in an access history record. Possible values include the following:   * `snowflake_sql` — Events generated by SQL statements that were executed within Snowflake. * `horizon_irc` — Events generated by calls made to the [Horizon Iceberg REST Catalog API](../../user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md). |
| `additional_properties` | VARIANT | Provides operational metadata for the source of the event. |

### JSON field descriptions

The following table defines the fields in the JSON array for the `base_objects_accessed`, `direct_objects_accessed`, and
`objects_modified`, `provider_base_objects_accessed`, and `provider_policies_referenced` columns.

| Field | Data Type | Description |
| --- | --- | --- |
| accountName [1] | VARCHAR | The account locator of the consumer account that queried the provider’s data object. If the query wasn’t executed by a consumer, this field is omitted. |
| columnId | NUMBER | A column ID that is unique within the account. This value is identical to the columnID in the [COLUMNS view](columns.md). |
| columnName | VARCHAR | The name of the accessed column. For policies, specifies the column on which the masking policy is set. |
| objectId | NUMBER | An identifier for the object, which is unique within a given account and domain. This number will match:   * The `TABLE_ID` number for a table, view, or materialized view. You can obtain this value from [TABLES view](tables.md), [VIEWS view](views.md), or [MATERIALIZED_VIEW_REFRESH_HISTORY view](materialized_view_refresh_history.md). * If a stage was accessed, this number will match the:    + `NAME` identifier for a user stage (see [USERS view](users.md))   + `TABLE_ID` number for a table stage (see [TABLES view](tables.md))   + `STAGE_ID` number for a name stage (see [STAGES view](stages.md)) |
| objectName | VARCHAR | The fully qualified name of the object that was accessed.  If a masking policy is set on a column or a row access policy is set on a table or view, the value refers to the fully qualified name of the table or view on which the row access policy is set or the table or view that has a masking policy set on one of its columns.  If a stage was accessed, this value will be the:   * `username` (User stage). * `table_name` (Table stage). * `stage_name` (Named stage). |
| objectDomain | VARCHAR | The type of object. For a list of supported objects, see [Supported Objects](../../user-guide/access-history.md).  Note that `FUNCTION` specifies UDFs, UDTFs, and external functions.  For data access policies, specifies the domain of the object on which the policy is set. |
| location | VARCHAR | The URL of the external location when the data access is an external location (e.g. `s3://mybucket/a.csv`). . If the query does not access a stage, this field is omitted. |
| stageKind | VARCHAR | When writing to a stage, one of the following: `Table | User | Internal Named | External Named` If the query does not access a stage, this field is omitted. |
| baseSources | VARCHAR | The columns that serve as the source columns for the columns specified by `directSources`. These columns facilitate column lineage. |
| directSources | VARCHAR | The columns specifically mentioned in the data write portion of the SQL statement that serves as the source columns in the target table to which data is written. These columns facilitate column lineage. |
| policyName | VARCHAR | The fully-qualified name of the policy. |
| policyId | NUMBER | An identifier for the policy, which is unique within a given account and domain. This value matches the identifier for a masking policy in the [MASKING_POLICIES view](masking_policies.md) or the identifier for a row access policy in the [ROW_ACCESS_POLICIES view](row_access_policies.md). |
| policyKind | VARCHAR | Either: MASKING_POLICY or ROW_ACCESS_POLICY |
| argumentSignature | VARCHAR | The name and data type for each argument in the UDF or stored procedure. |
| dataType |  | The data type of the return value for a UDF or stored procedure.  This value helps to differentiate two or more UDFs that have the same name but different return types. |
| joinObjects | VARCHAR | If a query contains a join, returns an array containing the joined objects and type of join. |
| joinObject | VARCHAR | The table or view that was joined with the accessed object. |
| type | VARCHAR | The type of join, as described in [JOIN](../constructs/join.md), [ASOF JOIN](../constructs/asof-join.md), and [LATERAL](../constructs/join-lateral.md). |

[1]

This field is found in the ACCESS_HISTORY view of the ORGANIZATION_USAGE schema, but not the ACCESS_HISTORY view of the ACCOUNT_USAGE schema.

### Object field descriptions for `object_modified_by_ddl`

The following table describes the fields of objects in the `object_modified_by_ddl` column.

| Field | Data type | Description |
| --- | --- | --- |
| objectDomain | VARCHAR | Type of the object defined or modified by the DDL operation. For more information about supported object types, see [Supported Objects](../../user-guide/access-history.md). |
| objectId | NUMBER | The identifier for the object, which is unique within a given account and domain, defined or modified by the DDL operation. |
| objectName | VARCHAR | The fully qualified name of the object defined or modified by the DDL operation. |
| operationType | VARCHAR | The SQL keyword that specifies the operation on the table, view, or column. For ALTER, CREATE, and DROP, this can also apply to listings and shares. For GRANT and REVOKE, this can also apply to shares. The following values are supported: ALTER | CREATE | DESCRIBE | DROP | REPLACE | UNDROP | REFRESH | SHOW | SUSPEND | RESUME | GRANT | REVOKE |
| properties | ARRAY | A JSON array that specifies the object or column properties when you create, modify, drop, or undrop the object or column. There are two types of properties: atomic and compound. |

For the `properties` JSON array:

* Atomic: one value per property (e.g. a `comment` has a single string value, the `enabled` property is a boolean and has one value).
* Compound: the property is multi-valued (e.g. `allowed_values` for a tag, masking policy).

Compound properties are recorded in a JSON array. For example, if a table contains a single column named EMAIL, the column is recorded as
follows:

```json
"columns": {
  "email": {
    "objectId": {
      "value": 1
    },
    "subOperationType": "ADD"
  }
}
```

In the previous example,

* `objectId` specifies the identifier for the column or object, except for allowed tag values, which don’t have an identifier.
* `subOperationType` can be one of the following values:

  + `ADD` specifies adding a compound property (for example, adding a column, setting allowed values).
  + `DROP` specifies removing a compound property.
  + `ALTER` specifies modifying a compound property.

#### CREATE or ALTER LISTING properties of OBJECT_MODIFIED_BY_DDL

The following table describes available `properties` arrays when `operationType` is CREATE or ALTER for a *listing*.

| Command | Properties of OBJECT_MODIFIED_BY_DDL |
| --- | --- |
| ```sqlexample CREATE EXTERNAL LISTING my_listing SHARE my_share   AS $$my_manifest$$ ``` | ```json "manifest": {   "value": "my_manifest" }, "share": {   "value": "my_share" } ``` |
| ```sqlexample ALTER LISTING my_listing   AS $$my_manifest$$ ``` | ```json "manifest": {   "value": "my_manifest" } ``` |
| ```sqlexample ALTER LISTING my_listing   ADD TARGETS $$my_targets_manifest$$; ``` | ```json "addTargets": {   "value": "my_targets_manifest" } ``` |
| ```sqlexample ALTER LISTING my_listing   REMOVE TARGETS $$my_targets_manifest$$; ``` | ```json "removeTargets": {   "value": "my_targets_manifest" } ``` |
| ```sqlexample ALTER LISTING my_listing   ADD VERSION V3   FROM @listing_db.listing_schema.stage1; ``` | ```json "manifestStageLocation": {   "value": "@listing_db.listing_schema.stage1" }, "versionAlias": {   "value": "V3" } ``` |

#### CREATE or ALTER SHARE properties of OBJECT_MODIFIED_BY_DDL

The following table describes available `properties` arrays when the `operationType` is CREATE or ALTER for a *share*.

| Command | Properties of OBJECT_MODIFIED_BY_DDL |
| --- | --- |
| ```sqlexample CREATE SHARE my_share   SECURE_OBJECTS_ONLY=FALSE; ``` | ```json "secureObjectsOnly": {   "value": false } ``` |
| ```sqlexample ALTER SHARE my_share   SET ACCOUNTS = acc1, acc2; ``` | ```json "accountsToSet": {   "value": [ "acc1", "acc2" ] } ``` |
| ```sqlexample ALTER SHARE my_share   ADD ACCOUNTS = acc1, acc2   SHARE_RESTRICTIONS = false; ``` | ```json "accountsToAdd": {  "value": [ "acc1", "acc2" ] }, "shareRestrictions": {   "value": false } ``` |
| ```sqlexample ALTER SHARE my_share   REMOVE ACCOUNTS = acc1, acc2; ``` | ```json "accountsToRemove": {   "value": [ "acc1", "acc2" ] } ``` |

#### GRANT TO SHARE or REVOKE FROM SHARE properties of OBJECT_MODIFIED_BY_DDL

The following table describes available `properties` arrays when the `operationType` is GRANT TO or REVOKE FROM for a *share*.

| Command | Properties of OBJECT_MODIFIED_BY_DDL |
| --- | --- |
| ```sqlexample GRANT USAGE ON DATABASE my_db   TO SHARE my_share; ``` | ```json "grant": {   "value": {     "PRIVILEGES": [       "USAGE"     ],     "SECURABLE_OBJECT_DOMAIN": "Database",     "SECURABLE_OBJECT_ID": 1234,     "SECURABLE_OBJECT_NAME": "MY_DB"   } } ``` |
| ```sqlexample GRANT SELECT ON ALL TABLES IN SCHEMA my_db.my_sch   TO SHARE my_share; ``` | ```json "grant": {   "value": {     "PRIVILEGES": [       "SELECT"     ],     "SECURABLE_OBJECT_DOMAIN": "Table",     "SECURABLE_OBJECT_SCOPE": "MY_DB.MY_SCH",     "SECURABLE_OBJECT_SCOPE_DOMAIN": "Schema"   } } ``` |
| ```sqlexample GRANT DATABASE ROLE my_db.my_role   TO SHARE my_share; ``` | ```json "grant": {   "value": {     "ROLES": [       "MY_DB.MY_ROLE"     ]   } } ``` |
| ```sqlexample REVOKE SELECT ON VIEW my_db.my_sch.my_view   FROM SHARE my_share; ``` | ```json "revoke": {   "value": {     "PRIVILEGES": [       "SELECT"     ],     "SECURABLE_OBJECT_DOMAIN": "View",     "SECURABLE_OBJECT_ID": 6789,     "SECURABLE_OBJECT_NAME": "MY_DB.MY_SCH.MY_VIEW"   } } ``` |

## Usage notes

General notes:
:   * For increased performance, filter queries on the `query_start_time` column and choose narrower time ranges. For sample queries,
      see [Querying the ACCESS_HISTORY View](../../user-guide/access-history.md).
    * Secure Views. The log record contains the underlying base table (i.e. `base_objects_accessed`) to generate the view. Examples
      include queries on other Account Usage and Organization Usage views and queries on base tables for extract, transform, and load
      (i.e. ETL) operations.
    * Records in the QUERY_HISTORY view do not always get recorded in the
      ACCESS_HISTORY view. The structure of the SQL statement determines whether Snowflake records an entry in the ACCESS_HISTORY view.
    * Specifying the `USING` clause while querying this view might cause non-referenced columns to be recorded in
      `direct_objects_accessed` field. As a workaround, replace the `USING` clause with a `JOIN ... ON ...` clause.
      For details, refer to:

      + [JOIN and USING](../constructs/join.md) (in the JOIN reference topic)
      + [Tracking Sensitive stage data movement](../../user-guide/access-history.md) (in the Access History query example)

Read query notes:
:   This view supports read queries of the following type:

    * SELECT, including CREATE TABLE … AS SELECT (i.e. CTAS).

      + Snowflake records the SELECT subquery in a CTAS operation.
    * CREATE TABLE … CLONE

      + Snowflake records the source table in a CLONE operation.
    * COPY INTO … TABLE

      + Snowflake logs this query only when the table is specified as the source in a FROM clause.
    * DML operations that read data (e.g. contains a SELECT subquery, specifies certain columns in WHERE or JOIN): INSERT … SELECT,
      UPDATE, DELETE, and MERGE.
    * UDFs and [Tabular SQL UDFs (UDTFs)](../../developer-guide/udf/sql/udf-sql-tabular-functions.md) if tables are included in queries inside the functions. This is
      logged in the `base_objects_accessed` field.

Write operation notes:
:   This view supports write operations of the following type:

    * GET `<internal_stage>`
    * PUT `<internal_stage>`
    * DELETE
    * TRUNCATE
    * INSERT

      + INSERT INTO … FROM SELECT \*
      + INSERT INTO TABLE … VALUES ()
    * MERGE INTO … FROM SELECT \*
    * UPDATE

      + UPDATE TABLE … FROM SELECT \* FROM …
      + UPDATE TABLE … WHERE …
    * Data loading statements:

      + COPY INTO TABLE FROM internalStage
      + COPY INTO TABLE FROM externalStage
      + COPY INTO TABLE FROM externalLocation
    * Data unloading statements:

      + COPY INTO internalStage FROM TABLE
      + COPY INTO externalStage FROM TABLE
      + COPY INTO externalLocation FROM TABLE
    * CREATE:

      + CREATE DATABASE … CLONE
      + CREATE SCHEMA … CLONE
      + CREATE TABLE … CLONE
      + CREATE TABLE … AS SELECT
    * For write operations that call the [CASE](../functions/case.md) function to determine the columns to access, such as a CTAS
      statement with the CASE function in the SELECT query, all columns referenced in every CASE branch are recorded in the
      `base_objects_accessed` column, the `direct_objects_accessed` column, or both columns depending on how the CTAS statement
      is written.

Data sharing notes:
:   If a Data Sharing provider account shares objects to Data Sharing consumer accounts through a share:

    * **Provider accounts:** The queries and logs on the shared objects executed in the provider account are not visible to
      Data Sharing consumer accounts.
    * **Consumer accounts:** The queries on the data share executed in the consumer account are logged and only visible to
      the consumer account, not the Data Sharing provider account.

      For example, if the provider shares a table and a view built from the table to the consumer account, and there is a query on the
      shared view, Snowflake records the shared view access in the `base_objects_accessed` column. This record, which includes the
      `columnName` and `objectName` values, allows the consumer to know which object was accessed in their account and also protects
      the provider because the underlying table (via the `objectId` and `columnId`) is not revealed to the consumer.
    * For column lineage:

      If a data sharing provider makes a view available to the data sharing consumer, the source columns for the view are not visible to the
      consumer because the columns originate from the data sharing provider.

      If the data sharing consumer moves data from the shared view to a table, Snowflake does not record the view columns as
      `baseSources` for the newly created table.
    * For shared UDFs and UDTFs:

      + In the consumer account, the local ACCESS_HISTORY view records the UDF/UDTF that was shared by the provider when the shared UDF/UDTF
        is invoked by the consumer.
      + In the provider account, the local ACCESS_HISTORY view records provider usage of a shared UDF/UDTF. Users in the consumer account
        cannot view how the provider account uses the shared UDF/UDTF.
    * For tracking policy references:

      The `policies_referenced` column contains policies that are local to the account that queries the data.

      If a provider shares a policy-protected table and a consumer accesses this table, the consumer cannot see the policy the provider set
      on the table or its columns.

      If a consumer creates a view (`v1`) from the shared object, sets a policy to the view (`v1`) or its columns, and a user in the
      consumer account accesses the protected view (`v1`) or another view (`v2`) created from the protected view (`v1`), the
      ACCESS_HISTORY view in the consumer account contains the policy that protects the view (`v1`) and its columns. The provider cannot
      see the record that corresponds to `v1`.

Hybrid tables:
:   Short-running queries that operate exclusively against hybrid tables will no
    longer generate a record in the QUERY_HISTORY view, in [QUERY_HISTORY view](query_history.md), or
    in the output of the QUERY_HISTORY table function. To monitor such queries, use the
    [AGGREGATE_QUERY_HISTORY](../account-usage/aggregate_query_history.md).

    To monitor Access History for such queries, use the
    [AGGREGATE_ACCESS_HISTORY](../account-usage/aggregate_access_history.md).
    This view allows you to more easily monitor high-throughput operational
    workloads for Access History.

Snowflake Native App Framework notes:
:   Some queries related to a Snowflake Native App are redacted. For details, see [Information redacted from SQL commands and views](../../developer-guide/native-apps/redacted-content.md).

Tag-based masking notes:
:   If a user accesses a table or view protected by a [tag-based masking policy](../../user-guide/tag-based-masking-policies.md), the
    `policies_referenced` column contains the masking policy applied through the tag when Snowflake enforces the masking policy on the
    protected column.

    The ACCESS_HISTORY view does not record any tag information.

UDFs & Stored Procedure notes:
:   These notes apply to external functions, UDFs and UDTFs for all languages, including when these functions have the `SECURE` property,
    and stored procedures with owner’s rights and caller’s rights:

    Column details:

    * The `direct_objects_accessed` column records explicit mention of these functions and procedures in a query.

      Snowflake does not record nested UDFs (i.e. a UDF mentioned in the definition of another UDF) in this column.
    * The `base_objects_accessed` column records external functions, shared functions, non-SQL UDFs, and stored procedures that are
      called in a query.
    * The `objects_modified` column records:

      + The UDF/UDTF when the result of calling the function copies the result to another column.
      + The UDF, UDTF, and an external function can be recorded in the arrays for `baseSources` and `directSources` depending on how the
        query is written.

Not supported:
:   This view does not log accesses of the following types:

    * Snowflake-provided [table functions](../functions-table.md), [Account Usage](../account-usage.md) views, and
      [Organization Usage](../organization-usage.md) views.
    * [RESULT_SCAN](../functions/result_scan.md) to obtain prior results.
    * An Access History record is generated when DDL operations are performed on
      [sequences](../../user-guide/querying-sequences.md). It is not generated when a sequence is used in any other
      operations, including generating new values.
    * Intermediate views accessed between the base table and direct object.

      For example, consider a query on View_A with the following object structure: View_A » View_B » View_C » Base_Table.

      The ACCESS_HISTORY view records the query on View_A and the Base_Table, not View_B and View_C.
    * The operations to update streams.
    * Data movement resulting from replication.
    * Failed queries, although logged in the QUERY_HISTORY view, will *not* be logged in the ACCESS_HISTORY view.

## Usage Notes: Column Lineage

These additional notes pertain to column lineage:

Supported operations:
:   Column lineage tracks details for the following SQL operations:

    * [CREATE TABLE … AS SELECT](../sql/create-table.md) (CTAS)
    * [CREATE TABLE … CLONE](../sql/create-table.md)
    * [INSERT … SELECT …](../sql/insert.md)
    * [MERGE](../sql/merge.md)
    * [UPDATE](../sql/update.md), two possible variations, for example:

      + Self-update:

        ```sqlexample
        UPDATE mydb.s1.t1 SET col_1 = col_1 + 1;
        ```
      + Two table update:

        ```sqlexample
        UPDATE mydb.s1.t1 FROM mydb.s2.t2 SET t1.col1 = t2.col1;
        ```
    * [ALTER TABLE](../sql/alter-table.md) … RENAME TO

Query Conditions:
:   * [Query profile/plan](../../user-guide/ui-snowsight-activity.md)

      The query plan Snowflake writes determines whether the ACCESS_HISTORY view contains column lineage. If a column needs to be
      evaluated as part of the query plan, Snowflake contains the column in the ACCESS_HISTORY view, even if the end result of the query plan
      is that the column is not included in the end result.

      For example, consider the following [INSERT](../sql/insert.md) statement with a `WHERE` clause for a particular column value:

      > ```sqlexample
      > insert into a(c1)
      > select c2
      > from b
      > where c3 > 1;
      > ```

      Even if the WHERE clause evaluates to `FALSE`, Snowflake records the `c2` column as a source column for the `c1` column. The
      `c3` column is not listed as a source column for either `baseSources` or `directSources`.
    * Masked columns:

      + The masked column is always listed in the `directSources` field.
      + The record in the `baseSources` field depends on the policy definition. For example:

        - If the masking policy conditions use a [CASE](../functions/case.md) function, then all of the columns referenced in each of
          the CASE branches are recorded in the `baseSources` field.
        - If the masking policy conditions only specify a constant value (e.g. `*****`), then the `baseSources` field is empty.
    * UDFs:

      + When passing a column as an argument to a UDF and writing the result to another column, the column that is passed as the argument
        is recorded in the `directSources` field. For example:

        > ```sqlexample
        > insert into A(col1) select f(col2) from B;
        > ```

        In this example, Snowflake records `col2` in the `directSources` field because the column is an argument for the UDF named
        `f`.
      + The record in the `baseSources` field depends on the UDF definition.

View columns:
:   View columns are not considered to be source columns and are not listed in the `baseSources` field when data from a view column
    is copied to a table column. The view columns in this case are listed in the `directSources` field.

EXISTS Subquery:
:   Columns that are referenced in the [EXISTS](../operators-subquery.md) subquery clause are not considered to be source
    columns.

## Usage Notes: `object_modified_by_ddl` Column

`IF [ NOT ] EXISTS` clauses: The `object_modified_by_ddl` column only records `CREATE` or `REPLACE` when creating
or modifying an object.

The column records these changes based on the following SQL operations. The DROP and UNDROP operations apply to tables and views, not
columns.

```sqlexample
CREATE OR REPLACE

ALTER ... { SET | UNSET }

ALTER ... ADD ROW ACCESS POLICY

ALTER ... DROP ROW ACCESS POLICY

ALTER ... DROP ALL ROW ACCESS POLICIES

DROP | UNDROP
```

The following table summarizes the relationship between DDL operations, supported domains, and the properties Snowflake records.

| Operation | Domain | Properties | Notes |
| --- | --- | --- | --- |
| CREATE [ OR REPLACE ] | TABLE | EXTERNAL TABLE | VIEW | MATERIALIZED VIEW | ICEBERG TABLE | Column name, column identifier. | CREATE DATABASE and CREATE SCHEMA operations do not have properties recorded. |
| CREATE | TABLE … { AS SELECT | USING TEMPLATE | LIKE | CLONE } | Column name, column identifier. | Snowflake records the creation source for LIKE and CLONE operations.  Snowflake does not record the creation source when the source object is from a share or with USING TEMPLATE. |
| ALTER … RENAME TO  ALTER TABLE … RENAME COLUMN | TABLE | VIEW | MATERIALIZED VIEW | ICEBERG TABLE | DATABASE | SCHEMA | The new name of the object or column. |  |
| ALTER … SWAP WITH | TABLE | SCHEMA | DATABASE | objectName, objectId, objectDomain | There are two records in the view, one for each swap target. Each record contains the same query identifier value. |
| ALTER … { ADD | DROP } COLUMN | TABLE | Column name, column identifier, and the ADD or DROP subOperationType. |  |
| DROP | TABLE | VIEW | MATERIALIZED VIEW | ICEBERG TABLE | DATABASE | SCHEMA | Snowflake does not record properties for these operations. |  |
| UNDROP | TABLE | ICEBERG TABLE | SCHEMA | DATABASE | Snowflake does not record properties for these operations. |  |

## Usage notes: Truncation

When a record exceeds the size limit for the view, Snowflake applies a progressive truncation strategy that preserves the most critical audit information while reducing the record size. Truncation adheres to the following general guidelines:

* Column-level information is truncated before object-level information.
* Lineage information is truncated before data access and data protection policy information.
* Query-level metadata columns (`query_id`, `query_start_time`, and `user_name`) are always preserved.

When truncating information, Snowflake replaces numbers with `-1` and replaces strings with `TRUNCATED`. These sentinel elements indicate that information has been truncated.

The following sections describe the order in which records are truncated. Truncation stops as soon as the record fits within the size constraints.

Phase 1: Truncate column lineage in the `object_modified` column
:   ```json
      {
      "objectDomain": "Stream",
      "objectId":  1105,
      "objectName": "\"NESTED_ALERT_PIPELINE_ALERT_eK1VYsLDcTcpqPAA\"",
      "columns": [
        {
          "columnId": -1,
          "columnName": "TRUNCATED",
        }
      ]
    }
    ```

Phase 2: Truncate column information in the `policies_referenced` column
:   ```json
    [
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED",
          }
        ],
        "objectDomain": "VIEW",
        "objectId": 66564,
        "objectName": "GOVERNANCE.VIEWS.V1",
        "policies": [
          {
            "policyName": "governance.policies.rap1",
            "policyId": 68813,
            "policyKind": "ROW_ACCESS_POLICY"
          }
      ]
      }
    ]
    ```

Phase 3: Truncate column access information in the `base_objects_accessed` column
:   ```json
    [
      {
        "objectDomain": "Function",
        "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",
        "objectId": "2",
        "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
        "dataType": "NUMBER(38,0)"
      },
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED"
          }
        ],
        "objectDomain": "Table",
        "objectId": 66564,
        "objectName": "GOVERNANCE.TABLES.T1"
      }
    ]
    ```

Phase 4: Truncate column access information in the `direct_objects_accessed` column
:   ```json
    [
      {
        "objectDomain": "Function",
        "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",
        "objectId": "2",
        "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
        "dataType": "NUMBER(38,0)"
      },
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED"
          }
        ],
        "objectDomain": "Table",
        "objectId": 66564,
        "objectName": "GOVERNANCE.TABLES.T1"
      }
    ]
    ```

Phase 5: Truncate column properties in the `object_modified_by_ddl` column
:   ```json
    {
      "objectDomain": "Table",
      "objectId": 20196,
      "objectName": "MY_DB.PUBLIC.T2",
      "operationType": "REPLACE",
      "properties": {
        "columns": "TRUNCATED",
      }
    }
    ```

Phase 6: Truncate column access information in the `provider_base_objects_accessed` column
:   ```json
    [
      {
        "objectDomain": "FUNCTION",
        "objectName": "GOVERNANCE.FUNCTIONS.RETURN_SUM",
        "objectId": "2",
        "argumentSignature": "(NUM1 NUMBER, NUM2 NUMBER)",
        "dataType": "NUMBER(38,0)"
      },
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED"
          }
        ],
        "objectDomain": "Table",
        "objectId": 66564,
        "objectName": "GOVERNANCE.TABLES.T1"
      }
    ]
    ```

Phase 7: Truncate column information in the `provider_policies_referenced` column
:   ```json
    [
      {
        "columns": [
          {
            "columnId": -1,
            "columnName": "TRUNCATED",
          }
        ],
        "objectDomain": "VIEW",
        "objectId": 66564,
        "objectName": "GOVERNANCE.VIEWS.V1",
        "policies": [
          {
            "policyName": "governance.policies.rap1",
            "policyId": 68813,
            "policyKind": "ROW_ACCESS_POLICY"
          }
        ]
      }
    ]
    ```

Phase 8: Replace information in columns with a single sentinel record
:   > As the last phase in the truncation process, Snowflake replaces all the information in a column with a single sentinel object. Snowflake replaces information in the following order:
    >
    > * `policies_referenced` column
    > * `objects_modified` column
    > * `base_objects_accessed` column
    > * `provider_base_objects_accessed` column (ORGANIZATION_USAGE schema only)
    > * `provider_policies_referenced` column (ORGANIZATION_USAGE schema only)

    The following is an example of a sentinel object found in a column:

    > ```json
    > [
    >   {
    >     "objectDomain": "TRUNCATED",
    >     "objectId": -1,
    >     "objectName": "TRUNCATED",
    >   }
    > ]
    > ```

---
title: ACCOUNTS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/accounts.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# ACCOUNTS view

The ACCOUNTS view in the ORGANIZATION_USAGE schema can be used to obtain details about the accounts in an organization.

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_NAME | VARCHAR | User-defined name that identifies an account within the organization. |
| CREATED_ON | TIMESTAMP | Date and time when the account was created. |
| REGION | VARCHAR | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account is located. |
| EDITION | VARCHAR | [Snowflake Edition](../../user-guide/intro-editions.md) of the account. |
| IS_ORG_ADMIN | BOOLEAN | Indicates whether the [ORGADMIN role](../../user-guide/organization-administrators.md) is enabled in an account. |
| IS_LOCKED | BOOLEAN | Indicates whether the account is locked. To determine if it was locked because it was dropped, look for a date and time in the SCHEDULED_DELETION_TIME column. If an account is unexpectedly locked, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| ACCOUNT_URL | VARCHAR | Preferred Snowflake [account URL](../../user-guide/organizations-connect.md) that includes the values of organization_name and account_name. |
| ACCOUNT_OLD_URL | VARCHAR | If the original [account URL](../../user-guide/organizations-connect.md) was saved when the account was renamed, provides the original URL. If the original account URL was dropped, the value is NULL even if the account was renamed. |
| ACCOUNT_OLD_URL_LAST_USED | VARCHAR | If the original account URL was saved when the account was renamed, indicates the last time the account was accessed using the original URL. |
| ORGANIZATION_OLD_URL | VARCHAR | If the account’s organization was changed in a way that created a new [account URL](../../user-guide/organizations-connect.md) and the original account URL was saved, provides the original account URL. If the original account URL was dropped, the value is NULL even if the organization changed. |
| ORGANIZATION_OLD_URL_LAST_USED | VARCHAR | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, indicates the last time the account was accessed using the original account URL. |
| ACCOUNT_LOCATOR | VARCHAR | [System-assigned identifier](../../user-guide/admin-account-identifier.md) of the account. |
| MANAGED_ACCOUNTS | VARCHAR | Indicates how many [reader accounts](../../user-guide/data-sharing-reader-create.md) have been created by the account. |
| IS_MANAGED | BOOLEAN | Indicates whether the account is a reader account. If `true`, the account is a reader account. |
| PARENT_ACCOUNT | VARCHAR | For reader accounts, provides the name of the parent account that is providing the reader account to consumers. |
| CONSUMPTION_BILLING_ENTITY_NAME | VARCHAR | Name of the consumption billing entity associated with an account. |
| MARKETPLACE_CONSUMER_BILLING_ENTITY_NAME | VARCHAR | Name of the marketplace consumer billing entity associated with an account. |
| MARKETPLACE_PROVIDER_BILLING_ENTITY_NAME | VARCHAR | Name of the marketplace provider billing entity associated with an account. |
| ALTERED_ON | TIMESTAMP | Date and time of the most recent change to the account. |
| SCHEDULED_DELETION_TIME | TIMESTAMP | Date and time when a [dropped account](../../user-guide/organizations-manage-accounts-delete.md) will be permanently deleted. |
| DELETED_ON | TIMESTAMP | Date and time when the account was permanently deleted. |
| MOVED_ON | TIMESTAMP | Date and time when the account was moved from the current organization to a different one. |
| COMMENT | VARCHAR | Comment associated with the account. |
| IS_EVENTS_ACCOUNT | BOOLEAN | Indicates whether an account is an events account. For more information, see [Use logging and event tracing for an app](../../developer-guide/native-apps/event-about.md). |

## Usage notes

* Latency for the view may be up to 24 hours.
* Deleted accounts are removed from the view after one year.

---
title: ALERT_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/alert_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# ALERT_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view enables you to retrieve the history of [alert](../../user-guide/alerts.md) usage. The view displays one row for
each run of an alert in the history.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the alert. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the alert. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the alert. |
| ACTION | VARCHAR | The text of the SQL statement that serves as the action for the alert. |
| ACTION_QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement executed as the action of the alert. |
| CONDITION | VARCHAR | The text of the SQL statement that serves as the condition for the alert. |
| CONDITION_QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement executed as the condition of the alert. |
| ERROR_CODE | NUMBER | Error code, if the alert returned an error or failed to execute (e.g. if the current user did not have privileges to execute the alert). |
| ERROR_MESSAGE | VARCHAR | Error message, if the alert returned an error. |
| STATE | VARCHAR | Status of the alert. This can be one of the following:   * SCHEDULED: The alert will execute at the time specified by the SCHEDULED_TIME column. This status does not apply to   [alerts on new data](../../user-guide/alerts.md). * EXECUTING: The condition or action of the alert is currently executing. * FAILED: The alert failed. Either the alert condition or alert action encountered an error that prevented it from being   executed. * CANCELLED: The alert execution was cancelled (e.g. when the alert is suspended). * CONDITION_FALSE: The condition was evaluated successfully but returned no data. As a result, the action was not executed.   This status does not apply to [alerts on new data](../../user-guide/alerts.md). * CONDITION_FAILED: The evaluation of the condition failed. For details on the failure, check the ERROR_CODE and   ERROR_MESSAGE columns. * ACTION_FAILED: The condition was evaluated successfully, but the execution of the action failed. For details on the   failure, check the ERROR_CODE and ERROR_MESSAGE columns. * TRIGGERED: The condition was evaluated successfully, and the action was executed successfully. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the scheduled alert is/was scheduled to start running.  Note that we make a best effort to ensure absolute precision, but only guarantee that alerts do not execute *before* the scheduled time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the alert completed, or NULL if SCHEDULED_TIME is in the future or if the alert is still running. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database containing the schema. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema. |
| SCHEDULED_FROM | VARCHAR | Specifies what initiated the alert. The column contains one of the following values:   * `SCHEDULE`: The alert was scheduled to run normally, as described in SCHEDULE clause of   [CREATE ALERT](../sql/create-alert.md). * `EXECUTE ALERT`: The alert was scheduled to run using [EXECUTE ALERT](../sql/execute-alert.md). * `TRIGGER`: The [alert on new data](../../user-guide/alerts.md) was run because the underlying table or view   contains new data. |

## Usage notes

* Latency for the view may be up to 24 hours.

* For increased performance, filter queries on the COMPLETED_TIME or SCHEDULED_TIME column.

## Examples

Retrieve records for the 10 most recent completed alert runs:

> ```sqlexample
> SELECT account_name, name, condition, condition_query_id, action, action_query_id, state
>   FROM snowflake.organization_usage.alert_history
>   LIMIT 10;
> ```

Retrieve records for alert runs completed in the past hour:

> ```sqlexample
> SELECT account_name, name, condition, condition_query_id, action, action_query_id, state
> FROM snowflake.organization_usage.alert_history
> WHERE COMPLETED_TIME > DATEADD(hours, -1, CURRENT_TIMESTAMP());
> ```

---
title: ANOMALIES_IN_CURRENCY_DAILY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/anomalies_in_currency_daily.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# ANOMALIES_IN_CURRENCY_DAILY view

This Organization Usage view provides insights into whether [cost anomalies](../../user-guide/cost-anomalies.md) occurred in accounts in the
organization.

Each row provides the consumption of an account on a specific day, and whether that consumption was a cost anomaly.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| DATE | DATE | Day in UTC when the consumption occurred. |
| ANOMALY_ID | VARCHAR | System-generated identifier. |
| IS_ANOMALY | BOOLEAN | If true, consumption has been identified as a cost anomaly because it has gone outside the range of the upper and lower bound. |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the account where consumption occurred. |
| ACCOUNT_LOCATOR | VARCHAR | Account locator of the account where consumption occurred. |
| REGION | VARCHAR | Snowflake region where the account is located. |
| ACTUAL_VALUE | NUMBER | Amount of consumption measured in CURRENCY. |
| CURRENCY | VARCHAR | Unit of measure for the consumption. |
| UPPER_BOUND | NUMBER | Predicted highest level of consumption based on the anomaly-detecting algorithm, measured in CURRENCY. Consumption levels above this value are considered an anomaly. |
| LOWER_BOUND | NUMBER | Predicted lowest level of consumption based on the anomaly-detecting algorithm, measured in CURRENCY. Consumption levels below this value are considered an anomaly. |
| FORECASTED_VALUE | NUMBER | Predicted consumption based on the anomaly-detecting algorithm, measured in CURRENCY. |

## Usage notes

Latency for the view might be up to 8 hours.

---
title: AUTOMATIC_CLUSTERING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/automatic_clustering_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# AUTOMATIC_CLUSTERING_HISTORY view

The AUTOMATIC_CLUSTERING_HISTORY view in the ORGANIZATION_USAGE schema
is used for querying the [Automatic Clustering](../../user-guide/tables-auto-reclustering.md) history for
your organization’s tables within a specified date range. The information
returned by the function includes the credits consumed, bytes updated, and rows
updated each time a table is reclustered.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization in which the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date when automatic clustering usage occurred. |
| CREDITS_USED | NUMBER | Number of credits billed for automatic clustering during the day specified by the USAGE_DATE value. |
| NUM_BYTES_RECLUSTERED | NUMBER | Number of bytes reclustered during the day specified by the USAGE_DATE value. |
| NUM_ROWS_RECLUSTERED | NUMBER | Number of rows reclustered during the day specified by the USAGE_DATE value. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table. |
| TABLE_NAME | VARCHAR | Name of the table. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the table. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the table. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the table. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the table. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).
* A row might be clustered multiple times, depending on data skew, clustering key distribution, and reordering required for micro-partitions. A large table with poor initial clustering might need multiple passes to reach an optimally clustered state. Therefore, the NUM_ROWS_RECLUSTERED value for a table could be as high as the total number of rows in the table or even higher.

---
title: BACKUP_OPERATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/backup_operation_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# BACKUP_OPERATION_HISTORY view

This Organization Usage view provides information on operations performed on backups.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The timestamp at which the backup operation started. |
| END_TIME | TIMESTAMP_LTZ | The timestamp at which the backup operation ended. |
| BACKUP_SET_ID | NUMBER | The local backup set ID. |
| BACKUP_ID | VARCHAR | The unique identifier of backup being worked on. |
| OPERATION_TYPE | VARCHAR | Could be either of the below operations:   * CREATE * EXPIRE * RESTORE * ADD_LEGAL_HOLD * REMOVE_LEGAL_HOLD |
| QUERY_ID | VARCHAR | Internal system-generated identifier for the SQL statement. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).

---
title: BACKUP_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/backup_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# BACKUP_POLICIES view

This Organization Usage view provides information on backup policies.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the backup policy. |
| NAME | VARCHAR | Name of the backup policy. |
| SCHEMA_ID | NUMBER | Internal system-generated identifier for the schema of the backup policy. |
| SCHEMA_NAME | VARCHAR | Schema that the backup policy belongs to. |
| CATALOG_ID | NUMBER | Internal system-generated identifier for the database of the backup policy. |
| CATALOG_NAME | VARCHAR | Database that the backup policy belongs to. |
| SCHEDULE | VARCHAR | Schedule for backup creation. |
| EXPIRE_AFTER_DAYS | NUMBER | Days after backup creation when backup should be expired. |
| HAS_RETENTION_LOCK | VARCHAR | Indicates whether the policy includes a retention lock. Y if policy has retention lock; N otherwise.  Retention lock protects backups from being deleted by anyone for the defined retention period. The retention lock also prevents the retention period from being decreased on the policy. |
| OWNER | VARCHAR | Name of the role that owns the backup policy. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the backup policy. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the backup policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the backup policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the backup policy was deleted. |
| COMMENT | VARCHAR | Comment for the backup policy. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: BACKUP_SETS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/backup_sets.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# BACKUP_SETS view

This Organization Usage view provides information on backup sets.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the backup set. |
| NAME | VARCHAR | Name of the backup set |
| SCHEMA_ID | NUMBER | Internal system-generated identifier for the schema of the backup set. |
| SCHEMA_NAME | VARCHAR | Schema that the backup set belongs to. |
| CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the backup set. |
| CATALOG_NAME | VARCHAR | Database that the backup set belongs to. |
| OBJECT_KIND | VARCHAR | Type of object that the backup set backs up. |
| OBJECT_ID | NUMBER | ID of object that the backup set backs up. |
| OBJECT_NAME | VARCHAR | Name of object that the backup set backs up. |
| OBJECT_SCHEMA_ID | NUMBER | ID of schema that contains the object that is backed up by this backup set. |
| OBJECT_SCHEMA_NAME | VARCHAR | Name of schema that contains the object that is backed up by this backup set. |
| OBJECT_CATALOG_ID | NUMBER | ID of database that contains the object that is backed up by this backup set. |
| OBJECT_CATALOG_NAME | VARCHAR | Name of database that contains the object that is backed up by this backup set. |
| BACKUP_POLICY_ID | NUMBER | ID of backup policy attached to this backup set. |
| BACKUP_POLICY_NAME | VARCHAR | Name of backup policy attached to this backup set. |
| BACKUP_POLICY_SCHEMA_ID | NUMBER | ID of the schema that contains the backup policy. |
| BACKUP_POLICY_SCHEMA_NAME | VARCHAR | Name of the schema that contains the backup policy. |
| BACKUP_POLICY_CATALOG_ID | NUMBER | ID of the database that contains the backup policy. |
| BACKUP_POLICY_CATALOG_NAME | VARCHAR | Name of the database that contains the backup policy. |
| OWNER | VARCHAR | Name of the role that owns the backup set. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the backup set. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the backup set was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the backup set was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the backup set was deleted. |
| COMMENT | VARCHAR | Comment for the backup set. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: BACKUPS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/backups.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# BACKUPS view

This Organization Usage view provides information on backups.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Snowflake-generated identifier of the backup.  Note: this is not the local ID, this is the globally unique UUID of the Backup. |
| BACKUP_SET_ID | NUMBER | ID of backup set that contains the backup. |
| BACKUP_SET_NAME | VARCHAR | Name of backup set that contains the backup. |
| BACKUP_SET_SCHEMA_ID | NUMBER | ID of schema that the backup set belongs to. |
| BACKUP_SET_SCHEMA | VARCHAR | Name of schema that the backup set belongs to. |
| BACKUP_SET_CATALOG_ID | NUMBER | ID of database that the backup set belongs to. |
| BACKUP_SET_CATALOG | VARCHAR | Name of database that the backup set belongs to. |
| CREATED | TIMESTAMP_LTZ | Timestamp at which backup was created. |
| DELETED | TIMESTAMP_LTZ | Timestamp at which backup was deleted. |
| EXPIRATION_SCHEDULED_FOR | TIMESTAMP_LTZ | Timestamp at which backup will be expired. |
| IS_UNDER_LEGAL_HOLD | BOOLEAN | Y if backup is under legal hold; N otherwise. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: CLASS_INSTANCES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/class_instances.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# CLASS_INSTANCES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each instance of a class defined in the account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the instance. |
| NAME | VARCHAR | Name of the instance. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the instance. |
| SCHEMA_NAME | VARCHAR | Name of the schema the instance belongs to. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the instance. |
| DATABASE_NAME | VARCHAR | Name of the database the instance belongs to. |
| CLASS_ID | NUMBER | Internal/system-generated identifier for the class the instance is instantiated from. |
| CLASS_NAME | VARCHAR | Name of the class the instance is instantiated from. |
| CLASS_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the class the instance is instantiated from. |
| CLASS_SCHEMA_NAME | VARCHAR | Name of the schema of the class the instance is instantiated from. |
| CLASS_DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the class the instance is instantiated from. |
| CLASS_DATABASE_NAME | VARCHAR | Name of the database of the class the instance is instantiated from. |
| OWNER_NAME | VARCHAR | Name of the role that owns the instance. |
| OWNER_ROLE_TYPE | VARCHAR | The internal/system-generated identifier of the role that owns the instance of the class. |
| CREATED | TIMESTAMP_LTZ | Date and time when the instance was created. |
| DELETED | TIMESTAMP_LTZ | Date and time when the instance was deleted. |
| COMMENT | VARCHAR | Comment for the instance. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays the instances for which the current role for the session has been granted access privileges.

## Examples

The following example finds all instances of the [ANOMALY_DETECTION](../classes/anomaly_detection.md) class:

```sqlexample
SELECT ACCOUNT_NAME, NAME, DATABASE_NAME, SCHEMA_NAME, CLASS_NAME
  FROM snowflake.organization_usage.class_instances
  WHERE CLASS_NAME = 'ANOMALY_DETECTION';
```

The following example joins this view with [TABLES view](tables.md) on the INSTANCE_ID column to find the tables
that belong to each instance:

```sqlexample
SELECT a.TABLE_NAME,
       b.NAME AS instance_name,
       b.CLASS_NAME
  FROM SNOWFLAKE.ORGANIZATION_USAGE.TABLES a
  JOIN SNOWFLAKE.ORGANIZATION_USAGE.CLASS_INSTANCES b
  ON a.INSTANCE_ID = b.ID
  WHERE b.DELETED IS NULL;
```

---
title: CLASSES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/classes.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# CLASSES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each [class](../snowflake-db-classes.md)
in the account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the class. |
| NAME | VARCHAR | Name of the class. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the class. |
| SCHEMA_NAME | VARCHAR | Name of the schema the class belongs to. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the class. |
| DATABASE_NAME | VARCHAR | Name of the database the class belongs to. |
| OWNER_NAME | VARCHAR | Name of the role that owns the class. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the class was created. |
| DELETED | TIMESTAMP_LTZ | Date and time when the class was deleted. |
| COMMENT | VARCHAR | Comment for the class. |

## Usage notes

Latency for the view may be up to 24 hours.

## Examples

The following example finds all classes in the account:

```sqlexample
SELECT account_name, name, database_name, schema_name
  FROM snowflake.organization_usage.classes;
```

---
title: COLUMNS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/columns.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# COLUMNS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each column in the tables defined in an account.

See also:
:   [DATABASES view](databases.md)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column | Data Type | Description |
| --- | --- | --- |
| COLUMN_ID | NUMBER | Internal/system-generated identifier for the column. |
| COLUMN_NAME | TEXT | Name of the column. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table or view for the column. |
| TABLE_NAME | TEXT | Table or view that the column belongs to. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the table or view for the column. |
| TABLE_SCHEMA | TEXT | Schema that the table or view belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the table or view for the column. |
| TABLE_CATALOG | TEXT | Database that the table or view belongs to. |
| ORDINAL_POSITION | NUMBER | Ordinal position of the column in the table/view. |
| COLUMN_DEFAULT | TEXT | Default value of the column. |
| IS_NULLABLE | TEXT | Whether the column allows NULL values. |
| DATA_TYPE | TEXT | Data type of the column.  This column shows the standard Snowflake data type of the column. The DATA_TYPE_ALIAS column displays the original data type name that was specified for the column when the table was created, or when the column was altered. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string columns. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string columns. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric columns. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric columns. |
| NUMERIC_SCALE | NUMBER | Scale of numeric columns. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | TEXT | Not applicable for Snowflake. |
| INTERVAL_PRECISION | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | TEXT | Not applicable for Snowflake. |
| COLLATION_CATALOG | TEXT | Not applicable for Snowflake. |
| COLLATION_SCHEMA | TEXT | Not applicable for Snowflake. |
| COLLATION_NAME | TEXT | Not applicable for Snowflake. |
| DOMAIN_CATALOG | TEXT | Not applicable for Snowflake. |
| DOMAIN_SCHEMA | TEXT | Not applicable for Snowflake. |
| DOMAIN_NAME | TEXT | Not applicable for Snowflake. |
| UDT_CATALOG | TEXT | Not applicable for Snowflake. |
| UDT_SCHEMA | TEXT | Not applicable for Snowflake. |
| UDT_NAME | TEXT | Not applicable for Snowflake. |
| SCOPE_CATALOG | TEXT | Not applicable for Snowflake. |
| SCOPE_SCHEMA | TEXT | Not applicable for Snowflake. |
| SCOPE_NAME | TEXT | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | TEXT | Not applicable for Snowflake. |
| DTD_IDENTIFIER | TEXT | Not applicable for Snowflake. |
| IS_SELF_REFERENCING | TEXT | Not applicable for Snowflake. |
| IS_IDENTITY | TEXT | Whether the column is an identity column. |
| IDENTITY_GENERATION | TEXT | Whether an identity column’s value is always generated or only generated by default. Snowflake only supports `BY DEFAULT`. |
| IDENTITY_START | TEXT | Not applicable for Snowflake. |
| IDENTITY_INCREMENT | TEXT | Not applicable for Snowflake. |
| IDENTITY_MAXIMUM | TEXT | Not applicable for Snowflake. |
| IDENTITY_MINIMUM | TEXT | Not applicable for Snowflake. |
| IDENTITY_CYCLE | TEXT | Whether the value of an identity column allows cycling. Snowflake only supports `NO CYCLE`. |
| IDENTITY_ORDERED | TEXT | If `YES`, the column is an identity column and has the ORDER property. If `NO`, the column is an identity column and has the NOORDER property. |
| SCHEMA_EVOLUTION_RECORD | TEXT | Records information about the latest triggered Schema Evolution for a given table column. This column contains the following subfields:   * EvolutionType: The type of the triggered schema evolution (ADD_COLUMN or DROP_NOT_NULL). * EvolutionMode: The triggering ingestion mechanism (COPY, SNOWPIPE, or SNOWPIPE_STREAMING). * FileName: The file name that triggered the evolution (NULL for SNOWPIPE_STREAMING). * TriggeringTime: The approximate time when the column was evolved. * QueryId or PipeId: A unique identifier of the triggering query or pipe (QUERY ID for COPY, PIPE ID for SNOWPIPE, or NULL for SNOWPIPE_STREAMING). * Pipe name: Fully qualified pipe name that triggered schema evolution (SNOWPIPE_STREAMING only). * Channel name: Channel that triggered schema evolution (SNOWPIPE_STREAMING only). * offsetTokenUpperBound: An offset at or before which schema evolution was triggered (SNOWPIPE_STREAMING only). |
| COMMENT | TEXT | Comment for the column. |
| DELETED | TIMESTAMP_LTZ | Date and time when the column was deleted. |
| DATA_TYPE_ALIAS | TEXT | The data type alias or synonym specified for the column when the table was created or when the column was last altered.  For example, the BIGINT type is synonymous with the NUMBER type. If BIGINT was specified as the type for a column, then BIGINT is displayed in this DATA_TYPE_ALIAS column.  For columns in tables that were created before the [2025_07 behavior change bundle](../../release-notes/bcr-bundles/2025_07_bundle.md) was enabled, and not altered after the behavior change, the value in this column is NULL. For more information, see [COLUMNS view (multiple schemas): New column](../../release-notes/bcr-bundles/2025_07/bcr-2061.md). |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.

## Examples

The following example retrieves all columns in the `myTable` table defined in the `mydb` database:

```sqlexample
SELECT *
  FROM snowflake.organization_usage.columns
  WHERE
    table_catalog = 'mydb' AND
    table_name = 'myTable' AND
    deleted IS NULL;
```

---
title: COMPLETE_TASK_GRAPHS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/complete_task_graphs.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# COMPLETE_TASK_GRAPHS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

You can use the Organization Usage view to query the status of completed *graph* runs, such as runs that executed successfully, failed, or
were cancelled. A graph is currently defined as a single scheduled task or a [task graph](../../user-guide/tasks-graphs.md) composed of a
scheduled root task and one or more child tasks. For the purposes of this function, *root task* refers to either the single scheduled task
or the root task in a task graph.

The view avoids the 10,000 row limitation of the [COMPLETE_TASK_GRAPHS](../functions/complete_task_graphs.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROOT_TASK_NAME | TEXT | Name of the root task. |
| DATABASE_NAME | TEXT | Name of the database that contains the graph. |
| SCHEMA_NAME | TEXT | Name of the schema that contains the graph. |
| STATE | TEXT | State of the graph run:   * `SUCCEEDED`: All tasks in the graph ran successfully to completion, or the root task run succeeded and one or more child task runs were skipped. * `FAILED`: One or more task runs in the graph failed, or the root task run succeeded and one or more child task runs failed. * `CANCELLED`: One or more task runs in the graph were cancelled, or the root task run succeeded and one or more child task runs were cancelled.   Note that if the state of the root task run is SKIPPED, the function does not return a row for the run. |
| SCHEDULED_FROM | TEXT | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| FIRST_ERROR_TASK_NAME | TEXT | Name of the first task in the graph that returned an error; returns NULL if no task produced an error. |
| FIRST_ERROR_CODE | NUMBER | Error code of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| FIRST_ERROR_MESSAGE | TEXT | Error message of the error returned by the task named in FIRST_ERROR_TASK_NAME; returns NULL if no task produced an error. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the root task was scheduled to start running. Note that we make a best effort to ensure absolute precision, but only guarantee that tasks do not execute *before* the scheduled time. |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the root task definition started to run. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| NEXT_SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the standalone or root task (in a [DAG](../../user-guide/tasks-graphs.md) of tasks) is next scheduled to start running, assuming the current run of the standalone task or [DAG](../../user-guide/tasks-graphs.md) started at the SCHEDULED_TIME time completes in time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the last task in the [DAG](../../user-guide/tasks-graphs.md) was completed. |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a [DAG](../../user-guide/tasks-graphs.md). This ID matches the ID column value in the SHOW TASKS output for the same task. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the [DAG](../../user-guide/tasks-graphs.md) that was run, or is scheduled to be run. |
| RUN_ID | NUMBER | Time when the standalone or root task in a [DAG](../../user-guide/tasks-graphs.md) is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system may reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run prior to retry. You may use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| CONFIG | TEXT | Displays the graph level configuration used during the graph run if explicitly set. Otherwise displays NULL. |
| GRAPH_RUN_GROUP_ID | TEXT | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.

## Examples

Retrieve records for the 10 most recent task graph runs completed in your organization:

```sqlexample
SELECT account_name, root_task_name, state
FROM snowflake.organization_usage.complete_task_graphs
  LIMIT 10;
```

---
title: CONTRACT_ITEMS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/contract_items.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# CONTRACT_ITEMS view

The CONTRACT_ITEMS view in the ORGANIZATION_USAGE schema can be used to return the contract information for an organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| CONTRACT_NUMBER | VARCHAR | Snowflake contract number for the organization. |
| START_DATE | DATE | The start date for the Snowflake contract or the date the CONTRACT_ITEM goes into effect for the organization. |
| END_DATE | DATE | The end date for the Snowflake contract or the date the CONTRACT_ITEM stops being used for the organization. |
| EXPIRATION_DATE | DATE | The expiration date for the Snowflake contract or the date after which either the Renewal Contract goes into effect if signed within 30 days or the Snowflake relationship is terminated. |
| CONTRACT_ITEM | VARCHAR | One of capacity, additional capacity, or free usage. |
| CURRENCY | VARCHAR | The currency for the CONTRACT_ITEM. |
| AMOUNT | NUMBER (38,2) | The amount for the CONTRACT_ITEM measured in CURRENCY, not credits. |
| CONTRACT_MODIFIED_DATE | DATE | The date (in the UTC timezone) the CONTRACT_ITEM was last modified. |

## Usage notes

* Latency for the view may be up to 24 hours.
* If multiple organizations draw down from the same capacity contract, only the primary organization can access this view. The primary
  organization is also known as the funding organization.
* This view shows only the active contract for the organization.
* Customers who signed a contract through a Snowflake reseller cannot access data in this view.
* Data is retained indefinitely.
* This view does not include data generated prior to June 2020. To obtain data before this date, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: COPY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/copy_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# COPY_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

The view displays load activity for both [COPY INTO <table>](../sql/copy-into-table.md) statements and continuous data loading using
[Snowpipe](../../user-guide/data-load-snowpipe-intro.md). The view avoids the 10,000 row limitation of
the [LOAD_HISTORY view](../info-schema/load_history.md).

You can also view data loading details in Snowsight. See [Monitor data loading activity by using Copy History](../../user-guide/data-load-monitor.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_NAME | VARCHAR | Name of the source file and relative path to the file. |
| STAGE_LOCATION | VARCHAR | Name of the stage where the source file is located. |
| LAST_LOAD_TIME | TIMESTAMP_LTZ | Date and time of when the file finished loading. |
| ROW_COUNT | NUMBER | Number of rows loaded from the source file. |
| ROW_PARSED | NUMBER | Number of rows parsed from the source file; `NULL` if STATUS is `Load in progress`. |
| FILE_SIZE | NUMBER | Observed size of the source file in the internal or external stage before it loads. If the file is compressed, this shows the compressed size. If the file is uncompressed, this shows the uncompressed size. |
| FIRST_ERROR_MESSAGE | VARCHAR | First error of the source file. |
| FIRST_ERROR_LINE_NUMBER | NUMBER | Line number of the first error. |
| FIRST_ERROR_CHARACTER_POS | NUMBER | Position of the first error character. |
| FIRST_ERROR_COLUMN_NAME | VARCHAR | Column name of the first error. |
| ERROR_COUNT | NUMBER | Number of error rows in the source file. |
| ERROR_LIMIT | NUMBER | If the number of errors reaches this limit, then abort. |
| STATUS | VARCHAR | Status: `Loaded`, `Load failed`, `Partially loaded`, or `Load skipped`. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the target table. |
| TABLE_NAME | VARCHAR | Name of the target table.TABLE_NAME |
| TABLE_SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema for the table. |
| TABLE_SCHEMA_NAME | VARCHAR | Name of the schema in which the target table resides. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the table. |
| TABLE_CATALOG_NAME | VARCHAR | Name of the database in which the target table resides. |
| PIPE_CATALOG_NAME | VARCHAR | Name of the database in which the pipe resides. |
| PIPE_SCHEMA_NAME | VARCHAR | Name of the schema in which the pipe resides. |
| PIPE_NAME | VARCHAR | Name of the pipe defining the load parameters; `NULL` for COPY statement loads. |
| PIPE_RECEIVED_TIME | TIMESTAMP_LTZ | Date and time when the INSERT request for the file loaded through the pipe was received; `NULL` for COPY statement loads. |
| FIRST_COMMIT_TIME | TIMESTAMP_LTZ | Date and time when the first chunk of the file is committed. Snowpipe may load a file in multiple chunks that are separately committed. |
| BYTES_BILLED | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing visibility into Snowpipe’s cost implications directly within these history views. |

## Usage notes

* In most cases, latency for the view may be up to 24 hours. The latency for a given table’s copy history may be up to 2 days
  if both of the following conditions are true:

  + Fewer than 32 DML statements have been added to the given table since it was last updated in COPY_HISTORY.
  + Fewer than 100 rows have been added to the given table since it was last updated in COPY_HISTORY.

* The view only includes COPY INTO commands that executed to completion, with or without errors.
* Dropping or recreating a table object removes the load history metadata for bulk data load deduplication (COPY INTO *<table>* statements) into the table.
* Renaming a table object updates the corresponding TABLE_NAME entries in the copy history.
* Dropping or recreating a pipe object doesn’t remove the load history metadata for the pipe.
* The view only displays objects for which the current role for the session has been granted access privileges.
* After the replication of copy history, the COPY_HISTORY Account Usage view shows the history only after the latest truncate operation on the target table. This is different from the view without replication, which shows a complete copy history.

## Examples

Retrieve records for the 10 most recent COPY INTO commands executed:

```sqlexample
SELECT account_name, file_name, error_count, status, last_load_time
  FROM snowflake.organization_usage.copy-history
  ORDER BY last_load_time desc
  LIMIT 10;
```

---
title: DATA_TRANSFER_DAILY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/data_transfer_daily_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# DATA_TRANSFER_DAILY_HISTORY view

The DATA_TRANSFER_DAILY_HISTORY view in the ORGANIZATION_USAGE schema can be used to query the history of data transferred from Snowflake tables into a different cloud storage provider’s network (i.e. from Snowflake on Amazon Web Services (AWS), Google Cloud Platform, or Microsoft Azure into the other cloud provider’s network) and/or geographical region within the last 365 days (1 year).

The view includes the history of data transfer for all accounts in your Snowflake organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | Either `DATA_TRANSFER` or [INTERNAL_DATA_TRANSFER](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md). |
| ORGANIZATION_NAME | VARCHAR | Name of the organization . |
| ACCOUNT_NAME | VARCHAR | Name of the account. |
| USAGE_DATE | DATE | Date (in the UTC time zone) in which the usage took place. |
| TB_TRANSFERED | FLOAT | Number of terabytes transferred during the USAGE_DATE. |
| REGION | VARCHAR | ID of the Snowflake Region where the account is located. |
| ACCOUNT_LOCATOR | VARCHAR | Account locator for the account. |

## Usage notes

* Latency for the view may be up to 24 hours.

---
title: DATA_TRANSFER_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/data_transfer_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# DATA_TRANSFER_HISTORY view

The DATA_TRANSFER_HISTORY view in the ORGANIZATION_USAGE schema can be
used to query the history of data transferred from Snowflake tables into a
different cloud storage provider’s network (i.e. from Snowflake on AWS, Google
Cloud Platform, or Microsoft Azure into another cloud provider’s network)
and/or geographical region within a specified date range. The function returns
the history for your entire Snowflake organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this transfer history record. |
| SOURCE_CLOUD | VARCHAR | Name of the cloud provider for the platform where the data transfer originated: Amazon Web Services (AWS), Google Cloud Platform, or Microsoft Azure. |
| SOURCE_REGION | VARCHAR | Region where the data transfer originated. |
| TARGET_CLOUD | VARCHAR | Name of the cloud provider for the platform where the data was sent: AWS, Google Cloud Platform, or Microsoft Azure. |
| TARGET_REGION | VARCHAR | Region where the data was sent. |
| BYTES_TRANSFERRED | VARIANT | Number of bytes transferred during the usage date. |
| TRANSFER_TYPE | VARCHAR | Type of operation that caused the transfer. [COPY](../sql/copy-into-location.md), [COPY_FILES](../sql/copy-files.md), [DATA_LAKE](../../user-guide/tables-iceberg.md), [REPLICATION](../../user-guide/account-replication-intro.md), [EXTERNAL_FUNCTION](../external-functions.md), [INTERNAL](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md). |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

---
title: DATABASE_STORAGE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/database_storage_usage_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# DATABASE_STORAGE_USAGE_HISTORY view

The DATABASE_STORAGE_USAGE_HISTORY view in the ORGANIZATION_USAGE schema
can be used to query the average daily storage usage, in bytes, for all the
databases in your organization within a specified date range. The results
include:

* All data stored in tables and materialized views in the database(s).
* All historical data maintained in Fail-safe for the database(s).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this storage usage record. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| AVERAGE_DATABASE_BYTES | FLOAT | Number of bytes of database storage used, including bytes currently in Time Travel. |
| AVERAGE_FAILSAFE_BYTES | FLOAT | Number of bytes of Fail-safe storage used. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DELETED | TIMESTAMP_LTZ | Date and time when the database was dropped; NULL for active databases. |
| AVERAGE_HYBRID_TABLE_STORAGE_BYTES | FLOAT | Number of bytes of hybrid table storage used (data in the row store). |
| AVERAGE_ARCHIVE_STORAGE_COOL_BYTES | FLOAT | Average number of bytes (including active bytes, time travel bytes, and bytes subject to [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md)) of table storage used in the COOL storage tier. |
| AVERAGE_ARCHIVE_STORAGE_COLD_BYTES | FLOAT | Average number of bytes (including active bytes, time travel bytes, and bytes subject to [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md)) of table storage used in the COLD storage tier. |
| AVERAGE_COOL_FAILSAFE_BYTES | FLOAT | Average number of bytes of Fail-safe storage used in the COOL storage tier. |
| AVERAGE_COLD_FAILSAFE_BYTES | FLOAT | Average number of bytes of Fail-safe storage used in the COLD storage tier. |

## Usage notes

Latency for the view may be up to 24 hours (1 day).

> **Note:**
>
> With [BCR-2127](../../release-notes/bcr-bundles/2025_07/bcr-2127.md),
> this view includes new columns for storage lifecycle policies.
> To view storage lifecycle policy columns, you must enable the 2025_07 behavior change bundle
> in your account.
>
> To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
> execute the following statement:
>
> ```sqlexample
> SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_07');
> ```

---
title: DATABASES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/databases.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# DATABASES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each database defined in an account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| DATABASE_OWNER | VARCHAR | Name of the role that owns the database. |
| IS_TRANSIENT | VARCHAR | Whether the database is transient. |
| COMMENT | VARCHAR | Comment for the database. |
| CREATED | TIMESTAMP_LTZ | Date and time when the database was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the database was dropped. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| RESOURCE_GROUP | VARCHAR | For internal use. |
| TYPE | VARCHAR | Specifies the type of database. Valid values are: . . - APPLICATION: a Snowflake Native App. . - APPLICATION_PACKAGE: an application package. . - STANDARD: a normal database. . - IMPORTED DATABASE: a database created from a share. . - PERSONAL DATABASE: a personal database linked to its owner. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| OBJECT_VISIBILITY | OBJECT | `OBJECT_VISIBILITY`  [Preview Feature](../../release-notes/preview-features.md) — Open  Available to all accounts.  This property controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md) in the account, enabling users without explicit access privileges to find objects and request access. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view displays all of the databases in an account.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: FEATURE_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/feature_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# FEATURE_POLICIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view provides the
[feature policies](../../developer-guide/native-apps/ui-consumer-feature-policies.md) in your organization.

Each row in this view corresponds to a different feature policy.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the feature policy. |
| NAME | TEXT | Name of the feature policy. |
| SCHEMA_ID | TEXT | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | TEXT | Schema to which the feature policy belongs. |
| DATABASE_ID | TEXT | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | TEXT | Database to which the feature policy belongs. |
| OWNER | TEXT | Name of the role that owns the feature policy. |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example ROLE. If a Snowflake Native App owns the object, the value is APPLICATION. Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| BLOCKED_OBJECT_TYPES_FOR_CREATION | TEXT | A comma-separated list of object types that the feature policy blocks for creation. See [Feature Policies](../../developer-guide/native-apps/ui-consumer-feature-policies.md) for more information. |
| COMMENT | TEXT | Comments entered for the feature policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the feature policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. |
| DELETED | TIMESTAMP_LTZ | Date and time when the feature policy was dropped. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: FILE_FORMATS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/file_formats.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# FILE_FORMATS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each file format defined in an account.

File formats are named objects that can be used for loading/unloading data. For more information, see [CREATE FILE FORMAT](../sql/create-file-format.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_FORMAT_ID | NUMBER | Internal/system-generated identifier for the file format. |
| FILE_FORMAT_NAME | VARCHAR | Name of the file format, |
| FILE_FORMAT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the file format. |
| FILE_FORMAT_SCHEMA | VARCHAR | Schema that the file format belongs to. |
| FILE_FORMAT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the file format. |
| FILE_FORMAT_CATALOG | VARCHAR | Database that the file format belongs to. |
| FILE_FORMAT_OWNER | VARCHAR | Name of the role that owns the file format. |
| FILE_FORMAT_TYPE | VARCHAR | File format type of the file format (`CSV`, `JSON`, etc.). |
| RECORD_DELIMITER | VARCHAR | Character that separates records. |
| FIELD_DELIMITER | VARCHAR | Character that separates fields. |
| SKIP_HEADER | NUMBER | Number of lines skipped at the start of the file. |
| DATE_FORMAT | VARCHAR | Date format. |
| TIME_FORMAT | VARCHAR | Time format. |
| TIMESTAMP_FORMAT | VARCHAR | Timestamp format. |
| BINARY_FORMAT | VARCHAR | Binary format. |
| ESCAPE | VARCHAR | String used as the escape character for any field values. |
| ESCAPE_UNENCLOSED_FIELD | VARCHAR | String used as the escape character for unenclosed field values. |
| TRIM_SPACE | BOOLEAN | Whether whitespace is removed from fields. |
| FIELD_OPTIONALLY_ENCLOSED_BY | VARCHAR | Character used to enclose strings. |
| NULL_IF | VARCHAR | A list of strings to be replaced by null. |
| COMPRESSION | VARCHAR | Compression method for the data file. |
| ERROR_ON_COLUMN_COUNT_MISMATCH | VARCHAR | Whether to generate a parsing error if the number of fields in an input file does not match the number of columns in the corresponding table. |
| CREATED | TIMESTAMP_LTZ | Date and time when the file format was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the file format was dropped. |
| COMMENT | VARCHAR | Comment for the file format. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: FUNCTIONS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/functions.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# FUNCTIONS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each user-defined function (UDF) defined in an account.

For more information about UDFs, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| FUNCTION_ID | NUMBER | Internal/system-generated identifier for the UDF. |
| FUNCTION_NAME | VARCHAR | Name of the UDF. |
| FUNCTION_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the UDF. |
| FUNCTION_SCHEMA | VARCHAR | Schema which the UDF belongs to. |
| FUNCTION_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the UDF. |
| FUNCTION_CATALOG | VARCHAR | Database which the UDF belongs to. |
| FUNCTION_OWNER | VARCHAR | Name of the role that owns the UDF. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the UDF’s arguments. |
| DATA_TYPE | VARCHAR | Return value data type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string return value. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string return value. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric return value. |
| NUMERIC_SCALE | NUMBER | Scale of numeric return value. |
| FUNCTION_LANGUAGE | VARCHAR | Language of the UDF. |
| FUNCTION_DEFINITION | VARCHAR | UDF definition. |
| VOLATILITY | VARCHAR | Whether the UDF is volatile or immutable. |
| IS_NULL_CALL | VARCHAR | Whether the UDF is called when input is null. |
| CREATED | TIMESTAMP_LTZ | Date and time when the UDF was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the UDF was dropped. |
| COMMENT | VARCHAR | Comment for the function. |
| IS_EXTERNAL [1] | VARCHAR(3) | `YES` if the function is an [external function](../external-functions.md); otherwise, `NO`. |
| API_INTEGRATION [1] | VARCHAR | Name of the API integration object to authenticate the call to the proxy service. |
| CONTEXT_HEADERS [1] | VARCHAR | Context header information for the external function. |
| MAX_BATCH_ROWS [1] | NUMBER | Maximum number of rows in each batch sent to the proxy service. |
| COMPRESSION [1] | VARCHAR | Type of compression. |
| PACKAGES | VARCHAR | Packages requested by the function. |
| RUNTIME_VERSION | VARCHAR | Runtime version of the language used by the function. NULL if the function is SQL or JavaScript. |
| INSTALLED_PACKAGES | VARCHAR | All packages installed by the function. Output for Python functions only. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| IS_MEMOIZABLE | VARCHAR(3) | `YES` if the function is [memoizable](../../developer-guide/udf/sql/udf-sql-scalar-functions.md); otherwise, `NO`. |
| IS_DATA_METRIC | VARCHAR(3) | `YES` if the function is a [data metric function](../../user-guide/data-quality-intro.md); otherwise, `NO`. |
| SECRETS | JSON map | Map of [secrets](../sql/create-secret.md) specified by the function’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| EXTERNAL_ACCESS_INTEGRATIONS | VARCHAR | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the function’s EXTERNAL_ACCESS_INTEGRATION parameter. |
| IS_AGGREGATE | VARCHAR(3) | `YES` if the function is an aggregate function; otherwise, `NO`. |

[1]
(1,2,3,4,5)

These fields apply only to [Writing external functions](../external-functions.md).

## Usage notes

* Latency for the view can be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently might show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: GRANTS_TO_ROLES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/grants_to_roles.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# GRANTS_TO_ROLES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query access control privileges that have been granted to an account role, application,
application role, database role, instance role, or user.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the privilege is granted to the role. |
| MODIFIED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the privilege is updated. |
| PRIVILEGE | VARCHAR | Name of the privilege added to the role. |
| GRANTED_ON | VARCHAR | Object kind, such as `TABLE` or `DATABASE`, on which the privilege is granted. |
| NAME | VARCHAR | Name of the object on which the privilege is granted. |
| TABLE_CATALOG | VARCHAR | Name of the database for the current table or the name of the database that stores the instance of a class. |
| TABLE_SCHEMA | VARCHAR | Name of the schema for the current table or the name of the schema that stores the instance of a class. |
| GRANTED_TO | VARCHAR | `ACCOUNT ROLE`, `APPLICATION`, `APPLICATION_ROLE`, `DATABASE_ROLE`, `INSTANCE_ROLE`, or `USER`. |
| GRANTEE_NAME | VARCHAR | Identifier for the recipient role, the role to which the privilege is granted, or the name of the Snowflake Native App object. |
| GRANT_OPTION | BOOLEAN | `TRUE / FALSE`. If set to `TRUE`, the recipient role can grant the privilege to other roles. |
| GRANTED_BY | VARCHAR | Indicates the role that authorized a privilege grant to the grantee. `GRANTED_BY` displays empty for privileges granted by the SNOWFLAKE system role. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the privilege is revoked. |
| GRANTED_BY_ROLE_TYPE | VARCHAR | Either `APPLICATION`, `ROLE` or `DATABASE_ROLE`. |
| OBJECT_INSTANCE | VARCHAR | The fully-qualified name of the object that contains the instance role for a particular class in the format `database.schema.class`. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The GRANTS_TO_ROLES view shows a subset of all supported objects. The supported set is subject to change. The view is updated periodically
  to include support for new objects.
* The view does not contain grants to database roles from databases created from shares.
* The view does not contain grants on dropped objects.
* The `GRANTED_BY` column indicates the role that authorized a privilege grant to the grantee. The authorization role is known as the
  *grantor*.

  When you grant privileges on an object to a role using [GRANT <privileges> … TO ROLE](../sql/grant-privilege.md), the following authorization rules
  determine which role is listed as the grantor of the privilege:

  1. If an [active role](../../user-guide/security-access-control-overview.md) is the object owner (i.e. has the OWNERSHIP privilege on the
     object), that role is the grantor.
  2. If an active role was given privileges on the object by a GRANT PRIVILEGE … WITH GRANT OPTION statement, then the active role is the
     grantor. If multiple active roles meet this criterion and one of these active roles is the primary role, then the primary role is the
     grantor. If there are multiple active roles, and none of them are the primary role, Snowflake randomly selects one of the roles as the
     grantor.
  3. If an active role holds the global MANAGE GRANTS privilege, the grantor role is the object owner, not the role that held the
     MANAGE GRANTS privilege. That is, the MANAGE GRANTS privilege allows a role to impersonate the object owner for the purposes of
     granting privileges on that object.

  The `GRANTED_BY` column displays empty for privileges granted by the Snowflake SYSTEM role. Certain internal operations are
  performed with this role. Grants of privileges authorized by the SYSTEM role cannot be modified by customers.

For more information about how to use this view, see [Monitoring access control privileges in your account](../../user-guide/security-access-control-considerations.md).

---
title: GRANTS_TO_USERS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/grants_to_users.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# GRANTS_TO_USERS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query the roles that have been granted to a user.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the role is granted. |
| DELETED_ON | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the role is revoked. |
| ROLE | VARCHAR | Identifier for the role granted to the user. |
| GRANTED_TO | VARCHAR | For this view, the value is `USER`. |
| GRANTEE_NAME | VARCHAR | Name of the user to whom the privilege is granted. |
| GRANTED_BY | VARCHAR | Identifier for the role that granted the privilege. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The GRANTS_TO_USERS view **does not** include grants of privileges and non-account roles to users. For that information, see the
  [GRANTS_TO_ROLES view](grants_to_roles.md).
* This view records current grants and historical grants, including grants that were revoked and granted again. When a single grant occurs
  and as long as it remains active (that is, not revoked):

  + The view includes one row for the grant of the same role to the same user.
  + A regrant of the same role to the same user is not recorded as a new row. Instead, the DELETED_ON column remains NULL while the grant
    is active.
* When a grant is revoked from the user, the DELETED_ON column for the grant is updated from NULL to the timestamp when the grant was
  revoked.
* After revoking the role from the user, a grant of the same role to the same user is recorded in a new row. In this new row, the
  DELETED_ON column value is NULL because the grant is now active.

---
title: LISTING_AUTO_FULFILLMENT_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/listing_auto_fulfillment_usage_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# LISTING_AUTO_FULFILLMENT_USAGE_HISTORY view

This view in the ORGANIZATION_USAGE schema can be used to estimate the costs associated with
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md).

When a data product is fulfilled to a region, Snowflake uses a managed account associated with your provider account, called a
*secure share area*, to store the data product in each region with consumer demand. Your provider account incurs costs associated
with the secure share areas in other regions.

For more details, see [Auto-fulfillment costs](../../collaboration/provider-understand-cost-auto-fulfillment.md).

> **Note:**
>
> Because this view provides estimated values, the usage and currency values might not match the values in the
> [USAGE_IN_CURRENCY_DAILY view](usage_in_currency_daily.md) or your usage statement.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| CONTRACT_NUMBER | VARCHAR | Snowflake contract number for the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the secure share area where the usage occurred. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the secure share area where the usage occurred. |
| REGION | VARCHAR | Name of the region where the secure share area is located. |
| SERVICE_LEVEL | VARCHAR | Service level (edition) of the secure share area. See [Snowflake editions](../../user-guide/intro-editions.md). |
| USAGE_DATE | DATE | Date (in UTC format) in which the secure share area usage took place. |
| SERVICE_TYPE | VARCHAR | Can be one of:   * DATA_TRANSFER * REPLICATION * STORAGE |
| CURRENCY | VARCHAR | Currency of the usage. |
| ESTIMATED_USAGE | NUMBER (38,9) | Estimated amount of usage to be charged based on SERVICE_TYPE. Units of USAGE depend on the SERVICE_TYPE. For example, when the SERVICE_TYPE is REPLICATION, USAGE is measured in credits. When the SERVICE_TYPE is DATA_TRANSFER or STORAGE, USAGE is measured in terabytes. |
| ESTIMATED_USAGE_IN_CURRENCY | NUMBER (38,9) | Estimated amount to be charged for the SERVICE_TYPE for USAGE on the USAGE_DATE. |
| PROVIDER_ACCOUNT_REGION | VARCHAR | Name of the region where the provider account that shared the data product is located. If NULL, the usage could not be attributed to a specific provider account. |
| PROVIDER_ACCOUNT_NAME | VARCHAR | Name of the provider account that shared the data product that incurred the usage in the secure share area. If NULL, the usage could not be attributed to a specific provider account. |
| PROVIDER_ACCOUNT_LOCATOR | VARCHAR | Locator for the provider account that shared the data product that incurred the usage in the secure share area. If NULL, the usage could not be attributed to a specific provider account. |

## Usage notes

* Latency for the view may be up to 24 hours.

---
title: LOAD_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/load_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# LOAD_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view enables you to retrieve the history of data loaded into tables using the
[COPY INTO <table>](../sql/copy-into-table.md) command. The view displays one row for each file loaded.

> **Note:**
>
> This view does not return the history of data loaded using Snowpipe. For this historical information, query
> the [COPY_HISTORY](copy_history.md) view instead.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the target table |
| TABLE_NAME | VARCHAR | Name of target table |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the target table |
| SCHEMA_NAME | VARCHAR | Schema of target table |
| CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the target table |
| CATALOG_NAME | VARCHAR | Database of target table |
| FILE_NAME | VARCHAR | Name of source file |
| LAST_LOAD_TIME | TIMESTAMP_LTZ | Date and time (in the UTC time zone) of the load record |
| STATUS | VARCHAR | Status: `LOADED`, `LOAD FAILED`, or `PARTIALLY LOADED` |
| ROW_COUNT | NUMBER | Number of rows loaded from the source file |
| ROW_PARSED | NUMBER | Number of rows parsed from the source file |
| FIRST_ERROR_MESSAGE | VARCHAR | First error of the source file |
| FIRST_ERROR_LINE_NUMBER | NUMBER | Line number of the first error |
| FIRST_ERROR_CHARACTER_POSITION | NUMBER | Position of the first error character |
| FIRST_ERROR_COL_NAME | VARCHAR | Column name of the first error |
| ERROR_COUNT | NUMBER | Number of error rows in the source file |
| ERROR_LIMIT | NUMBER | If the number of error reach this limit, then abort |

## Usage notes

* In most cases, latency for the view may be up to 24 hours. The latency for a given table’s load history in the view may be up to 2 days
  if both of the following conditions are true:

  + Fewer than 32 DML statements have been added to the given table since it was last updated in LOAD_HISTORY.
  + Fewer than 100 rows have been added to the given table since it was last updated in LOAD_HISTORY.

* The view only includes COPY INTO commands that executed to completion, with or without errors. No record is added if the transaction is rolled back, for example, or if the ON_ERROR = ABORT_STATEMENT copy option is included in the COPY INTO *<table>* statement and a detected error in a data file aborts the load operation.
* When including a WHERE clause that references the `LAST_LOAD_TIME` column, you can specify any day of the week. For example, April 1, 2016 was a Friday; however, specifying Sunday instead does not
  affect the query results:

  ```sqlexample
  WHERE last_load_time > 'Sun, 01 Apr 2016 16:00:00 -0800'
  ```

* After the replication of load history, the LOAD_HISTORY Organization Usage view shows the history only after the latest truncate operation
  on the target table. This is different from the view without replication, which shows a complete data loading history.

## Examples

Retrieve records for the 10 most recent COPY INTO commands executed:

> ```sqlexample
> SELECT account_name, file_name, last_load_time
> FROM snowflake.organization_usage.load_history
>   ORDER BY last_load_time DESC
>   LIMIT 10;
> ```

---
title: LOCK_WAIT_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/lock_wait_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# LOCK_WAIT_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view includes the history of [transactions](../transactions.md) that wait on locks.
For details, see [Analyzing blocked transactions with the LOCK_WAIT_HISTORY view](../transactions.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| OBJECT_ID | NUMBER | Internal/system-generated identifier for the blocking object (such as a table) on which the transaction is waiting for a lock. |
| --- | --- | --- |
| LOCK_TYPE | VARCHAR | Type of lock. Valid values are `PARTITION`, `STREAM`, `TABLE`, and `ROW`. `ROW` is shown for hybrid table locks. |
| OBJECT_NAME | VARCHAR | Identifier for the object (such as a table) on which the transaction is waiting for a lock. `ROW` is shown for hybrid table locks. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the object on which the transaction is waiting for a lock. `0` is shown for hybrid tables. |
| SCHEMA_NAME | VARCHAR | Identifier for the schema of the object on which the transaction is waiting for a lock. NULL is shown for `ROW` locks. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database of the object on which the transaction is waiting for a lock. |
| DATABASE_NAME | VARCHAR | Identifier for the database of the object on which the transaction is waiting for a lock. |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement that is waiting on the lock. |
| TRANSACTION_ID | NUMBER | Internal/system-generated [identifier for the transaction](../transactions.md) with the statement that is waiting on the lock. Can be joined with the [QUERY_HISTORY view](query_history.md) for additional details about the statements in the transaction. |
| REQUESTED_AT | TIMESTAMP_LTZ | Timestamp when the lock was requested by the transaction waiting for the lock. |
| ACQUIRED_AT | TIMESTAMP_LTZ | Timestamp when the lock was acquired by the transaction holding the lock. |
| BLOCKER_QUERIES | VARIANT | JSON array of objects. Each object is a blocker query with the following properties:   * `is_snowflake`: TRUE if the query is a background process run by Snowflake (e.g., automatic maintenance of   materialized views). * `query_id`: Query ID of the current statement in the blocker transaction that blocked the statement. Empty if   `is_snowflake` is true. * `transaction_id`: ID of the blocker transaction. Empty if `is_snowflake` is true.   There may be up to 20 objects in this array. |

## Usage notes

* The first blocker query ID that is returned in the `blocker_queries` array is the ID of the query that was being executed
  in the transaction that holds the lock when the transaction waiting for the lock started waiting.
  Note that it is possible that queries prior to that query in the blocker transaction also acquired the lock and should be investigated.
* Each row in the output represents a transaction waiting on a lock. Note that there may be other transactions ahead
  of that transaction, waiting on the same lock.

## Examples

Find all the blocked transactions that requested locks within the past 24 hours:

```sqlexample
SELECT account_name, query_id, object_name, transaction_id, blocker_queries
  FROM snowflake.organization_usage.alert_history.lock_wait_history
  WHERE requested_at >= DATEADD('hours', -24, CURRENT_TIMESTAMP());
```

For additional examples, see [Analyzing blocked transactions with the LOCK_WAIT_HISTORY view](../transactions.md).

---
title: LOGIN_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/login_history.md
section: Organization Usage
---

Schemas:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# LOGIN_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query login attempts by Snowflake users.

Details about the error codes/messages for login attempts that were unsuccessful can be found in the following documentation:

* [Federated authentication & SSO error codes](../../user-guide/errors-saml.md)
* [Multi-factor authentication (MFA) error codes](../../user-guide/security-mfa-duo.md)
* [OAuth error codes](../../user-guide/oauth-snowflake-overview.md)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| READER_ACCOUNT_NAME | VARCHAR | Name of the reader account for the user authentication event. This column is only included in the view in the READER_ACCOUNT_USAGE schema. |
| EVENT_ID | NUMBER | Internal/system-generated identifier for the login attempt. |
| EVENT_TIMESTAMP | TIMESTAMP_LTZ | Time (in the UTC time zone) of the event occurrence. |
| EVENT_TYPE | VARCHAR | Event type, such as LOGIN for authentication events. |
| USER_NAME | VARCHAR | User associated with this event. |
| CLIENT_IP | VARCHAR | IP address where the request originated. |
| REPORTED_CLIENT_TYPE | VARCHAR | Reported type of the client software, such as JDBC_DRIVER, ODBC_DRIVER, and so on. This information is not authenticated. |
| REPORTED_CLIENT_VERSION | VARCHAR | Reported version of the client software. This information is not authenticated. |
| FIRST_AUTHENTICATION_FACTOR | VARCHAR | Method used to authenticate the user (the first factor in multi factor authentication, if used). |
| SECOND_AUTHENTICATION_FACTOR | VARCHAR | The second factor in multi factor authentication. If the user did not use multi-factor authentication, this value is NULL. |
| IS_SUCCESS | VARCHAR | Whether the user’s request was successful or not. |
| ERROR_CODE | NUMBER | Error code, if the request was not successful. |
| ERROR_MESSAGE | VARCHAR | Error message returned to the user, if the request was not successful. |
| RELATED_EVENT_ID | NUMBER | Reserved for future use. |
| CONNECTION | VARCHAR | Name of the connection used by the client, or NULL if the client is not using a connection URL. A connection is a Snowflake object that is part of [Client Redirect](../../user-guide/client-redirect.md). It represents a connection URL that you can use to fail over to another account for business continuity and disaster recovery. . , NOTE: If a client authenticates through an identity provider (IdP) that is configured with the account URL rather than the connection URL, the IdP directs the client to the account URL after authentication is complete. The CONNECTION column for this login event is NULL. See [Authentication and Client Redirect](../../user-guide/client-redirect.md). |
| CLIENT_PRIVATE_LINK_ID | VARCHAR | If the user logged in using [private connectivity](../../user-guide/private-connectivity-inbound.md), specifies the identifier of the endpoint from which the request originated. |
| FIRST_AUTHENTICATION_FACTOR_ID | VARCHAR | ID of the [credential](../account-usage/credentials.md) used to authenticate the user (the first factor in multi-factor authentication, if used). |
| SECOND_AUTHENTICATION_FACTOR_ID | VARCHAR | ID of the [credential](../account-usage/credentials.md) used for the second factor in multi-factor authentication. If the user did not use multi-factor authentication, this value is NULL. |
| LOGIN_DETAILS | VARCHAR | Displays details for each login event, including malicious IP protection category name, risk category, and blocking status. |

## Usage notes

* Latency for the view may be up to 24 hours.

* `INTERNAL_SNOWFLAKE_IP/0.0.0.0` appears as the client IP for login events triggered by internal Snowflake operations that support
  your usage. For example:

  + Because worksheets exist as unique sessions, when a user accesses a worksheet in [Snowsight](../../user-guide/ui-snowsight-gs.md),
    Snowflake creates a login event that originates from `INTERNAL_SNOWFLAKE_IP/0.0.0.0`.
  + When a Snowpark Container Services [service](../../developer-guide/snowpark-container-services/overview.md) logs into Snowflake, the client
    IP is masked to `INTERNAL_SNOWFLAKE_IP/0.0.0.0`.
* This view doesn’t record the activity of internal users the system defines to perform various operations, such as maintaining
  Snowsight worksheets.
* To see the blocking status of potentially malicious IP addresses, examine the LOGIN_DETAILS column output. For examples, see [View network login details](../../user-guide/malicious-ip-protection.md).

---
title: MASKING_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/masking_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# MASKING_POLICIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view provides the masking policies in your account.

Each row in this view corresponds to a different masking policy.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_NAME | VARCHAR | Name of the masking policy. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the masking policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema to which the masking policy belongs. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the masking policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the masking policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the masking policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Masking policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the masking policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the masking policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the masking policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| OPTIONS | VARIANT | The value for the EXEMPT_OTHER_POLICIES property in the policy. If set to `TRUE`, the column returns `{ "EXEMPT_OTHER_POLICIES: "TRUE" }`. If the property is set to `FALSE` or not set at all, the column returns NULL. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: MATERIALIZED_VIEW_REFRESH_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/materialized_view_refresh_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# MATERIALIZED_VIEW_REFRESH_HISTORY view

The MATERIALIZED_VIEW_REFRESH_HISTORY view in the ORGANIZATION_USAGE
schema is used for querying the
[materialized views](../../user-guide/views-materialized.md) refresh history for
a specified materialized view within a specified date range. The information
returned by the function includes the view name and credits consumed each time
a materialized view is refreshed.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the account (user-defined). |
| ACCOUNT_LOCATOR | VARCHAR | Locator of the account (system-defined). |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this refresh history record. |
| CREDITS_USED | NUMBER | Number of credits billed for materialized view maintenance during the USAGE_DATE. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the materialized view. |
| TABLE_NAME | VARCHAR | Name of the materialized view. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the materialized view. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the materialized view. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the materialized view. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the materialized view. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

---
title: METERING_DAILY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/metering_daily_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# METERING_DAILY_HISTORY view

The METERING_DAILY_HISTORY view in the ORGANIZATION_USAGE schema can be used to return the daily credit usage and a cloud services rebate for an organization within the last 365 days (1 year).

> **Note:**
>
> As of March 1, 2026, Snowflake no longer bills customers for hybrid table requests,
> and metering was disabled soon after this pricing change took effect. Any new data
> in the view as of March 1, 2026, will not be billed to customers, and you can still
> query the historical data in the view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | Type of service that is consuming credits. The following list includes many, **but not all**, of the possible service types:   * `AI_SERVICES`: See [Snowflake Cortex AI Functions (including LLM functions)](../../user-guide/snowflake-cortex/aisql.md) and [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md). * `ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `ARCHIVE_STORAGE_WRITE`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `AUTO_CLUSTERING`: See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `BACKUP`: See [Backups for disaster recovery and immutable storage](../../user-guide/backups.md). * `COPY_FILES`: See [COPY FILES](../sql/copy-files.md). * `DATA_QUALITY_MONITORING`: See [Introduction to data quality checks](../../user-guide/data-quality-intro.md). * `FAILSAFE_RECOVERY`: See [Understanding and viewing Fail-safe](../../user-guide/data-failsafe.md). * `HYBRID_TABLE_REQUESTS`: See [Hybrid tables](../../user-guide/tables-hybrid.md). * `MATERIALIZED_VIEW`: See [Working with Materialized Views](../../user-guide/views-materialized.md). * `OPENFLOW_COMPUTE_BYOC`: See [Openflow BYOC cost and scaling considerations](../../user-guide/data-integration/openflow/cost-byoc.md). * `OPENFLOW_COMPUTE_SNOWFLAKE`: See [Openflow Snowflake Deployment cost and scaling considerations](../../user-guide/data-integration/openflow/cost-spcs.md). * `PIPE`: See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `POSTGRES_COMPUTE`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `POSTGRES_COMPUTE_HA`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `QUERY_ACCELERATION`: See [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md). * `REPLICATION`: See [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md). * `SEARCH_OPTIMIZATION`: See [Search optimization service](../../user-guide/search-optimization-service.md). * `SENSITIVE_DATA_CLASSIFICATION`: See [Introduction to sensitive data classification](../../user-guide/classify-intro.md). * `SERVERLESS_ALERTS`: See [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md). * `SERVERLESS_TASK`: See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPARK_CONTAINER_SERVICES`: See [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md). * `SNOWPIPE_STREAMING`: See [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `STORAGE_LIFECYCLE_POLICY_EXECUTION`: Compute cost to apply a policy on a target table and expire or archive rows (policy execution). See [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md). * `TELEMETRY_DATA_INGEST`: See [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md). * `TRUST_CENTER`: See [Trust Center](../../user-guide/trust-center/overview.md). * `WAREHOUSE_METERING`: See [Overview of warehouses](../../user-guide/warehouses-overview.md). * `WAREHOUSE_METERING_READER`: See [Manage reader accounts](../../user-guide/data-sharing-reader-create.md). |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| USAGE_DATE | DATE | The date (in the UTC time zone) in which the usage took place. |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits used for warehouses and serverless compute resources during the USAGE_DATE. |
| CREDITS_USED_CLOUD_SERVICES | NUMBER | Number of credits used for cloud services during the USAGE_DATE. |
| CREDITS_USED | NUMBER | Total of CREDITS_USED_COMPUTE plus CREDITS_USED_CLOUD_SERVICES. |
| CREDITS_ADJUSTMENT_CLOUD_SERVICES | NUMBER | Number of credits [adjusted for cloud services](../../user-guide/cost-understanding-compute.md). This is a negative value (e.g. -9). |
| CREDITS_BILLED | NUMBER | Total number of credits billed for the account in the day. This is a sum of CREDITS_USED_COMPUTE, CREDITS_USED_CLOUD_SERVICES, and CREDITS_ADJUSTMENT_CLOUD_SERVICES. |
| REGION | VARCHAR | ID of the Snowflake Region where the account is located. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the account where the usage took place. |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).
* The data is retained for 365 days (1 year).

## Example

[Usage for cloud services](../../user-guide/cost-understanding-compute.md) is billed only if the daily consumption of cloud
services exceeds 10% of the daily usage of virtual warehouses. This query returns how much of cloud services consumption was actually
billed for a particular day, ordered by the highest billed amount.

```sqlexample
SELECT
    usage_date,
    credits_used_cloud_services,
    credits_adjustment_cloud_services,
    credits_used_cloud_services + credits_adjustment_cloud_services AS billed_cloud_services
FROM snowflake.organization_usage.metering_daily_history
WHERE usage_date >= DATEADD(month,-1,CURRENT_TIMESTAMP())
    AND credits_used_cloud_services > 0
ORDER BY 4 DESC;
```

---
title: METERING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/metering_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# METERING_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

The METERING_HISTORY view in the ORGANIZATION_USAGE schema can be used to return the hourly credit usage for each account in the organization.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | Type of service that is consuming credits. The following list includes many, **but not all**, of the possible service types:   * `AI_SERVICES`: See [Snowflake Cortex AI Functions (including LLM functions)](../../user-guide/snowflake-cortex/aisql.md) and [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md). * `ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `ARCHIVE_STORAGE_WRITE`: See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `AUTO_CLUSTERING`: See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `BACKUP`: See [Backups for disaster recovery and immutable storage](../../user-guide/backups.md). * `COPY_FILES`: See [COPY FILES](../sql/copy-files.md). * `DATA_QUALITY_MONITORING`: See [Introduction to data quality checks](../../user-guide/data-quality-intro.md). * `FAILSAFE_RECOVERY`: See [Understanding and viewing Fail-safe](../../user-guide/data-failsafe.md). * `HYBRID_TABLE_REQUESTS`: See [Hybrid tables](../../user-guide/tables-hybrid.md). * `MATERIALIZED_VIEW`: See [Working with Materialized Views](../../user-guide/views-materialized.md). * `OPENFLOW_COMPUTE_BYOC`: See [Openflow BYOC cost and scaling considerations](../../user-guide/data-integration/openflow/cost-byoc.md). * `OPENFLOW_COMPUTE_SNOWFLAKE`: See [Openflow Snowflake Deployment cost and scaling considerations](../../user-guide/data-integration/openflow/cost-spcs.md). * `PIPE`: See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `POSTGRES_COMPUTE`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `POSTGRES_COMPUTE_HA`: See [Snowflake Postgres](../../user-guide/snowflake-postgres/about.md). * `QUERY_ACCELERATION`: See [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md). * `REPLICATION`: See [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md). * `SEARCH_OPTIMIZATION`: See [Search optimization service](../../user-guide/search-optimization-service.md). * `SENSITIVE_DATA_CLASSIFICATION`: See [Introduction to sensitive data classification](../../user-guide/classify-intro.md). * `SERVERLESS_ALERTS`: See [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md). * `SERVERLESS_TASK`: See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPARK_CONTAINER_SERVICES`: See [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md). * `SNOWPIPE_STREAMING`: See [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `STORAGE_LIFECYCLE_POLICY_EXECUTION`: Compute cost to apply a policy on a target table and expire or archive rows (policy execution). See [Storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md). * `TELEMETRY_DATA_INGEST`: See [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md). * `TRUST_CENTER`: See [Trust Center](../../user-guide/trust-center/overview.md). * `WAREHOUSE_METERING`: See [Overview of warehouses](../../user-guide/warehouses-overview.md). * `WAREHOUSE_METERING_READER`: See [Manage reader accounts](../../user-guide/data-sharing-reader-create.md). |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the usage took place. |
| ENTITY_ID | NUMBER | A system-generated identifier for the entity associated with the service.  In most cases, this is the internal ID of the monitored entity; for example, a pipe, task, or replication group.  When the SERVICE_TYPE is COPY_FILES, this column shows the ID of the database, schema, or stage from which files are copied.  If the SERVICE_TYPE is an Openflow type, the value is NULL.  If the SERVICE_TYPE is Snowpipe Streaming, this shows the ID of the relevant pipe; which is the default pipe ID for the default pipe. |
| ENTITY_TYPE | VARCHAR | Type of Snowflake resource that consumed credits, such as WAREHOUSE, TASK, or TABLE. Note that TABLE is used for all table-like objects. |
| NAME | VARCHAR | The name of the service or object associated with the cost entry, which varies significantly based on the SERVICE_TYPE.  Standard (General): This column shows the name of the service type itself; for example, REPLICATION, TASK.  SNOWPIPE_STREAMING: This service type generates two distinct cost entries, and the NAME column varies for each:   * Cost entry 1 (table name): The value is the name of the Snowflake target table. For the high-performance default pipe, the name is derived from the target table name and appended with -STREAMING; for example, MY_TABLE-STREAMING. * Cost entry 2 (client string): The value is a colon-separated string in the format: SNOWPIPE_STREAMING:CLIENT_NAME:SNOWFLAKE_PROVIDED_ID. This is used for tracking client-side costs.   COPY_FILES: The value is the name of the database from which the files are copied.  Openflow Types: The value is NULL. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier of the database associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific database; for example, a warehouse or compute pool. |
| DATABASE_NAME | VARCHAR | Name of the database associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific database. |
| SCHEMA_ID | NUMBER | Internal or system-generated identifier of the schema associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific schema. |
| SCHEMA_NAME | VARCHAR | Name of the schema associated with the resource of type `ENTITY_TYPE`. Contains a NULL value when the resource isn’t associated with a specific schema. |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits used by warehouses, serverless compute, and [Openflow](../../user-guide/data-integration/openflow/about.md) resources in the hour. |
| CREDITS_USED_CLOUD_ SERVICES | NUMBER | Number of credits used for cloud services in the hour. Always `0` when the SERVICE_TYPE is one of the Openflow types. |
| CREDITS_USED | NUMBER | Total number of credits used for the account in the hour. This is a sum of CREDITS_USED_COMPUTE and CREDITS_USED_CLOUD_SERVICES. This value does not take into account the adjustment for cloud services, and may therefore be greater than your actual credit consumption. |
| BYTES | NUMBER | When the service type is `auto_clustering`, indicates the number of bytes reclustered during the START_TIME and END_TIME window. When the service type is `pipe`, indicates the number of bytes inserted during the START_TIME and END_TIME window. When the service type is `SNOWPIPE_STREAMING`, indicates the number of bytes migrated during the START_TIME and END_TIME window. When the service type is `COPY_FILES`, columns are aggregated at the database level. |
| ROWS | NUMBER | When the service type is `auto_clustering`, indicates number of rows reclustered during the START_TIME and END_TIME window. When the service type is `SNOWPIPE_STREAMING`, indicates the number of rows migrated during the START_TIME and END_TIME window. |
| FILES | NUMBER | When the service type is `pipe`, indicates number of files loaded during the START_TIME and END_TIME window. When the service type is `SNOWPIPE_STREAMING`, this is NULL. When the service type is `COPY_FILES`, columns are aggregated at the database level. |

## Usage notes

* Latency for the view may be up to 24 hours.

---
title: OBJECT_DEPENDENCIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/object_dependencies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# OBJECT_DEPENDENCIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays one row for each object dependency.

For example, while creating a view from a single table, the view is dependent on the table. Snowflake returns one row to record the
dependency of the view on the table.

However, if creating the view is dependent on two tables, Snowflake returns one row to record the dependency of the view on the first table
and, separately, one row to record the dependency of the view on the second table. This pattern continues for however many dependencies
there are for a given object.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| REFERENCED_DATABASE | TEXT | The parent database of the referenced object. |
| REFERENCED_SCHEMA | TEXT | The parent schema of the referenced object. |
| REFERENCED_OBJECT_NAME | TEXT | The name of the referenced object. |
| REFERENCED_OBJECT_ID | NUMBER | The object ID of the referenced object. |
| REFERENCED_OBJECT_DOMAIN | TEXT | The domain (e.g. `TABLE`, `VIEW`) of the referenced object. |
| REFERENCING_DATABASE | TEXT | The parent database of the referencing object. |
| REFERENCING_SCHEMA | TEXT | The parent schema of the referencing object. |
| REFERENCING_OBJECT_NAME | TEXT | The name of the referencing object. |
| REFERENCING_OBJECT_ID | NUMBER | The object ID of the referencing object. |
| REFERENCING_OBJECT_DOMAIN | TEXT | The domain (e.g. `TABLE`, `VIEW`) of the referencing object. |
| DEPENDENCY_TYPE | TEXT | The type of dependency (`BY_ID`, `BY_NAME`, or `BY_NAME_AND_ID`). |

## Usage notes

* Latency for this view may be up to 24 hours.

* For a complete list of supported objects and their dependency type, see [Supported object dependencies](../../user-guide/object-dependencies.md).
* Data movement, such as when data is copied or materialized from one object to another, does not result in an object dependency. For
  example, CREATE TABLE AS SELECT (CTAS), INSERT, or MERGE operations on tables result in data movement and are not included in this view.
* This view was backfilled on January 22, 2022 to include dependencies prior to making the view available. Snowflake continues to record
  dependencies after this date.

  Note that if a view or [UDF](../../developer-guide/udf/udf-overview.md) was invalid due to a missing dependency prior to this date and
  the missing dependency is fixed later, Snowflake does not record the dependency for the view or UDF.

  For example, if you created a view that depends on a table on December 1, 2021, dropped the table on the same day, and then undropped the
  table on February 1, 2022, Snowflake does not record that the view depends on the table.

  As a workaround, create or replace the view or UDF to so that this view records the dependency.

### Data sharing usage notes

General notes:
:   The view updates assume the share is not deleted.

    The view schema (i.e. column names, data types, and values) remains the same with these exceptions:

    * The value for the REFERENCED_OBJECT_ID column in the consumer account is always NULL for a shared object.

      This value prevents a customer from discovering the source object in the provider account.
    * The value for REFERENCED_OBJECT_DOMAIN is `TABLE` for all table-like objects.

Snowflake objects:
:   Shared objects, such as Account Usage views, are now supported as referenced objects.

    For example, if a user-defined view depends on data from another Account Usage view, such as LOGIN_HISTORY, the OBJECT_DEPENDENCIES view
    in the consumer account specifies the LOGIN_HISTORY view as the referenced object.

Rename notes:
:   When a provider renames a shared database, shared schema, or shared object:

    * The consumer OBJECT_DEPENDENCIES view record shows the record of the original name for the database, schema, or object prior to
      the renaming, not the renamed object.

      Newly renamed shared objects are not shown in the consumer OBJECT_DEPENDENCIES view to prevent the consumer from determining the object
      lifecycle in the provider account. A new referencing object would need to refer to the newly renamed object in order for the renamed
      object to appear in the local OBJECT_DEPENDENCIES view in the consumer account.
    * Renaming the shared database preserves the dependency in the consumer account.
    * Renaming a shared schema or shared objects in a shared schema breaks the dependency in the consumer account.

    If the consumer renames a shared database, all existing dependencies on that database break. Consequently, Snowflake removes the
    corresponding records from the OBJECT_DEPENDENCIES view in the consumer account.

    For example, the shared database contains a view named `db1_shared.views.view_1_shared`. The consumer renames the shared database to
    `mydb`. The view now has a fully-qualified name of `mydb.views.view_1_shared`. Snowflake removes the row specifying
    `db1_shared.views.view_1_shared` in the consumer’s OBJECT_DEPENDENCIES view because the dependency on the database named
    `db1_shared` is broken.

Not supported:
:   The `BY_ID` dependency type for referenced objects is not supported.

    * [Limitations](../../user-guide/object-dependencies.md)
    * [Object dependencies with snowflake features and services](../../user-guide/object-dependencies.md)

---
title: PASSWORD_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/password_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# PASSWORD_POLICIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view provides the user-defined [password policies](../../user-guide/password-authentication.md) in an account.

Each row in this view corresponds to a different password policy.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the policy. |
| ID | NUMBER | Internal/system-generated identifier for the password policy. |
| SCHEMA_ID | VARCHAR | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | VARCHAR | Schema to which the password policy belongs. |
| DATABASE_ID | VARCHAR | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | VARCHAR | Database to which the password policy belongs. |
| OWNER | VARCHAR | Name of the role that owns the password policy. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| PASSWORD_MIN_LENGTH | NUMBER | Minimum password length allowed for the policy. |
| PASSWORD_MAX_LENGTH | NUMBER | Maximum password length allowed for the policy. |
| PASSWORD_MIN_UPPER_CASE_CHARS | NUMBER | Minimum number of uppercase characters allowed for the policy. |
| PASSWORD_MIN_LOWER_CASE_CHARS | NUMBER | Minimum number of lowercase characters allowed for the policy. |
| PASSWORD_MIN_NUMERIC_CHARS | NUMBER | Minimum number of numeric characters allowed for the policy. |
| PASSWORD_MIN_SPECIAL_CHARS | NUMBER | Minimum number of special characters allowed for the policy. |
| PASSWORD_MIN_AGE_DAYS | NUMBER | The number of days a user must wait before a recently changed password can be changed again. |
| PASSWORD_MAX_AGE_DAYS | NUMBER | Maximum number of days password is valid. |
| PASSWORD_MAX_RETRIES | NUMBER | Maximum number of password attempts allowed. |
| PASSWORD_LOCKOUT_TIME_MINS | NUMBER | Minimum time in minutes before password can be retried. |
| COMMENT | VARCHAR | Comments entered for the password policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the password policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the password policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the password policy was dropped. |
| PASSWORD_HISTORY | NUMBER | The number of the most recent passwords that Snowflake stores. These stored passwords cannot be repeated when a user updates their password value. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: PIPE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/pipe_usage_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# PIPE_USAGE_HISTORY view

The PIPE_USAGE_HISTORY view in the ORGANIZATION_USAGE schema can be used
to query the history of data loaded into Snowflake tables and Apache Iceberg™ tables using
[Snowpipe](../../user-guide/data-load-snowpipe-intro.md) within a specified date range.
It includes the history of data loaded and credits billed for your
entire Snowflake organization.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| REGION | VARCHAR | Name of the region where the account is located. |
| PIPE_ID | NUMBER | Internal/system-generated identifier for the pipe used for the data load. Displays NULL if no pipe name was specified in the query. Each row includes the totals for all pipes in use within the time range. |
| PIPE_NAME | VARCHAR | Name of the pipe. Displays NULL for the internal (hidden) pipe object used to refresh the metadata for an external table. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this usage history record. |
| CREDITS_USED | NUMBER | Number of credits billed for Snowpipe data loads during the USAGE_DATE. |
| BYTES_INSERTED | VARIANT | Number of bytes loaded during the USAGE_DATE. |
| FILES_INSERTED | VARIANT | Number of files loaded during the USAGE_DATE. |
| BYTES_BILLED | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing visibility into Snowpipe’s cost implications directly within these history views. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

---
title: PIPES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/pipes.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# PIPES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each pipe defined in an account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| PIPE_ID | NUMBER | Internal or system-generated identifier for the pipe. |
| PIPE_NAME | VARCHAR | The name of the pipe object.  For manually created pipes, this is the name defined in the CREATE PIPE statement.  For the Snowpipe Streaming high-performance default pipe, this is derived from the target table name; for example, `MY_TABLE-STREAMING`. |
| PIPE_SCHEMA_ID | NUMBER | Internal or system-generated identifier for the schema that the pipe belongs to.  For the default pipe, this corresponds to the target table’s schema ID. |
| PIPE_SCHEMA | VARCHAR | Schema that the pipe belongs to.  For the default pipe, this corresponds to the target table’s schema. |
| PIPE_CATALOG_ID | NUMBER | Internal or system-generated identifier for the database that the pipe belongs to.  For the default pipe, this corresponds to the target table’s database ID. |
| PIPE_CATALOG | VARCHAR | Name of the database that the pipe belongs to.  For the default pipe, this corresponds to the target table’s database. |
| IS_AUTOINGEST_ENABLED | VARCHAR | Whether AUTO-INGEST is enabled for the pipe. Represents future functionality. |
| NOTIFICATION_CHANNEL_NAME | VARCHAR | Amazon Resource Name of the Amazon SQS queue for the stage named in the DEFINITION column. Represents future functionality. |
| PIPE_OWNER | VARCHAR | Name of the role that owns the pipe.  Returns NULL for the default pipe. |
| DEFINITION | VARCHAR | COPY statement used to load data from queued files into a Snowflake table. |
| CREATED | TIMESTAMP_LTZ | Creation time of the pipe. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this pipe.  Returns the following message for the default pipe: “Default pipe for Snowpipe Streaming High Performance ingestion to a table. Created and managed by Snowflake.” |
| PATTERN | VARCHAR | PATTERN copy option value in the [COPY INTO <table>](../sql/copy-into-table.md) statement in the pipe definition, if the copy option was specified. |
| DELETED | TIMESTAMP_LTZ | Date and time when the pipe was deleted. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object; for example, ROLE.  If a Snowflake Native App owns the object, the value is APPLICATION.  Snowflake returns NULL if you delete the object because a deleted object doesn’t have an owner role.  Returns NULL for the default pipe. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

The following example joins this view with [PIPE_USAGE_HISTORY view](../account-usage/pipe_usage_history.md) on the PIPE_ID column to track the credit usage associated with each unique PIPE object:

```sqlexample
select a.PIPE_CATALOG as PIPE_CATALOG,
       a.PIPE_SCHEMA as PIPE_SCHEMA,
       a.PIPE_NAME as PIPE_NAME,
       b.CREDITS_USED as CREDITS_USED
from SNOWFLAKE.ORGANIZATION_USAGE.PIPES a join SNOWFLAKE.ORGANIZATION_USAGE.PIPE_USAGE_HISTORY b
on a.pipe_id = b.pipe_id
where b.START_TIME > date_trunc(month, current_date);
```

---
title: POLICY_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/policy_references.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# POLICY_REFERENCES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view lists policy objects and their references in each account.

The view supports aggregation, masking, network, projection, row access, and storage lifecycle policies.

The view is complementary to the Information Schema table function [POLICY_REFERENCES](../functions/policy_references.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_DB | VARCHAR | The database in which the policy is set. |
| POLICY_SCHEMA | VARCHAR | The schema in which the policy is set. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the policy. |
| POLICY_NAME | VARCHAR | The name of the policy. |
| POLICY_KIND | VARCHAR(17) | The type of policy. |
| REF_DATABASE_NAME | VARCHAR | The name of the database containing an object that the queried object references. |
| REF_SCHEMA_NAME | VARCHAR | The name of the schema containing an object that the queried object references. |
| REF_ENTITY_NAME | VARCHAR | The name of the object (i.e. table_name, view_name, external_table_name) on which the policy is set. |
| REF_ENTITY_DOMAIN | VARCHAR | The object type (i.e. table, view) on which the policy is set. |
| REF_COLUMN_NAME | VARCHAR | The column name on which the policy is set. |
| REF_ARG_COLUMN_NAMES | VARCHAR | Returns NULL for rows in the query result in which a Column-level Security masking policy is set. |
| TAG_DATABASE | VARCHAR | The name of the database containing the tag that has a policy assigned to the tag or NULL if a policy is not assigned to the tag. |
| TAG_SCHEMA | VARCHAR | The name of the schema containing the tag that has a policy assigned to the tag or NULL if a policy is not assigned to the tag. |
| TAG_NAME | VARCHAR | The name of the tag that has a policy assigned to it or NULL if a policy is not assigned to the tag. |
| POLICY_STATUS | VARCHAR | Specifies the status of the policy, which can be one of four possible values: `ACTIVE`, `MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`, `COLUMN_IS_MISSING_FOR_SECONDARY_ARG`, or `COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`. |

Note the following for the POLICY_STATUS column:

> `ACTIVE`
> :   Specifies that the column (i.e. REF_COLUMN_NAME) is only associated with a single policy.
>
> `MULTIPLE_MASKING_POLICY_ASSIGNED_TO_THE_COLUMN`
> :   Specifies that multiple masking policies are assigned to the same column.
>
> `COLUMN_IS_MISSING_FOR_SECONDARY_ARG`
> :   Specifies that the policy (i.e. POLICY_NAME) is a conditional masking policy and the table (i.e. REF_ENTITY_NAME) does not have a
>     column with the same name.
>
> `COLUMN_DATATYPE_MISMATCH_FOR_SECONDARY_ARG`
> :   Specifies that the policy is a conditional masking policy and the table has a column with the same name but a different data type than
>     the data type in the masking policy signature.

## Usage notes

* Latency for the view may be up to 24 hours.

---
title: PROCEDURES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/procedures.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# PROCEDURES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each stored procedure defined in each account.

For more information about stored procedures, see [Stored procedures overview](../../developer-guide/stored-procedure/stored-procedures-overview.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| PROCEDURE_CATALOG | VARCHAR | Database to which the stored procedure belongs. |
| PROCEDURE_SCHEMA | VARCHAR | Schema to which the stored procedure belongs. |
| PROCEDURE_NAME | VARCHAR | Name of the stored procedure. |
| PROCEDURE_OWNER | VARCHAR | Name of the role that owns the stored procedure. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the stored procedure’s arguments. |
| DATA_TYPE | VARCHAR | Return value data type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string return value. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string return value. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric return value. |
| NUMERIC_SCALE | NUMBER | Scale of numeric return value. |
| PROCEDURE_LANGUAGE | VARCHAR | Language of the stored procedure. |
| PROCEDURE_DEFINITION | VARCHAR | Stored procedure definition. |
| CREATED | TIMESTAMP_LTZ | Creation time of the stored procedure. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this stored procedure. |
| DELETED | TIMESTAMP_LTZ | Date and time when the procedure was dropped. |
| RUNTIME_VERSION | VARCHAR | Runtime version of the language used by the procedure. |
| PACKAGES | VARCHAR | Packages requested by the procedure. |
| INSTALLED_PACKAGES | VARCHAR | All packages installed by the function. Output for Python procedures only. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| PROCEDURE_SCHEMA_ID | NUMBER | Internal/system-generated identifier of the schema to which the stored procedure belongs. |
| PROCEDURE_CATALOG_ID | NUMBER | Internal/system-generated identifier of the database to which the stored procedure belongs. |
| SECRETS | JSON map | Map of [secrets](../sql/create-secret.md) specified by the function’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| EXTERNAL_ACCESS_INTEGRATIONS | VARCHAR | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the function’s EXTERNAL_ACCESS_INTEGRATION parameter. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command when both are
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: QUERY_ACCELERATION_ELIGIBLE view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/query_acceleration_eligible.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# QUERY_ACCELERATION_ELIGIBLE view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to identify queries that are eligible for the
[query acceleration service](../../user-guide/query-acceleration-service.md) (QAS).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| QUERY_TEXT | VARCHAR | Text of the SQL statement. |
| START_TIME | TIMESTAMP_LTZ | Statement start time. |
| END_TIME | TIMESTAMP_LTZ | Statement end time. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that the query executed on. |
| WAREHOUSE_SIZE | VARCHAR | Size of the warehouse when this statement executed. |
| ELIGIBLE_QUERY_ACCELERATION_TIME | NUMBER | Amount of query execution time (in seconds) eligible for the query acceleration service. |
| UPPER_LIMIT_SCALE_FACTOR | NUMBER | Upper limit [scale factor](../sql/create-warehouse.md) for the given query. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_PARAMETERIZED_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |
|  |  |  |

## Usage notes

* Latency for the view may be up to 24 hours.

* Query acceleration is supported for the following SQL commands:

  > + SELECT
  > + INSERT
  > + CREATE TABLE AS SELECT (CTAS)
  > + COPY INTO <table>

  For more information about query eligibility, see [Eligible queries](../../user-guide/query-acceleration-service.md).
* This view only includes eligible queries that have *not* been accelerated. If you have enabled
  the query acceleration service and previously QAS-eligible queries are now accelerated, they
  are not included in this view.

## Examples

Identify the warehouses with the most queries eligible in a given period of time for the query acceleration service:

```sqlexample
SELECT account_name, warehouse_name, COUNT(query_id) AS num_eligible_queries
  FROM SNOWFLAKE.ORGANIZATION_USAGE.QUERY_ACCELERATION_ELIGIBLE
  WHERE start_time >= '2024-06-01 00:00'::TIMESTAMP
  AND end_time <= '2024-06-07 00:00'::TIMESTAMP
  GROUP BY warehouse_name
  ORDER BY num_eligible_queries DESC;
```

For more example queries, see [Identifying queries and warehouses with the QUERY_ACCELERATION_ELIGIBLE view](../../user-guide/query-acceleration-service.md).

---
title: QUERY_ACCELERATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/query_acceleration_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# QUERY_ACCELERATION_HISTORY view

The QUERY_ACCELERATION_HISTORY view in the ORGANIZATION_USAGE schema is used for querying the history of queries accelerated
by the [query acceleration service](../../user-guide/query-acceleration-service.md). The information returned by the view
includes the warehouse name and the credits consumed by the query acceleration service.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the account (user-defined). |
| ACCOUNT_LOCATOR | VARCHAR | Account locator of the account (system-defined). |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) when queries were accelerated. |
| CREDITS_USED | NUMBER | Number of credits billed for the query acceleration service during the USAGE_DATE. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse that the queries were executed on. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that the queries were executed on. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).

---
title: QUERY_ATTRIBUTION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/query_attribution_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# QUERY_ATTRIBUTION_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to determine the compute cost of a given query run on warehouses in your organization.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| PARENT_QUERY_ID | VARCHAR | Query ID of the parent query or NULL if the query does not have a parent. |
| ROOT_QUERY_ID | VARCHAR | Query ID of the topmost query in the chain or NULL if the query does not have a parent. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse that the query was executed on. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse that the query executed on. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_TAG | VARCHAR | Query tag set for this statement through the [QUERY_TAG](../parameters.md) session parameter. |
| USER_NAME | VARCHAR | User who issued the query. |
| START_TIME | TIMESTAMP_LTZ | Time when query execution started (in the local time zone). |
| END_TIME | TIMESTAMP_LTZ | Time when query execution ended (in the local time zone). |
| CREDITS_ATTRIBUTED_COMPUTE | FLOAT | Number of credits attributed to this query. Includes only the credit usage for the query execution and doesn’t include any warehouse idle time. |
| CREDITS_USED_QUERY_ACCELERATION | FLOAT | Number of credits consumed by the [Query Acceleration Service](../../user-guide/query-acceleration-service.md) to accelerate the query. NULL if the query is not accelerated. . . The total cost for an accelerated query is the sum of this column and the CREDITS_ATTRIBUTED_COMPUTE column. |

## Usage notes

* Latency for the view may be up to 24 hours.
* The QUERY_ATTRIBUTE_HISTORY view in the ACCOUNT_USAGE schema contains most of the same columns as the QUERY_ATTRIBUTE_HISTORY view in the ORGANIZATION_USAGE schema. For sample queries against the ACCOUNT_USAGE view, see [Examples](../account-usage/query_attribution_history.md). Simply replace SNOWFLAKE.ACCOUNT_USAGE with SNOWFLAKE.ORGANIZATION_USAGE in the queries to find organization-level information.

---
title: QUERY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/query_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# QUERY_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query Snowflake query history by various dimensions (time range, session, user, warehouse, etc.) within the last 365 days (1 year).

The view is available in both the ORGANIZATION_USAGE and READER_ACCOUNT_USAGE schemas with the following differences:

* The following columns are available *only* in the reader account view:

  + READER_ACCOUNT_NAME
  + READER_ACCOUNT_DELETED_ON

Alternatively, you can call the Information Schema table function, also named QUERY_HISTORY. See the
[description of the QUERY_HISTORY function](../functions/query_history.md).

See also:

> [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../functions/query_history.md) (Information Schema table function),
> [Monitor query activity with Query History](../../user-guide/ui-snowsight-activity.md) (Snowsight dashboard),
> [Use the Grouped Query History view in Snowsight](../../user-guide/ui-snowsight-activity.md)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| `organization_name` | VARCHAR | Name of the organization. |
| `account_locator` | VARCHAR | System-generated identifier for the account. |
| `account_name` | VARCHAR | User-defined identifier for the account. |

**Additional columns**

The *Available only in reader account usage views* column in the following table indicates whether the QUERY_HISTORY column is available in the
[READER_ACCOUNT_USAGE](../account-usage.md) schema.

| Column Name | Data Type | Description | Available only in reader account usage views |
| --- | --- | --- | --- |
| `reader_account_name` | VARCHAR | Name of the reader account in which the SQL statement was executed. | ✔ |
| `query_id` | VARCHAR | Internal/system-generated identifier for the SQL statement. | ✔ |
| `query_text` | VARCHAR | Text of the SQL statement. The limit is 100K characters. Longer SQL statements are truncated. |  |
| `database_id` | NUMBER | internal/system-generated identifier for the database that was in use. | ✔ |
| `database_name` | VARCHAR | database that was specified in the context of the query at compilation. | ✔ |
| `schema_id` | NUMBER | Internal/system-generated identifier for the schema that was in use. | ✔ |
| `schema_name` | VARCHAR | Schema that was specified in the context of the query at compilation. | ✔ |
| `query_type` | VARCHAR | DML, query, etc. If the query failed, then the query type may be UNKNOWN. |  |
| `session_id` | NUMBER | Session that executed the statement. | ✔ |
| `authn_event_id` | NUMBER | ID for the event for the authentication of the user for this query. This ID corresponds to the value in the `event_id` column in the [LOGIN_HISTORY](../account-usage/login_history.md) view. |  |
| `user_name` | VARCHAR | User who issued the query. |  |
| `role_name` | VARCHAR | Role that was active in the session at the time of the query. | ✔ |
| `warehouse_id` | NUMBER | Internal/system-generated identifier for the warehouse that was used. | ✔ |
| `warehouse_name` | VARCHAR | Warehouse that the query executed on, if any. | ✔ |
| `warehouse_size` | VARCHAR | Size of the warehouse when this statement executed. | ✔ |
| `warehouse_type` | VARCHAR | Type of the warehouse when this statement executed. | ✔ |
| `cluster_number` | NUMBER | The cluster (in a multi-cluster warehouse) that this statement executed on. | ✔ |
| `query_tag` | VARCHAR | Query tag set for this statement through the QUERY_TAG session parameter. | ✔ |
| `execution_status` | VARCHAR | Execution status for the query. Valid values: `success`, `fail`, `incident`. | ✔ |
| `error_code` | NUMBER | Error code, if the query returned an error | ✔ |
| `error_message` | VARCHAR | Error message, if the query returned an error. The limit is 5K characters. Longer error messages are truncated. | ✔ |
| `start_time` | TIMESTAMP_LTZ | Statement start time (in the local time zone) | ✔ |
| `end_time` | TIMESTAMP_LTZ | Statement end time (in the local time zone). | ✔ |
| `total_elapsed_time` | NUMBER | Elapsed time (in milliseconds). | ✔ |
| `bytes_scanned` | NUMBER | Number of bytes scanned by this statement. | ✔ |
| `percentage_scanned_from_cache` | FLOAT | Percentage of data scanned from the local disk cache. The value ranges from 0.0 to 1.0. Multiply by 100 to get a true percentage. |  |
| `bytes_written` | NUMBER | Number of bytes written (e.g. when loading into a table). |  |
| `bytes_written_to_result` | NUMBER | Number of bytes written to a result object. For example, `SELECT * FROM ...` would produce a set of results in tabular format representing each field in the selection. . . In general, the results object represents whatever is produced as a result of the query, and `bytes_written_to_result` represents the size of the returned result. |  |
| `bytes_read_from_result` | NUMBER | Number of bytes read from a result object. |  |
| `rows_produced` | NUMBER | The number of rows produced by this statement. The `rows_produced` column will be deprecated in a future release. The value in the `rows_produced` column doesn’t always reflect the logical number of rows affected by a query. Snowflake recommends using the `rows_inserted`, `rows_updated`, `rows_written_to_result`, or `rows_deleted` columns instead. | ✔ |
| `rows_inserted` | NUMBER | Number of rows inserted by the query. |  |
| `rows_updated` | NUMBER | Number of rows updated by the query. |  |
| `rows_deleted` | NUMBER | Number of rows deleted by the query. |  |
| `rows_unloaded` | NUMBER | Number of rows unloaded during data export. |  |
| `bytes_deleted` | NUMBER | Number of bytes deleted by the query. |  |
| `partitions_scanned` | NUMBER | Number of micro-partitions scanned. |  |
| `partitions_total` | NUMBER | Total micro-partitions of all tables included in this query. |  |
| `bytes_spilled_to_local_storage` | NUMBER | Volume of data spilled to local disk. |  |
| `bytes_spilled_to_remote_storage` | NUMBER | Volume of data spilled to remote disk. |  |
| `bytes_sent_over_the_network` | NUMBER | Volume of data sent over the network. |  |
| `compilation_time` | NUMBER | Compilation time (in milliseconds) | ✔ |
| `execution_time` | NUMBER | Execution time (in milliseconds) | ✔ |
| `queued_provisioning_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, waiting for the warehouse compute resources to provision, due to warehouse creation, resume, or resize. | ✔ |
| `queued_repair_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, waiting for compute resources in the warehouse to be repaired. | ✔ |
| `queued_overload_time` | NUMBER | Time (in milliseconds) spent in the warehouse queue, due to the warehouse being overloaded by the current query workload. | ✔ |
| `transaction_blocked_time` | NUMBER | Time (in milliseconds) spent blocked by a concurrent DML. | ✔ |
| `outbound_data_transfer_cloud` | VARCHAR | Target cloud provider for statements that unload data to another region and/or cloud. | ✔ |
| `outbound_data_transfer_region` | VARCHAR | Target region for statements that unload data to another region and/or cloud. | ✔ |
| `outbound_data_transfer_bytes` | NUMBER | Number of bytes transferred in statements that unload data from Snowflake tables. | ✔ |
| `inbound_data_transfer_cloud` | VARCHAR | Source cloud provider for statements that load data from another region and/or cloud. | ✔ |
| `inbound_data_transfer_region` | VARCHAR | Source region for statements that load data from another region and/or cloud. | ✔ |
| `inbound_data_transfer_bytes` | NUMBER | Number of bytes transferred in a replication operation from another account. The source account could be in the same region or a different region than the current account. | ✔ |
| `list_external_files_time` | NUMBER | Time (in milliseconds) spent listing external files. |  |
| `credits_used_cloud_services` | NUMBER | Number of credits used for cloud services. This value does not take into account the [adjustment for cloud services](../../user-guide/cost-understanding-compute.md), and may therefore be greater than the credits that are billed. To determine how many credits were actually billed, run queries against the [METERING_DAILY_HISTORY view](metering_daily_history.md). | ✔ |
| `reader_account_deleted_on` | TIMESTAMP_LTZ | Time and date (in the UTC time zone) when the reader account is deleted. | ✔ |
| `release_version` | VARCHAR | Release version in the format of `major_release.minor_release.patch_release`. |  |
| `external_function_total_invocations` | NUMBER | The aggregate number of times that this query called remote services. For important details, see the Usage Notes. |  |
| `external_function_total_sent_rows` | NUMBER | The total number of rows that this query sent in all calls to all remote services. |  |
| `external_function_total_received_rows` | NUMBER | The total number of rows that this query received from all calls to all remote services. |  |
| `external_function_total_sent_bytes` | NUMBER | The total number of bytes that this query sent in all calls to all remote services. |  |
| `external_function_total_received_bytes` | NUMBER | The total number of bytes that this query received from all calls to all remote services. |  |
| `query_load_percent` | NUMBER | The approximate percentage of active compute resources in the warehouse for this query execution. |  |
| `is_client_generated_statement` | BOOLEAN | Indicates whether the query was client-generated. |  |
| `query_acceleration_bytes_scanned` | NUMBER | Number of bytes scanned by the [query acceleration service](../../user-guide/query-acceleration-service.md). |  |
| `query_acceleration_partitions_scanned` | NUMBER | Number of partitions scanned by the query acceleration service. |  |
| `query_acceleration_upper_limit_scale_factor` | NUMBER | Upper limit [scale factor](../../user-guide/query-acceleration-service.md) that a [query would have benefited from](../../user-guide/query-acceleration-service.md). |  |
| `transaction_id` | NUMBER | [ID of the transaction](../transactions.md) that contains the statement or 0 if the statement is not executed within a transaction. |  |
| `child_queries_wait_time` | NUMBER | Time (in milliseconds) to complete the cached lookup when calling a [memoizable function](../../developer-guide/udf/sql/udf-sql-scalar-functions.md). |  |
| `role_type` | VARCHAR | Specifies whether an APPLICATION, DATABASE_ROLE, or ROLE executed the query. |  |
| `query_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |  |
| `query_hash_version` | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |  |
| `query_parameterized_hash` | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |  |
| `query_parameterized_hash_version` | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |  |
| `secondary_role_stats` | VARCHAR | A JSON-formatted string that contains three fields regarding secondary roles that were evaluated in the query: a list of secondary roles or `ALL` depending on the session, a count of the number of secondary roles, and the internal/system-generated ID for each secondary role. The count and number of IDs have a maximum of 50. |  |
| `rows_written_to_result` | NUMBER | Number of rows written to a result object. For CREATE TABLE AS SELECT (CTAS) and all DML operations, this result is `1`. |  |
| `query_retry_time` | NUMBER | Total execution time (in milliseconds) for query retries caused by actionable errors. For more information, see Query retry columns. |  |
| `query_retry_cause` | VARCHAR | Error that caused the query to retry. If there is no query retry, the field is NULL. For more information, see Query retry columns. |  |
| `fault_handling_time` | NUMBER | Total execution time (in milliseconds) for query retries caused by errors that are *not* actionable. For more information, see Query retry columns. |  |
| `user_type` | VARCHAR | The type of the user executing the query. It’s the same as the `type` column in the [USERS view](users.md). If a Snowpark Container Services service executes the query, the user type is SNOWFLAKE_SERVICE. |  |
| `user_database_name` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |  |
| `user_database_id` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s database; otherwise, it’s NULL |  |
| `user_schema_name` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |  |
| `user_schema_id` | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s schema; otherwise, it’s NULL. |  |
| `bind_values` | ARRAY | Bind values in serialized form. If the query contains no bind values, then this column contains an empty array. If the array is too large or the [ALLOW_BIND_VALUES_ACCESS](../parameters.md) parameter is set to `FALSE`, this column contains NULL. For more information, see [Retrieve bind variable values](../bind-variables.md). |  |
|  |  |  |  |

## Usage notes

### General

* Latency for the view may be up to 24 hours.

* The values for the columns
  `external_function_total_invocations`, `external_function_total_sent_rows`,
  `external_function_total_received_rows`, `external_function_total_sent_bytes`, and `external_function_total_received_bytes`
  are affected by many factors, including:

  + The number of external functions in the SQL statement.
  + The number of rows per batch sent to each remote service.
  + The number of retries due to transient errors (for example, because a response was not received within the expected time).
* If you want to filter on client-generated query statements, use
  [QUERY_HISTORY](../functions/query_history.md) (an Information Schema table function).
* Canceled queries are identified by their `error_message` text (`SQL execution canceled`), not by their `execution_status` value.

### Query retry columns

A query might need to be retried one or more times in order to successfully complete. There can be multiple causes that result in a query
retry. Some of these causes are *actionable*, that is, a user can make changes to reduce or eliminate query retries for a specific query.
For example, if a query is retried due to an out of memory error, modifying warehouse settings might resolve the issue.

Some query retries are caused by a fault that is not actionable. That is, there is no change a user can make to prevent the
query retry. For example, a network outage might result in a query retry. In this case, there is no change to the query or to the
warehouse that executes it that can prevent the query retry.

The QUERY_RETRY_TIME, QUERY_RETRY_CAUSE, and FAULT_HANDLING_TIME columns can help you optimize queries that are retried and better
understand fluctuations in query performance.

### Query history for hybrid tables

The following notes explain when records are logged in the QUERY_HISTORY view for queries against hybrid tables:

* Short-running queries that operate exclusively against hybrid tables do not generate a record in this
  view or [QUERY_HISTORY](../functions/query_history.md) (Information
  Schema table function). To monitor such queries, use the
  [AGGREGATE_QUERY_HISTORY](../account-usage/aggregate_query_history.md) view.
  This view allows you to more easily monitor high-throughput operational
  workloads for trends and issues.
* Short-running queries that operate exclusively against hybrid tables do not provide a query profile
  that you can inspect in Snowsight.
* Queries against hybrid tables do generate both a record in the QUERY_HISTORY view and a query profile if any of the
  following conditions are met:

  + A query is executed against any table type other than the hybrid table type. This
    condition ensures that there is no behavior change for any existing
    non-Unistore workloads.
  + A query fails with an EXECUTION_STATUS of `failed_with_incident` (see
    [QUERY_HISTORY](../functions/query_history.md)). This
    condition ensures that you can investigate and report the specific failed
    query to receive assistance.
  + A query is running longer than approximately 500 milliseconds. This
    condition ensures that you can investigate performance issues for slow queries.
  + Query result size is too large.
  + A query is associated with a Snowflake transaction.
  + A query contains a system function with side effects.
  + A query is not one of the following statement types: SELECT, INSERT,
    DELETE, UPDATE, MERGE.
  + A query is executed from SnowSQL, Snowsight, or Classic Console. This
    condition ensures that you can manually generate a full query profile to
    investigate performance issues for any specific query even if it is not
    categorized as long-running.
  + Even if a query does not meet any of these criteria, queries can be
    periodically sampled to generate a record in the QUERY_HISTORY view and a
    query profile to help your investigation.

### PUT and GET commands

For the [PUT](../sql/put.md) and [GET](../sql/get.md) commands,
an EXECUTION_STATUS of `success` in the [QUERY_HISTORY](../account-usage/query_history.md)
does *not* mean that data files were successfully uploaded or downloaded.
Instead, the status indicates that Snowflake received authorization to proceed with the file transfer.

---
title: RATE_SHEET_DAILY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/rate_sheet_daily.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# RATE_SHEET_DAILY view

The RATE_SHEET_DAILY view in the ORGANIZATION_USAGE schema returns the effective rates used for calculating usage in the organization
currency based on credits used for all Snowflake accounts in your organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATE | DATE | Date (in the UTC time zone) for the effective price. |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| CONTRACT_NUMBER | VARCHAR | Snowflake contract number for the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the account. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the account. |
| REGION | VARCHAR | Name of the region where the account is located. |
| SERVICE_LEVEL | VARCHAR | Service level of the Snowflake account (Standard, Enterprise, Business Critical, etc.). |
| USAGE_TYPE | VARCHAR | Corresponds to the Usage Category column in a billing statement, which exists for backward compatibility only. Use the BILLING_TYPE, RATING_TYPE, SERVICE_TYPE, and IS_ADJUSTMENT columns for billing reconciliation. |
| CURRENCY | VARCHAR | The currency of the EFFECTIVE_RATE. |
| EFFECTIVE_RATE | NUMBER(38, 2) | The rate after applying any applicable discounts per the contract for the organization. |
| SERVICE_TYPE | VARCHAR | Type of usage, for example, `snowpipe` for usage related to the Snowpipe feature. |
| RATING_TYPE | VARCHAR | Indicates how the usage in the record is rated, or priced. Possible values include:   * `compute` * `storage` * `other` |
| BILLING_TYPE | VARCHAR | Indicates what is being charged or credited. Possible billing types include:   * `consumption` — Usage associated with compute credits, storage costs, and data transfer costs. * `rebate` — Usage covered by the credits awarded to the organization when it shared data with another organization. * `priority support` — Charges for priority support services. This charge is associated with a stipulation in a contract, not with an account. * `vps_deployment_fee` — Charges for a [Virtual Private Snowflake](../../user-guide/intro-editions.md) deployment. * `support_credit` — Snowflake Support credited the account to reverse charges attributed to an issue in Snowflake. |
| IS_ADJUSTMENT | BOOLEAN | Indicates whether the record is an adjustment to usage. |

## Usage notes

* Latency for the view may be up to 24 hours.
* Until month close, data for a given day in a month can change to account for any end-of-month adjustments/credits, mid-month contract amendments, or Snowflake account transfers from one organization to another.
* Customers who signed a contract through a Snowflake reseller cannot access data in this view.
* Data is retained indefinitely.
* This view does not include data generated prior to June 2020. To obtain data before this date, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: REFERENTIAL_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/referential_constraints.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# REFERENTIAL_CONSTRAINTS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each FOREIGN KEY constraint that is defined for tables in each account.

FOREIGN KEY constraints are used to enforce referential integrity. For more information, see
[Constraints](../constraints.md) and [Referential Integrity Constraints](../../user-guide/table-considerations.md).

To return information about other constraint types (as well as FOREIGN KEY constraints), query the [TABLE_CONSTRAINTS view](table_constraints.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the constraint. |
| CONSTRAINT_CATALOG | VARCHAR | Database that the constraint belongs to |
| CONSTRAINT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the constraint. |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the constraint belongs to |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint |
| UNIQUE_CONSTRAINT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the current constraint. |
| UNIQUE_CONSTRAINT_CATALOG | VARCHAR | Database of the unique constraint referenced by the current constraint. |
| UNIQUE_CONSTRAINT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the constraint. |
| UNIQUE_CONSTRAINT_SCHEMA | VARCHAR | Schema of the unique constraint referenced by the current constraint. |
| UNIQUE_CONSTRAINT_NAME | VARCHAR | Name of the unique constraint referenced by the current constraint. |
| MATCH_OPTION | VARCHAR | Match option for the constraint. |
| UPDATE_RULE | VARCHAR | Update Rule for the current constraint. |
| DELETE_RULE | VARCHAR | Delete Rule for the current constraint. |
| COMMENT | VARCHAR | Comment for the constraint. |
| CREATED | TIMESTAMP_LTZ | Date and time when the constraint was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the constraint was dropped. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: REMAINING_BALANCE_DAILY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/remaining_balance_daily.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# REMAINING_BALANCE_DAILY view

The REMAINING_BALANCE_DAILY view in the ORGANIZATION_USAGE schema can be used to return the daily remaining balance and on demand
consumption daily for an organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| CONTRACT_NUMBER | VARCHAR | Contract number for the organization. |
| DATE | DATE | The date of the FREE_USAGE_BALANCE or CAPACITY_BALANCE in the UTC time zone. |
| CURRENCY | VARCHAR | The currency of the FREE_USAGE_BALANCE or CAPACITY_BALANCE or ON_DEMAND_CONSUMPTION_BALANCE. |
| FREE_USAGE_BALANCE | NUMBER (38,2) | The amount of free usage in currency that is available for use as of the date. This is the end of day balance. |
| CAPACITY_BALANCE | NUMBER (38,2) | The amount of capacity in currency that is available for use as of the date. This is the end of day balance. |
| ON_DEMAND_CONSUMPTION_BALANCE | NUMBER (38,2) | The amount of consumption at on demand prices that will be invoiced given that all the free usage and capacity balances have been exhausted. This is a negative value (e.g. -250) until the invoice is paid. This is the end of day balance. |
| ROLLOVER_BALANCE | NUMBER (38,2) | The amount of rollover balance in currency that is available for use at the end of the date. At the end of a contract term, it is calculated as sum(AMOUNT) from the CONTRACT_ITEMS view - sum(USAGE_IN_CURRENCY) from the USAGE_IN_CURRENCY_DAILY view. |
| MARKETPLACE_CAPACITY_DRAWDOWN_BALANCE | NUMBER(38,2) | Amount of CAPACITY_BALANCE that is available for purchases in the Snowflake Marketplace. |

## Usage notes

* Latency for the view may be up to 72 hours.
* If multiple organizations draw down from the same capacity contract, only the primary organization can access this view. The primary
  organization is also known as the funding organization.
* On demand consumption balance resets after month close (typically on the 3rd or 4th day of the next month) after it is invoiced and paid.
* Until month close, data for a given day in a month can change to account for any end-of-month adjustments/credits or contract amendments
  between Snowflake organizations.
* Customers who signed a contract through a Snowflake reseller cannot access data in this view.
* Data is retained indefinitely.
* This view does not include data generated prior to June 2020. To obtain data before this date, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: REPLICATION_GROUP_REFRESH_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/replication_group_refresh_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# REPLICATION_GROUP_REFRESH_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query the refresh history for a specified
[replication or failover group](../../user-guide/account-replication-intro.md).

See also:
:   [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](../functions/replication_group_refresh_history.md) (Information Schema table function)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| REPLICATION_GROUP_NAME | VARCHAR | Name of the secondary replication or failover group. |
| REPLICATION_GROUP_ID | NUMBER | Internal/system-generated identifier for the replication or failover group. |
| PHASE_NAME | VARCHAR | Current phase in the replication operation. For the list of phases, see the Usage notes. |
| START_TIME | TIMESTAMP_LTZ | Time when the replication operation began. |
| END_TIME | TIMESTAMP_LTZ | Time when the replication operation finished, if applicable. `NULL` if it is in progress. |
| JOB_UUID | VARCHAR | Query ID for the refresh job. |
| TOTAL_BYTES | VARIANT | A JSON object that provides detailed information about refreshed databases:   * `totalBytesToReplicate`: Total number of bytes expected to be replicated. * `bytesUploaded`: Actual number of bytes uploaded. * `bytesDownloaded`: Actual number of bytes downloaded. * `databases`: List of JSON objects containing the following fields for each member database:    + `name`: Name of the database.   + `totalBytesToReplicate`: Total bytes expected to be replicated for the database. |
| OBJECT_COUNT | VARIANT | A JSON object that provides detailed information about refreshed objects:   * `totalObjects`: Total number of objects in the replication or failover group. * `completedObjects`: Total number of objects completed. * `objectTypes`: List of JSON objects containing the following fields for each type:    + `objectType`: Type of object (for example users, roles, grants, warehouses, schemas, tables, columns, etc).   + `totalObjects`: Total number of objects of this type in the replication or failover group.   + `completedObjects`: Total number of objects of this type that were completed. |
| PRIMARY_SNAPSHOT_TIMESTAMP | TIMESTAMP_LTZ | Timestamp when the primary snapshot was created. |
| ERROR | VARIANT | NULL if the refresh operation is successful. If the refresh operation fails, returns a JSON object that provides detailed information about the error:   * `errorCode`: Error code of the failure. * `errorMessage`: Error message of the failure. |

## Usage notes

* Latency for the view may be up to 24 hours.

  To view real-time refresh progress, use the [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](../functions/replication_group_refresh_history.md) table function.

* Results are only returned for secondary failover or replication groups in the current account (the target account).
* The following is the list of phases in the order processed:

  | # | Phase name | Description |
  | --- | --- | --- |
  | 1 | `SECONDARY_SYNCHRONIZING_MEMBERSHIP` | The secondary replication or failover group receives information from the primary group about the objects included in the group, and updates its membership metadata. |
  | 2 | `SECONDARY_UPLOADING_INVENTORY` | The secondary replication or failover group sends an inventory of its objects in the target account to the primary group. |
  | 3 | `PRIMARY_UPLOADING_METADATA` | The primary replication or failover group creates a snapshot of metadata in the source account and sends it to the secondary group. |
  | 4 | `PRIMARY_UPLOADING_DATA` | The primary replication or failover group copies the files the secondary group needs to reconcile any deltas between the objects in the source and target accounts. |
  | 5 | `SECONDARY_DOWNLOADING_METADATA` | The secondary replication or failover group applies the snapshot of the metadata that was sent by the primary. The metadata updates are not applied atomically and instead applied over time. |
  | 6 | `SECONDARY_DOWNLOADING_DATA` | The secondary replication or failover group copies the files sent by the primary group to the target account. |
  | 7 | `COMPLETED` / `FAILED` / `CANCELED` | Refresh operation status. |

## Examples

To retrieve the refresh history for the secondary failover group `myfg`, execute the following statement:

```sqlexample
SELECT account_name, phase_name, start_time, end_time,
       total_bytes, object_count, error
  FROM SNOWFLAKE.ORGANIZATION_USAGE.REPLICATION_GROUP_REFRESH_HISTORY
  WHERE replication_group_name = 'MYFG';
```

To retrieve the last refresh record for each replication or failover group, execute the following statement:

```sqlexample
SELECT account_name, replication_group_name, phase_name,
       start_time, end_time,
       total_bytes, object_count, error,
       ROW_NUMBER() OVER (
         PARTITION BY replication_group_name
         ORDER BY end_time DESC
       ) AS row_num
  FROM SNOWFLAKE.ORGANIZATION_USAGE.REPLICATION_GROUP_REFRESH_HISTORY
  QUALIFY row_num = 1;
```

---
title: REPLICATION_GROUP_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/replication_group_usage_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# REPLICATION_GROUP_USAGE_HISTORY view

The REPLICATION_GROUP_USAGE_HISTORY view in the ORGANIZATION_USAGE schema can be used to query the replication history for
replication and failover groups in your organization within a specified date range. The view includes the name of the
replication or failover group, credits consumed, and bytes transferred for replication.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this usage record. |
| REPLICATION_GROUP_NAME | VARCHAR | Name of the replication or failover group. |
| REPLICATION_GROUP_ID | NUMBER | Internal/system-generated identifier for the replication or failover group. |
| CREDITS_USED | NUMBER | Total number of credits used for replication during the USAGE_DATE. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for replication during the USAGE_DATE. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

---
title: REPLICATION_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/replication_usage_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# REPLICATION_USAGE_HISTORY view

The REPLICATION_USAGE_HISTORY view in the ORGANIZATION_USAGE schema can
be used to query the replication history for databases in your organization
within a specified date range. The view includes
the database name, credits consumed, and bytes transferred for replication.

> **Note:**
>
> This view only displays replication usage for database replication.
> To view usage for replication using [replication and failover groups](../../user-guide/account-replication-intro.md),
> see the [REPLICATION_GROUP_USAGE_HISTORY view](replication_group_usage_history.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this usage record. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| CREDITS_USED | NUMBER | Total number of credits used for database replication during the USAGE_DATE. |
| BYTES_TRANSFERRED | VARCHAR | Number of bytes transferred for database replication during the USAGE_DATE. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

---
title: RESOURCE_MONITORS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/resource_monitors.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# RESOURCE_MONITORS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays the resource monitors that have been created in the accounts within the organization. It does not
include resource monitors from reader accounts.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column | Data Type | Description |
| --- | --- | --- |
| RESOURCE_MONITOR_ID | NUMBER | System identifier. |
| NAME | VARCHAR | Name of the resource monitor. |
| CREATED | TIMESTAMP_LTZ | Date and time when the resource monitor was created. |
| CREDIT_QUOTA | VARIANT | Monthly credit quota for the resource monitor. |
| USED_CREDITS | VARIANT | Number of credits used in the current monthly billing cycle by all the warehouses associated with the resource monitor. |
| REMAINING_CREDITS | FLOAT | Number of credits still available to use in the current monthly billing cycle. |
| OWNER | VARCHAR | Name of the role that owns the resource monitor. |
| NOTIFY | NUMBER | Percentage of the credit quota. When consumption reaches this threshold, notifications are sent. |
| SUSPEND | NUMBER | Percentage of the credit quota. When consumption reaches this threshold, assigned warehouses are suspended but currently running queries are allowed to complete. |
| SUSPEND_IMMEDIATE | NUMBER | Percentage of the credit quota. When consumption reaches this threshold, all assigned warehouses are suspended immediately, including those running queries. |
| WAREHOUSES | VARCHAR | Names of the warehouses that are associated with the resource monitor. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: ROLES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/roles.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# ROLES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query a list of all roles defined in each account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROLE_ID | NUMBER | Internal/system-generated identifier for the role. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the role was created. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the role was deleted. |
| NAME | VARCHAR | Name of the role. |
| COMMENT | VARCHAR | Comment for the role. |
| OWNER | VARCHAR | Role with the OWNERSHIP privilege on the object. |
| ROLE_TYPE | VARCHAR | Either `ROLE`, `DATABASE_ROLE`, `INSTANCE_ROLE`, or `APPLICATION_ROLE`. |
| ROLE_DATABASE_NAME | VARCHAR | Name of the database that contains the database role if the role is a database role. |
| ROLE_INSTANCE_ID | NUMBER | Internal/system-generated identifier for the class instance that the role belongs to. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| IS_FROM_ORGANIZATION_USER_GROUP | BOOLEAN | If TRUE, the role was imported from an [organization user group](../../user-guide/organization-users.md). |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view does not include database roles for databases created from shares.

### Internal Snowflake role for Snowsight

The first time [Snowsight](../../user-guide/ui-snowsight.md) is accessed in an account, Snowflake creates the internal APPADMIN and
WORKSHEETS_APP_RL roles to support the web interface. These roles are used to cache query results in an internal stage in your account.
This cached data is encrypted and protected by the key hierarchy for the account. The limited privileges granted to these internal roles
only allow Snowsight to access the internal stage to store those results. Thes roles cannot list objects in your account or access
data in your tables. For more information, see [Getting started with Snowsight](../../user-guide/ui-snowsight-gs.md).

---
title: ROW_ACCESS_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/row_access_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# ROW_ACCESS_POLICIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each row access policy defined in an account.

Each row corresponds to a different row access policy.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_NAME | VARCHAR | Name of the row access policy. |
| POLICY_ID | NUMBER | Internal/system-generated identifier for the row access policy. |
| POLICY_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema in which the policy resides. |
| POLICY_SCHEMA | VARCHAR | Schema to which the row access policy belongs. |
| POLICY_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database in which the policy resides. |
| POLICY_CATALOG | VARCHAR | Database to which the row access policy belongs. |
| POLICY_OWNER | VARCHAR | Name of the role that owns the row access policy. |
| POLICY_SIGNATURE | VARCHAR | Type signature of the row access policy’s arguments. |
| POLICY_RETURN_TYPE | VARCHAR | Return value data type. |
| POLICY_BODY | VARCHAR | Row access policy definition. |
| POLICY_COMMENT | VARIANT | Comments entered for the row access policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the row access policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the row access policy was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| OPTIONS | VARIANT | The value for the EXEMPT_OTHER_POLICIES property in the policy. If set to `TRUE`, the column returns `{ "EXEMPT_OTHER_POLICIES: "TRUE" }`. If the property is set to `FALSE` or not set at all, the column returns NULL. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only returns rows if at least one row access policy has been created.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Example

Obtain all of the row access policies created in your account, ordered by the timestamp on which the policy was created:

> ```sqlexample
> select account_name, policy_name, policy_signature, created
> from row_access_policies
> order by created
> ;
> ```

---
title: SCHEMATA view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/schemata.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SCHEMATA view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each schema in an account except the READER_ACCOUNT_USAGE, and
INFORMATION_SCHEMA schemas.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema. |
| SCHEMA_NAME | VARCHAR | Name of the schema. |
| CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the schema. |
| CATALOG_NAME | VARCHAR | Database that the schema belongs to. |
| SCHEMA_OWNER | VARCHAR | Name of the role that owns the schema. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| IS_TRANSIENT | VARCHAR | Whether the schema is transient. |
| IS_MANAGED_ACCESS | VARCHAR | Whether the schema is a managed access schema. |
| DEFAULT_CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| SQL_PATH | VARCHAR | Not applicable for Snowflake. |
| COMMENT | VARCHAR | Comment for the schema. |
| CREATED | TIMESTAMP_LTZ | Date and time when the schema was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the schema was dropped. |
| SCHEMA_TYPE | VARCHAR | Specifies the schema type. Valid values are: . . - STANDARD: normal schema. . - VERSIONED: versioned schema. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| SCHEMA_TYPE | VARCHAR | Type of schema. Possible values are `STANDARD` and `VERSIONED`. |
| VERSION_NAME | VARCHAR | Name of the schema if it is a versioned schema. NULL otherwise. |
| VERSIONED_SCHEMA_ID | NUMBER | Internal/system-generated identifier if the schema is a versioned schema. NULL, otherwise. |
| OBJECT_VISIBILITY | OBJECT | `OBJECT_VISIBILITY`  [Preview Feature](../../release-notes/preview-features.md) — Open  Available to all accounts.  This property controls the [discoverability of the objects](../../user-guide/ui-snowsight/object-visibility-universal-search.md) in the account, enabling users without explicit access privileges to find objects and request access. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SEARCH_OPTIMIZATION_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/search_optimization_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SEARCH_OPTIMIZATION_HISTORY view

The SEARCH_OPTIMIZATION_HISTORY view in the ORGANIZATION_USAGE schema
is used for querying
the [search optimization service](../../user-guide/search-optimization-service.md)
maintenance history for a specified table within a specified date range. The
information returned by the function includes the table name and credits
consumed each time a search optimization maintenance operation occurred.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date (in the UTC time zone) of this usage record. |
| CREDITS_USED | NUMBER | Number of credits billed for the search optimization service during the USAGE_DATE. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the search optimization service. |
| TABLE_NAME | VARCHAR | This is a system-generated alias that contains the ID of the table for which search optimization was enabled; that ID is embedded inside a string of the form “SEARCH OPTIMIZATION ON TABLE_ID: <optimized_table_id>”. For example, if you enable search optimization on a table named `accounts`, and if `accounts` has ID 1200, then the TABLE_NAME (alias) shown in this column will be “SEARCH OPTIMIZATION ON TABLE_ID: 1200”. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the search optimization service. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the search optimization service. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the search optimization service. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the search optimization service. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).

---
title: SEQUENCES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/sequences.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SEQUENCES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each sequence defined in an account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| SEQUENCE_ID | NUMBER | Internal/system-generated identifier for the sequence. |
| SEQUENCE_NAME | VARCHAR | Name of the sequence. |
| SEQUENCE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the sequence. |
| SEQUENCE_SCHEMA | VARCHAR | Schema that the sequence belongs to. |
| SEQUENCE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the sequence. |
| SEQUENCE_CATALOG | VARCHAR | Database that the sequence belongs to. |
| SEQUENCE_OWNER | VARCHAR | Name of the role that owns the sequence. |
| DATA_TYPE | VARCHAR | Data type of the sequence. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of the data type of the sequence. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of the numeric precision of the data type of the sequence. |
| NUMERIC_SCALE | NUMBER | Scale of the data type of the sequence. |
| START_VALUE | VARCHAR | Initial value of the sequence. |
| MINIMUM_VALUE | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_VALUE | VARCHAR | Not applicable for Snowflake. |
| NEXT_VALUE | VARCHAR | Next value that the sequence will produce. |
| INCREMENT | VARCHAR | Increment of the sequence generator. |
| CYCLE_OPTION | VARCHAR | Not applicable for Snowflake. |
| ORDERED | VARCHAR | If `YES`, the sequence has the ORDER property. If `NO`, the sequence has the NOORDER property. |
| CREATED | TIMESTAMP_LTZ | Date and time when the sequence was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the sequence was dropped. |
| COMMENT | VARCHAR | Comment for the sequence. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SESSION_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/session_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SESSION_POLICIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view provides the [session policies](../../user-guide/session-policies.md) in your account.

Each row in this view corresponds to a different session policy.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the session policy. |
| NAME | VARCHAR | Name of the session policy. |
| SCHEMA_ID | VARCHAR | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | VARCHAR | Schema to which the session policy belongs. |
| DATABASE_ID | VARCHAR | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | VARCHAR | Database to which the session policy belongs. |
| OWNER | VARCHAR | Name of the role that owns the session policy. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| SESSION_IDLE_TIMEOUT_MINS | NUMBER | Session idle timeout in minutes for the policy. |
| SESSION_UI_IDLE_TIMEOUT_MINS | NUMBER | UI session idle timeout in minutes for the policy. |
| COMMENT | VARCHAR | Comments entered for the session policy (if any). |
| CREATED | TIMESTAMP_LTZ | Date and time when the session policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the session policy was dropped. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SESSIONS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/sessions.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SESSIONS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view provides information on the session, including information on the authentication method to Snowflake and the
Snowflake login event. Snowflake returns one row for each session created over the last year.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| SESSION_ID | Number | The unique identifier for the current session. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the session was created. |
| USER_NAME | String | The user name of the user. |
| AUTHENTICATION_METHOD | String | The authentication method used to access Snowflake. |
| LOGIN_EVENT_ID | Number | The unique identifier for the login event. |
| CLIENT_APPLICATION_VERSION | String | The version number (e.g. 3.8.7) of the Snowflake-provided client application used to create the remote session to Snowflake. |
| CLIENT_APPLICATION_ID | String | The identifier for the Snowflake-provided client application used to create the remote session to Snowflake (e.g. JDBC 3.8.7) |
| CLIENT_ENVIRONMENT | String | The environment variables (e.g. operating system, OCSP mode) of the client used to create a remote session to Snowflake. |
| CLIENT_BUILD_ID | String | The build number (e.g. 41897) of the third-party client application used to create a remote session to Snowflake, if available. For example, a third-party Java application that uses the JDBC driver to connect to Snowflake. |
| CLIENT_VERSION | String | The version number (e.g. 47154) of the third-party client application that uses a Snowflake-provided client to create a remote session to Snowflake, if available. . |
| ACCESS_TIME | TIMESTAMP_LTZ | Date and time when the session was last used. |
| IS_OPEN | BOOLEAN | Whether the session is currently open (TRUE) or closed (FALSE). |
| CLOSED_REASON | String | The reason why a Snowflake session closed. NULL for sessions that are currently open. One of the following for closed sessions: DROP_USER, LOGOUT, FORCED_LOGOUT, ABANDONED, OAUTH_CRITICAL_CHANGE_INTEGRATION, DROP_ACCOUNT, OAUTH_CONSENT_REVOKED, TASK_COMPLETED, SFC_FORCED_LOGOUT. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The SESSIONS view does not currently track SQL API transient sessions.
* This view does not record the activity of internal users the system defines to perform various operations
  (e.g. maintain Snowsight worksheets).

---
title: SNAPSHOT_OPERATION_HISTORY view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/organization-usage/snapshot_operation_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SNAPSHOT_OPERATION_HISTORY view — *Deprecated*

This Organization Usage view provides information on operations performed on snapshots.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The timestamp at which the snapshot operation started. |
| END_TIME | TIMESTAMP_LTZ | The timestamp at which the snapshot operation ended. |
| SNAPSHOT_SET_ID | NUMBER | The local snapshot set ID. |
| SNAPSHOT_ID | VARCHAR | The unique identifier of snapshot being worked on. |
| OPERATION_TYPE | VARCHAR | Could be either of the below operations:   * CREATE * EXPIRE * RESTORE * ADD_LEGAL_HOLD * REMOVE_LEGAL_HOLD |
| QUERY_ID | VARCHAR | Internal system-generated identifier for the SQL statement. |

## Usage notes

* Latency for the view may be up to 360 minutes (6 hours).

---
title: SNAPSHOT_POLICIES view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/organization-usage/snapshot_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SNAPSHOT_POLICIES view — *Deprecated*

This Organization Usage view provides information on snapshot policies.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the snapshot policy. |
| NAME | VARCHAR | Name of the snapshot policy. |
| SCHEMA_ID | NUMBER | Internal system-generated identifier for the schema of the snapshot policy. |
| SCHEMA_NAME | VARCHAR | Schema that the snapshot policy belongs to. |
| CATALOG_ID | NUMBER | Internal system-generated identifier for the database of the snapshot policy. |
| CATALOG_NAME | VARCHAR | Database that the snapshot policy belongs to. |
| SCHEDULE | VARCHAR | Schedule for snapshot creation. |
| EXPIRE_AFTER_DAYS | NUMBER | Days after snapshot creation when snapshot should be expired. |
| HAS_RETENTION_LOCK | VARCHAR | Indicates whether the policy includes a retention lock. Y if policy has retention lock; N otherwise.  Retention lock protects snapshots from being deleted by anyone for the defined retention period. The retention lock also prevents the retention period from being decreased on the policy. |
| OWNER | VARCHAR | Name of the role that owns the snapshot policy. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the snapshot policy. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the snapshot policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the snapshot policy was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the snapshot policy was deleted. |
| COMMENT | VARCHAR | Comment for the snapshot policy. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: SNAPSHOT_SETS view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/organization-usage/snapshot_sets.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SNAPSHOT_SETS view — *Deprecated*

This Organization Usage view provides information on snapshot sets.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal system-generated identifier for the snapshot set. |
| NAME | VARCHAR | Name of the snapshot set. |
| SCHEMA_ID | NUMBER | Internal system-generated identifier for the schema of the snapshot set. |
| SCHEMA_NAME | VARCHAR | Schema that the snapshot set belongs to. |
| CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the snapshot set. |
| CATALOG_NAME | VARCHAR | Database that the snapshot set belongs to. |
| OBJECT_KIND | VARCHAR | Type of object that the snapshot set is snapshotting. |
| OBJECT_ID | NUMBER | ID of object that the snapshot set is snapshotting. |
| OBJECT_NAME | VARCHAR | Name of object that the snapshot set is snapshotting. |
| OBJECT_SCHEMA_ID | NUMBER | ID of schema that contains the object being snapshotted by this snapshot set. |
| OBJECT_SCHEMA_NAME | VARCHAR | Name of schema that contains the object being snapshotted by this snapshot set. |
| OBJECT_CATALOG_ID | NUMBER | ID of database that contains the object being snapshotted by this snapshot set. |
| OBJECT_CATALOG_NAME | VARCHAR | Name of database that contains the object being snapshotted by this snapshot set. |
| SNAPSHOT_POLICY_ID | NUMBER | ID of snapshot policy attached to this snapshot set. |
| SNAPSHOT_POLICY_NAME | VARCHAR | Name of snapshot policy attached to this snapshot set. |
| SNAPSHOT_POLICY_SCHEMA_ID | NUMBER | ID of the schema that contains the snapshot policy. |
| SNAPSHOT_POLICY_SCHEMA_NAME | VARCHAR | Name of the schema that contains the snapshot policy. |
| SNAPSHOT_POLICY_CATALOG_ID | NUMBER | ID of the database that contains the snapshot policy. |
| SNAPSHOT_POLICY_CATALOG_NAME | VARCHAR | Name of the database that contains the snapshot policy. |
| OWNER | VARCHAR | Name of the role that owns the snapshot set. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the snapshot set. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the snapshot set was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the snapshot set was last altered. |
| DELETED | TIMESTAMP_LTZ | Date and time when the snapshot set was deleted. |
| COMMENT | VARCHAR | Comment for the snapshot set. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: SNAPSHOTS view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/organization-usage/snapshots.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# SNAPSHOTS view — *Deprecated*

This Organization Usage view provides information on snapshots.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Snowflake-generated identifier of the snapshot.  Note: this is not the local ID, this is the globally unique UUID of the Snapshot. |
| SNAPSHOT_SET_ID | NUMBER | ID of snapshot set that contains the snapshot. |
| SNAPSHOT_SET_NAME | VARCHAR | Name of snapshot set that contains the snapshot. |
| SNAPSHOT_SET_SCHEMA_ID | NUMBER | ID of schema that the snapshot set belongs to. |
| SNAPSHOT_SET_SCHEMA | VARCHAR | Name of schema that the snapshot set belongs to. |
| SNAPSHOT_SET_CATALOG_ID | NUMBER | ID of database that the snapshot set belongs to. |
| SNAPSHOT_SET_CATALOG | VARCHAR | Name of database that the snapshot set belongs to. |
| CREATED | TIMESTAMP_LTZ | Timestamp at which snapshot was created. |
| DELETED | TIMESTAMP_LTZ | Timestamp at which snapshot was deleted. |
| EXPIRATION_SCHEDULED_FOR | TIMESTAMP_LTZ | Timestamp at which snapshot will be expired. |
| IS_UNDER_LEGAL_HOLD | BOOLEAN | Y if snapshot is under legal hold; N otherwise.  This column isn’t displayed by the SHOW command, because the SHOW command output doesn’t include deleted objects. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: STAGE_STORAGE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/stage_storage_usage_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# STAGE_STORAGE_USAGE_HISTORY view

The STAGE_STORAGE_USAGE_HISTORY view in the ORGANIZATION_USAGE schema can be
used to query the average daily data storage usage, in bytes, for all
the Snowflake stages in your organization within the last 12 months.

The output includes storage for:

* Named internal stages.
* Default staging areas (for tables and users).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| ACCOUNT_LOCATOR | VARCHAR | Name of the account locator. |
| REGION | VARCHAR | Name of the region where the account is located. |
| USAGE_DATE | DATE | Date of this storage usage record. |
| AVERAGE_STAGE_BYTES | NUMBER | Number of bytes of stage storage used. |

## Usage notes

* Latency for the view may be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

---
title: STAGES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/stages.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# STAGES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each stage defined in an account.

Stages are named objects that can be used for loading/unloading data. For more information, see [CREATE STAGE](../sql/create-stage.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| STAGE_ID | NUMBER | Internal/system-generated identifier for the stage. |
| STAGE_NAME | VARCHAR | Name of the stage. |
| STAGE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the stage. |
| STAGE_SCHEMA | VARCHAR | Schema that the stage belongs to. |
| STAGE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the stage. |
| STAGE_CATALOG | VARCHAR | Database that the stage belongs to. |
| STAGE_URL | VARCHAR | If the stage is external, location of the stage; NULL if it is internal. |
| STAGE_REGION | VARCHAR | If the stage is external, region where the stage resides; NULL if it is internal. |
| STAGE_TYPE | VARCHAR | Type of stage (`Internal Named`, or `External Named`). |
| STAGE_OWNER | VARCHAR | Name of the role that owns the stage; NULL if it has been dropped. |
| COMMENT | VARCHAR | Comment for the stage. |
| CREATED | TIMESTAMP_LTZ | Date and time when the stage was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the stage was dropped. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| STORAGE_INTEGRATION | VARCHAR | The name of the storage integration associated with the stage; NULL for internal stages or stages that do not use a storage integration. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: STORAGE_DAILY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/storage_daily_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# STORAGE_DAILY_HISTORY view

The STORAGE_DAILY_HISTORY view in the ORGANIZATION_USAGE schema can be used to query the average daily storage usage, in bytes, for all accounts in the organization for the last 365 days (1 year).

Of the storage views that Snowflake provides, this view most closely reflects the storage that contributes to your bill at the account and organization level. Use it for high-level analysis and reporting of billed storage trends.

See also:
:   [DATABASE_STORAGE_USAGE_HISTORY view](../account-usage/database_storage_usage_history.md) , [STORAGE_USAGE view](../account-usage/storage_usage.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | The type of service, which can be one of `STORAGE`, `STORAGE_READER`. |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the account (user-defined). |
| USAGE_DATE | DATE | The date (in the UTC time zone) on which the usage took place. |
| AVERAGE_BYTES | NUMBER | Average number of bytes of database storage and stage storage used on this date, including data in Time Travel and Fail-safe. |
| REGION | VARCHAR | ID of the Snowflake Region where the account is located. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the account (system-defined). |
| CREDITS | NUMBER | Total number of storage credits used for the account on this date. (Calculated as AVERAGE_BYTES, converted to tebibytes, divided by the number of days in the month.) |

## Usage notes

* Latency for the view may be up to 120 minutes (2 hours).
* The data is retained for 365 days (1 year).
* For the authoritative record of storage charges, refer to your invoice.
* Other storage views, such as [DATABASE_STORAGE_USAGE_HISTORY view](../account-usage/database_storage_usage_history.md) and [STORAGE_USAGE view](../account-usage/storage_usage.md), use different measurement approaches and won’t match the values in this view.

---
title: STORAGE_LIFECYCLE_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/storage_lifecycle_policies.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# STORAGE_LIFECYCLE_POLICIES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays
[storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md)
in your organization.

Each row in this view corresponds to a different storage lifecycle policy.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | TEXT | Name of the storage lifecycle policy. |
| ID | NUMBER | Internal/system-generated identifier for the storage lifecycle policy. |
| SCHEMA_ID | TEXT | Internal/system-generated identifier for the schema in which the policy resides. |
| SCHEMA | TEXT | Schema to which the storage lifecycle policy belongs. |
| DATABASE_ID | TEXT | Internal/system-generated identifier for the database in which the policy resides. |
| DATABASE | TEXT | Database to which the storage lifecycle policy belongs. |
| OWNER | TEXT | Name of the role that owns the storage lifecycle policy. |
| SIGNATURE | TEXT | Type signature of the storage lifecycle policy’s arguments. |
| RETURN_TYPE | TEXT | Return value data type. |
| BODY | TEXT | Storage lifecycle policy definition. |
| COMMENT | TEXT | Comments entered for the storage lifecycle policy (if any). |
| CREATED_ON | TIMESTAMP_LTZ | Date and time when the storage lifecycle policy was created. |
| LAST_ALTERED_ON | TIMESTAMP_LTZ | Date and time when the storage lifecycle policy was last altered. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time when the storage lifecycle policy was dropped. |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, either ROLE or DATABASE_ROLE. |
| OPTIONS | OBJECT | Storage lifecycle policy options, including ARCHIVE_FOR_DAYS (number of days to keep data in current tier) and ARCHIVE_TIER (target storage tier). |

## Usage notes

* Latency for the view may be up to 24 hours.

---
title: STORAGE_LIFECYCLE_POLICY_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/storage_lifecycle_policy_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# STORAGE_LIFECYCLE_POLICY_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays the aggregated execution history of
[storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies.md).
This view shows historical data from the past 12 months and only includes policy executions
that have completed successfully or with failures. The view doesn’t include queued, currently executing, or cancelled
policy executions.

Each row in this view corresponds to a different storage lifecycle policy execution.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| POLICY_DB | VARCHAR | The name of the database that contains the storage lifecycle policy. |
| POLICY_SCHEMA | VARCHAR | The name of the schema that contains the storage lifecycle policy. |
| POLICY_NAME | VARCHAR | The name of the storage lifecycle policy. |
| REF_ENTITY_DB | VARCHAR | The name of the database that contains the object that the storage lifecycle policy is attached to. |
| REF_ENTITY_SCHEMA | VARCHAR | The name of the schema that contains the object that the storage lifecycle policy is attached to. |
| REF_ENTITY_NAME | VARCHAR | The name of the object that the storage lifecycle policy is attached to. |
| REF_ENTITY_DOMAIN | VARCHAR | The domain (type) of the object that the storage lifecycle policy is attached to; for example, Table. |
| STATE | VARCHAR | The aggregated state of the storage lifecycle policy execution: SUCCEEDED or FAILED (completed executions only). |
| START_TIME | TIMESTAMP_LTZ | Earliest timestamp of when any task in the storage lifecycle policy execution started. |
| END_TIME | TIMESTAMP_LTZ | Latest timestamp of when any task in the storage lifecycle policy execution completed. |
| EXECUTION_RESULT | VARIANT | JSON object containing detailed results for each task type in the storage lifecycle policy execution. The object can be of type EXPIRE, ARCHIVE, or EXPIRE_ARCHIVE ARCHIVE. Each nested object contains: start_time, end_time, state, and error details. |
| POLICY_BODY | VARCHAR | The body of the storage lifecycle policy. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view contains historical data for the past 12 months (one year).
* The view only shows completed policy executions. It doesn’t include queued, currently executing, or cancelled policy executions.

---
title: TABLE_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/table_constraints.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TABLE_CONSTRAINTS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each table constraint that is defined for the tables in an account.

This view returns information about the following constraint types:

* PRIMARY KEY
* FOREIGN KEY
* UNIQUE

For general information about constraints, see [Constraints](../constraints.md).

See also:
:   [REFERENTIAL_CONSTRAINTS view](referential_constraints.md)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_ID | NUMBER | Internal/system-generated identifier for the constraint. |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint. |
| CONSTRAINT_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the constraint. |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the constraint belongs to. |
| CONSTRAINT_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the constraint. |
| CONSTRAINT_CATALOG | VARCHAR | Database that the constraint belongs to. |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the table that the constraint belongs to. |
| TABLE_NAME | VARCHAR | Name of the current table. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the current table. |
| TABLE_SCHEMA | VARCHAR | Name of the schema for the current table. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the current table. |
| TABLE_CATALOG | VARCHAR | Name of the database for the current table. |
| CONSTRAINT_TYPE | VARCHAR | Type of the constraint (`PRIMARY KEY`, `UNIQUE KEY`, or `FOREIGN KEY`). |
| IS_DEFERRABLE | VARCHAR | Whether evaluation of the constraint can be deferred; by default, always `N`. |
| INITIALLY_DEFERRED | VARCHAR | Whether evaluation of the constraint is deferrable and initially deferred; by default, always `Y`. |
| ENFORCED | VARCHAR | Whether the constraint is enforced; by default, always `N`. |
| COMMENT | VARCHAR | Comment for the constraint. |
| CREATED | TIMESTAMP_LTZ | Date and time when the constraint was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the constraint was dropped. |
| RELY | VARCHAR | Whether a constraint in NOVALIDATE mode is taken into account during query rewrite. For details, see [Constraint properties](../sql/create-table-constraint.md). |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: TABLE_STORAGE_METRICS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/table_storage_metrics.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TABLE_STORAGE_METRICS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays table-level storage utilization information, which is used to calculate the storage billing for each
table in the account, including tables that have been dropped, but
are still incurring storage costs.

In addition to table metadata, the view displays the number of storage bytes billed for each table. Snowflake breaks down the bytes into the
following categories:

* Active bytes, representing data in the table that can be queried.
* Deleted bytes that are still accruing storage charges because they have not been purged yet from the system. These bytes are classified
  into the following sub-categories:

  + Bytes in Time Travel (i.e. recently deleted, but still within the Time Travel retention period for the table).
  + Bytes in Fail-safe (i.e. deleted bytes that are past the Time Travel retention period, but within the Fail-safe period for the table).
  + Bytes retained for clones (i.e. deleted bytes that are no longer in Time Travel or Fail-safe, but are still retained because clones of the table reference the bytes).

In other words, rows are maintained in this view until the corresponding tables are no longer billed for any storage, regardless of various
states that the data in the tables may be in (i.e. active, Time Travel, Fail-safe, or retained for clones).

For more details about data storage in tables, see [Data storage considerations](../../user-guide/tables-storage-considerations.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | Internal/system-generated identifier for the table. |
| TABLE_NAME | VARCHAR | Name of the table. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema of the table. |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database of the table. |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to. |
| CLONE_GROUP_ID | NUMBER | Unique identifier for the oldest clone ancestor of this table. Same as ID if the table is not a clone. |
| IS_TRANSIENT | VARCHAR | ‘YES’ if table is transient or temporary, otherwise ‘NO’. Transient and temporary tables have no Fail-safe period. |
| ACTIVE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the active state for the table. For Iceberg table storage, active bytes aren’t billed to *Iceberg* tables. For more information, see [Iceberg table billing](../../user-guide/tables-iceberg.md). |
| TIME_TRAVEL_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the Time Travel state for the table. |
| FAILSAFE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the Fail-safe state for the table. |
| RETAINED_FOR_CLONE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are retained after deletion because they are referenced by one or more clones of this table, or by [WORM backups](../../user-guide/backups.md) that contain the table. |
| DELETED | BOOLEAN | TRUE if table has been dropped or recreated. |
| TABLE_CREATED | TIMESTAMP_LTZ | Date and time when the table was created. |
| TABLE_DROPPED | TIMESTAMP_LTZ | Date and time when the table was dropped. NULL if table has not been dropped. |
| TABLE_ENTERED_FAILSAFE | TIMESTAMP_LTZ | Date and time when the table, if dropped, entered the Fail-safe state, or NULL. In this state, the table cannot be restored using UNDROP. For transient tables, which aren’t recoverable using Fail-safe, this column indicates when the time travel retention period has passed. |
| SCHEMA_CREATED | TIMESTAMP_LTZ | Date and time when the schema for the table was created. |
| SCHEMA_DROPPED | TIMESTAMP_LTZ | Date and time when the schema for the table was dropped. |
| CATALOG_CREATED | TIMESTAMP_LTZ | Date and time when the database for the table was created. |
| CATALOG_DROPPED | TIMESTAMP_LTZ | Date and time when the database for the table was dropped. |
| COMMENT | VARCHAR | Comment for the table. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| ARCHIVE_STORAGE_COOL_ACTIVE_BYTES | NUMBER | The number of bytes in the cool storage tier owned by (and billed to) this table that are in the active state for the table. |
| ARCHIVE_STORAGE_COLD_ACTIVE_BYTES | NUMBER | The number of bytes in the cold storage tier owned by (and billed to) this table that are in the active state for the table. |
| ARCHIVE_STORAGE_COOL_TIME_TRAVEL_BYTES | NUMBER | The number of bytes in the cool storage tier owned by (and billed to) this table that are in the Time Travel state for the table. |
| ARCHIVE_STORAGE_COLD_TIME_TRAVEL_BYTES | NUMBER | The number of bytes in the cold storage tier owned by (and billed to) this table that are in the Time Travel state for the table. |
| ARCHIVE_STORAGE_COOL_FAILSAFE_BYTES | NUMBER | The number of bytes owned by (and billed to) this table in the COOL storage tier that are in the Fail-safe state for the table. |
| ARCHIVE_STORAGE_COLD_FAILSAFE_BYTES | NUMBER | The number of bytes owned by (and billed to) this table in the COLD storage tier that are in the Fail-safe state for the table. |
| ARCHIVE_STORAGE_COOL_EARLY_DELETION_PENALTY_BYTES | NUMBER | The number of penalty bytes deleted early and billed for) that are in the COOL storage tier. For more information, see [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). |
| ARCHIVE_STORAGE_COLD_EARLY_DELETION_PENALTY_BYTES | NUMBER | The number of penalty bytes deleted early and billed for) that are in the COLD storage tier. For more information, see [minimum storage duration charges](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). |

## Usage notes

* Latency for the view may be up to 24 hours.
* > **Note:**
  >
  > With [BCR-2127](../../release-notes/bcr-bundles/2025_07/bcr-2127.md),
  > this view includes new columns for storage lifecycle policies.
  > To view storage lifecycle policy columns, you must enable the 2025_07 behavior change bundle
  > in your account.
  >
  > To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
  > execute the following statement:
  >
  > ```sqlexample
  > SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_07');
  > ```

* `ID` and `CLONE_GROUP_ID`:

  > + `ID` does not change for a table throughout its lifecycle, including if the table is renamed or dropped.
  > + `CLONE_GROUP_ID` is the ID of the oldest ancestor of a clone, including if the table has been dropped, but is still accruing storage costs. For example:
  >
  >   > 1. Table `t2` is cloned from `t1`.
  >   > 2. Table `t3` is cloned from `t2`.
  >
  >   All three tables list the `ID` for `t1` as their `CLONE_GROUP_ID`, even if `t1` is dropped and eventually purged from Snowflake.
  > + If the IDs are identical, the table is not a clone.
  > + Storage bytes are always owned by, and therefore billed to, the table where the bytes were initially added. If the table is then cloned, storage metrics for these initial bytes never transfer
  >   to the clones, even if the bytes are deleted from the source table.
* Cloned tables share the same underlying storage (at the micro-partition level) until either the original table or cloned table is modified. With each change made to either table, the table takes
  “ownership” of the changed bytes.
* Dropped tables are displayed in the view as long as they still incur storage costs:

  > + Dropped tables retain their active storage metrics, indicating how many bytes will be active if the table is restored.
  > + Dropped tables in the Time Travel retention period for the table can be restored using the UNDROP command.
  > + Dropped tables in Fail-safe (`TABLE_ENTERED_FAILSAFE` is not `NULL`) will potentially display `NULL` values in most columns, except for:
  >
  >   > ID columns:
  >   > :   `ID` , `CLONE_GROUP_ID`
  >   >
  >   > Bytes columns:
  >   > :   `ACTIVE_BYTES` , `TIME_TRAVEL_BYTES` , `FAILSAFE_BYTES` , `RETAINED_FOR_CLONE_BYTES`
  >
  >   These tables cannot be restored using the UNDROP command.
* When data is deleted from a table with a Time Travel retention period of 0 days, asynchronous background processes purge the active bytes
  or move them directly into Fail-safe storage, depending on the table type. This may take a short time to complete. During that time, the
  `TIME_TRAVEL_BYTES` column may contain a non-zero value even when the Time Travel retention period is 0 days.
* `FAILSAFE_BYTES` denotes bytes that have passed beyond Time Travel. All such bytes are billed to the current table.
* If multiple rows have the same value in the `TABLE_NAME` column, this indicates that multiple versions of the table exist. A version is created each time a table is dropped and a new table
  with the same name is created, including when a [CREATE OR REPLACE TABLE](../sql/create-table.md) command is issued on an existing table. Note that the current version will have a
  `NULL` value for the `TABLE_DROPPED` column; all other versions will have a timestamp value. This is important to note because each version of a table incurs storage costs associated with
  Time Travel (and Fail-safe, if the table is permanent).
* Any data in the `DELETED` column prior to August 2018 may not be accurate.
* In some cases, active bytes might include bytes for data in a dropped column. For more information,
  see the [usage notes](../sql/alter-table.md) for ALTER TABLE.
* For Iceberg tables:

  + Snowflake doesn’t bill for [Iceberg table](../../user-guide/tables-iceberg.md) storage when the table uses
    an external volume that you manage. However, if the table uses
    [Snowflake Storage](../../user-guide/tables-iceberg-internal-storage.md) (`EXTERNAL_VOLUME = SNOWFLAKE_MANAGED`),
    Snowflake charges for the storage.
    For more information, see [Iceberg table billing](../../user-guide/tables-iceberg.md).
  + If the table is externally managed,
    this view might display inaccurate storage utilization information.

---
title: TABLES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/tables.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TABLES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each table and view in an account.

See also:
:   [COLUMNS view](columns.md) , [VIEWS view](views.md), [TABLES view](../info-schema/tables.md) (Information Schema)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal, Snowflake-generated identifier for the table. |
| TABLE_NAME | VARCHAR | Name of the table. |
| TABLE_SCHEMA_ID | NUMBER | Internal, Snowflake-generated identifier of the schema for the table. |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal, Snowflake-generated identifier of the database for the table. |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to. |
| TABLE_OWNER | VARCHAR | Name of the role that owns the table. |
| TABLE_TYPE | VARCHAR | Indicates the table type. Valid values are `BASE TABLE`, `TEMPORARY TABLE`, `EXTERNAL TABLE`, `EVENT TABLE`, `VIEW`, or `MATERIALIZED VIEW`. |
| IS_TRANSIENT | VARCHAR | Indicates whether the table is transient. |
| CLUSTERING_KEY | VARCHAR | Column(s) and/or expression(s) that comprise the clustering key for the table. |
| ROW_COUNT | NUMBER | Number of rows in the table. |
| BYTES | NUMBER | Number of bytes accessed by a scan of the table. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| SELF_REFERENCING_COLUMN_NAME | VARCHAR | Not applicable for Snowflake. |
| REFERENCE_GENERATION | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_NAME | VARCHAR | Not applicable for Snowflake. |
| IS_INSERTABLE_INTO | VARCHAR | Not applicable for Snowflake. |
| IS_TYPED | VARCHAR | Not applicable for Snowflake. |
| COMMIT_ACTION | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Date and time when the table was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| DELETED | TIMESTAMP_LTZ | Date and time when the table was dropped. |
| AUTO_CLUSTERING_ON | VARCHAR | Status of Automatic Clustering for a table. For details, see [Viewing the Automatic Clustering status for a table](../../user-guide/tables-auto-reclustering.md). |
| COMMENT | VARCHAR | Comment for the table. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| IS_ICEBERG | VARCHAR | Indicates whether the table is an [Iceberg table](../../user-guide/tables-iceberg.md). Valid values are `YES` or `NO`. |
| IS_DYNAMIC | VARCHAR | Indicates whether the table is a [dynamic table](../../user-guide/dynamic-tables-about.md). Valid values are `YES` or `NO`. |
| IS_HYBRID | VARCHAR | Indicates whether the table is a [hybrid table](../../user-guide/tables-hybrid.md). Valid values are `YES` or `NO`. |
| ARCHIVE_STORAGE_COOL_ROW_COUNT | NUMBER | The number of rows that are in the COOL storage tier. |
| ARCHIVE_STORAGE_COOL_BYTES | NUMBER | The number of bytes accessed by retrieving data from the COOL storage tier. |
| ARCHIVE_STORAGE_COLD_ROW_COUNT | NUMBER | The number of rows that are in the COLD storage tier. |
| ARCHIVE_STORAGE_COLD_BYTES | NUMBER | The number of bytes accessed by retrieving data from the COLD storage tier. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command executed by a user who holds the MANAGE GRANTS privilege.
* Querying the `SUM(BYTES)` for a table does not represent the total storage usage, because the amount does not include Time Travel and Fail-safe usage.
* Using the value in the LAST_ALTERED column for Time Travel is *not* recommended and can return unexpected results for the following
  reaons:

  + Time Travel can only be used to query historical data modified by a [DML operation](../../user-guide/data-time-travel.md).
  + The LAST_ALTERED column inludes both DML and DDL operations (see the next usage note).
  + For DML operations, the value in the LAST_ALTERED column is the timestamp at the beginning of the statement execution rather than
    the time of the commit of the transaction containing this statement.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.
* The LAST_ALTERED column does not necessarily indicate the last refreshed time for external tables.
  To retrieve the last refreshed time for an auto-refreshed external table, you can use the
  [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](../functions/system_external_table_pipe_status.md) function, which returns
  information such as the timestamp of the last file Snowflake has registered.

* > **Note:**
  >
  > With [BCR-2127](../../release-notes/bcr-bundles/2025_07/bcr-2127.md),
  > this view includes new columns for storage lifecycle policies.
  > To view storage lifecycle policy columns, you must enable the 2025_07 behavior change bundle
  > in your account.
  >
  > To [enable this bundle in your account](../../release-notes/bcr-bundles/managing-behavior-change-releases.md),
  > execute the following statement:
  >
  > ```sqlexample
  > SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_07');
  > ```

## Examples

Retrieve the total size (in bytes) of all active tables in all schemas in your account:

> ```sqlexample
> SELECT account_name, table_schema, SUM(bytes)
>     FROM SNOWFLAKE.ORGANIZATION_USAGE.TABLES
>     WHERE deleted IS NULL
>     GROUP BY table_schema;
> ```

---
title: TAG_REFERENCES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/tag_references.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TAG_REFERENCES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to identify the associations between objects and tags.

This view only records the direct relationship between the object and the tag. [Tag inheritance](../../user-guide/object-tagging/inheritance.md) is not included in this view.

The view is complementary to the information schema table function [TAG_REFERENCES](../functions/tag_references.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| TAG_DATABASE | VARCHAR | The database in which the tag is set. |
| TAG_SCHEMA | VARCHAR | The schema in which the tag is set. |
| TAG_ID | NUMBER | Internal/system-generated identifier for the tag. Note that for system tags this value is NULL. |
| TAG_NAME | VARCHAR | The name of the tag. This is the `key` in the `key = 'value'` pair of the tag. |
| TAG_VALUE | VARCHAR | The value of tag. This is the `'value'` in the `key = 'value'` pair of the tag. |
| OBJECT_DATABASE | VARCHAR | Database name of the referenced object for database and schema objects. If the object is not a database or schema object, the value is empty. |
| OBJECT_SCHEMA | VARCHAR | Schema name of the referenced object (for schema objects). If the referenced object is not a schema object (e.g. warehouse), this value is empty. |
| OBJECT_ID | NUMBER | Internal identifier of the referenced object. |
| OBJECT_NAME | VARCHAR | Name of the referenced object if the tag association is on the object. If the tag association is on a column, Snowflake returns the parent table name. |
| OBJECT_DELETED | TIMESTAMP_LTZ | Date and time when the associated or parent object was dropped. |
| DOMAIN | VARCHAR | Domain of the reference object (e.g. table, view) if the tag association is on the object. For columns, the domain is COLUMN if the tag association is on a column. For more information, see [supported domains](../functions/tag_references.md). |
| COLUMN_ID | NUMBER | The local identifier of the reference column; not applicable if the tag association is not a column. |
| COLUMN_NAME | VARCHAR | Name of the referenced column; not applicable if the tag association is not a column. |
| APPLY_METHOD | VARCHAR | Specifies how the tag got assigned to the object.   * `CLASSIFIED`: The tag was automatically applied to a column that was classified as containing sensitive data. See [About tag mapping](../../user-guide/classify-auto.md). * `MANUAL`: Someone manually set the tag on the object using a CREATE <object> command or ALTER <object> command. See [Set a tag](../../user-guide/object-tagging/work.md). * `PROPAGATED`: The tag was automatically propagated from one object to another. See [Automatic tag propagation with user-defined tags](../../user-guide/object-tagging/propagation.md). * `NULL`: Legacy record. * `NONE`: Legacy record. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not contain information about columns that have been deleted.
* The TAG_DATABASE_ID column is not included in this view. To obtain this value in your query result, perform a JOIN operation with the
  [TAGS view](../account-usage/tags.md).

## Examples

Return the tag references for your Snowflake account:

> ```sqlexample
> select account_name, tag_name, tag_value, domain, object_id
> from snowflake.organization_usage.tag_references
> order by tag_name, domain, object_id;
> ```

Return the active objects that have tag associations in your Snowflake account. The addition of the specified WHERE clause filters the
objects that are deleted:

> ```sqlexample
> select account_name, tag_name, tag_value, domain, object_id
> from snowflake.organization_usage.tag_references
> where object_deleted is null
> order by tag_name, domain, object_id;
> ```

---
title: TAGS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/tags.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TAGS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view lists the tags in an account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| TAG_ID | NUMBER | The local identifier of a tag. |
| TAG_NAME | TEXT | The name of a tag. |
| TAG_SCHEMA_ID | NUMBER | The local identifier of the tag schema. |
| TAG_SCHEMA | TEXT | The name of schema in which the tag exists. |
| TAG_DATABASE_ID | NUMBER | The local identifier of the database in which the tag exists. |
| TAG_DATABASE | TEXT | The name of the database in which the tag exists. |
| TAG_OWNER | TEXT | The name of the role that owns the tag. |
| TAG_COMMENT | VARIANT | Comments for the tag, if any. |
| CREATED | TIMESTAMP_LTZ | Date and time when the tag was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the tag was dropped, or the date and time when its parents were dropped. |
| ALLOWED_VALUES | VARIANT | Specifies the possible string values that can be assigned to the tag when the tag is set on an [object](../../user-guide/object-tagging/introduction.md) or NULL if the tag does not have any specified allowed values. For details, see [Set a list of allowed tag values](../../user-guide/object-tagging/work.md). |
| OWNER_ROLE_TYPE | TEXT | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| PROPAGATE | VARCHAR | Indicates whether the tag is configured for automatic propagation. Possible values are the following:   * NULL — Tag is not propagated. * `ON_DEPENDENCY` — Tag is propagated when there is an object dependency (for example, creating a view from a tagged table). * `ON_DATA_MOVEMENT` — Tag is propagated when there is data movement (for example, using a CTAS statement to create a table   from a tagged table). * `ON_DEPENDENCY_AND_DATA_MOVEMENT` — Tag is propagated for both object dependencies and data movement. |
| ON_CONFLICT | VARCHAR | If the tag is configured for automatic propagation, indicates what happens when the value of the tag being propagated conflicts with the value that was specified when the tag was manually applied to the same object. For more information, see [Tag propagation conflicts](../../user-guide/object-tagging/propagation.md). |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Return the tag references for your Snowflake account:

> ```sqlexample
> select * from snowflake.organization_usage.tags
> order by tag_name;
> ```

---
title: TASK_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/task_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TASK_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view enables you to retrieve the history of [task](../../user-guide/tasks-intro.md) usage.
The view displays one row for each run of a task in the history.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the task. |
| QUERY_TEXT | VARCHAR | Text of the SQL statement. |
| CONDITION_TEXT | VARCHAR | Text of WHEN condition the task evaluates when determining whether to run. |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the task. |
| TASK_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the task. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the task. |
| TASK_DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contains the task. |
| SCHEDULED_TIME | TIMESTAMP_LTZ | Time when the task is/was scheduled to start running. Note that we make a best effort to ensure absolute precision, but only guarantee that tasks do not execute *before* the scheduled time. |
| COMPLETED_TIME | TIMESTAMP_LTZ | Time when the task completed. |
| STATE | VARCHAR | Status of the completed task: SUCCEEDED, FAILED, CANCELLED, or SKIPPED. Note that the view does not return SCHEDULED or EXECUTING task runs. To retrieve the task history details for runs in a scheduled or executing state, query the [TASK_HISTORY](../functions/task_history.md) table function in the Information Schema. The timed-out tasks always have a `FAILED` state in the task history. |
| RETURN_VALUE | VARCHAR | Value set for the predecessor task in a [task graph](../../user-guide/tasks-graphs.md). The return value is explicitly set by calling the [SYSTEM$SET_RETURN_VALUE](../functions/system_set_return_value.md) function by the predecessor task. |
| QUERY_ID | VARCHAR | ID of the SQL statement executed by the task. Can be joined with the QUERY_HISTORY view for additional details about the execution of the statement or stored procedure. |
| QUERY_START_TIME | TIMESTAMP_LTZ | Time when the query in the task definition started to run. This timestamp aligns with the start time for the query returned by QUERY_HISTORY. |
| ERROR_CODE | VARCHAR | Error code, if the statement returned an error. |
| ERROR_MESSAGE | VARCHAR | Error message, if the statement returned an error. |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the task graph that was run, or is scheduled to be run. Each incremental increase in the value represents one or more modifications to tasks in the task graph. If the root task is recreated (using CREATE OR REPLACE TASK), then the version number restarts from 1. |
| RUN_ID | NUMBER | Time when the standalone or root task in a task graph is/was originally scheduled to start running. Format is epoch time (in milliseconds). . . *Original* scheduled time refers to rare instances when the system may reschedule the same task to run at a different time to retry it or rebalance the load. If that happens, RUN_ID shows the original scheduled run time and SCHEDULED_TIME shows the rescheduled run time. . . Note that RUN_ID may not be a unique identifier for the current task/graph run prior to retry. You may use GRAPH_RUN_GROUP_ID column as a replacement for RUN_ID. |
| ROOT_TASK_ID | VARCHAR | Unique identifier for the root task in a task graph. This ID matches the ID column value in the SHOW TASKS output for the same task. |
| SCHEDULED_FROM | VARCHAR | One of:  * `SCHEDULE`: The task was scheduled to run normally, as described in SCHEDULE or AFTER clauses of [CREATE TASK](../sql/create-task.md). * `EXECUTE_TASK`: The task was scheduled to run with [EXECUTE TASK](../sql/execute-task.md). * `MANUAL RETRY`: The task was scheduled to run with [EXECUTE TASK … RETRY LAST](../sql/execute-task.md). * `AUTOMATIC RETRY`: The task was configured to retry on failure and the previous execution failed. For more information, see [Automatically retry failed task runs](../../user-guide/tasks-intro.md). * `TRIGGER` : The task was run because the stream, in the `WHEN` clause of the task, contained new data.  For runs of child tasks in a task graph, the column returns the same value as the root task run. |
| ATTEMPT_NUMBER | NUMBER | Integer representing the number of attempts to run this task. Initially one. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| CONFIG | VARCHAR | Displays the graph level configuration if set for the root task, otherwise displays NULL. |
| QUERY_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the canonicalized SQL text. |
| QUERY_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_HASH`. |
| QUERY_PARAMETERIZED_HASH | VARCHAR | The [hash value](../../user-guide/query-hash.md) computed based on the parameterized query. |
| QUERY_PARAMETERIZED_HASH_VERSION | NUMBER | The [version of the logic](../../user-guide/query-hash.md) used to compute `QUERY_PARAMETERIZED_HASH`. |
| GRAPH_RUN_GROUP_ID | VARCHAR | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same GRAPH_RUN_GROUP_ID. The combination of GRAPH_RUN_GROUP_ID, and ATTEMPT_NUMBER can be used to uniquely identify a graph run. |
| BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

## Usage notes

* Latency for the view may be up to 24 hours.

* For increased performance, filter queries on the COMPLETED_TIME or SCHEDULED_TIME column.

## Examples

Retrieve records for the 10 most recent completed task runs:

> ```sqlexample
> SELECT account_name, query_text, completed_time
> FROM snowflake.organization_usage.task_history
> ORDER BY completed_time DESC
> LIMIT 10;
> ```

Retrieve records for task runs completed in the past hour:

> ```sqlexample
> SELECT account_name, query_text, completed_time
> FROM snowflake.organization_usage.task_history
> WHERE completed_time > DATEADD(hours, -1, CURRENT_TIMESTAMP());
> ```

---
title: TASK_VERSIONS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/task_versions.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TASK_VERSIONS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view enables you to retrieve the history of [task versions](../../user-guide/tasks-intro.md). The returned rows
indicate the tasks that comprised a [task graph](../../user-guide/tasks-graphs.md) and their properties at a given time.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROOT_TASK_ID | TEXT | Unique identifier for the root task in a DAG. This ID matches the ID column value in the SHOW TASKS output for the same task. Matches ROOT_TASK_ID in [COMPLETE_TASK_GRAPHS view](complete_task_graphs.md) and [TASK_HISTORY view](task_history.md). |
| GRAPH_VERSION | NUMBER | Integer identifying the version of the task. Matches GRAPH_VERSION in [COMPLETE_TASK_GRAPHS view](complete_task_graphs.md). |
| GRAPH_VERSION_CREATED_ON | TIMESTAMP_LTZ | Date and time when this version of the task graph was saved. |
| NAME | TEXT | Name of the task. |
| ID | TEXT | Unique identifier for each task. Note that recreating a task (using CREATE OR REPLACE TASK) essentially creates a new task, which has a new ID. |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database that contained the task. |
| DATABASE_NAME | TEXT | Name of the database in which the task is stored. |
| SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contained the task. |
| SCHEMA_NAME | TEXT | Name of the schema in which the task is stored. |
| OWNER | TEXT | Role that owns the task (that is, has the OWNERSHIP privilege on the task). |
| COMMENT | TEXT | Comment for the task. |
| WAREHOUSE_NAME | TEXT | Warehouse that provides the required resources to run the task. |
| SCHEDULE | TEXT | Schedule for running the task. Displays NULL if no schedule is specified. |
| PREDECESSORS | ARRAY | JSON array of any tasks identified in the AFTER parameter for the task (that is, predecessor tasks). When run successfully to completion, these tasks trigger the current task. Individual task names in the array are fully qualified (that is, include the container database and schema names). Displays an empty array if the task has no predecessor. |
| STATE | TEXT | Current state of the task: `started` or `suspended`. `NULL` for root tasks (tasks with no predecessors). |
| DEFINITION | TEXT | SQL statements executed when the task runs. |
| CONDITION_TEXT | TEXT | Condition specified in the WHEN clause for the task. |
| ALLOW_OVERLAPPING_EXECUTION | BOOLEAN | For root tasks in a DAG, displays TRUE if overlapping execution of the DAG is explicitly allowed. For child tasks in a DAG, displays NULL. |
| ERROR_INTEGRATION | TEXT | Name of the notification integration used to access Amazon Simple Notification Service (SNS), Google Pub/Sub, or Microsoft Azure Event Grid to relay error notifications for the task. |
| LAST_COMMITTED_ON | TIMESTAMP_LTZ | Timestamp when a version of the task was last set. If no version has been set (that is, if the task has not been resumed or manually executed after it was created), the value is NULL. |
| LAST_SUSPENDED_ON | TIMESTAMP_LTZ | Timestamp when the task was last suspended. If the task has not been suspended yet, the value is NULL. |
| TARGET_COMPLETION_INTERVAL | TEXT | The window of time when the task should perform. Only used for serverless tasks. Optional for serverless tasks, required for [serverless triggered tasks](../../user-guide/tasks-intro.md). |
| SCHEDULING_MODE | TEXT | Reserved for future functionality. Displays UNKNOWN. |

## Usage notes

Latency for the view may be up to 24 hours.

## Examples

Retrieve the tasks from a specific task graph based on the ROOT_TASK_ID and GRAPH_VERSION:

> ```sqlexample
> SELECT *
> FROM snowflake.organization_usage.task_versions
> WHERE ROOT_TASK_ID = 'afb36ccc-. . .-b746f3bf555d' AND GRAPH_VERSION = 3;
> ```

Retrieve the task runs for a particular task graph and its descendant tasks from task_history, with additional task information from
task_versions.

> ```sqlexample
> SELECT
> task_history.* rename state AS task_run_state,
> task_versions.state AS task_state,
> task_versions.graph_version_created_on,
> task_versions.warehouse_name,
> task_versions.comment,
> task_versions.schedule,
> task_versions.predecessors,
> task_versions.allow_overlapping_execution,
> task_versions.error_integration
> FROM snowflake.organization_usage.task_history
> JOIN snowflake.organization_usage.task_versions using (root_task_id, graph_version)
> WHERE task_history.ROOT_TASK_ID = 'afb36ccc-. . .-b746f3bf555d'
> ```

---
title: TRUST_CENTER_FINDINGS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/trust_center_findings.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TRUST_CENTER_FINDINGS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view shows security violations discovered by [Trust Center scanners](../../user-guide/trust-center/overview.md).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column | Data Type | Description |
| --- | --- | --- |
| ID | NUMBER | System identifier of the account that had the finding. |
| PROVIDER_ID | VARCHAR | System identifier of the provider of the scanner package. |
| SCANNER_PACKAGE_ID | VARCHAR | System identifier of the scanner package. |
| SCANNER_ID | VARCHAR | System identifier of the scanner. |
| SEVERITY | VARCHAR | Severity of the finding, as assigned by the scanner [LOW, MEDIUM, HIGH, CRITICAL]. |
| STATE | VARCHAR | State of the finding [OPEN, RESOLVED, RESOLVED MANUALLY]. |
| CREATED_ON | TIMESTAMP_LTZ | The time at which the finding was initially created. |
| UPDATED_ON | TIMESTAMP_LTZ | The time at which the finding was last updated. |

## Usage notes

* Latency for the view may be up to 24 hours.

---
title: TYPES view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/types.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# TYPES view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each [user-defined type](../data-types-user-defined.md)
defined in an account.

See also:
:   [TYPES view](../info-schema/types.md) (Information Schema) ,
    [TYPES view](../account-usage/types.md) (Account Usage)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data type | Description |
| --- | --- | --- |
| TYPE_ID | NUMBER | Internal/system-generated identifier for the type. |
| TYPE_NAME | VARCHAR | Name of the type. |
| TYPE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that contains the type. |
| TYPE_SCHEMA | VARCHAR | Schema that contains the type. |
| TYPE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database that contains the type. |
| TYPE_CATALOG | VARCHAR | Database that contains the type. |
| TYPE_OWNER | VARCHAR | Name of the role that owns the type. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| BASE_DATA_TYPE | VARCHAR | Underlying data type of the user-defined type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters for VARCHAR types. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes for VARCHAR types. |
| NUMERIC_PRECISION | NUMBER | Numeric precision for NUMBER types. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of the numeric precision for NUMBER types. |
| NUMERIC_SCALE | NUMBER | Numeric scale for NUMBER types. |
| DATETIME_PRECISION | NUMBER | Fractional seconds precision for TIMESTAMP types. |
| CHECK_EXPRESSION | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_EXPRESSION | VARCHAR | Not applicable for Snowflake. |
| IS_NULLABLE_DEFAULT | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Date and time when the type was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| DELETED | TIMESTAMP_LTZ | Date and time when the type was dropped. |
| COMMENT | VARCHAR | Comment for this type. |

## Usage notes

* Latency for the view might be up to 24 hours.
* The view only displays objects for which the current role for the session has been granted access privileges.
* The view doesn’t recognize the MANAGE GRANTS privilege and consequently might show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve all user-defined types in the organization:

```sqlexample
SELECT type_name, type_catalog, type_schema, type_owner, base_data_type
  FROM SNOWFLAKE.ORGANIZATION_USAGE.TYPES
  ORDER BY created DESC;
```

Retrieve user-defined types that have been dropped:

```sqlexample
SELECT type_name, type_catalog, type_schema, deleted
  FROM SNOWFLAKE.ORGANIZATION_USAGE.TYPES
  WHERE deleted IS NOT NULL
  ORDER BY deleted DESC;
```

---
title: USAGE_IN_CURRENCY_DAILY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/usage_in_currency_daily.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# USAGE_IN_CURRENCY_DAILY view

The USAGE_IN_CURRENCY_DAILY view in the ORGANIZATION_USAGE schema can be used to return the daily credit usage and usage in currency for
an organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| CONTRACT_NUMBER | VARCHAR | Snowflake contract number for the organization. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage was consumed. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the account where the usage was consumed. |
| REGION | VARCHAR | Name of the region where the account is located. |
| SERVICE_LEVEL | VARCHAR | Service level (edition) of the Snowflake account (Standard, Enterprise, Business Critical, etc.). |
| USAGE_DATE | DATE | Date (in UTC format) in which the usage took place. |
| USAGE_TYPE | VARCHAR | Corresponds to the Usage Category column in a billing statement, which exists for backward compatibility only. Use the BILLING_TYPE, RATING_TYPE, SERVICE_TYPE, and IS_ADJUSTMENT columns for billing reconciliation. |
| USAGE | NUMBER (38,3) | Total amount of usage charged based on SERVICE_TYPE. The unit of the USAGE depends on the RATING_TYPE. For example, when the RATING_TYPE is `compute`, USAGE is measured in credits. When the RATING_TYPE is `data transfer` or `storage`, the usage is rated in terabytes. |
| CURRENCY | VARCHAR | Currency of the usage. |
| USAGE_IN_CURRENCY | NUMBER (38,2) | Total amount charged for the USAGE_TYPE for USAGE on the USAGE_DATE. |
| BALANCE_SOURCE | VARCHAR | Source of the funds used to pay for the daily usage. The source can be one of the following:   * `capacity` — Usage paid with credits remaining on an organization’s capacity commitment. * `rollover` — Usage paid with rollover credits. When an organization renews a capacity commitment, unused credits are added to the   balance of the new contract as rollover credits. * `free usage` — Usage covered by the free credits provided to the organization. * `overage` — Usage that was paid at on-demand pricing, which occurs when an organization has exhausted its capacity, rollover,   and free credits. * `rebate` — Usage covered by the credits awarded to the organization when it shared data with another organization. |
| BILLING_TYPE | VARCHAR | Indicates what is being charged or credited. Possible billing types include:   * `consumption` — Usage associated with compute credits, storage costs, and data transfer costs. * `rebate` — Usage covered by the credits awarded to the organization when it shared data with another organization. * `priority support` — Charges for priority support services. This charge is associated with a stipulation in a contract, not with an account. * `vps_deployment_fee` — Charges for a [Virtual Private Snowflake](../../user-guide/intro-editions.md) deployment. * `support_credit` — Snowflake Support credited the account to reverse charges attributed to an issue in Snowflake. |
| RATING_TYPE | VARCHAR | Indicates how the usage in the record is rated, or priced. Possible values include:   * `compute` * `data_transfer` * `storage` * `other` |
| SERVICE_TYPE | VARCHAR | Type of usage. The following list includes many, but not all, of the possible service types:   * `ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING` — See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `ARCHIVE_STORAGE_WRITE` — See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `AUTOMATIC_CLUSTERING` — See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `CLOUD_SERVICES` — See [Cloud service credit usage](../../user-guide/cost-understanding-compute.md). * `COPY_FILES` — See [COPY FILES](../sql/copy-files.md). * `DATA_TRANSFER` — See [Understanding data transfer cost](../../user-guide/cost-understanding-data-transfer.md). * `EGRESS_COST_OPTIMIZER` — See [Optimizing data transfer costs with Egress Cost Optimizer](../../collaboration/provider-listings-auto-fulfillment-eco.md). * `INTERNAL_DATA_TRANSFER` — See costs associated with [Snowpark Container Services](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md). * `LOGGING` — See [Logging, tracing, and metrics](../../developer-guide/logging-tracing/logging-tracing-overview.md). * `MATERIALIZED_VIEW` — See [Working with Materialized Views](../../user-guide/views-materialized.md). * `OUTBOUND_PRIVATELINK_DATA_PROCESSED` — See [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md). * `OUTBOUND_PRIVATELINK_ENDPOINTS` — See [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md). * `REPLICATION` — See [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md). * `QUERY_ACCELERATION` — See [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md) * `SEARCH_OPTIMIZATION` — See [Search optimization service](../../user-guide/search-optimization-service.md) * `SENSITIVE_DATA_CLASSIFICATION` — See [Introduction to sensitive data classification](../../user-guide/classify-intro.md). * `SERVERLESS_ALERTS` — See [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md). * `SERVERLESS_TASK` — See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPIPE` — See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `SNOWPIPE_STREAMING` — See [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `STORAGE` — See [Understanding storage cost](../../user-guide/cost-understanding-data-storage.md). * `STORAGE_LIFECYCLE_POLICY_EXECUTION` — See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `TRUST_CENTER` — See [Trust Center](../../user-guide/trust-center/overview.md). * `WAREHOUSE_METERING` — See [Virtual warehouse credit usage](../../user-guide/cost-understanding-compute.md). Does not indicate usage of serverless or cloud services compute. |
| IS_ADJUSTMENT | BOOLEAN | Indicates whether the record is an adjustment to usage. |

## Usage notes

* Latency for the view may be up to 72 hours.
* Until month close, data for a given day in a month can change to account for any end-of-month adjustments/credits, contract amendments,
  or Snowflake account transfers between organizations.
* Customers who signed a contract through a Snowflake reseller cannot access data in this view.
* Data is retained indefinitely.
* This view does not include data generated prior to June 2020. To obtain data before this date, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: USERS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/users.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# USERS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to query a list of all users in each account.

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| USER_ID | NUMBER | Internal/system-generated identifier for the user. |
| NAME | VARCHAR | A unique identifier for the user. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the user’s account was created. |
| DELETED_ON | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the user’s account was deleted. |
| LOGIN_NAME | VARCHAR | Name that the user enters to log into the system. |
| DISPLAY_NAME | VARCHAR | Name displayed for the user in the Snowflake web interface. |
| FIRST_NAME | VARCHAR | First name of the user. |
| LAST_NAME | VARCHAR | Last name of the user. |
| EMAIL | VARCHAR | Email address for the user. |
| MUST_CHANGE_PASSWORD | BOOLEAN | Specifies whether the user is forced to change their password on their next login. |
| HAS_PASSWORD | BOOLEAN | Specifies whether a password was created for the user. |
| COMMENT | VARCHAR | Comment for the user. |
| DISABLED | VARIANT | Specified whether the user account is disabled preventing the user from logging in to the Snowflake and running queries. |
| SNOWFLAKE_LOCK | VARIANT | Specifies whether a temporary lock has been placed on the user’s account. |
| DEFAULT_WAREHOUSE | VARCHAR | The virtual warehouse that is active by default for the user’s session upon login. |
| DEFAULT_NAMESPACE | VARCHAR | The namespace (database only or database and schema) that is active by default for the user’s session upon login. |
| DEFAULT_ROLE | VARCHAR | The role that is active by default for the user’s session upon login. |
| EXT_AUTHN_DUO | BOOLEAN | Specifies whether Duo Security is enabled for the user, which requires the user to use MFA (multi-factor authorization) for login. |
| EXT_AUTHN_UID | VARCHAR | The authorization ID used for Duo Security. |
| HAS_MFA | BOOLEAN | Specifies whether the user is enrolled for multi-factor authentication. |
| BYPASS_MFA_UNTIL | TIMESTAMP_LTZ | The number of minutes to temporarily bypass MFA for the user. |
| LAST_SUCCESS_LOGIN | TIMESTAMP_LTZ | Date and time (in the UTC time zone) when the user last logged in to the Snowflake. |
| EXPIRES_AT | TIMESTAMP_LTZ | The date and time when the user’s status is set to `EXPIRED` and the user can no longer log in. This is useful for defining temporary users (e.g. users who should only have access to Snowflake for a limited time period). |
| LOCKED_UNTIL_TIME | TIMESTAMP_LTZ | Specifies the number of minutes until the temporary lock on the user login is cleared. |
| HAS_RSA_PUBLIC_KEY | BOOLEAN | Specifies whether RSA public key used for key pair authentication has been set up for the user. |
| PASSWORD_LAST_SET_TIME | TIMESTAMP_LTZ | The timestamp on which the last non-null password was set for the user. Default to null if no password has been set yet or if Snowflake is unable to determine the timestamp for the user before the inclusion of this column. |
| OWNER | VARCHAR | Specifies the role with the OWNERSHIP privilege on the object. |
| DEFAULT_SECONDARY_ROLE | VARCHAR | Specifies the default secondary role for the user (that is, ALL) or NULL if not set. |
| HAS_PAT | BOOLEAN | If TRUE, a [programmatic access token (PAT)](../../user-guide/programmatic-access-tokens.md) has been generated for the user. |
| HAS_WORKLOAD_IDENTITY | BOOLEAN | If TRUE, the user is configured to use [workload identity federation](../../user-guide/workload-identity-federation.md) to authenticate with Snowflake. |
| TYPE | VARCHAR | Specifies the [type of user](../../user-guide/admin-user-management.md). |
| DATABASE_NAME | VARCHAR | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |
| DATABASE_ID | NUMBER | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s database; otherwise, it’s NULL. |
| SCHEMA_NAME | VARCHAR | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |
| SCHEMA_ID | NUMBER | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s schema; otherwise, it’s NULL. |
| IS_FROM_ORGANIZATION_USER | BOOLEAN | If TRUE, the user was imported from an [organization user](../../user-guide/organization-users.md). |

## Usage notes

* Latency for the view may be up to 24 hours.

* The `LAST_SUCCESS_LOGIN` column may have a value that differs from the `last_success_login` column in the
  SHOW USERS command output because of different methodologies used to record near-real-time and historical logins. The
  column might have a NULL value if the login history data for the user is outside the one-year retention period of
  historical data.
* Columns that are not applicable to service users (that is, users with `TYPE=SERVICE`) contain NULL values. For example,
  `HAS_PASSWORD` contains NULL values for service users.
* The `deletedOn` column might not be accurate for Snowpark Container Services [service user](../../developer-guide/snowpark-container-services/spcs-execute-sql.md). For services created before release 8.42.0, the `deletedOn` column of the service user shows as empty even if the associated service is dropped; For services created after release 8.42.0, the `deletedOn` column of the service user shows as the deletion time of the associating service.

### Internal Snowflake User for Snowsight

The first time [Snowsight](../../user-guide/ui-snowsight.md) is accessed in an account, Snowflake creates an internal WORKSHEETS_APP_USER user to support the web interface. This user is used to cache query results in an internal stage in an account. For more information, see [Getting started with Snowsight](../../user-guide/ui-snowsight-gs.md).

---
title: VIEWS view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/views.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# VIEWS view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view displays a row for each view in an account, not including the views in the ACCOUNT_USAGE,
READER_ACCOUNT_USAGE, and INFORMATION_SCHEMA schemas.

See also:
:   [TABLES view](tables.md)

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_ID | NUMBER | Internal/system-generated identifier for the view. |
| TABLE_NAME | VARCHAR | Name of the view. |
| TABLE_SCHEMA_ID | NUMBER | Internal/system-generated identifier for the schema that the view belongs to. |
| TABLE_SCHEMA | VARCHAR | Schema that the view belongs to. |
| TABLE_CATALOG_ID | NUMBER | Internal/system-generated identifier for the database that the view belongs to. |
| TABLE_CATALOG | VARCHAR | Database that the view belongs to. |
| TABLE_OWNER | VARCHAR | Name of the role that owns the view. |
| VIEW_DEFINITION | VARCHAR | Text of the query expression for the view. |
| CHECK_OPTION | VARCHAR | Not applicable for Snowflake. |
| IS_UPDATABLE | VARCHAR | Not applicable for Snowflake. |
| INSERTABLE_INTO | VARCHAR | Not applicable for Snowflake. |
| IS_SECURE | VARCHAR | Specifies whether the view is secure. |
| CREATED | TIMESTAMP_LTZ | Date and time when the view was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| DELETED | TIMESTAMP_LTZ | Date and time when the view was deleted. |
| COMMENT | VARCHAR | Comment for the view. |
| INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* Latency for the view may be up to 24 hours.

* The view does not recognize the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command
  executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.

---
title: WAREHOUSE_EVENTS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/warehouse_events_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# WAREHOUSE_EVENTS_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to return the events that have been triggered for the single-cluster and multi-cluster warehouses
in your account.

Supported events include:

* Creating, dropping, or altering a warehouse, including resizing the warehouse.
* Resuming or suspending a warehouse.
* Resuming, suspending, or resizing a cluster in a warehouse (single-cluster and multi-cluster warehouses).
* Stopping or starting additional clusters in a warehouse (multi-cluster warehouses only).

## Columns

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column name | Data Type | Description |
| --- | --- | --- |
| TIMESTAMP | TIMESTAMP_LTZ | The timestamp when the event is triggered. |
| WAREHOUSE_ID | NUMBER | The unique warehouse ID (assigned by Snowflake) that corresponds to the warehouse name in your account. |
| WAREHOUSE_NAME | VARCHAR | The name of the warehouse in your account. |
| CLUSTER_NUMBER | NUMBER | If an event was triggered for a specific cluster in a multi-cluster warehouse, the number of the cluster (starting with 1) for which the event was triggered; if the event was triggered for all clusters in the warehouse or is not applicable for a single-cluster warehouse, NULL is displayed. |
| EVENT_NAME | VARCHAR | Name of the event. For the list of possible values, see below. |
| EVENT_REASON | VARCHAR | The cause of the event. For the list of possible values, see below. |
| EVENT_STATE | VARCHAR | State of an event that might take time to complete: STARTED or COMPLETED. |
| USER_NAME | VARCHAR | User who initiated the event. |
| ROLE_NAME | VARCHAR | Role that was active in the session at the time the event was initiated. |
| QUERY_ID | VARCHAR | Internal/system-generated identifier for the SQL statement. |
| SIZE | VARCHAR | Current size of the warehouse at the time of the event. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| CLUSTER_COUNT | VARCHAR | Number of warehouse clusters at the time of the event. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| WAREHOUSE_TYPE | VARCHAR | One of `STANDARD` or `SNOWPARK-OPTIMIZED`. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| RESOURCE_CONSTRAINT | VARCHAR | One of: . - `STANDARD_GEN_1` . - `STANDARD_GEN_2` . - `MEMORY_1X` . - `MEMORY_1X_x86` . - `MEMORY_16X` . - `MEMORY_16X_x86` . - `MEMORY_64X` . - `MEMORY_64X_x86` . This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. It’s also NULL for standard warehouses created before the release of the STANDARD_GEN_2 feature. |

### EVENT_NAME descriptions

The following sections describe the valid values for the EVENT_NAME column for warehouse-related and cluster-related events.

#### Warehouse-related events

The following table describes the valid values for the EVENT_NAME column for warehouse-related events:

| EVENT_NAME | Description |
| --- | --- |
| CONVERT_WAREHOUSE | Triggered by the conversion of a warehouse from standard to Snowpark-optimized, from Snowpark-optimized to standard, from Gen1 to Gen2, or Gen2 to Gen1. This event happens whether the warehouse is running or suspended when the conversion happens.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | CONVERT_TO_SNOWPARK_OPTIMIZED, CONVERT_TO_STANDARD, or CONVERT_RESOURCE_CONSTRAINT |   Cost impact: Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries.  **Tip:** For information about cost implications of changing the RESOURCE_CONSTRAINT property, see [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](../../user-guide/warehouses-gen2.md). |
| CREATE_WAREHOUSE | Triggered by the creation of a new warehouse, which can occur when a user manually creates a warehouse or when an account is provisioned and the default warehouse is automatically created in the account.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | None (N/A) |   Cost impact: None if the cluster is created with INITIALLY_SUSPENDED = TRUE. Otherwise, metering starts when all compute resources are provisioned for the warehouse or the warehouse starts processing statements (if the warehouse starts processing statements before the resources are fully provisioned). |
| DROP_WAREHOUSE | Triggered when an existing warehouse is dropped; all currently executing queries on the warehouse are stopped and the compute resources are released.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | None (N/A) |   Cost impact: Metering on the compute resources for the warehouse stops after all currently executing queries complete. |
| ALTER_WAREHOUSE | Triggered when the properties of an existing warehouse are changed, including resizing the warehouse. If the warehouse is resized, additional RESIZE_WAREHOUSE events are triggered. This event can also trigger RESUME_WAREHOUSE or SUSPEND_WAREHOUSE events.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | COMPLETED or STARTED | | Event reason | None (N/A) |   Cost impact: Depends on the event(s) that are triggered by the ALTER statement. |
| RESIZE_WAREHOUSE | Triggered by changing the size of a warehouse, which increases or decreases the compute resources in each cluster in the warehouse. For a running warehouse, this event also triggers a RESIZE_CLUSTER event for each cluster in the warehouse.   |  |  | | --- | --- | | Cluster number | None (N/A) | | Event state | STARTED | | Event reason | WAREHOUSE_RESIZE |   Cost impact: Resizing a running warehouse adds or removes compute resources in each cluster in the warehouse. Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries.  Resizing a suspended warehouse does not provision any new resources for the warehouse. |
| RESUME_WAREHOUSE | Triggered when a suspended warehouse is resumed or a new warehouse is created with INITIALLY_SUSPENDED = FALSE. This event also triggers a RESUME_CLUSTER event for each cluster in the warehouse.   |  |  | | --- | --- | | Cluster number | None (applies to all clusters) | | Event state | STARTED | | Event reason | WAREHOUSE_AUTORESUME or WAREHOUSE_RESUME |   Cost impact: Metering begins after all the compute resources are provisioned for the warehouse. |
| SUSPEND_WAREHOUSE | Triggered when a running warehouse is suspended. This event also triggers a SUSPEND_CLUSTER event for each cluster in the warehouse.   |  |  | | --- | --- | | Cluster number | None (applies to all clusters) | | Event state | STARTED | | Event reason | WAREHOUSE_AUTOSUSPEND or WAREHOUSE_SUSPEND |   Cost impact: Metering on the compute resources for the warehouse stops after all running statements complete. |
| WAREHOUSE_CONSISTENT | Triggered when pending changes to a warehouse complete. For more information, see [Usage notes](../account-usage/warehouse_events_history.md).   |  |  | | --- | --- | | Cluster number | NULL | | Event state | COMPLETED | | Event reason | NULL |   Cost impact: None. Metering occurs for the warehouse event that is logged with the STARTED state before the WAREHOUSE_CONSISTENT event.  For more information, see the cost impact of the warehouse events described in the previous rows. |

#### Cluster-related events

The following table describes the valid values for the EVENT_NAME column for cluster-related events:

| EVENT_NAME | Description |
| --- | --- |
| CONVERT_CLUSTER | Triggered by the conversion of a warehouse from standard to Snowpark-optimized, or from Snowpark-optimized to standard. This event is only emitted when the conversion happens while the warehouse is running.   |  |  | | --- | --- | | Cluster number | Number of the converted cluster (always `1` for a single-cluster warehouse) | | Event state | COMPLETED or STARTED | | Event reason | CONVERT_TO_SNOWPARK_OPTIMIZED or CONVERT_TO_STANDARD |   Cost impact: Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries.  **Tip:** For information about cost implications of changing the RESOURCE_CONSTRAINT property, see [considerations for changing RESOURCE_CONSTRAINT while a warehouse is running or suspended](../../user-guide/warehouses-gen2.md). |
| RESUME_CLUSTER | Triggered when a suspended cluster is resumed.   |  |  | | --- | --- | | Cluster number | Number of the resumed cluster (always `1` for a single-cluster warehouse) | | Event state | STARTED | | Event reason | * WAREHOUSE_AUTORESUME or WAREHOUSE_RESUME (single-cluster warehouse) * MULTICLUSTER_SPINUP (multi-cluster warehouse) |   Cost impact: Metering starts on the compute resources for the cluster after they are provisioned. |
| SUSPEND_CLUSTER | Triggered when a running cluster is suspended.   |  |  | | --- | --- | | Cluster number | Number of the suspended cluster (always `1` for a single-cluster warehouse) | | Event state | STARTED | | Event reason: | * WAREHOUSE_AUTOSUSPEND or WAREHOUSE_SUSPEND (single-cluster warehouse) * MULTICLUSTER_SPINDOWN (multi-cluster warehouse) * RESOURCE_MONITOR_SUSPEND |   Cost impact: Metering stops on the compute resources for the cluster after all currently executing queries complete. |
| RESIZE_CLUSTER | Triggered when a cluster is resized, usually as a result of resizing a warehouse.   |  |  | | --- | --- | | Cluster number | Number of the resized cluster (always `1` for a single-cluster warehouse) | | Event state | STARTED | | Event reason | * WAREHOUSE_AUTORESUME or WAREHOUSE_RESUME (single-cluster warehouse) * MULTICLUSTER_SPINDOWN or MULTICLUSTER_SPINUP (multi-cluster warehouse) * WAREHOUSE_RESIZE |   Cost impact: Depends on whether compute resources are added or removed due to resizing. Newly added resources start metering when they are provisioned. Removed resources stop metering after they finish processing any currently executing queries. |
| SPINUP_CLUSTER | Triggered when a cluster is started (multi-cluster warehouse only); usually happens when the mininimum or maximum cluster size is increased.   |  |  | | --- | --- | | Cluster number | Number of the cluster that was started | | Event state | STARTED | | Event reason | * WAREHOUSE_RESIZE (single-cluster warehouse) * MULTICLUSTER_SPINUP (multi-cluster warehouse) |   Cost impact: Metering starts on the compute resources for the cluster after they are provisioned. |
| SPINDOWN_CLUSTER | Triggered when a running cluster is shut down (multi-cluster warehouse only); usually happens when the minimum or maximum cluster size is decreased.   |  |  | | --- | --- | | Cluster number | Number of the cluster that was shut down | | Event state | STARTED | | Event reason | * WAREHOUSE_RESIZE (single-cluster warehouse) * MULTICLUSTER_SPINDOWN (multi-cluster warehouse) |   Cost impact: Metering stops on the compute resources for the cluster after all currently executing queries complete. |

### EVENT_REASON descriptions

The following table describes the valid values for the EVENT_REASON column:

| EVENT_REASON | Description |
| --- | --- |
| WAREHOUSE_AUTORESUME | A suspended warehouse was resumed automatically because AUTO_RESUME is enabled for the warehouse and a SQL statement was submitted to the warehouse. |
| WAREHOUSE_RESUME | A suspended warehouse was resumed manually by a user. |
| WAREHOUSE_AUTOSUSPEND | A running warehouse was suspended automatically because AUTO_SUSPEND is enabled for the warehouse and the defined period of inactivity for AUTO_SUSPEND has passed. |
| WAREHOUSE_SUSPEND | A running warehouse was suspended manually by a user. |
| WAREHOUSE_RESIZE | A warehouse was resized. |
| RESOURCE_MONITOR_SUSPEND | A warehouse was suspended because the credit quota for the resource monitor for the warehouse was reached. |
| MULTICLUSTER_SPINUP | A new or suspended cluster was provisioned in a multi-cluster warehouse; not applicable to single-cluster warehouses. |
| MULTICLUSTER_SPINDOWN | A running cluster was shut down in a multi-cluster warehouse; not applicable to single-cluster warehouses. |

## Usage notes

* Latency for the view may be up to 24 hours.

* An event can produce multiple rows in the view if it triggers additional, related events.
* The value for the EVENT_REASON, USER_NAME, ROLE_NAME, and QUERY_ID columns is NULL for a WAREHOUSE_CONSISTENT event.
* The WAREHOUSE_CONSISTENT event might share the same timestamp with another warehouse event and be listed out of order.

### Warehouse event that indicates that an operation has completed

Events that create a warehouse, change the size of the warehouse or the number of clusters, or suspend a warehouse are not atomic
operations. This means that some small amount of time is required for these operations to fully complete.

For example, if a warehouse is suspended using an ALTER WAREHOUSE … SUSPEND statement, any queries that are currently executing on the
warehouse must complete (or time out) before it can be suspended. In some cases, multiple warehouse events might be in-flight
(for example, resize and suspend). When all warehouse events have completed, the warehouse is in a *consistent* state.

If a warehouse event is logged with the STARTED state in the EVENT_STATE column, it is never
logged with a COMPLETED state. Instead, an event logged with the STARTED state is always followed by a subsequent WAREHOUSE_CONSISTENT
event. If multiple warehouse events are logged with the STARTED event state, those events coalesce to the same WAREHOUSE_CONSISTENT event.

If a warehouse event is logged with the COMPLETED state in the EVENT_STATE column, no subsequent WAREHOUSE_CONSISTENT event follows
unless another pending event is logged with a STARTED state.

## Examples

### View events history for the previous week

View the events history for warehouse `my_wh` for the previous week by executing the following statement:

```sqlexample
SELECT account_name, timestamp, warehouse_name, cluster_number,
       event_name, event_reason, event_state,
       size, cluster_count
  FROM SNOWFLAKE.ORGANIZATION_USAGE.WAREHOUSE_EVENTS_HISTORY
  WHERE warehouse_name = 'MY_WH'
  AND timestamp > DATEADD('day', -7, CURRENT_TIMESTAMP())
  ORDER BY timestamp DESC;
```

### Example events history results

#### Events history for a statement with no pending changes

An ALTER WAREHOUSE statement is logged with the COMPLETED state when there are no additional changes pending. For example,
the following statement updates the comment for warehouse `my_wh`:

```sqlexample
ALTER WAREHOUSE my_wh SET
  COMMENT = 'Updated comment for warehouse';
```

This statement results in the following row in the WAREHOUSE_EVENTS_HISTORY view:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-04-26 16:42:13.513 +0000 | MY_WH | ALTER_WAREHOUSE | COMPLETED | NULL | NULL |

#### Events history for a statement that is followed by a WAREHOUSE_CONSISTENT event

When an ALTER WAREHOUSE statement changes the warehouse size, additional events follow. For example, resize warehouse
`my_wh`:

```sqlexample
ALTER WAREHOUSE my_wh SET
  WAREHOUSE_SIZE = 'SMALL';
```

This statement results in the following rows in the WAREHOUSE_EVENTS_HISTORY view:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-05-29 15:13:05.874 +0000 | MY_WH | ALTER_WAREHOUSE | STARTED | NULL | NULL |
| 2024-05-29 15:13:05.874 +0000 | MY_WH | RESIZE_WAREHOUSE | STARTED | NULL | NULL |
| 2024-05-29 15:13:06.036 +0000 | MY_WH | WAREHOUSE_CONSISTENT | COMPLETED | SMALL | 1 |
| 2024-05-29 15:13:06.036 +0000 | MY_WH | RESIZE_CLUSTER | COMPLETED | NULL | NULL |

#### Events history for a Snowflake-initiated warehouse event

When Snowflake resumes a multi-cluster warehouse, the following warehouse events are logged:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-04-23 17:04:11.618 +0000 | MY_WH | SPINUP_CLUSTER | STARTED | NULL | NULL |
| 2024-04-23 17:04:11.657 +0000 | MY_WH | RESUME_CLUSTER | STARTED | NULL | NULL |
| 2024-04-23 17:04:11.657 +0000 | MY_WH | WAREHOUSE_CONSISTENT | COMPLETED | LARGE | 5 |

---
title: WAREHOUSE_LOAD_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/warehouse_load_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# WAREHOUSE_LOAD_HISTORY view

> **Important:**
>
> This view is only available in the organization account. For more information, see [Premium views in the organization account](../../user-guide/organization-accounts-premium-views.md).

This Organization Usage view can be used to analyze the workload on your warehouse within a specified date range.

See also:
:   [WAREHOUSE_METERING_HISTORY view](warehouse_metering_history.md)

## Columns

> **Note:**
>
> For the output columns of this view, the query load value is the ratio of the total execution time (in seconds) of all queries in a
> specific state in an interval by the total time (in seconds) for that interval.
>
> For example, if 276 seconds was the total time for 4 queries in a 5 minute (300 second) interval, then the query load value is
> 276 / 300 = 0.92.

**Organization-level columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization. |
| ACCOUNT_LOCATOR | VARCHAR | System-generated identifier for the account. |
| ACCOUNT_NAME | VARCHAR | User-defined identifier for the account. |

**Additional columns**

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_LTZ | The start of the specified time range (in the UTC time zone) in which the warehouse usage took place. |
| END_TIME | TIMESTAMP_LTZ | The end of the specified time range (in the UTC time zone) in which the warehouse usage took place. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| AVG_RUNNING | NUMBER(38,9) | Query load value for queries executed. |
| AVG_QUEUED_LOAD | NUMBER(38,9) | Query load value for queries queued because the warehouse was overloaded. |
| AVG_QUEUED_PROVISIONING | NUMBER(38,9) | Query load value for queries queued because the warehouse was being provisioned. |
| AVG_BLOCKED | NUMBER(38,9) | Query load value for queries blocked by a transaction lock. |
|  |  |  |

## Usage notes

* Latency for the view may be up to 24 hours.

* Load history is shown in 5-minute intervals.

---
title: WAREHOUSE_METERING_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/organization-usage/warehouse_metering_history.md
section: Organization Usage
---

Schema:
:   [ORGANIZATION_USAGE](../organization-usage.md)

# WAREHOUSE_METERING_HISTORY view

This Organization Usage view can be used to return the hourly credit usage for one or more warehouses across all the accounts in your organization
within the last 365 days (1 year).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the organization where the usage took place. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage took place. |
| REGION | VARCHAR | Name of the region where the account is located. |
| SERVICE_TYPE | VARCHAR | The type of service, which identifies whether the usage is for a standard or reader account. Valid values: WAREHOUSE_METERING or WAREHOUSE_METERING_READER. |
| START_TIME | TIMESTAMP_LTZ | The date and beginning of the hour (in the local time zone) in which the warehouse usage took place. |
| END_TIME | TIMESTAMP_LTZ | The date and end of the hour (in the local time zone) in which the warehouse usage took place. |
| WAREHOUSE_ID | NUMBER | Internal/system-generated identifier for the warehouse. |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| CREDITS_USED | NUMBER | Total number of credits used by the warehouse in the hour. This is the sum of CREDITS_USED_COMPUTE and CREDITS_USED_CLOUD_SERVICES. This value does not take into account the [adjustment for cloud services](../../user-guide/cost-understanding-compute.md), and may therefore be greater than the credits that are billed. To determine how many credits were actually billed, run queries against the [METERING_DAILY_HISTORY view](metering_daily_history.md). |
| CREDITS_USED_COMPUTE | NUMBER | Number of credits used for the warehouse in the hour. |
| CREDITS_USED_CLOUD_SERVICES | NUMBER | Number of credits used for cloud services in the hour. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the account where the usage took place. |

## Usage notes

* Latency for the view may be up to 1440 minutes (24 hours).

## Information Schema

Snowflake Information Schema views and table functions for metadata queries.

---
title: APPLICABLE_ROLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/applicable_roles.md
section: Information Schema
---

# APPLICABLE_ROLES view

This Information Schema view displays one row for each role grant applied to the currently authenticated user.

For more information about roles and grants, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [ENABLED_ROLES view](enabled_roles.md) , [OBJECT_PRIVILEGES view](object_privileges.md) , [TABLE_PRIVILEGES view](table_privileges.md) , [GRANTS_TO_USERS view](../account-usage/grants_to_users.md) ,
    [GRANTS_TO_ROLES view](../account-usage/grants_to_roles.md) , [SHOW GRANTS](../sql/show-grants.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| GRANTEE | VARCHAR | Role or user to whom the privilege is granted |
| ROLE_NAME | VARCHAR | Name of the role |
| ROLE_OWNER | VARCHAR | Owner of the role |
| IS_GRANTABLE | VARCHAR | Whether this role can be granted to others |

## Usage notes

The view does not display any information about [database roles](../../user-guide/security-access-control-considerations.md).

---
title: APPLICATION_CONFIGURATIONS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/application_configurations.md
section: Information Schema
---

# APPLICATION_CONFIGURATIONS view

This Information Schema view displays a row for each application configuration currently defined in the specified or current database where the information schema is located.

For more information about application configuration, see [Application configuration](../../developer-guide/native-apps/app-configuration.md).

## Columns

The following table provides definitions for the `APPLICATION_CONFIGURATIONS` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| NAME | STRING | The name of the configuration. |
| APPLICATION_NAME | STRING | The name of the application that the configuration is in. |
| CREATED_ON | TIMESTAMP | The timestamp when the configuration object was created. |
| UPDATED_ON | TIMESTAMP | The timestamp when the configuration object was last updated. |
| TYPE | STRING | The type of the configuration. Possible values are APPLICATION_NAME and STRING. |
| STATUS | STRING | The status of the configuration. Possible values are PENDING and DONE. |
| SENSITIVE | BOOLEAN | Whether the value is sensitive or not. |
| VALUE | STRING | The value that is set by the consumer.  For application configurations of the APPLICATION_NAME type, this is the most up-to-date name of the application specified by the consumer. This may not be the same as initially provided if the application has been renamed. If the application has been dropped, no value will be shown here, as if the value is not set.  When `SENSITIVE=TRUE`, the value is hidden, unless the executing role is the application owning the configuration. |
| VALUE_UPDATED_ON | TIMESTAMP | The last updated timestamp when the value was set or unset. |
| LABEL | STRING | A user-friendly name to be displayed in the UI, provided by the provider. |
| DESCRIPTION | STRING | The description of the configuration. |
| APPLICATION_ROLES | STRING | The comma-separated app role names that have access to the configuration.  This displays the most up-to-date names, even if roles have been renamed. If an application role has been dropped, it will not be included in the output list. |

## Usage notes

* The view only displays configurations for which the current role for the session has been granted access privileges.
* The view does not include configurations that have been dropped.

## Examples

Retrieve all listings in the current account:

```sqlexample
SELECT * FROM <any_database>.INFORMATION_SCHEMA.APPLICATION_CONFIGURATIONS;
```

---
title: APPLICATION_SPECIFICATION view
source: https://docs.snowflake.com/en/sql-reference/info-schema/application_specifications.md
section: Information Schema
---

# APPLICATION_SPECIFICATION view

This Information Schema view displays a row for each app specification request currently defined
in the specified or current database where the information schema is located.

For more information about app specification, see
[Overview of app specifications](../../developer-guide/native-apps/requesting-app-specs.md).

## Columns

The following table provides definitions for the `APPLICATION_SPECIFICATIONS` view columns.

| Column | Data type | Description |
| --- | --- | --- |
| NAME | TEXT | The name of the app specification. |
| APPLICATION_NAME | TEXT | The name of the app that contains the app specification. |
| TYPE | TEXT | The type of app specification. Possible values are EXTERNAL_ACCESS, SECURITY_INTEGRATION, and LISTING. |
| SEQUENCE_NUMBER | NUMBER | The sequence number of the app specification. |
| REQUESTED_ON | TIMESTAMP_LTZ | The timestamp when the app created the app specification. |
| STATUS | TEXT | The status of the app specification. Possible values are: APPROVED, PENDING, or DECLINED. |
| STATUS_UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the app specification was last updated, including when it was created, approved, or declined. |
| LABEL | TEXT | A label containing the name of the app specification that is displayed to consumer in Snowsight. |
| DESCRIPTION | TEXT | A description of the app specification. This description is displayed to the consumer. |
| DEFINITION | TEXT | The fields that comprise the app specification definition. |

---
title: BACKUP_POLICIES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/backup_policies.md
section: Information Schema
---

# BACKUP_POLICIES view

This Information Schema view provides information on backup policies.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| BACKUP_POLICY_NAME | VARCHAR | Name of the backup policy. |
| BACKUP_POLICY_SCHEMA | VARCHAR | Schema that the backup policy belongs to. |
| BACKUP_POLICY_CATALOG | VARCHAR | Database that the backup policy belongs to. |
| SCHEDULE | VARCHAR | Schedule for backup creation. |
| EXPIRE_AFTER_DAYS | NUMBER | Days after backup creation when backup should be expired and automatically deleted. |
| HAS_RETENTION_LOCK | VARCHAR | Indicates whether the policy includes a retention lock. Y if the policy has a retention lock; N otherwise.  Retention lock protects backups from being deleted by anyone for the defined retention period. The retention lock also prevents the retention period from being decreased on the policy. |
| OWNER | VARCHAR | Name of the role that owns the backup policy. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the backup policy. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the backup policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for the backup policy. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: BACKUP_SETS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/backup_sets.md
section: Information Schema
---

# BACKUP_SETS view

This Information Schema view provides information on backup sets.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| BACKUP_SET_NAME | VARCHAR | Name of the backup set. |
| BACKUP_SET_SCHEMA | VARCHAR | Schema that the backup set belongs to. |
| BACKUP_SET_CATALOG | VARCHAR | Database that the backup set belongs to. |
| OBJECT_KIND | VARCHAR | Type of object that the backup set is backing up. |
| OBJECT_NAME | VARCHAR | Name of object that the backup set is backing up. |
| OBJECT_SCHEMA | VARCHAR | Name of schema that contains the object that is backed up by this backup set. |
| OBJECT_CATALOG | VARCHAR | Name of database that contains the object that is backed up by this backup set. |
| BACKUP_POLICY_NAME | VARCHAR | Name of backup policy attached to this backup set. |
| BACKUP_POLICY_SCHEMA | VARCHAR | Name of the schema that contains the backup policy. |
| BACKUP_POLICY_CATALOG | VARCHAR | Name of the database that contains the backup policy. |
| OWNER | VARCHAR | Name of the role that owns the backup set. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the backup set. Account role or Database role. |
| CREATED | TIMESTAMP | Date and time when the backup set was created. |
| LAST_ALTERED | TIMESTAMP | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for the backup set. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: BACKUPS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/backups.md
section: Information Schema
---

# BACKUPS view

This Information Schema view provides information on backups.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Snowflake-generated identifier of the backup.  Note: this is not the local ID, this is the globally unique UUID of the backup. |
| CREATED | TIMESTAMP_LTZ | Timestamp at which backup was created. |
| BACKUP_SET_NAME | VARCHAR | Name of backup set that contains the backup. |
| BACKUP_SET_SCHEMA | VARCHAR | Name of schema that the backup set belongs to. |
| BACKUP_SET_CATALOG | VARCHAR | Name of database that the backup set belongs to. |
| EXPIRATION_SCHEDULED_FOR | TIMESTAMP_LTZ | Timestamp at which backup will be expired and deleted. |
| IS_UNDER_LEGAL_HOLD | BOOLEAN | Y if backup is under legal hold; N otherwise. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: CHECK_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/check_constraints.md
section: Information Schema
---

# CHECK_CONSTRAINTS view

This Information Schema view displays a row for each [CHECK constraint](../constraints-overview.md)
defined in the specified or current database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_CATALOG | VARCHAR | Database that the CHECK constraint belongs to. |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the CHECK constraint belongs to. |
| CONSTRAINT_TABLE | VARCHAR | Table or view that the CHECK constraint belongs to. |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint with the CHECK clause. |
| CHECK_CLAUSE | VARCHAR | Condition enforced by the CHECK constraint. |

## Usage notes

The view only displays objects for which the current role for the session has been granted access privileges.

## Examples

Retrieve all of the CHECK constraints applied to tables in the `mydb` database:

```sqlexample
USE DATABASE mydb;

SELECT * FROM INFORMATION_SCHEMA.CHECK_CONSTRAINTS;
```

---
title: CLASS_INSTANCE_FUNCTIONS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/class_instance_functions.md
section: Information Schema
---

# CLASS_INSTANCE_FUNCTIONS view

This Information Schema view displays a row for each function in a
[class](../snowflake-db-classes.md) instance.

See also:
:   [CLASS_INSTANCES view](class_instances.md),
    [CLASS_INSTANCE_PROCEDURES view](class_instance_procedures.md),
    [SHOW FUNCTIONS](../sql/show-functions.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FUNCTION_NAME | VARCHAR | Name of the function. |
| FUNCTION_INSTANCE_NAME | VARCHAR | Name of the class instance to which the function belongs. |
| FUNCTION_INSTANCE_SCHEMA | VARCHAR | Name of the schema to which the class instance belongs. |
| FUNCTION_INSTANCE_DATABASE | VARCHAR | Name of the database to which the class instance belongs. |
| FUNCTION_OWNER | VARCHAR | Name of the role that owns the function. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the function’s arguments. |
| DATA_TYPE | VARCHAR | Data type of the return value. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string type return value. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string type return value. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric type return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric type return value. |
| NUMERIC_SCALE | NUMBER | Scale of numeric type return value. |
| FUNCTION_LANGUAGE | VARCHAR | Language of the function. |
| FUNCTION_DEFINITION | VARCHAR | Function definition. |
| VOLATILITY | VARCHAR | Whether the function is volatile or immutable. |
| IS_NULL_CALL | VARCHAR | ‘YES’ if the function is called on null input. |
| IS_SECURE | VARCHAR | ‘YES’ if the function is [secure](../../developer-guide/secure-udf-procedure.md). |
| CREATED | TIMESTAMP_LTZ | Date and time when the function was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this function. |
| IS_EXTERNAL [1] | VARCHAR | ‘YES’ if the function is an [external function](../external-functions.md). |
| API_INTEGRATION [1] | VARCHAR | Name of the API integration object to authenticate the call to the proxy service. |
| CONTEXT_HEADERS [1] | VARCHAR | Context header information for the external function. |
| MAX_BATCH_ROWS [1] | NUMBER | Maximum number of rows in each batch sent to the proxy service. |
| COMPRESSION [1] | VARCHAR | Type of compression. |
| PACKAGES | VARCHAR | Packages requested by the function. |
| RUNTIME_VERSION | VARCHAR | Runtime version of the language used by the function. NULL if the function is SQL or JavaScript. |
| INSTALLED_PACKAGES | VARCHAR | All packages installed by the function. Output for Python functions only. |
| IS_MEMOIZABLE | VARCHAR | ‘YES’ if the function is memoizable, ‘NO’ otherwise. |

[1]
(1,2,3,4,5)

These fields apply only to [Writing external functions](../external-functions.md).

## Usage notes

* The view only displays objects for which the current role for the session has been granted
  an instance role with access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the functions for class instances in the `mydatabase` database:

```sqlexample
SELECT function_name,
       function_instance_name AS instance_name,
       argument_signature,
       data_type AS return_value_data_type
    FROM mydatabase.INFORMATION_SCHEMA.CLASS_INSTANCE_FUNCTIONS;
```

---
title: CLASS_INSTANCE_PROCEDURES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/class_instance_procedures.md
section: Information Schema
---

# CLASS_INSTANCE_PROCEDURES view

This Information Schema view displays a row for each procedure in a
[class](../snowflake-db-classes.md) instance.

See also:
:   [CLASS_INSTANCES view](class_instances.md),
    [CLASS_INSTANCE_FUNCTIONS view](class_instance_functions.md),
    [SHOW PROCEDURES](../sql/show-procedures.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PROCEDURE_NAME | VARCHAR | Name of the stored procedure. |
| PROCEDURE_INSTANCE_NAME | VARCHAR | Name of the class instance to which the procedure belongs. |
| PROCEDURE_INSTANCE_SCHEMA | VARCHAR | Name of the schema to which the class instance belongs. |
| PROCEDURE_INSTANCE_DATABASE | VARCHAR | Name of the database to which the class instance belongs. |
| PROCEDURE_OWNER | VARCHAR | Name of the role that owns the stored procedure. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the stored procedure’s arguments. |
| DATA_TYPE | VARCHAR | Return value data type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string return value. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string return value. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric return value. |
| NUMERIC_SCALE | VARCHAR | Scale of numeric return value. |
| PROCEDURE_LANGUAGE | VARCHAR | Language of the stored procedure. |
| PROCEDURE_DEFINITION | VARCHAR | Stored procedure definition. |
| CREATED | TIMESTAMP_LTZ | Date and time the stored procedure was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for the stored procedure. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted
  an instance role with access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the procedures for instances in the `mydatabase` database:

```sqlexample
SELECT procedure_name,
       procedure_instance_name,
       argument_signature,
       data_type AS return_value_data_type
    FROM mydatabase.INFORMATION_SCHEMA.CLASS_INSTANCE_PROCEDURES;
```

---
title: CLASS_INSTANCES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/class_instances.md
section: Information Schema
---

# CLASS_INSTANCES view

This Information Schema view displays a row for each [class](../snowflake-db-classes.md)
instance in a database.

See also:
:   [CLASS_INSTANCE_FUNCTIONS view](class_instance_functions.md),
    [CLASS_INSTANCE_PROCEDURES view](class_instance_procedures.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the instance. |
| SCHEMA_NAME | VARCHAR | Name of the schema the instance belongs to. |
| DATABASE_NAME | VARCHAR | Name of the database the instance belongs to. |
| CLASS_NAME | VARCHAR | Name of the class the instance is instantiated from. |
| CLASS_SCHEMA_NAME | VARCHAR | Name of the schema of the class the instance is instantiated from. |
| CLASS_DATABASE_NAME | VARCHAR | Name of the database of the class the instance is instantiated from. |
| VERSION | VARCHAR | Current version of the instance. |
| OWNER | VARCHAR | Name of the role that owns the instance. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the instance was created. |
| COMMENT | VARCHAR | Comment for the instance. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not include instances that have been dropped. To view dropped instances, use
  the Account Usage [CLASS_INSTANCES view](../account-usage/class_instances.md) instead.

## Examples

Retrieve the names of all instances, and the class they were instantiated from, in the `mydatabase` database:

```sqlexample
SELECT name, class_name, class_schema_name, class_database_name
    FROM mydatabase.INFORMATION_SCHEMA.CLASS_INSTANCES;
```

---
title: CLASSES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/classes.md
section: Information Schema
---

# CLASSES view

This Information Schema view displays a row for each [class](../snowflake-db-classes.md)
in the database.

See also:
:   [CLASSES view](../account-usage/classes.md),
    [CLASS_INSTANCES view](class_instances.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the instance. |
| SCHEMA_NAME | VARCHAR | Name of the schema the instance belongs to. |
| DATABASE_NAME | VARCHAR | Name of the database the instance belongs to. |
| VERSION | VARCHAR | Version of the class that is currently active in this account. |
| OWNER | VARCHAR | Name of the role that owns the instance. |
| OWNER_ROLE_TYPE | VARCHAR | The internal/system-generated identifier of the role that owns the instance of the class. |
| IS_SERVICE_CLASS | VARCHAR | TRUE if the class is a SERVICE class. |
| CREATED | TIMESTAMP_LTZ | Date and time when the instance was created. |
| COMMENT | VARCHAR | Comment for the instance. |

## Usage notes

The view only displays objects for which the current role for the session has been granted access privileges.

## Examples

Retrieve all classes in the SNOWFLAKE database:

```sqlexample
SELECT name, schema_name, database_name, version
    FROM SNOWFLAKE.INFORMATION_SCHEMA.CLASSES;
```

---
title: COLUMNS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/columns.md
section: Information Schema
---

# COLUMNS view

This Information Schema view displays a row for each column in the tables defined in the specified (or current) database.

See also:
:   [DATABASES view](databases.md)

## Columns

| Column | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | TEXT | Database that the table belongs to. |
| TABLE_SCHEMA | TEXT | Schema that the table belongs to. |
| TABLE_NAME | TEXT | Table or view that the column belongs to. |
| COLUMN_NAME | TEXT | Name of the column. |
| ORDINAL_POSITION | NUMBER | Ordinal position of the column in the table. |
| COLUMN_DEFAULT | TEXT | Default value of the column. |
| IS_NULLABLE | TEXT | ‘YES’ if the column may contain NULL, ‘NO’ otherwise. |
| DATA_TYPE | TEXT | Data type of the column.  This column shows the standard Snowflake data type of the column. The DATA_TYPE_ALIAS column displays the original data type name that was specified for the column when the table was created, or when the column was altered. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string columns. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string columns. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric columns. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric columns. |
| NUMERIC_SCALE | NUMBER | Scale of numeric columns. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | TEXT | Not applicable for Snowflake. |
| INTERVAL_PRECISION | NUMBER | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | TEXT | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | TEXT | Not applicable for Snowflake. |
| COLLATION_CATALOG | TEXT | Not applicable for Snowflake. |
| COLLATION_SCHEMA | TEXT | Not applicable for Snowflake. |
| COLLATION_NAME | TEXT | Not applicable for Snowflake. |
| DOMAIN_CATALOG | TEXT | Not applicable for Snowflake. |
| DOMAIN_SCHEMA | TEXT | Not applicable for Snowflake. |
| DOMAIN_NAME | TEXT | Not applicable for Snowflake. |
| UDT_CATALOG | TEXT | Not applicable for Snowflake. |
| UDT_SCHEMA | TEXT | Not applicable for Snowflake. |
| UDT_NAME | TEXT | Not applicable for Snowflake. |
| SCOPE_CATALOG | TEXT | Not applicable for Snowflake. |
| SCOPE_SCHEMA | TEXT | Not applicable for Snowflake. |
| SCOPE_NAME | TEXT | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | NUMBER | Not applicable for Snowflake. |
| DTD_IDENTIFIER | TEXT | Not applicable for Snowflake. |
| IS_SELF_REFERENCING | TEXT | Not applicable for Snowflake. |
| IS_IDENTITY | TEXT | Whether this column is an identity column. |
| IDENTITY_GENERATION | TEXT | Whether an identity column’s value is always generated or only generated by default. Snowflake only supports BY DEFAULT. |
| IDENTITY_START | TEXT | The START value from `CREATE TABLE ... (columnX ... AUTOINCREMENT START <#> ...)`. |
| IDENTITY_INCREMENT | TEXT | The INCREMENT value from `CREATE TABLE ... (columnX ... AUTOINCREMENT INCREMENT <#> ...)`. |
| IDENTITY_MAXIMUM | TEXT | Not applicable for Snowflake. |
| IDENTITY_MINIMUM | TEXT | Not applicable for Snowflake. |
| IDENTITY_CYCLE | TEXT | Whether the value of an identity column may cycle. Snowflake only supports NO CYCLE. |
| IDENTITY_ORDERED | TEXT | If `YES`, the column is an identity column and has the ORDER property. If `NO`, the column is an identity column and has the NOORDER property. |
| SCHEMA_EVOLUTION_RECORD | TEXT | Records information about the latest triggered Schema Evolution for a given table column. This column contains the following subfields:   * EvolutionType: The type of the triggered schema evolution (ADD_COLUMN or DROP_NOT_NULL). * EvolutionMode: The triggering ingestion mechanism (COPY, SNOWPIPE, or SNOWPIPE_STREAMING). * FileName: The file name that triggered the evolution (NULL for SNOWPIPE_STREAMING). * TriggeringTime: The approximate time when the column was evolved. * QueryId or PipeId: A unique identifier of the triggering query or pipe (QUERY ID for COPY, PIPE ID for SNOWPIPE, or NULL for SNOWPIPE_STREAMING). * Pipe name: Fully qualified pipe name that triggered schema evolution (SNOWPIPE_STREAMING only). * Channel name: Channel that triggered schema evolution (SNOWPIPE_STREAMING only). * offsetTokenUpperBound: An offset at or before which schema evolution was triggered (SNOWPIPE_STREAMING only). |
| COMMENT | TEXT | Comment for this column. |
| DATA_TYPE_ALIAS | TEXT | The data type alias or synonym specified for the column when the table was created or when the column was last altered.  For example, the BIGINT type is synonymous with the NUMBER type. If BIGINT was specified as the type for a column, then BIGINT is displayed in this DATA_TYPE_ALIAS column.  For columns in tables that were created before the [2025_07 behavior change bundle](../../release-notes/bcr-bundles/2025_07_bundle.md) was enabled, and not altered after the behavior change, the value in this column is NULL. For more information, see [COLUMNS view (multiple schemas): New column](../../release-notes/bcr-bundles/2025_07/bcr-2061.md). |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.

## Examples

Retrieve all columns in the `myTable` table defined in the `mydb` database:

```sqlexample
USE DATABASE mydb;
SELECT *
    FROM INFORMATION_SCHEMA.COLUMNS WHERE table_name = 'myTable';
```

---
title: CORTEX_SEARCH_SERVICE_SCORING_PROFILES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/cortex_search_service_scoring_profiles.md
section: Information Schema
---

# CORTEX_SEARCH_SERVICE_SCORING_PROFILES view

This Information Schema view displays a row for each Cortex Search Service named scoring profile in the current or specified database.

For more information about named scoring profiles, see [Named scoring profiles](../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_CATALOG | VARCHAR | The database in which the service is defined. |
| SERVICE_SCHEMA | VARCHAR | The schema in which the service is defined. |
| SERVICE_NAME | VARCHAR | The name of the search service to which the profile belongs. |
| PROFILE_NAME | VARCHAR | The name of the scoring profile. |
| SCORING_PROFILE | VARCHAR | The scoring profile configuration as a JSON-format string. |

## Example

The following statement lists the named scoring profiles that are in the current database.

```sqlexample
SELECT * FROM INFORMATION_SCHEMA.CORTEX_SEARCH_SERVICE_SCORING_PROFILES;
```

---
title: CORTEX_SEARCH_SERVICES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/cortex_search.md
section: Information Schema
---

# CORTEX_SEARCH_SERVICES view

This view shows existing Cortex Search Services in the current or specified database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_CATALOG | VARCHAR | Database that the service belongs to. |
| SERVICE_SCHEMA | VARCHAR | Schema that the service belongs to. |
| SERVICE_NAME | VARCHAR | Name of the service. |
| CREATED | TIMESTAMP_LTZ | Creation time of the service. |
| DEFINITION | VARCHAR | SQL query used to create the service. |
| SEARCH_COLUMN | VARCHAR | Name of the search column. |
| ATTRIBUTE_COLUMNS | VARCHAR | Comma-separated list of attribute columns in the service. |
| COLUMNS | VARCHAR | Comma-separated list of all columns included in the service. |
| TARGET_LAG | VARCHAR | Target lag for refreshing the service. |
| WAREHOUSE | VARCHAR | Name of the warehouse used for refreshing the service. |
| COMMENT | VARCHAR | Comment for this service. |
| SERVICE_QUERY_URL | VARCHAR | URL for querying the service. |
| OWNER | VARCHAR | Role that owns the service. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role of the service owner (one of DATABASE_ROLE or ROLE). |
| DATA_TIMESTAMP | TIMESTAMP_LTZ | Time at which the source data was checked for changes resulting in the currently serving index. |
| SOURCE_DATA_BYTES | NUMBER | Current size, in bytes, of the materialized source data. |
| SOURCE_DATA_NUM_ROWS | NUMBER | Current number of rows in the materialized source data. |
| INDEXING_STATE | VARCHAR | Indexing state of the service (one of SUSPENDED or RUNNING). |
| INDEXING_ERROR | VARCHAR | Error encountered in the last indexing pipeline, if one exists. |
| SERVING_STATE | VARCHAR | Serving state of the service (one of SUSPENDED or RUNNING). |
| SERVING_DATA_BYTES | NUMBER | Size of the billable serving data, in bytes. |
| EMBEDDING_MODEL | VARCHAR | The vector embedding model used by the service. |
| PRIMARY_KEY_COLUMNS | VARCHAR | Comma-separated list of primary key column names defined on the service. Empty if no primary key is set. |

## Example

```sqlexample
SELECT * FROM SNOWFLAKE.INFORMATION_SCHEMA.CORTEX_SEARCH_SERVICES;
```

---
title: CURRENT_PACKAGES_POLICY view
source: https://docs.snowflake.com/en/sql-reference/info-schema/current_packages_policy.md
section: Information Schema
---

# CURRENT_PACKAGES_POLICY view

This Information Schema view displays a row for each Snowpark packages policy created on the current account by the
[CREATE PACKAGES POLICY](../sql/create-packages-policy.md) command. For details, see
[Packages policies](../../developer-guide/udf/python/packages-policy.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | The name of the package policy |
| LANGUAGE | VARCHAR | The programming language the packages policy applies to |
| ALLOWLIST | VARCHAR | The list of package specs that are allowed |
| BLOCKLIST | VARCHAR | The list of package specs that are blocked |
| ADDITIONAL_CREATION_BLOCKLIST | VARCHAR | The list of package specs that are blocked at creation time |
| COMMENT | VARCHAR | A comment about the packages policy |

## Usage notes

Currently, package policies are supported for Python.

---
title: DATABASES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/databases.md
section: Information Schema
---

# DATABASES view

This Information Schema view displays a row for each database defined in your account.

> **Note:**
>
> This view uses Snowflake terminology of “database” whereas all the other Information Schema views use the standard INFORMATION_SCHEMA terminology of “catalog”. The two terms have the same meaning.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE_NAME | VARCHAR | Name of the database. |
| DATABASE_OWNER | VARCHAR | Name of the role that owns the database. |
| IS_TRANSIENT | VARCHAR | Whether this is a transient database. |
| COMMENT | VARCHAR | Comment for this database. |
| CREATED | TIMESTAMP_LTZ | Creation time of the database. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| TYPE | VARCHAR | Specifies the type of database. Valid values are: . . - APPLICATION : a Snowflake Native App. . - APPLICATION_PACKAGE : an application package. . - STANDARD: a normal database. . - IMPORTED DATABASE: a database created from a share. . - PERSONAL DATABASE: a personal database, linked to its owner. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.
* This view contains all of the databases in the account (regardless of the database’s INFORMATION_SCHEMA used to query the view).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: DCM_DEPLOYMENT_HISTORY
source: https://docs.snowflake.com/en/sql-reference/info-schema/dcm_deployment_history.md
section: Information Schema
---

Categories:
:   [Information Schema](../info-schema.md) , [Table functions](../functions-table.md)

# DCM_DEPLOYMENT_HISTORY

This table function returns the deployment history for DCM project objects. You can use it to
query successful and failed deployments, including timestamps, status, error details, and summary
statistics. The function provides role-based access and low-latency results.

## Syntax

```sqlsyntax
DCM_DEPLOYMENT_HISTORY(
      [ PROJECT_NAME => '<string>' ]
      [, START_TIME_RANGE_START => <constant_expr> ]
      [, START_TIME_RANGE_END => <constant_expr> ]
      [, RESULT_LIMIT => <integer> ] )
```

## Arguments

All arguments are optional.

`PROJECT_NAME => 'string'`
:   Fully qualified name of the DCM project. If not provided, the function returns history for all
    projects accessible by the current role.

`START_TIME_RANGE_START => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format) marking the start of the time range for retrieving
    deployment events.

    Default: 7 days ago.

`START_TIME_RANGE_END => constant_expr`
:   Timestamp (in TIMESTAMP_LTZ format) marking the end of the time range for retrieving
    deployment events.

    Default: current timestamp.

`RESULT_LIMIT => integer`
:   Maximum number of rows to return.

    Default: `10000`.

## Output

The function returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `QUERY_UUID` | VARCHAR | Unique identifier of the query that executed the deployment. |
| `PROJECT_NAME` | VARCHAR | Name of the DCM project that was deployed. |
| `START_TIMESTAMP` | TIMESTAMP_LTZ | Timestamp of when the deployment execution started. |
| `END_TIMESTAMP` | TIMESTAMP_LTZ | Timestamp of when the deployment execution completed or failed. |
| `DEPLOYMENT_NAME` | VARCHAR | Internal deployment identifier (for example, `DEPLOYMENT$1`, `DEPLOYMENT$2`). |
| `DEPLOYMENT_ALIAS` | VARCHAR | User-specified alias for the deployment. Empty if no alias was provided. |
| `STATUS` | VARCHAR | Result of the deployment. Possible values: `SUCCESSFUL`, `FAILED`, `CANCELED`. |
| `PHASE` | VARCHAR | The phase of the execution. Possible values: `PLAN`, `DEPLOY`, `INIT`. |
| `CONFIGURATION_PROFILE` | VARCHAR | Name of the configuration profile used for the deployment. Empty if no configuration was specified. |
| `ERROR_MESSAGE` | VARCHAR | Error message if the deployment failed. Empty for successful deployments. |
| `ERROR_CODE` | VARCHAR | Error code if the deployment failed. Empty for successful deployments. |
| `DATABASE_NAME` | VARCHAR | Database that contains the DCM project. |
| `SCHEMA_NAME` | VARCHAR | Schema that contains the DCM project. |
| `EXECUTOR_ROLE` | VARCHAR | Role that executed the deployment command. |
| `STATS` | VARIANT | JSON object containing summary statistics of the deployment, broken down by category. Each category contains counts of `created`, `altered`, and `dropped` items. Categories include `entities` (managed objects), `columns`, `grants`, and `dmfAttachments` (data metric function expectations). |

## Usage notes

* When calling an Information Schema table function, the session must have an INFORMATION_SCHEMA
  schema in use or the function name must be fully qualified. For more details, see
  [Snowflake Information Schema](../info-schema.md).

## Examples

Retrieve deployment history for a specific project, limited to 3 results:

> ```sqlexample
> SELECT
>   PROJECT_NAME,
>   START_TIMESTAMP,
>   DEPLOYMENT_NAME,
>   DEPLOYMENT_ALIAS,
>   STATUS,
>   CONFIGURATION_PROFILE,
>   EXECUTOR_ROLE
> FROM
>   TABLE (MY_DB.INFORMATION_SCHEMA.DCM_DEPLOYMENT_HISTORY(
>     project_name => 'MY_DB.PROJECTS.MY_PROJECT',
>     result_limit => 3
>   ));
> ```
>
> ```text
> +----------------+-----------------------------+--------------+------------------+------------+-----------------------+------------------+
> | PROJECT_NAME   | START_TIMESTAMP             | DEPLOYMENT   | DEPLOYMENT       | STATUS     | CONFIGURATION_PROFILE | EXECUTOR_ROLE    |
> |                |                             | _NAME        | _ALIAS           |            |                       |                  |
> +----------------+-----------------------------+--------------+------------------+------------+-----------------------+------------------+
> | MY_PROJECT     | 2026-03-20 09:15:22.254     | DEPLOYMENT$3 | staging update   | SUCCESSFUL | STAGE                 | PROJECT_DEPLOYER |
> | MY_PROJECT     | 2026-03-19 14:30:10.927     | DEPLOYMENT$2 |                  | FAILED     | DEV                   | PROJECT_DEPLOYER |
> | MY_PROJECT     | 2026-03-18 11:00:05.339     | DEPLOYMENT$1 | initial deploy   | SUCCESSFUL | DEV                   | PROJECT_DEPLOYER |
> +----------------+-----------------------------+--------------+------------------+------------+-----------------------+------------------+
> ```

The `STATS` column contains a JSON object with the following structure:

> ```text
> {
>   "columns": {
>     "altered": 0,
>     "created": 12,
>     "dropped": 0
>   },
>   "dmfAttachments": {
>     "altered": 0,
>     "created": 2,
>     "dropped": 0
>   },
>   "entities": {
>     "altered": 1,
>     "created": 5,
>     "dropped": 0
>   },
>   "grants": {
>     "altered": 0,
>     "created": 4,
>     "dropped": 0
>   }
> }
> ```

Retrieve all columns for all projects accessible by the current role within the last 24 hours:

> ```sqlexample
> SELECT *
> FROM TABLE (INFORMATION_SCHEMA.DCM_DEPLOYMENT_HISTORY(
>   start_time_range_start => DATEADD(hours, -24, CURRENT_TIMESTAMP())
> ));
> ```

---
title: ELEMENT_TYPES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/element_types.md
section: Information Schema
---

# ELEMENT_TYPES view

This Information Schema view displays a row for each [structured ARRAY type](../data-types-structured.md) in an
object (a column in a table) in the specified (or current) database.

Each row describes the type of the element in the structured ARRAY.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| OBJECT_CATALOG | VARCHAR | Database that contains the object that uses this ARRAY type. |
| OBJECT_SCHEMA | VARCHAR | Schema that contains the object that uses this ARRAY type. |
| OBJECT_NAME | VARCHAR | Name of the object that uses this ARRAY type (e.g. name of a table). |
| OBJECT_TYPE | VARCHAR | Type of the object that uses this ARRAY type:   * TABLE (if used by a column) |
| COLLECTION_TYPE_IDENTIFIER | VARCHAR | Type identifier. Use this to join on:   * The DTD_IDENTIFIER column in the [COLUMNS view](columns.md). * The DTD_IDENTIFIER column in this view (for nested types). * The DTD_IDENTIFIER column in the [FIELDS view](fields.md) (for nested types). |
| DATA_TYPE | VARCHAR | Data type of the element. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string elements. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string elements. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric elements. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric elements. |
| NUMERIC_SCALE | NUMBER | Scale of numeric elements. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | VARCHAR | Not applicable for Snowflake. |
| INTERVAL_PRECISION | NUMBER | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| COLLATION_CATALOG | VARCHAR | Not applicable for Snowflake. |
| COLLATION_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | The collation specification for this element |
| UDT_CATALOG | VARCHAR | Not applicable for Snowflake. |
| UDT_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| UDT_NAME | VARCHAR | Not applicable for Snowflake. |
| SCOPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| SCOPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| SCOPE_NAME | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | NUMBER | Maximum cardinality. Currently, this is always set to NULL. |
| DTD_IDENTIFIER | VARCHAR | Nested type identifier. Use this to join on:   * The COLLECTION_TYPE_IDENTIFIER column in this view. * The ROW_IDENTIFIER column in the [FIELDS view](fields.md) (for nested types). |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.

  The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to the
  [SHOW COLUMNS](../sql/show-columns.md) command when both are executed by a user who holds the MANAGE GRANTS privilege.

---
title: ENABLED_ROLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/enabled_roles.md
section: Information Schema
---

# ENABLED_ROLES view

This Information Schema view displays a row for each currently-enabled role in the session. A role is enabled if it is currently in use in the session or it has been granted to the role that is currently
in use.

For more information about roles, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [APPLICABLE_ROLES view](applicable_roles.md) , [OBJECT_PRIVILEGES view](object_privileges.md) , [TABLE_PRIVILEGES view](table_privileges.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ROLE_NAME | VARCHAR | Name of the role |
| ROLE_OWNER | VARCHAR | Owner of the role |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view always displays the PUBLIC role because it is always enabled.
* The view does not display any information about [database roles](../../user-guide/security-access-control-considerations.md).

---
title: EVENT_TABLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/event_tables.md
section: Information Schema
---

# EVENT_TABLES view

This Information Schema view displays a row for each event table and view in the specified (or current) database, including the views in
the INFORMATION_SCHEMA schema itself.

See also:
:   [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md), [VIEWS view](views.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | VARCHAR | Database that the event table belongs to |
| TABLE_SCHEMA | VARCHAR | Schema that the event table belongs to |
| TABLE_NAME | VARCHAR | Name of the event table |
| TABLE_OWNER | VARCHAR | Name of the role that owns the event table |
| CREATED | TIMESTAMP_LTZ | Creation time of the event table |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this event table |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the
  MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command when both are executed with a role that was
  granted the MANAGE GRANTS privilege.

  This behavior also applies to other account-level [privileges](../../user-guide/security-access-control-privileges.md) and Information
  Schema views for which there is a corresponding SHOW command.
* The view does not include event tables that have been dropped. To view dropped tables, use [SHOW EVENT TABLES](../sql/show-event-tables.md) instead.
* To view only event tables in your queries, filter using a WHERE clause, e.g.:

  > `... WHERE table_schema != 'INFORMATION_SCHEMA'`
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve the names of all event tables in all schemas in the `mydatabase` database:

```sqlexample
SELECT TABLE_NAME
    FROM mydatabase.information_schema.event_tables;
```

---
title: EXTERNAL_TABLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/external_tables.md
section: Information Schema
---

# EXTERNAL_TABLES view

This Information Schema view displays a row for each external table in the specified (or current) database.

See also:
:   [COLUMNS view](columns.md) , [VIEWS view](views.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to |
| TABLE_NAME | VARCHAR | Name of the table |
| TABLE_OWNER | VARCHAR | Name of the role that owns the table |
| CREATED | TIMESTAMP_LTZ | Creation time of the table |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| AUTO_CLUSTERING_ON | BOOLEAN | Whether automatic clustering is enabled for the table |
| COMMENT | VARCHAR | Comment for this table |
| LOCATION | VARCHAR | External stage where the files containing data to be read are staged |
| FILE_FORMAT_NAME | VARCHAR | Named file format that describes the staged data files to scan when querying the external table |
| FILE_FORMAT_TYPE | VARCHAR | Format type of the staged data files to scan when querying the external table |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.
* The view does not include external tables that have been dropped.
* To view only external tables in your queries, filter using a WHERE clause, e.g.:

  > `... WHERE table_schema != 'INFORMATION_SCHEMA'`
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.
* The LAST_ALTERED column does not necessarily indicate the last refreshed time for external tables.
  To retrieve the last refreshed time for an auto-refreshed external table, you can use the
  [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](../functions/system_external_table_pipe_status.md) function, which returns
  information such as the timestamp of the last file Snowflake has registered.

## Examples

Retrieve the list of all external tables in all schemas in the `mydatabase` database:

> ```sqlexample
> SELECT table_name, last_altered FROM mydatabase.information_schema.external_tables;
> ```

---
title: FIELDS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/fields.md
section: Information Schema
---

# FIELDS view

This Information Schema view displays a row for each field in a
[structured OBJECT type](../data-types-structured.md) and a row for the key and value in a
[MAP](../data-types-structured.md) in an object (a column in a table) in the specified (or current) database.

For MAPs, the view contains separate rows for the key and value.

Each row describes the type of the element in the structured ARRAY.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| OBJECT_CATALOG | VARCHAR | Database that contains the object that uses this OBJECT or MAP type. |
| OBJECT_SCHEMA | VARCHAR | Schema that contains the object that uses this OBJECT or MAP type. |
| OBJECT_NAME | VARCHAR | Name of the object that uses this OBJECT or MAP type (e.g. name of a table). |
| OBJECT_TYPE | VARCHAR | Type of the object that uses this OBJECT or MAP type:   * TABLE (if used by a column) |
| ROW_IDENTIFIER | VARCHAR | Type identifier. Use this to join on:   * The DTD_IDENTIFIER column in the [COLUMNS view](columns.md). * The DTD_IDENTIFIER column in the [ELEMENT_TYPES view](element_types.md) (for nested types). * The DTD_IDENTIFIER column in this view (for nested types). |
| FIELD_NAME | VARCHAR | One of the following values:   * For structured OBJECTs, the name of the key. * For MAPs, KEY for the key or VALUE for the value. |
| ORDINAL_POSITION | NUMBER | The ordinal position of the key in the OBJECT or MAP. The position is 1-based.  For MAPs, the ordinal position of the key is 1, and the ordinal position of the value is 2. |
| DATA_TYPE | VARCHAR | Data type of the value (for OBJECTs) or the key or value (for MAPs). |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters of string keys or values. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes of string keys or values. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric keys or values. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric keys or values. |
| NUMERIC_SCALE | NUMBER | Scale of numeric keys or values. |
| DATETIME_PRECISION | NUMBER | Not applicable for Snowflake. |
| INTERVAL_TYPE | VARCHAR | Not applicable for Snowflake. |
| INTERVAL_PRECISION | NUMBER | Not applicable for Snowflake. |
| CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| COLLATION_CATALOG | VARCHAR | Not applicable for Snowflake. |
| COLLATION_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | The collation specification for this keys or values. |
| UDT_CATALOG | VARCHAR | Not applicable for Snowflake. |
| UDT_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| UDT_NAME | VARCHAR | Not applicable for Snowflake. |
| SCOPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| SCOPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| SCOPE_NAME | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_CARDINALITY | NUMBER | Maximum cardinality. Currently, this is always set to NULL. |
| DTD_IDENTIFIER | VARCHAR | Nested type identifier. Use this to join on:   * The COLLECTION_TYPE_IDENTIFIER column in the [ELEMENT_TYPES view](element_types.md). * The ROW_IDENTIFIER column in this view (for nested types). |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.

  The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to the
  [SHOW COLUMNS](../sql/show-columns.md) command when both are executed by a user who holds the MANAGE GRANTS privilege.

---
title: FILE_FORMATS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/file_formats.md
section: Information Schema
---

# FILE_FORMATS view

This Information Schema view displays a row for each file format defined in the specified (or current) database.

File formats are named objects that can be used for loading/unloading data. For more information, see [CREATE FILE FORMAT](../sql/create-file-format.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FILE_FORMAT_CATALOG | VARCHAR | Database that the file format belongs to |
| FILE_FORMAT_SCHEMA | VARCHAR | Schema that the file format belongs to |
| FILE_FORMAT_NAME | VARCHAR | Name of the file format |
| FILE_FORMAT_OWNER | VARCHAR | Name of the role that owns the file format |
| FILE_FORMAT_TYPE | VARCHAR | Type of the file format |
| RECORD_DELIMITER | VARCHAR | Character that separates records |
| FIELD_DELIMITER | VARCHAR | Character that separates fields |
| SKIP_HEADER | NUMBER | Number of lines skipped at the start of the file |
| DATE_FORMAT | VARCHAR | Date format |
| TIME_FORMAT | VARCHAR | Time format |
| TIMESTAMP_FORMAT | VARCHAR | Timestamp format |
| BINARY_FORMAT | VARCHAR | Binary format |
| ESCAPE | VARCHAR | String used as the escape character for any field values |
| ESCAPE_UNENCLOSED_FIELD | VARCHAR | String used as the escape character for unenclosed field values |
| TRIM_SPACE | VARCHAR | Whether whitespace is removed from fields |
| FIELD_OPTIONALLY_ENCLOSED_BY | VARCHAR | Character used to enclose strings |
| NULL_IF | VARCHAR | A list of strings to be replaced by null |
| COMPRESSION | VARCHAR | Compression method for the data file |
| ERROR_ON_COLUMN_COUNT_MISMATCH | VARCHAR | Whether to generate a parsing error if the number of fields in an input file does not match the number of columns in the corresponding table |
| CREATED | TIMESTAMP_LTZ | Creation time of the file format |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this file format |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: FUNCTIONS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/functions.md
section: Information Schema
---

# FUNCTIONS view

This Information Schema view displays a row for each user-defined function (UDF), external function, or data metric function defined in the
specified (or current) database.

For more information about external functions, see [Writing external functions](../external-functions.md).
For more information about UDFs, see [User-defined functions overview](../../developer-guide/udf/udf-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| FUNCTION_CATALOG | VARCHAR | Database to which the function belongs. |
| FUNCTION_SCHEMA | VARCHAR | Schema to which the function belongs. |
| FUNCTION_NAME | VARCHAR | Function name. |
| FUNCTION_OWNER | VARCHAR | Name of the role that owns the function. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the function’s arguments. |
| DATA_TYPE | VARCHAR | Data type of the function’s return value. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER(9,0) | Maximum length in characters of a string return value. |
| CHARACTER_OCTET_LENGTH | NUMBER(9,0) | Maximum length in bytes of a string return value. |
| NUMERIC_PRECISION | NUMBER(9,0) | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER(9,0) | Radix of precision of numeric return value. |
| NUMERIC_SCALE | NUMBER(9,0) | Scale of numeric return value. |
| FUNCTION_LANGUAGE | VARCHAR | Language of the function’s handler. |
| FUNCTION_DEFINITION | VARCHAR | Definition of the function’s handler. |
| VOLATILITY | VARCHAR | VOLATILE if the function is [volatile](../sql/create-function.md); IMMUTABLE if it is [immutable](../sql/create-function.md). |
| IS_NULL_CALL | VARCHAR(3) | `YES` if the function is [called on null input](../sql/create-function.md); otherwise, `NO`. |
| IS_SECURE | VARCHAR(3) | `YES` if the function is [secure](../../developer-guide/secure-udf-procedure.md); otherwise, `NO`. |
| CREATED | TIMESTAMP_LTZ | Creation time of the function. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for the function. |
| IS_EXTERNAL [1] | VARCHAR(3) | `YES` if the function is an [external function](../external-functions.md); otherwise, `NO`. |
| API_INTEGRATION [1] | VARCHAR | Name of the [API integration object to authenticate](../external-functions-introduction.md) the call to the proxy service an external function makes. |
| CONTEXT_HEADERS [1] | VARCHAR | Context header information for the external function. |
| MAX_BATCH_ROWS [1] | NUMBER(9,0) | Maximum number of rows in each batch sent to the proxy service for an external function. |
| REQUEST_TRANSLATOR [1] | VARCHAR | Name of the external function’s [request translator](../external-functions-translators.md) (if any). |
| RESPONSE_TRANSLATOR [1] | VARCHAR | Name of the external function’s [response translator](../external-functions-translators.md) (if any). |
| COMPRESSION [1] | VARCHAR | Type of compression used for serializing function payload. |
| IMPORTS | VARCHAR | Names of files (including their stage location and path) containing imported libraries. |
| HANDLER | VARCHAR | Name of the handler function or class. |
| TARGET_PATH | VARCHAR | Path to the stage in which Snowflake stores the compiled result of [inline handler code](../../developer-guide/inline-or-staged.md). |
| RUNTIME_VERSION | VARCHAR | Runtime version of the function’s handler language; NULL if the function handler is written in SQL or JavaScript. |
| PACKAGES | VARCHAR | Names of packages specified in the PACKAGES clause of the [CREATE FUNCTION](../sql/create-function.md) statement. Currently, this column applies only when the handler is written in Python, Java, or Scala. |
| INSTALLED_PACKAGES | VARCHAR | Names of all packages installed by the function. This includes packages specified by the PACKAGES clause as well as their installed dependencies. Currently, this column applies only when the handler is written in Python. |
| IS_MEMOIZABLE | VARCHAR(3) | `YES` if the function is [memoizable](../../developer-guide/udf/sql/udf-sql-scalar-functions.md); otherwise, `NO`. |
| IS_DATA_METRIC | VARCHAR(3) | `YES` if the function is a [data metric function](../../user-guide/data-quality-intro.md); otherwise, `NO`. |
| IS_AGGREGATE | VARCHAR(3) | `YES` if the function is an aggregate function; otherwise, `NO`. |

[1]
(1,2,3,4,5,6,7)

These fields apply only to [Writing external functions](../external-functions.md).

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
  The view does not honor the MANAGE GRANTS privilege and consequently might show less information compared to a SHOW command when both are
  executed by a user who holds the MANAGE GRANTS privilege.

* Omitting a length for the VARCHAR type results in a VARCHAR that specifies the default maximum length. For more information, see
  [VARCHAR](../data-types-text.md).

* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: HYBRID_TABLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/hybrid_tables.md
section: Information Schema
---

# HYBRID_TABLES view

This Information Schema view displays a row for each hybrid table defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CATALOG | TEXT | Database to which the hybrid table belongs. |
| SCHEMA | TEXT | Schema to which the hybrid table belongs. |
| NAME | TEXT | Name of the hybrid table. |
| OWNER | TEXT | Owner of the hybrid table. |
| ROW_COUNT | NUMBER | Approximate row count of the hybrid table. |
| BYTES | NUMBER | Approximate size in bytes of the row store of the hybrid table. |
| RETENTION_TIME | NUMBER | Retention time for data in the hybrid table. |
| CREATED | TIMESTAMP_LTZ | Creation time of the hybrid table. |
| LAST_ALTERED | TIMESTAMP_LTZ | The last time this hybrid table was altered by a DDL statement, a TRUNCATE or INSERT OVERWRITE statement, or a compaction job. Note that regular DML operations are not recorded here. |
| COMMENT | TEXT | Comment for the hybrid table. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command when both are executed with a role that was granted the MANAGE GRANTS privilege.
* Just as with SHOW TABLES and SHOW HYBRID TABLES, the bytes and row count are approximate.

---
title: INDEX_COLUMNS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/index_columns.md
section: Information Schema
---

# INDEX_COLUMNS view

This Information Schema view displays a row for each column in the indexes defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | TEXT | Database to which the hybrid table belongs. |
| TABLE_SCHEMA | TEXT | Schema to which the hybrid table belongs. |
| TABLE_NAME | TEXT | Name of the hybrid table where the index is defined. |
| INDEX_NAME | TEXT | Name of the index on the hybrid table. |
| NAME | TEXT | Name of the column that is participating in the index. |
| KEY_SEQUENCE | NUMBER | Position of the column in the index, starting from 1. |
| INDEX_OWNER | TEXT | Owner of the index. |
| IS_UNIQUE | TEXT | With `YES` or `NO`, indicates whether this index is a unique index. |
| CONSTRAINT_NAME | TEXT | Name of the constraint that is associated with this index. |
| STATUS | TEXT | Status of this index. |
| CREATED | TIMESTAMP_LTZ | Time of creation for this index. |
| IS_INCLUDED_COLUMN | TEXT | With `YES` or `NO`, indicates whether this column is covered by an index. |

---
title: INDEXES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/indexes.md
section: Information Schema
---

# INDEXES view

This Information Schema view displays a row for each index defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | TEXT | Database to which the hybrid table belongs. |
| TABLE_SCHEMA | TEXT | Schema to which the hybrid table belongs. |
| TABLE_NAME | TEXT | Name of the hybrid table where the index is defined. |
| NAME | TEXT | Name of the index on the hybrid table. |
| OWNER | TEXT | Owner of the index. |
| IS_UNIQUE | TEXT | With `YES` or `NO`, indicates whether this index is a unique index. |
| CONSTRAINT_NAME | TEXT | Name of the constraint that is associated with this index. |
| STATUS | TEXT | Status of this index. |
| CREATED | TIMESTAMP_LTZ | Time of creation for this index. |

---
title: INFORMATION_SCHEMA_CATALOG_NAME view
source: https://docs.snowflake.com/en/sql-reference/info-schema/information_schema_catalog_name.md
section: Information Schema
---

# INFORMATION_SCHEMA_CATALOG_NAME view

This Information Schema view identifies the database (or catalog, in SQL terminology) that contains the INFORMATION_SCHEMA schema.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CATALOG_NAME | VARCHAR | The name of the database in which this information_schema resides. |

## Usage notes

* This view always contains a single row.

---
title: LISTINGS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/listings.md
section: Information Schema
---

# LISTINGS view

This Information Schema view displays all listings for which the current role has been granted access privileges. This view provides real time information with no latency of data.

## Columns

| Column | Data type | Description |
| --- | --- | --- |
| GLOBAL_NAME | VARCHAR | The global name of the listing. |
| NAME | VARCHAR | The name of the listing. |
| OWNER | VARCHAR | The name of the role that owns the listing. |
| CREATED_ON | TIMESTAMP_LTZ | The timestamp when the listing was created. |
| UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the listing was last updated. |
| PUBLISHED_ON | TIMESTAMP_LTZ | The timestamp when the listing was published. |
| TITLE | VARCHAR | The title of the listing. |
| SUBTITLE | VARCHAR | The subtitle of the listing. |
| DESCRIPTION | VARCHAR | The description of the listing. |
| LISTING_TERMS | VARCHAR | The terms of service associated with the listing. |
| STATE | VARCHAR | The current state of the listing. |
| SHARE | VARCHAR | The name of the share associated with the listing. |
| APPLICATION_PACKAGE | VARCHAR | The name of the application package associated with the listing. |
| DATA_ATTRIBUTES | VARCHAR | Data attributes associated with the listing. |
| CATEGORIES | VARCHAR | Categories associated with the listing. |
| PROFILE | VARCHAR | The profile attached to the external listing. |
| CUSTOMIZED_CONTACT_INFO | VARCHAR | Customized contact information associated with the listing. |
| COMMENT | VARCHAR | Comment associated with the listing, if any. |
| TARGETS | VARCHAR | The targets consolidating external/organization listings with regions. |
| AUTO_FULFILLMENT | VARCHAR | Auto-fulfillment information associated with the listing. |
| IS_SHARE | BOOLEAN | Indicates whether this is a data share listing. |
| IS_APPLICATION | BOOLEAN | Indicates whether this is an application listing. |
| DISTRIBUTION | VARCHAR | The distribution of the listing. Possible values are `EXTERNAL` and `ORGANIZATION`. |
| IS_MOUNTLESS_QUERYABLE | BOOLEAN | Indicates whether the listing is mountless queryable. |
| ORGANIZATION_PROFILE_NAME | VARCHAR | The organization profile attached to the listing, if any. |
| UNIFORM_LISTING_LOCATOR | VARCHAR | The uniform listing locator (ULL) of the listing. |
| APPROVER_CONTACT | VARCHAR | The approver contact information associated with the listing. |
| SUPPORT_CONTACT | VARCHAR | The support contact information associated with the listing. |
| RESHARING | VARCHAR | Resharing configuration of the listing. |

## Usage notes

The view doesn’t capture deleted listings.

## Examples

Retrieve all listings in the current account:

```sqlexample
SELECT * FROM <any_database>.INFORMATION_SCHEMA.LISTINGS;
```

---
title: LOAD_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/info-schema/load_history.md
section: Information Schema
---

# LOAD_HISTORY view

This Information Schema view enables you to retrieve the history of data loaded into tables using the [COPY INTO <table>](../sql/copy-into-table.md) command within the last 14 days. The view displays one row for each file loaded.

> **Note:**
>
> This view does not return the history of data loaded using Snowpipe. For this historical information, query the [COPY_HISTORY](../functions/copy_history.md) table function instead.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SCHEMA_NAME | VARCHAR | Schema of target table |
| FILE_NAME | VARCHAR | Name of source file |
| TABLE_NAME | VARCHAR | Name of target table |
| LAST_LOAD_TIME | TIMESTAMP_LTZ | Timestamp of the load record |
| STATUS | VARCHAR | Status: `LOADED`, `LOAD FAILED`, or `PARTIALLY LOADED` |
| ROW_COUNT | NUMBER | Number of rows loaded from the source file |
| ROW_PARSED | NUMBER | Number of rows parsed from the source file |
| FIRST_ERROR_MESSAGE | VARCHAR | First error of the source file |
| FIRST_ERROR_LINE_NUMBER | NUMBER | Line number of the first error |
| FIRST_ERROR_CHARACTER_POSITION | NUMBER | Position of the first error character |
| FIRST_ERROR_COL_NAME | VARCHAR | Column name of the first error |
| ERROR_COUNT | NUMBER | Number of error rows in the source file |
| ERROR_LIMIT | NUMBER | If the number of errors reaches this limit, then abort |

## Usage notes

* The historical data for COPY INTO commands is removed from the view when a table is dropped.
* The view only includes COPY INTO commands that executed to completion, with or without errors. No record is added if the transaction is rolled back, for example, or if the ON_ERROR = ABORT_STATEMENT copy option is included in the COPY INTO *<table>* statement and a detected error in a data file aborts the load operation.
* This view returns an upper limit of 10,000 rows. To avoid this limitation, use the [LOAD_HISTORY view](../account-usage/load_history.md) (Account Usage), [COPY_HISTORY function](../functions/copy_history.md) (Information Schema), or the [COPY_HISTORY view](../account-usage/copy_history.md) (Account Usage).
* When including a WHERE clause that references the `LAST_LOAD_TIME` column, you can specify any day of the week. For example, April 1, 2016 was a Friday; however, specifying Sunday instead does not
  affect the query results:

  ```sqlexample
  WHERE last_load_time > 'Sun, 01 Apr 2016 16:00:00 -0800'
  ```

* The LOAD_HISTORY view shows load history only after the latest truncate operation on the target table. This applies to the LOAD_HISTORY views before and after
  [replication](../../user-guide/account-replication-intro.md).

## Examples

Retrieve the history of data loaded into the `MYDB.PUBLIC.MYTABLE` table since April 1, 2016, assuming that April 1 occurred within the previous 14 days:

> ```sqlexample
> USE DATABASE mydb;
>
> SELECT table_name, last_load_time
>   FROM information_schema.load_history
>   WHERE schema_name=current_schema() AND
>   table_name='MYTABLE' AND
>   last_load_time > 'Fri, 01 Apr 2016 16:00:00 -0800';
> ```

Retrieve records for the 10 most recent COPY INTO commands executed against the `MYDB` database:

> ```sqlexample
> USE DATABASE mydb;
>
> SELECT table_name, last_load_time
>   FROM information_schema.load_history
>   ORDER BY last_load_time DESC
>   LIMIT 10;
> ```

---
title: MODEL_VERSIONS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/model_versions.md
section: Information Schema
---

# MODEL_VERSIONS view

This Information Schema view displays a row for each machine learning model version defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE_NAME | TEXT | Database to which the model version belongs. |
| SCHEMA_NAME | TEXT | Schema to which the model version belongs. |
| MODEL_NAME | TEXT | Model to which the model version belongs. |
| MODEL_VERSION_NAME | TEXT | Name of the model version. |
| VERSION_ALIASES | ARRAY | List of aliases of the model version. |
| COMMENT | TEXT | Comment for the model version. |
| OWNER | TEXT | Name of the role that owns the model version. |
| CREATED | TIMESTAMP_LTZ | Date and time when model version was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time when the model version was last updated. |
| FUNCTIONS | TEXT | Functions in the model version. |
| MODEL_TYPE | TEXT | Type of the model to which the model version belongs. |
| PYTHON_VERSION | TEXT | Version of Python required by the model version. |
| LANGUAGE | TEXT | Language in which the model version is implemented. |
| DEPENDENCIES | TEXT | Dependencies of the model version. |
| METADATA | TEXT | Metadata of the model version. |
| USERDATA | TEXT | User data of the model version. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The view does not include model versions that have been dropped.

## Examples

Retrieve the names of all model versions, functions, and the model they belong to, in the `mydatabase` database:

```sqlexample
SELECT model_version_name, model_name, functions, schema_name, database_name
    FROM mydatabase.INFORMATION_SCHEMA.MODEL_VERSIONS;
```

---
title: OBJECT_PRIVILEGES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/object_privileges.md
section: Information Schema
---

# OBJECT_PRIVILEGES view

This Information Schema view displays a row for each access privilege granted for all objects defined in your account. It includes the privileges displayed in the [TABLE_PRIVILEGES view](table_privileges.md) and
[USAGE_PRIVILEGES view](usage_privileges.md).

For more information about privileges and their impact on object access, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [APPLICABLE_ROLES view](applicable_roles.md) , [ENABLED_ROLES view](enabled_roles.md) , [TABLE_PRIVILEGES view](table_privileges.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| GRANTOR | VARCHAR | Role who granted the privilege |
| GRANTEE | VARCHAR | Role to whom the privilege is granted |
| GRANTED_TO | VARCHAR | Type of object that has been granted the privilege |
| OBJECT_CATALOG | VARCHAR | Database containing the object on which the privilege is granted |
| OBJECT_SCHEMA | VARCHAR | Schema containing the object on which the privilege is granted |
| OBJECT_NAME | VARCHAR | Name of the object on which the privilege is granted |
| OBJECT_TYPE | VARCHAR | Type of the object on which the privilege is granted |
| PRIVILEGE_TYPE | VARCHAR | Type of the granted privilege |
| IS_GRANTABLE | VARCHAR | Whether the privilege was granted WITH GRANT OPTION |
| CREATED | TIMESTAMP_LTZ | Creation time of the privilege |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: PACKAGES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/packages.md
section: Information Schema
---

# PACKAGES view

This Information Schema view displays a row for each Snowpark package version supported for use in the PACKAGES clause in the
[CREATE FUNCTION](../sql/create-function.md) and [CREATE PROCEDURE](../sql/create-procedure.md) commands. For Python, this view also displays a
row for each version of a third-party package that you can install. For details, see
[Using third-party packages](../../developer-guide/udf/python/udf-python-packages.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PACKAGE_NAME | VARCHAR | The name of the package |
| VERSION | VARCHAR | The version number of the package |
| LANGUAGE | VARCHAR | The programming language for the package |

## Usage notes

Currently, the package versions with the following are supported:

* `language = java`
* `language = python`
* `language = scala`

## Examples

List all available versions of the pandas package for Python:

> ```sqlexample
> SELECT package_name, version
>   FROM information_schema.packages
>   WHERE language = 'python'
>     AND package_name = 'pandas'
>   ORDER BY version;
> ```

---
title: PIPES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/pipes.md
section: Information Schema
---

# PIPES view

This Information Schema view displays a row for each pipe defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PIPE_CATALOG | VARCHAR | Database that the pipe belongs to |
| PIPE_SCHEMA | VARCHAR | Schema that the pipe belongs to |
| PIPE_NAME | VARCHAR | Name of the pipe |
| PIPE_OWNER | VARCHAR | Name of the role that owns the pipe |
| DEFINITION | VARCHAR | COPY statement used to load data from queued files into a Snowflake table. |
| IS_AUTOINGEST_ENABLED | VARCHAR | Whether AUTO-INGEST is enabled for the pipe. Represents future functionality. |
| NOTIFICATION_CHANNEL_NAME | VARCHAR | Amazon Resource Name of the Amazon SQS queue for the stage named in the DEFINITION column. Represents future functionality. |
| CREATED | TIMESTAMP_LTZ | Creation time of the pipe |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this pipe |
| PATTERN | VARCHAR | PATTERN copy option value in the [COPY INTO <table>](../sql/copy-into-table.md) statement in the pipe definition, if the copy option was specified. |

## Usage notes

* Returns results only for the pipe owner (i.e. the role with the OWNERSHIP privilege on the pipe) or a role with the MONITOR privilege on
  the pipe.
* To determine the current status of a pipe, query the [SYSTEM$PIPE_STATUS](../functions/system_pipe_status.md) function.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: PROCEDURES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/procedures.md
section: Information Schema
---

# PROCEDURES view

This Information Schema view displays a row for each stored procedure defined in the specified (or current) database.

For more information about stored procedures, see [Stored procedures overview](../../developer-guide/stored-procedure/stored-procedures-overview.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| PROCEDURE_CATALOG | VARCHAR | Database that the stored procedure belongs to. |
| PROCEDURE_SCHEMA | VARCHAR | Schema that the stored procedure belongs to. |
| PROCEDURE_NAME | VARCHAR | Name of the stored procedure. |
| PROCEDURE_OWNER | VARCHAR | Name of the role that owns the stored procedure. |
| ARGUMENT_SIGNATURE | VARCHAR | Type signature of the stored procedure’s arguments. |
| DATA_TYPE | VARCHAR | Return value data type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length of string return value, in characters. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length of string return value, in bytes. |
| NUMERIC_PRECISION | NUMBER | Numeric precision of numeric return value. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of precision of numeric return value. |
| NUMERIC_SCALE | NUMBER | Scale of numeric return value. |
| PROCEDURE_LANGUAGE | VARCHAR | Programming language of the stored procedure. |
| PROCEDURE_DEFINITION | VARCHAR | Definition of the stored procedure. |
| CREATED | TIMESTAMP_LTZ | Creation time of the stored procedure. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this stored procedure. |
| EXTERNAL_ACCESS_INTEGRATIONS | VARCHAR | Names of [external access integrations](../../developer-guide/external-network-access/external-network-access-overview.md) specified by the procedure’s EXTERNAL_ACCESS_INTEGRATION parameter. |
| SECRETS | JSON map | Map of [secrets](../sql/create-secret.md) specified by the procedure’s SECRETS parameter, where map keys are secret variable names and map values are secret object names. |
| OWNER_ROLE_TYPE | VARCHAR | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The
  view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: REFERENTIAL_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/referential_constraints.md
section: Information Schema
---

# REFERENTIAL_CONSTRAINTS view

This Information Schema view displays a row for each FOREIGN KEY constraint that is defined for tables
in the specified (or current) database.

FOREIGN KEY constraints are used to enforce referential integrity. For more information, see
[Constraints](../constraints.md) and [Referential Integrity Constraints](../../user-guide/table-considerations.md).

To return information about other constraint types (as well as FOREIGN KEY constraints), query the [TABLE_CONSTRAINTS view](table_constraints.md).

See also:
:   [TABLE_CONSTRAINTS view](table_constraints.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_CATALOG | VARCHAR | Database that the constraint belongs to |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the constraint belongs to |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint |
| UNIQUE_CONSTRAINT_CATALOG | VARCHAR | Database of the unique constraint referenced by the current constraint |
| UNIQUE_CONSTRAINT_SCHEMA | VARCHAR | Schema of the unique constraint referenced by the current constraint |
| UNIQUE_CONSTRAINT_NAME | VARCHAR | Name of the unique constraint referenced by the current constraint |
| MATCH_OPTION | VARCHAR | Match option for the constraint |
| UPDATE_RULE | VARCHAR | Update Rule for the current constraint |
| DELETE_RULE | VARCHAR | Delete Rule for the current constraint |
| COMMENT | VARCHAR | Comment for this constraint |
| CREATED | TIMESTAMP_LTZ | Creation time of the constraint |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Return information about all of the FOREIGN KEY constraints in the current database.

```sqlexample
SELECT * FROM INFORMATION_SCHEMA.REFERENTIAL_CONSTRAINTS;
```

```output
+--------------------+-------------------+-----------------------------------------------------+---------------------------+--------------------------+-----------------------------------------------------+--------------+-------------+-------------+---------+-------------------------------+-------------------------------+
| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME                                     | UNIQUE_CONSTRAINT_CATALOG | UNIQUE_CONSTRAINT_SCHEMA | UNIQUE_CONSTRAINT_NAME                              | MATCH_OPTION | UPDATE_RULE | DELETE_RULE | COMMENT | CREATED                       | LAST_ALTERED                  |
|--------------------+-------------------+-----------------------------------------------------+---------------------------+--------------------------+-----------------------------------------------------+--------------+-------------+-------------+---------+-------------------------------+-------------------------------|
| HTABLES_DB         | HTABLES_SCHEMA    | SYS_CONSTRAINT_51118aaf-1ee6-4548-bc9a-f87e65d92528 | HTABLES_DB                | HTABLES_SCHEMA           | SYS_CONSTRAINT_aad16788-491a-4e68-b0e3-30d48a33a1c1 | FULL         | NO ACTION   | NO ACTION   | NULL    | 2024-09-19 13:51:37.355 -0700 | 2024-09-19 13:51:37.608 -0700 |
| HTABLES_DB         | HTABLES_SCHEMA    | SYS_CONSTRAINT_c97bfe9b-6098-4b8a-b796-e341071db72a | HTABLES_DB                | HTABLES_SCHEMA           | SYS_CONSTRAINT_0bd41d0f-11f7-4366-82a3-f03f31fcce7e | FULL         | NO ACTION   | NO ACTION   | NULL    | 2024-05-28 18:21:43.899 -0700 | 2024-05-28 18:21:44.268 -0700 |
+--------------------+-------------------+-----------------------------------------------------+---------------------------+--------------------------+-----------------------------------------------------+--------------+-------------+-------------+---------+-------------------------------+-------------------------------+
```

Join this view to the [TABLE_CONSTRAINTS view](table_constraints.md) to get the names of referencing tables that have FOREIGN KEY constraints:

```sqlexample
SELECT tc.constraint_catalog, tc.constraint_schema, tc.constraint_name, tc.table_name, tc.constraint_type, tc.enforced
  FROM INFORMATION_SCHEMA.TABLE_CONSTRAINTS tc
    JOIN INFORMATION_SCHEMA.REFERENTIAL_CONSTRAINTS rc ON tc.constraint_name=rc.constraint_name;
```

```output
+--------------------+-------------------+-----------------------------------------------------+------------+-----------------+----------+
| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME                                     | TABLE_NAME | CONSTRAINT_TYPE | ENFORCED |
|--------------------+-------------------+-----------------------------------------------------+------------+-----------------+----------|
| HTABLES_DB         | HTABLES_SCHEMA    | SYS_CONSTRAINT_51118aaf-1ee6-4548-bc9a-f87e65d92528 | HTFK       | FOREIGN KEY     | YES      |
| HTABLES_DB         | HTABLES_SCHEMA    | SYS_CONSTRAINT_c97bfe9b-6098-4b8a-b796-e341071db72a | HT619      | FOREIGN KEY     | YES      |
+--------------------+-------------------+-----------------------------------------------------+------------+-----------------+----------+
```

---
title: REPLICATION_DATABASES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/replication_databases.md
section: Information Schema
---

# REPLICATION_DATABASES view

This Information Schema view displays a row for each primary and secondary database (i.e. database for which replication has been enabled) in your organization.

> **Note:**
>
> This view uses Snowflake terminology of “database”, whereas other Information Schema views use the standard INFORMATION_SCHEMA terminology of “catalog”. The two terms have the same meaning.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account that stores the database is located. |
| SNOWFLAKE_REGION | VARCHAR | Snowflake Region where the account that stores the database is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| ACCOUNT_NAME | VARCHAR | Name of the account in which the database is stored. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| COMMENT | VARCHAR | Comment for the database. |
| CREATED | TIMESTAMP_LTZ | Date and time when the database was created. |
| IS_PRIMARY | VARCHAR | Whether the database is a primary database; otherwise, is a secondary database. |
| PRIMARY | VARCHAR | Name of the primary database. |
| REPLICATION_ALLOWED_TO_ACCOUNTS | VARCHAR | Where `IS_PRIMARY` is TRUE, shows the fully-qualified names of accounts where replication has been enabled for this primary database. |
| FAILOVER_ALLOWED_TO_ACCOUNTS | VARCHAR | Where `IS_PRIMARY` is TRUE, shows the fully-qualified names of accounts where failover has been enabled for this primary database. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.

---
title: REPLICATION_GROUPS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/replication_groups.md
section: Information Schema
---

# REPLICATION_GROUPS view

This Information Schema view displays a row for each primary and secondary replication and/or failover group in your organization.

> **Note:**
>
> This view uses Snowflake terminology of “database”, whereas other Information Schema views use the standard INFORMATION_SCHEMA
> terminology of “catalog”. The two terms have the same meaning.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account that stores the replication or failover group is located. |
| SNOWFLAKE_REGION | VARCHAR | Snowflake Region where the account is located. A Snowflake Region is a distinct location within a cloud platform region that is isolated from other Snowflake Regions. A Snowflake Region can be either multi-tenant or single-tenant (for a Virtual Private Snowflake account). |
| CREATED_ON | TIMESTAMP_LTZ | Date and time replication or failover group was created. |
| ACCOUNT_NAME | VARCHAR | Name of the account. |
| NAME | VARCHAR | Name of the replication or failover group. |
| TYPE | VARCHAR | Type of group. Valid values are `REPLICATION` or `FAILOVER`. |
| COMMENT | VARCHAR | Comment string. |
| IS_PRIMARY | VARCHAR | Indicates whether the replication or failover group is the primary group. |
| PRIMARY | VARCHAR | Name of the primary group. |
| OBJECT_TYPES | VARCHAR | List of specified object types enabled for replication (and failover in the case of a `FAILOVER` group). |
| ALLOWED_INTEGRATION_TYPES | VARCHAR | List of integration types that are enabled for replication. Snowflake always includes this column in the output even if integrations were not specified in the CREATE *<object>* or ALTER *<object>* command. |
| ALLOWED_ACCOUNTS | VARCHAR | List of accounts enabled for replication and failover. |
| ORGANIZATION_NAME | VARCHAR | Name of your Snowflake organization. |
| ACCOUNT_LOCATOR | VARCHAR | Account locator in a region. |
| REPLICATION_SCHEDULE | VARCHAR | Scheduled interval for refresh; NULL if no replication schedule is set. |
| SECONDARY_STATE | VARCHAR | Current state of scheduled refresh. Valid values are `started` or `suspended`. NULL if no replication schedule is set. |
| NEXT_SCHEDULED_REFRESH | TIMESTAMP_LTZ | Date and time of the next scheduled refresh. |
| OWNER | VARCHAR | Name of the role with the OWNERSHIP privilege on the replication or failover group. NULL if the replication or failover group is in a different region. |
| IS_LISTING_AUTO_FULFILLMENT_GROUP | BOOLEAN | TRUE if the replication group is used for [Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md). FALSE otherwise. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.

---
title: SCHEMATA view
source: https://docs.snowflake.com/en/sql-reference/info-schema/schemata.md
section: Information Schema
---

# SCHEMATA view

This Information Schema view displays a row for each schema in the specified (or current) database, including the INFORMATION_SCHEMA schema itself.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CATALOG_NAME | VARCHAR | Database that the schema belongs to |
| SCHEMA_NAME | VARCHAR | Name of the schema |
| SCHEMA_OWNER | VARCHAR | Name of the role that owns the schema |
| IS_TRANSIENT | VARCHAR | Whether this is a transient schema |
| IS_MANAGED_ACCESS | VARCHAR | Whether the schema is a managed access schema |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel |
| DEFAULT_CHARACTER_SET_CATALOG | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_CHARACTER_SET_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_CHARACTER_SET_NAME | VARCHAR | Not applicable for Snowflake. |
| SQL_PATH | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Creation time of the schema |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this schema |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SEMANTIC_DIMENSIONS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/semantic_dimensions.md
section: Information Schema
---

# SEMANTIC_DIMENSIONS view

This Information Schema view displays a row for each dimension in a semantic view in the specified (or current) database.

See also:
:   [SEMANTIC_DIMENSIONS view (Account Usage)](../account-usage/semantic_dimensions.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_view_catalog` | VARCHAR | Database to which the semantic view belongs. |
| `semantic_view_schema` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `table_name` | VARCHAR | Name of the semantic table the dimension belongs to. |
| `name` | VARCHAR | Name of the dimension. |
| `data_type` | VARCHAR | Data type of the dimension expression. |
| `expression` | VARCHAR | The SQL expression used to calculate the dimension. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the dimension. |
| `comment` | VARCHAR | Description of the dimension. |
| `cortex_search_service_database_name` | VARCHAR | Name of the database containing the [Cortex Search Service that the dimension uses](../../user-guide/views-semantic/sql.md). |
| `cortex_search_service_schema_name` | VARCHAR | Name of the schema containing the Cortex Search Service that the dimension uses. |
| `cortex_search_service_name` | VARCHAR | Name of the Cortex Search Service that the dimension uses. |
| `cortex_search_service_column_name` | VARCHAR | Name of the column that the Cortex Search Service allows you to search on, if the dimension uses a Cortex Search Service. |

---
title: SEMANTIC_FACTS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/semantic_facts.md
section: Information Schema
---

# SEMANTIC_FACTS view

This Information Schema view displays a row for each fact in a semantic view in the specified (or current) database.

See also:
:   [SEMANTIC_FACTS view (Account Usage)](../account-usage/semantic_facts.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_view_catalog` | VARCHAR | Database to which the semantic view belongs. |
| `semantic_view_schema` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `table_name` | VARCHAR | Name of the semantic table the fact belongs to. |
| `name` | VARCHAR | Name of the fact. |
| `data_type` | VARCHAR | Data type of the fact expression. |
| `expression` | VARCHAR | The SQL expression used to calculate the fact. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the fact. |
| `comment` | VARCHAR | Description of the fact. |

## Access control requirements

[Private facts](../../user-guide/views-semantic/sql.md) are included only if you are using a role that has been
[granted the REFERENCES or OWNERSHIP privilege on the semantic view](../../user-guide/views-semantic/sql.md).

Otherwise, the view lists only the public facts.

---
title: SEMANTIC_METRICS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/semantic_metrics.md
section: Information Schema
---

# SEMANTIC_METRICS view

This Information Schema view displays a row for each metric in a semantic view in the specified (or current) database.

See also:
:   [SEMANTIC_METRICS view (Account Usage)](../account-usage/semantic_metrics.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_view_catalog` | VARCHAR | Database to which the semantic view belongs. |
| `semantic_view_schema` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `table_name` | VARCHAR | Name of the semantic table the metric belongs to. |
| `name` | VARCHAR | Name of the metric. |
| `data_type` | VARCHAR | Data type of the metric expression. |
| `expression` | VARCHAR | The SQL expression used to calculate the metric. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the metric. |
| `comment` | VARCHAR | Description of the metric. |

## Access control requirements

[Private metrics](../../user-guide/views-semantic/sql.md) are included only if you are using a role that has been
[granted the REFERENCES or OWNERSHIP privilege on the semantic view](../../user-guide/views-semantic/sql.md).

Otherwise, the view lists only the public metrics.

---
title: SEMANTIC_RELATIONSHIPS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/semantic_relationships.md
section: Information Schema
---

# SEMANTIC_RELATIONSHIPS view

This Information Schema view displays a row for each relationship in a semantic view in the specified (or current) database.

See also:
:   [SEMANTIC_RELATIONSHIPS view (Account Usage)](../account-usage/semantic_relationships.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_view_catalog` | VARCHAR | Database to which the semantic view belongs. |
| `semantic_view_schema` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `name` | VARCHAR | Name of the semantic relationship. |
| `table_name` | VARCHAR | Name of the semantic table referencing the other table. |
| `foreign_keys` | ARRAY(VARCHAR) | List of the names of the columns referring to the columns of the other table. |
| `ref_table_name` | VARCHAR | Name of the semantic table being referenced. |
| `ref_keys` | ARRAY(VARCHAR) | One of the following values:   * For relationships that represent [range joins](../../user-guide/views-semantic/sql.md), an array that contains   JSON-formatted strings for objects with the following keys:    + The `start_column` key specifies the name of the column that represents the start of the range.   + The `end_column` key specifies the name of the column that represents the end of the range.   + The `type` key is `RANGE`. * For relationships that represent [ASOF joins](../../user-guide/views-semantic/sql.md), an array that contains the   following elements:    + The name of the column in the first table.   + A JSON object with the following fields:      - `column`: Name of the column in the second table.     - `type`: `ASOF`.  * For other types of relationships, an array containing the name of the column in the other logical table in the relationship. |

---
title: SEMANTIC_TABLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/semantic_tables.md
section: Information Schema
---

# SEMANTIC_TABLES view

This Information Schema view displays a row for each logical table in a semantic view in the specified (or current) database.

See also:
:   [SEMANTIC_TABLES view (Account Usage)](../account-usage/semantic_tables.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `semantic_view_catalog` | VARCHAR | Database to which the semantic view belongs. |
| `semantic_view_schema` | VARCHAR | Schema to which the semantic view belongs. |
| `semantic_view_name` | VARCHAR | Name of the semantic view. |
| `name` | VARCHAR | Name of the semantic table. |
| `base_table_catalog` | VARCHAR | Database to which the base table belongs. |
| `base_table_schema` | VARCHAR | Schema to which the base table belongs. |
| `base_table_name` | VARCHAR | Name of the base table. |
| `primary_keys` | ARRAY(VARCHAR) | List of the primary key columns of the table. |
| `synonyms` | ARRAY(VARCHAR) | List of the synonyms for the table. |
| `distinct_ranges` | ARRAY(OBJECT) | Array of OBJECT values, which describe the [constraints for the logical table containing the range](../../user-guide/views-semantic/sql.md). Each object contains the following key-value pairs:   * `constraint_name`: The name of the constraint. * `end_column`: The name of the column that represents the end of the range. * `start_column`: The name of the column that represents the start of the range. |
| `comment` | VARCHAR | Description of the semantic table. |

---
title: SEMANTIC_VIEWS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/semantic_views.md
section: Information Schema
---

# SEMANTIC_VIEWS view

This Information Schema view displays a row for each semantic view in the specified (or current) database.

See also:
:   [SEMANTIC_VIEWS view (Account Usage)](../account-usage/semantic_views.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `catalog` | VARCHAR | Database to which the semantic view belongs. |
| `schema` | VARCHAR | Schema to which the semantic view belongs. |
| `name` | VARCHAR | Name of the semantic view. |
| `owner` | VARCHAR | Owner of the semantic view. |
| `created` | TIMESTAMP_LTZ | Creation time of the view. |
| `comment` | VARCHAR | Description of the semantic view. |

---
title: SEQUENCES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/sequences.md
section: Information Schema
---

# SEQUENCES view

This Information Schema view displays a row for each sequence defined in the specified (or current) database.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| SEQUENCE_CATALOG | VARCHAR | Database that the sequence belongs to |
| SEQUENCE_SCHEMA | VARCHAR | Schema that the sequence belongs to |
| SEQUENCE_NAME | VARCHAR | Name of the sequence |
| SEQUENCE_OWNER | VARCHAR | Name of the role that owns the sequence |
| DATA_TYPE | VARCHAR | Data type of the sequence |
| NUMERIC_PRECISION | NUMBER | Numeric precision of the data type of the sequence |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of the numeric precision of the data type of the sequence |
| NUMERIC_SCALE | NUMBER | Scale of the data type of the sequence |
| START_VALUE | VARCHAR | Initial value of the sequence |
| MINIMUM_VALUE | VARCHAR | Not applicable for Snowflake. |
| MAXIMUM_VALUE | VARCHAR | Not applicable for Snowflake. |
| NEXT_VALUE | VARCHAR | Next value that the sequence will produce |
| INCREMENT | VARCHAR | Increment of the sequence generator |
| CYCLE_OPTION | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Creation time of the sequence |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| ORDERED | VARCHAR | If `YES`, the sequence has the ORDER property. If `NO`, the sequence has the NOORDER property. |
| COMMENT | VARCHAR | Comment for this sequence |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SERVICES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/services.md
section: Information Schema
---

# SERVICES view

This view shows existing Snowpark Container Services services in the database.

## Columns

| Column | Data type | Description |
| --- | --- | --- |
| SERVICE_CATALOG | TEXT | Database that the service belongs to. |
| SERVICE_SCHEMA | TEXT | Schema that the service belongs to. |
| SERVICE_NAME | TEXT | Name of the service. |
| SERVICE_OWNER | TEXT | Name of the role that owns the service. App instance name if in an app. |
| SERVICE_OWNER_ROLE_TYPE | TEXT | Type of the owner role. |
| COMPUTE_POOL_NAME | TEXT | Compute pool where the job was executed. |
| DNS_NAME | TEXT | DNS name associated with the service. |
| CURRENT_INSTANCES | NUMBER | The current number of instances for the service. |
| TARGET_INSTANCES | NUMBER | The target number of service instances that should be running as determined by Snowflake.  When the CURRENT_INSTANCES value is not equal to the TARGET_INSTANCES value, Snowflake is either in the process of shutting down or launching service instances.  For example, consider the following:   * Suppose you create a service with MIN_INSTANCES = 1 and MAX_INSTANCES = 3. While the service is running, Snowflake might   determine that one instance is not enough. In this case, the value of TARGET_INSTANCES will increase, indicating Snowflake is in the process of launching additional instances.  It’s also possible that the TARGET_INSTANCES value is less than the CURRENT_INSTANCES value, which indicates that Snowflake is in the process of reducing the number of running instances. * If you create services but the compute pool doesn’t have capacity for the minimum number of instances that you requested, the   value of TARGET_INSTANCES will be equal to the value of MIN_INSTANCES. The value of CURRENT_INSTANCES will be less than the value of TARGET_INSTANCES. |
| MIN_READY_INSTANCES | INT | Minimum service instances that must be ready for Snowflake to consider the service is ready to process requests. |
| MIN_INSTANCES | INT | Minimum instances for the service. |
| MAX_INSTANCES | INT | Maximum instances for the service. |
| AUTO_RESUME | BOOLEAN | Flag that determines if the service can be auto resumed. |
| QUERY_WAREHOUSE | TEXT | Name of the default query warehouse of the service. |
| CREATED | TIMESTAMP_LTZ | Creation time of the service. |
| LAST_ALTERED | TIMESTAMP_LTZ | Last altered time of the service. |
| LAST_RESUMED | TIMESTAMP_LTZ | Last resumed time of the service. |
| COMMENT | TEXT | Comment for this service. |
| IS_JOB | BOOLEAN | `true` if the service is a job service; `false` otherwise. |
| SPEC_DIGEST | VARCHAR | The unique and immutable identifier representing the service spec content.  To observe the changes to the value of the SPEC_DIGEST column over time, a service user might execute the SHOW SERVICES command periodically. If the service user notices a change in value, they can infer that the service was upgraded. |
| IS_UPGRADING | BOOLEAN | TRUE, if Snowflake is in the process of upgrading the service. |
| MANAGING_OBJECT_DOMAIN | VARCHAR | The domain of the managing object (for example, the domain of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |
| MANAGING_OBJECT_NAME | VARCHAR | The name of the managing object (for example, the name of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |

## Example

```sqlexample
SELECT *
FROM my_database.information_schema.services
WHERE service_name LIKE '%myservice_%';
```

---
title: SHARES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/shares.md
section: Information Schema
---

# SHARES view

This Information Schema view displays all outbound and inbound shares for which the current role has been granted access privileges. This view provides real time information with no latency of data.

## Columns

| Column | Data type | Description |
| --- | --- | --- |
| CREATED_ON | TIMESTAMP_LTZ | The timestamp when the share was created. |
| KIND | VARCHAR | The kind of the share, outbound or inbound. |
| OWNER_ACCOUNT | VARCHAR | The owner account of the share. |
| NAME | VARCHAR | The name of the share. |
| DATABASE_NAME | VARCHAR | The name of the primary database associated with the share. This field is empty if no database has been granted to the share. |
| TO | VARCHAR | A comma-separated list of target accounts the share is shared with (outbound). This field is empty if the share has no target accounts. |
| OWNER | VARCHAR | The name of the role that owns the share. |
| COMMENT | VARCHAR | Comment associated with the share, if any. |
| LISTING_GLOBAL_NAME | VARCHAR | Global unique name of the listing associated with the share, if any. |
| SECURE_OBJECTS_ONLY | VARCHAR | Indicates whether the share can only have secure objects granted to it. |

## Usage notes

The view doesn’t capture deleted shares.

## Examples

Retrieve all shares in the current account:

```sqlexample
SELECT * FROM <any_database>.INFORMATION_SCHEMA.SHARES;
```

---
title: SNAPSHOT_POLICIES view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/info-schema/snapshot_policies.md
section: Information Schema
---

# SNAPSHOT_POLICIES view — *Deprecated*

This Information Schema view provides information on snapshot policies.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| SNAPSHOT_POLICY_NAME | VARCHAR | Name of the snapshot policy |
| SNAPSHOT_POLICY_SCHEMA | VARCHAR | Schema that the snapshot policy belongs to. |
| SNAPSHOT_POLICY_CATALOG | VARCHAR | Database that the snapshot policy belongs to. |
| SCHEDULE | VARCHAR | Schedule for snapshot creation. |
| EXPIRE_AFTER_DAYS | NUMBER | Days after snapshot creation when snapshot should be expired and automatically deleted. |
| HAS_RETENTION_LOCK | VARCHAR | Indicates whether the policy includes a retention lock. Y if the policy has a retention lock; N otherwise.  Retention lock protects snapshots from being deleted by anyone for the defined retention period. The retention lock also prevents the retention period from being decreased on the policy. |
| OWNER | VARCHAR | Name of the role that owns the snapshot policy. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the snapshot policy. Account role or Database role. |
| CREATED | TIMESTAMP_LTZ | Date and time when the snapshot policy was created. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for the snapshot policy. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SNAPSHOT_SETS view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/info-schema/snapshot_sets.md
section: Information Schema
---

# SNAPSHOT_SETS view — *Deprecated*

This Information Schema view provides information on snapshot sets.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| SNAPSHOT_SET_NAME | VARCHAR | Name of the snapshot set. |
| SNAPSHOT_SET_SCHEMA | VARCHAR | Schema that the snapshot set belongs to. |
| SNAPSHOT_SET_CATALOG | VARCHAR | Database that the snapshot set belongs to. |
| OBJECT_KIND | VARCHAR | Type of object that the snapshot set is snapshotting. |
| OBJECT_NAME | VARCHAR | Name of object that the snapshot set is snapshotting. |
| OBJECT_SCHEMA | VARCHAR | Name of schema that contains the object being snapshotted by this snapshot set. |
| OBJECT_CATALOG | VARCHAR | Name of database that contains the object being snapshotted by this snapshot set. |
| SNAPSHOT_POLICY_NAME | VARCHAR | Name of snapshot policy attached to this snapshot set. |
| SNAPSHOT_POLICY_SCHEMA | VARCHAR | Name of the schema that contains the snapshot policy. |
| SNAPSHOT_POLICY_CATALOG | VARCHAR | Name of the database that contains the snapshot policy. |
| OWNER | VARCHAR | Name of the role that owns the snapshot set. |
| OWNER_ROLE_TYPE | VARCHAR | Type of role that owns the snapshot set. Account role or Database role. |
| CREATED | TIMESTAMP | Date and time when the snapshot set was created. |
| LAST_ALTERED | TIMESTAMP | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for the snapshot set. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: SNAPSHOTS view — Deprecated
source: https://docs.snowflake.com/en/sql-reference/info-schema/snapshots.md
section: Information Schema
---

# SNAPSHOTS view — *Deprecated*

This Information Schema view provides information on snapshots.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Snowflake-generated identifier of the snapshot.  Note: this is not the local ID, this is the globally unique UUID of the Snapshot |
| CREATED | TIMESTAMP_LTZ | Timestamp at which snapshot was created. |
| SNAPSHOT_SET_NAME | VARCHAR | Name of snapshot set that contains the snapshot. |
| SNAPSHOT_SET_SCHEMA | VARCHAR | Name of schema that the snapshot set belongs to. |
| SNAPSHOT_SET_CATALOG | VARCHAR | Name of database that the snapshot set belongs to. |
| EXPIRATION_SCHEDULED_FOR | TIMESTAMP_LTZ | Timestamp at which snapshot will be expired and deleted. |
| IS_UNDER_LEGAL_HOLD | BOOLEAN | Y if snapshot is under legal hold; N otherwise. |

## Usage notes

* Latency for the view may be up to 180 minutes (3 hours).

---
title: STAGES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/stages.md
section: Information Schema
---

# STAGES view

This Information Schema view displays a row for each stage defined in the specified (or current) database.

Stages are named objects that can be used for loading/unloading data. For more information, see [CREATE STAGE](../sql/create-stage.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| STAGE_CATALOG | VARCHAR | Database that the stage belongs to. |
| STAGE_SCHEMA | VARCHAR | Schema that the stage belongs to. |
| STAGE_NAME | VARCHAR | Name of the stage. |
| STAGE_URL | VARCHAR | Location of an external stage. |
| STAGE_REGION | VARCHAR | Region where the stage resides. |
| STAGE_TYPE | VARCHAR | Type of stage (`Internal Named`, or `External Named`). |
| STAGE_OWNER | VARCHAR | Name of the role that owns the stage. |
| COMMENT | VARCHAR | Comment for this stage. |
| CREATED | TIMESTAMP_LTZ | Creation time of the stage. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the MANAGE GRANTS privilege and consequently may show less
  information compared to a SHOW command when both are executed by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

---
title: TABLE_CONSTRAINTS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/table_constraints.md
section: Information Schema
---

# TABLE_CONSTRAINTS view

This Information Schema view displays a row for each table constraint that is defined in the specified (or current) database.
This view returns information about the following constraint types:

* PRIMARY KEY
* FOREIGN KEY
* UNIQUE

For general information about constraints, see [Constraints](../constraints.md).

See also:
:   [REFERENTIAL_CONSTRAINTS view](referential_constraints.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSTRAINT_CATALOG | VARCHAR | Database that the constraint belongs to |
| CONSTRAINT_SCHEMA | VARCHAR | Schema that the constraint belongs to |
| CONSTRAINT_NAME | VARCHAR | Name of the constraint |
| TABLE_CATALOG | VARCHAR | Name of the database of the current table |
| TABLE_SCHEMA | VARCHAR | Name of the schema of the current table |
| TABLE_NAME | VARCHAR | Name of the current table |
| CONSTRAINT_TYPE | VARCHAR | Type of the constraint |
| IS_DEFERRABLE | VARCHAR | Whether evaluation of the constraint can be deferred |
| INITIALLY_DEFERRED | VARCHAR | Whether evaluation of the constraint is deferrable and initially deferred |
| ENFORCED | VARCHAR | Whether the constraint is enforced |
| COMMENT | VARCHAR | Comment for this constraint |
| CREATED | TIMESTAMP_LTZ | Creation time of the constraint |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| RELY | VARCHAR | Whether a constraint in NOVALIDATE mode is taken into account during query rewrite. For details, see [Constraint properties](../sql/create-table-constraint.md). |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Example

Create a hybrid table with a multi-column PRIMARY KEY constraint and a comment on the constraint. Query the view to get information about the constraint.

```sqlexample
CREATE OR REPLACE HYBRID TABLE HT2PK
  (col1 NUMBER(38,0) NOT NULL,
  col2 NUMBER(38,0) NOT NULL,
  col3 VARCHAR(16777216),
  CONSTRAINT PKEY_2 PRIMARY KEY (col1, col2) COMMENT 'Primary key on two columns');

SELECT constraint_name, table_name, constraint_type, enforced, comment
  FROM INFORMATION_SCHEMA.TABLE_CONSTRAINTS
  WHERE COMMENT IS NOT NULL;
```

```output
+-----------------+------------+-----------------+----------+----------------------------+
| CONSTRAINT_NAME | TABLE_NAME | CONSTRAINT_TYPE | ENFORCED | COMMENT                    |
|-----------------+------------+-----------------+----------+----------------------------|
| PKEY_2          | HT2PK      | PRIMARY KEY     | YES      | Primary key on two columns |
+-----------------+------------+-----------------+----------+----------------------------+
```

Return a list of constraints on all tables that have names beginning with `HT`:

```sqlexample
SELECT constraint_name, table_name, constraint_type, enforced, comment
  FROM INFORMATION_SCHEMA.TABLE_CONSTRAINTS
  WHERE table_name LIKE 'HT%'
  ORDER BY table_name;
```

```output
+-----------------------------------------------------+------------------------+-----------------+----------+----------------------------+
| CONSTRAINT_NAME                                     | TABLE_NAME             | CONSTRAINT_TYPE | ENFORCED | COMMENT                    |
|-----------------------------------------------------+------------------------+-----------------+----------+----------------------------|
| SYS_CONSTRAINT_da2e8533-5501-4862-ae42-0a7798d578eb | HT01                   | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_5b3c6d13-f607-4ef6-a147-0026bae98c71 | HT1                    | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_d5887706-0e3b-4d5b-8787-e3327cdf4851 | HT100                  | PRIMARY KEY     | YES      | NULL                       |
| PK1                                                 | HT1PK                  | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_f1d1e153-cc32-477c-9a24-5c049e40ca0a | HT239                  | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_fe27c4f3-23f6-4091-92c4-5acd53cc5029 | HT239                  | UNIQUE          | YES      | NULL                       |
| PKEY_2                                              | HT2PK                  | PRIMARY KEY     | YES      | Primary key on two columns |
| SYS_CONSTRAINT_0bd41d0f-11f7-4366-82a3-f03f31fcce7e | HT616                  | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_6124310b-5f50-4009-a5c0-dc1b5a89b0bc | HT616                  | UNIQUE          | YES      | NULL                       |
| SYS_CONSTRAINT_bf3d76ba-de1e-4227-954f-9f53de777ed4 | HT619                  | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_c97bfe9b-6098-4b8a-b796-e341071db72a | HT619                  | FOREIGN KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_6e02d776-1759-449e-aece-467aaaefcfc8 | HTFK                   | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_51118aaf-1ee6-4548-bc9a-f87e65d92528 | HTFK                   | FOREIGN KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_fe27c4f3-23f6-4091-92c4-5acd53cc5029 | HTLIKE                 | UNIQUE          | YES      | NULL                       |
| SYS_CONSTRAINT_f1d1e153-cc32-477c-9a24-5c049e40ca0a | HTLIKE                 | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_aad16788-491a-4e68-b0e3-30d48a33a1c1 | HTPK                   | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_0bdff17e-e90a-4929-99c5-98e3597e3069 | HTT1                   | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_39e9110f-7a72-454e-bfe2-0a26eca97e7c | HT_PRECIP              | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_6acd8274-04e7-4b22-b9ae-29185b979219 | HT_SENSOR_DATA_DEVICE1 | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_39e9110f-7a72-454e-bfe2-0a26eca97e7c | HT_WEATHER             | PRIMARY KEY     | YES      | NULL                       |
| SYS_CONSTRAINT_843d828a-900d-409e-a57d-8f27b602eccf | HT_WEATHER             | PRIMARY KEY     | YES      | NULL                       |
+-----------------------------------------------------+------------------------+-----------------+----------+----------------------------+
```

---
title: TABLE_PRIVILEGES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/table_privileges.md
section: Information Schema
---

# TABLE_PRIVILEGES view

This Information Schema view displays a row for each table privilege that has been granted to each role in the specified (or current) database.

For more information about roles and privileges, see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

See also:
:   [APPLICABLE_ROLES view](applicable_roles.md) , [ENABLED_ROLES view](enabled_roles.md) , [OBJECT_PRIVILEGES view](object_privileges.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| GRANTOR | VARCHAR | Role who granted the table privilege |
| GRANTEE | VARCHAR | Role to whom the table privilege is granted |
| GRANTED_TO | VARCHAR | Type of object that has been granted the privilege |
| TABLE_CATALOG | VARCHAR | Database containing the table on which the privilege is granted |
| TABLE_SCHEMA | VARCHAR | Schema containing the table on which the privilege is granted |
| TABLE_NAME | VARCHAR | Name of the table on which the privilege is granted |
| PRIVILEGE_TYPE | VARCHAR | Type of the granted privilege |
| IS_GRANTABLE | VARCHAR | Whether the privilege was granted WITH GRANT OPTION |
| WITH_HIERARCHY | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Creation time of the privilege |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.
* The PRIVILEGE_TYPE column contains Snowflake privilege types. For example, the owner of a table has the OWNERSHIP privilege, rather than each of the separate privileges (e.g. SELECT, INSERT, DELETE,
  UPDATE).

---
title: TABLE_STORAGE_METRICS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/table_storage_metrics.md
section: Information Schema
---

# TABLE_STORAGE_METRICS view

This view displays table-level storage utilization information, which is used to calculate the storage billing for each table in the account, including tables that have been dropped, but are still incurring
storage costs.

In addition to table metadata, the view displays the number of storage bytes billed for each table. Snowflake breaks down the bytes into the following categories:

* Active bytes, representing data in the table that can be queried.
* Deleted bytes that are still accruing storage charges because they have not been purged yet from the system. These bytes are classified into the following sub-categories:

  > + Bytes in Time Travel (recently deleted, but still within the Time Travel retention period for the table).
  > + Bytes in Fail-safe (deleted bytes that are past the Time Travel retention period, but within the Fail-safe period for the table).
  > + Bytes retained for clones (deleted bytes that are no longer in Time Travel or Fail-safe, but are still retained because clones of the table reference the bytes).

In other words, rows are maintained in this view until the corresponding tables are no longer billed for any storage, regardless of various states that the data in the tables may be in (active, Time Travel, Fail-safe, or retained for clones).

For more details about data storage in tables, see [Data storage considerations](../../user-guide/tables-storage-considerations.md).

> **Note:**
>
> To query this view, you must use the ACCOUNTADMIN role. The view is visible to other views and can be queried, but the queries will return no rows.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to. |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to. |
| TABLE_NAME | VARCHAR | Name of the table. |
| ID | NUMBER | Unique identifier for the table. |
| CLONE_GROUP_ID | NUMBER | Unique identifier for the oldest clone ancestor of this table. Same as ID if the table is not a clone. |
| IS_TRANSIENT | VARCHAR | ‘YES’ if table is transient or temporary, otherwise ‘NO’. Transient and temporary tables have no Fail-safe period. |
| ACTIVE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the active state for the table. For Iceberg table storage, active bytes aren’t billed to *Iceberg* tables. For more information, see [Iceberg table billing](../../user-guide/tables-iceberg.md). |
| TIME_TRAVEL_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the Time Travel state for the table. |
| FAILSAFE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are in the Fail-safe state for the table. |
| RETAINED_FOR_CLONE_BYTES | NUMBER | Bytes owned by (and billed to) this table that are retained after deletion because they are referenced by one or more clones of this table, or by [WORM backups](../../user-guide/backups.md) that contain the table. |
| TABLE_CREATED | TIMESTAMP_LTZ | Date and time at which the table was created. |
| TABLE_DROPPED | TIMESTAMP_LTZ | Date and time at which the table was dropped. NULL if table has not been dropped. |
| TABLE_ENTERED_FAILSAFE | TIMESTAMP_LTZ | Date and time at which the table, if dropped, entered the Fail-safe state, or NULL. In this state, the table cannot be restored using UNDROP. |
| CATALOG_CREATED | TIMESTAMP_LTZ | Date and time at which the database containing the table was created. |
| CATALOG_DROPPED | TIMESTAMP_LTZ | Date and time at which the database containing the table was dropped. |
| SCHEMA_CREATED | TIMESTAMP_LTZ | Date and time at which the schema containing the table was created. |
| SCHEMA_DROPPED | TIMESTAMP_LTZ | Date and time at which the schema containing the table was dropped. |
| COMMENT | VARCHAR | Comment for the table. |

## Usage notes

* There may be a 1-2 hour delay in updating storage related statistics for `active_bytes`, `time_travel_bytes`,
  `failsafe_bytes`, and `retained_for_clone_bytes`.
* ID and CLONE_GROUP_ID:

  > + ID does not change for a table throughout its lifecycle, including if the table is renamed or dropped.
  > + CLONE_GROUP_ID is the ID of the oldest ancestor of a clone, including if the table has been dropped, but is still accruing storage costs. For example:
  >
  >   > - Table `t2` is cloned from `t1`.
  >   > - Table `t3` is cloned from `t2`.
  >
  >   All three tables list the ID for `t1` as their CLONE_GROUP_ID, even if `t1` is dropped and eventually purged from Snowflake.
  > + If ID and CLONE_GROUP_ID are identical, the table is not a clone.
  > + Storage bytes are always owned by, and therefore billed to, the table where the bytes were initially added. If the table is then cloned, storage metrics for these initial bytes never transfer
  >   to the clones, even if the bytes are deleted from the source table.
* Cloned tables share the same underlying storage (at the micro-partition level) until either the original table or cloned table is modified. With each change made to either table, the table takes
  “ownership” of the changed bytes.
* Dropped tables are displayed in the view as long as they still incur storage costs:

  > + Dropped tables retain their active storage metrics, indicating how many bytes will be active if the table is restored.
  > + Dropped tables in the Time Travel retention period for the table can be restored using UNDROP.
  > + Dropped tables in Fail-safe (TABLE_ENTERED_FAILSAFE not NULL) will potentially display NULL values in most columns, except for:
  >
  >   > ID columns:
  >   > :   ID , CLONE_GROUP_ID
  >   >
  >   > Bytes columns:
  >   > :   ACTIVE_BYTES , TIME_TRAVEL_BYTES , FAILSAFE_BYTES , RETAINED_FOR_CLONE_BYTES
  >
  >   These tables cannot be restored using UNDROP.
* When data is deleted from a table with a Time Travel retention period of 0 days, asynchronous background processes purge the active bytes
  or move them directly into Fail-safe storage, depending on the table type. This may take a short time to complete. During that time, the
  `TIME_TRAVEL_BYTES` column may contain a non-zero value even when the Time Travel retention period is 0 days.
* `FAILSAFE_BYTES` denotes bytes that have passed beyond Time Travel. All such bytes are billed to the current table.
* If multiple rows have the same value in the TABLE_NAME column, this indicates that multiple versions of the table exist. A version is created each time a table is dropped and a new table with the same
  name is created, including when a [CREATE OR REPLACE TABLE](../sql/create-table.md) command is issued on an existing table. Note that the current version will have a NULL value for the
  TABLE_DROPPED column; all other versions will have a timestamp value. This is important to note because each version of a table incurs storage costs associated with Time Travel (and Fail-safe, if the
  table is permanent).
* In some cases, active bytes might include bytes for data in a dropped column. For more information,
  see the [usage notes](../sql/alter-table.md) for ALTER TABLE.
* For Iceberg tables:

  + Snowflake doesn’t bill for [Iceberg table](../../user-guide/tables-iceberg.md) storage when the table uses
    an external volume that you manage. However, if the table uses
    [Snowflake Storage](../../user-guide/tables-iceberg-internal-storage.md) (`EXTERNAL_VOLUME = SNOWFLAKE_MANAGED`),
    Snowflake charges for the storage.
    For more information, see [Iceberg table billing](../../user-guide/tables-iceberg.md).
  + If the table is externally managed,
    this view might display inaccurate storage utilization information.

---
title: TABLES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/tables.md
section: Information Schema
---

# TABLES view

This Information Schema view displays a row for each table and view in the specified (or current) database, including the views in the INFORMATION_SCHEMA schema itself.

See also:
:   [COLUMNS view](columns.md) , [VIEWS view](views.md) , [TABLES view](../account-usage/tables.md) (Account Usage)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | VARCHAR | Database that the table belongs to. |
| TABLE_SCHEMA | VARCHAR | Schema that the table belongs to. |
| TABLE_NAME | VARCHAR | Name of the table. |
| TABLE_OWNER | VARCHAR | Name of the role that owns the table. |
| TABLE_TYPE | VARCHAR | Indicates the table type. Valid values are `BASE TABLE`, `TEMPORARY TABLE`, `EXTERNAL TABLE`, `EVENT TABLE`, `VIEW`, or `MATERIALIZED VIEW`. |
| IS_TRANSIENT | VARCHAR | Indicates whether this is a transient table. |
| CLUSTERING_KEY | VARCHAR | Clustering key for the table. |
| ROW_COUNT | NUMBER | Number of rows in the table. |
| BYTES | NUMBER | Number of bytes accessed by a scan of the table. |
| RETENTION_TIME | NUMBER | Number of days that historical data is retained for Time Travel. |
| SELF_REFERENCING_COLUMN_NAME | VARCHAR | Not applicable for Snowflake. |
| REFERENCE_GENERATION | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_CATALOG | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_SCHEMA | VARCHAR | Not applicable for Snowflake. |
| USER_DEFINED_TYPE_NAME | VARCHAR | Not applicable for Snowflake. |
| IS_INSERTABLE_INTO | VARCHAR | Not applicable for Snowflake. |
| IS_TYPED | VARCHAR | Not applicable for Snowflake. |
| COMMIT_ACTION | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Creation time of the table. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| AUTO_CLUSTERING_ON | BOOLEAN | Indicates whether automatic clustering is enabled for the table. |
| COMMENT | VARCHAR | Comment for this table. |
| IS_TEMPORARY | VARCHAR | Indicates whether this is a temporary table. Valid values are `YES` and `NO`. |
| IS_ICEBERG | VARCHAR | Indicates whether the table is an [Iceberg table](../../user-guide/tables-iceberg.md). Valid values are `YES` or `NO`. |
| IS_DYNAMIC | VARCHAR | Indicates whether the table is a [dynamic table](../../user-guide/dynamic-tables-about.md). Valid values are `YES` or `NO`. |
| IS_IMMUTABLE | VARCHAR | Indicates whether the table was created with the [READ ONLY](../sql/create-table.md) property. Valid values are `YES` or `NO`. |
| IS_HYBRID | VARCHAR | Indicates whether the table is a [hybrid table](../../user-guide/tables-hybrid.md). Valid values are `YES` or `NO`. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the
  MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command when both are executed with a role that was
  granted the MANAGE GRANTS privilege.

  This behavior also applies to other account-level [privileges](../../user-guide/security-access-control-privileges.md) and Information
  Schema views for which there is a corresponding SHOW command.
* Querying the sum(bytes) for a table does not represent the total storage usage, because the amount does not include Time Travel and Fail-safe usage.
* The view does not include tables that have been dropped. To view dropped tables, use [SHOW TABLES](../sql/show-tables.md) instead.
* To view only tables in your queries, filter using a WHERE clause, e.g.:

  > `... WHERE table_schema != 'INFORMATION_SCHEMA'`
* Using the value in the LAST_ALTERED column for Time Travel is *not* recommended and can return unexpected results for the following
  reaons:

  + Time Travel can only be used to query historical data modified by a [DML operation](../../user-guide/data-time-travel.md).
  + The LAST_ALTERED column inludes both DML and DDL operations (see the next usage note).
  + For DML operations, the value in the LAST_ALTERED column is the timestamp at the beginning of the statement execution rather than
    the time of the commit of the transaction containing this statement.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.
* The LAST_ALTERED column does not necessarily indicate the last refreshed time for external tables.
  To retrieve the last refreshed time for an auto-refreshed external table, you can use the
  [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](../functions/system_external_table_pipe_status.md) function, which returns
  information such as the timestamp of the last file Snowflake has registered.

## Examples

Retrieve the size (in bytes) of all tables in all schemas in the `mydatabase` database:

```sqlexample
SELECT table_schema, SUM(bytes)
  FROM mydatabase.INFORMATION_SCHEMA.TABLES
  GROUP BY TABLE_SCHEMA;
```

---
title: TYPES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/types.md
section: Information Schema
---

# TYPES view

This Information Schema view displays a row for each [user-defined type](../data-types-user-defined.md)
defined in the specified or current database.

See also:
:   [TYPES view](../account-usage/types.md) (Account Usage) ,
    [TYPES view](../organization-usage/types.md) (Organization Usage)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| TYPE_CATALOG | VARCHAR | Database that contains the type. |
| TYPE_SCHEMA | VARCHAR | Schema that contains the type. |
| TYPE_NAME | VARCHAR | Name of the type. |
| TYPE_OWNER | VARCHAR | Name of the role that owns the type. |
| BASE_DATA_TYPE | VARCHAR | Underlying data type of the user-defined type. |
| CHARACTER_MAXIMUM_LENGTH | NUMBER | Maximum length in characters for VARCHAR types. |
| CHARACTER_OCTET_LENGTH | NUMBER | Maximum length in bytes for VARCHAR types. |
| NUMERIC_PRECISION | NUMBER | Numeric precision for NUMBER types. |
| NUMERIC_PRECISION_RADIX | NUMBER | Radix of the numeric precision for NUMBER types. |
| NUMERIC_SCALE | NUMBER | Numeric scale for NUMBER types. |
| DATETIME_PRECISION | NUMBER | Fractional seconds precision for TIMESTAMP types. |
| CHECK_EXPRESSION | VARCHAR | Not applicable for Snowflake. |
| DEFAULT_EXPRESSION | VARCHAR | Not applicable for Snowflake. |
| IS_NULLABLE_DEFAULT | VARCHAR | Not applicable for Snowflake. |
| COLLATION_NAME | VARCHAR | Not applicable for Snowflake. |
| CREATED | TIMESTAMP_LTZ | Creation time of the type. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| COMMENT | VARCHAR | Comment for this type. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view doesn’t
  honor the MANAGE GRANTS privilege and consequently might show less information compared to a SHOW command when both are executed
  by a user who holds the MANAGE GRANTS privilege.
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

## Examples

Retrieve all user-defined types in the `mydb` database:

```sqlexample
SELECT type_name, type_owner, base_data_type
  FROM mydb.INFORMATION_SCHEMA.TYPES;
```

Retrieve all user-defined types in a specific schema:

```sqlexample
SELECT type_name, type_owner, base_data_type
  FROM mydb.INFORMATION_SCHEMA.TYPES
  WHERE type_schema = 'MY_SCHEMA';
```

---
title: USAGE_PRIVILEGES view
source: https://docs.snowflake.com/en/sql-reference/info-schema/usage_privileges.md
section: Information Schema
---

# USAGE_PRIVILEGES view

In accordance with the ANSI standard, this view displays a row for each privilege defined for sequences in the specified (or current) database.

To view privileges on other types of objects, use the [OBJECT_PRIVILEGES view](object_privileges.md) view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| GRANTOR | VARCHAR | Role who granted the usage privilege |
| GRANTEE | VARCHAR | Role to whom the usage privilege is granted |
| GRANTED_TO | VARCHAR | Type of object that has been granted the privilege |
| OBJECT_CATALOG | VARCHAR | Database containing the object on which the privilege is granted |
| OBJECT_SCHEMA | VARCHAR | Schema containing the object on which the privilege is granted |
| OBJECT_NAME | VARCHAR | Name of the object on which the privilege is granted |
| OBJECT_TYPE | VARCHAR | Type of the object on which the privilege is granted |
| PRIVILEGE_TYPE | VARCHAR | Type of the granted privilege |
| IS_GRANTABLE | VARCHAR | Whether the privilege was granted WITH GRANT OPTION |
| CREATED | TIMESTAMP_LTZ | Creation time of the privilege |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges.

---
title: VIEWS view
source: https://docs.snowflake.com/en/sql-reference/info-schema/views.md
section: Information Schema
---

# VIEWS view

This Information Schema view displays a row for each view in the specified (or current) database, including the INFORMATION_SCHEMA views for the database.

See also:
:   [TABLES view](tables.md)

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| TABLE_CATALOG | VARCHAR | Database that the view belongs to. |
| TABLE_SCHEMA | VARCHAR | Schema that the view belongs to. |
| TABLE_NAME | VARCHAR | Name of the view. |
| TABLE_OWNER | VARCHAR | Name of the role that owns the view. |
| VIEW_DEFINITION | VARCHAR | Text of the view’s query expression. |
| CHECK_OPTION | VARCHAR | Not applicable for Snowflake. |
| IS_UPDATABLE | VARCHAR | Not applicable for Snowflake. |
| INSERTABLE_INTO | VARCHAR | Not applicable for Snowflake. |
| IS_SECURE | VARCHAR | Specifies whether the view is secure. |
| CREATED | TIMESTAMP_LTZ | Creation time of the view. |
| LAST_ALTERED | TIMESTAMP_LTZ | Date and time the object was last altered by a DML, DDL, or background metadata operation. See Usage Notes. |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view.  All supported table/view DDL operations update this field:   * { CREATE | ALTER | DROP | UNDROP } TABLE * { CREATE | ALTER | DROP } VIEW   All ALTER TABLE operations update this field, including setting or unsetting a table parameter (for example, COMMENT, DATA_RETENTION_TIME, etc.) and changes to table columns (ADD / MODIFY / RENAME / DROP).  For more information, see the Usage Notes. |
| LAST_DDL_BY | VARCHAR | The current username for the user who executed the last DDL operation. If the user has been dropped, shows `DROPPED_USER(<id>)`.  For dropped users, you can join the `<id>` with the USER_ID column in the USERS view of the ACCOUNT_USAGE or ORGANIZATION_USAGE schema. |
| COMMENT | VARCHAR | Comment for this view. |

## Usage notes

* The view only displays objects for which the current role for the session has been granted access privileges. The view does not honor the
  MANAGE GRANTS privilege and consequently may show less information compared to a SHOW command when both are executed with a role that was
  granted the MANAGE GRANTS privilege.

  This behavior also applies to other account-level [privileges](../../user-guide/security-access-control-privileges.md) and Information
  Schema views for which there is a corresponding SHOW command.
* To remove the INFORMATION_SCHEMA views from your queries, filter using a WHERE clause, e.g.:

  > `... WHERE table_schema != 'INFORMATION_SCHEMA'`
* The LAST_ALTERED column is updated when the following operations are performed on an object:

  + DDL operations.
  + DML operations (for tables only). This column is updated even when no rows are affected by the DML statement.
  + Background maintenance operations on metadata performed by Snowflake.

  For views and tables, use the LAST_DDL column for the last modification time for an object.
* The value in the LAST_DDL column is updated as follows:

  > + When a table or view is created, the LAST_DDL timestamp is the same as the CREATED timestamp.
  > + When a table or view is dropped, the LAST_DDL timestamp is the same as the DELETED timestamp.
  > + Last DDL data is not available for operations that occurred before the columns were
  >   [added](../../release-notes/bcr-bundles/2023_01/bcr-891.md). The new DDL fields contain `null` until a DDL operation is executed.
  > + For replicated databases, the LAST_DDL and LAST_DDL_BY fields are only updated for objects in the primary database. After failover, the
  >   LAST_DDL and LAST_DDL_BY fields are updated for DDL operations for the tables and views in the newly promoted primary database. These
  >   fields will remain unchanged for objects in the now secondary database.
  > + For objects in secondary databases that are newly created during a refresh operation, these fields are `null`.

## SQL Classes

Reference for Snowflake ML classes: FORECAST, ANOMALY_DETECTION, CLASSIFICATION, and more.

---
title: <budget_name>!ADD_CUSTOM_ACTION
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/add_custom_action.md
section: SQL Classes
---

# <budget_name>!ADD_CUSTOM_ACTION

Associates a stored procedure with a budget so that the procedure is called when projected or actual spending reaches a specified
threshold. The procedure must be associated by [reference](../../../references.md).

For more information, see [Custom actions for budgets](../../../../user-guide/budgets/custom-actions.md).

## Syntax

```sqlsyntax
<budget_name>!ADD_CUSTOM_ACTION (
  { '<stored_procedure_reference>' | <reference_statement> },
  { <array_of_arguments> | <array_construct_statement> },
  [ { 'ACTUAL' | 'PROJECTED' }, ]
  <threshold> )
```

## Arguments

`'stored_procedure_reference'`
:   The serialized string representation that resolves to a procedure. This string is the output of
    the [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the procedure to be associated with the budget.

`array_of_arguments`
:   Array of arguments to pass to the stored procedure.

`array_construct_statement`
:   An [ARRAY_CONSTRUCT](../../../functions/array_construct.md) statement that returns an array constructed from zero, one, or more
    inputs.

`{ 'ACTUAL' | 'PROJECTED'}`
:   Controls whether an action is triggered based on the actual or projected spend.

    `'ACTUAL'` — The stored procedure is called when the actual spend reaches the `threshold`.
    `'PROJECTED` — The stored procedure is called when spending is projected to reach the `threshold`.

    If omitted, defaults to `PROJECTED`.

`threshold`
:   Percentage of the budget limit. The stored procedure is called when Snowflake determines that actual or projected spending exceeds this
    percentage of the budget limit.

    Specify a number between 0 and 1,000, inclusive.

## Returns

Returns a VARCHAR value that indicates whether or not the procedure was successfully associated with the budget.

If the procedure could not be associated with the budget, the method returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain the stored procedure.
* USAGE privilege on the stored procedure.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Associate the `alert_team` stored procedure with the `budget_db.sch1.my_budget` budget so that it is
called when spending is forecast to reach 75% of the budget limit:

```sqlexample
CALL budget_db.sch1.my_budget!ADD_CUSTOM_ACTION(
  SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.alert_team(string, string, string)', 'SESSION', 'USAGE'),
  ARRAY_CONSTRUCT('admin@example.com', 'Budget Alert', 'Spending at 75% of budget limit'),
  'PROJECTED',
  75);
```

Associate the `alert_team` stored procedure with the `budget_db.sch1.my_budget` budget so that it is called when
spending has reached 90% of the budget limit:

```sqlexample
CALL budget_db.sch1.my_budget!ADD_CUSTOM_ACTION(
  SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.alert_team(string, number)', 'SESSION', 'USAGE'),
  ARRAY_CONSTRUCT('Critical budget threshold', 90),
  'ACTUAL',
  90);
```

---
title: <budget_name>!ADD_NOTIFICATION_INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/add_notification_integration.md
section: SQL Classes
---

# <budget_name>!ADD_NOTIFICATION_INTEGRATION

Adds a queue or webhook [notification integration](../../../sql/create-notification-integration.md) to a
[custom budget or the account budget](../../../../user-guide/budgets.md).

See also:
:   [<budget_name>!GET_NOTIFICATION_INTEGRATIONS](get_notification_integrations.md),
    [<budget_name>!REMOVE_NOTIFICATION_INTEGRATION](remove_notification_integration.md)

## Syntax

```sqlsyntax
<budget_name>!ADD_NOTIFICATION_INTEGRATION( '<integration_name>' )
```

## Arguments

`'integration_name'`
:   The name of the queue or webhook notification integration to add to the budget.

## Returns

Returns a VARCHAR value that indicates whether or not the notification integration was successfully added.

* If the notification integration was added successfully, the method returns `Integration added successfully`.
* Otherwise, the method returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

In addition, you must grant the following privileges to the SNOWFLAKE application:

* The USAGE privilege on the notification integration.

If the notification integration is for a webhook that uses a secret object, you must also grant the following privileges to the
SNOWFLAKE application:

* The READ privilege on that secret.
* The USAGE privilege on the schema containing that secret.
* The USAGE privilege on the database containing that schema.

For information, see:

* [Setting up email notification](../../../../user-guide/budgets/notifications.md)
* [Setting up queue notification](../../../../user-guide/budgets/notifications.md)
* [Setting up webhook notification](../../../../user-guide/budgets/notifications.md)

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

The following example adds the notification integration `budgets_notification_integration` to the account budget:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!ADD_NOTIFICATION_INTEGRATION(
  'budgets_notification_integration',
);
```

---
title: <budget_name>!ADD_RESOURCE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/add_resource.md
section: SQL Classes
---

# <budget_name>!ADD_RESOURCE

Add an object to a [custom budget](../../../../user-guide/budgets.md). The object must be added by
[reference](../../../references.md).

See also:
:   [<budget_name>!REMOVE_RESOURCE](remove_resource.md),
    [<budget_name>!GET_LINKED_RESOURCES](get_linked_resources.md)

## Syntax

```sqlsyntax
<budget_name>!ADD_RESOURCE( { '<object_reference>' | <reference_statement> } )
```

## Arguments

`'object_reference'`
:   The serialized string representation that resolves to an object. This string is the output of
    the [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the object to be added to the budget.

> **Note:**
>
> If you want to add a Snowflake Native App to a budget, when you call SYSTEM$REFERENCE, specify `'DATABASE'` (not `'APPLICATION'`)
> for the `object_type` argument.
>
> See Adding a Snowflake Native App to a budget.

## Returns

Returns a VARCHAR value that indicates whether or not the object was successfully added to the budget. For example:

```output
Successfully added resource to resource group
```

If the object could not be added to the budget, the function returns an error message. See
[You can’t add or remove objects from a custom budget](../../../../user-guide/budgets/troubleshoot.md).

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain the object being added (for schema objects).
* APPLYBUDGET privilege on the object being added.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only add objects to *custom budgets*.
* If you are directly adding individual objects, you can only add an object to one custom budget. In this case, if an object is currently
  included in one custom budget and you add that object to a second custom budget, Budgets removes the object from the first custom budget
  without issuing a warning.

  This behavior does not apply to using tags to add objects to budgets; an object with one or more tags can be
  included in multiple custom budgets if you are using tags to add the object to the budgets.
* You cannot create a reference for the SNOWFLAKE database; and you cannot add it
  to a budget.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

The following examples demonstrate how to add an object to a custom budget:

* Adding a table to a budget
* Adding a Snowflake Native App to a budget

### Adding a table to a budget

* The following example creates and returns a reference for the `t1` table:

  ```sqlexample
  SELECT SYSTEM$REFERENCE('TABLE', 't1', 'SESSION', 'APPLYBUDGET');
  ```

  The statement returns the reference in the output.

  ```output
  ENT_REF_TABLE_5862683050074_5AEB8D58FB3ACF249F2E35F365A9357C46BB00D7
  ```

  The following statement uses the string literal for this reference to add the `t1` table to the
  `budget_db.budget_schema.my_budget` budget:

  ```sqlexample
  CALL budget_db.budget_schema.my_budget!ADD_RESOURCE(
    'ENT_REF_TABLE_5862683050074_5AEB8D58FB3ACF249F2E35F365A9357C46BB00D7');
  ```
* The following example adds the `t2` table to the `budget_db.budget_schema.my_budget` budget, using a SQL statement
  to specify the reference:

  ```sqlexample
  CALL budget_db.budget_schema.my_budget!ADD_RESOURCE(
    SELECT SYSTEM$REFERENCE('TABLE', 't2', 'SESSION', 'APPLYBUDGET'));
  ```

### Adding a Snowflake Native App to a budget

The following example adds the `my_app` application to the `budget_db.budget_schema.my_budget` budget.

Note that when calling [SYSTEM$REFERENCE](../../../functions/system_reference.md), you must pass in `'DATABASE'` (not `'APPLICATION'`)
for the `object_type` argument.

```sqlexample
CALL budget_db.budget_schema.my_budget!ADD_RESOURCE(
  SELECT SYSTEM$REFERENCE('DATABASE', 'my_app', 'SESSION', 'APPLYBUDGET'));
```

## Error messages

For a list of common error messages and their causes and solutions, see [You can’t add or remove objects from a custom budget](../../../../user-guide/budgets/troubleshoot.md).

---
title: <budget_name>!ADD_RESOURCE_TAG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/add_resource_tag.md
section: SQL Classes
---

# <budget_name>!ADD_RESOURCE_TAG

Adds a tag to a custom budget. All resources that are tagged with the specified tag-value pair are included in the budget.

> **Important:**
>
> This method is being deprecated. Use [<budget_name>!SET_RESOURCE_TAGS](set_resource_tags.md) instead.

## Syntax

```sqlsyntax
<budget_name>!ADD_RESOURCE_TAG(
    { '<tag_reference>' | <reference_statement> },
    '<tag_value>')
```

## Arguments

`'tag_reference'`
:   The serialized string representation that resolves to a tag. This string is the output of the
    [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the tag to be added to the budget.

`'tag_value'`
:   The value of the tag you are adding to the budget.

## Returns

Returns a VARCHAR value that indicates whether or not the tag was successfully added to the budget.

If the tag could not be added to the budget, the function returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain the tag.
* APPLYBUDGET privilege on the tag being added.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only add tags to *custom budgets*.
* Snowflake doesn’t start showing usage for the added resources until the budget is refreshed, which can take up to six hours. If you want
  to view usage sooner, run the [REFRESH_USAGE](refresh_usage.md) method.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Retrieve the tag reference before calling the method to add a tag.
:   The following statement creates and returns a reference for the `cost_center` tag:

    ```sqlexample
    SELECT SYSTEM$REFERENCE(
      'TAG',
      'cost_mgmt_db.tags.cost_center',
      'SESSION',
      'APPLYBUDGET');
    ```

    The statement returns the reference in the output.

    ```output
    ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2
    ```

    The following statement uses the string literal for this reference to add the `cost_center = 'sales'` tag-value combination to the
    `budget_db.budget_schema.my_budget` budget:

    ```sqlexample
    CALL budget_db.budget_schema.my_budget!ADD_RESOURCE_TAG(
      'ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2',
      'sales');
    ```

Include the SYSTEM$REFERENCE function in the argument directly
:   After executing the following statement, the budget will track all objects that are tagged with the tag-value combination
    `team_tag = 'finance'`.

    > ```sqlexample
    > CALL budget_db.budget_schema.my_budget!ADD_RESOURCE_TAG(
    >     (SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')),
    >     'finance');
    > ```

---
title: <budget_name>!ADD_SHARED_RESOURCE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/add_shared_resource.md
section: SQL Classes
---

# <budget_name>!ADD_SHARED_RESOURCE

Adds a shared resource to a [custom budget](../../../../user-guide/budgets.md). When you add a shared resource, consumption is tracked only if the
resource is used by certain users. These users are tagged with a tag-value pair that was added to the budget using the
[SET_USER_TAGS](set_user_tags.md) method.

For more information, see [Using budgets for AI features (shared resources)](../../../../user-guide/budgets/budget-shared-resources.md).

## Syntax

```sqlsyntax
<budget_name>!ADD_SHARED_RESOURCE( '<domain>' [ , '<instance>' ] )
```

## Arguments

`'domain'`
:   The type of resource being added to the budget. Valid values:

    * `AI FUNCTION`
    * `CORTEX CODE`
    * `CORTEX AGENT`
    * `SNOWFLAKE INTELLIGENCE`

    Unless you specify a second argument, the budget tracks consumption for all resources within the specified domain.

`'instance'`
:   Optional. Specifies a specific resource within the selected `domain` to add to the budget.

    For domains that support instance-level selection (such as `AI FUNCTION` and `CORTEX CODE`), this argument allows you to track a specific function or interface.

    If you don’t specify a second argument, the budget tracks all instances within the domain.

    Examples:

    * AI Functions: `AI_CLASSIFY`, `AI_COMPLETE`
    * Cortex Code: `CORTEX_CODE_CLI`, `CORTEX_CODE_SNOWSIGHT`

    Instance-level selection is not applicable to all domains. For example, `CORTEX AGENT` and `SNOWFLAKE INTELLIGENCE` are currently tracked at the domain level only.

## Returns

Returns a VARCHAR value that indicates whether or not the resource was successfully added to the budget.

If the resource could not be added to the budget, the function returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain the resource being added (for schema objects).

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only add shared resources to *custom budgets*.
* To verify the results of the method, call the [GET_BUDGET_SCOPE](get_budget_scope.md) method.
* When all objects of the specified entity type are added (for example, all AI Functions), you can’t add individual resources of that type.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Add all AI Functions to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('AI FUNCTION');
```

Add a specific AI function to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('AI FUNCTION', 'AI_CLASSIFY');
```

Add all Cortex Code workloads to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('CORTEX CODE');
```

Add the Cortex Code CLI workload to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('CORTEX CODE', 'CORTEX_CODE_CLI');
```

Add the Cortex Code Snowsight workload to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('CORTEX CODE', 'CORTEX_CODE_SNOWSIGHT');
```

Add Cortex Agent workloads to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('CORTEX AGENT');
```

> **Note:**
>
> Cortex Agent budgets are available at the domain level only. To track the cost of specific agents, use resource budgets through resource tags instead. For more information, see [Resource budgets for Cortex Agents](../../../../user-guide/snowflake-cortex/cortex-agents-resource-budgets.md).

Add all Snowflake Intelligence workloads to the budget:

```sqlexample
CALL finance_budget!ADD_SHARED_RESOURCE('SNOWFLAKE INTELLIGENCE');
```

> **Note:**
>
> Snowflake Intelligence budgets are available at the domain level only.

---
title: <budget_name>!ADD_TAG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/add_tag.md
section: SQL Classes
---

# <budget_name>!ADD_TAG

Adds a tag to a custom budget. The tag must be added by [reference](../../../references.md).

> **Important:**
>
> This method has been deprecated. Use [<budget_name>!SET_RESOURCE_TAGS](set_resource_tags.md) instead.

## Syntax

```sqlsyntax
<budget_name>!ADD_TAG(
    { '<tag_reference>' | <reference_statement> },
    '<tag_value>')
```

## Arguments

`'tag_reference'`
:   The serialized string representation that resolves to a tag. This string is the output of the
    [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the tag to be added to the budget.

`'tag_value'`
:   The value of the tag you are adding to the budget.

## Returns

Returns a VARCHAR value that indicates whether or not the tag was successfully added to the budget.

If the tag could not be added to the budget, the function returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain the tag.
* APPLYBUDGET privilege on the tag being added.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only add tags to *custom budgets*.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Retrieve the tag reference before calling the method to add a tag.
:   The following statement creates and returns a reference for the `cost_center` tag:

    ```sqlexample
    SELECT SYSTEM$REFERENCE(
      'TAG',
      'cost_mgmt_db.tags.cost_center',
      'SESSION',
      'APPLYBUDGET');
    ```

    The statement returns the reference in the output.

    ```output
    ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2
    ```

    The following statement uses the string literal for this reference to add the `cost_center = 'sales'` tag/value combination to the
    `budget_db.budget_schema.my_budget` budget:

    ```sqlexample
    CALL budget_db.budget_schema.my_budget!ADD_TAG(
      'ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2',
      'sales');
    ```

Include the SYSTEM$REFERENCE function in the argument directly
:   After executing the following statement, the budget will track all objects that are tagged with the tag/value combination
    `team_tag = 'finance'`.

    > ```sqlexample
    > CALL budget_db.budget_schema.my_budget!ADD_TAG(
    >     (SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')),
    >     'finance');
    > ```

---
title: <budget_name>!CONFIRM_CUSTOM_ACTIONS_ACCESS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/confirm_custom_actions_access.md
section: SQL Classes
---

# <budget_name>!CONFIRM_CUSTOM_ACTIONS_ACCESS

Validate that the stored procedures associated with [custom actions](../../../../user-guide/budgets/custom-actions.md) are still valid and that required access control privileges are still granted.

To fix any problems, see [Stored procedure requirements](../../../../user-guide/budgets/custom-actions.md).

See also:
:   [<budget_name>!ADD_CUSTOM_ACTION](add_custom_action.md), [<budget_name>!GET_CUSTOM_ACTIONS](get_custom_actions.md)

## Syntax

```sqlsyntax
<budget_name>!CONFIRM_CUSTOM_ACTIONS_ACCESS()
```

## Returns

The method returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| PROCEDURE_FQN | VARCHAR | Fully qualified name of the stored procedure. |
| IS_VALID | BOOLEAN | If TRUE, the stored procedure is still valid and the SNOWFLAKE application still has the required privileges on the procedure. |
| REASON | VARCHAR | Explanation of why the custom action is no longer valid. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contain the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Verify the stored procedures and permissions for budget `my_budget` in schema `budget_db.sch1`:

```sqlexample
CALL budget_db.sch1.my_budget!CONFIRM_CUSTOM_ACTIONS_ACCESS();
```

Verify the stored procedures and permissions for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!CONFIRM_CUSTOM_ACTIONS_ACCESS();
```

---
title: <budget_name>!GET_BUDGET_SCOPE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_budget_scope.md
section: SQL Classes
---

# <budget_name>!GET_BUDGET_SCOPE

Returns the resources and tags that have been added to a [custom budget](../../../../user-guide/budgets.md). Helps determine which resource
consumption is tracked by the budget.

The list does not include:

* Objects that were added automatically (for example, compute pools and warehouses created and owned by a Snowflake Native App).
* Objects that were added when a tag was added to the budget.

## Syntax

```sqlsyntax
<budget_name>!GET_BUDGET_SCOPE()
```

## Returns

The method returns a JSON object with the following keys:

`resource_tags`
:   The resource tags that have been added to the budget. Resources belong to the budget if they are tagged with these tags. Contains
    the following fields:

    `operator`
    :   The matching logic used for resource tags. Can be one of the following values:

        * `UNION`: A resource is included in the budget if it is tagged with *any* of the tag-value pairs in the `tags` array.
        * `INTERSECTION`: A resource must be tagged with *all* of the tag-value pairs in the `tags` array to be included in the budget.

    `tags`
    :   An array of tag objects, each with the following fields:

        `tagId`
        :   Internal identifier for the tag.

        `tagDatabase`
        :   Database that contains the tag.

        `tagSchema`
        :   Schema that contains the tag.

        `tagName`
        :   Name of the tag.

        `tagValues`
        :   Array of tag values associated with the tag.

`resources`
:   An array of resources that have been added directly to the budget. Each object contains the following fields:

    `resourceId`
    :   Internal identifier for the resource.

    `resourceName`
    :   Name of the resource.

    `resourceDomain`
    :   Domain of the resource (for example, `WAREHOUSE`, `DATABASE`, `TABLE`).

    `schemaName`
    :   Schema that contains the resource.

    `databaseName`
    :   Database that contains the resource.

## Access control requirements

The following minimum privileges and roles are required to view results for custom budgets:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Get all resources and tags that have been added to the `budget_db.budget_schema.my_budget` budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_BUDGET_SCOPE();
```

---
title: <budget_name>!GET_CONFIG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_config.md
section: SQL Classes
---

# <budget_name>!GET_CONFIG

View the configuration properties for a [budget](../../../../user-guide/budgets.md).

## Syntax

```sqlsyntax
<budget_name>!GET_CONFIG()
```

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NOTIFICATION_EMAIL | VARCHAR | The email address(es) that receive budget notifications. If there is more than one email address, the function returns a comma-separated list. |
| LAST_NOTIFICATION_TIME | NUMBER | UTC timestamp when the last notification was sent. If no notifications were sent out yet, the value in this column is `-1`. |
| SPEND_LIMIT | NUMBER | The spending limit (in credits) for the budget. |
| NOTIFICATION_MUTE_FLAG | BOOLEAN | TRUE if notifications are muted for the budget. |
| BUDGET_TYPE | VARCHAR | Type of budget. Valid values are: `ACCOUNT_ROOT_BUDGET` or `USER_BUDGET` |
| IS_ACTIVE | BOOLEAN | TRUE if the account budget has been activated.  *This column is only available for the account budget.* |
| ACTIVATION_TIMESTAMP | TIMESTAMP_TZ | Date and time the account budget was activated.  *This column is only available for the account budget.* |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the budget configuration properties for `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_CONFIG();
```

View the budget configuration properties for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_CONFIG();
```

---
title: <budget_name>!GET_CUSTOM_ACTIONS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_custom_actions.md
section: SQL Classes
---

# <budget_name>!GET_CUSTOM_ACTIONS

ListS all [custom actions](../../../../user-guide/budgets/custom-actions.md) associated with a budget.

See also:
:   [<budget_name>!ADD_CUSTOM_ACTION](add_custom_action.md), [<budget_name>!REMOVE_CUSTOM_ACTIONS](remove_custom_actions.md)

## Syntax

```sqlsyntax
<budget_name>!GET_CUSTOM_ACTIONS()
```

## Returns

The method returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| ACTION_ID | VARCHAR | Unique identifier for the combination of the stored procedure fully qualified name, array of arguments, threshold, and trigger type. |
| PROCEDURE_FQN | VARCHAR | Fully qualified name of the stored procedure. |
| PROCEDURE_ARGS | ARRAY | Array of arguments passed to the stored procedure. |
| SPEND_STRATEGY | VARCHAR | Whether the custom action is triggered based on projected consumption or actual consumption. Valid values: `PROJECTED` or `ACTUAL`. |
| THRESHOLD | NUMBER | Percentage of the budget limit that triggers the stored procedure. |
| LAST_TRIGGER_ATTEMPT_TIME | TIMESTAMP_TZ | Last time the budget attempted to trigger the action, in UTC. |
| ADDED_TIMESTAMP | TIMESTAMP_TZ | Time when the action was added to the budget, in local time zone. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

List all custom actions for budget `my_budget` in schema `budget_db.sch1`:

```sqlexample
CALL budget_db.sch1.my_budget!GET_CUSTOM_ACTIONS();
```

List all custom actions for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_CUSTOM_ACTIONS();
```

---
title: <budget_name>!GET_CYCLE_START_ACTION
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_cycle_start_action.md
section: SQL Classes
---

# <budget_name>!GET_CYCLE_START_ACTION

Returns the [user-defined action](../../../../user-guide/budgets/cycle-start-actions.md) that is triggered when the budget cycle restarts.

See also:
:   [<budget_name>!SET_CYCLE_START_ACTION](set_cycle_start_action.md), [<budget_name>!REMOVE_CYCLE_START_ACTION](remove_cycle_start_action.md)

## Syntax

```sqlsyntax
<budget_name>!GET_CYCLE_START_ACTION()
```

## Returns

The method returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| ACTION_UUID | VARCHAR | Unique identifier for the cycle-start action. |
| PROCEDURE_FQN | VARCHAR | Fully qualified name of the stored procedure. |
| PROCEDURE_ARGS | ARRAY | Array of arguments passed to the stored procedure. |
| ADDED_TIMESTAMP | TIMESTAMP_TZ | Time when the action was added to the budget, in local time zone. |
| LAST_TRIGGERED_TIMESTAMP | TIMESTAMP_TZ | Last time the budget triggered the action, in UTC. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Get the cycle-start action for budget `my_budget` in schema `budget_db.sch1`:

```sqlexample
CALL budget_db.sch1.my_budget!GET_CYCLE_START_ACTION();
```

Get the cycle-start action for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_CYCLE_START_ACTION();
```

---
title: <budget_name>!GET_LINKED_RESOURCES
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_linked_resources.md
section: SQL Classes
---

# <budget_name>!GET_LINKED_RESOURCES

List the objects that we explicitly added to a [custom budget](../../../../user-guide/budgets.md).

The list does not include:

* Objects that were added automatically (for example, compute pools and warehouses created and owned by a Snowflake Native App).
* Objects that were added when a tag was added to the budget.

> **Important:**
>
> This method is being deprecated. Use [<budget_name>!GET_BUDGET_SCOPE](get_budget_scope.md) instead.

## Syntax

```sqlsyntax
<budget_name>!GET_LINKED_RESOURCES()
```

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| RESOURCE_ID | NUMBER | Internal identifier for the object. |
| NAME | VARCHAR | Name of the object. |
| DOMAIN | VARCHAR | Domain of the object. Valid values:   * `COMPUTE_POOL` * `DATABASE` * `MATERIALIZED_VIEW` * `PIPE` * `SCHEMA` * `TABLE` * `TASK` * `WAREHOUSE`   **Note:** If the object is a Snowflake Native App, the value in this column is `DATABASE` (not `APPLICATION`). |
| SCHEMA_NAME | VARCHAR | Name of the schema that contains the object. NULL if the object is not a schema-level object. |
| DATABASE_NAME | VARCHAR | Name of the database that contains the object. NULL if the object is not a database-level or schema-level object. |

## Access control requirements

The following minimum privileges and roles are required to view results for custom budgets:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Get all objects that were added to the `budget_db.budget_schema.my_budget` budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_LINKED_RESOURCES();
```

---
title: <budget_name>!GET_LINKED_TAGS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_linked_tags.md
section: SQL Classes
---

# <budget_name>!GET_LINKED_TAGS

List the tags that have been added to a [custom budget](../../../../user-guide/budgets.md).

> **Important:**
>
> This method has been deprecated. Use [<budget_name>!GET_BUDGET_SCOPE](get_budget_scope.md) instead.

## Syntax

```sqlsyntax
<budget_name>!GET_LINKED_TAGS()
```

## Returns

The method returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| TAG_ID | NUMBER | System-generated identifier. |
| TAG_VALUE | VARCHAR | Value of the tag. |
| TAG_DATABASE | VARCHAR | Database that contains the tag. |
| TAG_SCHEMA | VARCHAR | Schema that contains the tag. |
| TAG_NAME | VARCHAR | Name of the tag. |

## Access control requirements

The following minimum privileges and roles are required to view results for custom budgets:

* ADMIN or VIEWER [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Get all tags that were added to the `budget_db.budget_schema.my_budget` budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_LINKED_TAGS();
```

---
title: <budget_name>!GET_MEASUREMENT_TABLE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_measurement_table.md
section: SQL Classes
---

# <budget_name>!GET_MEASUREMENT_TABLE

View the credit usage data collected by the [budget](../../../../user-guide/budgets.md) maintenance task. For more
information, see [Understand budget costs](../../../../user-guide/budgets/cost.md).

## Syntax

```sqlsyntax
<budget_name>!GET_MEASUREMENT_TABLE()
```

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.
* The following role is required to view results for the *account budget*:

  Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| MEASUREMENT_TIME | NUMBER | UTC timestamp when the measurement was taken. |
| SERVICE_TYPE | VARCHAR | [Type of service](../../../../user-guide/budgets.md) that is consuming credits, which can be one of the following:   * `AUTO_CLUSTERING` * `DATA_QUALITY_MONITORING` * `HYBRID_TABLE_REQUESTS` * `MATERIALIZED_VIEW` * `PIPE` * `QUERY_ACCELERATION` * `SEARCH_OPTIMIZATION` * `SERVERLESS_ALERTS` * `SERVERLESS_TASK` * `SNOWPIPE_STREAMING` * `WAREHOUSE_METERING` * `WAREHOUSE_METERING_READER` |
| CREDITS_SPENT | NUMBER | Number of credits spent. |
| UPDATED_TIME | NUMBER | UTC timestamp when the measurement was updated. |

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the credit usage data collected for budget `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_MEASUREMENT_TABLE();
```

View the credit usage data collected for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_MEASUREMENT_TABLE();
```

---
title: <budget_name>!GET_NOTIFICATION_EMAIL
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_notification_email.md
section: SQL Classes
---

# <budget_name>!GET_NOTIFICATION_EMAIL

Returns the email address(es) configured to receive budget notifications for a [budget](../../../../user-guide/budgets.md).

See also:
:   [<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](get_notification_integration_name.md),
    [<budget_name>!GET_NOTIFICATION_MUTE_FLAG](get_notification_mute_flag.md),
    [<budget_name>!SET_EMAIL_NOTIFICATIONS](set_email_notifications.md),
    [<budget_name>!SET_NOTIFICATION_MUTE_FLAG](set_notification_mute_flag.md)

## Syntax

```sqlsyntax
<budget_name>!GET_NOTIFICATION_EMAIL()
```

## Returns

* An email address or comma-separated list of email addresses.
* An empty string if the notification email address is not set.

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the email address(es) configured to receive notifications for `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_NOTIFICATION_EMAIL();
```

View the email address(es) configured to receive notifications for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_NOTIFICATION_EMAIL();
```

---
title: <budget_name>!GET_NOTIFICATION_INTEGRATION_NAME
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_notification_integration_name.md
section: SQL Classes
---

# <budget_name>!GET_NOTIFICATION_INTEGRATION_NAME

Returns the name of the email notification integration configured for a [budget](../../../../user-guide/budgets.md).

To get the names of the notification integrations for cloud provider queues and webhooks, call
[<budget_name>!GET_NOTIFICATION_INTEGRATIONS](get_notification_integrations.md) instead.

See also:
:   [<budget_name>!GET_NOTIFICATION_EMAIL](get_notification_email.md),
    [<budget_name>!GET_NOTIFICATION_INTEGRATIONS](get_notification_integrations.md),
    [<budget_name>!GET_NOTIFICATION_MUTE_FLAG](get_notification_mute_flag.md),
    [<budget_name>!SET_EMAIL_NOTIFICATIONS](set_email_notifications.md),
    [<budget_name>!SET_NOTIFICATION_MUTE_FLAG](set_notification_mute_flag.md)

## Syntax

```sqlsyntax
<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME()
```

## Returns

* The name of the notification integration.
* An empty string if the notification email address is not set.

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the name of the notification integration used by `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_NOTIFICATION_INTEGRATION_NAME();
```

View the name of the notification integration used by the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_NOTIFICATION_INTEGRATION_NAME();
```

---
title: <budget_name>!GET_NOTIFICATION_INTEGRATIONS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_notification_integrations.md
section: SQL Classes
---

# <budget_name>!GET_NOTIFICATION_INTEGRATIONS

Returns information about the queue and webhook notification integrations associated with a
[custom budget or the account budget](../../../../user-guide/budgets.md).

To get the name of the email notification integration associated with the budget, call
[<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](get_notification_integration_name.md) instead.

See also:
:   [<budget_name>!ADD_NOTIFICATION_INTEGRATION](add_notification_integration.md),
    [<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](get_notification_integration_name.md),
    [<budget_name>!REMOVE_NOTIFICATION_INTEGRATION](remove_notification_integration.md)

## Syntax

```sqlsyntax
<budget_name>!GET_NOTIFICATION_INTEGRATIONS()
```

## Returns

Returns tabular data containing information about the notification integrations associated with the budget. The data includes a
row for each queue or webhook notification integration associated with the budget. The rows include the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `integration_name` | VARCHAR | Name of the notification integration. |
| `last_notification_time` | NUMBER | UTC timestamp when the last notification was sent. If no notifications were sent out yet, the value in this column is `-1`. |
| `added_date` | DATE | Date when the notification integration was added to the budget. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the names of the queue and webhook notification integrations, if any, for `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_NOTIFICATION_INTEGRATIONS();
```

View the names of the queue and webhook notification integrations, if any, for the account budget:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_NOTIFICATION_INTEGRATIONS();
```

---
title: <budget_name>!GET_NOTIFICATION_MUTE_FLAG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_notification_mute_flag.md
section: SQL Classes
---

# <budget_name>!GET_NOTIFICATION_MUTE_FLAG

View the status of the notification mute flag for a [budget](../../../../user-guide/budgets.md). If the mute
flag is set to TRUE, no notifications are sent about the budget’s spending limit.

See also:
:   [<budget_name>!GET_NOTIFICATION_EMAIL](get_notification_email.md),
    [<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](get_notification_integration_name.md),
    [<budget_name>!SET_EMAIL_NOTIFICATIONS](set_email_notifications.md),
    [<budget_name>!SET_NOTIFICATION_MUTE_FLAG](set_notification_mute_flag.md)

## Syntax

```sqlsyntax
<budget_name>!GET_NOTIFICATION_MUTE_FLAG()
```

## Returns

* `TRUE` if notifications are disabled.
* `FALSE` if notifications are enabled.

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the state of the notification mute flag for `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_NOTIFICATION_MUTE_FLAG();
```

View the state of the notification mute flag for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_NOTIFICATION_MUTE_FLAG();
```

---
title: <budget_name>!GET_NOTIFICATION_THRESHOLD
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_notification_threshold.md
section: SQL Classes
---

# <budget_name>!GET_NOTIFICATION_THRESHOLD

Returns the notification threshold for a [budget](../../../../user-guide/budgets.md). Notifications are sent when Snowflake predicts that spending
will exceed the threshold, which is a percentage of the budget limit.

## Syntax

```sqlsyntax
<budget_name>!GET_NOTIFICATION_THRESHOLD();
```

## Returns

Returns a VARCHAR value containing the notification threshold percentage.

## Access control requirements

The following minimum privileges and roles are required to call this method for *custom budgets*:

> * Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
> * USAGE privilege on the database and schema that contains the budget instance.

The following minimum privileges and roles are required to call this method for the *account budget*:

> * Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Get the notification threshold for budget `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_NOTIFICATION_THRESHOLD();
```

Get the notification threshold for the account budget:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_NOTIFICATION_THRESHOLD();
```

---
title: <budget_name>!GET_REFRESH_TIER
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_refresh_tier.md
section: SQL Classes
---

# <budget_name>!GET_REFRESH_TIER

Retrieves the current [refresh interval of a budget](../../../../user-guide/budgets.md). The budget refresh interval controls how long it takes for a
budget to be refreshed with the most current consumption data.

See also:
:   [<budget_name>!SET_REFRESH_TIER](set_refresh_tier.md)

## Syntax

```sqlsyntax
<budget_name>!GET_REFRESH_TIER()
```

## Returns

Returns one of the following VARCHAR values:

* `'TIER_1H'` — The budget refresh interval is one hour.
* `'TIER_6H'` — The budget refresh interval is up to 6.5 hours.

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the refresh interval for budget `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_REFRESH_TIER();
```

View the refresh interval for the account budget:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!GET_REFRESH_TIER();
```

---
title: <budget_name>!GET_RESOURCE_TAGS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_resource_tags.md
section: SQL Classes
---

# <budget_name>!GET_RESOURCE_TAGS

Lists the tags that have been added to a [custom budget](../../../../user-guide/budgets.md) using the [ADD_RESOURCE_TAG](add_resource_tag.md)
method. Resources tagged with these tag-value pairs are included in the budget.

> **Important:**
>
> This method is being deprecated. Use [<budget_name>!GET_BUDGET_SCOPE](get_budget_scope.md) instead.

## Syntax

```sqlsyntax
<budget_name>!GET_RESOURCE_TAGS()
```

## Returns

The method returns the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| TAG_ID | NUMBER | System-generated identifier. |
| TAG_VALUE | VARCHAR | Value of the tag. |
| TAG_DATABASE | VARCHAR | Database that contains the tag. |
| TAG_SCHEMA | VARCHAR | Schema that contains the tag. |
| TAG_NAME | VARCHAR | Name of the tag. |

## Access control requirements

The following minimum privileges and roles are required to view results for custom budgets:

* ADMIN or VIEWER [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Get all tags that were added to the `budget_db.budget_schema.my_budget` budget using the ADD_RESOURCE_TAG method:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_RESOURCE_TAGS();
```

---
title: <budget_name>!GET_SERVICE_TYPE_USAGE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_service_type_usage.md
section: SQL Classes
---

# <budget_name>!GET_SERVICE_TYPE_USAGE

View the credit usage for a [budget](../../../../user-guide/budgets.md) by service type.

> **Important:**
>
> This method has been deprecated. Use [<budget_name>!GET_SERVICE_TYPE_USAGE_V2](get_service_type_usage_v2.md) instead.

## Syntax

```sqlsyntax
<budget_name>!GET_SERVICE_TYPE_USAGE( SERVICE_TYPE => '<service_type>' ,
                                      TIME_DEPART => '<time_interval>' ,
                                      USER_TIMEZONE => '<timezone>' ,
                                      TIME_LOWER_BOUND => <constant_expr> ,
                                      TIME_UPPER_BOUND => <constant_expr>
                                    )
```

## Arguments

`SERVICE_TYPE => service_type`
:   The service type used to limit results.

    Valid values:

    > [Type of service](../../../../user-guide/budgets.md) that is consuming credits, which can be one of the following:
    >
    > * `AUTO_CLUSTERING`
    > * `DATA_QUALITY_MONITORING`
    > * `HYBRID_TABLE_REQUESTS`
    > * `MATERIALIZED_VIEW`
    > * `PIPE`
    > * `QUERY_ACCELERATION`
    > * `SEARCH_OPTIMIZATION`
    > * `SERVERLESS_ALERTS`
    > * `SERVERLESS_TASK`
    > * `SNOWPIPE_STREAMING`
    > * `WAREHOUSE_METERING`
    > * `WAREHOUSE_METERING_READER`

`TIME_DEPART => time_interval`
:   Time interval used to delineate usage records. Each row displays service usage by the specified time interval.

    Valid values:

    * HOUR, hour
    * DAY, day
    * WEEK, week

`USER_TIMEZONE => timezone`
:   String specifying the user’s timezone. Budget metering is based on the UTC timezone.

`TIME_LOWER_BOUND => constant_expr`
:   The start of the time range during which the spending occurred.

`TIME_UPPER_BOUND => constant_expr`
:   The end of the time range during which the spending occurred.

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | TIMESTAMP_TZ | Date and time the usage occurred. |
| ENTITY_ID | NUMBER | Internal identifier for the object in the budget. |
| NAME | VARCHAR | Name of the metered object. |
| CREDITS_USED | FLOAT | Number of credits used. This is the sum of CREDITS_COMPUTE and CREDITS_CLOUD. |
| CREDITS_COMPUTE | FLOAT | Number of compute credits used. |
| CREDITS_CLOUD | FLOAT | Number of cloud service credits used. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* For `timezone`, you can specify a [time zone name](https://data.iana.org/time-zones/tzdb-2025b/zone1970.tab) or a [link name](https://data.iana.org/time-zones/tzdb-2025b/backward) from release
  2025b of the [IANA Time Zone Database](https://www.iana.org/time-zones) (e.g. `America/Los_Angeles`, `Europe/London`, `UTC`,
  `Etc/GMT`, etc.).

  > **Note:**
  > + Time zone names are case-sensitive and must be enclosed in single quotes (e.g. `'UTC'`).
  > + Snowflake does not support the majority of timezone [abbreviations](https://en.wikipedia.org/wiki/List_of_time_zone_abbreviations) (e.g. `PDT`, `EST`, etc.) because a
  >   given abbreviation might refer to one of several different time zones. For example, `CST` might refer to Central
  >   Standard Time in North America (UTC-6), Cuba Standard Time (UTC-5), and China Standard Time (UTC+8).
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the daily credits spent for each warehouse in the past week for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_SERVICE_TYPE_USAGE(
   SERVICE_TYPE => 'WAREHOUSE_METERING',
   TIME_DEPART => 'day',
   USER_TIMEZONE => 'UTC',
   TIME_LOWER_BOUND => dateadd('day', -7, current_timestamp()),
   TIME_UPPER_BOUND => current_timestamp()
);
```

## Error messages

To troubleshoot issues that can occur when you call this method, see [You can’t successfully call the GET_SERVICE_TYPE_USAGE method](../../../../user-guide/budgets/troubleshoot.md).

---
title: <budget_name>!GET_SERVICE_TYPE_USAGE_V2
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_service_type_usage_v2.md
section: SQL Classes
---

# <budget_name>!GET_SERVICE_TYPE_USAGE_V2

View the credit usage for a [budget](../../../../user-guide/budgets.md) by service type.

## Syntax

```sqlsyntax
<budget_name>!GET_SERVICE_TYPE_USAGE_V2( '<start_month>' , '<end_month>' )
```

## Arguments

`'start_month'`
:   Specifies the start of the time period for which you want to return usage information. Specified in the format `YYYY-MM`.

`'end_month'`
:   Specifies the end of the time period for which you want to return usage information. Specified in the format `YYYY-MM`.

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERVICE_TYPE | VARCHAR | Lists the [service](../../../../user-guide/budgets.md) that used credits. |
| ENTITY_TYPE | VARCHAR | Type of object associated with the credit consumption. All table-like objects such as tables, views, materialized views, and external tables have a value of `TABLE`. |
| ENTITY_ID | NUMBER | Internal identifier for the object in the budget. |
| NAME | VARCHAR | Name of the object associated with the credit consumption. |
| CREDITS_USED | FLOAT | Number of credits used. This is the sum of CREDITS_COMPUTE and CREDITS_CLOUD. |
| CREDITS_COMPUTE | FLOAT | Number of compute credits used. |
| CREDITS_CLOUD | FLOAT | Number of cloud service credits used. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Example

Return credits consumed by objects associated with the budget `my_budget` in January, February, and March of 2025:

```sqlexample
CALL db.sch1.my_budget!GET_SERVICE_TYPE_USAGE_V2('2025-01', '2025-03');
```

---
title: <budget_name>!GET_SHARED_RESOURCES
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_shared_resources.md
section: SQL Classes
---

# <budget_name>!GET_SHARED_RESOURCES

Lists the shared resources that have been added to a [custom budget](../../../../user-guide/budgets.md) using the
[ADD_SHARED_RESOURCE](add_shared_resource.md) method.

## Syntax

```sqlsyntax
<budget_name>!GET_SHARED_RESOURCES()
```

## Returns

The method returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| RESOURCE_ID | NUMBER | System identifier of the resource. |
| NAME | VARCHAR | Name of the specific resource, or NULL if all resources of the domain type are included in the budget. |
| DOMAIN | VARCHAR | The type of resource. |

## Access control requirements

The following minimum privileges and roles are required to view results for custom budgets:

* ADMIN or VIEWER [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Get all shared resources that were added to the `budget_db.budget_schema.my_budget` budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_SHARED_RESOURCES();
```

---
title: <budget_name>!GET_SPENDING_HISTORY
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_spending_history.md
section: SQL Classes
---

# <budget_name>!GET_SPENDING_HISTORY

View the spending history for a [budget](../../../../user-guide/budgets.md).

See also:
:   [<budget_name>!GET_SERVICE_TYPE_USAGE](get_service_type_usage.md)

## Syntax

```sqlsyntax
<budget_name>!GET_SPENDING_HISTORY( [ TIME_LOWER_BOUND => <constant_expr> ,
                                      TIME_UPPER_BOUND => <constant_expr> ] )
```

## Optional arguments

`TIME_LOWER_BOUND => constant_expr,` . `TIME_UPPER_BOUND => constant_expr`
:   Time range (in UTC timestamp format) during which the spending occurred.

    You must set both lower and upper time bounds to limit the results by a time range.

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| MEASUREMENT_DATE | DATE | Date when the usage occurred. |
| SERVICE_TYPE | VARCHAR | [Type of service](../../../../user-guide/budgets.md) that is consuming credits, which can be one of the following:   * `AUTO_CLUSTERING` * `DATA_QUALITY_MONITORING` * `HYBRID_TABLE_REQUESTS` * `MATERIALIZED_VIEW` * `PIPE` * `QUERY_ACCELERATION` * `SEARCH_OPTIMIZATION` * `SERVERLESS_ALERTS` * `SERVERLESS_TASK` * `SNOWPIPE_STREAMING` * `WAREHOUSE_METERING` * `WAREHOUSE_METERING_READER` |
| CREDITS_SPENT | FLOAT | Number of credits used. |

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.
  + [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the spending history for budget `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_SPENDING_HISTORY();
```

View the spending history for the last 7 days for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_SPENDING_HISTORY(
  TIME_LOWER_BOUND=>dateadd('days', -7, current_timestamp()),
  TIME_UPPER_BOUND=>current_timestamp()
);
```

---
title: <budget_name>!GET_SPENDING_LIMIT
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/get_spending_limit.md
section: SQL Classes
---

# <budget_name>!GET_SPENDING_LIMIT

View the spending limit for a [budget](../../../../user-guide/budgets.md).

See also:
:   [<budget_name>!SET_SPENDING_LIMIT](set_spending_limit.md)

## Syntax

```sqlsyntax
<budget_name>!GET_SPENDING_LIMIT()
```

## Returns

* The number of credits set as the spending limit for the budget.
* `-1` if the spending limit is not set.

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  + Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

View the spending limit for budget `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!GET_SPENDING_LIMIT();
```

View the spending limit for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!GET_SPENDING_LIMIT();
```

---
title: <budget_name>!REFRESH_USAGE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/refresh_usage.md
section: SQL Classes
---

# <budget_name>!REFRESH_USAGE

Causes the budget to retrieve consumption data so that the budget can compare it to the spending limit without waiting for the next
automatic retrieval of data.

## Syntax

```sqlsyntax
<budget_name>!REFRESH_USAGE()
```

## Returns

Returns a VARCHAR value that indicates whether the usage was successfully refreshed.

## Access control requirements

The following minimum privileges and roles are required to call this method for custom budgets:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* It takes a few minutes for the budget to be refreshed with new usage data.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Retrieve consumption data for the `budget_db.budget_schema.my_budget` budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!REFRESH_USAGE();
```

---
title: <budget_name>!REMOVE_CUSTOM_ACTIONS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_custom_actions.md
section: SQL Classes
---

# <budget_name>!REMOVE_CUSTOM_ACTIONS

Remove one or more [custom actions](../../../../user-guide/budgets/custom-actions.md) from a budget.

See also:
:   [<budget_name>!ADD_CUSTOM_ACTION](add_custom_action.md), [<budget_name>!GET_CUSTOM_ACTIONS](get_custom_actions.md)

## Syntax

```sqlsyntax
<budget_name>!REMOVE_CUSTOM_ACTIONS()

<budget_name>!REMOVE_CUSTOM_ACTIONS( <threshold> )

<budget_name>!REMOVE_CUSTOM_ACTIONS( <threshold>, '<stored_procedure>' )
```

## Arguments

`threshold`
:   Threshold percentage at which custom actions are triggered. If you don’t specify a procedure name, all custom actions set for this threshold
    are removed.

`'stored_procedure'`
:   Fully qualified name of the stored procedure associated with the custom action. Snowflake removes all custom actions that match the
    specified stored procedure and threshold.

    > **Note:**
    >
    > When passing the fully qualified name of the procedure, use the `PROCEDURE_FQN` value from the output of the
    > [GET_CUSTOM_ACTIONS](get_custom_actions.md) method.

## Returns

Returns a VARCHAR value indicating the number of custom actions that were successfully removed.

## Access control requirements

The following privileges and roles are required to call this method for a budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Remove all custom actions from budget `my_budget` in schema `budget_db.sch1`:

```sqlexample
CALL budget_db.sch1.my_budget!REMOVE_CUSTOM_ACTIONS();
```

Remove all custom actions that are triggered when consumption reaches 75% of the budget limit:

```sqlexample
CALL budget_db.sch1.my_budget!REMOVE_CUSTOM_ACTIONS(75);
```

Remove the custom action that calls the `code_db.sch1.my_sp` stored procedure when consumption reaches 75% of the budget limit:

```sqlexample
CALL budget_db.sch1.my_budget!REMOVE_CUSTOM_ACTIONS(75, 'code_db.sch1.my_sp');
```

---
title: <budget_name>!REMOVE_CYCLE_START_ACTION
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_cycle_start_action.md
section: SQL Classes
---

# <budget_name>!REMOVE_CYCLE_START_ACTION

Removes the [user-defined action](../../../../user-guide/budgets/cycle-start-actions.md) that is triggered when the budget cycle restarts.

See also:
:   [<budget_name>!SET_CYCLE_START_ACTION](set_cycle_start_action.md), [<budget_name>!GET_CYCLE_START_ACTION](get_cycle_start_action.md)

## Syntax

```sqlsyntax
<budget_name>!REMOVE_CYCLE_START_ACTION()
```

## Returns

Returns a VARCHAR value indicating whether the cycle start action was successfully removed.

## Access control requirements

The following privileges and roles are required to call this method for a budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Remove the cycle-start action from budget `my_budget` in schema `budget_db.sch1`:

```sqlexample
CALL budget_db.sch1.my_budget!REMOVE_CYCLE_START_ACTION();
```

---
title: <budget_name>!REMOVE_NOTIFICATION_INTEGRATION
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_notification_integration.md
section: SQL Classes
---

# <budget_name>!REMOVE_NOTIFICATION_INTEGRATION

Removes a queue or webhook notification integration from a [custom budget or the account budget](../../../../user-guide/budgets.md).

See also:
:   [<budget_name>!ADD_NOTIFICATION_INTEGRATION](add_notification_integration.md),
    [<budget_name>!GET_NOTIFICATION_INTEGRATIONS](get_notification_integrations.md)

## Syntax

```sqlsyntax
<budget_name>!REMOVE_NOTIFICATION_INTEGRATION( '<integration_name>' )
```

## Arguments

`'integration_name'`
:   The name of the queue or webhook notification integration to remove from the budget.

## Returns

Returns a VARCHAR value that indicates whether or not the notification integration was successfully removed.

* If the notification integration was removed successfully, the method returns `Integration removed successfully`.
* Otherwise, the method returns an error message.

## Access control requirements

The following minimum privileges and roles are required to call this method on a budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Remove the notification integration `budgets_notification_integration` from custom budget `my_budget` in schema
`budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!REMOVE_NOTIFICATION_INTEGRATION(
  'budgets_notification_integration');
```

Remove the notification integration `budgets_notification_integration` from the account budget:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!REMOVE_NOTIFICATION_INTEGRATION(
  'budgets_notification_integration');
```

---
title: <budget_name>!REMOVE_RESOURCE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_resource.md
section: SQL Classes
---

# <budget_name>!REMOVE_RESOURCE

Remove an object from a [custom budget](../../../../user-guide/budgets.md). The object must be removed by
[reference](../../../references.md).

See also:
:   [<budget_name>!ADD_RESOURCE](add_resource.md),
    [<budget_name>!GET_LINKED_RESOURCES](get_linked_resources.md)

## Syntax

```sqlsyntax
<budget_name>!REMOVE_RESOURCE( { '<object_reference>' | <reference_statement> } )
```

## Arguments

`'object_reference'`
:   The serialized string representation that resolves to an object. This string is the output of
    the [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the object to be removed from the
    budget.

> **Note:**
>
> If you want to add a Snowflake Native App to a budget, when you call SYSTEM$REFERENCE, specify `'DATABASE'` (not `'APPLICATION'`)
> for the `object_type` argument.
>
> See Removing a Snowflake Native App from a budget.

## Returns

Returns a VARCHAR value that indicates whether or not the object was successfully removed from the budget. For example:

```output
Successfully removed resource from resource group
```

If the object could not be removed from the budget, the function returns an error message. See
[You can’t add or remove objects from a custom budget](../../../../user-guide/budgets/troubleshoot.md).

## Access control requirements

The following minimum privileges and roles are required to call this method on a *custom budget*:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.
* USAGE privilege on the database and schema that contain the object (for schema objects).
* APPLYBUDGET privilege on the object being removed.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

The following examples demonstrate how to remove an object from a custom budget:

* Removing a table from a budget
* Removing a Snowflake Native App from a budget

### Removing a table from a budget

* The following example creates and returns a reference for the `t1` table:

  ```sqlexample
  SELECT SYSTEM$REFERENCE('TABLE', 't1', 'SESSION', 'APPLYBUDGET');
  ```

  The statement returns the reference in the output.

  ```output
  ENT_REF_TABLE_5862683050074_5AEB8D58FB3ACF249F2E35F365A9357C46BB00D7
  ```

  The following statement uses the string literal for this reference to remove the `t1` table from the
  `budget_db.budget_schema.my_budget` budget:

  ```sqlexample
  CALL budget_db.budget_schema.my_budget!REMOVE_RESOURCE(
    'ENT_REF_TABLE_5862683050074_5AEB8D58FB3ACF249F2E35F365A9357C46BB00D7');
  ```
* The following example removes the `t2` table from the `budget_db.budget_schema.my_budget` budget, using a SQL
  statement to specify the reference:

  ```sqlexample
  CALL budget_db.budget_schema.my_budget!REMOVE_RESOURCE(
    SELECT SYSTEM$REFERENCE('TABLE', 't2', 'SESSION', 'APPLYBUDGET')
  ```

### Removing a Snowflake Native App from a budget

The following example removes the `my_app` application from the `budget_db.budget_schema.my_budget` budget.

Note that when calling [SYSTEM$REFERENCE](../../../functions/system_reference.md), you must pass in `'DATABASE'` (not `'APPLICATION'`)
for the `object_type` argument.

```sqlexample
CALL budget_db.budget_schema.my_budget!REMOVE_RESOURCE(
  SELECT SYSTEM$REFERENCE('DATABASE', 'my_app', 'SESSION', 'APPLYBUDGET'));
```

## Error messages

For a list of common error messages and their causes and solutions, see [You can’t add or remove objects from a custom budget](../../../../user-guide/budgets/troubleshoot.md).

---
title: <budget_name>!REMOVE_RESOURCE_TAG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_resource_tag.md
section: SQL Classes
---

# <budget_name>!REMOVE_RESOURCE_TAG

Removes a tag-value combination from a custom budget. When this tag-value pair was added to the budget using the
[ADD_RESOURCE_TAG](add_resource_tag.md) method, all resources tagged with the pair were included in the budget. Removing the tag-value
pair removes the tagged resources from the budget.

> **Important:**
>
> This method is being deprecated. Use [<budget_name>!SET_RESOURCE_TAGS](set_resource_tags.md) instead.

## Syntax

```sqlsyntax
<budget_name>!REMOVE_RESOURCE_TAG(
    { '<tag_reference>' | <reference_statement> },
    'tag_value' )
```

## Arguments

`'tag_reference'`
:   The serialized string representation that resolves to an tag. This string is the output of
    the [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the tag to be removed from the
    budget.

`'tag_value'`
:   Specifies the value of the tag-value combination that you are removing from the budget.

    If the tag was added to the budget with a different value, the tag continues to be associated with the budget after removing this
    specific tag-value combination.

## Returns

Returns a VARCHAR value that indicates whether or not the tag-value combination was successfully removed from the budget.

If the tag could not be removed from the budget, the function returns an error message.

## Access control requirements

The following minimum privileges and roles are required to call this method on a *custom budget*:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.
* USAGE privilege on the database and schema that contain the tag.
* APPLYBUDGET privilege on the tag being removed.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Retrieve the tag reference before calling the method to remove the tag-value combination.
:   The following statement creates and returns a reference for the `cost_center` tag:

    ```sqlexample
    SELECT SYSTEM$REFERENCE(
      'TAG',
      'cost_mgmt_db.tags.cost_center',
      'SESSION',
      'APPLYBUDGET');
    ```

    The statement returns the reference in the output.

    ```output
    ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2
    ```

    The following statement uses the string literal for this reference to add the `cost_center = 'sales'` tag-value combination to the
    `budget_db.budget_schema.my_budget` budget:

    ```sqlexample
    CALL budget_db.budget_schema.my_budget!REMOVE_RESOURCE_TAG(
      'ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2',
      'sales');
    ```

Include the SYSTEM$REFERENCE function in the argument directly
:   After executing the following statement, the budget will no longer track objects that are tagged with the tag-value combination
    `team_tag = 'finance'`.

    > ```sqlexample
    > CALL budget_db.budget_schema.my_budget!REMOVE_RESOURCE_TAG(
    >     (SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')),
    >     'finance');
    > ```

---
title: <budget_name>!REMOVE_SHARED_RESOURCE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_shared_resource.md
section: SQL Classes
---

# <budget_name>!REMOVE_SHARED_RESOURCE

Removes a shared resource from a [custom budget](../../../../user-guide/budgets.md). Shared resources are added to the budget using the
[ADD_SHARED_RESOURCE](add_shared_resource.md) method.

## Syntax

```sqlsyntax
<budget_name>!REMOVE_SHARED_RESOURCE( '<domain>' [ , '<ai_function>' ] )
```

## Arguments

`'domain'`
:   The type of resource being removed from the budget. Valid values:

    * `AI FUNCTION`

    Unless you specify a second argument, the budget stops tracking consumption for all AI functions.

`'ai_function'`
:   Optional. When the `domain` is `AI FUNCTION`, specifies a specific AI function to remove from the budget.

## Returns

Returns a VARCHAR value that indicates whether or not the resource was successfully removed from the budget.

If the resource could not be removed from the budget, the function returns an error message.

## Access control requirements

The following minimum privileges and roles are required to call this method on a *custom budget*:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.
* USAGE privilege on the database and schema that contain the resource (for schema objects).

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Remove all AI Functions from the budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!REMOVE_SHARED_RESOURCE('AI FUNCTION');
```

Remove the AI_COMPLETE function from the budget:

```sqlexample
CALL budget_db.budget_schema.my_budget!REMOVE_SHARED_RESOURCE(
  'AI FUNCTION',
  (SELECT SYSTEM$REFERENCE('FUNCTION', 'AI_COMPLETE')));
```

---
title: <budget_name>!REMOVE_TAG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/remove_tag.md
section: SQL Classes
---

# <budget_name>!REMOVE_TAG

Removes a tag/value combination from a custom budget. The tag must be removed by [reference](../../../references.md).

> **Important:**
>
> This method has been deprecated. Use [<budget_name>!SET_RESOURCE_TAGS](set_resource_tags.md) instead.

## Syntax

```sqlsyntax
<budget_name>!REMOVE_TAG(
    { '<tag_reference>' | <reference_statement> },
    'tag_value' )
```

## Arguments

`'tag_reference'`
:   The serialized string representation that resolves to an tag. This string is the output of
    the [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the tag to be removed from the
    budget.

`'tag_value'`
:   Specifies the value of the tag/value combination that you are removing from the budget.

    If the tag was added to the budget with a different value, the tag continues to be associated with the budget after removing this
    specific tag/value combination.

## Returns

Returns a VARCHAR value that indicates whether or not the tag/value combination was successfully removed from the budget.

If the tag could not be removed from the budget, the function returns an error message.

## Access control requirements

The following minimum privileges and roles are required to call this method on a *custom budget*:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contains the budget instance.
* USAGE privilege on the database and schema that contain the tag.
* APPLYBUDGET privilege on the tag being removed.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* This method can only be called on *custom budget* instances.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Retrieve the tag reference before calling the method to remove the tag/value combination.
:   The following statement creates and returns a reference for the `cost_center` tag:

    ```sqlexample
    SELECT SYSTEM$REFERENCE(
      'TAG',
      'cost_mgmt_db.tags.cost_center',
      'SESSION',
      'APPLYBUDGET');
    ```

    The statement returns the reference in the output.

    ```output
    ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2
    ```

    The following statement uses the string literal for this reference to add the `cost_center = 'sales'` tag/value combination to the
    `budget_db.budget_schema.my_budget` budget:

    ```sqlexample
    CALL budget_db.budget_schema.my_budget!REMOVE_TAG(
      'ENT_REF_TAG_10382726315710_8A8626AE765E29446C38A217CAD093FCC9A454C2',
      'sales');
    ```

Include the SYSTEM$REFERENCE function in the argument directly
:   After executing the following statement, the budget will no longer track objects that are tagged with the tag/value combination
    `team_tag = 'finance'`.

    > ```sqlexample
    > CALL budget_db.budget_schema.my_budget!REMOVE_TAG(
    >     (SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')),
    >     'finance');
    > ```

---
title: <budget_name>!SET_CYCLE_START_ACTION
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_cycle_start_action.md
section: SQL Classes
---

# <budget_name>!SET_CYCLE_START_ACTION

Associates a stored procedure with a budget so that the procedure is called when the budget cycle restarts. The procedure must be associated by [reference](../../../references.md).

For more information, see [Cycle-start actions for budgets](../../../../user-guide/budgets/cycle-start-actions.md).

## Syntax

```sqlsyntax
<budget_name>!SET_CYCLE_START_ACTION (
  { '<stored_procedure_reference>' | <reference_statement> },
  { <array_of_arguments> | <array_construct_statement> } )
```

## Arguments

`'stored_procedure_reference'`
:   The serialized string representation that resolves to a procedure. This string is the output of
    the [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

`reference_statement`
:   A [SYSTEM$REFERENCE](../../../functions/system_reference.md) statement that creates a reference for the procedure to be associated with the budget.

`array_of_arguments`
:   Array of arguments to pass to the stored procedure.

`array_construct_statement`
:   An [ARRAY_CONSTRUCT](../../../functions/array_construct.md) statement that returns an array constructed from zero, one, or more
    inputs.

## Returns

Returns a VARCHAR value that indicates whether or not the procedure was successfully associated with the budget.

If the procedure could not be associated with the budget, the method returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain the stored procedure.
* USAGE privilege on the stored procedure.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Associate the `reset_resources` stored procedure with the `budget_db.sch1.my_budget` budget so that it is
called when the budget cycle restarts:

```sqlexample
CALL budget_db.sch1.my_budget!SET_CYCLE_START_ACTION(
  SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.reset_resources(STRING, STRING)'),
  ARRAY_CONSTRUCT('admin@example.com', 'Budget cycle restarted'));
```

Associate the `enable_access` stored procedure with the `budget_db.sch1.my_budget` budget so that it is called when
the budget cycle restarts:

```sqlexample
CALL budget_db.sch1.my_budget!SET_CYCLE_START_ACTION(
  SYSTEM$REFERENCE('PROCEDURE', 'code_db.sch1.enable_access(STRING)'),
  ARRAY_CONSTRUCT('Re-enable resources for new budget cycle'));
```

---
title: <budget_name>!SET_EMAIL_NOTIFICATIONS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_email_notifications.md
section: SQL Classes
---

# <budget_name>!SET_EMAIL_NOTIFICATIONS

Set the email addresses to receive [budgets](../../../../user-guide/budgets.md) notifications.

See also:
:   [<budget_name>!GET_NOTIFICATION_EMAIL](get_notification_email.md),
    [<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](get_notification_integration_name.md),
    [<budget_name>!GET_NOTIFICATION_MUTE_FLAG](get_notification_mute_flag.md),
    [<budget_name>!SET_NOTIFICATION_MUTE_FLAG](set_notification_mute_flag.md)

## Syntax

```sqlsyntax
<budget_name>!SET_EMAIL_NOTIFICATIONS( [ '<notification_integration>', ]
                                       '<email> [ , <email> [ , ... ] ]' )
```

## Required arguments

`'email [ , email [ , ... ] ]'`
:   Specifies the email addresses to receive budget notification emails. Each email address in the list must be
    [verified](../../../../user-guide/notifications/email-notifications.md).

## Optional arguments

`'notification_integration'`
:   Specifies the identifier for the [email notification integration](../../../../user-guide/notifications/email-notifications.md).

    If the ALLOWED_RECIPIENTS parameter is set for the notification integration, each `email` in the notifications list
    must be included in the ALLOWED_RECIPIENTS list for the notification integration. Otherwise, you can include any verified
    email address in the notifications list.

## Returns

```output
The email integration is updated.
```

## Access control requirements

* The following minimum privileges and roles are required to call this method for *custom budgets*:

  + ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The minimum role required to call this method for the *account budget* is the BUDGET_ADMIN
  [application role](../../../../user-guide/budgets.md).

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.
* If you are using a notification integration, the USAGE privilege on the notification integration must be granted to
  APPLICATION SNOWFLAKE:

  ```sqlexample
  GRANT USAGE ON INTEGRATION budgets_notification_integration
    TO APPLICATION SNOWFLAKE;
  ```

## Examples

Send email notifications for budget `my_budget` in the `budgets_db.budgets_schema` schema to
[costadmin@domain.com](mailto:costadmin%40domain.com) and [budgetadmin@domain.com](mailto:budgetadmin%40domain.com):

```sqlexample
CALL budgets_db.budgets_schema.my_budget!SET_EMAIL_NOTIFICATIONS(
   'costadmin@domain.com, budgetadmin@domain.com');
```

Send email notifications for the account budget to [budgetadmin@domain.com](mailto:budgetadmin%40domain.com):

```sqlexample
CALL snowflake.local.account_root_budget!SET_EMAIL_NOTIFICATIONS(
   'budgets_notification', 'budgetadmin@domain.com');
```

## Error messages

For a list of common error messages and their causes and solutions, see [You can’t set email notifications for a budget](../../../../user-guide/budgets/troubleshoot.md).

---
title: <budget_name>!SET_NOTIFICATION_MUTE_FLAG
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_notification_mute_flag.md
section: SQL Classes
---

# <budget_name>!SET_NOTIFICATION_MUTE_FLAG

Enable or disable notifications for a [budget](../../../../user-guide/budgets.md).

See also:
:   [<budget_name>!GET_NOTIFICATION_EMAIL](get_notification_email.md),
    [<budget_name>!GET_NOTIFICATION_INTEGRATION_NAME](get_notification_integration_name.md),
    [<budget_name>!GET_NOTIFICATION_MUTE_FLAG](get_notification_mute_flag.md),
    [<budget_name>!SET_EMAIL_NOTIFICATIONS](set_email_notifications.md)

## Syntax

```sqlsyntax
<budget_name>!SET_NOTIFICATION_MUTE_FLAG( { TRUE | FALSE } );
```

## Arguments

`{ TRUE | FALSE }`
:   * TRUE to disable notifications.
    * FALSE to enable notifications.

    Default: FALSE

## Returns

```output
The notification mute flag has been updated to <true | false>.
```

## Access control requirements

* The following minimum privileges and roles are required to call this method for *custom budgets*:

  + Any [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following minimum privileges and roles are required to call this method for the *account budget*:

  Any [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Disable notifications for budget `my_budget` in schema `budget_db.budget_schema`:

```sqlexample
CALL budget_db.budget_schema.my_budget!SET_NOTIFICATION_MUTE_FLAG(TRUE);
```

Enable notifications for the account budget:

```sqlexample
CALL snowflake.local.account_root_budget!SET_NOTIFICATION_MUTE_FLAG(FALSE);
```

---
title: <budget_name>!SET_NOTIFICATION_THRESHOLD
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_notification_threshold.md
section: SQL Classes
---

# <budget_name>!SET_NOTIFICATION_THRESHOLD

Sets a notification threshold for a [budget](../../../../user-guide/budgets.md). Notifications are sent when Snowflake predicts that spending will
exceed the threshold.

## Syntax

```sqlsyntax
<budget_name>!SET_NOTIFICATION_THRESHOLD( <threshold_percent> );
```

## Arguments

`threshold_percent`
:   Percentage of the budget limit. Notifications are sent when Snowflake determines that spending will exceed this percentage of the budget
    limit.

    Accepted values: 0 - 1000

## Returns

Returns a VARCHAR value that indicates whether or not the notification threshold was successfully added.

## Access control requirements

The following minimum privileges and roles are required to call this method for *custom budgets*:

> * ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
> * USAGE privilege on the database and schema that contains the budget instance.

The following role is required to call this method for the *account budget*:

> * BUDGET_ADMIN [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

The following example sets the notification threshold of the account budget to 10% of the budget limit:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_NOTIFICATION_THRESHOLD(10);
```

---
title: <budget_name>!SET_REFRESH_TIER
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_refresh_tier.md
section: SQL Classes
---

# <budget_name>!SET_REFRESH_TIER

Sets the [refresh interval of a budget](../../../../user-guide/budgets.md). The budget refresh interval controls how long it takes for a
budget to be refreshed with the most current consumption data.

See also:
:   [<budget_name>!GET_REFRESH_TIER](get_refresh_tier.md)

## Syntax

```sqlsyntax
<budget_name>!SET_REFRESH_TIER( '<refresh_interval>' )
```

## Arguments

`'refresh_interval'`
:   Sets the budget refresh interval. Specify one of the following values:

    * `TIER_1H`: Sets the budget refresh interval to one hour. Setting the budget refresh interval to one hour increases the cost of the
      budget.
    * `TIER_6H`: Sets the budget refresh interval to the default of up to 6.5 hours.

    Default: `TIER_6H`

## Returns

Returns a VARCHAR value that indicates whether the refresh interval was successfully updated.

## Access control requirements

The following minimum privileges and roles are required to call this method for *custom budgets*:

> * ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
> * USAGE privilege on the database and schema that contains the budget instance.

The following role is required to call this method for the *account budget*:

> * BUDGET_ADMIN [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* Setting the budget refresh interval to one hour increases the cost of the budget by a factor of 12 compared to the default interval.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Set the refresh interval for a custom budget to one hour:

```sqlexample
CALL my_database.my_schema.my_budget!SET_REFRESH_TIER('TIER_1H');
```

Revert the refresh interval for the same budget back to the default (6.5 hours):

```sqlexample
CALL my_database.my_schema.my_budget!SET_REFRESH_TIER('TIER_6H');
```

Set the account root budget to the one-hour interval:

```sqlexample
CALL SNOWFLAKE.LOCAL.ACCOUNT_ROOT_BUDGET!SET_REFRESH_TIER('TIER_1H');
```

---
title: <budget_name>!SET_RESOURCE_TAGS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_resource_tags.md
section: SQL Classes
---

# <budget_name>!SET_RESOURCE_TAGS

Adds tags to a custom budget so that resources that are tagged with the specified tag-value pairs are included in the budget.

You can configure the budget so that a resource is included if it is tagged with *any* of the specified tags (UNION) or configure it so a resource is included only if they are tagged with *all* of the specified tags (INTERSECTION).

Calling the method replaces any existing tags that were added to the budget.

## Syntax

```sqlsyntax
<budget_name>!SET_RESOURCE_TAGS( <tag-pairs>, <operation_mode> )
```

## Arguments

`tag_pairs`
:   An [ARRAY](../../../data-types-semistructured.md) value that specifies tag references and tag values.

    A tag reference is a serialized string representation that resolves to a [tag](../../../../user-guide/object-tagging/introduction.md). This string is the output of the
    [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

    Each element in the array should be an array containing a tag reference and a tag value. For example:

    ```sqlexample
    [
      [ 'ENT_REF_TAG_10382726315710_8A8626AE765E2' , 'finance' ],
      ...
    ]
    ```

`operation_mode`
:   Specifies the matching logic to use for the specified tags. You can specify one of the following values:

    * `'UNION'`: Usage by a user is included in the budget if the user is tagged with *any* of the specified tag-value pairs. This corresponds to OR logic.
    * `'INTERSECTION'`: Usage by a user is included in the budget only if the user is tagged with *all* of the specified tag-value pairs. This corresponds to AND logic.

## Returns

Returns a VARCHAR value that indicates whether or not the tags were successfully set on the budget.

If the tags could not be set on the budget, the function returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain each tag.
* APPLYBUDGET privilege on each tag being added.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only set tags on custom budgets.
* By default, you can add up to 20 resource tags to the budget. To increase this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* To verify the results of the method, call the [GET_BUDGET_SCOPE](get_budget_scope.md) method.
* Snowflake doesn’t start showing usage for the added resources until the budget is refreshed, which can take up to six hours. If you want
  to view usage sooner, run the [REFRESH_USAGE](refresh_usage.md) method.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Use the `my_budget` budget to track all objects that are tagged with *either* the tag-value combination `cost_center = 'sales'` or the
tag-value combination `team_tag = 'finance'`.

```sqlexample
CALL budget_db.budget_schema.my_budget!SET_RESOURCE_TAGS(
  [
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.cost_center', 'SESSION', 'APPLYBUDGET')), 'sales'],
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')), 'finance']
  ],
  'UNION');
```

Use the `my_budget` budget to track all objects that are tagged with *both* `cost_center = 'sales'` and `team_tag = 'finance'`.

```sqlexample
CALL budget_db.budget_schema.my_budget!SET_RESOURCE_TAGS(
  [
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.cost_center', 'SESSION', 'APPLYBUDGET')), 'sales'],
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')), 'finance']
  ],
  'INTERSECTION');
```

---
title: <budget_name>!SET_SPENDING_LIMIT
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_spending_limit.md
section: SQL Classes
---

# <budget_name>!SET_SPENDING_LIMIT

Set the spending limit for a [budget](../../../../user-guide/budgets.md). The spending limit is expressed
in number of credits.

See also:
:   [<budget_name>!GET_SPENDING_LIMIT](get_spending_limit.md)

## Syntax

```sqlsyntax
<budget_name>!SET_SPENDING_LIMIT(<number>)
```

## Arguments

`number`
:   The number of credits allocated to the budget per month. When total usage for all objects assigned to the budget reaches this
    number for the current month, the budget is considered to be at 100% of the spending limit.

    For the account budget, all [supported objects](../../../../user-guide/budgets/custom-budget.md) contribute to the credit
    usage.

    If a value is not specified for a budget, the budget has no spending limit, will never reach 100% usage, and will not
    trigger notifications.

    Default: -1 (no spending limit).

## Returns

```output
The spending limit has been updated to <n> credits.
```

## Access control requirements

* The following minimum privileges and roles are required to view results for *custom budgets*:

  + ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
  + USAGE privilege on the database and schema that contains the budget instance.
* The following role is required to view results for the *account budget*:

  BUDGET_ADMIN [application role](../../../../user-guide/budgets.md) for the account budget.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* The `number` argument must be a positive integer.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Set the spending limit for the account budget to 500 credits per month:

```sqlexample
CALL snowflake.local.account_root_budget!SET_SPENDING_LIMIT(500);
```

Set the spending limit for budget `my_database.my_schema.my_budget` to 100 credits per month.

```sqlexample
CALL my_database.my_schema.my_budget!SET_SPENDING_LIMIT(100);
```

---
title: <budget_name>!SET_USER_TAGS
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/set_user_tags.md
section: SQL Classes
---

# <budget_name>!SET_USER_TAGS

Adds user tags to a custom budget. Consumption by a shared resource counts toward the budget’s spending limit only if the resource is acted upon by a user with the specified tag-value pairs. For more information, see [Using budgets for AI features (shared resources)](../../../../user-guide/budgets/budget-shared-resources.md).

You can configure the budget so that usage is included if a user is tagged with *any* of the specified tags (UNION) or configure it so usage is included only if the user is tagged with *all* of the specified tags (INTERSECTION).

Calling the method replaces any existing user tags that were added to the budget.

## Syntax

```sqlsyntax
<budget_name>!SET_USER_TAGS( <tag-pairs>, <operation_mode> )
```

## Arguments

`tag_pairs`
:   An [ARRAY](../../../data-types-semistructured.md) value that specifies tag references and tag values.

    A tag reference is a serialized string representation that resolves to a [tag](../../../../user-guide/object-tagging/introduction.md). This string is the output of the
    [SYSTEM$REFERENCE](../../../functions/system_reference.md) function.

    Each element in the array should be an array containing a tag reference and a tag value. For example:

    ```sqlexample
    [
      [ 'ENT_REF_TAG_10382726315710_8A8626AE765E2' , 'finance' ],
      ...
    ]
    ```

`operation_mode`
:   Specifies the matching logic to use for the specified tags. You can specify one of the following values:

    * `'UNION'`: Usage by a user is included in the budget if the user is tagged with *any* of the specified tag-value pairs. This corresponds to OR logic.
    * `'INTERSECTION'`: Usage by a user is included in the budget only if the user is tagged with *all* of the specified tag-value pairs. This corresponds to AND logic.

## Returns

Returns a VARCHAR value that indicates whether or not the tags were successfully set on the budget.

If the tags could not be set on the budget, the function returns an error message.

## Access control requirements

The following privileges and roles are required to call this method for a custom budget:

* ADMIN [instance role](../../../../user-guide/budgets.md) for the budget instance.
* USAGE privilege on the database and schema that contain the budget instance.
* USAGE privilege on the database and schema that contain each tag.
* APPLYBUDGET privilege on each tag being added.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only set tags on *custom budgets*.
* By default, you can add up to 20 user tags to the budget. To increase this limit, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
* To verify the results of the method, call the [GET_BUDGET_SCOPE](get_budget_scope.md) method.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Use the `my_budget` budget to track consumption when shared resources are acted upon by users tagged with *either* the tag-value combination
`cost_center = 'sales'` or the tag-value combination `team_tag = 'finance'`.

```sqlexample
CALL budget_db.budget_schema.my_budget!SET_USER_TAGS(
  [
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.cost_center', 'SESSION', 'APPLYBUDGET')), 'sales'],
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')), 'finance']
  ],
  'UNION');
```

Use the `my_budget` budget to track consumption when shared resources are acted upon by users tagged with *both* `cost_center = 'sales'` and `team_tag = 'finance'`.

```sqlexample
CALL budget_db.budget_schema.my_budget!SET_USER_TAGS(
  [
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.cost_center', 'SESSION', 'APPLYBUDGET')), 'sales'],
      [(SELECT SYSTEM$REFERENCE('TAG', 'cost_mgmt_db.tags.team_tag', 'SESSION', 'APPLYBUDGET')), 'finance']
  ],
  'INTERSECTION');
```

---
title: <classification_profile_name>!DESCRIBE
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/describe.md
section: SQL Classes
---

# <classification_profile_name>!DESCRIBE

Describes the properties of an instance of the CLASSIFICATION_PROFILE class.

## Syntax

```sqlsyntax
<classification_profile_name>!DESCRIBE()
```

## Output

The output includes the criteria you specified when [creating](../commands/create-classification-profile.md) the instance and is formatted as
follows:

```output
{
   "auto_tag": true | false ,
   "maximum_classification_validity_days": <integer>,
   "minimum_object_age_for_classification_days": <integer>
   "column_tag_map": <object>
   "custom_classifiers": <object>,
}
```

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Describe the classification profile:

```sqlexample
SELECT my_classification_profile!DESCRIBE();
```

```output
+--------------------------------------------------------+
|         MY_CLASSIFICATION_PROFILE!DESCRIBE()           |
+--------------------------------------------------------+
|   {                                                    |
|     "auto_tag": true,                                  |
|     "maximum_classification_validity_days": 30,        |
|     "column_tag_map": [                                |
|       {                                                |
|         "semantic_categories": [                       |
|           "NAME"                                       |
|         ],                                             |
|         "tag_name": "test_cc_db.test_cc_schema.pii_r3",|
|         "tag_value": "important"                       |
|       },                                               |
|      "custom_classifiers": {                           |
|        "PII": {                                        |
|          "SC1": {                                      |
|            "col_name_regex": "my_name",                |
|             "description": "a new semantic category",  |
|             "privacy_category": "IDENTIFIER",          |
|             "threshold": 0.8,                          |
|             "value_regex": "\\\\d{{2}}-\\\\d{{2}}"     |
|          },                                            |
|          "SC2": {                                      |
|            "privacy_category": "IDENTIFIER",           |
|            "threshold": 0.8,                           |
|            "value_regex": "\\\\d{{3}}-\\\\d{{3}}|\\\\d"|
|          }                                             |
|        }                                               |
|      },                                                |
       "minimum_object_age_for_classification_days": 1   |
|   }                                                    |
+--------------------------------------------------------+
```

---
title: <classification_profile_name>!SET_AUTO_TAG
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_auto_tag.md
section: SQL Classes
---

# <classification_profile_name>!SET_AUTO_TAG

Specifies whether to enable auto-tagging for the instance of the CLASSIFICATION_PROFILE class.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_AUTO_TAG( <boolean_value> )
```

## Arguments

`boolean_value`
:   Specifies whether to enable auto-tagging for the instance of the CLASSIFICATION_PROFILE class.

    TRUE enables auto-tagging.

    FALSE disables auto-tagging.

    Default: FALSE

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Enable auto-tagging for the classification profile:

```sqlexample
CALL my_classification_profile!SET_AUTO_TAG(true);
```

Disable auto-tagging for classification profile:

```sqlexample
CALL my_classification_profile!SET_AUTO_TAG(false);
```

---
title: <classification_profile_name>!SET_CLASSIFY_VIEWS
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_classify_views.md
section: SQL Classes
---

# <classification_profile_name>!SET_CLASSIFY_VIEWS

Specifies whether to classify views during sensitive data classification.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_CLASSIFY_VIEWS( <boolean_value> )
```

## Arguments

`boolean_value`
:   Specifies whether to enable classification of views for the instance of the CLASSIFICATION_PROFILE class.

    TRUE enables the classification of views.

    FALSE disables the classification of views. Only tables will be classified by sensitive data classification.

    Default: FALSE

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Configure the classification profile so that views are classified along with tables:

```sqlexample
CALL my_classification_profile!SET_CLASSIFY_VIEWS(true);
```

Disable view classification for the classification profile:

```sqlexample
CALL my_classification_profile!SET_CLASSIFY_VIEWS(false);
```

---
title: <classification_profile_name>!SET_CUSTOM_CLASSIFIERS
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_custom_classifiers.md
section: SQL Classes
---

# <classification_profile_name>!SET_CUSTOM_CLASSIFIERS

Adds [custom classifiers](../../../../user-guide/classify-custom.md) to an existing classification profile so sensitive data can be automatically
classified with custom classification semantic and privacy categories.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_CUSTOM_CLASSIFIERS( <object> )
```

## Arguments

`object`
:   An [OBJECT](../../../data-types-semistructured.md) value that specifies the custom classifiers to add to the classification profile.

    Each key in the object specifies the name of an instance of the [CUSTOM_CLASSIFIER class](../../custom_classifier.md).

    The value of each key specifies the [custom_classifier!LIST](../../custom_classifier/methods/list.md) method of the custom classifier instance.

## Returns

Returns a successful status message or an error message.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

```sqlexample
CALL my_classification_profile!SET_CUSTOM_CLASSIFIERS(
  {
    'medical_codes': medical_codes!list(),
    'finance_codes': finance_codes!list()
  });
```

---
title: <classification_profile_name>!SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_enable_tag_based_sensitive_data_exclusion.md
section: SQL Classes
---

# <classification_profile_name>!SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION

Enables or disables the ability to exclude certain data from sensitive data classification. When enabled, all objects tagged with
`SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION='TRUE'` are skipped during sensitive data classification.

For more information, see [Excluding data from sensitive data classification](../../../../user-guide/classify-auto-exclude.md).

## Syntax

```sqlsyntax
<classification_profile_name>!SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION( <boolean_value> )
```

## Arguments

`boolean_value`
:   Determines whether tag-based sensitive data exclusion is enabled for the classification profile.

    When set to TRUE, objects tagged with `SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION='TRUE'` are excluded from
    sensitive data classification.

    When set to FALSE, all objects are included in sensitive data classification regardless of whether the
    `SKIP_SENSITIVE_DATA_CLASSIFICATION` tag is set on an object.

    Default: FALSE

## Returns

Returns a successful status message or an error message.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Enable tag-based sensitive data exclusion for the `my_classification_profile` instance:

```sqlexample
CALL my_classification_profile!SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION(TRUE);
```

Disable tag-based sensitive data exclusion for the `test_profile` instance:

```sqlexample
CALL test_profile!SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION(FALSE);
```

---
title: <classification_profile_name>!SET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_maximum_classification_validity_days.md
section: SQL Classes
---

# <classification_profile_name>!SET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS

Specifies the maximum number of days to wait before a table is eligible to be automatically classified for an instance of the
CLASSIFICATION_PROFILE class.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS( <days> )
```

## Arguments

`days`
:   Specifies the number of days since the last classification event before a table can be classified again using automatic classification.

    The value must be greater than `0`.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Set the minimum number of days to be `5` before a table can be automatically classified again:

```sqlexample
CALL my_classification_profile!SET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS(5);
```

---
title: <classification_profile_name>!SET_MINIMUM_OBJECT_AGE_FOR_CLASSIFICATION_DAYS
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_minimum_object_age_for_classification_days.md
section: SQL Classes
---

# <classification_profile_name>!SET_MINIMUM_OBJECT_AGE_FOR_CLASSIFICATION_DAYS

Specifies the minimum number of days an object must exist before it is eligible to be automatically classified by an instance of the
CLASSIFICATION_PROFILE class.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_MINIMUM_OBJECT_AGE_FOR_CLASSIFICATION_DAYS( <days> )
```

## Arguments

`days`
:   Specifies the INTEGER number of days to wait before a table is eligible to be classified.

    The value must be equal to or greater than `0`.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Set the minimum number of days to be `1` before a table can be automatically classified:

```sqlexample
CALL my_classification_profile!SET_MINIMUM_OBJECT_AGE_FOR_CLASSIFICATION_DAYS(1);
```

---
title: <classification_profile_name>!SET_SNOWFLAKE_SEMANTIC_CATEGORIES
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_snowflake_semantic_categories.md
section: SQL Classes
---

# <classification_profile_name>!SET_SNOWFLAKE_SEMANTIC_CATEGORIES

Configures the classification profile to limit which types of data (semantic categories) to classify as sensitive. Snowflake
classifies data only if it belongs to the subset of [native semantic categories](../../../../user-guide/classify-native.md) that you
specify using this method.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_SNOWFLAKE_SEMANTIC_CATEGORIES( <array> )
```

## Arguments

`array`
:   An [ARRAY](../../../data-types-semistructured.md) value that specifies a list of Snowflake [native semantic categories](../../../../user-guide/classify-native.md)
    (types of data) and optional locales to use for classification. Snowflake identifies data as sensitive only if the data is classified as
    belonging to the specified categories (and locales, if provided).

    The array can contain objects with the following keys:

    * `category` — Required string that specifies a native semantic category.
    * `country_codes` — Optional array that specifies two-letter country codes. Snowflake identifies data as belonging to a category
      only if a semantic subcategory exists for the specified locales.

      To determine if a semantic subcategory exists for a locale and obtain the two-letter code for a country, see
      [Native semantic categories of sensitive data classification](../../../../user-guide/classify-native.md).

## Returns

Returns a successful status message or an error message.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Configure a classification profile so that data is classified only if it belongs to the NAME and NATIONAL_IDENTIFIER semantic categories:

```sqlexample
CALL my_classification_profile!SET_SNOWFLAKE_SEMANTIC_CATEGORIES(
  [
    {'category': 'NAME'},
    {'category': 'NATIONAL_IDENTIFIER'}
  ]);
```

Configure a classification profile so that data is classified only if Snowflake identifies it as a tax identifier in Italy (IT) or France (FR):

```sqlexample
CALL my_classification_profile!SET_SNOWFLAKE_SEMANTIC_CATEGORIES(
  [
    {
      'category': 'TAX_IDENTIFIER',
      'country_codes': ['IT', 'FR']
    }
  ]);
```

Combine global semantic categories with country-specific categories:

```sqlexample
CALL my_classification_profile!SET_SNOWFLAKE_SEMANTIC_CATEGORIES(
  [
    {'category': 'NAME'},
    {
      'category': 'PASSPORT',
      'country_codes': ['US']
    }
  ]);
```

---
title: <classification_profile_name>!SET_TAG_MAP
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/set_tag_map.md
section: SQL Classes
---

# <classification_profile_name>!SET_TAG_MAP

Adds a JSON object to an instance of the CLASSIFICATION_PROFILE class to map user-defined [tags](../../../../user-guide/object-tagging/introduction.md) to the
SEMANTIC_CATEGORY system tag.

## Syntax

```sqlsyntax
<classification_profile_name>!SET_TAG_MAP( <object> )
```

## Arguments

`object`
:   An [OBJECT](../../../data-types-semistructured.md) that maps one or more user-defined tags to the SEMANTIC_CATEGORY system tag.

    `'column_tag_map': [ ... ]`
    :   An array of objects that have the following key-value pairs:

        `'tag_name': 'string'`
        :   The fully qualified name of the tag.

            For more information, see [Identifier requirements](../../../identifiers-syntax.md).

        `'tag_value':'string'`
        :   The string value of the tag.

            Optional: If not specified, you must also omit the `semantic_categories` key. If omitted, the `tag_name` tag is applied to
            every column to which the SEMANTIC_CATEGORY system tag is applied, and the value of the user-defined tag will match the value of the
            SEMANTIC_CATEGORY tag.

        `'semantic_categories': [ 'category' [ , 'category' ... ] ]`
        :   A comma-separated list of [native categories](../../../../user-guide/classify-native.md). The `tag_name` user-defined tag is mapped to
            instances where the value of the SEMANTIC_CATEGORY tag is one of the specified native categories.

            Optional: If not specified, you must also omit the `tag_value` key. If omitted, the `tag_name` tag is applied to every
            column to which the system SEMANTIC_CATEGORY tag is applied, and the value of the user-defined tag will match the value of the
            SEMANTIC_CATEGORY tag.

## Returns

Returns a successful status message or an error message. For more information, see [About tag mapping](../../../../user-guide/classify-auto.md).

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

* If the same tag and semantic category is mapped to two different values, then the order of the objects in the `column_tag_map`
  determines the tag and string value to set on a column. Order the `column_tag_map` arrays from highest preference to lowest
  preference.

## Examples

Map a single tag and its value to the `my_classification_profile` instance:

```sqlexample
CALL my_classification_profile!SET_TAG_MAP(
  {
    'column_tag_map':[
      {
        'tag_name':'tag_db.sch.pii',
        'tag_value':'important',
        'semantic_categories':['NAME']
      }
    ]
  }
);
```

---
title: <classification_profile_name>!UNSET_CUSTOM_CLASSIFIERS
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/unset_custom_classifiers.md
section: SQL Classes
---

# <classification_profile_name>!UNSET_CUSTOM_CLASSIFIERS

Remove [custom classifiers](../../../../user-guide/classify-custom.md) from the classification profile.

## Syntax

```sqlsyntax
<classification_profile_name>!UNSET_CUSTOM_CLASSIFIERS()
```

## Returns

Returns a successful status message or an error message.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Remove custom classifiers from the `my_classification_profile` profile:

```sqlexample
CALL my_classification_profile!UNSET_CUSTOM_CLASSIFIERS();
```

---
title: <classification_profile_name>!UNSET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/unset_maximum_classification_validity_days.md
section: SQL Classes
---

# <classification_profile_name>!UNSET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS

Unsets the maximum number of days to wait before a table is eligible to be automatically classified for an instance of the
CLASSIFICATION_PROFILE class.

## Syntax

```sqlsyntax
<classification_profile_name>!UNSET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS()
```

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Unset the minimum number of days before a table can be automatically classified again:

```sqlexample
CALL my_classification_profile!UNSET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS();
```

---
title: <classification_profile_name>!UNSET_SNOWFLAKE_SEMANTIC_CATEGORIES
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/unset_snowflake_semantic_categories.md
section: SQL Classes
---

# <classification_profile_name>!UNSET_SNOWFLAKE_SEMANTIC_CATEGORIES

Removes the semantic category restrictions from the classification profile so that all [native semantic categories](../../../../user-guide/classify-native.md) are used during classification.

## Syntax

```sqlsyntax
<classification_profile_name>!UNSET_SNOWFLAKE_SEMANTIC_CATEGORIES()
```

## Returns

Returns a successful status message or an error message.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Remove semantic category restrictions from the `my_classification_profile` profile:

```sqlexample
CALL my_classification_profile!UNSET_SNOWFLAKE_SEMANTIC_CATEGORIES();
```

---
title: <classification_profile_name>!UNSET_TAG_MAP
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/methods/unset_tag_map.md
section: SQL Classes
---

# <classification_profile_name>!UNSET_TAG_MAP

Unsets a tag mapping from an instance of the CLASSIFICATION_PROFILE class.

## Syntax

```sqlsyntax
<classification_profile_name>!UNSET_TAG_MAP()
```

## Returns

Returns a successful status message or an error message. For more information, see [About tag mapping](../../../../user-guide/classify-auto.md).

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `classification_profile`!PRIVACY_USER | The classification profile instance. | The account role that calls this method must be granted this instance role on the classification profile. The role used to create the instance is automatically granted this instance role. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
return value of this method. Instead, call each method in a separate SQL statement.

## Examples

Unset the tag mapping from the `my_classification_profile` instance:

```sqlexample
CALL my_classification_profile!UNSET_TAG_MAP();
```

---
title: <instance_name>!GET_DRIVERS
source: https://docs.snowflake.com/en/sql-reference/classes/top-insights/methods/get_drivers.md
section: SQL Classes
---

# <instance_name>!GET_DRIVERS

Finds the most important dimensions in a dataset, builds segments from those dimensions, and then determines which of
those segments most influenced the metric.

GET_DRIVERS is well-suited to extracting root causes from datasets that have a large number of dimensions. Continuous
dimensions are also supported without pre-processing them into categorical dimensions, and the results can indicate
dimensions with negative conditions (for example, “region is not North America”).

If you need to select specific columns from the data returned by this method, use
[RESULT_SCAN](../../../functions/result_scan.md).

## Syntax

```sqlsyntax
<model_name>!GET_DRIVERS(
  INPUT_DATA => <input_data>,
  LABEL_COLNAME => '<label_colname>',
  METRIC_COLNAME => '<metric_colname>'
);
```

INPUT_DATA
:   A [reference](../../../references.md) to a table, view, or query. All columns other than the ones specified
    by LABEL_COLNAME and METRIC_COLNAME are taken as dimensions to be considered by Top Insights. Numeric columns are
    taken to be continuous dimensions, while string and Boolean columns are considered categorical dimensions. To treat a
    numeric column as a categorical dimension, cast it to a string.

LABEL_COLNAME
:   The name of a Boolean column in INPUT_DATA designated as the label that indicates control data (FALSE) vs test data
    (TRUE).

METRIC_COLNAME
:   The name of a [FLOAT](../../../data-types-numeric.md) column in INPUT_DATA representing the value of interest that has
    been influenced by the included dimensions.

## Output

| Column | Type | Description |
| --- | --- | --- |
| CONTRIBUTOR | [ARRAY](../../../data-types-semistructured.md) | ARRAY of strings describing a segment or insight from the algorithm. |
| METRIC_CONTROL | [FLOAT](../../../data-types-numeric.md) | The total value of the metric in the control period in a specific segment. |
| METRIC_TEST | [FLOAT](../../../data-types-numeric.md) | The total value of the metric in the test period in a specific segment. |
| CONTRIBUTION | [FLOAT](../../../data-types-numeric.md) | The absolute impact of the segment on the change in the metric. |
| RELATIVE_CONTRIBUTION | [FLOAT](../../../data-types-numeric.md) | The impact of the segment as a proportion of the overall change in the metric between test and control. |
| GROWTH_RATE | [FLOAT](../../../data-types-numeric.md) | The change in the metric in the segment as a proportion of the metric in the control group in the segment. |

## Usage Notes

* Execution time scales with the number of dimensions and the cardinality of those dimensions.
* The input metric must be an individual observation or an aggregate.
* For categorical dimensions having more than 25 values, Top Insights uses only the top 25 most influential values to create segments.

## Examples

See [Examples](../../../../user-guide/ml-functions/top-insights.md).

---
title: <model_name>!DETECT_ANOMALIES
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/methods/detect_anomalies.md
section: SQL Classes
---

# <model_name>!DETECT_ANOMALIES

Detects and reports anomalies in the input data passed to the method. This is a method of the anomaly detector object that you create by executing the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../commands/create-anomaly-detection.md) command.

The method returns a table that labels each row of the input data as anomalous or not.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

```sqlsyntax
<model_name>!DETECT_ANOMALIES(
  INPUT_DATA => <reference_to_data_to_analyze>,
  TIMESTAMP_COLNAME => '<timestamp_column_name>',
  TARGET_COLNAME => '<target_column_name>',
  [ CONFIG_OBJECT => <configuration_object>, ]
  [ SERIES_COLNAME => '<series_column_name>' ]
)
```

> **Note:**
>
> `model_name` is the object that you create by executing the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../commands/create-anomaly-detection.md) command.

## Arguments

**Required:**

`INPUT_DATA => reference_to_data_to_analyze`
:   A [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to the table, view, or query that returns
    the data to analyze.

    To create this reference, you can use the [TABLE keyword](../../../snowflake-db-classes.md) with the table name, view name,
    or query, or you can call the [SYSTEM$REFERENCE](../../../functions/system_reference.md) or
    [SYSTEM$QUERY_REFERENCE](../../../functions/system_query_reference.md) function.

`TIMESTAMP_COLNAME => 'timestamp_column_name'`
:   The name of the column containing the timestamps (TIMESTAMP_NTZ) in the time-series data.

`TARGET_COLNAME => 'target_column_name'`
:   The name of the column containing the data to analyze (type NUMERIC or FLOAT).

**Optional:**

`SERIES_COLNAME => 'series_column_name'`
:   Name of the column containing the identifier for the series (for multi-series data). This column should be a
    VARIANT because it can be any type of value or values from multiple columns in an array.

`CONFIG_OBJECT => config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure the anomaly detection job.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `prediction_interval` | [FLOAT](../../../data-types-numeric.md) | 0.99 | Value between 0 and 1 that specifies the percentage of the observations that should be marked as anomalies:  * For less strict anomaly detection (that is, identifying fewer observations marked as anomalies), specify a higher value. * For more strict anomaly detection (that is, identifying more observations as anomalies), reduce this value. |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) that specifies the error handling for the anomaly detection task. This is most useful when detecting anomalies in multiple series. Supported values are:   * `'abort'`: Abort the operation if an error is encountered in any time series. * `'skip'`: Skip any time series where anomaly detection encounters an error. This allows anomaly detection   to succeed for other time series. Series that failed are absent from the output. |

## Returns

The function returns the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| TS | TIMESTAMP_NTZ | The timestamps of the data |
| Y | FLOAT | The values for the time series |
| FORECAST | FLOAT | The predicted value at the timestamp. |
| LOWER_BOUND | FLOAT | The lower bound of the value within the prediction interval. Values that are lower than this are flagged as anomalies. |
| UPPER_BOUND | FLOAT | The upper bound of the value within the prediction interval. Values that are higher than this are flagged as anomalies. |
| IS_ANOMALY | BOOLEAN | True if the value is an anomaly; False if not. |
| PERCENTILE | FLOAT | The corresponding percentile of the observed Y value given the prediction interval.  If the percentile is outside of `((1 - alpha) / 2, 1 - (1 - alpha) / 2)`, the value is flagged as an anomaly. For example, if the prediction interval is 0.95, a percentile of 0.96 **would not** be an anomaly, but a percentile of 0.98 would be.  If the `prediction_interval` field is not specified in the configuration object, the default is 0.99. |
| DISTANCE | FLOAT | The multiple of the standard deviation from the FORECAST column (z-score) |

## Usage notes

* The columns for the data specified in the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../commands/create-anomaly-detection.md) command (in the INPUT_DATA
  constructor argument) must match the columns for the data specified in the INPUT_DATA argument of this method.

  For example, if you passed the SERIES_COLNAME argument to the [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](../commands/create-anomaly-detection.md) command, you must also pass the
  SERIES_COLNAME argument to this method. If you omitted the SERIES_COLNAME argument in the command, you must omit that argument here.
* If the column names specified by the TIMESTAMP_COLNAME or TARGET_COLNAME arguments do not exist in the table, view, or query
  specified by the INPUT_DATA argument, an error occurs.

---
title: <model_name>!EXPLAIN_FEATURE_IMPORTANCE
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/methods/explain_feature_importance.md
section: SQL Classes
---

# <model_name>!EXPLAIN_FEATURE_IMPORTANCE

Returns the relative feature importance for each feature used by the model.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

```sqlsyntax
<model_name>!EXPLAIN_FEATURE_IMPORTANCE();
```

## Returns

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| RANK | [INTEGER](../../../data-types-numeric.md) | The importance rank of a feature for a specific series |
| FEATURE_NAME | [VARCHAR](../../../data-types-text.md) | The name of the feature used to train the model `aggregated_endogenous_features` represents all features derived as transformations of your target variable. |
| IMPORTANCE_SCORE | [FLOAT](../../../data-types-numeric.md) | The feature’s importance score: a value in [0, 1], with 0 being the lowest possible importance, and 1 the highest. |
| FEATURE_TYPE | [VARCHAR](../../../data-types-text.md) | The source of the feature, one of:   * `user_provided` * `derived_from_timestamp` * `derived_from_endogenous` |

---
title: <model_name>!EXPLAIN_FEATURE_IMPORTANCE
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/methods/explain_feature_importance.md
section: SQL Classes
---

# <model_name>!EXPLAIN_FEATURE_IMPORTANCE

Returns the relative feature importance for each feature used by the model.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

```sqlsyntax
<model_name>!EXPLAIN_FEATURE_IMPORTANCE();
```

## Output

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| RANK | [INTEGER](../../../data-types-numeric.md) | The importance rank of a feature for a particular series. |
| FEATURE_NAME | [VARCHAR](../../../data-types-text.md) | The name of the feature used to train the model. `aggregated_endogenous_features` represents all features derived as transformations of the target variable. |
| IMPORTANCE_SCORE | [FLOAT](../../../data-types-numeric.md) | The feature’s importance score: a value in [0, 1], with 0 being the lowest possible importance, and 1 the highest. |
| FEATURE_TYPE | [VARCHAR](../../../data-types-text.md) | The source of the feature. One of:   * `user_provided`: Feature data provided by the user. * `derived_from_timestamp`: Periodic feature (e.g. day, week, or month) derived from timestamp data. * `derived_from_endogenous`: Features derived from a transformation of the target variable. |

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: <model_name>!FORECAST
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/methods/forecast.md
section: SQL Classes
---

# <model_name>!FORECAST

Generates a forecast from the previously trained model `model_name`.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

The required arguments vary depending on what use case the model was trained for.

**For single-series models without exogenous variables:**

```sqlsyntax
<name>!FORECAST(
  FORECASTING_PERIODS => <forecasting_periods>,
  [ CONFIG_OBJECT => <config_object> ]
);
```

**For single-series models with exogenous variables:**

```sqlsyntax
<name>!FORECAST(
  INPUT_DATA => <input_data>,
  TIMESTAMP_COLNAME => '<timestamp_colname>',
  [ CONFIG_OBJECT => <config_object> ]
);
```

**For multiple-series models without exogenous variables:**

```sqlsyntax
<name>!FORECAST(
  SERIES_VALUE => <series>,
  FORECASTING_PERIODS => <forecasting_periods>,
  [ CONFIG_OBJECT => <config_object> ]
);
```

**For multiple-series models with exogenous variables:**

```sqlsyntax
<name>!FORECAST(
  SERIES_VALUE => <series>,
  SERIES_COLNAME => <series_colname>,
  INPUT_DATA => <input_data>,
  TIMESTAMP_COLNAME => '<timestamp_colname>',
  [ CONFIG_OBJECT => <config_object> ]
);
```

## Arguments

**Required:**

Not all of the following arguments are required for every use case.

`FORECASTING_PERIODS => forecasting_periods`
:   Required for forecasts without exogenous variables.

    The number of steps ahead to forecast. The interval between steps is inferred by the model during training.

`INPUT_DATA => input_data`
:   Required for forecasts with exogenous variables.

    A [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to a table, view, or query
    that contains the future timestamps and values of the exogenous variables (additional user-provided features) that
    were passed as `input_data` when training the model. Using a reference allows the forecasting process, which
    runs with limited privileges, to use your privileges to access the data. Columns are matched between this argument and
    the original exogenous training data by name.

    To create this reference, you can use the [TABLE keyword](../../../snowflake-db-classes.md) with the table name, view name,
    or query, or you can call the [SYSTEM$REFERENCE](../../../functions/system_reference.md) or
    [SYSTEM$QUERY_REFERENCE](../../../functions/system_query_reference.md) function.

`TIMESTAMP_COLNAME => 'timestamp_colname'`
:   Required for forecasts with exogenous variables.

    The name of the column in `input_data` containing the timestamps.

`SERIES_COLNAME => 'series_colname'`
:   Required for multi-series forecasts with exogenous variables.

    The name of the column in `input_data` specifying the series.

`SERIES_VALUE => series`
:   Required for multi-series forecasts.

    The time series to forecast. Can be a single value (e.g., `'Series A'::variant`) or a [VARIANT](../../../data-types-semistructured.md), but must specify a series that
    the model has been trained on. If not specified, all trained series are predicted.

**Optional:**

`CONFIG_OBJECT => config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure the forecast job.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `prediction_interval` | [FLOAT](../../../data-types-numeric.md) | 0.95 | A value greater than or equal to 0.0 and less than 1.0. The default value of 0.95 means 95% of future points are expected to fall within the interval [lower_bound, upper_bound] from the forecast result. |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) specifying the error handling method. This is most useful when forecasting multiple series. Supported values are:   * `'abort'`: Abort the model forecasting operation if an error is encountered in any time series. * `'skip'`: Skip any time series where forecasting encounters an error. This allows forecasting   to succeed for other time series. Series that failed are absent from the model output. |

## Output

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| TS | [TIMESTAMP_NTZ](../../../data-types-datetime.md) | Timestamp. |
| FORECAST | [FLOAT](../../../data-types-numeric.md) | Forecast target value. |
| LOWER_BOUND | [FLOAT](../../../data-types-numeric.md) | Lower boundary of prediction interval. |
| UPPER_BOUND | [FLOAT](../../../data-types-numeric.md) | Upper boundary of prediction interval. |

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: <model_name>!SHOW_CONFUSION_MATRIX
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/show_confusion_matrix.md
section: SQL Classes
---

# <model_name>!SHOW_CONFUSION_MATRIX

Returns a table containing the number of instances of each combination of actual class and predicted class in models
where evaluation was enabled at instantiation. You can use this dataset to plot a confusion matrix. This method takes no
arguments. See [Confusion Matrix in show_confusion_matrix](../../../../user-guide/ml-functions/classification.md).

## Output

| Column | Type | Description |
| --- | --- | --- |
| `dataset_type` | [VARCHAR](../../../data-types-text.md) | The name of the dataset used for metrics calculation, currently EVAL. |
| `actual_class` | [VARCHAR](../../../data-types-text.md) | The actual class. |
| `predicted_class` | [VARCHAR](../../../data-types-text.md) | The predicted class. |
| `count` | [INTEGER](../../../data-types-numeric.md) | The number of instances of the given combination of actual and predicted class. |
| `logs` | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

---
title: <model_name>!SHOW_EVALUATION_METRICS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/methods/show_evaluation_metrics.md
section: SQL Classes
---

# <model_name>!SHOW_EVALUATION_METRICS

Returns out-of-sample evaluation metrics.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

You can call this method to retrieve the cross-validation metrics generated when the model was trained, or you
can call it with additional data that was not available at training time (out-of-sample data) and receive
metrics based on how well the model predicts that data.

**Return time-series cross-validation metrics generated at training time:**

These metrics are available only if `evaluate=TRUE` in the `CONFIG_OBJECT` during model construction
(this is the default).

```sqlsyntax
<model_name>!SHOW_EVALUATION_METRICS();
```

**Compute cross-validation metrics on additional out-of-sample data:**

```sqlsyntax
<model_name>!SHOW_EVALUATION_METRICS(
  INPUT_DATA => <input_data>,
  [ SERIES_COLNAME => '<series_colname>', ]
  TIMESTAMP_COLNAME => '<timestamp_colname>',
  TARGET_COLNAME => '<target_colname>',
  LABEL_COLNAME => '<label_column_name>',
  [ CONFIG_OBJECT => <config_object> ]
);
```

## Arguments

The following arguments apply only to the additional out-of-sample data use case.

**Required:**

Not all of the following arguments are required for every use case.

`INPUT_DATA => input_data`
:   A [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to a table, view, or query
    that contains the future timestamps and values of the target and any exogenous variables used during training. Columns
    are matched between this argument and the original exogenous training data by name.

    To create this reference, you can use the [TABLE keyword](../../../snowflake-db-classes.md) with the table name, view name,
    or query, or you can call the [SYSTEM$REFERENCE](../../../functions/system_reference.md) or
    [SYSTEM$QUERY_REFERENCE](../../../functions/system_query_reference.md) function.

`TIMESTAMP_COLNAME => 'timestamp_colname'`
:   Name of the column containing the timestamps in `input_data`.

`TARGET_COLNAME => 'target_colname'`
:   Name of the column containing the target (dependent value) in `input_data`.

`LABEL_COLNAME => 'label_column_name'`
:   Name of the column containing the labels for the data. Labels are Boolean (true/false) values indicating
    whether a given row is a known anomaly. If you do not have labeled data, pass an empty string (`''`) for this argument.

**Optional:**

`SERIES_COLNAME => 'series_colname'`
:   Name of the column in `input_data` specifying the series.

`CONFIG_OBJECT => config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure the evaluation job.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `prediction_interval` | [FLOAT](../../../data-types-numeric.md) | 0.99 | A value greater than or equal to 0.0 and less than 1.0. The default value of 0.95 means 95% of future points are expected to fall within the interval [lower_bound, upper_bound] derived from the forecast result. |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) specifying the error handling method. This is most useful when forecasting multiple series. Supported values are:   * `'abort'`: Abort the model forecasting operation if an error is encountered in any time series. * `'skip'`: Skip any time series where forecasting encounters an error. This allows forecasting   to succeed for other time series. Series that fail are absent from the model output. |

## Output

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| ERROR_METRIC | [VARCHAR](../../../data-types-text.md) | The name of the error metric used. The method returns the following metrics:  Point Metrics:  * `MAE`: [Mean Absolute Error](https://en.wikipedia.org/wiki/Mean_absolute_error). * `MAPE`: [Mean Absolute Percentage Error](https://en.wikipedia.org/wiki/Mean_absolute_percentage_error). * `MDA`: [Mean Directional Accuracy](https://en.wikipedia.org/wiki/Mean_directional_accuracy). * `MSE`: [Mean Squared Error](https://en.wikipedia.org/wiki/Mean_squared_error). * `SMAPE`: [Symmetric Mean Absolute Percentage Error](https://en.wikipedia.org/wiki/Symmetric_mean_absolute_percentage_error).  Interval Metrics: These metrics use the `prediction_interval` argument from the [Evaluation configuration](../commands/create-anomaly-detection.md).  * `COVERAGE_INTERVAL`: The proportion of actual values that fall within the prediction interval. * `WINKLER_ALPHA`: [Winkler Score](https://otexts.com/fpp3/distaccuracy.html#winkler-score). |
| LOGS | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

---
title: <model_name>!SHOW_EVALUATION_METRICS
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/show_evaluation_metrics.md
section: SQL Classes
---

# <model_name>!SHOW_EVALUATION_METRICS

Returns evaluation metrics for each class in models where evaluation was enabled at instantiation. This method takes no
arguments. See [Metrics in show_evaluation_metrics](../../../../user-guide/ml-functions/classification.md).

## Output

| Column | Type | Description |
| --- | --- | --- |
| `dataset_type` | [VARCHAR](../../../data-types-text.md) | The name of the dataset used for metrics calculation, currently EVAL. |
| `class` | [VARCHAR](../../../data-types-text.md) | The predicted class. Each class has its own set of metrics, which are provided in multiple rows. |
| `error_metric` | [VARCHAR](../../../data-types-text.md) | The error metric name. Can include Precision, Recall, F1, etc. |
| `metric_value` | [FLOAT](../../../data-types-numeric.md) | The error metric value |
| `logs` | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

---
title: <model_name>!SHOW_EVALUATION_METRICS
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/methods/show_evaluation_metrics.md
section: SQL Classes
---

# <model_name>!SHOW_EVALUATION_METRICS

Returns out-of-sample evaluation metrics generated using time-series cross validation.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

You can call this method to retrieve the cross-validation metrics generated when the model was trained, or you
can call it with additional data that was not available at training time (out-of-sample data) and receive
metrics based on how well the model predicts that data.

**Return time series cross-validation metrics generated at training time:**

These metrics are available only if `evaluate=TRUE` in the `CONFIG_OBJECT` during model construction
(this is the default).

```sqlsyntax
<model_name>!SHOW_EVALUATION_METRICS();
```

**Compute cross-validation metrics on additional out-of-sample data:**

```sqlsyntax
<model_name>!SHOW_EVALUATION_METRICS(
  INPUT_DATA => <input_data>,
  [ SERIES_COLNAME => '<series_colname>', ]
  TIMESTAMP_COLNAME => '<timestamp_colname>',
  TARGET_COLNAME => '<target_colname>',
  [ CONFIG_OBJECT => <config_object> ]
);
```

## Arguments

The following arguments only apply to the additional out-of-sample data use case.

**Required:**

Not all of the following arguments are required for every use case.

`INPUT_DATA => input_data`
:   A [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to a table, view, or query
    that contains the future timestamps and values of the target and any exogenous variables used during training. Columns
    are matched between this argument and the original exogenous training data by name.

    To create this reference, you can use the [TABLE keyword](../../../snowflake-db-classes.md) with the table name, view name,
    or query, or you can call the [SYSTEM$REFERENCE](../../../functions/system_reference.md) or
    [SYSTEM$QUERY_REFERENCE](../../../functions/system_query_reference.md) function.

`TIMESTAMP_COLNAME => 'timestamp_colname'`
:   Name of the column containing the timestamps in `input_data`.

`TARGET_COLNAME => 'target_colname'`
:   Name of the column containing the target (dependent value) in `input_data`.

**Optional:**

`SERIES_COLNAME => 'series_colname'`
:   The name of the column in `input_data` specifying the series.

`CONFIG_OBJECT => config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure the evaluation job.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `prediction_interval` | [FLOAT](../../../data-types-numeric.md) | 0.95 | A value greater than or equal to 0.0 and less than 1.0. The default value of 0.95 means 95% of future points are expected to fall within the interval [lower_bound, upper_bound] derived from the forecast result. |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) specifying the error handling method. This is most useful when forecasting multiple series. Supported values are:   * `'abort'`: Abort the model forecasting operation if an error is encountered in any time series. * `'skip'`: Skip any time series where forecasting encounters an error. This allows forecasting   to succeed for other time series. Series that fail are absent from the model output. |

## Output

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| ERROR_METRIC | [VARCHAR](../../../data-types-text.md) | The name of the error metric used. The method returns the following metrics:  Point Metrics:  * `MAE`: [Mean Absolute Error](https://en.wikipedia.org/wiki/Mean_absolute_error). * `MAPE`: [Mean Absolute Percentage Error](https://en.wikipedia.org/wiki/Mean_absolute_percentage_error). * `MDA`: [Mean Directional Accuracy](https://en.wikipedia.org/wiki/Mean_directional_accuracy). * `MSE`: [Mean Squared Error](https://en.wikipedia.org/wiki/Mean_squared_error). * `SMAPE`: [Symmetric Mean Absolute Percentage Error](https://en.wikipedia.org/wiki/Symmetric_mean_absolute_percentage_error).  Interval Metrics: These metrics use the `prediction_interval` argument from the [Evaluation configuration](../commands/create-forecast.md).  * `COVERAGE_INTERVAL`: The proportion of actual values that fall within the prediction interval. * `WINKLER_ALPHA`: [Winkler Score](https://otexts.com/fpp3/distaccuracy.html#winkler-score). |
| LOGS | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: <model_name>!SHOW_FEATURE_IMPORTANCE
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/show_feature_importance.md
section: SQL Classes
---

# <model_name>!SHOW_FEATURE_IMPORTANCE

Returns the relative feature importance for each feature used by the model. This method takes no arguments.

## Syntax

```sqlsyntax
<model_name>!SHOW_FEATURE_IMPORTANCE();
```

## Output

| Column | Type | Description |
| --- | --- | --- |
| `rank` | [INTEGER](../../../data-types-numeric.md) | The importance rank of a feature. |
| `feature` | [VARCHAR](../../../data-types-text.md) | The name of the feature used to train the model. |
| `score` | [FLOAT](../../../data-types-numeric.md) | The feature’s importance score: a value in [0, 1], with 0 being the lowest possible importance, and 1 the highest. |
| `feature_type` | [VARCHAR](../../../data-types-text.md) | The source of the feature. Currently this is always `user_provided`, which denotes feature data provided by the user. |

---
title: <model_name>!SHOW_GLOBAL_EVALUATION_METRICS
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/show_global_evaluation_metrics.md
section: SQL Classes
---

# <model_name>!SHOW_GLOBAL_EVALUATION_METRICS

Returns overall evaluation metrics for models where evaluation was enabled at instantiation. This method
takes no arguments. See [Metrics in show_global_evaluation_metrics](../../../../user-guide/ml-functions/classification.md).

## Output

| Column | Type | Description |
| --- | --- | --- |
| `dataset_type` | [VARCHAR](../../../data-types-text.md) | The name of the dataset used for metrics calculation, currently EVAL. |
| `average_type` | [VARCHAR](../../../data-types-text.md) | The method of aggregation used to calculate overall metrics from the individual class metrics, currently MACRO. |
| `error_metric` | [VARCHAR](../../../data-types-text.md) | The error metric name. Can include Precision, Recall, F1, etc. |
| `metric_value` | [FLOAT](../../../data-types-numeric.md) | The error metric value |
| `logs` | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

---
title: <model_name>!SHOW_THRESHOLD_METRICS
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/show_threshold_metrics.md
section: SQL Classes
---

# <model_name>!SHOW_THRESHOLD_METRICS

Returns raw counts and metrics for a specific threshold for each class in models where evaluation was enabled at instantiation.
This method takes no arguments. See [Metrics in show_threshold_metrics](../../../../user-guide/ml-functions/classification.md).

## Output

| Column | Type | Description |
| --- | --- | --- |
| `dataset_type` | [VARCHAR](../../../data-types-text.md) | The name of the dataset used for metrics calculation, currently EVAL. |
| `class` | [VARCHAR](../../../data-types-text.md) | The predicted class. Each class has its own set of metrics, which are provided in multiple rows. |
| `threshold` | [FLOAT](../../../data-types-numeric.md) | Threshold used to generate predictions. |
| `precision` | [FLOAT](../../../data-types-numeric.md) | Precision for the given class. The ratio of true positives to the total predicted positives. |
| `recall` | [FLOAT](../../../data-types-numeric.md) | Recall for the given class. Also called “sensitivity.” The ratio of true positives to the total actual positives. |
| `f1` | [FLOAT](../../../data-types-numeric.md) | F1 score for the given class. |
| `tpr` | [FLOAT](../../../data-types-numeric.md) | True positive rate for the given class. |
| `fpr` | [FLOAT](../../../data-types-numeric.md) | False positive rate for the given class. |
| `tp` | [INTEGER](../../../data-types-numeric.md) | Total count of true positives in the given class. |
| `fp` | [INTEGER](../../../data-types-numeric.md) | Total count of false positives in the given class. |
| `tn` | [INTEGER](../../../data-types-numeric.md) | Total count of true negatives in the given class. |
| `fn` | [INTEGER](../../../data-types-numeric.md) | Total count of false negatives in the given class. |
| `accuracy` | [FLOAT](../../../data-types-numeric.md) | The accuracy (ratio of correct predictions, both positive and negative, to the total number of predictions) for the given class. |
| `support` | [INTEGER](../../../data-types-numeric.md) | The support (true positives plus false negatives) for the given class. |
| `logs` | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

---
title: <model_name>!SHOW_TRAINING_LOGS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/methods/show_training_logs.md
section: SQL Classes
---

# <model_name>!SHOW_TRAINING_LOGS

Returns logs from model training. Output is non-NULL only when `'ON_ERROR' = 'SKIP'` is set in the training
`CONFIG_OBJECT`.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

```sqlsyntax
<model_name>!SHOW_TRAINING_LOGS();
```

## Returns

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series).  **Note:** Your single-series results may not have a SERIES column. [See recent change](../../../../release-notes/bcr-bundles/un-bundled/bcr-cortex-forecast-anomaly-detection-series-column.md). |
| LOGS | [OBJECT](../../../data-types-semistructured.md) | A log of errors encountered during training. The value for the key `Errors` is an array of training errors. If no errors were encountered, the LOGS column is NULL. |

## Examples

See [Detecting Anomalies](../../../../user-guide/ml-functions/anomaly-detection.md).

---
title: <model_name>!SHOW_TRAINING_LOGS
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/show_training_logs.md
section: SQL Classes
---

# <model_name>!SHOW_TRAINING_LOGS

Returns the logs generated during training, if available.

## Syntax

```sqlsyntax
<model_name>!SHOW_TRAINING_LOGS();
```

## Output

| Column | Type | Description |
| --- | --- | --- |
| `colname` | [VARCHAR](../../../data-types-text.md) | The column name that logs are reported for. |
| `logs` | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

---
title: <model_name>!SHOW_TRAINING_LOGS
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/methods/show_training_logs.md
section: SQL Classes
---

# <model_name>!SHOW_TRAINING_LOGS

Returns logs from model training. Output is non-NULL only when `'ON_ERROR' = 'SKIP'` is set in the training
`CONFIG_OBJECT`; otherwise the entire model fails to train.

If you need to select specific columns from the data returned by this method, you can call the method in the FROM clause of a
SELECT statement. See [Selecting columns from SQL class instance methods that return tabular data](../../../snowflake-db-classes.md).

## Syntax

```sqlsyntax
<model_name>!SHOW_TRAINING_LOGS();
```

## Output

| Column | Type | Description |
| --- | --- | --- |
| SERIES | [VARIANT](../../../data-types-semistructured.md) | Series value (NULL if model was trained with single time series). |
| LOGS | [OBJECT](../../../data-types-semistructured.md) | Object containing errors encountered during training. Currently the only key is `Errors`, an array of errors. If no errors were encountered, the logs object is NULL. |

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: account_root_budget!ACTIVATE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/activate.md
section: SQL Classes
---

# account_root_budget!ACTIVATE

Activate the account budget. You must activate the account budget in order
to use the [budgets](../../../../user-guide/budgets.md) feature.

See also:
:   [account_root_budget!DEACTIVATE](deactivate.md)

## Syntax

```sqlsyntax
CALL account_root_budget!ACTIVATE()
```

## Returns

```output
activated
```

## Access control requirements

Only a user with the ACCOUNTADMIN role or a role granted the following privileges can activate the account budget:

* Application role SNOWFLAKE.BUDGET_ADMIN
* [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* After the account budget is activated:

  + You must set the spending limit in order for the budget to start tracking credit usage.
  + You must [set up notifications for the budget](../../../../user-guide/budgets/notifications.md). If you do not set up notifications
    for the budget, no notifications will be sent out.
* This method is only available on the account budget. Custom budgets do not require activation.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Activate the account budget for your account:

```sqlexample
CALL snowflake.local.account_root_budget!ACTIVATE();
```

## Error messages

To troubleshoot issues with account budget activation, see [You can’t activate the account budget](../../../../user-guide/budgets/troubleshoot.md).

---
title: account_root_budget!DEACTIVATE
source: https://docs.snowflake.com/en/sql-reference/classes/budget/methods/deactivate.md
section: SQL Classes
---

# account_root_budget!DEACTIVATE

Deactivate the account [budget](../../../../user-guide/budgets.md).

See also:
:   [account_root_budget!ACTIVATE](activate.md)

## Syntax

```sqlsyntax
CALL account_root_budget!DEACTIVATE()
```

## Returns

```output
Deactivated!
```

## Access control requirements

The role used to call this method must be granted the following role and privilege:

* BUDGET_ADMIN [application role](../../../../user-guide/budgets.md)
* [Snowflake database role](../../../snowflake-db-roles.md) USAGE_VIEWER

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* After you deactivate the account budget, you can no longer create new custom budgets using Snowsight.
  However, you can continue to create custom budgets using SQL.
* This method is only available on the account budget. Custom budgets can’t be deactivated. They must be dropped using
  the [DROP BUDGET](../commands/drop-budget.md) command.
* Calling this method does not return the object. Because of this, you can’t use method chaining to call another method on the
  return value of this method. Instead, call each method in a separate SQL statement.

## Example

Deactivate the account budget for your account:

```sqlexample
CALL snowflake.local.account_root_budget!DEACTIVATE();
```

---
title: ALTER BUDGET
source: https://docs.snowflake.com/en/sql-reference/classes/budget/commands/alter-budget.md
section: SQL Classes
---

# ALTER BUDGET

*Fully qualified name*: SNOWFLAKE.CORE.BUDGET

Modifies the properties of a *custom* budget:

* Renames the budget.
* Sets or unsets a tag.
* Sets or unsets the comment.

See also:
:   [CREATE BUDGET](create-budget.md),
    [SHOW BUDGET](show-budget.md),
    [DROP BUDGET](drop-budget.md)

## Syntax

```sqlsyntax
ALTER SNOWFLAKE.CORE.BUDGET [ IF EXISTS ] <name> RENAME TO <new_name>

ALTER SNOWFLAKE.CORE.BUDGET [ IF EXISTS ] <name> SET COMMENT = '<string_literal>'

ALTER SNOWFLAKE.CORE.BUDGET [ IF EXISTS ] <name> UNSET COMMENT

ALTER SNOWFLAKE.CORE.BUDGET [ IF EXISTS ] <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]

ALTER SNOWFLAKE.CORE.BUDGET [ IF EXISTS ] <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) of the budget.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../../../identifiers-syntax.md).

`SET ...`
:   Specifies one or more budget properties to be set.

    `COMMENT = 'string_literal'`
    :   Sets the comment of the budget. This can also be done using the [COMMENT](../../../sql/comment.md) command.

    `TAG tag_name = 'tag_value' [ , tag_name = 'tag_value' , ... ]`
    :   Specifies the [tag](../../../../user-guide/object-tagging/introduction.md) name and the tag string value.

        The tag value is always a string, and the maximum number of characters for the tag value is 256.

        For information about specifying tags in a statement, see [Tag quotas](../../../../user-guide/object-tagging/introduction.md).

`UNSET ...`
:   Specifies one (or more) properties and/or parameters to unset for the budget, which resets them to the defaults:

    * `COMMENT`
    * `TAG tag_name [ , tag_name ... ]`

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege / Role | Object | Notes |
| --- | --- | --- |
| ADMIN | Budget | The role used to modify the properties of a custom budget must be granted this [instance role](../../../../user-guide/budgets.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* You can only modify the properties for a *custom* budget.
* To refer to this class by its unqualified name, include the database and schema of the class in your
  [search path](../../../snowflake-db-classes.md).

## Examples

Set the tag `dept` for the budget `my_budget` in the current schema:

```sqlexample
ALTER SNOWFLAKE.CORE.BUDGET my_budget SET TAG dept = 'finance';
```

---
title: ALTER SNOWFLAKE.ML.CLASSIFICATION
source: https://docs.snowflake.com/en/sql-reference/classes/classification/commands/alter-classification.md
section: SQL Classes
---

# ALTER SNOWFLAKE.ML.CLASSIFICATION

You can change the name, description, and tags of a classification model object using forms of the ALTER command. Models
themselves are immutable and cannot be updated in place. To update a model, drop the existing model and train a new one.

See also:
:   [CREATE SNOWFLAKE.ML.CLASSIFICATION](create-classification.md)

## Syntax

Rename a model:

```sqlsyntax
ALTER SNOWFLAKE.ML.CLASSIFICATION [ IF EXISTS ] <name>
    RENAME TO '<new_model_name>';
```

Set or change a [tag](../../../../user-guide/object-tagging/introduction.md) value on a model:

```sqlsyntax
ALTER SNOWFLAKE.ML.CLASSIFICATION  [ IF EXISTS ] <name>
    SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ];
```

Set or change a model’s comment:

```sqlsyntax
ALTER SNOWFLAKE.ML.CLASSIFICATION [ IF EXISTS ] <name>
    SET COMMENT = '<string_literal>';
```

Remove a [tag](../../../../user-guide/object-tagging/introduction.md) from a model:

```sqlsyntax
ALTER SNOWFLAKE.ML.CLASSIFICATION [ IF EXISTS ] <name>
    UNSET TAG <tag_name> [ , <tag_name> ... ];
```

Remove a model’s comment:

```sqlsyntax
ALTER SNOWFLAKE.ML.CLASSIFICATION [ IF EXISTS ] <name>
    UNSET COMMENT;
```

---
title: ANOMALY_DETECTION (SNOWFLAKE.ML)
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly_detection.md
section: SQL Classes
---

# ANOMALY_DETECTION (SNOWFLAKE.ML)

Anomaly detection allows you to detect outliers in your time series data by using a machine learning algorithm. You use [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](anomaly-detection/commands/create-anomaly-detection.md) to create
and train a detection model, and then use the [<model_name>!DETECT_ANOMALIES](anomaly-detection/methods/detect_anomalies.md) method to detect anomalies.

> **Important:**
>
> **Legal notice.** This Snowflake ML function is powered by machine learning technology, which you, not Snowflake, determine when and how to use. Machine
> learning technology and results provided may be inaccurate, inappropriate, or biased.
> Snowflake provides you with the machine learning models that you can use within your own workflows. Decisions based on machine
> learning outputs, including those built into automatic pipelines, should have human oversight and review processes
> to ensure model-generated content is accurate.
> Snowflake provides algorithms (without any pretraining) and you’re responsible for the data that you provide the algorithm (for example, for training and inference) and the decisions you make using the resulting model’s output.
> Queries for this feature or function are treated as any
> other SQL query and may be considered [metadata](../metadata.md).
>
> **Metadata.** When you use Snowflake ML functions, Snowflake logs generic error messages returned by an ML
> function. These error logs help us troubleshoot issues that arise and improve these functions to serve you better.
>
> For further information, see [Snowflake AI Trust and Safety FAQ](https://www.snowflake.com/en/legal/snowflake-ai-trust-and-safety/).

## ANOMALY_DETECTION commands

* [CREATE SNOWFLAKE.ML.ANOMALY_DETECTION](anomaly-detection/commands/create-anomaly-detection.md)
* [DROP SNOWFLAKE.ML.ANOMALY_DETECTION](anomaly-detection/commands/drop-anomaly-detection.md)
* [SHOW SNOWFLAKE.ML.ANOMALY_DETECTION](anomaly-detection/commands/show-anomaly-detection.md)

## ANOMALY_DETECTION methods

* [<model_name>!DETECT_ANOMALIES](anomaly-detection/methods/detect_anomalies.md)
* [<model_name>!EXPLAIN_FEATURE_IMPORTANCE](anomaly-detection/methods/explain_feature_importance.md)
* [<model_name>!SHOW_EVALUATION_METRICS](anomaly-detection/methods/show_evaluation_metrics.md)
* [<model_name>!SHOW_TRAINING_LOGS](anomaly-detection/methods/show_training_logs.md)

---
title: ANOMALY_INSIGHTS (SNOWFLAKE.LOCAL)
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly_insights.md
section: SQL Classes
---

# ANOMALY_INSIGHTS (SNOWFLAKE.LOCAL)

The ANOMALY_INSIGHTS [class](../snowflake-db-classes.md) is used to identify and investigate [cost anomalies](../../user-guide/cost-anomalies.md).

Snowflake instantiates a single instance of the ANOMALY_INSIGHTS class. You never create an instance of the class.

## ANOMALY_INSIGHTS methods

* [ANOMALY_INSIGHTS!GET_ACCOUNT_ANOMALIES_IN_CREDITS](anomaly-insights/methods/get_account_anomalies_in_credits.md)
* [ANOMALY_INSIGHTS!GET_ACCOUNT_NOTIFICATION_EMAILS](anomaly-insights/methods/get_account_notification_emails.md)
* [ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA](anomaly-insights/methods/get_daily_consumption_anomaly_data.md)
* [ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE](anomaly-insights/methods/get_hourly_consumption_by_service_type.md)
* [ANOMALY_INSIGHTS!GET_HOURLY_SPEND_FOR_ANOMALY](anomaly-insights/methods/get_hourly_spend_for_anomaly.md)
* [ANOMALY_INSIGHTS!GET_ORG_NOTIFICATION_EMAILS](anomaly-insights/methods/get_org_notification_emails.md)
* [ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION](anomaly-insights/methods/get_top_accounts_by_consumption.md)
* [ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE](anomaly-insights/methods/get_top_queries_from_warehouse.md)
* [ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE](anomaly-insights/methods/get_top_warehouses_on_date.md)
* [ANOMALY_INSIGHTS!SET_ACCOUNT_NOTIFICATION_EMAILS](anomaly-insights/methods/set_account_notification_emails.md)
* [ANOMALY_INSIGHTS!SET_ORG_NOTIFICATION_EMAILS](anomaly-insights/methods/set_org_notification_emails.md)

---
title: ANOMALY_INSIGHTS!GET_ACCOUNT_ANOMALIES_IN_CREDITS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_account_anomalies_in_credits.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_ACCOUNT_ANOMALIES_IN_CREDITS

Returns daily consumption for the current account, and identifies whether that consumption is considered a
[cost anomaly](../../../../user-guide/cost-anomalies.md).

> **Note:**
>
> This method returns consumption with credits as the unit of measure. If you want to return consumption in a currency instead, see
> [ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA](get_daily_consumption_anomaly_data.md).

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_ACCOUNT_ANOMALIES_IN_CREDITS(
  '<start_date>',
  '<end_date>' )
```

## Arguments

`'start_date'`
:   Specifies the beginning of the time period for which consumption data is returned.

    Data type: DATE

`'end_date'`
:   Specifies the end of the time period for which consumption data is returned.

    Data type: DATE

## Output

Returns a table with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Day in Coordinated Universal Time (UTC) when the consumption occurred. |
| CONSUMPTION | NUMBER (38,2) | Amount of consumption, measured in credits. |
| FORECASTED_CONSUMPTION | NUMBER (38,2) | Predicted consumption based on the anomaly-detecting algorithm, measured in credits. |
| UPPER_BOUND | NUMBER (38,2) | Predicted highest level of consumption based on the anomaly-detecting algorithm, measured in credits. Consumption levels above this value are considered an anomaly. |
| LOWER_BOUND | NUMBER (38,2) | Predicted lowest level of consumption based on the anomaly-detecting algorithm, measured in credits. Consumption levels below this value are considered an anomaly. |
| IS_ANOMALY | BOOLEAN | If `TRUE`, consumption was identified as a cost anomaly because it has gone outside the range of the upper and lower bound. |
| CURRENCY_TYPE | VARCHAR | Unit of measure for the consumption, which is always `CREDITS`. |
| ANOMALY_ID | VARCHAR | System-generated identifier. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role
* SNOWFLAKE.APP_USAGE_VIEWER application role

## Example

The following example identifies anomalies in the current account based on consumption between January 1, 2024, and March 31, 2024:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_ACCOUNT_ANOMALIES_IN_CREDITS(
  '2024-01-01', '2024-03-31');
```

---
title: ANOMALY_INSIGHTS!GET_ACCOUNT_NOTIFICATION_EMAILS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_account_notification_emails.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_ACCOUNT_NOTIFICATION_EMAILS

Returns the email addresses where notifications are sent when there is an [account-level cost anomaly](../../../../user-guide/cost-anomalies.md) in
the current account.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_ACCOUNT_NOTIFICATION_EMAILS()
```

## Arguments

None.

## Output

Returns a table with the following column:

| Column name | Data type | Description |
| --- | --- | --- |
| EMAIL_LIST | VARCHAR | Comma-delimited list of email addresses where notifications are sent when there is a cost anomaly in the current account. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role

## Usage notes

This method retrieves the email notification list for the account in which it is called.

## Example

The following example returns the email addresses where notifications are sent.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_ACCOUNT_NOTIFICATION_EMAILS();
```

---
title: ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_daily_consumption_anomaly_data.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA

Returns daily consumption for a specific account or the entire organization, and identifies whether that consumption is considered a
[cost anomaly](../../../../user-guide/cost-anomalies.md).

> **Note:**
>
> This method returns consumption with a currency as the unit of measure. If you want to return consumption in credits instead, see
> [ANOMALY_INSIGHTS!GET_ACCOUNT_ANOMALIES_IN_CREDITS](get_account_anomalies_in_credits.md).

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
  '<start_date>',
  '<end_date>',
  <account_name> )
```

## Arguments

`'start_date'`
:   Specifies the beginning of the time period for which consumption data is returned.

    Data type: DATE

`'end_date'`
:   Specifies the end of the time period for which consumption data is returned.

    Data type: DATE

`account_name`
:   Specifies an expression that determines the account(s) for which consumption data is returned. You can specify the following values:

    * `'account_name'`: Returns data for the specified account. You must specify the account name, not the account locator.
    * `CURRENT_ACCOUNT_NAME()`: Returns data for the current account.
    * `NULL`: Returns data for the entire organization, not a specific account.

## Output

Returns a table with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| USAGE_DATE | DATE | Day in UTC when the consumption occurred. |
| CONSUMPTION | NUMBER (38,2) | Amount of consumption measured in CURRENCY_TYPE. |
| FORECASTED_CONSUMPTION | NUMBER (38,2) | Predicted consumption based on the anomaly-detecting algorithm, measured in CURRENCY_TYPE. |
| UPPER_BOUND | NUMBER (38,2) | Predicted highest level of consumption based on the anomaly-detecting algorithm, measured in CURRENCY_TYPE. Consumption levels above this value are considered an anomaly. |
| LOWER_BOUND | NUMBER (38,2) | Predicted lowest level of consumption based on the anomaly-detecting algorithm, measured in CURRENCY_TYPE. Consumption levels below this value are considered an anomaly. |
| IS_ANOMALY | BOOLEAN | If true, consumption has been identified as a cost anomaly because it has gone outside the range of the upper and lower bound. |
| CURRENCY_TYPE | VARCHAR | Unit of measure for the consumption. For information about why the unit of measure is credits or a currency, see [Unit of measure for cost data](../../../../user-guide/cost-anomalies.md). |
| ANOMALY_ID | VARCHAR | System-generated identifier. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* ORGANIZATION_BILLING_VIEWER application role in the organization account
* SNOWFLAKE.APP_ORGANIZATION_BILLING_VIEWER application role in an ORGADMIN-enabled account

## Usage notes

To return data for a different account or the entire organization, you must execute this method from the
[organization account](../../../../user-guide/organization-accounts.md) or an
[ORGADMIN-enabled account](../../../../user-guide/organization-administrators.md).

## Example

Identify organization-level anomalies based on consumption between January 1, 2024, and March 31, 2024:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
  '2024-01-01', '2024-03-31', NULL);
```

Identify anomalies in the current account based on consumption between January 1, 2024, and March 31, 2024:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
  '2024-01-01', '2024-03-31', current_account_name());
```

Identify anomalies in the account `prod_acct1` based on consumption between January 1, 2024, and March 31, 2024:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_DAILY_CONSUMPTION_ANOMALY_DATA(
  '2024-01-01', '2024-03-31', 'prod_acct1');
```

---
title: ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_hourly_consumption_by_service_type.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE

Returns the hourly consumption in the current account on a specific day, broken down by service type. You can optionally limit the results
to the service types that are consuming the most credits on that day.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE(
  '<date>',
  <number_of_types> )
```

## Arguments

`'date'`
:   Specifies the day for which you want to return consumption data.

    Data type: DATE

`number_of_types`
:   Specifies the number of service types to return, ranked by total consumption on the specified day.

    To return all service types that consumed credits on the specified day, specify `NULL`.

    Data type: NUMBER

## Output

Returns a table with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| HOUR | NUMBER | A number between 0 and 23 (inclusive) that specifies the hour of the day during which consumption occurred. |
| SERVICE_TYPE | VARCHAR | The service type that consumed the credits (for example, `AI_SERVICES`). |
| CREDITS | NUMBER | The number of credits consumed by the service type during the specified hour. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role
* SNOWFLAKE.APP_USAGE_VIEWER application role

## Usage notes

* A day is defined by a 24-hour period in UTC. This might differ from a user’s local time zone.
* This method returns consumption data for the current account. It cannot be used to return data for other accounts or the entire
  organization.
* This method returns credits consumed for the account (not currency).
* The top service types are determined by total consumption across the entire day. After the top service types are identified, the
  method returns the hourly consumption for each of those service types.

## Examples

Return the two service types that consumed the most credits on January 15, 2026. The output includes the hourly consumption for each service
type.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE(
  '2026-01-15', 2);
```

Return the hourly consumption on January 15, 2026, for all service types that had consumption on that day:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE(
  '2026-01-15',
  NULL);
```

---
title: ANOMALY_INSIGHTS!GET_HOURLY_SPEND_FOR_ANOMALY
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_hourly_spend_for_anomaly.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_HOURLY_SPEND_FOR_ANOMALY

Returns the hourly consumption in the current account on a specific day.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_SPEND_FOR_ANOMALY(
  '<date>' )
```

## Arguments

`'date'`
:   Specifies the day for which you want to return consumption data.

    Data type: DATE

## Output

Returns a table with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| HOUR | INTEGER | Specifies the hour of the day during which consumption occurred. |
| CONSUMPTION | NUMBER | Specifies the amount of consumption during the hour in credits. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role
* SNOWFLAKE.APP_USAGE_VIEWER application role

## Usage notes

* A day is defined by a 24-hour period in UTC. This might differ from a user’s local time zone.
* This method returns consumption data for the current account. It cannot be used to return data for other accounts or the entire
  organization.
* This method returns credits consumed for the account (not currency).

## Example

The following example returns the hourly consumption on October 17, 2024.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_HOURLY_SPEND_FOR_ANOMALY('2024-10-17');
```

---
title: ANOMALY_INSIGHTS!GET_ORG_NOTIFICATION_EMAILS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_org_notification_emails.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_ORG_NOTIFICATION_EMAILS

Returns the email addresses where notifications are sent when there is an [organization-level cost anomaly](../../../../user-guide/cost-anomalies.md).
An organization-level anomaly occurs when the aggregate consumption for all accounts falls outside an expected range.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_ORG_NOTIFICATION_EMAILS()
```

## Arguments

None.

## Output

Returns a table with the following column:

| Column name | Data type | Description |
| --- | --- | --- |
| EMAIL_LIST | VARCHAR | Comma-delimited list of email addresses where notifications are sent when there is an organization-level cost anomaly. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.ORGANIZATION_BILLING_VIEWER application role in the organization account
* SNOWFLAKE.APP_ORGANIZATION_BILLING_VIEWER application role in an ORGADMIN-enabled account

## Example

The following example returns the list of email addresses that are notified when there is an organization-level cost anomaly.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_ORG_NOTIFICATION_EMAILS();
```

---
title: ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_top_accounts_by_consumption.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION

Returns accounts with the highest absolute change in consumption between a given date and the previous date. Helps investigate
[organization-level cost anomalies](../../../../user-guide/cost-anomalies.md).

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION(
  '<date>',
  <number_of_accounts> )
```

## Arguments

`'date'`
:   Specifies the date for which you want to return consumption data.

    Data type: DATE

`number_of_accounts`
:   Limits the number of accounts returned by the method. For example, if you specify `5`, the method returns only the top five accounts in
    the organization in terms of change in consumption.

    Data type: NUMBER

## Output

Returns a table with the following columns. Results are ordered by largest daily change in absolute value.

| Column name | Data type | Description |
| --- | --- | --- |
| ACCOUNT_NAME | VARCHAR | Name of the account where consumption occurred. |
| CONSUMPTION | NUMBER | Amount of consumption measured in CURRENCY. |
| CURRENCY | VARCHAR | Unit of measure for the consumption. For information about why the unit of measure is credits or a currency, see [Unit of measure for cost data](../../../../user-guide/cost-anomalies.md). |
| COST_CHANGE | NUMBER | Difference between consumption on the specified day and the previous day. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role
* SNOWFLAKE.APP_USAGE_VIEWER application role

## Usage notes

You must call this method from the [organization account](../../../../user-guide/organization-accounts.md) or an
[ORGADMIN-enabled account](../../../../user-guide/organization-administrators.md).

## Example

The following example returns the top seven accounts in terms of change in consumption when comparing December 16, 2024, and December 17,
2024.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_ACCOUNTS_BY_CONSUMPTION('2024-12-17', 7);
```

---
title: ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_top_queries_from_warehouse.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE

Returns the queries in a warehouse that consumed the most credits. Helps investigate
[account-level cost anomalies](../../../../user-guide/cost-anomalies.md) in the current account.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE(
  <warehouse_id>,
  '<date>',
  <number_of_queries> )
```

## Arguments

`warehouse_id`
:   Specifies the internal/system-generated identifier for the warehouse that ran the queries.

    You can find the warehouse ID by calling the [ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE](get_top_warehouses_on_date.md) method or querying the
    [WAREHOUSE_METERING_HISTORY view](../../../account-usage/warehouse_metering_history.md).

    Data type: NUMBER

`'date'`
:   Specifies the date for which you want to return consumption data.

    Data type: DATE

`number_of_queries`
:   Limits the number of queries returned by the method. For example, if you specify `5`, the method returns only the top five queries in
    terms of credits consumed.

    Data type: NUMBER

## Output

Returns a table with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse used to execute the query. |
| CONSUMPTION | NUMBER | Credits consumed by the query. |
| USERNAME | VARCHAR | User who executed the query. |
| QUERY_ID | VARCHAR | Query ID. |
| DURATION_MS | NUMBER | How long it took the query to execute, in milliseconds. |
| START_TIME | DATETIME | Date and time the user started executing the query. |
| QUERY_TAG | VARCHAR | Query tag, if any, applied to the query. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role
* SNOWFLAKE.APP_USAGE_VIEWER application role

## Usage notes

* This method returns consumption data for the current account. It cannot be used to return data for other accounts or the entire
  organization.
* You cannot use this method to return a currency as the unit of measure for the consumption.

## Example

Returns the top six queries that consumed the most credits on December 1, 2024, using a warehouse whose Warehouse ID is `838`.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_QUERIES_FROM_WAREHOUSE(838, '2024-12-01', 6);
```

---
title: ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/get_top_warehouses_on_date.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE

Returns warehouses with the highest change in consumption for a given date, determined by comparing the specified day with the previous day.
Helps investigate account-level and organization-level [cost anomalies](../../../../user-guide/cost-anomalies.md).

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE(
  '<date>',
  <number_of_warehouses>,
  <account_name> )
```

## Arguments

`'date'`
:   Specifies the date for which you want to return consumption data.

    Data type: DATE

`number_of_warehouses`
:   Limits the number of warehouses returned by the method. For example, if you specify `5`, the method returns only the top five warehouses
    in terms of change in consumption.

    Data type: NUMBER

`account_name`
:   Specifies an expression that determines the account(s) for which consumption data is returned. You can specify the following values:

    * `'account_name'`: Returns warehouse data for the specified account. You must specify the account name, not the account locator.
    * `CURRENT_ACCOUNT_NAME()`: Returns warehouse data for the current account.
    * `NULL`: Returns warehouse data for the entire organization, not a specific account.

## Output

Returns a table with the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| WAREHOUSE_NAME | VARCHAR | Name of the warehouse. |
| WAREHOUSE_ID | NUMBER | System-generated identifier of the warehouse. |
| CONSUMPTION | NUMBER (38,9) | Amount of consumption on the specified day in credits. |
| COST_CHANGE | NUMBER (38,9) | Difference between consumption on the specified day and the previous day. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role
* SNOWFLAKE.APP_USAGE_VIEWER application role

## Usage notes

* To return data for a different account or the entire organization, you must execute this method from the
  [organization account](../../../../user-guide/organization-accounts.md) or an
  [ORGADMIN-enabled account](../../../../user-guide/organization-administrators.md).
* You cannot use this method to return a currency as the unit of measure for the consumption.

## Example

Returns the top six warehouses in the organization in terms of change in consumption when comparing August 9, 2024, and August 10, 2024.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE('2024-08-10', 6, NULL);
```

Returns the top five warehouses in the current account in terms of change in consumption when comparing December 8, 2024, and December 9,
2024.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE(
  '2024-12-09', 5, CURRENT_ACCOUNT_NAME());
```

Returns the top three warehouses in the account `my_acct` in terms of change in consumption when comparing November 8, 2024, and November 9,
2024.

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!GET_TOP_WAREHOUSES_ON_DATE(
  '2024-11-09', 5, 'my_acct');
```

---
title: ANOMALY_INSIGHTS!SET_ACCOUNT_NOTIFICATION_EMAILS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/set_account_notification_emails.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!SET_ACCOUNT_NOTIFICATION_EMAILS

Defines the list of email addresses that will receive a notification when there is an
[account-level cost anomaly](../../../../user-guide/cost-anomalies.md) in the current account.

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!SET_ACCOUNT_NOTIFICATION_EMAILS(
  '<email_address> [, <email_address> ... ]' )
```

## Arguments

`'email_address [, email_address ... ]'`
:   Comma-delimited list of email addresses that will receive a notification when there is an account-level cost anomaly.

    Each email address must have been [verified by the user](../../../../user-guide/ui-snowsight-profile.md), otherwise it is ignored.

## Output

Returns a table with the following column:

| Column name | Data type | Description |
| --- | --- | --- |
| EMAIL_LIST | VARCHAR | Comma-delimited list of email addresses where notifications are sent when there is an account-level cost anomaly in the current account. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.APP_USAGE_ADMIN application role

## Usage notes

* This method sets the email notification list for the account in which it is called.
* Executing this method overwrites email addresses that were previously added to the notification list.
* Each email address must have been [verified by the user](../../../../user-guide/ui-snowsight-profile.md).
* You can use a group email address, such as a distribution list, for notifications, but this email address must be verified. Before adding
  a group email address to the notification list, you might need to create a new Snowflake user with the group email address so you can
  verify it.

## Example

Set the email notification list so that users with email addresses `user1@example.com` and `user2@example.com` receive a notification
when there is a cost anomaly in the current account:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!SET_ACCOUNT_NOTIFICATION_EMAILS(
  'user1@example.com, user2@example.com');
```

---
title: ANOMALY_INSIGHTS!SET_ORG_NOTIFICATION_EMAILS
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-insights/methods/set_org_notification_emails.md
section: SQL Classes
---

# ANOMALY_INSIGHTS!SET_ORG_NOTIFICATION_EMAILS

Defines the list of email addresses that will receive a notification when there is an
[organization-level cost anomaly](../../../../user-guide/cost-anomalies.md).

An organization-level anomaly occurs when the aggregate consumption all of accounts falls outside an expected range. If you want to define
a list of email addresses that will receive a notification when a specific account has a cost anomaly, see
[ANOMALY_INSIGHTS!SET_ACCOUNT_NOTIFICATION_EMAILS](set_account_notification_emails.md).

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

## Syntax

```sqlsyntax
SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!SET_ORG_NOTIFICATION_EMAILS(
  '<email_address> [, <email_address> ... ]' )
```

## Arguments

`'email_address [, email_address ... ]'`
:   Comma-delimited list of email addresses that will receive a notification when there is an organization-level cost anomaly.

## Output

Returns a table with the following column:

| Column name | Data type | Description |
| --- | --- | --- |
| EMAIL_LIST | VARCHAR | Comma-delimited list of email addresses where notifications are sent when there is an organization-level cost anomaly. |

## Access control requirements

Users with any of the following roles can call this method:

* ACCOUNTADMIN system role
* GLOBALORGADMIN system role
* SNOWFLAKE.ORGANIZATION_BILLING_VIEWER application role in the organization account
* SNOWFLAKE.APP_ORGANIZATION_BILLING_VIEWER application role in an ORGADMIN-enabled account

## Usage notes

* Each email address must have been [verified by the user](../../../../user-guide/ui-snowsight-profile.md).
* You can use a group email address, such as a distribution list, for notifications, but this email address must be verified. Before adding
  a group email address to the notification list, you might need to create a new Snowflake user with the group email address so you can
  verify it.

## Example

Set the email notification list so that users with email addresses `user1@example.com` and `user2@example.com` receive a notification
when there is an organization-level cost anomaly:

```sqlexample
CALL SNOWFLAKE.LOCAL.ANOMALY_INSIGHTS!SET_ORG_NOTIFICATION_EMAILS(
  'user1@example.com, user2@example.com');
```

---
title: CREATE BUDGET
source: https://docs.snowflake.com/en/sql-reference/classes/budget/commands/create-budget.md
section: SQL Classes
---

# CREATE BUDGET

*Fully qualified name*: SNOWFLAKE.CORE.BUDGET

Creates a new budget instance or replaces and existing budget instance in the current or
specified schema.

See also:
:   [ALTER BUDGET](alter-budget.md),
    [SHOW BUDGET](show-budget.md),
    [DROP BUDGET](drop-budget.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.CORE.BUDGET [ IF NOT EXISTS ] <name> ()
  [ [ WITH ] COMMENT = '<string_literal>' ]
```

## Parameters

`name`:
:   Specifies the identifier for the budget. The identifier must start with an alphabetic character and cannot contain spaces or
    special characters unless the identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double
    quotes are also case-sensitive.

    For more details, refer to [Identifier requirements](../../../identifiers-syntax.md).

## Optional parameters

`COMMENT = 'string_literal'`:
:   Specifies a comment for the budget.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege / Role | Object | Notes |
| --- | --- | --- |
| CREATE SNOWFLAKE.CORE.BUDGET | Schema | The role used to create a budget must be granted this privilege on the schema in which the budget is created. |
| SNOWFLAKE.BUDGET_CREATOR | Role | The role used to create a budget must be granted this [database role](../../../../user-guide/security-access-control-considerations.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* To refer to this class by its unqualified name, include the database and schema of the class in your
  [search path](../../../snowflake-db-classes.md).
* [Replication](../../../../user-guide/account-replication-intro.md) is supported only for instances
  of the [CUSTOM_CLASSIFIER](../../custom_classifier.md) class.
* An account can contain a maximum of 100 custom budgets.

## Examples

Create budget `my_budget` in the current schema:

```sqlexample
CREATE SNOWFLAKE.CORE.BUDGET my_budget();
```

---
title: CREATE CLASSIFICATION_PROFILE
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/commands/create-classification-profile.md
section: SQL Classes
---

# CREATE CLASSIFICATION_PROFILE

*Fully qualified name:* SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE

Creates a new instance of the CLASSIFICATION_PROFILE class or replaces an existing instance of the CLASSIFICATION_PROFILE class in the
current or specified schema.

> **Important:**
>
> If you execute a CREATE OR REPLACE command, the classification profile is removed from all databases and schemas, which turns off
> automatic classification.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  [ IF NOT EXISTS ] <classification_profile_name> (  <config_object> )
```

## Parameters

`classification_profile_name`
:   Specifies the identifier (name) for the instance of the CLASSIFICATION_PROFILE class; must be unique for the schema in which the object
    is created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../../../identifiers-syntax.md).

## Constructor arguments

`config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure sensitive data classification.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `minimum_object_age_for_` `classification_days` | INTEGER |  | Required: Specifies the minimum number of days an object must exist in order to be classified.  To classify objects immediately, specify `0`. |
    | `maximum_classification_` `validity_days` | INTEGER |  | Optional: Specifies the number of days since the last classification event before a table is classified again using automatic classification.  Specify this value to ensure that tables are reclassified. If you omit this key, objects are never reclassified.  The value must be greater than or equal to `1`. |
    | `snowflake_semantic_` `categories` | ARRAY |  | Optional: Specifies a list of Snowflake [native semantic categories](../../../../user-guide/classify-native.md) (types of data) and optional countries to use for classification. Snowflake identifies data as sensitive only if the data is classified as belonging to the specified categories (and countries, if provided). The array can contain the following keys:   * `category` — Required string that specifies a native semantic category. * `country_codes` — Optional array that specifies two-letter country codes. Snowflake identifies data as belonging to a category   only if a semantic subcategory exists for the specified country.  To determine if a semantic subcategory exists for a country and obtain the two-letter code for a country, see   [native semantic categories](../../../../user-guide/classify-native.md).   See [Classify data using a subset of native semantic categories](../../../../user-guide/classify-auto.md). |
    | `auto_tag` | BOOLEAN | TRUE | Optional: When `TRUE`, sets the recommended classification system tags on the columns in the specified object when the classification process is complete.  When `FALSE`, automatic tagging does not occur. |
    | `tag_map` | OBJECT |  | Optional: Maps one or more user-defined tags to the SEMANTIC_CATEGORY system tag.  See Tag map. |
    | `custom_classifiers` | OBJECT |  | Optional: Specifies [custom classifiers](../../../../user-guide/classify-custom.md) that are used when automatically classifying data.  Each key in the object specifies the name of an instance of the [CUSTOM_CLASSIFIER class](../../custom_classifier.md).  The value of each key specifies the [custom_classifier!LIST](../../custom_classifier/methods/list.md) method of the custom classifier instance. |
    | `enable_tag_based_` `sensitive_data_exclusion` | BOOLEAN | FALSE | Optional: When `TRUE`, objects tagged with the SNOWFLAKE.CORE.SKIP_SENSITIVE_DATA_CLASSIFICATION system tag are excluded from sensitive data classification.  When `FALSE`, tag-based sensitive data exclusion is disabled and all objects are classified regardless of system tags.  For more information, see [Excluding data from sensitive data classification](../../../../user-guide/classify-auto-exclude.md). |
    | `classify_views` | BOOLEAN | FALSE | Optional: When `FALSE`, views are excluded from sensitive data classification. When `TRUE`, views are automatically classified along with tables. |

### Tag map

An [OBJECT](../../../data-types-semistructured.md) that maps one or more user-defined tags to the SEMANTIC_CATEGORY system tag.

`'column_tag_map': [ ... ]`
:   An array of objects that have the following key-value pairs:

    `'tag_name': 'string'`
    :   The fully qualified name of the tag.

        For more information, see [Identifier requirements](../../../identifiers-syntax.md).

    `'tag_value':'string'`
    :   The string value of the tag.

        Optional: If not specified, you must also omit the `semantic_categories` key. If omitted, the `tag_name` tag is applied to
        every column to which the SEMANTIC_CATEGORY system tag is applied, and the value of the user-defined tag will match the value of the
        SEMANTIC_CATEGORY tag.

    `'semantic_categories': [ 'category' [ , 'category' ... ] ]`
    :   A comma-separated list of [native categories](../../../../user-guide/classify-native.md). The `tag_name` user-defined tag is mapped to
        instances where the value of the SEMANTIC_CATEGORY tag is one of the specified native categories.

        Optional: If not specified, you must also omit the `tag_value` key. If omitted, the `tag_name` tag is applied to every
        column to which the system SEMANTIC_CATEGORY tag is applied, and the value of the user-defined tag will match the value of the
        SEMANTIC_CATEGORY tag.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege/role | Object |
| --- | --- |
| CLASSIFICATION_ADMIN database role | n/a |
| CREATE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE privilege | Schema |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Methods

You can call the following methods on the instance of the CLASSIFICATION_PROFILE class that you create:

* [<classification_profile_name>!DESCRIBE](../methods/describe.md)
* [<classification_profile_name>!SET_AUTO_TAG](../methods/set_auto_tag.md)
* [<classification_profile_name>!SET_CLASSIFY_VIEWS](../methods/set_classify_views.md)
* [<classification_profile_name>!SET_CUSTOM_CLASSIFIERS](../methods/set_custom_classifiers.md)
* [<classification_profile_name>!SET_ENABLE_TAG_BASED_SENSITIVE_DATA_EXCLUSION](../methods/set_enable_tag_based_sensitive_data_exclusion.md)
* [<classification_profile_name>!SET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS](../methods/set_maximum_classification_validity_days.md)
* [<classification_profile_name>!SET_MINIMUM_OBJECT_AGE_FOR_CLASSIFICATION_DAYS](../methods/set_minimum_object_age_for_classification_days.md)
* [<classification_profile_name>!SET_SNOWFLAKE_SEMANTIC_CATEGORIES](../methods/set_snowflake_semantic_categories.md)
* [<classification_profile_name>!SET_TAG_MAP](../methods/set_tag_map.md)
* [<classification_profile_name>!UNSET_CUSTOM_CLASSIFIERS](../methods/unset_custom_classifiers.md)
* [<classification_profile_name>!UNSET_MAXIMUM_CLASSIFICATION_VALIDITY_DAYS](../methods/unset_maximum_classification_validity_days.md)
* [<classification_profile_name>!UNSET_SNOWFLAKE_SEMANTIC_CATEGORIES](../methods/unset_snowflake_semantic_categories.md)
* [<classification_profile_name>!UNSET_TAG_MAP](../methods/unset_tag_map.md)

## Usage notes

* Executing a CREATE OR REPLACE command removes the classification profile from all databases and schemas, which turns off automatic
  classification.
* To refer to this class by its unqualified name, include the database and schema of the class in your
  [search path](../../../snowflake-db-classes.md).
* If the same tag and semantic category is mapped to two different values, then the order of the objects in the `column_tag_map`
  determines the tag and string value to set on a column. Order the `column_tag_map` arrays from highest preference to lowest
  preference.

## Examples

Create an instance and specify basic criteria to automatically classify tables in a database:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  my_classification_profile(
    {
      'minimum_object_age_for_classification_days': 0,
      'maximum_classification_validity_days': 30,
      'auto_tag': true,
      'classify_views': false
    });
```

Create an instance and specify the tag mapping to a single tag:

```sqlexample
CREATE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE my_classification_profile(
  {
    'minimum_object_age_for_classification_days':0,
    'auto_tag':true,
    'tag_map':{
      'column_tag_map':[
        {
          'tag_name':'tag_db.sch.pii'
        }
      ]
    }
  }
);
```

Create an instance and specify the tag mapping to different tag values:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  my_classification_profile(
    {
      'minimum_object_age_for_classification_days':0,
      'auto_tag':true,
      'tag_map': {
        'column_tag_map':[
          {
            'tag_name':'test_ac_db.test_ac_schema.pii',
            'tag_value':'important',
            'semantic_categories':['NAME']
          },
          {
            'tag_name':'test_ac_db.test_ac_schema.pii',
            'tag_value':'pii',
            'semantic_categories':['EMAIL','NATIONAL_IDENTIFIER']
          }
        ]
      }
    }
  );
```

Create an instance and specify custom classifiers for the sensitive data classification process:

```sqlexample
CREATE SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE my_classification_profile(
  {
    'minimum_object_age_for_classification_days':0,
    'auto_tag':true,
    'custom_classifiers': {
      'medical_codes': medical_codes!list(),
      'finance_codes': finance_codes!list()
    }
  }
);
```

---
title: CREATE CUSTOM_CLASSIFIER
source: https://docs.snowflake.com/en/sql-reference/classes/custom_classifier/commands/create-custom-classifier.md
section: SQL Classes
---

# CREATE CUSTOM_CLASSIFIER

*Fully qualified name:* SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER

See also:
:   [Using custom classifiers to implement custom semantic categories](../../../../user-guide/classify-custom-using.md)

Creates a new custom classification instance or replaces an existing custom classification instance in the current or specified schema.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER
[ IF NOT EXISTS ] <custom_classifier_name>()
```

## Parameters

`custom_classifier_name()`
:   Specifies the identifier (name) for the instance; the name must be unique for the schema in which the object is created. You must add the
    parentheses at the end of the identifier when creating the object.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive.

    For more information, see [Identifier requirements](../../../identifiers-syntax.md).

## Arguments

None.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Database role | Object | Notes |
| --- | --- | --- |
| CLASSIFICATION_ADMIN | Database role | The account role that creates the object must be granted this database role.  This database role exists in the shared SNOWFLAKE database. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Methods

You can call the following methods on the custom classification instance that you create:

* [custom_classifier!ADD_REGEX](../methods/add_regex.md)
* [custom_classifier!DELETE_CATEGORY](../methods/delete_category.md)
* [custom_classifier!LIST](../methods/list.md)

## Usage notes

SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER is a name that Snowflake defines and maintains. Use this object name every time you want to create
an instance of this class. Alternatively, update your [search path](../../../snowflake-db-classes.md) to make it easier to use the instance.

## Examples

Create a custom classifier named `medical_codes`:

```sqlexample
CREATE OR REPLACE SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER internal_ids();
```

---
title: CREATE SNOWFLAKE.ML.ANOMALY_DETECTION
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/commands/create-anomaly-detection.md
section: SQL Classes
---

# CREATE SNOWFLAKE.ML.ANOMALY_DETECTION

Creates a new anomaly detection model or replaces an existing one using
the training data you provide.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.ML.ANOMALY_DETECTION <model_name>(
  INPUT_DATA => <reference_to_training_data>,
  [ SERIES_COLNAME => '<series_column_name>', ]
  TIMESTAMP_COLNAME => '<timestamp_column_name>',
  TARGET_COLNAME => '<target_column_name>',
  LABEL_COLNAME => '<label_column_name>',
  [ CONFIG_OBJECT => <config_object> ]
)
[ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
[ COMMENT = '<string_literal>' ]
```

## Parameters

`model_name`
:   Specifies the identifier (*model_name*) for the anomaly detector object; must be unique for the schema in which the object is
    created.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
    entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
    case-sensitive. For more details, see [Identifier requirements](../../../identifiers-syntax.md).

## Constructor arguments

**Required:**

`INPUT_DATA => reference_to_training_data`
:   Specifies a [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to the table, view, or query that returns
    the training data for the model.

    To create this reference, you can use the [TABLE keyword](../../../snowflake-db-classes.md) with the table name, view name,
    or query, or you can call the [SYSTEM$REFERENCE](../../../functions/system_reference.md) or
    [SYSTEM$QUERY_REFERENCE](../../../functions/system_query_reference.md) function.

`TIMESTAMP_COLNAME => 'timestamp_column_name'`
:   Specifies the name of the column containing the timestamps (TIMESTAMP_NTZ) in the time series data.

`TARGET_COLNAME => 'target_column_name'`
:   Specifies the name of the column containing the data (NUMERIC or FLOAT) to analyze.

`LABEL_COLNAME => 'label_column_name'`
:   Specifies the name of the column containing the labels for the data. Labels are Boolean (true/false) values indicating
    whether a given row is a known anomaly. If you do not have labeled data, pass an empty string (`''`) for this argument.

**Optional:**

`SERIES_COLNAME => 'series_column_name'`
:   Name of the column containing the identifier for the series (for multi-series data). This column should be a
    VARIANT because it can be any kind of value or a combination of values from more than one column in an array.

`CONFIG_OBJECT => config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure the model training job.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `aggregation_categorical` | [STRING](../../../data-types-text.md) | `'MODE'` | The aggregation method for categorical features. Supported values are:   * `'MODE'`: The most frequent value. * `'FIRST'`: The earliest value. * `'LAST'`: The latest value. |
    | `aggregation_numeric` | [STRING](../../../data-types-text.md) | `'MEAN'` | The aggregation method for numeric features. Supported values are:   * `'MEAN'`: The average of the values. * `'MEDIAN'`: The middle value. * `MODE`: The most frequent value. * `'MIN'`: The smallest value. * `'MAX'`: The largest value. * `'SUM'`: The total of the values. * `'FIRST'`: The earliest value. * `'LAST'`: The latest value. |
    | `aggregation_target` | [STRING](../../../data-types-text.md) | Same as `aggregation_numeric`, or `'MEAN'` if not specified | The aggregation method for the target value. Supported values are:   * `'MEAN'`: The average of the values. * `'MEDIAN'`: The middle value. * `MODE`: The most frequent value. * `'MIN'`: The smallest value. * `'MAX'`: The largest value. * `'SUM'`: The total of the values. * `'FIRST'`: The earliest value. * `'LAST'`: The latest value. |
    | `evaluate` | [BOOLEAN](../../../data-types-logical.md) | TRUE | Whether evaluation metrics should be generated. If TRUE, additional models are trained for cross-validation using the parameters in the `evaluation_config`. |
    | `evaluation_config` | [OBJECT](../../../data-types-semistructured.md) | See Evaluation configuration. | An optional config object to specify how out-of-sample evaluation metrics should be generated. See next section. |
    | `frequency` | [STRING](../../../data-types-text.md) | n/a | The frequency of the time series. If not specified, the model infers the frequency. The value must be a string representing a time period, such as `'1 day'`. Supported units include seconds, minutes, hours, days, weeks, months, quarters, and years. You may use singular (“hour”) or plural (“hours”) for the interval name, but may not abbreviate. |
    | `lower_bound` | [FLOAT](../../../data-types-numeric.md) or NULL | NULL | The lower bound for the target value. If specified, the model will not predict values below this threshold. |
    | `upper_bound` | [FLOAT](../../../data-types-numeric.md) or NULL | NULL | The upper bound for the target value. If specified, the model will not predict values above this threshold. |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) that specifies the error handling method for training. This is most useful when training multiple series. Supported values are:   * `'abort'`: Abort training if an error is encountered in any time series. * `'skip'`: Skip any time series where training encounters an error. This allows training to succeed for other time series.   To see which series failed during model training, call the model’s [<model_name>!SHOW_TRAINING_LOGS](../methods/show_training_logs.md) method. |

## Evaluation configuration

The `evaluation_config` object contains key-value pairs that configure cross-validation. These parameters are from the scikit-learn
[TimeSeriesSplit](https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.TimeSeriesSplit.html)
cross-validator.

| Key | Type | Default | Description |
| --- | --- | --- | --- |
| `n_splits` | [INTEGER](../../../data-types-numeric.md) | 5 | Number of splits. |
| `max_train_size` | [INTEGER](../../../data-types-numeric.md) or NULL (no maximum). | NULL | Maximum size for a single training set. |
| `test_size` | [INTEGER](../../../data-types-numeric.md) or NULL. | NULL | Used to limit the size of the test set. |
| `gap` | [INTEGER](../../../data-types-numeric.md) | 0 | Number of samples to exclude from the end of each training set before the test set. |
| `prediction_interval` | [FLOAT](../../../data-types-numeric.md) | 0.95 | The prediction interval used in calculating interval metrics. |

## Usage notes

* If the column names specified by the TIMESTAMP_COLNAME, TARGET_COLNAME, or LABEL_COLNAME arguments do not exist in the table,
  view, or query specified by the INPUT_DATA argument, an error occurs.
* [Replication](../../../../user-guide/account-replication-intro.md) is supported only for instances
  of the [CUSTOM_CLASSIFIER](../../custom_classifier.md) class.

## Examples

For a representative example, see the [anomaly detection example](../../../../user-guide/ml-functions/anomaly-detection.md).

---
title: CREATE SNOWFLAKE.ML.CLASSIFICATION
source: https://docs.snowflake.com/en/sql-reference/classes/classification/commands/create-classification.md
section: SQL Classes
---

# CREATE SNOWFLAKE.ML.CLASSIFICATION

Creates a new classification model or replaces an existing model in the current or specified schema.

See also:
:   [DROP SNOWFLAKE.ML.CLASSIFICATION](drop-classification.md)

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.ML.CLASSIFICATION [ IF NOT EXISTS ] <model_name> (
    INPUT_DATA => <input_data>,
    TARGET_COLNAME => '<target_colname>',
    [CONFIG_OBJECT => <config_object>],
)
[ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
[ COMMENT = '<string_literal>' ]
```

## Parameters

*Required*

`input_data`
:   A [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to the training data.
    Using a reference allows the training process, which runs with limited privileges, to use your active role’s
    privileges to access the data. You can use a reference to a table or a view if your data is already in that form, or
    you can use a [query reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to provide the query to be executed
    to obtain the data.

    INPUT_DATA must contain the entire training data to be consumed by the classification model. Any columns that are
    not named in the TARGET_COLNAME arguments are considered training variables (features). The order of the columns in
    the input data is not important.

    Feature columns must be STRING, NUMERIC, or BOOLEAN. STRING and BOOLEAN columns are treated as categorical features,
    while NUMERIC columns are considered continuous features. To treat a numeric column as categorical, cast it to STRING.

`target_colname`
:   Name of the column containing the label (target value) for each row in the training data. The target column may be
    BOOLEAN, NUMERIC, or STRING.

*Optional*

`config_object`
:   An [OBJECT](../../../data-types-semistructured.md) whose key-value pairs specify additional training options.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | evaluate | [BOOLEAN](../../../data-types-logical.md) | TRUE | Whether evaluation metrics should be generated. If TRUE, then additional model is trained for evaluation using the parameters in the `evaluation_config`. |
    | on_error | STRING | ‘ABORT’ | String constant that specifies the error handling method for the model training task. Supported values are:   * `'ABORT'`: Abort the entire training operation if any row results in an error. * `'SKIP'`: Skip rows that result in an error. The error is shown instead of the results. |
    | evaluation_config | [OBJECT](../../../data-types-semistructured.md) | NULL | A optional configuration object to specify how out-of-sample evaluation metrics should be generated. Currently, there is only one such option.   * `test_fraction` (FLOAT): The fraction of the dataset that should be used as test (evaluation) data.   If evaluation configuration is not specified, the default behavior is to try to include a minimum of 500 instances of the minority class in the evaluation set and to limit the total test fraction of 20% of the dataset. This approach maintains balance in model evaluation and training, particularly for minority classes. |

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege / Role | Object | Notes |
| --- | --- | --- |
| CREATE SNOWFLAKE.ML.CLASSIFICATION | Schema | The role used to create a budget must be granted this privilege on the schema in which the budget is created. |
| OWNERSHIP | Schema | A role must be granted or inherit the OWNERSHIP privilege on the object to create a temporary object that has the same name as the object that already exists in the schema. |
| `model_name`!mladmin | SNOWFLAKE.ML.CLASSIFICATION instance | This role, scoped to the model itself, is initially granted to the owner, who can grant it to others to allow them to call all of the model’s methods. See [Model Roles and Usage Privileges](../../../../user-guide/ml-functions/classification.md). |
| `model_name`!mlconsumer | SNOWFLAKE.ML.CLASSIFICATION instance | This role, scoped to the model itself, is initially granted to the owner, who can grant it to others to allow them to call the model’s prediction methods (such as `PREDICT`). See [Model Roles and Usage Privileges](../../../../user-guide/ml-functions/classification.md). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Example

See [Examples](../../../../user-guide/ml-functions/classification.md).

---
title: CREATE SNOWFLAKE.ML.FORECAST
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/commands/create-forecast.md
section: SQL Classes
---

# CREATE SNOWFLAKE.ML.FORECAST

Creates a new forecast model from the training data you provide or replaces the forecast model of the same name.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.ML.FORECAST [ IF NOT EXISTS ] <model_name>(
  INPUT_DATA => <input_data>,
  [ SERIES_COLNAME => '<series_colname>', ]
  TIMESTAMP_COLNAME => '<timestamp_colname>',
  TARGET_COLNAME => '<target_colname>',
  [ CONFIG_OBJECT => <config_object> ]
)
[ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
[ COMMENT = '<string_literal>' ]
```

> **Note:**
>
> Using named arguments makes argument order irrelevant and results in more readable code.
> However, you can also use positional arguments, as in the following example:
>
> ```sqlsyntax
> CREATE SNOWFLAKE.ML.FORECAST <name>(
>   '<input_data>', '<series_colname>', '<timestamp_colname>', '<target_colname>'
> );
> ```

## Parameters

`model_name`
:   Specifies the identifier for the model; must be unique for the schema in which the model is created.

    If the model identifier is not fully qualified (in the form of `db_name.schema_name.name` or
    `schema_name.name`), the command creates the model in the current schema for the session.

    In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters
    unless the entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in
    double quotes are also case-sensitive.

    For more details, see [Identifier requirements](../../../identifiers-syntax.md).

## Constructor arguments

**Required:**

`INPUT_DATA => input_data`
:   A [reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to the input data. Using a reference allows the
    training process, which runs with limited privileges, to use your privileges to access the data. You can use a reference to a
    table or a view if your data is already in that form, or you can use a
    [query reference](../../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to provide the query to be executed to obtain the data.

    To create this reference, you can use the [TABLE keyword](../../../snowflake-db-classes.md) with the table name, view name,
    or query, or you can call the [SYSTEM$REFERENCE](../../../functions/system_reference.md) or
    [SYSTEM$QUERY_REFERENCE](../../../functions/system_query_reference.md) function.

    The referenced data is the entire training data consumed by the forecasting model. If `input_data` contains any
    columns that are not named as `timestamp_colname`, `target_colname`, or `series_colname`, they
    are considered exogenous variables (additional features). Order of the columns in the input data is not important.

    Your input data must have columns with appropriate types for your use case. See
    [Examples](../../../../user-guide/ml-functions/forecasting.md) for details on each use case.

    | Use Case | Columns and types |
    | --- | --- |
    | Single time series | * Timestamp column: [TIMESTAMP_NTZ](../../../data-types-datetime.md). * Target value column: [FLOAT](../../../data-types-numeric.md). |
    | Multiple time series | * Series column: [VARIANT](../../../data-types-semistructured.md) containing numeric values and text. * Timestamp column: [TIMESTAMP_NTZ](../../../data-types-datetime.md). * Target value column: [FLOAT](../../../data-types-numeric.md). |
    | Single time series with exogenous variables | * Timestamp column: [TIMESTAMP_NTZ](../../../data-types-datetime.md). * Target value column: [FLOAT](../../../data-types-numeric.md). * Exogenous feature columns: [numeric](../../../data-types-numeric.md) or [text](../../../data-types-text.md). |
    | Multiple time series with exogenous variables | * Series column: [VARIANT](../../../data-types-semistructured.md) containing numeric values and text. * Timestamp column: [TIMESTAMP_NTZ](../../../data-types-datetime.md). * Target value column: [FLOAT](../../../data-types-numeric.md). * Exogenous feature columns: [numeric](../../../data-types-numeric.md) or [text](../../../data-types-text.md). |

`TIMESTAMP_COLNAME => 'timestamp_colname'`
:   Name of the column containing the timestamps in `input_data`.

`TARGET_COLNAME => 'target_colname'`
:   Name of the column containing the target (dependent value) in `input_data`.

**Optional:**

`SERIES_COLNAME => 'series_colname'`
:   For multiple time-series models, the name of the column defining the multiple time series in `input_data`.
    This column can be a value of any type, or an array of values from one or more other columns, as shown in
    [Forecast on multiple series](../../../../user-guide/ml-functions/forecasting.md).

    If you are providing arguments positionally, this must be the *second* argument.

`CONFIG_OBJECT => config_object`
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs used to configure the model training job.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `aggregation_categorical` | [STRING](../../../data-types-text.md) | `'MODE'` | The aggregation method for categorical features. Supported values are:   * `'MODE'`: The most frequent value. * `'FIRST'`: The earliest value. * `'LAST'`: The latest value. |
    | `aggregation_numeric` | [STRING](../../../data-types-text.md) | `'MEAN'` | The aggregation method for numeric features. Supported values are:   * `'MEAN'`: The average of the values. * `'MEDIAN'`: The middle value. * `MODE`: The most frequent value. * `'MIN'`: The smallest value. * `'MAX'`: The largest value. * `'SUM'`: The total of the values. * `'FIRST'`: The earliest value. * `'LAST'`: The latest value. |
    | `aggregation_target` | [STRING](../../../data-types-text.md) | Same as `aggregation_numeric`, or `'MEAN'` if not specified | The aggregation method for the target value. Supported values are:   * `'MEAN'`: The average of the values. * `'MEDIAN'`: The middle value. * `MODE`: The most frequent value. * `'MIN'`: The smallest value. * `'MAX'`: The largest value. * `'SUM'`: The total of the values. * `'FIRST'`: The earliest value. * `'LAST'`: The latest value. |
    | `aggregation_column` | [Object](../../../data-types-semistructured.md) | n/a | An object containing key-value pairs (both strings) that specify the aggregation method for specific columns. The key is the column name, and the value is the aggregation method. If a column is not specified, the model uses the method specified by `aggregation_numeric` or `aggregation_categorical`, or the default for that column type (`MEAN` for numeric, `MODE` for categorical). |
    | `evaluate` | [BOOLEAN](../../../data-types-logical.md) | TRUE | Whether evaluation metrics should be generated. If TRUE, then additional models are trained for cross-validation using the parameters in the `evaluation_config`. |
    | `evaluation_config` | [OBJECT](../../../data-types-semistructured.md) | See Evaluation configuration below. | A optional config object to specify how out-of-sample evaluation metrics should be generated. |
    | `frequency` | [STRING](../../../data-types-text.md) | n/a | The frequency of the time series. If not specified, the model infers the frequency. The value must be a string representing a time period, such as `'1 day'`. Supported units include seconds, minutes, hours, days, weeks, months, quarters, and years. You may use singular (“hour”) or plural (“hours”) for the interval name, but may not abbreviate. |
    | `method` | [STRING](../../../data-types-text.md) | `'best'` | String (constant) that specifies the algorithm used to train the model. Supported values are:   * `'best'`: Uses an ensemble of models to determine the best algorithm for the data. This ensemble   includes [Prophet](https://facebook.github.io/prophet/),   [ARIMA](https://en.wikipedia.org/wiki/Autoregressive_integrated_moving_average) ,   [Exponential Smoothing](https://en.wikipedia.org/wiki/Exponential_smoothing) , and a   [gradient boosting machine (GBM)](https://en.wikipedia.org/wiki/Gradient_boosting) based algorithm. * `'fast'`: Uses a single algorithm - a GBM based algorithm - to train the model. This option is faster   than the `'best'` option, but may not be as accurate. We recommend using `'fast'` when your training data   has 10,000 or more individual series. |
    | `lower_bound` | [FLOAT](../../../data-types-numeric.md) or NULL | NULL | The lower bound for the target value. If specified, the model will not predict values below this threshold. |
    | `upper_bound` | [FLOAT](../../../data-types-numeric.md) or NULL | NULL | The upper bound for the target value. If specified, the model will not predict values above this threshold. |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) that specifies the error handling method for the model training task. This is most useful when training multiple series. Supported values are:   * `'abort'`: Abort the training operation if an error is encountered in any time series. * `'skip'`: Skip any time series where training encounters an error. This allows model training to   succeed for other time series. To see which series failed, use the model’s   [<model_name>!SHOW_TRAINING_LOGS](../methods/show_training_logs.md) method. |

## Evaluation configuration

The `evaluation_config` object contains key-value pairs that configure cross-validation. These parameters are from scikit-learn’s
[TimeSeriesSplit](https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.TimeSeriesSplit.html).

> | Key | Type | Default | Description |
> | --- | --- | --- | --- |
> | `n_splits` | [INTEGER](../../../data-types-numeric.md) | 1 | Number of splits. |
> | `max_train_size` | [INTEGER](../../../data-types-numeric.md) or NULL (no maximum). | NULL | Maximum size for a single training set. |
> | `test_size` | [INTEGER](../../../data-types-numeric.md) or NULL. | NULL | Used to limit the size of the test set. |
> | `gap` | [INTEGER](../../../data-types-numeric.md) | 0 | Number of samples to exclude from the end of each training set before the test set. |
> | `prediction_interval` | [FLOAT](../../../data-types-numeric.md) | 0.95 | The prediction interval used in calculating interval metrics. |

## Usage notes

[Replication](../../../../user-guide/account-replication-intro.md) is supported only for instances
of the [CUSTOM_CLASSIFIER](../../custom_classifier.md) class.

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: CREATE SNOWFLAKE.ML.TOP_INSIGHTS
source: https://docs.snowflake.com/en/sql-reference/classes/top-insights/commands/create-top-insights.md
section: SQL Classes
---

# CREATE SNOWFLAKE.ML.TOP_INSIGHTS

Creates a new Top Insights instance or replaces an existing one. You must instantiate this class to gain access to the
method that provides insights into your data, called GET_DRIVERS. The instance does not store any data or settings. In
most cases, you do not need to create more than one instance of this class.

## Syntax

```sqlsyntax
CREATE [ OR REPLACE ] SNOWFLAKE.ML.TOP_INSIGHTS [ IF NOT EXISTS ] <instance_name>()
[ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
[ COMMENT = '<string_literal>' ]
```

## Usage notes

Use the IF NOT EXISTS form of this command to make sure that the instance exists before you call the GET_DRIVERS method.

[Replication](../../../../user-guide/account-replication-intro.md) is supported only for instances
of the [CUSTOM_CLASSIFIER](../../custom_classifier.md) class.

## Examples

See [Examples](../../../../user-guide/ml-functions/top-insights.md).

---
title: custom_classifier!ADD_REGEX
source: https://docs.snowflake.com/en/sql-reference/classes/custom_classifier/methods/add_regex.md
section: SQL Classes
---

# `custom_classifier`!ADD_REGEX

See also:
:   [Using custom classifiers to implement custom semantic categories](../../../../user-guide/classify-custom-using.md)

Adds categories and a regular expression to the custom classifier, while optionally specifying a regular expression for the column name and
a comment.

## Syntax

```sqlsyntax
<custom_classifier>!ADD_REGEX(
  SEMANTIC_CATEGORY => '<custom_category>' ,
  PRIVACY_CATEGORY => { 'IDENTIFIER' | 'QUASI-IDENTIFIER' | 'SENSITIVE' } ,
  VALUE_REGEX => '<regular_expression>' ,
  [ COL_NAME_REGEX => <regular_expression> ] ,
  [ DESCRIPTION => <string> ] ,
  [ THRESHOLD => <number> ]
)
```

## Arguments

**Required:**

`SEMANTIC_CATEGORY => custom_category`
:   Specifies the name of the custom category (that is, type of information).

`PRIVACY_CATEGORY => { 'IDENTIFIER' | 'QUASI-IDENTIFIER' | 'SENSITIVE' }`
:   Specifies the sensitivity of the data, and can be one of the following values: `'IDENTIFIER'`, `'QUASI_IDENTIFIER'`, or
    `'SENSITIVE'`.

`VALUE_REGEX => regular_expression`
:   Specifies the regular expression to match the values in a column.

    You can test the syntax of the regular expression by calling the [REGEXP_LIKE](../../../functions/regexp_like.md) function.

**Optional:**

`COL_NAME_REGEX => regular_expression`
:   Specifies the regular expression to match the name of the column that you want to classify.

`DESCRIPTION => string`
:   Specifies a comment describing the custom category or the custom classifier that implements it.

`THRESHOLD => number`
:   Specifies the threshold value for the scoring rule. For more information, see [Threshold for custom categories](../../../../user-guide/classify-custom.md).

    The acceptable range is greater than `0.0` and less than or equal to `1.0`.

    Default: `0.8`.

## Output

Returns a status message indicating the association of the category with the custom classifier in this format:
`classifier_name:category_name`.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `custom_classifier`!PRIVACY_USER | The custom classification instance. | The account role that calls this method must be granted this instance role on the custom classifier.  By default, the account role used to create the instance can call this method. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

* Calling this method multiple times gives an additive result for the number of regular expressions associated with the instance.
* Call each method in a separate SQL statement (no method chaining).
* All regular expression searches for classification purposes are case-insensitive.
* Test the regular expression before adding a regular expression to the custom classification instance. For example,
  use the [[ NOT ] REGEXP](../../../functions/regexp.md) function to make sure that only values that match the regex are returned in the result:

  ```sqlsyntax
  SELECT <col_to_classify>
  FROM <table_with_col_to_classify>
  WHERE <col_to_classify> REGEXP('<regex>');
  ```

  For details, see [String functions (regular expressions)](../../../functions-regexp.md).

## Examples

Add categories and a regular expression to the `medical_codes` instance:

```sqlexample
CALL internal_ids!ADD_REGEX(
  SEMANTIC_CATEGORY => 'EMPLOYEE_ID',
  PRIVACY_CATEGORY => 'IDENTIFIER',
  VALUE_REGEX => '^[0-9]{6}$',
  COL_NAME_REGEX => 'EMP.*ID.*',
  DESCRIPTION => 'Add a regex to identify employee IDs in a column',
  THRESHOLD => 0.8
);
```

Returns:

```output
+---------------+
|   ADD_REGEX   |
+---------------+
| EMPLOYEE_ID   |
+---------------+
```

Create a custom classifier that uses the default threshold and doesn’t use a regular expression to match column names:

```sqlexample
CALL medical_codes!ADD_REGEX(
  SEMANTIC_CATEGORY => 'ICD_10_CODES',
  PRIVACY_CATEGORY => 'IDENTIFIER',
  VALUE_REGEX => '[A-TV-Z][0-9][0-9AB]\.?[0-9A-TV-Z]{0,4}'
);
```

---
title: custom_classifier!DELETE_CATEGORY
source: https://docs.snowflake.com/en/sql-reference/classes/custom_classifier/methods/delete_category.md
section: SQL Classes
---

# `custom_classifier`!DELETE_CATEGORY

See also:
:   [Using custom classifiers to implement custom semantic categories](../../../../user-guide/classify-custom-using.md)

Deletes the specified semantic category with its associated privacy category, regular expression, and comment from the instance.

## Syntax

```sqlsyntax
<custom_classifier>!DELETE_CATEGORY( '<semantic_category>' )
```

## Arguments

`semantic_category`
:   Specifies the identifier (name) for the semantic category that you added to the instance when calling the
    [custom_classifier!ADD_REGEX](add_regex.md) method.

## Output

Returns a status message indicating the deletion of the specified semantic category.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `custom_classifier`!PRIVACY_USER | The custom classification instance. | The account role that calls this method must be granted this instance role on the custom classifier.  By default, the account role used to create the instance can call this method. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Call the method in a separate SQL statement (no method chaining).

## Examples

Delete a category from the `medical_codes` instance:

```sqlexample
CALL medical_codes!DELETE_CATEGORY('IC_10_CODES');
```

Returns:

```output
+------------------------------+
|       DELETE_CATEGORY        |
+------------------------------+
| Deleted category IC_10_CODES |
+------------------------------+
```

---
title: custom_classifier!LIST
source: https://docs.snowflake.com/en/sql-reference/classes/custom_classifier/methods/list.md
section: SQL Classes
---

# `custom_classifier`!LIST

Lists each custom classification semantic category system tag with its associated regular expressions for the column name and values in the
column, the description, and the privacy category tag.

See also:
:   [Using custom classifiers to implement custom semantic categories](../../../../user-guide/classify-custom-using.md)

## Syntax

```sqlsyntax
<custom_classifier>!LIST()
```

## Arguments

None.

## Output

Returns a JSON object with the following structure:

```sqljson
{
  "semantic_category_name": {
    "col_name_regex": "string",
    "description": "string",
    "privacy_category": "string",
    "threshold": number,
    "value_regex": "string"
   }
}
```

Each field value corresponds to the value that you specify when calling the
[custom_classifier!ADD_REGEX](add_regex.md) method.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Instance role | Object | Notes |
| --- | --- | --- |
| `custom_classifier`!PRIVACY_USER | The custom classifier instance. | The role that calls this method must be granted the instance role.  By default, the account role used to create the instance can call this method. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Call each method in a separate SQL statement (no method chaining).

## Examples

```sqlexample
SELECT internal_ids!LIST();
```

Returns:

```output
+--------------------------------------------------------------------------------+
| INTERNAL_IDS!LIST()                                                            |
+--------------------------------------------------------------------------------+
| {                                                                              |
|   "EMPLOYEE_ID": {                                                             |
|     "col_name_regex": "EMP.*ID.*",                                             |
|     "description": "Add a regex to identify employee IDs in a column",         |
|     "privacy_category": "IDENTIFIER",                                          |
|     "threshold": 0.8,                                                          |
|     "value_regex": "^[0-9]{6}$"                                                |
|   }                                                                            |
| }                                                                              |
+--------------------------------------------------------------------------------+
```

---
title: DROP BUDGET
source: https://docs.snowflake.com/en/sql-reference/classes/budget/commands/drop-budget.md
section: SQL Classes
---

# DROP BUDGET

*Fully qualified name*: SNOWFLAKE.CORE.BUDGET

Removes an instance of a *custom* budget.

See also:
:   [CREATE BUDGET](create-budget.md),
    [ALTER BUDGET](alter-budget.md),
    [SHOW BUDGET](show-budget.md)

## Syntax

```sqlsyntax
DROP SNOWFLAKE.CORE.BUDGET [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier (i.e. name) of the budget.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../../../identifiers-syntax.md).

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | Budget | The role used to drop a budget must be granted this privilege on the budget. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For more information, see [Budgets roles and privileges](../../../../user-guide/budgets.md).

## Usage notes

* To refer to this class by its unqualified name, include the database and schema of the class in your
  [search path](../../../snowflake-db-classes.md).
* Dropped budgets cannot be recovered; they must be recreated.

## Examples

Drop budget `my_budget` in the current schema:

```sqlexample
DROP SNOWFLAKE.CORE.BUDGET my_budget;
```

---
title: DROP CLASSIFICATION_PROFILE
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/commands/drop-classification-profile.md
section: SQL Classes
---

# DROP CLASSIFICATION_PROFILE

*Fully qualified name:* SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE

Drops a classification profile instance in the current or specified schema.

## Syntax

```sqlsyntax
DROP SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  [ IF EXISTS ] <classification_profile_name>
```

## Parameters

`classification_profile_name`
:   Specifies the identifier of the instance of the CLASSIFICATION_PROFILE class.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../../../identifiers-syntax.md).

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| OWNERSHIP | SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE instance |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Dropped instances cannot be recovered; they must be recreated.

## Examples

Drop the classification profile instance:

```sqlexample
DROP SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE my_classification_profile;
```

---
title: DROP CUSTOM_CLASSIFIER
source: https://docs.snowflake.com/en/sql-reference/classes/custom_classifier/commands/drop-custom-classifier.md
section: SQL Classes
---

# DROP CUSTOM_CLASSIFIER

*Fully qualified name:* SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER

Drops a custom classification instance in the current or specified schema.

See also:
:   [Using custom classifiers to implement custom semantic categories](../../../../user-guide/classify-custom-using.md)

## Syntax

```sqlsyntax
DROP SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER
[ IF EXISTS ] <custom_classifier_name>
```

## Parameters

`custom_classifier_name`
:   Specifies the identifier (name) for the instance.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../../../identifiers-syntax.md).

## Arguments

None.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object |
| --- | --- |
| OWNERSHIP | SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER instance |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Dropped instances cannot be recovered; they must be recreated.

## Examples

```sqlexample
DROP SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER data.classifiers.internal_ids;
```

---
title: DROP SNOWFLAKE.ML.ANOMALY_DETECTION
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/commands/drop-anomaly-detection.md
section: SQL Classes
---

# DROP SNOWFLAKE.ML.ANOMALY_DETECTION

Removes the specified model from the current or specified schema. Dropped models cannot be recovered; they must be recreated.

## Syntax

```sqlsyntax
DROP SNOWFLAKE.ML.ANOMALY_DETECTION [IF EXISTS] <model_name>;
```

## Parameters

`model_name`
:   Specifies the identifier for the model to drop. If the identifier contains spaces, special characters, or mixed-case
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also
    case-sensitive.

    If the model identifier is not fully qualified (in the form of `db_name.schema_name.name` or
    `schema_name.name`)), the command looks for the model in the current schema for the session.

## Examples

For a representative example, see the [anomaly detection example](../../../../user-guide/ml-functions/anomaly-detection.md).

---
title: DROP SNOWFLAKE.ML.CLASSIFICATION
source: https://docs.snowflake.com/en/sql-reference/classes/classification/commands/drop-classification.md
section: SQL Classes
---

# DROP SNOWFLAKE.ML.CLASSIFICATION

Drop an instance of a classification model.

See also:
:   [CREATE SNOWFLAKE.ML.CLASSIFICATION](create-classification.md).

## Syntax

```sqlsyntax
DROP SNOWFLAKE.ML.CLASSIFICATION [ IF EXISTS ] <name>
```

## Parameters

`name`
:   Specifies the identifier for the classification model. The identifier must start with an alphabetic character and cannot contain spaces or
    special characters unless the identifier string is enclosed in double quotes (e.g. `"My object"`). Identifiers enclosed in double
    quotes are also case-sensitive.

    If the model identifier is not fully-qualified (in the form of `db_name.schema_name.model_name` or
    `schema_name.model`), the command looks for the model in the current schema for the session.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege / Role | Object | Notes |
| --- | --- | --- |
| OWNERSHIP privilege | Classification model | The role used to drop a classification model must be granted this privilege on the model. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Usage notes

Dropped classification models cannot be recovered; they must be recreated.

## Examples

Drop classification model `my_model` in the current schema:

```sqlexample
DROP SNOWFLAKE.ML.CLASSIFICATION my_model;
```

---
title: DROP SNOWFLAKE.ML.FORECAST
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/commands/drop-forecast.md
section: SQL Classes
---

# DROP SNOWFLAKE.ML.FORECAST

Removes the specified model from the current or specified schema. Dropped models cannot be recovered; they must be recreated.

## Syntax

```sqlsyntax
DROP SNOWFLAKE.ML.FORECAST [ IF EXISTS ] <model_name>;
```

## Parameters

`model_name`
:   Specifies the identifier for the model to drop. If the identifier contains spaces, special characters, or mixed-case
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also
    case-sensitive.

    If the model identifier is not fully qualified (in the form of `db_name.schema_name.name` or
    `schema_name.name`)), the command looks for the model in the current schema for the session.

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: DROP SNOWFLAKE.ML.TOP_INSIGHTS
source: https://docs.snowflake.com/en/sql-reference/classes/top-insights/commands/drop-top-insights.md
section: SQL Classes
---

# DROP SNOWFLAKE.ML.TOP_INSIGHTS

Removes the specified Top Insights instance from the current or specified schema. Dropped instances cannot be recovered.

## Syntax

```sqlsyntax
DROP SNOWFLAKE.ML.TOP_INSIGHTS [ IF EXISTS ] <instance_name>;
```

## Parameters

`instance_name`
:   Specifies the identifier for the instance to drop. If the identifier contains spaces, special characters, or mixed-case
    characters, the entire string must be enclosed in double quotes. Identifiers enclosed in double quotes are also
    case-sensitive.

    If the instance identifier is not fully qualified (in the form of `db_name.schema_name.name` or
    `schema_name.name`)), the command looks for the instance in the current schema for the session.

---
title: model_name!PREDICT
source: https://docs.snowflake.com/en/sql-reference/classes/classification/methods/predict.md
section: SQL Classes
---

# model_name!PREDICT

Generates a classification prediction from the previously trained model `model_name`.

## Syntax

```sqlsyntax
<model_name>!PREDICT(
    INPUT_DATA => <input_data>,
    [CONFIG_OBJECT => <config_object>]
)
```

## Arguments

*Required*

INPUT_DATA
:   An [OBJECT](../../../data-types-semistructured.md) containing key-value pairs that map feature names to their values. Use
    [wildcard expansion in an object literal](../../../data-types-semistructured.md) to automatically create key-value pairs from a table, as in:

    ```sqlexample
    SELECT model_binary!PREDICT(INPUT_DATA => {*})
        as prediction from prediction_purchase_data;
    ```

    The feature names in the object must match the names and types specified at training time. Missing or extraneous features are ignored.

*Optional*

CONFIG_OBJECT
:   An [OBJECT](../../../data-types-semistructured.md) whose key-value pairs specify additional training options.

    | Key | Type | Default | Description |
    | --- | --- | --- | --- |
    | `on_error` | [STRING](../../../data-types-text.md) | `'ABORT'` | String (constant) that specifies the error handling method for the model inference task. Supported values are:   * `'ABORT'`: Abort the entire prediction operation if any row results in an error. * `'SKIP'`: Skip rows that result in an error. The error is shown instead of the results. |

## Output

> | Column | Type | Description |
> | --- | --- | --- |
> | PREDICTION | [VARIANT](../../../data-types-semistructured.md) | Prediction results as an [OBJECT](../../../data-types-semistructured.md) containing the following keys.   | Key | Type | Description | | --- | --- | --- | | `class` | [STRING](../../../data-types-text.md) | The predicted label with the highest probability. | | `probability` | [VARIANT](../../../data-types-semistructured.md) | An [OBJECT](../../../data-types-semistructured.md) containing the probabilities of each predicted class. For each class, the key is the class name, and the value is the predicted probability of the class. | |
> | LOGS | [VARIANT](../../../data-types-semistructured.md) | Contains error or warning messages. |

## Examples

See [Examples](../../../../user-guide/ml-functions/classification.md).

---
title: SHOW BUDGET
source: https://docs.snowflake.com/en/sql-reference/classes/budget/commands/show-budget.md
section: SQL Classes
---

# SHOW BUDGET

*Fully qualified name*: SNOWFLAKE.CORE.BUDGET

Lists budgets for which you have access privileges.

SHOW SNOWFLAKE.CORE.BUDGET INSTANCES is an alias for SHOW SNOWFLAKE.CORE.BUDGET.

See also:
:   [SYSTEM$SHOW_BUDGETS_IN_ACCOUNT](../../../functions/system_show_budgets_in_account.md),
    [CREATE BUDGET](create-budget.md)

## Syntax

```sqlsyntax
{
  SHOW SNOWFLAKE.CORE.BUDGET           |
  SHOW SNOWFLAKE.CORE.BUDGET INSTANCES
}
  [ LIKE '<pattern>' ]
  [ IN
        {
          ACCOUNT                  |

          DATABASE                 |
          DATABASE <database_name> |

          SCHEMA                   |
          SCHEMA <schema_name>     |
          <schema_name>
        }
  ]
  [ LIMIT <rows> [ FROM '<name_string>' ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

`LIMIT rows [ FROM 'name_string' ]`
:   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    returned might be less than the specified limit. For example, the number of existing objects is less than the specified limit.

    The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    specified number of rows following the first row whose object name matches the specified string:

    * The string must be enclosed in single quotes and is case sensitive.
    * The string does not have to include the full object name; partial names are supported.

    Default: No value (no limit is applied to the output)

    > **Note:**
    >
    > For SHOW commands that support both the `FROM 'name_string'` and `STARTS WITH 'name_string'` clauses, you can combine
    > both of these clauses in the same statement. However, both conditions must be met or they cancel out each other and no results are
    > returned.
    >
    > In addition, objects are returned in lexicographic order by name, so `FROM 'name_string'` only returns rows with a higher
    > lexicographic value than the rows returned by `STARTS WITH 'name_string'`.
    >
    > For example:
    >
    > * `... STARTS WITH 'A' LIMIT ... FROM 'B'` would return no results.
    > * `... STARTS WITH 'B' LIMIT ... FROM 'A'` would return no results.
    > * `... STARTS WITH 'A' LIMIT ... FROM 'AB'` would return results (if any rows match the input strings).

## Usage notes

* To refer to this class by its unqualified name, include the database and schema of the class in your
  [search path](../../../snowflake-db-classes.md).
* The system function [SYSTEM$SHOW_BUDGETS_IN_ACCOUNT](../../../functions/system_show_budgets_in_account.md) might execute faster than
  the SHOW command but doesn’t include owner fields in the output.
* The order of results is not guaranteed.

## Output

The command output provides budget properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the table was created. |
| name | Name of the budget. |
| database_name | Name of the database that contains the budget. |
| schema_name | Name of the schema that contains the budget. |
| current_version | Version of the BUDGET class used to create the budget instance. |
| comment | Comment for the budget. |
| owner | Role that owns the budget. |
| owner_role_type | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

## Examples

List budgets in the `budget_db.budget_schema` schema:

```sqlexample
SHOW SNOWFLAKE.CORE.BUDGET INSTANCES IN SCHEMA budget_db.budget_schema;
```

List budgets in the `budget_db.budget_schema` schema that include `dept` in the name of the budget:

```sqlexample
SHOW SNOWFLAKE.CORE.BUDGET LIKE '%DEPT%' IN SCHEMA budget_db.budget_schema;
```

---
title: SHOW CLASSIFICATION_PROFILE
source: https://docs.snowflake.com/en/sql-reference/classes/classification_profile/commands/show-classification-profile.md
section: SQL Classes
---

# SHOW CLASSIFICATION_PROFILE

*Fully qualified name:* SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE

Lists all classification profile instances.

## Syntax

```sqlsyntax
SHOW SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE
  [ LIKE <pattern> ]
  [ IN
    {
      ACCOUNT                  |

      DATABASE                 |
      DATABASE <database_name> |

      SCHEMA                   |
      SCHEMA <schema_name>     |
      <schema_name>
    }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege/role | Object | Notes |
| --- | --- | --- |
| <classification_profile>!PRIVACY_USER [instance role](../../../snowflake-db-classes.md) | n/a |  |

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Output

Provides custom classifier instance properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the classification profile instance was created. |
| name | Name of the classification profile instance. |
| database_name | Database that stores the classification profile instance. |
| schema_name | Schema that stores the classification profile instance. |
| current_version | The version of the classification profile instance. Snowflake automatically updates the version number. |
| comment | Comment for the classification profile instance. |
| owner | The role that owns the classification profile instance. |

## Examples

List the classification profiles that you can access:

```sqlexample
SHOW SNOWFLAKE.DATA_PRIVACY.CLASSIFICATION_PROFILE;
```

---
title: SHOW CUSTOM_CLASSIFIER
source: https://docs.snowflake.com/en/sql-reference/classes/custom_classifier/commands/show-custom-classifiers.md
section: SQL Classes
---

# SHOW CUSTOM_CLASSIFIER

*Fully qualified name:* SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER

See also:
:   [Using custom classifiers to implement custom semantic categories](../../../../user-guide/classify-custom-using.md)

Lists all custom classification instances that you can access.

SHOW SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER INSTANCES is an alias for SHOW SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER.

## Syntax

```sqlsyntax
{
  SHOW SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER           |
  SHOW SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER INSTANCES
}
  [ LIKE <pattern> ]
  [ IN
    {
      ACCOUNT                  |

      DATABASE                 |
      DATABASE <database_name> |

      SCHEMA                   |
      SCHEMA <schema_name>     |
      <schema_name>
    }
  ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

The order of results is not guaranteed.

## Access control requirements

A [role](../../../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| OWNERSHIP | The custom classification instance. | Users with the ACCOUNTADMIN admin role can list instances with this command. |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../../../user-guide/security-access-control-overview.md).

## Output

Provides custom classifier instance properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the custom classification instance was created. |
| name | Name of the custom classification instance. |
| database_name | Database that stores the custom classification instance. |
| schema_name | Schema that stores the custom classification instance. |
| current_version | The version of the custom classification instance. Snowflake automatically updates the version number. |
| comment | Comment for the custom classification instance. |
| owner | The role that owns the custom classification instance. |

## Examples

List all of the custom classifiers that you can access:

```sqlexample
SHOW SNOWFLAKE.DATA_PRIVACY.CUSTOM_CLASSIFIER;
```

Returns:

```output
+----------------------------------+---------------+---------------+-------------+-----------------+---------+-------------+
| created_on                       | name          | database_name | schema_name | current_version | comment | owner       |
+----------------------------------+---------------+---------------+-------------+-----------------+---------+-------------+
| 2023-09-08 07:00:00.123000+00:00 | INTERNAL_IDS  | DATA          | CLASSIFIERS | 1.0             | None    | DATA_OWNER  |
+----------------------------------+---------------+---------------+-------------+-----------------+---------+-------------+
```

---
title: SHOW SNOWFLAKE.ML.ANOMALY_DETECTION
source: https://docs.snowflake.com/en/sql-reference/classes/anomaly-detection/commands/show-anomaly-detection.md
section: SQL Classes
---

# SHOW SNOWFLAKE.ML.ANOMALY_DETECTION

Lists all anomaly detection models.

SHOW SNOWFLAKE.ML.ANOMALY_DETECTION INSTANCES is an alias for SHOW SNOWFLAKE.ML.ANOMALY_DETECTION.

## Syntax

```sqlsyntax
{
  SHOW SNOWFLAKE.ML.ANOMALY_DETECTION           |
  SHOW SNOWFLAKE.ML.ANOMALY_DETECTION INSTANCES
}
  [ LIKE <pattern> ]
  [ IN
      {
        ACCOUNT                  |

        DATABASE                 |
        DATABASE <database_name> |

        SCHEMA                   |
        SCHEMA <schema_name>     |
        <schema_name>
      }
   ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

The order of results is not guaranteed.

## Output

Model properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the model was created |
| name | Name of the model |
| database_name | Database in which the model is stored |
| schema_name | Schema in which the model is stored |
| current_version | The version of the model algorithm |
| comment | Comment for the model |
| owner | The role that owns the model |

## Examples

For a representative example, see the [anomaly detection example](../../../../user-guide/ml-functions/anomaly-detection.md).

---
title: SHOW SNOWFLAKE.ML.CLASSIFICATION
source: https://docs.snowflake.com/en/sql-reference/classes/classification/commands/show-classification.md
section: SQL Classes
---

# SHOW SNOWFLAKE.ML.CLASSIFICATION

Lists all classification models.

SHOW SNOWFLAKE.ML.CLASSIFICATION INSTANCES is an alias for SHOW SNOWFLAKE.ML.CLASSIFICATION.

See also:
:   [CREATE SNOWFLAKE.ML.CLASSIFICATION](create-classification.md)

## Syntax

```sqlsyntax
{
  SHOW SNOWFLAKE.ML.CLASSIFICATION           |
  SHOW SNOWFLAKE.ML.CLASSIFICATION INSTANCES
}
                                 [ LIKE <pattern> ]
                                 [ IN
                                     {
                                         ACCOUNT                  |

                                         DATABASE                 |
                                         DATABASE <database_name> |

                                         SCHEMA                   |
                                         SCHEMA <schema_name>     |
                                         <schema_name>
                                      }
                                  ]
```

## Usage notes

The order of results is not guaranteed.

## Output

The command output provides model properties and metadata in the following columns.

| Column | Description |
| --- | --- |
| `created_on` | Date and time when the model was created. |
| `name` | Name of the model. |
| `database_name` | Database in which the model is stored. |
| `schema_name` | Schema in which the model is stored. |
| `current_version` | The version of the model. |
| `comment` | Comment for the model. |
| `owner` | The role that owns the model. |

---
title: SHOW SNOWFLAKE.ML.FORECAST
source: https://docs.snowflake.com/en/sql-reference/classes/forecast/commands/show-forecast.md
section: SQL Classes
---

# SHOW SNOWFLAKE.ML.FORECAST

Lists all forecasting models.

SHOW SNOWFLAKE.ML.FORECAST INSTANCES is an alias for SHOW SNOWFLAKE.ML.FORECAST.

## Syntax

```sqlsyntax
{
  SHOW SNOWFLAKE.ML.FORECAST           |
  SHOW SNOWFLAKE.ML.FORECAST INSTANCES
}
  [ LIKE <pattern> ]
  [ IN
      {
        ACCOUNT                  |

        DATABASE                 |
        DATABASE <database_name> |

        SCHEMA                   |
        SCHEMA <schema_name>     |
        <schema_name>
      }
   ]
```

## Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

## Usage notes

The order of results is not guaranteed.

## Output

The command output provides model properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the model was created |
| name | Name of the model |
| database_name | Database in which the model is stored |
| schema_name | Schema in which the model is stored |
| current_version | The version of the model algorithm |
| comment | Comment for the model |
| owner | The role that owns the model |

## Examples

See [Examples](../../../../user-guide/ml-functions/forecasting.md).

---
title: SHOW SNOWFLAKE.ML.TOP_INSIGHTS
source: https://docs.snowflake.com/en/sql-reference/classes/top-insights/commands/show-top-insights.md
section: SQL Classes
---

# SHOW SNOWFLAKE.ML.TOP_INSIGHTS

Lists all Top Insights class instances.

SHOW SNOWFLAKE.ML.TOP_INSIGHTS INSTANCES is an alias for SHOW SNOWFLAKE.ML.TOP_INSIGHTS.

# Syntax

```sqlsyntax
{
  SHOW SNOWFLAKE.ML.TOP_INSIGHTS           |
  SHOW SNOWFLAKE.ML.TOP_INSIGHTS INSTANCES
}
  [ LIKE <pattern> ]
  [ IN
      {
        ACCOUNT                  |

        DATABASE                 |
        DATABASE <database_name> |

        SCHEMA                   |
        SCHEMA <schema_name>     |
        <schema_name>
      }
   ]
```

# Parameters

`LIKE 'pattern'`
:   Optionally filters the command output by object name. The filter uses case-insensitive pattern matching, with support for SQL
    wildcard characters (`%` and `_`).

    For example, the following patterns return the same results:

    `... LIKE '%testing%' ...`

    `... LIKE '%TESTING%' ...`

    . Default: No value (no filtering is applied to the output).

`[ IN ... ]`
:   Optionally specifies the scope of the command. Specify one of the following:

    `ACCOUNT`
    :   Returns records for the entire account.

    `DATABASE`, . `DATABASE db_name`
    :   Returns records for the current database in use or for a specified database (`db_name`).

        If you specify `DATABASE` without `db_name` and no database is in use, the keyword has no effect on the output.

        > **Note:**
        >
        > Using SHOW commands without an `IN` clause in a database context can result in fewer than expected results.
        >
        > Objects with the same name are only displayed once if no `IN` clause is used. For example, if you have table `t1` in
        > `schema1` and table `t1` in `schema2`, and they are both in scope of the database context you’ve specified (that is, the database
        > you’ve selected is the parent of `schema1` and `schema2`), then SHOW TABLES only displays one of the `t1` tables.

    `SCHEMA`, . `SCHEMA schema_name`
    :   Returns records for the current schema in use or a specified schema (`schema_name`).

        `SCHEMA` is optional if a database is in use or if you specify the fully qualified `schema_name` (for example, `db.schema`).

        If no database is in use, specifying `SCHEMA` has no effect on the output.

    If you omit `IN ...`, the scope of the command depends on whether the session currently has a database in use:

    * If a database is currently in use, the command returns the objects you have privileges to view in the database. This has the
      same effect as specifying `IN DATABASE`.
    * If no database is currently in use, the command returns the objects you have privileges to view in your account. This has the
      same effect as specifying `IN ACCOUNT`.

# Usage notes

The order of results is not guaranteed.

# Output

The command output provides instance properties and metadata in the following columns:

| Column | Description |
| --- | --- |
| created_on | Date and time when the instance was created |
| name | Name of the instance |
| database_name | Database in which the instance is stored |
| schema_name | Schema in which the instance is stored |
| current_version | The version of the instance |
| comment | Comment for the instance |
| owner | The role that owns the instance |

## SQL General Reference

SQL parameters, data types, identifiers, constraints, Snowflake Scripting, and other reference topics.

---
title: Account & session DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-other.md
section: SQL General Reference
---

# Account & session DDL

The following DDL commands are used to view and manage account-level and session operations, including:

* Viewing parameters at multiple levels in the system (account, session, object).
* Setting parameters at the account-level and within a session.
* Using a role, warehouse, database, or schema within a session.
* Using multi-statement transactions within a session.
* Setting and using SQL variables within a session.

## Account parameters & functions

|  |  |
| --- | --- |
| [ALTER ACCOUNT](sql/alter-account.md) | For setting parameters at the account-level; can only be performed by users with the ACCOUNTADMIN role. |
| [SHOW FUNCTIONS](sql/show-functions.md) | Displays system-defined functions, as well as any user-defined functions. |
| [SHOW PARAMETERS](sql/show-parameters.md) | For viewing parameter settings for the account. |

## Accounts

|  |  |
| --- | --- |
| [CREATE ACCOUNT](sql/create-account.md) | Used to create accounts in an organization. |
| [DROP ACCOUNT](sql/drop-account.md) |  |
| [SHOW ACCOUNTS](sql/show-accounts.md) | Lists the accounts in an organization. |
| [SHOW ORGANIZATION ACCOUNTS](sql/show-organization-accounts.md) | Use SHOW ACCOUNTS instead. |
| [SHOW REGIONS](sql/show-regions.md) |  |
| [UNDROP ACCOUNT](sql/undrop-account.md) |  |

## Managed accounts

|  |  |
| --- | --- |
| [CREATE MANAGED ACCOUNT](sql/create-managed-account.md) | Currently used to create [reader accounts](../user-guide/data-sharing-reader-create.md) for providers who wish to share data with non-Snowflake customers. |
| [DROP MANAGED ACCOUNT](sql/drop-managed-account.md) |  |
| [SHOW MANAGED ACCOUNTS](sql/show-managed-accounts.md) |  |

## Replication and failover/failback

|  |  |
| --- | --- |
| [ALTER CONNECTION](sql/alter-connection.md) |  |
| [CREATE CONNECTION](sql/create-connection.md) |  |
| [DROP CONNECTION](sql/drop-connection.md) |  |
| [SHOW CONNECTIONS](sql/show-connections.md) |  |
| [SHOW GLOBAL ACCOUNTS](sql/show-global-accounts.md) | Deprecated. Use [SHOW REPLICATION ACCOUNTS](sql/show-replication-accounts.md) instead. |
| [SHOW REPLICATION ACCOUNTS](sql/show-replication-accounts.md) |  |
| [SHOW REPLICATION DATABASES](sql/show-replication-databases.md) |  |

## Session parameters

|  |  |
| --- | --- |
| [ALTER SESSION](sql/alter-session.md) | For setting parameters within a session; can be performed by any user. |
| [SHOW PARAMETERS](sql/show-parameters.md) | For viewing parameter settings for the session (or account); can also be used to view parameter settings for a specified object. |

## Session context

|  |  |
| --- | --- |
| [USE ROLE](sql/use-role.md) | Specifies the primary role to use in the session. |
| [USE SECONDARY ROLES](sql/use-secondary-roles.md) | Specifies the secondary roles to use in the session. |
| [USE WAREHOUSE](sql/use-warehouse.md) | Specifies the virtual warehouse to use in the session. |
| [USE DATABASE](sql/use-database.md) | Specifies the database to use in the session. |
| [USE SCHEMA](sql/use-schema.md) | Specifies the schema to use in the session (specified schema must be in the current database for the session). |

See also:
:   [Context functions](functions-context.md)

## Queries

|  |  |
| --- | --- |
| [DESCRIBE RESULT](sql/desc-result.md) | Describes the columns in the results from a specified query (must have been executed within the last 24 hours). |
| [SHOW LOCKS](sql/show-locks.md) | For use with multi-statement transactions. |

## Session transactions

|  |  |
| --- | --- |
| [BEGIN](sql/begin.md) | For use with multi-statement transactions. |
| [COMMIT](sql/commit.md) | For use with multi-statement transactions. |
| [DESCRIBE TRANSACTION](sql/desc-transaction.md) | Describes the state of the transaction (e.g. committed, rolled back, running), etc. |
| [ROLLBACK](sql/rollback.md) | For use with multi-statement transactions. |
| [SHOW TRANSACTIONS](sql/show-transactions.md) | Lists all running transactions. |

## SQL variables

|  |  |
| --- | --- |
| [SET](sql/set.md) | For defining SQL variables in the session. |
| [SHOW VARIABLES](sql/show-variables.md) | For showing SQL variables in the session. |
| [UNSET](sql/unset.md) | For dropping SQL variables in the session. |

---
title: ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/snowflake_telemetry_add_row_access_policy_on_events_view.md
section: SQL General Reference
---

# ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW

> **Note:**
>
> Using row access policies on the default event table is an [Enterprise Edition](../../user-guide/intro-editions.md) feature.

Binds a [row access policy](../../user-guide/security-row-intro.md) to the [EVENTS_VIEW](../telemetry/events_view.md) by
specifying an array of the table’s columns. The EVENTS_VIEW is a view on the [default event table](../../developer-guide/logging-tracing/event-table-setting-up.md).

The EVENTS_ADMIN role includes the USAGE privilege on this procedure.

## Syntax

```sqlsyntax
SNOWFLAKE.TELEMETRY.ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW(
  <row_access_policy_reference>,
  <apply_on_columns>
)
```

## Arguments

`row_access_policy_reference`
:   A [reference](../references.md) to a row access policy object to apply for rows in the EVENTS_VIEW.

`apply_on_columns`
:   Array of view column names on which the policy should be applied.

    For the list of allowed column names, see [Event table columns](../../developer-guide/logging-tracing/event-table-columns.md).

## Returns

On successful execution, the procedure returns a string indicating success. Otherwise, the procedure returns an error.

## Usage notes

This stored procedure uses owner’s rights. For more details, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

## Examples

Code in the following example binds the `ROW_ACCESS_POLICY` policy to two columns in the EVENTS_VIEW:

```sqlexample
CALL SNOWFLAKE.TELEMETRY.ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW(
  SYSTEM$REFERENCE('ROW_ACCESS_POLICY', 'mydb.myschema.mypolicy', 'SESSION', 'APPLY'),
  ARRAY_CONSTRUCT('record_type', 'resource_attributes')
);
```

---
title: Aggregate functions
source: https://docs.snowflake.com/en/sql-reference/functions-aggregation.md
section: SQL General Reference
---

# Aggregate functions

Aggregate functions operate on values across rows to perform mathematical calculations such as sum, average, counting, minimum/maximum values, standard
deviation, and estimation, as well as some non-mathematical operations.

An aggregate function takes multiple rows (actually, zero, one, or more rows) as input and produces a single output.
In contrast, scalar functions take one row as input and produce one row (one value) as output.

An aggregate function always returns exactly one row, even when the input contains zero rows. Typically, if
the input contains zero rows, the output is NULL. However, an aggregate function could return `0`, an empty string, or
some other value when passed zero rows.

## List of functions (by sub-category)

| Function Name | Notes |
| --- | --- |
| **General Aggregation** |  |
| [ANY_VALUE](functions/any_value.md) |  |
| [AVG](functions/avg.md) |  |
| [CORR](functions/corr.md) |  |
| [COUNT](functions/count.md) |  |
| [COUNT_IF](functions/count_if.md) |  |
| [COVAR_POP](functions/covar_pop.md) |  |
| [COVAR_SAMP](functions/covar_samp.md) |  |
| [LISTAGG](functions/listagg.md) |  |
| [MAX](functions/max.md) |  |
| [MAX_BY](functions/max_by.md) |  |
| [MEDIAN](functions/median.md) |  |
| [MIN](functions/min.md) |  |
| [MIN_BY](functions/min_by.md) |  |
| [MODE](functions/mode.md) |  |
| [PERCENTILE_CONT](functions/percentile_cont.md) | Uses different syntax than the other aggregate functions. |
| [PERCENTILE_DISC](functions/percentile_disc.md) | Uses different syntax than the other aggregate functions. |
| [STDDEV, STDDEV_SAMP](functions/stddev.md) | STDDEV and STDDEV_SAMP are aliases. |
| [STDDEV_POP](functions/stddev_pop.md) |  |
| [SUM](functions/sum.md) |  |
| [VAR_POP](functions/var_pop.md) |  |
| [VAR_SAMP](functions/var_samp.md) |  |
| [VARIANCE_POP](functions/variance_pop.md) | Alias for [VAR_POP](functions/var_pop.md). |
| [VARIANCE , VARIANCE_SAMP](functions/variance.md) | Alias for [VAR_SAMP](functions/var_samp.md). |
| **Bitwise Aggregation** |  |
| [BITAND_AGG](functions/bitand_agg.md) |  |
| [BITOR_AGG](functions/bitor_agg.md) |  |
| [BITXOR_AGG](functions/bitxor_agg.md) |  |
| **Boolean Aggregation** |  |
| [BOOLAND_AGG](functions/booland_agg.md) |  |
| [BOOLOR_AGG](functions/boolor_agg.md) |  |
| [BOOLXOR_AGG](functions/boolxor_agg.md) |  |
| **Hash** |  |
| [HASH_AGG](functions/hash_agg.md) |  |
| **Semi-structured Data Aggregation** |  |
| [ARRAY_AGG](functions/array_agg.md) |  |
| [OBJECT_AGG](functions/object_agg.md) |  |
| **Linear Regression** |  |
| [REGR_AVGX](functions/regr_avgx.md) |  |
| [REGR_AVGY](functions/regr_avgy.md) |  |
| [REGR_COUNT](functions/regr_count.md) |  |
| [REGR_INTERCEPT](functions/regr_intercept.md) |  |
| [REGR_R2](functions/regr_r2.md) |  |
| [REGR_SLOPE](functions/regr_slope.md) |  |
| [REGR_SXX](functions/regr_sxx.md) |  |
| [REGR_SXY](functions/regr_sxy.md) |  |
| [REGR_SYY](functions/regr_syy.md) |  |
| **Statistics and Probability** |  |
| [KURTOSIS](functions/kurtosis.md) |  |
| [SKEW](functions/skew.md) |  |
| **Counting Distinct Values** |  |
| [ARRAY_UNION_AGG](functions/array_union_agg.md) |  |
| [ARRAY_UNIQUE_AGG](functions/array_unique_agg.md) |  |
| [BITMAP_BIT_POSITION](functions/bitmap_bit_position.md) |  |
| [BITMAP_BUCKET_NUMBER](functions/bitmap_bucket_number.md) |  |
| [BITMAP_COUNT](functions/bitmap_count.md) |  |
| [BITMAP_CONSTRUCT_AGG](functions/bitmap_construct_agg.md) |  |
| [BITMAP_OR_AGG](functions/bitmap_or_agg.md) |  |
| **Cardinality Estimation** . (**using** [HyperLogLog](../user-guide/querying-approximate-cardinality.md)) |  |
| [APPROX_COUNT_DISTINCT](functions/approx_count_distinct.md) | Alias for [HLL](functions/hll.md). |
| [DATASKETCHES_HLL](functions/datasketches_hll.md) |  |
| [DATASKETCHES_HLL_ACCUMULATE](functions/datasketches_hll_accumulate.md) |  |
| [DATASKETCHES_HLL_COMBINE](functions/datasketches_hll_combine.md) |  |
| [DATASKETCHES_HLL_ESTIMATE](functions/datasketches_hll_estimate.md) | Not an aggregate function; uses scalar input from [DATASKETCHES_HLL_ACCUMULATE](functions/datasketches_hll_accumulate.md) or [DATASKETCHES_HLL_COMBINE](functions/datasketches_hll_combine.md). |
| [HLL](functions/hll.md) |  |
| [HLL_ACCUMULATE](functions/hll_accumulate.md) |  |
| [HLL_COMBINE](functions/hll_combine.md) |  |
| [HLL_ESTIMATE](functions/hll_estimate.md) | Not an aggregate function; uses scalar input from [HLL_ACCUMULATE](functions/hll_accumulate.md) or [HLL_COMBINE](functions/hll_combine.md). |
| [HLL_EXPORT](functions/hll_export.md) |  |
| [HLL_IMPORT](functions/hll_import.md) |  |
| **Similarity Estimation** . (**using** [MinHash](../user-guide/querying-approximate-similarity.md)) |  |
| [APPROXIMATE_JACCARD_INDEX](functions/approximate_jaccard_index.md) | Alias for [APPROXIMATE_SIMILARITY](functions/approximate_similarity.md). |
| [APPROXIMATE_SIMILARITY](functions/approximate_similarity.md) |  |
| [MINHASH](functions/minhash.md) |  |
| [MINHASH_COMBINE](functions/minhash_combine.md) |  |
| **Frequency Estimation** . (**using** [Space-Saving](../user-guide/querying-approximate-frequent-values.md)) |  |
| [APPROX_TOP_K](functions/approx_top_k.md) |  |
| [APPROX_TOP_K_ACCUMULATE](functions/approx_top_k_accumulate.md) |  |
| [APPROX_TOP_K_COMBINE](functions/approx_top_k_combine.md) |  |
| [APPROX_TOP_K_ESTIMATE](functions/approx_top_k_estimate.md) | Not an aggregate function; uses scalar input from [APPROX_TOP_K_ACCUMULATE](functions/approx_top_k_accumulate.md) or [APPROX_TOP_K_COMBINE](functions/approx_top_k_combine.md). |
| **Percentile Estimation** . (**using** [t-Digest](../user-guide/querying-approximate-percentile-values.md)) |  |
| [APPROX_PERCENTILE](functions/approx_percentile.md) |  |
| [APPROX_PERCENTILE_ACCUMULATE](functions/approx_percentile_accumulate.md) |  |
| [APPROX_PERCENTILE_COMBINE](functions/approx_percentile_combine.md) |  |
| [APPROX_PERCENTILE_ESTIMATE](functions/approx_percentile_estimate.md) | Not an aggregate function; uses scalar input from [APPROX_PERCENTILE_ACCUMULATE](functions/approx_percentile_accumulate.md) or [APPROX_PERCENTILE_COMBINE](functions/approx_percentile_combine.md). |
| **Aggregation Utilities** |  |
| [GROUPING](functions/grouping.md) | Not an aggregate function, but can be used in conjunction with aggregate functions to determine the level of aggregation for a row produced by a [GROUP BY](constructs/group-by.md) query. |
| [GROUPING_ID](functions/grouping_id.md) | Alias for [GROUPING](functions/grouping.md). |
| **AI Functions** |  |
| [AI_AGG](functions/ai_agg.md) |  |
| [AI_SUMMARIZE_AGG](functions/ai_summarize_agg.md) |  |
| **Vector Aggregation** |  |
| [VECTOR_AVG](functions/vector_avg.md) |  |
| [VECTOR_MAX](functions/vector_max.md) |  |
| [VECTOR_MIN](functions/vector_min.md) |  |
| [VECTOR_SUM](functions/vector_sum.md) |  |
| **Semantic views** |  |
| [AGG](functions/agg.md) |  |

## Introductory example

The following example illustrates the difference between an aggregate function ([AVG](functions/avg.md)) and a scalar function ([COS](functions/cos.md)). The scalar function returns one output row for each input
row, while the aggregate function returns one output row for multiple input rows:

Create a table and populate it with values:

```sqlexample
CREATE TABLE simple (x INTEGER, y INTEGER);
INSERT INTO simple (x, y) VALUES
    (10, 20),
    (20, 44),
    (30, 70);
```

Query the table:

```sqlexample
SELECT x, y
    FROM simple
    ORDER BY x,y;
```

```output
+----+----+
|  X |  Y |
|----+----|
| 10 | 20 |
| 20 | 44 |
| 30 | 70 |
+----+----+
```

The scalar function returns one output row for each input row.

```sqlexample
SELECT COS(x)
    FROM simple
    ORDER BY x;
```

```output
+---------------+
|        COS(X) |
|---------------|
| -0.8390715291 |
|  0.4080820618 |
|  0.1542514499 |
+---------------+
```

The aggregate function returns one output row for multiple input rows:

```sqlexample
SELECT SUM(x)
    FROM simple;
```

```output
+--------+
| SUM(X) |
|--------|
|     60 |
+--------+
```

## Aggregate functions and NULL values

Some aggregate functions ignore NULL values. For example, [AVG](functions/avg.md) calculates the average of values `1`, `5`, and `NULL` to be `3`,
based on the following formula:

> `(1 + 5) / 2 = 3`

In both the numerator and the denominator, only the two non-NULL values are used.

If all of the values passed to the aggregate function are NULL, then the aggregate function returns NULL.

Some aggregate functions can be passed more than one column. For example:

```sqlexample
SELECT COUNT(col1, col2) FROM table1;
```

In these instances, the aggregate function ignores a row if any individual column is NULL.

For example, in the following query, [COUNT](functions/count.md) returns `1`, not `4`, because three of the four rows contain at least one NULL
value in the selected columns:

Create a table and populate it with values:

```sqlexample
CREATE OR REPLACE TABLE test_null_aggregate_functions (x INT, y INT);
INSERT INTO test_null_aggregate_functions (x, y) VALUES
  (1, 2),         -- No NULLs.
  (3, NULL),      -- One but not all columns are NULL.
  (NULL, 6),      -- One but not all columns are NULL.
  (NULL, NULL);   -- All columns are NULL.
```

Query the table:

```sqlexample
SELECT COUNT(x, y) FROM test_null_aggregate_functions;
```

```output
+-------------+
| COUNT(X, Y) |
|-------------|
|           1 |
+-------------+
```

If [SUM](functions/sum.md) is called with an expression that references two or more columns, and if one or more of those columns
is NULL, then the expression evaluates to NULL, and the row is ignored:

```sqlexample
SELECT SUM(x + y) FROM test_null_aggregate_functions;
```

```output
+------------+
| SUM(X + Y) |
|------------|
|          3 |
+------------+
```

This behavior differs from the behavior of [GROUP BY](constructs/group-by.md), which does not discard rows when some columns are NULL:

```sqlexample
SELECT x AS X_COL, y AS Y_COL
  FROM test_null_aggregate_functions
  GROUP BY x, y;
```

```output
+-------+-------+
| X_COL | Y_COL |
|-------+-------|
|     1 |     2 |
|     3 |  NULL |
|  NULL |     6 |
|  NULL |  NULL |
+-------+-------+
```

---
title: AI_GENERATE_TABLE_DESC
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/ai_generate_table_desc.md
section: SQL General Reference
---

# AI_GENERATE_TABLE_DESC

Generates and returns a description for a table or view. Optionally, the stored procedure can also generate descriptions for the columns of
the table or view.

The stored procedure uses the [Snowflake Cortex COMPLETE function](../functions/complete-snowflake-cortex.md) to
automatically generate descriptions.

## Syntax

```sqlsyntax
AI_GENERATE_TABLE_DESC(
  <table_name>
  [ , <config_object> ] )
```

## Required arguments

`table_name`
:   Specifies the table or view that you want to generate a description for.

## Optional arguments

`config_object`
:   An [OBJECT](../data-types-semistructured.md) that specifies whether you want to generate column descriptions and use sample data for those
    descriptions. You can use an [OBJECT constant](../data-types-semistructured.md) to specify this object.

    The OBJECT value has the following structure:

    ```sqlexample
    {
      'describe_columns': <boolean>,
      'use_table_data': <boolean>
    {
    ```

    `describe_columns`
    :   If set to TRUE, the stored procedure generates descriptions for all columns of the table.

    `use_table_data`
    :   If set to TRUE, the stored procedure uses sample data from the table to generate column descriptions, which can improve the accuracy of
        the descriptions. If FALSE, the stored procedure relies on metadata to generate the descriptions.

## Returns

Returns a JSON string with the following fields:

`COLUMNS`
:   Contains an array of columns for which descriptions were generated. This field is only returned if descriptions were generated for columns.

    The array contains the following fields for each column of the table:

    `database_name`
    :   Database that contains the column.

    `description`
    :   Description of the column that was generated by the stored procedure.

    `name`
    :   Name of the column.

    `schema_name`
    :   Schema that contains the column.

    `table_name`
    :   Table or view that contains the column.

`TABLE`
:   Contains an array that includes the description of the table along with general information about the table. The array consists of the
    following fields:

    `database_name`
    :   Database that contains the table.

    `description`
    :   Description of the table that was generated by the stored procedure.

    `name`
    :   Name of the table or view.

    `schema_name`
    :   Schema that contains the table.

## Access control requirements

Users must have the following privileges and roles to call the AI_GENERATE_TABLE_DESCRIPTION stored procedure:

* SELECT privilege on the table or view.
* SNOWFLAKE.CORTEX_USER database role.

## Usage notes

* Your region must support the LLM used by Snowflake Cortex to generate the descriptions. Check the
  [availability of the COMPLETE function](../../user-guide/snowflake-cortex/aisql.md). If the COMPLETE function is not supported in your region,
  you must enable [cross-region inference](../../user-guide/snowflake-cortex/cross-region-inference.md) to use the feature.

## Examples

Generate a description for view `v1`.

```sqlexample
CALL AI_GENERATE_TABLE_DESC( 'v1');
```

```output
{
  "TABLE": [
    {
      "database_name": "mydb",
      "description": " The table contains records of customer addresses. Each record includes a name and zip code.",
      "name": "v1",
      "schema_name": "sch1"
    }
  ]
}
```

Generate descriptions for the table `hr_data` and all of its columns. Use metadata only to generate the descriptions.

```sqlexample
CALL AI_GENERATE_TABLE_DESC(
  'mydb.sch1.hr_data',
  {
    'describe_columns': true,
    'use_table_data': false
  });
```

```output
{
  "COLUMNS": [
    {
      "database_name": "mydb",
      "description": "A column holding data of type DecimalType representing age values.",
      "name": "AGE",
      "schema_name": "sch1",
      "table_name": "hr_data"
    },
    {
      "database_name": "mydb",
      "description": "The first name of the employee.",
      "name": "FNAME",
      "schema_name": "sch1",
      "table_name": "hr_data"
    }
  ],
  "TABLE": [
    {
      "database_name": "mydb",
      "description": " The table contains records of employee data, specifically demographic information. Each record includes an employee's age and name.",
      "name": "hr_data",
      "schema_name": "sch1"
    }
  ]
}
```

---
title: All commands (alphabetical)
source: https://docs.snowflake.com/en/sql-reference/sql-all.md
section: SQL General Reference
---

# All commands (alphabetical)

This topic provides a list of all DDL and DML commands, as well as the SELECT command and other related commands, in alphabetical order.

| Command Name | Summary |
| --- | --- |
| **A** |  |
| [ALTER <object>](sql/alter.md) | Modifies the metadata of an account-level or database object, or the parameters for a session. |
| [ALTER ACCOUNT](sql/alter-account.md) | Modifies an account. |
| [ALTER AGENT](sql/alter-agent.md) | Modifies the properties or specification for an existing [Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md). |
| [ALTER AGGREGATION POLICY](sql/alter-aggregation-policy.md) | Replaces the existing rules or comment of an [aggregation policy](../user-guide/aggregation-policies.md). |
| [ALTER ALERT](sql/alter-alert.md) | Modifies the properties of an existing alert and suspends or resumes an existing [alert](../user-guide/alerts.md). |
| [ALTER API INTEGRATION](sql/alter-api-integration.md) | Modifies the properties of an existing API integration. |
| [ALTER APPLICATION](sql/alter-application.md) | Modifies the properties of an installed Snowflake Native App. |
| [ALTER APPLICATION DROP SPECIFICATION](sql/alter-application-drop-app-spec.md) | Drops an app specification from an app. |
| [ALTER APPLICATION DROP CONFIGURATION DEFINITION](sql/alter-application-drop-configuration-definition.md) | Deletes the [app configuration definition](../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App. |
| [ALTER APPLICATION PACKAGE](sql/alter-application-package.md) | Modifies the properties of an existing application package. |
| [ALTER APPLICATION PACKAGE … MODIFY RELEASE CHANNEL](sql/alter-application-package-release-channel.md) | Modifies the release channels defined for an existing application package. |
| [ALTER APPLICATION PACKAGE … RELEASE DIRECTIVE](sql/alter-application-package-release-directive.md) | Modifies the properties of an existing application package. |
| [ALTER APPLICATION PACKAGE … VERSION](sql/alter-application-package-version.md) | Modifies the versioning of an existing application package in the Snowflake Native App Framework. |
| [ALTER APPLICATION ROLE](sql/alter-application-role.md) | Modifies the properties for an existing application role. |
| [ALTER APPLICATION … { APPROVE | DECLINE} SPECIFICATION](sql/alter-application-sequence-number.md) | Approves or declines an [app specification](../developer-guide/native-apps/requesting-app-specs.md) using the specified sequence number. |
| [ALTER APPLICATION SET SPECIFICATION](sql/alter-application-set-app-spec.md) | Creates or updates an [app specification](../developer-guide/native-apps/requesting-app-specs.md) for a Snowflake Native App. |
| [ALTER APPLICATION SET CONFIGURATION DEFINITION](sql/alter-application-set-configuration-definition.md) | Creates or updates an [app configuration](../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App. |
| [ALTER APPLICATION SET CONFIGURATION VALUE](sql/alter-application-set-configuration-value.md) | Sets a value in an [app configuration definition](../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App. |
| [ALTER APPLICATION UNSET CONFIGURATION](sql/alter-application-unset-configuration.md) | Unsets an [app configuration definition](../developer-guide/native-apps/inter-app-communication.md) for a Snowflake Native App. |
| [ALTER AUTHENTICATION POLICY](sql/alter-authentication-policy.md) | Modifies the properties of an [authentication policy](../user-guide/authentication-policies.md). |
| [ALTER BACKUP POLICY](sql/alter-backup-policy.md) | Modifies the properties of a [backup](../user-guide/backups.md) policy. |
| [ALTER BACKUP SET](sql/alter-backup-set.md) | Modifies the properties for a [backup](../user-guide/backups.md) set. |
| [ALTER CATALOG INTEGRATION](sql/alter-catalog-integration.md) | Modifies the properties of an existing [catalog integration](../user-guide/tables-iceberg.md). |
| [ALTER COMPUTE POOL](sql/alter-compute-pool.md) | Modifies the properties of an existing [compute pool](../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| [ALTER CONNECTION](sql/alter-connection.md) | Modifies the properties for an existing [connection](../user-guide/client-redirect.md). |
| [ALTER CONTACT](sql/alter-contact.md) | Modifies the properties of an existing [contact](../user-guide/contacts-using.md). |
| [ALTER CORTEX SEARCH SERVICE](sql/alter-cortex-search.md) | Suspends, resumes, or modifies the properties of an existing [Cortex Search service](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). |
| [ALTER DATABASE](sql/alter-database.md) | Modifies the properties for an existing database. |
| [ALTER DATABASE (catalog-linked)](sql/alter-database-catalog-linked.md) | Modifies the properties for an existing [catalog-linked database](../user-guide/tables-iceberg-catalog-linked-database.md). |
| [ALTER DATABASE ROLE](sql/alter-database-role.md) | Modifies the properties for an existing database role. |
| [ALTER DATASET](sql/alter-dataset.md) | Modifies a dataset by adding or dropping dataset versions. |
| [ALTER DATASET … ADD VERSION](sql/alter-dataset-add-version.md) | Adds a version to a dataset. |
| [ALTER DATASET … DROP VERSION](sql/alter-dataset-drop-version.md) | Drops a dataset version. |
| [ALTER DBT PROJECT](sql/alter-dbt-project.md) | Modifies the properties of an existing [dbt project object](../user-guide/data-engineering/dbt-projects-on-snowflake.md). |
| [ALTER DCM PROJECT](sql/alter-dcm-project.md) | Modifies the properties of an existing [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md). |
| [ALTER DYNAMIC TABLE](sql/alter-dynamic-table.md) | Modifies the properties of a [dynamic table](../user-guide/dynamic-tables-about.md). |
| [ALTER EXPERIMENT](sql/alter-experiment.md) | Modifies the properties of an existing [experiment](../developer-guide/snowflake-ml/experiments.md). |
| [ALTER EXTERNAL ACCESS INTEGRATION](sql/alter-external-access-integration.md) | Modifies the properties of an existing [external access integration](../developer-guide/external-network-access/creating-using-external-network-access.md). |
| [ALTER EXTERNAL TABLE](sql/alter-external-table.md) | Modifies the properties, columns, or constraints for an existing external table. |
| [ALTER EXTERNAL VOLUME](sql/alter-external-volume.md) | Modifies the properties for an existing [external volume](../user-guide/tables-iceberg.md). |
| [ALTER FAILOVER GROUP](sql/alter-failover-group.md) | Modifies the properties for an existing [failover group](../user-guide/account-replication-intro.md). |
| [ALTER FEATURE POLICY](sql/alter-feature-policy.md) | Alters or renames a [feature policy](../developer-guide/native-apps/ui-consumer-feature-policies.md). |
| [ALTER FILE FORMAT](sql/alter-file-format.md) | Modifies the properties for an existing file format object. |
| [ALTER FUNCTION](sql/alter-function.md) | Modifies the properties of an existing user-defined or external function. |
| [ALTER FUNCTION (DMF)](sql/alter-function-dmf.md) | Modifies the properties of an existing data metric function (DMF). |
| [ALTER FUNCTION (Snowpark Container Services)](sql/alter-function-spcs.md) | Modifies the properties of an existing [service function](../developer-guide/snowpark-container-services/working-with-services.md). |
| [ALTER GATEWAY](sql/alter-gateway.md) | Modifies the configuration of an existing [gateway](../developer-guide/snowpark-container-services/gateway.md). |
| [ALTER GIT REPOSITORY](sql/alter-git-repository.md) | Modifies the properties of a Snowflake [Git repository clone](../developer-guide/git/git-overview.md). |
| [ALTER ICEBERG TABLE](sql/alter-iceberg-table.md) | Modifies properties such as clustering options and tags for an existing [Apache Iceberg™ table](../user-guide/tables-iceberg.md). |
| [ALTER ICEBERG TABLE … ALTER COLUMN … SET DATA TYPE (structured types)](sql/alter-iceberg-table-alter-column-set-data-type.md) | Modifies (evolves) a [structured type](data-types-structured.md) column in a Snowflake-managed [Apache Iceberg™ table](../user-guide/tables-iceberg.md). |
| [ALTER ICEBERG TABLE … CONVERT TO MANAGED](sql/alter-iceberg-table-convert-to-managed.md) | Converts an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) that uses an external Iceberg catalog into a table that uses Snowflake as the catalog (a Snowflake-managed Iceberg table). |
| [ALTER ICEBERG TABLE … REFRESH](sql/alter-iceberg-table-refresh.md) | Refreshes the metadata for an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) that uses an external Iceberg catalog. |
| [ALTER INTEGRATION](sql/alter-integration.md) | Modifies the properties for an existing integration. |
| [ALTER JOIN POLICY](sql/alter-join-policy.md) | Replaces the existing rules or comment for a [join policy](../user-guide/join-policies.md). |
| [ALTER LISTING](sql/alter-listing.md) | Modifies the properties of a [listings](../collaboration/collaboration-listings-about.md) with an inline YAML manifest, or from a file located in a stage location. |
| [ALTER MAINTENANCE POLICY](sql/alter-maintenance-policy.md) | Modifies an existing [maintenance policy](../developer-guide/native-apps/consumer-maintenance-policies.md). |
| [ALTER MASKING POLICY](sql/alter-masking-policy.md) | Replaces the existing masking policy rules with new rules or a new comment and allows the renaming of a masking policy. |
| [ALTER MATERIALIZED VIEW](sql/alter-materialized-view.md) | Alters a materialized view in the current/specified schema. |
| [ALTER MODEL](sql/alter-model.md) | Modifies the properties for an existing model, including its name, tags, default version, or comment. |
| [ALTER MODEL … ADD VERSION](sql/alter-model-add-version.md) | Adds a new version to an existing model from an existing model version. |
| [ALTER MODEL … DROP VERSION](sql/alter-model-drop-version.md) | Removes a version from the specified machine learning model. |
| [ALTER MODEL … MODIFY VERSION](sql/alter-model-modify-version.md) | Modifies a version of a model, changing the version’s comment or metadata. |
| [ALTER MODEL MONITOR](sql/alter-model-monitor.md) | Modifies the properties of a [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md). |
| [ALTER NETWORK POLICY](sql/alter-network-policy.md) | Modifies the properties for an existing network policy. |
| [ALTER NETWORK RULE](sql/alter-network-rule.md) | Modifies an existing network rule. |
| [ALTER NOTEBOOK](sql/alter-notebook.md) | Modifies the properties of an existing [notebook](../user-guide/ui-snowsight/notebooks.md). |
| [ALTER NOTIFICATION INTEGRATION](sql/alter-notification-integration.md) | Modifies the properties for an existing notification integration. |
| [ALTER NOTIFICATION INTEGRATION (email)](sql/alter-notification-integration-email.md) | Modifies the properties for an existing notification integration for [sending email messages](../user-guide/notifications/email-notifications.md). |
| [ALTER NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](sql/alter-notification-integration-queue-inbound-azure.md) | Modifies the properties for an existing notification integration for receiving messages from an Azure Event Grid topic. |
| [ALTER NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](sql/alter-notification-integration-queue-inbound-gcp.md) | Modifies the properties for an existing notification integration for receiving messages from a Google Pub/Sub topic. |
| [ALTER NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)](sql/alter-notification-integration-queue-outbound-aws.md) | Modifies the properties for an existing notification integration for [sending a message to an Amazon SNS topic](../user-guide/notifications/creating-notification-integration-amazon-sns.md). |
| [ALTER NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)](sql/alter-notification-integration-queue-outbound-azure.md) | Modifies the properties for an existing notification integration for [sending a message to an Azure Event Grid topic](../user-guide/notifications/creating-notification-integration-azure-event-grid.md). |
| [ALTER NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)](sql/alter-notification-integration-queue-outbound-gcp.md) | Modifies the properties for an existing notification integration for [sending a message to a Google Pub/Sub topic](../user-guide/notifications/creating-notification-integration-google-pubsub.md). |
| [ALTER NOTIFICATION INTEGRATION (webhooks)](sql/alter-notification-integration-webhooks.md) | Modifies the properties for an existing notification integration for a [webhook](../user-guide/notifications/webhook-notifications.md). |
| [ALTER OPENFLOW DATA PLANE](sql/alter-oflow-data-plane.md) | Modifies an Openflow data plane integration. |
| [ALTER ONLINE FEATURE TABLE](sql/alter-online-feature-table.md) | Modifies the properties of an existing [online feature table](sql/create-online-feature-table.md). |
| [ALTER ORGANIZATION ACCOUNT](sql/alter-organization-account.md) | Modifies the properties of an existing [organization account](../user-guide/organization-accounts.md). |
| [ALTER ORGANIZATION PROFILE](sql/alter-organization-profile.md) | Modifies the properties of an [organization profile](../user-guide/collaboration/organization-profiles/org-profiles-create-manage.md) using an inline YAML manifest, or using a YAML manifest file located in a stage location. |
| [ALTER ORGANIZATION USER](sql/alter-organization-user.md) | Modifies the properties of an existing [organization user](../user-guide/organization-users.md). |
| [ALTER ORGANIZATION USER GROUP](sql/alter-organization-user-group.md) | Modifies the properties of an existing [organization user group](../user-guide/organization-users.md). |
| [ALTER PACKAGES POLICY](sql/alter-packages-policy.md) | Modifies the properties for an existing [packages policy](../developer-guide/udf/python/packages-policy.md). |
| [ALTER PASSWORD POLICY](sql/alter-password-policy.md) | Modifies the properties for an existing password policy. |
| [ALTER PIPE](sql/alter-pipe.md) | Modifies a limited set of properties for an existing pipe object. |
| [ALTER POSTGRES INSTANCE](sql/alter-postgres-instance.md) | Modifies the properties of an existing [Snowflake Postgres instance](../user-guide/snowflake-postgres/about.md). |
| [ALTER PRIVACY POLICY](sql/alter-privacy-policy.md) | Modifies the properties of an existing [privacy policy](../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md). |
| [ALTER PROCEDURE](sql/alter-procedure.md) | Modifies the properties for an existing stored procedure. |
| [ALTER PROJECTION POLICY](sql/alter-projection-policy.md) | Replaces the existing [projection policy](../user-guide/projection-policies.md) rules with new rules or a new comment and allows the renaming of a projection policy. |
| [ALTER REPLICATION GROUP](sql/alter-replication-group.md) | Modifies the properties for an existing [replication group](../user-guide/account-replication-intro.md). |
| [ALTER RESOURCE MONITOR](sql/alter-resource-monitor.md) | Modifies the properties and triggers for an existing [resource monitor](../user-guide/resource-monitors.md). |
| [ALTER ROLE](sql/alter-role.md) | Modifies the properties for an existing [custom role](../user-guide/security-access-control-overview.md). |
| [ALTER ROW ACCESS POLICY](sql/alter-row-access-policy.md) | Modifies the properties for an existing row access policy, including renaming the policy or replacing the policy rules. |
| [ALTER SCHEMA](sql/alter-schema.md) | Modifies the properties for an existing schema, including renaming the schema or swapping it with another schema, and changing the Time Travel data retention period (if you are using Snowflake Enterprise Edition or higher). |
| [ALTER SECRET](sql/alter-secret.md) | Modifies the properties of an existing secret. |
| [ALTER SECURITY INTEGRATION](sql/alter-security-integration.md) | Modifies the properties for an existing security integration. |
| [ALTER SECURITY INTEGRATION (External API Authentication)](sql/alter-security-integration-api-auth.md) | Modifies the properties of an existing security integration created for External API Authentication. |
| [ALTER SECURITY INTEGRATION (AWS IAM Authentication)](sql/alter-security-integration-aws-iam.md) | Modifies the properties of an existing security integration created for authenticating with AWS IAM. |
| [ALTER SECURITY INTEGRATION (External OAuth)](sql/alter-security-integration-oauth-external.md) | Modifies the properties of an existing security integration created for External OAuth. |
| [ALTER SECURITY INTEGRATION (Snowflake OAuth)](sql/alter-security-integration-oauth-snowflake.md) | Modifies the properties of an existing security integration created for a Snowflake OAuth client. |
| [ALTER SECURITY INTEGRATION (SAML2)](sql/alter-security-integration-saml2.md) | Modifies the properties of an existing SAML2 security integration. |
| [ALTER SECURITY INTEGRATION (SCIM)](sql/alter-security-integration-scim.md) | Modifies the properties of an existing SCIM security integration. |
| [ALTER SEMANTIC VIEW](sql/alter-semantic-view.md) | Modifies the comment for an existing [semantic view](../user-guide/views-semantic/overview.md) or renames a semantic view. |
| [ALTER SEQUENCE](sql/alter-sequence.md) | Modifies the properties for an existing sequence. |
| [ALTER SERVICE](sql/alter-service.md) | Modifies [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) configuration, upgrades the code for the service, and allows you to suspend or resume a service. |
| [ALTER SESSION](sql/alter-session.md) | Sets parameters that change the behavior for the current session. |
| [ALTER SESSION POLICY](sql/alter-session-policy.md) | Modifies the properties for an existing session policy. |
| [ALTER SHARE](sql/alter-share.md) | Modifies the properties for an existing [share](../user-guide/data-sharing-intro.md). |
| [ALTER SNAPSHOT](sql/alter-snapshot.md) | Modifies the properties of an existing [snapshot of a block storage volume](../developer-guide/snowpark-container-services/block-storage-volume.md). |
| [ALTER SNAPSHOT POLICY — Deprecated](sql/alter-snapshot-policy.md) | Modifies the properties of a [snapshot](../user-guide/backups.md) policy. |
| [ALTER SNAPSHOT SET — Deprecated](sql/alter-snapshot-set.md) | Modifies the properties for a [snapshot](../user-guide/backups.md) set. |
| [ALTER STAGE](sql/alter-stage.md) | Modifies the properties for an existing named internal or external stage. |
| [ALTER STORAGE INTEGRATION](sql/alter-storage-integration.md) | Modifies the properties for an existing storage integration. |
| [ALTER STORAGE LIFECYCLE POLICY](sql/alter-storage-lifecycle-policy.md) | Modifies the properties of an existing [storage lifecycle policy](../user-guide/storage-management/storage-lifecycle-policies.md). |
| [ALTER STREAM](sql/alter-stream.md) | Modifies the properties, columns, or constraints for an existing [stream](../user-guide/streams-intro.md). |
| [ALTER STREAMLIT](sql/alter-streamlit.md) | Modifies the properties of an existing Streamlit object. |
| [ALTER TABLE](sql/alter-table.md) | Modifies the properties, columns, or constraints for an existing table. |
| [ALTER TABLE … ALTER COLUMN](sql/alter-table-column.md) | This topic describes how to modify one or more column properties for a table using an `ALTER COLUMN` clause in a [ALTER TABLE](sql/alter-table.md) statement. |
| [ALTER TABLE (event tables)](sql/alter-table-event-table.md) | Modifies the properties, columns, or constraints for an existing [event table](../developer-guide/logging-tracing/event-table-setting-up.md). |
| [ALTER TAG](sql/alter-tag.md) | Modifies the properties for an existing tag, including renaming the tag and setting a masking policy on a tag. |
| [ALTER TASK](sql/alter-task.md) | Modifies the properties for an existing task. |
| [ALTER TYPE](sql/alter-type.md) | Modifies the properties for an existing [user-defined type](data-types-user-defined.md). |
| [ALTER USER](sql/alter-user.md) | Modifies the properties and object/session parameters for an existing user in the system. |
| [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](sql/alter-user-add-programmatic-access-token.md) | Creates a [programmatic access token](../user-guide/programmatic-access-tokens.md) for a user. |
| [ALTER USER … MODIFY PROGRAMMATIC ACCESS TOKEN (PAT)](sql/alter-user-modify-programmatic-access-token.md) | Changes the name of a [programmatic access token](../user-guide/programmatic-access-tokens.md) or a property of the token. |
| [ALTER USER … REMOVE PROGRAMMATIC ACCESS TOKEN (PAT)](sql/alter-user-remove-programmatic-access-token.md) | Revokes a [programmatic access token](../user-guide/programmatic-access-tokens.md) for a user. |
| [ALTER USER … ROTATE PROGRAMMATIC ACCESS TOKEN (PAT)](sql/alter-user-rotate-programmatic-access-token.md) | Rotates [programmatic access token](../user-guide/programmatic-access-tokens.md), generating a new token secret with an extended expiration time, and expiring the existing token secret. |
| [ALTER VIEW](sql/alter-view.md) | Modifies the properties for an existing view. |
| [ALTER WAREHOUSE](sql/alter-warehouse.md) | Suspends or resumes a [virtual warehouse](../user-guide/warehouses-overview.md), or aborts all queries (and other SQL statements) for a warehouse. |
| **B** |  |
| [BEGIN](sql/begin.md) | Begins a transaction in the current session. |
| **C** |  |
| [CALL](sql/call.md) | Calls a [stored procedure](../developer-guide/stored-procedure/stored-procedures-overview.md). |
| [CALL (with anonymous procedure)](sql/call-with.md) | Creates and calls an anonymous procedure that is like a [stored procedure](../developer-guide/stored-procedure/stored-procedures-overview.md) but is not stored for later use. |
| [COMMENT](sql/comment.md) | Adds a comment or overwrites an existing comment for an existing object. |
| [COMMIT](sql/commit.md) | Commits an open transaction in the current session. |
| [COPY FILES](sql/copy-files.md) | Copy files from a source location to an output stage. |
| [COPY INTO <location>](sql/copy-into-location.md) | Unloads data from a table (or query) into one or more files in one of the following locations. |
| [COPY INTO <table>](sql/copy-into-table.md) | Loads data from files to an existing table. |
| [CREATE <object>](sql/create.md) | Creates a new object of the specified type. |
| [CREATE ACCOUNT](sql/create-account.md) | Creates a new account in your organization. |
| [CREATE AGENT](sql/create-agent.md) | Creates a new [Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md) object with the specified attributes and specification. |
| [CREATE AGGREGATION POLICY](sql/create-aggregation-policy.md) | Creates a new [aggregation policy](../user-guide/aggregation-policies.md) in the current/specified schema or replaces an existing aggregation policy. |
| [CREATE ALERT](sql/create-alert.md) | Creates a new [alert](../user-guide/alerts.md) in the current schema. |
| [CREATE API INTEGRATION](sql/create-api-integration.md) | Creates a new API integration object in the account or replaces an existing API integration. |
| [CREATE APPLICATION](sql/create-application.md) | Creates a Snowflake Native App based on an application package or listing. |
| [CREATE APPLICATION PACKAGE](sql/create-application-package.md) | Creates a new application package that contains the data content and application logic of Snowflake Native App. |
| [CREATE APPLICATION ROLE](sql/create-application-role.md) | Creates a new application role or replaces an existing application role. |
| [CREATE AUTHENTICATION POLICY](sql/create-authentication-policy.md) | Creates a new [authentication policy](../user-guide/authentication-policies.md) in the current or specified schema or replaces an existing authentication policy. |
| [CREATE BACKUP POLICY](sql/create-backup-policy.md) | Creates a [backup](../user-guide/backups.md) policy. |
| [CREATE BACKUP SET](sql/create-backup-set.md) | Creates a [backup](../user-guide/backups.md) set for a table, a schema, or a database. |
| [CREATE CATALOG INTEGRATION](sql/create-catalog-integration.md) | Creates a new [catalog integration](../user-guide/tables-iceberg.md) for [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) in the account or replaces an existing catalog integration. |
| [CREATE CATALOG INTEGRATION (AWS Glue)](sql/create-catalog-integration-glue.md) | Creates a new [catalog integration](../user-guide/tables-iceberg.md) in the account or replaces an existing catalog integration for [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) that use AWS Glue as the catalog. |
| [CREATE CATALOG INTEGRATION (Object storage)](sql/create-catalog-integration-object-storage.md) | Creates a new [catalog integration](../user-guide/tables-iceberg.md) in the account or replaces an existing catalog integration for the following sources. |
| [CREATE CATALOG INTEGRATION (Snowflake Open Catalog)](sql/create-catalog-integration-open-catalog.md) | Creates a new [catalog integration](../user-guide/tables-iceberg.md) for [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) that integrate with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) in the account or replaces an existing catalog integration. |
| [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](sql/create-catalog-integration-rest.md) | Creates a new [catalog integration](../user-guide/tables-iceberg.md) in the account or replaces an existing catalog integration for [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) managed in a remote catalog that complies with the open source [Apache Iceberg™ REST OpenAPI specification](https://github.com/apache/iceberg/blob/main/open-api/rest-catalog-open-api.yaml). |
| [CREATE CATALOG INTEGRATION (SAP® Business Data Cloud)](sql/create-catalog-integration-sap.md) | Creates a new catalog integration in the account or replaces an existing catalog integration for SAP® Business Data Cloud to interact with SAP® Data Products managed in the SAP® Business Data Cloud object store. |
| [CREATE <object> … CLONE](sql/create-clone.md) | Creates a copy of an existing object in the system. |
| [CREATE COMPUTE POOL](sql/create-compute-pool.md) | Creates a new [compute pool](../developer-guide/snowpark-container-services/working-with-compute-pool.md) in the current account. |
| [CREATE CONNECTION](sql/create-connection.md) | Creates a new [connection](../user-guide/client-redirect.md) in the account. |
| [CREATE CONTACT](sql/create-contact.md) | Creates a new [contact](../user-guide/contacts-using.md) or replaces an existing contact. |
| [CREATE CORTEX SEARCH SERVICE](sql/create-cortex-search.md) | Creates a new [Cortex Search service](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) or replaces an existing one. |
| [CREATE DATA METRIC FUNCTION](sql/create-data-metric-function.md) | Creates a new data metric function (DMF) in the current or specified schema, or replaces an existing data metric function. |
| [CREATE DATABASE](sql/create-database.md) | Creates a new database in the system. |
| [CREATE DATABASE (catalog-linked)](sql/create-database-catalog-linked.md) | Creates a new [catalog-linked database](../user-guide/tables-iceberg-catalog-linked-database.md) for Apache Iceberg™ tables that use an external Iceberg REST catalog. |
| [CREATE DATABASE ROLE](sql/create-database-role.md) | Create a new [database role](../user-guide/security-access-control-considerations.md) or replace an existing database role in the system. |
| [CREATE DATASET](sql/create-dataset.md) | Creates a new [machine learning dataset](../developer-guide/snowflake-ml/dataset.md) in the current schema or the schema that you specify. |
| [CREATE DBT PROJECT](sql/create-dbt-project.md) | Creates a new [dbt project object](../user-guide/data-engineering/dbt-projects-on-snowflake.md) or replaces an existing dbt project. |
| [CREATE DCM PROJECT](sql/create-dcm-project.md) | Creates a new [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md) or replaces an existing DCM project. |
| [CREATE DYNAMIC TABLE](sql/create-dynamic-table.md) | Creates a [dynamic table](../user-guide/dynamic-tables-about.md), based on a specified query. |
| [CREATE EVENT TABLE](sql/create-event-table.md) | Creates an [event table](../developer-guide/logging-tracing/event-table-setting-up.md) that captures events, including logged messages from functions and procedures. |
| [CREATE EXPERIMENT](sql/create-experiment.md) | Creates a new [experiment](../developer-guide/snowflake-ml/experiments.md) or replaces an existing experiment. |
| [CREATE EXTERNAL ACCESS INTEGRATION](sql/create-external-access-integration.md) | Creates an [external access integration](../developer-guide/external-network-access/creating-using-external-network-access.md) for access to external network locations from a UDF or procedure handler. |
| [CREATE EXTERNAL FUNCTION](sql/create-external-function.md) | Creates a new [external function](external-functions.md). |
| [CREATE EXTERNAL TABLE](sql/create-external-table.md) | Creates a new [external table](../user-guide/tables-external-intro.md) in the current or specified schema or replaces an existing external table. |
| [CREATE EXTERNAL VOLUME](sql/create-external-volume.md) | Creates a new [external volume](../user-guide/tables-iceberg.md) for [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) in the account or replaces an existing external volume. |
| [CREATE FAILOVER GROUP](sql/create-failover-group.md) | Creates a new [failover group](../user-guide/account-replication-intro.md) of specified objects in the system. |
| [CREATE FEATURE POLICY](sql/create-feature-policy.md) | Creates a new [feature policy](../developer-guide/native-apps/ui-consumer-feature-policies.md). |
| [CREATE FILE FORMAT](sql/create-file-format.md) | Creates a named file format that describes a set of staged data to access or load into Snowflake tables. |
| [CREATE FUNCTION](sql/create-function.md) | Creates a new [UDF (user-defined function)](../developer-guide/udf/udf-overview.md). |
| [CREATE FUNCTION (Snowpark Container Services)](sql/create-function-spcs.md) | Creates a [service function](../developer-guide/snowpark-container-services/working-with-services.md). |
| [CREATE GATEWAY](sql/create-gateway.md) | Creates a new [gateway](../developer-guide/snowpark-container-services/gateway.md) in the current schema. |
| [CREATE GIT REPOSITORY](sql/create-git-repository.md) | Creates a Snowflake Git repository clone in the schema or replaces an existing Git repository clone. |
| [CREATE HYBRID TABLE](sql/create-hybrid-table.md) | Creates a new hybrid table in the current/specified schema or replaces an existing table. |
| [CREATE ICEBERG TABLE](sql/create-iceberg-table.md) | Creates or replaces an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) in the current/specified schema. |
| [CREATE ICEBERG TABLE (AWS Glue as the Iceberg catalog)](sql/create-iceberg-table-aws-glue.md) | Creates or replaces an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) in the current/specified schema using an Iceberg table that is registered in the AWS Glue Data Catalog. |
| [CREATE ICEBERG TABLE (Delta files in object storage)](sql/create-iceberg-table-delta.md) | Creates or replaces an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) in the current/specified schema using Delta table files in object storage (external cloud storage). |
| [CREATE ICEBERG TABLE (Iceberg files in object storage)](sql/create-iceberg-table-iceberg-files.md) | Creates or replaces an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) in the current/specified schema using Iceberg files in object storage (external cloud storage). |
| [CREATE ICEBERG TABLE (Iceberg REST catalog)](sql/create-iceberg-table-rest.md) | Creates or replaces an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) in the current/specified schema for an Iceberg REST catalog. |
| [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](sql/create-iceberg-table-snowflake.md) | Creates or replaces an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) that uses [Snowflake as the Iceberg catalog](../user-guide/tables-iceberg.md) in the current/specified schema. |
| [CREATE IMAGE REPOSITORY](sql/create-image-repository.md) | Creates a new [image repository](../developer-guide/snowpark-container-services/working-with-registry-repository.md) in the current schema. |
| [CREATE INDEX](sql/create-index.md) | Creates a new secondary index in an existing [hybrid table](../user-guide/tables-hybrid.md) and populates the index with data. |
| [CREATE INTEGRATION](sql/create-integration.md) | Creates a new integration in the system or replaces an existing integration. |
| [CREATE INTERACTIVE TABLE](sql/create-interactive-table.md) | Creates a new [interactive table](../user-guide/interactive.md) in the current/specified schema or replaces an existing table. |
| [CREATE INTERACTIVE WAREHOUSE](sql/create-interactive-warehouse.md) | Creates a new interactive [virtual warehouse](../user-guide/warehouses-overview.md) optimized for low-latency, high-concurrency workloads with interactive tables. |
| [CREATE JOIN POLICY](sql/create-join-policy.md) | Creates a new [join policy](../user-guide/join-policies.md) in the current/specified schema or replaces an existing join policy. |
| [CREATE LISTING](sql/create-listing.md) | Create a free listing to share directly with specific consumers, with an inline YAML manifest, or from a file located in a stage location. |
| [CREATE MAINTENANCE POLICY](sql/create-maintenance-policy.md) | Creates a new [maintenance policy](../developer-guide/native-apps/consumer-maintenance-policies.md) in the current or specified schema. |
| [CREATE MANAGED ACCOUNT](sql/create-managed-account.md) | Creates a new managed account. |
| [CREATE MASKING POLICY](sql/create-masking-policy.md) | Creates a new masking policy in the current/specified schema or replaces an existing masking policy. |
| [CREATE MATERIALIZED VIEW](sql/create-materialized-view.md) | Creates a new materialized view in the current/specified schema, based on a query of an existing table, and populates the view with data. |
| [CREATE MCP SERVER](sql/create-mcp-server.md) | Creates a new MCP (Model Context Protocol) server or replaces an existing MCP server. |
| [CREATE MODEL](sql/create-model.md) | Creates a new machine learning model in the current/specified schema or replaces an existing model. |
| [CREATE MODEL MONITOR](sql/create-model-monitor.md) | Create or replace a [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md) in the current or specified schema. |
| [CREATE NETWORK POLICY](sql/create-network-policy.md) | Creates a network policy or replaces an existing network policy. |
| [CREATE NETWORK RULE](sql/create-network-rule.md) | Creates a network rule or replaces an existing network rule. |
| [CREATE NOTEBOOK](sql/create-notebook.md) | Creates a new [Snowflake notebook](../user-guide/ui-snowsight/notebooks.md) or replaces an existing notebook. |
| [CREATE NOTEBOOK PROJECT](sql/create-notebook-project.md) |  |
| [CREATE NOTIFICATION INTEGRATION](sql/create-notification-integration.md) | Creates a new notification integration in the account or replaces an existing integration. |
| [CREATE NOTIFICATION INTEGRATION (email)](sql/create-notification-integration-email.md) | Creates a new notification integration in the account or replaces an existing integration for [sending email messages](../user-guide/notifications/email-notifications.md). |
| [CREATE NOTIFICATION INTEGRATION (inbound from an Azure Event Grid topic)](sql/create-notification-integration-queue-inbound-azure.md) | Creates a new notification integration in the account or replaces an existing integration for receiving messages from an Azure Event Grid topic. |
| [CREATE NOTIFICATION INTEGRATION (inbound from a Google Pub/Sub topic)](sql/create-notification-integration-queue-inbound-gcp.md) | Creates a new notification integration in the account or replaces an existing integration for receiving messages from a Google Pub/Sub topic. |
| [CREATE NOTIFICATION INTEGRATION (outbound to an Amazon SNS topic)](sql/create-notification-integration-queue-outbound-aws.md) | Creates a new notification integration in the account or replaces an existing integration for [sending a message to an Amazon SNS topic](../user-guide/notifications/creating-notification-integration-amazon-sns.md). |
| [CREATE NOTIFICATION INTEGRATION (outbound to an Azure Event Grid topic)](sql/create-notification-integration-queue-outbound-azure.md) | Creates a new notification integration in the account or replaces an existing integration for [sending a message to an Azure Event Grid topic](../user-guide/notifications/creating-notification-integration-azure-event-grid.md). |
| [CREATE NOTIFICATION INTEGRATION (outbound to a Google Pub/Sub topic)](sql/create-notification-integration-queue-outbound-gcp.md) | Creates a new notification integration in the account or replaces an existing integration for [sending a message to a Google Pub/Sub topic](../user-guide/notifications/creating-notification-integration-google-pubsub.md). |
| [CREATE NOTIFICATION INTEGRATION (webhooks)](sql/create-notification-integration-webhooks.md) | Creates a new notification integration or replaces an existing integration for a [webhook](../user-guide/notifications/webhook-notifications.md). |
| [CREATE ONLINE FEATURE TABLE](sql/create-online-feature-table.md) | Creates a new online feature table in the current/specified schema or replaces an existing table. |
| [CREATE OR ALTER <object>](sql/create-or-alter.md) | CREATE OR ALTER commands are DDL commands that combine the functionality of the CREATE command and the ALTER command, enabling you to define an object using the syntax supported by the CREATE <object> command with the limitations of the ALTER <object> command. |
| [CREATE ORGANIZATION ACCOUNT](sql/create-organization-account.md) | Creates a new [organization account](../user-guide/organization-accounts.md). |
| [CREATE ORGANIZATION LISTING](sql/create-organization-listing.md) | Create an organization listing to share data products securely within your organization. |
| [CREATE ORGANIZATION PROFILE](sql/create-organization-profile.md) | Create the organization profile that forms part of the Uniform Listing Locator (ULL) used to publish organizational listings or query organizational listing information without mounting the listing. |
| [CREATE ORGANIZATION USER](sql/create-organization-user.md) | Creates a new [organization user](../user-guide/organization-users.md). |
| [CREATE ORGANIZATION USER GROUP](sql/create-organization-user-group.md) | Creates a new [organization user group](../user-guide/organization-users.md). |
| [CREATE PACKAGES POLICY](sql/create-packages-policy.md) | Creates a new [packages policy](../developer-guide/udf/python/packages-policy.md) or replaces an existing packages policy. |
| [CREATE PASSWORD POLICY](sql/create-password-policy.md) | Creates a new password policy or replaces an existing password policy. |
| [CREATE PIPE](sql/create-pipe.md) | Creates a new pipe in the system for defining the [COPY INTO <table>](sql/copy-into-table.md) statement used by [Snowpipe](../user-guide/data-load-snowpipe-intro.md) to load data from an ingestion queue, or by [Snowpipe Streaming with high-performance architecture](../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md) to load data from a streaming source directly into tables. |
| [CREATE POSTGRES INSTANCE](sql/create-postgres-instance.md) | Creates a new [Snowflake Postgres instance](../user-guide/snowflake-postgres/about.md) or creates a fork of an existing instance. |
| [CREATE PRIVACY POLICY](sql/create-privacy-policy.md) | Creates a new [privacy policy](../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) or replaces an existing privacy policy. |
| [CREATE PROCEDURE](sql/create-procedure.md) | Creates a new [stored procedure](../developer-guide/stored-procedure/stored-procedures-usage.md). |
| [CREATE PROJECTION POLICY](sql/create-projection-policy.md) | Creates a new [projection policy](../user-guide/projection-policies.md) in the current/specified schema or replaces an existing projection policy. |
| [CREATE PROVISIONED THROUGHPUT](sql/create-provisioned-throughput.md) | Creates a new [Provisioned Throughput resource](../user-guide/snowflake-cortex/provisioned-throughput.md) or replaces an existing one. |
| [CREATE REPLICATION GROUP](sql/create-replication-group.md) | Creates a new [replication group](../user-guide/account-replication-intro.md) of specified objects in the system. |
| [CREATE RESOURCE MONITOR](sql/create-resource-monitor.md) | Creates a new [resource monitor](../user-guide/resource-monitors.md). |
| [CREATE ROLE](sql/create-role.md) | Create a new role or replace an existing role in the system. |
| [CREATE ROW ACCESS POLICY](sql/create-row-access-policy.md) | Creates a new row access policy in the current/specified schema or replaces an existing row access policy. |
| [CREATE SCHEMA](sql/create-schema.md) | Creates a new schema in the current database. |
| [CREATE SECRET](sql/create-secret.md) | Creates a new secret in the current or specified schema or replaces an existing secret. |
| [CREATE SECURITY INTEGRATION](sql/create-security-integration.md) | Creates a new security integration in the account or replaces an existing integration. |
| [CREATE SECURITY INTEGRATION (External API Authentication)](sql/create-security-integration-api-auth.md) | Creates a new security integration for external API Authentication in the account or replaces an existing integration. |
| [CREATE SECURITY INTEGRATION (AWS IAM Authentication)](sql/create-security-integration-aws-iam.md) | Creates a new security integration for external authentication using Amazon Web Services (AWS) Identity and Access Management (IAM). |
| [CREATE SECURITY INTEGRATION (External OAuth)](sql/create-security-integration-oauth-external.md) | Creates a new External OAuth security integration in the account or replaces an existing integration. |
| [CREATE SECURITY INTEGRATION (Snowflake OAuth)](sql/create-security-integration-oauth-snowflake.md) | Creates a new Snowflake OAuth security integration in the account or replaces an existing integration. |
| [CREATE SECURITY INTEGRATION (SAML2)](sql/create-security-integration-saml2.md) | Creates a new SAML2 security integration in the account or replaces an existing integration. |
| [CREATE SECURITY INTEGRATION (SCIM)](sql/create-security-integration-scim.md) | Creates a new SCIM security integration in the account or replaces an existing integration. |
| [CREATE SEMANTIC VIEW](sql/create-semantic-view.md) | Creates a new [semantic view](../user-guide/views-semantic/overview.md) in the current/specified schema. |
| [CREATE SEQUENCE](sql/create-sequence.md) | Creates a new sequence, which can be used for generating sequential, unique numbers. |
| [CREATE SERVICE](sql/create-service.md) | Creates a new [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) in the current schema. |
| [CREATE SESSION POLICY](sql/create-session-policy.md) | Creates a new session policy or replaces an existing session policy. |
| [CREATE SHARE](sql/create-share.md) | Creates a new, empty [share](../user-guide/data-sharing-intro.md). |
| [CREATE SNAPSHOT](sql/create-snapshot.md) | Creates or replaces a [snapshot of a block storage volume](../developer-guide/snowpark-container-services/block-storage-volume.md) for a specified volume and service instance. |
| [CREATE SNAPSHOT POLICY — Deprecated](sql/create-snapshot-policy.md) | Creates a [snapshot](../user-guide/backups.md) policy. |
| [CREATE SNAPSHOT SET — Deprecated](sql/create-snapshot-set.md) | Creates a [snapshot](../user-guide/backups.md) set for a table, a schema, or a database. |
| [CREATE STAGE](sql/create-stage.md) | Creates a new named *internal* or *external* stage to use for loading data from files into Snowflake tables and unloading data from tables into files. |
| [CREATE STORAGE INTEGRATION](sql/create-storage-integration.md) | Creates a new storage integration in the account or replaces an existing integration. |
| [CREATE STORAGE LIFECYCLE POLICY](sql/create-storage-lifecycle-policy.md) | Creates a new [storage lifecycle policy](../user-guide/storage-management/storage-lifecycle-policies.md) in the current or specified schema, or replaces an existing policy. |
| [CREATE STREAM](sql/create-stream.md) | Creates a new stream in the current/specified schema or replaces an existing [stream](../user-guide/streams-intro.md). |
| [CREATE STREAMLIT](sql/create-streamlit.md) | Creates a new Streamlit object in Snowflake or replaces an existing Streamlit object in the same schema. |
| [CREATE TABLE](sql/create-table.md) | Creates a new table in the current/specified schema, replaces an existing table, or alters an existing table. |
| [CREATE | ALTER TABLE … CONSTRAINT](sql/create-table-constraint.md) | This topic describes how to create constraints by specifying a CONSTRAINT clause in a [CREATE TABLE](sql/create-table.md), [CREATE HYBRID TABLE](sql/create-hybrid-table.md), or [ALTER TABLE](sql/alter-table.md) statement. |
| [CREATE TAG](sql/create-tag.md) | Creates a new tag or replaces an existing tag in the system. |
| [CREATE TASK](sql/create-task.md) | Creates a new [task](../user-guide/tasks-intro.md) in the current/specified schema or replaces an existing task. |
| [CREATE TYPE](sql/create-type.md) | Creates a [user-defined type](data-types-user-defined.md). |
| [CREATE USER](sql/create-user.md) | Creates a new user or replaces an existing user in the system. |
| [CREATE OR ALTER VERSIONED SCHEMA](sql/create-versioned-schema.md) | Creates a new versioned schema or modifies an existing versioned schema. |
| [CREATE VIEW](sql/create-view.md) | Creates a new view in the current/specified schema, based on a query of one or more existing tables (or any other valid query expression). |
| [CREATE WAREHOUSE](sql/create-warehouse.md) | Creates a new [virtual warehouse](../user-guide/warehouses-overview.md) in the system. |
| **D** |  |
| [DELETE](sql/delete.md) | Remove rows from a table. |
| [DESCRIBE <object>](sql/desc.md) | Describes the details for the specified object. |
| [DESCRIBE AGENT](sql/desc-agent.md) | Describes the properties of a [Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md). |
| [DESCRIBE AGGREGATION POLICY](sql/desc-aggregation-policy.md) | Describes the details about an [aggregation policy](../user-guide/aggregation-policies.md), including the creation date, name, and the SQL expression. |
| [DESCRIBE ALERT](sql/desc-alert.md) | Describes the properties of an [alert](../user-guide/alerts.md). |
| [DESCRIBE APPLICATION](sql/desc-application.md) | Displays information about a Snowflake Native App. |
| [DESCRIBE APPLICATION PACKAGE](sql/desc-application-package.md) | Displays information about an application package. |
| [DESCRIBE AUTHENTICATION POLICY](sql/desc-authentication-policy.md) | Describes the properties of an [authentication policy](../user-guide/authentication-policies.md). |
| [DESCRIBE AVAILABLE LISTING](sql/desc-available-listing.md) | Describes the columns in the listings that are available to the user who runs the command. |
| [DESCRIBE AVAILABLE ORGANIZATION PROFILE](sql/desc-available-organization-profile.md) | Describes the active organization profile that can be associated with organizational listings. |
| [DESCRIBE BACKUP POLICY](sql/desc-backup-policy.md) | Describes a specific [backup policy](../user-guide/backups.md). |
| [DESCRIBE BACKUP SET](sql/desc-backup-set.md) | Describes a specific [backup set](../user-guide/backups.md). |
| [DESCRIBE CATALOG INTEGRATION](sql/desc-catalog-integration.md) | Describes the properties of a [catalog integration](../user-guide/tables-iceberg.md). |
| [DESCRIBE COMPUTE POOL](sql/desc-compute-pool.md) | Describes the properties of a [compute pool](../developer-guide/snowpark-container-services/working-with-compute-pool.md). |
| [DESCRIBE CONFIGURATION](sql/desc-configuration.md) | Describes the properties of a [configuration](../developer-guide/native-apps/inter-app-communication.md). |
| [DESCRIBE CORTEX SEARCH SERVICE](sql/desc-cortex-search.md) | Describes the properties of a [Cortex Search service](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). |
| [DESCRIBE DATABASE](sql/desc-database.md) | Describes the database. |
| [DESCRIBE DBT PROJECT](sql/desc-dbt-project.md) | Describes the properties of a [dbt project object](../user-guide/data-engineering/dbt-projects-on-snowflake.md). |
| [DESCRIBE DCM PROJECT](sql/desc-dcm-project.md) | Describes the properties of a [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md). |
| [DESCRIBE DYNAMIC TABLE](sql/desc-dynamic-table.md) | Describes the columns in a [dynamic table](../user-guide/dynamic-tables-about.md). |
| [DESCRIBE EVENT TABLE](sql/desc-event-table.md) | Describes the columns in an [event table](../developer-guide/logging-tracing/event-table-setting-up.md). |
| [DESCRIBE EXTERNAL TABLE](sql/desc-external-table.md) | Describes the VALUE column and virtual columns in an external table. |
| [DESCRIBE EXTERNAL VOLUME](sql/desc-external-volume.md) | Describes the properties of an [external volume](../user-guide/tables-iceberg.md). |
| [DESCRIBE FEATURE POLICY](sql/desc-feature-policy.md) | Describes the properties of a [feature policy](../developer-guide/native-apps/ui-consumer-feature-policies.md). |
| [DESCRIBE FILE FORMAT](sql/desc-file-format.md) | Describes the property type (for example, `String` or `Integer`), the defined value of the property, and the default value for each property in a file format object definition. |
| [DESCRIBE FUNCTION](sql/desc-function.md) | Describes the specified user-defined function (UDF) or external function, including the signature (i.e. arguments), return value, language, and body (i.e. definition). |
| [DESCRIBE FUNCTION (DMF)](sql/desc-function-dmf.md) | Describes the specified data metric function (DMF), including the signature (arguments), return value, language, and body (definition). |
| [DESCRIBE FUNCTION (Snowpark Container Services)](sql/desc-function-spcs.md) | Describes the specified [service function](../developer-guide/snowpark-container-services/working-with-services.md), including the signature (arguments), return value, language, and body (path to the Snowpark Container Services service). |
| [DESCRIBE GATEWAY](sql/desc-gateway.md) | Describes the properties of a [gateway](../developer-guide/snowpark-container-services/gateway.md). |
| [DESCRIBE GIT REPOSITORY](sql/desc-git-repository.md) | Describes an existing Snowflake [Git repository clone](../developer-guide/git/git-overview.md). |
| [DESCRIBE ICEBERG TABLE](sql/desc-iceberg-table.md) | Describes either the columns in an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) or the current values, as well as the default values, for the properties of an Iceberg table. |
| [DESCRIBE INTEGRATION](sql/desc-integration.md) | Describes the properties of an integration. |
| [DESCRIBE JOIN POLICY](sql/desc-join-policy.md) | Describes the details about a [join policy](../user-guide/join-policies.md), including the creation date, name, and the SQL expression. |
| [DESCRIBE LISTING](sql/desc-listing.md) | Describes the columns in a [listing](../collaboration/collaboration-listings-about.md). |
| [DESCRIBE MAINTENANCE POLICY](sql/desc-maintenance-policy.md) | Shows the details of a [maintenance policy](../developer-guide/native-apps/consumer-maintenance-policies.md). |
| [DESCRIBE MASKING POLICY](sql/desc-masking-policy.md) | Describes the details about a masking policy, including the creation date, name, data type, and SQL expression. |
| [DESCRIBE MATERIALIZED VIEW](sql/desc-materialized-view.md) | Describes the columns in a materialized view. |
| [DESCRIBE MCP SERVER](sql/desc-mcp-server.md) | Describes the properties of an MCP (Model Context Protocol) server. |
| [DESCRIBE MODEL MONITOR](sql/desc-model-monitor.md) | Displays information about a specific [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md). |
| [DESCRIBE NETWORK POLICY](sql/desc-network-policy.md) | Describes the properties specified for a network policy. |
| [DESCRIBE NETWORK RULE](sql/desc-network-rule.md) | Describes the properties specified for a network rule. |
| [DESCRIBE NOTEBOOK](sql/desc-notebook.md) | Describes the properties of a [notebook](../user-guide/ui-snowsight/notebooks.md). |
| [DESCRIBE NOTIFICATION INTEGRATION](sql/desc-notification-integration.md) | Describes the properties of a notification integration. |
| [DESCRIBE OPENFLOW DATA PLANE INTEGRATION](sql/desc-oflow-data-plane-integration.md) | Describes the columns in an Openflow data plane integration. |
| [DESCRIBE ONLINE FEATURE TABLE](sql/desc-online-feature-table.md) | Describes the columns in an [online feature table](sql/create-online-feature-table.md). |
| [DESCRIBE ORGANIZATION PROFILE](sql/desc-organization-profile.md) | Describes the properties of an organization profile. |
| [DESCRIBE PACKAGES POLICY](sql/desc-packages-policy.md) | Describes the details about a packages policy. |
| [DESCRIBE PASSWORD POLICY](sql/desc-password-policy.md) | Describes the details about a password policy. |
| [DESCRIBE PIPE](sql/desc-pipe.md) | Describes the properties specified for a pipe, as well as the default values of the properties. |
| [DESCRIBE POSTGRES INSTANCE](sql/desc-postgres-instance.md) | Describes the properties of a [Snowflake Postgres instance](../user-guide/snowflake-postgres/about.md). |
| [DESCRIBE PRIVACY POLICY](sql/desc-privacy-policy.md) | Describes the properties of a [privacy policy](../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md). |
| [DESCRIBE PROCEDURE](sql/desc-procedure.md) | Describes the specified stored procedure, including the stored procedure’s signature (i.e. arguments), return value, language, and body (i.e. definition). |
| [DESCRIBE PROJECTION POLICY](sql/desc-projection-policy.md) | Describes the details about a [projection policy](../user-guide/projection-policies.md), including the creation date, name, and the SQL expression. |
| [DESCRIBE RESULT](sql/desc-result.md) | Describes the columns in the result of a query. |
| [DESCRIBE ROW ACCESS POLICY](sql/desc-row-access-policy.md) | Describes a row access policy, including the creation date, name, data type, and SQL expression. |
| [DESCRIBE SCHEMA](sql/desc-schema.md) | Describes the schema. |
| [DESCRIBE SEARCH OPTIMIZATION](sql/desc-search-optimization.md) | Describes the [search optimization configuration](../user-guide/search-optimization/enabling.md) for a specified table and its columns. |
| [DESCRIBE SECRET](sql/desc-secret.md) | Describes the properties of a secret. |
| [DESCRIBE SEMANTIC VIEW](sql/desc-semantic-view.md) | Describes the properties of the logical tables, dimensions, facts, and metrics that make up a [semantic view](../user-guide/views-semantic/overview.md). |
| [DESCRIBE SEQUENCE](sql/desc-sequence.md) | Describes a sequence, including the sequence’s interval. |
| [DESCRIBE SERVICE](sql/desc-service.md) | Describes the properties of a [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) (including job services). |
| [DESCRIBE SESSION POLICY](sql/desc-session-policy.md) | Describes the details about a session policy. |
| [DESCRIBE SHARE](sql/desc-share.md) | Describes the data objects that are included in a [share](../user-guide/data-sharing-intro.md). |
| [DESCRIBE SNAPSHOT](sql/desc-snapshot.md) | Describes the properties of a [snapshot of a block storage volume](../developer-guide/snowpark-container-services/block-storage-volume.md). |
| [DESCRIBE SNAPSHOT POLICY](sql/desc-snapshot-policy.md) | Describes a specific [snapshot policy](../user-guide/backups.md). |
| [DESCRIBE SNAPSHOT SET](sql/desc-snapshot-set.md) | Describes a specific [snapshot set](../user-guide/backups.md). |
| [DESCRIBE SPECIFICATION](sql/desc-specification.md) | Describes the details about an [app specification](../developer-guide/native-apps/requesting-app-specs.md). |
| [DESCRIBE STAGE](sql/desc-stage.md) | Describes the values specified for the properties in a stage (file format, copy, and location), as well as the default values for each property. |
| [DESCRIBE STORAGE LIFECYCLE POLICY](sql/desc-storage-lifecycle-policy.md) | Describes the properties of a [storage lifecycle policy](../user-guide/storage-management/storage-lifecycle-policies.md). |
| [DESCRIBE STREAM](sql/desc-stream.md) | Describes the properties specified for a stream. |
| [DESCRIBE STREAMLIT](sql/desc-streamlit.md) | Describes the columns in a Streamlit object. |
| [DESCRIBE TABLE](sql/desc-table.md) | Describes either the columns in a table or the set of stage properties for the table (current values and default values). |
| [DESCRIBE TASK](sql/desc-task.md) | Shows information about a task. |
| [DESCRIBE TRANSACTION](sql/desc-transaction.md) | Describes the [transaction](transactions.md), including the start time and the state (running, committed, rolled back). |
| [DESCRIBE TYPE](sql/desc-type.md) | Describes a [user-defined type](data-types-user-defined.md). |
| [DESCRIBE USER](sql/desc-user.md) | Describes a [user](../user-guide/admin-user-management.md), including the current and default values of the properties of the user. |
| [DESCRIBE VIEW](sql/desc-view.md) | Describes the columns in a view (or table). |
| [DESCRIBE WAREHOUSE](sql/desc-warehouse.md) | Describes a [virtual warehouse](../user-guide/warehouses-overview.md). |
| [DROP <object>](sql/drop.md) | Removes the specified object from the system. |
| [DROP ACCOUNT](sql/drop-account.md) | Drops an account, which initiates the process of [deleting the account](../user-guide/organizations-manage-accounts-delete.md). |
| [DROP AGENT](sql/drop-agent.md) | Removes the specified [Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md) with the specified name from the current or specified database and schema. |
| [DROP AGGREGATION POLICY](sql/drop-aggregation-policy.md) | Removes an [aggregation policy](../user-guide/aggregation-policies.md) from the current/specified schema. |
| [DROP ALERT](sql/drop-alert.md) | Drops an existing [alert](../user-guide/alerts.md). |
| [DROP APPLICATION](sql/drop-application.md) | Removes an application from the system in the Native Apps Framework. |
| [DROP APPLICATION PACKAGE](sql/drop-application-package.md) | Removes an application package from the system in the Native Apps Framework. |
| [DROP APPLICATION ROLE](sql/drop-application-role.md) | Removes the specified application role from the system. |
| [DROP AUTHENTICATION POLICY](sql/drop-authentication-policy.md) | Removes an [authentication policy](../user-guide/authentication-policies.md) from the system. |
| [DROP BACKUP POLICY](sql/drop-backup-policy.md) | Deletes a [backup](../user-guide/backups.md) policy. |
| [DROP BACKUP SET](sql/drop-backup-set.md) | Deletes a [backup](../user-guide/backups.md) set. |
| [DROP CATALOG INTEGRATION](sql/drop-catalog-integration.md) | Removes a [catalog integration](../user-guide/tables-iceberg.md) from the account. |
| [DROP COMPUTE POOL](sql/drop-compute-pool.md) | Removes the specified [compute pool](../developer-guide/snowpark-container-services/working-with-compute-pool.md) from the account. |
| [DROP CONNECTION](sql/drop-connection.md) | Removes a connection from the account. |
| [DROP CONTACT](sql/drop-contact.md) | Removes the specified [contact](../user-guide/contacts-using.md) from the current schema. |
| [DROP CORTEX SEARCH SERVICE](sql/drop-cortex-search.md) | Removes the specified [Cortex Search service](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) from the current schema. |
| [DROP DATABASE](sql/drop-database.md) | Removes a database from the system. |
| [DROP DATABASE ROLE](sql/drop-database-role.md) | Removes the specified database role from the system. |
| [DROP DBT PROJECT](sql/drop-dbt-project.md) | Removes the specified [dbt project object](../user-guide/data-engineering/dbt-projects-on-snowflake.md) from the current or specified schema. |
| [DROP DCM PROJECT](sql/drop-dcm-project.md) | Removes the specified [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md) from the current/specified schema. |
| [DROP DYNAMIC TABLE](sql/drop-dynamic-table.md) | Removes a [dynamic table](../user-guide/dynamic-tables-about.md) from the current/specified schema. |
| [DROP EXPERIMENT](sql/drop-experiment.md) | Removes the specified [experiment](../developer-guide/snowflake-ml/experiments.md) from the current/specified schema. |
| [DROP EXTERNAL TABLE](sql/drop-external-table.md) | Removes an external table from the current or specified schema. |
| [DROP EXTERNAL VOLUME](sql/drop-external-volume.md) | Removes an [external volume](../user-guide/tables-iceberg.md) from the account, but retains a version of the external volume so that it can be recovered using [UNDROP EXTERNAL VOLUME](sql/undrop-external-volume.md). |
| [DROP FAILOVER GROUP](sql/drop-failover-group.md) | Removes a [failover group](../user-guide/account-replication-intro.md) from the account. |
| [DROP FEATURE POLICY](sql/drop-feature-policy.md) | Removes the specified [feature policy](../developer-guide/native-apps/ui-consumer-feature-policies.md). |
| [DROP FILE FORMAT](sql/drop-file-format.md) | Removes the specified file format from the current/specified schema. |
| [DROP FUNCTION](sql/drop-function.md) | Removes the specified user-defined function (UDF) or external function from the current/specified schema. |
| [DROP FUNCTION (DMF)](sql/drop-function-dmf.md) | Removes the specified data metric function (DMF) from the current or specified schema. |
| [DROP FUNCTION (Snowpark Container Services)](sql/drop-function-spcs.md) | Removes the specified [service function](../developer-guide/snowpark-container-services/working-with-services.md). |
| [DROP GATEWAY](sql/drop-gateway.md) | Removes the specified [gateway](../developer-guide/snowpark-container-services/gateway.md) from the current or specified schema. |
| [DROP GIT REPOSITORY](sql/drop-git-repository.md) | Removes the specified Snowflake Git repository clone from the current/specified schema. |
| [DROP ICEBERG TABLE](sql/drop-iceberg-table.md) | Removes an [Apache Iceberg™ table](../user-guide/tables-iceberg.md) from the current/specified schema, but retains a version of the Iceberg table so that it can be recovered using [UNDROP ICEBERG TABLE](sql/undrop-iceberg-table.md). |
| [DROP IMAGE REPOSITORY](sql/drop-image-repository.md) | Removes the specified [image repository](../developer-guide/snowpark-container-services/tutorials/tutorial-1.md) from the current or specified schema. |
| [DROP INDEX](sql/drop-index.md) | Drops a secondary index. |
| [DROP INTEGRATION](sql/drop-integration.md) | Removes an integration from the account. |
| [DROP JOIN POLICY](sql/drop-join-policy.md) | Removes a [join policy](../user-guide/join-policies.md) from the current/specified schema. |
| [DROP LISTING](sql/drop-listing.md) | Removes the specified [listing](../collaboration/collaboration-listings-about.md) from the system and immediately revokes access for all consumers. |
| [DROP MAINTENANCE POLICY](sql/drop-maintenance-policy.md) | Removes a [maintenance policy](../developer-guide/native-apps/consumer-maintenance-policies.md) from the current or specified schema. |
| [DROP MANAGED ACCOUNT](sql/drop-managed-account.md) | Removes a managed account, including all objects created in the account, and immediately restricts access to the account. |
| [DROP MASKING POLICY](sql/drop-masking-policy.md) | Removes a masking policy from the system. |
| [DROP MATERIALIZED VIEW](sql/drop-materialized-view.md) | Removes the specified materialized view from the current/specified schema. |
| [DROP MCP SERVER](sql/drop-mcp-server.md) | Removes the specified MCP (Model Context Protocol) server from the current/specified schema. |
| [DROP MODEL](sql/drop-model.md) | Removes a machine learning model from the current/specified schema. |
| [DROP MODEL MONITOR](sql/drop-model-monitor.md) | Removes the specified [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md) from the current or specified schema. |
| [DROP NETWORK POLICY](sql/drop-network-policy.md) | Removes the specified network policy from the system. |
| [DROP NETWORK RULE](sql/drop-network-rule.md) | Removes the specified network rule from the system. |
| [DROP NOTEBOOK](sql/drop-notebook.md) | Removes the specified [notebook](../user-guide/ui-snowsight/notebooks.md) from the current/specified schema, but retains a version of the notebook so that it can be recovered using [UNDROP NOTEBOOK](sql/undrop-notebook.md). |
| [DROP ONLINE FEATURE TABLE](sql/drop-online-feature-table.md) | Removes the specified [online feature table](sql/create-online-feature-table.md) from the current/specified schema. |
| [DROP ORGANIZATION PROFILE](sql/drop-organization-profile.md) | Removes an organization profile. |
| [DROP ORGANIZATION USER](sql/drop-organization-user.md) | Removes an [organization user](../user-guide/organization-users.md) from the organization. |
| [DROP ORGANIZATION USER GROUP](sql/drop-organization-user-group.md) | Removes an [organization user group](../user-guide/organization-users.md) from the organization. |
| [DROP PACKAGES POLICY](sql/drop-packages-policy.md) | Removes a packages policy from the system. |
| [DROP PASSWORD POLICY](sql/drop-password-policy.md) | Removes a password policy from the system. |
| [DROP PIPE](sql/drop-pipe.md) | Removes the specified pipe from the current/specified schema. |
| [DROP POSTGRES INSTANCE](sql/drop-postgres-instance.md) | Removes the specified [Snowflake Postgres instance](../user-guide/snowflake-postgres/about.md) from the account. |
| [DROP PRIVACY POLICY](sql/drop-privacy-policy.md) | Removes the specified [privacy policy](../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) from the current/specified schema. |
| [DROP PROCEDURE](sql/drop-procedure.md) | Removes the specified stored procedure from the current/specified schema. |
| [DROP PROJECTION POLICY](sql/drop-projection-policy.md) | Removes a [projection policy](../user-guide/projection-policies.md) from the current/specified schema. |
| [DROP REPLICATION GROUP](sql/drop-replication-group.md) | Removes a [replication group](../user-guide/account-replication-intro.md) from the account. |
| [DROP RESOURCE MONITOR](sql/drop-resource-monitor.md) | Removes the specified [resource monitor](../user-guide/resource-monitors.md) from the system. |
| [DROP ROLE](sql/drop-role.md) | Removes the specified role from the system. |
| [DROP ROW ACCESS POLICY](sql/drop-row-access-policy.md) | Removes a row access policy from the system. |
| [DROP SCHEMA](sql/drop-schema.md) | Removes a schema from the current/specified database. |
| [DROP SECRET](sql/drop-secret.md) | Removes a secret from the system. |
| [DROP SEMANTIC VIEW](sql/drop-semantic-view.md) | Removes the specified [semantic view](../user-guide/views-semantic/overview.md) from the current/specified schema. |
| [DROP SEQUENCE](sql/drop-sequence.md) | Removes a sequence from the current/specified schema. |
| [DROP SERVICE](sql/drop-service.md) | Removes the specified [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) from the current or specified schema. |
| [DROP SESSION POLICY](sql/drop-session-policy.md) | Removes a session policy from the system. |
| [DROP SHARE](sql/drop-share.md) | Removes the specified [share](../user-guide/data-sharing-intro.md) from the system and immediately revokes access for all consumers (i.e. accounts who have created a database from the share). |
| [DROP SNAPSHOT](sql/drop-snapshot.md) | Removes a [snapshot of a block storage volume](../developer-guide/snowpark-container-services/block-storage-volume.md). |
| [DROP SNAPSHOT POLICY — Deprecated](sql/drop-snapshot-policy.md) | Deletes a [snapshot](../user-guide/backups.md) policy. |
| [DROP SNAPSHOT SET — Deprecated](sql/drop-snapshot-set.md) | Deletes a [snapshot](../user-guide/backups.md) set. |
| [DROP STAGE](sql/drop-stage.md) | Removes the specified named internal or external stage from the current/specified schema. |
| [DROP STORAGE LIFECYCLE POLICY](sql/drop-storage-lifecycle-policy.md) | Removes the specified [storage lifecycle policy](../user-guide/storage-management/storage-lifecycle-policies.md) from the current or specified schema. |
| [DROP STREAM](sql/drop-stream.md) | Removes a stream from the current/specified schema. |
| [DROP STREAMLIT](sql/drop-streamlit.md) | Removes the specified Streamlit object from the current/specified schema. |
| [DROP TABLE](sql/drop-table.md) | Removes a table from the current or specified schema, but retains a version of the table so that it can be recovered by using [UNDROP TABLE](sql/undrop-table.md). |
| [DROP TAG](sql/drop-tag.md) | Removes a tag from the system. |
| [DROP TASK](sql/drop-task.md) | Removes a task from the current/specified schema. |
| [DROP TYPE](sql/drop-type.md) | Removes a [user-defined type](data-types-user-defined.md). |
| [DROP USER](sql/drop-user.md) | Removes the specified user from the system. |
| [DROP VIEW](sql/drop-view.md) | Removes the specified view from the current/specified schema. |
| [DROP WAREHOUSE](sql/drop-warehouse.md) | Removes the specified [virtual warehouse](../user-guide/warehouses-overview.md) from the system. |
| **E** |  |
| [EXECUTE ALERT](sql/execute-alert.md) | Manually executes an [alert](../user-guide/alerts.md) independent of the schedule for the alert. |
| [EXECUTE DBT PROJECT](sql/execute-dbt-project.md) | Executes the specified [dbt project object](../user-guide/data-engineering/dbt-projects-on-snowflake.md) or the dbt project in a Snowflake workspace using the dbt command and command-line options specified. |
| [EXECUTE DCM PROJECT](sql/execute-dcm-project.md) | Executes one of the following actions on a [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md). |
| [EXECUTE IMMEDIATE](sql/execute-immediate.md) | Executes a string that contains a SQL statement or a [Snowflake Scripting statement](../developer-guide/snowflake-scripting/blocks.md). |
| [EXECUTE IMMEDIATE FROM](sql/execute-immediate-from.md) | EXECUTE IMMEDIATE FROM executes the SQL statements specified in a file in a stage. |
| [EXECUTE JOB SERVICE](sql/execute-job-service.md) | Executes a Snowpark Container Services service as a job. |
| [EXECUTE NOTEBOOK](sql/execute-notebook.md) | Executes the notebook outside the Notebook Editor. |
| [EXECUTE NOTEBOOK PROJECT](sql/execute-notebook-project.md) | Executes a notebook stored in a notebook project (NPO). |
| [EXECUTE TASK](sql/execute-task.md) | Manually triggers an asynchronous single run of a task (either a standalone task or the root task in a [task graph](../user-guide/tasks-graphs.md)) independent of the schedule defined for the task. |
| [EXPLAIN](sql/explain.md) | Returns the logical execution plan for the specified SQL statement. |
| **G** |  |
| [GET](sql/get.md) | Downloads data files from one of the following [internal stage](../user-guide/data-load-overview.md) types to a local directory or folder on a client machine. |
| [GRANT APPLICATION ROLE](sql/grant-application-role.md) | Assigns an application role to an account role, another application role, an application, or a user. |
| [GRANT CALLER](sql/grant-caller.md) | Grants [caller grants](../developer-guide/restricted-callers-rights.md) to a role. |
| [GRANT DATABASE ROLE](sql/grant-database-role.md) | Assigns a database role to an [account role, another database role](../user-guide/security-access-control-overview.md), or a user. |
| [GRANT DATABASE ROLE … TO SHARE](sql/grant-database-role-share.md) | Grants a database role to a share. |
| [GRANT OWNERSHIP](sql/grant-ownership.md) | Transfers ownership of an object or all objects of a specified type in a schema from one role to another role. |
| [GRANT <privileges> … TO ROLE](sql/grant-privilege.md) | Grants one or more access privileges on a securable object to a role or database role. |
| [GRANT <privileges> … TO APPLICATION](sql/grant-privilege-application.md) | Grants one or more access privileges on a securable object to an application. |
| [GRANT <privileges> … TO APPLICATION ROLE](sql/grant-privilege-application-role.md) | Grants one or more access privileges on a securable schema-level object to an application role. |
| [GRANT <privilege> … TO SHARE](sql/grant-privilege-share.md) | Grants access privileges for databases and other supported database objects (schemas, UDFs, tables, and views) to a share. |
| [GRANT <privileges> … TO USER](sql/grant-privilege-user.md) | Grants one or more access privileges on a securable object to a user. |
| [GRANT ROLE](sql/grant-role.md) | Assigns a role to a user or another role. |
| [GRANT SERVICE ROLE](sql/grant-service-role.md) | Assigns a service role to an account role, application role, or database role. |
| **I** |  |
| [INSERT](sql/insert.md) | Updates a table by inserting one or more rows into the table. |
| [INSERT (multi-table)](sql/insert-multi-table.md) | Updates multiple tables by inserting one or more rows with column values (from a query) into the tables. |
| **L** |  |
| [LIST](sql/list.md) | Returns a list of files from one of the following Snowflake storage features. |
| **M** |  |
| [MERGE](sql/merge.md) | Inserts, updates, and deletes values in a table that are based on values in a second table or a subquery. |
| **P** |  |
| [PUT](sql/put.md) | Uploads one or more data files from a local file system onto an [internal stage](../user-guide/data-load-local-file-system-create-stage.md). |
| **R** |  |
| [REMOVE](sql/remove.md) | Removes files from either an external (external cloud storage) or internal (i.e. Snowflake) stage. |
| [REVOKE APPLICATION ROLE](sql/revoke-application-role.md) | Revokes an application role from an account role or another application role. |
| [REVOKE CALLER](sql/revoke-caller.md) | Revokes privileges that were previously granted to an executable owner using a [caller grant](../developer-guide/restricted-callers-rights.md). |
| [REVOKE DATABASE ROLE](sql/revoke-database-role.md) | Revokes a database role from an [account role or another database role](../user-guide/security-access-control-overview.md). |
| [REVOKE DATABASE ROLE … FROM SHARE](sql/revoke-database-role-share.md) | Revokes a database role from a share. |
| [REVOKE <privileges> … FROM ROLE](sql/revoke-privilege.md) | Removes one or more privileges on a securable object from a role or database role. |
| [REVOKE <privileges> … FROM APPLICATION](sql/revoke-privilege-application.md) | Revokes one or more access privileges on a securable object from an application. |
| [REVOKE <privileges> … FROM APPLICATION ROLE](sql/revoke-privilege-application-role.md) | Revokes one or more access privileges on a securable schema-level object from an application role. |
| [REVOKE <privilege> … FROM SHARE](sql/revoke-privilege-share.md) | Revokes access privileges for databases and other supported database objects (schemas, tables, and views) from a share. |
| [REVOKE <privileges> … FROM USER](sql/revoke-privilege-user.md) | Removes one or more privileges on a securable object from a user. |
| [REVOKE ROLE](sql/revoke-role.md) | Removes a role from another role or a user. |
| [REVOKE SERVICE ROLE](sql/revoke-service-role.md) | Revokes a service role from an account role, application role, or database role. |
| [ROLLBACK](sql/rollback.md) | Rolls back an open transaction in the current session. |
| **S** |  |
| [SELECT](sql/select.md) | SELECT can be used as either a statement or as a clause within other statements. |
| [SET](sql/set.md) | Initializes the value of a [session variable](session-variables.md) to the result of a SQL expression. |
| [SHOW <objects>](sql/show.md) | Lists the existing objects for the specified object type. |
| [SHOW ACCOUNTS](sql/show-accounts.md) | Lists all the accounts in your organization, excluding [managed accounts](../user-guide/data-sharing-reader-create.md). |
| [SHOW AGENTS](sql/show-agents.md) | Lists the [Cortex Agents](../user-guide/snowflake-cortex/cortex-agents.md) for which you have access privileges. |
| [SHOW AGGREGATION POLICIES](sql/show-aggregation-policies.md) | Lists information about existing [aggregation policies](../user-guide/aggregation-policies.md), including the creation date, database and schema names, owner, and any available comments. |
| [SHOW ALERTS](sql/show-alerts.md) | Lists the [alerts](../user-guide/alerts.md) for which you have access privileges. |
| [SHOW APPLICATION PACKAGES](sql/show-application-packages.md) | Lists the application packages for which you have access privileges across your entire account in the Native Apps Framework. |
| [SHOW APPLICATION ROLES](sql/show-application-roles.md) | Lists the application roles in the specified app for which you have access privileges. |
| [SHOW APPLICATIONS](sql/show-applications.md) | Lists the Snowflake Native Apps that you have access privileges for across your entire account. |
| [SHOW AUTHENTICATION POLICIES](sql/show-authentication-policies.md) | Lists [authentication policy](../user-guide/authentication-policies.md) information, including the creation date, database and schema names, owner, and any available comments. |
| [SHOW AVAILABLE LISTINGS](sql/show-available-listings.md) | Lists the listings that are available to the user who runs the command. |
| [SHOW AVAILABLE OFFERS](sql/show-available-offers.md) | Lists the [offers](../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md) that are available to the user who runs the command. |
| [SHOW AVAILABLE ORGANIZATION PROFILES](sql/show-available-organization-profiles.md) | Lists the organization profiles available in the user’s organization. |
| [SHOW BACKUP POLICIES](sql/show-backup-policies.md) | Lists all the [backup](../user-guide/backups.md) policies in your account for which you have access privileges. |
| [SHOW BACKUP SETS](sql/show-backup-sets.md) | Lists all the [backup](../user-guide/backups.md) sets for which you have access privileges. |
| [SHOW BACKUPS IN BACKUP SET](sql/show-backups-in-backup-set.md) | Lists all the [backups](../user-guide/backups.md) in a backup set. |
| [SHOW CALLER GRANTS](sql/show-caller-grants.md) | Lists the [caller grants](../developer-guide/restricted-callers-rights.md) being used to implement restricted caller’s rights. |
| [SHOW CATALOG INTEGRATIONS](sql/show-catalog-integrations.md) | Lists the [catalog integrations](../user-guide/tables-iceberg.md) in your account. |
| [SHOW CHANNELS](sql/show-channels.md) | Lists the [Snowpipe Streaming channels](../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) for which you have access privileges. |
| [SHOW CLASSES](sql/show-classes.md) | Lists all available classes. |
| [SHOW COLUMNS](sql/show-columns.md) | Lists the columns in the tables or views and the dimensions, facts, and metrics in the [semantic views](../user-guide/views-semantic/overview.md) for which you have access privileges. |
| [SHOW COMPUTE POOL INSTANCE FAMILIES](sql/show-compute-pool-instance-families.md) | Lists the available [compute pool instance families](../developer-guide/snowpark-container-services/working-with-compute-pool.md) that you can use to create a compute pool. |
| [SHOW COMPUTE POOLS](sql/show-compute-pools.md) | Lists the [compute pools](../developer-guide/snowpark-container-services/working-with-compute-pool.md) in your account for which you have access privileges. |
| [SHOW CONFIGURATIONS](sql/show-configurations.md) | Lists the [configurations](../developer-guide/native-apps/inter-app-communication.md) in the specified app for which you have access privileges. |
| [SHOW CONNECTIONS](sql/show-connections.md) | Lists the [connections](../user-guide/client-redirect.md) for which you have access privileges. |
| [SHOW CONTACTS](sql/show-contacts.md) | Lists the [contacts](../user-guide/contacts-using.md) for which you have access privileges. |
| [SHOW CORTEX SEARCH SERVICES](sql/show-cortex-search.md) | Lists the [Cortex Search services](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) for which you have access privileges. |
| [SHOW DATA METRIC FUNCTIONS](sql/show-data-metric-functions.md) | Lists the [data metric functions](../user-guide/data-quality-intro.md) (DMFs) for which you have access privileges. |
| [SHOW DATABASE ROLES](sql/show-database-roles.md) | Lists all the database roles in the specified database. |
| [SHOW DATABASES](sql/show-databases.md) | Lists the databases for which you have access privileges across your entire account, including dropped databases that are still within the Time Travel retention period and, therefore, can be undropped. |
| [SHOW DATABASES IN FAILOVER GROUP](sql/show-databases-in-failover-group.md) | Lists databases in a [failover group](../user-guide/account-replication-intro.md). |
| [SHOW DATABASES IN REPLICATION GROUP](sql/show-databases-in-replication-group.md) | Lists databases in a [replication group](../user-guide/account-replication-intro.md). |
| [SHOW DATASETS](sql/show-datasets.md) | Displays information about the datasets in your account. |
| [SHOW DBT PROJECTS](sql/show-dbt-projects.md) | Lists the [dbt project objects](../user-guide/data-engineering/dbt-projects-on-snowflake.md) for which you have access privileges. |
| [SHOW DCM PROJECTS](sql/show-dcm-projects.md) | Lists the [DCM projects](../user-guide/dcm-projects/dcm-projects-overview.md) for which you have at least READ privilege. |
| [SHOW DELEGATED AUTHORIZATIONS](sql/show-delegated-authorizations.md) | Lists the active delegated authorizations for which you have access privileges. |
| [SHOW DEPLOYMENTS IN DCM PROJECT](sql/show-deployments-in-dcm-project.md) | Shows all deployments for the specified [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md). |
| [SHOW DYNAMIC TABLES](sql/show-dynamic-tables.md) | Lists the [dynamic tables](../user-guide/dynamic-tables-about.md) for which you have access privileges. |
| [SHOW ENDPOINTS](sql/show-endpoints.md) | Lists the endpoints in a [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) (or a job service). |
| [SHOW ENTITIES IN DCM PROJECT](sql/show-entities-in-dcm-project.md) | Shows all Snowflake objects that are currently managed by a specified [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md). |
| [SHOW EVENT TABLES](sql/show-event-tables.md) | Lists the [event tables](../developer-guide/logging-tracing/event-table-setting-up.md) for which you have access privileges, including dropped tables that are still within the Time Travel retention period and, therefore, can be undropped. |
| [SHOW EXPERIMENTS](sql/show-experiments.md) | Lists the [experiments](../developer-guide/snowflake-ml/experiments.md) for which you have access privileges. |
| [SHOW EXTERNAL FUNCTIONS](sql/show-external-functions.md) | Lists all the external functions created for your account. |
| [SHOW EXTERNAL TABLES](sql/show-external-tables.md) | Lists the external tables for which you have access privileges. |
| [SHOW EXTERNAL VOLUMES](sql/show-external-volumes.md) | Lists the [external volumes](../user-guide/tables-iceberg.md) in your account for which you have access privileges. |
| [SHOW FAILOVER GROUPS](sql/show-failover-groups.md) | Lists the primary and secondary [failover groups](../user-guide/account-replication-intro.md) in your account, as well as the failover groups in other accounts that are associated with your account. |
| [SHOW FEATURE POLICIES](sql/show-feature-policies.md) | Lists the [feature policies](../developer-guide/native-apps/ui-consumer-feature-policies.md) for which you have access privileges. |
| [SHOW FILE FORMATS](sql/show-file-formats.md) | Lists the file formats for which you have access privileges. |
| [SHOW FUNCTIONS](sql/show-functions.md) | Lists all functions that you have privileges to access, including built-in, user-defined, and external functions. |
| [SHOW FUNCTIONS IN MODEL](sql/show-functions-in-model.md) | Lists functions defined in machine learning models. |
| [SHOW GATEWAYS](sql/show-gateways.md) | Lists the [gateway](../developer-guide/snowpark-container-services/gateway.md) for which you have access privileges. |
| [SHOW GIT BRANCHES](sql/show-git-branches.md) | Lists the branches in the specified Snowflake Git repository clone. |
| [SHOW GIT REPOSITORIES](sql/show-git-repositories.md) | Lists the [Git repository clones](../developer-guide/git/git-overview.md) that you have privileges to access. |
| [SHOW GIT TAGS](sql/show-git-tags.md) | Lists the tags in the specified Snowflake [Git repository clone](../developer-guide/git/git-overview.md). |
| [SHOW GLOBAL ACCOUNTS](sql/show-global-accounts.md) | Lists all the accounts in your organization that are enabled for replication and indicates the Snowflake Region in which each account is located. |
| [SHOW GRANTS](sql/show-grants.md) | Lists all access control privileges that have been explicitly granted to roles, users, and shares. |
| [SHOW GRANTS IN DCM PROJECT](sql/show-grants-in-dcm-project.md) | `SHOW GRANTS IN DCM PROJECT` lists all grants deployed and managed by the specified [DCM project](../user-guide/dcm-projects/dcm-projects-overview.md). |
| [SHOW HYBRID TABLES](sql/show-hybrid-tables.md) | Lists the [hybrid tables](../user-guide/tables-hybrid.md) for which you have access privileges. |
| [SHOW ICEBERG TABLES](sql/show-iceberg-tables.md) | Lists the [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) for which you have access privileges. |
| [SHOW IMAGE REPOSITORIES](sql/show-image-repositories.md) | Lists the [image repositories](../developer-guide/snowpark-container-services/tutorials/tutorial-1.md) for which you have access privileges. |
| [SHOW IMAGES IN IMAGE REPOSITORY](sql/show-images-in-image-repository.md) | Lists the images in an [image repository](../developer-guide/snowpark-container-services/working-with-registry-repository.md). |
| [SHOW INDEXES](sql/show-indexes.md) | Lists all the indexes in your account for which you have access privileges. |
| [SHOW INTEGRATIONS](sql/show-integrations.md) | Lists the integrations in your account. |
| [SHOW JOIN POLICIES](sql/show-join-policies.md) | Lists information about existing [join policies](../user-guide/join-policies.md), including the creation date, database and schema names, owner, and any available comments. |
| [SHOW LISTINGS](sql/show-listings.md) | Lists the [listings](../collaboration/collaboration-listings-about.md) that you have privileges to access. |
| [SHOW LISTINGS IN FAILOVER GROUP](sql/show-listings-in-failover-group.md) | Shows the listings in a [failover group](../user-guide/account-replication-intro.md). |
| [SHOW LOCKS](sql/show-locks.md) | Lists all running transactions that have locks on resources. |
| [SHOW MAINTENANCE POLICIES](sql/show-maintenance-policies.md) | Lists the [maintenance policies](../developer-guide/native-apps/consumer-maintenance-policies.md) applied to the specified account or app. |
| [SHOW MANAGED ACCOUNTS](sql/show-managed-accounts.md) | Lists the managed accounts created for your account. |
| [SHOW MASKING POLICIES](sql/show-masking-policies.md) | Lists masking policy information, including the creation date, database and schema names, owner, and any available comments. |
| [SHOW MATERIALIZED VIEWS](sql/show-materialized-views.md) | Lists the materialized views that you have privileges to access. |
| [SHOW MCP SERVERS](sql/show-mcp-servers.md) | Lists the MCP (Model Context Protocol) servers for which you have access privileges. |
| [SHOW MFA METHODS](sql/show-mfa-methods.md) | Lists the [second factors of authentication](../user-guide/security-mfa-second-factor.md) that a user enrolled in multi-factor authentication uses to sign in to Snowflake. |
| [SHOW MODEL MONITORS](sql/show-model-monitors.md) | Lists all [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md) that you can access in the current or specified schema and displays information about each one. |
| [SHOW MODELS](sql/show-models.md) | Lists the machine learning models that you have privileges to access. |
| [SHOW NETWORK POLICIES](sql/show-network-policies.md) | Lists all network policies defined in the system. |
| [SHOW NETWORK RULES](sql/show-network-rules.md) | Lists all network rules defined in the system. |
| [SHOW NOTEBOOK PROJECTS](sql/show-notebook-projects.md) | Lists the notebook projects (Snowflake `NOTEBOOK` objects) visible to the current role. |
| [SHOW NOTEBOOKS](sql/show-notebooks.md) | Lists the [notebooks](../user-guide/ui-snowsight/notebooks.md) for which you have access privileges. |
| [SHOW NOTIFICATION INTEGRATIONS](sql/show-notification-integrations.md) | Lists the notification integrations in your account. |
| [SHOW OBJECTS](sql/show-objects.md) | Lists the tables and views for which you have access privileges. |
| [SHOW OBJECTS OWNED BY APPLICATION](sql/show-objects-owned-by-application.md) | Lists the objects owned by an app that exists outside the app. |
| [SHOW OFFERS](sql/show-offers.md) | Provides information about all [offers](../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md) added to a listing. |
| [SHOW OPENFLOW DATA PLANE INTEGRATIONS](sql/show-oflow-data-plane-integration.md) | List OPENFLOW DATA PLANE INTEGRATIONS. |
| [SHOW ONLINE FEATURE TABLES](sql/show-online-feature-tables.md) | Lists the [online feature tables](sql/create-online-feature-table.md) for which you have access privileges. |
| [SHOW ORGANIZATION ACCOUNTS](sql/show-organization-accounts.md) | Lists the [organization account](../user-guide/organization-accounts.md) of the organization. |
| [SHOW ORGANIZATION PROFILES](sql/show-organization-profiles.md) | Lists the organization profiles for which you have access privileges. |
| [SHOW ORGANIZATION USER GROUPS](sql/show-organization-user-groups.md) | Lists [organization user groups](../user-guide/organization-users.md). |
| [SHOW ORGANIZATION USERS](sql/show-organization-users.md) | Lists [organization users](../user-guide/organization-users.md). |
| [SHOW PACKAGES POLICIES](sql/show-packages-policies.md) | Lists packages policy information. |
| [SHOW PARAMETERS](sql/show-parameters.md) | Lists all the account, session, and object parameters that can be set, as well as the current and default values for each parameter. |
| [SHOW PASSWORD POLICIES](sql/show-password-policies.md) | Lists password policy information, including the creation date, database and schema names, owner, and any available comments. |
| [SHOW PIPES](sql/show-pipes.md) | Lists the pipes for which you have access privileges. |
| [SHOW POSTGRES INSTANCES](sql/show-postgres-instances.md) | Lists the [Snowflake Postgres instances](../user-guide/snowflake-postgres/about.md) for which you have access privileges. |
| [SHOW PRICING PLANS](sql/show-pricing-plans.md) | Lists visible and hidden [pricing plans](../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md). |
| [SHOW PRIMARY KEYS](sql/show-primary-keys.md) | Lists primary keys for one or more tables. |
| [SHOW PRIVACY POLICIES](sql/show-privacy-policies.md) | Lists the [privacy policies](../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) for which you have access privileges. |
| [SHOW PRIVILEGES](sql/show-privileges.md) | Lists the privileges granted to an application. |
| [SHOW PROCEDURES](sql/show-procedures.md) | Lists all stored procedures that you have privileges to access, including built-in and user-defined procedures. |
| [SHOW PROJECTION POLICIES](sql/show-projection-policies.md) | Lists [projection policy](../user-guide/projection-policies.md) information, including the creation date, database and schema names, owner, and any available comments. |
| [SHOW REFERENCES](sql/show-references.md) | Lists the references defined for an application in the manifest file and the references the consumer has associated to the application. |
| [SHOW REGIONS](sql/show-regions.md) | Lists all the [regions](../user-guide/intro-regions.md) in which accounts can be created. |
| [SHOW RELEASE CHANNELS](sql/show-release-channels.md) | Lists the [release channels](../developer-guide/native-apps/release-channels.md) for an application package or listing. |
| [SHOW RELEASE DIRECTIVES](sql/show-release-directives.md) | Lists the release directives defined for an application package. |
| [SHOW REPLICATION ACCOUNTS](sql/show-replication-accounts.md) | Lists all the accounts in your organization that are enabled for replication and indicates the [region](../user-guide/intro-regions.md) in which each account is located. |
| [SHOW REPLICATION DATABASES](sql/show-replication-databases.md) | Lists all the primary and secondary databases (that is to say, all the databases for which replication has been enabled) in your account and indicates the [region](../user-guide/intro-regions.md) in which each account is located. |
| [SHOW REPLICATION GROUPS](sql/show-replication-groups.md) | Displays information about [replication groups and failover groups](../user-guide/account-replication-intro.md). |
| [SHOW RESOURCE MONITORS](sql/show-resource-monitors.md) | Lists all the resource monitors in your account for which you have access privileges. |
| [SHOW ROLES](sql/show-roles.md) | Lists all the roles which you can view across your entire account, including the system-defined roles and any custom roles that exist. |
| [SHOW ROLES IN SERVICE](sql/show-roles-in-service.md) | Lists all the service roles associated with a service. |
| [SHOW ROW ACCESS POLICIES](sql/show-row-access-policies.md) | Lists the row access policies for which you have access privileges. |
| [SHOW RUN … IN EXPERIMENT](sql/show-run-in-experiment.md) | Displays logged parameters or metrics for [experiment runs](../developer-guide/snowflake-ml/experiments.md). |
| [SHOW RUNS IN EXPERIMENT](sql/show-runs-in-experiment.md) | Lists the runs in an [experiment](../developer-guide/snowflake-ml/experiments.md). |
| [SHOW SCHEMAS](sql/show-schemas.md) | Lists the schemas for which you have access privileges, including dropped schemas that are still within the Time Travel retention period and, therefore, can be undropped. |
| [SHOW SECRETS](sql/show-secrets.md) | Lists the secrets for which you have rights to see. |
| [SHOW SEMANTIC DIMENSIONS](sql/show-semantic-dimensions.md) | Lists the dimensions in the [semantic views](../user-guide/views-semantic/overview.md) for which you have access privileges. |
| [SHOW SEMANTIC DIMENSIONS FOR METRIC](sql/show-semantic-dimensions-for-metric.md) | Lists the dimensions that you can return when querying a specific metric in a [semantic view](../user-guide/views-semantic/overview.md). |
| [SHOW SEMANTIC FACTS](sql/show-semantic-facts.md) | Lists the facts in the [semantic views](../user-guide/views-semantic/overview.md) for which you have access privileges. |
| [SHOW SEMANTIC METRICS](sql/show-semantic-metrics.md) | Lists the metrics in the [semantic views](../user-guide/views-semantic/overview.md) for which you have access privileges. |
| [SHOW SEMANTIC VIEWS](sql/show-semantic-views.md) | Lists the [semantic views](../user-guide/views-semantic/overview.md) for which you have access privileges. |
| [SHOW SEQUENCES](sql/show-sequences.md) | Lists all the sequences for which you have access privileges. |
| [SHOW SERVICE CONTAINERS IN SERVICE](sql/show-service-containers-in-service.md) | Lists the containers in all instances of a [service](../developer-guide/snowpark-container-services/working-with-services.md). |
| [SHOW SERVICE INSTANCES IN SERVICE](sql/show-service-instances-in-service.md) | Lists instances of a [service](../developer-guide/snowpark-container-services/working-with-services.md). |
| [SHOW SERVICE VOLUMES IN SERVICE](sql/show-service-volumes-in-service.md) | Lists the storage volumes for all instances of a [service](../developer-guide/snowpark-container-services/working-with-services.md). |
| [SHOW SERVICES](sql/show-services.md) | Lists the [Snowpark Container Services services](../developer-guide/snowpark-container-services/working-with-services.md) (including job services) for which you have access privileges. |
| [SHOW SESSION POLICIES](sql/show-session-policies.md) | Lists session policy information, including the creation date, database and schema names, owner, and any available comments. |
| [SHOW SHARED CONTENT IN APPLICATION PACKAGE](sql/show-shared-content.md) | Shows all of the objects for which you have access privileges that have been shared from a Declarative Native App application package. |
| [SHOW SHARES](sql/show-shares.md) | Lists all [shares](../user-guide/data-sharing-intro.md) available in the system. |
| [SHOW SHARES IN FAILOVER GROUP](sql/show-shares-in-failover-group.md) | Lists shares in a [failover group](../user-guide/account-replication-intro.md). |
| [SHOW SHARES IN REPLICATION GROUP](sql/show-shares-in-replication-group.md) | Lists shares in a [replication group](../user-guide/account-replication-intro.md). |
| [SHOW SNAPSHOT POLICIES — Deprecated](sql/show-snapshot-policies.md) | Lists all the [snapshot](../user-guide/backups.md) policies in your account for which you have access privileges. |
| [SHOW SNAPSHOT SETS — Deprecated](sql/show-snapshot-sets.md) | Lists all the [snapshot](../user-guide/backups.md) sets for which you have access privileges. |
| [SHOW SNAPSHOTS](sql/show-snapshots.md) | Lists the [snapshots of block storage volumes](../developer-guide/snowpark-container-services/block-storage-volume.md) for which you have access privileges. |
| [SHOW SNAPSHOTS IN SNAPSHOT SET — Deprecated](sql/show-snapshots-in-snapshot-set.md) | Lists all the [snapshots](../user-guide/backups.md) in a snapshot set. |
| [SHOW SPECIFICATIONS](sql/show-specifications.md) | Lists the app specifications that have been defined for an app. |
| [SHOW STAGES](sql/show-stages.md) | Lists all the stages for which you have access privileges. |
| [SHOW STORAGE LIFECYCLE POLICIES](sql/show-storage-lifecycle-policies.md) | Lists the [storage lifecycle policies](../user-guide/storage-management/storage-lifecycle-policies.md) for which you have access privileges. |
| [SHOW STREAMLITS](sql/show-streamlits.md) | Lists the Streamlit objects for which you have access privileges. |
| [SHOW STREAMS](sql/show-streams.md) | Lists the streams for which you have access privileges. |
| [SHOW TABLES](sql/show-tables.md) | Lists the tables for which you have access privileges, including dropped tables that are still within the Time Travel retention period and, therefore, can be undropped. |
| [SHOW TAGS](sql/show-tags.md) | Lists the tag information. |
| [SHOW TASKS](sql/show-tasks.md) | Lists the tasks for which you have access privileges. |
| [SHOW TELEMETRY EVENT DEFINITIONS](sql/show-telemetry-event-definitions.md) | Lists the [event definitions](../developer-guide/native-apps/event-definition.md) for the specified app. |
| [SHOW TRANSACTIONS](sql/show-transactions.md) | List all running transactions. |
| [SHOW TYPES](sql/show-types.md) | Lists the [user-defined types](data-types-user-defined.md) for which you have access privileges. |
| [SHOW USER FUNCTIONS](sql/show-user-functions.md) | Lists all user-defined functions (UDFs) for which you have access privileges. |
| [SHOW USER PROCEDURES](sql/show-user-procedures.md) | Lists all user-defined procedures for which you have access privileges. |
| [SHOW USER PROGRAMMATIC ACCESS TOKENS](sql/show-user-programmatic-access-tokens.md) | Lists the [programmatic access tokens](../user-guide/programmatic-access-tokens.md) associated with a user. |
| [SHOW USER WORKLOAD IDENTITY AUTHENTICATION METHODS](sql/show-user-workload-identity-authentication-methods.md) | **Related Topics** |
| [SHOW USERS](sql/show-users.md) | Lists all [users](../user-guide/admin-user-management.md) in the system. |
| [SHOW VARIABLES](sql/show-variables.md) | Lists all [variables](session-variables.md) defined in the current session. |
| [SHOW VERSIONS IN APPLICATION PACKAGE](sql/show-versions.md) | Lists the versions defined in the specified application package. |
| [SHOW VERSIONS IN DATASET](sql/show-versions-in-dataset.md) | Displays information about the datasets in your account at either the schema or database level. |
| [SHOW VERSIONS IN DBT PROJECT](sql/show-versions-in-dbt-project.md) | Displays a list of all versions of a [dbt project object](../user-guide/data-engineering/dbt-projects-on-snowflake.md). |
| [SHOW VERSIONS IN LISTING](sql/show-versions-in-listing.md) | Lists and provides details of all listing versions. |
| [SHOW VERSIONS IN MODEL](sql/show-versions-in-model.md) | Lists the versions in a machine learning model. |
| [SHOW VERSIONS IN ORGANIZATION PROFILE](sql/show-versions-in-organization-profile.md) | Lists the organization profile versions for which you have access privileges. |
| [SHOW VIEWS](sql/show-views.md) | Lists the views, including secure views, for which you have access privileges. |
| [SHOW WAREHOUSES](sql/show-warehouses.md) | Lists all the [virtual warehouses](../user-guide/warehouses-overview.md) in your account for which you have access privileges. |
| [SHOW WORKSPACES](sql/show-workspaces.md) | Lists the [workspaces](../user-guide/ui-snowsight/workspaces.md) for which you have access privileges. |
| **T** |  |
| [TRUNCATE MATERIALIZED VIEW](sql/truncate-materialized-view.md) | Removes all rows from a materialized view, but leaves the view intact (including all privileges and constraints on the materialized view). |
| [TRUNCATE TABLE](sql/truncate-table.md) | Removes all rows from a table but leaves the table intact (including all privileges and constraints on the table). |
| **U** |  |
| [UNDROP <object>](sql/undrop.md) | Restores the specified object to the system. |
| [UNDROP ACCOUNT](sql/undrop-account.md) | Restores a [dropped account](../user-guide/organizations-manage-accounts-delete.md) that has not yet been permanently deleted (a dropped account that is within its grace period). |
| [UNDROP DATABASE](sql/undrop-database.md) | Restores the most recent version of a dropped database. |
| [UNDROP DYNAMIC TABLE](sql/undrop-dynamic-table.md) | Restores the most recent version of a dropped [dynamic table](../user-guide/dynamic-tables-about.md). |
| [UNDROP EXTERNAL VOLUME](sql/undrop-external-volume.md) | Restores the most recent version of a dropped [external volume](../user-guide/tables-iceberg.md). |
| [UNDROP ICEBERG TABLE](sql/undrop-iceberg-table.md) | Restores the most recent version of a dropped [Apache Iceberg™ table](../user-guide/tables-iceberg.md). |
| [UNDROP NOTEBOOK](sql/undrop-notebook.md) | Restores the most recent version of a dropped notebook. |
| [UNDROP SCHEMA](sql/undrop-schema.md) | Restore the most recent version of a dropped schema. |
| [UNDROP SNAPSHOT](sql/undrop-snapshot.md) | Restores a previously removed [snapshot of a block storage volume](../developer-guide/snowpark-container-services/block-storage-volume.md). |
| [UNDROP STREAMLIT](sql/undrop-streamlit.md) | Restores the most recent version of a dropped Streamlit object. |
| [UNDROP TABLE](sql/undrop-table.md) | Restores the most recent version of a dropped table. |
| [UNDROP TAG](sql/undrop-tag.md) | Restores the most recent version of a tag to the system. |
| [UNDROP TYPE](sql/undrop-type.md) | Restores the most recent version of a [user-defined type](data-types-user-defined.md). |
| [UNSET](sql/unset.md) | Drops a [session variable](session-variables.md). |
| [UPDATE](sql/update.md) | Updates specified rows in the target table with new values. |
| [USE <object>](sql/use.md) | Specifies the role, warehouse, database, or schema to use for the current session. |
| [USE DATABASE](sql/use-database.md) | Specifies the active/current database for the session. |
| [USE ROLE](sql/use-role.md) | Specifies the active/current primary role for the session. |
| [USE SCHEMA](sql/use-schema.md) | Specifies the active/current schema for the session. |
| [USE SECONDARY ROLES](sql/use-secondary-roles.md) | Specifies the active/current secondary roles for the session. |
| [USE WAREHOUSE](sql/use-warehouse.md) | Specifies the active/current [virtual warehouse](../user-guide/warehouses-overview.md) for the session. |

---
title: All functions (alphabetical)
source: https://docs.snowflake.com/en/sql-reference/functions-all.md
section: SQL General Reference
---

# All functions (alphabetical)

This topic provides a list of all Snowflake system-defined (i.e. built-in) functions, scalar or table, in alphabetical order.

The list includes:

* The name of each function.
* A summary of each function.
* A list of the categories that the function belongs in.

| Function Name | Summary | Category |
| --- | --- | --- |
| **A** |  |  |
| [ABS](functions/abs.md) | Returns the absolute value of a numeric expression. | [Numeric functions](functions-numeric.md) |
| [ACOS](functions/acos.md) | Computes the inverse cosine (arc cosine) of its input; the result is a number in the interval `[0, pi]`. | [Numeric functions](functions-numeric.md) |
| [ACOSH](functions/acosh.md) | Computes the inverse (arc) hyperbolic cosine of its input. | [Numeric functions](functions-numeric.md) |
| [ADD_MONTHS](functions/add_months.md) | Adds or subtracts a specified number of months to a date or timestamp, preserving the end-of-month information. | [Date & time functions](functions-date-time.md) |
| [AGENT_RUN (SNOWFLAKE.CORTEX)](functions/agent_run-snowflake-cortex.md) | Runs a [Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md) without an agent object and returns the response as JSON. | [String & binary functions](functions-string.md) |
| [AGG](functions/agg.md) | Evaluates and returns the value of a metric in a [semantic view](../user-guide/views-semantic/overview.md) when you [run a query](../user-guide/views-semantic/querying.md). | [Aggregate functions](functions-aggregation.md) |
| [AI_AGG](functions/ai_agg.md) | Reduces a column of text data using a natural language instruction. | [Aggregate functions](functions-aggregation.md) , [String & binary functions](functions-string.md) |
| [AI_CLASSIFY](functions/ai_classify.md) | Classifies text or images into categories that you specify. | [String & binary functions](functions-string.md) |
| [AI_COMPLETE (Prompt object)](functions/ai_complete-prompt-object.md) | Generates a response (completion) for a prompt object. | [String & binary functions](functions-string.md) |
| [AI_COMPLETE (Single image)](functions/ai_complete-single-file.md) | Generates a response (completion) for a text prompt using a supported language model. | [String & binary functions](functions-string.md) |
| [AI_COMPLETE (Single string)](functions/ai_complete-single-string.md) | Generates a response (completion) for a text prompt using a supported language model. | [String & binary functions](functions-string.md) |
| [AI_COMPLETE](functions/ai_complete.md) | Generates a response (completion) from text or an image using a supported language model. | [String & binary functions](functions-string.md) , [File functions](functions-file.md) |
| [AI_COUNT_TOKENS](functions/ai_count_tokens.md) | Returns an estimate of the number of tokens in a prompt for the specified large language model or task-specific function. | [String & binary functions](functions-string.md) |
| [AI_EMBED](functions/ai_embed.md) | Creates an embedding vector from text or an image. | [String & binary functions](functions-string.md) |
| [AI_EXTRACT (Document AI legacy models)](functions/ai_extract-document-ai.md) | Extracts information from a file using a legacy Document AI model. | [String & binary functions](functions-string.md) |
| [AI_EXTRACT](functions/ai_extract.md) | Extracts information from an input string or file. | [String & binary functions](functions-string.md) |
| [AI_FILTER](functions/ai_filter.md) | Classifies free-form prompt inputs into a boolean. | [String & binary functions](functions-string.md) |
| [AI_PARSE_DOCUMENT](functions/ai_parse_document.md) | Returns the extracted content from a document on a Snowflake stage as a JSON-formatted string. | [String & binary functions](functions-string.md) |
| [AI_REDACT](functions/ai_redact.md) | Detects and redacts personally identifiable information (PII) from unstructured text data. | [String & binary functions](functions-string.md) |
| [AI_SENTIMENT](functions/ai_sentiment.md) | Returns overall and category [sentiment](../user-guide/snowflake-cortex/ai-sentiment.md) in the given input text. | [String & binary functions](functions-string.md) |
| [AI_SIMILARITY](functions/ai_similarity.md) | Computes a similarity score based on the vector cosine similarity value of the inputs’ embedding vectors. | [String & binary functions](functions-string.md) |
| [AI_SUMMARIZE_AGG](functions/ai_summarize_agg.md) | Summarizes a column of text data. | [Aggregate functions](functions-aggregation.md) , [String & binary functions](functions-string.md) |
| [AI_TRANSCRIBE](functions/ai_transcribe.md) | Transcribes text from an audio or video file with optional timestamps and speaker labels. | [File functions](functions-file.md) |
| [AI_TRANSLATE](functions/ai_translate.md) | Translates the given input text from one supported language to another. | [String & binary functions](functions-string.md) |
| [ALERT_HISTORY](functions/alert_history.md) | This INFORMATION_SCHEMA table function can be used to query the history of [alerts](../user-guide/alerts.md) within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [ALL_USER_NAMES](functions/all_user_names.md) | Returns all user names in the current account. | [Context functions](functions-context.md) |
| [ANY_VALUE](functions/any_value.md) | Returns some value of the expression from the group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [APPLICATION_CALLBACK_HISTORY](functions/application_callback_history.md) | Returns information about the history of [callback](../developer-guide/native-apps/callbacks.md) invocations for Snowflake Native Apps in your Snowflake account. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [APPLICATION_CONFIGURATION_VALUE_HISTORY](functions/application_configuration_value_history.md) | [Table functions](functions-table.md) (Tables) | [Table functions](functions-table.md) (Tables) |
| [APPLICATION_JSON](functions/application_json.md) | Returns a JSON object that specifies the JSON message to use for a notification. | [Notification functions](functions-notification.md) |
| [APPLICATION_SPECIFICATION_STATUS_HISTORY](functions/application_specification_status_history.md) | Returns information about the history of the [status changes for app specifications](../developer-guide/native-apps/ui-consumer-app-spec.md) in your Snowflake account. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [APPROX_COUNT_DISTINCT](functions/approx_count_distinct.md) | Uses HyperLogLog to return an approximation of the distinct cardinality of the input (i.e. `HLL(col1, col2, ... )` returns an approximation of `COUNT(DISTINCT col1, col2, ... )`). | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [APPROX_PERCENTILE](functions/approx_percentile.md) | Returns an approximated value for the desired percentile (that is, if column `c` has `n` numbers, APPROX_PERCENTILE(c, p) returns a number such that approximately `n * p` of the numbers in `c` are smaller than the returned number). | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [APPROX_PERCENTILE_ACCUMULATE](functions/approx_percentile_accumulate.md) | Returns the internal representation of the t-Digest state (as a JSON object) at the end of aggregation. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [APPROX_PERCENTILE_COMBINE](functions/approx_percentile_combine.md) | Combines (merges) percentile input states into a single output state. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [APPROX_PERCENTILE_ESTIMATE](functions/approx_percentile_estimate.md) | Returns the desired approximated percentile value for the specified t-Digest state. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [APPROX_TOP_K](functions/approx_top_k.md) | Uses Space-Saving to return an approximation of the most frequent values in the input, along with their approximate frequencies. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [APPROX_TOP_K_ACCUMULATE](functions/approx_top_k_accumulate.md) | Returns the Space-Saving summary at the end of aggregation. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [APPROX_TOP_K_COMBINE](functions/approx_top_k_combine.md) | Combines (merges) input states into a single output state. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [APPROX_TOP_K_ESTIMATE](functions/approx_top_k_estimate.md) | Returns the approximate most frequent values and their estimated frequency for the given Space-Saving state. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [APPROXIMATE_JACCARD_INDEX](functions/approximate_jaccard_index.md) | Returns an estimation of the similarity (Jaccard index) of inputs based on their MinHash states. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [APPROXIMATE_SIMILARITY](functions/approximate_similarity.md) | Returns an estimation of the similarity (Jaccard index) of inputs based on their MinHash states. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [ARRAY_AGG](functions/array_agg.md) | Returns the input values, pivoted into an array. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_APPEND](functions/array_append.md) | Returns an array containing all elements from the source array as well as the new element. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_CAT](functions/array_cat.md) | Returns a concatenation of two arrays. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_COMPACT](functions/array_compact.md) | Returns a compacted array with missing and null values removed, effectively converting sparse arrays into dense arrays. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_CONSTRUCT](functions/array_construct.md) | Returns an array constructed from zero, one, or more inputs. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_CONSTRUCT_COMPACT](functions/array_construct_compact.md) | Returns an array constructed from zero, one, or more inputs; the constructed array omits any NULL input values. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_CONTAINS](functions/array_contains.md) | Returns TRUE if the specified value is found in the specified array. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_DISTINCT](functions/array_distinct.md) | Returns a new [ARRAY](data-types-semistructured.md) that contains only the distinct elements from the input ARRAY. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_EXCEPT](functions/array_except.md) | Returns a new [ARRAY](data-types-semistructured.md) that contains the elements from one input ARRAY that are not in another input ARRAY. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_FLATTEN](functions/array_flatten.md) | Flattens an [ARRAY](data-types-semistructured.md) of ARRAYs into a single ARRAY. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_GENERATE_RANGE](functions/array_generate_range.md) | Returns an [ARRAY](data-types-semistructured.md) of integer values within a specified range (e.g. `[2, 3, 4]`). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_INSERT](functions/array_insert.md) | Returns an array containing all elements from the source array as well as the new element. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_INTERSECTION](functions/array_intersection.md) | Returns an array that contains the matching elements in the two input arrays. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_MAX](functions/array_max.md) | Given an input [ARRAY](data-types-semistructured.md), returns the element with the highest value that is not a SQL NULL. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_MIN](functions/array_min.md) | Given an input [ARRAY](data-types-semistructured.md), returns the element with the lowest value that is not a SQL NULL. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_POSITION](functions/array_position.md) | Returns the index of the first occurrence of an element in an array. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_PREPEND](functions/array_prepend.md) | Returns an array containing the new element as well as all elements from the source array. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_REMOVE](functions/array_remove.md) | Given a source [ARRAY](data-types-semistructured.md), returns an ARRAY with elements of the specified value removed. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_REMOVE_AT](functions/array_remove_at.md) | Given a source [ARRAY](data-types-semistructured.md), returns an ARRAY with the element at the specified position removed. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_REVERSE](functions/array_reverse.md) | Returns an [array](data-types-semistructured.md) with the elements of the input array in reverse order. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_SIZE](functions/array_size.md) | Returns the size of the input array. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_SLICE](functions/array_slice.md) | Returns an array constructed from a specified subset of elements of the input array. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_SORT](functions/array_sort.md) | Returns an [ARRAY](data-types-semistructured.md) that contains the elements of the input ARRAY sorted in ascending or descending order. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_TO_STRING](functions/array_to_string.md) | Returns an input array converted to a string by casting all values to strings (using [TO_VARCHAR](functions/to_char.md)) and concatenating them (using the string from the second argument to separate the elements). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAY_UNION_AGG](functions/array_union_agg.md) | Returns an [ARRAY](data-types-semistructured.md) that contains the union of the distinct values from the input arrays in a column. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [ARRAY_UNIQUE_AGG](functions/array_unique_agg.md) | Returns an [ARRAY](data-types-semistructured.md) that contains all of the distinct values from the specified column. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [ARRAYS_OVERLAP](functions/arrays_overlap.md) | Compares whether two arrays have at least one element in common. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAYS_TO_OBJECT](functions/arrays_to_object.md) | Returns an [OBJECT](data-types-semistructured.md) that contains the keys specified by one input [ARRAY](data-types-semistructured.md) and the values specified by another input ARRAY. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ARRAYS_ZIP](functions/arrays_zip.md) | Returns an [array](data-types-semistructured.md) of [objects](data-types-semistructured.md), each of which contains key-value pairs for an nth element in the input arrays. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_<object_type>](functions/as.md) | You can use this family of functions to perform strict casting of VARIANT values to values of other data types. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_ARRAY](functions/as_array.md) | Casts a [VARIANT](data-types-semistructured.md) value to an [ARRAY](data-types-semistructured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_BINARY](functions/as_binary.md) | Casts a [VARIANT](data-types-semistructured.md) value to a [BINARY](data-types-text.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_BOOLEAN](functions/as_boolean.md) | Casts a [VARIANT](data-types-semistructured.md) value to a [BOOLEAN](data-types-logical.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_CHAR , AS_VARCHAR](functions/as_char-varchar.md) | Casts a [VARIANT](data-types-semistructured.md) value to a [VARCHAR](data-types-text.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_DATE](functions/as_date.md) | Casts a [VARIANT](data-types-semistructured.md) value to a [DATE](data-types-datetime.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_DECIMAL , AS_NUMBER](functions/as_decimal-number.md) | Casts a [VARIANT](data-types-semistructured.md) value to a fixed-point [NUMBER](data-types-numeric.md) value, with optional precision and scale. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_DOUBLE , AS_REAL](functions/as_double-real.md) | Casts a [VARIANT](data-types-semistructured.md) value to a [floating-point value](data-types-numeric.md). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_INTEGER](functions/as_integer.md) | Casts a [VARIANT](data-types-semistructured.md) value to an [INTEGER](data-types-numeric.md). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_OBJECT](functions/as_object.md) | Casts a [VARIANT](data-types-semistructured.md) value to an [OBJECT](data-types-semistructured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_TIME](functions/as_time.md) | Casts a [VARIANT](data-types-semistructured.md) value to a [TIME](data-types-datetime.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [AS_TIMESTAMP_\*](functions/as_timestamp.md) | Casts a [VARIANT](data-types-semistructured.md) value to the respective [timestamp](data-types-datetime.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [ASCII](functions/ascii.md) | Returns the ASCII code for the first character of a string. | [String & binary functions](functions-string.md) |
| [ASIN](functions/asin.md) | Computes the inverse sine (arc sine) of its argument; the result is a number in the interval `[-pi/2, pi/2]`. | [Numeric functions](functions-numeric.md) |
| [ASINH](functions/asinh.md) | Computes the inverse (arc) hyperbolic sine of its argument. | [Numeric functions](functions-numeric.md) |
| [ATAN](functions/atan.md) | Computes the inverse tangent (arc tangent) of its argument; the result is a number in the interval `[-pi, pi]`. | [Numeric functions](functions-numeric.md) |
| [ATAN2](functions/atan2.md) | Computes the inverse tangent (arc tangent) of the ratio of its two arguments. | [Numeric functions](functions-numeric.md) |
| [ATANH](functions/atanh.md) | Computes the inverse (arc) hyperbolic tangent of its argument. | [Numeric functions](functions-numeric.md) |
| [AUTO_REFRESH_REGISTRATION_HISTORY](functions/auto_refresh_registration_history.md) | This table function can be used to query the history of data files registered in the metadata for a specified external table or directory table and the credits billed for these operations. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [AUTOMATIC_CLUSTERING_HISTORY](functions/automatic_clustering_history.md) | This table function is used for querying the [Automatic Clustering](../user-guide/tables-auto-reclustering.md) history for given tables within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [AVAILABLE_LISTING_REFRESH_HISTORY](functions/available_listing_refresh_history.md) | Returns the past 14 days of refresh history for an available listing or a database mounted from a listing using cross-cloud auto-fulfillment. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [AVAILABLE_LISTINGS](functions/available_listings.md) | Returns all listings that are available for the consumer to discover and access. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [AVG](functions/avg.md) | Returns the average of non-NULL records. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| **B** |  |  |
| [BASE64_DECODE_BINARY](functions/base64_decode_binary.md) | Decodes a Base64-encoded string to a binary. | [String & binary functions](functions-string.md) |
| [BASE64_DECODE_STRING](functions/base64_decode_string.md) | Decodes a Base64-encoded string to a string. | [String & binary functions](functions-string.md) |
| [BASE64_ENCODE](functions/base64_encode.md) | Encodes the input (string or binary) using Base64 encoding. | [String & binary functions](functions-string.md) |
| [[ NOT ] BETWEEN](functions/between.md) | Returns `TRUE` when the input expression (numeric or string) is within the specified lower and upper boundary. | [Conditional expression functions](expressions-conditional.md) |
| [BIND_VALUES](functions/bind_values.md) | This INFORMATION_SCHEMA table function returns information about the values of [bind variables](bind-variables.md) used in queries. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [BIT_LENGTH](functions/bit_length.md) | Returns the length of a string or binary value in bits. | [String & binary functions](functions-string.md) |
| [BITAND](functions/bitand.md) | Returns the bitwise AND of two numeric or binary expressions. | [Bitwise expression functions](expressions-byte-bit.md) |
| [BITAND_AGG](functions/bitand_agg.md) | Returns the bitwise AND value of all non-NULL numeric records in a group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Bitwise expression functions](expressions-byte-bit.md) |
| [BITMAP_BIT_POSITION](functions/bitmap_bit_position.md) | Given a numeric value, returns the relative position for the bit that represents that value in a bitmap. | [Aggregate functions](functions-aggregation.md) |
| [BITMAP_BUCKET_NUMBER](functions/bitmap_bucket_number.md) | Given a numeric value, returns an identifier (“bucket number”) for the bitmap containing the bit that represents the value.. | [Aggregate functions](functions-aggregation.md) |
| [BITMAP_CONSTRUCT_AGG](functions/bitmap_construct_agg.md) | Returns a bitmap with bits set for each distinct value in a group. | [Aggregate functions](functions-aggregation.md) |
| [BITMAP_COUNT](functions/bitmap_count.md) | Given a bitmap that represents the set of distinct values for a column, returns the number of distinct value. | [Aggregate functions](functions-aggregation.md) |
| [BITMAP_OR_AGG](functions/bitmap_or_agg.md) | Returns a bitmap containing the results of a binary OR operation on the input bitmaps. | [Aggregate functions](functions-aggregation.md) |
| [BITNOT](functions/bitnot.md) | Returns the bitwise negation of a numeric or binary expression. | [Bitwise expression functions](expressions-byte-bit.md) |
| [BITOR](functions/bitor.md) | Returns the bitwise OR of two numeric or binary expressions. | [Bitwise expression functions](expressions-byte-bit.md) |
| [BITOR_AGG](functions/bitor_agg.md) | Returns the bitwise OR value of all non-NULL numeric records in a group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Bitwise expression functions](expressions-byte-bit.md) |
| [BITSHIFTLEFT](functions/bitshiftleft.md) | Shifts the bits for a numeric or binary expression `n` positions to the left. | [Bitwise expression functions](expressions-byte-bit.md) |
| [BITSHIFTRIGHT](functions/bitshiftright.md) | Shifts the bits for a numeric or binary expression `n` positions to the right. | [Bitwise expression functions](expressions-byte-bit.md) |
| [BITXOR](functions/bitxor.md) | Returns the bitwise XOR of two numeric or binary expressions. | [Bitwise expression functions](expressions-byte-bit.md) |
| [BITXOR_AGG](functions/bitxor_agg.md) | Returns the bitwise XOR value of all non-NULL numeric records in a group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Bitwise expression functions](expressions-byte-bit.md) |
| [BOOLAND](functions/booland.md) | Computes the Boolean AND of two numeric expressions. | [Conditional expression functions](expressions-conditional.md) |
| [BOOLAND_AGG](functions/booland_agg.md) | Returns TRUE if all non-NULL Boolean records in a group evaluate to TRUE. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Conditional expression functions](expressions-conditional.md) |
| [BOOLNOT](functions/boolnot.md) | Computes the Boolean NOT of a single numeric expression. | [Conditional expression functions](expressions-conditional.md) |
| [BOOLOR](functions/boolor.md) | Computes the Boolean OR of two numeric expressions. | [Conditional expression functions](expressions-conditional.md) |
| [BOOLOR_AGG](functions/boolor_agg.md) | Returns TRUE if at least one Boolean record in a group evaluates to TRUE. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Conditional expression functions](expressions-conditional.md) |
| [BOOLXOR](functions/boolxor.md) | Computes the Boolean XOR of two numeric expressions; that is, one of the expressions, but not both expressions, is true. | [Conditional expression functions](expressions-conditional.md) |
| [BOOLXOR_AGG](functions/boolxor_agg.md) | Returns TRUE if exactly one Boolean record in the group evaluates to TRUE. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Conditional expression functions](expressions-conditional.md) |
| [BUILD_SCOPED_FILE_URL](functions/build_scoped_file_url.md) | Generates a scoped Snowflake file URL to a staged file using the stage name and relative file path as inputs. | [File functions](functions-file.md) |
| [BUILD_STAGE_FILE_URL](functions/build_stage_file_url.md) | Generates a Snowflake *file URL* to a staged file using the stage name and relative file path as inputs. | [File functions](functions-file.md) |
| **C** |  |  |
| [CASE](functions/case.md) | Works like a cascading “if-then-else” statement. | [Conditional expression functions](expressions-conditional.md) |
| [CAST , ::](functions/cast.md) | Converts a value of one data type into another data type. | [Conversion functions](functions-conversion.md) |
| [CBRT](functions/cbrt.md) | Returns the cubic root of a numeric expression. | [Numeric functions](functions-numeric.md) |
| [CEIL](functions/ceil.md) | Returns values from `input_expr` rounded to the nearest equal or larger integer, or to the nearest equal or larger value with the specified number of places after the decimal point. | [Numeric functions](functions-numeric.md) |
| [CHARINDEX](functions/charindex.md) | Searches for the first occurrence of the first argument in the second argument and, if successful, returns the position (1-based) of the first argument in the second argument. | [String & binary functions](functions-string.md) |
| [CHECK_JSON](functions/check_json.md) | Checks the validity of a JSON document. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [CHECK_XML](functions/check_xml.md) | Checks the validity of an [XML](../user-guide/semistructured-data-formats.md) document. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [CHR , CHAR](functions/chr.md) | Converts a Unicode code point (including 7-bit ASCII) into the character that matches the input Unicode. | [String & binary functions](functions-string.md) |
| [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](functions/classify_text-snowflake-cortex.md) | Classifies free-form text into categories that you provide. | [String & binary functions](functions-string.md) |
| [COALESCE](functions/coalesce.md) | Returns the first non-NULL expression among its arguments, or NULL if all its arguments are NULL. | [Conditional expression functions](expressions-conditional.md) |
| [COLLATE](functions/collate.md) | Returns a copy of the original string, but with the specified `collation_specification` property instead of the original `collation_specification` property. | [String & binary functions](functions-string.md) |
| [COLLATION](functions/collation.md) | Returns the collation specification of the expression. | [String & binary functions](functions-string.md) |
| [COMPLETE (SNOWFLAKE.CORTEX) (multimodal)](functions/complete-snowflake-cortex-multimodal.md) | Given an image and a prompt, generates a response (completion) using a language model. | [String & binary functions](functions-string.md) |
| [COMPLETE (SNOWFLAKE.CORTEX)](functions/complete-snowflake-cortex.md) | Given a prompt, generates a response (completion) using your choice of supported language model. | [String & binary functions](functions-string.md) |
| [COMPLETE_TASK_GRAPHS](functions/complete_task_graphs.md) | Returns the status of a completed *graph* run. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [COMPRESS](functions/compress.md) | Compresses the input string or binary value with a compression method. | [String & binary functions](functions-string.md) |
| [CONCAT , ||](functions/concat.md) | Concatenates one or more strings, or concatenates one or more binary values. | [String & binary functions](functions-string.md) |
| [CONCAT_WS](functions/concat_ws.md) | Concatenates two or more strings, or concatenates two or more binary values, and uses the first argument as a delimiter between the following strings. | [String & binary functions](functions-string.md) |
| [CONDITIONAL_CHANGE_EVENT](functions/conditional_change_event.md) | Returns a window event number for each row within a window partition when the value of the argument `expr1` in the current row is different from the value of `expr1` in the previous row. | [Window functions](functions-window.md) |
| [CONDITIONAL_TRUE_EVENT](functions/conditional_true_event.md) | Returns a window event number for each row within a window partition based on the result of the boolean argument `expr1`. | [Window functions](functions-window.md) |
| [CONTAINS](functions/contains.md) | Returns true if `expr1` contains `expr2`. | [String & binary functions](functions-string.md) |
| [CONVERT_TIMEZONE](functions/convert_timezone.md) | Converts a timestamp to another time zone. | [Date & time functions](functions-date-time.md) |
| [COPY_HISTORY](functions/copy_history.md) | This table function can be used to query Snowflake data loading history along various dimensions within the last 14 days. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [CORR](functions/corr.md) | Returns the correlation coefficient for non-null pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [CORTEX_SEARCH_DATA_SCAN](functions/cortex_search_data_scan.md) | This table function returns the data indexed by a [Cortex Search service](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md), including the columns defined in the source query and the computed vector embeddings for the search column. | [Table functions](functions-table.md) |
| [CORTEX_SEARCH_REFRESH_HISTORY](functions/cortex_search_refresh_history.md) | This table function returns information about each refresh (completed and running) of [Cortex Search services](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [COS](functions/cos.md) | Computes the cosine of its argument; the argument should be expressed in radians. | [Numeric functions](functions-numeric.md) |
| [COSH](functions/cosh.md) | Computes the hyperbolic cosine of its argument. | [Numeric functions](functions-numeric.md) |
| [COT](functions/cot.md) | Computes the cotangent of its argument; the argument should be expressed in radians. | [Numeric functions](functions-numeric.md) |
| [COUNT](functions/count.md) | Returns either the number of non-NULL records for the specified columns, or the total number of records. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [COUNT_IF](functions/count_if.md) | Returns the number of records that satisfy a condition or NULL if no records satisfy the condition. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [COUNT_TOKENS (SNOWFLAKE.CORTEX)](functions/count_tokens-snowflake-cortex.md) | Returns the number of tokens in a prompt for the large language model or the task-specific function specified in the argument. | [String & binary functions](functions-string.md) |
| [COVAR_POP](functions/covar_pop.md) | Returns the population covariance for non-null pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [COVAR_SAMP](functions/covar_samp.md) | Returns the sample covariance for non-null pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [CUME_DIST](functions/cume_dist.md) | Finds the cumulative distribution of a value with regard to other values within the same window partition. | [Window functions](functions-window.md) |
| [CUMULATIVE_PRIVACY_LOSSES](functions/cumulative_privacy_losses.md) | Returns the privacy budgets associated with a specific [privacy policy](../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md). | [Table functions](functions-table.md) |
| [CURRENT_ACCOUNT](functions/current_account.md) | Returns the [account locator](../user-guide/admin-account-identifier.md) used by the user’s current session. | [Context functions](functions-context.md) |
| [CURRENT_ACCOUNT_NAME](functions/current_account_name.md) | Returns the name of the current account. | [Context functions](functions-context.md) |
| [CURRENT_AVAILABLE_ROLES](functions/current_available_roles.md) | Returns a list of all account-level roles granted to the current user. | [Context functions](functions-context.md) |
| [CURRENT_CLIENT](functions/current_client.md) | Returns the version of the client from which the function was called. | [Context functions](functions-context.md) |
| [CURRENT_DATABASE](functions/current_database.md) | Returns the name of the current database, which varies depending on where you call the function. | [Context functions](functions-context.md) |
| [CURRENT_DATE](functions/current_date.md) | Returns the current date of the system. | [Context functions](functions-context.md) |
| [CURRENT_IP_ADDRESS](functions/current_ip_address.md) | Returns the IP address of the client that submitted the request. | [Context functions](functions-context.md) |
| [CURRENT_ORGANIZATION_NAME](functions/current_organization_name.md) | Returns the name of the organization to which the current account belongs. | [Context functions](functions-context.md) |
| [CURRENT_ORGANIZATION_USER](functions/current_organization_user.md) | Returns the name of the user currently logged into the system, but only if the user is an [organization user](../user-guide/organization-users.md). | [Context functions](functions-context.md) |
| [CURRENT_REGION](functions/current_region.md) | Returns the name of the region for the account where the current user is logged in. | [Context functions](functions-context.md) |
| [CURRENT_ROLE](functions/current_role.md) | Returns the name of the [primary role](../user-guide/security-access-control-overview.md) in use for the current session when the primary role is an account-level role or NULL if the role in use for the current session is a database role. | [Context functions](functions-context.md) |
| [CURRENT_ROLE_TYPE](functions/current_role_type.md) | Calling the CURRENT_ROLE_TYPE function returns `ROLE` if the current active (primary) role in the session is an account role. | [Context functions](functions-context.md) |
| [CURRENT_SCHEMA](functions/current_schema.md) | Returns the name of the current schema, which varies depending on where you call the function. | [Context functions](functions-context.md) |
| [CURRENT_SCHEMAS](functions/current_schemas.md) | Returns active search path schemas. | [Context functions](functions-context.md) |
| [CURRENT_SECONDARY_ROLES](functions/current_secondary_roles.md) | Returns the [secondary roles](../user-guide/security-access-control-overview.md) in use for the current session. | [Context functions](functions-context.md) |
| [CURRENT_SESSION](functions/current_session.md) | Returns a unique system identifier for the Snowflake session corresponding to the present connection. | [Context functions](functions-context.md) |
| [CURRENT_STATEMENT](functions/current_statement.md) | Returns the SQL text of the statement that is currently executing. | [Context functions](functions-context.md) |
| [CURRENT_TASK_GRAPHS](functions/current_task_graphs.md) | Returns the status of a *graph* run that is currently scheduled or is executing. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [CURRENT_TIME](functions/current_time.md) | Returns the current time for the system. | [Context functions](functions-context.md) |
| [CURRENT_TIMESTAMP](functions/current_timestamp.md) | Returns the current timestamp for the system in the local time zone. | [Context functions](functions-context.md) |
| [CURRENT_TRANSACTION](functions/current_transaction.md) | Returns the transaction id of an open transaction in the current session. | [Context functions](functions-context.md) |
| [CURRENT_USER](functions/current_user.md) | Returns the name of the user currently logged into the system. | [Context functions](functions-context.md) |
| [CURRENT_VERSION](functions/current_version.md) | Returns the current Snowflake version. | [Context functions](functions-context.md) |
| [CURRENT_WAREHOUSE](functions/current_warehouse.md) | Returns the name of the warehouse in use for the current session. | [Context functions](functions-context.md) |
| **D** |  |  |
| [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](functions/data_agent_run-snowflake-cortex.md) | Runs a [Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md) and returns the response as JSON. | [String & binary functions](functions-string.md) |
| [DATA_METRIC_FUNCTION_EXPECTATIONS](functions/data_metric_function_expectations.md) | Returns information about the [expectations](../user-guide/data-quality-expectations.md) that exist in the account. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATA_METRIC_FUNCTION_REFERENCES](functions/data_metric_function_references.md) | Returns a row for each object that has the specified data metric function assigned to the object or returns a row for each data metric function assigned to the specified object. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATA_QUALITY_MONITORING_EXPECTATION_STATUS](functions/data_quality_monitoring_expectation_status.md) | For a specified object, returns a row for every time a data metric function (DMF) with an [expectation](../user-guide/data-quality-expectations.md) was run. | [LOCAL schema](local.md) , [Table functions](functions-table.md) |
| [DATA_QUALITY_MONITORING_RESULTS](functions/data_quality_monitoring_results.md) | Returns a row for each data metric function assigned to the specified object, which includes the evaluation result and other metadata of the data metric function on the object. | [LOCAL schema](local.md) , [Table functions](functions-table.md) |
| [DATA_TRANSFER_HISTORY](functions/data_transfer_history.md) | This table function can be used to query the history of data transferred from Snowflake tables into a different cloud storage provider’s network (i.e. from Snowflake on AWS, Google Cloud Platform, or Microsoft Azure into the other cloud provider’s network) and/or geographical region within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATABASE_REFRESH_HISTORY](functions/database_refresh_history.md) | Returns the refresh history for a secondary database. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB](functions/database_refresh_progress.md) | The DATABASE_REFRESH_PROGRESS family of functions can be used to query the status of a database refresh along various dimensions. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATABASE_REPLICATION_USAGE_HISTORY](functions/database_replication_usage_history.md) | This table function can be used to query the replication history for a specified database within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATABASE_STORAGE_USAGE_HISTORY](functions/database_storage_usage_history.md) | This table function can be used to query the average daily storage usage, in bytes, for a single database (or all the databases in your account) within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DATASKETCHES_HLL](functions/datasketches_hll.md) | Returns an approximation of the distinct cardinality of the input (that is, `DATASKETCHES_HLL(col1)` returns an approximation of `COUNT(DISTINCT col1)`). | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [DATASKETCHES_HLL_ACCUMULATE](functions/datasketches_hll_accumulate.md) | Returns the sketch at the end of aggregation. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [DATASKETCHES_HLL_COMBINE](functions/datasketches_hll_combine.md) | Combines (merges) input sketches into a single output sketch. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [DATASKETCHES_HLL_ESTIMATE](functions/datasketches_hll_estimate.md) | Returns the cardinality estimate for the given sketch. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [DATE_FROM_PARTS](functions/date_from_parts.md) | Creates a date from individual numeric components that represent the year, month, and day of the month. | [Date & time functions](functions-date-time.md) |
| [DATE_PART](functions/date_part.md) | Extracts the specified date or time part from a date, time, or timestamp. | [Date & time functions](functions-date-time.md) |
| [DATE_TRUNC](functions/date_trunc.md) | Truncates a DATE, TIME, or TIMESTAMP value to the specified precision. | [Date & time functions](functions-date-time.md) |
| [DATEADD](functions/dateadd.md) | Adds the specified value for the specified date or time part to a date, time, or timestamp. | [Date & time functions](functions-date-time.md) |
| [DATEDIFF](functions/datediff.md) | Calculates the difference between two date, time, or timestamp expressions based on the date or time part requested. | [Date & time functions](functions-date-time.md) |
| [DAYNAME](functions/dayname.md) | Extracts the three-letter day-of-week name from the specified date or timestamp. | [Date & time functions](functions-date-time.md) |
| [DBT_PROJECT_EXECUTION_HISTORY](functions/dbt_project_execution_history.md) | Returns the execution history of [dbt Projects on Snowflake](../user-guide/data-engineering/dbt-projects-on-snowflake.md). | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DECODE](functions/decode.md) | Compares the select expression to each search expression in order. | [Conditional expression functions](expressions-conditional.md) |
| [DECOMPRESS_BINARY](functions/decompress_binary.md) | Decompresses the compressed `BINARY` input parameter. | [String & binary functions](functions-string.md) |
| [DECOMPRESS_STRING](functions/decompress_string.md) | Decompresses the compressed `BINARY` input parameter to a string. | [String & binary functions](functions-string.md) |
| [DECRYPT](functions/decrypt.md) | Decrypts a BINARY value using a VARCHAR passphrase. | [Encryption functions](functions-encryption.md) |
| [DECRYPT_RAW](functions/decrypt_raw.md) | Decrypts a BINARY value using a BINARY key. | [Encryption functions](functions-encryption.md) |
| [DEGREES](functions/degrees.md) | Converts radians to degrees. | [Numeric functions](functions-numeric.md) |
| [DENSE_RANK](functions/dense_rank.md) | Returns the rank of a value within a group of values, without gaps in the ranks. | [Window function syntax and usage](functions-window-syntax.md) |
| [DIV0](functions/div0.md) | Performs division like the division operator (`/`), but returns 0 when the divisor is 0 (rather than reporting an error). | [Numeric functions](functions-numeric.md) |
| [DIV0NULL](functions/div0null.md) | Performs division like the division operator (`/`), but returns 0 when the divisor is 0 or NULL (rather than reporting an error or returning NULL). | [Numeric functions](functions-numeric.md) |
| [ACCEPTED_VALUES (system data metric function)](functions/dmf_accepted_values.md) | Returns the number of records where the value of a column does *not* match a Boolean expression. | [Data metric functions](functions-data-metric.md) |
| [AVG (system data metric function)](functions/dmf_avg.md) | Returns the average value for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [BLANK_COUNT (system data metric function)](functions/dmf_blank_count.md) | Returns the count of column values that are blank for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [BLANK_PERCENT (system data metric function)](functions/dmf_blank_percent.md) | Returns the percentage of column values that are blank for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [DATA_METRIC_SCHEDULED_TIME (system data metric function)](functions/dmf_data_metric_schedule_time.md) | Returns the timestamp for when a DMF is scheduled to run or the current timestamp if the function is called manually. | [Data metric functions](functions-data-metric.md) |
| [DUPLICATE_COUNT (system data metric function)](functions/dmf_duplicate_count.md) | Returns the count of column values that have duplicates, including NULL values. | [Data metric functions](functions-data-metric.md) |
| [FRESHNESS (system data metric function)](functions/dmf_freshness.md) | Returns how much time in seconds has elapsed since a table was last modified. | [Data metric functions](functions-data-metric.md) |
| [MAX (system data metric function)](functions/dmf_max.md) | Returns the maximum value for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [MIN (system data metric function)](functions/dmf_min.md) | Returns the minimum value for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [NULL_COUNT (system data metric function)](functions/dmf_null_count.md) | Returns the total number of NULL values for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [NULL_PERCENT (system data metric function)](functions/dmf_null_percent.md) | Returns the percentage of columns values that are NULL for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [ROW_COUNT (system data metric function)](functions/dmf_row_count.md) | Returns the total number of rows in a table. | [Data metric functions](functions-data-metric.md) |
| [STDDEV (system data metric function)](functions/dmf_stddev.md) | Returns the standard deviation value for the specified column in a table. | [Data metric functions](functions-data-metric.md) |
| [UNIQUE_COUNT (system data metric function)](functions/dmf_unique_count.md) | Returns the total number of unique non-NULL values for the specified columns in a table. | [Data metric functions](functions-data-metric.md) |
| [DP_INTERVAL_HIGH](functions/dp_interval_high.md) | Returns the upper bound of the [noise interval](../user-guide/diff-privacy/differential-privacy-analyst.md), which is used by differential privacy to help analysts determine how much noise has been introduced into query results. | [Differential privacy functions](functions-differential-privacy.md) |
| [DP_INTERVAL_LOW](functions/dp_interval_low.md) | Returns the lower bound of the [noise interval](../user-guide/diff-privacy/differential-privacy-analyst.md), which is used by differential privacy to help analysts determine how much noise has been introduced into query results. | [Differential privacy functions](functions-differential-privacy.md) |
| [DYNAMIC_TABLE_GRAPH_HISTORY](functions/dynamic_table_graph_history.md) | This table function returns information on all [dynamic tables](../user-guide/dynamic-tables-about.md) in the current account. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DYNAMIC_TABLE_REFRESH_HISTORY](functions/dynamic_table_refresh_history.md) | This table function returns information about each refresh (completed and running) of [dynamic tables](../user-guide/dynamic-tables-about.md). | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [DYNAMIC_TABLES](functions/dynamic_tables.md) | This table function returns metadata about [dynamic tables](../user-guide/dynamic-tables-about.md), including aggregate lag metrics and the status of the most recent refreshes, within 7 days of the current time. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| **E** |  |  |
| [EDITDISTANCE](functions/editdistance.md) | Computes the Levenshtein distance between two input strings. | [String & binary functions](functions-string.md) |
| [EMAIL_INTEGRATION_CONFIG](functions/email_integration_config.md) | Returns a JSON object that specifies the email notification integration, recipients, and subject line to use for an email notification. | [Notification functions](functions-notification.md) |
| [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](functions/embed_text-snowflake-cortex.md) | Creates a vector embedding of 768 dimensions from English-language text. | [String & binary functions](functions-string.md) |
| [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](functions/embed_text_1024-snowflake-cortex.md) | Creates a vector embedding of 1024 dimensions from text. | [String & binary functions](functions-string.md) |
| [ENCRYPT](functions/encrypt.md) | Encrypts a VARCHAR or BINARY value using a VARCHAR passphrase. | [Encryption functions](functions-encryption.md) |
| [ENCRYPT_RAW](functions/encrypt_raw.md) | Encrypts a BINARY value using a BINARY key. | [Encryption functions](functions-encryption.md) |
| [ENDSWITH](functions/endswith.md) | Returns TRUE if the first expression ends with the second expression. | [String & binary functions](functions-string.md) |
| [ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)](functions/entity_sentiment-snowflake-cortex.md) | Returns sentiment scores for English-language text, including overall sentiment and specific sentiment for specified entities. | [String & binary functions](functions-string.md) |
| [[ NOT ] EQUAL_NULL](functions/equal_null.md) | Compares whether two expressions are equal. | [Conditional expression functions](expressions-conditional.md) |
| [ESTIMATE_REMAINING_DP_AGGREGATES](functions/estimate_remaining_dp_aggregates.md) | Returns the estimated number of aggregation functions that can be run before the limit of a privacy budget is reached. | [Differential privacy functions](functions-differential-privacy.md) , [Table functions](functions-table.md) |
| [EXECUTE_AI_EVALUATION](functions/execute_ai_evaluation.md) | Start or get the status of a Cortex Agent evaluation run. | [System functions](functions-system.md) |
| [EXP](functions/exp.md) | Computes Euler’s number `e` raised to a floating-point value. | [Numeric functions](functions-numeric.md) |
| [EXPLAIN_JSON](functions/explain_json.md) | This function converts an EXPLAIN plan from JSON to a table. | [System functions](functions-system.md) |
| [EXTERNAL_FUNCTIONS_HISTORY](functions/external_functions_history.md) | This table function retrieves the history of external functions called by Snowflake for your entire Snowflake account. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [EXTERNAL_TABLE_FILES](functions/external_table_files.md) | This table function can be used to query information about the staged data files included in the metadata for a specified [external table](../user-guide/tables-external-intro.md). | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY](functions/external_table_registration_history.md) | This table function can be used to query information about the metadata history for an external table. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [EXTRACT](functions/extract.md) | Extracts the specified date or time part from a date, time, or timestamp. | [Date & time functions](functions-date-time.md) |
| [EXTRACT_ANSWER (SNOWFLAKE.CORTEX)](functions/extract_answer-snowflake-cortex.md) | Extracts an answer to a given question from a text document. | [String & binary functions](functions-string.md) |
| [EXTRACT_SEMANTIC_CATEGORIES](functions/extract_semantic_categories.md) | Returns a set of categories (semantic and privacy) for each supported column in the specified table or view. | [System functions](functions-system.md) |
| **F** |  |  |
| [FACTORIAL](functions/factorial.md) | Computes the factorial of its input. | [Numeric functions](functions-numeric.md) |
| [FILTER](functions/filter.md) | Filters an [array](data-types-semistructured.md) based on the logic in a lambda expression. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [FINETUNE ('CANCEL') (SNOWFLAKE.CORTEX)](functions/finetune-cancel.md) | Cancels the specified fine-tuning job from the current schema. | [String & binary functions](functions-string.md) |
| [FINETUNE ('CREATE') (SNOWFLAKE.CORTEX)](functions/finetune-create.md) | Creates a fine-tuning job. | [String & binary functions](functions-string.md) |
| [FINETUNE ('DESCRIBE') (SNOWFLAKE.CORTEX)](functions/finetune-describe.md) | Describes the properties of a fine-tuning job. | [String & binary functions](functions-string.md) |
| [FINETUNE ('SHOW') (SNOWFLAKE.CORTEX)](functions/finetune-show.md) | Lists all the fine-tuning jobs in the current account. | [String & binary functions](functions-string.md) |
| [FINETUNE (SNOWFLAKE.CORTEX)](functions/finetune-snowflake-cortex.md) | This function lets you create and manage large language models customized for your specific task. | [String & binary functions](functions-string.md) |
| [FIRST_VALUE](functions/first_value.md) | Returns the first value within an ordered group of values. | [Window function syntax and usage](functions-window-syntax.md) |
| [FL_GET_CONTENT_TYPE](functions/fl_get_content_type.md) | Returns the content type (also known as the MIME type) of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_ETAG](functions/fl_get_etag.md) | Returns the content hash (ETAG) of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_FILE_TYPE](functions/fl_get_file_type.md) | Returns the file type (modality) of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_LAST_MODIFIED](functions/fl_get_last_modified.md) | Returns the last modified date of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_RELATIVE_PATH](functions/fl_get_relative_path.md) | Returns the relative path of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_SCOPED_FILE_URL](functions/fl_get_scoped_file_url.md) | Returns the scoped URL of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_SIZE](functions/fl_get_size.md) | Returns the size, in bytes, of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_STAGE](functions/fl_get_stage.md) | Returns the stage name of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_GET_STAGE_FILE_URL](functions/fl_get_stage_file_url.md) | Returns the stage URL of a [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_IS_AUDIO](functions/fl_is_audio.md) | Checks if the input is an audio [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_IS_COMPRESSED](functions/fl_is_compressed.md) | Checks if the input is a compressed [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_IS_DOCUMENT](functions/fl_is_document.md) | Checks if the input is a document [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_IS_IMAGE](functions/fl_is_image.md) | Checks if the input is an image [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FL_IS_VIDEO](functions/fl_is_video.md) | Checks if the input is a video [FILE](data-types-unstructured.md). | [File functions](functions-file.md) |
| [FLATTEN](functions/flatten.md) | Flattens (explodes) compound values into multiple rows. | [Table functions](functions-table.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [FLOOR](functions/floor.md) | Returns values from `input_expr` rounded to the nearest equal or smaller integer, or to the nearest equal or smaller value with the specified number of places after the decimal point. | [Numeric functions](functions-numeric.md) |
| **G** |  |  |
| [GENERATE_COLUMN_DESCRIPTION](functions/generate_column_description.md) | Generates a list of columns from a set of staged files that contain semi-structured data using the [INFER_SCHEMA](functions/infer_schema.md) function output. | [Metadata functions](functions-metadata.md) |
| [GENERATE_POSTGRES_ACCESS_TOKEN_FOR_USER](functions/generate_postgres_access_token_for_user.md) | Generates a short-lived access token for a Snowflake user to use as a password when logging into a Snowflake Postgres instance that has the AUTHENTICATION_AUTHORITY attribute set to POSTGRES_OR_SNOWFLAKE. | Generates a short-lived access token for a Snowflake user to use as a password when logging into a Snowflake Postgres instance that has the AUTHENTICATION_AUTHORITY attribute set to POSTGRES_OR_SNOWFLAKE. |
| [GENERATOR](functions/generator.md) | Creates rows of data based either on a specified number of rows, a specified generation period (in seconds), or both. | [Table functions](functions-table.md) |
| [GET](functions/get.md) | Extracts a value from an [ARRAY](data-types-semistructured.md) or an [OBJECT](data-types-semistructured.md) (or a [VARIANT](data-types-semistructured.md) that contains an ARRAY or OBJECT). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [GET_ABSOLUTE_PATH](functions/get_absolute_path.md) | Retrieves the absolute path of a staged file using the stage name and path of the file relative to its location in the stage as inputs. | [File functions](functions-file.md) |
| [GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)](functions/get_ai_evaluation_data-snowflake-local.md) | Retrieves evaluation data for a Cortex Agent evaluation run. | [Table functions](functions-table.md) |
| [GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)](functions/get_ai_observability_logs-snowflake-local.md) | Retrieve log data for a Cortex Agent observability event, such as a warning or failure. | [Table functions](functions-table.md) |
| [GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)](functions/get_ai_record_trace-snowflake-local.md) | Retrieve a single trace record from a Cortex Agent evaluation run. | [Table functions](functions-table.md) |
| [GET_ANACONDA_PACKAGES_REPODATA](functions/get_anaconda_packages_repodata.md) | Returns a list of third-party packages that are available from Anaconda. | [System functions](functions-system.md) |
| [GET_CONDITION_QUERY_UUID](functions/get_condition_query_uuid.md) | Returns the query ID for the SQL statement executed for the condition of an [alert](../user-guide/alerts.md). | [Context functions](functions-context.md) |
| [GET_CONFIGURATION_VALUE (SYS_CONTEXT function)](functions/get_configuration_value.md) | Returns the current value for the specified configuration. | [Context functions](functions-context.md) |
| [GET_CONTACTS](functions/get_contacts.md) | Returns the [contacts](../user-guide/contacts-using.md) associated with an object. | [Table functions](functions-table.md) |
| [GET_DDL](functions/get_ddl.md) | Returns a DDL statement that can be used to recreate the specified object. | [Metadata functions](functions-metadata.md) |
| [GET_IGNORE_CASE](functions/get_ignore_case.md) | Extracts a field value from an object; returns NULL if either of the arguments is NULL. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [GET_JOB_HISTORY](functions/get_job_history.md) | Returns the job history for [Snowpark Container Services jobs](../developer-guide/snowpark-container-services/working-with-services.md) that ran within the specified time range. | [Table functions](functions-table.md) |
| [GET_LINEAGE (SNOWFLAKE.CORE)](functions/get_lineage-snowflake-core.md) | Given a Snowflake object, returns data lineage information upstream or downstream from that object. | [Table functions](functions-table.md) |
| [GET_OBJECT_REFERENCES](functions/get_object_references.md) | Returns a list of objects that a specified object references. | [Table functions](functions-table.md) |
| [GET_PATH , :](functions/get_path.md) | Extracts a value from semi-structured data using a path name. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [GET_PRESIGNED_URL](functions/get_presigned_url.md) | Generates a pre-signed URL to a file on a stage using the stage name and relative file path as inputs. | [File functions](functions-file.md) |
| [GET_PYTHON_PROFILER_OUTPUT (SNOWFLAKE.CORE)](functions/get_python_profiler_output.md) | Returns output containing a report generated by the [Python code profiler](../developer-guide/stored-procedure/python/procedure-python-profiler.md). | [System functions](functions-system.md) |
| [GET_QUERY_OPERATOR_STATS](functions/get_query_operator_stats.md) | Returns statistics about individual query operators within a query that has completed. | [System functions](functions-system.md) , [Table functions](functions-table.md) |
| [GET_RELATIVE_PATH](functions/get_relative_path.md) | Extracts the path of a staged file relative to its location in the stage using the stage name and absolute file path in cloud storage as inputs. | [File functions](functions-file.md) |
| [GET_STAGE_LOCATION](functions/get_stage_location.md) | Retrieves the URL for an external or internal named stage using the stage name as the input. | [File functions](functions-file.md) |
| [GETBIT](functions/getbit.md) | Given an INTEGER value, returns the value of a bit at a specified position. | [Bitwise expression functions](expressions-byte-bit.md) |
| [GETDATE](functions/getdate.md) | Returns the current timestamp for the system in the local time zone. | [Context functions](functions-context.md) |
| [GETVARIABLE](functions/getvariable.md) | Returns the value associated with a SQL variable name. | [Context functions](functions-context.md) |
| [GREATEST](functions/greatest.md) | Returns the largest value from a list of expressions. | [Conditional expression functions](expressions-conditional.md) |
| [GREATEST_IGNORE_NULLS](functions/greatest_ignore_nulls.md) | Returns the largest non-NULL value from a list of expressions. | [Conditional expression functions](expressions-conditional.md) |
| [GROUPING](functions/grouping.md) | Describes which of a list of expressions are grouped in a row produced by a [GROUP BY](constructs/group-by.md) query. | [Aggregate functions](functions-aggregation.md) |
| [GROUPING_ID](functions/grouping_id.md) | Describes which of a list of expressions are grouped in a row produced by a [GROUP BY](constructs/group-by.md) query. | [Aggregate functions](functions-aggregation.md) |
| **H** |  |  |
| [H3_CELL_TO_BOUNDARY](functions/h3_cell_to_boundary.md) | Returns the [GEOGRAPHY](data-types-geospatial.md) object representing the boundary of an [H3](data-types-geospatial.md) cell. | [Geospatial functions](functions-geospatial.md) |
| [H3_CELL_TO_CHILDREN](functions/h3_cell_to_children.md) | Returns an [array](data-types-semistructured.md) of the INTEGER IDs of the children of an [H3](data-types-geospatial.md) cell for a given resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_CELL_TO_CHILDREN_STRING](functions/h3_cell_to_children_string.md) | Returns an [array](data-types-semistructured.md) of the VARCHAR values containing the hexadecimal IDs of the children of an [H3](data-types-geospatial.md) cell for a given resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_CELL_TO_PARENT](functions/h3_cell_to_parent.md) | Returns the ID of the parent of an [H3](data-types-geospatial.md) cell for a given resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_CELL_TO_POINT](functions/h3_cell_to_point.md) | Returns the [GEOGRAPHY](data-types-geospatial.md) object representing the Point that is the centroid of an [H3](data-types-geospatial.md) cell. | [Geospatial functions](functions-geospatial.md) |
| [H3_COMPACT_CELLS](functions/h3_compact_cells.md) | Returns an [array](data-types-semistructured.md) of [VARIANT](data-types-semistructured.md) values that contain the INTEGER IDs of fewer, larger [H3](data-types-geospatial.md) cells that cover the same area as the H3 cells in the input. | [Geospatial functions](functions-geospatial.md) |
| [H3_COMPACT_CELLS_STRINGS](functions/h3_compact_cells_strings.md) | Returns an [array](data-types-semistructured.md) of [VARIANT](data-types-semistructured.md) values that contain the VARCHAR hexadecimal IDs of fewer, larger [H3](data-types-geospatial.md) cells that cover the same area as the H3 cells in the input. | [Geospatial functions](functions-geospatial.md) |
| [H3_COVERAGE](functions/h3_coverage.md) | Returns an [array](data-types-semistructured.md) of IDs (as INTEGER values) identifying the minimal set of [H3](data-types-geospatial.md) cells that completely cover a shape (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_COVERAGE_STRINGS](functions/h3_coverage_strings.md) | Returns an [array](data-types-semistructured.md) of hexadecimal IDs (as VARCHAR values) identifying the minimal set of [H3](data-types-geospatial.md) cells that completely cover a shape (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_GET_RESOLUTION](functions/h3_get_resolution.md) | Returns the resolution of an [H3](data-types-geospatial.md) cell. | [Geospatial functions](functions-geospatial.md) |
| [H3_GRID_DISK](functions/h3_grid_disk.md) | Returns an [array](data-types-semistructured.md) of the IDs of the [H3](data-types-geospatial.md) cells that are within the k-distance from the specified cell. | [Geospatial functions](functions-geospatial.md) |
| [H3_GRID_DISTANCE](functions/h3_grid_distance.md) | Returns the distance between two [H3](data-types-geospatial.md) cells specified by their IDs. | [Geospatial functions](functions-geospatial.md) |
| [H3_GRID_PATH](functions/h3_grid_path.md) | Returns an [array](data-types-semistructured.md) of the IDs of the [H3](data-types-geospatial.md) cells that represent the line between two cells. | [Geospatial functions](functions-geospatial.md) |
| [H3_INT_TO_STRING](functions/h3_int_to_string.md) | Converts the INTEGER value of an [H3](data-types-geospatial.md) cell ID to hexadecimal format. | [Geospatial functions](functions-geospatial.md) |
| [H3_IS_PENTAGON](functions/h3_is_pentagon.md) | Returns TRUE if the boundary of an [H3](data-types-geospatial.md) cell represents a pentagon. | [Geospatial functions](functions-geospatial.md) |
| [H3_IS_VALID_CELL](functions/h3_is_valid_cell.md) | Returns TRUE if the input represents a valid [H3](data-types-geospatial.md) cell. | [Geospatial functions](functions-geospatial.md) |
| [H3_LATLNG_TO_CELL](functions/h3_latlng_to_cell.md) | Returns the INTEGER value of the [H3](data-types-geospatial.md) cell ID for a given latitude, longitude, and resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_LATLNG_TO_CELL_STRING](functions/h3_latlng_to_cell_string.md) | Returns the [H3](data-types-geospatial.md) cell ID in hexadecimal format (as a VARCHAR value) for a given latitude, longitude, and resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_POINT_TO_CELL](functions/h3_point_to_cell.md) | Returns the INTEGER value of an [H3](data-types-geospatial.md) cell ID for a Point (specified by a [GEOGRAPHY](data-types-geospatial.md) object) at a given resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_POINT_TO_CELL_STRING](functions/h3_point_to_cell_string.md) | Returns the hexadecimal value of an [H3](data-types-geospatial.md) cell ID for a Point (specified by a [GEOGRAPHY](data-types-geospatial.md) object) at a given resolution. | [Geospatial functions](functions-geospatial.md) |
| [H3_POLYGON_TO_CELLS](functions/h3_polygon_to_cells.md) | Returns an [array](data-types-semistructured.md) of INTEGER values of the IDs of [H3](data-types-geospatial.md) cells that have centroids contained by a Polygon (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_POLYGON_TO_CELLS_STRINGS](functions/h3_polygon_to_cells_strings.md) | Returns an [array](data-types-semistructured.md) of VARCHAR values of the hexadecimal IDs of [H3](data-types-geospatial.md) cells that have centroids contained by a Polygon (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_STRING_TO_INT](functions/h3_string_to_int.md) | Converts an [H3](data-types-geospatial.md) cell ID in hexadecimal format to an INTEGER value. | [Geospatial functions](functions-geospatial.md) |
| [H3_TRY_COVERAGE](functions/h3_try_coverage.md) | A special version of [H3_COVERAGE](functions/h3_coverage.md) that returns NULL if an error occurs when it attempts to return an [array](data-types-semistructured.md) of IDs (as INTEGER values) identifying the minimal set of [H3](data-types-geospatial.md) cells that completely cover a shape (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_TRY_COVERAGE_STRINGS](functions/h3_try_coverage_strings.md) | A special version of [H3_COVERAGE_STRINGS](functions/h3_coverage_strings.md) that returns NULL if an error occurs when it attempts to return an [array](data-types-semistructured.md) of hexadecimal IDs (as VARCHAR values) identifying the minimal set of [H3](data-types-geospatial.md) cells that completely cover a shape (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_TRY_GRID_DISTANCE](functions/h3_try_grid_distance.md) | A special version of [H3_GRID_DISTANCE](functions/h3_grid_distance.md) that returns NULL if an error occurs when it attempts to return the distance between two [H3](data-types-geospatial.md) cells. | [Geospatial functions](functions-geospatial.md) |
| [H3_TRY_GRID_PATH](functions/h3_try_grid_path.md) | A special version of [H3_GRID_PATH](functions/h3_grid_path.md) that returns NULL if an error occurs when it attempts to return an array of VARIANT values that contain the IDs of the [H3](data-types-geospatial.md) cells that represent the line between two cells. | [Geospatial functions](functions-geospatial.md) |
| [H3_TRY_POLYGON_TO_CELLS](functions/h3_try_polygon_to_cells.md) | A special version of [H3_POLYGON_TO_CELLS](functions/h3_polygon_to_cells.md) that returns NULL if an error occurs when it attempts to return an [array](data-types-semistructured.md) of INTEGER values of the IDs of [H3](data-types-geospatial.md) cells that have centroids contained by a Polygon (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_TRY_POLYGON_TO_CELLS_STRINGS](functions/h3_try_polygon_to_cells_strings.md) | A special version of [H3_POLYGON_TO_CELLS_STRINGS](functions/h3_polygon_to_cells_strings.md) that returns NULL if an error occurs when it attempts to return an [array](data-types-semistructured.md) of VARCHAR values of the hexadecimal IDs of [H3](data-types-geospatial.md) cells that have centroids contained by a Polygon (specified by a [GEOGRAPHY](data-types-geospatial.md) object). | [Geospatial functions](functions-geospatial.md) |
| [H3_UNCOMPACT_CELLS](functions/h3_uncompact_cells.md) | Returns an [array](data-types-semistructured.md) of [VARIANT](data-types-semistructured.md) values that contain the INTEGER IDs of [H3](data-types-geospatial.md) cells at the specified resolution that cover the same area as the H3 cells in the input. | [Geospatial functions](functions-geospatial.md) |
| [H3_UNCOMPACT_CELLS_STRINGS](functions/h3_uncompact_cells_strings.md) | Returns an [array](data-types-semistructured.md) of [VARIANT](data-types-semistructured.md) values that contain the VARCHAR hexadecimal IDs of [H3](data-types-geospatial.md) cells at the specified resolution that cover the same area as the H3 cells in the input. | [Geospatial functions](functions-geospatial.md) |
| [HASH](functions/hash.md) | Returns a signed 64-bit hash value. | [Hash functions](functions-hash-scalar.md) |
| [HASH_AGG](functions/hash_agg.md) | Returns an aggregate signed 64-bit hash value over the (unordered) set of input rows. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [HAVERSINE](functions/haversine.md) | Calculates the great-circle distance in kilometers between two points on the Earth’s surface, using the [Haversine formula](https://en.wikipedia.org/wiki/Haversine_formula). | [Geospatial functions](functions-geospatial.md) |
| [HEX_DECODE_BINARY](functions/hex_decode_binary.md) | Decodes a hex-encoded string to a binary. | [String & binary functions](functions-string.md) |
| [HEX_DECODE_STRING](functions/hex_decode_string.md) | Decodes a hex-encoded string to a string. | [String & binary functions](functions-string.md) |
| [HEX_ENCODE](functions/hex_encode.md) | Encodes the input using hexadecimal (also ‘hex’ or ‘base16’) encoding. | [String & binary functions](functions-string.md) |
| [HLL](functions/hll.md) | Uses HyperLogLog to return an approximation of the distinct cardinality of the input (i.e. `HLL(col1, col2, ... )` returns an approximation of `COUNT(DISTINCT col1, col2, ... )`). | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [HLL_ACCUMULATE](functions/hll_accumulate.md) | Returns the HyperLogLog state at the end of aggregation. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [HLL_COMBINE](functions/hll_combine.md) | Combines (merges) input states into a single output state. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [HLL_ESTIMATE](functions/hll_estimate.md) | Returns the cardinality estimate for the given HyperLogLog state. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [HLL_EXPORT](functions/hll_export.md) | Converts input in BINARY format to OBJECT format. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [HLL_IMPORT](functions/hll_import.md) | Converts input in OBJECT format to BINARY format. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [HOUR / MINUTE / SECOND](functions/hour-minute-second.md) | Extracts the corresponding time part from a time or timestamp value. | [Date & time functions](functions-date-time.md) |
| **I** |  |  |
| [ICEBERG_TABLE_FILES](functions/iceberg_table_files.md) | Returns information about the data files registered to an externally managed Apache Iceberg™ table at a specified point in time. | [Table functions](functions-table.md) |
| [ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY](functions/iceberg_table_snapshot_refresh_history.md) | Returns metadata and [snapshot](../user-guide/tables-iceberg.md) information about the most recent refresh history for a specified externally managed Apache Iceberg™ table. | [Table functions](functions-table.md) |
| [IFF](functions/iff.md) | Returns one of two values depending on whether a Boolean expression evaluates to true or false. | [Conditional expression functions](expressions-conditional.md) |
| [IFNULL](functions/ifnull.md) | If `expr1` is NULL, returns `expr2`, otherwise returns `expr1`. | [Conditional expression functions](expressions-conditional.md) |
| [[ NOT ] ILIKE](functions/ilike.md) | Performs a case-insensitive comparison to determine whether a string matches or does not match a specified pattern. | [String & binary functions](functions-string.md) |
| [ILIKE ANY](functions/ilike_any.md) | Performs a case-insensitive comparison to match a string against any of one or more specified patterns. | [String & binary functions](functions-string.md) |
| [[ NOT ] IN](functions/in.md) | Tests whether its argument is or is not one of the members of an explicit list or the result of a subquery. | [Conditional expression functions](expressions-conditional.md) |
| [INFER_SCHEMA](functions/infer_schema.md) | Automatically detects the file metadata schema in a set of staged data files that contain semi-structured data and retrieves the column definitions. | [Table functions](functions-table.md) |
| [INITCAP](functions/initcap.md) | Returns the input string with the first letter of each word in uppercase and the subsequent letters in lowercase. | [String & binary functions](functions-string.md) |
| [INSERT](functions/insert.md) | Replaces a substring of the specified length, starting at the specified position, with a new string or binary value. | [String & binary functions](functions-string.md) |
| [INTEGRATION](functions/integration.md) | Returns a JSON object that specifies the notification integration to use to send a message. | [Notification functions](functions-notification.md) |
| [INTERPOLATE_BFILL, INTERPOLATE_FFILL, INTERPOLATE_LINEAR](functions/interpolate_bfill.md) | Updates rows in a time-series data set to gap-fill missing values based on surrounding values. | [Window functions](functions-window.md) |
| [INVOKER_ROLE](functions/invoker_role.md) | Returns the name of the account-level role of the object executing the query or NULL if the name of the role is a database role. | [Context functions](functions-context.md) |
| [INVOKER_SHARE](functions/invoker_share.md) | Returns the name of the share that directly accessed the table or view where the INVOKER_SHARE function is invoked, otherwise the function returns NULL. | [Context functions](functions-context.md) |
| [IS [ NOT ] DISTINCT FROM](functions/is-distinct-from.md) | Compares whether two expressions are equal (or not equal). | [Conditional expression functions](expressions-conditional.md) |
| [IS [ NOT ] NULL](functions/is-null.md) | Determines whether an expression is NULL or is not NULL. | [Conditional expression functions](expressions-conditional.md) |
| [IS_<object_type>](functions/is.md) | This family of functions serves as Boolean predicates that can be used to determine the data type of a value stored in a VARIANT column. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_APPLICATION_ROLE_ACTIVATED (SYS_CONTEXT function)](functions/is_application_role_activated.md) | Returns the VARCHAR value `'TRUE'` if an application role is activated in the specified context. | [Context functions](functions-context.md) |
| [IS_APPLICATION_ROLE_IN_SESSION](functions/is_application_role_in_session.md) | Verifies whether the application role is activated in the consumer’s current session. | [Context functions](functions-context.md) |
| [IS_ARRAY](functions/is_array.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains an [ARRAY](data-types-semistructured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_BINARY](functions/is_binary.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains a [binary string](data-types-text.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_BOOLEAN](functions/is_boolean.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains a [BOOLEAN](data-types-logical.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_CHAR , IS_VARCHAR](functions/is_char-varchar.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains a [string value](data-types-text.md). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_CONFIGURATION_SET (SYS_CONTEXT function)](functions/is_configuration_set.md) | Returns the VARCHAR value `'TRUE'` if the specified configuration has a value set, that is, the configuration’s status is `DONE`. | [Context functions](functions-context.md) |
| [IS_DATABASE_ROLE_ACTIVATED (SYS_CONTEXT function)](functions/is_database_role_activated.md) | Returns the VARCHAR value `'TRUE'` if a database role is activated in the current session. | [Context functions](functions-context.md) |
| [IS_DATABASE_ROLE_IN_SESSION](functions/is_database_role_in_session.md) | Verifies whether the database role is in the user’s active primary or secondary role hierarchy for the current session or if the specified column contains a database role that is in the user’s active primary or secondary role hierarchy for the current session. | [Context functions](functions-context.md) |
| [IS_DATE , IS_DATE_VALUE](functions/is_date-value.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains a [DATE](data-types-datetime.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_DECIMAL](functions/is_decimal.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains a [fixed-point number or integer](data-types-numeric.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_DOUBLE , IS_REAL](functions/is_double-real.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains a [floating-point number, fixed-point number, or integer](data-types-numeric.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_GRANTED_TO_INVOKER_ROLE](functions/is_granted_to_invoker_role.md) | Returns TRUE if the role returned by the INVOKER_ROLE function inherits the privileges of the specified role in the argument based on the context in which the function is called. | [Context functions](functions-context.md) |
| [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](functions/is_group_activated.md) | Returns the VARCHAR value `'TRUE'` if the role representing an [organization user group](../user-guide/organization-users.md) is activated in a given context. | [Context functions](functions-context.md) |
| [IS_GROUP_IMPORTED (SYS_CONTEXT function)](functions/is_group_imported.md) | Returns the VARCHAR value `'TRUE'` if the specified group is an [organization user group](../user-guide/organization-users.md) that was imported into the current account. | [Context functions](functions-context.md) |
| [IS_INSTANCE_ROLE_IN_SESSION](functions/is_instance_role_in_session.md) | Verifies whether the user’s active primary or secondary role hierarchy for the session inherits the specified instance role. | [Context functions](functions-context.md) |
| [IS_INTEGER](functions/is_integer.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains an [integer](data-types-numeric.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_NULL_VALUE](functions/is_null_value.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument is a [JSON null](../user-guide/semistructured-considerations.md) value. | [Conditional expression functions](expressions-conditional.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_OBJECT](functions/is_object.md) | Returns TRUE if its [VARIANT](data-types-semistructured.md) argument contains an [OBJECT](data-types-semistructured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_ORGANIZATION_USER](functions/is_organization_user.md) | Returns TRUE if the argument is a Snowflake user who is an [organization user](../user-guide/organization-users.md). | [Organization user and organization user group functions](functions-organization-users.md) |
| [IS_ORGANIZATION_USER_GROUP](functions/is_organization_user_group.md) | Returns TRUE if the specified [role](../user-guide/security-access-control-overview.md) was created when an administrator added an [organization user group](../user-guide/organization-users.md) to the account. | [Organization user and organization user group functions](functions-organization-users.md) |
| [IS_ORGANIZATION_USER_GROUP_IN_SESSION](functions/is_organization_user_group_in_session.md) | Assuming a role was imported from an [organization user group](../user-guide/organization-users.md), verifies whether the role is in the user’s active primary or secondary role hierarchy for the session. | [Context functions](functions-context.md) |
| [IS_ROLE_ACTIVATED (SYS_CONTEXT function)](functions/is_role_activated.md) | Returns the VARCHAR value `'TRUE'` if an account role is activated in the current session. | [Context functions](functions-context.md) |
| [IS_ROLE_IN_SESSION](functions/is_role_in_session.md) | Verifies whether the specified account role is in the currently active primary or secondary role hierarchy. | [Context functions](functions-context.md) |
| [IS_TIME](functions/is_time.md) | Verifies whether a [VARIANT](data-types-semistructured.md) argument contains a [TIME](data-types-datetime.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_TIMESTAMP_\*](functions/is_timestamp.md) | Verifies whether a [VARIANT](data-types-semistructured.md) argument contains the respective [timestamp](data-types-datetime.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [IS_USER_IMPORTED (SYS_CONTEXT function)](functions/is_user_imported.md) | Returns the VARCHAR value `'TRUE'` if the specified user is an [organization user](../user-guide/organization-users.md) that was imported into the current account. | [Context functions](functions-context.md) |
| **J** |  |  |
| [JAROWINKLER_SIMILARITY](functions/jarowinkler_similarity.md) | Computes the [Jaro-Winkler similarity](https://en.wikipedia.org/wiki/Jaro%E2%80%93Winkler_distance) between two input strings. | [String & binary functions](functions-string.md) |
| [JSON_EXTRACT_PATH_TEXT](functions/json_extract_path_text.md) | Parses the first argument as a JSON string and returns the value of the element pointed to by the path in the second argument. | [Semi-structured and structured data functions](functions-semistructured.md) |
| **K** |  |  |
| [KURTOSIS](functions/kurtosis.md) | Returns the sample excess kurtosis of non-NULL records. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| **L** |  |  |
| [LAG](functions/lag.md) | Accesses data in a previous row in the same result set without having to join the table to itself. | [Window function syntax and usage](functions-window-syntax.md) |
| [LAST_DAY](functions/last_day.md) | Returns the last day of the specified date part for a date or timestamp. | [Date & time functions](functions-date-time.md) |
| [LAST_QUERY_ID](functions/last_query_id.md) | Returns the ID of a specified query in the current session. | [Context functions](functions-context.md) |
| [LAST_SUCCESSFUL_SCHEDULED_TIME](functions/last_successful_scheduled_time.md) | Returns the timestamp representing the scheduled time for the most recent successful evaluation of the alert condition, where no errors occurred when executing the action. | [Date & time functions](functions-date-time.md) |
| [LAST_TRANSACTION](functions/last_transaction.md) | Returns the transaction ID of the last transaction that was either committed or rolled back in the current session. | [Context functions](functions-context.md) |
| [LAST_VALUE](functions/last_value.md) | Returns the last value within an ordered group of values. | [Window function syntax and usage](functions-window-syntax.md) |
| [LEAD](functions/lead.md) | Accesses data in a subsequent row in the same result set without having to join the table to itself. | [Window function syntax and usage](functions-window-syntax.md) |
| [LEAST](functions/least.md) | Returns the smallest value from a list of expressions. | [Conditional expression functions](expressions-conditional.md) |
| [LEAST_IGNORE_NULLS](functions/least_ignore_nulls.md) | Returns the smallest non-NULL value from a list of expressions. | [Conditional expression functions](expressions-conditional.md) |
| [LEFT](functions/left.md) | Returns a leftmost substring of its input. | [String & binary functions](functions-string.md) |
| [LENGTH, LEN](functions/length.md) | Returns the length of an input [string or binary](data-types-text.md) value. | [String & binary functions](functions-string.md) |
| [[ NOT ] LIKE](functions/like.md) | Performs a case-sensitive comparison to determine whether a string matches or does not match a specified pattern. | [String & binary functions](functions-string.md) |
| [LIKE ALL](functions/like_all.md) | Performs a case-sensitive comparison to match a string against all of one or more specified patterns. | [String & binary functions](functions-string.md) |
| [LIKE ANY](functions/like_any.md) | Performs a case-sensitive comparison to match a string against any of one or more specified patterns. | [String & binary functions](functions-string.md) |
| [LISTAGG](functions/listagg.md) | Returns the concatenated input values, separated by the `delimiter` string. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [LISTING_REFRESH_HISTORY](functions/listing_refresh_history.md) | Returns the past 14 days of refresh history for a cross-cloud auto-fulfillment listing. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [LN](functions/ln.md) | Returns the natural logarithm of a numeric expression. | [Numeric functions](functions-numeric.md) |
| [LOCALTIME](functions/localtime.md) | Returns the current time for the system. | [Context functions](functions-context.md) |
| [LOCALTIMESTAMP](functions/localtimestamp.md) | Returns the current timestamp for the system in the local time zone. | [Context functions](functions-context.md) |
| [LOG](functions/log.md) | Returns the logarithm of a numeric expression. | [Numeric functions](functions-numeric.md) |
| [LOGIN_HISTORY , LOGIN_HISTORY_BY_USER](functions/login_history.md) | The LOGIN_HISTORY family of table functions can be used to query login attempts by Snowflake users along various dimensions. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [LOWER](functions/lower.md) | Returns the input string with all characters converted to lowercase. | [String & binary functions](functions-string.md) |
| [LPAD](functions/lpad.md) | Left-pads a string with characters from another string, or left-pads a binary value with bytes from another binary value. | [String & binary functions](functions-string.md) |
| [LTRIM](functions/ltrim.md) | Removes leading characters, including whitespace, from a string. | [String & binary functions](functions-string.md) |
| **M** |  |  |
| [MAP_CAT](functions/map_cat.md) | Returns the concatenatation of two [MAP](data-types-structured.md) values. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_CONTAINS_KEY](functions/map_contains_key.md) | Determines whether the specified [MAP](data-types-structured.md) contains the specified key. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_DELETE](functions/map_delete.md) | Returns a [MAP](data-types-structured.md) based on an existing MAP with one or more keys removed. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_ENTRIES](functions/map_entries.md) | Returns an ARRAY value of key-value pair objects for each entry in a [MAP](data-types-structured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_INSERT](functions/map_insert.md) | Returns a new [MAP](data-types-structured.md) consisting of the input MAP with a new key-value pair inserted. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_KEYS](functions/map_keys.md) | Returns the keys in a [MAP](data-types-structured.md). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_PICK](functions/map_pick.md) | Returns a new [MAP](data-types-structured.md) containing the specified key-value pairs from an existing MAP. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MAP_SIZE](functions/map_size.md) | Returns the size of a [MAP](data-types-structured.md). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [MATERIALIZED_VIEW_REFRESH_HISTORY](functions/materialized_view_refresh_history.md) | This table function is used for querying the [materialized views](../user-guide/views-materialized.md) refresh history for a specified materialized view within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [MAX](functions/max.md) | Returns the maximum value for the records within `expr`. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [MAX_BY](functions/max_by.md) | Finds the row(s) containing the maximum value for a column and returns the value of another column in that row. | [Aggregate functions](functions-aggregation.md) |
| [MD5 , MD5_HEX](functions/md5.md) | Returns a 32-character hex-encoded string containing the 128-bit MD5 message digest. | [String & binary functions](functions-string.md) |
| [MD5_BINARY](functions/md5_binary.md) | Returns a 16-byte `BINARY` value containing the 128-bit MD5 message digest. | [String & binary functions](functions-string.md) |
| [MD5_NUMBER — Obsoleted](functions/md5_number.md) | Returns the 128-bit MD5 message digest interpreted as a signed 128-bit big endian number. | [String & binary functions](functions-string.md) |
| [MD5_NUMBER_LOWER64](functions/md5_number_lower64.md) | Calculates the 128-bit MD5 message digest, interprets it as a signed 128-bit big endian number, and returns the lower 64 bits of the number as an unsigned integer. | [String & binary functions](functions-string.md) |
| [MD5_NUMBER_UPPER64](functions/md5_number_upper64.md) | Calculates the 128-bit MD5 message digest, interprets it as a signed 128-bit big endian number, and returns the upper 64 bits of the number as an unsigned integer. | [String & binary functions](functions-string.md) |
| [MEDIAN](functions/median.md) | Determines the median of a set of values. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [MIN](functions/min.md) | Returns the minimum value for the records within `expr`. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [MIN_BY](functions/min_by.md) | Finds the row(s) containing the minimum value for a column and returns the value of another column in that row. | [Aggregate functions](functions-aggregation.md) |
| [MINHASH](functions/minhash.md) | Returns a MinHash state containing an array of size `k` constructed by applying `k` number of different hash functions to the input rows and keeping the minimum of each hash function. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [MINHASH_COMBINE](functions/minhash_combine.md) | Combines input MinHash states into a single MinHash output state. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window-syntax.md) |
| [MOD](functions/mod.md) | Returns the remainder of input `expr1` divided by input `expr2`. | [Numeric functions](functions-numeric.md) |
| [MODE](functions/mode.md) | Returns the most frequent value for the values within `expr1`. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [MODEL_MONITOR_DRIFT_METRIC](functions/model-monitor-drift-metric.md) | Gets drift metrics from a [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md). | [Model monitor functions](functions-model-monitors.md) |
| [MODEL_MONITOR_PERFORMANCE_METRIC](functions/model-monitor-performance-metric.md) | Gets performance metrics from a [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md). | [Model monitor functions](functions-model-monitors.md) |
| [MODEL_MONITOR_STAT_METRIC](functions/model-monitor-stat-metric.md) | Gets count metrics from a [model monitor](../developer-guide/snowflake-ml/model-registry/model-observability.md). | [Model monitor functions](functions-model-monitors.md) |
| [MONTHNAME](functions/monthname.md) | Returns the three-letter month name for the specified date or timestamp. | [Date & time functions](functions-date-time.md) |
| [MONTHS_BETWEEN](functions/months_between.md) | Returns the number of months between two DATE or TIMESTAMP values. | [Date & time functions](functions-date-time.md) |
| **N** |  |  |
| [NETWORK_RULE_REFERENCES](functions/network_rule_references.md) | Returns a row for each object with which the specified network rule is associated or returns a row for each network rule associated with the specified container. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [NEXT_DAY](functions/next_day.md) | Returns the date of the first specified day of week (DOW) that occurs after the input date. | [Date & time functions](functions-date-time.md) |
| [NORMAL](functions/normal.md) | Generates a normally-distributed pseudo-random floating point number with specified `mean` and `stddev` (standard deviation). | [Data generation functions](functions-data-generation.md) |
| [NOTIFICATION_HISTORY](functions/notification_history.md) | This table function can be used to query the history of notifications sent through Snowflake. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [NTH_VALUE](functions/nth_value.md) | Returns the nth value (up to 1000) within an ordered group of values. | [Window function syntax and usage](functions-window-syntax.md) |
| [NTILE](functions/ntile.md) | Divides an ordered data set equally into the number of buckets specified by `constant_value`. | [Window function syntax and usage](functions-window-syntax.md) |
| [NULLIF](functions/nullif.md) | Returns NULL if `expr1` is equal to `expr2`, otherwise returns `expr1`. | [Conditional expression functions](expressions-conditional.md) |
| [NULLIFZERO](functions/nullifzero.md) | Returns NULL if the argument evaluates to `0`; otherwise, returns the argument. | [Conditional expression functions](expressions-conditional.md) |
| [NVL](functions/nvl.md) | If `expr1` is NULL, returns `expr2`, otherwise returns `expr1`. | [Conditional expression functions](expressions-conditional.md) |
| [NVL2](functions/nvl2.md) | Returns values depending on whether the first input is NULL. | [Conditional expression functions](expressions-conditional.md) |
| **O** |  |  |
| [OBJECT_AGG](functions/object_agg.md) | Returns one OBJECT per group. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [OBJECT_CONSTRUCT](functions/object_construct.md) | Returns an [OBJECT](data-types-semistructured.md) constructed from the arguments. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [OBJECT_CONSTRUCT_KEEP_NULL](functions/object_construct_keep_null.md) | Returns an [OBJECT](data-types-semistructured.md) constructed from the arguments that retains key-values pairs with NULL values. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [OBJECT_DELETE](functions/object_delete.md) | Returns an object containing the contents of the input (that is, source) object with one or more keys removed. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [OBJECT_INSERT](functions/object_insert.md) | Returns an [OBJECT](data-types-semistructured.md) value consisting of the input OBJECT value with a new key-value pair inserted (or an existing key updated with a new value). | [Semi-structured and structured data functions](functions-semistructured.md) |
| [OBJECT_KEYS](functions/object_keys.md) | Returns an array containing the list of keys in the top-most level of the input object. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [OBJECT_PICK](functions/object_pick.md) | Returns a new [OBJECT](data-types-semistructured.md) containing some of the key-value pairs from an existing object. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [OCTET_LENGTH](functions/octet_length.md) | Returns the length of a string or binary value in bytes. | [String & binary functions](functions-string.md) |
| [ONLINE_FEATURE_TABLE_REFRESH_HISTORY](functions/online-feature-table-refresh-history.md) | This table function returns information about each refresh (completed and running) of [online feature tables](sql/create-online-feature-table.md). | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| **P** |  |  |
| [PARSE_DOCUMENT (SNOWFLAKE.CORTEX)](functions/parse_document-snowflake-cortex.md) | Returns the extracted content from a document on a Snowflake stage as a JSON-formatted string. | [File functions](functions-file.md) |
| [PARSE_IP](functions/parse_ip.md) | Returns a JSON object consisting of all the components from a valid INET (Internet Protocol) or CIDR (Classless Internet Domain Routing) IPv4 or IPv6 string. | [String & binary functions](functions-string.md) |
| [PARSE_JSON](functions/parse_json.md) | Interprets an input string as a JSON document, producing a [VARIANT](data-types-semistructured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [PARSE_URL](functions/parse_url.md) | Returns an [OBJECT](data-types-semistructured.md) value that consists of all the components (fragment, host, parameters, path, port, query, scheme) in a valid input URL/URI. | [String & binary functions](functions-string.md) |
| [PARSE_XML](functions/parse_xml.md) | Interprets an input string as an [XML](../user-guide/semistructured-data-formats.md) document, producing an [OBJECT](data-types-semistructured.md) value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [PERCENT_RANK](functions/percent_rank.md) | Returns the relative rank of a value within a group of values, specified as a percentage ranging from 0.0 to 1.0. | [Window functions](functions-window.md) |
| [PERCENTILE_CONT](functions/percentile_cont.md) | Return a percentile value based on a continuous distribution of the input column (specified in `order_by_expr`). | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [PERCENTILE_DISC](functions/percentile_disc.md) | Returns a percentile value based on a discrete distribution of the input column (specified in `order_by_expr`). | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [PI](functions/pi.md) | Returns the value of pi as a floating-point value. | [Numeric functions](functions-numeric.md) |
| [PIPE_USAGE_HISTORY](functions/pipe_usage_history.md) | This table function can be used to query the history of data loaded into Snowflake tables using [Snowpipe](../user-guide/data-load-snowpipe-intro.md) within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [POLICY_CONTEXT](functions/policy_context.md) | Simulates the results of a query based upon the value of one or more context functions, which lets you determine how policies affect query results. | [Context functions](functions-context.md) |
| [POLICY_REFERENCES](functions/policy_references.md) | Returns a row for each object that has the specified policy assigned to the object or returns a row for each policy assigned to the specified object. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [POSITION](functions/position.md) | Searches for the first occurrence of the first argument in the second argument and, if successful, returns the position (1-based) of the first argument in the second argument. | [String & binary functions](functions-string.md) |
| [POW, POWER](functions/pow.md) | Returns a number (x) raised to the specified power (y). | [Numeric functions](functions-numeric.md) |
| [PREVIOUS_DAY](functions/previous_day.md) | Returns the date of the first specified day of week (DOW) that occurs before the input date. | [Date & time functions](functions-date-time.md) |
| [PROMPT](functions/prompt.md) | The PROMPT function constructs a structured OBJECT containing a template string and a list of arguments. | [Semi-structured and structured data functions](functions-semistructured.md) |
| **Q** |  |  |
| [QUERY_ACCELERATION_HISTORY](functions/query_acceleration_history.md) | The QUERY_ACCELERATION_HISTORY function is used for querying the [query acceleration service](../user-guide/query-acceleration-service.md) history within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [QUERY_HISTORY , QUERY_HISTORY_BY_\*](functions/query_history.md) | You can use the QUERY_HISTORY family of table functions to query Snowflake query history along various dimensions. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| **R** |  |  |
| [RADIANS](functions/radians.md) | Converts degrees to radians. | [Numeric functions](functions-numeric.md) |
| [RANDOM](functions/random.md) | Each call returns a pseudo-random 64-bit integer. | [Data generation functions](functions-data-generation.md) |
| [RANDSTR](functions/randstr.md) | Returns a random string of specified `length`. | [Data generation functions](functions-data-generation.md) |
| [RANK](functions/rank.md) | Returns the rank of a value within an ordered group of values. | [Window functions](functions-window.md) |
| [RATIO_TO_REPORT](functions/ratio_to_report.md) | Returns the ratio of a value within a group to the sum of the values within the group. | [Window functions](functions-window.md) |
| [REDUCE](functions/reduce.md) | Reduces an [array](data-types-semistructured.md) to a single value based on the logic in a lambda expression. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [[ NOT ] REGEXP](functions/regexp.md) | Performs a comparison to determine whether a string matches or does not match a specified pattern. | [String functions (regular expressions)](functions-regexp.md) |
| [REGEXP_COUNT](functions/regexp_count.md) | Returns the number of times that a [pattern](functions-regexp.md) occurs in a string. | [String functions (regular expressions)](functions-regexp.md) |
| [REGEXP_INSTR](functions/regexp_instr.md) | Returns the position of the specified occurrence of the regular expression pattern in the string subject. | [String functions (regular expressions)](functions-regexp.md) |
| [REGEXP_LIKE](functions/regexp_like.md) | Performs a comparison to determine whether a string matches a specified pattern. | [String functions (regular expressions)](functions-regexp.md) |
| [REGEXP_REPLACE](functions/regexp_replace.md) | Returns the subject with the specified pattern — or all occurrences of the pattern — either removed or replaced by a replacement string. | [String functions (regular expressions)](functions-regexp.md) |
| [REGEXP_SUBSTR](functions/regexp_substr.md) | Returns the substring that matches a [regular expression](functions-regexp.md) within a string. | [String functions (regular expressions)](functions-regexp.md) |
| [REGEXP_SUBSTR_ALL](functions/regexp_substr_all.md) | Returns an [ARRAY](data-types-semistructured.md) that contains all substrings that match a [regular expression](functions-regexp.md) within a string. | [String functions (regular expressions)](functions-regexp.md) |
| [REGR_AVGX](functions/regr_avgx.md) | Returns the average of the independent variable for non-null pairs in a group, where `x` is the independent variable and `y` is the dependent variable. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [REGR_AVGY](functions/regr_avgy.md) | Returns the average of the dependent variable for non-null pairs in a group, where `x` is the independent variable and `y` is the dependent variable. | [Aggregate functions](functions-aggregation.md) , [Window functions](functions-window.md) |
| [REGR_COUNT](functions/regr_count.md) | Returns the number of non-null number pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_INTERCEPT](functions/regr_intercept.md) | Returns the intercept of the univariate linear regression line for non-null pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_R2](functions/regr_r2.md) | Returns the coefficient of determination for non-null pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_SLOPE](functions/regr_slope.md) | Returns the slope of the linear regression line for non-null pairs in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_SXX](functions/regr_sxx.md) | Returns REGR_COUNT(y, x) \* VAR_POP(x) for non-null pairs. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_SXY](functions/regr_sxy.md) | Returns REGR_COUNT(expr1, expr2) \* COVAR_POP(expr1, expr2) for non-null pairs. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_SYY](functions/regr_syy.md) | Returns REGR_COUNT(y, x) \* VAR_POP(y) for non-null pairs. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [REGR_VALX](functions/regr_valx.md) | Returns NULL if the first argument is NULL; otherwise, returns the second argument. | [Conditional expression functions](expressions-conditional.md) |
| [REGR_VALY](functions/regr_valy.md) | Returns NULL if the second argument is NULL; otherwise, returns the first argument. | [Conditional expression functions](expressions-conditional.md) |
| [REPEAT](functions/repeat.md) | Builds a string by repeating the input for the specified number of times. | [String & binary functions](functions-string.md) |
| [REPLACE](functions/replace.md) | Removes all occurrences of a specified substring, and optionally replaces them with another substring. | [String & binary functions](functions-string.md) |
| [REPLICATION_GROUP_DANGLING_REFERENCES](functions/replication_group_dangling_references.md) | Detects cases where an object that’s referenced in a replication group or failover group isn’t actually replicated to the secondary account. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](functions/replication_group_refresh_history.md) | You can use the REPLICATION_GROUP_REFRESH_HISTORY family of table functions to query the replication history for one secondary replication or failover group, or all such groups, within the last 14 days. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [REPLICATION_GROUP_REFRESH_PROGRESS, REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB, REPLICATION_GROUP_REFRESH_PROGRESS_ALL](functions/replication_group_refresh_progress.md) | You can use the REPLICATION_GROUP_REFRESH_PROGRESS family of table functions to query the status of refresh operations for replication or failover groups. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [REPLICATION_GROUP_USAGE_HISTORY](functions/replication_group_usage_history.md) | Returns the replication usage history for secondary replication or failover groups within the last 14 days. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [REPLICATION_USAGE_HISTORY](functions/replication_usage_history.md) | This table function can be used to query the replication history for a specified database within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [REST_EVENT_HISTORY](functions/rest_event_history.md) | Returns a list of SCIM REST API requests made to Snowflake over a specified time interval. | [Table functions](functions-table.md) |
| [RESULT_SCAN](functions/result_scan.md) | Returns the result set of a previous command (within 24 hours of when you ran the query) as if the result was a table. | [Table functions](functions-table.md) |
| [REVERSE](functions/reverse.md) | Reverses the order of characters in a string, or of bytes in a binary value. | [String & binary functions](functions-string.md) |
| [RIGHT](functions/right.md) | Returns a rightmost substring of its input. | [String & binary functions](functions-string.md) |
| [[ NOT ] RLIKE](functions/rlike.md) | Performs a comparison to determine whether a string matches or does not match a specified pattern. | [String functions (regular expressions)](functions-regexp.md) |
| [ROUND](functions/round.md) | Returns rounded values for `input_expr`. | [Numeric functions](functions-numeric.md) |
| [ROW_NUMBER](functions/row_number.md) | Returns a unique row number for each row within a window partition. | [Window function syntax and usage](functions-window-syntax.md) |
| [RPAD](functions/rpad.md) | Right-pads a string with characters from another string, or right-pads a binary value with bytes from another binary value. | [String & binary functions](functions-string.md) |
| [RTRIM](functions/rtrim.md) | Removes trailing characters, including whitespace, from a string. | [String & binary functions](functions-string.md) |
| [RTRIMMED_LENGTH](functions/rtrimmed_length.md) | Returns the length of its argument, minus trailing whitespace, but including leading whitespace. | [String & binary functions](functions-string.md) |
| **S** |  |  |
| [SANITIZE_WEBHOOK_CONTENT](functions/sanitize_webhook_content.md) | Removes placeholders (for example, the SNOWFLAKE_WEBHOOK_SECRET placeholder, which specifies a secret) from the body of a notification message to be sent. | [Notification functions](functions-notification.md) |
| [SCHEDULED_TIME](functions/scheduled_time.md) | Returns the timestamp representing the scheduled time of the current alert. | [Date & time functions](functions-date-time.md) |
| [SEARCH](functions/search.md) | Searches character data (text) in specified columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. | [String & binary functions](functions-string.md) |
| [SEARCH_IP](functions/search_ip.md) | Searches for valid IPv4 and IPv6 addresses in specified character-string columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. | [String & binary functions](functions-string.md) |
| [SEARCH_OPTIMIZATION_HISTORY](functions/search_optimization_history.md) | This table function is used for querying the [search optimization service](../user-guide/search-optimization-service.md) maintenance history for a specified table within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [SEARCH_PREVIEW (SNOWFLAKE.CORTEX)](functions/search_preview-snowflake-cortex.md) | Given a Cortex Search service name, and a query, returns a response from the specified service. | [String & binary functions](functions-string.md) |
| [SENTIMENT (SNOWFLAKE.CORTEX)](functions/sentiment-snowflake-cortex.md) | Returns an overall sentiment score for the given English-language input text. | [String & binary functions](functions-string.md) |
| [SEQ1 / SEQ2 / SEQ4 / SEQ8](functions/seq1.md) | Returns a sequence of monotonically increasing integers, with wrap-around. | [Data generation functions](functions-data-generation.md) |
| [SERVERLESS_ALERT_HISTORY](functions/serverless_alert_history.md) | This table function is used for querying the [serverless alert](../user-guide/alerts.md) usage history. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [SERVERLESS_TASK_HISTORY](functions/serverless_task_history.md) | This table function is used for querying the [serverless task](../user-guide/tasks-intro.md) usage history. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [SET_SYS_CONTEXT](functions/set_sys_context.md) | Sets a value for a specified key in a specified namespace. | [Context functions](functions-context.md) |
| [SHA1 , SHA1_HEX](functions/sha1.md) | Returns a 40-character hex-encoded string containing the 160-bit SHA-1 message digest. | [String & binary functions](functions-string.md) |
| [SHA1_BINARY](functions/sha1_binary.md) | Returns a 20-byte binary containing the 160-bit SHA-1 message digest. | [String & binary functions](functions-string.md) |
| [SHA2 , SHA2_HEX](functions/sha2.md) | Returns a hex-encoded string containing the N-bit SHA-2 message digest, where N is the specified output digest size. | [String & binary functions](functions-string.md) |
| [SHA2_BINARY](functions/sha2_binary.md) | Returns a binary containing the N-bit SHA-2 message digest, where N is the specified output digest size. | [String & binary functions](functions-string.md) |
| [SHOW_PYTHON_PACKAGES_DEPENDENCIES](functions/show_python_packages_dependencies.md) | Returns a list of the dependencies and their versions for the Python packages that were specified. | [System functions](functions-system.md) |
| [SIGN](functions/sign.md) | Returns the sign of its argument. | [Numeric functions](functions-numeric.md) |
| [SIN](functions/sin.md) | Computes the sine of its argument; the argument should be expressed in radians. | [Numeric functions](functions-numeric.md) |
| [SINH](functions/sinh.md) | Computes the hyperbolic sine of its argument. | [Numeric functions](functions-numeric.md) |
| [SKEW](functions/skew.md) | Returns the sample skewness of non-NULL records. | [Aggregate functions](functions-aggregation.md) |
| [SOUNDEX](functions/soundex.md) | Returns a string that contains a phonetic representation of the input string. | [String & binary functions](functions-string.md) |
| [SOUNDEX_P123](functions/soundex_p123.md) | Returns a string that contains a phonetic representation of the input string, and retains the Soundex code number for the second letter when the first and second letters use the same number. | [String & binary functions](functions-string.md) |
| [SPACE](functions/space.md) | Builds a string consisting of the specified number of blank spaces. | [String & binary functions](functions-string.md) |
| [<service_name>!SPCS_CANCEL_JOB](functions/spcs_cancel_job.md) | Cancels a [Snowpark Container Services job](../developer-guide/snowpark-container-services/working-with-services.md); also referred to as job service. | [Table functions](functions-table.md) |
| [<service_name>!SPCS_GET_EVENTS](functions/spcs_get_events.md) | Returns the events that Snowflake collected for the specified service. | [Table functions](functions-table.md) |
| [<service_name>!SPCS_GET_LOGS](functions/spcs_get_logs.md) | Returns the logs that Snowflake collected from containers of the specified service. | [Table functions](functions-table.md) |
| [<service_name>!SPCS_GET_METRICS](functions/spcs_get_metrics.md) | Returns the metrics that Snowflake collected for the specified service. | [Table functions](functions-table.md) |
| [<service_name>!SPCS_WAIT_FOR](functions/spcs_wait_for.md) | Waits for the [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) to reach the specified state, with a timeout. | [Snowpark Container Services functions](functions-spcs.md) |
| [SPLIT](functions/split.md) | Splits a given string with a given separator and returns the result in an array of strings. | [String & binary functions](functions-string.md) |
| [SPLIT_PART](functions/split_part.md) | Splits a given string at a specified character and returns the requested part. | [String & binary functions](functions-string.md) |
| [SPLIT_TEXT_MARKDOWN_HEADER (SNOWFLAKE.CORTEX)](functions/split_text_markdown_header-snowflake-cortex.md) | The SPLIT_TEXT_MARKDOWN_HEADER function splits a Markdown-formatted document into structured text chunks based on header levels. | [String & binary functions](functions-string.md) |
| [SPLIT_TEXT_RECURSIVE_CHARACTER (SNOWFLAKE.CORTEX)](functions/split_text_recursive_character-snowflake-cortex.md) | The SPLIT_TEXT_RECURSIVE_CHARACTER function splits a string into shorter stings, recursively, for preprocessing text to be used with text embedding or search indexing functions. | [String & binary functions](functions-string.md) |
| [SPLIT_TO_TABLE](functions/split_to_table.md) | This table function splits a string (based on a specified delimiter) and flattens the results into rows. | [String & binary functions](functions-string.md) , [Table functions](functions-table.md) |
| [SQRT](functions/sqrt.md) | Returns the square-root of a non-negative numeric expression. | [Numeric functions](functions-numeric.md) |
| [SQUARE](functions/square.md) | Returns the square of a numeric expression (i.e. a numeric expression multiplied by itself). | [Numeric functions](functions-numeric.md) |
| [ST_AREA](functions/st_area.md) | Returns the area of the Polygon(s) in a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_ASEWKB](functions/st_asewkb.md) | Given a value of type [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md), return the binary representation of that value in [EWKB (extended well-known binary)](data-types-geospatial.md) format. | [Geospatial functions](functions-geospatial.md) |
| [ST_ASEWKT](functions/st_asewkt.md) | Given a value of type [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md), return the text (VARCHAR) representation of that value in [EWKT (extended well-known text)](data-types-geospatial.md) format. | [Geospatial functions](functions-geospatial.md) |
| [ST_ASGEOJSON](functions/st_asgeojson.md) | Given a value of type [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md), return the [GeoJSON](data-types-geospatial.md) representation of that value. | [Geospatial functions](functions-geospatial.md) |
| [ST_ASWKB , ST_ASBINARY](functions/st_aswkb.md) | Given a value of type [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md), return the binary representation of that value in [WKB (well-known binary)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary) format. | [Geospatial functions](functions-geospatial.md) |
| [ST_ASWKT , ST_ASTEXT](functions/st_aswkt.md) | Given a value of type [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md), return the text (VARCHAR) representation of that value in [WKT (well-known text)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry) format. | [Geospatial functions](functions-geospatial.md) |
| [ST_AZIMUTH](functions/st_azimuth.md) | Given a Point that represents the origin (the location of the observer) and a specified Point, returns the azimuth in radians. | [Geospatial functions](functions-geospatial.md) |
| [ST_BUFFER](functions/st_buffer.md) | Returns a [GEOMETRY](data-types-geospatial.md) object that represents a MultiPolygon containing the points within a specified distance of the input GEOMETRY object. | [Geospatial functions](functions-geospatial.md) |
| [ST_CENTROID](functions/st_centroid.md) | Returns the Point representing the geometric center of a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_COLLECT](functions/st_collect.md) | There are two forms of ST_COLLECT. | [Geospatial functions](functions-geospatial.md) |
| [ST_CONTAINS](functions/st_contains.md) | Returns TRUE if a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object is completely inside another object of the same type. | [Geospatial functions](functions-geospatial.md) |
| [ST_COVEREDBY](functions/st_coveredby.md) | Returns TRUE if no point in one geospatial object is outside another geospatial object. | [Geospatial functions](functions-geospatial.md) |
| [ST_COVERS](functions/st_covers.md) | Returns TRUE if no point in one geospatial object is outside of another geospatial object. | [Geospatial functions](functions-geospatial.md) |
| [ST_DIFFERENCE](functions/st_difference.md) | Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the points in the first object that are not in the second object (i.e. the difference between the two objects). | [Geospatial functions](functions-geospatial.md) |
| [ST_DIMENSION](functions/st_dimension.md) | Given a value of type [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md), return the “dimension” of the value. | [Geospatial functions](functions-geospatial.md) |
| [ST_DISJOINT](functions/st_disjoint.md) | Returns TRUE if the two [GEOGRAPHY](data-types-geospatial.md) objects or the two [GEOMETRY](data-types-geospatial.md) objects are disjoint (i.e. do not share any portion of space). | [Geospatial functions](functions-geospatial.md) |
| [ST_DISTANCE](functions/st_distance.md) | Returns the minimum great circle distance between two [GEOGRAPHY](data-types-geospatial.md) or the minimum Euclidean distance between two [GEOMETRY](data-types-geospatial.md) objects. | [Geospatial functions](functions-geospatial.md) |
| [ST_DWITHIN](functions/st_dwithin.md) | Returns TRUE if the minimum great circle distance between two points (two [GEOGRAPHY](data-types-geospatial.md) objects) is within the specified distance. | [Geospatial functions](functions-geospatial.md) |
| [ST_ENDPOINT](functions/st_endpoint.md) | Returns the last Point in a LineString. | [Geospatial functions](functions-geospatial.md) |
| [ST_ENVELOPE](functions/st_envelope.md) | Returns the minimum bounding box (a rectangular “envelope”) that encloses a specified [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_GEOGFROMGEOHASH](functions/st_geogfromgeohash.md) | Returns a [GEOGRAPHY](data-types-geospatial.md) object for the polygon that represents the boundaries of a [geohash](functions/st_geohash.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [ST_GEOGPOINTFROMGEOHASH](functions/st_geogpointfromgeohash.md) | Returns a [GEOGRAPHY](data-types-geospatial.md) object for the Point that represents the center of a [geohash](functions/st_geohash.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [ST_GEOGRAPHYFROMWKB](functions/st_geographyfromwkb.md) | Parses a [WKB (well-known binary)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary) or [EWKB (extended well-known binary)](data-types-geospatial.md) input and returns a value of type [GEOGRAPHY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [ST_GEOGRAPHYFROMWKT](functions/st_geographyfromwkt.md) | Parses a [WKT (well-known text)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry) or [EWKT (extended well-known text)](data-types-geospatial.md) input and returns a value of type [GEOGRAPHY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [ST_GEOHASH](functions/st_geohash.md) | Returns the [geohash](https://en.wikipedia.org/wiki/Geohash) for a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_GEOMETRYFROMWKB](functions/st_geometryfromwkb.md) | Parses a [WKB (well-known binary)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary) or EWKB (extended well-known binary) input and returns a value of type [GEOMETRY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [ST_GEOMETRYFROMWKT](functions/st_geometryfromwkt.md) | Parses a [WKT (well-known text)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry) or EWKT (extended well-known text) input and returns a value of type [GEOMETRY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [ST_GEOMFROMGEOHASH](functions/st_geomfromgeohash.md) | Returns a [GEOMETRY](data-types-geospatial.md) object for the polygon that represents the boundaries of a [geohash](https://en.wikipedia.org/wiki/Geohash). | [Geospatial functions](functions-geospatial.md) |
| [ST_GEOMPOINTFROMGEOHASH](functions/st_geompointfromgeohash.md) | Returns a [GEOMETRY](data-types-geospatial.md) object for the point that represents center of a [geohash](https://en.wikipedia.org/wiki/Geohash). | [Geospatial functions](functions-geospatial.md) |
| [ST_HAUSDORFFDISTANCE](functions/st_hausdorffdistance.md) | Returns the discrete [Hausdorff distance](https://en.wikipedia.org/wiki/Hausdorff_distance) between two [GEOGRAPHY](data-types-geospatial.md) objects. | [Geospatial functions](functions-geospatial.md) |
| [ST_INTERPOLATE](functions/st_interpolate.md) | Given an input [GEOGRAPHY](data-types-geospatial.md) object, returns an interpolated object that is within a specified tolerance. | [Geospatial functions](functions-geospatial.md) |
| [ST_INTERSECTION](functions/st_intersection.md) | Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the shape containing the set of points that are common to both input objects (i.e. the intersection of the two objects). | [Geospatial functions](functions-geospatial.md) |
| [ST_INTERSECTION_AGG](functions/st_intersection_agg.md) | Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the shape containing the combined set of points that are common to the shapes represented by the objects in the column (that is, the intersection of the shapes). | [Geospatial functions](functions-geospatial.md) |
| [ST_INTERSECTS](functions/st_intersects.md) | Returns TRUE if the two [GEOGRAPHY](data-types-geospatial.md) objects or the two [GEOMETRY](data-types-geospatial.md) objects intersect (i.e. share any portion of space). | [Geospatial functions](functions-geospatial.md) |
| [ST_ISVALID](functions/st_isvalid.md) | Returns TRUE if the specified [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object represents a [valid shape](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) |
| [ST_LENGTH](functions/st_length.md) | Returns the great circle length of the LineString(s) in a [GEOGRAPHY](data-types-geospatial.md) object or the Euclidean length of the LineString(s) in a [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_MAKEGEOMPOINT , ST_GEOMPOINT](functions/st_makegeompoint.md) | Constructs a [GEOMETRY](data-types-geospatial.md) object that represents a Point with the specified longitude and latitude. | [Geospatial functions](functions-geospatial.md) |
| [ST_MAKELINE](functions/st_makeline.md) | Constructs a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object that represents a line connecting the points in the input objects. | [Geospatial functions](functions-geospatial.md) |
| [ST_MAKEPOINT , ST_POINT](functions/st_makepoint.md) | Constructs a [GEOGRAPHY](data-types-geospatial.md) object that represents a point with the specified longitude and latitude. | [Geospatial functions](functions-geospatial.md) |
| [ST_MAKEPOLYGON , ST_POLYGON](functions/st_makepolygon.md) | Constructs a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object that represents a Polygon without holes. | [Geospatial functions](functions-geospatial.md) |
| [ST_MAKEPOLYGONORIENTED](functions/st_makepolygonoriented.md) | Constructs a [GEOGRAPHY](data-types-geospatial.md) object that represents a Polygon without holes. | [Geospatial functions](functions-geospatial.md) |
| [ST_NPOINTS , ST_NUMPOINTS](functions/st_npoints.md) | Returns the number of points in a [GEOGRAPHY](data-types-geospatial.md) or [GEOGRAPHY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_PERIMETER](functions/st_perimeter.md) | Returns the length of the perimeter of the polygon(s) in a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_POINTN](functions/st_pointn.md) | Returns a Point at a specified index in a LineString. | [Geospatial functions](functions-geospatial.md) |
| [ST_SETSRID](functions/st_setsrid.md) | Returns a [GEOMETRY](data-types-geospatial.md) object that has its SRID (spatial reference system identifier) set to the specified value. | [Geospatial functions](functions-geospatial.md) |
| [ST_SIMPLIFY](functions/st_simplify.md) | Given an input [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object that represents a Line or Polygon, returns a simpler approximation of the object. | [Geospatial functions](functions-geospatial.md) |
| [ST_SRID](functions/st_srid.md) | Returns the SRID (spatial reference system identifier) of a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_STARTPOINT](functions/st_startpoint.md) | Returns the first Point in a LineString. | [Geospatial functions](functions-geospatial.md) |
| [ST_SYMDIFFERENCE](functions/st_symdifference.md) | Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the set of points from both input objects that are not part of the intersection of the objects (i.e. the [symmetric difference](https://en.wikipedia.org/wiki/Symmetric_difference) of the two objects). | [Geospatial functions](functions-geospatial.md) |
| [ST_TRANSFORM](functions/st_transform.md) | Converts a [GEOMETRY](data-types-geospatial.md) object from one [spatial reference system (SRS)](https://en.wikipedia.org/wiki/Spatial_reference_system) to another. | [Geospatial functions](functions-geospatial.md) |
| [ST_UNION](functions/st_union.md) | Given two input GEOGRAPHY objects, returns a GEOGRAPHY object that represents the combined set of shapes for both objects (i.e. the union of the two shapes). | [Geospatial functions](functions-geospatial.md) |
| [ST_UNION_AGG](functions/st_union_agg.md) | Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the combined set of points that are in at least one of the shapes represented by the objects in the column (that is, the union of the shapes). | [Geospatial functions](functions-geospatial.md) |
| [ST_WITHIN](functions/st_within.md) | Returns true if the first geospatial object is fully contained by the second geospatial object. | [Geospatial functions](functions-geospatial.md) |
| [ST_X](functions/st_x.md) | Returns the longitude (X coordinate) of a Point represented by a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_XMAX](functions/st_xmax.md) | Returns the maximum longitude (X coordinate) of all points contained in the specified [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_XMIN](functions/st_xmin.md) | Returns the minimum longitude (X coordinate) of all points contained in the specified [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_Y](functions/st_y.md) | Returns the latitude (Y coordinate) of a Point represented by a [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_YMAX](functions/st_ymax.md) | Returns the maximum latitude (Y coordinate) of all points contained in the specified [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [ST_YMIN](functions/st_ymin.md) | Returns the minimum latitude (Y coordinate) of all points contained in the specified [GEOGRAPHY](data-types-geospatial.md) or [GEOMETRY](data-types-geospatial.md) object. | [Geospatial functions](functions-geospatial.md) |
| [STAGE_DIRECTORY_FILE_REGISTRATION_HISTORY](functions/stage_directory_file_registration_history.md) | This table function can be used to query information about the metadata history for a directory table. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [STAGE_STORAGE_USAGE_HISTORY](functions/stage_storage_usage_history.md) | This table function can be used to query the average daily data storage usage, in bytes, for all the Snowflake stages in your account within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [STARTSWITH](functions/startswith.md) | Returns true if `expr1` starts with `expr2`. | [String & binary functions](functions-string.md) |
| [STDDEV, STDDEV_SAMP](functions/stddev.md) | Returns the sample standard deviation (square root of sample variance) of non-NULL values. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [STDDEV_POP](functions/stddev_pop.md) | Returns the population standard deviation (square root of variance) of non-NULL values. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [STORAGE_LIFECYCLE_POLICY_HISTORY](functions/storage_lifecycle_policy_history.md) | Returns execution history for [storage lifecycle policies](../user-guide/storage-management/storage-lifecycle-policies.md) in your account within the last 14 days. | [Table functions](functions-table.md) |
| [STRIP_NULL_VALUE](functions/strip_null_value.md) | Converts a [JSON null](../user-guide/semistructured-considerations.md) value to a SQL NULL value. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [STRTOK](functions/strtok.md) | Tokenizes a given string and returns the requested part. | [String & binary functions](functions-string.md) |
| [STRTOK_SPLIT_TO_TABLE](functions/strtok_split_to_table.md) | Tokenizes a string with the given set of delimiters and flattens the results into rows. | [String & binary functions](functions-string.md) , [Table functions](functions-table.md) |
| [STRTOK_TO_ARRAY](functions/strtok_to_array.md) | Tokenizes the given string using the given set of delimiters and returns the tokens as an [ARRAY](data-types-semistructured.md) value. | [String & binary functions](functions-string.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [SUBSTR , SUBSTRING](functions/substr.md) | Returns the portion of the [string or binary](data-types-text.md) value from `base_expr`, starting from the character/byte specified by `start_expr`, with optionally limited length. | [String & binary functions](functions-string.md) |
| [SUM](functions/sum.md) | Returns the sum of non-NULL records for `expr`. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [SUMMARIZE (SNOWFLAKE.CORTEX)](functions/summarize-snowflake-cortex.md) | Summarizes the given English-language input text. | [String & binary functions](functions-string.md) |
| [SYS_CONTEXT](functions/sys_context.md) | Returns information about the context in which the function is called. | [Context functions](functions-context.md) |
| [SYS_CONTEXT (SNOWFLAKE$APPLICATION namespace)](functions/sys_context_snowflake_application.md) | Returns information about the context in which a statement is executed within a [Snowflake Native App](../developer-guide/native-apps/native-apps-about.md). | [Context functions](functions-context.md) |
| [SYS_CONTEXT (SNOWFLAKE$ENVIRONMENT namespace)](functions/sys_context_snowflake_environment.md) | Returns information about the environment (the client, current account, and current region) in which the function is called. | [Context functions](functions-context.md) |
| [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](functions/sys_context_snowflake_organization.md) | Returns information about the current organization. | [Context functions](functions-context.md) |
| [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)](functions/sys_context_snowflake_organization_session.md) | Returns information about the session in which the function is called and the current organization user. | [Context functions](functions-context.md) |
| [SYS_CONTEXT (SNOWFLAKE$SESSION namespace)](functions/sys_context_snowflake_session.md) | Returns information about the session in which the function is called. | [Context functions](functions-context.md) |
| [SYS_CONTEXT (SNOWFLAKE$SESSION_ATTRIBUTES namespace)](functions/sys_context_snowflake_session_attributes.md) | Returns a custom session attribute set using SET_SYS_CONTEXT. | [Context functions](functions-context.md) |
| [SYSDATE](functions/sysdate.md) | Returns the current timestamp for the system in the UTC time zone. | [Context functions](functions-context.md) |
| [SYSTEM$ABORT_SESSION](functions/system_abort_session.md) | Aborts the specified session. | [System functions](functions-system.md) |
| [SYSTEM$ABORT_TRANSACTION](functions/system_abort_transaction.md) | Aborts the specified transaction, if it is running. | [System functions](functions-system.md) |
| [SYSTEM$ACTIVATE_CMK_INFO](functions/system_activate_cmk_info.md) | Activates Tri-Secret Secure in your account, optionally with private connectivity, by using the customer-managed key (CMK) information that you registered for your account. | [System functions](functions-system.md) |
| [SYSTEM$ACTIVATE_CMK_INFO_POSTGRES](functions/system_activate_cmk_info_postgres.md) | Activates Snowflake Postgres Tri-Secret Secure in your account by using the CMK (customer-managed key) information that you registered for your account. | [System functions](functions-system.md) |
| [SYSTEM$ADD_EVENT (for Snowflake Scripting)](functions/system_add_event.md) | Add an event for trace. | [System functions](functions-system.md) |
| [SYSTEM$ADD_REFERENCE](functions/system_add_reference.md) | Called by a Snowflake Native App to associate a consumer reference string to a reference definition. | [System functions](functions-system.md) |
| [SYSTEM$ALLOWLIST](functions/system_allowlist.md) | Returns host names and port numbers to add to your firewall’s allowed list so that you can access Snowflake from behind your firewall. | [System functions](functions-system.md) |
| [SYSTEM$ALLOWLIST_PRIVATELINK](functions/system_allowlist_privatelink.md) | Returns host names and port numbers for [AWS PrivateLink](https://aws.amazon.com/privatelink/), [Azure Private Link](https://azure.microsoft.com/en-us/services/private-link/), and [Google Cloud Private Service Connect](https://cloud.google.com/vpc/docs/configure-private-service-connect-services) deployments to add to your firewall’s allowed list so that you can access Snowflake from behind your firewall. | [System functions](functions-system.md) |
| [SYSTEM$APP_COMPATIBILITY_CHECK](functions/system_app_compatibility_check.md) | Returns the [Snowflake edition](../user-guide/intro-editions.md) of the consumer account where an app is installed. | [System functions](functions-system.md) |
| [SYSTEM$APPLICATION_GET_LOG_LEVEL](functions/system_application_get_log_level.md) | Returns the log level for the specified object. | [System functions](functions-system.md) |
| [SYSTEM$APPLICATION_GET_METRIC_LEVEL](functions/system_application_get_metric_level.md) | Returns the metric level for the specified object. | [System functions](functions-system.md) |
| [SYSTEM$APPLICATION_GET_TRACE_LEVEL](functions/system_application_get_trace_level.md) | Returns the trace level for the specified object. | [System functions](functions-system.md) |
| [SYSTEM$AUTHORIZE_PRIVATELINK](functions/system_authorize_privatelink.md) | Enables private connectivity to the Snowflake service for the current account. | [System functions](functions-system.md) |
| [SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS](functions/system_authorize_stage_privatelink_access.md) | Authorizes Snowflake to access the private endpoint for [Azure private endpoints for internal stages](../user-guide/private-internal-stages-azure.md) and [Google Private Service Connect endpoints for internal stages](../user-guide/private-internal-stages-gcp.md) for the current account. | [System functions](functions-system.md) |
| [SYSTEM$AUTO_REFRESH_STATUS](functions/system_auto_refresh_status.md) | Returns the automated refresh status for an externally managed [Iceberg table](../user-guide/tables-iceberg.md). | [System functions](functions-system.md) |
| [SYSTEM$BEGIN_DEBUG_APPLICATION](functions/system_begin_debug_application.md) | Enables [session debug mode](../developer-guide/native-apps/installing-testing-application.md) for a Snowflake Native App. | [System functions](functions-system.md) |
| [SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](functions/system_behavior_change_bundle_status.md) | Returns the status of the specified [behavior change release bundle](../release-notes/behavior-change-policy.md) for the current account. | [System functions](functions-system.md) |
| [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](functions/system_block_internal_stages_public_access.md) | Prevents all public traffic from accessing the internal stage of the current Snowflake account on Microsoft Azure. | [System functions](functions-system.md) |
| [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION](functions/system_block_internal_stages_public_access_with_exception.md) | Prevents public traffic from accessing the internal stage of the current Snowflake account on Microsoft Azure, while allowing access from specified IP addresses or CIDR blocks. | [System functions](functions-system.md) |
| [SYSTEM$CANCEL_ALL_QUERIES](functions/system_cancel_all_queries.md) | Cancels all active/running queries in the specified session. | [System functions](functions-system.md) |
| [SYSTEM$CANCEL_QUERY](functions/system_cancel_query.md) | Cancels the specified query (or statement) if it is currently active/running. | [System functions](functions-system.md) |
| [SYSTEM$CATALOG_LINK_STATUS](functions/system_catalog_link_status.md) | Returns the link status for a specified [catalog-linked database](../user-guide/tables-iceberg-catalog-linked-database.md). | [System functions](functions-system.md) |
| [SYSTEM$CKE_HASH_FUNCTION](functions/system_cke_hash_function.md) | Analyzes [Cortex Knowledge Extensions (CKE)](../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md) usage by mapping `hashedDocumentIds` back to your original document primary keys in the Cortex Search Service. | [System functions](functions-system.md) |
| [SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS](functions/system_cleanup_database_role_grants.md) | Revokes privileges on dropped objects from the share and grants the database role to the share. | [System functions](functions-system.md) |
| [SYSTEM$CLIENT_VERSION_INFO](functions/system_client_version_info.md) | Returns version information for Snowflake clients and drivers. | [System functions](functions-system.md) |
| [SYSTEM$CLIENT_VULNERABILITY_INFO](functions/system_client_vulnerability_info.md) | Returns details about common vulnerabilities and exposures (CVE) fixes and related vulnerabilities for Snowflake clients and drivers. | [System functions](functions-system.md) |
| [SYSTEM$CLUSTERING_DEPTH](functions/system_clustering_depth.md) | Computes the average depth of the table according to the specified columns (or the clustering key defined for the table). | [System functions](functions-system.md) |
| [SYSTEM$CLUSTERING_INFORMATION](functions/system_clustering_information.md) | Returns clustering information, including average clustering depth, for a table based on one or more columns in the table. | [System functions](functions-system.md) |
| [SYSTEM$CLUSTERING_RATIO — Deprecated](functions/system_clustering_ratio.md) | Calculates the clustering ratio for a table, based on one or more columns in the table. | [System functions](functions-system.md) |
| [SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT](functions/system_commit_move_organization_account.md) | Finalizes the process of moving an [organization account](../user-guide/organization-accounts.md) from one region to another. | [System functions](functions-system.md) |
| [SYSTEM$CONVERT_PIPES_SQS_TO_SNS](functions/system_convert_pipes_sqs_to_sns.md) | Convert pipes using Amazon SQS (Simple Queue Service) notifications to the Amazon Simple Notification Service (SNS) service for an S3 bucket. | [System functions](functions-system.md) |
| [SYSTEM$CREATE_BILLING_EVENT](functions/system_create_billing_event.md) | Creates a [billable event](../developer-guide/native-apps/adding-custom-event-billing.md) that tracks consumer usage of an installed monetized application. | [System functions](functions-system.md) |
| [SYSTEM$CREATE_BILLING_EVENTS](functions/system_create_billing_events.md) | Creates multiple [billable events](../developer-guide/native-apps/adding-custom-event-billing.md) that track consumer usage of installed monetized applications. | [System functions](functions-system.md) |
| [SYSTEM$CURRENT_USER_TASK_NAME](functions/system_current_user_task_name.md) | Returns the name of the task currently executing when invoked from the statement or stored procedure defined by the task. | [System functions](functions-system.md) |
| [SYSTEM$DATA_METRIC_SCAN](functions/system_data_metric_scan.md) | Returns the rows identified by a [data quality metric](../user-guide/data-quality-intro.md) as containing data that fails a data quality check. | [System functions](functions-system.md) , [Table functions](functions-table.md) |
| [SYSTEM$DATABASE_REFRESH_HISTORY — Deprecated](functions/system_database_refresh_history.md) | Returns a JSON object showing the refresh history for a secondary database. | [System functions](functions-system.md) |
| [SYSTEM$DATABASE_REFRESH_PROGRESS , SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB — Deprecated](functions/system_database_refresh_progress.md) | The SYSTEM$DATABASE_REFRESH_PROGRESS family of functions can be used to query the status of a database refresh along various dimensions. | [System functions](functions-system.md) |
| [SYSTEM$DEACTIVATE_CMK_INFO](functions/system_deactivate_cmk_info.md) | De-activates Tri-Secret Secure in your account. | [System functions](functions-system.md) |
| [SYSTEM$DECODE_PAT](functions/system_decode_pat.md) | Returns information about a [programmatic access token](../user-guide/programmatic-access-tokens.md), given the secret for the token. | [System functions](functions-system.md) |
| [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](functions/system_deprovision_privatelink_endpoint.md) | Deprovisions a private connectivity endpoint in the Snowflake VPC or VNet to prevent Snowflake from connecting to an external service by using private connectivity. | [System functions](functions-system.md) |
| [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS](functions/system_deprovision_privatelink_endpoint_tss.md) | Deprovisions a private connectivity endpoint in the Snowflake VPC or VNet to prevent Snowflake from connecting to an external key management service (KMS) resource using private connectivity. | [System functions](functions-system.md) |
| [SYSTEM$DEREGISTER_CMK_INFO](functions/system_deregister_cmk_info.md) | Cancels registration of your currently-registered customer-managed key (CMK) for use with Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$DEREGISTER_CMK_INFO_POSTGRES](functions/system_deregister_cmk_info_postgres.md) | Cancels registration of your currently-registered customer-managed key (CMK) for use with Snowflake Postgres Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY](functions/system_desc_iceberg_access_identity.md) | Returns information about the Snowflake service principal for a specified external cloud provider in an account. | [System functions](functions-system.md) |
| [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](functions/system_disable_behavior_change_bundle.md) | Disables the behavior changes included in the specified [behavior change release bundle](../release-notes/behavior-change-policy.md) for the current account. | [System functions](functions-system.md) |
| [SYSTEM$DISABLE_DATABASE_REPLICATION](functions/system_disable_database_replication.md) | Disable replication for a primary database and any secondary databases linked to it. | [System functions](functions-system.md) |
| [SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](functions/system_disable_global_data_sharing_for_account.md) | Disables Cross-Cloud Auto-Fulfillment on an account. | [System functions](functions-system.md) |
| [SYSTEM$DISABLE_PREVIEW_ACCESS](functions/system_disable_preview_access.md) | Disables access to [open preview](../release-notes/preview-features.md) and private preview features. | [System functions](functions-system.md) |
| [SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY](functions/system_disable_privatelink_access_only.md) | Unblocks connections for inbound network traffic that are routed over the public internet. | [System functions](functions-system.md) |
| [SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](functions/system_enable_behavior_change_bundle.md) | Enables behavior changes included in the specified [behavior change release bundle](../release-notes/behavior-change-policy.md) for the current account. | [System functions](functions-system.md) |
| [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](functions/system_enable_global_data_sharing_for_account.md) | Enables Cross-Cloud Auto-Fulfillment on an account. | [System functions](functions-system.md) |
| [SYSTEM$ENABLE_PREVIEW_ACCESS](functions/system_enable_preview_access.md) | Enables access to [open preview](../release-notes/preview-features.md) features. | [System functions](functions-system.md) |
| [SYSTEM$ENCODE_CKE_PRIMARY_KEY](functions/system_encode_cke_primary_key.md) | Takes one or more [primary key](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) columns from a [Cortex Knowledge Extensions (CKE)](../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md) document and converts them into an encoded representation. | [System functions](functions-system.md) |
| [SYSTEM$END_DEBUG_APPLICATION](functions/system_end_debug_application.md) | Disables [session debug mode](../developer-guide/native-apps/installing-testing-application.md) for a Snowflake Native App. | [System functions](functions-system.md) |
| [SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY](functions/system_enforce_privatelink_access_only.md) | Enforces the behavior that successful connections to your Snowflake account use only your private endpoints. | [System functions](functions-system.md) |
| [SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS](functions/system_estimate_automatic_clustering_costs.md) | Returns estimated costs associated with enabling [Automatic Clustering](../user-guide/tables-auto-reclustering.md) for a table. | [System functions](functions-system.md) |
| [SYSTEM$ESTIMATE_QUERY_ACCELERATION](functions/system_estimate_query_acceleration.md) | For a previously executed query, this function returns a JSON object that specifies if the query is eligible to benefit from the [query acceleration service](../user-guide/query-acceleration-service.md). | [System functions](functions-system.md) |
| [SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS](functions/system_estimate_search_optimization_costs.md) | Returns the estimated costs of adding [search optimization](../user-guide/search-optimization-service.md) to a given table and configuring specific columns for search optimization. | [System functions](functions-system.md) |
| [SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS](functions/system_evaluate_data_quality_expectations.md) | Returns the [expectations](../user-guide/data-quality-expectations.md) for associations between data metric functions (DMFs) and a table, including whether an expectation is currently violated. | [System functions](functions-system.md) , [Table functions](functions-table.md) |
| [SYSTEM$EXPLAIN_JSON_TO_TEXT](functions/system_explain_json_to_text.md) | This function converts EXPLAIN output from JSON to formatted text. | [System functions](functions-system.md) |
| [SYSTEM$EXPLAIN_PLAN_JSON](functions/system_explain_plan_json.md) | Given the text of a SQL statement, this function generates the EXPLAIN plan in JSON. | [System functions](functions-system.md) |
| [SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW](functions/system_export_tds_from_semantic_view.md) | Returns a [semantic view](../user-guide/views-semantic/overview.md) in Tableau Data Source (TDS) format. | [System functions](functions-system.md) |
| [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](functions/system_external_table_pipe_status.md) | Retrieves a JSON representation of the current refresh status for the internal (hidden) pipe object associated with an external table. | [System functions](functions-system.md) |
| [SYSTEM$FINISH_OAUTH_FLOW](functions/system_finish_oauth_flow.md) | Sets the OAUTH_REFRESH_TOKEN parameter value of the secret passed as an argument in the [SYSTEM$START_OAUTH_FLOW](functions/system_start_oauth_flow.md) call that began the OAuth flow. | [System functions](functions-system.md) |
| [SYSTEM$GENERATE_SAML_CSR](functions/system_generate_saml_csr.md) | Generates a certificate signing request (CSR) with the subject set to the subject of the certificate stored in the [SAML2 integration](sql/create-security-integration-saml2.md) and can specify the `DN` to be used in the CSR. | [System functions](functions-system.md) |
| [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](functions/system_generate_scim_access_token.md) | Returns a new SCIM access token that is valid for six months. | [System functions](functions-system.md) |
| [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](functions/system_get_all_default_columns_overrides.md) | Returns the list of columns that were set by previous calls to [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_set_default_columns_override_for_show_command.md) and [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_set_default_columns_override_for_system_object.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_ALL_REFERENCES](functions/system_get_all_references.md) | Iterates through all associations for a reference and returns information about the associations. | [System functions](functions-system.md) |
| [SYSTEM$GET_AWS_SNS_IAM_POLICY](functions/system_get_aws_sns_iam_policy.md) | Returns an AWS IAM policy statement that must be added to the Amazon SNS topic policy in order to grant the Amazon SQS messaging queue created by Snowflake to subscribe to the topic. | [System functions](functions-system.md) |
| [SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG](functions/system_get_catalog_linked_database_config.md) | Returns the configuration parameters set on the specified [catalog-linked database](../user-guide/tables-iceberg-catalog-linked-database.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_CLASSIFICATION_RESULT](functions/system_get_classification_result.md) | Returns the classification result of the specified object. | [System functions](functions-system.md) |
| [SYSTEM$GET_CMK_AKV_CONSENT_URL](functions/system_get_cmk_akv_consent_url.md) | Returns a consent URL to the Azure Key Vault account related to customer-managed keys. | [System functions](functions-system.md) |
| [SYSTEM$GET_CMK_CONFIG](functions/system_get_cmk_config.md) | Returns configuration information for use with customer-managed keys (CMKs) and Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$GET_CMK_CONFIG_POSTGRES](functions/system_get_cmk_config_postgres.md) | Returns configuration information for use with customer-managed keys (CMKs) and Snowflake Postgres Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$GET_CMK_INFO](functions/system_get_cmk_info.md) | Returns the status of your customer-managed key (CMK) for use with Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$GET_CMK_INFO_POSTGRES](functions/system_get_cmk_info_postgres.md) | Returns the status of your customer-managed key (CMK) for use with Snowflake Postgres Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$GET_CMK_KMS_KEY_POLICY](functions/system_get_cmk_kms_key_policy.md) | Returns an ARRAY containing a snippet of the AWS Key Management Service policy information related to customer-managed keys. | [System functions](functions-system.md) |
| [SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE](functions/system_get_compute_pool_pending_maintenance.md) | Retrieves information about pending Snowflake [maintenance actions for compute pools](../developer-guide/snowpark-container-services/working-with-compute-pool.md) in the current account. | [System functions](functions-system.md) |
| [SYSTEM$GET_DBT_LOG](functions/system_get_dbt_log.md) | Returns logs for the specified run for a dbt Projects on Snowflake. | [System functions](functions-system.md) |
| [SYSTEM$GET_DEBUG_STATUS](functions/system_get_debug_status.md) | Returns the [session debug mode](../developer-guide/native-apps/installing-testing-application.md) status of the current session. | [System functions](functions-system.md) |
| [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_get_default_columns_override_for_show_command.md) | Returns the list of columns that were set by a previous call to [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_set_default_columns_override_for_show_command.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_get_default_columns_override_for_system_object.md) | Returns the list of columns that were set by a previous call to [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_set_default_columns_override_for_system_object.md) for the specified Snowflake view (for example, for a specific [ACCOUNT_USAGE view](account-usage.md) or [INFORMATION_SCHEMA view](info-schema.md)). | [System functions](functions-system.md) |
| [SYSTEM$GET_DIRECTORY_TABLE_STATUS](functions/system_get_directory_table_status.md) | Returns a list of records that contain the [directory table](../user-guide/data-load-dirtables.md) consistency status for stages in your account. | [System functions](functions-system.md) |
| [SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD](functions/system_get_gcp_kms_cmk_grant_access_cmd.md) | Returns a Google Cloud gcloud command to obtain policy information for the Google Cloud Key Management Service for use with customer-managed keys. | [System functions](functions-system.md) |
| [SYSTEM$GET_HASH_FOR_APPLICATION](functions/system_get_hash_for_application.md) | Returns the hash value for a Snowflake Native App or query ID. | [System functions](functions-system.md) |
| [SYSTEM$GET_ICEBERG_TABLE_INFORMATION](functions/system_get_iceberg_table_information.md) | Returns the location of the root metadata file and status of the latest snapshot for an [Apache Iceberg™ table](../user-guide/tables-iceberg.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS](functions/system_get_instance_family_placement_groups.md) | Returns the list of placement groups supported for the specified [instance family](../developer-guide/snowpark-container-services/working-with-compute-pool.md) for [Snowpark Container Services compute pool nodes](../developer-guide/snowpark-container-services/working-with-compute-pool.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_LOGIN_FAILURE_DETAILS](functions/system_get_login_failure_details.md) | Returns a JSON object that represents an unsuccessful login attempt associated with External OAuth, SAML, or key pair authentication. | [System functions](functions-system.md) |
| [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](functions/system_get_predecessor_return_value.md) | Retrieves the return value for the predecessor task in a [task graph](../user-guide/tasks-graphs.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_PREVIEW_ACCESS_STATUS](functions/system_get_preview_access_status.md) | Determine if access to all preview features is enabled or disabled. | [System functions](functions-system.md) |
| [SYSTEM$GET_PRIVATELINK](functions/system_get_privatelink.md) | Verifies whether your current account is authorized for private connectivity to the Snowflake service. | [System functions](functions-system.md) |
| [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](functions/system_get_privatelink_authorized_endpoints.md) | Returns a list of the authorized endpoints for your current account to use with private connectivity to the Snowflake service. | [System functions](functions-system.md) |
| [SYSTEM$GET_PRIVATELINK_CONFIG](functions/system_get_privatelink_config.md) | Returns a JSON representation of the Snowflake account information necessary to facilitate the self-service configuration of private connectivity to the Snowflake service or Snowflake internal stages. | [System functions](functions-system.md) |
| [SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS](functions/system_get_privatelink_endpoint_registrations.md) | Returns the registered private endpoints that can route your connection to the Snowflake service. | [System functions](functions-system.md) |
| [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](functions/system_get_privatelink_endpoints_info.md) | Returns the status of all private connectivity endpoints that you provision. | [System functions](functions-system.md) |
| [SYSTEM$GET_PURCHASE_ATTRIBUTES](functions/system_get_purchase_attributes.md) | Identifies the behavior of a listing at runtime. | [System functions](functions-system.md) |
| [SYSTEM$GET_REFERENCED_OBJECT_ID_HASH](functions/system_get_referenced_object_id_hash.md) | Returns the hash of the entity ID of the consumer object. | [System functions](functions-system.md) |
| [SYSTEM$GET_RESULTSET_STATUS](functions/system_get_resultset_status.md) | Returns the status of a [RESULTSET](../developer-guide/snowflake-scripting/resultsets.md) in a Snowflake Scripting stored procedure. | [System functions](functions-system.md) |
| [SYSTEM$GET_SERVICE_DNS_DOMAIN](functions/system_get_service_dns_domain.md) | Given a schema name, returns that schema’s DNS namespace hash as a string. | [System functions](functions-system.md) |
| [SYSTEM$GET_SERVICE_LOGS](functions/system_get_service_logs.md) | Retrieves local logs from a [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md) container. | [System functions](functions-system.md) |
| [SYSTEM$GET_SERVICE_STATUS — Deprecated](functions/system_get_service_status.md) | Retrieves the status of a [Snowpark Container Services service](../developer-guide/snowpark-container-services/working-with-services.md). | [System functions](functions-system.md) |
| [SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](functions/system_get_snowflake_egress_ip_ranges.md) | Returns a list of egress IP address ranges (as Classless Inter-Domain Routing (CIDR) IP addresses) that you can use to represent Snowflake in a server’s IP allowlist. | [System functions](functions-system.md) |
| [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](functions/system_get_snowflake_platform_info.md) | Returns platform information for the cloud provider that hosts your Snowflake account. | [System functions](functions-system.md) |
| [SYSTEM$GET_TAG](functions/system_get_tag.md) | Returns the tag value associated with the specified Snowflake object or column. | [System functions](functions-system.md) |
| [SYSTEM$GET_TAG_ALLOWED_VALUES](functions/system_get_tag_allowed_values.md) | Returns a comma-separated list of string values that can be set on a [supported object](../user-guide/object-tagging/introduction.md), or NULL to indicate the tag key does not have any specified string values and accepts all [possible](../user-guide/object-tagging/introduction.md) string values. | [System functions](functions-system.md) |
| [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](functions/system_get_tag_on_current_column.md) | Returns the tag string value assigned to the column based upon the specified tag or NULL if a tag is not assigned to the specified column. | [System functions](functions-system.md) |
| [SYSTEM$GET_TAG_ON_CURRENT_TABLE](functions/system_get_tag_on_current_table.md) | Returns the tag string value assigned to the table based upon the specified tag or NULL if a tag is not assigned to the specified table. | [System functions](functions-system.md) |
| [SYSTEM$GET_TASK_GRAPH_CONFIG](functions/system_get_task_graph_config.md) | Returns information from a [task graph](../user-guide/tasks-graphs.md) configuration. | [System functions](functions-system.md) |
| [SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER](functions/system_global_account_set_parameter.md) | Enables replication and failover features for a specified account in an [organization](../user-guide/organizations.md). | [System functions](functions-system.md) |
| [SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT](functions/system_hold_privilege_on_account.md) | Indicates if a privilege has been granted to a Snowflake Native App. | [System functions](functions-system.md) |
| [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](functions/system_initiate_move_organization_account.md) | Starts the process of moving an [organization account](../user-guide/organization-accounts.md) to a new region. | [System functions](functions-system.md) |
| [SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](functions/system_internal_stages_public_access_status.md) | Checks to see whether public IP addresses are allowed to access the internal stage of the current Snowflake account on Microsoft Azure. | [System functions](functions-system.md) |
| [SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED](functions/system_is_application_all_mandatory_telemetry_event_definitions_enabled.md) | Indicates that the AUTHORIZE_TELEMETRY_EVENT_SHARING property has been set on the app. | [System functions](functions-system.md) |
| [SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING](functions/system_is_application_authorized_for_telemetry_event_sharing.md) | Indicates that the AUTHORIZE_TELEMETRY_EVENT_SHARING has been set on the app. | [System functions](functions-system.md) |
| [SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT](functions/system_is_application_installed_from_same_account.md) | Shows if an app is installed on the same account as the application package it is based on. | [System functions](functions-system.md) |
| [SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER](functions/system_is_application_sharing_events_with_provider.md) | Shows if event sharing is enabled. | [System functions](functions-system.md) |
| [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](functions/system_is_global_data_sharing_enabled_for_account.md) | Specifies whether Cross-Cloud Auto-Fulfillment is enabled or disabled on an account. | [System functions](functions-system.md) |
| [SYSTEM$IS_LISTING_PURCHASED](functions/system_is_listing_purchased.md) | Returns TRUE if the consumer account querying data has purchased the listing, otherwise returns FALSE. | [System functions](functions-system.md) |
| [SYSTEM$IS_LISTING_TRIAL](functions/system_is_listing_trial.md) | Limits the functionality of a Snowflake Native App based on whether a consumer is trialing the application as part of a [Limited trial listings](../collaboration/collaboration-listings-about.md) or has access to the full data product. | [System functions](functions-system.md) |
| [SYSTEM$LAST_CHANGE_COMMIT_TIME](functions/system_last_change_commit_time.md) | Returns a token that can be used to detect whether a database table or view changed between two calls to the function. | [System functions](functions-system.md) |
| [SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME](functions/system_link_account_objects_by_name.md) | Adds a global identifier to account objects in the target (current) account that were created using scripts and that match objects with the same names in the source account. | [System functions](functions-system.md) |
| [SYSTEM$LINK_ORGANIZATION_USER](functions/system_link_organization_user.md) | Links an [organization user](../user-guide/organization-users.md) with a user that already exists in the regular account. | [System functions](functions-system.md) |
| [SYSTEM$LINK_ORGANIZATION_USER_GROUP](functions/system_link_organization_user_group.md) | Links an [organization user group](../user-guide/organization-users.md) with an access control role that already exists in the regular account. | [System functions](functions-system.md) |
| [SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES](functions/system_list_application_restricted_features.md) | Returns a JSON object containing a list of restricted features that the consumer has allowed a Snowflake Native App to use. | [System functions](functions-system.md) |
| [SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG](functions/system_list_iceberg_tables_from_catalog.md) | Lists tables in a remote Apache Iceberg™ REST catalog (including [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview)). | [System functions](functions-system.md) |
| [SYSTEM$LIST_NAMESPACES_FROM_CATALOG](functions/system_list_namespaces_from_catalog.md) | Lists the namespaces in a remote Apache Iceberg™ REST catalog (including [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview)). | [System functions](functions-system.md) |
| [SYSTEM$LOCATE_DBT_ARCHIVE](functions/system_locate_dbt_archive.md) | Returns the URL from which you can retrieve zipped dbt run artifacts for a specified dbt project. | [System functions](functions-system.md) |
| [SYSTEM$LOCATE_DBT_ARTIFACTS](functions/system_locate_dbt_artifacts.md) | Returns the location of artifacts from a specified dbt Project run (for example, `manifest.json`). | [System functions](functions-system.md) |
| [SYSTEM$LOG, SYSTEM$LOG_<level> (for Snowflake Scripting)](functions/system_log.md) | Logs a message at the specified severity level. | [System functions](functions-system.md) |
| [SYSTEM$MIGRATE_SAML_IDP_REGISTRATION](functions/system_migrate_saml_idp_registration.md) | Migrates an existing SAML identity provider (i.e. IdP) configuration as defined by the account parameter [SAML_IDENTITY_PROVIDER](parameters.md) to a security integration. | [System functions](functions-system.md) |
| [SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS](functions/system_opt_in_internal_stage_network_logs.md) | Starts record collection of network access attempts to internal stage locations for this account. | [System functions](functions-system.md) |
| [SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS](functions/system_opt_out_internal_stage_network_logs.md) | Stops record collection of network access attempts to internal stage locations for this account. | [System functions](functions-system.md) |
| [SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY](functions/system_opt_out_malicious_ip_protection_by_category.md) | Disables [Malicious IP Protection](../user-guide/malicious-ip-protection.md) for one or more curated IP categories in the current account. | [System functions](functions-system.md) |
| [SYSTEM$PIPE_FORCE_RESUME](functions/system_pipe_force_resume.md) | Forces a pipe paused using [ALTER PIPE](sql/alter-pipe.md) to resume. | [System functions](functions-system.md) |
| [SYSTEM$PIPE_REBINDING_WITH_NOTIFICATION_CHANNEL](functions/system_pipe_rebinding_with_notification_channel.md) | Retries the notification channel binding process when a replicated pipe has not been successfully bound to a notification channel during replication time. | [System functions](functions-system.md) |
| [SYSTEM$PIPE_STATUS](functions/system_pipe_status.md) | Retrieves a JSON representation of the current status of a pipe. | [System functions](functions-system.md) |
| [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](functions/system_provision_privatelink_endpoint.md) | Provisions a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to an external service by using private connectivity. | [System functions](functions-system.md) |
| [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS](functions/system_provision_privatelink_endpoint_tss.md) | Provisions a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to a key management service (KMS) by using private connectivity. | [System functions](functions-system.md) |
| [SYSTEM$QUERY_REFERENCE](functions/system_query_reference.md) | Returns a [query reference](../developer-guide/stored-procedure/stored-procedures-calling-references.md) that you can pass to a stored procedure. | [System functions](functions-system.md) |
| [SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](functions/system_read_yaml_from_semantic_view.md) | Returns the [specification of a semantic model (in YAML format)](../user-guide/views-semantic/sql.md) for a [semantic view](../user-guide/views-semantic/overview.md). | [System functions](functions-system.md) |
| [SYSTEM$REFERENCE](functions/system_reference.md) | Returns a [reference](references.md) to an object (a table, view, or function). | [System functions](functions-system.md) |
| [SYSTEM$REGISTER_CMK_INFO](functions/system_register_cmk_info.md) | Registers your customer-managed key (CMK) for use with Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$REGISTER_CMK_INFO_POSTGRES](functions/system_register_cmk_info_postgres.md) | Registers your customer-managed key (CMK) for use with Snowflake Postgres Tri-Secret Secure. | [System functions](functions-system.md) |
| [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](functions/system_register_privatelink_endpoint.md) | Registers a private connectivity endpoint to route your connection to the Snowflake service. | [System functions](functions-system.md) |
| [SYSTEM$REGISTRY_LIST_IMAGES — Deprecated](functions/system_registry_list_images.md) | Lists images in an [image repository](../developer-guide/snowpark-container-services/working-with-registry-repository.md). | [System functions](functions-system.md) |
| [SYSTEM$REMOVE_ALL_REFERENCES](functions/system_remove_all_references.md) | Deletes all associations to the reference. | [System functions](functions-system.md) |
| [SYSTEM$REMOVE_REFERENCE](functions/system_remove_reference.md) | Remove an association from the reference to an object in the consumer account and returns a unique system-generated alias for the reference. | [System functions](functions-system.md) |
| [SYSTEM$REPORT_HEALTH_STATUS](functions/system_report_health_status.md) | Sends [application health information](../developer-guide/native-apps/monitoring.md) from a consumer app to the provider account. | [System functions](functions-system.md) |
| [SYSTEM$RESOLVE_PYTHON_PACKAGES](functions/system_resolve_python_packages.md) | Returns a list of the resolved dependencies and their versions for the Python packages that were specified. | [System functions](functions-system.md) |
| [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT](functions/system_restore_privatelink_endpoint.md) | Restores a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to an external service using private connectivity. | [System functions](functions-system.md) |
| [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS](functions/system_restore_privatelink_endpoint_tss.md) | Restores a private connectivity endpoint in the Snowflake VPC or VNet to enable Snowflake to connect to an external key management service (KMS) resource by using private connectivity. | [System functions](functions-system.md) |
| [SYSTEM$REVOKE_PRIVATELINK](functions/system_revoke_privatelink.md) | Disables private connectivity to the Snowflake service for the current account. | [System functions](functions-system.md) |
| [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](functions/system_revoke_stage_privatelink_access.md) | Revokes the authorization for Snowflake to access the private endpoint for [Azure private endpoints for internal stages](../user-guide/private-internal-stages-azure.md) and [Google Private Service Connect endpoints for internal stages](../user-guide/private-internal-stages-gcp.md) for the current account. | [System functions](functions-system.md) |
| [SYSTEM$SAP_BDC_LIST_SHARES](functions/system_sap_bdc_list_shares.md) | Lists Data Products shared by SAP® Business Data Cloud with the enrolled catalog integration. | [System functions](functions-system.md) |
| [SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH](functions/system_schedule_async_replication_group_refresh.md) | Starts a refresh operation for a replication group or a failover group, in the background. | [System functions](functions-system.md) |
| [SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG](functions/system_send_notifications_to_catalog.md) | Sends a notification to [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview) to update Snowflake-managed [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) in Open Catalog with the latest table changes, and returns whether the notification was sent successfully along with an error code and error message for the failure, if applicable. | [System functions](functions-system.md) |
| [SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS](functions/system_set_application_restricted_feature_access.md) | Enables a restricted feature for a Snowflake Native App. | [System functions](functions-system.md) |
| [SYSTEM$SET_CATALOG_INTEGRATION](functions/system_set_catalog_integration.md) | Replaces the catalog integration associated with an externally managed [Apache Iceberg™ table](../user-guide/tables-iceberg.md). | [System functions](functions-system.md) |
| [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_set_default_columns_override_for_show_command.md) | Controls the columns that should be returned when the specified [SHOW <objects>](sql/show.md) command is executed. | [System functions](functions-system.md) |
| [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_set_default_columns_override_for_system_object.md) | Controls the columns that should be returned when you select all columns (`SELECT *`) from the specified Snowflake view (for example, from a specific [ACCOUNT_USAGE view](account-usage.md) or [INFORMATION_SCHEMA view](info-schema.md)). | [System functions](functions-system.md) |
| [SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION](functions/system_set_event_sharing_account_for_region.md) | Sets the event account for a region. | [System functions](functions-system.md) |
| [SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME](functions/system_set_privatelink_endpoint_hostname.md) | Modifies only the host name of an existing [private connectivity endpoint](../user-guide/private-connectivity-outbound.md). | [System functions](functions-system.md) |
| [SYSTEM$SET_REFERENCE](functions/system_set_reference.md) | Called by a Snowflake Native App to associate a consumer reference string to a reference definition. | [System functions](functions-system.md) |
| [SYSTEM$SET_RETURN_VALUE](functions/system_set_return_value.md) | Explicitly sets the return value for a task. | [System functions](functions-system.md) |
| [SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES](functions/system_set_row_timestamp_on_all_supported_tables.md) | Use this system function to bulk enable row timestamps on existing tables. | [System functions](functions-system.md) |
| [SYSTEM$SET_SPAN_ATTRIBUTES (for Snowflake Scripting)](functions/system_set_span_attributes.md) | Sets attribute name and value associated with a span containing trace events. | [System functions](functions-system.md) |
| [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](functions/system_show_active_behavior_change_bundles.md) | Returns an array of the currently available [behavior change release bundles](../release-notes/behavior-change-policy.md), the default state of each bundle, and the actual state of the bundle for the current account. | [System functions](functions-system.md) |
| [SYSTEM$SHOW_BUDGETS_FOR_RESOURCE](functions/system_show_budgets_for_resource.md) | Returns a string containing a list of the [budgets](../user-guide/budgets.md) that track a specified resource (for example, a table or a schema). | [System functions](functions-system.md) |
| [SYSTEM$SHOW_BUDGETS_IN_ACCOUNT](functions/system_show_budgets_in_account.md) | Returns the [budgets](../user-guide/budgets.md) in the account for which you have access privileges. | [System functions](functions-system.md) |
| [SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS](functions/system_show_event_sharing_accounts.md) | Shows event accounts in a provider organization. | [System functions](functions-system.md) |
| [SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](functions/system_show_move_organization_account_status.md) | Returns the status of an attempt to move an [organization account](../user-guide/organization-accounts.md). | [System functions](functions-system.md) |
| [SYSTEM$SHOW_OAUTH_CLIENT_SECRETS](functions/system_show_oauth_client_secrets.md) | Returns the client secrets in a string. | [System functions](functions-system.md) |
| [SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES](functions/system_show_sensitive_data_monitored_entities.md) | Returns a JSON array of databases or schemas that are associated with a classification profile, which indicates that objects in these entities are monitored by [sensitive data classification](../user-guide/classify-intro.md). | [System functions](functions-system.md) |
| [SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN](functions/system_snowpipe_streaming_update_channel_offset_token.md) | Updates the offset token for a particular channel used by Snowpipe Streaming with a new offset token. | [System functions](functions-system.md) |
| [SYSTEM$START_OAUTH_FLOW](functions/system_start_oauth_flow.md) | Initiates the OAUTH client flow, returning a URL you use in a browser to complete the OAuth consent process. | [System functions](functions-system.md) |
| [SYSTEM$START_USER_EMAIL_VERIFICATION](functions/system_start_user_email_verification.md) | Starts the [email verification process for a user](../user-guide/notifications/email-notifications.md). | [System functions](functions-system.md) |
| [SYSTEM$STREAM_BACKLOG](functions/system_stream_backlog.md) | Returns the set of table versions between the current [offset](../user-guide/streams-intro.md) for a specified stream and the current timestamp. | [Table functions](functions-table.md) , [System functions](functions-system.md) |
| [SYSTEM$STREAM_GET_TABLE_TIMESTAMP](functions/system_stream_get_table_timestamp.md) | Returns the timestamp in nanoseconds of the latest table version at or before the current offset for the specified stream. | [System functions](functions-system.md) |
| [SYSTEM$STREAM_HAS_DATA](functions/system_stream_has_data.md) | Indicates whether a specified stream contains change data capture (CDC) records. | [System functions](functions-system.md) |
| [SYSTEM$SUPPORTED_DBT_VERSIONS](functions/system_supported_dbt_versions.md) | Returns a JSON array containing the versions that Snowflake supports for dbt Projects. | [System functions](functions-system.md) |
| [SYSTEM$TASK_DEPENDENTS_ENABLE](functions/system_task_dependents_enable.md) | Recursively resumes a specified task and all its dependent tasks. | [System functions](functions-system.md) |
| [SYSTEM$TASK_RUNTIME_INFO](functions/system_task_runtime_info.md) | Returns information about the current task run. | [System functions](functions-system.md) |
| [SYSTEM$TRIGGER_LISTING_REFRESH](functions/system_trigger_listing_refresh.md) | Triggers a one-time, on-demand data refresh for a provider’s databases or listings, accessible to all consumers. | [System functions](functions-system.md) |
| [SYSTEM$TYPEOF](functions/system_typeof.md) | Returns a string representing the SQL data type associated with an expression. | [System functions](functions-system.md) |
| [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](functions/system_unblock_internal_stages_public_access.md) | Allows traffic from public IP addresses to access the internal stage of the current Snowflake account on Microsoft Azure. | [System functions](functions-system.md) |
| [SYSTEM$UNLINK_ORGANIZATION_USER](functions/system_unlink_organization_user.md) | Unlinks a user object from an [organization user](../user-guide/organization-users.md) so it can be managed as a local user going forward. | [System functions](functions-system.md) |
| [SYSTEM$UNLINK_ORGANIZATION_USER_GROUP](functions/system_unlink_organization_user_group.md) | Unlinks an access control role from an [organization user group](../user-guide/organization-users.md) so it can be managed as a local role going forward. | [System functions](functions-system.md) |
| [SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT](functions/system_unregister_privatelink_endpoint.md) | Unregisters a private connectivity endpoint to route your connection to the Snowflake service. | [System functions](functions-system.md) |
| [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_unset_default_columns_override_for_show_command.md) | Clears the list of columns specified by a previous call to [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_set_default_columns_override_for_show_command.md) for a type of object. | [System functions](functions-system.md) |
| [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_unset_default_columns_override_for_system_object.md) | Clears the list of columns specified by a previous call to [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_set_default_columns_override_for_system_object.md) for the specified Snowflake view (for example, for a specific [ACCOUNT_USAGE view](account-usage.md) or [INFORMATION_SCHEMA view](info-schema.md)). | [System functions](functions-system.md) |
| [SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION](functions/system_unset_event_sharing_account_for_region.md) | Unsets the events account for a region. | [System functions](functions-system.md) |
| [SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS](functions/system_user_task_cancel_ongoing_executions.md) | Cancels a run of the specified task that the system has already started to process (that is, a run with an EXECUTING state in the [TASK_HISTORY](functions/task_history.md) output). | [System functions](functions-system.md) |
| [SYSTEM$VALIDATE_STORAGE_INTEGRATION](functions/system_validate_storage_integration.md) | Validates the configuration for a specified storage integration. | [System functions](functions-system.md) |
| [SYSTEM$VERIFY_CATALOG_INTEGRATION](functions/system_verify_catalog_integration.md) | Verifies the configuration for a specified catalog integration for Apache Iceberg™ REST. | [System functions](functions-system.md) |
| [SYSTEM$VERIFY_CMK_INFO](functions/system_verify_cmk_info.md) | Verifies your customer-managed key (CMK) configuration and returns a message about the registered CMK. | [System functions](functions-system.md) |
| [SYSTEM$VERIFY_CMK_INFO_POSTGRES](functions/system_verify_cmk_info_postgres.md) | Verifies your customer-managed key (CMK) configuration for Snowflake Postgres Tri-Secret Secure and returns a message about the registered CMK. | [System functions](functions-system.md) |
| [SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN](functions/system_verify_ext_oauth_token.md) | Determines whether your [External OAuth](../user-guide/oauth-ext-overview.md) access token is valid or has expired and needs to be regenerated. | [System functions](functions-system.md) |
| [SYSTEM$VERIFY_EXTERNAL_VOLUME](functions/system_verify_external_volume.md) | Verifies the configuration for a specified [external volume](../user-guide/tables-iceberg-configure-external-volume.md). | [System functions](functions-system.md) |
| [SYSTEM$WAIT](functions/system_wait.md) | Waits for the specified amount of time before proceeding. | [System functions](functions-system.md) |
| [SYSTEM$WAIT_FOR_SERVICES](functions/system_wait_for_services.md) | Waits for one or more [Snowpark Container Services services](../developer-guide/snowpark-container-services/working-with-services.md) to reach the READY state (or becomes upgraded) before returning. | [System functions](functions-system.md) |
| [SYSTEM$WHITELIST — Deprecated](functions/system_whitelist.md) | Returns hostnames and port numbers to add to your firewall’s allowed list so that you can access Snowflake from behind your firewall. | [System functions](functions-system.md) |
| [SYSTEM$WHITELIST_PRIVATELINK — Deprecated](functions/system_whitelist_privatelink.md) | Returns hostnames and port numbers for [AWS PrivateLink](https://aws.amazon.com/privatelink/), [Azure Private Link](https://azure.microsoft.com/en-us/services/private-link/), and [Google Cloud Private Service Connect](https://cloud.google.com/vpc/docs/configure-private-service-connect-services) deployments to add to your firewall’s allowed list so that you can access Snowflake from behind your firewall. | [System functions](functions-system.md) |
| [SYSTIMESTAMP](functions/systimestamp.md) | Returns the current timestamp for the system. | [Context functions](functions-context.md) |
| **T** |  |  |
| [TAG_REFERENCES](functions/tag_references.md) | Returns a table in which each row displays an association between a tag and value. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [TAG_REFERENCES_ALL_COLUMNS](functions/tag_references_all_columns.md) | Returns a table in which each row displays the tag name and tag value assigned to a specific column. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [TAG_REFERENCES_WITH_LINEAGE](functions/tag_references_with_lineage.md) | Returns a table in which each row displays an association between the specified tag and the Snowflake object to which the tag is associated. | [Account Usage table functions](account-usage.md) , [Table functions](functions-table.md) |
| [TAN](functions/tan.md) | Computes the tangent of its argument; the argument should be expressed in radians. | [Numeric functions](functions-numeric.md) |
| [TANH](functions/tanh.md) | Computes the hyperbolic tangent of its argument. | [Numeric functions](functions-numeric.md) |
| [TASK_DEPENDENTS](functions/task_dependents.md) | This table function returns the list of child [tasks](../user-guide/tasks-intro.md) for a given root task in a [task graph](../user-guide/tasks-graphs.md). | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [TASK_HISTORY](functions/task_history.md) | You can use this table function to query the history of [task](../user-guide/tasks-intro.md) usage within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [TEXT_HTML](functions/text_html.md) | Returns a JSON object that specifies the HTML message to use for a notification. | [Notification functions](functions-notification.md) |
| [TEXT_PLAIN](functions/text_plain.md) | Returns a JSON object that specifies the plain text message to use for a notification. | [Notification functions](functions-notification.md) |
| [TIME_FROM_PARTS](functions/time_from_parts.md) | Creates a time from individual numeric components. | [Date & time functions](functions-date-time.md) |
| [TIME_SLICE](functions/time_slice.md) | Calculates the beginning or end of a “slice” of time, where the length of the slice is a multiple of a standard unit of time (minute, hour, day, etc.). | [Date & time functions](functions-date-time.md) |
| [TIMEADD](functions/timeadd.md) | Adds the specified value for the specified date or time part to a date, time, or timestamp. | [Date & time functions](functions-date-time.md) |
| [TIMEDIFF](functions/timediff.md) | Calculates the difference between two date, time, or timestamp expressions based on the specified date or time part. | [Date & time functions](functions-date-time.md) |
| [TIMESTAMP_FROM_PARTS](functions/timestamp_from_parts.md) | Creates a timestamp from individual numeric components. | [Date & time functions](functions-date-time.md) |
| [TIMESTAMPADD](functions/timestampadd.md) | Adds the specified value for the specified date or time part to a date, time, or timestamp. | [Date & time functions](functions-date-time.md) |
| [TIMESTAMPDIFF](functions/timestampdiff.md) | Calculates the difference between two date, time, or timestamp expressions based on the specified date or time part. | [Date & time functions](functions-date-time.md) |
| [TO_ARRAY](functions/to_array.md) | Converts the input expression to an [ARRAY](data-types-semistructured.md) value. | [Conversion functions](functions-conversion.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [TO_BINARY](functions/to_binary.md) | Converts the input expression to a binary value. | [Conversion functions](functions-conversion.md) |
| [TO_BOOLEAN](functions/to_boolean.md) | Converts the input text or numeric expression to a [BOOLEAN](data-types-logical.md) value. | [Conversion functions](functions-conversion.md) |
| [TO_CHAR , TO_VARCHAR](functions/to_char.md) | Converts the input expression to a string. | [Conversion functions](functions-conversion.md) |
| [TO_DATE , DATE](functions/to_date.md) | Converts an input expression to a date. | [Conversion functions](functions-conversion.md) , [Date & time functions](functions-date-time.md) |
| [TO_DECFLOAT](functions/to_decfloat.md) | Converts an expression to a decimal floating-point number ([DECFLOAT](data-types-numeric.md)). | [Conversion functions](functions-conversion.md) |
| [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](functions/to_decimal.md) | Converts an input expression to a fixed-point number. | [Conversion functions](functions-conversion.md) |
| [TO_DOUBLE](functions/to_double.md) | Converts an expression to a double-precision floating-point number. | [Conversion functions](functions-conversion.md) |
| [TO_FILE](functions/to_file.md) | Constructs a value of type [FILE](data-types-unstructured.md) from a file location or from metadata. | [File functions](functions-file.md) |
| [TO_GEOGRAPHY](functions/to_geography.md) | Parses an input and returns a value of type [GEOGRAPHY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [TO_GEOMETRY](functions/to_geometry.md) | Parses an input and returns a value of type [GEOMETRY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [TO_JSON](functions/to_json.md) | Converts a [VARIANT](data-types-semistructured.md) value to a string containing the JSON representation of the value. | [Conversion functions](functions-conversion.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [TO_OBJECT](functions/to_object.md) | Converts the input value to an [OBJECT](data-types-semistructured.md). | [Conversion functions](functions-conversion.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [TO_QUERY](functions/to_query.md) | Returns a result set based on SQL text and an optional set of arguments that are passed to the SQL text if it is parameterized. | [Table functions](functions-table.md) |
| [TO_TIME , TIME](functions/to_time.md) | Converts an input expression into a time. | [Conversion functions](functions-conversion.md) , [Date & time functions](functions-date-time.md) |
| [TO_TIMESTAMP / TO_TIMESTAMP_\*](functions/to_timestamp.md) | Converts an input expression into the corresponding timestamp. | [Conversion functions](functions-conversion.md) , [Date & time functions](functions-date-time.md) |
| [TO_UUID](functions/to_uuid.md) | Converts the input expression to a [UUID](data-types-uuid.md) value. | [Conversion functions](functions-conversion.md) |
| [TO_VARIANT](functions/to_variant.md) | Converts any value to a [VARIANT](data-types-semistructured.md) value or NULL (if input is NULL). | [Conversion functions](functions-conversion.md) |
| [TO_XML](functions/to_xml.md) | Converts a [VARIANT](data-types-semistructured.md) to a VARCHAR that contains an [XML](../user-guide/semistructured-data-formats.md) representation of the value. | [Conversion functions](functions-conversion.md) , [Semi-structured and structured data functions](functions-semistructured.md) |
| [TRANSFORM](functions/transform.md) | Transforms an [array](data-types-semistructured.md) based on the logic in a lambda expression. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [TRANSLATE (SNOWFLAKE.CORTEX)](functions/translate-snowflake-cortex.md) | Translates the given input text from one supported language to another. | [String & binary functions](functions-string.md) |
| [TRANSLATE](functions/translate.md) | Replaces characters in a string. | [String & binary functions](functions-string.md) |
| [TRIM](functions/trim.md) | Removes leading and trailing characters from a string. | [String & binary functions](functions-string.md) |
| [TRUNCATE , TRUNC](functions/trunc.md) | Rounds the input expression down to the nearest (or equal) value closer to zero. | [Numeric functions](functions-numeric.md) |
| [TRUNCATE, TRUNC](functions/trunc2.md) | Truncates a DATE, TIME, or TIMESTAMP value to the specified precision. | [Date & time functions](functions-date-time.md) |
| [TRY_BASE64_DECODE_BINARY](functions/try_base64_decode_binary.md) | A special version of [BASE64_DECODE_BINARY](functions/base64_decode_binary.md) that returns a NULL value if an error occurs during decoding. | [String & binary functions](functions-string.md) |
| [TRY_BASE64_DECODE_STRING](functions/try_base64_decode_string.md) | A special version of [BASE64_DECODE_STRING](functions/base64_decode_string.md) that returns a NULL value if an error occurs during decoding. | [String & binary functions](functions-string.md) |
| [TRY_CAST](functions/try_cast.md) | A special version of [CAST , ::](functions/cast.md) that is available for a subset of data type conversions. | [Conversion functions](functions-conversion.md) |
| [TRY_COMPLETE (SNOWFLAKE.CORTEX)](functions/try_complete-snowflake-cortex.md) | Performs the same operation as the [COMPLETE](functions/complete-snowflake-cortex.md) function but returns NULL instead of raising an error when the operation cannot be performed. | [String & binary functions](functions-string.md) |
| [TRY_DECRYPT](functions/try_decrypt.md) | A special version of [DECRYPT](functions/decrypt.md) that returns a NULL value if an error occurs during decryption. | [Encryption functions](functions-encryption.md) |
| [TRY_DECRYPT_RAW](functions/try_decrypt_raw.md) | A special version of [DECRYPT_RAW](functions/decrypt_raw.md) that returns a NULL value if an error occurs during decryption. | [Encryption functions](functions-encryption.md) |
| [TRY_HEX_DECODE_BINARY](functions/try_hex_decode_binary.md) | A special version of [HEX_DECODE_BINARY](functions/hex_decode_binary.md) that returns a NULL value if an error occurs during decoding. | [String & binary functions](functions-string.md) |
| [TRY_HEX_DECODE_STRING](functions/try_hex_decode_string.md) | A special version of [HEX_DECODE_STRING](functions/hex_decode_string.md) that returns a NULL value if an error occurs during decoding. | [String & binary functions](functions-string.md) |
| [TRY_PARSE_JSON](functions/try_parse_json.md) | A special version of [PARSE_JSON](functions/parse_json.md) that returns a NULL value if an error occurs during parsing. | [Semi-structured and structured data functions](functions-semistructured.md) |
| [TRY_TO_BINARY](functions/try_to_binary.md) | A special version of [TO_BINARY](functions/to_binary.md) that performs the same operation (i.e. converts an input expression to a binary value), but with error handling support (i.e. if the conversion cannot be performed, it returns a NULL value instead of raising an error). | [Conversion functions](functions-conversion.md) |
| [TRY_TO_BOOLEAN](functions/try_to_boolean.md) | A special version of [TO_BOOLEAN](functions/to_boolean.md) that performs the same operation (that is, converts an input expression to a Boolean value), but with error-handling support. | [Conversion functions](functions-conversion.md) |
| [TRY_TO_DATE](functions/try_to_date.md) | A special version of the [TO_DATE](functions/to_date.md) function that performs the same operation (i.e. converts an input expression to a date), but with error-handling support (i.e. if the conversion cannot be performed, it returns a NULL value instead of raising an error). | [Conversion functions](functions-conversion.md) , [Date & time functions](functions-date-time.md) |
| [TRY_TO_DECFLOAT](functions/try_to_decfloat.md) | A special version of [TO_DECFLOAT](functions/to_decfloat.md) that performs the same operation — that is, converts an input expression to a [DECFLOAT](data-types-numeric.md) — but with error-handling support. | [Conversion functions](functions-conversion.md) |
| [TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC](functions/try_to_decimal.md) | A special version of [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](functions/to_decimal.md) that performs the same operation of converting an input expression to a fixed-point number, but has error-handling support so that the function returns NULL if the conversion can’t be performed. | [Conversion functions](functions-conversion.md) |
| [TRY_TO_DOUBLE](functions/try_to_double.md) | A special version of [TO_DOUBLE](functions/to_double.md) that performs the same operation (that is, converts an input expression to a double-precision floating-point number), but with error-handling support (that is, if the conversion can’t be performed, it returns a NULL value instead of raising an error). | [Conversion functions](functions-conversion.md) |
| [TRY_TO_FILE](functions/try_to_file.md) | A version of [TO_FILE](functions/to_file.md) that returns NULL instead of raising an error. | [File functions](functions-file.md) |
| [TRY_TO_GEOGRAPHY](functions/try_to_geography.md) | Parses an input and returns a value of type [GEOGRAPHY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [TRY_TO_GEOMETRY](functions/try_to_geometry.md) | Parses an input and returns a value of type [GEOMETRY](data-types-geospatial.md). | [Geospatial functions](functions-geospatial.md) , [Conversion functions](functions-conversion.md) |
| [TRY_TO_TIME](functions/try_to_time.md) | A special version of [TO_TIME , TIME](functions/to_time.md) that performs the same operation (i.e. converts an input expression into a time), but with error-handling support (i.e. if the conversion cannot be performed, it returns a NULL value instead of raising an error). | [Conversion functions](functions-conversion.md) |
| [TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_\*](functions/try_to_timestamp.md) | A special version of [TO_TIMESTAMP / TO_TIMESTAMP_\*](functions/to_timestamp.md) that performs the same operation (i.e. converts an input expression into a timestamp), but with error-handling support (i.e. if the conversion cannot be performed, it returns a NULL value instead of raising an error). | [Conversion functions](functions-conversion.md) |
| [TRY_TO_UUID](functions/try_to_uuid.md) | A special version of [TO_UUID](functions/to_uuid.md) that performs the same operation — that is, converts an input expression to a [UUID](data-types-uuid.md) value — but with error handling support. | [Conversion functions](functions-conversion.md) |
| [TYPEOF](functions/typeof.md) | Returns the type of a value stored in a [VARIANT](data-types-semistructured.md) column. | [Semi-structured and structured data functions](functions-semistructured.md) |
| **U** |  |  |
| [UNICODE](functions/unicode.md) | Returns the Unicode code point for the first Unicode character in a string. | [String & binary functions](functions-string.md) |
| [UNIFORM](functions/uniform.md) | Generates a uniformly-distributed pseudo-random number in the inclusive range [`min`, `max`]. | [Data generation functions](functions-data-generation.md) |
| [UPPER](functions/upper.md) | Returns the input string with all characters converted to uppercase. | [String & binary functions](functions-string.md) |
| [UUID_STRING](functions/uuid_string.md) | Generates either a version 4 (random) or version 5 (named) RFC 4122-compliant universally unique identifier (UUID) as a formatted string. | [String & binary functions](functions-string.md) , [Data generation functions](functions-data-generation.md) |
| **V** |  |  |
| [VALIDATE](functions/validate.md) | Validates the files loaded in a past execution of the [COPY INTO <table>](sql/copy-into-table.md) command and returns all the errors encountered during the load, rather than just the first error. | [Table functions](functions-table.md) |
| [VALIDATE_PIPE_LOAD](functions/validate_pipe_load.md) | This table function can be used to validate data files processed by [Snowpipe](../user-guide/data-load-snowpipe-intro.md) within a specified time range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [VAR_POP](functions/var_pop.md) | Returns the population variance of non-NULL records in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [VAR_SAMP](functions/var_samp.md) | Returns the sample variance of non-NULL records in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [VARIANCE , VARIANCE_SAMP](functions/variance.md) | Returns the sample variance of non-NULL records in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [VARIANCE_POP](functions/variance_pop.md) | Returns the population variance of non-NULL records in a group. | [Aggregate functions](functions-aggregation.md) , [Window function syntax and usage](functions-window-syntax.md) |
| [VECTOR_AVG](functions/vector_avg.md) | Computes the element-wise average of [vectors](../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. | [Vector functions](functions-vector.md) , [Aggregate functions](functions-aggregation.md) |
| [VECTOR_COSINE_SIMILARITY](functions/vector_cosine_similarity.md) | Computes the cosine similarity between two [vectors](../user-guide/snowflake-cortex/vector-embeddings.md). | [Vector functions](functions-vector.md) |
| [VECTOR_INNER_PRODUCT](functions/vector_inner_product.md) | Computes the inner product of two [vectors](../user-guide/snowflake-cortex/vector-embeddings.md). | [Vector functions](functions-vector.md) |
| [VECTOR_L1_DISTANCE](functions/vector_l1_distance.md) | Computes the L1 distance between two [vectors](../user-guide/snowflake-cortex/vector-embeddings.md). | [Vector functions](functions-vector.md) |
| [VECTOR_L2_DISTANCE](functions/vector_l2_distance.md) | Computes the L2 distance between two [vectors](../user-guide/snowflake-cortex/vector-embeddings.md). | [Vector functions](functions-vector.md) |
| [VECTOR_MAX](functions/vector_max.md) | Computes the element-wise maximum of [vectors](../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. | [Vector functions](functions-vector.md) , [Aggregate functions](functions-aggregation.md) |
| [VECTOR_MIN](functions/vector_min.md) | Computes the element-wise minimum of [vectors](../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. | [Vector functions](functions-vector.md) , [Aggregate functions](functions-aggregation.md) |
| [VECTOR_NORMALIZE](functions/vector_normalize.md) | Normalizes a [VECTOR](data-types-vector.md) in the L2 vector space, giving its elements values in the range of [0,1] and giving it a magnitude of 1. | [Vector functions](functions-vector.md) |
| [VECTOR_SUM](functions/vector_sum.md) | Computes the element-wise sum of [vectors](../user-guide/snowflake-cortex/vector-embeddings.md) in an aggregate. | [Vector functions](functions-vector.md) , [Aggregate functions](functions-aggregation.md) |
| [VECTOR_TRUNCATE](functions/vector_truncate.md) | Truncates a [VECTOR](data-types-vector.md) to a smaller dimension. | [Vector functions](functions-vector.md) |
| **W** |  |  |
| [WAREHOUSE_LOAD_HISTORY](functions/warehouse_load_history.md) | This table function can be used to query the activity history (defined as the “query load”) for a single warehouse within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [WAREHOUSE_METERING_HISTORY](functions/warehouse_metering_history.md) | This table function can be used in queries to return the hourly credit usage for a single warehouse (or all the warehouses in your account) within a specified date range. | [Information Schema](info-schema.md) , [Table functions](functions-table.md) |
| [WIDTH_BUCKET](functions/width_bucket.md) | Constructs equi-width histograms, in which the histogram range is divided into intervals of identical size, and returns the bucket number into which the value of an expression falls, after it has been evaluated. | [Numeric functions](functions-numeric.md) |
| **X** |  |  |
| [XMLGET](functions/xmlget.md) | Extracts an [XML](../user-guide/semistructured-data-formats.md) element object (often referred to as simply a *tag*) from the content of the outer XML element based on the name and instance number of the specified tag. | [Semi-structured and structured data functions](functions-semistructured.md) |
| **Y** |  |  |
| [YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER](functions/year.md) | Extracts the corresponding date part from a date or timestamp. | [Date & time functions](functions-date-time.md) |
| **Z** |  |  |
| [ZEROIFNULL](functions/zeroifnull.md) | Returns 0 if its argument is null; otherwise, returns its argument. | [Conditional expression functions](expressions-conditional.md) |
| [ZIPF](functions/zipf.md) | Returns a Zipf-distributed integer, for `N` elements and characteristic exponent `s`. | [Data generation functions](functions-data-generation.md) |

---
title: APPLICATION_STATE view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/application-state-view.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# APPLICATION_STATE view

This view in the DATA_SHARING_USAGE schema can be used to display information about apps installed
from a listing for all application packages in the current account.

If a listing was published using
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md),
this view displays information for installed apps across all regions.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSUMER_SNOWFLAKE_REGION | VARCHAR | The Snowflake region of the consumer account where the app is installed. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | The organization name of the consumer account. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | The consumer account locator. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | The consumer account name. |
| PROVIDER_SNOWFLAKE_REGION | VARCHAR | The Snowflake region of the provider account that created the application package. |
| PROVIDER_ACCOUNT_LOCATOR | VARCHAR | The provider account locator. |
| PROVIDER_ACCOUNT_NAME | VARCHAR | The provider account name. |
| PACKAGE_NAME | VARCHAR | The current name for the application package in the provider’s account from which the app was installed. |
| APPLICATION_NAME_HASH | VARCHAR | The hash string of the name of the installed app instance in the consumer account. The consumer uses the [SYSTEM$GET_HASH_FOR_APPLICATION](../functions/system_get_hash_for_application.md) function to calculate the hash value of the installed application. The consumer can then use this value when contacting the provider. |
| CREATED_ON | DATETIME | The timestamp when the app instance was first installed. |
| CURRENT_VERSION | VARCHAR | The current version of the app. |
| CURRENT_PATCH | INT | The current patch level of the app. |
| CURRENT_INSTALLED_ON | DATETIME | The timestamp when the current version of the app was installed. |
| PREVIOUS_VERSION_STATE | VARCHAR | The state of the previous version. Possible values are COMPLETE and FINALIZING.   * `COMPLETE` indicates that upgrade is completed and that there are no active   queries being executed from the previous version, if it exists. * `FINALIZING` indicates that the instance has been upgraded from the previous version,   however one or more queries may be still be running that are using the previous version. |
| PREVIOUS_VERSION | VARCHAR | The previous version of the app. |
| PREVIOUS_PATCH | INT | The previous patch level of the app. |
| UPGRADE_STATE | VARCHAR | The version upgrade state of the app. See Application version upgrade states for more information. |
| TARGET_UPGRADE_VERSION | VARCHAR | The target version of the app that is running or pending upgrade. |
| TARGET_UPGRADE_PATCH | INT | The version patch level of the app that is running or pending upgrade. |
| UPGRADE_STARTED_ON | DATETIME | The timestamp when the app upgrade started. |
| UPGRADE_ATTEMPT | INT | The number of attempts to upgrade to the target version or patch. |
| UPGRADE_ATTEMPTED_ON | DATETIME | The timestamp when the most recent upgrade attempt was attempted. |
| UPGRADE_FAILURE_REASON | VARCHAR | A description of the failure if the previous app upgrade failed. |
| LISTING_NAME | VARCHAR | The name of the listing on the data exchange from which the app was installed. |
| LISTING_DISPLAY_NAME | VARCHAR | The display name of the listing. |
| EXCHANGE_NAME | VARCHAR | The data exchange name of the listing from which the app was installed. |
| LAST_HEALTH_STATUS | VARCHAR | The last reported health status of the app. Possible values are:   * OK * FAILED * PAUSED |
| LAST_HEALTH_STATUS_UPDATED_ON | VARCHAR | The timestamp when the health status was last reported. |
| ENABLED_TELEMETRY_EVENT_DEFINITIONS | VARCHAR | A list of event definitions that the consumer has enabled. See [About event definitions](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#about-event-sharing) for more information. |
| UPGRADE_STATE_UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the app entered its current upgrade state. This value is automatically set by Snowflake. |
| DISABLEMENT_REASONS | VARCHAR | An array containing the reasons why the Snowflake Native App was disabled. See Reasons an app can become disabled. |

## Reasons an app can become disabled

The following table lists the possible values for the DISABLEMENT_REASONS column:

| Value | Status description | Is recoverable? |
| --- | --- | --- |
| MANUALLY_DISABLED | The app is disabled by Snowflake | Yes. To re-enable the app, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| ACCOUNT_INACTIVE | The account becomes inactive by being locked or suspended causing the app to be unavailable. In this state a consumer cannot execute any SQL queries in their account and the app cannot be upgraded. | Yes. The app is automatically re-enabled if the account lock or suspension is removed |
| PACKAGE_VERSION_IS_MISSING | The application package version for the app was dropped by the provider. | Possibly. This can be caused by a temporary platform outage, in which case the app may recover automatically. Otherwise, the provider can work with Snowflake Support to attempt version recovery. Contact the application provider for more details. |
| CMK_ACCESS_DENIED | The consumer manages the encryption key themselves (ENCRYPT_USE_CMK_KMS is enabled) and Snowflake doesn’t have access to this key. | Yes. To re-enable the app, ensure that the cloud provider configuration to retrieve the CMK is correct and that Snowflake has access to the key. |
| LISTING_ACCESS_REVOKED | The listing used to create the app is no longer available. Possible reasons for this status include:   * The provider deleted the listing * The provider manually removed access to the private listing from the consumer account | Possibly. Recoverability depends on the reason why access was revoked.  For example, if the listing was deleted it is not recoverable. If a consumer account was manually removed from the private listing, access to the listing and app can be restored. |
| LISTING_TRIAL_USAGE_EXCEEDED | The application has exceeded the usage limit for a usage-based trial listing. | No |
| LISTING_PAYMENT_REQUIRED | The listing used to install the app is a paid listing and requires payment for further usage. | Yes. The consumer must correctly set up payment for the app. |
| LISTING_TRIAL_TIME_EXCEEDED | The application exceeded the trial duration. | No |
| APPLICATION_PACKAGE_NOT_AVAILABLE | The application package used to create the app no longer exists. The provider may have dropped the corresponding application package. | No |
| APPLICATION_PACKAGE_DISABLED | The application package used to create the app is disabled by the Snowflake. | Yes. The app is re-enabled, if Snowflake re-enables the application package. |
| APPLICATION_SUSPENDED | The app resources for example, tasks, services, and compute pools, are suspended due to the app being disabled.  The suspended objects remain suspended until the app is re-enabled and there are no other reasons the app was disabled. | Yes |
| APPLICATION_SUSPEND_RESUME_IN_PROGRESS | The app resources, for example tasks, services, and compute pools, are currently resuming. | Yes |

## Application version upgrade states

The following are the possible values for the UPGRADE_STATE column:

* `INSTALLING`: The application object is in the process of being created.
* `INSTALL_FAILED`: The creation of the application object failed. The application object
  remains in the `INSTALL_FAILED` state until it is dropped. See the `UPGRADE_FAILURE_REASON`
  column of the [DESCRIBE APPLICATION](../sql/desc-application.md) command for information about why the
  installation or upgrade failed.
* `COMPLETE`: The setup script successfully completed and the application object was created
  or upgraded.
* `QUEUED`: The application object is queued for upgrade.
* `UPGRADING`: The application object is in the process of being upgraded.
* `FAILED`: All upgrade attempts failed. The reason for the failure is listed in the
  `UPGRADE_FAILURE_REASON` column, if present. The instance remains in the `FAILED` state until
  a release directive is updated to point to a different version than the one that the upgrade was
  targeting, as defined in the `TARGET_UPGRADE_VERSION` column.
* `QUEUED_DELAYED`: The application object is queued for an upgrade that is scheduled for a future time.
* `QUEUED_RETRY`: The instance failed one or more upgrade attempts. The reason for the failure
  is indicated in `UPGRADE_FAILURE_REASON`: The instance is queued to perform another upgrade attempt.
* `DISABLED`: The application object and its upgrades were disabled. In this state the instance will be
  inaccessible for consumers, it will not be considered for upgrades and will not block application package
  version drop. The reason for the failure is listed in the `UPGRADE_FAILURE_REASON` column, if present.

## Usage notes

* There is no data retention for this view. If an app is uninstalled the information
  contained in this view is no longer available.

---
title: Arithmetic operators
source: https://docs.snowflake.com/en/sql-reference/operators-arithmetic.md
section: SQL General Reference
---

# Arithmetic operators

Arithmetic operators are used to generate numeric output from one or more input expressions.

The input expressions must be numeric (fixed-point or floating point), except in the following cases:

* The unary operator `+` can take a number string, but the string is implicitly converted to its corresponding numeric value.
* The binary operator `-` can be applied to DATE expressions.

## List of arithmetic operators

| Operator | Syntax | Description |
| --- | --- | --- |
| `+` (unary) | `+a` | Returns `a`, which causes implicit conversion of `a` to a numeric value. If `a` is a string, but the string can’t be converted to a numeric value, an error is returned. |
| `+` | `a + b` | Adds two numeric expressions (`a` and `b`). |
| `-` (unary) | `-a` | Negates the input numeric expression. |
| `-` | `a - b` | Subtracts one numeric expression (`b`) from another (`a`). |
| `-` | `a - b` | Subtracts one date expression (`b`) from another (`a`). The result is an integer number of days. Subtraction is the only arithmetic operation allowed on DATE expressions. |
| `*` | `a * b` | Multiplies two numeric expressions (`a` and `b`). |
| `/` | `a / b` | Divides one numeric expression (`a`) by another (`b`). For functions that return 0 when dividing by 0 or NULL, see [DIV0](functions/div0.md) and [DIV0NULL](functions/div0null.md). |
| `%` | `a % b` | Computes the modulo of numeric expression `a` per `b`. For more information, see [MOD](functions/mod.md). |

## Scale and precision in arithmetic operations

The *scale* and *precision* of the output of an arithmetic operation depends on the scale and precision of the input.

Snowflake uses calculations to preserve scale and precision in the numeric output generated by various arithmetic operations (multiplication,
division, and so on). The following descriptions are used in this section:

Leading digits:
:   Number of digits (`L`) to the left of the decimal point in a numeric value.

Scale:
:   Number of digits (`S`) to the right of the decimal point in a numeric value.

Precision:
:   Total number of digits (`P`) in a numeric value, calculated as the sum of its leading digits and scale (that is, `P = L + S`). Note that precision in Snowflake is always limited to 38.

    Also:

    * Fixed-point data types (NUMBER, DECIMAL, and so on) utilize precision and scale. For example, for the DECIMAL(8,2) data type, precision is 8,
      scale is 2, and the number of leading digits is 6.
    * Floating-point data types (FLOAT, DOUBLE, REAL, and so on) utilize 8-byte doubles.

For outputs, note that these are maximum number of digits; the actual number of digits for any given output might be less.

### Multiplication

When performing multiplication:

* The number of leading digits in the output is the sum of the leading digits in both inputs.
* Snowflake minimizes potential overflow (due to chained multiplication) by adding the number of digits in the scale of both inputs, up to a maximum threshold of 12 digits, unless either of the inputs has
  a scale larger than 12, in which case the larger input scale is used as the output scale.

In other words, assuming a multiplication operation with two inputs (`L1.S1` and `L2.S2`), the maximum number of digits in the output are calculated as follows:

> Leading digits:
> :   `L = L1 + L2`
>
> Scale:
> :   `S = min(S1 + S2, max(S1, S2, 12))`
>
> Precision:
> :   `P = L + S`

> **Note:**
>
> Snowflake performs integer multiplication for numeric values, so intermediate results might cause some overflow; however, the final output won’t overflow.

#### Examples

```sqlexample
SELECT 10.01 n1, 1.1 n2, n1 * n2;
```

```output
+-------+-----+---------+
|    N1 |  N2 | N1 * N2 |
|-------+-----+---------|
| 10.01 | 1.1 |  11.011 |
+-------+-----+---------+
```

```sqlexample
SELECT 10.001 n1, .001 n2, n1 * n2;
```

```output
+--------+-------+----------+
|     I1 |    I2 |  I1 * I2 |
|--------+-------+----------|
| 10.001 | 0.001 | 0.010001 |
+--------+-------+----------+
```

```sqlexample
SELECT .1 n1, .0000000000001 n2, n1 * n2;
```

```output
+-----+-----------------+-----------------+
|  N1 |              N2 |         N1 * N2 |
|-----+-----------------+-----------------|
| 0.1 | 0.0000000000001 | 0.0000000000000 |
+-----+-----------------+-----------------+
```

### Division

When performing division:

* The number of leading digits in the output is the sum of the leading digits of the numerator and the scale of the denominator.
* Snowflake minimizes potential overflow in the output (due to chained division) and loss of scale by adding 6 digits to the scale of the numerator, up to a maximum threshold of 12 digits, unless the
  scale of the numerator is larger than 12, in which case the numerator scale is used as the output scale.

In other words, assuming a division operation with numerator `L1.S1` and denominator `L2.S2`, the maximum number of digits in the output are calculated as follows:

> Leading digits:
> :   `L = L1 + S2`
>
> Scale:
> :   `S = max(S1, min(S1 + 6, 12))`
>
> Precision:
> :   `P = L + S`

If the result of the division operation exceeds the output scale, Snowflake rounds the output (rather than truncating the output).

> **Note:**
>
> Similar to multiplication, intermediate division results might cause some overflow; however, the final output won’t overflow.

#### Examples

```sqlexample
SELECT 2 n1, 7 n2, n1 / n2;
```

```output
+----+----+----------+
| N1 | N2 |  N1 / N2 |
|----+----+----------|
|  2 |  7 | 0.285714 |
+----+----+----------+
```

```sqlexample
SELECT 10.1 n1, 2.1 n2, n1 / n2;
```

```output
+------+-----+-----------+
|   N1 |  N2 |   N1 / N2 |
|------+-----+-----------|
| 10.1 | 2.1 | 4.8095238 |
+------+-----+-----------+
```

```sqlexample
SELECT 10.001 n1, .001 n2, n1 / n2;
```

```output
+--------+-------+-----------------+
|     N1 |    N2 |         N1 / N2 |
|--------+-------+-----------------|
| 10.001 | 0.001 | 10001.000000000 |
+--------+-------+-----------------+
```

```sqlexample
SELECT .1 n1, .0000000000001 n2, n1 / n2;
```

```output
+-----+-----------------+-----------------------+
|  N1 |              N2 |               N1 / N2 |
|-----+-----------------+-----------------------|
| 0.1 | 0.0000000000001 | 1000000000000.0000000 |
+-----+-----------------+-----------------------+
```

### Addition and subtraction

For addition or subtraction:

* The number of leading digits in the output is the largest number of leading digits of the inputs plus 1 (to preserve carried values).
* The scale for the output is the largest scale of the inputs.

In other words, assuming an addition or subtraction operation has two inputs (`L1.S1` and `L2.S2`), the maximum number of digits in the output are calculated as follows:

> Leading digits:
> :   `L = max(L1, L2) + 1`
>
> Scale:
> :   `S = max(S1, S2)`
>
> Precision:
> :   `P = L + S`

### Other N-ary operations

For all other arithmetic operations with more than one numeric input, such as modulo (`a % b` or [MOD](functions/mod.md)):

* The number of leading digits in the output is the largest number of leading digits of the inputs.
* The scale for the output is the largest scale of the inputs.

In other words, assuming an n-ary operation with inputs `L1.S1`, `L2.S2`, etc., the maximum number of digits in the output are calculated as follows:

> Leading digits:
> :   `L = max(L1, L2, ...)`
>
> Scale:
> :   `S = max(S1, S2, ...)`
>
> Precision:
> :   `P = L + S`

### Unary operations

Unary arithmetic operations have the same output precision and scale as the input precision and scale, except for [ROUND](functions/round.md), which allows explicitly specifying the output
scale.

### Bitwise operations

The list of supported bitwise arithmetic operations is available at [Conditional expression functions](expressions-conditional.md).

> **Note:**
>
> * For numeric values, bitwise operations only operate on the leading digits in the input. The output always has a scale of zero.
> * For binary bitwise operations, the output has the same number of leading digits as the maximum leading digits in the input.

---
title: ASOF JOIN
source: https://docs.snowflake.com/en/sql-reference/constructs/asof-join.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# ASOF JOIN

An ASOF JOIN operation combines rows from two tables based on timestamp values that follow each
other, precede each other, or match exactly. For each row in the first (or left) table, the join finds a single
row in the second (or right) table that has the closest timestamp value. The qualifying row on the right side
is the closest match, which could be equal in time, earlier in time, or later in time, depending on the specified
comparison operator.

This topic describes how to use the ASOF JOIN construct in the [FROM](from.md) clause. For a more detailed conceptual
explanation of ASOF joins, see [Analyzing time-series data](../../user-guide/querying-time-series-data.md).

See also [JOIN](join.md), which covers the syntax for other standard join types, such as
inner and outer joins.

## Syntax

The following FROM clause syntax is specific to ASOF JOIN:

```sqlsyntax
FROM <left_table> ASOF JOIN <right_table>
  MATCH_CONDITION ( <left_table.timecol> <comparison_operator> <right_table.timecol> )
  [ ON <table.col> = <table.col> [ AND ... ] | USING ( <column_list> ) ]
```

## Parameters

`FROM`
:   The first (or left) table in the FROM clause is assumed to contain records that either follow (in time),
    precede, or are exactly synchronized with, the records in the second (or right) table. When there is no
    match for a row in the left table, the columns from the right table are null-padded.

    In addition to regular tables and views, any object reference can be used in an ASOF JOIN.
    See [FROM](from.md).

    ASOF JOIN can be used in most contexts where joins are supported. For information about some restrictions, see Usage Notes.

`MATCH_CONDITION ( left_table.timecol comparison_operator right_table.timecol )`
:   This condition names the specific timestamp columns to be compared in each table.

    * The order of tables is important in the condition. The left table must be named first.
    * The parentheses are required.
    * The comparison operator must be one of the following: `>=`, `<=`, `>`, `<`. The equals operator (`=`) is
      not supported.
    * All of the following data types are supported: DATE, TIME, DATETIME, TIMESTAMP, TIMESTAMP_LTZ, TIMESTAMP_NTZ, TIMESTAMP_TZ.
    * You can also use NUMBER columns in the match condition. For example, you might have NUMBER columns that contain UNIX
      timestamps (which define the number of seconds that have elapsed since January 1st, 1970).
    * The data types of the two matched columns don’t have to be exactly the same, but they must be
      [compatible](../intro-summary-data-types.md).

`ON table.col = table.col [ AND ... ]  | USING (column_list)`
:   The optional ON or USING clause defines one or more equality conditions on columns in the two tables, for the purpose of
    logically grouping the results of the query.

    For general information about ON and USING, see [JOIN](join.md). Note that a join specified with USING
    projects one of the joining columns in its intermediate result set, not both. A join specified with an ON clause projects both
    joining columns.

    The following notes are specific to ASOF JOIN:

    * The comparison operator in the ON clause must be the equal sign (=).
    * The ON clause cannot contain disjuncts (conditions connected with OR). Conditions connected with AND are supported.
    * Each side of a condition must refer to only one of the two tables in the join. However, the order of the table references doesn’t matter.
    * Each condition can be enclosed in parentheses, but they aren’t required.

See also More Details on Join Behavior and Specifying a USING condition instead of an ON condition.

## Usage notes

* If no match is found in the right table for a given row, the result is null-padded for the selected columns from the right table. (ASOF joins are similar to left outer joins in this respect.)
* If you use TIME columns in the match condition (as opposed to one of the [timestamp types](../data-types-datetime.md)), you might need to set the TIME_OUTPUT_FORMAT parameter in order to see the exact TIME values that are being compared when you look at ASOF JOIN query results. By default, the display of a TIME column truncates milliseconds. See TIME columns in the match condition.
* You can use more than one ASOF join in the same query as long as all of the syntax rules are followed for each join. Each join must be immediately followed by its own MATCH_CONDITION. You cannot apply a single MATCH_CONDITION to multiple ASOF joins. See Multiple ASOF joins in a query.
* ASOF joins are not supported for joins with LATERAL table functions or LATERAL inline views. For more information about lateral joins, see [LATERAL](join-lateral.md).
* An ASOF join with a self-reference is not allowed in a RECURSIVE common table expression (CTE). For information about CTEs, see [WITH](with.md).
* The EXPLAIN output for ASOF JOIN queries identifies the ON (or USING) conditions and the MATCH_CONDITION. For example, in text or tabular format, output similar to the following text appears above the table scans in the plan:

  ```output
  ->ASOF Join  joinKey: (S.LOCATION = R.LOCATION) AND (S.STATE = R.STATE),
    matchCondition: (S.OBSERVED >= R.OBSERVED)
  ```
* [Query profiles](../../user-guide/ui-snowsight-activity.md) also clearly identify the ASOF JOIN operation in the plan. In this example, you can see that the table scan reads 22M rows from the left table, which are all preserved by the join. The profile also shows the match condition for the join.

* You can specify the ASOF keyword in a [semantic view](../../user-guide/views-semantic/overview.md) to perform the ASOF JOIN
  operation on two logical tables in the view. For information, see [Using a date, time, timestamp, or numeric range to join logical tables](../../user-guide/views-semantic/sql.md).

## More details on join behavior

The optional ON (or USING) conditions for ASOF JOIN provide a way of grouping or partitioning table rows before the final matching rows
are singled out by the required match condition. If you want the rows from the joined tables to be grouped on one or more dimensions
that the tables share (stock symbol, location, city, state, company name, etc.), use an ON condition.
If you don’t use an ON condition, each row from the left table may be matched (by time) with any row from the right table
in the final result set.

In the following example, tables `left_table` and `right_table` have values `A`, `B`, etc.
in column `c1`, and values `1`, `2`, etc. in column `c2`. Column `c3` is a TIME column, and `c4` is a numeric value (column of interest).

First, create and load the two tables:

```sqlexample
CREATE OR REPLACE TABLE left_table (
  c1 VARCHAR(1),
  c2 TINYINT,
  c3 TIME,
  c4 NUMBER(3,2)
);

CREATE OR REPLACE TABLE right_table (
  c1 VARCHAR(1),
  c2 TINYINT,
  c3 TIME,
  c4 NUMBER(3,2)
);

INSERT INTO left_table VALUES
  ('A',1,'09:15:00',3.21),
  ('A',2,'09:16:00',3.22),
  ('B',1,'09:17:00',3.23),
  ('B',2,'09:18:00',4.23);

INSERT INTO right_table VALUES
  ('A',1,'09:14:00',3.19),
  ('B',1,'09:16:00',3.04);
```

```sqlexample
SELECT * FROM left_table ORDER BY c1, c2;
```

```output
+----+----+----------+------+
| C1 | C2 | C3       |   C4 |
|----+----+----------+------|
| A  |  1 | 09:15:00 | 3.21 |
| A  |  2 | 09:16:00 | 3.22 |
| B  |  1 | 09:17:00 | 3.23 |
| B  |  2 | 09:18:00 | 4.23 |
+----+----+----------+------+
```

```sqlexample
SELECT * FROM right_table ORDER BY c1, c2;
```

```output
+----+----+----------+------+
| C1 | C2 | C3       |   C4 |
|----+----+----------+------|
| A  |  1 | 09:14:00 | 3.19 |
| B  |  1 | 09:16:00 | 3.04 |
+----+----+----------+------+
```

If `c1` and `c2` are both ON condition columns in the query, a row in the left table only matches a row in the right table
when `A` and `1`, `A` and `2`, `B` and `1`, or `B` and `2` are found in both tables.
If no match is found for such values, the right table columns are null-padded.

```sqlexample
SELECT *
  FROM left_table l ASOF JOIN right_table r
    MATCH_CONDITION(l.c3>=r.c3)
    ON(l.c1=r.c1 and l.c2=r.c2)
  ORDER BY l.c1, l.c2;
```

```output
+----+----+----------+------+------+------+----------+------+
| C1 | C2 | C3       |   C4 | C1   | C2   | C3       |   C4 |
|----+----+----------+------+------+------+----------+------|
| A  |  1 | 09:15:00 | 3.21 | A    |  1   | 09:14:00 | 3.19 |
| A  |  2 | 09:16:00 | 3.22 | NULL | NULL | NULL     | NULL |
| B  |  1 | 09:17:00 | 3.23 | B    |  1   | 09:16:00 | 3.04 |
| B  |  2 | 09:18:00 | 4.23 | NULL | NULL | NULL     | NULL |
+----+----+----------+------+------+------+----------+------+
```

If the ON conditions are removed, any combination of values in `c1` and `c2` may be matched in the final result.
Only the match condition determines the results.

```sqlexample
SELECT *
  FROM left_table l ASOF JOIN right_table r
    MATCH_CONDITION(l.c3>=r.c3)
  ORDER BY l.c1, l.c2;
```

```output
+----+----+----------+------+----+----+----------+------+
| C1 | C2 | C3       |   C4 | C1 | C2 | C3       |   C4 |
|----+----+----------+------+----+----+----------+------|
| A  |  1 | 09:15:00 | 3.21 | A  |  1 | 09:14:00 | 3.19 |
| A  |  2 | 09:16:00 | 3.22 | B  |  1 | 09:16:00 | 3.04 |
| B  |  1 | 09:17:00 | 3.23 | B  |  1 | 09:16:00 | 3.04 |
| B  |  2 | 09:18:00 | 4.23 | B  |  1 | 09:16:00 | 3.04 |
+----+----+----------+------+----+----+----------+------+
```

## Expected behavior when “ties” exist in the right table

ASOF JOIN queries always attempt to match a single row in the left table with a single row in the right table.
This behavior is true even if two (or more) rows in the right table are identical and qualify for the join. When
such ties exist and you run the same join query multiple times, you might get different results. The results are
non-deterministic because any one of the tying rows might be returned. If you’re unsure about the results of ASOF JOIN
queries, check for exact matches in the timestamp values for rows in the right table.

For example, using the same tables from the examples in the previous section, add a `right_id` column to `right_table`
and insert the following rows:

```sqlexample
CREATE OR REPLACE TABLE right_table
  (c1 VARCHAR(1),
  c2 TINYINT,
  c3 TIME,
  c4 NUMBER(3,2),
  right_id VARCHAR(2));

INSERT INTO right_table VALUES
  ('A',1,'09:14:00',3.19,'A1'),
  ('A',1,'09:14:00',3.19,'A2'),
  ('B',1,'09:16:00',3.04,'B1');

SELECT * FROM right_table ORDER BY 1, 2;
```

```output
+----+----+----------+------+----------+
| C1 | C2 | C3       |   C4 | RIGHT_ID |
|----+----+----------+------+----------|
| A  |  1 | 09:14:00 | 3.19 | A1       |
| A  |  1 | 09:14:00 | 3.19 | A2       |
| B  |  1 | 09:16:00 | 3.04 | B1       |
+----+----+----------+------+----------+
```

Two of the rows are identical except for their `right_id` values. Now run the following ASOF JOIN query:

```sqlexample
SELECT *
  FROM left_table l ASOF JOIN right_table r
    MATCH_CONDITION(l.c3>=r.c3)
  ORDER BY l.c1, l.c2;
```

```output
+----+----+----------+------+----+----+----------+------+----------+
| C1 | C2 | C3       |   C4 | C1 | C2 | C3       |   C4 | RIGHT_ID |
|----+----+----------+------+----+----+----------+------+----------|
| A  |  1 | 09:15:00 | 3.21 | A  |  1 | 09:14:00 | 3.19 | A2       |
| A  |  2 | 09:16:00 | 3.22 | B  |  1 | 09:16:00 | 3.04 | B1       |
| B  |  1 | 09:17:00 | 3.23 | B  |  1 | 09:16:00 | 3.04 | B1       |
| B  |  2 | 09:18:00 | 4.23 | B  |  1 | 09:16:00 | 3.04 | B1       |
+----+----+----------+------+----+----+----------+------+----------+
```

Note that rows `A1` and `A2` from `right_table` both qualify for the join, but only `A2` is returned. On a
subsequent run of the same query, `A1` could be returned instead.

## Rewriting ASOF JOIN queries to reduce scans on the right table

When the cardinality of the ON or USING join column in the left table is lower than the cardinality of the
join column in the right table, the optimizer does not [prune](../../user-guide/tables-clustering-micropartitions.md)
the unmatched rows from the right table. Therefore, more rows than are needed for the join will be scanned
from the right table. This behavior typically occurs when the query includes a highly selective filter on a
non-join column from the left table, and the filter reduces the cardinality of the join column.

You can work around this problem by manually reducing the rows that qualify for the join. For example, the
original query has the following pattern, and `t1.c1` has lower cardinality than `t2.c1`:

```sqlexample
SELECT ...
  FROM t1
    ASOF JOIN t2
      MATCH_CONDITION(...)
      ON t1.c1 = t2.c1
  WHERE t1 ...;
```

You can rewrite the query as follows to manually select the rows from `t2` where `t2.c1` values are
found in `t1.c1`:

```sqlexample
WITH t1 AS (SELECT * FROM t1 WHERE t1 ...)
SELECT ...
  FROM t1
    ASOF JOIN (SELECT * FROM t2 WHERE t2.c1 IN (SELECT t1.c1 FROM t1)) AS t2
      MATCH_CONDITION(...)
      ON t1.c1 = t2.c1;
```

## Using ASOF and MATCH_CONDITION as object names and aliases

Use of the ASOF and MATCH_CONDITION keywords in SELECT command syntax is restricted:

* If a SELECT statement uses ASOF or MATCH_CONDITION as the name of a table, view, or inline view, you must identify it
  as follows:

  + If the object was created with double quotes in the name, use the same double-quoted name.
  + If the object was created without double quotes in the name, use double quotes and capital letters.

  For example, the following statements are no longer allowed and return errors:

  ```sqlexample
  SELECT * FROM asof;

  WITH match_condition AS (SELECT * FROM T1) SELECT * FROM match_condition;
  ```

  If you created the objects with double quotes, fix the problem by using double quotes:

  ```sqlexample
  SELECT * FROM "asof";

  WITH "match_condition" AS (SELECT * FROM T1) SELECT * FROM "match_condition";
  ```

  If you created the objects without double quotes, fix the problem by using double quotes and capital letters:

  ```sqlexample
  SELECT * FROM "ASOF";

  WITH "MATCH_CONDITION" AS (SELECT * FROM T1) SELECT * FROM "MATCH_CONDITION";
  ```

  See also [Unquoted identifiers](../identifiers-syntax.md).
* If a SELECT statement uses ASOF or MATCH_CONDITION as an alias, you must use AS before the alias or double-quote the
  alias. For example, the following statements are no longer allowed and return errors:

  ```sqlexample
  SELECT * FROM t1 asof;

  SELECT * FROM t2 match_condition;
  ```

  Fix the problem in one of the following ways:

  ```sqlexample
  SELECT * FROM t1 AS asof;

  SELECT * FROM t1 "asof";

  SELECT * FROM t2 AS match_condition;

  SELECT * FROM t2 "match_condition";
  ```

## Examples

The following examples demonstrate the expected behavior of ASOF JOIN queries.
Start by running the query under [Joining two tables on the closest match (alignment)](../../user-guide/querying-time-series-data.md), then proceed with
the examples here.

### NULL-padded results

Insert a new row into the `trades` table with a date that’s a day earlier than the existing rows in both
`trades` and `quotes`:

```sqlexample
INSERT INTO trades VALUES('SNOW','2023-09-30 12:02:55.000',3000);
```

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       1 |
+-------------------------+
```

Now run the first example query again. Note that the query returns four rows, but the new row is null-padded.
There is no row in the `quotes` table that qualifies for the match condition.
The columns from `trades` are returned, and the corresponding columns from `quotes` are null-padded.

```sqlexample
SELECT t.stock_symbol, t.trade_time, t.quantity, q.quote_time, q.price
  FROM trades t ASOF JOIN quotes q
    MATCH_CONDITION(t.trade_time >= quote_time)
    ON t.stock_symbol=q.stock_symbol
  ORDER BY t.stock_symbol;
```

```output
+--------------+-------------------------+----------+-------------------------+--------------+
| STOCK_SYMBOL | TRADE_TIME              | QUANTITY | QUOTE_TIME              |        PRICE |
|--------------+-------------------------+----------+-------------------------+--------------|
| AAPL         | 2023-10-01 09:00:05.000 |     2000 | 2023-10-01 09:00:03.000 | 139.00000000 |
| SNOW         | 2023-09-30 12:02:55.000 |     3000 | NULL                    |         NULL |
| SNOW         | 2023-10-01 09:00:05.000 |     1000 | 2023-10-01 09:00:02.000 | 163.00000000 |
| SNOW         | 2023-10-01 09:00:10.000 |     1500 | 2023-10-01 09:00:08.000 | 165.00000000 |
+--------------+-------------------------+----------+-------------------------+--------------+
```

### Using a different comparison operator in the match condition

Following on from the previous example, the results of the query change again when the comparison operator in the
match condition is changed. The following query specifies the `<=` operator (instead of `>=`):

```sqlexample
SELECT t.stock_symbol, t.trade_time, t.quantity, q.quote_time, q.price
  FROM trades t ASOF JOIN quotes q
    MATCH_CONDITION(t.trade_time <= quote_time)
    ON t.stock_symbol=q.stock_symbol
  ORDER BY t.stock_symbol;
```

```output
+--------------+-------------------------+----------+-------------------------+--------------+
| STOCK_SYMBOL | TRADE_TIME              | QUANTITY | QUOTE_TIME              |        PRICE |
|--------------+-------------------------+----------+-------------------------+--------------|
| AAPL         | 2023-10-01 09:00:05.000 |     2000 | 2023-10-01 09:00:07.000 | 142.00000000 |
| SNOW         | 2023-10-01 09:00:10.000 |     1500 | NULL                    |         NULL |
| SNOW         | 2023-10-01 09:00:05.000 |     1000 | 2023-10-01 09:00:07.000 | 166.00000000 |
| SNOW         | 2023-09-30 12:02:55.000 |     3000 | 2023-10-01 09:00:01.000 | 166.00000000 |
+--------------+-------------------------+----------+-------------------------+--------------+
```

See also Less than and greater than comparison operators.

### Specifying a USING condition instead of an ON condition

You can use an ON condition or a USING condition with ASOF JOIN queries. The following query is equivalent to the
previous query, but it replaces ON with USING. The syntax `USING(stock_symbol)` implies the condition
`t.stock_symbol=q.stock_symbol`.

```sqlexample
SELECT t.stock_symbol, t.trade_time, t.quantity, q.quote_time, q.price
  FROM trades t ASOF JOIN quotes q
    MATCH_CONDITION(t.trade_time <= quote_time)
    USING(stock_symbol)
  ORDER BY t.stock_symbol;
```

### Inner join to a third table

The following example adds a third `companies` table to the join in order to pick the company name for each stock symbol.
You can use a regular INNER JOIN with an ON condition (or some other standard join syntax) to add the third table.
However, note that `USING(stock_symbol)` would not work here because the reference to `stock_symbol` would be ambiguous.

```sqlexample
CREATE OR REPLACE TABLE companies(
  stock_symbol VARCHAR(4),
  company_name VARCHAR(100)
);

 INSERT INTO companies VALUES
  ('NVDA','NVIDIA Corp'),
  ('TSLA','Tesla Inc'),
  ('SNOW','Snowflake Inc'),
  ('AAPL','Apple Inc')
;
```

```sqlexample
SELECT t.stock_symbol, c.company_name, t.trade_time, t.quantity, q.quote_time, q.price
  FROM trades t ASOF JOIN quotes q
    MATCH_CONDITION(t.trade_time >= quote_time)
    ON t.stock_symbol=q.stock_symbol
    INNER JOIN companies c ON c.stock_symbol=t.stock_symbol
  ORDER BY t.stock_symbol;
```

```output
+--------------+---------------+-------------------------+----------+-------------------------+--------------+
| STOCK_SYMBOL | COMPANY_NAME  | TRADE_TIME              | QUANTITY | QUOTE_TIME              |        PRICE |
|--------------+---------------+-------------------------+----------+-------------------------+--------------|
| AAPL         | Apple Inc     | 2023-10-01 09:00:05.000 |     2000 | 2023-10-01 09:00:03.000 | 139.00000000 |
| SNOW         | Snowflake Inc | 2023-09-30 12:02:55.000 |     3000 | NULL                    |         NULL |
| SNOW         | Snowflake Inc | 2023-10-01 09:00:05.000 |     1000 | 2023-10-01 09:00:02.000 | 163.00000000 |
| SNOW         | Snowflake Inc | 2023-10-01 09:00:10.000 |     1500 | 2023-10-01 09:00:08.000 | 165.00000000 |
+--------------+---------------+-------------------------+----------+-------------------------+--------------+
```

### Numbers as timestamps

The following example demonstrates that the match condition can compare numeric values.
In this case, the tables have UNIX timestamp values stored in NUMBER(38,0) columns. `1696150805`
is equivalent to `2023-10-30 10:20:05.000` (three seconds later than `1696150802`).

```sqlexample
SELECT * FROM trades_unixtime;
```

```output
+--------------+------------+----------+--------------+
| STOCK_SYMBOL | TRADE_TIME | QUANTITY |        PRICE |
|--------------+------------+----------+--------------|
| SNOW         | 1696150805 |      100 | 165.33300000 |
+--------------+------------+----------+--------------+
```

```sqlexample
SELECT * FROM quotes_unixtime;
```

```output
+--------------+------------+----------+--------------+--------------+
| STOCK_SYMBOL | QUOTE_TIME | QUANTITY |          BID |          ASK |
|--------------+------------+----------+--------------+--------------|
| SNOW         | 1696150802 |      100 | 166.00000000 | 165.00000000 |
+--------------+------------+----------+--------------+--------------+
```

```sqlexample
SELECT *
  FROM trades_unixtime tu
    ASOF JOIN quotes_unixtime qu
    MATCH_CONDITION(tu.trade_time>=qu.quote_time);
```

```output
+--------------+------------+----------+--------------+--------------+------------+----------+--------------+--------------+
| STOCK_SYMBOL | TRADE_TIME | QUANTITY |        PRICE | STOCK_SYMBOL | QUOTE_TIME | QUANTITY |          BID |          ASK |
|--------------+------------+----------+--------------+--------------+------------+----------+--------------+--------------|
| SNOW         | 1696150805 |      100 | 165.33300000 | SNOW         | 1696150802 |      100 | 166.00000000 | 165.00000000 |
+--------------+------------+----------+--------------+--------------+------------+----------+--------------+--------------+
```

### TIME columns in the match condition

The following examples join tables that contain weather observations. The observations in these tables are recorded in TIME columns.
You can create and load the tables as follows:

```sqlexample
CREATE OR REPLACE TABLE raintime(
  observed TIME(9),
  location VARCHAR(40),
  state VARCHAR(2),
  observation NUMBER(5,2)
);

INSERT INTO raintime VALUES
  ('14:42:59.230', 'Ahwahnee', 'CA', 0.90),
  ('14:42:59.001', 'Oakhurst', 'CA', 0.50),
  ('14:42:44.435', 'Reno', 'NV', 0.00)
;

CREATE OR REPLACE TABLE preciptime(
  observed TIME(9),
  location VARCHAR(40),
  state VARCHAR(2),
  observation NUMBER(5,2)
);

INSERT INTO preciptime VALUES
  ('14:42:59.230', 'Ahwahnee', 'CA', 0.91),
  ('14:42:59.001', 'Oakhurst', 'CA', 0.51),
  ('14:41:44.435', 'Las Vegas', 'NV', 0.01),
  ('14:42:44.435', 'Reno', 'NV', 0.01),
  ('14:40:34.000', 'Bozeman', 'MT', 1.11)
;

CREATE OR REPLACE TABLE snowtime(
  observed TIME(9),
  location VARCHAR(40),
  state VARCHAR(2),
  observation NUMBER(5,2)
);

INSERT INTO snowtime VALUES
  ('14:42:59.199', 'Fish Camp', 'CA', 3.20),
  ('14:42:44.435', 'Reno', 'NV', 3.00),
  ('14:43:01.000', 'Lake Tahoe', 'CA', 4.20),
  ('14:42:45.000', 'Bozeman', 'MT', 1.80)
;
```

When you run the first query, some of the TIME values appear to be exactly the same in the result set (`14:42:59`, `14:42:44`).

```sqlexample
SELECT * FROM preciptime p ASOF JOIN snowtime s MATCH_CONDITION(p.observed>=s.observed)
  ORDER BY p.observed;
```

```output
+----------+-----------+-------+-------------+----------+-----------+-------+-------------+
| OBSERVED | LOCATION  | STATE | OBSERVATION | OBSERVED | LOCATION  | STATE | OBSERVATION |
|----------+-----------+-------+-------------+----------+-----------+-------+-------------|
| 14:40:34 | Bozeman   | MT    |        1.11 | NULL     | NULL      | NULL  |        NULL |
| 14:41:44 | Las Vegas | NV    |        0.01 | NULL     | NULL      | NULL  |        NULL |
| 14:42:44 | Reno      | NV    |        0.01 | 14:42:44 | Reno      | NV    |        3.00 |
| 14:42:59 | Oakhurst  | CA    |        0.51 | 14:42:45 | Bozeman   | MT    |        1.80 |
| 14:42:59 | Ahwahnee  | CA    |        0.91 | 14:42:59 | Fish Camp | CA    |        3.20 |
+----------+-----------+-------+-------------+----------+-----------+-------+-------------+
```

To return a more precise display of TIME values, including milliseconds, run the following [ALTER SESSION](../sql/alter-session.md) command,
then run the ASOF JOIN query again:

```sqlexample
ALTER SESSION SET TIME_OUTPUT_FORMAT = 'HH24:MI:SS.FF3';
```

```output
+----------------------------------+
| status                           |
|----------------------------------|
| Statement executed successfully. |
+----------------------------------+
```

```sqlexample
SELECT * FROM preciptime p ASOF JOIN snowtime s MATCH_CONDITION(p.observed>=s.observed)
  ORDER BY p.observed;
```

```output
+--------------+-----------+-------+-------------+--------------+-----------+-------+-------------+
| OBSERVED     | LOCATION  | STATE | OBSERVATION | OBSERVED     | LOCATION  | STATE | OBSERVATION |
|--------------+-----------+-------+-------------+--------------+-----------+-------+-------------|
| 14:40:34.000 | Bozeman   | MT    |        1.11 | NULL         | NULL      | NULL  |        NULL |
| 14:41:44.435 | Las Vegas | NV    |        0.01 | NULL         | NULL      | NULL  |        NULL |
| 14:42:44.435 | Reno      | NV    |        0.01 | 14:42:44.435 | Reno      | NV    |        3.00 |
| 14:42:59.001 | Oakhurst  | CA    |        0.51 | 14:42:45.000 | Bozeman   | MT    |        1.80 |
| 14:42:59.230 | Ahwahnee  | CA    |        0.91 | 14:42:59.199 | Fish Camp | CA    |        3.20 |
+--------------+-----------+-------+-------------+--------------+-----------+-------+-------------+
```

### Multiple ASOF joins in a query

The following example shows how to connect a sequence of two or more ASOF joins in a single query block.
The three tables (`snowtime`, `raintime`, `preciptime`) all contain weather observations that were recorded in
specific locations at specific times. The column of interest is the `observation` column. The rows are logically grouped by state.

```sqlexample
ALTER SESSION SET TIME_OUTPUT_FORMAT = 'HH24:MI:SS.FF3';

SELECT *
  FROM snowtime s
    ASOF JOIN raintime r
      MATCH_CONDITION(s.observed>=r.observed)
      ON s.state=r.state
    ASOF JOIN preciptime p
      MATCH_CONDITION(s.observed>=p.observed)
      ON s.state=p.state
  ORDER BY s.observed;
```

```output
+--------------+------------+-------+-------------+--------------+----------+-------+-------------+--------------+----------+-------+-------------+
| OBSERVED     | LOCATION   | STATE | OBSERVATION | OBSERVED     | LOCATION | STATE | OBSERVATION | OBSERVED     | LOCATION | STATE | OBSERVATION |
|--------------+------------+-------+-------------+--------------+----------+-------+-------------+--------------+----------+-------+-------------|
| 14:42:44.435 | Reno       | NV    |        3.00 | 14:42:44.435 | Reno     | NV    |        0.00 | 14:42:44.435 | Reno     | NV    |        0.01 |
| 14:42:45.000 | Bozeman    | MT    |        1.80 | NULL         | NULL     | NULL  |        NULL | 14:40:34.000 | Bozeman  | MT    |        1.11 |
| 14:42:59.199 | Fish Camp  | CA    |        3.20 | 14:42:59.001 | Oakhurst | CA    |        0.50 | 14:42:59.001 | Oakhurst | CA    |        0.51 |
| 14:43:01.000 | Lake Tahoe | CA    |        4.20 | 14:42:59.230 | Ahwahnee | CA    |        0.90 | 14:42:59.230 | Ahwahnee | CA    |        0.91 |
+--------------+------------+-------+-------------+--------------+----------+-------+-------------+--------------+----------+-------+-------------+
```

### Less than and greater than comparison operators

Following on from the previous example, two ASOF joins are specified, but this time the first match condition uses the `>`
operator and the second uses the `<` operator. The result is a single row that returns data from all three tables, and three rows
that return data from two of the tables. Many of the columns in the result set are null-padded.

Logically, the query finds only one row where the observed time from the `snowtime` table was later than the observed time from the
`raintime` table but earlier than the observed time from the `preciptime` table.

```sqlexample
SELECT *
  FROM snowtime s
    ASOF JOIN raintime r
      MATCH_CONDITION(s.observed>r.observed)
      ON s.state=r.state
    ASOF JOIN preciptime p
      MATCH_CONDITION(s.observed<p.observed)
      ON s.state=p.state
  ORDER BY s.observed;
```

```output
+--------------+------------+-------+-------------+--------------+-----------+-------+-------------+--------------+----------+-------+-------------+
| OBSERVED     | LOCATION   | STATE | OBSERVATION | OBSERVED     | LOCATION  | STATE | OBSERVATION | OBSERVED     | LOCATION | STATE | OBSERVATION |
|--------------+------------+-------+-------------+--------------+-----------+-------+-------------+--------------+----------+-------+-------------|
| 14:42:44.435 | Reno       | NV    |        3.00 | 14:41:44.435 | Las Vegas | NV    |        0.00 | NULL         | NULL     | NULL  |        NULL |
| 14:42:45.000 | Bozeman    | MT    |        1.80 | NULL         | NULL      | NULL  |        NULL | NULL         | NULL     | NULL  |        NULL |
| 14:42:59.199 | Fish Camp  | CA    |        3.20 | 14:42:59.001 | Oakhurst  | CA    |        0.50 | 14:42:59.230 | Ahwahnee | CA    |        0.91 |
| 14:43:01.000 | Lake Tahoe | CA    |        4.20 | 14:42:59.230 | Ahwahnee  | CA    |        0.90 | NULL         | NULL     | NULL  |        NULL |
+--------------+------------+-------+-------------+--------------+-----------+-------+-------------+--------------+----------+-------+-------------+
```

### Examples of expected error cases

The following examples show queries that return expected syntax errors.

Having declared that `snowtime s` is the left table, you cannot begin the match condition with a reference to the right table, `preciptime p`:

```sqlexample
SELECT * FROM snowtime s ASOF JOIN preciptime p MATCH_CONDITION(p.observed>=s.observed);
```

```output
010002 (42601): SQL compilation error:
MATCH_CONDITION clause is invalid: The left side allows only column references from the left side table, and the right side allows only column references from the right side table.
```

Only the `>=`, `<=`, `>`, and `<` operators are allowed in match conditions:

```sqlexample
SELECT * FROM preciptime p ASOF JOIN snowtime s MATCH_CONDITION(p.observed=s.observed);
```

```output
010001 (42601): SQL compilation error:
MATCH_CONDITION clause is invalid: Only comparison operators '>=', '>', '<=' and '<' are allowed. Keywords such as AND and OR are not allowed.
```

The ON clause for ASOF JOIN must contain equality conditions:

```sqlexample
SELECT *
  FROM preciptime p ASOF JOIN snowtime s
  MATCH_CONDITION(p.observed>=s.observed)
  ON s.state>=p.state;
```

```output
010010 (42601): SQL compilation error:
ON clause for ASOF JOIN must contain conjunctions of equality conditions only. Disjunctions are not allowed. Each side of an equality condition must only refer to either the left table or the right table. S.STATE >= P.STATE is invalid.
```

An ON clause equality condition cannot contain disjunctions:

```sqlexample
SELECT *
  FROM preciptime p ASOF JOIN snowtime s
  MATCH_CONDITION(p.observed>=s.observed)
  ON s.state=p.state OR s.location=p.location;
```

```output
010010 (42601): SQL compilation error:
ON clause for ASOF JOIN must contain conjunctions of equality conditions only. Disjunctions are not allowed. Each side of an equality condition must only refer to either the left table or the right table. (S.STATE = P.STATE) OR (S.LOCATION = P.LOCATION) is invalid.
```

ASOF joins cannot be used with LATERAL inline views:

```sqlexample
SELECT t1.a "t1a", t2.a "t2a"
  FROM t1 ASOF JOIN
    LATERAL(SELECT a FROM t2 WHERE t1.b = t2.b) t2
    MATCH_CONDITION(t1.a >= t2.a)
  ORDER BY 1,2;
```

```output
010004 (42601): SQL compilation error:
ASOF JOIN is not supported for joins with LATERAL table functions or LATERAL views.
```

---
title: ASSOCIATE_SEMANTIC_CATEGORY_TAGS
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/associate_semantic_category_tags.md
section: SQL General Reference
---

# ASSOCIATE_SEMANTIC_CATEGORY_TAGS

> **Note:**
>
> ASSOCIATE_SEMANTIC_CATEGORY_TAGS is a legacy stored procedure. Snowflake recommends using other methods of
> implementing [sensitive data classification](../../user-guide/classify-intro.md).

Takes the results of the EXTRACT_SEMANTIC_CATEGORIES function on a table/view and applies the results as tags on the supported columns
in the table/view.

Before calling this stored procedure, you should first execute the EXTRACT_SEMANTIC_CATEGORIES function on the table/view and determine
whether you are satisfied with the results generated by the classification algorithm.

## Syntax

```sqlsyntax
ASSOCIATE_SEMANTIC_CATEGORY_TAGS( '<object_name>' , <category_extraction_result> )
```

## Arguments

`object_name`
:   The name of the table, external table, view, or materialized view containing the columns to be classified. If a database and schema are
    not in use in the current session, the name must be fully-qualified.

    The name must be specified exactly as it is stored in the database. If the name contains special characters, capitalization, or blank
    spaces, the name must be enclosed first in double-quotes and then in single quotes.

`category_extraction_result`
:   The result from executing the EXTRACT_SEMANTIC_CATEGORIES function on the same table/view.

## Usage notes

* Globally-defined stored procedures utilize caller’s rights. For more details, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
* The function applies the Classification system tags from the top level of the classification results returned by the
  [EXTRACT_SEMANTIC_CATEGORIES](../functions/extract_semantic_categories.md) function. Alternate values are not applied.

  If you want to apply alternate values:

  + You can store the classification results in a table and edit the results before applying them or
  + Apply the values manually using [ALTER TABLE … MODIFY COLUMN … SET TAG](../sql/alter-table-column.md).
* To unset a Classification system tag from a column, use an ALTER TABLE … MODIFY COLUMN … UNSET TAG statement.
* This stored procedure is no longer being updated to coincide with additional enhancements to
  [Data Classification](../../user-guide/classify-intro.md).

## Examples

Extract the semantic and privacy categories for the `my_db.my_schema.hr_data` table and apply the categories as tags for the table:

> ```sqlexample
> USE ROLE data_engineer;
>
> CALL ASSOCIATE_SEMANTIC_CATEGORY_TAGS('mydb.my_schema.hr_data',
>                                       EXTRACT_SEMANTIC_CATEGORIES('mydb.my_schema.hr_data'));
> ```

Apply the results from [EXTRACT_SEMANTIC_CATEGORIES](../functions/extract_semantic_categories.md) that have been stored in the `classification_results`
table:

> ```sqlexample
> USE ROLE data_engineer;
>
> CALL ASSOCIATE_SEMANTIC_CATEGORY_TAGS('mydb.my_schema.hr_data',
>                                       (SELECT * FROM classification_results));
> ```

Modify the results from [EXTRACT_SEMANTIC_CATEGORIES](../functions/extract_semantic_categories.md) in the `classification_results` table and apply the
tags:

> ```sqlexample
> USE ROLE data_engineer;
>
> UPDATE classification_results SET V =
>     OBJECT_INSERT(V,'LNAME',OBJECT_INSERT(
>         OBJECT_INSERT(V:LNAME,'semantic_category','NAME',TRUE),
>         'privacy_category','IDENTIFIER',TRUE),
>         TRUE
>         );
>
> CALL ASSOCIATE_SEMANTIC_CATEGORY_TAGS('mydb.my_schema.hr_data',
>                                       (SELECT * FROM classification_results));
> ```

---
title: AT | BEFORE
source: https://docs.snowflake.com/en/sql-reference/constructs/at-before.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# AT | BEFORE

The AT or BEFORE clause is used for Snowflake Time Travel. In a query, it is specified in the [FROM](from.md) clause
immediately after the table name, and it determines the point in the past from which historical data is requested for the object:

* The AT keyword specifies that the request is inclusive of any changes made by a statement or transaction with a timestamp equal to the
  specified parameter.
* The BEFORE keyword specifies that the request refers to a point immediately preceding the specified parameter. This point in time is just
  before the statement, identified by its query ID, is completed. For more information, see Using the BEFORE clause.

You can use the same syntax to clone objects; see [CREATE <object> … CLONE](../sql/create-clone.md). If you don’t
specify a point in time for a clone, the clone defaults to the state of the object as of now
(the [CURRENT_TIMESTAMP](../functions/current_timestamp.md) value).

For more information, see [Understanding & using Time Travel](../../user-guide/data-time-travel.md).

See also:
:   [FROM](from.md)

## Syntax

```sqlsyntax
SELECT ...
FROM ...
  { AT | BEFORE }
  (
    { TIMESTAMP => <timestamp> |
      OFFSET => <time_difference> |
      STATEMENT => <id> |
      STREAM => '<name>' }
  )
[ ... ]
```

## Parameters

`TIMESTAMP => timestamp`
:   Specifies an exact date and time to use for Time Travel. The value must be explicitly cast to a TIMESTAMP,
    TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ data type.

    If no explicit cast is specified, the timestamp in the AT clause is treated as a timestamp with the UTC time zone (equivalent to
    TIMESTAMP_NTZ). Using the TIMESTAMP data type for an explicit cast may also result in the value being treated as a TIMESTAMP_NTZ
    value. For details, see [Date & time data types](../data-types-datetime.md).

`OFFSET => time_difference`
:   Specifies the difference in seconds from the current time to use for Time Travel, in the form `-N` where `N`
    can be an integer or arithmetic expression (e.g. `-120` is 120 seconds, `-30*60` is 1800 seconds or 30 minutes).

`STATEMENT => id`
:   Specifies the query ID of a statement to use as the reference point for Time Travel. This parameter supports any statement of one of the
    following types:

    * DML (e.g. INSERT, UPDATE, DELETE)
    * TCL (BEGIN, COMMIT transaction)
    * SELECT

    The query ID must reference a query that has been executed within the last 14 days. If the query ID references a query over 14 days old,
    the following error is returned:

    ```output
    Error: statement <query_id> not found
    ```

    To work around this limitation, use the timestamp for the referenced query.

`STREAM => 'name'`
:   Specifies the identifier (i.e. name) for an existing stream on the queried table or view. The current offset in
    the stream is used as the `AT` or `BEFORE` point in time for returning change data for the source object.

    This keyword is supported only when creating a stream (using [CREATE STREAM](../sql/create-stream.md)) or querying change data (using
    the [CHANGES](changes.md) clause). For examples, see those topics.

## Using the AT TIMESTAMP parameter

In the AT clause, you can specify the TIMESTAMP keyword followed by a string that represents a timestamp and an optional explicit cast to
the TIMESTAMP, TIMESTAMP_TZ, TIMESTAMP_LTZ, or TIMESTAMP_NTZ data type. The following examples are all valid:

```sqlexample
AT ( TIMESTAMP => '2024-06-05 12:30:00'::TIMESTAMP_LTZ )

AT ( TIMESTAMP => '2024-06-05 12:30:00'::TIMESTAMP )

AT ( TIMESTAMP => '2024-06-05 12:30:00' )
```

If no explicit cast is specified, the timestamp in the AT clause is treated as a timestamp with the UTC time zone (equivalent to TIMESTAMP_NTZ).
Using the TIMESTAMP data type for an explicit cast may also result in the value being treated as a TIMESTAMP_NTZ value, as discussed in
[Date & time data types](../data-types-datetime.md).

The explicit cast that you choose affects the results of Time Travel queries because timestamps are interpreted with respect to the
current time zone for the session and the value of the TIMESTAMP_TYPE_MAPPING parameter. For more details about this behavior, see
[Querying Time Travel data in a session with a non-UTC time zone](https://community.snowflake.com/s/article/Querying-time-travel-data-in-a-session-with-a-non-UTC-timezone).

For example, you are running queries in a SQL session where the current time zone is `America/Los_Angeles` and TIMESTAMP_TYPE_MAPPING is set to
`TIMESTAMP_NTZ`. Create a table and immediately insert two rows:

```sqlexample
CREATE OR REPLACE TABLE tt1 (c1 INT, c2 INT);
INSERT INTO tt1 VALUES(1,2);
INSERT INTO tt1 VALUES(2,3);
```

Check the creation time of the table with a SHOW TABLES command:

```sqlexample
SHOW TERSE TABLES LIKE 'tt1';
```

```output
+-------------------------------+------+-------+---------------+----------------+
| created_on                    | name | kind  | database_name | schema_name    |
|-------------------------------+------+-------+---------------+----------------|
| 2024-06-05 15:25:35.557 -0700 | TT1  | TABLE | TRAVEL_DB     | TRAVEL_SCHEMA  |
+-------------------------------+------+-------+---------------+----------------+
```

Note the time zone offset in the `created_on` column. Five minutes later, insert another row:

```sqlexample
INSERT INTO tt1 VALUES(3,4);
```

Now run the following Time Travel query, expecting it to return the first two rows:

```sqlexample
SELECT * FROM tt1 at(TIMESTAMP => '2024-06-05 15:29:00'::TIMESTAMP);
```

```output
000707 (02000): Time travel data is not available for table TT1. The requested time is either beyond the allowed time travel period or before the object creation time.
```

The query fails because the time zone of the session is UTC, and the explicit cast to TIMESTAMP honors that time zone.
Therefore, the table is assumed to have been created *after* the specified timestamp. To solve this problem, run the
query again with an explicit cast to TIMESTAMP_LTZ (local time zone):

```sqlexample
SELECT * FROM tt1 at(TIMESTAMP => '2024-06-05 15:29:00'::TIMESTAMP_LTZ);
```

```output
+----+----+
| C1 | C2 |
|----+----|
|  1 |  2 |
|  2 |  3 |
+----+----+
```

As expected, the query returns the first two rows that were inserted. Finally, run the same query but specify a slightly later timestamp:

```sqlexample
SELECT * FROM tt1 at(TIMESTAMP => '2024-06-05 15:31:00'::TIMESTAMP_LTZ);
```

```output
+----+----+
| C1 | C2 |
|----+----|
|  1 |  2 |
|  2 |  3 |
|  3 |  4 |
+----+----+
```

This query returns all three rows, given the later timestamp.

## Using the BEFORE clause

The STATEMENT parameter in the BEFORE clause must refer to a query ID. The point in the past used by Time Travel is just before the
statement for that query ID is completed rather than before the statement is started. If concurrent queries commit modifications to
the data between the start and end of the statement, these changes are included in your results.

For example, the following statements are being executed on table `my_table` in parallel in two separate threads:

| Time | Thread | Operation | Phase | Description |
| --- | --- | --- | --- | --- |
| `t1` | 1 | INSERT INTO my_table(id) VALUE(1) | Start | Insert starts execution by performing required checks. |
| `t2` | 1 | INSERT INTO my_table(id) VALUE(1) | End | Insert updated `my_table`. |
| `t3` | 1 | DELETE FROM my_table | Start | Delete identifies the list of records to delete (id=1). |
| `t4` | 2 | INSERT INTO my_table(id) VALUE(2) | Start | Insert starts execution by performing required checks. |
| `t5` | 2 | INSERT INTO my_table(id) VALUE(2) | End | Insert updated `my_table`. |
| `t6` | 2 | SELECT \* FROM my_table | End | Thread `2` selects rows from `my_table`. The results include all rows (id=1, id=2). |
| `t7` | 1 | DELETE FROM my_table | End | Delete updates `my_table` deleting all old records present before time `t3` when the delete statement started in thread `1` (id=1). |
| `t8` | 1 | SELECT \* FROM my_table BEFORE(STATEMENT => LAST_QUERY_ID()) | End | SELECT statement uses Time Travel to retrieve historical data from before the completion of the delete operation. The results include the row from the 2nd insert statement that happened concurrently in thread `2` (id=1, id=2). |

As a workaround, you can use a TIMESTAMP parameter that specifies a point in time just before the start of the statement.

## Usage notes

* Data in Snowflake is identified by timestamps that can differ slightly from the exact value of system time.
* The value for TIMESTAMP or OFFSET must be a constant expression.
* The smallest time resolution for TIMESTAMP is milliseconds.
* If requested data is beyond the Time Travel retention period (default is 1 day), the statement fails.

  In addition, if the requested data is within the Time Travel retention period but no historical data is available (e.g. if the retention
  period was extended), the statement fails.
* If the specified Time Travel time is at or before the point in time when the object was created, the statement fails. See
  Using the AT TIMESTAMP parameter.
* When you access historical table data, the results include the columns, default values, etc. from the current definition of the table.
  The same applies to non-materialized views. For example, if you alter a table to add a column, querying for historical data before
  the point in time when the column was added returns results that include the new column.
* Historical data has the same access control requirements as current data. Any changes are applied retroactively.
* The AT and BEFORE clauses do not support selecting historical data from a [CTE](../../user-guide/queries-cte.md).

  For example, the following query is not supported:

  ```sqlexample
  WITH mycte AS
    (SELECT mytable.* FROM mytable)
  SELECT * FROM mycte AT(TIMESTAMP => '2024-03-13 13:56:09.553 +0100'::TIMESTAMP_TZ);
  ```

  However, these clauses are supported in a query in a [WITH](with.md) clause. For example, the following
  query is supported:

  ```sqlexample
  WITH mycte AS
    (SELECT * FROM mytable AT(TIMESTAMP => '2024-03-13 13:56:09.553 +0100'::TIMESTAMP_TZ))
  SELECT * FROM mycte;
  ```
* Time Travel queries against hybrid tables have the following limitations:

  + Only the TIMESTAMP parameter is supported in the AT clause. The OFFSET, STATEMENT, and STREAM parameters are not supported.
  + The value of the TIMESTAMP parameter must be the same for all tables that belong to the same database. If the tables belong
    to different databases, different TIMESTAMP values may be used.
  + The BEFORE clause is not supported.
* CREATE DATABASE … CLONE and CREATE SCHEMA … CLONE commands that use Time Travel and specify the time with the STATEMENT parameter
  return an error if any hybrid tables exist in the specified database. The error prompts you to run the command using the
  [IGNORE HYBRID TABLES parameter](../sql/create-clone.md). When you include this parameter, the command will
  create the cloned database or schema but skip any hybrid tables.

## Troubleshooting

|  |  |
| --- | --- |
| Error | ```output Time travel data is not available for table <tablename> ``` |
| Cause | In some cases, this is caused by using a string where a timestamp is expected. |
| Solution | Cast the string to a timestamp.  ```sqlexample ... AT(TIMESTAMP => '2018-07-27 12:00:00')               -- fails ... AT(TIMESTAMP => '2018-07-27 12:00:00'::TIMESTAMP)    -- succeeds ``` |

## Examples

Select historical data from a table using a specific timestamp. In the first two
examples, which use the TIMESTAMP parameter, `my_table` could be a standard table or a hybrid table.
Subsitute a recent date, time, or timestamp that’s within the retention period.

```sqlexample
SELECT * FROM my_table AT(TIMESTAMP => 'Wed, 26 Jun 2024 09:20:00 -0700'::TIMESTAMP_LTZ);
```

```sqlexample
SELECT * FROM my_table AT(TIMESTAMP => TO_TIMESTAMP(1432669154242, 3));
```

Select historical data from a table as of 5 minutes ago:

```sqlexample
SELECT * FROM my_table AT(OFFSET => -60*5) AS T WHERE T.flag = 'valid';
```

Select historical data from a table up to, but not including any changes made by the specified transaction:

```sqlexample
SELECT * FROM my_table BEFORE(STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726');
```

Return the difference in table data resulting from the specified transaction:

```sqlexample
SELECT oldt.* ,newt.*
  FROM my_table BEFORE(STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726') AS oldt
    FULL OUTER JOIN my_table AT(STATEMENT => '8e5d0ca9-005e-44e6-b858-a8f5b37c5726') AS newt
    ON oldt.id = newt.id
  WHERE oldt.id IS NULL OR newt.id IS NULL;
```

The following example runs a Time Travel join query against two tables in the same database, one of
which is a hybrid table. The same TIMESTAMP expression must be used for both tables.
Subsitute a recent date, time, or timestamp that’s within the retention period.

```sqlexample
SELECT *
  FROM db1.public.htt1
    AT(TIMESTAMP => '2024-06-05 17:50:00'::TIMESTAMP_LTZ) h
    JOIN db1.public.tt1
    AT(TIMESTAMP => '2024-06-05 17:50:00'::TIMESTAMP_LTZ) t
    ON h.c1=t.c1;
```

---
title: AWAIT (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/await.md
section: SQL General Reference
---

# AWAIT (Snowflake Scripting)

Waits for all [asynchronous child jobs](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md)
that are running to finish or for a specific asynchronous child job that is running for a
[RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md) to finish, then returns
when the all jobs have finished or the specific job has finished, respectively.

AWAIT is a blocking call. You can use an AWAIT statement to block other code from running until
the AWAIT call completes.

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [CANCEL](cancel.md)

## Syntax

```sqlsyntax
AWAIT { ALL | <result_set_name> };
```

Where:

> `ALL`
> :   The stored procedure waits for all asynchronous child jobs that were started before the AWAIT call.
>
> `result_set_name`
> :   The stored procedure waits for the asynchronous child job that is running for the specified RESULTSET
>     to finish.

## Usage notes

* An asynchronous child job is created when the ASYNC keyword is specified for a query.
  For more information, see [Working with asynchronous child jobs](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md).
* When the ASYNC keyword is specified for a query, the stored procedure can’t access the query results
  until an AWAIT statement returns the results.
* When you run an asynchronous child job, “fire and forget” isn’t supported. Therefore, if the stored
  procedure runs a child job that is still running when the stored procedure completes, the child job
  is canceled automatically.
* Snowflake Scripting supports built-in variables that you can use in the code for stored procedures.

  These variables behave in the following ways for asynchronous child jobs:

  + The [SQLID](../../developer-guide/snowflake-scripting/query-id.md) variable is available for the query
    specified for an asynchronous child job immediately after the asynchronous child job is created.
  + The following [built-in variables for exception handling](../../developer-guide/snowflake-scripting/exceptions.md)
    are available after the AWAIT or AWAIT ALL statement associated with the asynchronous child job that
    caused the error runs:

    - SQLCODE
    - SQLERRM
    - SQLSTATE

    When an AWAIT ALL statement is associated with multiple asynchronous child jobs, these built-in variables
    capture information about the first failing asynchronous child job.
  + The following built-in variables related to
    [the number of rows affected by DML commands](../../developer-guide/snowflake-scripting/dml-status.md)
    are available after the AWAIT statement associated with the asynchronous child job for a
    RESULTSET runs:

    - SQLROWCOUNT
    - SQLFOUND
    - SQLNOTFOUND

    These variables aren’t available when an AWAIT ALL statement runs.
* If an asynchronous child job fails, the AWAIT or AWAIT ALL statement associated with the asynchronous job
  fails with an error, and execution of the stored procedure stops. For example, the following stored procedure
  fails and returns an error when execution reaches the AWAIT statement:

  ```sqlexample
  BEGIN
    LET res RESULTSET := ASYNC (SELECT * FROM invalid_table);
    AWAIT res;
  END;
  ```

  ```output
  002003 (42S02): Uncaught exception of type 'STATEMENT_ERROR' on line 2 at position 4 : SQL compilation error:
  Table 'INVALID_TABLE' does not exist or not authorized.
  ```

## Examples

Wait for all asynchronous child jobs to complete:

```sqlexample
AWAIT ALL;
```

Wait for an asynchronous child job that is running for a RESULTSET to complete:

```sqlexample
AWAIT my_result_set;
```

For more examples, see [Examples of using asynchronous child jobs](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md).

---
title: BEGIN … END (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/begin.md
section: SQL General Reference
---

# BEGIN … END (Snowflake Scripting)

`BEGIN` and `END` define a Snowflake Scripting block.

For more information on blocks, see [Understanding blocks in Snowflake Scripting](../../developer-guide/snowflake-scripting/blocks.md).

## Syntax

```sqlsyntax
BEGIN
    <statement>;
    [ <statement>; ... ]
[ EXCEPTION <exception_handler> ]
END;
```

Where:

> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).
>
> `exception_handler`
> :   Specifies how exceptions should be handled. Refer to [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md) and
>     [EXCEPTION (Snowflake Scripting)](exception.md).

## Usage notes

* The keyword `END` must be followed immediately by a semicolon, or followed immediately by a label that is
  immediately followed by a semicolon.
* The keyword `BEGIN` must not be followed immediately by a semicolon.
* `BEGIN` and `END` are usually used inside another language construct, such as a looping or branching construct,
  or inside a stored procedure. However, this is not required. A BEGIN/END block can be the top-level construct inside
  an anonymous block.
* Blocks can be nested.

## Examples

This is a simple example of using `BEGIN` and `END` to group related statements. This example creates two
related tables.

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
    CREATE TABLE parent (ID INTEGER);
    CREATE TABLE child (ID INTEGER, parent_ID INTEGER);
    RETURN 'Completed';
END;
$$
;
```

The next example is similar; the statements are grouped into a block and are also inside a transaction within
that block:

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
    BEGIN TRANSACTION;
    TRUNCATE TABLE child;
    TRUNCATE TABLE parent;
    COMMIT;
    RETURN '';
END;
$$
;
```

In this example, the statements are inside a [branching](../../developer-guide/snowflake-scripting/branch.md) construct.

```sqlexample
IF (both_rows_are_valid) THEN
    BEGIN
        BEGIN TRANSACTION;
        INSERT INTO parent ...;
        INSERT INTO child ...;
        COMMIT;
    END;
END IF;
```

---
title: Binary input and output
source: https://docs.snowflake.com/en/sql-reference/binary-input-output.md
section: SQL General Reference
---

# Binary input and output

Snowflake supports three binary formats or encoding schemes: hex, base64, and UTF-8.

## Overview of supported binary formats

This section describes the supported binary formats.

### hex (default)

The “hex” format refers to the hexadecimal, or base 16, system. In this format, each byte is represented by two characters
(digits from `0` to `9` and letters from `A` to `F`). When using hex to perform conversion:

| From | To | Notes |
| --- | --- | --- |
| Binary | String | hex uses uppercase letters. |
| String | Binary | hex is case-insensitive. |

Hex is the default binary format.

### base64

The “base64” format encodes binary data (or string data) as printable ASCII characters (letters,
digits, and punctuation marks or mathematical operators).
(The base64 encoding scheme is defined in [RFC 4648](https://tools.ietf.org/html/rfc4648).)

Base64-encoded data has the following advantages:

* Because base64-encoded data is pure ASCII text, it can be stored in systems that support ASCII character data but
  not BINARY data. For example, binary data that represents music (digital samples), or UTF data that represents
  Mandarin language characters, can be encoded as ASCII text and stored in systems that support only ASCII characters.
* Because base64-encoded data doesn’t contain control characters (for example, end-of-transmission characters, tab characters),
  base64-encoded data can be transmitted and received without risk that control characters could
  be interpreted as commands rather than as data. Base64-encoded data is compatible with older modems and
  other telecommunications equipment that transmit and receive data one character at a time (without packet headers
  or protocols that indicate which parts of a packet are data and which are header or control information).

Base64-encoded data has the following disadvantages:

* Converting data back and forth between binary and printable ASCII representations consumes computation resources.
* Base64-encoded data requires approximately 1/3 more storage space than the original data.

The following sections provide technical details on base64 encoding.

#### Details of base64 encoding

Each group of three 8-bit bytes (a total of 24 bits) of binary data is re-arranged into four groups of 6 bits each
(still 24 bits). Each of the 64 possible combinations of 6 bits is represented by one of the following 64 printable
ASCII characters:

* Uppercase letters (A - Z)
* Lowercase letters (a - z)
* Decimal digits (0 - 9)
* `+`
* `/`

In addition, the character `=` is used for padding if the length of the input that isn’t an exact multiple of 3.

Because base64-encoded data doesn’t contain whitespace characters (for example, blanks and line breaks), base64-encoded
data can be mixed with whitespace if desired. For example, if the transmitter or receiver has a maximum limit
on line length, the base64-encoded data can be split into individual lines by adding newline characters without
corrupting the data. When using base64 to perform conversion:

| From | To | Notes |
| --- | --- | --- |
| Binary | String | Base64 does not insert any whitespace or line breaks. |
| String | Binary | Base64 ignores all whitespace and line breaks. |

### UTF-8

The UTF-8 format refers to the UTF-8 character encoding for Unicode.

UTF-8 is used for text-to-binary encoding. UTF-8 can’t be used for binary-to-text encoding because not all possible BINARY values
can be converted to valid UTF-8 strings.

This format is convenient for performing one-to-one conversion between binary and string, for reinterpreting the underlying data
as one type or the other rather than actually encoding and decoding.

## Session parameters for binary values

There are two session parameters that determine how binary values are passed into and out of Snowflake:

* [BINARY_INPUT_FORMAT](parameters.md): Specifies the format of VARCHAR input to functions that convert from VARCHAR to BINARY.
  It is used for:

  + Performing conversion to BINARY in the one-argument version of [TO_BINARY](functions/to_binary.md).
  + Loading data into Snowflake (if no file format option is specified; see below for details).

  The parameter can be set to `HEX`, `BASE64`, or `UTF-8` (or `UTF8`).
  The parameter values are case-insensitive. The default is `HEX`.
* [BINARY_OUTPUT_FORMAT](parameters.md): Specifies the format of VARCHAR output from functions that convert from BINARY to VARCHAR.
  It is used for:

  + Performing conversion to VARCHAR in the one-argument version of [TO_CHAR , TO_VARCHAR](functions/to_char.md).
  + Unloading data from Snowflake (if no file format option is specified; see below for details).
  + Displaying binary data in human-readable format (for example, in the Snowflake web interface) when no binary-to-varchar conversion
    was called explicitly.

  The parameter can be set to `HEX` or `BASE64`. The parameter values are case-insensitive. The default is `HEX`.

  > **Note:**
  >
  > Because conversion from binary to string can fail with the UTF-8 format, BINARY_OUTPUT_FORMAT can’t be set to `UTF-8`. To use
  > UTF-8 for conversion in this situation, use the two-argument version of [TO_CHAR , TO_VARCHAR](functions/to_char.md).

The parameters can be set at the account, user, and session levels. Execute the [SHOW PARAMETERS](sql/show-parameters.md) command to
view the current parameter settings that apply to all operations in the current session.

## File format option for loading/unloading binary values

Separate from the binary input and output session parameters, Snowflake provides the BINARY_FORMAT file format option, which can be
used to explicitly control binary formatting when loading data into or unloading data from Snowflake tables.

This option can be set to `HEX`, `BASE64`, or `UTF-8` (values are case-insensitive). The option affects both data
loading and unloading and, similar to other file format options, can be specified in the following ways:

* In a named file format, which can then be referenced in a named stage or directly in a COPY command.
* In a named stage, which can then be referenced directly in a COPY command.
* Directly in a COPY command.

### Data loading

When used for data loading, BINARY_FORMAT specifies the format of binary values in your staged data files. This option
overrides any value set for the BINARY_INPUT_FORMAT parameter in the session (see Session parameters for binary values).

If the option is set to `HEX` or `BASE64`, data loading can fail if the strings in the staged data file aren’t valid hex or
base64. In this case, Snowflake returns an error and then performs the action specified for the ON_ERROR copy option.

### Data unloading

When used in data unloading, the BINARY_FORMAT option specifies the format applied to binary values unloaded to the files in the
specified stage. This option overrides any value set for the BINARY_OUTPUT_FORMAT parameter in the session
(see Session parameters for binary values).

If the option is set to `UTF-8`, data unloading fails if any binary values in the table contain invalid UTF-8. In this case, Snowflake
returns an error.

## Example input/output

BINARY input/output can be confusing because “what you see isn’t necessarily what you get.”

Consider the following example:

```sqlexample
CREATE OR REPLACE TABLE binary_table (v VARCHAR, b BINARY);

INSERT INTO binary_table (v, b)
  SELECT 'AB', TO_BINARY('AB');

SELECT v, b FROM binary_table;
```

```output
+----+----+
| V  | B  |
|----+----|
| AB | AB |
+----+----+
```

The outputs for column `v` (VARCHAR) and column `b` appear to be identical. Yet
the value for column `b` was converted to binary. Why does the value in column
`b` look unchanged?

The answer is that the argument to TO_BINARY is treated as a sequence of
hexadecimal digits (even though it is inside quotes and therefore looks
like a string). The two characters you see are actually interpreted as a pair of
hexadecimal digits that represent one byte of binary data, not two bytes of
string data. (This wouldn’t have worked if the input “string” had
contained characters other than hexadecimal digits; the result would have been an
error message similar to `"String '...' isn't a legal hex-encoded string"`.)

Also, when BINARY data is displayed, by default it is displayed as a
sequence of hexadecimal digits. Thus the data went in as hexadecimal digits
(not a string) and is displayed as hexadecimal digits, so it appears unchanged.

In fact, if the goal was to store the two-character string `AB`, then the code
was wrong. The proper code would use the function [HEX_ENCODE](functions/hex_encode.md)
to convert the string to a sequence of hexadecimal digits (or use another “encode” function to
convert to another format, such as base64) before storing the data.
Examples of that are below.

### Hexadecimal (“HEX”) format example

One way to enter BINARY data is to encode it as a string of hexadecimal
characters, as shown in the following example.

Start by creating a table with a BINARY column:

```sqlexample
CREATE OR REPLACE TABLE demo_binary_hex (b BINARY);
```

If you try to insert an “ordinary” string by using the TO_BINARY function
to try to convert it to a valid BINARY value, it fails:

```sqlexample
INSERT INTO demo_binary_hex (b) SELECT TO_BINARY('HELP', 'HEX');
```

Here’s the error message:

```output
100115 (22000): The following string is not a legal hex-encoded value: 'HELP'
```

This time, explicitly convert the input to a string of hexadecimal digits
before inserting it (this will succeed):

```sqlexample
INSERT INTO demo_binary_hex (b) SELECT TO_BINARY(HEX_ENCODE('HELP'), 'HEX');
```

Now, retrieve the data:

```sqlexample
SELECT TO_VARCHAR(b), HEX_DECODE_STRING(TO_VARCHAR(b)) FROM demo_binary_hex;
```

```output
+---------------+----------------------------------+
| TO_VARCHAR(B) | HEX_DECODE_STRING(TO_VARCHAR(B)) |
|---------------+----------------------------------|
| 48454C50      | HELP                             |
+---------------+----------------------------------+
```

As you can see, by default the output is shown as hexadecimal. To get back
the original string, use the function [HEX_DECODE_STRING](functions/hex_decode_string.md)
(the complement of the function HEX_ENCODE that was used previously to encode the string).

The following query shows in more detail what’s going on internally:

```sqlexample
SELECT 'HELP',
       HEX_ENCODE('HELP'),
       b,
       HEX_DECODE_STRING(HEX_ENCODE('HELP')),
       TO_VARCHAR(b),
       HEX_DECODE_STRING(TO_VARCHAR(b))
  FROM demo_binary_hex;
```

```output
+--------+--------------------+----------+---------------------------------------+---------------+----------------------------------+
| 'HELP' | HEX_ENCODE('HELP') | B        | HEX_DECODE_STRING(HEX_ENCODE('HELP')) | TO_VARCHAR(B) | HEX_DECODE_STRING(TO_VARCHAR(B)) |
|--------+--------------------+----------+---------------------------------------+---------------+----------------------------------|
| HELP   | 48454C50           | 48454C50 | HELP                                  | 48454C50      | HELP                             |
+--------+--------------------+----------+---------------------------------------+---------------+----------------------------------+
```

### BASE64 format example

Before reading this section, consider reading Hexadecimal (“HEX”) format example.
The basic concepts are similar, and Hexadecimal (“HEX”) format example explains them in
more detail.

Start by creating a table with a BINARY column:

```sqlexample
CREATE OR REPLACE TABLE demo_binary_base64 (b BINARY);
```

Insert a row:

```sqlexample
INSERT INTO demo_binary_base64 (b) SELECT TO_BINARY(BASE64_ENCODE('HELP'), 'BASE64');
```

Retrieve that row:

```sqlexample
SELECT 'HELP',
       BASE64_ENCODE('HELP'),
       BASE64_DECODE_STRING(BASE64_ENCODE('HELP')),
       TO_VARCHAR(b, 'BASE64'),
       BASE64_DECODE_STRING(TO_VARCHAR(b, 'BASE64'))
 FROM demo_binary_base64;
```

```output
+--------+-----------------------+---------------------------------------------+-------------------------+-----------------------------------------------+
| 'HELP' | BASE64_ENCODE('HELP') | BASE64_DECODE_STRING(BASE64_ENCODE('HELP')) | TO_VARCHAR(B, 'BASE64') | BASE64_DECODE_STRING(TO_VARCHAR(B, 'BASE64')) |
|--------+-----------------------+---------------------------------------------+-------------------------+-----------------------------------------------|
| HELP   | SEVMUA==              | HELP                                        | SEVMUA==                | HELP                                          |
+--------+-----------------------+---------------------------------------------+-------------------------+-----------------------------------------------+
```

### UTF-8 format example

Start by creating a table with a BINARY column:

```sqlexample
CREATE OR REPLACE TABLE demo_binary_utf8 (b BINARY);
```

Insert a row:

```sqlexample
INSERT INTO demo_binary_utf8 (b) SELECT TO_BINARY('HELP', 'UTF-8');
```

Retrieve that row:

```sqlexample
SELECT 'HELP',
       TO_VARCHAR(b, 'UTF-8')
  FROM demo_binary_utf8;
```

```output
+--------+------------------------+
| 'HELP' | TO_VARCHAR(B, 'UTF-8') |
|--------+------------------------|
| HELP   | HELP                   |
+--------+------------------------+
```

---
title: Bind variables
source: https://docs.snowflake.com/en/sql-reference/bind-variables.md
section: SQL General Reference
---

# Bind variables

Applications can accept data from users and use that data in SQL statements. For
example, an application might ask a user to enter contact information, such as an
address and phone number.

To specify this user input in a SQL statement, you can programmatically construct
a string for the SQL statement by concatenating the user input with the other parts of the
statement. Alternatively, you can use *bind variables*. To use bind variables,
put one or more placeholders in the text of the SQL statement, then specify the
variable (the value to be used) for each placeholder.

## Overview of bind variables

With bind variables, you replace literals in SQL statements with placeholders. For
example, the following SQL statement uses literals for the inserted values:

```sqlexample
INSERT INTO t (c1, c2) VALUES (1, 'Test string');
```

The following SQL statement uses placeholders for the inserted values:

```sqlexample
INSERT INTO t (c1, c2) VALUES (?, ?);
```

Your application code binds data with each placeholder in the SQL statement. The
technique for binding data with a placeholder depends on the programming language.
The syntax of the placeholder also varies by programming language. It is either
`?`, `:varname`, or `%varname`.

## Use bind variables in Javascript stored procedures

You can use [Javascript](../developer-guide/stored-procedure/stored-procedures-javascript.md) to create
stored procedures that run SQL.

To specify bind variables in Javascript code, use `?` placeholders. For example,
the following INSERT statement specifies bind variables for the values inserted into
a table row:

```sqlexample
INSERT INTO t (col1, col2) VALUES (?, ?)
```

In Javascript code, you can use bind variables for the values in most SQL statements.
For information about limitations, see Limitations for bind variables.

For more information about using bind variables in Javascript, see
[Binding variables](../developer-guide/stored-procedure/stored-procedures-javascript.md).

## Use bind variables with Snowflake Scripting

You can use [Snowflake Scripting](../developer-guide/snowflake-scripting/index.md) to create procedural code
that runs SQL, such as code blocks and stored procedures. To specify bind variables in Snowflake Scripting
code, prefix the variable name with a colon. For example, the following INSERT statement specifies a bind variable
named `variable1`:

```sqlexample
INSERT INTO t (c1) VALUES (:variable1)
```

When you run SQL in an [EXECUTE IMMEDIATE](sql/execute-immediate.md) command or an
[OPEN command for a cursor](../developer-guide/snowflake-scripting/cursors.md), you can bind variables with the USING clause.

This example binds variables in an EXECUTE IMMEDIATE command with a USING clause:

```sqlexample
EXECUTE IMMEDIATE :query USING (minimum_price, maximum_price);
```

For the full example that includes this code, see
[Executing a statement that contains bind variables](sql/execute-immediate.md).

When you declare a cursor, you can specify bind parameters (`?` characters) in a SELECT statement. You can then
bind these parameters to variables in the USING clause when you open the cursor.

The following example declares a cursor and specifies bind parameters, then opens the cursor with the USING clause:

```sqlexample
LET c1 CURSOR FOR SELECT id FROM invoices WHERE price > ? AND price < ?;
OPEN c1 USING (minimum_price, maximum_price);
```

Snowflake Scripting also supports numbering bind variables by position and reusing a bind variable in a SQL statement.
For numbered bind variables, each variable declaration is assigned an index, and you can refer to the nth declared
variable with `:n`. For example, the following Snowflake Scripting block specifies bind variable `:1` for
the `i` variable and `:2` for the `v` variable, and it reuses the `:1` bind variable in a SQL statement:

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  i INTEGER DEFAULT 1;
  v VARCHAR DEFAULT 'SnowFlake';
  r RESULTSET;
BEGIN
  CREATE OR REPLACE TABLE snowflake_scripting_bind_demo (id INTEGER, value VARCHAR);
  EXECUTE IMMEDIATE 'INSERT INTO snowflake_scripting_bind_demo (id, value)
    SELECT :1, (:2 || :1)' USING (i, v);
  r := (SELECT * FROM snowflake_scripting_bind_demo);
  RETURN TABLE(r);
END;
$$
;
```

```output
+----+------------+
| ID | VALUE      |
|----+------------|
|  1 | SnowFlake1 |
+----+------------+
```

In Snowflake Scripting code, you can use bind variables for the values in most SQL statements.
For information about limitations, see Limitations for bind variables.

For more information about using bind variables in Snowflake Scripting, see
[Using a variable in a SQL statement (binding)](../developer-guide/snowflake-scripting/variables.md) and [Using an argument in a SQL statement (binding)](../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

## Use bind variables with the SQL API

You can use the [Snowflake SQL API](../developer-guide/sql-api/index.md) to access and update data in a Snowflake
database. You can create applications that use the SQL API to submit SQL statements and manage
deployments.

When you submit a request that runs a SQL statement, you can use bind variables for values
in the statement. For more information, see [Using bind variables in a statement](../developer-guide/sql-api/submitting-requests.md).

## Use bind variables with drivers

Using Snowflake [drivers](../developer-guide/drivers.md), you can write applications that
perform operations on Snowflake. The drivers support programming languages such as Go, Java,
and Python. For information about using bind variables in an application for a specific driver,
follow the link for the driver:

* [Go](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Binding_Parameters)
* [JDBC](../developer-guide/jdbc/jdbc-using.md)
* [.NET](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/QueryingData.md#bind-parameter)
* [Node.js](../developer-guide/node-js/nodejs-driver-execute.md)
* [ODBC](../developer-guide/odbc/odbc-using.md)
* [Python](../developer-guide/python-connector/python-connector-example.md)

> **Note:**
>
> The PHP driver does not support bind variables.

## Use bind variables with arrays of values

You can bind an array of values to variables in SQL statements. Using this technique, you
can improve performance by inserting multiple rows in a single batch, which avoids network
round trips and compilations. The use of an array bind is also called a “bulk insert” or
“batch insert.”

> **Note:**
>
> Snowflake supports other data loading methods that are recommended instead of using array binds.
> For more information, see [Load data into Snowflake](../guides-overview-loading-data.md) and
> [Data loading and unloading commands](commands-data-loading.md).

The following is an example of an array bind in Python code:

```python
conn = snowflake.connector.connect( ... )
rows_to_insert = [('milk', 2), ('apple', 3), ('egg', 2)]
conn.cursor().executemany(
            "insert into grocery (item, quantity) values (?, ?)",
            rows_to_insert)
```

This example specifies the following bind list: `[('milk', 2), ('apple', 3), ('egg', 2)]`.
The way an application specifies a bind list depends on the programming language.

This code inserts three rows into the table:

```output
+-------+----+
| C1    | C2 |
|-------+----|
| milk  |  2 |
| apple |  3 |
| egg   |  2 |
+-------+----+
```

For information about using array binds in an application for a specific driver,
follow the link for the driver:

* [Go](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Batch_Inserts_and_Binding_Parameters)
* [JDBC](../developer-guide/jdbc/jdbc-using.md)
* [.NET](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/QueryingData.md#bind-array-variables)
* [Node.js](../developer-guide/node-js/nodejs-driver-execute.md)
* [ODBC](../developer-guide/odbc/odbc-using.md)
* [Python](../developer-guide/python-connector/python-connector-example.md)

> **Note:**
>
> The PHP driver doesn’t support array binds.

### Limitations of using array binds

The following limitations apply to array binds:

* Only INSERT INTO … VALUES statements can contain array bind variables.
* The VALUES clause must be a single-row list of bind variables. For example, the following
  VALUES clause is not allowed:

  ```sqlexample
  VALUES (?,?), (?,?)
  ```

### Insert multiple rows without using array binds

An INSERT statement might use bind variables to insert multiple rows without using
an array bind. The following example inserts values into two rows, but it doesn’t use an array bind.

```sqlexample
INSERT INTO t VALUES (?,?), (?,?);
```

For example, your application can specify a bind list that’s equivalent to the following values, in order,
for the placeholders: `[1,'String1',2,'String2']`. Because the VALUES clause specifies more
than one row, the statement only inserts the exact number of values (four in the example), rather
than a dynamic number of rows.

## Use bind variables with semi-structured data

To bind variables with semi-structured data, bind the variable as a string type, and use functions
such as [PARSE_JSON](functions/parse_json.md) or [ARRAY_CONSTRUCT](functions/array_construct.md).

The following example creates a table with one [VARIANT](data-types-semistructured.md) column and then calls
the PARSE_JSON function to insert semi-structured data into the table with a bind variable:

```sqlexample
CREATE TABLE t (a VARIANT);
-- Code that supplies a bind value for ? of '{'a': 'abc', 'x': 'xyz'}'
INSERT INTO t SELECT PARSE_JSON(a) FROM VALUES (?);
```

The following example queries the table:

```sqlexample
SELECT * FROM t;
```

The query returns the following output:

```output
+---------------+
| A             |
|---------------|
| {             |
|   "a": "abc", |
|   "x": "xyz"  |
| }             |
+---------------+
```

The following statement calls the ARRAY_CONSTRUCT function to insert an array of semi-structured
data into a VARIANT column with a bind variable:

```sqlexample
INSERT INTO t SELECT ARRAY_CONSTRUCT(column1) FROM VALUES (?);
```

Both of these examples can insert a single row, or they can use an array bind to insert multiple rows
in one batch. You can use this technique to insert any type of semi-structured data that is valid in a
VARIANT column.

## Retrieve bind variable values

To retrieve the values of bind variables in a query that has been executed, you can use the
[BIND_VALUES](functions/bind_values.md) table function in the INFORMATION_SCHEMA schema.
With this function, you can retrieve bind variable values from any code that supports bind variables,
including Javascript and Snowflake Scripting code.

You can also access these bind variable values from the `bind_values` column in the output for
the [QUERY_HISTORY Account Usage view](account-usage/query_history.md),
the [QUERY_HISTORY Organization Usage view](organization-usage/query_history.md),
or the [QUERY_HISTORY function](functions/query_history.md).

To prevent bind values from being accessible to users, set the [ALLOW_BIND_VALUES_ACCESS](parameters.md)
account-level parameter to `FALSE`.

You might want to retrieve bind variable values for the following cases:

* **Troubleshooting queries** - When you know the exact bind values used in queries, it’s easier to optimize
  the queries and debug the following types of issues:

  + A query fails to run.
  + A query’s performance is poor.
  + A query isn’t using caches or expected execution plans.
* **Recreating queries for testing** - Developers and DBAs can recreate user-generated queries with bind variable
  values to replicate problems and for stress testing.
* **Auditing and compliance** - For security and compliance purposes, organizations must audit the data that users
  are accessing. They can use bind variable values to determine the exact data retrieved by users.

### Examples that retrieve bind variable values

The following queries return the bind variable values for a previous query:

```sqlexample
SELECT * FROM TABLE(
  INFORMATION_SCHEMA.BIND_VALUES('<query_id_value>'));
```

```sqlexample
SELECT bind_values
  FROM SNOWFLAKE.ACCOUNT_USAGE.QUERY_HISTORY
  WHERE query_id = '<query_id_value>';
```

Replace `query_id_value` with the query ID. You can use the [LAST_QUERY_ID](functions/last_query_id.md)
function to return the ID of a previous query.

> **Note:**
>
> The latency for the QUERY_HISTORY view might be up to 45 minutes.

The following examples use the BIND_VALUES function:

* Snowflake Scripting example that retrieves named bind variables
* Python Connector example that retrieves positional bind variables

#### Snowflake Scripting example that retrieves named bind variables

Run the following Snowflake Scripting anonymous block, which includes a statement that uses bind variables:

```sqlexample
DECLARE
  name STRING;
  temperature FLOAT;
  res RESULTSET;
BEGIN
  name := 'Snowman';
  temperature := -20.14;
  res := (
    SELECT
      CONCAT('Hello ', :NAME, '!') as greeting,
      CONCAT('It is ', :TEMPERATURE, 'deg C today.') as weather
  );
  RETURN LAST_QUERY_ID();
END;
```

Note: If you use [Snowflake CLI](../developer-guide/snowflake-cli/index.md), [SnowSQL](../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  name STRING;
  temperature FLOAT;
  res RESULTSET;
BEGIN
  name := 'Snowman';
  temperature := -20.14;
  res := (
    SELECT
      CONCAT('Hello ', :NAME, '!') as greeting,
      CONCAT('It is ', :TEMPERATURE, 'deg C today.') as weather
  );
  RETURN LAST_QUERY_ID();
END;
$$
;
```

The block returns the query ID of the statement that uses bind variables.

> **Note:**
>
> Your statement will return a different query ID than the one shown in here.

```output
+--------------------------------------+
| anonymous block                      |
|--------------------------------------|
| 01bbe3d6-0109-0863-0000-a99502ffa062 |
+--------------------------------------+
```

To retrieve the bind variables used in the anonymous block, run the following query. Replace
`01bbe3d6-0109-0863-0000-a99502ffa062` with the query ID in your output after running the
anonymous block.

```sqlexample
SELECT * FROM TABLE(
  INFORMATION_SCHEMA.BIND_VALUES('01bbe3d6-0109-0863-0000-a99502ffa062'));
```

```output
+--------------------------------------+----------+-------------+------+---------+
| QUERY_ID                             | POSITION | NAME        | TYPE | VALUE   |
|--------------------------------------+----------+-------------+------+---------|
| 01bbe3d6-0109-0863-0000-a99502ffa062 |     NULL | TEMPERATURE | REAL | -20.14  |
| 01bbe3d6-0109-0863-0000-a99502ffa062 |     NULL | NAME        | TEXT | Snowman |
+--------------------------------------+----------+-------------+------+---------+
```

#### Python Connector example that retrieves positional bind variables

The following Python Connector code uses the BIND_VALUES function to display the values of the
positional bind variables in the output:

```python
cursor = conn.cursor()
print(cursor.execute(
          """
          SELECT
              CONCAT('Hello ', ?, '!') as greeting,
              CONCAT('It is ', ?, 'deg C today.') as weather
          """,
          params=["Snowman", -20.14],
      ).fetch_pandas_all())

query_id = cursor.sfqid
print(f"Bind values for query {query_id} are:")
print(cursor.execute("SELECT * FROM TABLE(INFORMATION_SCHEMA.BIND_VALUES(?))", params=[query_id]).fetch_pandas_all())
```

```output
        GREETING                   WEATHER
0  Hello Snowman!  It is -20.14deg C today.

Bind values for query 01bbe918-0200-0001-0000-000000101145 are:

                               QUERY_ID POSITION  NAME  TYPE    VALUE
0  01bbe918-0200-0001-0000-000000101145        1  None  TEXT  Snowman
1  01bbe918-0200-0001-0000-000000101145        2  None  REAL   -20.14
```

## Limitations for bind variables

The following limitations apply to bind variables:

* Limitations for SELECT statements:

  + Bind variables can’t replace numbers that are part of a data type definition (for example,
    `NUMBER(?)`) or [collation specification](collation.md) (for example,
    `COLLATE ?`).
  + Bind variables can’t be used for the source in a SELECT statement that queries files on a stage.
* Limitations for DDL commands:

  + Bind variables can’t be used in the following DDL commands:

    - CREATE/ALTER INTEGRATION
    - CREATE/ALTER REPLICATION GROUP
    - CREATE/ALTER PIPE
    - CREATE TABLE … USING TEMPLATE
  + Bind variables can’t be used in the following clauses:

    - ALTER COLUMN
    - COMMENT ON CONSTRAINT
  + In CREATE/ALTER commands, bind variables can’t be used for the values of the following parameters:

    - CREDENTIALS
    - DIRECTORY
    - ENCRYPTION
    - IMPORTS
    - PACKAGES
    - REFRESH
    - TAG
    - Parameters that are specific to external tables
  + Bind variables can’t be used for properties that are part of a [FILE FORMAT](sql/create-file-format.md)
    value.
* In COPY INTO commands, bind variables can’t be used for the values of the following parameters:

  + CREDENTIALS
  + ENCRYPTION
  + FILE_FORMAT
* In SHOW commands, bind variables can’t be used in the STARTS WITH parameter.
* Bind variables can’t be used in an EXECUTE IMMEDIATE FROM command.
* Bind variable values can’t be converted automatically from one data type to another when bind variables are used in:

  + Snowflake Scripting code that specifies the data type explicitly
  + DDL statements
  + Stage names

## Security considerations for bind variables

Bind variables don’t mask sensitive data in all cases. For example, the values of bind variables might appear
in error messages and other artifacts.

Bind variables can help to prevent SQL injection attacks when you construct SQL statements with user input. However,
bind variables can present potential security risks. If inputs to SQL statements come from external sources, make sure
they are validated. For more information, see [SQL injection](../developer-guide/stored-procedure/stored-procedures-usage.md).

---
title: Bitwise expression functions
source: https://docs.snowflake.com/en/sql-reference/expressions-byte-bit.md
section: SQL General Reference
---

# Bitwise expression functions

This family of functions can be used to perform bitwise operations on numbers or a group of numeric records.

| Function Name | Syntax | Summary Description |
| --- | --- | --- |
| [BITAND](functions/bitand.md) | `BITAND(a, b)` | Bitwise AND of two numeric or binary expressions (`a` and `b`). |
| [BITAND_AGG](functions/bitand_agg.md) | `BITAND_AGG(a)` | Bitwise AND value of all non-NULL numeric records in a group `a`. |
| [BITNOT](functions/bitnot.md) | `BITNOT(a)` | Bitwise negation of `a` numeric or binary expression. |
| [BITOR](functions/bitor.md) | `BITOR(a, b)` | Bitwise OR of two numeric or binary expressions (`a` and `b`). |
| [BITOR_AGG](functions/bitor_agg.md) | `BITOR_AGG(a)` | Bitwise OR value of all non-NULL numeric records in a group `a`. |
| [BITSHIFTLEFT](functions/bitshiftleft.md) | `BITSHIFTLEFT(a, n)` | Shift the bits for `a` numeric or binary expression `n` positions to the left. |
| [BITSHIFTRIGHT](functions/bitshiftright.md) | `BITSHIFTRIGHT(a, n)` | Shift the bits for `a` numeric or binary expression `n` positions to the right, with sign extension. |
| [BITXOR](functions/bitxor.md) | `BITXOR(a, b)` | Bitwise XOR of two numeric or binary expressions (`a` and `b`). |
| [BITXOR_AGG](functions/bitxor_agg.md) | `BITXOR_AGG(a)` | Bitwise XOR value of all non-NULL numeric records in a group `a`. |
| [GETBIT](functions/getbit.md) | `GETBIT(a, n)` | Return the bit at position `n` in `a` numeric expression. |

---
title: BREAK (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/break.md
section: SQL General Reference
---

# BREAK (Snowflake Scripting)

`BREAK` (or `EXIT`) terminates a loop.

For more information on terminating loops, see [Terminating a loop](../../developer-guide/snowflake-scripting/loops.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [CONTINUE](continue.md)

## Syntax

```sqlsyntax
{ BREAK | EXIT } [ <label> ] ;
```

Where:

> `label`
> :   An optional label. If the label is specified, the `BREAK` will jump to the statement immediately after
>     the label.
>
>     You can use this to break out of more than one level of a nested loop or a nested branch.

## Usage notes

* `BREAK` and `EXIT` are synonymous.
* If the loop is embedded in another loop(s), you can exit out of not only the current loop, but also an
  enclosing loop, by including the enclosing loop’s label as part of the `BREAK`. For an example, see the examples
  section below.

## Examples

Here is an example of using BREAK to exit not only the current loop, but also an enclosing loop:

```sqlexample
DECLARE
  i INTEGER;
  j INTEGER;
BEGIN
  i := 1;
  j := 1;
  WHILE (i <= 4) DO
    WHILE (j <= 4) DO
      -- Exit when j is 3, even if i is still 1.
      IF (j = 3) THEN
        BREAK outer_loop;
      END IF;
      j := j + 1;
    END WHILE inner_loop;
    i := i + 1;
  END WHILE outer_loop;
  -- Execution resumes here after the BREAK executes.
  RETURN i;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
    DECLARE
        i INTEGER;
        j INTEGER;
    BEGIN
        i := 1;
        j := 1;
        WHILE (i <= 4) DO
            WHILE (j <= 4) DO
                -- Exit when j is 3, even if i is still 1.
                IF (j = 3) THEN
                     BREAK outer_loop;
                END IF;
                j := j + 1;
            END WHILE inner_loop;
            i := i + 1;
        END WHILE outer_loop;
        -- Execution resumes here after the BREAK executes.
        RETURN i;
    END;
$$;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               1 |
+-----------------+
```

---
title: Calling an external function for AWS
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-call.md
section: SQL General Reference
---

# Calling an external function for AWS

This topic describes how to call an external function.

An external function is called like any other [UDF (user-defined function)](../developer-guide/udf/udf-overview.md). (And, like any
other UDF, an external function is called the same way regardless of platform.)

1. If you have not already done so, make sure that your session is using the database and schema that contain the function.

   (External functions are database objects; when you call the function, the database and schema containing the function must be in
   use in your session, or you must specify the fully-qualified name of the function.)

   ```sqlexample
   USE DATABASE <database_name>;
   USE SCHEMA <schema_name>;
   ```
2. If appropriate, and if you have not already done so, grant USAGE privilege on the external function to one or more Snowflake
   roles that need to call the external function.

   (A role must have USAGE or OWNERSHIP privileges on an external function to call it.)

   ```sqlexample
   GRANT USAGE ON FUNCTION <external_function_name>(<parameter_data_type>) TO <role_name>;
   ```

   For example:

   ```sqlexample
   GRANT USAGE ON FUNCTION echo(INTEGER, VARCHAR) TO analyst_role;
   ```
3. Using an appropriate role, call your external function as part of an SQL statement. If you created one of the sample external
   functions supplied by Snowflake, you can call the function as shown below:

   > ```sqlexample
   > SELECT echo(42, 'Adams');
   > ```

   If you used a function name other than `echo`, then replace `echo` with the actual function name.

   The returned value should be similar to:

   > ```sqlexample
   > [0, 42, "Adams"]
   > ```

   Where:

   * `0` is the row number of the returned value.
   * `42, "Adams"` is the returned value.

> **Note:**
>
> Although an external function can usually be called like other UDFs, there are a handful of exceptions. For details,
> see [Execution-time limitations and issues](external-functions-introduction.md).

---
title: Calling an external function for Azure
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-call.md
section: SQL General Reference
---

# Calling an external function for Azure

This topic describes how to call an external function:

An external function is called like any other [UDF (user-defined function)](../developer-guide/udf/udf-overview.md). (And, like any
other UDF, an external function is called the same way regardless of platform.)

1. If you have not already done so, make sure that your session is using the database and schema that contain the function.

   (External functions are database objects; when you call the function, the database and schema containing the function must be in
   use in your session, or you must specify the fully-qualified name of the function.)

   ```sqlexample
   USE DATABASE <database_name>;
   USE SCHEMA <schema_name>;
   ```
2. If appropriate, and if you have not already done so, grant USAGE privilege on the external function to one or more Snowflake
   roles that need to call the external function.

   (A role must have USAGE or OWNERSHIP privileges on an external function to call it.)

   ```sqlexample
   GRANT USAGE ON FUNCTION <external_function_name>(<parameter_data_type>) TO <role_name>;
   ```

   For example:

   ```sqlexample
   GRANT USAGE ON FUNCTION echo(INTEGER, VARCHAR) TO analyst_role;
   ```
3. Using an appropriate role, call your external function as part of an SQL statement. If you created one of the sample external
   functions supplied by Snowflake, you can call the function as shown below:

   > ```sqlexample
   > SELECT echo(42, 'Adams');
   > ```

   If you used a function name other than `echo`, then replace `echo` with the actual function name.

   The returned value should be similar to:

   > ```sqlexample
   > [0, 42, "Adams"]
   > ```

   Where:

   * `0` is the row number of the returned value.
   * `42, "Adams"` is the returned value.

> **Note:**
>
> Although an external function can usually be called like other UDFs, there are a handful of exceptions. For details,
> see [Execution-time limitations and issues](external-functions-introduction.md).

---
title: Calling an external function for GCP
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-call.md
section: SQL General Reference
---

# Calling an external function for GCP

This topic describes how to call an external function:

1. If appropriate, grant USAGE privilege on the external function to one or more Snowflake roles so that the roles can call the external
   function. A role must have USAGE or OWNERSHIP privileges on that external function.
2. Call your external function as you would execute any UDF. For example, if you create the sample function provided by Snowflake:

   > ```sqlexample
   > select my_external_function(42, 'Life, the Universe, and Everything');
   > ```

   If you customized the function name when you created the function, then replace `my_external_function` with the customized name.

   The returned value should be similar to:

   > ```sqlexample
   > [42, "Life, the Universe, and Everything"]
   > ```

> **Note:**
>
> External functions are schema objects so the schema containing the function must be in use in your session or you must specify the
> fully-qualified name of the function when calling it.

---
title: CANCEL (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/cancel.md
section: SQL General Reference
---

# CANCEL (Snowflake Scripting)

Cancels an [asynchronous child job](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md)
that is running for a [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [AWAIT](await.md)

## Syntax

```sqlsyntax
CANCEL <result_set_name> ;
```

Where:

> `result_set_name`
> :   The name of the RESULTSET.

## Usage notes

* An asynchronous child job is created for a RESULTSET when the ASYNC keyword is specified for the query
  that is associated with the RESULTSET.
* If the child job for the RESULTSET has already completed, the CANCEL statement has no effect.

## Examples

```sqlexample
CANCEL my_result_set;
```

---
title: CASE (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/case.md
section: SQL General Reference
---

# CASE (Snowflake Scripting)

A `CASE` statement provides a way to specify multiple conditions.

For more information on branching constructs, see [Working with conditional logic](../../developer-guide/snowflake-scripting/branch.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

## Syntax

**Simple CASE statement:**

> ```sqlsyntax
> CASE ( <expression_to_match> )
>     WHEN <expression> THEN
>         <statement>;
>         [ <statement>; ... ]
>     [ WHEN ... ]
>     [ ELSE
>         <statement>;
>         [ <statement>; ... ]
>     ]
> END [ CASE ] ;
> ```

Where:

> `expression_to_match`
> :   The expression to match.
>
> `expression`
> :   If the value of this expression matches the value of `expression_to_match`, then the statements in this clause
>     are executed.
>
> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).

**Searched CASE statement:**

> ```sqlsyntax
> CASE
>     WHEN <boolean_expression> THEN
>         <statement>;
>         [ <statement>; ... ]
>     [ WHEN ... ]
>     [ ELSE
>         <statement>;
>         [ <statement>; ... ]
>     ]
> END [ CASE ] ;
> ```

Where:

> `boolean_expression`
> :   If this expression evaluates to TRUE, then the statements in this clause are executed.
>
> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).

## Usage notes

* If more than one branch of the `CASE` would match the expression, only the first is used.
* When you compare expressions, NULL does not match NULL. If you wish to test explicitly for NULL values, use
  [IS [ NOT ] NULL](../functions/is-null.md).

## Examples

This example demonstrates a simple `CASE` statement:

> ```sqlexample
> CREATE PROCEDURE case_demo_01(v VARCHAR)
> RETURNS VARCHAR
> LANGUAGE SQL
> AS
>   BEGIN
>     CASE (v)
>       WHEN 'first choice' THEN
>         RETURN 'one';
>       WHEN 'second choice' THEN
>         RETURN 'two';
>       ELSE
>         RETURN 'unexpected choice';
>     END;
>   END;
> ```
>
> Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
> `execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
> code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):
>
> ```sqlexample
> CREATE PROCEDURE case_demo_01(v VARCHAR)
> RETURNS VARCHAR
> LANGUAGE SQL
> AS
> $$
>     BEGIN
>         CASE (v)
>             WHEN 'first choice' THEN
>                 RETURN 'one';
>             WHEN 'second choice' THEN
>                 RETURN 'two';
>             ELSE
>                 RETURN 'unexpected choice';
>        END CASE;
>     END;
> $$
> ;
> ```

When you call this stored procedure, the procedure produces the following output:

> ```sqlexample
> CALL case_demo_01('second choice');
> +--------------+
> | CASE_DEMO_01 |
> |--------------|
> | two          |
> +--------------+
> ```

This example demonstrates a searched `CASE` statement:

> ```sqlexample
> CREATE PROCEDURE case_demo_2(v VARCHAR)
> RETURNS VARCHAR
> LANGUAGE SQL
> AS
>   BEGIN
>     CASE
>       WHEN v = 'first choice' THEN
>         RETURN 'one';
>       WHEN v = 'second choice' THEN
>         RETURN 'two';
>       ELSE
>         RETURN 'unexpected choice';
>     END;
>   END;
> ```
>
> Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
> `execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
> code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):
>
> ```sqlexample
> CREATE PROCEDURE case_demo_2(v VARCHAR)
> RETURNS VARCHAR
> LANGUAGE SQL
> AS
> $$
>     BEGIN
>         CASE
>             WHEN v = 'first choice' THEN
>                 RETURN 'one';
>             WHEN v = 'second choice' THEN
>                 RETURN 'two';
>             ELSE
>                 RETURN 'unexpected choice';
>        END CASE;
>     END;
> $$
> ;
> ```

When you call this stored procedure, the procedure produces the following output:

> ```sqlexample
> CALL case_demo_2('none of the above');
> +-------------------+
> | CASE_DEMO_2       |
> |-------------------|
> | unexpected choice |
> +-------------------+
> ```

---
title: CHANGES
source: https://docs.snowflake.com/en/sql-reference/constructs/changes.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# CHANGES

The CHANGES clause enables querying the change tracking metadata for a table or view within a specified interval of time
without having to create a stream with an explicit transactional offset. Multiple queries can retrieve the change tracking
metadata between different transactional start and endpoints.

> **Note:**
>
> Change tracking must be enabled on the source table or the source view and its underlying tables. For details, see the usage notes
> (in this topic).

In a query, the CHANGES clause is specified in the [FROM](from.md) clause.

The optional END keyword specifies the end of the change interval. The results are inclusive of the end marker.

## Syntax

```sqlsyntax
SELECT ...
FROM ...
   CHANGES ( INFORMATION => { DEFAULT | APPEND_ONLY } )
   AT ( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> | STREAM => '<name>' } ) | BEFORE ( STATEMENT => <id> )
   [ END( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> } ) ]
[ ... ]
```

## Parameters

`INFORMATION => { DEFAULT | APPEND_ONLY }`
:   Specifies the type of change tracking data to return based on the metadata recorded in each:

    `DEFAULT`
    :   Returns all DML changes to the source object, including inserts, updates, and deletes (including table truncates). This type of change
        tracking compares inserted and deleted rows in the change set to provide the row level delta. As a net effect, for example, a row that
        is inserted and then deleted between two transactional points of time in a table is removed in the delta (i.e. is not returned in the
        query results).

    `APPEND_ONLY`
    :   Returns appended rows only; therefore no join is performed. As a result, querying append-only changes can be much more performant than querying standard (default) changes for extract, load, transform (ELT) and similar scenarios that depend exclusively on row inserts.

`TIMESTAMP => timestamp`
:   Specifies an exact date and time to use for Time Travel. Note that the value must be explicitly cast to a TIMESTAMP.

`OFFSET => time_difference`
:   Specifies the difference in seconds from the current time to use for Time Travel, in the form `-N` where `N` can be an integer or arithmetic expression (e.g. `-120` is 120 seconds, `-30*60`
    is 1800 seconds or 30 minutes).

`STATEMENT => id`
:   Specifies the query ID of a statement to use as the reference point for Time Travel. This parameter supports any statement of one of the following types:

    * DML (e.g. INSERT, UPDATE, DELETE)
    * TCL (BEGIN, COMMIT transaction)
    * SELECT

`STREAM => 'name'`
:   Specifies the identifier (i.e. name) for an existing stream on the queried table or view. The current offset in
    the stream is used as the `AT` point in time for returning change data for the source object.

## Usage notes

* The CHANGES clause is not supported when querying for changes (which are resolved using change-tracking metadata) for
  [directory tables](../../user-guide/data-load-dirtables.md) or [external tables](../../user-guide/tables-external-intro.md).
* Currently, at least one of the following must be true before change tracking metadata is recorded for a table:

  + Change tracking is enabled on the table or view for the interval queried by CHANGES.
  + A stream is created for the table.

  Change tracking can be enabled explicitly by using the [ALTER TABLE](../sql/alter-table.md) command or implicitly when a stream or table is created.

  > ```sqlexample
  > ALTER TABLE mytable SET CHANGE_TRACKING = TRUE;
  > ```

  Both options add hidden columns to the table which store change tracking metadata. The columns consume a small amount of storage.

  To query the change data for a view, change tracking must be enabled on the source view and its underlying tables. For instructions, see
  [Enabling change tracking on views and underlying tables](../../user-guide/streams-manage.md). Additionally, the view is subject to the same limitations as streams on views. For more information, see [Streams on views](../../user-guide/streams-intro.md).
* The [AT | BEFORE](at-before.md) clause is required and sets the current offset for the change tracking metadata.
* The optional END clause sets the end timestamp for the change interval. If no END value is specified, the current timestamp is used as the end of the change interval.

  Note that the END clause is valid only when combined with the CHANGES clause to query change tracking metadata (i.e. this clause cannot be combined with AT|BEFORE when using Time Travel to query historic data for other objects).
* The value for TIMESTAMP or OFFSET must be a constant expression.
* The smallest time resolution for TIMESTAMP is milliseconds.
* If requested data is beyond the Time Travel retention period (default is 1 day), the statement fails.

  In addition, if the requested data is within the Time Travel retention period but no historical data is available (e.g. if the retention period was extended), the statement fails.
* The CHANGES clause computes the changes on the specified interval, without maintaining a durable [offset store](../../user-guide/streams-intro.md). For more information, see [CHANGES clause: Read-only alternative to streams](../../user-guide/streams-intro.md).

## Examples

The following example queries the standard (delta) and append-only change tracking metadata for a table. No END() value is provided, so the current timestamp is used as the endpoint in the transactional interval of time:

```sqlexample
 CREATE OR REPLACE TABLE t1 (
   id number(8) NOT NULL,
   c1 varchar(255) default NULL
 );

-- Enable change tracking on the table.
 ALTER TABLE t1 SET CHANGE_TRACKING = TRUE;

 -- Initialize a session variable for the current timestamp.
 SET ts1 = (SELECT CURRENT_TIMESTAMP());

 INSERT INTO t1 (id,c1)
 VALUES
 (1,'red'),
 (2,'blue'),
 (3,'green');

 DELETE FROM t1 WHERE id = 1;

 UPDATE t1 SET c1 = 'purple' WHERE id = 2;

 -- Query the change tracking metadata in the table during the interval from $ts1 to the current time.
 -- Return the full delta of the changes.
 SELECT *
 FROM t1
   CHANGES(INFORMATION => DEFAULT)
   AT(TIMESTAMP => $ts1);

 +----+--------+-----------------+-------------------+------------------------------------------+
 | ID | C1     | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
 |----+--------+-----------------+-------------------+------------------------------------------|
 |  2 | purple | INSERT          | False             | 1614e92e93f86af6348f15af01a85c4229b42907 |
 |  3 | green  | INSERT          | False             | 86df000054a4d1dc64d5d74a44c3131c4c046a1f |
 +----+--------+-----------------+-------------------+------------------------------------------+

 -- Query the change tracking metadata in the table during the interval from $ts1 to the current time.
 -- Return the append-only changes.
 SELECT *
 FROM t1
   CHANGES(INFORMATION => APPEND_ONLY)
   AT(TIMESTAMP => $ts1);

 +----+-------+-----------------+-------------------+------------------------------------------+
 | ID | C1    | METADATA$ACTION | METADATA$ISUPDATE | METADATA$ROW_ID                          |
 |----+-------+-----------------+-------------------+------------------------------------------|
 |  1 | red   | INSERT          | False             | 6a964a652fa82974f3f20b4f49685de54eeb4093 |
 |  2 | blue  | INSERT          | False             | 1614e92e93f86af6348f15af01a85c4229b42907 |
 |  3 | green | INSERT          | False             | 86df000054a4d1dc64d5d74a44c3131c4c046a1f |
 +----+-------+-----------------+-------------------+------------------------------------------+
```

The following example consumes the append-only changes for a table from a transactional point of time before the rows were deleted from the table:

```sqlexample
CREATE OR REPLACE TABLE t1 (
  id number(8) NOT NULL,
  c1 varchar(255) default NULL
);

-- Enable change tracking on the table.
ALTER TABLE t1 SET CHANGE_TRACKING = TRUE;

-- Initialize a session 'start timestamp' variable for the current timestamp.
SET ts1 = (SELECT CURRENT_TIMESTAMP());

INSERT INTO t1 (id,c1)
VALUES
(1,'red'),
(2,'blue'),
(3,'green');

-- Initialize a session 'end timestamp' variable for the current timestamp.
SET ts2 = (SELECT CURRENT_TIMESTAMP());

DELETE FROM t1 WHERE id = 3;
SET last_query_id = (SELECT LAST_QUERY_ID());

-- Create a table populated by the change data between the start and end timestamps.
CREATE OR REPLACE TABLE t2 (
  c1 varchar(255) default NULL
  )
AS SELECT C1
  FROM t1
  CHANGES(INFORMATION => APPEND_ONLY)
  AT(TIMESTAMP => $ts1)
  END(TIMESTAMP => $ts2);

SELECT * FROM t2;

+-------+
| C1    |
|-------|
| red   |
| blue  |
| green |
+-------+

-- Create a table populated by the change data between the start timestamp and end statement.
-- This example demonstrates that END is inclusive of the statement passed in.
CREATE OR REPLACE TABLE t3 (
  c1 varchar(255) default NULL
  )
AS SELECT C1
  FROM t1
  CHANGES(INFORMATION => DEFAULT)
  AT(TIMESTAMP => $ts1)
  END(STATEMENT => $last_query_id);

+-------+
| C1    |
|-------|
| red   |
| blue  |
+-------+
```

The following example is similar to the previous example. This example uses the current
offset for a stream on the source table as the start point in time for populating the new table
with change data from the source table. Because a stream is created on the source object,
you do not need to explicitly enable change tracking on the object:

```sqlexample
CREATE OR REPLACE TABLE t1 (
  id number(8) NOT NULL,
  c1 varchar(255) default NULL
);

-- Create a stream on the table.
CREATE OR REPLACE STREAM s1 ON TABLE t1;

INSERT INTO t1 (id,c1)
VALUES
(1,'red'),
(2,'blue'),
(3,'green');

-- Initialize a session 'end timestamp' variable for the current timestamp.
SET ts2 = (SELECT CURRENT_TIMESTAMP());

DELETE FROM t1;

-- Create a table populated by the change data between the current
-- s1 offset and the end timestamp.
CREATE OR REPLACE TABLE t2 (
  c1 varchar(255) default NULL
  )
AS SELECT C1
  FROM t1
  CHANGES(INFORMATION => APPEND_ONLY)
  AT(STREAM => 's1')
  END(TIMESTAMP => $ts2);

SELECT * FROM t2;

+-------+
| C1    |
|-------|
| red   |
| blue  |
| green |
+-------+
```

---
title: CLOSE (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/close.md
section: SQL General Reference
---

# CLOSE (Snowflake Scripting)

Closes the specified cursor.

For more information on cursors, see [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [DECLARE](declare.md), [OPEN](open.md), [FETCH](fetch.md)

## Syntax

```sqlsyntax
CLOSE <cursor_name> ;
```

Where:

> `cursor_name`
> :   The name of the cursor.

## Usage notes

* After a cursor is closed, the cursor’s current row pointer is invalid. Re-opening the cursor causes the cursor to start from
  the beginning of the new result set.

## Examples

```sqlexample
CLOSE my_cursor_name;
```

For a more complete example of using a cursor, see
[the introductory cursor example](../../developer-guide/snowflake-scripting/cursors.md).

An example using a loop is included in the [documentation on FOR loops](for.md).

---
title: Collation locales supported by Snowflake
source: https://docs.snowflake.com/en/sql-reference/collation-locales.md
section: SQL General Reference
---

# Collation locales supported by Snowflake

Collation allows you to specify alternative rules for comparing strings, which can be used to compare and
sort data according to a particular language or other user-specified rules. For more information, see
[Collation support](collation.md).

Snowflake collation supports the locales in the following table. The table lists each supported
locale and its associated language.

| Locale | Language |
| --- | --- |
| `af` | Afrikaans |
| `af_na` | Afrikaans (Namibia) |
| `af_za` | Afrikaans (South Africa) |
| `agq` | Aghem |
| `agq_cm` | Aghem (Cameroon) |
| `ak` | Akan |
| `ak_gh` | Akan (Ghana) |
| `am` | Amharic |
| `am_et` | Amharic (Ethiopia) |
| `ar` | Arabic |
| `ar_001` | Arabic (World) |
| `ar_ae` | Arabic (United Arab Emirates) |
| `ar_bh` | Arabic (Bahrain) |
| `ar_dj` | Arabic (Djibouti) |
| `ar_dz` | Arabic (Algeria) |
| `ar_eg` | Arabic (Egypt) |
| `ar_eh` | Arabic (Western Sahara) |
| `ar_er` | Arabic (Eritrea) |
| `ar_il` | Arabic (Israel) |
| `ar_iq` | Arabic (Iraq) |
| `ar_jo` | Arabic (Jordan) |
| `ar_km` | Arabic (Comoros) |
| `ar_kw` | Arabic (Kuwait) |
| `ar_lb` | Arabic (Lebanon) |
| `ar_ly` | Arabic (Libya) |
| `ar_ma` | Arabic (Morocco) |
| `ar_mr` | Arabic (Mauritania) |
| `ar_om` | Arabic (Oman) |
| `ar_ps` | Arabic (Palestinian Territories) |
| `ar_qa` | Arabic (Qatar) |
| `ar_sa` | Arabic (Saudi Arabia) |
| `ar_sd` | Arabic (Sudan) |
| `ar_so` | Arabic (Somalia) |
| `ar_ss` | Arabic (South Sudan) |
| `ar_sy` | Arabic (Syria) |
| `ar_td` | Arabic (Chad) |
| `ar_tn` | Arabic (Tunisia) |
| `ar_ye` | Arabic (Yemen) |
| `as` | Assamese |
| `as_in` | Assamese (India) |
| `asa` | Asu |
| `asa_tz` | Asu (Tanzania) |
| `ast` | Asturian |
| `ast_es` | Asturian (Spain) |
| `az` | Azerbaijani |
| `az_cyrl` | Azerbaijani (Cyrillic) |
| `az_cyrl_az` | Azerbaijani (Cyrillic, Azerbaijan) |
| `az_latn` | Azerbaijani (Latin) |
| `az_latn_az` | Azerbaijani (Latin, Azerbaijan) |
| `bas` | Basaa |
| `bas_cm` | Basaa (Cameroon) |
| `be` | Belarusian |
| `be_by` | Belarusian (Belarus) |
| `bem` | Bemba |
| `bem_zm` | Bemba (Zambia) |
| `bez` | Bena |
| `bez_tz` | Bena (Tanzania) |
| `bg` | Bulgarian |
| `bg_bg` | Bulgarian (Bulgaria) |
| `bm` | Bambara |
| `bm_ml` | Bambara (Mali) |
| `bn` | Bangla |
| `bn_bd` | Bangla (Bangladesh) |
| `bn_in` | Bangla (India) |
| `bo` | Tibetan |
| `bo_cn` | Tibetan (China) |
| `bo_in` | Tibetan (India) |
| `br` | Breton |
| `br_fr` | Breton (France) |
| `brx` | Bodo |
| `brx_in` | Bodo (India) |
| `bs` | Bosnian |
| `bs_cyrl` | Bosnian (Cyrillic) |
| `bs_cyrl_ba` | Bosnian (Cyrillic, Bosnia & Herzegovina) |
| `bs_latn` | Bosnian (Latin) |
| `bs_latn_ba` | Bosnian (Latin, Bosnia & Herzegovina) |
| `ca` | Catalan |
| `ca_ad` | Catalan (Andorra) |
| `ca_es` | Catalan (Spain) |
| `ca_fr` | Catalan (France) |
| `ca_it` | Catalan (Italy) |
| `ccp` | Chakma |
| `ccp_bd` | Chakma (Bangladesh) |
| `ccp_in` | Chakma (India) |
| `ce` | Chechen |
| `ce_ru` | Chechen (Russia) |
| `cgg` | Chiga |
| `cgg_ug` | Chiga (Uganda) |
| `chr` | Cherokee |
| `chr_us` | Cherokee (United States) |
| `ckb` | Central Kurdish |
| `ckb_iq` | Central Kurdish (Iraq) |
| `ckb_ir` | Central Kurdish (Iran) |
| `cs` | Czech |
| `cs_cz` | Czech (Czechia) |
| `cy` | Welsh |
| `cy_gb` | Welsh (United Kingdom) |
| `da` | Danish |
| `da_dk` | Danish (Denmark) |
| `da_gl` | Danish (Greenland) |
| `dav` | Taita |
| `dav_ke` | Taita (Kenya) |
| `de` | German |
| `de_at` | German (Austria) |
| `de_be` | German (Belgium) |
| `de_ch` | German (Switzerland) |
| `de_de` | German (Germany) |
| `de_it` | German (Italy) |
| `de_li` | German (Liechtenstein) |
| `de_lu` | German (Luxembourg) |
| `dje` | Zarma |
| `dje_ne` | Zarma (Niger) |
| `dsb` | Lower Sorbian |
| `dsb_de` | Lower Sorbian (Germany) |
| `dua` | Duala |
| `dua_cm` | Duala (Cameroon) |
| `dyo` | Jola-Fonyi |
| `dyo_sn` | Jola-Fonyi (Senegal) |
| `dz` | Dzongkha |
| `dz_bt` | Dzongkha (Bhutan) |
| `ebu` | Embu |
| `ebu_ke` | Embu (Kenya) |
| `ee` | Ewe |
| `ee_gh` | Ewe (Ghana) |
| `ee_tg` | Ewe (Togo) |
| `el` | Greek |
| `el_cy` | Greek (Cyprus) |
| `el_gr` | Greek (Greece) |
| `en` | English |
| `en_001` | English (World) |
| `en_150` | English (Europe) |
| `en_ag` | English (Antigua & Barbuda) |
| `en_ai` | English (Anguilla) |
| `en_as` | English (American Samoa) |
| `en_at` | English (Austria) |
| `en_au` | English (Australia) |
| `en_bb` | English (Barbados) |
| `en_be` | English (Belgium) |
| `en_bi` | English (Burundi) |
| `en_bm` | English (Bermuda) |
| `en_bs` | English (Bahamas) |
| `en_bw` | English (Botswana) |
| `en_bz` | English (Belize) |
| `en_ca` | English (Canada) |
| `en_cc` | English (Cocos [Keeling] Islands) |
| `en_ch` | English (Switzerland) |
| `en_ck` | English (Cook Islands) |
| `en_cm` | English (Cameroon) |
| `en_cx` | English (Christmas Island) |
| `en_cy` | English (Cyprus) |
| `en_de` | English (Germany) |
| `en_dg` | English (Diego Garcia) |
| `en_dk` | English (Denmark) |
| `en_dm` | English (Dominica) |
| `en_er` | English (Eritrea) |
| `en_fi` | English (Finland) |
| `en_fj` | English (Fiji) |
| `en_fk` | English (Falkland Islands) |
| `en_fm` | English (Micronesia) |
| `en_gb` | English (United Kingdom) |
| `en_gd` | English (Grenada) |
| `en_gg` | English (Guernsey) |
| `en_gh` | English (Ghana) |
| `en_gi` | English (Gibraltar) |
| `en_gm` | English (Gambia) |
| `en_gu` | English (Guam) |
| `en_gy` | English (Guyana) |
| `en_hk` | English (Hong Kong SAR China) |
| `en_ie` | English (Ireland) |
| `en_il` | English (Israel) |
| `en_im` | English (Isle of Man) |
| `en_in` | English (India) |
| `en_io` | English (British Indian Ocean Territory) |
| `en_je` | English (Jersey) |
| `en_jm` | English (Jamaica) |
| `en_ke` | English (Kenya) |
| `en_ki` | English (Kiribati) |
| `en_kn` | English (St. Kitts & Nevis) |
| `en_ky` | English (Cayman Islands) |
| `en_lc` | English (St. Lucia) |
| `en_lr` | English (Liberia) |
| `en_ls` | English (Lesotho) |
| `en_mg` | English (Madagascar) |
| `en_mh` | English (Marshall Islands) |
| `en_mo` | English (Macau SAR China) |
| `en_mp` | English (Northern Mariana Islands) |
| `en_ms` | English (Montserrat) |
| `en_mt` | English (Malta) |
| `en_mu` | English (Mauritius) |
| `en_mw` | English (Malawi) |
| `en_my` | English (Malaysia) |
| `en_na` | English (Namibia) |
| `en_nf` | English (Norfolk Island) |
| `en_ng` | English (Nigeria) |
| `en_nl` | English (Netherlands) |
| `en_nr` | English (Nauru) |
| `en_nu` | English (Niue) |
| `en_nz` | English (New Zealand) |
| `en_pg` | English (Papua New Guinea) |
| `en_ph` | English (Philippines) |
| `en_pk` | English (Pakistan) |
| `en_pn` | English (Pitcairn Islands) |
| `en_pr` | English (Puerto Rico) |
| `en_pw` | English (Palau) |
| `en_rw` | English (Rwanda) |
| `en_sb` | English (Solomon Islands) |
| `en_sc` | English (Seychelles) |
| `en_sd` | English (Sudan) |
| `en_se` | English (Sweden) |
| `en_sg` | English (Singapore) |
| `en_sh` | English (St. Helena) |
| `en_si` | English (Slovenia) |
| `en_sl` | English (Sierra Leone) |
| `en_ss` | English (South Sudan) |
| `en_sx` | English (Sint Maarten) |
| `en_sz` | English (Swaziland) |
| `en_tc` | English (Turks & Caicos Islands) |
| `en_tk` | English (Tokelau) |
| `en_to` | English (Tonga) |
| `en_tt` | English (Trinidad & Tobago) |
| `en_tv` | English (Tuvalu) |
| `en_tz` | English (Tanzania) |
| `en_ug` | English (Uganda) |
| `en_um` | English (U.S. Outlying Islands) |
| `en_us` | English (United States) |
| `en_us_posix` | English (United States, Computer) |
| `en_vc` | English (St. Vincent & Grenadines) |
| `en_vg` | English (British Virgin Islands) |
| `en_vi` | English (U.S. Virgin Islands) |
| `en_vu` | English (Vanuatu) |
| `en_ws` | English (Samoa) |
| `en_za` | English (South Africa) |
| `en_zm` | English (Zambia) |
| `en_zw` | English (Zimbabwe) |
| `eo` | Esperanto |
| `es` | Spanish |
| `es_419` | Spanish (Latin America) |
| `es_ar` | Spanish (Argentina) |
| `es_bo` | Spanish (Bolivia) |
| `es_br` | Spanish (Brazil) |
| `es_bz` | Spanish (Belize) |
| `es_cl` | Spanish (Chile) |
| `es_co` | Spanish (Colombia) |
| `es_cr` | Spanish (Costa Rica) |
| `es_cu` | Spanish (Cuba) |
| `es_do` | Spanish (Dominican Republic) |
| `es_ea` | Spanish (Ceuta & Melilla) |
| `es_ec` | Spanish (Ecuador) |
| `es_es` | Spanish (Spain) |
| `es_gq` | Spanish (Equatorial Guinea) |
| `es_gt` | Spanish (Guatemala) |
| `es_hn` | Spanish (Honduras) |
| `es_ic` | Spanish (Canary Islands) |
| `es_mx` | Spanish (Mexico) |
| `es_ni` | Spanish (Nicaragua) |
| `es_pa` | Spanish (Panama) |
| `es_pe` | Spanish (Peru) |
| `es_ph` | Spanish (Philippines) |
| `es_pr` | Spanish (Puerto Rico) |
| `es_py` | Spanish (Paraguay) |
| `es_sv` | Spanish (El Salvador) |
| `es_us` | Spanish (United States) |
| `es_uy` | Spanish (Uruguay) |
| `es_ve` | Spanish (Venezuela) |
| `et` | Estonian |
| `et_ee` | Estonian (Estonia) |
| `eu` | Basque |
| `eu_es` | Basque (Spain) |
| `ewo` | Ewondo |
| `ewo_cm` | Ewondo (Cameroon) |
| `fa` | Persian |
| `fa_af` | Persian (Afghanistan) |
| `fa_ir` | Persian (Iran) |
| `ff` | Fulah |
| `ff_cm` | Fulah (Cameroon) |
| `ff_gn` | Fulah (Guinea) |
| `ff_mr` | Fulah (Mauritania) |
| `ff_sn` | Fulah (Senegal) |
| `fi` | Finnish |
| `fi_fi` | Finnish (Finland) |
| `fil` | Filipino |
| `fil_ph` | Filipino (Philippines) |
| `fo` | Faroese |
| `fo_dk` | Faroese (Denmark) |
| `fo_fo` | Faroese (Faroe Islands) |
| `fr` | French |
| `fr_be` | French (Belgium) |
| `fr_bf` | French (Burkina Faso) |
| `fr_bi` | French (Burundi) |
| `fr_bj` | French (Benin) |
| `fr_bl` | French (St. Barthélemy) |
| `fr_ca` | French (Canada) |
| `fr_cd` | French (Congo - Kinshasa) |
| `fr_cf` | French (Central African Republic) |
| `fr_cg` | French (Congo - Brazzaville) |
| `fr_ch` | French (Switzerland) |
| `fr_ci` | French (Côte d’Ivoire) |
| `fr_cm` | French (Cameroon) |
| `fr_dj` | French (Djibouti) |
| `fr_dz` | French (Algeria) |
| `fr_fr` | French (France) |
| `fr_ga` | French (Gabon) |
| `fr_gf` | French (French Guiana) |
| `fr_gn` | French (Guinea) |
| `fr_gp` | French (Guadeloupe) |
| `fr_gq` | French (Equatorial Guinea) |
| `fr_ht` | French (Haiti) |
| `fr_km` | French (Comoros) |
| `fr_lu` | French (Luxembourg) |
| `fr_ma` | French (Morocco) |
| `fr_mc` | French (Monaco) |
| `fr_mf` | French (St. Martin) |
| `fr_mg` | French (Madagascar) |
| `fr_ml` | French (Mali) |
| `fr_mq` | French (Martinique) |
| `fr_mr` | French (Mauritania) |
| `fr_mu` | French (Mauritius) |
| `fr_nc` | French (New Caledonia) |
| `fr_ne` | French (Niger) |
| `fr_pf` | French (French Polynesia) |
| `fr_pm` | French (St. Pierre & Miquelon) |
| `fr_re` | French (Réunion) |
| `fr_rw` | French (Rwanda) |
| `fr_sc` | French (Seychelles) |
| `fr_sn` | French (Senegal) |
| `fr_sy` | French (Syria) |
| `fr_td` | French (Chad) |
| `fr_tg` | French (Togo) |
| `fr_tn` | French (Tunisia) |
| `fr_vu` | French (Vanuatu) |
| `fr_wf` | French (Wallis & Futuna) |
| `fr_yt` | French (Mayotte) |
| `fur` | Friulian |
| `fur_it` | Friulian (Italy) |
| `fy` | Western Frisian |
| `fy_nl` | Western Frisian (Netherlands) |
| `ga` | Irish |
| `ga_ie` | Irish (Ireland) |
| `gd` | Scottish Gaelic |
| `gd_gb` | Scottish Gaelic (United Kingdom) |
| `gl` | Galician |
| `gl_es` | Galician (Spain) |
| `gsw` | Swiss German |
| `gsw_ch` | Swiss German (Switzerland) |
| `gsw_fr` | Swiss German (France) |
| `gsw_li` | Swiss German (Liechtenstein) |
| `gu` | Gujarati |
| `gu_in` | Gujarati (India) |
| `guz` | Gusii |
| `guz_ke` | Gusii (Kenya) |
| `gv` | Manx |
| `gv_im` | Manx (Isle of Man) |
| `ha` | Hausa |
| `ha_gh` | Hausa (Ghana) |
| `ha_ne` | Hausa (Niger) |
| `ha_ng` | Hausa (Nigeria) |
| `haw` | Hawaiian |
| `haw_us` | Hawaiian (United States) |
| `he` | Hebrew |
| `he_il` | Hebrew (Israel) |
| `hi` | Hindi |
| `hi_in` | Hindi (India) |
| `hr` | Croatian |
| `hr_ba` | Croatian (Bosnia & Herzegovina) |
| `hr_hr` | Croatian (Croatia) |
| `hsb` | Upper Sorbian |
| `hsb_de` | Upper Sorbian (Germany) |
| `hu` | Hungarian |
| `hu_hu` | Hungarian (Hungary) |
| `hy` | Armenian |
| `hy_am` | Armenian (Armenia) |
| `id` | Indonesian |
| `id_id` | Indonesian (Indonesia) |
| `ig` | Igbo |
| `ig_ng` | Igbo (Nigeria) |
| `ii` | Sichuan Yi |
| `ii_cn` | Sichuan Yi (China) |
| `is` | Icelandic |
| `is_is` | Icelandic (Iceland) |
| `it` | Italian |
| `it_ch` | Italian (Switzerland) |
| `it_it` | Italian (Italy) |
| `it_sm` | Italian (San Marino) |
| `it_va` | Italian (Vatican City) |
| `ja` | Japanese |
| `ja_jp` | Japanese (Japan) |
| `go` | Ngomba |
| `jgo_cm` | Ngomba (Cameroon) |
| `jmc` | Machame |
| `jmc_tz` | Machame (Tanzania) |
| `ka` | Georgian |
| `ka_ge` | Georgian (Georgia) |
| `kab` | Kabyle |
| `kab_dz` | Kabyle (Algeria) |
| `kam` | Kamba |
| `kam_ke` | Kamba (Kenya) |
| `kde` | Makonde |
| `kde_tz` | Makonde (Tanzania) |
| `kea` | Kabuverdianu |
| `kea_cv` | Kabuverdianu (Cape Verde) |
| `khq` | Koyra Chiini |
| `khq_ml` | Koyra Chiini (Mali) |
| `ki` | Kikuyu |
| `ki_ke` | Kikuyu (Kenya) |
| `kk` | Kazakh |
| `kk_kz` | Kazakh (Kazakhstan) |
| `kkj` | Kako |
| `kkj_cm` | Kako (Cameroon) |
| `kl` | Kalaallisut |
| `kl_gl` | Kalaallisut (Greenland) |
| `kln` | Kalenjin |
| `kln_ke` | Kalenjin (Kenya) |
| `km` | Khmer |
| `km_kh` | Khmer (Cambodia) |
| `kn` | Kannada |
| `kn_in` | Kannada (India) |
| `ko` | Korean |
| `ko_kp` | Korean (North Korea) |
| `ko_kr` | Korean (South Korea) |
| `kok` | Konkani |
| `kok_in` | Konkani (India) |
| `ks` | Kashmiri |
| `ks_in` | Kashmiri (India) |
| `ksb` | Shambala |
| `ksb_tz` | Shambala (Tanzania) |
| `ksf` | Bafia |
| `ksf_cm` | Bafia (Cameroon) |
| `ksh` | Colognian |
| `ksh_de` | Colognian (Germany) |
| `kw` | Cornish |
| `kw_gb` | Cornish (United Kingdom) |
| `ky` | Kyrgyz |
| `ky_kg` | Kyrgyz (Kyrgyzstan) |
| `lag` | Langi |
| `lag_tz` | Langi (Tanzania) |
| `lb` | Luxembourgish |
| `lb_lu` | Luxembourgish (Luxembourg) |
| `lg` | Ganda |
| `lg_ug` | Ganda (Uganda) |
| `lkt` | Lakota |
| `lkt_us` | Lakota (United States) |
| `ln` | Lingala |
| `ln_ao` | Lingala (Angola) |
| `ln_cd` | Lingala (Congo - Kinshasa) |
| `ln_cf` | Lingala (Central African Republic) |
| `ln_cg` | Lingala (Congo - Brazzaville) |
| `lo` | Lao |
| `lo_la` | Lao (Laos) |
| `lrc` | Northern Luri |
| `lrc_iq` | Northern Luri (Iraq) |
| `lrc_ir` | Northern Luri (Iran) |
| `lt` | Lithuanian |
| `lt_lt` | Lithuanian (Lithuania) |
| `lu` | Luba-Katanga |
| `lu_cd` | Luba-Katanga (Congo - Kinshasa) |
| `luo` | Luo |
| `luo_ke` | Luo (Kenya) |
| `luy` | Luyia |
| `luy_ke` | Luyia (Kenya) |
| `lv` | Latvian |
| `lv_lv` | Latvian (Latvia) |
| `mas` | Masai |
| `mas_ke` | Masai (Kenya) |
| `mas_tz` | Masai (Tanzania) |
| `mer` | Meru |
| `mer_ke` | Meru (Kenya) |
| `mfe` | Morisyen |
| `mfe_mu` | Morisyen (Mauritius) |
| `mg` | Malagasy |
| `mg_mg` | Malagasy (Madagascar) |
| `mgh` | Makhuwa-Meetto |
| `mgh_mz` | Makhuwa-Meetto (Mozambique) |
| `mgo` | Metaʼ |
| `mgo_cm` | Metaʼ (Cameroon) |
| `mk` | Macedonian |
| `mk_mk` | Macedonian (Macedonia) |
| `ml` | Malayalam |
| `ml_in` | Malayalam (India) |
| `mn` | Mongolian |
| `mn_mn` | Mongolian (Mongolia) |
| `mr` | Marathi |
| `mr_in` | Marathi (India) |
| `ms` | Malay |
| `ms_bn` | Malay (Brunei) |
| `ms_my` | Malay (Malaysia) |
| `ms_sg` | Malay (Singapore) |
| `mt` | Maltese |
| `mt_mt` | Maltese (Malta) |
| `mua` | Mundang |
| `mua_cm` | Mundang (Cameroon) |
| `my` | Burmese |
| `my_mm` | Burmese (Myanmar [Burma]) |
| `mzn` | Mazanderani |
| `mzn_ir` | Mazanderani (Iran) |
| `naq` | Nama |
| `naq_na` | Nama (Namibia) |
| `nb` | Norwegian Bokmål |
| `nb_no` | Norwegian Bokmål (Norway) |
| `nb_sj` | Norwegian Bokmål (Svalbard & Jan Mayen) |
| `nd` | North Ndebele |
| `nd_zw` | North Ndebele (Zimbabwe) |
| `nds` | Low German |
| `nds_de` | Low German (Germany) |
| `nds_nl` | Low German (Netherlands) |
| `ne` | Nepali |
| `ne_in` | Nepali (India) |
| `ne_np` | Nepali (Nepal) |
| `nl` | Dutch |
| `nl_aw` | Dutch (Aruba) |
| `nl_be` | Dutch (Belgium) |
| `nl_bq` | Dutch (Caribbean Netherlands) |
| `nl_cw` | Dutch (Curaçao) |
| `nl_nl` | Dutch (Netherlands) |
| `nl_sr` | Dutch (Suriname) |
| `nl_sx` | Dutch (Sint Maarten) |
| `nmg` | Kwasio |
| `nmg_cm` | Kwasio (Cameroon) |
| `nn` | Norwegian Nynorsk |
| `nn_no` | Norwegian Nynorsk (Norway) |
| `nnh` | Ngiemboon |
| `nnh_cm` | Ngiemboon (Cameroon) |
| `nus` | Nuer |
| `nus_ss` | Nuer (South Sudan) |
| `nyn` | Nyankole |
| `nyn_ug` | Nyankole (Uganda) |
| `om` | Oromo |
| `om_et` | Oromo (Ethiopia) |
| `om_ke` | Oromo (Kenya) |
| `or` | Odia |
| `or_in` | Odia (India) |
| `os` | Ossetic |
| `os_ge` | Ossetic (Georgia) |
| `os_ru` | Ossetic (Russia) |
| `pa` | Punjabi |
| `pa_arab` | Punjabi (Arabic) |
| `pa_arab_pk` | Punjabi (Arabic, Pakistan) |
| `pa_guru` | Punjabi (Gurmukhi) |
| `pa_guru_in` | Punjabi (Gurmukhi, India) |
| `pl` | Polish |
| `pl_pl` | Polish (Poland) |
| `ps` | Pashto |
| `ps_af` | Pashto (Afghanistan) |
| `pt` | Portuguese |
| `pt_ao` | Portuguese (Angola) |
| `pt_br` | Portuguese (Brazil) |
| `pt_ch` | Portuguese (Switzerland) |
| `pt_cv` | Portuguese (Cape Verde) |
| `pt_gq` | Portuguese (Equatorial Guinea) |
| `pt_gw` | Portuguese (Guinea-Bissau) |
| `pt_lu` | Portuguese (Luxembourg) |
| `pt_mo` | Portuguese (Macau SAR China) |
| `pt_mz` | Portuguese (Mozambique) |
| `pt_pt` | Portuguese (Portugal) |
| `pt_st` | Portuguese (São Tomé & Príncipe) |
| `pt_tl` | Portuguese (Timor-Leste) |
| `qu` | Quechua |
| `qu_bo` | Quechua (Bolivia) |
| `qu_ec` | Quechua (Ecuador) |
| `qu_pe` | Quechua (Peru) |
| `rm` | Romansh |
| `rm_ch` | Romansh (Switzerland) |
| `rn` | Rundi |
| `rn_bi` | Rundi (Burundi) |
| `ro` | Romanian |
| `ro_md` | Romanian (Moldova) |
| `ro_ro` | Romanian (Romania) |
| `rof` | Rombo |
| `rof_tz` | Rombo (Tanzania) |
| `ru` | Russian |
| `ru_by` | Russian (Belarus) |
| `ru_kg` | Russian (Kyrgyzstan) |
| `ru_kz` | Russian (Kazakhstan) |
| `ru_md` | Russian (Moldova) |
| `ru_ru` | Russian (Russia) |
| `ru_ua` | Russian (Ukraine) |
| `rw` | Kinyarwanda |
| `rw_rw` | Kinyarwanda (Rwanda) |
| `rwk` | Rwa |
| `rwk_tz` | Rwa (Tanzania) |
| `sah` | Sakha |
| `sah_ru` | Sakha (Russia) |
| `saq` | Samburu |
| `saq_ke` | Samburu (Kenya) |
| `sbp` | Sangu |
| `sbp_tz` | Sangu (Tanzania) |
| `se` | Northern Sami |
| `se_fi` | Northern Sami (Finland) |
| `se_no` | Northern Sami (Norway) |
| `se_se` | Northern Sami (Sweden) |
| `seh` | Sena |
| `seh_mz` | Sena (Mozambique) |
| `ses` | Koyraboro Senni |
| `ses_ml` | Koyraboro Senni (Mali) |
| `sg` | Sango |
| `sg_cf` | Sango (Central African Republic) |
| `shi` | Tachelhit |
| `shi_latn` | Tachelhit (Latin) |
| `shi_latn_ma` | Tachelhit (Latin, Morocco) |
| `shi_tfng` | Tachelhit (Tifinagh) |
| `shi_tfng_ma` | Tachelhit (Tifinagh, Morocco) |
| `si` | Sinhala |
| `si_lk` | Sinhala (Sri Lanka) |
| `sk` | Slovak |
| `sk_sk` | Slovak (Slovakia) |
| `sl` | Slovenian |
| `sl_si` | Slovenian (Slovenia) |
| `smn` | Inari Sami |
| `smn_fi` | Inari Sami (Finland) |
| `sn` | Shona |
| `sn_zw` | Shona (Zimbabwe) |
| `so` | Somali |
| `so_dj` | Somali (Djibouti) |
| `so_et` | Somali (Ethiopia) |
| `so_ke` | Somali (Kenya) |
| `so_so` | Somali (Somalia) |
| `sq` | Albanian |
| `sq_al` | Albanian (Albania) |
| `sq_mk` | Albanian (Macedonia) |
| `sq_xk` | Albanian (Kosovo) |
| `sr` | Serbian |
| `sr_cyrl` | Serbian (Cyrillic) |
| `sr_cyrl_ba` | Serbian (Cyrillic, Bosnia & Herzegovina) |
| `sr_cyrl_me` | Serbian (Cyrillic, Montenegro) |
| `sr_cyrl_rs` | Serbian (Cyrillic, Serbia) |
| `sr_cyrl_xk` | Serbian (Cyrillic, Kosovo) |
| `sr_latn` | Serbian (Latin) |
| `sr_latn_ba` | Serbian (Latin, Bosnia & Herzegovina) |
| `sr_latn_me` | Serbian (Latin, Montenegro) |
| `sr_latn_rs` | Serbian (Latin, Serbia) |
| `sr_latn_xk` | Serbian (Latin, Kosovo) |
| `sv` | Swedish |
| `sv_ax` | Swedish (Åland Islands) |
| `sv_fi` | Swedish (Finland) |
| `sv_se` | Swedish (Sweden) |
| `sw` | Swahili |
| `sw_cd` | Swahili (Congo - Kinshasa) |
| `sw_ke` | Swahili (Kenya) |
| `sw_tz` | Swahili (Tanzania) |
| `sw_ug` | Swahili (Uganda) |
| `ta` | Tamil |
| `ta_in` | Tamil (India) |
| `ta_lk` | Tamil (Sri Lanka) |
| `ta_my` | Tamil (Malaysia) |
| `ta_sg` | Tamil (Singapore) |
| `te` | Telugu |
| `te_in` | Telugu (India) |
| `teo` | Teso |
| `teo_ke` | Teso (Kenya) |
| `teo_ug` | Teso (Uganda) |
| `tg` | Tajik |
| `tg_tj` | Tajik (Tajikistan) |
| `th` | Thai |
| `th_th` | Thai (Thailand) |
| `ti` | Tigrinya |
| `ti_er` | Tigrinya (Eritrea) |
| `ti_et` | Tigrinya (Ethiopia) |
| `to` | Tongan |
| `to_to` | Tongan (Tonga) |
| `tr` | Turkish |
| `tr_cy` | Turkish (Cyprus) |
| `tr_tr` | Turkish (Turkey) |
| `tt` | Tatar |
| `tt_ru` | Tatar (Russia) |
| `twq` | Tasawaq |
| `twq_ne` | Tasawaq (Niger) |
| `tzm` | Central Atlas Tamazight |
| `tzm_ma` | Central Atlas Tamazight (Morocco) |
| `ug` | Uyghur |
| `ug_cn` | Uyghur (China) |
| `uk` | Ukrainian |
| `uk_ua` | Ukrainian (Ukraine) |
| `ur` | Urdu |
| `ur_in` | Urdu (India) |
| `ur_pk` | Urdu (Pakistan) |
| `utf8` | UTF-8 (Unicode ordering) |
| `uz` | Uzbek |
| `uz_arab` | Uzbek (Arabic) |
| `uz_arab_af` | Uzbek (Arabic, Afghanistan) |
| `uz_cyrl` | Uzbek (Cyrillic) |
| `uz_cyrl_uz` | Uzbek (Cyrillic, Uzbekistan) |
| `uz_latn` | Uzbek (Latin) |
| `uz_latn_uz` | Uzbek (Latin, Uzbekistan) |
| `vai` | Vai |
| `vai_latn` | Vai (Latin) |
| `vai_latn_lr` | Vai (Latin, Liberia) |
| `vai_vaii` | Vai (Vai) |
| `vai_vaii_lr` | Vai (Vai, Liberia) |
| `vi` | Vietnamese |
| `vi_vn` | Vietnamese (Vietnam) |
| `vun` | Vunjo |
| `vun_tz` | Vunjo (Tanzania) |
| `wae` | Walser |
| `wae_ch` | Walser (Switzerland) |
| `wo` | Wolof |
| `wo_sn` | Wolof (Senegal) |
| `xog` | Soga |
| `xog_ug` | Soga (Uganda) |
| `yav` | Yangben |
| `yav_cm` | Yangben (Cameroon) |
| `yi` | Yiddish |
| `yi_001` | Yiddish (World) |
| `yo` | Yoruba |
| `yo_bj` | Yoruba (Benin) |
| `yo_ng` | Yoruba (Nigeria) |
| `yue` | Cantonese |
| `yue_hans` | Cantonese (Simplified) |
| `yue_hans_cn` | Cantonese (Simplified, China) |
| `yue_hant` | Cantonese (Traditional) |
| `yue_hant_hk` | Cantonese (Traditional, Hong Kong SAR China) |
| `zgh` | Standard Moroccan Tamazight |
| `zgh_ma` | Standard Moroccan Tamazight (Morocco) |
| `zh` | Chinese |
| `zh_hans` | Chinese (Simplified) |
| `zh_hans_cn` | Chinese (Simplified, China) |
| `zh_hans_hk` | Chinese (Simplified, Hong Kong SAR China) |
| `zh_hans_mo` | Chinese (Simplified, Macau SAR China) |
| `zh_hans_sg` | Chinese (Simplified, Singapore) |
| `zh_hant` | Chinese (Traditional) |
| `zh_hant_hk` | Chinese (Traditional, Hong Kong SAR China) |
| `zh_hant_mo` | Chinese (Traditional, Macau SAR China) |
| `zh_hant_tw` | Chinese (Traditional, Taiwan) |
| `zu` | Zulu |
| `zu_za` | Zulu (South Africa) |

---
title: Collation support
source: https://docs.snowflake.com/en/sql-reference/collation.md
section: SQL General Reference
---

# Collation support

Collation allows you to specify alternative rules for comparing [text strings](data-types-text.md),
which can be used to compare and sort data according to a particular language or other user-specified rules.

## Overview of collation support

The following sections explain what collation is and how you use collation when comparing strings:

* Understanding collation
* Uses for collation
* Collation control

### Understanding collation

Text strings in Snowflake are stored using the UTF-8 character set and, by default, strings are compared according to
the Unicode codes that represent the characters in the string.

However, comparing strings based on their UTF-8 character representations might not provide the desired or expected behavior. For example:

* If special characters in a given language do not sort according to that language’s ordering standards, then sorting might return unexpected results.
* You might want the strings to be ordered by other rules, such as ignoring whether the characters are uppercase or lowercase.

Collation allows you to explicitly specify the rules to use for comparing strings, based on:

* Different locales (that is, different character sets for different languages).
* Case-sensitivity (that is, whether to use case-sensitive or case-insensitive string comparisons without explicitly calling the [UPPER](functions/upper.md) or
  [LOWER](functions/lower.md) functions to convert the strings).
* Accent-sensitivity (for example, whether `Z`, `Ź`, and `Ż` are considered the same letter or different letters).
* Punctuation-sensitivity (that is, whether comparisons use only letters or include all characters). For example, if a comparison is punctuation-insensitive, then `A-B-C` and `ABC` are treated as
  equivalent.
* Additional options, such as preferences for sorting based on the first letter in a string and trimming of leading and/or trailing blank spaces.

### Uses for collation

Collation can be used in a wide variety of operations, including (but not limited to):

| Usage | Example | Link |
| --- | --- | --- |
| Simple comparison | `... WHERE column1 = column2 ...` | [WHERE](constructs/where.md) |
| Joins | `... ON table1.column1 = table2.column2 ...` | [JOIN](constructs/join.md) |
| Sorting | `... ORDER BY column1 ...` | [ORDER BY](constructs/order-by.md) |
| Top-K sorting | `... ORDER BY column1 LIMIT N ...` | [LIMIT / FETCH](constructs/limit.md) |
| Aggregation | `... GROUP BY ...` | [GROUP BY](constructs/group-by.md) |
| Window functions | `... PARTITION BY ... ORDER BY ...` | [Window functions](functions-window.md) |
| Scalar functions | `... LEAST(column1, column2, column3) ...` | [Scalar functions](functions.md) |
| Aggregate functions | `... MIN(column1), MAX(column1) ...` | [Aggregate functions](functions-aggregation.md) |
| Data clustering | `... CLUSTER BY (column1) ...` | [Clustering Keys & Clustered Tables](../user-guide/tables-clustering-keys.md) |

### Collation control

Collation control is granular. You can explicitly specify the collation to use for:

* An account, using the account-level parameter [DEFAULT_DDL_COLLATION](parameters.md).
* All columns in all tables added to a database, using the [ALTER DATABASE](sql/alter-database.md) command.
* All columns in all tables added to a schema, using the [ALTER SCHEMA](sql/alter-schema.md) command.
* All columns added to a table, using the [ALTER TABLE](sql/alter-table.md) command.
* Individual columns in a table, using the [CREATE TABLE](sql/create-table.md) command.
* A specific comparison within a SQL statement (for example, `WHERE col1 = col2`). If multiple collations are applied to a
  statement, Snowflake determines the collation to use based on precedence. For more details about precedence, see
  Collation precedence in multi-string operations.

## Collation SQL constructs

You can use the following SQL constructs for collation:

* COLLATE clause for table column definitions
* COLLATE function
* COLLATION function

### COLLATE clause for table column definitions

Adding the optional COLLATE clause to the definition of a table column indicates that the specified collation is used for comparisons and other related operations performed on the data in
the column:

```sqlsyntax
CREATE TABLE <table_name> ( <col_name> <col_type> COLLATE '<collation_specification>'
                            [ , <col_name> <col_type> COLLATE '<collation_specification>' ... ]
                            [ , ... ]
                          )
```

If no COLLATE clause is specified for a column, Snowflake uses the default, which compares strings based on their UTF-8 character representations.

Also, Snowflake supports specifying an empty string for the collation specification (for example, `COLLATE ''`), which is equivalent to specifying no collation for the column.

However, due to precedence, specifying `COLLATE ''` for a column does not have the same effect as explicitly specifying `COLLATE 'utf8'`. For more details, see
Collation precedence in multi-string operations.

You can’t specify the COLLATE clause for indexed columns in [hybrid tables](../user-guide/tables-hybrid.md). For more information, see [Collations on hybrid table columns](sql/create-hybrid-table.md).

To see whether collation has been specified for the columns in a table, use [DESCRIBE TABLE](sql/desc-table.md). When you execute the DESCRIBE TABLE command, collation specifications are in the `type` column in the output. Alternatively, use the COLLATION function to view the collation, if any, for a specific column.

### COLLATE function

The [COLLATE](functions/collate.md) function uses the specified collation on the input string expression:

```sqlsyntax
COLLATE( <expression> , '<collation_specification>' )
```

This function can also be called using infix notation:

```sqlsyntax
<expression> COLLATE '<collation_specification>'
```

This function is particularly useful for explicitly specifying a particular collation for a particular operation (for example,
sorting), but it can also be used to:

* Allow collation in the [SELECT](sql/select.md) clause of a subquery, making all operations on the specified column in the outer query use the collation.
* Create a table using CTAS with a specified collation.

This example valuates using English case-insensitive collation:

```sqlexample
SELECT * FROM t1 WHERE COLLATE(col1 , 'en-ci') = 'Tango';
```

This example sorts the results using German (Deutsch) collation:

```sqlexample
SELECT * FROM t1 ORDER BY COLLATE(col1 , 'de');
```

This example creates a table with a column using French collation:

```sqlexample
CREATE TABLE t2 AS SELECT COLLATE(col1, 'fr') AS col1 FROM t1;
```

This example uses infix notation to create a table with a column using French collation:

```sqlexample
CREATE TABLE t2 AS SELECT col1 COLLATE 'fr' AS col1 FROM t1;
```

### COLLATION function

The [COLLATION](functions/collation.md) function returns the collation specification used by an expression, including a table column:

```sqlsyntax
COLLATION( <expression> )
```

If no collation has been specified for the expression, the function returns NULL.

Typically, if you use this function on a column name, it is best to use DISTINCT to avoid getting one row of output for each row in the table. For example:

```sqlexample
SELECT DISTINCT COLLATION(column1) FROM table1;
```

> **Note:**
>
> This function only returns the collation specification, not its precedence level. For more details about precedence, see Collation precedence in multi-string operations (in this
> topic).

## Collation specifications

When using a COLLATE clause (for a table column) or the COLLATE function (for an expression), you must include a collation specification,
which determines the comparison logic used for the column/expression.

A collation specification consists of a string of one or more specifiers separated by a hyphen (`-`), in the form of:

> `'<specifier>[-<specifier> ...]'`

The following specifiers are supported (for more information, see Supported specifiers in this topic):

* Locale
* Case-sensitivity
* Accent-sensitivity
* Punctuation-sensitivity
* First-letter preference
* Case-conversion
* Space-trimming

Specifiers are case-insensitive and can be in any order, except for locale, which must always be first, if used.

The following sections provide more details about collation specifications:

* Specification examples
* Supported specifiers

### Specification examples

Some examples of collation specification strings include:

* `'de'`: German (Deutsch) locale.
* `'de-ci-pi'`: German locale, with case-insensitive and punctuation-insensitive comparisons.
* `'fr_CA-ai'`: Canadian French locale, with accent-insensitive comparisons.
* `'en_US-trim'`: US English locale, with leading spaces and trailing spaces trimmed before the comparison.

You can also specify an empty string for a collation specification (for example, `COLLATE ''` or `COLLATE(col1, '')`), which indicates to use no collation.

### Supported specifiers

Locale:
:   Specifies the language-specific and country-specific rules to apply.

    Supports valid locale strings, consisting of a language code (required) and country code (optional) in the form of `language_country`. Some locale examples include:

    * `en` - English
    * `en_US` - American English
    * `fr` - French
    * `fr_CA` - Canadian French

    In addition, the `utf8` pseudo-locale specifies Unicode ordering, which is the default. For more details, see Differences in sorting when using UTF-8 or locale collation (in this topic).

    The locale specifier is optional, but, if used, must be the first specifier in the string.

    For the full list of locales supported by Snowflake, see [Collation locales supported by Snowflake](collation-locales.md).

Case-sensitivity:
:   Determines whether case is considered when comparing values. Possible values:

    * `cs` - Case-sensitive (default)
    * `ci` - Case-insensitive

    For example:

    | Collation Specification | Value | Result |
    | --- | --- | --- |
    | `'en-ci'` | `Abc = abc` | True |
    | `'en-cs'` / `en` | `Abc = abc` | False |

Accent-sensitivity:
:   Determines whether accented characters are considered equal to, or different from, their base characters. Possible values:

    * `as` - Accent-sensitive (default)
    * `ai` - Accent-insensitive

    For example:

    | Collation Specification | Value | Result | Notes |
    | --- | --- | --- | --- |
    | `'fr-ai'` | `E = É` | True |  |
    | `'fr-as'` / `'fr'` | `E = É` | False |  |
    | `'en-ai'` | `a = ą` | True | In English, these letters are treated as having only accent differences, so specifying accent-insensitivity results in the values comparing as equal. |
    | `'pl-ai'` | `a = ą` | False | In Polish, these letters are treated as separate base letters, so they always compare as unequal regardless of whether accent-insensitivity is specified. |
    | `'pl-as'` / `'pl'` | `a = ą` | False |  |

    The rules for accent-sensitivity and collation vary between languages. For example, in some languages, collation is always accent-sensitive, and you cannot turn it off even by specifying
    accent-insensitive collation.

Punctuation-sensitivity:
:   Determines whether non-letter characters matter. Possible values:

    * `ps` - Punctuation-sensitive.
    * `pi` - Punctuation-insensitive.

    Note that the default is locale-specific (that is, if punctuation-sensitivity is not specified, locale-specific rules are used). In most cases, the rules are equivalent to `ps`.

    For example:

    | Collation Specification | Value | Result | Notes |
    | --- | --- | --- | --- |
    | `'en-pi'` | `A-B-C = ABC` | True |  |
    | `'en-ps'` | `A-B-C = ABC` | False |  |

First-letter preference:
:   Determines whether, when sorting, uppercase or lowercase letters are first. Possible values:

    * `fl` - Lowercase letters sorted first.
    * `fu` - Uppercase letters sorted first.

    The default is locale-specific (that is, if no value is specified, locale-specific ordering is used). In most cases, the ordering is equivalent to `fl`.

    Also, this specifier has no impact on equality comparisons.

Case-conversion:
:   Results in strings being converted to lowercase or uppercase before comparisons. In some situations, this is faster than full locale-specific collation. Possible values:

    * `upper` - Convert the string to uppercase before comparisons.
    * `lower` - Convert the string to lowercase before comparisons.

    This specifier does not have a default (that is, if no value is specified, neither of the conversions occurs).

Space-trimming:
:   Removes leading/trailing spaces from strings before comparisons. This functionality can be useful for performing comparisons equivalent (except in extremely rare corner cases) in semantics to the SQL CHAR data type.

    Possible values:

    * `trim` - Remove both leading and trailing spaces before comparisons.
    * `ltrim` - Remove only leading spaces before comparisons.
    * `rtrim` - Remove only trailing spaces before comparisons.

    This specifier does not have a default (that is, if no value is specified, trimming is not performed).

    For example:

    | Collation Specification | Value | Result | Notes |
    | --- | --- | --- | --- |
    | `'en-trim'` | `__ABC_ = ABC` | True | For the purposes of these examples, underscore characters represent blank spaces. |
    | `'en-ltrim'` | `__ABC_ = ABC` | False |  |
    | `'en-rtrim'` | `__ABC_ = ABC` | False |  |
    | `'en'` | `__ABC_ = ABC` | False |  |

## Collation implementation details

The following sections provide more detail about support for collation:

* Case-insensitive comparisons
* Differences in sorting when using UTF-8 or locale collation
* Collation precedence in multi-string operations
* Limited support for collation in built-in functions
* Performance implications of using collation
* Additional considerations for using collation

### Case-insensitive comparisons

The following sections describe case-insensitive comparisons:

* Differences when comparing uppercase strings and original strings
* Character weights

#### Differences when comparing uppercase strings and original strings

In some languages, two lowercase characters have the same corresponding uppercase character. For example, some languages support both
dotted and undotted forms of lowercase `I` (for example, `i` and `ı`). Forcing the strings to uppercase affects comparisons.

The following example illustrates the difference:

Create the table:

```sqlexample
CREATE OR REPLACE TABLE test_table (col1 VARCHAR, col2 VARCHAR);
INSERT INTO test_table VALUES ('ı', 'i');
```

Query the data:

```sqlexample
SELECT col1 = col2,
       COLLATE(col1, 'lower') = COLLATE(col2, 'lower'),
       COLLATE(col1, 'upper') = COLLATE(col2, 'upper')
  FROM test_table;
```

```output
+-------------+-------------------------------------------------+-------------------------------------------------+
| COL1 = COL2 | COLLATE(COL1, 'LOWER') = COLLATE(COL2, 'LOWER') | COLLATE(COL1, 'UPPER') = COLLATE(COL2, 'UPPER') |
|-------------+-------------------------------------------------+-------------------------------------------------|
| False       | False                                           | True                                            |
+-------------+-------------------------------------------------+-------------------------------------------------+
```

#### Character weights

Snowflake supports the following collation specifications.

* [ICU](https://en.wikipedia.org/wiki/International_Components_for_Unicode) (International Components for Unicode).
* Snowflake-specific collation specifications (for example, `upper` and `lower`).

For case-insensitive comparison operations defined by the ICU, Snowflake follows the
[Unicode Collation Algorithm (UCA)](http://www.unicode.org/reports/tr10) and considers only the
primary and secondary weights, not the tertiary weights, of Unicode characters. Characters that differ only in their tertiary
weights are treated as identical. For example, using the `en-ci` collation specification, a space and a non-breaking space
are considered identical.

### Differences in sorting when using UTF-8 or locale collation

Strings are always stored internally in Snowflake in UTF-8, and can represent any character in any language supported by UTF-8. Therefore, when no collation is specified, the
behavior is the same as the UTF-8 collation (that is, `'utf8'`).

In Snowflake, `'utf8'` and `'bin'` are equivalent collation specifications. However, these specifications can’t be mixed in a single expression. For example, the following
query returns an error:

```sqlexample
SELECT 'abc' COLLATE 'bin' = 'abc' COLLATE 'utf8';
```

UTF-8 collation is based on the numeric representation of the character as opposed to the alphabetic order of the character.

This is analogous to sorting by the ordinal value of each ASCII character, which is important to note because uppercase letters have ordinal values lower than lowercase letters:

`A = 65`

`B = 66`

`...`

`a = 97`

`b = 98`

`...`

As a result:

* If you sort in UTF-8 order, all uppercase letters are returned before all lowercase letters:

  > `A` , `B` , … , `Y` , `Z` , … , `a` , `b` , … , `y` , `z`
* In contrast, the `'en'` collation specification sorts alphabetically (instead of using the UTF-8 internal representation), resulting in both `A` and `a` returned before both `B` and `b`:

  > `a` , `A` , `b` , `B` , …

Additionally, the differences between the `cs` and `ci` case-sensitivity specifiers affect sorting:

* `cs` (case-sensitive) always returns the lowercase version of a letter before the uppercase version of the same letter. For example, using `'en-cs'`:

  > `a` , `A` , `b` , `B` , …

  Case-sensitive is the default and, therefore, `'en-cs'` and `'en'` are equivalent.
* `ci` (case-insensitive) returns uppercase and lowercase versions of letters randomly with respect to each other, but still before both uppercase and lowercase version of later letters. For
  example, using `'en-ci'`:

  > `A` , `a` , `b` , `B` , …

Some non-alphabetic characters can also be sorted differently depending upon the collation setting. The following example shows that
the plus character (`+`) and minus character (`-`) are sorted differently for different collation settings:

Create the table:

```sqlexample
CREATE OR REPLACE TABLE demo (
    no_explicit_collation VARCHAR,
    en_ci VARCHAR COLLATE 'en-ci',
    en VARCHAR COLLATE 'en',
    utf_8 VARCHAR collate 'utf8');
INSERT INTO demo (no_explicit_collation) VALUES
    ('-'),
    ('+');
UPDATE demo SET
    en_ci = no_explicit_collation,
    en = no_explicit_collation,
    utf_8 = no_explicit_collation;
```

Query the data:

```sqlexample
SELECT MAX(no_explicit_collation), MAX(en_ci), MAX(en), MAX(utf_8)
  FROM demo;
```

```output
+----------------------------+------------+---------+------------+
| MAX(NO_EXPLICIT_COLLATION) | MAX(EN_CI) | MAX(EN) | MAX(UTF_8) |
|----------------------------+------------+---------+------------|
| -                          | +          | +       | -          |
+----------------------------+------------+---------+------------+
```

### Collation precedence in multi-string operations

When performing an operation on two (or more) strings, different collations might be specified for different strings.
Determining the collation to apply depends on how collation was specified for each input and the precedence of each
specifier.

There are three precedence levels (from highest to lowest):

Function:
:   Collation is specified using the COLLATE function in a SQL statement.

Column:
:   Collation was specified in the column definition.

None:
:   No collation is/was specified for a given expression/column, or collation with an empty specification is/was used (for example, `COLLATE(col1, '')` or `col1 STRING COLLATE ''`).

When determining the collation to use, the collation specification with the highest precedence is used. If multiple collations are specified with the same precedence level, their
values are compared, and if they are not equal, an error is returned.

For example, consider a table with the following column-level collation specifications:

```sqlexample
CREATE OR REPLACE TABLE collation_precedence_example(
  col1    VARCHAR,               -- equivalent to COLLATE ''
  col2_fr VARCHAR COLLATE 'fr',  -- French locale
  col3_de VARCHAR COLLATE 'de'   -- German locale
);
```

If the table is used in a statement comparing two strings, collation is applied as follows:

* This comparison uses the `'fr'` collation because the precedence for `col2_fr` is higher than the
  precedence for `col1`:

  ```sqlexample
  ... WHERE col1 = col2_fr ...
  ```
* This comparison uses the `'en'` collation, because it is explicitly specified in the statement,
  which takes precedence over the collation for `col2_fr`:

  ```sqlexample
  ... WHERE col1 COLLATE 'en' = col2_fr ...
  ```
* This comparison returns an error because the expressions have different collations at the same precedence level:

  ```sqlexample
  ... WHERE col2_fr = col3_de ...
  ```
* This comparison uses the `'de'` collation because collation for `col2_fr` has been removed:

  ```sqlexample
  ... WHERE col2_fr COLLATE '' = col3_de ...
  ```
* This comparison returns an error because the expressions have different collations at the same precedence level:

  ```sqlexample
  ... WHERE col2_fr COLLATE 'en' = col3_de COLLATE 'de' ...
  ```

Because explicit collation has higher precedence than no collation, specifying an empty string (or specifying no collation) is
different from explicitly specifying `'utf8'` collation. The last two statements in the following code examples show the difference:

For example, consider a table with the following column-level collation specifications:

```sqlexample
CREATE OR REPLACE TABLE collation_precedence_example2(
  s1 STRING COLLATE '',
  s2 STRING COLLATE 'utf8',
  s3 STRING COLLATE 'fr'
);
```

If the table is used in a statement comparing two strings, collation is applied as follows:

* This comparison uses `'utf8'` because `s1` has no collation and `'utf8'` is the default:

  ```sqlexample
  ... WHERE s1 = 'a' ...
  ```
* This comparison uses `'utf8'` because `s1` has no collation and `s2` has explicit `'utf8'` collation

  ```sqlexample
  ... WHERE s1 = s2 ...
  ```
* This comparison executes without error because `s1` has no collation and `s3` has explicit `fr` collation, so the
  explicit collation takes precedence:

  ```sqlexample
  ... WHERE s1 = s3 ...
  ```
* This comparison causes an error because `s2` and `s3` have different collations specified at the same precedence level:

  ```sqlexample
  ... WHERE s2 = s3 ...
  ```

  ```output
  002322 (42846): SQL compilation error: Incompatible collations: 'fr' and 'utf8'
  ```

### Limited support for collation in built-in functions

Collation is supported in only a subset of string functions. Functions that could reasonably be expected to implement
collation, but do not yet support collation, return an error when used with collation. These error messages are
displayed not only when calling the COLLATE function, but also when calling a string function on a column that was
defined as collated in the CREATE TABLE or ALTER TABLE statement that created that column.

#### Functions that support collation

These functions support collation:

* [[ NOT ] BETWEEN](functions/between.md)
* [CASE](functions/case.md)
* [CHARINDEX](functions/charindex.md)
* [COALESCE](functions/coalesce.md)
* [CONCAT , ||](functions/concat.md)
* [CONTAINS](functions/contains.md)
* [DECODE](functions/decode.md)
* [ENDSWITH](functions/endswith.md)
* [[ NOT ] EQUAL_NULL](functions/equal_null.md)
* [GREATEST](functions/greatest.md)
* [IFF](functions/iff.md)
* [IFNULL](functions/ifnull.md)
* [[ NOT ] ILIKE](functions/ilike.md)
* [ILIKE ANY](functions/ilike_any.md) (partial support)
* [LEAST](functions/least.md)
* [LEFT](functions/left.md)
* [LENGTH, LEN](functions/length.md) (supported without impact)
* [[ NOT ] LIKE](functions/like.md)
* [LIKE ALL](functions/like_all.md) (partial support)
* [LIKE ANY](functions/like_any.md) (partial support)
* [LISTAGG](functions/listagg.md)
* [LPAD](functions/lpad.md)
* [MAX](functions/max.md)
* [MIN](functions/min.md)
* [NULLIF](functions/nullif.md)
* [NVL](functions/nvl.md)
* [NVL2](functions/nvl2.md)
* [POSITION](functions/position.md)
* [REPLACE](functions/replace.md)
* [RIGHT](functions/right.md)
* [RPAD](functions/rpad.md)
* [SPLIT](functions/split.md)
* [SPLIT_PART](functions/split_part.md)
* [STARTSWITH](functions/startswith.md)
* [SUBSTR , SUBSTRING](functions/substr.md) (supported without impact)

Some of these functions have limitations on their use with collation. For information, see the documentation of each
specific function.

This list might expand over time.

> **Caution:**
>
> Some SQL operators and predicates, such as `||` (concatenation) and `LIKE`, are implemented as functions
> (and are available as functions, for example `LIKE()` and `CONCAT()`). If a predicate or operator is implemented as
> a function, and the function does not support collation, then the predicate or operator does not support collation.

See also Collation limitations.

### Performance implications of using collation

Using collation can affect the performance of various database operations:

* Operations involving comparisons might be slower.

  This can impact simple [WHERE](constructs/where.md) clauses, as well as joins, sorts, GROUP BY operations, etc.
* When used with some functions in [WHERE](constructs/where.md) predicates, micro-partition pruning might be less
  efficient.
* Using collation in a [WHERE](constructs/where.md) predicate that is different from the collation specified for the
  column might result in reduced pruning efficiency or the complete elimination of pruning.

### Additional considerations for using collation

* Remember that, despite the similarity in their names, the following collation functions return different results:

  + COLLATE explicitly specifies which collation to use.
  + COLLATION shows which collation is used if none is specified explicitly.
* A column with a collation specification can use characters that are not from the locale for the collation, which might impact sorting.

  For example, if a column is created with a `COLLATE 'en'` clause, the data in the column can contain the non-English character `É`. In this situation, the character `É` is sorted close to
  `E`.
* You can specify collation operations that are not necessarily meaningful.

  For example, you could specify that Polish data is compared to French data using German collation:

  ```sqlexample
  SELECT ... WHERE COLLATE(French_column, 'de') = Polish_column;
  ```

  However, Snowflake does not recommend using the feature this way because it might return unexpected or unintended results.
* After a table column is defined, you cannot change the collation for the column. In other words, after a column has been created with a particular collation using a
  [CREATE TABLE](sql/create-table.md) statement, you cannot use [ALTER TABLE](sql/alter-table.md) to change the
  collation.

  However, you can specify a different collation in a DML statement, such as a [SELECT](sql/select.md) statement, that references the column.
* When you create a view using the [CREATE VIEW](sql/create-view.md) command, the view’s columns inherit the collation
  specifications of the columns in the source tables.

## Differences between `ci` and `upper` / `lower`

The `upper` and `lower` collation specifications can provide better performance than the `ci` collation specification during
string comparison and sorting. However, `upper` and `lower` have slightly different effects from `ci`, as explained in the
next sections:

* Differences in comparisons of widths, spaces, and scripts
* Differences in handling ignorable code points
* Differences when characters are represented by different code points
* Differences with sequences of code points representing a single character
* Differences when changes to case result in multiple code points
* Differences in sort order

### Differences in comparisons of widths, spaces, and scripts

During string comparisons, the `ci` collation specification recognizes that different visual representations
of a character might still refer to the same character, and treats them accordingly. To allow for more performant
comparisons, the `upper` and `lower` collation specifications do not recognize these different visual
representations of a character as the same character.

Specifically, the `ci` collation specification ignores some differences in the following categories,
while the `upper` and `lower` collation specifications do not ignore them:

* Character widths
* Types of spaces
* Character scripts

The following sections include examples that illustrate these differences.

> **Note:**
>
> The comparison behavior of full-width and half-width characters might depend on the locale.

#### Example of comparisons of characters with different widths

Create a table named `different_widths` and insert rows containing characters of different widths:

```sqlexample
CREATE OR REPLACE TABLE different_widths(codepoint STRING, description STRING);

INSERT INTO different_widths VALUES
  ('a', 'ASCII a'),
  ('A', 'ASCII A'),
  ('ａ', 'Full-width a'),
  ('Ａ', 'Full-width A');

SELECT codepoint VISUAL_CHAR,
       'U+'  || TO_CHAR(UNICODE(codepoint), '0XXX') codepoint_representation,
       description
  FROM different_widths;
```

```output
+-------------+--------------------------+--------------+
| VISUAL_CHAR | CODEPOINT_REPRESENTATION | DESCRIPTION  |
|-------------+--------------------------+--------------|
| a           | U+0061                   | ASCII a      |
| A           | U+0041                   | ASCII A      |
| ａ          | U+FF41                   | Full-width a |
| Ａ          | U+FF21                   | Full-width A |
+-------------+--------------------------+--------------+
```

The following query shows that the `ci` collation specification finds one distinct value when comparing the characters.
The `upper` and `lower` collation specifications find two distinct values when comparing the characters.

```sqlexample
SELECT COUNT(*) NumRows,
       COUNT(DISTINCT UNICODE(codepoint)) DistinctCodepoints,
       COUNT(DISTINCT codepoint COLLATE 'en-ci') DistinctCodepoints_EnCi,
       COUNT(DISTINCT codepoint COLLATE 'upper') DistinctCodepoints_Upper,
       COUNT(DISTINCT codepoint COLLATE 'lower') DistinctCodepoints_Lower
  FROM different_widths;
```

```output
+---------+--------------------+-------------------------+--------------------------+--------------------------+
| NUMROWS | DISTINCTCODEPOINTS | DISTINCTCODEPOINTS_ENCI | DISTINCTCODEPOINTS_UPPER | DISTINCTCODEPOINTS_LOWER |
|---------+--------------------+-------------------------+--------------------------+--------------------------|
|       4 |                  4 |                       1 |                        2 |                        2 |
+---------+--------------------+-------------------------+--------------------------+--------------------------+
```

The `ci` collation specification ignores differences in both width and case, which means that it finds no differences
between the characters. The `upper` and `lower` collation specifications only ignore differences in case, so
the half-width characters are considered to be different characters than the full-width characters.

The half-width lowercase `a` is considered to be the same as the half-width uppercase `A`, and the full-width
lowercase `a` is considered to be the same as the full-width uppercase `A`. Therefore, the `upper` and
`lower` collation specifications find two distinct values.

#### Example of comparisons of different types of spaces

Create a table named `different_whitespaces` and insert rows with different types of spaces:

```sqlexample
CREATE OR REPLACE TABLE different_whitespaces(codepoint STRING, description STRING);

INSERT INTO different_whitespaces VALUES
  (' ', 'ASCII space'),
  ('\u00A0', 'Non-breaking space'),
  (' ', 'Ogham space mark'),
  (' ', 'en space'),
  (' ', 'em space');

SELECT codepoint visual_char,
       'U+'  || TO_CHAR(unicode(codepoint), '0XXX')
       codepoint_representation, description
  FROM different_whitespaces;
```

```output
+-------------+--------------------------+--------------------+
| VISUAL_CHAR | CODEPOINT_REPRESENTATION | DESCRIPTION        |
|-------------+--------------------------+--------------------|
|             | U+0020                   | ASCII space        |
|             | U+00A0                   | Non-breaking space |
|             | U+1680                   | Ogham space mark   |
|             | U+2002                   | en space           |
|             | U+2003                   | em space           |
+-------------+--------------------------+--------------------+
```

The following query shows that the `ci` collation specification finds one distinct value when comparing the spaces, which
means that there are no differences between them. The `upper` and `lower` collation specifications find five distinct
values when comparing the spaces, which means that they are all different.

```sqlexample
SELECT COUNT(*) NumRows,
       COUNT(DISTINCT UNICODE(codepoint)) NumDistinctCodepoints,
       COUNT(DISTINCT codepoint COLLATE 'en-ci') DistinctCodepoints_EnCi,
       COUNT(DISTINCT codepoint COLLATE 'upper') DistinctCodepoints_Upper,
       COUNT(DISTINCT codepoint COLLATE 'lower') DistinctCodepoints_Lower
  FROM different_whitespaces;
```

```output
+---------+-----------------------+-------------------------+--------------------------+--------------------------+
| NUMROWS | NUMDISTINCTCODEPOINTS | DISTINCTCODEPOINTS_ENCI | DISTINCTCODEPOINTS_UPPER | DISTINCTCODEPOINTS_LOWER |
|---------+-----------------------+-------------------------+--------------------------+--------------------------|
|       5 |                     5 |                       1 |                        5 |                        5 |
+---------+-----------------------+-------------------------+--------------------------+--------------------------+
```

#### Example of comparisons of characters with different scripts

Create a table named `different_scripts` and insert rows containing characters that use different scripts:

```sqlexample
CREATE OR REPLACE TABLE different_scripts(codepoint STRING, description STRING);

INSERT INTO different_scripts VALUES
  ('1', 'ASCII digit 1'),
  ('¹', 'Superscript 1'),
  ('₁', 'Subscript 1'),
  ('①', 'Circled digit 1'),
  ('੧', 'Gurmukhi digit 1'),
  ('௧', 'Tamil digit 1');

SELECT codepoint VISUAL_CHAR,
       'U+'  || TO_CHAR(UNICODE(codepoint), '0XXX') codepoint_representation,
       description
  FROM different_scripts;
```

```output
+-------------+--------------------------+------------------+
| VISUAL_CHAR | CODEPOINT_REPRESENTATION | DESCRIPTION      |
|-------------+--------------------------+------------------|
| 1           | U+0031                   | ASCII digit 1    |
| ¹           | U+00B9                   | Superscript 1    |
| ₁           | U+2081                   | Subscript 1      |
| ①           | U+2460                   | Circled digit 1  |
| ੧           | U+0A67                   | Gurmukhi digit 1 |
| ௧           | U+0BE7                   | Tamil digit 1    |
+-------------+--------------------------+------------------+
```

The following query shows that the `ci` collation specification finds one distinct value when comparing the characters, which
means that there are no differences between them. The `upper` and `lower` collation specifications find six distinct
values when comparing the characters, which means that they are all different.

```sqlexample
SELECT COUNT(*) NumRows,
       COUNT(DISTINCT UNICODE(codepoint)) DistinctCodepoints,
       COUNT(DISTINCT codepoint COLLATE 'en-ci') DistinctCodepoints_EnCi,
       COUNT(DISTINCT codepoint COLLATE 'upper') DistinctCodepoints_Upper,
       COUNT(DISTINCT codepoint COLLATE 'lower') DistinctCodepoints_Lower
  FROM different_scripts;
```

```output
+---------+--------------------+-------------------------+--------------------------+--------------------------+
| NUMROWS | DISTINCTCODEPOINTS | DISTINCTCODEPOINTS_ENCI | DISTINCTCODEPOINTS_UPPER | DISTINCTCODEPOINTS_LOWER |
|---------+--------------------+-------------------------+--------------------------+--------------------------|
|       6 |                  6 |                       1 |                        6 |                        6 |
+---------+--------------------+-------------------------+--------------------------+--------------------------+
```

### Differences in handling ignorable code points

The Unicode Collation Algorithm specifies that collation elements (code points) can be
[ignorable](https://www.unicode.org/reports/tr10/tr10-36.html#Ignorables_Defn), which means that a code point is not considered
during string comparison and sorting.

* With the `ci` collation specification, these code points are ignored. This can make it difficult to search for or replace
  ignorable code points.
* With the `upper` and `lower` collation specifications, these code points are not ignored.

For example, the code point `U+0001` is ignorable. If you compare this code point to an empty string with the `en-ci`
collation specification, the result is TRUE because `U+0001` is ignored:

```sqlexample
SELECT '\u0001' = '' COLLATE 'en-ci';
```

```output
+-------------------------------+
| '\U0001' = '' COLLATE 'EN-CI' |
|-------------------------------|
| True                          |
+-------------------------------+
```

On the other hand, if you use the `upper` or `lower` collation specification, the result is FALSE because `U+0001` is not
ignored:

```sqlexample
SELECT '\u0001' = '' COLLATE 'upper';
```

```output
+-------------------------------+
| '\U0001' = '' COLLATE 'UPPER' |
|-------------------------------|
| False                         |
+-------------------------------+
```

Similarly, suppose that you call the [REPLACE](functions/replace.md) function to remove this code point from a string.
If you use the `en-ci` collation specification, the function does not remove the code point because `U+0001` is ignored.

As shown in the following example, the string returned by the REPLACE function has the same length as the string passed into the
function because the function does not remove the `U+0001` character.

```sqlexample
SELECT
  LEN('abc\u0001') AS original_length,
  LEN(REPLACE('abc\u0001' COLLATE 'en-ci', '\u0001')) AS length_after_replacement;
```

```output
+-----------------+--------------------------+
| ORIGINAL_LENGTH | LENGTH_AFTER_REPLACEMENT |
|-----------------+--------------------------|
|               4 |                        4 |
+-----------------+--------------------------+
```

On the other hand, if you use the `upper` or `lower` collation specification, the function removes the code point from the
string, returning a shorter string.

```sqlexample
SELECT
  LEN('abc\u0001') AS original_length,
  LEN(REPLACE('abc\u0001' COLLATE 'upper', '\u0001')) AS length_after_replacement;
```

```output
+-----------------+--------------------------+
| ORIGINAL_LENGTH | LENGTH_AFTER_REPLACEMENT |
|-----------------+--------------------------|
|               4 |                        3 |
+-----------------+--------------------------+
```

### Differences when characters are represented by different code points

In Unicode,
[different sequences of code points can represent the same character](https://en.wikipedia.org/wiki/Unicode_equivalence).
For example, the Greek Small Letter Iota with Dialytika and Tonos can be represented by the
[precomposed character](https://en.wikipedia.org/wiki/Precomposed_character) with the code point `U+0390` or by the
sequence of code points `U+03b9` `U+0308` `U+0301` for the decomposed characters.

If you use the `ci` collation specification, the different sequences of code points for a character are treated as the same
character. For example, the code point `U+0390` and the sequence of code points `U+03b9` `U+0308` `U+0301` are treated
as equivalent:

```sqlexample
SELECT '\u03b9\u0308\u0301' = '\u0390' COLLATE 'en-ci';
```

```output
+-------------------------------------------------+
| '\U03B9\U0308\U0301' = '\U0390' COLLATE 'EN-CI' |
|-------------------------------------------------|
| True                                            |
+-------------------------------------------------+
```

In order to improve performance for the `upper` and `lower` collation specifications, the sequences are not handled in the
same way. Two sequences of code points are considered to be equivalent only if they result in the same binary representation
after they are converted to uppercase or lowercase.

For example, using the `upper` specification with the code point `U+0390` and the sequence of code points `U+03b9`
`U+0308` `U+0301` results in characters that are treated as equal:

```sqlexample
SELECT '\u03b9\u0308\u0301' = '\u0390' COLLATE 'upper';
```

```output
+-------------------------------------------------+
| '\U03B9\U0308\U0301' = '\U0390' COLLATE 'UPPER' |
|-------------------------------------------------|
| True                                            |
+-------------------------------------------------+
```

Using the `lower` specification results in characters that are not equal:

```sqlexample
SELECT '\u03b9\u0308\u0301' = '\u0390' COLLATE 'lower';
```

```output
+-------------------------------------------------+
| '\U03B9\U0308\U0301' = '\U0390' COLLATE 'LOWER' |
|-------------------------------------------------|
| False                                           |
+-------------------------------------------------+
```

These differences are less likely to occur when using `upper` (rather than `lower`) because there is only one composite
uppercase code point (`U+0130`), compared to over 100 composite lowercase code points.

### Differences with sequences of code points representing a single character

In cases where a sequence of code points represents a single character, the `ci` collation specification recognizes that the
sequence represents a single character and does not match individual code points in the sequence.

For example, the sequence of code points `U+03b9` `U+0308` `U+0301` represents a single character (the Greek Small Letter
Iota with Dialytika and Tonos). `U+0308` and `U+0301` represent accents applied to `U+03b9`.

For the `ci` collation specification, if you use the [CONTAINS](functions/contains.md) function to determine if the
sequence `U+03b9` `U+0308` contains `U+03b9` or `U+0308`, the function returns FALSE because the sequence `U+03b9`
`U+0308` is treated as a single character:

```sqlexample
SELECT CONTAINS('\u03b9\u0308', '\u03b9' COLLATE 'en-ci');
```

```output
+----------------------------------------------------+
| CONTAINS('\U03B9\U0308', '\U03B9' COLLATE 'EN-CI') |
|----------------------------------------------------|
| False                                              |
+----------------------------------------------------+
```

```sqlexample
SELECT CONTAINS('\u03b9\u0308', '\u0308' COLLATE 'en-ci');
```

```output
+----------------------------------------------------+
| CONTAINS('\U03B9\U0308', '\U0308' COLLATE 'EN-CI') |
|----------------------------------------------------|
| False                                              |
+----------------------------------------------------+
```

To improve performance, the `upper` and `lower` specifications do not treat these sequences as a single character. In the
example above, the CONTAINS function returns TRUE because these specifications treat the sequence of code points as separate
characters:

```sqlexample
SELECT CONTAINS('\u03b9\u0308', '\u03b9' COLLATE 'upper');
```

```output
+----------------------------------------------------+
| CONTAINS('\U03B9\U0308', '\U03B9' COLLATE 'UPPER') |
|----------------------------------------------------|
| True                                               |
+----------------------------------------------------+
```

```sqlexample
SELECT CONTAINS('\u03b9\u0308', '\u0308' COLLATE 'upper');
```

```output
+----------------------------------------------------+
| CONTAINS('\U03B9\U0308', '\U0308' COLLATE 'UPPER') |
|----------------------------------------------------|
| True                                               |
+----------------------------------------------------+
```

### Differences when changes to case result in multiple code points

For some composite characters, the uppercase or lowercase version of the character is represented by a sequence of code points.
For example, the uppercase character for the German character ß is a sequence of two S characters (SS).

Even though ß and SS are equivalent, when you use the `upper` collation specification, searches of ß and SS return different
results. Sequences produced by case conversion either match in their entirety or not at all.

```sqlexample
SELECT CONTAINS('ß' , 's' COLLATE 'upper');
```

```output
+--------------------------------------+
| CONTAINS('SS' , 'S' COLLATE 'UPPER') |
|--------------------------------------|
| False                                |
+--------------------------------------+
```

```sqlexample
SELECT CONTAINS('ss', 's' COLLATE 'upper');
```

```output
+-------------------------------------+
| CONTAINS('SS', 'S' COLLATE 'UPPER') |
|-------------------------------------|
| True                                |
+-------------------------------------+
```

### Differences in sort order

Sorting for the `upper` and `lower` collation specifications works differently from sorting for the `ci` specification:

* With the `ci` specification, strings are sorted by collation key. In general, the collation key can account for case
  sensitivity, accent sensitivity, locale, etc.
* With the `upper` and `lower` specifications, strings are sorted by code point to improve performance.

For example, some characters within the ASCII range (such as `+` and `-`) sort differently:

```sqlexample
SELECT '+' < '-' COLLATE 'en-ci';
```

```output
+---------------------------+
| '+' < '-' COLLATE 'EN-CI '|
|---------------------------|
| False                     |
+---------------------------+
```

```sqlexample
SELECT '+' < '-' COLLATE 'upper';
```

```output
+---------------------------+
| '+' < '-' COLLATE 'UPPER' |
|---------------------------|
| True                      |
+---------------------------+
```

As another example, strings with ignored code points sort in a different order:

```sqlexample
SELECT 'a\u0001b' < 'ab' COLLATE 'en-ci';
```

```output
+-----------------------------------+
| 'A\U0001B' < 'AB' COLLATE 'EN-CI' |
|-----------------------------------|
| False                             |
+-----------------------------------+
```

```sqlexample
SELECT 'a\u0001b' < 'ab' COLLATE 'upper';
```

```output
+-----------------------------------+
| 'A\U0001B' < 'AB' COLLATE 'UPPER' |
|-----------------------------------|
| True                              |
+-----------------------------------+
```

In addition, emojis sort differently:

```sqlexample
SELECT 'abc' < '❄' COLLATE 'en-ci';
```

```output
+-----------------------------+
| 'ABC' < '❄' COLLATE 'EN-CI' |
|-----------------------------|
| False                       |
+-----------------------------+
```

```sqlexample
SELECT 'abc' < '❄' COLLATE 'upper';
```

```output
+-----------------------------+
| 'ABC' < '❄' COLLATE 'UPPER' |
|-----------------------------|
| True                        |
+-----------------------------+
```

## Collation limitations

The following limitations apply to collation:

* Collation is supported only for strings up to 64 MB
* Collation not supported with UDFs
* Collation not supported for strings in VARIANT, ARRAY, or OBJECT values
* Clean rooms support only default collation

### Collation is supported only for strings up to 64 MB

Although the Snowflake VARCHAR data type supports strings up to 128 MB, Snowflake supports collation only when the
resulting string is 64 MB or less. (Some collation operations can lengthen a string.)

### Collation not supported with UDFs

Snowflake does not support collation with UDFs (user-defined functions):

* You cannot return a collated string value from a UDF; the server reports that the actual return type is incompatible with
  the declared return type.
* If you pass a collated string value to a UDF, the collation information is not passed; the UDF sees the string as an uncollated
  string.

### Collation not supported for strings in VARIANT, ARRAY, or OBJECT values

Strings stored inside a VARIANT, OBJECT, or ARRAY value do not include a collation specification. Therefore:

* Comparison of these values always uses the `'utf8'` collation.
* When a VARCHAR value with a collation specification is used to construct an ARRAY, OBJECT, or VARIANT value, the
  collation specification is not preserved.
* You can still compare a value stored inside an ARRAY, OBJECT, or VARIANT by extracting the value, casting to
  VARCHAR, and adding a collation specification. For example:

  ```sqlexample
  COLLATE(VARIANT_COL:fld1::VARCHAR, 'en-ci') = VARIANT_COL:fld2::VARCHAR
  ```

### Clean rooms support only default collation

Clean rooms support only default collation at the account level. You can check this by running SHOW PARAMETERS LIKE
‘DEFAULT_DDL_COLLATION’ IN ACCOUNT;

## Collation examples

The following statement creates a table that uses different collation for each column:

```sqlexample
CREATE OR REPLACE TABLE collation_demo (
  uncollated_phrase VARCHAR,
  utf8_phrase VARCHAR COLLATE 'utf8',
  english_phrase VARCHAR COLLATE 'en',
  spanish_phrase VARCHAR COLLATE 'es');

INSERT INTO collation_demo (
      uncollated_phrase,
      utf8_phrase,
      english_phrase,
      spanish_phrase)
   VALUES (
     'pinata',
     'pinata',
     'pinata',
     'piñata');
```

> **Note:**
>
> Collations don’t affect the set of characters that can be stored. Snowflake supports all UTF-8 characters.

The following query on the table shows the expected values:

```sqlexample
SELECT * FROM collation_demo;
```

```output
+-------------------+-------------+----------------+----------------+
| UNCOLLATED_PHRASE | UTF8_PHRASE | ENGLISH_PHRASE | SPANISH_PHRASE |
|-------------------+-------------+----------------+----------------|
| pinata            | pinata      | pinata         | piñata         |
+-------------------+-------------+----------------+----------------+
```

The following query does not find a match because the character `ñ` does not match `n`:

```sqlexample
SELECT * FROM collation_demo WHERE spanish_phrase = uncollated_phrase;
```

```output
+-------------------+-------------+----------------+----------------+
| UNCOLLATED_PHRASE | UTF8_PHRASE | ENGLISH_PHRASE | SPANISH_PHRASE |
|-------------------+-------------+----------------+----------------|
+-------------------+-------------+----------------+----------------+
```

Changing collation doesn’t force related, but unequal, characters (for example, `ñ` and `n`) to be treated as equal:

```sqlexample
CREATE OR REPLACE TABLE collation_demo1 (
  uncollated_phrase VARCHAR,
  utf8_phrase VARCHAR COLLATE 'utf8',
  english_phrase VARCHAR COLLATE 'en-ai',
  spanish_phrase VARCHAR COLLATE 'es-ai');

INSERT INTO collation_demo1 (
    uncollated_phrase,
    utf8_phrase,
    english_phrase,
    spanish_phrase)
  VALUES (
    'piñata',
    'piñata',
    'piñata',
    'piñata');

SELECT uncollated_phrase = 'pinata',
       utf8_phrase = 'pinata',
       english_phrase = 'pinata',
       spanish_phrase = 'pinata'
  FROM collation_demo1;
```

```output
+------------------------------+------------------------+---------------------------+---------------------------+
| UNCOLLATED_PHRASE = 'PINATA' | UTF8_PHRASE = 'PINATA' | ENGLISH_PHRASE = 'PINATA' | SPANISH_PHRASE = 'PINATA' |
|------------------------------+------------------------+---------------------------+---------------------------|
| False                        | False                  | True                      | False                     |
+------------------------------+------------------------+---------------------------+---------------------------+
```

Only the English phrase returns `True` for the following reasons:

* Uncollated comparisons don’t ignore accents.
* `utf8` collation comparisons don’t ignore accents.
* The `en-ai` and `es-ai` collation comparisons ignore accents, but in Spanish, `ñ` is treated as an
  individual character rather than an accented `n`.

The following examples demonstrate the effect of collation on sort order:

```sqlexample
INSERT INTO collation_demo (spanish_phrase) VALUES
  ('piña colada'),
  ('Pinatubo (Mount)'),
  ('pint'),
  ('Pinta');
```

```sqlexample
SELECT spanish_phrase FROM collation_demo
  ORDER BY spanish_phrase;
```

```output
+------------------+
| SPANISH_PHRASE   |
|------------------|
| Pinatubo (Mount) |
| pint             |
| Pinta            |
| piña colada      |
| piñata           |
+------------------+
```

The following query returns the values in a different order by changing the
collation to from `'es'` (Spanish) to `'utf8'`:

```sqlexample
SELECT spanish_phrase FROM collation_demo
  ORDER BY COLLATE(spanish_phrase, 'utf8');
```

```output
+------------------+
| SPANISH_PHRASE   |
|------------------|
| Pinatubo (Mount) |
| Pinta            |
| pint             |
| piña colada      |
| piñata           |
+------------------+
```

This example shows how to use the COLLATION function to view the collation for an expression, such as a column:

```sqlexample
CREATE OR REPLACE TABLE collation_demo2 (
  c1 VARCHAR COLLATE 'fr',
  c2 VARCHAR COLLATE '');

INSERT INTO collation_demo2 (c1, c2) VALUES
  ('a', 'a'),
  ('b', 'b');
```

```sqlexample
SELECT DISTINCT COLLATION(c1), COLLATION(c2) FROM collation_demo2;
```

```output
+---------------+---------------+
| COLLATION(C1) | COLLATION(C2) |
|---------------+---------------|
| fr            | NULL          |
+---------------+---------------+
```

You can also use [DESCRIBE TABLE](sql/desc-table.md) to view collation information about the columns in a table:

```sqlexample
DESC TABLE collation_demo2;
```

```output
+------+--------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type                           | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+--------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| C1   | VARCHAR(16777216) COLLATE 'fr' | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| C2   | VARCHAR(16777216)              | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+--------------------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

---
title: Comparison operators
source: https://docs.snowflake.com/en/sql-reference/operators-comparison.md
section: SQL General Reference
---

# Comparison operators

Comparison operators are used to test the equality of two input expressions. They are typically used in the [WHERE](constructs/where.md) clause of a query.

| Operator | Syntax | Description |
| --- | --- | --- |
| `=` | `a = b` | `a` is equal to `b`. |
| `!=` | `a != b` | `a` is not equal to `b`. |
| `<>` | `a <> b` | `a` is not equal to `b`. |
| `>` | `a > b` | `a` is greater than `b`. |
| `>=` | `a >= b` | `a` is greater than or equal to `b`. |
| `<` | `a < b` | `a` is less than `b`. |
| `<=` | `a <= b` | `a` is less than or equal to `b`. |

---
title: Conditional expression functions
source: https://docs.snowflake.com/en/sql-reference/expressions-conditional.md
section: SQL General Reference
---

# Conditional expression functions

Conditional expression functions return values based on logical operations
using each expression passed to the function. For example, the `BOOLOR`
function takes two numeric expressions and returns True if either (or both) of
the expressions evaluate to a True (non-zero) value.

* [[ NOT ] BETWEEN](functions/between.md)
* [BOOLAND](functions/booland.md)
* [BOOLNOT](functions/boolnot.md)
* [BOOLOR](functions/boolor.md)
* [BOOLXOR](functions/boolxor.md)
* [CASE](functions/case.md)
* [COALESCE](functions/coalesce.md)
* [DECODE](functions/decode.md)
* [[ NOT ] EQUAL_NULL](functions/equal_null.md)
* [GREATEST](functions/greatest.md)
* [GREATEST_IGNORE_NULLS](functions/greatest_ignore_nulls.md)
* [IFF](functions/iff.md)
* [IFNULL](functions/ifnull.md)
* [[ NOT ] IN](functions/in.md)
* [IS [ NOT ] DISTINCT FROM](functions/is-distinct-from.md)
* [IS [ NOT ] NULL](functions/is-null.md)
* [IS_NULL_VALUE](functions/is_null_value.md)
* [LEAST](functions/least.md)
* [LEAST_IGNORE_NULLS](functions/least_ignore_nulls.md)
* [NULLIF](functions/nullif.md)
* [NULLIFZERO](functions/nullifzero.md)
* [NVL](functions/nvl.md)
* [NVL2](functions/nvl2.md)
* [REGR_VALX](functions/regr_valx.md)
* [REGR_VALY](functions/regr_valy.md)
* [ZEROIFNULL](functions/zeroifnull.md)

---
title: CONNECT BY
source: https://docs.snowflake.com/en/sql-reference/constructs/connect-by.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# CONNECT BY

Joins a table to itself to process hierarchical data in the table. The `CONNECT BY` subclause of the
[FROM](from.md) clause iterates to process the data.

For example, you can create a query that shows a “parts explosion” to
recursively list a component and the sub-components of that component.

The Snowflake syntax for CONNECT BY is mostly compatible with the Oracle syntax.

See also:
:   [WITH](with.md)

## Syntax

The general form of a statement with CONNECT BY is similar to the following
(some variations in order are allowed, but are not shown):

```sqlsyntax
SELECT <column_list> [ , <level_expression> ]
  FROM <data_source>
    START WITH <predicate>
    CONNECT BY [ PRIOR ] <col1_identifier> = [ PRIOR ] <col2_identifier>
           [ , [ PRIOR ] <col3_identifier> = [ PRIOR ] <col4_identifier> ]
           ...
  ...
```

## Parameters

`column_list`
:   This generally follows the rules for the projection clause of a [SELECT](../sql/select.md) statement.

`level_expression`
:   CONNECT BY queries allow some pseudo-columns.
    One of those pseudo-columns is `LEVEL`, which indicates the current level
    of the hierarchy (where level 1 represents the top of the hierarchy).
    The projection clause of the query can use LEVEL as a column.

`data_source`
:   The data source is usually a table, but can be another table-like data source, such as a view, UDTF, etc.

`predicate`
:   The predicate is an expression that selects the first “level” of the
    hierarchy (e.g. the president of the company or the top-level component in
    a parts explosion). The predicate should look similar to a
    [WHERE](where.md) clause, but without the keyword `WHERE`.

    See the Examples section (in this topic) for predicate examples.

`colN_identifier`
:   The CONNECT BY clause should contain one or more expressions similar to those
    used in joins. Specifically, a column in the “current” level of the table
    should refer to a column in the “prior” (higher) level of the table.

    For example, in a manager/employee hierarchy, the clause might look similar to:

    > ```sqlexample
    > ... CONNECT BY manager_ID = PRIOR employee_ID ...
    > ```

    The keyword PRIOR indicates that the value should be taken from the
    prior (higher/parent) level.

    In this example, the current employee’s `manager_ID` should match the prior level’s `employee_ID`.

    The CONNECT BY clause can contain more than one such expression, for example:

    > ```sqlexample
    > ... CONNECT BY y = PRIOR x AND b = PRIOR a ...
    > ```

    Each expression similar to the following should have exactly one occurrence of the keyword PRIOR:

    > ```sqlsyntax
    > CONNECT BY <col_1_identifier> = <col_2_identifier>
    > ```
    >
    > The keyword PRIOR may be on either the left-hand or right-hand side of the `=` sign. For example:
    >
    > ```sqlsyntax
    > CONNECT BY <col_1_identifier> = PRIOR <col_2_identifier>
    > ```
    >
    > or
    >
    > ```sqlsyntax
    > CONNECT BY PRIOR <col_1_identifier> = <col_2_identifier>
    > ```

## Usage notes

* A CONNECT BY clause always joins a table to itself, not to another table.
* Some variations within the projection clause are valid. Although the syntax shows `level_expression`
  occurring after the `column_list`, the level expression(s) can occur in any order.
* The keyword `PRIOR` should occur exactly once in each CONNECT BY expression. `PRIOR` can occur on either
  the left-hand side or the right-hand side of the expression, but not on both.
* A query with CONNECT BY may also contain one or both of the following:

  + Filters in a [WHERE](where.md) clause.
  + [JOINs](join.md) (which may be in either a [FROM](from.md) clause or
    a [WHERE](where.md) clause).

  The order of evaluation is:

  1. JOINs (regardless of whether specified in the WHERE clause or the FROM clause).
  2. CONNECT BY
  3. Filters (other than JOIN filters).

  For example, filters in a WHERE clause are processed after the CONNECT BY.
* The Snowflake implementation of CONNECT BY is mostly compatible with the Oracle implementation; however,
  Snowflake does not support:

  > + NOCYCLE
  > + CONNECT_BY_ISCYCLE
  > + CONNECT_BY_ISLEAF

* Snowflake supports the function `SYS_CONNECT_BY_PATH` when used with the `CONNECT BY` clause.
  `SYS_CONNECT_BY_PATH` returns a string that contains the path from the root to the current element.
  An example is included in the Examples section below.

* Snowflake supports the `CONNECT_BY_ROOT` operator when used with the `CONNECT BY` clause. The `CONNECT_BY_ROOT`
  operator allows the current level to use information from the root level of the hierarchy, even if the root level
  is not the immediate parent of the current level.
  An example is included in the Examples section below.
* The `CONNECT BY` clause can iterate as many times as necessary to process the data. Constructing a query improperly can cause
  an infinite loop. In these cases, the query continues to run until the query succeeds, the query times out (e.g. exceeds the
  number of seconds specified by the [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md) parameter), or you
  [cancel the query](../../user-guide/querying-cancel-statements.md).

  For information on how infinite loops can occur and for guidelines on how to avoid this problem, see
  [Troubleshooting a Recursive CTE](../../user-guide/queries-cte.md).

## Examples

This example uses a CONNECT BY to show the management hierarchy in a table
of employee information. The table and data are shown below:

> > ```sqlexample
> > CREATE OR REPLACE TABLE employees (title VARCHAR, employee_ID INTEGER, manager_ID INTEGER);
> > ```
> >
> > ```sqlexample
> > INSERT INTO employees (title, employee_ID, manager_ID) VALUES
> >     ('President', 1, NULL),  -- The President has no manager.
> >         ('Vice President Engineering', 10, 1),
> >             ('Programmer', 100, 10),
> >             ('QA Engineer', 101, 10),
> >         ('Vice President HR', 20, 1),
> >             ('Health Insurance Analyst', 200, 20);
> > ```
>
> The query and output are shown below:
>
> > ```sqlexample
> > SELECT employee_ID, manager_ID, title
> >   FROM employees
> >     START WITH title = 'President'
> >     CONNECT BY
> >       manager_ID = PRIOR employee_id
> >   ORDER BY employee_ID;
> > +-------------+------------+----------------------------+
> > | EMPLOYEE_ID | MANAGER_ID | TITLE                      |
> > |-------------+------------+----------------------------|
> > |           1 |       NULL | President                  |
> > |          10 |          1 | Vice President Engineering |
> > |          20 |          1 | Vice President HR          |
> > |         100 |         10 | Programmer                 |
> > |         101 |         10 | QA Engineer                |
> > |         200 |         20 | Health Insurance Analyst   |
> > +-------------+------------+----------------------------+
> > ```

This example uses the `SYS_CONNECT_BY_PATH` function to show the hierarchy from the President down to the
current employee:

> ```sqlexample
> SELECT SYS_CONNECT_BY_PATH(title, ' -> '), employee_ID, manager_ID, title
>   FROM employees
>     START WITH title = 'President'
>     CONNECT BY
>       manager_ID = PRIOR employee_id
>   ORDER BY employee_ID;
> +----------------------------------------------------------------+-------------+------------+----------------------------+
> | SYS_CONNECT_BY_PATH(TITLE, ' -> ')                             | EMPLOYEE_ID | MANAGER_ID | TITLE                      |
> |----------------------------------------------------------------+-------------+------------+----------------------------|
> |  -> President                                                  |           1 |       NULL | President                  |
> |  -> President -> Vice President Engineering                    |          10 |          1 | Vice President Engineering |
> |  -> President -> Vice President HR                             |          20 |          1 | Vice President HR          |
> |  -> President -> Vice President Engineering -> Programmer      |         100 |         10 | Programmer                 |
> |  -> President -> Vice President Engineering -> QA Engineer     |         101 |         10 | QA Engineer                |
> |  -> President -> Vice President HR -> Health Insurance Analyst |         200 |         20 | Health Insurance Analyst   |
> +----------------------------------------------------------------+-------------+------------+----------------------------+
> ```

This example uses the `CONNECT_BY_ROOT` keyword to display information from the top of the hierarchy in each row
of output:

> ```sqlexample
> SELECT
> employee_ID, manager_ID, title,
> CONNECT_BY_ROOT title AS root_title
>   FROM employees
>     START WITH title = 'President'
>     CONNECT BY
>       manager_ID = PRIOR employee_id
>   ORDER BY employee_ID;
> +-------------+------------+----------------------------+------------+
> | EMPLOYEE_ID | MANAGER_ID | TITLE                      | ROOT_TITLE |
> |-------------+------------+----------------------------+------------|
> |           1 |       NULL | President                  | President  |
> |          10 |          1 | Vice President Engineering | President  |
> |          20 |          1 | Vice President HR          | President  |
> |         100 |         10 | Programmer                 | President  |
> |         101 |         10 | QA Engineer                | President  |
> |         200 |         20 | Health Insurance Analyst   | President  |
> +-------------+------------+----------------------------+------------+
> ```

This example uses a CONNECT BY to show a “parts explosion”:

> Here is the data:
>
> > ```sqlexample
> > -- The components of a car.
> > CREATE TABLE components (
> >     description VARCHAR,
> >     quantity INTEGER,
> >     component_ID INTEGER,
> >     parent_component_ID INTEGER
> >     );
> >
> > INSERT INTO components (description, quantity, component_ID, parent_component_ID) VALUES
> >     ('car', 1, 1, 0),
> >        ('wheel', 4, 11, 1),
> >           ('tire', 1, 111, 11),
> >           ('#112 bolt', 5, 112, 11),
> >           ('brake', 1, 113, 11),
> >              ('brake pad', 1, 1131, 113),
> >        ('engine', 1, 12, 1),
> >           ('piston', 4, 121, 12),
> >           ('cylinder block', 1, 122, 12),
> >           ('#112 bolt', 16, 112, 12)   -- Can use same type of bolt in multiple places
> >     ;
> > ```
>
> Here are the query and output:
>
> > ```sqlexample
> > SELECT
> >   description,
> >   quantity,
> >   component_id,
> >   parent_component_ID,
> >   SYS_CONNECT_BY_PATH(component_ID, ' -> ') AS path
> >   FROM components
> >     START WITH component_ID = 1
> >     CONNECT BY
> >       parent_component_ID = PRIOR component_ID
> >   ORDER BY path
> >   ;
> > +----------------+----------+--------------+---------------------+----------------------------+
> > | DESCRIPTION    | QUANTITY | COMPONENT_ID | PARENT_COMPONENT_ID | PATH                       |
> > |----------------+----------+--------------+---------------------+----------------------------|
> > | car            |        1 |            1 |                   0 |  -> 1                      |
> > | wheel          |        4 |           11 |                   1 |  -> 1 -> 11                |
> > | tire           |        1 |          111 |                  11 |  -> 1 -> 11 -> 111         |
> > | #112 bolt      |        5 |          112 |                  11 |  -> 1 -> 11 -> 112         |
> > | brake          |        1 |          113 |                  11 |  -> 1 -> 11 -> 113         |
> > | brake pad      |        1 |         1131 |                 113 |  -> 1 -> 11 -> 113 -> 1131 |
> > | engine         |        1 |           12 |                   1 |  -> 1 -> 12                |
> > | #112 bolt      |       16 |          112 |                  12 |  -> 1 -> 12 -> 112         |
> > | piston         |        4 |          121 |                  12 |  -> 1 -> 12 -> 121         |
> > | cylinder block |        1 |          122 |                  12 |  -> 1 -> 12 -> 122         |
> > +----------------+----------+--------------+---------------------+----------------------------+
> > ```

---
title: Constraints
source: https://docs.snowflake.com/en/sql-reference/constraints.md
section: SQL General Reference
---

# Constraints

Constraints define integrity and consistency rules for data stored in tables.
Snowflake provides support for constraints as defined in the ANSI SQL standard,
as well as some extensions for compatibility with other databases, such as Oracle.

> **Important:**
>
> * For standard tables, Snowflake supports defining and maintaining constraints, but
>   doesn’t enforce them, except for NOT NULL and CHECK constraints, which are always enforced.
>
>   Violations of constraints might cause unexpected downstream effects. If you decide to create a
>   constraint that must be relied upon, ensure that your downstream processes can maintain data
>   integrity. For more information, see [Constraint properties](sql/create-table-constraint.md).
>
>   Constraints on standard tables are provided primarily for data modeling purposes and compatibility
>   with other databases, as well as to support client tools that utilize constraints. For example,
>   Tableau supports using constraints to perform join culling (join elimination), which can improve the
>   performance of generated queries and cube refresh.
> * For [hybrid tables](../user-guide/tables-hybrid.md), Snowflake both supports and enforces
>   constraints. Primary key constraints are required and enforced on all hybrid tables, and other
>   constraints are enforced when used.

**Next Topics:**

* [Overview of constraints](constraints-overview.md)
* [Creating constraints](constraints-create.md)
* [Modifying constraints](constraints-alter.md)
* [Dropping constraints](constraints-drop.md)

---
title: Context functions
source: https://docs.snowflake.com/en/sql-reference/functions-context.md
section: SQL General Reference
---

# Context functions

This family of functions allows for the gathering of information about the context in which the statement is executed. These functions are evaluated
at most once per statement.

## List of functions

| Sub-category | Function | Notes |
| --- | --- | --- |
| General context | [CURRENT_CLIENT](functions/current_client.md) |  |
|  | [CURRENT_DATE](functions/current_date.md) |  |
|  | [CURRENT_IP_ADDRESS](functions/current_ip_address.md) |  |
|  | [CURRENT_REGION](functions/current_region.md) |  |
|  | [CURRENT_TIME](functions/current_time.md) |  |
|  | [CURRENT_TIMESTAMP](functions/current_timestamp.md) |  |
|  | [CURRENT_VERSION](functions/current_version.md) |  |
|  | [GETDATE](functions/getdate.md) | Alias for CURRENT_TIMESTAMP. |
|  | [LOCALTIME](functions/localtime.md) | Alias for CURRENT_TIME. |
|  | [LOCALTIMESTAMP](functions/localtimestamp.md) | Alias for CURRENT_TIMESTAMP. |
|  | [SYSDATE](functions/sysdate.md) |  |
|  | [SYSTIMESTAMP](functions/systimestamp.md) |  |
|  | [SYS_CONTEXT](functions/sys_context.md) |  |
| Session context | [ALL_USER_NAMES](functions/all_user_names.md) |  |
|  | [CURRENT_ACCOUNT](functions/current_account.md) | Returns account locator. |
|  | [CURRENT_ACCOUNT_NAME](functions/current_account_name.md) | Returns account name. |
|  | [CURRENT_ORGANIZATION_NAME](functions/current_organization_name.md) |  |
|  | [CURRENT_ORGANIZATION_USER](functions/current_organization_user.md) |  |
|  | [CURRENT_ROLE](functions/current_role.md) |  |
|  | [CURRENT_AVAILABLE_ROLES](functions/current_available_roles.md) |  |
|  | [CURRENT_SECONDARY_ROLES](functions/current_secondary_roles.md) |  |
|  | [CURRENT_SESSION](functions/current_session.md) |  |
|  | [CURRENT_STATEMENT](functions/current_statement.md) |  |
|  | [CURRENT_TRANSACTION](functions/current_transaction.md) |  |
|  | [CURRENT_USER](functions/current_user.md) |  |
|  | [GETVARIABLE](functions/getvariable.md) |  |
|  | [SET_SYS_CONTEXT](functions/set_sys_context.md) |  |
|  | [LAST_QUERY_ID](functions/last_query_id.md) |  |
|  | [LAST_TRANSACTION](functions/last_transaction.md) |  |
| Session object context | [CURRENT_DATABASE](functions/current_database.md) |  |
|  | [CURRENT_ROLE_TYPE](functions/current_role_type.md) |  |
|  | [CURRENT_SCHEMA](functions/current_schema.md) |  |
|  | [CURRENT_SCHEMAS](functions/current_schemas.md) |  |
|  | [CURRENT_WAREHOUSE](functions/current_warehouse.md) |  |
|  | [INVOKER_ROLE](functions/invoker_role.md) |  |
|  | [INVOKER_SHARE](functions/invoker_share.md) |  |
|  | [IS_APPLICATION_ROLE_ACTIVATED (SYS_CONTEXT function)](functions/is_application_role_activated.md) |  |
|  | [IS_APPLICATION_ROLE_IN_SESSION](functions/is_application_role_in_session.md) |  |
|  | [IS_DATABASE_ROLE_IN_SESSION](functions/is_database_role_in_session.md) |  |
|  | [IS_GRANTED_TO_INVOKER_ROLE](functions/is_granted_to_invoker_role.md) |  |
|  | [IS_INSTANCE_ROLE_IN_SESSION](functions/is_instance_role_in_session.md) |  |
|  | [IS_ROLE_ACTIVATED (SYS_CONTEXT function)](functions/is_role_activated.md) |  |
|  | [IS_ROLE_IN_SESSION](functions/is_role_in_session.md) |  |
|  | [POLICY_CONTEXT](functions/policy_context.md) |  |
| Alert context | [GET_CONDITION_QUERY_UUID](functions/get_condition_query_uuid.md) |  |
| Organization context | [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](functions/is_group_activated.md) |  |
|  | [IS_GROUP_IMPORTED (SYS_CONTEXT function)](functions/is_group_imported.md) |  |
|  | [IS_USER_IMPORTED (SYS_CONTEXT function)](functions/is_user_imported.md) |  |

## Usage notes

* Context functions generally do not require arguments (except for [SYS_CONTEXT](functions/sys_context.md)).
* To comply with the ANSI standard, the following context functions can be called without parentheses
  in SQL statements:

  + CURRENT_DATE
  + CURRENT_TIME
  + CURRENT_TIMESTAMP
  + CURRENT_USER
  + LOCALTIME
  + LOCALTIMESTAMP
  > **Note:**
  >
  > If you are setting a [Snowflake Scripting variable](../developer-guide/snowflake-scripting/variables.md)
  > to an expression that calls one of these functions (for example, `my_var := <function_name>();`),
  > you must include the parentheses.

## Examples

Display the current warehouse, database, and schema for the session:

```sqlexample
SELECT CURRENT_WAREHOUSE(), CURRENT_DATABASE(), CURRENT_SCHEMA();
```

```output
+---------------------+--------------------+------------------+
| CURRENT_WAREHOUSE() | CURRENT_DATABASE() | CURRENT_SCHEMA() |
|---------------------+--------------------+------------------+
| MY_WAREHOUSE        | MY_DB              | PUBLIC           |
|---------------------+--------------------+------------------+
```

Display the current date, time, and timestamp (note that parentheses are not required to call these functions):

```sqlexample
SELECT CURRENT_DATE, CURRENT_TIME, CURRENT_TIMESTAMP;
```

```output
+--------------+--------------+-------------------------------+
| CURRENT_DATE | CURRENT_TIME | CURRENT_TIMESTAMP             |
|--------------+--------------+-------------------------------|
| 2024-06-07   | 10:45:15     | 2024-06-07 10:45:15.064 -0700 |
+--------------+--------------+-------------------------------+
```

In a Snowflake Scripting block, call the CURRENT_DATE function without parentheses to set a variable in a
SQL statement:

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  currdate DATE;
BEGIN
  SELECT CURRENT_DATE INTO currdate;
  RETURN currdate;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| 2024-06-07      |
+-----------------+
```

In a Snowflake Scripting block, attempting to set a variable to an expression that calls the CURRENT_DATE
function without parentheses results in an error:

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  today DATE;
BEGIN
  today := CURRENT_DATE;
  RETURN today;
END;
$$
;
```

```output
000904 (42000): SQL compilation error: error line 5 at position 11
invalid identifier 'CURRENT_DATE'
```

The same block returns the current date when the function is called with the parentheses:

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  today DATE;
BEGIN
  today := CURRENT_DATE();
  RETURN today;
END;
$$
;
```

```output
+-----------------+
| anonymous block |
|-----------------|
| 2024-06-07      |
+-----------------+
```

---
title: CONTINUE (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/continue.md
section: SQL General Reference
---

# CONTINUE (Snowflake Scripting)

`CONTINUE` (or `ITERATE`) skips the rest of the statements in the iteration of a loop and starts the next iteration of
the loop.

For more information on terminating the current iteration of a loop, see [Terminating an iteration without terminating the loop](../../developer-guide/snowflake-scripting/loops.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [BREAK](break.md)

## Syntax

```sqlsyntax
{ CONTINUE | ITERATE } [ <label> ] ;
```

Where:

> `label`
> :   An optional label. If the label is specified, the `CONTINUE` will start at the first statement in the loop with
>     the label.
>
>     You can use this to continue more than one level higher in a nested loop or a nested branch.

## Usage notes

* `CONTINUE` and `ITERATE` are synonymous.
* If the loop is embedded in another loop(s), you can break out of not only the current loop and start from the first statement in
  the enclosing loop by including the enclosing loop’s label as part of the `CONTINUE`. For an example, see the examples
  section below.

## Examples

The following loop iterates 3 times. Because the code after the `CONTINUE` statement is not executed, the variable
named `counter2` will be 0 rather than 3.

> ```sqlexample
> DECLARE
>   counter1 NUMBER(8, 0);
>   counter2 NUMBER(8, 0);
> BEGIN
>   counter1 := 0;
>   counter2 := 0;
>   WHILE (counter1 < 3) DO
>     counter1 := counter1 + 1;
>     CONTINUE;
>     counter2 := counter2 + 1;
>   END WHILE;
>   RETURN counter2;
> END;
> ```
>
> Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
> `execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
> code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):
>
> ```sqlexample
> EXECUTE IMMEDIATE $$
> DECLARE
>     counter1 NUMBER(8, 0);
>     counter2 NUMBER(8, 0);
> BEGIN
>     counter1 := 0;
>     counter2 := 0;
>     WHILE (counter1 < 3) DO
>         counter1 := counter1 + 1;
>         CONTINUE;
>         counter2 := counter2 + 1;
>     END WHILE;
>     RETURN counter2;
> END;
> $$;
> ```

Here is the output of executing the example:

```sqlexample
+-----------------+
| anonymous block |
|-----------------|
|               0 |
+-----------------+
```

---
title: Conversion functions
source: https://docs.snowflake.com/en/sql-reference/functions-conversion.md
section: SQL General Reference
---

# Conversion functions

This family of functions can be used to convert an expression of any Snowflake data type to another data type.

## List of functions

| Sub-category | Function | Notes |
| --- | --- | --- |
| **Any data type** | [CAST , ::](functions/cast.md) |  |
| [TRY_CAST](functions/try_cast.md) | Error-handling version of CAST. |
| **Text/character/binary data types** | [TO_CHAR , TO_VARCHAR](functions/to_char.md) |  |
| [TO_BINARY](functions/to_binary.md) |  |
| [TRY_TO_BINARY](functions/try_to_binary.md) | Error-handling version to TO_BINARY. |
| **Numeric data types** | [TO_DECFLOAT](functions/to_decfloat.md) |  |
| [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](functions/to_decimal.md) |  |
| [TO_DOUBLE](functions/to_double.md) |  |
| [TRY_TO_DECFLOAT](functions/try_to_decfloat.md) | Error-handling version of TO_DECFLOAT. |
| [TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC](functions/try_to_decimal.md) | Error-handling versions of TO_DECIMAL, TO_NUMBER, and so on. |
| [TRY_TO_DOUBLE](functions/try_to_double.md) | Error-handling version of TO_DOUBLE. |
| **Boolean data type** | [TO_BOOLEAN](functions/to_boolean.md) |  |
| [TRY_TO_BOOLEAN](functions/try_to_boolean.md) | Error-handling version of TO_BOOLEAN. |
| **Date and time data types** | [TO_DATE , DATE](functions/to_date.md) |  |
| [TO_TIME , TIME](functions/to_time.md) |  |
| [TO_TIMESTAMP / TO_TIMESTAMP_\*](functions/to_timestamp.md) |  |
| [TRY_TO_DATE](functions/try_to_date.md) | Error-handling version of TO_DATE. |
| [TRY_TO_TIME](functions/try_to_time.md) | Error-handling version of TO_TIME. |
| [TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_\*](functions/try_to_timestamp.md) | Error-handling versions of TO_TIMESTAMP, and so on. |
| **Semi-structured data types** | [TO_ARRAY](functions/to_array.md) |  |
| [TO_OBJECT](functions/to_object.md) |  |
| [TO_VARIANT](functions/to_variant.md) |  |
| **Geospatial data types** | [TO_GEOGRAPHY](functions/to_geography.md) |  |
| [TRY_TO_GEOGRAPHY](functions/try_to_geography.md) | Error-handling version of TO_GEOGRAPHY |
| [ST_GEOGFROMGEOHASH](functions/st_geogfromgeohash.md) |  |
| [ST_GEOGPOINTFROMGEOHASH](functions/st_geogpointfromgeohash.md) |  |
| [ST_GEOGRAPHYFROMWKB](functions/st_geographyfromwkb.md) |  |
| [ST_GEOGRAPHYFROMWKT](functions/st_geographyfromwkt.md) |  |
| [TO_GEOMETRY](functions/to_geometry.md) |  |
| [TRY_TO_GEOMETRY](functions/try_to_geometry.md) | Error-handling version of TO_GEOMETRY |
| [ST_GEOMETRYFROMWKB](functions/st_geometryfromwkb.md) |  |
| [ST_GEOMETRYFROMWKT](functions/st_geometryfromwkt.md) |  |

## Error-handling conversion functions

Conversion functions with a TRY_ prefix are special versions of their respective conversion functions. These functions return a NULL value instead of raising an error when the conversion cannot be performed:

* [TRY_CAST](functions/try_cast.md)
* [TRY_TO_BINARY](functions/try_to_binary.md)
* [TRY_TO_BOOLEAN](functions/try_to_boolean.md)
* [TRY_TO_DATE](functions/try_to_date.md)
* [TRY_TO_DECIMAL, TRY_TO_NUMBER, TRY_TO_NUMERIC](functions/try_to_decimal.md)
* [TRY_TO_DOUBLE](functions/try_to_double.md)
* [TRY_TO_GEOGRAPHY](functions/try_to_geography.md)
* [TRY_TO_GEOMETRY](functions/try_to_geometry.md)
* [TRY_TO_TIME](functions/try_to_time.md)
* [TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_\*](functions/try_to_timestamp.md)

These functions only support string expressions (i.e. VARCHAR or CHAR data type) as input.

> **Important:**
>
> These error-handling conversion functions are optimized for situations where conversion errors are relatively infrequent:
>
> * If there are no (or very few) errors, they should result in no visible performance impact.
> * If there are a large number of conversion failures, using these functions can result in significantly slower performance. Also, when using them with the VARIANT type, some operations might result in reduced performance.

## Numeric formats in conversion functions

The functions
[TO_DECIMAL , TO_NUMBER , TO_NUMERIC](functions/to_decimal.md), and
[TO_DOUBLE](functions/to_double.md)
accept an optional parameter that specifies the format of the input string,
if the input expression evaluates to a string. For more information
about the values this parameter can have, see
[SQL format models](sql-format-models.md).

## Date and time formats in conversion functions

The following functions allow you to specify the expected date, time, or timestamp format to parse or produce a string:

* [TO_CHAR , TO_VARCHAR](functions/to_char.md)
* [TO_DATE , DATE](functions/to_date.md)
* [TRY_TO_DATE](functions/try_to_date.md)
* [TO_TIME , TIME](functions/to_time.md)
* [TRY_TO_TIME](functions/try_to_time.md)
* [TO_TIMESTAMP / TO_TIMESTAMP_\*](functions/to_timestamp.md)
* [TRY_TO_TIMESTAMP / TRY_TO_TIMESTAMP_\*](functions/try_to_timestamp.md)

You specify the format in an optional argument, using the following case-insensitive elements to describe the format:

| Format element | Description |
| --- | --- |
| `YYYY` | Four-digit [1] year. |
| `YY` | Two-digit [1] year, controlled by the [TWO_DIGIT_CENTURY_START](parameters.md) session parameter. For example, when set to `1980`, values of `79` and `80` are parsed as `2079` and `1980`, respectively. |
| `Y` | One-digit or two-digit [2] year without leading zeros, controlled by the [TWO_DIGIT_CENTURY_START](parameters.md) session parameter. For example, when the parameter set to `1990`, values of `2005` and `1991` are serialized as `5` and `91`, respectively. |
| `MM` | Two-digit [1] month (`01` = January, and so on). |
| `MO` | One-digit or two-digit [2] month without leading zeros (`1` = January, and so on). |
| `MON` | Abbreviated month name [3]. |
| `MMMM` | Full month name [3]. |
| `DD` | Two-digit [1] day of month (`01` through `31`). |
| `D` | One-digit or two-digit [2] day of month without leading zeros (`1` through `31`). |
| `DY` | Abbreviated day of week. |
| `HH24` | Two digits [1] for hour (`00` through `23`). You *must not* specify `AM` / `PM` or `A` / `P`. |
| `HH12` | Two digits [1] for hour (`01` through `12`). You can specify `AM` / `PM` or `A` / `P`. |
| `H24` | One or two digits [2] for hour without leading zeros (`0` through `23`). You *must not* specify `AM` / `PM` or `A` / `P`. |
| `H12` | One or two digits [2] for hour without leading zeros (`1` through `12`). You can specify `AM` / `PM` or `A` / `P`. |
| `AM` , `PM` | Ante meridiem (`AM`) / post meridiem (`PM`). Use this only with `HH12` and code:`H12` (*not* with `HH24` or `H24`). |
| `P` | Ante meridiem (`A`) / post meridiem (`P`). Use this only with `HH12` and code:`H12` (*not* with `HH24` or `H24`). |
| `HH` | Synonym for `HH24`. |
| `H` | Synonym for `H24`. |
| `MI` | Two digits [1] for minute (`00` through `59`). |
| `ME` | One or two digits [2] for minute without leading zeros (`0` through `59`). |
| `SS` | Two digits [1] for second (`00` through `59`). |
| `S` | One or two digits [2] for second without leading zeros (`0` through `59`). |
| `FF[0-9]` | Fractional seconds with precision `0` (seconds) to `9` (nanoseconds), e.g. `FF`, `FF0`, `FF3`, `FF9`. Specifying `FF` is equivalent to `FF9` (nanoseconds). |
| `TZH:TZM` , `TZHTZM` , `TZH` | Two-digit [1] time zone hour and minute, offset from UTC. Can be prefixed by `+`/`-` for sign. |
| `UUUU` | Four-digit year in [ISO format](https://en.wikipedia.org/wiki/ISO_8601), which are negative for BCE years. |

[1] The number of digits describes the output produced when serializing values to text. When parsing text, Snowflake accepts up to the specified number of digits. For example, a day number can be one or two digits.

[2] The number of digits describes the output produced when serializing values to text. Parsing isn’t supported. If parsing is required, use an equivalent format that includes leading zeros. These format elements will be enabled in BCR bundle 2026_03.

[3] For the MON format element, the output produced when serializing values to text is the abbreviated month name. For the MMMM format element, the output produced when serializing values to text is the full month name. When parsing text, Snowflake accepts the three-digit abbreviation or the full month name for both MON and MMMM. For example, “January” or “Jan”, “February” or “Feb”, and so on are accepted when parsing text.

> **Note:**
>
> * When a date-only format is used, the associated time is assumed to be midnight on that day.
> * Anything in the format between double quotes or other than the above elements is parsed/formatted without being interpreted.
>   Snowflake recommends always enclosing literal characters in double quotes
>   (for example, `"T"`, `"EST"`, `"Z"`) to ensure they are treated as literals.
> * For more details about valid ranges, number of digits, and best practices, see
>   [Additional information about using date, time, and timestamp formats](date-time-input-output.md).

### Usage notes

Anything in the format between double quotes or other than the above elements is parsed/formatted without being interpreted.

### Examples

Convert a string to a date using a specified input format of `dd/mm/yyyy`. The display format for dates in the output
is determined by the [DATE_OUTPUT_FORMAT](parameters.md) session parameter (default `YYYY-MM-DD`).

```sqlexample
SELECT TO_DATE('3/4/2024', 'dd/mm/yyyy');
```

```output
+-----------------------------------+
| TO_DATE('3/4/2024', 'DD/MM/YYYY') |
|-----------------------------------|
| 2024-04-03                        |
+-----------------------------------+
```

Convert a date to a string, and specify a [date output format](parameters.md)
of `mon dd, yyyy`.

```sqlexample
SELECT TO_VARCHAR('2024-04-05'::DATE, 'mon dd, yyyy');
```

```output
+------------------------------------------------+
| TO_VARCHAR('2024-04-05'::DATE, 'MON DD, YYYY') |
|------------------------------------------------|
| Apr 05, 2024                                   |
+------------------------------------------------+
```

## Binary formats in conversion functions

[TO_CHAR , TO_VARCHAR](functions/to_char.md), and [TO_BINARY](functions/to_binary.md) accept an optional
argument specifying the expected format to parse or produce a string.

The format can be one of the following strings (case-insensitive):

> * HEX
> * BASE64
> * UTF-8

For more information about these formats, see [Overview of supported binary formats](binary-input-output.md).

For examples of using these formats, see the Examples section of
[Binary input and output](binary-input-output.md).

---
title: CORTEX_ANALYST_REQUESTS_V view
source: https://docs.snowflake.com/en/sql-reference/local/cortex_analyst_requests_v.md
section: SQL General Reference
---

Schema:
:   [LOCAL](../local.md)

# CORTEX_ANALYST_REQUESTS_V view

The CORTEX_ANALYST_REQUESTS_V view presents [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md) request log data.

## Columns

| `timestamp` | TIMESTAMP_NTZ | The time the request was received. |
| --- | --- | --- |
| `semantic_model_type` | VARCHAR | The data that was used to provide the semantic model.  This value is one of the following:   * `FILE_ON_STAGE`: The semantic model was loaded from a   [semantic model file](../../user-guide/views-semantic/sql.md) on a Snowflake stage. * `SEMANTIC_VIEW`: The semantic model was loaded from a   [semantic view](../../user-guide/views-semantic/overview.md) definition. |
| `semantic_model_name` | VARCHAR | The name of the semantic model.   * For models loaded from a stage, this is the fully qualified path to the staged file in the form   `@db.schema.stage/spec_path`. * For models loaded from a semantic view, this is the name of the view. |
| `tables_referenced` | VARIANT | An array containing the fully qualified names of all tables referenced by Cortex Analyst. |
| `request_id` | VARCHAR | The internal/system-generated identifier for the request. |
| `user_id` | VARCHAR | The Snowflake identifier for the user that sent the request. |
| `source` | VARCHAR | A JSON object containing metadata about the request source.  This object has the following fields:   * `agent_request_id`: Internal/system-generated identifier for the agent request that generated this request,   if any |
| `generated_sql` | VARCHAR | The SQL statement generated by Cortex Analyst in response to the request’s question. |
| `latest_question` | VARCHAR | The most recent question sent as part of the request. |
| `request_body` | VARIANT | The full body of the HTTP request.  For the structure of this object, see [Request body](../../user-guide/snowflake-cortex/cortex-analyst/rest-api.md). |
| `response_body` | VARIANT | The full body of the HTTP response.  For the structure of this object, see [Non-streaming response](../../user-guide/snowflake-cortex/cortex-analyst/rest-api.md). |
| `response_status_code` | NUMBER | The HTTP response code returned to the client. |
| `warnings` | VARIANT | The list of warnings from Cortex Analyst about the request. |
| `primary_role_name` | VARCHAR | The name of the primary role used by the user who made the request. |
| `response_metadata` | VARIANT | Metadata containing the response generation details for the request |
| `feedback` | ARRAY | Array of objects that contain information for all feedback associated with the request.  Feedback objects are identical to the REST API call request body as described in [Send feedback](../../user-guide/snowflake-cortex/cortex-analyst/rest-api.md). |

## Required privileges

Accessing the CORTEX_ANALYST_REQUESTS_V view requires one of the following roles:

* SNOWFLAKE.CORTEX_ANALYST_REQUESTS_VIEWER
* SNOWFLAKE.CORTEX_ANALYST_REQUESTS_ADMIN

---
title: Creating an asynchronous function on Azure
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-asynchronous.md
section: SQL General Reference
---

# Creating an asynchronous function on Azure

The concepts for creating an asynchronous external functions on Azure are similar to the concepts for
[creating an asynchronous function on AWS](external-functions-creating-aws-sample-asynchronous.md).

However, the AWS code sample cannot be used directly on Azure because you must use the corresponding Azure services like
Azure Functions and Azure Blob Storage. Furthermore, the details of navigating the cloud platform’s user interface differ.

---
title: Creating an external function for AWS using an AWS CloudFormation template
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-template.md
section: SQL General Reference
---

# Creating an external function for AWS using an AWS CloudFormation template

These topics provide detailed instructions for using an AWS (Amazon Web Services) CloudFormation template to create an external
function hosted on AWS.

Snowflake provides a sample template that you can start with. This template hides some details of the creation process.
When you are ready to create your own custom external function, you can either customize a copy of the template or you can
[use the AWS Management Console](external-functions-creating-aws-ui.md) to create the function.

These topics assume that you are already familiar with the AWS Management Console. They describe the general steps that you need
to complete, but do not describe the Console in detail.

**See also:**

* [Planning an external function for AWS](external-functions-creating-aws-planning.md)

**Steps:**

* [Step 1: Use the template to create the remote service (AWS Lambda function) and proxy service (API Gateway)](external-functions-creating-aws-template-services.md)
* [Step 2: Record the Amazon API Gateway URL and the new IAM role ARN](external-functions-creating-aws-template-gateway-url.md)
* [Step 3: Create the API integration for AWS in Snowflake](external-functions-creating-aws-common-api-integration.md)
* [Step 4: Link the API integration for AWS to the proxy service in the Management Console](external-functions-creating-aws-common-api-integration-proxy-link.md)
* [Step 5: Create the external function for AWS in Snowflake](external-functions-creating-aws-common-ext-function.md)

---
title: Creating an external function for AWS using the AWS Management Console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-ui.md
section: SQL General Reference
---

# Creating an external function for AWS using the AWS Management Console

These topics provide detailed instructions for using the AWS Management Console user interface to create an external function
hosted on AWS (Amazon Web Services). You can use these instructions either to create the sample external function provided by
Snowflake or as a guide to create your own external function.

These topics explain how to:

* Create a basic AWS Lambda Function as a remote service and an Amazon API Gateway as a proxy service.
* Create an API integration and the external function itself in Snowflake.
* Link the API integration to the API Management service.
* Secure the API Management service through a security policy.

These topics assume that you are already familiar with the AWS Management Console. They describe the general steps that you need
to complete, but do not describe the Console in detail.

**See also:**

* [Planning an external function for AWS](external-functions-creating-aws-planning.md)

**Steps:**

* [Step 1: Create the remote service (AWS Lambda function) in the Management Console](external-functions-creating-aws-ui-remote-service.md)
* [Step 2: Create the proxy service (Amazon API Gateway) in the AWS Management Console](external-functions-creating-aws-ui-proxy-service.md)
* [Step 3: Create the API integration for AWS in Snowflake](external-functions-creating-aws-common-api-integration.md)
* [Step 4: Link the API integration for AWS to the proxy service in the Management Console](external-functions-creating-aws-common-api-integration-proxy-link.md)
* [Step 5: Create the external function for AWS in Snowflake](external-functions-creating-aws-common-ext-function.md)

---
title: Creating an external function for Azure using an ARM template
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-template.md
section: SQL General Reference
---

# Creating an external function for Azure using an ARM template

These topics provide detailed instructions for using an ARM (Azure Resource Manager) template to create an external function hosted on
Microsoft Azure.

Snowflake provides a sample template that you can start with. This template hides some details of the creation process and hard-codes
some names (e.g. trigger name) and functionality. When you are ready to create your own custom external function, you can either customize
a copy of the template or you can [use the Azure Portal](external-functions-creating-azure-ui.md) to create the function.

These topics assume that you are already familiar with the Azure Portal. They describe the general steps that you need to complete,
but do not describe the Portal in detail.

**See also:**

* [Planning an external function for Azure](external-functions-creating-azure-planning.md)

**Steps:**

* [Step 1: Create an Azure AD app for the Azure functions app in the Portal](external-functions-creating-azure-template-apps.md)
* [Step 2: Use the template to create the remote service (Azure function) and proxy service (API Management service)](external-functions-creating-azure-template-services.md)
* [Step 3: Create the API integration for Azure in Snowflake](external-functions-creating-azure-common-api-integration.md)
* [Step 4: Link the API integration for Azure to the proxy service in the Portal](external-functions-creating-azure-common-api-integration-proxy-link.md)
* [Step 5: Create the external function for Azure in Snowflake](external-functions-creating-azure-common-ext-function.md)
* [Step 6: Update the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-template-security-policy.md)

---
title: Creating an external function for Azure using the Azure Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-ui.md
section: SQL General Reference
---

# Creating an external function for Azure using the Azure Portal

These topics provide detailed instructions for using the Azure Portal user interface to create an external function hosted on
Microsoft Azure. You can use these instructions either to create the sample external function provided by Snowflake or as a guide to
create your own external function.

In these topics, you will learn how to:

* Create a basic Azure Function as a remote service and an Azure API Management service as a proxy service.
* Create an API integration and the external function itself in Snowflake.
* Link the API integration to the API Management service.
* Secure the API Management service through a security policy.

These topics assume that you are already familiar with the Azure Portal. They describe the general steps that you need to complete,
but do not describe the Portal in detail.

**See also:**

* [Planning an external function for Azure](external-functions-creating-azure-planning.md)

**Steps:**

* [Step 1: Create the remote service (Azure function) in the Portal](external-functions-creating-azure-ui-remote-service.md)
* [Step 2: Create the proxy service (Azure API Management service) in the Portal](external-functions-creating-azure-ui-proxy-service.md)
* [Step 3: Create the API integration for Azure in Snowflake](external-functions-creating-azure-common-api-integration.md)
* [Step 4: Link the API integration for Azure to the proxy service in the Portal](external-functions-creating-azure-common-api-integration-proxy-link.md)
* [Step 5: Create the external function for Azure in Snowflake](external-functions-creating-azure-common-ext-function.md)
* [Step 6: Create the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-ui-security-policy.md)

---
title: Creating an external function for GCP using the Google Cloud console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-ui.md
section: SQL General Reference
---

# Creating an external function for GCP using the Google Cloud console

These topics provide detailed instructions for using the Google Cloud Console user interface to create an external function hosted on
GCP (Google Cloud Platform). You can use these instructions either to create the sample external function provided by Snowflake or as
a guide to create your own external function.

In these topics, you will learn how to:

* Create a basic Google Cloud Function as a remote service and a Google Cloud API Gateway as a proxy service.
* Create an API integration and the external function itself in Snowflake.
* Secure the API Gateway through a security policy.

These topics assume that you are already familiar with the Google Cloud Console. They describe the general steps that you need to complete,
but do not describe the Console in detail.

> **Tip:**
>
> Google also provides a command-line interface that you can use for many of these steps. For more details, see the GCP
> [gcloud documentation](https://cloud.google.com/api-gateway/docs/quickstart).

**See also:**

* [Planning an external function for GCP](external-functions-creating-gcp-planning.md)

**Steps:**

* [Step 1: Create the remote service (Google Cloud Function) in the console](external-functions-creating-gcp-ui-remote-service.md)
* [Step 2: Create the proxy service (Google Cloud API Gateway) in the console](external-functions-creating-gcp-ui-proxy-service.md)
* [Step 3: Create the API integration for GCP in Snowflake](external-functions-creating-gcp-common-api-integration.md)
* [Step 4: Create the external function for GCP in Snowflake](external-functions-creating-gcp-common-ext-function.md)
* [Step 5: Create a GCP security policy for the proxy service in the console](external-functions-creating-gcp-ui-security-policy.md)

---
title: Creating constraints
source: https://docs.snowflake.com/en/sql-reference/constraints-create.md
section: SQL General Reference
---

# Creating constraints

A constraint can be created at table creation using [CREATE TABLE](sql/create-table.md), or added to a table later using [ALTER TABLE](sql/alter-table.md):

* Single-column constraints can be created inline as part of the column definition.
* Multi-column constraints must be created with a separate out-of-line clause that specifies the columns in the constraint.

To create a constraint, certain access control privileges must be granted on the role used to create the constraint. For more information, see [Access control requirements](sql/create-table-constraint.md).

## Creating constraints inline

The following inline syntax can only be used for single-column constraints:

```sqlsyntax
CREATE [ OR REPLACE ] TABLE <name> (<column_name> <column_type> [ <inline_constraint> ] , ... )

ALTER TABLE <name> ADD COLUMN <column_name> <column_type> [ <inline_constraint> ]
```

For `inline_constraint` syntax details, see [CREATE | ALTER TABLE … CONSTRAINT](sql/create-table-constraint.md).

## Creating constraints out-of-line

The following out-of-line syntax must be used for multi-column constraints, but can also be used for single-column constraints:

```sqlsyntax
CREATE [ OR REPLACE ] TABLE <name> ( ... , [ <outofline_constraint> ], ... )

ALTER TABLE <name> ADD <outofline_constraint>
```

For `outofline_constraint` syntax details, see [CREATE | ALTER TABLE … CONSTRAINT](sql/create-table-constraint.md).

## Constraints in CREATE TABLE … LIKE and CLONE

Snowflake supports creating copies of tables using [CREATE TABLE](sql/create-table.md):

* To create an empty copy, use CREATE TABLE … LIKE.
* To create a clone, use CREATE TABLE … CLONE.

In addition, copies of tables are automatically created when a schema or database is cloned.

Regardless of how a copy is created for a table, the constraints on the original table are also copied. When copying a foreign key with a referencing table (foreign key table) and a referenced table (primary key table), the following scenarios may occur:

* If both tables are copied in the same command (such as during cloning of a schema or database), a new foreign key is created between the new referencing table and the referenced table.
* If only the referencing table is copied, a new foreign key is created on the referencing table, which points to the original primary key table as the referenced table.
* If only the referenced table is copied, no new foreign keys are created, although the primary or unique keys are copied.

As a result, if you copy a referencing and referenced table separately, you must manually create a new foreign key, or change the primary key table for the new foreign key manually.

---
title: Creating external functions on AWS
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws.md
section: SQL General Reference
---

# Creating external functions on AWS

These topics provide detailed instructions for creating an external function for AWS using either the AWS Management Console or an
AWS CloudFormation template provided by Snowflake.

**Next Topics:**

* [Planning an external function for AWS](external-functions-creating-aws-planning.md)
* [Creating an external function for AWS using the AWS Management Console](external-functions-creating-aws-ui.md)
* [Creating an external function for AWS using an AWS CloudFormation template](external-functions-creating-aws-template.md)
* [Calling an external function for AWS](external-functions-creating-aws-call.md)
* [Troubleshooting external functions for AWS](external-functions-creating-aws-troubleshooting.md)

---
title: Creating external functions on GCP
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp.md
section: SQL General Reference
---

# Creating external functions on GCP

These topics explain how to plan, create, call, and troubleshoot external functions hosted on GCP (Google Cloud Platform).

**Next Topics:**

* [Planning an external function for GCP](external-functions-creating-gcp-planning.md)
* [Creating an external function for GCP using the Google Cloud console](external-functions-creating-gcp-ui.md)
* [Calling an external function for GCP](external-functions-creating-gcp-call.md)
* [Troubleshooting external functions for GCP](external-functions-creating-gcp-troubleshooting.md)

---
title: Creating external functions on Microsoft Azure
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure.md
section: SQL General Reference
---

# Creating external functions on Microsoft Azure

These topics provide detailed instructions for creating an external function for Microsoft Azure using either the Azure Portal or an
ARM (Azure Resource Manager) template provided by Snowflake.

**Next Topics:**

* [Planning an external function for Azure](external-functions-creating-azure-planning.md)
* [Creating an external function for Azure using the Azure Portal](external-functions-creating-azure-ui.md)
* [Creating an external function for Azure using an ARM template](external-functions-creating-azure-template.md)
* [Calling an external function for Azure](external-functions-creating-azure-call.md)
* [Troubleshooting external functions for Azure](external-functions-creating-azure-troubleshooting.md)

---
title: Data Definition Language (DDL) commands
source: https://docs.snowflake.com/en/sql-reference/sql-ddl-summary.md
section: SQL General Reference
---

# Data Definition Language (DDL) commands

DDL commands are used to create, manipulate, and modify objects in Snowflake, such as users, virtual warehouses, databases, schemas,
tables, views, columns, functions, and stored procedures.

They are also used to perform many account-level and session operations, such as setting parameters, initializing variables, and
initiating transactions.

The following commands serve as the base for all DDL commands:

* [ALTER <object>](sql/alter.md)
* [COMMENT](sql/comment.md)
* [CREATE <object>](sql/create.md)
* [CREATE OR ALTER <object>](sql/create-or-alter.md)
* [DESCRIBE <object>](sql/desc.md)
* [DROP <object>](sql/drop.md)
* [SHOW <objects>](sql/show.md)
* [USE <object>](sql/use.md)

Each command takes an *object type* and *identifier*, as well as additional parameters and options. The descriptions for the
[individual commands](sql-all.md) provide the syntax and full list of parameters that can be specified for each
command. The descriptions also provide detailed usage notes and examples.

The commands are grouped into the following categories:

* [Account & session DDL](ddl-other.md)
* [User & security DDL](ddl-user-security.md)
* [Warehouse & resource monitor DDL](ddl-virtual-warehouse.md)
* [Database, schema, & share DDL](ddl-database.md)
* [Table, view, & sequence DDL](ddl-table.md)
* [Data loading / unloading DDL](ddl-stage.md)
* [DDL for user-defined functions, external functions, and stored procedures](ddl-udf.md)
* [Data pipeline DDL](ddl-pipeline.md)
* [Listings DDL](ddl-listings.md)
* [Machine learning model DDL](ddl-model.md)

---
title: Data generation functions
source: https://docs.snowflake.com/en/sql-reference/functions-data-generation.md
section: SQL General Reference
---

# Data generation functions

Data generation functions allow you to generate data. Snowflake supports two types of data generation functions:

* Random, which can be useful for testing purposes.

  These functions produce a random value each time. Each value is independent of the other values generated by other calls to the function.
  The underlying algorithm produces pseudo-random values, and thus the values are not truly random or independent, but without knowing the
  algorithm, the values are essentially unpredictable, usually evenly distributed (if the sample size is large), and pseudo-independent of
  each other.
* Controlled distribution, which can be useful for providing unique ID numbers for records that do not already have unique identifiers.

  These functions produce values that are not independent. For example, the [NORMAL](functions/normal.md) function returns values that have an
  approximately “normal” (bell-shaped) distribution based on a specified mean and standard deviation. Thus, each new value generated is at
  least indirectly influenced by previously generated values as the function tries to maintain the specified distribution. As another
  example, the [SEQ](functions/seq1.md) family of functions return a sequence of values.

> **Note:**
>
> The [UNIFORM](functions/uniform.md) function is listed as a controlled-distribution function, but is intended to generate evenly-distributed values.
> In other words, it acts as though it’s a “random” function, but we refer to it as a controlled distribution function because the distribution
> is explicitly specified and because you can choose a data-generation function that produces non-uniform values over a large sample size.

## List of functions

| Function Name | Notes |
| --- | --- |
| **Random** |  |
| [RANDOM](functions/random.md) | Returns a pseudo-random 64-bit integer. |
| [RANDSTR](functions/randstr.md) | Returns a random string of specified length. |
| [UUID_STRING](functions/uuid_string.md) | Returns a random RFC 4122-compliant UUID as a formatted string. |
| **Controlled Distribution** |  |
| [NORMAL](functions/normal.md) | Returns a normal-distributed floating point number, with specified mean and standard deviation. |
| [UNIFORM](functions/uniform.md) | Returns a uniformly random number within the specified range. |
| [ZIPF](functions/zipf.md) | Returns a Zipf-distributed integer. |
| [SEQ1 / SEQ2 / SEQ4 / SEQ8](functions/seq1.md) | Returns a sequence of monotonically increasing integers. |

## Usage notes

* Random distribution functions are deterministic.
* Each random distribution function takes a generator expression, `gen`, as its last argument. The generator expression, `gen`, can
  be constant or variable:

  > + If constant, then the result of the random distribution function is constant (unless there are other, variable arguments, which is currently only supported for the
  >   [RANDSTR](functions/randstr.md) function).
  > + If variable, then the result of the random distribution function is variable.
* Generator expressions must be a type 64-bit integer, although implicit conversions are allowed. Any expression that can be converted into a 64-bit integer can be used as a generator expression.
* The randomness of any random distribution function is directly linked to the randomness of its generator expression. For most practical purposes, the [RANDOM](functions/random.md) data generation
  function is the best choice for randomly-generated integer values.
* Sequences generated by data generation functions are not guaranteed to be ordered and gap-free. This is because the numbers may be generated in parallel, in an unsynchronized fashion.

  For more details about sequences in Snowflake, see [Using Sequences](../user-guide/querying-sequences.md).
* Decimal-float ([DECFLOAT](data-types-numeric.md)) values can’t be used as arguments for data generation functions.

---
title: Data loading / unloading DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-stage.md
section: SQL General Reference
---

# Data loading / unloading DDL

Stages and file formats are named database objects that can be used to simplify and streamline bulk loading data into and unloading data out of database tables.

Pipes are named database objects that define COPY statements for loading micro-batches of data using Snowpipe.

## Stage management

Snowflake supports two types of stages for storing data files used for loading/unloading:

* Internal stages store the files internally within Snowflake.
* External stages store the files in an external location (i.e. S3 bucket) that is referenced by the stage. An external stage specifies location and credential information, if required, for the S3 bucket.

Both external and internal stages can include file format and copy options.

* [CREATE STAGE](sql/create-stage.md)
* [CREATE STAGE … CLONE](sql/create-clone.md)
* [ALTER STAGE](sql/alter-stage.md)
* [DROP STAGE](sql/drop-stage.md)
* [DESCRIBE STAGE](sql/desc-stage.md)
* [SHOW STAGES](sql/show-stages.md)

## File format management

A file format encapsulates information, such as file type (CSV, JSON, etc.) and formatting options specific to each type, for data files used for bulk loading/unloading.

* [CREATE FILE FORMAT](sql/create-file-format.md)
* [CREATE FILE FORMAT … CLONE](sql/create-clone.md)
* [ALTER FILE FORMAT](sql/alter-file-format.md)
* [DROP FILE FORMAT](sql/drop-file-format.md)
* [DESCRIBE FILE FORMAT](sql/desc-file-format.md)
* [SHOW FILE FORMATS](sql/show-file-formats.md)

## Git repository management

A Snowflake [Git repository stage](../developer-guide/git/git-overview.md) represents a local Git repository in Snowflake.

* [CREATE GIT REPOSITORY](sql/create-git-repository.md)
* [ALTER GIT REPOSITORY](sql/alter-git-repository.md)
* [DROP GIT REPOSITORY](sql/drop-git-repository.md)
* [DESCRIBE GIT REPOSITORY](sql/desc-git-repository.md)
* [SHOW GIT BRANCHES](sql/show-git-branches.md)
* [SHOW GIT REPOSITORIES](sql/show-git-repositories.md)
* [SHOW GIT TAGS](sql/show-git-tags.md)

## Pipe management

A pipe encapsulates a single COPY statement for loading a set of data files from an ingestion queue into a table.

* [CREATE PIPE](sql/create-pipe.md)
* [ALTER PIPE](sql/alter-pipe.md)
* [DROP PIPE](sql/drop-pipe.md)
* [DESCRIBE PIPE](sql/desc-pipe.md)
* [SHOW PIPES](sql/show-pipes.md)

---
title: Data Manipulation Language (DML) commands
source: https://docs.snowflake.com/en/sql-reference/sql-dml.md
section: SQL General Reference
---

# Data Manipulation Language (DML) commands

This topic provides links to all the DML commands, grouped by category.

## General DML

Commands for inserting, deleting, updating, and merging data in Snowflake tables:

* [INSERT](sql/insert.md)
* [INSERT (multi-table)](sql/insert-multi-table.md)
* [MERGE](sql/merge.md)
* [UPDATE](sql/update.md)
* [DELETE](sql/delete.md)
* [TRUNCATE TABLE](sql/truncate-table.md)

## Data loading / unloading DML

Commands for bulk copying data into and out of Snowflake tables:

* [COPY INTO <table>](sql/copy-into-table.md) (loading/importing data)
* [COPY INTO <location>](sql/copy-into-location.md) (unloading/exporting data)

See also:
:   [VALIDATE](functions/validate.md) (table function)

## File staging commands (for data loading / unloading)

These commands do not perform any actual DML, but are used to stage and manage files stored in Snowflake locations (named internal stages, table stages,
and user stages), for the purpose of loading and unloading data:

* [PUT](sql/put.md)
* [GET](sql/get.md)
* [LIST](sql/list.md) (can also be used with named external stages)
* [REMOVE](sql/remove.md)

---
title: Data metric functions
source: https://docs.snowflake.com/en/sql-reference/functions-data-metric.md
section: SQL General Reference
---

# Data metric functions

Snowflake provides built-in system data metric functions to measure data quality for tables and views:

* [ACCEPTED_VALUES (system data metric function)](functions/dmf_accepted_values.md)
* [AVG (system data metric function)](functions/dmf_avg.md)
* [BLANK_COUNT (system data metric function)](functions/dmf_blank_count.md)
* [BLANK_PERCENT (system data metric function)](functions/dmf_blank_percent.md)
* [DATA_METRIC_SCHEDULED_TIME (system data metric function)](functions/dmf_data_metric_schedule_time.md)
* [DUPLICATE_COUNT (system data metric function)](functions/dmf_duplicate_count.md)
* [FRESHNESS (system data metric function)](functions/dmf_freshness.md)
* [MAX (system data metric function)](functions/dmf_max.md)
* [MIN (system data metric function)](functions/dmf_min.md)
* [NULL_COUNT (system data metric function)](functions/dmf_null_count.md)
* [NULL_PERCENT (system data metric function)](functions/dmf_null_percent.md)
* [ROW_COUNT (system data metric function)](functions/dmf_row_count.md)
* [STDDEV (system data metric function)](functions/dmf_stddev.md)
* [UNIQUE_COUNT (system data metric function)](functions/dmf_unique_count.md)

For details, see [System data metric functions](../user-guide/data-quality-system-dmfs.md).

---
title: Data pipeline DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-pipeline.md
section: SQL General Reference
---

# Data pipeline DDL

Snowflake provides a full set of DDL commands for creating and managing streams and tasks.

## Stream management

* [CREATE STREAM](sql/create-stream.md)
* [CREATE stream … CLONE](sql/create-clone.md)
* [ALTER STREAM](sql/alter-stream.md)
* [DROP STREAM](sql/drop-stream.md)
* [SHOW STREAMS](sql/show-streams.md)

## Task management

* [CREATE TASK](sql/create-task.md)
* [CREATE task … CLONE](sql/create-clone.md)
* [ALTER TASK](sql/alter-task.md)
* [DROP TASK](sql/drop-task.md)
* [EXECUTE TASK](sql/execute-task.md)
* [SHOW TASKS](sql/show-tasks.md)

---
title: Data type conversion
source: https://docs.snowflake.com/en/sql-reference/data-type-conversion.md
section: SQL General Reference
---

# Data type conversion

In many cases, a value of one data type can be converted to another data type. For example, an
[INTEGER](data-types-numeric.md) value can be converted to a
[floating-point data type](data-types-numeric.md) value. Converting a data type is called *casting*.

## Explicit casting vs implicit casting

Users can explicitly convert a value from one data type to another. This is called *explicit casting*.

In some situations, Snowflake converts a value to another data type automatically. This is called *implicit casting* or *coercion*.

### Explicit casting

Users can explicitly cast a value by using any of the following options:

* The [CAST](functions/cast.md) function.
* The `::` operator, called the *cast operator*.
* The appropriate SQL function; for example, [TO_DOUBLE](functions/to_double.md).

For example, each query casts a string value to a DATE value:

> ```sqlexample
> SELECT CAST('2022-04-01' AS DATE);
>
> SELECT '2022-04-01'::DATE;
>
> SELECT TO_DATE('2022-04-01');
> ```

Casting is allowed in most contexts in which a general expression is allowed, including the WHERE clause. For example:

> ```sqlexample
> SELECT date_column
>   FROM log_table
>   WHERE date_column >= '2022-04-01'::DATE;
> ```

### Implicit casting (coercion)

Coercion occurs when a function (or operator) requires a data type that is different from, but compatible with, the arguments
(or operands).

* Examples for functions or stored procedures:

  + The following code coerces the INTEGER value in column `my_integer_column` to FLOAT so that the value can
    be passed to the function `my_float_function()`, which expects a FLOAT:

    > ```sqlexample
    > SELECT my_float_function(my_integer_column)
    >   FROM my_table;
    > ```
* Examples for operators:

  + The following code coerces the INTEGER value `17` to VARCHAR so that the values can be concatenated by using
    the `||` operator:

    > ```sqlexample
    > SELECT 17 || '76';
    > ```

    The result of this SELECT statement is the string `'1776'`.
  + The following statement coerces the INTEGER value in column `my_integer_column` to FLOAT so that the value can be
    compared to the value `my_float_column` by using the `<` comparison operator:

    > ```sqlexample
    > SELECT ...
    >   FROM my_table
    >   WHERE my_integer_column < my_float_column;
    > ```

Not all contexts — for example, not all operators — support coercion.

## Casting and precedence

When casting inside an expression, the code must take into account the precedence of the cast operator relative to other
operators in the expression.

Consider the following example:

```sqlexample
SELECT height * width::VARCHAR || ' square meters'
  FROM dimensions;
```

The cast operator has higher precedence than the arithmetic operator `*` (multiply), so the statement is
interpreted as shown in the following example:

```sqlexample
... height * (width::VARCHAR) ...
```

To cast the result of the expression `height * width`, use parentheses, as shown in the following example:

```sqlexample
SELECT (height * width)::VARCHAR || ' square meters'
  FROM dimensions;
```

As another example, consider the following statement:

```sqlexample
SELECT -0.0::FLOAT::BOOLEAN;
```

You might expect this to be interpreted as shown in the following example:

```sqlexample
SELECT (-0.0::FLOAT)::BOOLEAN;
```

Therefore, it would be expected to return FALSE (0 = FALSE, 1 = TRUE).

However, the cast operator has higher precedence than the unary minus (negation) operator, so the
statement is interpreted as shown in the following example:

```sqlexample
SELECT -(0.0::FLOAT::BOOLEAN);
```

Therefore, the query results in an error message because the unary minus can’t be applied to a BOOLEAN.

## Data types that can be cast

The following table shows the valid data type conversions in Snowflake. The table also shows which coercions Snowflake
can perform automatically.

> **Note:**
>
> Internally, the [CAST](functions/cast.md) function and the `::` operator call the appropriate conversion
> function. For example, if you cast a NUMBER to a BOOLEAN, Snowflake calls the [TO_BOOLEAN](functions/to_boolean.md)
> function. The usage notes for each conversion function apply when the function is called indirectly by using a cast, and also when
> the function is called directly. For example, if you execute `CAST(my_decimal_column AS BOOLEAN)`, the rules for calling
> TO_BOOLEAN with a DECIMAL value apply. For convenience, the table includes links to the relevant conversion functions.

For more information about conversions between [semi-structured types](data-types-semistructured.md) and
[structured types](data-types-structured.md), see [Converting structured and semi-structured types](data-types-structured.md).

| Source data type | Target data type | Castable | Coercible | Conversion function | Notes |
| --- | --- | --- | --- | --- | --- |
| ARRAY |  |  |  |  |  |
|  | [VARCHAR](data-types-text.md) | ✔ | ❌ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ✔ | [TO_VARIANT](functions/to_variant.md) | None. |
|  | [VECTOR](data-types-vector.md) | ✔ | ✔ |  | Use explicit casting for conversion. For more information, see [Vector conversion](data-types-vector.md). |
| BINARY |  |  |  |  |  |
|  | [VARCHAR](data-types-text.md) | ✔ | ❌ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| BOOLEAN |  |  |  |  |  |
|  | [DECFLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DECFLOAT](functions/to_decfloat.md) | For example, from `FALSE` to `0`. |
|  | [NUMBER](data-types-numeric.md) | ✔ | ❌ | [TO_NUMBER](functions/to_decimal.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | For example, from `TRUE` to `'true'`. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ✔ | [TO_VARIANT](functions/to_variant.md) | None. |
| DATE |  |  |  |  |  |
|  | [TIMESTAMP](data-types-datetime.md) | ✔ | ✔ | [TO_TIMESTAMP](functions/to_timestamp.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| DECFLOAT . *(decimal floating-point numbers)* |  |  |  |  |  |
|  | [BOOLEAN](data-types-logical.md) | ✔ | ✔ | [TO_BOOLEAN](functions/to_boolean.md) | For example, from `0` to `FALSE`. |
|  | [FLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DOUBLE](functions/to_double.md) | None. |
|  | [NUMBER[(p,s)]](data-types-numeric.md) | ✔ | ✔ | [TO_NUMBER](functions/to_decimal.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
| FLOAT . *(floating-point numbers)* |  |  |  |  |  |
|  | [BOOLEAN](data-types-logical.md) | ✔ | ✔ | [TO_BOOLEAN](functions/to_boolean.md) | For example, from `0.0` to `FALSE`. |
|  | [DECFLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DECFLOAT](functions/to_decfloat.md) | None. |
|  | [NUMBER[(p,s)]](data-types-numeric.md) | ✔ | ✔ | [TO_NUMBER](functions/to_decimal.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ✔ | [TO_VARIANT](functions/to_variant.md) | None. |
| GEOGRAPHY |  |  |  |  |  |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| GEOMETRY |  |  |  |  |  |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| Interval data types |  |  |  |  |  |
|  | [NUMBER[(p,s)]](data-types-numeric.md) | ✔ | ❌ | [TO_NUMBER](functions/to_decimal.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
| NUMBER[(p,s)] . *(Fixed-point numbers, including INTEGER)* |  |  |  |  |  |
|  | [BOOLEAN](data-types-logical.md) | ✔ | ✔ | [TO_BOOLEAN](functions/to_boolean.md) | For example, from `0` to `FALSE`. |
|  | [DECFLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DECFLOAT](functions/to_decfloat.md) | None. |
|  | [FLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DOUBLE](functions/to_double.md) | None. |
|  | [Interval data types](data-types-datetime.md) | ✔ | ❌ | — | Cast is only supported for interval data types with a single component: INTERVAL YEAR, INTERVAL DAY, INTERVAL HOUR, INTERVAL MINUTE, and INTERVAL SECOND. |
|  | [TIMESTAMP](data-types-datetime.md) | ✔ | ✔ | [TO_TIMESTAMP](functions/to_timestamp.md) | [1] |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ✔ | [TO_VARIANT](functions/to_variant.md) | None. |
| OBJECT |  |  |  |  |  |
|  | [ARRAY](data-types-semistructured.md) | ✔ | ❌ | [TO_ARRAY](functions/to_array.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ❌ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ✔ | [TO_VARIANT](functions/to_variant.md) | None. |
| TIME |  |  |  |  |  |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| TIMESTAMP |  |  |  |  |  |
|  | [DATE](data-types-datetime.md) | ✔ | ✔ | [TO_DATE , DATE](functions/to_date.md) | None. |
|  | [TIME](data-types-datetime.md) | ✔ | ✔ | [TO_TIME , TIME](functions/to_time.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| VARCHAR |  |  |  |  |  |
|  | [BOOLEAN](data-types-logical.md) | ✔ | ✔ | [TO_BOOLEAN](functions/to_boolean.md) | For example, from `'false'` to `FALSE`. |
|  | [DATE](data-types-datetime.md) | ✔ | ✔ | [TO_DATE , DATE](functions/to_date.md) | None. |
|  | [DECFLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DECFLOAT](functions/to_decfloat.md) | None. |
|  | [FLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DOUBLE](functions/to_double.md) | For example, from `'12.34'` to `12.34`. |
|  | [Interval data types](data-types-datetime.md) | ✔ | ✔ | — | The VARCHAR value is parsed in the same way as an [interval literal](data-types-datetime.md). |
|  | [NUMBER[(p,s)]](data-types-numeric.md) | ✔ | ✔ | [TO_NUMBER](functions/to_decimal.md) | For example, from `'12.34'` to `12.34`. |
|  | [TIME](data-types-datetime.md) | ✔ | ✔ | [TO_TIME , TIME](functions/to_time.md) | None. |
|  | [TIMESTAMP](data-types-datetime.md) | ✔ | ✔ | [TO_TIMESTAMP](functions/to_timestamp.md) | None. |
|  | [UUID](data-types-uuid.md) | ✔ | ✔ | [TO_UUID](functions/to_uuid.md) | None. |
|  | [VARIANT](data-types-semistructured.md) | ✔ | ❌ | [TO_VARIANT](functions/to_variant.md) | None. |
| UUID |  |  |  |  |  |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
| VARIANT |  |  |  |  |  |
|  | [ARRAY](data-types-semistructured.md) | ✔ | ✔ | [TO_ARRAY](functions/to_array.md) | None. |
|  | [BOOLEAN](data-types-logical.md) | ✔ | ✔ | [TO_BOOLEAN](functions/to_boolean.md) | For example, from a VARIANT containing `'false'` to `FALSE`. |
|  | [DATE](data-types-datetime.md) | ✔ | ✔ | [TO_DATE , DATE](functions/to_date.md) | None. |
|  | [FLOAT](data-types-numeric.md) | ✔ | ✔ | [TO_DOUBLE](functions/to_double.md) | None. |
|  | [GEOGRAPHY](data-types-geospatial.md) | ✔ | ❌ | [TO_GEOGRAPHY](functions/to_geography.md) | None. |
|  | [NUMBER[(p,s)]](data-types-numeric.md) | ✔ | ✔ | [TO_NUMBER](functions/to_decimal.md) | None. |
|  | [OBJECT](data-types-semistructured.md) | ✔ | ✔ | [TO_OBJECT](functions/to_object.md) | None. |
|  | [TIME](data-types-datetime.md) | ✔ | ✔ | [TO_TIME , TIME](functions/to_time.md) | None. |
|  | [TIMESTAMP](data-types-datetime.md) | ✔ | ✔ | [TO_TIMESTAMP](functions/to_timestamp.md) | None. |
|  | [VARCHAR](data-types-text.md) | ✔ | ✔ | [TO_VARCHAR](functions/to_char.md) | None. |
|  | [VECTOR](data-types-vector.md) | ✔ | ❌ |  | The VARIANT must contain an ARRAY of type FLOAT or INT. |
| VECTOR |  |  |  |  |  |
|  | [ARRAY](data-types-semistructured.md) | ✔ | ✔ | [TO_ARRAY](functions/to_array.md) | None. |

[1]

NUMBER can be converted to TIMESTAMP because the values are treated as seconds since the beginning of the epoch (1970-01-01 00:00:00).

> **Note:**
>
> For each listed data type — for example, FLOAT — the rules apply to all aliases for that data type. For example, the rules for FLOAT apply to
> DOUBLE, which is an alias for FLOAT.

## Usage notes

Except where stated otherwise, the following rules apply to both explicit casting and implicit casting:

* Conversion depends not only on the data type, but also the value, of the source; for example:

  + The VARCHAR value `'123'` can be converted to a numeric value, but the VARCHAR value `'xyz'` can’t be converted to
    a numeric value.
  + The ability to cast a specific value of type VARIANT depends on the type of the data *inside* the VARIANT. For
    example, if the VARIANT contains a value of type TIME, then you can’t cast the VARIANT value to a TIMESTAMP value,
    because you can’t cast a TIME value to a TIMESTAMP value.
* Snowflake performs implicit conversion of arguments to make
  them compatible. For example, if one of the input expressions is a numeric type, the return type
  is also a numeric type. That is, `SELECT COALESCE('17', 1);` first converts the VARCHAR value `'17'`
  to the NUMBER value `17`, and then returns the first non-NULL value.

  When conversion isn’t possible, implicit conversion fails. For example, `SELECT COALESCE('foo', 1);`
  returns an error because the VARCHAR value `'foo'` can’t be converted to a NUMBER value.

  We recommend passing in arguments of the same type or explicitly converting arguments if needed.

* When implicit conversion converts a non-numeric value to a numeric value, the result is a value
  of type NUMBER(18,5).

  For numeric string arguments that aren’t constants, if NUMBER(18,5) isn’t sufficient to represent
  the numeric value, then cast the argument to a type that
  can represent the value.

* For some pairs of data types, conversion can result in loss of precision; for example:

  + Converting a FLOAT value to an INTEGER value rounds the value.
  + Converting a value from fixed-point numeric — for example, NUMBER(38, 0) — to floating point — for example, FLOAT — can result
    in rounding or truncation if the fixed-point number can’t be precisely represented in a floating point number.
  + Converting a TIMESTAMP value to a DATE value removes the information about the time of day.
* Although Snowflake converts values in some situations where loss of precision can occur, Snowflake doesn’t allow conversion in
  other situations where a loss of precision would occur. For example, Snowflake doesn’t allow conversion when conversion would cause the
  following situations to happen:

  + Truncate a VARCHAR value. For example, Snowflake doesn’t cast VARCHAR(10) to VARCHAR(5), either implicitly or explicitly.
  + Result in the loss of digits other than the least significant digits. For example, the following loss of digits fails:

    ```sqlexample
    SELECT 12.3::FLOAT::NUMBER(3,2);
    ```

    In this example, the number `12.3` has two digits before the decimal point, but the data type `NUMBER(3,2)` has room for
    only one digit before the decimal point.
* When converting from a type with less precision to a type with more precision, conversion uses default values. For example,
  converting a DATE value to a TIMESTAMP_NTZ value causes the hour, minute, second, and fractional seconds to be set to `0`.
* When a FLOAT value is cast to a VARCHAR value, trailing zeros are omitted.

  For example, the following statements create a table and insert a row that contains a VARCHAR value, a FLOAT value, and
  a VARIANT value. The VARIANT value is constructed from JSON that contains a floating-point value represented with trailing zeros:

  ```sqlexample
  CREATE OR REPLACE TABLE convert_test_zeros (
    varchar1 VARCHAR,
    float1 FLOAT,
    variant1 VARIANT);

  INSERT INTO convert_test_zeros SELECT
    '5.000',
    5.000,
    PARSE_JSON('{"Loan Number": 5.000}');
  ```

  The following SELECT statement explicitly casts both the FLOAT column and the FLOAT value inside the VARIANT column to VARCHAR.
  In each case, the VARCHAR contains no trailing zeros:

  ```sqlexample
  SELECT varchar1,
         float1::VARCHAR,
         variant1:"Loan Number"::VARCHAR
    FROM convert_test_zeros;
  ```

  ```output
  +----------+-----------------+---------------------------------+
  | VARCHAR1 | FLOAT1::VARCHAR | VARIANT1:"LOAN NUMBER"::VARCHAR |
  |----------+-----------------+---------------------------------|
  | 5.000    | 5               | 5                               |
  +----------+-----------------+---------------------------------+
  ```
* Some operations can return different data types, depending on a conditional expression. For example, the following
  [IFNULL](functions/ifnull.md) calls return slightly different data types depending on the input values:

  ```sqlexample
  SELECT SYSTEM$TYPEOF(IFNULL(12.3, 0)),
         SYSTEM$TYPEOF(IFNULL(NULL, 0));
  ```

  ```output
  +--------------------------------+--------------------------------+
  | SYSTEM$TYPEOF(IFNULL(12.3, 0)) | SYSTEM$TYPEOF(IFNULL(NULL, 0)) |
  |--------------------------------+--------------------------------|
  | NUMBER(3,1)[SB1]               | NUMBER(1,0)[SB1]               |
  +--------------------------------+--------------------------------+
  ```

  If the expression has more than one possible data type, Snowflake chooses the data type based on the actual result.
  For more information about precision and scale in calculations, see [Scale and precision in arithmetic operations](operators-arithmetic.md).
  If the query generates more than one result — for example, multiple rows of results — Snowflake chooses a data type that
  is capable of holding each of the individual results.
* Some applications, such as SnowSQL, and some graphical user interfaces, such as Snowsight, apply their
  own conversion and formatting rules when they display data. For example, SnowSQL displays BINARY values as a string that contains
  only hexadecimal digits; that string is generated by implicitly calling a conversion function. Therefore, the data that SnowSQL
  displays might not unambiguously indicate which data conversions that Snowflake coerced.

---
title: DATA_QUALITY_MONITORING_ANOMALY_DETECTION_STATUS view
source: https://docs.snowflake.com/en/sql-reference/local/data_quality_monitoring_anomaly_detection_status.md
section: SQL General Reference
---

Schema:
:   [LOCAL](../local.md)

# DATA_QUALITY_MONITORING_ANOMALY_DETECTION_STATUS view

This view displays a row for every time a data metric function (DMF) ran with [anomaly detection](../../user-guide/data-quality-anomaly.md) enabled.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `scheduled_time` | TIMESTAMP_LTZ | The time that the DMF is scheduled to run based on the schedule that you set for the table or view. |
| `change_commit_time` | TIMESTAMP_LTZ | The time that the DMF trigger operation occurred, or `None` if the DMF is not scheduled to run by a trigger operation.  For information about the trigger operation, see [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md). |
| `measurement_time` | TIMESTAMP_LTZ | The time at which the metric was evaluated. |
| `table_id` | NUMBER | Internal, system-generated identifier of the table that is associated with the DMF. |
| `table_name` | VARCHAR | Name of the table that is associated with the DMF. |
| `table_schema` | VARCHAR | Name of the schema name that contains the table that is associated with the DMF. |
| `table_database` | VARCHAR | Name of the database that contains the table that is associated with the DMF. |
| `metric_id` | NUMBER | Internal, system-generated identifier of the DMF. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `metric_schema` | VARCHAR | Name of the schema that contains the DMF. |
| `metric_database` | VARCHAR | Name of the database that contains the DMF. |
| `metric_return_type` | VARCHAR | Return type of the DMF. |
| `reference_id` | VARCHAR | The ID to uniquely identify the metric entity reference, known as the *association ID*. |
| `value` | VARIANT | The result of the DMF evaluation. |
| `is_anomaly` | BOOLEAN | If TRUE, the `value` returned by the DMF is an anomaly because it was outside the range of `upperbound` and `lowerbound`. |
| `upperbound` | NUMBER | Highest value that should be returned by the DMF based on the anomaly-detecting algorithm. Values returned by the DMF that are above this upper bound are considered anomalies. |
| `lowerbound` | NUMBER | Lowest value that should be returned by the DMF based on the anomaly-detecting algorithm. Values returned by the DMF that are below this lower bound are considered anomalies. |
| `forecast` | NUMBER | Value that the anomaly-detecting algorithm predicted would be returned by the DMF. |

## Access control requirements

The role used to query the view must be granted one of the following application roles:

* SNOWFLAKE.DATA_QUALITY_MONITORING_VIEWER
* SNOWFLAKE.DATA_QUALITY_MONITORING_ADMIN

---
title: DATA_QUALITY_MONITORING_EXPECTATION_STATUS view
source: https://docs.snowflake.com/en/sql-reference/local/data_quality_monitoring_expectation_status.md
section: SQL General Reference
---

Schema:
:   [LOCAL](../local.md)

# DATA_QUALITY_MONITORING_EXPECTATION_STATUS view

This view displays a row for every time a data metric function (DMF) with an [expectation](../../user-guide/data-quality-expectations.md)
was run in your account.

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| `scheduled_time` | TIMESTAMP_LTZ | The time the DMF is scheduled to run based on the schedule that you set for the table or view. |
| `change_commit_time` | TIMESTAMP_LTZ | The time the DMF trigger operation occurred, or `None` if the DMF is not scheduled to run by a trigger operation.  For information about the trigger operation, see [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md). |
| `measurement_time` | TIMESTAMP_LTZ | The time at which the metric was evaluated. |
| `table_id` | NUMBER | Internal/system-generated identifier of the table that is associated with the DMF. |
| `table_name` | VARCHAR | Name of the table that is associated with the DMF. |
| `table_schema` | VARCHAR | Name of the schema name that contains the table that is associated with the DMF. |
| `table_database` | VARCHAR | Name of the database that contains the table that is associated with the DMF. |
| `metric_id` | NUMBER | Internal/system-generated identifier of the DMF. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `metric_schema` | VARCHAR | Name of the schema that contains the DMF. |
| `metric_database` | VARCHAR | Name of the database that contains the DMF. |
| `metric_return_type` | VARCHAR | Return type of the DMF. |
| `arguments_ids` | ARRAY | Array of the identifiers of the DMF arguments. Array elements are in the same order as the arguments. |
| `arguments_types` | ARRAY | Array of the domain/type of each argument. Array elements are in the same order as the arguments.  Currently only supports COLUMN type arguments. |
| `arguments_names` | ARRAY | Array of the names of the DMF arguments. For column arguments, each element is the name of a column. Array elements are in the same order as the arguments. |
| `reference_id` | VARCHAR | The ID to uniquely identify the metric entity reference, known as the association ID. |
| `value` | VARIANT | The result of the DMF evaluation. |
| `expectation_name` | VARCHAR | Name that was given to the expectation when it was added to the association between the DMF and the object. |
| `expectation_id` | VARCHAR | System-generated identifier. |
| `expectation_expression` | VARCHAR | Boolean expression of the expectation. See [Defining what meets the expectation](../../user-guide/data-quality-expectations.md). |
| `expectation_violated` | BOOLEAN | If TRUE, the expectation was violated. An expectation is violated when the `expectation_expression` evaluates to FALSE.  A NULL value indicates the evaluation of the expectation failed. |

## Access control requirements

The role used to query the view must be granted one of the following application roles:

* SNOWFLAKE.DATA_QUALITY_MONITORING_VIEWER
* SNOWFLAKE.DATA_QUALITY_MONITORING_ADMIN

---
title: DATA_QUALITY_MONITORING_RESULTS view
source: https://docs.snowflake.com/en/sql-reference/local/data_quality_monitoring_results.md
section: SQL General Reference
---

Schema:
:   [LOCAL](../local.md)

# DATA_QUALITY_MONITORING_RESULTS view

This view displays a row for each result of calling a [data metric function](../../user-guide/data-quality-intro.md) in your
account.

## Columns

The columns in the view are defined as follows:

| Column name | Data type | Description |
| --- | --- | --- |
| `scheduled_time` | TIMESTAMP_LTZ | The time the DMF is scheduled to run based on the schedule that you set for the table or view. |
| `change_commit_time` | TIMESTAMP_LTZ | The time the DMF trigger operation occurred, or `None` if the DMF is not scheduled to run by a trigger operation.  For information about the trigger operation, see [Adjust the schedule for DMFs](../../user-guide/data-quality-working.md). |
| `measurement_time` | TIMESTAMP_LTZ | The time at which the metric was evaluated. |
| `table_id` | NUMBER | Internal/system-generated identifier of the table that the DMF is associated with. |
| `table_name` | VARCHAR | Name of the table that the DMF is associated with. |
| `table_schema` | VARCHAR | Name of the schema that contains the table that the DMF is associated with. |
| `table_database` | VARCHAR | Name of the database that contains the table that the DMF is associated with. |
| `metric_id` | NUMBER | Internal/system-generated identifier of the DMF. |
| `metric_name` | VARCHAR | Name of the DMF. |
| `metric_schema` | VARCHAR | Name of the schema that contains the DMF. |
| `metric_database` | VARCHAR | Name of the database that contains the DMF. |
| `metric_return_type` | VARCHAR | Return type of the DMF. |
| `arguments_ids` | ARRAY | Array of the identifiers of the DMF arguments. Array elements are in the same order as the arguments. |
| `arguments_types` | ARRAY | Array of the domain/type of each DMF argument. Array elements are in the same order as the arguments.  Currently only supports COLUMN type arguments. |
| `arguments_names` | ARRAY | Array of the names of the DMF arguments. For column arguments, each element is the name of a column. Array elements are in the same order as the arguments. |
| `reference_id` | VARCHAR | The ID to uniquely identify the metric entity reference, known as the association ID. |
| `value` | VARIANT | The result of the DMF evaluation. |

## Access control requirements

The role used to query the view must be granted the SNOWFLAKE.DATA_QUALITY_MONITORING_VIEWER application role or the
SNOWFLAKE.DATA_QUALITY_MONITORING_ADMIN application role.

---
title: Database, schema, & share DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-database.md
section: SQL General Reference
---

# Database, schema, & share DDL

Databases and schemas are used to organize data stored in Snowflake:

* A database is a logical grouping of schemas. Each database belongs to a single Snowflake account.
* A schema is a logical grouping of database objects (tables, views, etc.). Each schema belongs to a single database.

Together, a database and schema comprise a *namespace* in Snowflake. When performing any operations on database objects in Snowflake, the
namespace is inferred from the current database and schema in use for the session. If a database and schema are not in use for the session,
the namespace must be explicitly specified when performing any operations on the objects.

Snowflake provides a full set of DDL commands for creating and managing databases and schemas.

In addition, Snowflake provides DDL for creating and managing shares. A share specifies a set of database objects (schemas, tables, and
secure views) containing data you wish to share with other Snowflake accounts.

## Database management

* [CREATE DATABASE](sql/create-database.md)
* [CREATE DATABASE (catalog-linked)](sql/create-database-catalog-linked.md)
* [CREATE DATABASE … CLONE](sql/create-clone.md)
* [ALTER DATABASE](sql/alter-database.md)
* [ALTER DATABASE (catalog-linked)](sql/alter-database-catalog-linked.md)
* [DESCRIBE DATABASE](sql/desc-database.md)
* [DROP DATABASE](sql/drop-database.md)
* [UNDROP DATABASE](sql/undrop-database.md)
* [USE DATABASE](sql/use-database.md)
* [SHOW DATABASES](sql/show-databases.md)

## Schema management

* [CREATE SCHEMA](sql/create-schema.md)
* [CREATE SCHEMA … CLONE](sql/create-clone.md)
* [ALTER SCHEMA](sql/alter-schema.md)
* [DROP SCHEMA](sql/drop-schema.md)
* [UNDROP SCHEMA](sql/undrop-schema.md)
* [USE SCHEMA](sql/use-schema.md)
* [SHOW SCHEMAS](sql/show-schemas.md)

## Share management

* [CREATE SHARE](sql/create-share.md)
* [ALTER SHARE](sql/alter-share.md)
* [DROP SHARE](sql/drop-share.md)
* [SHOW SHARES](sql/show-shares.md)
* [DESCRIBE SHARE](sql/desc-share.md)

---
title: Date & time data types
source: https://docs.snowflake.com/en/sql-reference/data-types-datetime.md
section: SQL General Reference
---

# Date & time data types

Snowflake supports data types for managing dates, times, and timestamps (combined date + time). Snowflake also supports formats for
string constants used in manipulating dates, times, and timestamps.

## Data types

Snowflake supports the following date and time data types:

* DATE
* DATETIME
* Interval data types
* TIME
* TIMESTAMP_LTZ , TIMESTAMP_NTZ , TIMESTAMP_TZ

> **Note:**
>
> For DATE and TIMESTAMP data, Snowflake recommends using years between 1582 and 9999. Snowflake accepts some
> years outside this range, but years prior to 1582 should be avoided due to
> limitations on the Gregorian Calendar.

### DATE

Snowflake supports a single DATE data type for storing dates (with no time elements).

DATE accepts dates in the most common forms (`YYYY-MM-DD`, `DD-MON-YYYY`, and so on).

In addition, all accepted TIMESTAMP values are valid inputs for dates, but the TIME information is truncated.

### DATETIME

DATETIME is synonymous with TIMESTAMP_NTZ.

### Interval data types

Interval data types store values that represent a duration of time. You can calculate an interval as the difference
between two dates or times. An interval only defines a duration, so it doesn’t have a start or end point in time.
For example, you might define an interval as three years and seven months.

Snowflake supports the following year-month variations of interval data types:

| Data type | Description |
| --- | --- |
| INTERVAL YEAR | Represents a duration of time in years. |
| INTERVAL YEAR TO MONTH | Represents a duration of time in years and months. |
| INTERVAL MONTH | Represents a duration of time in months. |

Snowflake supports the following day-time variations of interval data types:

| Data type | Description |
| --- | --- |
| INTERVAL DAY | Represents a duration of time in days. |
| INTERVAL DAY TO HOUR | Represents a duration of time in days and hours. |
| INTERVAL DAY TO MINUTE | Represents a duration of time in days, hours, and minutes. |
| INTERVAL DAY TO SECOND | Represents a duration of time in days, hours, minutes, seconds, and fractional seconds. |
| INTERVAL HOUR | Represents a duration of time in hours. |
| INTERVAL HOUR TO MINUTE | Represents a duration of time in hours and minutes. |
| INTERVAL HOUR TO SECOND | Represents a duration of time in hours, minutes, seconds, and fractional seconds. |
| INTERVAL MINUTE | Represents a duration of time in minutes. |
| INTERVAL MINUTE TO SECOND | Represents a duration of time in minutes, seconds, and fractional seconds. |
| INTERVAL SECOND | Represents a duration of time in seconds and fractional seconds. |

The following sections describe interval data types in more detail:

* Benefits of interval data types
* Syntax of interval data types
* Representing interval values
* Interval formats
* Examples of interval values
* Operations that involve date and time values
* Functions that accept interval values as arguments
* Examples for interval data types
* Limitations for interval data types

> **Note:**
>
> You can also use interval constants for date and time arithmetic. However,
> interval constants don’t support interval storage as a column type.

#### Benefits of interval data types

Interval data types provide the following benefits:

* Ensure accurate date arithmetic without ambiguity.
* Eliminate the need for manual [conversion and casting](data-type-conversion.md) from
  integer-based durations.
* Optimize storage for data that represents intervals of time.
* Optimize query execution for duration data.
* Simplify the migration of data from third-party databases, such as Databricks, Oracle, and Teradata.
* Comply fully with ANSI standards.

#### Syntax of interval data types

To specify an interval data type, use the following syntax:

```sqlsyntax
INTERVAL { yearMonthQualifier | dayTimeQualifier }
```

Where:

> ```sqlsyntax
> yearMonthQualifier ::=
>   {
>     YEAR [ (<precision>) ] [ TO MONTH ]
>     | MONTH [ (<precision>) ]
>   }
> ```
>
> ```sqlsyntax
> dayTimeQualifier ::=
>   {
>     DAY [ (<precision>) ] [ TO { HOUR | MINUTE | SECOND [ (<fractional_seconds_precision>) ] } ]
>     | HOUR [ (<precision>) ] [ TO { MINUTE | SECOND [ (<fractional_seconds_precision>) ] } ]
>     | MINUTE [ (<precision>) ] [ TO SECOND [ (<fractional_seconds_precision>) ] ]
>     | SECOND [ (<precision>) [ , (<fractional_seconds_precision>) ] ]
>   }
> ```

Properties:

* `precision` is the total number of digits that is allowed. Precision can range from `1` to `9`.

  Default: `9`
* `fractional_seconds_precision` is the number of digits in the fractional part of a second. Time precision
  can range from `0` (seconds) to `9` (nanoseconds).

  Default: `9`

Use this syntax when you are specifying an interval data type. For example, the following table has a `duration`
column of INTERVAL YEAR TO MONTH type:

```sqlexample
CREATE OR REPLACE TEMPORARY TABLE sample_table_with_interval (
  id VARCHAR,
  duration INTERVAL YEAR(2) TO MONTH);
```

#### Representing interval values

You can represent an interval value by using an interval literal or an interval format:

* Interval literals
* Interval formats
* Examples of interval values

##### Interval literals

An interval literal is an expression that specifies a duration of time in a string literal. Use the following syntax
to specify an interval literal:

```sqlsyntax
INTERVAL '[ <sign> ] <string>' { <yearMonthQualifier> | <dayTimeQualifier> }
```

Where:

* `sign` is an optional symbol that specifies a positive (`+`) or negative (`-`) duration of
  time.

  Default: `+`.
* `string` is a value that represents a time duration.
* `yearMonthQualifier` is a qualifier that is defined in Syntax of interval data types.
* `dayTimeQualifier` is a qualifier that is defined in Syntax of interval data types.

##### Interval formats

String literals in specific formats can represent interval values.

To specify values for years and months, use the following format:

```sqlsyntax
'<sign><Y>-<MM>'
```

Where:

* `sign` is a required symbol that specifies a positive (`+`) or negative (`-`) duration of
  time.

  Default: `+`.
* `Y` is the number of years. The number of digits that is allowed (precision) depends on the data type
  of the value.
* `MM` is two digits for the number of months, from `00` to `11`.

To specify values for days, hours, seconds, and fractional seconds, use the following format:

```sqlsyntax
'<sign>[<D>] [<HH24>]:[<MI>]:[<SS>].[<F>]'
```

Where:

* `sign` is a required symbol that specifies a positive (`+`) or negative (`-`) duration of
  time.

  Default: `+`.
* `D` is the number of days. The number of digits that is allowed (precision) depends on the data type
  of the value.

  Omit `D` for values of the following types:

  + INTERVAL HOUR
  + INTERVAL HOUR TO MINUTE
  + INTERVAL HOUR TO SECOND
  + INTERVAL MINUTE
  + INTERVAL MINUTE TO SECOND
  + INTERVAL SECOND
* `HH24` is two digits for the number of hours, from `00` to `23`.

  Omit `HH24` for values of the following types:

  + INTERVAL DAY
  + INTERVAL MINUTE
  + INTERVAL MINUTE TO SECOND
  + INTERVAL SECOND
* `MI` is two digits for the number of minutes, from `00` through `59`.

  Omit `MI` for values of the following types:

  + INTERVAL DAY TO HOUR
  + INTERVAL DAY
  + INTERVAL HOUR
  + INTERVAL SECOND
* `SS` is two digits for the number of seconds, from `00` through `59`.

  Omit `SS` for values of the following types:

  + INTERVAL DAY
  + INTERVAL DAY TO HOUR
  + INTERVAL DAY TO MINUTE
  + INTERVAL HOUR
  + INTERVAL HOUR TO MINUTE
  + INTERVAL MINUTE
* `F` is the number of fractional seconds for the data types that include seconds. The number of digits
  that is allowed (precision) depends on the data type of the value.

The following usage notes apply to string literals in interval format:

* The string literal representation applies when you use the CAST or TO_CHAR function to cast intervals explicitly
  to text strings.
* Leading zeros in a field specify precision.

##### Examples of interval values

The following table shows how to represent various interval values. The values shown in the table conform to the
following rules for interval values:

* For positive values, the plus sign `+` is optional for interval literal values but required for interval
  format values.
* In the interval literal values, the value in parentheses specifies the precision, which is the number of digits
  that is allowed. For example, `YEAR(3)` specifies that three digits are allowed in the year.
* In the interval format values, the primary field (the leading field) does not include leading zeros.
  Subordinate fields use a fixed number of digits. For example, in a YEAR TO MONTH value like `+1-08`,
  the year field has no leading zeros, and the month field uses two digits.

| Duration | Type | Interval literal value | Interval format value |
| --- | --- | --- | --- |
| Positive 5 years | INTERVAL YEAR | `INTERVAL '5' YEAR(2)` | `'+5'` |
| Positive 1 year and 8 months | INTERVAL YEAR TO MONTH | `INTERVAL '1-08' YEAR(3) TO MONTH` | `'+001-08'` |
| Negative 5 months | INTERVAL MONTH | `INTERVAL '-5' MONTH(2)` | `'-5'` |
| Positive 14 months | INTERVAL MONTH | `INTERVAL '14' MONTH(2)` | `'+14'` |
| Negative 44 years and 11 months | INTERVAL YEAR TO MONTH | `INTERVAL '-44-11' YEAR(2) TO MONTH` | `'-44-11'` |
| Positive 11 days, 10 hours, and 9 minutes | INTERVAL DAY TO MINUTE | `INTERVAL '11 10:09' DAY(2) TO MINUTE` | `'+11 10:09'` |
| Positive 2 days, 23 hours, 8 minutes, 23 seconds, and 275 milliseconds | INTERVAL DAY TO SECOND | `INTERVAL '02 23:08:23.275' DAY(2) TO SECOND(3)` | `'+2 23:08:23.275'` |
| Positive 4 seconds and 300 milliseconds | INTERVAL SECOND | `INTERVAL '4.3' SECOND(5, 6)` | `'+4.300000'` |

#### Operations that involve date and time values

The following table shows the data type of the result for valid arithmetic operations that involve interval values:

| First operand | Operator | Second operand | Result type |
| --- | --- | --- | --- |
| Timestamp | `-` | Timestamp | An interval data type |
| Date or timestamp | `+` | Interval | DATE, DATETIME, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ |
| Date or timestamp | `-` | Interval | DATE, DATETIME, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ |
| Interval | `+` | Date or timestamp | DATE, DATETIME, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ |
| Numeric | `*` | Interval | An interval data type |
| Interval | `*` | Numeric | An interval data type |
| Interval | `/` | Numeric | An interval data type |
| Interval | `+` | Interval | An interval data type |
| Interval | `-` | Interval | An interval data type |

For operations that involve two interval values, the values must both be year-month interval values, or
they must both be day-time interval values. Operations that mix year-month interval values and
day-time interval values aren’t supported. When the operation involves two year-month interval values,
the result type is a year-month interval type. When the operation involves two day-time interval
values, the result type is a day-time interval type.

#### Functions that accept interval values as arguments

The following functions accept interval values as arguments:

* [ABS](functions/abs.md)
* [ANY_VALUE](functions/any_value.md)
* [AVG](functions/avg.md)
* [[ NOT ] BETWEEN](functions/between.md)
* [COALESCE](functions/coalesce.md)
* [COUNT](functions/count.md)
* [COUNT_IF](functions/count_if.md)
* [DATE_PART](functions/date_part.md)
* [DAY](functions/year.md)
* [DENSE_RANK](functions/dense_rank.md)
* [[ NOT ] EQUAL_NULL](functions/equal_null.md)
* [EXTRACT](functions/extract.md)
* [FIRST_VALUE](functions/first_value.md)
* [GREATEST](functions/greatest.md)
* [GREATEST_IGNORE_NULLS](functions/greatest_ignore_nulls.md)
* [HOUR](functions/hour-minute-second.md)
* [IFF](functions/iff.md)
* [IFNULL](functions/ifnull.md)
* [[ NOT ] IN](functions/in.md)
* [IS [ NOT ] DISTINCT FROM](functions/is-distinct-from.md)
* [IS [ NOT ] NULL](functions/is-null.md)
* [LAG](functions/lag.md)
* [LAST_VALUE](functions/last_value.md)
* [LEAD](functions/lead.md)
* [LEAST](functions/least.md)
* [LEAST_IGNORE_NULLS](functions/least_ignore_nulls.md)
* [MAX](functions/max.md)
* [MAX_BY](functions/max_by.md)
* [MEDIAN](functions/median.md)
* [MIN](functions/min.md)
* [MINUTE](functions/hour-minute-second.md)
* [MIN_BY](functions/min_by.md)
* [MONTH](functions/year.md)
* [NTH_VALUE](functions/nth_value.md)
* [NVL](functions/nvl.md)
* [NVL2](functions/nvl2.md)
* [PERCENT_RANK](functions/percent_rank.md)
* [PERCENTILE_CONT](functions/percentile_cont.md)
* [PERCENTILE_DISC](functions/percentile_disc.md)
* [RANK](functions/rank.md)
* [SECOND](functions/hour-minute-second.md)
* [SIGN](functions/sign.md)
* [SUM](functions/sum.md)
* [TO_CHAR](functions/to_char.md)
* [TO_VARCHAR](functions/to_char.md)
* [YEAR](functions/year.md)

#### Examples for interval data types

The following examples show how to use interval data types:

* Performing arithmetic by using interval data
* Inserting and querying year-month interval data
* Inserting and querying day-time interval data
* Copying interval data into a table and querying the table

##### Performing arithmetic by using interval data

The following examples perform arithmetic by using interval data.

Add one year and one month to a date:

```sqlexample
SELECT TO_DATE('2024-01-01') + INTERVAL '1-1' YEAR TO MONTH
  AS date_plus_one_year_one_month;
```

```output
+------------------------------+
| DATE_PLUS_ONE_YEAR_ONE_MONTH |
|------------------------------|
| 2025-02-01                   |
+------------------------------+
```

Subtract one year and one month from a date:

```sqlexample
SELECT TO_DATE('2024-01-01') + INTERVAL '-1-1' YEAR TO MONTH
  AS date_plus_one_year_one_month;
```

```output
+------------------------------+
| DATE_PLUS_ONE_YEAR_ONE_MONTH |
|------------------------------|
| 2022-12-01                   |
+------------------------------+
```

Add a period of time to a timestamp:

```sqlexample
SELECT TO_TIMESTAMP('2024-01-01 08:08:08.99') + INTERVAL '1 01:01:01.7878' DAY TO SECOND
  AS date_plus_period_of_time;
```

```output
+--------------------------+
| DATE_PLUS_PERIOD_OF_TIME |
|--------------------------|
| 2024-01-02 09:09:10.777  |
+--------------------------+
```

The following example uses the [SYSTEM$TYPEOF](functions/system_typeof.md) function to show that an INTERVAL DAY TO SECOND value
is returned when a query subtracts two timestamp values:

```sqlexample
SELECT SYSTEM$TYPEOF(TO_TIMESTAMP('2025-10-05 01:02:03') - TO_TIMESTAMP('2025-09-15 11:36:22'))
  AS type;
```

```output
+------------------------------------+
| TYPE                               |
|------------------------------------|
| INTERVAL DAY(9) TO SECOND(9)[SB16] |
+------------------------------------+
```

To view the results of the query in interval format, you can cast the expression to the INTERVAL DAY(2) TO SECOND(2)
data type to specify precision, and then cast to VARCHAR:

```sqlexample
SELECT (TO_TIMESTAMP('2025-10-05 01:02:03') - TO_TIMESTAMP('2025-09-15 11:36:22'))::INTERVAL DAY(2) TO SECOND(2)::VARCHAR
  AS interval_format_result;
```

```output
+------------------------+
| INTERVAL_FORMAT_RESULT |
|------------------------|
| +19 13:25:41.00        |
+------------------------+
```

##### Inserting and querying year-month interval data

Create a table that tracks candidates for open positions with an INTERVAL YEAR TO MONTH column, and insert data:

```sqlexample
CREATE OR REPLACE TABLE candidates (
  name_first VARCHAR,
  name_last VARCHAR,
  duration_of_experience INTERVAL YEAR(2) TO MONTH);

INSERT INTO candidates VALUES ('Jane', 'Smith', '14-4');
INSERT INTO candidates VALUES ('Robert', 'Adams', '0-3');
INSERT INTO candidates VALUES ('Mary', 'Jones', '5-11');
```

When you query the table without casting the `` duration_of_experience` `` column to a data type, the output shows the
column values as the total number of months in each row:

```sqlexample
SELECT name_first,
       name_last,
       duration_of_experience AS months_of_experience
  FROM candidates;
```

```output
+------------+-----------+----------------------+
| NAME_FIRST | NAME_LAST | MONTHS_OF_EXPERIENCE |
|------------+-----------+----------------------|
| Jane       | Smith     |                  172 |
| Robert     | Adams     |                    3 |
| Mary       | Jones     |                   71 |
+------------+-----------+----------------------+
```

When you query the table and cast the `duration_of_experience` column to the VARCHAR data type, the output shows the
column values in interval format:

```sqlexample
SELECT name_first,
       name_last,
       duration_of_experience::VARCHAR AS duration_of_experience
  FROM candidates;
```

```output
+------------+-----------+------------------------+
| NAME_FIRST | NAME_LAST | DURATION_OF_EXPERIENCE |
|------------+-----------+------------------------|
| Jane       | Smith     | +14-04                 |
| Robert     | Adams     | +0-03                  |
| Mary       | Jones     | +5-11                  |
+------------+-----------+------------------------+
```

##### Inserting and querying day-time interval data

Create a table that specifies the timeout duration for various software features with an
INTERVAL HOUR TO SECOND column, and insert data:

```sqlexample
CREATE OR REPLACE TABLE feature_timeouts (
  feature VARCHAR,
  timeout_duration INTERVAL HOUR(2) TO SECOND(0));

INSERT INTO feature_timeouts VALUES ('Feature1', '00:00:30');
INSERT INTO feature_timeouts VALUES ('Feature2', '00:10:00');
INSERT INTO feature_timeouts VALUES ('Feature3', '01:00:00');
```

Query the table and cast the `timeout_duration` column to the VARCHAR data type:

```sqlexample
SELECT feature,
       timeout_duration::VARCHAR AS timeout_duration
  FROM feature_timeouts;
```

```output
+----------+------------------+
| FEATURE  | TIMEOUT_DURATION |
|----------+------------------|
| Feature1 | +0:00:30         |
| Feature2 | +0:10:00         |
| Feature3 | +1:00:00         |
+----------+------------------+
```

##### Copying interval data into a table and querying the table

Complete the following steps to stage a file with interval data, and then copy the file into a table:

1. In a file on your file system, copy the following content:

   ```output
   1,1-2,28 16:15:14.0
   2,-3-2,-54 16:15:14.123
   ```

   This example assumes that the file is named `interval_values.csv` in the `/examples/intervals/` directory.
2. Create a stage:

   ```sqlexample
   CREATE STAGE interval_stage;
   ```
3. In the internal staging location, stage the file:

   ```sqlexample
   PUT file:///examples/intervals/interval_values.csv @~/interval_stage
     AUTO_COMPRESS=false;
   ```
4. Create a table for the data:

   ```sqlexample
   CREATE OR REPLACE TABLE sample_interval_values(
     c1 STRING,
     c2 INTERVAL YEAR(1) TO MONTH,
     c3 INTERVAL DAY(2) TO SECOND(3));
   ```
5. To load the staged file into the table that you created, use the [COPY INTO <table>](sql/copy-into-table.md) command:

   ```sqlexample
   COPY INTO sample_interval_values FROM @~/interval_stage;
   ```
6. To view the loaded data, query the table, and cast to the VARCHAR type to view the loaded data:

   ```sqlexample
   SELECT c1,
          c2::VARCHAR AS YEAR_TO_MONTH,
          c3::VARCHAR AS DAY_TO_SECOND,
     FROM sample_interval_values;
   ```

   ```output
   +----+---------------+------------------+
   | C1 | YEAR_TO_MONTH | DAY_TO_SECOND    |
   |----+---------------+------------------|
   | 1  | +1-02         | +28 16:15:14.000 |
   | 2  | -3-02         | -54 16:15:14.123 |
   +----+---------------+------------------+
   ```

#### Limitations for interval data types

The following limitations apply to interval data types:

* Year-month interval values can’t be combined or compared with day-time interval values.
* Interval constants and values of interval data type can’t be combined or compared.
* Interval constants can’t be inserted into a column that has an interval data type.
* [VARIANT](data-types-semistructured.md) values can’t contain interval values.
* [Structured data type](data-types-structured.md) values can’t contain interval values.
* Interval expressions can’t be used in [user-defined functions (UDFs)](../developer-guide/udf/udf-overview.md)
  or [Snowflake Scripting](../developer-guide/snowflake-scripting/index.md).
* The following types of tables can’t have interval columns:

  + [Clustered tables](../user-guide/tables-clustering-keys.md)
  + [Dynamic tables](../user-guide/dynamic-tables-about.md)
  + [Apache Iceberg™ tables](../user-guide/tables-iceberg.md)
* Queries on interval columns can’t benefit from the [Search optimization service](../user-guide/search-optimization-service.md).

### TIME

Snowflake supports a single TIME data type for storing times in the form of `HH:MI:SS`.

TIME supports an optional precision parameter for fractional seconds (for example, `TIME(3)`).
Time precision can range from 0 (seconds) to 9 (nanoseconds). The default precision is 9.

All TIME values must be between `00:00:00` and `23:59:59.999999999`. TIME internally stores “wallclock” time, and all operations on TIME values are performed
without taking any time zone into consideration.

### TIMESTAMP_LTZ , TIMESTAMP_NTZ , TIMESTAMP_TZ

Snowflake supports three variations of timestamp:

TIMESTAMP_LTZ:
:   TIMESTAMP_LTZ internally stores UTC values with a specified precision. However, all operations are performed in the current session’s time zone, controlled by the
    [TIMEZONE](parameters.md) session parameter.

    Synonymous with TIMESTAMP_LTZ:

    * TIMESTAMPLTZ
    * TIMESTAMP WITH LOCAL TIME ZONE

TIMESTAMP_NTZ:
:   TIMESTAMP_NTZ internally stores “wallclock” time with a specified precision. All operations are performed without taking any time zone into account.

    If the output format contains a time zone, the UTC indicator (`Z`) is displayed.

    TIMESTAMP_NTZ is the default for TIMESTAMP.

    Synonymous with TIMESTAMP_NTZ:

    * TIMESTAMPNTZ
    * TIMESTAMP WITHOUT TIME ZONE
    * DATETIME

TIMESTAMP_TZ:
:   TIMESTAMP_TZ internally stores UTC values together with an associated *time zone offset*. When a time zone isn’t provided, the session time zone offset is used. All
    operations are performed with the time zone offset specific to each record.

    Synonymous with TIMESTAMP_TZ:

    * TIMESTAMPTZ
    * TIMESTAMP WITH TIME ZONE

    TIMESTAMP_TZ values are compared based on their times in UTC. For example, the following comparison between
    different times in different timezones returns TRUE because the two values have equivalent times in UTC.

    ```sqlexample
    SELECT '2024-01-01 00:00:00 +0000'::TIMESTAMP_TZ = '2024-01-01 01:00:00 +0100'::TIMESTAMP_TZ;
    ```

> **Attention:**
>
> TIMESTAMP_TZ currently only stores the *offset* of a given time zone, not the actual *time zone*, at the moment of creation for a given value. This is especially
> important for daylight saving time, which is not utilized by UTC.
>
> For example, with the [TIMEZONE](parameters.md) parameter set to `"America/Los_Angeles"`, converting a value to TIMESTAMP_TZ in January of a given year stores the
> time zone offset of `-0800`. If six months are later added to the value, the `-0800` offset is retained, even though in July the offset for Los Angeles is
> `-0700`. This is because, after the value is created, the actual time zone information (`"America/Los_Angeles"`) is no longer available. The following code
> sample illustrates this behavior:
>
> ```sqlexample
> SELECT '2024-01-01 12:00:00'::TIMESTAMP_TZ;
> ```
>
> ```output
> +-------------------------------------+
> | '2024-01-01 12:00:00'::TIMESTAMP_TZ |
> |-------------------------------------|
> | 2024-01-01 12:00:00.000 -0800       |
> +-------------------------------------+
> ```
>
> ```sqlexample
> SELECT DATEADD(MONTH, 6, '2024-01-01 12:00:00'::TIMESTAMP_TZ);
> ```
>
> ```output
> +--------------------------------------------------------+
> | DATEADD(MONTH, 6, '2024-01-01 12:00:00'::TIMESTAMP_TZ) |
> |--------------------------------------------------------|
> | 2024-07-01 12:00:00.000 -0800                          |
> +--------------------------------------------------------+
> ```

#### TIMESTAMP

TIMESTAMP in Snowflake is a user-specified alias associated with one of the TIMESTAMP_\* variations. In all operations where TIMESTAMP is used, the associated TIMESTAMP_\*
variation is automatically used. The TIMESTAMP data type is never stored in tables.

The TIMESTAMP_\* variation associated with TIMESTAMP is specified by the [TIMESTAMP_TYPE_MAPPING](parameters.md) session parameter. The default is TIMESTAMP_NTZ.

All timestamp variations, as well as the TIMESTAMP alias, support an optional precision parameter for fractional
seconds (for example, `TIMESTAMP(3)`). Timestamp precision can range from 0 (seconds) to 9 (nanoseconds). The default precision is 9.

#### Timestamp examples

These examples create a table using different timestamps.

First, create a table with a TIMESTAMP column (mapped to TIMESTAMP_NTZ):

```sqlexample
ALTER SESSION SET TIMESTAMP_TYPE_MAPPING = TIMESTAMP_NTZ;

CREATE OR REPLACE TABLE ts_test(ts TIMESTAMP);

DESC TABLE ts_test;
```

```output
+------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type             | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| TS   | TIMESTAMP_NTZ(9) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

Next, explicitly use one of the TIMESTAMP variations (TIMESTAMP_LTZ):

```sqlexample
CREATE OR REPLACE TABLE ts_test(ts TIMESTAMP_LTZ);

DESC TABLE ts_test;
```

```output
+------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type             | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| TS   | TIMESTAMP_LTZ(9) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

Use TIMESTAMP_LTZ with different time zones:

```sqlexample
CREATE OR REPLACE TABLE ts_test(ts TIMESTAMP_LTZ);

ALTER SESSION SET TIMEZONE = 'America/Los_Angeles';

INSERT INTO ts_test VALUES('2024-01-01 16:00:00');
INSERT INTO ts_test VALUES('2024-01-02 16:00:00 +00:00');
```

This query shows that the time for January 2nd is 08:00 in Los Angeles (which is 16:00 in UTC):

```sqlexample
SELECT ts, HOUR(ts) FROM ts_test;
```

```output
+-------------------------------+----------+
| TS                            | HOUR(TS) |
|-------------------------------+----------|
| 2024-01-01 16:00:00.000 -0800 |       16 |
| 2024-01-02 08:00:00.000 -0800 |        8 |
+-------------------------------+----------+
```

Next, note that the times change with a different time zone:

```sqlexample
ALTER SESSION SET TIMEZONE = 'America/New_York';

SELECT ts, HOUR(ts) FROM ts_test;
```

```output
+-------------------------------+----------+
| TS                            | HOUR(TS) |
|-------------------------------+----------|
| 2024-01-01 19:00:00.000 -0500 |       19 |
| 2024-01-02 11:00:00.000 -0500 |       11 |
+-------------------------------+----------+
```

Create a table and use TIMESTAMP_NTZ:

```sqlexample
CREATE OR REPLACE TABLE ts_test(ts TIMESTAMP_NTZ);

ALTER SESSION SET TIMEZONE = 'America/Los_Angeles';

INSERT INTO ts_test VALUES('2024-01-01 16:00:00');
INSERT INTO ts_test VALUES('2024-01-02 16:00:00 +00:00');
```

Note that both times from different time zones are converted to the same “wallclock” time:

```sqlexample
SELECT ts, HOUR(ts) FROM ts_test;
```

```output
+-------------------------+----------+
| TS                      | HOUR(TS) |
|-------------------------+----------|
| 2024-01-01 16:00:00.000 |       16 |
| 2024-01-02 16:00:00.000 |       16 |
+-------------------------+----------+
```

Next, note that changing the session time zone doesn’t affect the results:

```sqlexample
ALTER SESSION SET TIMEZONE = 'America/New_York';

SELECT ts, HOUR(ts) FROM ts_test;
```

```output
+-------------------------+----------+
| TS                      | HOUR(TS) |
|-------------------------+----------|
| 2024-01-01 16:00:00.000 |       16 |
| 2024-01-02 16:00:00.000 |       16 |
+-------------------------+----------+
```

Create a table and use TIMESTAMP_TZ:

```sqlexample
CREATE OR REPLACE TABLE ts_test(ts TIMESTAMP_TZ);

ALTER SESSION SET TIMEZONE = 'America/Los_Angeles';

INSERT INTO ts_test VALUES('2024-01-01 16:00:00');
INSERT INTO ts_test VALUES('2024-01-02 16:00:00 +00:00');
```

Note that the January 1st record inherited the session time zone,
and `America/Los_Angeles` was converted to a numeric time zone offset:

```sqlexample
SELECT ts, HOUR(ts) FROM ts_test;
```

```output
+-------------------------------+----------+
| TS                            | HOUR(TS) |
|-------------------------------+----------|
| 2024-01-01 16:00:00.000 -0800 |       16 |
| 2024-01-02 16:00:00.000 +0000 |       16 |
+-------------------------------+----------+
```

Next, note that changing the session time zone doesn’t affect the results:

```sqlexample
ALTER SESSION SET TIMEZONE = 'America/New_York';

SELECT ts, HOUR(ts) FROM ts_test;
```

```output
+-------------------------------+----------+
| TS                            | HOUR(TS) |
|-------------------------------+----------|
| 2024-01-01 16:00:00.000 -0800 |       16 |
| 2024-01-02 16:00:00.000 +0000 |       16 |
+-------------------------------+----------+
```

## Supported calendar

Snowflake uses the Gregorian Calendar for all dates and timestamps. The Gregorian Calendar starts in the year 1582, but recognizes prior years, which is important to note
because Snowflake does not adjust dates prior to 1582 (or calculations involving dates prior to 1582) to match the Julian Calendar. The `UUUU` format element
supports negative years.

## Date and time formats

All of these data types accept most non-ambiguous date, time, or date + time formats. See
[Supported formats for AUTO detection](date-time-input-output.md) for the formats that Snowflake recognizes when
[configured to detect the format automatically](date-time-input-output.md).

You can also
[specify the date and time format manually](date-time-input-output.md). When specifying the
format, you can use the case-insensitive elements listed in the following table:

| Format element | Description |
| --- | --- |
| `YYYY` | Four-digit [1] year. |
| `YY` | Two-digit [1] year, controlled by the [TWO_DIGIT_CENTURY_START](parameters.md) session parameter. For example, when set to `1980`, values of `79` and `80` are parsed as `2079` and `1980`, respectively. |
| `Y` | One-digit or two-digit [2] year without leading zeros, controlled by the [TWO_DIGIT_CENTURY_START](parameters.md) session parameter. For example, when the parameter set to `1990`, values of `2005` and `1991` are serialized as `5` and `91`, respectively. |
| `MM` | Two-digit [1] month (`01` = January, and so on). |
| `MO` | One-digit or two-digit [2] month without leading zeros (`1` = January, and so on). |
| `MON` | Abbreviated month name [3]. |
| `MMMM` | Full month name [3]. |
| `DD` | Two-digit [1] day of month (`01` through `31`). |
| `D` | One-digit or two-digit [2] day of month without leading zeros (`1` through `31`). |
| `DY` | Abbreviated day of week. |
| `HH24` | Two digits [1] for hour (`00` through `23`). You *must not* specify `AM` / `PM` or `A` / `P`. |
| `HH12` | Two digits [1] for hour (`01` through `12`). You can specify `AM` / `PM` or `A` / `P`. |
| `H24` | One or two digits [2] for hour without leading zeros (`0` through `23`). You *must not* specify `AM` / `PM` or `A` / `P`. |
| `H12` | One or two digits [2] for hour without leading zeros (`1` through `12`). You can specify `AM` / `PM` or `A` / `P`. |
| `AM` , `PM` | Ante meridiem (`AM`) / post meridiem (`PM`). Use this only with `HH12` and code:`H12` (*not* with `HH24` or `H24`). |
| `P` | Ante meridiem (`A`) / post meridiem (`P`). Use this only with `HH12` and code:`H12` (*not* with `HH24` or `H24`). |
| `HH` | Synonym for `HH24`. |
| `H` | Synonym for `H24`. |
| `MI` | Two digits [1] for minute (`00` through `59`). |
| `ME` | One or two digits [2] for minute without leading zeros (`0` through `59`). |
| `SS` | Two digits [1] for second (`00` through `59`). |
| `S` | One or two digits [2] for second without leading zeros (`0` through `59`). |
| `FF[0-9]` | Fractional seconds with precision `0` (seconds) to `9` (nanoseconds), e.g. `FF`, `FF0`, `FF3`, `FF9`. Specifying `FF` is equivalent to `FF9` (nanoseconds). |
| `TZH:TZM` , `TZHTZM` , `TZH` | Two-digit [1] time zone hour and minute, offset from UTC. Can be prefixed by `+`/`-` for sign. |
| `UUUU` | Four-digit year in [ISO format](https://en.wikipedia.org/wiki/ISO_8601), which are negative for BCE years. |

[1] The number of digits describes the output produced when serializing values to text. When parsing text, Snowflake accepts up to the specified number of digits. For example, a day number can be one or two digits.

[2] The number of digits describes the output produced when serializing values to text. Parsing isn’t supported. If parsing is required, use an equivalent format that includes leading zeros. These format elements will be enabled in BCR bundle 2026_03.

[3] For the MON format element, the output produced when serializing values to text is the abbreviated month name. For the MMMM format element, the output produced when serializing values to text is the full month name. When parsing text, Snowflake accepts the three-digit abbreviation or the full month name for both MON and MMMM. For example, “January” or “Jan”, “February” or “Feb”, and so on are accepted when parsing text.

> **Note:**
>
> * When a date-only format is used, the associated time is assumed to be midnight on that day.
> * Anything in the format between double quotes or other than the above elements is parsed/formatted without being interpreted.
>   Snowflake recommends always enclosing literal characters in double quotes
>   (for example, `"T"`, `"EST"`, `"Z"`) to ensure they are treated as literals.
> * For more details about valid ranges, number of digits, and best practices, see
>   [Additional information about using date, time, and timestamp formats](date-time-input-output.md).

### Examples of using date and time formats

The following example uses `FF` to indicate that the output has 9 digits in the fractional seconds field:

```sqlexample
CREATE OR REPLACE TABLE timestamp_demo_table(
  tstmp TIMESTAMP,
  tstmp_tz TIMESTAMP_TZ,
  tstmp_ntz TIMESTAMP_NTZ,
  tstmp_ltz TIMESTAMP_LTZ);
INSERT INTO timestamp_demo_table (tstmp, tstmp_tz, tstmp_ntz, tstmp_ltz) VALUES (
  '2024-03-12 01:02:03.123456789',
  '2024-03-12 01:02:03.123456789',
  '2024-03-12 01:02:03.123456789',
  '2024-03-12 01:02:03.123456789');
```

```sqlexample
ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF';
ALTER SESSION SET TIMESTAMP_TZ_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF';
ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF';
ALTER SESSION SET TIMESTAMP_LTZ_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF';
```

```sqlexample
SELECT tstmp, tstmp_tz, tstmp_ntz, tstmp_ltz
  FROM timestamp_demo_table;
```

```output
+-------------------------------+-------------------------------+-------------------------------+-------------------------------+
| TSTMP                         | TSTMP_TZ                      | TSTMP_NTZ                     | TSTMP_LTZ                     |
|-------------------------------+-------------------------------+-------------------------------+-------------------------------|
| 2024-03-12 01:02:03.123456789 | 2024-03-12 01:02:03.123456789 | 2024-03-12 01:02:03.123456789 | 2024-03-12 01:02:03.123456789 |
+-------------------------------+-------------------------------+-------------------------------+-------------------------------+
```

## Date and time constants

*Constants* (also known as *literals*) are fixed data values. Snowflake supports using string constants to specify fixed date, time, or timestamp values. String
constants must always be enclosed between delimiter characters. Snowflake supports using single quotes to delimit string constants.

For example:

```sqlexample
DATE '2024-08-14'
TIME '10:03:56'
TIMESTAMP '2024-08-15 10:59:43'
```

The string is parsed as a DATE, TIME, or TIMESTAMP value based on the input format for the data type, as set through the following parameters:

DATE:
:   [DATE_INPUT_FORMAT](parameters.md)

TIME:
:   [TIME_INPUT_FORMAT](parameters.md)

TIMESTAMP:
:   [TIMESTAMP_INPUT_FORMAT](parameters.md)

For example, to insert a specific date into a column in a table:

```sqlexample
CREATE TABLE t1 (d1 DATE);

INSERT INTO t1 (d1) VALUES (DATE '2024-08-15');
```

## Interval constants

You can use interval constants to add or subtract a period of time to or from a date, time, or timestamp. Interval constants are implemented
using the INTERVAL keyword, which has the following syntax:

```sqlsyntax
{ + | - } INTERVAL '<integer> [ <date_time_part> ] [ , <integer> [ <date_time_part> ] ... ]'
```

As with all string constants, Snowflake requires single quotes to delimit interval constants.

> **Note:**
>
> Interval constants support date and time arithmetic, but they don’t support interval storage as a column type.
> To store interval values in a column, you can use interval data types.

The INTERVAL keyword supports one or more integers and, optionally, one or more date or time parts. For example:

* `INTERVAL '1 year'` represents one year.
* `INTERVAL '4 years, 5 months, 3 hours'` represents four years, five months, and three hours.

If a date or time part isn’t specified, the interval represents seconds (for example, `INTERVAL '2'` is the same
as `INTERVAL '2 seconds'`). Note that this is different from the default unit of time for performing date arithmetic.
For more details, see Simple arithmetic for dates.

For the list of supported date and time parts, see Supported Date and Time Parts for Intervals.

> **Note:**
>
> * The order of interval increments is important. The increments are added or subtracted in the order listed. For example:
>
>   > + `INTERVAL '1 year, 1 day'` first adds or subtracts a year and then a day.
>   > + `INTERVAL '1 day, 1 year'` first adds or subtracts a day and then a year.
>   >
>   > Ordering differences can affect calculations influenced by calendar events, such as leap years:
>   >
>   > ```sqlexample
>   > SELECT TO_DATE ('2019-02-28') + INTERVAL '1 day, 1 year';
>   > ```
>   >
>   > ```output
>   > +---------------------------------------------------+
>   > | TO_DATE ('2019-02-28') + INTERVAL '1 DAY, 1 YEAR' |
>   > |---------------------------------------------------|
>   > | 2020-03-01                                        |
>   > +---------------------------------------------------+
>   > ```
>   >
>   > ```sqlexample
>   > SELECT TO_DATE ('2019-02-28') + INTERVAL '1 year, 1 day';
>   > ```
>   >
>   > ```output
>   > +---------------------------------------------------+
>   > | TO_DATE ('2019-02-28') + INTERVAL '1 YEAR, 1 DAY' |
>   > |---------------------------------------------------|
>   > | 2020-02-29                                        |
>   > +---------------------------------------------------+
>   > ```
> * INTERVAL is not a data type (that is, you can’t define a table column to be of data type INTERVAL). Intervals can only be used in date, time, and timestamp arithmetic.
> * You can’t use an interval with a [SQL variable](session-variables.md). For example, the following query returns an error:
>
>   ```sqlexample
>   SET v1 = '1 year';
>
>   SELECT TO_DATE('2023-04-15') + INTERVAL $v1;
>   ```

### Supported date and time parts for intervals

The INTERVAL keyword supports the following date and time parts as arguments (case-insensitive):

| Date or Time Part | Abbreviations / Variations |
| --- | --- |
| `year` | `y` , `yy` , `yyy` , `yyyy` , `yr` , `years` , `yrs` |
| `quarter` | `q` , `qtr` , `qtrs` , `quarters` |
| `month` | `mm` , `mon` , `mons` , `months` |
| `week` | `w` , `wk` , `weekofyear` , `woy` , `wy` , `weeks` |
| `day` | `d` , `dd` , `days`, `dayofmonth` |
| `hour` | `h` , `hh` , `hr` , `hours` , `hrs` |
| `minute` | `m` , `mi` , `min` , `minutes` , `mins` |
| `second` | `s` , `sec` , `seconds` , `secs` |
| `millisecond` | `ms` , `msec` , `milliseconds` |
| `microsecond` | `us` , `usec` , `microseconds` |
| `nanosecond` | `ns` , `nsec` , `nanosec` , `nsecond` , `nanoseconds` , `nanosecs` , `nseconds` |

### Interval examples

Add a year interval to a specific date:

```sqlexample
SELECT TO_DATE('2023-04-15') + INTERVAL '1 year';
```

```output
+-------------------------------------------+
| TO_DATE('2023-04-15') + INTERVAL '1 YEAR' |
|-------------------------------------------|
| 2024-04-15                                |
+-------------------------------------------+
```

Add an interval of 3 hours and 18 minutes to a specific time:

```sqlexample
SELECT TO_TIME('04:15:29') + INTERVAL '3 hours, 18 minutes';
```

```output
+------------------------------------------------------+
| TO_TIME('04:15:29') + INTERVAL '3 HOURS, 18 MINUTES' |
|------------------------------------------------------|
| 07:33:29                                             |
+------------------------------------------------------+
```

Add a complex interval to the output of the CURRENT_TIMESTAMP function:

```sqlexample
SELECT CURRENT_TIMESTAMP + INTERVAL
    '1 year, 3 quarters, 4 months, 5 weeks, 6 days, 7 minutes, 8 seconds,
    1000 milliseconds, 4000000 microseconds, 5000000001 nanoseconds'
  AS complex_interval1;
```

The following is sample output. The output is different when the current timestamp is different.

```output
+-------------------------------+
| COMPLEX_INTERVAL1             |
|-------------------------------|
| 2026-11-07 18:07:19.875000001 |
+-------------------------------+
```

Add a complex interval with abbreviated date/time part notation to a specific date:

```sqlexample
SELECT TO_DATE('2025-01-17') + INTERVAL
    '1 y, 3 q, 4 mm, 5 w, 6 d, 7 h, 9 m, 8 s,
    1000 ms, 445343232 us, 898498273498 ns'
  AS complex_interval2;
```

```output
+-------------------------------+
| COMPLEX_INTERVAL2             |
|-------------------------------|
| 2027-03-30 07:31:32.841505498 |
+-------------------------------+
```

Query a table of employee information and return the names of employees who were hired within the past two years and three months:

```sqlexample
SELECT name, hire_date
  FROM employees
  WHERE hire_date > CURRENT_DATE - INTERVAL '2 y, 3 month';
```

Filter a TIMESTAMP column named `ts` from a table named `t1` and add four seconds to each returned value:

```sqlexample
SELECT ts + INTERVAL '4 seconds'
  FROM t1
  WHERE ts > TO_TIMESTAMP('2024-04-05 01:02:03');
```

## Simple arithmetic for dates

In addition to using interval constants to add to and subtract from dates, times, and timestamps, you can also
add and subtract days to and from DATE values, in the form of `{ + | - }` `integer`, where `integer`
specifies the number of days to add or subtract.

> **Note:**
>
> TIME and TIMESTAMP values don’t yet support simple arithmetic.

### Date arithmetic examples

Add one day to a specific date:

```sqlexample
SELECT TO_DATE('2024-04-15') + 1;
```

```output
+---------------------------+
| TO_DATE('2024-04-15') + 1 |
|---------------------------|
| 2024-04-16                |
+---------------------------+
```

Subtract four days from a specific date:

```sqlexample
SELECT TO_DATE('2024-04-15') - 4;
```

```output
+---------------------------+
| TO_DATE('2024-04-15') - 4 |
|---------------------------|
| 2024-04-11                |
+---------------------------+
```

Query a table named `employees` and return the names of people who left the company, but were employed more than 365 days:

```
SELECT name
  FROM employees
  WHERE end_date > start_date + 365;
```

---
title: Date & time functions
source: https://docs.snowflake.com/en/sql-reference/functions-date-time.md
section: SQL General Reference
---

# Date & time functions

This family of functions can be used to construct, convert, extract, or modify date, time, and timestamp data.

## List of functions

| Sub-category | Function | Notes |
| --- | --- | --- |
| **Construction** | [DATE_FROM_PARTS](functions/date_from_parts.md) |  |
| [TIME_FROM_PARTS](functions/time_from_parts.md) |  |
| [TIMESTAMP_FROM_PARTS](functions/timestamp_from_parts.md) |  |
| **Extraction** | [DATE_PART](functions/date_part.md) | Accepts all date and time parts (see Supported date and time parts). |
| [DAYNAME](functions/dayname.md) |  |
| [EXTRACT](functions/extract.md) | Alternative for [DATE_PART](functions/date_part.md). |
| [HOUR / MINUTE / SECOND](functions/hour-minute-second.md) | Alternative for [DATE_PART](functions/date_part.md). |
| [LAST_DAY](functions/last_day.md) | Accepts relevant date parts (see Supported date and time parts). |
| [MONTHNAME](functions/monthname.md) |  |
| [NEXT_DAY](functions/next_day.md) |  |
| [PREVIOUS_DAY](functions/previous_day.md) |  |
| [YEAR\* / DAY\* / WEEK\* / MONTH / QUARTER](functions/year.md) | Alternative for [DATE_PART](functions/date_part.md). |
| **Addition/subtraction** | [ADD_MONTHS](functions/add_months.md) |  |
| [DATEADD](functions/dateadd.md) | Accepts relevant date and time parts (see Supported date and time parts). |
| [DATEDIFF](functions/datediff.md) | Accepts relevant date and time parts (see Supported date and time parts). |
| [MONTHS_BETWEEN](functions/months_between.md) |  |
| [TIMEADD](functions/timeadd.md) | Alias for [DATEADD](functions/dateadd.md). |
| [TIMEDIFF](functions/timediff.md) | Alias for [DATEDIFF](functions/datediff.md). |
| [TIMESTAMPADD](functions/timestampadd.md) | Alias for [DATEADD](functions/dateadd.md). |
| [TIMESTAMPDIFF](functions/timestampdiff.md) | Alias for [DATEDIFF](functions/datediff.md). |
| **Truncation** | [DATE_TRUNC](functions/date_trunc.md) | Accepts relevant date and time parts (see Supported date and time parts). |
| [TIME_SLICE](functions/time_slice.md) | Allows a time to be “rounded” to the start of an evenly-spaced interval. |
| [TRUNCATE, TRUNC](functions/trunc2.md) | Alternative for [DATE_TRUNC](functions/date_trunc.md). |
| **Conversion** | [TO_DATE , DATE](functions/to_date.md) | Supports conversions based on string, timestamp, and VARIANT expressions. Supports integers for conversions based on the beginning of the Unix epoch. |
| [TO_TIME , TIME](functions/to_time.md) | Supports conversions based on string, timestamp, and VARIANT expressions. Supports integers for conversions based on the beginning of the Unix epoch. |
| [TO_TIMESTAMP / TO_TIMESTAMP_\*](functions/to_timestamp.md) | Supports conversions based on string, date, timestamp, and VARIANT expressions. Supports numeric expressions and integers for conversions based on the beginning of the Unix epoch. |
| **Time zone** | [CONVERT_TIMEZONE](functions/convert_timezone.md) |  |
| **Alerts** | [LAST_SUCCESSFUL_SCHEDULED_TIME](functions/last_successful_scheduled_time.md) |  |
| [SCHEDULED_TIME](functions/scheduled_time.md) |  |

## Output formats

Several date and time functions return date, time, and timestamp values. The following session parameters
determine the format of the output returned by these functions:

* The display format for times is determined by the [TIME_OUTPUT_FORMAT](parameters.md)
  session parameter (default `HH24:MI:SS`).
* The display format for dates is determined by the [DATE_OUTPUT_FORMAT](parameters.md)
  session parameter (default `YYYY-MM-DD`).
* The display format for timestamps is determined by the timestamp data type returned by the function.
  The following session parameters set the output format for different timestamp data types:

  + [TIMESTAMP_LTZ_OUTPUT_FORMAT](parameters.md)
  + [TIMESTAMP_NTZ_OUTPUT_FORMAT](parameters.md)
  + [TIMESTAMP_TZ_OUTPUT_FORMAT](parameters.md)
  + [TIMESTAMP_OUTPUT_FORMAT](parameters.md)

For more information, see [Date and time input and output formats](date-time-input-output.md).

## Supported date and time parts

Certain functions (as well as their appropriate aliases and alternatives) accept a date or time part as an argument. The following two
tables list the parts (case-insensitive) that you can use with these functions.

| Date parts | Abbreviations / variations | DATEADD | DATEDIFF | DATE_PART | DATE_TRUNC | LAST_DAY |
| --- | --- | --- | --- | --- | --- | --- |
| `year` | `y` , `yy` , `yyy` , `yyyy` , `yr` , `years` , `yrs` | ✔ | ✔ | ✔ | ✔ | ✔ |
| `month` | `mm` , `mon` , `mons` , `months` | ✔ | ✔ | ✔ | ✔ | ✔ |
| `day` | `d` , `dd` , `days`, `dayofmonth` | ✔ | ✔ | ✔ | ✔ |  |
| `dayofweek` [1] | `weekday` , `dow` , `dw` |  |  | ✔ |  |  |
| `dayofweek_iso` [2] | `dayofweekiso` , `weekday_iso` , `dow_iso` , `dw_iso` |  |  | ✔ |  |  |
| `dayofyear` | `yearday` , `doy` , `dy` |  |  | ✔ |  |  |
| `week` [1] | `w` , `wk` , `weekofyear` , `woy` , `wy` | ✔ | ✔ | ✔ | ✔ | ✔ |
| `week_iso` [2] | `weekiso` , `weekofyeariso` , `weekofyear_iso` |  |  | ✔ |  |  |
| `quarter` | `q` , `qtr` , `qtrs` , `quarters` | ✔ | ✔ | ✔ | ✔ | ✔ |
| `yearofweek` [1] |  |  |  | ✔ |  |  |
| `yearofweekiso` [2] |  |  |  | ✔ |  |  |

[1] For usage details, see the next section, which describes how Snowflake handles calendar weeks and weekdays.

[2] Not controlled by the WEEK_START and WEEK_OF_YEAR_POLICY session parameters, as described in the next section.

| Time Parts | Abbreviations / Variations | DATEADD | DATEDIFF | DATE_PART | DATE_TRUNC | LAST_DAY |
| --- | --- | --- | --- | --- | --- | --- |
| `hour` | `h` , `hh` , `hr` , `hours` , `hrs` | ✔ | ✔ | ✔ | ✔ |  |
| `minute` | `m` , `mi` , `min` , `minutes` , `mins` | ✔ | ✔ | ✔ | ✔ |  |
| `second` | `s` , `sec` , `seconds` , `secs` | ✔ | ✔ | ✔ | ✔ |  |
| `millisecond` | `ms` , `msec` , `milliseconds` | ✔ | ✔ |  | ✔ |  |
| `microsecond` | `us` , `usec` , `microseconds` | ✔ | ✔ |  | ✔ |  |
| `nanosecond` | `ns` , `nsec` , `nanosec` , `nsecond` , `nanoseconds` , `nanosecs` , `nseconds` | ✔ | ✔ | ✔ | ✔ |  |
| `epoch_second` | `epoch` , `epoch_seconds` |  |  | ✔ |  |  |
| `epoch_millisecond` | `epoch_milliseconds` |  |  | ✔ |  |  |
| `epoch_microsecond` | `epoch_microseconds` |  |  | ✔ |  |  |
| `epoch_nanosecond` | `epoch_nanoseconds` |  |  | ✔ |  |  |
| `timezone_hour` | `tzh` |  |  | ✔ |  |  |
| `timezone_minute` | `tzm` |  |  | ✔ |  |  |

## Calendar weeks and weekdays

The behavior of week-related functions in Snowflake is controlled by the [WEEK_START](parameters.md) and [WEEK_OF_YEAR_POLICY](parameters.md) session parameters. An important aspect of understanding how these
parameters interact is the concept of ISO weeks.

### ISO weeks

As defined in the [ISO 8601](https://en.wikipedia.org/wiki/ISO_8601) standard (for dates and time formats), ISO weeks always start on Monday and “belong” to the year that contains the Thursday of
that week. This means that a day in one year might belong to a week in a different year:

* For days in early January, the WOY (week of the year) value can be 52 or 53 (i.e. the day belongs to the last week in the previous year).
* For days in late December, the WOY value can be 1 (i.e. the day belongs to the first week in the next year).

Snowflake provides a special set of week-related date functions (and equivalent data parts) whose behavior is consistent with the ISO week semantics:
[DAYOFWEEKISO, WEEKISO, and YEAROFWEEKISO](functions/year.md).

These functions (and date parts) disregard the session parameters (i.e. they always follow the ISO semantics).

For details about how the other week-related date functions are handled, see the following sections:

* First day of the week
* First and last weeks of the year
* Examples

### First day of the week

Most week-related functions are controlled only by the [WEEK_START](parameters.md) session parameter. The function results differ depending on how this parameter is set:

| Function | Parameter set to `0` (default / legacy behavior) | Parameter set to `1` - `7` (Monday - Sunday) |
| --- | --- | --- |
| [DAYOFWEEK](functions/year.md) | Returns `0` (Sunday) to `6` (Saturday). | Returns `1` (defined first day of the week) to `7` (last day of the week relative to the defined first day). |
| [DATE_TRUNC](functions/date_trunc.md) (with a `WEEK` part) | Truncates the input week to start on Monday. | Truncates the input week to start on the defined first day of the week. |
| [LAST_DAY](functions/last_day.md) (with a `WEEK` part) | Returns the Sunday of the input week. | Returns the last day of the input week relative to the defined first day of the week. |
| [DATEDIFF](functions/datediff.md) (with a `WEEK` part) | Calculated using weeks starting on Monday. | Calculated using weeks starting on the defined first day of the week. |

> **Tip:**
>
> The default value for the parameter is `0`, which preserves the legacy Snowflake behavior (ISO-like semantics).
> However, we recommend changing this value to explicitly control the resulting behavior of the functions. The most common
> scenario is to set the parameter to `1`.

### First and last weeks of the year

The [WEEK_OF_YEAR_POLICY](parameters.md) session parameter controls how the [WEEK and YEAROFWEEK](functions/year.md) functions behave.
The parameter can have two values:

* `0`: The affected week-related functions use semantics similar to the ISO semantics, in which a week belongs to a given year
  if at least 4 days of that week are in that year. This means that all the weeks have 7 days, but the first days of January and the
  last days of December might belong to a week in a different year. For this reason, both the [YEAROFWEEK and YEAROFWEEKISO](functions/year.md)
  functions can provide the year that the week belongs to.
* `1`: January 1 always starts the first week of the year, and December 31 is always in the last week of the year. This means
  that the first week and last week in the year might have fewer than 7 days.

This behavior is also influenced by the start day of the week, as controlled by the value set for the [WEEK_START](parameters.md) session parameter:

* `0` or `1`: The behavior is equivalent to the ISO week semantics, with the week starting on Monday.
* `2` to `7`: The “4 days” logic is preserved, but the first day of the week is different.

> **Tip:**
>
> The default value for both parameters is `0`, which preserves the legacy Snowflake behavior (ISO-like semantics). However,
> we recommend changing these values to explicitly control the resulting behavior of the functions. The most common scenario is
> to set both parameters to `1`.

### Examples

These examples query the same set of date functions, but with different values set for the [WEEK_OF_YEAR_POLICY](parameters.md)
and [WEEK_START](parameters.md) session parameters to illustrate how they influence the results of the functions.

The examples use the following data:

```sqlexample
CREATE OR REPLACE TABLE week_examples (d DATE);

INSERT INTO week_examples VALUES
  ('2016-12-30'),
  ('2016-12-31'),
  ('2017-01-01'),
  ('2017-01-02'),
  ('2017-01-03'),
  ('2017-01-04'),
  ('2017-01-05'),
  ('2017-12-30'),
  ('2017-12-31');
```

#### Controlling the first day of the week

Setting WEEK_START to `0` (legacy behavior) or `1` (Monday) does not have a significant effect, as illustrated in the following two examples:

```sqlexample
ALTER SESSION SET WEEK_START = 0;

SELECT d "Date",
       DAYNAME(d) "Day",
       DAYOFWEEK(d) "DOW",
       DATE_TRUNC('week', d) "Trunc Date",
       DAYNAME("Trunc Date") "Trunc Day",
       LAST_DAY(d, 'week') "Last DOW Date",
       DAYNAME("Last DOW Date") "Last DOW Day",
       DATEDIFF('week', '2017-01-01', d) "Weeks Diff from 2017-01-01 to Date"
  FROM week_examples;
```

```output
+------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------+
| Date       | Day | DOW | Trunc Date | Trunc Day | Last DOW Date | Last DOW Day | Weeks Diff from 2017-01-01 to Date |
|------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------|
| 2016-12-30 | Fri |   5 | 2016-12-26 | Mon       | 2017-01-01    | Sun          |                                  0 |
| 2016-12-31 | Sat |   6 | 2016-12-26 | Mon       | 2017-01-01    | Sun          |                                  0 |
| 2017-01-01 | Sun |   0 | 2016-12-26 | Mon       | 2017-01-01    | Sun          |                                  0 |
| 2017-01-02 | Mon |   1 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-01-03 | Tue |   2 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-01-04 | Wed |   3 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-01-05 | Thu |   4 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-12-30 | Sat |   6 | 2017-12-25 | Mon       | 2017-12-31    | Sun          |                                 52 |
| 2017-12-31 | Sun |   0 | 2017-12-25 | Mon       | 2017-12-31    | Sun          |                                 52 |
+------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------+
```

```sqlexample
ALTER SESSION SET WEEK_START = 1;

SELECT d "Date",
       DAYNAME(d) "Day",
       DAYOFWEEK(d) "DOW",
       DATE_TRUNC('week', d) "Trunc Date",
       DAYNAME("Trunc Date") "Trunc Day",
       LAST_DAY(d, 'week') "Last DOW Date",
       DAYNAME("Last DOW Date") "Last DOW Day",
       DATEDIFF('week', '2017-01-01', d) "Weeks Diff from 2017-01-01 to Date"
  FROM week_examples;
```

```output
+------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------+
| Date       | Day | DOW | Trunc Date | Trunc Day | Last DOW Date | Last DOW Day | Weeks Diff from 2017-01-01 to Date |
|------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------|
| 2016-12-30 | Fri |   5 | 2016-12-26 | Mon       | 2017-01-01    | Sun          |                                  0 |
| 2016-12-31 | Sat |   6 | 2016-12-26 | Mon       | 2017-01-01    | Sun          |                                  0 |
| 2017-01-01 | Sun |   7 | 2016-12-26 | Mon       | 2017-01-01    | Sun          |                                  0 |
| 2017-01-02 | Mon |   1 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-01-03 | Tue |   2 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-01-04 | Wed |   3 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-01-05 | Thu |   4 | 2017-01-02 | Mon       | 2017-01-08    | Sun          |                                  1 |
| 2017-12-30 | Sat |   6 | 2017-12-25 | Mon       | 2017-12-31    | Sun          |                                 52 |
| 2017-12-31 | Sun |   7 | 2017-12-25 | Mon       | 2017-12-31    | Sun          |                                 52 |
+------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------+
```

* With WEEK_START set to `0`, the DOW for Sunday is `0`.
* With WEEK_START set to `1`, the DOW for Sunday is `7`.

The results differ more significantly if WEEK_START is set to any day other than Monday. For example, setting the parameter to `3` (Wednesday) changes the results of all the week-related functions (columns 3 through 8):

```sqlexample
ALTER SESSION SET WEEK_START = 3;

SELECT d "Date",
       DAYNAME(d) "Day",
       DAYOFWEEK(d) "DOW",
       DATE_TRUNC('week', d) "Trunc Date",
       DAYNAME("Trunc Date") "Trunc Day",
       LAST_DAY(d, 'week') "Last DOW Date",
       DAYNAME("Last DOW Date") "Last DOW Day",
       DATEDIFF('week', '2017-01-01', d) "Weeks Diff from 2017-01-01 to Date"
  FROM week_examples;
```

```output
+------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------+
| Date       | Day | DOW | Trunc Date | Trunc Day | Last DOW Date | Last DOW Day | Weeks Diff from 2017-01-01 to Date |
|------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------|
| 2016-12-30 | Fri |   3 | 2016-12-28 | Wed       | 2017-01-03    | Tue          |                                  0 |
| 2016-12-31 | Sat |   4 | 2016-12-28 | Wed       | 2017-01-03    | Tue          |                                  0 |
| 2017-01-01 | Sun |   5 | 2016-12-28 | Wed       | 2017-01-03    | Tue          |                                  0 |
| 2017-01-02 | Mon |   6 | 2016-12-28 | Wed       | 2017-01-03    | Tue          |                                  0 |
| 2017-01-03 | Tue |   7 | 2016-12-28 | Wed       | 2017-01-03    | Tue          |                                  0 |
| 2017-01-04 | Wed |   1 | 2017-01-04 | Wed       | 2017-01-10    | Tue          |                                  1 |
| 2017-01-05 | Thu |   2 | 2017-01-04 | Wed       | 2017-01-10    | Tue          |                                  1 |
| 2017-12-30 | Sat |   4 | 2017-12-27 | Wed       | 2018-01-02    | Tue          |                                 52 |
| 2017-12-31 | Sun |   5 | 2017-12-27 | Wed       | 2018-01-02    | Tue          |                                 52 |
+------------+-----+-----+------------+-----------+---------------+--------------+------------------------------------+
```

#### Controlling the year and days for the first/last weeks of the year

The following example sets both parameters to `0` to follow ISO-like semantics (i.e. week starts on Monday and all weeks have 7 days):

```sqlexample
ALTER SESSION SET WEEK_OF_YEAR_POLICY=0, WEEK_START=0;

SELECT d "Date",
       DAYNAME(d) "Day",
       WEEK(d) "WOY",
       WEEKISO(d) "WOY (ISO)",
       YEAROFWEEK(d) "YOW",
       YEAROFWEEKISO(d) "YOW (ISO)"
  FROM week_examples;
```

```output
+------------+-----+-----+-----------+------+-----------+
| Date       | Day | WOY | WOY (ISO) |  YOW | YOW (ISO) |
|------------+-----+-----+-----------+------+-----------|
| 2016-12-30 | Fri |  52 |        52 | 2016 |      2016 |
| 2016-12-31 | Sat |  52 |        52 | 2016 |      2016 |
| 2017-01-01 | Sun |  52 |        52 | 2016 |      2016 |
| 2017-01-02 | Mon |   1 |         1 | 2017 |      2017 |
| 2017-01-03 | Tue |   1 |         1 | 2017 |      2017 |
| 2017-01-04 | Wed |   1 |         1 | 2017 |      2017 |
| 2017-01-05 | Thu |   1 |         1 | 2017 |      2017 |
| 2017-12-30 | Sat |  52 |        52 | 2017 |      2017 |
| 2017-12-31 | Sun |  52 |        52 | 2017 |      2017 |
+------------+-----+-----+-----------+------+-----------+
```

The next example illustrates the effect of keeping WEEK_OF_YEAR_POLICY set to `0`, but changing WEEK_START to `3` (Wednesday):

```sqlexample
ALTER SESSION SET WEEK_OF_YEAR_POLICY=0, WEEK_START=3;

SELECT d "Date",
       DAYNAME(d) "Day",
       WEEK(d) "WOY",
       WEEKISO(d) "WOY (ISO)",
       YEAROFWEEK(d) "YOW",
       YEAROFWEEKISO(d) "YOW (ISO)"
  FROM week_examples;
```

```output
+------------+-----+-----+-----------+------+-----------+
| Date       | Day | WOY | WOY (ISO) |  YOW | YOW (ISO) |
|------------+-----+-----+-----------+------+-----------|
| 2016-12-30 | Fri |  53 |        52 | 2016 |      2016 |
| 2016-12-31 | Sat |  53 |        52 | 2016 |      2016 |
| 2017-01-01 | Sun |  53 |        52 | 2016 |      2016 |
| 2017-01-02 | Mon |  53 |         1 | 2016 |      2017 |
| 2017-01-03 | Tue |  53 |         1 | 2016 |      2017 |
| 2017-01-04 | Wed |   1 |         1 | 2017 |      2017 |
| 2017-01-05 | Thu |   1 |         1 | 2017 |      2017 |
| 2017-12-30 | Sat |  52 |        52 | 2017 |      2017 |
| 2017-12-31 | Sun |  52 |        52 | 2017 |      2017 |
+------------+-----+-----+-----------+------+-----------+
```

* 2016 now has 53 weeks (instead of 52).
* WOY for Jan 1st, 2017 moves to week 53 (from 52).
* WOY for Jan 2nd and 3rd, 2017 moves to week 53 (from 1).
* YOW for Jan 2nd and 3rd, 2017 moves to 2016 (from 2017).
* WOY (ISO) and YOW (ISO) are not affected by the parameter change.

The last two examples set WEEK_OF_YEAR_POLICY to `1` and set WEEK_START first to `1` (Monday) and then `3` (Wednesday):

```sqlexample
ALTER SESSION SET WEEK_OF_YEAR_POLICY=1, WEEK_START=1;

SELECT d "Date",
       DAYNAME(d) "Day",
       WEEK(d) "WOY",
       WEEKISO(d) "WOY (ISO)",
       YEAROFWEEK(d) "YOW",
       YEAROFWEEKISO(d) "YOW (ISO)"
  FROM week_examples;
```

```output
+------------+-----+-----+-----------+------+-----------+
| Date       | Day | WOY | WOY (ISO) |  YOW | YOW (ISO) |
|------------+-----+-----+-----------+------+-----------|
| 2016-12-30 | Fri |  53 |        52 | 2016 |      2016 |
| 2016-12-31 | Sat |  53 |        52 | 2016 |      2016 |
| 2017-01-01 | Sun |   1 |        52 | 2017 |      2016 |
| 2017-01-02 | Mon |   2 |         1 | 2017 |      2017 |
| 2017-01-03 | Tue |   2 |         1 | 2017 |      2017 |
| 2017-01-04 | Wed |   2 |         1 | 2017 |      2017 |
| 2017-01-05 | Thu |   2 |         1 | 2017 |      2017 |
| 2017-12-30 | Sat |  53 |        52 | 2017 |      2017 |
| 2017-12-31 | Sun |  53 |        52 | 2017 |      2017 |
+------------+-----+-----+-----------+------+-----------+
```

```sqlexample
ALTER SESSION SET week_of_year_policy=1, week_start=3;

SELECT d "Date",
       DAYNAME(d) "Day",
       WEEK(d) "WOY",
       WEEKISO(d) "WOY (ISO)",
       YEAROFWEEK(d) "YOW",
       YEAROFWEEKISO(d) "YOW (ISO)"
  FROM week_examples;
```

```output
+------------+-----+-----+-----------+------+-----------+
| Date       | Day | WOY | WOY (ISO) |  YOW | YOW (ISO) |
|------------+-----+-----+-----------+------+-----------|
| 2016-12-30 | Fri |  53 |        52 | 2016 |      2016 |
| 2016-12-31 | Sat |  53 |        52 | 2016 |      2016 |
| 2017-01-01 | Sun |   1 |        52 | 2017 |      2016 |
| 2017-01-02 | Mon |   1 |         1 | 2017 |      2017 |
| 2017-01-03 | Tue |   1 |         1 | 2017 |      2017 |
| 2017-01-04 | Wed |   2 |         1 | 2017 |      2017 |
| 2017-01-05 | Thu |   2 |         1 | 2017 |      2017 |
| 2017-12-30 | Sat |  53 |        52 | 2017 |      2017 |
| 2017-12-31 | Sun |  53 |        52 | 2017 |      2017 |
+------------+-----+-----+-----------+------+-----------+
```

* With WEEK_OF_YEAR_POLICY set to `1` and WEEK_START set to `1` (Monday):

  + WOY for `2017-01-01` is `1`.
  + Week 1 consists of 1 day.
  + Week 2 starts on `Mon`.

  This usage scenario is generally the most common.
* With WEEK_OF_YEAR_POLICY set to `1` and WEEK_START set to `3` (Wednesday):

  + WOY for 2017-01-01 is still `1`.
  + Week 1 consists of 3 days.
  + Week 2 starts on `Wed`.

In both examples, WOY (ISO) and YOW (ISO) are not affected by the parameter change.

---
title: Date and time input and output formats
source: https://docs.snowflake.com/en/sql-reference/date-time-input-output.md
section: SQL General Reference
---

# Date and time input and output formats

Date and time formats provide a method for representing dates, times, and timestamps.

## How Snowflake determines the input and output formats to use

To determine the input and output formats to use for dates, times, and timestamps, Snowflake uses:

* Session parameters for dates, times, and timestamps
* File format options for loading/unloading dates, times, and timestamps

### Session parameters for dates, times, and timestamps

A set of session parameters determines how date, time, and timestamp data is passed into and out of Snowflake,
as well as the time zone used in the time and timestamp formats that support time zones.

You can set the parameters at the account, user, and session levels. Execute the [SHOW PARAMETERS](sql/show-parameters.md)
command to view the current parameter settings that apply to all operations in the current session.

#### Input formats

The following parameters define which date, time, and timestamp formats are recognized for DML, including COPY,
INSERT, and MERGE operations:

* [DATE_INPUT_FORMAT](parameters.md)
* [TIME_INPUT_FORMAT](parameters.md)
* [TIMESTAMP_INPUT_FORMAT](parameters.md)

The default for all three parameters is AUTO. When the parameter value is set to AUTO, Snowflake attempts to match date,
time, or timestamp strings in any input expression with one of the formats listed in
Supported formats for AUTO detection:

* If a matching format is found, Snowflake accepts the string.
* If no matching format is found, Snowflake returns an error.

#### Output formats

The following parameters define the formats for date and time output from Snowflake:

* [DATE_OUTPUT_FORMAT](parameters.md)
* [TIME_OUTPUT_FORMAT](parameters.md)
* [CSV_TIMESTAMP_FORMAT](parameters.md)
* [TIMESTAMP_OUTPUT_FORMAT](parameters.md)
* [TIMESTAMP_LTZ_OUTPUT_FORMAT](parameters.md)
* [TIMESTAMP_NTZ_OUTPUT_FORMAT](parameters.md)
* [TIMESTAMP_TZ_OUTPUT_FORMAT](parameters.md)

In addition, the following parameter maps the TIMESTAMP data type alias to one of the three TIMESTAMP_\* variations:

* [TIMESTAMP_TYPE_MAPPING](parameters.md)

#### Time zone

The following parameter determines the time zone:

* [TIMEZONE](parameters.md)

### File format options for loading/unloading dates, times, and timestamps

Separate from the input and output format parameters, Snowflake provides three file format options to use when loading data into or unloading data from Snowflake tables:

* DATE_FORMAT
* TIME_FORMAT
* TIMESTAMP_FORMAT

The options can be specified directly in the COPY command or in a named stage or file format object referenced in the COPY command. When specified, these options override the
corresponding input formats (when loading data) or output formats (when unloading data).

#### Data loading

When used in data loading, the options specify the format of the date, time, and timestamp strings in your staged data files. The options override the DATE_INPUT_FORMAT,
TIME_INPUT_FORMAT, or TIMESTAMP_INPUT_FORMAT parameter settings.

The default for all these options is AUTO, meaning the [COPY INTO <table>](sql/copy-into-table.md) command attempts to match all date and timestamp strings in the staged data files
with one of the formats listed in Supported formats for AUTO detection:

* If a matching format is found, Snowflake accepts the string.
* If no matching format is found, Snowflake returns an error and then performs the action specified for the ON_ERROR copy option.

> **Warning:**
>
> Snowflake supports automatic detection of most common date, time, and timestamp formats (see tables below). However, some formats might produce ambiguous results, which can
> cause Snowflake to apply an incorrect format when using AUTO for data loading.
>
> To guarantee correct loading of data, Snowflake strongly recommends explicitly setting the file format options for data loading.

#### Data unloading

When used in data unloading, the options specify the format applied to the dates, times, and timestamps unloaded to the files in specified stage.

The default for all these options is AUTO, meaning Snowflake applies the formatting specified in the following parameters:

* DATE_OUTPUT_FORMAT
* TIME_OUTPUT_FORMAT
* TIMESTAMP_\*_OUTPUT_FORMAT (depending on the TIMESTAMP_TYPE_MAPPING setting)

## About the elements used in input and output formats

In input and output formats that you specify in parameters,
file format options, and
[conversion functions](functions-conversion.md), you can use the elements listed in the table below.

The next sections
also use these elements to describe the formats recognized by Snowflake automatically.

| Format element | Description |
| --- | --- |
| `YYYY` | Four-digit [1] year. |
| `YY` | Two-digit [1] year, controlled by the [TWO_DIGIT_CENTURY_START](parameters.md) session parameter. For example, when set to `1980`, values of `79` and `80` are parsed as `2079` and `1980`, respectively. |
| `Y` | One-digit or two-digit [2] year without leading zeros, controlled by the [TWO_DIGIT_CENTURY_START](parameters.md) session parameter. For example, when the parameter set to `1990`, values of `2005` and `1991` are serialized as `5` and `91`, respectively. |
| `MM` | Two-digit [1] month (`01` = January, and so on). |
| `MO` | One-digit or two-digit [2] month without leading zeros (`1` = January, and so on). |
| `MON` | Abbreviated month name [3]. |
| `MMMM` | Full month name [3]. |
| `DD` | Two-digit [1] day of month (`01` through `31`). |
| `D` | One-digit or two-digit [2] day of month without leading zeros (`1` through `31`). |
| `DY` | Abbreviated day of week. |
| `HH24` | Two digits [1] for hour (`00` through `23`). You *must not* specify `AM` / `PM` or `A` / `P`. |
| `HH12` | Two digits [1] for hour (`01` through `12`). You can specify `AM` / `PM` or `A` / `P`. |
| `H24` | One or two digits [2] for hour without leading zeros (`0` through `23`). You *must not* specify `AM` / `PM` or `A` / `P`. |
| `H12` | One or two digits [2] for hour without leading zeros (`1` through `12`). You can specify `AM` / `PM` or `A` / `P`. |
| `AM` , `PM` | Ante meridiem (`AM`) / post meridiem (`PM`). Use this only with `HH12` and code:`H12` (*not* with `HH24` or `H24`). |
| `P` | Ante meridiem (`A`) / post meridiem (`P`). Use this only with `HH12` and code:`H12` (*not* with `HH24` or `H24`). |
| `HH` | Synonym for `HH24`. |
| `H` | Synonym for `H24`. |
| `MI` | Two digits [1] for minute (`00` through `59`). |
| `ME` | One or two digits [2] for minute without leading zeros (`0` through `59`). |
| `SS` | Two digits [1] for second (`00` through `59`). |
| `S` | One or two digits [2] for second without leading zeros (`0` through `59`). |
| `FF[0-9]` | Fractional seconds with precision `0` (seconds) to `9` (nanoseconds), e.g. `FF`, `FF0`, `FF3`, `FF9`. Specifying `FF` is equivalent to `FF9` (nanoseconds). |
| `TZH:TZM` , `TZHTZM` , `TZH` | Two-digit [1] time zone hour and minute, offset from UTC. Can be prefixed by `+`/`-` for sign. |
| `UUUU` | Four-digit year in [ISO format](https://en.wikipedia.org/wiki/ISO_8601), which are negative for BCE years. |

[1] The number of digits describes the output produced when serializing values to text. When parsing text, Snowflake accepts up to the specified number of digits. For example, a day number can be one or two digits.

[2] The number of digits describes the output produced when serializing values to text. Parsing isn’t supported. If parsing is required, use an equivalent format that includes leading zeros. These format elements will be enabled in BCR bundle 2026_03.

[3] For the MON format element, the output produced when serializing values to text is the abbreviated month name. For the MMMM format element, the output produced when serializing values to text is the full month name. When parsing text, Snowflake accepts the three-digit abbreviation or the full month name for both MON and MMMM. For example, “January” or “Jan”, “February” or “Feb”, and so on are accepted when parsing text.

> **Note:**
>
> * When a date-only format is used, the associated time is assumed to be midnight on that day.
> * Anything in the format between double quotes or other than the above elements is parsed/formatted without being interpreted.
>   Snowflake recommends always enclosing literal characters in double quotes
>   (for example, `"T"`, `"EST"`, `"Z"`) to ensure they are treated as literals.
> * For more details about valid ranges, number of digits, and best practices, see
>   Additional information about using date, time, and timestamp formats.

## Supported formats for AUTO detection

If instructed to do so, Snowflake automatically detects and processes specific formats for date, time, and timestamp input
strings. The following sections describe the supported formats:

* Date formats
* Time formats
* Timestamp formats

> **Attention:**
>
> Some strings can match multiple formats. For example, `'07-04-2016'` is compatible with both
> `MM-DD-YYYY` and `DD-MM-YYYY`, but has different meanings in each format (July 4 vs. April 7). The fact that a
> matching format is found does not guarantee that the string is parsed as the user intended.
>
> Although automatic date format detection is convenient, it increases the possibility of misinterpretation. Snowflake
> strongly recommends specifying the format explicitly rather than relying on automatic date detection.

### Date formats

For descriptions of the elements used in the formats below, see About the elements used in input and output formats.

| Format | Example | Notes |
| --- | --- | --- |
| **ISO Date Formats** |  |  |
| `YYYY-MM-DD` | `2013-04-28` |  |
| **Other Date Formats** |  |  |
| `DD-MON-YYYY` | `17-DEC-1980` |  |
| `MM/DD/YYYY` | `12/17/1980` | Could produce incorrect dates when loading or operating on dates in common European formats (that is, `DD/MM/YYYY`). For example, 05/02/2013 could be interpreted as May 2, 2013 instead of February 5, 2013. |

When using AUTO date formatting, dashes and slashes aren’t interchangeable. Slashes imply `MM/DD/YYYY` format,
and dashes imply `YYYY-MM-DD` format. Strings such as `'2019/01/02'` or `'01-02-2019'` aren’t interpreted as you might
expect.

### Time formats

For descriptions of the elements used in the formats below, see About the elements used in input and output formats.

| Format | Example | Notes |
| --- | --- | --- |
| **ISO Time Formats** |  |  |
| `HH24:MI:SS.FFTZH:TZM` | `20:57:01.123456789+07:00` |  |
| `HH24:MI:SS.FF` | `20:57:01.123456789` |  |
| `HH24:MI:SS` | `20:57:01` |  |
| `HH24:MI` | `20:57` |  |
| **Internet (RFC) Time Formats** |  |  |
| `HH12:MI:SS.FF AM` | `07:57:01.123456789 AM` |  |
| `HH12:MI:SS AM` | `04:01:07 AM` |  |
| `HH12:MI AM` | `04:01 AM` |  |

The `AM` format element allows values with either `AM` or `PM`.

> **Note:**
>
> Use the `AM` format element only with `HH12` (not with `HH24`).

When a timezone offset (for example, `0800`) occurs immediately after a digit in a time or timestamp string, the timezone
offset must start with `+` or `-`. The sign prevents ambiguity when the fractional seconds or the
time zone offset does not contain the maximum number of allowable digits. For example,
without a separator between the last digit of the fractional seconds and the first digit of the timezone,
the `1` in the time `04:04:04.321200` could be either the last digit of the fractional seconds
(that is, 321 milliseconds) or the first digit of the timezone offset (that is, 12 hours ahead of UTC).

### Timestamp formats

For descriptions of the elements used in the formats below, see About the elements used in input and output formats.

| Format | Example | Notes |
| --- | --- | --- |
| **ISO Timestamp Formats** |  |  |
| `YYYY-MM-DD"T"HH24:MI:SS.FFTZH:TZM` | `2013-04-28T20:57:01.123456789+07:00` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24:MI:SS.FFTZH:TZM` | `2013-04-28 20:57:01.123456789+07:00` |  |
| `YYYY-MM-DD HH24:MI:SS.FFTZH` | `2013-04-28 20:57:01.123456789+07` |  |
| `YYYY-MM-DD HH24:MI:SS.FF TZH:TZM` | `2013-04-28 20:57:01.123456789 +07:00` |  |
| `YYYY-MM-DD HH24:MI:SS.FF TZHTZM` | `2013-04-28 20:57:01.123456789 +0700` |  |
| `YYYY-MM-DD HH24:MI:SS TZH:TZM` | `2013-04-28 20:57:01 +07:00` |  |
| `YYYY-MM-DD HH24:MI:SS TZHTZM` | `2013-04-28 20:57:01 +0700` |  |
| `YYYY-MM-DD"T"HH24:MI:SS.FF` | `2013-04-28T20:57:01.123456` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24:MI:SS.FF` | `2013-04-28 20:57:01.123456` |  |
| `YYYY-MM-DD"T"HH24:MI:SS` | `2013-04-28T20:57:01` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24:MI:SS` | `2013-04-28 20:57:01` |  |
| `YYYY-MM-DD"T"HH24:MI` | `2013-04-28T20:57` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24:MI` | `2013-04-28 20:57` |  |
| `YYYY-MM-DD"T"HH24` | `2013-04-28T20` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24` | `2013-04-28 20` |  |
| `YYYY-MM-DD"T"HH24:MI:SSTZH:TZM` | `2013-04-28T20:57:01-07:00` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24:MI:SSTZH:TZM` | `2013-04-28 20:57:01-07:00` |  |
| `YYYY-MM-DD HH24:MI:SSTZH` | `2013-04-28 20:57:01-07` |  |
| `YYYY-MM-DD"T"HH24:MITZH:TZM` | `2013-04-28T20:57+07:00` | The double quotes around the `T` are optional, but recommended (see the tip following this table for details). |
| `YYYY-MM-DD HH24:MITZH:TZM` | `2013-04-28 20:57+07:00` |  |
| **Internet (RFC) Timestamp Formats** |  |  |
| `DY, DD MON YYYY HH24:MI:SS TZHTZM` | `Thu, 21 Dec 2000 16:01:07 +0200` |  |
| `DY, DD MON YYYY HH24:MI:SS.FF TZHTZM` | `Thu, 21 Dec 2000 16:01:07.123456789 +0200` |  |
| `DY, DD MON YYYY HH12:MI:SS AM TZHTZM` | `Thu, 21 Dec 2000 04:01:07 PM +0200` |  |
| `DY, DD MON YYYY HH12:MI:SS.FF AM TZHTZM` | `Thu, 21 Dec 2000 04:01:07.123456789 PM +0200` |  |
| `DY, DD MON YYYY HH24:MI:SS` | `Thu, 21 Dec 2000 16:01:07` |  |
| `DY, DD MON YYYY HH24:MI:SS.FF` | `Thu, 21 Dec 2000 16:01:07.123456789` |  |
| `DY, DD MON YYYY HH12:MI:SS AM` | `Thu, 21 Dec 2000 04:01:07 PM` |  |
| `DY, DD MON YYYY HH12:MI:SS.FF AM` | `Thu, 21 Dec 2000 04:01:07.123456789 PM` |  |
| **Other Timestamp Formats** |  |  |
| `MM/DD/YYYY HH24:MI:SS` | `2/18/2008 02:36:48` | Could produce incorrect dates when loading or operating on dates in common European formats (i.e. `DD/MM/YYYY`). For example, 05/02/2013 could be interpreted as May 2, 2013 instead of February 5, 2013. |
| `DY MON DD HH24:MI:SS TZHTZM YYYY` | `Mon Jul 08 18:09:51 +0000 2013` |  |

When a timezone offset (for example, `0800`) occurs immediately after a digit in a time or timestamp string, the timezone
offset must start with `+` or `-`. The sign prevents ambiguity when the fractional seconds or the
time zone offset does not contain the maximum number of allowable digits. For example,
without a separator between the last digit of the fractional seconds and the first digit of the timezone,
the `1` in the time `04:04:04.321200` could be either the last digit of the fractional seconds
(that is, 321 milliseconds) or the first digit of the timezone offset (that is, 12 hours ahead of UTC).

> **Tip:**
>
> In some of the timestamp formats, the letter `T` is used as a separator between the date and time
> (for example, `'YYYY-MM-DD"T"HH24:MI:SS'`).
>
> The double quotes around the `T` are optional. However, Snowflake recommends always enclosing literal
> characters — such as `T`, timezone abbreviations like `EST`, or any other non-element text — in
> double quotes to avoid ambiguity.
>
> Use the double quotes only in the format specifier, not the actual values. For example:
>
> ```sqlexample
> -- Not recommended: literals T and EST are not quoted
> SELECT TO_TIMESTAMP('2026-01-02T03:04:05 EST', 'YYYY-MM-DDTHH24:MI:SS EST');
>
> -- Recommended: all literal characters are enclosed in double quotes
> SELECT TO_TIMESTAMP('2026-01-02T03:04:05 EST', 'YYYY-MM-DD"T"HH24:MI:SS "EST"');
> ```
>
> The quotes around literal characters must be double quotes, not single quotes.

## Additional information about using date, time, and timestamp formats

The following sections describe requirements and best practices for individual fields in dates, times, and timestamps.

* Valid ranges of values for fields
* Using the correct number of digits with format elements
* Whitespace in values and format specifiers
* Context dependency
* Summary of best practices for specifying the format

### Valid ranges of values for fields

The recommended ranges of values for each field are shown below:

| Field | Values | Notes |
| --- | --- | --- |
| Years | `0001` to `9999` | Some values outside this range might be accepted in some contexts, but Snowflake recommends using only values in this range. For example, the year 0000 is accepted, but is incorrect because in the Gregorian calendar the year 1 A.D. comes immediately after the year 1 B.C.; there is no year 0. |
| Months | `01` to `12` |  |
| Days | `01` to `31` | In months that have fewer than 31 days, the actual maximum is the number of days in the month. |
| Hours | `00` to `23` | Or `01`-`12` if you are using `HH12` format. |
| Minutes | `00` to `59` |  |
| Seconds | `00` to `59` | Snowflake doesn’t support leap seconds or leap-leap seconds; values `60` and `61` are rejected. |
| Fraction | `0` to `999999999` | The number of digits after the decimal point depends in part upon the exact format specifier (for example, `FF3` supports up to 3 digits after the decimal point and `FF9` supports up to 9 digits after the decimal point). You can enter fewer digits than you specified (for example, 1 digit is allowed even if you use `FF9`); trailing zeros aren’t required to fill out the field to the specified width. |

### Using the correct number of digits with format elements

For most fields (year, month, day, hour, minute, and second), the elements (`YYYY`, `MM`, `DD`, and so on) of the
format specifier are two or four characters.

The following rules tell you how many digits you should actually specify in the literal values:

* `YYYY`: You can specify 1, 2, 3, or 4 digits of the year. However, Snowflake recommends specifying 4 digits. If
  necessary, prepend leading zeros. For example, the year 536 A.D. is `0536`.
* `YY`: Specify 1 or 2 digits of the year. However, Snowflake recommends specifying 2 digits. If
  necessary, prepend a leading zero.
* `MM`: Specify one or two digits. For example, January can be represented as `01` or `1`. Snowflake recommends
  using two digits.
* `DD`: Specify one or two digits. Snowflake recommends using two digits.
* `HH12` and `HH24`: Specify one or two digits. Snowflake recommends using two digits.
* `MI`: Specify one or two digits. Snowflake recommends using two digits.
* `SS`: Specify one or two digits. Snowflake recommends using two digits.
* `FF9`: Specify between 1 and 9 digits (inclusive). Snowflake recommends specifying the number of actual
  significant digits. Trailing zeros aren’t required.
* `TZH`: Specify one or two digits. Snowflake recommends using two digits.
* `TZM`: Specify one or two digits. Snowflake recommends using two digits.

For all fields (other than fractional seconds), Snowflake recommends specifying the maximum number of digits. Use leading
zeros if necessary. For example, `0001-02-03 04:05:06 -07:00` follows the recommended format.

For fractional seconds, trailing zeros are optional. In general, it is considered good practice to specify only
the number of digits that are reliable and meaningful. For example, if a time measurement is accurate to three decimal
places (milliseconds), then specifying it as nine digits (for example, `.123000000`) might be misleading.

### Whitespace in values and format specifiers

Snowflake enforces matching whitespace in some, but not all, situations. For example, the following statement
generates an error because there is no space between the days and the hours in the specified value, but there is a
space between `DD` and `HH` in the format specifier:

```sqlexample
SELECT TO_TIMESTAMP('2019-02-2823:59:59 -07:00', 'YYYY-MM-DD HH24:MI:SS TZH:TZM');
```

However, the following statement doesn’t generate an error, even though the value contains a whitespace where the specifier doesn’t:

```sqlexample
SELECT TO_TIMESTAMP('2019-02-28 23:59:59.000000000 -07:00', 'YYYY-MM-DDHH24:MI:SS.FF TZH:TZM');
```

The reason for the difference is that in the former case, the values would be ambiguous if the fields aren’t all
at their maximum width. For example, `213` could be interpreted as 2 days and 13 hours, or as 21 days and 3 hours.
However, `DDHH` is unambiguously the same as `DD HH` (other than the whitespace).

> **Tip:**
>
> Although some whitespace differences are allowed in order to handle variably-formatted data,
> Snowflake recommends that values and specifiers exactly match, including spaces.

### Context dependency

Not all restrictions are enforced equally in all contexts.
For example, some expressions might roll over February 31, while others might not.

### Summary of best practices for specifying the format

These best practices minimize ambiguities and other potential issues in past, current, and projected future versions
of Snowflake:

* Be aware of the dangers of mixing data from sources that use different formats (for example, of mixing data that follows
  the common U.S. format `MM-DD-YYYY` and the common European format `DD-MM-YYYY`).
* Specify the maximum number of digits for each field (except fractional seconds). For example, use four-digit years,
  specifying leading zeros if necessary.
* Specify a blank or the letter `T` between the date and time in a timestamp.
  If using `T`, Snowflake recommends writing it as `"T"` in the format string.
* Enclose literal characters in double quotes within format strings. For example, use
  `YYYY-MM-DD"T"HH24:MI:SS "EST"` rather than `YYYY-MM-DDTHH24:MI:SS EST`. This helps prevent
  literal text from being misinterpreted as format elements.
* Make sure whitespace (and the `"T"` separator between the date and time) are the same in values and in the
  format specifier.
* Use interval arithmetic if you need the equivalent of rollover.
* Be careful when using AUTO formatting. When possible, specify the format, and ensure that values always match the
  specified format.
* Specify the format in the command, because it is safer than specifying the format outside the command, for example in
  a parameter such as DATE_INPUT_FORMAT. (See below.)
* When moving scripts from one environment to another, ensure that date-related parameters, such as DATE_INPUT_FORMAT,
  are the same in the new environment as they were in the old environment (assuming that the values are also in
  the same format).

## Date & time functions

Snowflake provides a set of functions to construct, convert, extract, or modify DATE, TIME, and TIMESTAMP data. For more
information, see [Date & Time Functions](functions-date-time.md).

## AUTO detection of integer-stored date, time, and timestamp values

For integers of seconds or milliseconds stored in a string, Snowflake attempts to determine the correct unit of measurement based
on the length of the value.

> **Note:**
>
> The use of quoted integers as inputs is deprecated.

This example calculates the timestamp equivalent to 1487654321 seconds since the start of the Unix epoch:

```sqlexample
SELECT TO_TIMESTAMP('1487654321');
```

```output
+-------------------------------+
| TO_TIMESTAMP('1487654321')    |
|-------------------------------|
| 2017-02-21 05:18:41.000000000 |
+-------------------------------+
```

Here is a similar calculation using milliseconds since the start of the epoch:

```sqlexample
SELECT TO_TIMESTAMP('1487654321321');
```

```output
+-------------------------------+
| TO_TIMESTAMP('1487654321321') |
|-------------------------------|
| 2017-02-21 05:18:41.321000000 |
+-------------------------------+
```

Depending on the magnitude of the value, Snowflake uses a different unit of measure:

* After the string is converted to an integer, the integer is treated as a number of seconds, milliseconds,
  microseconds, or nanoseconds after the start of the Unix epoch (1970-01-01 00:00:00.000000000 UTC).

  + If the integer is less than 31536000000 (the number of milliseconds in a year), then the value is treated as
    a number of seconds.
  + If the value is greater than or equal to 31536000000 and less than 31536000000000, then the value is treated
    as milliseconds.
  + If the value is greater than or equal to 31536000000000 and less than 31536000000000000, then the value is
    treated as microseconds.
  + If the value is greater than or equal to 31536000000000000, then the value is
    treated as nanoseconds.
* If more than one row is evaluated (for example, if the input is the column name of a table that contains more than
  one row), each value is examined independently to determine if the value represents seconds, milliseconds, microseconds, or
  nanoseconds.

In cases where formatted strings and integers in strings are passed to the function, each value is cast according to the contents
of the string. For example, if you pass a date-formatted string and a string containing an integer to TO_TIMESTAMP, the function
interprets each value correctly according to what each string contains:

```sqlexample
SELECT TO_TIMESTAMP(column1) FROM VALUES ('2013-04-05'), ('1487654321');
```

```output
+-------------------------+
| TO_TIMESTAMP(COLUMN1)   |
|-------------------------|
| 2013-04-05 00:00:00.000 |
| 2017-02-21 05:18:41.000 |
+-------------------------+
```

## Date & time function format best practices

AUTO detection usually determines the correct input format. However, there are situations where it might not be able to make the correct determination.

To avoid this, Snowflake strongly recommends the following best practices (substituting [TO_DATE , DATE](functions/to_date.md) or [TO_TIME , TIME](functions/to_time.md) for [TO_TIMESTAMP](functions/to_timestamp.md), as appropriate):

* Avoid using AUTO format if there is any chance for ambiguous results. Instead, specify an explicit format string by:

  + Setting [TIMESTAMP_INPUT_FORMAT](parameters.md) and other session parameters for dates, timestamps, and times.
    See Session Parameters for Dates, Times, and Timestamps (in this topic).
  + Specifying the format using the following syntax:

    ```sqlsyntax
    TO_TIMESTAMP(<value>, '<format>')
    ```
* For strings containing integer values, specify the scale using the following syntax:

  ```sqlsyntax
  TO_TIMESTAMP(TO_NUMBER(<string_column>), <scale>)
  ```

---
title: DDL for user-defined functions, external functions, and stored procedures
source: https://docs.snowflake.com/en/sql-reference/ddl-udf.md
section: SQL General Reference
---

# DDL for user-defined functions, external functions, and stored procedures

UDFs (user-defined functions) and stored procedures are two programming constructs that allow you to extend Snowflake SQL.

## UDF management

UDFs can be used to perform operations that are not available via the system-defined functions provided by Snowflake. Snowflake provides the following DDL
commands for creating and managing UDFs:

* [CREATE FUNCTION](sql/create-function.md)
* [ALTER FUNCTION](sql/alter-function.md)
* [DROP FUNCTION](sql/drop-function.md)
* [DESCRIBE FUNCTION](sql/desc-function.md)
* [SHOW USER FUNCTIONS](sql/show-user-functions.md)

> **Note:**
>
> UDFs can contain Java, JavaScript, Python, and SQL; however, DDL and DML operations are not supported in UDFs.

## External function management

External functions can be used to perform operations that are not available via the system-defined functions provided
by Snowflake. External functions are a type of UDF, but their syntax is different enough that they have their own
CREATE, ALTER, and SHOW statements.

Snowflake provides the following DDL commands for creating and managing external functions:

* [CREATE EXTERNAL FUNCTION](sql/create-external-function.md)
* [ALTER FUNCTION](sql/alter-function.md)
* [DROP FUNCTION](sql/drop-function.md)
* [SHOW EXTERNAL FUNCTIONS](sql/show-external-functions.md)
* [DESCRIBE FUNCTION](sql/desc-function.md)

External functions use API integrations. Snowflake provides the following DDL commands for creating and managing
API integrations:

* [CREATE API INTEGRATION](sql/create-api-integration.md)
* [ALTER API INTEGRATION](sql/alter-api-integration.md)
* [DROP INTEGRATION](sql/drop-integration.md)
* [SHOW INTEGRATIONS](sql/show-integrations.md)
* [DESCRIBE INTEGRATION](sql/desc-integration.md)

## Stored procedure management

Snowflake provides the following DDL commands for creating and managing stored procedures:

* [CREATE PROCEDURE](sql/create-procedure.md)
* [ALTER PROCEDURE](sql/alter-procedure.md)
* [DROP PROCEDURE](sql/drop-procedure.md)
* [SHOW PROCEDURES](sql/show-procedures.md)
* [DESCRIBE PROCEDURE](sql/desc-procedure.md)

In addition, Snowflake provides the following command for using stored procedures:

* [CALL](sql/call.md)

---
title: DECLARE (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/declare.md
section: SQL General Reference
---

# DECLARE (Snowflake Scripting)

Declares one or more Snowflake Scripting variables, cursors, RESULTSETs, nested stored procedures, or exceptions.

For more information, see the following topics:

* [Working with variables](../../developer-guide/snowflake-scripting/variables.md)
* [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md)
* [Working with RESULTSETs](../../developer-guide/snowflake-scripting/resultsets.md)
* [Using nested stored procedures](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md)
* [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md)

See also:
:   [LET](let.md)

## Syntax

```sqlsyntax
DECLARE
  {   <variable_declaration>
    | <cursor_declaration>
    | <resultset_declaration>
    | <nested_stored_procedure_declaration>
    | <exception_declaration> };
  [
    {   <variable_declaration>
      | <cursor_declaration>
      | <resultset_declaration>
      | <nested_stored_procedure_declaration>
      | <exception_declaration> };
    ...
  ]
```

The following sections describe the syntax for each type of declaration in more detail:

* Variable declaration syntax
* Cursor declaration syntax
* RESULTSET declaration syntax
* Nested stored procedure declaration syntax
* Exception declaration syntax

### Variable declaration syntax

Use the following syntax to declare a [variable](../../developer-guide/snowflake-scripting/variables.md):

```sqlsyntax
<variable_declaration> ::=
  <variable_name> [<type>] [ { DEFAULT | := } <expression>]
```

Where:

> `variable_name`
> :   The name of the variable. The name must follow the naming rules for [Object identifiers](../identifiers.md).
>
> `type`
> :   A [SQL data type](../../sql-reference-data-types.md).
>
> `DEFAULT expression` or . `:= expression`
> :   Assigns the value of `expression` to the variable. If both `type` and `expression` are specified, the
>     expression must evaluate to a data type that matches, or can be implicitly [cast](../functions/cast.md) to, the
>     specified `type`.

For example:

> ```sqlexample
> profit NUMBER(38, 2) := 0;
> ```

For a complete example, see Examples.

For more information about variables, see [Working with variables](../../developer-guide/snowflake-scripting/variables.md).

### Cursor declaration syntax

Use the following syntax to declare a [cursor](../../developer-guide/snowflake-scripting/cursors.md):

```sqlsyntax
<cursor_declaration> ::=
  <cursor_name> CURSOR FOR <query>
```

Where:

> `cursor_name`
> :   The name to give the cursor. This can be any valid Snowflake [identifier](../identifiers.md)
>     that is not already in use in this block. The identifier is used by other cursor-related commands, such as
>     `FETCH`.
>
> `query`
> :   The query that defines the result set that the cursor iterates over.
>
>     This can be almost any valid SELECT statement. To specify bind parameters in the SELECT statement, use
>     question marks (`?`). You can bind the parameters to bind variables in the USING clause when you
>     open the cursor.

For example:

> ```sqlexample
> c1 CURSOR FOR SELECT id, price FROM invoices;
> ```

For more information about cursors (including complete examples), see [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md).

### RESULTSET declaration syntax

Use the following syntax to declare a [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md):

```sqlsyntax
<resultset_name> RESULTSET [ { DEFAULT | := } [ ASYNC ] ( <query> ) ] ;
```

Where:

> `resultset_name`
> :   The name to give the RESULTSET.
>
>     The name should be unique within the current scope.
>
>     The name must follow the naming rules for [Object identifiers](../identifiers.md).
>
> `ASYNC`
> :   Runs the query as an [asynchronous child job](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md).
>
>     The query can be any valid SQL statement, including SELECT statements and DML statements, such as INSERT
>     or UPDATE.
>
>     When this keyword is omitted, the stored procedure runs child jobs sequentially, and each child job waits for
>     the running child job to finish before it starts.
>
>     You can use this keyword to run multiple child jobs concurrently, which can improve efficiency and reduce overall
>     run time.
>
>     You can use [AWAIT](await.md) and [CANCEL](cancel.md)
>     statements to manage asynchronous child jobs for a RESULTSET.
>
> `DEFAULT query` or . `:= query`
> :   Assigns the value of `query` to the RESULTSET.

For example:

```sqlexample
res RESULTSET DEFAULT (SELECT col1 FROM mytable ORDER BY col1);
```

For more information about RESULTSETs (including complete examples), see [Working with RESULTSETs](../../developer-guide/snowflake-scripting/resultsets.md).

### Nested stored procedure declaration syntax

Use the following syntax to declare a [nested stored procedure](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md):

```sqlsyntax
<nested_procedure_name> PROCEDURE (
    [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> | TABLE ( [ <col_name> <col_data_type> [ , ... ] ] ) }
  AS <nested_procedure_definition>
```

Where:

`nested_procedure_name`
:   The name of the nested stored procedure. The name must follow the naming rules for [Object identifiers](../identifiers.md).

`( [ arg_name arg_data_type ] [ , ... ] )`
:   Specifies the input arguments for the nested stored procedure.

    * For `arg_name`, specify the name of the input argument.
    * For `arg_data_type`, specify [a SQL data type](../../sql-reference-data-types.md).

`RETURNS { result_data_type | TABLE ( [ col_name col_data_type [ , ... ] ] ) }`
:   Specifies the type of the result returned by the stored procedure. Currently, NOT NULL isn’t supported in the
    RETURNS parameter for nested stored procedures.

    * For `RETURNS result_data_type`, specify [a SQL data type](../../sql-reference-data-types.md).
    * For `RETURNS TABLE ( [ col_name col_data_type [ , ... ] ] )`, if you know the
      [Snowflake data types](../../sql-reference-data-types.md) of the columns in the returned table, specify the column names and
      types:

      ```sqlexample
      RETURNS TABLE (sales_date DATE, quantity NUMBER)
      ```

      Otherwise (for example, if you are determining the column types during run time), you can omit the column names and types:

      ```sqlexample
      RETURNS TABLE ()
      ```

      > **Note:**
      >
      > Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
      > applies whether you are creating a stored or anonymous procedure.
      >
      > ```sqlexample
      > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
      >   RETURNS TABLE(g GEOGRAPHY)
      >   ...
      > ```
      >
      > ```sqlexample
      > WITH test_return_geography_table_1() AS PROCEDURE
      >   RETURNS TABLE(g GEOGRAPHY)
      >   ...
      > CALL test_return_geography_table_1();
      > ```
      >
      > If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
      >
      > ```none
      > Stored procedure execution error: data type of returned table does not match expected returned table type
      > ```
      >
      > To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
      >
      > ```sqlexample
      > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
      >   RETURNS TABLE()
      >   ...
      > ```
      >
      > ```sqlexample
      > WITH test_return_geography_table_1() AS PROCEDURE
      >   RETURNS TABLE()
      >   ...
      > CALL test_return_geography_table_1();
      > ```

`AS nested_procedure_definition`
:   Defines the code executed by the nested stored procedure. The definition can consist of any valid code.

### Exception declaration syntax

Use the following syntax to declare an [exception](../../developer-guide/snowflake-scripting/exceptions.md):

```sqlsyntax
<exception_name> EXCEPTION [ ( <exception_number> , '<exception_message>' ) ] ;
```

Where:

> `exception_name`
> :   The name to give to the exception.
>
> `exception_number`
> :   A number to uniquely identify the exception. The number must be an integer between -20000 and -20999. The
>     number should not be used for any other exception that exists at the same time.
>
>     Default: -20000
>
> `exception_message`
> :   A message to describe the exception.
>     The message must not contain any double quote characters.
>
>     Default: Empty string.

For example:

> ```sqlexample
> exception_could_not_create_table EXCEPTION (-20003, 'ERROR: Could not create table.');
> ```

For more information about exceptions (including complete examples), see [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md).

## Examples

This example declares a variable named `profit` for use in a Snowflake Scripting anonymous block:

```sqlexample
DECLARE
  profit number(38, 2) DEFAULT 0.0;
BEGIN
  LET cost number(38, 2) := 100.0;
  LET revenue number(38, 2) DEFAULT 110.0;

  profit := revenue - cost;
  RETURN profit;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  profit number(38, 2) DEFAULT 0.0;
BEGIN
  LET cost number(38, 2) := 100.0;
  LET revenue number(38, 2) DEFAULT 110.0;

  profit := revenue - cost;
  RETURN profit;
END;
$$
;
```

For more examples that declare variables, cursors, RESULTSETs, and exceptions, see the following topics:

* [Examples of using variables](../../developer-guide/snowflake-scripting/variables.md)
* [Example of using a cursor](../../developer-guide/snowflake-scripting/cursors.md)
* [Examples of using a RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md)
* [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md)

---
title: DEREGISTER_EXTENSION
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/deregister_extension.md
section: SQL General Reference
---

# DEREGISTER_EXTENSION

Deregisters an extension from the Trust Center.

For more information, see [Using Trust Center extensions](../../user-guide/trust-center/trust-center-extensions.md).

## Syntax

```sqlsyntax
SNOWFLAKE.TRUST_CENTER.DEREGISTER_EXTENSION(
  '<extension_id>')
```

## Arguments

`'extension_id'`
:   The identifier of the extension.

    To find the identifiers for registered extensions, query the
    [EXTENSIONS view](../trust_center/extensions.md).

## Returns

Returns a VARCHAR value:

* If deregistration is successful, the VARCHAR value contains the following message:

  ```output
  Extension is successfully deregistered.
  ```
* If deregistration fails, the VARCHAR value contains an error message. Deregistration can fail for the following reasons:

  + The specified `extension_id` is invalid.

  The following example shows an error message that is returned for an invalid `extension_id`:

  ```output
  Either the extension with given id does not exist or it is already deregistered
  ```

## Examples

The following example deregisters an extension with ID `extension_id`:

```sqlexample
CALL SNOWFLAKE.TRUST_CENTER.DEREGISTER_EXTENSION(
  'extension_id');
```

---
title: Designing high-performance external functions
source: https://docs.snowflake.com/en/sql-reference/external-functions-implementation.md
section: SQL General Reference
---

# Designing high-performance external functions

This topic provides information about concurrency, reliability, and scalability of external functions, including
information about using asynchronous external functions.

## Asynchronous vs. Synchronous remote services

A remote service can be synchronous or asynchronous.

synchronous:
:   A call to a synchronous remote service is a blocking call. The remote service does not send any response
    until the results are ready. The service can’t be polled.

    Synchronous code is easier to implement than asynchronous code.

asynchronous:
:   An asynchronous remote service can be polled while the caller waits for results.

    Asynchronous handling reduces sensitivity to timeouts.

    For more information about asynchronous services, see Microsoft’s description of
    [Asynchronous Request-Reply Pattern](https://docs.microsoft.com/en-us/azure/architecture/patterns/async-request-reply) .
    (The information is not limited to Microsoft Azure.)

A synchronous remote service receives an HTTP POST request, processes the request, and returns the result. Depending
upon how long it takes to process the data, there can be a significant delay between the time that request is
received and the results are returned.

An asynchronous remote service receives an HTTP POST request and returns a (usually almost immediate) acknowledgement
that the request was received. The caller (Snowflake) then executes a polling loop in which it issues one or more HTTP
GET requests (usually with a significant delay between each request) to check the status of asynchronous processing.
A GET does not send any data in the request body, but contains the same headers as the original POST.

Asynchronous remote services are useful when a remote service exceeds the timeouts built into components such as
the proxy service (e.g. Amazon API Gateway).

A remote service is not necessarily purely synchronous or purely asynchronous. A remote service can operate
synchronously and asynchronously at different times, depending upon factors such as the amount of data in the
request, the number of other requests being processed, etc.

Snowflake’s implementation of external functions is generally compatible with both synchronous and asynchronous
third party function libraries.

The diagram below contrasts synchronous and asynchronous processing. The upper path is synchronous. The lower
path (which includes one or more HTTP GET requests) is asynchronous.

To view examples of synchronous and asynchronous external functions,
see [Snowflake Sample Functions](external-functions-creating-aws-ui-remote-service.md).

### Synchronous remote service

Before a user can call an external function, developers and
Snowflake account administrators must configure Snowflake to access the proxy service. Typically,
the steps are done in approximately the order shown below (starting from the right-hand side of the
diagram above and moving leftward towards Snowflake).

1. A developer must write the remote service, and that remote service must be exposed via the HTTPS proxy service.
   For example, the remote service might be a Python function running on AWS Lambda and exposed via a resource in
   the Amazon API Gateway.
2. In Snowflake, an ACCOUNTADMIN or a role with the CREATE INTEGRATION privilege
   must create an “API integration” object that contains authentication
   information that enables Snowflake to communicate with the proxy service. The API integration is created with
   the SQL command [CREATE API INTEGRATION](sql/create-api-integration.md).
3. A Snowflake user must execute the SQL command [CREATE EXTERNAL FUNCTION](sql/create-external-function.md). The user must use
   a role that has USAGE privilege on the API integration and has sufficient privileges to create functions.

   > **Note:**
   >
   > The CREATE EXTERNAL FUNCTION command does not actually create an external function in the sense of loading
   > code that will be “executed outside Snowflake”. Instead, the CREATE EXTERNAL FUNCTION command creates a
   > database object that indirectly references the code that executes outside Snowflake. More precisely,
   > the CREATE EXTERNAL FUNCTION command creates an object that contains:
   >
   > * The URL of the resource in the HTTPS proxy service that acts as a relay function.
   > * The name of the API integration to use to authenticate to the proxy service.
   > * A name that is effectively an alias for the remote service. This alias is used in SQL commands,
   >   for example `SELECT MyAliasForRemoteServiceXYZ(col1) ...;`

The alias in Snowflake, the HTTPS proxy service resource’s name, and the remote service’s name can all be different.
(Using the same name for all three can simplify administration, however.)

Although the steps described above are the most common way of executing an external function, some variations
are allowed. For example:

* The remote service might not be the final step in the chain; the remote service could call yet another
  remote service to do part of the work.
* If the remote service doesn’t accept and return JSON-formatted data, then the HTTPS proxy service’s resource (the
  relay function) could convert the data from JSON format to another format (and convert the returned data
  back to JSON).
* Although Snowflake recommends that the remote service behave as a true function (i.e. a piece of code that
  accepts 0 or more input parameters and returns an output) that has no side effects and keeps no state information,
  this is not strictly required. The remote service could perform other tasks, for example sending alerts if a value
  (such as a temperature reading in the data) is dangerously high. In rare cases, the remote service might keep state
  information, for example the total number of alerts issued.

### Asynchronous remote service

An asynchronous remote service is useful
when a remote service exceeds the timeouts built into components such as the proxy service.

An asynchronous remote service involves the same components (client, Snowflake, proxy service, and remote service)
and the same general steps as described above. However, the details of the HTTP requests and responses are different.

Asynchronous behavior is implemented by the person who writes the remote service (and by Snowflake).
SQL statements are the same for asynchronous remote services as for synchronous remote services.

If you are writing your own remote service and want to make it compatible with Snowflake’s asynchronous handling,
write the remote service to behave as follows:

* When it initially receives an HTTP POST for a specific
  [batch](external-functions-introduction.md) of rows, the remote service returns
  HTTP code 202 (“Processing…”).
* If the remote service receives any HTTP GET requests after the POST but before the output is ready, the
  remote service returns HTTP code 202.
* After the remote service has generated all of the output rows, it waits for the next HTTP GET with the same
  batch ID, and then returns the rows received, along with HTTP code 200 (“Successful completion…”).

In short, for each batch received, the remote service returns 202 until the results are ready, after which
the next GET receives the results and an HTTP 200.

For each batch, Snowflake works with the asynchronous remote service as follows:

1. Snowflake sends an HTTP POST that contains the data to process, along with a unique batch ID.
2. If Snowflake receives an HTTP 202 response, then Snowflake loops until one of the following is true:

   * Snowflake receives the data and an HTTP 200.
   * Snowflake’s internal timeout is reached.
   * Snowflake receives an error (e.g. HTTP response code 5XX).

   In each iteration of the loop, Snowflake delays, then issues an HTTP GET that contains the same batch ID as
   the corresponding HTTP POST’s batch ID, so that the remote service can return information for the correct batch.

   The delay inside the loop starts out short but grows longer for each HTTP 202 response received until Snowflake’s
   timeout is reached.
3. If Snowflake’s timeout is reached before HTTP 200 is returned, then Snowflake aborts the SQL query.

   Currently, Snowflake’s timeout is 10 minutes (600 seconds) and is not user-configurable. This timeout might
   change in the future.

> **Note:**
>
> The frequency with which queries hit timeouts depends in part upon the scalability of the remote service. If your
> remote service times out frequently, then see also the discussion of
> Scalability.

## Scalability

The remote service, the proxy service, and any other steps between Snowflake and the remote service, must be able to
handle the peak workloads sent to them.

Some cloud platform providers have default usage limits or other quotas for proxy services and remote services,
which can limit the throughput of external function calls.

Larger Snowflake [warehouse sizes](../user-guide/warehouses-overview.md) can increase the concurrency with which
requests are sent, which might exceed the proxy service’s quota.

Users can see how many times Snowflake had to retry sending request batches (due to throttling or other errors)
for a query by looking at the value for Retries due to transient errors on the query profile.

### Scalability of the remote service

Developers who write remote services should consider:

* The frequency with which the remote service will be called.
* The number of rows sent per call.
* The resources required to process each row.
* The time distribution of calls (peak vs. average).

Capacity might need to increase over time as the callers change from a few developers and testers to an entire
organization. If the remote service is used by multiple organizations, capacity might need to increase as
the number of organizations increases. Furthermore, as the number and diversity of organizations increase, the size
and timing of workloads might become more difficult to predict.

The remote service provider is responsible for providing enough capacity to handle peak workloads.
Different techniques can be used to scale the service. If the remote service is managed by the author of the remote
service, then the author might need to explicitly provision the service with enough capacity to handle peaks.
Alternatively, the author might decide to use a hosted auto-scaled/elastic service, such as AWS Lambda.

Remote services should return HTTP response code 429 when overloaded. If Snowflake sees HTTP 429,
Snowflake scales back the rate at which it sends rows, and retries sending batches of rows that were not
processed successfully.

For more information about troubleshooting scalability issues, see
Troubleshooting scalability and performance issues.

If remote service invocations time out because each individual invocation takes a long time, rather than
because the system is generally overloaded, then see the description of how to build an
Asynchronous remote service.

### Scalability of the proxy service

The proxy service should also be scalable. Fortunately, proxy services provided by major cloud providers are
generally scalable.

However, some proxy services, including Amazon API Gateway and Azure API Management, have default usage limits. When
the request rate exceeds the limit, these proxy services throttle requests. If necessary, you
might need to ask AWS or Azure to increase your quota on your proxy service.

Users who develop or administer external functions should remember the following platform-specific information:

Amazon API Gateway:
:   The Amazon API Gateway is itself a managed AWS service, which auto-scales to users’ workloads. Users should be
    familiar with various
    [limits of API Gateway](https://docs.aws.amazon.com/apigateway/latest/developerguide/limits.html) .

    The Amazon API Gateway can be configured to help scale the remote service. Specifically, the API Gateway can be
    configured to enable caching and/or throttling of requests to reduce the load on the remote service if needed:

    * [Enable caching](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-caching.html)
    * [Enable throttling](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-request-throttling.html)

    Because throttling can affect timeouts and retries, users might also want to review information about how Snowflake
    handles timeouts and retries:

    * [Account for timeout errors](external-functions-best-practices.md)
    * [Do not assume that the remote service is passed each row exactly once](external-functions-best-practices.md)

Azure API Management Service:
:   For Azure API Management, the limits depend on the SKU chosen for the service. The limits are documented in the
    API Management limits section of the
    [Azure Subscription Service Limits](https://docs.microsoft.com/en-us/azure/azure-resource-manager/management/azure-subscription-service-limits#api-management-limits) .

    Because throttling can affect timeouts and retries, users might also want to review information about how Snowflake
    handles timeouts and retries:

    * [Account for timeout errors](external-functions-best-practices.md)
    * [Do not assume that the remote service is passed each row exactly once](external-functions-best-practices.md)

### Troubleshooting scalability and performance issues

* Use the [QUERY_HISTORY , QUERY_HISTORY_BY_\*](functions/query_history.md) function to observe performance characteristics and help
  debug performance issues.
* Use the [Query History page](../user-guide/ui-snowsight-activity.md) page to see average latency per request.
* Use the [Query History page](../user-guide/ui-snowsight-activity.md) page to see how many times requests were retried
  due to transient errors, including those listed in the section titled [Do not assume that the remote service is passed each row exactly once](external-functions-best-practices.md).
* Monitor your remote service resource usage to see how it scales to the load, and ensure that the
  remote service has enough capacity to serve peak load.
* Utilize logging in the Amazon API Gateway or in the remote service to get per-request details.
* Control the concurrency with which Snowflake sends requests to their remote service. For more details,
  see concurrency.
* Return HTTP Response Code 429 from the remote service when it is overloaded. Return this as early as possible,
  rather than wait for latency to increase.
* Take into account the proxy service timeout. For example, as of July 2020, the timeout for Amazon API
  Gateway is 30 seconds. Timeouts can be caused by various factors, including overloading of the remote service.

Snowflake attempts to retry transient errors/timeouts within a reasonable time, but if the service continues to be
overloaded, and retries do not succeed, eventually the query is aborted.

## Concurrency

Resource requirements depend upon the way that rows are distributed across calls (many parallel
calls with a few rows each vs. one call with the same total number of rows). A system that supports high capacity
does not necessarily support high concurrency, and vice-versa. You should estimate the peak concurrency required, as
well as the largest reasonable individual workloads, and provide enough resources to handle both types of peaks.

Furthermore, the concurrency estimate should take into account that Snowflake can parallelize external function calls.
A single query from a single user might cause multiple calls to the remote service in parallel. Several factors
affect the number of concurrent calls from Snowflake to a proxy service or remote service, including:

* The number of concurrent users who are running queries with external functions.
* The size of each user’s query.
* The amount of compute resources in the virtual warehouse (i.e. [the warehouse size](../user-guide/warehouses-overview.md)).
* The number of [warehouses](../user-guide/warehouses-multicluster.md).

Handling concurrency properly can be particularly complex if external functions have
[side effects](external-functions-best-practices.md). The results can vary depending upon the
order in which user’s rows are processed. (Snowflake recommends that you avoid writing or using remote services that
have side effects.)

## Reliability

Depending upon where the remote service is running, you might need to consider:

* Reliability.
* Error-handling.
* Debugging.
* Upgrading (if the remote service might add new features or need bug fixes).

If the remote service is not stateless, you might also need to consider recovery after failure. (Snowflake
strongly recommends that remote services be stateless.)

For information about timeouts and retries, see [Account for timeout errors](external-functions-best-practices.md) and
[Do not assume that the remote service is passed each row exactly once](external-functions-best-practices.md).

---
title: Differential privacy functions
source: https://docs.snowflake.com/en/sql-reference/functions-differential-privacy.md
section: SQL General Reference
---

# Differential privacy functions

The following functions are associated with [differential privacy](../user-guide/diff-privacy/differential-privacy-overview.md).

| Function | Description |
| --- | --- |
| [DP_INTERVAL_LOW](functions/dp_interval_low.md) | Returns the lower bound of the [noise interval](../user-guide/diff-privacy/differential-privacy-analyst.md). |
| [DP_INTERVAL_HIGH](functions/dp_interval_high.md) | Returns the upper bound of the [noise interval](../user-guide/diff-privacy/differential-privacy-analyst.md). |
| [ESTIMATE_REMAINING_DP AGGREGATES](functions/estimate_remaining_dp_aggregates.md) | Returns the estimated remaining number of aggregation function calls in the current user’s [privacy budget](../user-guide/diff-privacy/differential-privacy-admin-privacy-budgets.md). |

---
title: DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/snowflake_telemetry_drop_row_access_policy_on_events_view.md
section: SQL General Reference
---

# DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW

> **Note:**
>
> Using row access policies on the default event table is an [Enterprise Edition](../../user-guide/intro-editions.md) feature.

Deletes the specified [row access policy](../../user-guide/security-row-intro.md) bound to the
[EVENTS_VIEW](../telemetry/events_view.md).

The EVENTS_ADMIN role includes the USAGE privilege on this procedure.

## Syntax

```sqlsyntax
SNOWFLAKE.TELEMETRY.DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW(
  <row_access_policy_reference>
)
```

## Arguments

`row_access_policy_reference`
:   A [reference](../references.md) to a row access policy object for the policy to drop.

## Returns

On successful execution, the procedure returns a string indicating success. Otherwise, the procedure returns an error.

## Usage notes

This stored procedure uses owner’s rights. For more details, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

## Examples

Code in the following example drops the `ROW_ACCESS_POLICY` policy bound to the EVENTS_VIEW:

```sqlexample
CALL SNOWFLAKE.TELEMETRY.DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW(
  SYSTEM$REFERENCE('ROW_ACCESS_POLICY', 'mydb.myschema.mypolicy', 'SESSION', 'APPLY')
);
```

---
title: Dropping constraints
source: https://docs.snowflake.com/en/sql-reference/constraints-drop.md
section: SQL General Reference
---

# Dropping constraints

Constraints are dropped using the following [ALTER TABLE](sql/alter-table.md) commands:

* ALTER TABLE … DROP CONSTRAINT explicitly drops the specified constraint. Similar to modifying constraints,
  you can identify the constraint using the constraint name or column definition along with the constraint type.
  For a primary key, the constraint can also be identified using the PRIMARY KEY keyword.
* ALTER TABLE … DROP COLUMN drops a column and its associated constraints.

By default, when a primary or unique key is dropped, all foreign keys referencing the key being dropped are also dropped,
unless the RESTRICT drop option is specified.

Constraints are also dropped when the associated tables, schemas, or databases are dropped. The DROP commands support the
CASCADE | RESTRICT drop options.

> **Note:**
>
> You can restore dropped tables, schemas, and databases using the UNDROP command. Dropped columns and constraints can’t
> be restored.

## Dropping constraints

You can explicitly drop UNIQUE, PRIMARY KEY, FOREIGN KEY, and CHECK constraints using the ALTER TABLE … DROP CONSTRAINT command:

> ```sqlsyntax
> ALTER TABLE <table_name> DROP { CONSTRAINT <name> | PRIMARY KEY | { UNIQUE | FOREIGN KEY } (<column>, [ ... ] ) } [ CASCADE | RESTRICT ]
> ```

For these constraints, when dropping a FOREIGN KEY constraint or a primary or unique key constraint with no foreign key references,
the constraints are dropped directly.

The default drop option is CASCADE, which means that dropping a unique or primary key with foreign key references drops all the referencing
foreign keys together with the unique or primary key.

If the RESTRICT drop option is specified, when dropping a primary or unique key, an error is returned if there exist foreign keys that
reference the keys being dropped.

## Dropping columns

Dropping columns using ALTER TABLE … DROP COLUMN behaves similarly to dropping constraints:

> ```sqlsyntax
> ALTER TABLE <table_name> DROP COLUMN <name> [ CASCADE | RESTRICT ]
> ```

For PRIMARY KEY, UNIQUE, and FOREIGN KEY constraints, the default drop option is CASCADE, which means any constraint that contains
the column being dropped is also dropped. If a primary or unique key involving the column is referenced by other FOREIGN KEY constraints,
all referencing foreign keys are dropped. If the RESTRICT option is specified, an error is returned if the column has primary or unique
keys with foreign keys references. The drop command only succeeds if there are no constraints defined on or referring to the column being
dropped.

For CHECK constraints that reference a single column, the default drop option is CASCADE. However, for CHECK constraints that reference
multiple columns, the default drop option is RESTRICT, which prevents accidental deletion of constraints that might be required for
data integrity.

## Dropping tables, schemas, and databases

The DROP command drops the specified table, schema, or database and can also be specified to drop all constraints associated with the object:

> ```sqlsyntax
> DROP { TABLE | SCHEMA | DATABASE } <name> [ CASCADE | RESTRICT ]
> ```

Similar to dropping columns and constraints, CASCADE is the default drop option, and all constraints that belong to or references the object
being dropped are also dropped.

For example, when dropping a database, if the database contains a primary or unique key which is referenced by a foreign key from another database,
the referencing foreign keys are also dropped.

If the object is later undropped, all relevant constraints previously dropped are restored.

If the RESTRICT option is specified, an error is returned if any PRIMARY KEY or UNIQUE constraints under the object has foreign key references.

---
title: Encryption functions
source: https://docs.snowflake.com/en/sql-reference/functions-encryption.md
section: SQL General Reference
---

# Encryption functions

Encryption functions encrypt or decrypt VARCHAR or BINARY values.

| Function | Notes |
| --- | --- |
| [ENCRYPT](functions/encrypt.md) | Encrypts VARCHAR or BINARY values using a passphrase. |
| [DECRYPT](functions/decrypt.md) | Decrypts VARCHAR or BINARY values using a passphrase. |
| [TRY_DECRYPT](functions/try_decrypt.md) | Error-handling version of DECRYPT. |
| [ENCRYPT_RAW](functions/encrypt_raw.md) | Encrypts BINARY values using a binary key and an initialization vector. |
| [DECRYPT_RAW](functions/decrypt_raw.md) | Decrypts BINARY values using a binary key and an initialization vector. |
| [TRY_DECRYPT_RAW](functions/try_decrypt_raw.md) | Error-handling version of DECRYPT_RAW. |

---
title: EVALUATE_CANDIDATE_NETWORK_POLICY
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/evaluate_candidate_network_policy.md
section: SQL General Reference
---

# EVALUATE_CANDIDATE_NETWORK_POLICY

Simulates the effect of applying a candidate network policy against historical ingress traffic, without
activating the policy.

Analyzing the output enables administrators to answer the following questions:

* What would this policy have blocked?
* Would legitimate users be affected?

The procedure evaluates all observed ingress client IPs and produces a row-level what-if result.
It doesn’t modify account configuration.

See also:
:   [RECOMMEND_NETWORK_POLICY](recommend_network_policy.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NETWORK_SECURITY.EVALUATE_CANDIDATE_NETWORK_POLICY(
  POLICY_NAME => '<string>'
  [, LOOKBACK_DAYS => <integer> ]
  [, USER_NAME => <string> ])
```

## Arguments

**Required:**

`POLICY_NAME => 'string'`
:   The name of the candidate network policy to evaluate.

**Optional:**

`LOOKBACK_DAYS => 'integer'`
:   The number of days of historical ingress traffic to evaluate against. Controls how far back
    the simulation looks.

    Default: 90

`USER_NAME => 'string'`
:   Filters the evaluation to include only traffic from the specified user.

    Default: No filter; all users are included.

## Returns

Returns a table with (at minimum) the following columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `ACCESS_CLIENT_IP` | VARCHAR | The client IP address observed in historical ingress traffic. |
| `IS_ALLOWED` | VARCHAR | Whether the IP would be allowed (`YES`) or blocked (`NO`) if the candidate policy were activated. |

**Interpretation:**

* `YES` — This IP *would be allowed* if the policy were activated.
* `NO` — This IP *would be blocked* if the policy were activated.

The evaluation results don’t activate the policy. You must activate the recommended network policy if you want to enforce it, by running the
[ALTER ACCOUNT](../sql/alter-account.md) command. For an example, see step 8 in [Generate and evaluate a candidate network policy](../../user-guide/network-policy-advisor.md).

# Access control requirements

A user must have the SECURITYADMIN role at a minimum to run this stored procedure.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The procedure is read-only with respect to account configuration. It doesn’t activate or modify
  any network policies.
* This procedure can’t determine which IP addresses are correct or safe for your organization.
  You must validate results with your IT and security teams before activating the policy.
* Execution time might be 1-2 minutes for accounts with large amounts of historical ingress access data.
* Evaluation results might be dense for high-traffic accounts and might require filtering or
  visualization.
* Each row in the output represents a decision point that administrators should review.

## Examples

Evaluate a candidate network policy using the default lookback window:

```sqlexample
USE ROLE SECURITYADMIN;

CALL SNOWFLAKE.NETWORK_SECURITY.EVALUATE_CANDIDATE_NETWORK_POLICY(
  POLICY_NAME => 'MY_INGRESS_POLICY'
  );
```

Evaluate a candidate network policy against the last 90 days of ingress traffic:

```sqlexample
USE ROLE SECURITYADMIN;

CALL SNOWFLAKE.NETWORK_SECURITY.EVALUATE_CANDIDATE_NETWORK_POLICY(
  POLICY_NAME => 'MY_INGRESS_POLICY',
  LOOKBACK_DAYS => 90
  );
```

Evaluate a candidate network policy against the last 90 days of ingress traffic for a user named `user1`:

```sqlexample
USE ROLE SECURITYADMIN;

CALL SNOWFLAKE.NETWORK_SECURITY.EVALUATE_CANDIDATE_NETWORK_POLICY(
  POLICY_NAME => 'MY_INGRESS_POLICY',
  LOOKBACK_DAYS => 90,
  USER_NAME => 'user1'
  );
```

---
title: EVENTS_VIEW view
source: https://docs.snowflake.com/en/sql-reference/telemetry/events_view.md
section: SQL General Reference
---

# EVENTS_VIEW view

This view displays rows for telemetry data collected in the [default event table](../../developer-guide/logging-tracing/event-table-setting-up.md),
SNOWFLAKE.TELEMETRY.EVENTS.

You can manage access to this view with row access policies. To manage row access policies you create with this view, use the following stored
procedures:

* [ADD_ROW_ACCESS_POLICY_ON_EVENTS_VIEW](../stored-procedures/snowflake_telemetry_add_row_access_policy_on_events_view.md)
* [DROP_ROW_ACCESS_POLICY_ON_EVENTS_VIEW](../stored-procedures/snowflake_telemetry_drop_row_access_policy_on_events_view.md)

## Columns

Columns in this view correspond to columns in an event table you create. For more information, see
[Event table columns](../../developer-guide/logging-tracing/event-table-columns.md).

| Column Name | Data Type | Description |
| --- | --- | --- |
| TIMESTAMP | TIMESTAMP_NTZ | Timestamp when the event record was added. See [TIMESTAMP column](../../developer-guide/logging-tracing/event-table-columns.md). |
| START_TIMESTAMP | TIMESTAMP_NTZ | Event period starting timestamp for metrics and spans. See [START_TIMESTAMP column](../../developer-guide/logging-tracing/event-table-columns.md). |
| OBSERVED_TIMESTAMP | TIMESTAMP_NTZ | A log’s UTC timestamp. Used when capturing logs that do not have an accompanying timestamp. See [OBSERVED_TIMESTAMP column](../../developer-guide/logging-tracing/event-table-columns.md). |
| TRACE | OBJECT | Tracing context. See [TRACE column](../../developer-guide/logging-tracing/event-table-columns.md). |
| RESOURCE | OBJECT | For future use. See [RESOURCE column](../../developer-guide/logging-tracing/event-table-columns.md). |
| RESOURCE_ATTRIBUTES | OBJECT | Attributes that identify the source of an event. See [RESOURCE_ATTRIBUTES column](../../developer-guide/logging-tracing/event-table-columns.md). |
| SCOPE | OBJECT | Scope for signals. See [SCOPE column](../../developer-guide/logging-tracing/event-table-columns.md). |
| SCOPE_ATTRIBUTES | OBJECT | For future use. See [SCOPE_ATTRIBUTES column](../../developer-guide/logging-tracing/event-table-columns.md). |
| RECORD_TYPE | VARCHAR | Type of the value in the RECORD field. See [RECORD_TYPE column](../../developer-guide/logging-tracing/event-table-columns.md). |
| RECORD | OBJECT | Fixed fields for each signal type. See [RECORD column](../../developer-guide/logging-tracing/event-table-columns.md). |
| RECORD_ATTRIBUTES | OBJECT | Variable attributes for each signal type. See [RECORD_ATTRIBUTES column](../../developer-guide/logging-tracing/event-table-columns.md). |
| VALUE | VARIANT | Primary event value. See [VALUE column](../../developer-guide/logging-tracing/event-table-columns.md). |
| EXEMPLARS | ARRAY | Exemplars for metrics. See [EXEMPLARS column](../../developer-guide/logging-tracing/event-table-columns.md). |

---
title: EXCEPTION (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/exception.md
section: SQL General Reference
---

# EXCEPTION (Snowflake Scripting)

Specifies how to handle exceptions raised in the Snowflake Scripting block.

For more information on exceptions, see [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md).

See also:
:   [RAISE](raise.md)

## Syntax

```sqlsyntax
EXCEPTION
    WHEN <exception_name> [ OR <exception_name> ... ] [ { EXIT | CONTINUE } ] THEN
        <statement>;
        [ <statement>; ... ]
    [ WHEN ... ]
    [ WHEN OTHER [ { EXIT | CONTINUE } ] THEN ]
        <statement>;
        [ <statement>; ... ]
```

Where:

> `exception_name`
> :   An exception name defined in the
>     [DECLARE portion of the current block](../../developer-guide/snowflake-scripting/variables.md),
>     or in an enclosing block.
>
> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).

## Usage notes

* Each [block](../../developer-guide/snowflake-scripting/blocks.md) can have its own exception handler.
* Snowflake supports no more than one exception handler per block. However, that handler can catch more than one type
  of exception by having more than one `WHEN` clause.
* The `WHEN OTHER [ { EXIT | CONTINUE } ] THEN` clause catches any exception not yet specified.
* An exception handler applies to statements between the BEGIN and EXCEPTION sections of the block in which
  it is declared. It does’t apply to the DECLARE section of the block.
* An exception handler can handle a specified exception only if that specified exception is in
  [scope](../../developer-guide/snowflake-scripting/variables.md).
* If a stored procedure is intended to return a value, then it should return a value from each possible exit path,
  including each `WHEN` clause of `EXIT` type in the exception handler.
* To use a variable in an exception handler, the variable must be declared in the
  [DECLARE](declare.md) section or passed as an argument to a
  stored procedure. It can’t be declared in the [BEGIN … END](begin.md)
  section. For more information, see [Passing variables to an exception handler in Snowflake Scripting](../../developer-guide/snowflake-scripting/exceptions.md).
* When an exception occurs, the handler conditions are checked in order and the first `WHEN` clause that
  matches is used. The order within a block is top to bottom, and the inner blocks are checked before the outer
  blocks. There is no preference in matching `EXIT` or `CONTINUE` handlers, whichever matches first is used.
* Only one handler can be matched for a statement. However, any exceptions encountered inside of an exception
  handler body can trigger outer block exception handlers.
* Each `WHEN` clause in an exception handler can be one of the following types:

  + `EXIT` - The block runs the statements in the handler and then exits the current block. If the block runs an
    exception of this type, and the block contains statements after the exception handler, those statements
    aren’t run.

    If the block is an inner block, and the exception handler doesn’t contain a `RETURN` statement, then
    execution exits the inner block and continues with the code in the outer block.

    `EXIT` is the default.
  + `CONTINUE` - The block executes the statements in the handler and continues with the statement
    immediately following the one that caused the error.

  An `EXCEPTION` clause can have `WHEN` clauses of both types — `EXIT` and `CONTINUE`.

  For a `WHEN` clause of the `CONTINUE` type, the following usage notes apply:

  + If an error is raised in a [branching construct](../../developer-guide/snowflake-scripting/branch.md),
    then the continuing statement is the statement immediately after the branching construct.
  + If an error is raised in the condition of a [loop](../../developer-guide/snowflake-scripting/loops.md), then
    the continuing statement is the statement immediately after the loop.
  + If an error is raised in the body of a loop, then the continuing statement is the statement in the next iteration
    of the loop. For an example, see Handle an exception and continue.
  + If an error is raised in a [RETURN](return.md) statement, then the
    continuing statement is the statement immediately after the `RETURN` statement.
  + If an error is raised in a
    [nested stored procedure](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md) and the error
    is handled by the outer scope, then the continuing statement is the statement immediately after the stored
    procedure call.
  + Avoid including a `RETURN` statement in a `WHEN` clause of the `CONTINUE` type. If you include a
    `RETURN` statement, then the stored procedure returns without continuing.

  For a `WHEN` clause of the `CONTINUE` type, the following examples show which statement is the statement
  immediately following the one that caused the error for different scenarios. In these examples, the
  `error_expression` is the expression that raised the exception, and the `continue_statement` is the
  statement that the code continues with in the block after the `CONTINUE` handler statements.

  ```sqlexample
  DECLARE
    ...
  BEGIN
    ...
    LET a := <error_expression>;
    <continue_statement>;
    ...
  EXCEPTION
    WHEN <exception_name> CONTINUE THEN
      ...
  END;
  ```

  ```sqlexample
  LET x := <valid_expression>;
  x := <error_expression>;
  <continue_statement>
  ```

  ```sqlexample
  SELECT <statement> INTO <error_expression>;
  <continue_statement>;
  ```

  ```sqlexample
  IF (<error_expression>) THEN
    <statement>
  ELSEIF (<valid_expression>) THEN
    <statement>
  ELSE
    <statement>
  END IF;
  <continue_statement>;
  ```

  ```sqlexample
  CASE (<error_expression>)
    WHEN (<valid_expression>) THEN
      <statement>
    ELSE
      <statement>
  END CASE;
  <continue_statement>
  ```

  ```sqlexample
  CASE (<valid_expression>)
    WHEN (<error_expression>) THEN
      <statement>
    WHEN (<valid_expression>) THEN
      <statement>
    ELSE
      <statement>
  END CASE;
  <continue_statement>
  ```

  ```sqlexample
  FOR i IN <valid_expression> TO <error_expression> DO
    <statement>
  END FOR
  <continue_statement>
  ```

  ```sqlexample
  WHILE <error_expression> DO
    <statement>
  END WHILE;
  <continue_statement>
  ```

  ```sqlexample
  REPEAT
    <statement>
  UNTIL <error_expression>;
  <continue_statement>
  ```

  ```sqlexample
  RETURN <error_expression>;
  <continue_statement>
  ```

  ```sqlexample
  DECLARE
    x int := 0;
    myproc PROCEDURE()
      RETURNS STRING
      AS BEGIN
        x := <error_expression>;
        <statement>
      END;
  BEGIN
    CALL myproc();
    <continue_statement>
    ...
  END;
  ```

## Examples

The following examples declare and raise an exceptions, and handle the exceptions
with exception handlers:

* Handle exceptions of more than one type
* Handle an exception and continue
* Handle exceptions in nested blocks
* Handle multiple exceptions in the same clause and unspecified exceptions
* Handle exceptions by using built-in variables

### Handle exceptions of more than one type

The following example shows an exception handler that is designed to handle more than one type of exception:

```sqlexample
DECLARE
  result VARCHAR;
  exception_1 EXCEPTION (-20001, 'I caught the expected exception.');
  exception_2 EXCEPTION (-20002, 'Not the expected exception!');
BEGIN
  result := 'If you see this, I did not catch any exception.';
  IF (TRUE) THEN
    RAISE exception_1;
  END IF;
  RETURN result;
EXCEPTION
  WHEN exception_2 THEN
    RETURN SQLERRM;
  WHEN exception_1 THEN
    RETURN SQLERRM;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  result VARCHAR;
  exception_1 EXCEPTION (-20001, 'I caught the expected exception.');
  exception_2 EXCEPTION (-20002, 'Not the expected exception!');
BEGIN
  result := 'If you see this, I did not catch any exception.';
  IF (TRUE) THEN
    RAISE exception_1;
  END IF;
  RETURN result;
EXCEPTION
  WHEN exception_2 THEN
    RETURN SQLERRM;
  WHEN exception_1 THEN
    RETURN SQLERRM;
END;
$$;
```

The output shows that the exception handler caught the exception:

```output
+----------------------------------+
| anonymous block                  |
|----------------------------------|
| I caught the expected exception. |
+----------------------------------+
```

### Handle an exception and continue

The following example shows an exception handler with a `WHEN` clause of the `CONTINUE` type:

```sqlexample
DECLARE
  exception_1 EXCEPTION (-20001, 'Catch and continue');
BEGIN
  LET counter := 0;
  IF (TRUE) THEN
    RAISE exception_1;
  END IF;
  counter := counter + 10;
  RETURN 'Counter value: ' || counter;
EXCEPTION
  WHEN exception_1 CONTINUE THEN
    counter := counter +1;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  exception_1 EXCEPTION (-20001, 'Catch and continue');
BEGIN
  LET counter := 0;
  IF (TRUE) THEN
    RAISE exception_1;
  END IF;
  counter := counter + 10;
  RETURN 'Counter value: ' || counter;
EXCEPTION
  WHEN exception_1 CONTINUE THEN
    counter := counter +1;
END;
$$;
```

The output shows that the exception handler caught the exception, executed a statement that added
`1` to the counter, and then executed the next statement after the exception was caught, which
added `10` to the counter:

```output
+-------------------+
| anonymous block   |
|-------------------|
| Counter value: 11 |
+-------------------+
```

The following example shows how an exception handler with a `WHEN` clause of the `CONTINUE` type works
when an error is raised in a loop. The example raises an error on the first iteration because it tries to
divide the value `10` by zero. The `CONTINUE` handler logs the error in the `error_log_table`, and the block
continues with the next iteration of the loop, which divides `10` by `1`. The loop continues to iterate until
`10` is divided by `5` and the loop ends. The output is `2`:

```sqlexample
CREATE TABLE error_log_table (handler_type VARCHAR, error_message VARCHAR);

DECLARE
  x INT := 0;
BEGIN
  FOR i IN 0 TO 5 DO
    x := 10/i;
  END FOR;
  RETURN x;
EXCEPTION
  WHEN EXPRESSION_ERROR CONTINUE THEN
    INSERT INTO error_log_table SELECT 'continue_type', :SQLERRM;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
CREATE TABLE error_log_table (handler_type VARCHAR, error_message VARCHAR);

EXECUTE IMMEDIATE $$
DECLARE
  x INT := 0;
BEGIN
  FOR i IN 0 TO 5 DO
    x := 10/i;
  END FOR;
  RETURN x;
EXCEPTION
  WHEN EXPRESSION_ERROR CONTINUE THEN
    INSERT INTO error_log_table SELECT 'continue_type', :SQLERRM;
END;
$$;
```

```output
+-----------------+
| anonymous block |
|-----------------|
|               2 |
+-----------------+
```

### Handle exceptions in nested blocks

This following example demonstrates nested blocks, and shows that an inner block
can raise an exception declared in either the inner block or in an outer block:

```sqlexample
DECLARE
  e1 EXCEPTION (-20001, 'Exception e1');
BEGIN
  -- Inner block.
  DECLARE
    e2 EXCEPTION (-20002, 'Exception e2');
    selector BOOLEAN DEFAULT TRUE;
  BEGIN
    IF (selector) THEN
      RAISE e1;
    ELSE
      RAISE e2;
    END IF;
  END;
EXCEPTION
  WHEN e1 THEN
    RETURN SQLERRM || ' caught in outer block.';
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  e1 EXCEPTION (-20001, 'Exception e1');
BEGIN
  -- Inner block.
  DECLARE
    e2 EXCEPTION (-20002, 'Exception e2');
    selector BOOLEAN DEFAULT TRUE;
  BEGIN
    IF (selector) THEN
      RAISE e1;
    ELSE
      RAISE e2;
    END IF;
  END;
EXCEPTION
  WHEN e1 THEN
    RETURN SQLERRM || ' caught in outer block.';
END;
$$;
```

The output shows that the exception handler caught the exception:

```output
+-------------------------------------+
| anonymous block                     |
|-------------------------------------|
| Exception e1 caught in outer block. |
+-------------------------------------+
```

This following example is similar to the previous example, but demonstrates nested blocks, each of which has its
own exception handler:

```sqlexample
DECLARE
  result VARCHAR;
  e1 EXCEPTION (-20001, 'Outer exception e1');
BEGIN
  result := 'No error so far (but there will be).';
  DECLARE
    e1 EXCEPTION (-20101, 'Inner exception e1');
  BEGIN
    RAISE e1;
  EXCEPTION
    WHEN e1 THEN
      result := 'Inner exception raised.';
      RETURN result;
  END;
  RETURN result;
EXCEPTION
  WHEN e1 THEN
    result := 'Outer exception raised.';
    RETURN result;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  result VARCHAR;
  e1 EXCEPTION (-20001, 'Outer exception e1');
BEGIN
  result := 'No error so far (but there will be).';
  DECLARE
    e1 EXCEPTION (-20101, 'Inner exception e1');
  BEGIN
    RAISE e1;
  EXCEPTION
    WHEN e1 THEN
      result := 'Inner exception raised.';
      RETURN result;
  END;
  RETURN result;
EXCEPTION
  WHEN e1 THEN
    result := 'Outer exception raised.';
    RETURN result;
END;
$$;
```

> **Note:**
>
> This example uses the same exception name (`e1`) in the outer and inner blocks, which isn’t recommended.
>
> The example does this to illustrate the [scope](../../developer-guide/snowflake-scripting/variables.md) of exception names. The two exceptions with the
> name `e1` are different exceptions.
>
> The `e1` handler in the outer block doesn’t handle the exception e1 that is declared and raised in the inner block.

The output shows that the inner exception handler ran:

```output
+-------------------------+
| anonymous block         |
|-------------------------|
| Inner exception raised. |
+-------------------------+
```

### Handle multiple exceptions in the same clause and unspecified exceptions

The following example fragment shows how to perform two tasks:

* Catch more than one exception in the same clause by using `OR`.
* Catch unspecified exceptions by using `WHEN OTHER THEN`.

```sqlexample
EXCEPTION
  WHEN MY_FIRST_EXCEPTION OR MY_SECOND_EXCEPTION OR MY_THIRD_EXCEPTION THEN
    RETURN 123;
  WHEN MY_FOURTH_EXCEPTION THEN
    RETURN 4;
  WHEN OTHER THEN
    RETURN 99;
```

### Handle exceptions by using built-in variables

The following example shows how to return SQLCODE, SQLERRM (SQL error message), and SQLSTATE
[built-in variable values](../../developer-guide/snowflake-scripting/exceptions.md) when catching an exception:

```sqlexample
DECLARE
  MY_EXCEPTION EXCEPTION (-20001, 'Sample message');
BEGIN
  RAISE MY_EXCEPTION;
EXCEPTION
  WHEN STATEMENT_ERROR THEN
    RETURN OBJECT_CONSTRUCT('Error type', 'STATEMENT_ERROR',
                            'SQLCODE', SQLCODE,
                            'SQLERRM', SQLERRM,
                            'SQLSTATE', SQLSTATE);
  WHEN EXPRESSION_ERROR THEN
    RETURN OBJECT_CONSTRUCT('Error type', 'EXPRESSION_ERROR',
                            'SQLCODE', SQLCODE,
                            'SQLERRM', SQLERRM,
                            'SQLSTATE', SQLSTATE);
  WHEN OTHER THEN
    RETURN OBJECT_CONSTRUCT('Error type', 'Other error',
                            'SQLCODE', SQLCODE,
                            'SQLERRM', SQLERRM,
                            'SQLSTATE', SQLSTATE);
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  MY_EXCEPTION EXCEPTION (-20001, 'Sample message');
BEGIN
  RAISE MY_EXCEPTION;
EXCEPTION
  WHEN STATEMENT_ERROR THEN
    RETURN OBJECT_CONSTRUCT('Error type', 'STATEMENT_ERROR',
                            'SQLCODE', SQLCODE,
                            'SQLERRM', SQLERRM,
                            'SQLSTATE', SQLSTATE);
  WHEN EXPRESSION_ERROR THEN
    RETURN OBJECT_CONSTRUCT('Error type', 'EXPRESSION_ERROR',
                            'SQLCODE', SQLCODE,
                            'SQLERRM', SQLERRM,
                            'SQLSTATE', SQLSTATE);
  WHEN OTHER THEN
    RETURN OBJECT_CONSTRUCT('Error type', 'Other error',
                            'SQLCODE', SQLCODE,
                            'SQLERRM', SQLERRM,
                            'SQLSTATE', SQLSTATE);
END;
$$;
```

Running this example produces the following output:

```output
+--------------------------------+
| anonymous block                |
|--------------------------------|
| {                              |
|   "Error type": "Other error", |
|   "SQLCODE": -20001,           |
|   "SQLERRM": "Sample message", |
|   "SQLSTATE": "P0001"          |
| }                              |
+--------------------------------+
```

---
title: Expansion operators
source: https://docs.snowflake.com/en/sql-reference/operators-expansion.md
section: SQL General Reference
---

# Expansion operators

Expansion operators expand a query expression that represents a list into the individual values in
the list. Currently, the spread operator (`**`) is the only expansion operator supported by Snowflake.

## Spread

The spread operator expands an [array](data-types-semistructured.md) into a list of individual values. This
operator is useful for the following use cases:

* Queries containing [IN clauses](functions/in.md).
* Calls to system-defined functions that take a list of values as arguments, such as
  [COALESCE](functions/coalesce.md), [GREATEST](functions/greatest.md), and
  [LEAST](functions/least.md).
* SQL user-defined [functions](../developer-guide/udf/sql/udf-sql-introduction.md) that use an argument
  to provide an array of values.
* Snowflake Scripting [stored procedures](../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md)
  that use a bind variable to provide an array of values. For more information about using bind variables
  in Snowflake Scripting, see [Using a variable in a SQL statement (binding)](../developer-guide/snowflake-scripting/variables.md) and
  [Using an argument in a SQL statement (binding)](../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

For more information about these use cases, see the
[Snowflake Introduces SQL Spread Operator (\*\*)](https://www.snowflake.com/en/engineering-blog/sql-spread-operator/)
blog post.

### Syntax

```sqlsyntax
** <array>
```

### Limitations

* The input must be an array of constant values, which can be an array of literal values or a bind variable that represents
  an array of literal values.
* Each value in a semi-structured array is of type [VARIANT](data-types-semistructured.md). A VARIANT value can
  contain a value of any other data type. The spread operator supports the following data types for the value
  stored in the VARIANT value:

  + [Numeric](data-types-numeric.md) (for example, INTEGER and NUMERIC)
  + [String & binary](data-types-text.md) (for example, VARCHAR and BINARY)
  + [Logical](data-types-logical.md) (for example, BOOLEAN)
  + [Date & time](data-types-datetime.md) (for example, DATE, TIME, and TIMESTAMP)
* User-defined functions and stored procedures written in languages other than SQL can’t use the
  spread operator.
* Expanding very large arrays with the spread operator might degrade performance.

### Examples

Some of the examples use the data the following table:

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE spread_demo (col1 INT, col2 VARCHAR);

INSERT INTO spread_demo VALUES
  (1, 'a'),
  (2, 'b'),
  (3, 'c'),
  (4, 'd'),
  (5, 'e');

SELECT * FROM spread_demo;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    1 | a    |
|    2 | b    |
|    3 | c    |
|    4 | d    |
|    5 | e    |
+------+------+
```

The following examples use the spread operator.

* Expand an array of literal values in an IN clause
* Expand an array of literal values in a system-defined function call
* Use the spread operator with a bind variable in a SQL user-defined function
* Use the spread operator with a bind variable in a Snowflake Scripting stored procedure

#### Expand an array of literal values in an IN clause

Expand an array of numbers using the spread operator in a query on the `spread_demo` table
created previously:

```sqlexample
SELECT * FROM spread_demo
  WHERE col1 IN (** [3, 4])
  ORDER BY col1;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    3 | c    |
|    4 | d    |
+------+------+
```

Expand an array of strings using the spread operator:

```sqlexample
SELECT * FROM spread_demo
  WHERE col2 IN (** ['b', 'd'])
  ORDER BY col1;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    2 | b    |
|    4 | d    |
+------+------+
```

Use an IN clause in a query with a mix of INTEGER values and expanded array values:

```sqlexample
SELECT * FROM spread_demo
  WHERE col1 IN (** [1, 2], 4, 5)
  ORDER BY col1;
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    1 | a    |
|    2 | b    |
|    4 | d    |
|    5 | e    |
+------+------+
```

#### Expand an array of literal values in a system-defined function call

Expand an array of strings in a call to the COALESCE function:

```sqlexample
SELECT COALESCE(** [NULL, NULL, 'my_string_1', 'my_string_2']) AS first_non_null;
```

```output
+----------------+
| FIRST_NON_NULL |
|----------------|
| my_string_1    |
+----------------+
```

Expand an array of numbers in a call to the GREATEST function:

```sqlexample
SELECT GREATEST(** [1, 2, 5, 4, 5]) AS greatest_value;
```

```output
+----------------+
| GREATEST_VALUE |
|----------------|
|              5 |
+----------------+
```

#### Use the spread operator with a bind variable in a SQL user-defined function

Create a SQL user-defined function that uses the spread operator. The function takes an array as
an argument and then expands the array values to query the `spread_demo` table
created previously:

```sqlexample
CREATE OR REPLACE FUNCTION spread_function_demo(col_1_values ARRAY)
  RETURNS TABLE(
    col1 INT,
    col2 VARCHAR)
AS
$$
   SELECT * FROM spread_demo
     WHERE col1 IN (** col_1_values)
     ORDER BY col1
$$;
```

Query the table using the function:

```sqlexample
SELECT * FROM TABLE(spread_function_demo([1, 3, 5]));
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    1 | a    |
|    3 | c    |
|    5 | e    |
+------+------+
```

#### Use the spread operator with a bind variable in a Snowflake Scripting stored procedure

Create a Snowflake Scripting stored procedure that uses the spread operator. The stored procedure takes
an array as an argument and then expands the array values in a bind variable to query the `spread_demo`
table created previously:

```sqlexample
CREATE OR REPLACE PROCEDURE spread_sp_demo(col_1_values ARRAY)
  RETURNS TABLE(
    col1 INT,
    col2 VARCHAR)
  LANGUAGE SQL
AS
$$
DECLARE
  res RESULTSET;
BEGIN
  res := (SELECT * FROM spread_demo
     WHERE col1 IN (** :col_1_values)
     ORDER BY col1);
  RETURN TABLE(res);
END;
$$;
```

Call the stored procedure:

```sqlexample
CALL spread_sp_demo([2, 4]);
```

```output
+------+------+
| COL1 | COL2 |
|------+------|
|    2 | b    |
|    4 | d    |
+------+------+
```

---
title: EXTENSIONS view
source: https://docs.snowflake.com/en/sql-reference/trust_center/extensions.md
section: SQL General Reference
---

Schema:
:   [TRUST_CENTER](../trust_center.md)

# EXTENSIONS view

The SNOWFLAKE.TRUST_CENTER.EXTENSIONS view displays a row for each extension that is registered with
the Trust Center.

For more information, see [Using Trust Center extensions](../../user-guide/trust-center/trust-center-extensions.md).

The view has the following columns:

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| NAME | VARCHAR | The name of the extension. |
| ID | NUMBER | The identifier of the extension. |
| SOURCE_TYPE | VARCHAR | The source type of the extension:   * `APPLICATION PACKAGE` * `LISTING` |
| SOURCE | VARCHAR | If the source type is `APPLICATION PACKAGE`, the name of the application package. If the source type is `LISTING`, the identifier of the listing. |
| VERSION | VARCHAR | The version of the Snowflake Native App extension that is registered with the Trust Center. |
| PATCH | NUMBER | The patch number of the Native App extension that is registered with the Trust Center. |
| PREVIOUS_VERSION | VARCHAR | The previous version of the Native App extension that is registered with the Trust Center. |
| PREVIOUS_PATCH | NUMBER | The previous patch number of the Native App extension registered with the Trust Center. |
| REGISTRATION_STATE | VARCHAR | The state of the registration of the extension with the Trust Center:   * `COMPLETED` * `IN_PROGRESS` * `FAILED` |
| REGISTRATION_TARGET_VERSION | VARCHAR | The version of the Native App extension that is in the process of being registered with the Trust Center. |
| REGISTRATION_TARGET_PATCH | NUMBER | The patch number of the Native App extension that is in the process of being registered with the Trust Center. |
| REGISTRATION_ATTEMPTED_ON | TIMESTAMP_LTZ | The timestamp of the last attempted registration. |
| REGISTRATION_FAILURE_REASON | VARCHAR | If the last attempted registration failed, the reason for the failure. Otherwise, NULL. |
| REGISTERED_TIMESTAMP | TIMESTAMP_LTZ | The timestamp for when the extension was registered with the Trust Center for the first time. |

---
title: External functions best practices
source: https://docs.snowflake.com/en/sql-reference/external-functions-best-practices.md
section: SQL General Reference
---

# External functions best practices

This topic documents best practices that will improve efficiency and prevent unexpected results that could occur
if a remote service is not designed to be compatible with Snowflake.

You can find additional best practices in the following documents:

* [Optimize the performance and reliability of Azure Functions](https://docs.microsoft.com/en-us/azure/azure-functions/functions-best-practices)

  (Although this is a Microsoft Azure document, much of the advice in it applies to remote services on
  any cloud platform.)

## Use a remote service’s batch API if available

Some remote services offer both batch mode and single-row mode. If the queries that use an external function are
expected to send multiple rows, then Snowflake recommends using the batch mode of the remote service to improve
performance.

This rule does not necessarily apply if:

* Each row is very large (e.g. hundreds of kilobytes or more).
* The remote service processes rows differently if they are received in batches than if they are received
  individually. (For details, see Process one row at a time.)

## Process one row at a time

To minimize networking overhead, Snowflake typically batches rows to send to remote services. The number of batches
and the size of each batch can vary.

In addition, the order of batches can vary, and the order of rows within a batch can vary. Even if the query contains
an ORDER BY clause, the ORDER BY is usually applied after the external function(s) have been called.

Because batch size and row order are not guaranteed, writing a function that returns a value for a row
that depends upon any other row in this batch or previous batches can produce non-deterministic results.

Snowflake strongly recommends that the remote service process each row independently.
The return value for each input row should depend on only that input row, not on other input rows. (Currently,
external functions do not support [window functions](functions-window.md), for example.)

Note also that because batch size is not guaranteed, counting batches is not meaningful.

See also Ensure your external function is stateless.

## Do not assume that the remote service is passed each row exactly once

If Snowflake calls a remote service, and the remote service receives the request and returns a result, but Snowflake
does not receive the result due to a temporary network problem, Snowflake might repeat the request. If Snowflake
retries, the remote service might see the same row twice (or more).

This can cause unexpected effects. For example, because the remote service might get called more than once for the
same value, a remote service that assigns unique IDs might have gaps in the sequence of those IDs. In some cases,
such effects can be reduced by tracking the batch ID in the `sf-external-function-query-batch-id` field of the
request header to determine whether a particular batch of rows has been processed previously. When Snowflake retries a
request for a specific batch, Snowflake uses the same batch ID as it used earlier for the same batch.

Snowflake retries when it receives the following errors:

* All transient network transport errors.
* All requests that fail with 429 status code.
* All requests that fail with 5XX status code.

Requests are retried until a total retry timeout is reached. The total retry timeout is not user-configurable.
Snowflake might adjust this limit in the future.

When the total retry timeout is reached without a successful retry, the query fails.

If your external function call times out when the remote service is working, and all the elements between Snowflake
and the remote service seem to be working, you can try a smaller batch size to see if that reduces the timeout errors.

To learn how to set the maximum batch size, see [CREATE EXTERNAL FUNCTION](sql/create-external-function.md).

## Ensure your external function is stateless

In general, an external function (including the remote service) should avoid storing state information, both:

* Internal state (state that the remote service stores internally).
* External state (state stored outside the remote service, for example state information sent to and/or read from
  another remote service that itself retains state).

If the remote service changes state information and then uses that information to affect future outputs, the function
might return different values than expected.

For example, consider a simple remote service that contains an internal counter and returns the number of rows
received since the remote service first started. If there is a temporary network problem, and Snowflake repeats
a request with the same data, the remote service will count the re-sent rows twice (or more).

For an example involving external state, see Avoid side-effects.

In the rare cases where a function is not stateless, the documentation for callers should say clearly that the
function is not stateless, and the function should be marked volatile.

If a remote service handles requests
[asynchronously](external-functions-implementation.md), then the remote service
author must write the remote service to store and manage some state temporarily. For example, the remote service must
store the HTTP POST request’s batch ID so that if an HTTP GET is received with the same batch ID, the remote service
can return HTTP code 202 when the specified batch is still being processed.

Note that a query can be aborted for various reasons, which means that there is no guarantee that a final GET will
arrive after the remote service has finished generating a result. Remote services that store state for asynchronous
requests should eventually time out and clean up that internal state. The optimal timeout might change in the future,
but currently Snowflake recommends preserving information about asynchronous requests for at least 10 minutes
and preferably 12 hours before deleting it.

## Avoid side-effects

An external function (including the remote service) should avoid side effects, such as changing external
state (information stored outside the remote service).

For example, if the remote service reports out-of-range values to a government agency, that is a side effect.

Side-effects can be useful, but the side-effects of calling an external function are not always predictable.
For example, suppose that you call a remote service that analyzes an anonymized health record and returns a diagnosis.
Suppose also that if the diagnosis is that the patient has a contagious disease, then the diagnosis is reported to an
agency that keeps count of the number of cases of that disease. This is a useful side effect. However, it is
vulnerable to problems such as:

* If an external function call is inside a transaction that is rolled back, the side effects are not rolled back.
* If the remote service is called more than once with the same row (e.g. due to temporary network failures and
  retries), the side-effect could occur more than once. For example, an infected patient might be counted twice
  in the statistics.

There are also situations in which rows could be undercounted rather than overcounted.

In the very rare cases where a function has side effects, the documentation for callers should say clearly what the
side effects are, and the function should be marked volatile.

## Categorize your function as volatile or immutable

Functions can be categorized as volatile or immutable. (The [CREATE EXTERNAL FUNCTION](sql/create-external-function.md)
statement allows the user to specify whether the function is volatile or immutable.)

For an external function to be considered immutable, it should meet the following criteria:

* If given the same input value, the function returns the same output value. (For example, the SQRT function returns
  the same output when given the same input, but the CURRENT_TIMESTAMP function does not necessarily return the same
  output when given the same input.)
* The function has no side effects. (For details, see Avoid side-effects.)

If a function meets these two criteria, then Snowflake can use certain types of optimizations to reduce the number
of rows or batches sent to the remote service. (These optimizations might evolve over time, and are not described in
detail here.)

Snowflake cannot detect or enforce immutability, or factors that affect immutability (for example,
side effects). The writer of a remote service should document whether the remote service meets the criteria
to be labeled immutable. If a remote service has side effects, then the external function that calls that remote
service should be marked volatile, even if the function call returns the same output value for the same input value.
If you are not certain that a remote service is immutable, then any external function that calls that remote service
should be labeled volatile.

## Account for timeout errors

An external function call involves Snowflake, a remote service, a proxy service, and potentially other elements
in the chain. None of these elements know how long a particular function call should take, so none know exactly
when to stop waiting and return a timeout error. Each step might have its own independent timeout. For more
information about timeouts and retries, see Account for timeout errors and retries.

## Minimize latency

To minimize latency and improve performance of external function calls, Snowflake recommends doing the following
when practical:

* Put the API Gateway in the same cloud platform and region as Snowflake instances that call it most frequently (or
  with the largest amount of data).
* If you wrote the remote service (rather than using an existing service), deploy that remote service in
  the same cloud platform and region as it is called from.
* Send as little data as possible. For example, if the remote service will examine inputs values and
  operate on only a subset of them, then it is usually more efficient to filter in SQL and send only the
  relevant rows to the remote service, rather than send all rows to the remote service and let it filter.

  As another example, if you are processing a column that contains large semi-structured
  data values, and the remote service will operate on only a small piece of each of those data values, it is
  usually more efficient to extract the relevant piece using Snowflake SQL and send only that piece, rather
  than send the entire column and have the remote service do the extraction of the small piece before processing.

## Develop and test external functions one step at a time

Snowflake recommends that you test without Snowflake before testing with Snowflake.

During the early stages of developing an external function, use the cloud platform proxy service console (e.g. the
Amazon API Gateway console) and remote service development console (e.g. the AWS Lambda console) to help develop
and test the proxy service and remote service.

For example, if you have developed a Lambda function, you might want to test it extensively through the Lambda console
before testing it by calling it from Snowflake.

Testing through the proxy service console and remote service console usually has the following advantages:

* It can make diagnosing the problem easier because there are fewer places to look for the cause of the problem.
* Viewing the data payload might provide useful debugging information. Snowflake does not show any portion of
  the data payload in error messages; although this enhances security, it can slow debugging.
* Snowflake auto-retries HTTP 5xx errors, which can make debugging slower or more difficult in some situations.
* Testing through Snowflake consumes Snowflake credits in addition to cloud platform credits.

Of course, after you’ve tested the remote service and the proxy service as much as you can without Snowflake, you
should test them with Snowflake. The advantages of testing with Snowflake include:

* You’re testing all the steps involved in the external function.
* Using a Snowflake table as the data source makes it easy to test with large volumes of data to get a realistic
  estimate of the performance of the external function.

Consider the following test cases:

* NULL values.
* “Empty” values (for example, empty strings, empty semi-structured data types).
* Very long VARCHAR and BINARY values, if appropriate.

## Make your remote service asynchronous

If you are writing a remote service, and if your remote service might not return results within the expected timeout,
then consider making your remote service
[asynchronous](external-functions-implementation.md).
For details, see [Asynchronous vs. Synchronous remote services](external-functions-implementation.md).

## Ensure that arguments to the external function correspond to arguments parsed by the remote service

When passing arguments to or from an external function, ensure that the data types are appropriate. If the value
sent can’t fit into the data type being received, the value might be truncated or corrupted, or the remote service
call might fail.

For example, because some Snowflake SQL numeric data types can store larger values than commonly-used JavaScript
data types, de-serializing large numbers from JSON is particularly sensitive in JavaScript.

If you change the number, data types, or order of the arguments to the remote
service, remember to make the corresponding changes to the external function. Currently, the ALTER FUNCTION
command does not have an option to change parameters, so you must drop and re-create the external function to change
the arguments.

---
title: FETCH (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/fetch.md
section: SQL General Reference
---

# FETCH (Snowflake Scripting)

Uses the specified cursor to fetch one or more rows.

For more information on cursors, see [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [DECLARE](declare.md), [OPEN](open.md), [CLOSE](close.md)

## Syntax

```sqlsyntax
FETCH <cursor_name> INTO <variable> [, <variable> ... ] ;
```

Where:

> `cursor_name`
> :   The name of the cursor.
>
> `variable`
> :   The name of the variable into which to retrieve the value of one column of the current row.
>
>     You should have one variable for each column defined in the cursor declaration.
>
>     The variable must already have been [declared](../../developer-guide/snowflake-scripting/variables.md).
>
>     The variable’s data type must be compatible with the value to be fetched.

## Usage notes

* The number of `variable`s should match the number of expressions selected in the `SELECT` clause of
  the cursor declaration.
* If you try to `FETCH` a row after the last row, you get NULL values.
* A RESULTSET or CURSOR does not necessarily cache all the rows of the result set at the time that the query is executed.
  FETCH operations can experience latency.

## Examples

```sqlsyntax
FETCH my_cursor_name INTO my_variable_name ;
```

For a more complete example of using a cursor, see
[the introductory cursor example](../../developer-guide/snowflake-scripting/cursors.md).

An example using a loop is included in the documentation for [FOR loops](../../developer-guide/snowflake-scripting/loops.md).

---
title: File functions
source: https://docs.snowflake.com/en/sql-reference/functions-file.md
section: SQL General Reference
---

# File functions

File functions enable you to access files staged in cloud storage.

## List of functions

| Function Name | Notes |
| --- | --- |
| **Stages** |  |
| [GET_STAGE_LOCATION](functions/get_stage_location.md) | Returns the URL for an external or internal named stage using the stage name as the input. |
| [GET_RELATIVE_PATH](functions/get_relative_path.md) | Extracts the path of a staged file relative to its location in the stage using the stage name and absolute file path in cloud storage as inputs. |
| [GET_ABSOLUTE_PATH](functions/get_absolute_path.md) | Returns the absolute path of a staged file using the stage name and path of the file relative to its location in the stage as inputs. |
| [GET_PRESIGNED_URL](functions/get_presigned_url.md) | Generates the pre-signed URL to a staged file using the stage name and relative file path as inputs. Access files in an external stage using the function. |
| [BUILD_SCOPED_FILE_URL](functions/build_scoped_file_url.md) | Generates a scoped Snowflake file URL to a staged file using the stage name and relative file path as inputs. |
| [BUILD_STAGE_FILE_URL](functions/build_stage_file_url.md) | Generates a Snowflake file URL to a staged file using the stage name and relative file path as inputs. |
| **AI Functions** |  |
| [AI_COMPLETE](functions/ai_complete.md) | Generates a response (completion) from text or an image using a supported language model. |
| [AI_PARSE_DOCUMENT](functions/ai_parse_document.md) | Returns the extracted content from a document on a Snowflake stage as a JSON-formatted string. |
| [AI_TRANSCRIBE](functions/ai_transcribe.md) | Transcribes text from an audio file with optional timestamps and speaker labels. |

The following functions are for use with the FILE data type. For more information, see [Unstructured data types](data-types-unstructured.md).

| Sub-category | Function |
| --- | --- |
| Constructor | [TO_FILE](functions/to_file.md) |
|  | [TRY_TO_FILE](functions/try_to_file.md) |
| Accessors | [FL_GET_CONTENT_TYPE](functions/fl_get_content_type.md) |
|  | [FL_GET_ETAG](functions/fl_get_etag.md) |
|  | [FL_GET_FILE_TYPE](functions/fl_get_file_type.md) |
|  | [FL_GET_LAST_MODIFIED](functions/fl_get_last_modified.md) |
|  | [FL_GET_RELATIVE_PATH](functions/fl_get_relative_path.md) |
|  | [FL_GET_SCOPED_FILE_URL](functions/fl_get_scoped_file_url.md) |
|  | [FL_GET_SIZE](functions/fl_get_size.md) |
|  | [FL_GET_STAGE](functions/fl_get_stage.md) |
|  | [FL_GET_STAGE_FILE_URL](functions/fl_get_stage_file_url.md) |
| Utility Functions | [FL_IS_AUDIO](functions/fl_is_audio.md) |
|  | [FL_IS_COMPRESSED](functions/fl_is_compressed.md) |
|  | [FL_IS_DOCUMENT](functions/fl_is_document.md) |
|  | [FL_IS_IMAGE](functions/fl_is_image.md) |
|  | [FL_IS_VIDEO](functions/fl_is_video.md) |

## Usage notes

* GET_PRESIGNED_URL and BUILD_SCOPED_FILE_URL are non-deterministic functions; the others are deterministic.

---
title: Flow operators
source: https://docs.snowflake.com/en/sql-reference/operators-flow.md
section: SQL General Reference
---

# Flow operators

Flow operators chain SQL statements together, where the results of one statement serve as the input to another statement.
Currently, the pipe operator (`->>`) is the only flow operator supported by Snowflake.

## Pipe

Pipe operators are similar to Unix pipes (`|`) on the command line, but for SQL statements instead of Unix
commands. To use the pipe operator, specify a series of SQL statements separated by the operator. You can specify any
valid SQL statement, such as SHOW, SELECT, CREATE, INSERT, and so on. After the first SQL statement, each
subsequent statement can take the results of any previous statement as input. In the FROM clause, a previous SQL
statement is referenced by a parameter with the dollar sign (`$`) and the pipe number, which is the relative
position of the statement in the chain counting back from the current statement.

The pipe operator chains the following series of SQL statements together, and the comments show the relative
reference numbers for each statement:

```sqlexample
first_st -- Referenced as $4 in last_st, $3 in fourth_st, $2 in third_st, and $1 in second_st
  ->> second_st -- Referenced as $3 in last_st, $2 in fourth_st, and $1 in third_st
  ->> third_st  -- Referenced as $2 in last_st and $1 in fourth_st
  ->> fourth_st -- Referenced as $1 in last_st
  ->> last_st;
```

For example, this series of SQL statements has a pipe number reference in three SELECT statements, and each one takes
the results of the first SELECT statement as input:

```sqlexample
SELECT ...
  ->> SELECT ... FROM $1
  ->> SELECT ... FROM $2
  ->> SELECT ... FROM $3;
```

As shown, you end the chain of SQL statements by placing a semicolon after the last statement. Don’t place a semicolon
after the previous statements in the chain. The output of the entire chain is the final result of the last SQL statement.
Client tools, such as SnowSQL, treat the chain of statements as a single statement.

The pipe operator provides the following benefits:

* Simplifies the execution of dependent SQL statements.
* Improves the readability and flexibility of complex SQL operations.

### Syntax

```sqlsyntax
<sql_statement_1> ->> <sql_statement_2> [ ->> <sql_statement_n> ... ]
```

### Usage notes

* Each statement produces a result that can only be consumed by a subsequent statement in the chain.
* Statements are executed in their specified order. Unlike `RESULT_SCAN(LAST_QUERY_ID())`, the pipe number
  resolves to the correct result set in the chain, whether other queries were run concurrently
  outside of the chain or not.
* When a statement consumes the results of a previous statement, the result set consumed is equivalent to the
  result set returned by the [RESULT_SCAN](functions/result_scan.md) function that was passed the query
  ID of the previous statement.

  For example, these statements limit the output of the SHOW WAREHOUSES command to specific columns:

  ```sqlexample
  SHOW WAREHOUSES;

  SELECT "name", "state", "type", "size"
    FROM TABLE(RESULT_SCAN(LAST_QUERY_ID(-1)));
  ```

  This statement uses the pipe operator to produce the same results:

  ```sqlexample
  SHOW WAREHOUSES
    ->> SELECT "name", "state", "type", "size" FROM $1;
  ```

  The output column names for SHOW and DESCRIBE commands are generated in lowercase. If you consume a
  result set from a SHOW or DESCRIBE command with the pipe operator or the RESULT_SCAN function,
  use [double-quoted identifiers](identifiers-syntax.md) for the column names in the query to
  ensure that they match the column names in the output that was scanned. For example, if the name of an
  output column is `type`, then specify `"type"` for the identifier.
* A query that uses the pipe operator isn’t guaranteed to return rows in the same order as the input result set of
  a previous query in the chain. You can include an ORDER BY clause with the query to specify the order.
* An error raised by any SQL statement stops the execution of the chain, and that error is returned to the client.
* The last statement result is returned to the client.
* The statements are executed as a [Snowflake Scripting](../developer-guide/snowflake-scripting/index.md)
  anonymous block.

### Limitations

* The `$n` parameter is only valid in the FROM clause of a SQL statement.
* Each SQL statement produces a result that can only be consumed by a subsequent statement in the pipe
  chain. The results can’t be consumed outside of the pipe chain, except for the results of the last
  statement.
* Bind variables aren’t supported.
* Using the pipe operator in a multi-statement execution (that is, submitting multiple statements
  separated by `;` rather than `->>` in a single call) from Snowflake client drivers isn’t supported.
* When you use the pipe operator with [Snowflake Scripting](../developer-guide/snowflake-scripting/index.md),
  you can’t combine declaration and assignment of a RESULTSET if you use the pipe operator in the SQL statement.

  For example, the following code returns an error:

  ```sqlexample
  LET res RESULTSET := (SELECT 'myvalue' ->> SELECT $1 FROM $1);
  RETURN TABLE(res);
  ```

  The following example succeeds because it separates the declaration and assignment of a RESULTSET:

  ```sqlexample
  LET res RESULTSET;
  res := (SELECT 'myvalue' ->> SELECT $1 FROM $1);
  RETURN TABLE(res);
  ```

### Examples

The following examples use the pipe operator:

* Select a list of columns for the output of a SHOW command
* Execute queries that take input from queries on multiple tables
* Return the row counts for DML operations in a transaction
* Return the results for inserts into a table that is later dropped

#### Select a list of columns for the output of a SHOW command

Run a SHOW TABLES command, and use the pipe operator to limit the output to the `created_on`, `name`, and
`owner` columns for tables created after April 15, 2025.

```sqlexample
SHOW TABLES
  ->> SELECT "created_on" AS creation_date,
             "name" AS table_name,
             "owner" AS table_owner
        FROM $1
        WHERE creation_date > '2025-04-15'::DATE;
```

```output
+-------------------------------+-------------+--------------+
| CREATION_DATE                 | TABLE_NAME  | TABLE_OWNER  |
|-------------------------------+-------------+--------------|
| 2025-04-16 08:46:16.130 -0700 | TEST_TABLE1 | ACCOUNTADMIN |
| 2025-04-16 09:44:13.701 -0700 | MYTABLE1    | USER_ROLE    |
| 2025-04-16 08:46:32.092 -0700 | MYTABLE2    | USER_ROLE    |
+-------------------------------+-------------+--------------+
```

#### Execute queries that take input from queries on multiple tables

First, create a `dept_pipe_demo` table and an `emp_pipe_demo` table, and insert data into each one:

```sqlexample
CREATE OR REPLACE TABLE dept_pipe_demo (
  deptno NUMBER(2),
  dname VARCHAR(14),
  loc VARCHAR(13)
  ) AS SELECT * FROM VALUES
     (10, 'ACCOUNTING', 'NEW YORK'),
     (20, 'RESEARCH', 'DALLAS'),
     (30, 'SALES', 'CHICAGO'),
     (40, 'OPERATIONS', 'BOSTON');

CREATE OR REPLACE TABLE emp_pipe_demo (
  empno NUMBER(4),
  ename VARCHAR(10),
  sal NUMBER(7,2),
  deptno NUMBER(2)
  ) AS SELECT * FROM VALUES
    (7369, 'SMITH', 800, 20),
    (7499, 'ALLEN', 1600, 30),
    (7521, 'WARD', 1250, 30),
    (7698, 'BLAKE', 2850, 30),
    (7782, 'CLARK', 2450, 10);
```

The following example uses the pipe operator for a chain of SQL statements that perform the following operations:

1. Query the `dept_pipe_demo` table to return rows where `dname` equals `SALES`.
2. Query the `emp_pipe_demo` table for employees with a salary greater than `1500` in the `SALES` department,
   using the results of the previous query as input by specifying `$1` in the WHERE condition of a FROM clause.
3. Run a query that returns the `ename` and `sal` values using the results of the previous query as input
   by specifying `$1` in the FROM clause.

```sqlexample
SELECT * FROM dept_pipe_demo WHERE dname = 'SALES'
  ->> SELECT * FROM emp_pipe_demo WHERE sal > 1500 AND deptno IN (SELECT deptno FROM $1)
  ->> SELECT ename, sal FROM $1 ORDER BY 2 DESC;
```

```output
+-------+---------+
| ENAME |     SAL |
|-------+---------|
| BLAKE | 2850.00 |
| ALLEN | 1600.00 |
+-------+---------+
```

> **Note:**
>
> The purpose of this example is to show how to combine a series of queries with the pipe operator. However, the
> same output can be achieved with a join query, and join queries typically perform better than queries combined
> with the pipe operator.

#### Return the row counts for DML operations in a transaction

Create a table and insert rows one by one. Chaining all the statements lets you use the
pipe operator to examine the result of each INSERT statement, which represents the total number of
rows inserted.

In each of the SELECT statements in the example, the `$1` in the SELECT list is a shorthand reference for
the first column, not a previous result in the pipe. The `$n` parameter for a pipe number is only
valid in the FROM clause.

```sqlexample
CREATE OR REPLACE TABLE test_sql_pipe_dml (a INT, b INT)
  ->> INSERT INTO test_sql_pipe_dml VALUES (1, 2)
  ->> INSERT INTO test_sql_pipe_dml VALUES (3, 4)
  ->> INSERT INTO test_sql_pipe_dml VALUES (5, 6)
  ->> INSERT INTO test_sql_pipe_dml VALUES (7, 8)
  ->> SELECT (SELECT $1 FROM $4) +
             (SELECT $1 FROM $3) +
             (SELECT $1 FROM $2) +
             (SELECT $1 FROM $1)
        AS "Number of rows";
```

```output
+----------------+
| Number of rows |
|----------------|
|              4 |
+----------------+
```

The following example uses the pipe operator for a chain of SQL statements that perform the following operations:

1. Begin a transaction.
2. Insert a row into the previously created table.
3. Delete rows from the table.
4. Update rows in the table.
5. Commit the transaction.
6. Query the number of rows that were affected by each DML operation.

```sqlexample
BEGIN TRANSACTION
  ->> INSERT INTO test_sql_pipe_dml VALUES (1, 2)
  ->> DELETE FROM test_sql_pipe_dml WHERE a = 1
  ->> UPDATE test_sql_pipe_dml SET b = 2
  ->> COMMIT
  ->> SELECT
        (SELECT $1 FROM $4) AS "Inserted rows",
        (SELECT $1 FROM $3) AS "Deleted rows",
        (SELECT $1 FROM $2) AS "Updated rows";
```

```output
+---------------+--------------+--------------+
| Inserted rows | Deleted rows | Updated rows |
|---------------+--------------+--------------|
|             1 |            2 |            3 |
+---------------+--------------+--------------+
```

#### Return the results for inserts into a table that is later dropped

This example uses the pipe operator for a chain of SQL statements that performs the following operations:

1. Create a table with an IDENTITY column.
2. Insert rows into the table.
3. Query the table.
4. Drop the table.
5. Query the results of pipe number `$2` (the SELECT statement).

The result set consumed in the last SELECT statement is equivalent to the result set returned by the
[RESULT_SCAN](functions/result_scan.md) function for the query ID of the previous SELECT statement.

```sqlexample
CREATE OR REPLACE TABLE test_sql_pipe_drop (
    id INT IDENTITY START 10 INCREMENT 1,
    data VARCHAR)
  ->> INSERT INTO test_sql_pipe_drop (data) VALUES ('row1'), ('row2'), ('row3')
  ->> SELECT * FROM test_sql_pipe_drop
  ->> DROP TABLE test_sql_pipe_drop
  ->> SELECT COUNT(*) "Number of rows", MAX(id) AS "Last ID" FROM $2;
```

```output
+----------------+---------+
| Number of rows | Last ID |
|----------------+---------|
|              3 |      12 |
+----------------+---------+
```

---
title: FOR (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/for.md
section: SQL General Reference
---

# FOR (Snowflake Scripting)

A `FOR` loop repeats a sequence of steps a specific number of times. The number of times might be specified by the
user, or might be specified by the number of rows in a [cursor](../../developer-guide/snowflake-scripting/cursors.md). The syntax
of these two types of `FOR` loops is slightly different.

For more information on loops, see [Working with loops](../../developer-guide/snowflake-scripting/loops.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [BREAK](break.md), [CONTINUE](continue.md)

## Syntax

To loop over all rows in a [cursor](../../developer-guide/snowflake-scripting/cursors.md), use:

> ```sqlsyntax
> FOR <row_variable> IN <cursor_name> DO
>     <statement>;
>     [ <statement>; ... ]
> END FOR [ <label> ] ;
> ```

To loop a specified number of times, use:

> ```sqlsyntax
> FOR <counter_variable> IN [ REVERSE ] <start> TO <end> { DO | LOOP }
>     <statement>;
>     [ <statement>; ... ]
> END { FOR | LOOP } [ <label> ] ;
> ```

Where:

> `row_variable`
> :   Specify a variable name that follows the rules for [Object identifiers](../identifiers.md).
>
>     Do not add a declaration for this variable in the DECLARE or BEGIN … END sections.
>     The name should not already be defined in the scope of the local block.
>
>     The name is valid inside the `FOR` loop, but not outside the `FOR` loop.
>
>     The `row_variable` holds one row from the cursor. Fields within that row are accessed using dot notation. For example:
>
>     > `my_row_variable.my_column_name`
>
>     A more complete example is included in the examples below.
>
> `counter_variable`
> :   Specify a variable name that follows the rules for [Object identifiers](../identifiers.md).
>
>     The name of the `counter_variable` is valid only inside the `FOR` loop.
>     If a variable with the same name is declared outside the loop, the outer variable and the loop variable are separate. Inside the
>     loop, references to that name are resolved to the loop variable.
>
>     The code inside the `FOR` loop is allowed to read the value of the counter variable, but should not change it. For
>     example, do not increment the counter variable manually to change the step size.
>
> `start`
> :   This is the initial value of `counter_variable`.
>
>     The starting value should be an INTEGER or an expression that evaluates to an INTEGER.
>
> `end`
> :   This is the final value of `counter_variable`, after the `counter_variable` has been incremented as you loop.
>
>     The ending value should be an INTEGER or an expression that evaluates to an INTEGER.
>
>     The `end` value should be greater than or equal to the `start` value. If `end` is less than
>     `start`, the loop executes 0 times (even if the `REVERSE` keyword is used).
>
> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).
>
> `cursor_name`
> :   The name of the cursor to iterate through.
>
> `label`
> :   An optional label. Such a label can be a jump target for a [BREAK (Snowflake Scripting)](break.md) or
>     [CONTINUE (Snowflake Scripting)](continue.md) statement. A label must follow the naming rules for
>     [Object identifiers](../identifiers.md).

## Usage notes

* The loop iterates up to and including the `end` point.

  For example, `FOR i IN 1 TO 10` loops 10 times, and during the final iteration the value of `i` is 10.

  If you use the `REVERSE` keyword, then the loop iterates backwards down to and including the `start` value.
* A loop can contain multiple statements. You can use, but are not required to use, a [BEGIN … END (Snowflake Scripting)](begin.md)
  [block](../../developer-guide/snowflake-scripting/blocks.md) to contain those statements.
* The optional keyword `REVERSE` causes Snowflake to start with the `end` value and decrement down to the `start` value.
* Although you can change the value of the `counter_variable` inside the loop, Snowflake recommends that you avoid doing this.
  Changing the value makes the code more difficult to understand.
* If you use the keyword `DO`, then use `END FOR` at the end of the `FOR` loop. If you use the keyword `LOOP`, then use
  `END LOOP` at the end of the `FOR` loop.

## Examples

The following sections contain examples of different kinds of FOR loops:

> **Note:**
>
> For more examples, see [FOR loop](../../developer-guide/snowflake-scripting/loops.md).

### Cursor-based FOR loops

This example shows how to use a [cursor](../../developer-guide/snowflake-scripting/cursors.md) to sum the values in the `price`
column of all the rows returned by a query. This stored procedure behaves somewhat like an aggregate function.

> ```sqlexample
> CREATE or replace TABLE invoices (price NUMBER(12, 2));
> INSERT INTO invoices (price) VALUES
>     (11.11),
>     (22.22);
> ```
>
> ```sqlexample
> CREATE OR REPLACE PROCEDURE for_loop_over_cursor()
> RETURNS FLOAT
> LANGUAGE SQL
> AS
> $$
> DECLARE
>     total_price FLOAT;
>     c1 CURSOR FOR SELECT price FROM invoices;
> BEGIN
>     total_price := 0.0;
>     OPEN c1;
>     FOR rec IN c1 DO
>         total_price := total_price + rec.price;
>     END FOR;
>     CLOSE c1;
>     RETURN total_price;
> END;
> $$
> ;
> ```
>
> Here is the output of the stored procedure:
>
> ```sqlexample
> CALL for_loop_over_cursor();
> +----------------------+
> | FOR_LOOP_OVER_CURSOR |
> |----------------------|
> |                33.33 |
> +----------------------+
> ```

### Counter-based FOR loops

This example shows how to use a `FOR` loop to iterate a specified number of times:

> ```sqlexample
> CREATE PROCEDURE simple_for(iteration_limit INTEGER)
> RETURNS INTEGER
> LANGUAGE SQL
> AS
> $$
>     DECLARE
>         counter INTEGER DEFAULT 0;
>     BEGIN
>         FOR i IN 1 TO iteration_limit DO
>             counter := counter + 1;
>         END FOR;
>         RETURN counter;
>     END;
> $$;
> ```
>
> Here is the output of the stored procedure:
>
> ```sqlexample
> CALL simple_for(3);
> +------------+
> | SIMPLE_FOR |
> |------------|
> |          3 |
> +------------+
> ```

The following example shows how to use the `REVERSE` keyword to count backwards.

> ```sqlexample
> CREATE PROCEDURE reverse_loop(iteration_limit INTEGER)
> RETURNS VARCHAR
> LANGUAGE SQL
> AS
> $$
>     DECLARE
>         values_of_i VARCHAR DEFAULT '';
>     BEGIN
>         FOR i IN REVERSE 1 TO iteration_limit DO
>             values_of_i := values_of_i || ' ' || i::varchar;
>         END FOR;
>         RETURN values_of_i;
>     END;
> $$;
> ```
>
> Here is the output of the stored procedure:
>
> ```sqlexample
> CALL reverse_loop(3);
> +--------------+
> | REVERSE_LOOP |
> |--------------|
> |  3 2 1       |
> +--------------+
> ```

The following example shows the behavior when the loop counter variable has the same name (`i`) as a variable that was already
declared. Within the `FOR` loop, references to `i` resolve to the loop counter variable (not to the variable declared outside of
the loop).

> ```sqlexample
> CREATE PROCEDURE p(iteration_limit INTEGER)
> RETURNS VARCHAR
> LANGUAGE SQL
> AS
> $$
>     DECLARE
>         counter INTEGER DEFAULT 0;
>         i INTEGER DEFAULT -999;
>         return_value VARCHAR DEFAULT '';
>     BEGIN
>         FOR i IN 1 TO iteration_limit DO
>             counter := counter + 1;
>         END FOR;
>         return_value := 'counter: ' || counter::varchar || '\n';
>         return_value := return_value || 'i: ' || i::VARCHAR;
>         RETURN return_value;
>     END;
> $$;
> ```
>
> Here is the output of the stored procedure:
>
> ```sqlexample
> CALL p(3);
> +------------+
> | P          |
> |------------|
> | counter: 3 |
> | i: -999    |
> +------------+
> ```

---
title: FOR UPDATE
source: https://docs.snowflake.com/en/sql-reference/constructs/for-update.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# FOR UPDATE

Locks the rows that the query selects until the transaction that contains the query commits or
aborts.

This clause is supported for use with hybrid tables only, and is useful for transactional
workloads in which multiple transactions attempt to access the same rows at the same time.
Rows are locked for update in the sense that other transactions cannot write data to these
rows until the transaction doing the locking has been fully committed or rolled back.
However, other transactions can read the locked rows, and other rows in the same table can be
read, updated, or deleted.

```sqlsyntax
SELECT ...
  FROM ...
  [ ... ]
  FOR UPDATE [ NOWAIT | WAIT <wait_time> ]
```

## Parameters

`NOWAIT`
:   Returns an error if the transaction cannot lock the selected rows immediately.
    NOWAIT is the default.

`WAIT wait_time`
:   Specifies the maximum time (in seconds) that the query waits to acquire row-level locks. If
    the wait time expires, the query returns an error.

## Restrictions

The FOR UPDATE clause:

* Cannot be used with [AUTOCOMMIT transactions](../parameters.md).
* Must be the last clause in the [SELECT statement](../constructs.md).
* Cannot be used in a [CTAS statement](../sql/create-table.md).
* Cannot be used inside [subqueries](../../user-guide/querying-subqueries.md).
* Cannot select from [multiple tables (joins)](join.md) or
  [set operations](../operators-query.md).
* Cannot be used when the query contains:

  + [DISTINCT](../../user-guide/querying-distinct-counts.md)
  + [Aggregation functions](../functions-aggregation.md)
  + [GROUP BY](group-by.md)
  + [HAVING](having.md)
  + [Sequences](../../user-guide/querying-sequences.md)

## Usage notes

Because hybrid tables support the READ COMMITTED isolation level, FOR UPDATE clauses do not
guarantee read stability. For example, assume that a table `T` with a single column named `ID`
contains two rows with values `5` and `10`.

1. The following query is run in transaction `T1`:

   ```sqlexample
   SELECT * FROM T WHERE ID < 20 FOR UPDATE;
   ```

   The query returns the values `5` and `10` and locks those two rows.
2. Another transaction, `T2`, runs the following DELETE operation:

   ```sqlexample
   DELETE FROM T WHERE ID = 5;
   ```

   Transaction `T2` has to wait until `T1` completes (that is, until it commits or rolls back).
3. However, a third transaction, `T3`, can complete the following INSERT operation:

   ```sqlexample
   INSERT INTO T VALUES 12;
   ```
4. A subsequent query in `T1` now returns three values (rows), not two: `5`, `10`, and `12`:

   ```sqlexample
   SELECT * FROM T WHERE ID < 20;
   ```

## Examples

Open a new transaction, select all of the rows from a hybrid table (`ht`), and lock those
rows until the transaction commits. Update some selected rows and run another query before
committing the transaction.

```sqlexample
BEGIN;
...
SELECT * FROM ht ORDER BY c1 FOR UPDATE;
...
UPDATE ht set c1 = c1 + 10 WHERE c1 = 0;
...
SELECT ... ;
...
COMMIT;
```

Apply a maximum wait time of 60 seconds for row locking:

```sqlexample
BEGIN;
...
SELECT * FROM ht FOR UPDATE WAIT 60;
...
COMMIT;
```

---
title: FROM
source: https://docs.snowflake.com/en/sql-reference/constructs/from.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# FROM

Specifies the tables, views, or table functions to use in a [SELECT](../sql/select.md) statement.

See also:
:   [AT | BEFORE](at-before.md) , [CHANGES](changes.md) , [CONNECT BY](connect-by.md) , [JOIN](join.md) , [ASOF JOIN](asof-join.md), [MATCH_RECOGNIZE](match_recognize.md), [PIVOT](pivot.md) ,
    [SAMPLE / TABLESAMPLE](sample.md) , [SEMANTIC_VIEW](semantic_view.md), [UNPIVOT](unpivot.md),
    [Working with joins](../../user-guide/querying-joins.md), [Analyzing time-series data](../../user-guide/querying-time-series-data.md)

## Syntax

```sqlsyntax
SELECT ...
FROM objectReference [ JOIN objectReference [ ... ] ]
[ ... ]
```

Where:

> ```sqlsyntax
> objectReference ::=
>    {
>       [<namespace>.]<object_name>
>            [ AT | BEFORE ( <object_state> ) ]
>            [ CHANGES ( <change_tracking_type> ) ]
>            [ MATCH_RECOGNIZE ]
>            [ PIVOT | UNPIVOT ]
>            [ [ AS ] <alias_name> ]
>            [ SAMPLE ]
>      | <table_function>
>            [ PIVOT | UNPIVOT ]
>            [ [ AS ] <alias_name> ]
>            [ SAMPLE ]
>      | ( VALUES (...) )
>            [ SAMPLE ]
>      | [ LATERAL ] ( <subquery> )
>            [ [ AS ] <alias_name> ]
>      | @[<namespace>.]<stage_name>[/<path>]
>            [ ( FILE_FORMAT => <format_name>, PATTERN => '<regex_pattern>' ) ]
>            [ [ AS ] <alias_name> ]
>      | DIRECTORY( @<stage_name> )
>      | SEMANTIC_VIEW( ... )
>      | ERROR_TABLE( <base_table_name> )
>      | DYNAMIC_TABLE_REFRESH_BOUNDARY( <object_name> )
>            [ AT | BEFORE ( <object_state> ) ]
>            [ CHANGES ( <change_tracking_type> ) ]
>            [ [ AS ] <alias_name> ]
>    }
> ```

## Parameters

`JOIN`
:   Subclause that specifies to perform a join between two or more tables (or views or table functions).
    The join can be an inner join, outer join, or other type of join.
    The join can use the keyword JOIN or an alternative supported join syntax.
    For more details about joins, see [JOIN](join.md) and
    [ASOF JOIN](asof-join.md).

`[ AS ] alias_name`
:   Specifies a name given to the object reference it is attached to. Can be used with any of the other subclauses in the FROM clause.

    Alias names must follow the rules for [Object identifiers](../identifiers.md).

`VALUES`
:   The `VALUES` clause can specify literal values or expressions to be used in the `FROM` clause.
    This clause can contain table and column aliases (not shown in the diagram above).
    For more details about the VALUES clause, see [VALUES](values.md).

### Object or table function clause

`[namespace.]object_name`
:   Specifies the name of the object (table or view) being queried.

    The object name can be qualified using `namespace` (in the form of `db_name.schema_name.object_name` or `schema_name.object_name`). A namespace is not required if
    the context can be derived from the current database and schema for the session.

    When specifying a table/view name to query, you can also specify the following optional subclause:

    > `{ AT | BEFORE } ( object_state )`
    > :   Optional subclause that specifies the time-based or event-based historical state of the table or view for Time Travel. For more details, see [AT | BEFORE](at-before.md).
    >
    > `MATCH_RECOGNIZE`
    > :   Optional subclause for finding sequences of rows that match a pattern. For more details, see [MATCH_RECOGNIZE](match_recognize.md).

`table_function`
:   Specifies a system-defined table function, a UDF table function, or a class method to call within the FROM clause. For details,
    see the following topics:

    * [Using a table function in the FROM clause](../functions-table.md)
    * [Calling a UDTF](../../developer-guide/udf/udf-calling-sql.md)
    * [Selecting columns from SQL class instance methods that return tabular data](../snowflake-db-classes.md)

`{ PIVOT | UNPIVOT }`
:   Optional subclause that specifies to pivot or unpivot the results of the FROM clause. For more details, see [PIVOT](pivot.md) and [UNPIVOT](unpivot.md).

`SAMPLE`
:   Optional subclause that specifies to sample rows from the table/view. For more details, see [SAMPLE / TABLESAMPLE](sample.md).

### Inline view clause

`[ LATERAL ] ( subquery )`
:   Specifies an inline view within the FROM clause. If the optional `LATERAL` keyword is used, then the
    `subquery` can refer to columns from other tables (or views or table functions) that are in the current
    FROM clause and to the left of the inline view.

    For more information about subqueries in general, see [Working with Subqueries](../../user-guide/querying-subqueries.md).

### Staged file clause

`@[namespace.]stage_name[/path]`
:   Specifies a named stage to be queried (or `~` for referring to the stage for the current user or `%` followed by a table name for referring to the stage for the specified table).

    When querying a stage, you can also optionally specify a named file format and pattern:

    > `( FILE_FORMAT => format_name [ , PATTERN => 'regex_pattern' ] )`
    > :   Specifies a named file format object to use for the stage and a pattern to filter the set of files in the stage.

    For more details about querying stages, see [Query data in staged files](../../user-guide/querying-stage.md).

### Directory table clause

`DIRECTORY( @stage_name )`
:   Specifies the name of a stage that includes a [directory table](../../user-guide/data-load-dirtables.md).

### Hierarchical query result

`hierarchical_query_result`
:   A hierarchical query result is the result set from using a clause such as CONNECT BY to query a table of hierarchical
    data. For more details, see [CONNECT BY](connect-by.md).

### Semantic view clause

`SEMANTIC_VIEW(...)`
:   Specifies the [semantic view](../../user-guide/views-semantic/overview.md) that you want to
    [query](../../user-guide/views-semantic/querying.md). For information, see [SEMANTIC_VIEW](semantic_view.md).

### Error table clause

`ERROR_TABLE( base_table_name )`
:   Specifies the name of the base table associated with the error table that you want to query. For information, see
    [DML error logging](../../user-guide/data-load-overview.md).

### Dynamic table refresh boundary clause

`DYNAMIC_TABLE_REFRESH_BOUNDARY( <object_name> )`
:   Wraps a table, view, or dynamic table reference so that upstream dynamic tables reachable through it are not refreshed together with the
    downstream dynamic table. Used in dynamic table definitions to decouple pipelines. For information, see
    [Dynamic table refresh boundary](../../user-guide/dynamic-tables-refresh-boundary.md).

    `DYNAMIC_TABLE_REFRESH_BOUNDARY()` has no effect outside of dynamic table definitions. The wrapped object is read normally.

## Usage notes

* Object names are SQL identifiers. They are case-insensitive by default. To preserve case, enclose them between double quotes
  (`" "`).

## Examples

Create a table and load data into it:

```sqlexample
CREATE TABLE ftable1 (retail_price FLOAT, wholesale_cost FLOAT, description VARCHAR);

INSERT INTO ftable1 (retail_price, wholesale_cost, description)
  VALUES (14.00, 6.00, 'bling');
```

Here is a basic example of using the FROM clause:

```sqlexample
SELECT description, retail_price, wholesale_cost
  FROM ftable1;
```

```output
+-------------+--------------+----------------+
| DESCRIPTION | RETAIL_PRICE | WHOLESALE_COST |
|-------------+--------------+----------------|
| bling       |           14 |              6 |
+-------------+--------------+----------------+
```

The following example is identical to the previous example, but specifies the table name qualified by the schema for the table:

```sqlexample
SELECT description, retail_price, wholesale_cost
  FROM temporary_doc_test.ftable1;
```

```output
+-------------+--------------+----------------+
| DESCRIPTION | RETAIL_PRICE | WHOLESALE_COST |
|-------------+--------------+----------------|
| bling       |           14 |              6 |
+-------------+--------------+----------------+
```

The following example creates an inline view and then uses it in the query:

```sqlexample
SELECT v.profit
  FROM (SELECT retail_price - wholesale_cost AS profit FROM ftable1) AS v;
```

```output
+--------+
| PROFIT |
|--------|
|      8 |
+--------+
```

The following example queries a sample of 10% of the data in the table:

```sqlexample
SELECT *
  FROM sales SAMPLE(10);
```

The following example executes a user-defined table function (UDTF):

```sqlexample
SELECT *
  FROM TABLE(Fibonacci_Sequence_UDTF(6.0::FLOAT));
```

These examples use an `AT` clause to return historical data from the following specified points in the past:

* One day earlier than the current time (`-86400 = -3600 * 24`).
* Specific time and day.

```sqlexample
SELECT *
  FROM sales AT(OFFSET => -86400);

SELECT *
  FROM sales AT(TIMESTAMP => '2018-07-27 12:00:00'::TIMESTAMP);
```

For more information about `AT`, see [AT | BEFORE](at-before.md).

The following example queries files located in a named stage:

```sqlexample
SELECT v.$1, v.$2, ...
  FROM
    @my_stage( FILE_FORMAT => 'csv_format', PATTERN => '.*my_pattern.*') v;
```

The following example retrieves all metadata columns in a [directory table](../../user-guide/data-load-dirtables.md) for a stage named `mystage`:

```sqlexample
SELECT * FROM DIRECTORY(@mystage);
```

The following example retrieves the FILE_URL column values from a directory table for files greater than 100 K bytes in size:

```sqlexample
SELECT FILE_URL FROM DIRECTORY(@mystage) WHERE SIZE > 100000;
```

The following example retrieves the FILE_URL column values from a directory table for comma-separated value files:

```sqlexample
SELECT FILE_URL FROM DIRECTORY(@mystage) WHERE RELATIVE_PATH LIKE '%.csv';
```

---
title: GENERATE_SYNTHETIC_DATA
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/generate_synthetic_data.md
section: SQL General Reference
---

# GENERATE_SYNTHETIC_DATA

The procedure generates synthetic data from one or more tables, based on data from input tables, and returns a table that contains metrics
about the generated data, such as the coefficient of difference (similarity) between the source data and the generated data.

This stored procedure uses the [caller’s rights](../../developer-guide/stored-procedure/stored-procedures-rights.md) to generate the output table.

Read the [requirements](../../user-guide/synthetic-data.md) for running this procedure. If any requirements are not met, the request
will fail before it starts generating data.

[Learn more about synthetic data usage](../../user-guide/synthetic-data.md).

## Syntax

```sqlsyntax
SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA(<configuration_object>)
```

## Arguments

`configuration_object`
:   An [OBJECT](../data-types-semistructured.md) that specifies the details of the request. You can use an
    [OBJECT constant](../data-types-semistructured.md) to specify this object.

    The OBJECT value has the following structure:

    ```javascript
    {
      'datasets': [
        {
          'input_table': '<input_table_name>',
          'output_table' : '<output_table_name>',
          'columns': {
            '<column_name>': {
              <property_name>: <property_value>
            }
            , ...
          }
        }
        , ...
      ],
      'similarity_filter': <boolean>,
      'replace_output_tables': <boolean>,
      'consistency_secret': <session_scoped_reference_string>
    }
    ```

    The OBJECT value contains the following key-value pairs:

    `datasets`
    :   An [array](../data-types-semistructured.md) specifying the data to generate. Each element in the array is an OBJECT
        value that defines a single input-output table pair. You can specify a maximum of five table pairs.

        A child OBJECT value representing a single input-output table pair has the following properties:

        `input_table`
        :   The fully-qualified name of the input table from which to generate synthetic data.
            If the table does not exist or cannot be accessed, Snowflake returns an error message. See
            [Using synthetic data in Snowflake](../../user-guide/synthetic-data.md) for more input table requirements.

            If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
            Identifiers enclosed in double quotes are also case-sensitive.

            For more information, see [Identifier requirements](../identifiers-syntax.md).

        `output_table`
        :   The fully-qualified name of the output table to store the synthetic data generated from `input_table`. The generated table will
            have the same permissions and policies as if the user had called CREATE TABLE with default values. If the table already exists and
            `replace_output_tables=TRUE`, the existing table will be overwritten.

            In addition, the identifier must start with an alphabetic character and cannot contain spaces or special characters unless the
            entire identifier string is enclosed in double quotes (for example, `"My object"`). Identifiers enclosed in double quotes are also
            case-sensitive.

            For more information, see [Identifier requirements](../identifiers-syntax.md).

        `columns`
        :   (*Optional*) An OBJECT value specifying additional properties for specific columns. Each field in the OBJECT defines
            properties for a single column. You do not need to define properties for all columns, or any columns at all. For each
            field:

            * The key is the column name. Properties in the value should be applied to this column.
            * The value is an OBJECT value containing any of the following key-value pairs:

              + `join_key`: BOOLEAN, where TRUE indicates that this is a join key column. This cannot be used in a column labeled
                `categorical`. Column must be a string, numeric, or Boolean value. [Learn about join keys.](../../user-guide/synthetic-data.md)

                Default: FALSE.
              + `categorical`: BOOLEAN, used to specify whether the column is a categorical string. Set to TRUE to enable the output to
                mark the data as non-sensitive, and able to be used in the output. Set to FALSE to redact values from the output.
                If not specified, will be determined by examining the data. Can be specified only for STRING columns. If set to TRUE, you cannot
                specify the `replace` or `join_key` fields for this column

                Default: Inferred based on the column data.
              + `replace`: Specifies an output format for **STRING** values. Can be used only on categorical string columns. The only values
                that can be used with `join_key` columns are `uuid` and `email`. Cannot be used when `categorical` is TRUE. If specified,
                you must provide a value for `consistency_secret`. The following values are supported:

                `replace` values

                | Value | Description |
                | --- | --- |
                | `uuid` | A UUID. Example: `88d99a35-c4be-4022-b06a-41fb4629b46d` |
                | `name` | A first and last name in US locale style. Example: `George Washington` |
                | `first_name` | A first name in US locale style. Example: `George` |
                | `last_name` | A last name in US locale style. Example: `Washington` |
                | `address` | An abbreviated address in US locale style. Example: `1600 Pennsylvania Ave` |
                | `full_address` | A detailed street address in US locale style. Example: `1600 Pennsylvania Ave NW, Washington DC 20500` |
                | `email` | An email address. Example: `bdbQ6OPBS5ScOdJx8bVpFw@example.com` |
                | `phone` | A US-style 10-digit phone number in US locale style. Example: `212-555-1234` |
                | `ssn` | A US-style Social Security number. Example: `123-45-6789` |

                Default:

                - For `join_key` columns, `uuid`
                - For non-join-key columns, the value will be redacted.

    `similarity_filter`
    :   (Optional) Specifies whether to use a similarity filter when creating the synthetic data. Set this to TRUE to use the built-in privacy
        filter to remove rows from the target table that are too similar to rows in the input table. If FALSE, each output table will have the
        same number of rows as its input table; if TRUE, an output table might have fewer rows than its input table. If TRUE, synthetic data
        generation will fail if you have NULL values in any non-string columns.

        Default: FALSE

        For more information, see [Enhancing privacy](../../user-guide/synthetic-data.md).

    `replace_output_tables`
    :   (*Optional*) Specifies whether to overwrite the output synthetic data table when creating the synthetic data. Set this to TRUE to
        overwrite the output table.

        Default: FALSE

    `consistency_secret`
    :   Session-scoped reference STRING for a [symmetric key SECRET](../sql/create-secret.md). Required if either of the following
        conditions are met, (otherwise you can omit this field):

        * If you want consistency for join keys across multiple runs.
        * If `columns.replace` or `columns.join_key = TRUE` are specified on any column, and this procedure is run in an
          [owner’s rights stored procedure](../../developer-guide/stored-procedure/stored-procedures-rights.md).

        If you provide a secret, the procedure generates consistent values for STRING join keys across multiple runs that reuse the same
        consistency secret. If you provide a secret, you must have the READ or OWNERSHIP privilege on this secret.

        If you don’t provide a secret, join keys are consistent between tables in the same run, but not across multiple runs.
        [Learn more about consistency.](../../user-guide/synthetic-data.md)

        Default: No consistency

## Output

| Column Name | Data Type | Description |
| --- | --- | --- |
| `created_on` | TIMESTAMP | Time the synthetic data was generated. |
| `table_name` | VARCHAR | Name of the synthetic table. |
| `table_schema` | VARCHAR | Schema name of the synthetic table. |
| `table_database` | VARCHAR | Database name of the synthetic table. |
| `columns` | VARCHAR | A pair of columns in the synthetic table. |
| `source_table_id` | NUMBER | Internal/system-generated identifier of the input table. |
| `source_table_name` | VARCHAR | Name of the input table. |
| `source_table_schema` | VARCHAR | Schema name of the input table. |
| `source_table_database` | VARCHAR | Database name of the input table. |
| `source_columns` | VARCHAR | Names of the source columns. |
| `metric_type` | ENUM | `correlation_coefficient_difference` - Calculated as the absolute value of the correlation coefficient between two non-join columns in the source table and the same two columns in the generated data.  Currently, `correlation_coefficient_difference` is the only supported metric. This is the difference between the correlation coefficient of every combination of columns in the input table and the same coefficient in the generated data. Each row represents the correlation coefficient difference between one combination of columns. The column name pair is found in these columns: `columns` and `source_columns`. |
| `metric_value` | NUMBER | Value of the metric. |

## Access control requirements

To generate synthetic data, you must use a role with each the following grants:

* USAGE on the warehouse that you want to use for queries.
* SELECT on the input table from which you want to generate synthetic data.
* USAGE on the database and schema that contain the input table, and on the database that contains the output table.
* CREATE TABLE on the schema that contains the output table.
* OWNERSHIP on the output tables. The simplest way to do this is by granting OWNERSHIP to the schema where the output table is
  generated. (However, if someone has applied a FUTURE GRANT on this schema, table ownership will be silently overridden – that is,
  `GRANT OWNERSHIP ON FUTURE TABLES IN SCHEMA db.my_schema TO ROLE some_role` will automatically grant OWNERSHIP to `some_role` on any
  new tables created in schema `my_schema`.)

All users can access the SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA stored procedure. Access is made available using the
SNOWFLAKE.CORE_VIEWER database role, which is granted to the PUBLIC role.

## Usage notes

* The JSON key values must be lowercase.
* You must [accept the Anaconda terms and conditions](../../developer-guide/udf/python/udf-python-packages.md) in your Snowflake account in order to enable this
  feature.
* For additional requirements, see [Requirements](../../user-guide/synthetic-data.md).
* Any timestamps earlier than `1677-09-21 00:12:43.145224193` or later than `2262-04-11 23:47:16.854775807` in the source data are
  coerced to `1677-09-21 00:12:43.145224193` or `2262-04-11 23:47:16.854775807` respectively when generating synthetic data.

## Examples

This example generates synthetic data from an input table containing medical information (blood type, gender, age, and ethnicity). The
response shows the closeness in data between the source and generated tables. The generated synthetic data table is not shown.

**Two columns designated as join keys**

```sqlexample
CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
    'datasets':[
        {
          'input_table': 'syndata_db.sch.faker_source_t',
          'output_table': 'syndata_db.sch.faker_synthetic_t',
          'columns': { 'blood_type': {'join_key': TRUE} , 'ethnicity': {'join_key': TRUE}}
        }
      ]
  });
```

**No columns designated as join keys**

```sqlexample
CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
  'datasets':[
      {
        'input_table': 'syndata_db.sch.faker_source_t',
        'output_table': 'syndata_db.sch.faker_synthetic_t'
      }
    ]
});
```

**Use consistency key to generate consistent values across multiple runs**

```sqlexample
CREATE OR REPLACE SECRET my_db.public.my_consistency_secret
  TYPE = SYMMETRIC_KEY
  ALGORITHM = GENERIC;

CALL SNOWFLAKE.DATA_PRIVACY.GENERATE_SYNTHETIC_DATA({
  'datasets':[
      {
        'input_table': 'CLINICAL_DB.PUBLIC.BASE_TABLE',
        'output_table': 'my_db.public.test_syndata',
        'columns': { 'patient_id': {'join_key': TRUE, 'replace': 'uuid'}}
      }
    ],
    'consistency_secret': SYSTEM$REFERENCE('SECRET', 'MY_CONSISTENCY_SECRET', 'SESSION', 'READ')::STRING,
    'replace_output_tables': TRUE
});
```

**Output from calling the function**

```output
+---------------------------+-------------------+--------------+----------------+------------------------+-------------------+---------------------+-----------------------+------------------------+------------------------------------+----------------+
| CREATED_ON                | TABLE_NAME        | TABLE_SCHEMA | TABLE_DATABASE | COLUMNS                | SOURCE_TABLE_NAME | SOURCE_TABLE_SCHEMA | SOURCE_TABLE_DATABASE | SOURCE_COLUMNS         | METRIC_TYPE                        | METRIC_VALUE   |
+---------------------------+-------------------+--------------+----------------+------------------------+-------------------+---------------------+-----------------------+------------------------+------------------------------------+----------------+
| 2024-07-30 09:53:28.439 Z | faker_synthetic_t | sch          | syndata_db     | "BLOOD_TYPE,GENDER"    | faker_source_t    | sch                 | syndata_db            | "BLOOD_TYPE,GENDER"    | CORRELATION_COEFFICIENT_DIFFERENCE | 0.02430214616  |
| 2024-07-30 09:53:28.439 Z | faker_synthetic_t | sch          | syndata_db     | "BLOOD_TYPE,AGE"       | faker_source_t    | sch                 | syndata_db            | "BLOOD_TYPE,AGE"       | CORRELATION_COEFFICIENT_DIFFERENCE | 0.001919343586 |
| 2024-07-30 09:53:28.439 Z | faker_synthetic_t | sch          | syndata_db     | "BLOOD_TYPE,ETHNICITY" | faker_source_t    | sch                 | syndata_db            | "BLOOD_TYPE,ETHNICITY" | CORRELATION_COEFFICIENT_DIFFERENCE | 0.003720197046 |
| 2024-07-30 09:53:28.439 Z | faker_synthetic_t | sch          | syndata_db     | "GENDER,AGE"           | faker_source_t    | sch                 | syndata_db            | "GENDER,AGE"           | CORRELATION_COEFFICIENT_DIFFERENCE | 0.004348586645 |
| 2024-07-30 09:53:28.439 Z | faker_synthetic_t | sch          | syndata_db     | "GENDER,ETHNICITY"     | faker_source_t    | sch                 | syndata_db            | "GENDER,ETHNICITY"     | CORRELATION_COEFFICIENT_DIFFERENCE | 0.001171535243 |
| 2024-07-30 09:53:28.439 Z | faker_synthetic_t | sch          | syndata_db     | "AGE,ETHNICITY"        | faker_source_t    | sch                 | syndata_db            | "AGE,ETHNICITY"        | CORRELATION_COEFFICIENT_DIFFERENCE | 0.004265938158 |
+---------------------------+-------------------+--------------+----------------+------------------------+-------------------+---------------------+-----------------------+------------------------+------------------------------------+----------------+
```

---
title: Geospatial data types
source: https://docs.snowflake.com/en/sql-reference/data-types-geospatial.md
section: SQL General Reference
---

# Geospatial data types

Snowflake offers native support for geospatial features such as points, lines, and polygons on the Earth’s surface.

> **Tip:**
>
> You can use the search optimization service to improve query performance.
> For details, see [Search optimization service](../user-guide/search-optimization-service.md).

## Data types

Snowflake provides the following data types for geospatial data:

* The GEOGRAPHY data type, which models Earth as though it were a perfect sphere.
* The GEOMETRY data type, which represents features in a planar (Euclidean, Cartesian)
  coordinate system.

### GEOGRAPHY data type

The GEOGRAPHY data type follows the WGS 84 standard (spatial reference ID 4326; for details, see
<https://epsg.io/4326>).

Points on the earth are represented as degrees of longitude (from -180 degrees to +180 degrees) and latitude
(-90 to +90). Snowflake uses 14 decimal places to store GEOGRAPHY coordinates. When the data includes decimal
places exceeding this limit, the coordinates are rounded to ensure compliance with the specified length constraint.

Altitude currently isn’t supported.

Line segments are interpreted as great circle arcs on the Earth’s surface.

Snowflake also provides
[geospatial functions](functions-geospatial.md) that
operate on the GEOGRAPHY data type.

If you have geospatial data — such as longitude and latitude data, WKT, WKB, GeoJSON, and so on — we suggest converting and
storing the data in GEOGRAPHY columns, rather than keeping the data in their original formats in VARCHAR, VARIANT, or NUMBER columns.
Storing your data in GEOGRAPHY columns can significantly improve the performance of queries that use geospatial functionality.

When the input to a geospatial function for the GEOGRAPHY data type represents a polygon, the starting point and ending point in
the polygon must be the same. Otherwise, the function might return errors.

### GEOMETRY data type

The GEOMETRY data type represents features in a planar (Euclidean, Cartesian) coordinate system.

The coordinates are represented as pairs of real numbers (x, y). Currently, only 2D coordinates are supported.

The units of the X and Y are determined by the [spatial reference system (SRS)](https://en.wikipedia.org/wiki/Spatial_reference_system) associated with the GEOMETRY object.
The spatial reference system is identified by the [spatial reference system identifier (SRID)](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier) number. Unless
the SRID is provided when creating the GEOMETRY object or by calling [ST_SETSRID](functions/st_setsrid.md), the SRID is 0.

Snowflake uses 14 decimal places to store GEOMETRY coordinates. When the data includes decimal
places exceeding this limit, the coordinates are rounded to ensure compliance with the specified length constraint.

Snowflake provides a set of
[geospatial functions that operate on the GEOMETRY data type](functions-geospatial.md). For these functions:

* All functions assume planar coordinates, even if the geometry uses a non-planar SRS.
* The measurement functions (for example, [ST_LENGTH](functions/st_length.md)) use the same units as the coordinate system.
* For functions that accept multiple GEOMETRY expressions as arguments (for example, [ST_DISTANCE](functions/st_distance.md)),
  the input expressions must be defined in the same SRS.

## Geospatial input and output

The following sections cover the supported standard formats and object types when reading and writing geospatial data.

* Supported standard input and output formats
* Supported geospatial object types
* Specifying the output format for result sets
* Examples of inserting and querying GEOGRAPHY data

### Supported standard input and output formats

The GEOGRAPHY and GEOMETRY data types support the following standard industry formats for input and output:

* [Well-Known Text](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry)
  (WKT)
* [Well-Known Binary](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Well-known_binary)
  (WKB)
* [Extended WKT and WKB (EWKT and EWKB)](https://en.wikipedia.org/wiki/Well-known_text_representation_of_geometry#Format_variations)
  (see the note on EWKT and EWKB handling)
* [IETF GeoJSON](https://tools.ietf.org/html/rfc7946)
  (see the note on GeoJSON handling)

You might also find the following Open Geospatial Consortium’s Simple Feature Access references helpful:

* [Common Architecture](https://www.opengeospatial.org/standards/sfa)
* [SQL Option](https://www.opengeospatial.org/standards/sfs)

Any departure from these standards is noted explicitly in the Snowflake documentation.

#### GeoJSON handling for GEOGRAPHY values

The WKT and WKB standards specify a format only. The semantics of WKT/WKB objects depend on the reference system (for
example, a plane or a sphere).

The GeoJSON standard, on the other hand, specifies both a format and its semantics: GeoJSON points are explicitly
WGS 84 coordinates, and GeoJSON line segments are planar edges (straight lines).

Contrary to that, the Snowflake GEOGRAPHY data type interprets all line segments, including those input from or
output to GeoJSON format, as great circle arcs. In essence, Snowflake treats GeoJSON as JSON-formatted WKT with spherical
semantics.

#### EWKT and EWKB handling for GEOGRAPHY values

EWKT and EWKB are non-standard formats [introduced by PostGIS](https://postgis.net/docs/ST_GeomFromEWKT.html).
They enhance the WKT and WKB formats by including a [spatial reference system identifier (SRID)](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier), which specifies the
coordinate reference system to use with the data. Snowflake currently supports only WGS84, which maps to SRID=4326.

By default, Snowflake issues an error if an EWKB or EWKT input value contains an SRID other than 4326. Conversely, all EWKB and EWKT output values have SRID=4326.

### Supported geospatial object types

The GEOGRAPHY and GEOMETRY data types can store the following types of geospatial objects:

* WKT / WKB / EWKT / EWKB / GeoJSON geospatial objects:

  + Point
  + MultiPoint
  + LineString
  + MultiLineString
  + Polygon
  + MultiPolygon
  + GeometryCollection
* These GeoJSON-specific geospatial objects:

  + Feature
  + FeatureCollection

### Specifying the output format for result sets

The session parameters [GEOGRAPHY_OUTPUT_FORMAT](parameters.md) and
[GEOMETRY_OUTPUT_FORMAT](parameters.md) control the rendering of GEOGRAPHY and GEOMETRY columns in
result sets (respectively).

These parameters can have one of the following values:

| Parameter value | Description |
| --- | --- |
| `GeoJSON` (default) | The GEOGRAPHY / GEOMETRY result is rendered as an OBJECT in GeoJSON format. |
| `WKT` | The GEOGRAPHY / GEOMETRY result is rendered as a VARCHAR in WKT format. |
| `WKB` | The GEOGRAPHY / GEOMETRY result is rendered as a BINARY in WKB format. |
| `EWKT` | The GEOGRAPHY / GEOMETRY result is rendered as a VARCHAR in EWKT format. |
| `EWKB` | The GEOGRAPHY / GEOMETRY result is rendered as a BINARY in EWKB format. |

For `EWKT` and `EWKB`, the SRID is always 4326 in the output. See EWKT and EWKB handling for GEOGRAPHY values.

This parameter affects all clients, including the Snowflake UI and the SnowSQL command-line client, as well as the
JDBC, ODBC, Node.js, Python, and so on drivers and connectors.

For example, the JDBC Driver returns the following metadata for a GEOGRAPHY-typed result column (column `i` in this
example):

* If `GEOGRAPHY_OUTPUT_FORMAT='GeoJSON'` or `GEOMETRY_OUTPUT_FORMAT='GeoJSON'`:

  + `ResultSetMetaData.getColumnType(i)` returns `java.sql.Types.VARCHAR`.
  + `ResultSetMetaData.getColumnClassName(i)` returns `"java.lang.String"`.
* If `GEOGRAPHY_OUTPUT_FORMAT='WKT'` or `'EWKT'`, or if `GEOMETRY_OUTPUT_FORMAT='WKT'` or `'EWKT'`:

  + `ResultSetMetaData.getColumnType(i)` returns `java.sql.Types.VARCHAR`.
  + `ResultSetMetaData.getColumnClassName(i)` returns `"java.lang.String"`.
* If `GEOGRAPHY_OUTPUT_FORMAT='WKB'` or `'EWKB'`, or if `GEOMETRY_OUTPUT_FORMAT='WKB'` or `'EWKB'`:

  + `ResultSetMetaData.getColumnType(i)` returns `java.sql.Types.BINARY`.
  + `ResultSetMetaData.getColumnClassName(i)` returns `"[B"` (array of byte).

> **Note:**
>
> APIs for retrieving database-specific type names (`getColumnTypeName` in JDBC and the
> `SQL_DESC_TYPE_NAME` descriptor in ODBC) always return `GEOGRAPHY` and `GEOMETRY` for the type name,
> regardless of the values of the `GEOGRAPHY_OUTPUT_FORMAT` and `GEOMETRY_OUTPUT_FORMAT` parameters. For details, see:
>
> * [Snowflake-specific behavior](../developer-guide/jdbc/jdbc-api.md) in the JDBC Driver documentation.
> * [Retrieving results and information about results](../developer-guide/odbc/odbc-api.md) in the ODBC Driver documentation.

### Examples of inserting and querying GEOGRAPHY data

The code below shows sample input and output for the GEOGRAPHY data type. Note the following:

* For the coordinates in WKT, EWKT, and GeoJSON, longitude appears before latitude (for example, `POINT(lon lat)`).

* For the WKB and EWKB output, it is assumed that the [BINARY_OUTPUT_FORMAT](parameters.md) parameter
  is set to `HEX` (the default value for the parameter).

The following example creates a table with a GEOGRAPHY column, inserts data in WKT format, and returns
the data in different output formats.

```sqlexample
CREATE OR REPLACE TABLE geospatial_table (id INTEGER, g GEOGRAPHY);
INSERT INTO geospatial_table VALUES
  (1, 'POINT(-122.35 37.55)'),
  (2, 'LINESTRING(-124.20 42.00, -120.01 41.99)');
```

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='GeoJSON';
```

```sqlexample
SELECT g
  FROM geospatial_table
  ORDER BY id;
```

```output
+------------------------+
| G                      |
|------------------------|
| {                      |
|   "coordinates": [     |
|     -122.35,           |
|     37.55              |
|   ],                   |
|   "type": "Point"      |
| }                      |
| {                      |
|   "coordinates": [     |
|     [                  |
|       -124.2,          |
|       42               |
|     ],                 |
|     [                  |
|       -120.01,         |
|       41.99            |
|     ]                  |
|   ],                   |
|   "type": "LineString" |
| }                      |
+------------------------+
```

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='WKT';
```

```sqlexample
SELECT g
  FROM geospatial_table
  ORDER BY id;
```

```output
+-------------------------------------+
| G                                   |
|-------------------------------------|
| POINT(-122.35 37.55)                |
| LINESTRING(-124.2 42,-120.01 41.99) |
+-------------------------------------+
```

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='WKB';
```

```sqlexample
SELECT g
  FROM geospatial_table
  ORDER BY id;
```

```output
+------------------------------------------------------------------------------------+
| G                                                                                  |
|------------------------------------------------------------------------------------|
| 01010000006666666666965EC06666666666C64240                                         |
| 010200000002000000CDCCCCCCCC0C5FC00000000000004540713D0AD7A3005EC01F85EB51B8FE4440 |
+------------------------------------------------------------------------------------+
```

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='EWKT';
```

```sqlexample
SELECT g
  FROM geospatial_table
  ORDER BY id;
```

```output
+-----------------------------------------------+
| G                                             |
|-----------------------------------------------|
| SRID=4326;POINT(-122.35 37.55)                |
| SRID=4326;LINESTRING(-124.2 42,-120.01 41.99) |
+-----------------------------------------------+
```

```sqlexample
ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT='EWKB';
```

```sqlexample
SELECT g
  FROM geospatial_table
  ORDER BY id;
```

```output
+--------------------------------------------------------------------------------------------+
| G                                                                                          |
|--------------------------------------------------------------------------------------------|
| 0101000020E61000006666666666965EC06666666666C64240                                         |
| 0102000020E610000002000000CDCCCCCCCC0C5FC00000000000004540713D0AD7A3005EC01F85EB51B8FE4440 |
+--------------------------------------------------------------------------------------------+
```

## Using geospatial data in Snowflake

The following sections cover how to work with geospatial data in Snowflake.

* Understanding the effects of using different SRIDs with GEOMETRY
* Changing the spatial reference system (SRS) and SRID of a GEOMETRY object
* Performing DML operations on GEOGRAPHY and GEOMETRY columns
* Loading geospatial data from stages
* Using geospatial data with Java UDFs
* Using geospatial data with JavaScript UDFs
* Using geospatial data with Python UDFs
* Using GEOGRAPHY objects with H3

### Understanding the effects of using different SRIDs with GEOMETRY

In a GEOMETRY column, you can insert objects that have different [SRIDs](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier). If the column contains more than one SRID, some of the
important performance optimizations aren’t applied. This can result in slower queries, in particular when joining on a geospatial
predicate.

### Changing the spatial reference system (SRS) and SRID of a GEOMETRY object

To change the [SRS](https://en.wikipedia.org/wiki/Spatial_reference_system) and [SRID](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier) of an existing GEOMETRY object, call the [ST_TRANSFORM](functions/st_transform.md) function,
passing in the new SRID. The function returns a new GEOMETRY object with the new SRID and the coordinates converted to use the
SRS. For example, to return a GEOMETRY object for `geometry_expression` that uses the SRS for SRID 32633, execute the following
statement:

```sqlexample
SELECT ST_TRANSFORM(geometry_expression, 32633);
```

If the original SRID isn’t set correctly in the existing GEOMETRY object, specify the original SRID as an additional argument.
For example, if `geometry_expression` is a GEOMETRY object that uses the SRID 4326, and you want to transform this to use the
SRID 28992, execute the following statement:

```sqlexample
SELECT ST_TRANSFORM(geometry_expression, 4326, 28992);
```

If a GEOMETRY object uses the correct coordinates for a SRS but has the wrong SRID, you can fix the SRID by calling the
[ST_SETSRID](functions/st_setsrid.md) function. For example, the following statement sets the SRID for
`geometry_expression` to 4326, while leaving the coordinates unchanged:

```sqlexample
SELECT ST_SETSRID(geometry_expression, 4326);
```

### Performing DML operations on GEOGRAPHY and GEOMETRY columns

When a GEOGRAPHY or GEOMETRY column is the target of a DML operation (INSERT, COPY, UPDATE, MERGE, or CREATE TABLE AS…), the
column’s source expression can be any of the following types:

* GEOGRAPHY or GEOMETRY : An expression of type GEOGRAPHY or GEOMETRY is usually the result of a parsing function, a constructor
  function, or an existing GEOGRAPHY or GEOMETRY column. For a complete list of supported functions and categories of functions,
  see [Geospatial functions](functions-geospatial.md).
* VARCHAR: Interpreted as a WKT, WKB (in hex format), EWKT, EWKB (in hex format), or GeoJSON formatted string (see
  [TO_GEOGRAPHY(VARCHAR)](functions/to_geography.md)).
* BINARY: Interpreted as a WKB binary (see [TO_GEOGRAPHY(BINARY)](functions/to_geography.md) and
  [TO_GEOMETRY(BINARY)](functions/to_geometry.md)).
* VARIANT: Interpreted as a GeoJSON object (see [TO_GEOGRAPHY(VARIANT)](functions/to_geography.md) and
  [TO_GEOMETRY(VARIANT)](functions/to_geometry.md)).

### Loading geospatial data from stages

You can load data from CSV or JSON/AVRO files in a stage directly (that is, without copy transforms) into a
GEOGRAPHY column.

* CSV: String values from the corresponding CSV column are parsed as GeoJSON, WKT, EWKT, WKB, or EWKB (see
  [TO_GEOGRAPHY(VARCHAR)](functions/to_geography.md)).
* JSON/AVRO: The JSON values in the file are interpreted as GeoJSON (see
  [TO_GEOGRAPHY(VARIANT)](functions/to_geography.md)).

  See also GeoJSON handling for GEOGRAPHY values.

Loading data from other file formats (Parquet, ORC, and so on) is
possible through a [COPY](sql/copy-into-table.md) transform.

### Using geospatial data with Java UDFs

Java UDFs allow the GEOGRAPHY type as an argument and as a return value. See [SQL-Java Data Type Mappings](../developer-guide/udf-stored-procedure-data-type-mapping.md) and
[Passing a GEOGRAPHY value to an in-line Java UDF](../developer-guide/udf/java/udf-java-cookbook.md) for details.

### Using geospatial data with JavaScript UDFs

JavaScript UDFs allow the GEOGRAPHY or GEOMETRY type as an argument and as a return value.

If a JavaScript UDF has an argument of type GEOGRAPHY or GEOMETRY, that argument is visible as a JSON object in GeoJSON
format inside the UDF body.

If a JavaScript UDF returns GEOGRAPHY or GEOMETRY, the UDF body is expected to return a JSON object in GeoJSON format.

For example, these two JavaScript UDFs are roughly equivalent to the built-in functions ST_X and ST_MAKEPOINT:

```sqlexample
CREATE OR REPLACE FUNCTION my_st_x(g GEOGRAPHY) RETURNS REAL
LANGUAGE JAVASCRIPT
AS
$$
  if (G["type"] != "Point")
  {
     throw "Not a point"
  }
  return G["coordinates"][0]
$$;

CREATE OR REPLACE FUNCTION my_st_makepoint(lng REAL, lat REAL) RETURNS GEOGRAPHY
LANGUAGE JAVASCRIPT
AS
$$
  g = {}
  g["type"] = "Point"
  g["coordinates"] = [ LNG, LAT ]
  return g
$$;
```

### Using geospatial data with Python UDFs

Python UDFs allow the GEOGRAPHY and GEOMETRY types as arguments and as return values.

If a Python UDF has an argument of type GEOGRAPHY or GEOMETRY, that argument is represented as a
GeoJSON object, which is converted to a Python `dict` object inside the UDF body.

If a Python UDF returns GEOGRAPHY or GEOMETRY, the UDF body is expected to return a Python `dict` object
that complies with the structure of GeoJSON.

For example, this Python UDF returns the number of distinct geometries that constitute a composite GEOGRAPHY type:

```sqlexample-python
CREATE OR REPLACE FUNCTION py_numgeographys(geo GEOGRAPHY)
RETURNS INTEGER
LANGUAGE PYTHON
RUNTIME_VERSION = 3.10
PACKAGES = ('shapely')
HANDLER = 'udf'
AS $$
from shapely.geometry import shape, mapping
def udf(geo):
    if geo['type'] not in ('MultiPoint', 'MultiLineString', 'MultiPolygon', 'GeometryCollection'):
        raise ValueError('Must be a composite geometry type')
    else:
        g1 = shape(geo)
        return len(g1.geoms)
$$;
```

Check [Snowflake Labs](https://github.com/Snowflake-Labs/sf-samples/tree/main/samples/geospatial/Python%20UDFs) for more samples
of Python UDFs. Some of them enable complex spatial manipulations or simplify data ingestion. For example,
[this UDF](https://github.com/Snowflake-Labs/sf-samples/blob/main/samples/geospatial/Python%20UDFs/PY_LOAD_GEOFILES.sql) allows
reading formats that aren’t supported natively, such as Shapefiles (.SHP), TAB, KML, GPKG, and others.

> **Note:**
>
> The code samples in Snowflake Labs are intended solely for reference and educational purposes. These code samples aren’t covered
> by any Service Level Agreement.

### Using GEOGRAPHY objects with H3

[H3](https://h3geo.org/docs/) is a [hierarchical geospatial index](https://h3geo.org/docs/highlights/indexing) that partitions
the world into hexagonal cells in a [discrete global grid system](https://en.wikipedia.org/wiki/Discrete_global_grid).

Snowflake provides SQL functions that enable you to use H3 with GEOGRAPHY objects. You can
use these functions to:

* Get the H3 cell ID ([index](https://h3geo.org/docs/core-library/h3Indexing)) for a GEOGRAPHY object that represents a Point (and vice versa).
* Get the IDs of the minimal set of H3 cells that cover a GEOGRAPHY object.
* Get the IDs of the H3 cells that have centroids within a GEOGRAPHY object that represents a Polygon.
* Get the GEOGRAPHY object that represents the boundary of an H3 cell.
* Get the parents and children of a given H3 cell.
* Get the longitude and latitude of the centroid of an H3 cell (and vice versa).
* Get the [resolution](https://h3geo.org/docs/core-library/restable) of an H3 cell.
* Get the hexadecimal representation of an H3 cell ID (and vice versa).

For more information about these functions, see [Geospatial functions](functions-geospatial.md).

## Choosing the geospatial data type to use (GEOGRAPHY or GEOMETRY)

The next sections explain the differences between the GEOGRAPHY and GEOMETRY data types:

* Understanding the differences between GEOGRAPHY and GEOMETRY
* Examples comparing the GEOGRAPHY and GEOMETRY data types
* Understanding the differences in input data validation

### Understanding the differences between GEOGRAPHY and GEOMETRY

Although both the GEOGRAPHY and GEOMETRY data types define geospatial features, the types use different models. The following
table summarizes the differences.

| GEOGRAPHY data type | GEOMETRY data type |
| --- | --- |
| * Defines features on a sphere. * Only the WGS84 coordinate system. [SRID](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier) is always 4326. * Coordinates are latitude (-90 to 90) and longitude (-180 to 180) in degrees. * Results of measurement operations (ST_LENGTH, ST_AREA, and so on) are in meters. * Segments are interpreted as great circle arcs on the Earth’s surface. | * Defines features on a plane. * Any coordinate system is supported. * Units of coordinate values are defined by the spatial reference system. * Results of measurement operations (ST_LENGTH, ST_AREA, and so on) are in the same unit as coordinates. For example, if the   input coordinates are in degrees, the results are in degrees. * Segments are interpreted as straight lines on the plane. |

### Examples comparing the GEOGRAPHY and GEOMETRY data types

The following examples compare the output of the geospatial functions when using the GEOGRAPHY and GEOMETRY data types as input.

#### Example 1: Querying the distance between Berlin and San Francisco

The following table compares the output of [ST_DISTANCE](functions/st_distance.md) for GEOGRAPHY types and GEOMETRY types:

| ST_DISTANCE using . GEOGRAPHY input | ST_DISTANCE using . GEOMETRY input |
| --- | --- |
| ```sqlexample SELECT ST_DISTANCE(     ST_POINT(13.4814, 52.5015),     ST_POINT(-121.8212, 36.8252))   AS distance_in_meters; ```  ```output +--------------------+ | DISTANCE_IN_METERS | |--------------------| |   9182410.99227821 | +--------------------+ ``` | ```sqlexample SELECT ST_DISTANCE(     ST_GEOMPOINT(13.4814, 52.5015),     ST_GEOMPOINT(-121.8212, 36.8252))   AS distance_in_degrees; ```  ```output +---------------------+ | DISTANCE_IN_DEGREES | |---------------------| |       136.207708844 | +---------------------+ ``` |

As shown in the example above:

* With GEOGRAPHY input values, the input coordinates are in degrees, and the output value is in meters. (The result is 9,182 km.)
* With GEOMETRY input values, the input coordinates and output value are degrees. (The result is 136.208 degrees.)

#### Example 2: Querying the area of Germany

The following table compares the output of [ST_AREA](functions/st_area.md) for GEOGRAPHY types and GEOMETRY types:

| ST_AREA using . GEOGRAPHY input | ST_AREA using . GEOMETRY input |
| --- | --- |
| ```sqlexample SELECT ST_AREA(border) AS area_in_sq_meters   FROM world_countries   WHERE name = 'Germany'; ```  ```output +-------------------+ | AREA_IN_SQ_METERS | |-------------------| |  356379183635.591 | +-------------------+ ``` | ```sqlexample SELECT ST_AREA(border) as area_in_sq_degrees   FROM world_countries_geom   WHERE name = 'Germany'; ```  ```output +--------------------+ | AREA_IN_SQ_DEGREES | |--------------------| |       45.930026848 | +--------------------+ ``` |

As shown in the example above:

* With GEOGRAPHY input values, the input coordinates are in degrees, the output value is in square meters. The result is
  356,379 km^2.
* With GEOMETRY input values, the input coordinates are in degrees, and the output value is in square degrees. The result is
  45.930 square degrees.

#### Example 3: Querying the names of countries overlapping the line from Berlin to San Francisco

The following table compares the output of [ST_INTERSECTS](functions/st_intersects.md) for GEOGRAPHY types and GEOMETRY types:

| ST_INTERSECTS using . GEOGRAPHY input | ST_INTERSECTS using . GEOMETRY input |
| --- | --- |
| ```sqlexample SELECT name FROM world_countries WHERE   ST_INTERSECTS(border,     TO_GEOGRAPHY(       'LINESTRING(13.4814 52.5015, -121.8212 36.8252)'     )); ```  ```output +--------------------------+ | NAME                     | |--------------------------| |                  Germany | |                  Denmark | |                  Iceland | |                Greenland | |                   Canada | | United States of America | +--------------------------+ ``` | ```sqlexample SELECT name FROM world_countries_geom WHERE   ST_INTERSECTS(border,     TO_GEOMETRY(       'LINESTRING(13.4814 52.5015, -121.8212 36.8252)'     )); ```  ```output +--------------------------+ | NAME                     | |--------------------------| |                  Germany | |                  Belgium | |              Netherlands | |           United Kingdom | | United States of America | +--------------------------+ ``` |
|  |  |

### Understanding the differences in input data validation

To create a GEOMETRY or GEOGRAPHY object for an input shape, you must use a shape that is well-formed and valid, according to the
[OGC rules for Simple Features](https://www.ogc.org/standards/sfa). The next sections explain how the validity of input data differs between GEOMETRY and GEOGRAPHY.

#### A shape can be valid GEOGRAPHY but invalid GEOMETRY

A given shape can be a valid GEOGRAPHY object but an invalid GEOMETRY object, and vice versa.

For example, self-intersecting polygons are disallowed by the OGC rules. A given set of points might define edges that intersect in
the Cartesian domain but not on a sphere. Consider the following polygon:

```none
POLYGON((0 50, 25 50, 50 50, 0 50))
```

In the Cartesian domain, this polygon degrades to a line and, as a result, is invalid.

However, on a sphere, this same polygon doesn’t intersect itself and is valid:

#### Conversion and constructor functions handle validation differently

When the input data is invalid, the GEOMETRY and GEOGRAPHY functions handle validation in different ways:

* Some of the functions for constructing and converting to GEOGRAPHY objects might attempt to repair the shape to handle problems
  such as unclosed loops, spikes, cuts, and self-intersecting loops in polygons. For example, when either the
  [TO_GEOGRAPHY](functions/to_geography.md) function or the
  [ST_MAKEPOLYGON](functions/st_makepolygon.md) function is used to
  construct a polygon, the function corrects the orientation of the loop to prevent the creation of polygons that span more than half of the
  globe. However, the [ST_MAKEPOLYGONORIENTED](functions/st_makepolygonoriented.md) function doesn’t attempt to correct the orientation of
  the loop.

  If the function is successful in repairing the shape, the function returns a GEOGRAPHY object.
* The functions for constructing and converting to GEOMETRY objects (for example, [TO_GEOMETRY](functions/to_geometry.md)) don’t
  support the ability to repair the shape.

## Converting between GEOGRAPHY and GEOMETRY

Snowflake supports converting from a GEOGRAPHY object to a GEOMETRY object (and vice versa). Snowflake also supports
transformations of objects that use different spatial reference systems (SRS).

The following example converts a GEOGRAPHY object that represents a point to a GEOMETRY object with the [SRID](https://en.wikipedia.org/wiki/Spatial_reference_system#Identifier) 0:

```sqlexample
SELECT TO_GEOMETRY(TO_GEOGRAPHY('POINT(-122.306100 37.554162)'));
```

To set the SRID of the new GEOMETRY object, pass the SRID as an argument to the constructor function. For example:

```sqlexample
SELECT TO_GEOMETRY(TO_GEOGRAPHY('POINT(-122.306100 37.554162)', 4326));
```

If you need to set the SRID of an existing GEOMETRY object, see Changing the spatial reference system (SRS) and SRID of a GEOMETRY object.

## Automatic performance optimizations for queries with geospatial predicates

Snowflake automatically implements the following performance optimizations for queries
with geospatial predicates:

* GeoJoin
* Geospatial pruning for GEOMETRY predicates

### GeoJoin

*GeoJoin* is a Snowflake query optimization feature for geospatial joins. It’s a specialized join rewrite optimization
that improves performance when joining tables based on predicates that call geospatial functions, such as ST_INTERSECTS,
ST_CONTAINS, ST_DWITHIN, and so on. For example, a GeoJoin might be used to find all stores within specific geographic
regions.

GeoJoin has the following characteristics:

* Automatically optimizes queries that join tables that use geospatial functions.
* Performs spatial overlap analysis between geographic datasets.

The GeoJoin optimization is triggered automatically by Snowflake’s query optimizer when it detects appropriate geospatial
join patterns in your SQL queries. No additional configuration is required to benefit from improved performance.

### Geospatial pruning for GEOMETRY predicates

Snowflake can improve the performance of some queries that filter on a GEOMETRY column by
[pruning micro-partitions](../user-guide/tables-clustering-micropartitions.md) that can’t contain matching
rows. This optimization uses bounding-box metadata stored with GEOMETRY values to avoid scanning data that is guaranteed
not to satisfy the predicate. That is, Snowflake skips micro-partitions whose stored bounding-box metadata don’t
intersect the bounding box of a constant geometry in your filter.

Snowflake performs this optimization automatically, and no additional configuration is required to benefit
from improved performance.

#### Predicates that can benefit from geospatial pruning

Pruning is designed for filter predicates on a GEOMETRY column where one side is a constant geometry, such as a
literal or constant-foldable expression. For example:

* `ST_INTERSECTS(geom_col, <constant_geometry>)`
* `ST_CONTAINS(geom_col, <constant_geometry>)`
* `ST_COVERS(geom_col, <constant_geometry>)`
* `ST_COVEREDBY(geom_col, <constant_geometry>)`
* `ST_WITHIN(geom_col, <constant_geometry>)`

These are the specific types of predicates that benefit from file-level pruning that uses bounding boxes.

#### How geospatial pruning works

A GEOMETRY value is stored with metadata that includes the following items:

* A bounding box (xmin, ymin, xmax, ymax)
* An SRID value

Snowflake uses file-level bounding-box metadata as the primary signal for pruning.

Snowflake pre-checks micro-partitions for bounding-box intersections before evaluating the exact geometry predicate.
If the metadata indicates that no bounding-box overlap is possible, the micro-partitions can be skipped.

The following illustration shows a bounding-box line with two intersecting shapes and two non-intersecting shapes:

The illustration shows a geospatial query that is similar to the following example:

```sqlexample
SELECT *
  FROM <table>
  WHERE ST_CONTAINS(<geo_column>, <constant_geometry>);
```

For a bounding box that is specified from the geospatial constant (shown in blue), only the micro-partitions that
correspond with bounding boxes that overlap the bounding-box line (shown in green) are scanned. Micro-partitions
that correspond with bounding boxes that don’t overlap (shown in red) are pruned.

#### SRID behavior

Spatial predicates in Snowflake are SRID-sensitive. Therefore, mixing incompatible SRIDs in the same column might
return incorrect results. For the best results and deterministic behavior, keep SRIDs consistent within a GEOMETRY
column, which is also the common real-world pattern.

#### Geospatial pruning for Iceberg tables

For Iceberg GEOMETRY columns, Snowflake tries to use the bounding-box metadata for geospatial pruning, the same way it
does for GEOMETRY columns in standard Snowflake tables. If the necessary statistics are present in the underlying Iceberg
metadata, the same pruning logic applies without modification.

#### Other performance considerations for geospatial pruning

Geospatial pruning works best when the data layout allows Snowflake to skip large ranges of micro-partitions. If rows
with similar locations are spread across many micro-partitions, Snowflake might need to scan a large portion of the
table, even when pruning is used.

Clustering by location can improve pruning efficiency. When you cluster rows by a spatial key derived from your GEOMETRY column,
objects that are near each other are more likely to be stored in the same micro-partitions. As a result, spatial filters,
such as ST_INTERSECTS or ST_CONTAINS with a constant geometry, can prune more micro-partitions and read less data.

Follow these best practices to optimize geospatial pruning:

* Cluster by a discretized spatial index, such as H3 or geohash, computed from a representative point of the
  geometry, such as the centroid. Use a resolution appropriate for your query window sizes.
* If your geometries are large, such as polygons that cover wide areas, consider clustering by multiple keys
  to reduce over-clustering and improve selectivity. For example, cluster by H3 at two resolutions or a coarse
  grid key plus a finer grid key. For datasets with a dominant query pattern — for example, “within a city” or
  “within a tile” — choose a clustering key that aligns with that pattern.

The following example clusters a table by a GEOMETRY column that contains points:

```sqlexample
ALTER TABLE <table_name>
  CLUSTER BY (H3_POINT_TO_CELL(<geo_column>, <h3_resolution>));
```

The following example clusters a table by a GEOMETRY column that contains LineStrings and Polygons:

```sqlexample
ALTER TABLE <table_name>
  CLUSTER BY (H3_POINT_TO_CELL(ST_CENTROID(<geo_column>), <h3_resolution>));
```

> **Note:**
>
> * Clustering doesn’t change query results. It changes how data is organized on storage. The benefits of clustering
>   depend on data distribution, table size, and query patterns.
> * You can monitor pruning effectiveness by comparing bytes scanned and micro-partitions scanned before and after
>   clustering.

#### Limitations for geospatial pruning

The following limitations apply to geospatial pruning:

* It applies only to GEOMETRY (planar) predicates.

  To prune GEOGRAPHY predicates, use [search optimization](../user-guide/search-optimization-service.md).
* It applies only when one predicate argument is a constant.

  If both predicate arguments are columns (for example, `ST_INTERSECTS(a.geom, b.geom)`), this optimization
  doesn’t apply. For such cases, GeoJoin might be used.

## Specifying how invalid geospatial shapes are handled

By default, when you use a [geospatial conversion function](functions-geospatial.md) to convert
data in a supported input format to a GEOGRAPHY or GEOMETRY object, the function
does the following:

1. The function attempts to validate the shape in the input data.
2. The function determines if the shape is valid according to the
   [Open Geospatial Consortium’s Simple Feature Access / Common Architecture](https://www.ogc.org/standards/sfa) standard.
3. If the shape is invalid, the function attempts to repair the data (for example, fixing polygons by closing the rings).
4. If the shape is still invalid after the repairs, the function reports an error and doesn’t create the GEOGRAPHY or GEOMETRY
   object. (For the TRY_\* functions, the functions return NULL, rather than reporting an error.)

With this feature, you have more control over the validation and repair process. You can:

* Allow these conversion functions to create GEOGRAPHY and GEOMETRY objects for invalid shapes.
* Determine if the shape for a GEOGRAPHY or GEOMETRY object is invalid.

### Understanding the effects of invalid shapes on geospatial functions

Different [geospatial functions](functions-geospatial.md) have different effects when you pass in a GEOGRAPHY
or GEOMETRY object for an invalid shape.

#### Effects on GEOMETRY objects

For GEOMETRY objects:

* The following functions return results based on the original invalid shape:

  + [ST_AREA](functions/st_area.md)
  + [ST_ASGEOJSON](functions/st_asgeojson.md)
  + [ST_ASWKB](functions/st_aswkb.md)
  + [ST_ASWKT](functions/st_aswkt.md)
  + [ST_CENTROID](functions/st_centroid.md)
  + [ST_CONTAINS](functions/st_contains.md)
  + [ST_DIMENSION](functions/st_dimension.md)
  + [ST_DISTANCE](functions/st_distance.md)
  + [ST_ENVELOPE](functions/st_envelope.md)
  + [ST_INTERSECTS](functions/st_intersects.md)
  + [ST_LENGTH](functions/st_length.md)
  + [ST_NPOINTS , ST_NUMPOINTS](functions/st_npoints.md)
  + [ST_PERIMETER](functions/st_perimeter.md)
  + [ST_SETSRID](functions/st_setsrid.md)
  + [ST_SRID](functions/st_srid.md)
  + [ST_X](functions/st_x.md)
  + [ST_XMAX](functions/st_xmax.md)
  + [ST_XMIN](functions/st_xmin.md)
  + [ST_Y](functions/st_y.md)
  + [ST_YMAX](functions/st_ymax.md)
  + [ST_YMIN](functions/st_ymin.md)
* The following functions validate the shape and fail with an error if the shape is invalid:

  + [ST_MAKELINE](functions/st_makeline.md)
  + [ST_MAKEPOLYGON](functions/st_makepolygon.md)

#### Effects on GEOGRAPHY objects

For GEOGRAPHY objects:

* The following functions return results based on the original invalid shape:

  + [ST_ASWKB](functions/st_aswkb.md)
  + [ST_ASWKT](functions/st_aswkt.md)
  + [ST_ASGEOJSON](functions/st_asgeojson.md)
  + [ST_AZIMUTH](functions/st_azimuth.md)
  + [ST_COLLECT](functions/st_collect.md)
  + [ST_DIMENSION](functions/st_dimension.md)
  + [ST_GEOHASH](functions/st_geohash.md)
  + [ST_HAUSDORFFDISTANCE](functions/st_hausdorffdistance.md)
  + [ST_MAKELINE](functions/st_makeline.md)
  + [ST_NPOINTS , ST_NUMPOINTS](functions/st_npoints.md)
  + [ST_POINTN](functions/st_pointn.md)
  + [ST_SRID](functions/st_srid.md)
  + [ST_ENDPOINT](functions/st_endpoint.md)
  + [ST_STARTPOINT](functions/st_startpoint.md)
  + [ST_X](functions/st_x.md)
  + [ST_Y](functions/st_y.md)
* The following functions validate the shape and fail with an error if the shape is invalid:

  + [ST_COLLECT](functions/st_collect.md)
  + [ST_MAKEPOLYGON](functions/st_makepolygon.md)
  + [ST_MAKEPOLYGONORIENTED](functions/st_makepolygonoriented.md)
* The following functions return NULL if it isn’t possible to compute the value:

  + [ST_AREA](functions/st_area.md)
  + [ST_CENTROID](functions/st_centroid.md)
  + [ST_CONTAINS](functions/st_contains.md)
  + [ST_COVERS](functions/st_covers.md)
  + [ST_DIFFERENCE](functions/st_difference.md)
  + [ST_DISTANCE](functions/st_distance.md)
  + [ST_DWITHIN](functions/st_dwithin.md)
  + [ST_ENVELOPE](functions/st_envelope.md)
  + [ST_INTERSECTION](functions/st_intersection.md)
  + [ST_INTERSECTION_AGG](functions/st_intersection_agg.md)
  + [ST_INTERSECTS](functions/st_intersects.md)
  + [ST_LENGTH](functions/st_length.md)
  + [ST_PERIMETER](functions/st_perimeter.md)
  + [ST_SIMPLIFY](functions/st_simplify.md)
  + [ST_SYMDIFFERENCE](functions/st_symdifference.md)
  + [ST_UNION](functions/st_union.md)
  + [ST_UNION_AGG](functions/st_union_agg.md)
  + [ST_XMAX](functions/st_xmax.md)
  + [ST_XMIN](functions/st_xmin.md)
  + [ST_YMAX](functions/st_ymax.md)
  + [ST_YMIN](functions/st_ymin.md)

### Working with invalid shapes

The next sections explain how to allow functions to create invalid shapes and how to determine if a GEOGRAPHY or GEOMETRY object
represents an invalid or repaired shape.

#### Allowing conversion functions to create invalid shapes

To allow the following conversion functions to create invalid geospatial objects, pass `TRUE` for the second argument
(`allowInvalid`):

```sqlsyntax
TO_GEOGRAPHY( <input> [, <allowInvalid> ] )
```

```sqlsyntax
ST_GEOGFROMWKB( <input> [, <allowInvalid> ] )
```

```sqlsyntax
ST_GEOGFROMWKT( <input> [, <allowInvalid> ] )
```

```sqlsyntax
TO_GEOMETRY( <input> [, <allowInvalid> ] )
```

```sqlsyntax
ST_GEOMFROMWKB( <input> [, <allowInvalid> ] )
```

```sqlsyntax
ST_GEOMFROMWKT( <input> [, <allowInvalid> ] )
```

By default, the `allowInvalid` argument is `FALSE`.

When you pass `TRUE` for the `allowInvalid` argument, the conversion function returns a GEOGRAPHY or GEOMETRY
object, even when the input shape is invalid and can’t be repaired successfully.

For example, the following input shape is a LineString that consists of the same two Points. Passing `TRUE` for the
`allowInvalid` argument returns a GEOMETRY object that represents an invalid shape:

```sqlexample
SELECT TO_GEOMETRY('LINESTRING(100 102,100 102)', TRUE);
```

#### Determining if a shape is invalid

To determine if a GEOGRAPHY or GEOMETRY object is invalid, call the [ST_ISVALID](functions/st_isvalid.md) function.

The following example checks if an object is valid:

```sqlexample
SELECT TO_GEOMETRY('LINESTRING(100 102,100 102)', TRUE) AS g, ST_ISVALID(g);
```

---
title: Geospatial functions
source: https://docs.snowflake.com/en/sql-reference/functions-geospatial.md
section: SQL General Reference
---

# Geospatial functions

Geospatial functions operate on
[GEOGRAPHY](data-types-geospatial.md) and [GEOMETRY](data-types-geospatial.md) and convert GEOGRAPHY and GEOMETRY
values to and from other representations (such as VARCHAR).

| Sub-category | Function | Notes |
| --- | --- | --- |
| Conversion / Input / Parsing | [ST_GEOGFROMGEOHASH](functions/st_geogfromgeohash.md) | GEOGRAPHY only |
|  | [ST_GEOGPOINTFROMGEOHASH](functions/st_geogpointfromgeohash.md) | GEOGRAPHY only |
|  | [ST_GEOGRAPHYFROMWKB](functions/st_geographyfromwkb.md) | GEOGRAPHY only |
|  | [ST_GEOGRAPHYFROMWKT](functions/st_geographyfromwkt.md) | GEOGRAPHY only |
|  | [ST_GEOMETRYFROMWKB](functions/st_geometryfromwkb.md) | GEOMETRY only |
|  | [ST_GEOMETRYFROMWKT](functions/st_geometryfromwkt.md) | GEOMETRY only |
|  | [ST_GEOMFROMGEOHASH](functions/st_geomfromgeohash.md) | GEOMETRY only |
|  | [ST_GEOMPOINTFROMGEOHASH](functions/st_geompointfromgeohash.md) | GEOMETRY only |
|  | [TO_GEOGRAPHY](functions/to_geography.md) | GEOGRAPHY only |
|  | [TO_GEOMETRY](functions/to_geometry.md) | GEOMETRY only |
|  | [TRY_TO_GEOGRAPHY](functions/try_to_geography.md) | GEOGRAPHY only |
|  | [TRY_TO_GEOMETRY](functions/try_to_geometry.md) | GEOMETRY only |
| Conversion / Output / Formatting | [ST_ASGEOJSON](functions/st_asgeojson.md) |  |
|  | [ST_ASWKB](functions/st_aswkb.md) |  |
|  | [ST_ASBINARY](functions/st_aswkb.md) | Alias for ST_ASWKB |
|  | [ST_ASEWKB](functions/st_asewkb.md) |  |
|  | [ST_ASWKT](functions/st_aswkt.md) |  |
|  | [ST_ASTEXT](functions/st_aswkt.md) | Alias for ST_ASWKT |
|  | [ST_ASEWKT](functions/st_asewkt.md) |  |
|  | [ST_GEOHASH](functions/st_geohash.md) |  |
| Constructor Functions | [ST_MAKELINE](functions/st_makeline.md) |  |
|  | [ST_MAKEGEOMPOINT](functions/st_makegeompoint.md) | GEOMETRY only |
|  | [ST_GEOMPOINT](functions/st_makegeompoint.md) | Alias for ST_MAKEGEOMPOINT |
|  | [ST_MAKEPOINT](functions/st_makepoint.md) | GEOGRAPHY only |
|  | [ST_POINT](functions/st_makepoint.md) | Alias for ST_MAKEPOINT |
|  | [ST_MAKEPOLYGON](functions/st_makepolygon.md) |  |
|  | [ST_POLYGON](functions/st_makepolygon.md) | Alias for ST_MAKEPOLYGON |
|  | [ST_MAKEPOLYGONORIENTED](functions/st_makepolygonoriented.md) | GEOGRAPHY only |
| Accessor Functions | [ST_DIMENSION](functions/st_dimension.md) |  |
|  | [ST_ENDPOINT](functions/st_endpoint.md) |  |
|  | [ST_POINTN](functions/st_pointn.md) |  |
|  | [ST_SRID](functions/st_srid.md) |  |
|  | [ST_STARTPOINT](functions/st_startpoint.md) |  |
|  | [ST_X](functions/st_x.md) |  |
|  | [ST_XMAX](functions/st_xmax.md) |  |
|  | [ST_XMIN](functions/st_xmin.md) |  |
|  | [ST_Y](functions/st_y.md) |  |
|  | [ST_YMAX](functions/st_ymax.md) |  |
|  | [ST_YMIN](functions/st_ymin.md) |  |
| Relationship and Measurement Functions | [HAVERSINE](functions/haversine.md) |  |
|  | [ST_AREA](functions/st_area.md) |  |
|  | [ST_AZIMUTH](functions/st_azimuth.md) |  |
|  | [ST_CONTAINS](functions/st_contains.md) |  |
|  | [ST_COVEREDBY](functions/st_coveredby.md) |  |
|  | [ST_COVERS](functions/st_covers.md) |  |
|  | [ST_DISJOINT](functions/st_disjoint.md) |  |
|  | [ST_DISTANCE](functions/st_distance.md) |  |
|  | [ST_DWITHIN](functions/st_dwithin.md) | GEOGRAPHY only |
|  | [ST_HAUSDORFFDISTANCE](functions/st_hausdorffdistance.md) | GEOGRAPHY only |
|  | [ST_INTERSECTS](functions/st_intersects.md) |  |
|  | [ST_LENGTH](functions/st_length.md) |  |
|  | [ST_NPOINTS](functions/st_npoints.md) |  |
|  | [ST_NUMPOINTS](functions/st_npoints.md) | Alias for ST_NPOINTS |
|  | [ST_PERIMETER](functions/st_perimeter.md) |  |
|  | [ST_WITHIN](functions/st_within.md) |  |
| Transformation Functions | [ST_BUFFER](functions/st_buffer.md) | GEOMETRY only |
|  | [ST_CENTROID](functions/st_centroid.md) |  |
|  | [ST_COLLECT](functions/st_collect.md) (Scalar and Aggregate) | GEOGRAPHY only |
|  | [ST_DIFFERENCE](functions/st_difference.md) | GEOGRAPHY only |
|  | [ST_ENVELOPE](functions/st_envelope.md) | Deprecated for GEOGRAPHY |
|  | [ST_INTERPOLATE](functions/st_interpolate.md) | GEOGRAPHY only |
|  | [ST_INTERSECTION](functions/st_intersection.md) | GEOGRAPHY only |
|  | [ST_INTERSECTION_AGG](functions/st_intersection_agg.md) (Scalar and Aggregate) | GEOGRAPHY only |
|  | [ST_SETSRID](functions/st_setsrid.md) | GEOMETRY only |
|  | [ST_SIMPLIFY](functions/st_simplify.md) |  |
|  | [ST_SYMDIFFERENCE](functions/st_symdifference.md) | GEOGRAPHY only |
|  | [ST_TRANSFORM](functions/st_transform.md) | GEOMETRY only |
|  | [ST_UNION](functions/st_union.md) | GEOGRAPHY only |
|  | [ST_UNION_AGG](functions/st_union_agg.md) (Scalar and Aggregate) | GEOGRAPHY only |
| Utility Functions | [ST_ISVALID](functions/st_isvalid.md) |  |
| H3 Functions | [H3_CELL_TO_BOUNDARY](functions/h3_cell_to_boundary.md) | GEOGRAPHY only |
|  | [H3_CELL_TO_CHILDREN](functions/h3_cell_to_children.md) | GEOGRAPHY only |
|  | [H3_CELL_TO_CHILDREN_STRING](functions/h3_cell_to_children_string.md) | GEOGRAPHY only |
|  | [H3_CELL_TO_PARENT](functions/h3_cell_to_parent.md) | GEOGRAPHY only |
|  | [H3_CELL_TO_POINT](functions/h3_cell_to_point.md) | GEOGRAPHY only |
|  | [H3_COMPACT_CELLS](functions/h3_compact_cells.md) | GEOGRAPHY only |
|  | [H3_COMPACT_CELLS_STRINGS](functions/h3_compact_cells_strings.md) | GEOGRAPHY only |
|  | [H3_COVERAGE](functions/h3_coverage.md) | GEOGRAPHY only |
|  | [H3_COVERAGE_STRINGS](functions/h3_coverage_strings.md) | GEOGRAPHY only |
|  | [H3_GET_RESOLUTION](functions/h3_get_resolution.md) | GEOGRAPHY only |
|  | [H3_GRID_DISTANCE](functions/h3_grid_distance.md) | GEOGRAPHY only |
|  | [H3_GRID_DISK](functions/h3_grid_disk.md) | GEOGRAPHY only |
|  | [H3_GRID_PATH](functions/h3_grid_path.md) | GEOGRAPHY only |
|  | [H3_INT_TO_STRING](functions/h3_int_to_string.md) | GEOGRAPHY only |
|  | [H3_IS_PENTAGON](functions/h3_is_pentagon.md) | GEOGRAPHY only |
|  | [H3_IS_VALID_CELL](functions/h3_is_valid_cell.md) | GEOGRAPHY only |
|  | [H3_LATLNG_TO_CELL](functions/h3_latlng_to_cell.md) | GEOGRAPHY only |
|  | [H3_LATLNG_TO_CELL_STRING](functions/h3_latlng_to_cell_string.md) | GEOGRAPHY only |
|  | [H3_POINT_TO_CELL](functions/h3_point_to_cell.md) | GEOGRAPHY only |
|  | [H3_POINT_TO_CELL_STRING](functions/h3_point_to_cell_string.md) | GEOGRAPHY only |
|  | [H3_POLYGON_TO_CELLS](functions/h3_polygon_to_cells.md) | GEOGRAPHY only |
|  | [H3_POLYGON_TO_CELLS_STRINGS](functions/h3_polygon_to_cells_strings.md) | GEOGRAPHY only |
|  | [H3_STRING_TO_INT](functions/h3_string_to_int.md) | GEOGRAPHY only |
|  | [H3_TRY_COVERAGE](functions/h3_try_coverage.md) | GEOGRAPHY only |
|  | [H3_TRY_COVERAGE_STRINGS](functions/h3_try_coverage_strings.md) | GEOGRAPHY only |
|  | [H3_TRY_GRID_DISTANCE](functions/h3_try_grid_distance.md) | GEOGRAPHY only |
|  | [H3_TRY_GRID_PATH](functions/h3_try_grid_path.md) | GEOGRAPHY only |
|  | [H3_TRY_POLYGON_TO_CELLS](functions/h3_try_polygon_to_cells.md) | GEOGRAPHY only |
|  | [H3_TRY_POLYGON_TO_CELLS_STRINGS](functions/h3_try_polygon_to_cells_strings.md) | GEOGRAPHY only |
|  | [H3_UNCOMPACT_CELLS](functions/h3_uncompact_cells.md) | GEOGRAPHY only |
|  | [H3_UNCOMPACT_CELLS_STRINGS](functions/h3_uncompact_cells_strings.md) | GEOGRAPHY only |

---
title: GROUP BY
source: https://docs.snowflake.com/en/sql-reference/constructs/group-by.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# GROUP BY

Groups rows with the same group-by-item expressions and computes aggregate functions for the resulting group. A GROUP BY
expression can be:

* A column name.
* A number referencing a position in the [SELECT](../sql/select.md) list.
* A general expression.

## GROUP BY extensions

GROUP BY supports the following extensions that provide powerful aggregation capabilities:

* [GROUP BY GROUPING SETS](group-by-grouping-sets.md): Compute multiple GROUP BY clauses in a single statement
* [GROUP BY ROLLUP](group-by-rollup.md): Produce subtotal rows for hierarchical data
* [GROUP BY CUBE](group-by-cube.md) : Produce subtotal rows for all combinations of dimensions

You can combine these extensions with regular GROUP BY columns. For example:

* `GROUP BY x, GROUPING SETS(y, z)`
* `GROUP BY x, ROLLUP(y, z)`
* `GROUP BY x, CUBE(y, z)`

For more information about interpreting NULL values in extension results, see the
[GROUPING](../functions/grouping.md) utility function.

## Syntax

```sqlsyntax
SELECT ...
  FROM ...
  [ ... ]
  GROUP BY groupItem [ , groupItem [ , ... ] ]
  [ ... ]
```

```sqlsyntax
SELECT ...
  FROM ...
  [ ... ]
  GROUP BY ALL
  [ ... ]
```

Where:

> ```sqlsyntax
> groupItem ::= { <column_alias> | <position> | <expr> }
> ```

## Parameters

`column_alias`
:   Column alias appearing in the query block’s [SELECT](../sql/select.md) list.

`position`
:   Position of an expression in the [SELECT](../sql/select.md) list.

`expr`
:   Any expression on tables in the current scope.

`GROUP BY ALL`
:   Specifies that all items in the SELECT list that do not use aggregate functions should be used for grouping.

    For examples, refer to Group by all columns.

## Usage notes

* A GROUP BY clause can reference expressions in the projection clause by name or by position.
  If the GROUP BY clause references by name, each reference is resolved as follows:

  + If the query contains a database object (for example, a table or view) with a matching column name, the reference is resolved to the
    column name.
  + Otherwise, if the projection clause of the SELECT contains an expression alias with a matching name, the reference is
    resolved to the alias.

  For an example, see Precedence when a column name and an alias match.
* If all SELECT items use aggregate functions, specifying GROUP BY ALL is equivalent to specifying the statement without the
  GROUP BY clause.

  For example, the following statement only has SELECT items that use aggregate functions:

  ```sqlexample
  SELECT SUM(amount)
    FROM mytable
    GROUP BY ALL;
  ```

  The statement above is equivalent to not specifying the GROUP by clause:

  ```sqlexample
  SELECT SUM(amount)
    FROM mytable;
  ```

## Examples

The following sections provide examples of using the GROUP BY clause:

* Group by one column
* Group by multiple columns
* Group by all columns
* Precedence when a column name and an alias match

Note that the examples in each section use the data that you set up in Setting up the data for the examples.

### Setting up the data for the examples

The examples in this section use a table named `sales` and a table named `product`. To create these tables and insert the
data needed for the example, run the following commands:

```sqlexample
CREATE TABLE sales (
  product_ID INTEGER,
  retail_price REAL,
  quantity INTEGER,
  city VARCHAR,
  state VARCHAR);

INSERT INTO sales (product_id, retail_price, quantity, city, state) VALUES
  (1, 2.00,  1, 'SF', 'CA'),
  (1, 2.00,  2, 'SJ', 'CA'),
  (2, 5.00,  4, 'SF', 'CA'),
  (2, 5.00,  8, 'SJ', 'CA'),
  (2, 5.00, 16, 'Miami', 'FL'),
  (2, 5.00, 32, 'Orlando', 'FL'),
  (2, 5.00, 64, 'SJ', 'PR');

CREATE TABLE products (
  product_ID INTEGER,
  wholesale_price REAL);
INSERT INTO products (product_ID, wholesale_price) VALUES (1, 1.00);
INSERT INTO products (product_ID, wholesale_price) VALUES (2, 2.00);
```

### Group by one column

This example shows the gross revenue per product, grouped by `product_id` (that is, the total amount of money received for
each product):

```sqlexample
SELECT product_ID, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY product_ID;
```

```output
+------------+---------------+
| PRODUCT_ID | GROSS_REVENUE |
+------------+---------------+
|          1 |          6    |
|          2 |        620    |
+------------+---------------+
```

The following example builds on the previous example, showing the net profit per product, grouped by `product_id`:

```sqlexample
SELECT p.product_ID, SUM((s.retail_price - p.wholesale_price) * s.quantity) AS profit
  FROM products AS p, sales AS s
  WHERE s.product_ID = p.product_ID
  GROUP BY p.product_ID;
```

```output
+------------+--------+
| PRODUCT_ID | PROFIT |
+------------+--------+
|          1 |      3 |
|          2 |    372 |
+------------+--------+
```

### Group by multiple columns

The following example demonstrates how to group by multiple columns:

```sqlexample
SELECT state, city, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY state, city;
```

```output
+-------+---------+---------------+
| STATE |   CITY  | GROSS REVENUE |
+-------+---------+---------------+
|   CA  | SF      |            22 |
|   CA  | SJ      |            44 |
|   FL  | Miami   |            80 |
|   FL  | Orlando |           160 |
|   PR  | SJ      |           320 |
+-------+---------+---------------+
```

### Group by all columns

The following example is equivalent to the example used in Group by multiple columns.

```sqlexample
SELECT state, city, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY ALL;
```

```output
+-------+---------+---------------+
| STATE |   CITY  | GROSS REVENUE |
+-------+---------+---------------+
|   CA  | SF      |            22 |
|   CA  | SJ      |            44 |
|   FL  | Miami   |            80 |
|   FL  | Orlando |           160 |
|   PR  | SJ      |           320 |
+-------+---------+---------------+
```

### Precedence when a column name and an alias match

It is possible (but usually not recommended) to create a query that contains an alias that matches a column name:

```sqlexample
SELECT x, some_expression AS x
  FROM ...
```

If a clause contains a name that matches both a column name and an alias, then the clause uses the column name. The following example demonstrates this behavior using a GROUP BY clause:

Create a table and insert rows:

```sqlexample
CREATE TABLE employees (salary FLOAT, state VARCHAR, employment_state VARCHAR);
INSERT INTO employees (salary, state, employment_state) VALUES
  (60000, 'California', 'Active'),
  (70000, 'California', 'On leave'),
  (80000, 'Oregon', 'Active');
```

The following query returns the sum of the salaries of the employees who are active and the sum of the salaries of the employees who
are on leave:

```sqlexample
SELECT SUM(salary), ANY_VALUE(employment_state)
  FROM employees
  GROUP BY employment_state;
```

```output
+-------------+-----------------------------+
| SUM(SALARY) | ANY_VALUE(EMPLOYMENT_STATE) |
|-------------+-----------------------------|
|      140000 | Active                      |
|       70000 | On leave                    |
+-------------+-----------------------------+
```

The next query uses the alias `state`, which matches the name of a column of the table in the query. When `state` is used in
the GROUP BY clause, Snowflake interprets it as a reference to the column name, not the alias. This query therefore returns the sum of
the salaries of the employees in the state of California and the sum of the salaries of the employees in the state of Oregon,
yet displays `employment_state` information, such as `Active`, rather than the names of states or provinces:

```sqlexample
SELECT SUM(salary), ANY_VALUE(employment_state) AS state
  FROM employees
  GROUP BY state;
```

```output
+-------------+--------+
| SUM(SALARY) | STATE  |
|-------------+--------|
|      130000 | Active |
|       80000 | Active |
+-------------+--------+
```

---
title: GROUP BY CUBE
source: https://docs.snowflake.com/en/sql-reference/constructs/group-by-cube.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# GROUP BY CUBE

GROUP BY CUBE is an extension of the [GROUP BY](group-by.md) clause, similar to
[GROUP BY ROLLUP](group-by-rollup.md). Like ROLLUP, CUBE produces aggregated rows at
multiple levels. However, while ROLLUP creates aggregations that follow a natural hierarchy (for example,
city rolls up to state, and state rolls up to country), CUBE creates aggregations for all possible combinations of the specified
columns. These include both the hierarchical aggregations that ROLLUP would produce and additional
“cross-tabulation” rows that aggregate across each individual dimension independently.

CUBE can be combined with other GROUP BY expressions. For example, you can write
`GROUP BY x, CUBE(y, z)` to group by column `x` in combination with cube
aggregations on `y` and `z`.

A CUBE grouping is equivalent to a series of grouping sets and is essentially a shorter specification. The `N` elements of a CUBE
specification correspond to `2^N GROUPING SETS`.

## See also

* [GROUPING](../functions/grouping.md) (Utility function to identify which grouping level produced each row)
* [GROUP BY GROUPING SETS](group-by-grouping-sets.md)
* [GROUP BY ROLLUP](group-by-rollup.md)

## Syntax

```sqlsyntax
SELECT ...
FROM ...
[ ... ]
GROUP BY [ groupItem [ , groupItem [ , ... ] ] , ] CUBE ( groupItem [ , groupItem [ , ... ] ] )
[ ... ]
```

Where:

> ```sqlsyntax
> groupItem ::= { <column_alias> | <position> | <expr> }
> ```

## Parameters

`column_alias`
:   Column alias appearing in the query block’s [SELECT](../sql/select.md) list.

`position`
:   Position of an expression in the [SELECT](../sql/select.md) list.

`expr`
:   Any expression on tables in the current scope.

## Usage notes

* Snowflake allows up to 7 elements (equivalent to 128 grouping sets) in each cube.

## Examples

Start by creating and loading a table with information about sales from
a chain store that has branches in different cities and states/territories.

> ```sqlexample
> -- Create some tables and insert some rows.
> CREATE TABLE products (product_ID INTEGER, wholesale_price REAL);
> INSERT INTO products (product_ID, wholesale_price) VALUES
>     (1, 1.00),
>     (2, 2.00);
>
> CREATE TABLE sales (product_ID INTEGER, retail_price REAL,
>     quantity INTEGER, city VARCHAR, state VARCHAR);
> INSERT INTO sales (product_id, retail_price, quantity, city, state) VALUES
>     (1, 2.00,  1, 'SF', 'CA'),
>     (1, 2.00,  2, 'SJ', 'CA'),
>     (2, 5.00,  4, 'SF', 'CA'),
>     (2, 5.00,  8, 'SJ', 'CA'),
>     (2, 5.00, 16, 'Miami', 'FL'),
>     (2, 5.00, 32, 'Orlando', 'FL'),
>     (2, 5.00, 64, 'SJ', 'PR');
> ```

Run a cube query that shows profit by city, state, and total across all states.
The example below shows a query that has three “levels”:

* Each city.
* Each state.
* All revenue combined.

This example uses `ORDER BY state, city NULLS LAST` to ensure that each state’s rollup comes immediately after all of
the cities in that state, and that the final rollup appears at the end of the output.

> ```sqlexample
> SELECT state, city, SUM((s.retail_price - p.wholesale_price) * s.quantity) AS profit
>  FROM products AS p, sales AS s
>  WHERE s.product_ID = p.product_ID
>  GROUP BY CUBE (state, city)
>  ORDER BY state, city NULLS LAST
>  ;
> +-------+---------+--------+
> | STATE | CITY    | PROFIT |
> |-------+---------+--------|
> | CA    | SF      |     13 |
> | CA    | SJ      |     26 |
> | CA    | NULL    |     39 |
> | FL    | Miami   |     48 |
> | FL    | Orlando |     96 |
> | FL    | NULL    |    144 |
> | PR    | SJ      |    192 |
> | PR    | NULL    |    192 |
> | NULL  | Miami   |     48 |
> | NULL  | Orlando |     96 |
> | NULL  | SF      |     13 |
> | NULL  | SJ      |    218 |
> | NULL  | NULL    |    375 |
> +-------+---------+--------+
> ```

Some rollup rows contain NULL values. For example, the last row in the table contains a NULL value for the city and
a NULL value for the state because the data is for all cities and states, not a specific city and state.

The [GROUPING](../functions/grouping.md) utility function can help distinguish between NULL values
that result from the cube aggregation versus actual NULL values in the data.

Both GROUP BY CUBE and GROUP BY ROLLUP produce one row for each city/state pair, and both GROUP BY clauses also produce
rollup rows for each individual state and for all states combined. The difference between the two GROUP BY clauses is that
GROUP BY CUBE also produces an output row for each city name (`Miami`, `SJ`, and so on).

Take care when using GROUP BY CUBE on hierarchical data. In this example, the row for `SJ` contains totals for both the city
named `SJ` in the state of `CA` and the city named `SJ` in the territory of `PR`, even though the only relationship between those
cities is that they have the same name. In general, use GROUP BY ROLLUP to analyze hierarchical data, and GROUP BY CUBE to
analyze data across independent axes.

---
title: GROUP BY GROUPING SETS
source: https://docs.snowflake.com/en/sql-reference/constructs/group-by-grouping-sets.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# GROUP BY GROUPING SETS

GROUPING SETS is a powerful extension of the [GROUP BY](group-by.md) clause that computes multiple GROUP BY clauses in a single statement. A *grouping set* is a set of dimension columns.

GROUPING SETS expressions can be combined with other GROUP BY expressions, making this construct an integrated part of the GROUP BY clause rather than a separate construct. For example, you can write `GROUP BY x, GROUPING SETS(y, z)` to group by column `x` in combination with separate groupings on `y` and `z`.

A GROUPING SETS expression is equivalent to the union of two or more [GROUP BY](group-by.md) operations in the same result set. For example:

* `GROUP BY GROUPING SETS((a))` is equivalent to the single grouping set operation `GROUP BY a`.
* `GROUP BY GROUPING SETS((a), (b))` is equivalent to `GROUP BY a UNION ALL GROUP BY b`.

Note that `GROUPING SETS(a, b)` without additional parentheses is logically equivalent to `GROUPING SETS((a), (b))` because both create two separate grouping sets, one for column `a` and one for column `b`. This expression is quite different from `GROUPING SETS((a, b))`, which creates a single grouping set that groups by both columns.

## Syntax

```sqlsyntax
SELECT ...
FROM ...
[ ... ]
GROUP BY [ groupItem [ , groupItem [ , ... ] ] , ] GROUPING SETS ( groupSet [ , groupSet [ , ... ] ] )
[ ... ]
```

Where:

> ```sqlsyntax
> groupItem ::= { <column_alias> | <position> | <expr> }
>
> groupSet ::= groupItem | ( groupItem [ , groupItem [ , ... ] ] )
> ```

## Parameters

`column_alias`
:   Column alias appearing in the query block’s [SELECT](../sql/select.md) list.

`position`
:   Position of an expression in the [SELECT](../sql/select.md) list.

`expr`
:   Any expression on tables in the current scope.

## Usage notes

* Snowflake allows up to 128 grouping sets in the same query block.
* Syntax variations with parentheses:

  + `GROUPING SETS(a, b)` is shorthand for `GROUPING SETS((a), (b))`. Both create two separate grouping sets: one that groups by column `a`, and another that groups by column `b`.
  + `GROUPING SETS((a, b))` creates a single grouping set that groups by both columns `a` and `b` (similar to `GROUP BY a, b`).
* You can combine regular GROUP BY columns with GROUPING SETS: `GROUP BY x, GROUPING SETS(y, z)` groups by column `x` in combination with separate groupings on `y` and `z`.
* The output typically contains some NULL values. Because GROUP BY GROUPING SETS
  merges the results of two or more result sets, each of which was
  grouped by different criteria, some columns that have a single value
  in one result set might have many corresponding values in the
  other result set. For example, if you do a union of a set of
  employees grouped by department with a set grouped by seniority, the
  members of the set with the greatest seniority are not necessarily all
  in the same department, so the value of `department_name` is set to
  NULL. The following examples contain NULL values for this reason.

## See also

* [GROUPING](../functions/grouping.md) (Utility function to identify which grouping level produced each row)
* [GROUP BY ROLLUP](group-by-rollup.md)
* [GROUP BY CUBE](group-by-cube.md)

## Examples

These examples use a table of information about nurses who are trained to
assist in disasters. All of these nurses have a license as nurses (for example,
an RN has a license as a “Registered Nurse”), and an additional license (for example,
in a disaster-related specialty, such as search and rescue, radio
communications, and so on). This example simplifies and uses just two categories
of licenses:

* Nursing: RN (Registered Nurse) and LVN (Licensed Vocational Nurse).
* Amateur (“ham”) Radio: Ham radio licenses include “Technician”, “General”, and “Amateur Extra”.

The following commands create and load the table:

```sqlexample
CREATE or replace TABLE nurses (
  ID INTEGER,
  full_name VARCHAR,
  medical_license VARCHAR,   -- LVN, RN, etc.
  radio_license VARCHAR      -- Technician, General, Amateur Extra
  )
  ;

INSERT INTO nurses
    (ID, full_name, medical_license, radio_license)
  VALUES
    (201, 'Thomas Leonard Vicente', 'LVN', 'Technician'),
    (202, 'Tamara Lolita VanZant', 'LVN', 'Technician'),
    (341, 'Georgeann Linda Vente', 'LVN', 'General'),
    (471, 'Andrea Renee Nouveau', 'RN', 'Amateur Extra')
    ;
```

This query uses GROUP BY GROUPING SETS:

```sqlexample
SELECT COUNT(*), medical_license, radio_license
  FROM nurses
  GROUP BY GROUPING SETS (medical_license, radio_license)
  ORDER BY 3 DESC NULLS FIRST;
```

The first two rows show the count of RNs and LVNs (two types of nursing
licenses). The NULL values in the `radio_license` column for
those two rows are deliberate; the query grouped all of the LVNs together
(and all the RNs together) regardless of their radio license, so the
results can’t show one value in the `radio_license` column for each
row that necessarily applies to all the LVNs or RNs grouped in that row.

The next three rows show the number of nurses with each type of ham radio
license (“Technician”, “General”, and “Amateur Extra”). The NULL value
for `medical_license` in each of those three rows is deliberate because
no single medical license necessarily applies to all members of each
of those rows.

```output
+----------+-----------------+---------------+
| COUNT(*) | MEDICAL_LICENSE | RADIO_LICENSE |
|----------+-----------------+---------------|
|        3 | LVN             | NULL          |
|        1 | RN              | NULL          |
|        2 | NULL            | Technician    |
|        1 | NULL            | General       |
|        1 | NULL            | Amateur Extra |
+----------+-----------------+---------------+
```

The following example demonstrates the difference between grouping by columns
separately versus grouping by columns together. The query groups by the
combination of both `medical_license` and `radio_license`:

```sqlexample
SELECT COUNT(*), medical_license, radio_license
  FROM nurses
  GROUP BY GROUPING SETS ((medical_license, radio_license))
  ORDER BY 3 DESC NULLS FIRST;
```

This query produces rows where each combination of `medical_license` and
`radio_license` appears with its count. Unlike the previous example, there
are no NULL values in the output because the query groups by both columns
together rather than creating separate groupings for each column.

```output
+----------+-----------------+---------------+
| COUNT(*) | MEDICAL_LICENSE | RADIO_LICENSE |
|----------+-----------------+---------------|
|        2 | LVN             | Technician    |
|        1 | LVN             | General       |
|        1 | RN              | Amateur Extra |
+----------+-----------------+---------------+
```

The next example shows what happens when some columns contain NULL values.
Start by adding three new nurses who don’t yet have ham radio licenses.

```sqlexample
INSERT INTO nurses
    (ID, full_name, medical_license, radio_license)
  VALUES
    (101, 'Lily Vine', 'LVN', NULL),
    (102, 'Larry Vancouver', 'LVN', NULL),
    (172, 'Rhonda Nova', 'RN', NULL)
    ;
```

Then run the same query as before:

```sqlexample
SELECT COUNT(*), medical_license, radio_license
  FROM nurses
  GROUP BY GROUPING SETS (medical_license, radio_license)
  ORDER BY 3 DESC NULLS FIRST;
```

Why is there now a row that has NULL in both columns? And if all the values are
NULL, why is the COUNT(\*) result equal to 3?

The answer is that the NULL in the `radio_license` column of that row
occurs because three nurses don’t have any radio license. (The query
`SELECT DISTINCT radio_license FROM nurses` now returns four distinct
values: “Technician”, “General”, “Amateur Extra”, and “NULL”.)

The NULL value in the `medical_licenses` column occurs for the same reason that
NULL values occur in the earlier query results: the nurses counted in this
row have different medical licenses, so no one value (`RN` or `LVN`)
necessarily applies to all of the nurses counted in this row.

```output
+----------+-----------------+---------------+
| COUNT(*) | MEDICAL_LICENSE | RADIO_LICENSE |
|----------+-----------------+---------------|
|        2 | RN              | NULL          |
|        5 | LVN             | NULL          |
|        3 | NULL            | NULL          |
|        2 | NULL            | Technician    |
|        1 | NULL            | General       |
|        1 | NULL            | Amateur Extra |
+----------+-----------------+---------------+
```

The following example demonstrates the combination of regular GROUP BY columns with GROUPING SETS.
This query groups by `medical_license`, and within each medical license group, creates
separate aggregations for each `radio_license` value and for all radio licenses combined:

```sqlexample
SELECT COUNT(*), medical_license, radio_license
  FROM nurses
  GROUP BY medical_license, GROUPING SETS (radio_license, ())
  ORDER BY 3 DESC NULLS FIRST;
```

For each medical license (LVN and RN), the output shows:

* Rows grouped by each specific `radio_license` value (Technician, General, Amateur Extra, or NULL for those without a radio license)
* A summary row with NULL in the `radio_license` column representing all nurses with that medical license, regardless of their radio license

```output
+----------+-----------------+---------------+
| COUNT(*) | MEDICAL_LICENSE | RADIO_LICENSE |
|----------+-----------------+---------------|
|        2 | LVN             | NULL          |
|        1 | RN              | NULL          |
|        2 | RN              | NULL          |
|        5 | LVN             | NULL          |
|        2 | LVN             | Technician    |
|        1 | LVN             | General       |
|        1 | RN              | Amateur Extra |
+----------+-----------------+---------------+
```

You can compare this output to the output of a GROUP BY query without the GROUPING SETS clause:

```sqlexample
SELECT COUNT(*), medical_license, radio_license
  FROM nurses
  GROUP BY medical_license, radio_license
  ORDER BY 3 DESC NULLS FIRST;
```

```output
+----------+-----------------+---------------+
| COUNT(*) | MEDICAL_LICENSE | RADIO_LICENSE |
|----------+-----------------+---------------|
|        2 | LVN             | NULL          |
|        1 | RN              | NULL          |
|        2 | LVN             | Technician    |
|        1 | LVN             | General       |
|        1 | RN              | Amateur Extra |
+----------+-----------------+---------------+
```

### Using the GROUPING function

The [GROUPING](../functions/grouping.md) utility function helps identify
which level of aggregation produced each row. This is especially useful for distinguishing
between NULL values that result from the grouping operation versus actual NULL values in
the data.

The GROUPING function returns:

* `0` for a row that is grouped on the specified column
* `1` for a row that is not grouped on the specified column (where NULL appears due to aggregation)

This example adds GROUPING functions to the query to clarify the output:

```sqlexample
SELECT
    COUNT(*),
    medical_license,
    radio_license,
    GROUPING(medical_license) AS grp_medical,
    GROUPING(radio_license) AS grp_radio
  FROM nurses
  GROUP BY GROUPING SETS (medical_license, radio_license);
```

The `grp_medical` and `grp_radio` columns show which columns were used for grouping:

* Rows 1-2: Grouped by `medical_license` (`grp_medical=0`), not by `radio_license` (`grp_radio=1`)
* Rows 3-6: Grouped by `radio_license` (`grp_radio=0`), not by `medical_license` (`grp_medical=1`)
* Row 6: The NULL value in `radio_license` is actual data (`grp_radio=0`), while the NULL in `medical_license` is from aggregation (`grp_medical=1`)

```output
+----------+-----------------+---------------+-------------+-----------+
| COUNT(*) | MEDICAL_LICENSE | RADIO_LICENSE | GRP_MEDICAL | GRP_RADIO |
|----------+-----------------+---------------+-------------+-----------|
|        2 | RN              | NULL          |           0 |         1 |
|        5 | LVN             | NULL          |           0 |         1 |
|        2 | NULL            | Technician    |           1 |         0 |
|        1 | NULL            | General       |           1 |         0 |
|        3 | NULL            | NULL          |           1 |         0 |
|        1 | NULL            | Amateur Extra |           1 |         0 |
+----------+-----------------+---------------+-------------+-----------+
```

---
title: GROUP BY ROLLUP
source: https://docs.snowflake.com/en/sql-reference/constructs/group-by-rollup.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# GROUP BY ROLLUP

GROUP BY ROLLUP is an extension of the [GROUP BY](group-by.md) clause that produces
aggregated rows at multiple levels of a hierarchy (in addition to the detailed grouped rows). For example,
if you group by city and state, ROLLUP produces aggregations for each city/state combination, each state
total, and a grand total across all states. These aggregations are computed using the same aggregate
functions specified in the SELECT clause.

ROLLUP can be combined with other GROUP BY expressions. For example, you can write
`GROUP BY x, ROLLUP(y, z)` to group by column `x` in combination with rollup
aggregations on `y` and `z`.

You can think of rollup as generating multiple result sets, each of which
(after the first) is the aggregate of the previous result set. So, for example,
if you own a chain of retail stores, you might want to see the profit for:

* Each store.
* Each city (large cities might have multiple stores).
* Each state.
* Everything (all stores in all states).

You could create separate reports to get that information, but it is more
efficient to scan the data once.

If you are familiar with the concept of [grouping sets](group-by-grouping-sets.md),
you can think of a ROLLUP grouping as equivalent to a series of grouping sets,
but essentially a shorter specification. The `N` elements of
a ROLLUP specification correspond to `N+1 GROUPING SETS`.

## See also

* [GROUPING](../functions/grouping.md) (Utility function to identify which grouping level produced each row)
* [GROUP BY GROUPING SETS](group-by-grouping-sets.md)
* [GROUP BY CUBE](group-by-cube.md)

## Syntax

```sqlsyntax
SELECT ...
FROM ...
[ ... ]
GROUP BY [ groupItem [ , groupItem [ , ... ] ] , ] ROLLUP ( groupItem [ , groupItem [ , ... ] ] )
[ ... ]
```

Where:

> ```sqlsyntax
> groupItem ::= { <column_alias> | <position> | <expr> }
> ```

## Parameters

`column_alias`
:   Column alias appearing in the query block’s [SELECT](../sql/select.md) list.

`position`
:   Position of an expression in the [SELECT](../sql/select.md) list.

`expr`
:   Any expression on tables in the current scope.

## Usage notes

* As the query is aggregated at higher and higher levels, it shows NULL values
  in more columns of each row. This is appropriate. In the following example,
  for the aggregate at the state level, the `city` column is NULL;
  that’s because the value in the `profit` column does not correspond to one
  city. Similarly, in the final total, which aggregates data from all the
  states and all the cities, the revenue is not from one specific state or one
  specific city, so both the `state` and `city` columns in that row are NULL.
* The query should list the “most significant level” first in the parentheses
  after the ROLLUP. For example, states contain cities, so if you are rolling up
  data across states and cities, the clause should be `GROUP BY ROLLUP (state, city)`

  If you reverse the order of the column names, you get a result that is
  probably not what you want. In the following example, if you reversed the order
  of `city` and `state` in the ROLLUP clause, the result would be incorrect,
  at least in part because both California and Puerto Rico have a city named San Jose (`SJ`),
  and you probably would not want to combine the revenue from the two different San Jose cities,
  except in the final total of all revenue. (An alternative way to avoid combining data from
  different cities with the same name is to create a unique ID for each city and use the ID
  rather than the name in the query.)
* The [GROUPING](../functions/grouping.md) utility function can help distinguish
  between NULL values that result from the rollup aggregation versus actual NULL values in the data.
  GROUPING returns `0` for a row grouped on a specified column and `1` for a row where the column
  shows NULL because of aggregation.

## Examples

Start by creating and loading a table with information about sales at
a chain store that has branches in different cities and states/territories.

> ```sqlexample
> -- Create some tables and insert some rows.
> CREATE TABLE products (product_ID INTEGER, wholesale_price REAL);
> INSERT INTO products (product_ID, wholesale_price) VALUES
>     (1, 1.00),
>     (2, 2.00);
>
> CREATE TABLE sales (product_ID INTEGER, retail_price REAL,
>     quantity INTEGER, city VARCHAR, state VARCHAR);
> INSERT INTO sales (product_id, retail_price, quantity, city, state) VALUES
>     (1, 2.00,  1, 'SF', 'CA'),
>     (1, 2.00,  2, 'SJ', 'CA'),
>     (2, 5.00,  4, 'SF', 'CA'),
>     (2, 5.00,  8, 'SJ', 'CA'),
>     (2, 5.00, 16, 'Miami', 'FL'),
>     (2, 5.00, 32, 'Orlando', 'FL'),
>     (2, 5.00, 64, 'SJ', 'PR');
> ```

Run a rollup query that shows profit by city, state, and total across all
states. The query produces three “levels” of aggregation:

* Each city.
* Each state.
* All revenue combined across all states.

The query uses `ORDER BY state, city NULLS LAST` to ensure that each state’s rollup comes immediately after all of
the cities in that state, and that the final rollup appears at the end of the output.

> ```sqlexample
> SELECT state, city, SUM((s.retail_price - p.wholesale_price) * s.quantity) AS profit
>  FROM products AS p, sales AS s
>  WHERE s.product_ID = p.product_ID
>  GROUP BY ROLLUP (state, city)
>  ORDER BY state, city NULLS LAST
>  ;
> +-------+---------+--------+
> | STATE | CITY    | PROFIT |
> |-------+---------+--------|
> | CA    | SF      |     13 |
> | CA    | SJ      |     26 |
> | CA    | NULL    |     39 |
> | FL    | Miami   |     48 |
> | FL    | Orlando |     96 |
> | FL    | NULL    |    144 |
> | PR    | SJ      |    192 |
> | PR    | NULL    |    192 |
> | NULL  | NULL    |    375 |
> +-------+---------+--------+
> ```

Some rollup rows contain NULL values. For example, the last row in the table contains a NULL value for the city and
a NULL value for the state because the data is for all cities and states, not a specific city and state.

---
title: Hash functions
source: https://docs.snowflake.com/en/sql-reference/functions-hash-scalar.md
section: SQL General Reference
---

# Hash functions

Snowflake provides hash functions, which take input value(s) and return a signed 64-bit numeric value. Hash functions are deterministic.
Snowflake provides both a scalar hash function and an aggregate hash function, both of which are listed here.

> **Note:**
>
> The hash functions are not cryptographic hash functions.
>
> For cryptographic functions, use the SHA families of functions (see [String & binary functions](functions-string.md)).

| Function Name | Notes |
| --- | --- |
| [HASH](functions/hash.md) |  |
| [HASH_AGG](functions/hash_agg.md) | [Aggregate function](functions-aggregation.md). |

---
title: HAVING
source: https://docs.snowflake.com/en/sql-reference/constructs/having.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# HAVING

Filters rows produced by [GROUP BY](group-by.md) that do not satisfy a predicate.

## Syntax

```sqlsyntax
SELECT ...
FROM ...
GROUP BY ...
HAVING <predicate>
[ ... ]
```

## Parameters

`predicate`
:   A [Boolean expression](../data-types-logical.md).

## Usage notes

* The condition specified by the HAVING clause applies to expressions produced by the [GROUP BY](group-by.md).
  Therefore, the same restrictions that apply to [GROUP BY](group-by.md) expressions also apply to the HAVING
  clause. The predicate can only refer to:

  > + Constants.
  > + Expressions that appear in [GROUP BY](group-by.md).
  > + [Aggregate functions](../functions-aggregation.md).
* Expressions in the [SELECT](../sql/select.md) list can be referred to by the column alias defined in the list.

## Examples

Find the departments that have fewer than 10 employees:

> ```sqlexample
> SELECT department_id
> FROM employees
> GROUP BY department_id
> HAVING count(*) < 10;
> ```

---
title: ICEBERG_ACCESS_ERRORS view
source: https://docs.snowflake.com/en/sql-reference/monitoring/iceberg_access_errors.md
section: SQL General Reference
---

Schema:
:   [MONITORING](../monitoring.md)

# ICEBERG_ACCESS_ERRORS view

This MONITORING schema view displays [external volume](../../user-guide/tables-iceberg.md)
access errors for the account.

Use the information in this view to search for and troubleshoot access errors, which can result from situations like the following:

* Snowflake loses privileges to access the external volume storage location.
* Snowflake tries to access files that have been deleted or overwritten.
* Snowflake encounters other storage access issues.

See also:
:   [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md)

## Columns

| Column name | Data type | Description |
| --- | --- | --- |
| EXTERNAL_VOLUME_ID | NUMBER | The unique ID of the external volume associated with the error. |
| EXTERNAL_VOLUME_NAME | VARCHAR | The name of the external volume associated with the error. |
| CREATED_ON | TIMESTAMP_LTZ | Date and time when the error was raised. |
| EXTERNAL_VOLUME_PATH | VARCHAR | Full path to the file on the external volume associated with the error. |
| MESSAGE | VARCHAR | The Snowflake error message. |
| STORAGE_METHOD_NAME | VARCHAR | The method (action) tried against the storage location; for example, `findRegionForLocation` or `deleteCurrentFiles`. |
| STORAGE_PROVIDER_ERROR_MESSAGE | VARCHAR | The error message received from your cloud service provider. |

## Examples

Retrieve all storage access errors for the external volume named `my_s3_external_volume`:

```sqlexample
SELECT * FROM snowflake.monitoring.iceberg_access_errors
  WHERE EXTERNAL_VOLUME_NAME ILIKE 'my_s3_external_volume';
```

Retrieve storage access errors that started within the last hour for the external volume named `my_external_volume`:

```sqlexample
SELECT * FROM snowflake.monitoring.iceberg_access_errors
  WHERE EXTERNAL_VOLUME_NAME ILIKE 'my_external_volume'
  AND CREATED_ON > DATEADD(HOUR, -1, CURRENT_TIMESTAMP());
```

---
title: Identifier requirements
source: https://docs.snowflake.com/en/sql-reference/identifiers-syntax.md
section: SQL General Reference
---

# Identifier requirements

Unquoted object identifiers:

* Start with a letter (A-Z, a-z) or an underscore (“_”).
* Contain only letters, underscores, decimal digits (0-9), and dollar signs (“$”).
* Are stored and resolved as uppercase characters (e.g. `id` is stored and resolved as `ID`).

If you put double quotes around an identifier (e.g.
“My identifier with blanks and punctuation.”), the following rules apply:

* The case of the identifier is preserved when storing and resolving the identifier (e.g. `"id"` is stored and resolved as
  `id`).
* The identifier can contain and start with ASCII, extended ASCII, and non-ASCII characters.

  To use the double quote character inside a quoted identifier, use two quotes. For example:

  ```sqlexample
  CREATE TABLE "quote""andunquote""" ...
  ```

  creates a table named:

  ```sqlexample
  quote"andunquote"
  ```

  where the quotation marks are part of the name.

> **Note:**
>
> * Regardless of whether an identifier is unquoted or double-quoted, the maximum number of characters allowed is 255 (including blank spaces).
> * Identifiers can also be specified using string literals, session variables or bind variables. For details, see [SQL variables](session-variables.md).

## Unquoted identifiers

If an identifier is not enclosed in double quotes, it must begin with a letter or underscore (`_`) and cannot contain extended characters or blank
spaces.

The following are all examples of valid identifiers; however, the case of the characters in these identifiers would not be preserved:

```none
myidentifier
MyIdentifier1
My$identifier
_my_identifier
```

Unquoted identifiers are stored and resolved in uppercase. Therefore, an unquoted identifier is equivalent to a capitalized double-quoted
identifier with the same name. For example, the following two statements attempt to create the same table:

```sqlexample
CREATE TABLE mytable(c1 INT, c2 INT);
```

```output
+-------------------------------------+
| status                              |
|-------------------------------------|
| Table MYTABLE successfully created. |
+-------------------------------------+
```

```sqlexample
CREATE TABLE "MYTABLE"(c1 INT, c2 INT);
```

```output
002002 (42710): SQL compilation error:
Object 'MYTABLE' already exists.
```

## Double-quoted identifiers

Delimited identifiers (i.e. identifiers enclosed in double quotes) are case-sensitive and can start with and contain any valid characters,
including:

* Numbers
* Special characters (`.`, `'`, `!`, `@`, `#`, `$`, `%`, `^`, `&`, `*`, etc.)
* Extended ASCII and non-ASCII characters
* Blank spaces

For example:

```none
"MyIdentifier"
"my.identifier"
"my identifier"
"My 'Identifier'"
"3rd_identifier"
"$Identifier"
"идентификатор"
```

> **Important:**
>
> If an object is created using a double-quoted identifier, when referenced in a query or any other SQL statement, the identifier must be specified
> exactly as created, including the double quotes. Failure to include the quotes might result in an `Object does not exist` error (or
> similar type of error).
>
> Also, note that the entire identifier must be enclosed in quotes when referenced in a query/SQL statement. This is particularly important if periods
> (`.`) are used in identifiers because periods are also used in fully-qualified object names to separate each object.
>
> For example:
>
> ```none
> "My.DB"."My.Schema"."Table.1"
> ```

### Exceptions

* Double-quoted identifiers are not supported for the
  [names of user-defined functions (UDFs) and procedures](../developer-guide/udf-stored-procedure-naming-conventions.md) in which the
  handler language is Java, JavaScript, Snowflake Scripting, or SQL.
* You can use only ASCII characters for the names of user-defined functions (UDFs) and procedures in which the handler language is Java.

## Identifier resolution

By default, Snowflake applies the following rules for storing identifiers (at creation/definition time) and resolving them (in queries and other SQL
statements):

* When an identifier is unquoted, it is stored and resolved in uppercase.
* When an identifier is double-quoted, it is stored and resolved exactly as entered, including case.

For example, the following four names are equivalent and all resolve to `TABLENAME`:

```none
TABLENAME
tablename
tableName
TableName
```

In contrast, the following four names are considered to be different, unique values:

```none
"TABLENAME"
"tablename"
"tableName"
"TableName"
```

If these identifiers were used to create objects of the same type (e.g. tables), they would result in the creation of four distinct objects.

## Migrating from databases that treat double-quoted identifiers as case-insensitive

In the ANSI/ISO standard for SQL, identifiers in double quotes (delimited identifiers) are treated as case-sensitive. However,
some companies provide databases that treat double-quoted identifiers as case-insensitive.

If you are migrating your data and applications from one of these databases to Snowflake, those applications might use double
quotes around identifiers that are intended to be case-insensitive. This can prevent Snowflake from resolving the identifiers
correctly. For example, an application might use double quotes around an identifier in lowercase, and the Snowflake database
has the identifier in uppercase.

To work around this limitation, Snowflake provides the [QUOTED_IDENTIFIERS_IGNORE_CASE](parameters.md) session parameter, which
causes Snowflake to treat lowercase letters in double-quoted identifiers as uppercase when creating and finding objects.

See the next sections for details:

* Controlling case using the QUOTED_IDENTIFIERS_IGNORE_CASE parameter
* Impact of changing the parameter

> **Note:**
>
> Changing the value of the parameter can affect your ability to find existing objects. See
> Impact of changing the parameter for details.

### Controlling case using the QUOTED_IDENTIFIERS_IGNORE_CASE parameter

To configure Snowflake to treat alphabetic characters in double-quoted identifiers as uppercase for the session, set the
parameter to TRUE for the session. With this setting, all alphabetical characters in identifiers are stored and resolved as
uppercase characters.

In other words, the following eight names are equivalent and all resolve to `TABLENAME`:

```none
TABLENAME
tablename
tableName
TableName
"TABLENAME"
"tablename"
"tableName"
"TableName"
```

Note that the parameter has no effect on any of the limitations for unquoted identifiers with regards to numbers, extended
characters, and blank spaces.

### Impact of changing the parameter

Changing the [QUOTED_IDENTIFIERS_IGNORE_CASE](parameters.md) session parameter only affects new objects and queries:

* With the default setting of FALSE, if an object is created using a double-quoted identifier with mixed case, Snowflake stores
  the identifier in mixed case.
* If the parameter is later changed to TRUE, Snowflake will not be able to resolve that double-quoted mixed case identifier and
  will not be able retrieve that object.

> **Tip:**
>
> Due to the impact that changing the parameter can have on resolving identifiers, we highly recommend choosing the
> identifier resolution method early in your implementation of Snowflake. Then, have your account administrator set the parameter
> at the account level to enforce this resolution method by default.
>
> Although you can override this parameter at the session level, we don’t encourage changing the parameter from the default,
> unless you have an explicit need to do so.

The following examples illustrate the behavior after changing the parameter from FALSE to TRUE:

```sqlexample
-- Set the default behavior
ALTER SESSION SET QUOTED_IDENTIFIERS_IGNORE_CASE = false;

-- Create a table with a double-quoted identifier
CREATE TABLE "One" (i int);  -- stored as "One"

-- Create a table with an unquoted identifier
CREATE TABLE TWO(j int);     -- stored as "TWO"

-- These queries work
SELECT * FROM "One";         -- searches for "One"
SELECT * FROM two;           -- searched for "TWO"
SELECT * FROM "TWO";         -- searches for "TWO"

-- These queries do not work
SELECT * FROM One;           -- searches for "ONE"
SELECT * FROM "Two";         -- searches for "Two"

-- Change to the all-uppercase behavior
ALTER SESSION SET QUOTED_IDENTIFIERS_IGNORE_CASE = true;

-- Create another table with a double-quoted identifier
CREATE TABLE "Three"(k int); -- stored as "THREE"

-- These queries work
SELECT * FROM "Two";         -- searches for "TWO"
SELECT * FROM two;           -- searched for "TWO"
SELECT * FROM "TWO";         -- searches for "TWO"
SELECT * FROM "Three";       -- searches for "THREE"
SELECT * FROM three;         -- searches for "THREE"

-- This query does not work now - "One" is not retrievable
SELECT * FROM "One";         -- searches for "ONE"
```

Additionally, if the identifiers for two tables differ only by case, one identifier might resolve to a different table after changing the parameter:

```sqlexample
-- Set the default behavior
ALTER SESSION SET QUOTED_IDENTIFIERS_IGNORE_CASE = false;

-- Create a table with a double-quoted identifier
CREATE TABLE "Tab" (i int);  -- stored as "Tab"

-- Create a table with an unquoted identifier
CREATE TABLE TAB(j int);     -- stored as "TAB"

-- This query retrieves "Tab"
SELECT * FROM "Tab";         -- searches for "Tab"

-- Change to the all-uppercase behavior
ALTER SESSION SET QUOTED_IDENTIFIERS_IGNORE_CASE = true;

-- This query retrieves "TAB"
SELECT * FROM "Tab";         -- searches for "TAB"
```

---
title: IF (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/if.md
section: SQL General Reference
---

# IF (Snowflake Scripting)

An `IF` statement provides a way to execute a set of statements if a condition is met.

For more information on branching constructs, see [Working with conditional logic](../../developer-guide/snowflake-scripting/branch.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

## Syntax

```sqlsyntax
IF ( <condition> ) THEN
    <statement>;
    [ <statement>; ... ]
[
ELSEIF ( <condition> ) THEN
    <statement>;
    [ <statement>; ... ]
]
[
ELSE
    <statement>;
    [ <statement>; ... ]
]
END IF;
```

Where:

> `condition`
> :   An expression that evaluates to a BOOLEAN.
>
> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).

## Usage notes

* The keyword `THEN` is required.
* `ELSEIF` is one word (no spaces).
* `END IF` is two words.
* After each `THEN` or `ELSE` clause, the body allows the `BEGIN` and `END` keywords, but does not require
  them, even if the body contains more than one `statement`.
* If the `condition` is NULL, then it is treated as FALSE.

## Examples

Here is an example of a Snowflake Scripting `IF` statement inside a stored procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE example_if(flag INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  IF (FLAG = 1) THEN
    RETURN 'one';
  ELSEIF (FLAG = 2) THEN
    RETURN 'two';
  ELSE
    RETURN 'Unexpected input.';
  END IF;
END;
$$
;
```

Here is the command to call the stored procedure, along with the output:

```sqlexample
CALL example_if(3);
```

```output
+-------------------+
| EXAMPLE_IF        |
|-------------------|
| Unexpected input. |
+-------------------+
```

For more examples that use the `IF` statement, see:

* [Working with conditional logic](../../developer-guide/snowflake-scripting/branch.md) - Return different values based on IF conditions
  in a simple anonymous block.
* [Examples for common use cases of Snowflake Scripting](../../developer-guide/snowflake-scripting/use-cases.md) - Execute SQL statements based on IF conditions in loops.
* [BREAK](break.md), [LOOP](loop.md), and [Working with loops](../../developer-guide/snowflake-scripting/loops.md) -
  Execute BREAK statements to terminate a loop based on IF conditions.
* [EXCEPTION](exception.md) - Raise exceptions based on IF conditions.

---
title: INTO
source: https://docs.snowflake.com/en/sql-reference/constructs/into.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# INTO

Sets [Snowflake Scripting variables](../../developer-guide/snowflake-scripting/variables.md) to the values in a row returned by a
SELECT statement. See [Setting variables to the results of a SELECT statement](../../developer-guide/snowflake-scripting/variables.md) for details.

## Syntax

```sqlsyntax
SELECT <expression1>
   [ , <expression2> ]
   [ , <expressionN> ]
[ INTO :<variable1> ]
   [ , :<variable2> ]
   [ , :<variableN> ]
FROM ...
WHERE ...
[ ... ]
```

## Parameters

`expression1`, . `expression2`, . `expressionN`
:   Specifies scalar expressions (e.g. columns in a table specified by the [FROM](from.md) clause).

`variable1`, . `variable2`, . `variableN`
:   [Snowflake Scripting variables](../../developer-guide/snowflake-scripting/variables.md) that should be set to the values in the
    expressions in the SELECT clause.

## Usage notes

* The SELECT statement must return a single row.

## Examples

See [Setting variables to the results of a SELECT statement](../../developer-guide/snowflake-scripting/variables.md).

---
title: Introduction to external functions
source: https://docs.snowflake.com/en/sql-reference/external-functions-introduction.md
section: SQL General Reference
---

# Introduction to external functions

This topic describes external functions, which call executable code that is developed, maintained, stored, and
executed outside Snowflake.

This topic helps you:

* Understand what an external function is.
* Decide whether an external function is the best way for you to implement a
  [UDF (user-defined function).](../developer-guide/udf/udf-overview.md)
* Choose the cloud platform for your external function.

> **Note:**
>
> When using external functions in China, use the [syntax and workflow described for AWS](external-functions-creating-aws.md).

## What is an external function?

An *external function* calls code that is executed outside Snowflake.

The remotely executed code is known as a *remote service*.

Information sent to a remote service is usually relayed through a *proxy service*.

Snowflake stores security-related external function information in an *API integration*.

The diagram below shows the basic information flow from a client program, through Snowflake, and to the remote
service:

Each of the key components is described in more detail below.

External Function:
:   An external function is a type of [UDF](../developer-guide/udf/udf-overview.md).
    Unlike other UDFs, an external function does not contain its own code;
    instead, the external function calls code that is stored and executed outside Snowflake.

    Inside Snowflake, the external function is stored as a database object that contains information that
    Snowflake uses to call the remote service. This stored information includes the URL of the
    proxy service
    that relays information to and from the remote service. This information is specified as
    part of the [CREATE EXTERNAL FUNCTION](sql/create-external-function.md) command.

    The database object that represents the external function is created in a specific database and schema. The
    external function can be called using dot notation to represent the fully-qualified name. For example:

    ```sqlexample
    select my_database.my_schema.my_external_function(col1) from table1;
    ```

Remote Service:
:   The remotely executed code is known as a remote service.

    The remote service must act like a function. For example, it must return a value.

    Snowflake supports *scalar* external functions; the remote service must return exactly one row for each
    row received.

    To be called by the Snowflake external function feature, the remote service must:

    * Accept [JSON](https://www.json.org) inputs and return JSON outputs. (For more information about
      Snowflake-compatible HTTP headers and JSON formatted data, see [Remote service input and output data formats](external-functions-data-format.md).)
    * Expose an HTTPS endpoint.

    For example, a remote service can be implemented as:

    * An AWS Lambda function.
    * A Microsoft Azure Function.
    * An HTTPS server (e.g. Node.js) running on an EC2 instance.

Proxy Service:
:   Snowflake does not call a
    remote service directly.
    Instead, Snowflake calls a proxy service, which relays the data to the remote service.

    The proxy service can increase security by authenticating requests to the remote service.

    The proxy service can support subscription-based billing for a remote service. For example, the proxy service
    can verify that a caller to the remote service is a paid subscriber.

    The proxy service also relays the response from the remote service back to Snowflake.

    Examples of proxy services include:

    * Amazon API Gateway.
    * Microsoft Azure API Management service.

API Integration:
:   An *integration* is a Snowflake object that provides an interface between Snowflake and third-party services.
    An API integration stores information, such as security information, that is needed to work with a proxy service
    or remote service.

    An API integration is created with the [CREATE API INTEGRATION](sql/create-api-integration.md) command.

Users can write and call their own remote services, or call remote services written by third parties. These remote
services can be written using any HTTP server stack, including cloud serverless compute services such as AWS Lambda.

From the perspective of a user running a SQL statement, an external function behaves like any other
[UDF](../developer-guide/udf/udf-overview.md) . External functions follow these rules:

* External functions return a value.
* External functions can accept parameters.
* An external function can appear in any clause of a SQL statement in which other types of
  [UDF](../developer-guide/udf/udf-overview.md) can appear. For example:

  > ```sqlexample
  > select my_external_function_2(column_1, column_2)
  >     from table_1;
  >
  > select col1
  >     from table_1
  >     where my_external_function_3(col2) < 0;
  >
  > create view view1 (col1) as
  >     select my_external_function_5(col1)
  >         from table9;
  > ```
* An external function can be part of a more complex expression:

  ```sqlexample
  select upper(zipcode_to_city_external_function(zipcode))
    from address_table;
  ```
* The returned value can be a compound value, such as a VARIANT that contains JSON.
* External functions can be overloaded; two different functions can have the same name
  but different signatures (different numbers or data types of input parameters).

## How external functions work

Snowflake does not call a remote service directly. Instead, Snowflake calls the remote service through a cloud
provider’s native HTTPS proxy service, for example API Gateway on AWS.

The main steps to call an external function are:

1. A user’s client program passes Snowflake a SQL statement that calls an external function.
2. When evaluating the external function as part of the query execution, Snowflake reads the external function
   definition and the corresponding API integration information.

   * The information from the external function definition includes:

     + The URL of the proxy service.
     + The name of the corresponding API integration.
   * The information from the API integration includes:

     + The proxy service resource to use. The resource contains information about the remote service, such as the
       location of that service.
     + The authentication information for that proxy service resource.

   Snowflake then composes an HTTP POST command that includes:

   * The data to be processed. This data is in JSON format.
   * HTTP header information. (Details are documented in [CREATE EXTERNAL FUNCTION](sql/create-external-function.md).)
   * Authentication information from the API integration.

   Snowflake then sends the POST request to the proxy service.
3. The proxy service receives the POST and then processes and forwards the request to the actual remote service.
   You can loosely think of the proxy service and resource as a “relay function” that calls the remote service.
4. The remote service processes the data and returns the result, which is passed back through the chain to the
   original SQL statement.
5. If the remote service responds with an HTTP code to signal
   [asynchronous](external-functions-implementation.md) processing, then Snowflake
   sends one or more HTTP GET requests to retrieve the result from the remote service. Snowflake continues to send GET
   requests as long as it receives the response code to keep requesting, or until the external function times out
   or returns an error.

Typically, when a query has a large number of rows to send to a remote service, the rows are split into
batches. Batches typically allow more parallelism and faster queries. In some cases, batches reduce
overloading of the remote service.

A remote service returns 1 batch of rows for each batch received. For a scalar external function, the
number of rows in the returned batch is equal to the number of rows in the received batch.

Each batch has a unique batch ID, which is included in each request sent from Snowflake to the remote service.

Retry operations (e.g. due to timeouts) are typically done at the batch level.

## Advantages of external functions

External functions have the following advantages over other [UDFs](../developer-guide/udf/udf-overview.md):

* The code for the remote service can be written in languages that other UDFs cannot be written in,
  including:

  + Go
  + C#
* Remote services can use functions and libraries that can’t be accessed by internal UDFs. For example,
  remote services can interface with commercially available third-party libraries,
  such as machine-learning scoring libraries.
* Developers can write remote services that can be called both from Snowflake and from other software
  written to use the same interface.

## Limitations of external functions

External functions have the following limitations, which can be loosely grouped into creation-time limitations and
execution-time limitations.

### Creation-time limitations and requirements

* Before an external function can be called the first time, an administrator must do
  some configuration work. This work requires knowledge of the cloud platform (e.g. AWS or Microsoft Azure),
  especially about security.
* Snowflake calls remote services indirectly through a cloud HTTP proxy service (such as
  the Amazon API Gateway), so the remote service for an external function must be
  callable from a proxy service. Fortunately, almost any function that can act as
  an HTTPS endpoint can be accessed as an external function via a proxy service.
  The function author must program the proxy service to call the remote service
  (e.g. a function running on AWS Lambda).
* Some cloud platforms might have specific requirements. For example, on AWS, external functions
  require regional endpoints or private endpoints. For more details, see
  Supported platforms. For more details about Amazon API Gateway regional and
  private endpoints, see [Choosing your endpoint type: Regional endpoint vs. Private endpoint](external-functions-creating-aws-planning.md).
* Only functions, not stored procedures, can be written using the external functions feature.
* [Future grants](sql/grant-privilege.md) of privileges on external functions are not supported.

### Execution-time limitations and issues

* Because the remote service is opaque to Snowflake, the optimizer might not be able to perform
  some optimizations that it could perform for equivalent functions.
* External functions have more overhead than functions (both built-in functions and internal UDFs) and
  usually execute more slowly.
* Currently, external functions must be scalar functions. A scalar external function returns a single value for each
  input row.
* Currently, external functions cannot be shared with data consumers via
  [Secure Data Sharing](../user-guide/data-sharing-gs.md).
* The maximum response size per batch is 10MB.
* External functions cannot be used in the following situations:

  + As part of a database object (e.g. table, view, UDF, or masking policy) shared
    via [Secure Data Sharing](../user-guide/data-sharing-intro.md). For example, you cannot create a shared view that uses an
    external function. The following is not supported:

    ```sqlexample
    CREATE VIEW my_shared_view AS SELECT my_external_function(x) ...;
    CREATE SHARE things_to_share;
    ...
    GRANT SELECT ON VIEW my_shared_view TO SHARE things_to_share;
    ...
    ```
  + A `DEFAULT` clause of a `CREATE TABLE` statement. In other words, the default
    value for a column cannot be an expression that calls an external function.
    If you try to include an external function in a `DEFAULT` clause, then the
    `CREATE TABLE` statement fails.
  + A [COPY transformation](../user-guide/data-load-transform.md).
* External functions can raise additional security issues. For example, if you call a
  third party’s function, that party could keep copies of the data passed to the function.

## Billing for external functions usage

Using external functions incurs normal costs associated with:

* [Snowflake warehouse usage.](../user-guide/cost-understanding-compute.md)
* [Data transfer.](../user-guide/cost-understanding-data-transfer.md)

In addition, you might need to pay indirect or third-party charges, including charges by the provider of the remote service. Charges can
vary from vendor to vendor.

> **Note:**
>
> Data sent via Amazon API Gateway Private Endpoints incurs AWS PrivateLink charges for both ingress and egress.

## Supported platforms

### Platforms that support calling an external function

In general, an external function can be called from a Snowflake account on any cloud platform that Snowflake supports:

* Amazon Web Services (AWS)
* Microsoft Azure
* Google Cloud Platform (GCP)

Exceptions are listed below:

* An external function accessed through an AWS API Gateway private endpoint can be accessed only from a Snowflake VPC (Virtual Private
  Cloud) on AWS and in the same AWS region. For more details about private endpoints on AWS, see
  [Choosing your endpoint type: Regional endpoint vs. Private endpoint](external-functions-creating-aws-planning.md).

The SQL syntax for calling an external function is the same on all platforms.

The SQL statements ([CREATE EXTERNAL FUNCTION](sql/create-external-function.md) and
[CREATE API INTEGRATION](sql/create-api-integration.md)) that configure access to these services
are the same for all platforms. However, the clauses within these statements vary, depending upon the platforms
hosting the proxy service and the remote service.

### Platforms that support creating an external function’s remote service and proxy service

Although an external function can be called from any platform, the external function’s remote service and
proxy service must each be created on specific supported platforms.

In many cases, the platform and account for the remote service are the same as the platform and account for the
proxy service. However, that is not required. For example, a SQL query could call an Azure Function (remote
service) via an AWS API Gateway (proxy service). The SQL query itself could be running on a Snowflake instance
running on GCP.

#### Platforms that support a remote service

You need an HTTP server stack to host the remote service. Any HTTP server stack that can support the remote
service should be compatible with external functions.

To create your remote service, you typically need:

* An account with a cloud platform’s provider (e.g. a Microsoft Azure account to create an Azure Function). This
  account provides storage and compute services for the remote service. This account is separate from your
  Snowflake account.

Snowflake provides instructions for creating a remote service as:

* An AWS Lambda function.
* A Microsoft Azure function.
* A Google Cloud Function.

#### Platforms that support a proxy service

You need an instance of a native HTTP proxy service on a cloud platform.

To configure your proxy service, you typically need:

* An account with a cloud platform’s provider (e.g. an Amazon account to use AWS). This account provides
  storage and compute services for the proxy service. This account is separate from your Snowflake account.
* A cloud platform role that has the privileges required to configure a proxy service.
  This cloud platform role is separate from your Snowflake role(s).

The following HTTPS proxy services are supported:

* Amazon API Gateway.
* Microsoft Azure API Management Service.
* Google Cloud API Gateway.

The sections below contain platform-specific information that users should be aware of before choosing a platform.

### Platform-specific restrictions

AWS:

* This feature supports only regional and private endpoints for the Amazon API Gateway. (For a description of the
  different types of endpoints, see
  [endpoints](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-basic-concept.html) .)
* Snowflake external functions and API integrations do not support
  [AWS custom domains](https://docs.aws.amazon.com/apigateway/latest/developerguide/how-to-custom-domains.html).
  To access an Amazon API Gateway from Snowflake, use the default URL generated by AWS, which looks similar to the following:

  ```none
  https://api-id.execute-api.region.amazonaws.com/stage
  ```

---
title: JOIN
source: https://docs.snowflake.com/en/sql-reference/constructs/join.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# JOIN

A `JOIN` operation combines rows from two tables — or other table-like sources, such as
views or table functions — to create a new combined row that can be used in the query.
For a conceptual explanation of joins, see [Working with joins](../../user-guide/querying-joins.md).

This topic describes how to use the `JOIN` subclause in the [FROM](from.md) clause.
The `JOIN` subclause specifies, explicitly or implicitly, how to relate rows
in one table to the corresponding rows in the other table. You can also use the [ASOF JOIN](asof-join.md)
subclause, which is used to join time-series data on timestamp columns when their values closely follow each other,
precede each other, or match exactly.

Although the recommended way to join tables is to use `JOIN` with the `ON` subclause of the `FROM` clause,
an alternative way to join tables is to use the `WHERE` clause. For details, see the documentation for the
[WHERE](where.md) clause.

## Syntax

Use one of the following:

```sqlsyntax
SELECT ...
FROM <object_ref1> [
                     {
                       INNER
                       | { LEFT | RIGHT | FULL } [ OUTER ]
                     }
                     [ DIRECTED ]
                   ]
                   JOIN <object_ref2>
  [ ON <condition> ]
[ ... ]
```

```sqlsyntax
SELECT *
FROM <object_ref1> [
                     {
                       INNER
                       | { LEFT | RIGHT | FULL } [ OUTER ]
                     }
                     [ DIRECTED ]
                   ]
                   JOIN <object_ref2>
  [ USING( <column_list> ) ]
[ ... ]
```

```sqlsyntax
SELECT ...
FROM <object_ref1> [
                     {
                       NATURAL [
                                 {
                                   INNER
                                   | { LEFT | RIGHT | FULL } [ OUTER ]
                                 }
                                 [ DIRECTED ]
                               ]
                       | CROSS  [ DIRECTED ]
                     }
                   ]
                   JOIN <object_ref2>
[ ... ]
```

## Parameters

`object_ref1` and `object_ref2`
:   Each object reference is a table or table-like data source.

`JOIN`
:   Use the `JOIN` keyword to specify that the tables should be joined. Combine `JOIN` with other join-related
    keywords — for example, `INNER` or `OUTER` — to specify the type of join.

    The semantics of joins are as follows (for brevity, this topic uses `o1` and
    `o2` for `object_ref1` and `object_ref2`, respectively).

    | Join Type | Semantics |
    | --- | --- |
    | `o1 INNER JOIN o2` | For each row of `o1`, a row is produced for each row of `o2` that matches according to the `ON condition` subclause. (You can also use a comma to specify an inner join. For an example, see the examples section.) If you use `INNER JOIN` without the `ON` clause, or if you use a comma without a `WHERE` clause, the result is the same as using `CROSS JOIN`: a Cartesian product; every row of `o1` paired with every row of `o2`. |
    | `o1 LEFT OUTER JOIN o2` | The result of the inner join is augmented with a row for each row of `o1` that has no matches in `o2`. The result columns referencing `o2` contain null. |
    | `o1 RIGHT OUTER JOIN o2` | The result of the inner join is augmented with a row for each row of `o2` that has no matches in `o1`. The result columns referencing `o1` contain null. |
    | `o1 FULL OUTER JOIN o2` | Returns all joined rows, plus one row for each unmatched left side row (extended with nulls on the right), plus one row for each unmatched right side row (extended with nulls on the left). |
    | `o1 CROSS JOIN o2` | For every possible combination of rows from `o1` and `o2` (that is, Cartesian product), the joined table contains a row consisting of all columns in `o1` followed by all columns in `o2`. A `CROSS JOIN` can’t be combined with an `ON condition` clause. However, you can use a `WHERE` clause to filter the results. |
    | `o1 NATURAL JOIN o2` | A `NATURAL JOIN` is identical to an explicit `JOIN` on the common columns of the two tables, except that the common columns are included only once in the output. (A natural join assumes that columns with the same name, but in different tables, contain corresponding data.) For examples, see the examples section. A `NATURAL JOIN` can be combined with an `OUTER JOIN`. A `NATURAL JOIN` can’t be combined with an `ON condition` clause because the `JOIN` condition is already implied. However, you can use a `WHERE` clause to filter the results. |

    The `DIRECTED` keyword specifies a *directed join*, which enforces the join order of the tables. The first, or left,
    table is scanned before the second, or right, table. For example, `o1 INNER DIRECTED JOIN o2` scans the `o1`
    table before the `o2` table. Directed joins are useful in the following situations:

    * You are migrating workloads into Snowflake that have join order directives.
    * You want to improve performance by scanning join tables in a specific order.

    Default: `INNER JOIN`

    If the word `JOIN` is used without specifying `INNER` or `OUTER`, then the `JOIN` is an inner join.

    If the `DIRECTED` keyword is added, the join type — for example, `INNER`, `LEFT`,
    `RIGHT`, or `FULL` — is required.

    See also:

    * [LATERAL](join-lateral.md)
    * [ASOF JOIN](asof-join.md)

`ON condition`
:   A [Boolean expression](../data-types-logical.md) that defines the rows from the two sides of the `JOIN`
    that are considered to match, for example:

    ```sqlexample
    ON object_ref2.id_number = object_ref1.id_number
    ```

    Conditions are discussed in more detail in the [WHERE](where.md) clause documentation.

    The `ON` clause is prohibited for `CROSS JOIN`.

    The `ON` clause is unnecessary, and prohibited, for
    `NATURAL JOIN` because the join columns are implied.

    For other joins, the `ON` clause is optional. However, omitting
    the `ON` clause results in a Cartesian product; every row of
    `object_ref1` paired with every row of `object_ref2`. A
    Cartesian product can produce a very large volume of output, almost all of
    which consists of pairs of rows that aren’t actually related, which consumes
    a lot of resources and is often a user error.

`USING( column_list )`
:   A list of columns in common between the two tables being joined. These
    columns are used as the join columns. The columns must have the same
    name and meaning in each of the tables being joined.

    For example, suppose that the SQL statement contains:

    ```sqlexample
    ... o1 JOIN o2
        USING (key_column)
    ```

    In the simple case, this would be equivalent to:

    ```sqlexample
    ... o1 JOIN o2
        ON o2.key_column = o1.key_column
    ```

    In the standard JOIN syntax, the projection list (the list of columns
    and other expressions after the SELECT keyword) is `*`. This causes
    the query to return the `key_column` exactly once. The columns
    are returned in the following order:

    * The columns in the `USING` clause in the order specified.
    * The left table columns not specified in the `USING` clause.
    * The right table columns not specified in the `USING` clause.

    For examples of standard and nonstandard usage, see the examples section.

## Usage notes

* The following restrictions apply to table functions other than SQL UDTFs:

  + You can’t specify the `ON`, `USING`, or `NATURAL JOIN` clause in a lateral table
    function, other than a SQL UDTF.

    For example, the following syntax is not allowed:

    ```sqlexample
    SELECT ... FROM my_table
      JOIN TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
    ```

    ```sqlexample
    SELECT ... FROM my_table
      INNER JOIN TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
    ```

    ```sqlexample
    SELECT ... FROM my_table
      JOIN TABLE(my_js_udtf(col_a))
      ON ... ;
    ```

    ```sqlexample
    SELECT ... FROM my_table
      INNER JOIN TABLE(my_js_udtf(col_a))
      ON ... ;
    ```
  + You can’t specify the `ON`, `USING`, or `NATURAL JOIN` clause in an outer lateral join
    to a table function, other than a SQL UDTF.

    For example, the following syntax is not allowed:

    ```sqlexample
    SELECT ... FROM my_table
      LEFT JOIN TABLE(FLATTEN(input=>[a]))
      ON ... ;
    ```

    ```sqlexample
    SELECT ... FROM my_table
      FULL JOIN TABLE(FLATTEN(input=>[a]))
      ON ... ;
    ```

    ```sqlexample
    SELECT ... FROM my_table
      LEFT JOIN TABLE(my_js_udtf(a))
      ON ... ;
    ```

    ```sqlexample
    SELECT ... FROM my_table
      FULL JOIN TABLE(my_js_udtf(a))
      ON ... ;
    ```

    Using this syntax results in the following error:

    ```output
    000002 (0A000): Unsupported feature
      'lateral table function called with OUTER JOIN syntax
       or a join predicate (ON clause)'
    ```
  + These restrictions don’t apply if you are using a comma, rather than a JOIN keyword:

    ```sqlexample
    SELECT ... FROM my_table,
      TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
    ```

## Examples

Many of the `JOIN` examples use two tables: `t1` and `t2`. Create these tables and insert data:

```sqlexample
CREATE TABLE t1 (col1 INTEGER);

INSERT INTO t1 (col1) VALUES
  (2),
  (3),
  (4);

CREATE TABLE t2 (col1 INTEGER);

INSERT INTO t2 (col1) VALUES
  (1),
  (2),
  (2),
  (3);
```

The following examples run queries with joins:

* Run a query with an inner join
* Run a query with a left outer join
* Run a query with a right outer join
* Run a query with a full outer join
* Run a query with a cross join
* Run a query with a natural join
* Run a query that combines joins in the FROM clause
* Run queries with joins that use the USING clause

### Run a query with an inner join

The following example runs a query with an inner join:

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 INNER JOIN t2
    ON t2.col1 = t1.col1
  ORDER BY 1,2;
```

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    2 |
|    2 |    2 |
|    3 |    3 |
+------+------+
```

Run the same query with an inner-directed join to enforce the join order so that the left table is scanned first:

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 INNER DIRECTED JOIN t2
    ON t2.col1 = t1.col1
  ORDER BY 1,2;
```

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    2 |
|    2 |    2 |
|    3 |    3 |
+------+------+
```

### Run a query with a left outer join

The following example runs a query with a left outer join:

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 LEFT OUTER JOIN t2
    ON t2.col1 = t1.col1
  ORDER BY 1,2;
```

In the output, there is a NULL value for the row in table `t1` that doesn’t have a matching row
in table `t2`:

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    2 |
|    2 |    2 |
|    3 |    3 |
|    4 | NULL |
+------+------+
```

### Run a query with a right outer join

The following example runs a query with a right outer join:

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 RIGHT OUTER JOIN t2
    ON t2.col1 = t1.col1
  ORDER BY 1,2;
```

In the output, there is a NULL value for the row in table `t1` that doesn’t have a matching
row in table `t2`.

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    2 |
|    2 |    2 |
|    3 |    3 |
| NULL |    1 |
+------+------+
```

### Run a query with a full outer join

The following example runs a query with a full outer join:

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 FULL OUTER JOIN t2
    ON t2.col1 = t1.col1
  ORDER BY 1,2;
```

Each table has a row that doesn’t have a matching row in the other table, so the output contains two
rows with NULL values:

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    2 |
|    2 |    2 |
|    3 |    3 |
|    4 | NULL |
| NULL |    1 |
+------+------+
```

### Run a query with a cross join

The following example runs a query with a cross join:

> **Note:**
>
> A cross join doesn’t have an ON clause.

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 CROSS JOIN t2
  ORDER BY 1, 2;
```

The output shows that the query produces a Cartesian product:

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    1 |
|    2 |    2 |
|    2 |    2 |
|    2 |    3 |
|    3 |    1 |
|    3 |    2 |
|    3 |    2 |
|    3 |    3 |
|    4 |    1 |
|    4 |    2 |
|    4 |    2 |
|    4 |    3 |
+------+------+
```

A cross join can be filtered by a `WHERE` clause, as shown in the following example:

```sqlexample
SELECT t1.col1, t2.col1
  FROM t1 CROSS JOIN t2
  WHERE t2.col1 = t1.col1
  ORDER BY 1, 2;
```

```output
+------+------+
| COL1 | COL1 |
|------+------|
|    2 |    2 |
|    2 |    2 |
|    3 |    3 |
+------+------+
```

### Run a query with a natural join

The following example shows a query with a natural join. First, create two tables and
insert data:

```sqlexample
CREATE OR REPLACE TABLE d1 (
  id NUMBER,
  name VARCHAR);

INSERT INTO d1 (id, name) VALUES
  (1,'a'),
  (2,'b'),
  (4,'c');

CREATE OR REPLACE TABLE d2 (
  id NUMBER,
  value VARCHAR);

INSERT INTO d2 (id, value) VALUES
  (1,'xx'),
  (2,'yy'),
  (5,'zz');
```

Run a query with a natural join:

```sqlexample
SELECT *
  FROM d1 NATURAL INNER JOIN d2
  ORDER BY id;
```

The output shows that a natural join produces the same output as the corresponding inner join,
except that the output doesn’t include a second copy of the join column:

```output
+----+------+-------+
| ID | NAME | VALUE |
|----+------+-------|
|  1 | a    | xx    |
|  2 | b    | yy    |
+----+------+-------+
```

The following example shows that you can combine natural joins with outer joins:

```sqlexample
SELECT *
  FROM d1 NATURAL FULL OUTER JOIN d2
  ORDER BY id;
```

```output
+----+------+-------+
| ID | NAME | VALUE |
|----+------+-------|
|  1 | a    | xx    |
|  2 | b    | yy    |
|  4 | c    | NULL  |
|  5 | NULL | zz    |
+----+------+-------+
```

### Run a query that combines joins in the FROM clause

You can combine in the `FROM` clause. Create a third table:

```sqlexample
CREATE TABLE t3 (col1 INTEGER);

INSERT INTO t3 (col1) VALUES
  (2),
  (6);
```

Run a query that chains together two joins in the FROM clause:

```sqlexample
SELECT t1.*, t2.*, t3.*
  FROM t1
    LEFT OUTER JOIN t2 ON (t1.col1 = t2.col1)
    RIGHT OUTER JOIN t3 ON (t3.col1 = t2.col1)
  ORDER BY t1.col1;
```

```output
+------+------+------+
| COL1 | COL1 | COL1 |
|------+------+------|
|    2 |    2 |    2 |
|    2 |    2 |    2 |
| NULL | NULL |    6 |
+------+------+------+
```

In such a query, the results are determined based on the joins taking place from left to right,
although the optimizer might reorder the joins if a different join order produces the same result. If the
right outer join is meant to take place before the left outer join, then write the query in the following
way:

```sqlexample
SELECT t1.*, t2.*, t3.*
FROM t1
  LEFT OUTER JOIN
    (t2 RIGHT OUTER JOIN t3 ON (t3.col1 = t2.col1))
  ON (t1.col1 = t2.col1)
ORDER BY t1.col1;
```

```output
+------+------+------+
| COL1 | COL1 | COL1 |
|------+------+------|
|    2 |    2 |    2 |
|    2 |    2 |    2 |
|    3 | NULL | NULL |
|    4 | NULL | NULL |
+------+------+------+
```

### Run queries with joins that use the USING clause

The next two examples show standard (ISO 9075) and nonstandard usage of
the `USING` clause. Both are supported by Snowflake.

This first example shows standard usage. Specifically, the projection list
contains exactly `*`:

```sqlexample
WITH
  l AS (
       SELECT 'a' AS userid
       ),
  r AS (
       SELECT 'b' AS userid
       )
SELECT *
  FROM l LEFT JOIN r USING(userid);
```

Even though the example query joins two tables, and each table has one column,
and the query asks for all columns, the output contains one column, not two:

```output
+--------+
| USERID |
|--------|
| a      |
+--------+
```

The following example shows nonstandard usage. The projection list contains
something other than `*`:

```sqlexample
WITH
  l AS (
       SELECT 'a' AS userid
     ),
  r AS (
       SELECT 'b' AS userid
       )
SELECT l.userid as UI_L,
       r.userid as UI_R
  FROM l LEFT JOIN r USING(userid);
```

The output contains two columns, and the second column contains either a value
from the second table or NULL:

```output
+------+------+
| UI_L | UI_R |
|------+------|
| a    | NULL |
+------+------+
```

---
title: LATERAL
source: https://docs.snowflake.com/en/sql-reference/constructs/join-lateral.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# LATERAL

In a [FROM](from.md) clause, the LATERAL keyword allows an inline view to reference columns
from a table expression that precedes that inline view.

A lateral join behaves more like a correlated subquery than a typical join. A lateral join
behaves as if the server executed a loop similar to the following:

```none
for each row in left_hand_table LHT:
    execute right_hand_subquery RHS using the values from the current row in the LHT
```

Unlike the output of a non-lateral join, the output from a lateral join includes only the rows
generated from the inline view. There is no need for an explicit ON clause to join rows from the left-hand
side to the right-hand side; the relationship is already established because the inline view
references columns from the left-hand table expression.

See also: [Using lateral joins](../../user-guide/lateral-join-using.md).

## When to use LATERAL

LATERAL is a valuable tool for the following use cases:

* **Chaining table functions on nested data**: When you need to flatten arrays within arrays or
  navigate multiple levels of nested JSON, each subsequent table function call must reference
  the output of the previous one. Lateral joins make this possible.
* **Calling table functions with row-specific arguments**: When a table function (such as a UDTF)
  needs to receive different input values for each row from the left-hand table.

For simple cases such as flattening a single-level array, using `TABLE(FLATTEN(...))` without
a lateral join produces the same result. Lateral joins are necessary only when the inline view must reference
columns that are only available from a preceding expression in the FROM clause.

## Syntax

```sqlsyntax
SELECT ...
  FROM <left_hand_table_expression>, LATERAL ( <inline_view> )
...
```

## Parameters

`left_hand_table_expression`
:   This is a source of rows, such as:

    > * A table.
    > * A view.
    > * A subquery.
    > * A table function.
    > * The result of an earlier join.

`inline_view`
:   The `inline_view` can be:

    > * An inline view: a view defined within the statement, and valid only for the duration of the statement.
    > * A subquery.
    > * A table function: either a built-in table function such as FLATTEN or a user-defined table function (UDTF).

    The `inline_view` can’t be a plain table reference. It must be an expression that
    can process or filter rows based on values from the left-hand table expression, such as a
    subquery with a WHERE clause or a table function call.

## Usage notes

* The inline view after the keyword LATERAL can reference columns only from the inline view itself and from
  tables to the left of the inline view in the [FROM](from.md) clause.

  ```sqlexample
  SELECT *
    FROM table_reference_me, LATERAL (...), table_do_not_reference_me ...
  ```
* Although the inline view typically references one or more columns from the `left_hand_table_expression`, it
  is not required to do so.
* Just as INNER JOIN syntax can use either the comma or the keywords INNER JOIN,
  a lateral join can also use the comma or the keywords INNER JOIN. For example:

  ```sqlexample
  FROM departments AS d INNER JOIN LATERAL (...)
  ```
* You can’t specify the ON, USING, or NATURAL JOIN clause in:

  + A lateral table function (other than a SQL UDTF)
  + An outer lateral join to a table function (other than a SQL UDTF)

  For details, see [the usage notes in the JOIN topic](join.md).
* The `left_hand_table_expression` can’t be an UNPIVOT result set. Attempting to
  reference an UNPIVOT alias in a LATERAL join causes an error. As a workaround, materialize
  the UNPIVOT result into a temporary table first, then use that table as the left-hand expression.
  For more information, see [UNPIVOT](unpivot.md).

## Examples

See also [Example: Using a lateral join with the FLATTEN table function](../../user-guide/lateral-join-using.md) and [Using FLATTEN to Filter the Results in a WHERE Clause](../../user-guide/querying-semistructured.md).

The following example uses these tables:

```sqlexample
CREATE TABLE departments (department_id INTEGER, name VARCHAR);
CREATE TABLE employees (employee_ID INTEGER, last_name VARCHAR,
  department_ID INTEGER, project_names ARRAY);

INSERT INTO departments (department_ID, name) VALUES
  (1, 'Engineering'),
  (2, 'Support');
INSERT INTO employees (employee_ID, last_name, department_ID) VALUES
  (101, 'Richards', 1),
  (102, 'Paulson',  1),
  (103, 'Johnson',  2);
```

This following query is a lateral join with a subquery:

```sqlexample
SELECT *
  FROM departments AS d,
    LATERAL (SELECT * FROM employees AS e WHERE e.department_ID = d.department_ID) AS iv2
  ORDER BY employee_ID;
```

```output
+---------------+-------------+-------------+-----------+---------------+---------------+
| DEPARTMENT_ID | NAME        | EMPLOYEE_ID | LAST_NAME | DEPARTMENT_ID | PROJECT_NAMES |
|---------------+-------------+-------------+-----------+---------------+---------------|
|             1 | Engineering |         101 | Richards  |             1 | NULL          |
|             1 | Engineering |         102 | Paulson   |             1 | NULL          |
|             2 | Support     |         103 | Johnson   |             2 | NULL          |
+---------------+-------------+-------------+-----------+---------------+---------------+
```

The following SQL statement is equivalent and produces the same output. It uses the keywords
INNER JOIN instead of the comma in the FROM clause.

```sqlexample
SELECT *
  FROM departments AS d INNER JOIN
    LATERAL (SELECT * FROM employees AS e WHERE e.department_ID = d.department_ID) AS iv2
  ORDER BY employee_ID;
```

### Chaining LATERAL FLATTEN for nested data

LATERAL is required when you need to chain multiple [FLATTEN](../functions/flatten.md)
calls to access nested data structures. In the following example, the second FLATTEN must reference
the output of the first FLATTEN, which is only possible with LATERAL.

```sqlexample
CREATE OR REPLACE TABLE persons AS
  SELECT column1 AS id, PARSE_JSON(column2) AS c
    FROM VALUES
      (12712555,
       '{ "name": { "first": "John", "last": "Smith" },
          "contact": [{ "business": [
            { "type": "phone", "content": "555-1234" },
            { "type": "email", "content": "j.smith@example.com" }
          ]}]}'),
      (98127771,
       '{ "name": { "first": "Jane", "last": "Doe" },
          "contact": [{ "business": [
            { "type": "phone", "content": "555-1236" },
            { "type": "email", "content": "j.doe@example.com" }
          ]}]}');
```

The following query uses two LATERAL FLATTEN calls. The first call flattens the `contact` array, and
the second flattens the `business` array within each contact. The second FLATTEN call references
`f.value`, which comes from the output of the first FLATTEN call.

```sqlexample
SELECT id,
    f1.value:type::STRING AS contact_type,
    f1.value:content::STRING AS contact_details
  FROM persons p,
    LATERAL FLATTEN(INPUT => p.c, PATH => 'contact') f,
    LATERAL FLATTEN(INPUT => f.value:business) f1;
```

```output
+----------+--------------+---------------------+
|       ID | CONTACT_TYPE | CONTACT_DETAILS     |
|----------+--------------+---------------------|
| 12712555 | phone        | 555-1234            |
| 12712555 | email        | j.smith@example.com |
| 98127771 | phone        | 555-1236            |
| 98127771 | email        | j.doe@example.com   |
+----------+--------------+---------------------+
```

This query can’t be written without LATERAL because the second FLATTEN call depends on the output
of the first FLATTEN call.

## LATERAL versus other approaches

The following table summarizes when to use LATERAL compared to other approaches:

| Use case | Recommendation |
| --- | --- |
| Flatten a single-level array | `TABLE(FLATTEN(...))` without LATERAL works the same. LATERAL is optional. |
| Flatten nested arrays (arrays within arrays) | LATERAL is required to chain FLATTEN calls. |
| Filter rows from another table based on the current row | Either a correlated subquery in the SELECT list or LATERAL works. LATERAL can return multiple rows and columns; a correlated subquery in SELECT can’t do this. |
| Call a table function with row-specific input | LATERAL allows the table function to receive different arguments for each row. |

---
title: LET (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/let.md
section: SQL General Reference
---

# LET (Snowflake Scripting)

Assigns an expression to a Snowflake Scripting variable, cursor, or RESULTSET.

For more information on variables, cursors, and RESULTSETs, see:

* [Working with variables](../../developer-guide/snowflake-scripting/variables.md)
* [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md)
* [Working with RESULTSETs](../../developer-guide/snowflake-scripting/resultsets.md)

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [DECLARE](declare.md)

## Syntax

```sqlsyntax
LET { <variable_assignment> | <cursor_assignment> | <resultset_assignment> }
```

The syntax for each type of assignment is described below in more detail.

* Variable assignment syntax
* Cursor assignment syntax
* RESULTSET assignment syntax

### Variable assignment syntax

Use the following syntax to assign an expression to a [variable](../../developer-guide/snowflake-scripting/variables.md).

```sqlsyntax
LET <variable_name> <type> { DEFAULT | := } <expression> ;

LET <variable_name> { DEFAULT | := } <expression> ;
```

Where:

> `variable_name`
> :   The name of the variable. The name must follow the naming rules for [object identifiers](../identifiers.md).
>
> `type`
> :   A [SQL data type](../../sql-reference-data-types.md).
>
> `DEFAULT expression` or . `:= expression`
> :   Assigns the value of `expression` to the variable.
>
>     If both `type` and `expression` are specified, the expression must evaluate to a data type that matches.

For example, the following `LET` statements declare three variables of type [NUMBER](../data-types-numeric.md),
with precision set to `38` and scale set to `2`. All three variables have a default value, using either `DEFAULT`
or `:=` to specify it.

```sqlexample
BEGIN
  ...
  LET profit NUMBER(38, 2) DEFAULT 0.0;
  LET revenue NUMBER(38, 2) DEFAULT 110.0;
  LET cost NUMBER(38, 2) := 100.0;
  ...
```

For more examples, see:

* [Working with variables](../../developer-guide/snowflake-scripting/variables.md)
* [IF statements](../../developer-guide/snowflake-scripting/branch.md)
* [Working with loops](../../developer-guide/snowflake-scripting/loops.md)
* [Examples for common use cases of Snowflake Scripting](../../developer-guide/snowflake-scripting/use-cases.md)

### Cursor assignment syntax

Use one of the following syntaxes to assign an expression to a [cursor](../../developer-guide/snowflake-scripting/cursors.md).

```sqlsyntax
LET <cursor_name> CURSOR FOR <query> ;
```

```sqlsyntax
LET <cursor_name> CURSOR FOR <resultset_name> ;
```

Where:

> `cursor_name`
> :   The name to give the cursor. This can be any valid Snowflake [identifier](../identifiers.md)
>     that is not already in use in this block. The identifier is used by other cursor-related commands, such as [FETCH (Snowflake Scripting)](fetch.md).
>
> `query`
> :   The query that defines the result set that the cursor iterates over.
>
>     This can be almost any valid SELECT statement.
>
> `resultset_name`
> :   The name of the [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md) for the cursor to operate on.

For example, the following `LET` statement declares cursor `c1` for a query:

```sqlexample
BEGIN
  ...
  LET c1 CURSOR FOR SELECT price FROM invoices;
  ...
```

For more examples, see [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md).

### RESULTSET assignment syntax

Use the following syntax to assign an expression to a [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md).

```sqlsyntax
<resultset_name> := ( <query> ) ;
```

Where:

> `resultset_name`
> :   The name to give the RESULTSET.
>
>     The name should be unique within the current scope.
>
>     The name must follow the naming rules for [Object identifiers](../identifiers.md).
>
> `DEFAULT query` or . `:= query`
> :   Assigns the value of `query` to the RESULTSET.

For example, the following `LET` statement declares RESULTSET `res` for a query:

```sqlexample
BEGIN
  ...
  LET res RESULTSET := (SELECT price FROM invoices);
  ...
```

For more examples, see [Working with RESULTSETs](../../developer-guide/snowflake-scripting/resultsets.md).

---
title: LIMIT / FETCH
source: https://docs.snowflake.com/en/sql-reference/constructs/limit.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# LIMIT / FETCH

Constrains the maximum number of rows returned by a statement or subquery. Both LIMIT (PostgreSQL syntax) and FETCH (ANSI syntax) are supported, and produce the same result.

See also:
:   [TOP <n>](top_n.md)

## Syntax

### PostgreSQL syntax

```sqlsyntax
SELECT ...
FROM ...
[ ORDER BY ... ]
LIMIT <count> [ OFFSET <start> ]
[ ... ]
```

### ANSI syntax

```sqlsyntax
SELECT ...
FROM ...
[ ORDER BY ... ]
[ OFFSET <start> ] [ { ROW | ROWS } ] FETCH [ { FIRST | NEXT } ] <count> [ { ROW | ROWS } ] [ ONLY ]
[ ... ]
```

## Parameters

`count`
:   The number of rows returned. Must be a non-negative integer constant.

    The values NULL, empty string (`''`), and `$$$$` are also accepted and are treated as
    “unlimited”; this is useful primarily for connectors and drivers (such as the JDBC driver) if they
    receive an incomplete parameter list when dynamically binding parameters to a statement.

`OFFSET` `start`
:   The row number after which the limited/fetched rows are returned. Must be a non-negative integer constant.

    If `OFFSET` is omitted, the output starts from the first row in the result set.

    The values NULL, empty string (`''`) and `$$$$` are also accepted and are treated as 0
    (i.e. do not skip any rows); this is useful primarily for connectors and drivers (such as the JDBC
    driver) if they receive an incomplete parameter list when dynamically binding parameters to a statement.

`ONLY`
:   Optional keyword that does not affect the output. It is used for emphasis to the
    human reader.

## Usage notes

* An [ORDER BY](order-by.md) clause is not required; however, without an ORDER BY clause, the results are non-deterministic
  because query results are not necessarily in any particular order. To control the results returned, use an ORDER BY clause.
* An ORDER BY clause in a subquery only guarantees ordering within that subquery. The ordering is
  not preserved in outer query levels. When a LIMIT clause depends on an ORDER BY clause from a
  different nesting level, the optimizer might not apply the LIMIT clause as expected, and the
  number of rows returned can differ from the LIMIT value. A COUNT(\*) query on the same subquery
  might also report a different number of rows from the actual number of rows returned.

  For example, in the following query the innermost subquery orders the results, the middle
  subquery limits the output to six rows, and the outer query limits the output to 100 rows. You might expect six rows
  because the inner LIMIT clause is smaller, but because the ORDER BY clause is in a different
  subquery from the LIMIT clause, results are unpredictable and the query might return more or
  fewer than six rows:

  ```sqlexample
  SELECT *
    FROM (
          SELECT *
            FROM (
                   SELECT *
                     FROM my_table
                     ORDER BY col1  -- Ordering: innermost level
                 )
            LIMIT 6                 -- LIMIT: middle level
         )
    LIMIT 100;                      -- LIMIT: outermost level
  ```

  To avoid unpredictable results, keep the ORDER BY clause and the LIMIT (or FETCH) clause at the
  same query level:

  ```sqlexample
  SELECT *
    FROM my_table
    ORDER BY col1
    LIMIT 6;
  ```
* Top-K pruning can improve the performance of queries that include both LIMIT and ORDER BY clauses. For more
  information, see [Top-K pruning for improved query performance](../../user-guide/querying-top-k-pruning-optimization.md).
* TOP `n` and LIMIT `count` are equivalent.
* Both the LIMIT clause and the [SAMPLE](sample.md) clause return a subset of rows from a table. When you use the
  LIMIT clause, Snowflake returns the specified number of rows in the fastest way possible. When you use the SAMPLE
  clause, Snowflake returns rows based on the sampling method specified in the clause.

## Examples

The following examples show the effect of LIMIT. For simplicity, these
queries omit the ORDER BY clause and assume that the output order is
always the same as shown by the first query. **Real-world queries should
include ORDER BY.**

```sqlexample
SELECT c1 FROM testtable;
```

```output
+------+
|   C1 |
|------|
|    1 |
|    2 |
|    3 |
|   20 |
|   19 |
|   18 |
|    1 |
|    2 |
|    3 |
|    4 |
| NULL |
|   30 |
| NULL |
+------+
```

```sqlexample
SELECT c1 FROM testtable LIMIT 3 OFFSET 3;
```

```output
+----+
| C1 |
|----|
| 20 |
| 19 |
| 18 |
+----+
```

```sqlexample
SELECT c1 FROM testtable ORDER BY c1;
```

```output
+------+
|   C1 |
|------|
|    1 |
|    1 |
|    2 |
|    2 |
|    3 |
|    3 |
|    4 |
|   18 |
|   19 |
|   20 |
|   30 |
| NULL |
| NULL |
+------+
```

```sqlexample
SELECT c1 FROM testtable ORDER BY c1 LIMIT 3 OFFSET 3;
```

```output
+----+
| ID |
|----|
|  2 |
|  3 |
|  3 |
+----+
```

The following examples demonstrate the use of NULLs to indicate:

* No limit to the number of rows.
* Start at row one (do not skip any rows).

  ```sqlexample
  CREATE TABLE demo1 (i INTEGER);
  INSERT INTO demo1 (i) VALUES (1), (2);
  ```

  ```sqlexample
  SELECT * FROM demo1 ORDER BY i LIMIT NULL OFFSET NULL;
  ```

  ```output
  +---+
  | I |
  |---|
  | 1 |
  | 2 |
  +---+
  ```

  ```sqlexample
  SELECT * FROM demo1 ORDER BY i LIMIT '' OFFSET '';
  ```

  ```output
  +---+
  | I |
  |---|
  | 1 |
  | 2 |
  +---+
  ```

  ```sqlexample
  SELECT * FROM demo1 ORDER BY i LIMIT $$$$ OFFSET $$$$;
  ```

  ```output
  +---+
  | I |
  |---|
  | 1 |
  | 2 |
  +---+
  ```

---
title: LISTING_ACCESS_HISTORY view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/listing-access-history.md
section: SQL General Reference
---

Schema:
:   [Data Sharing Usage](../data-sharing-usage.md)

# LISTING_ACCESS_HISTORY view

This view in the DATA_SHARING_USAGE schema can be used to explore the history of consumers’ usage of your listings.
LISTING_ACCESS_HISTORY provides object-level information about queries run against the data shares or Native Apps attached to your listings. For more information about the data provided by the LISTING_ACCESS_HISTORY view, see the Columns section.

Each row returned by LISTING_ACCESS_HISTORY represents a single time the listing was accessed by a consumer. Because the rows represent queries instead of sessions, it is likely that the same listing will appear multiple times, one row for each query.

A single consumer query can access objects from multiple listings. The QUERY_TOKEN identifies the query that generated a row in the listing access history.
To identify a collection of listing objects accessed by a single consumer query, use the QUERY_TOKEN.

The LISTING_ACCESS_HISTORY view does not allow providers to obtain any private consumer information, such as the actual text of queries. The
view also excludes any objects that are not owned by the provider account. For example, if a consumer joins data from your listing with their own data or another
provider’s data, only listing objects that you own are returned by the LISTING_ACCESS_HISTORY view.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| QUERY_TOKEN | VARCHAR | Unique ID for each query run by a consumer. A QUERY_TOKEN does not correlate with any actual query identifier on the consumer side. |
| QUERY_DATE | DATE | Date when the query was executed. |
| EXCHANGE_NAME | VARCHAR | Snowflake Marketplace or the data exchange where the listing is available. |
| SNOWFLAKE_REGION | VARCHAR | Snowflake region where the consumption occurred. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing in the Snowflake Marketplace or data exchange that provides the share. |
| PROVIDER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the share owner. |
| PROVIDER_ACCOUNT_NAME | VARCHAR | Account name of the share owner. |
| SHARE_NAME | VARCHAR | Name of the data share that consumers accessed. When IS_SHARE is FALSE, the value is NULL. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Account name of the consumer. |
| CONSUMER_ACCOUNT_ORGANIZATION | VARCHAR | Name of the organization for the consumer account. |
| LISTING_OBJECTS_ACCESSED | ARRAY | Use SHARE_OBJECTS_ACCESSED as it contains the same data. When IS_SHARE is FALSE, the value is NULL. See LISTING_OBJECTS_ACCESSED array for formatting. |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account of the consumer is located. |
| CONSUMER_NAME | VARCHAR | Contains the name of the consumer account that accessed, used, or requested a listing. If no name is available, such as for trial accounts, the value is NULL. |
| IS_SHARE | BOOLEAN | TRUE if the access was on a share. When TRUE, the SHARE_OBJECTS_ACCESSED column provides details about the share objects accessed by the consumer query. |
| IS_APPLICATION | BOOLEAN | TRUE if the access was on an application. When TRUE, APPLICATION_OBJECTS_ACCESSED column provides details about the application objects accessed by the consumer query. |
| SHARE_OBJECTS_ACCESSED | ARRAY | Details the share objects accessed by the consumer query. When IS_SHARE is FALSE, the value is NULL. See SHARE_OBJECTS_ACCESSED array for formatting. |
| APPLICATION_OBJECTS_ACCESSED | ARRAY | Details the application objects accessed by the consumer query. When IS_APPLICATION is FALSE, the value is NULL. See APPLICATION_OBJECTS_ACCESSED array. |
| APPLICATION_PACKAGE_NAME | VARCHAR | The current name of the application package from which the application was installed. When IS_APPLICATION is FALSE, the value is NULL. |
| APPLICATION_VERSION | VARCHAR | The version of the application when the query occurred. When IS_APPLICATION is FALSE, the value is NULL |
| APPLICATION_PATCH_ID | NUMBER | The application patch number when the query occurred. When IS_APPLICATION is FALSE, the value is NULL. |

## Usage notes

* Latency for the view may be up to 2 days.
* The data is retained for 365 days (1 year).

### SHARE_OBJECTS_ACCESSED array

The SHARE_OBJECTS_ACCESSED array provides details about the objects in a share accessed by a consumer query. The format of an item in the
array depends on the type of object that was accessed.

Functions:

```sqljson
{
  "argumentSignature": (function_signature varchar),
  "objectName": "DATABASE_NAME.SCHEMA_NAME.FUNCTION_NAME",
  "objectID": "12345",
  "objectDomain": "Function"
}
```

Stored procedures:

```sqljson
{
  "argumentSignature": (function_signature varchar),
  "objectName": "DATABASE_NAME.SCHEMA_NAME.PROCEDURE_NAME"
  "objectID":"12345"
  "objectDomain":"Procedure"
}
```

Tables, views, and columns:

```sqljson
[
  {
    "Columns": [
      {
        "columnId": ######,
        "columnName": "column1_name"
      },
      {
        "columnId": ######,
        "columnName": "column2_name"
      }
    ],
    "objectDomain":"VIEW",
    "objectId": ##view_id##,
    "objectName": "DATABASE_1.PUBLIC.VIEW_1"
  },
  {
    "Columns": [
      {
        "columnId": ######,
        "columnName": "column3_name"
      },
      {
        "columnId": ######,
        "columnName": "column4_name"
      }
    ],
    "objectDomain":"TABLE",
    "objectId": ##table_id##,
    "objectName": "DATABASE_2.PUBLIC.TABLE1"
  }
]
```

Cortex Search Services:

```sqljson
[
  {
    "objectDomain":"Cortex Search Service",
    "objectId": 12345,
    "objectName": "DATABASE_2.PUBLIC.SHARED_CKE_NAME",
    "hashedDocumentIds": [##hashed_id1##, ##hashed_id2##],
    "hashVersion": "V1"
  }
]
```

### APPLICATION_OBJECTS_ACCESSED array

The APPLICATION_OBJECTS_ACCESSED array provides details about the objects in a Native App accessed by a consumer query. The format of an item in the array depends on the type of object that was accessed.

Unlike the LISTING_OBJECTS_ACCESSED column array results, APPLICATION_OBJECTS_ACCESSED results containing object IDs are unavailable and database names are masked.

Functions:

```sqljson
{
  "argumentSignature": (function_signature varchar),
  "objectName": "23662386A408C571B77FDC53691793E4992D1C12.SCHEMA_NAME.FUNCTION_NAME",
  "objectDomain": "Function"
}
```

Stored procedures:

```sqljson
{
  "argumentSignature": (function_signature varchar),
  "objectName": "23662386A408C571B77FDC53691793E4992D1C12.SCHEMA_NAME.PROCEDURE_NAME"
  "objectDomain":"Procedure"
}
```

Tables, views, and columns:

```sqljson
[
  {
    "Columns": [
      {
        "columnName": "column1_name"
      },
      {
        "columnName": "column2_name"
      }
    ],
    "objectDomain":"VIEW",
    "objectName": "5F3297829072D2E23B852D7787825FF762E74EF3.PUBLIC.VIEW_1"
  },
  {
    "Columns": [
      {
        "columnName": "column3_name"
      },
      {
        "columnName": "column4_name"
      }
    ],
    "objectDomain":"TABLE",
    "objectName": "D85A2CE1531C6C1E077FA701713047305BDF5A83.PUBLIC.TABLE1"
  }
]
```

### LISTING_OBJECTS_ACCESSED array

Use SHARE_OBJECTS_ACCESSED Array instead.

## Examples

This section contains the following example SQL queries for the LISTING_ACCESS_HISTORY view:

* Aggregate view of access over time
* Aggregate view of access over time by consumer
* Access count by column
* Table joins
* Table joins by consumer

### Aggregate view of access over time

An aggregate view of which functions, stored procedures, tables, views, and columns have been accessed (over a certain period) and the total number of times.

```sqlexample
select
  lah.exchange_name,
  lah.listing_global_name,
  lah.share_name,
  los.value:"objectName"::string as object_name,
  coalesce(los.value:"objectDomain"::string, los.value:"objectDomain"::string) as object_type,
  count(distinct lah.query_token) as n_queries,
  count(distinct lah.consumer_account_locator) as n_distinct_consumer_accounts
from SNOWFLAKE.DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY as lah
join lateral flatten(input=>lah.listing_objects_accessed) as los
where true
  and query_date between '2022-03-01' and '2022-04-30'
group by 1,2,3,4,5
order by 1,2,3,4,5;
```

### Aggregate view of access over time by consumer

This example is similar to Aggregate view of access over time, broken down by consumer.

```sqlexample
select
  lah.exchange_name,
  lah.listing_global_name,
  lah.share_name,
  los.value:"objectName"::string as object_name,
  coalesce(los.value:"objectDomain"::string, los.value:"objectDomain"::string) as object_type,
  consumer_account_locator,
  count(distinct lah.query_token) as n_queries
from SNOWFLAKE.DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY as lah
join lateral flatten(input=>lah.listing_objects_accessed) as los
where true
  and query_date between '2022-03-01' and '2022-04-30'
group by 1,2,3,4,5,6
order by 1,2,3,4,5,6;
```

### Access count by column

For a given object (table, view), how many times each column was accessed.

```sqlexample
select
  los.value:"objectDomain"::string as object_type,
  los.value:"objectName"::string as object_name,
  cols.value:"columnName"::string as column_name,
  count(distinct lah.query_token) as n_queries,
  count(distinct lah.consumer_account_locator) as n_distinct_consumer_accounts
from SNOWFLAKE.DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY as lah
join lateral flatten(input=>lah.listing_objects_accessed) as los
join lateral flatten(input=>los.value, path=>'columns') as cols
where true
  and los.value:"objectDomain"::string in ('Table', 'View')
  and query_date between '2022-03-01' and '2022-04-30'
  and los.value:"objectName"::string = 'DATABASE_NAME.SCHEMA_NAME.TABLE_NAME'
  and lah.consumer_account_locator = 'CONSUMER_ACCOUNT_LOCATOR'
group by 1,2,3;
```

### Table joins

A view of which combination of tables are being joined together.

```sqlexample
with
accesses as (
  select
    lah.query_token,
    array_agg(distinct los.value:"objectName"::string) as object_names
  from SNOWFLAKE.DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY as lah
  join lateral flatten(input=>lah.listing_objects_accessed) as los
  where true
    and los.value:"objectDomain"::string in ('Table', 'View')
    and query_date between '2022-03-01' and '2022-04-30'
  group by 1
)
select
  object_names,
  sum(1) as n_queries
from accesses
group by 1
```

### Table joins by consumer

A view of which tables are being joined together (pairs) broken down by consumer.

```sqlexample
with
accesses as (
  select distinct
    los.value:"objectDomain"::string as object_type,
    los.value:"objectName"::string as object_name,
    lah.query_token,
    lah.consumer_account_locator
  from SNOWFLAKE.DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY as lah
  join lateral flatten(input=>lah.listing_objects_accessed) as los
  where true
    and los.value:"objectDomain"::string in ('Table', 'View')
    and query_date between '2022-03-01' and '2022-04-30'
)
select
  a1.object_name as object_name_1,
  a2.object_name as object_name_2,
  a1.consumer_account_locator as consumer_account_locator,
  count(distinct a1.query_token) as n_queries
from accesses as a1
join accesses as a2
  on a1.query_token = a2.query_token
  and a1.object_name < a2.object_name
group by 1,2,3;
```

---
title: LISTING_AUTO_FULFILLMENT_DATABASE_STORAGE_DAILY view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/listing-auto-fulfillment-database-storage-daily.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# LISTING_AUTO_FULFILLMENT_DATABASE_STORAGE_DAILY view

This view in the DATA_SHARING_USAGE schema can be used to determine the data storage used by Cross-Cloud Auto-Fulfillment. When a listing
is fulfilled to another region, the data product is stored in the region. This view contains details about how much data is stored in
specific regions, and which listings and databases the data storage is associated with.

You can use this view to help manage the costs associated with Cross-Cloud Auto-Fulfillment.
See [Auto-fulfillment costs](../../collaboration/provider-understand-cost-auto-fulfillment.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the storage usage occurred. |
| SNOWFLAKE_REGION | VARCHAR | [Snowflake region](../../user-guide/admin-account-identifier.md) where the storage usage occurred. |
| USAGE_DATE | DATE | Date in UTC when the storage usage was recorded. |
| DATABASE_NAME | VARCHAR | Name of the database. |
| SOURCE_DATABASE_ID | NUMBER | Internal ID of the source database that contains the data product shared by the provider. |
| DELETED | TIMESTAMP | Time when the database was dropped. NULL for active databases. |
| AVERAGE_DATABASE_BYTES | FLOAT | Number of bytes of database storage used, including data in [Time Travel](../../user-guide/data-time-travel.md). |
| AVERAGE_FAILSAFE_BYTES | FLOAT | Number of bytes of [Fail-safe storage](../../user-guide/data-failsafe.md) used. |
| LISTINGS | ARRAY | List of listings that reference the database in this specific region. Returns an empty array until a listing is successfully fulfilled to a region. |

## Usage notes

* Latency for the view may be up to 2 days.
* The data is retained for 365 days (1 year).
* Stage storage is not included in this view.
* The view only contains data from 2023-04-16 onward.
* In cases where auto-fulfillment is incomplete, the array returned for the LISTINGS column might be empty.
* The view contains data for all data products, whether your data product is a Snowflake Native App or a share.

> **Important:**
>
> This view is intended to help you understand the resources used by Cross-Cloud Auto-Fulfillment. It is not intended to
> be used for billing reconciliation. Instead, refer to the views in the ORGANIZATION_USAGE schema. See
> [View actual costs](../../collaboration/provider-listings-auto-fulfillment-monitor-view-costs.md) for more details.

## Examples

Shows the average storage used in each Snowflake region over a specific time period, grouped by region and database:

```sqlexample
SELECT
   snowflake_region,
   database_name,
   listings,
   AVG(average_database_bytes) AS AVG_storage
FROM snowflake.data_sharing_usage.listing_auto_fulfillment_database_storage_daily
WHERE 1=1
   AND usage_date BETWEEN '2023-04-17' AND '2023-04-30'
GROUP BY 1,2,3
ORDER BY 4 DESC;
```

---
title: LISTING_AUTO_FULFILLMENT_REFRESH_DAILY view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/listing-auto-fulfillment-refresh-daily.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# LISTING_AUTO_FULFILLMENT_REFRESH_DAILY view

This view in the DATA_SHARING_USAGE schema can be used to determine the data refreshes performed by Cross-Cloud Auto-Fulfillment.
When a listing is fulfilled to another region, the data product is refreshed on a frequency defined by the listing provider. This view
contains details about how much data is refreshed to specific regions, and which listings and databases the data refreshes are
associated with.

You can use this view to help manage the costs associated with Cross-Cloud Auto-Fulfillment.
See [Auto-fulfillment costs](../../collaboration/provider-understand-cost-auto-fulfillment.md).

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the data refresh occurred. |
| SNOWFLAKE_REGION | VARCHAR | [Snowflake region](../../user-guide/admin-account-identifier.md) where the data refresh occurred. |
| USAGE_DATE | DATE | Date in UTC when the refresh was performed. |
| FULFILLMENT_GROUP_NAME | VARCHAR | Identifier for the auto-fulfillment group used to refresh the data. |
| BYTES_TRANSFERRED | NUMBER | Number of bytes transferred for refreshes in this day. |
| CREDITS_USED | NUMBER | Number of credits used for refreshes in this day. |
| DATABASES | ARRAY | List of databases refreshed in the auto-fulfillment group. Returns an empty array until a listing is successfully fulfilled to a region. |
| LISTINGS | ARRAY | List of listings that reference the databases in this region. Returns an empty array until a listing is successfully fulfilled to a region. |

## Usage notes

* Latency for the view may be up to 2 days.
* The data is retained for 365 days (1 year).
* The view only contains data from 2023-04-16 onward.
* In cases where auto-fulfillment is incomplete, the array returned for the LISTINGS column might be empty.
* The view contains data for all data products, whether your data product is a Snowflake Native App or a share.

> **Important:**
>
> This view is intended to help you understand the resources used by Cross-Cloud Auto-Fulfillment. It is not intended to
> be used for billing reconciliation. Instead, refer to the views in the ORGANIZATION_USAGE schema. See
> [View actual costs](../../collaboration/provider-listings-auto-fulfillment-monitor-view-costs.md).
> for more details.

## Examples

Shows the sum of credits used to refresh the data associated with a specific auto-fulfillment group,
including the associated databases and listings:

```sqlexample
 SELECT
   fulfillment_group_name,
   databases,
   listings,
   SUM(credits_used) AS total_credits_used
FROM snowflake.data_sharing_usage.listing_auto_fulfillment_refresh_daily
GROUP BY 1,2,3
ORDER BY 4 DESC;
```

Shows top databases by credit usage for a given time period:

```sqlexample
 SELECT
   databases,
   listings,
   SUM(credits_used) AS total_credits_used
FROM snowflake.data_sharing_usage.listing_auto_fulfillment_refresh_daily
WHERE 1=1
   AND usage_date BETWEEN '2023-04-17' AND '2023-04-30'
GROUP BY 1,2
ORDER BY 3 DESC;
```

---
title: LISTING_CONSUMPTION_DAILY view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/listing-consumption-daily.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# LISTING_CONSUMPTION_DAILY view

This view in the DATA_SHARING_USAGE schema can be used to analyze consumption of a Snowflake Native App or shared data associated with listings
in a data exchange, such as the Snowflake Marketplace. The view returns a record for each consumer account that queried data for a given date.

## Columns

LISTING_CONSUMPTION_DAILY

| Field | Type | Description |
| --- | --- | --- |
| EVENT_DATE | DATETIME | Date of the consumption. |
| EXCHANGE_NAME | VARCHAR | Name of the data exchange or the Snowflake Marketplace to which the listing belongs. |
| SNOWFLAKE_REGION | VARCHAR | Snowflake Region where the consumption occurred. |
| LISTING_NAME | VARCHAR | Identifier for the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. Unique for each listing and is used to create the listing URL. |
| PROVIDER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the data product owner. |
| PROVIDER_ACCOUNT_NAME | VARCHAR | Account name of the data product owner. |
| SHARE_NAME | VARCHAR | Share name. If your data product is a Snowflake Native App, this is NULL. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator name of the consumer. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Account name of the consumer. |
| CONSUMER_ORGANIZATION | VARCHAR | Organization name of the consumer. |
| JOBS | NUMBER | Total jobs run that day on the data product. A job is recorded when a consumer query resolves objects included in the data share or Snowflake Native App attached to the listing. |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account of the consumer is located. |
| CONSUMER_NAME | VARCHAR | Contains the company name of the consumer account that accessed, used, or requested a listing. If no name is available, such as for trial accounts, the value is NULL. |
| UNIQUE_USERS_1D | NUMBER | Count of unique users (within the consumer account) who had jobs running on the date of consumption (EVENT_DATE). |
| UNIQUE_USERS_7D | NUMBER | Count of unique users (within the consumer account) who had jobs running within the 7-day period ending on the date of consumption (EVENT_DATE). |
| UNIQUE_USERS_28D | NUMBER | Count of unique users (within the consumer account) who had jobs running within the 28-day period ending on the date of consumption (EVENT_DATE). |

## Usage notes

* Latency for the view may be up to 2 days.
* The data is retained for 365 days (1 year).
* The view contains data for all data products, whether your data product is a Snowflake Native App or a share.

## Examples

Shows top listings by consumption for a given time period:

```sqlexample
 SELECT
   listing_name,
   listing_display_name,
   SUM(jobs) AS jobs
FROM snowflake.data_sharing_usage.listing_consumption_daily
WHERE 1=1
   AND event_date BETWEEN '2021-01-01' AND '2021-01-31'
GROUP BY 1,2
ORDER BY 3 DESC
```

Shows top consumers by listing:

```sqlexample
SELECT
  *,
  ROW_NUMBER() OVER (PARTITION BY listing_name, listing_display_name ORDER BY jobs DESC) AS rank
FROM (
  SELECT
    listing_name,
    listing_display_name,
    consumer_account_locator,
    SUM(jobs) AS jobs
  FROM snowflake.data_sharing_usage.listing_consumption_daily
  WHERE 1=1
    AND event_date BETWEEN '2021-01-01' AND '2021-01-31'
  GROUP BY 1,2,3
)
ORDER BY
  listing_name,
  listing_display_name,
  rank
```

---
title: LISTING_EVENTS_DAILY view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/listing-events-daily.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# LISTING_EVENTS_DAILY view

The LISTING_EVENTS_DAILY view in the [DATA_SHARING_USAGE](../data-sharing-usage.md) schema lets you query the daily history of
consumer activity on listings for the Snowflake Marketplace and data exchanges, including:

* Consumer installs a database from a listing.
* Consumer installs a Snowflake Native App.
* Consumer requests unlimited access to a limited trial listing or a free listing where data is not yet available.
* Consumer installs the trial data product for a paid listing or limited trial listing.
* Consumer buys a paid listing from the Snowflake Marketplace.
* Consumer decides to no longer use the paid data for a paid listing.
* Consumer uninstalls a Snowflake Native App or drops an imported database.

The view includes the history of consumer activity for a specific listing.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| EVENT_DATE | DATE | Date of the event. |
| EXCHANGE_NAME | VARCHAR | Name of the data exchange the listing belongs to, such as the Snowflake Marketplace. |
| EVENT_TYPE | VARCHAR | One of:   * `GET`: Consumer creates a database for a free listing, or installs a Snowflake Native App. * `REQUEST`: Consumer requests a “by request” (personalized) listing, a limited trial listing, or a free listing that’s in a region where the data isn’t yet available. * `TRIAL`: Consumer creates a trial database or installs a trial Snowflake Native App. * `PURCHASE`: Consumer agrees to be invoiced when paid data in a paid listing is queried. * `CANCEL PURCHASE`: Consumer decides to stop using the paid data in a paid listing. * `UNINSTALL`: Consumer uninstalls a Snowflake Native App or drops an imported database. |
| SNOWFLAKE_REGION | VARCHAR | Snowflake Region where the `REQUEST` or `GET` event occurred. |
| LISTING_NAME | VARCHAR | Identifier of the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. Unique for each listing and is used to create the listing URL. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer account. For more information about account identifiers, see [account identifier](../../user-guide/admin-account-identifier.md). |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Name of the consumer account. |
| CONSUMER_ORGANIZATION | VARCHAR | Organization name of the consumer account. |
| CONSUMER_EMAIL | VARCHAR | Email address for the consumer account (if available). |
| TERMS_ACCEPTED_DATE | DATETIME | Timestamp when the consumer accepted the listing terms. |
| CONSUMER_METADATA | VARIANT | Other information included by the consumer when the event happened, such as their name or the reason for using a free email address. |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account of the consumer is located. |
| CONSUMER_NAME | VARCHAR | Contains the company name of the consumer account that accessed, used, or requested a listing. If a name is unavailable, such as for trial accounts, the value is NULL. |
| ACCESS_TYPE | VARCHAR | The listing access type. The access type is also called the monetization type. |
| EVENT_TIMESTAMP | DATETIME | The date and time that a listing-related event occurred. |

## Usage notes

* Latency for the view may be up to 2 days.
* The data is retained for 365 days (1 year).
* The view contains data for all data products, whether your data product is a Snowflake Native App or a share.

## Examples

Shows daily count of gets and requests by listing:

```sqlexample
SELECT
  listing_name,
  listing_display_name,
  event_date,
  event_type,
  SUM(1) AS count_gets_requests
FROM snowflake.data_sharing_usage.listing_events_daily
GROUP BY 1,2,3,4
```

---
title: LISTING_TELEMETRY_DAILY view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/listing-telemetry-daily.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# LISTING_TELEMETRY_DAILY view

The LISTING_TELEMETRY_DAILY view in the DATA_SHARING_USAGE schema displays daily telemetry data by data exchange and region.
The view returns a row for each data exchange in your organization and each region where that data exchange is available.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| EXCHANGE_NAME | VARCHAR | Name of the data exchange the listing belongs to, such as the Snowflake Marketplace. |
| EVENT_DATE | DATE | Date of the event. |
| SNOWFLAKE_REGION | VARCHAR | Snowflake Region where the event occurred. If `NONE`, the event occurred for a user that is not signed in to a Snowflake account. |
| LISTING_NAME | VARCHAR | Identifier of the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. Unique for each listing and is used to create the listing URL. |
| EVENT_TYPE | VARCHAR | Event that occurred for the listing. Use in combination with the ACTION column. This can be one of the following:   * GET: Consumer creates a database for a free, paid, or limited trial listing, or installs a Snowflake Native App, depending on the value   of the ACTION column. * REQUEST: Consumer requests a limited trial listing or a free listing in a region where the data is not yet available. * LISTING CLICK: A user clicks the tile for a listing, such as from search or the Snowflake Marketplace page. * LISTING VIEW: A user visits the listing detail page. * UNINSTALL: Consumer uninstalls a Snowflake Native App or drops an imported database. |
| ACTION | VARCHAR | Action that was taken for the event. This can be one of the following:   * STARTED: The consumer selected Get or Request for a listing on the listing details page. * COMPLETED: can be one of the following, depending on the EVENT_TYPE:    + For an EVENT_TYPE of GET, indicates that a consumer installed a Snowflake Native App or created a database from the data product. For     paid and limited trial listings, this indicates that a consumer started a trial or purchased a data product.   + For an EVENT_TYPE of REQUEST, indicates that the provider received a listing request from the consumer.   + For an EVENT_TYPE of UNINSTALL, indicates that the consumer successfully uninstalled a Snowflake Native App or dropped an imported database. * CLICK: For a LISTING CLICK event, indicates that a consumer clicked the tile for a listing, such as from search or   the Snowflake Marketplace homepage. * VIEW: For a LISTING VIEW event, records a listing view. |
| EVENT_COUNT | INTEGER | The total number of times this event action occurred on the event date. |
| CONSUMER_ACCOUNTS_DAILY | INTEGER | The count of distinct accounts that performed the given event action above. |
| CONSUMER_ACCOUNTS_28D | INTEGER | The count of distinct consumer accounts that performed the given event action in the past 28 days. |
| REGION_GROUP | VARCHAR | [Region group](../../user-guide/admin-account-identifier.md) where the account of the consumer is located. If `NONE`, the event occurred for a user that is not signed in to a Snowflake account. |

## Usage notes

* Latency for the view may be up to 2 days.
* The data is retained for 365 days (1 year).
* The view contains data for all data products, whether your data product is a Snowflake Native App or a share.

## Examples

To review the click-through rates for each listing, run the following:

```sqlexample
SELECT
  listing_name,
  listing_display_name,
  event_date,
  SUM(IFF(event_type = 'LISTING CLICK', consumer_accounts_daily, 0)) AS listing_clicks,
  SUM(IFF(event_type IN ('GET', 'REQUEST') and action = 'STARTED', consumer_accounts_daily, 0)) AS get_request_started,
  SUM(IFF(event_type IN ('GET', 'REQUEST') and action = 'COMPLETED', consumer_accounts_daily, 0)) AS get_request_completed,
  get_request_completed / NULLIFZERO(listing_clicks) AS ctr
FROM snowflake.data_sharing_usage.LISTING_TELEMETRY_DAILY
GROUP BY 1,2,3
ORDER BY 1,2,3;
```

To get a clearer sense of how many listing views are from immediate potential customers, you can use the REGION_GROUP field to split
the total count of listing views per day by whether the view was performed by a user signed in to a Snowflake account or not:

```sqlexample
SELECT
  listing_name,
  listing_display_name,
  event_date,
  COUNT_IF(event_type= 'listing_view' AND region_group='NONE') as unknown_user_view_count,
  COUNT_IF(event_type= 'listing_view' AND region_group!='NONE') as known_user_view_count
FROM snowflake.data_sharing_usage.LISTING_TELEMETRY_DAILY
GROUP BY 1,2,3
ORDER BY 1,2,3;
```

---
title: Listings DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-listings.md
section: SQL General Reference
---

# Listings DDL

Snowflake provides a full set of DDL commands for creating and managing [listings](../collaboration/collaboration-listings-about.md).

## Listing management

* [CREATE LISTING](sql/create-listing.md)
* [ALTER LISTING](sql/alter-listing.md)
* [DESCRIBE LISTING](sql/desc-listing.md)
* [SHOW LISTINGS](sql/show-listings.md)
* [DROP LISTING](sql/drop-listing.md)

---
title: Literals and variables as identifiers with IDENTIFIER() syntax
source: https://docs.snowflake.com/en/sql-reference/identifier-literal.md
section: SQL General Reference
---

# Literals and variables as identifiers with IDENTIFIER() syntax

In Snowflake SQL statements, in addition to referring to objects by name (see [Identifier requirements](identifiers-syntax.md)), you can
also use a string literal, session variable, bind variable, or
[Snowflake Scripting variable](../developer-guide/snowflake-scripting/variables.md) to refer to an object. For example, you can
use a session variable that is set to the name of a table in the FROM clause of a SELECT statement. To use an object name
specified in a literal or variable, use IDENTIFIER() syntax.

Using IDENTIFIER() to identify database objects is a best practice because it can make
code more reusable and help to prevent [SQL injection](../developer-guide/stored-procedure/stored-procedures-usage.md) risks.

## Syntax

```sqlsyntax
IDENTIFIER( { string_literal | session_variable | bind_variable | snowflake_scripting_variable } )
```

`string_literal`
:   String identifying the name of the object:

    * The string must either be enclosed by single quotes (`'name'`) or start with a dollar sign (`$name`).
    * The string literal can be a fully-qualified object name (e.g. `'db_name.schema_name.object_name'` or `$db_name.schema_name.object_name`).

`session_variable`
:   A [SQL variable](session-variables.md) that has been set for the session.

`bind_variable`
:   A [bind variable](bind-variables.md), in the form of `?` or `:variable`, which can be used by clients/programmatic interfaces that support binding (JDBC, ODBC, Python, etc.).

`snowflake_scripting_variable`
:   A [Snowflake Scripting variable](../developer-guide/snowflake-scripting/variables.md) that has been set.

## Usage notes

* You can use literals and variables (session or bind) in some cases when you need to identify an object by name (queries, DML,
  DDL, and so on).
* You can use bind variables for object identifiers and bind variables for values in the same query.
* In a FROM clause, you can use
  `TABLE( { string_literal | session_variable | bind_variable | snowflake_scripting_variable } )` as a synonym for
  `IDENTIFIER( { string_literal | session_variable | bind_variable | snowflake_scripting_variable } )`.
* Although IDENTIFIER() uses the syntax of a function, it isn’t a true function and isn’t returned by commands such as
  SHOW FUNCTIONS.

## Examples

The following examples use the IDENTIFIER() syntax.

### Using the IDENTIFIER() syntax with string literals

These examples show you how to refer to an object when a string literal contains the
object identifier.

Create a database:

```sqlexample
CREATE OR REPLACE DATABASE IDENTIFIER('my_db');
```

```output
+--------------------------------------+
| status                               |
|--------------------------------------|
| Database MY_DB successfully created. |
+--------------------------------------+
```

Create a schema:

```sqlexample
CREATE OR REPLACE SCHEMA IDENTIFIER('my_schema');
```

```output
+----------------------------------------+
| status                                 |
|----------------------------------------|
| Schema MY_SCHEMA successfully created. |
+----------------------------------------+
```

Create a table using a case-insensitive table name specified in a string that contains the fully-qualified name:

```sqlexample
CREATE OR REPLACE TABLE IDENTIFIER('my_db.my_schema.my_table') (c1 number);
```

```output
+--------------------------------------+
| status                               |
|--------------------------------------|
| Table MY_TABLE successfully created. |
+--------------------------------------+
```

Create a table using a case-sensitive table name specified in a double-quoted string:

```sqlexample
CREATE OR REPLACE TABLE IDENTIFIER('"my_table"') (c1 number);
```

```output
+--------------------------------------+
| status                               |
|--------------------------------------|
| Table my_table successfully created. |
+--------------------------------------+
```

Show the tables in a schema:

```sqlexample
SHOW TABLES IN SCHEMA IDENTIFIER('my_schema');
```

```output
+-------------------------------+----------+---------------+-------------+-------+---------+---------+
| created_on                    | name     | database_name | schema_name | kind  | comment | ...     |
|-------------------------------+----------+---------------+-------------+-------+---------+---------|
| 2024-07-03 08:55:11.992 -0700 | MY_TABLE | MY_DB         | MY_SCHEMA   | TABLE |         | ...     |
| 2024-07-03 08:56:00.604 -0700 | my_table | MY_DB         | MY_SCHEMA   | TABLE |         | ...     |
+-------------------------------+----------+---------------+-------------+-------+---------+---------+
```

### Using the IDENTIFIER() syntax with session variables

These examples show you how to use a [session variable](session-variables.md) that has
a table name or schema name.

Set a session variable for a schema name:

```sqlexample
SET schema_name = 'my_db.my_schema';
```

Set a session variable for a table name:

```sqlexample
SET table_name = 'my_table';
```

Specify the schema for the current session:

```sqlexample
USE SCHEMA IDENTIFIER($schema_name);
```

Insert values into a table:

```sqlexample
INSERT INTO IDENTIFIER($table_name) VALUES (1), (2), (3);
```

Query a table:

```sqlexample
SELECT * FROM IDENTIFIER($table_name) ORDER BY 1;
```

```output
+----+
| C1 |
|----|
|  1 |
|  2 |
|  3 |
+----+
```

This example shows how to use a session variable that has a function name.

1. Create the function `speed_of_light`:

   > ```sqlexample
   > CREATE FUNCTION speed_of_light()
   > RETURNS INTEGER
   > AS
   >   $$
   >   299792458
   >   $$;
   > ```
2. Call the function by name:

   > ```sqlexample
   > SELECT speed_of_light();
   > ```
   >
   > ```output
   > +------------------+
   > | SPEED_OF_LIGHT() |
   > |------------------|
   > |        299792458 |
   > +------------------+
   > ```
3. Call the function by using the IDENTIFIER() syntax:

   > ```sqlexample
   > SET my_function_name = 'speed_of_light';
   > ```
   >
   > ```sqlexample
   > SELECT IDENTIFIER($my_function_name)();
   > ```
   >
   > ```output
   > +---------------------------------+
   > | IDENTIFIER($MY_FUNCTION_NAME)() |
   > |---------------------------------|
   > |                       299792458 |
   > +---------------------------------+
   > ```

### Using the IDENTIFIER() syntax with bind variables

These examples show you how to use [bind variables](bind-variables.md) to identify objects.

This example shows you how to bind a function name in JDBC. The function is named `speed_of_light`.

```java
String sql_command;

// Create a Statement object to use later.
System.out.println("Create JDBC statement.");
Statement statement = connection.createStatement();
System.out.println("Create function.");
sql_command = "CREATE FUNCTION speed_of_light() RETURNS INTEGER AS $$ 299792458 $$";
statement.execute(sql_command);

System.out.println("Create prepared statement.");
sql_command = "SELECT IDENTIFIER(?)()";
PreparedStatement ps = connection.prepareStatement(sql_command);
// Bind
ps.setString(1, "speed_of_light");
ResultSet rs = ps.executeQuery();
if (rs.next()) {
  System.out.println("Speed of light (m/s) = " + rs.getInt(1));
}
```

The following examples show a variety of SQL statements that can use binding, and a variety of database objects
that can be bound (including schema names and table names):

```sqlexample
USE SCHEMA IDENTIFIER(?);

CREATE OR REPLACE TABLE IDENTIFIER(?) (c1 NUMBER);

INSERT INTO IDENTIFIER(?) values (?), (?), (?);

SELECT t2.c1
  FROM IDENTIFIER(?) AS t1,
       IDENTIFIER(?) AS t2
  WHERE t1.c1 = t2.c1 AND t1.c1 > (?);

DROP TABLE IDENTIFIER(?);
```

### Using the IDENTIFIER() syntax with Snowflake Scripting variables

This example shows how to use a [Snowflake Scripting variable](../developer-guide/snowflake-scripting/variables.md)
for a table name in a SELECT statement:

```sqlexample
BEGIN
  LET res RESULTSET := (SELECT COUNT(*) AS COUNT FROM IDENTIFIER(:table_name));
  ...
```

---
title: Logical data types
source: https://docs.snowflake.com/en/sql-reference/data-types-logical.md
section: SQL General Reference
---

# Logical data types

This topic describes the logical data types supported in Snowflake.

## Data types

Snowflake supports a single logical data type (BOOLEAN).

### BOOLEAN

BOOLEAN can have TRUE or FALSE values. BOOLEAN can also have an UNKNOWN value, which is represented by NULL.
BOOLEAN columns can be used in expressions (for example, a [SELECT](sql/select.md) list),
as well as predicates (for example, a [WHERE](constructs/where.md) clause).

The BOOLEAN data type enables support for [Ternary logic](ternary-logic.md).

## BOOLEAN conversion

Snowflake supports conversion to and from BOOLEAN.

### Conversion to BOOLEAN

Non-BOOLEAN values can be converted to BOOLEAN values explicitly or implicitly.

#### Explicit conversion

You can explicitly convert specific [text string](data-types-text.md) and [numeric](data-types-numeric.md) values
to BOOLEAN values by using the [TO_BOOLEAN](functions/to_boolean.md) or [CAST](functions/cast.md) functions:

String conversion:
:   * Strings converted to TRUE: `'true'`, `'t'`, `'yes'`, `'y'`, `'on'`, `'1'`.
    * Strings converted to FALSE: `'false'`, `'f'`, `'no'`, `'n'`, `'off'`, `'0'`.
    * Conversion is case-insensitive.
    * Other text strings can’t be converted to BOOLEAN values.

Numeric conversion:
:   * Zero (`0`) is converted to FALSE.
    * Any non-zero value is converted to TRUE.

#### Implicit conversion

Snowflake can implicitly convert specific text string and numeric values to BOOLEAN values:

String conversion:
:   * `'true'` is converted to TRUE.
    * `'false'` is converted to FALSE.
    * Conversion is case-insensitive.

Numeric conversion:
:   * Zero (`0`) is converted to FALSE.
    * Any non-zero value is converted to TRUE.

### Conversion from BOOLEAN

BOOLEAN values can be converted to non-BOOLEAN values explicitly or implicitly.

#### Explicit conversion

You can explicitly cast BOOLEAN values to text string or numeric values:

String conversion:
:   * TRUE is converted to `'true'`.
    * FALSE is converted to `'false'`.

Numeric conversion:
:   * TRUE is converted to `1`.
    * FALSE is converted to `0`.

#### Implicit conversion

Snowflake can implicitly convert BOOLEAN values to text string values:

String conversion:
:   * TRUE is converted to `'true'`.
    * FALSE is converted to `'false'`.

## Examples

Create a table and insert values:

```sqlexample
CREATE OR REPLACE TABLE test_boolean(
  b BOOLEAN,
  n NUMBER,
  s STRING);

INSERT INTO test_boolean VALUES
  (true, 1, 'yes'),
  (false, 0, 'no'),
  (NULL, NULL, NULL);

SELECT * FROM test_boolean;
```

```output
+-------+------+------+
| B     |    N | S    |
|-------+------+------|
| True  |    1 | yes  |
| False |    0 | no   |
| NULL  | NULL | NULL |
+-------+------+------+
```

The following query includes a BOOLEAN-typed expression:

```sqlexample
SELECT b, n, NOT b AND (n < 1) FROM test_boolean;
```

```output
+-------+------+-------------------+
| B     |    N | NOT B AND (N < 1) |
|-------+------+-------------------|
| True  |    1 | False             |
| False |    0 | True              |
| NULL  | NULL | NULL              |
+-------+------+-------------------+
```

The following example uses a BOOLEAN column in predicates:

```sqlexample
SELECT * FROM test_boolean WHERE NOT b AND (n < 1);
```

```output
+-------+---+----+
| B     | N | S  |
|-------+---+----|
| False | 0 | no |
+-------+---+----+
```

The following example casts a text value to a BOOLEAN value. The example uses
the [SYSTEM$TYPEOF](functions/system_typeof.md) to show the type of the value
after the conversion.

```sqlexample
SELECT s,
       TO_BOOLEAN(s),
       SYSTEM$TYPEOF(TO_BOOLEAN(s))
  FROM test_boolean;
```

```output
+------+---------------+------------------------------+
| S    | TO_BOOLEAN(S) | SYSTEM$TYPEOF(TO_BOOLEAN(S)) |
|------+---------------+------------------------------|
| yes  | True          | BOOLEAN[SB1]                 |
| no   | False         | BOOLEAN[SB1]                 |
| NULL | NULL          | BOOLEAN[SB1]                 |
+------+---------------+------------------------------+
```

The following example casts a number value to a BOOLEAN value:

```sqlexample
SELECT n,
       TO_BOOLEAN(n),
       SYSTEM$TYPEOF(TO_BOOLEAN(n))
  FROM test_boolean;
```

```output
+------+---------------+------------------------------+
| N    | TO_BOOLEAN(N) | SYSTEM$TYPEOF(TO_BOOLEAN(N)) |
|------+---------------+------------------------------|
| 1    | True          | BOOLEAN[SB1]                 |
| 0    | False         | BOOLEAN[SB1]                 |
| NULL | NULL          | BOOLEAN[SB1]                 |
+------+---------------+------------------------------+
```

In this example, Snowflake implicitly converts a BOOLEAN value to a text value:

```sqlexample
SELECT 'Text for ' || s || ' is ' || b AS result,
       SYSTEM$TYPEOF('Text for ' || s || ' is ' || b) AS type_of_result
  FROM test_boolean;
```

```output
+----------------------+-------------------------+
| RESULT               | TYPE_OF_RESULT          |
|----------------------+-------------------------|
| Text for yes is true | VARCHAR(134217728)[LOB] |
| Text for no is false | VARCHAR(134217728)[LOB] |
| NULL                 | VARCHAR(134217728)[LOB] |
+----------------------+-------------------------+
```

---
title: Logical operators
source: https://docs.snowflake.com/en/sql-reference/operators-logical.md
section: SQL General Reference
---

# Logical operators

Logical operators return the result of a particular Boolean operation on one or two input expressions. Logical operators are also
referred to as Boolean operators.

Logical operators can only be used as a predicate (for example, in the [WHERE](constructs/where.md) clause). Input expressions must be predicates.

See also:
:   [BOOLAND](functions/booland.md) , [BOOLNOT](functions/boolnot.md) , [BOOLOR](functions/boolor.md) , [BOOLXOR](functions/boolxor.md)

## List of logical operators

| Operator | Syntax example | Description |
| --- | --- | --- |
| `AND` | `a AND b` | Matches both expressions (`a` and `b`). |
| `NOT` | `NOT a` | Doesn’t match the expression. |
| `OR` | `a OR b` | Matches either expression. |

The order of precedence of these operators is shown below (from highest to
lowest):

* NOT
* AND
* OR

## Examples

The following examples use logical operators:

* Use logical operators in queries on table data
* Use logical operators in queries on Boolean values
* Show “truth tables” for the logical operators

### Use logical operators in queries on table data

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE logical_test1 (id INT, a INT, b VARCHAR);

INSERT INTO logical_test1 (id, a, b) VALUES (1, 8, 'Up');
INSERT INTO logical_test1 (id, a, b) VALUES (2, 25, 'Down');
INSERT INTO logical_test1 (id, a, b) VALUES (3, 15, 'Down');
INSERT INTO logical_test1 (id, a, b) VALUES (4, 47, 'Up');

SELECT * FROM logical_test1;
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  1 |  8 | Up   |
|  2 | 25 | Down |
|  3 | 15 | Down |
|  4 | 47 | Up   |
+----+----+------+
```

#### Execute queries that use a single logical operator

Use a single logical operator in the WHERE clause of various queries:

```sqlexample
SELECT *
  FROM logical_test1
  WHERE a > 20 AND
        b = 'Down';
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  2 | 25 | Down |
+----+----+------+
```

```sqlexample
SELECT *
  FROM logical_test1
  WHERE a > 20 OR
        b = 'Down';
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  2 | 25 | Down |
|  3 | 15 | Down |
|  4 | 47 | Up   |
+----+----+------+
```

```sqlexample
SELECT *
  FROM logical_test1
  WHERE a > 20 OR
        b = 'Up';
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  1 |  8 | Up   |
|  2 | 25 | Down |
|  4 | 47 | Up   |
+----+----+------+
```

```sqlexample
SELECT *
  FROM logical_test1
  WHERE NOT a > 20;
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  1 |  8 | Up   |
|  3 | 15 | Down |
+----+----+------+
```

#### Show the precedence of logical operators

The following examples show the precedence of the logical operators.

The first example shows that the precedence of AND is higher than the
precedence of OR. The query returns the rows that match these conditions:

* `b` equals `Down`.

OR

* `a` equals `8` AND `b` equals `Up`.

```sqlexample
SELECT *
  FROM logical_test1
  WHERE b = 'Down' OR
        a = 8 AND b = 'Up';
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  1 |  8 | Up   |
|  2 | 25 | Down |
|  3 | 15 | Down |
+----+----+------+
```

You can use parentheses in the WHERE clause to change the precedence. For example,
the following query returns the rows that match these conditions:

* `b` equals `Down` OR `a` equals `8`.

AND

* `b` equals `Up`.

```sqlexample
SELECT *
  FROM logical_test1
  WHERE (b = 'Down' OR a = 8) AND b = 'Up';
```

```output
+----+---+----+
| ID | A | B  |
|----+---+----|
|  1 | 8 | Up |
+----+---+----+
```

The next example shows that the precedence of NOT is higher than the precedence of AND. For example,
the following query returns the rows that match these conditions:

* `a` does NOT equal `15`.

AND

* `b` equals `Down`.

```sqlexample
SELECT *
  FROM logical_test1
  WHERE NOT a = 15 AND b = 'Down';
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  2 | 25 | Down |
+----+----+------+
```

You can use parentheses in the WHERE clause to change the precedence. For example,
the following query returns the rows that do NOT match both of these conditions:

* `a` equals `15`.

AND

* `b` equals `Down`.

```sqlexample
SELECT *
  FROM logical_test1
  WHERE NOT (a = 15 AND b = 'Down');
```

```output
+----+----+------+
| ID |  A | B    |
|----+----+------|
|  1 |  8 | Up   |
|  2 | 25 | Down |
|  4 | 47 | Up   |
+----+----+------+
```

### Use logical operators in queries on Boolean values

Create a table and insert data:

```sqlexample
CREATE OR REPLACE TABLE logical_test2 (a BOOLEAN, b BOOLEAN);

INSERT INTO logical_test2 VALUES (0, 1);

SELECT * FROM logical_test2;
```

```output
+-------+------+
| A     | B    |
|-------+------|
| False | True |
+-------+------+
```

The following query uses the OR operator to return rows where either `a` or `b`
is TRUE:

```sqlexample
SELECT a, b FROM logical_test2 WHERE a OR b;
```

```output
+-------+------+
| A     | B    |
|-------+------|
| False | True |
+-------+------+
```

The following query uses the AND operator to return rows where both `a` and `b`
are both TRUE:

```sqlexample
SELECT a, b FROM logical_test2 WHERE a AND b;
```

```output
+---+---+
| A | B |
|---+---|
+---+---+
```

The following query uses the AND operator and the NOT operator to return rows where
`b` is TRUE and `a` is FALSE:

```sqlexample
SELECT a, b FROM logical_test2 WHERE b AND NOT a;
```

```output
+-------+------+
| A     | B    |
|-------+------|
| False | True |
+-------+------+
```

The following query uses the AND operator and the NOT operator to return rows where
`a` is TRUE and `b` is FALSE:

```sqlexample
SELECT a, b FROM logical_test2 WHERE a AND NOT b;
```

```output
+---+---+
| A | B |
|---+---|
+---+---+
```

### Show “truth tables” for the logical operators

The next few examples show “truth tables” for the logical operators on a Boolean column. For more information about the
behavior of Boolean values in Snowflake, see [Ternary logic](ternary-logic.md).

Create a new table and data:

```sqlexample
CREATE OR REPLACE TABLE logical_test3 (x BOOLEAN);

INSERT INTO logical_test3 (x) VALUES
  (False),
  (True),
  (NULL);
```

This shows the truth table for the OR operator:

```sqlexample
SELECT x AS "OR",
       x OR False AS "FALSE",
       x OR True AS "TRUE",
       x OR NULL AS "NULL"
  FROM logical_test3;
```

```output
+-------+-------+------+------+
| OR    | FALSE | TRUE | NULL |
|-------+-------+------+------|
| False | False | True | NULL |
| True  | True  | True | True |
| NULL  | NULL  | True | NULL |
+-------+-------+------+------+
```

This shows the truth table for the AND operator:

```sqlexample
SELECT x AS "AND",
       x AND False AS "FALSE",
       x AND True AS "TRUE",
       x AND NULL AS "NULL"
  FROM logical_test3;
```

```output
+-------+-------+-------+-------+
| AND   | FALSE | TRUE  | NULL  |
|-------+-------+-------+-------|
| False | False | False | False |
| True  | False | True  | NULL  |
| NULL  | False | NULL  | NULL  |
+-------+-------+-------+-------+
```

---
title: LOOP (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/loop.md
section: SQL General Reference
---

# LOOP (Snowflake Scripting)

A `LOOP` loop does not specify a number of iterations or a terminating condition. The user must explicitly
exit the loop by using [BREAK](break.md) or [RETURN](return.md) inside the loop.

For more information on loops, see [Working with loops](../../developer-guide/snowflake-scripting/loops.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [BREAK](break.md), [CONTINUE](continue.md), [RETURN](return.md)

## Syntax

```sqlsyntax
LOOP
    <statement>;
    [ <statement>; ... ]
END LOOP [ <label> ] ;
```

Where:

> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).
>
> `label`
> :   An optional label. Such a label can be a jump target for a [BREAK](break.md) or
>     [CONTINUE](continue.md) statement. A label must follow the naming rules for
>     [Object identifiers](../identifiers.md).

## Usage notes

* A `LOOP` repeats until a `BREAK` or `RETURN` is executed. The `BREAK` or `RETURN` command is almost always
  inside a conditional expression (e.g. `IF` or `CASE`).
* A loop can contain multiple statements. You can use, but are not required to use, a [BEGIN … END](begin.md)
  [block](../../developer-guide/snowflake-scripting/blocks.md) to contain those statements.

## Examples

This loop inserts predictable test data into a table:

```sqlexample
CREATE TABLE dummy_data (ID INTEGER);

CREATE PROCEDURE break_out_of_loop()
RETURNS INTEGER
LANGUAGE SQL
AS
$$
    DECLARE
        counter INTEGER;
    BEGIN
        counter := 0;
        LOOP
            counter := counter + 1;
            IF (counter > 5) THEN
                BREAK;
            END IF;
            INSERT INTO dummy_data (ID) VALUES (:counter);
        END LOOP;
        RETURN counter;
    END;
$$
;
```

Here is the output of executing the stored procedure:

```sqlexample
CALL break_out_of_loop();
+-------------------+
| BREAK_OUT_OF_LOOP |
|-------------------|
|                 6 |
+-------------------+
```

Here is the content of the table after calling the stored procedure:

```sqlexample
SELECT *
    FROM dummy_data
    ORDER BY ID;
+----+
| ID |
|----|
|  1 |
|  2 |
|  3 |
|  4 |
|  5 |
+----+
```

For more examples, see [LOOP loop](../../developer-guide/snowflake-scripting/loops.md).

---
title: Machine learning model DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-model.md
section: SQL General Reference
---

# Machine learning model DDL

The following DDL commands are used to create, view, and manage machine-learning models and their versions.

A model is a schema-level object that contains a machine learning model that has been trained and stored in the Snowpark
ML Registry. Model commands let you create and manage models in SQL. You can also create and manage models in Python
using the Snowpark ML Registry API.

Model monitors allow you to monitor the performance of machine learning models you have deployed in Snowflake.

## Machine learning models

|  |  |
| --- | --- |
| [CREATE MODEL](sql/create-model.md) | Creates a new machine learning model in the current/specified schema or replaces an existing model. |
| [ALTER MODEL](sql/alter-model.md) | Modifies the properties for an existing model, including its name, tags, default version, or comment. |
| [SHOW MODELS](sql/show-models.md) | Lists the machine learning models that you have privileges to access. |
| [DROP MODEL](sql/drop-model.md) | Removes a machine learning model from the current/specified schema. |

## Machine learning model versions

|  |  |
| --- | --- |
| [ALTER MODEL … ADD VERSION](sql/alter-model-add-version.md) | Adds a new version to an existing model from an internal stage. |
| [ALTER MODEL … DROP VERSION](sql/alter-model-drop-version.md) | Removes a version from an existing model. |
| [ALTER MODEL … MODIFY VERSION](sql/alter-model-modify-version.md) | Modifies a version of a model, changing the version’s comment or metadata. |
| [SHOW VERSIONS IN MODEL](sql/show-versions-in-model.md) | Lists the versions in a machine learning model. |

## Machine learing model functions

|  |  |
| --- | --- |
| [SHOW FUNCTIONS IN MODEL](sql/show-functions-in-model.md) | Shows the models (methods) attached to a machine learing model. |

## Machine learning model monitors

|  |  |
| --- | --- |
| [CREATE MODEL MONITOR](sql/create-model-monitor.md) | Create a new model monitor. |
| [ALTER MODEL MONITOR](sql/alter-model-monitor.md) | Modify the properties of an existing model monitor, including its refresh interval and warehouse, or suspend or resume it. |
| [SHOW MODEL MONITORS](sql/show-model-monitors.md) | Lists the model monitors that you have privileges to access. |
| [DESCRIBE MODEL MONITOR](sql/desc-model-monitor.md) | Shows the properties of a model monitor. |
| [DROP MODEL MONITOR](sql/drop-model-monitor.md) | Removes a model monitor from the current/specified schema. |

---
title: MATCH_RECOGNIZE
source: https://docs.snowflake.com/en/sql-reference/constructs/match_recognize.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# MATCH_RECOGNIZE

Recognizes matches of a pattern in a set of rows. `MATCH_RECOGNIZE` accepts a set of rows (from a table,
view, subquery, or other source) as input, and returns all matches for a given row pattern within this
set. The pattern is defined similarly to a regular expression.

The clause can return either:

* All the rows belonging to each match.
* One summary row per match.

`MATCH_RECOGNIZE` is typically used to detect events in time series. For example, `MATCH_RECOGNIZE` can search a
stock price history table for shapes like `V` (down followed by up) or `W` (down, up, down, up).

`MATCH_RECOGNIZE` is an optional subclause of the [FROM](from.md) clause.

> **Note:**
>
> You cannot use the MATCH_RECOGNIZE clause in a **recursive** [common table expression (CTE)](../../user-guide/queries-cte.md).

See also:
:   [Identifying Sequences of Rows That Match a Pattern](../../user-guide/match-recognize-introduction.md)

## Syntax

```sqlsyntax
MATCH_RECOGNIZE (
    [ PARTITION BY <expr> [, ... ] ]
    [ ORDER BY <expr> [, ... ] ]
    [ MEASURES <expr> [AS] <alias> [, ... ] ]
    [ ONE ROW PER MATCH |
      ALL ROWS PER MATCH [ { SHOW EMPTY MATCHES | OMIT EMPTY MATCHES | WITH UNMATCHED ROWS } ]
      ]
    [ AFTER MATCH SKIP
          {
          PAST LAST ROW   |
          TO NEXT ROW   |
          TO [ { FIRST | LAST} ] <symbol>
          }
      ]
    PATTERN ( <pattern> )
    DEFINE <symbol> AS <expr> [, ... ]
)
```

## Required subclauses

### DEFINE: Defining symbols

```sqlsyntax
DEFINE <symbol1> AS <expr1> [ , <symbol2> AS <expr2> ]
```

Symbols (also known as “pattern variables”) are the building blocks of the pattern.

A symbol is defined by an expression. If the expression evaluates to true for a row, the symbol is assigned to
that row. A row can be assigned multiple symbols.

Symbols that are not defined in the `DEFINE` clause, but are used in the pattern, are always assigned to all
rows. Implicitly, they are equivalent to the following example:

```sqlexample
...
define
    my_example_symbol as true
...
```

Patterns are defined based on symbols and operators.

### PATTERN: Specifying the pattern to match

```sqlsyntax
PATTERN ( <pattern> )
```

The pattern defines a valid sequence of rows that represents a match. The pattern is defined like a regular
expression (regex) and is built from symbols, operators, and quantifiers.

For example, suppose that symbol `S1` is defined as `stock_price < 55`, and symbol `S2` is defined
as `stock price > 55`. The following pattern specifies a sequence of rows in which the stock price increased
from less than 55 to greater than 55:

```none
PATTERN (S1 S2)
```

The following is a more complex example for a pattern definition:

```none
^ S1 S2*? ( {- S3 -} S4 )+ | PERMUTE(S1, S2){1,2} $
```

The following section describes the individual components of this pattern in detail.

> **Note:**
>
> MATCH_RECOGNIZE uses [backtracking](https://en.wikipedia.org/wiki/Backtracking) to match patterns. As is the case with other
> [regular expression engines that use backtracking](https://en.wikipedia.org/wiki/Regular_expression#Implementations_and_running_times),
> some combinations of patterns and data to match can take a long time to execute, which can result in high computation costs.
>
> To improve performance, define a pattern that is as specific as possible:
>
> * Make sure that each row matches only one symbol or a small number of symbols
> * Avoid using symbols that match every row (e.g. symbols not in the `DEFINE` clause or symbols that are defined as true)
> * Define an upper limit for quantifiers (e.g. `{,10}` instead of `*`).
>
> For example, the following pattern can result in increased costs if no rows match:
>
> ```sqlexample
> symbol1+ any_symbol* symbol2
> ```
>
> If there is an upper limit to the number of rows that you want to match, you can specify that limit in the quantifiers to
> improve performance. In addition, rather than specifying that you want to find `any_symbol` that follows `symbol1`, you can
> look for a row that is not `symbol1` (`not_symbol1`, in this example);
>
> ```sqlexample
> symbol1{1,limit} not_symbol1{,limit} symbol2
> ```
>
> In general, you should monitor the query execution time to verify that the query is not taking longer than expected.

Symbols:

A symbol matches to a row that symbol was assigned to. The following symbols are available:

* `symbol`. For example, `S1`, … , `S4`
  Those are symbols that were defined in the `DEFINE` subclause and are evaluated per row.
  (These can also include symbols that were not defined and are automatically assigned to all rows.)
* `^` (Start of partition.)
  This is a virtual symbol that denotes the start of a partition and has no row associated with it. You can use it
  to require a match to start only at the beginning of a partition.

  For an example, see [Matching Patterns Relative to the Beginning or End of a Partition](../../user-guide/match-recognize-introduction.md).
* `$` (End of partition.)
  This is a virtual symbol that denotes the end of a partition and has no row associated with it. You can use it
  to require a match to end only at the end of a partition.

  For an example, see [Matching Patterns Relative to the Beginning or End of a Partition](../../user-guide/match-recognize-introduction.md).

Quantifiers:

A quantifier can be placed following a symbol or operation. A quantifier denotes the minimum and maximum number of
occurrences of the associated symbol or operation. The following quantifiers are available:

> | Quantifier | Meaning |
> | --- | --- |
> | `+` | 1 or more. For example, `( {- S3 -} S4 )+`. |
> | `*` | 0 or more. For example, `S2*?`. |
> | `?` | 0 or 1. |
> | `{n}` | Exactly n. |
> | `{n,}` | n or more. |
> | `{,m}` | 0 to m. |
> | `{n, m}` | n to m. For example, `PERMUTE(S1, S2){1,2}`. |

By default, quantifiers are in “greedy mode”, which means they try to match the maximum quantity if possible. To put a
quantifier into “reluctant mode”, in which the quantifier tries to match the minimum quantity if possible,
place a `?` after the quantifier (e.g. `S2*?`).

Operators:

Operators specify in which order symbols or other operations should occur in the sequence of rows to form a valid
match. The following operators are available:

> | Operator | Meaning |
> | --- | --- |
> | `... ...` (space) | Concatenation. Specifies that a symbol or operation should follow another one. For example, `S1 S2` means that the condition defined for `S2` should occur after the condition defined for `S1`. |
> | `{- ... -}` | Exclusion. Excludes the contained symbols or operations from the output. For example, `{- S3 -}` excludes operator `S3` from the output. Excluded rows will not appear in the output, but will be included in the evaluation of `MEASURES` expressions. |
> | `( ... )` | Grouping. Used to override the precedence of an operator or to apply the same quantifier for symbols or operations in the group. In this example, the quantifier `+` applies to the sequence `{- S3 -} S4`, not merely `S4`. |
> | `PERMUTE(..., ...)` | Permutation. Matches any permutation of the specified patterns. For example, `PERMUTE(S1, S2)` matches either `S1 S2` or `S2 S1`. `PERMUTE()` takes an unlimited number of arguments. |
> | `... | ...` | Alternative. Specifies that either the first symbol or operation or the other one should occur. For example, `( S3 S4 ) | PERMUTE(S1, S2)`. The alternative operator has precedence over the concatenation operator. |

## Optional subclauses

### ORDER BY: Sorting the rows before matching

`ORDER BY orderItem1 [ , orderItem2 ... ]`
:   Where:

    > ```none
    > orderItem ::= { <column_alias> | <expr> } [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ]
    > ```

    Define the order of the rows as you would for
    [window functions](../functions-window-syntax.md). This is the order in which the individual rows of
    each partition are passed to the `MATCH_RECOGNIZE` operator.

    For more information, see [Partitioning and Sorting the Rows](../../user-guide/match-recognize-introduction.md).

### PARTITION BY: Partitioning the rows into windows

`PARTITION BY <expr1> [ , <expr2> ... ]`
:   Partition the input set of rows as you would for [window functions](../functions-window-syntax.md).
    `MATCH_RECOGNIZE` performs matching individually for each resulting partition.

    Partitioning not only groups rows that are related to each other, but also leverages Snowflake’s
    distributed data processing capability because separate partitions can be processed in parallel.

    For more information about partitioning, see [Partitioning and Sorting the Rows](../../user-guide/match-recognize-introduction.md).

### MEASURES: Specifying additional output columns

```sqlsyntax
MEASURES <expr1> [AS] <alias1> [ ... , <exprN> [AS] <aliasN> ]
```

“Measures” are optional additional columns that are added to the output of the `MATCH_RECOGNIZE` operator.
The expressions in the `MEASURES` subclause have the same capabilities as the expressions in the `DEFINE`
subclause. For further information, see Symbols.

Within the `MEASURES` subclause, the following functions specific to `MATCH_RECOGNIZE` are available:

* `MATCH_NUMBER()`
  Returns the sequential number of the match. The MATCH_NUMBER starts from 1, and is incremented for each match.
* `MATCH_SEQUENCE_NUMBER()`
  Returns the row number within a match. The MATCH_SEQUENCE_NUMBER is sequential and starts from 1.
* `CLASSIFIER()`
  Returns a TEXT value that contains the symbol that the respective row matched. For example, if a row matched
  the symbol `GT75`, then the `CLASSIFIER` function returns the string “GT75”.

> **Note:**
>
> When specifying measures, remember the restrictions mentioned in the
> Limitations on window functions used in DEFINE and MEASURES section.

### ROW(S) PER MATCH: Specifying the rows to return

```sqlsyntax
{
  ONE ROW PER MATCH  |
  ALL ROWS PER MATCH [ { SHOW EMPTY MATCHES | OMIT EMPTY MATCHES | WITH UNMATCHED ROWS } ]
}
```

Specifies which rows are returned for a successful match. This subclause is optional.

* `ALL ROWS PER MATCH`: Return all rows in the match.
* `ONE ROW PER MATCH`: Return one summary row for each match, regardless of how many rows are in the match.
  This is the default.

Be aware of the following special cases:

* Empty Matches: An empty match happens if a pattern is able to match against zero rows. For instance, if the pattern
  is defined as `A*` and the first row at the beginning of a matching attempt is assigned to symbol `B`, then an
  empty match including only that row is generated, because the `*` quantifier in the `A*` pattern allows 0
  occurrences of `A` to be treated as a match. The `MEASURES` expressions are evaluated differently for this row:

  + The CLASSIFIER function returns NULL.
  + Window functions return NULL.
  + The COUNT function returns 0.
* Unmatched Rows: If a row was not matched against the pattern, it is called an unmatched row. `MATCH_RECOGNIZE` can
  be configured to return unmatched rows, too. For unmatched rows, expressions in the `MEASURES` subclause return
  NULL.

* Exclusions

  The exclusion syntax `({- ... -})` in the pattern definition allows users to exclude certain rows from the output.
  If all matched symbols in the pattern were excluded, no row is generated for that match if `ALL ROWS PER MATCH` was
  specified. Note that the MATCH_NUMBER is incremented anyway. Excluded rows are not part of the result, but are
  included for the evaluation of `MEASURES` expressions.

  When using the exclusion syntax, the ROWS PER MATCH subclause can be specified as follows:

  + ONE ROW PER MATCH (default)

    Returns exactly one row for each successful match. The default window function semantic for window functions in
    the `MEASURES` subclause is `FINAL`.

    The output columns of the `MATCH_RECOGNIZE` operator are all expressions given in the `PARTITION BY` subclause
    and all `MEASURES` expressions. All resulting rows of a match are grouped by the expressions given in the
    `PARTITION BY` subclause and the `MATCH_NUMBER` using the `ANY_VALUE` aggregation function for all measures.
    Therefore, if measures evaluate to a different value for different rows of the same match, then the output is
    non-deterministic.

    Omitting the `PARTITION BY` and `MEASURES` subclause results in an error indicating that the result does not
    include any columns.

    For empty matches, a row is generated. Unmatched rows are not part of the output.
  + `ALL ROWS PER MATCH`

    Returns a row for each row that is part of the match, except for rows that were matched to a portion of the
    pattern that was marked for exclusion.

    Excluded rows are still taken into account in computations in the `MEASURES` subclause.

    Matches might overlap based on the `AFTER MATCH SKIP TO` subclause, so the same row might appear multiple times
    in the output.

    The default window function semantic for window functions in the `MEASURES` subclause is `RUNNING`.

    The output columns of the `MATCH_RECOGNIZE` operator are the columns of the set of rows being input and the
    columns defined in the `MEASURES` subclause.

    The following options are available for `ALL ROWS PER MATCH`:

    - `SHOW EMPTY MATCHES (default)`
      Add empty matches to the output. Unmatched rows are not output.
    - `OMIT EMPTY MATCHES`
      Neither empty matches nor unmatched rows are output. However, the MATCH_NUMBER is still incremented by an
      empty match.
    - `WITH UNMATCHED ROWS`
      Adds empty matches and unmatched rows to the output. If this clause is used, then the pattern must not contain
      exclusions.

  For an example that uses exclusion to reduce irrelevant output, see
  [Search for Patterns in Non-Adjacent Rows](../../user-guide/match-recognize-introduction.md).

### AFTER MATCH SKIP: Specifying where to continue after a match

```sqlsyntax
AFTER MATCH SKIP
{
    PAST LAST ROW   |
    TO NEXT ROW   |
    TO [ { FIRST | LAST} ] <symbol>
}
```

This subclause specifies where to continue the matching after a positive match was found.

* `PAST LAST ROW (default)`

  Continue matching after the last row of the current match.

  This prevents matches that contain overlapping rows. For example, if you have a stock pattern that contains
  3 `V` shapes in a row, then `PAST LAST ROW` finds one `W` pattern, not two.
* `TO NEXT ROW`

  Continue matching after the first row of the current match.

  This allows matches that contain overlapping rows. For example, if you have a stock pattern that contains 3 `V`
  shapes in a row, then `TO NEXT ROW` finds two `W` patterns (the first pattern is based on the first two `V`
  shapes, and the second `W` shape is based on the second and third `V` shapes; thus both patterns contain the
  same `V`).
* `TO [ { FIRST | LAST } ] <symbol>`

  Continue matching at the first or last (default) row that was matched to the given symbol.

  At least one row needs to be mapped to the given symbol or an error is raised.

  If this does not skip past the first row of the current match, then an error is raised.

## Usage notes

### Expressions in DEFINE and MEASURES clauses

The `DEFINE` and `MEASURES` clauses allow expressions. Those expressions can be complex and can include
[window functions](../functions-window-syntax.md) and special navigational functions (which are a type of
window function).

In most respects, expressions in `DEFINE` and `MEASURES` follow the rules for expressions elsewhere in Snowflake
SQL syntax. However, there are some differences, which are described below:

Window Functions:
:   Navigational functions allow references to other rows besides the current row. For example, to create an expression
    that defines a drop in price, you need to compare the price in one row to the price in another row.
    The navigational functions are:

    * `PREV( expr [ , offset [, default ] ] )`
      Navigate to the previous row within the current match in the MEASURES subclause.

      This function is currently not available in the DEFINE subclause. Instead, you can use [LAG](../functions/lag.md) which
      navigates to the previous row within the current [window frame](../functions-window-syntax.md).
    * `NEXT( expr [ , offset [ , default ] ] )`
      Navigate to the next row within the current [window frame](../functions-window-syntax.md). This function is
      equivalent to [LEAD](../functions/lead.md).
    * `FIRST( expr )`
      Navigate to the first row of the current match in the MEASURES subclause.

      This function is currently not available in the DEFINE subclause. Instead, you can use [FIRST_VALUE](../functions/first_value.md)
      which navigates to the first row of the current [window frame](../functions-window-syntax.md).
    * `LAST( expr )`
      Navigate to the last row of the current [window frame](../functions-window-syntax.md). This function is similar to
      [LAST_VALUE](../functions/last_value.md), but for LAST the window frame is limited to the current row of the current matching
      attempt when LAST is used within the DEFINE subclause.

    For an example that uses the navigational functions, see
    [Returning Information About the Match](../../user-guide/match-recognize-introduction.md).

    In general, when a window function is used inside a `MATCH_RECOGNIZE` clause, the window function does not require
    its own `OVER (PARTITION BY ... ORDER BY ...)` clause. The window is implicitly determined by
    the `PARTITION BY` and `ORDER BY` of the `MATCH_RECOGNIZE` clause. (However, see
    Limitations on window functions used in DEFINE and MEASURES for some exceptions.)

    In general, the [window frame](../functions-window-syntax.md) is also derived implicitly from the current context in
    which the window function is being used. The lower bound of the frame is defined as described below:

    > In the `DEFINE` subclause:
    >
    > > The frame starts at the beginning of the current matching attempt except when using `LAG`, `LEAD`,
    > > `FIRST_VALUE`, and `LAST_VALUE`.
    >
    > In the `MEASURES` subclause:
    >
    > > The frame starts at the beginning of the match that was found.

    The edges of the window frame can be specified by using either `RUNNING` or `FINAL` semantics.

    > ```sqlsyntax
    > expr ::= ... [ { RUNNING | FINAL } ] windowFunction ...
    > ```
    >
    > `RUNNING`:
    >
    > > In general, the frame ends at the current row. However, the following exceptions exist:
    > >
    > > * In the `DEFINE` subclause, for `LAG`, `LEAD`, `FIRST_VALUE`, `LAST_VALUE`, and `NEXT`,
    > >   the frame ends at the last row of the window.
    > > * In the `MEASURES` subclause, for `PREV`, `NEXT`, `LAG`, and `LEAD`, the frame ends at the
    > >   last row of the window.
    > >
    > > In the `DEFINE` subclause, `RUNNING` is the default (and the only allowed) semantic.
    > >
    > > In the `MEASURES` subclause, when the `ALL ROWS PER MATCH` subclause is used, `RUNNING` is the
    > > default.
    >
    > `FINAL`:
    >
    > > The frame ends at the last row of the match.
    > >
    > > `FINAL` is allowed only in the `MEASURES` subclause. It is the default there when
    > > `ONE ROW PER MATCH` applies.

Symbol Predicates:
:   Expressions within the `DEFINE` and `MEASURES` subclauses allow symbols as predicates for column references.

    ```sqlsyntax
    predicatedColumnReference ::= <symbol>.<column>
    ```

    The `<symbol>` indicates a row that was matched, and the `<column>` identifies a specific column within that row.

    A predicated column reference means that the surrounding window function only looks at rows that were finally
    mapped to the specified symbol.

    Predicated column references can be used outside and inside of a window function. If used outside of a window
    function, `<symbol>.<column>` is the same as `LAST(<symbol>.<column>)`. Inside of a window function,
    all column references either need to be predicated with the same symbol or are all non-predicated.

    The following explains how navigational-related functions behave with predicated column references:

    * `PREV/LAG( ... <symbol>.<column> ... , <offset>)`
      Searches the window frame backwards starting from and including the current row (or last row in case of a
      `FINAL` semantic) for the first row that was finally mapped to the specified `<symbol>`, and then
      goes `<offset>` (default is 1) rows backwards, ignoring the symbol those rows were mapped to. If the searched part of the
      frame does not contain a row mapped to `<symbol>` or the search would go beyond the edge of the frame, then NULL is returned.
    * `NEXT/LEAD( ... <symbol>.<column> ... , <offset>)`
      Searches the window frame backwards starting from and including the current row (or last row in case of
      a `FINAL` semantic) for the first row that was finally mapped to the specified `<symbol>`, and then
      goes `<offset>` (default is 1) rows forward, ignoring the symbol those rows were mapped to. If the searched part of the
      frame does not contain a row mapped to `<symbol>` or the search would go beyond the edge of the frame, then NULL is returned.
    * `FIRST/FIRST_VALUE( ... <symbol>.<column> ... )`
      Searches the window frame forwards starting from and including the first row up to and including the current row
      (or last row in case of a `FINAL` semantic) for the first row that was finally mapped to the specified `<symbol>`.
      If the searched part of the frame does not contain a row mapped to `<symbol>`, NULL is returned.
    * `LAST/LAST_VALUE( ... <symbol>.<column> ... )`
      Searches the window frame backwards starting from and including the current row (or last row in case of a
      `FINAL` semantic) for the first row that was finally mapped to the specified `<symbol>`. If the searched part of
      the frame does not contain a row mapped to `<symbol>`, NULL is returned.

    > **Note:**
    >
    > Restrictions on window functions are documented in the Limitations on window functions used in DEFINE and MEASURES section.

### Limitations on window functions used in DEFINE and MEASURES

Expressions in the `DEFINE` and `MEASURES` subclauses can include window functions. However, there are some
limitations on using window functions in these subclauses. These limitations are shown in the table below:

> | Function | DEFINE (Running) [column/symbol.column] | MEASURES (Running) [column/symbol.column] | MEASURES (Final) [column/symbol.column] |
> | --- | --- | --- | --- |
> | Column | ✔ / ❌ | ✔ / ❌ | ✔ / ✔ |
> | PREV(…) | ❌ / ❌ | ✔ / ❌ | ✔ / ❌ |
> | NEXT(…) | ✔ / ❌ | ✔ / ❌ | ✔ / ❌ |
> | FIRST(…) | ❌ / ❌ | ✔ / ❌ | ✔ / ✔ |
> | LAST(…) | ✔ / ❌ | ✔ / ❌ | ✔ / ✔ |
> | LAG() | ✔ / ❌ | ✔ / ❌ | ✔ / ❌ |
> | LEAD() | ✔ / ❌ | ✔ / ❌ | ✔ / ❌ |
> | FIRST_VALUE() | ✔ / ❌ | ✔ / ❌ | ✔ / ✔ |
> | LAST_VALUE() | ✔ / ❌ | ✔ / ❌ | ✔ / ✔ |
> | Aggregations [1] | ✔ / ❌ | ✔ / ✔ | ✔ / ✔ |
> | Other window functions [1] | ✔ / ❌ | ✔ / ❌ | ✔ / ❌ |

[1]
(1,2)

These functions require an explicit frame definition `(OVER (ROWS BETWEEN ...))` when used in the `DEFINE` clause.

The `MATCH_RECOGNIZE`-specific functions `MATCH_NUMBER()`, `MATCH_SEQUENCE_NUMBER()`, and `CLASSIFIER()` are
currently not available in the `DEFINE` subclause.

## Troubleshooting

### Error message: “SELECT with no columns” when using ONE ROW PER MATCH

When you use the `ONE ROW PER MATCH` clause, only columns and expressions from the `PARTITION BY` and `MEASURES`
subclauses are allowed in the projection clause of the SELECT. If you try to use `MATCH_RECOGNIZE` without either a
`PARTITION BY` or `MEASURES` clause, you get an error similar to `SELECT with no columns`.

For more information about `ONE ROW PER MATCH` vs. `ALL ROWS PER MATCH`,
see [Generating One Row for Each Match vs Generating All Rows for Each Match](../../user-guide/match-recognize-introduction.md).

## Examples

The topic [Identifying Sequences of Rows That Match a Pattern](../../user-guide/match-recognize-introduction.md) contains many examples, including some that are simpler than
most of the examples here. If you are not already familiar with `MATCH_RECOGNIZE`, then you might want to read those
examples first.

Some of the examples below use the following table and data:

> ```sqlexample
> create table stock_price_history (company TEXT, price_date DATE, price INT);
> ```
>
> ```sqlexample
> insert into stock_price_history values
>     ('ABCD', '2020-10-01', 50),
>     ('XYZ' , '2020-10-01', 89),
>     ('ABCD', '2020-10-02', 36),
>     ('XYZ' , '2020-10-02', 24),
>     ('ABCD', '2020-10-03', 39),
>     ('XYZ' , '2020-10-03', 37),
>     ('ABCD', '2020-10-04', 42),
>     ('XYZ' , '2020-10-04', 63),
>     ('ABCD', '2020-10-05', 30),
>     ('XYZ' , '2020-10-05', 65),
>     ('ABCD', '2020-10-06', 47),
>     ('XYZ' , '2020-10-06', 56),
>     ('ABCD', '2020-10-07', 71),
>     ('XYZ' , '2020-10-07', 50),
>     ('ABCD', '2020-10-08', 80),
>     ('XYZ' , '2020-10-08', 54),
>     ('ABCD', '2020-10-09', 75),
>     ('XYZ' , '2020-10-09', 30),
>     ('ABCD', '2020-10-10', 63),
>     ('XYZ' , '2020-10-10', 32);
> ```

The following graph shows the shapes of the curves:

### Report one summary row for each `V` shape

The following query searches for all `V` shapes in the previously presented stock_price_history. The output is
explained in more detail after the query and output.

> ```sqlexample
> SELECT * FROM stock_price_history
>   MATCH_RECOGNIZE(
>     PARTITION BY company
>     ORDER BY price_date
>     MEASURES
>       MATCH_NUMBER() AS match_number,
>       FIRST(price_date) AS start_date,
>       LAST(price_date) AS end_date,
>       COUNT(*) AS rows_in_sequence,
>       COUNT(row_with_price_decrease.*) AS num_decreases,
>       COUNT(row_with_price_increase.*) AS num_increases
>     ONE ROW PER MATCH
>     AFTER MATCH SKIP TO LAST row_with_price_increase
>     PATTERN(row_before_decrease row_with_price_decrease+ row_with_price_increase+)
>     DEFINE
>       row_with_price_decrease AS price < LAG(price),
>       row_with_price_increase AS price > LAG(price)
>   )
> ORDER BY company, match_number;
> +---------+--------------+------------+------------+------------------+---------------+---------------+
> | COMPANY | MATCH_NUMBER | START_DATE | END_DATE   | ROWS_IN_SEQUENCE | NUM_DECREASES | NUM_INCREASES |
> |---------+--------------+------------+------------+------------------+---------------+---------------|
> | ABCD    |            1 | 2020-10-01 | 2020-10-04 |                4 |             1 |             2 |
> | ABCD    |            2 | 2020-10-04 | 2020-10-08 |                5 |             1 |             3 |
> | XYZ     |            1 | 2020-10-01 | 2020-10-05 |                5 |             1 |             3 |
> | XYZ     |            2 | 2020-10-05 | 2020-10-08 |                4 |             2 |             1 |
> | XYZ     |            3 | 2020-10-08 | 2020-10-10 |                3 |             1 |             1 |
> +---------+--------------+------------+------------+------------------+---------------+---------------+
> ```

The output shows one row per match (regardless of how many rows were part of the match).

The output includes the following columns:

* COMPANY: The stock symbol for the company.
* The MATCH_NUMBER is a sequential number identifying which match this was within this data set (e.g. the first match
  has MATCH_NUMBER 1, the second match has MATCH_NUMBER 2, etc.). If the data was partitioned, then the MATCH_NUMBER
  is the sequential number within the partition (in this example, for each company/stock).
* START_DATE: The date at which this occurrence of the pattern starts.
* END_DATE: The date at which this occurrence of the pattern ends.
* ROWS_IN_SEQUENCE: This is the number of rows in the match. For example, the first match is based on the prices
  measured on 4 days (October 1 through October 4), so ROWS_IN_SEQUENCE is 4.
* NUM_DECREASES: This is the number of days (within the match) that the price went down. For example, in the first match, the
  price went down for 1 day and then went up for 2 days, so NUM_DECREASES is 1.
* NUM_INCREASES: This is the number of days (within the match) that the price went up. For example, in the first match, the
  price went down for 1 day and then went up for 2 days, so NUM_INCREASES is 2.

### Report all rows for all matches for one company

This example returns all rows within each match (not just one summary row per match). This pattern searches for
rising prices of the ‘ABCD’ company:

> ```sqlexample
> select price_date, match_number, msq, price, cl from
>   (select * from stock_price_history where company='ABCD') match_recognize(
>     order by price_date
>     measures
>         match_number() as "MATCH_NUMBER",
>         match_sequence_number() as msq,
>         classifier() as cl
>     all rows per match
>     pattern(ANY_ROW UP+)
>     define
>         ANY_ROW AS TRUE,
>         UP as price > lag(price)
> )
> order by match_number, msq;
> +------------+--------------+-----+-------+---------+
> | PRICE_DATE | MATCH_NUMBER | MSQ | PRICE | CL      |
> |------------+--------------+-----+-------+---------|
> | 2020-10-02 |            1 |   1 |    36 | ANY_ROW |
> | 2020-10-03 |            1 |   2 |    39 | UP      |
> | 2020-10-04 |            1 |   3 |    42 | UP      |
> | 2020-10-05 |            2 |   1 |    30 | ANY_ROW |
> | 2020-10-06 |            2 |   2 |    47 | UP      |
> | 2020-10-07 |            2 |   3 |    71 | UP      |
> | 2020-10-08 |            2 |   4 |    80 | UP      |
> +------------+--------------+-----+-------+---------+
> ```

### Omit empty matches

This searches for price ranges above the average of the whole chart of a company. This example omits empty matches.
Note, however, that empty matches nonetheless increment the MATCH_NUMBER:

> ```sqlexample
> select * from stock_price_history match_recognize(
>     partition by company
>     order by price_date
>     measures
>         match_number() as "MATCH_NUMBER"
>     all rows per match omit empty matches
>     pattern(OVERAVG*)
>     define
>         OVERAVG as price > avg(price) over (rows between unbounded
>                                   preceding and unbounded following)
> )
> order by company, price_date;
> +---------+------------+-------+--------------+
> | COMPANY | PRICE_DATE | PRICE | MATCH_NUMBER |
> |---------+------------+-------+--------------|
> | ABCD    | 2020-10-07 |    71 |            7 |
> | ABCD    | 2020-10-08 |    80 |            7 |
> | ABCD    | 2020-10-09 |    75 |            7 |
> | ABCD    | 2020-10-10 |    63 |            7 |
> | XYZ     | 2020-10-01 |    89 |            1 |
> | XYZ     | 2020-10-04 |    63 |            4 |
> | XYZ     | 2020-10-05 |    65 |            4 |
> | XYZ     | 2020-10-06 |    56 |            4 |
> | XYZ     | 2020-10-08 |    54 |            6 |
> +---------+------------+-------+--------------+
> ```

### Demonstrate the WITH UNMATCHED ROWS option

This example demonstrates the `WITH UNMATCHED ROWS option`. Like the
Omit empty matches example above, this example searches for price ranges
above the average price of each company’s chart. Note that the quantifier in this query is `+`, while the
quantifier in the previous query was `*`:

> ```sqlexample
> select * from stock_price_history match_recognize(
>     partition by company
>     order by price_date
>     measures
>         match_number() as "MATCH_NUMBER",
>         classifier() as cl
>     all rows per match with unmatched rows
>     pattern(OVERAVG+)
>     define
>         OVERAVG as price > avg(price) over (rows between unbounded
>                                  preceding and unbounded following)
> )
> order by company, price_date;
> +---------+------------+-------+--------------+---------+
> | COMPANY | PRICE_DATE | PRICE | MATCH_NUMBER | CL      |
> |---------+------------+-------+--------------+---------|
> | ABCD    | 2020-10-01 |    50 |         NULL | NULL    |
> | ABCD    | 2020-10-02 |    36 |         NULL | NULL    |
> | ABCD    | 2020-10-03 |    39 |         NULL | NULL    |
> | ABCD    | 2020-10-04 |    42 |         NULL | NULL    |
> | ABCD    | 2020-10-05 |    30 |         NULL | NULL    |
> | ABCD    | 2020-10-06 |    47 |         NULL | NULL    |
> | ABCD    | 2020-10-07 |    71 |            1 | OVERAVG |
> | ABCD    | 2020-10-08 |    80 |            1 | OVERAVG |
> | ABCD    | 2020-10-09 |    75 |            1 | OVERAVG |
> | ABCD    | 2020-10-10 |    63 |            1 | OVERAVG |
> | XYZ     | 2020-10-01 |    89 |            1 | OVERAVG |
> | XYZ     | 2020-10-02 |    24 |         NULL | NULL    |
> | XYZ     | 2020-10-03 |    37 |         NULL | NULL    |
> | XYZ     | 2020-10-04 |    63 |            2 | OVERAVG |
> | XYZ     | 2020-10-05 |    65 |            2 | OVERAVG |
> | XYZ     | 2020-10-06 |    56 |            2 | OVERAVG |
> | XYZ     | 2020-10-07 |    50 |         NULL | NULL    |
> | XYZ     | 2020-10-08 |    54 |            3 | OVERAVG |
> | XYZ     | 2020-10-09 |    30 |         NULL | NULL    |
> | XYZ     | 2020-10-10 |    32 |         NULL | NULL    |
> +---------+------------+-------+--------------+---------+
> ```

### Demonstrate symbol predicates in the MEASURES clause

This example shows the use of `<symbol>.<column>` notation with symbol predicates:

> ```sqlexample
> SELECT company, price_date, price, "FINAL FIRST(LT45.price)", "FINAL LAST(LT45.price)"
>     FROM stock_price_history
>        MATCH_RECOGNIZE (
>            PARTITION BY company
>            ORDER BY price_date
>            MEASURES
>                FINAL FIRST(LT45.price) AS "FINAL FIRST(LT45.price)",
>                FINAL LAST(LT45.price)  AS "FINAL LAST(LT45.price)"
>            ALL ROWS PER MATCH
>            AFTER MATCH SKIP PAST LAST ROW
>            PATTERN (LT45 LT45)
>            DEFINE
>                LT45 AS price < 45.00
>            )
>     WHERE company = 'ABCD'
>     ORDER BY price_date;
> +---------+------------+-------+-------------------------+------------------------+
> | COMPANY | PRICE_DATE | PRICE | FINAL FIRST(LT45.price) | FINAL LAST(LT45.price) |
> |---------+------------+-------+-------------------------+------------------------|
> | ABCD    | 2020-10-02 |    36 |                      36 |                     39 |
> | ABCD    | 2020-10-03 |    39 |                      36 |                     39 |
> | ABCD    | 2020-10-04 |    42 |                      42 |                     30 |
> | ABCD    | 2020-10-05 |    30 |                      42 |                     30 |
> +---------+------------+-------+-------------------------+------------------------+
> ```

---
title: Metadata fields in Snowflake
source: https://docs.snowflake.com/en/sql-reference/metadata.md
section: SQL General Reference
---

# Metadata fields in Snowflake

The data contained in metadata fields may be processed outside of your Snowflake Region. It is your responsibility to ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other regulated data is entered into any metadata field when using the Snowflake Service.

When creating an object in Snowflake, metadata fields may be captured. The most common metadata fields are:

* Object definitions, such as a policy, an external function, or a view definition.
* Object properties, such as an object name or an object comment.
* Listing and profile fields, such as listing and organization descriptions.

> **Attention:**
>
> For objects defined through SQL, metadata fields are usually populated by any fields entered as part of [CREATE <object>](sql/create.md),
> and [ALTER <object>](sql/alter.md), or method call statements for a given object. Creating or manipulating objects in other languages,
> such as Python, may also populate metadata fields based on the object’s definitions and properties.
>
> When using these commands, ensure that no personal data (other than for a User object), sensitive data, export-controlled data, or other
> regulated data populates any metadata fields.

In addition to the above fields, the following table sets forth additional potential metadata fields in the Snowflake Service. Metadata is
“Usage Data” as defined in our [Terms of Service](https://www.snowflake.com/legal/terms-of-service/) or other agreement between you and
Snowflake covering use of the Snowflake Service.

Snowflake updates this table regularly as new features and services are added. If you have questions about how Snowflake tracks data or
about sensitive information in the actual query text please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |  |
| --- | --- | --- |
| Additional Metadata | Query literals | [Query Data in Snowflake](../guides-overview-queries.md)  [QUERY_HISTORY view](account-usage/query_history.md) |
|  | Manifests | [Snowflake Native App manifest file](../developer-guide/native-apps/manifest-overview.md)  [Listing manifest reference](../progaccess/listing-manifest-reference.md) |
|  | Snowpark Container Services specification file | [Service specification reference](../developer-guide/snowpark-container-services/specification-reference.md) |
|  | Listing information, manifest file, and profiles | [Listing fields](../collaboration/provider-listings-reference.md)  [Listing manifest reference](../progaccess/listing-manifest-reference.md)  [Provider profile fields](../collaboration/provider-profiles-managing.md) |
|  | Custom instructions for Snowflake Copilot | [Using Snowflake Copilot](../user-guide/snowflake-copilot.md) |
|  | Generic error messages from ML functions | [ML Functions](../guides-overview-ml-functions.md) |
|  | Semantic models and semantic views | [Cortex Analyst semantic models](../user-guide/views-semantic/sql.md)  [Semantic views](../user-guide/views-semantic/overview.md) |
|  | Experiment run parameters, metrics, and artifacts | [Snowflake Experiments](../developer-guide/snowflake-ml/experiments.md) |

---
title: Metadata functions
source: https://docs.snowflake.com/en/sql-reference/functions-metadata.md
section: SQL General Reference
---

# Metadata functions

Snowflake provides functions that return metadata information, such as descriptions of the statements used to create database
objects (e.g. tables).

| Function Name | Notes |
| --- | --- |
| [GENERATE_COLUMN_DESCRIPTION](functions/generate_column_description.md) | Generate a list of columns from a set of staged files that contain [semi-structured data](../user-guide/semistructured-intro.md). |
| [GET_DDL](functions/get_ddl.md) | Get DDL to [re-]create a database object. |

---
title: Model monitor functions
source: https://docs.snowflake.com/en/sql-reference/functions-model-monitors.md
section: SQL General Reference
---

# Model monitor functions

Model monitors allow you to track the performance of your machine learning models in production. You can use the
following functions to retrieve metrics from the model monitors.

> * MODEL_MONITOR_DRIFT_METRIC
> * MODEL_MONITOR_PERFORMANCE_METRIC
> * MODEL_MONITOR_STAT_METRIC

Each function requires the name of a model monitor and the name of a metric to be retrieved from that model.

## List of functions

| Function name | Notes |
| --- | --- |
| [MODEL_MONITOR_DRIFT_METRIC](functions/model-monitor-drift-metric.md) |  |
| [MODEL_MONITOR_PERFORMANCE_METRIC](functions/model-monitor-performance-metric.md) |  |
| [MODEL_MONITOR_STAT_METRIC](functions/model-monitor-stat-metric.md) |  |

---
title: Modifying constraints
source: https://docs.snowflake.com/en/sql-reference/constraints-alter.md
section: SQL General Reference
---

# Modifying constraints

After a constraint is created, you can modify it in the following ways:

* The constraint can be renamed.
* Some properties can be modified; for example, RELY.
* Some properties can’t be modified; for example, DEFERRABLE. To modify these properties, the constraint must
  be dropped and recreated.
* The column definition for a constraint can’t be modified; for example, add new columns, drop existing columns,
  or change the order of columns. To make these types of changes, the constraint must be dropped and recreated.

When modifying a constraint, identify the constraint using either the constraint name or the columns in the constraint
definition along with the type of the constraint. Primary keys can also be identified using the PRIMARY KEY keyword, because
each table can have only a single PRIMARY KEY.

If a table with constraints is modified, for example by renaming the table or swapping the table with another table, the
constraints are updated to reflect the changes.

## Renaming a constraint

Use the following syntax for the [ALTER TABLE](sql/alter-table.md) command to rename a constraint:

```sqlsyntax
ALTER TABLE <table_name> RENAME CONSTRAINT <old_name> TO <new_name>;
```

## Modifying the properties of a constraint

Use the following syntax for the [ALTER TABLE](sql/alter-table.md) command to modify the properties of a constraint:

```sqlsyntax
ALTER TABLE <table_name>
  { ALTER | MODIFY } {
      CONSTRAINT <name>
    | PRIMARY KEY
    | { UNIQUE | FOREIGN KEY } (<column_name>, [ ... ] )
  }
  { [ [ NOT ] ENFORCED ] [ VALIDATE | NOVALIDATE ] [ RELY | NORELY ] };
```

For CHECK constraints, the `constraint_name` is required. Also, you can’t modify the `expr` associated
with a CHECK constraint. To modify the `expr`, the CHECK constraint must be dropped and re-created.

Currently, Snowflake only supports setting the following constraint properties:

* [ NOT ] ENFORCED
* NOVALIDATE and VALIDATE
* RELY and NORELY

Snowflake doesn’t support setting ENFORCED. Snowflake only supports setting NOVALIDATE for CHECK constraints.
See also [Non-default values for ENABLE and VALIDATE properties](sql/create-table-constraint.md).

For descriptions of the constraint properties, see [Constraint properties](sql/create-table-constraint.md).

## Modifying a table with constraints

If a table with constraints is renamed, the constraints for the table, and any FOREIGN KEY constraints that reference
the table, are updated to reference the new name.

Likewise, if a table is swapped with another table, the existing table, all the constraints on the table, are maintained
on the swapped table.

For more details about renaming or swapping tables, see [ALTER TABLE](sql/alter-table.md).

---
title: Notational conventions
source: https://docs.snowflake.com/en/sql-reference/conventions.md
section: SQL General Reference
---

# Notational conventions

The following notational conventions are used in Snowflake documentation.

> **Important:**
>
> In syntax and code descriptions, angle brackets (`< >`), square brackets (`[ ]`), curly braces (`{ }`), and vertical bars (`|`) are used for notational purposes only. To
> avoid syntax errors, do not include them when entering a command or writing code.
>
> However, brackets and braces have specific meanings in JSON and XML, and therefore must be included when working with JSON or XML documents/data.

## Syntax, examples, and text

| Notation | Description |
| --- | --- |
| ITEM , `ITEM` | All-uppercase indicates a Snowflake SQL command, keyword, parameter name, or function name. |
| item , `item` | All-lowercase indicates a user-supplied value for an identifier, parameter, or argument. |
| *<item>* , `item` | Angle brackets and italics indicate identifiers, parameters, or arguments that are provided by users. |
| `( item1 item2 ... )` | Parentheses are used in SQL to group parameters or arguments.  They are required when entering a command (i.e. they must be typed exactly as they appear). |
| `{ item1 item2 ... }` | Curly braces indicate groupings of identifiers, parameters, or arguments.  Curly braces are also used with vertical bars to delimit choices when more than one choice is available.  In both of those cases, the curly braces should not be entered. |
| `[ ITEM ]` , `[ item1 item2 ... ]` | Square brackets indicate optional parts of a statement. They should not be entered.  In many cases, items in the square brackets are optional because default values are provided. |
| `|` | A vertical bar indicates a choice between two or more items or values, usually within square brackets or curly braces. The square brackets or curly braces should not be entered. |
| `...` (ellipsis) | The previous item can be repeated an indefinite number of times. |

### Examples

In the following, the keyword `WORK` is optional:

```sqlsyntax
BEGIN [ WORK ]
```

Therefore, either of the following are valid:

```sqlexample
BEGIN;
BEGIN WORK;
```

In the following, you can use either the keyword `WORK` or the keyword `TRANSACTION`. You must not use both. You can omit both.

```sqlsyntax
BEGIN [ { WORK | TRANSACTION } ]
```

Therefore, any of the following are valid:

```sqlexample
BEGIN;
BEGIN WORK;
BEGIN TRANSACTION;
```

The following shows the syntax of a function call that accepts one argument. The parentheses are required.
The `<function_name>`, `<argument_name>`, and `<data_type>` should be replaced with the actual names:

```sqlsyntax
create function <function_name>( <argument_name> <data_type> )
```

Therefore, the following is valid:

```sqlexample
create function my_function(my_argument integer)
```

The following shows a function that requires at least one argument and accepts optional additional arguments.

```sqlsyntax
<function_name>( <argument_name> <data_type> [ , <argument_name> data_type ] ... )
```

Therefore, the following are valid:

```sqlexample
my_function(argument_1 integer)
my_function(argument_1 integer, argument_2 integer)
my_function(argument_1 integer, argument_2 integer, argument_3 varchar)
```

In this case, additional arguments are also allowed.

## JSON data

| Notation | Description |
| --- | --- |
| `[ item1 ... ]` | Square brackets are JSON array delimiters. |
| `{ item1 item2 ... }` | Curly braces are JSON object delimiters. |

## XML data

| Notation | Description |
| --- | --- |
| `<item> ... </item>` | Angle brackets indicate the start or end of an XML element. |

---
title: Notification functions
source: https://docs.snowflake.com/en/sql-reference/functions-notification.md
section: SQL General Reference
---

# Notification functions

Notification functions are helper functions that you can call when using the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION](stored-procedures/system_send_snowflake_notification.md) stored procedure to
[send a notification](../user-guide/notifications/snowflake-notifications.md).

The integration configuration and message construction functions return JSON-formatted strings that you pass to the
SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure.

| Sub-category | Function | Notes |
| --- | --- | --- |
| Integration Configuration | [EMAIL_INTEGRATION_CONFIG](functions/email_integration_config.md) |  |
|  | [INTEGRATION](functions/integration.md) |  |
| Message Construction | [APPLICATION_JSON](functions/application_json.md) |  |
|  | [TEXT_HTML](functions/text_html.md) |  |
|  | [TEXT_PLAIN](functions/text_plain.md) |  |
| Message Sanitization | [SANITIZE_WEBHOOK_CONTENT](functions/sanitize_webhook_content.md) |  |

---
title: NULL (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/null.md
section: SQL General Reference
---

# NULL (Snowflake Scripting)

NULL can be used as a “no-op” (no operation) statement.

> **Note:**
>
> Using NULL as a statement is uncommon. NULL is usually used as a *value*, rather than as a *statement*.
>
> As a value, NULL means “no value.” For more information, see
> [the Wikipedia article on SQL NULL](https://en.wikipedia.org/wiki/Null_(SQL)).
>
> When working with [semi-structured data types](../data-types-semistructured.md),
> such as [JSON](../../user-guide/tutorials/json-basics-tutorial.md), you might need to
> [distinguish between NULL as an SQL value and NULL as a JSON value (also called “VARIANT NULL”)](../../user-guide/semistructured-considerations.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

## Syntax

```sqlsyntax
NULL;
```

## Usage notes

* The NULL statement can be executed only inside [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) code.
* A NULL statement in an exception handler ensures that the code continues executing rather than aborting if there is no
  higher-level handler.
* A NULL statement in a branch does nothing; however, it communicates to the reader that the author of the code explicitly
  considered the condition for which the branch would execute. In other words, the NULL shows that the branch condition wasn’t
  overlooked or accidentally omitted.
* Before using the NULL statement, consider alternatives.

  For example, suppose you are writing a stored procedure with an exception handler. In most stored procedures, if each
  non-exception code path should return a value, then each code path involving an exception handler should also return a value.
  In that case, avoid executing a NULL statement. Instead, consider explicitly returning NULL, an empty result set, or an
  error indicator.

  You can also use a CONTINUE handler to run statements in the exception block and continues with the statement
  immediately following the one that caused the error. For more information, see [Handling an exception in Snowflake Scripting](../../developer-guide/snowflake-scripting/exceptions.md).

## Examples

The following code uses a NULL statement in an exception handler to ensure that the exception is caught (rather than passed
up to the caller), but no specific action is taken:

```sqlexample
CREATE PROCEDURE null_as_statement()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  SELECT 1 / 0;
  RETURN 'If you see this, the exception was not thrown/caught properly.';
EXCEPTION
  WHEN OTHER THEN
      NULL;
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL null_as_statement();
```

```output
+-------------------+
| NULL_AS_STATEMENT |
|-------------------|
| NULL              |
+-------------------+
```

> **Note:**
>
> The NULL value returned by the CALL statement isn’t directly due to the NULL statement in the exception. Instead, the return
> value is NULL because the stored procedure didn’t execute an explicit RETURN statement.
>
> Snowflake recommends that stored procedures explicitly return a value, including in each branch of the exception handler.

---
title: Numeric data types
source: https://docs.snowflake.com/en/sql-reference/data-types-numeric.md
section: SQL General Reference
---

# Numeric data types

This topic describes the numeric data types supported in Snowflake, along with the supported formats for numeric constants
and literals.

## Data types for fixed-point numbers

Snowflake supports the following data types for fixed-point numbers.

### NUMBER

Numbers up to 38 digits, with an optional precision and scale:

Precision:
:   Total number of digits allowed.

Scale:
:   Number of digits allowed to the right of the decimal point.

By default, precision is 38, and scale is 0; that is, NUMBER(38, 0). Precision limits the range
of values that can be inserted into or cast to columns of a given type. For example, the value `999` fits into
NUMBER(38,0) but not into NUMBER(2,0).

Because precision is the total number of digits allowed, you can’t load a value into a NUMBER column if the number
of digits to the left of the decimal point exceeds the precision of the column minus its scale. For example,
NUMBER(20, 2) allows 18 digits on the left side of the decimal point and two digits on the right side of the decimal
point, for a total of 20 digits.

The *maximum scale*, which is the number of digits to the right of the decimal point, is 37. Numbers that have fewer than 38
significant digits, but whose least significant digit is past the 37th decimal place — for example,
0.0000000000000000000000000000000000000012 (1.2e-39) — can’t be represented without losing some digits of precision.

> **Note:**
>
> If data is converted to another data type with lower precision, and then converted back to the higher-precision data
> type, the data can lose precision. For example, precision is lost if you convert a NUMBER(38,37) value to a DOUBLE value
> — which has a precision of approximately 15 decimal digits — and then back to NUMBER.

Snowflake also supports the FLOAT data type, which allows a wider range of values,
although with less precision.

### DECIMAL , DEC , NUMERIC

Synonymous with NUMBER.

### INT , INTEGER , BIGINT , SMALLINT , TINYINT , BYTEINT

Synonymous with NUMBER, except that precision and scale can’t be specified (that is, it always defaults to NUMBER(38, 0)).
Therefore, for all INTEGER data types, the range of values is all integer values from
-99999999999999999999999999999999999999 to +99999999999999999999999999999999999999 (inclusive).

The various names — for example, TINYINT, BYTEINT, and so on —are to simplify porting from other systems and to suggest
the expected range of values for a column of the specified type.

### Impact of precision and scale on storage size

Precision — the total number of digits — doesn’t affect storage. The storage requirements for the same number in columns with
different precisions, such as NUMBER(2,0) and NUMBER(38,0), are the same. For each micro-partition, Snowflake determines
the minimum and maximum values for a given column and uses that information to determine the storage size for all values
for that column in the partition. For example:

* If a column contains only values between `-128` and `+127`, each of the values consumes 1 byte (uncompressed).
* If the largest value in the column is `10000000`, each of the values consumes 4 bytes (uncompressed).

However, scale — the number of digits following the decimal point — affects storage. For example, the same value stored in
a column of type NUMBER(10,5) consumes more space than NUMBER(5,0). Also, processing values with a larger scale might be
slightly slower and consume more memory.

To save space, Snowflake compresses values before writing them to storage. The amount of compression depends on the data
values and other factors.

### Examples of fixed-point data types in a table

The following statement creates a table with columns of various fixed-point data types:

```sqlexample
CREATE OR REPLACE TABLE test_fixed(
  num0 NUMBER,
  num10 NUMBER(10,1),
  dec20 DECIMAL(20,2),
  numeric30 NUMERIC(30,3),
  int1 INT,
  int2 INTEGER);

DESC TABLE test_fixed;
```

```output
+-----------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name      | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|-----------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| NUM0      | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| NUM10     | NUMBER(10,1) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| DEC20     | NUMBER(20,2) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| NUMERIC30 | NUMBER(30,3) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| INT1      | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| INT2      | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+-----------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

## Data types for floating-point numbers

Snowflake supports the following data types for floating-point numbers.

### FLOAT , FLOAT4 , FLOAT8

The names FLOAT, FLOAT4, and FLOAT8 are for compatibility with other systems. Snowflake treats all three as 64-bit
floating-point numbers.

#### Precision

Snowflake uses double-precision (64 bit) IEEE 754 floating-point numbers.

Precision is approximately 15 digits. For example, for integers, the range is from -9007199254740991 to +9007199254740991
(-253 + 1 to +253 - 1). Floating-point values can range from approximately
10-308 to 10+308. Snowflake can represent more extreme values between approximately 10-324
and 10-308 with less precision. For more details, see the
[Wikipedia article on double-precision numbers](https://en.wikipedia.org/wiki/Double-precision_floating-point_format).

Snowflake supports the fixed-point data type NUMBER, which allows greater precision,
although a smaller range of exponents.

#### Special values

Snowflake supports the following special values for FLOAT:

* `'NaN'` (not a number)
* `'inf'` (infinity)
* `'-inf'` (negative infinity)

The symbols `'NaN'`, `'inf'`, and `'-inf'` must be in single quotes, and are case-insensitive.

Comparison semantics for `'NaN'` differ from the IEEE 754 standard in the following ways:

| Condition | Snowflake | IEEE 754 | Comment |
| --- | --- | --- | --- |
| `'NaN' = 'NaN'` | `TRUE` | `FALSE` | In Snowflake, `'NaN'` values are all equal. |
| `'NaN' > X` . where `X` is any FLOAT value, including . infinity, other than `NaN` itself. | `TRUE` | `FALSE` | In Snowflake, `'NaN'` is greater . than any other FLOAT value, . including infinity. |

#### Rounding errors

Floating point operations can have small rounding errors in the least significant digits. Rounding errors can occur in any type
of floating-point processing, including trigonometric, statistical, and geospatial functions.

The following list shows considerations for rounding errors:

* Errors can vary each time the query is executed.
* Errors can be larger when operands have different precision or scale.
* Errors can accumulate, especially when aggregate functions —for example, [SUM](functions/sum.md)
  or [AVG](functions/avg.md) — process large numbers of rows. Casting to a fixed-point data type before
  aggregating can reduce or eliminate these errors.
* Rounding errors can occur not only when working with SQL, but also when working with other code — for example, Java,
  JavaScript, or Python — that runs inside Snowflake — for example, in
  [UDFs](../developer-guide/udf/udf-overview.md) and
  [stored procedures](../developer-guide/stored-procedure/stored-procedures-overview.md).
* When comparing two floating-point numbers, Snowflake recommends comparing for approximate equality rather than exact equality.

It might be possible to avoid these types of approximation errors by using the exact DECFLOAT data type.

### DOUBLE , DOUBLE PRECISION , REAL

Synonymous with FLOAT.

### Examples of floating-point data types in a table

The following statement creates a table with columns of various floating-point data types:

```sqlexample
CREATE OR REPLACE TABLE test_float(
  double1 DOUBLE,
  float1 FLOAT,
  dp1 DOUBLE PRECISION,
  real1 REAL);

DESC TABLE test_float;
```

```output
+---------+-------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name    | type  | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|---------+-------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| DOUBLE1 | FLOAT | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| FLOAT1  | FLOAT | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| DP1     | FLOAT | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| REAL1   | FLOAT | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+---------+-------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

> **Note:**
>
> The DESC TABLE command’s `type` column displays the data type FLOAT not only for FLOAT, but also for synonyms
> of FLOAT; for example, DOUBLE, DOUBLE PRECISION, and REAL.

### DECFLOAT

The decimal float (DECFLOAT) data type stores numbers exactly, with up to 38 significant digits of precision,
and uses a dynamic base-10 exponent to represent very large or small values. The exponent range is from -16383
to 16384, allowing values approximately between -10^(16384) and 10^(16384). The DECFLOAT data type supports a variable
scale so that the scale varies depending on the specific value being stored. In contrast to the FLOAT data type,
which represents values as approximations, the DECFLOAT data type represents exact values in the specified precision.

The DECFLOAT data type doesn’t support the following special values
that are supported by the FLOAT data type: `'NaN'` (not a number), `'inf'` (infinity),
and `'-inf'` (negative infinity).

#### Use cases for the DECFLOAT data type

Use the DECFLOAT data type when you need exact decimal results and a wide, variable scale in the same column.

The DECFLOAT data type is appropriate for the following general use cases:

* You are ingesting data, and the scale of incoming numeric values is unknown or highly variable.
* You require exact numeric values; for example, ledgers, taxes, or compliance.
* You are migrating from systems that rely on the IEEE 754-decimal representation or 128-bit decimals. These
  migrations might be blocked by the precision or range limitations of other Snowflake data types.
* You want to avoid `Number out of representable range` errors when you sum, multiply, or divide high-precision
  numeric values.

For example, you can use the DECFLOAT data type for the following specific use cases:

* You are ingesting heterogeneously scaled data from Oracle DECIMAL or DB2 DECFLOAT columns.
* You are performing financial modeling that involves computations with scales of results that are hard to predict.
* You are running scientific measurements that swing from nano units to astronomical units.

You can continue to use the NUMBER data type for fixed-scale numeric columns or the FLOAT data type for high-throughput
analytics where imprecise results are acceptable.

#### Usage notes for the DECFLOAT data type

* If an operation produces a result with more than 38 digits, the DECFLOAT value is rounded to 38-digit precision,
  with the least-significant digits rounded off according to the current rounding mode. Snowflake uses
  the [half up rounding mode](https://en.wikipedia.org/wiki/Rounding#Rounding_half_up) for DECFLOAT
  values.
* When you specify a DECFLOAT value or you cast to a DECFLOAT value, avoid using numeric literals in SQL. If you
  use numeric literals in SQL, the values are interpreted as NUMBER or FLOAT values before being cast to a DECFLOAT value,
  which can result in range errors or loss of exactness. Instead, use either string literals — such as `SELECT '<value>'::DECFLOAT`
  — or the DECFLOAT literal — such as `SELECT DECFLOAT '<value>'`.
* When operations mix DECFLOAT values and values of other numeric types, coercion prefers the DECFLOAT values.
  For example, when you add a value of NUMBER type and DECFLOAT type, the result is a DECFLOAT value.
* Use of the DECFLOAT type might cause storage consumption to increase.

#### Drivers and driver versions that support the DECFLOAT data type

The following Snowflake drivers and driver versions support the DECFLOAT data type. You might need to update your drivers
to the versions that support DECFLOAT:

| Driver | Minimum supported version | Notes |
| --- | --- | --- |
| Snowflake Connector for Python | 3.14.1 | pandas DataFrames don’t support the DECFLOAT type. |
| ODBC | 3.12.0 | None. |
| JDBC | 3.27.0 | None. |
| Go Snowflake Driver | 1.17.0 | None. |
| SQL API | 2.0.0 | None. |

Unsupported drivers treat DECFLOAT values as TEXT values. For some drivers, a driver parameter must be set to map
the DECFLOAT type to a language-native type. For more information, see [Drivers](../developer-guide/drivers.md).

#### Limitations for the DECFLOAT data type

The following limitations apply to the DECFLOAT type:

* A DECFLOAT value can’t be stored as a
  [semi-structured data type](data-types-semistructured.md) or
  [structured data type](data-types-structured.md).

  To store a DECFLOAT value as a string for one of these types, you can [cast](data-type-conversion.md)
  the DECFLOAT value to a VARCHAR value.
* DECFLOAT values aren’t supported in the following types of tables:

  + Tables in external formats, such as Iceberg
  + Hybrid tables
* The DECFLOAT data type isn’t supported in stored procedures or user-defined functions (UDFs) written in a
  language other than SQL, such as Python or Java.
* The DECFLOAT data type isn’t supported in Snowpark.
* Snowsight has limited support for the DECFLOAT data type.
* The following features don’t support the DECFLOAT data type:

  + [Clustering keys](../user-guide/tables-clustering-keys.md)
  + [Differential privacy](../user-guide/diff-privacy/differential-privacy-sql-reference.md)
  + [Sensitive data classification](../user-guide/classify-intro.md)
  + [Search optimization service](../user-guide/search-optimization-service.md)
* The NUMBER and FLOAT types might provide better performance than the DECFLOAT type.

#### Examples for the DECFLOAT data type

The following examples use the DECFLOAT data type:

* Show the differences between DECFLOAT and FLOAT
* Use DECFLOAT values with aggregate functions

##### Show the differences between DECFLOAT and FLOAT

The following example shows the differences between the DECFLOAT and FLOAT data types:

1. Create a table with a DECFLOAT column and a FLOAT column, and then insert the same values for both types into the table:

   ```sqlexample
   CREATE OR REPLACE TABLE decfloat_sample (
     id INT,
     decfloat_val DECFLOAT,
     float_val FLOAT);

   INSERT INTO decfloat_sample VALUES
     (
       1,
       DECFLOAT '123e7000',
       FLOAT '123e7000'
     ),
     (
       2,
       12345678901234567890123456789::DECFLOAT,
       12345678901234567890123456789::FLOAT
     ),
     (
       3,
       '-4.2e-5432'::DECFLOAT,
       '-4.2e-5432'::FLOAT
     ),
     (
       4,
       '1.00000000000000000000000000000000000014'::DECFLOAT,
       '1.00000000000000000000000000000000000014'::FLOAT
     ),
     (
       5,
       '1.00000000000000000000000000000000000015'::DECFLOAT,
       '1.00000000000000000000000000000000000015'::FLOAT
     );
   ```

   The statement inserts DECFLOAT values in the following ways:

   * The first value is inserted by using the DECFLOAT literal.
   * The second value is inserted by casting an INTEGER value to a DECFLOAT value.
   * The third, fourth, and fifth values are inserted by casting a VARCHAR value to a DECFLOAT value.
2. To show the types, describe the table by using the DESC TABLE command.

   The precision wasn’t specified in the table definition for either column, but the output shows that the DECFLOAT data type supports up to 38 significant digits of precision:

   ```sqlexample
   DESC TABLE decfloat_sample;
   ```

   ```output
   +--------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
   | name         | type         | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
   |--------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
   | ID           | NUMBER(38,0) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
   | DECFLOAT_VAL | DECFLOAT(38) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
   | FLOAT_VAL    | FLOAT        | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
   +--------------+--------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
   ```
3. To show the differences in the values, query the table by using the SELECT statement:

   ```sqlexample
   SELECT * FROM decfloat_sample;
   ```

   ```output
   +----+-----------------------------------------+------------------------+
   | ID | DECFLOAT_VAL                            |              FLOAT_VAL |
   |----+-----------------------------------------+------------------------|
   |  1 | 1.23e7002                               | inf                    |
   |  2 | 12345678901234567890123456789           |   1.23456789012346e+28 |
   |  3 | -4.2e-5432                              |  -0                    |
   |  4 | 1.0000000000000000000000000000000000001 |   1                    |
   |  5 | 1.0000000000000000000000000000000000002 |   1                    |
   +----+-----------------------------------------+------------------------+
   ```

   The output shows the following differences:

   * The first row shows that the DECFLOAT type supports a wider range of values than the FLOAT type.
     The DECFLOAT value is very large (`1.23e7002`). The FLOAT value is `inf`, which means that the
     value is larger than any value that the FLOAT type can represent.
   * The second row shows that the DECFLOAT type retains the specified value exactly. The FLOAT
     value is an approximation that is stored in scientific notation.
   * The third row shows that the DECFLOAT type supports very small values (`-4.2e-5432`). The FLOAT
     value is approximated to `-0`.
   * The fourth and fifth rows show that the DECFLOAT type supports up to 38 digits of precision and uses
     rounding rules for values beyond the limit. The FLOAT value is approximated to `1` in both rows.

##### Use DECFLOAT values with aggregate functions

The following example uses DECFLOAT values with aggregate functions:

1. Create a table, and then insert DECFLOAT values into the table:

   ```sqlexample
   CREATE OR REPLACE TABLE decfloat_agg_sample (decfloat_val DECFLOAT);

   INSERT INTO decfloat_agg_sample VALUES
     (DECFLOAT '1e1000'),
     (DECFLOAT '-2.47e999'),
     (DECFLOAT '22e-75');
   ```
2. Query the table by using some aggregate functions:

   ```sqlexample
   SELECT SUM(decfloat_val),
          AVG(decfloat_val),
          MAX(decfloat_val),
          MIN(decfloat_val)
     FROM decfloat_agg_sample;
   ```

   ```output
   +-------------------+-------------------+-------------------+-------------------+
   | SUM(DECFLOAT_VAL) | AVG(DECFLOAT_VAL) | MAX(DECFLOAT_VAL) | MIN(DECFLOAT_VAL) |
   |-------------------+-------------------+-------------------+-------------------|
   | 7.53e999          | 2.51e999          | 1e1000            | -2.47e999         |
   +-------------------+-------------------+-------------------+-------------------+
   ```

## Numeric constants

The term *constants* — also known as *literals* — refers to fixed data values. The following formats are
supported for numeric constants:

> `[+-][digits][.digits][e[+-]digits]`

Where:

* `+` or `-` indicates a positive or negative value. The default is positive.
* `digits` is one or more digits from 0 to 9.
* `e` (or `E`) indicates an exponent in scientific notation. At least one digit must follow the exponent marker if present.

The following numbers are all examples of supported numeric constants:

```output
15
+1.34
0.2
15e-03
1.234E2
1.234E+2
-1
```

---
title: Numeric functions
source: https://docs.snowflake.com/en/sql-reference/functions-numeric.md
section: SQL General Reference
---

# Numeric functions

Numeric functions operate on numeric values and perform operations such as
rounding and exponentiation.

| Sub-category | Function | Notes |
| --- | --- | --- |
| **Arithmetic** | [DIV0](functions/div0.md) |  |
| [DIV0NULL](functions/div0null.md) |  |
| **Rounding and Truncation** | [ABS](functions/abs.md) |  |
| [CEIL](functions/ceil.md) |  |
| [FLOOR](functions/floor.md) |  |
| [MOD](functions/mod.md) |  |
| [ROUND](functions/round.md) |  |
| [SIGN](functions/sign.md) |  |
| [TRUNCATE , TRUNC](functions/trunc.md) |  |
| **Exponent and Root** | [CBRT](functions/cbrt.md) |  |
| [EXP](functions/exp.md) |  |
| [FACTORIAL](functions/factorial.md) |  |
| [POW, POWER](functions/pow.md) |  |
| [SQRT](functions/sqrt.md) |  |
| [SQUARE](functions/square.md) |  |
| **Logarithmic** | [LN](functions/ln.md) |  |
| [LOG](functions/log.md) |  |
| **Trigonometric** | [ACOS](functions/acos.md) |  |
| [ACOSH](functions/acosh.md) |  |
| [ASIN](functions/asin.md) |  |
| [ASINH](functions/asinh.md) |  |
| [ATAN](functions/atan.md) |  |
| [ATAN2](functions/atan2.md) |  |
| [ATANH](functions/atanh.md) |  |
| [COS](functions/cos.md) |  |
| [COSH](functions/cosh.md) |  |
| [COT](functions/cot.md) |  |
| [DEGREES](functions/degrees.md) |  |
| [PI](functions/pi.md) |  |
| [RADIANS](functions/radians.md) |  |
| [SIN](functions/sin.md) |  |
| [SINH](functions/sinh.md) |  |
| [TAN](functions/tan.md) |  |
| [TANH](functions/tanh.md) |  |
| **Other** | [WIDTH_BUCKET](functions/width_bucket.md) |  |

---
title: Object identifiers
source: https://docs.snowflake.com/en/sql-reference/identifiers.md
section: SQL General Reference
---

# Object identifiers

An identifier is a string of characters (up to 255 characters in length) used to identify first-class Snowflake “named” objects, including table columns:

* Identifiers are specified at object creation time and then are referenced in queries and DDL/DML statements.
* Identifiers can also be defined in queries as aliases (e.g. `SELECT a+b AS "the sum";`).

Object identifiers, often simply referred to as object *names*, must be unique within the context of the object type and the “parent” object:

Account:
:   Identifiers for account objects (users, roles, warehouses, databases, etc.) must be unique across the entire account.

Databases:
:   Identifiers for schemas must be unique within the database. To enable resolving schemas that have the same identifiers across databases,
    Snowflake supports fully-qualifying the schema identifiers in the form of:

    `<database_name>.<schema_name>`

Schemas:
:   Identifiers for schema objects (tables, views, file formats, stages, etc.) must be unique within the schema. To enable resolving objects
    that have the same identifiers in different databases/schemas, Snowflake supports fully-qualifying the object identifiers in the form of:

    `<database_name>.<schema_name>.<object_name>`

Tables:
:   Identifiers for columns must be unique within the table.

> **Note:**
>
> UDFs and stored procedures are schema objects; however Snowflake supports UDFs/stored procedures with the same identifier within the same schema
> (also referred to as “overloading”). For more details, see [Naming and overloading procedures and UDFs](../developer-guide/udf-stored-procedure-naming-conventions.md).

**Next Topics:**

* [Identifier requirements](identifiers-syntax.md)
* [Literals and variables as identifiers with IDENTIFIER() syntax](identifier-literal.md)
* [Object name resolution](name-resolution.md)

---
title: Object name resolution
source: https://docs.snowflake.com/en/sql-reference/name-resolution.md
section: SQL General Reference
---

# Object name resolution

A fully-qualified schema object (table, view, file format etc.) has the form:

> `<database_name>.<schema_name>.<object_name>`

However, because this can be tedious to write, the user is allowed to omit qualifications, from left to right. This topic describes how schema object names are resolved.

## Resolution when database omitted

> `(''<schema_name>.<object_name>'')`

The object name is augmented with the current database. The current database is set to a default value, depending on the account’s settings, when a session is initiated. Afterwards, it can be changed using the
[USE DATABASE](sql/use-database.md) command. The [CREATE DATABASE](sql/create-database.md) command also implicitly changes the current database to the newly created one. The name of the current database is returned by the
[CURRENT_DATABASE](functions/current_database.md) function.

For example:

```sqlexample
SELECT CURRENT_DATABASE();
```

```output
+--------------------+
| CURRENT_DATABASE() |
+--------------------+
| TESTDB             |
+--------------------+
```

```sqlexample
CREATE DATABASE db1;
```

```output
+------------------------------------+
|               status               |
+------------------------------------+
| Database DB1 successfully created. |
+------------------------------------+
```

```sqlexample
SELECT CURRENT_DATABASE();
```

```output
+--------------------+
| CURRENT_DATABASE() |
+--------------------+
| DB1                |
+--------------------+
```

```sqlexample
USE DATABASE testdb;
```

```output
+----------------------------------+
|              status              |
+----------------------------------+
| Statement executed successfully. |
+----------------------------------+
```

```sqlexample
SELECT CURRENT_DATABASE();
```

```output
+--------------------+
| CURRENT_DATABASE() |
+--------------------+
| TESTDB             |
+--------------------+
```

## Resolution when schema omitted (double-dot notation)

> `(''<database_name>..<object_name>'')`

The two dots indicate that the schema name is not specified. The PUBLIC default schema is always referenced.

Note that this notational format is provided mostly for compatibility with other systems, such as Microsoft SQL Server and IBM Netezza. Using this notation in new queries is discouraged.

## Unqualified objects

Unqualified objects (single identifiers) are resolved in two different ways, depending on whether they appear in a DDL or DML statement or in a query.

### DDL and DML statements

In DDL and DML statements, unqualified objects are augmented with the current database and schema. The current schema is maintained similarly to the current database. The current schema always belongs to the current database.

When a session is initiated, the current schema is initialized based on the connection’s settings. When the current database is changed, the current schema defaults to the value of an internal property (normally set to PUBLIC).
The current schema can be changed (always within the current database) by using the [USE SCHEMA](sql/use-schema.md) command. It is also implicitly changed by the [CREATE SCHEMA](sql/create-schema.md) command. The name of the
current schema is returned by the [CURRENT_SCHEMA](functions/current_schema.md) function.

For example:

```sqlexample
SELECT CURRENT_SCHEMA();
```

```output
+------------------+
| CURRENT_SCHEMA() |
+------------------+
| TESTSCHEMA       |
+------------------+
```

```sqlexample
CREATE DATABASE db1;
```

```output
+------------------------------------+
|               status               |
+------------------------------------+
| Database DB1 successfully created. |
+------------------------------------+
```

```sqlexample
SELECT CURRENT_SCHEMA();
```

```output
+------------------+
| CURRENT_SCHEMA() |
+------------------+
| PUBLIC           |
+------------------+
```

```sqlexample
CREATE SCHEMA sch1;
```

```output
+-----------------------------------+
|              status               |
+-----------------------------------+
| Schema SCH1 successfully created. |
+-----------------------------------+
```

```sqlexample
SELECT current_schema();
```

```output
+------------------+
| CURRENT_SCHEMA() |
|------------------+
| SCH1             |
|------------------+
```

### Name resolution in queries

In queries, unqualified object names are resolved through a search path.

The search path usually contains the current schema, but can also contain other schemas.

The search path is stored in the session-level parameter SEARCH_PATH. Similar to any other parameter, it can be
changed using the [ALTER SESSION](sql/alter-session.md) command.

The value of the search path is a comma-separated list of identifiers. The list can contain
fully- or partially-qualified schema names. Each schema name can be a [Double-quoted identifiers](identifiers-syntax.md).

The search path can also contain the following pseudo-variables:

> $current
> :   Specifies the current schema (see above).
>
> $public
> :   Specifies the public schema of the current database. The public schema’s name is determined by an
>     internal property, maintained by Snowflake, that is typically set to PUBLIC (for the PUBLIC schema
>     automatically created for each database).

These pseudo-variable names are case-insensitive.

The default value of the search path is `$current, $public`.

If the user specifies a new value for the search path, the new value will be validated. Every schema identifier specified in the new value must correspond to an existing schema. (In particular, every unqualified schema must
correspond to an existing schema in the current database). Otherwise an error will be raised and search_path will retain its previous value. However, the pseudo-variables can be used freely. For example, *$public* can be used even
if the current database has no public schema.

The value of the SEARCH_PATH parameter is reinterpreted every time it is used. Therefore, changing the current schema changes the meaning of `$current`, and changing the current database changes the meaning of `$public`, as
well as the meaning of any unqualified schemas.

If a schema in the search path is dropped, or if the current database is changed and some unqualified schemas in the search path don’t exist in the new database, no error is raised.

The SEARCH_PATH is not used inside [views](../user-guide/views-introduction.md) or [UDFs](../developer-guide/udf/udf-overview.md).
All unqualified objects in a view or UDF definition will be resolved in the view’s or UDF’s schema only.

The literal value of the search path can be examined through the command [SHOW PARAMETERS](sql/show-parameters.md).

To see the schemas that will be searched for unqualified objects in queries, use the [CURRENT_SCHEMAS](functions/current_schemas.md) function. The return value for the function contains a series of fully-qualified schemas in the
search path, separated by commas.

For example:

```sqlexample
SELECT CURRENT_SCHEMAS();
```

```output
+-------------------+
| CURRENT_SCHEMAS() |
+-------------------+
| []                |
+-------------------+
```

```sqlexample
USE DATABASE mytestdb;

SELECT current_schemas();
```

```output
+---------------------+
| CURRENT_SCHEMAS()   |
+---------------------+
| ["MYTESTDB.PUBLIC"] |
+---------------------+
```

```sqlexample
CREATE SCHEMA private;

SELECT current_schemas();
```

```output
+-----------------------------------------+
| CURRENT_SCHEMAS()                       |
+-----------------------------------------+
| ["MYTESTDB.PRIVATE", "MYTESTDB.PUBLIC"] |
+-----------------------------------------+
```

The pseudo-variables are expanded to their current value, unqualified schemas are fully qualified, and schemas that don’t exist or aren’t visible are omitted.

```sqlexample
SHOW PARAMETERS LIKE 'search_path';
```

```output
+-------------+--------------------+--------------------+------------------------------------------------+
| key         | value              | default            | description                                    |
+-------------+--------------------+--------------------+------------------------------------------------+
| SEARCH_PATH | $current, $public, | $current, $public, | Search path for unqualified object references. |
+-------------+--------------------+--------------------+------------------------------------------------+
```

```sqlexample
SELECT current_schemas();
```

```output
+---------------------------------------------------------------------------+
|                       CURRENT_SCHEMAS()                                   |
+---------------------------------------------------------------------------+
| [XY12345.TESTDB.TESTSCHEMA, XY12345.TESTDB.PUBLIC, SAMPLES.COMMON.PUBLIC] |
+---------------------------------------------------------------------------+
```

```sqlexample
CREATE DATABASE db1;
```

```output
+------------------------------------+
|               status               |
+------------------------------------+
| Database DB1 successfully created. |
+------------------------------------+
```

```sqlexample
USE SCHEMA public;
```

```output
+----------------------------------+
|              status              |
+----------------------------------+
| Statement executed successfully. |
+----------------------------------+
```

```sqlexample
SELECT current_schemas();
```

```output
+---------------------------------------------+
|                CURRENT_SCHEMAS()            |
+---------------------------------------------+
| [XY12345.DB1.PUBLIC, SAMPLES.COMMON.PUBLIC] |
+---------------------------------------------+
```

```sqlexample
ALTER SESSION SET search_path='$current, $public, testdb.public';
```

```output
+----------------------------------+
|              status              |
+----------------------------------+
| Statement executed successfully. |
+----------------------------------+
```

```sqlexample
SHOW PARAMETERS LIKE 'search_path';
```

```output
+-------------+----------------------------------+--------------------+------------------------------------------------+
| key         | value                            | default            | description                                    |
+-------------+----------------------------------+--------------------+------------------------------------------------+
| SEARCH_PATH | $current, $public, testdb.public | $current, $public, | Search path for unqualified object references. |
+-------------+----------------------------------+--------------------+------------------------------------------------+
```

```sqlexample
SELECT current_schemas();
```

```output
+---------------------------------------------+
|                CURRENT_SCHEMAS()            |
+---------------------------------------------+
| [XY12345.DB1.PUBLIC, XY12345.TESTDB.PUBLIC] |
+---------------------------------------------+
```

#### Precedence when a column name and an alias match

It is possible (but usually not recommended) to create a query that contains an alias that matches a column name:

```sqlexample
SELECT x, some_expression AS x
  FROM ...
```

If a clause contains a name that matches both a column name and an alias, then the clause uses the column name. The following example demonstrates this behavior using a GROUP BY clause:

Create a table and insert rows:

```sqlexample
CREATE TABLE employees (salary FLOAT, state VARCHAR, employment_state VARCHAR);
INSERT INTO employees (salary, state, employment_state) VALUES
  (60000, 'California', 'Active'),
  (70000, 'California', 'On leave'),
  (80000, 'Oregon', 'Active');
```

The following query returns the sum of the salaries of the employees who are active and the sum of the salaries of the employees who
are on leave:

```sqlexample
SELECT SUM(salary), ANY_VALUE(employment_state)
  FROM employees
  GROUP BY employment_state;
```

```output
+-------------+-----------------------------+
| SUM(SALARY) | ANY_VALUE(EMPLOYMENT_STATE) |
|-------------+-----------------------------|
|      140000 | Active                      |
|       70000 | On leave                    |
+-------------+-----------------------------+
```

The next query uses the alias `state`, which matches the name of a column of the table in the query. When `state` is used in
the GROUP BY clause, Snowflake interprets it as a reference to the column name, not the alias. This query therefore returns the sum of
the salaries of the employees in the state of California and the sum of the salaries of the employees in the state of Oregon,
yet displays `employment_state` information, such as `Active`, rather than the names of states or provinces:

```sqlexample
SELECT SUM(salary), ANY_VALUE(employment_state) AS state
  FROM employees
  GROUP BY state;
```

```output
+-------------+--------+
| SUM(SALARY) | STATE  |
|-------------+--------|
|      130000 | Active |
|       80000 | Active |
+-------------+--------+
```

---
title: OPEN (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/open.md
section: SQL General Reference
---

# OPEN (Snowflake Scripting)

Opens a cursor.

For more information on cursors, see [Working with cursors](../../developer-guide/snowflake-scripting/cursors.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [DECLARE](declare.md), [FETCH](fetch.md), [CLOSE](close.md)

## Syntax

```sqlsyntax
OPEN <cursor_name> [ USING (bind_variable_1 [, bind_variable_2 ...] ) ] ;
```

Where:

> `cursor_name`
> :   The name of the cursor.
>
> `bind_variable`
> :   A bind variable holds a value to be used in the cursor’s query definition (e.g. in a `WHERE` clause).
>
>     An example of binding is included in the examples later in this section.

## Usage notes

* The result set of a query can be thought of as a set of rows. Internally, opening a cursor executes the query,
  reads the rows, and positions an internal pointer to the first of the rows.
* As with any SQL query, if the query definition does not contain an
  [ORDER BY](../constructs/order-by.md) at the outermost level, then the result
  set has no defined order. When the result set for the cursor is created, its order persists until the cursor is
  closed. However, re-declaring or re-opening the cursor might produce the rows in a different order.
* Similarly, if a cursor is closed, and then the underlying table(s) are updated before it is re-opened, the
  result set can also change.

## Examples

```sqlexample
DECLARE
    c1 CURSOR FOR SELECT price FROM invoices;
BEGIN
    OPEN c1;
    ...
```

The following shows how to bind a variable when opening a [cursor](../../developer-guide/snowflake-scripting/cursors.md):

```sqlexample
DECLARE
    price_to_search_for FLOAT;
    price_count INTEGER;
    c2 CURSOR FOR SELECT COUNT(*) FROM invoices WHERE price = ?;
BEGIN
    price_to_search_for := 11.11;
    OPEN c2 USING (price_to_search_for);
```

For a more complete example of using a cursor, see
[the introductory cursor example](../../developer-guide/snowflake-scripting/cursors.md).

---
title: ORDER BY
source: https://docs.snowflake.com/en/sql-reference/constructs/order-by.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# ORDER BY

Specifies an ordering of the rows of the result table from a [SELECT](../sql/select.md) list.

## Syntax

**Sorting by specific columns**

```sqlsyntax
SELECT ...
  FROM ...
  ORDER BY orderItem [ , orderItem , ... ]
  [ ... ]
```

Where:

```sqlsyntax
orderItem ::= { <column_alias> | <position> | <expr> } [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ]
```

**Sorting by all columns**

```sqlsyntax
SELECT ...
  FROM ...
  ORDER BY ALL [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ]
  [ ... ]
```

## Parameters

`column_alias`
:   Column alias appearing in the query block’s [SELECT](../sql/select.md) list.

`position`
:   Position of an expression in the [SELECT](../sql/select.md) list.

`expr`
:   Any expression on tables in the current scope.

`{ ASC | DESC }`
:   Optionally returns the values of the sort key in ascending (lowest to highest) or descending (highest to lowest) order.

    Default: ASC

`NULLS { FIRST | LAST }`
:   Optionally specifies whether NULL values are returned before/after non-NULL values, based on the sort order (ASC or DESC).

    Default: Depends on the sort order (ASC or DESC); see the usage notes below for details

`ALL`
:   Sorts the results by all of the columns specified in the SELECT list. The results are sorted by the columns in the order in
    which they appear.

    For example, suppose that the SELECT list contains:

    ```sqlexample
    SELECT col_1, col_2, col_3
      FROM my_table
      ORDER BY ALL;
    ```

    The results are sorted first by `col_1`, then by `col_2`, and then by `col_3`.

    > **Note:**
    >
    > You cannot specify ORDER BY ALL if a column in the SELECT list uses an aggregate function.

## Usage notes

* All data is sorted according to the numeric byte value of each character in the ASCII table. UTF-8 encoding is supported.
* For numeric values, leading zeros before the decimal point and trailing zeros (`0`) after the decimal point have no effect on sort order.

* When NULLS FIRST or NULLS LAST isn’t specified, the ordering of NULL values depends on the setting of the
  [DEFAULT_NULL_ORDERING](../parameters.md) parameter and the sort order:

  + When the sort order is ASC (the default) and the DEFAULT_NULL_ORDERING parameter is set to `LAST`
    (the default), NULL values are returned last. Therefore, unless specified otherwise, NULL values are considered to be higher than
    any non-NULL values.
  + When the sort order is ASC and the DEFAULT_NULL_ORDERING parameter is set to `FIRST`, NULL values are returned first.
  + When the sort order is DESC and the DEFAULT_NULL_ORDERING parameter is set to `FIRST`, NULL values are returned last.
  + When the sort order is DESC and the DEFAULT_NULL_ORDERING parameter is set to `LAST`, NULL values are returned first.
* The sort order isn’t guaranteed to be consistent for values of different data types in
  [semi-structured](../data-types-semistructured.md) data, such as an array that contains elements of
  different data types.
* Top-K pruning can improve the performance of queries that include both [LIMIT](limit.md) and ORDER BY clauses. For more
  information, see [Top-K pruning for improved query performance](../../user-guide/querying-top-k-pruning-optimization.md).
* An ORDER BY clause can be used at different levels in a query, such as in a subquery or inside an OVER() clause for a window function.
  An ORDER BY clause inside a subquery or an OVER() clause applies only in that context. For example, the ORDER BY clause
  in the following query orders results only within the subquery, not the outermost level of the query:

  ```sqlexample
  SELECT *
    FROM (
      SELECT branch_name
        FROM branch_offices
        ORDER BY monthly_sales DESC
        LIMIT 3
    );
  ```

  In this example, the ORDER BY clause is specified in the subquery, so the subquery returns the names in order of monthly
  sales. The ORDER BY clause in the subquery does not apply to the outer query. This query returns the names of the three
  branches that had the highest monthly sales, but not necessarily in order by monthly sales.

  Sorting can be expensive. If you want the results of the outer query sorted, use an ORDER BY clause only at the
  top level of the query, and avoid using ORDER BY clauses in subqueries unless necessary.

  Similarly, when ORDER BY and [LIMIT](limit.md) (or FETCH) clauses are at different nesting levels, results
  can be unpredictable. For details and examples, see the [LIMIT / FETCH usage notes](limit.md).

## Examples

The following examples demonstrate how to use ORDER BY to sort the results:

* Sorting by string values
* Sorting by numeric values
* Sorting NULLS first or last

### Sorting by string values

The following example sorts the results by string values:

```sqlexample
SELECT column1
  FROM VALUES
    ('a'), ('1'), ('B'), (null), ('2'), ('01'), ('05'),
    (' this'), ('this'), ('this and that'), ('&'), ('%')
  ORDER BY column1;
```

```output
+---------------+
| COLUMN1       |
|---------------|
|  this         |
| %             |
| &             |
| 01            |
| 05            |
| 1             |
| 2             |
| B             |
| a             |
| this          |
| this and that |
| NULL          |
+---------------+
```

### Sorting by numeric values

The following example sorts the results by numeric values:

```sqlexample
SELECT column1
  FROM VALUES
    (3), (4), (null), (1), (2), (6),
    (5), (0005), (.05), (.5), (.5000)
  ORDER BY column1;
```

```output
+---------+
| COLUMN1 |
|---------|
|    0.05 |
|    0.50 |
|    0.50 |
|    1.00 |
|    2.00 |
|    3.00 |
|    4.00 |
|    5.00 |
|    5.00 |
|    6.00 |
|    NULL |
+---------+
```

### Sorting NULLS first or last

The following example configures all queries in the session to sort NULLS last by setting the [DEFAULT_NULL_ORDERING](../parameters.md)
parameter to `LAST`.

```sqlexample
ALTER SESSION SET DEFAULT_NULL_ORDERING = 'LAST';
```

```sqlexample
SELECT column1
  FROM VALUES (1), (null), (2), (null), (3)
  ORDER BY column1;
```

```output
+---------+
| COLUMN1 |
|---------|
|       1 |
|       2 |
|       3 |
|    NULL |
|    NULL |
+---------+
```

```sqlexample
SELECT column1
  FROM VALUES (1), (null), (2), (null), (3)
  ORDER BY column1 DESC;
```

```output
+---------+
| COLUMN1 |
|---------|
|    NULL |
|    NULL |
|       3 |
|       2 |
|       1 |
+---------+
```

The following example overrides the DEFAULT_NULL_ORDERING parameter by specifying NULLS FIRST in a query:

```sqlexample
SELECT column1
  FROM VALUES (1), (null), (2), (null), (3)
  ORDER BY column1 NULLS FIRST;
```

```output
+---------+
| COLUMN1 |
|---------|
|    NULL |
|    NULL |
|       1 |
|       2 |
|       3 |
+---------+
```

The following example sets the DEFAULT_NULL_ORDERING parameter to `FIRST` to sort NULLS first:

```sqlexample
ALTER SESSION SET DEFAULT_NULL_ORDERING = 'FIRST';

SELECT column1
  FROM VALUES (1), (null), (2), (null), (3)
  ORDER BY column1;
```

```output
+---------+
| COLUMN1 |
|---------|
|    NULL |
|    NULL |
|       1 |
|       2 |
|       3 |
+---------+
```

```sqlexample
SELECT column1
  FROM VALUES (1), (null), (2), (null), (3)
  ORDER BY column1 DESC;
```

```output
+---------+
| COLUMN1 |
|---------|
|       3 |
|       2 |
|       1 |
|    NULL |
|    NULL |
+---------+
```

The following example overrides the DEFAULT_NULL_ORDERING parameter by specifying NULLS LAST in a query:

```sqlexample
SELECT column1
  FROM VALUES (1), (null), (2), (null), (3)
  ORDER BY column1 NULLS LAST;
```

```output
+---------+
| COLUMN1 |
|---------|
|       1 |
|       2 |
|       3 |
|    NULL |
|    NULL |
+---------+
```

### Sorting by all columns in the SELECT list

To run the examples in this section, create the following table:

```sqlexample
CREATE OR REPLACE TABLE my_sort_example(a NUMBER, s VARCHAR, b BOOLEAN);

INSERT INTO my_sort_example VALUES
  (0, 'abc', TRUE),
  (0, 'abc', FALSE),
  (0, 'abc', NULL),
  (0, 'xyz', FALSE),
  (0, NULL, FALSE),
  (1, 'xyz', TRUE),
  (NULL, 'xyz', FALSE);
```

The following example sorts the results by all columns in the table:

```sqlexample
SELECT * FROM my_sort_example
  ORDER BY ALL;
```

As shown below, the results are sorted first by the `a` column, then by the `s` column, and then by the `b` column (the
order in which the columns were defined in the table).

```output
+------+------+-------+
| A    | S    | B     |
|------+------+-------|
| 0    | abc  | False |
| 0    | abc  | True  |
| 0    | abc  | NULL  |
| 0    | xyz  | False |
| 0    | NULL | False |
| 1    | xyz  | True  |
| NULL | xyz  | False |
+------+------+-------+
```

The following example sorts the results in ascending order:

```sqlexample
SELECT * FROM my_sort_example
  ORDER BY ALL ASC;
```

```output
+------+------+-------+
| A    | S    | B     |
|------+------+-------|
| 0    | abc  | False |
| 0    | abc  | True  |
| 0    | abc  | NULL  |
| 0    | xyz  | False |
| 0    | NULL | False |
| 1    | xyz  | True  |
| NULL | xyz  | False |
+------+------+-------+
```

The following example sets the DEFAULT_NULL_ORDERING parameter to sort NULL values last for all queries executed during the
session:

```sqlexample
ALTER SESSION SET DEFAULT_NULL_ORDERING = 'LAST';

SELECT * FROM my_sort_example
  ORDER BY ALL;
```

```output
+------+------+-------+
| A    | S    | B     |
|------+------+-------|
| NULL | xyz  | False |
| 0    | NULL | False |
| 0    | abc  | NULL  |
| 0    | abc  | False |
| 0    | abc  | True  |
| 0    | xyz  | False |
| 1    | xyz  | True  |
+------+------+-------+
```

The following example specifies NULLS FIRST in a query to override that setting:

```sqlexample
SELECT * FROM my_sort_example
  ORDER BY ALL NULLS FIRST;
```

```output
+------+------+-------+
| A    | S    | B     |
|------+------+-------|
| NULL | xyz  | False |
| 0    | NULL | False |
| 0    | abc  | NULL  |
| 0    | abc  | False |
| 0    | abc  | True  |
| 0    | xyz  | False |
| 1    | xyz  | True  |
+------+------+-------+
```

The following example returns the columns in the order `b`, `s`, and `a`. The results are sorted first by `b`, then by
`s`, and then by `a`:

```sqlexample
SELECT b, s, a FROM my_sort_example
  ORDER BY ALL NULLS LAST;
```

```output
+-------+------+------+
| B     | S    | A    |
|-------+------+------|
| False | abc  | 0    |
| False | xyz  | 0    |
| False | xyz  | NULL |
| False | NULL | 0    |
| True  | abc  | 0    |
| True  | xyz  | 1    |
| NULL  | abc  | 0    |
+-------+------+------+
```

---
title: Organization user and organization user group functions
source: https://docs.snowflake.com/en/sql-reference/functions-organization-users.md
section: SQL General Reference
---

# Organization user and organization user group functions

The following functions help you work with [organization users and organization user groups](../user-guide/organization-users.md).

| Function | Description |
| --- | --- |
| [CURRENT_ORGANIZATION_USER](functions/current_organization_user.md) | Indicates whether the current user in the session was imported from an organization user. |
| [IS_ORGANIZATION_USER](functions/is_organization_user.md), . [IS_USER_IMPORTED (SYS_CONTEXT function)](functions/is_user_imported.md) | Tests whether a specific user was imported from an organization user. |
| [IS_ORGANIZATION_USER_GROUP](functions/is_organization_user_group.md), . [IS_GROUP_IMPORTED (SYS_CONTEXT function)](functions/is_group_imported.md) | Tests whether a specific role was imported from an organization user group. |
| [IS_ORGANIZATION_USER_GROUP_IN_SESSION](functions/is_organization_user_group_in_session.md), . [IS_GROUP_ACTIVATED (SYS_CONTEXT function)](functions/is_group_activated.md) | Tests whether a specific imported role is in the role hierarchy of the user’s current session. |
| [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION namespace)](functions/sys_context_snowflake_organization.md) | Returns information about organization users and organization user groups. |
| [SYS_CONTEXT (SNOWFLAKE$ORGANIZATION_SESSION namespace)](functions/sys_context_snowflake_organization_session.md) | Returns information about the current session and the current organization user in the session. |
| [SYSTEM$LINK_ORGANIZATION_USER](functions/system_link_organization_user.md) | Links an organization user with an existing user object so it can be managed as an organization user going forward. |
| [SYSTEM$LINK_ORGANIZATION_USER_GROUP](functions/system_link_organization_user_group.md) | Links an organization user group with an existing access control role so it can be managed as a organization user group going forward. |
| [SYSTEM$UNLINK_ORGANIZATION_USER](functions/system_unlink_organization_user.md) | Unlinks a user object from an organization user so it can be managed as a local user going forward. |
| [SYSTEM$UNLINK_ORGANIZATION_USER_GROUP](functions/system_unlink_organization_user_group.md) | Unlinks an access control role from an organization user group so it can be managed as a local role going forward. |

---
title: Overview of constraints
source: https://docs.snowflake.com/en/sql-reference/constraints-overview.md
section: SQL General Reference
---

# Overview of constraints

Snowflake provides the following constraint functionality:

* Constraint types from the ANSI SQL standard. For more information, see
  Supported constraint types.
* Named constraints.
* Single-column and multi-column constraints.
* Creation of constraints inline and out-of-line.
* Creation, modification, and deletion of constraints.

For more information, see [CREATE | ALTER TABLE … CONSTRAINT](sql/create-table-constraint.md).

## Supported constraint types

Snowflake supports the following constraint types from the ANSI SQL standard:

* **PRIMARY KEY**: Guarantees that all of the values in a column are distinct and that the column
  can’t store NULL values. The primary key uniquely identifies each row in a table.
* **UNIQUE**: Guarantees that all of the values in a column are distinct. Unlike a PRIMARY KEY constraint,
  a column with a UNIQUE constraint can have NULL values.
* **FOREIGN KEY**: Enforces referential integrity by requiring values in a column or set of columns
  to match values in another table or the same table.
* **NOT NULL**: Ensures that a column can’t store a NULL value.
* **CHECK**: Enforces a SQL expression as a condition on the values that can be inserted into or updated in one or more
  columns of a table. For more information, see CHECK constraints.

A table can have multiple unique keys and foreign keys, but only one primary key. A PRIMARY KEY constraint implies that the
column is both NOT NULL and UNIQUE.

All foreign keys must reference a corresponding primary or unique key that matches the column types of each column in the
foreign key. The primary key for a foreign key can be on a different table or the same table as the foreign key. When you
define FOREIGN KEY constraints across [hybrid tables](../user-guide/tables-hybrid.md), the tables must be in the same database.

The following table summarizes the differences in behavior between standard tables and hybrid tables,
with respect to the enforcement of constraints and whether constraints are required:

* A constraint is *enforced* when it protects a column from being updated in certain ways. For example, a column that is
  declared NOT NULL can’t contain a NULL value. An attempt to copy or insert a NULL value into a NOT NULL column results in an
  error. For hybrid tables, you can’t set the NOT ENFORCED property on PRIMARY KEY, FOREIGN KEY, and UNIQUE constraints. Setting
  this property results in an `invalid constraint property` error.
* A constraint is *required* when one or more columns in a table must have such a constraint, which is only true for
  PRIMARY KEY constraints on hybrid tables.

| Feature | Hybrid tables | Standard tables |
| --- | --- | --- |
| PRIMARY KEY constraints | Required, enforced | Optional, not enforced |
| FOREIGN KEY constraints | Optional, enforced (referential integrity) | Optional, not enforced |
| UNIQUE constraints | Optional, enforced | Optional, not enforced |
| NOT NULL constraints | Optional, enforced | Optional, enforced |
| CHECK constraints | Not supported | Optional, enforced |

## Table constraints

Snowflake supports constraints on permanent, transient, temporary, and hybrid
tables. You can define constraints on columns of all data types, and you can
include any number of columns in a single constraint.

The following are considerations for constraints:

* When you copy a table by using CREATE TABLE … LIKE or CREATE TABLE … CLONE,
  all existing constraints on the table, including foreign keys, are copied to the
  new table. CREATE TABLE … CLONE isn’t supported for hybrid tables.
* Additional commands and functions, such as DROP, UNDROP, and GET_DDL are
  supported for tables with constraints. They are also supported for schemas
  and databases.

  For [Snowflake Time Travel](../user-guide/data-time-travel.md), when previous versions of a table
  are copied, the current version of the constraints on the table are used because Snowflake
  doesn’t store previous versions of constraints in table metadata.

## Single-column and multi-column constraints

You can define constraints on a single column or on multiple columns in the same
table.

For multi-column constraints (composite primary keys or unique keys), the
columns are ordered, and each column has a corresponding key sequence.

## Inline and out-of-line constraints

Constraints are defined either inline or out-of-line during table creation or
modification:

* Inline constraints are created as part of the column definition and can only
  be used for single-column constraints.
* Out-of-line constraints are defined using a separate clause that specifies the
  column or columns on which the constraint is created. They can be used for creating
  either single-column or multi-column constraints, as well as for creating
  constraints on existing columns.

## Constraints in GET_DDL

The SQL statements that [GET_DDL](functions/get_ddl.md) returns include the
clauses that define constraints; however, note the following:

* Single-column constraints, such as `NOT NULL` and `DEFAULT`, are
  reconstructed inline with the definition of the column.
* Table constraints, such as unique, primary, and foreign keys, are always reconstructed as
  out-of-line constraints, even if they consist of a single column.
* For unnamed constraints — that is, constraints with a system-generated name —
  [GET_DDL](functions/get_ddl.md) doesn’t return the system-generated name.

## CHECK constraints

A CHECK constraint enforces a SQL expression as a condition on the values that can be inserted into or updated in one or more
columns of a table. For example, a CHECK constraint might
ensure that the `quantity` column in a table only contains values that are greater than zero or that the
`salary` column in a table only contains values in a specific range.

You can specify a CHECK constraint by using [CONSTRAINT clause](sql/create-table-constraint.md)
in the following SQL commands:

* [CREATE TABLE](sql/create-table.md)
* [ALTER TABLE](sql/alter-table.md)
* [CREATE ICEBERG TABLE](sql/create-iceberg-table.md)
* [ALTER ICEBERG TABLE](sql/alter-iceberg-table.md)

You can show information about existing CHECK constraints by querying the
[CHECK_CONSTRAINTS view](info-schema/check_constraints.md).

Check constraints are enforced during the following DML operations:

* [INSERT](sql/insert.md)
* [UPDATE](sql/update.md)
* [MERGE](sql/merge.md)
* [CREATE TABLE … AS SELECT (CTAS)](sql/create-table.md)

If the condition evaluates to TRUE or NULL, the DML operation proceeds. If the condition
evaluates to FALSE, the CHECK constraint fails.

For examples of CHECK constraints, see [Examples of constraints with standard tables](sql/create-table-constraint.md).

### Usage notes

* Check constraints are always enforced.
* You can use the following [ALTER TABLE](sql/alter-table.md) commands and the Iceberg equivalents to work
  with CHECK constraints:

  + ALTER TABLE … RENAME CONSTRAINT
  + ALTER TABLE … ADD [ CONSTRAINT <constraint_name> ] CHECK ( <expr> ) ENABLE [ VALIDATE | NOVALIDATE ]

    - ENABLE VALIDATE, the default for CHECK constraints, enforces the constraint for all existing rows and for
      all rows that are inserted or updated after you run the command. ENABLE VALIDATE is only supported for
      new tables, not for existing tables.
    - ENABLE NOVALIDATE enforces the constraint for all rows that are inserted or updated after you run the
      command, but doesn’t enforce the constraint for existing rows.
  + ALTER TABLE … ALTER CONSTRAINT <constraint_name> ENABLE [ VALIDATE | NOVALIDATE ]

    If you change a CHECK constraint from NOVALIDATE to VALIDATE, the constraint is enforced on all existing
    rows before it is changed to VALIDATE.
  + ALTER TABLE … DROP CONSTRAINT
* The following [ALTER TABLE](sql/alter-table.md) commands and Iceberg equivalents can operate on
  a column with a CHECK constraint defined on it:

  + ALTER TABLE … ALTER COLUMN

    Only operations that don’t modify a CHECK constraint are supported.
  + ALTER TABLE … RENAME COLUMN

    Check constraints that reference the renamed column are implicitly updated to use the new
    column name.
  + ALTER TABLE … DROP COLUMN

    The operation fails if the column being dropped is used by an existing CHECK constraint that
    also references another column. In this case, delete the constraint before deleting the column.
* If records violate a CHECK constraint during ingestion, the entire batch operation fails the first time
  it encounters a record that isn’t valid.

### Limitations

* Only standard tables and Snowflake-managed Iceberg tables support CHECK constraints. Other types of tables,
  such as hybrid tables, don’t support CHECK constraints.
* The expression associated with an existing CHECK constraint can’t be modified using an ALTER TABLE command.
  To modify the expression, drop and re-create the CHECK constraint.
* CHECK constraints can’t be specified in CREATE OR ALTER TABLE commands.
* The following operations don’t support CHECK constraints:

  + If you attempt to COPY INTO a table with CHECK constraints, the operation fails.
  + If you attempt to create a pipe with a target table that has CHECK constraints, the operation fails.
  + If you attempt streaming ingestion into a table that has CHECK constraints, the operation fails.
  + If you attempt external writes on Iceberg tables that have CHECK constraints, the operation fails.

---
title: PAID_LISTING_ACCESS_AND_CHANGE_LOG view
source: https://docs.snowflake.com/en/sql-reference/data-sharing-usage/paid-listing-access-change-log.md
section: SQL General Reference
---

Schema:
:   [DATA_SHARING_USAGE](../data-sharing-usage.md)

# PAID_LISTING_ACCESS_AND_CHANGE_LOG view

> **Note:**
>
> The CURRENT_PRICING_PLAN column includes new fields, indicated with an asterisk (\*), that are available only for the Offers preview.

Providers can use this view in the [Data Sharing Usage](../data-sharing-usage.md) schema to query a consumer’s paid listing change log to determine the status of a pricing plan and when consumers will lose access to paid or trial listings.

This view is updated when a consumer changes their pricing plan, starts or ends a trial listing, or when they cancel a subscription.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| EVENT_DATE | DATETIME | The date and time the row was created. |
| LISTING_NAME | VARCHAR | The name of the listing associated with the pricing plan. |
| LISTING_DISPLAY_NAME | VARCHAR | The listing display name. |
| LISTING_GLOBAL_NAME | VARCHAR | The Unique Listing Locator (ULL) for the listing. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | The consumer account name. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | The Snowflake consumer account locator. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | The consumer organization name. |
| CONSUMER_SNOWFLAKE_REGION | VARCHAR | The Snowflake region that corresponds to the consumer’s organization billing and shipping addresses. |
| CURRENT_PRICING_PLAN | VARIANT | Information in JSON format about the pricing plan that was active on the date and time specified in the EVENT_DATE column. The following information can be returned:   * PLAN_TYPE: Type of Pricing Plan. Values can be: (SUBSCRIPTION, USAGE) * PLAN_SUB_TYPE: One of:  + If PLAN_TYPE is SUBSCRIPTION: One of INSTALLMENTS or FIXED.   + If PLAN_TYPE is USAGE: One of PER_QUERY, COMPUTE_POOL_SURCHARGE, or CUSTOM_BILLING. * FREE_QUERIES_LIMIT: The number of free queries a consumer is allowed before a usage price applies. * PER_QUERY_PRICE: The cost per query for usage based plans. * BASE_PRICE: The base monthly usage fee. * MAX_MONTHLY_PRICE_LIMIT: The maximum monthly usage fee. * MAX_TRIAL_DAYS_LIMIT: The time limit for consumer trials. * MAX_TRIAL_QUERIES_LIMIT: The usage limit for consumer trials. * BILLING_DURATION_MONTHS: The contract duration in months. * CUSTOM_BILLING_EVENTS: A list of existing custom billing events. One of:    + CLASS_NAME: Billing event class name.   + DISPLAY_NAME: The name of the billing event.   + UNIT_PRICE: Price per billable unit. * DESCRIPTION: The pricing plan description. * IS_AUTO_RENEWAL_ALLOWED: Indicates if the consumer can automatically renew the listing: `true` or `false`. * INSTALLMENT_SCHEDULE: The installment plan schedule.    + INSTALLMENT_DURATION_MONTHS: The number of months installments are due.   + DEFAULT_INSTALLMENT_PRICE: The usual dollar amount of each installment payment, excluding exceptions like prorated or upfront payments.   + INSTALLMENT_OVERRIDES: A list of overridden installments. One of:      - INSTALLMENT_NUMBER: Installment number being overridden.     - INSTALLMENT_PRICE: The installment price. * IS_EARLY_ACCESS_ALLOWED: Consumers can access data before payment. * COMPUTE_POOL_SURCHARGES: A list of the defined Snowpark Container Services (SPCS) compute pool surcharges. One of:    + COMPUTE_POOL_NAME: Compute pool name.   + UNIT_PRICE: Price per credit or per hour for compute pool usage. * CURRENCY: Pricing plan currency in United States dollars. * \*CONTRACT_TYPE: For a flat-fee plan, this can be for a limited time or recurring (subscription). For a usage-based plan, this is always “pay as you go.” * \*COMPUTE_POOL_SURCHARGE_TYPE: The unit on which the surcharge would be applicable. This can be `HOUR` or `CREDIT`. * \*CONTRACT_DURATION_MONTHS: The length of the contract in months. * \*INVOICE_START_PREFERENCE: Indicates whether the first invoice is sent when the offer is accepted or on a specified date. Can be one of `OFFER_ACCEPTED_DATE`, `SPECIFIED_DATE`, or `FIRST_DAY_NEXT_MONTH`. If `SPECIFIED_DATE` is selected, then see `INVOICE_START_TIME`. * \*INVOICE_START_TIME: The date of the first invoice. * \*IS_DEFAULT: This field is always present. If `true`, it indicates that this is a default offer attached to a pricing plan. If `false`, it indicates that this is a private offer targeted to a specific consumer. * \*OFFER_DISPLAY_NAME: The name of the offer as shown to consumers. * \*PRICING_PLAN_DISPLAY_NAME: The name of the pricing plan as shown to consumers. * \*PAYMENT_TERMS: The payment terms. * \*PAYMENT_TYPE: The payment type. This can be `FULL` for paid in full or `INSTALLMENT` for a payment made in installments. * \*ALLOWED_PAYMENT_METHODS: The allowed payment methods. * \*ACCESS_START_PREFERENCE: The preference of access start. This can be one of `OFFER_ACCEPTED_DATE` or `SPECIFIC_DATE`. * \*ACCESS_START_TIME: The time when access starts. * \*ACCESS_END_TIME: The date and time when access ends. * \*TARGET_CONSUMER: The consumer targeted by this offer. * \*STATE: The offer state. * \*PRODUCT_SKU: The SKU linked to a pricing plan that’s tied to an offer. * \*PRICING_MODEL: Specifies whether the pricing plan tied to the offer is a flat fee or a usage model. |
| NEXT_PRICING_PLAN | VARIANT | Information in JSON format about the pricing plan that becomes active on the date and time specified in the `CURRENT_PRICING_PLAN_END_ON` column. The JSON format for this column is identical to that of `CURRENT_PRICING_PLAN`. |
| IS_CONSUMER_AUTO_RENEWAL_ENABLED | BOOLEAN | The consumer enabled auto-renewal. This is applicable only to subscription listings. |
| PURCHASE_STATE | VARCHAR | The listing state. The state can be one of:   * TRIAL * EXPIRED * DELETED * PURCHASED |
| CURRENT_PRICING_PLAN_START_ON | DATETIME | The date and time a pricing plan became active. |
| CURRENT_PRICING_PLAN_END_ON | DATETIME | The date and time the current pricing plan ends. |
| TRIAL_END_ON | DATETIME | The end date of the listing trial. |
| ACCESS_END_ON | DATETIME | The date and time the current subscription term ends. NULL indicates that the current plan is not a subscription, but a usage based plan instead. |

## Usage notes

* Latency for the view may be up to two days.
* The data is retained up to one year.
* A row is created when any column changes. For example, when a consumer changes their pricing plan, starts or ends a trial, cancels a subscription, or deletes the data.
* The data includes all consumers who have accessed the listing at least once, including those who have canceled their listing subscription or trial.
* The view contains data for paid Snowflake listings that have one or more consumers.
* This report contains one row per listing per consumer. For example, if a consumer purchased two listings from a provider and each purchase was updated three times, then the view contains six entries. An individual column represents the state of a single, specific purchase.
* The following data is not included in the view:

  + Limited trial listings
  + Free listings - provided without charge on or off the Snowflake platform
  + Free listings - provided without charge on-platform, but paid off-platform directly to the provider
  + Listings that have never been accessed by consumers

## Examples

Show a change log for a specified listing and consumer:

```sqlexample
SELECT
  event_date,
  listing_name,
  listing_global_name,
  consumer_account_name,
  consumer_account_locator,
  consumer_organization_name,
  current_pricing_plan,
  next_pricing_plan,
  is_consumer_auto_renewal_enabled,
  purchase_state,
  current_pricing_plan_start_on,
  current_pricing_plan_end_on,
  trial_end_on,
  access_end_on
FROM snowflake.data_sharing_usage.paid_listing_access_and_change_log
WHERE TRUE
  AND consumer_organization_name = 'specific_organization_name'
  AND listing_display_name = 'specific_listing_display_name'
ORDER BY event_date DESC;
```

Show listings and consumers with pricing plans ending in the next billing period:

```sqlexample
SELECT
  event_date,
  listing_name,
  listing_global_name,
  consumer_account_name,
  consumer_account_locator,
  consumer_organization_name,
  current_pricing_plan,
  next_pricing_plan,
  is_consumer_auto_renewal_enabled,
  purchase_state,
  current_pricing_plan_start_on,
  current_pricing_plan_end_on,
  trial_end_on,
  access_end_on
FROM snowflake.data_sharing_usage.paid_listing_access_and_change_log
WHERE TRUE
  AND consumer_organization_name = 'specific_organization_name'
  AND listing_display_name = 'specific_listing_display_name'
QUALIFY TRUE
  AND ROW_NUMBER() OVER (
   PARTITION BY
    consumer_organization_name,
    consumer_snowflake_region,
    consumer_account_name,
    listing_display_name
ORDER BY event_date DESC ) = 1;
```

---
title: Parameters
source: https://docs.snowflake.com/en/sql-reference/parameters.md
section: SQL General Reference
---

# Parameters

Snowflake provides parameters that let you control the behavior of your account, individual user sessions, and objects. All
parameters have default values. You can set these parameters and override them at different levels, depending on the parameter
type (account, session, or object).

## Parameter hierarchy and types

This section describes the different types of parameters and the levels at which each type can be set. There are three types of
parameters:

* Account parameters
* Session parameters
* Object parameters

The following diagram illustrates the hierarchical relationship between the different parameter types and how individual
parameters can be overridden at each level:

### Account parameters

You can only set account parameters at the account level, if you are using a role that has been granted the privilege to set the
parameter. To set an account parameter, you run the [ALTER ACCOUNT](sql/alter-account.md) command.

Snowflake provides the following account parameters:

| Parameter | Notes |
| --- | --- |
| ACCOUNT_LEVEL_FILE_EXTENSIONS_ALLOW_LIST_FOR_PRIVATE_WORKSPACES | Used to specify the file extensions allowed in private workspaces for the account. |
| ACCOUNT_LEVEL_FILE_EXTENSIONS_ALLOW_LIST_FOR_SHARED_WORKSPACES | Used to specify the file extensions allowed in shared workspaces for the account. |
| ALLOW_BIND_VALUES_ACCESS | Used to allow clients to access bind variable values. |
| ALLOW_CLIENT_MFA_CACHING |  |
| ALLOW_ID_TOKEN | Used to enable connection caching in browser-based single sign-on (SSO) for Snowflake-provided clients. |
| ALLOWED_SPCS_WORKLOAD_TYPES | Used to specify the workload types that are allowed in your account to deploy to Snowpark Container Services. |
| CLIENT_ENCRYPTION_KEY_SIZE | Used for encryption of files staged for data loading or unloading; might require additional installation and configuration (see description for details). |
| CORTEX_ENABLED_CROSS_REGION | Used to enable cross-region processing of Snowflake Cortex calls in a different region if the call cannot be processed in your account region. |
| DEFAULT_DBT_VERSION | Used to set the default version for all future dbt project objects created in an account. |
| DISABLE_USER_PRIVILEGE_GRANTS | Used to disable granting of privileges directly to users. For more information, see [GRANT privileges to USERS Usage notes](sql/grant-privilege-user.md). |
| DISALLOWED_SPCS_WORKLOAD_TYPES | Used to specify the workload types that are disallowed in your account to deploy to Snowpark Container Services. |
| ENABLE_AUTOMATIC_SENSITIVE_DATA_CLASSIFICATION_LOG | Controls whether events from sensitive data classification are logged to the user event table. |
| ENABLE_BUDGET_EVENT_LOGGING | Controls whether events from budgets are logged to the event table. |
| ENABLE_EGRESS_COST_OPTIMIZER | Used to enable or disable listing auto-fulfillment egress cost egress optimization. |
| ENABLE_IDENTIFIER_FIRST_LOGIN |  |
| ENABLE_INTERNAL_STAGES_PRIVATELINK | Allows the [SYSTEM$GET_PRIVATELINK_CONFIG](functions/system_get_privatelink_config.md) function to return the `private-internal-stages` key in the query result. |
| ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK | Allows the [SYSTEM$GET_PRIVATELINK_CONFIG](functions/system_get_privatelink_config.md) function to return the `privatelink-snowflake-managed-storage-volume-nfs` and `privatelink-snowflake-managed-storage-volume-fs` keys in the query result on Azure deployments. |
| ENABLE_NOTEBOOK_CREATION_IN_PERSONAL_DB | Used to enable or disable private notebooks on a Snowflake account. |
| ENABLE_SPCS_BLOCK_STORAGE_SNOWFLAKE_FULL_ENCRYPTION_ENFORCEMENT | Used to enable enforcement of SNOWFLAKE_FULL encryption for Snowpark Container Services [block-storage volumes and snapshots](../developer-guide/snowpark-container-services/block-storage-volume.md). |
| ENABLE_TAG_PROPAGATION_EVENT_LOGGING | Controls whether Snowflake collects telemetry data for tag propagation. |
| ENABLE_TRI_SECRET_AND_REKEY_OPT_OUT_FOR_IMAGE_REPOSITORY | Used to specify an image Repository’s choice to opt out of Tri-Secret Secure and [Periodic rekeying](../user-guide/security-encryption-manage.md). |
| ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES |  |
| ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME |  |
| EXTERNAL_OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST |  |
| INITIAL_REPLICATION_SIZE_LIMIT_IN_TB |  |
| LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE | Used to set the refresh schedule for all listings in an account. |
| MIN_DATA_RETENTION_TIME_IN_DAYS | Used to set the minimum data retention period for retaining historical data for Time Travel operations. |
| NETWORK_POLICY | This is the only account parameter that can be set by either account administrators (i.e users with the ACCOUNTADMIN system role) or security administrators (i.e users with the SECURITYADMIN system role). . For more information, see Object parameters. |
| OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST |  |
| PERIODIC_DATA_REKEYING |  |
| READ_CONSISTENCY_MODE |  |
| REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_CREATION |  |
| REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_OPERATION |  |
| SQL_TRACE_QUERY_TEXT | Used to specify whether to capture the SQL text of a traced SQL statement. |
| SSO_LOGIN_PAGE |  |
| USE_WORKSPACES_FOR_SQL | Used to enable or disable [Workspaces](../user-guide/ui-snowsight/workspaces.md) as the default SQL editor for the account. |

> **Note:**
>
> By default, account parameters are not displayed in the output of [SHOW PARAMETERS](sql/show-parameters.md). For
> information about viewing account parameters, see Viewing the Parameters and Their Values (in this topic).

### Session parameters

Most parameters are session parameters, which you can set at the following levels:

Account:
:   Account administrators can run the [ALTER ACCOUNT](sql/alter-account.md) command to set session parameters for the
    account.

    The values that you set at this level become the default values for individual users and their sessions.

User:
:   Administrators with the appropriate privileges (typically, a user who has been granted the SECURITYADMIN role) can run
    the [ALTER USER](sql/alter-user.md) command to override session parameters for individual users. In addition, individual
    users can run the ALTER USER command to override default sessions parameters for themselves.

    The values set that you set for a user become the default values in any session started by that user.

Session:
:   Users can run the [ALTER SESSION](sql/alter-session.md) command to override session parameters for the current
    session.

> **Note:**
>
> By default, only session parameters are displayed in the output of [SHOW PARAMETERS](sql/show-parameters.md). For information
> about viewing account and object parameters, see Viewing the Parameters and Their Values (in this topic).

### Object parameters

You can set object parameters at the following levels:

Account:
:   Account administrators can run the [ALTER ACCOUNT](sql/alter-account.md) command to set object parameters for objects
    in the account.

    The values that you set at this level become the default values for individual objects created in the account.

Object:
:   Users with the appropriate privileges can run the [CREATE <object>](sql/create.md) or [ALTER <object>](sql/alter.md)
    commands to override object parameters for an individual object.

Snowflake provides the following object parameters:

| Parameter | Object Type | Notes |
| --- | --- | --- |
| AUTO_EVENT_LOGGING | Snowflake Scripting stored procedure |  |
| BASE_LOCATION_PREFIX | Database, Schema | Specifies a prefix to use in the write path for Apache Iceberg™ table files. |
| CATALOG | Database, Schema, Apache Iceberg™ table |  |
| CATALOG_SYNC | Account, Database, Schema, Apache Iceberg™ table | This parameter is only supported for Snowflake-managed Iceberg tables that you sync with Open Catalog. |
| CORTEX_MODELS_ALLOWLIST | Cortex AI Functions and models | Comma-separated names of allowed Cortex language models, `'All'`, or `'None'`. |
| DATA_METRIC_SCHEDULE | Table | Specifies the schedule to run the data metric functions associated to the table. All data metric functions on the table or view follow the same schedule. |
| DATA_RETENTION_TIME_IN_DAYS | Database, Schema, Table |  |
| DEFAULT_DDL_COLLATION | Database, Schema, Table |  |
| DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU | Database, Schema | [System compute pools](../developer-guide/snowpark-container-services/working-with-compute-pool.md) |
| DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU | Database, Schema | [System compute pools](../developer-guide/snowpark-container-services/working-with-compute-pool.md) |
| DEFAULT_STREAMLIT_COMPUTE_POOL | Account | [Configuring your own preferred compute pools for Streamlit apps](../developer-guide/snowpark-container-services/working-with-compute-pool.md) |
| DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE | Account, Database, Schema |  |
| DISABLE_UI_DOWNLOAD_BUTTON | Account, User |  |
| ENABLE_DATA_COMPACTION | Account, Database, Schema, Apache Iceberg™ table | This parameter is only supported for Snowflake-managed Iceberg tables. |
| ENABLE_ICEBERG_MERGE_ON_READ | Account, Database, Schema, Apache Iceberg™ table |  |
| ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR | User | Affects the query history for queries that fail because of syntax or parsing errors. |
| ENABLE_UNREDACTED_SECURE_OBJECT_ERROR | User | Affects redaction of error messages related to secure objects in metadata. |
| EVENT_TABLE | Database, Account |  |
| EXTERNAL_VOLUME | Database, Schema, Apache Iceberg™ table |  |
| ICEBERG_VERSION | Apache Iceberg™ table |  |
| ICEBERG_VERSION_DEFAULT | Account, Database, Schema |  |
| LOG_LEVEL | Account, Database, Schema, DCM project, Stored Procedure, Function, Dynamic Table, Iceberg table, Task, Service. | Log messages from logging APIs. |
| LOG_EVENT_LEVEL | Account, Database, Schema, DCM project, Stored Procedure, Function, Dynamic Table, Iceberg table, Task, Service. | Log events (record type EVENT) written to the event table. |
| MAX_CONCURRENCY_LEVEL | Warehouse |  |
| MAX_DATA_EXTENSION_TIME_IN_DAYS | Database, Schema, Table |  |
| METRIC_LEVEL | Account, Database, Schema, Stored Procedure, Function |  |
| NETWORK_POLICY | User | This is the only user parameter that can be set by either account administrators (users with the ACCOUNTADMIN system role) or security administrators (users with the SECURITYADMIN system role).  If this parameter is set on the account and a user in the same account, the user-level network policy overrides the account-level network policy. |
| PATH_LAYOUT | Apache Iceberg™ table | Specifies the path layout for Parquet data files written to partitioned Iceberg tables. |
| PIPE_EXECUTION_PAUSED | Schema, Pipe |  |
| PREVENT_UNLOAD_TO_INLINE_URL | User |  |
| PREVENT_UNLOAD_TO_INTERNAL_STAGES | User |  |
| REPLACE_INVALID_CHARACTERS | Database, Schema, file format, Apache Iceberg™ table | Can only be set for Iceberg tables that use an external Iceberg catalog. |
| `ROW_TIMESTAMP` | Database, Schema, Table | Use this parameter to enable row timestamps on your tables. For more information, see [Use row timestamps to measure latency in your pipelines](../user-guide/data-engineering/row-timestamps.md). |
| `ROW_TIMESTAMP_DEFAULT` | Database, Schema, Table | Use this parameter to set row timestamps by default for new tables in a container. For more information, see [Use row timestamps to measure latency in your pipelines](../user-guide/data-engineering/row-timestamps.md). |
| SERVERLESS_TASK_MAX_STATEMENT_SIZE | Database, Schema, Task, Account |  |
| SERVERLESS_TASK_MIN_STATEMENT_SIZE | Database, Schema, Task, Account |  |
| STATEMENT_QUEUED_TIMEOUT_IN_SECONDS | Warehouse | Also a session parameter (can be set at both the object and session levels). For inheritance and override details, see the parameter description. |
| STATEMENT_TIMEOUT_IN_SECONDS | Warehouse | Also a session parameter (can be set at both the object and session levels). For inheritance and override details, see the parameter description. |
| STORAGE_SERIALIZATION_POLICY | Database, Schema, Apache Iceberg™ table | This parameter is only supported for Iceberg tables that use Snowflake as the catalog. |
| SUSPEND_TASK_AFTER_NUM_FAILURES | Database, Schema, Task |  |
| TASK_AUTO_RETRY_ATTEMPTS | Database, Schema, Task |  |
| TRACE_LEVEL | Account, Database, Schema, Stored Procedure, Function |  |
| USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE | Database, Schema, Task |  |
| USER_TASK_MINIMUM_TRIGGER_INTERVAL_IN_SECONDS | Database, Schema, Task |  |
| USER_TASK_TIMEOUT_MS | Database, Schema, Task |  |

> **Note:**
>
> By default, object parameters are not displayed in the output of [SHOW PARAMETERS](sql/show-parameters.md). For
> information about viewing object parameters, see Viewing the Parameters and Their Values (in this topic).

## Viewing the parameters and their values

To view the parameters that are set and their default values, run the [SHOW PARAMETERS](sql/show-parameters.md) command. You can
run the command with different command parameters to display different types of parameter:

* Viewing session parameters
* Viewing object parameters
* Viewing all parameters (including account and object parameters)
* Limiting the list of parameters by name

### Viewing session parameters

By default, the command displays only session parameters:

```sqlexample
SHOW PARAMETERS;
```

### Viewing object parameters

To display the object parameters for a specific object, include the IN clause with the object type and name. For example:

```sqlexample
SHOW PARAMETERS IN DATABASE mydb;
```

```sqlexample
SHOW PARAMETERS IN WAREHOUSE mywh;
```

### Viewing all parameters (including account and object parameters)

To display all parameters, including account and object parameters, include the IN ACCOUNT clause:

```sqlexample
SHOW PARAMETERS IN ACCOUNT;
```

### Limiting the list of parameters by name

You can specify the LIKE clause to limit the list of parameters by name. For example:

* To display the session parameters with names containing “time”:

  ```sqlexample
  SHOW PARAMETERS LIKE '%time%';
  ```
* To display all the parameters with names starting with “time”:

  ```sqlexample
  SHOW PARAMETERS LIKE 'time%' IN ACCOUNT;
  ```

> **Note:**
>
> You must specify the LIKE clause before the IN clause.

## ABORT_DETACHED_QUERY

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies the action that Snowflake performs for in-progress queries if connectivity is lost due to abrupt termination of a session (e.g. network outage, browser termination, service
    interruption).

Values:
:   `TRUE`: In-progress queries are aborted 5 minutes after connectivity is lost.

    `FALSE`: In-progress queries are completed.

Default:
:   `FALSE`

> **Note:**
>
> * For client drivers, closing the connection from the client side (such as calling `connection.close()`) is different from actually logging out from the Snowflake session. Closing the connection can be associated with cleaning up resources owned by the connection, including but not limited to performing a session logout. Performing a session logout also implies that any queries still running in the same session (for example, queries submitted asynchronously) are canceled after a couple of minutes when the session is logged out, even if the ABORT_DETACHED_QUERY parameter is set to `false` (the default value).
>
>   Therefore, some Snowflake drivers implement their own business logic to decide whether session logout is performed when the connection is closed.
>
>   Currently, this functionality is implemented in the following drivers:
>
>   + [JDBC Driver](../developer-guide/jdbc/jdbc-using.md)
>   + [Snowflake Connector for Python](../developer-guide/python-connector/python-connector-example.md)
>   + [Go Snowflake Driver](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#hdr-Asynchronous_Queries)
> * Most queries require compute resources in order to be executed. These resources are provided by virtual warehouses, which consume credits while running. If the Snowflake session is not terminated when the connection closes, warehouses might continue running and consuming credits to complete any queries that were in progress at the time the connection was closed, up to the value of the STATEMENT_TIMEOUT_IN_SECONDS parameter, which has a default of two days.

## ACTIVE_PYTHON_PROFILER

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   Sets the profiler to use for the session when [profiling Python handler code](../developer-guide/stored-procedure/python/procedure-python-profiler.md).

Values:
:   `'LINE'`: To have the profile focus on line use activity.

    `'MEMORY'`: To have the profile focus on memory use activity.

Default:
:   None.

## ACCOUNT_LEVEL_FILE_EXTENSIONS_ALLOW_LIST_FOR_PRIVATE_WORKSPACES

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the file extensions that are allowed in private workspaces for the account. The value is a comma-separated list of extensions,
    for example: `.ipynb,.sql,.txt`. If the parameter is empty (default), all file extensions are allowed.

    When the allow list is non-empty:

    * Only the listed extensions are permitted; all others are blocked.
    * Files uploaded through Workspaces with a non-allowed extension will immediately fail to upload.
    * If a file is renamed to use a non-allowed extension, the file becomes inaccessible within the workspace.
    * Pre-existing files with disallowed extensions will not appear in Workspaces.
    * Users can still use the Snowflake CLI `PUT` command to upload files with non-allowed extensions to a workspace’s virtual stage or a Notebook
      Project Object’s virtual stage. However, these files are inaccessible and cannot be used, viewed, downloaded (via `GET`), or listed (via `LIST`)
      from within the workspace or Notebook Project Object environment.
    * To maintain core workspace functionality, include `.ipynb` and `.sql` in the allow list.
    * Files without an extension (for example, `Makefile`) are not allowed once the list is non-empty.
    * Dotfiles (for example, `.gitignore` or `.venv`) must be explicitly added to the list.
    * Extension matching is case-sensitive. For example, if `.txt` is in the list, `.TXT` is not allowed.

Default:
:   Empty string (all extensions allowed)

## ACCOUNT_LEVEL_FILE_EXTENSIONS_ALLOW_LIST_FOR_SHARED_WORKSPACES

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the file extensions that are allowed in shared workspaces for the account. The value is a comma-separated list of extensions,
    for example: `.ipynb,.sql,.txt`. If the parameter is empty (default), all file extensions are allowed.

    When the allow list is non-empty:

    * Only the listed extensions are permitted; all others are blocked.
    * Files uploaded through Workspaces with a non-allowed extension will immediately fail to upload.
    * If a file is renamed to use a non-allowed extension, the file becomes inaccessible within the workspace.
    * Pre-existing files with disallowed extensions will not appear in Workspaces.
    * Users can still use the Snowflake CLI `PUT` command to upload files with non-allowed extensions to a workspace’s virtual stage or a Notebook
      Project Object’s virtual stage. However, these files are inaccessible and cannot be used, viewed, downloaded (via `GET`), or listed (via `LIST`)
      from within the workspace or Notebook Project Object environment.
    * To maintain core workspace functionality, include `.ipynb` and `.sql` in the allow list.
    * Files without an extension (for example, `Makefile`) are not allowed once the list is non-empty.
    * Dotfiles (for example, `.gitignore` or `.venv`) must be explicitly added to the list.
    * Extension matching is case-sensitive. For example, if `.txt` is in the list, `.TXT` is not allowed.
    * Files with disallowed extensions that originate from a connected Git repository are visible to the user but remain inaccessible within the workspace.

Default:
:   Empty string (all extensions allowed)

## ALLOW_BIND_VALUES_ACCESS

Type:
:   Account — Can only be set for Account

Data Type:
:   Boolean

Description:
:   Specifies whether clients can access [bind variable](bind-variables.md) values by using the [BIND_VALUES](functions/bind_values.md) table function, the [QUERY_HISTORY Account Usage view](account-usage/query_history.md), the [QUERY_HISTORY Organization Usage view](organization-usage/query_history.md), or the [QUERY_HISTORY function](functions/query_history.md). For more information, see [Retrieve bind variable values](bind-variables.md).

Values:
:   `TRUE`: Allows the retrieval of bind variable values.

    `FALSE`: Doesn’t allow retrieval of bind variable values.

Default:
:   `TRUE`

## ALLOW_CLIENT_MFA_CACHING

Type:
:   Account — Can only be set for Account

Data Type:
:   Boolean

Description:
:   Specifies whether an MFA token can be saved in the client-side operating system keystore to promote continuous, secure connectivity without users needing to respond to an MFA prompt at the start of each connection attempt to Snowflake. For details and the list of supported Snowflake-provided clients, see [Using MFA token caching to minimize the number of prompts during authentication — optional](../user-guide/security-mfa.md).

Values:
:   `TRUE`: Stores an MFA token in the client-side operating system keystore to enable the client application to use the MFA token whenever a new connection is established. While true, users are not prompted to respond to additional MFA prompts.

    `FALSE`: Does not store an MFA token. Users must respond to an MFA prompt whenever the client application establishes a new connection with Snowflake.

Default:
:   `FALSE`

## ALLOW_ID_TOKEN

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether a connection token can be saved in the client-side operating system keystore to promote continuous, secure connectivity without users needing to enter login credentials at the start of each connection attempt to Snowflake. For details and the list of supported Snowflake-provided clients, see [Using connection caching to minimize the number of prompts for authentication — Optional](../user-guide/admin-security-fed-auth-use.md).

Values:
:   `TRUE`: Stores a connection token in the client-side operating system keystore to enable the client application to perform browser-based SSO without prompting users to authenticate whenever a new connection is established.

    `FALSE`: Does not store a connection token. Users are prompted to authenticate whenever the client application establishes a new connection with Snowflake. SSO to Snowflake is still possible if this parameter is set to false.

Default:
:   `FALSE`

## ALLOWED_SPCS_WORKLOAD_TYPES

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the workload types that are allowed in your account to deploy to Snowpark Container Services. Also see DISALLOWED_SPCS_WORKLOAD_TYPES.

Values:
:   The value is a comma-separated list of the following supported workload types:

    * `USER`: Any workloads directly deployed by users.
    * `NOTEBOOK`: Snowflake Notebooks.
    * `STREAMLIT`: Streamlit in Snowflake.
    * `MODEL_SERVING`: ML Model Serving.
    * `ML_JOB`: Snowflake ML Jobs.
    * `ALL`: All workloads.

Default:
:   `ALL`

> **Note:**
>
> If you configure both ALLOWED_SPCS_WORKLOAD_TYPES and DISALLOWED_SPCS_WORKLOAD_TYPES, DISALLOWED_SPCS_WORKLOAD_TYPES takes precedence. For example, if you configure both these parameters and specify the `NOTEBOOK` workload, `NOTEBOOK` workloads aren’t allowed to run on Snowpark Container Services.

## AUTO_EVENT_LOGGING

Type:
:   Object (for Snowflake Scripting stored procedures)

Data Type:
:   String (Constant)

Description:
:   Controls whether Snowflake Scripting log messages and trace events are ingested automatically into the
    [event table](../developer-guide/logging-tracing/event-table-setting-up.md). To set this parameter, run the
    [ALTER PROCEDURE](sql/alter-procedure.md) command.

Values:
:   * `LOGGING`: Automatically adds the following additional logging information to the event table when a
      procedure is executed:

      + BEGIN/END of a Snowflake Scripting block.
      + BEGIN/END of a child job request.

      This information is added to the event table only if the effective LOG_LEVEL is set
      to `TRACE` for the stored procedure.
    * `TRACING`: Automatically adds the following additional trace information to the event table when a
      stored procedure is executed:

      + Exception catching.
      + Information about child job execution.
      + Child job statistics.
      + Stored procedure statistics, including execution time and input values.

      This information is added to the event table only if the effective TRACE_LEVEL is set
      to `ALWAYS` or `ON_EVENT` for the stored procedure.
    * `ALL`: Automatically adds both the logging information added for the `LOGGING` value
      and the trace information added for the `TRACING` value.
    * `OFF`: Does not automatically add logging information or trace information to the event table.

Default:
:   `OFF`

For more information about using this parameter, see [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md),
[Automatically add log messages about blocks and child jobs](../developer-guide/logging-tracing/logging-snowflake-scripting.md),
and [Automatically emit trace events for child jobs and exceptions](../developer-guide/logging-tracing/tracing-snowflake-scripting.md).

## AUTOCOMMIT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether autocommit is enabled for the session. Autocommit determines whether a DML statement, when executed without an active transaction, is automatically committed after the
    statement successfully completes. For more information, see [Transactions](transactions.md).

    > **Note:**
    >
    > Setting this parameter to `FALSE` stops usage data from being saved to the ORGANIZATION_USAGE schema of an
    > [organization account](../user-guide/organization-accounts.md).

Values:
:   `TRUE`: Autocommit is enabled.

    `FALSE`: Autocommit is disabled, meaning DML statements must be explicitly committed or rolled back.

Default:
:   `TRUE`

> **Note:**
>
> The `FALSE` value isn’t supported for [tasks](sql/create-task.md).

## AUTOCOMMIT_API_SUPPORTED (view-only)

Type:
:   N/A

Data Type:
:   Boolean

Description:
:   For Snowflake internal use only. View-only parameter that indicates whether API support for autocommit is enabled for your account. If the value is `TRUE`, you can enable or disable
    autocommit through the APIs for the following drivers/connectors:

    * [JDBC driver](../developer-guide/jdbc/jdbc.md)
    * [ODBC driver](../developer-guide/odbc/odbc.md)
    * [Snowflake Connector for Python](../developer-guide/python-connector/python-connector.md)

## BASE_LOCATION_PREFIX

Type:
:   Object (for databases and schemas) — Can be set for Account » Database » Schema

Data Type:
:   String

Description:
:   Specifies a prefix for Snowflake to use in the write path for Snowflake-managed Apache Iceberg™ tables.
    For more information, see [data and metadata directories for Iceberg tables](../user-guide/tables-iceberg-storage.md).

Values:
:   Any valid string prefix that complies with the storage naming conventions of your cloud provider.

Default:
:   None

## BINARY_INPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   The format of VARCHAR values passed as input to VARCHAR-to-BINARY conversion functions. For more information, see
    [Binary input and output](binary-input-output.md).

Values:
:   `HEX` , `BASE64` , or `UTF8` / `UTF-8`

Default:
:   `HEX`

## BINARY_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   The format for VARCHAR values returned as output by BINARY-to-VARCHAR conversion functions. For more information, see
    [Binary input and output](binary-input-output.md).

Values:
:   `HEX` or `BASE64`

Default:
:   `HEX`

## CATALOG

Type:
:   Object (for databases, schemas, and Apache Iceberg™ tables) — Can be set for Account » Database » Schema » Iceberg table

Data Type:
:   String

Description:
:   Specifies the catalog for Apache Iceberg™ tables.
    For more information, see the [Iceberg table documentation](../user-guide/tables-iceberg.md).

Values:
:   `SNOWFLAKE` or any valid [catalog integration](../user-guide/tables-iceberg.md) identifier.

Default:
:   None

## CATALOG_SYNC

Type:
:   Object (for databases, schemas, and Iceberg tables) — Can be set for Account » Database » Schema » Iceberg Table

Data Type:
:   String

Description:
:   Specifies the name of your catalog integration for [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).
    Snowflake syncs tables that use the specified catalog integration with your Snowflake Open Catalog account. For more information, see [Sync a Snowflake-managed table with Snowflake Open Catalog](../user-guide/tables-iceberg-open-catalog-sync.md).

Values:
:   The name of any existing catalog integration for Open Catalog.

Default:
:   None

## CLIENT_ENABLE_LOG_INFO_STATEMENT_PARAMETERS

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Clients:
:   JDBC

Description:
:   Enables users to log the data values bound to
    [PreparedStatements](../developer-guide/jdbc/jdbc-api.md).

    To see the values, you must not only set this session-level parameter to `TRUE`, but also set the
    connection parameter named `TRACING` to either `INFO` or `ALL`.

    * Set `TRACING` to `ALL` to see all debugging information and all binding information.
    * Set `TRACING` to `INFO` to see the binding parameter values and less other debug information.

    > **Caution:**
    >
    > If you bind confidential information, such as medical diagnoses or passwords, that information is
    > logged. Snowflake recommends making sure that the log file is secure, or only using test data, when you set
    > this parameter to `TRUE`.

Values:
:   `TRUE` or `FALSE`.

Default:
:   `FALSE`

## CLIENT_ENCRYPTION_KEY_SIZE

Type:
:   Account — Can be set only for Account

Data Type:
:   Integer

Clients:
:   Any

Description:
:   Specifies the AES encryption key size, in bits, used by Snowflake to encrypt/decrypt files stored on internal stages (for loading/unloading data) when you use the `SNOWFLAKE_FULL` encryption type.

Values:
:   `128` or `256`

Default:
:   `128`

> **Note:**
>
> * This parameter is not used for encrypting/decrypting files stored in external stages (that is, S3 buckets or Azure containers). Encryption/decryption of these files is accomplished using an external
>   encryption key explicitly specified in the COPY command or in the named external stage referenced in the command.
> * If you are using the JDBC driver and you wish to set this parameter to 256 (for strong encryption), additional JCE policy files must be installed on each client machine from which
>   data is loaded/unloaded. For more information about installing the required files, see [Java requirements for the JDBC Driver](../developer-guide/jdbc/java-install.md).
> * If you are using the Python connector (or SnowSQL) and you wish to set this parameter to 256 (for strong encryption), no additional installation or configuration tasks are required.

## CLIENT_MEMORY_LIMIT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Clients:
:   JDBC, ODBC

Description:
:   Parameter that specifies the maximum amount of memory the JDBC driver or ODBC driver should use for the result set from queries (in MB).

    For the JDBC driver:

    * To simplify JVM memory management, the parameter sets a global maximum memory usage limit for all queries.
    * CLIENT_RESULT_CHUNK_SIZE specifies the maximum size of each set (or *chunk*) of query results to download (in MB).
      The driver might require additional memory to process a chunk; if so, it will adjust memory usage during runtime to process
      at least one thread/query. Verify that CLIENT_MEMORY_LIMIT is set significantly higher than CLIENT_RESULT_CHUNK_SIZE to
      ensure sufficient memory is available.

    For the ODBC driver:

    * This parameter is supported in version 2.22.0 and higher.
    * `CLIENT_RESULT_CHUNK_SIZE` is not supported.

> **Note:**
>
> * The driver will attempt to honor the parameter value, but will cap usage at 80% of your system memory.
> * The memory usage limit set in this parameter does not apply to any other JDBC or ODBC driver operations
>   (e.g. connecting to the database, preparing a query, or PUT and GET statements).

Values:
:   Any valid number of megabytes.

Default:
:   `1536` (effectively 1.5 GB)

    Most users should not need to set this parameter. If this parameter is not set by the user, the driver starts
    with the default specified above.

    In addition, the JDBC driver actively manages its memory conservatively to avoid using up all available memory.

## CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Clients:
:   JDBC, ODBC

Description:
:   For specific ODBC functions and JDBC methods, this parameter can change the default search scope from all
    databases/schemas to the current database/schema. The narrower search typically returns fewer rows and executes
    more quickly.

    For example, the `getTables()` JDBC method accepts a database name and schema name as arguments, and returns the
    names of the tables in the database and schema. If the database and schema arguments are `null`, then by default, the
    method searches all databases and all schemas in the account. Setting CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX to
    `TRUE` narrows the search to the current database and schema specified by the
    connection context.

    In essence, setting this parameter to `TRUE` creates the following precedence for database and schema:

    > 1. Values passed as arguments to the functions/methods.
    > 2. Values specified in the connection context (if any).
    > 3. Default (all databases and all schemas).

    For more details, see the information below.

    This parameter applies to the following:

    * JDBC driver methods (for the `DatabaseMetaData` class):

      + `getColumns`
      + `getCrossReference`
      + `getExportedKeys`
      + `getForeignKeys`
      + `getFunctions`
      + `getImportedKeys`
      + `getPrimaryKeys`
      + `getSchemas`
      + `getTables`
    * ODBC driver functions:

      + `SQLTables`
      + `SQLColumns`
      + `SQLPrimaryKeys`
      + `SQLForeignKeys`
      + `SQLGetFunctions`
      + `SQLProcedures`

Values:
:   `TRUE`: If the database and schema arguments are `null`, then the driver retrieves metadata for only
    the database and schema specified by the connection context.

    The interaction is described in more detail in the table below.

    `FALSE`: If the database and schema arguments are `null`, then the driver retrieves
    metadata for all databases and schemas in the account.

Default:
:   `FALSE`

Additional Notes:
:   The *connection context* refers to the current database and schema for the session, which can be set using
    any of the following options:

    1. Specify the default namespace for the user who connects to Snowflake (and initiates the session). This can be
       set for the user through the [CREATE USER](sql/create-user.md) or [ALTER USER](sql/alter-user.md)
       command, but must be set before the user connects.
    2. Specify the database and schema when connecting to Snowflake through the driver.
    3. Issue a [USE DATABASE](sql/use-database.md) or [USE SCHEMA](sql/use-schema.md) command within the session.

    If the database or schema was specified by more than one of these, then the most recent one applies.

    When CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX is set to `TRUE`:

    | database argument | schema argument | Database used | Schema used |
    | --- | --- | --- | --- |
    | Non-null | Non-null | Argument | Argument |
    | Non-null | Null | Argument | All schemas |
    | Null | Non-null | Connection context | Argument |
    | Null | Null | Connection context | Session context |

> **Note:**
>
> For the JDBC driver, this behavior applies to version 3.6.27 (and higher).
> For the ODBC driver, this behavior applies to version 2.12.96 (and higher).

If you want to search only the connection context database, but want to search all schemas within that database,
see CLIENT_METADATA_USE_SESSION_DATABASE.

## CLIENT_METADATA_USE_SESSION_DATABASE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Clients:
:   JDBC

Description:
:   This parameter applies to only the methods affected by CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX.

    This parameter applies only when both of the following conditions are met:

    * CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX is `FALSE` or unset.
    * No database or schema is passed to the relevant ODBC function or JDBC method.

    For specific ODBC functions and JDBC methods, this parameter can change the default search scope from all
    databases to the current database. The narrower search typically returns fewer rows and executes
    more quickly.

    For more details, see the information below.

Values:
:   `TRUE`:

    > The driver searches all schemas in the connection context’s database. (For more details about the
    > connection context, see the documentation for
    > CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX.)

    `FALSE`:

    > The driver searches all schemas in all databases.

Default:
:   `FALSE`

Additional Notes:

When the database is `null` and the schema is `null` and CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX is FALSE:

> | CLIENT_METADATA_USE_SESSION_DATABASE | Behavior |
> | --- | --- |
> | FALSE | All schemas in all databases are searched. |
> | TRUE | All schemas in the current database are searched. |

## CLIENT_PREFETCH_THREADS

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Clients:
:   JDBC, ODBC, Python, .NET

Description:
:   Parameter that specifies the number of threads used by the client to pre-fetch large result sets. The driver will attempt to honor the parameter value, but defines the
    minimum and maximum values (depending on your system’s resources) to improve performance.

Values:
:   `1` to `10`

Default:
:   `4`

    Most users should not need to set this parameter. If this parameter is not set by the user, the driver starts
    with the default specified above, but also actively manages its thread count conservatively to avoid using up all
    available memory.

## CLIENT_RESULT_CHUNK_SIZE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Clients:
:   JDBC, Node.js, SQL API, Go

Description:
:   Parameter that specifies the maximum size of each set (or *chunk*) of query results to download (in MB). The JDBC driver downloads query results in chunks.

    Also see CLIENT_MEMORY_LIMIT.

Values:
:   `16` to `160`

Default:
:   `160`

    Most users should not need to set this parameter. If this parameter is not set by the user, the driver starts
    with the default specified above, but also actively manages its memory conservatively to avoid using up all
    available memory.

## CLIENT_RESULT_COLUMN_CASE_INSENSITIVE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Clients:
:   JDBC

Description:
:   Parameter that indicates whether to match column name case-insensitively in `ResultSet.get*` methods in JDBC.

Values:
:   `TRUE`: matches column names case-insensitively.

    `FALSE`: matches column names case-sensitively.

Default:
:   `FALSE`

## CLIENT_SESSION_KEEP_ALIVE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Clients:
:   .NET, Golang, JDBC, Node.js, ODBC, Python,

Description:
:   Parameter that indicates whether to force a user to log in again after a period of inactivity in the session.

Values:
:   `TRUE`: Snowflake keeps the session active indefinitely as long as the connection is active, even if there is no activity from the user.

    `FALSE`: The user must log in again after four hours of inactivity.

Default:
:   `FALSE`

> **Note:**
>
> Currently, the parameter only takes effect while initiating the session. You can modify the parameter value
> within the session level by executing an ALTER SESSION command, but it does not affect the session
> keep-alive functionality, such as extending the session. For information about setting the parameter at
> the session level, see the client documentation:
>
> * [.NET](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/Connecting.md)
> * [Golang](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#section-readme)
> * [JDBC](../developer-guide/jdbc/jdbc-configure.md)
> * [Node.js](../developer-guide/node-js/nodejs-driver-connect.md)
> * [ODBC](../developer-guide/odbc/odbc-parameters.md)
> * [Python](../developer-guide/python-connector/python-connector-api.md)

## CLIENT_SESSION_KEEP_ALIVE_HEARTBEAT_FREQUENCY

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Clients:
:   SnowSQL, JDBC, Python, Node.js

Description:
:   Number of seconds in-between client attempts to update the token for the session.

Values:
:   `900` to `3600`

Default:
:   `3600`

## CLIENT_TIMESTAMP_TYPE_MAPPING

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Clients:
:   Any

Description:
:   Specifies the [TIMESTAMP_\* variation](data-types-datetime.md) to use when binding timestamp variables for JDBC or ODBC applications that use the bind API to load data.

Values:
:   `TIMESTAMP_LTZ` or `TIMESTAMP_NTZ`

Default:
:   `TIMESTAMP_LTZ`

## CORTEX_MODELS_ALLOWLIST

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the models that users in the account can access. Use this parameter to allowlist models for all users in the account. If you need to provide specific users with access beyond what you’ve specified in the allowlist, use role-based access control instead. For more information, see [Account-level allowlist parameter](../user-guide/snowflake-cortex/aisql.md).

When users make a request, Snowflake Cortex evaluates the parameter to determine whether the user can access the model.

Values:
:   * `'All'`: Provides access to all models, including fine-tuned models.

      Example:

      ```sqlexample
      ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'All';
      ```
    * `'model1,model2,...'`: Provides access to the models specified in a comma-separated list.

      Example:

      ```sqlexample
      ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'mistral-large2,llama3.1-70b';
      ```
    * `'None'`: Prevents access to any model.

      Example:

      ```sqlexample
      ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST = 'None';
      ```

Default:
:   `'All'`

## CORTEX_ENABLED_CROSS_REGION

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the regions where an inference request may be processed in case the request cannot be processed in the region
    where request is originally placed. Specifying `DISABLED` disables cross-region inferencing. For examples and details,
    see [Cross-region inference](../user-guide/snowflake-cortex/cross-region-inference.md).

Values:
:   This parameter can be set to one of the following:

    * `DISABLED`
    * `ANY_REGION`
    * Comma-separated list including one or more of the following values:

      + `AWS_APJ`
      + `AWS_AU`
      + `AWS_EU`
      + `AWS_US`
      + `AWS_GLOBAL`
      + `AZURE_EU`
      + `AZURE_US`
      + `AZURE_GLOBAL`
      + `GCP_US`
      + `GCP_GLOBAL`

    Explanation of each parameter value

    | Value | Behavior |
    | --- | --- |
    | `DISABLED` | Inference requests will be handled in:   * The region where the request is placed. |
    | `ANY_REGION` | Inference requests may be routed to:   * Any region that supports cross-region inference (listed in this table) and that has availability, including the region where the request is placed. |
    | `AWS_APJ` | Inference requests will be handled in the region where the request is placed and in the following AWS regions   * AWS Asia Pacific (Tokyo) ap-northeast-1 * AWS Asia Pacific (Seoul) ap-northeast-2 * AWS Asia Pacific (Osaka) ap-northeast-3 * AWS Asia Pacific (Mumbai) ap-south-1 * AWS Asia Pacific (Hyderabad) ap-south-2 * AWS Asia Pacific (Singapore) ap-southeast-1 * AWS Asia Pacific (Sydney) ap-southeast-2 * AWS Asia Pacific (Melbourne) ap-southeast-4 |
    | `AWS_AU` | Inference requests will be handled in the region where the request is placed and in the following AWS regions   * AWS Asia Pacific (Sydney) ap-southeast-2 * AWS Asia Pacific (Melbourne) ap-southeast-4 |
    | `AWS_EU` | Inference requests will be handled in the region where the request is placed and in the following AWS regions, which are (and will be) located within the European Union:   * AWS Europe (Frankfurt) eu-central-1 * AWS Europe (Stockholm) eu-north-1 * AWS Europe (Milan) eu-south-1 * AWS Europe (Spain) eu-south-2 * AWS Europe (Ireland) eu-west-1 * AWS Europe (Paris) eu-west-3 |
    | `AWS_US` | Inference requests will be handled in the region where the request is placed and in the following AWS regions, which are (and will be) located within the United States:   * AWS US East (N. Virginia) us-east-1 * AWS US East (Ohio) us-east-2 * AWS US West (Oregon) us-west-2 |
    | `AWS_Global` | Inference requests will be handled in the region where the request is placed and in any AWS commercial region. |
    | `AZURE_EU` | Inference requests will be handled in the region where the request is placed and in the following Azure regions, which are (and will be) located within the European Union:   * Azure Europe (Netherlands) westeurope * Azure Europe (France) francecentral * Azure Europe (Germany) germanywestcentral * Azure Europe (Italy) italynorth * Azure Europe (Poland) polandcentral * Azure Europe (Spain) spaincentral * Azure Europe (Sweden) swedencentral |
    | `AZURE_US` | Inference requests will be handled in the region where the request is placed and in the following Azure regions, which are (and will be) located within the United States:   * Azure US (Virginia) eastus2 * Azure US (Virginia) eastus * Azure US (California) westus * Azure US (Phoenix) westus3 * Azure US (Illinois) northcentralus * Azure US (Texas) southcentralus |
    | `AZURE_Global` | Inference requests will be handled in the region where the request is placed and in any Azure commercial region. |
    | `GCP_US` | Inference requests will be handled in the region where the request is placed and in the following GCP regions, which are (and will be) located within the United States:   * GCP US (Iowa) us-central1 * GCP US (Oregon) us-west1 * GCP US (Las Vegas) us-west4 * GCP US (N. Virginia) us-east4 |
    | `GCP_Global` | Inference requests will be handled in the region where the request is placed and in any GCP commercial region. |

Default:
:   The default value depends on when and where the account was created:

    * `ANY_REGION` for new accounts in new organizations within commercial regions created after March 9, 2026.
    * `DISABLED` for all other accounts, including government regions.

## CSV_TIMESTAMP_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the format for TIMESTAMP values in CSV files downloaded from Snowsight.

    If this parameter is not set, TIMESTAMP_LTZ_OUTPUT_FORMAT will be used for TIMESTAMP_LTZ values, TIMESTAMP_TZ_OUTPUT_FORMAT will be used for TIMESTAMP_TZ and TIMESTAMP_NTZ_OUTPUT_FORMAT for TIMESTAMP_NTZ values.

    For more information, see [Date and time input and output formats](date-time-input-output.md) or [Download your query results](../user-guide/ui-snowsight-query.md).

Values:
:   Any valid, supported timestamp format.

Default:
:   No value.

## DATA_METRIC_SCHEDULE

Type:
:   Object (for tables)

Data type:
:   String

Description:
:   Specifies the schedule to run the data metric functions associated to the table.

Values:
:   The schedule can be based on a defined number of minutes, a cron expression, or a DML event on the table that does not involve
    reclustering. For details, see:

    * [Data metric function actions (dataMetricFunctionAction)](sql/alter-table.md).
    * [Adjust the schedule for DMFs](../user-guide/data-quality-working.md).

Default:
:   `60 MINUTE`

## DATA_RETENTION_TIME_IN_DAYS

Type:
:   Object (for databases, schemas, and tables) — Can be set for Account » Database » Schema » Table

Data Type:
:   Integer

Description:
:   Number of days for which Snowflake retains historical data for performing Time Travel actions (SELECT, CLONE, UNDROP) on the object. A value of `0` effectively disables
    Time Travel for the specified database, schema, or table. For more information, see [Understanding & using Time Travel](../user-guide/data-time-travel.md).

Values:
:   `0` or `1` (for [Standard Edition](../user-guide/intro-editions.md))

    `0` to `90` (for [Enterprise Edition or higher](../user-guide/intro-editions.md))

Default:
:   `1`

## DATE_INPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the input format for the DATE data type. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported date format or `AUTO`

    (`AUTO` specifies that Snowflake attempts to automatically detect the format of dates stored in the system during the session)

Default:
:   `AUTO`

## DATE_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the display format for the DATE data type. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported date format

Default:
:   `YYYY-MM-DD`

## DEFAULT_DBT_VERSION

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the default version for all future dbt project objects created in an account. Setting this value on the account enables organization administrators to opt-in to newer versions (for example, changing the default to `1.10.15`) without requiring users to manually update CREATE DBT PROJECT DDL statements for every individual project. For more information, see [Versions for dbt project objects and files](../user-guide/data-engineering/dbt-projects-on-snowflake-versions.md).

Values:
:   `1.9.4`, or `1.10.15`

Default:
:   `1.9.4`

## DEFAULT_DDL_COLLATION

Type:
:   Object (for databases, schemas, and tables) — Can be set for Account » Database » Schema » Table

Data Type:
:   String

Description:
:   Sets the default collation used for the following DDL operations:

    * [CREATE TABLE](sql/create-table.md)
    * [ALTER TABLE](sql/alter-table.md) … ADD COLUMN

    Setting this parameter forces all subsequently created columns in the affected objects (table, schema, database, or account) to have
    the specified collation as the default, unless the collation for the column is explicitly defined in the DDL.

    For example, if `DEFAULT_DDL_COLLATION = 'en-ci'`, then the following two statements are equivalent:

    ```sqlexample
    CREATE TABLE test(c1 INTEGER, c2 STRING, c3 STRING COLLATE 'en-cs');

    CREATE TABLE test(c1 INTEGER, c2 STRING COLLATE 'en-ci', c3 STRING COLLATE 'en-cs');
    ```

    > **Note:**
    >
    > This parameter isn’t supported for [dynamic tables](../user-guide/dynamic-tables-about.md) and [Apache Iceberg™ tables](../user-guide/tables-iceberg.md).
    > This parameter isn’t supported on indexed columns for hybrid tables.

Values:
:   Any valid, supported [collation specification](collation.md).

Default:
:   Empty string

> **Note:**
>
> To set the default collation for the account, use the following command:
>
> * [ALTER ACCOUNT](sql/alter-account.md)
>
> The default collation for table columns can be set at the table, schema, or database level during creation or any time afterwards:
>
> * [CREATE TABLE](sql/create-table.md) or [ALTER TABLE](sql/alter-table.md)
> * [CREATE SCHEMA](sql/create-schema.md) or [ALTER SCHEMA](sql/alter-schema.md)
> * [CREATE DATABASE](sql/create-database.md) or [ALTER DATABASE](sql/alter-database.md)

## DEFAULT_NOTEBOOK_COMPUTE_POOL_CPU

Type:
:   Object (for databases and schemas) — Can be set for Account » Database » Schema

Data Type:
:   String

Description:
:   Sets the preferred CPU compute pool used for [Notebooks on CPU Container Runtime](../developer-guide/snowflake-ml/notebooks-on-spcs.md).

Values:
:   Name of a compute pool in your account.

Default:
:   SYSTEM_COMPUTE_POOL_CPU (see [System compute pools](../developer-guide/snowpark-container-services/working-with-compute-pool.md)).

## DEFAULT_NOTEBOOK_COMPUTE_POOL_GPU

Type:
:   Object (for databases and schemas) — Can be set for Account » Database » Schema

Data Type:
:   String

Description:
:   Sets the preferred GPU compute pool used for [Notebooks on GPU Container Runtime](../developer-guide/snowflake-ml/notebooks-on-spcs.md).

Values:
:   Name of a compute pool in your account.

Default:
:   SYSTEM_COMPUTE_POOL_GPU (see [System compute pools](../developer-guide/snowpark-container-services/working-with-compute-pool.md)).

## DEFAULT_NULL_ORDERING

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the default ordering of NULL values in a result set.

The ordering of NULL values in rows depend on the [ORDER BY](constructs/order-by.md) clause:

* When the sort order is ASC (the default) and this parameter is set to `LAST` (the default), NULL
  values are returned last. Therefore, unless specified otherwise, NULL values are considered to be higher than
  any non-NULL values.
* When the sort order is ASC and this parameter is set to `FIRST`, NULL values are returned first.
* When the sort order is DESC and this parameter is set to `FIRST`, NULL values are returned last.
* When the sort order is DESC and this parameter is set to `LAST`, NULL values are returned first.

If a NULL ordering is specified in the ORDER BY clause with NULLS FIRST or NULLS LAST, then the
specified ordering takes precedence over any value of DEFAULT_NULL_ORDERING.

Values:
:   `FIRST`: NULL values are lower than any non-NULL values.

    `LAST`: NULL values are higher than any non-NULL values.

Default:
:   `LAST`

## DEFAULT_STREAMLIT_COMPUTE_POOL

Type:
:   Account — Can only be set for Account

Data Type:
:   String

Description:
:   Specifies the default compute pool to use for container-runtime
    [Streamlit apps](../developer-guide/streamlit/getting-started/overview.md).

    When you run CREATE STREAMLIT, if you specify a container runtime in the RUNTIME_NAME property and don’t
    specify the COMPUTE_POOL property, Snowflake uses the compute pool specified the DEFAULT_STREAMLIT_COMPUTE_POOL
    parameter. This default compute pool is resolved at creation time. Updating DEFAULT_STREAMLIT_COMPUTE_POOL won’t
    update the COMPUTE_POOL property on existing Streamlit apps. For more information, see
    [Configuring your own preferred compute pools for Streamlit apps](../developer-guide/snowpark-container-services/working-with-compute-pool.md).

Values:
:   Name of a compute pool in your account.

Default:
:   SYSTEM_COMPUTE_POOL_CPU

## DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE

Type:
:   Object (for databases and schemas) — Can be set for Account » Database » Schema

Data Type:
:   String

Description:
:   Specifies the name of the default warehouse to use when creating a notebook.

    For more information, see [ALTER ACCOUNT](sql/alter-account.md), [ALTER DATABASE](sql/alter-database.md), and [ALTER SCHEMA](sql/alter-schema.md).

Values:
:   The name of any existing warehouse.

Default:
:   `SYSTEM$STREAMLIT_NOTEBOOK_WH`

## DISABLE_UI_DOWNLOAD_BUTTON

Type:
:   Object (for users) — Can be set for Account > User

Data Type:
:   Boolean

Description:
:   Controls whether users in an account see a button to download data in Snowsight, such as a table
    returned from running a query in a worksheet.

    If the button to download is hidden in Snowsight, users can still download or export data using
    [third-party software](../user-guide/ecosystem.md).

Values:
:   `TRUE`: Users in the account don’t see a button to download data in Snowsight.

    `FALSE`: Users in the account see a button to download data in Snowsight.

Default:
:   `FALSE`

## DISABLE_USER_PRIVILEGE_GRANTS

Type:
:   Object (for users) — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Controls whether users in an account can grant privileges directly to other users.

    Disabling user privilege grants (that is, setting DISABLE_USER_PRIVILEGE_GRANTS to `TRUE`) doesn’t affect existing grants to users.
    Existing grants to users continue to confer privileges to those users. For more information, see [GRANT <privileges> … TO USER](sql/grant-privilege-user.md).

Values:
:   `TRUE`: Users in the account cannot grant privileges to another user.

    `FALSE`: Users in the account can grant privileges to another user.

Default:
:   `FALSE`

## DISALLOWED_SPCS_WORKLOAD_TYPES

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Specifies the workload types that are disallowed in your account to deploy to Snowpark Container Services. Also see ALLOWED_SPCS_WORKLOAD_TYPES.

Values:
:   The value is a comma-separated list of the following supported workload types:

    * `USER`: Any workloads directly deployed by users.
    * `NOTEBOOK`: Snowflake Notebooks.
    * `STREAMLIT`: Streamlit in Snowflake.
    * `MODEL_SERVING`: ML Model Serving.
    * `ML_JOB`: Snowflake ML Jobs.
    * `ALL`: All workloads.

Default:
:   Empty string

> **Note:**
>
> If you configure both DISALLOWED_SPCS_WORKLOAD_TYPES and ALLOWED_SPCS_WORKLOAD_TYPES parameters, Snowflake first applies DISALLOWED_SPCS_WORKLOAD_TYPES. For example, if you configure both these parameters and specify the `NOTEBOOK` workload, `NOTEBOOK` workloads are not allowed to run on Snowpark Container Services.

## ENABLE_AUTOMATIC_SENSITIVE_DATA_CLASSIFICATION_LOG

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Controls whether events from [sensitive data classification](../user-guide/classify-auto.md) are logged in the user event table.

Values:
:   `TRUE`: Snowflake logs events for sensitive data classification in the user event table.

    `FALSE`: Events for sensitive data classification are not logged.

Default:
:   `TRUE`

## ENABLE_BUDGET_EVENT_LOGGING

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Controls whether telemetry data is collected for [budgets](../user-guide/budgets.md).

Values:
:   `TRUE`: Snowflake logs telemetry data that is related to budgets to an event table.

    `FALSE`: Snowflake doesn’t log telemetry data that is related to budgets.

Default:
:   `TRUE`

## ENABLE_DATA_COMPACTION

Type:
:   Object (for databases, schemas, and Iceberg tables) — Can be set for Account » Database » Schema » Iceberg Table

Data Type:
:   Boolean

Description:
:   Specifies whether Snowflake should enable data compaction on Snowflake-managed [Apache Iceberg™ tables](../user-guide/tables-iceberg.md).

Values:
:   `TRUE`: Snowflake performs data compaction on the tables.

    `FALSE`: Snowflake doesn’t perform data compaction on the tables.

Default:
:   `TRUE`

## ENABLE_EGRESS_COST_OPTIMIZER

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Enables or disables the Listing Cross-cloud auto-fulfillment Egress cost optimizer.

Values:
:   `TRUE`: Enable the Egress cost optimizer.

    `FALSE`: Disable the Egress cost optimizer.

Default:
:   `FALSE`

For more information see [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md).

## ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether the output returned by the [GET_DDL](functions/get_ddl.md) function contains data type synonyms specified in the original DDL statement. Data type synonyms are also called *data type aliases*.

Values:
:   `TRUE`: Show the data type aliases specified in the original DDL statement.

    `FALSE`: Replace the data type aliases specified in the original DDL statement with standard
    Snowflake data type names.

You can set this parameter to TRUE to generate DDL statements using the GET_DDL function that specify
data type aliases as defined in the original SQL statements, which might be required to preserve data
model integrity during migrations.

The following are examples of data type aliases:

* CHAR is an alias for the [VARCHAR](data-types-text.md) data type.
* BIGINT is an alias for the [NUMBER](data-types-numeric.md) data type.
* DATETIME is an alias for the [TIMESTAMP_NTZ](data-types-datetime.md) data type.

The following statement creates a table using the aliases for the data types:

```sqlexample
CREATE TABLE test_get_ddl_aliases(x CHAR, y BIGINT, z DATETIME);
```

When this parameter is set to FALSE, the GET_DDL function returns the following output:

```sqlexample
ALTER SESSION SET ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS = FALSE;

SELECT GET_DDL('TABLE', 'test_get_ddl_aliases');
```

```output
+------------------------------------------------+
| GET_DDL('TABLE', 'TEST_GET_DDL_ALIASES')       |
|------------------------------------------------|
| create or replace TABLE TEST_GET_DDL_ALIASES ( |
|     X VARCHAR(1),                              |
|     Y NUMBER(38,0),                            |
|     Z TIMESTAMP_NTZ(9)                         |
| );                                             |
+------------------------------------------------+
```

When this parameter is set to TRUE, the GET_DDL function returns the following output:

```sqlexample
ALTER SESSION SET ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS = TRUE;

SELECT GET_DDL('TABLE', 'test_get_ddl_aliases');
```

```output
+------------------------------------------------+
| GET_DDL('TABLE', 'TEST_GET_DDL_ALIASES')       |
|------------------------------------------------|
| create or replace TABLE TEST_GET_DDL_ALIASES ( |
|     X CHAR,                                    |
|     Y BIGINT,                                  |
|     Z DATETIME                                 |
| );                                             |
+------------------------------------------------+
```

Default:
:   `FALSE`

## ENABLE_ICEBERG_MERGE_ON_READ

Type:
:   Object (for databases, schemas, and Apache Iceberg™ tables) — Can be set for Account » Database » Schema » Iceberg table

Data Type:
:   Boolean

Description:
:   Specifies whether to enable merge-on-read behavior for Snowflake-managed [Apache Iceberg™ tables](../user-guide/tables-iceberg.md).
    For more information, see [Use row-level deletes](../user-guide/tables-iceberg-manage.md).

Values:
:   `TRUE`: Enable merge-on-read behavior:

    * If you use the Iceberg v2 format with Iceberg tables, enables using row-level deletes through positional delete files.
    * If you use the Iceberg v3 format with Iceberg tables, enables using row-level deletes through deletion vectors.

    For more information about merge-on-read and copy-on-write behavior, see [Use row-level deletes](../user-guide/tables-iceberg-manage.md).

    > **Note:**
    >
    > To specify the Iceberg version for tables, use the ICEBERG_VERSION_DEFAULT parameter or ICEBERG_VERSION parameter.

    `FALSE`: Enables copy-on-write behavior for DML operations.

Default:
:   `TRUE`

## ENABLE_IDENTIFIER_FIRST_LOGIN

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Determines the login flow for users. When enabled, Snowflake prompts users for their username or email address before presenting
    authentication methods. For details, see [Identifier-first login](../user-guide/identifier-first-login.md).

Values:
:   `TRUE`: Snowflake uses an identifier-first login flow to authenticate users.

    `FALSE`: Snowflake presents all possible login options, even if those options don’t apply to a particular user.

Default:
:   `FALSE`

## ENABLE_INTERNAL_STAGES_PRIVATELINK

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether the [SYSTEM$GET_PRIVATELINK_CONFIG](functions/system_get_privatelink_config.md) function returns the `private-internal-stages` key in the query
    result. The corresponding value in the query result is used during the configuration process for private connectivity to internal stages.
    The value of this parameter also affects the behavior of system functions related to private connectivity. For example, `TRUE` enables
    [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](functions/system_revoke_stage_privatelink_access.md) and `FALSE` turns off [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](functions/system_revoke_stage_privatelink_access.md).

Values:
:   `TRUE`: Returns the `private-internal-stages` key and value in the query result.

    `FALSE`: Doesn’t return the `private-internal-stages` key and value in the query result.

Default:
:   `FALSE`

## ENABLE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether the [SYSTEM$GET_PRIVATELINK_CONFIG](functions/system_get_privatelink_config.md) function returns the
    `privatelink-snowflake-managed-storage-volume-nfs` and `privatelink-snowflake-managed-storage-volume-fs` keys in the query
    result on Azure deployments. The corresponding values in the query result are used during the configuration process for private connectivity to
    Snowflake-managed storage volumes. The value of this parameter also affects the behavior of system functions related to private connectivity. For example, `TRUE` enables
    [SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](functions/system_revoke_snowflake_managed_storage_volume_privatelink_access.md) and `FALSE` turns off [SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](functions/system_revoke_snowflake_managed_storage_volume_privatelink_access.md).

Values:
:   `TRUE`: Returns the `privatelink-snowflake-managed-storage-volume-nfs` and
    `privatelink-snowflake-managed-storage-volume-fs` keys and values in the query result for Azure deployments.

    `FALSE`: Doesn’t return these keys and values in the query result.

Default:
:   `FALSE`

## ENABLE_NOTEBOOK_CREATION_IN_PERSONAL_DB

Type:
:   User — Can be set for Account > User

Data Type:
:   Boolean

Description:
:   Specifies whether users can create private notebooks (stored in their personal databases). When TRUE, users in the account can
    create private notebooks (assuming other necessary privileges are granted).

Values:
:   `TRUE`: Enables users to create private notebooks.

    `FALSE`: Prevents users from creating private notebooks.

Default:
:   `FALSE`

## ENABLE_SPCS_BLOCK_STORAGE_SNOWFLAKE_FULL_ENCRYPTION_ENFORCEMENT

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Enables enforcement of SNOWFLAKE_FULL encryption type for Snowpark Container Services
    [block-storage volumes and snapshots](../developer-guide/snowpark-container-services/block-storage-volume.md).

Values:
:   `TRUE`: Enforces creation of SPCS block-storage volumes and snapshots only with the SNOWFLAKE_FULL
    encryption type. The SNOWFLAKE_SSE encryption type isn’t permitted. All existing block-storage
    volumes and snapshots with the SNOWFLAKE_SSE encryption type must be migrated to SNOWFLAKE_FULL before
    enabling this parameter. Setting the parameter value to TRUE with existing SNOWFLAKE_FULL encrypted
    volumes or snapshots results in an error.

    `FALSE`: Both SNOWFLAKE_SSE and SNOWFLAKE_FULL encryption types are permitted for SPCS
    block-storage volumes and snapshots in the account.

Default:
:   `FALSE`

## ENABLE_TAG_PROPAGATION_EVENT_LOGGING

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Controls whether telemetry data is collected for [automatic tag propagation](../user-guide/object-tagging/propagation.md).

Values:
:   `TRUE`: Snowflake logs telemetry data that is related to tag propagation to an event table.

    `FALSE`: Snowflake doesn’t log telemetry data that is related to tag propagation.

Default:
:   `FALSE`

## ENABLE_TRI_SECRET_AND_REKEY_OPT_OUT_FOR_IMAGE_REPOSITORY

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies the choice for the [image repository](../developer-guide/snowpark-container-services/working-with-registry-repository.md) to opt out of Tri-Secret Secure and [Periodic rekeying](../user-guide/security-encryption-manage.md).

Values:
:   `TRUE`: Opts out Tri-Secret Secure and periodic rekeying for the image repository.

    `FALSE`: Disallows the creation of an image repository for Tri-Secret Secure and periodic rekeying for accounts. Similarly, disallows
    enabling Tri-Secret Secure and periodic rekeying for accounts that have enabled image repository.

Default:
:   `FALSE`

## ENABLE_UNHANDLED_EXCEPTIONS_REPORTING

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether Snowflake may capture – in an event table – log messages or trace event data for unhandled exceptions
    in procedure or UDF handler code. For more information, see [Capturing messages from unhandled exceptions](../developer-guide/logging-tracing/unhandled-exception-messages.md).

Values:
:   `TRUE`: Data about unhandled exceptions is captured as log or trace data if logging and tracing are enabled.

    `FALSE`: Data about unhandled exceptions is not captured.

Default:
:   `TRUE`

## ENABLE_UNLOAD_PHYSICAL_TYPE_OPTIMIZATION

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether to set the schema for unloaded Parquet files based on the logical column data types (that is, the types in the unload SQL query or source table) or on the
    unloaded column values (that is, the smallest data types and precision that support the values in the output columns of the unload SQL statement or source table).

Values:
:   `TRUE`: The schema of unloaded Parquet data files is determined by the column values in the unload SQL query or source table. Snowflake optimizes table columns by setting the smallest precision that accepts all of the values. The unloader follows this pattern when writing values to Parquet files. The data type and precision of an output column are set to the smallest data type and precision that support its values in the unload SQL statement or source table. Accept this setting for better performance and smaller data files.

    `FALSE`: The schema is determined by the logical column data types. Set this value for a consistent output file schema.

Default:
:   `TRUE`

## ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR

Type:
:   User — Can be set for Account » User

Data Type:
:   Boolean

Description:
:   Controls whether query text is redacted if a SQL query fails due to a syntax or parsing error. If `FALSE`, the content of a
    failed query is redacted in the views, pages, and functions that provide a query history.

    Only users with a role that is granted or inherits the AUDIT privilege can set the ENABLE_UNREDACTED_QUERY_SYNTAX_ERROR parameter.

    When using the ALTER USER command to set the parameter to `TRUE` for a particular user, modify the user that you want to see the query
    text, not the user who executed the query (if those are different users).

Values:
:   `TRUE`: Disables the redaction of query text for queries that fail due to a syntax or parsing error.

    `FALSE`: Redacts the contents of a query from the views, pages, and functions that provide a query history when a query fails due to a
    syntax or parsing error.

Default:
:   `FALSE`

## ENABLE_UNREDACTED_SECURE_OBJECT_ERROR

Type:
:   User — Can be set for Account » User

Data Type:
:   Boolean

Description:
:   Controls whether error messages related to secure objects are redacted in metadata. For more information,
    see [Secure objects: Redaction of information in error messages](../release-notes/bcr-bundles/un-bundled/bcr-1858.md).

    Only users with a role that is granted or inherits the AUDIT privilege can set the ENABLE_UNREDACTED_SECURE_OBJECT_ERROR parameter.

    When using the ALTER USER command to set the parameter to `TRUE` for a particular user, modify the user that you want to see the
    redacted error messages in metadata, not the user who caused the error.

Values:
:   `TRUE`: Disables the redaction of error messages related to secure objects in metadata.

    `FALSE`: Redacts the contents of error messages related to secure objects in metadata.

Default:
:   `FALSE`

## ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether a network policy that uses network rules can restrict access to AWS internal stages.

    This parameter has no effect on network policies that do not use network rules.

    This account-level parameter affects both account-level and user-level network policies.

    For details about using network policies and network rules to restrict access to AWS internal stages, including the use of this parameter,
    see [Protecting internal stages on AWS](../user-guide/network-policies.md).

Values:
:   `TRUE`: Allows network policies that use network rules to restrict access to AWS internal stages. The network rule must
    also use the appropriate `MODE` and `TYPE` to restrict access to the internal stage.

    `FALSE`: Network policies never restrict access to internal stages.

Default:
:   `FALSE`

## ENFORCE_NETWORK_RULES_FOR_SNOWFLAKE_MANAGED_STORAGE_VOLUME

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether a network policy that uses network rules can restrict access to AWS Snowflake-managed storage volumes.

    This parameter has no effect on network policies that do not use network rules.

    This account-level parameter affects only account-level network policies.

    For details about using network policies and network rules to restrict access to Snowflake-managed storage volumes, see
    [Protecting Snowflake-managed storage volumes on AWS](../user-guide/network-policies.md).

Values:
:   `TRUE`: Allows network policies that use network rules to restrict access to Snowflake-managed storage volumes. The
    network rule must also use the appropriate `MODE` and `TYPE` to restrict access to the volume.

    `FALSE`: Network policies never restrict access to Snowflake-managed storage volumes.

Default:
:   `FALSE`

## ERROR_ON_NONDETERMINISTIC_MERGE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether to return an error when the [MERGE](sql/merge.md) command is used to update or delete a target row that joins multiple source rows and the system cannot
    determine the action to perform on the target row.

Values:
:   `TRUE`: An error is returned that includes values from one of the target rows that caused the error.

    `FALSE`: No error is returned and the merge completes successfully, but the results of the merge are nondeterministic.

Default:
:   `TRUE`

## ERROR_ON_NONDETERMINISTIC_UPDATE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether to return an error when the [UPDATE](sql/update.md) command is used to update a target row that joins multiple source rows and the system cannot determine the
    action to perform on the target row.

Values:
:   `TRUE`: An error is returned that includes values from one of the target rows that caused the error.

    `FALSE`: No error is returned and the update completes, but the results of the update are nondeterministic.

Default:
:   `FALSE`

## EVENT_TABLE

Type:
:   Object — Can be set for Account » Database

Data Type:
:   String

Description:
:   Specifies the name of the event table for logging messages from stored procedures and UDFs contained by the object with which
    the event table is associated.

    Associating an event table with a database is available in [Enterprise Edition or higher](../user-guide/intro-editions.md).

Values:
:   Any existing event table created by executing the [CREATE EVENT TABLE](sql/create-event-table.md) command.

Default:
:   None

## EXTERNAL_OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Determines whether the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles can be used as the primary role when creating a
    Snowflake session based on the access token from the External OAuth authorization server.

Values:
:   `TRUE`: Adds the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles to the `EXTERNAL_OAUTH_BLOCKED_ROLES_LIST` property of the
    External OAuth security integration, which means these roles cannot be used as the primary role when creating a Snowflake session using
    External OAuth authentication.

    `FALSE`: Removes the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN from the list of blocked roles defined by the
    `EXTERNAL_OAUTH_BLOCKED_ROLES_LIST` property of the External OAuth security integration.

Default:
:   `TRUE`

## EXTERNAL_VOLUME

Object (for databases, schemas, and Apache Iceberg™ tables) — Can be set for Account » Database » Schema » Iceberg table

Data Type:
:   String

Description:
:   Specifies the external volume for Apache Iceberg™ tables. For more information,
    see the [Iceberg table documentation](../user-guide/tables-iceberg.md).

Values:
:   Any valid external volume identifier.

Default:
:   None

## GEOGRAPHY_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   Display format for [GEOGRAPHY values](data-types-geospatial.md).

    For EWKT and EWKB, the SRID is always 4326 in the output.
    Refer to the [note on EWKT and EWKB handling](data-types-geospatial.md).

Values:
:   `GeoJSON`, `WKT`, `WKB`, `EWKT`, or `EWKB`

Default:
:   `GeoJSON`

## GEOMETRY_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   Display format for [GEOMETRY values](data-types-geospatial.md).

Values:
:   `GeoJSON`, `WKT`, `WKB`, `EWKT`, or `EWKB`

Default:
:   `GeoJSON`

## HYBRID_TABLE_LOCK_TIMEOUT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Description:
:   Number of seconds to wait while trying to acquire row-level locks on a hybrid table, before timing out and aborting the statement.

Values:
:   `0` to any integer (no limit). A value of `0` disables lock waiting (that is, the statement must acquire the lock
    immediately or abort). This value specifies how long the statement will wait for all of the row-level locks it needs to acquire after each
    execution attempt (1 hour by default). If the statement cannot acquire all of the locks, it can be retried, and the same waiting period is applied.

Default:
:   `3600` (1 hour)

See also LOCK_TIMEOUT.

## ICEBERG_VERSION

Type:
:   Object (for Apache Iceberg™ tables) — Can be set only for Apache Iceberg™ tables

Data Type:
:   Integer

Description:
:   Specifies the version of the Apache Iceberg™ specification that the table conforms to. If you use the ICEBERG_VERSION_DEFAULT
    parameter to specify the default Iceberg version at a higher level, this parameter overrides the default. You can specify an Iceberg
    version for Snowflake-managed Iceberg tables and externally managed Iceberg tables that you create in a catalog-linked database.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    > **Note:**
    >
    > You can set this parameter when creating an Iceberg table using the [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](sql/create-iceberg-table-snowflake.md)
    > or [CREATE ICEBERG TABLE (Iceberg REST catalog)](sql/create-iceberg-table-rest.md) command.
    > You can’t use the ALTER ICEBERG TABLE command to change this configuration for an existing table.

Values:
:   `2`: The table conforms with Iceberg version 2.

    `3`: The table conforms with Iceberg version 3.

Default:
:   `2`

## ICEBERG_VERSION_DEFAULT

Type:
:   Object (for databases and schemas) — Can be set for Account » Database » Schema

Data Type:
:   Integer

Description:
:   Specifies the version of the Apache Iceberg™ specification to conform to when creating new Snowflake-managed Iceberg tables.

    > **Caution:**
    >
    > Before you use other engines to upgrade an Iceberg tables format-version in table properties to v3, ensure that the table isn’t used by
    > engines or applications that don’t yet support v3. Downgrading format versions isn’t supported in the Apache Iceberg specification. Therefore, all
    > readers and writers must support v3. The default version for Iceberg tables in Snowflake is v2, which can be configured to v3 if
    > needed. Using Snowflake to perform in-place version upgrades isn’t supported at this time.

    > **Note:**
    >
    > To set the version for a specific table, set the ICEBERG_VERSION parameter instead. See ICEBERG_VERSION.

Values:
:   `2`: The table conforms with Iceberg version 2.

    `3`: The table conforms with Iceberg version 3.

Default:
:   `2`

## INITIAL_REPLICATION_SIZE_LIMIT_IN_TB

Type:
:   Account — Can be set only for Account

Data Type:
:   Number.

Description:
:   Sets the maximum estimated size limit for the initial replication of a primary database to a secondary database (in TB). Set this parameter on any account that stores a secondary database. This size limit helps prevent accounts from accidentally incurring large database replication charges.

    To remove the size limit, set the value to `0.0`.

    Note that there is currently no default size limit applied to subsequent refreshes of a secondary database.

Values:
:   `0.0` and above with a scale of at least 1 (e.g. `20.5`, `32.25`, `33.333`, etc.).

Default:
:   `10.0`

## JDBC_ENABLE_PUT_GET

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether to allow PUT and GET commands access to local file systems.

Values:
:   `TRUE`: JDBC enables PUT and GET commands.

    `FALSE`: JDBC disables PUT and GET commands.

Default:
:   `TRUE`

## JDBC_TREAT_DECIMAL_AS_INT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies how JDBC processes columns that have a scale of zero (`0`).

Values:
:   `TRUE`: JDBC processes a column whose scale is zero as BIGINT.

    `FALSE`: JDBC processes a column whose scale is zero as DECIMAL.

Default:
:   `TRUE`

## JDBC_TREAT_TIMESTAMP_NTZ_AS_UTC

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies how JDBC processes TIMESTAMP_NTZ values.

    By default, when the JDBC driver fetches a value of type TIMESTAMP_NTZ from Snowflake, it converts the value to
    “wallclock” time using the client JVM timezone.

    Users who want to keep UTC timezone for the conversion can set this parameter to `TRUE`.

    This parameter applies only to the JDBC driver.

Values:
:   `TRUE`: The driver uses UTC to get the TIMESTAMP_NTZ value in “wallclock” time.

    `FALSE`: The driver uses the client JVM’s current timezone to get the TIMESTAMP_NTZ value in “wallclock” time.

Default:
:   `FALSE`

## JDBC_USE_SESSION_TIMEZONE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether the JDBC Driver uses the time zone of the JVM or the time zone of the session (specified by the
    TIMEZONE parameter) for the `getDate()`, `getTime()`, and `getTimestamp()` methods of the
    `ResultSet` class.

Values:
:   `TRUE`: The JDBC Driver uses the time zone of the session.

    `FALSE`: The JDBC Driver uses the time zone of the JVM.

Default:
:   `TRUE`

## JSON_INDENT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Description:
:   Specifies the number of blank spaces to indent each new element in JSON output in the session. Also specifies whether to insert newline characters after each element.

Values:
:   `0` to `16`

    (a value of `0` returns compact output by removing all blank spaces and newline characters from the output)

Default:
:   `2`

> **Note:**
>
> This parameter does not affect JSON unloaded from a table into a file using the [COPY INTO <location>](sql/copy-into-location.md) command. The command always unloads JSON data in the NDJSON format:
>
> * Each record from the table separated by a newline character.
> * Within each record, compact formatting (that is, no spaces or newline characters).

## JS_TREAT_INTEGER_AS_BIGINT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies how the Snowflake Node.js Driver processes numeric columns that have a scale of zero (`0`), for example INTEGER or NUMBER(p, 0).

Values:
:   `TRUE`: JavaScript processes a column whose scale is zero as Bigint.

    `FALSE`: JavaScript processes a column whose scale is zero as Number.

Default:
:   `FALSE`

> **Note:**
>
> By default, Snowflake INTEGER columns (including BIGINT, NUMBER(p, 0), etc.) are converted to JavaScript’s Number
> data type. However, the largest legal Snowflake integer values are larger than the largest legal JavaScript
> Number values. To convert Snowflake INTEGER columns to JavaScript Bigint, which can store larger values than
> JavaScript Number, set the session parameter JS_TREAT_INTEGER_AS_BIGINT.
>
> For examples of how to use this parameter, see [Fetching integer data types as Bigint](../developer-guide/node-js/nodejs-driver-consume.md).

## LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Sets the time interval used to refresh the application package based data products to other regions.

Values:
:   * `num MINUTES`:

      A value between `1` and `11520`. Must include the unit MINUTES.
    * `USING CRON expr time_zone`:

      Specifies a cron expression and time zone for the refresh. Supports a subset of standard cron utility syntax.

      For a list of time zones, see the Wikipedia topic [list of tz database time zones](https://en.wikipedia.org/wiki/List_of_tz_database_time_zones).
      The cron expression consists of the following fields:

      ```output
      # __________ minute (0-59)
      # | ________ hour (0-23)
      # | | ______ day of month (1-31, or L)
      # | | | ____ month (1-12, JAN-DEC)
      # | | | | __ day of week (0-6, SUN-SAT, or L)
      # | | | | |
      # | | | | |
        * * * * *
      ```

      The following special characters are supported:

      `*`
      :   Wildcard. Specifies any occurrence of the field.

      `L`
      :   Stands for “last”. When used in the day-of-week field, it allows you to specify constructs such as “the last Friday” (“5L”) of a
          given month. In the day-of-month field, it specifies the last day of the month.

      `/n`
      :   Indicates the *nth* instance of a given unit of time. Each quanta of time is computed independently. For example, if `4/3` is
          specified in the month field, then the refresh is scheduled for April, July, and October. For example, every three months, starting with the fourth
          month of the year. The same schedule is maintained in subsequent years. That is, the refresh is not scheduled to run in
          January (3 months after the October run).

      > **Note:**
      > + The cron expression currently evaluates against the specified time zone only. Altering the TIMEZONE parameter value
      >   for the account (or setting the value at the user or session level) does not change the time zone for the refresh.
      > + The cron expression defines all valid run times for the refresh. Snowflake attempts to refresh listings based on
      >   this schedule; however, any valid run time is skipped if a previous run has not completed before the next valid run time starts.
      > + When both a specific day of month and day of week are included in the cron expression, then the refresh is scheduled on days
      >   satisfying either the day of month or the day of week. For example, `SCHEDULE = 'USING CRON 0 0 10-20 * TUE,THU UTC'`
      >   schedules a refresh at 0 a.m. on the tenth to twentieth day of any month and also on any Tuesday or Thursday outside of those dates.

Default:
:   None

## LOCK_TIMEOUT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer

Description:
:   Number of seconds to wait while trying to lock a resource, before timing out and aborting the statement.

Values:
:   `0` to any integer (no limit). A value of `0` disables lock waiting (the statement must acquire the lock
    immediately or abort). If multiple resources need to be locked by the statement, the timeout applies separately
    to each lock attempt.

Default:
:   `43200` (12 hours)

See also HYBRID_TABLE_LOCK_TIMEOUT.

## LOG_LEVEL

Type:
:   Session — Can be set for Account » User » Session

    Object (for databases, schemas, DCM projects, stored procedures, UDFs, dynamic tables, Iceberg tables, tasks, services) — Can be set for:

    * Account » Database » Schema » DCM project
    * Account » Database » Schema » Procedure
    * Account » Database » Schema » Function
    * Account » Database » Schema » Dynamic table
    * Account » Database » Schema » Iceberg table (externally managed)
    * Account » Database » Schema » Task
    * Account » Database » Schema » Service

Data Type:
:   String (Constant)

Description:
:   Specifies the severity level of log messages produced through logging APIs that should be ingested and made available in the
    active event table. Log messages at the specified level (and at more severe levels) are ingested. For more information about log levels,
    see [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md).

Values:
:   * `TRACE`
    * `DEBUG`
    * `INFO`
    * `WARN`
    * `ERROR`
    * `FATAL`
    * `OFF`

Default:
:   `OFF`

Additional Notes:
:   The following table lists the levels of log messages ingested when you set the `LOG_LEVEL` parameter to a level.

    | LOG_LEVEL Parameter Setting | Levels of Log Messages Ingested |
    | --- | --- |
    | `TRACE` | * `TRACE` * `DEBUG` * `INFO` * `WARN` * `ERROR` * `FATAL` |
    | `DEBUG` | * `DEBUG` * `INFO` * `WARN` * `ERROR` * `FATAL` |
    | `INFO` | * `INFO` * `WARN` * `ERROR` * `FATAL` |
    | `WARN` | * `WARN` * `ERROR` * `FATAL` |
    | `ERROR` | * `ERROR` * `FATAL` |
    | `FATAL` | * `ERROR` (Only for Java UDFs, Java UDTFs, and Java and Scala stored procedures. For more information, see   [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md).) * `FATAL` |

    If this parameter is set in both the session and the object (or schema, database, or account), the more verbose value is used.
    See [How Snowflake determines the level in effect](../developer-guide/logging-tracing/telemetry-levels.md).

## LOG_EVENT_LEVEL

Type:
:   Session — Can be set for Account » User » Session

    Object (for databases, schemas, DCM projects, stored procedures, UDFs, dynamic tables, Iceberg tables, tasks, services) — Can be set for:

    * Account » Database » Schema » DCM project
    * Account » Database » Schema » Procedure
    * Account » Database » Schema » Function
    * Account » Database » Schema » Dynamic table
    * Account » Database » Schema » Iceberg table (externally managed)
    * Account » Database » Schema » Task
    * Account » Database » Schema » Service

Data Type:
:   String (Constant)

Description:
:   Specifies the severity level of log events (rows with record type EVENT) that should be ingested and made available in the
    active event table. Log events at the specified level (and at more severe levels) are ingested. For the supported severity values
    and how levels combine, see [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md).

Values:
:   * `TRACE`
    * `DEBUG`
    * `INFO`
    * `WARN`
    * `ERROR`
    * `FATAL`
    * `OFF`

Default:
:   `OFF`

Additional Notes:
:   The following table lists the levels of log events ingested when you set the `LOG_EVENT_LEVEL` parameter to a level.

    | LOG_EVENT_LEVEL Parameter Setting | Levels of Log Events Ingested |
    | --- | --- |
    | `TRACE` | * `TRACE` * `DEBUG` * `INFO` * `WARN` * `ERROR` * `FATAL` |
    | `DEBUG` | * `DEBUG` * `INFO` * `WARN` * `ERROR` * `FATAL` |
    | `INFO` | * `INFO` * `WARN` * `ERROR` * `FATAL` |
    | `WARN` | * `WARN` * `ERROR` * `FATAL` |
    | `ERROR` | * `ERROR` * `FATAL` |
    | `FATAL` | * `ERROR` * `FATAL` |

    If this parameter is set in both the session and the object (or schema, database, or account), the more verbose value is used.
    See [How Snowflake determines the level in effect](../developer-guide/logging-tracing/telemetry-levels.md).

## LOGIN_IDP_REDIRECT (view-only)

Type:
:   Account

Data type:
:   VARCHAR

Description:
:   View-only parameter that contains a JSON object summarizing the values that someone set for the `LOGIN_IDP_REDIRECT`
    account property.

    The JSON object contains a mapping between Snowflake interfaces and
    [SAML security integrations](../user-guide/admin-security-fed-auth-security-integration.md). SAML security integrations are used to
    implement single sign-on (SSO) authentication. If an interface is mapped to a SAML security integration, then users who access the
    interface are redirected to the third-party identity provider (IdP) to authenticate; they never see the Snowflake login screen.

    For more information about setting the `LOGIN_IDP_REDIRECT` account property, see
    [ALTER ACCOUNT](sql/alter-account.md).

## MAX_CONCURRENCY_LEVEL

Type:
:   Object (for warehouses) — Can be set for Account » Warehouse

Data Type:
:   Number

Description:
:   Specifies the concurrency level for SQL statements (that is, queries and DML) executed by a warehouse. When the level is reached, the operation performed depends on whether
    the warehouse is a single-cluster or multi-cluster warehouse:

    * **Single-cluster or multi-cluster (in Maximized mode):** Statements are queued until already-allocated resources are freed or additional resources are provisioned, which can be accomplished by
      increasing the size of the warehouse.
    * **Multi-cluster (in Auto-scale mode):** Additional clusters are started.

    MAX_CONCURRENCY_LEVEL can be used in conjunction with the STATEMENT_QUEUED_TIMEOUT_IN_SECONDS parameter to ensure a warehouse is never backlogged.

    In general, it limits the number of statements
    that can be executed concurrently by a warehouse cluster, but there are exceptions. In the following cases, the actual number of
    statements executed concurrently by a warehouse might be more or less than the specified level:

    * **Smaller, more basic statements:** More statements might execute concurrently because small statements generally execute on a subset of the available compute resources in a warehouse. This means they
      only count as a fraction towards the concurrency level.
    * **Larger, more complex statements:** Fewer statements might execute concurrently.

Default:
:   `8`

> **Tip:**
>
> This value is a default only and can be changed at any time:
>
> * Lowering the concurrency level for a warehouse can limit the number of concurrent queries running in a warehouse.
>   When fewer queries are competing for the warehouse’s resources at a given time, a query can potentially be given more resources, which
>   might result in faster query performance, particularly for a large/complex and multi-statement query.
> * Raising the concurrency level for a warehouse might decrease the compute resources that are available for a statement; however, it does
>   not always limit the total number of concurrent queries that can be executed by the warehouse, nor does it necessarily impact total
>   warehouse performance, which depends on the nature of the queries being executed.
>
> Note that, as described earlier, this parameter impacts multi-cluster warehouses (in Auto-scale mode) because Snowflake automatically
> starts a new cluster within the multi-cluster warehouse to avoid queuing. Thus, lowering the concurrency level for a multi-cluster
> warehouse (in Auto-scale mode) potentially increases the number of active clusters at any time.
>
> Also, remember that Snowflake automatically allocates resources for each statement when it is submitted and the allocated amount is
> dictated by the individual requirements of the statement. Based on this, and through observations of user query patterns over time, we’ve
> selected a default that balances performance and resource usage.
>
> As such, before changing the default, we recommend that you test the change by adjusting the parameter in small increments and
> observing the impact against a representative set of your queries.

## MAX_DATA_EXTENSION_TIME_IN_DAYS

Type:
:   Object (for databases, schemas, and tables) — Can be set for Account » Database » Schema » Table

Data Type:
:   Integer

Description:
:   Maximum number of days Snowflake can extend the data retention period for tables to prevent streams on the tables from becoming stale. By default, if the DATA_RETENTION_TIME_IN_DAYS setting for a source table is less than 14 days, and a stream has not been consumed, Snowflake temporarily extends this period to the stream’s offset, up to a maximum of 14 days, regardless of the [Snowflake Edition](../user-guide/intro-editions.md) for your account. The MAX_DATA_EXTENSION_TIME_IN_DAYS parameter enables you to limit this automatic extension period to control storage costs for data retention or for compliance reasons.

This parameter can be set at the account, database, schema, and table levels. Note that setting the parameter at the account or schema level only affects tables for which the parameter has not already been explicitly set at a lower level (e.g. at the table level by the table owner). A value of `0` effectively disables the automatic extension for the specified database, schema, or table. For more information about streams and staleness, see [Introduction to streams](../user-guide/streams-intro.md).

Values:
:   `0` to `90` (90 days) — a value of `0` disables the automatic extension of the data retention period. To increase the maximum value for tables in your account, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Default:
:   `14`

> **Note:**
>
> * This parameter can cause data to be retained longer than the default data retention.
>   Before increasing it, confirm that the new value fits your compliance requirements.
> * Table retention is not extended for streams on shared tables. If you share a table,
>   ensure that you set the table retention time long enough for your data consumer to
>   consume the stream. If a provider shares a table with, for example, 7 days’
>   retention and keeps the 14-day default extension, the stream will be stale after 14
>   days in the provider account and after 7 days in the consumer account.

## METRIC_LEVEL

Type:
:   Session — Can be set for Account » User » Session

    Object (for databases, schemas, stored procedures, and UDFs) — Can be set for Account » Database » Schema » Procedure and Account » Database » Schema » Function

Data Type:
:   String (Constant)

Description:
:   Controls how metrics data is ingested into the event table. For more information about metric levels, see
    [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md).

Values:
:   `ALL`: All metrics data will be recorded in the event table.

    `NONE`: No metrics data will be recorded in the event table.

Default:
:   `NONE`

## MULTI_STATEMENT_COUNT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Integer (Constant)

Clients:
:   SQL API, JDBC, .NET, ODBC

Description:
:   Number of statements to execute when using the multi-statement capability.

Values:
:   `0`: Variable number of statements.

    `1`: One statement.

    More than `1`: When MULTI_STATEMENT_COUNT is set as a session parameter, you can specify the exact number of statements to
    execute.

    Negative numbers are not permitted.

Default:
:   `1`

## MIN_DATA_RETENTION_TIME_IN_DAYS

Type:
:   Account — Can be set only for Account

Data Type:
:   Integer

Description:
:   Minimum number of days for which Snowflake retains historical data for performing Time Travel actions (SELECT, CLONE, UNDROP)
    on an object. If a minimum number of days for data retention is set on an account, the data retention period for an object is determined by
    MAX(DATA_RETENTION_TIME_IN_DAYS, MIN_DATA_RETENTION_TIME_IN_DAYS).

    For more information, see [Understanding & using Time Travel](../user-guide/data-time-travel.md).

Values:
:   `0` or `1` (for [Standard Edition](../user-guide/intro-editions.md))

    `0` to `90` (for [Enterprise Edition or higher](../user-guide/intro-editions.md))

Default:
:   `0`

> **Note:**
>
> * This parameter only applies to permanent tables and does not apply to the following objects:
>
>   + Transient tables
>   + Temporary tables
>   + External tables
>   + Materialized views
>   + Streams
> * This parameter can only be set and unset by account administrators (that is, users with the ACCOUNTADMIN role or other role that is granted
>   the ACCOUNTADMIN role).
> * Setting the minimum data retention time does not alter any existing DATA_RETENTION_TIME_IN_DAYS parameter value set on databases,
>   schemas, or tables. The effective retention time of a database, schema, or table is MAX(DATA_RETENTION_TIME_IN_DAYS,
>   MIN_DATA_RETENTION_TIME_IN_DAYS).

## NETWORK_POLICY

Type:
:   Account — Can be set only for Account (can be set by account administrators and security administrators)

Type:
:   Object (for users) — Can be set for Account » User

Data Type:
:   String

Description:
:   Specifies the network policy to enforce for your account. Network policies enable restricting access to your account based on
    users’ IP address. For more details, see [Controlling network traffic with network policies](../user-guide/network-policies.md).

Values:
:   Any existing network policy (created using [CREATE NETWORK POLICY](sql/create-network-policy.md))

Default:
:   None

> **Note:**
>
> This is the only account parameter that can be set by security administrators (i.e users with the SECURITYADMIN system role) or higher.

## NOORDER_SEQUENCE_AS_DEFAULT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether the ORDER or NOORDER property is set by default when you create a new sequence or add a new table
    column.

    The ORDER and NOORDER properties determine whether or not the values are generated for the sequence or auto-incremented column
    in [increasing or decreasing order](../user-guide/querying-sequences.md).

Values:
:   * `TRUE`: When you create a new sequence or add a new table column, the NOORDER property is set by default.

      NOORDER specifies that the values are not guaranteed to be in increasing order.

      For example, if a sequence has `START 1 INCREMENT 2`, the generated values might be `1`, `3`, `101`, `5`, `103`, etc.

      NOORDER can improve performance when multiple INSERT operations are performed concurrently (for example, when multiple
      clients are executing multiple INSERT statements).
    * `FALSE`: When you create a new sequence or add a new table column, the ORDER property is set by default.

      ORDER specifies that the values generated for a sequence or auto-incremented column are in increasing order (or, if the interval
      is a negative value, in decreasing order).

      For example, if a sequence or auto-incremented column has `START 1 INCREMENT 2`, the generated values might be
      `1`, `3`, `5`, `7`, `9`, etc.

    If you set this parameter, the value that you set overrides the value in the 2024_01 behavior change bundle.

Default:
:   `TRUE`

## ODBC_TREAT_DECIMAL_AS_INT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies how ODBC processes columns that have a scale of zero (`0`).

Values:
:   `TRUE`: ODBC processes a column whose scale is zero as BIGINT.

    `FALSE`: ODBC processes a column whose scale is zero as DECIMAL.

Default:
:   `FALSE`

## OAUTH_ADD_PRIVILEGED_ROLES_TO_BLOCKED_LIST

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Determines whether the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles can be used as the primary role when creating a
    Snowflake session based on the access token from Snowflake’s authorization server.

Values:
:   `TRUE`: Adds the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN roles to the `BLOCKED_ROLES_LIST` property of the Snowflake OAuth
    security integration, which means these roles cannot be used as the primary role when creating a Snowflake session using Snowflake
    OAuth.

    `FALSE`: Removes the ACCOUNTADMIN, ORGADMIN, GLOBALORGADMIN, and SECURITYADMIN from the list of blocked roles defined by the
    `BLOCKED_ROLES_LIST` property of the Snowflake OAuth security integration.

Default:
:   `TRUE`

## OPT_OUT_ERROR_LOGGING

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether DML error logging is allowed.

Values:
:   `TRUE`: DML error logging isn’t turned on, regardless of whether it is turned on
    for specific tables.

    `FALSE`: DML error logging behaves normally:

    > * DML error logging is turned on for tables that have the ERROR_LOGGING table-level parameter set
    >   to `TRUE`.
    > * DML error logging is turned off for tables that have the ERROR_LOGGING table-level parameter set
    >   to `FALSE`.

Default:
:   `FALSE`

For more information, see [DML error logging](../user-guide/data-load-overview.md).

## PERIODIC_DATA_REKEYING

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   This parameter only applies to [Enterprise Edition](../user-guide/intro-editions.md) (or higher). It enables/disables re-encryption of table data with new keys on a yearly basis to provide
    additional levels of data protection.

    You can enable and disable rekeying at any time. Enabling/disabling rekeying does not result in gaps in your encrypted data:

    > * If rekeying is enabled for a period of time and then disabled, all data already tagged for rekeying is rekeyed, but no further data is rekeyed until you re-enable it again.
    > * If rekeying is re-enabled, Snowflake automatically rekeys all data that has keys which meet the criteria (that is, keys that are older than one year).

    For more information about rekeying of encrypted data, see [Understanding Encryption Key Management in Snowflake](../user-guide/security-encryption-manage.md).

Values:
:   `TRUE`: Data is rekeyed after one year has passed since the data was last encrypted. Rekeying occurs in the background so no down-time is experienced and the affected data/table is always
    available.

    `FALSE`: Data is not rekeyed.

Default:
:   `FALSE`

> **Note:**
>
> There are charges associated with data rekeying because, after data is rekeyed, the old data (with the previous key encryption) is maintained in Fail-safe for the standard time period (7 days). For
> this reason, periodic rekeying is disabled by default. To enable periodic rekeying, you must explicitly enable it.
>
> Also, Fail-safe charges for rekeying are not listed individually in your monthly statement; they are included in the Fail-safe total for your account each month.
>
> For more information about Fail-safe, see [Understanding and viewing Fail-safe](../user-guide/data-failsafe.md).

## PIPE_EXECUTION_PAUSED

Type:
:   Object — Can be set for Account » Schema » Pipe

Data Type:
:   Boolean

Description:
:   Specifies whether to pause a running pipe, primarily in preparation for transferring ownership of the pipe to a different role:

    * An account administrator (user with the ACCOUNTADMIN role) can set this parameter at the account level, effectively pausing or resuming all pipes in the account.
    * A user with the MODIFY privilege on a schema can pause or resume all pipes in the schema.
    * The pipe owner can set this parameter for a pipe.

    Note that setting the parameter at the account or schema level only affects pipes for which the parameter has not already been explicitly set at a lower level
    (e.g. at the pipe level by the pipe owner).

    This enables the practical use case in which an account administrator can pause all pipes at the account level, while a pipe owner can still have an individual pipe
    running.

Values:
:   `TRUE`: Pauses the pipe. When the parameter is set to this value, the [SYSTEM$PIPE_STATUS](functions/system_pipe_status.md) function shows the `executionState`
    as `PAUSED`. Note that the pipe owner can continue to submit files to a paused pipe; however, the files are not processed until the pipe is resumed.

    `FALSE`: Resumes the pipe, but only if ownership of the pipe has not been transferred while it was paused. When the parameter is set to this value, the
    [SYSTEM$PIPE_STATUS](functions/system_pipe_status.md) function shows the `executionState` as `RUNNING`.

    If ownership of the pipe was transferred to another role after the pipe was paused, this parameter cannot be used to resume the pipe. Instead, use the
    [SYSTEM$PIPE_FORCE_RESUME](functions/system_pipe_force_resume.md) function to explicitly force the pipe to resume.

    This enables the new owner to use [SYSTEM$PIPE_STATUS](functions/system_pipe_status.md) to evaluate the pipe status (e.g. determine how many files are waiting to be loaded)
    before resuming the pipe.

Default:
:   `FALSE` (pipes are running by default)

> **Note:**
>
> In general, pipes do not need to paused, except for transferring ownership.

## PATH_LAYOUT

Type:
:   Object (for Apache Iceberg™ tables) — Can be set for Iceberg table

Data Type:
:   String (Constant)

Description:
:   Specifies the path layout that Snowflake uses when writing Parquet data files to [Iceberg tables](../user-guide/tables-iceberg.md).
    You can specify this parameter when you create a table but you can’t specify this parameter when you modify a table.

    > **Note:**
    >
    > For externally managed tables that you create in a standard Snowflake database, Snowflake infers and honors the partitoning scheme
    > that is specified by the remote table.

Values:
:   * `FLAT`: Snowflake writes all Parquet data files under the `data/` directory for the table.
    * `HIERARCHICAL`: Snowflake writes partitioned data under the `data/` directory for tha table by using a hierarchical
      path layout. With this layout, each partition column is represented
      as a directory level in the path. To define these partition
      columns, use the PARTITION BY parameter. This layout is also called “Hive-style” partitioning.

      If you specify the hierarchical layout but don’t specify a PARTITION BY clause with the command, Snowflake stores the Parquet data
      files by using a flat layout path. You can’t
      use the ALTER ICEBERG TABLE command to later enable a hierarchical path layout for the table. You might set this
      parameter to
      HIERARCHICAL without specifying a PARTITION BY clause if you don’t want to use partitioning with hierarchical paths now but you
      might in the future.

Default:
:   `FLAT`

For more information, see [Data and metadata directories](../user-guide/tables-iceberg-storage.md).

## PREVENT_UNLOAD_TO_INLINE_URL

Type:
:   Object (for users) — Can be set for Account » User

Data Type:
:   Boolean

Description:
:   Specifies whether to prevent ad hoc data unload operations to external cloud storage locations (that is, [COPY INTO <location>](sql/copy-into-location.md) statements that specify the cloud storage URL and access settings directly in the statement). For an example, see [Unloading data from a table directly to files in an external location](sql/copy-into-location.md).

Values:
:   `TRUE`: COPY INTO *<location>* statements must reference either a named internal (Snowflake) or external stage or an internal user or table stage. A named external stage must store the cloud storage URL and access settings in its definition.

    `FALSE`: Ad hoc data unload operations to external cloud storage locations are permitted.

Default:
:   `FALSE`

## PREVENT_UNLOAD_TO_INTERNAL_STAGES

Type:
:   User — Can be set for Account » User

Data Type:
:   Boolean

Description:
:   Specifies whether to prevent data unload operations to internal (Snowflake) stages using [COPY INTO <location>](sql/copy-into-location.md) statements.

Values:
:   `TRUE`: Unloading data from Snowflake tables to any internal stage, including user stages, table stages, or named internal stages is prevented.

    `FALSE`: Unloading data to internal stages is permitted, limited only by the default restrictions of the stage type:

    > * The current user can only unload data to their own user stage.
    > * Users can only unload data to table stages when their active role has the OWNERSHIP privilege on the table.
    > * Users can only unload data to named internal stages when their active role has the WRITE privilege on the stage.

Default:
:   `FALSE`

## PYTHON_PROFILER_TARGET_STAGE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the fully-qualified name of the stage in which to save a report when
    [profiling Python handler code](../developer-guide/stored-procedure/python/procedure-python-profiler.md).

Values:
:   Fully-qualified name of the stage in which to save the report.

    * Use a temporary stage to store output only for the duration of the session.
    * Use a permanent stage to preserve the profiler output outside of the scope of a session.

    For more information, see [Specify the Snowflake stage where profile output should be written](../developer-guide/stored-procedure/python/procedure-python-profiler.md).

Default:
:   `''`

## PYTHON_PROFILER_MODULES

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the list of Python modules to include in a report when [profiling Python handler code](../developer-guide/stored-procedure/python/procedure-python-profiler.md).

    Use this parameter to specify modules that are contained in staged handlers or that contain dependencies that you want to include
    in the profile.

Values:
:   A comma-separated list of Python module names.

    For examples, see [Including modules with the PYTHON_PROFILER_MODULES parameter](../developer-guide/stored-procedure/python/procedure-python-profiler.md) and [Profiling staged handler code](../developer-guide/stored-procedure/python/procedure-python-profiler.md).

Default:
:   `''`

## QUERY_TAG

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (up to 2000 characters)

Description:
:   Optional string that can be used to tag queries and other SQL statements executed within a session. The tags are displayed in the output of the [QUERY_HISTORY , QUERY_HISTORY_BY_\*](functions/query_history.md)
    functions.

Default:
:   None

## QUOTED_IDENTIFIERS_IGNORE_CASE

Type:
:   Session — Can be set for Account » User » Session

    Object — Can be set for Account » Database » Schema » Table

Data Type:
:   Boolean

Description:
:   Specifies whether letters in double-quoted object identifiers are stored and resolved as uppercase letters. By default,
    Snowflake preserves the case of alphabetic characters when storing and resolving double-quoted identifiers. (see
    [Identifier resolution](identifiers-syntax.md).) You can use this parameter in situations in which
    [third-party applications always use double quotes around identifiers](identifiers-syntax.md).

    > **Note:**
    >
    > Changing this parameter from the default value can affect your ability to find objects that were previously created with
    > double-quoted mixed case identifiers. Refer to [Impact of changing the parameter](identifiers-syntax.md).

    When set on a table, schema, or database, the setting only affects the evaluation of table names in the bodies of views and
    user-defined functions (UDFs). If your account uses double-quoted identifiers that should be treated as case-insensitive
    and you plan to share a view or UDF with an account that treats double-quoted identifiers as case-sensitive, you can set
    this on the view or UDF that you plan to share. This allows the other account to resolve the table names in the view or UDF
    correctly.

Values:
:   `TRUE`: Letters in double-quoted identifiers are stored and resolved as uppercase letters.

    `FALSE`: The case of letters in double-quoted identifiers is preserved. Snowflake resolves and stores the identifiers in the specified case.

    For more information, see [Identifier resolution](identifiers-syntax.md).

Default:
:   `FALSE`

For example:

| Identifier |  | Param set to `FALSE` (default) | Param set to `TRUE` |
| --- | --- | --- | --- |
| `"columnname"` | resolves to: | `columnname` | `COLUMNNAME` |
| `"columnName"` | resolves to: | `columnName` | `COLUMNNAME` |
| `"ColumnName"` | resolves to: | `ColumnName` | `COLUMNNAME` |
| `"COLUMNNAME"` | resolves to: | `COLUMNNAME` | `COLUMNNAME` |

## READ_CONSISTENCY_MODE

Type:
:   Account — Can be set only for Account

Data Type:
:   String

Description:
:   Defines the level of consistency guarantees that are required for sessions with near-concurrent changes.

Values:
:   `SESSION`: Changes are immediately visible to subsequent queries within the same session but not always immediately across sessions.

    `GLOBAL`: Changes are immediately visible to subsequent queries across concurrently running sessions, but with a small impact on query response times
    (usually milliseconds).

Default:
:   `SESSION`

For more information, see [Read consistency across sessions](transactions.md).

## REPLACE_INVALID_CHARACTERS

Type:
:   Object — Can be set for Account » Database » Schema » Iceberg table

Data Type:
:   Boolean

Description:
:   Specifies whether to replace invalid UTF-8 characters with the Unicode replacement character (�) in query results
    for [Apache Iceberg™ tables](sql/create-iceberg-table.md) that use an external catalog.

Values:
:   `TRUE`: Snowflake replaces invalid UTF-8 characters with the Unicode replacement character.

    `FALSE`: Snowflake leaves invalid UTF-8 characters unchanged. Snowflake returns a user error message if it encounters an invalid UTF-8
    character.

Default:
:   `FALSE`

## REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_CREATION

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether to require a storage integration object as cloud credentials when creating a named external stage (using [CREATE STAGE](sql/create-stage.md)) to access a private cloud storage location.

Values:
:   `TRUE`: Creating an external stage to access a private cloud storage location requires referencing a storage integration object as cloud credentials.

    `FALSE`: Creating an external stage does not require referencing a storage integration object. Users can instead reference explicit cloud provider credentials, such as secret keys or access tokens, if they have been configured for the storage location.

Default:
:   `FALSE`

## REQUIRE_STORAGE_INTEGRATION_FOR_STAGE_OPERATION

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   Specifies whether to require using a named external stage that references a storage integration object as cloud credentials when loading data from or unloading data to a private cloud storage location.

Values:
:   `TRUE`: Loading data from or unloading data to a private cloud storage location requires using a named external stage that references a storage integration object; specifying a named external stage that references explicit cloud provider credentials, such as secret keys or access tokens, produces a user error.

    `FALSE`: Users can load data from or unload data to a private cloud storage location using a named external stage that references explicit cloud provider credentials.

    If PREVENT_UNLOAD_TO_INLINE_URL is FALSE, then users can specify the explicit cloud provider credentials directly in the COPY statement.

Default:
:   `FALSE`

## ROWS_PER_RESULTSET

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Number

Clients:
:   SQL API

Description:
:   Specifies the maximum number of rows returned in a result set.

Values:
:   `0` to any number (no limit) — a value of `0` specifies no maximum.

Default:
:   `0`

## S3_STAGE_VPCE_DNS_NAME

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the DNS name of an Amazon S3 interface endpoint. Requests sent to the internal stage of an account via
    [AWS PrivateLink for Amazon S3](https://docs.aws.amazon.com/AmazonS3/latest/userguide/privatelink-interface-endpoints.html) use this
    endpoint to connect.

    For more information, see [Accessing Internal stages with dedicated interface endpoints](../user-guide/private-internal-stages-aws.md).

Values:
:   Valid region-scoped DNS Name of an S3 interface endpoint.

    The standard format begins with an asterisk (`*`) and ends with `vpce.amazonaws.com`
    (e.g. `*.vpce-sd98fs0d9f8g.s3.us-west-2.vpce.amazonaws.com`). For more details about obtaining this value, refer to
    [AWS configuration](../user-guide/private-internal-stages-aws.md).

    Alternative formats include `bucket.vpce-xxxxxxxx.s3.<region>.vpce.amazonaws.com` and `vpce-xxxxxxxx.s3.<region>.vpce.amazonaws.com`.

Default:
:   Empty string

## SAML_IDENTITY_PROVIDER

Type:
:   Account — Can be set only for Account

Data Type:
:   JSON

Description:
:   Enables federated authentication. This deprecated parameter enables federated authentication. This parameter accepts a JSON
    object, enclosed in single quotes, with the following fields:

    ```sqljson
    {
      "certificate": "",
      "issuer": "",
      "ssoUrl": "",
      "type"  : "",
      "label" : ""
    }
    ```

    Where:

    `certificate`
    :   Specifies the certificate (generated by the IdP) that verifies communication between the IdP and Snowflake.

    `issuer`
    :   Indicates the Issuer/EntityID of the IdP.

        Optional.

        For information on how to obtain this value in Okta and AD FS, see [Migrating to a SAML2 security integration](../user-guide/admin-security-fed-auth-configure-snowflake.md).

    `ssoUrl`
    :   Specifies the URL endpoint (provided by the IdP) where Snowflake sends the SAML requests.

    `type`
    :   Specifies the type of IdP used for federated authentication (`"OKTA"` , `"ADFS"` , `"Custom"`).

    `label`
    :   Specifies the button text for the IdP in the Snowflake login page. The default label is `Single Sign On`. If you change the default label, the label you specify can only contain alphanumeric
        characters (special characters and blank spaces are not currently supported).

        Note that, if the `"type"` field is `"Okta"`, a value for the `label` field does not need to be specified because Snowflake displays the Okta logo in the button.

    For more information, including examples of setting the parameter, see [Migrating to a SAML2 security integration](../user-guide/admin-security-fed-auth-configure-snowflake.md).

Default:
:   None

## SEARCH_PATH

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the path to search to resolve unqualified object names in queries. For more information, see
    [Name resolution in queries](name-resolution.md).

Values:
:   Comma-separated list of identifiers. An identifier can be a fully or partially qualified schema name.

Default:
:   `$current, $public`

    For more information about the default settings, see [default search path](name-resolution.md).

> **Note:**
>
> * You cannot set this parameter within a client connection string, such as a JDBC or ODBC connection string. You must
>   establish a session before setting a search path.
> * This parameter isn’t supported for [tasks](sql/create-task.md).

## SERVERLESS_TASK_MAX_STATEMENT_SIZE

Type:
:   Object — Can be set for Account » Database » Schema » Task

Data Type:
:   String

Description:
:   Specifies the maximum allowed warehouse size for [Serverless tasks](../user-guide/tasks-intro.md).

Values:
:   Any traditional [warehouse size](../user-guide/warehouses-overview.md): `XSMALL`, `SMALL`, `MEDIUM`, `LARGE`, `XLARGE`, `X2LARGE`. The maximum size is `X2LARGE`.

    Also supports the syntax: `XXLARGE`.

Default:
:   `X2LARGE`

## SERVERLESS_TASK_MIN_STATEMENT_SIZE

Type:
:   Object — Can be set for Account » Database » Schema » Task

Data Type:
:   String

Description:
:   Specifies the minimum allowed warehouse size for [Serverless tasks](../user-guide/tasks-intro.md).

Values:
:   Any traditional [warehouse size](../user-guide/warehouses-overview.md): `XSMALL`, `SMALL`, `MEDIUM`, `LARGE`, `XLARGE`, `X2LARGE`. The maximum size is `X2LARGE`.

    Also supports the syntax: `XXLARGE`.

Default:
:   `XSMALL`

## SIMULATED_DATA_SHARING_CONSUMER

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the name of a consumer account to simulate for testing/validating shared data, particularly shared secure views. When this parameter is set in a session, shared views return rows as if executed in the specified consumer account rather than the provider account.

    > **Note:**
    >
    > Simulations only succeed when the current role is the owner of the view.
    > If the current role does not own the view, simulations fail with the error:

    ```none
    Shared view consumer simulation requires that the executing role owns the view.
    ```

    For more information, see [About Secure Data Sharing](../user-guide/data-sharing-intro.md) and [Create and configure shares](../user-guide/data-sharing-provider.md).

Default:
:   None

> **Important:**
>
> This is a session parameter, which means it can be set at the account level; however, it only applies to testing queries on shared views. Because the parameter affects all queries in a session, it should
> never be set at the account level.

## SQL_TRACE_QUERY_TEXT

Type:
:   Account — Can be set only for Account

Data Type:
:   String (Constant)

Description:
:   Specifies whether to capture the SQL text of a traced SQL statement.

Values:
:   `ON`: Traces that follow a SQL statement will capture text of the SQL and store it in the event table.

    `OFF`: Traces do not capture SQL text in the event table.

    For more information, see [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md) and [SQL statement tracing](../developer-guide/logging-tracing/tracing.md).

Default:
:   `OFF`

## SSO_LOGIN_PAGE

Type:
:   Account — Can be set only for Account

Data Type:
:   Boolean

Description:
:   This deprecated parameter disables preview mode for testing SSO (after enabling federated authentication) before rolling it out to users:

Values:
:   `TRUE`: Preview mode is disabled and users will see the button for Snowflake-initiated SSO for your identity provider (as specified in SAML_IDENTITY_PROVIDER) in the Snowflake main login page.

    `FALSE`: Preview mode is enabled and SSO can be tested using the following URL:

    > * If your account is in US West: `https://<account_identifier>.snowflakecomputing.com/console/login?fedpreview=true`
    > * If your account is in any other region:
    >   `https://<account_identifier>.<region_id>.snowflakecomputing.com/console/login?fedpreview=true`

    For more information, see:

    * [Migrating to a SAML2 security integration](../user-guide/admin-security-fed-auth-configure-snowflake.md)
    * [Account identifiers](../user-guide/admin-account-identifier.md)

Default:
:   `FALSE`

## STATEMENT_QUEUED_TIMEOUT_IN_SECONDS

Type:
:   Session and Object (for warehouses)

    Can be set for Account » User » Session; can also be set for individual warehouses

Data Type:
:   Number

Description:
:   Amount of time, in seconds, a SQL statement (query, DDL, DML, and so on) remains queued for a warehouse before it is canceled by the system. This parameter can be used in conjunction with the MAX_CONCURRENCY_LEVEL parameter to ensure a warehouse is never backlogged.

    The parameter can be set at different levels in the session hierarchy (on the account, user, and session). If the parameter is set at more than one level, the rules described in Session parameters determine which value is used.

    The parameter can also be set for an individual warehouse to control the runtime for all SQL statements processed by the warehouse. When the parameter is set in both the session hierarchy and the warehouse, the timeout is the lowest non-zero value of the two parameters.

    For example, assume the parameter is set to the following values at different levels:

    * User - 10
    * Session - 20
    * Warehouse - 15

    In this case, the value of the parameter set on the warehouse is used (15) because it is less than the value set in the session hierarchy (20). The parameter set on the user (10) isn’t considered because it is overridden by the parameter set in the session.

    > **Note:**
    >
    > When both STATEMENT_QUEUED_TIMEOUT_IN_SECONDS and USER_TASK_TIMEOUT_MS are set, the value of USER_TASK_TIMEOUT_MS takes precedence.
    >
    > When comparing the values of these two parameters, note that
    > STATEMENT_QUEUED_TIMEOUT_IN_SECONDS is set in units of seconds, while USER_TASK_TIMEOUT_MS
    > uses units of milliseconds.

Values:
:   `0` to any number (no limit) — a value of `0` specifies that no timeout is enforced. A statement will remained queued as long as the queue persists.

Default:
:   `0` (no timeout)

## STATEMENT_TIMEOUT_IN_SECONDS

Type:
:   Session and Object (for warehouses)

    Can be set for Account » User » Session; can also be set for individual warehouses

Data Type:
:   Number

Description:
:   Amount of time, in seconds, after which a running SQL statement (query, DDL, DML, and so on) is canceled by the system.

    The parameter can be set at different levels in the session hierarchy (on the account, user, and session). If the parameter is set at more than one level, the rules described in Session parameters determine which value is used.

    The parameter can also be set for an individual warehouse to control the runtime for all SQL statements processed by the warehouse. When the parameter is set in both the session hierarchy and the warehouse, the timeout is the lowest non-zero value of the two parameters.

    For example, assume the parameter is set to the following values at different levels:

    * User - 10
    * Session - 20
    * Warehouse - 15

    In this case, the value of the parameter set on the warehouse is used (15) because it is less than the value set in the session hierarchy (20). The parameter set on the user (10) isn’t considered because it is overridden by the parameter set in the session.

    When both USER_TASK_TIMEOUT_MS and STATEMENT_TIMEOUT_IN_SECONDS are set, the timeout is the lowest non-zero value of the two parameters. When comparing the values of these two parameters, note that STATEMENT_TIMEOUT_IN_SECONDS is set in units of seconds, while USER_TASK_TIMEOUT_MS uses units of milliseconds.

    The parameter setting applies to all of the time taken by the statement, including queue time, locked time, execution time, compilation time, and
    so on. It applies to the overall time taken by the statement, not just the warehouse execution time.

Values:
:   `0` to `604800` (7 days) — a value of `0` specifies that the maximum timeout value is enforced.

Default:
:   `172800` (2 days)

## STORAGE_SERIALIZATION_POLICY

Type:
:   Object (for databases, schemas, and Apache Iceberg™ tables) — Can be set for Account » Database » Schema » Iceberg table

Data Type:
:   String (Constant)

Description:
:   Specifies the storage serialization policy for Snowflake-managed [Apache Iceberg™ tables](../user-guide/tables-iceberg.md).

Values:
:   `COMPATIBLE`: Snowflake performs encoding and compression that ensures interoperability with third-party compute engines.

    `OPTIMIZED`: Snowflake performs encoding and compression that ensures the best table performance within Snowflake.

Default:
:   `OPTIMIZED`

## STRICT_JSON_OUTPUT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   This parameter specifies whether JSON output in a session is compatible with the general standard (as described by <http://json.org>).

    By design, Snowflake allows JSON input that contains non-standard values; however, these non-standard values might result in Snowflake outputting JSON that is incompatible with other platforms and
    languages. This parameter, when enabled, ensures that Snowflake outputs valid/compatible JSON.

Values:
:   `TRUE`: Strict JSON output is enabled, enforcing the following behavior:

    > * Missing and undefined values in input mapped to JSON NULL.
    > * Non-finite numeric values in input (Infinity, -Infinity, NaN, etc.) mapped to strings with valid JavaScript representations. This enables compatibility with JavaScript and also allows conversion of
    >   these values back to numeric values.

    `FALSE`: Strict JSON output is not enabled.

Default:
:   `FALSE`

For example:

| Non-standard JSON Input |  | Param set to `FALSE` (default) | Param set to `TRUE` |
| --- | --- | --- | --- |
| `[289, 2188,]` | outputs: | `[ 289, 2188, undefined ]` | `[ 289, 2188, null ]` |
| `[undefined, undefined]` | outputs: | `[ undefined, undefined ]` | `[ null, null ]` |
| `[Infinity,inf,-Infinity,-inf]` | outputs: | `[ Infinity, Infinity, -Infinity, -Infinity ]` | `[ "Infinity", "Infinity", "-Infinity", "-Infinity" ]` |
| `[NaN,nan]` | outputs: | `[ NaN, NaN ]` | `[ "NaN", "NaN" ]` |

## SUSPEND_TASK_AFTER_NUM_FAILURES

Type:
:   Object (for databases, schemas, and tasks) — Can be set for Account » Database » Schema » Task

Data Type:
:   Integer

Description:
:   Number of consecutive failed task runs after which a standalone task or
    [task graph](../user-guide/tasks-graphs.md) root task is suspended automatically. Failed task runs include
    runs in which the SQL code in the task body either produces a user error or times out. Task
    runs that are skipped, canceled, or that fail due to a system error are considered indeterminate
    and are not included in the count of failed task runs.

    When the parameter is set to `0`, the failed task is not automatically suspended.

    When the parameter is set to a value greater than `0`, the following behavior applies to
    runs of standalone tasks or task graph root tasks:

    * A standalone task is automatically suspended after the specified number of consecutive task
      runs either fail or time out.
    * A root task is automatically suspended after the specified number of times in consecutive runs
      after any single task in a task graph fails or times out, after all
      `TASK_AUTO_RETRY_ATTEMPTS` for that task.

      For example, if a root task has `SUSPEND_TASK_AFTER_NUM_FAILURES` set to 3, and
      it has a child task with `TASK_AUTO_RETRY_ATTEMPTS` set to 3, then after that child task
      fails 9 consecutive times, the root task is suspended.

    The default value for the parameter is set to `10`, which means that the task is automatically suspended after 10 consecutive failed task runs.

    When you explicitly set the parameter value at the account, database, or schema level, the
    change is applied to tasks contained in the modified object during their next scheduled run
    (including any child task in a task graph run in progress).

    Suspending a standalone task resets its count of failed task runs. Suspending the root task of a task graph resets the count for each
    task in the task graph.

Values:
:   `0` - No upper limit.

Default:
:   `10`

## TASK_AUTO_RETRY_ATTEMPTS

Type:
:   Object (for databases, schemas, and tasks) — Can be set for Account » Database » Schema » Task

Data Type:
:   Integer

Description:
:   Specifies the number of automatic task graph retry attempts. If any task graphs complete in a `FAILED` state, Snowflake
    can automatically retry the task graphs from the last task in the graph that failed. Failed task runs include runs in which the SQL code in
    the task body either produces a user error or times out. Task runs that are skipped or canceled are considered indeterminate and are not included in the count of failed task runs.

    The automatic task graph retry is disabled by default. To enable this feature, set `TASK_AUTO_RETRY_ATTEMPTS` to a value greater than
    `0`.

    When you set the parameter value at the account, database, or schema level, the change is applied to tasks contained in the modified object
    during their next scheduled run.

Values:
:   `0` - No upper limit.

Default:
:   `0`

## TIMESTAMP_DAY_IS_ALWAYS_24H

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether the [DATEADD](functions/dateadd.md) function (and its aliases) always consider a day to be exactly 24 hours for expressions that span multiple days.

Values:
:   `TRUE`: A day is always exactly 24 hours.

    `FALSE`: A day is not always 24 hours.

Default:
:   `FALSE`

> **Important:**
>
> If set to `TRUE`, the actual time of day might not be preserved when daylight saving time (DST) is in effect. For example:
>
> ```sqlexample
> alter session set TIMESTAMP_DAY_IS_ALWAYS_24H = true;
>
> -- With DST beginning on 2018-03-11 at 2 AM, America/Los_Angeles time zone
> select dateadd(day, 1, '2018-03-10 09:00:00'::TIMESTAMP_LTZ), dateadd(day, 1, '2018-11-03 09:00:00'::TIMESTAMP_LTZ);
>
> +-------------------------------------------------------+-------------------------------------------------------+
> | DATEADD(DAY, 1, '2018-03-10 09:00:00'::TIMESTAMP_LTZ) | DATEADD(DAY, 1, '2018-11-03 09:00:00'::TIMESTAMP_LTZ) |
> |-------------------------------------------------------+-------------------------------------------------------|
> | 2018-03-11 10:00:00.000 -0700                         | 2018-11-04 08:00:00.000 -0800                         |
> +-------------------------------------------------------+-------------------------------------------------------+
>
> alter session set TIMESTAMP_DAY_IS_ALWAYS_24H = false;
>
> select dateadd(day, 1, '2018-03-10 09:00:00'::TIMESTAMP_LTZ), dateadd(day, 1, '2018-11-03 09:00:00'::TIMESTAMP_LTZ);
>
> +-------------------------------------------------------+-------------------------------------------------------+
> | DATEADD(DAY, 1, '2018-03-10 09:00:00'::TIMESTAMP_LTZ) | DATEADD(DAY, 1, '2018-11-03 09:00:00'::TIMESTAMP_LTZ) |
> |-------------------------------------------------------+-------------------------------------------------------|
> | 2018-03-11 09:00:00.000 -0700                         | 2018-11-04 09:00:00.000 -0800                         |
> +-------------------------------------------------------+-------------------------------------------------------+
> ```

## TIMESTAMP_INPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the input format for the TIMESTAMP data type alias. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported timestamp format or `AUTO`

    (`AUTO` specifies that Snowflake attempts to automatically detect the format of timestamps stored in the system during the session)

Default:
:   `AUTO`

## TIMESTAMP_LTZ_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the display format for the TIMESTAMP_LTZ data type. If CSV_TIMESTAMP_FORMAT is not set, TIMESTAMP_LTZ_OUTPUT_FORMAT is used when downloading CSV files. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported timestamp format

Default:
:   None

## TIMESTAMP_NTZ_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the display format for the TIMESTAMP_NTZ data type. If CSV_TIMESTAMP_FORMAT is not set, TIMESTAMP_NTZ_OUTPUT_FORMAT is used when downloading CSV files. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported timestamp format

Default:
:   `YYYY-MM-DD HH24:MI:SS.FF3`

## TIMESTAMP_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the display format for the TIMESTAMP data type alias. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported timestamp format

Default:
:   `YYYY-MM-DD HH24:MI:SS.FF3 TZHTZM`

## TIMESTAMP_TYPE_MAPPING

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the TIMESTAMP_\* variation that the TIMESTAMP data type alias maps to.

Values:
:   `TIMESTAMP_LTZ` , `TIMESTAMP_NTZ` , or `TIMESTAMP_TZ`

Default:
:   `TIMESTAMP_NTZ`

## TIMESTAMP_TZ_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the display format for the TIMESTAMP_TZ data type. If CSV_TIMESTAMP_FORMAT is not set, TIMESTAMP_TZ_OUTPUT_FORMAT is used when downloading CSV files. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported timestamp format

Default:
:   None

## TIMEZONE

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   Specifies the time zone for the session.

Values:
:   You can specify a [time zone name](https://data.iana.org/time-zones/tzdb-2025b/zone1970.tab) or a [link name](https://data.iana.org/time-zones/tzdb-2025b/backward) from release 2025b of the [IANA Time Zone Database](https://www.iana.org/time-zones) (e.g.
    `America/Los_Angeles`, `Europe/London`, `UTC`, `Etc/GMT`, etc.).

Default:
:   `America/Los_Angeles`

> **Note:**
>
> * Time zone names are case-sensitive and must be enclosed in single quotes (e.g. `'UTC'`).
> * Snowflake does not support the majority of timezone [abbreviations](https://en.wikipedia.org/wiki/List_of_time_zone_abbreviations) (e.g. `PDT`, `EST`, etc.) because a
>   given abbreviation might refer to one of several different time zones. For example, `CST` might refer to Central
>   Standard Time in North America (UTC-6), Cuba Standard Time (UTC-5), and China Standard Time (UTC+8).

## TIME_INPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the input format for the TIME data type. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported time format or `AUTO`

    (`AUTO` specifies that Snowflake attempts to automatically detect the format of times stored in the system during the session)

Default:
:   `AUTO`

## TIME_OUTPUT_FORMAT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the display format for the TIME data type. For more information, see [Date and time input and output formats](date-time-input-output.md).

Values:
:   Any valid, supported time format

Default:
:   `HH24:MI:SS`

## TRACE_LEVEL

Type:
:   Session — Can be set for Account » User » Session

    Object (for databases, schemas, stored procedures, and UDFs) — Can be set for Account » Database » Schema » Procedure and Account » Database » Schema » Function

Data Type:
:   String (Constant)

Description:
:   Controls how trace events are ingested into the event table. For more information about trace levels, see
    [Setting levels for logging, metrics, and tracing](../developer-guide/logging-tracing/telemetry-levels.md).

Values:
:   `ALWAYS`: All spans and trace events will be recorded in the event table.

    `ON_EVENT`: Trace events will be recorded in the event table only when your stored procedures or UDFs explicitly add events.

    `OFF`: No spans or trace events will be recorded in the event table.

Default:
:   `OFF`

> **Note:**
>
> When tracing events, you must also set the LOG_LEVEL parameter to one of its supported values.

## TRANSACTION_ABORT_ON_ERROR

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   BOOLEAN

Description:
:   Specifies the action to perform when a statement issued within a non-autocommit transaction returns with an error.

Values:
:   `TRUE`: The non-autocommit transaction is aborted. All statements issued inside that transaction will fail until a commit or rollback statement is executed to close that transaction.

    `FALSE`: The non-autocommit transaction is not aborted.

Default:
:   `FALSE`

## TRANSACTION_DEFAULT_ISOLATION_LEVEL

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String

Description:
:   Specifies the isolation level for transactions in the user session.

Values:
:   `READ COMMITTED` (only currently-supported value)

Default:
:   `READ COMMITTED`

## TWO_DIGIT_CENTURY_START

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Number

Description:
:   Specifies the “century start” year for 2-digit years (that is, the earliest year such dates can represent). This parameter prevents ambiguous dates when importing or converting data with
    the `YY` date format component (years represented as 2 digits).

Values:
:   `1900` to `2100` (any value outside of this range returns an error)

Default:
:   `1970`

For example:

| Year |  | Param set to `1900` | Param set to `1970` (default) | Param set to `1980` | Param set to `1990` | Param set to `2000` |
| --- | --- | --- | --- | --- | --- | --- |
| `00` | becomes: | `1900` | `2000` | `2000` | `2000` | `2000` |
| `79` | becomes: | `1979` | `1979` | `2079` | `2079` | `2079` |
| `89` | becomes: | `1989` | `1989` | `1989` | `2089` | `2089` |
| `99` | becomes: | `1999` | `1999` | `1999` | `1999` | `2099` |

## UNSUPPORTED_DDL_ACTION

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   String (Constant)

Description:
:   Determines if an unsupported (non-default) value specified for a constraint property returns an error.

Values:
:   `IGNORE`: Snowflake does not return an error for unsupported values.

    `FAIL`: Snowflake returns an error for unsupported values.

Default:
:   `IGNORE`

> **Important:**
>
> This parameter does not determine whether the constraint is created. Snowflake does not create constraints using unsupported values, regardless of how this parameter is set.
>
> For more information, see [Constraint properties](sql/create-table-constraint.md).

## USE_CACHED_RESULT

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Boolean

Description:
:   Specifies whether to reuse persisted query results, if available, when a matching query is submitted.

Values:
:   `TRUE`: When a query is submitted, Snowflake checks for matching query results for previously-executed queries and, if a matching result exists, uses the result instead of executing the
    query. This can help reduce query time because Snowflake retrieves the result directly from the cache.

    `FALSE`: Snowflake executes each query when submitted, regardless of whether a matching query result exists.

Default:
:   `TRUE`

## USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE

Type:
:   Object (for databases, schemas, and tasks) — Can be set for Account » Database » Schema » Task

Data Type:
:   String

Description:
:   Specifies the size of the compute resources to provision for the first run of the task, before a task history is available for
    Snowflake to determine an ideal size. Once a task has successfully completed a few runs, Snowflake ignores this parameter setting. If the
    task history is unavailable for a given task, the compute resources revert to this initial size.

    > **Note:**
    >
    > This parameter applies only to [serverless tasks](../user-guide/tasks-intro.md).

    The size is equivalent to the compute resources available when creating a warehouse. If the parameter is omitted, the first runs of the
    task are executed using a medium-sized (`MEDIUM`) warehouse.

    You can change the initial size for individual tasks (using [ALTER TASK](sql/alter-task.md)) after the task is created but
    before it has run successfully once. Changing the parameter after the first run of this task starts has no effect on the
    compute resources for current or future task runs.

    Note that suspending and resuming a task does not remove the task history used to size the compute resources. The task history is
    only removed if the task is recreated (using the [CREATE OR REPLACE TASK](sql/create-task.md) syntax).

Values:
:   Any traditional [warehouse size](../user-guide/warehouses-overview.md): `XSMALL`, `SMALL`, `MEDIUM`, `LARGE`, `XLARGE`, `X2LARGE`. The maximum size is `X2LARGE`.

    Also supports the syntax: `XXLARGE`.

Default:
:   `MEDIUM`

## USER_TASK_MINIMUM_TRIGGER_INTERVAL_IN_SECONDS

Type:
:   Object (for databases, schemas, and tasks) — Can be set for Account » Database » Schema » Task

Data Type:
:   Number

Description:
:   Defines how frequently a [triggered task](../user-guide/tasks-triggered.md) can execute in seconds.
    If a task is triggered again while it’s running,
    Snowflake waits the specified number of seconds (after the previous run was scheduled) before starting the next run.

    If you set this parameter to more than 12 hours for a task, the task runs every 12 hours.

Values:
:   `10` - `604800` (1 week).

Default:
:   `30`

## USER_TASK_TIMEOUT_MS

Type:
:   Object (for databases, schemas, and tasks) — Can be set for Account » Database » Schema » Task

Data Type:
:   Number

Description:
:   Specifies the time limit on a single run of the task before it times out (in milliseconds).

    > **Note:**
    >
    > * Before you increase the time limit for tasks significantly, consider whether the SQL statements in the task definitions could be
    >   optimized (either by rewriting the statements or using stored procedures) or whether the warehouse size for tasks with user-managed
    >   compute resources should be increased.
    > * When both STATEMENT_TIMEOUT_IN_SECONDS and USER_TASK_TIMEOUT_MS are set, the timeout is the lowest non-zero value of the two parameters.
    > * When both STATEMENT_QUEUED_TIMEOUT_IN_SECONDS and USER_TASK_TIMEOUT_MS are set, the value of USER_TASK_TIMEOUT_MS takes precedence.
    >
    > For more information about USER_TASK_TIMEOUT_MS, see [CREATE TASK…USER_TASK_TIMEOUT](sql/create-task.md).

Values:
:   `0` - `604800000` (7 days). A value of `0` specifies that the maximum timeout value is enforced.

Default:
:   `3600000` (1 hour)

## USE_WORKSPACES_FOR_SQL

Type:
:   Account — Can be set only for Account

Data Type:
:   String (Constant)

Description:
:   Controls whether the Workspaces editor is the default SQL editing experience for the account.

Values:
:   `always`: Set the account-wide default editor to be Workspaces for all users.

    `never`: Revert to the previous editor and temporarily ignore any Snowflake-managed BCR that makes Workspaces the default.

    For more information, see [Workspaces](../user-guide/ui-snowsight/workspaces.md).

## WEEK_OF_YEAR_POLICY

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Number

Description:
:   Specifies how the weeks in a given year are computed.

Values:
:   `0`: The semantics used are equivalent to the ISO semantics, in which a week belongs to a given year if at least 4 days of that week are in that year.

    `1`: January 1 is included in the first week of the year and December 31 is included in the last week of the year.

Default:
:   `0` (ISO-like behavior)

> **Tip:**
>
> `1` is the most common value, based on feedback we’ve received. For more information, including examples, see [Calendar weeks and weekdays](functions-date-time.md).

## WEEK_START

Type:
:   Session — Can be set for Account » User » Session

Data Type:
:   Number

Description:
:   Specifies the first day of the week (used by week-related date functions).

Values:
:   `0`: Legacy Snowflake behavior is used (ISO-like semantics).

    `1` (Monday) to `7` (Sunday): All the week-related functions use weeks that start on the specified day of the week.

Default:
:   `0` (legacy Snowflake behavior)

> **Tip:**
>
> `1` is the most common value, based on feedback we’ve received. For more information, including examples, see [Calendar weeks and weekdays](functions-date-time.md).

---
title: PARTNER_CONTRACT_ITEMS view
source: https://docs.snowflake.com/en/sql-reference/billing/partner_contract_items.md
section: SQL General Reference
---

Schema:
:   [BILLING](../billing.md)

# PARTNER_CONTRACT_ITEMS view

The PARTNER_CONTRACT_ITEMS view in the BILLING schema provides contract information for the reseller’s customers.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the reseller’s organization. |
| SOLD_TO_ORGANIZATION_NAME | VARCHAR | Name of the organization of the reseller’s customer. |
| SOLD_TO_CUSTOMER_NAME | VARCHAR | Name of the reseller’s customer. |
| SOLD_TO_PO_NUMBER | VARCHAR | Purchase order number associated with the reseller’s sale to the customer. |
| SOLD_TO_CONTRACT_NUMBER | VARCHAR | Number associated with the customer’s contract with the reseller. |
| START_DATE | DATE | Start date for the customer’s contract with the reseller, or the date the CONTRACT_ITEM goes into effect. |
| END_DATE | DATE | End date of the customer’s contract with the reseller. |
| EXPIRATION_DATE | DATE | Expiration date of the customer’s contract with the reseller. |
| CONTRACT_ITEM | VARCHAR | One of capacity, additional capacity, or free usage. |
| CURRENCY | VARCHAR | Currency for the CONTRACT_ITEM. |
| AMOUNT | NUMBER(26,4) | Amount for the CONTRACT_ITEM. |
| CONTRACT_MODIFIED_DATE | DATE | Date (in UTC) the CONTRACT_ITEM was last modified. |

## Usage notes

* Latency for the view can be up to 24 hours.

---
title: PARTNER_RATE_SHEET_DAILY view
source: https://docs.snowflake.com/en/sql-reference/billing/partner_rate_sheet_daily.md
section: SQL General Reference
---

Schema:
:   [BILLING](../billing.md)

# PARTNER_RATE_SHEET_DAILY view

The PARTNER_RATE_SHEET_DAILY view in the BILLING schema returns the effective rates used for calculating usage in the organization currency. This usage is based on credits used for all Snowflake accounts in the organization of a reseller’s customer.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the reseller’s organization. |
| SOLD_TO_ORGANIZATION_NAME | VARCHAR | Name of the organization of the reseller’s customer. |
| SOLD_TO_CUSTOMER_NAME | VARCHAR | Name of the reseller’s customer. |
| SOLD_TO_PO_NUMBER | VARCHAR | Purchase order number associated with the reseller’s sale to the customer. |
| SOLD_TO_CONTRACT_NUMBER | VARCHAR | Number associated with the customer’s contract with the reseller. |
| DATE | DATE | Date (in UTC) for the effective price. |
| ACCOUNT_NAME | VARCHAR | Name of the customer’s account. |
| ACCOUNT_LOCATOR | VARCHAR | Locator of the customer’s account, which is used in the [legacy account identifier](../../user-guide/admin-account-identifier.md). |
| REGION | VARCHAR | Name of the region where the customer’s account is located. |
| SERVICE_LEVEL | VARCHAR | Service level of the customer’s Snowflake account (Standard, Enterprise, Business Critical, etc.). |
| USAGE_TYPE | VARCHAR | Type of usage, which can be one of Compute, Storage, Data Transfer, Materialized Views, etc. |
| BILLING_TYPE | VARCHAR | Indicates what is being charged or credited. Possible billing types include:   * `consumption` — Usage associated with compute credits, storage costs, and data transfer costs. * `rebate` — Usage covered by the credits awarded to the organization when it shared data with another organization. * `priority support` — Charges for priority support services. This charge is associated with a stipulation in a contract, not with an account. * `vps_deployment_fee` — Charges for a [Virtual Private Snowflake](../../user-guide/intro-editions.md) deployment. * `support_credit` — Snowflake Support credited the account to reverse charges attributed to an issue in Snowflake. |
| RATING_TYPE | VARCHAR | Indicates how the usage in the record is rated, or priced. Possible values include:   * `compute` * `storage` * `other` |
| SERVICE_TYPE | VARCHAR | Type of usage, for example, `snowpipe` for usage related to the Snowpipe feature. |
| IS_ADJUSTMENT | BOOLEAN | Indicates whether the record is an adjustment to usage. |
| CURRENCY | VARCHAR | Currency of the EFFECTIVE_RATE. |
| EFFECTIVE_RATE | NUMBER(38, 2) | Rate after applying any applicable discounts. |

## Usage notes

* Latency for the view can be up to 24 hours.
* Until month close, data for a given day in a month can change to account for any end-of-month adjustments, mid-month contract amendments, or Snowflake account transfers from one organization to another.

---
title: PARTNER_REMAINING_BALANCE_DAILY view
source: https://docs.snowflake.com/en/sql-reference/billing/partner_remaining_balance_daily.md
section: SQL General Reference
---

Schema:
:   [BILLING](../billing.md)

# PARTNER_REMAINING_BALANCE_DAILY view

The PARTNER_REMAINING_BALANCE_DAILY view in the BILLING schema provides the daily remaining balance and the on-demand consumption daily for a reseller’s customers.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the reseller’s organization. |
| SOLD_TO_ORGANIZATION_NAME | VARCHAR | Name of the organization of the reseller’s customer. |
| SOLD_TO_CUSTOMER_NAME | VARCHAR | Name of the reseller’s customer. |
| SOLD_TO_PO_NUMBER | VARCHAR | Purchase order number associated with the reseller’s sale to the customer. |
| SOLD_TO_CONTRACT_NUMBER | VARCHAR | Number associated with the customer’s contract with the reseller. |
| DATE | DATE | Date of the FREE_USAGE_BALANCE or CAPACITY_BALANCE in UTC. |
| CURRENCY | VARCHAR | Currency of the FREE_USAGE_BALANCE, or CAPACITY_BALANCE, or ON_DEMAND_CONSUMPTION_BALANCE. |
| FREE_USAGE_BALANCE | NUMBER (38,2) | Amount of free usage in currency that is available for use as of the date. This is the end of day balance. |
| CAPACITY_BALANCE | NUMBER (38,2) | Amount of capacity in currency that is available for use as of the date. This is the end of day balance. |
| ON_DEMAND_CONSUMPTION_BALANCE | NUMBER (38,2) | Amount of consumption at on demand prices that will be invoiced given that all the free usage and capacity balances have been exhausted. This is a negative value (e.g. -250) until the invoice is paid. This is the end of day balance. |
| ROLLOVER_BALANCE | NUMBER (38,2) | Amount of rollover balance in currency that is available for use at the end of the date. At the end of a contract term, it is calculated as sum(AMOUNT) from the CONTRACT_ITEMS view - sum(USAGE_IN_CURRENCY) from the PARTNER_USAGE_IN_CURRENCY_DAILY view. |
| MARKETPLACE_CAPACITY_DRAWDOWN_BALANCE | NUMBER (38,2) | Amount of CAPACITY_BALANCE that is available for purchases in the Snowflake Marketplace. |

## Usage notes

* Latency for the view may be up to 24 hours.
* On demand consumption balance resets after month close (typically on the 3rd or 4th day of the next month) after it is invoiced and paid.
* Until month close, data for a given day in a month can change to account for any end-of-month adjustments or contract amendments between Snowflake organizations.

## Example query

To query the remaining balance for all your customers’ organizations on the last day of February 2022:

```sqlexample
SELECT date AS balancedate,
  *,
  capacity_balance + free_usage_balance + rollover_balance AS total_balance
  FROM snowflake.billing.partner_remaining_balance_daily
  WHERE date = '2022-02-28';
```

---
title: PARTNER_USAGE_IN_CURRENCY_DAILY view
source: https://docs.snowflake.com/en/sql-reference/billing/partner_usage_in_currency_daily.md
section: SQL General Reference
---

Schema:
:   [BILLING](../billing.md)

# PARTNER_USAGE_IN_CURRENCY_DAILY view

The PARTNER_USAGE_IN_CURRENCY_DAILY view in the BILLING schema provides the daily credit usage and daily currency usage for all of a
reseller’s customers.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| ORGANIZATION_NAME | VARCHAR | Name of the reseller’s organization. |
| SOLD_TO_ORGANIZATION_NAME | VARCHAR | Name of the organization of the reseller’s customer. |
| SOLD_TO_CUSTOMER_NAME | VARCHAR | Name of the reseller’s customer. |
| SOLD_TO_PO_NUMBER | VARCHAR | Purchase order number associated with the reseller’s sale to the customer (if available). |
| SOLD_TO_CONTRACT_NUMBER | VARCHAR | Number associated with the customer’s contract with the reseller. |
| ACCOUNT_NAME | VARCHAR | Name of the account where the usage was consumed. |
| ACCOUNT_LOCATOR | VARCHAR | Locator for the account where the usage was consumed. The locator is used in the [legacy account identifier](../../user-guide/admin-account-identifier.md). |
| REGION | VARCHAR | Name of the region where the account is located. |
| SERVICE_LEVEL | VARCHAR | Service level of the Snowflake account (Standard, Enterprise, Business Critical, etc.). |
| USAGE_DATE | DATE | Date (in UTC) in which the usage took place. |
| USAGE_TYPE | VARCHAR | Type of usage. For each usage type, `overage` is prepended when the usage was billed at on-demand pricing because it exceeded the capacity of the contract. Possible usage types include:  * `adj for incl cloud services` — Refer to [Understanding billing for cloud services usage](../../user-guide/cost-understanding-compute.md). * `automatic clustering` — Refer to [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `cloud services` — Refer to [Cloud service credit usage](../../user-guide/cost-understanding-compute.md). * `compute` — Refer to [Virtual warehouse credit usage](../../user-guide/cost-understanding-compute.md). Does not indicate usage of serverless or cloud services compute. * `data transfer` — Refer to [Understanding data transfer cost](../../user-guide/cost-understanding-data-transfer.md). * `materialized views` — Refer to [Working with Materialized Views](../../user-guide/views-materialized.md). * `priority support` — Indicates how much was charged for priority support services in a given month. This charge is associated with a stipulation in a contract, not with an account. * `serverless tasks` — Refer to [Introduction to tasks](../../user-guide/tasks-intro.md). * `snowpipe` — Refer to [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `snowpipe streaming` — Refer to [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `storage` — Refer to [Understanding storage cost](../../user-guide/cost-understanding-data-storage.md). * `support credit` — Indicates that Snowflake Support credited the account to reverse charges attributed to an issue in Snowflake. Represents credits applied to the account for a given month. |
| CURRENCY | VARCHAR | Currency associated with the usage. |
| USAGE | NUMBER (38,6) | Total number of credits charged for the USAGE_TYPE for usage on the USAGE_DATE. |
| USAGE_IN_CURRENCY | NUMBER (38,6) | Total amount charged for the USAGE_TYPE for USAGE on the USAGE_DATE. |
| BALANCE_SOURCE | VARCHAR | Source of the funds used to pay for the daily usage. Can be one of the following:   * `capacity` — Usage paid with credits remaining on an organization’s capacity contract. * `rollover` — Usage paid with rollover credits. When an organization renews a capacity contract, unused credits are added to the   balance of the new contract as rollover credits. * `free usage` — Usage covered by the free credits provided to the organization. * `overage` — Usage that was paid at on-demand pricing, which occurs when an organization has exhausted its capacity, rollover,   and free credits. * `rebate` — Usage covered by the credits awarded to the organization of the reseller’s customer when it shared data with another   organization. |
| BILLING_TYPE | VARCHAR | Indicates what is being charged or credited. Possible billing types include:   * `consumption` — Usage associated with compute credits, storage costs, and data transfer costs. * `rebate` — Usage covered by the credits awarded to the organization when it shared data with another organization. * `priority support` — Charges for priority support services. This charge is associated with a stipulation in a contract, not with an account. * `vps_deployment_fee` — Charges for a [Virtual Private Snowflake](../../user-guide/intro-editions.md) deployment. * `support_credit` — Snowflake Support credited the account to reverse charges attributed to an issue in Snowflake. |
| RATING_TYPE | VARCHAR | Indicates how the usage in the record is rated, or priced. Possible values include:   * `compute` * `data_transfer` * `storage` * `other` |
| SERVICE_TYPE | VARCHAR | Type of usage. The following list includes many, but not all, of the possible service types:   * `ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING` — See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `ARCHIVE_STORAGE_WRITE` — See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `AUTOMATIC_CLUSTERING` — See [Automatic Clustering](../../user-guide/tables-auto-reclustering.md). * `CLOUD_SERVICES` — See [Cloud service credit usage](../../user-guide/cost-understanding-compute.md). * `COPY_FILES` — See [COPY FILES](../sql/copy-files.md). * `DATA_TRANSFER` — See [Understanding data transfer cost](../../user-guide/cost-understanding-data-transfer.md). * `EGRESS_COST_OPTIMIZER` — See [Optimizing data transfer costs with Egress Cost Optimizer](../../collaboration/provider-listings-auto-fulfillment-eco.md). * `INTERNAL_DATA_TRANSFER` — See costs associated with [Snowpark Container Services](../../developer-guide/snowpark-container-services/accounts-orgs-usage-views.md). * `LOGGING` — See [Logging, tracing, and metrics](../../developer-guide/logging-tracing/logging-tracing-overview.md). * `MATERIALIZED_VIEW` — See [Working with Materialized Views](../../user-guide/views-materialized.md). * `OUTBOUND_PRIVATELINK_DATA_PROCESSED` — See [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md). * `OUTBOUND_PRIVATELINK_ENDPOINTS` — See [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md). * `REPLICATION` — See [Introduction to replication and failover across multiple accounts](../../user-guide/account-replication-intro.md). * `QUERY_ACCELERATION` — See [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md) * `SEARCH_OPTIMIZATION` — See [Search optimization service](../../user-guide/search-optimization-service.md) * `SENSITIVE_DATA_CLASSIFICATION` — See [Introduction to sensitive data classification](../../user-guide/classify-intro.md). * `SERVERLESS_ALERTS` — See [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md). * `SERVERLESS_TASK` — See [Introduction to tasks](../../user-guide/tasks-intro.md). * `SNOWPIPE` — See [Snowpipe](../../user-guide/data-load-snowpipe-intro.md). * `SNOWPIPE_STREAMING` — See [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). * `STORAGE` — See [Understanding storage cost](../../user-guide/cost-understanding-data-storage.md). * `STORAGE_LIFECYCLE_POLICY_EXECUTION` — See [Billing for storage lifecycle policies](../../user-guide/storage-management/storage-lifecycle-policies-billing.md). * `TRUST_CENTER` — See [Trust Center](../../user-guide/trust-center/overview.md). * `WAREHOUSE_METERING` — See [Virtual warehouse credit usage](../../user-guide/cost-understanding-compute.md). Does not indicate usage of serverless or cloud services compute. |
| IS_ADJUSTMENT | BOOLEAN | Indicates whether the record is an adjustment to usage. |

## Usage notes

* Latency for the view can be up to 24 hours.
* Until month close, data for a given day in a month can change to account for any end-of-month adjustments, contract amendments, or Snowflake account transfers between organizations.

## Example query

To query the usage in credits and currency for all Snowflake accounts under your customers’ organizations for the month of January 2022:

```sqlexample
SELECT * FROM snowflake.billing.partner_usage_in_currency_daily
  WHERE MONTH(usage_date) = 01
    AND YEAR(usage_date) = 2022
  ORDER BY sold_to_contract_number, usage_date ASC;
```

---
title: PIVOT
source: https://docs.snowflake.com/en/sql-reference/constructs/pivot.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# PIVOT

Rotates a table by turning the unique values from one column in the input expression into multiple columns and aggregating results
where required on any remaining column values. In a query, it is specified in the [FROM](from.md) clause after
the table name or subquery.

PIVOT supports the following [built-in aggregate functions](../functions-aggregation.md):

* [AVG](../functions/avg.md)
* [COUNT](../functions/count.md)
* [MAX](../functions/max.md)
* [MIN](../functions/min.md)
* [SUM](../functions/sum.md)

PIVOT can be used to transform a narrow table (for example, `empid`, `month`, `sales`) into a wider table
(for example, `empid`, `jan_sales`, `feb_sales`, `mar_sales`).

See also:
:   [UNPIVOT](unpivot.md)

## Syntax

```sqlsyntax
SELECT ...
FROM ...
   PIVOT ( <aggregate_function> ( <pivot_column> ) [ [ AS ] <alias> ]
            FOR <value_column> IN (
              <pivot_value_1> [ [ AS ] <alias> ] [ , <pivot_value_2> [ [ AS ] <alias> ] ... ]
              | ANY [ ORDER BY ... ]
              | <subquery>
            )
            [ DEFAULT ON NULL (<value>) ]
         )

[ ... ]
```

## Parameters

`aggregate_function`
:   The aggregate function for combining the grouped values from `pivot_column`.

`pivot_column [ [ AS ] alias ]`
:   The column from the source table or subquery that will be aggregated.

    The optional `[ AS ] alias` clause specifies the alias to use for the aggregate in the result of
    the PIVOT operation. An underscore and then the alias is appended to each pivot column name. For example,
    if the `alias` is `total`, then the pivot operation appends `_TOTAL` to the pivot column names.
    The AS keyword is optional.

`value_column`
:   The column from the source table or subquery that contains the values from which column names will be generated.

`pivot_value_N [ [ AS ] alias ]`
:   A list of values for the pivot column to pivot into headings in the query results.

    The optional `[ AS ] alias` clause specifies the alias to use for the value in the result of
    the PIVOT operation. The alias replaces the value.

`ANY [ ORDER BY ... ]`
:   Pivot on all distinct values of the pivot column. To control the order of the pivot columns in the output,
    specify an [ORDER BY](order-by.md) clause after the ANY keyword. If the pivot column contains NULLs,
    then NULL is also treated as a pivot value.

`subquery`
:   Pivot on all values found in the subquery. The DISTINCT keyword is required if the subquery includes an
    ORDER BY clause. The subquery must be an uncorrelated subquery that returns a single column. Pivoting is
    performed on all distinct values returned by the subquery. For information about uncorrelated subqueries,
    see [Working with Subqueries](../../user-guide/querying-subqueries.md).

`DEFAULT ON NULL` (`value`)
:   Replace all NULL values in the pivot result with the specified default value. The default value can be any scalar
    expression that does not depend on the pivot and aggregation column.

## Usage notes

* Snowflake supports *dynamic pivot*. A dynamic pivot query uses the ANY keyword or a subquery in the PIVOT
  subclause instead of specifying the pivot values explicitly.
* When dynamic pivot is used in a [view](../../user-guide/views-introduction.md) definition, queries on the view
  might fail if the underlying data changes so that the pivot output columns are changed.
* Dynamic pivot isn’t supported in the body of a stored procedure or user-defined function (UDF).
* A pivot query that doesn’t use dynamic pivot can return output with duplicate columns. We recommend avoiding
  output with duplicate columns. A dynamic pivot query deduplicates duplicate columns.
* A pivot query that doesn’t use dynamic pivot might fail if it attempts to
  [CAST](../functions/cast.md) a [VARIANT](../data-types-semistructured.md) column to a different
  data type. Dynamic pivot queries don’t have this limitation.
* Currently, the PIVOT semantic doesn’t allow multiple aggregations, but you can achieve similar results by using
  PIVOT with the [UNION operator](../operators-query.md). For an example, see
  Dynamic pivot with multiple aggregations using UNION.

## Examples

The PIVOT examples use the following `quarterly_sales` table:

```sqlexample
CREATE OR REPLACE TABLE quarterly_sales(
  empid INT,
  amount INT,
  quarter TEXT)
  AS SELECT * FROM VALUES
    (1, 10000, '2023_Q1'),
    (1, 400, '2023_Q1'),
    (2, 4500, '2023_Q1'),
    (2, 35000, '2023_Q1'),
    (1, 5000, '2023_Q2'),
    (1, 3000, '2023_Q2'),
    (2, 200, '2023_Q2'),
    (2, 90500, '2023_Q2'),
    (1, 6000, '2023_Q3'),
    (1, 5000, '2023_Q3'),
    (2, 2500, '2023_Q3'),
    (2, 9500, '2023_Q3'),
    (3, 2700, '2023_Q3'),
    (1, 8000, '2023_Q4'),
    (1, 10000, '2023_Q4'),
    (2, 800, '2023_Q4'),
    (2, 4500, '2023_Q4'),
    (3, 2700, '2023_Q4'),
    (3, 16000, '2023_Q4'),
    (3, 10200, '2023_Q4');
```

The following examples use PIVOT:

* Dynamic pivot on all distinct column values automatically
* Dynamic pivot on column values using a subquery
* Dynamic pivot with multiple aggregations using UNION
* Dynamic pivot with a join query
* Pivot on a specified list of column values for the pivot column
* Pivot with a default value for NULL values
* Pivot examples that involve multiple columns

### Dynamic pivot on all distinct column values automatically

Given the table `quarterly_sales`, pivot on the `amount` column using the ANY keyword to sum the
total sales per employee for all of the distinct quarters, and specify ORDER BY so that the pivot columns
are in order:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (ANY ORDER BY quarter))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' | '2023_Q4' |
|-------+-----------+-----------+-----------+-----------|
|     1 |     10400 |      8000 |     11000 |     18000 |
|     2 |     39500 |     90700 |     12000 |      5300 |
|     3 |      NULL |      NULL |      2700 |     28900 |
+-------+-----------+-----------+-----------+-----------+
```

The following example is the same as the previous example, but it appends the alias `_TOTAL` to
each pivot column name:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) AS total FOR quarter IN (ANY ORDER BY quarter))
  ORDER BY empid;
```

```output
+-------+-----------------+-----------------+-----------------+-----------------+
| EMPID | '2023_Q1_TOTAL' | '2023_Q2_TOTAL' | '2023_Q3_TOTAL' | '2023_Q4_TOTAL' |
|-------+-----------------+-----------------+-----------------+-----------------|
|     1 |           10400 |            8000 |           11000 |           18000 |
|     2 |           39500 |           90700 |           12000 |            5300 |
|     3 |            NULL |            NULL |            2700 |           28900 |
+-------+-----------------+-----------------+-----------------+-----------------+
```

### Dynamic pivot on column values using a subquery

Assume that in addition to the `quarterly_sales` table, an `ad_campaign_types_by_quarter`
table tracks the types of advertisements run during particular quarters. This table has the following
structure and data:

```sqlexample
CREATE OR REPLACE TABLE ad_campaign_types_by_quarter(
  quarter VARCHAR,
  television BOOLEAN,
  radio BOOLEAN,
  print BOOLEAN)
  AS SELECT * FROM VALUES
    ('2023_Q1', TRUE, FALSE, FALSE),
    ('2023_Q2', FALSE, TRUE, TRUE),
    ('2023_Q3', FALSE, TRUE, FALSE),
    ('2023_Q4', TRUE, FALSE, TRUE);
```

You can use a subquery in a pivot query to determine the sum of the sales in the quarters that had
specific ad campaigns. For example, the following pivot query returns data only for quarters with
television ad campaigns:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (
      SELECT DISTINCT quarter
        FROM ad_campaign_types_by_quarter
        WHERE television = TRUE
        ORDER BY quarter))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q4' |
|-------+-----------+-----------|
|     1 |     10400 |     18000 |
|     2 |     39500 |      5300 |
|     3 |      NULL |     28900 |
+-------+-----------+-----------+
```

### Dynamic pivot with multiple aggregations using UNION

You can use the [UNION operator](../operators-query.md) to show multiple aggregations in
a single result set. This example uses dynamic pivot and the UNION operator to show the following
information for each employee in each quarter:

* The average amount of a sale, using the [AVG](../functions/avg.md) function.
* The sale with the highest value, using the [MAX](../functions/max.md) function.
* The sale with the lowest value, using the [MIN](../functions/min.md) function.
* The number of sales, using the [COUNT](../functions/count.md) function.
* The total amount for all sales, using the [SUM](../functions/sum.md) function.

Run the query:

```sqlexample
SELECT 'Average sale amount' AS aggregate, *
  FROM quarterly_sales
    PIVOT(AVG(amount) FOR quarter IN (ANY ORDER BY quarter))
UNION
SELECT 'Highest value sale' AS aggregate, *
  FROM quarterly_sales
    PIVOT(MAX(amount) FOR quarter IN (ANY ORDER BY quarter))
UNION
SELECT 'Lowest value sale' AS aggregate, *
  FROM quarterly_sales
    PIVOT(MIN(amount) FOR quarter IN (ANY ORDER BY quarter))
UNION
SELECT 'Number of sales' AS aggregate, *
  FROM quarterly_sales
    PIVOT(COUNT(amount) FOR quarter IN (ANY ORDER BY quarter))
UNION
SELECT 'Total amount' AS aggregate, *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (ANY ORDER BY quarter))
ORDER BY aggregate, empid;
```

```output
+---------------------+-------+--------------+--------------+--------------+--------------+
| AGGREGATE           | EMPID |    '2023_Q1' |    '2023_Q2' |    '2023_Q3' |    '2023_Q4' |
|---------------------+-------+--------------+--------------+--------------+--------------|
| Average sale amount |     1 |  5200.000000 |  4000.000000 |  5500.000000 |  9000.000000 |
| Average sale amount |     2 | 19750.000000 | 45350.000000 |  6000.000000 |  2650.000000 |
| Average sale amount |     3 |         NULL |         NULL |  2700.000000 |  9633.333333 |
| Highest value sale  |     1 | 10000.000000 |  5000.000000 |  6000.000000 | 10000.000000 |
| Highest value sale  |     2 | 35000.000000 | 90500.000000 |  9500.000000 |  4500.000000 |
| Highest value sale  |     3 |         NULL |         NULL |  2700.000000 | 16000.000000 |
| Lowest value sale   |     1 |   400.000000 |  3000.000000 |  5000.000000 |  8000.000000 |
| Lowest value sale   |     2 |  4500.000000 |   200.000000 |  2500.000000 |   800.000000 |
| Lowest value sale   |     3 |         NULL |         NULL |  2700.000000 |  2700.000000 |
| Number of sales     |     1 |     2.000000 |     2.000000 |     2.000000 |     2.000000 |
| Number of sales     |     2 |     2.000000 |     2.000000 |     2.000000 |     2.000000 |
| Number of sales     |     3 |     0.000000 |     0.000000 |     1.000000 |     3.000000 |
| Total amount        |     1 | 10400.000000 |  8000.000000 | 11000.000000 | 18000.000000 |
| Total amount        |     2 | 39500.000000 | 90700.000000 | 12000.000000 |  5300.000000 |
| Total amount        |     3 |         NULL |         NULL |  2700.000000 | 28900.000000 |
+---------------------+-------+--------------+--------------+--------------+--------------+
```

### Dynamic pivot with a join query

To pivot in a query with a join, you can use a [common table expression (CTE)](../../user-guide/queries-cte.md)
for the pivot query.

For example, assume a simple table maps employees to managers:

```sqlexample
CREATE OR REPLACE TABLE emp_manager(
    empid INT,
    managerid INT)
  AS SELECT * FROM VALUES
    (1, 7),
    (2, 8),
    (3, 9);

SELECT * from emp_manager;
```

```output
+-------+-----------+
| EMPID | MANAGERID |
|-------+-----------|
|     1 |         7 |
|     2 |         8 |
|     3 |         9 |
+-------+-----------+
```

Run a query that joins the `emp_manager` table and the `quarterly_sales` table and pivots on the
`amount` column in the `quarterly_sales` table:

```sqlexample
WITH
  src AS
  (
    SELECT *
      FROM quarterly_sales
        PIVOT(SUM(amount) FOR quarter IN (ANY ORDER BY quarter))
  )
SELECT em.managerid, src.*
  FROM emp_manager em
  JOIN src ON em.empid = src.empid
  ORDER BY empid;
```

```output
+-----------+-------+-----------+-----------+-----------+-----------+
| MANAGERID | EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' | '2023_Q4' |
|-----------+-------+-----------+-----------+-----------+-----------|
|         7 |     1 |     10400 |      8000 |     11000 |     18000 |
|         8 |     2 |     39500 |     90700 |     12000 |      5300 |
|         9 |     3 |      NULL |      NULL |      2700 |     28900 |
+-----------+-------+-----------+-----------+-----------+-----------+
```

### Pivot on a specified list of column values for the pivot column

Given the table `quarterly_sales`, pivot on the `amount` column to sum the
total sales per employee for the specified quarters:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (
      '2023_Q1',
      '2023_Q2',
      '2023_Q3'))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' |
|-------+-----------+-----------+-----------|
|     1 |     10400 |      8000 |     11000 |
|     2 |     39500 |     90700 |     12000 |
|     3 |      NULL |      NULL |      2700 |
+-------+-----------+-----------+-----------+
```

You can pivot on all of the quarters in the `amount` column by running the following
query:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (
      '2023_Q1',
      '2023_Q2',
      '2023_Q3',
      '2023_Q4'))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' | '2023_Q4' |
|-------+-----------+-----------+-----------+-----------|
|     1 |     10400 |      8000 |     11000 |     18000 |
|     2 |     39500 |     90700 |     12000 |      5300 |
|     3 |      NULL |      NULL |      2700 |     28900 |
+-------+-----------+-----------+-----------+-----------+
```

You can modify the column names in the output with the AS clause. For example, to shorten the column names and
show them without quotes, run the following query:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (
      '2023_Q1' AS q1,
      '2023_Q2' AS q2,
      '2023_Q3' AS q3,
      '2023_Q4' AS q4))
  ORDER BY empid;
```

```output
+-------+-------+-------+-------+-------+
| EMPID |    Q1 |    Q2 |    Q3 |    Q4 |
|-------+-------+-------+-------+-------|
|     1 | 10400 |  8000 | 11000 | 18000 |
|     2 | 39500 | 90700 | 12000 |  5300 |
|     3 |  NULL |  NULL |  2700 | 28900 |
+-------+-------+-------+-------+-------+
```

### Pivot with a default value for NULL values

If the query returns NULL values, you can replace them with a default value by using DEFAULT ON NULL.
For example, you can use dynamic pivot and replace the NULL values with a default value of `0` by
running the following query:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount) FOR quarter IN (ANY ORDER BY quarter)
      DEFAULT ON NULL (0))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' | '2023_Q4' |
|-------+-----------+-----------+-----------+-----------|
|     1 |     10400 |      8000 |     11000 |     18000 |
|     2 |     39500 |     90700 |     12000 |      5300 |
|     3 |         0 |         0 |      2700 |     28900 |
+-------+-----------+-----------+-----------+-----------+
```

You can also use DEFAULT ON NULL with a specified list of columns:

```sqlexample
SELECT *
  FROM quarterly_sales
    PIVOT(SUM(amount)
      FOR quarter IN (
        '2023_Q1',
        '2023_Q2')
      DEFAULT ON NULL (0))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' |
|-------+-----------+-----------|
|     1 |     10400 |      8000 |
|     2 |     39500 |     90700 |
|     3 |         0 |         0 |
+-------+-----------+-----------+
```

### Pivot examples that involve multiple columns

Pivot queries can work with multiple columns. Before running these examples, add a column to the `quarterly_sales`
table and populate the column with random values.

First, add a column that tracks the discount applied to each sale to the `quarterly_sales` table:

```sqlexample
ALTER TABLE quarterly_sales ADD COLUMN discount_percent INT DEFAULT 0;
```

Populate the new column with random values between `0` and `5`, which specify the discount
percentage for each sale:

```sqlexample
UPDATE quarterly_sales SET discount_percent = UNIFORM(0, 5, RANDOM());
```

Query the table to show the new column with the random values added:

```sqlexample
SELECT * FROM quarterly_sales;
```

```output
+-------+--------+---------+------------------+
| EMPID | AMOUNT | QUARTER | DISCOUNT_PERCENT |
|-------+--------+---------+------------------|
|     1 |  10000 | 2023_Q1 |                0 |
|     1 |    400 | 2023_Q1 |                1 |
|     2 |   4500 | 2023_Q1 |                4 |
|     2 |  35000 | 2023_Q1 |                2 |
|     1 |   5000 | 2023_Q2 |                2 |
|     1 |   3000 | 2023_Q2 |                1 |
|     2 |    200 | 2023_Q2 |                2 |
|     2 |  90500 | 2023_Q2 |                1 |
|     1 |   6000 | 2023_Q3 |                1 |
|     1 |   5000 | 2023_Q3 |                3 |
|     2 |   2500 | 2023_Q3 |                1 |
|     2 |   9500 | 2023_Q3 |                3 |
|     3 |   2700 | 2023_Q3 |                1 |
|     1 |   8000 | 2023_Q4 |                1 |
|     1 |  10000 | 2023_Q4 |                4 |
|     2 |    800 | 2023_Q4 |                3 |
|     2 |   4500 | 2023_Q4 |                5 |
|     3 |   2700 | 2023_Q4 |                3 |
|     3 |  16000 | 2023_Q4 |                0 |
|     3 |  10200 | 2023_Q4 |                1 |
+-------+--------+---------+------------------+
```

Now that the new column is added and populated, run the following examples:

* Exclude columns from a pivot query with a CTE
* Run a multidimensional pivot query

#### Exclude columns from a pivot query with a CTE

You can use a [common table expression (CTE)](../../user-guide/queries-cte.md) to exclude columns
from a pivot query.

The following example uses a CTE to exclude the `discount_percent` column from a pivot query:

```sqlexample
WITH
  sales_without_discount AS
    (SELECT * EXCLUDE(discount_percent) FROM quarterly_sales)
SELECT *
  FROM sales_without_discount
    PIVOT(SUM(amount) FOR quarter IN (ANY ORDER BY quarter))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' | '2023_Q4' |
|-------+-----------+-----------+-----------+-----------|
|     1 |     10400 |      8000 |     11000 |     18000 |
|     2 |     39500 |     90700 |     12000 |      5300 |
|     3 |      NULL |      NULL |      2700 |     28900 |
+-------+-----------+-----------+-----------+-----------+
```

You can use a CTE to exclude the `amount` column and show the average discount that
each employee gave in each quarter:

```sqlexample
WITH
  sales_without_amount AS
    (SELECT * EXCLUDE(amount) FROM quarterly_sales)
SELECT *
  FROM sales_without_amount
    PIVOT(AVG(discount_percent) FOR quarter IN (ANY ORDER BY quarter))
  ORDER BY empid;
```

```output
+-------+-----------+-----------+-----------+-----------+
| EMPID | '2023_Q1' | '2023_Q2' | '2023_Q3' | '2023_Q4' |
|-------+-----------+-----------+-----------+-----------|
|     1 |  0.500000 |  1.500000 |  2.000000 |  2.500000 |
|     2 |  3.000000 |  1.500000 |  2.000000 |  4.000000 |
|     3 |      NULL |      NULL |  1.000000 |  1.333333 |
+-------+-----------+-----------+-----------+-----------+
```

#### Run a multidimensional pivot query

A multidimensional pivot query pivots on more than one column. This example pivots on the `amount`
column and the `discount_percentage` column. The query returns the sum of all sales by all employees
each quarter and the maximum discount percentage for all sales each quarter.

In the query, the SELECT list uses `$col_position` parameters to run [SUM](../functions/sum.md)
and [MAX](../functions/max.md) functions on the returned columns in order, and to name the returned
columns. A subquery in the FROM clause supplies the data for the pivot operations. Because the output shows sales
results for all employees, the subquery doesn’t include the `empid` column.

```sqlexample
SELECT SUM($1) AS q1_sales_total,
       SUM($2) AS q2_sales_total,
       SUM($3) AS q3_sales_total,
       SUM($4) AS q4_sales_total,
       MAX($5) AS q1_maximum_discount,
       MAX($6) AS q2_maximum_discount,
       MAX($7) AS q3_maximum_discount,
       MAX($8) AS q4_maximum_discount
  FROM
    (SELECT amount,
            quarter AS quarter_amount,
            quarter AS quarter_discount,
            discount_percent
      FROM quarterly_sales)
  PIVOT (
    SUM(amount)
    FOR quarter_amount IN (
      '2023_Q1',
      '2023_Q2',
      '2023_Q3',
      '2023_Q4'))
  PIVOT (
    MAX(discount_percent)
    FOR quarter_discount IN (
      '2023_Q1',
      '2023_Q2',
      '2023_Q3',
      '2023_Q4'));
```

```output
+----------------+----------------+----------------+----------------+---------------------+---------------------+---------------------+---------------------+
| Q1_SALES_TOTAL | Q2_SALES_TOTAL | Q3_SALES_TOTAL | Q4_SALES_TOTAL | Q1_MAXIMUM_DISCOUNT | Q2_MAXIMUM_DISCOUNT | Q3_MAXIMUM_DISCOUNT | Q4_MAXIMUM_DISCOUNT |
|----------------+----------------+----------------+----------------+---------------------+---------------------+---------------------+---------------------|
|          49900 |          98700 |          25700 |          52200 |                   4 |                   2 |                   3 |                   5 |
+----------------+----------------+----------------+----------------+---------------------+---------------------+---------------------+---------------------+
```

---
title: Planning an external function for AWS
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-planning.md
section: SQL General Reference
---

# Planning an external function for AWS

This topic helps you prepare to create an external function for AWS (Amazon Web Services) using either the AWS Management Console
or an AWS CloudFormation template provided by Snowflake.

## Prerequisites

These instructions assume that you are an experienced AWS Management Console user.

You need:

* An account with AWS, including privileges to:

  + Create AWS roles via IAM (identity and access management).
  + Create AWS Lambda Functions.
  + Create an API Gateway endpoint.
* A Snowflake account in which you have ACCOUNTADMIN privileges or a role with the CREATE INTEGRATION privilege.
* If you plan to use a private endpoint, you need your Virtual Private Cloud (VPC) ID.
  (You must use a VPC ID, not a VPC Endpoint ID. VPC Endpoint IDs can change over time.)

  If you do not already have your VPC ID, you can find it by executing the following command in the Snowflake web interface:

  > ```sqlexample
  > select system$get_snowflake_platform_info();
  > ```
  >
  > The output will resemble:
  >
  > ```output
  > {
  >   "snowflake-vpc-id": ["vpc-12345678"],
  >   "snowflake-egress-vpc-ids": [
  >     ...
  >    {
  >      "id": "vpc-12345678",
  >      "expires": "2025-03-01T00:00:00",
  >      "purpose": "generic"
  >    },
  >    ...
  >    ]
  >  }
  > ```

  From the function output, for each property identified with “purpose”: “generic”, record the corresponding VPC ID(s).

  After you decide whether to create your external function by using the AWS Management Console or an
  AWS CloudFormation template, copy the VPC IDs to the appropriate tracking worksheet:

  + Management Console worksheet.
  + CloudFormation template worksheet.

## Choosing your endpoint type: Regional endpoint vs. Private endpoint

You access a proxy service (such as Amazon API Gateway) via a URI, often referred to as an *endpoint*.
The instructions for creating your Amazon API Gateway ask you to choose one of the following types of endpoints:

* A regional endpoint.
* A private endpoint.

The following information can help you choose the type of endpoint.

A regional endpoint can be accessed across AWS regions, or even across cloud platforms.
Your Snowflake instance, your proxy service, and your remote service can all be in different regions or even on
different cloud platforms. For example, a Snowflake instance running on Azure could send requests to an Amazon API Gateway
regional endpoint, which in turn could forward data to a remote service running on GCP.

A private endpoint can be configured to allow access only within a region. For example, you can configure a private endpoint
to allow access from only a Snowflake VPC (Virtual Private Cloud) in the same AWS region. Communication between a Snowflake VPC
and a private endpoint uses AWS PrivateLink.

For more details about the types of endpoints on AWS, see:

* [Amazon API Gateway concepts](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-basic-concept.html)
* [Amazon API Gateway endpoint types](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-api-endpoint-types.html)

If you want to use a private endpoint, and you are not sure which region you are using, you can look up your
region by doing either of the following:

* Call the SQL function `CURRENT_REGION()` (e.g. `SELECT CURRENT_REGION()`).
* Check your Snowflake account hostname, which normally indicates the cloud provider and region. For more
  information about account hostnames, regions, and cloud providers, see [Supported cloud regions](../user-guide/intro-regions.md).

To use a private endpoint, your account must meet the following requirements:

* Business Critical (or higher) edition of Snowflake.

## Choosing the method for creating the external function

Snowflake provides instructions for two ways to create an external function on AWS:

* AWS Management Console web interface
* AWS CloudFormation template provided by Snowflake

### AWS Management Console

You can use the [AWS Management Console](https://aws.amazon.com/console/) to create a Lambda Function (as the remote service)
and an Amazon API Gateway instance (as the proxy service). If you choose this method, you also use the AWS Management Console to
configure security-related settings.

The instructions for creating an external function using the AWS Management Console include a sample Lambda Function and details
for creating a basic API Gateway:

* First-time users can use the instructions with little or no modification.
* Experienced users can use the instructions and sample Lambda Function as a starting point for creating a custom Lambda Function
  and a custom-configured API Gateway.

### AWS CloudFormation template

The CloudFormation template performs both of the following steps in creating an external function:

* Creating the remote service (an AWS Lambda Function).
* Creating and configuring the proxy service (an Amazon API Gateway).

The template also:

* Creates two IAM roles (one for the Lambda Function and one for the API Gateway).
* Configures a resource policy for the API Gateway.

## Preparing to use the AWS Management Console

### Create a worksheet for tracking required information

As you create your external function, you should record specific information that you enter (e.g. the Resource Invocation URL)
so that you can use that information in subsequent steps. The worksheet below helps you track this information.

```none
===========================================================================
================ Tracking Worksheet: AWS Management Console ===============
===========================================================================

****** Step 1: Information about the Lambda Function (remote service) *****

Your AWS Account ID: ______________________________________________________

Lambda Function Name: _____________________________________________________

******** Step 2: Information about the API Gateway (proxy Service) ********

New IAM Role Name: ________________________________________________________

New IAM Role ARN: _________________________________________________________

Snowflake VPC ID (optional): ______________________________________________

New API Name: _____________________________________________________________

API Gateway Resource Name: ________________________________________________

Resource Invocation URL: __________________________________________________

Method Request ARN: _______________________________________________________

*** Step 3: Information about the API Integration and External Function ***

API Integration Name: _____________________________________________________

API_AWS_IAM_USER_ARN: _____________________________________________________

API_AWS_EXTERNAL_ID: ______________________________________________________

External Function Name: ___________________________________________________
```

## Preparing to use an AWS CloudFormation template

### Download the template

The template is available for download from the
[deployment templates directory](https://github.com/Snowflake-Labs/sfguide-external-functions-examples/tree/main/DeploymentTemplates/aws/BasicSetup.yaml)
in the Snowflake repository in GitHub.

### Create a worksheet for tracking required information

As you create your external function, you should record specific information that you enter (e.g. the Resource Invocation URL)
so that you can use that information in subsequent steps. The worksheet below helps you track this information.

```none
===========================================================================
================== Tracking Worksheet: CloudFormation Template ============
===========================================================================

New IAM Role Name: ________________________________________________________

New IAM Role ARN: _________________________________________________________

Resource Invocation URL: __________________________________________________

API_AWS_IAM_USER_ARN: _____________________________________________________

API_AWS_EXTERNAL_ID: ______________________________________________________
```

## Additional resources for building external functions on AWS

When you are ready to create your own remote service for your own external function, you might want to look at the
examples of remote services based on Lambda Functions that are available in
[The Snowflake Labs.](https://github.com/Snowflake-Labs/sfguide-external-functions-examples)

## Next step

AWS Management Console:
:   [Step 1: Create the remote service (AWS Lambda function) in the Management Console](external-functions-creating-aws-ui-remote-service.md)

AWS CloudFormation template:
:   [Step 1: Use the template to create the remote service (AWS Lambda function) and proxy service (API Gateway)](external-functions-creating-aws-template-services.md)

---
title: Planning an external function for Azure
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-planning.md
section: SQL General Reference
---

# Planning an external function for Azure

This topic helps you prepare to create an external function for Microsoft Azure using either the Azure Portal or an ARM (Azure Resource
Manager) template provided by Snowflake.

## Prerequisites

These instructions assume that you are an experienced Azure Portal user.

To create an external function for Azure, you must have the following:

* An Azure AD (Active Directory) tenant.
* An account in that Azure AD tenant. The account must have privileges to:

  + Create an Azure Functions app.
  + Create a service endpoint using Azure API Management service.
  + Register an Azure AD Application.
* A Snowflake account in which you have the ACCOUNTADMIN role or a role with the CREATE INTEGRATION privilege.

In addition, you should already have an Azure AD Tenant ID.

The Azure AD Tenant ID is a [UUID](https://en.wikipedia.org/wiki/Universally_unique_identifier) , which typically is formatted to look
similar to `12345678-abcd-1234-efab-123456789012`, where each non-dash character is a hexadecimal digit.

If you do not already know your Azure AD tenant ID, you can find it using the following procedure:

1. Log into the Azure Portal (<http://portal.azure.com>).
2. In the Azure services icons near the top of the page, click on Azure Active Directory.
3. In the menu on the left-hand side, look for the section titled Manage, then click on Properties under that.

The Azure AD tenant ID is displayed in the Tenant ID field.

## Public Internet or private connectivity

When you call an external function, the connectivity from Snowflake to the external service can go through the public Internet or use
[Azure Private Link](https://learn.microsoft.com/en-us/azure/private-link/private-link-overview) (Microsoft documentation). The choice to
use Azure Private Link depends on your security requirements in terms of how you need to connect to the external service. Using Azure
Private Link can help you meet your security requirements.

If you choose to use the public Internet, finish the remainder of this topic and follow the numbered topics for creating an external
function on Azure using the Azure Portal or the ARM template.

If you choose to use Azure Private Link, the configuration process requires using the ACCOUNTADMIN role and a Snowflake account that is
Business Critical Edition (or higher). There is an additional billing charge to use Azure Private Link. Finish the remainder of this topic and review these topics for more information:

* [Private connectivity for outbound network traffic](../user-guide/private-connectivity-outbound.md)
* [Manage private connectivity endpoints: Azure](../user-guide/private-manage-endpoints-azure.md)
* [Private connectivity with external functions: Azure ARM template](external-functions-creating-azure-template-private-connect.md) (includes billing section)
* [Private connectivity with external functions: Azure Portal](external-functions-creating-azure-ui-private-connect.md) (includes billing section)

## Choosing the method for creating the external function

Snowflake provides instructions for two ways to create an external function on Azure:

* Azure Portal web interface
* ARM (Azure Resource Manager) template provided by Snowflake

### Azure portal

You can use the [Azure Portal](https://azure.microsoft.com/en-us/features/azure-portal/) to create an Azure Function (as the remote
service) and an API Management service instance (as the proxy service). If you choose this method, you also use the Azure Portal to
configure security-related settings.

The instructions for creating an external function using the Azure Portal include a sample Azure Function and details for creating a basic
API Management service instance:

* First-time users can use the instructions and sample Azure Function with little or no modification.
* Experienced users can use the instructions and sample Azure Function as a starting point for creating a custom Azure Function and a
  custom-configured API Management service instance.

### ARM (Azure Resource Manager) template

An [ARM template](https://docs.microsoft.com/en-us/azure/azure-resource-manager/templates/overview) uses [JSON](https://www.json.org/) to describe
configuration information about an Azure Function (as the remote service) and an Azure API Management service instance (as the proxy
service).

Snowflake provides a sample ARM template that includes the following:

* Sample Azure Function.
* Most of the configuration information for a sample API Management service. You must enter some additional information if you wish to
  customize the sample API Management service.
* Code to create a storage account needed by the Azure Functions service.
* Code to add a validate-JWT (JSON Web Token) Policy to the API Management instance in order to increase security of the Azure API
  Management service. However, you must manually update the validate-JWT policy before using it.

ARM templates can be useful for both first-time and experienced users:

* First-time users might want to start with the Snowflake sample template because it reduces the number of steps required to create the
  Azure Function and the API Management service instance.

  Note that although the template-based instructions help you create your first external function quickly, they skip steps that
  most users need when creating customized external functions.
* Experienced users might want to use ARM templates because templates can be used to automate deployment. This can be useful if you are
  developing an Azure Function and API Management service iteratively.

For more information about configuring Azure Functions using ARM templates, see the Microsoft documentation:
[resource deployment](https://docs.microsoft.com/en-us/azure/azure-functions/functions-infrastructure-as-code) .

## Preparing to use the Azure portal

These sections help you prepare to use the Azure Portal to create an external function on Microsoft Azure.

### Choose the pricing plan for your Azure function

In Microsoft Azure, an Azure Function (remote service) can run on a Linux host or Windows host. At this time, Azure offers different
combinations of pricing and authentication options for Linux and Windows hosts.

If you plan to run your Azure Function on Linux, then you must choose a valid combination of Azure pricing plan and authentication:

* If you use the Premium or App Service pricing plan:

  + Create the Azure AD (Active Directory) application from the Authentication/Authorization tab on the Azure Functions
    screen in the Azure Portal.
  + Use Azure AD for authentication with the Azure Functions service.

  Additional details and links are provided later in the
  [instructions for creating a remote service](external-functions-creating-azure-ui-remote-service.md).
* If you use the Consumption pricing plan:

  + Manually create the Azure AD application in the Azure Portal. Additional details are provided later in the
    [instructions for creating a remote service](external-functions-creating-azure-ui-remote-service.md).
  + Set a validate-JWT policy for the API Management instance. For details, see [Step 6: Create the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-ui-security-policy.md).
  + Use IP address restrictions to limit the remote service to accept connections only from the API Management service instance. For
    details, see [Restrict the IP addresses that accept Azure functions calls (optional)](external-functions-creating-azure-ui-security-policy.md).

### Create a worksheet for tracking required information

As you complete the tasks to create an external function in the Azure Portal, you are required to enter specific values
(e.g. API Management service name) during each step in the process. Often, the values you enter are required in subsequent steps.

To facilitate recording/tracking this information, we’ve provided a worksheet with fields for each of the required values:

```none
================================================================================
======================= Tracking Worksheet: Azure Portal =======================
================================================================================

****** Step 1: Azure Function (Remote Service) Info ****************************

Azure Function app name: _______________________________________________________
HTTP-Triggered Function name: __________________________________________________
Azure Function AD app registration name: _______________________________________
Azure Function App AD Application ID: __________________________________________

    (The value for the Azure Function App AD Application ID above is the
    "Application (client) ID" of the Azure AD app registration for the
    Azure Function. The value is used to fill in the "azure_ad_application_id"
    field in the CREATE API INTEGRATION command. This value is in the form of a
    UUID (universally unique identifier), which contains hexadecimal digits and
    dashes.)

****** Step 2: Azure API Management Service (Proxy Service) Info ***************

API Management service name: ___________________________________________________
API Management API URL suffix: _________________________________________________

****** Steps 3-5: API Integration & External Function Info *********************

API Integration Name: __________________________________________________________
AZURE_MULTI_TENANT_APP_NAME: ___________________________________________________
AZURE_CONSENT_URL: _____________________________________________________________

External Function Name: ________________________________________________________
```

## Preparing to use the ARM template

These sections help you prepare to use the ARM template provided by Snowflake to create an external function on Microsoft Azure.

### Download the template

The template is available to download from the
[Snowflake repository in GitHub](https://github.com/Snowflake-Labs/sfguide-external-functions-examples/tree/main/DeploymentTemplates/azure/BasicSetup.json).

Before you can use the template, you must import it into the Azure Portal. Details for importing the template are included later in the
topic that describes using the template.

### Choose the pricing plan for your Azure function

In Microsoft Azure, an Azure Function (remote service) can run on a Linux host or Windows host. At this time, Azure offers different
combinations of pricing and authentication options for Linux and Windows hosts.

The Snowflake-provided ARM template defaults to using the following pricing plan and authentication information:

* Defaults to using a Windows host for the Azure Function.
* Defaults to the “Consumption” pricing tier.
* Creates an Azure Functions app, and configures that app to require AD (Active Directory) authentication.
* Creates a security policy to validate a JWT (JSON Web Token) that authorizes Snowflake to call your
  Azure Function.

  Note that this security policy is missing one field; instructions provided later tell you how to fill in this field.

If you plan to run your Azure API management instance or Azure Function with a different configuration, you must update the
template. For information about updating the template, see the Microsoft documentation:

* [Automating resource deployment](https://docs.microsoft.com/en-us/azure/azure-functions/functions-infrastructure-as-code)
  (for your function app in Azure Functions)

### Create a worksheet for tracking required information

As you complete the tasks to create an external function using the ARM template provided by Snowflake, you are required to enter specific
values (e.g. API Management service name) during each step in the process. Often, the values you enter are required in subsequent steps.

To facilitate recording/tracking this information, we’ve provided a worksheet with fields for each of the required values:

> **Note:**
>
> For information hard-coded in the ARM template, the values have already been filled in.

```none
================================================================================
======================= Tracking Worksheet: ARM Template =======================
================================================================================

****** Step 1: Azure Function (Remote Service) Info ****************************

HTTP-Triggered Function name: __________________ echo __________________________
Azure Function AD Application ID: ______________________________________________

    (The value for the Azure Function AD Application ID above is the
    "Application (client) ID" of the Azure AD app registration for the
    Azure Function. The value is used to fill in the "azure_ad_application_id"
    field in the CREATE API INTEGRATION command. This value is in the form of a
    UUID (universally unique identifier), which contains hexadecimal digits and
    dashes.)

****** Step 2: Azure API Management Service (Proxy Service) Info ***************

API Management service name: ___________________________________________________
API Management URL: ____________________________________________________________
Azure Function HTTP Trigger URL: _______________________________________________
API Management API URL suffix: _________________________________________________

****** Steps 3-5: API Integration & External Function Info *********************

API Integration Name: __________________________________________________________
AZURE_MULTI_TENANT_APP_NAME: ___________________________________________________
AZURE_CONSENT_URL: _____________________________________________________________

External Function Name: ________________________________________________________
```

## Next step

Azure Portal:
:   [Step 1: Create the remote service (Azure function) in the Portal](external-functions-creating-azure-ui-remote-service.md)

ARM template:
:   [Step 1: Create an Azure AD app for the Azure functions app in the Portal](external-functions-creating-azure-template-apps.md)

---
title: Planning an external function for GCP
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-planning.md
section: SQL General Reference
---

# Planning an external function for GCP

This topic helps you prepare to create an external function for GCP (Google Cloud Platform) using the Google Cloud Console
user interface.

## Prerequisites

These instructions assume that you are an experienced Google Cloud Console user.

To create an external function for GCP, you must have the following:

* A Google Cloud project ID.
* The correct services enabled for your Google Cloud Project. For detailed requirements, see the
  [Quickstart for Deploying an API/API Gateway using gcloud](https://cloud.google.com/api-gateway/docs/quickstart).

## Create a worksheet to track required information

As you complete the tasks to create an external function in the Google Cloud Console, you are required to enter specific values
(e.g. Cloud Function Trigger URL) during each step in the process. Often, the values you enter are required in subsequent steps.

To facilitate recording/tracking this information, we’ve provided a worksheet with fields for each of the required values:

```none
================================================================================
=================== Tracking Worksheet: Google Cloud Console ===================
================================================================================

****** Step 1: Cloud Function (Remote Service) Info ****************************

Cloud Function Trigger URL: ____________________________________________________

****** Step 2: API Config File Info ********************************************

Path Suffix: ___________________________________________________________________
Configuration File Name: _______________________________________________________

****** Step 2: API Gateway (Proxy Service) Info ********************************

Managed Service Identifier: ____________________________________________________
Gateway Base URL : _____________________________________________________________

****** Steps 3-4: API Integration & External Function Info *********************

API Integration Name: __________________________________________________________
API_GCP_SERVICE_ACCOUNT: _______________________________________________________

External Function Name: ________________________________________________________

****** Step 5: Security Info ***************************************************

Security Definition Name: ______________________________________________________
```

## Next step

[Step 1: Create the remote service (Google Cloud Function) in the console](external-functions-creating-gcp-ui-remote-service.md)

---
title: Private connectivity with external functions: Azure ARM template
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-template-private-connect.md
section: SQL General Reference
---

# Private connectivity with external functions: Azure ARM template

This topic provides configuration details to set up private connectivity to an external service by calling an external function for
Snowflake accounts on Microsoft Azure. You can use the ARM template to configure resources in Microsoft Azure. Afterward, you can create an API integration
and external function in Snowflake. Finally, you can call the external function to validate private connectivity to the external service.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Process overview

The following is a general overview of the configuration process. Steps in Snowflake must be done by a user with the ACCOUNTADMIN role.
Steps in Azure are done by a user with rights to use the corresponding resources unless otherwise specified.

The following steps are the same as using external functions with the public Internet:

1. Complete the prerequisite steps for external functions on Microsoft Azure.
2. In the Azure Portal, create an application.
3. In the Azure Portal, create the remote service.

However, you might want to create new resources to fully differentiate your private connectivity needs from your public Internet needs.
Consult with your internal security administrators to determine the best approach for your needs.

These steps are unique to external functions that use private connectivity for an external service:

1. In Snowflake, create a private endpoint.
2. In the Azure Portal, approve the private endpoint.

   This action is done by the owner of the Azure API Management resource.
3. In Snowflake, create a new API integration.

   You need a dedicated API integration to support private connectivity to the external service.
4. In Snowflake, create an external function. Use the private connectivity URL as the invocation URL in the external function.
5. In Snowflake, call the external function to enable Snowflake to connect to the external service using private connectivity.

## Configuration

Complete these steps in the Azure Portal:

1. If you already have an ARM template set up and you want to reuse the remote service and proxy service, skip to the private
   connectivity steps. Otherwise, complete these steps:
2. Complete the [prerequisites](external-functions-creating-azure-planning.md) for external functions on Microsoft
   Azure.
3. In the Azure Portal, [create an application](external-functions-creating-azure-template-apps.md).
4. Create the remote service as follows:

   1. In the Azure Portal, search for `Deploy a custom template`.
   2. In the Select a template tab, select Build your own template.
   3. Select Load file.
   4. Navigate to the directory on the machine where you downloaded the template and select that template.
   5. Select Save. This takes you to the Custom deployment screen.
5. Continue with these steps:

   1. [Create the Azure function and API Management service](external-functions-creating-azure-template-services.md).
   2. [Obtain the required URLs for the API integration and external function](external-functions-creating-azure-template-services.md).

Complete these steps to configure private connectivity:

1. In Snowflake, run the [CREATE API INTEGRATION](sql/create-api-integration.md) command to create a new API integration to support private
   connectivity to the external service. Update the property values to align with your Microsoft Azure subscription:

   ```sqlexample
   CREATE API INTEGRATION external_api_integration_azure_private
     API_PROVIDER = azure_private_api_management
     AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
     AZURE_AD_APPLICATION_ID = 'dv3421nq-1g4s-4ap4-x89c-xrf28hna7m2o'
     API_ALLOWED_PREFIXES = ('https://aztest1-external-function-api.azure.net')
     ENABLED = TRUE
     COMMENT = 'API integration for private connectivity to an external service with external functions on Azure.';
   ```
2. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](functions/system_provision_privatelink_endpoint.md) system function to create the private
   endpoint. Update the argument values to align with your Microsoft Azure subscription:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
     'aztest1-external-function-api.azure.net',
     'Gateway'
     );
   ```
3. In the Azure Portal and as the owner of the Azure API Management resource, approve the private endpoint. For details, see the [approval
   process](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
4. [Link the API Integration for Azure to the proxy service](external-functions-creating-azure-common-api-integration-proxy-link.md) to enable Snowflake to send API requests to the Azure API Management
   service.
5. You can choose to block public access to the Azure API Management resource. For more information, see
   Secure access to the Azure API Management resource (in this topic).
6. In Snowflake, if you already have a database and schema to store the external function and want to use these objects, be sure these
   objects are [in use](sql/use.md) or select them in Snowsight. Otherwise, create a database and schema to
   store the external the external function for use with private connectivity to an external service:

   ```sqlexample
   CREATE DATABASE private_external_service_db;
   CREATE SCHEMA private_ext_functions;
   ```
7. In Snowflake, run the [CREATE EXTERNAL FUNCTION](sql/create-external-function.md) command to create the external function to use with
   private connectivity to the external service. Be sure to update the invocation URL with the external service private connectivity URL:

   ```sqlexample
   CREATE OR REPLACE SECURE EXTERNAL FUNCTION private_ext_function_azure_portal(
     a INTEGER , b VARCHAR)
     RETURNS VARIANT
     API_INTEGRATION = external_api_integration_azure_private
     AS 'https://aztest1-external-function-api.azure.net/my-api-url-suffix/http-function-name';
   ```

   The URL format depends on whether you are creating an external function using the Azure Portal or the Azure ARM template. For
   details, see [invocation URL format](external-functions-creating-azure-common-ext-function.md).
8. In Snowflake, call the external function to test private connectivity to the external service:

   ```sqlexample
   SELECT private_ext_function_azure(66, 'Mario');
   ```

   ```output
   [0, 66, 'Mario']
   ```

If the output of the function returns the result that matches the configuration of the remote service at the beginning of the procedure,
then you confirmed that private connectivity to the external service is working as expected.

## Secure access to the Azure API Management resource

You can secure the access to the Azure API Management resource that is associated with the private endpoint for use with external functions.
From the perspective of the Azure API Management resource, Snowflake is an inbound connection. By securing the access, you reduce the
likelihood of attacks that might compromise your use of external functions.

For example, you might want to run this Azure CLI
[apim command](https://learn.microsoft.com/en-us/cli/azure/apim?view=azure-cli-latest#az-apim-update) to block public access:

```none
az apim update --name <api-name> --resource-group <resource group name> --public-network-access false
```

Update the placeholder values with the values that correspond to the name of the API Management resource and the name of the resource group.

For details and options, see these topics:

* [Use a virtual network to secure inbound and outbound traffic for Azure API Management](https://learn.microsoft.com/en-us/azure/api-management/virtual-network-concepts?tabs=stv2).
* [Connect privately to API Management using an inbound private endpoint](https://learn.microsoft.com/en-us/azure/api-management/private-endpoint).

---
title: Private connectivity with external functions: Azure Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-ui-private-connect.md
section: SQL General Reference
---

# Private connectivity with external functions: Azure Portal

This topic provides configuration details to set up outbound private connectivity to an external service by calling an external function
for Snowflake accounts on Microsoft Azure as follows:

* Use the Azure Portal user interface to configure resources in Microsoft Azure.
* Create an API integration and external function in Snowflake.
* Call the external function in Snowflake to validate private connectivity to the external service.

## Outbound private connectivity costs

You pay for each private connectivity endpoint along with total data processed. For pricing of these items, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

You can explore the cost of these items by filtering on the following service types when querying billing views in the ACCOUNT_USAGE and ORGANIZATION_USAGE schemas:

* OUTBOUND_PRIVATELINK_ENDPOINT
* OUTBOUND_PRIVATELINK_DATA_PROCESSED

For example, you can query the [USAGE_IN_CURRENCY_DAILY](organization-usage/usage_in_currency_daily.md) view and filter on these service types.

## Process overview

The following is a general overview of the configuration process. Steps in Snowflake must be done by a user with the ACCOUNTADMIN role.
Steps in the Azure Portal are done by a user with rights to use the corresponding resources unless otherwise specified.

The following steps are the same as using external functions with the public Internet:

1. Complete the prerequisite steps for external functions on Microsoft Azure.
2. In the Azure Portal, create the remote service.
3. In the Azure Portal, create the proxy service.

However, you might want to create new resources to fully differentiate your private connectivity needs from your public Internet needs.
Consult with your internal security administrators to determine the best approach for your needs.

These steps are unique to external functions that use private connectivity for an external service:

1. In Snowflake, create a private endpoint.

   Snowflake stores the private IP address for the private endpoint internally.
2. In the Azure Portal, approve the private endpoint.

   This action is done by the owner of the Azure API Management resource (external service).
3. In Snowflake, create a new API integration.

   You need a dedicated API integration to support private connectivity to the external service.
4. In Snowflake, create an external function. The private connectivity URL is the value for the invocation URL in the external function.
5. In Snowflake, call the external function to enable Snowflake to connect to the external service using private connectivity.
6. Deprovision any private connectivity endpoints that are not necessary.

## Configuration

Complete these steps in the Azure Portal:

1. If you already have the Azure API Management resource set up and you want to reuse the remote service and proxy service, skip to the
   private connectivity steps. Otherwise, complete these steps:
2. Complete the [prerequisites](external-functions-creating-azure-planning.md) for external functions on Microsoft Azure.
3. In the Azure Portal, [create the remote service](external-functions-creating-azure-ui-remote-service.md).
4. In the Azure Portal, [create the proxy service](external-functions-creating-azure-ui-proxy-service.md).

Complete these steps to configure private connectivity:

1. In Snowflake, run the [CREATE API INTEGRATION](sql/create-api-integration.md) command to create a new API integration to support private
   connectivity to the external service. Update the property values to align with your Microsoft Azure subscription:

   ```sqlexample
   CREATE API INTEGRATION external_api_integration_azure_private
     API_PROVIDER = azure_private_api_management
     AZURE_TENANT_ID = 'a123b4c5-1234-123a-a12b-1a23b45678c9'
     AZURE_AD_APPLICATION_ID = 'dv3421nq-1g4s-4ap4-x89c-xrf28hna7m2o'
     API_ALLOWED_PREFIXES = ('https://aztest1-external-function-api.azure.net')
     ENABLED = TRUE
     COMMENT = 'API integration for private connectivity to an external service with external functions on Azure.';
   ```
2. In Snowflake, call the [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](functions/system_provision_privatelink_endpoint.md) system function to create the private
   endpoint. Update the argument values to align with your Microsoft Azure subscription:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SELECT SYSTEM$PROVISION_PRIVATELINK_ENDPOINT(
     '/subscriptions/f4b00c5f-f6bf-41d6-806b-e1cac4f1f36f/resourceGroups/aztest1-external-function-rg/providers/Microsoft.ApiManagement/service/aztest1-external-function-api',
     'aztest1-external-function-api.azure.net',
     'Gateway'
     );
   ```
3. In the Azure Portal and as the owner of the Azure API Management resource, approve the private endpoint. For details, see the [approval
   process](https://learn.microsoft.com/en-us/azure/private-link/manage-private-endpoint?tabs=manage-private-link-powershell#private-endpoint-connections).
4. [Link the API Integration for Azure to the proxy service](external-functions-creating-azure-common-api-integration-proxy-link.md) to enable Snowflake to send API requests to the Azure API Management
   service.
5. You can choose to block public access to the Azure API Management resource. For more information, see
   Secure access to the Azure API Management resource (in this topic).
6. In Snowflake, if you already have a database and schema to store the external function and want to use these objects, be sure these
   objects are [in use](sql/use.md) or select them in Snowsight. Otherwise, create a database and schema to
   store the external the external function for use with private connectivity to an external service:

   ```sqlexample
   CREATE DATABASE private_external_service_db;
   CREATE SCHEMA private_ext_functions;
   ```
7. In Snowflake, run the [CREATE EXTERNAL FUNCTION](sql/create-external-function.md) command to create the external function to use with
   private connectivity to the external service. Be sure to update the invocation URL with the external service private connectivity URL:

   ```sqlexample
   CREATE OR REPLACE SECURE EXTERNAL FUNCTION private_ext_function_azure_portal(
     a INTEGER , b VARCHAR)
     RETURNS VARIANT
     API_INTEGRATION = external_api_integration_azure_private
     AS 'https://aztest1-external-function-api.azure.net/my-api-url-suffix/http-function-name';
   ```

   The URL format depends on whether you are creating an external function using the Azure Portal or the Azure ARM template. For
   details, see [invocation URL format](external-functions-creating-azure-common-ext-function.md).
8. In Snowflake, call the external function to test private connectivity to the external service:

   ```sqlexample
   SELECT private_ext_function_azure(66, 'Mario');
   ```

   ```output
   [0, 66, 'Mario']
   ```

If the output of the function returns the result that matches the configuration of the remote service at the beginning of the procedure,
then you confirmed that private connectivity to the external service is working as expected.

## Secure access to the Azure API Management resource

You can secure the access to the Azure API Management resource that is associated with the private endpoint for use with external functions.
From the perspective of the Azure API Management resource, Snowflake is an inbound connection. By securing the access, you reduce the
likelihood of attacks that might compromise your use of external functions.

For example, you might want to run this Azure CLI
[apim command](https://learn.microsoft.com/en-us/cli/azure/apim?view=azure-cli-latest#az-apim-update) to block public access:

```none
az apim update --name <api-name> --resource-group <resource group name> --public-network-access false
```

Update the placeholder values with the values that correspond to the name of the API Management resource and the name of the resource group.

For details and options, see these topics:

* [Use a virtual network to secure inbound and outbound traffic for Azure API Management](https://learn.microsoft.com/en-us/azure/api-management/virtual-network-concepts?tabs=stv2).
* [Connect privately to API Management using an inbound private endpoint](https://learn.microsoft.com/en-us/azure/api-management/private-endpoint).

---
title: QUALIFY
source: https://docs.snowflake.com/en/sql-reference/constructs/qualify.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# QUALIFY

In a SELECT statement, the QUALIFY clause filters the results of window functions.

QUALIFY does with window functions what HAVING does with aggregate functions and GROUP BY clauses.

In the execution order of a query, QUALIFY is therefore evaluated after window functions are computed. Typically,
a SELECT statement’s clauses are evaluated in the order shown below:

> 1. FROM
> 2. WHERE
> 3. GROUP BY
> 4. HAVING
> 5. WINDOW
> 6. QUALIFY
> 7. DISTINCT
> 8. ORDER BY
> 9. LIMIT

## Syntax

```sqlsyntax
QUALIFY <predicate>
```

The general form of a statement with QUALIFY is similar to the following
(some variations in order are allowed, but are not shown):

```sqlsyntax
SELECT <column_list>
  FROM <data_source>
  [GROUP BY ...]
  [HAVING ...]
  QUALIFY <predicate>
  [ ... ]
```

## Parameters

`column_list`
:   This generally follows the rules for the projection clause of a [SELECT](../sql/select.md) statement.

`data_source`
:   The data source is usually a table, but can be another table-like data source, such as a view, UDTF (user-defined table function),
    etc.

`predicate`
:   The predicate is an expression that filters the result after aggregates and window functions are computed.
    The predicate should look similar to a [HAVING](having.md) clause, but without the
    keyword HAVING. In addition, the predicate can also contain window functions.

    See the Examples section (in this topic) for predicate examples.

## Usage notes

* The QUALIFY clause requires at least one window function to be specified in at least one of the following clauses
  of the SELECT statement:

  + The SELECT column list.
  + The filter predicate of the QUALIFY clause.

  Examples of each of these are shown in the Examples section below.
* Expressions in the SELECT list, including window functions, can be referred to by the column alias defined in the
  SELECT list.
* QUALIFY supports aggregates and subqueries in the predicate. For aggregates, the same rules as for the HAVING clause
  apply.
* The word QUALIFY is a reserved word.
* The Snowflake syntax for QUALIFY is not part of the ANSI standard.

## Examples

The QUALIFY clause simplifies queries that require filtering on the result of window functions. Without QUALIFY,
filtering requires nesting. The example below uses the ROW_NUMBER() function to return only the first row in each
partition.

Create and load a table:

```sqlexample
CREATE TABLE qt (i INTEGER, p CHAR(1), o INTEGER);
INSERT INTO qt (i, p, o) VALUES
  (1, 'A', 1),
  (2, 'A', 2),
  (3, 'B', 1),
  (4, 'B', 2);
```

```output
+-------------------------+
| number of rows inserted |
|-------------------------|
|                       4 |
+-------------------------+
```

This query uses nesting rather than QUALIFY:

```sqlexample
SELECT *
  FROM (
    SELECT i, p, o, ROW_NUMBER() OVER (PARTITION BY p ORDER BY o) AS row_num
      FROM qt)
  WHERE row_num = 1;
```

```output
+---+---+---+---------+
| I | P | O | ROW_NUM |
|---+---+---+---------|
| 1 | A | 1 |       1 |
| 3 | B | 1 |       1 |
+---+---+---+---------+
```

This query uses QUALIFY:

```sqlexample
SELECT i, p, o
  FROM qt
  QUALIFY ROW_NUMBER() OVER (PARTITION BY p ORDER BY o) = 1;
```

```output
+---+---+---+
| I | P | O |
|---+---+---|
| 1 | A | 1 |
| 3 | B | 1 |
+---+---+---+
```

You can also use QUALIFY to reference window functions that are in the SELECT column list:

```sqlexample
SELECT i, p, o, ROW_NUMBER() OVER (PARTITION BY p ORDER BY o) AS row_num
  FROM qt
  QUALIFY row_num = 1;
```

```output
+---+---+---+---------+
| I | P | O | ROW_NUM |
|---+---+---+---------|
| 1 | A | 1 |       1 |
| 3 | B | 1 |       1 |
+---+---+---+---------+
```

You can see how QUALIFY acts as a filter by removing it from the previous query and comparing the output:

```sqlexample
SELECT i, p, o, ROW_NUMBER() OVER (PARTITION BY p ORDER BY o) AS row_num
  FROM qt;
```

```output
+---+---+---+---------+
| I | P | O | ROW_NUM |
|---+---+---+---------|
| 1 | A | 1 |       1 |
| 2 | A | 2 |       2 |
| 3 | B | 1 |       1 |
| 4 | B | 2 |       2 |
+---+---+---+---------+
```

The QUALIFY clause can also be combined with aggregate functions and subqueries in the predicate. In such a query,
HAVING filters rows after GROUP BY aggregation, while QUALIFY filters rows after window functions are computed.
Both clauses can appear together when a query requires both kinds of filtering. For example:

```sqlexample
SELECT p, SUM(o) OVER (PARTITION BY p) AS r
  FROM qt
  WHERE o < 4
  GROUP BY p, o
  HAVING SUM(i) > 3
  QUALIFY r IN (
    SELECT MIN(i)
      FROM qt
      GROUP BY p
      HAVING MIN(i) > 3);
```

---
title: Query operators
source: https://docs.snowflake.com/en/sql-reference/operators.md
section: SQL General Reference
---

# Query operators

Snowflake supports most of the standard operators defined in SQL:1999.

These operators include arithmetic operators (such as `+` and `-`),
set operators (such as UNION), subquery operators (such as ANY), and so on.

| Category | Operators |
| --- | --- |
| [Arithmetic operators](operators-arithmetic.md) | `+` , `-` , `*` , `/` , `%` |
| [Comparison operators](operators-comparison.md) | `=` , `!=` , `<>` , `<` , `<=` , `>` , `>=` |
| [Expansion operators](operators-expansion.md) | `**` |
| [Flow operators](operators-flow.md) | `->>` |
| [Logical operators](operators-logical.md) | AND , NOT , OR |
| [Set operators](operators-query.md) | INTERSECT, MINUS, EXCEPT, UNION |
| [Subquery operators](operators-subquery.md) | [NOT] EXISTS, ANY / ALL, [NOT] IN |

See also [Bitwise expression functions](expressions-byte-bit.md).

---
title: RAISE (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/raise.md
section: SQL General Reference
---

# RAISE (Snowflake Scripting)

Raises an exception.

For more information about exceptions, see [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [EXCEPTION](exception.md)

## Syntax

```sqlsyntax
RAISE <exception_name> ;
```

Where:

> `exception_name`
> :   The name of the exception to raise.
>
>     If you are handling an exception in an exception handler and you want to raise the same exception again, omit this
>     argument. See [Raising the same exception again in an exception handler in Snowflake Scripting](../../developer-guide/snowflake-scripting/exceptions.md).

## Examples

This creates and raises (but does not catch) a simple exception:

```sqlexample
CREATE PROCEDURE thrower()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    DECLARE
        MY_EXCEPTION EXCEPTION;
    BEGIN
        RAISE MY_EXCEPTION;
    END;
$$
;
```

Here is the call to the stored procedure that raises the exception:

```sqlexample
CALL thrower();
```

Here is the output of executing the stored procedure that raises the exception:

```output
-20000 (P0001): Uncaught exception of type 'MY_EXCEPTION' on line 5 at position 8
```

The next example is similar to the preceding example, but uses an exception for which the user defined a custom
exception number and exception message:

```sqlexample
    DECLARE
        MY_EXCEPTION EXCEPTION (-20002, 'Raised MY_EXCEPTION.');
```

Here is the output of executing the stored procedure that raises the exception:

```output
-20002 (P0001): Uncaught exception of type 'MY_EXCEPTION' on line 7 at position 8 : Raised MY_EXCEPTION.
```

For more examples, see the examples for
[handling an exception](exception.md).

---
title: RECOMMEND_NETWORK_POLICY
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/recommend_network_policy.md
section: SQL General Reference
---

# RECOMMEND_NETWORK_POLICY

Generates a recommended allow-list for an ingress network policy based on successful access within a
specified lookback window.

This stored procedure is intended as a starting point if you don’t currently have a network policy or want to redesign an existing one.

The procedure analyzes successful ingress requests, optimizes individual IPs into CIDR blocks, and returns
human-readable SQL that administrators can review, refine, and execute.

See also:
:   [EVALUATE_CANDIDATE_NETWORK_POLICY](evaluate_candidate_network_policy.md)

## Syntax

```sqlsyntax
SNOWFLAKE.NETWORK_SECURITY.RECOMMEND_NETWORK_POLICY(
  LOOKBACK_DAYS => <integer>
  [, USER_NAME => '<string>' ]
  )
```

## Arguments

**Required:**

`LOOKBACK_DAYS => 'integer'`
:   The number of days of successful ingress access to analyze.

**Optional:**

`USER_NAME => 'string'`
:   Filters the recommendation to include only traffic from the specified user.

    Default: None (includes all users in an account).

## Returns

Returns human-readable text that contains example SQL statements. The output includes the following information:

* A summary of the number of distinct IP addresses analyzed and the number of CIDR blocks produced.
* An example CREATE OR REPLACE NETWORK RULE statement for an ingress network rule.
* An example CREATE OR REPLACE NETWORK POLICY statement for a network policy that references the rule.

# Access control requirements

A user must have the SECURITYADMIN role at a minimum to run this stored procedure.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

* The procedure is read-only with respect to account configuration. It does not create or modify
  any network rules or policies.
* Recommendations are based only on historical **successful** ingress. Blocked or failed access is
  not recommended for allow-listing.
* This procedure can’t determine which IP addresses are correct or safe for your organization.
  You must validate results with your IT and security teams before executing the generated SQL.
* SQL is provided as text to support copy-paste workflows.
* Output might vary depending on traffic volume and lookback window.
* The USER_NAME filter is optional. When omitted, the recommendation covers all users in an account.
* The procedure enforces a hard limit of **1,000 CIDR blocks**. If the recommendation exceeds
  this limit, the procedure returns an error. To stay within the limit, try a shorter lookback window
  or filter by user.

## Examples

Generate a recommended network policy based on the last 1 day of traffic for a specific user:

```sqlexample
USE ROLE SECURITYADMIN;

CALL SNOWFLAKE.NETWORK_SECURITY.RECOMMEND_NETWORK_POLICY(
  LOOKBACK_DAYS => 1,
  USER_NAME => 'user1'
  );
```

Generate a recommended network policy based on the last 30 days of traffic for all users:

```sqlexample
USE ROLE SECURITYADMIN;

CALL SNOWFLAKE.NETWORK_SECURITY.RECOMMEND_NETWORK_POLICY(
  LOOKBACK_DAYS => 30
  );
```

```output
Recommended candidate network policy based on 1,000 distinct IP addresses,
optimized to 99 CIDR blocks from the last 30 days.

You can execute the following statements with appropriate privileges:

-- Create a network rule

CREATE OR REPLACE NETWORK RULE my_ingress_rule
  MODE = INGRESS
  TYPE = IPV4
  VALUE_LIST = ('203.0.113.0/24', ...);

-- Create a network policy

CREATE OR REPLACE NETWORK POLICY my_ingress_policy
  ALLOWED_NETWORK_RULE_LIST = ('my_ingress_rule');
```

---
title: REGISTER_EXTENSION
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/register_extension.md
section: SQL General Reference
---

# REGISTER_EXTENSION

Registers an extension with the Trust Center.

For more information, see [Using Trust Center extensions](../../user-guide/trust-center/trust-center-extensions.md).

## Syntax

```sqlsyntax
SNOWFLAKE.TRUST_CENTER.REGISTER_EXTENSION(
  '<source_type>',
  '<source>',
  '<extension_name>')
```

## Arguments

`'source_type'`
:   The source type of the extension. Possible values are `LISTING` and
    `APPLICATION PACKAGE`.

`'source'`
:   The listing ID or the name of the application package.

    You can run the [SHOW APPLICATIONS](../sql/show-applications.md) SQL command to see all of the
    Snowflake Native Apps that are installed in your account, including the extensions. The listing ID
    or application package for an extension is shown in the `source` column in the output.

`'extension_name'`
:   Name of the extension.

    In the output for the SHOW APPLICATIONS SQL command, the extension names are shown in the
    `name` column.

## Returns

Returns a VARCHAR value:

* If registration is successful, the VARCHAR value contains the following message:

  ```output
  Extension <name> is successfully registered.
  ```
* If registration fails, the VARCHAR value contains an error message. Registration can fail for the following reasons:

  + The specified `source_type` is invalid.
  + The specified `source` is invalid.
  + The specified `extension_name` is invalid.
  + The combination of the `source_type`, `source`, and `extension_name` is invalid.
  + The `trust_center_integration_role` role in the namespace of the extension isn’t granted to the SNOWFLAKE application.

## Examples

The following example registers an extension that is named `tc_extension` that was installed from the private listing
`GZ13Z1VEWNG`:

```sqlexample
CALL SNOWFLAKE.TRUST_CENTER.REGISTER_EXTENSION(
  'LISTING',
  'GZ13Z1VEWNG',
  'tc_extension');
```

---
title: Remote service input and output data formats
source: https://docs.snowflake.com/en/sql-reference/external-functions-data-format.md
section: SQL General Reference
---

# Remote service input and output data formats

When Snowflake sends data to a remote service, or receives data from a remote
service, the data must be formatted correctly. This topic provides information
about the proper data formats. Data received from and returned to Snowflake must
also be of an [appropriate data type](external-functions-best-practices.md).

When executing an external function, for example, Snowflake sends and expects data in the
format described here. It sends the data to a proxy service, not directly to the
remote service (for more, see [Introduction to external functions](external-functions-introduction.md)). Therefore, the proxy service must receive
(and return) data in a Snowflake-compatible format. Although
typically the proxy service passes data through unchanged, the proxy can reformat data (both
sending and receiving) to meet the needs of both the remote service and Snowflake.

For simplicity, and to help illustrate the formats that Snowflake expects to send and receive, most of the examples
in this section assume that the remote service reads and writes data in the same format as Snowflake expects, and
the proxy service passes data through unchanged in both directions.

## Data format sent by Snowflake

Each HTTP request from Snowflake is a POST or a GET.

* A POST request contains headers and a request body. The request body
  includes a [batch](external-functions-introduction.md) of rows.
* A GET contains only headers, and is used only for polling when the remote
  service returns results [asynchronously](external-functions-implementation.md).

### Body format

The body of the POST request contains the data, serialized in JSON format.

The schema for the JSON is:

* The top-level is a JSON object (a set of name/value pairs, also called a “dictionary”).
* Currently, there is exactly one item in that object; the key for that item is named “data”.
* That “data” item’s value is a JSON array, in which:

  + Each element is one row of data.
  + Each row of data is a JSON array of one or more columns.
  + The first column is always the row number (i.e. the 0-based index of the row within the batch).
  + The remaining columns contain the arguments to the function.
* Data types are serialized as follows:

  + Numbers are serialized as JSON numbers.
  + Booleans are serialized as JSON booleans.
  + Strings are serialized as JSON strings.
  + Objects are serialized as JSON objects.
  + All other supported data types are serialized as JSON strings.

    - Dates, times, and timestamps are serialized as strings. For details about formatting these data types as
      strings, see [Date and time input and output formats](date-time-input-output.md) and [Date and time formats in conversion functions](functions-conversion.md).
    - Binary columns are serialized as strings. For details, see
      [Overview of supported binary formats](binary-input-output.md).
  + NULL is serialized as JSON null.

For examples of extracting data in a remote service on each platform, see:

* AWS: [Create the Remote Service (Lambda Function on AWS)](external-functions-creating-aws-ui-remote-service.md) .
* Azure: [Create the Remote Service (Azure Function)](external-functions-creating-azure-ui-remote-service.md) .

Optionally, the JSON can be compressed for transmission over the network. Compression is
documented in [CREATE EXTERNAL FUNCTION](sql/create-external-function.md).

#### Body example

Here’s an example of a serialized request for an external function with the signature
`f(integer, varchar, timestamp)`. The first column is the row number within the batch, and the
next three values are the arguments to the external function.

> ```sqljson
> {
>     "data": [
>                 [0, 10, "Alex", "2014-01-01 16:00:00"],
>                 [1, 20, "Steve", "2015-01-01 16:00:00"],
>                 [2, 30, "Alice", "2016-01-01 16:00:00"],
>                 [3, 40, "Adrian", "2017-01-01 16:00:00"]
>             ]
> }
> ```

### Header format

The header information is generally available to the remote service as a set of
key/value pairs. The header information includes:

* The following HTTP headers:

  + Headers that describe how data is serialized in the request body:

    - “sf-external-function-format”: This is currently always set to “json”.
    - “sf-external-function-format-version”: This is currently always set to “1.0”.
  + “sf-external-function-current-query-id”: This contains the query ID of the query that called this external
    function. You can use this to correlate Snowflake queries to calls of the remote service, for example to help
    debug issues.
  + “sf-external-function-query-batch-id”: The batch ID uniquely identifies the specific batch of
    rows processed with this request. The remote service can use this ID to track the status of a batch that is being
    processed. The ID can also be used as an idempotency token if requests are retried due to an error.
    The ID can also be used for logging/tracing of requests by the remote service.

    The batch ID in a GET is the same as the batch ID in the corresponding POST.

    The batch ID is an opaque value generated by Snowflake. The format could change in future releases, so remote
    services should not rely on a specific format or try to interpret the value.
  + Headers that describe the signature (name and argument types) and return type of the external function that
    was called in the SQL query. These values can have characters that are not standard characters for Snowflake
    [identifiers](identifiers-syntax.md), so base64 versions of the information are included,
    and non-standard characters are replaced with a blank in the non-base64 versions.

    The specific headers are:

    - sf-external-function-name
    - sf-external-function-name-base64
    - sf-external-function-signature
    - sf-external-function-signature-base64
    - sf-external-function-return-type
    - sf-external-function-return-type-base64

    For example, the headers sent for the function `ext_func(n integer)  returns varchar` are:

    - sf-external-function-name: ext_func
    - sf-external-function-name-base64: <base64 value>
    - sf-external-function-signature: (N NUMBER)
    - sf-external-function-signature-base64: <base64 value>
    - sf-external-function-return-type: VARCHAR(134217728)
    - sf-external-function-return-type-base64: <base64 value>

    Because SQL INTEGER values are treated as SQL NUMBER, the SQL argument declared as type `INTEGER` is
    described as type `NUMBER`.
* Additional optional metadata described in the “headers” and “context_headers” properties of
  [CREATE EXTERNAL FUNCTION](sql/create-external-function.md).

#### Header access example

To extract the “sf-external-function-signature” header from inside an AWS Lambda function written in
Python, which receives the headers as a Python dictionary, execute the following:

> ```python
> def handler(event, context):
>
>     request_headers = event["headers"]
>     signature = request_headers["sf-external-function-signature"]
> ```

The details will be different for other languages and on other cloud platforms.

For remote services developed on AWS, more information about headers and lambda proxy integration is available in
the [AWS API Gateway documentation](https://docs.aws.amazon.com/apigateway/latest/developerguide/set-up-lambda-proxy-integrations.html#api-gateway-simple-proxy-for-lambda-input-format) .

## Data format received by Snowflake

### Body format

When a remote service finishes processing a batch, the remote service should
send data back to Snowflake in a JSON format similar to the format of the data
sent by Snowflake.

The JSON response returned to Snowflake should contain one row for each row sent by Snowflake. Each returned row
contains two values:

* The row number (i.e. the 0-based index of the row within the batch).
* The value returned from the function for that row. The value can be a compound value (for example, an OBJECT), but
  it must be exactly one value because all scalar Snowflake functions (external or otherwise) return a single value.

So that Snowflake can correlate the response with the request, the row numbers in the returned data
must correspond to the row numbers in the data that Snowflake sent and must be
returned in the same order as they were received.

#### Body access example

The following JSON example shows two rows containing an OBJECT value, each
preceded by a row number:

```sqljson
{
    "data":
        [
            [ 0, { "City" : "Warsaw",  "latitude" : 52.23, "longitude" :  21.01 } ],
            [ 1, { "City" : "Toronto", "latitude" : 43.65, "longitude" : -79.38 } ]
        ]
}
```

To compose one of these returned rows with Python, you might use the following code:

```python
...
row_number = 0
output_value = {}

output_value["city"] = "Warsaw"
output_value["latitude"] = 21.01
output_value["longitude"] = 52.23
row_to_return = [row_number, output_value]
...
```

To access the OBJECT value of a returned row with SQL, use the notation described in
[Traversing Semi-structured Data](../user-guide/querying-semistructured.md). For example:

```sqlexample
select val:city, val:latitude, val:longitude
    from (select ext_func_city_lat_long(city_name) as val from table_of_city_names);
```

### Header format

The response can also contain the following optional HTTP headers:

* Content-MD5: Snowflake uses the optional Content-MD5 header to check the integrity of the response. If this header
  is included in the response, Snowflake computes an MD5 checksum on the response body to ensure that it matches
  the corresponding checksum in the returned header. If the values do not match, the SQL query fails. The checksum
  should be encoded in a base64 representation before being returned in the header. See the example code below.

Optionally, the JSON can be compressed for transmission over the network. Compression is
documented in [CREATE EXTERNAL FUNCTION](sql/create-external-function.md).

For information about timeouts and retries, see [Account for timeout errors](external-functions-best-practices.md) and
[Do not assume that the remote service is passed each row exactly once](external-functions-best-practices.md).

### Status code

The response also contains an HTTP status code. Snowflake recognizes the following HTTP status codes:

| Code | Description |
| --- | --- |
| 200 | Batch processed successfully. |
| 202 | Batch received and still being processed. |

Other values are treated as errors.

### Response creation example

The example Python code below returns a proper response, including the HTTP response code, the processed data, and an
MD5 header (which is optional).

This example is based on an AWS Lambda function. Some code might need customization for different platforms.

```python
import json
import hashlib
import base64

def handler(event, context):

    # The return value should contain an array of arrays (one inner array
    # per input row for a scalar function).
    array_of_rows_to_return = [ ]

    ...

    json_compatible_string_to_return = json.dumps({"data" : array_of_rows_to_return})

    # Calculate MD5 checksum for the response
    md5digest = hashlib.md5(json_compatible_string_to_return.encode('utf-8')).digest()
    response_headers = {
        'Content-MD5' : base64.b64encode(md5digest)
    }

    # Return the HTTP status code, the processed data, and the headers
    # (including the Content-MD5 header).
    return {
        'statusCode': 200,
        'body': json_compatible_string_to_return,
        'headers': response_headers
    }
```

---
title: REPEAT (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/repeat.md
section: SQL General Reference
---

# REPEAT (Snowflake Scripting)

A `REPEAT` loop iterates until a specified condition is true. A `REPEAT` loop tests the condition at
the end of the loop. This means that the body of a `REPEAT` loop always executes at least once.

For more information on loops, see [Working with loops](../../developer-guide/snowflake-scripting/loops.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [BREAK](break.md), [CONTINUE](continue.md)

## Syntax

```sqlsyntax
REPEAT
    <statement>;
    [ <statement>; ... ]
UNTIL ( <condition> )
END REPEAT [ <label> ] ;
```

Where:

> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).
>
> `condition`
> :   An expression that evaluates to a BOOLEAN.
>
> `label`
> :   An optional label. Such a label can be a jump target for a [BREAK](break.md) or
>     [CONTINUE](continue.md) statement. A label must follow the naming rules for
>     [Object identifiers](../identifiers.md).

## Usage notes

* Put parentheses around the condition in the `REPEAT`. For example: `REPEAT ( <condition> )`.
* If the `condition` never evaluates to TRUE, and the loop does not contain a
  [BREAK](break.md) command (or equivalent), then the loop will run and consume credits
  indefinitely.
* If the `condition` is NULL, then it is treated as FALSE.
* A loop can contain multiple statements. You can use, but are not required to use, a [BEGIN … END](begin.md)
  [block](../../developer-guide/snowflake-scripting/blocks.md) to contain those statements.

## Examples

This example uses a loop to calculate a power of 2. (This is an inefficient solution, but it does
demonstrate looping.)

```sqlexample
CREATE PROCEDURE power_of_2()
RETURNS NUMBER(8, 0)
LANGUAGE SQL
AS
$$
DECLARE
    counter NUMBER(8, 0);      -- Loop counter.
    power_of_2 NUMBER(8, 0);   -- Stores the most recent power of 2 that we calculated.
BEGIN
    counter := 1;
    power_of_2 := 1;
    REPEAT
        power_of_2 := power_of_2 * 2;
        counter := counter + 1;
    UNTIL (counter > 8)
    END REPEAT;
    RETURN power_of_2;
END;
$$;
```

Here is the output of executing the stored procedure:

```sqlexample
CALL power_of_2();
+------------+
| POWER_OF_2 |
|------------|
|        256 |
+------------+
```

For more examples, see [REPEAT loop](../../developer-guide/snowflake-scripting/loops.md).

---
title: RESAMPLE
source: https://docs.snowflake.com/en/sql-reference/constructs/resample.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# RESAMPLE

Returns a data set that includes both input rows and generated rows for missing data points, given a user-defined time-based granularity.

## Syntax

```sqlsyntax
FROM <object_reference> [ [ AS ] <alias_name> ]
  RESAMPLE(
    USING <time_series_column>
    INCREMENT BY <time_series_constant>
    [ PARTITION BY <partition_column> [ , ... ] ]
    [ METADATA_COLUMNS
        { IS_GENERATED() | BUCKET_START() } [ [ AS ] <alias_name> ] [ , ... ] ]
    )
```

## Required parameters

`FROM object_reference`
:   Specifies the name of a table or another object reference that contains the input data set, such as a subquery.
    For more information about object references, see [FROM](from.md).

`USING time_series_column`
:   Specifies the column that contains time-based values in the time series. The column must be a
    [date and time data type](../data-types-datetime.md) or a
    [numeric data type](../data-types-numeric.md). For example, UNIX timestamp values might be stored in NUMBER(38,0)
    columns, where `1743447600` is equivalent to `2025-3-31 12:00:00`.

`INCREMENT BY time_series_constant`
:   Specifies an INTERVAL constant or a numeric constant, depending on the data type of the USING column. This constant represents the width of each time interval.
    The slices are aligned relative to midnight on January 1, 1970 (`1970-01-01 00:00:00`). The TIME_SLICE function uses the same alignment; for
    more information, see the TIME_SLICE [usage notes](../functions/time_slice.md).

    * When the USING parameter specifies a date or time column, the INCREMENT BY expression must be an
      [INTERVAL constant](../data-types-datetime.md).
    * When the USING parameter specifies a numeric column, the INCREMENT BY expression must also be numeric.

    The starting point for a generated time series is based on the minimum time of the `time_series_constant`.

    If this constant is a numeric constant, it must be positive (greater than 0).

## Optional parameters

`[ AS ] alias_name`
:   Specifies an alternative name for the object reference. The alias can be used in any other subclause within the FROM clause.
    Alias names must follow the rules for [Object identifiers](../identifiers.md).

`PARTITION BY partition_column`
:   Partitions the result set on one or more input columns and generates new rows within each partition.

`METADATA_COLUMNS {function} [ [ AS ] {alias_name} ]`
:   Adds one or more metadata columns to the resampled result set. To add the columns, call one or both of the following functions:

    `IS_GENERATED()`
    :   Adds an `is_generated` column to the result set that marks which rows are new (generated by the RESAMPLE operation) and which rows already existed.

    `BUCKET_START()`
    :   Adds a `bucket_start` column to the result set. This column returns the value that marks the beginning of the current bucket or interval
        that the RESAMPLE operation produces, based on the values in the column specified in the USING clause. You can use the BUCKET_START column to
        identify which interval a particular row belongs to after resampling.

    If you specify both metadata columns, separate them with a comma.

    Generated columns can have aliases. Alias names must follow the rules for [Object identifiers](../identifiers.md).

## Usage notes

* An INTERVAL constant in the INCREMENT BY clause has the following requirements:

  + The constant must be equal to or greater than one `second`. Smaller units (`millisecond`, `microsecond`, `nanosecond`) aren’t supported.
  + When the USING column is a DATE data type, you can’t specify a unit in the interval that is more granular than `day` (`hour`, `minute`, `second`).
    For example, the constants `'INTERVAL 1 day, 2 hours'` and `'INTERVAL 25 hours'` aren’t allowed.
  + To avoid ambiguity, certain date parts can’t be mixed. The supported date parts fall into three discrete groups:

    - `year`, `quarter`, `month`
    - `week`
    - `day`, `hour`, `minute`, `second`

    For example, the following intervals, which cross these group boundaries, aren’t allowed:

    - `INTERVAL '1 week, 3 days'`
    - `INTERVAL '2 weeks, 12 hours'`
    - `INTERVAL '3 months, 1 week'`

* With respect to joins, the RESAMPLE construct works in a similar way to the [SAMPLE / TABLESAMPLE](sample.md) construct. Resampling applies to only one table, not all preceding tables or the entire expression prior to the RESAMPLE clause. To resample the result of a join, use a subquery for the join, then resample the resulting table. See [Sampling with joins](sample.md).
* The RESAMPLE clause is evaluated before WHERE clause conditions are applied. If you want to resample a filtered data set, filter it first (for example, by creating a new table that you can resample or by using a subquery that is computed first inside the main RESAMPLE query). The following query resamples the whole table, then discards everything but the rows for `Atlanta` and `Boston`.

  ```sqlexample
  SELECT *
    FROM heavy_weather
      RESAMPLE(
        USING start_time
        INCREMENT BY INTERVAL '1 day')
    WHERE city IN('Atlanta','Boston')
    ORDER BY start_time, city, county;
  ```

  A potential rewrite with a subquery would be:

  ```sqlexample
  SELECT *
    FROM (SELECT * FROM heavy_weather WHERE city IN('Atlanta','Boston'))
      RESAMPLE(
        USING start_time
        INCREMENT BY INTERVAL '1 day')
    ORDER BY start_time, city, county;
  ```
* When resampled rows are generated, they contain NULL values for columns that aren’t partitioned. If you use a WHERE clause to filter columns that aren’t partitioned, your filter might be applied to generated rows in unexpected ways.

  For example, because the following query does not have a PARTITION BY clause, it does not work as expected. In the generated rows, `city` and `county` are NULL. The WHERE clause then filters out all generated rows; only the original rows that match the filter criteria are preserved.

  ```sqlexample
  SELECT *
    FROM march_temps
      RESAMPLE(
        USING observed
        INCREMENT BY INTERVAL '5 minutes')
    WHERE city = 'Big Bear City' AND county = 'San Bernardino'
    ORDER BY observed;
  ```

  To solve the problem, the following query partitions by `city` and `county`. The RESAMPLE clause generates rows for combinations of `city` and `county` that exist in the source data. The WHERE clause condition preserves the appropriate generated rows because `city` and `county` have non-NULL values in those rows.

  ```sqlexample
  SELECT *
    FROM march_temps
      RESAMPLE(
        USING observed
        INCREMENT BY INTERVAL '5 minutes'
        PARTITION BY city, county)
    WHERE city = 'Big Bear City' AND county = 'San Bernardino'
    ORDER BY observed;
  ```

  In comparison,

## Examples

The following examples show how to use the RESAMPLE construct in queries.

### RESAMPLE example that uses a numeric column

The following example has a UNIX timestamp in the source table. This numeric column is specified in the RESAMPLE clause as the
USING column. Create and load the following table:

```sqlexample
CREATE OR REPLACE TABLE sensor_data_unixtime (device_id VARCHAR(10), unixtime NUMBER(38,0), avg_temp NUMBER(6,4), vibration NUMBER (5,4), motor_rpm INT);

INSERT INTO sensor_data_unixtime VALUES
  ('DEVICE3', 1696150802, 36.1103, 0.4226, 1560),
  ('DEVICE3', 1696150803, 35.2987, 0.4326, 1561),
  ('DEVICE3', 1696150804, 40.0001, 0.3221, 1562),
  ('DEVICE3', 1696150805, 38.0422, 0.3333, 1589),
  ('DEVICE3', 1696150807, 33.1524, 0.4865, 1499),
  ('DEVICE3', 1696150808, 32.0422, 0.4221, 1498),
  ('DEVICE3', 1696150809, 31.1519, 0.4751, 1600),
  ('DEVICE3', 1696150810, 29.1524, 0.4639, 1605),
  ('DEVICE3', 1696150812, 35.2987, 0.4336, 1585),
  ('DEVICE3', 1696150813, 40.0000, 0.4226, 1560)
;
```

Now run the following RESAMPLE query:

```sqlexample
SELECT * FROM sensor_data_unixtime
  RESAMPLE(USING unixtime INCREMENT BY 1) ORDER BY unixtime;
```

```output
+-----------+------------+----------+-----------+-----------+
| DEVICE_ID |   UNIXTIME | AVG_TEMP | VIBRATION | MOTOR_RPM |
|-----------+------------+----------+-----------+-----------|
| DEVICE3   | 1696150802 |  36.1103 |    0.4226 |      1560 |
| DEVICE3   | 1696150803 |  35.2987 |    0.4326 |      1561 |
| DEVICE3   | 1696150804 |  40.0001 |    0.3221 |      1562 |
| DEVICE3   | 1696150805 |  38.0422 |    0.3333 |      1589 |
| DEVICE3   | 1696150806 |     NULL |      NULL |      NULL |
| DEVICE3   | 1696150807 |  33.1524 |    0.4865 |      1499 |
| DEVICE3   | 1696150808 |  32.0422 |    0.4221 |      1498 |
| DEVICE3   | 1696150809 |  31.1519 |    0.4751 |      1600 |
| DEVICE3   | 1696150810 |  29.1524 |    0.4639 |      1605 |
| DEVICE3   | 1696150811 |     NULL |      NULL |      NULL |
| DEVICE3   | 1696150812 |  35.2987 |    0.4336 |      1585 |
| DEVICE3   | 1696150813 |  40.0000 |    0.4226 |      1560 |
+-----------+------------+----------+-----------+-----------+
```

The following query fails because the INCREMENT BY expression must be a positive numeric constant when the USING column is numeric:

```sqlexample
SELECT * FROM sensor_data_unixtime
  RESAMPLE(USING unixtime INCREMENT BY INTERVAL '1 second') ORDER BY unixtime;
```

```output
009954 (42601): SQL compilation error:
RESAMPLE INCREMENT BY has to be numeric type when USING parameter is numeric.
```

### RESAMPLE example that returns generated rows only

The following example resamples the `march_temps` table (as created in [Using the RESAMPLE clause](../../user-guide/querying-time-series-data.md)) and includes
metadata columns named `generated_row` and `bucket_start` in the result:

```sqlexample
CREATE OR REPLACE TABLE march_temps_every_five_mins AS
  SELECT * FROM march_temps
    RESAMPLE(
      USING observed
      INCREMENT BY INTERVAL '5 minutes'
      PARTITION BY city, county
      METADATA_COLUMNS IS_GENERATED() AS generated_row, BUCKET_START()
      )
  ORDER BY observed;
```

The following query returns only the generated rows from the `march_temps_every_five_mins` table:

```sqlexample
SELECT * FROM march_temps_every_five_mins
  WHERE generated_row = 'True';
```

```output
+-------------------------+-------------+------------------+----------------+---------------+-------------------------+
| OBSERVED                | TEMPERATURE | CITY             | COUNTY         | GENERATED_ROW | BUCKET_START            |
|-------------------------+-------------+------------------+----------------+---------------+-------------------------|
| 2025-03-15 09:45:00.000 |        NULL | Big Bear City    | San Bernardino | True          | 2025-03-15 09:45:00.000 |
| 2025-03-15 09:50:00.000 |        NULL | Big Bear City    | San Bernardino | True          | 2025-03-15 09:50:00.000 |
| 2025-03-15 10:00:00.000 |        NULL | South Lake Tahoe | El Dorado      | True          | 2025-03-15 10:00:00.000 |
| 2025-03-15 10:00:00.000 |        NULL | Big Bear City    | San Bernardino | True          | 2025-03-15 10:00:00.000 |
| 2025-03-15 10:05:00.000 |        NULL | South Lake Tahoe | El Dorado      | True          | 2025-03-15 10:05:00.000 |
| 2025-03-15 10:05:00.000 |        NULL | Big Bear City    | San Bernardino | True          | 2025-03-15 10:05:00.000 |
| 2025-03-15 10:15:00.000 |        NULL | Big Bear City    | San Bernardino | True          | 2025-03-15 10:15:00.000 |
+-------------------------+-------------+------------------+----------------+---------------+-------------------------+
```

### RESAMPLE example that uses BUCKET_START() to aggregate resampled rows

The following example uses the `bucket_start` metadata column to aggregate resampled rows. The query counts the number of observations
per city that have the same bucket start time, given a resampled result set that is incremented by a 1-day interval. To run this
example, create the `march_temps` table, as described in [Using the RESAMPLE clause](../../user-guide/querying-time-series-data.md).

```sqlexample
SELECT bucket_start, county, COUNT(*)
  FROM march_temps
    RESAMPLE(
      USING observed
      INCREMENT BY INTERVAL '1 day'
      METADATA_COLUMNS IS_GENERATED(), BUCKET_START()
      )
  WHERE IS_GENERATED = 'False'
  GROUP BY bucket_start, county;
```

```output
+-------------------------+----------------+----------+
| BUCKET_START            | COUNTY         | COUNT(*) |
|-------------------------+----------------+----------|
| 2025-03-15 00:00:00.000 | El Dorado      |        4 |
| 2025-03-15 00:00:00.000 | San Bernardino |        4 |
+-------------------------+----------------+----------+
```

### RESAMPLE example that uses BUCKET_START() to filter out non-uniform rows

You can use the `bucket_start` metadata column to filter out non-uniform data from a resampled result set. For example:

```sqlexample
SELECT *
  FROM march_temps
    RESAMPLE(
      USING observed
      INCREMENT BY INTERVAL '5 minutes'
      METADATA_COLUMNS BUCKET_START() AS bucket_first_row
      )
  WHERE observed = bucket_first_row
  ORDER BY observed;
```

This query resamples the table, then removes two original rows that don’t conform to the 5-minute interval (those with
values `09:49:00` and `10:18:00`).

---
title: Reserved & limited keywords
source: https://docs.snowflake.com/en/sql-reference/reserved-keywords.md
section: SQL General Reference
---

# Reserved & limited keywords

Snowflake SQL reserves all ANSI keywords (with the exception of type keywords such as CHAR, DATE, DECIMAL, etc.), as well as some additional keywords (ASC, DESC, MINUS, etc.) that are reserved by
other popular databases. Additionally, Snowflake reserves keywords REGEXP and RLIKE (which function like the ANSI reserved keyword LIKE) and SOME (which is a synonym for the ANSI reserved keyword ANY).

To avoid parsing ambiguities, Snowflake SQL also prohibits the use of keywords such as LEFT, OUTER, JOIN, etc. as table names or aliases in the FROM list, and the use of keywords such as TRUE, FALSE, CASE,
etc. as column references in scalar expressions.

The following table provides the list of reserved keywords in Snowflake and keywords that are not strictly reserved, but have usage limitations:

| Keyword | Comment |
| --- | --- |
| **A** |  |
| ACCOUNT | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| ALL | Reserved by ANSI. |
| ALTER | Reserved by ANSI. |
| AND | Reserved by ANSI. |
| ANY | Reserved by ANSI. |
| AS | Reserved by ANSI. |
| **B** |  |
| BETWEEN | Reserved by ANSI. |
| BY | Reserved by ANSI. |
| **C** |  |
| CASE | Cannot be used as column reference in a scalar expression. |
| CAST | Cannot be used as column reference in a scalar expression. |
| CHECK | Reserved by ANSI. |
| COLUMN | Reserved by ANSI. |
| CONNECT | Reserved by ANSI. |
| CONNECTION | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| CONSTRAINT | Cannot be used as a column name in CREATE TABLE DDL. |
| CREATE | Reserved by ANSI. |
| CROSS | Cannot be used as table name or alias in a FROM clause. |
| CURRENT | Reserved by ANSI. |
| CURRENT_DATE | Cannot be used as column name (reserved by ANSI). |
| CURRENT_TIME | Cannot be used as column name (reserved by ANSI). |
| CURRENT_TIMESTAMP | Cannot be used as column name (reserved by ANSI). |
| CURRENT_USER | Cannot be used as column name (reserved by ANSI). |
| **D** |  |
| DATABASE | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| DELETE | Reserved by ANSI. |
| DISTINCT | Reserved by ANSI. |
| DROP | Reserved by ANSI. |
| **E** |  |
| ELSE | Reserved by ANSI. |
| EXISTS | Reserved by ANSI. |
| **F** |  |
| FALSE | Cannot be used as column reference in a scalar expression. |
| FOLLOWING | Reserved by ANSI. |
| FOR | Reserved by ANSI. |
| FROM | Reserved by ANSI. |
| FULL | Cannot be used as table name or alias in a FROM clause. |
| **G** |  |
| GRANT | Reserved by ANSI. |
| GROUP | Reserved by ANSI. |
| GSCLUSTER | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| **H** |  |
| HAVING | Reserved by ANSI. |
| **I** |  |
| ILIKE | Reserved by Snowflake. |
| IN | Reserved by ANSI. |
| INCREMENT | Reserved by Snowflake and others. |
| INNER | Cannot be used as table name or alias in a FROM clause. |
| INSERT | Reserved by ANSI. |
| INTERSECT | Reserved by ANSI. |
| INTO | Reserved by ANSI. |
| IS | Reserved by ANSI. |
| ISSUE | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| **J** |  |
| JOIN | Cannot be used as table name or alias in a FROM clause. |
| **L** |  |
| LATERAL | Cannot be used as table name or alias in a FROM clause. |
| LEFT | Cannot be used as table name or alias in a FROM clause. |
| LIKE | Reserved by ANSI. |
| LOCALTIME | Cannot be used as column name (reserved by ANSI). |
| LOCALTIMESTAMP | Cannot be used as column name (reserved by ANSI). |
| **M** |  |
| MINUS | Reserved by Snowflake and others. |
| **N** |  |
| NATURAL | Cannot be used as table name or alias in a FROM clause. |
| NOT | Reserved by ANSI. |
| NULL | Reserved by ANSI. |
| **O** |  |
| OF | Reserved by ANSI. |
| ON | Reserved by ANSI. |
| OR | Reserved by ANSI. |
| ORDER | Reserved by ANSI. |
| ORGANIZATION | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| **Q** |  |
| QUALIFY | Reserved by Snowflake. |
| **R** |  |
| REGEXP | Reserved by Snowflake. |
| REVOKE | Reserved by ANSI. |
| RIGHT | Cannot be used as table name or alias in a FROM clause. |
| RLIKE | Reserved by Snowflake. |
| ROW | Reserved by ANSI. |
| ROWS | Reserved by ANSI. |
| **S** |  |
| SAMPLE | Reserved by ANSI. |
| SCHEMA | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| SELECT | Reserved by ANSI. |
| SET | Reserved by ANSI. |
| SOME | Reserved by Snowflake. |
| START | Reserved by ANSI. |
| **T** |  |
| TABLE | Reserved by ANSI. |
| TABLESAMPLE | Reserved by ANSI. |
| THEN | Reserved by ANSI. |
| TO | Reserved by ANSI. |
| TRIGGER | Reserved by ANSI. |
| TRUE | Cannot be used as column reference in a scalar expression. |
| TRY_CAST | Cannot be used as column reference in a scalar expression. |
| **U** |  |
| UNION | Reserved by ANSI. |
| UNIQUE | Reserved by ANSI. |
| UPDATE | Reserved by ANSI. |
| USING | Cannot be used as table name or alias in a FROM clause. |
| **V** |  |
| VALUES | Reserved by ANSI. |
| VIEW | Cannot be used as an identifier in a SHOW command (e.g. ‘SHOW … IN <identifier>’). |
| **W** |  |
| WHEN | Cannot be used as column reference in a scalar expression. |
| WHENEVER | Reserved by ANSI. |
| WHERE | Reserved by ANSI. |
| WINDOW | Reserved by ANSI. |
| WITH | Reserved by ANSI. |

---
title: RESET_PRIVACY_BUDGET
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/reset_privacy_budget.md
section: SQL General Reference
---

# RESET_PRIVACY_BUDGET

Resets the cumulative privacy loss of a [privacy budget](../../user-guide/diff-privacy/differential-privacy-overview.md) to 0.

## Syntax

```sqlsyntax
SNOWFLAKE.DATA_PRIVACY.RESET_PRIVACY_BUDGET(
  '<privacy_policy_name>',
  '<budget_name>',
  '<organization_name>',
  '<account_name>')
```

## Arguments

`'privacy_policy_name'`
:   Name of the [privacy policy](../../user-guide/diff-privacy/differential-privacy-admin-privacy-policies.md) that specifies the privacy
    budget. Must be a fully qualified name that includes the database and schema.

`'budget_name'`
:   Name of a privacy budget.

`'organization_name'`
:   Name of the organization that contains the account in which the analyst is incurring privacy loss.

`'account_name'`
:   Name of the account in which the analyst is incurring privacy loss, specified using the [account name format](../../user-guide/admin-account-identifier.md)
    of the account identifier.

## Usage notes

* Globally defined stored procedures utilize caller’s rights. For more details, see
  [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
* Cumulative privacy loss is reset the next time a query incurs privacy loss. If you view the privacy budget after calling
  RESET_PRIVACY_BUDGET but before the first query incurs privacy loss, the cumulative privacy loss will not be 0.

## Examples

Suppose the `my_policy` privacy policy includes the `analyst_budget` privacy budget. To reset the cumulative privacy loss incurred by
users associated with the `analysts_budget` privacy budget who are executing their queries in the `companyorg.account_123` account:

```sqlexample
CALL SNOWFLAKE.DATA_PRIVACY.RESET_PRIVACY_BUDGET(
  'my_policy_db.my_policy_schema.my_policy',
  'analyst_budget',
  'companyorg',
  'account_123');
```

---
title: RETURN (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/return.md
section: SQL General Reference
---

# RETURN (Snowflake Scripting)

Returns the value of a specified expression.

For more information about returning values, see [Returning a value](../../developer-guide/snowflake-scripting/return.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

## Syntax

```sqlsyntax
RETURN <expression>;
```

Where:

> `expression`
> :   An expression that evaluates to the value to return.

## Usage notes

* A RETURN statement can be run in:

  + A block in a [stored procedure](../../developer-guide/stored-procedure/stored-procedures-overview.md) or
    [Snowflake Scripting user-defined function (UDF)](../../developer-guide/udf/sql/udf-sql-procedural-functions.md).
  + An [anonymous block](../../developer-guide/snowflake-scripting/blocks.md).
* A RETURN statement returns one of the following types:

  + A [SQL data type](../../sql-reference-data-types.md)
  + A table. Use `TABLE(...)` in the `RETURN` statement.

    If your block is in a stored procedure, you must also specify the `RETURNS TABLE...` clause in the
    [CREATE PROCEDURE](../sql/create-procedure.md) statement.

    > **Note:**
    >
    > Currently, in the `RETURNS TABLE(...)` clause, you can’t specify GEOGRAPHY as a column type. This
    > applies whether you are creating a stored or anonymous procedure.
    >
    > ```sqlexample
    > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
    >   RETURNS TABLE(g GEOGRAPHY)
    >   ...
    > ```
    >
    > ```sqlexample
    > WITH test_return_geography_table_1() AS PROCEDURE
    >   RETURNS TABLE(g GEOGRAPHY)
    >   ...
    > CALL test_return_geography_table_1();
    > ```
    >
    > If you attempt to specify GEOGRAPHY as a column type, calling the stored procedure results in the error:
    >
    > ```none
    > Stored procedure execution error: data type of returned table does not match expected returned table type
    > ```
    >
    > To work around this issue, you can omit the column arguments and types in `RETURNS TABLE()`.
    >
    > ```sqlexample
    > CREATE OR REPLACE PROCEDURE test_return_geography_table_1()
    >   RETURNS TABLE()
    >   ...
    > ```
    >
    > ```sqlexample
    > WITH test_return_geography_table_1() AS PROCEDURE
    >   RETURNS TABLE()
    >   ...
    > CALL test_return_geography_table_1();
    > ```

    If you want to return the data that a RESULTSET points to, pass the RESULTSET to TABLE(…), as shown in the example below:

    ```sqlexample
    CREATE PROCEDURE ...
    RETURNS TABLE(...)
    ...
        RETURN TABLE(my_result_set);
    ...
    ```

    See [Returning a RESULTSET as a table](../../developer-guide/snowflake-scripting/resultsets.md).
* You can set a variable to the return value of a stored procedure. For more information, see
  [Using the value returned from a stored procedure call](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

## Examples

This example declares a variable named `my_var` for use in a Snowflake Scripting anonymous block and
then returns the value of the variable:

```sqlexample
DECLARE
  my_var VARCHAR;
BEGIN
  my_var := 'Snowflake';
  RETURN my_var;
END;
```

Note: If you use [Snowflake CLI](../../developer-guide/snowflake-cli/index.md), [SnowSQL](../../user-guide/snowsql.md), the Classic Console, or the
`execute_stream` or `execute_string` method in [Python Connector](../../developer-guide/python-connector/python-connector.md)
code, use this example instead (see [Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE
$$
DECLARE
  my_var VARCHAR;
BEGIN
  my_var := 'Snowflake';
  RETURN my_var;
END;
$$;
```

---
title: SAMPLE / TABLESAMPLE
source: https://docs.snowflake.com/en/sql-reference/constructs/sample.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# SAMPLE / TABLESAMPLE

Returns a subset of rows sampled randomly from the specified table. You can specify different types of sampling methods, and
you can sample a fraction of a table or a fixed number of rows:

* When you sample a fraction of a table, with a specified probability for including a given row, the number of rows returned depends
  on the size of the table and the requested probability. You can specify a seed to make the sampling deterministic.
* When you sample a fixed, specified number of rows, the query returns the exact number of specified rows unless the table
  contains fewer rows.

SAMPLE and TABLESAMPLE are synonymous and can be used interchangeably.

## Syntax

```sqlsyntax
SELECT ...
FROM ...
  { SAMPLE | TABLESAMPLE } [ samplingMethod ]
[ ... ]
```

Where:

> ```sqlsyntax
> samplingMethod ::= { { BERNOULLI | ROW } ( { <probability> | <num> ROWS } ) |
>                      { SYSTEM | BLOCK } ( <probability> ) [ { REPEATABLE | SEED } ( <seed> ) ] }
> ```

## Parameters

`{ BERNOULLI | ROW }` or . `{ SYSTEM | BLOCK }`
:   Specifies the sampling method to use:

    * `BERNOULLI` (or `ROW`): Includes each row with a `probability` of `p/100`.
      This method is similar to flipping a weighted coin *for each row*.
    * `SYSTEM` (or `BLOCK`): Includes each block of rows with a `probability` of `p/100`.
      This method is similar to flipping a weighted coin *for each block of rows*. This method doesn’t support fixed-size sampling.

    The sampling method is optional. If no method is specified, the default is `BERNOULLI`.

`probability` or . `num ROWS`
:   Specifies whether to sample based on a fraction of the table or a fixed number of rows in the table, where:

    * `probability` specifies the percentage probability to use for selecting the sample. Can be any decimal number
      between `0` (no rows selected) and `100` (all rows selected) inclusive.
    * `num` specifies the number of rows (up to 1,000,000) to sample from the table. Can be any integer between
      `0` (no rows selected) and `1000000` inclusive.

    In addition to using literals to specify `probability` or `num ROWS`, you can also use session or bind variables.

`{ REPEATABLE | SEED ( seed ) }`
:   Specifies a seed value to make the sampling deterministic. Can be any integer between `0` and `2147483647` inclusive.
    This parameter only applies to `SYSTEM` and `BLOCK` sampling.

    In addition to using literals to specify `seed`, you can also use session or bind variables.

## Usage notes

* The following keywords can be used interchangeably:

  > + `SAMPLE` and `TABLESAMPLE`
  > + `BERNOULLI` and `ROW`
  > + `SYSTEM` and `BLOCK`
  > + `REPEATABLE` and `SEED`
* The number of rows returned depends on the sampling method specified and whether the sample is based on a fraction of the table or
  a fixed number of rows in the table:

  Fraction-based:
  :   + For `BERNOULLI` or `ROW` sampling, the expected number of returned rows is `(p/100)*n`. For `SYSTEM`
        or `BLOCK` sampling, the sample might be biased, in particular for small tables.

        > **Note:**
        >
        > For very large tables, the difference between the two methods should be negligible.
        >
        > Also, because sampling is a probabilistic process, the number of rows returned isn’t exactly equal to `(p/100)*n` rows, but it is close to this value.
      + If no `seed` is specified, SAMPLE generates different results when the same query is repeated.
      + If a table doesn’t change, and the same `seed` and `probability` are specified, SAMPLE generates the same result. However,
        sampling on a copy of a table might not return the same result as sampling on the original table, even if the same `probability` and
        `seed` are specified.

  Fixed-size:
  :   + If the table is larger than the requested number of rows, the number of requested rows is always returned.
      + If the table is smaller than the requested number of rows, the entire table is returned.
      + When you use `SYSTEM` or `BLOCK` sampling, Snowflake samples data in storage-level units rather than individual rows. In most cases,
        a block corresponds to a single micro-partition.

        For very small tables with relatively few micro-partitions, sampling operates at a finer granularity than a full micro-partition.
        In these cases, a block represents a subset of rows that might span micro-partitions. As a result, the returned sample more
        closely reflects the requested percentage, even when the table contains only a small number of micro-partitions.
      + `SYSTEM`, `BLOCK`, and `SEED (seed)` aren’t supported for fixed-size sampling. For example, the following queries produce errors:

        ```sqlexample
        SELECT * FROM example_table SAMPLE SYSTEM (10 ROWS);

        SELECT * FROM example_table SAMPLE ROW (10 ROWS) SEED (99);
        ```
* Sampling with `SEED (seed)` isn’t supported on views or subqueries. For example, the following query produces an error:

  ```sqlexample
  SELECT * FROM (SELECT * FROM example_table) SAMPLE (1) SEED (99);
  ```
* Sampling the result of a join is allowed, but only when both of the following are true:

  + The sampling is row-based (Bernoulli).
  + The sampling doesn’t use a seed.

  The sampling is done after the join has been fully processed. Therefore, sampling doesn’t reduce the number of
  rows joined and doesn’t reduce the cost of the join. The Examples section includes an example of
  sampling the result of a join.
* Both the [LIMIT](limit.md) clause and the SAMPLE clause return a subset of rows from a table. When you use the
  LIMIT clause, Snowflake returns the specified number of rows in the fastest way possible. When you use the SAMPLE
  clause, Snowflake returns rows based on the sampling method specified in the clause.

## Performance considerations

* `SYSTEM` or `BLOCK` sampling is often faster than `BERNOULLI` or `ROW` sampling.
* Sampling without a `seed` is often faster than sampling with a `seed`.
* Fixed-size sampling might be slower than equivalent fraction-based sampling because fixed-size sampling prevents some query optimization.

## Examples

The following examples use the SAMPLE clause.

### Fraction-based row sampling

Return a sample of a table in which each row has a 10% probability of being included in the sample:

```sqlexample
SELECT * FROM testtable SAMPLE (10);
```

Return a sample of a table in which each row has a 20.3% probability of being included in the sample:

```sqlexample
SELECT * FROM testtable TABLESAMPLE BERNOULLI (20.3);
```

Return an entire table, including all rows in the table:

```sqlexample
SELECT * FROM testtable TABLESAMPLE (100);
```

Return an empty sample:

```sqlexample
SELECT * FROM testtable SAMPLE ROW (0);
```

### Sampling with joins

This example shows how to sample multiple tables in a join. It samples 25% of the rows in `table1` and
50% of the rows in `table2`:

```sqlexample
SELECT i, j
  FROM
    table1 AS t1 SAMPLE (25)
      INNER JOIN
    table2 AS t2 SAMPLE (50)
  WHERE t2.j = t1.i;
```

The `SAMPLE` clause applies to only one table, not all preceding tables or the entire expression prior to the
`SAMPLE` clause. The following `JOIN` operation joins all rows of `table1` to a sample of 50% of the rows in `table2`.
It doesn’t sample 50% of the rows that result from joining all rows in both tables:

```sqlexample
SELECT i, j
  FROM table1 AS t1 INNER JOIN table2 AS t2 SAMPLE (50)
  WHERE t2.j = t1.i;
```

To apply the `SAMPLE` clause to the result of a join, rather than to the individual tables in the join,
apply the join to an inline view that contains the result of the join. For example, perform
the join as a subquery, and then apply the SAMPLE to the result of the subquery. The example below samples
approximately 1% of the rows returned by the join:

```sqlexample
SELECT *
  FROM (
       SELECT *
         FROM t1 JOIN t2
           ON t1.a = t2.c
       ) SAMPLE (1);
```

### Fraction-based block sampling with seeds

Return a sample of a table in which each block of rows has a 3% probability of being included in the sample, and set the seed to 82:

```sqlexample
SELECT * FROM testtable SAMPLE SYSTEM (3) SEED (82);
```

Return a sample of a table in which each block of rows has a 0.012% probability of being included in the sample, and set the seed to 99992:

```sqlexample
SELECT * FROM testtable SAMPLE BLOCK (0.012) REPEATABLE (99992);
```

> **Note:**
>
> If either of these queries is run again without making any changes to the table, they return the same sample set.

### Fixed-size row sampling

Return a fixed-size sample of 10 rows in which each row has a `min(1, 10/n)` probability of being included in the sample, where `n` is the number of rows in the table:

```sqlexample
SELECT * FROM testtable SAMPLE (10 ROWS);
```

---
title: Sample asynchronous remote service for AWS
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-sample-asynchronous.md
section: SQL General Reference
---

# Sample asynchronous remote service for AWS

This topic contains a sample asynchronous AWS Lambda Function (remote service). You can create this sample function by following
the same steps described in [Step 1: Create the remote service (AWS Lambda function) in the Management Console](external-functions-creating-aws-ui-remote-service.md).

## Overview of the code

This section of the documentation provides information about creating an asynchronous external function on AWS.
(Before implementing your first asynchronous external function, you might want to read the
[conceptual overview](external-functions-implementation.md) of asynchronous
external functions.)

On AWS, asynchronous remote services must overcome the following restrictions:

* Because the HTTP POST and GET are separate requests, the remote service must keep information about the workflow
  launched by the POST request so that the state can later be queried by the GET request.

  Typically, each HTTP POST and HTTP GET invokes a separate instance of the handler function(s) in a separate
  process or thread. The separate instances do not share memory. In order for the GET handler to read the
  status or the processed data, the GET handler must access a shared storage resource that
  is available on AWS.
* The only way for the POST handler to send the initial HTTP 202 response code is via a `return` statement (or
  equivalent), which terminates the execution of the handler. Therefore, prior to returning HTTP 202, the POST handler
  must launch an independent process (or thread) to do the actual data
  processing work of the remote service. This independent process typically needs access to the storage that is visible
  to the GET handler.

One way for an asynchronous remote service to overcome these restrictions is to use 3 processes (or threads) and shared storage:

In this model, the processes have the following responsibilities:

* The HTTP POST handler:

  + Reads the input data. In a Lambda Function, this is read from the body of the handler function’s `event` input
    parameter.
  + Reads the batch ID. In a Lambda Function, this is read from the header of the `event` input parameter.
  + Starts the data processing process, and passes it the data and the batch ID. The data is usually passed
    during the call, but could be passed by writing it to external storage.
  + Records the batch ID in shared storage that both the data processing process and
    the HTTP GET handler process can access.
  + If needed, records that the processing of this batch has not yet finished.
  + Returns HTTP 202 if no error was detected.
* The data processing code:

  + Reads the input data.
  + Processes the data.
  + Makes the result available to the GET handler (either by writing the result data to shared storage, or by
    providing an API through which to query the results).
  + Typically, updates this batch’s status (e.g. from `IN_PROGRESS` to `SUCCESS`) to indicate that the
    results are ready to be read.
  + Exits. Optionally, this process can return an error indicator. Snowflake does not see this directly
    (Snowflake sees only the HTTP return codes from the POST handler and GET handler), but returning an
    error indicator from the data processing process might help during debugging.
* The GET handler:

  + Reads the batch ID. In a Lambda Function, this is read from the header of the `event` input parameter.
  + Reads the storage to get the current status of this batch (e.g. `IN_PROGRESS` or `SUCCESS`).
  + If the processing is still in progress, then return 202.
  + If the processing has finished successfully, then:

    - Read the results.
    - Clean up storage.
    - Return the results along with HTTP code 200.
  + If the stored status indicates an error, then:

    - Clean up storage.
    - Return an error code.

  Note that the GET handler might be called multiple times for a batch if the processing takes long
  enough that multiple HTTP GET requests are sent.

There are many possible variations on this model. For example:

* The batch ID and status could be written at the start of the data processing process rather than at the end of
  the POST process.
* The data processing could be done in a separate function (e.g. a separate Lambda function) or even as a
  completely separate service.
* The data processing code does not necessarily need to write to shared storage. Instead, the processed data could
  be made available another way. For example, an API could accept the batch ID as a parameter and return the data.

The implementation code should take into account the possibility that the processing will take too long
or will fail, and therefore any partial results must be cleaned up to avoid wasting storage space.

The storage mechanism must be sharable across multiple processes (or threads). Possible storage mechanisms include:

* Storage mechanisms provided by AWS, such as:

  + Disk space (e.g. [Amazon Elastic File System (EFS)](https://aws.amazon.com/efs/) ).
  + A local database server available through AWS (e.g. [Amazon DynamoDB](https://aws.amazon.com/dynamodb/) ).
* Storage that is outside AWS but accessible from AWS.

The code for each of the 3 processes above can be written as 3 separate Lambda Functions (one for the POST handler,
one for the data processing function, and one for the GET handler), or as a single function that can be invoked in
different ways.

The sample Python code below is a single Lambda Function that can be called separately for the POST, the
data processing, and the GET processes.

## Sample code

This code shows a sample query with output.
The focus in this example is on the three processes and how they interact, not on the shared storage mechanism
(DynamoDB) or data transformation (sentiment analysis). The code is structured to make it easy to replace the
example storage mechanism and data transformation with different ones.

For simplicity, this example:

* Hard-codes some important values (e.g. the AWS region).
* Assumes the existence of some resources (e.g. the Jobs table in Dynamo).

```python
import json
import time
import boto3

HTTP_METHOD_STRING = "httpMethod"
HEADERS_STRING = "headers"
BATCH_ID_STRING = "sf-external-function-query-batch-id"
DATA_STRING = "data"
REGION_NAME = "us-east-2"

TABLE_NAME = "Jobs"
IN_PROGRESS_STATUS = "IN_PROGRESS"
SUCCESS_STATUS = "SUCCESS"

def lambda_handler(event, context):
    # this is called from either the GET or POST
    if (HTTP_METHOD_STRING in event):
        method = event[HTTP_METHOD_STRING]
        if method == "POST":
            return initiate(event, context)
        elif method == "GET":
            return poll(event, context)
        else:
            return create_response(400, "Function called from invalid method")

    # if not called from GET or POST, then this lambda was called to
    # process data
    else:
        return process_data(event, context)

# Reads batch_ID and data from the request, marks the batch_ID as being processed, and
# starts the processing service.
def initiate(event, context):
    batch_id = event[HEADERS_STRING][BATCH_ID_STRING]
    data = json.loads(event["body"])[DATA_STRING]

    lambda_name = context.function_name

    write_to_storage(batch_id, IN_PROGRESS_STATUS, "NULL")
    lambda_response = invoke_process_lambda(batch_id, data, lambda_name)

    # lambda response returns 202, because we are invoking it with
    # InvocationType = 'Event'
    if lambda_response["StatusCode"] != 202:
        response = create_response(400, "Error in initiate: processing lambda not started")
    else:
        response = {
            'statusCode': lambda_response["StatusCode"]
        }

    return response

# Processes the data passed to it from the POST handler. In this example,
# the processing is to perform sentiment analysis on text.
def process_data(event, context):
    data = event[DATA_STRING]
    batch_id = event[BATCH_ID_STRING]

    def process_data_impl(data):
        comprehend = boto3.client(service_name='comprehend', region_name=REGION_NAME)
        # create return rows
        ret = []
        for i in range(len(data)):
            text = data[i][1]
            sentiment_response = comprehend.detect_sentiment(Text=text, LanguageCode='en')
            sentiment_score = json.dumps(sentiment_response['SentimentScore'])
            ret.append([i, sentiment_score])
        return ret

    processed_data = process_data_impl(data)
    write_to_storage(batch_id, SUCCESS_STATUS, processed_data)

    return create_response(200, "No errors in process")

# Repeatedly checks on the status of the batch_ID, and returns the result after the
# processing has been completed.
def poll(event, context):
    batch_id = event[HEADERS_STRING][BATCH_ID_STRING]
    processed_data = read_data_from_storage(batch_id)

    def parse_processed_data(response):
        # in this case, the response is the response from DynamoDB
        response_metadata = response['ResponseMetadata']
        status_code = response_metadata['HTTPStatusCode']

        # Take action depending on item status
        item = response['Item']
        job_status = item['status']
        if job_status == SUCCESS_STATUS:
            # the row number is stored at index 0 as a Decimal object,
            # we need to convert it into a normal int to be serialized to JSON
            data = [[int(row[0]), row[1]] for row in item['data']]
            return {
                'statusCode': 200,
                'body': json.dumps({
                    'data': data
                })
            }
        elif job_status == IN_PROGRESS_STATUS:
            return {
                'statusCode': 202,
                "body": "{}"
            }
        else:
            return create_response(500, "Error in poll: Unknown item status.")

    return parse_processed_data(processed_data)

def create_response(code, msg):
    return {
        'statusCode': code,
        'body': msg
    }

def invoke_process_lambda(batch_id, data, lambda_name):
    # Create payload to be sent to processing lambda
    invoke_payload = json.dumps({
        BATCH_ID_STRING: batch_id,
        DATA_STRING: data
    })

    # Invoke processing lambda asynchronously by using InvocationType='Event'.
    # This allows the processing to continue while the POST handler returns HTTP 202.
    lambda_client = boto3.client('lambda', region_name=REGION_NAME,)
    lambda_response = lambda_client.invoke(
        FunctionName=lambda_name,
        InvocationType='Event',
        Payload=invoke_payload
    )
    # returns 202 on success if InvocationType = 'Event'
    return lambda_response

def write_to_storage(batch_id, status, data):
    # we assume that the table has already been created
    client = boto3.resource('dynamodb')
    table = client.Table(TABLE_NAME)

    # Put in progress item in table
    item_to_store = {
        'batch_id': batch_id,
        'status': status,
        'data': data,
        'timestamp': "{}".format(time.time())
    }
    db_response = table.put_item(
        Item=item_to_store
    )

def read_data_from_storage(batch_id):
    # we assume that the table has already been created
    client = boto3.resource('dynamodb')
    table = client.Table(TABLE_NAME)

    response = table.get_item(Key={'batch_id': batch_id},
                          ConsistentRead=True)
    return response
```

## Sample call and output

Here is a sample call to the asynchronous external function, along with sample output, including the sentiment
analysis results:

```sqlexample
create table test_tb(a string);
insert into test_tb values
    ('hello world'),
    ('I am happy');
select ext_func_async(a) from test_tb;

Row | EXT_FUNC_ASYNC(A)
0   | {"Positive": 0.47589144110679626, "Negative": 0.07314028590917587, "Neutral": 0.4493273198604584, "Mixed": 0.0016409909585490823}
1   | {"Positive": 0.9954453706741333, "Negative": 0.00039307220140472054, "Neutral": 0.002452891319990158, "Mixed": 0.0017087293090298772}
```

## Notes about the sample code

* The data processing function is invoked by calling:

  ```python
  lambda_response = lambda_client.invoke(
      ...
      InvocationType='Event',
      ...
  )
  ```

  The InvocationType should be ‘Event’, as shown above, because the 2nd process (or thread) must be asynchronous and
  `Event` is the only type of non-blocking call available through the `invoke()` method.
* The data processing function returns an HTTP 200 code. However, this HTTP 200 code is not returned directly to
  Snowflake. Snowflake does not see any HTTP 200 until a GET polls the status and sees that the data processing
  function finished processing this batch successfully.

---
title: Sample synchronous Lambda function
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-sample-synchronous.md
section: SQL General Reference
---

# Sample synchronous Lambda function

This topic includes code for a sample Lambda Function that you can use as-is to create your first external function, or that
you can use as a starting point for a custom Lambda Function.

This function is [synchronous](external-functions-implementation.md).

(A separate [asynchronous](external-functions-implementation.md)
[example](external-functions-creating-aws-sample-asynchronous.md) is also available.)

This example is written in Python.

This sample synchronous Lambda Function extracts each row, processes it, and returns a value for that row. Each output value is
simply an array that contains a copy of each of the values in the input row. The returned array is treated as a SQL VARIANT by
Snowflake.

> ```python
> import json
>
> def lambda_handler(event, context):
>
>     # 200 is the HTTP status code for "ok".
>     status_code = 200
>
>     # The return value will contain an array of arrays (one inner array per input row).
>     array_of_rows_to_return = [ ]
>
>     try:
>         # From the input parameter named "event", get the body, which contains
>         # the input rows.
>         event_body = event["body"]
>
>         # Convert the input from a JSON string into a JSON object.
>         payload = json.loads(event_body)
>         # This is basically an array of arrays. The inner array contains the
>         # row number, and a value for each parameter passed to the function.
>         rows = payload["data"]
>
>         # For each input row in the JSON object...
>         for row in rows:
>             # Read the input row number (the output row number will be the same).
>             row_number = row[0]
>
>             # Read the first input parameter's value. For example, this can be a
>             # numeric value or a string, or it can be a compound value such as
>             # a JSON structure.
>             input_value_1 = row[1]
>
>             # Read the second input parameter's value.
>             input_value_2 = row[2]
>
>             # Compose the output based on the input. This simple example
>             # merely echoes the input by collecting the values into an array that
>             # will be treated as a single VARIANT value.
>             output_value = ["Echoing inputs:", input_value_1, input_value_2]
>
>             # Put the returned row number and the returned value into an array.
>             row_to_return = [row_number, output_value]
>
>             # ... and add that array to the main array.
>             array_of_rows_to_return.append(row_to_return)
>
>         json_compatible_string_to_return = json.dumps({"data" : array_of_rows_to_return})
>
>     except Exception as err:
>         # 400 implies some type of error.
>         status_code = 400
>         # Tell caller what this function could not handle.
>         json_compatible_string_to_return = event_body
>
>     # Return the return value and HTTP status code.
>     return {
>         'statusCode': status_code,
>         'body': json_compatible_string_to_return
>     }
> ```

> **Note:**
>
> This sample code assumes that you are using Lambda proxy integration, as Snowflake recommends in the instructions to
> [create the API Gateway endpoint](external-functions-creating-aws-ui-proxy-service.md).

---
title: Securing an external function
source: https://docs.snowflake.com/en/sql-reference/external-functions-security.md
section: SQL General Reference
---

# Securing an external function

This topic describes platform-independent details related to securing external functions.

## Access control

### External functions

External functions, like any user-defined functions (UDFs), follow [access control](../user-guide/security-access-control-overview.md)
rules:

* External functions have an owner.
* The owner must grant callers (other than the owner) appropriate privilege(s) on the function.

However, external functions have some additional privilege requirement(s):

* Because an external function requires an API integration, the author of the external function must be
  granted USAGE privilege on the API integration.

For more information about UDFs and access control, see [Access control privileges](../user-guide/security-access-control-privileges.md).

### API integrations

#### Privileges and API integrations

An API integration is a database object. To create an API integration, you need ACCOUNTADMIN privileges or
a Snowflake role with the CREATE INTEGRATION privilege. Account administrators can grant and revoke ownership
and usage privileges on each API integration.

#### Using the API_KEY option in CREATE API INTEGRATION

Some proxy services (API Gateways) require users to provide subscription information (or other product-related information) when calling
the proxy service. The subscription information can be used to authenticate that the user is a paying customer, enforce usage quotas, etc.

Snowflake now supports *API keys*, also called *subscription keys* (Microsoft Azure’s term), which are alphanumeric string values that a
developer can distribute to users who need to provide subscription information.

Users can provide these keys to Snowflake by using the API_KEY clause of the CREATE API INTEGRATION statement or the ALTER API INTEGRATION
statement. The API_KEY clause is optional; you can omit it if the service does not need a key.

An API_KEY is in addition to, not a substitute for, IAM (Identity and Access Management).

API keys are sensitive. They are not displayed in:

* Query history commands.
* DESCRIBE INTEGRATION commands.
* DESCRIBE API INTEGRATION commands.

The developer of the service chooses how to format the key. The key is opaque to Snowflake, and Snowflake does not validate it.

You can read more about API keys on specific platforms by following the links below:

* [AWS API keys](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-api-usage-plans.html)
* [Microsoft Azure subscription keys](https://docs.microsoft.com/en-us/azure/api-management/api-management-subscriptions#what-are-subscriptions)

## Secure the proxy service

Unless your external function is intended to be publicly accessible,
Snowflake strongly recommends securing your proxy service endpoints.

Snowflake uses credential-less API integration objects to authenticate to the proxy service endpoint.
Credential-less API integrations separate responsibilities between administrators and
users. An API integration allows an administrator to create a trust policy between Snowflake
and the cloud provider using the cloud provider’s native authentication and authorization mechanism.
When Snowflake connects to the cloud provider, the cloud provider authenticates and authorizes
access through this trust policy. Using a specific API integration, the administrator can also
specify an allowed list of endpoints that the API integration object can access; this restricts which
proxy services and resources Snowflake can use, enabling the administrator to enforce organizational policies for
data egress and ingress.

More detailed instructions for securing specific proxy service endpoints, such as an Amazon API Gateway, are in
the platform-specific instructions.

## Secure the remote service

If you created your own remote service, don’t forget to secure that.

The details depend upon the implementation of the remote service and are outside the scope of this document.

In most cases, the remote service should use HTTPS, not HTTP.

## Additional security information

* Communications between Snowflake and the proxy server are encrypted using HTTPS.

### Platform-specific security information

#### AWS

* For AWS, all Snowflake HTTP requests (going to the API Gateway) are signed using AWS sigv4 authentication.
  For more information, see
  [AWS sig4 authentication](https://docs.aws.amazon.com/AmazonS3/latest/API/sig-v4-authenticating-requests.html) .
* Restrict access to your API Gateway endpoints by adding a resource policy. For more information, see
  [Secure your Amazon API Gateway endpoint](external-functions-creating-aws-ui-proxy-service.md).
* If you use [private endpoints](external-functions-creating-aws-planning.md), you might want to read about
  [PrivateLink](../user-guide/admin-security-privatelink.md).

---
title: SEMANTIC_VIEW
source: https://docs.snowflake.com/en/sql-reference/constructs/semantic_view.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# SEMANTIC_VIEW

Specifies the [semantic view](../../user-guide/views-semantic/overview.md) to [query](../../user-guide/views-semantic/querying.md).
You specify SEMANTIC_VIEW(…) in a [FROM](from.md) clause in a [SELECT](../sql/select.md) statement.

> **Note:**
>
> You can’t query [private facts or metrics](../../user-guide/views-semantic/sql.md) or use them in the WHERE condition.

See also:
:   [FROM](from.md), [Querying semantic views](../../user-guide/views-semantic/querying.md)

## Syntax

```sqlsyntax
SEMANTIC_VIEW(
  [<namespace>.]<semantic_view_name>
  [
    {
      METRICS <metric_expr> [ [ AS ] <alias> ] [ , ... ] |
      FACTS <fact_expr>  [ , ... ]
    }
  ]
  [ DIMENSIONS <dimension_expr>  [ [ AS ] <alias> ] [ , ... ] ]
  [ WHERE <predicate> ]
)
```

## Parameters

`[namespace.]semantic_view_name`
:   Specifies the identifier for the semantic view to query.

    If the identifier contains spaces or special characters, the entire string must be enclosed in double quotes.
    Identifiers enclosed in double quotes are also case-sensitive.

    For more information, see [Identifier requirements](../identifiers-syntax.md).

`METRICS metric_expr [ [ AS ] alias ] [ , ... ]`
:   Specifies the metrics that you want to return in the results. You can also specify an expression that includes:

    * Scalar expressions that refer to metrics in the semantic view.
    * Aggregations of dimensions or facts in the semantic view.

    > **Note:**
    >
    > * You can’t specify [private metrics](../../user-guide/views-semantic/sql.md).

    For the names of the metrics:

    * You can qualify the name of the metric (for example, `my_logical_table.my_metric`).

      Using the unqualified name works only if there are no other identifiers with the same unqualified name in the semantic view.
      For example, if a metric and a dimension use the same unqualified name, you must qualify the name of the metric in the query.
    * To specify all metrics in a logical table, use an asterisk as a wildcard, qualified by the logical table name (for example,
      `my_logical_table.*`).

      You can’t specify an asterisk without qualifying it with a table name.

    You can specify an alias for a metric after the name of the metric. You can use the optional AS keyword before the alias.

    Specify the metrics in the order in which they should appear in the results.

`FACTS fact_expr [ , ... ]`
:   Specifies the facts that you want to return in the results. You can also specify scalar expressions that refer to facts or
    dimensions in the semantic view. If you specify a scalar expression, the dimensions and facts in that expression must belong to
    the same logical table.

    > **Note:**
    >
    > You can’t specify [private facts](../../user-guide/views-semantic/sql.md).

    Unlike dimensions specified in the DIMENSIONS clause, the query does not group the facts specified in the FACTS clause.
    Different rows can include the same value for a fact.

    Specify the facts in the order in which they should appear in the results.

`DIMENSIONS dimension_expr [ [ AS ] alias ] [ , ... ]`
:   Specifies the dimensions that you want to return in the results. You can also specify scalar expressions that refer to
    dimensions or facts in the semantic view. If you specify a scalar expression, the dimensions and facts in that expression must
    belong to the same logical table.

    The query groups the results by the dimensions that you specify here. For example, if a logical table includes five distinct
    values for a dimension, specifying that dimension in the DIMENSIONS clause returns five rows.

    For the names of the dimensions:

    * You can qualify the name of the dimension (for example, `my_logical_table.my_dimension`). Using the unqualified name works
      only if there are no other identifiers with the same unqualified name in the semantic view. For example, if a metric and a
      dimension use the same unqualified name, you must qualify the name of the dimension in the query.
    * To specify all dimensions in a logical table, use an asterisk as a wildcard, qualified by the logical table name (for example,
      `my_logical_table.*`).

      You can’t specify an asterisk without qualifying it with a table name.

    You can specify an alias for a dimension after the expression for the dimension. You can use the optional AS keyword before the
    alias.

    If you specify a scalar expression, you can’t refer to dimensions in other semantic views or metrics.

    Specify the dimensions in the order in which they should appear in the results.

    > **Note:**
    >
    > If you are returning a window function metric, you must also return the dimensions that are specified in
    > PARTITION BY `dimension`, PARTITION BY EXCLUDING `dimension`, and ORDER BY `dimension` clauses
    > in the [CREATE SEMANTIC VIEW](../sql/create-semantic-view.md) statement for that semantic view.
    >
    > See [Defining and querying window function metrics](../../user-guide/views-semantic/querying.md).

`WHERE predicate`
:   A boolean expression. The expression can include [logical operators](../operators-logical.md),
    [built-in functions](../../sql-reference-functions.md), and
    [user-defined functions (UDFs)](../../developer-guide/udf/udf-overview.md).

    In the condition, you can only refer to dimensions, facts, and expressions that use dimensions and facts.

    If you specify facts from different entities, the RELATIONSHIPS clause in the semantic view definition must define a
    relationship between these entities.

    This filter condition is applied before the metrics are computed.

## Usage notes

* In the SEMANTIC_VIEW clause, you must specify at least one of the following clauses:

  + METRICS
  + DIMENSIONS
  + FACTS

  You cannot omit all of these clauses from the SEMANTIC_VIEW clause.
* When specifying a combination of these clauses, note the following:

  + You cannot specify FACTS and METRICS in the same SEMANTIC_VIEW clause.
  + Although you can specify both FACTS and DIMENSIONS in a query, you should do so only if the dimensions can uniquely determine
    the facts.

    The query groups the results by dimensions. if the facts do not depend on the dimensions, the results can be
    non-deterministic.
  + If you specify both FACTS and DIMENSIONS, all facts and dimensions used in the query (including those specified in the WHERE
    clause) must be defined in the same logical table.
  + If you specify a dimension and a metric, the logical table for the dimension must be related to the logical table for the
    metric.

    In addition, the logical table for the dimension must have an equal or lower level of granularity than the logical table for
    the metric.

    To determine which dimensions meet this criteria, you can run the
    [SHOW SEMANTIC DIMENSIONS FOR METRIC](../sql/show-semantic-dimensions-for-metric.md) command.

    For details, see [Choosing the dimensions that you can return for a given metric](../../user-guide/views-semantic/querying.md).
* In the DIMENSIONS clause, you can specify an expression that refers to a fact. Similarly, in the FACTS clause, you can specify
  an expression that refers to a dimension. For example:

  ```sqlexample
  -- Dimension expression that refers to a fact
  DIMENSIONS my_table.my_fact

  -- Fact expression that refers to a dimension
  FACTS my_table.my_dimension
  ```

  One of the main differences between using DIMENSIONS and FACTS is that the query groups the results by the dimensions and
  expressions specified in the DIMENSIONS clause.
* In the METRICS clause, you can specify an expression that includes:

  + A scalar expression referring to metrics.
  + An aggregation of dimensions or facts.
* Specify the METRICS, DIMENSIONS, and FACTS clauses in the order in which you want them to appear in the results.

  If you want the dimensions to appear first in the results, specify DIMENSIONS before METRICS. Otherwise, specify METRICS first.

  For example, suppose that you specify the METRICS clause first:

  ```sqlexample
  SELECT * FROM SEMANTIC_VIEW(
      tpch_analysis
      METRICS customer.customer_order_count
      DIMENSIONS customer.customer_name
    )
    ORDER BY customer_name
    LIMIT 5;
  ```

  In the output, the first column is the metric column (`customer_order_count`) and the second column is the dimension column
  (`customer_name`):

  ```output
  +----------------------+--------------------+
  | CUSTOMER_ORDER_COUNT | CUSTOMER_NAME      |
  |----------------------+--------------------|
  |                    6 | Customer#000000001 |
  |                    7 | Customer#000000002 |
  |                    0 | Customer#000000003 |
  |                   20 | Customer#000000004 |
  |                    4 | Customer#000000005 |
  +----------------------+--------------------+
  ```

  If you instead specify the DIMENSIONS clause first:

  ```sqlexample
  SELECT * FROM SEMANTIC_VIEW(
      tpch_analysis
      DIMENSIONS customer.customer_name
      METRICS customer.customer_order_count
    )
    ORDER BY customer_name
    LIMIT 5;
  ```

  In the output, the first column is the dimension column (`customer_name`) and the second column is the metric column
  (`customer_order_count`):

  ```output
  +--------------------+----------------------+
  | CUSTOMER_NAME      | CUSTOMER_ORDER_COUNT |
  |--------------------+----------------------|
  | Customer#000000001 |                    6 |
  | Customer#000000002 |                    7 |
  | Customer#000000003 |                    0 |
  | Customer#000000004 |                   20 |
  | Customer#000000005 |                    4 |
  +--------------------+----------------------+
  ```
* You can use the relation defined by a SEMANTIC_VIEW clause in other SQL constructs, including
  [JOIN](join.md), [PIVOT](pivot.md), [UNPIVOT](unpivot.md),
  [GROUP BY](group-by.md), and [common table expressions (CTEs)](../../user-guide/queries-cte.md).
* The output column headers use the unqualified names of the metrics and dimensions.

  If you have multiple metrics and dimensions with the same names, use a table alias to assign different names to the column
  headers. See [Handling duplicate column names in the output](../../user-guide/views-semantic/querying.md).

## Examples

See [Querying semantic views](../../user-guide/views-semantic/querying.md).

---
title: Semi-structured and structured data functions
source: https://docs.snowflake.com/en/sql-reference/functions-semistructured.md
section: SQL General Reference
---

# Semi-structured and structured data functions

These functions are used with:

* [Semi-structured data formats](../user-guide/semistructured-data-formats.md) (including JSON, Avro, and XML)
* [Semi-structured data types](data-types-semistructured.md) (including VARIANT, OBJECT, and ARRAY)
* [Structured data types](data-types-structured.md) (including structured OBJECT, structured ARRAY, and MAP)

## List of semi-structured and structured data functions

The functions are grouped by type of operation performed:

* Parsing JSON and XML data.
* Creating and manipulating [ARRAYs](data-types-semistructured.md) and [OBJECTs](data-types-semistructured.md).
* Extracting values from semi-structured and structured data (e.g. from an ARRAY, OBJECT, or MAP).
* Converting/casting semi-structured data types and structured data types to/from other data types.
* Determining the data type for values in semi-structured data (i.e. type predicates).

| Sub-category | Function | Notes |
| --- | --- | --- |
| **JSON and XML Parsing** | [CHECK_JSON](functions/check_json.md) |  |
|  | [CHECK_XML](functions/check_xml.md) |  |
|  | [JSON_EXTRACT_PATH_TEXT](functions/json_extract_path_text.md) |  |
|  | [PARSE_JSON](functions/parse_json.md) |  |
|  | [PARSE_XML](functions/parse_xml.md) |  |
|  | [STRIP_NULL_VALUE](functions/strip_null_value.md) |  |
| **Array/Object Creation and Manipulation** | [ARRAY_AGG](functions/array_agg.md) | See also [Aggregate functions](functions-aggregation.md). |
|  | [ARRAY_APPEND](functions/array_append.md) |  |
|  | [ARRAY_CAT](functions/array_cat.md) |  |
|  | [ARRAY_COMPACT](functions/array_compact.md) |  |
|  | [ARRAY_CONSTRUCT](functions/array_construct.md) |  |
|  | [ARRAY_CONSTRUCT_COMPACT](functions/array_construct_compact.md) |  |
|  | [ARRAY_CONTAINS](functions/array_contains.md) |  |
|  | [ARRAY_DISTINCT](functions/array_distinct.md) |  |
|  | [ARRAY_EXCEPT](functions/array_except.md) |  |
|  | [ARRAY_FLATTEN](functions/array_flatten.md) |  |
|  | [ARRAY_GENERATE_RANGE](functions/array_generate_range.md) |  |
|  | [ARRAY_INSERT](functions/array_insert.md) |  |
|  | [ARRAY_INTERSECTION](functions/array_intersection.md) |  |
|  | [ARRAY_MAX](functions/array_max.md) |  |
|  | [ARRAY_MIN](functions/array_min.md) |  |
|  | [ARRAY_POSITION](functions/array_position.md) |  |
|  | [ARRAY_PREPEND](functions/array_prepend.md) |  |
|  | [ARRAY_REMOVE](functions/array_remove.md) |  |
|  | [ARRAY_REMOVE_AT](functions/array_remove_at.md) |  |
|  | [ARRAY_REPEAT](functions/array_repeat.md) |  |
|  | [ARRAY_REVERSE](functions/array_reverse.md) |  |
|  | [ARRAY_SIZE](functions/array_size.md) |  |
|  | [ARRAY_SLICE](functions/array_slice.md) |  |
|  | [ARRAY_SORT](functions/array_sort.md) |  |
|  | [ARRAY_TO_STRING](functions/array_to_string.md) |  |
|  | [ARRAY_UNION_AGG](functions/array_union_agg.md) | See also [Aggregate functions](functions-aggregation.md). |
|  | [ARRAY_UNIQUE_AGG](functions/array_unique_agg.md) | See also [Aggregate functions](functions-aggregation.md). |
|  | [ARRAYS_OVERLAP](functions/arrays_overlap.md) |  |
|  | [ARRAYS_TO_OBJECT](functions/arrays_to_object.md) |  |
|  | [ARRAYS_ZIP](functions/arrays_zip.md) |  |
|  | [OBJECT_AGG](functions/object_agg.md) | See also [Aggregate functions](functions-aggregation.md). |
|  | [OBJECT_CONSTRUCT](functions/object_construct.md) |  |
|  | [OBJECT_CONSTRUCT_KEEP_NULL](functions/object_construct_keep_null.md) |  |
|  | [OBJECT_DELETE](functions/object_delete.md) |  |
|  | [OBJECT_INSERT](functions/object_insert.md) |  |
|  | [OBJECT_PICK](functions/object_pick.md) |  |
|  | [PROMPT](functions/prompt.md) |  |
| **Higher-order** | [FILTER](functions/filter.md) | See also [Use lambda functions on data with Snowflake higher-order functions](../user-guide/querying-semistructured.md). |
|  | [REDUCE](functions/reduce.md) | See also [Use lambda functions on data with Snowflake higher-order functions](../user-guide/querying-semistructured.md). |
|  | [TRANSFORM](functions/transform.md) | See also [Use lambda functions on data with Snowflake higher-order functions](../user-guide/querying-semistructured.md). |
| **Map Creation and Manipulation** | [MAP_CAT](functions/map_cat.md) |  |
|  | [MAP_CONTAINS_KEY](functions/map_contains_key.md) |  |
|  | [MAP_DELETE](functions/map_delete.md) |  |
|  | [MAP_ENTRIES](functions/map_entries.md) |  |
|  | [MAP_INSERT](functions/map_insert.md) |  |
|  | [MAP_KEYS](functions/map_keys.md) |  |
|  | [MAP_PICK](functions/map_pick.md) |  |
|  | [MAP_SIZE](functions/map_size.md) |  |
| **Extraction** | [FLATTEN](functions/flatten.md) | [Table function](functions-table.md). |
|  | [GET](functions/get.md) |  |
|  | [GET_IGNORE_CASE](functions/get_ignore_case.md) |  |
|  | [GET_PATH , :](functions/get_path.md) | Variation of GET. |
|  | [OBJECT_KEYS](functions/object_keys.md) | Extracts keys from key/value pairs in [OBJECT](data-types-semistructured.md). |
|  | [XMLGET](functions/xmlget.md) |  |
| **Conversion/Casting** | [AS_<object_type>](functions/as.md) |  |
|  | [AS_ARRAY](functions/as_array.md) |  |
|  | [AS_BINARY](functions/as_binary.md) |  |
|  | [AS_CHAR , AS_VARCHAR](functions/as_char-varchar.md) |  |
|  | [AS_DATE](functions/as_date.md) |  |
|  | [AS_DECIMAL , AS_NUMBER](functions/as_decimal-number.md) |  |
|  | [AS_DOUBLE , AS_REAL](functions/as_double-real.md) |  |
|  | [AS_INTEGER](functions/as_integer.md) |  |
|  | [AS_OBJECT](functions/as_object.md) |  |
|  | [AS_TIME](functions/as_time.md) |  |
|  | [AS_TIMESTAMP_\*](functions/as_timestamp.md) |  |
|  | [STRTOK_TO_ARRAY](functions/strtok_to_array.md) |  |
|  | [TO_ARRAY](functions/to_array.md) |  |
|  | [TO_JSON](functions/to_json.md) |  |
|  | [TO_OBJECT](functions/to_object.md) |  |
|  | [TO_VARIANT](functions/to_variant.md) |  |
|  | [TO_XML](functions/to_xml.md) |  |
| **Type Predicates** | [IS_<object_type>](functions/is.md) |  |
|  | [IS_ARRAY](functions/is_array.md) |  |
|  | [IS_BOOLEAN](functions/is_boolean.md) |  |
|  | [IS_BINARY](functions/is_binary.md) |  |
|  | [IS_CHAR , IS_VARCHAR](functions/is_char-varchar.md) |  |
|  | [IS_DATE , IS_DATE_VALUE](functions/is_date-value.md) |  |
|  | [IS_DECIMAL](functions/is_decimal.md) |  |
|  | [IS_DOUBLE , IS_REAL](functions/is_double-real.md) |  |
|  | [IS_INTEGER](functions/is_integer.md) |  |
|  | [IS_NULL_VALUE](functions/is_null_value.md) |  |
|  | [IS_OBJECT](functions/is_object.md) |  |
|  | [IS_TIME](functions/is_time.md) |  |
|  | [IS_TIMESTAMP_\*](functions/is_timestamp.md) |  |
|  | [TYPEOF](functions/typeof.md) |  |

---
title: Semi-structured data types
source: https://docs.snowflake.com/en/sql-reference/data-types-semistructured.md
section: SQL General Reference
---

# Semi-structured data types

The following Snowflake data types can contain other data types:

* VARIANT (can contain a value of any other data type).
* OBJECT (can directly contain a VARIANT value, and thus indirectly contain a value of any
  other data type, including itself).
* ARRAY (can directly contain a VARIANT value, and thus indirectly contain a value of any
  other data type, including itself).

We often refer to these data types as *semi-structured* data types. Strictly speaking, OBJECT is the only one of these
data types that, by itself, has all of the characteristics of a true
[semi-structured data type](https://en.wikipedia.org/wiki/Semi-structured_data). However, combining these data types allows you to
explicitly represent arbitrary [hierarchical data structures](../user-guide/semistructured-intro.md),
which can be used to load and operate on data in semi-structured formats (such as [JSON](../user-guide/semistructured-data-formats.md),
[Avro](../user-guide/semistructured-data-formats.md), [ORC](../user-guide/semistructured-data-formats.md), [Parquet](../user-guide/semistructured-data-formats.md), or
[XML](../user-guide/semistructured-data-formats.md)).

> **Note:**
>
> For information about *structured data types* (for example, ARRAY(INTEGER), OBJECT(city VARCHAR), or MAP(VARCHAR, VARCHAR),
> see [Structured data types](data-types-structured.md).

## VARIANT

A VARIANT value can store a value of any other type, including OBJECT and ARRAY values.

### Characteristics of a VARIANT value

A VARIANT value can have a maximum size of up to 128 MB of uncompressed data. However, in practice,
the maximum size is usually smaller because of internal overhead. The maximum size is also dependent
on the object being stored.

### Inserting VARIANT data

To insert VARIANT data directly, use `INSERT INTO ... SELECT`. The following example shows how to insert JSON-formatted
data into a VARIANT value:

```sqlexample
CREATE OR REPLACE TABLE variant_insert (v VARIANT);
INSERT INTO variant_insert (v)
  SELECT PARSE_JSON('{"key3": "value3", "key4": "value4"}');
SELECT * FROM variant_insert;
```

```output
+---------------------+
| V                   |
|---------------------|
| {                   |
|   "key3": "value3", |
|   "key4": "value4"  |
| }                   |
+---------------------+
```

### Using VARIANT values

To convert a value to or from the VARIANT data type, you can explicitly cast using the [CAST](functions/cast.md)
function, the [TO_VARIANT](functions/to_variant.md) function, or the `::` operator (for example, `expression::VARIANT`).

In some situations, a value can be implicitly cast to a VARIANT value. For details, see [Data type conversion](data-type-conversion.md).

The following example shows how to use a VARIANT value, including how to convert from a VARIANT value and to a VARIANT value.

Create a table and insert a value:

```sqlexample
CREATE OR REPLACE TABLE varia (float1 FLOAT, v VARIANT, float2 FLOAT);
INSERT INTO varia (float1, v, float2) VALUES (1.23, NULL, NULL);
```

The first UPDATE converts a FLOAT value to a VARIANT value. The second UPDATE converts a VARIANT
value to a FLOAT value.

```sqlexample
UPDATE varia SET v = TO_VARIANT(float1);  -- converts from a FLOAT value to a VARIANT value.
UPDATE varia SET float2 = v::FLOAT;       -- converts from a VARIANT value to a FLOAT value.
```

SELECT all the values:

```sqlexample
SELECT * FROM varia;
```

```output
+--------+-----------------------+--------+
| FLOAT1 | V                     | FLOAT2 |
|--------+-----------------------+--------|
|   1.23 | 1.230000000000000e+00 |   1.23 |
+--------+-----------------------+--------+
```

As shown in the previous example, to convert a value from the VARIANT data type, cast the VARIANT value to the
target data type. For example, the following statement uses the `::` operator to convert the VARIANT
to a FLOAT:

```sqlexample
SELECT my_variant_column::FLOAT * 3.14 FROM ...;
```

VARIANT data stores both the value and the data type of the value. Therefore, you can use VARIANT values in expressions where the
value’s data type is valid without first casting the VARIANT. For example, if VARIANT column `my_variant_column` contains a
numeric value, then you can directly multiply `my_variant_column` by another numeric value:

```sqlexample
SELECT my_variant_column * 3.14 FROM ...;
```

You can retrieve the value’s native data type by using the [TYPEOF](functions/typeof.md) function.

By default, when VARCHAR, DATE, TIME, and TIMESTAMP values are retrieved from a VARIANT column, the values are surrounded by double
quotes. You can eliminate the double quotes by explicitly casting the values to the underlying data types (for example, from VARIANT to
VARCHAR). For example:

```sqlexample
SELECT 'Sample', 'Sample'::VARIANT, 'Sample'::VARIANT::VARCHAR;
```

```output
+----------+-------------------+----------------------------+
| 'SAMPLE' | 'SAMPLE'::VARIANT | 'SAMPLE'::VARIANT::VARCHAR |
|----------+-------------------+----------------------------|
| Sample   | "Sample"          | Sample                     |
+----------+-------------------+----------------------------+
```

A VARIANT value can be missing (contain SQL NULL), which is different from a VARIANT **null** value, which is a real value used to
represent a null value in semi-structured data. VARIANT **null** is a true value that compares as equal to itself.
For more information, see [NULL values](../user-guide/semistructured-considerations.md).

If data was loaded from JSON format and stored in a VARIANT column, then the following considerations apply:

* For data that is mostly regular and uses only native JSON types (such as strings and numbers), the performance is very similar for
  storage and query operations on relational data and data in a VARIANT column.
* For non-native data (such as dates and timestamps), the values are stored as strings when loaded into a VARIANT column. Therefore,
  operations on these values might be slower and also consume more space than when stored in a relational column with the corresponding
  data type.

For more information about using the VARIANT data type, see [Considerations for semi-structured data stored in VARIANT](../user-guide/semistructured-considerations.md).

For more information about querying semi-structured data stored in a VARIANT column, see [Querying Semi-structured Data](../user-guide/querying-semistructured.md).

### Common uses for VARIANT data

VARIANT data is typically used when:

* You want to create [hierarchical data](../user-guide/semistructured-intro.md) by explicitly defining a hierarchy that contains two or
  more ARRAY values or OBJECT values.
* You want to load JSON, Avro, ORC, or Parquet data directly, without explicitly describing the hierarchical structure of the data.

  Snowflake can convert data from JSON, Avro, ORC, or Parquet format to an internal hierarchy of
  ARRAY, OBJECT, and VARIANT data and store that hierarchical data directly in a VARIANT value. Although you can manually construct
  the data hierarchy yourself, it is usually easier to let Snowflake do it for you.

  For more information about loading and converting semi-structured data, see [Load semi-structured data](../user-guide/semistructured-intro.md).

## OBJECT

A Snowflake OBJECT value is analogous to a [JSON “object”](http://json.org). In other programming
languages, the corresponding data type is often called a “dictionary,” “hash,” or “map.”

An OBJECT value contains key-value pairs.

### Characteristics of an OBJECT value

In Snowflake semi-structured OBJECT data, each key is a [VARCHAR](data-types-text.md) value, and each
value is a VARIANT value.

Because a VARIANT value can store a value of any other data type, different VARIANT values (in different key-value pairs) can have
different underlying data types. For example, an OBJECT value can hold a person’s name as a VARCHAR value and a person’s age as an INTEGER
value. In the following example, both the name and the age are cast to VARIANT values.

```sqlexample
SELECT OBJECT_CONSTRUCT(
  'name', 'Jones'::VARIANT,
  'age',  42::VARIANT);
```

The following considerations apply to OBJECT data:

* Currently, Snowflake doesn’t support explicitly-typed objects.
* In a key-value pair, the key shouldn’t be an empty string, and neither the key nor the value should be NULL.
* The maximum length of an OBJECT value is 128 MB.
* An OBJECT value can contain [semi-structured data](https://en.wikipedia.org/wiki/Semi-structured_data).
* An OBJECT value can be used to create [hierarchical data structures](../user-guide/semistructured-intro.md).

> **Note:**
>
> Snowflake also supports the structured OBJECT data type, which allows for values other than VARIANT values. A structured OBJECT type also defines
> the keys that must be present in an OBJECT value of that type. For more information, see [Structured data types](data-types-structured.md).

### Inserting OBJECT data

To insert OBJECT data directly, use `INSERT INTO ... SELECT`.

The following example uses the [OBJECT_CONSTRUCT](functions/object_construct.md) function to construct the OBJECT value that it inserts.

```sqlexample
CREATE OR REPLACE TABLE object_example (object_column OBJECT);
INSERT INTO object_example (object_column)
  SELECT OBJECT_CONSTRUCT('thirteen', 13::VARIANT, 'zero', 0::VARIANT);
SELECT * FROM object_example;
```

```output
+-------------------+
| OBJECT_COLUMN     |
|-------------------|
| {                 |
|   "thirteen": 13, |
|   "zero": 0       |
| }                 |
+-------------------+
```

In each key-value pair, the value was explicitly cast to VARIANT. Explicit casting wasn’t required in these cases.
Snowflake can implicitly cast to VARIANT. For information about implicit casting, see
[Data type conversion](data-type-conversion.md).

You can also use an OBJECT constant to specify the OBJECT value to insert. For more information, see
OBJECT constants.

### OBJECT constants

A *constant* (also known as a *literal*) refers to a fixed data value. Snowflake supports using constants to specify OBJECT values.
OBJECT constants are delimited with curly braces (`{` and `}`).

OBJECT constants have the following syntax:

```sqlsyntax
{ [<key>: <value> [, <key>: <value> , ...]] }
```

Where:

`key`
:   The key in a key-value pair. The `key` must be a string literal.

`value`
:   The value that is associated with the key. The `value` can be a literal or an expression.
    The `value` can be any data type.

The following are examples that specify OBJECT constants:

* `{}` is an empty OBJECT value.
* `{ 'key1': 'value1' , 'key2': 'value2' }` contains the specified key-value pairs for the OBJECT value using
  literals for the values.
* `{ 'key1': c1+1 , 'key2': c1+2 }` contains the specified key-value pairs for the OBJECT value using
  expressions for the values.

* `{*}` is a wildcard that constructs the OBJECT value from the specified data using the attribute names
  as keys and the associated values as values.

  When it is specified in an object constant, the wildcard can be unqualified or qualified with a table name or alias.
  For example, both of these wildcard specifications are valid:

  ```sqlexample
  SELECT {*} FROM my_table;

  SELECT {my_table1.*}
    FROM my_table1 INNER JOIN my_table2
      ON my_table2.col1 = my_table1.col1;
  ```

  You can use the ILIKE and EXCLUDE keywords in an object constant. To select specific columns, use the
  ILIKE keyword. For example, the following query selects columns that match the pattern `col1%` in
  the table `my_table`:

  ```sqlexample
  SELECT {* ILIKE 'col1%'} FROM my_table;
  ```

  To exclude specific columns, use the EXCLUDE keyword. For example, the following query excludes `col1` in
  the table `my_table`:

  ```sqlexample
  SELECT {* EXCLUDE col1} FROM my_table;
  ```

  The following query excludes `col1` and `col2` in the table `my_table`:

  ```sqlexample
  SELECT {* EXCLUDE (col1, col2)} FROM my_table;
  ```

  Wildcards can’t be mixed with key-value pairs. For example, the following wildcard specification isn’t allowed:

  ```sqlexample
  SELECT {*, 'k': 'v'} FROM my_table;
  ```

  More than one wildcard can’t be used in one object constant. For example, the following
  wildcard specification isn’t allowed:

  ```sqlexample
  SELECT {t1.*, t2.*} FROM t1, t2;
  ```

The following statements use an OBJECT constant and the [OBJECT_CONSTRUCT](functions/object_construct.md) function to perform
an insert of OBJECT data into a table. The OBJECT values contain the names and capital cities of two Canadian
provinces.

```sqlexample
CREATE OR REPLACE TABLE my_object_table (my_object OBJECT);

INSERT INTO my_object_table (my_object)
  SELECT { 'PROVINCE': 'Alberta'::VARIANT , 'CAPITAL': 'Edmonton'::VARIANT };

INSERT INTO my_object_table (my_object)
  SELECT OBJECT_CONSTRUCT('PROVINCE', 'Manitoba'::VARIANT , 'CAPITAL', 'Winnipeg'::VARIANT );

SELECT * FROM my_object_table;
```

```output
+--------------------------+
| MY_OBJECT                |
|--------------------------|
| {                        |
|   "CAPITAL": "Edmonton", |
|   "PROVINCE": "Alberta"  |
| }                        |
| {                        |
|   "CAPITAL": "Winnipeg", |
|   "PROVINCE": "Manitoba" |
| }                        |
+--------------------------+
```

The following example uses a wildcard (`{*}`) to insert OBJECT data by getting the attribute names and
values from the FROM clause. First, create a table named `demo_ca_provinces` with VARCHAR
values that contain the province and capital names:

```sqlexample
CREATE OR REPLACE TABLE demo_ca_provinces (province VARCHAR, capital VARCHAR);
INSERT INTO demo_ca_provinces (province, capital) VALUES
  ('Ontario', 'Toronto'),
  ('British Columbia', 'Victoria');

SELECT province, capital
  FROM demo_ca_provinces
  ORDER BY province;
```

```output
+------------------+----------+
| PROVINCE         | CAPITAL  |
|------------------+----------|
| British Columbia | Victoria |
| Ontario          | Toronto  |
+------------------+----------+
```

Insert object data into the `my_object_table` using the data in the `demo_ca_provinces`
table:

```sqlexample
INSERT INTO my_object_table (my_object)
  SELECT {*} FROM demo_ca_provinces;

SELECT * FROM my_object_table;
```

```output
+----------------------------------+
| MY_OBJECT                        |
|----------------------------------|
| {                                |
|   "CAPITAL": "Edmonton",         |
|   "PROVINCE": "Alberta"          |
| }                                |
| {                                |
|   "CAPITAL": "Winnipeg",         |
|   "PROVINCE": "Manitoba"         |
| }                                |
| {                                |
|   "CAPITAL": "Toronto",          |
|   "PROVINCE": "Ontario"          |
| }                                |
| {                                |
|   "CAPITAL": "Victoria",         |
|   "PROVINCE": "British Columbia" |
| }                                |
+----------------------------------+
```

The following example uses expressions for the values in an OBJECT constant:

```sqlexample
SET my_variable = 10;
SELECT {'key1': $my_variable+1, 'key2': $my_variable+2};
```

```output
+--------------------------------------------------+
| {'KEY1': $MY_VARIABLE+1, 'KEY2': $MY_VARIABLE+2} |
|--------------------------------------------------|
| {                                                |
|   "key1": 11,                                    |
|   "key2": 12                                     |
| }                                                |
+--------------------------------------------------+
```

SQL statements specify string literals inside an OBJECT value with single quotes (as elsewhere in Snowflake SQL), but string
literals inside an OBJECT value are displayed with double quotes:

```sqlexample
SELECT { 'Manitoba': 'Winnipeg' } AS province_capital;
```

```output
+--------------------------+
| PROVINCE_CAPITAL         |
|--------------------------|
| {                        |
|   "Manitoba": "Winnipeg" |
| }                        |
+--------------------------+
```

### Accessing elements of an OBJECT value by key

To retrieve the value in an OBJECT value, specify the key in
[square brackets](../user-guide/querying-semistructured.md), as shown below:

```sqlexample
SELECT object_column['thirteen'] FROM object_example;
```

You can also use the colon operator. The following command shows that the results are the same whether you use
the square brackets or the colon:

```sqlexample
SELECT object_column['thirteen'],
       object_column:thirteen
  FROM object_example;
```

```output
+---------------------------+------------------------+
| OBJECT_COLUMN['THIRTEEN'] | OBJECT_COLUMN:THIRTEEN |
|---------------------------+------------------------|
| 13                        | 13                     |
+---------------------------+------------------------+
```

For more information about the colon operator, see [Dot Notation](../user-guide/querying-semistructured.md), which describes
the use of the `:` and `.` operators to access nested data.

### Common uses for OBJECT data

OBJECT data is typically used when one or more of the following are true:

* You have multiple pieces of data that are identified by strings. For example, if
  you want to look up information by province name, you might want to use an OBJECT value.
* You want to store information about the data with the data. The names (keys) aren’t merely distinct identifiers, but are
  meaningful.
* The information has no natural order, or the order can be inferred solely from the keys.
* The structure of the data varies, or the data can be incomplete. For example, if you want to create a catalog of books that
  usually contains the title, author name, and publication date, but in some cases the publication date is unknown, then you might
  want to use an OBJECT value.

## ARRAY

A Snowflake array is similar to an array in many other programming languages. An array contains 0 or more pieces of data.
Each element is accessed by specifying its position in the array.

### Characteristics of an array

Each value in a semi-structured array is of type VARIANT. A VARIANT value can contain a value of any
other data type.

Values of other data types can be cast to VARIANT values and then stored in an array. Some functions for arrays, including
[ARRAY_CONSTRUCT](functions/array_construct.md), can [implicitly cast](data-type-conversion.md) values to VARIANT
values.

Because arrays store VARIANT values, and because VARIANT values can store other data types within them, the underlying data types
of the values in an array can be different. However, in most cases, the data elements are of the same or compatible
types, so they can all be processed the same way.

The following considerations apply to arrays:

* Snowflake doesn’t support arrays of elements of a specific non-VARIANT type.
* A Snowflake array is declared without specifying the number of elements. An array can grow dynamically based on operations such
  as [ARRAY_APPEND](functions/array_append.md). Snowflake doesn’t currently support fixed-size arrays.
* An array can contain both SQL NULL values and JSON null values. For more information, see [NULL values](../user-guide/semistructured-considerations.md).
* The theoretical maximum combined size of all values in an array is 128 MB. However, arrays have internal overhead.
  The practical maximum data size is usually smaller, depending upon the number and values of the elements.

> **Note:**
>
> Snowflake also supports structured arrays, which allow for elements of types other than VARIANT.
> For more information, see [Structured data types](data-types-structured.md).

### Inserting ARRAY data

To insert ARRAY data directly, use `INSERT INTO ... SELECT`.

The following code uses the [ARRAY_CONSTRUCT](functions/array_construct.md) function to construct the array that it inserts.

```sqlexample
CREATE OR REPLACE TABLE array_example (array_column ARRAY);
INSERT INTO array_example (array_column)
  SELECT ARRAY_CONSTRUCT(12, 'twelve', NULL);
```

You can also use an ARRAY constant to specify the array to insert. For more information, see
ARRAY constants.

### ARRAY constants

A *constant* (also known as a *literal*) refers to a fixed data value. Snowflake supports using constants to specify ARRAY values.
ARRAY constants are delimited with square brackets (`[` and `]`).

ARRAY constants have the following syntax:

```sqlsyntax
[<value> [, <value> , ...]]
```

Where:

`value`
:   The value that is associated with an array element. The `value` can be a literal or an expression.
    The `value` can be any data type.

The following are examples that specify ARRAY constants:

* `[]` is an empty ARRAY value.
* `[ 1 , 'value1' ]` contains the specified values for the ARRAY constant using
  literals for the values.
* `[ c1+1 , c1+2 ]` contains the specified values for the ARRAY constant using
  expressions for the values.

The following example uses an ARRAY constant to specify the array to insert.

```sqlexample
INSERT INTO array_example (array_column)
  SELECT [ 12, 'twelve', NULL ];
```

The following statements use an ARRAY constant and the [ARRAY_CONSTRUCT](functions/array_construct.md) function to perform the same task:

```sqlexample
UPDATE my_table SET my_array = [ 1, 2 ];

UPDATE my_table SET my_array = ARRAY_CONSTRUCT(1, 2);
```

The following example uses expressions for the values in an ARRAY constant:

```sqlexample
SET my_variable = 10;
SELECT [$my_variable+1, $my_variable+2];
```

```output
+----------------------------------+
| [$MY_VARIABLE+1, $MY_VARIABLE+2] |
|----------------------------------|
| [                                |
|   11,                            |
|   12                             |
| ]                                |
+----------------------------------+
```

SQL statements specify string literals inside an array with single quotes (as elsewhere in Snowflake SQL), but string
literals inside an array are displayed with double quotes:

```sqlexample
SELECT [ 'Alberta', 'Manitoba' ] AS province;
```

```output
+--------------+
| PROVINCE     |
|--------------|
| [            |
|   "Alberta", |
|   "Manitoba" |
| ]            |
+--------------+
```

### Accessing elements of an array by index or by slice

Array indexes are 0-based, so the first element in an array is element 0.

Values in an array are accessed by specifying an array element’s index number in square brackets. For example, the following query
reads the value at index position `2` in the array stored in `my_array_column`.

```sqlexample
SELECT my_array_column[2] FROM my_table;
```

Arrays can be nested. The following query reads the zeroth element of the zeroth element of a nested array:

```sqlexample
SELECT my_array_column[0][0] FROM my_table;
```

Attempting to access an element beyond the end of an array returns NULL.

A *slice* of an array is a sequence of adjacent elements (that is, a contiguous subset of the array).

You can access a slice of an array by calling the [ARRAY_SLICE](functions/array_slice.md) function. For example:

```sqlexample
SELECT ARRAY_SLICE(my_array_column, 5, 10) FROM my_table;
```

The ARRAY_SLICE function returns elements from the specified starting element (5 in the example above) up to
but not including the specified ending element (10 in the example above).

An empty array or an empty slice is often denoted by a pair of square braces with nothing between them (`[]`).

### Dense and sparse arrays

An array can be *dense* or *sparse*.

In a dense array, the index values of the elements start at zero and are sequential (0, 1, 2, and so on). However,
in a sparse array, the index values can be non-sequential (for example, 0, 2, 5). The values don’t need to start at 0.

If an index has no corresponding element, then the value corresponding to that index is said to be *undefined*. For example, if a
sparse array has three elements, and those elements are at indexes 0, 2, and 5, then the elements at indexes 1, 3, and 4 are
`undefined`.

An undefined element is treated as an element. For example, consider the earlier example of a sparse array that contains elements
at indexes 0, 2, and 5 (and doesn’t have any elements after index 5). If you read the slice containing elements at indexes 3 and 4,
then the output is similar to the following:

```sqlexample
[ undefined, undefined ]
```

Attempting to access a slice beyond the end of an array results in an empty array, not an array of `undefined` values.
The following SELECT statement attempts to read beyond the last element in the sample sparse array:

```sqlexample
SELECT ARRAY_SLICE(array_column, 6, 8) FROM table_1;
```

The output is an empty array:

```output
+---------------------------------+
| array_slice(array_column, 6, 8) |
+---------------------------------+
| [ ]                             |
+---------------------------------+
```

`undefined` is different from NULL. A NULL value in an array is a defined element.

In a dense array, each element consumes storage space, even if the value of the element is NULL.
In a sparse array, `undefined` elements don’t directly consume storage space.

In a dense array, the theoretical range of index values is from 0 to 134217727. (The maximum theoretical number of elements
is 134217728 because the upper limit on size is 128 MB, or 134217728 bytes, and the smallest possible value is one byte.)

In a sparse array, the theoretical range of index values is from 0 to 231 - 1. However, because of the 128 MB limitation, a
sparse array can’t hold 231 values. The maximum theoretical number of values is still limited to 134217728.

Because of internal overhead, the practical size limit in both dense and sparse arrays is at least slightly less than
the theoretical maximum of 128 MB.

You can create a sparse array by using the [ARRAY_INSERT](functions/array_insert.md) function to insert values at specific
index points in an array (leaving other array elements `undefined`). Because ARRAY_INSERT pushes elements to the right, which
changes the index values required to access them, it is normally best to fill a sparse array from left to right
(that is, from 0 up, increasing the index value for each new value inserted).

### Common uses for ARRAY data

ARRAY data is typically used when one or more of the following are true:

* There is a collection of data, and each piece in the collection is structured the same or similarly.
* Each piece of data is processed similarly. For example, you might loop through the data, processing each piece the same way.
* The data has a natural order, for example, chronological.

## Examples

The following example shows the output of a DESC TABLE command on a table with VARIANT, ARRAY, and OBJECT data.

```sqlexample
CREATE OR REPLACE TABLE test_semi_structured(
  var VARIANT,
  arr ARRAY,
  obj OBJECT);

DESC TABLE test_semi_structured;
```

```output
+------+---------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type    | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+---------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| VAR  | VARIANT | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| ARR  | ARRAY   | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| OBJ  | OBJECT  | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+---------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

This example shows how to load simple values into a table, and what those values look like when you query the table.

Create a table and load the data:

```sqlexample
CREATE OR REPLACE TABLE demonstration1 (
  ID INTEGER,
  array1 ARRAY,
  variant1 VARIANT,
  object1 OBJECT);

INSERT INTO demonstration1 (id, array1, variant1, object1)
  SELECT
    1,
    ARRAY_CONSTRUCT(1, 2, 3),
    PARSE_JSON(' { "key1": "value1", "key2": "value2" } '),
    PARSE_JSON(' { "outer_key1": { "inner_key1A": "1a", "inner_key1B": "1b" }, '
              ||
               '   "outer_key2": { "inner_key2": 2 } } ');

INSERT INTO demonstration1 (id, array1, variant1, object1)
  SELECT
    2,
    ARRAY_CONSTRUCT(1, 2, 3, NULL),
    PARSE_JSON(' { "key1": "value1", "key2": NULL } '),
    PARSE_JSON(' { "outer_key1": { "inner_key1A": "1a", "inner_key1B": NULL }, '
              ||
                '   "outer_key2": { "inner_key2": 2 } '
              ||
               ' } ');
```

Now show the data in the table.

```sqlexample
SELECT *
  FROM demonstration1
  ORDER BY id;
```

```output
+----+-------------+---------------------+--------------------------+
| ID | ARRAY1      | VARIANT1            | OBJECT1                  |
|----+-------------+---------------------+--------------------------|
|  1 | [           | {                   | {                        |
|    |   1,        |   "key1": "value1", |   "outer_key1": {        |
|    |   2,        |   "key2": "value2"  |     "inner_key1A": "1a", |
|    |   3         | }                   |     "inner_key1B": "1b"  |
|    | ]           |                     |   },                     |
|    |             |                     |   "outer_key2": {        |
|    |             |                     |     "inner_key2": 2      |
|    |             |                     |   }                      |
|    |             |                     | }                        |
|  2 | [           | {                   | {                        |
|    |   1,        |   "key1": "value1", |   "outer_key1": {        |
|    |   2,        |   "key2": null      |     "inner_key1A": "1a", |
|    |   3,        | }                   |     "inner_key1B": null  |
|    |   undefined |                     |   },                     |
|    | ]           |                     |   "outer_key2": {        |
|    |             |                     |     "inner_key2": 2      |
|    |             |                     |   }                      |
|    |             |                     | }                        |
+----+-------------+---------------------+--------------------------+
```

For additional examples, see [Querying Semi-structured Data](../user-guide/querying-semistructured.md).

---
title: Service types
source: https://docs.snowflake.com/en/sql-reference/service-types.md
section: SQL General Reference
---

# Service types

This topic contains the types of usage that can incur costs in Snowflake.

| Service type | Usage statement name | Description | Category | Unit |
| --- | --- | --- | --- | --- |
| AI_INFERENCE | AI INFERENCE | Compute credits for running AI model inference workloads. | AI and Machine Learning | Million Tokens |
| AI_INFERENCE_TOOLS | AI INFERENCE (TOOLS) | Compute credits for running AI model inference tool calls. | AI and Machine Learning | N/A |
| AI_SECURITY_GUARDRAILS | AI SECURITY GUARDRAILS | AI-powered security guardrails for content safety and policy enforcement. | AI and Machine Learning | AI Credits |
| AI_SERVICES | AI SERVICES | Usage of Snowflake AI and ML services including Cortex functions. | AI and Machine Learning | Credits |
| ARCHIVE_STORAGE_COLD | ARCHIVE STORAGE COLD | Long-term cold archive storage for infrequently accessed data. | Storage | TiB-Months |
| ARCHIVE_STORAGE_COOL | ARCHIVE STORAGE COOL | Mid-tier archive storage for occasionally accessed data. | Storage | TiB-Months |
| ARCHIVE_STORAGE_DATA_RETRIEVAL | ARCHIVE STORAGE DATA RETRIEVAL | Charges for retrieving data from archive storage tiers. | Databases | TiB |
| ARCHIVE_STORAGE_RETRIEVAL_FILE_PROCESSING | ARCHIVE STORAGE RETRIEVAL FILE PROCESSING | Processing charges for file retrieval operations from archive storage. | Databases | Credits |
| ARCHIVE_STORAGE_WRITE | ARCHIVE STORAGE WRITE | Charges for writing data to archive storage tiers. | Storage | Credits |
| AUTOMATED_REFRESH_AND_DATA_REGISTRATION | AUTOMATED REFRESH AND DATA REGISTRATION | Compute for automated refresh and data registration in Iceberg tables. | Databases | Credits |
| AUTO_CLUSTERING | AUTOMATIC CLUSTERING | Serverless compute for automatic reclustering of tables. | Databases | Credits |
| BACKUP | BACKUP | Compute and storage for backup snapshots. | Storage | Credits |
| BLOCK_STORAGE | BLOCK STORAGE | Block storage used by Snowpark Container Services and Hybrid Tables. | Storage | TiB-Months |
| BLOCK_STORAGE_ADDITIONAL_IOPS | BLOCK STORAGE ADDITIONAL IOPS | Additional IOPS provisioned beyond baseline for block storage. | Storage | Thousand IOPS-Months |
| BLOCK_STORAGE_ADDITIONAL_THROUGHPUT | BLOCK STORAGE ADDITIONAL THROUGHPUT | Additional throughput provisioned beyond baseline for block storage. | Storage | GB/Second-Months |
| CLOUD_SERVICES | CLOUD SERVICES | Always-on services including authentication, metadata, and query optimization. | Compute | Credits |
| COPY_FILES | COPY FILES | Serverless compute for COPY FILES operations between stages. | Migration | Credits |
| CORTEX_AGENTS | CORTEX AGENTS | Agent API that orchestrates tools, queries data, and executes multi-step tasks in Snowflake and via MCP powered by Cortex. | AI and Machine Learning | AI Credits |
| CORTEX_CODE_CLI | CORTEX CODE: CLI | AI-powered coding assistant accessed via the command-line interface for local development. | AI and Machine Learning | AI Credits |
| CORTEX_CODE_CLI_SUBSCRIPTION | CORTEX CODE: CLI (SUBSCRIPTION) | Self-service Cortex Code command-line interface subscription. | AI and Machine Learning | N/A |
| CORTEX_CODE_SNOWSIGHT | CORTEX CODE: SNOWSIGHT | AI-powered coding assistant accessed through the browser-based IDE in Snowsight. | AI and Machine Learning | AI Credits |
| DATA_QUALITY_MONITORING | DATA QUALITY MONITORING | Serverless compute for data quality monitoring and metrics. | Management and Governance | Credits |
| DATA_TRANSFER | DATA TRANSFER | Network egress charges for data transferred out of Snowflake. | Networking | TiB |
| EGRESS_COST_OPTIMIZER | EGRESS COST OPTIMIZER | Service that optimizes and reduces data egress costs. | Management and Governance | TiB-Months |
| FAILSAFE_RECOVERY | FAILSAFE RECOVERY | Charges for recovering data from Fail-safe storage. | Databases | Credits |
| HYBRID_TABLE_DEDICATED_STORAGE_MODE | HYBRID TABLE DEDICATED STORAGE MODE | Dedicated storage mode for Hybrid Tables. | Storage | Days |
| HYBRID_TABLE_STORAGE | HYBRID TABLE STORAGE | Storage used by Hybrid Tables for transactional data. | Storage | TiB-Months |
| INTERNAL_DATA_TRANSFER | INTERNAL DATA TRANSFER | Data transfer between Snowflake regions or cloud providers. | Networking | TiB |
| LOGGING | LOGGING | Storage and compute for event logging and audit trails. | Management and Governance | Credits |
| MATERIALIZED_VIEW | MATERIALIZED VIEWS | Serverless compute for maintaining materialized views. | Compute | Credits |
| OPENFLOW_COMPUTE_BYOC | OPENFLOW COMPUTE BYOC | Openflow compute using Bring Your Own Cloud resources. | Compute | Credits |
| OPENFLOW_COMPUTE_SNOWFLAKE | OPENFLOW COMPUTE SNOWFLAKE | Openflow compute managed by Snowflake infrastructure. | Compute | Credits |
| ORGANIZATION_USAGE | ORGANIZATION USAGE | Compute for organization-level usage tracking and reporting. | Management and Governance | Credits |
| OUTBOUND_PRIVATELINK_DATA_PROCESSED | OUTBOUND PRIVATELINK DATA PROCESSED | Data processed through outbound private connectivity connections. | Networking | TiB |
| OUTBOUND_PRIVATELINK_ENDPOINT | OUTBOUND PRIVATELINK ENDPOINT | Endpoint charges for outbound private connectivity connections. | Networking | Thousand Hours |
| PIPE | SNOWPIPE | Serverless compute for continuous data ingestion via Snowpipe. | Migration | Credits |
| POSTGRES_COMPUTE | POSTGRES COMPUTE | Compute for PostgreSQL-compatible interface workloads. | Compute | Credits |
| POSTGRES_COMPUTE_HA | POSTGRES COMPUTE HA | High-availability compute for PostgreSQL-compatible workloads. | Compute | Credits |
| POSTGRES_STORAGE | POSTGRES STORAGE | Storage for PostgreSQL-compatible interface data. | Storage | TiB-Months |
| POSTGRES_STORAGE_HA | POSTGRES STORAGE HA | High-availability storage for PostgreSQL-compatible data. | Storage | TiB-Months |
| QUERY_ACCELERATION | QUERY ACCELERATION | Serverless compute for accelerating eligible queries. | Compute | Credits |
| REPLICATION | REPLICATION | Compute and transfer for database and account replication. | Networking | Credits |
| SEARCH_OPTIMIZATION | SEARCH OPTIMIZATION | Serverless compute for search optimization service maintenance. | Compute | Credits |
| SENSITIVE_DATA_CLASSIFICATION | SENSITIVE DATA CLASSIFICATION | Serverless compute for classifying sensitive data. | Management and Governance | Credits |
| SERVERLESS_ALERTS | SERVERLESS ALERTS | Serverless compute for alert condition evaluation. | Management and Governance | Credits |
| SERVERLESS_TASK | SERVERLESS TASKS | Serverless compute for scheduled task execution. | Compute | Credits |
| SERVERLESS_TASKS_FLEX | SERVERLESS TASKS FLEX | Flexible serverless compute for scheduled tasks. | Compute | Credits |
| SNOWFLAKE_INTELLIGENCE | SNOWFLAKE INTELLIGENCE | Natural language interface for asking questions about your Snowflake and MCP data. | AI and Machine Learning | AI Credits |
| SNOWPARK_CONTAINER_SERVICES | SNOWPARK CONTAINER SERVICES | Compute and resources for Snowpark Container Services workloads. | Compute | Credits |
| SNOWPIPE_STREAMING | SNOWPIPE STREAMING | Serverless compute for low-latency streaming ingestion. | Migration | Credits |
| SNOWWORK | SNOWWORK | AI workspace that combines coding, data exploration, and task automation in a unified Snowflake environment. | AI and Machine Learning | AI Credits |
| STORAGE | STORAGE | Compressed data storage including Time Travel and Fail-safe. | Storage | TiB-Months |
| STORAGE_LIFECYCLE_POLICY_EXECUTION | STORAGE LIFECYCLE POLICY EXECUTION | Compute for executing storage lifecycle policies. | Management and Governance | Credits |
| STORAGE_REQUEST | STORAGE REQUEST | Request-based charges for storage operations. | Databases | Million Requests |
| TABLE_OPTIMIZATION | TABLE OPTIMIZATION | Serverless compute for automatic table optimization. | Compute | Credits |
| TELEMETRY_DATA_INGEST | TELEMETRY DATA INGEST | Ingestion of telemetry and observability data. | Management and Governance | Credits |
| TRUST_CENTER | TRUST CENTER | Serverless compute for Trust Center security monitoring. | Management and Governance | Credits |
| WAREHOUSE_METERING | COMPUTE | Virtual warehouse compute credits for query execution. | Compute | Credits |

---
title: Set operators
source: https://docs.snowflake.com/en/sql-reference/operators-query.md
section: SQL General Reference
---

# Set operators

Set operators combine the intermediate results of multiple query blocks into a single result set.

## General syntax

```sqlsyntax
[ ( ] <query> [ ) ]
{
  INTERSECT |
  { MINUS | EXCEPT } |
  UNION [ { DISTINCT | ALL } ] [ BY NAME ]
}
[ ( ] <query> [ ) ]
[ ORDER BY ... ]
[ LIMIT ... ]
```

## General usage notes

* Each query can itself contain query operators, so that you can combine multiple query expressions with set operators.
* You can apply the [ORDER BY](constructs/order-by.md) and [LIMIT / FETCH](constructs/limit.md) clauses to the result
  of the set operator.
* When using these operators:

  + Make sure that each query selects the same number of columns, with the exception of queries that include UNION BY NAME
    or UNION ALL BY NAME.
  + Make sure that the data type of each column is consistent across the rows from different sources.
    One of the examples in the Use the UNION operator and cast mismatched data types section
    illustrates the potential problem and solution when data types don’t match.
  + In general, make sure the “meanings,” as well as the data types, of the columns match. The following query with the
    UNION ALL operator won’t produce the desired results:

    ```sqlexample
    SELECT LastName, FirstName FROM employees
    UNION ALL
    SELECT FirstName, LastName FROM contractors;
    ```

    The risk of error increases when you use an asterisk to specify all columns of a table, for example:

    ```sqlexample
    SELECT * FROM table1
    UNION ALL
    SELECT * FROM table2;
    ```

    If the tables have the same number of columns, but the columns aren’t in the same order, the query results will
    probably be incorrect when you use these operators.

    The UNION BY NAME and UNION ALL BY NAME operators are exceptions for this scenario. For example, the following
    query returns the correct results:

    ```sqlexample
    SELECT LastName, FirstName FROM employees
    UNION ALL BY NAME
    SELECT FirstName, LastName FROM contractors;
    ```
  + The names of the output columns are based on the names of the columns of the first query. For example,
    consider the following query:

    ```sqlexample
    SELECT LastName, FirstName FROM employees
    UNION ALL
    SELECT FirstName, LastName FROM contractors;
    ```

    This query behaves as though the query were the following:

    ```sqlexample
    SELECT LastName, FirstName FROM employees
    UNION ALL
    SELECT FirstName AS LastName, LastName AS FirstName FROM contractors;
    ```
* The precedence of the set operators matches the ANSI and ISO SQL standards:

  + The UNION [ALL] and MINUS (EXCEPT) operators have equal precedence.
  + The INTERSECT operator has higher precedence than UNION [ALL] and MINUS (EXCEPT).

  Snowflake processes operators of equal precedence from left to right.

  You can use parentheses to force the expressions to be evaluated in a different order.

  Not all database vendors follow the ANSI/ISO standard for precedence of set operators. Snowflake recommends using parentheses to
  specify the order of evaluation, especially if you are porting code from another vendor to Snowflake, or writing code that you
  might execute on other databases as well as on Snowflake.

## Sample tables for examples

Some of the examples in this topic use the following sample tables. Both tables have a postal code column. One table records the postal code of
each sales office, and the other records the postal code of each customer.

```sqlexample
CREATE OR REPLACE TABLE sales_office_postal_example(
  office_name VARCHAR,
  postal_code VARCHAR);

INSERT INTO sales_office_postal_example VALUES ('sales1', '94061');
INSERT INTO sales_office_postal_example VALUES ('sales2', '94070');
INSERT INTO sales_office_postal_example VALUES ('sales3', '98116');
INSERT INTO sales_office_postal_example VALUES ('sales4', '98005');

CREATE OR REPLACE TABLE customer_postal_example(
  customer VARCHAR,
  postal_code VARCHAR);

INSERT INTO customer_postal_example VALUES ('customer1', '94066');
INSERT INTO customer_postal_example VALUES ('customer2', '94061');
INSERT INTO customer_postal_example VALUES ('customer3', '98444');
INSERT INTO customer_postal_example VALUES ('customer4', '98005');
```

## INTERSECT

Returns rows from one query’s result set which also appear in another query’s result set, with duplicate elimination.

### Syntax

```sqlsyntax
[ ( ] <query> [ ) ]
INTERSECT
[ ( ] <query> [ ) ]
```

### INTERSECT operator examples

To find the postal codes that are in both the `sales_office_postal_example` table and the `customer_postal_example`
table, query the sample tables:

```sqlexample
SELECT postal_code FROM sales_office_postal_example
INTERSECT
SELECT postal_code FROM customer_postal_example
ORDER BY postal_code;
```

```output
+-------------+
| POSTAL_CODE |
|-------------|
| 94061       |
| 98005       |
+-------------+
```

## MINUS , EXCEPT

Returns the rows returned by the first query that aren’t also returned by the second query.

The MINUS and EXCEPT keywords have the same meaning and can be used interchangeably.

### Syntax

```sqlsyntax
[ ( ] <query> [ ) ]
MINUS
[ ( ] <query> [ ) ]

[ ( ] <query> [ ) ]
EXCEPT
[ ( ] <query> [ ) ]
```

### MINUS operator examples

Query the sample tables to find the postal codes in the
`sales_office_postal_example` table that aren’t also in the `customer_postal_example` table:

```sqlexample
SELECT postal_code FROM sales_office_postal_example
MINUS
SELECT postal_code FROM customer_postal_example
ORDER BY postal_code;
```

```output
+-------------+
| POSTAL_CODE |
|-------------|
| 94070       |
| 98116       |
+-------------+
```

Query the sample tables to find the postal codes in the
`customer_postal_example` table that aren’t also in the `sales_office_postal_example` table:

```sqlexample
SELECT postal_code FROM customer_postal_example
MINUS
SELECT postal_code FROM sales_office_postal_example
ORDER BY postal_code;
```

```output
+-------------+
| POSTAL_CODE |
|-------------|
| 94066       |
| 98444       |
+-------------+
```

## UNION [ { DISTINCT | ALL } ] [ BY NAME ]

Combines the result sets from two queries:

* UNION [ DISTINCT ] combines rows by column position with duplicate elimination.
* UNION ALL combines rows by column position without duplicate elimination.
* UNION [ DISTINCT ] BY NAME combines rows by column name with duplicate elimination.
* UNION ALL BY NAME combines rows by column name without duplicate elimination.

The default is UNION DISTINCT (that is, combine rows by column position with duplicate elimination).
The DISTINCT keyword is optional. The DISTINCT keyword and the ALL keyword are mutually
exclusive.

Use UNION or UNION ALL when the column positions match in the tables that you are combining. Use
UNION BY NAME or UNION ALL BY NAME for the following use cases:

* The tables that you are combining have varying column orders.
* The tables that you are combining have evolving schemas, where columns are added or reordered.
* You want to combine subsets of columns that have different positions in the tables.

### Syntax

```sqlsyntax
[ ( ] <query> [ ) ]
UNION [ { DISTINCT | ALL } ] [ BY NAME ]
[ ( ] <query> [ ) ]
```

### Usage notes for the BY NAME clause

In addition to the general usage notes, the following usage notes apply to
UNION BY NAME and UNION ALL BY NAME:

* Columns with the same identifiers are matched and combined. Matching of unquoted identifiers is case-insensitive,
  and matching of quoted identifiers is case-sensitive.
* The inputs aren’t required to have the same number of columns. If a column exists in one input but not the other, it
  is filled with NULL values in the combined result set for each row where it’s missing.
* The order of columns in the combined result set is determined by the order of unique columns from
  left to right, as they are first encountered.

### UNION operator examples

The following examples use the UNION operator:

* Combine the results from two queries by column position
* Combine the results from two queries by column name
* Use an alias to combine the results from two queries with different column names
* Use the UNION operator and cast mismatched data types

#### Combine the results from two queries by column position

To combine the result sets by column position from two queries on the
sample tables, use the UNION operator:

```sqlexample
SELECT office_name office_or_customer, postal_code FROM sales_office_postal_example
UNION
SELECT customer, postal_code FROM customer_postal_example
ORDER BY postal_code;
```

```output
+--------------------+-------------+
| OFFICE_OR_CUSTOMER | POSTAL_CODE |
|--------------------+-------------|
| sales1             | 94061       |
| customer2          | 94061       |
| customer1          | 94066       |
| sales2             | 94070       |
| sales4             | 98005       |
| customer4          | 98005       |
| sales3             | 98116       |
| customer3          | 98444       |
+--------------------+-------------+
```

#### Combine the results from two queries by column name

Create two tables with differing column order and insert data:

```sqlexample
CREATE OR REPLACE TABLE union_demo_column_order1 (
  a INTEGER,
  b VARCHAR);

INSERT INTO union_demo_column_order1 VALUES
  (1, 'one'),
  (2, 'two'),
  (3, 'three');

CREATE OR REPLACE TABLE union_demo_column_order2 (
  B VARCHAR,
  A INTEGER);

INSERT INTO union_demo_column_order2 VALUES
  ('three', 3),
  ('four', 4);
```

To combine the result sets by column name from two queries, use the UNION BY NAME operator:

```sqlexample
SELECT * FROM union_demo_column_order1
UNION BY NAME
SELECT * FROM union_demo_column_order2
ORDER BY a;
```

```output
+---+-------+
| A | B     |
|---+-------|
| 1 | one   |
| 2 | two   |
| 3 | three |
| 4 | four  |
+---+-------+
```

The output shows that the query eliminated the duplicate row (with `3` in column `A` and `three`
in column `B`).

To combine the tables without duplicate elimination, use the UNION ALL BY NAME operator:

```sqlexample
SELECT * FROM union_demo_column_order1
UNION ALL BY NAME
SELECT * FROM union_demo_column_order2
ORDER BY a;
```

```output
+---+-------+
| A | B     |
|---+-------|
| 1 | one   |
| 2 | two   |
| 3 | three |
| 3 | three |
| 4 | four  |
+---+-------+
```

Notice that the cases of the column names don’t match in the two tables. The column names are lowercase in
the `union_demo_column_order1` table and uppercase in the `union_demo_column_order2` table. If you run
a query with quotation marks around the column names, an error is returned because the matching of quoted
identifiers is case-sensitive. For example, the following query places quotation marks around the column names:

```sqlexample
SELECT 'a', 'b' FROM union_demo_column_order1
UNION ALL BY NAME
SELECT 'B', 'A' FROM union_demo_column_order2
ORDER BY a;
```

```output
000904 (42000): SQL compilation error: error line 4 at position 9
invalid identifier 'A'
```

#### Use an alias to combine the results from two queries with different column names

When you use the UNION BY NAME operator to combine the result sets by column name from two queries on the
sample tables, the rows in the result set have NULL values because
the column names don’t match:

```sqlexample
SELECT office_name, postal_code FROM sales_office_postal_example
UNION BY NAME
SELECT customer, postal_code FROM customer_postal_example
ORDER BY postal_code;
```

```output
+-------------+-------------+-----------+
| OFFICE_NAME | POSTAL_CODE | CUSTOMER  |
|-------------+-------------+-----------|
| sales1      | 94061       | NULL      |
| NULL        | 94061       | customer2 |
| NULL        | 94066       | customer1 |
| sales2      | 94070       | NULL      |
| sales4      | 98005       | NULL      |
| NULL        | 98005       | customer4 |
| sales3      | 98116       | NULL      |
| NULL        | 98444       | customer3 |
+-------------+-------------+-----------+
```

The output shows that columns with different identifiers aren’t combined and that rows have NULL
values for columns that are in one table but not the other. The `postal_code` column is in both tables,
so there are no NULL values in the output for the `postal_code` column.

The following query uses the alias `office_or_customer` so that columns with different names
have the same name for the duration of the query:

```sqlexample
SELECT office_name AS office_or_customer, postal_code FROM sales_office_postal_example
UNION BY NAME
SELECT customer AS office_or_customer, postal_code FROM customer_postal_example
ORDER BY postal_code;
```

```output
+--------------------+-------------+
| OFFICE_OR_CUSTOMER | POSTAL_CODE |
|--------------------+-------------|
| sales1             | 94061       |
| customer2          | 94061       |
| customer1          | 94066       |
| sales2             | 94070       |
| sales4             | 98005       |
| customer4          | 98005       |
| sales3             | 98116       |
| customer3          | 98444       |
+--------------------+-------------+
```

#### Use the UNION operator and cast mismatched data types

This example demonstrates a potential issue with using the UNION operator when data types don’t match,
then provides the solution.

Start by creating the tables and inserting some data:

```sqlexample
CREATE OR REPLACE TABLE union_test1 (v VARCHAR);
CREATE OR REPLACE TABLE union_test2 (i INTEGER);

INSERT INTO union_test1 (v) VALUES ('Smith, Jane');
INSERT INTO union_test2 (i) VALUES (42);
```

Execute a UNION by column position operation with different data types (a VARCHAR value in `union_test1` and an
INTEGER value in `union_test2`):

```sqlexample
SELECT v FROM union_test1
UNION
SELECT i FROM union_test2;
```

This query returns an error:

```output
100038 (22018): Numeric value 'Smith, Jane' is not recognized
```

Now use explicit casting to convert the inputs to a compatible type:

```sqlexample
SELECT v::VARCHAR FROM union_test1
UNION
SELECT i::VARCHAR FROM union_test2;
```

```output
+-------------+
| V::VARCHAR  |
|-------------|
| Smith, Jane |
| 42          |
+-------------+
```

---
title: Snowflake classes
source: https://docs.snowflake.com/en/sql-reference/snowflake-db-classes.md
section: SQL General Reference
---

# Snowflake classes

The SNOWFLAKE database also includes Classes provided by Snowflake.

## Concepts

A *Class* is similar to a class in object oriented programming and serves as a blueprint for creating instances. An *Instance* is an
object created from a Class. Classes and instances are schema-level objects in Snowflake. You can think of a class as an extensible
Snowflake object type and an instance as a Snowflake object.

A class provides a public API through stored procedures and functions. Collectively they are referred to as *class methods*. A
class also provides *class roles* that enable fine-grained privileges on class methods. In addition to its public API, a class
includes private state and private procedures and functions, similar to private properties and methods in object oriented programming.
The implementation of a class can evolve over time through new *class versions*. Instances are upgraded to the latest class version
automatically by Snowflake.

For example, Snowflake provides the [ANOMALY_DETECTION class](classes/anomaly_detection.md) in the SNOWFLAKE.ML
schema. You can create an instance of a class using a CREATE command just as you would create an object of a specific object type.

The example below creates an instance of a class and calls an instance method.

1. Update your search path to include `SNOWFLAKE.ML`:

   ```sqlexample
   ALTER SESSION SET SEARCH_PATH = '$current, $public, snowflake.ml';
   ```
2. Create an instance of ANOMALY_DETECTION class:

   ```sqlexample
   CREATE ANOMALY_DETECTION mydatabase.myschema.my_anomaly_detector(...);
   ```
3. After you create an instance of the ANOMALY_DETECTION class, you can call instance methods:

   ```sqlexample
   mydatabase.myschema.my_anomaly_detector!DETECT_ANOMALIES(...);
   ```

> **Note:**
>
> Currently, classes are only provided by Snowflake and cannot be created by users.

## List available classes

You can find available classes and learn more about each class using SHOW commands. These commands allow you to:

* Find all available classes in the SNOWFLAKE database.
* List class methods.
* List class roles.

### Find all classes

List all the available Snowflake classes by executing the [SHOW CLASSES](sql/show-classes.md) command:

```sqlexample
SHOW CLASSES IN DATABASE SNOWFLAKE;
```

The results of this statement include the database and schema name for each class.

### Update your search path

Classes are objects in a schema in the SNOWFLAKE database. You must use the fully qualified class name (for example,
SNOWFLAKE.ML.ANOMALY_DETECTION) to execute the SQL commands that follow in this topic. Alternatively, you can update the
[search path](name-resolution.md) to include the database and schema for a class, then refer to
the class by its unqualified name (for example, ANOMALY_DETECTION).

> **Note:**
>
> If you update the search path for a particular class, functions that have the same name but that are part of
> a different class will no longer be accessible. For example, if you add `SNOWFLAKE.CORTEX` to your search path,
> the string function [TRANSLATE](functions/translate.md) won’t be accessible since the
> [SNOWFLAKE.CORTEX.TRANSLATE](functions/translate-snowflake-cortex.md) function exists.

You can modify the search path using ALTER SESSION, ALTER USER, or ALTER ACCOUNT.

| Command | Notes |
| --- | --- |
| [ALTER SESSION](sql/alter-session.md) | Modifies the search path for the current session only. You can modify your own search path at the session level. A session-level change overrides the account-level or user-level setting. |
| [ALTER USER](sql/alter-user.md) | Modifies the search path persistently for the current or specified user. You can modify your own search path at the user level. An administrator can modify another user’s search path. A user-level change overrides the account-level or session-level setting. |
| [ALTER ACCOUNT](sql/alter-account.md) | Modifies the search path persistently for all users in the account. An administrator must modify the search path at the account level. |

1. Execute the following statement and copy your current search path from the `value` column:

   ```sqlexample
   SHOW PARAMETERS LIKE 'search_path';
   ```
2. Update your search path.

   > **Note:**
   >
   > The examples below use the default search path, `$current, $public`. If your search path in the `value`
   > column from the previous step does not match the default value, edit the example statements below to include your actual
   > search path.

   For example, to add SNOWFLAKE.ML to your search path for your current session, execute the following statement:

   ```sqlexample
   ALTER SESSION SET SEARCH_PATH = '$current, $public, SNOWFLAKE.ML';
   ```

   To add SNOWFLAKE.ML to your own search path at the user level, execute the following statement:

   ```sqlexample
   ALTER USER SET SEARCH_PATH = '$current, $public, SNOWFLAKE.ML';
   ```

   A user with the ACCOUNTADMIN role can update the search path for the account by executing the following statement:

   ```sqlexample
   ALTER ACCOUNT SET SEARCH_PATH = '$current, $public, SNOWFLAKE.ML';
   ```

For more information on how Snowflake resolves names, see [Object name resolution](name-resolution.md).

### Class methods

A class provides a public API through stored procedures and functions. Collectively they are referred to as class *methods*. To list
all the methods for a class, including the arguments required for each method, execute the
[SHOW FUNCTIONS IN CLASS](sql/show-functions.md) and
[SHOW PROCEDURES IN CLASS](sql/show-procedures.md) commands. A class might include multiple methods with the same name
but different signatures (that is to say, a different number of arguments or argument data types).

> **Note:**
>
> The example statements in this topic use the non-qualified class name ANOMALY_DETECTION. If you have not
> updated your search path to include SNOWFLAKE.ML, use the fully qualified
> name for the SNOWFLAKE.ML.ANOMALY_DETECTION class.

For example, to list all the functions available in the SNOWFLAKE.ML.ANOMALY_DETECTION class, execute the following statement:

```sqlexample
SHOW FUNCTIONS IN CLASS ANOMALY_DETECTION;
```

```output
+-----------------------+-------------------+-------------------+--------------------------------------------------------------------------+--------------+----------+
| name                  | min_num_arguments | max_num_arguments | arguments                                                                | descriptions | language |
|-----------------------+-------------------+-------------------+--------------------------------------------------------------------------+--------------+----------|
| _DETECT_ANOMALIES_1_1 |                 5 |                 5 | (MODEL BINARY, TS TIMESTAMP_NTZ, Y FLOAT, FEATURES ARRAY, CONFIG OBJECT) | NULL         | Python   |
| _FIT                  |                 3 |                 3 | (TS TIMESTAMP_NTZ, Y FLOAT, FEATURES ARRAY)                              | NULL         | Python   |
| _FIT                  |                 4 |                 4 | (TS TIMESTAMP_NTZ, Y FLOAT, LABEL BOOLEAN, FEATURES ARRAY)               | NULL         | Python   |
+-----------------------+-------------------+-------------------+--------------------------------------------------------------------------+--------------+----------+
```

To list all the stored procedures in the SNOWFLAKE.ML.ANOMALY_DETECTION class, execute the following statement:

```sqlexample
SHOW PROCEDURES IN CLASS ANOMALY_DETECTION;
```

The results below include the stored procedures in the class for which the current role in the session has been granted
access privileges:

```output
+---------------------------------+-------------------+-------------------+------------------------------------------------------------------------------------------------------------------------------------------+--------------+------------+
| name                            | min_num_arguments | max_num_arguments | arguments                                                                                                                                | descriptions | language   |
|---------------------------------+-------------------+-------------------+------------------------------------------------------------------------------------------------------------------------------------------+--------------+------------|
| __CONSTRUCT                     |                 4 |                 4 | (INPUT_DATA VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR, LABEL_COLNAME VARCHAR)                                           | NULL         | Javascript |
| __CONSTRUCT                     |                 5 |                 5 | (INPUT_DATA VARCHAR, SERIES_COLNAME VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR, LABEL_COLNAME VARCHAR)                   | NULL         | Javascript |
| DETECT_ANOMALIES                |                 4 |                 4 | (INPUT_DATA VARCHAR, SERIES_COLNAME VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR)                                          | NULL         | SQL        |
| DETECT_ANOMALIES                |                 5 |                 5 | (INPUT_DATA VARCHAR, SERIES_COLNAME VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR, CONFIG_OBJECT OBJECT)                    | NULL         | SQL        |
| DETECT_ANOMALIES                |                 3 |                 3 | (INPUT_DATA VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR)                                                                  | NULL         | SQL        |
| DETECT_ANOMALIES                |                 4 |                 4 | (INPUT_DATA VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR, CONFIG_OBJECT OBJECT)                                            | NULL         | SQL        |
| EXPLAIN_FEATURE_IMPORTANCE      |                 0 |                 0 | ()                                                                                                                                       | NULL         | SQL        |
| _CONSTRUCTFEATUREINPUT          |                 6 |                 6 | (INPUT_REF VARCHAR, SERIES_COLNAME VARCHAR, TIMESTAMP_COLNAME VARCHAR, TARGET_COLNAME VARCHAR, LABEL_COLNAME VARCHAR, REF_ALIAS VARCHAR) | NULL         | Javascript |
| _CONSTRUCTINFERENCEFUNCTIONNAME |                 0 |                 0 | ()                                                                                                                                       | NULL         | SQL        |
| _CONSTRUCTINFERENCERESULTAPI    |                 0 |                 0 | ()                                                                                                                                       | NULL         | SQL        |
| _SETTRAININGINFO                |                 0 |                 0 | ()                                                                                                                                       | NULL         | SQL        |
+---------------------------------+-------------------+-------------------+------------------------------------------------------------------------------------------------------------------------------------------+--------------+------------+
```

### Class roles

A class might have one or more roles that are granted the USAGE privilege on some or all class methods. You can list the available roles in
a class using the [SHOW ROLES IN CLASS](sql/show-roles.md) command.

List all the roles in the SNOWFLAKE.ML.ANOMALY_DETECTION class:

```sqlexample
SHOW ROLES IN CLASS ANOMALY_DETECTION;
```

```output
+-------------------------------+------+---------+
| created_on                    | name | comment |
|-------------------------------+------+---------|
| 2023-06-06 01:06:42.808 +0000 | USER | NULL    |
+-------------------------------+------+---------+
```

#### Instance roles

Roles are defined in the class and instantiated in the instance as an *instance role*. An instance role can be granted to a role in your
account to enable access to instance methods.

For example, if you have an ANOMALY_DETECTION instance `my_anomaly_detector` in schema `my_db.my_schema`, you can view
the privileges granted to the instance role USER using the following statement:

```sqlexample
SHOW GRANTS TO SNOWFLAKE.ML.ANOMALY_DETECTION ROLE my_db.my_schema.my_anomaly_detector!USER;
```

To grant the instance role to role `my_role` in your account, execute the following statement:

```sqlexample
GRANT SNOWFLAKE.ML.ANOMALY_DETECTION ROLE my_db.my_schema.my_anomaly_detector!USER
  TO ROLE my_role;
```

The above statement enables the role `my_role` to execute methods of the ANOMALY_DETECTOR instance `my_anomaly_detector`.

> **Note:**
>
> The role `my_role` must also have the USAGE privilege on database `my_db` and schema `my_schema`.
> Role `my_role` must also have the appropriate privileges on objects passed to instance methods.

## Grant the privilege to create class instances

In order to create an instance of a class, a role must be granted the CREATE *<class_name>* privilege.

For example, to enable the `ml_admin` role to create SNOWFLAKE.ML.ANOMALY_DETECTION instances in the `mydb.myschema`
schema, execute the following statement:

```sqlexample
GRANT CREATE ANOMALY_DETECTION ON SCHEMA mydb.myschema TO ROLE ml_admin;
```

## Create an instance

You can create an instance of a class using the CREATE *<object>* command and the class constructor method.

> **Note:**
>
> Instance names in a schema must be unique irrespective of the class they were created from. For example, if you have
> an instance of the [BUDGET (SNOWFLAKE.CORE)](classes/budget.md) class named `foo`, you can’t create an instance of the
> [ANOMALY_DETECTION (SNOWFLAKE.ML)](classes/anomaly_detection.md) class named `foo` in the same schema.

For example, to create an anomaly detector `my_anomaly_detector` instance, execute the following statement:

```sqlexample
CREATE ANOMALY_DETECTION my_anomaly_detector(
  INPUT_DATA => SYSTEM$REFERENCE('VIEW', 'my_view'),
  TIMESTAMP_COLUMN => 'my_timestamp_column'
  TARGET_COLNAME => 'my_target_column',
  LABEL_COLNAME => ''
);
```

## Use an instance

After you create an instance of a class, you can call the instance methods that the class
provides. Calling a method requires the exclamation point (`!`) character. The `!` character is used to dereference
the instance.

For example, to call the DETECT_ANOMALIES method of the anomaly detector `my_anomaly_detector`, execute the following
statement:

```sqlexample
CALL my_anomaly_detector!DETECT_ANOMALIES(
  INPUT_DATA => SYSTEM$REFERENCE('VIEW', 'my_view'),
  TIMESTAMP_COLNAME =>'my_timestamp_column',
  TARGET_COLNAME => 'my_target_column'
);
```

## Selecting columns from SQL class instance methods that return tabular data

Some methods return tabular data (for example, methods in the [ANOMALY_DETECTION](classes/anomaly_detection.md)
and [FORECAST](classes/forecast.md) classes). To select and manipulate this tabular data, you can call these
methods in the [FROM](constructs/from.md) clause of a SELECT statement.

When calling the method, omit the [CALL](sql/call.md) command. Instead, put the call in parentheses, preceded by the
TABLE keyword:

```sqlsyntax
SELECT ... FROM TABLE( <method_name>( <arg> [ , ... <arg> ] ) );
```

For example, to select the `ts`, `forecast`, and `is_anomaly` columns from the tabular data returned by the
[DETECT_ANOMALIES](classes/anomaly-detection/methods/detect_anomalies.md) method of the anomaly detector
`my_anomaly_detector`:

```sqlexample
SELECT ts, forecast, is_anomaly FROM TABLE(
  my_anomaly_detector!DETECT_ANOMALIES(
    INPUT_DATA => TABLE('my_view'),
    TIMESTAMP_COLNAME =>'my_timestamp_column',
    TARGET_COLNAME => 'my_target_column'
  )
);
```

If you pass in a reference to a query, the query cannot refer to any [common table expressions](../user-guide/queries-cte.md)
defined outside of the reference. For example, executing the following statement results in an error because the query reference
refers to `my_data`, which is defined in the outer [WITH](constructs/with.md) clause:

```sqlexample
WITH my_data AS (
  SELECT * FROM my_view
)
SELECT ts, forecast FROM TABLE(
  my_anomaly_detector!DETECT_ANOMALIES(
    INPUT_DATA => TABLE('SELECT * FROM my_data'),
    TIMESTAMP_COLNAME =>'my_timestamp_column',
    TARGET_COLNAME => 'my_target_column'
  )
);
```

To work around this limitation, move the WITH clause inside the query reference:

```sqlexample
SELECT ts, forecast FROM TABLE(
  my_anomaly_detector!DETECT_ANOMALIES(
    INPUT_DATA => TABLE('
      WITH my_data AS (
        SELECT * FROM my_view
      )
      SELECT * FROM my_data
    '),
    TIMESTAMP_COLNAME =>'my_timestamp_column',
    TARGET_COLNAME => 'my_target_column'
  )
);
```

## Available classes

For a list of available Snowflake classes, see [SQL class reference](../sql-reference-classes.md).

## Limitations

[Replication](../user-guide/account-replication-intro.md) is supported only for instances
of the [CUSTOM_CLASSIFIER](classes/custom_classifier.md) class.

---
title: SNOWFLAKE database
source: https://docs.snowflake.com/en/sql-reference/snowflake-db.md
section: SQL General Reference
---

# SNOWFLAKE database

Snowflake provides a system-defined, read-only shared database named SNOWFLAKE that contains metadata and historical usage data
about the objects in your organization and accounts.
The SNOWFLAKE database is an example of [Secure Data Sharing](../guides-overview-sharing.md), and provides object metadata and other usage metrics for your organization and accounts.

In each account, the SNOWFLAKE database contains the following schemas (also read-only):

ACCOUNT_USAGE:
:   Views that display object metadata and usage metrics for your account.

ALERT:
:   Functions that are intended for use in [alert objects](../user-guide/alerts.md).

BILLING:
:   Views that contains billing information for the customers of Snowflake resellers and distributors. Only resellers and distributors
    can access the views in the BILLING schema.

CORE:
:   Contains views and other schema objects to support select Snowflake features, such as the
    [system tags](../user-guide/classify-intro.md) used with classifying data and the
    [system data metric functions](../user-guide/data-quality-system-dmfs.md) used to measure data quality.

DATA_PRIVACY:
:   Contains functions and stored procedures related to data privacy. Also contains the
    [custom_classifier class](../user-guide/classify-custom.md).

DATA_SHARING_USAGE:
:   Views that display object metadata and usage metrics related to listings published in the Snowflake Marketplace or
    a data exchange.

EXTERNAL_ACCESS:
:   Schema that contains built-in network rules specific to connections for network traffic outbound from Snowflake.
    For information about egress network rules, see [Snowflake-managed egress network rules](../user-guide/network-rules.md).

INFORMATION_SCHEMA:
:   This schema is automatically created in all databases. In a shared database, such as SNOWFLAKE, this schema doesn’t
    serve a purpose and can be disregarded.

LOCAL:
:   This schema is used by some account-level Snowflake features for logging to [telemetry event tables](../developer-guide/logging-tracing/event-table-setting-up.md).
    For more information about this schema, see [LOCAL](local.md).

ML:
:   Contains [ML functions](../guides-overview-ml-functions.md), which is a suite of analysis tools built by Snowflake.

MONITORING:
:   Views that provide historical information for objects in your account. In the
    [Information Schema](info-schema.md), the views and table functions that return historical information will eventually be
    migrated to the MONITORING schema in the future.

NETWORK_SECURITY:
:   Schema that contains built-in network rules that define the set of allowed IP addresses that a frequently used, third-party
    partner application uses to connect with Snowflake. For more information about Snowflake-managed network rules, see [Snowflake-managed
    network rules](../user-guide/network-rules.md). This schema also contains stored procedures for the
    [Network Policy Advisor](../user-guide/network-policy-advisor.md), including
    [RECOMMEND_NETWORK_POLICY](stored-procedures/recommend_network_policy.md) and
    [EVALUATE_CANDIDATE_NETWORK_POLICY](stored-procedures/evaluate_candidate_network_policy.md).

NOTIFICATION:
:   Stored procedures and functions for [sending notifications](../user-guide/notifications/snowflake-notifications.md).

ORGANIZATION_USAGE:
:   Views that display historical usage data across all the accounts in your organization.

READER_ACCOUNT_USAGE:
:   Similar to ACCOUNT_USAGE, but only contains views relevant to the reader accounts (if any) provisioned for the
    account.

SPCS:
:   Functions for use with [Snowpark Container Services](../developer-guide/snowpark-container-services/working-with-services.md).

TELEMETRY:
:   Tables, views, and stored procedures to support [collecting telemetry data](../developer-guide/logging-tracing/logging-tracing-overview.md)
    such as log messages, trace event data, and metrics data.

TRUST_CENTER:
:   Views that display data about the [Trust Center extensions](../user-guide/trust-center/trust-center-extensions.md).

Some SNOWFLAKE schemas include classes. A class is an extensible object type that encapsulates object data and code. For more information,
see [Snowflake classes](snowflake-db-classes.md).

> **Important:**
>
> By default, the SNOWFLAKE database is visible to all users. This does not mean all objects within the SNOWFLAKE database are accessible
> to all users.
>
> Objects that are not meant to be accessible by default remain inaccessible unless access is explicitly granted by a user with the
> ACCOUNTADMIN role, including access to the ACCOUNT_USAGE, READER_ACCOUNT_USAGE, ORGANIZATION_USAGE, and DATA_SHARING_USAGE schemas.
>
> Privileges to perform other actions on these views can be granted to other roles in your account. For more information, see
> [Enabling other roles to use schemas in the SNOWFLAKE database](account-usage.md).

---
title: SNOWFLAKE database roles
source: https://docs.snowflake.com/en/sql-reference/snowflake-db-roles.md
section: SQL General Reference
---

# SNOWFLAKE database roles

When an account is provisioned, the SNOWFLAKE database is automatically imported.
The database is an example of Snowflake using [Secure Data Sharing](../user-guide/data-sharing-gs.md) to provide object metadata and other usage metrics for your organization and accounts.

Access to schema objects in the SNOWFLAKE database is controlled by different [database roles](../user-guide/security-access-control-considerations.md).
The following sections describe each SNOWFLAKE database role, its associated privileges, and the associated schema objects the role is granted access to.

## ACCOUNT_USAGE schema

[ACCOUNT_USAGE](account-usage.md) schemas have four defined SNOWFLAKE database roles, each granted the SELECT privilege on specific views.

| Role | Purpose and Description |
| --- | --- |
| OBJECT_VIEWER | The OBJECT_VIEWER role provides visibility into object metadata. |
| USAGE_VIEWER | The USAGE_VIEWER role provides visibility into historical usage information. |
| GOVERNANCE_VIEWER | The GOVERNANCE_VIEWER role provides visibility into data governance related information. |
| SECURITY_VIEWER | The SECURITY_VIEWER role provides visibility into security based information. |

### Database role required to access ACCOUNT_USAGE views

The OBJECT_VIEWER, USAGE_VIEWER, GOVERNANCE_VIEWER, and SECURITY_VIEWER roles have the SELECT privilege to query Account Usage
views in the shared SNOWFLAKE database. Use the following table to determine which database role has access to a view.

| View | Database Role |
| --- | --- |
| [ACCESS_HISTORY view](account-usage/access_history.md) | GOVERNANCE_VIEWER |
| [APPLICATION_CONFIGURATIONS view](account-usage/application_configurations.md) | SECURITY_VIEWER |
| [AGGREGATE_ACCESS_HISTORY view](account-usage/aggregate_access_history.md) | GOVERNANCE_VIEWER |
| [AGGREGATE_QUERY_HISTORY view](account-usage/aggregate_query_history.md) | GOVERNANCE_VIEWER |
| [AGGREGATION_POLICIES view](account-usage/aggregation_policies.md) | GOVERNANCE_VIEWER |
| [ANOMALIES_DAILY view](account-usage/anomalies_daily.md) | USAGE_VIEWER |
| [APPLICATION_CALLBACK_HISTORY view](account-usage/application_callback_history.md) | SECURITY_VIEWER |
| [APPLICATION_CONFIGURATION_VALUE_HISTORY view](account-usage/application_configuration_value_history.md) | SECURITY_VIEWER |
| [APPLICATION_DAILY_USAGE_HISTORY view](account-usage/application_daily_usage_history.md) | USAGE_VIEWER |
| [APPLICATION_SPECIFICATION_STATUS_HISTORY view](account-usage/application_specification_status_history.md) | SECURITY_VIEWER |
| [APPLICATION_SPECIFICATIONS view](account-usage/application_specifications.md) | SECURITY_VIEWER |
| [ARCHIVE_STORAGE_DATA_RETRIEVAL_USAGE_HISTORY view](account-usage/archive_storage_data_retrieval_usage_history.md) | USAGE_VIEWER |
| [AUTOMATIC_CLUSTERING_HISTORY view](account-usage/automatic_clustering_history.md) | USAGE_VIEWER |
| [BLOCK_STORAGE_HISTORY view](account-usage/block_storage_history.md) | USAGE_VIEWER |
| [BLOCK_STORAGE_SNAPSHOTS view](account-usage/block_storage_snapshots.md) | OBJECT_VIEWER |
| [CATALOG_LINKED_DATABASE_USAGE_HISTORY view](account-usage/catalog_linked_database_usage_history.md) | USAGE_VIEWER |
| [CLASS_INSTANCES view](account-usage/class_instances.md) | USAGE_VIEWER |
| [CLASSES view](account-usage/classes.md) | USAGE_VIEWER |
| [COLUMN_QUERY_PRUNING_HISTORY view](account-usage/column_query_pruning_history.md) | USAGE_VIEWER |
| [COLUMNS view](account-usage/columns.md) | OBJECT_VIEWER |
| [COMPLETE_TASK_GRAPHS view](account-usage/complete_task_graphs.md) | OBJECT_VIEWER |
| [CONTACT_REFERENCES view](account-usage/contact_references.md) | GOVERNANCE_VIEWER |
| [CONTACTS view](account-usage/contacts.md) | GOVERNANCE_VIEWER |
| [COPY_FILES_HISTORY view](account-usage/copy_files_history.md) | USAGE_VIEWER |
| [COPY_HISTORY view](account-usage/copy_history.md) | USAGE_VIEWER |
| [CORTEX_AI_FUNCTIONS_USAGE_HISTORY view](account-usage/cortex_ai_functions_usage_history.md) | USAGE_VIEWER |
| [CORTEX_AGENT_USAGE_HISTORY view](account-usage/cortex_agent_usage_history.md) | USAGE_VIEWER |
| [CORTEX_AISQL_USAGE_HISTORY view](account-usage/cortex_aisql_usage_history.md) | USAGE_VIEWER |
| [CORTEX_ANALYST_USAGE_HISTORY view](account-usage/cortex_analyst_usage_history.md) | USAGE_VIEWER |
| [CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view](account-usage/cortex_document_processing_usage_history.md) | USAGE_VIEWER |
| [CORTEX_FINE_TUNING_USAGE_HISTORY view](account-usage/cortex_fine_tuning_usage_history.md) | USAGE_VIEWER |
| [CORTEX_FUNCTIONS_QUERY_USAGE_HISTORY view](account-usage/cortex_functions_query_usage_history.md) | USAGE_VIEWER |
| [CORTEX_FUNCTIONS_USAGE_HISTORY view](account-usage/cortex_functions_usage_history.md) | USAGE_VIEWER |
| [CORTEX_SEARCH_BATCH_QUERY_USAGE_HISTORY view](account-usage/cortex_search_batch_query_usage_history.md) | USAGE_VIEWER |
| [CORTEX_SEARCH_DAILY_USAGE_HISTORY view](account-usage/cortex_search_daily_usage_history.md) | USAGE_VIEWER |
| [CORTEX_PROVISIONED_THROUGHPUT_USAGE_HISTORY view](account-usage/cortex_provisioned_throughput_usage_history.md) | USAGE_VIEWER |
| [CORTEX_REST_API_USAGE_HISTORY view](account-usage/cortex_rest_api_usage_history.md) | USAGE_VIEWER |
| [CORTEX_SEARCH_SERVING_USAGE_HISTORY view](account-usage/cortex_search_serving_usage_history.md) | USAGE_VIEWER |
| [CREDENTIALS view](account-usage/credentials.md) | SECURITY_VIEWER |
| [DATA_CLASSIFICATION_HISTORY view](account-usage/data_classification_history.md) | GOVERNANCE_VIEWER |
| [DATA_CLASSIFICATION_LATEST view](account-usage/data_classification_latest.md) | GOVERNANCE_VIEWER |
| [DATA_METRIC_FUNCTION_EXPECTATIONS view](account-usage/data_metric_function_expectations.md) | USAGE_VIEWER or GOVERNANCE_VIEWER |
| [DATA_METRIC_FUNCTION_REFERENCES view](account-usage/data_metric_function_references.md) | USAGE_VIEWER or GOVERNANCE_VIEWER |
| [DATA_QUALITY_MONITORING_USAGE_HISTORY view](account-usage/data_quality_monitoring_usage_history.md) | USAGE_VIEWER |
| [DATA_TRANSFER_HISTORY view](account-usage/data_transfer_history.md) | USAGE_VIEWER |
| [DATABASE_STORAGE_USAGE_HISTORY view](account-usage/database_storage_usage_history.md) | USAGE_VIEWER |
| [DATABASES view](account-usage/databases.md) | OBJECT_VIEWER |
| [DOCUMENT_AI_USAGE_HISTORY view](account-usage/document_ai_usage_history.md) | USAGE_VIEWER |
| [DYNAMIC_TABLE_REFRESH_HISTORY view](account-usage/dynamic_table_refresh_history.md) | USAGE_VIEWER |
| [ELEMENT_TYPES view](account-usage/element_types.md) | OBJECT_VIEWER |
| [EVENT_USAGE_HISTORY view](account-usage/event_usage_history.md) | USAGE_VIEWER |
| [EXTERNAL_ACCESS_HISTORY view](account-usage/external_access_history.md) | USAGE_VIEWER |
| [FIELDS view](account-usage/fields.md) | OBJECT_VIEWER |
| [FILE_FORMATS view](account-usage/file_formats.md) | OBJECT_VIEWER |
| [FUNCTIONS view](account-usage/functions.md) | OBJECT_VIEWER |
| [GRANTS_TO_ROLES view](account-usage/grants_to_roles.md) | SECURITY_VIEWER |
| [GRANTS_TO_SHARES view](account-usage/grants_to_shares.md) | SECURITY_VIEWER |
| [GRANTS_TO_USERS view](account-usage/grants_to_users.md) | SECURITY_VIEWER |
| [HYBRID_TABLE_USAGE_HISTORY view](account-usage/hybrid_table_usage_history.md) | USAGE_VIEWER |
| [HYBRID_TABLES view](account-usage/hybrid_tables.md) | OBJECT_VIEWER |
| [ICEBERG_STORAGE_OPTIMIZATION_HISTORY view](account-usage/iceberg_storage_optimization_history.md) | USAGE_VIEWER |
| [INDEX_COLUMNS view](account-usage/index_columns.md) | OBJECT_VIEWER |
| [INDEXES view](account-usage/indexes.md) | OBJECT_VIEWER |
| [INGRESS_NETWORK_ACCESS_HISTORY view](account-usage/ingress_network_access_history.md) | SECURITY_VIEWER |
| [INTERNAL_DATA_TRANSFER_HISTORY view](account-usage/internal_data_transfer_history.md) | USAGE_VIEWER |
| [INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](account-usage/internal_stage_network_access_history.md) | SECURITY_VIEWER |
| [JOIN_POLICIES view](account-usage/join_policies.md) | GOVERNANCE_VIEWER |
| [LISTINGS view](account-usage/listings.md) | SECURITY_VIEWER |
| [LOAD_HISTORY view](account-usage/load_history.md) | USAGE_VIEWER |
| [LOGIN_HISTORY view](account-usage/login_history.md) | SECURITY_VIEWER |
| [MASKING_POLICIES view](account-usage/masking_policies.md) | GOVERNANCE_VIEWER |
| [MATERIALIZED_VIEW_REFRESH_HISTORY view](account-usage/materialized_view_refresh_history.md) | USAGE_VIEWER |
| [METERING_DAILY_HISTORY view](account-usage/metering_daily_history.md) | USAGE_VIEWER |
| [METERING_HISTORY view](account-usage/metering_history.md) | USAGE_VIEWER |
| [NETWORK_POLICIES view](account-usage/network_policies.md) | SECURITY_VIEWER |
| [NETWORK_RULE_REFERENCES view](account-usage/network_rule_references.md) | SECURITY_VIEWER |
| [NETWORK_RULES view](account-usage/network_rules.md) | SECURITY_VIEWER |
| [NOTEBOOKS_CONTAINER_RUNTIME_HISTORY view](account-usage/notebooks_container_runtime_history.md) | USAGE_VIEWER |
| [OBJECT_ACCESS_REQUEST_HISTORY view](account-usage/object_access_request_history.md) | OBJECT_VIEWER |
| [OBJECT_DEPENDENCIES view](account-usage/object_dependencies.md) | OBJECT_VIEWER |
| [ACCOUNT_USAGE.ONLINE_FEATURE_TABLE_REFRESH_HISTORY](account-usage/online_feature_table_refresh_history.md) | USAGE_VIEWER |
| [OPENFLOW_USAGE_HISTORY view](account-usage/openflow_usage_history.md) | USAGE_VIEWER |
| [OUTBOUND_PRIVATELINK_ENDPOINTS view](account-usage/outbound_privatelink_endpoints.md) | SECURITY_VIEWER |
| [PASSWORD_POLICIES view](account-usage/password_policies.md) | SECURITY_VIEWER |
| [PIPE_USAGE_HISTORY view](account-usage/pipe_usage_history.md) | USAGE_VIEWER |
| [PIPES view](account-usage/pipes.md) | OBJECT_VIEWER |
| [POLICY_REFERENCES view](account-usage/policy_references.md) | GOVERNANCE_VIEWER, SECURITY_VIEWER |
| [POSTGRES_STORAGE_USAGE_HISTORY view](account-usage/postgres_storage_usage_history.md) | USAGE_VIEWER |
| [PRIVACY_BUDGETS view](account-usage/privacy_budgets.md) | GOVERNANCE_VIEWER |
| [PRIVACY_POLICIES view](account-usage/privacy_policies.md) | GOVERNANCE_VIEWER |
| [PROCEDURES view](account-usage/procedures.md) | OBJECT_VIEWER |
| [PROJECTION_POLICIES view](account-usage/projection_policies.md) | GOVERNANCE_VIEWER |
| [QUERY_ACCELERATION_ELIGIBLE view](account-usage/query_acceleration_eligible.md) | GOVERNANCE_VIEWER |
| [QUERY_ATTRIBUTION_HISTORY view](account-usage/query_attribution_history.md) | USAGE_VIEWER, GOVERNANCE_VIEWER |
| [QUERY_HISTORY view](account-usage/query_history.md) | GOVERNANCE_VIEWER |
| [QUERY_INSIGHTS view](account-usage/query_insights.md) | GOVERNANCE_VIEWER |
| [REFERENTIAL_CONSTRAINTS view](account-usage/referential_constraints.md) | OBJECT_VIEWER |
| [REPLICATION_GROUP_REFRESH_HISTORY view](account-usage/replication_group_refresh_history.md) | USAGE_VIEWER |
| [REPLICATION_GROUP_USAGE_HISTORY view](account-usage/replication_group_usage_history.md) | USAGE_VIEWER |
| [REPLICATION_GROUPS view](account-usage/replication_groups.md) | OBJECT_VIEWER |
| [REPLICATION_USAGE_HISTORY view](account-usage/replication_usage_history.md) | USAGE_VIEWER |
| [RESOURCE_MONITORS view](account-usage/resource_monitors.md) | OBJECT_VIEWER |
| [ROLES view](account-usage/roles.md) | SECURITY_VIEWER |
| [ROW_ACCESS_POLICIES view](account-usage/row_access_policies.md) | GOVERNANCE_VIEWER |
| [SCHEMATA view](account-usage/schemata.md) | OBJECT_VIEWER |
| [SEARCH_OPTIMIZATION_BENEFITS view](account-usage/search_optimization_benefits.md) | USAGE_VIEWER |
| [SEARCH_OPTIMIZATION_HISTORY view](account-usage/search_optimization_history.md) | USAGE_VIEWER |
| [SECRETS view](account-usage/secrets.md) | SECURITY_VIEWER |
| [SEMANTIC_DIMENSIONS view](account-usage/semantic_dimensions.md) | OBJECT_VIEWER |
| [SEMANTIC_FACTS view](account-usage/semantic_facts.md) | OBJECT_VIEWER |
| [SEMANTIC_METRICS view](account-usage/semantic_metrics.md) | OBJECT_VIEWER |
| [SEMANTIC_RELATIONSHIPS view](account-usage/semantic_relationships.md) | OBJECT_VIEWER |
| [SEMANTIC_TABLES view](account-usage/semantic_tables.md) | OBJECT_VIEWER |
| [SEMANTIC_VIEWS view](account-usage/semantic_views.md) | OBJECT_VIEWER |
| [SEQUENCES view](account-usage/sequences.md) | OBJECT_VIEWER |
| [SERVERLESS_ALERT_HISTORY view](account-usage/serverless_alert_history.md) | USAGE_VIEWER |
| [SERVERLESS_TASK_HISTORY view](account-usage/serverless_task_history.md) | USAGE_VIEWER |
| [SERVICES view](account-usage/services.md) | OBJECT_VIEWER |
| [SESSION_POLICIES view](account-usage/session_policies.md) | SECURITY_VIEWER |
| [SESSIONS view](account-usage/sessions.md) | SECURITY_VIEWER |
| [SHARES view](account-usage/shares.md) | SECURITY_VIEWER |
| [SNAPSHOT_OPERATION_HISTORY view — Deprecated](account-usage/snapshot_operation_history.md) | OBJECT_VIEWER |
| [SNAPSHOT_POLICIES view — Deprecated](account-usage/snapshot_policies.md) | OBJECT_VIEWER |
| [SNAPSHOT_SETS view — Deprecated](account-usage/snapshot_sets.md) | OBJECT_VIEWER |
| [SNAPSHOT_STORAGE_USAGE view — Deprecated](account-usage/snapshot_storage_usage.md) | OBJECT_VIEWER |
| [SNAPSHOTS view — Deprecated](account-usage/snapshots.md) | OBJECT_VIEWER |
| [SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view](account-usage/snowflake_intelligence_usage_history_view.md) | USAGE_VIEWER |
| [SNOWPARK_CONTAINER_SERVICES_HISTORY view](account-usage/snowpark_container_services_history.md) | USAGE_VIEWER |
| [SNOWPIPE_STREAMING_CHANNEL_HISTORY view](account-usage/snowpipe_streaming_channel_history.md) | USAGE_VIEWER |
| [STAGE_STORAGE_USAGE_HISTORY view](account-usage/stage_storage_usage_history.md) | USAGE_VIEWER |
| [STAGES view](account-usage/stages.md) | OBJECT_VIEWER |
| [STORAGE_LIFECYCLE_POLICIES view](account-usage/storage_lifecycle_policies.md) | GOVERNANCE_VIEWER |
| [STORAGE_LIFECYCLE_POLICY_HISTORY view](account-usage/storage_lifecycle_policy_history.md) | GOVERNANCE_VIEWER |
| [STORAGE_REQUEST_HISTORY view](account-usage/storage_request_history.md) | USAGE_VIEWER |
| [STORAGE_USAGE view](account-usage/storage_usage.md) | USAGE_VIEWER |
| [TABLE_CONSTRAINTS view](account-usage/table_constraints.md) | OBJECT_VIEWER |
| [TABLE_DML_HISTORY view](account-usage/table_dml_history.md) | USAGE_VIEWER |
| [TABLE_PRUNING_HISTORY view](account-usage/table_pruning_history.md) | USAGE_VIEWER |
| [TABLE_QUERY_PRUNING_HISTORY view](account-usage/table_query_pruning_history.md) | USAGE_VIEWER |
| [TABLE_STORAGE_METRICS view](account-usage/table_storage_metrics.md) | USAGE_VIEWER |
| [TABLES view](account-usage/tables.md) | OBJECT_VIEWER |
| [TAG_REFERENCES view](account-usage/tag_references.md) | GOVERNANCE_VIEWER |
| [TAGS view](account-usage/tags.md) | OBJECT_VIEWER or GOVERNANCE_VIEWER |
| [TASK_HISTORY view](account-usage/task_history.md) | USAGE_VIEWER |
| [TRUST_CENTER_FINDINGS view](account-usage/trust_center_findings.md) | SECURITY_VIEWER |
| [USERS view](account-usage/users.md) | SECURITY_VIEWER |
| [VIEWS view](account-usage/views.md) | OBJECT_VIEWER |
| [WAREHOUSE_EVENTS_HISTORY view](account-usage/warehouse_events_history.md) | USAGE_VIEWER |
| [WAREHOUSE_LOAD_HISTORY view](account-usage/warehouse_load_history.md) | USAGE_VIEWER |
| [WAREHOUSE_METERING_HISTORY view](account-usage/warehouse_metering_history.md) | USAGE_VIEWER |

## READER_ACCOUNT_USAGE schema

The READER_USAGE_VIEWER SNOWFLAKE database role is granted SELECT privilege on all READER_ACCOUNT_USAGE views.
As reader accounts are created by clients, the READER_USAGE_VIEWER role is expected to be granted to those roles used to monitor reader account use.

| View |
| --- |
| [LOGIN_HISTORY view](account-usage/login_history.md) |
| [QUERY_HISTORY view](account-usage/query_history.md) |
| [RESOURCE_MONITORS view](account-usage/resource_monitors.md) |
| [STORAGE_USAGE view](account-usage/storage_usage.md) |
| [WAREHOUSE_METERING_HISTORY view](account-usage/warehouse_metering_history.md) |

## ORGANIZATION_USAGE schema

The ORGANIZATION_USAGE_VIEWER, ORGANIZATION_BILLING_VIEWER, and ORGANIZATION_ACCOUNTS_VIEWER SNOWFLAKE database roles are granted the SELECT privilege on Organization Usage views in the shared SNOWFLAKE database.

| View | ORGANIZATION_BILLING_VIEWER Role | ORGANIZATION_USAGE_VIEWER Role | ORGANIZATION_ACCOUNTS_VIEWER Role |
| --- | --- | --- | --- |
| [ACCOUNTS view](organization-usage/accounts.md) |  |  | ✔ |
| [ANOMALIES_IN_CURRENCY_DAILY view](organization-usage/anomalies_in_currency_daily.md) | ✔ |  |  |
| [CONTRACT_ITEMS view](organization-usage/contract_items.md) | ✔ |  |  |
| [LISTING_AUTO_FULFILLMENT_USAGE_HISTORY view](organization-usage/listing_auto_fulfillment_usage_history.md) | ✔ |  |  |
| [RATE_SHEET_DAILY view](organization-usage/rate_sheet_daily.md) | ✔ |  |  |
| [REMAINING_BALANCE_DAILY view](organization-usage/remaining_balance_daily.md) | ✔ |  |  |
| [USAGE_IN_CURRENCY_DAILY view](organization-usage/usage_in_currency_daily.md) | ✔ |  |  |
| [MARKETPLACE_DISBURSEMENT_REPORT View](../collaboration/views/marketplace-disbursement-report-org.md) | ✔ |  |  |
| [DATA_TRANSFER_DAILY_HISTORY view](organization-usage/data_transfer_daily_history.md) |  | ✔ |  |
| [DATA_TRANSFER_HISTORY view](organization-usage/data_transfer_history.md) |  | ✔ |  |
| [DATABASE_STORAGE_USAGE_HISTORY view](organization-usage/database_storage_usage_history.md) |  | ✔ |  |
| [AUTOMATIC_CLUSTERING_HISTORY view](organization-usage/automatic_clustering_history.md) |  | ✔ |  |
| [MARKETPLACE_PAID_USAGE_DAILY View](../collaboration/views/marketplace-paid-usage-daily-org.md) |  | ✔ |  |
| [MATERIALIZED_VIEW_REFRESH_HISTORY view](account-usage/materialized_view_refresh_history.md) |  | ✔ |  |
| [METERING_DAILY_HISTORY view](organization-usage/metering_daily_history.md) |  | ✔ |  |
| [MONETIZED_USAGE_DAILY View](../collaboration/views/monetized-usage-daily-org.md) |  | ✔ |  |
| [PIPE_USAGE_HISTORY view](organization-usage/pipe_usage_history.md) |  | ✔ |  |
| [QUERY_ACCELERATION_HISTORY view](organization-usage/query_acceleration_history.md) |  | ✔ |  |
| [REPLICATION_GROUP_USAGE_HISTORY view](organization-usage/replication_group_usage_history.md) |  | ✔ |  |
| [REPLICATION_USAGE_HISTORY view](organization-usage/replication_usage_history.md) |  | ✔ |  |
| [SEARCH_OPTIMIZATION_HISTORY view](organization-usage/search_optimization_history.md) |  | ✔ |  |
| [STAGE_STORAGE_USAGE_HISTORY view](organization-usage/stage_storage_usage_history.md) |  | ✔ |  |
| [STORAGE_DAILY_HISTORY view](organization-usage/storage_daily_history.md) |  | ✔ |  |
| [WAREHOUSE_METERING_HISTORY view](organization-usage/warehouse_metering_history.md) |  | ✔ |  |

## CORE schema

The CORE_VIEWER SNOWFLAKE database role is granted to the PUBLIC role in all Snowflake accounts containing a shared SNOWFLAKE database.
The USAGE privilege is granted to all Snowflake-defined functions and bundles in the CORE schema.

### Budget class

The BUDGET_CREATOR Snowflake database role is granted the USAGE privilege on the SNOWFLAKE.CORE schema and the BUDGET class
in the schema. This grant allows users with the BUDGET_CREATOR role to create instances of the BUDGET class.

For more information, see [Create a custom role to create budgets](../user-guide/budgets/custom-budget.md).

### Tag objects

The CORE_VIEWER database role is granted the APPLY privilege on the
[classification system tags](../user-guide/classify-intro.md) SNOWFLAKE.CORE.PRIVACY_CATEGORY and
SNOWFLAKE.CORE.SEMANTIC_CATEGORY. These grants allow users with a role that is granted the CORE_VIEWER database role to assign these system
tags to columns.

## ALERT schema

The ALERT_VIEWER SNOWFLAKE database role is granted the USAGE privilege on the functions defined in this schema.

## ML schema

The ML_USER SNOWFLAKE database role is granted to the PUBLIC role in all Snowflake accounts that contain a shared
SNOWFLAKE database and allows customers to access and use [ML functions](../guides-overview-ml-functions.md).
Users must also have the USAGE privilege on the ML schema to call these functions.

## MONITORING schema

The MONITORING_VIEWER database role has the SELECT privilege on all views in the MONITORING schema.

The MONITORING_VIEWER database role is granted to the PUBLIC role in all Snowflake accounts containing a shared SNOWFLAKE
database.

## SNOWFLAKE.CLASSIFICATION_ADMIN database role

The SNOWFLAKE.CLASSIFICATION_ADMIN database role allows a data engineer or steward to create an instance of the [CLASSIFICATION_PROFILE](classes/classification_profile.md) class.
A classification profile is used to implement [sensitive data classification](../user-guide/classify-auto.md).

## SNOWFLAKE.CORTEX_AGENT_USER database role

You can use the SNOWFLAKE.CORTEX_AGENT_USER database role to grant your users access to Snowflake Cortex Agents API without granting access to other Cortex
features. Using the Cortex Agents API requires *either* the SNOWFLAKE.CORTEX_USER database role *or* the
SNOWFLAKE.CORTEX_AGENT_USER database role.

By default, the SNOWFLAKE.CORTEX_USER database role is granted to the PUBLIC role. For fine-grained access control, revoke access from the PUBLIC role and grant access to the SNOWFLAKE.CORTEX_AGENT_USER database role.
For more information, see [Set up access to the agent](../user-guide/snowflake-cortex/cortex-agents-manage.md).

## SNOWFLAKE.AI_FUNCTIONS_USER database role

The SNOWFLAKE.AI_FUNCTIONS_USER database role is used to grant customers access to Snowflake Cortex scalar AI
functions (all Cortex AI functions except the aggregate functions AI_AGG and AI_SUMMARIZE_AGG) without granting
access to Cortex services such as Cortex Agent, Cortex Analyst, Cortex Fine-tuning, or Cortex Search. Calling
scalar AI functions requires *either* the SNOWFLAKE.CORTEX_USER database role *or* the
SNOWFLAKE.AI_FUNCTIONS_USER database role.

By default, this role is not granted to any roles. If you want users to have access to scalar AI functions,
grant this database role to appropriate roles. For details, see [Cortex LLM Functions required privileges](../user-guide/snowflake-cortex/aisql.md).

## SNOWFLAKE.CORTEX_EMBED_USER database role

The SNOWFLAKE.CORTEX_EMBED_USER database role is used to grant customers access to Snowflake Cortex embedding functions AI_EMBED,
SNOWFLAKE.CORTEX.EMBED_768, and SNOWFLAKE.CORTEX_EMBED_TEXT_1024 without granting access to other Cortex
features. Calling these embedding functions requires *either* the SNOWFLAKE.CORTEX_USER database role *or* the
SNOWFLAKE.CORTEX_EMBED_USER database role. This role is not granted to any roles by default.

By default, this role is not granted to any roles. If you want users to have access to the embedding functions,
grant this database role to appropriate roles. For details, see [Cortex LLM Functions required privileges](../user-guide/snowflake-cortex/aisql.md)

## SNOWFLAKE.CORTEX_USER database role

This SNOWFLAKE.CORTEX_USER database role is used to grant customers access to Snowflake Cortex features.
By default, this role is granted to the PUBLIC role. The PUBLIC role is automatically granted
to all users and roles, so this allows all users in your account to use Snowflake Cortex LLM functions.

If you don’t want all users to have this privilege, you can revoke access from the PUBLIC role and grant access to specific roles.
For details, see [Cortex LLM Functions required privileges](../user-guide/snowflake-cortex/aisql.md).

## SNOWFLAKE.COPILOT_USER database role

The SNOWFLAKE.COPILOT_USER database role allows customers to access Cortex Code features in Snowsight. Initially, this database role
is granted to the PUBLIC role. The PUBLIC role is automatically granted to all users and roles, so this allows all users in your account
to use Cortex Code. If you want to limit access to Cortex Code features in Snowsight, you can revoke access from the PUBLIC role and grant access to
specific roles. For details, see [Access control requirements](../user-guide/cortex-code/cortex-code-snowsight.md).

## Using SNOWFLAKE database roles

Administrators can use the [GRANT DATABASE ROLE](sql/grant-database-role.md) to assign a SNOWFLAKE database role to another role,
which can then be granted to a user. This would allow the user to access a specific subset of views in the SNOWFLAKE database.

In the following example a role is created which can be used to view SNOWFLAKE database object metadata, and does the following:

1. Creates a custom role.
2. Grants the OBJECT_VIEWER role to the custom role.
3. Grants the custom role to a user.

To create and grant the custom role, do the following:

1. Create the `CAN_VIEWMD` role, using [CREATE ROLE](sql/create-role.md) that will be used to grant access to object metadata.

   Only users with the USERADMIN system role or higher, or another role with the CREATE ROLE privilege on the
   account, can create roles.

   ```sqlexample
   CREATE ROLE CAN_VIEWMD COMMENT = 'This role can view metadata per SNOWFLAKE database role definitions';
   ```
2. Grant the OBJECT_VIEWER role to the CAN_VIEWMD role.

   Only users with the OWNERSHIP role can grant SNOWFLAKE database roles. For additional information, refer to [GRANT DATABASE ROLE](sql/grant-database-role.md).

   ```sqlexample
   GRANT DATABASE ROLE OBJECT_VIEWER TO ROLE CAN_VIEWMD;
   ```
3. Assign `CAN_VIEWMD` role to user `smith`.

   Only users with the SECURITYADMIN role can grant roles to users. For additional options, refer to [GRANT ROLE](sql/grant-role.md).

   ```sqlexample
   GRANT ROLE CAN_VIEWMD TO USER smith;
   ```

---
title: Snowpark Container Services functions
source: https://docs.snowflake.com/en/sql-reference/functions-spcs.md
section: SQL General Reference
---

# Snowpark Container Services functions

Snowpark Container Services provides the following functions for use with services:

| Function Name | Notes |
| --- | --- |
| [SPCS_CANCEL_JOB](functions/spcs_cancel_job.md) | Cancels a Snowpark Container Services job. |
| [SPCS_WAIT_FOR](functions/spcs_wait_for.md) | Waits for the service to reach the specified state, with a timeout. |

---
title: SQL format models
source: https://docs.snowflake.com/en/sql-reference/sql-format-models.md
section: SQL General Reference
---

# SQL format models

In Snowflake, SQL format models (that is, literals containing format strings) are used to specify how numeric values are converted to text strings and vice versa. As such, they can be specified as arguments in
the [TO_CHAR , TO_VARCHAR](functions/to_char.md) and [TO_DECIMAL , TO_NUMBER , TO_NUMERIC](functions/to_decimal.md) conversion functions.

> **Note:**
>
> Snowflake also provides some limited SQL format model support for dates, times, and timestamps (see [Date & time functions](functions-date-time.md) and [Conversion functions](functions-conversion.md)). Full
> support for using SQL format models to format dates, times, and timestamps will be added in a future release.

## Components of a format model

A format model consists of a string of format elements and literals.

### Format elements

Format elements are sequences of digits and/or letters (mostly case-insensitive), and, in some cases, symbols. Format elements can be directly concatenated to each other.

Some format elements are used commonly across all format models for controlling printing and matching input text. Other format elements have specific uses based on the type of values they are used to cast
to/from. For more information, see the following sections in this topic:

* Format Modifiers and Generic Space Handling
* Fixed-position Format Elements
* Text-minimal Format Elements

### Format literals

Format literals are sequences that can consist of combinations of:

* Strings of arbitrary characters delimited by double quotes (a double quote is represented as two adjacent double quotes).
* One or more of the following symbols:

  | Symbol/Character | Notes |
  | --- | --- |
  | `.` (period) | In fixed numeric models, treated as a format element when following `0`, `9`, or `X`; otherwise preserved as-is. |
  | `,` (comma) | In numeric models, treated as a format element when following `0`, `9`, or `X`; otherwise preserved as-is. |
  | `;` (semi-colon) | Always preserved as-is. |
  | `:` (colon) | Always preserved as-is. |
  | `-` (minus sign) | Always preserved as-is. |
  | `=` (equal sign) | Always preserved as-is. |
  | `/` (forward slash) | Always preserved as-is. |
  | `(` (left parenthesis) | Always preserved as-is. |
  | `)` (right parenthesis) | Always preserved as-is. |

A literal is always printed as-is, exactly where it was located in the format model.

Here is a brief example of using a SQL format model to print the minus
sign after a number rather than before it. The `MI`
indicates where to put the minus sign if the number is a negative number.

> ```sqlexample
> select to_varchar(-123.45, '999.99MI') as EXAMPLE;
> ```

The output would look similar to `123.45-` rather than the default `-123.45`.

More examples are included at the end of this topic.

## Format modifiers and generic space handling

The following table lists special format elements that control printing and matching input text, and are common to all format models:

| Element | Description |
| --- | --- |
| `_` (underscore) | Nothing printed; optional space on input. |
| `FM` | Fill mode modifier; toggles between *compact* and *fill* modes for any elements following the modifier in the model. |
| `FX` | Exact match modifier; toggles between *lax* and *exact* match modes for any elements following the modifier in the model. |

> **Note:**
>
> The fill mode modifier has no effect on the text-minimal numeric format elements (`TM`, `TM9`, and `TME`).

### Printing output strings using the fill mode modifier

By default, the fill mode is set to *fill* and the `FM` fill mode modifier toggles it to *compact*; repeated use toggles it back to *fill*, etc.

In most cases, using *fill* mode on printing guarantees that format elements produce output of a fixed width by padding numbers on the left with leading zeros or spaces, and padding text with spaces on
the right. This guarantees that columnar output in fixed-width fonts will be aligned.

In *compact* mode, most format elements produce only minimum-width output (that is, leading zeros and spaces and trailing spaces are suppressed).

The format elements that don’t adhere to these rules are explicitly noted below.

The exact match modifier, `FX` does not affect printing; the underscore format element prints nothing.

### Parsing input strings using the modifiers

Parsing of input strings is affected by both the fill mode modifier, `FM`, and the exact match modifier `FX`. Initially:

* Fill mode is set to *fill* and `FM` toggles it to *compact* and back.
* Exact match mode is set to *lax* and `FX` toggles it to *exact* and back.

All string matching against format elements and literals during parsing is case-insensitive.

In *lax* mode, the first step of input parsing is skipping leading white space (a sequence of spaces, tabs, LF, CR, FF, and VT characters); the mode at the beginning input is strict if the first format
element is `FX`, and *lax* otherwise.

> **Note:**
>
> Only normal space characters are allowed within values to be parsed (that is, components cannot be on different lines, separated by tabs, etc.).

In the *lax* match mode, spaces within literals are matched against any non-empty input sequence of spaces; non-space characters are matched one-to-one. In the *exact* mode, all characters in a literal
must match the input characters one-to-one.

The numeric format elements are matched against the corresponding digit sequences:

* If both *fill* and *exact* modes are in effect, the number of digits must exactly correspond to the width of the corresponding numeric format elements (leading zeros are expected).
* If *compact* or *lax* mode is in effect, a matching input number must have, at most, the number of digits equal to the maximal width of the format element, and at least one digit; leading zeros are
  ignored.

The textual format elements are matched case-insensitively:

* If both *fill* and *exact* modes are in effect, the number of trailing spaces, up to the max width of the element, is expected.
* Otherwise, spaces after the variable-length textual elements are ignored in *lax* mode, and exact match to the actual word (without padding spaces) is expected in *exact* mode.

Finally, the trailing white space until the end of the input string is ignored if the current mode is *lax*.

Normally, both *lax* and *exact* modes do not allow matching spaces where spaces are not present in the format model or could not be generated by printing the content of format elements in *fill* mode.

> **Note:**
>
> This behavior differs from Oracle lax match semantics, where spaces can be inserted in between any two format elements — Snowflake uses stricter matching semantics to avoid excessive false matches
> during automatic data type recognition.

Places where spaces should be ignored if present in both *lax* and *exact* modes can be explicitly marked using the `_` (underscore) format element.

As a rule of thumb, a format in *exact* mode recognizes only input strings printed by the same format, while a format in *lax* mode recognizes input strings which were printed by the similar format with
any fill mode modifiers added or removed.

## Numeric format models

Numeric format models supports two types:

* Fixed-position (with explicit placement of digits where the `0`, `9`, or `X` format elements are placed)
* Text-minimal (`TM`, `TME`, and `TM9` format elements)

> **Note:**
>
> These two types cannot be intermingled within the same model.

### Fixed-position numeric formats

> **Note:**
>
> This section discusses non-negative fixed-position numbers; for more information about positioning of a number’s sign in the output for fixed-position numeric formats, see
> Sign Position for Fixed-Position Formats.

Fixed-position numbers are represented using digit elements, `0` or `9`. For example, `999` holds numbers from 1 to 3 decimal digits. The fractional part of the numbers is delimited using separator
elements, `.` (period) or `D`:

* `.` is always rendered as a period.
* To use a different character for the `D` elements, modify the input string to replace all periods with commas and all commas with periods before applying the cast function.

Normally, the leading zeros in the integer part and trailing zeros in the fractional part are replaced with spaces (except when the value of the integer part is zero, in which case it is rendered as a
single `0` character). To suppress this behavior use the `0` format element in place of `9`; the corresponding positions have `0` characters preserved. The format element `B`, when used before
the number, suppresses preserving the last `0` in the integer value (that is, if you use `B` and the value of the integer part of the number is zero, all digits are rendered as spaces).

The digit group separator `,` (comma) or `G` results in the corresponding group separator character being printed if the number is big enough so the digits are on the both sides of group separator.
An example of a format model useful for printing currency sums would be `999,999.00`.

When there are more digits in the integer part of the number than there are digit positions in the format, all digits are printed as `#` to indicate overflow.

The exponent element causes fixed-position numbers to be normalized so that the first digit in the integer part is 1 to 9 (unless the value of the number is zero, in which case the value of the exponent
is also zero). The `EE` element automatically picks the right number of digits in the exponent, and does not print the `+` sign, while `EEE`, `EEEE`, and `EEEEE` always print the `+` or `-`
sign for the exponent and the requested number of digits (leading zeros are not suppressed). Exponent overflow is indicated by `#` in place of digits.

The exponent indicators print either capital `E` or lowercase `e` depending on the case of the first letter in the format element.

The `X` format element works like `9`, except that hexadecimal digits `0-9A-F` are printed. Currently, hexadecimal fractions are not supported. Similar to `9`, `X` replaces leading zeros with
spaces. The `0` element, when used together with `X` prints hexadecimal digits without leading zero suppression (thus use `000X` to print hex numbers that always contain 4 digits).

Note that `X` prints hexadecimal digits with uppercase Latin letters, and lowercase `x` prints lowercase Latin letters. The hexadecimal `0` format element uses the case of the subsequent `X`
format element.

Normally, hexadecimal numbers are printed as unsigned, that is, negative numbers have all `1`’s in the most significant bit(s), but using the `X` element together with an explicit sign (`S` or `MI`)
causes the `-` sign to be printed along with the absolute value of the number.

Fixed-position numeric format models report overflow on special values (infinity or not-a-number) of floating point numbers.

#### Fixed-position format elements

The following table lists the supported elements for fixed-position formats. Note the following:

* The **Repeatable** column indicates whether an element can be repeated in a format model, otherwise the element can only be used once per format model.
* The **Case-sensitive** column indicates elements where the case of the element affects the format. For example:

  + `EE` processes exponents with an uppercase `E`.
  + `ee` processes exponents with a lowercase `e`.

  All the other elements are case-insensitive.

| Element | Repeatable | Case-sensitive | Description |
| --- | --- | --- | --- |
| `$` |  |  | Dollar sign printed before digits in the number (usually after the sign). |
| `%` |  |  | Percentage; the value is multiplied by 100 before formatting, and a literal `%` symbol is added at the location where the format element is in the input format. For example, `TO_CHAR(0.25, 'TM9%')` produces `25%`. |
| `.` (period) |  |  | Decimal fraction separator; always printed as a period. |
| `,` (comma) | ✔ |  | Digit group separator; printed as a comma or blank space. |
| `0` | ✔ |  | Position for a digit; leading/trailing zeros are explicitly printed. |
| `9` | ✔ |  | Position for a digit; leading/trailing zeros are replaced with blank spaces. |
| `B` |  |  | Forces representing a zero value as a space in the subsequent number. |
| `D` |  |  | Decimal fraction separator; alternative for `.` element (see description above). |
| `EE` |  | ✔ | Variable-width exponent, from 2 to 7 characters, with no `+` sign for integers (for example, `E0`, `E21`, `E200`, `E-200`). |
| `EEE` |  | ✔ | Fixed-width exponent (3 characters); range covers from `E-9` to `E+9`. |
| `EEEE` |  | ✔ | Fixed-width exponent (4 characters); range covers from `E-99` to `E+99`. |
| `EEEEE` |  | ✔ | Fixed-width exponent (5 characters); range covers from `E-999` to `E+999`. |
| `EEEEEE` |  | ✔ | Fixed-width exponent (6 characters); range covers from `E-9999` to `E+9999`. |
| `EEEEEEE` |  | ✔ | Fixed-width exponent (7 characters); range covers from `E-16383` to `E+16384`. |
| `G` | ✔ |  | Digit group separator; alternative for `,` (see description above). |
| `MI` |  |  | Explicit numeric sign place holder; prints a space for positive numbers or a `-` sign for negative numbers. |
| `S` |  |  | Explicit numeric sign place holder; prints a `+` sign for positive numbers or a `-` sign for negative numbers.. |
| `X` | ✔ | ✔ | Hexadecimal digit. |

#### Sign position for fixed-position formats

By default, fixed-position formats always reserve a space for the number’s sign:

* For non-negative numbers, the default blank space is printed before the first digit.
* For negative numbers, the default blank space and `-` sign are printed before the first digit (or decimal, when the `B` format element is used for fractional numbers).

However, the `S`, `MI`, and `$` format elements can be used to explicitly specify where the sign and/or blank space for the number are located.

For example (underscores, `_`, are used in these examples to indicate where blank spaces are inserted):

| Format Model | `12` prints as: | `-7` prints as: |
| --- | --- | --- |
| `99` | `_12` | `_-7` |
| `S99` | `+12` | `_-7` |
| `99S` | `12+` | `_7-` |
| `MI99` | `_12` | `-_7` |
| `99MI` | `12_` | `_7-` |
| `$99` | `_$12` | `_-$7` |

#### Printing numbers using fixed-position formats and the fill mode modifier

In *fill* mode, the variable-length format elements, such as `EE` and `MI`, are space-padded on the right.

In *compact* mode, all spaces resulting from numeric format elements, including the variable-length elements, are removed, so the resulting strings are shorter and no longer aligned. For
example (note the lack of blank spaces):

| Format Model | `12` prints as: | `-7` prints as: |
| --- | --- | --- |
| `FM99` | `12` | `-7` |

#### Parsing numbers using fixed-position formats and the modifiers

Parsing strings containing numbers is affected by both the `FX` and `FM` modifiers:

* In *lax* mode:

  + Digit group separators are optional (that is, numbers with or without group separators match — though numbers of digits between respective group separators must match); it also permits `+` as a valid
    match for the `MI` format element.
  + The *lax* mode does not disable requirement that digits (even leading or trailing zeros) must be present to match `0` format elements.
  + Spaces between the leading sign and the first digit are allowed in *lax* mode.
  + Also, in *lax* mode, all the exponent format elements (`EE`, `EEE`, `EEEE`, and `EEEEE`) are treated as `EE`, and match an exponent specification with 1 to 3 digits and optional `+` or `-`
    sign.
  + Use `B` to allow matching numbers with no digits in the integer part. The decimal dot before an empty fractional part is optional in *lax* mode.
* In *exact* mode:

  + The number must have a proper number of spaces in place of omitted digits to match the format (that is, in *fill* mode, it is spaces and, in *compact* mode, it is a lack of spaces).
  + Omitting group separators is not allowed under *exact* mode, and `MI` won’t match the `+` sign.
  + The exponent format elements other than `EE` must match the sign place and the exact number of digits required by the format element.
  + The decimal dot in the place specified by the format model is mandatory.

### Text-minimal numeric formats

While fixed-position numeric format models always explicitly specify the number of digits, the text-minimal format elements use a minimal number of digits based on the value of the number. The `TM*` format
elements always produce variable-length output with no spaces, regardless of the fill mode modifier (*fill* or *compact*).

* `TM9` prints the number as an integer or decimal fraction, based on the value of the number. Any decimal fixed-point number value is printed precisely with the number of digits in the fractional part
  determined by the scale of the number (trailing zeros are preserved in *fill* mode).
* For floating-point numbers, `TM9` picks the number of fractional digits based on the number’s exponent (note that precise binary to decimal fraction conversion is not possible). If the floating-point
  number’s magnitude is too large, causing the positional notation to be too long, it switches to scientific notation (see `TME` below). If the floating-point number is too small, `TM9` prints zero.
* `TME` prints the number in scientific notation, that is, with exponent (same as `EE`) and one digit in the integer position of the fractional part. The case of the exponent indicator (`E` or `e`)
  matches the case of the first letter (`T` or `t`) in the format element.
* `TM` chooses either `TM9` or `TME` depending on the magnitude of the number, to minimize the length of the text while preserving precision.
* `TM9` supports parameters with the following syntax:

  ```sqlsyntax
  TM9(<number_of_decimal_digits>,<group_size>)
  ```

  Where:

  + `number_of_decimal_digits` is the maximum number of digits. Specify an integer to limit the number of
    digits or `ALL` for an unlimited number of digits.
  + `group_size` is the number of digits in a group.

  For example, the following `TM9` format produces a number with up to six decimal digits and the integral part
  formatted in groups of three digits:

  ```sqlexample
  TM9(6,3)
  ```

  The following `TM9` format produces a number with an unlimited number of digits and the integral part
  formatted in groups of three digits:

  ```sqlexample
  TM9(ALL,3)
  ```

  > **Note:**
  >
  > Currently, no spaces are allowed in the parameter specification. For example, `TM9(6,3)` is allowed, but
  > `TM9( 6, 3 )` or `TM9(6, 3)` returns an error.

#### Text-minimal format elements

The following table lists the supported elements for text-minimal formats. Note the following:

* No elements can be repeated within a text-minimal format string.
* The **Case-sensitive** column indicates elements where the case of the element affects the format. For example:

  + `TME` processes exponents with an uppercase `E`.
  + `tme` processes exponents with a lowercase `e`.

  All the other elements are case-insensitive.

| Element | Repeatable | Case-sensitive | Description |
| --- | --- | --- | --- |
| `$` |  |  | Dollar sign is inserted before digits in the number (usually after sign). |
| `TM` |  | ✔ | Text-minimal number, either `TM9` or `TME`, whichever is shorter. |
| `TM9` |  | ✔ | Text-minimal number in positional notation. |
| `TME` |  | ✔ | Text-minimal number in scientific notation (with exponent). |
| `B` |  |  | Forces representing a zero value as a space in the subsequent number. |
| `MI` |  |  | Explicit numeric sign place holder; becomes either `-` or a space. |
| `S` |  |  | Explicit numeric sign place holder; becomes either `-` or `+`. |

#### Sign position for text-minimal formats

By default, the sign for text-minimal formats is either:

* `-` for negative numbers, prepended to the number.
* Omitted for non-negative numbers.

The `$`, `S`, and `MI` elements have the same effect as with fixed-position format models. Note that floating-point numbers have two distinct zero values (`+0.` and `-0.`) which represent
infinitesimal positive and negative values, respectively.

#### Parsing numbers using text-minimal formats and the modifiers

Parsing with the text-minimal format models is not affected by the `FX` or `FM` modifiers; however, the explicit sign elements, `S` and `MI` are affected, as described above.

`TM9` matches any decimal number (integer or fractional) in positional notation; it does not match numbers in scientific notation (that is, with exponent). Conversely:

* `TME` matches only scientific notation.
* `TM` matches both.

Numbers matched by text-minimal elements cannot have spaces or digit group separators within them.

Letters within exponent elements and hexadecimal digits are always matched without regard to case (lower or upper).

## Alternate, automatic, and default formats

| Element | Description |
| --- | --- |
| `|` (pipe) | Separates alternative formats. |
| `AUTO` | Automatic format(s). |

When parsing strings, it is possible to specify multiple alternative formats by separating format strings with the `|` character. The string is successfully parsed if it matches any one format. If the
input string matches multiple formats, any format will be used for the conversion.

An entire format used for parsing can be replaced with the keyword `AUTO`; this inserts one or more alternative automatic formats depending on the type of the source or result value. Adding a custom format
to the automatic format(s) can be done using `AUTO` as one of the alternatives.

Default formats are used when formats are not explicitly specified in cast functions, for parsing input values (that is, in CSV files), and for printing results.

### Default formats for printing

The following table lists the default formats for printing:

| SQL Data Type | Parameter | Default Format |
| --- | --- | --- |
| DECIMAL | *none* | `TM9` |
| DOUBLE | *none* | `TME` |

### Default formats for parsing

The following table lists the default formats for parsing:

| SQL Data Type | Parameter | Default `AUTO` Format |
| --- | --- | --- |
| DECIMAL | *None* | `TM9` |
| DOUBLE | *None* | `TME` |

The list of formats used for automatic optimistic string conversion (that is, for strings which are automatically recognized as numeric) is the union of all the formats in the above table of default input
formats.

## Examples

### Output examples

This example shows how to display numbers with leading zeros:

> ```sqlexample
> create table sample_numbers (f float);
> insert into sample_numbers (f) values (1.2);
> insert into sample_numbers (f) values (123.456);
> insert into sample_numbers (f) values (1234.56);
> insert into sample_numbers (f) values (-123456.789);
> select to_varchar(f, '999,999.999'), to_varchar(f, 'S000,000.000') from sample_numbers;
> ```

The output will look similar to:

> ```sqlexample
> +------------------------------+-------------------------------+
> | TO_VARCHAR(F, '999,999.999') | TO_VARCHAR(F, 'S000,000.000') |
> +==============================+===============================+
> |        1.2                   | +000,001.200                  |
> +------------------------------+-------------------------------+
> |      123.456                 | +000,123.456                  |
> +------------------------------+-------------------------------+
> |    1,234.56                  | +001,234.560                  |
> +------------------------------+-------------------------------+
> | -123,456.789                 | -123,456.789                  |
> +------------------------------+-------------------------------+
> ```

You don’t need leading zeros in order to align numbers. The default fill mode
is “fill”, which means that leading blanks are used to align numbers based
on the positions of the decimal points.

> ```sqlexample
> select to_varchar(f, '999,999.999'), to_varchar(f, 'S999,999.999') from sample_numbers;
> ```

The output will look similar to:

> ```sqlexample
> +------------------------------+-------------------------------+
> | TO_VARCHAR(F, '999,999.999') | TO_VARCHAR(F, 'S999,999.999') |
> +==============================+===============================+
> |        1.2                   |       +1.2                    |
> +------------------------------+-------------------------------+
> |      123.456                 |     +123.456                  |
> +------------------------------+-------------------------------+
> |    1,234.56                  |   +1,234.56                   |
> +------------------------------+-------------------------------+
> | -123,456.789                 | -123,456.789                  |
> +------------------------------+-------------------------------+
> ```

This example shows what happens if you use the FM (Fill Mode) modifier to
switch from “fill” mode to “compact” mode, that is, to remove leading characters
that would align the numbers:

> ```sqlexample
> select  to_varchar(f, '999,999.999'), to_varchar(f, 'FM999,999.999') from sample_numbers;
> ```

The output will look similar to:

> ```sqlexample
> +------------------------------+--------------------------------+
> | TO_VARCHAR(F, '999,999.999') | TO_VARCHAR(F, 'FM999,999.999') |
> +==============================+================================+
> |        1.2                   | 1.2                            |
> +------------------------------+--------------------------------+
> |      123.456                 | 123.456                        |
> +------------------------------+--------------------------------+
> |    1,234.56                  | 1,234.56                       |
> +------------------------------+--------------------------------+
> | -123,456.789                 | -123,456.789                   |
> +------------------------------+--------------------------------+
> ```

This example shows how to display numbers in exponential notation:

> ```sqlexample
> select to_char(1234, '9d999EE'), 'will look like', '1.234E3';
> ```

The output will look similar to:

> ```sqlexample
> +--------------------------+------------------+-----------+
> | TO_CHAR(1234, '9D999EE') | 'WILL LOOK LIKE' | '1.234E3' |
> +==========================+==================+===========+
> | 1.234E3                  |  will look like  |  1.234E3  |
> +--------------------------+------------------+-----------+
> ```

This shows how to include literals in the output. The literal portions
are enclosed within double quotes (which, in turn, are inside the
single quotes that delimit the string).

> ```sqlexample
> select to_char(12, '">"99"<"');
> ```

The output will look similar to:

> ```sqlexample
> +-------+
> | > 12< |
> +-------+
> ```

### Input examples

These examples demonstrate the use of format models for inputs.

> The following example shows some simple input operations, with an emphasis
> on showing the difference between using “0” and “9” to specify format of digits.
>
> The digit “9” as a formatter will accept blanks or “missing” leading digits.
> The digit “0” as a formatter will not accept blanks or missing leading zeros.
>
> ```sqlexample
> -- All of the following convert the input to the number 12,345.67.
> SELECT TO_NUMBER('012,345.67', '999,999.99', 8, 2);
> SELECT TO_NUMBER('12,345.67', '999,999.99', 8, 2);
> SELECT TO_NUMBER(' 12,345.67', '999,999.99', 8, 2);
> -- The first of the following works, but the others will not convert.
> -- (They are not supposed to convert, so "failure" is correct.)
> SELECT TO_NUMBER('012,345.67', '000,000.00', 8, 2);
> SELECT TO_NUMBER('12,345.67', '000,000.00', 8, 2);
> SELECT TO_NUMBER(' 12,345.67', '000,000.00', 8, 2);
> ```
>
> This shows how to accept either of two numeric formats
> (`-###` or `###-`).
>
> ```sqlexample
> -- Create the table and insert data.
> create table format1 (v varchar, i integer);
> insert into format1 (v) values ('-101');
> insert into format1 (v) values ('102-');
> insert into format1 (v) values ('103');
>
> -- Try to convert varchar to integer without a
> -- format model.  This fails (as expected)
> -- with a message similar to:
> --    "Numeric value '102-' is not recognized"
> update format1 set i = TO_NUMBER(v);
>
> -- Now try again with a format specifier that allows the minus sign
> -- to be at either the beginning or the end of the number.
> -- Note the use of the vertical bar ("|") to indicate that
> -- either format is acceptable.
> update format1 set i = TO_NUMBER(v, 'MI999|999MI');
> select i from format1;
> ```

---
title: SQL variables
source: https://docs.snowflake.com/en/sql-reference/session-variables.md
section: SQL General Reference
---

# SQL variables

You can define and use SQL variables in sessions in Snowflake.

## Overview

Snowflake supports SQL variables declared by the user. They have many uses, such as storing application-specific environment settings.

### Variable identifiers

SQL variables are globally identified using case-insensitive names.

### Variable DDL

Snowflake provides the following DDL commands for using SQL variables:

* [SET](sql/set.md)
* [UNSET](sql/unset.md)
* [SHOW VARIABLES](sql/show-variables.md)

## Initializing variables

You can set variables by executing the SQL statement [SET](sql/set.md) or by setting the variables in the connection
string when you connect to Snowflake.

The size of string or binary variables is limited to 256 bytes.

### Using SQL to initialize variables in a session

You can initialize variables in SQL using the [SET](sql/set.md) command. The data type of the variable is derived from the
data type of the result of the evaluated expression. The following examples initialize variables:

```sqlexample
SET my_variable1 = 10;
SET my_variable2 = 'example';
```

You can initialize variables by using queries that return a single result. The following examples initialize variables by
using queries:

```sqlexample
SET cust_last_name = (SELECT lname FROM customers WHERE customer_id=100);
SET timestamp_variable = (SELECT CURRENT_TIMESTAMP());
```

You can initialize multiple variables in the same statement, thereby reducing the number of round-trip communications with the server.
The following examples initialize multiple variables:

```sqlexample
SET (var1, var2, var3) = (10, 20, 30);
SET (current_user, current_warehouse) = ((SELECT CURRENT_USER()), (SELECT CURRENT_WAREHOUSE()));
```

### Setting variables on connection

In addition to using [SET](sql/set.md) to set variables within a session, you can pass variables as arguments in the connection
string used to initialize a session in Snowflake. This option is especially useful when using tools where the specification of the connection string
is the only customization possible.

For example, using the Snowflake JDBC driver, you can set additional connection properties that are interpreted as parameters.
The JDBC API requires SQL variables to be strings.

```java
// Build connection properties
Properties properties = new Properties();

// Required connection properties
properties.put("user"    ,  "jsmith"      );
properties.put("password",  "mypassword");
properties.put("account" ,  "myaccount");

// Set some additional variables.
properties.put("$variable_1", "some example");
properties.put("$variable_2", "1"           );

// Create a new connection
String connectStr = "jdbc:snowflake://localhost:8080";

// Open a connection under the snowflake account and enable variable support
Connection con = DriverManager.getConnection(connectStr, properties);
```

## Using variables in SQL

Variables can be used in Snowflake anywhere a literal constant is allowed, except where noted in the documentation. To distinguish them
from bind values and column names, all variables must be prefixed with a `$` sign.

For example:

```sqlexample
SET (min, max)=(40, 70);

SELECT $min;

SELECT AVG(salary) FROM emp WHERE age BETWEEN $min AND $max;
```

> **Note:**
>
> Because the `$` sign is the prefix used to identify variables in SQL statements, it is treated as a special character when used
> in identifiers. Identifiers (database names, table names, column names, and so on) can’t start with special characters unless the entire
> name is enclosed in double quotes. For more information, see [Object identifiers](identifiers.md).

Variables can also contain identifier names, such as table names. To use a variable as an identifier, you must
wrap it inside `IDENTIFIER()` (for example, `IDENTIFIER($my_variable)`). Some examples are below:

```sqlexample
SET my_table_name='table1';
```

```sqlexample
CREATE TABLE IDENTIFIER($my_table_name) (i INTEGER);
INSERT INTO IDENTIFIER($my_table_name) (i) VALUES (42);
```

```sqlexample
SELECT * FROM IDENTIFIER($my_table_name);
```

```output
+----+
|  I |
|----|
| 42 |
+----+
```

In the context of a FROM clause, you can wrap the variable name in `TABLE()`, as shown below:

```sqlexample
SELECT * FROM TABLE($my_table_name);
```

```output
+----+
|  I |
|----|
| 42 |
+----+
```

```sqlexample
DROP TABLE IDENTIFIER($my_table_name);
```

For more information about `IDENTIFIER()`, see [Literals and variables as identifiers with IDENTIFIER() syntax](identifier-literal.md).

### Viewing variables for the session

To see all the variables defined in the current session, use the [SHOW VARIABLES](sql/show-variables.md) command:

```sqlexample
SET (min, max)=(40, 70);
```

```output
+----------------------------------+
| status                           |
|----------------------------------|
| Statement executed successfully. |
+----------------------------------+
```

```sqlexample
SHOW VARIABLES;
```

```output
+----------------+-------------------------------+-------------------------------+------+-------+-------+---------+
|     session_id | created_on                    | updated_on                    | name | value | type  | comment |
|----------------+-------------------------------+-------------------------------+------+-------+-------+---------|
| 10363773891062 | 2024-06-28 10:09:57.990 -0700 | 2024-06-28 10:09:58.032 -0700 | MAX  | 70    | fixed |         |
| 10363773891062 | 2024-06-28 10:09:57.990 -0700 | 2024-06-28 10:09:58.021 -0700 | MIN  | 40    | fixed |         |
+----------------+-------------------------------+-------------------------------+------+-------+-------+---------+
```

### Session variable functions

The following convenience functions are provided for manipulating session variables to support compatibility with other database systems
and to issue SQL through tools that do not support the `$` syntax for accessing variables. All of these functions accept and
return session variable values as strings:

> * SYS_CONTEXT and SET_SYS_CONTEXT
> * SESSION_CONTEXT and SET_SESSION_CONTEXT
> * [GETVARIABLE](functions/getvariable.md) and SETVARIABLE

Here are examples of using GETVARIABLE. First, define a variable using SET:

```sqlexample
SET var_artist_name = 'Jackson Browne';
```

```output
+----------------------------------+
| status                           |
+----------------------------------+
| Statement executed successfully. |
+----------------------------------+
```

Return the variable value:

```sqlexample
SELECT GETVARIABLE('var_artist_name');
```

In this example, the output is NULL because Snowflake stores variables with all uppercase letters.

Update the casing:

```sqlexample
SELECT GETVARIABLE('VAR_ARTIST_NAME');
```

```output
+--------------------------------+
| GETVARIABLE('VAR_ARTIST_NAME') |
+--------------------------------+
| Jackson Browne                 |
+--------------------------------+
```

You can use the variable name in a WHERE clause, for example:

```sqlexample
SELECT album_title
  FROM albums
  WHERE artist = $var_artist_name;
```

## Removing variables

SQL variables are private to a session. When a Snowflake session is closed, all variables created during the session are dropped. This
means that no one can access user-defined variables that have been set in another session, and when the session is closed, these variables
expire.

In addition, variables can be explicitly dropped using the [UNSET](sql/unset.md) command.

For example:

```sqlexample
UNSET my_variable;
```

---
title: Step 1: Create an Azure AD app for the Azure functions app in the Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-template-apps.md
section: SQL General Reference
---

# Step 1: Create an Azure AD app for the Azure functions app in the Portal

This topic provides detailed instructions for creating an Azure AD app for the Azure Functions app.

## Previous step

[Planning an external function for Azure](external-functions-creating-azure-planning.md)

## Create an Azure AD app

1. If you haven’t already, log into the Azure Portal.
2. Search for the App registrations page.
3. Click on New registration, which takes you to the Register an application screen.
4. Enter a unique name for your Azure AD app.
5. Record the name of the Azure AD app in the `Azure Function AD app registration name` field in your tracking worksheet.
6. Under Supported account types, choose
   Accounts in this organizational directory only (Default Directory only - Single tenant).
7. Click on Register.

   This takes you to the Home » App registrations screen and shows the newly created Azure AD
   app.
8. Record the Application (client) ID from the Azure AD app you just created in the `Azure Function AD Application ID` field in
   your tracking worksheet. This ID should be in the form of a UUID.

## Next step

[Step 2: Use the template to create the remote service (Azure function) and proxy service (API Management service)](external-functions-creating-azure-template-services.md)

---
title: Step 1: Create the remote service (AWS Lambda function) in the Management Console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-ui-remote-service.md
section: SQL General Reference
---

# Step 1: Create the remote service (AWS Lambda function) in the Management Console

This topic provides detailed instructions for creating an AWS Lambda Function for use as the remote service for your external
function.

This topic includes code for a sample Lambda Function that you can use as-is to create your first external function, or that
you can use as a starting point for a custom Lambda Function.

## Previous step

[Planning an external function for AWS](external-functions-creating-aws-planning.md)

## Introduction

There are multiple ways to create a remote service on AWS. This topic shows one way, which is to create a Lambda Function
to be used as the remote service.

This tutorial describes two sample Lambda Functions, each of which is written in Python.

More details are provided in Snowflake Sample Functions (in this topic).

## Understanding Lambda function input and output

In order for Snowflake to send data to and receive data from your remote service, your remote service must accept
and return data in JSON format.

Platform-independent information about remote service input and output is in
[Remote Service Input and Output Data Formats](external-functions-data-format.md) .

This section provides details that are specific to AWS Lambda Functions.

### Language-independent input and output via JSON

The information in this section applies to all Lambda Functions used as remote services for Snowflake external functions.
For platform-specific information about external function input and output, see the sub-section(s) below.

On AWS, the convention for an HTTP-compatible service is to return the body inside a JSON object that also includes the HTTP status
code. The JSON for a typical return value from an AWS Lambda function looks like the following:

> ```sqljson
> {
> "statusCode": <http_status_code>,
> "body":
>         {
>             "data":
>                   [
>                       [ 0, <value> ],
>                       [ 1, <value> ]
>                       ...
>                   ]
>         }
> }
> ```

The structure of the JSON input is similar to the preceding, but includes additional key-value pairs that you are unlikely to
need, and excludes the `statusCode`.

### Python-specific Lambda function input

The following material applies to Snowflake-compatible Python-language Lambda Functions, including the sample Lambda Functions
in this tutorial. This information is in addition to the
language-independent rules for input and output.

A Snowflake-compatible Python-Language Lambda Function receives two parameters, `event` and `context`. Simple
external functions typically need only the `event` parameter.

The `event` parameter includes many sub-fields, one of which is `body`. The body is a JSON-compatible string that
contains a dictionary.

The dictionary includes a key named `data`; the corresponding value for `data` is an array. That array contains
the rows passed by Snowflake.

Each row is represented by an array that is nested inside the `data` array.

(Because AWS Lambda conveniently processes the HTTP POST request sent by Snowflake, extracts the body, and passes the body inside
the event parameter, the example functions provided by Snowflake do not need to parse the entire HTTP POST request.)

The [Sample synchronous Lambda function](external-functions-creating-aws-sample-synchronous.md) includes code showing how to read the `event` parameter.

## Choose the code for the Lambda function

### Snowflake sample functions

Snowflake supplies two sample functions:

* The shorter example is [synchronous](external-functions-implementation.md). If you are new to
  external functions or Lambda Functions, Snowflake recommends that you use this to create your first sample external function.

  Experienced users can also copy and modify it to use as a starting point for custom remote services.

  The code is available in [sample synchronous Lambda Function](external-functions-creating-aws-sample-synchronous.md).
* The other example is [asynchronous](external-functions-implementation.md).

  This sample is intended primarily as a sample for building customized asynchronous remote services.

  The code is available in [sample asynchronous Lambda Function](external-functions-creating-aws-sample-asynchronous.md).

### Custom Lambda function

You can write your own Lambda Function from scratch, or you can use one of the functions described in
Snowflake sample functions (in this topic) as a starting point.

If you have an existing remote service that you want to use, then you can skip most of the instructions in this step of the
tutorial. Instead, do the following:

1. Record your AWS Account ID in the `Your AWS Account ID` field in the tracking worksheet.
2. Record the remote service’s Lambda Function name in the `Lambda Function Name` field in the tracking worksheet.
3. Go to [Step 2: Create the proxy service (Amazon API Gateway) in the AWS Management Console](external-functions-creating-aws-ui-proxy-service.md).

## Create a Lambda function

To create an AWS Lambda Function, follow the steps below.

> **Note:**
>
> Although these steps show you how to create the sample remote services provided by Snowflake, you can use these steps as a model
> for creating your own customized remote service. If you create a custom Lambda Function, then modify the steps below as
> appropriate (e.g. choose the appropriate programming language for your remote service’s code).

1. Log into the AWS Management Console, if you haven’t already.
2. If you have not already recorded your AWS account ID in the worksheet field named `Your AWS Account ID`, record it now.

   If you need to look up your AWS account ID, follow the
   [AWS instructions](https://docs.aws.amazon.com/IAM/latest/UserGuide/console_account-alias.html#FindingYourAWSId).
3. Select Lambda.
4. Select Create function.
5. Enter a function name.

   Record this name in the `Lambda Function Name` field in the worksheet.
6. Select the programming language to use. If you are using one of the sample Python functions provided by Snowflake,
   then choose Python 3.10.
7. Choose or create an execution role for this function.

   Select the appropriate option(s), typically Create a new role with basic Lambda permissions.

   (This role is separate from your cloud account role and separate from your Snowflake role(s).)
8. Click on the Create Function button.
9. In the lambda_function tab, enter the code for the function.

   If you have not already written your own function, you can use the following example provided by Snowflake:

   > [sample synchronous code](external-functions-creating-aws-sample-synchronous.md) or the

   > **Tip:**
   >
   > If you cannot paste into the edit window, try double-clicking on the function’s file name to enable editing.
10. Click on the Deploy button to deploy the function.

## Test the Lambda function

Click the down-arrow beside the Test button and select Configure test event. In the Event name field, type test.

For the sample synchronous Python function provided by Snowflake, use the following test data (replace any default
data with the data below):

> ```sqljson
> {
>   "body":
>     "{ \"data\": [ [ 0, 43, \"page\" ], [ 1, 42, \"life, the universe, and everything\" ] ] }"
> }
> ```

Click Create, and then click Test.

The execution results should be similar to:

> ```sqljson
> {
>   "statusCode": 200,
>   "body": "{\"data\": [[0, [\"Echoing inputs:\", 43, \"page\"]], [1, [\"Echoing inputs:\", 42, \"life, the universe, and everything\"]]]}"
> }
> ```

You now have an AWS Lambda function that you can use as the remote service for your external function.

## Next step

[Step 2: Create the proxy service (Amazon API Gateway) in the AWS Management Console](external-functions-creating-aws-ui-proxy-service.md)

---
title: Step 1: Create the remote service (Azure function) in the Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-ui-remote-service.md
section: SQL General Reference
---

# Step 1: Create the remote service (Azure function) in the Portal

This topic provides detailed instructions for creating an Azure Function for use as the remote service for your external function.

## Previous step

[Planning an external function for Azure](external-functions-creating-azure-planning.md)

## Create the Azure functions app

There are multiple possible ways to create a remote service. This section shows how to create a remote service that is implemented
as a JavaScript function.

This external function is [synchronous](external-functions-implementation.md).
For information about creating an
[asynchronous](external-functions-implementation.md) external function, see
[Creating an Asynchronous Function on Azure](external-functions-creating-azure-asynchronous.md).

Create an Azure Functions app to serve as a container for the function(s) that you create later:

1. If you haven’t already, log into the Azure Portal.
2. Create the Azure Functions app by following the instructions in the Microsoft documentation:
   [Azure Functions App](https://docs.microsoft.com/en-us/azure/azure-functions/functions-create-function-app-portal).

   As you follow the instructions, remember the following:

   > * When you enter a name the Function App Name field, also record the name in the `Azure Function app name` field in
   >   your tracking worksheet.
   > * When asked to choose how to Publish, choose Code.
   > * Some restrictions apply when creating multiple apps in the same resource group. For details, see the Microsoft documentation:
   >   [Azure app service](https://docs.microsoft.com/en-us/azure/app-service/containers/app-service-linux-intro#limitations).

   Snowflake provides a sample “echo” function in Node.js. To use this sample function to get started:

   > * When asked for the `Runtime stack`, select Node.js.
   > * When asked for the version of Node.js, select version 12.
   > * When asked which OS to run the function on, choose “Windows” or “Linux”.
   >
   >   + If you are only creating a demo function, Snowflake recommends selecting “Windows”.
   >
   >     Linux Function Apps cannot be edited in the Azure Portal. Users must publish the code through the Visual Studio Code interface.
   >   + If you want to run your Azure Function on Linux rather than Microsoft Windows, see the Microsoft documentation:
   >     [Azure Functions](https://docs.microsoft.com/en-us/azure/azure-functions/functions-create-first-function-vs-code?pivots=programming-language-javascript).
   >
   >     Azure AD authentication is not available on Linux when using the “Consumption” pricing plan for Azure Functions.
   >     You must use an “App Service” pricing plan or “Premium” pricing plan in order to authenticate with Azure AD.
   >
   >     For more details, see the Microsoft documentation:
   >     [Azure AD](https://docs.microsoft.com/en-us/azure/app-service/configure-authentication-provider-aad).

## Create an HTTP-triggered Azure function

After you create your Azure Functions app (container), you need to create an Azure Function in the container. This function acts as the
remote service.

Microsoft allows Azure Functions to be called (“triggered”) different ways. A Snowflake external function invokes a remote service via an
HTTP POST command, so the Azure Function you create must be an “HTTP-triggered function”.

> **Tip:**
>
> You can use the instructions provided by Microsoft to create the HTTP-triggered function:
>
> > * [Create an app portal](https://docs.microsoft.com/en-us/azure/azure-functions/functions-create-function-app-portal)
> > * [Create an Azure function](https://docs.microsoft.com/en-us/azure/azure-functions/functions-create-first-azure-function#create-function)
>
> However, Snowflake provides custom instructions that include additional details and sample code, and suggest a different authorization
> level than Microsoft. We suggest using the custom instructions in place of Microsoft’s instructions.

### Create the function

To perform the tasks described in this section, you should be in the Function App screen in the Azure Portal. The name of your
Azure Functions app should be displayed, typically near the upper left corner of the screen.

To create the HTTP-triggered function:

1. In the left-hand side menu tree, look for the section titled Functions. In that section, click on the item
   labeled Functions to add a function.
2. Click on the + Add button.
3. Select HTTP trigger from the list of potential triggers on the right.
4. Enter the name to use for your HTTP-triggered function.

   Record this name in the `HTTP-Triggered Function name` field in your tracking worksheet.
5. Enter the Authorization level.

   Snowflake recommends choosing Function as the authorization level.

   For more information about possible authorization levels, see the Microsoft documentation:
   [HTTP-triggered functions](https://docs.microsoft.com/en-us/azure/azure-functions/functions-bindings-http-webhook-trigger?tabs=csharp#configuration).
6. Click on the button titled Add.

   This takes you to a screen that shows the function name and, below that, the word Function.
7. In the tree menu on the left-hand side, click on Code + Test.
8. Replace the default code with your own code.

   Sample code for a JavaScript “echo” function is provided below.

   The function reads each row, then copies the row to the output (results). The row number is also included in the output. The output is
   returned as part of a multi-level dictionary.

   This function accepts and returns data in the same format (JSON) that Snowflake sends and reads. For more details about data
   formats, see [Remote Service Input and Output Data Formats](external-functions-data-format.md) .

   Normally, the function returns HTTP code 200. If no rows are passed to the function (i.e. if the request body is empty), the function
   returns error code 400.

   > ```javascript
   > module.exports = async function(context, request) {
   >     context.log('JavaScript HTTP trigger function processed a request.');
   >
   >     if (request.body) {
   >         var rows = request.body.data;
   >         var results = [];
   >         rows.forEach(row => {
   >             results.push([row[0], row]);
   >         });
   >
   >         results = {data: results}
   >         context.res = {
   >             status: 200,
   >             body: JSON.stringify(results)
   >         };
   >    }
   >    else {
   >        context.res = {
   >            status: 400,
   >            body: "Please pass data in the request body."
   >        };
   >    }
   > };
   > ```
9. Click on the Save button above the code.

### Test the function

To test the HTTP-triggered Azure Function you just created, paste the following sample data into the Body field and click on
the Test/Run button:

> ```none
> {
>     "data": [ [ 0, 43, "page" ], [ 1, 42, "life, the universe, and everything" ] ]
> }
> ```

The content of the output should be similar to the following:

> ```none
> { "data":
>     [
>         [ 0, [ 0, 43, "page" ] ],
>         [ 1, [ 1, 42, "life, the universe, and everything" ]  ]
>     ]
> }
> ```

Note that the formatting might be different from what is shown above.

## Set the authorization requirements for the Azure functions app

When an external function is called, Snowflake sends an HTTP POST command to the proxy service (e.g. the Azure API Management service),
which relays the POST to the remote service (e.g. the Azure Function).

Each of these two steps should have authorization requirements, so you typically specify:

* The authorization needed to call the API Management service.
* The authorization needed to call functions in the Azure Functions app that contains your Azure Function.

This section describes how to require authorization for your Azure Functions app. The API Management service is created later, so its
authorization requirements are also specified later.

When Snowflake authenticates with your Azure Functions app, Snowflake uses OAuth client credential grant flow with Azure AD.

For more details about the client credential grant flow, see the Microsoft documentation:
[client credential](https://docs.microsoft.com/en-us/azure/active-directory/azuread-dev/v1-oauth2-client-creds-grant-flow).

This client credential flow requires an Azure AD app registration that represents the Azure Functions app.

This section includes instructions for creating the Azure AD app registration for the Azure Functions app. For example, you can set your
Azure Functions app to require Azure AD authentication. To configure authorization via Azure AD, you must:

* Create an Azure AD app registration, which is an Azure AD-based entity that represents
  an identity or resource identifier (i.e. what you want to protect).
* Associate the Azure AD app registration with the Azure Functions app for which you want to require authentication.

> **Note:**
>
> For Azure Functions, the fastest way to create an Azure AD app registration is by enabling Azure AD Authentication for the service, as
> documented below. If you are using a remote service other than an Azure Function, use the App registrations page to create
> a new Azure AD app registration for your remote service.
>
> For more details about app registration, see the Microsoft documentation:
>
> > [app registration documentation](https://docs.microsoft.com/en-us/azure/active-directory/develop/quickstart-register-app)

### Enable app service authentication for the Azure functions app

Before you execute the steps below, you should be on the Function App screen for your Azure Functions app.

1. In the left-hand menu pane, look for the section named Settings and click on Authentication.

   If the left-hand margin shows the Developer menu (with Code + Test, Integration, etc.), if you have a scroll
   bar at the bottom of your screen, try sliding the scroll bar to the left to return to the Function App
   or App Service section, and then look for Settings.
2. Click the Add identity provider button.
3. In the Identity provider drop-down menu, select Microsoft.
4. For App registration type, select Create new app registration.
5. In the Name field, type the name of your app.
6. For Supported account types, select Current tenant - Single tenant.
7. For Restrict access, select Require authentication.
8. For Unauthenticated requests, select HTTP 401 Unauthorized.
9. Click Next: Permissions. Review the permissions.
10. Click Add. A new Azure AD application is created and the application page is displayed.
11. Click the link that shows your application’s name to go to your Azure AD application’s page.
12. Find the Application (client) ID field.

    Record this ID in the Azure Function App AD Application ID field in your tracking worksheet.

    > **Important:**
    >
    > Make sure you copy the ID, not the Azure AD application name. The ID should contain a UUID.

## Next step

[Step 2: Create the proxy service (Azure API Management service) in the Portal](external-functions-creating-azure-ui-proxy-service.md)

---
title: Step 1: Create the remote service (Google Cloud Function) in the console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-ui-remote-service.md
section: SQL General Reference
---

# Step 1: Create the remote service (Google Cloud Function) in the console

This topic provides detailed instructions for creating a Google Cloud Function for use as the remote service for your external function.

## Previous step

[Planning an external function for GCP](external-functions-creating-gcp-planning.md)

## Create the Google Cloud Function

Create the function by following Google’s
[instructions to create a Cloud Function](https://cloud.google.com/functions/docs/quickstart-console).

If you are creating the function using the sample Python-language function provided by Snowflake, then choose Python Quickstart;
otherwise, choose the appropriate QuickStart based on the language you are using.

As you follow Google’s instructions, make sure to do the following:

1. Specify that the trigger for the function is HTTP.
2. Copy the trigger URL to the `Cloud Function Trigger URL` field in your tracking worksheet.
3. In the Authentication section, select Require authentication.

   The GCP instructions say to select Allow unauthenticated invocations. That is acceptable for sample
   functions, including the sample function provided by Snowflake, but most production systems should require authentication.
4. If Require HTTPS is not already enabled, then enable it.
5. Click Save.
6. Select an appropriate Runtime. If you are creating the sample Python function supplied by Snowflake,
   then choose the Python 3.7 runtime.

   > **Important:**
   >
   > Select the Runtime value before you paste in the code.
7. Replace the default code with either the Snowflake sample code or your own custom code. The Snowflake sample code is provided in
   Sample synchronous Google Cloud Function (in this topic).
8. Make sure that the Entry point matches the name of the function (in this case, `echo`).

## Test the Google Cloud Function

After you finish creating the Google Cloud Function, use the Testing tab in the console to call the function to make sure that
it works as expected.

For the sample Python function provided by Snowflake, use the following test data (replace any default
data in the Testing tab with the data below):

> ```sqljson
> { "data":
>   [
>     [ 0, 43, "page" ],
>     [ 1, 42, "life, the universe, and everything" ]
>   ]
> }
> ```

The execution results should be similar to:

> ```sqljson
> {"data":
>   [
>     [0, [43, "page"] ],
>     [1, [42, "life, the universe, and everything"] ]
>   ]
> }
> ```

The results might be displayed in a different format from the example shown above.

If the test succeeded, you now have a Google Cloud Function that you can use as the remote service for your external function.

## Sample synchronous Google Cloud Function

This sample code combines the input parameter values into a single list (array) and returns that list as a single
value of SQL type VARIANT. The code is written in Python 3.7.

This function accepts and returns data in the same format (JSON) that Snowflake sends and reads.

> ```python
> import json
>
> HTTP_SUCCESS = 200
> HTTP_FAILURE = 400
>
> def echo(request):
>     try:
>         # The list of rows to return.
>         return_value = []
>
>         payload = request.get_json()
>         rows = payload["data"]
>
>         # For each input row
>         for row in rows:
>             # Include the row number.
>             row_number = row[0]
>             # Combine the value(s) in the row into a Python list that will be treated as an SQL VARIANT.
>             row_value = row[1:]
>             row_to_return = [row_number, row_value]
>             return_value.append(row_to_return)
>
>         json_compatible_string_to_return = json.dumps( { "data" : return_value } )
>         return (json_compatible_string_to_return, HTTP_SUCCESS)
>
>     except:
>         return(request.data, HTTP_FAILURE)
> ```

For more information about data formats, see [Remote Service Input and Output Data Formats](external-functions-data-format.md) .

## Next step

[Step 2: Create the proxy service (Google Cloud API Gateway) in the console](external-functions-creating-gcp-ui-proxy-service.md)

---
title: Step 1: Use the template to create the remote service (AWS Lambda function) and proxy service (API Gateway)
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-template-services.md
section: SQL General Reference
---

# Step 1: Use the template to create the remote service (AWS Lambda function) and proxy service (API Gateway)

This topic provides detailed instructions for using the AWS CloudFormation template provided by Snowflake. The template simplifies
the tasks for creating the AWS Lambda Function (to use as the remote service) and the Amazon API Gateway (to use as the proxy
service) for your external function.

This document shows how to create a sample external function on AWS by using a CloudFormation template.

Snowflake provides a template you can start with. This template hides some details of the creation process and
hard-codes some names (e.g. the stage name) and functionality. When you are ready to create your own custom external
function, you can either customize a copy of the template, or you can follow the more flexible instructions at
[Creating external functions on AWS](external-functions-creating-aws.md).

If you would like to customize the template, you can read more about
[AWS CloudFormation](https://docs.aws.amazon.com/AWSCloudFormation/latest/UserGuide/cfn-whatis-concepts.html) .

> **Note:**
>
> These instructions assume that you are already familiar with AWS administration. These instructions
> describe the general steps that you need to execute, but do not describe the user interface in
> detail because the interface could change.

## Previous step

[Planning an external function for AWS](external-functions-creating-aws-planning.md)

## Upload the template

1. Go to the AWS Management Console.
2. In the top search bar, search for CloudFormation.
3. Under Services, click on CloudFormation.
4. Click on Create stack.

   If given a choice between With new resources (standard) or
   With existing resources (import resources), then choose With new resources (standard).
5. On the Create stack page, under Prepare template, select Template is ready.
6. Select Upload a template file.
7. Select Choose file.
8. Navigate to the directory that contains your copy of the template, then select that template.
9. Click Next to reach the page on which you enter names for roles, etc.

   > **Note:**
   >
   > The template uses default names for some resources. You can change the names.

## Configure your options

The template contains default values for most fields. However, you need to enter a few values, such as whether you want a
regional endpoint or a private endpoint.

1. Enter a name for the stack.
2. Enter the type of endpoint that you want to use: “REGIONAL” or “PRIVATE”.

   If you are unsure which type to use, choose “REGIONAL”.

   If you choose “PRIVATE”, then update the VPC ID (labeled “sourceVpcId” in the template).
   (For instructions on finding your VPC ID, see [Planning an external function for AWS](external-functions-creating-aws-planning.md).)

   For more information about endpoints, including a description of the different types of endpoints, see
   [AWS endpoints](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-basic-concept.html)
   and [Choosing your endpoint type: Regional endpoint vs. Private endpoint](external-functions-creating-aws-planning.md).
3. Enter a name for the API Gateway IAM role (parameter apiGatewayIAMRoleName). This is the role assumed
   by Snowflake for authorizing with the API Gateway.
   Make sure this role does not already exist because the template will try to update the role if it exists.

   Record the role name in the tracking worksheet field titled `New IAM Role Name`.
4. Enter a name for the Lambda Execution role (parameter lambdaExecutionRoleName). This role is used by the
   Lambda service for adding CloudWatch logs.
   Make sure this role does not already exist because the template will try to update the role if it exists.
5. Click Next.

   This page has some advanced options for template deployment.

   1. Optionally, set advanced options, such as stack policy. (These are not needed when creating the sample function
      using the template supplied by Snowflake. However, if you use template-based deployment for functions that you have
      customized, then you might need to customize the advanced options at this point.)
   2. Click Next.
6. On the review page, scroll down to the end and acknowledge that the CloudFormation template might create IAM
   resources with custom names. This is needed because the template creates two IAM roles as part of the deployment.
7. Click on Create stack.

The deployment will take a few seconds. After the deployment is complete, you should be on the Events tab for
the newly created stack. The created resources will be listed under the Resources tab.

## Next step

[Step 2: Record the Amazon API Gateway URL and the new IAM role ARN](external-functions-creating-aws-template-gateway-url.md)

---
title: Step 2: Create the proxy service (Amazon API Gateway) in the AWS Management Console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-ui-proxy-service.md
section: SQL General Reference
---

# Step 2: Create the proxy service (Amazon API Gateway) in the AWS Management Console

Snowflake does not send data (HTTP POST requests) directly to a remote service. Instead, Snowflake sends the data to a proxy
service that relays the data to the remote service (e.g. an AWS Lambda Function) and back again.

This topic provides instructions for creating and configuring an Amazon API Gateway for use as the proxy service for your
external function.

Configuring an Amazon API Gateway as the proxy service requires several steps, including:

* Creating a new IAM (identity and access management) role in your AWS account.
* Creating an Amazon API Gateway endpoint and configuring it.
* Securing your Amazon API Gateway endpoint.
* Creating an API Integration object in Snowflake.
* Setting up a trust relationship between Snowflake and the new IAM role.

The steps to create these are interleaved because:

* The API integration needs information from the API Gateway, such as the role’s ARN (Amazon Resource Name).
* The API Gateway needs information from the API integration, such as the API_AWS_EXTERNAL_ID and API_AWS_IAM_USER_ARN.

## Previous step

[Step 1: Create the remote service (AWS Lambda function) in the Management Console](external-functions-creating-aws-ui-remote-service.md)

## Create a new IAM role in your AWS account

For Snowflake to authenticate to your AWS account, a Snowflake-owned IAM (identity and access management) user must be
granted permission to assume an IAM role in your AWS account.

The steps to create an IAM role are:

1. Create a new IAM role: In the AWS console, search for IAM, click Roles, and then click Create Role.
2. When asked to select the type of trusted entity, choose Another AWS account.
3. When asked to Specify accounts that can use this role, paste the value from the worksheet field named
   `Your AWS Account ID`.

   (Use your AWS Account ID, not Snowflake’s. Snowflake’s ARN will be associated with this IAM role later.)
4. Click Next: Permissions.
5. Optionally, set permissions (Attach permissions policies).
6. Click Next: Tags.
7. Optionally, add tags.
8. Click Next: Review.
9. Enter a role name.

   * Record the role name in the `New IAM Role Name` field in the worksheet.
10. Click on the Create role button. After you create the role:

    * Record the Role ARN in the `New IAM Role ARN` field in the worksheet.

## Create the API Gateway endpoint

Before you create and configure your API Gateway, choose whether to use a regional endpoint or a
private endpoint. For more information, see [Choosing your endpoint type: Regional endpoint vs. Private endpoint](external-functions-creating-aws-planning.md).

If you plan to use a private endpoint, you need the VPC (Virtual Private Cloud) ID that you recorded in the tracking worksheet.

The steps to create an API Gateway endpoint are below:

1. In the AWS management console, select API Gateway.
2. Select Create API.
3. Select the type of endpoint (regional or private).

   * If you want a regional endpoint, then:

     > + Find REST API and click on its Build button.
   * If you want a private endpoint, then:

     > + Find REST API private and click on its Build button.
   > **Important:**
   >
   > Make sure that you choose REST API or REST API private. Do not select HTTP API
   > or another option.
4. Select the New API option.
5. Enter a name for the new API.

   Record this name in the `New API Name` field in the worksheet.
6. If asked to select an Endpoint Type, select either Regional or Private.
7. Leave the `VPC Endpoint IDs` field blank.
8. Click on the Create API button.
9. To create a resource, click Actions, and then click Create Resource.

   Record the resource name in the `API Gateway Resource Name` field of the worksheet.

   Click the Create Resource button. The screen displays
   No methods defined for the resource.
10. To create a new method, click Actions and select Create Method.

    In the small drop-down menu box under the resource name, select POST and then click the grey checkmark beside it.
11. The Integration type should be Lambda Function. If that is not already selected, then select it.
12. Select the checkbox Use Lambda Proxy integration.

    It is important to select Lambda proxy integration because
    the JSON without Lambda proxy integration would be different from the JSON with Lambda proxy integration.
    For more information about Lambda proxy integration, see the AWS documentation for:

    * [Lambda integration](https://docs.aws.amazon.com/apigateway/latest/developerguide/getting-started-with-lambda-integration.html)
    * [API development](https://docs.aws.amazon.com/apigateway/latest/developerguide/http-api-develop-integrations-lambda.html)
13. In the Lambda Function field, paste the `Lambda Function Name` that you recorded in the worksheet.
14. Click on the Save button.
15. Click on the Actions button, and select the Deploy API action.
16. Select or create a stage. Click Deploy.
17. Underneath the resource name, you should see POST.

    If you do not see this, you might need to expand the resource tree by clicking on the triangle that is to the
    left of the resource name.
18. Click on POST, and then record the Invoke URL for the POST request in the
    `Resource Invocation URL` field in the worksheet.

    Make sure that the invocation URL includes the name of the resource that you created; if it doesn’t, you might
    have clicked on the invocation URL for the stage rather than the resource.
19. Click on Save Changes.

## Test the API Gateway

Check that the API Gateway can call your Lambda Function.

1. Follow [AWS’s instructions for testing](https://docs.aws.amazon.com/apigateway/latest/developerguide/api-gateway-request-validation-test.html#api-gateway-request-validation-test-in-console) .
2. At the appropriate step in the AWS instructions, paste the following text into the Request Body:

   ```sqljson
   {
       "data":
           [
               [0, 43, "page"],
               [1, 42, "life, the universe, and everything"]
           ]
   }
   ```

After you execute the test, you should see the Request, Status, Latency, and Response Body appear on
the right (you might need to scroll to see it).

If the returned status is 200, your API Gateway invoked the correct Lambda function.

(This verification step skips authentication, and therefore does not uncover issues with permissions.)

## Secure your Amazon API Gateway endpoint

For an overview of securing proxy service endpoints, such as Amazon API Gateway endpoints,
see [Secure the proxy service](external-functions-security.md).

To secure an Amazon API Gateway endpoint:

1. At this point, you should be on the screen that displays your API Gateway information, and you should see
   your resource and POST method.

   If you are not already there, do the following:

   1. In the AWS Management Console, go to the API Gateway page.
   2. Select your API Gateway.
   3. In the left-hand pane, click on Resources.
   4. Click on the POST method. (If you don’t see this, expand the resource tree by
      clicking on the triangle to the left of the resource in the Resources pane,
      which is usually the second pane from the left.)
2. Copy the Method Request ARN from the Method Request box to the `Method Request ARN` field in the
   worksheet.
3. Click on the title Method Request.
4. Click the edit symbol beside Authorization and select `AWS_IAM` to specify that the method request requires AWS_IAM authorization.

   Click on the small checkmark next to the menu to confirm your choice.
5. To set the resource policy for the API Gateway to specify who is authorized to invoke the gateway endpoint, click on
   Resource Policy in the left-hand column of the window for the API.

   * Regional Endpoint:

     Paste the JSON-formatted resource policy template below into the resource policy editor, then replace the
     placeholders with the appropriate values from the worksheet, as described below.

     ```sqljson
     {
         "Version": "2012-10-17",
         "Statement":
         [
             {
             "Effect": "Allow",
             "Principal":
                 {
                 "AWS": "arn:aws:sts::<12-digit-number>:assumed-role/<external_function_role>/snowflake"
                 },
             "Action": "execute-api:Invoke",
             "Resource": "<method_request_ARN>"
             }
         ]
     }
     ```

     Replace the following portions of the resource policy:

     + Replace the `<12-digit-number>` with the value in the field `Your AWS Account ID`, which you recorded in the worksheet.
     + Replace the `<external_function_role>` with the role name from the `New IAM Role Name` field in the
       worksheet.

       > For example, if your AWS Role Name is:
       >
       > ```none
       > arn:aws:iam::987654321098:role/MyNewIAMRole
       > ```
       >
       > then the result should be:
       >
       > ```none
       > "AWS": "arn:aws:sts::987654321098:assumed-role/MyNewIAMRole/snowflake"
       > ```
     + Replace the `<method_request_ARN>` with the value in the `Method Request ARN` field of the worksheet.
       This is the ARN of the resource’s POST method.

       > > **Note:**
       > >
       > > Setting the Resource to the Method Request ARN specifies that the API Gateway should allow calls to only the
       > > specified resource.
       > > It is possible to specify a subset of the Method Request ARN as a prefix, which allows multiple resources to
       > > be called from the same API Gateway.
       > >
       > > For example, if the Method Request ARN is:
       > >
       > > ```none
       > > arn:aws:execute-api:us-west-1:123456789012:a1b2c3d4e5/*/POST/MyResource
       > > ```
       > >
       > > then you could specify just the following prefix:
       > >
       > > ```none
       > > arn:aws:execute-api:us-west-1:123456789012:a1b2c3d4e5/*
       > > ```
     + U.S. government GovCloud users only:

       - Update the Method Request ARN to use `aws-us-gov`, e.g.:

         > ```none
         > arn:aws-us-gov:execute-api:us-gov-west-1:123456789012:a1b2c3d4e5/*
         > ```
       - Make sure that you use a GovCloud region, e.g. `us-gov-west-1`.
   * Private Endpoint:

     > Paste the resource policy template below into the resource policy editor, then replace the placeholders
     > with the appropriate values from the worksheet, as described below.
     >
     > ```sqljson
     > {
     >     "Version": "2012-10-17",
     >     "Statement": [
     >         {
     >             "Effect": "Allow",
     >             "Principal": {
     >                 "AWS": "arn:aws:sts::<12-digit-number>:assumed-role/<external_function_role>/snowflake"
     >             },
     >             "Action": "execute-api:Invoke",
     >             "Resource": "<method_request_ARN>",
     >             "Condition": {
     >                 "StringEquals": {
     >                     "aws:sourceVpc": "<VPC_ID>"
     >                 }
     >             }
     >         }
     >     ]
     > }
     > ```
     >
     > Replace the following portions of the resource policy:
     >
     > + Replace the <12-digit-number>, <external_function_role> and <method_request_ARN>
     >   as described above for a regional endpoint.
     > + Replace the <VPC_ID> with the Snowflake VPC ID for your region, which should be recorded in
     >   the `Snowflake VPC ID` field of the worksheet.
     > + U.S. government GovCloud users only:
     >
     >   - Update the Method Request ARN to use `aws-us-gov`, e.g.:
     >
     >     > ```none
     >     > arn:aws-us-gov:execute-api:us-gov-west-1:123456789012:a1b2c3d4e5/*
     >     > ```
     >   - Make sure that you use a GovCloud region, e.g. `us-gov-west-1`.
6. Click Save to save the resource policy.
7. Deploy the updated API. To do this, click the API name in the breadcrumb trail at the top of the page. Click Actions and then click Deploy API. Select your deployment stage and click Deploy.

In the next step, you create a Snowflake API integration object. Do not close your AWS Management Console
window now; you must return to it later.

## Next step

[Step 3: Create the API integration for AWS in Snowflake](external-functions-creating-aws-common-api-integration.md)

---
title: Step 2: Create the proxy service (Azure API Management service) in the Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-ui-proxy-service.md
section: SQL General Reference
---

# Step 2: Create the proxy service (Azure API Management service) in the Portal

Snowflake does not send data (HTTP POST requests) directly to a remote service. Instead, Snowflake sends the data to a proxy service that
relays the data from Snowflake to the remote service (i.e. Azure Function) and back again.

This topic provides instructions for creating and configuring an Azure API Management service for use as the proxy service for your
external function.

## Previous step

[Step 1: Create the remote service (Azure function) in the Portal](external-functions-creating-azure-ui-remote-service.md)

## Create the API Management service

The first step is to create the API Management service in the Azure Portal:

1. If you haven’t already, log into the Portal.
2. To create the API Management service, follow the instructions provided in the Microsoft documentation:
   [Create an API Management service](https://docs.microsoft.com/en-us/azure/api-management/get-started-create-service-instance).

   As you perform the tasks described in the instructions, remember to record the API Management service name (which might be titled
   Resource name) in the `API Management service name` field in your tracking worksheet.

   > **Note:**
   >
   > Deploying the API Management service can take 30-40 minutes or more. When deployment completes, you should see a message similar
   > to Your deployment is complete.
3. After the deployment completes, click the Go to resource button.

## Import the API containing the Azure function

After you create the API Management service, the next step is to import and publish the Azure Functions app that contains the APIs
(functions) to call through that API Management service:

1. To import and publish an Azure Function, follow the instructions provided in the Microsoft documentation:
   [Import a function app](https://docs.microsoft.com/en-us/azure/api-management/import-function-app-as-api).

   This document includes instructions for other tasks, as well as importing APIs. For this demonstration, you typically need only the
   instructions for importing an Azure Functions app as a new API.

   As you perform the tasks described in the instructions, remember the following:

   * One of the steps requires that you specify an option for Product. For this demonstration, choose Starter rather
     than Unlimited. For a production system, you might choose differently.
   * Record the API URL suffix in the `API Management API URL suffix` field in your tracking worksheet.

   After completing the tasks to import an Azure Functions app, you should be back on the API Management service page.
2. Find and click on the Settings tab, which is next to the Design tab on the panel of the screen
   below your API’s revision number (e.g. REVISION 1).
3. If the Subscription Required checkbox has a checkmark, then uncheck it unless you want to require a subscription.

   If you do not see the Subscription section, scroll down.
4. Click the Save button.

> **Note:**
>
> Snowflake strongly recommends
> [creating a security policy on the Azure API Management service](external-functions-creating-azure-ui-security-policy.md).
>
> You can create the security policy now or you can finish creating the external function first and test the external function before
> creating the security policy. To simplify debugging, this topic finishes creating and testing the external function before creating
> the security policy.

## Next step

[Step 3: Create the API integration for Azure in Snowflake](external-functions-creating-azure-common-api-integration.md)

---
title: Step 2: Create the proxy service (Google Cloud API Gateway) in the console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-ui-proxy-service.md
section: SQL General Reference
---

# Step 2: Create the proxy service (Google Cloud API Gateway) in the console

Snowflake does not send data (HTTP POST requests) directly to a remote service. Instead, Snowflake sends the data to a proxy service that
relays the data from Snowflake to the remote service (i.e. GCP Cloud Function) and back again.

This topic provides instructions for creating and configuring a Google Cloud API Gateway for use as the proxy service for your
external function.

## Previous step

[Step 1: Create the remote service (Google Cloud Function) in the console](external-functions-creating-gcp-ui-remote-service.md)

## Links to related Google documentation

For more detailed information about using the Google Cloud Console to perform the tasks described in this topic, see the following sections
in the [Quickstart for Deploying an API/API Gateway using the Google Cloud Console](https://cloud.google.com/api-gateway/docs/quickstart-console):

* [Creating an API definition](https://cloud.google.com/api-gateway/docs/quickstart-console#creating_an_api_definition)
* [Creating a gateway](https://cloud.google.com/api-gateway/docs/quickstart-console#creating_a_gateway)

If you prefer to use the command line instead of the Google Cloud Console, see the following sections in the
[Quickstart for Deploying an API/API Gateway using gcloud](https://cloud.google.com/api-gateway/docs/quickstart):

* [Creating an API](https://cloud.google.com/api-gateway/docs/creating-api#creating-an-api)
* [Creating an API config](https://cloud.google.com/api-gateway/docs/creating-api-config)
* [Deploy an API to a gateway](https://cloud.google.com/api-gateway/docs/deploying-api)

If you use the Google documentation, remember to copy the required information (e.g. Gateway URL) to your tracking worksheet.

## Create an API definition

On your local file system, create and customize a YAML-formatted configuration file that specifies the API you are creating. The file
should have the `.yaml` or `.yml` extension.

Configuration file template:

```none
swagger: '2.0'
info:
  title: API Gateway config for Snowflake external function.
  description: This configuration file connects the API Gateway resource to the remote service (Cloud Function).
  version: 1.0.0
schemes:
  - https
produces:
  - application/json
paths:
  /<PATH>:
    post:
      summary: Echo the input.
      operationId: echo
      x-google-backend:
        address: <HTTP ENDPOINT TO ROUTE REQUEST TO>
        protocol: h2
      responses:
        '200':
          description: <DESCRIPTION>
          schema:
            type: string
```

Fill in or update the following fields:

1. Replace `<PATH>` with a unique name. This will be incorporated into URLs, so use only characters that are valid in URLs. For
   example, enter `demo-func-resource`.

   Note that, unlike the other fields in this configuration file, enter the `<PATH>` value before the colon, rather than
   after the colon. For example, the following is correct:

   ```none
   paths:
     /demo-func-resource:
   ```

   The path name should not contain any
   [path parameters](https://swagger.io/docs/specification/2-0/describing-parameters/#path-parameters).
   Google supports path parameters when
   [setting the path to a URL](https://cloud.google.com/api-gateway/docs/passing-data#setting_the_backend_service_address_and_path_in_the_openapi_spec).
   However, Snowflake does not support path parameters in the corresponding URL specified in the CREATE EXTERNAL FUNCTION statement.
2. Copy the path (e.g. `demo-func-resource`) from the immediately preceding step to the `Path Suffix` field
   in your tracking worksheet.
3. Find the `address` field under the `x-google-backend` field, and replace `<HTTP ENDPOINT TO ROUTE REQUEST TO>` with the
   value from the `Cloud Function Trigger URL` field in your tracking worksheet. The result should look similar to:

   ```none
   x-google-backend:
     address: https:// ...
   ```

   The URL should not be enclosed in quotation marks.

   The URL does not need to be an endpoint hosted by Google; it can be the path to any HTTP endpoint.

   If you selected Require HTTPS in [Step 1: Create the remote service (Google Cloud Function) in the console](external-functions-creating-gcp-ui-remote-service.md), then ensure
   that the URL you enter into the `address` field starts with `https`.
4. Optionally, you can update any of the following values:

   > * `title` in the `info` section.
   > * `description` in the `info` section.
   > * `operationId` in the `post` subsection of the `paths` section.
   > * `summary` in the `post` subsection of the `paths` section.
5. Review your sample configuration file. It should look similar to the following:

   ```none
   swagger: '2.0'
   info:
     title: "API Gateway config for Snowflake external function"
     description: "This configuration file connects the API Gateway resource to the remote service (Cloud Function)."
     version: 1.0.0
   schemes:
     - https
   produces:
     - application/json
   paths:
     /demo-func-resource:
       post:
         summary: "echo the input"
         operationId: echo
         x-google-backend:
           address: https://my_dev.cloudfunctions.net/demo-cloud-function-01
           protocol: h2
         responses:
           '200':
             description: echo result
             schema:
               type: string
   ```

   > **Note:**
   >
   > This configuration will leave your gateway open to the public until you secure it in [Step 5: Create a GCP security policy for the proxy service in the console](external-functions-creating-gcp-ui-security-policy.md) of this tutorial.
6. Optionally, to make sure that no one can use your gateway in the meantime, add a security definition to the configuration file that uses a temporary,
   invalid service account name (`google_service_account`) as described in this optional step. Adding this security definition in this step means that you cannot test your
   external function until you finish configuring security in [Step 5: Create a GCP security policy for the proxy service in the console](external-functions-creating-gcp-ui-security-policy.md). Specifically, the instruction
   to test your external function in [Step 4: Create the external function for GCP in Snowflake](external-functions-creating-gcp-common-ext-function.md) will not work yet.

   1. Add the following `securityDefinitions` section immediately above the `schemes` section of the configuration file and at the same indentation level.

      > ```none
      > securityDefinitions:
      >   <security-def-name>:
      >     authorizationUrl: ""
      >     flow: "implicit"
      >     type: "oauth2"
      >     x-google-issuer: "google_service_account"
      >     x-google-jwks_uri: "https://www.googleapis.com/robot/v1/metadata/x509/google_service_account"
      > ```
      >
      > * Replace `<security-def-name>` with a unique security definition name (e.g. `snowflakeAccess01`).
      > * Record this name in the `Security Definition Name` field in your tracking worksheet.
   2. Update the `post:` section of the configuration file to reference the security definition that you created above. Below the `operationId` field, add:

      ```none
      security:
        - <security-def-name>: []
      ```

      * Make sure it is indented at the same level as the `operationId` field.
      * Replace `<security-def-name>` with the value from the `Security Definition Name` field in your tracking worksheet.
      * Make sure to include a hyphen and a blank prior to the security definition name, as shown above.
      * Make sure to include the empty square braces (`[]`) after the colon.

      For example:

      ```none
      paths:
        /demo-func-resource:
          post:
            summary: "echo the input"
            operationId: echo
            security:
              - snowflakeAccess01: []
            x-google-backend:
              address: https://my_dev.cloudfunctions.net/demo-cloud-function-01
              protocol: h2
      ```
7. Save the configuration file.
8. Record the file path and name in the `Configuration File Name` field in your tracking worksheet.

To learn more about the API configuration file, see the following GCP documentation:

* [OpenAPI overview](https://cloud.google.com/api-gateway/docs/openapi-overview) .
* [Create an API definition](https://cloud.google.com/api-gateway/docs/quickstart-console#creating_an_api_definition) .

## Create an API Gateway

To create an API Gateway:

1. Create a GCP API.
2. Create an API Config.
3. Create a Gateway with the API Config.

### Create a GCP API

This step creates a *GCP API*, which is a container that can contain one or more API Gateways and one or more configuration files:

1. If you have not already done so, go to the Google Cloud API Gateway screen by clicking on the GCP menu and selecting
   API Gateway.
2. Click on CREATE GATEWAY.
3. Enter the Display Name and the API ID (e.g. `demo-api-display-name-for-external-function1` and
   `demo-api-id-for-external-function1`).

   You do not need to record these values in your tracking worksheet because you do not need to enter these later to create your
   external function. However, you might want to record the API ID so that you can delete it when you are done with it.

### Create an API config

Upload your configuration file to the console, which creates an *API Config*.

1. Scroll to the API Config section of the screen.
2. Search for the field that contains Upload an API Spec.

   Click on BROWSE and select your configuration file. The name of your configuration file was recorded in
   the `Configuration File Name` field in your tracking worksheet.
3. Enter a display name into the field that contains Display Name.
4. Select a service account.

   If you created the sample function, then in the field that contains Select a Service Account, select
   App Engine default service account.

   If you are creating a function to use in production (rather than as a sample), you might choose a different service account.

   The selected service account must have appropriate privileges, including privileges to call the Cloud Function.

### Create a gateway with the API config

1. Scroll to the Gateway details section of the screen.
2. Enter the Display Name of the new API Gateway.
3. Click in the Location field and select the appropriate region (e.g. `us-central1`).
4. Click on CREATE GATEWAY.

   This takes you to the APIs screen and shows you a list of your APIs.

   If your new API is not visible immediately, wait a few minutes, then click the Refresh button.
5. Copy the value of the API’s Managed Service to the `Managed Service Identifier` field in your tracking worksheet.
6. At this point, you should still see a list of your APIs. Click on the name of the API.

   You should see 4 tabs: OVERVIEW, DETAILS, CONFIGS, and GATEWAYS.
7. Click on the GATEWAYS tab.
8. Copy the Gateway URL to the `Gateway Base URL` field in your tracking worksheet.

## Next step

[Step 3: Create the API integration for GCP in Snowflake](external-functions-creating-gcp-common-api-integration.md)

---
title: Step 2: Record the Amazon API Gateway URL and the new IAM role ARN
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-template-gateway-url.md
section: SQL General Reference
---

# Step 2: Record the Amazon API Gateway URL and the new IAM role ARN

## Previous step

[Step 1: Use the template to create the remote service (AWS Lambda function) and proxy service (API Gateway)](external-functions-creating-aws-template-services.md)

## Get the API Gateway URL and the new IAM role ARN

In the next few steps, you create the API integration and the external function.
In order to create these, you need the API Gateway URL and the New IAM Role ARN, which you can find by following the steps below.

1. You should be in the AWS Management Console. You should be on the Events tab for the stack you created in the
   previous step.
2. Click on the Outputs tab.
3. Copy the value for resourceInvocationUrl to the tracking worksheet field titled `Resource Invocation URL`.
4. Copy the value for awsRoleArn to the tracking worksheet field titled `New IAM Role ARN`.

## Next step

[Step 3: Create the API integration for AWS in Snowflake](external-functions-creating-aws-common-api-integration.md)

---
title: Step 2: Use the template to create the remote service (Azure function) and proxy service (API Management service)
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-template-services.md
section: SQL General Reference
---

# Step 2: Use the template to create the remote service (Azure function) and proxy service (API Management service)

This topic provides detailed instructions for using the ARM template provided by Snowflake. The template simplifies the tasks for
creating the Azure Function (to use as the remote service) and API Management service (to use as the proxy service) for your
external function.

## Previous step

[Step 1: Create an Azure AD app for the Azure functions app in the Portal](external-functions-creating-azure-template-apps.md)

## Import the template

Before you can use the template, you have to import it into the Azure Portal:

1. If you haven’t already, log into the Azure Portal.
2. In the Azure search bar, search for Template.
3. Under Services, click on Deploy a custom template.
4. Select Build your own template in the editor.
5. Select Load file.
6. Navigate to the directory on the machine where you downloaded the template, then select that template.
7. Click Save.

This takes you to the Custom deployment screen.

## Create the Azure function and API Management service

In the Custom deployment screen:

1. Select an existing (or create a new) Resource group.

   > **Tip:**
   >
   > If you create a new resource group solely for this demonstration, then you might want to record
   > the name so that you can delete it later when you are done with it.
2. Select the appropriate Region.
3. Enter an API Management Service Name.
4. Record the API Management Service name in the `API Management service name` field in your tracking worksheet.
5. In the Function App Name field, enter a unique name.
6. Record the Function App Name in the `Azure Function app name` field in your tracking worksheet.
7. In the Publisher email field, enter your email address. Microsoft uses this email to notify you after the API Management
   service has been created.
8. In the Azuread Application Id field, enter the ID of the Azure AD application you created earlier. This is the value in the
   `Azure Function AD Application ID` field in your tracking worksheet.
9. Click on Review + create.
10. Click on Create.

Creating the Azure Functions app and API Management service typically takes approximately half an hour.

## Obtain the required URLs for the API integration and external function

To create the API integration and external function in Snowflake, you need the API Management service’s URL, which you can find by
following the steps below after Azure has finished creating the API Management service.

At this point, the Azure Portal should show the message Your deployment is complete and should show the Deployment name.

1. Click on Outputs in the left-hand column.
2. Record the api Management URL in the `API Management URL` field in your tracking worksheet.
3. Record the azure Function Http Trigger URL in the `Azure Function HTTP Trigger URL` field in your tracking worksheet.

## Next step

[Step 3: Create the API integration for Azure in Snowflake](external-functions-creating-azure-common-api-integration.md)

---
title: Step 3: Create the API integration for AWS in Snowflake
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-common-api-integration.md
section: SQL General Reference
---

# Step 3: Create the API integration for AWS in Snowflake

This topic provides instructions for creating an API integration object in Snowflake to work with your proxy service (i.e.
Amazon API Gateway). The instructions are the same regardless of whether you are using the Management Console or the
CloudFormation template.

## Previous step

AWS Management Console:
:   [Step 2: Create the proxy service (Amazon API Gateway) in the AWS Management Console](external-functions-creating-aws-ui-proxy-service.md)

AWS CloudFormation template:
:   [Step 2: Record the Amazon API Gateway URL and the new IAM role ARN](external-functions-creating-aws-template-gateway-url.md)

## Prerequisites

You need the following information to create the API integration for AWS in Snowflake:

> * The `New IAM Role ARN` (from your tracking worksheet).
> * The `Resource Invocation URL` (from your tracking worksheet).

## Create the API integration object

1. Open a Snowflake session, typically a Snowflake web interface session.
2. Use a Snowflake role with ACCOUNTADMIN privileges or the CREATE INTEGRATION privilege, for example:

   ```sqlexample
   use role <has_accountadmin_privileges>;
   ```
3. Type the [CREATE API INTEGRATION](sql/create-api-integration.md) command to create an API integration. The command should
   look similar to the following:

   ```sqlexample
   CREATE OR REPLACE API INTEGRATION my_api_integration_01
     api_provider = aws_api_gateway
     api_aws_role_arn = '<new_IAM_role_ARN>'
     api_allowed_prefixes = ('https://')
     enabled = true;
   ```

   Customize the command:

   * The `api_provider` clause should be set based on the type of endpoint:

     + If you are using a private endpoint, the api_provider clause should be set to
       `aws_private_api_gateway`.
     + If you are using a U.S. government GovCloud endpoint, the api_provider clause should be set to
       `aws_gov_api_gateway` or `aws_gov_private_api_gateway`.
     + For most other users, the api_provider clause should be set to `aws_api_gateway`.
   * The `<new_IAM_role_ARN>` should be the value in the `New IAM Role ARN` field in the tracking worksheet.
   * The api_allowed_prefixes field should contain the resource invocation URL that you recorded earlier.

   Below is an example of a complete CREATE API INTEGRATION statement:

   ```sqlexample
   create or replace api integration demonstration_external_api_integration_01
       api_provider=aws_api_gateway
       api_aws_role_arn='arn:aws:iam::123456789012:role/my_cloud_account_role'
       api_allowed_prefixes=('https://xyz.execute-api.us-west-2.amazonaws.com/production/')
       enabled=true;
   ```
4. In the tracking worksheet field titled `API Integration Name`, record the name of the API integration
   that you created. You need the API integration name when you execute the
   CREATE EXTERNAL FUNCTION command later.
5. Execute the CREATE API INTEGRATION command you typed above.

## Record the API_AWS_IAM_USER_ARN and API_AWS_EXTERNAL_ID

1. Execute the DESCRIBE INTEGRATION command.

   ```sqlexample
   DESCRIBE INTEGRATION <my_integration_name>;
   ```

   For example:

   ```sqlexample
   DESCRIBE INTEGRATION my_api_integration_01;
   ```
2. Look for the property named API_AWS_IAM_USER_ARN and then record that property’s property_value in the
   tracking worksheet.
3. Find the property named API_AWS_EXTERNAL_ID and record that property’s property_value in the tracking worksheet.

   Note that the property_value of the API_AWS_EXTERNAL_ID often ends with an equals sign (“=”). That equals sign is
   part of the value; make sure that you cut and paste it along with the rest of the property_value.

For the next few steps, you return to your AWS administration window. Do not close your Snowflake
administration window now; you must return to it later.

## Next step

[Step 4: Link the API integration for AWS to the proxy service in the Management Console](external-functions-creating-aws-common-api-integration-proxy-link.md)

---
title: Step 3: Create the API integration for Azure in Snowflake
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-common-api-integration.md
section: SQL General Reference
---

# Step 3: Create the API integration for Azure in Snowflake

This topic provides instructions for creating an API integration object in Snowflake to work with your proxy service (i.e. Azure API
Management service). The instructions are the same regardless of whether you are using the Azure Portal or ARM template.

## Previous step

Azure Portal:
:   [Step 2: Create the proxy service (Azure API Management service) in the Portal](external-functions-creating-azure-ui-proxy-service.md)

ARM template:
:   [Step 2: Use the template to create the remote service (Azure function) and proxy service (API Management service)](external-functions-creating-azure-template-services.md)

## Required information

You need the following information to create the API integration for Azure in Snowflake:

* `Azure Function App AD Application ID` (from your tracking worksheet)
* Azure AD Tenant ID (as described in the [Prerequisites](external-functions-creating-azure-planning.md) section for planning an external
  function)

## Create the API integration object

Use the [CREATE API INTEGRATION](sql/create-api-integration.md) command to create the API integration object:

1. Open a Snowflake session, typically a Snowflake web interface session.
2. Execute the USE ROLE command to use the ACCOUNTADMIN role or a role with the CREATE INTEGRATION privilege. For example:

   > ```sqlexample
   > use role has_accountadmin_privileges;
   > ```
3. Enter a CREATE API INTEGRATION statement. The statement should look similar to the following:

   > ```sqlexample
   > create or replace api integration <integration_name>
   >     api_provider = azure_api_management
   >     azure_tenant_id = '<tenant_id>'
   >     azure_ad_application_id = '<azure_application_id>'
   >     api_allowed_prefixes = ('<url>')
   >     enabled = true;
   > ```

   In the statement:

   > 1. Replace `<integration_name>` with a unique integration name (e.g. `my_api_integration_name`). The name must follow the rules for
   >    [Object identifiers](identifiers.md).
   >
   >    In addition, record the integration name in the `API Integration Name` field in your tracking worksheet. You will need the name when
   >    you execute the CREATE EXTERNAL FUNCTION command later in the creation process.
   > 2. Replace `<tenant_id>` with your Azure AD Tenant ID.
   >
   >    As an alternative, you can use your domain (e.g. `my_company.onmicrosoft.com`).
   > 3. Replace `<azure_application_id>` with the value from the `Azure Function App AD Application ID` field in your tracking worksheet.
   > 4. For `api_allowed_prefixes`, replace `<url>` with the appropriate URL.
   >
   >    Usually, this is the URL of the proxy service (i.e. Azure API Management service), in the following format:
   >
   >    > ```sqlexample
   >    > https://<api_management_service_name>.azure-api.net
   >    > ```
   >
   >    However, you can restrict the URLs to which this API integration can be applied by appending an appropriate suffix, in which case
   >    the URL has the following format:
   >
   >    > ```sqlexample
   >    > https://<api_management_service_name>.azure-api.net/<api_url_suffix>
   >    > ```
   >
   >    The URL you enter depends on whether you are using the Azure Portal or ARM template to create your external function:
   >
   >    Azure Portal:
   >    :   Use the values from the `API Management service name` and `API Management API URL suffix` fields in your
   >        tracking worksheet. For example, your URL should look similar to:
   >
   >        ```sqlexample
   >        https://my-api-management-svc.azure-api.net/my-api-url-suffix
   >        ```
   >
   >        This should match the base URL and suffix from the API Management service Settings tab for your imported API. If
   >        convenient, you can copy the value from the tab instead.
   >
   >    ARM template:
   >    :   Use the value from the `API Management URL` field in your tracking worksheet.
4. If you haven’t already, execute the CREATE API INTEGRATION statement you entered.

## Next step

[Step 4: Link the API integration for Azure to the proxy service in the Portal](external-functions-creating-azure-common-api-integration-proxy-link.md)

---
title: Step 3: Create the API integration for GCP in Snowflake
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-common-api-integration.md
section: SQL General Reference
---

# Step 3: Create the API integration for GCP in Snowflake

This topic provides instructions for creating an API integration object in Snowflake to work with your proxy service (i.e. Google Cloud API
Gateway).

## Previous step

[Step 2: Create the proxy service (Google Cloud API Gateway) in the console](external-functions-creating-gcp-ui-proxy-service.md)

## Create the API integration object

Use the [CREATE API INTEGRATION](sql/create-api-integration.md) command to create the API integration object:

1. Open a Snowflake session, typically a Snowflake web interface session.
2. Execute the USE ROLE command to use the ACCOUNTADMIN role or a role with the CREATE INTEGRATION privilege. For example:

   > ```sqlexample
   > use role has_accountadmin_privileges;
   > ```
3. Enter a CREATE API INTEGRATION statement. The statement should look similar to the following:

   > ```sqlexample
   > create or replace api integration <integration_name>
   >     api_provider = google_api_gateway
   >     google_audience = '<google_audience_claim>'
   >     api_allowed_prefixes = ('<url>')
   >     enabled = true;
   > ```

   In the statement:

   > 1. Replace `<integration_name>` with a unique integration name (e.g. `my_api_integration_name`. The name must follow the rules for
   >    [Object identifiers](identifiers.md).
   >
   >    In addition, record the integration name in the `API Integration Name` field in your tracking worksheet. You will need the name when
   >    you execute the CREATE EXTERNAL FUNCTION command later in the creation process.
   > 2. For `google_audience`, replace `<google_audience_claim>` with the value from the `Managed Service Identifier` field in your
   >    tracking worksheet.
   >
   >    During authentication, Snowflake passes a JWT (JSON Web Token) to Google. The JWT contains an “aud” (“audience”) claim, which
   >    Snowflake sets to the value for `google_audience`.
   >
   >    For more information about authenticating with Google, see the Google service account
   >    [authentication documentation](https://cloud.google.com/api-gateway/docs/authenticate-service-account#configure_auth).
   > 3. For `api_allowed_prefixes`, replace `<url>` with the value from the `Gateway Base URL` field in your tracking worksheet.
   >
   >    This field allows you to restrict the URLs to which this API integration can be applied. You can use a value that is more restrictive
   >    than the Gateway Base URL.
4. If you haven’t already, execute the CREATE API INTEGRATION statement you entered.

## Record the API_GCP_SERVICE_ACCOUNT information for the API integration

1. Execute the [DESCRIBE INTEGRATION](sql/desc-integration.md) command. For example:

   > ```sqlexample
   > describe integration my_api_integration_name;
   > ```
2. Record the value for `API_GCP_SERVICE_ACCOUNT` in the `API_GCP_SERVICE_ACCOUNT` field in your tracking worksheet.

## Next step

[Step 4: Create the external function for GCP in Snowflake](external-functions-creating-gcp-common-ext-function.md)

---
title: Step 4: Create the external function for GCP in Snowflake
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-common-ext-function.md
section: SQL General Reference
---

# Step 4: Create the external function for GCP in Snowflake

This topic provides instructions for creating an external function object in Snowflake. This object stores information about the remote
service, such as the parameters that the remote service accepts.

> **Note:**
>
> External functions in Snowflake are database objects, meaning they must be created in a schema in a database. To create an external
> function, you must have the appropriate privileges on the database and schema where you are creating the function.
>
> For more details, see [Access control privileges](../user-guide/security-access-control-privileges.md).

## Previous step

[Step 3: Create the API integration for GCP in Snowflake](external-functions-creating-gcp-common-api-integration.md)

## Create the external function object

This task assumes you are in the Worksheets  page in the Classic Console:

1. Enter a [CREATE EXTERNAL FUNCTION](sql/create-external-function.md) statement. The statement should look similar to the following:

   > ```sqlexample
   > create or replace external function <external_function_name>(<parameters>)
   >     returns variant
   >     api_integration = <api_integration_name>
   >     as '<function_url>';
   > ```
2. Replace `<external_function_name>` with a unique function name (e.g. `echo`). This name must follow the rules for
   [Object identifiers](identifiers.md).

   In addition, record the function name in the “External Function Name” field in your tracking worksheet.
3. Replace `<parameters>` with the names and SQL data types of the parameters for the function, if any. For example:

   > ```sqlexample
   > a integer, b varchar
   > ```

   The parameters must correspond to the parameters expected by the remote service. The parameter names do not need to match, but the
   data types need to be compatible.

   In addition, record the parameter names and data types in the “External Function Name” field in your tracking worksheet.
4. Replace `<api_integration_name>` with the value from the “API Integration Name” field in your tracking worksheet.

1. Replace `<function_URL>` with the values from the `Gateway Base URL` and `Path Suffix` fields, separated by a forward slash (`/`).

   The URL should look similar to:

   > ```sqlexample
   > https://<gateway-base-url>/<path-suffix>
   > ```
2. If you haven’t already, execute the CREATE EXTERNAL FUNCTION command that you entered.

## Test your external function

You should now be able to call your external function to verify that it works correctly.

> **Note:**
>
> If you added a security definition to the configuration file to secure your gateway in [Step 2: Create the proxy service (Google Cloud API Gateway) in the console](external-functions-creating-gcp-ui-proxy-service.md)
> of this tutorial, you will not be able to test your external function until you update the security definitions in the configuration file
> in [Step 5: Create a GCP security policy for the proxy service in the console](external-functions-creating-gcp-ui-security-policy.md) of this tutorial.

For details, see [Calling an external function for GCP](external-functions-creating-gcp-call.md).

## Next step

[Step 5: Create a GCP security policy for the proxy service in the console](external-functions-creating-gcp-ui-security-policy.md)

---
title: Step 4: Link the API integration for AWS to the proxy service in the Management Console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-common-api-integration-proxy-link.md
section: SQL General Reference
---

# Step 4: Link the API integration for AWS to the proxy service in the Management Console

This topic provides instructions for linking the API integration object in Snowflake to your proxy service (i.e.
Amazon API Gateway). You do this by creating a trust relationship between Snowflake and the IAM (identity and access
management) role you created earlier.

The instructions are the same regardless of whether you are using the Management Console or the
CloudFormation template.

## Previous step

[Step 3: Create the API integration for AWS in Snowflake](external-functions-creating-aws-common-api-integration.md)

## Set up the trust relationship(s) between Snowflake and the new IAM role

In the AWS Management Console:

1. Select IAM.
2. Select Roles.
3. In the worksheet, look up the value in the `New IAM Role Name` field, then
   look for the same value (role name) in the AWS Management Console.
4. Click on the Trust relationships tab, then click on the button Edit trust relationship.

   This should open the Policy Document into which you can add authentication information.
5. In the Policy Document, find the Statement.Principal.AWS field and replace the value (not the
   key) with the value in the `API_AWS_IAM_USER_ARN` field of the worksheet.
6. Find the Statement.Condition field. Initially, this should contain only curly braces (“{}”).
7. Paste the following between the curly braces:

   > `"StringEquals": { "sts:ExternalId": "xxx" }`
8. Replace the `xxx` with the value for the `API_AWS_EXTERNAL_ID` field in the worksheet.
9. After you are done editing the Policy Document for the trust relationship, it should look similar to the
   following:

   > ```sqljson
   > {
   >   "Version": "2012-10-17",
   >   "Statement": [
   >     {
   >       "Effect": "Allow",
   >       "Principal": {
   >         "AWS": "arn:aws:iam::1234567898012:user/development/development_user"
   >       },
   >       "Action": "sts:AssumeRole",
   >       "Condition": {"StringEquals": { "sts:ExternalId": "EXTERNAL_FUNCTIONS_SFCRole=3_8Hcmbi9halFOkt+MdilPi7rdgOv=" }}
   >     }
   >   ]
   > }
   > ```
10. Click on Update Trust Policy.

## Next step

[Step 5: Create the external function for AWS in Snowflake](external-functions-creating-aws-common-ext-function.md)

---
title: Step 4: Link the API integration for Azure to the proxy service in the Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-common-api-integration-proxy-link.md
section: SQL General Reference
---

# Step 4: Link the API integration for Azure to the proxy service in the Portal

When an external function is called, Snowflake sends an HTTP POST command to the proxy service (i.e. Azure API Management service), which
relays the POST to the remote service (i.e. Azure Functions). A service principal in your Azure AD tenant allows Snowflake to
authenticate with Azure AD when calling the API Management service in your tenant.

This topic provides instructions for creating a service principal to link the API integration you created in the previous step with
your Azure API Management service. The instructions are the same regardless of whether you are using the Azure Portal or ARM template.

For more information about service principals, see the Microsoft documentation:
[Applications and service principals](https://docs.microsoft.com/en-us/azure/active-directory/develop/app-objects-and-service-principals#service-principal-object).

## Previous step

[Step 3: Create the API integration for Azure in Snowflake](external-functions-creating-azure-common-api-integration.md)

## Obtain the app name and consent URL for the API integration

Before you create a service principal, you need some information about the API integration:

1. If you haven’t already, log into the Snowflake web interface.
2. Execute the [DESCRIBE INTEGRATION](sql/desc-integration.md) command for the API integration you created in the previous step:

   > ```sqlexample
   > describe api integration <integration_name>;
   > ```
3. From the DESCRIBE results:

   * Record the app name (from the AZURE_MULTI_TENANT_APP_NAME column) in the corresponding field in your tracking worksheet.
   * Record the consent URL (from the AZURE_CONSENT_URL column) in the corresponding field in your tracking worksheet.

     The URL looks similar to the following:

     > ```sqlexample
     > https://login.microsoftonline.com/<tenant_id>/oauth2/authorize?client_id=<snowflake_application_id>&response_type=code
     > ```

## Grant Snowflake access to your Azure tenancy

To grant Snowflake access to your Azure tenancy, you need the AZURE_CONSENT_URL that you recorded earlier:

1. Paste the URL into your browser. When your browser resolves this URL, Azure automatically creates a service principal that represents
   Snowflake in the tenant.

   Note that you only need to create a service principal for Snowflake once per tenancy. After Snowflake has been granted access, access
   does not need to be granted again. In other words, you do not need to grant access again for each new external function you create for
   Azure.

   If Snowflake has already been granted access to your Azure tenancy, you should see the Snowflake web site, which should show something
   similar to SNOWFLAKE THE CLOUD DATA PLATFORM. You can then skip the remaining tasks and proceed to
   [Step 5: Create the external function for Azure in Snowflake](external-functions-creating-azure-common-ext-function.md).

   If Snowflake has not yet been granted access, you should see a Microsoft Permissions requested page, and you should continue
   to the next task.
2. Click the Accept button. This allows the Azure service principal created for your Snowflake account to obtain an access token
   on any resource inside your Azure AD tenant.

At this point, you have finished creating a service principal in your tenant to represent Snowflake.

However, to enhance security, you should ensure that only authorized clients can access your Azure Function. Instructions for controlling
access are provided in the final step of the creation process.

## Next step

[Step 5: Create the external function for Azure in Snowflake](external-functions-creating-azure-common-ext-function.md)

---
title: Step 5: Create a GCP security policy for the proxy service in the console
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-ui-security-policy.md
section: SQL General Reference
---

# Step 5: Create a GCP security policy for the proxy service in the console

In the previous steps, you created a Google Cloud Function that can be called by anyone who has the correct Google Cloud API Gateway
endpoint. Unless you want the endpoint to be open to the public, you should secure it by creating a security policy on the Google Cloud
API Gateway.

This topic provides instructions for creating a security policy for the API Gateway by adding a customized `securityDefinitions`
section to the configuration file for the API definition.

> **Important:**
>
> Snowflake strongly recommends creating a security policy for the API Gateway. After completing this step, only Snowflake is allowed to
> call your Cloud Function through the API Gateway.

## Previous step

[Step 4: Create the external function for GCP in Snowflake](external-functions-creating-gcp-common-ext-function.md)

## Links to related Google documentation

For more detailed information about using the Google Cloud Console to perform the tasks described in this topic, see the following topics
in the [Google Cloud API Gateway documentation](https://cloud.google.com/api-gateway/docs/quickstart-console):

* [Securing Access by Using an API Key](https://cloud.google.com/api-gateway/docs/quickstart-console#securing_access_by_using_an_api_key)
* [Testing Your API Key](https://cloud.google.com/api-gateway/docs/quickstart-console#testing_your_api_key)

## Update the configuration file

> **Note:**
>
> The name of the configuration file is recorded in the `Configuration File Name` field in your tracking worksheet.

1. Add or update the following `securityDefinitions` section in the configuration file. Add this immediately above the
   `schemes` section of the configuration file and at the same indentation level.

   > ```none
   > securityDefinitions:
   >   <security-def-name>:
   >     authorizationUrl: ""
   >     flow: "implicit"
   >     type: "oauth2"
   >     x-google-issuer: "<gmail service account>"
   >     x-google-jwks_uri: "https://www.googleapis.com/robot/v1/metadata/x509/<gmail service account>"
   > ```

   In the section:

   > * Replace `<security-def-name>` with a unique security definition name (e.g. `snowflakeAccess01`). If you added the temporary security definition in step 2 of the tutorial ([Step 2: Create the proxy service (Google Cloud API Gateway) in the console](external-functions-creating-gcp-ui-proxy-service.md)), this is already done.
   > * In addition, record this name in the `Security Definition Name` field in your tracking worksheet.
   > * Replace `<gmail service account>` with the value from the `API_GCP_SERVICE_ACCOUNT` field in your tracking worksheet.
   >   Add the value in two places in the configuration file:
   >
   >   > + `x-google-issuer` field
   >   > + `x-google-jwks_uri` field (appended to the end of the field)
2. Update the `post:` section of the configuration file to reference the security definition that you created
   above.

   Below the `operationId` field, add:

   > ```none
   > security:
   >   - <security-def-name>: []
   > ```

   This should be indented at the same level as the `operationId` field.

   > * Replace `<security-def-name>` with the value from the `Security Definition Name` field in your tracking worksheet if you have not already done so.
   > * Make sure to include a hyphen and a blank prior to the security definition name, as shown above.
   > * Make sure to include the empty square braces (`[]`) after the colon.

   For example:

   > ```none
   > security:
   >   - snowflakeAccess01: []
   > ```
3. Save the configuration file.

Your updated configuration file should look similar to the following:

```none
swagger: '2.0'
info:
  title: API Gateway config for Snowflake external function
  description: This configuration file connects the API Gateway resource to the remote service (Cloud Function).
  version: 1.0.0
securityDefinitions:
  snowflakeAccess01:
    authorizationUrl: ""
    flow: "implicit"
    type: "oauth2"
    x-google-issuer: "<API_GCP_SERVICE_ACCOUNT>"
    x-google-jwks_uri: "https://www.googleapis.com/robot/v1/metadata/x509/<API_GCP_SERVICE_ACCOUNT>"
schemes:
  - https
produces:
  - application/json
paths:
  /demo-func-resource:
    post:
      summary: Echo the input
      operationId: operationID
      security:
        - snowflakeAccess01: []
      x-google-backend:
        address: <Cloud Function Trigger URL>
        protocol: h2
      responses:
        '200':
          description: <DESCRIPTION>
          schema:
            type: string
```

## Reload the updated file

After updating the configuration file, you must reload the file in the Google Cloud Console:

1. On the Gateways page, click on the name of your gateway.
2. Click on EDIT.
3. Under API Config, click in the box titled Select a Config.
4. Select the option Create new API config.
5. In the box that contains Upload an API Spec, click on the BROWSE button.
6. Select the desired YAML file, which you created previously. Check that it has the extension `.yaml` or `.yml`.
7. Enter the Display Name. Use a new, unique name, not the name that you used previously.
8. If you are asked to Select a Service Account, then select App Engine default service account.

   If you are creating a function to use in production (rather than as a sample), you might choose a different service account.

   The selected service account must have appropriate privileges, including privileges to call the Google Cloud Function.
9. You should now be back on the page for your API gateway. If the Config field shows the old API config file’s display name:

   > 1. Click on EDIT.
   > 2. Under API Config, find the Select a Config box again, and click in the box.
   > 3. Select the new API config.
   > 4. Click the UPDATE button. This takes you back to the list of API gateways.

   You might need to wait a few minutes while the API Gateway is updated. An icon may appear to the left of the API gateway name,
   indicating that the gateway is being refreshed.

To check whether the refresh is still in progress, click on the REFRESH button above the gateway name. After the icon to the left
of the gateway name disappears, the gateway should be fully refreshed, and you can continue to the next step.

## Test your external function

To make sure that your external function works correctly with the new security configuration file, call your external function again.

For details, see [Calling an external function for GCP](external-functions-creating-gcp-call.md).

## Next step

None. You’ve successfully created an external function for GCP.

---
title: Step 5: Create the external function for AWS in Snowflake
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-common-ext-function.md
section: SQL General Reference
---

# Step 5: Create the external function for AWS in Snowflake

This topic provides instructions for creating an external function object in Snowflake. This object stores information about the remote
service, such as the parameters that the remote service accepts. The instructions are the same regardless of whether you are using the
AWS Management Console or the AWS CloudFormation template.

> **Note:**
>
> External functions in Snowflake are database objects, meaning they must be created in a schema in a database. To create an external
> function, you must have the appropriate privileges on the database and schema where you are creating the function.
>
> For more details, see [Access control privileges](../user-guide/security-access-control-privileges.md).

## Previous step

[Step 4: Link the API integration for AWS to the proxy service in the Management Console](external-functions-creating-aws-common-api-integration-proxy-link.md)

## Create the external function

Return to the Snowflake web interface (where you earlier typed the `CREATE API INTEGRATION` command).

1. Type the `CREATE EXTERNAL FUNCTION` command. It should look similar to the following:

   ```sqlexample
   CREATE EXTERNAL FUNCTION my_external_function(n INTEGER, v VARCHAR)
       RETURNS VARIANT
       API_INTEGRATION = <api_integration_name>
       AS '<resource_invocation_url>';
   ```

   Customize the command:

   * The `<api_integration_name>` value should contain the name of the API integration that you created earlier.
   * The `<resource_invocation_url>` value should be the `Resource Invocation URL` you recorded in the worksheet.
     Make sure that this URL includes the API Gateway resource name, not just the stage name.
   * You might also want to customize the function name.

   This example passes two arguments (an INTEGER and a VARCHAR ) because those are the arguments that the
   Lambda function expects. When you create your own Lambda function, you must pass
   appropriate arguments for your Lambda function.
2. Record the name of the external function in the `External Function Name` field in your tracking worksheet.
3. If you have not already executed the CREATE EXTERNAL FUNCTION command that you typed above, execute it now.

## Test your external function

You should now be able to call your external function to verify that it works correctly.

For details, see [Calling an external function for AWS](external-functions-creating-aws-call.md).

## Next step

None. If you were able to call the function, then you’ve successfully created an external function for AWS.

---
title: Step 5: Create the external function for Azure in Snowflake
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-common-ext-function.md
section: SQL General Reference
---

# Step 5: Create the external function for Azure in Snowflake

This topic provides instructions for creating an external function object in Snowflake. This object stores information about the remote
service, such as the parameters that the remote service accepts. The instructions are the same regardless of whether you are using the
Azure Portal or ARM template.

> **Note:**
>
> External functions in Snowflake are database objects, meaning they must be created in a schema in a database. To create an external
> function, you must have the appropriate privileges on the database and schema where you are creating the function.
>
> For more details, see [Access control privileges](../user-guide/security-access-control-privileges.md).

## Previous step

[Step 4: Link the API integration for Azure to the proxy service in the Portal](external-functions-creating-azure-common-api-integration-proxy-link.md)

## Create the external function

This task assumes you are in the Worksheets  page in Snowsight:

1. Enter a [CREATE EXTERNAL FUNCTION](sql/create-external-function.md) statement. The statement should look similar to the following:

   ```sqlexample
   create or replace external function <external_function_name>(<parameters>)
       returns variant
       api_integration = <api_integration_name>
       as '<invocation_url>';
   ```
2. Replace `<external_function_name>` with a unique function name (e.g. `echo`). This name must follow the rules for
   [Object identifiers](identifiers.md).

   In addition, record the function name in the `External Function Name` field in your tracking worksheet.
3. Replace `<parameters>` with the names and SQL data types of the parameters for the function, if any.

   The parameters must correspond to the parameters expected by the remote service. The parameter names do not need to match, but the
   data types need to be compatible.

   If your Azure Function uses the sample JavaScript code provided in Step 1, then the parameters are an INTEGER and a VARCHAR. For
   example:

   > ```sqlexample
   > a integer, b varchar
   > ```

   In addition, record the parameter names and data types in the `External Function Name` field in your tracking worksheet.
4. Replace `<api_integration_name>` with the value from the `API Integration Name` field in your tracking worksheet.
5. Replace `<invocation_url>` with the appropriate URL. This is the URL to which Snowflake sends the HTTP POST command in order to
   call the remote service and has the following format:

   > ```none
   > https://<api_management_service_name>.azure-api.net/<api_url_suffix>/<http_triggered_function_name>
   > ```

   The URL you use depends on whether you are using the Azure Portal or ARM template to create your external function:

   > Azure Portal:
   > :   Use the values from the `API Management service name`, `API Management API URL suffix`, and
   >     `HTTP-Triggered Function name` fields in your tracking worksheet. For example, your URL should look similar to:
   >
   >     > ```none
   >     > https://my-api-management-svc.azure-api.net/my-api-url-suffix/my_http_function
   >     > ```
   >
   > ARM template:
   > :   Use the value from the `Azure Function HTTP Trigger URL` field in your tracking worksheet.
6. If you haven’t already, execute the CREATE EXTERNAL FUNCTION command that you entered.

## Test your external function

You should now be able to call your external function to verify that it works correctly.

For details, see [Calling an external function for Azure](external-functions-creating-azure-call.md).

## Next step

Azure Portal:
:   [Step 6: Create the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-ui-security-policy.md)

ARM template:
:   [Step 6: Update the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-template-security-policy.md)

---
title: Step 6: Create the Azure security policy for the proxy service in the Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-ui-security-policy.md
section: SQL General Reference
---

# Step 6: Create the Azure security policy for the proxy service in the Portal

The previous steps allow your imported APIs (and Azure Function) to be called by Snowflake, as well as other authenticated clients, such
as applications that are in your Azure AD tenant or have a service principal in your Azure AD tenant.

If you want to allow only Snowflake to call the Azure Function, you must implement token validation. With token validation, when Snowflake
tries to access the API Management service, Snowflake presents a JWT (JSON Web Token) access token obtained from Azure AD. The API Management service can
either validate the JWT or pass it through without validation.

This topic provides instructions for creating a security policy for the API Management service by adding a validate-JWT policy that
defines the rules for validating the token.

> **Important:**
>
> Snowflake strongly recommends creating a security policy for the API Management service. After completing this step, only Snowflake
> is allowed to call your Azure Function through the API Management service.
>
> If you prefer to use role-based validation in your validate-JWT policy, see the Microsoft Service Principal documentation for assigning
> a role to a service principal:
> [New-AzureADServiceAppRoleAssignment](https://docs.microsoft.com/en-us/powershell/module/azuread/new-azureadserviceapproleassignment).

## Previous step

[Step 5: Create the external function for Azure in Snowflake](external-functions-creating-azure-common-ext-function.md)

## Create a validate-JWT policy that allows Snowflake to call the Azure function

This section shows how to specify a policy for validating a JSON Web Token (JWT) that authorizes Snowflake to call your Azure Function.
The validation policy (“validate-JWT policy”) validates the following two claims in the JWT:

* The Snowflake service principal application ID.
* The target application App ID (the “audience ID” or just “aud”) of the Azure Function.

For more information about claims in JSON Web Tokens (JWTs) issued by Azure Active Directory, see the Microsoft documentation:
[access tokens](https://docs.microsoft.com/en-us/azure/active-directory/develop/access-tokens#claims-in-access-tokens).

The following steps configure the imported API to use a JSON Web Token:

1. If you haven’t already, log into the Azure Portal.
2. Go to the API Management service screen.
3. Select your API Management service.
4. Find the APIs section in the left-hand column, then click on the APIs option under that.
5. In the column that contains All APIs, click on the name of the API for which you want to add a security
   policy.
6. Look for In-bound Processing:

   > 1. Click on + Add policy.
   > 2. Click on validate-jwt.
   > 3. Fill in the Header name with the value `Authorization`.
   > 4. Add validation for the JWT (JSON Web Token) provided by Snowflake for accessing the Azure Function:
   >
   >    > 1. Look for Required claims and click on + Add claim.
   >    > 2. Fill in the Name field with `aud` (short for “audience”).
   >    > 3. Within the required claim, Look for Values and click on +Add value.
   >    >
   >    >    Add the UUID that you copied to the azure_ad_application_id field in the CREATE API INTEGRATION command. This is recorded in
   >    >    the `Azure Function App AD Application ID` field of your tracking worksheet.
   > 5. Add a separate “claim” for Snowflake:
   >
   >    > 1. Click on + Add claim again:
   >    > 2. Fill in the Name field with the literal string `appid`.
   >    > 3. Within the claim, click on + Add value and add the Snowflake Application ID in the Values field.
   >    >
   >    >    If you do not already have the Snowflake Application ID, you can get it by performing the following steps
   >    >    (the Snowflake Application ID is in the Application ID field):
   >    >
   >    >    > 1. In the worksheet, find the AZURE_MULTI_TENANT_APP_NAME that you filled in earlier.
   >    >    > 2. In the Azure Portal search box, look for Enterprise Applications.
   >    >    >
   >    >    >    This takes you to the Enterprise applications | All applications screen.
   >    >    > 3. In that screen, search for the AZURE_MULTI_TENANT_APP_NAME.
   >    >    >
   >    >    >    The enterprise applications search box does not have a label. Look for a wide field immediately
   >    >    >    above the list of enterprise applications. The box might say something similar to
   >    >    >    First 50 shown, to search all of your applications, enter a display name or the application ID.
   >    >    >
   >    >    >    If you do not find an exact match for the AZURE_MULTI_TENANT_APP_NAME, then search again using only
   >    >    >    the first several characters of this name (if the name contains an underscore, then do not include the
   >    >    >    underscore or any characters after the underscore).
   >    >    > 4. Find the Application ID value for the AZURE_MULTI_TENANT_APP_NAME.
7. Paste the following into Open ID URLs:

   > `https://login.microsoftonline.com/<tenant_id>/.well-known/openid-configuration`

   Replace the `<tenant_id>` with your Azure AD Tenant ID (as described in the
   [Prerequisites](external-functions-creating-azure-planning.md) section for planning an external function).
8. Click on Save.

## Test your external function

To make sure that your external function works correctly with the new security policy, call your external function again.

For details, see [Calling an external function for Azure](external-functions-creating-azure-call.md).

## Restrict the IP addresses that accept Azure functions calls (optional)

In addition to specifying a validate-JWT policy (or using role-based validation), you can implement additional security by restricting IP
addresses. This ensures that only the API Management service’s IP address is allowed to access the Azure Functions app containing your Azure
Function.

For more information about restricting IP addresses, see the Microsoft documentation:
[In-bound IP address restrictions](https://docs.microsoft.com/en-us/azure/azure-functions/functions-networking-options#inbound-ip-restrictions).

## Next step

None. You’ve successfully created an external function for Azure.

---
title: Step 6: Update the Azure security policy for the proxy service in the Portal
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-template-security-policy.md
section: SQL General Reference
---

# Step 6: Update the Azure security policy for the proxy service in the Portal

The ARM template provided by Snowflake creates a security policy to validate a JWT (JSON Web Token) that authorizes Snowflake to
call your Azure Function.

However, the security policy is missing one field, which you need to fill in to ensure the policy provides the necessary level of security.

> **Important:**
>
> Snowflake strongly recommends updating the security policy for the API Management service. After completing this step, only Snowflake
> is allowed to call your Azure Function through the API Management service.

## Previous step

[Step 5: Create the external function for Azure in Snowflake](external-functions-creating-azure-common-ext-function.md)

## Update the security policy in the Azure API Management service

To update the policy:

1. If you haven’t already, log into the Azure Portal.
2. Select API Management Services.
3. Find the API Management service instance that you created. The name of this instance is recorded in the `API Management service name`
   field in your tracking worksheet.
4. Click on the API Management service instance name.
5. Select APIs » APIs.
6. Under All APIs, select ext-func-api.
7. Select POST echo.
8. Click on the validate-JWT button, which is in the Inbound processing box.

   If you do not see this button, please scroll down.
9. Search for “SNOWFLAKE_SERVICE_PRINCIPAL_ID”, and replace it with the Snowflake app ID.

   If you do not already have the Snowflake app ID, you can get it by performing the following steps:

   > 1. In the worksheet, find the AZURE_MULTI_TENANT_APP_NAME that you filled in earlier.
   > 2. In the Azure Portal search box, look for Enterprise Applications.
   >
   >    This takes you to the Enterprise applications | All applications screen.
   > 3. In that screen, search for the AZURE_MULTI_TENANT_APP_NAME.
   >
   >    The enterprise applications search box does not have a label. Look for a wide field immediately
   >    above the list of enterprise applications. The box might say something similar to
   >    First 50 shown, to search all of your applications, enter a display name or the application ID.
   >
   >    If you do not find an exact match for the AZURE_MULTI_TENANT_APP_NAME, then search again using only
   >    the first several characters of this name (if the name contains an underscore, then do not include the
   >    underscore or any characters after the underscore).
   > 4. Find the Application ID value for the AZURE_MULTI_TENANT_APP_NAME.
10. Click Save.

## Test your external function

To make sure that your external function works correctly with the updated security policy, call your external function again.

For details, see [Calling an external function for Azure](external-functions-creating-azure-call.md).

## Next step

None. You’ve successfully created an external function for Azure.

---
title: String & binary data types
source: https://docs.snowflake.com/en/sql-reference/data-types-text.md
section: SQL General Reference
---

# String & binary data types

This topic describes the string/text data types, including binary strings, supported in Snowflake, along with the supported formats for string constants/literals.

## Data types for text strings

Snowflake supports the following data types for text (that is, character) strings.

### VARCHAR

VARCHAR values hold Unicode UTF-8 characters.

> **Note:**
>
> In some systems outside of Snowflake, data types such as CHAR and VARCHAR store ASCII, while data types such as NCHAR and
> NVARCHAR store Unicode.
>
> In Snowflake, VARCHAR and all other string data types store Unicode UTF-8 characters. There is no difference with respect to
> Unicode handling between CHAR and NCHAR data types. Synonyms such as NCHAR are primarily for syntax compatibility when porting
> DDL commands to Snowflake.

When you declare a column of type VARCHAR, you can specify an optional parameter `(N)`, which is the maximum number of
characters to store. For example:

```sqlexample
CREATE TABLE t1 (v VARCHAR(134217728));
```

If no length is specified, the default is 16777216.

Although a VARCHAR value’s maximum length is specified in characters, a VARCHAR value is also limited to a maximum of
134217728 bytes (128 MB). The maximum number of Unicode characters that can be stored in a VARCHAR column is as follows:

Single-byte:
:   134217728

Multi-byte:
:   Between 67108864 (2 bytes per character) and 33554432 (4 bytes per character)

For example, if you declare a column as `VARCHAR(134217728)`, the column can hold a maximum of 67,108,864 2-byte Unicode characters,
even though you specified a maximum length of `134217728`.

When choosing the maximum length for a VARCHAR column, consider the following:

* **Storage:** A column consumes storage for only the amount of actual data stored. For example, a 1-character string in a
  `VARCHAR(134217728)` column only consumes a single character.
* **Performance:** There is no performance difference between using the full-length VARCHAR declaration `VARCHAR(134217728)` and a
  smaller length.

  In any relational database, SELECT statements in which a WHERE clause references VARCHAR columns or string columns aren’t
  as fast as SELECT statements filtered using a date or numeric column condition.
* **Tools for working with data:** Some BI/ETL tools define the maximum size of the VARCHAR data in storage or in memory. If you
  know the maximum size for a column, you can limit the size when you add the column.
* **Collation:** When you specify a [collation](collation.md) for a VARCHAR column, the number of characters
  that are allowed varies, depending on the number of bytes each character takes and the collation specification of the column.

  When comparing values in a collated column,
  [Snowflake follows the Unicode Collation Algorithm (UCA)](collation.md). This algorithm affects the
  maximum number of characters allowed. Currently, around 1.5 million to 8 million characters are allowed in a VARCHAR column
  that is defined with a maximum size and a collation specification.

  As an example, the following table shows how the maximum number of characters can vary for a `VARCHAR(134217728)` column, depending
  on the number of bytes per character and the collation specification used:

  | Number of bytes per character | Collation specification | Maximum number of characters allowed (approximate) |
  | --- | --- | --- |
  | 1 byte | `en-ci` or `en-ci-pi-ai` | Around 56 million characters |
  | 1 byte | `en` | Around 32 million characters |
  | 2 byte | `en-ci-pi-ai` | Around 64 million characters |
  | 2 byte | `en-ci` or `en-ci-pi` | Around 21.6 million characters |
  | 2 byte | `en` | Around 12 million characters |

### CHAR, CHARACTER, NCHAR

Synonymous with VARCHAR, except that if the length is not specified, `CHAR(1)` is the default.

> **Note:**
>
> Snowflake currently deviates from common CHAR semantics in that strings shorter than the maximum length are not space-padded at the end.

### STRING, TEXT, VARCHAR2, NVARCHAR, NVARCHAR2, CHAR VARYING, NCHAR VARYING

Synonymous with VARCHAR.

### String examples in table columns

```sqlexample
CREATE OR REPLACE TABLE test_text(
  vm VARCHAR(134217728),
  vd VARCHAR,
  v50 VARCHAR(50),
  cm CHAR(134217728),
  cd CHAR,
  c10 CHAR(10),
  sm STRING(134217728),
  sd STRING,
  s20 STRING(20),
  tm TEXT(134217728),
  td TEXT,
  t30 TEXT(30));

DESC TABLE test_text;
```

```output
+------+--------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type               | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+--------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| VM   | VARCHAR(134217728) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| VD   | VARCHAR(16777216)  | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| V50  | VARCHAR(50)        | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| CM   | VARCHAR(134217728) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| CD   | VARCHAR(1)         | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| C10  | VARCHAR(10)        | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| SM   | VARCHAR(134217728) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| SD   | VARCHAR(16777216)  | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| S20  | VARCHAR(20)        | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| TM   | VARCHAR(134217728) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| TD   | VARCHAR(16777216)  | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| T30  | VARCHAR(30)        | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+--------------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

## Data types for binary strings

Snowflake supports the following data types for binary strings.

### BINARY

The maximum length is 64 MB (67,108,864 bytes). Unlike VARCHAR, the BINARY data type has no notion of Unicode characters,
so the length is always measured in terms of bytes.

BINARY values are limited to 64 MB so that they fit within 128 MB when converted to hexadecimal strings, for example using
`TO_CHAR(<binary_expression>, 'HEX')`.

When you declare a column of type BINARY, you can specify an optional parameter `(N)`, which is the maximum number of
bytes to store. For example:

```sqlexample
CREATE TABLE b1 (b BINARY(33554432));
```

If no length is specified, the default is 8388608.

### VARBINARY

VARBINARY is synonymous with BINARY.

### Internal representation

The BINARY data type holds a sequence of 8-bit bytes.

When Snowflake displays BINARY data values, Snowflake often represents each
byte as two hexadecimal characters. For example, the word `HELP` might be
displayed as `48454C50`, where `48` is the hexadecimal equivalent of
the ASCII (Unicode) letter `H`, `45` is the hexadecimal representation of
the letter `E`, and so on.

For more information about entering and displaying BINARY data, see
[Binary input and output](binary-input-output.md).

### Binary examples in table columns

```sqlexample
CREATE OR REPLACE TABLE test_binary(
  bd BINARY,
  b100 BINARY(100),
  vbd VARBINARY);

DESC TABLE test_binary;
```

```output
+------+-----------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
| name | type            | kind   | null? | default | primary key | unique key | check | expression | comment | policy name | privacy domain |
|------+-----------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------|
| BD   | BINARY(8388608) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| B100 | BINARY(100)     | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
| VBD  | BINARY(8388608) | COLUMN | Y     | NULL    | N           | N          | NULL  | NULL       | NULL    | NULL        | NULL           |
+------+-----------------+--------+-------+---------+-------------+------------+-------+------------+---------+-------------+----------------+
```

## String constants

*Constants* (also known as *literals*) refer to fixed data values. String constants in Snowflake must always be enclosed between
delimiter characters. Snowflake supports using either of the following to delimit string constants:

* Single quotes
* Pairs of dollar signs

### Single-quoted string constants

A string constant can be enclosed between single quote delimiters (for example, `'This is a string'`). To include
a single quote character within a string constant, type two adjacent single quotes (for example, `''`).

For example:

```sqlexample
SELECT 'Today''s sales projections', '-''''-';
```

```output
+------------------------------+----------+
| 'TODAY''S SALES PROJECTIONS' | '-''''-' |
|------------------------------+----------|
| Today's sales projections    | -''-     |
+------------------------------+----------+
```

> **Note:**
>
> Two single quotes is not the same as the double quote character (`"`), which is used (as needed) for delimiting object identifiers. For more information, see
> [Identifier requirements](identifiers-syntax.md).

#### Escape sequences in single-quoted string constants

To include a single quote or other special characters (for example, newlines) in a single-quoted string constant, you must escape these
characters by using *backslash escape sequences*. A backslash escape sequence is a sequence of characters that begins with a
backslash (`\`).

> **Note:**
>
> If the string contains many single quotes, backslashes, or other special characters, you can use a
> dollar-quoted string constant instead to avoid escaping these characters.

You can also use escape sequences to insert ASCII characters by specifying their
[code points](https://en.wikipedia.org/wiki/Code_point) (the numeric values that correspond to those characters) in octal or
hexadecimal. For example, in ASCII, the code point for the space character is 32, which is 20 in hexadecimal. To specify a space,
you can use the hexadecimal escape sequence `\x20`.

You can also use escape sequences to insert Unicode characters, for example `\u26c4`.

The following table lists the supported escape sequences in four categories: simple, octal, hexadecimal, and Unicode:

> | Escape sequence | Character represented |
> | --- | --- |
> | **Simple escape sequences** |  |
> | `\'` | A single quote (`'`) character. |
> | `\"` | A double quote (`"`) character. |
> | `\\` | A backslash (`\`) character. |
> | `\b` | A backspace character. |
> | `\f` | A formfeed character. |
> | `\n` | A newline (linefeed) character. |
> | `\r` | A carriage return character. |
> | `\t` | A tab character. |
> | `\0` | An ASCII NUL character. |
> | **Octal escape sequences** |  |
> | `\ooo` | ASCII character in octal notation (that is, where each `o` represents an octal digit). |
> | **Hexadecimal escape sequences** |  |
> | `\xhh` | ASCII character in hexadecimal notation (that is, where each `h` represents a hexadecimal digit). |
> | **Unicode escape sequences** |  |
> | `\uhhhh` | Unicode character in hexadecimal notation (that is, where each `h` represents a hexadecimal digit). The number of hexadecimal digits must be exactly four. |

As shown in the table above, if a string constant must include a backslash character (for example, `C:\` in a Windows path or `\d` in
a [regular expression](functions-regexp.md)), you must escape the backslash with a second backslash. For
example, to include `\d` in a regular expression in a string constant, use `\\d`.

If a backslash is used in sequences other than the ones listed above, the backslash is ignored. For example, the
sequence of characters `'\z'` is interpreted as `'z'`.

The following example demonstrates how to use backslash escape sequences. This includes examples of specifying:

* A tab character.
* A newline.
* A backslash.
* The octal and hexadecimal escape sequences for an exclamation mark (code point 33, which is `\041` in octal and `\x21` in
  hexadecimal).
* The Unicode escape sequence for a small image of a snowman.
* Something that is not a valid escape sequence.

```sqlexample
SELECT $1, $2 FROM
  VALUES
    ('Tab','Hello\tWorld'),
    ('Newline','Hello\nWorld'),
    ('Backslash','C:\\user'),
    ('Octal','-\041-'),
    ('Hexadecimal','-\x21-'),
    ('Unicode','-\u26c4-'),
    ('Not an escape sequence', '\z');
```

```output
+------------------------+---------------+
| $1                     | $2            |
|------------------------+---------------|
| Tab                    | Hello   World |
| Newline                | Hello         |
|                        | World         |
| Backslash              | C:\user       |
| Octal                  | -!-           |
| Hexadecimal            | -!-           |
| Unicode                | -⛄-          |
| Not an escape sequence | z             |
+------------------------+---------------+
```

### Dollar-quoted string constants

In some cases, you might need to specify a string constant that contains:

* Single quote characters.
* Backslash characters (for example, in a [regular expression](functions-regexp.md)).
* Newline characters (for example, in the body of a stored procedure or function that you specify in
  [CREATE PROCEDURE](sql/create-procedure.md) or [CREATE FUNCTION](sql/create-function.md)).

In these cases, you can avoid escaping these characters by using
a pair of dollar signs (`$$`) rather than a single quote (`'`) to delimit the beginning and ending of the string.

In a dollar-quoted string constant, you can include quotes, backslashes, newlines and any other special character (except for
double-dollar signs) without escaping those characters. The content of a dollar-quoted string constant is always interpreted
literally.

The following examples are equivalent ways of specifying string constants:

| Example using single quote delimiters | Example using double dollar sign delimiters |
| --- | --- |
| ```none 'string with a \' character' ``` | ```none $$string with a ' character$$ ``` |
| ```none 'regular expression with \\ characters: \\d{2}-\\d{3}-\\d{4}' ``` | ```none $$regular expression with \ characters: \d{2}-\d{3}-\d{4}$$ ``` |
| ```none 'string with a newline\ncharacter' ``` | ```none $$string with a newline character$$ ``` |

The following example uses a dollar-quoted string constant that contains newlines and several
escape sequences:

```sqlexample
SELECT $1, $2 FROM VALUES (
  'row1',
  $$a
                                  ' \ \t
                                  \x21 z $ $$);
```

```output
+------+---------------------------------------------+
| $1   | $2                                          |
|------+---------------------------------------------|
| row1 | a                                           |
|      |                                   ' \ \t    |
|      |                                   \x21 z $  |
+------+---------------------------------------------+
```

In this example, the escape sequences are interpreted as their individual characters
(for example, a backslash followed by a `t`), rather than as escape sequences.

---
title: String & binary functions
source: https://docs.snowflake.com/en/sql-reference/functions-string.md
section: SQL General Reference
---

# String & binary functions

This family of functions perform operations on a string input value, or binary input value (for certain functions), and return a string or numeric value.

The functions are grouped by type of operation performed.

| Function Name | Binary Input Supported | Collation Supported | Notes |
| --- | --- | --- | --- |
| **General Manipulation** |  |  |  |
| [ASCII](functions/ascii.md) |  |  |  |
| [BIT_LENGTH](functions/bit_length.md) | ✔ |  |  |
| [CHR , CHAR](functions/chr.md) |  |  |  |
| [CONCAT , ||](functions/concat.md) | ✔ | ✔ |  |
| [CONCAT_WS](functions/concat_ws.md) | ✔ | ✔ |  |
| [INSERT](functions/insert.md) | ✔ |  |  |
| [LENGTH, LEN](functions/length.md) | ✔ |  |  |
| [LPAD](functions/lpad.md) | ✔ |  |  |
| [LTRIM](functions/ltrim.md) |  |  |  |
| [OCTET_LENGTH](functions/octet_length.md) | ✔ |  |  |
| [PARSE_IP](functions/parse_ip.md) |  |  |  |
| [PARSE_URL](functions/parse_url.md) |  |  |  |
| [REPEAT](functions/repeat.md) |  |  |  |
| [REVERSE](functions/reverse.md) | ✔ |  |  |
| [RPAD](functions/rpad.md) | ✔ |  |  |
| [RTRIM](functions/rtrim.md) |  |  |  |
| [RTRIMMED_LENGTH](functions/rtrimmed_length.md) |  |  |  |
| [SOUNDEX](functions/soundex.md) |  |  |  |
| [SOUNDEX_P123](functions/soundex_p123.md) |  |  |  |
| [SPACE](functions/space.md) |  |  |  |
| [SPLIT](functions/split.md) |  | ✔ | Provides partial support for collation. For details, see the documentation of the function. |
| [SPLIT_PART](functions/split_part.md) |  |  |  |
| [SPLIT_TO_TABLE](functions/split_to_table.md) |  |  |  |
| [STRTOK](functions/strtok.md) |  |  |  |
| [STRTOK_TO_ARRAY](functions/strtok_to_array.md) |  |  |  |
| [STRTOK_SPLIT_TO_TABLE](functions/strtok_split_to_table.md) |  |  |  |
| [TRANSLATE](functions/translate.md) |  |  |  |
| [TRIM](functions/trim.md) |  |  |  |
| [UNICODE](functions/unicode.md) |  |  |  |
| [UUID_STRING](functions/uuid_string.md) |  |  |  |
| **Full-Text Search** |  |  |  |
| [SEARCH](functions/search.md) |  |  |  |
| [SEARCH_IP](functions/search_ip.md) |  |  |  |
| **Case Conversion** |  |  |  |
| [INITCAP](functions/initcap.md) |  |  |  |
| [LOWER](functions/lower.md) |  |  |  |
| [UPPER](functions/upper.md) |  |  |  |
| **Regular Expression Matching** |  |  |  |
| [[ NOT ] REGEXP](functions/regexp.md) |  |  | Alias for RLIKE. |
| [REGEXP_COUNT](functions/regexp_count.md) |  |  |  |
| [REGEXP_EXTRACT_ALL](functions/regexp_substr_all.md) |  |  | Alias for REGEXP_SUBSTR_ALL. |
| [REGEXP_INSTR](functions/regexp_instr.md) |  |  |  |
| [REGEXP_LIKE](functions/regexp_like.md) |  |  | Alias for RLIKE. |
| [REGEXP_REPLACE](functions/regexp_replace.md) |  |  |  |
| [REGEXP_SUBSTR](functions/regexp_substr.md) |  |  |  |
| [REGEXP_SUBSTR_ALL](functions/regexp_substr_all.md) |  |  |  |
| [[ NOT ] RLIKE](functions/rlike.md) |  |  |  |
| **Other Matching/Comparison** |  |  |  |
| [CHARINDEX](functions/charindex.md) | ✔ | ✔ | Alias for POSITION. Provides partial support for collation. For details, see the documentation of the POSITION function. |
| [CONTAINS](functions/contains.md) | ✔ | ✔ | Provides partial support for collation. For details, see the documentation of the function. |
| [EDITDISTANCE](functions/editdistance.md) |  |  |  |
| [ENDSWITH](functions/endswith.md) | ✔ | ✔ | Provides partial support for collation. For details, see the documentation of the function. |
| [[ NOT ] ILIKE](functions/ilike.md) |  |  | Case-insensitive alternative for LIKE. |
| [ILIKE ANY](functions/ilike_any.md) |  |  | Case-insensitive alternative for LIKE ANY. |
| [JAROWINKLER_SIMILARITY](functions/jarowinkler_similarity.md) |  |  |  |
| [LEFT](functions/left.md) | ✔ | ✔ |  |
| [[ NOT ] LIKE](functions/like.md) |  |  |  |
| [LIKE ALL](functions/like_all.md) |  |  |  |
| [LIKE ANY](functions/like_any.md) |  |  |  |
| [POSITION](functions/position.md) | ✔ | ✔ | Provides partial support for collation. For details, see the documentation of the function. |
| [REPLACE](functions/replace.md) |  |  |  |
| [RIGHT](functions/right.md) | ✔ | ✔ |  |
| [STARTSWITH](functions/startswith.md) | ✔ | ✔ | Provides partial support for collation. For details, see the documentation of the function. |
| [SUBSTR , SUBSTRING](functions/substr.md) | ✔ | ✔ |  |
| **Compression/Decompression** |  |  |  |
| [COMPRESS](functions/compress.md) | ✔ |  |  |
| [DECOMPRESS_BINARY](functions/decompress_binary.md) | ✔ |  |  |
| [DECOMPRESS_STRING](functions/decompress_string.md) | ✔ |  |  |
| **Encoding/Decoding** |  |  |  |
| [BASE64_DECODE_BINARY](functions/base64_decode_binary.md) |  |  |  |
| [BASE64_DECODE_STRING](functions/base64_decode_string.md) |  |  |  |
| [BASE64_ENCODE](functions/base64_encode.md) | ✔ |  |  |
| [HEX_DECODE_BINARY](functions/hex_decode_binary.md) |  |  |  |
| [HEX_DECODE_STRING](functions/hex_decode_string.md) |  |  |  |
| [HEX_ENCODE](functions/hex_encode.md) | ✔ |  |  |
| [TRY_BASE64_DECODE_BINARY](functions/try_base64_decode_binary.md) |  |  | Error-handling version of BASE64_DECODE_BINARY. |
| [TRY_BASE64_DECODE_STRING](functions/try_base64_decode_string.md) |  |  | Error-handling version of BASE64_DECODE_STRING. |
| [TRY_HEX_DECODE_BINARY](functions/try_hex_decode_binary.md) |  |  | Error-handling version of HEX_DECODE_BINARY. |
| [TRY_HEX_DECODE_STRING](functions/try_hex_decode_string.md) |  |  | Error-handling version of HEX_DECODE_STRING. |
| **Cryptographic/Checksum** |  |  |  |
| [MD5 , MD5_HEX](functions/md5.md) |  |  | Intended primarily for checksum operations. Not recommended for cryptography. |
| [MD5_BINARY](functions/md5_binary.md) |  |  | Intended primarily for checksum operations. Not recommended for cryptography. |
| [MD5_NUMBER_LOWER64](functions/md5_number_lower64.md) |  |  | Intended primarily for checksum operations. Not recommended for cryptography. |
| [MD5_NUMBER_UPPER64](functions/md5_number_upper64.md) |  |  | Intended primarily for checksum operations. Not recommended for cryptography. |
| [SHA1 , SHA1_HEX](functions/sha1.md) |  |  |  |
| [SHA1_BINARY](functions/sha1_binary.md) |  |  |  |
| [SHA2 , SHA2_HEX](functions/sha2.md) |  |  |  |
| [SHA2_BINARY](functions/sha2_binary.md) |  |  |  |
| **Hash (Non-cryptographic)** |  |  |  |
| [HASH](functions/hash.md) | ✔ |  | Allows data types other than string and binary. Not intended for cryptography. |
| [HASH_AGG](functions/hash_agg.md) | ✔ |  | Allows data types other than string and binary. Not intended for cryptography. |
| **Collation** |  |  |  |
| [COLLATE](functions/collate.md) |  |  |  |
| [COLLATION](functions/collation.md) |  |  |  |
| **AI Functions** |  |  |  |
| [AGENT_RUN (SNOWFLAKE.CORTEX)](functions/agent_run-snowflake-cortex.md) |  |  |  |
| [AI_AGG](functions/ai_agg.md) |  |  |  |
| [AI_CLASSIFY](functions/ai_classify.md) |  |  |  |
| [AI_COMPLETE](functions/ai_complete.md) |  |  |  |
| [AI_COUNT_TOKENS](functions/ai_count_tokens.md) |  |  |  |
| [AI_EMBED](functions/ai_embed.md) |  |  |  |
| [AI_FILTER](functions/ai_filter.md) |  |  |  |
| [AI_REDACT](functions/ai_redact.md) |  |  |  |
| [AI_SENTIMENT](functions/ai_sentiment.md) |  |  |  |
| [AI_SIMILARITY](functions/ai_similarity.md) |  |  |  |
| [AI_SUMMARIZE_AGG](functions/ai_summarize_agg.md) |  |  |  |
| [AI_TRANSLATE](functions/ai_translate.md) |  |  |  |
| [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](functions/classify_text-snowflake-cortex.md) |  |  |  |
| [COMPLETE (SNOWFLAKE.CORTEX)](functions/complete-snowflake-cortex.md) |  |  |  |
| [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](functions/data_agent_run-snowflake-cortex.md) |  |  |  |
| [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](functions/embed_text-snowflake-cortex.md) |  |  |  |
| [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](functions/embed_text_1024-snowflake-cortex.md) |  |  |  |
| [ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)](functions/entity_sentiment-snowflake-cortex.md) |  |  |  |
| [EXTRACT_ANSWER (SNOWFLAKE.CORTEX)](functions/extract_answer-snowflake-cortex.md) |  |  |  |
| [FINETUNE (SNOWFLAKE.CORTEX)](functions/finetune-snowflake-cortex.md) |  |  |  |
| [PARSE_DOCUMENT (SNOWFLAKE.CORTEX)](functions/parse_document-snowflake-cortex.md) |  |  |  |
| [SPLIT_TEXT_MARKDOWN_HEADER (SNOWFLAKE.CORTEX)](functions/split_text_markdown_header-snowflake-cortex.md) |  |  |  |
| [SPLIT_TEXT_RECURSIVE_CHARACTER (SNOWFLAKE.CORTEX)](functions/split_text_recursive_character-snowflake-cortex.md) |  |  |  |
| [SENTIMENT (SNOWFLAKE.CORTEX)](functions/sentiment-snowflake-cortex.md) |  |  |  |
| [SUMMARIZE (SNOWFLAKE.CORTEX)](functions/summarize-snowflake-cortex.md) |  |  |  |
| [TRANSLATE (SNOWFLAKE.CORTEX)](functions/translate-snowflake-cortex.md) |  |  |  |
| [COUNT_TOKENS (SNOWFLAKE.CORTEX)](functions/count_tokens-snowflake-cortex.md) |  |  |  |
| [TRY_COMPLETE (SNOWFLAKE.CORTEX)](functions/try_complete-snowflake-cortex.md) |  |  |  |
| [SEARCH_PREVIEW (SNOWFLAKE.CORTEX)](functions/search_preview-snowflake-cortex.md) |  |  |  |

---
title: String functions (regular expressions)
source: https://docs.snowflake.com/en/sql-reference/functions-regexp.md
section: SQL General Reference
---

# String functions (regular expressions)

These string functions perform operations that match a regular expression (often referred to as a “regex”).

## List of regex functions

| Function | Notes |
| --- | --- |
| [[ NOT ] REGEXP](functions/regexp.md) | Alias for RLIKE. |
| [REGEXP_COUNT](functions/regexp_count.md) |  |
| [REGEXP_EXTRACT_ALL](functions/regexp_substr_all.md) | Alias for REGEXP_SUBSTR_ALL. |
| [REGEXP_INSTR](functions/regexp_instr.md) |  |
| [REGEXP_LIKE](functions/regexp_like.md) | Alias for RLIKE. |
| [REGEXP_REPLACE](functions/regexp_replace.md) |  |
| [REGEXP_SUBSTR](functions/regexp_substr.md) |  |
| [REGEXP_SUBSTR_ALL](functions/regexp_substr_all.md) |  |
| [[ NOT ] RLIKE](functions/rlike.md) |  |

## General usage notes

In these notes, “subject” refers to the string to operate on and “pattern” refers to the regular expression:

* The subject is typically a variable column, while the pattern is typically a constant, but this is not required; every argument
  to a regular expression function can be either a constant or variable.
* Patterns support the most of the POSIX ERE (Extended Regular Expression) syntax. For details, see the
  [POSIX basic and extended](http://en.wikipedia.org/wiki/Regular_expression#POSIX_basic_and_extended) section (in Wikipedia).

  One exception is that the regular expression functions don’t support non-greedy quantifiers, for example `*?`, `??`,
  and `+?`.
* Patterns also support the following Perl backslash-sequences:

  + `\d`: decimal digit (0-9).
  + `\D`: not a decimal digit.
  + `\s`: whitespace character.
  + `\S`: not a whitespace character.
  + `\w`: “word” character (a-z, A-Z, underscore (“_”), or decimal digit).
  + `\W`: not a word character.
  + `\b`: word boundary.
  + `\B`: not a word boundary.

  For details, see the [Character classes](http://en.wikipedia.org/wiki/Regular_expression#Character_classes) section (in Wikipedia) or the
  [Backslash sequences](http://perldoc.perl.org/perlrecharclass.html#Backslash-sequences) section (in the Perl documentation).

  > **Note:**
  >
  > In [single-quoted string constants](data-types-text.md), you must escape the backslash character in
  > the backslash-sequence. For example, to specify `\d`, use `\\d`. For details, see
  > Specifying regular expressions in single-quoted string constants (in this topic).
  >
  > You do not need to escape backslashes if you are delimiting the string with
  > [pairs of dollar signs ($$)](data-types-text.md) (rather than single quotes).
* By default, the POSIX wildcard character `.` (in the pattern) does not include newline characters `\n` (in the subject) as matches.

  To also match newline characters, either replace `.` with `(.|\n)` in the `pattern` argument, or use the `s` parameter in the `parameters` argument (described
  below).
* All the regular expression functions support Unicode. A single Unicode character always counts as one character (that is, the POSIX meta-character `.` matches exactly one Unicode character),
  regardless of the byte-length of the corresponding binary representation of that character. Also, for functions that take or return subject offsets, a single Unicode character counts as 1.

## Specifying the parameters for the regular expression

Most regular expression functions support an optional `parameters` argument. The `parameters` argument is a VARCHAR string that specifies the matching
behavior of the regular expression function. The following parameters are supported:

| Parameter | Description |
| --- | --- |
| `c` | Enables case-sensitive matching. |
| `i` | Enables case-insensitive matching. |
| `m` | Enables multi-line mode (that is, meta-characters `^` and `$` mark the beginning and end of any line of the subject). By default, multi-line mode is disabled (that is, `^` and `$` mark the beginning and end of the entire subject). |
| `e` | Extracts submatches; applies only to [REGEXP_INSTR](functions/regexp_instr.md), [REGEXP_SUBSTR](functions/regexp_substr.md), [REGEXP_SUBSTR_ALL](functions/regexp_substr_all.md), and the aliases for these functions. |
| `s` | Enables the POSIX wildcard character `.` to match `\n`. By default, wildcard character matching is disabled. |

The default string is `c`, which specifies:

* Case-sensitive matching.
* Single-line mode.
* No submatch extraction, except for [REGEXP_REPLACE](functions/regexp_replace.md), which always uses submatch extraction.
* POSIX wildcard character `.` does not match `\n` newline characters.

When specifying multiple parameters, enter the string with no spaces or delimiters.
For example, `ims` specifies case-insensitive matching in multi-line mode with POSIX wildcard matching.

If both `c` and `i` are included in the `parameters` string, the one that occurs last in the string dictates whether the function performs case-sensitive or case-insensitive
matching. For example, `ci` specifies case-insensitive matching because the `i` occurs last in the string.

The following example shows how the results can be different for case-sensitive and case-insensitive matching.
The [REGEXP_COUNT](functions/regexp_count.md) function returns no matches for `snow` and `SNOW` for case-sensitive matching (`c` parameter,
the default) and one match for case-insensitive matching (`i` parameter):

```sqlexample
SELECT REGEXP_COUNT('snow', 'SNOW', 1, 'c') AS case_sensitive_matching,
       REGEXP_COUNT('snow', 'SNOW', 1, 'i') AS case_insensitive_matching;
```

```output
+-------------------------+---------------------------+
| CASE_SENSITIVE_MATCHING | CASE_INSENSITIVE_MATCHING |
|-------------------------+---------------------------|
|                       0 |                         1 |
+-------------------------+---------------------------+
```

Use the [REGEXP_SUBSTR](functions/regexp_substr.md) function with the `e` parameter to look for the word
`Release`, followed by one or more non-word characters, followed by one or more digits, and then return
the substring that matches the digits:

```sqlexample
SELECT REGEXP_SUBSTR('Release 24', 'Release\\W+(\\d+)', 1, 1, 'e') AS release_number;
```

```output
+----------------+
| RELEASE_NUMBER |
|----------------|
| 24             |
+----------------+
```

For more examples that use parameters, see [REGEXP_INSTR](functions/regexp_instr.md), [REGEXP_LIKE](functions/regexp_like.md),
[REGEXP_SUBSTR](functions/regexp_substr.md), [REGEXP_SUBSTR_ALL](functions/regexp_substr_all.md), and [[ NOT ] RLIKE](functions/rlike.md).

## Matching characters that are metacharacters

In regular expressions, some characters are treated as metacharacters that have a specific meaning. For example:

* `.` is a
  [metacharacter that matches any single character](https://en.wikipedia.org/wiki/Regular_expression#POSIX_basic_and_extended).
* `*` is a [quantifier](https://en.wikipedia.org/wiki/Regular_expression#Basic_concepts) that matches zero or more instances
  of the preceding element. For example, `BA*` matches `B`, `BA`, `BAA`, and so on.
* `?` is a quantifier that matches zero or one instance of the preceding element.

To match the actual character (for example, an actual period, asterisk, or question mark), you must escape the metacharacter with a
backslash (for example, `\.`, `\*`, `\?`, and so on).

> **Note:**
>
> If you are using the regular expression in a [single-quoted string constant](data-types-text.md),
> you must escape the backslash with a second backslash (for example, `\\.`, `\\*`, `\\?`, and so on). For details, see
> Specifying regular expressions in single-quoted string constants

For example, suppose that you need to find an open parenthesis (`(`) in a string. One way to specify this is to use a backslash
to escape the character in the pattern (for example, `\(`).

If you are specifying the pattern as a [single-quoted string constant](data-types-text.md), you must also
[escape that backslash with a second backslash](data-types-text.md).

The following pattern matches a sequence of alphanumeric characters that appear inside parentheses (for example, `(NY)`):

```sqlexample
SELECT REGEXP_SUBSTR('Customers - (NY)','\\([[:alnum:]]+\\)') AS location;
```

```output
+----------+
| LOCATION |
|----------|
| (NY)     |
+----------+
```

For additional examples, see Example of using metacharacters in a single-quoted string constant.

Note that you do not need to escape the backslash character if you are using a
[dollar-quoted string constant](data-types-text.md):

```sqlexample
SELECT REGEXP_SUBSTR('Customers - (NY)',$$\([[:alnum:]]+\)$$) AS location;
```

```output
+----------+
| LOCATION |
|----------|
| (NY)     |
+----------+
```

## Using backreferences

Snowflake does not support backreferences in regular expression patterns (known as “squares” in formal language theory); however, backreferences are supported in the replacement string of the
[REGEXP_REPLACE](functions/regexp_replace.md) function.

## Specifying an empty pattern

In most regexp functions, an empty pattern (that is, `''`) matches nothing, not even an empty subject.

The exceptions are [REGEXP_LIKE](functions/regexp_like.md) and its aliases [[ NOT ] REGEXP](functions/regexp.md) and [[ NOT ] RLIKE](functions/rlike.md),
in which the empty pattern matches the empty subject because the pattern is implicitly anchored at both ends
(that is, `''` automatically becomes `'^$'`).

An empty group (that is, subexpression `()`), matches the space in between characters, including the beginning and end of the subject.

## Specifying regular expressions in dollar-quoted string constants

If you are using a string constant to specify the regular expression for a function, you can use a
[dollar-quoted string constant](data-types-text.md) to avoid
escaping the backslash characters in the regular expression. (If you are using
[single-quoted string constants](data-types-text.md), you need to escape the backslashes.)

The content of a dollar-quoted string constant is always interpreted literally.

For example, when escaping a metacharacter, you only need to use a single backslash:

```sqlexample
SELECT w2
  FROM wildcards
  WHERE REGEXP_LIKE(w2, $$\?$$);
```

When using a backreference, you only need to use a single backslash:

```sqlexample
SELECT w2, REGEXP_REPLACE(w2, '(.old)', $$very \1$$)
  FROM wildcards
  ORDER BY w2;
```

## Specifying regular expressions in single-quoted string constants

If you are using a regular expression in a [single-quoted string constant](data-types-text.md), you must
escape any backslashes in backslash-sequences with a second backslash.

> **Note:**
>
> To avoid escaping backslashes in a regular expression, you can use a
> dollar-quoted string constant, rather than a single-quoted string constant.

For example:

* If you are escaping a metacharacter with a backslash, you must escape the backslash with
  a second backslash. See Example of using metacharacters in a single-quoted string constant.
* If you are using a backslash-sequence, you must escape the backslash in the sequence.
* If you are using a backreference, you must escape the backslash in the backreference.
  See Example of using backreferences in a single-quoted string constant.

### Example of using metacharacters in a single-quoted string constant

This example uses the backslash as part of an escape sequence in a regular expression that searches for a question mark (`?`).

Create a table and insert a row that contains a single backslash in one column and a question mark in another column:

```sqlexample
CREATE OR REPLACE TABLE wildcards (w VARCHAR, w2 VARCHAR);
INSERT INTO wildcards (w, w2) VALUES ('\\', '?');
```

The following query searches for the question mark literal. The search uses a regular expression, and the question mark is a
meta-character in regular expressions, so the search must escape the question mark to treat it as a literal. Because the
backslash appears in a string literal, the backslash itself must also be escaped:

```sqlexample
SELECT w2
  FROM wildcards
  WHERE REGEXP_LIKE(w2, '\\?');
```

```output
+----+
| W2 |
|----|
| ?  |
+----+
```

The following query makes it easier to see that the regular expression is composed of two characters (the backslash escape
character and the question mark):

```sqlexample
SELECT w2
  FROM wildcards
  WHERE REGEXP_LIKE(w2, '\\' || '?');
```

```output
+----+
| W2 |
|----|
| ?  |
+----+
```

In the previous example, the extra backslash was needed only because the escape character was part of a string literal.
It was not needed for the regular expression itself. The following SELECT statement does not need to parse a string literal as
part of the SQL command string, and therefore does not need the extra escape character that the string literal needed:

```sqlexample
SELECT w, w2, w || w2 AS escape_sequence, w2
  FROM wildcards
  WHERE REGEXP_LIKE(w2, w || w2);
```

```output
+---+----+-----------------+----+
| W | W2 | ESCAPE_SEQUENCE | W2 |
|---+----+-----------------+----|
| \ | ?  | \?              | ?  |
+---+----+-----------------+----+
```

### Example of using backreferences in a single-quoted string constant

If you use a backreference (for example, `\1`) in a string literal, you must escape the backslash
that is a part of that backreference. For example, to specify the backreference `\1` in a replacement string literal of
[REGEXP_REPLACE](functions/regexp_replace.md), use `\\1`.

The following example uses the table created earlier. The SELECT uses a backreference to replace each occurrence of the regular
expression `.old` with a copy of the matched string preceded by the word “very”:

```sqlexample
INSERT INTO wildcards (w, w2) VALUES (NULL, 'When I am cold, I am bold.');
```

```sqlexample
SELECT w2, REGEXP_REPLACE(w2, '(.old)', 'very \\1')
  FROM wildcards
  ORDER BY w2;
```

```output
+----------------------------+------------------------------------------+
| W2                         | REGEXP_REPLACE(W2, '(.OLD)', 'VERY \\1') |
|----------------------------+------------------------------------------|
| ?                          | ?                                        |
| When I am cold, I am bold. | When I am very cold, I am very bold.     |
+----------------------------+------------------------------------------+
```

---
title: Structured data types
source: https://docs.snowflake.com/en/sql-reference/data-types-structured.md
section: SQL General Reference
---

# Structured data types

The Snowflake structured types are ARRAY, OBJECT, and MAP. Structured types contain elements or key-value pairs with specific
[Snowflake data types](../sql-reference-data-types.md). The following are examples of structured types:

* An ARRAY of INTEGER elements.
* An OBJECT with VARCHAR and NUMBER key-value pairs.
* A MAP that associates a VARCHAR key with a DOUBLE value.

You can use structured types in the following ways:

* You can define a structured type column in a table.

  A structured type column supports a maximum of 1000 sub-columns.

  In an [Apache Iceberg™ table](../user-guide/tables-iceberg.md), the
  [Apache Iceberg™ data types](../user-guide/tables-iceberg-data-types.md) `list`, `struct`, and `map` correspond
  to the structured ARRAY, structured OBJECT, and MAP types in Snowflake.
* You use structured types when accessing data from a structured type column in a table.
* You can cast a semi-structured [ARRAY](data-types-semistructured.md), [OBJECT](data-types-semistructured.md), or [VARIANT](data-types-semistructured.md)
  value to a corresponding structured type (for example, an ARRAY value to an ARRAY value of INTEGER elements). You can also cast a
  structured type of a semi-structured type.

This topic explains how to use structured types in Snowflake.

> **Note:**
>
> Structured types aren’t supported for dynamic, hybrid, or external tables.

## Specifying a structured type

When defining a structured type column or casting a value to a structured type, use the syntax described in the following
sections:

* Specifying a structured ARRAY type
* Specifying a structured OBJECT type
* Specifying a MAP type

### Specifying a structured ARRAY type

To specify a structured ARRAY type, use the following syntax:

```sqlsyntax
ARRAY( <element_type> [ NOT NULL ] )
```

Where:

* `element_type` is the [Snowflake data type](../sql-reference-data-types.md) of the elements in this ARRAY.

  You can also specify a structured ARRAY, a structured OBJECT, or a MAP as the type of the element.

  > **Note:**
  >
  > In the definition of a standard Snowflake table (non-Iceberg) column, you can’t specify GEOGRAPHY as the type of the ARRAY element.
  >
  > In the definition of an Iceberg table column, you can’t specify VARIANT, semi-structured ARRAY, or semi-structured OBJECT
  > as the type of the ARRAY element.
* NOT NULL specifies that the ARRAY can’t contain any elements that are NULL.

For example, compare the types returned by the [SYSTEM$TYPEOF](functions/system_typeof.md) function in the following statement:

* The first column expression casts a semi-structured ARRAY value to a structured ARRAY value (an ARRAY of NUMBER elements).
* The second column expression specifies a semi-structured ARRAY value.

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    [1, 2, 3]::ARRAY(NUMBER)
  ) AS structured_array,
  SYSTEM$TYPEOF(
    [1, 2, 3]
  ) AS semi_structured_array;
```

```output
+-------------------------------+-----------------------+
| STRUCTURED_ARRAY              | SEMI_STRUCTURED_ARRAY |
|-------------------------------+-----------------------|
| ARRAY(NUMBER(38,0))[LOB]      | ARRAY[LOB]            |
+-------------------------------+-----------------------+
```

### Specifying a structured OBJECT type

To specify a structured OBJECT type, use the following syntax:

```sqlsyntax
OBJECT(
  [
    <key> <value_type> [ NOT NULL ]
    [ , <key> <value_type> [ NOT NULL ] ]
    [ , ... ]
  ]
)
```

Where:

* `key` specifies a key for the OBJECT type.

  + Each `key` in an OBJECT definition must be unique.
  + The order of the keys is part of the OBJECT definition. Comparing two OBJECT values that have the same keys in a different order
    isn’t allowed. (A compile time error occurs.)
  + If you don’t specify any key but specify the parentheses (that is, if you use `OBJECT()`), the resulting type is a structured
    OBJECT that contains no keys. A structured OBJECT with no keys is different from a semi-structured OBJECT.
* `value_type` is the [Snowflake data type](../sql-reference-data-types.md) of the value corresponding to the key.

  You can also specify a structured ARRAY, a structured OBJECT, or a MAP as the type of the value.

  > **Note:**
  >
  > In the definition of a standard Snowflake table (non-Iceberg) column, you can’t specify GEOGRAPHY as the type of the
  > value corresponding to the OBJECT key.
  >
  > In the definition of an Iceberg table column, you can’t specify VARIANT, semi-structured ARRAY, or semi-structured OBJECT
  > as the type of the value corresponding to the OBJECT key.
* NOT NULL specifies that the value corresponding to the key can’t be NULL.

For example, compare the types returned by the [SYSTEM$TYPEOF](functions/system_typeof.md) function in the following statement:

* The first column expression casts a semi-structured OBJECT value to a structured OBJECT value that contains the following keys and values:

  + A key named `str` with a VARCHAR value that is not NULL.
  + A key named `num` with a NUMBER value.
* The second column expression specifies a semi-structured OBJECT value.

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    {
      'str': 'test',
      'num': 1
    }::OBJECT(
      str VARCHAR NOT NULL,
      num NUMBER
    )
  ) AS structured_object,
  SYSTEM$TYPEOF(
    {
      'str': 'test',
      'num': 1
    }
  ) AS semi_structured_object;
```

```output
+-----------------------------------------------------+------------------------+
| STRUCTURED_OBJECT                                   | SEMI_STRUCTURED_OBJECT |
|-----------------------------------------------------+------------------------|
| OBJECT(str VARCHAR NOT NULL, num NUMBER(38,0))[LOB] | OBJECT[LOB]            |
+-----------------------------------------------------+------------------------+
```

### Specifying a MAP type

To specify a MAP type, use the following syntax:

```sqlsyntax
MAP( <key_type> , <value_type> [ NOT NULL ] )
```

Where:

* `key_type` is the [Snowflake data type](../sql-reference-data-types.md) of the key for the map. You must use one of
  the following types for keys:

  + VARCHAR
  + NUMBER with the scale 0

  You can’t use a floating point data type as the type for the key.

  Map keys can’t be NULL.
* `value_type` is the [Snowflake data type](../sql-reference-data-types.md) of the values in the map.

  You can also specify a structured ARRAY, a structured OBJECT, or a MAP as the type of the values.

  > **Note:**
  >
  > In the definition of a standard Snowflake table (non-Iceberg) column, you can’t specify GEOGRAPHY as the type of the
  > value in the MAP.
  >
  > In the definition of an Iceberg table column, you can’t specify VARIANT, semi-structured ARRAY, or semi-structured OBJECT
  > as the type of the value in the MAP.
* NOT NULL specifies that the value corresponding to the key can’t be NULL.

The following example casts a semi-structured OBJECT value to a MAP value and uses the [SYSTEM$TYPEOF](functions/system_typeof.md)
function to print the resulting type of the value. The MAP associates VARCHAR keys with VARCHAR values.

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    {
      'a_key': 'a_val',
      'b_key': 'b_val'
    }::MAP(VARCHAR, VARCHAR)
  ) AS map_example;
```

```output
+----------------------------+
| MAP_EXAMPLE                |
|----------------------------|
| MAP(VARCHAR, VARCHAR)[LOB] |
+----------------------------+
```

## Creating a table with a structured type column

When you use the [CREATE TABLE](sql/create-table.md) command to create a table, you can use the syntax described in
Specifying a structured type to define a column that contains a structured type.

The following examples demonstrate how to specify a structured type column:

* Example of creating a table with a structured ARRAY column
* Example of creating a table with a structured OBJECT column
* Example of creating a table with a MAP column

### Example of creating a table with a structured ARRAY column

The following statement creates a table with a column for a structured ARRAY:

```sqlexample
CREATE TABLE my_table_with_structured_array_column (
  numeric_array ARRAY(NUMBER)
);
```

The following statement inserts a row into the table:

```sqlexample
INSERT INTO my_table_with_structured_array_column SELECT
  [10, 20, 30]::ARRAY(NUMBER);
```

Note the following:

* Because the example uses an [ARRAY constant](data-types-semistructured.md) for the value to insert, the example uses a query
  (SELECT) rather than the VALUES clause.

  The VALUES clause [does not support](sql/insert.md) OBJECT constants, ARRAY constants, and some
  functions like [OBJECT_CONSTRUCT](functions/object_construct.md) and [ARRAY_CONSTRUCT](functions/array_construct.md).
* Because
  an ARRAY constant specifies a semi-structured ARRAY (not a structured ARRAY),
  you must cast the resulting semi-structured ARRAY to a structured ARRAY.

### Example of creating a table with a structured OBJECT column

The following statement creates a table with a column for a structured OBJECT:

```sqlexample
CREATE TABLE customer (
  c_id VARCHAR,
  c_name VARCHAR,
  c_address OBJECT(
    state VARCHAR,
    city VARCHAR,
    street VARCHAR,
    zip_code NUMBER
  )
);
```

The following statement inserts a row into the table:

```sqlexample
INSERT INTO customer SELECT
  '1',
  'customer_name',
  {
    'state': 'CA',
    'city': 'San Mateo',
    'street': '450 Concar Drive',
    'zip_code': 94402
  }::OBJECT(
    state VARCHAR,
    city VARCHAR,
    street VARCHAR,
    zip_code NUMBER
  );
```

Note the following:

* Because the example uses an [OBJECT constant](data-types-semistructured.md) for the value to insert, the example uses a query
  (SELECT) rather than the VALUES clause.

  The VALUES clause [does not support](sql/insert.md) OBJECT constants, ARRAY constants, and some
  functions like [OBJECT_CONSTRUCT](functions/object_construct.md) and [ARRAY_CONSTRUCT](functions/array_construct.md).
* Because
  an OBJECT constant specifies a semi-structured OBJECT (not a structured OBJECT),
  you must cast the resulting semi-structured OBJECT to a structured OBJECT.

### Example of creating a table with a MAP column

The following statement creates a table with a column for a MAP:

```sqlexample
CREATE OR REPLACE TABLE my_table_with_map_column(my_map MAP(VARCHAR, VARCHAR));
```

The following statement inserts a row into the table:

```sqlexample
INSERT INTO my_table_with_map_column SELECT
  {'key123': 'value123'}::MAP(VARCHAR, VARCHAR);
```

Note the following:

* Because the example uses an [OBJECT constant](data-types-semistructured.md) for the value to insert, the example uses a query
  (SELECT) rather than the VALUES clause.

  The VALUES clause [does not support](sql/insert.md) OBJECT constants, ARRAY constants, and some
  functions like [OBJECT_CONSTRUCT](functions/object_construct.md) and [ARRAY_CONSTRUCT](functions/array_construct.md).
* Because
  an OBJECT constant specifies a semi-structured OBJECT (not a MAP),
  you must cast the resulting semi-structured OBJECT to a MAP.

## Adding a structured type column

To add a column containing a structured type, use [ALTER TABLE … ADD COLUMN](sql/alter-table.md) with the
syntax described in Specifying a structured type. For example:

```sqlexample
ALTER TABLE customer ADD COLUMN phone ARRAY(VARCHAR);
```

## Dropping and renaming structured type columns

To drop or rename a structured type column, you can use [ALTER TABLE … DROP COLUMN](sql/alter-table.md) and
[ALTER TABLE … RENAME COLUMN](sql/alter-table.md) (as you would with a column with a semi-structured object).

## Using structured types in semi-structured types

You can’t use a MAP, structured OBJECT, or structured ARRAY value in a VARIANT, semi-structured OBJECT, or semi-structured ARRAY value. An
error occurs in the following situations:

* You use a MAP, structured OBJECT, or structured ARRAY value in an [OBJECT constant](data-types-semistructured.md) or
  [ARRAY constant](data-types-semistructured.md).
* You pass a MAP, structured OBJECT, or structured ARRAY value to an OBJECT or ARRAY
  [constructor function](functions-semistructured.md).

## Converting structured and semi-structured types

The following table summarizes rules for [converting](data-type-conversion.md) structured OBJECT, structured
ARRAY, and MAP values to semi-structured OBJECT, ARRAY, and VARIANT values (and vice versa).

| Source data type | Target data type | [Castable](data-type-conversion.md) | [Coercible](data-type-conversion.md) |
| --- | --- | --- | --- |
| Semi-structured ARRAY | Structured ARRAY | ✔ | ❌ |
| Semi-structured OBJECT | * Structured OBJECT * MAP | ✔ | ❌ |
| Semi-structured VARIANT | * Structured ARRAY * Structured OBJECT * MAP | ✔ | ❌ |
| Structured ARRAY | Semi-structured ARRAY | ✔ | ❌ |
| * Structured OBJECT * MAP | Semi-structured OBJECT | ✔ | ❌ |
| * Structured ARRAY * Structured OBJECT * MAP | Semi-structured VARIANT | ✔ | ❌ |

The following sections explain these rules in more detail.

* Explicitly casting a semi-structured type to a structured type
* Explicitly casting a structured type to a semi-structured type
* Implicit casting a value (coercion)
* Casting from one structured type to another

### Explicitly casting a semi-structured type to a structured type

To explicitly cast a value of a semi-structured type to a value of a structured type, you can
[call the CAST function or use the :: operator](functions/cast.md).

> **Note:**
>
> TRY_CAST isn’t supported for structured types.

You can only cast values of the following semi-structured types to values of the corresponding structured type;
otherwise, a runtime error occurs.

| Semi-structured type | Structured type that you can cast to |
| --- | --- |
| ARRAY | Structured ARRAY |
| OBJECT | MAP or structured OBJECT |
| VARIANT | MAP or structured ARRAY or OBJECT |

The next sections provide more detail about how the types are cast:

* Casting semi-structured ARRAY and VARIANT values to structured ARRAY values
* Casting semi-structured OBJECT and VARIANT values to structured OBJECT values
* Casting semi-structured OBJECT and VARIANT values to MAP values

#### Casting semi-structured ARRAY and VARIANT values to structured ARRAY values

The following steps demonstrate how to cast a semi-structured ARRAY or VARIANT value to an ARRAY value
of NUMBER elements:

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    CAST ([1,2,3] AS ARRAY(NUMBER))
  ) AS array_cast_type,
  SYSTEM$TYPEOF(
    CAST ([1,2,3]::VARIANT AS ARRAY(NUMBER))
  ) AS variant_cast_type;
```

Or:

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    [1,2,3]::ARRAY(NUMBER)
  ) AS array_cast_type,
  SYSTEM$TYPEOF(
    [1,2,3]::VARIANT::ARRAY(NUMBER)
  ) AS variant_cast_type;
```

```output
+--------------------------+--------------------------+
| ARRAY_CAST_TYPE          | VARIANT_CAST_TYPE        |
|--------------------------+--------------------------|
| ARRAY(NUMBER(38,0))[LOB] | ARRAY(NUMBER(38,0))[LOB] |
+--------------------------+--------------------------+
```

When you cast a semi-structured ARRAY or VARIANT value to a structured ARRAY value, note the following:

* Each element of the ARRAY value is cast to the specified type of the ARRAY.

  Casting the ARRAY column to ARRAY(VARCHAR) converts each value to a VARCHAR value:

  ```sqlexample
  SELECT
    CAST ([1,2,3] AS ARRAY(VARCHAR)) AS varchar_array,
    SYSTEM$TYPEOF(varchar_array) AS array_cast_type;
  ```

  ```output
  +---------------+---------------------+
  | VARCHAR_ARRAY | ARRAY_CAST_TYPE     |
  |---------------+---------------------|
  | [             | ARRAY(VARCHAR)[LOB] |
  |   "1",        |                     |
  |   "2",        |                     |
  |   "3"         |                     |
  | ]             |                     |
  +---------------+---------------------+
  ```
* If the element can’t be cast to the specified type (for example, casting `['a', 'b', 'c']` to ARRAY(NUMBER)), the cast fails.
* If the ARRAY value contains NULL elements and the ARRAY type specifies NOT NULL (for example, casting `[1, NULL, 3]` to
  ARRAY(NUMBER NOT NULL), the cast fails.
* Elements that are [JSON null values](../user-guide/semistructured-considerations.md) are converted to NULL, if the target element type doesn’t
  support JSON nulls (that is, the target type isn’t a semi-structured ARRAY, OBJECT, or VARIANT).

  For example, if you are casting to ARRAY(NUMBER), JSON null values are converted to NULL because NUMBER doesn’t support JSON
  nulls.

  On the other hand, if you are casting to ARRAY(VARIANT), JSON null values aren’t converted to NULL because VARIANT supports
  JSON nulls.

#### Casting semi-structured OBJECT and VARIANT values to structured OBJECT values

The following steps demonstrate how to cast a semi-structured OBJECT or VARIANT value to a structured OBJECT value
containing the `city` and `state` key-value pairs (which are VARCHAR values):

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    CAST ({'city':'San Mateo','state':'CA'} AS OBJECT(city VARCHAR, state VARCHAR))
  ) AS object_cast_type,
  SYSTEM$TYPEOF(
    CAST ({'city':'San Mateo','state':'CA'}::VARIANT AS OBJECT(city VARCHAR, state VARCHAR))
  ) AS variant_cast_type;
```

Or:

```sqlexample
SELECT
  SYSTEM$TYPEOF(
     {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR, state VARCHAR)
  ) AS object_cast_type,
  SYSTEM$TYPEOF(
     {'city':'San Mateo','state':'CA'}::VARIANT::OBJECT(city VARCHAR, state VARCHAR)
  ) AS variant_cast_type;
```

```output
+------------------------------------------+------------------------------------------+
| OBJECT_CAST_TYPE                         | VARIANT_CAST_TYPE                        |
|------------------------------------------+------------------------------------------|
| OBJECT(city VARCHAR, state VARCHAR)[LOB] | OBJECT(city VARCHAR, state VARCHAR)[LOB] |
+------------------------------------------+------------------------------------------+
```

When you cast a semi-structured OBJECT or VARIANT value to a structured OBJECT value, note the following:

* The OBJECT value can’t contain any additional keys that aren’t specified in the OBJECT type.

  If there are additional keys, the cast fails.
* If the OBJECT value is missing a key that is specified in the OBJECT type, the cast fails.
* The value of each key in the OBJECT value is converted to the specified type for that key.

  If a value can’t be cast to the specified type, the cast fails.
* If the value for a key is a [JSON null value](../user-guide/semistructured-considerations.md), the value is converted to NULL when the target value
  type doesn’t support JSON nulls (that is, the target type is not a semi-structured ARRAY, OBJECT, or VARIANT).

  For example, if you are casting to OBJECT(city VARCHAR), JSON null values are converted to NULL because VARCHAR doesn’t
  support JSON nulls.

  On the other hand, if you are casting to OBJECT(city VARIANT), JSON null values aren’t converted to NULL because VARIANT
  supports JSON nulls.

#### Casting semi-structured OBJECT and VARIANT values to MAP values

The following statements demonstrate how to cast a semi-structured OBJECT or VARIANT value to a MAP value
that associates a VARCHAR key with a VARCHAR value:

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    CAST ({'my_key':'my_value'} AS MAP(VARCHAR, VARCHAR))
  ) AS map_cast_type,
  SYSTEM$TYPEOF(
    CAST ({'my_key':'my_value'} AS MAP(VARCHAR, VARCHAR))
  ) AS variant_cast_type;
```

Or:

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    {'my_key':'my_value'}::MAP(VARCHAR, VARCHAR)
  ) AS map_cast_type,
  SYSTEM$TYPEOF(
    {'my_key':'my_value'}::VARIANT::MAP(VARCHAR, VARCHAR)
  ) AS variant_cast_type;
```

```output
+----------------------------+----------------------------+
| MAP_CAST_TYPE              | VARIANT_CAST_TYPE          |
|----------------------------+----------------------------|
| MAP(VARCHAR, VARCHAR)[LOB] | MAP(VARCHAR, VARCHAR)[LOB] |
+----------------------------+----------------------------+
```

When you cast a semi-structured OBJECT or VARIANT value to a MAP value, note the following:

* If the keys and values do not match the specified types, the keys and values are converted to the specified types.
* If the keys and values can’t be cast to the specified types, the cast fails.
* If the value for a key is a [JSON null value](../user-guide/semistructured-considerations.md), the value is converted to NULL when the target value
  type doesn’t support JSON nulls (that is, the target type is not a semi-structured ARRAY, OBJECT, or VARIANT).

  For example, if you are casting to MAP(VARCHAR, VARCHAR), JSON null values are converted to NULL because VARCHAR doesn’t
  support JSON nulls.

  On the other hand, if you are casting to MAP(VARCHAR, VARIANT), JSON null values aren’t converted to NULL because VARIANT
  supports JSON nulls.

### Explicitly casting a structured type to a semi-structured type

To explicitly cast a value of a structured type to a value of a semi-structured type, you can
[call the CAST function, use the :: operator](functions/cast.md), or call one of the conversion functions
(for example, [TO_ARRAY](functions/to_array.md), [TO_OBJECT](functions/to_object.md),
or [TO_VARIANT](functions/to_variant.md)).

> **Note:**
>
> TRY_CAST isn’t supported with structured types.

| Structured type | Semi-structured type that you can cast to |
| --- | --- |
| Structured ARRAY | ARRAY |
| MAP or structured OBJECT | OBJECT |
| MAP, structured ARRAY, or structured OBJECT | VARIANT |

For example:

* If `col_structured_array` is ARRAY(VARCHAR) type:

  + CAST(col_structured_array AS ARRAY) returns a semi-structured ARRAY value.
  + CAST(col_structured_array AS VARIANT) returns a VARIANT value that holds a semi-structured ARRAY value.
* If `col_structured_object` is OBJECT(name VARCHAR, state VARCHAR) type:

  + CAST(col_structured_object AS OBJECT) returns a semi-structured OBJECT value.
  + CAST(col_structured_object AS VARIANT) returns a VARIANT value that holds a semi-structured OBJECT value.
* If `col_map` is MAP(VARCHAR, VARCHAR) type:

  + CAST(col_map AS OBJECT) returns a semi-structured OBJECT value.
  + CAST(col_map AS VARIANT) returns a VARIANT value that holds a semi-structured OBJECT value.

Note the following:

* When you are casting to a semi-structured OBJECT value, the order of keys in the structured OBJECT value isn’t preserved.
* When you are casting a structured OBJECT or MAP value to a semi-structured OBJECT or VARIANT value, any NULL values are converted to
  [JSON null values](../user-guide/semistructured-considerations.md).

  If you are casting a structured ARRAY value to a VARIANT value, NULL values are preserved as is.

  ```sqlexample
  SELECT [1,2,NULL,3]::ARRAY(INTEGER)::VARIANT;
  ```

  ```output
  +---------------------------------------+
  | [1,2,NULL,3]::ARRAY(INTEGER)::VARIANT |
  |---------------------------------------|
  | [                                     |
  |   1,                                  |
  |   2,                                  |
  |   undefined,                          |
  |   3                                   |
  | ]                                     |
  +---------------------------------------+
  ```
* If you are casting a MAP value that uses a NUMBER type for keys, the MAP keys are converted to strings in the returned OBJECT value.

### Implicit casting a value (coercion)

The following rules apply to [implicitly casting (coercion)](data-type-conversion.md) from a value of one structured type to
a value of another structured type:

* A structured type value can be coerced to another structured type value if the two basic types are the same:

  + An ARRAY value of one type can be coerced to an ARRAY value of another type, provided that the first element type is coercible to the
    second element type.

    An element type can be coerced to another element type in either of the following cases:

    - Both types are numeric. The following cases are supported:

      * Both use the same numeric type but possibly differ in precision and/or scale.
      * Coercing NUMBER to FLOAT (and vice versa).
    - Both types are timestamps. The following cases are supported:

      * Both use the same type but possibly differ in precision.
      * Coercing TIMESTAMP_LTZ to TIMESTAMP_TZ (and vice versa).

    For example:

    - An ARRAY(NUMBER) value can be coerced to an ARRAY(DOUBLE) value.
    - An ARRAY(DATE) value can’t be coerced to an ARRAY(NUMBER) value.
  + An OBJECT value with one type definition can be coerced to an OBJECT value of with another type definition only if all of the following
    are true:

    - Both OBJECT types have the same number of keys.
    - Both OBJECT types use the same names for keys.
    - The keys in both OBJECT types are in the same order.
    - The type of each value in one OBJECT type can be coerced to the type of the corresponding value in the other OBJECT type.

      As is the case with element types in structured ARRAY values, you can coerce the type of one value to another type only if:

      * Both types are numeric. The following cases are supported:

        + Both use the same numeric type but possibly differ in precision and/or scale.
        + Coercing NUMBER to FLOAT (and vice versa).
      * Both types are timestamps. The following cases are supported:

        + Both use the same type but possibly differ in precision.
        + Coercing TIMESTAMP_LTZ to TIMESTAMP_TZ (and vice versa).

    For example:

    - An OBJECT(city VARCHAR, zipcode NUMBER) value can be coerced to an OBJECT(city VARCHAR, zipcode DOUBLE) value.
    - An OBJECT(city VARCHAR, zipcode NUMBER) value can’t be coerced to an OBJECT(city VARCHAR, zipcode DATE) value.
  + A MAP value with one value type can be coerced to a MAP value with a different value type if:

    - Both value types are numeric. The following cases are supported:

      * Both use the same numeric type but possibly differ in precision and/or scale.
      * Coercing NUMBER to FLOAT (and vice versa).
    - Both value types are timestamps. The following cases are supported:

      * Both use the same type but possibly differ in precision.
      * Coercing TIMESTAMP_LTZ to TIMESTAMP_TZ (and vice versa).

    For example, a MAP(VARCHAR, NUMBER) value can be coerced to a MAP(VARCHAR, DOUBLE) value.
  + A MAP value with one key type can be coerced to a MAP value with a different key type if both key types use the same integer NUMERIC
    type that differ only in precision.

    For example, a MAP(VARCHAR, NUMBER) value can’t be coerced to a MAP(NUMBER, NUMBER) value.
* A structured type value can’t be coerced to a semi-structured value (and vice versa).
* A VARCHAR value can’t be coerced to a structured type value.

### Casting from one structured type to another

You can [call the CAST function or use the :: operator](functions/cast.md) to cast from a value of
one structured type to a value of another structured type. You can cast values from and to the following structured types:

* For structured ARRAYs:

  You can cast an ARRAY value of one type to an ARRAY value of another type.
* For structured OBJECTs:

  + You can use a cast to
    change the order of key-value pairs in an OBJECT value.
  + You can use a cast to
    change the names of the keys in an OBJECT value.
  + You can use a cast to
    add keys to an OBJECT value.
  + You can cast a structured OBJECT value to a MAP value.
* For MAP values:

  + You can cast a MAP value with keys and values of a specific type to a MAP value with keys and values of a different type.
  + You can cast a MAP value to a structured OBJECT value.

> **Note:**
>
> TRY_CAST isn’t supported with structured types.

If it isn’t possible to cast the values from one type to the other, the cast fails. For example, attempting to cast an
ARRAY(BOOLEAN) value to an ARRAY(DATE) value fails.

#### Example: Casting from one type of ARRAY value to another

The following example casts an ARRAY(NUMBER) value to an ARRAY(VARCHAR) value:

```sqlexample
SELECT CAST(
  CAST([1,2,3] AS ARRAY(NUMBER))
  AS ARRAY(VARCHAR)) AS cast_array;
```

```output
+------------+
| CAST_ARRAY |
|------------|
| [          |
|   "1",     |
|   "2",     |
|   "3"      |
| ]          |
+------------+
```

#### Example: Changing the order of key-value pairs in an OBJECT value

The following example changes the order of key-value pairs in a structured OBJECT value:

```sqlexample
SELECT CAST(
  {'city': 'San Mateo','state': 'CA'}::OBJECT(city VARCHAR, state VARCHAR)
  AS OBJECT(state VARCHAR, city VARCHAR)) AS object_value_order;
```

```output
+-----------------------+
| OBJECT_VALUE_ORDER    |
|-----------------------|
| {                     |
|   "state": "CA",      |
|   "city": "San Mateo" |
| }                     |
+-----------------------+
```

#### Example: Changing the key names in an OBJECT value

To change the key names in a structured OBJECT value, specify the RENAME FIELDS keywords at the end of CAST. For example:

```sqlexample
SELECT CAST({'city':'San Mateo','state': 'CA'}::OBJECT(city VARCHAR, state VARCHAR)
  AS OBJECT(city_name VARCHAR, state_name VARCHAR) RENAME FIELDS) AS object_value_key_names;
```

```output
+-----------------------------+
| OBJECT_VALUE_KEY_NAMES      |
|-----------------------------|
| {                           |
|   "city_name": "San Mateo", |
|   "state_name": "CA"        |
| }                           |
+-----------------------------+
```

#### Example: Adding keys to an OBJECT value

If the type that you are casting to has additional key-value pairs that aren’t present in the original structured OBJECT value,
specify the ADD FIELDS keywords at the end of CAST. For example:

```sqlexample
SELECT CAST({'city':'San Mateo','state': 'CA'}::OBJECT(city VARCHAR, state VARCHAR)
  AS OBJECT(city VARCHAR, state VARCHAR, zipcode NUMBER) ADD FIELDS) AS add_fields;
```

```output
+------------------------+
| ADD_FIELDS             |
|------------------------|
| {                      |
|   "city": "San Mateo", |
|   "state": "CA",       |
|   "zipcode": null      |
| }                      |
+------------------------+
```

The values for the newly added keys are set to NULL. If you want to assign a value to these keys, call the
OBJECT_INSERT function instead.

## Constructing structured ARRAY, structured OBJECT, and MAP values

The following sections explain how to construct structured ARRAY, structured OBJECT, and MAP values.

* Using SQL functions to construct structured ARRAY and OBJECT values
* Using ARRAY and OBJECT constants to construct structured ARRAY and OBJECT values
* Constructing a MAP value

### Using SQL functions to construct structured ARRAY and OBJECT values

The following functions construct semi-structured ARRAY values:

* [ARRAY_CONSTRUCT](functions/array_construct.md)
* [ARRAY_CONSTRUCT_COMPACT](functions/array_construct_compact.md)
* [ARRAY_AGG](functions/array_agg.md)
* [TO_ARRAY](functions/to_array.md)

The following functions construct semi-structured OBJECT values:

* [OBJECT_CONSTRUCT](functions/object_construct.md)
* [OBJECT_CONSTRUCT_KEEP_NULL](functions/object_construct_keep_null.md)
* [OBJECT_AGG](functions/object_agg.md)
* [TO_OBJECT](functions/to_object.md)

To construct a structured ARRAY or OBJECT value, use these functions and explicitly cast the return value of the function. For example:

```sqlexample
SELECT ARRAY_CONSTRUCT(10, 20, 30)::ARRAY(NUMBER);
```

```sqlexample
SELECT OBJECT_CONSTRUCT(
  'oname', 'abc',
  'created_date', '2020-01-18'::DATE
)::OBJECT(
  oname VARCHAR,
  created_date DATE
);
```

For details, refer to Explicitly casting a semi-structured type to a structured type.

> **Note:**
>
> You can’t pass structured ARRAY, structured OBJECT, or MAP values to these functions. Doing so would result in a structured
> type being implicitly cast to a semi-structured type, which isn’t allowed, as noted in
> Implicit casting a value (coercion).

### Using ARRAY and OBJECT constants to construct structured ARRAY and OBJECT values

When you specify an [ARRAY constant](data-types-semistructured.md) or an [OBJECT constant](data-types-semistructured.md), you are
specifying a semi-structured ARRAY or OBJECT value.

To construct a structured ARRAY or OBJECT value, you must explicitly cast the expression. For example:

```sqlexample
SELECT [10, 20, 30]::ARRAY(NUMBER);
```

```sqlexample
SELECT {
  'oname': 'abc',
  'created_date': '2020-01-18'::DATE
}::OBJECT(
  oname VARCHAR,
  created_date DATE
);
```

For details, refer to Explicitly casting a semi-structured type to a structured type.

### Constructing a MAP value

To construct a MAP value, construct a semi-structured OBJECT value, and cast the OBJECT value to a MAP value.

For example, the following statements both produce the MAP value `{'city'->'San Mateo','state'->'CA'}`:

```sqlexample
SELECT OBJECT_CONSTRUCT(
  'city', 'San Mateo',
  'state', 'CA'
)::MAP(
  VARCHAR,
  VARCHAR
);
```

```sqlexample
SELECT {
  'city': 'San Mateo',
  'state': 'CA'
}::MAP(
  VARCHAR,
  VARCHAR
);
```

The following statement produces the MAP value `{-10->'CA',-20->'OR'}`:

```sqlexample
SELECT {
  '-10': 'CA',
  '-20': 'OR'
}::MAP(
  NUMBER,
  VARCHAR
);
```

For details, refer to Casting semi-structured OBJECT and VARIANT values to MAP values.

## Working with keys, values, and elements in values of structured types

The following sections explain how to use keys, values, and elements in values of structured types.

* Getting the list of keys from a structured OBJECT value
* Getting the list of keys from a MAP value
* Accessing values and elements from values of structured types
* Determining the size of a structured ARRAY value
* Determining the size of a MAP value
* Looking up elements in a structured ARRAY value
* Determining if a MAP value contains a key

### Getting the list of keys from a structured OBJECT value

To get the list of keys in a structured OBJECT value, call the [OBJECT_KEYS](functions/object_keys.md) function:

```sqlexample
SELECT OBJECT_KEYS({'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR, state VARCHAR));
```

If the input is a structured OBJECT value, the function returns an ARRAY(VARCHAR) value containing the keys. If the input is a
semi-structured OBJECT value, the function returns an ARRAY value.

### Getting the list of keys from a MAP value

To get the list of keys in a MAP value, call the [MAP_KEYS](functions/map_keys.md) function:

```sqlexample
SELECT MAP_KEYS({'my_key':'my_value'}::MAP(VARCHAR,VARCHAR));
```

### Accessing values and elements from values of structured types

You can use the following methods to access values and elements from structured ARRAY, structured OBJECT, and MAP
values:

* The [GET](functions/get.md) function
* The [GET_IGNORE_CASE](functions/get_ignore_case.md) function
* The [GET_PATH](functions/get_path.md) function
* [Dot Notation](../user-guide/querying-semistructured.md)
* [Bracket Notation](../user-guide/querying-semistructured.md)

The returned values and elements have the type specified for the structured value, rather than VARIANT.

The following example passes the first element of a semi-structured ARRAY value and an ARRAY(VARCHAR) value to the
[SYSTEM$TYPEOF](functions/system_typeof.md) function to return the data type of that element:

```sqlexample
SELECT
  SYSTEM$TYPEOF(
    ARRAY_CONSTRUCT('San Mateo')[0]
  ) AS semi_structured_array_element,
  SYSTEM$TYPEOF(
    CAST(
      ARRAY_CONSTRUCT('San Mateo') AS ARRAY(VARCHAR)
    )[0]
  ) AS structured_array_element;
```

```output
+-------------------------------+--------------------------+
| SEMI_STRUCTURED_ARRAY_ELEMENT | STRUCTURED_ARRAY_ELEMENT |
|-------------------------------+--------------------------|
| VARIANT[LOB]                  | VARCHAR[LOB]             |
+-------------------------------+--------------------------+
```

Note the following:

* When you pass a structured OBJECT value to the GET or GET_IGNORE_CASE function, you must specify a constant for the key.

  You don’t need to specify a constant if you are passing a MAP or structured ARRAY value to the GET function.

  You also don’t need to specify a constant if you are passing a MAP value to the GET_IGNORE_CASE function.
* When you pass a structured OBJECT, structured ARRAY, or MAP value to the GET_PATH function, you must specify a constant for the path
  name.
* For a structured OBJECT value, if you use an OBJECT key or a path that doesn’t exist, a compile-time error occurs.

  In contrast, when you use an index, key, or path that doesn’t exist with a semi-structured OBJECT value, the function returns NULL.

### Determining the size of a structured ARRAY value

To determine the size of a structured ARRAY value, pass the ARRAY value to the [ARRAY_SIZE](functions/array_size.md) function:

```sqlexample
SELECT ARRAY_SIZE([1,2,3]::ARRAY(NUMBER));
```

### Determining the size of a MAP value

To determine the size of a MAP value, pass the MAP value to [MAP_SIZE](functions/map_size.md) function:

```sqlexample
SELECT MAP_SIZE({'my_key':'my_value'}::MAP(VARCHAR,VARCHAR));
```

### Looking up elements in a structured ARRAY value

To determine if an element is present in a structured ARRAY value, call the [ARRAY_CONTAINS](functions/array_contains.md) function.
For example:

```sqlexample
SELECT ARRAY_CONTAINS(10, [1, 10, 100]::ARRAY(NUMBER));
```

To determine the position of an element in a structured ARRAY value, call the [ARRAY_POSITION](functions/array_position.md) function.
For example:

```sqlexample
SELECT ARRAY_POSITION(10, [1, 10, 100]::ARRAY(NUMBER));
```

> **Note:**
>
> For both functions, use an element of a type that is comparable to the type of the
> ARRAY value.
>
> Don’t cast the expression for the element to a VARIANT value.

### Determining if a MAP value contains a key

To determine if a MAP value contains a key, call the [MAP_CONTAINS_KEY](functions/map_contains_key.md) function:

For example:

```sqlexample
SELECT MAP_CONTAINS_KEY('key_to_find', my_map);
```

```sqlexample
SELECT MAP_CONTAINS_KEY(10, my_map);
```

## Comparing values

The following sections explain how to compare values:

* Comparing structured values with semi-structured values
* Comparing structured values with other structured values
* Determining if two ARRAY values overlap

### Comparing structured values with semi-structured values

You can’t compare a structured ARRAY, structured OBJECT, or MAP value with a semi-structured ARRAY, OBJECT, or VARIANT value.

### Comparing structured values with other structured values

You can compare two values of the same type (for example, two structured ARRAY values, two structured OBJECT values, or two MAP values).

Currently, the following comparison operators are supported for comparing values of structured types:

* `=`
* `!=`
* `<`
* `<=`
* `>=`
* `>`

When you compare two structured values for equality, note the following:

* If one type can’t be coerced to the other type, the comparison fails.
* When you compare MAP values that have numeric keys, the keys are compared as numbers (not as VARCHAR values).

When you compare two structured values using `<`, `<=`, `>=`, or `>`, the structured value fields are compared in
alphabetical order. For example, the following value:

```sqlexample
{'a':2,'b':1}::OBJECT(b INTEGER,a INTEGER)
```

is greater than:

```sqlexample
{'a':1,'b':2}::OBJECT(b INTEGER,a INTEGER)
```

### Determining if two ARRAY values overlap

To determine if the elements of two structured ARRAY values overlap, call the
[ARRAYS_OVERLAP](functions/arrays_overlap.md) function. For example:

```sqlexample
SELECT ARRAYS_OVERLAP(numeric_array, other_numeric_array);
```

The ARRAY values must be of comparable types.

You can’t pass a semi-structured ARRAY value and a structured ARRAY value to this function. Both ARRAY values must either be structured
or semi-structured.

## Transforming values of structured types

The following sections explain how to transform structured ARRAY, structured OBJECT, and MAP values:

* Transforming structured ARRAY values
* Transforming structured OBJECT values
* Transforming MAP values

### Transforming structured ARRAY values

When you pass a structured ARRAY value to these functions, the functions return a structured ARRAY value of the same type:

* [ARRAY_APPEND](functions/array_append.md)
* [ARRAY_CAT](functions/array_cat.md)
* [ARRAY_COMPACT](functions/array_compact.md)
* [ARRAY_EXCEPT](functions/array_except.md)
* [ARRAY_INSERT](functions/array_insert.md)
* [ARRAY_INTERSECTION](functions/array_intersection.md)
* [ARRAY_PREPEND](functions/array_prepend.md)
* [ARRAY_SLICE](functions/array_slice.md)
* [ARRAY_UNION_AGG](functions/array_union_agg.md)

The next sections explain how these functions work with structured ARRAY values.

* Functions that add elements to ARRAY values
* Functions that accept multiple ARRAY values as input

#### Functions that add elements to ARRAY values

The following functions add elements to an ARRAY values:

* [ARRAY_APPEND](functions/array_append.md)
* [ARRAY_INSERT](functions/array_insert.md)
* [ARRAY_PREPEND](functions/array_prepend.md)

For these functions, the type of the element must be coercible to the type of
the ARRAY value.

For example, the following call succeeds because a NUMBER value can be coerced to a DOUBLE value (the type of the ARRAY value):

```sqlexample
SELECT ARRAY_APPEND( [1,2]::ARRAY(DOUBLE), 3::NUMBER );
```

The following call succeeds because VARCHAR values can be coerced to DOUBLE values:

```sqlexample
SELECT ARRAY_APPEND( [1,2]::ARRAY(DOUBLE), '3' );
```

The following call fails because DATE values can’t be coerced to NUMBER values:

```sqlexample
SELECT ARRAY_APPEND( [1,2]::ARRAY(NUMBER), '2022-02-02'::DATE );
```

#### Functions that accept multiple ARRAY values as input

The following functions accept multiple ARRAY values as input arguments:

* [ARRAY_CAT](functions/array_cat.md)
* [ARRAY_EXCEPT](functions/array_except.md)
* [ARRAY_INTERSECTION](functions/array_intersection.md)

When you call these functions, both arguments must either be structured ARRAY values or semi-structured ARRAY values.
For example, the following calls fail because one argument is a structured ARRAY value and the other argument is a
semi-structured ARRAY value:

```sqlexample
SELECT ARRAY_CAT( [1,2]::ARRAY(NUMBER), ['3','4'] );
```

```sqlexample
SELECT ARRAY_CAT( [1,2], ['3','4']::ARRAY(VARCHAR) );
```

The ARRAY_EXCEPT function returns an ARRAY value of the same type as the ARRAY value in the first argument.

The ARRAY_CAT and ARRAY_INTERSECTION functions return an ARRAY value of a type that can accommodate the types of both input values.

For example, the following call to ARRAY_CAT passes in two structured ARRAY values:

* The first structured ARRAY value doesn’t allow NULLs and contains NUMBER values with the scale of 0 (NUMBER(38, 0)).
* The second structured ARRAY value contains a NULL and a NUMBER value that has the scale of 1.

The ARRAY value returned by ARRAY_CAT allows NULLs and contains NUMBER values with the scale of 1.

```sqlexample
SELECT
  ARRAY_CAT(
    [1, 2, 3]::ARRAY(NUMBER NOT NULL),
    [5.5, NULL]::ARRAY(NUMBER(2, 1))
  ) AS concatenated_array,
  SYSTEM$TYPEOF(concatenated_array);
```

```output
+--------------------+-----------------------------------+
| CONCATENATED_ARRAY | SYSTEM$TYPEOF(CONCATENATED_ARRAY) |
|--------------------+-----------------------------------|
| [                  | ARRAY(NUMBER(38,1))[LOB]          |
|   1,               |                                   |
|   2,               |                                   |
|   3,               |                                   |
|   5.5,             |                                   |
|   undefined        |                                   |
| ]                  |                                   |
+--------------------+-----------------------------------+
```

For the ARRAY_CAT function, the ARRAY value in the second argument must be coercible
to the type in the first argument.

For the ARRAY_EXCEPT and ARRAY_INTERSECTION functions, the ARRAY value in the second argument must be
comparable to the ARRAY value in the first argument.

For example, the following call succeeds because an ARRAY(NUMBER) value is comparable to an ARRAY(DOUBLE) value:

```sqlexample
SELECT ARRAY_EXCEPT( [1,2]::ARRAY(NUMBER), [2,3]::ARRAY(DOUBLE) );
```

The following call fails because an ARRAY(NUMBER) value isn’t comparable to an ARRAY(VARCHAR) value:

```sqlexample
SELECT ARRAY_EXCEPT( [1,2]::ARRAY(NUMBER), ['2','3']::ARRAY(VARCHAR) );
```

### Transforming structured OBJECT values

The following sections explain how to return a structured OBJECT value that has been transformed from another OBJECT value:

* Removing key-value pairs
* Inserting key-value pairs and updating values
* Selecting key-value pairs from an existing OBJECT

To change the order of key-value pairs, rename keys, or add keys without specifying values, use the
[CAST function or :: operator](functions/cast.md). For details, see
Casting from one structured type to another.

#### Removing key-value pairs

To return a new OBJECT value that contains the key-value pairs from an existing OBJECT value with specific key-value pairs removed,
call the [OBJECT_DELETE](functions/object_delete.md) function.

When calling this function, note the following:

* For the arguments that are keys, you must specify constants.
* If the specified key isn’t part of the OBJECT type definition, the call fails. For example, the following call fails because
  the OBJECT value doesn’t contain the specified key `zip_code`:

  ```sqlexample
  SELECT OBJECT_DELETE( {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR), 'zip_code' );
  ```

  ```output
  093201 (23001): Function OBJECT_DELETE: expected structured object to contain field zip_code but it did not.
  ```
* The function returns a structured OBJECT value. The type of the OBJECT value excludes the deleted key. For example, suppose that you
  remove the `city` key:

  ```sqlexample
  SELECT
    OBJECT_DELETE(
      {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR),
      'city'
    ) AS new_object,
    SYSTEM$TYPEOF(new_object);
  ```

  The function returns an OBJECT value of the type `OBJECT(state VARCHAR)`, which doesn’t include the `city` key.

  ```output
  +-----------------+----------------------------+
  | NEW_OBJECT      | SYSTEM$TYPEOF(NEW_OBJECT)  |
  |-----------------+----------------------------|
  | {               | OBJECT(state VARCHAR)[LOB] |
  |   "state": "CA" |                            |
  | }               |                            |
  +-----------------+----------------------------+
  ```
* If the function removes all keys from the OBJECT value, the function returns an empty structured OBJECT value of the type `OBJECT()`.

  ```sqlexample
  SELECT
    OBJECT_DELETE(
      {'state':'CA'}::OBJECT(state VARCHAR),
      'state'
    ) AS new_object,
    SYSTEM$TYPEOF(new_object);
  ```

  ```output
  +------------+---------------------------+
  | NEW_OBJECT | SYSTEM$TYPEOF(NEW_OBJECT) |
  |------------+---------------------------|
  | {}         | OBJECT()[LOB]             |
  +------------+---------------------------+
  ```

  When the type of a structured OBJECT value includes key-value pairs, the names and types of those pairs are included in parentheses
  in the type (for example, OBJECT(city VARCHAR)). Because an empty structured OBJECT value contains no key-value pairs, the
  parentheses are empty.

#### Inserting key-value pairs and updating values

To return a new OBJECT value that contains the key-value pairs from an existing OBJECT value with additional key-value pairs or new values for
keys, call the [OBJECT_INSERT](functions/object_insert.md) function.

When calling this function, note the following:

* For the arguments that are keys, you must specify constants.
* When the `updateFlag` argument is FALSE (when you are inserting a new key-value pair):

  + If you specify a key that already exists in the OBJECT value, an error occurs.

    ```sqlexample
    SELECT OBJECT_INSERT(
      {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR),
      'city',
      'San Jose',
      false
    );
    ```

    ```output
    093202 (23001): Function OBJECT_INSERT:
      expected structured object to not contain field city but it did.
    ```
  + The function returns a structured OBJECT value. The type of the OBJECT value includes the newly inserted key. For example, suppose that
    you add the `zipcode` key with the VARCHAR value `94402`:

    ```sqlexample
    SELECT
      OBJECT_INSERT(
        {'city':'San Mateo','state':'CA'}::OBJECT(city VARCHAR,state VARCHAR),
        'zip_code',
        94402::VARCHAR,
        false
      ) AS new_object,
      SYSTEM$TYPEOF(new_object) AS type;
    ```

    ```output
    +------------------------+---------------------------------------------------------------------+
    | NEW_OBJECT             | TYPE                                                                |
    |------------------------+---------------------------------------------------------------------|
    | {                      | OBJECT(city VARCHAR, state VARCHAR, zip_code VARCHAR NOT NULL)[LOB] |
    |   "city": "San Mateo", |                                                                     |
    |   "state": "CA",       |                                                                     |
    |   "zip_code": "94402"  |                                                                     |
    | }                      |                                                                     |
    +------------------------+---------------------------------------------------------------------+
    ```

    The type of the inserted value determines the type added to the OBJECT type definition. In this case, the value for
    `zipcode` is a value cast to a VARCHAR, so the type of `zipcode` is VARCHAR.
* When the `updateFlag` argument is TRUE (when you are replacing an existing key-value pair):

  + If you specify a key that doesn’t exist in the OBJECT value, an error occurs.
  + The function returns a structured OBJECT value of the same type.
  + The type of the inserted value is coerced to the type of the existing key.

#### Selecting key-value pairs from an existing OBJECT

To return a new OBJECT value that contains selected key-value pairs from an existing OBJECT value,
call the [OBJECT_PICK](functions/object_pick.md) function.

When calling this function, note the following:

* For the arguments that are keys, you must specify constants.
* You can’t pass in an ARRAY of keys as the second argument. You must specify each key as a separate argument.
* The function returns a structured OBJECT value. The type of the OBJECT value includes the keys in the order in which they are specified.

  For example, suppose that you select the `state` and `city` keys in that order:

  ```sqlexample
  SELECT
    OBJECT_PICK(
      {'city':'San Mateo','state':'CA','zip_code':94402}::OBJECT(city VARCHAR,state VARCHAR,zip_code DOUBLE),
      'state',
      'city') AS new_object,
    SYSTEM$TYPEOF(new_object);
  ```

  The function returns an OBJECT value of the type `OBJECT(state VARCHAR, city VARCHAR)`.

  ```output
  +-----------------------+------------------------------------------+
  | NEW_OBJECT            | SYSTEM$TYPEOF(NEW_OBJECT)                |
  |-----------------------+------------------------------------------|
  | {                     | OBJECT(state VARCHAR, city VARCHAR)[LOB] |
  |   "state": "CA",      |                                          |
  |   "city": "San Mateo" |                                          |
  | }                     |                                          |
  +-----------------------+------------------------------------------+
  ```

### Transforming MAP values

To transform MAP values, use the following functions:

* [MAP_CAT](functions/map_cat.md)
* [MAP_DELETE](functions/map_delete.md)
* [MAP_INSERT](functions/map_insert.md)
* [MAP_PICK](functions/map_pick.md)

## Working with structured types

The following sections explain how to use different SQL functions and set operators with values of structured types:

* Using the FLATTEN function with values of structured types
* Using the PARSE_JSON function
* Using structured types with set operators and CASE expressions
* Working with other semi-structured functions

### Using the FLATTEN function with values of structured types

You can pass structured ARRAY, structured OBJECT, and MAP values to the FLATTEN function. As is the case with semi-structured data
types, you can use the PATH argument to specify the value being flattened.

* If the value being flattened is a structured ARRAY value and the RECURSIVE argument is FALSE, the `value` column contains a value of
  the same type as the ARRAY value.

  For example:

  ```sqlexample
  SELECT value, SYSTEM$TYPEOF(value)
    FROM TABLE(FLATTEN(INPUT => [1.08, 2.13, 3.14]::ARRAY(DOUBLE)));
  ```

  ```output
  +-------+----------------------+
  | VALUE | SYSTEM$TYPEOF(VALUE) |
  |-------+----------------------|
  |  1.08 | FLOAT[DOUBLE]        |
  |  2.13 | FLOAT[DOUBLE]        |
  |  3.14 | FLOAT[DOUBLE]        |
  +-------+----------------------+
  ```
* If the value being flattened is a MAP value and the RECURSIVE argument is FALSE, the `key` column contains a key of the same type as
  the MAP key, and the `value` column contains a value of the same type as the MAP value.

  For example:

  ```sqlexample
  SELECT key, SYSTEM$TYPEOF(key), value, SYSTEM$TYPEOF(value)
    FROM TABLE(FLATTEN(INPUT => {'my_key': 'my_value'}::MAP(VARCHAR, VARCHAR)));
  ```

  ```output
  +--------+--------------------+----------+----------------------+
  | KEY    | SYSTEM$TYPEOF(KEY) | VALUE    | SYSTEM$TYPEOF(VALUE) |
  |--------+--------------------+----------+----------------------|
  | my_key | VARCHAR[LOB]       | my_value | VARCHAR[LOB]         |
  +--------+--------------------+----------+----------------------+
  ```
* Otherwise, the `key` and `value` columns have the type VARIANT.

For MAP values, the order of keys and values returned is indeterminate.

### Using the PARSE_JSON function

The [PARSE_JSON](functions/parse_json.md) function doesn’t return structured types.

### Using structured types with set operators and CASE expressions

You can use structured ARRAY, structured OBJECT, and MAP values in:

* [Query expressions combined by set operators (e.g. UNION ALL)](operators-query.md).
* [CASE expressions](functions/case.md).

For set operators, if different types are used in the different expressions (for example, if one type is ARRAY(NUMBER) and the other is
ARRAY(DOUBLE)), one type is coerced to the other.

### Working with other semi-structured functions

The following functions don’t accept a structured ARRAY, structured OBJECT, or MAP values as an input argument:

* [AS_ARRAY](functions/as_array.md)
* [AS_OBJECT](functions/as_object.md)
* [IS_ARRAY](functions/is_array.md)
* [IS_OBJECT](functions/is_object.md)
* [TYPEOF](functions/typeof.md)

Passing a structured type value as input results in an error.

## Accessing structured types in applications using drivers

In applications that use drivers (for example, the ODBC or JDBC driver, the Snowflake Connector for Python, etc.), structured type
values are returned as semi-structured type values. For example:

* The values in a structured ARRAY column are returned as semi-structured ARRAY values to the client application.
* The values in a structured OBJECT or MAP column are returned as semi-structured OBJECT values to the client application.

> **Note:**
>
> For client applications that use the JDBC driver, the `ResultSet.getArray()` method returns an error
> if the query results you want to retrieve contain a structured ARRAY value with NULL values.
>
> To retrieve a string representation instead, use the `ResultSet.getString()` method:
>
> ```java
> String result = resultSet.getString(1);
> ```

## Using structured types with user-defined functions (UDFs) and stored procedures

When you create a user-defined function (UDF), user-defined table function (UDTF), or stored procedure in
[SQL](../developer-guide/udf/sql/udf-sql-introduction.md),
[Snowflake Scripting](../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md),
[Java](../developer-guide/udf/java/udf-java-introduction.md),
[Python](../developer-guide/udf/python/udf-python-introduction.md), or
[Scala](../developer-guide/udf/scala/udf-scala-introduction.md), you can use structured
types in the arguments and return values. For example:

```sqlexample
CREATE OR REPLACE FUNCTION my_udf(
    location OBJECT(city VARCHAR, zipcode NUMBER, val ARRAY(BOOLEAN)))
  RETURNS VARCHAR
  AS
  $$
    ...
  $$;
```

```sqlexample
CREATE OR REPLACE FUNCTION my_udtf(check BOOLEAN)
  RETURNS TABLE(col1 ARRAY(VARCHAR))
  AS
  $$
  ...
  $$;
```

```sqlexample
CREATE OR REPLACE PROCEDURE my_procedure(values ARRAY(INTEGER))
  RETURNS ARRAY(INTEGER)
  LANGUAGE SQL
  AS
  $$
    ...
  $$;
```

```sqlexample
CREATE OR REPLACE FUNCTION my_function(values ARRAY(INTEGER))
  RETURNS ARRAY(INTEGER)
  LANGUAGE PYTHON
  RUNTIME_VERSION=3.10
  AS
  $$
    ...
  $$;
```

> **Note:**
>
> Structured types aren’t yet supported in UDFs, UDTFs, and stored procedures in JavaScript.

## Viewing information about structured types

The following sections describe the views and commands that you can use to view information about structured types:

* Using the SHOW COLUMNS command to view structured type information
* Using the DESCRIBE and other SHOW commands to view structured type information
* Viewing information about the structured types used in a database

### Using the SHOW COLUMNS command to view structured type information

In the output of the [SHOW COLUMNS](sql/show-columns.md) command, the `data_type` column includes information about the
types of elements, keys, and values.

### Using the DESCRIBE and other SHOW commands to view structured type information

The output of the following commands includes information about structured types:

* [DESCRIBE TABLE](sql/desc-table.md)
* [DESCRIBE RESULT](sql/desc-result.md)
* [DESCRIBE FUNCTION](sql/desc-function.md)
* [DESCRIBE PROCEDURE](sql/desc-procedure.md)
* [SHOW FUNCTIONS](sql/show-functions.md)
* [SHOW PROCEDURES](sql/show-procedures.md)

For example, in the DESCRIBE RESULT output, the row for a MAP(VARCHAR, VARCHAR) column contains the following value in the
`type` column:

```sqlexample
map(VARCHAR, VARCHAR)
```

The row for an ARRAY(NUMBER) column contains the following value in the `type` column:

```sqlexample
ARRAY(NUMBER(38,0))
```

### Viewing information about the structured types used in a database

For columns of structured types, the INFORMATION_SCHEMA [COLUMNS view](info-schema/columns.md) only provides information about
the basic data type of the column (ARRAY, OBJECT, or MAP).

For example, the `data_type` column just contains `ARRAY`, `OBJECT`, or `MAP`. The column doesn’t include the types of the
elements, keys, or values.

To view information about the types of elements, keys, and values, use the following views:

* For information about the types of elements in structured ARRAY types, query the
  [ELEMENT_TYPES view in INFORMATION_SCHEMA](info-schema/element_types.md) or the
  [ELEMENT_TYPES view in ACCOUNT_USAGE](account-usage/element_types.md).
* For information about the types of keys and values in structured OBJECT and MAP types, query the
  [FIELDS view in INFORMATION_SCHEMA](info-schema/fields.md) or the
  [FIELDS view in ACCOUNT_USAGE](account-usage/fields.md).

---
title: Subquery operators
source: https://docs.snowflake.com/en/sql-reference/operators-subquery.md
section: SQL General Reference
---

# Subquery operators

A [subquery](../user-guide/querying-subqueries.md) is a query within another query. Subquery operators
perform operations on the values produced by subqueries.

Snowflake supports the following subquery operators:

* ALL / ANY
* [ NOT ] EXISTS
* [ NOT ] IN

## ALL / ANY

The ALL and ANY keywords can be used to apply a comparison operator to the values produced by a subquery (which can return more than one row).

### Syntax

```sqlsyntax
<expr> comparisonOperator { ALL | ANY } ( <query> )
```

Where:

```sqlsyntax
comparisonOperator ::=
  { = | != | > | >= | < | <= }
```

### Usage notes

* The expression is compared with the operator for each value that the subquery returns:

  + If ALL is specified, then the result is TRUE if every row of the subquery satisfies the condition; otherwise, it returns FALSE.
  + If ANY is specified, then the result is TRUE if any row of the subquery satisfies the condition; otherwise, it returns FALSE.
* ANY/ALL subqueries are currently supported only in a [WHERE](constructs/where.md) clause.
* ANY/ALL subqueries can’t appear as an argument to an [OR](operators-logical.md) operator.
* The subquery must contain only one item in its [SELECT](sql/select.md) list.

### Examples

Use a `!= ALL` subquery to find the departments that have no employees:

```sqlexample
SELECT department_id
  FROM departments d
  WHERE d.department_id != ALL (
    SELECT e.department_id
      FROM employees e);
```

## [ NOT ] EXISTS

An EXISTS subquery is a Boolean expression that can appear in a [WHERE](constructs/where.md) or [HAVING](constructs/having.md) clause,
or in any function that operates on a Boolean expression:

* An EXISTS expression evaluates to TRUE if any rows are produced by the subquery.
* A NOT EXISTS expression evaluates to TRUE if no rows are produced by the subquery.

### Syntax

```sqlsyntax
[ NOT ] EXISTS ( <query> )
```

### Usage notes

* [Correlated](../user-guide/querying-subqueries.md) EXISTS subqueries are currently supported only in a
  [WHERE](constructs/where.md) clause.
* Correlated EXISTS subqueries cannot appear as an argument to an [OR](operators-logical.md) operator.
* Uncorrelated EXISTS subqueries are supported anywhere that a Boolean expression is allowed.

### Examples

Use a correlated NOT EXISTS subquery to find the departments that have no employees:

```sqlexample
SELECT department_id
  FROM departments d
  WHERE NOT EXISTS (
    SELECT 1
      FROM employees e
      WHERE e.department_id = d.department_id);
```

## [ NOT ] IN

The IN and NOT IN operators check whether an expression is included in the values produced by a subquery.

### Syntax

```sqlsyntax
<expr> [ NOT ] IN ( <query> )
```

### Usage notes

* IN is shorthand for `= ANY`, and is subject to the same restrictions as ANY subqueries.
* NOT IN is shorthand for `!= ALL`, and is subject to the same restrictions as ALL subqueries.
* [NOT] IN can also be used as an operator in expressions that don’t involve a subquery. For details, see
  [[ NOT ] IN](functions/in.md).

### Examples

Use a NOT IN subquery that is equivalent to the `!= ALL` subquery example (earlier in this topic)
to find the departments that have no employees:

```sqlexample
SELECT department_id
  FROM departments d
  WHERE d.department_id NOT IN (
    SELECT e.department_id
      FROM employees e);
```

---
title: Summary of data types
source: https://docs.snowflake.com/en/sql-reference/intro-summary-data-types.md
section: SQL General Reference
---

# Summary of data types

Snowflake supports most SQL data types. The following table provides a summary of the supported data types:

| Category | Type | Notes |
| --- | --- | --- |
| [Numeric data types](data-types-numeric.md) | NUMBER | Default precision and scale are (38,0). |
|  | DECIMAL, NUMERIC | Synonymous with NUMBER. |
|  | INT, INTEGER, BIGINT, SMALLINT, TINYINT, BYTEINT | Synonymous with NUMBER, except precision and scale can’t be specified. |
|  | FLOAT, FLOAT4, FLOAT8 | [1] |
|  | DOUBLE, DOUBLE PRECISION, REAL | Synonymous with FLOAT. [1] |
|  | DECFLOAT | Stores numbers exactly, with up to 38 significant digits of precision, and uses a dynamic base-10 exponent. |
| [String & binary data types](data-types-text.md) | VARCHAR | Default length is 16777216 bytes. Maximum length is 134217728 bytes. |
|  | CHAR, CHARACTER | Synonymous with VARCHAR, except the default length is VARCHAR(1). |
|  | STRING, TEXT | Synonymous with VARCHAR. |
|  | BINARY |  |
|  | VARBINARY | Synonymous with BINARY. |
| [Logical data types](data-types-logical.md) | BOOLEAN | Currently only supported for accounts provisioned after January 25, 2016. |
| [Date & time data types](data-types-datetime.md) | DATE |  |
|  | DATETIME | Synonymous with TIMESTAMP_NTZ. |
|  | TIME |  |
|  | TIMESTAMP | Alias for one of the TIMESTAMP variations (TIMESTAMP_NTZ by default). |
|  | TIMESTAMP_LTZ | TIMESTAMP with local time zone; time zone, if provided, isn’t stored. |
|  | TIMESTAMP_NTZ | TIMESTAMP with no time zone; time zone, if provided, isn’t stored. |
|  | TIMESTAMP_TZ | TIMESTAMP with time zone. |
| [Semi-structured data types](data-types-semistructured.md) | VARIANT |  |
|  | OBJECT |  |
|  | ARRAY |  |
| [Structured data types](data-types-structured.md) | ARRAY |  |
|  | OBJECT |  |
|  | MAP |  |
| [Unstructured data types](data-types-unstructured.md) | FILE | See [Introduction to unstructured data](../user-guide/unstructured-intro.md). |
| [Geospatial data types](data-types-geospatial.md) | GEOGRAPHY |  |
|  | GEOMETRY |  |
| [UUID data type](data-types-uuid.md) | UUID |  |
| [Vector data types](data-types-vector.md) | VECTOR |  |
| [User-defined types](data-types-user-defined.md) | Not applicable | Defined by the user based on existing Snowflake data types. |

[1] A known issue in Snowflake displays FLOAT, FLOAT4, FLOAT8, REAL, DOUBLE, and DOUBLE PRECISION as FLOAT, even though they are stored as DOUBLE.

---
title: Summary of functions
source: https://docs.snowflake.com/en/sql-reference/intro-summary-operators-functions.md
section: SQL General Reference
---

# Summary of functions

Snowflake supports most of the standard functions defined in SQL:1999, as well as parts of the SQL:2003 analytic extensions.

## Scalar functions

A scalar function is a function that returns one value per invocation; in most cases, you can think of this as returning
one value per row. This contrasts with [Aggregate functions](functions-aggregation.md), which return one value per group of rows.

For a complete list of scalar function categories, see [Scalar functions](functions.md).

## Aggregate functions

Snowflake supports aggregate functions to operate on values across rows to perform mathematical calculations such as sum, average,
counting, minimum/maximum values, standard deviation, and estimation, as well as some non-mathematical operations.

For a complete list, see [Aggregate functions](functions-aggregation.md).

## Window functions

Window functions are [aggregate functions](functions-aggregation.md) that can operate on a subset of rows within the set of input rows.

## Table functions

Snowflake supports many [Table functions](functions-table.md) to obtain information about Snowflake features and services.

For a complete summary, see [List of system-defined table functions](functions-table.md).

## System functions

For a complete list of system functions, see [System functions](functions-system.md).

## User-defined functions (UDFs)

In addition to the system-defined functions provided by Snowflake, you can create user-defined functions (UDFs). See
[User-defined functions overview](../developer-guide/udf/udf-overview.md) for more information.

## External functions

Snowflake also supports [Writing external functions](external-functions.md), which are stored and executed outside Snowflake.

---
title: System functions
source: https://docs.snowflake.com/en/sql-reference/functions-system.md
section: SQL General Reference
---

# System functions

Snowflake provides the following types of system functions:

* Control functions that allow you to execute actions in the system (e.g. aborting a query).
* Information functions that return information about the system (e.g. calculating the clustering depth of a table).
* Information functions that return information about queries (e.g. information about EXPLAIN plans).

Many of these system functions have the prefix `SYSTEM$` (e.g. `SYSTEM$TYPEOF`). For the system functions that use
this prefix, you must specify the prefix when calling the function. For example:

```sqlexample
SELECT SYSTEM$TYPEOF('a');
```

| Function Name | Notes |
| --- | --- |
| **Control** |  |
| [EXECUTE_AI_EVALUATION](functions/execute_ai_evaluation.md) |  |
| [SYSTEM$ABORT_SESSION](functions/system_abort_session.md) |  |
| [SYSTEM$ABORT_TRANSACTION](functions/system_abort_transaction.md) |  |
| [SYSTEM$ACTIVATE_CMK_INFO](functions/system_activate_cmk_info.md) |  |
| [SYSTEM$ACTIVATE_CMK_INFO_POSTGRES](functions/system_activate_cmk_info_postgres.md) |  |
| [SYSTEM$ADD_EVENT (for Snowflake Scripting)](functions/system_add_event.md) |  |
| [SYSTEM$ADD_REFERENCE](functions/system_add_reference.md) |  |
| [SYSTEM$AUTHORIZE_PRIVATELINK](functions/system_authorize_privatelink.md) |  |
| [SYSTEM$AUTHORIZE_STAGE_PRIVATELINK_ACCESS](functions/system_authorize_stage_privatelink_access.md) |  |
| [SYSTEM$AUTHORIZE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](functions/system_authorize_snowflake_managed_storage_volume_privatelink_access.md) |  |
| [SYSTEM$BEGIN_DEBUG_APPLICATION](functions/system_begin_debug_application.md) |  |
| [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](functions/system_block_internal_stages_public_access.md) |  |
| [SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION](functions/system_block_internal_stages_public_access_with_exception.md) |  |
| [SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](functions/system_block_snowflake_managed_storage_volume_public_access.md) |  |
| [SYSTEM$BLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_WITH_EXCEPTION](functions/system_block_snowflake_managed_storage_volume_public_access_with_exception.md) |  |
| [SYSTEM$CANCEL_ALL_QUERIES](functions/system_cancel_all_queries.md) |  |
| [SYSTEM$CANCEL_QUERY](functions/system_cancel_query.md) |  |
| [SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS](functions/system_cleanup_database_role_grants.md) |  |
| [SYSTEM$COMMIT_MOVE_ORGANIZATION_ACCOUNT](functions/system_commit_move_organization_account.md) |  |
| [SYSTEM$CONVERT_PIPES_SQS_TO_SNS](functions/system_convert_pipes_sqs_to_sns.md) |  |
| [SYSTEM$CREATE_BILLING_EVENT](functions/system_create_billing_event.md) |  |
| [SYSTEM$CREATE_BILLING_EVENTS](functions/system_create_billing_events.md) |  |
| [SYSTEM$DEACTIVATE_CMK_INFO](functions/system_deactivate_cmk_info.md) |  |
| [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT](functions/system_deprovision_privatelink_endpoint.md) |  |
| [SYSTEM$DEPROVISION_PRIVATELINK_ENDPOINT_TSS](functions/system_deprovision_privatelink_endpoint_tss.md) |  |
| [SYSTEM$DEREGISTER_CMK_INFO](functions/system_deregister_cmk_info.md) |  |
| [SYSTEM$DEREGISTER_CMK_INFO_POSTGRES](functions/system_deregister_cmk_info_postgres.md) |  |
| [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](functions/system_disable_behavior_change_bundle.md) |  |
| [SYSTEM$DISABLE_DATABASE_REPLICATION](functions/system_disable_database_replication.md) |  |
| [SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](functions/system_disable_global_data_sharing_for_account.md) |  |
| [SYSTEM$DISABLE_PREVIEW_ACCESS](functions/system_disable_preview_access.md) |  |
| [SYSTEM$DISABLE_PRIVATELINK_ACCESS_ONLY](functions/system_disable_privatelink_access_only.md) |  |
| [SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](functions/system_enable_behavior_change_bundle.md) |  |
| [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](functions/system_enable_global_data_sharing_for_account.md) |  |
| [SYSTEM$ENABLE_PREVIEW_ACCESS](functions/system_enable_preview_access.md) |  |
| [SYSTEM$END_DEBUG_APPLICATION](functions/system_end_debug_application.md) |  |
| [SYSTEM$ENFORCE_PRIVATELINK_ACCESS_ONLY](functions/system_enforce_privatelink_access_only.md) |  |
| [SYSTEM$FINISH_OAUTH_FLOW](functions/system_finish_oauth_flow.md) |  |
| [SYSTEM$GLOBAL_ACCOUNT_SET_PARAMETER](functions/system_global_account_set_parameter.md) |  |
| [SYSTEM$INITIATE_MOVE_ORGANIZATION_ACCOUNT](functions/system_initiate_move_organization_account.md) |  |
| [SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME](functions/system_link_account_objects_by_name.md) |  |
| [SYSTEM$LINK_ORGANIZATION_USER](functions/system_link_organization_user.md) |  |
| [SYSTEM$LINK_ORGANIZATION_USER_GROUP](functions/system_link_organization_user_group.md) |  |
| [SYSTEM$MIGRATE_SAML_IDP_REGISTRATION](functions/system_migrate_saml_idp_registration.md) |  |
| [SYSTEM$OPT_IN_INTERNAL_STAGE_NETWORK_LOGS](functions/system_opt_in_internal_stage_network_logs.md) |  |
| [SYSTEM$OPT_OUT_INTERNAL_STAGE_NETWORK_LOGS](functions/system_opt_out_internal_stage_network_logs.md) |  |
| [SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY](functions/system_opt_out_malicious_ip_protection_by_category.md) |  |
| [SYSTEM$PIPE_FORCE_RESUME](functions/system_pipe_force_resume.md) |  |
| [SYSTEM$PIPE_REBINDING_WITH_NOTIFICATION_CHANNEL](functions/system_pipe_rebinding_with_notification_channel.md) |  |
| [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT](functions/system_provision_privatelink_endpoint.md) |  |
| [SYSTEM$PROVISION_PRIVATELINK_ENDPOINT_TSS](functions/system_provision_privatelink_endpoint_tss.md) |  |
| [SYSTEM$REGISTER_CMK_INFO](functions/system_register_cmk_info.md) |  |
| [SYSTEM$REGISTER_CMK_INFO_POSTGRES](functions/system_register_cmk_info_postgres.md) |  |
| [SYSTEM$REGISTER_PRIVATELINK_ENDPOINT](functions/system_register_privatelink_endpoint.md) |  |
| [SYSTEM$REMOVE_ALL_REFERENCES](functions/system_remove_all_references.md) |  |
| [SYSTEM$REMOVE_REFERENCE](functions/system_remove_reference.md) |  |
| [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT](functions/system_restore_privatelink_endpoint.md) |  |
| [SYSTEM$RESTORE_PRIVATELINK_ENDPOINT_TSS](functions/system_restore_privatelink_endpoint_tss.md) |  |
| [SYSTEM$REVOKE_PRIVATELINK](functions/system_revoke_privatelink.md) |  |
| [SYSTEM$REVOKE_STAGE_PRIVATELINK_ACCESS](functions/system_revoke_stage_privatelink_access.md) |  |
| [SYSTEM$REVOKE_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PRIVATELINK_ACCESS](functions/system_revoke_snowflake_managed_storage_volume_privatelink_access.md) |  |
| [SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH](functions/system_schedule_async_replication_group_refresh.md) |  |
| [SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG](functions/system_send_notifications_to_catalog.md) |  |
| [SYSTEM$SET_APPLICATION_RESTRICTED_FEATURE_ACCESS](functions/system_set_application_restricted_feature_access.md) |  |
| [SYSTEM$SET_CATALOG_INTEGRATION](functions/system_set_catalog_integration.md) |  |
| [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_set_default_columns_override_for_show_command.md) |  |
| [SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_set_default_columns_override_for_system_object.md) |  |
| [SYSTEM$SET_EVENT_SHARING_ACCOUNT_FOR_REGION](functions/system_set_event_sharing_account_for_region.md) |  |
| [SYSTEM$SET_PRIVATELINK_ENDPOINT_HOSTNAME](functions/system_set_privatelink_endpoint_hostname.md) |  |
| [SYSTEM$SET_REFERENCE](functions/system_set_reference.md) |  |
| [SYSTEM$SET_ROW_TIMESTAMP_ON_ALL_SUPPORTED_TABLES](functions/system_set_row_timestamp_on_all_supported_tables.md) |  |
| [SYSTEM$SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS_STATUS](functions/system_snowflake_managed_storage_volume_public_access_status.md) |  |
| [SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN](functions/system_snowpipe_streaming_update_channel_offset_token.md) |  |
| [SYSTEM$START_OAUTH_FLOW](functions/system_start_oauth_flow.md) |  |
| [SYSTEM$START_USER_EMAIL_VERIFICATION](functions/system_start_user_email_verification.md) |  |
| [SYSTEM$TASK_DEPENDENTS_ENABLE](functions/system_task_dependents_enable.md) |  |
| [SYSTEM$TRIGGER_LISTING_REFRESH](functions/system_trigger_listing_refresh.md) |  |
| [SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS](functions/system_unblock_internal_stages_public_access.md) |  |
| [SYSTEM$UNBLOCK_SNOWFLAKE_MANAGED_STORAGE_VOLUME_PUBLIC_ACCESS](functions/system_unblock_snowflake_managed_storage_volume_public_access.md) |  |
| [SYSTEM$UNLINK_ORGANIZATION_USER](functions/system_unlink_organization_user.md) |  |
| [SYSTEM$UNLINK_ORGANIZATION_USER_GROUP](functions/system_unlink_organization_user_group.md) |  |
| [SYSTEM$UNREGISTER_PRIVATELINK_ENDPOINT](functions/system_unregister_privatelink_endpoint.md) |  |
| [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_unset_default_columns_override_for_show_command.md) |  |
| [SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_unset_default_columns_override_for_system_object.md) |  |
| [SYSTEM$UNSET_EVENT_SHARING_ACCOUNT_FOR_REGION](functions/system_unset_event_sharing_account_for_region.md) |  |
| [SYSTEM$USER_TASK_CANCEL_ONGOING_EXECUTIONS](functions/system_user_task_cancel_ongoing_executions.md) |  |
| [SYSTEM$WAIT](functions/system_wait.md) |  |
| **Information** |  |
| [EXTRACT_SEMANTIC_CATEGORIES](functions/extract_semantic_categories.md) |  |
| [GET_ANACONDA_PACKAGES_REPODATA](functions/get_anaconda_packages_repodata.md) |  |
| [SHOW_PYTHON_PACKAGES_DEPENDENCIES](functions/show_python_packages_dependencies.md) |  |
| [SYSTEM$ALLOWLIST](functions/system_allowlist.md) |  |
| [SYSTEM$ALLOWLIST_PRIVATELINK](functions/system_allowlist_privatelink.md) |  |
| [SYSTEM$APP_COMPATIBILITY_CHECK](functions/system_app_compatibility_check.md) |  |
| [SYSTEM$APPLICATION_GET_LOG_LEVEL](functions/system_application_get_log_level.md) |  |
| [SYSTEM$APPLICATION_GET_METRIC_LEVEL](functions/system_application_get_metric_level.md) |  |
| [SYSTEM$APPLICATION_GET_TRACE_LEVEL](functions/system_application_get_trace_level.md) |  |
| [SYSTEM$AUTO_REFRESH_STATUS](functions/system_auto_refresh_status.md) |  |
| [SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](functions/system_behavior_change_bundle_status.md) |  |
| [SYSTEM$CATALOG_LINK_STATUS](functions/system_catalog_link_status.md) |  |
| [SYSTEM$CKE_HASH_FUNCTION](functions/system_cke_hash_function.md) |  |
| [SYSTEM$CLIENT_VERSION_INFO](functions/system_client_version_info.md) |  |
| [SYSTEM$CLIENT_VULNERABILITY_INFO](functions/system_client_vulnerability_info.md) |  |
| [SYSTEM$CLUSTERING_DEPTH](functions/system_clustering_depth.md) |  |
| [SYSTEM$CLUSTERING_INFORMATION](functions/system_clustering_information.md) |  |
| [SYSTEM$CLUSTERING_RATIO](functions/system_clustering_ratio.md) | Deprecated; use the other clustering functions instead. |
| [SYSTEM$CURRENT_USER_TASK_NAME](functions/system_current_user_task_name.md) |  |
| [SYSTEM$DATA_METRIC_SCAN](functions/system_data_metric_scan.md) |  |
| [SYSTEM$DATABASE_REFRESH_HISTORY](functions/system_database_refresh_history.md) | Deprecated; use [DATABASE_REFRESH_HISTORY](functions/database_refresh_history.md) instead. |
| [SYSTEM$DATABASE_REFRESH_PROGRESS , SYSTEM$DATABASE_REFRESH_PROGRESS_BY_JOB](functions/system_database_refresh_progress.md) | Deprecated; use [DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB](functions/database_refresh_progress.md) instead. |
| [SYSTEM$DECODE_PAT](functions/system_decode_pat.md) |  |
| [SYSTEM$DESC_ICEBERG_ACCESS_IDENTITY](functions/system_desc_iceberg_access_identity.md) |  |
| [SYSTEM$ENCODE_CKE_PRIMARY_KEY](functions/system_encode_cke_primary_key.md) |  |
| [SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS](functions/system_estimate_automatic_clustering_costs.md) |  |
| [SYSTEM$ESTIMATE_SEARCH_OPTIMIZATION_COSTS](functions/system_estimate_search_optimization_costs.md) |  |
| [SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS](functions/system_evaluate_data_quality_expectations.md) |  |
| [EXPLAIN_PRIVILEGES](functions/explain_privileges.md) |  |
| [SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW](functions/system_export_tds_from_semantic_view.md) |  |
| [SYSTEM$EXTERNAL_TABLE_PIPE_STATUS](functions/system_external_table_pipe_status.md) |  |
| [SYSTEM$GENERATE_SAML_CSR](functions/system_generate_saml_csr.md) |  |
| [SYSTEM$GENERATE_SCIM_ACCESS_TOKEN](functions/system_generate_scim_access_token.md) |  |
| [SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](functions/system_get_all_default_columns_overrides.md) |  |
| [SYSTEM$GET_ALL_REFERENCES](functions/system_get_all_references.md) |  |
| [SYSTEM$GET_AWS_SNS_IAM_POLICY](functions/system_get_aws_sns_iam_policy.md) |  |
| [SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG](functions/system_get_catalog_linked_database_config.md) |  |
| [SYSTEM$GET_CLASSIFICATION_RESULT](functions/system_get_classification_result.md) |  |
| [SYSTEM$GET_CMK_AKV_CONSENT_URL](functions/system_get_cmk_akv_consent_url.md) |  |
| [SYSTEM$GET_CMK_CONFIG](functions/system_get_cmk_config.md) |  |
| [SYSTEM$GET_CMK_CONFIG_POSTGRES](functions/system_get_cmk_config_postgres.md) |  |
| [SYSTEM$GET_CMK_INFO](functions/system_get_cmk_info.md) |  |
| [SYSTEM$GET_CMK_INFO_POSTGRES](functions/system_get_cmk_info_postgres.md) |  |
| [SYSTEM$GET_CMK_KMS_KEY_POLICY](functions/system_get_cmk_kms_key_policy.md) |  |
| [SYSTEM$GET_COMPUTE_POOL_PENDING_MAINTENANCE](functions/system_get_compute_pool_pending_maintenance.md) |  |
| [SYSTEM$GET_DBT_LOG](functions/system_get_dbt_log.md) |  |
| [SYSTEM$GET_DEBUG_STATUS](functions/system_get_debug_status.md) |  |
| [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](functions/system_get_default_columns_override_for_show_command.md) |  |
| [SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](functions/system_get_default_columns_override_for_system_object.md) |  |
| [SYSTEM$GET_DIRECTORY_TABLE_STATUS](functions/system_get_directory_table_status.md) |  |
| [SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD](functions/system_get_gcp_kms_cmk_grant_access_cmd.md) |  |
| [SYSTEM$GET_HASH_FOR_APPLICATION](functions/system_get_hash_for_application.md) |  |
| [SYSTEM$GET_ICEBERG_TABLE_INFORMATION](functions/system_get_iceberg_table_information.md) |  |
| [SYSTEM$GET_INSTANCE_FAMILY_PLACEMENT_GROUPS](functions/system_get_instance_family_placement_groups.md) |  |
| [SYSTEM$GET_LOGIN_FAILURE_DETAILS](functions/system_get_login_failure_details.md) |  |
| [SYSTEM$GET_PREDECESSOR_RETURN_VALUE](functions/system_get_predecessor_return_value.md) |  |
| [SYSTEM$GET_PREVIEW_ACCESS_STATUS](functions/system_get_preview_access_status.md) |  |
| [SYSTEM$GET_PRIVATELINK](functions/system_get_privatelink.md) |  |
| [SYSTEM$GET_PRIVATELINK_AUTHORIZED_ENDPOINTS](functions/system_get_privatelink_authorized_endpoints.md) |  |
| [SYSTEM$GET_PRIVATELINK_CONFIG](functions/system_get_privatelink_config.md) |  |
| [SYSTEM$GET_PRIVATELINK_ENDPOINTS_INFO](functions/system_get_privatelink_endpoints_info.md) |  |
| [SYSTEM$GET_PRIVATELINK_ENDPOINT_REGISTRATIONS](functions/system_get_privatelink_endpoint_registrations.md) |  |
| [SYSTEM$GET_PURCHASE_ATTRIBUTES](functions/system_get_purchase_attributes.md) |  |
| [SYSTEM$GET_REFERENCED_OBJECT_ID_HASH](functions/system_get_referenced_object_id_hash.md) |  |
| [SYSTEM$GET_SERVICE_DNS_DOMAIN](functions/system_get_service_dns_domain.md) |  |
| [SYSTEM$GET_SERVICE_LOGS](functions/system_get_service_logs.md) |  |
| [SYSTEM$GET_SERVICE_STATUS — Deprecated](functions/system_get_service_status.md) | Deprecated; use the [SHOW SERVICE CONTAINERS IN SERVICE](sql/show-service-containers-in-service.md) command instead. |
| [SYSTEM$GET_SNOWFLAKE_EGRESS_IP_RANGES](functions/system_get_snowflake_egress_ip_ranges.md) |  |
| [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](functions/system_get_snowflake_platform_info.md) |  |
| [SYSTEM$GET_TABLE_ARCHIVE_METADATA](functions/system_get_table_archive_metadata.md) |  |
| [SYSTEM$GET_TAG](functions/system_get_tag.md) |  |
| [SYSTEM$GET_TAG_ALLOWED_VALUES](functions/system_get_tag_allowed_values.md) |  |
| [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](functions/system_get_tag_on_current_column.md) |  |
| [SYSTEM$GET_TAG_ON_CURRENT_TABLE](functions/system_get_tag_on_current_table.md) |  |
| [SYSTEM$GET_TASK_GRAPH_CONFIG](functions/system_get_task_graph_config.md) |  |
| [SYSTEM$HOLD_PRIVILEGE_ON_ACCOUNT](functions/system_hold_privilege_on_account.md) |  |
| [SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS](functions/system_internal_stages_public_access_status.md) |  |
| [SYSTEM$IS_APPLICATION_ALL_MANDATORY_TELEMETRY_EVENT_DEFINITIONS_ENABLED](functions/system_is_application_all_mandatory_telemetry_event_definitions_enabled.md) |  |
| [SYSTEM$IS_APPLICATION_AUTHORIZED_FOR_TELEMETRY_EVENT_SHARING](functions/system_is_application_authorized_for_telemetry_event_sharing.md) |  |
| [SYSTEM$IS_APPLICATION_INSTALLED_FROM_SAME_ACCOUNT](functions/system_is_application_installed_from_same_account.md) |  |
| [SYSTEM$IS_APPLICATION_SHARING_EVENTS_WITH_PROVIDER](functions/system_is_application_sharing_events_with_provider.md) |  |
| [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](functions/system_is_global_data_sharing_enabled_for_account.md) |  |
| [SYSTEM$IS_LISTING_PURCHASED](functions/system_is_listing_purchased.md) |  |
| [SYSTEM$IS_LISTING_TRIAL](functions/system_is_listing_trial.md) |  |
| [SYSTEM$LAST_CHANGE_COMMIT_TIME](functions/system_last_change_commit_time.md) |  |
| [SYSTEM$LIST_APPLICATION_RESTRICTED_FEATURES](functions/system_list_application_restricted_features.md) |  |
| [SYSTEM$LIST_ICEBERG_TABLES_FROM_CATALOG](functions/system_list_iceberg_tables_from_catalog.md) |  |
| [SYSTEM$LIST_NAMESPACES_FROM_CATALOG](functions/system_list_namespaces_from_catalog.md) |  |
| [SYSTEM$LOCATE_DBT_ARCHIVE](functions/system_locate_dbt_archive.md) |  |
| [SYSTEM$LOCATE_DBT_ARTIFACTS](functions/system_locate_dbt_artifacts.md) |  |
| [SYSTEM$LOG, SYSTEM$LOG_<level> (for Snowflake Scripting)](functions/system_log.md) |  |
| [SYSTEM$PIPE_STATUS](functions/system_pipe_status.md) |  |
| [SYSTEM$QUERY_REFERENCE](functions/system_query_reference.md) |  |
| [SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](functions/system_read_yaml_from_semantic_view.md) |  |
| [SYSTEM$REFERENCE](functions/system_reference.md) |  |
| [SYSTEM$REGISTRY_LIST_IMAGES](functions/system_registry_list_images.md) | Deprecated; use the [SHOW IMAGES IN IMAGE REPOSITORY](sql/show-images-in-image-repository.md) command instead. |
| [SYSTEM$REPORT_HEALTH_STATUS](functions/system_report_health_status.md) |  |
| [SYSTEM$SAP_BDC_LIST_SHARES](functions/system_sap_bdc_list_shares.md) |  |
| [SYSTEM$SET_RETURN_VALUE](functions/system_set_return_value.md) |  |
| [SYSTEM$SET_SPAN_ATTRIBUTES (for Snowflake Scripting)](functions/system_set_span_attributes.md) |  |
| [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](functions/system_show_active_behavior_change_bundles.md) |  |
| [SYSTEM$SHOW_BUDGETS_FOR_RESOURCE](functions/system_show_budgets_for_resource.md) |  |
| [SYSTEM$SHOW_BUDGETS_IN_ACCOUNT](functions/system_show_budgets_in_account.md) |  |
| [SYSTEM$SHOW_EVENT_SHARING_ACCOUNTS](functions/system_show_event_sharing_accounts.md) |  |
| [SYSTEM$SHOW_MOVE_ORGANIZATION_ACCOUNT_STATUS](functions/system_show_move_organization_account_status.md) |  |
| [SYSTEM$SHOW_OAUTH_CLIENT_SECRETS](functions/system_show_oauth_client_secrets.md) |  |
| [SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES](functions/system_show_sensitive_data_monitored_entities.md) |  |
| [SYSTEM$STREAM_BACKLOG](functions/system_stream_backlog.md) | This function is a [table function](functions-table.md). |
| [SYSTEM$STREAM_GET_TABLE_TIMESTAMP](functions/system_stream_get_table_timestamp.md) |  |
| [SYSTEM$STREAM_HAS_DATA](functions/system_stream_has_data.md) |  |
| [SYSTEM$SUPPORTED_DBT_VERSIONS](functions/system_supported_dbt_versions.md) |  |
| [SYSTEM$TASK_RUNTIME_INFO](functions/system_task_runtime_info.md) |  |
| [SYSTEM$TYPEOF](functions/system_typeof.md) |  |
| [SYSTEM$VALIDATE_STORAGE_INTEGRATION](functions/system_validate_storage_integration.md) |  |
| [SYSTEM$VERIFY_CATALOG_INTEGRATION](functions/system_verify_catalog_integration.md) |  |
| [SYSTEM$VERIFY_CMK_INFO](functions/system_verify_cmk_info.md) |  |
| [SYSTEM$VERIFY_CMK_INFO_POSTGRES](functions/system_verify_cmk_info_postgres.md) |  |
| [SYSTEM$VERIFY_EXTERNAL_OAUTH_TOKEN](functions/system_verify_ext_oauth_token.md) |  |
| [SYSTEM$VERIFY_EXTERNAL_VOLUME](functions/system_verify_external_volume.md) |  |
| [SYSTEM$WHITELIST](functions/system_whitelist.md) | Deprecated; use [SYSTEM$ALLOWLIST](functions/system_allowlist.md) instead. |
| [SYSTEM$WAIT_FOR_SERVICES](functions/system_wait_for_services.md) |  |
| [SYSTEM$WHITELIST_PRIVATELINK](functions/system_whitelist_privatelink.md) | Deprecated; use [SYSTEM$ALLOWLIST_PRIVATELINK](functions/system_allowlist_privatelink.md) instead. |
| **Query Information** |  |
| [EXPLAIN_GRANTABLE_PRIVILEGES](functions/explain_grantable_privileges.md) |  |
| [EXPLAIN_JSON](functions/explain_json.md) |  |
| [GET_QUERY_OPERATOR_STATS](functions/get_query_operator_stats.md) |  |
| [GET_PYTHON_PROFILER_OUTPUT (SNOWFLAKE.CORE)](functions/get_python_profiler_output.md) |  |
| [SYSTEM$ESTIMATE_QUERY_ACCELERATION](functions/system_estimate_query_acceleration.md) |  |
| [SYSTEM$EXPLAIN_PLAN_JSON](functions/system_explain_plan_json.md) |  |
| [SYSTEM$EXPLAIN_JSON_TO_TEXT](functions/system_explain_json_to_text.md) |  |
| [SYSTEM$GET_RESULTSET_STATUS](functions/system_get_resultset_status.md) |  |

---
title: SYSTEM$CANCEL_CLASSIFY_SCHEMA
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_cancel_classify_schema.md
section: SQL General Reference
---

# SYSTEM$CANCEL_CLASSIFY_SCHEMA

Schedules the cancellation of the classification process for the tables in the specified schema. You can cancel the classification process
for tables that the role used to call this stored procedure can access.

A table that is staged to have the classification process canceled is not classified until you classify the table again.

## Syntax

```sqlsyntax
SYSTEM$CANCEL_CLASSIFY_SCHEMA( '<object_name>' )
```

## Arguments

`object_name`
:   The name of the schema containing the tables to have the classification process cancelled. If a database and schema are not in use in the
    current session, the name must be fully-qualified.

    The name must be specified exactly as it is stored in the database. If the name contains special characters, capitalization, or blank
    spaces, the name must be enclosed first in double-quotes and then in single quotes.

## Returns

The stored procedure returns a JSON object in the following formats depending on the specified schema name:

* If you call [SYSTEM$CLASSIFY_SCHEMA](system_classify_schema.md) to stage classification and then call SYSTEM$CANCEL_CLASSIFY_SCHEMA with the same
  schema name to cancel the classification process, the output is as follows:

  ```sqljson
  {
    "failed": [],
    "succeeded": [
      {
        "message": "Classification Cancelled for table [T1].",
        "table_name": "T1"
      },
      {
        "message": "Classification Cancelled for table [T2].",
        "table_name": "T2"
      },
      ...
      }
    ]
  }
  ```
* If you call SYSTEM$CANCEL_CLASSIFY_SCHEMA and the specified schema is not staged for classification, the output is as follows:

  ```sqljson
  {
    "failed": [
      {
        "message": "Unable to cancel classification for table [T1] since its already complete.",
        "table_name": "T1"
      },
      {
        "message": "Unable to cancel classification for table [T2] since its already complete.",
        "table_name": "T2"
      },
      ...
    ],
    "succeeded": []
  }
  ```

Where:

`failed`
:   Specifies a reason why the cancellation process cannot be performed for the specified table.

`succeeded`
:   Confirms the cancellation process is scheduled for the specified table.

## Usage notes

* The cancellation process can take a short time (seconds) to complete. This is analogous to
  [canceling a query](../../user-guide/querying-cancel-statements.md).
* The specified schema name can contain up to 1000 table objects. If the schema contains more than 1000 table objects, Snowflake returns an
  error message.
* Snowflake-provided stored procedures utilize caller’s rights. For more details, see
  [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

## Examples

Cancels the classification of tables in the schema:

> ```sqlexample
> CALL SYSTEM$CANCEL_CLASSIFY_SCHEMA('hr.tables');
> ```

---
title: SYSTEM$CLASSIFY
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_classify.md
section: SQL General Reference
---

# SYSTEM$CLASSIFY

Classifies the specified object with the option to specify the number of rows to sample and assign the recommended
[classification tag](../../user-guide/classify-intro.md) to each column in the specified object.

## Syntax

```sqlsyntax
SYSTEM$CLASSIFY( '<object_name>' ,
  { '<classification_profile>' | <options> } )
```

## Arguments

`'object_name'`
:   The name of the table, external table, view, or materialized view containing the columns to be classified. If a database and schema are
    not in use in the current session, the name must be fully-qualified.

    The name must be specified exactly as it is stored in the database. If the name contains special characters, capitalization, or blank
    spaces, the name must be enclosed first in double-quotes and then in single quotes.

`'classification_profile'`
:   Specifies a [classification profile](../../user-guide/classify-auto.md) in order to classify based on the criteria specified in the profile.

`options`
:   Specifies a JSON [OBJECT](../data-types-semistructured.md) that determines how the classification process works. One of the following:

    `NULL`
    :   Snowflake uses its default configuration based on the number of rows in the specified object. System tags are not set on any columns
        in the specified object.

    `{}`
    :   An empty object, which is functionally equivalent to specifying `NULL`.

    `{'sample_count': integer}`
    :   Specifies the number of rows to sample in the specified object. Any number from `1` to `10000`, inclusive.

    `{'auto_tag': true}`
    :   Sets the recommended classification system tags on the columns in the specified object when the classification process is complete.

        When you use this argument, call the stored procedure with the role that has the OWNERSHIP privilege on the schema.

    `{'sample_count': integer, 'auto_tag': true}`
    :   Classify the specified object while specifying the number of rows to sample and set the recommended system tag on each column in the
        specified object when the classification process is complete.

        When you use this argument, call the stored procedure with the role that has the OWNERSHIP privilege on the schema.

    `{'use_all_custom_classifiers': true}`
    :   Snowflake evaluates all custom classification instances and recommends the tag associated with a custom classification instance based
        on the classification result.

        This option uses the custom classifiers that are accessible to the role in use that calls the stored procedure
        (current role, caller’s rights). For information, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

    `{'custom_classifiers': ['instance_name1' [ , 'instance_name2' ... ] ]}`
    :   Specifies the custom classification instance to evaluate as a source for the recommended tag to be set on the column.

        You can specify multiple instances in the list and separate each instance with a comma.

## Returns

Returns a JSON object in the following format. For example:

```sqljson
{
  "classification_profile_config": {
    "classification_profile_name": "db1.sch.sensitive_data_detection_profile"
  },
  "classification_result": {
    "col1_name": {
      "alternates": [],
      "recommendation": {
        "confidence": "HIGH",
        "coverage": 1,
        "details": [],
        "privacy_category": "QUASI_IDENTIFIER",
        "semantic_category": "DATE_OF_BIRTH",
        "tags": [
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.semantic_category",
            "tag_value": "DATE_OF_BIRTH"
          },
          {
            "tag_applied": true,
            "tag_name": "snowflake.core.privacy_category",
            "tag_value": "QUASI_IDENTIFIER"
          }
        ]
      },
      "valid_value_ratio": 1
    }
  }
}
```

**Possible fields**:

`classification_profile_config`
:   If automatic classification is configured, contains the fully qualified name of the configuration profile that was used to generate the
    classification results.

`classification_result`
:   Provides details about each column that was classified.

`object_path_results`
:   When a column contains semi-structured data with sensitive fields, the `object_path_results` key lists the fields that were
    classified into a native or custom semantic category. For more information, see [View classification results for JSON columns](../../user-guide/classify-results.md).

`alternates`
:   Provides information about each tag and value to consider other than the recommended tag.

`recommendation`
:   Provides information about each tag and value as the primary choice based on the classification process.

These values can appear in both the alternates and recommendation:

> `classifier_name`
> :   The fully-qualified name of the custom classification instance that was used to tag the classified column.
>
>     This field only appears when using a custom classification instance as the source of the tag to set on a column.
>
> `confidence`
> :   Provides one of the following values: `HIGH`, `MEDIUM`, or `LOW`. This value indicates the relative confidence that Snowflake
>     has based upon the column sampling process and how the column data aligns with how Snowflake classifies data.
>
> `coverage`
> :   Provides the percent of sampled cell values that match the rules for a particular category.
>
> `details`
> :   Provides fields and values related to geography-specific classification. The `semantic_category` field contains the
>     [semantic subcategory](../../user-guide/classify-native.md) for a locale.
>
> `privacy_category`
> :   Provides the privacy category.
>
>     The possible values are `IDENTIFIER`, `QUASI-IDENTIFIER` and `SENSITIVE`.
>
> `semantic_category`
> :   Provides the semantic category. For a list of native semantic categories, see [Native semantic categories of sensitive data classification](../../user-guide/classify-native.md).
>
>     If the value is `MULTIPLE`, then sensitive data was found in semi-structured data. Inspect the `object_path_results` field
>     of the results object for a detailed breakdown of which native and custom semantic categories were found during classification. For more information, see [View classification results for JSON columns](../../user-guide/classify-results.md).
>
> `tags`
> :   Provides information about the tags that were applied to the column as a result of the classification process.
>
> `valid_value_ratio`
> :   Provides the ratio of how many values in the sample size are valid.
>
>     * For structured data, invalid values include NULL, an empty string, and a string with more than 256 characters.
>     * For semi-structured data, invalid values include NULL and an empty string.

## Usage notes

* Snowflake-provided stored procedures utilize caller’s rights. For more details, see
  [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
* If you want to apply alternate system tag values, use an
  [ALTER TABLE … MODIFY COLUMN … SET TAG](../sql/alter-table-column.md) statement to update the tag value.
* To unset a Classification system tag from a column, use an ALTER TABLE … MODIFY COLUMN … UNSET TAG statement.

## Examples

Classify a table:

> ```sqlexample
> CALL SYSTEM$CLASSIFY('hr.tables.empl_info', null);
> ```

Classify a table and specify the number of rows to sample:

> ```sqlexample
> CALL SYSTEM$CLASSIFY('hr.tables.empl_info', {'sample_count': 1000});
> ```

Classify a table and set the system tags to the columns:

> ```sqlexample
> CALL SYSTEM$CLASSIFY('hr.tables.empl_info', {'auto_tag': true});
> ```

Classify a table, and specify the number of rows to sample and set the recommended system tag to each column in the table:

> ```sqlexample
> CALL SYSTEM$CLASSIFY('hr.tables.empl_info', {'sample_count': 1000, 'auto_tag': true});
> ```

Classify a table based on the criteria specified in the `my_config_profile` classification profile:

> ```sqlexample
> CALL SYSTEM$CLASSIFY('hr.tables.empl_info, 'my_config_profile');
> ```

---
title: SYSTEM$CLASSIFY_SCHEMA
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_classify_schema.md
section: SQL General Reference
---

# SYSTEM$CLASSIFY_SCHEMA

Schedules the tables in the specified schema to be classified with the option to specify the number of rows to sample in each table and
assign the recommended sensitive data classification [system tag](../../user-guide/classify-intro.md) to each column in the tables
stored in the specified schema.

## Syntax

```sqlsyntax
SYSTEM$CLASSIFY_SCHEMA( '<schema_name>' , <object> )
```

## Arguments

`schema_name`
:   The name of the schema containing the tables to be classified. If a database and schema are not in use in the current session, the name
    must be fully-qualified.

    The name must be specified exactly as it is stored in the database. If the name contains special characters, capitalization, or blank
    spaces, the name must be enclosed first in double-quotes and then in single quotes.

`object`
:   Specifies a JSON [OBJECT](../data-types-semistructured.md) that determines how the classification process works. One of the following:

    `NULL`
    :   Snowflake uses its default configuration based on the number of rows in the specified object. System tags are not set on any columns
        in the specified object.

    `{}`
    :   An empty object, which is functionally equivalent to specifying `NULL`.

    `{'sample_count': integer}`
    :   Specifies the number of rows to sample in the specified object. Any number from `1` to `10000`, inclusive.

    `{'auto_tag': true}`
    :   Sets the recommended classification system tags on the columns in the specified object when the classification process is complete.

        When you use this argument, call the stored procedure with the role that has the OWNERSHIP privilege on the schema.

    `{'sample_count': integer, 'auto_tag': true}`
    :   Classify the specified object while specifying the number of rows to sample and set the recommended system tag on each column in the
        specified object when the classification process is complete.

        When you use this argument, call the stored procedure with the role that has the OWNERSHIP privilege on the schema.

    `{'use_all_custom_classifiers': true}`
    :   Snowflake evaluates all custom classification instances and recommends the tag associated with a custom classification instance based
        on the classification result.

        This option uses the custom classifiers that are accessible to the role in use that calls the stored procedure
        (current role, caller’s rights). For information, see [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

    `{'custom_classifiers': ['instance_name1' [ , 'instance_name2' ... ] ]}`
    :   Specifies the custom classification instance to evaluate as a source for the recommended tag to be set on the column.

        You can specify multiple instances in the list and separate each instance with a comma.

## Returns

The stored procedure returns a JSON object in the following format. For example:

```sqljson
{
  "failed": [
    {
      "message": "Insufficient privileges.",
      "table_name": "t4"
    }
  ],
  "succeeded": [
    {
      "table_name": "t1"
    },
    {
      "table_name": "t2"
    },
    {
      "table_name": "t3"
    }
  ]
}
```

Where:

`failed`
:   Specifies a message that provides a reason why the table was not scheduled to be classified.

`succeeded`
:   Specifies each table that is scheduled for Data Classification.

## Usage notes

* The specified schema name can contain up to 1000 table objects. If the schema contains more than 1000 table objects, Snowflake returns an
  error message.
* Snowflake-provided stored procedures utilize caller’s rights. For more details, see
  [Understanding caller’s rights and owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).
* If you want to apply alternate system tag values, use an
  [ALTER TABLE … MODIFY COLUMN … SET TAG](../sql/alter-table-column.md) statement to update the tag value.
* To unset a Classification system tag from a column, use an ALTER TABLE … MODIFY COLUMN … UNSET TAG statement.

> **Caution:**
>
> When you call this stored procedure, the classification process for each table in the schema runs in parallel and consumes warehouse
> resources. If you call this stored procedure many times in a short period to classify tables in schemas simultaneously,
> those processes also run in parallel. Many parallel classification processes can exceed the warehouse capability, which causes the
> classification process for some tables to fail. Consequently, a schema might have some of its tables classified and others not classified.
>
> Prior to calling SYSTEM$CLASSIFY_SCHEMA, evaluate the number of columns in each table, the number of tables in a schema, the number of
> schemas that you want to classify, and the warehouse size that is in use for the session.

## Examples

Stage the classification of tables in the schema:

> ```sqlexample
> CALL SYSTEM$CLASSIFY_SCHEMA('hr.tables', null);
> ```

Stage the classification of the tables in the schema and specify the number of rows to sample:

> ```sqlexample
> CALL SYSTEM$CLASSIFY_SCHEMA('hr.tables', {'sample_count': 1000});
> ```

Stage the classification of the tables in the schema and set the system tags to the columns:

> ```sqlexample
> CALL SYSTEM$CLASSIFY_SCHEMA('hr.tables', {'auto_tag': true});
> ```

Stage the classification of the tables in the schema, specify the number of rows to sample, and set the recommended system tag to each
column in the table:

> ```sqlexample
> CALL SYSTEM$CLASSIFY_SCHEMA('hr.tables', {'sample_count': 1000, 'auto_tag': true});
> ```

---
title: SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md
section: SQL General Reference
---

# SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML

Creates a [semantic view](../../user-guide/views-semantic/overview.md) from a
[semantic model specification in YAML format](../../user-guide/views-semantic/sql.md), or verifies
that you can use a semantic model specification to create a semantic view.

The stored procedure uses the name from the YAML specification for the name of the semantic view.

If a semantic view with the same name already exists, the stored procedure attempts to replace that semantic view and copy the
grants from that semantic view. This has the same effect as running
[CREATE OR REPLACE SEMANTIC VIEW … COPY GRANTS](../sql/create-semantic-view.md).

See also:
:   [SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](../functions/system_read_yaml_from_semantic_view.md) , [CREATE SEMANTIC VIEW](../sql/create-semantic-view.md)

## Syntax

```sqlsyntax
SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  '<fully_qualified_schema_name>' ,
  '<yaml_specification>' ,
  [ <verify_only> ]
)
```

## Arguments

**Required:**

`'fully_qualified_schema_name'`
:   [Fully qualified name](../name-resolution.md) of the schema where you want to create the semantic view.

    You must qualify the schema name with the database name (for example, `my_db.my_schema`). Otherwise, an error occurs.

`'yaml_specification'`
:   [Semantic model specification in YAML format](../../user-guide/views-semantic/sql.md).

    If the specification contains quotes, backslashes, or newlines, you can use a
    [dollar-quoted string constant](../data-types-text.md) for this argument.

**Optional:**

`verify_only`
:   If TRUE, verifies that you can use the semantic model specified by `'yaml_specification'` to create a semantic view.

    You can specify this to verify that you can create a semantic view from the model before you attempt to create the semantic
    view.

    Default: FALSE

## Returns

Returns a VARCHAR value containing the status of the operation to create the semantic view or verify that the semantic view can
be created.

If the stored procedure fails to create the semantic view or verify that the semantic view can be created, the stored procedure
throws an exception.

## Access control requirements

A [role](../../user-guide/security-access-control-overview.md) used to execute this operation must have the following
[privileges](../../user-guide/security-access-control-overview.md) at a minimum:

| Privilege | Object | Notes |
| --- | --- | --- |
| CREATE SEMANTIC VIEW | Schema | Required to create a new semantic view. |
| SELECT | Table, view | Required on any tables and/or views used in the semantic view definition. |
| OWNERSHIP | Existing semantic view with the same name. | If a semantic view with the same name already exists, the stored procedure attempts to replace that semantic view. To replace an existing semantic view, you must use a role that has been granted the OWNERSHIP privilege.  OWNERSHIP is a special privilege on an object that is automatically granted to the role that created the object, but can also be transferred using the [GRANT OWNERSHIP](../sql/grant-ownership.md) command to a different role by the owning role (or any role with the MANAGE GRANTS privilege). |

Operating on an object in a schema requires at least one privilege on the parent database and at least one privilege on the parent schema.

For instructions on creating a custom role with a specified set of privileges, see [Creating custom roles](../../user-guide/security-access-control-configure.md).

For general information about roles and privilege grants for performing SQL actions on
[securable objects](../../user-guide/security-access-control-overview.md), see [Overview of Access Control](../../user-guide/security-access-control-overview.md).

## Usage notes

If the name of the database or schema is a [double-quoted identifier](../identifiers-syntax.md) (for example, if the name
contains spaces), you must include double quotes around the name. For example:

```sqlexample
CALL SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  '"my database"."my schema"',
  ...
);
```

## Examples

The following example verifies that you can use a given semantic model specification in YAML to create a semantic view named
`tpch_analysis` in the database `my_db` and schema `my_schema`:

```sqlexample-yaml
CALL SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  'my_db.my_schema',
  $$
  name: TPCH_REV_ANALYSIS
  description: Semantic view for revenue analysis
  tables:
    - name: CUSTOMERS
      description: Main table for customer data
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: CUSTOMER
      primary_key:
        columns:
          - C_CUSTKEY
      dimensions:
        - name: CUSTOMER_NAME
          synonyms:
            - customer name
          description: Name of the customer
          expr: customers.c_name
          data_type: VARCHAR(25)
        - name: C_CUSTKEY
          expr: C_CUSTKEY
          data_type: VARCHAR(134217728)
      metrics:
        - name: CUSTOMER_COUNT
          description: Count of number of customers
          expr: COUNT(c_custkey)
    - name: LINE_ITEMS
      description: Line items in orders
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: LINEITEM
      primary_key:
        columns:
          - L_ORDERKEY
          - L_LINENUMBER
      dimensions:
        - name: L_ORDERKEY
          expr: L_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: L_LINENUMBER
          expr: L_LINENUMBER
          data_type: VARCHAR(134217728)
      facts:
        - name: DISCOUNTED_PRICE
          description: Extended price after discount
          expr: l_extendedprice * (1 - l_discount)
          data_type: "NUMBER(25,4)"
        - name: LINE_ITEM_ID
          expr: "CONCAT(l_orderkey, '-', l_linenumber)"
          data_type: VARCHAR(134217728)
    - name: ORDERS
      synonyms:
        - sales orders
      description: All orders table for the sales domain
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: ORDERS
      primary_key:
        columns:
          - O_ORDERKEY
      dimensions:
        - name: ORDER_DATE
          description: Date when the order was placed
          expr: o_orderdate
          data_type: DATE
        - name: ORDER_YEAR
          description: Year when the order was placed
          expr: YEAR(o_orderdate)
          data_type: "NUMBER(4,0)"
        - name: O_ORDERKEY
          expr: O_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: O_CUSTKEY
          expr: O_CUSTKEY
          data_type: VARCHAR(134217728)
      facts:
        - name: COUNT_LINE_ITEMS
          expr: COUNT(line_items.line_item_id)
          data_type: "NUMBER(18,0)"
      metrics:
        - name: AVERAGE_LINE_ITEMS_PER_ORDER
          description: Average number of line items per order
          expr: AVG(orders.count_line_items)
        - name: ORDER_AVERAGE_VALUE
          description: Average order value across all orders
          expr: AVG(orders.o_totalprice)
  relationships:
    - name: LINE_ITEM_TO_ORDERS
      left_table: LINE_ITEMS
      right_table: ORDERS
      relationship_columns:
        - left_column: L_ORDERKEY
          right_column: O_ORDERKEY
      relationship_type: many_to_one
    - name: ORDERS_TO_CUSTOMERS
      left_table: ORDERS
      right_table: CUSTOMERS
      relationship_columns:
        - left_column: O_CUSTKEY
          right_column: C_CUSTKEY
      relationship_type: many_to_one
  $$,
TRUE);
```

If the specification is valid, the stored procedure returns the following message:

```output
+----------------------------------------------------------------------------------+
| SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML                                            |
|----------------------------------------------------------------------------------|
| YAML file is valid for creating a semantic view. No object has been created yet. |
+----------------------------------------------------------------------------------+
```

If the YAML syntax is invalid, the stored procedure throw an exception. For example, if a colon is missing:

```yaml
relationships
  - name: LINE_ITEM_TO_ORDERS
```

the stored procedure throws an exception, indicating that the YAML syntax is invalid:

```output
392400 (22023): Uncaught exception of type 'EXPRESSION_ERROR' on line 3 at position 23 :
  Invalid semantic model YAML: while scanning a simple key
   in 'reader', line 90, column 3:
        relationships
        ^
  could not find expected ':'
   in 'reader', line 91, column 11:
          - name: LINE_ITEM_TO_ORDERS
                ^
```

If the specification refers to a physical table that does not exist, the stored procedure throws an exception:

```yaml
base_table:
  database: SNOWFLAKE_SAMPLE_DATA
  schema: TPCH_SF1
  table: NONEXISTENT
```

```output
002003 (42S02): Uncaught exception of type 'EXPRESSION_ERROR' on line 3 at position 23 :
  SQL compilation error:
  Table 'SNOWFLAKE_SAMPLE_DATA.TPCH_SF1.NONEXISTENT' does not exist or not authorized.
```

Similarly, if the specification refers to a primary key column that does not exist, the stored procedure throws an exception:

```yaml
primary_key:
  columns:
    - NONEXISTENT
```

```output
000904 (42000): Uncaught exception of type 'EXPRESSION_ERROR' on line 3 at position 23 :
  SQL compilation error: error line 0 at position -1
  invalid identifier 'NONEXISTENT'
```

The following example creates a semantic view named `tpch_analysis` in the database `my_db` and schema `my_schema`:

```sqlexample-yaml
CALL SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  'my_db.my_schema',
  $$
  name: TPCH_REV_ANALYSIS
  description: Semantic view for revenue analysis
  tables:
    - name: CUSTOMERS
      description: Main table for customer data
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: CUSTOMER
      primary_key:
        columns:
          - C_CUSTKEY
      dimensions:
        - name: CUSTOMER_NAME
          synonyms:
            - customer name
          description: Name of the customer
          expr: customers.c_name
          data_type: VARCHAR(25)
        - name: C_CUSTKEY
          expr: C_CUSTKEY
          data_type: VARCHAR(134217728)
      metrics:
        - name: CUSTOMER_COUNT
          description: Count of number of customers
          expr: COUNT(c_custkey)
    - name: LINE_ITEMS
      description: Line items in orders
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: LINEITEM
      primary_key:
        columns:
          - L_ORDERKEY
          - L_LINENUMBER
      dimensions:
        - name: L_ORDERKEY
          expr: L_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: L_LINENUMBER
          expr: L_LINENUMBER
          data_type: VARCHAR(134217728)
      facts:
        - name: DISCOUNTED_PRICE
          description: Extended price after discount
          expr: l_extendedprice * (1 - l_discount)
          data_type: "NUMBER(25,4)"
        - name: LINE_ITEM_ID
          expr: "CONCAT(l_orderkey, '-', l_linenumber)"
          data_type: VARCHAR(134217728)
    - name: ORDERS
      synonyms:
        - sales orders
      description: All orders table for the sales domain
      base_table:
        database: SNOWFLAKE_SAMPLE_DATA
        schema: TPCH_SF1
        table: ORDERS
      primary_key:
        columns:
          - O_ORDERKEY
      dimensions:
        - name: ORDER_DATE
          description: Date when the order was placed
          expr: o_orderdate
          data_type: DATE
        - name: ORDER_YEAR
          description: Year when the order was placed
          expr: YEAR(o_orderdate)
          data_type: "NUMBER(4,0)"
        - name: O_ORDERKEY
          expr: O_ORDERKEY
          data_type: VARCHAR(134217728)
        - name: O_CUSTKEY
          expr: O_CUSTKEY
          data_type: VARCHAR(134217728)
      facts:
        - name: COUNT_LINE_ITEMS
          expr: COUNT(line_items.line_item_id)
          data_type: "NUMBER(18,0)"
      metrics:
        - name: AVERAGE_LINE_ITEMS_PER_ORDER
          description: Average number of line items per order
          expr: AVG(orders.count_line_items)
        - name: ORDER_AVERAGE_VALUE
          description: Average order value across all orders
          expr: AVG(orders.o_totalprice)
  relationships:
    - name: LINE_ITEM_TO_ORDERS
      left_table: LINE_ITEMS
      right_table: ORDERS
      relationship_columns:
        - left_column: L_ORDERKEY
          right_column: O_ORDERKEY
      relationship_type: many_to_one
    - name: ORDERS_TO_CUSTOMERS
      left_table: ORDERS
      right_table: CUSTOMERS
      relationship_columns:
        - left_column: O_CUSTKEY
          right_column: C_CUSTKEY
      relationship_type: many_to_one
  $$
);
```

```output
+-----------------------------------------+
| SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML   |
|-----------------------------------------|
| Semantic view was successfully created. |
+-----------------------------------------+
```

---
title: SYSTEM$REQUEST_LISTING_AND_WAIT
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_request_listing_and_wait.md
section: SQL General Reference
---

Categories:
:   [System functions](../functions-system.md) (System Control)

# SYSTEM$REQUEST_LISTING_AND_WAIT

Requests a listing and automatically polls for listing availability. To learn more about listings, see [About sharing with listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about).

## Syntax

```sqlsyntax
SYSTEM$REQUEST_LISTING_AND_WAIT( '<listing_global_name>' [ , <timeout_mins> ] )
```

## Arguments

`listing_global_name`
:   The global name of the listing being requested.

`timeout_mins`
:   The time to wait for listing request fulfillment in minutes before timing out. The default is 240 minutes or 4 hours.

## Returns

* Returns `Success: Listing <listing_global_name> is ready to be imported` when a requested listing becomes available or is already available.
* Returns `Error: Timed out waiting for the listing to be available after <timeout_mins> min(s)` when the specified timeout period is exceeded.

## Usage notes

To request a listing without waiting for listing fulfillment, enter `0` (zero) for the `timeout_mins` value. When the value is `0` and the request is successful, the message `Success: Listing <listing_global_name> requested successfully, but not waiting to confirm fulfillment` is returned.

## Examples

```sqlexample
CALL SYSTEM$REQUEST_LISTING_AND_WAIT('GZ13Z1VEWIJ', 60);
```

---
title: SYSTEM$SEND_EMAIL
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_send_email.md
section: SQL General Reference
---

# SYSTEM$SEND_EMAIL

Sends an [email notification](../../user-guide/notifications/email-stored-procedures.md) to the specified recipients from
`no-reply@snowflake.net`.

> **Note:**
>
> Email notifications are processed through Snowflake’s Amazon Web Services (AWS) deployments, using AWS Simple Email Service
> (SES). The content of an email message sent using AWS may be retained by Snowflake for up to thirty days to manage the delivery
> of the message. After this period, the message content is deleted.

## Syntax

```sqlsyntax
SYSTEM$SEND_EMAIL(
  '<integration_name>',
  '<email_address_1> [ , ... <email_address_N> ]',
  '<email_subject>',
  '<email_content>',
  [ '<mime_type>' ] )
```

## Arguments

### Required

`integration_name`
:   Name of the [notification integration](../../user-guide/notifications/email-notifications.md) that you want to use to send the
    email message.

`email_address_1 [ , ... email_address_N ]`
:   List of email addresses that should receive the email notification.

    Specify one or more unquoted email addresses in a comma-separated string.

    If the `ALLOWED_RECIPIENTS` property of the
    [notification integration](../../user-guide/notifications/email-notifications.md) is set and any of the email addresses is
    not in that list, no email notifications are sent.

`email_subject`
:   Subject line of the email notification. You cannot specify an empty string.

`email_content`
:   Body of the email. You cannot specify an empty string.

### Optional

`mime_type`
:   The MIME type of the `email_content` value, the email’s content. Default is `text/plain`.

    The following types are supported:

    * `text/plain` – Specify this when `email_content` is plain text. This is the default value.
    * `text/html` – Specify this when `email_content` is HTML.

      Note that the content of a message of the `text/html` type is not validated as well-formed HTML.

## Returns

Returns TRUE if the stored procedure executes successfully.

## Examples

See [Using SYSTEM$SEND_EMAIL to send email notifications](../../user-guide/notifications/email-stored-procedures.md).

---
title: SYSTEM$SEND_SNOWFLAKE_NOTIFICATION
source: https://docs.snowflake.com/en/sql-reference/stored-procedures/system_send_snowflake_notification.md
section: SQL General Reference
---

# SYSTEM$SEND_SNOWFLAKE_NOTIFICATION

Sends a notification message to an email address, webhook, or queue provided by a Cloud service (Amazon SNS, Google Cloud PubSub,
or Azure Event Grid).

See also:
:   [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md)

## Syntax

```sqlsyntax
SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  <message>,
  <integration_configuration> )

SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  ( <message>, [ <message>, ... ] ),
  <integration_configuration> )

SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  <message>,
  ( <integration_configuration> [ , <integration_configuration> , ... ] ) )

SYSTEM$SEND_SNOWFLAKE_NOTIFICATION(
  ( <message> [ , <message> , ... ] ),
  ( <integration_configuration> [ , <integration_configuration> , ... ] ) )
```

## Arguments

`message`
:   JSON-formatted string that specifies the type and content of the message. The string must be in the following format:

    ```json
    { "<content_type>": "<message_contents>" }
    ```

    Where:

    * `"content_type"` is one of the following:

      + `"text/plain"` for plain text messages.
      + `"text/html"` for HTML messages.
      + `"application/json"` for JSON messages.
    * `"<message_contents>"` is the content of the message.

    For example:

    ```json
    { "text/html": "<p>A message</p>" }
    ```

    To construct this string, you can call one of the following functions:

    * To send an HTML email message, call the [TEXT_HTML](../functions/text_html.md) function.
    * To send a plain text email message, call the [TEXT_PLAIN](../functions/text_plain.md) function.
    * To send a JSON message to a queue, call the [APPLICATION_JSON](../functions/application_json.md) function.

`integration_configuration`
:   JSON-formatted string that specifies the notification integration or the email configuration to use to send the notification.
    The string must be one of the following formats:

    ```json
    { "<integration_name>": {} }
    ```

    ```json
    { "<integration_name>": { <options> } }
    ```

    Where:

    * `"integration_name"` is the name of the notification integration.
    * `options` is a comma-delimited list of properties (in JSON format) that specify values that override the defaults
      in the integration. You can specify the following properties:

      | Property Name | Description |
      | --- | --- |
      | `subject` | Subject line of the email notification. For example:  ```json { "subject" : "Service status update" } ```  The subject cannot exceed 256 characters in length.  If you do not set this property, the default subject line from the integration is used.  If the integration does not specify a default subject line, `"Snowflake Email Notification"` is used. |
      | `toAddress` | List of email addresses of the recipients to include in the “To:” line of the email notification.  Format this list as a JSON array. For example:  ```json { "toAddress" : ["person_1@example.com", "person_2@example.com"] } ```  If you do not set this property, the stored procedure uses the list of email addresses from the DEFAULT_RECIPIENTS property of the [email notification integration](../../user-guide/notifications/email-notifications.md). |
      | `ccAddress` | List of email addresses of the recipients to include in the “Cc:” line of the email notification.  Format this list as a JSON array. For example:  ```json { "ccAddress" : ["person_to_cc1@example.com", "person_to_cc2@example.com"] } ``` |
      | `bccAddress` | List of email addresses of the recipients to include in the “Bcc:” line of the email notification.  Format this list as a JSON array. For example:  ```json { "bccAddress" : ["person_to_bcc1@example.com", "person_to_bcc2@example.com"] } ``` |

      For example:

      ```json
      { "my_queue_int": {} }
      ```

      ```json
      { "my_email_int": { "subject" : "Different subject" } }
      ```

      ```json
      { "my_email_int": { "subject" : "Different subject" }, { "toAddress": ["person_a@example.com"] }
      ```

    To construct the JSON-formatted strings for the integration configuration, call one of the following functions:

    * If you are sending the notification to a queue, or if you are sending an email notification and want to use the default values
      specified in the email notification integration, call the [INTEGRATION](../functions/integration.md) function.
    * if you are sending an email notification and want to override the default values specified in the email notification
      integration, call the [EMAIL_INTEGRATION_CONFIG](../functions/email_integration_config.md) function.

`( message [ , message , ... ] )`
:   ARRAY of JSON-formatted strings, each of which specify a message type and content. Specify this argument if you want to send a
    message in multiple formats.

    Each message should use the format described above.

    To construct the ARRAY, call the [ARRAY_CONSTRUCT](../functions/array_construct.md) function.

    > **Note:**
    >
    > The ARRAY cannot contain more than one object for the same message content type.

`( integration_configuration [ , integration_configuration , ... ] )`
:   ARRAY of JSON-formatted strings, each of which specifies a notification integration and configuration to use. Specify this
    argument if you want to use multiple notification integrations or email configurations to send a message.

    Each integration configuration should use
    the format described above.

    To construct the ARRAY, call the [ARRAY_CONSTRUCT](../functions/array_construct.md) function.

    > **Note:**
    >
    > The ARRAY cannot contain more than one object for the same notification integration.

## Returns

If the stored procedure executes successfully, it returns the string “Enqueued notifications”.

## Usage notes

* For email notifications, if the DEFAULT_RECIPIENTS property is not set in the notification integration and you do not set the
  `toAddress:` property in the SYSTEM$SEND_SNOWFLAKE_NOTIFICATION call, the call fails.
* For webhook notifications, call [SANITIZE_WEBHOOK_CONTENT](../functions/sanitize_webhook_content.md) to sanitize the message before passing
  the message to SYSTEM$SEND_SNOWFLAKE_NOTIFICATION.

## Examples

See [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../user-guide/notifications/snowflake-notifications.md).

---
title: Table functions
source: https://docs.snowflake.com/en/sql-reference/functions-table.md
section: SQL General Reference
---

# Table functions

A table function returns a set of rows for each input row. The returned set can contain zero, one, or more rows. Each row can
contain one or more columns.

Table functions are sometimes called “tabular functions”.

## What are table functions?

Table functions are typically used when a function returns multiple rows for each individual input.

Each time that a table function is called, it can return a different number of rows. For example, a function
`record_high_temperatures_for_date()`, which returns a list of record high temperatures for a specified date, might
return 0 rows on April 10, 1 row on June 10, and 40 rows on August 20.

### Simple examples of table functions

The following are appropriate as table functions:

* A function that accepts an account number and a date, and returns all charges billed to that account on that date.
  (More than one charge might have been billed on a particular date.)
* A function that accepts a user ID and returns the database roles assigned to that user.
  (A user might have multiple roles, including “sysadmin” and “useradmin”.)

### Functions in which each output row depends upon multiple input rows

Table functions can be grouped into two categories based on the number of input rows that affect each output row:

* 1-to-N
* M-to-N

The functions described earlier are 1-to-N table functions: each output row depends upon only one input row. For example, a
function `record_high_temperatures_for_date()` might produce multiple output rows (one for each city that hit a record on
that date). Each output row for a specific input date depends only on that date; each output row is independent of the rows for
every other date.

Snowflake also supports M-to-N table functions: each output row can depend upon multiple input rows. For example, if a function
generates a moving average of stock prices, that function uses stock prices from multiple input rows (multiple dates) to generate
each output row.

More generally, in an M-to-N function, a group of M input rows produces a group of N output rows. M can be one or more rows.
N can be zero, one, or more rows.

For example, in a 10-day moving average, M is 10. N is 1 because each group of 10 input rows produces one average price.

### Built-in table functions vs user-defined table functions

Snowflake provides hundreds of built-in functions, many of which are table functions. Built-in table functions are listed in
System-Defined Table Functions.

Users can also write their own functions, called user-defined functions or “UDFs”. Some UDFs are scalar; some are tabular.
User-defined table functions are called “UDTFs”. For information about UDFs (including UDTFs), see
[User-defined functions overview](../developer-guide/udf/udf-overview.md).

Built-in table functions and user-defined table functions generally follow the same rules; for example, they are called the same way
from SQL statements.

## Using a table function

### Using a table function in the FROM clause

A table contains a set of rows. Similarly, a table function returns a set of rows. Both tables and table functions are used in
contexts that expect a set of rows. Specifically, table functions are used in the [FROM](constructs/from.md) clause of a
SQL statement.

To help the SQL compiler recognize a table function as a source of rows, Snowflake requires that the table function call be
wrapped by the `TABLE()` keyword.

For example, the following statement calls a table function named `record_high_temperatures_for_date()`, which takes a DATE
value as an argument:

> ```sqlexample
> SELECT city_name, temperature
>     FROM TABLE(record_high_temperatures_for_date('2021-06-27'::DATE))
>     ORDER BY city_name;
> ```

For more information about the syntax of `TABLE()`, see [Table literals](literals-table.md).

Table functions, like functions in general, can accept zero, one, or multiple input arguments in each invocation. Each argument
must be a scalar expression.

For more details about the syntax of table function calls, see Syntax (in this topic).

### Using a table as input to a table function

The argument to a table function can be a literal or an expression, such as a column of a table.
For example, the SELECT statement below passes values from a table as arguments to a table function:

```sqlexample
CREATE OR REPLACE table dates_of_interest (event_date DATE);
INSERT INTO dates_of_interest (event_date) VALUES
    ('2021-06-21'::DATE),
    ('2022-06-21'::DATE);

CREATE OR REPLACE FUNCTION record_high_temperatures_for_date(d DATE)
    RETURNS TABLE (event_date DATE, city VARCHAR, temperature NUMBER)
    as
    $$
    SELECT d, 'New York', 65.0
    UNION ALL
    SELECT d, 'Los Angeles', 69.0
    $$;
```

```sqlexample
SELECT
        doi.event_date as "Date",
        record_temperatures.city,
        record_temperatures.temperature
    FROM dates_of_interest AS doi,
         TABLE(record_high_temperatures_for_date(doi.event_date)) AS record_temperatures
      ORDER BY doi.event_date, city;
+------------+-------------+-------------+
| Date       | CITY        | TEMPERATURE |
|------------+-------------+-------------|
| 2021-06-21 | Los Angeles |          69 |
| 2021-06-21 | New York    |          65 |
| 2022-06-21 | Los Angeles |          69 |
| 2022-06-21 | New York    |          65 |
+------------+-------------+-------------+
```

The arguments to a table function can come from other table-like sources, including views and other table functions.

## List of system-defined table functions

Snowflake provides the following system-defined (i.e. built-in) table functions:

| Sub-category | Function | Notes |
| --- | --- | --- |
| Data Loading | [INFER_SCHEMA](functions/infer_schema.md) | For more information, see [Load data into Snowflake](../guides-overview-loading-data.md). |
|  | [VALIDATE](functions/validate.md) |  |
| Data Generation | [GENERATOR](functions/generator.md) |  |
| Data Conversion | [SPLIT_TO_TABLE](functions/split_to_table.md) |  |
|  | [STRTOK_SPLIT_TO_TABLE](functions/strtok_split_to_table.md) |  |
| Differential Privacy | [CUMULATIVE_PRIVACY_LOSSES](functions/cumulative_privacy_losses.md) |  |
| Object Modeling | [GET_OBJECT_REFERENCES](functions/get_object_references.md) |  |
| Parameterized Queries | [TO_QUERY](functions/to_query.md) |  |
| Semi-structured Queries | [FLATTEN](functions/flatten.md) | For more information, see [Querying Semi-structured Data](../user-guide/querying-semistructured.md). |
| Query Results | [RESULT_SCAN](functions/result_scan.md) | Can be used to perform SQL operations on the output from another SQL operation (e.g. SHOW). |
| Query Profile | [GET_QUERY_OPERATOR_STATS](functions/get_query_operator_stats.md) |  |
| Historical & Usage Information |  | Includes:   * [Snowflake Information Schema](info-schema.md) * [Account Usage](account-usage.md) * [LOCAL schema](local.md) |
| User Login | [LOGIN_HISTORY , LOGIN_HISTORY_BY_USER](functions/login_history.md) |  |
| Queries | [QUERY_HISTORY , QUERY_HISTORY_BY_\*](functions/query_history.md) |  |
|  | [QUERY_ACCELERATION_HISTORY](functions/query_acceleration_history.md) | For more information, see [Using the Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md). |
| Warehouse & Storage Usage | [DATABASE_STORAGE_USAGE_HISTORY](functions/database_storage_usage_history.md) |  |
|  | [WAREHOUSE_LOAD_HISTORY](functions/warehouse_load_history.md) |  |
|  | [WAREHOUSE_METERING_HISTORY](functions/warehouse_metering_history.md) |  |
|  | [STAGE_STORAGE_USAGE_HISTORY](functions/stage_storage_usage_history.md) |  |
| Storage Lifecycle Policies | [STORAGE_LIFECYCLE_POLICY_HISTORY](functions/storage_lifecycle_policy_history.md) | Information Schema table function. For more information, see [Storage lifecycle policies](../user-guide/storage-management/storage-lifecycle-policies.md). |
| Column-level & Row-level Security | [POLICY_REFERENCES](functions/policy_references.md) |  |
| Object Tagging | [TAG_REFERENCES](functions/tag_references.md) | Information Schema table function. |
|  | [TAG_REFERENCES_ALL_COLUMNS](functions/tag_references_all_columns.md) | Information Schema table function. |
|  | [TAG_REFERENCES_WITH_LINEAGE](functions/tag_references_with_lineage.md) | Account Usage table function. |
| Account Replication | [REPLICATION_GROUP_DANGLING_REFERENCES](functions/replication_group_dangling_references.md) | For more information, see [Introduction to replication and failover across multiple accounts](../user-guide/account-replication-intro.md) |
|  | [REPLICATION_GROUP_REFRESH_HISTORY, REPLICATION_GROUP_REFRESH_HISTORY_ALL](functions/replication_group_refresh_history.md) |  |
|  | [REPLICATION_GROUP_REFRESH_PROGRESS, REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB, REPLICATION_GROUP_REFRESH_PROGRESS_ALL](functions/replication_group_refresh_progress.md) |  |
|  | [REPLICATION_GROUP_USAGE_HISTORY](functions/replication_group_usage_history.md) |  |
| Alerts | [ALERT_HISTORY](functions/alert_history.md) | For more information, see [Setting up alerts based on data in Snowflake](../user-guide/alerts.md). |
|  | [SERVERLESS_ALERT_HISTORY](functions/serverless_alert_history.md) |  |
| Bind variables | [BIND_VALUES](functions/bind_values.md) | For more information, see [Retrieve bind variable values](bind-variables.md). |
| Database Replication | [DATABASE_REFRESH_HISTORY](functions/database_refresh_history.md) | For more information, see [Replicating databases across multiple accounts](../user-guide/db-replication-config.md). |
|  | [DATABASE_REFRESH_PROGRESS , DATABASE_REFRESH_PROGRESS_BY_JOB](functions/database_refresh_progress.md) |  |
|  | [DATABASE_REPLICATION_USAGE_HISTORY](functions/database_replication_usage_history.md) |  |
| Data Loading & Transfer | [COPY_HISTORY](functions/copy_history.md) |  |
|  | [DATA_TRANSFER_HISTORY](functions/data_transfer_history.md) |  |
|  | [PIPE_USAGE_HISTORY](functions/pipe_usage_history.md) |  |
|  | [STAGE_DIRECTORY_FILE_REGISTRATION_HISTORY](functions/stage_directory_file_registration_history.md) |  |
|  | [VALIDATE_PIPE_LOAD](functions/validate_pipe_load.md) |  |
| Data Clustering (within Tables) | [AUTOMATIC_CLUSTERING_HISTORY](functions/automatic_clustering_history.md) | For more information, see [Automatic Clustering](../user-guide/tables-auto-reclustering.md). |
| dbt Projects on Snowflake | [DBT_PROJECT_EXECUTION_HISTORY](functions/dbt_project_execution_history.md) | For more information, see [dbt Projects on Snowflake](../user-guide/data-engineering/dbt-projects-on-snowflake.md). |
| Dynamic Tables | [DYNAMIC_TABLES](functions/dynamic_tables.md) | For more information, see [Create dynamic tables](../user-guide/dynamic-tables-create.md). |
|  | [DYNAMIC_TABLE_GRAPH_HISTORY](functions/dynamic_table_graph_history.md) |  |
|  | [DYNAMIC_TABLE_REFRESH_HISTORY](functions/dynamic_table_refresh_history.md) |  |
| External Functions | [EXTERNAL_FUNCTIONS_HISTORY](functions/external_functions_history.md) | For more information, see [Writing external functions](external-functions.md). |
| External Tables | [AUTO_REFRESH_REGISTRATION_HISTORY](functions/auto_refresh_registration_history.md) | For more information, see [Introduction to external tables](../user-guide/tables-external-intro.md). |
|  | [EXTERNAL_TABLE_FILES](functions/external_table_files.md) |  |
|  | [EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY](functions/external_table_registration_history.md) |  |
| Iceberg Tables | [ICEBERG_TABLE_FILES](functions/iceberg_table_files.md) | Information Schema table function. |
|  | [ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY](functions/iceberg_table_snapshot_refresh_history.md) | Information Schema table function. |
| Listings | [AVAILABLE_LISTINGS](functions/available_listings.md) |  |
|  | [AVAILABLE_LISTING_REFRESH_HISTORY](functions/available_listing_refresh_history.md) |  |
|  | [LISTING_REFRESH_HISTORY](functions/listing_refresh_history.md) |  |
| Materialized Views Maintenance | [MATERIALIZED_VIEW_REFRESH_HISTORY](functions/materialized_view_refresh_history.md) | For more information, see [Working with Materialized Views](../user-guide/views-materialized.md). |
| Machine learning | [ONLINE_FEATURE_TABLE_REFRESH_HISTORY](functions/online-feature-table-refresh-history.md) | For more information, see [Feature store commands](commands-feature-store.md). |
| Notifications | [NOTIFICATION_HISTORY](functions/notification_history.md) | For more information, see [Using SYSTEM$SEND_EMAIL to send email notifications](../user-guide/notifications/email-stored-procedures.md). |
| SCIM Maintenance | [REST_EVENT_HISTORY](functions/rest_event_history.md) | For more information, see [Auditing SCIM API requests](../user-guide/scim-intro.md) |
| Search Optimization Maintenance | [SEARCH_OPTIMIZATION_HISTORY](functions/search_optimization_history.md) | For more information, see [Search optimization service](../user-guide/search-optimization-service.md). |
| Streams | [SYSTEM$STREAM_BACKLOG](functions/system_stream_backlog.md) | For more information, see [Introduction to streams](../user-guide/streams-intro.md). |
| Tasks | [COMPLETE_TASK_GRAPHS](functions/complete_task_graphs.md) | For more information, see [Introduction to tasks](../user-guide/tasks-intro.md). |
|  | [CURRENT_TASK_GRAPHS](functions/current_task_graphs.md) |  |
|  | [SERVERLESS_TASK_HISTORY](functions/serverless_task_history.md) |  |
|  | [TASK_DEPENDENTS](functions/task_dependents.md) |  |
|  | [TASK_HISTORY](functions/task_history.md) |  |
| Network rules | [NETWORK_RULE_REFERENCES](functions/network_rule_references.md) | Information Schema table function. For details, see [Network rules](../user-guide/network-rules.md). |
| Data Quality | [DATA_METRIC_FUNCTION_EXPECTATIONS](functions/data_metric_function_expectations.md) |  |
|  | [DATA_METRIC_FUNCTION_REFERENCES](functions/data_metric_function_references.md) |  |
|  | [DATA_QUALITY_MONITORING_EXPECTATION_STATUS](functions/data_quality_monitoring_expectation_status.md) |  |
|  | [DATA_QUALITY_MONITORING_RESULTS](functions/data_quality_monitoring_results.md) |  |
|  | [SYSTEM$DATA_METRIC_SCAN](functions/system_data_metric_scan.md) |  |
|  | [SYSTEM$EVALUATE_DATA_QUALITY_EXPECTATIONS](functions/system_evaluate_data_quality_expectations.md) |  |
| Data Lineage | [GET_LINEAGE (SNOWFLAKE.CORE)](functions/get_lineage-snowflake-core.md) | For more information, see [Data Lineage](../user-guide/ui-snowsight-lineage.md). |
| Cortex Search | [CORTEX_SEARCH_DATA_SCAN](functions/cortex_search_data_scan.md) | For more information, see [Cortex Search](../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). |
|  | [CORTEX_SEARCH_REFRESH_HISTORY](functions/cortex_search_refresh_history.md) |  |
| Contacts | [GET_CONTACTS](functions/get_contacts.md) |  |
| Snowpark Container Services | [GET_JOB_HISTORY](functions/get_job_history.md) | For more information, see [Snowpark Container Services: Monitoring Services](../developer-guide/snowpark-container-services/monitoring-services.md). |
|  | [<service_name>!SPCS_GET_EVENTS](functions/spcs_get_events.md) |  |
|  | [<service_name>!SPCS_GET_LOGS](functions/spcs_get_logs.md) |  |
|  | [<service_name>!SPCS_GET_METRICS](functions/spcs_get_metrics.md) |  |
| Snowflake Native Apps | [APPLICATION_CALLBACK_HISTORY](functions/application_callback_history.md) | For more information, see [Callbacks](../developer-guide/native-apps/callbacks.md). |
|  | [APPLICATION_SPECIFICATION_STATUS_HISTORY](functions/application_specification_status_history.md) | For more information, see [Overview of app specifications](../developer-guide/native-apps/requesting-app-specs.md). |
|  | [APPLICATION_CONFIGURATION_VALUE_HISTORY](functions/application_configuration_value_history.md) | For more information, see [Application configuration](../developer-guide/native-apps/app-configuration.md). |
| Cortex Agents | [GET_AI_RECORD_TRACE (SNOWFLAKE.LOCAL)](functions/get_ai_record_trace-snowflake-local.md) | For more information, see [Cortex Agent evaluations](../user-guide/snowflake-cortex/cortex-agents-evaluations.md). |
|  | [GET_AI_OBSERVABILITY_LOGS (SNOWFLAKE.LOCAL)](functions/get_ai_observability_logs-snowflake-local.md) | For more information, see [Cortex Agent evaluations](../user-guide/snowflake-cortex/cortex-agents-evaluations.md). |
|  | [GET_AI_EVALUATION_DATA (SNOWFLAKE.LOCAL)](functions/get_ai_evaluation_data-snowflake-local.md) | For more information, see [Cortex Agent evaluations](../user-guide/snowflake-cortex/cortex-agents-evaluations.md). |

## Syntax

```sqlsyntax
SELECT ...
  FROM [ <input_table> [ [AS] <alias_1> ] ,
         [ LATERAL ]
       ]
       TABLE( <table_function>( [ <arg_1> [, ... ] ] ) ) [ [ AS ] <alias_2> ];
```

For function-specific syntax, see the documentation for the individual system-defined table functions.

## Usage notes

* Table functions can also be applied to a set of rows using the LATERAL construct.
* To enable using table expressions, Snowflake supports ANSI/ISO standard syntax for table expressions in the [FROM](constructs/from.md) clause of queries and subqueries. This syntax is used to
  indicate that an expression returns a collection of rows instead of a single row.
* This ANSI/ISO syntax is valid only in the [FROM](constructs/from.md) clause of the [SELECT](sql/select.md) list. You cannot omit these keywords and parentheses from a
  collection subquery specification in any other context.

---
title: Table literals
source: https://docs.snowflake.com/en/sql-reference/literals-table.md
section: SQL General Reference
---

# Table literals

Table literals are used to pass the name of a table or a placeholder value (instead of a table name) to a query. Table literals appear in the [FROM](constructs/from.md) clause of a SQL
statement and consist of either the table name, or a SQL variable or API bind variable in place of the table name.

Informally, when using `TABLE(...)` to construct a table literal, you can think of `TABLE()` as like a
[table function](functions-table.md). Syntactically, `TABLE()` looks like a function. Semantically,
`TABLE()` behaves similarly to a table function because it:

* Accepts a scalar value as input.
* Returns a set of 0 or more rows.
* Can be used as a source of rows in a [FROM](constructs/from.md) clause.

## Syntax

```sqlsyntax
TABLE( { <string_literal> | <session_variable> | <bind_variable> } )
```

`string_literal`
:   A string literal that contains an [identifier](identifiers.md) for a table:

    * The identifier can be fully-qualified in the form of:

      `db_name.schema_name.table_name`

      `schema_name.table_name`
    * Double quotes are supported for individual object identifiers that are case-sensitive or contain spaces and special characters.
    * The entire identifier string must be enclosed in single quotes or `$$`. For example:

      `'mytable'` or `$$mytable$$`

      `'mydb.myschema.mytable'` or `$$mydb.myschema.mytable$$`

      `'"DB 1"."Schema 1".mytable'` or `$$"DB 1"."Schema 1".mytable$$`

`session_variable`
:   A [SQL variable](session-variables.md) that has been set for the session.

`bind_variable`
:   A bind variable, in the standard form of `?` or `:name`, for use with APIs that support bindings (Java, Python, etc.).

## Usage notes

* Table literals are supported in the [FROM](constructs/from.md) clause only.
* Where `TABLE()` is supported, it is equivalent to using [IDENTIFIER()](identifier-literal.md).
* When a bind variable is used to prepare a statement, table metadata is not available after preparing the statement.

## Examples

Query the table `mytable` using a table literal (note that the following two examples are syntactically equivalent):

> ```sqlexample
> SELECT * FROM TABLE('mytable');
>
> SELECT * FROM TABLE($$mytable$$);
> ```

Query the table `mytable` in the schema `myschema` and the database `mydb` using a table literal (note that the following two examples are syntactically equivalent):

> ```sqlexample
> SELECT * FROM TABLE('mydb."myschema"."mytable"');
>
> SELECT * FROM TABLE($$mydb."myschema"."mytable"$$);
> ```

Set a session variable that references a table name and then query the table using the variable passed as a table literal:

> ```sqlexample
> SET myvar = 'mytable';
>
> SELECT * FROM TABLE($myvar);
> ```

Prepare a statement with a binding that represents a table (note that the following two examples are syntactically equivalent):

> ```sqlexample
> SELECT * FROM TABLE(?);
>
> SELECT * FROM TABLE(:binding);
> ```

---
title: Table, view, & sequence DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-table.md
section: SQL General Reference
---

# Table, view, & sequence DDL

Tables and views are the primary objects created and maintained in database schemas:

* All data in Snowflake is stored in tables.
* Views can be used to display selected rows and columns in one or more tables.

Sequences are also schema-level objects. Sequences can be used to generate unique numbers across sessions and statements or to generate
values for a primary key or any column that requires a unique value.

## Table management

* [CREATE TABLE](sql/create-table.md)
* [CREATE TABLE … CLONE](sql/create-clone.md)
* [CREATE TABLE … CONSTRAINT](sql/create-table-constraint.md)
* [ALTER TABLE](sql/alter-table.md)
* [ALTER TABLE … ALTER COLUMN](sql/alter-table-column.md)
* [ALTER TABLE … CONSTRAINT](sql/create-table-constraint.md)
* [DROP TABLE](sql/drop-table.md)
* [UNDROP TABLE](sql/undrop-table.md)
* [SHOW TABLES](sql/show-tables.md) (also [SHOW OBJECTS](sql/show-objects.md))
* [SHOW COLUMNS](sql/show-columns.md)
* [DESCRIBE TABLE](sql/desc-table.md)
* [DESCRIBE SEARCH OPTIMIZATION](sql/desc-search-optimization.md)

## Event table management

* [CREATE EVENT TABLE](sql/create-event-table.md)
* [ALTER TABLE (event tables)](sql/alter-table-event-table.md)
* [DROP TABLE](sql/drop-table.md)
* [SHOW EVENT TABLES](sql/show-event-tables.md)
* [DESCRIBE EVENT TABLE](sql/desc-event-table.md)

## External table management

* [CREATE EXTERNAL TABLE](sql/create-external-table.md)
* [ALTER EXTERNAL TABLE](sql/alter-external-table.md)
* [DROP EXTERNAL TABLE](sql/drop-external-table.md)
* [SHOW EXTERNAL TABLES](sql/show-external-tables.md) (also [SHOW OBJECTS](sql/show-objects.md))
* [DESCRIBE EXTERNAL TABLE](sql/desc-external-table.md)

## Standard view management

* [CREATE VIEW](sql/create-view.md)
* [ALTER VIEW](sql/alter-view.md)
* [DROP VIEW](sql/drop-view.md)
* [SHOW VIEWS](sql/show-views.md) (also [SHOW OBJECTS](sql/show-objects.md))
* [SHOW COLUMNS](sql/show-columns.md)
* [DESCRIBE VIEW](sql/desc-view.md)

## Materialized view management

* [CREATE MATERIALIZED VIEW](sql/create-materialized-view.md)
* [ALTER MATERIALIZED VIEW](sql/alter-materialized-view.md)
* [DROP MATERIALIZED VIEW](sql/drop-materialized-view.md)
* [SHOW MATERIALIZED VIEWS](sql/show-materialized-views.md)
* [DESCRIBE MATERIALIZED VIEW](sql/desc-materialized-view.md)

## Sequence management

* [CREATE SEQUENCE](sql/create-sequence.md)
* [CREATE SEQUENCE … CLONE](sql/create-clone.md)
* [ALTER SEQUENCE](sql/alter-sequence.md)
* [DROP SEQUENCE](sql/drop-sequence.md)
* [SHOW SEQUENCES](sql/show-sequences.md)
* [DESCRIBE SEQUENCE](sql/desc-sequence.md)

## Column-level security management

Use these commands for Dynamic Data Masking and External Tokenization.

* [CREATE MASKING POLICY](sql/create-masking-policy.md)
* [ALTER MASKING POLICY](sql/alter-masking-policy.md) (see also: [ALTER TABLE](sql/alter-table.md), [ALTER TABLE … ALTER COLUMN](sql/alter-table-column.md), and [ALTER VIEW](sql/alter-view.md))
* [DROP MASKING POLICY](sql/drop-masking-policy.md)
* [SHOW MASKING POLICIES](sql/show-masking-policies.md)
* [DESCRIBE MASKING POLICY](sql/desc-masking-policy.md)

## Row access policy management

Snowflake supports the following DDL commands and operations to manage row access policies:

* [CREATE ROW ACCESS POLICY](sql/create-row-access-policy.md)
* [ALTER ROW ACCESS POLICY](sql/alter-row-access-policy.md)
* [DROP ROW ACCESS POLICY](sql/drop-row-access-policy.md)
* [SHOW ROW ACCESS POLICIES](sql/show-row-access-policies.md)
* [DESCRIBE ROW ACCESS POLICY](sql/desc-row-access-policy.md)
* [ALTER TABLE](sql/alter-table.md), [ALTER EXTERNAL TABLE](sql/alter-external-table.md), and [ALTER VIEW](sql/alter-view.md) (to add/drop a policy on a table or view)

---
title: Ternary logic
source: https://docs.snowflake.com/en/sql-reference/ternary-logic.md
section: SQL General Reference
---

# Ternary logic

As specified in the SQL standard, ternary logic, or three-valued logic (3VL), is a logic system with three truth values: TRUE, FALSE, and UNKNOWN. In Snowflake, UNKNOWN is represented by NULL. Ternary logic
applies to the evaluation of Boolean expressions, as well as predicates, and affects the results of logical operations such as AND, OR, and NOT:

* When used in expressions (for example, in a [SELECT](sql/select.md) list), UNKNOWN results are returned as NULL values.
* When used as a predicate (for example, in a [WHERE](constructs/where.md) clause), UNKNOWN results evaluate to FALSE.

## Truth tables

This section describes the truth tables for the [comparison](operators-comparison.md) and [logical](operators-logical.md) operators.

### Comparison operators

If any operand for a comparison operator is NULL, the result is NULL. The comparison operators and functions are:

* `=` , `!=` , `<>`
* `<` , `<=` , `>` , `>=`
* [GREATEST](functions/greatest.md), [LEAST](functions/least.md)

### Logical operators

Given a BOOLEAN column `C`:

| If `C` is: | `C AND NULL` evaluates to: | `C OR NULL` evaluates to: | `NOT C` evaluates to: |
| --- | --- | --- | --- |
| TRUE | NULL | TRUE | FALSE |
| FALSE | FALSE | NULL | TRUE |
| NULL | NULL | NULL | NULL |

In addition:

| If `C` is: | `C AND (NOT C)` evaluates to: | `C OR (NOT C)` evaluates to: | `NOT (C OR NULL)` evaluates to: |
| --- | --- | --- | --- |
| TRUE | FALSE | TRUE | FALSE |
| FALSE | FALSE | TRUE | NULL |
| NULL | NULL | NULL | NULL |

## Usage notes for conditional expressions

This section describes behavior specific to conditional expressions.

### IFF behavior

[IFF](functions/iff.md) returns the following results for ternary logic. Given a BOOLEAN column `C`:

| If `C` is: | `IFF(C, e1, e2)` evaluates to: |
| --- | --- |
| TRUE | `e1` |
| FALSE | `e2` |
| NULL | `e2` |

### [ NOT ] IN behavior

[[ NOT ] IN](functions/in.md) returns the following results for ternary logic. Given 3 numeric columns `c1`, `c2`, and `c3`:

* `c1 IN (c2, c3, ...)` is syntactically equivalent to `(c1 = c2 OR c1 = c3 OR ...)`.

  As a result, when the value of `c1` is NULL, the expression `c1 IN (c2, c3, NULL)` always evaluates to NULL.
* `c1 NOT IN (c2, c3, ... )` is syntactically equivalent to `(c1 <> c2 AND c1 <> c3 AND ...)`.

  Therefore, even if `c1 NOT IN (c2, c3)` is TRUE, `c1 NOT IN (c2, c3, NULL)` evaluates to NULL.

---
title: TOP <n>
source: https://docs.snowflake.com/en/sql-reference/constructs/top_n.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# TOP *<n>*

Constrains the maximum number of rows returned by a statement or subquery.

See also:
:   [LIMIT / FETCH](limit.md)

## Syntax

```sqlsyntax
SELECT
  [ TOP <n> ]
    ...
FROM ...
[ ORDER BY ... ]
[ ... ]
```

## Parameters

`n`
:   The maximum number of rows to return in the result set.

## Usage notes

* An [ORDER BY](order-by.md) clause is not required; however, without an [ORDER BY](order-by.md) clause, the results are non-deterministic because results within a result set are not necessarily in any particular order. To control the results returned, use an [ORDER BY](order-by.md) clause.
* When TOP *<n>* and ORDER BY are at different nesting levels in a query, results can be unpredictable.
  For details and examples, see the [LIMIT / FETCH usage notes](limit.md).
* `n` must be a non-negative integer constant.
* TOP `n` and LIMIT `count` are equivalent.

## Examples

The following example shows the effect of TOP N. For simplicity, these
queries omit the ORDER BY clause and assume that the output order is
always the same as shown by the first query. **Real-world queries should
include an ORDER BY clause.**

```sqlexample
SELECT c1 FROM testtable;
```

```output
+------+
|   C1 |
|------|
|    1 |
|    2 |
|    3 |
|   20 |
|   19 |
|   18 |
|    1 |
|    2 |
|    3 |
|    4 |
| NULL |
|   30 |
| NULL |
+------+
```

```sqlexample
SELECT TOP 4 c1 FROM testtable;
```

```output
+----+
| C1 |
|----|
|  1 |
|  2 |
|  3 |
| 20 |
+----+
```

---
title: Transactions
source: https://docs.snowflake.com/en/sql-reference/transactions.md
section: SQL General Reference
---

# Transactions

A transaction is a sequence of SQL statements that are committed or rolled back as a unit.

## Introduction

### What is a transaction?

A transaction is a sequence of SQL statements that are processed as an atomic unit. All statements in the transaction
are either applied (committed) or undone (rolled back) together.
Snowflake transactions guarantee [ACID properties](http://en.wikipedia.org/wiki/ACID).

A transaction can include both reads and writes.

Transactions follow these rules:

* Transactions are never *nested*. For example, you cannot create an *outer* transaction that would roll back an
  *inner* transaction that was committed, or create an *outer* transaction that would commit an *inner* transaction
  that had been rolled back.
* A transaction is associated with a single session. Multiple sessions cannot share the same transaction. For
  information about handling transactions with overlapping threads in the same session, see
  Transactions and multi-threading.

### Terminology

In this topic:

* The term *DDL* includes CTAS statements (CREATE TABLE AS SELECT) as well as other DDL statements that define database objects.
* The term *DML* refers to INSERT, UPDATE, DELETE, MERGE, and TRUNCATE statements.
* The term *query statement* refers to SELECT and [CALL](sql/call.md) statements.

Although a CALL statement (which calls a [stored procedure](../developer-guide/stored-procedure/stored-procedures-overview.md)) is a
single statement, the stored procedure it calls can contain multiple statements. There are
special rules for stored procedures and transactions.

### Explicit transactions

A transaction can be started explicitly by executing a [BEGIN](sql/begin.md) statement. Snowflake
supports the synonyms BEGIN WORK and BEGIN TRANSACTION. Snowflake recommends using BEGIN TRANSACTION.

A transaction can be ended explicitly by executing [COMMIT](sql/commit.md) or
[ROLLBACK](sql/rollback.md). Snowflake supports the synonym COMMIT WORK for COMMIT, and the synonym
ROLLBACK WORK for ROLLBACK.

In general, if a transaction is already active, any BEGIN TRANSACTION statements are ignored. Users should avoid
extra BEGIN TRANSACTION statements, however, because they make it much more difficult for human readers to pair up a COMMIT (or ROLLBACK)
statement with the corresponding BEGIN TRANSACTION statement.

One exception to this rule involves a nested stored procedure call. For details, see
Scoped transactions.

> **Note:**
>
> Explicit transactions should contain only DML statements and query statements. DDL statements implicitly commit
> active transactions (for details, see the DDL section).

### Implicit transactions

Transactions can be started and ended implicitly, without an explicit BEGIN TRANSACTION or COMMIT/ROLLBACK.
Implicit transactions behave the same way as explicit transactions. However, the rules that determine when the
implicit transaction starts are different from the rules that determine when an explicit transaction starts.

The rules for stopping and starting depend upon whether the statement is a DDL statement, a DML statement, or a
query statement. If the statement is a DML or query statement, the rules depend upon whether AUTOCOMMIT is enabled.

#### DDL

Each DDL statement executes as a separate transaction.

If a DDL statement is executed while a transaction is active, the DDL statement:

1. Implicitly commits the active transaction.
2. Executes the DDL statement as a separate transaction.

Because a DDL statement is its own transaction, you cannot roll back a DDL statement; the transaction containing the
DDL completes before you can execute an explicit ROLLBACK.

If a DDL statement is followed immediately by a DML statement, that DML statement implicitly starts a new transaction.

#### AUTOCOMMIT

Snowflake supports an [AUTOCOMMIT](parameters.md) parameter. The default setting for AUTOCOMMIT is TRUE (enabled).

While AUTOCOMMIT is enabled:

* Each statement outside an explicit transaction is treated as though it is inside its own implicit
  single-statement transaction. In other words, that statement is automatically committed if it succeeds, and
  automatically rolled back if it fails.

  Statements inside an explicit transaction are not affected by AUTOCOMMIT. For example,
  statements inside an explicit BEGIN TRANSACTION … ROLLBACK are rolled back even if AUTOCOMMIT is TRUE.

While AUTOCOMMIT is disabled:

* An implicit BEGIN TRANSACTION is executed at:

  + The first DML statement after a transaction ends. This is true regardless of what ended the
    preceding transaction (for example, a DDL statement, or an explicit COMMIT or ROLLBACK).
  + The first DML statement after disabling AUTOCOMMIT.
* An implicit COMMIT is executed as follows (if a transaction is already active):

  + When a DDL statement is executed.
  + When an `ALTER SESSION SET AUTOCOMMIT` statement is executed, regardless of whether the new value is
    TRUE or FALSE, and whether the new value is different from the previous value.
    For example, even if you set AUTOCOMMIT to FALSE when it is already FALSE, an implicit COMMIT is executed.
* An implicit ROLLBACK is executed as follows (if a transaction is already active):

  + At the end of a session.
  + At the end of a stored procedure.

    Regardless of whether the stored procedure’s active transaction was started explicitly or implicitly,
    Snowflake rolls back the active transaction and issues an error message.

> **Caution:**
>
> Do not change AUTOCOMMIT settings inside a [stored procedure](../developer-guide/stored-procedure/stored-procedures-overview.md).
> You will get an error message.

### Mixing implicit and explicit starts and ends of a transaction

To avoid writing confusing code, you should avoid mixing implicit and explicit starts and ends in the same
transaction. The following are legal, but discouraged:

* An implicitly started transaction can be ended by an explicit COMMIT or ROLLBACK.
* An explicitly started transaction can be ended by an implicit COMMIT or ROLLBACK.

### Failed statements within a transaction

Although a transaction is committed or rolled back as a unit, that is not quite the same as saying that
it succeeds or fails as a unit. If a statement fails within a transaction, you can still commit the transaction, rather than roll
it back.

When a DML statement or CALL statement in a transaction fails, the changes made by that failed statement are rolled back. However, the
transaction stays active until the entire transaction is committed or rolled back. If the transaction is committed,
the changes made by the successful statements are applied.

For example, consider the following code, which inserts two valid values and one invalid value into a table:

```sqlexample
CREATE TABLE table1 (i int);
BEGIN TRANSACTION;
INSERT INTO table1 (i) VALUES (1);
INSERT INTO table1 (i) VALUES ('This is not a valid integer.');    -- FAILS!
INSERT INTO table1 (i) VALUES (2);
COMMIT;
SELECT i FROM table1 ORDER BY i;
```

If the statements after the failed INSERT statement are executed, the output of the final SELECT statement includes the rows with
integer values 1 and 2, even though one of the other statements in the transaction failed.

> **Note:**
>
> The statements after the failed INSERT statement might or might not be executed. The behavior depends on how the statements are run and
> how errors are handled.
>
> For example:
>
> * If these statements are inside a stored procedure written in Snowflake Scripting language, the failed INSERT statement
>   throws an exception.
>
>   + If the exception is not handled, the stored procedure never completes, and the COMMIT is never executed, so the open
>     transaction is implicitly rolled back. In that case, the table does not contain the values `1` or `2`.
>   + If the stored procedure handles the exception and commits the statements prior to the failed INSERT statement, but does not
>     execute the statements after the failed INSERT statement, only the row with the value `1` is stored in the table.
> * If these statements are not inside a stored procedure, the behavior depends on how the statements are executed. For example:
>
>   + If the statements are executed through Snowsight, execution halts at the first error.
>   + If the statements are executed by SnowSQL using the `-f` (filename) option, execution does not halt at the first error,
>     and the statements after the error are executed.

### Transactions and multi-threading

Although multiple sessions cannot share the same transaction, multiple *threads* that use a single connection
share the same session, and thus share the same transaction. This behavior can lead to unexpected results, such
as one thread rolling back work that was done in another thread.

This situation can occur when a client application using a Snowflake driver (such as the
Snowflake JDBC Driver) or a connector (such as the Snowflake Connector for Python) is multi-threaded. If two
or more threads share the same connection, those threads also share the current transaction in that
connection. A BEGIN TRANSACTION, COMMIT, or ROLLBACK by one thread affects all threads using that shared connection.
If the threads are running asynchronously, the results can be unpredictable.

Similarly, changing the AUTOCOMMIT setting in one thread affects the AUTOCOMMIT setting in all other threads
that use the same connection.

Snowflake recommends that multi-threaded client programs do at least one of the following:

* Use a separate connection for each thread.

  Note that even with separate connections, your code can still hit race conditions that generate unpredictable
  output; for example, one thread might delete data before another thread tries to update it.
* Execute the threads synchronously rather than asynchronously, to control the order in which steps are performed.

## Stored procedures and transactions

In general, the rules described in the previous sections also apply to stored procedures.
This section provides additional information specific to stored procedures.

A transaction can be inside a stored procedure, or a stored procedure can be inside a transaction; however, a
transaction cannot be partly inside and partly outside a stored procedure, or started in one stored procedure and
finished in a different stored procedure.

For example:

* You cannot start a transaction before calling the stored procedure, then complete the transaction inside the
  stored procedure. If you try to do this, Snowflake reports an error like this:

  ```output
  Modifying a transaction that has started at a different scope is not allowed.
  ```
* You cannot start a transaction inside the stored procedure, then complete the transaction after returning from the
  procedure. If a transaction is started inside a stored procedure and is still active when the stored procedure
  finishes, an error occurs and the transaction is rolled back.

These rules also apply to nested stored procedures. If procedure `A` calls procedure `B`, procedure `B`
cannot complete a transaction that was started in procedure `A` or vice versa. Each BEGIN TRANSACTION in `A` must
have a corresponding COMMIT (or ROLLBACK) in `A`, and each BEGIN TRANSACTION in `B` must have a corresponding
COMMIT (or ROLLBACK) in `B`.

If a stored procedure contains an explicit transaction, that transaction can contain either part or all of the body of the
stored procedure. For example, in the following stored procedure, only some of the statements are inside the explicit
transaction. (This example, and several subsequent examples, use pseudo-code for simplicity.)

```sqlexample
CREATE PROCEDURE ...
  AS
  $$
    ...
    statement1;

    BEGIN TRANSACTION;
    statement2;
    COMMIT;

    statement3;
    ...

  $$;
```

### Non-overlapping transactions

The sections below describe:

* Using a stored procedure inside a transaction.
* Using a transaction inside a stored procedure.

#### Using a stored procedure inside a transaction

In the simplest case, a stored procedure is considered to be inside of a transaction if the following conditions are
met:

* A BEGIN TRANSACTION is executed before the stored procedure is called.
* The corresponding COMMIT (or ROLLBACK) is executed after the stored procedure completes.
* The body of the stored procedure does not contain an explicit or implicit BEGIN TRANSACTION or COMMIT
  (or ROLLBACK).

The stored procedure inside the transaction follows the rules of the enclosing transaction:

* If the transaction is committed, all the statements inside the procedure are committed.
* If the transaction is rolled back, all statements inside the procedure are rolled back.

The following pseudo-code shows a stored procedure called entirely inside an explicit transaction:

```sqlexample
CREATE PROCEDURE my_procedure()
  ...
  AS
  $$
    statement X;
    statement Y;
  $$;

BEGIN TRANSACTION;
  statement W;
  CALL my_procedure();
  statement Z;
COMMIT;
```

This is equivalent to executing the following sequence of statements:

```sqlexample
BEGIN TRANSACTION;
statement W;
statement X;
statement Y;
statement Z;
COMMIT;
```

#### Using a transaction in a stored procedure

You can execute zero, one, or more transactions inside a stored procedure. The following pseudo-code shows an example
of two transactions in one stored procedure:

```sqlexample
CREATE PROCEDURE p1()
...
$$
  BEGIN TRANSACTION;
  statement C;
  statement D;
  COMMIT;

  BEGIN TRANSACTION;
  statement E;
  statement F;
  COMMIT;
$$;
```

The stored procedure could be called as shown here:

```sqlexample
BEGIN TRANSACTION;
statement A;
statement B;
COMMIT;

CALL p1();

BEGIN TRANSACTION;
statement G;
statement H;
COMMIT;
```

This is equivalent to executing the following sequence:

```sqlexample
BEGIN TRANSACTION;
statement A;
statement B;
COMMIT;

BEGIN TRANSACTION;
statement C;
statement D;
COMMIT;

BEGIN TRANSACTION;
statement E;
statement F;
COMMIT;

BEGIN TRANSACTION;
statement G;
statement H;
COMMIT;
```

In this code, four separate transactions are executed. Each transaction either starts and completes outside the
procedure, or starts and completes inside the procedure. No transaction is split across a procedure boundary (partly
inside and partly outside the stored procedure). No transaction is nested in another transaction.

### Scoped transactions

A [stored procedure](../developer-guide/stored-procedure/stored-procedures-overview.md) that contains a transaction can be called from within another
transaction. For example, a transaction inside a stored procedure can include a call to another stored procedure that contains a
transaction.

Snowflake does not treat the inner transaction as nested; instead, the inner transaction is
a separate transaction. Snowflake calls these “autonomous scoped transactions” (or simply “scoped
transactions”).

The starting point and ending point of each scoped transaction determine which statements are included in the transaction. The start and
end can be explicit or implicit. Each SQL statement is part of only one transaction. An enclosing ROLLBACK or COMMIT does not undo an
enclosed COMMIT or ROLLBACK.

> **Note:**
>
> The terms “inner” and “outer” are commonly used when describing nested operations, such as nested stored procedure
> calls. However, transactions in Snowflake are not truly “nested”; therefore, to reduce confusion when referring to
> transactions, this document frequently uses the terms “enclosed” and “enclosing”, rather than “inner” and “outer”.

The diagram below shows two stored procedures and two scoped transactions. In this example, each stored
procedure contains its own independent transaction. The first stored procedure calls the second stored procedure,
so the procedures overlap in time; however, they do not overlap in content. All the statements inside the shaded
inner box are in one transaction; all the other statements are in another transaction.

In the next example, the transaction boundaries are different from the stored procedure boundaries; the transaction
that starts in the enclosing stored procedure includes some but not all of the statements in the enclosed stored procedure.

In the code above, the second stored procedure contains some statements (`SP2_T1_S2` and `SP2_T1_S3`) that are in the
scope of the first transaction. Only statement `SP2_T2_S1`, inside the shaded inner box, is in the scope of the second
transaction.

The next example demonstrates the problems that occur if a transaction does not begin and end within the same stored
procedure. The example contains the same number of COMMIT statements as BEGIN statements. However, the
BEGIN and COMMIT statements are not paired properly, so this example contains two errors:

* The enclosing stored procedure starts a scoped transaction, but doesn’t explicitly complete it. Therefore
  that scoped transaction causes an error at the end of that stored procedure, and the active transaction is
  implicitly rolled back.
* The second stored procedure contains a COMMIT, but there is no corresponding BEGIN in that stored procedure.
  This COMMIT does *not* commit the open transaction started in the first stored procedure. Instead, the
  improperly paired COMMIT causes an error.

The next example shows three scoped transactions that overlap in time. In this example,
stored procedure `p1()` calls another stored procedure `p2()` from inside a transaction, and `p2()`
contains its own transaction, so the transaction started in `p2()` also runs independently.
(This example uses pseudo-code.)

```sqlexample
CREATE PROCEDURE p2()
...
$$
  BEGIN TRANSACTION;
  statement C;
  COMMIT;
$$;

CREATE PROCEDURE p1()
...
$$
  BEGIN TRANSACTION;
  statement B;
  CALL p2();
  statement D;
  COMMIT;
$$;

BEGIN TRANSACTION;
statement A;
CALL p1();
statement E;
COMMIT;
```

In these three scoped transactions:

* The transaction that is outside any stored procedure contains statements `A` and `E`.
* The transaction in stored procedure `p1()` contains statements `B` and `D`
* The transaction in `p2()` contains statement `C`.

The rules for scoped transactions also apply to recursive stored procedure calls. A recursive call is just a specific
type of nested call, and follows the same transaction rules as a nested call.

> **Caution:**
>
> Overlapping scoped transactions can cause a deadlock if they manipulate the
> same database object (such as a table). Scoped transactions should be used only when necessary.

#### Implicit commits for transactions inside stored procedures

Some commands, including most DDL statements, implicitly commit any active transaction. If an outer stored procedure
opens a transaction and an inner stored procedure executes such a command, the command returns the following error message:

```output
Modifying a transaction that has started at a different scope is not allowed.
```

For example, the following code fails because the DROP TAG statement in the inner procedure attempts to implicitly
commit the transaction that was started in the outer procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE test_scoped_outer()
  RETURNS VARIANT
  LANGUAGE JAVASCRIPT
  EXECUTE AS CALLER
AS $$
  snowflake.execute({sqlText: `BEGIN TRANSACTION;`});
  snowflake.execute({sqlText: `CALL test_scoped();`});
  snowflake.execute({sqlText: `COMMIT;`});
$$;

CREATE OR REPLACE PROCEDURE test_scoped()
  RETURNS VARIANT
  LANGUAGE JAVASCRIPT
  EXECUTE AS CALLER
AS $$
  snowflake.execute({sqlText: `CREATE OR REPLACE TAG test;`}); -- works
  snowflake.execute({sqlText: `DROP TAG IF EXISTS test;`});    -- fails
$$;

CALL test_scoped_outer();
```

To avoid this error, do not execute DDL statements (or other commands that implicitly commit transactions) inside
a stored procedure that might be called from within an active transaction started in another scope.

#### Implicit ROLLBACK for transactions at the end of stored procedures

When AUTOCOMMIT is disabled, be especially careful with combining implicit transactions and stored procedures. If you
accidentally leave a transaction active at the end of a stored procedure, the transaction is rolled back.

For example, the following pseudo-code example causes an implicit ROLLBACK at the end of the stored procedure:

```sqlexample
CREATE PROCEDURE p1() ...
$$
  INSERT INTO parent_table ...;
  INSERT INTO child_table ...;
$$;

ALTER SESSION SET AUTOCOMMIT = FALSE;
CALL p1;
COMMIT WORK;
```

In this example, the command to set AUTOCOMMIT commits any active transaction. A new transaction is not started
immediately. The stored procedure contains a DML statement, which implicitly begins a new transaction. That
implicit BEGIN TRANSACTION does not have a matching COMMIT or ROLLBACK in the stored procedure. Because there is an
active transaction at the end of the stored procedure, that active transaction is implicitly rolled back.

If you want to run the entire stored procedure in a single transaction, start the transaction before you call
the stored procedure, and commit the transaction after the call:

```sqlexample
CREATE PROCEDURE p1() ...
$$
  INSERT INTO parent_table ...;
  INSERT INTO child_table ...;
$$;

ALTER SESSION SET AUTOCOMMIT = FALSE;
BEGIN TRANSACTION;
CALL p1;
COMMIT WORK;
```

In this case, the BEGIN and COMMIT are properly paired, and the code executes without error.

As an alternative, put both the BEGIN TRANSACTION and the COMMIT inside the stored procedure, as shown in the
following pseudo-code example:

```sqlexample
CREATE PROCEDURE p1() ...
$$
  BEGIN TRANSACTION;
  INSERT INTO parent_table ...;
  INSERT INTO child_table ...;
  COMMIT WORK;
$$;

ALTER SESSION SET AUTOCOMMIT = FALSE;
CALL p1;
```

#### Improperly paired BEGIN/COMMIT blocks in scoped transactions

If you do not pair your BEGIN/COMMIT blocks properly in a scoped transaction, Snowflake reports an error. That error can have further
impacts, such as preventing a stored procedure from being completed or preventing an enclosing transaction from being committed. For
example, in the following pseudo-code example, some statements in the enclosing stored procedure, as well as the enclosed stored
procedure, are rolled back:

```sqlexample
CREATE OR REPLACE PROCEDURE outer_sp1()
...
AS
$$
  INSERT 'osp1_alpha';
  BEGIN WORK;
  INSERT 'osp1_beta';
  CALL inner_sp2();
  INSERT 'osp1_delta';
  COMMIT WORK;
  INSERT 'osp1_omega';
$$;

CREATE OR REPLACE PROCEDURE inner_sp2()
...
AS
$$
  BEGIN WORK;
  INSERT 'isp2';
  -- Missing COMMIT, so implicitly rolls back!
$$;

CALL outer_sp1();

SELECT * FROM st;
```

In this example, the only value that is inserted is `osp1_alpha`. None of the other values are inserted because a COMMIT is not correctly
paired with a BEGIN. The error is handled as follows:

1. When procedure `inner_sp2()` finishes, Snowflake detects that the BEGIN in `inner_sp2()` has no corresponding COMMIT (or ROLLBACK).

   1. Snowflake implicitly rolls back the scoped transaction that started in `inner_sp2()`.
   2. Snowflake also returns an error because the CALL to `inner_sp2()` failed.
2. Because the CALL to `inner_sp2()` failed, and because that CALL statement was in `outer_sp1()`, the stored procedure `outer_sp1()`
   itself also fails and returns an error, rather than continuing.
3. Because `outer_sp1()` does not finish executing:

   * The INSERT statements for values `osp1_delta` and `osp1_omega` never execute.
   * The open transaction in `outer_sp1()` is implicitly rolled back rather than committed, so the insert of value `osp1_beta` is never
     committed.

## Apache Iceberg™ tables and transactions

The Snowflake transaction principles generally apply to Apache Iceberg™ tables. For more information
about transactions specific to Iceberg tables, see the [Iceberg topic on transactions](../user-guide/tables-iceberg-transactions.md).

## READ COMMITTED isolation level

READ COMMITTED is the only isolation level currently supported for tables. With READ COMMITTED isolation, a statement sees only data that was
committed before the statement began. It never sees uncommitted data.

When a statement is executed inside a multi-statement transaction:

* A statement sees only data that was committed before the *statement* began.
  *Two successive statements in the same transaction can see different data if another transaction is committed
  between the execution of the first and the second statements.*
* A statement *does* see the changes made by previous statements executed *within* the same transaction,
  even though those changes are not yet committed.

## Read consistency across sessions

In general, Snowflake maintains read consistency for all changes that occur *within any given session*, such as changes introduced by DDL and DML operations.
When a user starts a new session, all changes that were committed before the session started, and all changes that are committed within the session, are
immediately visible to subsequent queries in that session. This is standard behavior and matches the requirements for most workloads.

If you want to extend read consistency to be guaranteed *across sessions* that are running queries in a near-concurrent fashion, and you are willing to accept a
small delay in query response times (usually milliseconds), set the [READ_CONSISTENCY_MODE](parameters.md) parameter to `'GLOBAL'`. By setting
this parameter, you change the default behavior such that queries read any near-concurrent changes that occur in concurrently running sessions. An alternative way to
guarantee this level of consistency is to run all queries in the same session.

For example, using the default `'SESSION'` value:

* Session 1 starts.
* Session 2 starts.
* Session 1 inserts a row into table `t`.
* Session 1 selects data from table `t` and sees the new row immediately.
* Session 2 runs the same query. Session 2 might not see the new row.

To guarantee that Sessions 1 and 2 get the same result for the same query in this scenario, follow one of these three steps, which are presented
in order, from most recommended to least recommended:

1. Use a single session for all of the queries that depend on each other. In this case:

   * Session 1 starts.
   * Session 1 inserts a row into table `t`.
   * Session 1 selects data from table `t` and sees the new row immediately.
2. Start Session 2 after the changes are committed in Session 1. In this case:

   * Session 1 starts.
   * Session 1 inserts a row into table `t`.
   * Session 2 starts.
   * Session 1 selects data from table `t` and sees the new row immediately.
   * Session 2 runs the same query and is guaranteed to see the new row.
3. Use the [ALTER ACCOUNT](sql/alter-account.md) command to set READ_CONSISTENCY_MODE to `'GLOBAL'`:

   ```sqlexample
   ALTER ACCOUNT SET READ_CONSISTENCY_MODE = 'GLOBAL';
   ```

   This parameter can only be set at the account level by a user with ACCOUNTADMIN privileges.

## Resource locking

Transactional operations acquire locks on a resource, such as a table, while that resource is being modified. Locks
block other statements from modifying the resource until the lock is released.

The following guidelines apply in most situations:

* COMMIT operations (including both AUTOCOMMIT and explicit COMMIT) lock resources, but usually only briefly.
* CREATE TABLE, CREATE DYNAMIC TABLE, CREATE STREAM, and ALTER TABLE operations all lock their underlying resources when setting CHANGE_TRACKING = TRUE, but usually only briefly.
  Only UPDATE and DELETE DML operations are blocked when a table is locked. INSERT operations are *not* blocked.
* UPDATE, DELETE, and MERGE statements hold locks that generally prevent them from running in parallel with other UPDATE, DELETE, and MERGE statements.

  For [hybrid tables](../user-guide/tables-hybrid.md), locks are held on individual rows. Locks on UPDATE, DELETE, and MERGE statements only prevent parallel
  UPDATE, DELETE, and MERGE statements that operate on the same row or rows. UPDATE, DELETE, and MERGE on different rows in the same table can progress.
* Most INSERT and COPY statements write only new partitions. Those statements often can run in parallel with other
  INSERT and COPY operations, and sometimes can run in parallel with an UPDATE, DELETE, or MERGE statement.

  Avoid executing INSERT and COPY statements concurrently with DDL statements on the same object in different sessions, because doing so can result in
  inconsistencies. When INSERT or COPY statements are executed on an object in an explicit transaction,
  avoid DDL statements on the same object in different sessions for the duration of the transaction. For example, don’t run INSERT statements on a table
  in one session while simultaneously running a DDL statement that changes the data type of a column in the table in a different session.

Locks held by a statement are released on [COMMIT](sql/commit.md) or [ROLLBACK](sql/rollback.md) of the transaction.

### Lock timeout parameters

Two parameters control timeout for locks: [LOCK_TIMEOUT](parameters.md) and [HYBRID_TABLE_LOCK_TIMEOUT](parameters.md).

#### LOCK_TIMEOUT parameter

A blocked statement either acquires a lock on the resource it is waiting for or times out, while waiting for the resource to become available. You can set the
length of time (in seconds) that a statement should block by setting the LOCK_TIMEOUT parameter.

For example, to change the lock timeout to 2 hours (7200 seconds) for the current session:

```sqlexample
ALTER SESSION SET LOCK_TIMEOUT=7200;
SHOW PARAMETERS LIKE 'lock_timeout';
```

```output
+--------------+-------+---------+---------+-------------------------------------------------------------------------------+--------+
| key          | value | default | level   | description                                                                   | type   |
|--------------+-------+---------+---------+-------------------------------------------------------------------------------+--------|
| LOCK_TIMEOUT | 7200  | 43200   | SESSION | Number of seconds to wait while trying to lock a resource, before timing out  | NUMBER |
|              |       |         |         | and aborting the statement. A value of 0 turns off lock waiting i.e. the      |        |
|              |       |         |         | statement must acquire the lock immediately or abort. If multiple resources   |        |
|              |       |         |         | need to be locked by the statement, the timeout applies separately to each    |        |
|              |       |         |         | lock attempt.                                                                 |        |
+--------------+-------+---------+---------+-------------------------------------------------------------------------------+--------+
```

#### HYBRID_TABLE_LOCK_TIMEOUT parameter

A blocked statement on a hybrid table either acquires a row-level lock on the table it is waiting for or times out, while waiting for the table to become available.
You can set the length of time (in seconds) that a statement should block by setting the HYBRID_TABLE_LOCK_TIMEOUT parameter.

For example, to change the hybrid table lock timeout to 10 minutes (600 seconds) for the current session:

```sqlexample
ALTER SESSION SET HYBRID_TABLE_LOCK_TIMEOUT=600;
SHOW PARAMETERS LIKE 'hybrid_table_lock_timeout';
```

```output
+---------------------------+-------+---------+---------+--------------------------------------------------------------------------------+--------|
| key                       | value | default | level   | description                                                                    | type   |
|---------------------------+-------+---------+---------+--------------------------------------------------------------------------------+--------+
| HYBRID_TABLE_LOCK_TIMEOUT | 600   | 3600    | SESSION | Number of seconds to wait while trying to acquire locks, before timing out and | NUMBER |
|                           |       |         |         | aborting the statement. A value of 0 turns off lock waiting i.e. the statement |        |
|                           |       |         |         | must acquire the lock immediately or abort.                                    |        |
+---------------------------+-------+---------+---------+--------------------------------------------------------------------------------+--------+
```

### Deadlocks

Deadlocks may occur when concurrent transactions are waiting on resources that are locked by each other.

Note the following rules:

* Deadlocks cannot occur while autocommit query statements are being executed concurrently. This is true for both standard tables and hybrid tables because
  SELECT statements are always read-only.
* Deadlocks cannot occur with autocommit DML operations on standard tables, but they can occur with autocommit DML operations on hybrid tables.
* Deadlocks can occur when transactions are started explicitly and multiple statements are executed in each transaction. Snowflake detects deadlocks and
  chooses the most recent statement that is part of the deadlock as the victim. The statement is rolled back, but the transaction itself remains active
  and must be committed or rolled back.

Deadlock detection can take time.

## Managing transactions and locks

Snowflake provides the following SQL commands to help you monitor and manage transactions and locks:

* [DESCRIBE TRANSACTION](sql/desc-transaction.md)
* [ROLLBACK](sql/rollback.md)
* [SHOW LOCKS](sql/show-locks.md)
* [SHOW TRANSACTIONS](sql/show-transactions.md)

The [LOCK_WAIT_HISTORY view](account-usage/lock_wait_history.md) logs a detailed history of transactions with respect
to locking, showing when specific locks were requested and acquired.

In addition, Snowflake provides the following context functions for obtaining information about transactions within a session:

* [CURRENT_STATEMENT](functions/current_statement.md)
* [CURRENT_TRANSACTION](functions/current_transaction.md)
* [LAST_QUERY_ID](functions/last_query_id.md)
* [LAST_TRANSACTION](functions/last_transaction.md)

You can call the following function to abort a transaction: [SYSTEM$ABORT_TRANSACTION](functions/system_abort_transaction.md).

### Aborting transactions

If a transaction is running in a session and the session disconnects abruptly, preventing the transaction from committing or rolling back, the transaction is left in a
detached state, including any locks that the transaction is holding on resources. If this happens, you might need to abort the transaction.

To abort a running transaction, the user who started the transaction or an account administrator can call the system function, [SYSTEM$ABORT_TRANSACTION](functions/system_abort_transaction.md).

If the transaction is not aborted by the user:

* And it blocks another transaction from acquiring a lock on the same table *and* is idle for 5 minutes, it is automatically aborted and rolled back.
* And it does *not* block other transactions from modifying the same table and is older than 4 hours, it is automatically aborted and rolled back.
* And it reads from or writes to hybrid tables, and is idle for 5 minutes, it is automatically aborted and rolled back, regardless of whether it blocks
  other transactions from modifying the same table.

To allow a statement error within a transaction to abort the transaction, set the [TRANSACTION_ABORT_ON_ERROR](parameters.md) parameter at the session or account level.

### Analyzing blocked transactions with the LOCK_WAIT_HISTORY view

The [LOCK_WAIT_HISTORY view](account-usage/lock_wait_history.md) returns transaction details that can be useful in analyzing blocked transactions.
Each row in the output includes the details for a transaction that is waiting on a lock and the details of transactions that are holding
that lock or waiting ahead for that lock.

For example, see the scenario below:

In this scenario, the following data is returned:

* Transaction B is the transaction that is waiting for a lock.
* Transaction B requested the lock at timestamp T1.
* Transaction A is the transaction that holds the lock.
* Query 2 in Transaction A is the blocker query.

Query 2 is the blocker query because it is the first statement in Transaction A (the transaction holding the lock) that
Transaction B (the transaction waiting for the lock) started waiting on.

However, note that a later query in Transaction A (Query 5) also acquired the lock. It is possible that subsequent concurrent executions of these
transactions could cause Transaction B to block on a different query that acquires the lock in Transaction A. Therefore, you must investigate all queries in
the first blocker transaction.

See also Transaction and lock visibility for hybrid tables.

#### Examining a long-running statement

1. Query the Account Usage [QUERY_HISTORY view](account-usage/query_history.md) for transactions that waited for locks in the last 24 hours:

   ```sqlexample
   SELECT query_id, query_text, start_time, session_id, execution_status, total_elapsed_time,
          compilation_time, execution_time, transaction_blocked_time
     FROM snowflake.account_usage.query_history
     WHERE start_time >= dateadd('hours', -24, current_timestamp())
     AND transaction_blocked_time > 0
     ORDER BY transaction_blocked_time DESC;
   ```

   Review the results of the query and note the query IDs of the queries with high TRANSACTION_BLOCKED_TIME values.
2. To find blocker transactions for the queries identified from the previous step, query the LOCK_WAIT_HISTORY view for rows with
   those query IDs:

   ```sqlexample
   SELECT object_name, lock_type, transaction_id, blocker_queries
     FROM snowflake.account_usage.lock_wait_history
     WHERE query_id = '<query_id>';
   ```

   There may be multiple queries in the `blocker_queries` column in the results. Note the `transaction_id` of each blocker query
   in the output.
3. Query the QUERY_HISTORY view for each transaction in the `blocker_queries` output:

   ```sqlexample
   SELECT query_id, query_text, start_time, session_id, execution_status, total_elapsed_time, compilation_time, execution_time
     FROM snowflake.account_usage.query_history
     WHERE transaction_id = '<transaction_id>';
   ```

   Investigate the query results. If a statement in the transaction was a DML statement and operated on the locked resource, it may
   have acquired the lock at some point during the transaction.

### Monitoring transactions and locks

You can use the [SHOW TRANSACTIONS](sql/show-transactions.md) command to return a list of transactions being run by the current user (in all of that user’s sessions) or by all users in all sessions in the account. The following example is for the current user’s sessions.

```sqlexample
SHOW TRANSACTIONS;
```

```output
+---------------------+---------+-----------------+--------------------------------------+-------------------------------+---------+-------+
|                  id | user    |         session | name                                 | started_on                    | state   | scope |
|---------------------+---------+-----------------+--------------------------------------+-------------------------------+---------+-------|
| 1721165674582000000 | CALIBAN | 186457423713330 | 551f494d-90ed-438d-b32b-1161396c3a22 | 2024-07-16 14:34:34.582 -0700 | running |     0 |
| 1721165584820000000 | CALIBAN | 186457423749354 | a092aa44-9a0a-4955-9659-123b35c0efeb | 2024-07-16 14:33:04.820 -0700 | running |     0 |
+---------------------+---------+-----------------+--------------------------------------+-------------------------------+---------+-------+
```

Every Snowflake transaction is assigned a unique transaction ID. The `id` value is a signed 64-bit (long) integer. The range of
values is -9,223,372,036,854,775,808 (-2 63) to 9,223,372,036,854,775,807 (2 63 - 1).

You can also use the [CURRENT_TRANSACTION](functions/current_transaction.md) function to return the transaction ID of the transaction currently running in the session.

```sqlexample
SELECT CURRENT_TRANSACTION();
```

```output
+-----------------------+
| CURRENT_TRANSACTION() |
|-----------------------|
| 1721161383427000000   |
+-----------------------+
```

If you know the transaction ID you want to monitor, you can use the [DESCRIBE TRANSACTION](sql/desc-transaction.md) command to return details about the transaction,
while it is still running or after it has committed or aborted. For example:

```sqlexample
DESCRIBE TRANSACTION 1721161383427000000;
```

```output
+---------------------+---------+----------------+--------------------------------------+-------------------------------+-----------+-------------------------------+
|                  id | user    |        session | name                                 | started_on                    | state     | ended_on                      |
|---------------------+---------+----------------+--------------------------------------+-------------------------------+-----------+-------------------------------|
| 1721161383427000000 | CALIBAN | 10363774361222 | 7db0ec5c-2e5d-47be-ac37-66cbf905668b | 2024-07-16 13:23:03.427 -0700 | committed | 2024-07-16 13:24:14.402 -0700 |
+---------------------+---------+----------------+--------------------------------------+-------------------------------+-----------+-------------------------------+
```

### Transaction and lock visibility for hybrid tables

When you are looking at the output of commands and views for transactions that access hybrid tables, or locks on hybrid table rows,
note the following behavior:

* Transactions are listed only if they are blocking other transactions, or if they are blocked.
* Keep in mind that transactions that access hybrid tables hold row-level locks (`ROW` type). If two transactions access different rows in the same table, they do not
  block each other.
* Transactions are listed only if a blocked transaction has been blocked for more than 5 seconds.
* When a transaction is no longer blocked, it might still appear in the output, but for no more than 15 seconds.

Similarly, in the SHOW LOCKS output, the following rules apply:

* A lock is listed only if one transaction holds the lock and the other transaction is blocked on that particular lock.
* In the `type` column, hybrid table locks show `ROW`.
* The `resource` column always shows the blocking transaction ID. (The blocked transaction is blocked by the transaction with this ID.)
* In many cases, queries against hybrid tables do not generate query IDs. See [Usage notes](account-usage/query_history.md).

For example:

```sqlexample
SHOW LOCKS;
```

```output
+---------------------+------+---------------------+-------------------------------+---------+-------------+--------------------------------------+
| resource            | type |         transaction | transaction_started_on        | status  | acquired_on | query_id                             |
|---------------------+------+---------------------+-------------------------------+---------+-------------+--------------------------------------|
| 1721165584820000000 | ROW  | 1721165584820000000 | 2024-07-16 14:33:04.820 -0700 | HOLDING | NULL        |                                      |
| 1721165584820000000 | ROW  | 1721165674582000000 | 2024-07-16 14:34:34.582 -0700 | WAITING | NULL        | 01b5b715-0002-852b-0000-a99500665352 |
+---------------------+------+---------------------+-------------------------------+---------+-------------+--------------------------------------+
```

In the [LOCK_WAIT_HISTORY view](account-usage/lock_wait_history.md), the output behaves as follows:

* The `requested_at` and `acquired_at` columns define when row-level locks were requested and acquired, subject to the general
  rules for reporting transaction activity with hybrid tables.
* The `lock_type` and `object_name` columns both show the value `Row`.
* The `schema_id` and `schema_name` columns are always empty (`0` and NULL, respectively).
* The `object_id` column always shows the blocking object’s ID.
* The `blocker_queries` column is a JSON array with exactly one element, which shows the blocking transaction.
* Even if multiple transactions are blocked on the same row, they are shown as multiple rows in the output.

For example:

```sqlexample
SELECT query_id, object_name, transaction_id, blocker_queries
  FROM SNOWFLAKE.ACCOUNT_USAGE.LOCK_WAIT_HISTORY
  WHERE requested_at >= DATEADD('hours', -48, CURRENT_TIMESTAMP()) LIMIT 1;
```

```output
+--------------------------------------+-------------+---------------------+---------------------------------------------------------+
| QUERY_ID                             | OBJECT_NAME |      TRANSACTION_ID | BLOCKER_QUERIES                                         |
|--------------------------------------+-------------+---------------------+---------------------------------------------------------|
| 01b5b715-0002-852b-0000-a99500665352 | Row         | 1721165674582000000 | [                                                       |
|                                      |             |                     |   {                                                     |
|                                      |             |                     |     "is_snowflake": false,                              |
|                                      |             |                     |     "query_id": "01b5b70d-0002-8513-0000-a9950065d43e", |
|                                      |             |                     |     "transaction_id": 1721165584820000000               |
|                                      |             |                     |   }                                                     |
|                                      |             |                     | ]                                                       |
+--------------------------------------+-------------+---------------------+---------------------------------------------------------+
```

## Best practices

* A transaction should contain statements that are related and should succeed or fail together, for example,
  withdrawing money from one account and depositing that same money to another account. If a rollback occurs, either
  the payer or the recipient ends up with the money; the money never “disappears” (withdrawn from one account but
  never deposited to the other account).

  In general, one transaction should contain only related statements. Making a statement less granular means that
  when a transaction is rolled back, it might roll back useful work that didn’t actually need to be rolled back.
* Larger transactions can improve performance in some cases for standard tables, but not typically for hybrid tables.

  Although the preceding bullet point emphasized the importance of grouping only statements that truly need to be
  committed or rolled back as a group, larger transactions can sometimes be useful.
  In Snowflake, as in most databases, managing transactions consumes resources. For example, inserting 10 rows in
  one transaction is generally faster and cheaper than inserting one row each in 10 separate transactions.
  Combining multiple statements into a single transaction can improve performance.
* Overly large transactions can reduce parallelism or increase deadlocks. If you do decide to group unrelated
  statements to improve performance (as described in the previous bullet point), keep in mind that a transaction
  can acquire locks on resources, which can delay other queries or lead to
  deadlocks.
* For hybrid tables:

  + AUTOCOMMIT DML statements in general run much faster than non-AUTOCOMMIT DML statements.
  + Relatively small AUTOCOMMIT DML statements run much faster than non-AUTOCOMMIT DML statements.
    DML statements that run in under 5 seconds or access no more than 1 MB of data take advantage of a fast mode
    that is not available to longer-running or larger DML statements.
* Snowflake recommends keeping AUTOCOMMIT enabled and using explicit transactions as much as possible. Using
  explicit transactions makes it easier for human readers to see where transactions begin and end. This, combined with
  AUTOCOMMIT, makes your code less likely to experience unintended rollbacks, for example at the end of a
  stored procedure.
* Avoid changing AUTOCOMMIT merely to start a new transaction implicitly. Instead, use BEGIN TRANSACTION
  to make it more obvious where a new transaction starts.
* Avoid executing more than one BEGIN TRANSACTION statement in a row. Extra BEGIN TRANSACTION statements make it harder to see where
  a transaction actually begins, and make it harder to pair COMMIT/ROLLBACK commands with the corresponding BEGIN TRANSACTION.

## Examples

### Simple example of scoped transaction and stored procedure

This is a simple example of scoped transactions. The stored procedure contains a transaction that inserts a
row with the value 12 and then rolls back. The outer transaction commits. The output shows that all rows
in the scope of the outer transaction are kept, while the row in the scope of the inner transaction
is not kept.

Note that because only part of the stored procedure is inside its own transaction, values inserted by the INSERT statements that are
in the stored procedure, but outside the stored procedure’s transaction, are kept.

Create two tables:

```none
create table tracker_1 (id integer, name varchar);
create table tracker_2 (id integer, name varchar);
```

Create the stored procedure:

```none
create procedure sp1()
returns varchar
language javascript
AS
$$
    // This is part of the outer transaction that started before this
    // stored procedure was called. This is committed or rolled back
    // as part of that outer transaction.
    snowflake.execute (
        {sqlText: "insert into tracker_1 values (11, 'p1_alpha')"}
        );

    // This is an independent transaction. Anything inserted as part of this
    // transaction is committed or rolled back based on this transaction.
    snowflake.execute (
        {sqlText: "begin transaction"}
        );
    snowflake.execute (
        {sqlText: "insert into tracker_2 values (12, 'p1_bravo')"}
        );
    snowflake.execute (
        {sqlText: "rollback"}
        );

    // This is part of the outer transaction started before this
    // stored procedure was called. This is committed or rolled back
    // as part of that outer transaction.
    snowflake.execute (
        {sqlText: "insert into tracker_1 values (13, 'p1_charlie')"}
        );

    // Dummy value.
    return "";
$$;
```

Call the stored procedure:

```none
begin transaction;
insert into tracker_1 values (00, 'outer_alpha');
call sp1();
insert into tracker_1 values (09, 'outer_zulu');
commit;
```

The results should include 00, 11, 13, and 09. The row with ID = 12 should not be included. This row was in the scope
of the enclosed transaction, which was rolled back. All other rows were in the scope of the outer transaction, and
were committed. Note in particular that the rows with IDs 11 and 13 were inside the stored procedure, but outside the
innermost transaction; they are in the scope of the enclosing transaction, and were committed with that.

```none
select id, name FROM tracker_1
union all
select id, name FROM tracker_2
order by id;
+----+-------------+
| ID | NAME        |
|----+-------------|
|  0 | outer_alpha |
|  9 | outer_zulu  |
| 11 | p1_alpha    |
| 13 | p1_charlie  |
+----+-------------+
```

### Logging information independently of a transaction’s success

This is a simple, practical example of how to use a scoped transaction. In this example, a transaction
logs certain information; that logged information is preserved whether the transaction itself succeeds or fails.
This technique can be used to track all attempted actions, whether or not each succeeded.

Create two tables:

```none
create table data_table (id integer);
create table log_table (message varchar);
```

Create the stored procedure:

```none
create procedure log_message(MESSAGE VARCHAR)
returns varchar
language javascript
AS
$$
    // This is an independent transaction. Anything inserted as part of this
    // transaction is committed or rolled back based on this transaction.
    snowflake.execute (
        {sqlText: "begin transaction"}
        );
    snowflake.execute (
        {sqlText: "insert into log_table values ('" + MESSAGE + "')"}
        );
    snowflake.execute (
        {sqlText: "commit"}
        );

    // Dummy value.
    return "";
$$;

create procedure update_data()
returns varchar
language javascript
AS
$$
    snowflake.execute (
        {sqlText: "begin transaction"}
        );
    snowflake.execute (
        {sqlText: "insert into data_table (id) values (17)"}
        );
    snowflake.execute (
        {sqlText: "call log_message('You should see this saved.')"}
        );
    snowflake.execute (
        {sqlText: "rollback"}
        );

    // Dummy value.
    return "";
$$;
```

Call the stored procedure:

```sqlexample
begin transaction;
call update_data();
rollback;
```

The data table is empty because the transaction was rolled back:

```sqlexample
select * from data_table;
+----+
| ID |
|----|
+----+
```

However, the logging table is not empty; the insert into the logging table was done in a separate transaction from
the insert into data_table.

```sqlexample
select * from log_table;
+----------------------------+
| MESSAGE                    |
|----------------------------|
| You should see this saved. |
+----------------------------+
```

### Examples of scoped transactions and stored procedures

The next few examples use the tables and stored procedures shown below. By passing appropriate parameters, the caller
can control where BEGIN TRANSACTION, COMMIT, and ROLLBACK statements are executed inside the stored procedures.

Create the tables:

```none
create table tracker_1 (id integer, name varchar);
create table tracker_2 (id integer, name varchar);
create table tracker_3 (id integer, name varchar);
```

This procedure is the enclosing stored procedure, and depending upon the parameters passed to it, can create an
enclosing transaction.

```none
create procedure sp1_outer(
    USE_BEGIN varchar,
    USE_INNER_BEGIN varchar,
    USE_INNER_COMMIT_OR_ROLLBACK varchar,
    USE_COMMIT_OR_ROLLBACK varchar
    )
returns varchar
language javascript
AS
$$
    // This should be part of the outer transaction started before this
    // stored procedure was called. This should be committed or rolled back
    // as part of that outer transaction.
    snowflake.execute (
        {sqlText: "insert into tracker_1 values (11, 'p1_alpha')"}
        );

    // This is an independent transaction. Anything inserted as part of this
    // transaction is committed or rolled back based on this transaction.
    if (USE_BEGIN != '')  {
        snowflake.execute (
            {sqlText: USE_BEGIN}
            );
        }
    snowflake.execute (
        {sqlText: "insert into tracker_2 values (12, 'p1_bravo')"}
        );
    // Call (and optionally begin/commit-or-rollback) an inner stored proc...
    var command = "call sp2_inner('";
    command = command.concat(USE_INNER_BEGIN);
    command = command.concat("', '");
    command = command.concat(USE_INNER_COMMIT_OR_ROLLBACK);
    command = command.concat( "')" );
    snowflake.execute (
        {sqlText: command}
        );
    if (USE_COMMIT_OR_ROLLBACK != '') {
        snowflake.execute (
            {sqlText: USE_COMMIT_OR_ROLLBACK}
            );
        }

    // This is part of the outer transaction started before this
    // stored procedure was called. This is committed or rolled back
    // as part of that outer transaction.
    snowflake.execute (
        {sqlText: "insert into tracker_1 values (13, 'p1_charlie')"}
        );

    // Dummy value.
    return "";
$$;
```

This procedure is the inner stored procedure, and depending upon the parameters passed to it, can create an
enclosed transaction.

```none
create procedure sp2_inner(
    USE_BEGIN varchar,
    USE_COMMIT_OR_ROLLBACK varchar)
returns varchar
language javascript
AS
$$
    snowflake.execute (
        {sqlText: "insert into tracker_2 values (21, 'p2_alpha')"}
        );

    if (USE_BEGIN != '')  {
        snowflake.execute (
            {sqlText: USE_BEGIN}
            );
        }
    snowflake.execute (
        {sqlText: "insert into tracker_3 values (22, 'p2_bravo')"}
        );
    if (USE_COMMIT_OR_ROLLBACK != '')  {
        snowflake.execute (
            {sqlText: USE_COMMIT_OR_ROLLBACK}
            );
        }

    snowflake.execute (
        {sqlText: "insert into tracker_2 values (23, 'p2_charlie')"}
        );

    // Dummy value.
    return "";
$$;
```

#### Commit the middle level of three levels

This example contains 3 transactions. This example commits the “middle” level (the transaction enclosed by the
outer-most transaction and enclosing the inner-most transaction). This rolls back the outer-most and
inner-most transactions.

```none
begin transaction;
insert into tracker_1 values (00, 'outer_alpha');
call sp1_outer('begin transaction', 'begin transaction', 'rollback', 'commit');
insert into tracker_1 values (09, 'outer_charlie');
rollback;
```

The result is that only the rows in the middle transaction (12, 21, and 23) are committed. The rows in the outer
transaction and the inner transaction are not committed.

```none
-- Should return only 12, 21, 23.
select id, name from tracker_1
union all
select id, name from tracker_2
union all
select id, name from tracker_3
order by id;
+----+------------+
| ID | NAME       |
|----+------------|
| 12 | p1_bravo   |
| 21 | p2_alpha   |
| 23 | p2_charlie |
+----+------------+
```

#### Roll back the middle level of three levels

This example contains 3 transactions. This example rolls back the “middle” level (the transaction enclosed by the
outer-most transaction and enclosing the inner-most transaction). This commits the outer-most and inner-most
transactions.

```none
begin transaction;
insert into tracker_1 values (00, 'outer_alpha');
call sp1_outer('begin transaction', 'begin transaction', 'commit', 'rollback');
insert into tracker_1 values (09, 'outer_charlie');
commit;
```

The result is that all rows except the rows in the middle transaction (12, 21, and 23) are committed.

```none
select id, name from tracker_1
union all
select id, name from tracker_2
union all
select id, name from tracker_3
order by id;
+----+---------------+
| ID | NAME          |
|----+---------------|
|  0 | outer_alpha   |
|  9 | outer_charlie |
| 11 | p1_alpha      |
| 13 | p1_charlie    |
| 22 | p2_bravo      |
+----+---------------+
```

### Using error handling with transactions in stored procedures

The following code shows simple error handling for a transaction in a stored procedure. If the parameter value ‘fail’
is passed, the stored procedure tries to delete from two tables that exist and one table that doesn’t exist, and the
stored procedure catches the error and returns an error message. If the parameter value ‘fail’ is not passed, the
procedure tries to delete from two tables that do exist, and succeeds.

Create the tables and stored procedure:

```none
begin transaction;

create table parent(id integer);
create table child (child_id integer, parent_ID integer);

-- ----------------------------------------------------- --
-- Wrap multiple related statements in a transaction,
-- and use try/catch to commit or roll back.
-- ----------------------------------------------------- --
-- Create the procedure
create or replace procedure cleanup(FORCE_FAILURE varchar)
  returns varchar not null
  language javascript
  as
  $$
  var result = "";
  snowflake.execute( {sqlText: "begin transaction;"} );
  try {
      snowflake.execute( {sqlText: "delete from child where parent_id = 1;"} );
      snowflake.execute( {sqlText: "delete from parent where id = 1;"} );
      if (FORCE_FAILURE === "fail")  {
          // To see what happens if there is a failure/rollback,
          snowflake.execute( {sqlText: "delete from no_such_table;"} );
          }
      snowflake.execute( {sqlText: "commit;"} );
      result = "Succeeded";
      }
  catch (err)  {
      snowflake.execute( {sqlText: "rollback;"} );
      return "Failed: " + err;   // Return a success/error indicator.
      }
  return result;
  $$
  ;

commit;
```

Call the stored procedure and force an error:

```none
call cleanup('fail');
+----------------------------------------------------------+
| CLEANUP                                                  |
|----------------------------------------------------------|
| Failed: SQL compilation error:                           |
| Object 'NO_SUCH_TABLE' does not exist or not authorized. |
+----------------------------------------------------------+
```

Call the stored procedure without forcing an error:

```none
call cleanup('do not fail');
+-----------+
| CLEANUP   |
|-----------|
| Succeeded |
+-----------+
```

---
title: Troubleshooting external functions for AWS
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-aws-troubleshooting.md
section: SQL General Reference
---

# Troubleshooting external functions for AWS

This topic provides troubleshooting information for external functions for AWS.

## Platform-independent Runtime Issues

### Data Type Return Values Do Not Match Expected Return Values

When passing arguments to or from an external function, ensure that the data types are appropriate. If the value
sent can’t fit into the data type being received, the value might be truncated or corrupted in some other way.

For more details, see [Ensure that arguments to the external function correspond to arguments parsed by the remote service](external-functions-best-practices.md).

### Error: Row numbers out of order

Possible Causes:
:   The row numbers you return within each batch should be monotonically ascending integers starting at 0. The input row numbers must also
    follow this rule, and each output row should match the corresponding input row. For example, the output in output row 0 should
    correspond to the input in input row 0.

Possible Solutions:
:   Ensure that the row numbers you return are the same as the row numbers you received, and that each output value uses the row number of
    the corresponding input. If this doesn’t work, then the input row numbers may not be correct or you did not return the rows in the
    correct order.

    Next, ensure that the output row numbers start from 0, increase by 1, and are in order.

For more information about data input and output formats, see [Remote service input and output data formats](external-functions-data-format.md).

### Error: “Error parsing JSON: Invalid response”

Possible Causes:
:   The most likely cause is that the JSON returned by the remote service (e.g. AWS Lambda function) is not constructed correctly.

Possible Solutions:
:   Ensure that the external function returns an array of arrays, with one inner array returned for each input row received. Review the
    description of the output format at [Data format received by Snowflake](external-functions-data-format.md).

### Error: Format of the returned value is not JSON

Possible Causes:
:   Your return value includes double quotes inside the value.

Possible Solutions:
:   Although JSON strings are delimited by double quotes, the string itself should not start and end with a quotation mark in most cases.
    If the embedded double quotes are incorrect, remove them.

### Error: Function received the wrong number of rows

Possible Causes:
:   The remote service tried to return more or fewer rows than it received. Even though the function is nominally scalar, it might receive
    multiple rows in the `body` field of the `event` parameter, and should return exactly as many rows as it received.

Possible Solution(s):
:   Ensure that the remote service returns one row for each row that it receives.

## AWS-specific issues

### API Gateway returns error 502 while the endpoint is using Lambda proxy integration

Possible Cause:
:   The Lambda function might have:

    * Timed out.
    * Thrown an exception.
    * Failed in some other way.

Possible Solution:
:   If the Lambda or API Gateway logs, are available to you, examine them.

    If the source code of the Lambda function is available to you, then analyze and debug the code in the
    Lambda function. In some cases, you might be able to execute a copy of that code in a simpler context
    (outside AWS) to help debug it.

    Verify that the data sent to the Lambda function is in the format that Lambda function expects.
    You might want to try sending a smaller, simpler data set to see whether that succeeds.

    Verify that you are not sending too much data at a time.

    In some cases, increasing the timeout might solve the problem, especially if the Lambda function requires a
    lot of CPU resources, or if the Lambda function itself calls other remote services and thus requires more time.

### Unable to read the requests body in the HTTP POST method inside the Amazon AWS Lambda function

Possible Cause:
:   You might not have enabled Lambda proxy integration.

Possible Solution:
:   Enable Lambda proxy integration.

    For more details, see the steps in [Create the API Gateway endpoint](external-functions-creating-aws-ui-proxy-service.md).

### Error assuming AWS_ROLE

The full text of the message is:

> ```none
> SQL execution error: Error assuming AWS_ROLE. Please verify the role and externalId are
> configured correctly in your AWS policy.
> ```

Possible Cause:
:   * In the AWS Trust Relationship Policy for your role, the AWS ARN is incorrect. Possible causes of that include:

      + You did not set it.
      + You set it, but you used the ARN of the AWS role (incorrect) instead of the user ARN, which you
        can see from the DESCRIBE INTEGRATION command in Snowflake. Make sure that you use the value from the
        `API_AWS_IAM_USER_ARN` field of the worksheet rather than the value from the “API_AWS_ROLE_ARN” field.
    * In your AWS Trust Relationship Policy, the std:ExternalId is incorrect. Possible causes of that
      include:

      + You did not set it.
      + You re-created the API integration object. Re-creating the API object changes its external ID.

### Error: 403 ‘{“Message”:”User: <ARN> is not authorized to perform: execute-api:Invoke”}’

The full text of the message is:

> ```none
> Request failed for external function <function_name>.
> Error: 403 '{"Message":"User: <ARN> is not authorized to perform: execute-api:Invoke on resource: <MethodRequestARN>"}'
> ```

Possible Cause:
:   * The API Gateway resource policy has:

      + The wrong IAM Role ARN.
      + The wrong assumed role.
      + The wrong Method Request ARN.
    * The IAM role doesn’t have the right policy attached.

Possible Solution:
:   * Make sure that you followed the resource policy template in
      [Secure your Amazon API Gateway endpoint](external-functions-creating-aws-ui-proxy-service.md). Specifically, make sure that your resource policy:

      + Replaced the <12-digit number> with the value in the worksheet field named `Your AWS account ID`.
      + Replaced the <external_function_role> with the value in the `New IAM Role Name` field of the
        worksheet.
      + Replaced the method_request_ARN in the Resource field with the value in the
        `Method Request ARN` field in the worksheet. Make sure there is no slash at the end.
    * If you need to make sure that the IAM role has the correct permissions policy attached, you can find the
      role’s permissions policy list by following the steps below:

      1. In AWS, go to Identity and Access Management (IAM) and select the role.
      2. View the Summary for the role.
      3. Click on the Permissions tab.
      4. Verify that the required policy is in the Permissions policies list.
    * Make sure the endpoint being called is the resource, not the stage, that is set up on the API Gateway.

### Error: 403 ‘{“Message”:”User: anonymous is not authorized to perform: execute-api:Invoke”}’

The full text of the message is:

> ```none
> Request failed for external function <function_name>.
> Error: 403 '{"Message":"User: anonymous is not authorized to perform: execute-api:Invoke on resource: <MethodRequestARN>"}'
> ```

Possible Cause:
:   One possible cause is that when you were configuring authorization for the API Gateway, you might not have
    specified that the Method Request requires AWS_IAM authorization for the resource.

Possible Solution:
:   If you did not follow the instructions in
    [secure the Amazon API Gateway](external-functions-creating-aws-ui-proxy-service.md), then please follow them
    now to specify AWS_IAM authorization.

### Error parsing JSON response … Error: top-level JSON object must contain “data” JSON array element

The full text of the message is:

> ```none
> Error parsing JSON response for external function ... Error: top-level JSON object must contain "data" JSON array element
> ```

Possible Cause:
:   * You might not have specified Lambda proxy integration for the POST command in your API Gateway resource.

Possible Solution:
:   * Specify Lambda proxy integration for your API Gateway resource.

      For more details about Lambda proxy integration, see the steps in
      [Create the API Gateway endpoint](external-functions-creating-aws-ui-proxy-service.md).

### Request failed for external function EXT_FUNC with remote service error: 403 ‘{“message”:”Forbidden”}’;

Possible Cause:
:   The proxy service required an [API key](external-functions-security.md), typically for
    authentication or billing. The API key is missing or incorrect.

Possible Solution:
:   Use the ALTER API INTEGRATION command to specify the correct API key.

### CloudFormation stack creation fails

This error can occur if you are using an AWS CloudFormation template to create an external function.

Possible cause:
:   You do not have required permissions for creating the resources specified in the CloudFormation template.

Possible Solution:
:   Check the Events tab for the stack to see the error details.

    Also look at the AWS external functions
    troubleshooting page for additional
    troubleshooting tips.

---
title: Troubleshooting external functions for Azure
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-azure-troubleshooting.md
section: SQL General Reference
---

# Troubleshooting external functions for Azure

This topic provides troubleshooting information for external functions for Azure.

## Platform-independent Runtime Issues

### Data Type Return Values Do Not Match Expected Return Values

When passing arguments to or from an external function, ensure that the data types are appropriate. If the value
sent can’t fit into the data type being received, the value might be truncated or corrupted in some other way.

For more details, see [Ensure that arguments to the external function correspond to arguments parsed by the remote service](external-functions-best-practices.md).

### Error: Row numbers out of order

Possible Causes:
:   The row numbers you return within each batch should be monotonically ascending integers starting at 0. The input row numbers must also
    follow this rule, and each output row should match the corresponding input row. For example, the output in output row 0 should
    correspond to the input in input row 0.

Possible Solutions:
:   Ensure that the row numbers you return are the same as the row numbers you received, and that each output value uses the row number of
    the corresponding input. If this doesn’t work, then the input row numbers may not be correct or you did not return the rows in the
    correct order.

    Next, ensure that the output row numbers start from 0, increase by 1, and are in order.

For more information about data input and output formats, see [Remote service input and output data formats](external-functions-data-format.md).

### Error: “Error parsing JSON: Invalid response”

Possible Causes:
:   The most likely cause is that the JSON returned by the remote service (e.g. AWS Lambda function) is not constructed correctly.

Possible Solutions:
:   Ensure that the external function returns an array of arrays, with one inner array returned for each input row received. Review the
    description of the output format at [Data format received by Snowflake](external-functions-data-format.md).

### Error: Format of the returned value is not JSON

Possible Causes:
:   Your return value includes double quotes inside the value.

Possible Solutions:
:   Although JSON strings are delimited by double quotes, the string itself should not start and end with a quotation mark in most cases.
    If the embedded double quotes are incorrect, remove them.

### Error: Function received the wrong number of rows

Possible Causes:
:   The remote service tried to return more or fewer rows than it received. Even though the function is nominally scalar, it might receive
    multiple rows in the `body` field of the `event` parameter, and should return exactly as many rows as it received.

Possible Solution(s):
:   Ensure that the remote service returns one row for each row that it receives.

## Azure-specific issues

### Unable to modify settings during creation of the Azure function

Possible Causes:
:   When creating your Azure Function, you may not be able to modify settings for the function under the
    Authentication/Authorization menu.

    This problem can occur if all of the following are true:

    * Your Azure Function is running on Linux rather than Microsoft Windows.
    * You plan to use Azure AD authentication/authorization for your Azure Function.
    * You are using Azure’s “consumption” pricing tier rather than the “premium” pricing tier.

    Azure AD authentication is not available on the Linux Consumption plan for Azure Functions. You must use an App Service plan or
    Premium plan in order to authenticate with Azure AD.

Possible Solutions:
:   * Recreate the Azure Function and specify that it will run on Microsoft Windows rather than Linux.
    * Skip Azure AD authentication/authorization for the Azure Function; instead, perform the following tasks:

      + Set a validate-JWT (JSON Web Token) Policy for the API Management instance as documented in
        [Step 6: Create the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-ui-security-policy.md).
      + Use IP address restrictions to limit the remote service to accept connections only from the API Management
        service instance.

      If you choose this solution, you must create the Azure AD application manually. For details, see the Microsoft documentation:

      > [app registration](https://docs.microsoft.com/en-us/azure/active-directory/develop/quickstart-register-app) .

      If you create the Azure AD application manually, record the `Azure Function AD app registration name` and
      the `Azure Function App AD Application ID` in your tracking worksheet.
    * Switch from consumption pricing to premium pricing or use an App Service plan. For more details, see the Microsoft documentation:

      > [configuring an authentication provider](https://docs.microsoft.com/en-us/azure/app-service/configure-authentication-provider-aad)

### External function times out

Possible Causes:
:   There are many possible causes of timeouts. On Azure, one of the possible causes is that the Azure Functions app was not written to scale
    properly.

Possible Solutions:
:   Ensure that you are following the
    [Azure guidelines for writing scalable functions](https://docs.microsoft.com/en-us/azure/azure-functions/functions-best-practices#scalability-best-practices) .

For more information about troubleshooting scalability and performance issues, see
[Troubleshooting scalability and performance issues](external-functions-implementation.md) .

### Error: Failed to obtain Azure active directory access token.

Possible Solutions:
:   Try the following steps:

    * Verify that the Snowflake service principal has access to your Azure AD tenant.
    * Verify that the tenant ID and the Azure AD application ID are correct.

      Note that whitespace, including leading and trailing whitespace (e.g. blanks), is significant in ID fields. Check for incorrect
      leading or trailing whitespace.

### Error: 401 ‘{ “statusCode”: 401, “message”: “Access denied due to missing subscription key…” }’

Full error message text:

```none
Request failed for external function <function_name>. Error: 401 '{ "statusCode": 401, "message":
"Access denied due to missing subscription key. Make sure to include subscription key when making requests to an API." }'
```

Possible Causes:
:   The API Management service’s subscription requirement might be on.

Possible Solutions:
:   You might need to turn off the subscription requirement for the API Management service.

### Error: 401 ‘{ “statusCode”: 401, “message”: “Access denied due to missing subscription key.” }

Possible Causes:
:   The proxy service requires an [API key](external-functions-security.md) (aka “subscription key”), typically for authentication
    or billing. However, no API key was supplied in the API_KEY clause of the CREATE API INTEGRATION command.

Possible Solutions:
:   Use the ALTER API INTEGRATION command to update the API integration with a valid API key.

### Error: 401 ‘{ “statusCode”: 401, “message”: “Access denied due to invalid subscription key.” }’

Possible Causes:
:   The proxy service requires an [API key](external-functions-security.md) (aka “subscription key”), typically for authentication
    or billing. However, the API key supplied in the API_KEY clause of the CREATE API INTEGRATION command was not valid.

Possible Solutions:
:   Use the ALTER API INTEGRATION command to update the API integration with a valid API key.

### Error: 401 ‘{ “statusCode”: 401, “message”: “Invalid JWT.” }’

Full error message text:

```none
Request failed for external function <function_name>. Error: 401 '{ "statusCode": 401, "message": "Invalid JWT." }'
```

Possible Causes:
:   * You might not have finished setting the security policy on the Azure API Management service. For example, you might have:

      + Created, but not edited, the JWT (JSON Web Token).
      + Omitted one or more of the required claims/values. For example, you might have specified the claim for Snowflake but not the
      + remote service (Azure Function), or vice versa.
    * You might have used an invalid open ID URL.

Possible Solutions:
:   * Finish setting the security policy on the Azure API Management service. For example, review the JWT and verify that you included the
      required claims/values, including the claim for Snowflake and the claim for the remote service (Azure Function).
    * Verify that you used a valid open ID URL.

### Error (remote service): 401 ‘{ “statusCode”: 401, “message”: “Invalid JWT.” }’

Full error message text:

```none
Request failed for external function <function_name> with remote service error: 401 '{ "statusCode": 401, "message": "Invalid JWT." }'
```

Possible Causes:
:   If you used the ARM template, you might not have updated the JWT (JSON Web Token) that the template created for you.

Possible Solutions:
:   Update the JWT as documented in [Step 6: Update the Azure security policy for the proxy service in the Portal](external-functions-creating-azure-template-security-policy.md).

### Error: 500 …

Possible Causes:
:   You might have chosen the wrong option for your Azure AD app:

    * Incorrect option:
      Accounts in any organizational directory (Any Azure AD directory - Multitenant) and personal Microsoft accounts (e.g. Skype, Xbox)
    * Correct option: Accounts in this organizational directory only (Default Directory only - Single tenant)

---
title: Troubleshooting external functions for GCP
source: https://docs.snowflake.com/en/sql-reference/external-functions-creating-gcp-troubleshooting.md
section: SQL General Reference
---

# Troubleshooting external functions for GCP

This topic provides troubleshooting information for external functions for GCP.

## Platform-independent Runtime Issues

### Data Type Return Values Do Not Match Expected Return Values

When passing arguments to or from an external function, ensure that the data types are appropriate. If the value
sent can’t fit into the data type being received, the value might be truncated or corrupted in some other way.

For more details, see [Ensure that arguments to the external function correspond to arguments parsed by the remote service](external-functions-best-practices.md).

### Error: Row numbers out of order

Possible Causes:
:   The row numbers you return within each batch should be monotonically ascending integers starting at 0. The input row numbers must also
    follow this rule, and each output row should match the corresponding input row. For example, the output in output row 0 should
    correspond to the input in input row 0.

Possible Solutions:
:   Ensure that the row numbers you return are the same as the row numbers you received, and that each output value uses the row number of
    the corresponding input. If this doesn’t work, then the input row numbers may not be correct or you did not return the rows in the
    correct order.

    Next, ensure that the output row numbers start from 0, increase by 1, and are in order.

For more information about data input and output formats, see [Remote service input and output data formats](external-functions-data-format.md).

### Error: “Error parsing JSON: Invalid response”

Possible Causes:
:   The most likely cause is that the JSON returned by the remote service (e.g. AWS Lambda function) is not constructed correctly.

Possible Solutions:
:   Ensure that the external function returns an array of arrays, with one inner array returned for each input row received. Review the
    description of the output format at [Data format received by Snowflake](external-functions-data-format.md).

### Error: Format of the returned value is not JSON

Possible Causes:
:   Your return value includes double quotes inside the value.

Possible Solutions:
:   Although JSON strings are delimited by double quotes, the string itself should not start and end with a quotation mark in most cases.
    If the embedded double quotes are incorrect, remove them.

### Error: Function received the wrong number of rows

Possible Causes:
:   The remote service tried to return more or fewer rows than it received. Even though the function is nominally scalar, it might receive
    multiple rows in the `body` field of the `event` parameter, and should return exactly as many rows as it received.

Possible Solution(s):
:   Ensure that the remote service returns one row for each row that it receives.

## GCP-specific issues

### Error: Request fails with ‘{“message”:”Audiences in jwt are not allowed”,”code”:403}’

Possible Causes:
:   The value in the API integration’s `google_audience` field is not allowed.

Possible Solutions:
:   * Verify that the API Integration’s `google_audience` value matches the managed service name of your API, which should be
      recorded in the `Managed Service Identifier` field in your tracking worksheet.
    * If you added an x-google-audiences field to the securityDefinitions section of your API config file, make sure that
      the value in x-google-audiences matches the value in the `google_audience` field of the API integration.

For more information about authenticating with Google, see the Google service account
[authentication documentation](https://cloud.google.com/api-gateway/docs/authenticate-service-account#configure_auth).

### Error: Request fails with ‘{“message”:”Jwt is missing”,”code”:401}’

Possible Causes:
:   * The value of the x-google-issuer field in the securityDefinitions field in the configuration file
      might not match the value of the API_GCP_SERVICE_ACCOUNT for the API integration, as recorded in your tracking worksheet.
    * The value in x-google-issuer might contain extra whitespace.

Possible Solutions:
:   * Update the x-google-issuer to match the API_GCP_SERVICE_ACCOUNT.
    * Remove unneeded whitespace.

### Error: Request fails with ‘403 forbidden’

Possible Causes:
:   The service account using the config does not have the appropriate permissions on the backend.

Possible Solutions:
:   Update the service account’s permissions.

---
title: UNPIVOT
source: https://docs.snowflake.com/en/sql-reference/constructs/unpivot.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# UNPIVOT

Rotates a table by transforming columns into rows. UNPIVOT is a relational operator that accepts
two columns (from a table or subquery), along with a list of columns, and generates a row for
each column specified in the list. In a query, it is specified in the [FROM](from.md) clause after
the table name or subquery.

UNPIVOT is not exactly the reverse of PIVOT because it can’t undo aggregations made by PIVOT.

This operator can be used to transform a wide table (e.g. `empid`, `jan_sales`,
`feb_sales`, `mar_sales`) into a narrower table (e.g. `empid`, `month`,
`sales`).

See also:
:   [PIVOT](pivot.md)

## Syntax

```sqlsyntax
SELECT ...
FROM ...
    UNPIVOT [ { INCLUDE | EXCLUDE } NULLS ]
      ( <value_column>
        FOR <name_column> IN (
          <col> [ [ AS ] <col_alias> ],
          ...
        )
      )

[ ... ]
```

## Parameters

`{ INCLUDE | EXCLUDE } NULLS`
:   Specifies whether to include or exclude rows with NULLs in the `name_column`:

    * `INCLUDE NULLS` includes rows with NULLs.
    * `EXCLUDE NULLS` excludes rows with NULLs.

    Default: `EXCLUDE NULLS`

`value_column`
:   The name to assign to the generated column that will be populated with the values from the columns in the column list.

`name_column`
:   The name to assign to the generated column that will be populated with the names of the columns in the column list.

`column_list`
:   The names of the columns in the source table or subquery that will be rotated into a single pivot column.
    The column names will populate `name_column`, and the column values will populate `value_column`.

    The `column_list` can only contain literal column names, not a subquery.

    The columns in `column_list` must have exactly the same data type, with the following exceptions:

    * The [data types for text strings](../data-types-text.md) can be different lengths.
    * If the columns contain text strings, different columns can use different data types for text. For example,
      the list can include a VARCHAR column and a CHAR column.

`[ AS ] col_alias`
:   Specifies the column alias to use in the result of the UNPIVOT operation instead of the original column names.
    You can’t use different aliases for the same column name. However, you can’t use the same alias for multiple
    column names. The AS keyword is optional.

## Usage notes

* You can’t use a [LATERAL join](join-lateral.md) to directly reference the
  result set of an UNPIVOT operation. Attempting to do so returns an error. As a workaround, materialize the UNPIVOT
  result into a temporary table first, then reference that table in the LATERAL join. To create and load the
  `monthly_sales` table that is selected from in this example, see the examples section.

  The following query doesn’t work because LATERAL can’t reference an UNPIVOT result set directly:

  ```sqlexample
  SELECT *
    FROM monthly_sales
      UNPIVOT (sales FOR month IN (jan, feb, mar, apr)) unpvt
      JOIN LATERAL (SELECT unpvt.sales AS sales_value) jl;
  ```

  The following CREATE TEMPORARY TABLE statement creates a temporary table to materialize the UNPIVOT result.
  The query that follows that statement references the temporary table in the LATERAL join:

  ```sqlexample
  CREATE OR REPLACE TEMPORARY TABLE unpivot_result AS
    SELECT *
      FROM monthly_sales
        UNPIVOT (sales FOR month IN (jan, feb, mar, apr));

  SELECT *
    FROM unpivot_result
      JOIN LATERAL (SELECT unpivot_result.sales AS sales_value) jl;
  ```

## Examples

Create a table, `monthly_sales`, with the following structure and data:

```sqlexample
CREATE OR REPLACE TABLE monthly_sales(
  empid INT,
  dept TEXT,
  jan INT,
  feb INT,
  mar INT,
  apr INT);

INSERT INTO monthly_sales VALUES
  (1, 'electronics', 100, 200, 300, 100),
  (2, 'clothes', 100, 300, 150, 200),
  (3, 'cars', 200, 400, 100, 50),
  (4, 'appliances', 100, NULL, 100, 50);

SELECT * FROM monthly_sales;
```

```output
+-------+-------------+-----+------+------+-----+
| EMPID | DEPT        | JAN | FEB  | MAR  | APR |
|-------+-------------+-----+------+------+-----|
|     1 | electronics | 100 | 200  | 300  | 100 |
|     2 | clothes     | 100 | 300  | 150  | 200 |
|     3 | cars        | 200 | 400  | 100  |  50 |
|     4 | appliances  | 100 | NULL | 100  |  50 |
+-------+-------------+-----+------+------+-----+
```

Unpivot the individual month columns to return a single `sales` value by `month` for each employee.

```sqlexample
SELECT *
  FROM monthly_sales
    UNPIVOT (sales FOR month IN (jan, feb, mar, apr))
  ORDER BY empid;
```

```output
+-------+-------------+-------+-------+
| EMPID | DEPT        | MONTH | SALES |
|-------+-------------+-------+-------|
|     1 | electronics | JAN   |   100 |
|     1 | electronics | FEB   |   200 |
|     1 | electronics | MAR   |   300 |
|     1 | electronics | APR   |   100 |
|     2 | clothes     | JAN   |   100 |
|     2 | clothes     | FEB   |   300 |
|     2 | clothes     | MAR   |   150 |
|     2 | clothes     | APR   |   200 |
|     3 | cars        | JAN   |   200 |
|     3 | cars        | FEB   |   400 |
|     3 | cars        | MAR   |   100 |
|     3 | cars        | APR   |    50 |
|     4 | appliances  | JAN   |   100 |
|     4 | appliances  | MAR   |   100 |
|     4 | appliances  | APR   |    50 |
+-------+-------------+-------+-------+
```

The following example is the same as the previous example, but it uses aliases for the column names:

```sqlexample
SELECT *
  FROM monthly_sales
    UNPIVOT (sales FOR month IN (
      jan AS january,
      feb AS february,
      mar AS march,
      apr AS april)
    )
  ORDER BY empid;
```

```output
+-------+-------------+----------+-------+
| EMPID | DEPT        | MONTH    | SALES |
|-------+-------------+----------+-------|
|     1 | electronics | JANUARY  |   100 |
|     1 | electronics | FEBRUARY |   200 |
|     1 | electronics | MARCH    |   300 |
|     1 | electronics | APRIL    |   100 |
|     2 | clothes     | JANUARY  |   100 |
|     2 | clothes     | FEBRUARY |   300 |
|     2 | clothes     | MARCH    |   150 |
|     2 | clothes     | APRIL    |   200 |
|     3 | cars        | JANUARY  |   200 |
|     3 | cars        | FEBRUARY |   400 |
|     3 | cars        | MARCH    |   100 |
|     3 | cars        | APRIL    |    50 |
|     4 | appliances  | JANUARY  |   100 |
|     4 | appliances  | MARCH    |   100 |
|     4 | appliances  | APRIL    |    50 |
+-------+-------------+----------+-------+
```

The previous SELECT statements exclude NULLs by default. So, they don’t include a row for appliances in February
in the results. To include NULLs in the results, run the following SQL statement:

```sqlexample
SELECT *
  FROM monthly_sales
    UNPIVOT INCLUDE NULLS (sales FOR month IN (jan, feb, mar, apr))
  ORDER BY empid;
```

```output
+-------+-------------+-------+-------+
| EMPID | DEPT        | MONTH | SALES |
|-------+-------------+-------+-------|
|     1 | electronics | JAN   |   100 |
|     1 | electronics | FEB   |   200 |
|     1 | electronics | MAR   |   300 |
|     1 | electronics | APR   |   100 |
|     2 | clothes     | JAN   |   100 |
|     2 | clothes     | FEB   |   300 |
|     2 | clothes     | MAR   |   150 |
|     2 | clothes     | APR   |   200 |
|     3 | cars        | JAN   |   200 |
|     3 | cars        | FEB   |   400 |
|     3 | cars        | MAR   |   100 |
|     3 | cars        | APR   |    50 |
|     4 | appliances  | JAN   |   100 |
|     4 | appliances  | FEB   |  NULL |
|     4 | appliances  | MAR   |   100 |
|     4 | appliances  | APR   |    50 |
+-------+-------------+-------+-------+
```

This output includes a row for appliances in February.

Instead of selecting all columns with `*`, you can include specific columns in the SELECT list and reference
the UNPIVOT `value_column` and `name_column`. The following example is similar to the previous
example, but it specifies the `value_column` `sales` and the `name_column` `month` in the
SELECT list. The query excludes the `empid` column:

```sqlexample
SELECT dept, month, sales
  FROM monthly_sales
    UNPIVOT INCLUDE NULLS (sales FOR month IN (jan, feb, mar, apr))
  ORDER BY dept;
```

```output
+-------------+-------+-------+
| DEPT        | MONTH | SALES |
|-------------+-------+-------|
| appliances  | JAN   |   100 |
| appliances  | FEB   |  NULL |
| appliances  | MAR   |   100 |
| appliances  | APR   |    50 |
| cars        | JAN   |   200 |
| cars        | FEB   |   400 |
| cars        | MAR   |   100 |
| cars        | APR   |    50 |
| clothes     | JAN   |   100 |
| clothes     | FEB   |   300 |
| clothes     | MAR   |   150 |
| clothes     | APR   |   200 |
| electronics | JAN   |   100 |
| electronics | FEB   |   200 |
| electronics | MAR   |   300 |
| electronics | APR   |   100 |
+-------------+-------+-------+
```

---
title: Unstructured data types
source: https://docs.snowflake.com/en/sql-reference/data-types-unstructured.md
section: SQL General Reference
---

# Unstructured data types

Snowflake supports three different kinds of data:

* *Structured data* (such as a CSV file) follows a strict tabular schema. Structured data can be easily loaded into SQL tables.
* *Semi-structured data* (such as a JSON or XML file) has a flexible schema. Snowflake can access fields in semi-structured data using
  special functions, but the data is not as easily queried as structured data. Semi-structured data can be loaded into SQL tables
  using VARIANT columns.
* *Unstructured data* (such as a document, image, or audio file) has no inherent schema. Unstructured data might still
  have an internal structure (for example, PNG image files must follow a documented format) but such technical details do not generally
  relate to the information in the file.

Snowflake provides ways of accessing and processing data in unstructured files, such as
the [AI COMPLETE function](functions/ai_complete.md).

To use unstructured data in Snowflake, it must first be stored on an internal or external stage. The Snowflake function
that processes the unstructured data reads it from there. Depending on the function, you specify the file in
one or more of the following ways:

* By passing a stage name and a relative path to the file as two separate arguments to the function that will use it.
* By passing a [staged](functions/build_stage_file_url.md) or [scoped](functions/build_scoped_file_url.md) URL as a string.
* By passing a FILE object created using the [TO_FILE](functions/to_file.md) or [TRY_TO_FILE](functions/try_to_file.md) function.

## FILE data type

Snowflake provides the FILE data type for unstructured data. A FILE value represents a file stored in an internal or
external stage, but does not store the file’s data, only a reference to it. It includes the following metadata:

* STAGE: The name of the stage on which the file resides.
* RELATIVE_PATH: The relative path of the file in its stage.
* STAGE_FILE_URL: The stage file URL.
* SCOPED_FILE_URL: A scoped file URL.
* CONTENT_TYPE: The MIME type of the file.
* SIZE: The size, in bytes, of the file.
* ETAG: A unique hash of the file contents.
* LAST_MODIFIED: The timestamp at which the file was last modified.

Not all of these fields are required. A FILE must have CONTENT_TYPE, SIZE, ETAG, and LAST_MODIFIED fields, and also the
file’s location specified by STAGE plus RELATIVE_PATH, STAGE_FILE_URL, or SCOPED_FILE_URL.

You can create a file by passing a scoped file URL, a stage and path, or a metadata object to the
[TO_FILE](functions/to_file.md) or [TRY_TO_FILE](functions/try_to_file.md) function.

### FILE functions

| Sub-category | Function |
| --- | --- |
| Constructors | [TO_FILE](functions/to_file.md) |
|  | [TRY_TO_FILE](functions/try_to_file.md) |
| Accessors | [FL_GET_CONTENT_TYPE](functions/fl_get_content_type.md) |
|  | [FL_GET_ETAG](functions/fl_get_etag.md) |
|  | [FL_GET_FILE_TYPE](functions/fl_get_file_type.md) |
|  | [FL_GET_LAST_MODIFIED](functions/fl_get_last_modified.md) |
|  | [FL_GET_RELATIVE_PATH](functions/fl_get_relative_path.md) |
|  | [FL_GET_SCOPED_FILE_URL](functions/fl_get_scoped_file_url.md) |
|  | [FL_GET_SIZE](functions/fl_get_size.md) |
|  | [FL_GET_STAGE](functions/fl_get_stage.md) |
|  | [FL_GET_STAGE_FILE_URL](functions/fl_get_stage_file_url.md) |
| Utility Functions | [FL_IS_AUDIO](functions/fl_is_audio.md) |
|  | [FL_IS_COMPRESSED](functions/fl_is_compressed.md) |
|  | [FL_IS_DOCUMENT](functions/fl_is_document.md) |
|  | [FL_IS_IMAGE](functions/fl_is_image.md) |
|  | [FL_IS_VIDEO](functions/fl_is_video.md) |

## Usage notes

* FILE values may become inconsistent with the underlying staged files. FILE values are not updated when you modify or
  delete the underlying file. Conversely, if a FILE value is deleted from a table, the underlying file is not affected.
* Permissions on the underlying files are governed by the type of URL that was specified when creating the FILE. Stage
  file URLs and stage/path combinations give permanent permission to callers that have access to the associated stage.
  Scoped URLs give temporary user-based access to the underlying file for a 24-hour period.

## Using unstructured data in Snowflake via SQL

Create a table with a FILE column.

```sqlexample
CREATE TABLE images_table(img FILE);
```

Load data into the table from an external stage `my_images` that contains image files. `mpy_images` can be an internal or external
stage.

> **Note:**
>
> This process requires directory table support on the stage. Enable it, if necessary, using the SQL below:

```sqlexample
ALTER STAGE my_images DIRECTORY=(ENABLE=true);
```

Load data into the Snowflake table.

```sqlexample
INSERT INTO images_table
    SELECT TO_FILE(file_url) FROM DIRECTORY(@my_images);
```

Run SQL statements against `images_table`. For example, the following query returns the relative path of each file in
the table that was last modified between January 1, 2021 and January 1, 2023.

```sqlexample
SELECT FL_GET_RELATIVE_PATH(f)
    FROM images_table
    WHERE FL_GET_LAST_MODIFIED(f) BETWEEN '2021-01-01' and '2023-01-01';
```

## Known limitations

The FILE data type currently cannot be used in:

* CLUSTER BY, GROUP BY, and ORDER BY clauses
* Hybrid tables, Iceberg tables, and external tables
* SnowScript
* Secured views
* Binds
* Search optimization
* Clients and connectors except Snowpark Python

---
title: Unsupported data types
source: https://docs.snowflake.com/en/sql-reference/data-types-unsupported.md
section: SQL General Reference
---

# Unsupported data types

Snowflake doesn’t support the following data types:

| Category | Type | Notes |
| --- | --- | --- |
| LOB (Large Object) | BLOB | BINARY can be used instead; maximum of 67108864 bytes. For more information, see [String & binary data types](data-types-text.md). |
| CLOB | VARCHAR can be used instead; maximum of 134217728 bytes (for singlebyte). For more information, see [String & binary data types](data-types-text.md). |
| Other | ENUM |  |

---
title: User & security DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-user-security.md
section: SQL General Reference
---

# User & security DDL

Snowflake provides a full set of SQL commands for managing users and security. These commands can only be executed by users who are granted roles that have the OWNERSHIP privilege on the managed object. This is usually restricted to the
ACCOUNTADMIN and SECURITYADMIN roles.

However, individual users are able to perform the following tasks for themselves:

* Change their password (only through the web interface).
* View their user information (via [DESCRIBE USER](sql/desc-user.md)).
* Change their default role, virtual warehouse, or namespace (via [ALTER USER](sql/alter-user.md)).
* Change their session parameters (via [ALTER SESSION](sql/alter-session.md)).

## User management

Each user with access to Snowflake is represented by a user object. A user object stores all of the information about the user, including their login name, password, and defaults (role, virtual warehouse, and namespace). Use the following
DDL commands to manage users in the system:

* [CREATE USER](sql/create-user.md)
* [ALTER USER](sql/alter-user.md)
* [DROP USER](sql/drop-user.md)
* [DESCRIBE USER](sql/desc-user.md)
* [SHOW USERS](sql/show-users.md)

## Role management

Snowflake uses roles to control access to objects in the system:

* Roles are granted access privileges for objects in the system (databases, tables, etc.).
* Roles are granted to users to enable them to create, modify, and use the objects for which the roles have privileges.
* Roles can be granted to other roles to support defining hierarchical access privileges.

Use the following DDL commands to manage roles in the system:

* [CREATE ROLE](sql/create-role.md)
* [ALTER ROLE](sql/alter-role.md)
* [DROP ROLE](sql/drop-role.md)
* [SHOW ROLES](sql/show-roles.md)

Use the following DDL commands to manage database roles in the system:

* [CREATE DATABASE ROLE](sql/create-database-role.md)
* [ALTER DATABASE ROLE](sql/alter-database-role.md)
* [DROP DATABASE ROLE](sql/drop-database-role.md)
* [SHOW DATABASE ROLES](sql/show-database-roles.md)

Use the following command to activate a primary role or secondary roles within a user session:

* [USE ROLE](sql/use-role.md)
* [USE SECONDARY ROLES](sql/use-secondary-roles.md)

## Object tagging management

Snowflake supports the following DDL to create and manage tags:

* [CREATE TAG](sql/create-tag.md)
* [ALTER TAG](sql/alter-tag.md)
* [ALTER <object>](sql/alter.md) (to set a tag on a Snowflake object)
* [SHOW TAGS](sql/show-tags.md)
* [DROP TAG](sql/drop-tag.md)
* [UNDROP TAG](sql/undrop-tag.md)

Note that Snowflake does not support the [describe](sql/desc.md) operation for the tag object.

## Access control management

Use the following commands to manage access control for objects by granting (and revoking) object privileges to roles and granting roles to users and other roles:

* [GRANT <privileges> … TO ROLE](sql/grant-privilege.md)
* [REVOKE <privileges> … FROM ROLE](sql/revoke-privilege.md)
* [GRANT <privilege> … TO SHARE](sql/grant-privilege-share.md)
* [REVOKE <privilege> … FROM SHARE](sql/revoke-privilege-share.md)
* [GRANT DATABASE ROLE … TO SHARE](sql/grant-database-role-share.md)
* [REVOKE DATABASE ROLE … FROM SHARE](sql/revoke-database-role-share.md)
* [GRANT OWNERSHIP](sql/grant-ownership.md)
* [GRANT ROLE](sql/grant-role.md)
* [GRANT DATABASE ROLE](sql/grant-database-role.md)
* [REVOKE ROLE](sql/revoke-role.md)
* [REVOKE DATABASE ROLE](sql/revoke-database-role.md)
* [SHOW GRANTS](sql/show-grants.md)

## Network policy management

A network policy supports restricting access to your account based on user IP address. Use the following commands to create, alter, or drop network policies:

* [ALTER NETWORK POLICY](sql/alter-network-policy.md)
* [CREATE NETWORK POLICY](sql/create-network-policy.md)
* [DESCRIBE NETWORK POLICY](sql/desc-network-policy.md)
* [DROP NETWORK POLICY](sql/drop-network-policy.md)
* [SHOW NETWORK POLICIES](sql/show-network-policies.md)

## Secret management

Snowflake supports the following DDL commands and operations to manage secrets:

* [CREATE SECRET](sql/create-secret.md)
* [ALTER SECRET](sql/alter-secret.md)
* [DROP SECRET](sql/drop-secret.md)
* [SHOW SECRETS](sql/show-secrets.md)
* [DESCRIBE SECRET](sql/desc-secret.md)

## Password policy management

Snowflake provides the following DDL commands to manage password policy objects:

* [CREATE PASSWORD POLICY](sql/create-password-policy.md)
* [ALTER PASSWORD POLICY](sql/alter-password-policy.md)
* [DROP PASSWORD POLICY](sql/drop-password-policy.md)
* [SHOW PASSWORD POLICIES](sql/show-password-policies.md)
* [DESCRIBE PASSWORD POLICY](sql/desc-password-policy.md)

## Session policy management

Snowflake provides the following DDL commands to manage session policy objects:

* [CREATE SESSION POLICY](sql/create-session-policy.md)
* [ALTER SESSION POLICY](sql/alter-session-policy.md)
* [DROP SESSION POLICY](sql/drop-session-policy.md)
* [SHOW SESSION POLICIES](sql/show-session-policies.md)
* [DESCRIBE SESSION POLICY](sql/desc-session-policy.md)

## Third-party integrations

An integration is a Snowflake object that provides an interface between Snowflake and third-party services. Use the following commands to create, alter, or drop integrations:

### API integrations

* [ALTER API INTEGRATION](sql/alter-api-integration.md)
* [CREATE API INTEGRATION](sql/create-api-integration.md)
* [DESCRIBE INTEGRATION](sql/desc-integration.md)
* [DROP INTEGRATION](sql/drop-integration.md)
* [SHOW INTEGRATIONS](sql/show-integrations.md)

### Notification integrations

* [ALTER NOTIFICATION INTEGRATION](sql/alter-notification-integration.md)
* [CREATE NOTIFICATION INTEGRATION](sql/create-notification-integration.md)
* [DESCRIBE NOTIFICATION INTEGRATION](sql/desc-notification-integration.md)
* [DROP INTEGRATION](sql/drop-integration.md)
* [SHOW NOTIFICATION INTEGRATIONS](sql/show-notification-integrations.md)

### Security integrations

* [ALTER SECURITY INTEGRATION](sql/alter-security-integration.md)
* [CREATE SECURITY INTEGRATION](sql/create-security-integration.md)
* [DESCRIBE INTEGRATION](sql/desc-integration.md)
* [DROP INTEGRATION](sql/drop-integration.md)
* [SHOW INTEGRATIONS](sql/show-integrations.md)
* [SHOW DELEGATED AUTHORIZATIONS](sql/show-delegated-authorizations.md)

### Storage integrations

* [ALTER STORAGE INTEGRATION](sql/alter-storage-integration.md)
* [CREATE STORAGE INTEGRATION](sql/create-storage-integration.md)
* [DESCRIBE INTEGRATION](sql/desc-integration.md)
* [DROP INTEGRATION](sql/drop-integration.md)
* [SHOW INTEGRATIONS](sql/show-integrations.md)

---
title: User-defined types
source: https://docs.snowflake.com/en/sql-reference/data-types-user-defined.md
section: SQL General Reference
---

# User-defined types

You can define *user-defined types*, which are new data types that are based on existing
[Snowflake data types](../sql-reference-data-types.md). For example, suppose that you want to
define a column for the age of a person, and you want to restrict the values to include numbers with at
most three digits and no digits after the decimal point. You can define a data type named `age` that
corresponds to `NUMBER(3,0)`.

A user-defined type is a schema-level object that can be used in all of the places that types can be used, including
column definitions, function and procedure definitions, and cast expressions.

User-defined types can simplify schema maintenance and improve data quality. You can define a user-defined type
once, and then use it in multiple objects.

You can also use user-defined types to group related data fields into a single, logical column, instead of using
multiple columns or tables for the fields. For example, you can define a data type for addresses that is a
[structured OBJECT type](data-types-structured.md) with fields for the street address,
city, state, and ZIP Code.

## Privileges required for user-defined types

To create a user-defined type in a schema, you must use a role that has been granted the CREATE TYPE privilege on that schema.

For more information, see the [access control requirements for user-defined types](../user-guide/security-access-control-privileges.md).

## General usage notes for user-defined types

* To change the definition of a user-defined type, drop it and re-create it.

  If you change the definition of a user-defined type:

  + SQL statements that directly operate on table columns that use the type might
    return errors, including SELECT statements and DML statements. However, SQL statements that don’t directly operate on
    table columns that use the type run normally. For example, if a table contains a user-defined type column
    named `typed_column`, and a SELECT statement specifies other columns in its SELECT list, the SELECT statement runs
    normally. To correct the problem, you can revise the SQL statements to use the underlying Snowflake types.
  + Calls to functions and stored procedures that use the type return errors. To correct the problem,
    drop and re-create the functions and stored procedures.
* The [ALTER TABLE … ALTER COLUMN](sql/alter-table-column.md) command can change the data type of a column from a user-defined
  type to a compatible [Snowflake data type](../sql-reference-data-types.md), or from a Snowflake data type to a
  user-defined type.
* When you are constructing an object to insert into a column of a user-defined type by using the
  [OBJECT_CONSTRUCT](functions/object_construct.md) function or an [OBJECT constant](data-types-semistructured.md),
  cast the result to the user-defined type.

  For examples, see Using a user-defined type for a table column.
* When [set operators](operators-query.md) (for example, UNION, INTERSECT, EXCEPT) or
  [conditional expression functions](expressions-conditional.md) (for example, CASE, IFF, COALESCE, NVL, and so on)
  evaluate an expression that resolves to a value of a user-defined type, Snowflake determines a common type using the underlying
  base types of the operands. By default, the data type of the result is this base type. If you want the result to be a value of a
  user-defined type, explicitly cast the final expression to the user-defined type.

  The following rules apply when user-defined types are used in set operations or conditional expression functions:

  + User-defined types are distinct from their base types, but, in expression type resolution, they coerce to their base
    types to find a common type.
  + If the branches or operands resolve to a single Snowflake type (for example, VARCHAR or NUMBER), that is the result type.
  + To preserve a user-defined type or produce a result that is a value of a user-defined type,
    cast the overall expression by
    using `CAST(expr AS user-defined type)` or `expr::user-defined type`.
  + Incompatible base types (for example, VARCHAR and NUMBER) follow normal
    [coercion rules](data-type-conversion.md). If no common base type exists, an error is returned.

  For examples, see Using set operators and conditional expression functions with user-defined types.
* Using user-defined types and compatible Snowflake data types for function [overloading](../developer-guide/udf-stored-procedure-naming-conventions.md)
  is allowed. That is, you can specify a user-defined type for a function argument type, and you can specify a compatible Snowflake
  data type for an argument type of a function with the same name.
* If a user-defined type is specified as the RETURN type of a SQL user-defined function (UDF) or Snowflake Scripting stored procedure,
  the return value must be explicitly cast to the user-defined type in the body of the UDF or stored procedure.
* When a user-defined type is used as an argument or return value for a UDF or procedure written in a language other
  than SQL (such as Python or Java), the user-defined type is treated the same as its base type.
* [Schema evolution](../user-guide/data-load-schema-evolution.md) isn’t supported for user-defined types.

## Casting user-defined types

User-defined types support [data type conversion](data-type-conversion.md), including
both explicit casting and implicit casting (coercion):

* Explicit casting to and from user-defined types
* Coercion of user-defined types

### Explicit casting to and from user-defined types

A user-defined type value can be cast to the same data types as values of its base type. For example,
create a user-defined type named `age` that is based on the NUMBER type:

```sqlexample
CREATE TYPE age AS NUMBER(3,0);
```

A value can be cast to a user-defined type if the value can be cast to the base type of the user-defined type.
For example, the value `10` can be cast to the NUMBER type, so you can cast the value to the `age` type:

```sqlexample
SELECT 10::age;
```

A user-defined type value can be cast to a different data type if the base type of the user-defined type can be
cast to that data type. For example, a NUMBER value can be cast to the VARCHAR type, so the value `10` of
user-defined type `age` can be cast to the VARCHAR type:

```sqlexample
SELECT 10::age::VARCHAR;
```

### Coercion of user-defined types

A user-defined type value coerces to its base type. Therefore, in all operations, it behaves the same as
its base type. For example, create a user-defined type named `age` that is based on the NUMBER type and
a table with two columns of type `age`:

```sqlexample
CREATE TYPE age AS NUMBER(3,0);

CREATE TABLE test_age_udf(a age, b age);
```

Insert values into the table:

```sqlexample
INSERT INTO test_age_udf VALUES (10, 20);
```

The following example performs an addition operation on the table values, and Snowflake coerces the `age` values to values
of type NUMBER to complete the operation. The example uses the [SYSTEM$TYPEOF](functions/system_typeof.md) function to show
the data type of the result:

```sqlexample
SELECT a + b AS result,
       SYSTEM$TYPEOF(a + b) AS type
  FROM test_age_udf;
```

```output
+--------+------------------+
| RESULT | TYPE             |
|--------+------------------|
|     30 | NUMBER(4,0)[SB1] |
+--------+------------------+
```

## Examples for user-defined data types

The following examples show you how to use user-defined types:

### Using a user-defined type for a table column

In the following example, you create a user-defined type named `address`, and then use the type in a table:

1. Create a user-defined type that is based on a [structured OBJECT type](data-types-structured.md)
   to store address information:

   ```sqlexample
   CREATE TYPE address AS OBJECT(
     street VARCHAR(100),
     city VARCHAR(50),
     state_abbr CHAR(2),
     zip_code CHAR(10)
   );
   ```
2. Create a table that stores customer information, including the address:

   ```sqlexample
   CREATE TABLE customers_udt_test (
     cust_id VARCHAR NOT NULL,
     cust_name VARCHAR(100),
     cust_address address
   );
   ```
3. Insert a row into the table, and specify the value for the `cust_address` column by casting an
   [OBJECT constant](data-types-semistructured.md) to the `address` type:

   ```sqlexample
   INSERT INTO customers_udt_test (cust_id, cust_name, cust_address)
     SELECT
       '1000',
       'Example1 Inc',
       {
         'street': '101 Snow Street',
         'city': 'San Francisco',
         'state_abbr': 'CA',
         'zip_code': '94102'
       }::address;
   ```
4. Insert a row into the table, and specify the value for the `cust_address` column by calling the
   [OBJECT_CONSTRUCT](functions/object_construct.md) function and casting the return value to the `address`
   type:

   ```sqlexample
   INSERT INTO customers_udt_test (cust_id, cust_name, cust_address)
     SELECT
       '1001',
       'Example2 Inc',
       OBJECT_CONSTRUCT(
         'street', '555 Polar Bear Street',
         'city', 'New York',
         'state_abbr', 'NY',
         'zip_code', '10001'
       )::address;
   ```
5. Insert a row into the table, and specify the value for the `cust_address` column by casting an OBJECT constant
   to the OBJECT type, which is the base type of the `address` type. It is usually easier to cast an OBJECT
   constant to the user-defined type, but this example shows that the OBJECT constant is coerced to the user-defined
   type:

   ```sqlexample
   INSERT INTO customers_udt_test (cust_id, cust_name, cust_address)
     SELECT
       '1002',
       'Example3 Inc',
       {
         'street': '909 Flake Street',
         'city': 'Seattle',
         'state_abbr': 'WA',
         'zip_code': '98109'
       }::OBJECT(
            street VARCHAR(100),
            city VARCHAR(50),
            state_abbr CHAR(2),
            zip_code CHAR(10));
   ```
6. To show the inserted rows, query the table:

   ```sqlexample
   SELECT * FROM customers_udt_test;
   ```

   ```output
   +---------+--------------+--------------------------------------+
   | CUST_ID | CUST_NAME    | CUST_ADDRESS                         |
   |---------+--------------+--------------------------------------|
   | 1000    | Example1 Inc | {                                    |
   |         |              |   "street": "101 Snow Street",       |
   |         |              |   "city": "San Francisco",           |
   |         |              |   "state_abbr": "CA",                |
   |         |              |   "zip_code": "94102"                |
   |         |              | }                                    |
   | 1001    | Example2 Inc | {                                    |
   |         |              |   "street": "555 Polar Bear Street", |
   |         |              |   "city": "New York",                |
   |         |              |   "state_abbr": "NY",                |
   |         |              |   "zip_code": "10001"                |
   |         |              | }                                    |
   | 1002    | Example3 Inc | {                                    |
   |         |              |   "street": "909 Flake Street",      |
   |         |              |   "city": "Seattle",                 |
   |         |              |   "state_abbr": "WA",                |
   |         |              |   "zip_code": "98109"                |
   |         |              | }                                    |
   +---------+--------------+--------------------------------------+
   ```
7. Query the table, and use the colon operator to show only the `zip_code` values in the `address` data:

   ```sqlexample
   SELECT cust_id,
          cust_name,
          cust_address:zip_code
     FROM customers_udt_test;
   ```

   ```output
   +---------+--------------+-----------------------+
   | CUST_ID | CUST_NAME    | CUST_ADDRESS:ZIP_CODE |
   |---------+--------------+-----------------------|
   | 1000    | Example1 Inc | 94102                 |
   | 1001    | Example2 Inc | 10001                 |
   | 1002    | Example3 Inc | 98109                 |
   +---------+--------------+-----------------------+
   ```

### Using set operators and conditional expression functions with user-defined types

When [set operators](operators-query.md) or [conditional expression functions](expressions-conditional.md)
evaluate values of Snowflake types and user-defined types, the types must be compatible and coercible into a single type. The resulting
output is of the Snowflake base type, unless it is explicitly cast to a user-defined type. For more information, see
General usage notes for user-defined types.

The examples in this section use set operators and conditional expressions with user-defined types.
First, create several user-defined types with various base types:

```sqlexample
CREATE TYPE us_zipcode AS VARCHAR;
CREATE TYPE uk_postcode AS VARCHAR;
CREATE TYPE positive_integer AS INTEGER;
CREATE TYPE positive_number AS NUMBER;
```

The following query calls the IFF function. The call evaluates a value of the `us_zipcode` user-defined
type and a value of a compatible Snowflake type. The query uses the [SYSTEM$TYPEOF](functions/system_typeof.md) function to
show that the result is Snowflake base type VARCHAR:

```sqlexample
SELECT IFF(TRUE, '90210'::us_zipcode, '10006') AS result,
       SYSTEM$TYPEOF(IFF(TRUE, '90210'::us_zipcode, '10006')) AS type;
```

```output
+--------+-------------------------+
| RESULT | TYPE                    |
|--------+-------------------------|
| 90210  | VARCHAR(134217728)[LOB] |
+--------+-------------------------+
```

The following query is the same as the previous query, but it casts the result to the `us_zipcode` user-defined type:

```sqlexample
SELECT IFF(TRUE, '90210'::us_zipcode, '10006')::us_zipcode AS result,
       SYSTEM$TYPEOF(IFF(TRUE, '90210'::us_zipcode, '10006')::us_zipcode) AS type;
```

```output
+--------+-------------------------------+
| RESULT | TYPE                          |
|--------+-------------------------------|
| 90210  | MYDB.MYSCHEMA.US_ZIPCODE[LOB] |
+--------+-------------------------------+
```

The following query contains a CASE expression that evaluates different but compatible user-defined
types and returns a value of a Snowflake base type:

```sqlexample
SELECT CASE
         WHEN TRUE THEN 'SW1A 0AA'::uk_postcode
           ELSE '90210'::us_zipcode
         END AS result,
       SYSTEM$TYPEOF(CASE
         WHEN TRUE THEN 'SW1A 0AA'::uk_postcode
           ELSE '90210'::us_zipcode
         END) AS type;
```

```output
+----------+-------------------------+
| RESULT   | TYPE                    |
|----------+-------------------------|
| SW1A 0AA | VARCHAR(134217728)[LOB] |
+----------+-------------------------+
```

The following query is the same as the previous query, but it casts the result to the `uk_postcode` user-defined type:

```sqlexample
SELECT CAST(CASE
         WHEN TRUE THEN 'SW1A 0AA'::uk_postcode
           ELSE '90210'::us_zipcode
         END AS uk_postcode) AS result,
       SYSTEM$TYPEOF(CAST(CASE
         WHEN TRUE THEN 'SW1A 0AA'::uk_postcode
           ELSE '90210'::us_zipcode
         END AS uk_postcode)) AS type;
```

```output
+----------+--------------------------------------------+
| RESULT   | TYPE                                       |
|----------+--------------------------------------------|
| SW1A 0AA | MYDB.MYSCHEMA.UK_POSTCODE[LOB]             |
+----------+--------------------------------------------+
```

The following query contains a COALESCE expression that evaluates different but compatible user-defined
types and returns a value of a Snowflake base type:

```sqlexample
SELECT COALESCE(
         5::positive_integer,
         10::positive_number) AS result,
       SYSTEM$TYPEOF(COALESCE(
         5::positive_integer,
         10::positive_number)) AS type;
```

```output
+--------+-------------------+
| RESULT | TYPE              |
|--------+-------------------|
|      5 | NUMBER(38,0)[SB1] |
+--------+-------------------+
```

The following query is the same as the previous query, but it casts the result to the `positive_number` user-defined type:

```sqlexample
SELECT CAST(COALESCE(
         5::positive_integer,
         10::positive_number
       ) AS positive_number) AS result,
       SYSTEM$TYPEOF(CAST(COALESCE(
         5::positive_integer,
         10::positive_number
       ) AS positive_number)) AS type;
```

```output
+--------+------------------------------------+
| RESULT | TYPE                               |
|--------+------------------------------------|
|      5 | MYDB.MYSCHEMA.POSITIVE_NUMBER[SB1] |
+--------+------------------------------------+
```

---
title: Using binary data
source: https://docs.snowflake.com/en/sql-reference/binary-examples.md
section: SQL General Reference
---

# Using binary data

The usefulness and flexibility of the BINARY data type is best demonstrated by example. This topic provides
practical examples of tasks that involve the BINARY data type and its three encoding schemes.

## Converting between hex and base64

The BINARY data type can be used as an intermediate step when converting between hex and base64 strings.

Convert from hex to base64 using [TO_CHAR](functions/to_char.md):

```sqlexample
SELECT c1, TO_CHAR(TO_BINARY(c1, 'hex'), 'base64') FROM hex_strings;
```

```output
+----------------------+-----------------------------------------+
| C1                   | TO_CHAR(TO_BINARY(C1, 'HEX'), 'BASE64') |
|----------------------+-----------------------------------------|
| df32ede209ed5a4e3c25 | 3zLt4gntWk48JQ==                        |
| AB4F3C421B           | q088Qhs=                                |
| 9324df2ecc54         | kyTfLsxU                                |
+----------------------+-----------------------------------------+
```

Convert from base64 to hex:

```sqlexample
SELECT c1, TO_CHAR(TO_BINARY(c1, 'base64'), 'hex') FROM base64_strings;
```

```output
+------------------+-----------------------------------------+
| C1               | TO_CHAR(TO_BINARY(C1, 'BASE64'), 'HEX') |
|------------------+-----------------------------------------|
| 3zLt4gntWk48JQ== | DF32EDE209ED5A4E3C25                    |
| q088Qhs=         | AB4F3C421B                              |
| kyTfLsxU         | 9324DF2ECC54                            |
+------------------+-----------------------------------------+
```

## Converting between text and UTF-8 bytes

Strings in Snowflake are composed of Unicode characters, while binary values are composed of bytes. By converting a
string to a binary value with the UTF-8 format, you can directly manipulate the bytes that make up the Unicode characters.

Convert single-character strings to their UTF-8 representation in bytes using [TO_BINARY](functions/to_binary.md):

```sqlexample
SELECT c1, TO_BINARY(c1, 'utf-8') FROM characters;
```

```output
+----+------------------------+
| C1 | TO_BINARY(C1, 'UTF-8') |
|----+------------------------|
| a  | 61                     |
| é  | C3A9                   |
| ❄  | E29D84                 |
| π  | CF80                   |
+----+------------------------+
```

Convert a UTF-8 byte sequence to a string using [TO_CHAR , TO_VARCHAR](functions/to_char.md):

```sqlexample
SELECT TO_CHAR(X'41424320E29D84', 'utf-8');
```

```output
+-------------------------------------+
| TO_CHAR(X'41424320E29D84', 'UTF-8') |
|-------------------------------------|
| ABC ❄                               |
+-------------------------------------+
```

## Getting the MD5 digest in base64

Convert the binary MD5 digest to a base64 string using [TO_CHAR](functions/to_char.md):

```sqlexample
SELECT TO_CHAR(MD5_BINARY(c1), 'base64') FROM variants;
```

```output
+----------+-----------------------------------+
| C1       | TO_CHAR(MD5_BINARY(C1), 'BASE64') |
|----------+-----------------------------------|
| 3        | 7MvIfktc4v4oMI/Z8qe68w==          |
| 45       | bINJzHJgrmLjsTloMag5jw==          |
| "abcdef" | 6AtQFwmJUPxYqtg8jBSXjg==          |
| "côté"   | H6G3w1nEJsUY4Do1BFp2tw==          |
+----------+-----------------------------------+
```

## Convert to binary with variable format

Convert strings to binary values using a binary format extracted from the string. The statement includes
the [TRY_TO_BINARY](functions/try_to_binary.md) and [SPLIT_PART](functions/split_part.md)
functions:

```sqlexample
SELECT c1,
       TRY_TO_BINARY(SPLIT_PART(c1, ':', 2), SPLIT_PART(c1, ':', 1)) AS binary_value
  FROM strings;
```

```output
+-------------------------+----------------------+
| C1                      | BINARY_VALUE         |
|-------------------------+----------------------|
| hex:AB4F3C421B          | AB4F3C421B           |
| base64:c25vd2ZsYWtlCg== | 736E6F77666C616B650A |
| utf8:côté               | 63C3B474C3A9         |
| ???:abc                 | NULL                 |
+-------------------------+----------------------+
```

Try multiple formats for the conversion:

```sqlexample
SELECT c1,
       COALESCE(
         x'00' || TRY_TO_BINARY(c1, 'hex'),
         x'01' || TRY_TO_BINARY(c1, 'base64'),
         x'02' || TRY_TO_BINARY(c1, 'utf-8')) AS binary_value
  FROM strings;
```

```output
+------------------+------------------------+
| C1               | BINARY_VALUE           |
|------------------+------------------------|
| ab4f3c421b       | 00AB4F3C421B           |
| c25vd2ZsYWtlCg== | 01736E6F77666C616B650A |
| côté             | 0263C3B474C3A9         |
| 1100             | 001100                 |
+------------------+------------------------+
```

> **Note:**
>
> Since the above queries use [TRY_TO_BINARY](functions/try_to_binary.md), the result is NULL if the format
> isn’t recognized or if the string can’t be parsed with the given format.

Convert the results from the previous example back to strings using [SUBSTR](functions/substr.md)
and [DECODE](functions/decode.md):

```sqlexample
SELECT c1,
       TO_CHAR(
       SUBSTR(c1, 2),
       DECODE(SUBSTR(c1, 1, 1), x'00', 'hex', x'01', 'base64', x'02', 'utf-8')) AS string_value
  FROM bin;
```

```output
+------------------------+------------------+
| C1                     | STRING_VALUE     |
|------------------------+------------------|
| 00AB4F3C421B           | AB4F3C421B       |
| 01736E6F77666C616B650A | c25vd2ZsYWtlCg== |
| 0263C3B474C3A9         | côté             |
| 001100                 | 1100             |
+------------------------+------------------+
```

## Custom decoding with JavaScript UDF

The BINARY data type allows the storage of arbitrary data. Since JavaScript UDFs support the data type via `Uint8Array`
(see [Introduction to JavaScript UDFs](../developer-guide/udf/javascript/udf-javascript-introduction.md)), it is possible to implement custom decoding
logic in JavaScript. This isn’t the most efficient way to work, but it is very powerful.

Create a function that decodes based on the first byte:

```sqlexample
CREATE OR REPLACE FUNCTION my_decoder (b BINARY)
  RETURNS VARIANT
  LANGUAGE JAVASCRIPT
AS '
  IF (B[0] == 0) {
      var number = 0;
      FOR (var i = 1; i < B.length; i++) {
          number = number * 256 + B[i];
      }
      RETURN number;
  }
  IF (B[0] == 1) {
      var str = "";
      FOR (var i = 1; i < B.length; i++) {
          str += String.fromCharCode(B[i]);
      }
      RETURN str;
  }
  RETURN NULL;';
```

```sqlexample
SELECT c1, my_decoder(c1) FROM bin;
```

```output
+----------------+----------------+
| C1             | MY_DECODER(C1) |
|----------------+----------------|
| 002A           | 42             |
| 0148656C6C6F21 | "Hello!"       |
| 00FFFF         | 65535          |
| 020B1701       | null           |
+----------------+----------------+
```

---
title: Using references to authorize access on objects
source: https://docs.snowflake.com/en/sql-reference/references.md
section: SQL General Reference
---

# Using references to authorize access on objects

A reference can be used to authorize access on objects to a stored procedure, Snowflake Native App, or class instance
that does not have access to those objects by default.

## Introduction

A reference is a string that can be used as an identifier. The identifier resolves to the object being referenced.

A reference encapsulates the following:

* The object name.
* The active role used to create the object reference and any active secondary role(s) if applicable.
* The privilege(s) on the object that are specified when the reference is created.

Some scenarios where a reference might be required include:

* An [owner’s rights stored procedure](../developer-guide/stored-procedure/stored-procedures-rights.md) requires access to insert data in a table
  owned by a different role.
* An application performs data analytics and requires read access to data in tables.
* An instance of the SNOWFLAKE.ML.ANOMALY_DETECTION class requires read access to a view for training the anomaly detection ML model.

### Objects identified by name

A reference identifies an object *by name*. This means if an object is renamed after a reference is created, the reference is
invalid. However, if a new object with the same name is created, the reference *might* be valid. For example, a role
`my_role` creates a reference `my_ref1` for table `my_table1` with the SELECT
privilege. After the reference is created, table `my_table1` is dropped and a *new* table named `my_table1` is created.
The reference `my_ref1` identifies a table with the name `my_table1`. In this case, it identifies the *new*
table `my_table1`.

If the role used to create the reference, and the privilege(s) granted on `my_table1` are still valid, access to
the new `my_table1` is authorized when using the reference.

If the role and privilege(s) encapsulated in the reference are no longer valid, access to table `my_table1` cannot
be authorized and a new reference must be created for the new table.

### Privileges verified at execution time

The privileges granted to the role that created the reference are verified at the time the reference is used. For example, a role
`my_role` creates a reference to a table `t1` with the SELECT privilege. If `my_role` is dropped or
the SELECT privilege on table `t1` is revoked from `my_role`, the privileges encapsulated in the reference are no
longer valid. When the reference is passed to a stored procedure that requires the SELECT privilege on
the table, the stored procedure fails with a permissions error.

### Types of references and reference lifespan

The lifespan of a reference can be specified at creation time.

* A *transient reference* has a limited lifespan, either for the duration of the call in which the reference is passed, or
  for the duration of the session.

* A *persistent reference* has an unlimited lifespan. The reference remains valid until the object it references is dropped,
  the reference is unset, or the reference becomes invalid.

  For examples of unsetting references, see Unset a persistent reference for an application.

  A reference can become invalid for any of the following reasons:

  + The object it references is renamed.
  + The role that created the reference is dropped.
  + The role that created the reference no longer has privileges on the object.

  For more information, see Objects identified by name and Privileges verified at execution time.

### References for owner’s rights stored procedures

An [owner’s rights stored procedure](../developer-guide/stored-procedure/stored-procedures-rights.md) executes with the privileges of the *owner*
rather than the privileges of the *caller* who executes the stored procedure. In order to perform actions on a table, view, or
function that the caller has privileges to access, the caller must pass a reference to the table, view, or function. The reference
enables the stored procedure to perform actions on the object that the reference identifies with the privileges of the
creator of the reference (in this case, the caller).

### References for applications and classes

By design, applications and classes *do not* have access to objects in the account where the application is installed or
an instance of a class is created. Users can authorize access on objects to an application or class instance by creating a reference.

#### Providers and consumers of applications and classes

A *provider* creates an application and a *consumer* installs and uses an application in the consumer account. In the case of
[Snowflake classes](snowflake-db-classes.md), Snowflake is the *provider*, and a user with a Snowflake account who creates an instance
of a class is the *consumer*.

Providers can create applications and classes that request and use references in their code. For more information, see
References for providers.

Consumers can create and pass references to applications that they install in their account or to instances
of Snowflake classes. For more information, see References for consumers.

## Supported targets for references

The targets for a reference can be an object or a query. If the target of the reference is an object, privilege(s) on the object
are required for the reference.

### Supported object types and privileges for references

The following table lists the object types that a reference can include, the type of reference that can be created,
and the privileges allowed for each object:

| Object type | Transient | Persistent | Privileges allowed | Default privilege |
| --- | --- | --- | --- | --- |
| API INTEGRATION |  | ✔ | USAGE | USAGE |
| CATALOG INTEGRATION |  | ✔ | USAGE | USAGE |
| COMPUTE POOL | ✔ |  | APPLYBUDGET |  |
| DATABASE | ✔ |  | APPLYBUDGET |  |
| EXTERNAL ACCESS INTEGRATION |  | ✔ | USAGE |  |
| EXTERNAL VOLUME |  | ✔ | USAGE |  |
| EXTERNAL TABLE |  | ✔ | SELECT, REFERENCES | SELECT |
| FUNCTION | ✔ | ✔ | USAGE | USAGE |
| GIT REPOSITORY |  | ✔ | READ | READ |
| MATERIALIZED VIEW | ✔ |  | APPLYBUDGET |  |
| PIPE | ✔ |  | APPLYBUDGET | APPLYBUDGET |
| POLICY | ✔ |  | MANAGE POLICY |  |
| PROCEDURE | ✔ | ✔ | USAGE | USAGE |
| ROW ACCESS POLICY | ✔ |  | APPLY |  |
| SCHEMA | ✔ |  | APPLYBUDGET |  |
| SECRET |  | ✔ | USAGE, READ |  |
|  | ✔ |  | READ |  |
| STAGE |  | ✔ | READ, WRITE | READ |
| TABLE | ✔ |  | APPLYBUDGET, REBUILD, EVOLVESCHEMA |  |
|  | ✔ | ✔ | SELECT, INSERT, UPDATE, DELETE, TRUNCATE, REFERENCES | SELECT |
| TAG | ✔ |  | APPLYBUDGET |  |
| TASK | ✔ |  | APPLYBUDGET | APPLYBUDGET |
| VIEW | ✔ | ✔ | SELECT, REFERENCES | SELECT |
| WAREHOUSE | ✔ |  | APPLYBUDGET |  |
|  |  | ✔ | MODIFY, MONITOR, OPERATE, USAGE | USAGE |

### Query references

A *query reference* is a type of transient reference. It references a SELECT statement that can be used in the FROM clause of
another SQL statement in a stored procedure. You can create a query reference by using the
[SYSTEM$QUERY_REFERENCE](functions/system_query_reference.md) function or the TABLE keyword.

For more information, see [Using query references](../developer-guide/stored-procedure/stored-procedures-calling-references.md) and [Using the TABLE keyword to create a reference to a table, view, or query](../developer-guide/stored-procedure/stored-procedures-calling-references.md).

## References for providers

You can create applications as a provider by using the [Snowflake Native App Framework](../developer-guide/native-apps/native-apps-about.md).
For detailed information on requesting references from a consumer of your application, see
[Request references and object-level privileges from consumers](../developer-guide/native-apps/requesting-refs.md).

## References for consumers

You can create a reference by using the [SYSTEM$REFERENCE](functions/system_reference.md) function. You can pass the string identifier
that the function returns to a stored procedure, application, or class instance. Alternatively, you can pass in the statement that
creates the reference in place of the string identifier.

> **Note:**
>
> If you are passing a reference to a stored procedure, you can use the TABLE keyword (rather than calling the SYSTEM$REFERENCE
> function) to create the reference. See [Using the TABLE keyword to create a reference to a table, view, or query](../developer-guide/stored-procedure/stored-procedures-calling-references.md).

### Examples

Create a transient reference with session scope to table `t1` with the SELECT privilege:

```sqlexample
SELECT SYSTEM$REFERENCE('TABLE', 't1', 'SESSION', 'SELECT');
```

To create a reference to the same table for the lifetime of the scope in which it is referenced (for example, if you pass it to
a stored procedure, its lifetime would be that of the outermost block of the stored procedure), execute the following statement:

```sqlexample
SELECT SYSTEM$REFERENCE('TABLE', 't1', 'CALL', 'SELECT');
```

Create a persistent reference to a table `t1` with the INSERT privilege to pass to an application:

```sqlexample
SELECT SYSTEM$REFERENCE('TABLE', 't1', 'PERSISTENT', 'INSERT');
```

Create a query reference to pass to a stored procedure. The lifetime of this transient reference is for the outermost block of
the stored procedure to which you pass the reference to:

```sqlexample
SELECT SYSTEM$QUERY_REFERENCE('SELECT id FROM my_table', FALSE);
```

For additional examples:

* Stored procedure example, see [Background: The problem with passing objects and queries to stored procedures](../developer-guide/stored-procedure/stored-procedures-calling-references.md).
* Native App Framework application example, see
  [Associating the Reference to the Application](https://other-docs.snowflake.com/en/native-apps/consumer-granting-privs#associating-the-reference-to-the-application).
* Class instance example, see [Training an Anomaly Detection Model with Labeled Data](../user-guide/ml-functions/anomaly-detection.md).

#### Unset a persistent reference for an application

An application or a class that requires a persistent reference will also provide a method to unset the reference. Method names can vary
by implementation.

Alternatively, you can unset references by using an [ALTER APPLICATION … UNSET REFERENCES](sql/alter-application.md)
statement.

1. You can use the [SHOW REFERENCES](sql/show-references.md) command to view all references, including references that have been
   set for an application.

   For example, to view references for application `my_app`:

   ```sqlexample
   SHOW REFERENCES IN APPLICATION my_app;
   ```
2. You can unset any references that have been set for the application by using the ALTER APPLICATION command.

   For example, to unset the reference named `table_to_read` in application `my_app`:

   ```sqlexample
   ALTER APPLICATION my_app UNSET REFERENCES('table_to_read');
   ```

   For example, to unset all references in application `my_app`:

   ```sqlexample
   ALTER APPLICATION my_app UNSET REFERENCES;
   ```

## Considerations when using references

* If an object that is associated with a reference is renamed (or reparented), the reference is no longer valid.

  If a new object is created with the same name, and if the roles encoded in the reference association have the relevant
  privileges on the new object, the reference remains valid. Otherwise it fails with a permissions error.
* If an object is swapped, and the roles encoded in the reference association have the relevant privileges on the new object
  that now has the swapped name, the reference remains valid. Otherwise it fails with a permissions error.
* Object drop and undrop:

  + If an object that is associated with a reference is dropped, the reference association becomes invalid.
  + If the object is undropped, the reference association becomes valid again.
* Cloning

  You can clone a class instance, or its parent database or schema, that uses references to objects in your account.

  > + If the reference object is referenced with a fully qualified name, the instance clone refers to the original object.
  > + If the referenced object is referenced with a partially qualified or unqualified name, the instance clone might refer to
  >   the clone object, the original object, or to no real object depending on the cloning boundary.
* Replication is supported for an application or a database containing a class instance that uses a reference to objects
  in the consumer account.

  References function correctly in the target account as long as the following objects are replicated:

  + Application or class instance.
  + Referenced object.
  + Role that created the reference.

  These objects can be replicated in different replication or failover groups. Once all objects are replicated, the reference
  is usable.

For providers of Snowflake Native App Framework applications, see also [Considerations when using references](../developer-guide/native-apps/requesting-refs.md).

## Monitoring usage of references

You can view references requested by an application by using the [SHOW REFERENCES](sql/show-references.md) command. If you have set
any references for an application, the output will include information about the object, database, schema, and the identifier for each
reference.

For example, to view references in application `my_app`:

```sqlexample
SHOW REFERENCES IN APPLICATION my_app;
```

---
title: Using request and response translators with data for a remote service
source: https://docs.snowflake.com/en/sql-reference/external-functions-translators.md
section: SQL General Reference
---

# Using request and response translators with data for a remote service

With request and response translators, you can change the format of data sent to, and received from, remote services used by
external functions.

## Purpose

When Snowflake sends data to a remote service, Snowflake formats the data according to
[these rules](external-functions-data-format.md). Similarly, when Snowflake receives data from a remote service,
Snowflake expects the data to be formatted according to the same rules.

Many remote services expect to handle data in a different format. With request and response translators, you can conveniently:

* Convert data from Snowflake’s format to the remote service’s native input format (*request translator*).
* Convert data from the remote service’s native output format to Snowflake’s format (*response translator*).

## SQL implementation

To translate data between Snowflake’s format and the remote service’s native input format, you use JavaScript [UDFs](../developer-guide/udf/udf-overview.md)
(user-defined functions). You almost always write a pair of UDFs: one to translate the request and one to translate the response.

Snowflake calls these functions as part of each external function call. For example, for a request to a remote service, Snowflake calls the request translator function,
passes it the Snowflake-formatted data, then takes the returned data and sends it to the remote service. When the
remote service returns data, Snowflake calls the response translator function to convert the data back to the format that Snowflake
understands.

From the user perspective, calling an external function when a translator is converting is the same as calling an
external function without a translator. After you specify translators as part of the
[CREATE EXTERNAL FUNCTION](sql/create-external-function.md) statement, they are called automatically.

An external function can have a maximum of one request translator and one response translator at a time.

The request and response translator UDFs can be [secure UDFs](../developer-guide/secure-udf-procedure.md).

### Assigning a translator function to an external function

To specify which user-defined function to use as a translator, include `REQUEST_TRANSLATOR` and `RESPONSE_TRANSLATOR`
clauses in the `CREATE EXTERNAL FUNCTION` statement. Each takes the name of the
translator function to use at run time.

For example:

> ```sqlexample
> CREATE EXTERNAL FUNCTION f(...)
>     RETURNS OBJECT
>     ...
>     REQUEST_TRANSLATOR = my_request_translator_udf
>     RESPONSE_TRANSLATOR = my_response_translator_udf
>     ...
>     AS <url_of_proxy_and_resource>;
> ```

The syntax for specifying translators as part of a `CREATE EXTERNAL FUNCTION` statement is shown below:

> ```sqlsyntax
> CREATE EXTERNAL FUNCTION f(...)
>     RETURNS OBJECT
>     ...
>     [ REQUEST_TRANSLATOR = <request_translator_udf_name> ]
>     [ RESPONSE_TRANSLATOR = <response_translator_udf_name> ]
>     ...
> ```
>
> where:
>
> > `request_translator_udf_name`
> > :   The name of the request translator function.
> >
> > `response_translator_udf_name`
> > :   The name of the response translator function.

The `REQUEST_TRANSLATOR` and `RESPONSE_TRANSLATOR` parameters each take one parameter of type [OBJECT](data-types-semistructured.md).

You can also specify a request or response translator in an [ALTER FUNCTION](sql/alter-function.md) command. You can:

* Add a translator if the external function does not already have one.
* Replace an existing translator.
* Remove a translator.

Use the `SET` keyword to add a new translator or to replace an existing translator.

To add or replace a translator:

> ```sqlsyntax
> ALTER FUNCTION ...
>     SET [REQUEST_TRANSLATOR | RESPONSE_TRANSLATOR] = <udf_name>;
> ```
>
> where
>
> > `udf_name`
> > :   The name of a previously-created JavaScript UDF.

To remove a translator:

> ```sqlsyntax
> ALTER FUNCTION ...
>     UNSET [REQUEST_TRANSLATOR | RESPONSE_TRANSLATOR];
> ```

### Requirements for the SQL

* The name of the translator function in the `CREATE EXTERNAL FUNCTION` or `ALTER FUNCTION` statement should
  be either:

  + A qualified name (e.g. MyDatabase.MySchema.MyJavaScriptUDF).
  + Defined in the same database and schema as the external function that uses them.
* When the translator is specified in a `CREATE EXTERNAL FUNCTION` or `ALTER FUNCTION` statement, the
  translator UDF must already exist. You can’t specify the name first and create the UDF later – even if you
  don’t call the external function before you create the UDF.
* A UDF used as a translator should not be dropped without first removing it from all external
  functions that use it. (At the time the external function is called, Snowflake fails with an error if the translator does not exist.)
* If the translator UDF is modified (via `ALTER FUNCTION`), it must retain the same interface requirements.
  If it does not retain the interface requirements, an exception is raised before running the external function.

## JavaScript implementation

At run time, SQL passes an [OBJECT](data-types-semistructured.md) to the translator UDF. The JavaScript code receives this as a
JavaScript object.

### Implementing a request translator

#### Request translator input properties

A translator UDF receives a JavaScript object named `event`. The object contains the following properties:

* `body`: The format of the `data` field is the same as the existing Snowflake rowset batch (i.e. an array of rows).

  For example,

  ```none
  {
    "body": {
            "data": [
                      [0,"cat"],
                      [1,"dog"]
                    ]
            }
  }
  ```

  The existing data is nested under the outer body.
* `serviceUrl`: The external function’s defined URL to call.
* `contextHeaders`: An object that contains all the context-related headers, where the names
  are the field names. For example, the object could contain the field name “SF_CONTEXT_CURRENT_DATABASE”, and the corresponding
  value would be a string containing the current database name.

#### Request translator output properties

The request translator returns an object with fields used to communicate with the external service API gateway. That object has three
optional fields:

* `body`: Defines the actual body to be passed to the service. If this is not defined, there is no body.
  The `body` value should be a string or a JSON object in the format that the remote service expects. If the value is a
  string, that string can contain internal structure (e.g. be JSON-compatible). If the value is a JSON object, that object is
  converted to a string so that it can be included as part of the HTTP POST command string.
* `urlSuffix`: Sets the suffix of the service URL, which is added to the end of the `serviceUrl` value. This suffix
  is also allowed to contain query parameters. Parameter names and values must be URL encoded. For example, if you want to set a parameter named `a`
  to value `my param` you need to do URL encoding of the space character, so the parameter would be `?a=my%20param`.
* `translatorData`: Passed from the request translator to the response translator. This field can pass context information, such as
  the input body, the service URL or suffix, or context headers.

All three fields are optional. However, as a practical matter, most request translators return at least the body data.

### Implementing a response translator

#### Response translator input properties

The input parameter for the response translator function is an object. The example below uses `EVENT`, which contains two properties:

* `body`: The response to be decoded from the external service response.
* `translatorData`: If this field is returned by the request translator, then Snowflake passes it to the response translator.

#### Response translator output properties

The response translator response is returned as an object under the `body` element; the format is the existing
external function format (array of rows). For example:

> ```none
> {
>   "body": {
>           "data": [
>                     [0, "Life"],
>                     [1, "the universe"],
>                     [2, "and everything"]
>                   ]
>            }
> }
> ```

### Requirements for the translator function

Each translator UDF must meet the following requirements:

* It must be a [JavaScript UDF](../developer-guide/udf/javascript/udf-javascript-introduction.md).
* It must take exactly one parameter of type [OBJECT](data-types-semistructured.md), which represents a batch of rows.
* It must return one value of type OBJECT, which also represents a batch of rows.
* It must be a scalar UDF (returning one row for each row (OBJECT) passed in).

  > **Note:**
  >
  > Although the translator is scalar, the OBJECT passed to the translator can (and
  > usually does) have multiple rows embedded inside the JSON in the OBJECT.
* The number and order of the rows (inside the OBJECT) returned by the response translator UDF must be the same as the number and
  order of the rows passed to the request translator UDF (inside the OBJECT).

## Example request translator and response translator

The following example shows a request translator and response translator being used to
convert data into the format required by an external service that does
sentiment analysis, [Amazon Comprehend
BatchDetectSentiment](https://docs.aws.amazon.com/comprehend/latest/dg/API_BatchDetectSentiment.html).
The request translator shapes the HTTP request to match the format that
the backend service expects.

To use translators, you’ll need an API gateway. This example uses
an API gateway that is already configured to talk to the sentiment
analysis service. For more information about how to integrate with an Amazon Web Services (AWS) service as the backend, see [Set up an API integration request using
the API Gateway
console](https://docs.aws.amazon.com/apigateway/latest/developerguide/how-to-method-settings-console.html)
in the AWS documentation.

It is helpful to get your API integration working successfully before
adding translators.

### Setup

Set up a database to hold demo data.

Choose a role that has permission to create external functions:

```sqlexample
USE ROLE ACCOUNTADMIN;
```

Specify which warehouse, database and schema to use:

```sqlexample
USE WAREHOUSE w;
USE DATABASE a;
USE SCHEMA b;
```

Create a table to hold your test sentences:

```sqlexample
CREATE TABLE demo(vc varchar);
INSERT INTO demo VALUES('Today is a good day'),('I am feeling mopey');
```

### Request body before translation

This external function doesn’t have a request translator or response translator:

```sqlexample
CREATE OR REPLACE EXTERNAL FUNCTION ComprehendSentiment(thought varchar)
RETURNS VARIANT
API_INTEGRATION = aws_comprehend_gateway
AS 'https://<MY_GATEWAY>.execute-api.us-east-1.amazonaws.com/test/comprehend_proxy';
```

You can call the external function with your test data from the demo table:

```sqlexample
SELECT ComprehendSentiment(vc), vc FROM demo;
```

The generated request body uses the Snowflake external function
data format:

```sqljson
{"body":{"data:" [[0, "Today is a good day"],[1,"I am feeling mopey"]]}}
```

However, the external sentiment analysis service expects a different
format that specifies the language and an array of strings:

```sqljson
{"body": { Language: "en", TextList: [ "Today is a good day", "I am feeling mopey"]}}
```

The next section describes how you can add a request translator to change the request body to the required format.

### Convert the request body

By using a request translator, you can convert the default input described above
(in the Snowflake data format) to the format that the external service requires.

The following SQL creates an `awscomprehendrequest_translator` translator function.

```sqlexample
CREATE OR REPLACE FUNCTION AWSComprehendrequest_translator(EVENT OBJECT)
RETURNS OBJECT
LANGUAGE JAVASCRIPT AS
'
var textlist = []
for(i = 0; i < EVENT.body.data.length; i++) {
   let row = EVENT.body.data[i];
   // row[0] is the row number and row[1] is the input text.
   textlist.push(row[1]); //put text into the textlist
}
// create the request for the service. Also pass the input request as part of the output.
return { "body": { "LanguageCode": "en", "TextList" : textlist }, "translatorData": EVENT.body }
';
```

In the request translator function, the code:

* Loops through each of the input rows. For each row, it adds the string,
  which is in `row[1]`, to the `textlist` array. The
  value at `row[0]` is the row number and it can be ignored.
* Returns a JSON body that has the language code and text
  list that matches the requirements of the external service.
* Returns data via the `translatorData` field. This is
  used by the response translator. In this example, you are sending the
  original input data. You will use the length of the input data in the
  response translator to know how many input requests there were.

You can test the request translator by calling it directly.

```sqlexample
SELECT AWSComprehendrequest_translator(parse_json('{"body":{"data": [[0, "I am so happy we got a sunny day for my birthday."], [1, "$$$$$."], [2, "Today is my last day in the old house."]]}}'));
```

The request translator puts the body into the shape expected by the external
service.

```sqljson
{"body":{
   "LanguageCode": "en",
   "TextList": [
      "I am so happy we got a sunny day for my birthday.",
      "$$$$$.",
      "Today is my last day in the old house."
               ]
         },
   "translatorData": {
      "data": [[0, "I am so happy we got a sunny day for my birthday."],
               [1, "$$$$$."],
               [2, "Today is my last day in the old house."]]
                     }
}
```

### Response body before adding a response translator

A response body from the external service looks something like this.

```sqljson
{"body":{
   "ErrorList": [ { "ErrorCode": 57, "ErrorMessage": "Language unknown", "Index": 1} ],
   "ResultList":[ { "Index": 0, "Sentiment": "POSITIVE",
                    "SentimentScore": { "Mixed": 25, "Negative": 5, "Neutral": 1, "Positive": 90 }},
                  { "Index": 2, "Sentiment": "NEGATIVE",
                    "SentimentScore": { "Mixed": 25, "Negative": 75, "Neutral": 30, "Positive": 20 }}
                ]
         }
}
```

### Convert the response body

The response translator processes the results that you get back from the
external service. The results contain a combination of errors in the
`ErrorList` and results in the `ResultList`.

The response translator code combines these results together to make a complete
set that matches the order of the rows that were passed to the external service. The
response translator returns the results in the Snowflake format.

The following SQL creates an `awscomprehendresponse_translator` translator function.

```sqlexample
CREATE OR REPLACE FUNCTION AWSComprehendresponse_translator(EVENT OBJECT)
RETURNS OBJECT
LANGUAGE JAVASCRIPT AS
'
// Combine the scored results and the errors into a single list.
var responses = new Array(EVENT.translatorData.data.length);
// output format: array of {
// "Sentiment": (POSITIVE, NEUTRAL, MIXED, NEGATIVE, or ERROR),
// "SentimentScore": <score>, "ErrorMessage": ErrorMessage }.
// If error, set ErrorMessage; otherwise, set SentimentScore.
// Insert good results into proper position.
for(i = 0; i < EVENT.body.ResultList.length; i++) {
   let row = EVENT.body.ResultList[i];
   let result = [row.Index, {"Sentiment": row.Sentiment, "SentimentScore": row.SentimentScore}]
   responses[row.Index] = result
}
// Insert errors.
for(i = 0; i < EVENT.body.ErrorList.length; i++) {
   let row = EVENT.body.ErrorList[i];
   let result = [row.Index, {"Sentiment": "Error", "ErrorMessage": row.ErrorMessage}]
   responses[row.Index] = result
}
return { "body": { "data" : responses } };
';
```

In the response translator function, the code:

* Initializes an array called `responses` with the size of the input from the
  `translatorData` array length. You sent `translatorData` from the request
  translator to the response translator to pass the original list of test strings.
* Loops through each of the non-error results and puts them into the result list.
* Loops through the error results and puts them into the result list. The result
  list has an index position which tells you what entry it is. The order of the
  produced results must match the input order. The result list also contains
  the sentiment information.

After all of the responses have been gathered, they are returned in a
JSON body in the format that Snowflake expects.

The following direct test will return a JSON body with the correct format.

```sqlexample
SELECT AWSComprehendresponse_translator(
    parse_json('{
        "translatorData": {
            "data": [[0, "I am so happy we got a sunny day for my birthday."],
                    [1, "$$$$$."],
                    [2, "Today is my last day in the old house."]]
                          }
        "body": {
            "ErrorList":  [ { "ErrorCode": 57,  "ErrorMessage": "Language unknown",  "Index": 1 } ],
            "ResultList": [
                            { "Index": 0,  "Sentiment": "POSITIVE",
                              "SentimentScore": { "Mixed": 25,  "Negative": 5,  "Neutral": 1,  "Positive": 90 }
                            },
                            { "Index": 2, "Sentiment": "NEGATIVE",
                              "SentimentScore": { "Mixed": 25,  "Negative": 75,  "Neutral": 30,  "Positive": 20 }
                            }
                          ]
            },
        }'
    )
);
```

### Assign the translators to the external function

To the external function, add the request and response translator functions by
assigning the function names as values to the `request_translator` and `response_translator`
parameters. This way, they’ll be called automatically when the external function runs.

```sqlexample
CREATE OR REPLACE EXTERNAL FUNCTION ComprehendSentiment(thought varchar)
RETURNS VARIANT
API_INTEGRATION = aws_comprehend_gateway
request_translator = db_name.schema_name.AWSComprehendrequest_translator
response_translator = db_name.schema_name.AWSComprehendresponse_translator
AS 'https://<MY_GATEWAY>.execute-api.us-east-1.amazonaws.com/test/comprehend_proxy';
```

You can describe the function to get information about it.

```sqlexample
DESCRIBE FUNCTION ComprehendSentiment(VARCHAR);
```

### Call the external function

Test the external function by calling it with a single sentence.

```sqlexample
SELECT ComprehendSentiment('Today is a good day');
```

You see the sentiment analysis results.

```sqljson
{"Sentiment": "POSITIVE",
 "SentimentScore":{"Mixed":0.002436627633869648,
                   "Negative":0.0014803812373429537,
                   "Neutral":0.015923455357551575,
                   "Positive": 0.9801595211029053}}
```

Test the external function by calling it with multiple sentences. Use the same `demo` table that you created earlier.

```sqlexample
SELECT ComprehendSentiment(vc), vc FROM demo;
```

The sentiment analysis results are displayed.

When the external function was called, the request translator automatically
converted data into the format required by the external service. Then,
the response translator automatically converted the response from the external
service back into the format required by Snowflake.

## Tips for testing request and response translators

* Test case values are typically OBJECT values (collections of key-value pairs). These should be formatted to meet the
  requirements in [these rules](external-functions-data-format.md).
* You can start testing your request translator or response translator by passing in an example input converted to a string. For example:

  ```sqlexample
  select my_request_translator_function(parse_json('{"body": {"data": [ [0,"cat",867], [1,"dog",5309] ] } }'));
  ```

  (The input to `PARSE_JSON()` must be a JSON-formatted string.)
* Test with `NULL` values if appropriate.

  + Include at least one SQL `NULL` value in your test cases.
  + Include at least one [JSON NULL](../user-guide/semistructured-considerations.md) value in your test cases.
* Translating a request and translating a response are often converse processes. Conceptually:

  > ```none
  > my_response_translator_udf(my_request_translator_udf(x)) = x
  > ```

  You can use this characteristic to help test your request translator and response translator if the data formats match. Create a table with
  good test values, then execute a command similar to:

  > ```none
  > SELECT test_case_column
  >     FROM test_table
  >     WHERE my_response_translator_udf(my_request_translator_udf(x)) != x;
  > ```

  The query should not return any rows.

  Note that translating a request and translating a response are not always exactly converse. For an example of where they might not be
  converse, see the discussion of converse functions in the “Usage Notes” section of the documentation for the
  [TO_JSON() function](functions/to_json.md).

---
title: UUID data type
source: https://docs.snowflake.com/en/sql-reference/data-types-uuid.md
section: SQL General Reference
---

# UUID data type

The UUID data type stores universally unique identifiers (UUIDs). A UUID is a 128-bit binary value that
uniquely identifies information. Each UUID value is designed to be globally unique, which means that
there is a very low probability that two different systems will generate the exact same UUID independently.
However, uniqueness depends on how the UUID values are generated, and the Snowflake UUID data type
itself doesn’t guarantee uniqueness. For example, a user can insert the same UUID value multiple
times without errors.

UUID values are in UUID format, which is a 36-character string of hexadecimal digits, separated by
hyphens, in the pattern 8-4-4-4-12. For example, `f353ca91-4fc5-49f2-9b9e-304f83d11914` is a string
in UUID format.

For more information about the UUID data type, see the
[Universally unique identifier](https://en.wikipedia.org/wiki/Universally_unique_identifier) Wikipedia
article.

The following considerations apply to the UUID data type:

* [Snowflake drivers](../developer-guide/drivers.md) treat UUID values as text strings.
* The ANSI literal form of UUID is supported as input.
* UUID values of any version can be inserted into tables.
* UUID values are case-insensitive.

## Specify a UUID data type

* To specify a UUID type, use the following syntax:

  ```sqlsyntax
  <column_name> UUID
  ```

  Where:

  + `column_name` is the name of a column in a table.

## Limitations for the UUID data type

The following limitations apply to the UUID data type:

* A UUID value can’t be stored in a value of a
  [semi-structured data type](data-types-semistructured.md) or
  [structured data type](data-types-structured.md).

  To store a UUID value as a string in a value of one of these types, you can [cast](data-type-conversion.md)
  the UUID value to a VARCHAR value.
* The UUID data type isn’t supported in stored procedures or user-defined functions (UDFs) written in a
  language other than SQL, such as Python or Java.
* The UUID data type isn’t supported in [hybrid tables](../user-guide/tables-hybrid.md).
* The UUID data type isn’t supported in Snowpark.
* The following features don’t support the UUID data type:

  + [Differential privacy](../user-guide/diff-privacy/differential-privacy-sql-reference.md)
  + [Sensitive data classification](../user-guide/classify-intro.md)

## Examples for the UUID data type

The following examples insert UUID values into tables:

* Insert a UUID value into a table
* Automatically generate UUID values when you insert rows into a table

### Insert a UUID value into a table

* Create a table with a column of UUID type and insert a UUID value:

  ```sqlexample
  CREATE TABLE sample_uuid_table(uuid_col UUID);

  INSERT INTO sample_uuid_table VALUES ('c73d9175-0a1d-48c6-8d30-df165461328b');
  ```

### Automatically generate UUID values when you insert rows into a table

The following example shows you how to automatically generate UUID values when you insert rows into a table:

1. Create a table that uses the [UUID_STRING](functions/uuid_string.md) function to
   generate a UUID value for each row inserted into the table:

   ```sqlexample
   CREATE OR REPLACE TABLE sample_generate_uuid (
     id UUID DEFAULT UUID_STRING() NOT NULL,
     sample_column VARCHAR);
   ```
2. Insert values into the table and omit the `id` column so that a UUID value is generated and
   inserted automatically:

   ```sqlexample
   INSERT INTO sample_generate_uuid (sample_column) VALUES
     ('value_a'),
     ('value_b');
   ```
3. Query the table to view the generated UUID values:

   ```sqlexample
   SELECT * FROM sample_generate_uuid;
   ```

   ```output
   +--------------------------------------+---------------+
   | ID                                   | SAMPLE_COLUMN |
   |--------------------------------------+---------------|
   | f353ca91-4fc5-49f2-9b9e-304f83d11914 | value_a       |
   | da563283-e201-4744-b158-221dd204a61f | value_b       |
   +--------------------------------------+---------------+
   ```

---
title: VALUES
source: https://docs.snowflake.com/en/sql-reference/constructs/values.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# VALUES

In the SELECT statement, the VALUES subclause of the FROM clause lets you
specify a set of constants to form a finite set of rows.

For information about the VALUES clause in the INSERT statement, see
the documentation for the [INSERT](../sql/insert.md) statement.

## Syntax

```sqlsyntax
SELECT ...
FROM ( VALUES ( <expr> [ , <expr> [ , ... ] ] ) [ , ( ... ) ] )
  [ [ AS ] <table_alias> [ ( <column_alias> [ , ... ] ) ] ]
[ ... ]
```

## Parameters

`expr`
:   Each expression must be a constant, or an expression that can be evaluated as a constant during compilation of the
    SQL statement.

    Most simple arithmetic expressions and string functions can be evaluated at compile time, but most other expressions
    can’t.

`table_alias`
:   An optional alias to give the set of rows a name, as though the set of rows were a table.

`column_alias`
:   Optional column aliases can specify the columns names.

## Usage notes

* Inside a [FROM](from.md) clause, a VALUES clause can’t contain the `DEFAULT` keyword. This limitation is in contrast
  to a VALUES clause in an [INSERT](../sql/insert.md) statement, which supports the use
  of `DEFAULT`; for example, `INSERT INTO table VALUES (10, DEFAULT, 'Name') ...`.
* When a VALUES clause includes multiple values of different data types for the same column, Snowflake determines a
  common data type that can encompass all values and implicitly converts each value to that common type. This conversion
  can produce unexpected results or errors if you aren’t aware of it.

  To avoid unexpected coercion, explicitly CAST each value to the desired type, separate the values into multiple SQL
  statements, or ensure that all values in a column share the same type.

  **Numeric example**

  When numeric values in the same column differ significantly in scale or precision, Snowflake might return an
  `out of range` error because a value doesn’t fit in the determined common numeric type.

  ```sqlexample
  SELECT column1 FROM VALUES
    (3.469446951953614e-18),
    (115898.73);
  ```

  ```output
  100039 (22003): Numeric value '115898.73' is out of range
  ```

  For numeric values specifically, you can also specify values as text strings in quotation marks and then convert them
  to numeric values as needed.

  **Timestamp example**

  When timestamp values of different types appear in the same column, Snowflake converts all values to a common
  timestamp type. In the following example, a TIMESTAMP_NTZ value is coerced to TIMESTAMP_LTZ:

  ```sqlexample
  SELECT $1 AS a, SYSTEM$TYPEOF(a) FROM VALUES
    (TO_TIMESTAMP_LTZ('2025-03-24 01:37:00 -0700')),
    (TO_TIMESTAMP_NTZ('2025-03-24 08:37:00'));
  ```

  ```output
  +-------------------------------+------------------+
  | A                             | SYSTEM$TYPEOF(A) |
  |-------------------------------+------------------|
  | 2025-03-24 01:37:00.000 -0700 | TIMESTAMP_LTZ(9) |
  | 2025-03-24 08:37:00.000 -0700 | TIMESTAMP_LTZ(9) |
  +-------------------------------+------------------+
  ```
* The VALUES clause is limited to 200,000 rows.

## Examples

The following examples use the VALUES clause to generate a fixed, known set of rows:

```sqlexample
SELECT * FROM (VALUES (1, 'one'), (2, 'two'), (3, 'three'));
```

```output
+---------+---------+
| COLUMN1 | COLUMN2 |
|---------+---------|
|       1 | one     |
|       2 | two     |
|       3 | three   |
+---------+---------+
```

You can reference values either by column name (implicit) or column position. The following
example references the second column by column position:

```sqlexample
SELECT column1, $2 FROM (VALUES (1, 'one'), (2, 'two'), (3, 'three'));
```

```output
+---------+-------+
| COLUMN1 | $2    |
|---------+-------|
|       1 | one   |
|       2 | two   |
|       3 | three |
+---------+-------+
```

The following example distinguishes multiple VALUES clauses by using aliases:

```sqlexample
SELECT v1.$2, v2.$2
  FROM (VALUES (1, 'one'), (2, 'two')) AS v1
        INNER JOIN (VALUES (1, 'One'), (3, 'three')) AS v2
  WHERE v2.$1 = v1.$1;
```

You can also specify aliases for the column names, as shown in the following example:

```sqlexample
SELECT c1, c2
  FROM (VALUES (1, 'one'), (2, 'two')) AS v1 (c1, c2);
```

---
title: Vector data types
source: https://docs.snowflake.com/en/sql-reference/data-types-vector.md
section: SQL General Reference
---

# Vector data types

This topic describes the vector data types.

## Data types

Snowflake supports a single vector data type, VECTOR.

> **Note:**
>
> The VECTOR data type is only supported in SQL, the [Python connector](../developer-guide/python-connector/python-connector-example.md), and
> the Snowpark Python library. No other languages are supported.

### VECTOR

With the VECTOR data type, Snowflake encodes and processes vectors efficiently. This data type supports semantic vector
search and retrieval applications, such as RAG-based applications, and common operations on vectors in vector-processing applications.

To specify a VECTOR type, use the following syntax:

```sqlsyntax
VECTOR( <type>, <dimension> )
```

Where:

* `type` is the Snowflake data type of the elements, which can be 32-bit integers or 32-bit floating-point numbers.

  You can specify one of the following types:

  > + INT
  > + FLOAT

  > **Note:**
  >
  > These types are distinct from the types with the same names described in [Numeric data types](data-types-numeric.md),
  > which represent `NUMBER(38, 0)` and double-precision IEEE 754 floating-point numbers, respectively.
* `dimension` is the dimension (length) of the vector. This must be a positive integer value with a maximum value of 4096.

> **Note:**
>
> Direct vector comparisons (for example, v1 < v2) are byte-wise lexicographic and, although deterministic, won’t produce the results
> that you might expect from number comparisons. So although you can use VECTOR columns in ORDER BY clauses, for vector comparisons, use the
> [vector similarity functions](functions-vector.md) provided.

The following definitions are examples of valid vector definitions:

* Define a vector of 256 32-bit floating-point values:

  ```sqlexample
  VECTOR(FLOAT, 256)
  ```
* Define a vector of 16 32-bit integer values:

  ```sqlexample
  VECTOR(INT, 16)
  ```

The following definitions are examples of invalid vector definitions:

* A vector definition using an invalid value type:

  ```sqlexample
  VECTOR(STRING, 256)
  ```
* A vector definition using an invalid vector size:

  ```sqlexample
  VECTOR(INT, -1)
  ```

## Vector conversion

This section describes how to convert to and from a VECTOR value. For details about casting, see [Data type conversion](data-type-conversion.md).

### Converting a value to a VECTOR value

VECTOR values can be explicitly cast from the following types:

* [ARRAY](data-types-semistructured.md)
* [Structured ARRAY](data-types-structured.md)
* [Variant containing an ARRAY](data-types-semistructured.md)

### Converting a value from a VECTOR value

VECTOR values can be explicitly cast to the following types:

* [ARRAY](data-types-semistructured.md)
* [Structured ARRAY](data-types-structured.md)

When converting between VECTOR and ARRAY types, the number of elements in the array must match the dimension of the vector, and the data type of the array elements must match the data type of the vector. For example, you can convert a VECTOR(FLOAT, 3) value to an ARRAY of three FLOAT values, but not to an ARRAY of two FLOAT values or an ARRAY of three INT values.

## Loading and unloading vector data

Directly loading and unloading a VECTOR column isn’t supported. For VECTOR columns, you must load and unload
data as an ARRAY and then cast it to a VECTOR when you use it. To learn how to load and unload ARRAY data types, see
[Introduction to loading semi-structured data](../user-guide/semistructured-intro.md). A common use case for vectors is to generate a
[vector embedding](../user-guide/snowflake-cortex/vector-embeddings.md).

The following example shows how to unload a table with a VECTOR column to an internal stage named `mystage`:

```sqlexample
CREATE OR REPLACE TABLE myvectortable (a VECTOR(float, 3), b VECTOR(float, 3));
INSERT INTO myvectortable SELECT [1.1,2.2,3]::VECTOR(FLOAT,3), [1,1,1]::VECTOR(FLOAT,3);
INSERT INTO myvectortable SELECT [1,2.2,3]::VECTOR(FLOAT,3), [4,6,8]::VECTOR(FLOAT,3);

COPY INTO @mystage/unload/
  FROM (SELECT TO_ARRAY(a), TO_ARRAY(b) FROM myvectortable);
```

The following example shows how to load a table from a stage and then cast the ARRAY columns as VECTOR columns:

```sqlexample
CREATE OR REPLACE TABLE arraytable (a ARRAY, b ARRAY);

COPY INTO arraytable
  FROM @mystage/unload/mydata.csv.gz;

SELECT a::VECTOR(FLOAT, 3), b::VECTOR(FLOAT, 3)
  FROM arraytable;
```

## Examples

Construct a VECTOR by casting a constant ARRAY:

```sqlexample
SELECT [1, 2, 3]::VECTOR(FLOAT, 3) AS vec;
```

Add a column with the VECTOR data type:

```sqlexample
ALTER TABLE myissues ADD COLUMN issue_vec VECTOR(FLOAT, 768);

UPDATE TABLE myissues
  SET issue_vec = SNOWFLAKE.CORTEX.EMBED_TEXT_768('e5-base-v2', issue_text);
```

## Limitations

The following limitations apply to VECTOR data:

* The VECTOR data type has limited language support. Languages not represented in this table aren’t supported.

  | Snowflake feature | Python | SQL |
  | --- | --- | --- |
  | UDFs | ✔ | ✔ |
  | UDTFs | ✔ | ✔ |
  | Drivers/Connectors | ✔ | ✔ |
  | Snowpark API | ✔ |  |
* Vectors aren’t supported in VARIANT columns.
* Vectors aren’t supported as [clustering keys](../user-guide/tables-clustering-keys.md).
* Server-side binding isn’t supported. This means that when writing to a VECTOR column through a Snowflake driver, you must cast the
  VECTOR values in the query before running the query.
* Vectors are allowed in [hybrid tables](../user-guide/tables-hybrid.md), but not as primary keys or secondary index keys.
* The VECTOR data type isn’t supported for use with the following Snowflake features:

  + [Snowflake Scripting](../developer-guide/snowflake-scripting/index.md)
  + [Apache Iceberg™ tables](../user-guide/tables-iceberg.md)
  + [Search optimization service](../user-guide/search-optimization-service.md)
  + [Snowpipe](../user-guide/data-load-snowpipe-intro.md)
  + [Bind variables](bind-variables.md)

---
title: Vector functions
source: https://docs.snowflake.com/en/sql-reference/functions-vector.md
section: SQL General Reference
---

# Vector functions

Snowflake provides both similarity and element-wise aggregation functions for the [VECTOR](data-types-vector.md) data type. These functions allow for finding vectors nearest to a source vector, used for semantic search and fine-tuning generative responses from LLMs and generative AI.

Similarity functions operate on two VECTOR arguments of equal element type and dimension, computing the specified metric. Snowflake provides the following vector similarity functions:

> * [VECTOR_INNER_PRODUCT](functions/vector_inner_product.md)
> * [VECTOR_L1_DISTANCE](functions/vector_l1_distance.md)
> * [VECTOR_L2_DISTANCE](functions/vector_l2_distance.md)
> * [VECTOR_COSINE_SIMILARITY](functions/vector_cosine_similarity.md)

Vector manipulation functions take an existing vector and return a new vector with different properties, such as truncation or normalization. Snowflake provides the following vector manipulation functions:

> * [VECTOR_TRUNCATE](functions/vector_truncate.md)
> * [VECTOR_NORMALIZE](functions/vector_normalize.md)

Vector aggregate functions operate on columns of VECTOR values to perform element-wise mathematical operations such as sum, average, minimum, and maximum across all vectors in a group. Snowflake provides the following vector aggregation functions:

> * [VECTOR_SUM](functions/vector_sum.md)
> * [VECTOR_MIN](functions/vector_min.md)
> * [VECTOR_MAX](functions/vector_max.md)
> * [VECTOR_AVG](functions/vector_avg.md)

> **Note:**
>
> Vector functions on Snowflake are optimized in a way that can reduce floating point precision. These functions have a margin of error up to `1e-4`.

## List of functions

| Function Name | Notes |
| --- | --- |
| [VECTOR_INNER_PRODUCT](functions/vector_inner_product.md) |  |
| [VECTOR_L1_DISTANCE](functions/vector_l1_distance.md) |  |
| [VECTOR_L2_DISTANCE](functions/vector_l2_distance.md) |  |
| [VECTOR_COSINE_SIMILARITY](functions/vector_cosine_similarity.md) | Not supported in Snowpark API. |
| [VECTOR_TRUNCATE](functions/vector_truncate.md) |  |
| [VECTOR_NORMALIZE](functions/vector_normalize.md) |  |
| [VECTOR_SUM](functions/vector_sum.md) |  |
| [VECTOR_MIN](functions/vector_min.md) |  |
| [VECTOR_MAX](functions/vector_max.md) |  |
| [VECTOR_AVG](functions/vector_avg.md) |  |

---
title: Warehouse & resource monitor DDL
source: https://docs.snowflake.com/en/sql-reference/ddl-virtual-warehouse.md
section: SQL General Reference
---

# Warehouse & resource monitor DDL

A virtual warehouse is a cluster of compute resources. A warehouse is needed to execute certain types of SQL statements because it provides resources such as CPU, memory, and local storage.

Resource monitors can be used to control credit usage for warehouses. A resource monitor specifies a monthly credit quota, one or more credit usage thresholds, and actions to perform when the thresholds are
reached. Each resource monitor can be associated with one or more warehouses.

## Virtual warehouses

* [CREATE WAREHOUSE](sql/create-warehouse.md)
* [ALTER WAREHOUSE](sql/alter-warehouse.md)
* [DESCRIBE WAREHOUSE](sql/desc-warehouse.md)
* [DROP WAREHOUSE](sql/drop-warehouse.md)
* [USE WAREHOUSE](sql/use-warehouse.md)
* [SHOW WAREHOUSES](sql/show-warehouses.md)

## Resource monitors

* [CREATE RESOURCE MONITOR](sql/create-resource-monitor.md)
* [ALTER RESOURCE MONITOR](sql/alter-resource-monitor.md)
* [DROP RESOURCE MONITOR](sql/drop-resource-monitor.md)
* [SHOW RESOURCE MONITORS](sql/show-resource-monitors.md)

---
title: WHERE
source: https://docs.snowflake.com/en/sql-reference/constructs/where.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# WHERE

The `WHERE` clause specifies a condition that acts as a filter. You can use the `WHERE` clause to:

* Filter the result of the [FROM](from.md) clause in a [SELECT](../sql/select.md) statement.
* Specify which rows to operate on in an [UPDATE](../sql/update.md),
  [MERGE](../sql/merge.md), or [DELETE](../sql/delete.md) .

## Syntax

```sqlsyntax
...
WHERE <predicate>
[ ... ]
```

## Parameters

`predicate`
:   A [Boolean expression](../data-types-logical.md). The expression can include
    [logical operators](../operators-logical.md),
    such as `AND`, `OR`, and `NOT`.

## Usage notes

* Predicates in the WHERE clause behave as if they are evaluated after the [FROM](from.md) clause (though the optimizer
  can reorder predicates if it does not impact the results). For example, if a predicate in the WHERE clause
  references columns of a table participating in an outer join in the FROM clause, the filter operates on the rows
  returned from the join (which might be padded with NULLs).
* Use care when creating expressions that might evaluate NULLs.

  + In most contexts, the Boolean expression `NULL = NULL` returns NULL, not TRUE. Consider using
    [IS [ NOT ] NULL](../functions/is-null.md) to compare NULL values.
  + In a `WHERE` clause, if an expression evaluates to NULL, the row for that expression is removed from the result
    set (that is, it is filtered out).
* The maximum number of expressions in a list is 200,000. For example, the limit applies to the number of expressions
  in the following SELECT statement:

  ```sqlexample
  SELECT column_x
     FROM mytable
     WHERE column_y IN (<expr1>, <expr2>, <expr3> ...);
  ```

  To avoid reaching the limit, perform a join with a lookup table that contains the expression values, rather than specifying
  the values using the IN clause. For example, when the expression values in the previous example are added to a lookup table
  named `mylookuptable`, you can run the following query successfully even if the lookup table has more than 200,000 rows:

  ```sqlexample
  SELECT column_x
    FROM mytable t
    JOIN mylookuptable l
    ON t.column_y = l.values_for_comparison;
  ```

## Joins in the WHERE clause

Although the `WHERE` clause is primarily for filtering, the `WHERE` clause can also be used to express many types
of joins. For conceptual information about joins, see [Working with joins](../../user-guide/querying-joins.md).

A `WHERE` clause can specify a join by including join conditions, which are Boolean expressions that define which row(s) from one
side of the JOIN match row(s) from the other side of the join.

The following two equivalent queries show how to express an inner join in either the `WHERE` or [FROM](from.md) clause:

> ```sqlexample
> SELECT t1.c1, t2.c2
>     FROM t1, t2
>     WHERE t1.c1 = t2.c2;
>
> SELECT t1.c1, t2.c2
>     FROM t1 INNER JOIN t2
>         ON t1.c1 = t2.c2;
> ```

Outer joins can be specified by using either the `(+)` syntax in the `WHERE` clause or
the `OUTER JOIN` keywords in the [FROM](from.md) clause.

When you specify an outer join with `(+)`, the WHERE clause applies `(+)` to each join column of the table that is
“inner” (defined below).

> **Note:**
>
> The result of an outer join contains a copy of all rows from one table. In this topic, the table whose rows are preserved is
> called the “outer” table, and the other table is called the “inner” table.
>
> * In a LEFT OUTER JOIN, the left-hand table is the outer table and the right-hand table is the inner table.
> * In a RIGHT OUTER JOIN, the right-hand table is the outer table and the left-hand table is the inner table.

The following queries show equivalent left outer joins, one of which specifies the join in the `FROM` clause and one of which
specifies the join in the `WHERE` clause:

> ```sqlexample
> SELECT t1.c1, t2.c2
> FROM t1 LEFT OUTER JOIN t2
>         ON t1.c1 = t2.c2;
>
> SELECT t1.c1, t2.c2
> FROM t1, t2
> WHERE t1.c1 = t2.c2(+);
> ```

In the second query, the `(+)` is on the right hand side and identifies the inner table.

Sample output for both queries is below:

> ```sqlexample
> +-------+-------+
> | T1.C1 | T2.C2 |
> |-------+-------|
> |     1 |     1 |
> |     2 |  NULL |
> |     3 |     3 |
> |     4 |  NULL |
> +-------+-------+
> ```

If you are joining a table on multiple columns, use the `(+)` notation
on each column in the inner table (`t2` in the example below):

> > ```sqlexample
> > SELECT t1.c1, t2.c2
> > FROM t1, t2
> > WHERE t1.c1 = t2.c2 (+)
> >   AND t1.c3 = t2.c4 (+);
> > ```
>
> > **Note:**
> >
> > There are many restrictions on where the `(+)` annotation can appear; [FROM](from.md) clause outer joins are more expressive. Snowflake suggests using the
> > `(+)` notation only when porting code that already uses that notation.
> > New code should avoid that notation.
> >
> > The restrictions include:
> >
> > * You cannot use the `(+)` notation to create `FULL OUTER JOIN`; you
> >   can only create `LEFT OUTER JOIN` and `RIGHT OUTER JOIN`.
> >   The following is not valid. The statement causes the following error message:
> >   `SQL compilation error: Outer join predicates form a cycle between 'T1' and 'T2'.`
> >
> >   ```sqlexample
> >   -- NOT VALID
> >   select t1.c1
> >       from t1, t2
> >       where t1.c1 (+) = t2.c2 (+);
> >   ```
> > * If a table participates in more than one join in a query, the `(+)` notation can specify the table as the inner table in only
> >   one of those joins. The following is not valid because `t1` serves as the inner table in two joins.
> >   The statement causes the following error message:
> >   `SQL compilation error: Table 'T1' is outer joined to multiple tables: 'T3' and 'T2'.`
> >
> >   ```sqlexample
> >   -- NOT VALID
> >   select t1.c1
> >       from t1, t2, t3
> >       where t1.c1 (+) = t2.c2
> >         and t1.c1 (+) = t3.c3;
> >   ```
> >
> >   Note, however, that you can use `(+)` to identify different tables as
> >   inner tables in different joins in the same SQL statement. The following
> >   example joins three tables: `t1`, `t2`, and `t3`, two of which are
> >   inner tables (in different joins). This statement performs:
> >
> >   + A LEFT OUTER JOIN between `t1` and `t2` (where `t2` is the inner table).
> >   + A LEFT OUTER JOIN between `t2` and `t3` (where `t3` is the inner table).
> >
> >   ```sqlexample
> >   select t1.c1
> >       from t1, t2, t3
> >       where t1.c1 = t2.c2 (+)
> >         and t2.c2 = t3.c3 (+);
> >   ```

The `(+)` may be immediately adjacent to the table and column name, or it may be separated by whitespace. Both of the following
are valid:

> ```sqlexample
> where t1.c1 = t2.c2(+)
>
> where t1.c1 = t2.c2 (+)
> ```

A query can contain joins specified in both the `FROM ... ON ...` clause and the `WHERE` clause. However, specifying
joins in different clauses of the same query can make that query more difficult to read.

Support for joins in the `WHERE` clause is primarily for backwards compatibility with older queries that do not use
the `FROM ... ON ...` syntax. Snowflake recommends using `FROM ... ON ...` when writing new queries with joins.
For details, see [JOIN](join.md).

## Examples

### Simple examples of filtering

The following show some simple uses of the WHERE clause:

> ```sqlexample
> SELECT * FROM invoices
>   WHERE invoice_date < '2018-01-01';
>
> SELECT * FROM invoices
>   WHERE invoice_date < '2018-01-01'
>     AND paid = FALSE;
> ```

This example uses a subquery and shows all the invoices that have
smaller-than-average billing amounts:

> ```sqlexample
> SELECT * FROM invoices
>     WHERE amount < (
>                    SELECT AVG(amount)
>                        FROM invoices
>                    )
>     ;
> ```

### Performing joins in the WHERE clause

To specify a join in the `WHERE` clause, list the tables to be joined in the `FROM clause`, separating the tables
with a comma. Specify the join condition as a filter in the `WHERE` clause, as shown in the following example:

> ```sqlexample
> SELECT t1.col1, t2.col1
>     FROM t1, t2
>     WHERE t2.col1 = t1.col1
>     ORDER BY 1, 2;
> +------+------+
> | COL1 | COL1 |
> |------+------|
> |    2 |    2 |
> |    2 |    2 |
> |    3 |    3 |
> +------+------+
> ```

> **Note:**
>
> The comma operator is older syntax for `INNER JOIN`. The following statement shows the recommended way to
> perform a join using newer syntax. The query below is equivalent to the query above:
>
> > ```sqlexample
> > SELECT t1.col1, t2.col1
> >     FROM t1 JOIN t2
> >         ON t2.col1 = t1.col1
> >     ORDER BY 1, 2;
> > +------+------+
> > | COL1 | COL1 |
> > |------+------|
> > |    2 |    2 |
> > |    2 |    2 |
> > |    3 |    3 |
> > +------+------+
> > ```

This next section shows 3-table joins and shows the difference in behavior with 0, 1, or 2 `(+)` outer join
operators.

> Before executing the queries, create and load the tables to use in the joins:
>
> > ```sqlexample
> > create table departments (
> >     department_ID INTEGER,
> >     department_name VARCHAR,
> >     location VARCHAR
> >     );
> > insert into departments (department_id, department_name, location) values
> >     (10, 'CUSTOMER SUPPORT', 'CHICAGO'),
> >     (40, 'RESEARCH', 'BOSTON'),
> >     (80, 'Department with no employees yet', 'CHICAGO'),
> >     (90, 'Department with no projects or employees yet', 'EREHWON')
> >     ;
> >
> > create table projects (
> >     project_id integer,
> >     project_name varchar,
> >     department_id integer
> >     );
> > insert into projects (project_id, project_name, department_id) values
> >     (4000, 'Detect fake product reviews', 40),
> >     (4001, 'Detect false insurance claims', 10),
> >     (9000, 'Project with no employees yet', 80),
> >     (9099, 'Project with no department or employees yet', NULL)
> >     ;
> >
> > create table employees (
> >     employee_ID INTEGER,
> >     employee_name VARCHAR,
> >     department_id INTEGER,
> >     project_id INTEGER
> >     );
> > insert into employees (employee_id, employee_name, department_id, project_id)
> >   values
> >     (1012, 'May Aidez', 10, NULL),
> >     (1040, 'Devi Nobel', 40, 4000),
> >     (1041, 'Alfred Mendeleev', 40, 4001)
> >     ;
> > ```
>
> Execute a 3-way inner join. This does not use `(+)` (or the OUTER keyword) and is therefore an inner join. The
> output includes only rows for which there is a department, project, and employee:
>
> > ```sqlexample
> > SELECT d.department_name, p.project_name, e.employee_name
> >     FROM  departments d, projects p, employees e
> >     WHERE
> >             p.department_id = d.department_id
> >         AND
> >             e.project_id = p.project_id
> >     ORDER BY d.department_id, p.project_id, e.employee_id;
> > +------------------+-------------------------------+------------------+
> > | DEPARTMENT_NAME  | PROJECT_NAME                  | EMPLOYEE_NAME    |
> > |------------------+-------------------------------+------------------|
> > | CUSTOMER SUPPORT | Detect false insurance claims | Alfred Mendeleev |
> > | RESEARCH         | Detect fake product reviews   | Devi Nobel       |
> > +------------------+-------------------------------+------------------+
> > ```
>
> Perform an outer join. This is similar to the preceding statement except that this uses `(+)` to make the
> second join a right outer join. The effect is that if a department is included in the output, then all of that
> department’s projects are included, even if those projects have no employees:
>
> > ```sqlexample
> > SELECT d.department_name, p.project_name, e.employee_name
> >     FROM  departments d, projects p, employees e
> >     WHERE
> >             p.department_id = d.department_id
> >         AND
> >             e.project_id(+) = p.project_id
> >     ORDER BY d.department_id, p.project_id, e.employee_id;
> > +----------------------------------+-------------------------------+------------------+
> > | DEPARTMENT_NAME                  | PROJECT_NAME                  | EMPLOYEE_NAME    |
> > |----------------------------------+-------------------------------+------------------|
> > | CUSTOMER SUPPORT                 | Detect false insurance claims | Alfred Mendeleev |
> > | RESEARCH                         | Detect fake product reviews   | Devi Nobel       |
> > | Department with no employees yet | Project with no employees yet | NULL             |
> > +----------------------------------+-------------------------------+------------------+
> > ```
>
> Perform two outer joins.
> This is the same as the preceding statement except that this uses `(+)` to make both joins into
> outer joins. The effect is that all departments are included (even if they have no projects or employees yet) and
> all projects associated with departments are included (even if they have no employees yet). Note that the output
> excludes projects that have no department.
>
> > ```sqlexample
> > SELECT d.department_name, p.project_name, e.employee_name
> >     FROM  departments d, projects p, employees e
> >     WHERE
> >             p.department_id(+) = d.department_id
> >         AND
> >             e.project_id(+) = p.project_id
> >     ORDER BY d.department_id, p.project_id, e.employee_id;
> > +----------------------------------------------+-------------------------------+------------------+
> > | DEPARTMENT_NAME                              | PROJECT_NAME                  | EMPLOYEE_NAME    |
> > |----------------------------------------------+-------------------------------+------------------|
> > | CUSTOMER SUPPORT                             | Detect false insurance claims | Alfred Mendeleev |
> > | RESEARCH                                     | Detect fake product reviews   | Devi Nobel       |
> > | Department with no employees yet             | Project with no employees yet | NULL             |
> > | Department with no projects or employees yet | NULL                          | NULL             |
> > +----------------------------------------------+-------------------------------+------------------+
> > ```

(Remember, however, that Snowflake recommends using the `OUTER` keyword in the `FROM` clause rather than using
the `(+)` operator in the `WHERE` clause.)

---
title: WHILE (Snowflake Scripting)
source: https://docs.snowflake.com/en/sql-reference/snowflake-scripting/while.md
section: SQL General Reference
---

# WHILE (Snowflake Scripting)

A `WHILE` loop iterates while a specified condition is true.

For more information on loops, see [Working with loops](../../developer-guide/snowflake-scripting/loops.md).

> **Note:**
>
> This [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) construct is valid only within a
> [Snowflake Scripting block](../../developer-guide/snowflake-scripting/blocks.md).

See also:
:   [BREAK](break.md), [CONTINUE](continue.md)

## Syntax

```sqlsyntax
WHILE ( <condition> ) { DO | LOOP }
    <statement>;
    [ <statement>; ... ]
END { WHILE | LOOP } [ <label> ] ;
```

Where:

> `condition`
> :   An expression that evaluates to a BOOLEAN.
>
> `statement`
> :   A statement can be any of the following:
>
>     * A single SQL statement (including CALL).
>     * A control-flow statement (for example, a [looping](../../developer-guide/snowflake-scripting/loops.md) or
>       [branching](../../developer-guide/snowflake-scripting/branch.md) statement).
>     * A nested [block](../../developer-guide/snowflake-scripting/blocks.md).
>
> `label`
> :   An optional label. Such a label can be a jump target for a [BREAK](break.md) or
>     [CONTINUE](continue.md) statement. A label must follow the naming rules for
>     [Object identifiers](../identifiers.md).

## Usage notes

* Put parentheses around the condition in the `WHILE`. For example: `WHILE ( <condition> )`.
* If the `condition` never evaluates to FALSE, and the loop doesn’t contain a
  [BREAK (Snowflake Scripting)](break.md) command (or equivalent), then the loop will run and consume credits
  indefinitely.
* If the `condition` is NULL, then it is treated as FALSE.
* A loop can contain multiple statements. You can use, but are not required to use, a [BEGIN … END](begin.md)
  [block](../../developer-guide/snowflake-scripting/blocks.md) to contain those statements.
* Pair the keyword `DO` with `END WHILE`, and pair the keyword `LOOP` with `END LOOP`.
  For example:

  ```sqlexample
  WHILE (...) DO
      ...
  END WHILE;

  WHILE (...) LOOP
      ...
  END LOOP;
  ```

## Examples

This example uses a loop to calculate a power of 2. The `counter` variable is the loop counter. The
`power_of_2` variable stores the most recent power of 2 that was calculated. (This is an inefficient
solution, but it demonstrates looping.)

```sqlexample
CREATE PROCEDURE power_of_2()
RETURNS NUMBER(8, 0)
LANGUAGE SQL
AS
$$
DECLARE
  counter NUMBER(8, 0);      -- Loop counter.
  power_of_2 NUMBER(8, 0);   -- Stores the most recent power of 2 that we calculated.
BEGIN
  counter := 1;
  power_of_2 := 1;
  WHILE (counter <= 8) DO
    power_of_2 := power_of_2 * 2;
    counter := counter + 1;
  END WHILE;
  RETURN power_of_2;
END;
$$
;
```

Call the stored procedure:

```sqlexample
CALL power_of_2();
```

```output
+------------+
| POWER_OF_2 |
|------------|
|        256 |
+------------+
```

This example uses a loop and the [DATEADD](../functions/dateadd.md) function to add a day to a date
until the condition is met.

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET mydate := '2024-05-08';
  WHILE (mydate < '2024-05-20') DO
    mydate := DATEADD(day, 1, mydate);
  END WHILE;
  RETURN mydate;
END;
$$
;
```

```output
+-------------------------+
| anonymous block         |
|-------------------------|
| 2024-05-20 00:00:00.000 |
+-------------------------+
```

For more examples, see [WHILE loop](../../developer-guide/snowflake-scripting/loops.md).

---
title: Window function syntax and usage
source: https://docs.snowflake.com/en/sql-reference/functions-window-syntax.md
section: SQL General Reference
---

# Window function syntax and usage

Snowflake supports a large number of analytic SQL functions known as *window functions*. The details for each function are documented on individual
reference pages. The purpose of this section is to provide general reference information that applies to some or all window functions, including
detailed syntax for the main components of the OVER clause:

* PARTITION BY clause
* ORDER BY clause
* Window frame syntax

Users who are not familiar with window functions might want to read the conceptual material in [Analyzing data with window functions](../user-guide/functions-window-using.md).

## Syntax

```sqlsyntax
<function> ( [ <arguments> ] ) OVER ( [ <windowDefinition> ] )
```

Where:

```sqlsyntax
windowDefinition ::=

[ PARTITION BY <expr1> [, ...] ]
[ ORDER BY <expr2> [ { ASC | DESC } ] [ NULLS { FIRST | LAST } ] [, ...] ]
[ <windowFrameClause> ]
```

Where:

```sqlsyntax
windowFrameClause ::=

{
    { ROWS | RANGE } UNBOUNDED PRECEDING
  | { ROWS | RANGE } <n> PRECEDING
  | { ROWS | RANGE } CURRENT ROW
  | { ROWS | RANGE } BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
  | { ROWS | RANGE } BETWEEN CURRENT ROW AND UNBOUNDED FOLLOWING
  | { ROWS | RANGE } BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING
  | { ROWS | RANGE } BETWEEN <n> { PRECEDING | FOLLOWING } AND <n> { PRECEDING | FOLLOWING }
  | { ROWS | RANGE } BETWEEN UNBOUNDED PRECEDING AND <n> { PRECEDING | FOLLOWING }
  | { ROWS | RANGE } BETWEEN <n> { PRECEDING | FOLLOWING } AND UNBOUNDED FOLLOWING
}
```

## Parameters

`OVER( [ windowDefinition ] )`
:   Specifies that the function is being used as a window function and specifies the window over which
    the function operates.
    The OVER clause must contain parentheses, but they may be empty, depending on the requirements of the
    function in question. An empty OVER clause has no partitions and an implied default window frame.

`PARTITION BY expr1`
:   Groups rows into partitions, by product, city, or year, for example. Input rows are grouped by partitions, then the function is
    computed over each partition. The PARTITION BY clause is optional; you can analyze a set of rows as a single partition.

`ORDER BY expr2`
:   Orders rows within each partition, or within the entire set of rows if no partition is specified.
    This ORDER BY clause is distinct from the ORDER BY clause that controls the order of all the rows that are
    returned in the final result of a query. Although the ORDER BY clause is optional for some window functions, it is required for others.
    For example, ranking window functions such as RANK and NTILE require their input data to be in a meaningful order.

    The ORDER BY clause for a window function follows rules similar to those for the main ORDER BY clause in a query,
    with respect to ASC/DESC (ascending/descending) order and NULL handling. For details, see [ORDER BY](constructs/order-by.md).

    > **Note:**
    >
    > The ORDER BY clause for window functions does not support the use of an ordinal position, such as `OVER (PARTITION BY 1 ORDER BY 2)`.
    > In this context, `2` is interpreted as the constant `2`; it does not refer to the second column in the query.

    Different functions handle the ORDER BY clause in different ways:

    * Some window functions require an ORDER BY clause.
    * Some window functions prohibit an ORDER BY clause.
    * Some window functions use an ORDER BY clause if one is present, but do not require it.
    * Some window functions apply an implicit window frame to the ORDER BY clause. (For more information, see
      Usage notes for window frames.)

`{ ROWS | RANGE }`
:   Specifies the type or mode of window frame, which defines either a physical number of rows (ROWS) or a logically computed set of rows (RANGE).
    See [Range-based versus row-based window frames](../user-guide/functions-window-using.md).

    Both types of frame specify starting and ending points, using either implicit named boundaries or explicit offset values.
    A named boundary is defined with the keywords CURRENT ROW, UNBOUNDED PRECEDING, and UNBOUNDED FOLLOWING. Explicit offsets are
    defined with numbers or intervals (`n PRECEDING` or `n FOLLOWING`).

`{ RANGE BETWEEN n PRECEDING | n FOLLOWING }`
:   Specifies a range-based window frame with explicit offsets.

    RANGE BETWEEN window frames with explicit offsets must have only one ORDER BY expression.
    The following data types are supported for that expression:

    * DATE, TIMESTAMP, TIMESTAMP_LTZ , TIMESTAMP_NTZ (DATETIME) , TIMESTAMP_TZ
    * NUMBER, including INT, FLOAT, and so on

    TIME and other Snowflake data types are not supported when this type of window frame is used. For other window frames, other data types,
    such as VARCHAR, can be used in the ORDER BY expression.

    For RANGE BETWEEN window frames, *n* must be an unsigned constant (a positive numeric value, including 0) or a positive INTERVAL constant:

    * If `expr2` is a numeric data type, `n` must be an unsigned constant.
    * If `expr2` is a TIMESTAMP data type, `n` must be an [INTERVAL constant](data-types-datetime.md).
      For example: `INTERVAL '12 hours'` or `INTERVAL '3 days'`.
    * If `expr2` is a DATE data type, `n` can be an unsigned constant or an INTERVAL constant, but the start and end of the frame must use the same data type for the `n` value.

    When the ORDER BY expression is ascending (ASC), the syntax `n FOLLOWING` means “rows with values greater than (or later than) *x*,” and
    `n PRECEDING` means “rows with values less than (or earlier than) *x*,” where *x* is the ORDER BY value for the current row. When the ORDER BY expression is descending (DESC), the opposite is true. (The offsets `0 PRECEDING` and `0 FOLLOWING` are equivalent to CURRENT ROW.)

### RANGE BETWEEN limitations

The following subset of window functions support the RANGE BETWEEN syntax with explicit offsets:

> * [COUNT](functions/count.md), [SUM](functions/sum.md), [MIN](functions/min.md),
>   [MAX](functions/max.md), [AVG](functions/avg.md)
> * [STDDEV, STDDEV_SAMP](functions/stddev.md), [STDDEV_POP](functions/stddev_pop.md) (and aliases)
> * [VARIANCE , VARIANCE_SAMP](functions/variance.md), [VARIANCE_POP](functions/variance_pop.md) (and aliases)
> * [COUNT_IF](functions/count_if.md)
> * [FIRST_VALUE](functions/first_value.md), [LAST_VALUE](functions/last_value.md)
> * [ARRAY_AGG](functions/array_agg.md)

In addition, note that:

* DISTINCT versions of these functions do not support this syntax.
* The following limitations apply when the COUNT window function is used with this syntax.

  + Only one input argument is supported.
  + `COUNT(table.*)` wildcard queries are not supported. For example, you cannot specify:

    ```sqlsyntax
    COUNT(t1.*) OVER(ORDER BY col1 RANGE BETWEEN 1 PRECEDING AND 1 FOLLOWING)
    ```
* You cannot specify a frame that results in a logical reversal of the frame start and end positions. For example, the following frames return
  errors because the ending row of the frame precedes the starting row:

  ```sqlsyntax
  ORDER BY col1 ASC RANGE BETWEEN 2 PRECEDING AND 4 PRECEDING
  ORDER BY col1 ASC RANGE BETWEEN 2 FOLLOWING AND 2 PRECEDING
  ```

### RANGE BETWEEN behavior when the ORDER BY expression contains NULL values

Note the following behavior when a RANGE BETWEEN window frame is used and the ORDER BY column contains NULL values:

* When the ORDER BY clause specifies NULLS FIRST, rows with NULL in the ORDER BY column are included in UNBOUNDED PRECEDING frames.
* When the ORDER BY clause specifies NULLS LAST, rows with NULL in the ORDER BY column are included in UNBOUNDED FOLLOWING frames.
* Rows with NULL in the ORDER BY column are included in an explicit-offset frame boundary only when the ORDER BY value of the current row is NULL.

See RANGE BETWEEN examples with NULL values in the ORDER BY clause.

### Correlated columns in window functions

Using correlated columns inside window functions (such as in the PARTITION BY or ORDER BY clauses) is not supported.
This limitation applies when window functions are used in [LATERAL](constructs/join-lateral.md)
joins or correlated subqueries, where the window function attempts to reference columns from outer query blocks.

The following example demonstrates this limitation. The query uses [FLATTEN](functions/flatten.md) in a lateral join
with the [ROW_NUMBER](functions/row_number.md) window function to process JSON data. When the `completed_on` column (which is
explicitly correlated with the lateral join) is used in the ORDER BY clause of the function, the query returns an error:

```sqlexample
WITH data AS (
  SELECT
    PARSE_JSON('[
                  {"completed_on": "2024-06-03T20:17:08.621001019Z"},
                  {"completed_on": "2024-06-03T18:26:08.691858742Z"},
                  {"completed_on": "2024-06-03T14:43:40.215726239Z"}
                 ]'
               ) d
  )
SELECT
    fields.*
  FROM data,
    LATERAL FLATTEN(d) AS f,
    LATERAL (SELECT
             f.value:completed_on AS completed_on,
             ROW_NUMBER() OVER(ORDER BY f.value:completed_on DESC) rn
            )fields;
```

```output
SQL compilation error: Window function [ROW_NUMBER() OVER (ORDER BY GET(F.VALUE, 'completed_on') DESC NULLS FIRST)] contains a correlation.
```

To solve this problem, don’t use correlated columns inside window functions. Move the window function to the outer
query where the correlation no longer exists. For example:

```sqlexample
WITH data AS (
  SELECT
    PARSE_JSON('[
                  {"completed_on": "2024-06-03T20:17:08.621001019Z"},
                  {"completed_on": "2024-06-03T18:26:08.691858742Z"},
                  {"completed_on": "2024-06-03T14:43:40.215726239Z"}
                 ]') d
  )
SELECT
    fields.*,
    ROW_NUMBER() OVER(ORDER BY completed_on DESC) rn
  FROM
    data,
    LATERAL FLATTEN(d) AS f,
    LATERAL (SELECT
             f.value:completed_on AS completed_on
            )fields;
```

This query executes successfully and returns:

```output
+--------------------------------------+----+
| COMPLETED_ON                         | RN |
|--------------------------------------+----|
| "2024-06-03T20:17:08.621001019Z"     |  1 |
| "2024-06-03T18:26:08.691858742Z"     |  2 |
| "2024-06-03T14:43:40.215726239Z"     |  3 |
+--------------------------------------+----+
```

When you move the ROW_NUMBER() function to the outer query, the `completed_on` column is no longer correlated,
and the window function can process it correctly.

## Usage notes for window frames

* All window functions support window frames. However, support for window frame syntax
  varies by function. If no window frame is specified, the default depends on the function:

  + For non-ranking functions (such as [COUNT](functions/count.md), [MAX](functions/max.md),
    [MIN](functions/min.md), and [SUM](functions/sum.md)), the
    default is the following window frame (in accordance with the ANSI standard):

    ```sqlsyntax
    RANGE BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
    ```
  + For ranking functions (such as [FIRST_VALUE](functions/first_value.md), [LAST_VALUE](functions/last_value.md),
    [NTH_VALUE](functions/nth_value.md)), the default is the entire window:

    ```sqlsyntax
    ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING
    ```

    Note that this behavior *does not comply* with the ANSI standard.

    > **Note:**
    >
    > For clarity, Snowflake recommends avoiding implicit window frames. If your query uses a window frame, define an explicit window frame.
* Window frames require the data in the window to be in a known order. Therefore, the ORDER BY clause inside the OVER
  clause is *required* for window frame syntax, even though that ORDER BY clause is generally optional.

## Examples

This section contains examples that show how to use window functions in different ways. For additional examples, see
[Analyzing data with window functions](../user-guide/functions-window-using.md) and the pages for individual functions.

### Introductory example

Suppose that you own a chain of stores. The following query shows the percentage of
the chain’s total profit that is generated by each store. The query uses the RATIO_TO_REPORT function, which takes a
value (`net_profit`) from the current row and divides it by the sum of the corresponding values from all the other rows:

Create and load the table:

```sqlexample
CREATE TRANSIENT TABLE store_sales (
    branch_ID    INTEGER,
    city        VARCHAR,
    gross_sales NUMERIC(9, 2),
    gross_costs NUMERIC(9, 2),
    net_profit  NUMERIC(9, 2)
    );

INSERT INTO store_sales (branch_ID, city, gross_sales, gross_costs)
    VALUES
    (1, 'Vancouver', 110000, 100000),
    (2, 'Vancouver', 140000, 125000),
    (3, 'Montreal', 150000, 140000),
    (4, 'Montreal', 155000, 146000);

UPDATE store_sales SET net_profit = gross_sales - gross_costs;
```

Query the table:

```sqlexample
SELECT branch_ID,
       net_profit,
       100 * RATIO_TO_REPORT(net_profit) OVER () AS percent_of_chain_profit
    FROM store_sales AS s1
    ORDER BY branch_ID;
+-----------+------------+-------------------------+
| BRANCH_ID | NET_PROFIT | PERCENT_OF_CHAIN_PROFIT |
|-----------+------------+-------------------------|
|         1 |   10000.00 |             22.72727300 |
|         2 |   15000.00 |             34.09090900 |
|         3 |   10000.00 |             22.72727300 |
|         4 |    9000.00 |             20.45454500 |
+-----------+------------+-------------------------+
```

### Window frame with an unbounded starting position

Create and populate a table with values:

```sqlexample
CREATE OR REPLACE TABLE example_cumulative (p INT, o INT, i INT);

INSERT INTO example_cumulative VALUES
    (  0, 1, 10), (0, 2, 20), (0, 3, 30),
    (100, 1, 10),(100, 2, 30),(100, 2, 5),(100, 3, 11),(100, 3, 120),
    (200, 1, 10000),(200, 1, 200),(200, 1, 808080),(200, 2, 33333),(200, 3, null), (200, 3, 4),
    (300, 1, null), (300, 1, null);
```

Run a query that uses a window frame with an unbounded starting position and show the output.
Return cumulative COUNT, SUM, AVG, MIN, and MAX values for each row in each partition:

```sqlexample
SELECT
    p, o, i,
    COUNT(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) count_i_Rows_Pre,
    SUM(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) sum_i_Rows_Pre,
    AVG(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) avg_i_Rows_Pre,
    MIN(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) min_i_Rows_Pre,
    MAX(i)   OVER (PARTITION BY p ORDER BY o ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW) max_i_Rows_Pre
  FROM example_cumulative
  ORDER BY p,o;
+-----+---+--------+------------------+----------------+----------------+----------------+----------------+
|   P | O |      I | COUNT_I_ROWS_PRE | SUM_I_ROWS_PRE | AVG_I_ROWS_PRE | MIN_I_ROWS_PRE | MAX_I_ROWS_PRE |
|-----+---+--------+------------------+----------------+----------------+----------------+----------------|
|   0 | 1 |     10 |                1 |             10 |         10.000 |             10 |             10 |
|   0 | 2 |     20 |                2 |             30 |         15.000 |             10 |             20 |
|   0 | 3 |     30 |                3 |             60 |         20.000 |             10 |             30 |
| 100 | 1 |     10 |                1 |             10 |         10.000 |             10 |             10 |
| 100 | 2 |     30 |                2 |             40 |         20.000 |             10 |             30 |
| 100 | 2 |      5 |                3 |             45 |         15.000 |              5 |             30 |
| 100 | 3 |     11 |                4 |             56 |         14.000 |              5 |             30 |
| 100 | 3 |    120 |                5 |            176 |         35.200 |              5 |            120 |
| 200 | 1 |  10000 |                1 |          10000 |      10000.000 |          10000 |          10000 |
| 200 | 1 |    200 |                2 |          10200 |       5100.000 |            200 |          10000 |
| 200 | 1 | 808080 |                3 |         818280 |     272760.000 |            200 |         808080 |
| 200 | 2 |  33333 |                4 |         851613 |     212903.250 |            200 |         808080 |
| 200 | 3 |   NULL |                4 |         851613 |     212903.250 |            200 |         808080 |
| 200 | 3 |      4 |                5 |         851617 |     170323.400 |              4 |         808080 |
| 300 | 1 |   NULL |                0 |           NULL |           NULL |           NULL |           NULL |
| 300 | 1 |   NULL |                0 |           NULL |           NULL |           NULL |           NULL |
+-----+---+--------+------------------+----------------+----------------+----------------+----------------+
```

Return the same results as the above query by using the default window frame (that is,
`ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW`):

```sqlexample
SELECT
    p, o, i,
    COUNT(i) OVER (PARTITION BY p ORDER BY o) count_i_Range_Pre,
    SUM(i)   OVER (PARTITION BY p ORDER BY o) sum_i_Range_Pre,
    AVG(i)   OVER (PARTITION BY p ORDER BY o) avg_i_Range_Pre,
    MIN(i)   OVER (PARTITION BY p ORDER BY o) min_i_Range_Pre,
    MAX(i)   OVER (PARTITION BY p ORDER BY o) max_i_Range_Pre
  FROM example_cumulative
  ORDER BY p,o;
+-----+---+--------+-------------------+-----------------+-----------------+-----------------+-----------------+
|   P | O |      I | COUNT_I_RANGE_PRE | SUM_I_RANGE_PRE | AVG_I_RANGE_PRE | MIN_I_RANGE_PRE | MAX_I_RANGE_PRE |
|-----+---+--------+-------------------+-----------------+-----------------+-----------------+-----------------|
|   0 | 1 |     10 |                 1 |              10 |       10.000000 |              10 |              10 |
|   0 | 2 |     20 |                 2 |              30 |       15.000000 |              10 |              20 |
|   0 | 3 |     30 |                 3 |              60 |       20.000000 |              10 |              30 |
| 100 | 1 |     10 |                 1 |              10 |       10.000000 |              10 |              10 |
| 100 | 2 |     30 |                 3 |              45 |       15.000000 |               5 |              30 |
| 100 | 2 |      5 |                 3 |              45 |       15.000000 |               5 |              30 |
| 100 | 3 |     11 |                 5 |             176 |       35.200000 |               5 |             120 |
| 100 | 3 |    120 |                 5 |             176 |       35.200000 |               5 |             120 |
| 200 | 1 |  10000 |                 3 |          818280 |   272760.000000 |             200 |          808080 |
| 200 | 1 |    200 |                 3 |          818280 |   272760.000000 |             200 |          808080 |
| 200 | 1 | 808080 |                 3 |          818280 |   272760.000000 |             200 |          808080 |
| 200 | 2 |  33333 |                 4 |          851613 |   212903.250000 |             200 |          808080 |
| 200 | 3 |   NULL |                 5 |          851617 |   170323.400000 |               4 |          808080 |
| 200 | 3 |      4 |                 5 |          851617 |   170323.400000 |               4 |          808080 |
| 300 | 1 |   NULL |                 0 |            NULL |            NULL |            NULL |            NULL |
| 300 | 1 |   NULL |                 0 |            NULL |            NULL |            NULL |            NULL |
+-----+---+--------+-------------------+-----------------+-----------------+-----------------+-----------------+
```

### Window frames with explicit offsets

Create and populate a table with values:

```sqlexample
CREATE TABLE example_sliding
  (p INT, o INT, i INT, r INT, s VARCHAR(100));

INSERT INTO example_sliding VALUES
  (100,1,1,70,'seventy'),(100,2,2,30, 'thirty'),(100,3,3,40,'forty'),(100,4,NULL,90,'ninety'),
  (100,5,5,50,'fifty'),(100,6,6,30,'thirty'),
  (200,7,7,40,'forty'),(200,8,NULL,NULL,'n_u_l_l'),(200,9,NULL,NULL,'n_u_l_l'),(200,10,10,20,'twenty'),
  (200,11,NULL,90,'ninety'),
  (300,12,12,30,'thirty'),
  (400,13,NULL,20,'twenty');
```

Return MIN function results for two columns (numeric and string) over sliding windows before, after, and including the current row:

```sqlexample
select p, o, i AS i_col,
    MIN(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 3 PRECEDING AND 1 PRECEDING) min_i_3P_1P,
    MIN(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 FOLLOWING AND 3 FOLLOWING) min_i_1F_3F,
    MIN(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 PRECEDING AND 3 FOLLOWING) min_i_1P_3F,
    s,
    MIN(s) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 3 PRECEDING AND 1 PRECEDING) min_s_3P_1P,
    MIN(s) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 FOLLOWING AND 3 FOLLOWING) min_s_1F_3F,
    MIN(s) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 PRECEDING AND 3 FOLLOWING) min_s_1P_3F
  FROM example_sliding
  ORDER BY p, o;
+-----+----+-------+-------------+-------------+-------------+---------+-------------+-------------+-------------+
|   P |  O | I_COL | MIN_I_3P_1P | MIN_I_1F_3F | MIN_I_1P_3F | S       | MIN_S_3P_1P | MIN_S_1F_3F | MIN_S_1P_3F |
|-----+----+-------+-------------+-------------+-------------+---------+-------------+-------------+-------------|
| 100 |  1 |     1 |        NULL |           2 |           1 | seventy | NULL        | forty       | forty       |
| 100 |  2 |     2 |           1 |           3 |           1 | thirty  | seventy     | fifty       | fifty       |
| 100 |  3 |     3 |           1 |           5 |           2 | forty   | seventy     | fifty       | fifty       |
| 100 |  4 |  NULL |           1 |           5 |           3 | ninety  | forty       | fifty       | fifty       |
| 100 |  5 |     5 |           2 |           6 |           5 | fifty   | forty       | thirty      | fifty       |
| 100 |  6 |     6 |           3 |        NULL |           5 | thirty  | fifty       | NULL        | fifty       |
| 200 |  7 |     7 |        NULL |          10 |           7 | forty   | NULL        | n_u_l_l     | forty       |
| 200 |  8 |  NULL |           7 |          10 |           7 | n_u_l_l | forty       | n_u_l_l     | forty       |
| 200 |  9 |  NULL |           7 |          10 |          10 | n_u_l_l | forty       | ninety      | n_u_l_l     |
| 200 | 10 |    10 |           7 |        NULL |          10 | twenty  | forty       | ninety      | n_u_l_l     |
| 200 | 11 |  NULL |          10 |        NULL |          10 | ninety  | n_u_l_l     | NULL        | ninety      |
| 300 | 12 |    12 |        NULL |        NULL |          12 | thirty  | NULL        | NULL        | thirty      |
| 400 | 13 |  NULL |        NULL |        NULL |        NULL | twenty  | NULL        | NULL        | twenty      |
+-----+----+-------+-------------+-------------+-------------+---------+-------------+-------------+-------------+
```

Return MAX function results for two columns (numeric and string) over sliding windows before, after, and including the current row:

```sqlexample
SELECT p, o, i AS i_col,
    MAX(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 3 PRECEDING AND 1 PRECEDING) max_i_3P_1P,
    MAX(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 FOLLOWING AND 3 FOLLOWING) max_i_1F_3F,
    MAX(i) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 PRECEDING AND 3 FOLLOWING) max_i_1P_3F,
    s,
    MAX(s) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 3 PRECEDING AND 1 PRECEDING) max_s_3P_1P,
    MAX(s) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 FOLLOWING AND 3 FOLLOWING) max_s_1F_3F,
    MAX(s) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 1 PRECEDING AND 3 FOLLOWING) max_s_1P_3F
  FROM example_sliding
  ORDER BY p, o;
+-----+----+-------+-------------+-------------+-------------+---------+-------------+-------------+-------------+
|   P |  O | I_COL | MAX_I_3P_1P | MAX_I_1F_3F | MAX_I_1P_3F | S       | MAX_S_3P_1P | MAX_S_1F_3F | MAX_S_1P_3F |
|-----+----+-------+-------------+-------------+-------------+---------+-------------+-------------+-------------|
| 100 |  1 |     1 |        NULL |           3 |           3 | seventy | NULL        | thirty      | thirty      |
| 100 |  2 |     2 |           1 |           5 |           5 | thirty  | seventy     | ninety      | thirty      |
| 100 |  3 |     3 |           2 |           6 |           6 | forty   | thirty      | thirty      | thirty      |
| 100 |  4 |  NULL |           3 |           6 |           6 | ninety  | thirty      | thirty      | thirty      |
| 100 |  5 |     5 |           3 |           6 |           6 | fifty   | thirty      | thirty      | thirty      |
| 100 |  6 |     6 |           5 |        NULL |           6 | thirty  | ninety      | NULL        | thirty      |
| 200 |  7 |     7 |        NULL |          10 |          10 | forty   | NULL        | twenty      | twenty      |
| 200 |  8 |  NULL |           7 |          10 |          10 | n_u_l_l | forty       | twenty      | twenty      |
| 200 |  9 |  NULL |           7 |          10 |          10 | n_u_l_l | n_u_l_l     | twenty      | twenty      |
| 200 | 10 |    10 |           7 |        NULL |          10 | twenty  | n_u_l_l     | ninety      | twenty      |
| 200 | 11 |  NULL |          10 |        NULL |          10 | ninety  | twenty      | NULL        | twenty      |
| 300 | 12 |    12 |        NULL |        NULL |          12 | thirty  | NULL        | NULL        | thirty      |
| 400 | 13 |  NULL |        NULL |        NULL |        NULL | twenty  | NULL        | NULL        | twenty      |
+-----+----+-------+-------------+-------------+-------------+---------+-------------+-------------+-------------+
```

Return the sum of a number column across sliding windows before, after, and encompassing the current row:

```sqlexample
SELECT p, o, r AS r_col,
    SUM(r) OVER (PARTITION BY p ORDER BY o ROWS BETWEEN 4 PRECEDING AND 2 PRECEDING) sum_r_4P_2P,
    sum(r) over (partition by p ORDER BY o ROWS BETWEEN 2 FOLLOWING AND 4 FOLLOWING) sum_r_2F_4F,
    sum(r) over (partition by p ORDER BY o ROWS BETWEEN 2 PRECEDING AND 4 FOLLOWING) sum_r_2P_4F
  FROM example_sliding
  ORDER BY p, o;
+-----+----+-------+-------------+-------------+-------------+
|   P |  O | R_COL | SUM_R_4P_2P | SUM_R_2F_4F | SUM_R_2P_4F |
|-----+----+-------+-------------+-------------+-------------|
| 100 |  1 |    70 |        NULL |         180 |         280 |
| 100 |  2 |    30 |        NULL |         170 |         310 |
| 100 |  3 |    40 |          70 |          80 |         310 |
| 100 |  4 |    90 |         100 |          30 |         240 |
| 100 |  5 |    50 |         140 |        NULL |         210 |
| 100 |  6 |    30 |         160 |        NULL |         170 |
| 200 |  7 |    40 |        NULL |         110 |         150 |
| 200 |  8 |  NULL |        NULL |         110 |         150 |
| 200 |  9 |  NULL |          40 |          90 |         150 |
| 200 | 10 |    20 |          40 |        NULL |         110 |
| 200 | 11 |    90 |          40 |        NULL |         110 |
| 300 | 12 |    30 |        NULL |        NULL |          30 |
| 400 | 13 |    20 |        NULL |        NULL |          20 |
+-----+----+-------+-------------+-------------+-------------+
```

### Ranking function examples

The following example shows how to rank sales based on the total amount (in dollars) that each salesperson has sold. The ORDER BY clause within the
OVER clause sorts the totals in descending order (highest to lowest). The query calculates the rank of each salesperson relative to all other salespeople.

Create the table and insert the data:

```sqlexample
CREATE TABLE sales_table (salesperson_name VARCHAR, sales_in_dollars INTEGER);
INSERT INTO sales_table (salesperson_name, sales_in_dollars) VALUES
    ('Smith', 600),
    ('Jones', 1000),
    ('Torkelson', 700),
    ('Dolenz', 800);
```

Now query the data:

```sqlexample
SELECT
    salesperson_name,
    sales_in_dollars,
    RANK() OVER (ORDER BY sales_in_dollars DESC) AS sales_rank
  FROM sales_table;
+------------------+------------------+------------+
| SALESPERSON_NAME | SALES_IN_DOLLARS | SALES_RANK |
|------------------+------------------+------------|
| Jones            |             1000 |          1 |
| Dolenz           |              800 |          2 |
| Torkelson        |              700 |          3 |
| Smith            |              600 |          4 |
+------------------+------------------+------------+
```

The output is not necessarily ordered by rank. To display results ordered by rank, specify an ORDER BY clause for the query itself (in addition
to the ORDER BY clause for the window function), as shown here:

```sqlexample
SELECT
    salesperson_name,
    sales_in_dollars,
    RANK() OVER (ORDER BY sales_in_dollars DESC) AS sales_rank
  FROM sales_table
  ORDER BY 3;
+------------------+------------------+------------+
| SALESPERSON_NAME | SALES_IN_DOLLARS | SALES_RANK |
|------------------+------------------+------------|
| Jones            |             1000 |          1 |
| Dolenz           |              800 |          2 |
| Torkelson        |              700 |          3 |
| Smith            |              600 |          4 |
+------------------+------------------+------------+
```

The preceding example has *two* ORDER BY clauses:

* One controls the order of the ranking.
* One controls the order of the output.

These clauses are independent. For example, you could order the rankings based on total sales (as shown above), but
order the output rows based on the salesperson’s last name:

```sqlexample
SELECT
    salesperson_name,
    sales_in_dollars,
    RANK() OVER (ORDER BY sales_in_dollars DESC) AS sales_rank
  FROM sales_table
  ORDER BY salesperson_name;
+------------------+------------------+------------+
| SALESPERSON_NAME | SALES_IN_DOLLARS | SALES_RANK |
|------------------+------------------+------------|
| Dolenz           |              800 |          2 |
| Jones            |             1000 |          1 |
| Smith            |              600 |          4 |
| Torkelson        |              700 |          3 |
+------------------+------------------+------------+
```

### RANGE BETWEEN example with explicit numeric offsets

The following example uses RANGE BETWEEN syntax with explicit numeric offsets.
To run this example, follow these instructions: [Create and load the menu_items table](functions/stddev.md).
For similar examples that use INTERVAL offsets, see [Using windowed aggregations for rolling calculations](../user-guide/querying-time-series-data.md).

The following query computes the average cost of goods sold for categories of menu items available from a food truck.
The window function does not partition the results; therefore, the averages are computed across the complete result set,
subject to a range-based frame.

The boundary of the frame is the cost of goods value in the current row, plus two (the first row = 0.50 + 2.00, for example).
Rows qualify for the frame when they fall within this two-dollar range.

```sqlexample
SELECT menu_category, menu_cogs_usd,
    AVG(menu_cogs_usd)
      OVER(ORDER BY menu_cogs_usd RANGE BETWEEN CURRENT ROW AND 2 FOLLOWING) avg_cogs
  FROM menu_items
  WHERE menu_category IN('Beverage','Dessert','Snack')
  GROUP BY menu_category, menu_cogs_usd
  ORDER BY menu_category, menu_cogs_usd;
```

```output
+---------------+---------------+----------+
| MENU_CATEGORY | MENU_COGS_USD | AVG_COGS |
|---------------+---------------+----------|
| Beverage      |          0.50 |  1.18333 |
| Beverage      |          0.65 |  1.37857 |
| Beverage      |          0.75 |  1.50000 |
| Dessert       |          0.50 |  1.18333 |
| Dessert       |          1.00 |  1.87500 |
| Dessert       |          1.25 |  2.05000 |
| Dessert       |          2.50 |  3.16666 |
| Dessert       |          3.00 |  3.50000 |
| Snack         |          1.25 |  2.05000 |
| Snack         |          2.25 |  2.93750 |
| Snack         |          4.00 |  4.00000 |
+---------------+---------------+----------+
```

For example, the `avg_cogs` value for the first row is 1.1833. This is computed as the sum of all the `menu_cogs_usd` values that fall
between 0.50 and 2.50, divided by the count of those rows:

`(0.50 + 0.65 + 0.75 + 0.50 + 1.00 + 1.25 + 2.50 + 1.25 + 2.25) / 9 = 1.18333`

The second to last row has an avg_cogs value of 2.93750. This is computed as the sum of all the `menu_cogs_usd` values that fall between 2.25 and 4.25,
divided by the count of those rows:

`(2.25 + 2.50 + 3.00 + 4.00) / 4 = 2.93750`

The last row returns 4.0 for both the `avg_cogs` and `menu_cogs_usd`. This result is accurate because only this row belongs to the frame; 4.0 is the
maximum `menu_cogs_usd` value in the entire result, so it becomes a single-row frame. It has no “following” rows.

Note that this query has an ORDER BY clause for the window function and an ORDER BY clause for the final results of the query. The final ORDER BY output
does not influence the calculation of the window function results. The ordered result set for computing the function is an intermediate result set that the
final query does not display.

#### RANGE BETWEEN examples with NULL values in the ORDER BY clause

The `nulls` table contains five rows, and two have NULL values in the `c1` column. Create and
load the table as follows:

```sqlexample
CREATE OR REPLACE TABLE nulls(c1 int, c2 int);

INSERT INTO nulls VALUES
  (1,10),
  (2,20),
  (3,30),
  (NULL,20),
  (NULL,50);
```

When NULLS LAST is specified, and the window frame uses explicit offsets, rows with NULL in `c1`
are included in the frame only when the ORDER BY value of the current row is NULL.
The following query returns a sum of `50` when row `3` is the current row.
The following NULL row is not included in the frame.

```sqlexample
SELECT c1 c1_nulls_last, c2,
    SUM(c2) OVER(ORDER BY c1 NULLS LAST RANGE BETWEEN 1 PRECEDING AND 1 FOLLOWING) sum_c2
  FROM nulls;
```

```output
+---------------+----+--------+
| C1_NULLS_LAST | C2 | SUM_C2 |
|---------------+----+--------|
|             1 | 10 |     30 |
|             2 | 20 |     60 |
|             3 | 30 |     50 |
|          NULL | 20 |     70 |
|          NULL | 50 |     70 |
+---------------+----+--------+
```

When NULLS LAST is specified, and the window frame uses UNBOUNDED FOLLOWING, rows with NULL in `c1`
are included in the frame. The following query returns a sum of `120` when row `3` is the current row.
Both following NULL rows are included in the frame.

```sqlexample
SELECT c1 c1_nulls_last, c2,
    SUM(c2) OVER(ORDER BY c1 NULLS LAST RANGE BETWEEN 1 PRECEDING AND UNBOUNDED FOLLOWING) sum_c2
  FROM nulls;
```

```output
+---------------+----+--------+
| C1_NULLS_LAST | C2 | SUM_C2 |
|---------------+----+--------|
|             1 | 10 |    130 |
|             2 | 20 |    130 |
|             3 | 30 |    120 |
|          NULL | 20 |     70 |
|          NULL | 50 |     70 |
+---------------+----+--------+
```

### Create and load the heavy_weather table

To create and insert rows into the `heavy_weather` table, which is used in some window function
examples, run this script.

```sqlexample
CREATE OR REPLACE TABLE heavy_weather
  (start_time TIMESTAMP, precip NUMBER(3,2), city VARCHAR(20), county VARCHAR(20));

INSERT INTO heavy_weather VALUES
('2021-12-23 06:56:00.000',0.08,'Mount Shasta','Siskiyou'),
('2021-12-23 07:51:00.000',0.09,'Mount Shasta','Siskiyou'),
('2021-12-23 16:23:00.000',0.56,'South Lake Tahoe','El Dorado'),
('2021-12-23 17:24:00.000',0.38,'South Lake Tahoe','El Dorado'),
('2021-12-23 18:30:00.000',0.28,'South Lake Tahoe','El Dorado'),
('2021-12-23 19:35:00.000',0.37,'Mammoth Lakes','Mono'),
('2021-12-23 19:36:00.000',0.80,'South Lake Tahoe','El Dorado'),
('2021-12-24 04:43:00.000',0.25,'Alta','Placer'),
('2021-12-24 05:26:00.000',0.34,'Alta','Placer'),
('2021-12-24 05:35:00.000',0.42,'Big Bear City','San Bernardino'),
('2021-12-24 06:49:00.000',0.17,'South Lake Tahoe','El Dorado'),
('2021-12-24 07:40:00.000',0.07,'Alta','Placer'),
('2021-12-24 08:36:00.000',0.07,'Alta','Placer'),
('2021-12-24 11:52:00.000',0.08,'Alta','Placer'),
('2021-12-24 12:52:00.000',0.38,'Alta','Placer'),
('2021-12-24 15:44:00.000',0.13,'Alta','Placer'),
('2021-12-24 15:53:00.000',0.07,'South Lake Tahoe','El Dorado'),
('2021-12-24 16:55:00.000',0.09,'Big Bear City','San Bernardino'),
('2021-12-24 21:53:00.000',0.07,'Montague','Siskiyou'),
('2021-12-25 02:52:00.000',0.07,'Alta','Placer'),
('2021-12-25 07:52:00.000',0.07,'Alta','Placer'),
('2021-12-25 08:52:00.000',0.08,'Alta','Placer'),
('2021-12-25 09:48:00.000',0.18,'Alta','Placer'),
('2021-12-25 12:52:00.000',0.10,'Alta','Placer'),
('2021-12-25 17:21:00.000',0.23,'Alturas','Modoc'),
('2021-12-25 17:52:00.000',1.54,'Alta','Placer'),
('2021-12-26 01:52:00.000',0.61,'Alta','Placer'),
('2021-12-26 05:43:00.000',0.16,'South Lake Tahoe','El Dorado'),
('2021-12-26 05:56:00.000',0.08,'Bishop','Inyo'),
('2021-12-26 06:52:00.000',0.75,'Bishop','Inyo'),
('2021-12-26 06:53:00.000',0.08,'Lebec','Los Angeles'),
('2021-12-26 07:52:00.000',0.65,'Alta','Placer'),
('2021-12-26 09:52:00.000',2.78,'Alta','Placer'),
('2021-12-26 09:55:00.000',0.07,'Big Bear City','San Bernardino'),
('2021-12-26 14:22:00.000',0.32,'Alta','Placer'),
('2021-12-26 14:52:00.000',0.34,'Alta','Placer'),
('2021-12-26 15:43:00.000',0.35,'Alta','Placer'),
('2021-12-26 17:31:00.000',5.24,'Alta','Placer'),
('2021-12-26 22:52:00.000',0.07,'Alta','Placer'),
('2021-12-26 23:15:00.000',0.52,'Alta','Placer'),
('2021-12-27 02:52:00.000',0.08,'Alta','Placer'),
('2021-12-27 03:52:00.000',0.14,'Alta','Placer'),
('2021-12-27 04:52:00.000',1.52,'Alta','Placer'),
('2021-12-27 14:37:00.000',0.89,'Alta','Placer'),
('2021-12-27 14:53:00.000',0.07,'South Lake Tahoe','El Dorado'),
('2021-12-27 17:53:00.000',0.07,'South Lake Tahoe','El Dorado'),
('2021-12-30 11:23:00.000',0.12,'Lebec','Los Angeles'),
('2021-12-30 11:43:00.000',0.98,'Lebec','Los Angeles'),
('2021-12-30 13:53:00.000',0.23,'Lebec','Los Angeles'),
('2021-12-30 14:53:00.000',0.13,'Lebec','Los Angeles'),
('2021-12-30 15:15:00.000',0.29,'Lebec','Los Angeles'),
('2021-12-30 17:53:00.000',0.10,'Lebec','Los Angeles'),
('2021-12-30 18:53:00.000',0.09,'Lebec','Los Angeles'),
('2021-12-30 19:53:00.000',0.07,'Lebec','Los Angeles'),
('2021-12-30 20:53:00.000',0.07,'Lebec','Los Angeles')
;
```

---
title: Window functions
source: https://docs.snowflake.com/en/sql-reference/functions-window.md
section: SQL General Reference
---

# Window functions

Window functions are analytic functions that you can use for various calculations such as running totals,
moving averages, and rankings.

For general syntax rules, see [Window function syntax and usage](functions-window-syntax.md). For syntax specific to
individual functions, go to the links in the following table.

| Sub-category | Notes |
| --- | --- |
| **General window** |  |
| [ANY_VALUE](functions/any_value.md) |  |
| [AVG](functions/avg.md) |  |
| [CONDITIONAL_CHANGE_EVENT](functions/conditional_change_event.md) |  |
| [CONDITIONAL_TRUE_EVENT](functions/conditional_true_event.md) |  |
| [CORR](functions/corr.md) |  |
| [COUNT](functions/count.md) |  |
| [COUNT_IF](functions/count_if.md) |  |
| [COVAR_POP](functions/covar_pop.md) |  |
| [COVAR_SAMP](functions/covar_samp.md) |  |
| [INTERPOLATE_BFILL, INTERPOLATE_FFILL, INTERPOLATE_LINEAR](functions/interpolate_bfill.md) |  |
| [LISTAGG](functions/listagg.md) | Uses WITHIN GROUP syntax. |
| [MAX](functions/max.md) |  |
| [MEDIAN](functions/median.md) |  |
| [MIN](functions/min.md) |  |
| [MODE](functions/mode.md) |  |
| [PERCENTILE_CONT](functions/percentile_cont.md) | Uses WITHIN GROUP syntax. |
| [PERCENTILE_DISC](functions/percentile_disc.md) | Uses WITHIN GROUP syntax. |
| [RATIO_TO_REPORT](functions/ratio_to_report.md) |  |
| [STDDEV, STDDEV_SAMP](functions/stddev.md) | STDDEV and STDDEV_SAMP are aliases. |
| [STDDEV_POP](functions/stddev_pop.md) |  |
| [SUM](functions/sum.md) |  |
| [VAR_POP](functions/var_pop.md) |  |
| [VAR_SAMP](functions/var_samp.md) |  |
| [VARIANCE_POP](functions/variance_pop.md) | Alias for [VAR_POP](functions/var_pop.md). |
| [VARIANCE , VARIANCE_SAMP](functions/variance.md) | Alias for [VAR_SAMP](functions/var_samp.md). |
| **Ranking** |  |
| [CUME_DIST](functions/cume_dist.md) |  |
| [DENSE_RANK](functions/dense_rank.md) |  |
| [FIRST_VALUE](functions/first_value.md) |  |
| [LAG](functions/lag.md) |  |
| [LAST_VALUE](functions/last_value.md) |  |
| [LEAD](functions/lead.md) |  |
| [NTH_VALUE](functions/nth_value.md) |  |
| [NTILE](functions/ntile.md) |  |
| [PERCENT_RANK](functions/percent_rank.md) | Supports only RANGE BETWEEN window frames without explicit offsets. |
| [RANK](functions/rank.md) |  |
| [ROW_NUMBER](functions/row_number.md) |  |
| **Bitwise aggregation** |  |
| [BITAND_AGG](functions/bitand_agg.md) |  |
| [BITOR_AGG](functions/bitor_agg.md) |  |
| [BITXOR_AGG](functions/bitxor_agg.md) |  |
| **Boolean aggregation** |  |
| [BOOLAND_AGG](functions/booland_agg.md) |  |
| [BOOLOR_AGG](functions/boolor_agg.md) |  |
| [BOOLXOR_AGG](functions/boolxor_agg.md) |  |
| **Hash** |  |
| [HASH_AGG](functions/hash_agg.md) |  |
| **Semi-structured data aggregation** |  |
| [ARRAY_AGG](functions/array_agg.md) |  |
| [OBJECT_AGG](functions/object_agg.md) |  |
| **Counting distinct values** |  |
| [ARRAY_UNION_AGG](functions/array_union_agg.md) |  |
| [ARRAY_UNIQUE_AGG](functions/array_unique_agg.md) |  |
| **Linear regression** |  |
| [REGR_AVGX](functions/regr_avgx.md) |  |
| [REGR_AVGY](functions/regr_avgy.md) |  |
| [REGR_COUNT](functions/regr_count.md) |  |
| [REGR_INTERCEPT](functions/regr_intercept.md) |  |
| [REGR_R2](functions/regr_r2.md) |  |
| [REGR_SLOPE](functions/regr_slope.md) |  |
| [REGR_SXX](functions/regr_sxx.md) |  |
| [REGR_SXY](functions/regr_sxy.md) |  |
| [REGR_SYY](functions/regr_syy.md) |  |
| **Statistics and probability** |  |
| [KURTOSIS](functions/kurtosis.md) |  |
| **Cardinality estimation** . (**using** [HyperLogLog](../user-guide/querying-approximate-cardinality.md)) |  |
| [APPROX_COUNT_DISTINCT](functions/approx_count_distinct.md) | Alias for [HLL](functions/hll.md). |
| [HLL](functions/hll.md) |  |
| [HLL_ACCUMULATE](functions/hll_accumulate.md) |  |
| [HLL_COMBINE](functions/hll_combine.md) |  |
| [HLL_ESTIMATE](functions/hll_estimate.md) | Not an aggregate function; uses scalar input from [HLL_ACCUMULATE](functions/hll_accumulate.md) or [HLL_COMBINE](functions/hll_combine.md). |
| [HLL_EXPORT](functions/hll_export.md) |  |
| [HLL_IMPORT](functions/hll_import.md) |  |
| **Similarity estimation** . (**using** [MinHash](../user-guide/querying-approximate-similarity.md)) |  |
| [APPROXIMATE_JACCARD_INDEX](functions/approximate_jaccard_index.md) | Alias for [APPROXIMATE_SIMILARITY](functions/approximate_similarity.md). |
| [APPROXIMATE_SIMILARITY](functions/approximate_similarity.md) |  |
| [MINHASH](functions/minhash.md) |  |
| [MINHASH_COMBINE](functions/minhash_combine.md) |  |
| **Frequency estimation** . (**using** [Space-Saving](../user-guide/querying-approximate-frequent-values.md)) |  |
| [APPROX_TOP_K](functions/approx_top_k.md) |  |
| [APPROX_TOP_K_ACCUMULATE](functions/approx_top_k_accumulate.md) |  |
| [APPROX_TOP_K_COMBINE](functions/approx_top_k_combine.md) |  |
| [APPROX_TOP_K_ESTIMATE](functions/approx_top_k_estimate.md) | Not an aggregate function; uses scalar input from [APPROX_TOP_K_ACCUMULATE](functions/approx_top_k_accumulate.md) or [APPROX_TOP_K_COMBINE](functions/approx_top_k_combine.md). |
| **Percentile estimation** . (**using** [t-Digest](../user-guide/querying-approximate-percentile-values.md)) |  |
| [APPROX_PERCENTILE](functions/approx_percentile.md) |  |
| [APPROX_PERCENTILE_ACCUMULATE](functions/approx_percentile_accumulate.md) |  |
| [APPROX_PERCENTILE_COMBINE](functions/approx_percentile_combine.md) |  |
| [APPROX_PERCENTILE_ESTIMATE](functions/approx_percentile_estimate.md) | Not an aggregate function; uses scalar input from [APPROX_PERCENTILE_ACCUMULATE](functions/approx_percentile_accumulate.md) or [APPROX_PERCENTILE_COMBINE](functions/approx_percentile_combine.md). |

---
title: WITH
source: https://docs.snowflake.com/en/sql-reference/constructs/with.md
section: SQL General Reference
---

Categories:
:   [Query syntax](../constructs.md)

# WITH

The WITH clause is an optional clause that precedes the body of the [SELECT](../sql/select.md) statement, and defines one
or more [CTEs (common table expressions)](../../user-guide/queries-cte.md) that can be used later in the statement. For example,
CTEs can be referenced in the [FROM](from.md) clause.

> **Note:**
>
> You can use a WITH clause when creating and calling an anonymous procedure similar to a stored procedure. That clause modifies
> a CALL command rather than a SELECT command. For more information, see [CALL (with anonymous procedure)](../sql/call-with.md).

The WITH clause is used with machine learning model objects to create an alias to a specific version of the model,
which can then be used to call the methods of that version. See [Calling model methods](../commands-model-function.md).

See also:
:   [CONNECT BY](connect-by.md), [Model commands](../commands-model.md)

## Syntax

Subquery:

```sqlsyntax
[ WITH
       <cte_name1> [ ( <cte_column_list> ) ] AS ( SELECT ...  )
   [ , <cte_name2> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
   [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
]
SELECT ...
```

Recursive CTE:

```sqlsyntax
[ WITH [ RECURSIVE ]
       <cte_name1> ( <cte_column_list> ) AS ( anchorClause UNION ALL recursiveClause )
   [ , <cte_name2> ( <cte_column_list> ) AS ( anchorClause UNION ALL recursiveClause ) ]
   [ , <cte_nameN> ( <cte_column_list> ) AS ( anchorClause UNION ALL recursiveClause ) ]
]
SELECT ...
```

Where:

> ```sqlsyntax
> anchorClause ::=
>     SELECT <anchor_column_list> FROM ...
>
> recursiveClause ::=
>     SELECT <recursive_column_list> FROM ... [ JOIN ... ]
> ```

## Parameters

`cte_name1` , `cte_nameN`
:   The CTE name must follow the rules for views and similar [object identifiers](../identifiers.md).

`cte_column_list`
:   The names of the columns in the CTE (common table expression).

`anchor_column_list`
:   The columns used in the anchor clause for the recursive CTE. The columns in this list must
    correspond to the columns defined in `cte_column_list`.

`recursive_column_list`
:   The columns used in the recursive clause for the recursive CTE. The columns in this list must
    correspond to the columns defined in `cte_column_list`.

For more details, see Anchor Clause and Recursive Clause (in this topic). For a detailed
explanation of how the anchor clause and recursive clause work together, see
[Working with CTEs (Common Table Expressions)](../../user-guide/queries-cte.md).

## Usage notes

### General usage

* A WITH clause can refer recursively to itself, and to other CTEs that appear earlier in the same clause. For instance,
  `cte_name2` can refer to `cte_name1` and itself, while `cte_name1` can refer to itself, but not to
  `cte_name2`.
* You can mix recursive and non-recursive (iterative and non-iterative) CTE clauses in the WITH clause. The CTE clauses should
  be ordered such that, if a CTE needs to reference another CTE, the CTE to be referenced should be defined earlier in the
  statement (e.g. the second CTE can refer to the first CTE, but not vice versa).

  The CTEs do not need to be listed in order based on whether they are recursive or not. For example, a non-recursive CTE can
  be listed immediately after the keyword `RECURSIVE`, and a recursive CTE can come after that non-recursive CTE.

  Within a recursive CTE, either the anchor clause or the recursive clause (or both) can refer to another CTE(s).
* For recursive CTEs, the `cte_column_list` is required.
* For non-recursive CTEs, the `cte_column_list` is optional.
* Make sure to use `UNION ALL`, not `UNION`, in a recursive CTE.
* The keyword `RECURSIVE` is optional.

  > + CTEs can be recursive whether or not `RECURSIVE` was specified.
  > + You can use the keyword `RECURSIVE` even if no CTEs are recursive.
  > + If `RECURSIVE` is used, it must be used only once, even if more than one CTE is recursive.
  >
  > Although SQL statements work properly with or without the keyword `RECURSIVE`, using the keyword properly makes the
  > code easier to understand and maintain. Snowflake recommends using the keyword `RECURSIVE` if one or more CTEs are
  > recursive, and Snowflake strongly recommends omitting the keyword if none of the CTEs are recursive.

> **Attention:**
>
> When using a recursive CTE, it is possible to create a query that goes into an infinite loop and consumes credits until the
> query succeeds, the query times out (e.g. exceeds the number of seconds specified by the
> [STATEMENT_TIMEOUT_IN_SECONDS](../parameters.md) parameter), or you [cancel the query](../../user-guide/querying-cancel-statements.md).
>
> For information on how infinite loops can occur and for guidelines on how to avoid this problem, see
> [Troubleshooting a Recursive CTE](../../user-guide/queries-cte.md).
>
> For example, to limit the number of iterations to less than 10:
>
> ```sqlexample
> WITH cte AS (
>   SELECT ..., 1 as level ...
>
>   UNION ALL
>
>   SELECT ..., cte.level + 1 as level
>    FROM cte ...
>    WHERE ... level < 10
> ) ...
> ```

### Limitations

* The Snowflake implementation of recursive CTEs does not support the following keywords that some other systems support:

  + `SEARCH DEPTH FIRST BY ...`
  + `CYCLE ... SET ...`

### Anchor clause

The anchor clause in a recursive CTE is a [SELECT](../sql/select.md) statement.

The anchor clause is executed once during the execution of the statement in which it is embedded; it runs before the
recursive clause and generates the first set of rows from the recursive CTE. These rows are not only included in the output
of the query, but also referenced by the recursive clause.

The anchor clause can contain any SQL construct allowed in a SELECT clause. However, the anchor clause cannot reference
`cte_name1`; only the recursive clause can reference `cte_name1`.

Although the anchor clause usually selects from the same table as the recursive clause, this is not required. The anchor
clause can select from any table-like data source, including another table, a view, a UDTF, or a constant value.

The anchor clause selects a single “level” of the hierarchy, typically the top level, or the highest level of interest. For
example, if the query is intended to show the “parts explosion” of a car, the anchor clause returns the highest level component,
which is the car itself.

The output from the anchor clause represents one layer of the hierarchy, and this layer is stored as the content of the “view”
that is accessed in the first iteration of the recursive clause.

### Recursive clause

The recursive clause is a [SELECT](../sql/select.md) statement. This SELECT is restricted to projections, filters, and
joins (inner joins and outer joins in which the recursive reference is on the preserved side of the outer join). The recursive
clause cannot contain:

* Aggregate or window functions,
* `GROUP BY`, `ORDER BY`, `LIMIT`, or `DISTINCT`.

The recursive clause can (and usually does) reference the `cte_name1` as though the CTE were a table or view.

The recursive clause usually includes a JOIN that joins the table that was used in the anchor clause to the CTE. However, the
JOIN can join more than one table or table-like data source (view, etc.).

The first iteration of the recursive clause starts with the data from the anchor clause. That data is then joined to the other
table(s) in the FROM clause of the recursive clause.

Each subsequent iteration starts with the data from the previous iteration.

You can think of the CTE clause or “view” as holding the contents from the previous iteration, so that those contents are available
to be joined. Note that during any one iteration, the CTE contains only the contents from the previous iteration, not the results accumulated
from all previous iterations. The accumulated results (including from the anchor clause) are
stored in a separate place.

### Column lists in a recursive CTE

There are three column lists in a recursive CTE:

* `cte_column_list`
* `anchor_column_list` (in the anchor clause)
* `recursive_column_list` (in the recursive clause)

A recursive CTE can contain other column lists (e.g. in a subquery), but these three column lists must be present.

These three column lists must all correspond to each other.

In pseudo-code, this looks similar to:

```sqlsyntax
WITH RECURSIVE cte_name (X, Y) AS
(
  SELECT related_to_X, related_to_Y FROM table1
  UNION ALL
  SELECT also_related_to_X, also_related_to_Y
    FROM table1 JOIN cte_name ON <join_condition>
)
SELECT ... FROM ...
```

Columns `X` and `related_to_X` must correspond; the anchor clause generates the initial “contents” of the “view” that the
CTE represents, so each column from the anchor clause (e.g. column `related_to_x`) must generate output that will belong in
the corresponding column of the CTE (e.g. column `X`).

Columns `also_related_to_X` and `X` must correspond; on each iteration of the recursive clause, the output of that clause
becomes the new content of the CTE/view for the next iteration.

Also, columns `related_to_X` and `also_related_to_X` must correspond because they are each on one side of the `UNION ALL`
operator, and the columns on each side of a `UNION ALL` operator must correspond.

## Examples

### Non-recursive examples

This section provides sample queries and sample output. To keep the examples short, the code omits the statements to create
and load the tables.

This first example uses a simple WITH clause as a view to extract a subset of data, in this case the music albums that were
released in 1976. For this small database, the query output is the albums “Amigos” and “Look Into The Future”, both from the
year 1976:

> ```sqlexample
> with
>   albums_1976 as (select * from music_albums where album_year = 1976)
> select album_name from albums_1976 order by album_name;
> +----------------------+
> | ALBUM_NAME           |
> |----------------------|
> | Amigos               |
> | Look Into The Future |
> +----------------------+
> ```

This next example uses a WITH clause with an earlier WITH clause; the CTE named `journey_album_info_1976` uses the CTE named
`album_info_1976`. The output is the album “Look Into The Future”, with the name of the band:

> ```sqlexample
> with
>    album_info_1976 as (select m.album_ID, m.album_name, b.band_name
>       from music_albums as m inner join music_bands as b
>       where m.band_id = b.band_id and album_year = 1976),
>    Journey_album_info_1976 as (select *
>       from album_info_1976
>       where band_name = 'Journey')
> select album_name, band_name
>    from Journey_album_info_1976;
> +----------------------+-----------+
> | ALBUM_NAME           | BAND_NAME |
> |----------------------+-----------|
> | Look Into The Future | Journey   |
> +----------------------+-----------+
> ```

This example lists musicians who played on Santana albums and Journey albums. This example does not use the WITH clause.
For this query (and the next few queries, all of which are equivalent ways of running the same query), the output is the IDs and
names of musicians who played on Santana albums and Journey albums:

> ```sqlexample
> select distinct musicians.musician_id, musician_name
>  from musicians inner join musicians_and_albums inner join music_albums inner join music_bands
>  where musicians.musician_ID = musicians_and_albums.musician_ID
>    and musicians_and_albums.album_ID = music_albums.album_ID
>    and music_albums.band_ID = music_bands.band_ID
>    and music_bands.band_name = 'Santana'
> intersect
> select distinct musicians.musician_id, musician_name
>  from musicians inner join musicians_and_albums inner join music_albums inner join music_bands
>  where musicians.musician_ID = musicians_and_albums.musician_ID
>    and musicians_and_albums.album_ID = music_albums.album_ID
>    and music_albums.band_ID = music_bands.band_ID
>    and music_bands.band_name = 'Journey'
> order by musician_ID;
> +-------------+---------------+
> | MUSICIAN_ID | MUSICIAN_NAME |
> |-------------+---------------|
> |         305 | Gregg Rolie   |
> |         306 | Neal Schon    |
> +-------------+---------------+
> ```

As you can see, the previous query contains duplicate code. The next few examples show how to simplify this query by using
one or more explicit views, and then how to simplify it by using CTEs.

This query shows how to use views to reduce the duplication and complexity of the previous example (as in the previous example,
this does not use a WITH clause):

> ```sqlexample
> create or replace view view_musicians_in_bands AS
>   select distinct musicians.musician_id, musician_name, band_name
>     from musicians inner join musicians_and_albums inner join music_albums inner join music_bands
>     where musicians.musician_ID = musicians_and_albums.musician_ID
>       and musicians_and_albums.album_ID = music_albums.album_ID
>       and music_albums.band_ID = music_bands.band_ID;
> ```
>
> With this view, you can re-write the original query as:
>
> ```sqlexample
> select musician_id, musician_name from view_musicians_in_bands where band_name = 'Santana'
> intersect
> select musician_id, musician_name from view_musicians_in_bands where band_name = 'Journey'
> order by musician_ID;
> +-------------+---------------+
> | MUSICIAN_ID | MUSICIAN_NAME |
> |-------------+---------------|
> |         305 | Gregg Rolie   |
> |         306 | Neal Schon    |
> +-------------+---------------+
> ```

This example uses a WITH clause to do the equivalent of what the preceding query did:

> ```sqlexample
> with
>   musicians_in_bands as (
>      select distinct musicians.musician_id, musician_name, band_name
>       from musicians inner join musicians_and_albums inner join music_albums inner join music_bands
>       where musicians.musician_ID = musicians_and_albums.musician_ID
>         and musicians_and_albums.album_ID = music_albums.album_ID
>         and music_albums.band_ID = music_bands.band_ID)
> select musician_ID, musician_name from musicians_in_bands where band_name = 'Santana'
> intersect
> select musician_ID, musician_name from musicians_in_bands where band_name = 'Journey'
> order by musician_ID
>   ;
> +-------------+---------------+
> | MUSICIAN_ID | MUSICIAN_NAME |
> |-------------+---------------|
> |         305 | Gregg Rolie   |
> |         306 | Neal Schon    |
> +-------------+---------------+
> ```

These statements create more granular views (this example does not use a WITH clause):

> List the albums by a particular band:
>
> ```sqlexample
> create or replace view view_album_IDs_by_bands AS
>  select album_ID, music_bands.band_id, band_name
>   from music_albums inner join music_bands
>   where music_albums.band_id = music_bands.band_ID;
> ```
>
> List the musicians who played on albums:
>
> ```sqlexample
> create or replace view view_musicians_in_bands AS
>  select distinct musicians.musician_id, musician_name, band_name
>   from musicians inner join musicians_and_albums inner join view_album_IDs_by_bands
>   where musicians.musician_ID = musicians_and_albums.musician_ID
>     and musicians_and_albums.album_ID = view_album_IDS_by_bands.album_ID;
> ```
>
> Now use those views to query musicians who played on both Santana and Journey albums:
>
> ```sqlexample
> select musician_id, musician_name from view_musicians_in_bands where band_name = 'Santana'
> intersect
> select musician_id, musician_name from view_musicians_in_bands where band_name = 'Journey'
> order by musician_ID;
> +-------------+---------------+
> | MUSICIAN_ID | MUSICIAN_NAME |
> |-------------+---------------|
> |         305 | Gregg Rolie   |
> |         306 | Neal Schon    |
> +-------------+---------------+
> ```

These statements create more granular implicit views (this example uses a WITH clause):

> ```sqlexample
> with
>   album_IDs_by_bands as (select album_ID, music_bands.band_id, band_name
>                           from music_albums inner join music_bands
>                           where music_albums.band_id = music_bands.band_ID),
>   musicians_in_bands as (select distinct musicians.musician_id, musician_name, band_name
>                           from musicians inner join musicians_and_albums inner join album_IDs_by_bands
>                           where musicians.musician_ID = musicians_and_albums.musician_ID
>                             and musicians_and_albums.album_ID = album_IDS_by_bands.album_ID)
> select musician_ID, musician_name from musicians_in_bands where band_name = 'Santana'
> intersect
> select musician_ID, musician_name from musicians_in_bands where band_name = 'Journey'
> order by musician_ID
>   ;
> +-------------+---------------+
> | MUSICIAN_ID | MUSICIAN_NAME |
> |-------------+---------------|
> |         305 | Gregg Rolie   |
> |         306 | Neal Schon    |
> +-------------+---------------+
> ```

### Recursive examples

This is a basic example of using a recursive CTE to generate a Fibonacci series:

> ```sqlexample
> WITH RECURSIVE current_f (current_val, previous_val) AS
>     (
>     SELECT 0, 1
>     UNION ALL
>     SELECT current_val + previous_val, current_val FROM current_f
>       WHERE current_val + previous_val < 100
>     )
>   SELECT current_val FROM current_f ORDER BY current_val;
> +-------------+
> | CURRENT_VAL |
> |-------------|
> |           0 |
> |           1 |
> |           1 |
> |           2 |
> |           3 |
> |           5 |
> |           8 |
> |          13 |
> |          21 |
> |          34 |
> |          55 |
> |          89 |
> +-------------+
> ```

This example is a query with a recursive CTE that shows a “parts explosion” for an automobile:

> ```sqlexample
> -- The components of a car.
> CREATE TABLE components (
>     description VARCHAR,
>     component_ID INTEGER,
>     quantity INTEGER,
>     parent_component_ID INTEGER
>     );
>
> INSERT INTO components (description, quantity, component_ID, parent_component_ID) VALUES
>     ('car', 1, 1, 0),
>        ('wheel', 4, 11, 1),
>           ('tire', 1, 111, 11),
>           ('#112 bolt', 5, 112, 11),
>           ('brake', 1, 113, 11),
>              ('brake pad', 1, 1131, 113),
>        ('engine', 1, 12, 1),
>           ('piston', 4, 121, 12),
>           ('cylinder block', 1, 122, 12),
>           ('#112 bolt', 16, 112, 12)   -- Can use same type of bolt in multiple places
>     ;
> ```
>
> ```sqlexample
> WITH RECURSIVE current_layer (indent, layer_ID, parent_component_ID, component_id, description, sort_key) AS (
>   SELECT
>       '...',
>       1,
>       parent_component_ID,
>       component_id,
>       description,
>       '0001'
>     FROM components WHERE component_id = 1
>   UNION ALL
>   SELECT indent || '...',
>       layer_ID + 1,
>       components.parent_component_ID,
>       components.component_id,
>       components.description,
>       sort_key || SUBSTRING('000' || components.component_ID, -4)
>     FROM current_layer JOIN components
>       ON (components.parent_component_id = current_layer.component_id)
>   )
> SELECT
>   -- The indentation gives us a sort of "side-ways tree" view, with
>   -- sub-components indented under their respective components.
>   indent || description AS description,
>   component_id,
>   parent_component_ID
>   -- The layer_ID and sort_key are useful for debugging, but not
>   -- needed in the report.
> --  , layer_ID, sort_key
>   FROM current_layer
>   ORDER BY sort_key;
> +-------------------------+--------------+---------------------+
> | DESCRIPTION             | COMPONENT_ID | PARENT_COMPONENT_ID |
> |-------------------------+--------------+---------------------|
> | ...car                  |            1 |                   0 |
> | ......wheel             |           11 |                   1 |
> | .........tire           |          111 |                  11 |
> | .........#112 bolt      |          112 |                  11 |
> | .........brake          |          113 |                  11 |
> | ............brake pad   |         1131 |                 113 |
> | ......engine            |           12 |                   1 |
> | .........#112 bolt      |          112 |                  12 |
> | .........piston         |          121 |                  12 |
> | .........cylinder block |          122 |                  12 |
> +-------------------------+--------------+---------------------+
> ```

For more examples, see [Working with CTEs (Common Table Expressions)](../../user-guide/queries-cte.md).

---
title: Working with date and time values
source: https://docs.snowflake.com/en/sql-reference/date-time-examples.md
section: SQL General Reference
---

# Working with date and time values

Date and time calculations are among the most widely used and most critical computations in analytics and data mining. This topic provides practical examples of common date and time queries
and calculations.

## Loading dates and timestamps

This section provides examples for loading date and timestamp values, and describes considerations related to
time zones when loading these values.

### Loading timestamps with no time zone attached

In the following example, the [TIMESTAMP_TYPE_MAPPING](parameters.md) parameter is set to `TIMESTAMP_LTZ` (local time zone).
The [TIMEZONE](parameters.md) parameter is set to `America/Chicago` time. If some incoming timestamps don’t have a specified time zone,
then Snowflake loads those strings assuming the timestamps represent local time in the time zone set for the TIMEZONE parameter.

```sqlexample
ALTER SESSION SET TIMESTAMP_TYPE_MAPPING = 'TIMESTAMP_LTZ';
ALTER SESSION SET TIMEZONE = 'America/Chicago';

CREATE OR REPLACE TABLE time (ltz TIMESTAMP);
INSERT INTO time VALUES ('2024-05-01 00:00:00.000');

SELECT * FROM time;
```

```output
+-------------------------------+
| LTZ                           |
|-------------------------------|
| 2024-05-01 00:00:00.000 -0500 |
+-------------------------------+
```

### Loading timestamps with a time zone attached

In the following example, the [TIMESTAMP_TYPE_MAPPING](parameters.md) parameter is set to `TIMESTAMP_LTZ` (local time zone).
The [TIMEZONE](parameters.md) parameter is set to `America/Chicago` time. If some incoming timestamps have a different
time zone specified, Snowflake loads the string in `America/Chicago` time.

```sqlexample
ALTER SESSION SET TIMESTAMP_TYPE_MAPPING = 'TIMESTAMP_LTZ';
ALTER SESSION SET TIMEZONE = 'America/Chicago';

CREATE OR REPLACE TABLE time (ltz TIMESTAMP);
INSERT INTO time VALUES ('2024-04-30 19:00:00.000 -0800');

SELECT * FROM time;
```

```output
+-------------------------------+
| LTZ                           |
|-------------------------------|
| 2024-04-30 22:00:00.000 -0500 |
+-------------------------------+
```

### Converting timestamps to alternative time zones

In the following example, a set of timestamp values is stored with no time zone data. The timestamps are loaded in UTC time and converted to other time zones:

```sqlexample
ALTER SESSION SET TIMEZONE = 'UTC';
ALTER SESSION SET TIMESTAMP_LTZ_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS TZH:TZM';

CREATE OR REPLACE TABLE utctime (ntz TIMESTAMP_NTZ);
INSERT INTO utctime VALUES ('2024-05-01 00:00:00.000');
```

```sqlexample
SELECT * FROM utctime;
```

```output
+-------------------------+
| NTZ                     |
|-------------------------|
| 2024-05-01 00:00:00.000 |
+-------------------------+
```

```sqlexample
SELECT CONVERT_TIMEZONE('UTC','America/Chicago', ntz)::TIMESTAMP_LTZ AS ChicagoTime
  FROM utctime;
```

```output
+---------------------------+
| CHICAGOTIME               |
|---------------------------|
| 2024-04-30 19:00:00 +0000 |
+---------------------------+
```

```sqlexample
SELECT CONVERT_TIMEZONE('UTC','America/Los_Angeles', ntz)::TIMESTAMP_LTZ AS LATime
  FROM utctime;
```

```output
+---------------------------+
| LATIME                    |
|---------------------------|
| 2024-04-30 17:00:00 +0000 |
+---------------------------+
```

## Inserting valid date strings into date columns in a table

This example inserts values into a DATE column.

```sqlexample
CREATE OR REPLACE TABLE my_table(id INTEGER, date1 DATE);
INSERT INTO my_table(id, date1) VALUES (1, TO_DATE('2024.07.23', 'YYYY.MM.DD'));
INSERT INTO my_table(id) VALUES (2);
```

```sqlexample
SELECT id, date1
  FROM my_table
  ORDER BY id;
```

```output
+----+------------+
| ID | DATE1      |
|----+------------|
|  1 | 2024-07-23 |
|  2 | NULL       |
+----+------------+
```

The TO_DATE function accepts TIMESTAMP values and strings in TIMESTAMP format, but discards the time information
(hours, minutes, and so on).

```sqlexample
INSERT INTO my_table(id, date1) VALUES
  (3, TO_DATE('2024.02.20 11:15:00', 'YYYY.MM.DD HH:MI:SS')),
  (4, TO_TIMESTAMP('2024.02.24 04:00:00', 'YYYY.MM.DD HH:MI:SS'));
```

```sqlexample
SELECT id, date1
  FROM my_table
  WHERE id >= 3;
```

```output
+----+------------+
| ID | DATE1      |
|----+------------|
|  3 | 2024-02-20 |
|  4 | 2024-02-24 |
+----+------------+
```

If you insert a DATE that was defined with only a time, then the default date is January 1, 1970.

```sqlexample
INSERT INTO my_table(id, date1) VALUES
  (5, TO_DATE('11:20:30', 'hh:mi:ss'));
```

```sqlexample
SELECT id, date1
  FROM my_table
  WHERE id = 5;
```

```output
+----+------------+
| ID | DATE1      |
|----+------------|
|  5 | 1970-01-01 |
+----+------------+
```

When you retrieve DATE values, you can format them as TIMESTAMP values:

```sqlexample
SELECT id,
       TO_VARCHAR(date1, 'dd-mon-yyyy hh:mi:ss') AS date1
  FROM my_table
  ORDER BY id;
```

```output
+----+----------------------+
| ID | DATE1                |
|----+----------------------|
|  1 | 23-Jul-2024 00:00:00 |
|  2 | NULL                 |
|  3 | 20-Feb-2024 00:00:00 |
|  4 | 24-Feb-2024 00:00:00 |
|  5 | 01-Jan-1970 00:00:00 |
+----+----------------------+
```

## Retrieving the current date and time

Get the current date as a DATE value:

```sqlexample
SELECT CURRENT_DATE();
```

Get the current date and time as a TIMESTAMP value:

```sqlexample
SELECT CURRENT_TIMESTAMP();
```

## Retrieving dates and days of the week

Get the current day of the week as a number using the [EXTRACT](functions/extract.md) function:

```sqlexample
SELECT EXTRACT('dayofweek', CURRENT_DATE());
```

> **Note:**
>
> * The `dayofweek_iso` part follows the ISO-8601 data elements and interchange formats standard.
>   The function returns the day of the week as an integer value in the range 1-7, where 1 represents Monday.
> * For compatibility with some other systems, the `dayofweek` part follows the UNIX standard.
>   The function returns the day of the week as an integer value in the range 0-6, where 0 represents Sunday.

You can get the current day of the week as a string using the [TO_VARCHAR](functions/to_char.md)
or [DECODE](functions/decode.md) function.

Run a query that returns the short English name (for example, `Sun`, `Mon`, and so on) for the current date:

```sqlexample
SELECT TO_VARCHAR(CURRENT_DATE(), 'dy');
```

Run a query that returns the explicitly-provided weekday names for the current date:

```sqlexample
SELECT DECODE(EXTRACT('dayofweek_iso', CURRENT_DATE()),
  1, 'Monday',
  2, 'Tuesday',
  3, 'Wednesday',
  4, 'Thursday',
  5, 'Friday',
  6, 'Saturday',
  7, 'Sunday') AS weekday_name;
```

## Retrieving date and time parts

You can get various date and time parts for the current date and time using the [DATE_PART](functions/date_part.md) function.

Query for the current day of the month:

```sqlexample
SELECT DATE_PART(day, CURRENT_TIMESTAMP());
```

Query for the current year:

```sqlexample
SELECT DATE_PART(year, CURRENT_TIMESTAMP());
```

Query for the current month:

```sqlexample
SELECT DATE_PART(month, CURRENT_TIMESTAMP());
```

Query for the current hour:

```sqlexample
SELECT DATE_PART(hour, CURRENT_TIMESTAMP());
```

Query for the current minute:

```sqlexample
SELECT DATE_PART(minute, CURRENT_TIMESTAMP());
```

Query for the current second:

```sqlexample
SELECT DATE_PART(second, CURRENT_TIMESTAMP());
```

You can also use the [EXTRACT](functions/extract.md) function to get various date and time parts for the current date and time.

Query for the current day of the month:

```sqlexample
SELECT EXTRACT('day', CURRENT_TIMESTAMP());
```

Query for the current year:

```sqlexample
SELECT EXTRACT('year', CURRENT_TIMESTAMP());
```

Query for the current month:

```sqlexample
SELECT EXTRACT('month', CURRENT_TIMESTAMP());
```

Query for the current hour:

```sqlexample
SELECT EXTRACT('hour', CURRENT_TIMESTAMP());
```

Query for the current minute:

```sqlexample
SELECT EXTRACT('minute', CURRENT_TIMESTAMP());
```

Query for the current second:

```sqlexample
SELECT EXTRACT('second', CURRENT_TIMESTAMP());
```

This query returns tabular output with various date and time parts for the current date and time:

```sqlexample
SELECT month(CURRENT_TIMESTAMP()) AS month,
       day(CURRENT_TIMESTAMP()) AS day,
       hour(CURRENT_TIMESTAMP()) AS hour,
       minute(CURRENT_TIMESTAMP()) AS minute,
       second(CURRENT_TIMESTAMP()) AS second;
```

```output
+-------+-----+------+--------+--------+
| MONTH | DAY | HOUR | MINUTE | SECOND |
|-------+-----+------+--------+--------|
|     8 |  28 |    7 |     59 |     28 |
+-------+-----+------+--------+--------+
```

## Calculating business calendar dates and times

Get the first day of the month as a DATE value using the [DATE_TRUNC](functions/date_trunc.md) function. For example, get the first day of the current month:

```sqlexample
SELECT DATE_TRUNC('month', CURRENT_DATE());
```

Get the last day of the current month as a DATE value using the [DATEADD](functions/dateadd.md) and DATE_TRUNC functions:

```sqlexample
SELECT DATEADD('day',
               -1,
               DATE_TRUNC('month', DATEADD(day, 31, DATE_TRUNC('month',CURRENT_DATE()))));
```

For an alternative option, the following example uses DATE_TRUNC to retrieve the beginning of the current month, adds one month to retrieve
the beginning of the next month, and then subtracts one day to determine the last day of the current month.

```sqlexample
SELECT DATEADD('day',
               -1,
               DATEADD('month', 1, DATE_TRUNC('month', CURRENT_DATE())));
```

Get the last day of the previous month as a DATE value:

```sqlexample
SELECT DATEADD(day,
               -1,
               DATE_TRUNC('month', CURRENT_DATE()));
```

Get the short English name (for example, `Jan`, `Dec`, and so on) for the current month:

```sqlexample
SELECT TO_VARCHAR(CURRENT_DATE(), 'Mon');
```

Get the current month name using explicitly-provided month names:

```sqlexample
SELECT DECODE(EXTRACT('month', CURRENT_DATE()),
         1, 'January',
         2, 'February',
         3, 'March',
         4, 'April',
         5, 'May',
         6, 'June',
         7, 'July',
         8, 'August',
         9, 'September',
         10, 'October',
         11, 'November',
         12, 'December') AS current_month;
```

Get the date for Monday in the current week:

```sqlexample
SELECT DATEADD('day',
               (EXTRACT('dayofweek_iso', CURRENT_DATE()) * -1) + 1,
               CURRENT_DATE());
```

Get the date for Friday in the current week:

```sqlexample
SELECT DATEADD('day',
               (5 - EXTRACT('dayofweek_iso', CURRENT_DATE())),
               CURRENT_DATE());
```

Get the date for the first Monday in the current month using the [DATE_PART](functions/date_part.md) function:

```sqlexample
SELECT DATEADD(day,
               MOD( 7 + 1 - DATE_PART('dayofweek_iso', DATE_TRUNC('month', CURRENT_DATE())), 7),
               DATE_TRUNC('month', CURRENT_DATE()));
```

> **Note:**
>
> In the above query, the `1` value in `7 + 1` translates to Monday. To retrieve the date for the first Tuesday,
> Wednesday, and so on, substitute `2`, `3`, and so on, respectively, through `7` for `Sunday`.

Get the first day of the current year as a DATE value:

```sqlexample
SELECT DATE_TRUNC('year', CURRENT_DATE());
```

Get the last day of the current year as a DATE value:

```sqlexample
SELECT DATEADD('day',
               -1,
               DATEADD('year', 1, DATE_TRUNC('year', CURRENT_DATE())));
```

Get the last day of the previous year as a DATE value:

```sqlexample
SELECT DATEADD('day',
               -1,
               DATE_TRUNC('year', CURRENT_DATE()));
```

Get the first day of the current quarter as a DATE value:

```sqlexample
SELECT DATE_TRUNC('quarter', CURRENT_DATE());
```

Get the last day of the current quarter as a DATE value:

```sqlexample
SELECT DATEADD('day',
               -1,
               DATEADD('month', 3, DATE_TRUNC('quarter', CURRENT_DATE())));
```

Get the date and timestamp for midnight in the current day:

```sqlexample
SELECT DATE_TRUNC('day', CURRENT_TIMESTAMP());
```

```output
+----------------------------------------+
| DATE_TRUNC('DAY', CURRENT_TIMESTAMP()) |
|----------------------------------------|
| Wed, 07 Sep 2016 00:00:00 -0700        |
+----------------------------------------+
```

## Incrementing date and time values

Use the [DATEADD](functions/dateadd.md) function to increment date and time values.

Add two years to the current date:

```sqlexample
SELECT DATEADD(year, 2, CURRENT_DATE());
```

Add two days to the current date:

```sqlexample
SELECT DATEADD(day, 2, CURRENT_DATE());
```

Add two hours to the current date and time:

```sqlexample
SELECT DATEADD(hour, 2, CURRENT_TIMESTAMP());
```

Add two minutes to the current date and time:

```sqlexample
SELECT DATEADD(minute, 2, CURRENT_TIMESTAMP());
```

Add two seconds to the current date and time:

```sqlexample
SELECT DATEADD(second, 2, CURRENT_TIMESTAMP());
```

## Converting valid character strings to dates, times, or timestamps

In most use cases, Snowflake correctly handles date and timestamp values formatted as strings. In certain cases,
such as string-based comparisons or when a result depends on a different timestamp format from the format set in
the session parameters, we recommend explicitly converting values to the desired format to avoid unexpected results.

For example, without explicit casting, comparing string values produces string-based results:

```sqlexample
CREATE OR REPLACE TABLE timestamps(timestamp1 STRING);

INSERT INTO timestamps VALUES
  ('Fri, 05 Apr 2013 00:00:00 -0700'),
  ('Sat, 06 Apr 2013 00:00:00 -0700'),
  ('Sat, 01 Jan 2000 00:00:00 -0800'),
  ('Wed, 01 Jan 2020 00:00:00 -0800');
```

The following query performs a comparison without explicit casting:

```sqlexample
SELECT * FROM timestamps WHERE timestamp1 < '2014-01-01';
```

```output
+------------+
| TIMESTAMP1 |
|------------|
+------------+
```

The following query performs a comparison with explicit casting to DATE:

```sqlexample
SELECT * FROM timestamps WHERE timestamp1 < '2014-01-01'::DATE;
```

```output
+---------------------------------+
| DATE1                           |
|---------------------------------|
| Fri, 05 Apr 2013 00:00:00 -0700 |
| Sat, 06 Apr 2013 00:00:00 -0700 |
| Sat, 01 Jan 2000 00:00:00 -0800 |
+---------------------------------+
```

For more information about conversion functions, see [Date and time formats in conversion functions](functions-conversion.md).

## Applying date arithmetic to date strings

Add five days to the date expressed in a string:

```sqlexample
SELECT DATEADD('day',
               5,
               TO_TIMESTAMP('12-jan-2024 00:00:00','dd-mon-yyyy hh:mi:ss'))
  AS add_five_days;
```

```output
+-------------------------+
| ADD_FIVE_DAYS           |
|-------------------------|
| 2024-01-17 00:00:00.000 |
+-------------------------+
```

You can calculate the difference in days between the current date and the date expressed in a string using the
[DATEDIFF](functions/datediff.md) function.

Calculate the difference in days using the [TO_TIMESTAMP](functions/to_timestamp.md) function:

```sqlexample
SELECT DATEDIFF('day',
                TO_TIMESTAMP ('12-jan-2024 00:00:00','dd-mon-yyyy hh:mi:ss'),
                CURRENT_DATE())
  AS to_timestamp_difference;
```

```output
+-------------------------+
| TO_TIMESTAMP_DIFFERENCE |
|-------------------------|
|                     229 |
+-------------------------+
```

Calculate the difference in days using the [TO_DATE](functions/to_date.md) function:

```sqlexample
SELECT DATEDIFF('day',
                TO_DATE ('12-jan-2024 00:00:00','dd-mon-yyyy hh:mi:ss'),
                CURRENT_DATE())
  AS to_date_difference;
```

```output
+--------------------+
| TO_DATE_DIFFERENCE |
|--------------------|
|                229 |
+--------------------+
```

Add one day to a specified date:

```sqlexample
SELECT TO_DATE('2024-01-15') + 1 AS date_plus_one;
```

```output
+---------------+
| DATE_PLUS_ONE |
|---------------|
| 2024-01-16    |
+---------------+
```

Subtract nine days from the current date (for example, Aug 28, 2024):

```sqlexample
SELECT CURRENT_DATE() - 9 AS date_minus_nine;
```

```output
+-----------------+
| DATE_MINUS_NINE |
|-----------------|
| 2024-08-19      |
+-----------------+
```

## Calculating differences between dates or times

Calculate the difference between the current date and the date in three years:

```sqlexample
SELECT DATEDIFF(year, CURRENT_DATE(),
       DATEADD(year, 3, CURRENT_DATE()));
```

Calculate the difference between the current date and the date in three months:

```sqlexample
SELECT DATEDIFF(month, CURRENT_DATE(),
       DATEADD(month, 3, CURRENT_DATE()));
```

Calculate the difference between the current date and the date in three days:

```sqlexample
SELECT DATEDIFF(day, CURRENT_DATE(),
       DATEADD(day, 3, CURRENT_DATE()));
```

Calculate the difference between the current time and the time in three hours:

```sqlexample
SELECT DATEDIFF(hour, CURRENT_TIMESTAMP(),
       DATEADD(hour, 3, CURRENT_TIMESTAMP()));
```

Calculate the difference between the current time and the time in three minutes:

```sqlexample
SELECT DATEDIFF(minute, CURRENT_TIMESTAMP(),
       DATEADD(minute, 3, CURRENT_TIMESTAMP()));
```

Calculate the difference between the current time and the time in three seconds:

```sqlexample
SELECT DATEDIFF(second, CURRENT_TIMESTAMP(),
       DATEADD(second, 3, CURRENT_TIMESTAMP()));
```

## Creating yearly calendar views

```sqlexample
CREATE OR REPLACE VIEW calendar_2016 AS
  SELECT n,
         theDate,
         DECODE (EXTRACT('dayofweek',theDate),
           1 , 'Monday',
           2 , 'Tuesday',
           3 , 'Wednesday',
           4 , 'Thursday',
           5 , 'Friday',
           6 , 'Saturday',
           0 , 'Sunday') theDayOfTheWeek,
         DECODE (EXTRACT(month FROM theDate),
           1 , 'January',
           2 , 'February',
           3 , 'March',
           4 , 'April',
           5 , 'May',
           6 , 'June',
           7 , 'July',
           8 , 'August',
           9 , 'september',
           10, 'October',
           11, 'November',
           12, 'December') theMonth,
         EXTRACT(year FROM theDate) theYear
  FROM
    (SELECT ROW_NUMBER() OVER (ORDER BY seq4()) AS n,
            DATEADD(day, ROW_NUMBER() OVER (ORDER BY seq4())-1, TO_DATE('2016-01-01')) AS theDate
      FROM table(generator(rowCount => 365)))
  ORDER BY n ASC;

SELECT * from CALENDAR_2016;
```

```output
+-----+------------+-----------------+-----------+---------+
|   N | THEDATE    | THEDAYOFTHEWEEK | THEMONTH  | THEYEAR |
|-----+------------+-----------------+-----------+---------|
|   1 | 2016-01-01 | Friday          | January   |    2016 |
|   2 | 2016-01-02 | Saturday        | January   |    2016 |
  ...
| 364 | 2016-12-29 | Thursday        | December  |    2016 |
| 365 | 2016-12-30 | Friday          | December  |    2016 |
+-----+------------+-----------------+-----------+---------+
```

---
title: Writing external functions
source: https://docs.snowflake.com/en/sql-reference/external-functions.md
section: SQL General Reference
---

# Writing external functions

External functions are user-defined functions that are stored and executed outside of Snowflake.

External functions make it easier to access external API services such as geocoders, machine learning models, and other custom code
running outside of Snowflake. This feature eliminates the need to export and reimport data when using third-party services, significantly
simplifying your data pipelines.

> **Note:**
>
> When using external functions in China, use the [syntax and workflow described for AWS](external-functions-creating-aws.md).

[Introduction to external functions](external-functions-introduction.md)
:   Learn about external functions, which call executable code that is developed, maintained, stored, and executed outside Snowflake.

[Remote service input and output data formats](external-functions-data-format.md)
:   Understand the data formats sent and received by Snowflake.

[Using request and response translators with data for a remote service](external-functions-translators.md)
:   Change the format of data sent to and received from remote services.

[Designing high-performance external functions](external-functions-implementation.md)
:   Design high-performance functions with these tips on asynchronous services, scalability, concurrency, and reliability.

[External functions best practices](external-functions-best-practices.md)
:   Improve efficiency and prevent unexpected results with these best practices.

[Securing an external function](external-functions-security.md)
:   Create secure external functions.

## Remote services

[Creating external functions on AWS](external-functions-creating-aws.md)
:   Create an external function from functionality on AWS.

[Creating external functions on GCP](external-functions-creating-gcp.md)
:   Create an external function from functionality on GCP.

[Creating external functions on Microsoft Azure](external-functions-creating-azure.md)
:   Create an external function from functionality on Azure.

## Connectors & Drivers

Snowflake connectors for Kafka, Google, Microsoft, and other third-party platforms.

---
title: About the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for Google Analytics Aggregate Data

Google Analytics is a cloud-based tool that provides insight into how users interact with your website. You can use it to analyze user actions, track the number of visitors and page views, and analyze bounce rates for a page.

The Snowflake Connector for Google Analytics Aggregate Data enables you to automatically ingest aggregated data from Google Analytics 4 (GA4) reports into your Snowflake account. The connector extracts aggregated data using the [GA4 Reporting API](https://developers.google.com/analytics/devguides/reporting/data/v1).

Data ingestion relies on v1 of the [Google Analytics Data API](https://developers.google.com/analytics/devguides/reporting/data/v1). For more information about ingestion model, see [Snowflake Connector for Google Analytics Aggregate Data ingestion model](gaad-ingestion-model.md).

> **Note:**
>
> * The connector can only ingest Google Analytics 4 (GA4) report data.
> * The connector requires the `date` dimension to be present in a report definition.

For release note information, see [Snowflake Connector for Google Analytics Aggregate Data release notes](../../../release-notes/connectors/gaad.md).

## Limitations

The Snowflake Connector for Google Analytics Aggregate Data has the following limitations:

* Accounts in government regions are not supported.
* The connector can only retrieve data for Google Analytics 4 (GA4) properties. Universal Analytics (UA) are not supported.
* The data in Google Analytics might change up to 72 hours after it is recorded. Currently, the connector cannot reflect the changes in real time.
* The Snowflake Connector for Google Analytics Aggregate Data is not supported with Snowflake trial accounts due to external access security concerns.
* The Snowflake Connector for Google Analytics Aggregate Data creates tables and views for the ingested data in a database and schema chosen by the user. Currently, the connector must have ownership of those tables and views. There must be no future ownership grants on the database or schema, and the schema must not have managed access enabled.
* Users can configure at most 40 reports. Currently, this limit cannot be increased.
* [AUTOCOMMIT](../../../sql-reference/parameters.md) must be enabled to configure and use the connector.
* Currently, the connector requires the [TIMESTAMP_INPUT_FORMAT](../../../sql-reference/parameters.md) to be set to AUTO.
* If the warehouse used by the connector has [STATEMENT_TIMEOUT_IN_SECONDS](../../../sql-reference/parameters.md) set, it must be set to a minimum of 4 hours.

---
title: About the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for Google Analytics Raw Data

Google Analytics is a cloud-based tool that provides insight into how users interact with your website. You can use it to analyze user actions, track the number of visitors and page views, and analyze bounce rates for a page.

The Snowflake Connector for Google Analytics Raw Data enables you to automatically ingest event-level Google Analytics 4 (GA4) data into your Snowflake account. If you want to extract aggregated report data, see [About the Snowflake Connector for Google Analytics Aggregate Data](../gaad/gaad-connector-about.md) and the [GA4 Reporting API](https://developers.google.com/analytics/devguides/reporting/data/v1).

To extract Snowflake Connector for Google Analytics Raw Data – the granular, event-level details – you must set up a manual link between a GA4 property and a Google Cloud Platform (GCP) project. This enables the export of raw data into BigQuery. The Snowflake Connector for Google Analytics Raw Data then connects to the [BigQuery Storage API](https://cloud.google.com/bigquery/docs/reference/storage/), and downloads the data into your Snowflake account.

The Snowflake Connector for Google Analytics Raw Data ingests data to the selected destination database and schema. Tables and views containing your Google Analytics 4 data within that schema are temporarily owned by the connector, for as long as the connector is installed.
If you want to uninstall, but do not want to lose your data, please see the [Uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data](gard-connector-uninstalling-and-reinstalling.md) section and read about the data ownership transfer during uninstallation.

For release note information, see [Snowflake Connector for Google Analytics Raw Data release notes](../../../release-notes/connectors/gard.md).

# Known limitations

The Snowflake Connector for Google Analytics Raw Data has the following limitations:

* Accounts in government regions are currently not supported.
* The Snowflake Connector for Google Analytics Raw Data does not work on Snowflake trial accounts due to external access security concerns. This is not expected to change in the future.
* The connector can retrieve data for Google Analytics 4 (GA4) properties only. Universal Analytics (UA) is not, and will not be supported.
* The Snowflake Connector for Google Analytics Raw Data assumes that the application is the owner (has OWNERSHIP privilege) of all tables and views in [destination schema](gard-connector-setting-up-data.md). Granting the FUTURE OWNERSHIP privilege on tables or views in this SCHEMA/DATABASE, or using a managed schema, will result in connector not working correctly.
* The AUTOCOMMIT parameter has to be enabled in the session interacting with the connector.
* The connector will not work correctly if custom date formats are set in the account.
* Emojis are not supported as parts of the application name set during connector installation.
* Switching Google analytics export to a different project is not supported.

---
title: About the Snowflake Connector for Microsoft Power Platform
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for Microsoft Power Platform

This topic describes how to connect to Snowflake from Microsoft Power Platform by adding Snowflake as a data connection.

When connected, you can use your Snowflake data from the following platforms:

* Power Apps: Build applications that can read from and write to Snowflake .
* Power Automate: Build flows and add actions that enable executing custom SQL and get back the results.
* Copilot Studio: Build custom agents using your Snowflake data as a knowledge source.
* Logic Apps: Build and run automated workflows in, across, and outside the software ecosystems in your enterprise or organization.

The Microsoft Power Platform helps you create flows and add actions to execute and get back results of custom SQL statements with the Snowflake connection.

## Supported capabilities for Power Apps

* Users should first create virtual tables and then load them into apps with the Snowflake connection.

  To learn how to create virtual tables, see
  [Create and edit virtual tables with Microsoft Dataverse - Power Apps](https://learn.microsoft.com/power-apps/maker/data-platform/create-edit-virtual-entities).

## Virtual Network support

With Azure Virtual Network support for Power Platform, users can integrate Power Platform with resources
inside their virtual network without exposing them over the public internet.

To connect to Virtual Network, please make sure to follow both steps mentioned below.

1. Learn how to setup [Azure Private Link and Snowflake](../../../user-guide/privatelink-azure.md).
2. Learn how to setup [Virtual Network support for Power Platform](https://learn.microsoft.com/power-platform/admin/vnet-support-setup-configure).
   To learn more about Azure Virtual Network, see [Virtual Network support overview](https://learn.microsoft.com/power-platform/admin/vnet-support-overview).

## Prerequisites

* Users must have a Snowflake account.
* Users must have Microsoft Entra ID for the external authorization.
  The authorization flow for PowerApps supports Service-Principal; however, Power Automate supports both Service-Principal and on-behalf-of-user flows.
* Users must have a premium Power Apps license.

## Known issues and limitations

1. A Snowflake table needs to have a Primary or Unique Key (integer data type only), and at least one additional column.
2. We currently do not support duplicate columns when the `join` command is executed. A workaround would be to add aliases to the duplicated columns.
3. Other limitations with Virtual Tables are listed [here](https://learn.microsoft.com/power-apps/maker/data-platform/create-edit-virtual-entities#considerations-when-you-use-virtual-tables) .
4. Virtual tables are only supported with connections created with ‘Service Principal’ authentication.
5. When using Service Principle authentication, the user needs to have read access to the **information_schema.columns** table.
6. Snowflake connections cannot be created directly in Canvas apps. Error information and steps needed to resolve the issue are as follows:

   1. An error will show if the Snowflake connection is created directly in a Canvas app as shown below.
   2. Rather than adding the connector directly in the Canvas app,
      create a service principal connection (not delegated) from outside of the Canvas app.
      Use the Snowflake connection created above and create a virtual table.
   3. Afterwards, the virtual table can be loaded in the Canvas app, and builds out of the Canvas app can proceed as normal.

      > **Note:**
      >
      > The `ANIMALS` table above is a virtual table, created using the Snowflake Connection as previously described in [Install and configure Snowflake Connector for Snowflake Connector for Microsoft Power Platform](tasks.md).

## Considerations

Consider the following when using the Snowflake connector with Microsoft Power Platform:

* The authorization server can grant the OAuth client an access token on behalf of the user, referred to as `DELEGATED BASED AUTH`.
* The authorization server can grant the OAuth client an access token for the OAuth client itself, referred to as `SP BASED AUTH`.
* When creating a security integration, describe the integration created and
  determine if the role given to the Snowflake user is in the blocked list.

  If in the blocked list, than either change or remove the role of the user in the blocked list.
* Ensure that the `login_name` and roles are correctly set in Snowflake.

  To verify login names, open a browser to Snowsight. In the navigation menu, select Governance & security » Users & roles.
  Select a user and edit as required.
* Snowflake account details (warehouse, role, schema, database) are case-sensitive and must match exactly as they are in the Snowflake account when configuring the connection.
* For Delegated and Service Principal based connections, please create a Power Automate flow to validate the connection.

## Customers using Snowflake Connector [DEPRECATED]

* Applicable: All regions
* This option is only for older connections without an explicit authentication type and is only provided for backward compatibility.
* To migrate from an older Snowflake connector to the new one, please follow the steps below.

  1. Power Automate flows and Power Apps using older connections will need to be updated by changing to the new connection.
  2. Power Automate flow action “Convert result set rows from array to objects” would also need to be dropped as that functionality is now wrapped in “Check the Status and Get Results”.

### Next steps

After reviewing this page, review the current set of [installation tasks](tasks.md).

---
title: About the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The Snowflake Connector for MySQL allows you to:

* Load data into Snowflake from a MySQL database.
* Configure replication so that changes in your MySQL database are replicated to Snowflake.

To handle connections between Snowflake and MySQL, the connector uses an agent. The agent is
distributed as a Docker image. The agent is run within your network and is used to push data into your Snowflake
account.

> **Note:**
>
> The Snowflake Connector for MySQL requires exactly one instance of the agent application to be running at all times.

The ongoing incremental updates use the Change Data Capture (CDC) technique that captures changes performed on the source database.
The changes include INSERT, UPDATE, and DELETE operations, which are automatically replicated on the
destination database in Snowflake.

## Multiple appplication instances

You can install multiple instances of Snowflake Connector for MySQL on your Snowflake account.
For more information, see [Optional: Installing multiple instances of Snowflake Connector for MySQL](install-snowsight.md).

## Private links

The Snowflake connector for MySQL supports private links. For more information, see:

* [AWS PrivateLink and Snowflake](../../user-guide/admin-security-privatelink.md)
* [Azure Private Link and Snowflake](../../user-guide/privatelink-azure.md)
* [Google Cloud Private Service Connect and Snowflake](../../user-guide/private-service-connect-google.md)

## Agent & Connector App compatibilities

The Snowflake Connector for MySQL is being released against a specific version, described as **x.y.z version** where x is major, y is minor and z is patch. Agents on dockerhub are also
released with the X.Y.Z version. Each x.y.z version of Snowflake Connector for MySQL supports all agents with the same major version X=x and not greater
minor version of the agent. Moreover each x.0.0 version of the Snowflake Connector for MySQL supports all (x-1).Y.Z versions of the agent for all Y and Z.

## Known limitations

The following sections describe the known limitations for the connector.

### Transaction size

The connector is subject to the [same limitations as MySQL’s group replication](https://dev.mysql.com/doc/refman/8.4/en/group-replication-limitations.html#group-replication-limitations-transaction-size).
This means that a single transaction must fit into a binary log message of no more than 4GB.
Transactions exceeding this size will cause the source table to be marked as permanently failed, and
require a full snapshot reload of the associated table.

### Connector availability

When installing the connector note the following limitations:

* Accounts in government regions are not supported.
* To install and configure the connector, you must be logged in as a user with the ACCOUNTADMIN role. Other roles are not supported at this time.

### Types compatibility

Differences between the source database and Snowflake column types prevent some values from being converted and written into Snowflake because of the maximum column capacity or allowed ranges. For example:

* Snowflake BINARY type has a maximum length of 64 MB (67108864 bytes)
* Snowflake date types, like DATE, DATETIME, and TIMESTAMP, have a maximum year of 9999
* Snowflake VARCHAR type has a maximum length of 128 MB (134217728 bytes)

If such incompatibility happens, the replication of a table is stopped with a failure.

### Source table schema changes

The following table shows different types of changes to the source table schema and whether they are supported, and if so how.

New column names are subject to the same limitations as described in the Identifiers limitations section.

| Type of schema change | Supported? | Notes |
| --- | --- | --- |
| Adding a new column | Yes | The new column will be visible in the destination table just like any other column that existed at the start of the replication.  It is not possible to add a new column with the same name as a previously deleted or renamed column.  For example, if columns `A` and `B` existed initially, but `A` was deleted and `B` was renamed to `B2` - it is not possible to add a column named `A` or `B`. |
| Deleting an existing column | Yes | If a column is deleted in the source table, it will not be deleted in the destination table. Instead, a soft-delete approach is followed and the column will be renamed to `<previous name>__SNOWFLAKE_DELETED` so that historical values can still be queried. All rows replicated after the column is deleted will have a NULL value in this column.  For example, if a column `A` is deleted, it will be renamed to `A__SNOWFLAKE_DELETED` in the destination table and the contents of the column remain unchanged. |
| Renaming a column | Yes | Renaming a column is equal to deleting the column and creating a new one with the new name. The deletion follows the soft-delete approach explained in the previous row.  For example, if column `A` was renamed to `B` - in the destination table `A` was renamed to `A__SNOWFLAKE_DELETED`, and a new column `B` was added. All rows existing before the change keep the column’s values in the `A__SNOWFLAKE_DELETED` column, while new rows added after the change have the values in the `B` column.  It is not possible to rename a column to the same name as a previously deleted or renamed column. For example, if columns `A`, `B` and `C` existed initially, but `A` was deleted and `B` was renamed to `B2` - it is not possible to rename the column `C` to `A` or `B`. |
| Changing the type of column | Partially | Changing the type of source table column is only possible if both the previous and the new type are mapped to the same type in Snowflake.  In any other case, the replication will fail permanently. |
| Changing the precision of a numeric column | No | Changing the precision of a source table column will result in replication failing permanently. |
| Changing the scale of a numeric column | No | Changing the scale of a source table column will result in replication failing permanently. |
| Changing the primary key definition | No | Changing the primary key definition of the source table column will result in replication failing permanently. |

### High-capacity columns

An active agent is continuously reading all events from the binary log, even if some events refer to source tables that
were not added for replication. If the binary log contains very large events, like updates of the BLOB-like columns,
the agent might crash due to the lack of available memory.

### Primary keys

Tables without primary keys are not supported.

### Identifiers limitations

Currently, the connector does not support the `"` character in replicated schema, table or column names.
Additionally, the following keywords are not supported:

For schema names:
:   * `INFORMATION_SCHEMA`

For column names:
:   * `_SNOWFLAKE_INSERTED_AT`
    * `_SNOWFLAKE_UPDATED_AT`
    * `_SNOWFLAKE_DELETED`
    * names with suffix `__SNOWFLAKE_DELETED`
    * Column names marked as `Cannot be used as column name` in [Reserved & limited keywords](../../sql-reference/reserved-keywords.md)

### MySQL version >= 8.0.0

Currently, the connector depends on `binlog_row_metadata = full` configuration property that was introduced in MySQL, version 8.

### Source database authorization

Private key authorization to the source database is not supported. Only authorization via user and password is supported.

---
title: About the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The Snowflake Connector for PostgreSQL allows you to:

* Load data into Snowflake from a PostgreSQL database.
* Configure replication so that changes in your PostgreSQL database are replicated to Snowflake.

To handle connections between Snowflake and PostgreSQL, the connector uses an agent. The agent is
distributed as a Docker image. The agent is run within your network and is used to push data into your Snowflake
account.

> **Note:**
>
> The Snowflake Connector for PostgreSQL requires exactly one instance of the agent application to be running at
> all times.

The ongoing incremental updates use the Change Data Capture (CDC) technique that captures changes performed on the source database.
The changes include INSERT, UPDATE, and DELETE operations, which are automatically replicated on the
destination database in Snowflake.

## Multiple appplication instances

You can install multiple instances of the Snowflake Connector for PostgreSQL on your Snowflake account.
For more information, see [Optional: Installing multiple instances of Snowflake Connector for PostgreSQL](install-snowsight.md).

## Private links

The Snowflake connector for PostgreSQL supports private links. For more information, see:

* [AWS PrivateLink and Snowflake](../../user-guide/admin-security-privatelink.md)
* [Azure Private Link and Snowflake](../../user-guide/privatelink-azure.md)
* [Google Cloud Private Service Connect and Snowflake](../../user-guide/private-service-connect-google.md)

## Agent and connector application compatibilities

The Snowflake Connector for PostgreSQL is being released against a specific version, described as **x.y.z version** where x is major, y is minor and z is patch. Agents on dockerhub are also
released with the X.Y.Z version. Each x.y.z version of Snowflake Connector for PostgreSQL supports all agents with the same major version X=x and not greater
minor version of the agent. Moreover each x.0.0 version of the Snowflake Connector for PostgreSQL supports all (x-1).Y.Z versions of the agent for all Y and Z.

## Known limitations

The following sections describe the known limitations for the connector.

### Read replicas are not supported

Because of PostgreSQL limitations, logical replication is not supported on replicas therefore the Snowflake Connector for PostgreSQL must be connected to primary database only.

### Connector availability

When installing the connector note the following limitations:

* Accounts in government regions are not supported.
* To install and configure the connector, you must be logged in as a user with the ACCOUNTADMIN role. Other roles are not supported at this time.

### Types compatibility

Differences between the source database and Snowflake column types prevent some values from being converted and written into Snowflake because of the maximum column capacity or allowed ranges. For example:

* Snowflake BINARY type has a maximum length of 64 MB (67108864 bytes)
* Snowflake date types, like DATE, DATETIME, and TIMESTAMP, have a maximum year of 9999
* Snowflake VARCHAR type has a maximum length of 128 MB (134217728 bytes)

If such incompatibility happens, the replication of a table is stopped with a failure.

### Source table schema changes

The following table shows different types of changes to the source table schema and whether they are supported, and if so how.

New column names are subject to the same limitations as described in the Identifiers limitations section.

| Type of schema change | Supported? | Notes |
| --- | --- | --- |
| Adding a new column | Yes | The new column will be visible in the destination table just like any other column that existed at the start of the replication.  It is not possible to add a new column with the same name as a previously deleted or renamed column.  For example, if columns `A` and `B` existed initially, but `A` was deleted and `B` was renamed to `B2` - it is not possible to add a column named `A` or `B`. |
| Deleting an existing column | Yes | If a column is deleted in the source table, it will not be deleted in the destination table. Instead, a soft-delete approach is followed and the column will be renamed to `<previous name>__SNOWFLAKE_DELETED` so that historical values can still be queried. All rows replicated after the column is deleted will have a NULL value in this column.  For example, if a column `A` is deleted, it will be renamed to `A__SNOWFLAKE_DELETED` in the destination table and the contents of the column remain unchanged. |
| Renaming a column | Yes | Renaming a column is equal to deleting the column and creating a new one with the new name. The deletion follows the soft-delete approach explained in the previous row.  For example, if column `A` was renamed to `B` - in the destination table `A` was renamed to `A__SNOWFLAKE_DELETED`, and a new column `B` was added. All rows existing before the change keep the column’s values in the `A__SNOWFLAKE_DELETED` column, while new rows added after the change have the values in the `B` column.  It is not possible to rename a column to the same name as a previously deleted or renamed column. For example, if columns `A`, `B` and `C` existed initially, but `A` was deleted and `B` was renamed to `B2` - it is not possible to rename the column `C` to `A` or `B`. |
| Changing the type of column | Partially | Changing the type of source table column is only possible if both the previous and the new type are mapped to the same type in Snowflake.  In any other case, the replication will fail permanently. |
| Changing the precision of a numeric column | No | Changing the precision of a source table column will result in replication failing permanently. |
| Changing the scale of a numeric column | No | Changing the scale of a source table column will result in replication failing permanently. |
| Changing the primary key definition | No | Changing the primary key definition of the source table column will result in replication failing permanently. |

### High-capacity columns

An active agent is continuously reading all events using logical replication mechanism, even if some events refer to source tables that
were not added for replication. If the logical replication contains very large events, like updates of the TEXT-like columns,
the agent might crash because of the lack of available memory.

### Primary keys

Tables without primary keys are not supported.

### Identifiers limitations

Currently, the connector does not support the `"` character in replicated schema, table or column names.
Additionally, the following keywords are not supported:

For schema names:
:   * `INFORMATION_SCHEMA`

For column names:
:   * `_SNOWFLAKE_INSERTED_AT`
    * `_SNOWFLAKE_UPDATED_AT`
    * `_SNOWFLAKE_DELETED`
    * names with suffix `__SNOWFLAKE_DELETED`
    * Column names marked as `Cannot be used as column name` in [Reserved & limited keywords](../../sql-reference/reserved-keywords.md)

### PostgreSQL version >= 11

Currently, the connector depends on `wal_level = logical` configuration property that was introduced in PostgreSQL, version 11.

### Replica identity setting

The [Replica identity](https://www.postgresql.org/docs/current/sql-altertable.html#SQL-ALTERTABLE-REPLICA-IDENTITY) of replicated tables must be set to `DEFAULT`.

### TOAST values

The replication of tables with TOAST values is not currently supported. This includes adding TOAST-able columns to the source schema when replication is already running.

### Source database authorization

Private key authorization to the source database is not supported. Only authorization via user and password is supported.

### Replica identity

The replica identity of a given table must be the same as the primary key, otherwise the replication will fail.

---
title: About the Snowflake Connector for ServiceNow®
source: https://docs.snowflake.com/en/connectors/servicenow/about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for ServiceNow®

The Snowflake Connector for ServiceNow® enables you to ingest ServiceNow® data into your Snowflake account.

ServiceNow® is a cloud-based platform that delivers workflows for service management including incident management, change management, asset management, configuration management, service catalog, request fulfillment, etc.

The Snowflake Connector for ServiceNow® allows you to ingest data from ServiceNow® into Snowflake automatically. The connector supports both the initial load of historical data as well as incremental updates. The latest data is regularly pulled from ServiceNow®. You control how frequently it is refreshed.

> **Note:**
>
> Data ingestion relies on `v2` of the ServiceNow® [table API](https://developer.servicenow.com/dev.do#!/reference/api/latest/rest/c_TableAPI).

The connector lets you replicate key dimensions and metrics from ServiceNow®, including:

* Incidents
* Changes
* Users
* Service catalog items
* Configuration items
* Company assets

## Known Limitations

The Snowflake Connector for ServiceNow® has the following limitations:

* The connector can only ingest ServiceNow® tables with the `sys_id` column present.
* Changes to ServiceNow® table schema are not reflected in already ingested rows, unless they are updated.
* ServiceNow® views are not supported.
* Archived records in ServiceNow® are not ingested into Snowflake. See [the documentation](ingestion.md) for more details.
* The connector does not work with ServiceNow® instances where IP address access control has been configured to deny access from an outside network.
* The connector does not work with a ServiceNow® instance that is hidden behind a VPN.
* Replication of the connector to failover region is not automatic and requires additional manual steps.
* The connector does not support [MANAGED ACCESS](../../sql-reference/sql/create-versioned-schema.md) destination schemas.
* The connector requires the [AUTOCOMMIT parameter](../../sql-reference/parameters.md) to be enabled.
* The connector requires a virtual warehouse with AUTO_RESUME enabled. Serverless is not supported. Connector procedures are not guaranteed to work when called using serverless compute.
* Executing certain procedures via external tasks is not supported, that is:

  > + `CHECK_IF_AUDIT_ENABLED`
  > + `CHECK_RECORD_HISTORY`
  > + `TEST_CONNECTION`
  > + `TEST_TABLE_ACCESS`
  > + `GET_AVAILABLE_TABLES`
  > + `ENABLE_TABLE` (with custom configuration parameters)
  > + `SET_CONNECTION_CONFIGURATION`
  > + `FINALIZE_CONNECTOR_CONFIGURATION`
  > + `CONFIGURE_QUERY_CATEGORY`
  > + `CONFIGURE_CUSTOM_API_PATH`

---
title: About the Snowflake Connector for SharePoint
source: https://docs.snowflake.com/en/connectors/unstructured-data-connectors/sharepoint/about.md
section: Connectors & Drivers
---

# About the Snowflake Connector for SharePoint

> **Note:**
>
> The Snowflake Connector for SharePoint is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for SharePoint.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements is not guaranteed. The new solution is available as [Openflow Connector for SharePoint](../../../user-guide/data-integration/openflow/connectors/sharepoint/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic describes the basic concepts of Snowflake Connector for SharePoint, its use cases and benefits, key features,
how it works, and limitations.

The Snowflake Connector for SharePoint connector connects a Microsoft 365 SharePoint site and Snowflake to ingest
files and user permissions and keeps them up to date.
Snowflake Connector for SharePoint also supports the Cortex Search service and can make ingested files ready for conversational
analysis for use in AI Assistants using SQL, Python or REST APIs.

## Benefits

* Frictionless ingestion: The connector is easy to set up and configure. You can use files from SharePoint
  with Cortex Search in your chat interface of choice.
* Secure by default: The connector adheres to end-user access controls in SharePoint through Cortex Search filters.
* Scalable by design: Built on the Snowflake Native App framework, the connector leverages Snowflake’s built-in security,
  scalability and reliability capabilities.
* Saves costs: The connector saves you cost by eliminating the need to manually
  transfer files or integrate against API endpoints or manage third-party solutions.

## Use cases

Use this connector if you’re looking to do the following:

* Create AI assistants for public documents within your organization’s SharePoint site
* Enable your AI assistants to adhere to access controls specified in your organization’s SharePoint site

## How it works

This section describes how this connector works with respect to the two use cases mentioned earlier.

### Create AI assistants for public documents within your organization’s SharePoint site

Working with Snowflake Connector for SharePoint for this use case can be broadly divided into four phases,
each associated with a specific user persona. The following workflow describes these phases,
the associated user journey, and how this connector works:

1. An [Azure or Office 365 account administrator](https://learn.microsoft.com/en-us/microsoft-365/admin/add-users/about-admin-roles) in your organization configures [Microsoft Graph](https://learn.microsoft.com/en-us/graph/overview)
   to enable OAuth authentication as described in [Get access without a user](https://learn.microsoft.com/en-us/graph/auth-v2-service?tabs=http).
   They then share the required credentials with the organization’s data engineer.
2. A **data engineer or data scientist** in your organization installs the SharePoint Connector for Snowflake
   from the Snowflake marketplace into their Snowflake account. They then configure the connector with the following information:

   * Specifying the SharePoint OAuth credentials (ClientID, Client Secret and TenantID) obtained from step 1.
   * Specifying the URL of their SharePoint site. Typically, this is a specific subsite within your SharePoint site.
   * Choosing whether to ingest files from all folders or a specific folder in the SharePoint URL.
     Note that the files from subfolders are always included.

   After the connector validates the configuration, it does the following:

   1. Ingests supported files and user permissions from the specified source.
   2. Uses the [PARSE_DOCUMENT function](../../../user-guide/snowflake-cortex/parse-document.md) of Cortex AI to parse and chunk the ingested files.
   3. Creates a [Cortex Search service](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) to serve as a RAG engine for your own AI assistants with your parsed and chunked data.
3. An **IT developer** in the organization creates a chatbot in their chat interface of choice,
   such as bot extensions in Slack, Teams or a web page, and hosts it as appropriate within their environment.
   The IT Developer configures roles, permissions and authentication in Snowflake to use the Cortex Search
   REST API endpoint available in the suite of [Snowflake REST APIs](../../../developer-guide/snowflake-rest-api/snowflake-rest-api.md).
4. After your AI assistant is up and running, **business users** in your organization can interact with
   it to ask questions and see responses based on the files ingested from your SharePoint site into your
   Snowflake account. All responses have citations that are links to the source documents from your SharePoint site.

### Enable your AI assistants to adhere to access controls specified in your organization’s SharePoint site

Working with Snowflake Connector for SharePoint for this use case can be broadly divided into four phases,
each associated with a specific user persona. The following workflow describes these phases, the associated user journey,
and how this connector works:

1. An [Azure or Office 365 account administrator](https://learn.microsoft.com/en-us/microsoft-365/admin/add-users/about-admin-roles) in your organization configures [Microsoft Graph](https://learn.microsoft.com/en-us/graph/overview)
   to enable OAuth authentication as described in [Get access without a user](https://learn.microsoft.com/en-us/graph/auth-v2-service?tabs=http).
   They then share the required credentials with the organization’s data engineer or data scientist.
2. A **data engineer or data scientist** in your organization installs the SharePoint Connector for Snowflake
   from the Snowflake marketplace into their Snowflake account. They then configure the connector by:

   * Specifying the SharePoint OAuth credentials (client ID, client secret and tenant ID) obtained in step 1.
   * Specifying the URL of their SharePoint site. Typically, this is a specific subsite within your SharePoint site.
   * Choosing whether to ingest files from all folders or a specific folder in the SharePoint URL.
     Note that the files from subfolders are always included.

   After the connector validates the configuration, it does the following:

   1. Ingests supported files and user permissions from the specified source.
   2. Uses the PARSE_DOCUMENT function of Cortex AI to parse and chunk the ingested files.
   3. Creates a Cortex Search service to serve as a RAG engine for your own AI assistants with your parsed and chunked data.
3. An **IT developer** in the organization creates a chatbot in their chat interface of choice,
   such as bot extensions in Slack, Teams or a web page, and hosts it as appropriate within their environment.

   1. They configure roles, permissions, and authentication in Snowflake to use the Cortex Search REST API endpoint available in the suite of Snowflake REST APIs.
   2. They specify a filter containing the email ID of the SharePoint user when the AI assistant queries the Cortex Search REST API, for example `filter.@contains.user_ids` or `filter.@contains.user_emails`. This restricts responses from Cortex Search to documents that the specified business user has access to in your organization’s SharePoint.
4. After your AI assistant is up and running, when **business users** in your organization interact with it to
   ask questions, they will only see information from files in your SharePoint thay have access to because of the filter specified in Step 3(b).
   All responses have citations that are links to the source documents from your SharePoint site.

## Limitations

* [Cortex Parse Document limitations and requirements](../../../user-guide/snowflake-cortex/parse-document.md)
* [Cortex Search limitations](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md)
* Changes caused by moving or renaming folders aren’t captured during incremental ingestion.
* The connector supports only Microsoft 365 groups.
* The connector ingests only the supported file types and ignores others.

## Regional availability

The Snowflake Connector for SharePoint depends on
[Cortex Parse document](../../../user-guide/snowflake-cortex/parse-document.md) and
[Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

The Snowflake Connector for SharePoint is currently available in the regions listed in [Cortex Parse Document regional availability](../../../user-guide/snowflake-cortex/parse-document.md).

## Next step

[Set up the Snowflake Connector for SharePoint](setup.md).

---
title: Access the ServiceNow® data in Snowflake
source: https://docs.snowflake.com/en/connectors/servicenow/accessing-data.md
section: Connectors & Drivers
---

# Access the ServiceNow® data in Snowflake

This topic describes how to access ServiceNow® data from your Snowflake account.

For each table in ServiceNow® that is configured for synchronization, the connector creates the following tables
and views:

* A table with the same name that contains the data in raw form, where each record is contained in a single
  VARIANT column.
* A table named `table_name__event_log` that contains the history of changes made to ServiceNow® records.
* A view named `table_name__view` that contains the data in flattened form, where the view contains a
  column for each column in the original table and a row for each record that is present in the original table.
* A view named `table_name__view_with_deleted` that contains the same data as
  `table_name__view` as well as rows for records that have been deleted in ServiceNow®.

> **Note:**
>
> After starting the connector, it may take some time for the views to be created.
>
> View creation depends on data in the ServiceNow® `sys_db_object`, `sys_dictionary` and `sys_glide_object` tables.
> The connector loads metadata from these tables after a business table is enabled for synchronization.
> When the metadata tables are ingested, a background task will create flattened views of the enabled tables.
> The task is run as often as the schedule of the most frequent table ingestion. After the metadata tables are synced,
> the task also captures any table schema changes and updates the already created views accordingly (only the views
> with the suffixes `__view` and `__view_with_deleted`, but not with `__view_with_display_values`).
>
> As it’s not an immediate process, status of view creation process is available under the `CONFIGURED_TABLES` view.
> If the view creation takes too long, the `CONNECTOR_ERRORS` view can also be checked for any related errors.

> **Warning:**
>
> If you plan to set [ROW ACCESS POLICIES](../../user-guide/security-row-using.md) on the tables
> and views created by the connector, make sure they do not block access to the role with the same name as the connector application.
> For example, if your connector application instance is called `MY_CONNECTOR_SERVICENOW`, then your policies cannot
> block a role named `MY_CONNECTOR_SERVICENOW`. Otherwise, the policies will interfere with the data ingestion process.

The following sections explain how to grant the privileges to access this data and how to access the data from
these tables and views.

## Grant privileges for accessing the ServiceNow® data in Snowflake

After the Snowflake Connector for ServiceNow® synchronizes the data with Snowflake, to access the ServiceNow® data a role needs:

* USAGE privilege on the database and schema that contain the ServiceNow® data in Snowflake, and
* a [DATA_READER application role](https://other-docs.snowflake.com/en/connectors/servicenow/v2/application-roles#data-reader-application-role).

Snowflake recommends creating a dedicated role with these privileges that can be granted to users who need
access to the ingested ServiceNow® data. If the connector has been [installed with Snowsight](installing-snowsight.md)
then the role provided during [Configure](installing-snowsight.md) step already has the necessary privileges.

For example, if you configured the connector application called `my_connector_servicenow` to store the ServiceNow®
data in the `dest_db` database and `dest_schema` schema, you can create a role named
`servicenow_data_reader_role` and grant the privileges for accessing the data to that role.

The following example shows how to grant these privileges:

> ```sqlexample
> CREATE ROLE servicenow_data_reader_role;
> GRANT USAGE ON DATABASE dest_db TO ROLE servicenow_data_reader_role;
> GRANT USAGE ON SCHEMA dest_db.dest_schema TO ROLE servicenow_data_reader_role;
> GRANT APPLICATION ROLE my_connector_servicenow.DATA_READER to role servicenow_data_reader_role;
> ```

> **Note:**
>
> * Do not run `GRANT OWNERSHIP ON FUTURE TABLES IN SCHEMA` on the schema that contains the ServiceNow® data
>   in Snowflake. Also, do not change the ownership of the tables that are already created by the connector.
>   Changing the ownership prevents the connector from ingesting the data to the table.
> * Do not change the ownership of the views in the schema that contains the ServiceNow® data in Snowflake.
>   Changing the ownership prevents the connector from updating the views when changes occur in the
>   ServiceNow® table schema.

## Access the raw data

For each ServiceNow® table that you synchronize, the Snowflake Connector for ServiceNow® creates a new table with
the same name in the database and schema for the ServiceNow® data in Snowflake.

For example, if you configured the connector to store the ServiceNow® data in the `dest_db` database and
`dest_schema` schema, and if you configured the connector to synchronize the `incident` table in
ServiceNow®, the connector creates the table named `dest_db.dest_schema.incident`.

This table contains raw data ingested from ServiceNow®. This table contains the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `sys_id` | VARCHAR | The value of the `sys_id` of the record in ServiceNow®. |
| `raw` | VARIANT | The data for the record in raw form. |
| `is_deleted` | BOOLEAN | Specifies whether or not the record was deleted in ServiceNow®. |
| `last_update_date` | TIMESTAMP_NTZ | The last time the record was updated in Snowflake. Note that the displayed timestamp is provided in the UTC timezone with no offset, which may differ from the timezone of dates displayed in the ServiceNow instance. |

The following is an example of the output for a SELECT statement that retrieves the data for the
`dest_db.dest_schema.incident` table:

> ```sqlexample
> SELECT * FROM DEST_DB.DEST_SCHEMA.INCIDENT LIMIT 5;
>
> +----------------------------------+-------------------------+-------------+--------------------------+
> | SYS_ID                           | RAW:ACTIVE              |  IS_DELETED | LAST_UPDATE_DATE         |
> +----------------------------------+-------------------------+-------------+--------------------------+
> | caa04d36db8ba0106e9643c81396197b | {"active": "true", ...} |  FALSE      |  2021-08-24 12:59:23.932 |
> | cea045be1b03e010eac562c4bd4bcbb2 | {"active": "true", ...} |  FALSE      |  2021-08-24 12:59:23.932 |
> | caa0c9bedb8be010f9f19c41ba961934 | {"active": "true", ...} |  FALSE      |  2021-08-24 12:59:23.932 |
> | caa0c9bedb8be010f9f19c41ba961969 | {"active": "true", ...} |  FALSE      |  2021-08-24 12:59:23.932 |
> | b9a0c53adb436410d6fa2b691396190a | {"active": "true", ...} |  FALSE      |  2021-08-24 12:59:23.932 |
> +----------------------------------+-------------------------+-------------+--------------------------+
> ```

## Access the flattened data

For each table that contains data, the connector creates two flattened views over the raw data.
The names of the views are the names of the table with the suffixes `__view` and
`__view_with_deleted`. For example, for the ServiceNow® table named `incident`, the connector creates
the following views:

* `dest_db.dest_schema.incident__view`
* `dest_db.dest_schema.incident__view_with_deleted`

The view with the `__view` suffix contains the records that are in the ServiceNow® table. The view with
the `__view_with_deleted` suffix includes these same records as well as the records that were deleted
from the ServiceNow® table.

Note the following:

* The names of the columns in these views are in uppercase. You cannot use lowercase names to access these
  columns.
* Columns with time and timestamps are always saved using the UTC timezone, regardless of the timezone set in the
  ServiceNow instance. As a result, depending on the ServiceNow instance configuration, their displayed values may
  differ from the values displayed in the ServiceNow instance. The difference relates only to displayed values, timestamps both
  in ServiceNow and Snowflake are referring to the same point in time.
* There are no views for empty tables. After data appears in the table in ServiceNow®, the view is created.
* Although the connector handles changes to the schema, the connector does not reload the data.

  As a result, in the case of schema changes, records from the old schema are not updated.

The following is an example of the output for a SELECT statement that retrieves the data from the
`dest_db.dest_schema.incident_view` view. In this example, the `incident` table in ServiceNow® has columns
named `ACTIVE`, `APPROVAL`, `CATEGORY`, and `ESCALATION`.

> ```sqlexample
> SELECT ACTIVE, APPROVAL, CATEGORY, ESCALATION
> FROM DEST_DB.DEST_SCHEMA.INCIDENT__VIEW LIMIT 5;
>
> +--------+----------------+------------------+------------+
> | ACTIVE | APPROVAL       | CATEGORY         | ESCALATION |
> +--------+----------------+------------------+------------+
> | TRUE   | not requested  | software         | 0          |
> | TRUE   | not requested  | Cloud Management | 0          |
> | TRUE   | not requested  | software         | 0          |
> | TRUE   | not requested  | network          | 0          |
> | TRUE   | not requested  | database         | 0          |
> +--------+----------------+------------------+------------+
> ```

## View the event logs for a table

The Snowflake Connector for ServiceNow® can track the changes made to records in ServiceNow®. This tracking
information is stored in tables called event logs.

For every ServiceNow® table enabled for synchronization, the connector creates an event log table within
Snowflake named `<destination_db>.<destination_schema>.<table_name>__event_log`.

Each event log table has the following columns:

| Column | Data Type | Description |
| --- | --- | --- |
| `sys_id` | VARCHAR | The value of the `sys_id` of the record in ServiceNow®. |
| `sys_updated_on` | VARCHAR | The date the record was last updated in ServiceNow®. If there is no `sys_updated_on` field in the ServiceNow® table, this column contains null values. Note that the displayed timestamp is provided in the UTC timezone with no offset, which may differ from the timezone of dates displayed in the ServiceNow instance. |
| `event_date` | TIMESTAMP_NTZ | The date the event was inserted in the event log. Note that the displayed timestamp is provided in the UTC timezone with no offset, which may differ from the timezone of the dates displayed in the ServiceNow instance. |
| `raw` | VARIANT | The current data of the record event. For DELETE events, this is the data of the record at the time of deletion. |
| `event_type` | VARCHAR | Specifies if the record was inserted, updated, or deleted from ServiceNow®. |

The event log reflects the history of data changes in the corresponding ServiceNow® table. For example, if a
new record is inserted into the `u_ip_port` table in ServiceNow®, a record with `event_type` set to
`INSERT` event type is added to the `dest_db.dest_schema.u_ip_port__event_log` table in Snowflake.

Similarly, if a record is updated or deleted in a table in ServiceNow®, a record with `event_type` set to
`UPDATE` or `DELETE` is added to the `dest_db.dest_schema.u_ip_port__event_log` table.

The tables in Snowflake that contain the raw data (`dest_db.dest_schema.table_name`) are derived
from the corresponding event log tables (`dest_db.dest_schema.table_name__event_log`). For example:

* If a record for an `INSERT` event is added to `table_name__event_log`, the connector adds a
  corresponding record to the `table_name` table.
* If an `UPDATE` event for the given `sys_id` is added to the event log table, the connector
  updates the corresponding record with the `sys_id` in the `table_name` table with new data.
* If a `DELETE` event occurs, the `is_deleted` flag of the corresponding record in
  `table_name` is set to `true`.

## Get the display value of a reference field

In ServiceNow® tables, some fields are [reference fields](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/field-administration/concept/c_ReferenceField.html), which contain references to records in other tables.

In the example below, the field `opened_by` in the `incident` table is a reference field that
contains a reference to the record with the `sys_id` `<sys_id>` in another
table (`sys_user`):

> ```sqljson
> {
>   "link": "https://myinstance.service-now.com/api/now/table/sys_user/<sys_id>",
>   "value": "<sys_id>"
> }
> ```

To show the reference fields in the table, call the `SHOW_REFERENCES_OF_TABLE` stored procedure with the following
argument:

> ```sqlsyntax
> CALL SHOW_REFERENCES_OF_TABLE('<table_name>');
> ```

Where:

`table_name`
:   Specifies the name of the table you want to show the reference fields for.

This stored procedure inspects the schema of the table and returns a JSON list of objects containing the following properties:

| Property | Description |
| --- | --- |
| `columnName` | Name of the reference field. |
| `referencedColumnName` | Name of the field that the reference points to. |
| `referencedTableName` | Name of the referenced table. |

### Enable data synchronization for referred tables

If a table contains references to other tables, you can enable data synchronization of the referred tables.
To synchronize data for referred tables, call the `ENABLE_REFERENCED_TABLES` stored procedure with the following argument:

> ```sqlsyntax
> CALL ENABLE_REFERENCED_TABLES('<table_name>');
> ```

Where:

`table_name`
:   Specifies the name of the table (with the table reference fields) for which you want to enable data synchronization.

### Create a view containing reference fields

If the table containing the reference fields and the tables referenced by the those fields have been processed, you can
create a view that replaces the references with display values.

To create this view, call the `CREATE_VIEW_WITH_DISPLAY_VALUES` stored procedure.

```sqlsyntax
CALL CREATE_VIEW_WITH_DISPLAY_VALUES('<table_name>');
```

Where:

`table_name`
:   Specifies the name of the table containing the table reference fields for which you want to create a view with display value.

> **Note:**
>
> Only [reference fields](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/field-administration/concept/c_ReferenceField.html) with the `sys_id` as [reference key](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/field-administration/task/t_DefineTheReferenceKey.html) are supported.

> **Important:**
>
> This procedure is only run manually, therefore each time the table schema is changed the view must be recreated manually
> to reflect the schema change.

After the view is created successfully, the stored procedure returns the name of the newly created view.
The view name is the table name with the `__view_with_references` suffix added.
For example, for a ServiceNow® table named `incident`, the stored procedure creates the view `incident__view_with_references`.
Reference fields are replaced with display values and a new metadata column is added for each reference field.

The display value column has the same name as the reference column being replaced and may be null when if the display value is null or
the reference is not resolved. The metadata column name is the name of the reference column with the `__metadata` suffix.
For example, for a reference column named `user`, the procedure creates a column named `user__metadata`.
The content of this column is a JSON object with a field named `reference_field` with the following properties:

| Property | Description |
| --- | --- |
| `key` | `sys_id` of the referred row. If the reference column or reference column field `value` is null, this property is also null. |
| `reference_table` | Name of the referenced table. If the reference is not resolved this property is null. |
| `link` | ServiceNow® link to the referred row. If the reference column or reference column field `link` is null, this property is also null. |
| `display_value` | Display value. If the reference is not resolved this property is null. |
| `resolved` | `true` if display value is resolved. `false` when the connector cannot resolve the reference. |
| `reason` | Reason the reference failed to resolve. For example `Display value is not ingested yet`. If the reference is resolved this property is not displayed. |

The following example shows how a pair of display value and metadata columns in a view created by the stored procedure
`CREATE_VIEW_WITH_DISPLAY_VALUES` looks like.
The example table `incident` has `opened_by` column which references (by `sys_id` as reference key) to the `sys_user` table.

The `incident__view_with_references` view created by the stored procedure resolves the reference, so the displayed values can be obtained with a simple `SELECT`.

```sqlexample
SELECT OPENED_BY, OPENED_BY__METADATA
  FROM DEST_DB.DEST_SCHEMA.INCIDENT__VIEW_WITH_REFERENCES;
```

This command displays information in the following format:

```output
+-----------+------------------------------------+
| OPENED_BY | OPENED_BY__METADATA                |
+-----------+------------------------------------+
| "JOHN"    | {                                  |
|           |   "reference_field": {             |
|           |     "display_value": "JOHN",       |
|           |     "key": "b177...",              |
|           |     "link": "https://...",         |
|           |     "reference_table": "sys_user", |
|           |     "resolved": true               |
|           |   }                                |
|           | }                                  |
+-----------+------------------------------------+
```

---
title: Accessing data ingested by Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-accessing-data.md
section: Connectors & Drivers
---

# Accessing data ingested by Snowflake Connector for Google Analytics Raw Data

This topic describes how to access raw data in Google Analytics from your Snowflake account.

For each property in BigQuery that is configured for synchronization, the Snowflake Connector for Google Analytics Raw Data creates:

* The `ANALYTICS_propertyId` table with the same name as the property name. This table contains the raw daily data. Each record in the table is stored in a separate row and the Google Analytics event data is saved into a single column of type VARIANT.
* The `ANALYTICS_propertyId__VIEW` view that maps the event data from the table above into separate columns.
* The `ANALYTICS_INTRADAY_propertyId` table with the same name as the property name. This table contains the raw intraday data.
* The `ANALYTICS_INTRADAY_propertyId__VIEW` view that maps the intraday event data from the table above into separate columns.
* The `USERS_propertyId` table with the same name as the property name. This table contains the raw users data.
* The `USERS_propertyId__VIEW` view that maps the users data from the table above into separate columns.
* The `PSEUDONYMOUS_USERS_propertyId` table with the same name as the property name. This table contains the raw pseudonymous users data.
* The `PSEUDONYMOUS_USERS_propertyId__VIEW` view that maps the users data from the table above into separate columns.

The temporary owner of the tables and views above is the Snowflake Connector for Google Analytics Raw Data. The ownership should be transferred during the Connector uninstallation, for details see [Uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data](gard-connector-uninstalling-and-reinstalling.md).

The following sections explain how to grant the privileges to access this data and how to access the data from these tables and views.

## Granting privileges for accessing the Google Analytics data in Snowflake

After the Snowflake Connector for Google Analytics Raw Data synchronizes the data with Snowflake, you can use `data_owner_role` to access the data or any other role if it meets both of the following conditions:

* Has USAGE privilege on the database and schema that contain the data ingested by Snowflake Connector for Google Analytics Raw Data.
* Is granted with the DATA_READER application role, which has SELECT privilege on tables or views within this schema.

For example, if you configured the Snowflake Connector for Google Analytics Raw Data to store the data in the `dest_db` database and
`dest_schema` schema, you can create the `google_analytics_raw_data_reader_role` role and grant the privileges
for accessing the data to that role.

The following example shows how to grant these privileges:

> ```sqlexample
> CREATE ROLE google_analytics_raw_data_reader_role;
> GRANT USAGE ON DATABASE dest_db TO ROLE google_analytics_raw_data_reader_role;
> GRANT USAGE ON SCHEMA dest_db.dest_schema TO ROLE google_analytics_raw_data_reader_role;
> GRANT APPLICATION ROLE SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_RAW_DATA.DATA_READER TO ROLE google_analytics_raw_data_reader_role;
> ```

## Accessing the raw data

For each BigQuery table that you synchronize, the Snowflake Connector for Google Analytics Raw Data creates a new table with
the same name in the Snowflake database and schema used to store the Snowflake Connector for Google Analytics Raw Data.

For example, if you configured the connector to store data in the `dest_db` database and
`dest_schema` schema and access data via role `my_role`,
and if you configured the connector to synchronize the `analytics_12345` table in
BigQuery, the connector creates the table named `dest_db.dest_schema.analytics_12345`.

This table contains raw data ingested from BigQuery. The table contains the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| `raw` | VARIANT | The data for the record in raw form. |
| `run_id` | VARIANT | The id of the asynchronous process that ingested the data. |
| `source_table_date` | DATE | The name of the daily table from which the connector ingested the data to the table. |
| `ingestion_complete` | BOOLEAN | True if the connector ingested all the data from the daily table, false if some of the data is still being downloaded. |

The following is an example of the output for a SELECT statement that retrieves the data for the
`dest_db.dest_schema.analytics_12345` table:

> ```sqlexample
> SELECT * FROM DEST_DB.DEST_SCHEMA.ANALYTICS_12345 LIMIT 5;
>
> +---------------------------+--------------------------------------+--------------------+--------------------+
> | RAW                       | RUN_ID                               |  SOURCE_TABLE_DATE | INGESTION_COMPLETE |
> +---------------------------+--------------------------------------+--------------------+--------------------+
> | { "app_info": null, ... } | f8edbf0e-1d0d-4ff5-9e5c-0e114b1fc44a |  2023-06-13        |  TRUE              |
> | { "app_info": null, ... } | f8edbf0e-1d0d-4ff5-9e5c-0e114b1fc44a |  2023-06-13        |  TRUE              |
> | { "app_info": null, ... } | f8edbf0e-1d0d-4ff5-9e5c-0e114b1fc44a |  2023-06-13        |  TRUE              |
> | { "app_info": null, ... } | d949ab70-6a7e-47a5-b876-d7e33d701b0d |  2023-06-14        |  FALSE             |
> | { "app_info": null, ... } | d949ab70-6a7e-47a5-b876-d7e33d701b0d |  2023-06-14        |  FALSE             |
> +---------------------------+--------------------------------------+--------------------+--------------------+
> ```

## Accessing the flattened data

For each table that contains data, the connector asynchronously creates a flattened view of the raw data, and refreshes it daily.
The name of the view is the name of the table with the suffix `__view`.
For example, for the table named `analytics_12345`, the connector creates
the `dest_db.dest_schema.analytics_12345__view` view.

> **Note:**
>
> * There are no views for rows where `ingestion_complete` is `FALSE` .
> * If the BigQuery column type changes, for the previously existing view, the view column is changed to type `VARIANT`.

The following is an example of the output for a SELECT statement that retrieves the data from the
`dest_db.dest_schema.analytics_12345__view` view. In this example, the `analytics_12345` table has `VARIANT` column `raw`
with values named `EVENT_DATE`, `EVENT_TIMESTAMP`, `EVENT_NAME`, and `EVENT_PREVIOUS_TIMESTAMP`.

> ```sqlexample
> USE ROLE MY_ROLE;
> SELECT EVENT_DATE, EVENT_TIMESTAMP, EVENT_NAME, EVENT_PREVIOUS_TIMESTAMP
> FROM DEST_DB.DEST_SCHEMA.ANALYTICS_12345__VIEW LIMIT 5;
>
> +------------+--------------------------+-------------------+--------------------------+
> | EVENT_DATE | EVENT_TIMESTAMP          | EVENT_NAME        | EVENT_PREVIOUS_TIMESTAMP |
> +------------+--------------------------+-------------------+--------------------------+
> | 2023-06-13 | 2023-06-13 18:27:20.775  | "page_view"       | null                     |
> | 2023-06-13 | 2023-06-13 18:27:25.960  | "user_engagement" | null                     |
> | 2023-06-13 | 2023-06-13 19:26:49.130  | "scroll"          | null                     |
> | 2023-06-13 | 2023-06-13 18:27:51.135  | "page_view"       | null                     |
> | 2023-06-13 | 2023-06-13 18:27:56.343  | "user_engagement" | null                     |
> +------------+--------------------------+-------------------+--------------------------+
> ```

---
title: Accessing fetched Google Analytics data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-accessing-data.md
section: Connectors & Drivers
---

# Accessing fetched Google Analytics data

## Overview

For each report that is configured for synchronization, the connector creates the following table and view
in the destination database and destination schema:

* `report_name__RAW`: A table that contains data in a raw form, where each row contains a Google Analytics
  record in a single VARIANT column.
* `report_name`: A view that contains flattened data, where each row contains a Google Analytics
  dimension or metric in a separate column.

## Accessing the raw data

For each synchronized Google Analytics report, the connector creates a new table
with the `report_name__RAW` name in the Snowflake database and schema used to store the Google Analytics data.

For example, if you configure the connector to store the Google Analytics data in the `dest_db` database
and `dest_schema` schema and to synchronize the `my_report` report, the connector
creates the table `dest_db.dest_schema.my_report__raw`.

This table contains raw data ingested from Google Analytics in the following columns:

| Column | Data type | Description |
| --- | --- | --- |
| `DATE` | DATE | The value of the `date` dimension for a record from Google Analytics |
| `RAW` | VARIANT | The data for a record from Google Analytics in raw form |
| `LAST_UPDATE_DATE` | TIMESTAMP_NTZ | The last time a record was updated in Snowflake |

The following example SELECT statement retrieves data from the `dest_db.dest_schema.my_report__raw` table:

> ```sqlexample
> SELECT * FROM DEST_DB.DEST_SCHEMA.MY_REPORT__RAW;
> ```

## Accessing the flattened data

In addition, for each table that contains data, the connector creates a flattened view of the raw data.
The name of the view is the name of the table without the `__RAW` suffix. For example, for the table
named `dest_db.dest_schema.my_report__raw`, the connector creates the view named `dest_db.dest_schema.my_report`.

The view contains flattened records from Google Analytics, where each dimension and metric is stored in a separate column.

The following is an example of a SELECT statement that retrieves data from the `dest_db.dest_schema.my_report` view:

> ```sqlexample
> SELECT * FROM DEST_DB.DEST_SCHEMA.MY_REPORT;
> ```

> **Note:**
>
> The flattened view is created only after an entire dataset is fetched from the GA API.
> Larger reports might take more time.

---
title: Application roles in the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/roles.md
section: Connectors & Drivers
---

# Application roles in the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The following sections describe application roles available in the connector application:

> * ADMIN
> * AGENT
> * VIEWER
> * DATA_READER

All the application roles are automatically assigned to the account level role responsible for installing the application on the account.
They can be then reassigned for easier control over the connector application access.
For more information, see [GRANT APPLICATION ROLE](../../sql-reference/sql/grant-application-role.md).

## ADMIN application role

The `ADMIN` application role can be used to view connector configuration and state.
It also allows to execute procedures contained in the application.

## AGENT application role

The `AGENT` application role is used by the agent in order to be able to perform replication process. Should not be used manually.

## VIEWER application role

The `VIEWER` application role provides access to view basic configuration of the connector.

## DATA_READER application role

The `DATA_READER` application role can be used to give read privileges on replicated data without access to the connector application itself.

In order to view replicated data, a user needs to have following privileges:

> * `USAGE` grant on destination database
> * `USAGE` grant on destination schema
> * `SELECT` grant on destination table

The connector grants `USAGE` / `SELECT` privileges to this role on all destination databases, schemas and tables created by the application.

> **Attention:**
>
> Be aware, that the `DATA_READER` application role is provided with privileges only on objects created by the application.
> If the destination database or destination schema already exists and is not owned by the connector application,
> the connector won’t be able to grant proper privileges to the `DATA_READER` role on these objects.
> In such case, account level roles with the `DATA_READER` application role need to be manually supplemented with the `USAGE` grant on these objects.

---
title: Application roles in the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/roles.md
section: Connectors & Drivers
---

# Application roles in the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The following sections describe application roles available in the connector application:

> * ADMIN
> * AGENT
> * VIEWER
> * DATA_READER

All the application roles are automatically assigned to the account level role responsible for installing the application on the account.
They can be then reassigned for easier control over the connector application access.
For more information, see [GRANT APPLICATION ROLE](../../sql-reference/sql/grant-application-role.md).

## ADMIN application role

The `ADMIN` application role can be used to view connector configuration and state.
It also allows to execute procedures contained in the application.

## AGENT application role

The `AGENT` application role is used by the agent in order to be able to perform replication process. Should not be used manually.

## VIEWER application role

The `VIEWER` application role provides access to view basic configuration of the connector.

## DATA_READER application role

The `DATA_READER` application role can be used to give read privileges on replicated data without access to the connector application itself.

In order to view replicated data, a user needs to have following privileges:

> * `USAGE` grant on destination database
> * `USAGE` grant on destination schema
> * `SELECT` grant on destination table

The connector grants `USAGE` / `SELECT` privileges to this role on all destination databases, schemas and tables created by the application.

> **Attention:**
>
> Be aware, that the `DATA_READER` application role is provided with privileges only on objects created by the application.
> If the destination database or destination schema already exists and is not owned by the connector application,
> the connector won’t be able to grant proper privileges to the `DATA_READER` role on these objects.
> In such case, account level roles with the `DATA_READER` application role need to be manually supplemented with the `USAGE` grant on these objects.

---
title: Configure disaster recovery
source: https://docs.snowflake.com/en/connectors/servicenow/disaster-recovery.md
section: Connectors & Drivers
---

# Configure disaster recovery

The Snowflake Connector for ServiceNow® can be configured to use a second instance to support disaster recovery.

## About Snowflake Connector for ServiceNow® disaster recovery support

The Snowflake Connector for ServiceNow® stores metadata about configured tables and its own configuration within the application instance.
When the application is dropped or becomes corrupted, this internal state is lost.
To prevent this, the connector exports the metadata to the destination database alongside the ingested data during specific events, such as:

* Scheduling a new ingestion
* Finalizing the reload
* Cancelling the reload

The export process creates several tables in the destination schema to store the connector’s internal state.
These tables do not contain the ingested data but are essential for recovering the connector’s state after the application
is dropped or becomes corrupted. When replicated, these tables can also be used to recover the state of the connector on a different Snowflake account.
The following tables are created by the export process:

* `APP_CONFIG_SFSDKEXPORT_V1`
* `APP_STATE_SFSDKEXPORT_V1`
* `CONNECTOR_ERRORS_LOG_SFSDKEXPORT_V1`
* `INGESTION_PROCESS_SFSDKEXPORT_V1`
* `INGESTION_RUN_SFSDKEXPORT_V1`
* `NOTIFICATIONS_STATE_SFSDKEXPORT_V1`
* `RESOURCE_INGESTION_DEFINITION_SFSDKEXPORT_V1`
* `__CONNECTOR_STATE_EXPORT`

## Importing existing data and reports to a new instance of the connector

If the Snowflake Connector for ServiceNow® has been uninstalled or corrupted, it is possible to resume ingestion of previously configured tables, provided that the destination
database was not dropped. The metadata for tables configured in the connector is saved in the destination database alongside the ingested data.

To continue ingesting data after installing a new connector instance, perform the following:

1. Configure the connector

   Configure the connector by following the instructions in [Install and configure the connector with Snowsight](installing-snowsight.md) or [Install and configure the connector with SQL commands](installing-sql.md).
   When choosing the destination database and schema, select the existing schema that contains data ingested by the previous instance of the connector.
2. Grant required privileges to the connector

   > **Note:**
   >
   > This step is only required if you installed and configured the connector using SQL commands.
   > If you installed the connector using Snowsight you can skip this step.

   Execute the following command to ensure that the newly installed connector becomes the owner of all objects in the existing schema:

   ```sqlexample
   system$grant_ownership_to_application('your_application_instance', true, '<database>', '<schema>');
   ```

   Where `<database>` and `<schema>` are the names of the existing database and schema, respectively.
3. Pause the connector

   ```sqlexample
   call pause_connector();
   ```
4. Import the existing data and table configuration

   Import the existing data and table configuration by executing the following command from the context of installed application:

   ```sqlexample
   call import_state(force => true);
   ```

   The **force** parameter is set to **true** to ensure that any changes that might have been made to the freshly installed connector
   are overwritten with the table configuration and internal data from the old installation.
5. Resume the Connector

   ```sqlexample
   call resume_connector();
   ```

At this point, the new instance of the Snowflake Connector for ServiceNow® connector should resume ingestion of the existing tables.

## Replicating the destination database and connector state to another snowflake deployment

This section describes the steps to replicate the content of the destination database.
The destination database contains the ingested data and the metadata for the tables configured in the connector.
If the connector or the data downloaded by the connector is critical for your business, consider setting up a secondary Snowflake account in a different region
and replicating the destination database to the secondary account.

### Terms and definitions

The following terms and definitions are used during the disaster recover configuration process.

Destination Database
:   The database configured as the target for the data ingested by the connector. This is also the database where the connector’s internal state is exported to.

Destination Schema
:   The schema configured as the target for the data ingested by the connector.

Internal State
:   The internal data and configuration of the connector, for example table configurations, ingestion state, and error logs.

Connector instance
:   The Snowflake Connector for ServiceNow® connector instance installed on the Snowflake account.

ACCOUNT_PRIM
:   Example name of primary account

ACCOUNT_SEC
:   Example name of secondary (replica) account

APP_PRIM
:   Example Snowflake Connector for ServiceNow® connector instance name installed on the primary account

APP_SEC
:   Example Snowflake Connector for ServiceNow® connector instance name installed on the secondary account

DST_DB.DST_SCHEMA
:   Example destination schema name for the connector instance (where data is ingested and the connector’s internal state is saved)

DST_DB
:   Example destination database name configured for the connector

MYORG
:   Example name of your organization (both accounts must be in the same organization)

### Introduction

When installed on your account, the Snowflake Connector for ServiceNow® connector (connector instance) appears as a normal database that contain data, procedures etc.
However, it cannot be replicated to a secondary account in the same way as a normal database.
Currently, there is no native mechanism to replicate the connector instance with its internal state to a replica account.
Specifically, the installed application cannot be added to a replication group.

Instead of replicating the connector instance directly, the connector exports the metadata of configured tables to the destination schema
configured during the connector setup process. The state is saved there and can be replicated alongside the ingested data.

For example, if you configured the connector to ingest data into the destination schema DST_DB.DST_SCHEMA,
the connector automatically saves its internal state to this schema.
You can then replicate both the ingested data and the internal state using the following command:

```sqlexample
create replication group connector_dest_database_group
  object_types = databases
  allowed_databases = dst_db
  allowed_accounts = ...;
```

### Setting up replication of ingested data and configured reports

> **Caution:**
>
> Always test your disaster recovery procedures to verify that data and state replication are functioning as expected.
>
> Before proceeding, familiarize yourself with [Snowflake Replication](../../user-guide/account-replication-intro.md).

The following sections contain instructions applicable to all versions of Snowflake.

1. Installing the connector on the primary account

   Install and configure Snowflake Connector for ServiceNow® on the primary account. For detailed instructions, see [Install and configure the connector with Snowsight](installing-snowsight.md) or [Install and configure the connector with SQL commands](installing-sql.md).

   On the primary account, create a replication group and add DST_DB as an allowed database:

   ```sqlexample
   -- on primary account
   create replication group connector_rep_group_prim
     object_types = databases
     allowed_databases = dst_db
     allowed_accounts = myorg.account_sec
     replication_schedule = '10 minute';
   ```
2. Setting up replication on the secondary account

   To replicate DST_DB from the primary account to the secondary account, create a new replication group on the secondary account:

   ```sqlexample
   -- on secondary account
   create replication group connector_rep_group_sec
     as replica of myorg.account_prim.connector_rep_group_prim;

   alter replication group connector_rep_group_sec refresh;
   ```

   At this point, a read-only DST_DB database should be created on the secondary account, and data from the primary account
   will be replicated according to the configured schedule.
3. Install the connector on the secondary account

   Install and configure Snowflake Connector for ServiceNow® on the secondary account in the same way as on the primary account.
   Point the instance to ingest data into the replicated database and schema.
   While replication is ongoing (until the replication group on the secondary account is dropped),
   the database is in read-only mode. The connector can be configured to use a read-only database as the ingestion target;
   however, it cannot ingest data until the database transitions to read-write mode.

   After configuring the connector on the secondary account, pause the connector by executing:

   ```sqlexample
   -- on secondary account
   call pause_connector();
   ```

   At this point, the connector is installed and ready to take over if the primary account fails.

### Recovery procedure

When the primary deployment becomes unavailable, configure the connector instance on the secondary account to continue ingestion.

> **Important:**
>
> All steps must be executed on the secondary account.

1. Drop the replication group

   Drop the replication group on the secondary account to transition the replicated database to read-write mode:

   ```sqlexample
   drop replication group connector_rep_group_sec;
   ```
2. Grant ownership of existing database objects to the connector

   Grant ownership of all objects in the replicated schema to the connector by executing:

   ```sqlexample
   call system$grant_ownership_to_application('app_sec', true, 'dst_db', 'dst_schema');
   ```
3. Import the state

   Initialize the connector with the state replicated from the primary account:

   ```sqlexample
   call import_state(true);
   ```
4. Resume the connector

   Resume the connector by executing:

   ```sqlexample
   call resume_connector();
   ```

   At this point, the connector on the secondary account should resume data ingestion, continuing from where the connector on primary account left off.

   > **Note:**
   >
   > Ensure that both the primary and secondary accounts are part of the same organization. The replication schedule can be adjusted based on your requirements.

---
title: Configure OAuth authentication for Google Cloud
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-create-client-id.md
section: Connectors & Drivers
---

# Configure OAuth authentication for Google Cloud

## About customer-provided OAuth client authentication

An application that authenticates to Google using OAuth 2.0 must provide two objects in Google Cloud:

* An *OAuth consent screen* that tells users who is requesting access to their data and what kind of data users are allowing your application to access.
* An *OAuth Client ID* that is used to authenticate an application to Google. This is necessary when you want to access resources owned by your end user.

You must provide your own OAuth consent screen and client ID to authenticate.

## Prerequisites

To provide the OAuth consent screen and OAuth client ID, you must first create a Google Cloud project. For information about creating Google Cloud projects, see the Google Cloud documentation.

> **Note:**
>
> If possible, create an OAuth consent screen in a Google Cloud project that belongs to an organization. Ensure that the connector users are members of the same organization.
>
> If your project does not belong to an organization, you must renew authentication every seven days.

## Configure the OAuth consent screen

1. To open the OAuth consent screen creator, in your Google Cloud project, select APIs & Services » OAuth consent screen.
2. Select one of the following user types:

   * Internal: Select this user type only if the Google Cloud project belongs to an organization and the connector users are members of the same organization.
   * External: If you select this user type, you must renew authentication weekly.
3. Select Create.
4. Provide the following information:

   > * App name: Snowflake Connector for Google Analytics Aggregate Data
   > * User support email: your email address
   > * Developer contact information: your email address
5. Select Save and continue.
6. Select Add or remove scopes » Manually add scopes.
7. Copy the following address:

   > ```none
   > https://www.googleapis.com/auth/analytics.readonly
   > ```
8. Paste the address in the dialog, and then select Add to table.
9. Select Update.
10. If you selected the External user type, follow these steps:

    > 1. Select Test users » Add users.
    > 2. Enter the email addresses of users who are allowed to use the connector.
    > 3. Select Add.
11. To finish the configuration, select Save and continue » Back to dashboard.

## Configure the OAuth client ID

In this procedure, you acquire a redirect URL from Snowsight and paste it into your Google Cloud project.

1. In Snowsight, start the Snowflake Connector for Google Analytics Aggregate Data configuration wizard.
2. In the third step of the connector configuration, Authenticate Google Cloud Platform, copy the value from the Redirect URL section.
3. In your Google Cloud project, to open the OAuth consent screen creator, select APIs & Services » Credentials.
4. Select Create credentials » OAuth client ID.
5. In the Application type dropdown list, select Web application.
6. In the Name box, enter the following name: Snowflake Connector for Google Analytics Aggregate Data ID
7. Select Authorized redirect URIs » Add URI.
8. Select Create.
9. Copy the Your Client ID and Your Client Secret values.
10. Return to the Snowflake Connector for Google Analytics Aggregate Data interface, and paste the values into the corresponding boxes.
11. Select Sign in.

## Prevent session expiration for the OAuth consent screen

1. In the Google Admin Console menu, select Security » Access and data control » Google Cloud session control.
2. In the Reauthentication policy section, select the Exempt Trusted apps checkbox.
3. In the Google Admin Console menu, select Security » API Controls » App Access Control.
4. In the Configured apps section, select Add app » OAuth App Name Or Client ID.
5. Copy the client ID created in Configure the OAuth client ID, and paste it in the box.
6. Select Search.
7. Select the Snowflake Connector for Google Analytics Aggregate Data application name.
8. Select the created OAuth Client ID checkbox, and then click Select.
9. In the Scope section, select All users.
10. Select Continue.
11. In the Access to Google Data section, select Trusted.
12. Select Continue.
13. On the Review screen, select Finish.

---
title: Configure service account authentication for Google Cloud
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-create-service-account-key.md
section: Connectors & Drivers
---

# Configure service account authentication for Google Cloud

## Prerequisites

To provide the service account file, you must first create a Google Cloud project. For information about creating Google Cloud projects, see the Google Cloud documentation.

## Create a service account key

1. To open the service account creator, in your Google Cloud project, select APIs & Services » Credentials.
2. Select Create credentials » service account.
3. In the Service account details enter any service account name you choose.
4. To create the service account, select Done.
5. To manage the new service account, in the Credentials section, select the service account name.
6. Select Keys » Add key » Create a new key.
7. To save the service account key file, in the key type selection view, select the recommended JSON type, and then select Create.

   This file is needed during the connector configuration.

## Formatting the service account key

The service account key you downloaded in the previous procedure can be used to automatically complete the form when configuring the connector, using the drag-and-drop functionality in the configuration wizard.

If the private key is entered manually, it must be properly formatted first.

Example service account key in JSON format:

> ```sqljson
> {
>   "type": "service_account",
>   "project_id": "your-project-id0809",
>   "private_key_id": "7a7df777f88...f7f7s8d7f7s",
>   "private_key": "-----BEGIN PRIVATE KEY-----\nMIIEvgIBADC9ON1OA4JjRidj\n/7O5Ioq+L2112946/CsXsfiHFwIQQedWt\nQ75sl7M5lHTsVQtIdtBcGJXvk5/7CHOmtkn6w\n2dRoyCWv2bknmogZIy3fssMolwVaZ15cmsuB0\nwTI81dojSVwrzPshiYY9lfugdVZ2uiFcw4haWo8o\nUhg2tHOWyveoFN2RF03kUfdnEfhAAmXKZai\nWkd49r+jAgMBAAECggEAIP/5TIE9LJ4QAZcXG2sEQl7GldrQho0nuAOVkEtzQsuP\ndmgbFYU39qinuLc83GF/Ghr3PdswzQTKeKCvZZXhQ4FpYk9VhyQr6iTKv6bBD8du\nMrF2LKknax1eCFG81o0A+zOvo\npMrJl/9EOOVJKnifhH7kdS/JRqHXEzQUGkpOWSs6ep7MGN4+vLv+GlZqIIgEGwmW\nJN/72+5bLiaL9T7If1+/T/sa\n-----END PRIVATE KEY-----\n",
>   "client_email": "testclientemail.gserviceaccount.com",
>   "client_id": "2345345634546456",
>   "auth_uri": "https://accounts.google.com/o/oauth2/auth",
>   "token_uri": "https://oauth2.googleapis.com/token",
>   "auth_provider_x509_cert_url": "https://www.googleapis.com/oauth2/v1/certs",
>   "client_x509_cert_url": "https://www.googleapis.com/robot/v1/metadata/x509/you-project.....",
>   "universe_domain": "googleapis.com"
> }
> ```

For the private key, only the text between —–BEGIN PRIVATE KEY—– and —–END PRIVATE KEY—– is relevant.

To transform a private key to the format acceptable by the connector, follow this procedure:

1. In a text editor, open the key downloaded in the previous procedure.
2. Copy the content of the private_key field.
3. Delete both the —–BEGIN PRIVATE KEY—– and —–END PRIVATE KEY—– markers.
4. Delete all \n (newline) characters from the file. A key usually contains at least 10 occurrences.
5. Save the file for later use.

After edits, your key should resemble this code:

> ```none
> MIIEvgIBADC9ON1OA4JjRidj/7O5Ioq+L2112946/CsXsfiHFwIQQedWtQ75sl7M5lHTsVQtIdtBcGJXvk5/7CHOmtkn6w2dRoyCWv2bknmogZIy3fssMolwVaZ15cmsuB0wTI81dojSVwrzPshiYY9lfugdVZ2uiFcw4haWo8oUhg2tHOWyveoFN2RF03kUfdnEfhAAmXKZaiWkd49r+jAgMBAAECggEAIP/5TIE9LJ4QAZcXG2sEQl7GldrQho0nuAOVkEtzQsuPdmgbFYU39qinuLc83GF/Ghr3PdswzQTKeKCvZZXhQ4FpYk9VhyQr6iTKv6bBD8duMrF2LKknax1eCFG81o0A+zOvopMrJl/9EOOVJKnifhH7kdS/JRqHXEzQUGkpOWSs6ep7MGN4+vLv+GlZqIIgEGwmWJN/72+5bLiaL9T7If1+/T/sa
> ```

## Grant the service account access to Google Analytics

The service account requires access to all Google Analytics properties that the connector will use.

1. In the Google Analytics console, choose a property that will be used by the connector.
2. Select the Property access management tab.
3. Add the service account email as a Viewer.
4. Repeat this process for all properties that will be used in the connector.

---
title: Configure the Snowflake Connector for Google Analytics Aggregate Data using SQL
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-configuring-sql.md
section: Connectors & Drivers
---

# Configure the Snowflake Connector for Google Analytics Aggregate Data using SQL

This topic provides information about using SQL to configure the Snowflake Connector for Google Analytics Aggregate Data.

> **Note:**
>
> The Snowflake Connector for Google Analytics Aggregate Data is typically configured using Snowsight. SQL configuration is
> considered an advanced configuration method and should only be used by
> those familiar with the underlying details of connector configuration.
>
> Installation using SQL statements is not supported and must be done via Snowsight.

To configure the connector using SQL statements, complete these tasks:

* Prepare a warehouse, data owner role, and destination database.
* Configure the connector.
* Create Snowflake objects required for connecting to GA4.
* Set the connection configuration.
* Finalize the connector configuration.

> **Note:**
>
> In order to configure the connector, you must use stored procedures that are defined
> in the PUBLIC schema of the connector’s installation database.
>
> Before calling these stored procedures, select that database for the session.
>
> For example, if that database is named `snowflake_connector_for_google_analytics_aggregate_data`, run the following command:
>
> ```sqlexample
> USE DATABASE snowflake_connector_for_google_analytics_aggregate_data;
> ```

## Prepare a warehouse, data owner role, and destination database

1. Grant usage on a specified warehouse and task execution permissions to the connector application:

   > ```sqlexample
   > USE ROLE accountadmin;
   > CREATE WAREHOUSE google_analytics_aggregate_data_warehouse WITH WAREHOUSE_SIZE = 'X-Small';
   > GRANT USAGE ON WAREHOUSE google_analytics_aggregate_data_warehouse TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > GRANT EXECUTE TASK, EXECUTE MANAGED TASK ON ACCOUNT TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > ```
   >
   > The connector needs these grants to perform ingestion.
2. Create a destination database and schema:

   > ```sqlexample
   > CREATE DATABASE google_analytics_aggregate_data_dest_db;
   > CREATE SCHEMA google_analytics_aggregate_data_dest_db.google_analytics_aggregate_data_dest_schema;
   > ```
   >
   > Ingested data is stored in the destination schema. You can also use an existing database and schema.
3. Add required grants on the destination database to the application:

   > ```sqlexample
   > USE ROLE accountadmin;
   > GRANT USAGE ON DATABASE google_analytics_aggregate_data_dest_db TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > GRANT USAGE ON SCHEMA google_analytics_aggregate_data_dest_db.google_analytics_aggregate_data_dest_schema TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > GRANT CREATE TABLE ON SCHEMA google_analytics_aggregate_data_dest_db.google_analytics_aggregate_data_dest_schema TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > GRANT CREATE VIEW ON SCHEMA google_analytics_aggregate_data_dest_db.google_analytics_aggregate_data_dest_schema TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > ```
   >
   > The application needs the grants to create tables for reports data and to create the reports views.
4. Create the data owner role and add required grants:

   > ```sqlexample
   > USE ROLE accountadmin;
   > CREATE OR REPLACE ROLE google_analytics_aggregate_data_resources_provider;
   > GRANT USAGE ON DATABASE google_analytics_aggregate_data_dest_db TO ROLE google_analytics_aggregate_data_resources_provider;
   > GRANT USAGE ON SCHEMA google_analytics_aggregate_data_dest_db.google_analytics_aggregate_data_dest_schema TO ROLE google_analytics_aggregate_data_resources_provider;
   > GRANT USAGE ON WAREHOUSE google_analytics_aggregate_data_warehouse TO ROLE google_analytics_aggregate_data_resources_provider;
   > GRANT APPLICATION ROLE snowflake_connector_for_google_analytics_aggregate_data.data_reader TO ROLE google_analytics_aggregate_data_resources_provider;
   > ```

## Configure the connector

* Call the `CONFIGURE_CONNECTOR` procedure, passing the name of the warehouse, destination database and schema, and data owner role:

  > ```sqlexample
  > USE ROLE accountadmin;
  > CALL CONFIGURE_CONNECTOR(
  >    PARSE_JSON('{"warehouse": "GOOGLE_ANALYTICS_AGGREGATE_DATA_WAREHOUSE", "destination_database": "GOOGLE_ANALYTICS_AGGREGATE_DATA_DEST_DB", "destination_schema": "GOOGLE_ANALYTICS_AGGREGATE_DATA_DEST_SCHEMA", "data_owner_role": "GOOGLE_ANALYTICS_AGGREGATE_DATA_RESOURCES_PROVIDER"}')
  > );
  > ```
  >
  > > **Note:**
  > >
  > > Values passed to CONFIGURE_CONNECTOR are case-sensitive and should be passed as seen in the UI (for example, as seen in the SHOW command).

## Create Snowflake objects required for connecting to GA4

1. To create a security integration for your connection, follow one of these options:

   > > **Note:**
   > >
   > > Using a service account is a recommended option.
   >
   > If you are using a service account, then you need key file. For details on how to create one see [Configure service account authentication for Google Cloud](gaad-connector-create-service-account-key.md).
   > Create a security integration using the details from the key file:
   >
   > ```sqlexample
   > CREATE SECURITY INTEGRATION
   > snowflake_connector_for_google_analytics_aggregate_data_security_integration
   > type = api_authentication
   > auth_type = oauth2
   > oauth_client_id = '000000000000000000000'
   > oauth_token_endpoint = 'https://oauth2.googleapis.com/token'
   > enabled = true
   > oauth_allowed_scopes = ('https://www.googleapis.com/auth/analytics.readonly')
   > oauth_assertion_issuer = '<value of client_email from the JSON key file>'
   > oauth_grant='JWT_BEARER'
   > oauth_client_secret = '<value of private_key from the JSON key file with no delimiters or newlines>';
   > ```
   >
   > If you are using OAuth2, you need to configure a consent screen and client credentials. For details on how to do that, see [Configure OAuth authentication for Google Cloud](gaad-connector-create-client-id.md).
   > Then you need to create security integration:
   >
   > ```sqlexample
   > CREATE OR REPLACE SECURITY INTEGRATION
   > snowflake_connector_for_google_analytics_aggregate_data_security_integration
   > type = api_authentication
   > auth_type = oauth2
   > oauth_client_id = '<value of gcp oauth client_id>'
   > oauth_client_secret = '<value of gcp oauth secret>'
   > oauth_token_endpoint = 'https://oauth2.googleapis.com/token'
   > OAUTH_AUTHORIZATION_ENDPOINT = 'https://accounts.google.com/o/oauth2/auth?access_type=offline&prompt=consent'
   > OAUTH_ALLOWED_SCOPES = ('https://www.googleapis.com/auth/analytics.readonly')
   > enabled = true;
   > ```
2. Create a secret using the security integration:

   > ```sqlexample
   > USE ROLE accountadmin;
   >
   > CREATE DATABASE connectors_secret;
   > CREATE SCHEMA connectors_secret.snowflake_connector_for_google_analytics_aggregate_data;
   >
   > USE SCHEMA connectors_secret.snowflake_connector_for_google_analytics_aggregate_data;
   >
   > CREATE OR REPLACE SECRET secret
   > type = oauth2
   > api_authentication = snowflake_connector_for_google_analytics_aggregate_data_security_integration;
   > ```
   >
   > > **Note:**
   > >
   > > The secret will securely store the access token generated using the credentials from the security integration.
3. Provide secret-related grants to the connector application:

   > ```sqlexample
   > USE ROLE accountadmin;
   >
   > GRANT USAGE ON DATABASE connectors_secret TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > GRANT USAGE ON SCHEMA connectors_secret.snowflake_connector_for_google_analytics_aggregate_data TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > GRANT READ ON SECRET connectors_secret.snowflake_connector_for_google_analytics_aggregate_data.secret TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > ```
4. If you are using oauth2 authorization, generate a token. Use the link generated by the following code:

   > ```sqlexample
   > SELECT SYSTEM$START_OAUTH_FLOW('connectors_secret.snowflake_connector_for_google_analytics_aggregate_data.secret');
   > ```
   >
   > You will be redirected to the oauth2 screen. After you accept the required grants, you will be redirected to the endpoint, which completes the oauth2 flow.
5. Configure external access:

   > ```sqlexample
   > USE ROLE accountadmin;
   >
   > USE SCHEMA connectors_secret.snowflake_connector_for_google_analytics_aggregate_data;
   >
   > CREATE NETWORK RULE network_rule
   > mode = EGRESS
   > type = HOST_PORT
   > value_list = (
   >     'analyticsadmin.googleapis.com:443',
   >     'analyticsdata.googleapis.com:443'
   > );
   >
   > CREATE EXTERNAL ACCESS INTEGRATION google_analytics_aggregate_data_external_access_integration
   > allowed_network_rules = (connectors_secret.snowflake_connector_for_google_analytics_aggregate_data.network_rule)
   > allowed_authentication_secrets = ('CONNECTORS_SECRET.OAUTH.SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA')
   > enabled = true;
   >
   > GRANT USAGE ON INTEGRATION snowflake_connector_for_google_analytics_aggregate_data_external_access_integration TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
   > ```
   >
   > > **Note:**
   > >
   > > The connector uses the external access integration to communicate with Google Analytics APIs. The network rule controls the list of allowed hosts.

## Set the connection configuration

* Call the `SET_CONNECTION_CONFIGURATION` procedure, passing the external access integration, the full path to the secret, and the security integration:

  > ```sqlexample
  > USE ROLE accountadmin;
  > CALL SET_CONNECTION_CONFIGURATION(
  >     PARSE_JSON('{"external_access_integration": "SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA_EXTERNAL_ACCESS_INTEGRATION", "secret": "CONNECTORS_SECRET.SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA.SECRET", "security_integration": "SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA_SECURITY_INTEGRATION"}')
  > );
  > ```
  >
  > > **Note:**
  > >
  > > Values passed to SET_CONNECTION_CONFIGURATION should be unqualified, uppercase identifiers.

## Finalize the connector configuration

* Call the `FINALIZE_CONNECTOR_CONFIGURATION` procedure:

  > ```sqlexample
  > USE ROLE accountadmin;
  > CALL FINALIZE_CONNECTOR_CONFIGURATION(
  >      PARSE_JSON('{}')
  > );
  > ```

After the process is completed successfully, ingestion configuration can begin. For more information, see [Set up data ingestion for your Snowflake Connector for Google Analytics Aggregate Data instance](gaad-connector-setting-up-data.md).

---
title: Configuring BigQuery Link for Google Analytics 4 property
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-create-link.md
section: Connectors & Drivers
---

# Configuring BigQuery Link for Google Analytics 4 property

The topic provides information on how to configure the BigQuery link for Google Analytics 4 (GA4) property.

> **Note:**
>
> The Google account you use to access Google Analytics must have access to a Google Cloud Platform (GCP) project where the GA4 raw data can be extracted to. To learn how to create a GCP project, refer to the GCP documentation.

To set up the GA4 raw data extraction, do the following:

1. Sign in to Google Analytics.
2. From the dropdown list in the top navigation bar, select a GA4 property.
3. Enter the Admin panel.
4. Under the Product links column, select the BigQuery Links option.
5. Select Link » Choose a BigQuery project. From the available list, select the GCP project where you want to extract the GA4 raw data to.
6. Select Daily, Fresh Daily, Streaming export type.
7. Select Save.

To learn more about how to set up the BigQuery link for GA4 property, see [Google Support](https://support.google.com/analytics/topic/9359001).

---
title: Configuring OAuth authentication for Google Cloud Platform (GCP)
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-create-client-id.md
section: Connectors & Drivers
---

# Configuring OAuth authentication for Google Cloud Platform (GCP)

## About customer-provided OAuth client authentication

An application that authenticates to Google using OAuth 2.0 must provide two objects in GCP:

* OAuth consent screen that tells users who is requesting access to their data and what kind of data users are allowing your application to access.
* OAuth Client ID used to authenticate an application to Google. It is necessary when you want to access resources owned by your end user.

You must provide your own OAuth consent screen and client ID to authenticate. In a future release, the consent screen will be provided.

## Prerequisites

To provide the OAuth consent screen and OAuth client ID, you must create a Google Cloud Platform (GCP) project first. Refer to the GCP documentation to learn how to create a GCP project.

> **Note:**
>
> If possible, create an OAuth consent screen in a GCP project that belongs to an organization. Make sure that the connector users are members of the same organization.
>
> If your project does not belong to an organization, you must renew authentication every seven days.

## Configuring the OAuth consent screen

1. To open the OAuth consent screen creator, select APIs & Services » OAuth consent screen in your GCP project.
2. Select the user type.

   > You can select the Internal user type only if the GCP project belongs to an organization and the connector users are members of the same organization.
   >
   > The External user type causes the authentication to expire in seven days. If you choose this type, you need to renew authentication weekly.
3. Select Create.
4. Provide the following information:

   > * App name: Snowflake Connector for Google Analytics Raw Data
   > * User support email: your email address
   > * Developer contact information: your email address
5. Select Save and continue.
6. Select Add or remove scopes » Manually add scopes. Copy the following addresses:

   > ```none
   > https://www.googleapis.com/auth/bigquery.readonly
   > https://www.googleapis.com/auth/cloudplatformprojects.readonly
   > ```
7. To add the scopes, paste each address in a dialog and select Add to table.
8. Select Update.

For External user type:

> 1. Select Test users » Add users.
> 2. Enter the email addresses of users that are allowed to use the connector.
> 3. Select Add.

To finish configuration, select Save and continue » Back to dashboard.

## Configuring the OAuth client ID

The following procedure describes how to configure the OAuth Client ID:

1. To open the OAuth consent screen creator, select APIs & Services » Credentials in your GCP project.
2. Select Create credentials » OAuth client ID.
3. In the Application type dropdown list, select Web application.
4. In the Name box, enter the following name: Snowflake Connector for Google Analytics Raw Data ID.
5. Select Authorized redirect URIs » Add URI.
6. In the Snowflake Connector for Google Analytics Raw Data interface, go to the third step of the connector configuration: Authentication. Choose OAuth2 and copy the value from the Redirect URL box.
7. Go back to the GCP interface, and paste the value to the URI box.
8. Select Create.
9. Copy the Your Client ID and Your Client Secret values.
10. Paste the values into the corresponding boxes in the Snowflake Connector for Google Analytics Raw Data interface.
11. Select Sign in.

## Preventing session expiration for OAuth consent screen

The following procedure describes how to prevent session expiration for OAuth Consent Screen:

1. In the Google Admin Console menu, select Security » Access and data control » Google Cloud session control.
2. In the Reauthentication policy section, select the Exempt Trusted apps checkbox.
3. In the Google Admin Console menu, select Security » API Controls » App Access Control.
4. In the Configured apps section, select Add app » OAuth App Name Or Client ID.
5. Copy the client ID created in Configuring the OAuth Client ID, and paste it into the box.
6. Select Search.
7. Select Snowflake Connector for Google Analytics Raw Data application name.
8. Select the created OAuth Client ID checkbox, and click Select.
9. In the Scope section, select All users.
10. Select Continue.
11. In the Access to Google Data section, select Trusted.
12. Select Continue.
13. On the Review screen, select Finish.

---
title: Configuring replication for the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/configure-replication.md
section: Connectors & Drivers
---

# Configuring replication for the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The process of configuring replication for the Snowflake Connector for MySQL the following steps:

* Adding a data source

And optionally:

* Add a source table for replication
* Remove a table from replication

## Adding a data source

A data source is a representation of a single MySQL server.
The Snowflake Connector for MySQL can replicate data from multiple data sources.
Before you start replication, you need to add at least one data source.

The Snowflake Connector for MySQL replicates data from each data source to a distinct destination
database in Snowflake. The same destination database cannot be used by multiple data sources.

To add a data source, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ADD_DATA_SOURCE('<data_source_name>', '<dest_db>');
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the unique name of the data source. The name should correspond to the name of a datasource defined in the agent configuration. Please ensure that the chosen name complies with the following requirements:
> >
> >     * The name contains only uppercase letters (A-Z), and decimal digits (0-9).
> >     * The name cannot be longer than 50 characters.
> >
> > `dest_db`
> > :   Specifies the name of the destination database in Snowflake. If the database does not exist, the procedure automatically creates it.
> >     Otherwise, the connector uses an existing database. In that case, you must grant privileges on the database to the connector before adding a data source.
>
> > **Note:**
> >
> > Once added, a data source cannot be renamed or dropped.

### (Optional) Granting privileges on the destination database

To use an existing database as a destination database, the Snowflake Connector for MySQL requires the [CREATE SCHEMA](../../sql-reference/sql/create-schema.md) permission on that database. The connector is the owner of the schemas and tables containing ingested MySQL data.

To grant the [CREATE SCHEMA](../../sql-reference/sql/create-schema.md) permission, run the following command:

> ```sqlsyntax
> GRANT CREATE SCHEMA ON DATABASE <dest_db> TO APPLICATION <app_db_name>;
> ```
>
> Where:
>
> > `dest_db`
> > :   Specifies the name of the destination database for the data from a data source.
> >
> > `app_db_name`
> > :   Specifies the name of the connector database.

## Adding other data sources

You can add new data sources at any time. To add a new data source while the agent is already running, do the following:

1. Add a data source.
2. Ensure the agent is stopped.
3. [Configure the agent connection to the new data source](install-agent.md).
4. [Run the Docker container of the agent](install-agent.md).

## Add a source table for replication

To add source tables for replication, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ADD_TABLES('<data_source_name>', '<schema_name>', <table_names_array>);
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source that contains the source table.
> >
> > `schema_name`
> > :   Specifies the name of the schema of the source table.
> >
> > `table_names_array`
> > :   Specifies the array of table names:
> >
> >     `ARRAY_CONSTRUCT('<table_name>', '<other_table_name>', ...)`

Adding a source table has the following effects:

* `schema_name` and `table_name` are used as the schema name and table name respectively for replicating source data from the source database.

> **Note:**
>
> In one procedure call you can add many tables from the same datasource and schema.

> **Note:**
>
> **Schema and table names must match**
>
> You must use the exact table name and schema name, including case, as defined in the source database. The names you
> provide are used verbatim to generate the SELECT query in the source database. MySQL server names can be
> case-sensitive and using a different case could result in a “table does not exist” exception.
>
> **Recently removed tables**
>
> If tables were recently removed (Remove a table from replication), it might not be possible to add them back at this point in configuration.
> If an error with a message `Tables are not ready to be re-added` appears, wait several minutes before trying again.

## Add a source table with column filters

To add a source table with filtered columns, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ADD_TABLE_WITH_COLUMNS('<data_source_name>', '<schema_name>', '<table_name>', <included_columns_array>, <excluded_columns_array>);
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source that contains the source table.
> >
> > `schema_name`
> > :   Specifies the name of the schema of the source table.
> >
> > `table_name`
> > :   Specifies the name of the source table.
> >
> > `included_columns`
> > :   Specifies the array of column names that should be replicated:
> >
> >     `ARRAY_CONSTRUCT('<column_name>', '<other_column_name>', ...)`
> >
> > `excluded_columns`
> > :   Specifies the array of column names that should be ignored:
> >
> >     `ARRAY_CONSTRUCT('<column_name>', '<other_column_name>', ...)`

> **Attention:**
>
> Column names passed to the procedure must be case-sensitive, exactly as they are represented in source database.

Following rules apply to the above procedure:

* Filtering occurs before the data is ingested to Snowflake - only data from the chosen columns is streamed to Snowflake in both snapshot and incremental loads.
* `included_columns` and `excluded_columns` are just masks. This way the connector will not throw an error if specified column does not exist. Mask for the non-existent column will simply get ignored.
* You shouldn’t provide both `included_columns` and `excluded_columns`. If you want to list `included_columns`, you should leave the `excluded_columns` empty, and vice versa.
* If both arrays are not empty and there aren’t any conflicting columns, `included_columns` takes precedence over `excluded_columns`.
* If a column appears in both `included_columns` and `excluded_columns`, the procedure throws an error.
* If both `included_columns` and `excluded_columns` are empty arrays, all available columns will be ingested.
* Regardless of configuration, primary key columns always get replicated.

For example, let’s assume we have a source table with given columns: A, B, C, D, where A is a primary key column, then:

| Included columns | Excluded columns | Expected result |
| --- | --- | --- |
| [] | [] | [A, B, C, D] |
| [A, B] | [] | [A, B] |
| [B] | [] | [A, B] |
| [] | [C, D] | [A, B] |
| [] | [A, B] | [A, C, D] |
| [A, B, Z] | [] | [A, B] |
| [A] | [A] | Error |

## Remove a table from replication

To remove a **single source table** from replication, run the following command:

```sqlsyntax
CALL PUBLIC.REMOVE_TABLE('<data_source_name>', '<schema_name>', '<table_name>');
```

Where:

> `data_source_name`
> :   Specifies the name of the data source that contains the source table.
>
> `schema_name`
> :   Specifies the name of the schema of the source table.
>
> `table_name`
> :   Specifies the name of the source table.

To remove **multiple source tables** from the same data source and schema with one procedure call, run the following command:

```sqlsyntax
CALL PUBLIC.REMOVE_TABLES('<data_source_name>', '<schema_name>', '<table_names_array>');
```

Where:

> `data_source_name`
> :   Specifies the name of the data source that contains the source table.
>
> `schema_name`
> :   Specifies the name of the schema of the source table.
>
> `table_names_array`
> :   Specifies the array of table names:
>
>     `ARRAY_CONSTRUCT('<table_name>', '<other_table_name>', ...)`

> **Note:**
>
> The process of removing a table from replication takes a few minutes. Once complete, the table will disappear from the `PUBLIC.REPLICATION_STATE` view in the connector (see [Monitoring the Snowflake Connector for MySQL](monitor.md)). Only then can it be enabled for replication again.

At this point the destination table is still **owned by the connector application**. If you wish to drop or otherwise modify the destination table, you need to first transfer its ownership to a role in your account. Execute the following query as `ACCOUNTADMIN`:

```sqlsyntax
GRANT OWNERSHIP ON TABLE <destination_database_name>.<schema_name>.<table_name>
  TO ROLE <role_name>
  REVOKE CURRENT GRANTS;
```

> **Note:**
>
> If you’re removing a table from replication fix its `FAILED` state, you will also need to rename or drop the destination table manually before enabling its replication again.

## Configuring scheduled replication

The connector can replicate data in two modes: continuous or scheduled. The default is a continuous mode.

Continuous mode replicates data as fast as possible. It requires running an operational warehouse 24/7, which might generate unnecessary costs, even without an ongoing replication.

Scheduled mode replicates data according to a configured schedule. It aims to reduce replication costs when there is no need to replicate data continuously, or the data volume is small (causing the connector to be in idle state most of the time).

Scheduled mode introduces the concept of replication completion. The snapshot replication begins when the `SELECT <columns> FROM <TABLE>` query execution starts, and it ends when data gets replicated into the destination table.
The incremental replication begins from the previously stored change data capture (CDC) pointer, but it does not have an ending, as the data is ingested continuously.
Therefore, the connector replicates data from previously stored CDC pointer until the latest CDC pointer (determined at the start of the replication). This way, the connector provides the completion of replication in a scheduled mode.

Scheduled mode reduces replication costs by suspending the operational warehouse. The warehouse can be suspended if the replication of each source table is completed. The warehouse remains suspended until the next run of the replication, according to the schedule.

> **Note:**
>
> Only one replication can run at a given time. If a replication is still running when the next scheduled run time occurs, then that scheduled time is skipped.

To enable scheduled mode, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ENABLE_SCHEDULED_REPLICATION('<data_source_name>', '<schedule>');
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source.
> >
> > `schedule`
> > :   Specifies the schedule or frequency at which the connector runs the replication of the data source. The minimum allowed
> >     frequency is 15 minutes. For details on specifying the schedule or frequency, see
> >     [SCHEDULE parameter](../../sql-reference/sql/create-task.md).

Schedule examples:

* `60 MINUTE`
  :   Schedules replication to every 60 minutes.
* `USING CRON 0 2 * * * UTC`
  :   Schedules replication to 2 a.m. UTC daily.

To disable scheduled mode, run the following command:

> ```sqlsyntax
> CALL PUBLIC.DISABLE_SCHEDULED_REPLICATION('<data_source_name>');
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source.

To check current schedule, see [Viewing data sources](monitor.md).

> **Note:**
>
> The operational warehouse handles replications from all data sources. The warehouse can only be suspended if the replication of each source table from every data source is completed. In other words, scheduled mode must be enabled for all data sources to ensure the auto-suspension works properly.

## Next steps

After completing these procedures, follow the steps in [Viewing MySQL data in Snowflake](view-data.md)

---
title: Configuring replication for the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/configure-replication.md
section: Connectors & Drivers
---

# Configuring replication for the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The process of configuring replication for the Snowflake Connector for PostgreSQL the following steps:

* Adding a data source

And optionally:

* Add a source table for replication
* Remove a table from replication

## Adding a data source

A data source is a representation of a single PostgreSQL server.
The Snowflake Connector for PostgreSQL can replicate data from multiple data sources.
Before you start replication, you need to add at least one data source.

The Snowflake Connector for PostgreSQL replicates data from each data source to a distinct destination
database in Snowflake. The same destination database cannot be used by multiple data sources.

To add a data source, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ADD_DATA_SOURCE('<data_source_name>', '<dest_db>');
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the unique name of the data source. The name should correspond to the name of a datasource defined in the agent configuration. Please ensure that the chosen name complies with the following requirements:
> >
> >     * The name contains only uppercase letters (A-Z), and decimal digits (0-9).
> >     * The name cannot be longer than 50 characters.
> >
> > `dest_db`
> > :   Specifies the name of the destination database in Snowflake. If the database does not exist, the procedure automatically creates it.
> >     Otherwise, the connector uses an existing database. In that case, you must grant privileges on the database to the connector before adding a data source.
>
> > **Note:**
> >
> > Once added, a data source cannot be renamed or dropped.

### (Optional) Granting privileges on the destination database

To use an existing database as a destination database, the Snowflake Connector for PostgreSQL requires the [CREATE SCHEMA](../../sql-reference/sql/create-schema.md) permission on that database. The connector is the owner of the schemas and tables containing ingested PostgreSQL data.

To grant the [CREATE SCHEMA](../../sql-reference/sql/create-schema.md) permission, run the following command:

> ```sqlsyntax
> GRANT CREATE SCHEMA ON DATABASE <dest_db> TO APPLICATION <app_db_name>;
> ```
>
> Where:
>
> > `dest_db`
> > :   Specifies the name of the destination database for the data from a data source.
> >
> > `app_db_name`
> > :   Specifies the name of the connector database.

## Adding other data sources

You can add new data sources at any time. To add a new data source while the agent is already running, do the following:

1. Add a data source.
2. Ensure the agent is stopped.
3. [Configure the agent connection to the new data source](install-agent.md).
4. [Run the Docker container of the agent](install-agent.md).

## Add a source table for replication

To add source tables for replication, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ADD_TABLES('<data_source_name>', '<schema_name>', <table_names_array>);
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source that contains the source table.
> >
> > `schema_name`
> > :   Specifies the name of the schema of the source table.
> >
> > `table_names_array`
> > :   Specifies the array of table names:
> >
> >     `ARRAY_CONSTRUCT('<table_name>', '<other_table_name>', ...)`

Adding a source table has the following effects:

* `schema_name` and `table_name` are used as the schema name and table name respectively for replicating source data from the source database.

> **Note:**
>
> In one procedure call you can add many tables from the same datasource and schema.

> **Note:**
>
> **Schema and table names must match**
>
> You must use the exact table name and schema name, including case, as defined in the source database. The names you
> provide are used verbatim to generate the SELECT query in the source database. PostgreSQL server names are
> case-sensitive and using a different case could result in a “table does not exist” exception.
>
> **Recently removed tables**
>
> If some tables were recently removed (Remove a table from replication), it might not be possible to add them yet.
> If an error with a message `Tables are not ready to be re-added` appears, wait several minutes before trying again.

## Add a source table with column filters

To add a source table with filtered columns, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ADD_TABLE_WITH_COLUMNS('<data_source_name>', '<schema_name>', '<table_name>', <included_columns_array>, <excluded_columns_array>);
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source that contains the source table.
> >
> > `schema_name`
> > :   Specifies the name of the schema of the source table.
> >
> > `table_name`
> > :   Specifies the name of the source table.
> >
> > `included_columns`
> > :   Specifies the array of column names that should be replicated:
> >
> >     `ARRAY_CONSTRUCT('<column_name>', '<other_column_name>', ...)`
> >
> > `excluded_columns`
> > :   Specifies the array of column names that should be ignored:
> >
> >     `ARRAY_CONSTRUCT('<column_name>', '<other_column_name>', ...)`

> **Attention:**
>
> Column names passed to the procedure must be case-sensitive, exactly as they are represented in source database.

Following rules apply to the above procedure:

* Filtering occurs before the data is ingested to Snowflake - only data from the chosen columns is streamed to Snowflake in both snapshot and incremental loads.
* `included_columns` and `excluded_columns` are just masks. This way the connector will not throw an error if specified column does not exist. Mask for the non-existent column will simply get ignored.
* You shouldn’t provide both `included_columns` and `excluded_columns`. If you want to list `included_columns`, you should leave the `excluded_columns` empty, and vice versa.
* If both arrays are not empty and there aren’t any conflicting columns, `included_columns` takes precedence over `excluded_columns`.
* If a column appears in both `included_columns` and `excluded_columns`, the procedure throws an error.
* If both `included_columns` and `excluded_columns` are empty arrays, all available columns will be ingested.
* Regardless of configuration, primary key columns always get replicated.

For example, let’s assume we have a source table with given columns: A, B, C, D, where A is a primary key column, then:

| Included columns | Excluded columns | Expected result |
| --- | --- | --- |
| [] | [] | [A, B, C, D] |
| [A, B] | [] | [A, B] |
| [B] | [] | [A, B] |
| [] | [C, D] | [A, B] |
| [] | [A, B] | [A, C, D] |
| [A, B, Z] | [] | [A, B] |
| [A] | [A] | Error |

## Remove a table from replication

To remove a **single source table** from replication, run the following command:

```sqlsyntax
CALL PUBLIC.REMOVE_TABLE('<data_source_name>', '<schema_name>', '<table_name>');
```

Where:

> `data_source_name`
> :   Specifies the name of the data source that contains the source table.
>
> `schema_name`
> :   Specifies the name of the schema of the source table.
>
> `table_name`
> :   Specifies the name of the source table.

To remove **multiple source tables** from the same data source and schema with one procedure call, run the following command:

```sqlsyntax
CALL PUBLIC.REMOVE_TABLES('<data_source_name>', '<schema_name>', '<table_names_array>');
```

Where:

> `data_source_name`
> :   Specifies the name of the data source that contains the source table.
>
> `schema_name`
> :   Specifies the name of the schema of the source table.
>
> `table_names_array`
> :   Specifies the array of table names:
>
>     `ARRAY_CONSTRUCT('<table_name>', '<other_table_name>', ...)`

> **Note:**
>
> The process of removing a table from replication takes a few minutes. Once complete, the table will disappear from the `PUBLIC.REPLICATION_STATE` view in the connector (see [Monitoring the Snowflake Connector for PostgreSQL](monitor.md)). Only then can it be enabled for replication again.

At this point the destination table is still **owned by the connector application**. If you wish to drop or otherwise modify the destination table, you need to first transfer its ownership to a role in your account. Execute the following query as `ACCOUNTADMIN`:

```sqlsyntax
GRANT OWNERSHIP ON TABLE <destination_database_name>.<schema_name>.<table_name>
  TO ROLE <role_name>
  REVOKE CURRENT GRANTS;
```

> **Note:**
>
> If you’re removing a table from replication fix its `FAILED` state, you will also need to rename or drop the destination table manually before enabling its replication again.

## Configuring scheduled replication

The connector can replicate data in two modes: continuous or scheduled. The default is a continuous mode.

Continuous mode replicates data as fast as possible. It requires running an operational warehouse 24/7, which might generate unnecessary costs, even without an ongoing replication.

Scheduled mode replicates data according to a configured schedule. It aims to reduce replication costs when there is no need to replicate data continuously, or the data volume is small (causing the connector to be in idle state most of the time).

Scheduled mode introduces the concept of replication completion. The snapshot replication begins when the `SELECT <columns> FROM <TABLE>` query execution starts, and it ends when data gets replicated into the destination table.
The incremental replication begins from the previously stored change data capture (CDC) pointer, but it does not have an ending, as the data is ingested continuously.
Therefore, the connector replicates data from previously stored CDC pointer until the latest CDC pointer (determined at the start of the replication). This way, the connector provides the completion of replication in a scheduled mode.

Scheduled mode reduces replication costs by suspending the operational warehouse. The warehouse can be suspended if the replication of each source table is completed. The warehouse remains suspended until the next run of the replication, according to the schedule.

> **Note:**
>
> Only one replication can run at a given time. If a replication is still running when the next scheduled run time occurs, then that scheduled time is skipped.

To enable scheduled mode, run the following command:

> ```sqlsyntax
> CALL PUBLIC.ENABLE_SCHEDULED_REPLICATION('<data_source_name>', '<schedule>');
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source.
> >
> > `schedule`
> > :   Specifies the schedule or frequency at which the connector runs the replication of the data source. The minimum allowed
> >     frequency is 15 minutes. For details on specifying the schedule or frequency, see
> >     [SCHEDULE parameter](../../sql-reference/sql/create-task.md).

Schedule examples:

* `60 MINUTE`
  :   Schedules replication to every 60 minutes.
* `USING CRON 0 2 * * * UTC`
  :   Schedules replication to 2 a.m. UTC daily.

To disable scheduled mode, run the following command:

> ```sqlsyntax
> CALL PUBLIC.DISABLE_SCHEDULED_REPLICATION('<data_source_name>');
> ```
>
> Where:
>
> > `data_source_name`
> > :   Specifies the name of the data source.

To check current schedule, see [Viewing data sources](monitor.md).

> **Note:**
>
> The operational warehouse handles replications from all data sources. The warehouse can only be suspended if the replication of each source table from every data source is completed. In other words, scheduled mode must be enabled for all data sources to ensure the auto-suspension works properly.

> **Warning:**
>
> Replication performed with reduced frequency may result in a significant grow of the Write-Ahead Logging (WAL) files. Ensure there is enough free space on the disk for the WAL files, taking into account the frequency of changes and their size in your data source.

## Next steps

After completing these procedures, follow the steps in [Viewing PostgreSQL data in Snowflake](view-data.md)

---
title: Configuring service account authentication for Google Cloud Platform (GCP)
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-create-service-account-key.md
section: Connectors & Drivers
---

# Configuring service account authentication for Google Cloud Platform (GCP)

## Prerequisites

An application that authenticates to Google using a service account must provide a service account key file with correct roles set.

To provide the service account key file, you must create a Google Cloud Platform (GCP) project first. Refer to the GCP documentation to learn how to create a GCP project.

## Creating a service account key

The following procedure describes how to create a service account:

1. To open the service account creator, select APIs & Services » Credentials in your GCP project.
2. Select Create credentials » Service account.
3. In the Service account details form type in a service account name of your choice.
4. In the Grant this service account access to project section you need to grant this service account at least the following set of roles: BigQuery Data Viewer, BigQuery Read Session User and BigQuery Job User.
5. After creating a service account find it on the list in the Credentials section and press on its name in order to manage the service account.
6. Select Keys » Add key » Create a new key.
7. In the key type selection view choose the recommended JSON type and press Create in order to save the service account key, which will be needed during the connector configuration.

## Setting up access to multiple GCP projects

You may have multiple Google Analytics properties exported to separate GCP projects. To ingest data for all of them with a single Snowflake Connector for Google Analytics Raw Data instance, you will need to allow access for the service account to each of the GCP projects.

The following procedure describes how to allow access for the previously created service account to an additional GCP project.

1. Note the Email value of the service account you created earlier.
2. In the selected GCP project, go to the IAM & Admin » IAM section.
3. Above the list of principals, select Grant Access.
4. In the New principals form type the Email of your service account.
5. In the Select a role form choose all the following roles: BigQuery Data Viewer, BigQuery Read Session User and BigQuery Job User.
6. Press Save and confirm that the service account’s Email appears in the list of principals.

---
title: Configuring the Snowflake Connector for Google Analytics Raw Data using SQL
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-configuring-sql.md
section: Connectors & Drivers
---

# Configuring the Snowflake Connector for Google Analytics Raw Data using SQL

This topic provides information on configuring the Snowflake Connector for Google Analytics Raw Data
through SQL.

> **Note:**
>
> Snowflake Connector for Google Analytics Raw Data configuration is typically done using Snowsight. SQL configuration is
> considered an advanced configuration method and should only be used by
> those familiar with the underlying details of connector configuration.

To configure the connector using SQL statements, do the following:

* Prepare a warehouse, data owner role and destination database.
* Provision the connector.
* Create Snowflake objects required for connecting to the GCP.
* Configure connection with the GCP.

> **Note:**
>
> In order to provision the connector and configure connection you will have to use stored procedures that are defined
> in the PUBLIC schema of the database that serves as an instance of the connector installation database.
>
> Before calling these stored procedures, select that database as the database to use for the session.
>
> For example, if that database is named `snowflake_connector_for_google_analytics_raw_data`, run the following command:
>
> ```sqlexample
> USE DATABASE snowflake_connector_for_google_analytics_raw_data;
> ```

## Prepare a warehouse, data owner role and destination database

1. Grant usage on specified warehouse and task execution permissions to the connector application.

   > ```sqlexample
   > USE ROLE accountadmin;
   > CREATE WAREHOUSE google_analytics_raw_data_warehouse with warehouse_size = 'X-Small';
   > GRANT USAGE ON WAREHOUSE google_analytics_raw_data_warehouse TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > GRANT EXECUTE TASK ON ACCOUNT TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > GRANT EXECUTE MANAGED TASK ON ACCOUNT TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > ```
2. Create the data owner role.

   > ```sqlexample
   > USE ROLE accountadmin;
   > CREATE OR REPLACE ROLE google_analytics_raw_data_resources_provider;
   > GRANT CREATE DATABASE ON ACCOUNT TO ROLE google_analytics_raw_data_resources_provider;
   > GRANT USAGE ON WAREHOUSE google_analytics_raw_data_warehouse TO ROLE google_analytics_raw_data_resources_provider;
   > GRANT ROLE google_analytics_raw_data_resources_provider TO USER ADMIN;
   > ```
3. Create a destination database and schema.

   > You may also use an existing destination database and schema – especially if you’re re-installing the connector.
   >
   > ```sqlexample
   > USE ROLE google_analytics_raw_data_resources_provider;
   > CREATE DATABASE google_analytics_raw_data_dest_db;
   > CREATE SCHEMA google_analytics_raw_data_dest_db.google_analytics_raw_data_dest_schema;
   > ```
4. Add required grants on the destination database to the application.

   > ```sqlexample
   > USE ROLE accountadmin;
   > GRANT USAGE ON DATABASE google_analytics_raw_data_dest_db TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > GRANT USAGE ON SCHEMA google_analytics_raw_data_dest_db.google_analytics_raw_data_dest_schema TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   >
   > GRANT CREATE TABLE ON SCHEMA google_analytics_raw_data_dest_db.google_analytics_raw_data_dest_schema TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > GRANT CREATE VIEW ON SCHEMA google_analytics_raw_data_dest_db.google_analytics_raw_data_dest_schema TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > ```
5. (Optional) Transfer ownership of tables and views in the destination schema

   If the connector was reinstalled and a previous destination schema is reused, ownership of all tables and views in
   destination schema must be transferred to the connector. The connector requires ownership privilege to manage
   grants on objects in schema and to recreate flattened views when schema of ingested table is changed.

   To transfer the ownership call the `SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION` function.

   ```sqlexample
   USE ROLE accountadmin;
   CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION(<connector_app>, true, <destination_database>, <destination_schema>);
   ```

   The `SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION` is a system function provided by Snowflake that allows the transfer of
   ownership of tables and views in a specified database or schema to the application. Only the ownership of regular tables and
   regular views is transferred, e.g. ownership of dynamic tables, external tables, materialized views, etc. won’t be
   transferred.

   The function has the following signature:

   ```sqlexample
   SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION(<to_app>, <should_copy_grants>, <from_database>, <from_schema>)
   ```

   Where:

   > `to_app`
   > :   Specifies the name of the application to which the ownership of objects should be transferred.
   >
   > `should_copy_grants`
   > :   If `TRUE` then copy existing grants, otherwise revoke. Copying grants requires `MANAGE GRANTS`
   >     permission on the caller.
   >
   > `from_database`
   > :   Name of the database containing objects whose ownership should be changed.
   >
   > `from_schema`
   > :   (Optional) name of the schema containing objects whose ownership should be changed. If no schema is specified,
   >     ownership is transferred on tables and views in all schemas in the provided database. Objects in managed schemas
   >     are omitted during ownership transfer.

   To execute the function the caller must meet one of the following conditions:

   * It has `MANAGE GRANTS` permission (e.g. ACCOUNTADMIN or SECURITYADMIN role), or
   * It contains role owning the application instance and role owning all objects to transfer the ownership. Objects on
     which the ownership is missing are omitted by the function.

   For example, to transfer ownership to the connector that:

   * Was installed as `snowflake_connector_for_google_analytics_raw_data`
   * Uses the schema named `dest_db.dest_schema` for the Google Analytics data in Snowflake

   Run the following command:

   ```sqlexample
   USE ROLE accountadmin;
   CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION('snowflake_connector_for_google_analytics_raw_data', true, 'dest_db', 'dest_schema');
   ```

   If needed, grant `DATA_READER` application role to the role previously owning the data to prevent
   disruptions of existing pipelines using the data:

   ```sqlexample
   GRANT APPLICATION ROLE <connector_app>.DATA_READER TO ROLE <previous_data_owner_role>;
   ```

   Note that `DATA_READER` application role won’t have any grants on tables and views in destination schema until
   `PROVISION_CONNECTOR` procedure is run.

## Provision the connector

1. Call the `PROVISION_CONNECTOR` procedure.

   > Pass the name of the warehouse, destination database and schema, and data owner role. These values are case-sensitive.
   >
   > ```sqlexample
   > CALL PROVISION_CONNECTOR(
   >     'GOOGLE_ANALYTICS_RAW_DATA_WAREHOUSE',
   >     'GOOGLE_ANALYTICS_RAW_DATA_DEST_DB.GOOGLE_ANALYTICS_RAW_DATA_DEST_SCHEMA',
   >     'GOOGLE_ANALYTICS_RAW_DATA_RESOURCES_PROVIDER'
   > );
   > ```

## Create Snowflake objects required for connecting to the GCP

1. Create a security integration for your service account.

   > First, you need a service account key file. For details on how to create one see [Configuring service account authentication for Google Cloud Platform (GCP)](gard-connector-create-service-account-key.md)
   >
   > ```sqlexample
   > CREATE SECURITY INTEGRATION
   > google_analytics_raw_data_security_integration
   > type = api_authentication
   > auth_type = oauth2
   > oauth_client_id = '<value of client_id from the JSON key file>'
   > oauth_token_endpoint = 'https://oauth2.googleapis.com/token'
   > enabled = true
   > oauth_allowed_scopes = (
   >     'https://www.googleapis.com/auth/bigquery.readonly',
   >     'https://www.googleapis.com/auth/cloudplatformprojects.readonly'
   > )
   > oauth_assertion_issuer = '<value of client_email from the JSON key file>'
   > oauth_grant='JWT_BEARER'
   > oauth_client_secret = '<value of private_key from the JSON key file with no delimiters or newlines>';
   > ```
2. Create a secret using the security integration.

   > ```sqlexample
   > CREATE DATABASE google_analytics_raw_data_connector_secret;
   > CREATE SCHEMA google_analytics_raw_data_connector_secret.oauth;
   >
   > USE SCHEMA google_analytics_raw_data_connector_secret.oauth;
   >
   > CREATE OR REPLACE SECRET google_analytics_raw_data
   > type = oauth2
   > api_authentication = google_analytics_raw_data_security_integration;
   > ```
3. Provide secret-related grants to the connector application.

   > ```sqlexample
   > GRANT USAGE ON DATABASE google_analytics_raw_data_connector_secret TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > GRANT USAGE ON SCHEMA google_analytics_raw_data_connector_secret.oauth TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > GRANT READ ON SECRET google_analytics_raw_data_connector_secret.oauth.google_analytics_raw_data TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > ```
4. Configure external access.

   > Keep in mind, that the path to the secret passed to `allowed_authentication_secrets` is case-sensitive.
   >
   > ```sqlexample
   > USE SCHEMA google_analytics_raw_data_connector_secret.oauth;
   >
   > CREATE NETWORK RULE
   > google_analytics_raw_data_allow_rule
   > mode = EGRESS
   > type = HOST_PORT
   > value_list = (
   >     'www.googleapis.com',
   >     'bigquery.googleapis.com',
   >     'bigquerystorage.googleapis.com',
   >     'cloudresourcemanager.googleapis.com',
   >     'oauth2.googleapis.com'
   > );
   >
   > CREATE EXTERNAL ACCESS INTEGRATION
   > google_analytics_raw_data_external_access_integration
   > allowed_network_rules = (google_analytics_raw_data_allow_rule)
   > allowed_authentication_secrets = ('GOOGLE_ANALYTICS_RAW_DATA_CONNECTOR_SECRET.OAUTH.GOOGLE_ANALYTICS_RAW_DATA')
   > enabled = true;
   >
   > GRANT USAGE ON INTEGRATION google_analytics_raw_data_external_access_integration TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
   > ```

## Configure connection with the GCP

1. Call the `CONFIGURE_CONNECTION` procedure.

   > Pass the name of the external access integration, the full path to the secret, and the name of the security integration. These values are case sensitive.
   >
   > > ```sqlexample
   > > CALL CONFIGURE_CONNECTION(
   > >     'GOOGLE_ANALYTICS_RAW_DATA_EXTERNAL_ACCESS_INTEGRATION',
   > >     'GOOGLE_ANALYTICS_RAW_DATA_CONNECTOR_SECRET.OAUTH.GOOGLE_ANALYTICS_RAW_DATA',
   > >     'GOOGLE_ANALYTICS_RAW_DATA_SECURITY_INTEGRATION'
   > > );
   > > ```
2. Check the connection status.

   > ```sqlexample
   > CALL CONNECTION_STATUS();
   > ```
   >
   > If there are no errors, you can follow [Setting up data ingestion for your Snowflake Connector for Google Analytics Raw Data](gard-connector-setting-up-data.md) to enable your Google Analytics properties.

---
title: Cost Governance of Snowflake Connector for SharePoint
source: https://docs.snowflake.com/en/connectors/unstructured-data-connectors/sharepoint/cost-governance.md
section: Connectors & Drivers
---

# Cost Governance of Snowflake Connector for SharePoint

> **Note:**
>
> The Snowflake Connector for SharePoint is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for SharePoint.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements is not guaranteed. The new solution is available as [Openflow Connector for SharePoint](../../../user-guide/data-integration/openflow/connectors/sharepoint/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic provides best practices for cost governance and finding the optimal warehouse size for the Snowflake Connector for SharePoint.

## Measuring Cost of the Connector

If the connector has a separate account only for data ingestion and storage, and the account shows no
other activity (such as executing queries by users using the ingested data), you can read the overall cost on the
account level. To learn more, refer to [Exploring overall cost](../../../user-guide/cost-exploring-overall.md).

If the account is not dedicated only to the connector or you need to investigate the costs further, you should
analyze the charged costs for the three components separately:

* Compute Cost
* Storage Cost
* Cortex Search service cost
* Data Transfer Cost

For an introduction to these three components of cost, refer to
[Understanding overall cost](../../../user-guide/cost-understanding-overall.md).

### General Recommendations

To obtain cost generated by the connector, we recommend that you create a separate account solely for
using the connector. This way you can track the exact data transfer generated by the connector.

If you cannot use a separate account for the connector, try the following:

* Create a separate database for storing ingested data to track storage cost easier.
* Allocate a warehouse only for the connector to get the exact compute cost.
* Use [object tags](../../../user-guide/object-tagging/introduction.md) on databases and a warehouse to build custom cost reports.

### Compute Cost

We recommend that you create a separate warehouse only for the connector. This setup allows you to
create [resource monitors](../../../user-guide/resource-monitors.md) on the warehouse. You can use the monitors to send email alerts and suspend the warehouse,
stopping the connector when the set credit quota is exceeded. The connector automatically resumes after the credit quota is
renewed. Note that setting credit quota too low in configurations where large volumes of data are ingested may cause the connector to not ingest all data.

For information on how to check credits consumed by the warehouse, refer to
[Exploring compute cost](../../../user-guide/cost-exploring-compute.md). You can
also assign [object tags](../../../user-guide/object-tagging/introduction.md) to the warehouse and use the tags to create cost reports.

If the warehouse used by the connector is used by other workflows, you can split the cost by roles.
To split usage by roles, see [Attributing cost](../../../user-guide/cost-attributing.md) and add the following `WHERE` clause on the QUERY_HISTORY view:

```sqlsyntax
warehouse_name = '<connector warehouse name>' AND role_name = 'APP_PRIMARY'
```

The query gives only an approximation of the cost.

> **Note:**
>
> Only one native app may use the warehouse, otherwise costs of different applications are inseparable
> because each native app uses the same role name (APP_PRIMARY).

#### Parse document function cost

For cost considerations related to the Parse document function, see
[Cost considerations](../../../user-guide/snowflake-cortex/parse-document.md).

### Cortex Search service cost

For cost considerations related to the Cortex Search service, see
[cost considerations](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

### Storage Cost

The Snowflake Connector for SharePoint stores data in two places:

* The connector database, which is created from the listing and which holds the connector internal state.
* The user-specified schema where the ingested data is stored.

Data storage is also used by the Snowflake [Fail-safe](../../../user-guide/data-failsafe.md) feature. The amount of data stored in Fail-safe
depends on the table updates done by the connector. The amount of data increases if the table rows ingested
from SharePoint are updated frequently or the whole table is reloaded. Typically, seven to ten days after
the connector is set up, the amount of Fail-safe data stabilizes (assuming that no reloads are performed and
that the flow of ingested data is at a steady rate).

If you want to check storage usage in Snowsight, we recommend that you have a separate database for storing
ingested data. This way you can filter the graphs for storage usage by object, which shows usage by separate
databases. You can also do it by querying the [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/account-usage/database_storage_usage_history.md) and filtering by both databases used by the connector.

If the database contains other schemas not related to the connector, you can query storage usage of a specific
schema that is dedicated to the data ingested from the connector. You can get the information from [TABLE_STORAGE_METRICS view](../../../sql-reference/account-usage/table_storage_metrics.md)
after filtering by database and schema names and aggregating columns with storage usage.

### Data Transfer Cost

The connector uses external access to retrieve data from SharePoint. Snowflake charges only for egress
traffic generated by the connector, based on the size of the requests from the connector to SharePoint.
The responses from SharePoint do not generate cost on Snowflake side.

Information on data transfer usage is available only in the aggregated form for all external access
integrations on the account level. To access the number of transferred bytes, use the [DATA_TRANSFER_HISTORY view](../../../sql-reference/account-usage/data_transfer_history.md)
and filter by the EXTERNAL_ACCESS transfer type.

### Healthcheck Task Cost

The connector creates a serverless task that will regularly check health status of your app instance and
send **only** the summarized result (if it’s healthy or not) to Snowflake.
The task is created after completing the installation wizard (or calling `FINALIZE_CONNECTOR_CONFIGURATION` in worksheets).
It runs in the background and generates a fixed cost of up to 0.5 credit/day
even if no SharePoint folder is enabled for replication.

The task cannot be manually stopped or dropped. However, to reduce this cost you can call `PAUSE_CONNECTOR` procedure
which will disable the task and not generate any cost when the connector is unused.

## Cost Optimization

### Determining the Optimal Warehouse Size for the Connector Instance

To find the optimal warehouse size for the connector, you should consider the factors that affect
the performance of the connector, such as the size of the instance, the number of enabled tables,
and the schedule for synchronizing each table. For example, if only a few tables are enabled the connector
might not benefit from increased parallelization.

We recommend that you define a set of measurable expectations, such as time intervals in which all tables should
be synchronized, and pick the smallest warehouse size that meets these expectations. For large amounts of ingested data
with tens of synchronized tables, the default recommendation is Large warehouse. On the other hand, when you just want to
try out the connector and enable a single table for ingestion, an X-Small warehouse should be sufficient. To find out if
you can downsize the warehouse, refer to [Monitoring warehouse load](../../../user-guide/warehouses-load-monitoring.md).

---
title: Cost governance of the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-pricing.md
section: Connectors & Drivers
---

# Cost governance of the Snowflake Connector for Google Analytics Aggregate Data

This topic provides best practices for cost governance and finding the optimal warehouse size for the Snowflake Connector for Google Analytics Aggregate Data.

## Measuring the cost of the connector

If the connector has a separate account only for data ingestion and storage,
and the account shows no other activity (such as executing queries by users using the ingested data),
you can read the overall cost on the account level. For more information, see [Exploring overall cost](../../../user-guide/cost-exploring-overall.md).

If the account is not dedicated only to the connector, or if you want to investigate costs further, you can analyze the charged costs for the components separately:

* Compute cost
* Storage cost

For an introduction to these components of cost, see [Understanding overall cost](../../../user-guide/cost-understanding-overall.md).

### General recommendations

To determine the costs generated by the connector, you can create a separate account solely for the connector.
Using a specific account lets you track the exact data transfer generated by the connector.

If you cannot use a separate account for the connector, consider the following options:

* To track storage costs more easily, create a separate database for storing ingested data.
* To determine exact compute costs, allocate a warehouse only for the connector.
* To build custom cost reports, use [object tags](../../../user-guide/object-tagging/introduction.md) on databases and the warehouse.

### Compute cost

Snowflake recommends that you create a dedicated warehouse only for the connector.
This configuration allows you to create [resource monitors](../../../user-guide/resource-monitors.md) on the warehouse.
You can use the monitors to send email alerts and suspend the warehouse, stopping the connector when the set credit quota is exceeded.
The connector automatically resumes after the credit quota is renewed. Note that setting the credit quota too low in configurations where large volumes of data are ingested can prevent the connector from ingesting all the data.
A major benefit is that the warehouse size can be adjusted to the data volume.

For information about how to check credits consumed by the warehouse, see [Exploring compute cost](../../../user-guide/cost-exploring-compute.md).
You can also assign [object tags](../../../user-guide/object-tagging/introduction.md) to the warehouse and use the tags to create cost reports.

If the warehouse used by the connector is used by other workflows, you can split the cost by roles.
To split usage by roles, use the [query for splitting warehouse usage](../../../user-guide/cost-attributing.md), and add the following `WHERE` clause on the QUERY_HISTORY view:

```sqlexample
WAREHOUSE_NAME = '<connector warehouse name>' AND
ROLE_NAME = '<role created for the connector to ingest data>'
```

Note that the role is the name created when the connector was installed, for example SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_RAW_DATA.

The query gives only an approximation of the cost.

### Storage cost

The Snowflake Connector for Google Analytics Aggregate Data stores data in two places:

* The connector database, which is created from the public share and holds the connector internal state
* The user-specified schema where the ingested data is stored

Data storage is also used by the Snowflake [Understanding and viewing Fail-safe](../../../user-guide/data-failsafe.md) feature.
The amount of data stored in Fail-safe depends on the table updates performed by the connector.

To check storage usage using Snowsight, you can use a separate database for storing ingested data.
This lets you filter the graphs for storage usage by object, which shows usage by individual database.
You can also view storage use by querying the [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/account-usage/database_storage_usage_history.md) view and filtering by databases used by the connector.

If the database contains other schemas not related to the connector, you can query
storage usage of a specific schema that is dedicated to the data ingested from the connector.
You can get the information from the [TABLE_STORAGE_METRICS view](../../../sql-reference/account-usage/table_storage_metrics.md) view after filtering by database and schema names and aggregating columns with storage usage.

## Determining the optimal warehouse size for the connector instance

For the Snowflake Connector for Google Analytics Aggregate Data, we recommend starting using an XSMALL warehouse and then experimenting with larger warehouses to possibly improve performance.

To find the optimal warehouse size for the connector, consider these factors:

* Number of configured reports
* Amount of data produced by each report
* Schedule of synchronizing reports

We recommend that you define a set of measurable expectations, such as time intervals
in which all reports should be synchronized, and pick the smallest warehouse size that meets these expectations.
To determine whether you need a larger warehouse, see [Monitoring warehouse load](../../../user-guide/warehouses-load-monitoring.md).

### Healthcheck Task Cost

The connector creates a serverless task that will regularly check health status of your app instance and send **only** the summarized result (if it’s healthy or not) to Snowflake.
The task is created after completing the installation wizard (or calling `FINALIZE_CONNECTOR_CONFIGURATION` in worksheets). It runs in the background and generates a fixed cost of up to 0.5 credit/day
even if no report is configured.

The task cannot be manually stopped or dropped. However, to reduce this cost you can call `PAUSE_CONNECTOR` procedure which will disable the task and not generate any cost when the connector is unused.

---
title: Cost governance of the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-pricing.md
section: Connectors & Drivers
---

# Cost governance of the Snowflake Connector for Google Analytics Raw Data

This topic provides best practices for cost governance and finding the optimal warehouse size for the Snowflake Connector for Google Analytics Raw Data.

## Measuring cost of the connector

If the connector has a separate account only for data ingestion and storage,
and the account shows no other activity (such as executing queries by users using the ingested data),
you can read the overall cost on the account level. To learn more, refer to [Exploring overall cost](../../../user-guide/cost-exploring-overall.md).

If the account is not dedicated only to the connector or you need to investigate the costs further, you should analyze the charged costs for the components separately:

* Compute Cost
* Storage Cost
* Data Transfer Cost

For an introduction to these three components of cost, refer to [Understanding overall cost](../../../user-guide/cost-understanding-overall.md).

### General recommendations

To determine the costs generated by the connector, you can create a separate account solely for the connector.
Using a specific account lets you track the exact data transfer generated by the connector.

If you cannot use a separate account for the connector, consider the following options:

* To track storage costs more easily, create a separate database for storing ingested data.
* To determine exact compute costs, allocate a warehouse only for the connector.
* To build custom cost reports, use [object tags](../../../user-guide/object-tagging/introduction.md) on databases and the warehouse.

### Compute cost

Snowflake recommends that you create a dedicated warehouse only for the connector.
This configuration allows you to create [resource monitors](../../../user-guide/resource-monitors.md) on the warehouse.
You can use the monitors to send email alerts and suspend the warehouse, stopping the connector when the set credit quota is exceeded.
The connector automatically resumes after the credit quota is renewed. Note that setting the credit quota too low in configurations where large volumes of data are ingested can prevent the connector from ingesting all the data.
A major benefit is that the warehouse size can be adjusted to the data volume.

For information about how to check credits consumed by the warehouse, see [Exploring compute cost](../../../user-guide/cost-exploring-compute.md).
You can also assign [object tags](../../../user-guide/object-tagging/introduction.md) to the warehouse and use the tags to create cost reports.

If the warehouse used by the connector is used by other workflows, you can split the cost by roles.
To split usage by roles, use the [query for splitting warehouse usage](../../../user-guide/cost-attributing.md), and add the following `WHERE` clause on the QUERY_HISTORY view:

```sqlexample
WAREHOUSE_NAME = '<connector warehouse name>' AND
ROLE_NAME = '<role created for the connector to ingest data>'
```

Note that the role is the name created when the connector was installed, for example SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_RAW_DATA.

The query gives only an approximation of the cost.

### Storage cost

The Snowflake Connector for Google Analytics Raw Data stores data in two places:

* The connector database, which is created from the public share and holds the connector internal state
* The user-specified schema where the ingested data is stored

Data storage is also used by the Snowflake [Understanding and viewing Fail-safe](../../../user-guide/data-failsafe.md) feature.
The amount of data stored in Fail-safe depends on the table updates performed by the connector.

To check storage usage using Snowsight, you can use a separate database for storing ingested data.
This lets you filter the graphs for storage usage by object, which shows usage by individual database.
You can also view storage use by querying the [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/account-usage/database_storage_usage_history.md) view and filtering by databases used by the connector.

If the database contains other schemas not related to the connector, you can query
storage usage of a specific schema that is dedicated to the data ingested from the connector.
You can get the information from the [TABLE_STORAGE_METRICS view](../../../sql-reference/account-usage/table_storage_metrics.md) view after filtering by database and schema names and aggregating columns with storage usage.

### Data transfer cost

Snowflake charges only for egress traffic generated by the connector, based on the size of the requests from the connector to Google Analytics Raw Data.
The responses from Google Analytics Raw Data do not generate cost on Snowflake side.

Information on data transfer usage is available only in the aggregated form for all external functions on the account level.
To access the number of transferred bytes, use the [DATA_TRANSFER_HISTORY view](../../../sql-reference/account-usage/data_transfer_history.md) and filter by the EXTERNAL_ACCESS transfer type.

There can be additional fees related to data transfer on the BigQuery side:
[data storage](https://cloud.google.com/bigquery/pricing#storage)
+
[egress traffic](https://cloud.google.com/bigquery/pricing#data_extraction_pricing).
Specifically the connector uses whats referred to as Streaming reads (Storage Read API).

Please review the associated documentation for details.

### Healthcheck task cost

The connector creates an internal, serverless task that regularly inspects the health of the instance, and sends a summary to Snowflake for monitoring purposes. The task is created after completing the installation wizard, or calling `CONFIGURE_CONNECTION` in a worksheet. It generates a fixed compute cost of up to 0.5 credits per day, even when no properties are enabled for ingestion.

The task cannot be explicitly suspended or dropped, however pausing the connector will also disable the healthcheck.

### Fresh Daily export type cost

Google Analytics 360 customers can use the Fresh Daily export type. This export type is more expensive than the regular
Daily export type, as it requires more frequent data refreshes. The connector will create reloads for the Fresh Daily tables
no more often than every hour for the next four days of the life of a new table record, which can increase the cost of the connector.

## Determining optimal warehouse size for the connector instance

To find the optimal warehouse size for the connector, you should consider the
factors that affect the performance of the connector, such as:

* Number of Google Analytics properties
* Amount of data produced by each of the properties
* Schedule of synchronizing properties

We recommend that you define a set of measurable expectations, such as time intervals
in which all tables should be synchronized, and pick the smallest warehouse size that meets these expectations.
To determine if you can downsize the warehouse, see [Monitoring warehouse load](../../../user-guide/warehouses-load-monitoring.md).

For Snowflake Connector for Google Analytics Raw Data, we recommend starting using an XSMALL warehouse and than experimenting with larger warehouse to possibly improve performance.

In addition, there can be a large difference in warehouse size requirement during different ingestion stages. For example, consider:

* Initial ingestion where the connector is loading historical data, possibly years worth of data, a larger warehouse can be beneficial.
* Normal daily ingestion, when loading only current daily increments of data, the smallest warehouses will suffice.

In addition, if a large property set is enabled for ingestion, consider a larger warehouse so that the connector can keep up with the data flow.

---
title: Cost governance of the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/cost-governance.md
section: Connectors & Drivers
---

# Cost governance of the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic provides best practices for cost governance and finding the optimal warehouse size for the Snowflake Connector for MySQL.

## Measuring Cost of the Connector

If the connector has a separate account only for data ingestion and storage, and the account shows no other activity
(such as executing queries by users using the ingested data), you can read the overall cost on the account level.
To learn more, see [Exploring overall cost](../../user-guide/cost-exploring-overall.md).

If the account is not dedicated only to the connector or you need to investigate the costs further,
you should analyze the charged costs for the three components separately:

* Compute Cost
* Storage Cost
* Data Transfer Cost

For an introduction to these three components of cost, see [Understanding overall cost](../../user-guide/cost-understanding-overall.md).

### General Recommendations

To obtain cost generated by the connector, we recommend that you create a separate account solely for using the connector.
Using a specific account you track the exact data transfer generated by the connector.

If you cannot use a separate account for the connector, consider the following:

* Create a separate database for storing ingested data to track storage cost easier.
* Allocate a warehouse only for the connector to get the exact compute cost.
* Use [object tags](../../user-guide/object-tagging/introduction.md) on databases and a warehouse to build custom cost reports.

### Compute Cost

We recommend that you use a pair of dedicated operations and compute warehouses only for the connector. This
configuration allows you to create [resource monitors](../../user-guide/resource-monitors.md) on these two warehouses. You can use the monitors to send
email alerts and suspend both warehouses, stopping the connector when the set credit quota is exceeded.

> **Note:**
>
> Setting the credit quota too low in configurations where large volumes
> of data are ingested may cause the connector to not ingest all data.

For information on how to check credits consumed by the warehouse, see
[Exploring compute cost](../../user-guide/cost-exploring-compute.md).
You can also assign [object tags](../../user-guide/object-tagging/introduction.md) to the warehouse and use the tags to create cost reports.

### Storage Cost

The MySQL 6.0.0 connector stores data in:

* The connector database, which is created when installing the connector and which holds the connector internal state.
* One or many other databases, which are created when configuring data sources and where the ingested data is stored.

Data storage is also used by the Snowflake [Fail-safe](../../user-guide/data-failsafe.md) feature. The amount of data stored in Fail-safe depends
on table updates done by the connector. Hence, the amount of data increases if table rows ingested from a source
database are updated frequently or a whole table is reloaded. Typically, seven to ten days after the connector
is set up, the amount of Fail-safe data stabilizes (assuming that no reloads are performed and that the flow
of ingested data is at a steady rate).

If you want to check the storage usage using Snowsight, we recommend that you use separate databases
for storing ingested data. This way you can filter the graphs for the storage usage by an object, which shows usage
by individual databases. You can also view the storage usage by querying the [DATABASE_STORAGE_USAGE_HISTORY view](../../sql-reference/account-usage/database_storage_usage_history.md)
and filtering by databases used by the connector.

If a database contains other schemas not related to the connector, you can query the storage usage of a specific schema
that is dedicated to data ingested by the connector. You can get this information from the [TABLE_STORAGE_METRICS view](../../sql-reference/account-usage/table_storage_metrics.md)
after filtering by database and schema names and aggregating columns with storage usage.

### Data Transfer Cost

The connector uses the Snowflake [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) feature to transfer data from a source database
to a destination database in your Snowflake account.

For information on how to check credits consumed by the Snowpipe Streaming, refer to [Costs for Snowpipe Streaming Classic](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-billing.md) and [Snowpipe Streaming high-performance architecture: Understand your costs](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-cost.md).

## Determining optimal warehouse size for the connector instance

A major benefit is that the compute warehouse size can be adjusted to the data volume. The connector typically requires a XSMALL ops warehouse and a XSMALL compute
warehouse, and do not take advantage of larger warehouses during data ingestion.

To find the optimal warehouse size for the connector, you should consider the factors that affect the performance of the connector, such as the size of the source databases, the number of changes, the number of enabled datasources and tables.

We recommend that you define a set of measurable expectations, such as replication lag, and pick the smallest warehouse size that meets these expectations.
Alternatively, when you just want to try out the connector and enable a single table for ingestion, an X-Small warehouse should be sufficient.

To determine if you can downsize the warehouse, see [Monitoring warehouse load](../../user-guide/warehouses-load-monitoring.md).

---
title: Cost governance of the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/cost-governance.md
section: Connectors & Drivers
---

# Cost governance of the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic provides best practices for cost governance and finding the optimal warehouse size for the Snowflake Connector for PostgreSQL.

## Measuring Cost of the Connector

If the connector has a separate account only for data ingestion and storage, and the account shows no other activity
(such as executing queries by users using the ingested data), you can read the overall cost on the account level.
To learn more, see [Exploring overall cost](../../user-guide/cost-exploring-overall.md).

If the account is not dedicated only to the connector or you need to investigate the costs further,
you should analyze the charged costs for the three components separately:

* Compute Cost
* Storage Cost
* Data Transfer Cost

For an introduction to these three components of cost, see [Understanding overall cost](../../user-guide/cost-understanding-overall.md).

### General Recommendations

To obtain cost generated by the connector, we recommend that you create a separate account solely for using the connector.
Using a specific account you track the exact data transfer generated by the connector.

If you cannot use a separate account for the connector, consider the following:

* Create a separate database for storing ingested data to track storage cost easier.
* Allocate a warehouse only for the connector to get the exact compute cost.
* Use [object tags](../../user-guide/object-tagging/introduction.md) on databases and a warehouse to build custom cost reports.

### Compute Cost

We recommend that you use a pair of dedicated operations and compute warehouses only for the connector. This
configuration allows you to create [resource monitors](../../user-guide/resource-monitors.md) on these two warehouses. You can use the monitors to send
email alerts and suspend both warehouses, stopping the connector when the set credit quota is exceeded.

> **Note:**
>
> Setting the credit quota too low in configurations where large volumes
> of data are ingested may cause the connector to not ingest all data.

For information on how to check credits consumed by the warehouse, see
[Exploring compute cost](../../user-guide/cost-exploring-compute.md).
You can also assign [object tags](../../user-guide/object-tagging/introduction.md) to the warehouse and use the tags to create cost reports.

### Storage Cost

The PostgreSQL 6.0.0 connector stores data in:

* The connector database, which is created when installing the connector and which holds the connector internal state.
* One or many other databases, which are created when configuring data sources and where the ingested data is stored.

Data storage is also used by the Snowflake [Fail-safe](../../user-guide/data-failsafe.md) feature. The amount of data stored in Fail-safe depends
on table updates done by the connector. Hence, the amount of data increases if table rows ingested from a source
database are updated frequently or a whole table is reloaded. Typically, seven to ten days after the connector
is set up, the amount of Fail-safe data stabilizes (assuming that no reloads are performed and that the flow
of ingested data is at a steady rate).

If you want to check the storage usage using Snowsight, we recommend that you use separate databases
for storing ingested data. This way you can filter the graphs for the storage usage by an object, which shows usage
by individual databases. You can also view the storage usage by querying the [DATABASE_STORAGE_USAGE_HISTORY view](../../sql-reference/account-usage/database_storage_usage_history.md)
and filtering by databases used by the connector.

If a database contains other schemas not related to the connector, you can query the storage usage of a specific schema
that is dedicated to data ingested by the connector. You can get this information from the [TABLE_STORAGE_METRICS view](../../sql-reference/account-usage/table_storage_metrics.md)
after filtering by database and schema names and aggregating columns with storage usage.

### Data Transfer Cost

The connector uses the Snowflake [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) feature to transfer data from a source database
to a destination database in your Snowflake account.

For information on how to check credits consumed by the Snowpipe Streaming, refer to [Costs for Snowpipe Streaming Classic](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-billing.md) and [Snowpipe Streaming high-performance architecture: Understand your costs](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-cost.md).

## Determining optimal warehouse size for the connector instance

A major benefit is that the compute warehouse size can be adjusted to the data volume. The connector typically requires a XSMALL ops warehouse and a XSMALL compute
warehouse, and do not take advantage of larger warehouses during data ingestion.

To find the optimal warehouse size for the connector, you should consider the factors that affect the performance of the connector, such as the size of the source databases, the number of changes, the number of enabled datasources and tables.

We recommend that you define a set of measurable expectations, such as replication lag, and pick the smallest warehouse size that meets these expectations.
Alternatively, when you just want to try out the connector and enable a single table for ingestion, an X-Small warehouse should be sufficient.

To determine if you can downsize the warehouse, see [Monitoring warehouse load](../../user-guide/warehouses-load-monitoring.md).

---
title: Cost governance of the Snowflake Connector for ServiceNow®
source: https://docs.snowflake.com/en/connectors/servicenow/cost-governance.md
section: Connectors & Drivers
---

# Cost governance of the Snowflake Connector for ServiceNow®

This topic provides best practices for cost governance and finding the optimal warehouse size for the Snowflake Connector for ServiceNow®.

## Measuring Cost of the Connector

If the connector has a separate account only for data ingestion and storage, and the account shows no other activity (such as executing queries by users using the ingested data), you can read the overall cost on the account level. To learn more, refer to [Exploring overall cost](../../user-guide/cost-exploring-overall.md).

If the account is not dedicated only to the connector or you need to investigate the costs further, you should analyze the charged costs for the three components separately:

* Compute Cost
* Storage Cost
* Data Transfer Cost

For an introduction to these three components of cost, refer to [Understanding overall cost](../../user-guide/cost-understanding-overall.md).

### General Recommendations

To obtain cost generated by the connector, we recommend that you create a separate account solely for using the connector. This way you can track the exact data transfer generated by the connector.

If you cannot use a separate account for the connector, try the following:

* Create a separate database for storing ingested data to track storage cost easier.
* Allocate a warehouse only for the connector to get the exact compute cost.
* Use [object tags](../../user-guide/object-tagging/introduction.md) on databases and a warehouse to build custom cost reports.

### Compute Cost

We recommend that you create a separate warehouse only for the connector. This setup allows you to create [resource monitors](../../user-guide/resource-monitors.md) on the warehouse. You can use the monitors to send email alerts and suspend the warehouse, stopping the connector when the set credit quota is exceeded. The connector automatically resumes after the credit quota is renewed. Note that setting credit quota too low in configurations where large volumes of data are ingested may cause the connector to not ingest all data.

For information on how to check credits consumed by the warehouse, refer to [Exploring compute cost](../../user-guide/cost-exploring-compute.md). You can also assign [object tags](../../user-guide/object-tagging/introduction.md) to the warehouse and use the tags to create cost reports.

If the warehouse used by the connector is used by other workflows, you can split the cost by roles. To split usage by roles, use the [query for splitting warehouse usage](../../user-guide/cost-attributing.md) and add the following `WHERE` clause on the QUERY_HISTORY view:

```sqlsyntax
WAREHOUSE_NAME = '<connector warehouse name>' AND ROLE_NAME = 'APP_PRIMARY'
```

The query gives only an approximation of the cost.

> **Note:**
>
> Only one native app may use the warehouse, otherwise costs of different applications are inseparable because each native app uses the same role name (APP_PRIMARY).

### Storage Cost

The Snowflake Connector for ServiceNow® stores data in two places:

* The connector database, which is created from the listing and which holds the connector internal state.
* The user-specified schema where the ingested data is stored.

Data storage is also used by the Snowflake [Fail-safe](../../user-guide/data-failsafe.md) feature. The amount of data stored in Fail-safe depends on the table updates done by the connector. The amount of data increases if the table rows ingested from ServiceNow® are updated frequently or the whole table is reloaded. Typically, seven to ten days after the connector is set up, the amount of Fail-safe data stabilizes (assuming that no reloads are performed and that the flow of ingested data is at a steady rate).

If you want to check storage usage in Snowsight, we recommend that you have a separate database for storing ingested data. This way you can filter the graphs for storage usage by object, which shows usage by separate databases. You can also do it by querying the [DATABASE_STORAGE_USAGE_HISTORY view](../../sql-reference/account-usage/database_storage_usage_history.md) and filtering by both databases used by the connector.

If the database contains other schemas not related to the connector, you can query storage usage of a specific schema that is dedicated to the data ingested from the connector. You can get the information from [TABLE_STORAGE_METRICS view](../../sql-reference/account-usage/table_storage_metrics.md) after filtering by database and schema names and aggregating columns with storage usage.

### Data Transfer Cost

The connector uses external access to retrieve data from ServiceNow®. Snowflake charges only for egress traffic generated by the connector, based on the size of the requests from the connector to ServiceNow®.
The responses from ServiceNow® do not generate cost on Snowflake side.

Information on data transfer usage is available only in the aggregated form for all external access integrations on the account level. To access the number of transferred bytes, use the [DATA_TRANSFER_HISTORY view](../../sql-reference/account-usage/data_transfer_history.md) and filter by the EXTERNAL_ACCESS transfer type.

### Healthcheck Task Cost

The connector creates a serverless task that will regularly check health status of your app instance and send **only** the summarized result (if it’s healthy or not) to Snowflake.
The task is created after completing the installation wizard (or calling `FINALIZE_CONNECTOR_CONFIGURATION` in worksheets). It runs in the background and generates a fixed cost of up to 0.5 credit/day
even if no ServiceNow® table is enabled for replication.

The task cannot be manually stopped or dropped. However, to reduce this cost you can call `PAUSE_CONNECTOR` procedure which will disable the task and not generate any cost when the connector is unused.

## Cost Optimization

### Determining the Optimal Warehouse Size for the Connector Instance

To find the optimal warehouse size for the connector, you should consider the factors that affect the performance of the connector, such as the size of ServiceNow® instance, the number of enabled tables, and the schedule for synchronizing each table. For example, if only a few tables are enabled the connector might not benefit from increased parallelization.

We recommend that you define a set of measurable expectations, such as time intervals in which all tables should be synchronized, and pick the smallest warehouse size that meets these expectations. For large amounts of ingested data with tens of synchronized tables, the default recommendation is Large warehouse. On the other hand, when you just want to try out the connector and enable a single table for ingestion, an X-Small warehouse should be sufficient. To find out if you can downsize the warehouse, refer to [Monitoring warehouse load](../../user-guide/warehouses-load-monitoring.md).

### Starting and Stopping the Connector Automatically Within a Specified Timeframe

To save on cost, you can run the connector only during a specified timeframe (for example, outside business hours), by calling the `PAUSE_CONNECTOR` and `RESUME_CONNECTOR` procedures.

You can automate pausing and resuming the connector with tasks. For example, to run the connector outside of UTC business hours, you might use the following query:

```sqlexample
CREATE TASK start_connector_after_business_hours
   WAREHOUSE = <my_warehouse>
   SCHEDULE USING CRON 0 17 * * MON-FRI Europe/London
   AS CALL <my_connector_servicenow>.PUBLIC.RESUME_CONNECTOR();

CREATE TASK stop_connector_before_business_hours
   WAREHOUSE = <my_warehouse>
   SCHEDULE USING CRON 0 9 * * MON-FRI Europe/London
   AS CALL <my_connector_servicenow>.PUBLIC.PAUSE_CONNECTOR();
```

### Setting Up Custom Data Ingestion Schedule

It’s possible to configure specific ingestion schedules for any table. This can be used to reduce warehouse usage and load on ServiceNow® instance
during the business hours. For example, to ingest a table `example_table` only once a week during a weekend run:

```sqlexample
CALL <my_connector_servicenow>.PUBLIC.CONFIGURE_TABLES_SCHEDULE(['example_table'], { 'type': 'custom', 'value': { 'hour': 11, 'dayOfWeek': '6' } });
```

This will make the connector ingest data for table `example_table` on 11 AM UTC every Saturday.

---
title: Data ingestion model for the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-data-ingestion-model.md
section: Connectors & Drivers
---

# Data ingestion model for the Snowflake Connector for Google Analytics Raw Data

This topic provides information on the data ingestion models supported by the Snowflake Connector for Google Analytics Raw Data.

## Google Analytics to BigQuery export

Google Analytics supports three types of BigQuery exports:
:   * Daily: Google Analytics exports data to tables named in the `events_XXXXXX` format. Tables are created once daily, after the end of the day, when all the events for the given day are collected.
    * Fresh Daily: Google Analytics exports data to tables named in the `events_fresh_XXXXXX` format. Tables are created and refreshed according to the scheduler, with a maximum frequency of once per hour. This feature is available only for the Google Analytics 360 customers.
    * Streaming: Google Analytics continuously exports data throughout the day, and stores it in a table named in the `events_intraday_XXXXXX` format.
    * Users: Google Analytics export containing user data related to the collected events. Tables are stored in BigQuery and named in the `users_XXXXXX` and `pseudonymous_users_XXXXXX` formats.

The connector supports these three types of exports and automatically downloads all the tables it finds in BigQuery, without requiring any additional configuration.

## Sink tables

For each property, the connector saves the events into property-specific tables, which are created in a database and a schema provided during connector configuration.

For each of the properties, there might be up to four sink tables created, depending on which export types were enabled. The tables are named as follows:

> * `ANALYTICS_<propertyId>`
> * `ANALYTICS_INTRADAY_<propertyId>`
> * `USERS_<propertyId>`
> * `PSEUDONYMOUS_USERS_<propertyId>`

## Daily table ingestion

The connector downloads the entire table in a single run when it recognizes the table is present in BigQuery. Google cautions the daily tables can be updated up to 72 hours after the table was created.
To ensure data consistency, the connector reloads tables after 72 hours, (Note that the exact reload time is dependent on the connector ingestion schedule).
Updates in BigQuery made after 72 hours since table creation, won’t be reflected in Snowflake. Such tables can be reloaded manually, using one of the [RELOAD_PROPERTY](gard-connector-setting-up-data.md) procedures.

## Fresh Daily table ingestion

After every successful ingestion run of the connector, reloads are continuously created to reload the table for up to 96 hours, 24 hours on the day the
table is created and 72 hours when data updates can occur). Reloads will follow each successful ingestion run, triggered after every dispatcher run,
with a maximum frequency of once per hour. The last reload date is calculated based on the table name and the allocated 96-hour period.

If a fresh daily ingestion needs to catch up, for instance due to a connector pause, the connector will ingest all the tables sequentially.
Reloads will not be created if they are unnecessary, that is, if more than 96 hours have passed since the table was created.

This feature is available only for the Google Analytics 360 customers. Fresh daily exports can be enabled manually by using the `ENABLE_PROPERTIES` or `UPDATE_INGESTION_OPTIONS` procedures.

## Intraday ingestion

The connector supports downloading historical intraday tables (if they are present in BigQuery) and ongoing ingestion of intraday tables still receiving updates.

For past days, the connector downloads intraday tables the same way it foes daily ones – each table is downloaded in whole, one table at a time, until the process reaches the present day’s data.

When the connector recognizes that an intraday table is the last one in BigQuery, it starts processing the table incrementally. This means it downloads incoming batches of data from the table throughout the day, at a constant interval, which is 8 hours by default.

When any of the following conditions are met:

> * A next-day table appeared in the BigQuery dataset
> * 24 hours passed since the first load for the given table

the connector does a final ingestion for the given intraday table and switches to the next one.

> **Note:**
>
> A small number of events may not be ingested if events are delayed more than 10 minutes. Immediately after the incremental load of a intraday table is finished, the connector verifies whether there are any lost events, and if so schedules a table reload to ensure data consistency between Snowflake and BigQuery.

## User data tables ingestion

User data table ingestion is based on the same mechanism as daily tables ingestion.

## Scheduling

Connector checks whether new tables exist in BigQuery and then schedules ingestions of them (or its parts in case of incremental intraday ingestions) into Snowflake when:

> * Task is triggered according to configured schedule
>   :   + By default it is every 8 hours
>       + Using [CONFIGURE_INGESTION_INTERVAL](gard-connector-managing.md) you can change the default interval value if you need more/less frequent updates.
> * Connector finished ingestion of last scheduled table
>   :   + In consequence, this means that schedules are more frequent than it stems from the configuration, since there should be at least one ingestion per day, which means at least one extra check.
>       + In particular, when there is initial load ongoing, and there are a lot of tables to ingest, after ingesting each of the tables, the scheduling mechanism is triggered.

---
title: Database connector concepts
source: https://docs.snowflake.com/en/connectors/db-connector-concepts.md
section: Connectors & Drivers
---

# Database connector concepts

## Connector components

The Snowflake Database connectors consist of a Snowflake Native App, installed from the Marketplace into your Snowflake account,
and an **Agent** application running inside your infrastucture, either on-premise, or in the cloud.

* **Agents** connect directly to your source databases, track updates on tables that you chose to replicate, and upload changes into your Snowflake account.
  Agents requires one-time configuration for connecting to Snowflake and the data sources, and afterwards only upgrading when new versions are released.
  Beyond that, it’s entirely controlled and configured via the Native App.
* **Native Apps** control the process of replicating data. They instruct the agents on which tables to track, receive the changes,
  and merge them into your destination databases. Most of your interaction will be with the Native App. It is upgraded automatically, when a new version becomes available.

This model in which an agent runs locally is necessary so that the connector can securely access source databases in networks that are closed to external connections.

One Agent instance always connects to a single Native App instance, and one Native App always works with one Agent.
If you need to run multiple Agent instances, perhaps to replicate source databases from disconnected networks,
you will need to install and configure multiple instances of the Native App. For assistance, [contact Snowflake Support](../user-guide/contacting-support.md).

> **Note:**
>
> For optimal performance, keep the Agent at the same version as the Native App, and upgrade it regularly.
> Currently Snowflake ensures compatibility between all publicly available Agent versions and the Native App.

Internally, the connector relies on an asynchronous, event-driven exchange of commands.
The Native App must also communicate and coordinate with the Agent. This is why you can notice a delay between the execution of a command, and seeing the effect of that command.

## Data sources, tables, journals and destinations

When talking about the data flow through the connector, we distinguish the following stages:

Data Source
:   The database that holds the tables that the connector is replicating. Depending on the database engine,
    this can either be the whole database *server* or one of the databases hosted *inside* the server.

    A single connector instance can replicate from multiple Data Sources, as long as the Agent can directly connect to all of them.

Source Table
:   A specific table in the source database that is tracked by the connector for changes which are then replicated into Snowflake.
    Each Data Source may contain multiple Source Tables that are replicated simultaneously by the same connector instance.

    The immediate parent of the Source Table in the Data Source becomes the schema of the corresponding Destination Table in Snowflake.

Journal
:   A Snowflake table, owned and managed by the connector’s Native App, that receives and stores every change applied to the Source Table:
    inserts, updates, and deletes. It’s a de-facto changelog of the Source Table’s data, and
    its structure reflects how database engines typically broadcast changes to their replicas.

    Every Source Table has a separate Journal table.

Destination Table
:   The Snowflake table in your account where the connector replicates data into.
    There’s a separate Destination Table for every Source Table.
    Its column names reflect the names in the Source Table, and their types are corresponding Snowflake types for the source columns.

    Each Destination Table also includes columns with replication information:
    `_SNOWFLAKE_INSERTED_AT`, `_SNOWFLAKE_UPDATED_AT`, `_SNOWFLAKE_DELETED`,
    holding the timestamps of the original insertion, last update, and deletion of the given row, respectively.

    The Destination Table has change tracking pre-enabled to allow for creating streams. The connector’s Native App keeps the `OWNERSHIP` grant on the table.

## Snapshot and incremental load

Replicating data from a newly added table begins with a **Snapshot Load**.
The Agent performs a single `SELECT <columns>` statement on the source table, t
hen streams all the records into an interim table in Snowflake, and afterwards copies them into the destination table.
This operation can be resource-intensive on the source database, and will typically take a long time for large tables.
You may need to wait until you see first records appear in the destination table.

A Snapshot Load can also be repeated, replacing previously-replicated data, to synchronize the source table with the destination, in the following scenarios:

* When the table’s replication fails permanently, due to unsupported data types, sizes, connector bugs, or other issues.
* When replication was paused, and after resuming, the source database’s changelog no longer contains entries since the last time the table was replicated.

After the initial snapshot is complete, the table’s replication turns to **Incremental Load**.
The Agent tracks the source database’s changelog, and streams these changes into the corresponding journal table,
from where they are later merged into the destination table. This cycle of reading, streaming, and merging can either be performed
continuously, or on a schedule. For more information about these modes,
see [Next steps](postgres6/configure-replication.md) and [Next steps](postgres6/configure-replication.md).

## Table replication lifecycle

A newly added source table’s replication cycle starts with **Schema Introspection**.
This is where the connector discovers the columns in the source table, their names, types,
then validates them against Snowflake’s and the connector’s limitations. Validation failures will cause this stage to fail,
and the cycle completes. After successful completion of Schema Introspection, the connector creates an empty destination table.

The second stage is **Snapshot Load** where the connector copies all data available in the source table into the destination table.
Failure of this stage will also finish the cycle, and no more data will be replicated. After successful completion,
the whole set of data from the source table will be available in the destination table.

Finally, the table moves on to **Incremental Load**, where the connector keeps tracking changes in the source table, and copying them into the destination table.
This continues until the table is removed from replication. Failure at this stage will permanently stop replication of the source table, until the issue is removed.

For instructions on how to determine which replication phase your tables are currently in, see [Monitoring the Snowflake Connector for MySQL](mysql6/monitor.md) and [Monitoring the Snowflake Connector for PostgreSQL](postgres6/monitor.md).

> **Note:**
>
> To resume replication for a failed table, once the issue that caused failure is fixed, remove the table from replication, and then add it again.
> For more information, see [Configuring replication for the Snowflake Connector for MySQL](mysql6/configure-replication.md) and [Monitoring the Snowflake Connector for PostgreSQL](postgres6/monitor.md).

## Flow of data from source to destination

The connector moves data differently from the source table into the destination,
depending on whether the its performing a Snapshot or Incremental Load.
For more information see Snapshot and incremental load.

### Snapshot load flow

1. The Agent performs a `SELECT <columns> FROM <source table>` on the source table, and inserts those records,
   using Snowflake’s Snowpipe Streaming, into an interim table called the Snapshot Table, stored inside the connector’s Native App.
2. Once all of the available rows are present in the Snapshot Table, the Native App runs a task that copies them into the
   destination table via `INSERT INTO <destination table> (SELECT <columns> FROM <snapshot table>)`.
3. After all the rows were copied into the destination table, the Snapshot Table is dropped.
   The replicated table is ready to move on to Incremental Load.

### Incremental load flow

1. The source database publishes changes on the source table into its changelog.
   The specific mechanism depends on the type of source database, but generally these list inserts, updates and deletes row by row.
2. The agent reads these changelogs in real time, and inserts these row-by-row changes into the corresponding journal tables.
3. A merge task detects the new entries in the journal in real time, and merges them into the destination table: inserting new records,
   updating or soft-deleting existing records. The merge task also adds timestamps for these changes into columns
   described in Data sources, tables, journals and destinations.

In Scheduled replication mode, the reading of the source database’s changelog, moving these changes into Snowflake,
and merging them into the source database are *not* performed continuously,
but on a fixed schedule instead. See Continuous vs. scheduled replication for details.

## Continuous vs. scheduled replication

The default replication mode for newly added data sources is **Continuous**. In this mode, the connector aims to replicate data changes as quickly as possible.
It’s the optimal mode for data sources that change often, where you need that data to be available in Snowflake at low latencies.

You can change the data source to replicate in **Scheduled** mode, where data is copied from the source and into destination tables in batches,
on a fixed schedule. This is the optimal mode for data sources that change infrequently, or when you intend to reduce credit consumption, and don’t require the data to be available in Snowflake at low latencies.

> **Note:**
>
> The replication mode can be set *per data source*, and will uniformly affect all tables configured for that data source. Setting a different mode or schedule per table is not supported.

When running Continuous replication, incremental load will never be reported as “completed”.
In Scheduled mode, incremental load is reported as completed after every batch of data is merged into the destination table.
A “batch” in Scheduled mode consists of all the changelog entries between the previous scheduled run, and the moment the next scheduled run is started.

## Warehouse types

Connectors requires two warehouses to operate:

* An **Operational Warehouse**, sometimes refered to as an Ops Warehouse, is used to execute the connector’s command & control operations.
  This warehouse is automatically created by the setup wizard, and its optimal size is XS.
* A **Compute Warehouse** is used to execute the merge tasks that move data from journals into destination tables.
  This warehouse may be created by the setup wizard, or manually. Its optimal size and type depend on the scale of your replication.

The above distinction is required to ensure that operational queries are executed in a timely fashion,
without being queued together with the queries that move large quantities of data.
This also means that the Operational Warehouse *cannot* be reused between connector instances, and should not be shared with other workloads in the account.

The Compute Warehouse, in turn, *can* be shared with other connector instances, and workloads.
Keep in mind, though, that sharing this warehouse may cause delays in data appearing in your destination tables.

> **Important:**
>
> The Operational Warehouse will *never* suspend when working in continuous mode which will cause it to consume credits even if no data is being replicated.
> To enable auto-suspend, change the replication mode for *all* data sources to scheduled. See Continuous vs. scheduled replication for details.

---
title: Deprecated — Snowflake Connector 1.x for Informatica Cloud
source: https://docs.snowflake.com/en/connectors/informatica-cloud-connector.md
section: Connectors & Drivers
---

# *Deprecated* — Snowflake Connector 1.x for Informatica Cloud

This topic contains information about how to set up and use version 1.x of the Snowflake Connector. It explains how Informatica Cloud organization administrators and business users can use the Snowflake
Connector to publish data to Snowflake.

The connector implements the Informatica Cloud Connector SDK. It can be deployed on both Informatica Cloud and Informatica PowerCenter 9.6.1. For assistance deploying the connector on PowerCenter, please
contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Note:**
>
> Snowflake also provides an ODBC library that can be used for data integration with Informatica’s products; currently, this library only supports read functionality.

## Introduction to Snowflake Connector

### Snowflake Connector Overview

Snowflake provides programmatic APIs for querying and modifying data in the form of industry-standard ODBC and JDBC libraries. The ODBC library can be used with Informatica products using standard ODBC
connectors. See the Informatica documentation for configuring the ODBC connector. The ODBC library can be downloaded from your Snowflake account. However, writing or updating large volumes of data into
Snowflake using ODBC is often not the most efficient or effective way to perform these operations.

The Snowflake Connector is designed to improve throughput of bulk insertion, modification, and deletion of large numbers of rows in Snowflake. It works by caching row-by-row data it receives through
Informatica, uploading it asynchronously to cloud storage in the form of compressed character-delimited files, and importing data from the files using the Snowflake COPY command.

### Snowflake Connector Implementation

Data submitted for processing is staged within the internal stage for the configured connection user (identified by the `~` character).

Subdirectories are created in the stage for each job. Multiple batches can be processed, with a corresponding subdirectory within the user stage for each batch. Each subdirectory
includes the following information:

* Name of the target table.
* Name of the operation (INSERT, DELETE, UPSERT, MODIFY).
* Timestamp and a unique identifier (consecutive number).

In the Query History page in Snowsight, the following commands are displayed for the user configured to run the process:

> * SQL statements configured to be run before a job.
> * Sequence of PUT commands to upload data files to the stage.
> * Creation of a temporary table to stage data.
> * COPY command to import data into the staging table, optionally in validation mode to first identify/retrieve data conversion errors.
> * DELETE, MERGE or INSERT command to process the data.
> * RM command to clean up staged files from the stage.

This sequence may be modified by the connector to optimize performance.

Data errors are reported to Informatica to be written to the error file session log, and may terminate the job if it is so configured. On its own, the Snowflake loading process skips all data conversion
errors.

## Snowflake Connections

### Snowflake Connection Overview

The Snowflake Connector uses the Snowflake JDBC driver to connect. The driver library is included in the connector distribution.

### Snowflake Connection Properties

The Snowflake Connector uses the following properties for connecting to Snowflake:

* USER and PASSWORD
* Snowflake URL
* Start transaction for jobs
* Abort on data errors
* Propagate data stream

#### USER and PASSWORD

User name and password for the account that will be used for the loading process. Snowflake recommends using a dedicated user with the appropriate write privileges for the table where data will be loaded.

#### Snowflake URL

JDBC URL for connecting to the Snowflake database and schema in your account. For example:

> `jdbc:snowflake://xy12345.snowflakecomputing.com/?db=load&schema=etl`

Where:

* `xy12345` is the name of your account (provided by Snowflake).
* If your account is located in a region other than US West, the JDBC connection string must also include the [region ID](../user-guide/intro-regions.md) after your account name in the form of
  `<account_id>.<region_id>.snowflakecomputing.com`.
* `load` is the name of the default database to use for loading data.
* `etl` is the name of the schema (in the `load` database) containing the tables to be loaded.

> **Note:**
>
> During design time, metadata browsing is limited to the Snowflake schema and database specified in the connection or in the search path of the user.

> **Tip:**
>
> If you run a job that includes a large set of data and very complex transformations, it may take a long time to complete. If the job takes over 4 hours, the Snowflake connection token may expire. To
> avoid this situation, you can specify the `client_session_keep_alive` parameter in the JDBC connection string, which prevents the connection token from expiring. For example:
>
> > `jdbc:snowflake://xy12345.snowflakecomputing.com/?...&client_session_keep_alive=true`

#### Start transaction for jobs

If set, the connector will initiate a transaction before the start of every job, and commit or rollback upon the completion or failure of the job.

> **Note:**
>
> Informatica does not support operation rollback or disconnect in the connector API. Terminating a job may leave hanging table locks and an uncommitted transaction
> that may be needed to be released manually from the Snowflake command line.

#### Abort on data errors

When this property is selected, every job will stop processing if any data conversion errors are encountered during data import. To rollback partial changes when errors
are encountered, also set **Start transaction for jobs**.

> **Note:**
>
> Because data is loaded asynchronously, some data may already be committed if this property is used and more than one batch of data was generated.

#### Propagate data stream

The connector implements midstream write interface that allows chaining of data processing. If this property is selected, the connector will pass data for further
processing.

For better performance, do not select this property.

## Snowflake Data Synchronization Tasks

The connector provides advanced target properties for specifying Snowflake-specific actions and properties to use when a data synchronization task is performed.

### Snowflake Advanced Target Properties

The following table describes the advanced target properties that can be specified for a data synchronization task:

| Advanced Target Property | Description |
| --- | --- |
| **Update key columns** | Semicolon separated list of column names in the target table that should be used as a composite key for DELETE or MODIFY operations. |
| **Execute before** | SQL statement that will be executed prior to start of a job. |
| **Truncate table** | Delete all data from the target table prior to execution of the job. This statement is completed after execution of the **Execute before** statement. |
| **Execute after** | SQL statement that will be executed after completion of a job. |
| **Process data in one batch** | When this property is checked, the connector will upload all data from the job prior to processing it. |
| **Preserve stage file on Error** | Preserve staged data file when an error occurs in loading data. This property is valid only if Abort on data error is enabled. |
| **Use Local Timezone** | Use agent local timezone to convert TIMESTAMP/datetime data. By default, UTC is used in conversions. |
| **Success File Directory** | Not currently used. |
| **Error File Directory** | Not currently used. |
| **Database Override** | Name of the database to update; overrides the target database defined for the data synchronization task. Do not specify values for database override, schema override, or table override in a Data Synchronization task. You can specify the values in a PowerCenter session. |
| **Schema Override** | Name of the schema to update; overrides the target schema defined for the data synchronization task. Do not specify values for database override, schema override, or table override in a Data Synchronization task. You can specify the values in a PowerCenter session. |
| **Table Override** | Name of the table to update; overrides the target table defined for the data synchronization task. Do not specify values for database override, schema override, or table override in a Data Synchronization task. You can specify the values in a PowerCenter session. |

**Usage Notes:**

* Snowflake does not enforce primary or foreign key constraints, and does not preserve metadata for keys. You must specify the **Update key columns**
  property even if corresponding columns are marked as key in the Informatica environment.
* The **Process data in one batch** property may delay completion of the job, but guarantees that no data will be persisted in case of a failure,
  and without the overall transaction. Processing maximum amount of data at a time also maximizes utilization of Snowflake warehouse parallelism.
* The **Database Override**, **Schema Override**, and **Table Override** attributes are used by PowerCenter to provide values at runtime that
  override the target database, schema, and/or table for the data synchronization task. This enables using the same data synchronization task
  to update tables in multiple databases and schemas. The fields are blank by default and should be left blank because the values for the attributes
  are provided at runtime.

---
title: Disaster recovery
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-disaster-recovery.md
section: Connectors & Drivers
---

# Disaster recovery

The GAAD connector stores metadata about configured reports and its own configuration within the application instance.
When the application is dropped or becomes corrupted, this internal state is lost.
To prevent this, the connector exports the metadata to the destination database alongside the ingested data during specific events, such as the following:

> * Configuring a new report
> * Deleting a report
> * Fetching new batches of data from Google Analytics
> * Changing the fetch page size for a report

The export process creates several tables in the destination schema to store the connector’s internal state.
These tables do not contain the ingested data but are essential for recovering the connector’s state after the application
is dropped or becomes corrupted. When replicated, these tables can also be used to recover the state of the connector on a different Snowflake account.
The following tables are created by the export process:

> * `APP_CONFIG_SFSDKEXPORT_V1`
> * `APP_STATE_SFSDKEXPORT_V1`
> * `CONNECTOR_ERRORS_LOG_SFSDKEXPORT_V1`
> * `INGESTION_PROCESS_SFSDKEXPORT_V1`
> * `INGESTION_RUN_SFSDKEXPORT_V1`
> * `NOTIFICATIONS_STATE_SFSDKEXPORT_V1`
> * `RESOURCE_INGESTION_DEFINITION_SFSDKEXPORT_V1`

## Importing existing data and reports to a new instance of the connector.

If the GAAD connector has been uninstalled or corrupted, it is possible to resume ingestion of previously configured reports, provided that the destination
database was not dropped. The metadata for reports configured in the connector is saved in the destination database alongside the ingested data.
To continue ingesting data after installing a new connector instance, follow these steps:

1. Configure the connector.

   Configure the connector by following the instructions in [Install and configure the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-installing.md).
   When choosing the destination database and schema, select the existing schema that contains data ingested by the previous instance of the connector.
2. Grant required privileges to the connector.

   > **Note:**
   >
   > This step is not required if you installed and configured the new connector using Snowsight. Execute it only if you installed the connector using SQL commands.

   Execute the following command to ensure that the newly installed GAAD connector becomes the owner of all objects in the existing schema:

   ```sqlexample
   system$grant_ownership_to_application('your_application_instance', true, 'database', 'schema');
   ```

   Where **database** and **schema** are the names of the existing database and schema, respectively.
3. Pause the connector.

   > ```sqlexample
   > call pause_connector();
   > ```
4. Import the existing data and reports.

   Import the existing data and reports by executing the following command from the context of installed application:

   ```sqlexample
   call import_state(force => true);
   ```

   The **force** parameter is set to **true** to ensure that any changes that might have been made to the freshly installed connector
   are overwritten with the reports and internal data from the old installation.
5. Resume the Connector

   ```sqlexample
   call resume_connector();
   ```

At this point, the new instance of the Snowflake Connector for Google Analytics Aggregate Data connector should resume ingestion of the existing reports.

## Replicating the destination database and connector state to another snowflake deployment

This section describes the steps to replicate the content of the destination database.
The destination database contains the ingested data and the metadata for the reports configured in the connector.
If the connector or the data downloaded by the GAAD connector is critical for your business, consider setting up a secondary Snowflake account in a different region
and replicating the destination database to the secondary account.

### Terms and definitions

> * **Destination Database** - the database configured as the target for the data ingested by the GAAD connector. This is also the database where the connector’s internal state is exported to.
> * **Sink Database** - the schema configured as the target for the data ingested by the GAAD connector.
> * **Internal State** - the internal data and configuration of the GAAD connector, for example report configurations, ingestion state, and error logs.
> * **GAAD instance** - the Snowflake Connector for Google Analytics Aggregate Data connector instance installed on the Snowflake account.
> * **GAAD** - Snowflake Connector for Google Analytics Aggregate Data
> * **ACCOUNT_PRIM** - example name of primary account
> * **ACCOUNT_SEC** - example name of secondary (replica) account
> * **APP_PRIM** - example Snowflake Connector for Google Analytics Aggregate Data connector instance name installed on the primary account
> * **APP_SEC** - example Snowflake Connector for Google Analytics Aggregate Data connector instance name installed on the secondary account
> * **DST_DB.DST_SCHEMA** - example destination schema name for the GAAD instance (where data is ingested and the connector’s internal state is saved)
> * **DST_DB** - example destination database name configured for the GAAD connector
> * **MYORG** - example name of your organization (both accounts must be in the same organization)

### Introduction

When installed on your account, the Snowflake Connector for Google Analytics Aggregate Data connector (GAAD instance) appears as a regular database that contain data, procedures etc.
However, it cannot be replicated to a secondary account in the same way as a regular database.
Currently, there is no native mechanism to replicate the GAAD instance with its internal state to a replica account.
Specifically, the installed application cannot be added to a replication group.

Instead of replicating the GAAD instance directly, the connector exports the metadata for configured reports to the destination schema configured during the connector setup process.
The state is saved there and can be replicated alongside the ingested data.

For example, if you configured the connector to ingest data into the destination schema DEST_DATABASE.PUBLIC,
the connector automatically saves its internal state to this schema.
You can then replicate both the ingested data and the internal state using the following command:

> ```sqlexample
> create replication group gaad_dest_database_group
>   object_types = databases
>   allowed_databases = dst_db
>   allowed_accounts = ...;
> ```

### Setting up replication of ingested data and configured reports

> > **Note:**
> >
> > Always test your disaster recovery procedures to verify that data and state replication are functioning as expected.
>
> > **Note:**
> >
> > The following sections contain instructions applicable to all versions of Snowflake.
>
> > **Note:**
> >
> > Before proceeding, familiarize yourself with Snowflake Replication <https://docs.snowflake.com/en/user-guide/account-replication-intro>

1. **Installing GAAD on the primary account**

   Install and configure Snowflake Connector for Google Analytics Aggregate Data on the primary account. For detailed instructions, see [Install and configure the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-installing.md).

   On the primary account, create a replication group and add DST_DB as an allowed database:

   > ```sqlexample
   > -- on primary account
   > create replication group gaad_rep_group_prim
   >   object_types = databases
   >   allowed_databases = dst_db
   >   allowed_accounts = myorg.account_sec
   >   replication_schedule = '10 minute';
   > ```
2. **Setting up replication on the secondary account**

   To replicate DST_DB from the primary account to the secondary account, create a new replication group on the secondary account:

   ```sqlexample
   -- on secondary account
   create replication group gaad_rep_group_sec
     as replica of myorg.account_prim.gaad_rep_group_prim;

   alter replication group gaad_rep_group_sec refresh;
   ```

   At this point, a read-only DST_DB database should be created on the secondary account, and data from the primary account
   will be replicated according to the configured schedule.
3. **Installing GAAD on the secondary account**

   Install and configure Snowflake Connector for Google Analytics Aggregate Data on the secondary account in the same way as on the primary account.
   Point the instance to ingest data into the replicated database and schema.
   While replication is ongoing (until the replication group on the secondary account is dropped),
   the database is in read-only mode. GAAD can be configured to use a read-only database as the ingestion target;
   however, it cannot ingest data until the database transitions to read-write mode.

   After configuring the connector on the secondary account, pause the connector by executing:

   > ```sqlexample
   > -- on secondary account
   > call pause_connector();
   > ```

   At this point, the GAAD connector is installed and ready to take over if the primary account fails.

### Recovery procedure

When the primary deployment becomes unavailable, configure GAAD instance on the secondary account to continue ingestion.

> > **Note:**
> >
> > All steps must be executed on the secondary account.

1. **Drop the replication group**

   Drop the replication group on the secondary account to transition the replicated database to read-write mode:

   ```sqlexample
   drop replication group gaad_rep_group_sec;
   ```
2. **Grant ownership of existing database objects to the connector**

   Grant ownership of all objects in the replicated schema to the GAAD connector by executing:

   ```sqlexample
   call system$grant_ownership_to_application('app_sec', true, 'dst_db', 'dst_schema');
   ```
3. **Import the state**

   Initialize the connector with the state replicated from the primary account:

   > ```sqlexample
   > call import_state(false);
   > ```
4. **Resume the connector**

   Resume the connector by executing:

   ```sqlexample
   call resume_connector();
   ```

   At this point, the GAAD connector on the secondary account should resume data ingestion, continuing from where the GAAD on primary account left off.

   > > **Note:**
   > >
   > > Ensure that both the primary and secondary accounts are part of the same organization. The replication schedule can be adjusted based on your requirements.

---
title: Install and configure Snowflake Connector for Snowflake Connector for Microsoft Power Platform
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/tasks.md
section: Connectors & Drivers
---

# Install and configure Snowflake Connector for Snowflake Connector for Microsoft Power Platform

Working with the Snowflake Connector for Microsoft Power Platform involves several key tasks.

Before enabling data replication, it’s important to carefully review each of the installation and configuration steps.

## Installation and configuration steps

These steps are designed to guide you through the initial setup of the Connector. The installation process involves various stages and contributions from different departments.

| Order | Task | Description |
| --- | --- | --- |
| 1 | [Snowflake Connector for Microsoft Power Platform: Configure the OAuth resource in Microsoft Entra ID](configure-oauth.md) | Configure OAuth in Microsoft Entra |
| 2 | [Snowflake Connector for Microsoft Power Platform: Create OAuth client in Microsoft Entra ID](create-oauth-client.md) | Create a Microsoft Entra OAuth client |
| 3 | [Snowflake Connector for Microsoft Power Platform: Collect Azure AD information for Snowflake](collect-azure-ad-info.md) | Collect the required Azure AD information used in later steps |
| 4 | [Snowflake Connector for Microsoft Power Platform: Create a security integration](create-security-integration.md) | Create the required security integration in Snowflake |
| 5 | [Snowflake Connector for Microsoft Power Platform: [Optional] Validate Entra authorization setup](validate-entra-auth.md) | Use cURL or a similar tool to validate the configuration |
| 6 | [Snowflake Connector for Microsoft Power Platform: [Optional] Validate Snowflake access](validate-sf-access.md) | Validate Snowflake access permissions and connectivity |

---
title: Install and configure the connector with Snowsight
source: https://docs.snowflake.com/en/connectors/servicenow/installing-snowsight.md
section: Connectors & Drivers
---

# Install and configure the connector with Snowsight

This topic provides information on installing and configuring the Snowflake Connector for ServiceNow® through
Snowsight.

## Install the Snowflake Connector for ServiceNow®

The following procedure describes how to install the connector:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for ServiceNow®, then select the tile for the connector.
4. In the page for the Snowflake Connector for ServiceNow®, select Get.

   This displays a dialog that you use to begin the initial part of the installation process.

   In the dialog configure the following:

   1. In the Warehouse used for installation field, select the warehouse that you want to use for
      installing the connector.

      > **Note:**
      >
      > This is not the same warehouse that is used by the connector to synchronize data from ServiceNow®. In a
      > later step, you will create a separate warehouse for this purpose.
   2. Optionally, under Options » Application name you can change the name of the application.
   3. Select Get.
5. A dialog appears with the notification: `Successfully Installed`. To continue configuration, select Configure.

   The dialog closes, and the Snowflake Connector for ServiceNow® page displays the UI for configuring
   and managing the connector.

## Configure the Snowflake Connector for ServiceNow®

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with either the ACCOUNTADMIN role or any other role that meets the following requirements:

   * You must have these account-level privileges:

     + EXECUTE TASK WITH GRANT OPTION
     + EXECUTE MANAGED TASK WITH GRANT OPTION
   * EVENT_TABLE must be enabled on the account.
   * For warehouse access, you must have at least one of the following privileges:

     + The CREATE WAREHOUSE
     + OWNERSHIP
     + USAGE WITH GRANT OPTION
   * For database access, you must have at least one of the following privileges:

     + The CREATE DATABASE
     + OWNERSHIP
     + USAGE WITH GRANT OPTION
   * For schema access, you must have at least one of the following privileges:

     + CREATE DATABASE
     + OWNERSHIP
     + USAGE WITH GRANT OPTION
     + CREATE SCHEMA
     + USAGE, CREATE TABLE, CREATE VIEW WITH GRANT OPTION
   * Optional: For role access, you can create a new or select an existing role that will be
     assigned the DATA_READER application role. If you want to create a new role, then you need the CREATE ROLE privilege on your account.
     However, this is not necessary to complete the configuration.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow®, then select it. You will be now moved to the installation wizard page, that will take you through the configuration process.

Below are listed application’s configuration steps:

### Configure

In this dialog, fill in the following fields:

| Field | Description |
| --- | --- |
| Warehouse | Identifier for a dedicated virtual warehouse for the connector.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates a new `Large` warehouse or reuses a warehouse with the specified name. |
| Destination Database | Identifier for database that will contain the schema with the tables for the ServiceNow® data in Snowflake.  Specify a name that is unique for your account. The name of the database must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates or reuses a database with the specified name. |
| Destination Schema | Identifier for a schema that will contain the ServiceNow® data in Snowflake.  The Snowflake Connector for ServiceNow® ingests ServiceNow® data into tables in this schema.  Specify a name that is unique within the selected database. The name of the schema must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates or reuses a schema with the specified name. |
| Role | Identifier for a new custom role for the connector.  This role will be granted the DATA_READER application role as well as USAGE privilege on Destination Database and Destination Schema.  Specify a name that is unique for your account. The name of the role must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates a new role with the specified name. |

> **Note:**
>
> By default, the fields are set to the names of objects that are created when you configure the connector.
> Snowflake recommends using new objects for these fields. However if needed, you can specify the names of existing objects,
> (for example if reinstalling the connector).

> **Attention:**
>
> Make sure the warehouse is able to execute a query for at least 3 hours. It’s affected by a parameter value that can be set both
> on the warehouse used by the connector and on the account (account’s value takes precedence). To check the current values run:
>
> ```sqlexample
> SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' FOR ACCOUNT;
> SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' FOR WAREHOUSE <connector_warehouse>;
> ```
>
> If both values are at least `10800` (i.e. 3 hours), then no change is needed. Otherwise, run as necessary:
>
> ```sqlexample
> ALTER ACCOUNT SET STATEMENT_TIMEOUT_IN_SECONDS = 10800;
> ALTER WAREHOUSE <connector_warehouse> SET STATEMENT_TIMEOUT_IN_SECONDS = 10800;
> ```
>
> If the proper timeout is not provided, then data ingestion failures will occur.

Select Configure.

### Authentication (Connect to ServiceNow)

If you’re not signed in as a user with the ACCOUNTADMIN role, ensure that you meet the following requirements:

* You must have the CREATE INTEGRATION privilege.
* If integrations were previously created by other roles, then the ownership of those integrations must to be transferred to your role.
* If the CONNECTORS_SECRET database doesn’t exist, then you need the CREATE DATABASE privilege.
* If CONNECTORS_SECRET database exists but was created by another role, then you need these privileges:

  + USAGE WITH GRANT OPTION
  + CREATE SCHEMA WITH GRANT OPTION
* If CONNECTORS_SECRET.APP_NAME schema exists but was created by another role, then you need these privileges:

  + USAGE WITH GRANT OPTION
  + CREATE SECRET
  + CREATE NETWORK RULE
* If CONNECTORS_SECRET.APP_NAME.SECRET exists but was created by another role, then its ownership needs to be transferred to your role.
* If CONNECTORS_SECRET.APP_NAME.NETWORK_RULE exists but was created by another role, then its ownership needs to be transferred to your role.

The following procedure describes how to set up a connection to ServiceNow.
You can select either basic authentication (username and password) or OAuth.

1. Select one of possible authentication methods: Basic authentication, OAuth2 or OAuth Client Credentials (recommended).
2. In the ServiceNow Instance field, enter the name of the ServiceNow® instance.

   This is the first part of the hostname of your ServiceNow® instance. For example, if the URL to your
   ServiceNow® instance is:

   ```none
   https://myinstance.service-now.com
   ```

   The name of your instance would be `myinstance`.

> **Note:**
>
> When using a custom domain, for example anything other than `service-now.com`, you must provide the full URL to the ServiceNow® instance.

#### Basic authentication flow

1. In ServiceNow username and ServiceNow password fields enter the credentials for your ServiceNow® account.
2. Select Connect.

#### OAuth2 authorization code flow

[Create an endpoint for clients to access the instance](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_CreateEndpointforExternalClients.html) and use it to configure the connector:

1. Login to your ServiceNow® instance, then select Homepage.
2. Search for System OAuth, then select Application Registry.
3. Select New, then select Create an OAuth API endpoint for external clients.

   This displays a configuration page for the application registry as shown in the following image:
4. In ServiceNow, enter a name for the OAuth application registry in the Name field.
5. In Snowsight, copy the value in the Redirect URL field.
   In ServiceNow, paste this value in the Redirect URL field.

   This value was generated by the connector.
6. If required, in ServiceNow, update the values in the Refresh Token Lifespan and Access Token Lifespan fields.

   * Snowflake recommends setting the lifespan of the access token to at least 600 seconds.
   * For the lifespan of the refresh token, specify a value that is 7776000 (90 days).

     > > **Attention:**
     > >
     > > When Snowsight is opened via Private Link URL, the redirect URL is different than when Snowsight was opened
     > > via public URL. If you configured redirect URL using value provided by Private Link Snowsight,
     > > all subsequent updates to refresh token must also be done with Private Link Snowsight. If you are accessing
     > > Snowsight via publicly available URL, all subsequent updates to refresh token must also be done with
     > > Snowsight available at this URL.
7. In ServiceNow, select Submit.

   The OAuth application registry appears in the list of application registries.
8. In ServiceNow, select the application registry you just created.

   Note that ServiceNow® created values for the Client ID and Client Secret fields.
9. In ServiceNow, copy the value for Client ID. In Snowsight, paste this value into the Client ID field.
10. In ServiceNow, copy the value for Client Secret. In Snowsight, paste this value into the Client Secret field.

    > **Note:**
    >
    > The connector uses a [secret](../../user-guide/api-authentication.md) (a type of schema-level object) to store the access tokens used to authenticate
    > to the ServiceNow® instance. The connector uses this secret object with a security integration and
    > an external access integration to connect to the ServiceNow® instance.
    >
    > The secret, security integration, and external access integration are created automatically when you install the connector.
11. In Snowsight, select Connect.

    A dialog appears asking you to login to your ServiceNow® instance with User name and Password. Provide the credentials
    of the user you want the connector to authenticate with - it needs to have the privileges listed in [Prepare your ServiceNow® instance](prereqs.md).

    > **Important:**
    >
    > If you were redirected directly to this dialog without needing to provide credentials,
    > then you are already logged in to your ServiceNow® instance. Ensure you are logged in as the same user
    > the connector should use and that user has the necessary privileges.
    >
    > Note: The current logged-in user is shown in upper right corner of the dialog.
12. After logging in, to confirm that you want to allow the connector to connect to your ServiceNow® account, select Allow.

#### OAuth Client Credentials flow

[Create an endpoint for clients to access the instance](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_CreateEndpointforExternalClients.html) and use it to configure the connector.

> **Warning:**
>
> To use this authentication method, your ServiceNow® instance must be upgraded to at least [Washington DC release](https://www.servicenow.com/docs/bundle/washingtondc-release-notes/page/release-notes/family-release-notes.html).

1. Log in to your ServiceNow® instance, then select Homepage.
2. Search for sys_properties.list.
3. Search for property with `glide.oauth.inbound.client.credential.grant_type.enabled` name in the table and make sure that it is
   set to `true`.

   > **Note:**
   >
   > If the property doesn’t exist, create it. Click New button and fill the following fields of new
   > property:
   >
   > * Set Name to `glide.oauth.inbound.client.credential.grant_type.enabled`,
   > * Set Type to `true | false`,
   > * Set Value to `true`.
4. Search for System OAuth and then select Application Registry.
5. Select New and then select Create an OAuth API endpoint for external clients.
6. Enter a name for the OAuth application registry in the Name field.
7. Select the user that you want the connector to authenticate with in the OAuth Application User field.
   The user needs to have the privileges listed in [Prepare your ServiceNow® instance](prereqs.md).

   > **Note:**
   >
   > If the OAuth Application User field isn’t available in the form, open Additional actions menu in the
   > upper left corner of the screen. Select Configure > Form builder. Then, add the missing
   > OAuth Application User field to the `Default` view of the form. Save the form and refresh
   > the page to continue.
8. Select Submit.

   The OAuth application registry appears in the list of application registries.
9. Select the application registry you just created.

   Note that ServiceNow® created values for the Client ID and Client Secret fields.
10. In ServiceNow®, copy the value for Client ID. In Snowsight, paste this value into the Client ID field.
11. In ServiceNow®, copy the value for Client Secret. In Snowsight, paste this value into the Client Secret field.

    > **Note:**
    >
    > The connector uses a [secret](../../user-guide/api-authentication.md) (a type of schema-level object) to store the access tokens used to authenticate
    > to the ServiceNow® instance. The connector uses this secret object with a security integration and
    > an external access integration to connect to the ServiceNow® instance.
    >
    > The secret, security integration, and external access integration are created automatically when you install the connector.
12. In Snowsight, select Connect.

### Validate source

This section will check connection to your ServiceNow® instance and, optionally, allow to setup Journal Table.

To enable the propagation of deleted records, set Journal Table that serves as the source of information about deleted records.

Use the `sys_audit_delete` table as the source table.

If you do not want to ingest deleted records from ServiceNow® into Snowflake, leave this field empty.

> **Note:**
>
> Ensure that the ServiceNow® user for the connector has access to the specified journal table. If not all rows in the
> table are visible to the user the connector may fail to fetch entries from the journal table during access validation.
> In such a case perform this step by calling the [FINALIZE_CONNECTOR_CONFIGURATION](installing-sql.md)
> procedure from SQL and provide it either the `table_name` or `sys_id` argument, together with `journal_table`.

> **Warning:**
>
> It is **not** possible to setup journal table after configuring the application. To enable the deleted records propagation
> after configuration, you will need to reinstall the connector.

Select Validate to finish the configuration process.

During source validation the connector will attempt to check if a previously exported connector state is present in the
destination schema. If the `__CONNECTOR_STATE_EXPORT` table is present and accessible to the connector, the connector
will try to import the state. When the import finishes successfully, the export table will be deleted. If an error occurs
during import, it’s possible to run the source validation again after fixing the error. If you don’t want to import the
state or you don’t want to fix the import error, transfer ownership of the table from the connector and drop the table.

### Side effects

As a result of the configuration steps, the wizard creates the following objects residing outside of the connector’s database
that are needed by the connector to work:

* Database `CONNECTORS_SECRET` with schema `SNOWFLAKE_CONNECTOR_FOR_SERVICENOW` used to store secret object,
* Secret object in `CONNECTORS_SECRET.SNOWFLAKE_CONNECTOR_FOR_SERVICENOW` named `SECRET` with your ServiceNow® credentials,
* Network rule object in `CONNECTORS_SECRET.SNOWFLAKE_CONNECTOR_FOR_SERVICENOW` named `NETWORK_RULE` used to allow outbound traffic from your account,
* Security integration named `SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_SECURITY_INTEGRATION`, which is used to integrate between Snowflake
  and a third-party OAuth 2.0 service,
* External access integration `SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_EXTERNAL_ACCESS_INTEGRATION` is used for communication with ServiceNow.

> **Note:**
>
> The above objects are called like this if default `SNOWFLAKE_CONNECTOR_FOR_SERVICENOW` name was chosen as the application name.
> If the chosen application name is changed during installation, these objects’ names will differ accordingly.

> **Important:**
>
> Names of these objects, the warehouse and the role used during configuration **must not** be changed. The connector
> references them by name. Changing their names or dropping them breaks references and will make the connector unusable.
>
> If necessary, instead of renaming the warehouse, use the [UPDATE_WAREHOUSE](managing.md)
> stored procedure to change the warehouse used by the connector.

## Configure logging for the connector

The Snowflake Connector for ServiceNow® uses event tables to store error logs for the connector. To set
up an event table manually follow [Setting up an Event Table](../../developer-guide/logging-tracing/event-table-setting-up.md) guide.

If the event table isn’t set up, and if you choose to install the connector using the UI wizard, the connector will set up the event table automatically.

> **Note:**
>
> This app will collect logs for debugging purposes and write them to an event table in your account and to an event table in the app provider account.
> Only logs for this app will be included, and these logs are “Connector Usage Data”.

The event table will be created in the following location.

> | Object | Name |
> | --- | --- |
> | Database | EVENTS_DB |
> | Schema | PUBLIC |
> | Table | EVENTS |

## Connector application roles

As a Native Application, Snowflake Connector for ServiceNow® defines [application roles](../../developer-guide/native-apps/creating-setup-script.md).
They can be reviewed in [Role-based access control for connectors (ServiceNow)](application-roles.md).

## Next steps

After installing and configuring the connector, perform the steps described in [Set up data ingestion for your ServiceNow® data](ingestion.md).

---
title: Install and configure the connector with SQL commands
source: https://docs.snowflake.com/en/connectors/servicenow/installing-sql.md
section: Connectors & Drivers
---

# Install and configure the connector with SQL commands

This topic describes how to use SQL commands to install and configure the connector. It assumes that
you have already performed the procedures outlined in [Prepare your ServiceNow® instance](prereqs.md).

## Install the Snowflake Connector for ServiceNow®

The following procedure describes how to install the connector:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for ServiceNow®, then select the tile for the connector.
4. In the page for the Snowflake Connector for ServiceNow®, select Get.

   This displays a dialog that you use to begin the initial part of the installation process.

   In the dialog configure the following:

   1. In the Application name field, enter the name of the database to be used as the database for the connector
      instance. This database is created for you automatically.
   2. In the Warehouse used for installation field, select the warehouse that you want to use for
      installing the connector.

      > **Note:**
      >
      > This is not the same warehouse that is used by the connector to synchronize data from ServiceNow®. In a
      > later step, you will create a separate warehouse for this purpose.
   3. Select Get.
5. A dialog appears with the notification: `Installing App After installation, an email will be sent to <user_email>`. You can now close the dialog.
   To continue configuration using SQL, wait until you receive an email stating that `'Snowflake Connector for ServiceNow' installed and ready for use` then go to the Worksheets.

## Set up OAuth

> **Note:**
>
> If you plan to use basic authentication instead of OAuth, you can skip this section and continue
> to Create a secret object

You can configure the Snowflake Connector for ServiceNow® to use OAuth for authenticating to the ServiceNow® instance. There are two supported OAuth flows:

* Client credentials grant (recommended): Available since [Washington DC release](https://www.servicenow.com/docs/bundle/washingtondc-release-notes/page/release-notes/family-release-notes.html). Client credentials are
  a widely accepted authorization standard for machine to machine integration and don’t require manual refresh tokens
  maintenance.
* Authorization code grant flow: This authentication method is available on all supported ServiceNow® releases, but
  with this method OAuth tokens must be manually refreshed before their expiration date, typically every 3 months.

### Set up OAuth with client credentials grant flow

To configure the Snowflake Connector for ServiceNow® to use OAuth with client credentials grant flow for authenticating to the ServiceNow® instance, do the following:

* In ServiceNow®, you must set up the instance to support using OAuth with the [client credentials grant flow](https://www.servicenow.com/docs/bundle/washingtondc-platform-security/page/integrate/authentication/concept/client-credentials.html).
* In the Snowflake Connector for ServiceNow®:

  + The connector uses a security integration with `TYPE = API_AUTHENTICATION` to connect
    Snowflake to the ServiceNow® instance.

    The security integration specifies the ServiceNow® OAuth client ID, client secret, and the endpoint
    URL for authenticating to the ServiceNow® instance.
  + The connector uses a Snowflake secret object to manage sensitive information, including the
    authentication credentials.

    In the case of using OAuth for authentication, the connector stores the ServiceNow® OAuth scope and the name of
    the security integration in the Snowflake secret object.

If your ServiceNow® instance already uses the OAuth client credentials grant flow and you would like to use that
instance with the Snowflake Connector for ServiceNow®, note the client ID, client secret, and endpoint URL that corresponds to the OAuth token.
For more information, see [Manage OAuth tokens](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_ManageTokens.html). After noting this information, proceed to create essential objects

#### Configure ServiceNow® instance to use the OAuth with client credentials grant flow

1. Configure your instance to use OAuth with the authorization code grant flow as shown in [Set up OAuth](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_SettingUpOAuth.html).
2. [Create an endpoint for clients to access the instance](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_CreateEndpointforExternalClients.html) and use it to configure the connector:

   1. Log in to your ServiceNow® instance, then select Homepage.
   2. Search for sys_properties.list.
   3. Search for property with `glide.oauth.inbound.client.credential.grant_type.enabled` name in the table and make sure it is
      set to `true`.

      > **Note:**
      >
      > If the property doesn’t exist, create it. Click New button and fill the following fields of new
      > property:
      >
      > * Set Name to `glide.oauth.inbound.client.credential.grant_type.enabled`,
      > * Set Type to `true | false`,
      > * Set Value to `true`.
   4. Search for System OAuth, then select Application Registry.
   5. Select New, then select Create an OAuth API endpoint for external clients.
   6. In ServiceNow®, enter a name for the OAuth application registry in the Name field.
   7. In ServiceNow®, select user you want the connector to authenticate with in the OAuth Application User field.
      The user needs to have the privileges listed in [Prepare your ServiceNow® instance](prereqs.md).

      > **Note:**
      >
      > If OAuth Application User field isn’t available in the form, open Additional actions menu in the
      > upper left corner of the screen. Select from menu Configure > Form builder. Then add missing
      > OAuth Application User field to the `Default` view of the form. Save the form and refresh
      > the page to continue.
   8. In ServiceNow®, select Submit.

      The OAuth application registry appears in the list of application registries.
   9. In ServiceNow®, select the application registry you just created.

      Note that ServiceNow® generated values for the Client ID and Client Secret fields.
      You will use these values when
      creating a security integration.

### Set up OAuth with authorization code grant flow

To configure the Snowflake Connector for ServiceNow® to use OAuth with authorization code grant flow for authenticating to the ServiceNow® instance:

* In ServiceNow, you must set up the instance to support using OAuth with the [authorization code grant flow](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/concept/c_OAuthAuthorizationCodeFlow.html).
* In the Snowflake Connector for ServiceNow®:

  + The connector uses a security integration with `TYPE = API_AUTHENTICATION` to connect
    Snowflake to the ServiceNow® instance.

    The security integration specifies the ServiceNow® OAuth client ID, client secret, and the endpoint
    URL for authenticating to the ServiceNow® instance.
  + The connector uses a Snowflake secret object to manage sensitive information, including the
    authentication credentials.

    In the case of using OAuth for authentication, the connector stores the ServiceNow® OAuth refresh
    token, the refresh token expiration time, and the name of the security integration in the Snowflake
    secret object.

If your ServiceNow® instance already uses the OAuth authorization code grant flow and you would like to use that
instance with the Snowflake Connector for ServiceNow®, note the client ID, client secret, and endpoint URL that corresponds to the OAuth token.
For more information, see [Manage OAuth tokens](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_ManageTokens.html). After noting this information proceed to generate OAuth refresh token

#### Configure ServiceNow® instance to use the OAuth with authorization code grant flow

1. Configure your instance to use OAuth with the authorization code grant flow as shown in [Set up OAuth](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_SettingUpOAuth.html).
2. [Create an endpoint for clients to access the instance](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/security/task/t_CreateEndpointforExternalClients.html) and use it to configure the connector:

   1. Login to your ServiceNow® instance, then select Homepage.
   2. Search for OAuth, then select Application Registry.
   3. Select New, then select Create an OAuth API endpoint for external clients.

      This displays a configuration page for the application registry as shown in the following image:
   4. In ServiceNow, enter a name for the OAuth application registry in the Name field.
   5. If required, in ServiceNow, update the values in the Refresh Token Lifespan and
      Access Token Lifespan fields.

      * Snowflake recommends setting the lifespan of the access token to at least 600 seconds.
      * For the lifespan of the refresh token, specify a value that is 7776000 (90 days).
   6. In ServiceNow, select Submit.

      The OAuth application registry appears in the list of application registries.
   7. In ServiceNow, select the application registry you just created.

      Note that ServiceNow® generated values for the Client ID and Client Secret fields.
      You will use these values when
      creating a security integration.

#### Generate OAuth refresh token

To generate the OAuth refresh token:

1. Send an HTTP request to the `/oauth_token.do` endpoint of your ServiceNow® instance, as explained in the
   [REST OAuth Example](https://docs.servicenow.com/bundle/washingtondc-api-reference/page/integrate/inbound-rest/reference/r_RESTOAuthExample.html) in the ServiceNow® documentation.

   For example, if you are using curl to send the HTTP request:

   > ```bash
   > curl -d "grant_type=password" --data-urlencode "client_id=<client_id>" --data-urlencode "client_secret=<client_secret>" --data-urlencode "username=<username>" --data-urlencode "password=<password>" -X POST https://<servicenow_instance>.service-now.com/oauth_token.do
   > ```

   Where

   > `<servicenow_instance>`
   > :   Specifies the name of your ServiceNow® instance.
   >
   > `client_id` and `client_secret`
   > :   Specify the values you obtained when setting up the ServiceNow® endpoint.
   >
   > `username` and `password`
   > :   Specify the credentials for your ServiceNow® instance.

   > **Note:**
   >
   > The example above uses the `data-urlencode` command-line flag in curl to [URL-encode](https://en.wikipedia.org/wiki/Percent-encoding) the client secret, username, and password in the HTTP request sent to ServiceNow®.
   >
   > If you are using a different tool to send the HTTP request, make sure that you URL-encode these values in the request.

   The body of the HTTP response contains a JSON object. Get the refresh token from the `refresh_token` field in this
   object:

   > ```sqljson
   > {"access_token":"abcd1234","refresh_token":"cdef567","scope":"useraccount","token_type":"Bearer","expires_in":1799}
   > ```

## Create essential objects

### Create a security integration (optional)

> **Note:**
>
> If you plan to use basic authentication instead of OAuth, you can skip this section and continue
> to Create a secret object

A security integration is a Snowflake object that provides an interface between Snowflake and a third-party
OAuth 2.0 service.

#### Create a security integration for OAuth with client credentials grant flow

Use the [CREATE SECURITY INTEGRATION](../../sql-reference/sql/create-security-integration-api-auth.md) command to create a security integration as shown in the
following example:

```sqlsyntax
CREATE SECURITY INTEGRATION <name>
 TYPE = API_AUTHENTICATION
 AUTH_TYPE = OAUTH2
 OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
 OAUTH_CLIENT_ID = '<client_id>'
 OAUTH_CLIENT_SECRET = '<client_secret>'
 OAUTH_TOKEN_ENDPOINT = 'https://<servicenow_instance>.service-now.com/oauth_token.do'
 OAUTH_GRANT = 'CLIENT_CREDENTIALS'
 OAUTH_ALLOWED_SCOPES = ('useraccount')
 ENABLED = TRUE;
```

Where:

> `name`
> :   Specifies the name of security integration. The name must be unique among integrations in your account.
>
> `client_id`
> :   Specifies the value of the Client ID field you obtained when setting up the ServiceNow® endpoint.
>
> `client_secret`
> :   Specifies the value of the Client Secret field you obtained when setting up the ServiceNow® endpoint.
>
> `servicenow_instance_name`
> :   Specifies the name of your ServiceNow® instance. This is the first part of the hostname of your ServiceNow®
>     instance. For example, if the URL to your ServiceNow® instance is:
>
>     ```none
>     https://myinstance.service-now.com
>     ```
>
>     The name of your instance would be `myinstance`.

#### Create a security integration for OAuth with authorization code grant flow

Use the [CREATE SECURITY INTEGRATION](../../sql-reference/sql/create-security-integration-api-auth.md) command to create a security integration as shown in the
following example:

```sqlsyntax
CREATE SECURITY INTEGRATION <name>
 TYPE = API_AUTHENTICATION
 AUTH_TYPE = OAUTH2
 OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
 OAUTH_CLIENT_ID = '<client_id>'
 OAUTH_CLIENT_SECRET = '<client_secret>'
 OAUTH_TOKEN_ENDPOINT = 'https://<servicenow_instance>.service-now.com/oauth_token.do'
 ENABLED = TRUE;
```

Where:

> `name`
> :   Specifies the name of security integration. The name must be unique among integrations in your account.
>
> `client_id`
> :   Specifies the value of the Client ID field you obtained when setting up the ServiceNow® endpoint.
>
> `client_secret`
> :   Specifies the value of the Client Secret field you obtained when setting up the ServiceNow® endpoint.
>
> `servicenow_instance_name`
> :   Specifies the name of your ServiceNow® instance. This is the first part of the hostname of your ServiceNow®
>     instance. For example, if the URL to your ServiceNow® instance is:
>
>     ```none
>     https://myinstance.service-now.com
>     ```
>
>     The name of your instance would be `myinstance`.

### Create a secret object

Create the Snowflake secret object that the Snowflake Connector for ServiceNow® uses for authentication.

Snowflake recommends storing the secret object in a dedicated database and schema.
Note that you can choose any role to manage the secret, and you can choose any database and schema to store the secret.

To create a custom role to manage the secret, use the [CREATE ROLE](../../sql-reference/sql/create-role.md) command. For information on the privileges
that you can grant to a role, see [Access control privileges](../../user-guide/security-access-control-privileges.md).

The next sections explain how to create a secret object that is stored in a separate database and schema and that
is managed by a custom role.

#### Create a schema for the secret objects

First, create a database and schema to store the secret object by running the [CREATE DATABASE](../../sql-reference/sql/create-database.md) and
[CREATE SCHEMA](../../sql-reference/sql/create-schema.md) commands. The names of the schema and database must be valid [object identifiers](../../sql-reference/identifiers-syntax.md).

For example, to create the database `secretsdb` and the schema `apiauth` for the secret object, run the following commands:

> ```sqlexample
> USE ROLE accountadmin;
> CREATE DATABASE secretsdb;
> CREATE SCHEMA apiauth;
> ```

#### Create a custom role to manage the secret (optional)

Next, create a custom role to manage the secret (assuming that you do not want to use an existing role) and grant
the role the privileges needed to create the secret.

1. Using the USERADMIN system role, run the [CREATE ROLE](../../sql-reference/sql/create-role.md) command to create a custom role to manage the secret.
   For example, to create the custom role `secretadmin` for managing the secret, run the following commands:

   > ```sqlexample
   > USE ROLE useradmin;
   > CREATE ROLE secretadmin;
   > ```
2. Using the SECURITYADMIN system role, run the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command to grant the following
   privileges to the custom role:

   * USAGE on the database that you created for the secret
   * USAGE and CREATE SECRET on
     the schema that you created for the secret

   For example:

   > ```sqlexample
   > USE ROLE securityadmin;
   > GRANT USAGE ON DATABASE secretsdb TO ROLE secretadmin;
   > GRANT USAGE ON SCHEMA secretsdb.apiauth TO role secretadmin;
   > GRANT CREATE SECRET ON SCHEMA secretsdb.apiauth TO role secretadmin;
   > ```
3. (optional) If you are setting up the connector with OAuth authentication, then also grant USAGE privilege on
   the security integration that you created earlier to the custom role.

   For example:

   > ```sqlexample
   > USE ROLE securityadmin;
   > GRANT USAGE ON INTEGRATION servicenow_oauth TO role secretadmin;
   > ```
4. Using the USERADMIN system role, run the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command to grant the custom role to the
   user who creates the secret. For example, to grant the role to the user `servicenow_secret_owner`, run
   the following commands:

   ```sqlexample
   USE ROLE useradmin;
   GRANT ROLE secretadmin TO user servicenow_secret_owner;
   ```

#### Create a secret

Next, create a secret to enable Snowflake to authenticate to the ServiceNow® instance using OAuth with the authorization code grant flow.

> **Note:**
>
> If you plan to use basic authentication instead of OAuth, see
> the note below instead.

To create a secret object for OAuth client credentials grant flow, run the [CREATE SECRET](../../sql-reference/sql/create-secret.md) command with the following parameters:

> * Set `TYPE` to `OAUTH2`.
> * Set `API_AUTHENTICATION` to the name of the security integration that you created in Create essential objects:
> * Set `OAUTH_SCOPES` to `useraccount`.
>
> For example, to create a secret named `service_now_creds_oauth_code` that uses the security integration named
> `servicenow_oauth`, run these commands:
>
> ```sqlexample
> USE ROLE secretadmin;
> USE SCHEMA secretsdb.apiauth;
> CREATE SECRET servicenow_creds_oauth_code
>   TYPE = OAUTH2
>   API_AUTHENTICATION = servicenow_oauth
>   OAUTH_SCOPES=('useraccount');
> ```

To create a secret object for OAuth authorization code grant flow, run the [CREATE SECRET](../../sql-reference/sql/create-secret.md) command with the following parameters:

> * Set `TYPE` to `OAUTH2`.
> * Set `OAUTH_REFRESH_TOKEN` to the OAuth refresh token that you retrieved in Generate OAuth refresh token.
> * Set `OAUTH_REFRESH_TOKEN_EXPIRY_TIME` to the refresh token expiration timestamp in UTC timezone. You can calculate
>   this by adding the token lifespan from ServiceNow® to the date when the token was issued. By default, the
>   token expires in 100 days.
> * Set `API_AUTHENTICATION` to the name of the security integration that you created in Create essential objects:
>
> For example, to create a secret named `service_now_creds_oauth_code` that uses the security integration named
> `servicenow_oauth`, run these commands:
>
> ```sqlexample
> USE ROLE secretadmin;
> USE SCHEMA secretsdb.apiauth;
> CREATE SECRET servicenow_creds_oauth_code
>   TYPE = OAUTH2
>   OAUTH_REFRESH_TOKEN = 'cdef567'
>   OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '2022-01-06 20:00:00'
>   API_AUTHENTICATION = servicenow_oauth;
> ```

To modify the properties of an existing secret (e.g. to update the OAuth refresh token), use the [ALTER SECRET](../../sql-reference/sql/alter-secret.md)
command.

> **Note:**
> > If you plan to use basic authentication (rather than OAuth), run the [CREATE SECRET](../../sql-reference/sql/create-secret.md) command to create a
> > secret with `TYPE` set to `PASSWORD`. Set `USERNAME` and `PASSWORD` to the username and
> > password of the ServiceNow® user that you plan to use to authenticate to the ServiceNow® instance. For example:
> >
> > ```sqlexample
> > USE ROLE secretadmin;
> > USE SCHEMA secretsdb.apiauth;
> > CREATE SECRET servicenow_creds_pw
> >   TYPE = PASSWORD
> >   USERNAME = 'jsmith1'
> >   PASSWORD = 'W3dr@fg*7B1c4j';
> > ```
>
> If multi-factor authentication is enabled for this user, you must provide the MFA token together with password
> as described in [REST API](https://docs.servicenow.com/bundle/washingtondc-api-reference/page/integrate/inbound-rest/concept/c_RESTAPI.html) in the ServiceNow® documentation.

### Create a warehouse

Snowflake recommends [creating a warehouse](../../user-guide/warehouses-tasks.md) dedicated
for the connector. A dedicated warehouse allows for better cost management and resource tracking. To facilitate
resource tracking, you can optionally [add one or more tags](../../user-guide/object-tagging/introduction.md) to the dedicated warehouse.

For the connector warehouse, Snowflake recommends using a large-sized warehouse.

To create a large-sized warehouse named `servicenow_conn_warehouse`, run the following command:

```sqlexample
USE ROLE accountadmin;
CREATE WAREHOUSE servicenow_conn_warehouse WAREHOUSE_SIZE = LARGE;
```

> **Attention:**
>
> Make sure the warehouse is able to execute a query for at least 8 hours. It’s affected by a parameter value that can be set both
> on the warehouse used by the connector and on the account (account’s value takes precedence). To check the current values run:
>
> ```sqlexample
> SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' FOR ACCOUNT;
> SHOW PARAMETERS LIKE 'STATEMENT_TIMEOUT_IN_SECONDS' FOR WAREHOUSE <connector_warehouse>;
> ```
>
> If both values are at least `28800` (i.e. 8 hours), then no change is needed. Otherwise, run one of the following as necessary:
>
> ```sqlexample
> ALTER ACCOUNT SET STATEMENT_TIMEOUT_IN_SECONDS = 28800;
> ALTER WAREHOUSE <connector_warehouse> SET STATEMENT_TIMEOUT_IN_SECONDS = 28800;
> ```
>
> If the proper timeout is not provided, then data ingestion failures will occur.

### Create a database and schema for the ServiceNow® data

Next, create a database and schema for the ServiceNow® data. The Snowflake Connector for ServiceNow® ingests ServiceNow® data
into this database and schema.

When creating the database and schema, note the following:

* The names of the schema and database must be valid [object identifiers](../../sql-reference/identifiers-syntax.md).
* To control access to the ingested ServiceNow® data in Snowflake, you can
  [grant the privileges on the schema to the roles that should be allowed to access the data](accessing-data.md).

To create the database and schema, run the [CREATE DATABASE](../../sql-reference/sql/create-database.md) and [CREATE SCHEMA](../../sql-reference/sql/create-schema.md) commands.

For example, to create the database `dest_db` and the schema `dest_schema` for the ServiceNow® data, run the following
commands:

```sqlexample
USE ROLE accountadmin;
CREATE DATABASE dest_db;
CREATE SCHEMA dest_schema;
```

> **Note:**
>
> If you are reinstalling the connector, you can reuse the schema that you created for the previous installation of the
> connector. This is possible if the previous installation of the connector has already loaded data and you want to
> continue loading data into the same tables.
>
> To continue loading data, do not modify the schema before [reinstalling the connector](managing.md).
> Do not change the definitions of the tables created by the previous installation of the connector.
>
> The connector periodically exports connector configuration and state to a `__CONNECTOR_STATE_EXPORT` table in the schema,
> which can later be used to recover connector configuration during reinstallation. Alternatively, if the export table isn’t present or was
> dropped manually, you can still later call the [the ENABLE_TABLES stored procedure](ingestion.md) to reenable the previously ingested tables.
> The stored procedure verifies that all required objects already exist and does not attempt to recreate them, thus
> there is no risk of losing already ingested data.

### Create a network rule for communicating with the ServiceNow® instance

Next, to allow outbound traffic from your account to your ServiceNow® instance, please create a network rule.
As an accountadmin, run the [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md) command with the following syntax:

```sqlsyntax
CREATE NETWORK RULE <name>
  MODE = 'EGRESS'
  TYPE = 'HOST_PORT'
  VALUE_LIST = ('<servicenow_instance>.service-now.com');
```

Where:

`name`
:   Specifies the name of the Network Rule. The name must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).

`VALUE_LIST = ('servicenow_instance_name.service-now.com')`
:   Specifies list of allowed ServiceNow® instances to which a request can be sent.

For example, to create the network rule named `servicenow_network_rule` inside `apiauth` schema of database `secretsdb` run the following command:

```sqlsyntax
USE ROLE accountadmin;
CREATE NETWORK RULE secretsdb.apiauth.servicenow_network_rule
  MODE = 'EGRESS'
  TYPE = 'HOST_PORT'
  VALUE_LIST = ('myinstance.service-now.com');
```

> **Note:**
>
> If you created the secret with a custom role, you need to additionally grant USAGE privilege on it to `ACCOUNTADMIN`
> before creating the network rule:
>
> ```sqlsyntax
> USE ROLE secretadmin;
> GRANT USAGE ON SECRET secretsdb.apiauth.<secret_name> TO ROLE ACCOUNTADMIN;
> ```

### Create an external access integration for communicating with the ServiceNow® instance

Next, create an external access integration for communicating with the ServiceNow® instance. Run the [CREATE EXTERNAL ACCESS INTEGRATION](../../sql-reference/sql/create-external-access-integration.md)
command with the following syntax:

```sqlsyntax
CREATE EXTERNAL ACCESS INTEGRATION <integration_name>
  ALLOWED_NETWORK_RULES = (<network_rule_name>)
  ALLOWED_AUTHENTICATION_SECRETS = (<secret_name>)
  ENABLED = TRUE;
```

Where:

`integration_name`
:   Specifies the name of the external access integration. The name must be a valid [object identifier](../../sql-reference/identifiers-syntax.md). The name must be unique among integrations in your account.

`ALLOWED_NETWORK_RULES = (network_rule_name)`
:   Specifies the network rule allowing access to your ServiceNow® instance. This limits the use of this integration to the instances with the URLs specified in the network rule.

    Set this to the name of the network rule that you created in Create a network rule for communicating with the ServiceNow® instance.

`ALLOWED_AUTHENTICATION_SECRETS = (secret_name)`
:   Specifies the list of the names of the secrets that are allowed for use in the scope of the API integration.

    Set this to the name of the secret object that you created in Create a secret object.

`ENABLED = TRUE`
:   Specifies whether this API integration is enabled or disabled. If the API integration is disabled, any
    external function that relies on it does not work.

    > `TRUE`
    > :   Allows the integration to run based on the parameters specified in the integration definition.
    >
    > `FALSE`
    > :   Suspends the integration for maintenance. Any integration between Snowflake and a third-party service fails to work.

For example, to create the external access integration named `servicenow_external_access_integration` run the following command:

> ```sqlexample
> USE ROLE accountadmin;
> CREATE EXTERNAL ACCESS INTEGRATION servicenow_external_access_integration
>   ALLOWED_NETWORK_RULES = (secretsdb.apiauth.servicenow_network_rule)
>   ALLOWED_AUTHENTICATION_SECRETS = (secretsdb.apiauth.servicenow_creds_pw)
>   ENABLED = TRUE
> ```

## Configure logging for the connector

The Snowflake Connector for ServiceNow® uses event tables to store error logs for the connector. To set up an event table follow [Setting up an Event Table](../../developer-guide/logging-tracing/event-table-setting-up.md) guide.

> **Important:**
>
> Snowflake recommends that you
> [set up event tracing](http://docs.snowflake.com/developer-guide/native-apps/ui-consumer-enable-logging) to help troubleshoot problems.

## Set up the installed connector

To set up the connector:

1. Create a database for the connector instance using Snowsight. For more information on how to create the database, see [Installing and Configuring the Connector with Snowsight](installing-snowsight.md).
2. Navigate to the SQL worksheet.
3. Log in as a user with the ACCOUNTADMIN role. For example:

   > ```sqlexample
   > USE ROLE ACCOUNTADMIN;
   > ```
4. Grant all the required privileges to the connector
   the database that serves as an instance of the connector.

   * EXECUTE TASK on the account
   * EXECUTE MANAGED TASK on the account
   * USAGE on
     the warehouse that you created for the connector.
   * USAGE on
     the database that you created for the ServiceNow® data
   * USAGE, CREATE TABLE, and CREATE VIEW on
     the schema that you created for the ServiceNow® data
   * USAGE on
     the external access integration that you created for ServiceNow®
   * USAGE on the database that you created for the secret
   * USAGE on
     the schema that you created for the secret
   * READ on the secret that you created

   For example, to grant the following privileges to the connector named `my_connector_servicenow`:

   * EXECUTE TASK on the account
   * EXECUTE MANAGED TASK on the account
   * USAGE on the warehouse `servicenow_conn_warehouse`
   * USAGE on the `dest_db` database
   * USAGE, CREATE TABLE, and CREATE VIEW on the `dest_db.dest_schema` schema
   * USAGE on the `servicenow_external_access_integration` integration
   * USAGE on the `secretsdb` database
   * USAGE on the `secretsdb.apiauth` schema
   * READ on the `secretsdb.apiauth.servicenow_creds_oauth_code secret` secret

   Run the following commands:

   > ```sqlexample
   > USE ROLE accountadmin;
   >
   > GRANT EXECUTE TASK ON ACCOUNT TO APPLICATION my_connector_servicenow;
   > GRANT EXECUTE MANAGED TASK ON ACCOUNT TO APPLICATION my_connector_servicenow;
   >
   > GRANT USAGE ON WAREHOUSE servicenow_conn_warehouse TO APPLICATION my_connector_servicenow;
   >
   > GRANT USAGE ON DATABASE dest_db TO APPLICATION my_connector_servicenow;
   > GRANT USAGE ON SCHEMA dest_db.dest_schema TO APPLICATION my_connector_servicenow;
   > GRANT CREATE TABLE ON SCHEMA dest_db.dest_schema TO APPLICATION my_connector_servicenow;
   > GRANT CREATE VIEW ON SCHEMA dest_db.dest_schema TO APPLICATION my_connector_servicenow;
   >
   > GRANT USAGE ON INTEGRATION servicenow_external_access_integration TO APPLICATION my_connector_servicenow;
   >
   > GRANT USAGE ON DATABASE secretsdb TO APPLICATION my_connector_servicenow;
   > GRANT USAGE ON SCHEMA secretsdb.apiauth TO APPLICATION my_connector_servicenow;
   > GRANT READ ON SECRET secretsdb.apiauth.servicenow_creds_oauth_code TO APPLICATION my_connector_servicenow;
   > ```
5. Transfer ownership of tables and views in destination schema (optional)

   If the connector was reinstalled and previous destination schema is reused, ownership of all tables and views in
   destination schema must be transferred to the connector. The connector requires ownership privilege to manage
   grants on objects in schema and to recreate flattened views when schema of ingested table is changed.

   To transfer the ownership call `SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION` function.

   ```sqlexample
   USE ROLE accountadmin;
   CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION(<connector_app>, true, <destination_database>, <destination_schema>);
   ```

   The `SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION` is a system function provided by Snowflake that allows the transfer of
   ownership of tables and views in a specified database or schema to the application. Only the ownership of regular tables and
   regular views is transferred, e.g. ownership of dynamic tables, external tables, materialized views, etc. won’t be
   transferred.

   The function has the following signature:

   ```sqlexample
   SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION(<to_app>, <should_copy_grants>, <from_database>, <from_schema>)
   ```

   Where:

   > `to_app`
   > :   Specifies the name of the application to which the ownership of objects should be transferred.
   >
   > `should_copy_grants`
   > :   If `TRUE` then copy existing grants, otherwise revoke. Copying grants requires `MANAGE GRANTS`
   >     permission on the caller.
   >
   > `from_database`
   > :   Name of the database containing objects whose ownership should be changed.
   >
   > `from_schema`
   > :   (Optional) name of the schema containing objects whose ownership should be changed. If no schema is specified,
   >     ownership is transferred on tables and views in all schemas in the provided database. Objects in managed schemas
   >     are omitted during ownership transfer.

   To execute the function the caller must meet one of the following conditions:

   * It has `MANAGE GRANTS` permission (e.g. ACCOUNTADMIN or SECURITYADMIN role), or
   * It contains role owning the application instance and role owning all objects to transfer the ownership. Objects on
     which the ownership is missing are omitted by the function.

   For example, to transfer ownership the connector that:

   * Was installed as application named `my_connector_servicenow`
   * Uses the schema named `dest_db.dest_schema` for the ServiceNow® data in Snowflake

   Run the following command:

   ```sqlexample
   USE ROLE accountadmin;
   CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION('my_connector_servicenow', true, 'dest_db', 'dest_schema');
   ```

   If needed, grant `DATA_READER` application role to the role previously owning the data to prevent
   disruptions of existing pipelines using the data:

   ```sqlexample
   GRANT APPLICATION ROLE <connector_app>.DATA_READER TO ROLE <previous_data_owner_role>;
   ```

   Note that `DATA_READER` application role won’t have any grants on tables and views in destination schema until
   `CONFIGURE_CONNECTOR` procedure is run.
6. Run the [USE DATABASE](../../sql-reference/sql/use-database.md) command to use the database for the connector. For example:

   ```sqlexample
   USE DATABASE my_connector_servicenow;
   ```
7. Configure the connector by using the [CALL](../../sql-reference/sql/call.md) command to call the stored procedure named `CONFIGURE_CONNECTOR`:

   ```sqlsyntax
   CALL CONFIGURE_CONNECTOR({
     'warehouse': '<warehouse_name>',
     'destination_database': '<dest_db>',
     'destination_schema': '<dest_schema>'
   })
   ```

   Where:

   > `warehouse_name`
   > :   Specifies the name of the warehouse for the connector.
   >
   >     The name of the warehouse must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).
   >
   > `dest_db`
   > :   Specifies the name of the
   >     database for the ServiceNow® data in Snowflake (the database that you created earlier).
   >
   >     The name of the database must be valid [object identifiers](../../sql-reference/identifiers-syntax.md).
   >
   > `dest_schema`
   > :   Specifies the name of the
   >     schema for the ServiceNow® data in Snowflake (the schema that you created earlier).
   >
   >     The name of the schema must be valid [object identifiers](../../sql-reference/identifiers-syntax.md).

   For example, to configure the connector that:

   * Uses warehouse `servicenow_conn_warehouse`.
   * Uses the schema named `dest_db.dest_schema` for the ServiceNow® data in Snowflake

   Run the following command:

   > ```sqlexample
   > CALL CONFIGURE_CONNECTOR({
   >   'warehouse': 'servicenow_conn_warehouse',
   >   'destination_database': 'dest_db',
   >   'destination_schema': 'dest_schema'
   > });
   > ```

   If the connector was started successfully, this stored procedure returns the following response:

   > ```json
   > {
   >   "responseCode": "OK",
   >   "message": "Connector successfully configured.",
   > }
   > ```
   >
   > > **Note:**
   > >
   > > Once the connector is started, it’s not possible to rename passed warehouse, destination database and destination
   > > schema for the connector. The connector references them by name. As a result, an attempt
   > > to drop or alter the name of these objects breaks the connector and stops it from working.
   > >
   > > Instead of renaming the warehouse, use [UPDATE_WAREHOUSE](managing.md)
   > > stored procedure to change the warehouse used by the connector.
8. Configure the connection to ServiceNow® instance by using the [CALL](../../sql-reference/sql/call.md) command to call the stored procedure
   named `SET_CONNECTION_CONFIGURATION`:

   ```sqlsyntax
   CALL SET_CONNECTION_CONFIGURATION({
     'service_now_url': '<servicenow_base_url>',
     'secret': '<secret_name>',
     'external_access_integration': '<external_access_integration_name>'
   })
   ```

   Where:

   > `servicenow_base_url`
   > :   Specifies the URL of the ServiceNow® instance that the connector should use. The URL should be in the following format:
   >
   >     > ```none
   >     > https://<servicenow_instance>.service-now.com
   >     > ```
   >
   > `secret_name`
   > :   Specifies the fully qualified name of the
   >     secret object containing the credentials for authenticating to ServiceNow® (the secret that you created earlier).
   >
   >     You must specify the fully qualified name of the secret object in the following format:
   >
   >     > ```none
   >     > <database_name>.<schema_name>.<secret_name>
   >     > ```
   >
   >     The names of the database, schema, and secret must be valid [object identifiers](../../sql-reference/identifiers-syntax.md).
   >
   > `external_access_integration_name`
   > :   Specifies the name of the
   >     external access integration for ServiceNow® (the external access integration that you created earlier).
   >
   >     The name of the integration must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).

   For example, to configure the connection to a ServiceNow® instance that:

   * Has the URL `https://myinstance.service-now.com`.
   * Uses the secret stored in `secretsdb.apiauth.servicenow_creds_oauth_code`.
   * Uses the external access integration named `servicenow_external_access_integration`.

   Run the following command:

   > ```sqlexample
   > CALL SET_CONNECTION_CONFIGURATION({
   >   'service_now_url': 'https://myinstance.service-now.com',
   >   'secret': 'SECRETSDB.APIAUTH.SERVICENOW_CREDS_OAUTH_CODE',
   >   'external_access_integration': 'SERVICENOW_API_INTEGRATION'
   > });
   > ```

   If the connection was configured successfully, this stored procedure returns the following response:

   > ```json
   > {
   >   "responseCode": "OK",
   >   "message": "Test request to ServiceNow® succeeded.",
   > }
   > ```

   > **Note:**
   >
   > Once the connection is configured, it’s not possible to change name of the passed secret and external access integration. The
   > connector references them by name. As a result, an attempt to drop or alter the name of these objects breaks
   > the connector and stops it from working.
9. Finalize the configuration of the connector using the [CALL](../../sql-reference/sql/call.md) command to call the stored procedure
   named `FINALIZE_CONNECTOR_CONFIGURATION`:

   ```sqlsyntax
   CALL FINALIZE_CONNECTOR_CONFIGURATION({
     'journal_table': '<name_of_journal_table>',
     'table_name': '<name_of_audited_table>',
     'sys_id': '<sys_id_of_audited_entry>'
   })
   ```

   Where:

   > `name_of_journal_table`
   > :   Specifies the name of the table that contains information about deleted records. Refer to [Prepare your ServiceNow® instance](prereqs.md) for more information.
   >
   >     Note that information on deleted records is available only for tables that you set up to propagate deleted records.
   >
   >     To prevent the propagation of deleted records, specify the `null` for this argument.
   >
   > `name_of_audited_table`
   > :   (optional) Specifies the name of the audited table that should be present in the journal table and to which the connector
   >     should have access. During validation of access to the journal table, the connector looks for audit entries related to
   >     this table. Provide this option when a query to ServiceNow® succeeds, but gives no result, causing the procedure to
   >     fail. Ensure that the ServiceNow® user for the connector has access to all entries for the specified table.
   >
   >     This option can’t be used together with `sys_id` parameter.
   >
   > `sys_id_of_audited_entry`
   > :   (optional) Specifies the `sys_id` of entry from some audited table that should be present in the journal table and
   >     to which the connector should have access. During validation of access to the journal table, the connector looks for
   >     audit entries related to this `sys_id`. Provide this option when a query to ServiceNow® succeeds, but gives no
   >     result, causing the procedure to fail. Ensure that the ServiceNow® user for the connector has access to specified
   >     entry.
   >
   >     This option can’t be used together with `table_name` parameter.

   If the connector was started successfully, this stored procedure returns the following response:

   ```json
   {
       "responseCode": "OK",
   }
   ```

   During finalization of the connector configuration, the connector will attempt to check if a previously exported connector
   state is present in the destination schema. If the `__CONNECTOR_STATE_EXPORT` table is present and accessible to
   the connector, the connector will try to import the state. When import finishes successfully, the export table will be
   deleted. If an error occurs during import, it’s possible to run the `FINALIZE_CONNECTOR_CONFIGURATION` procedure again
   after fixing the error. If you don’t want to import the state or you don’t want to fix the import error, transfer ownership
   of the table from the connector and drop the table.

The newly created database is an instance of the connector and contains the following:

* Stored procedures that you use to configure the connector. See
  [Set up data ingestion using SQL statements](ingestion.md) for more information.
* Views containing the logged messages and statistics for the connector. See
  [About Monitoring the Connector](monitoring.md) for more information.

## Connector application roles

As a Native Application, Snowflake Connector for ServiceNow® defines [application roles](../../developer-guide/native-apps/creating-setup-script.md).
They can be reviewed in [Role-based access control for connectors (ServiceNow)](application-roles.md).

## Sample installation scripts

The following example scripts demonstrate how to configure the Snowflake Connector for ServiceNow® using SQL worksheets.
This can help you quickly set up the connector in your environment and start using it.
Simply copy and paste the commands into the worksheet and fulfill the placeholders with your values.

> **Important:**
>
> It’s assumed that the application is already installed in the account as described here.

Before executing the commands, review the script and adjust to your needs:

Basic AuthOAuth Authorization CodeOAuth Client Credentials

```sqlexample
-- Specify values as required by your installation
SET application_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW'; -- use the same name as provided in the installation
SET connector_warehouse = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_WAREHOUSE';
SET servicenow_instance_domain = '<servicenow_instance>.service-now.com';

SET destination_database = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_DEST_DB';
SET destination_schema = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_DEST_SCHEMA';

SET secret_database = 'CONNECTORS_SECRET';
SET secret_schema = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW';
SET secret_database_schema = $secret_database || '.' || $secret_schema;
SET secret_fqn = $secret_database_schema || '.' || 'SECRET';

SET network_rule_fqn = $secret_database_schema || '.' || 'NETWORK_RULE';
SET external_access_integration_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_EXTERNAL_ACCESS_INTEGRATION';

SET destination_database_schema = $destination_database || '.' || $destination_schema;
SET servicenow_instance_url = 'https://' || $servicenow_instance_domain || '/';

-- Create essential objects
USE ROLE accountadmin;

CREATE WAREHOUSE IF NOT EXISTS IDENTIFIER($connector_warehouse)
   WAREHOUSE_SIZE = 'MEDIUM'
   AUTO_RESUME = TRUE;
CREATE DATABASE IF NOT EXISTS IDENTIFIER($secret_database);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($secret_database_schema);
CREATE DATABASE IF NOT EXISTS IDENTIFIER($destination_database);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($destination_database_schema);

-- Populate with your credentials
CREATE SECRET IDENTIFIER($secret_fqn)
   TYPE = PASSWORD
   USERNAME = '<servicenow_login>'
   PASSWORD = '<servicenow_password>';

-- None of the following commands should require any changes
CREATE NETWORK RULE IDENTIFIER($network_rule_fqn)
   MODE = 'EGRESS'
   TYPE = 'HOST_PORT'
   VALUE_LIST = ($servicenow_instance_domain);

CREATE PROCEDURE execute_immediate_create_ea_integration()
RETURNS VARIANT
EXECUTE AS caller
AS
BEGIN
   EXECUTE IMMEDIATE '
      CREATE EXTERNAL ACCESS INTEGRATION IDENTIFIER($external_access_integration_name)
      ALLOWED_NETWORK_RULES = ($network_rule_fqn)
      ALLOWED_AUTHENTICATION_SECRETS = ('  ||  $secret_fqn  ||  ') ENABLED = TRUE
   ';
END;
CALL execute_immediate_create_ea_integration();
DROP PROCEDURE IF EXISTS execute_immediate_create_ea_integration();

GRANT EXECUTE TASK ON ACCOUNT TO APPLICATION IDENTIFIER($application_name);
GRANT EXECUTE MANAGED TASK ON ACCOUNT TO APPLICATION IDENTIFIER($application_name);

GRANT USAGE ON WAREHOUSE IDENTIFIER($connector_warehouse) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON DATABASE IDENTIFIER($destination_database) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);

GRANT CREATE TABLE ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);
GRANT CREATE VIEW ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);

GRANT USAGE ON INTEGRATION IDENTIFIER($external_access_integration_name) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON DATABASE IDENTIFIER($secret_database) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON SCHEMA IDENTIFIER($secret_database_schema) TO APPLICATION IDENTIFIER($application_name);
GRANT READ ON SECRET IDENTIFIER($secret_fqn) TO APPLICATION IDENTIFIER($application_name);

CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION($application_name, true, $destination_database, $destination_schema);

USE APPLICATION IDENTIFIER($application_name);

-- Recommended to call one by one as the response might contain an error
CALL CONFIGURE_CONNECTOR({
   'warehouse': $connector_warehouse,
   'destination_database': $destination_database,
   'destination_schema': $destination_schema
});

CALL SET_CONNECTION_CONFIGURATION({
   'service_now_url': $servicenow_instance_url,
   'secret': $secret_fqn,
   'external_access_integration': $external_access_integration_name
});

-- Remove the 'journal_table' parameter if you don't want to track deleted records
CALL FINALIZE_CONNECTOR_CONFIGURATION({
   'journal_table': 'sys_audit_delete'
});
```

```sqlexample
-- Specify values as required by your installation
SET application_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW'; -- use the same name as provided in the installation
SET connector_warehouse = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_WAREHOUSE';
SET servicenow_instance_domain = '<servicenow_instance>.service-now.com';

SET destination_database = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_DEST_DB';
SET destination_schema = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_DEST_SCHEMA';

SET secret_database = 'CONNECTORS_SECRET';
SET secret_schema = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW';
SET secret_database_schema = $secret_database || '.' || $secret_schema;
SET secret_fqn = $secret_database_schema || '.' || 'SECRET';

SET network_rule_fqn = $secret_database_schema || '.' || 'NETWORK_RULE';
SET external_access_integration_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_EXTERNAL_ACCESS_INTEGRATION';
SET security_integration_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_SECURITY_INTEGRATION';

SET destination_database_schema = $destination_database || '.' || $destination_schema;
SET servicenow_instance_url = 'https://' || $servicenow_instance_domain || '/';
SET oauth_token_endpoint = $servicenow_instance_url || 'oauth_token.do';

-- Create essential objects
USE ROlE accountadmin;

CREATE WAREHOUSE IF NOT EXISTS IDENTIFIER($connector_warehouse)
   WAREHOUSE_SIZE = 'MEDIUM'
   AUTO_RESUME = TRUE;
CREATE DATABASE IF NOT EXISTS IDENTIFIER($secret_database);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($secret_database_schema);
CREATE DATABASE IF NOT EXISTS IDENTIFIER($destination_database);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($destination_database_schema);

-- Populate with your credentials
CREATE SECURITY INTEGRATION IDENTIFIER($security_integration_name)
   TYPE = API_AUTHENTICATION
   AUTH_TYPE = OAUTH2
   OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
   OAUTH_CLIENT_ID = '<client_id>'
   OAUTH_CLIENT_SECRET = '<client_secret>'
   OAUTH_TOKEN_ENDPOINT = $oauth_token_endpoint
   ENABLED = TRUE;

CREATE SECRET IDENTIFIER($secret_fqn)
   TYPE = OAUTH2
   OAUTH_REFRESH_TOKEN = '<refresh_token>'
   OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '<expiry time>'
   API_AUTHENTICATION = $security_integration_name;

-- None of the following commands should require any changes
CREATE NETWORK RULE IDENTIFIER($network_rule_fqn)
   MODE = 'EGRESS'
   TYPE = 'HOST_PORT'
   VALUE_LIST = ($servicenow_instance_domain);

CREATE PROCEDURE execute_immediate_create_ea_integration()
RETURNS VARIANT
EXECUTE AS caller
AS
BEGIN
   EXECUTE IMMEDIATE '
      CREATE EXTERNAL ACCESS INTEGRATION IDENTIFIER($external_access_integration_name)
      ALLOWED_NETWORK_RULES = ($network_rule_fqn)
      ALLOWED_AUTHENTICATION_SECRETS = ('  ||  $secret_fqn  ||  ') ENABLED = TRUE
   ';
END;
CALL execute_immediate_create_ea_integration();
DROP PROCEDURE IF EXISTS execute_immediate_create_ea_integration();

GRANT EXECUTE TASK ON ACCOUNT TO APPLICATION IDENTIFIER($application_name);
GRANT EXECUTE MANAGED TASK ON ACCOUNT TO APPLICATION IDENTIFIER($application_name);

GRANT USAGE ON WAREHOUSE IDENTIFIER($connector_warehouse) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON DATABASE IDENTIFIER($destination_database) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);

GRANT CREATE TABLE ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);
GRANT CREATE VIEW ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);

GRANT USAGE ON INTEGRATION IDENTIFIER($external_access_integration_name) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON DATABASE IDENTIFIER($secret_database) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON SCHEMA IDENTIFIER($secret_database_schema) TO APPLICATION IDENTIFIER($application_name);
GRANT READ ON SECRET IDENTIFIER($secret_fqn) TO APPLICATION IDENTIFIER($application_name);

CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION($application_name, true, $destination_database, $destination_schema);

USE APPLICATION IDENTIFIER($application_name);

-- Recommended to call one by one as the response might contain an error
CALL CONFIGURE_CONNECTOR({
   'warehouse': $connector_warehouse,
   'destination_database': $destination_database,
   'destination_schema': $destination_schema
});

CALL SET_CONNECTION_CONFIGURATION({
   'service_now_url': $servicenow_instance_url,
   'secret': $secret_fqn,
   'external_access_integration': $external_access_integration_name
});

-- Remove the 'journal_table' parameter if you don't want to track deleted records
CALL FINALIZE_CONNECTOR_CONFIGURATION({
   'journal_table': 'sys_audit_delete'
});
```

```sqlexample
-- Specify values as required by your installation
SET application_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW'; -- use the same name as provided in the installation
SET connector_warehouse = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_WAREHOUSE';
SET servicenow_instance_domain = '<servicenow_instance>.service-now.com';

SET destination_database = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_DEST_DB';
SET destination_schema = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_DEST_SCHEMA';

SET secret_database = 'CONNECTORS_SECRET';
SET secret_schema = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW';
SET secret_database_schema = $secret_database || '.' || $secret_schema;
SET secret_fqn = $secret_database_schema || '.' || 'SECRET';

SET network_rule_fqn = $secret_database_schema || '.' || 'NETWORK_RULE';
SET external_access_integration_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_EXTERNAL_ACCESS_INTEGRATION';
SET security_integration_name = 'SNOWFLAKE_CONNECTOR_FOR_SERVICENOW_SECURITY_INTEGRATION';

SET destination_database_schema = $destination_database || '.' || $destination_schema;
SET servicenow_instance_url = 'https://' || $servicenow_instance_domain || '/';
SET oauth_token_endpoint = $servicenow_instance_url || 'oauth_token.do';

-- Create essential objects
USE ROlE accountadmin;

CREATE WAREHOUSE IF NOT EXISTS IDENTIFIER($connector_warehouse)
   WAREHOUSE_SIZE = 'MEDIUM'
   AUTO_RESUME = TRUE;
CREATE DATABASE IF NOT EXISTS IDENTIFIER($secret_database);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($secret_database_schema);
CREATE DATABASE IF NOT EXISTS IDENTIFIER($destination_database);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($destination_database_schema);

-- Populate with your credentials
CREATE SECURITY INTEGRATION IDENTIFIER($security_integration_name)
   TYPE = API_AUTHENTICATION
   AUTH_TYPE = OAUTH2
   OAUTH_CLIENT_AUTH_METHOD = CLIENT_SECRET_POST
   OAUTH_CLIENT_ID = '<client_id>'
   OAUTH_CLIENT_SECRET = '<client_secret>'
   OAUTH_TOKEN_ENDPOINT = $oauth_token_endpoint
   OAUTH_GRANT = 'CLIENT_CREDENTIALS'
   OAUTH_ALLOWED_SCOPES = ('useraccount')
   ENABLED = TRUE;

CREATE SECRET IDENTIFIER($secret_fqn)
   TYPE = OAUTH2
   API_AUTHENTICATION = $security_integration_name
   OAUTH_SCOPES=('useraccount');

-- None of the following commands should require any changes
CREATE NETWORK RULE IDENTIFIER($network_rule_fqn)
   MODE = 'EGRESS'
   TYPE = 'HOST_PORT'
   VALUE_LIST = ($servicenow_instance_domain);

CREATE PROCEDURE execute_immediate_create_ea_integration()
RETURNS VARIANT
EXECUTE AS caller
AS
BEGIN
   EXECUTE IMMEDIATE '
      CREATE EXTERNAL ACCESS INTEGRATION IDENTIFIER($external_access_integration_name)
      ALLOWED_NETWORK_RULES = ($network_rule_fqn)
      ALLOWED_AUTHENTICATION_SECRETS = ('  ||  $secret_fqn  ||  ') ENABLED = TRUE
   ';
END;
CALL execute_immediate_create_ea_integration();
DROP PROCEDURE IF EXISTS execute_immediate_create_ea_integration();

GRANT EXECUTE TASK ON ACCOUNT TO APPLICATION IDENTIFIER($application_name);
GRANT EXECUTE MANAGED TASK ON ACCOUNT TO APPLICATION IDENTIFIER($application_name);

GRANT USAGE ON WAREHOUSE IDENTIFIER($connector_warehouse) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON DATABASE IDENTIFIER($destination_database) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);

GRANT CREATE TABLE ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);
GRANT CREATE VIEW ON SCHEMA IDENTIFIER($destination_database_schema) TO APPLICATION IDENTIFIER($application_name);

GRANT USAGE ON INTEGRATION IDENTIFIER($external_access_integration_name) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON DATABASE IDENTIFIER($secret_database) TO APPLICATION IDENTIFIER($application_name);
GRANT USAGE ON SCHEMA IDENTIFIER($secret_database_schema) TO APPLICATION IDENTIFIER($application_name);
GRANT READ ON SECRET IDENTIFIER($secret_fqn) TO APPLICATION IDENTIFIER($application_name);

CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION($application_name, true, $destination_database, $destination_schema);

USE APPLICATION IDENTIFIER($application_name);

-- Recommended to call one by one as the response might contain an error
CALL CONFIGURE_CONNECTOR({
   'warehouse': $connector_warehouse,
   'destination_database': $destination_database,
   'destination_schema': $destination_schema
});

CALL SET_CONNECTION_CONFIGURATION({
   'service_now_url': $servicenow_instance_url,
   'secret': $secret_fqn,
   'external_access_integration': $external_access_integration_name
});

-- Remove the 'journal_table' parameter if you don't want to track deleted records
CALL FINALIZE_CONNECTOR_CONFIGURATION({
   'journal_table': 'sys_audit_delete'
});
```

## Next steps

After installing and configuring the connector, perform the steps described in [Set up data ingestion for your ServiceNow® data](ingestion.md).

---
title: Install and configure the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-installing.md
section: Connectors & Drivers
---

# Install and configure the Snowflake Connector for Google Analytics Aggregate Data

This topic provides information about installing and configuring the Snowflake Connector for Google Analytics Aggregate Data
through Snowsight.

## Install the Snowflake Connector for Google Analytics Aggregate Data

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for Google Analytics Aggregate Data, and then select the tile for the connector.
4. On the page for the Snowflake Connector for Google Analytics Aggregate Data, select Get.

   A dialog box is displayed.
5. Under Options, for Application name, enter a name for the database to use for the connector instance.

   This database is created for you automatically.
6. For Warehouse used for installation, select the warehouse to use for installing the connector.

   > > **Note:**
   > >
   > > This is not the same warehouse that is used by the connector to synchronize data from Google Analytics.
   > > In a later procedure, you create a separate warehouse for this purpose.

## Configure the Snowflake Connector for Google Analytics Aggregate Data

> **Note:**
>
> Snowflake Connector for Google Analytics Aggregate Data can also be configured using SQL. Configuration using SQL is considered
> an advanced topic. For more information see [Configure the Snowflake Connector for Google Analytics Aggregate Data using SQL](gaad-connector-configuring-sql.md).

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with either the ACCOUNTADMIN role or any other role that meets the following requirements:

   * You must have these account-level privileges:

     + EXECUTE TASK with the grant option
     + EXECUTE MANAGED TASK with the grant option
   * EVENT_TABLE must be enabled on the account.
   * For warehouse access, you must have at least one of the following privileges:

     + The CREATE WAREHOUSE privilege on the account
     + The OWNERSHIP privilege on the warehouse
     + The USAGE privilege on the warehouse (with the grant option)
   * For database access, you must have at least one of the following privileges:

     + The CREATE DATABASE privilege on the account
     + The OWNERSHIP privilege on the database
     + The USAGE privilege on the database (with the grant option)
   * For schema access, you must have at least one of the following privileges:

     + The CREATE DATABASE privilege on the account
     + The OWNERSHIP privilege on the database
     + The USAGE privilege on the database (with the grant option)
     + The CREATE SCHEMA privilege on the database
     + The USAGE, CREATE TABLE, CREATE VIEW privileges on the schema (with the grant option)
   * Optional: For role access, you can create a new or select an existing role that will be
     assigned the DATA_READER application role. If you want to create a new role, then you need the CREATE ROLE privilege on your account.
     However, this is not necessary to complete the configuration.
2. In the navigation menu, select Catalog » Apps.
3. Select the Snowflake Connector for Google Analytics Aggregate Data.

   The configuration wizard starts.
4. Ensure that all prerequisites on the list are met, and mark them as done.
5. Click Start configuration.
6. Populate the following fields:

   > **Note:**
   >
   > By default, the fields are set to the names of objects that are created when you configure the connector.
   > Snowflake recommends using new objects for these fields. However, you can specify the names of existing objects (for example, if you are reinstalling the connector).

   | Field | Description |
   | --- | --- |
   | Warehouse | Enter the identifier for a new, dedicated virtual warehouse for the connector.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The configuration process creates a new `X-Small` warehouse with the specified name. |
   | Destination Database | Enter the identifier for a new database that will contain the schema with the tables for the Google Analytics data in Snowflake. Data downloaded from Google Analytics will be stored here.  Specify a name that is unique for your account. The name of the database must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The configuration process creates a new database with the specified name. |
   | Destination Schema | Enter the identifier for a new schema that will contain the Google Analytics data in Snowflake.  The Snowflake Connector for Google Analytics Aggregate Data ingests Google Analytics data into tables in this schema.  The name of the schema must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The configuration process creates a new schema with the specified name. |
   | Role | Enter the identifier for a new custom role for the connector.  This role is granted read access to tables and views that contain the Google Analytics data ingested by the connector.  The name of the role must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The configuration process creates a new role with the specified name. |
7. Select Configure at the bottom of the screen.

   The configuration process can take several minutes. When it is finished, the wizard advances to **Authentication**.
8. To specify authentication, follow one of these options:

   > **Note:**
   >
   > The Snowflake Connector for Google Analytics Aggregate Data supports two methods of authenticating in Google Analytics: **service accounts** and **OAuth2**. Each method requires
   > additional configuration in your Google Cloud project. For more information, see [Configure service account authentication for Google Cloud](gaad-connector-create-service-account-key.md) and [Configure OAuth authentication for Google Cloud](gaad-connector-create-client-id.md).

   * For a **service account**, populate the following fields:

   | Field | Description |
   | --- | --- |
   | Client email | Google service account email that was generated during service account creation in your Google Cloud project |
   | Private key | Private key that was generated during service account creation in your Google Cloud project  Ensure that the **—–BEGIN PRIVATE KEY—–**, **—–END PRIVATE KEY—–**, and **\n** symbols are removed. |

   * For **Oauth2**, populate the following fields:

   | Field | Description |
   | --- | --- |
   | Client id | Client ID that was generated in your Google Cloud project |
   | Client secret | Client secret that was generated for the client ID |

   If you’re not signed in as a user with the ACCOUNTADMIN role, ensure that you meet the following requirements:

   * You must have the CREATE INTEGRATION privilege.
   * If integrations were previously created by other roles, then the ownership of those integrations must to be transferred to your role.
   * If the CONNECTORS_SECRET database doesn’t exist, then you need the CREATE DATABASE privilege.
   * If CONNECTORS_SECRET database exists but was created by another role, then you need these privileges:

     + USAGE WITH GRANT OPTION
     + CREATE SCHEMA WITH GRANT OPTION
   * If CONNECTORS_SECRET.APP_NAME schema exists but was created by another role, then you need these privileges:

     + USAGE WITH GRANT OPTION
     + CREATE SECRET
     + CREATE NETWORK RULE
   * If CONNECTORS_SECRET.APP_NAME.SECRET exists but was created by another role, then its ownership needs to be transferred to your role.
   * If CONNECTORS_SECRET.APP_NAME.NETWORK_RULE exists but was created by another role, then its ownership needs to be transferred to your role.
9. Select Connect.

   If you selected **Oauth2** authentication, the Google OAuth authentication dialog will open.
10. Optional: Complete the Google OAuth authentication dialog.

    After successfully connecting, the connector verifies whether it can access Google Analytics data. If there are any errors, you will be provided with additional instructions.

After the process is completed successfully, ingestion configuration can begin. For more information, see [Set up data ingestion for your Snowflake Connector for Google Analytics Aggregate Data instance](gaad-connector-setting-up-data.md).

---
title: Install multi application instances for connectors (GAAD)
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-multi-app-instances.md
section: Connectors & Drivers
---

# Install multi application instances for connectors (GAAD)

You can install multiple instances of the same connector application on your Snowflake account.

To install an additional application instance, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select the application for which you want to install another instance. The application details page appears.
4. Click Add instance. The installation dialog appears.
5. Provide the instance name and select the warehouse to be used during the installation.
6. Select Get to begin the installation process.

Adding connector instances can take several minutes. When the installation process completes, you get an email notification.

> **Attention:**
>
> To avoid ingested data corruption, during connector configuration, always use a database schema that is
> different from all other native applications.

To access your installed connector application instances, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select your application instance to access it.

---
title: Install multi application instances for connectors (GARD)
source: https://docs.snowflake.com/en/connectors/google/gard/gard-multi-app-instances.md
section: Connectors & Drivers
---

# Install multi application instances for connectors (GARD)

You can install multiple instances of the same connector application on your Snowflake account.

To install an additional application instance, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select the application for which you want to install another instance. The application details page appears.
4. Click Add instance. The installation dialog appears.
5. Provide the instance name and select the warehouse to be used during the installation.
6. Select Get to begin the installation process.

Adding connector instances can take several minutes. When the installation process completes, you get an email notification.

> **Attention:**
>
> To avoid ingested data corruption, during connector configuration, always use a database schema that is
> different from all other native applications.

To access your installed connector application instances, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select your application instance to access it.

---
title: Install multi application instances for connectors (ServiceNow)
source: https://docs.snowflake.com/en/connectors/servicenow/multi-app-instances.md
section: Connectors & Drivers
---

# Install multi application instances for connectors (ServiceNow)

You can install multiple instances of the same connector application on your Snowflake account.

To install an additional application instance, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select the application for which you want to install another instance. The application details page appears.
4. Click Add instance. The installation dialog appears.
5. Provide the instance name and select the warehouse to be used during the installation.
6. Select Get to begin the installation process.

Adding connector instances can take several minutes. When the installation process completes, you get an email notification.

> **Attention:**
>
> To avoid ingested data corruption, during connector configuration, always use a database schema that is
> different from all other native applications.

To access your installed connector application instances, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select your application instance to access it.

---
title: Installation and configuration tasks for the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-tasks.md
section: Connectors & Drivers
---

# Installation and configuration tasks for the Snowflake Connector for Google Analytics Aggregate Data

Working with the Snowflake Connector for Google Analytics Aggregate Data includes the following common tasks:

* Installing and configuring the connector
* Setting up data ingestion and reviewing data
* Monitoring and managing the connector
* Troubleshooting

Review each task before installing and configuring an instance of the Snowflake Connector for Google Analytics Aggregate Data.

## Installing and configuring the connector

Perform the following tasks to install and configure the Snowflake Connector for Google Analytics Aggregate Data:

| Task | Description |
| --- | --- |
| [Preparing your Google Analytics and Google Cloud accounts](gaad-connector-prereqs.md) | Before installing the Snowflake Connector for Google Analytics Aggregate Data, set up your Google accounts and meet any common prerequisites. |
| [Configure service account authentication for Google Cloud](gaad-connector-create-service-account-key.md) | To provide service account credentials, first create them in your Google Cloud project. |
| [Configure OAuth authentication for Google Cloud](gaad-connector-create-client-id.md) | To provide the OAuth client ID and secret, create them in your Google Cloud project. |
| [Install and configure the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-installing.md) | Install the connector. |
| [Configure the Snowflake Connector for Google Analytics Aggregate Data using SQL](gaad-connector-configuring-sql.md) | Configure the connector using SQL. |

After installing and configuring the Snowflake Connector for Google Analytics Aggregate Data you must set up basic management as described in [Manage the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-managing.md).

## Monitoring and managing the connector

Review and perform the following tasks to provide routine management and monitoring of the connector:

| Task | Description |
| --- | --- |
| [Manage the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-managing.md) | This topic describes typical tasks you might need to perform after installing and configuring the connector. |
| [Monitoring the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-monitoring.md) | This topic describes how to monitor the state of the Snowflake Connector for Google Analytics Aggregate Data. |

## Setting up data ingestion and reviewing data

After installing and configuring the Snowflake Connector for Google Analytics Aggregate Data, you must configure data ingestion and can then begin accessing data.

Perform the following tasks to configure data ingestion and begin accessing ServiceNow data:

| Task | Description |
| --- | --- |
| [Set up data ingestion for your Snowflake Connector for Google Analytics Aggregate Data instance](gaad-connector-setting-up-data.md) | Explains how the Snowflake Connector for Google Analytics Aggregate Data ingests data. |
| [Accessing fetched Google Analytics data](gaad-connector-accessing-data.md) | Explains how to access ingested data. |

## Troubleshooting

For information about troubleshooting, see [Troubleshooting the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-troubleshooting.md).

---
title: Installing and configuring the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-installing.md
section: Connectors & Drivers
---

# Installing and configuring the Snowflake Connector for Google Analytics Raw Data

This topic provides information on installing and configuring the Snowflake Connector for Google Analytics Raw Data
through Snowsight.

## Installing the Snowflake Connector for Google Analytics Raw Data

To install the connector, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for Google Analytics Raw Data, then select the tile for the connector.
4. In the page for the Snowflake Connector for Google Analytics Raw Data, select Get.

   This displays a dialog box that you use to begin the initial part of the installation process.

   In the dialog box configure the following:

   1. In the Options->Application name field, enter the database to use as the database for the connector
      instance. This database is created for you automatically.
   2. In the Warehouse used for installation field, select the warehouse that you want to use for
      installing the connector.

      > **Note:**
      >
      > This is not the same warehouse that is used by the connector to synchronize data from Google Analytics.
      > In a later step, you will create a separate warehouse for this purpose.
   3. Select Get.
5. Select Open.

   The dialog box closes, and the Snowflake Connector for Google Analytics Raw Data page displays the UI
   for configuring and managing the connector.

## Configuring the Snowflake Connector for Google Analytics Raw Data

> **Note:**
>
> Snowflake Connector for Google Analytics Raw Data can also be configured using SQL. Configuration using SQL is considered
> an advanced topic. For more information see [Configuring the Snowflake Connector for Google Analytics Raw Data using SQL](gard-connector-configuring-sql.md).

To configure the connector, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with either the ACCOUNTADMIN role or any other role that meets the following requirements:

   * You must have these account-level privileges:

     + EXECUTE TASK with the grant option
     + EXECUTE MANAGED TASK with the grant option
   * EVENT_TABLE must be enabled on the account.
   * For warehouse access, you must have at least one of the following privileges:

     + The CREATE WAREHOUSE privilege on the account
     + The OWNERSHIP privilege on the warehouse
     + The USAGE privilege on the warehouse (with the grant option)
   * For database access, you must have at least one of the following privileges:

     + The CREATE DATABASE privilege on the account
     + The OWNERSHIP privilege on the database
     + The USAGE privilege on the database (with the grant option)
   * For schema access, you must have at least one of the following privileges:

     + The CREATE DATABASE privilege on the account
     + The OWNERSHIP privilege on the database
     + The USAGE privilege on the database (with the grant option)
     + The CREATE SCHEMA privilege on the database
     + The USAGE, CREATE TABLE, CREATE VIEW privileges on the schema (with the grant option)
   * Optional: For role access, you can create a new or select an existing role that will be
     assigned the DATA_READER application role. If you want to create a new role, then you need the CREATE ROLE privilege on your account.
     However, this is not necessary to complete the configuration.
2. In the navigation menu, select Catalog » Apps.
3. Select the Snowflake Connector for Google Analytics Raw Data.

   The configuration wizard starts.
4. Prerequisites

   1. Make sure all prerequisites from the list are met and mark them done.
   2. Click Start configuration
5. Configure warehouse, database, schema and role

   > **Note:**
   >
   > By default, the fields are set to the names of objects that are created when you configure the connector.
   > Snowflake recommends using new objects for these fields. However, you can specify the names of existing objects,
   > if needed (e.g. if you are reinstalling the connector).

   Populate the following fields and select Configure at the bottom of the screen:

   | Field | Description |
   | --- | --- |
   | Warehouse | Enter the identifier for a new, dedicated virtual warehouse for the connector or select an existing one.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  **Note:** Do not specify the same warehouse that you selected at the beginning of the connector installation.  The configuration process creates a new `X-Small` warehouse with the specified name.  Alternatively you can select an existing warehouse. |
   | Destination Database | Identifier for a new database that will contain the schema with the tables for the Google Analytics data in Snowflake. Data downloaded from Google Analytics will land here.  Specify a name that is unique for your account. The name of the database must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The configuration process creates a new database with the specified name.  Alternatively you can select an existing database. |
   | Destination Schema | Identifier for a new schema that will contain the Google Analytics data in Snowflake.  The Snowflake Connector for Google Analytics Raw Data ingests Google Analytics data into tables in this schema.  The name of the schema must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The configuration process creates a new schema with the specified name.  Alternatively you can select an existing schema. |
   | Role | Identifier for a new custom role for the connector.  Specify a name that is unique for your account. The name of the role must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  The role is an account-level role that will have read access to the ingested data.  Alternatively you can select an existing role. |

   If existing destination database and schema were provided, ownership of the
   existing regular tables and views will be transferred to the Snowflake Connector for Google Analytics Raw Data. That excludes, for example,
   external tables and materialized views. Moreover **nothing** will be transferred in managed schemas.

   It can take some time for the configuration process to complete. When the configuration process finishes successfully,
   the configuration wizard advances to `Authentication`.
6. Configure authentication

   The Snowflake Connector for Google Analytics Raw Data support two authentication methods - **OAuth** and **Service Accounts**.
   Each of methods requires additional configuration in your GCP project.

   For more information on how to configure each authentication see:

   * [Configuring service account authentication for Google Cloud Platform (GCP)](gard-connector-create-service-account-key.md)
   * [Configuring OAuth authentication for Google Cloud Platform (GCP)](gard-connector-create-client-id.md)

   If using authentication method **Service Account**, provide a JSON file with Service Account credentials.

   Alternatively you can populate the following fields:

   | Field | Description |
   | --- | --- |
   | Client email | Google service account email which was generated during service account creation process in Google Cloud Platform project. |
   | Private key | Private key which was generated during service account creation process in Google Cloud Platform project. |

   Ensure that you have removed `-----BEGIN PRIVATE KEY-----`, `-----END PRIVATE KEY-----`, and `\\n`.

   If using authentication method **Oauth2**, populate the following fields:

   | Field | Description |
   | --- | --- |
   | Client id | Client ID generated in Google Cloud Platform project. |
   | Client secret | Client secret ID generated in Google Cloud Platform project. |

   If you’re not signed in as a user with the ACCOUNTADMIN role, ensure that you meet the following requirements:

   * You must have the CREATE INTEGRATION privilege.
   * If integrations were previously created by other roles, then the ownership of those integrations must to be transferred to your role.
   * If the CONNECTORS_SECRET database doesn’t exist, then you need the CREATE DATABASE privilege.
   * If CONNECTORS_SECRET database exists but was created by another role, then you need these privileges:

     + USAGE WITH GRANT OPTION
     + CREATE SCHEMA WITH GRANT OPTION
   * If CONNECTORS_SECRET.APP_NAME schema exists but was created by another role, then you need these privileges:

     + USAGE WITH GRANT OPTION
     + CREATE SECRET
     + CREATE NETWORK RULE
   * If CONNECTORS_SECRET.APP_NAME.SECRET exists but was created by another role, then its ownership needs to be transferred to your role.
   * If CONNECTORS_SECRET.APP_NAME.NETWORK_RULE exists but was created by another role, then its ownership needs to be transferred to your role.

   Select Connect

   If you have selected **Oauth2** authentication, you will be presented with the Google OAuth2 authentication dialog flow.

   In the dialog, log in to Google to complete the Google OAuth2 authentication flow.

   It can take some time for the authentication process to complete.
7. Validate source

> After successfully connection, the conenctor will verify that it can access the Google Analytics data. On error, the connector will guide you with additional instruction.
>
> If the process completes successfully you can start configuring ingestion.
> For more information see [Setting up data ingestion for your Snowflake Connector for Google Analytics Raw Data](gard-connector-setting-up-data.md)

---
title: Manage the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-managing.md
section: Connectors & Drivers
---

# Manage the Snowflake Connector for Google Analytics Aggregate Data

This topic describes typical tasks you might need to perform after installing and configuring the connector.

## Setting up alerts

To set up alerts, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Google Analytics Aggregate Data, then select the tile for the connector.
4. Go to the Settings section and then select Email alerts from the menu on the left.
5. In the Email Address field, provide a Snowflake verified email address.

   > **Note:**
   >
   > You must specify an email address that is associated with the Snowflake account.
6. In the Email Frequency field, select how often you would like to receive alerts:

   * Immediately - you will receive notifications after each ingestion failure but at most every 30 minutes.
   * Once per day - you will receive notifications once a day at 12PM UTC.
   > **Note:**
   >
   > Alerts are sent only when an ingestion failure occurs.
7. Select Save changes to start receiving email alerts.

### Disabling alerts

To stop receiving alerts, select Stop receiving alerts in the email alerts configuration page.

## Upgrading the connector

The connector upgrades are managed automatically by the provider of the application.

## Re-authentication of the Connector

In order to change the secret, external access integration, or the security integration used by the connector without re-installation,
you need to execute the `UPDATE_CONNECTION_CONFIGURATION` procedure defined in the `PUBLIC` schema.
Ensure that all of the new objects are defined as described in [Configure the Snowflake Connector for Google Analytics Aggregate Data using SQL](gaad-connector-configuring-sql.md) and that the connector has all of the required grants.

```sqlexample
USE ROLE accountadmin;
CALL UPDATE_CONNECTION_CONFIGURATION(
    PARSE_JSON('{"external_access_integration": "<external access integration name>", "secret": "<full path to the secret>", "security_integration": "<security integration name>"}')
);
```

Replace the placeholders with actual values, as in the following example.

```sqlexample
USE ROLE accountadmin;
CALL UPDATE_CONNECTION_CONFIGURATION(
    PARSE_JSON('{"external_access_integration": "SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA_EXTERNAL_ACCESS_INTEGRATION", "secret": "CONNECTORS_SECRET.SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA.SECRET", "security_integration": "SNOWFLAKE_CONNECTOR_FOR_GOOGLE_ANALYTICS_AGGREGATE_DATA_SECURITY_INTEGRATION"}')
);
```

> **Note:**
>
> Values passed to UPDATE_CONNECTION_CONFIGURATION should be unqualified, uppercase identifiers.

## Changing the warehouse for the Connector

It is possible to change the warehouse that the Snowflake Connector for Google Analytics Aggregate Data uses for its internal tasks without reinstalling the connector.
First, make sure that the connector is paused. It can be done either via UI or using the `PAUSE_CONNECTOR` procedure.
Then, you need to grant the connector access to the new warehouse:

```sqlsyntax
GRANT USAGE ON WAREHOUSE <new_warehouse_name> TO APPLICATION snowflake_connector_for_google_analytics_aggregate_data;
```

After the access is granted, execute the `UPDATE_WAREHOUSE` procedure defined in the `PUBLIC` schema:

```sqlexample
CALL UPDATE_WAREHOUSE('<new_warehouse_name>');
```

Replace the placeholder with the actual value, as in the following example.

```sqlexample
CALL UPDATE_WAREHOUSE('NEW_WH');
```

> **Note:**
>
> Values passed to UPDATE_WAREHOUSE should be unqualified, uppercase identifiers.

## Uninstalling the connector

Removing the connector database does not delete the ingested data that is stored in a separate database or the
objects that were created during the installation performed using Snowsight.

> **Note:**
>
> To see objects created during the installation, select Snowflake Connector for Google Analytics Aggregate Data » Settings » Objects.

To uninstall the connector, follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Google Analytics Aggregate Data.
4. Select Uninstall.

---
title: Manage the Snowflake Connector for SharePoint
source: https://docs.snowflake.com/en/connectors/unstructured-data-connectors/sharepoint/manage.md
section: Connectors & Drivers
---

# Manage the Snowflake Connector for SharePoint

> **Note:**
>
> The Snowflake Connector for SharePoint is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for SharePoint.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements is not guaranteed. The new solution is available as [Openflow Connector for SharePoint](../../../user-guide/data-integration/openflow/connectors/sharepoint/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic describes how to manage your Snowflake Connector for SharePoint after you have [installed and configured it](setup.md).

You can perform the following tasks to manage the connector:

* Modify the refresh frequency
* Pause or resume the connector
* View the list of source folders

## Modify the refresh frequency

You can modify the refresh frequency to refresh content, metadata and permissions every day, every week, or every month.

To modify the refresh frequency, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for SharePoint and select it.
4. In the connector navigation menu, select Data sync » Manage Content Refresh.
5. Select Edit.
6. In the Refresh drop-down list, select either every day, every week, or every month.

You can perform refresh on-demand by running the following SQL command:

```sqlexample
CALL PUBLIC.REFRESH_SHAREPOINT_CONTENT();
```

> **Note:**
>
> You must have been assigned the role ACCOUNTADMIN to call the PUBLIC.REFRESH_SHAREPOINT_CONTENT procedure.

## Pause or resume the connector

Pausing the connector only pauses the data ingestion from SharePoint and
the processing with the document parsing function of Cortex, and not the Cortex Search service.
The Cortex Search service continues to process previously ingested data. Pausing the connector may
still result in credits consumed by Cortex Search.

Once you resume the connector, the data ingestion and processing restarts and the connector
fetches all changes to files, metadata and permissions since the previous refresh.
The connector also refreshes Cortex Search to use the latest content and permissions.

To pause or resume the connector, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for SharePoint and select it.
4. In the connector navigation menu, select Data sync » Manage Content Refresh.
5. Select Pause or Resume.

## View source folders list

To view the list of folders from which the connector ingests your data, use the following command:

```sqlexample
SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_SHAREPOINT.PUBLIC.AVAILABLE_FOLDERS;
```

## Next step

[Monitor the Snowflake Connector for SharePoint](monitor.md).

---
title: Managing the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-managing.md
section: Connectors & Drivers
---

# Managing the Snowflake Connector for Google Analytics Raw Data

This topic describes typical tasks you might need to perform after installing and configuring the connector.

## Changing the ingestion interval for the connector

The connector periodically checks and downloads data from BigQuery. The check is done every 8 hours by default, but it can be changed. If you want to set the new interval
for checking and downloading data, please use the `CONFIGURE_INGESTION_INTERVAL` procedure defined in the `PUBLIC` schema:

```sqlsyntax
CALL CONFIGURE_INGESTION_INTERVAL(<interval_configuration_name>)
```

Possible interval configurations along with cron definitions which are used under the hood:

```none
EVERY_15_MINUTES   -   */15 * * * * UTC
EVERY_30_MINUTES   -   */30 * * * * UTC
EVERY_1_HOUR       -   0 * * * * UTC
EVERY_4_HOURS      -   0 3/4 * * * UTC
EVERY_8_HOURS      -   0 3/8 * * * UTC
EVERY_1_DAY        -   0 3 * * * UTC
```

> **Note:**
>
> It is not possible to set custom cron expression.

Example usage:

```sqlsyntax
CALL CONFIGURE_INGESTION_INTERVAL('EVERY_1_HOUR')
```

The list of supported intervals can be also printed using the `LIST_SUPPORTED_INGESTION_INTERVALS` procedure defined in the `PUBLIC` schema:

```sqlsyntax
CALL LIST_SUPPORTED_INGESTION_INTERVALS()
```

## Setting up alerts

To set up alerts, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Google Analytics Raw Data, then select the tile for the connector.
4. In the page for the Snowflake Connector for Google Analytics Raw Data, go to the Settings section and then select Email alerts from the menu on the left.

   This displays a page for the email alerts configuration.
5. In the Email Address field, provide a Snowflake verified email address.

> **Note:**
>
> You must specify an email address that is associated with the Snowflake account.

1. In the Email Frequency field, select how often you would like to receive alerts:

* Immediately - you will receive notifications according to the values set in table synchronization.
* Once per day - you will receive notifications once a day at 12PM UTC.

> **Note:**
>
> Alerts are sent only when an invalid action (such as an error) occurs.

1. Select Save changes to start receiving email alerts.

### Disabling alerts

To stop receiving alerts, select Stop receiving alerts in the email alerts configuration page.

## Upgrading the connector

The connector upgrades are managed automatically by the provider of the application.

## Scaling the Connector

You should start your work with the Connector using a `X-Small` as it will most likely give you a sufficient performance.
However, if you are experiencing any Connector slowdowns, you may want to try gradually increasing the warehouse size and evaluating
whether you see any performance boosts at each step. Whether the Connector gains anything from scaling the warehouse depends on
a few factors, such as the number of properties or the amount of data each of them has.

For insights on how to resize the warehouse see Resizing a warehouse in [Working with warehouses](../../../user-guide/warehouses-tasks.md).

> **Note:**
>
> If you are constantly experiencing ingestion errors related to insufficient memory and are already using a `LARGE` or `X-LARGE`
> warehouse, then you can try to resolve this issue by decreasing the `MAX_CONCURRENCY_LEVEL` parameter on a warehouse from 8 (default) to 4.

## Changing the warehouse for the Connector

It is possible to change the warehouse that the Snowflake Connector for Google Analytics Raw Data uses for its internal tasks without reinstalling the connector.
First, make sure that the Connector is paused. It can be done eiter via UI or using the `PAUSE_CONNECTOR` procedure.
Then, you need to grant the Connector access to the new warehouse:

```sqlsyntax
GRANT USAGE ON WAREHOUSE <new_warehouse_name> TO APPLICATION snowflake_connector_for_google_analytics_raw_data;
```

After the access is granted, execute the `UPDATE_WAREHOUSE` procedure defined in the `PUBLIC` schema:

```sqlsyntax
CALL UPDATE_WAREHOUSE('<new_warehouse_name');
```

## Re-authentication of the Connector

In order to change the secret, external access integration or the security integration used by the connector without re-installation,
you need to execute the `UPDATE_CONNECTION` procedure defined in the `PUBLIC` schema.
Ensure, that all of the new objects are defined as described in [Configuring the Snowflake Connector for Google Analytics Raw Data using SQL](gard-connector-configuring-sql.md) and that the connector has all of the required grants.

```sqlsyntax
CALL UPDATE_CONNECTION('<new external access integration>', '<new secret>', '<new security integration>');
```

## Automatically disabling inaccessible Google Analytics properties

The Connector has a mechanism to automatically disable inaccessible Google Analytics properties in order to prevent unnecessary
costs caused by attempting ingestions for data which does not exist indefinitely and alarm you that data is not being ingested anymore.
The property is considered inaccessible and might be automatically disabled if data ingestions have been failing for the last 7 days.

## Proceeding during disaster recovery and failover

If you want to ensure that the connector will be able to continue data ingestion during a deployment outage, you need to
set up the sink database failover to a replica account. For details, see [Failing over databases across multiple accounts](../../../user-guide/database-failover-config.md).

Moreover, after an outage you need to manually install the Snowflake Connector for Google Analytics Raw Data on your replica account, because the connector itself can not be replicated.
After the installation it will synchronize itself with the replicated sink database.

> **Note:**
>
> In order to prevent data corruption you can not have two connector instances, one on a primary account and one on a replica account
> ingesting data to the sink database at the same time.

When a deployment outage occurs and your sink database fails over to a replica account, perform the following steps:

1. Sign in to your secondary account, where the sink database is replicated.
2. Install the Snowflake Connector for Google Analytics Raw Data on your secondary account. The connector will synchronize itself with the replicated sink database. The instance
   on your primary account goes into a read-only state after an outage, so data will not be corrupted at this point.
3. If you want to go back to the primary account after the deployment is available again, you need to first drop both connectors. It’s necessary to ensure a consistent connector state.
4. Replicate the data back from the secondary account to the primary one using the replication mechanism.
5. Reinstall the connector on a primary account once the data in sink table synchronizes with the sink table on your secondary account.

## Updating data ingestion options

You can use the `UPDATE_INGESTION_OPTIONS` procedure defined in the `PUBLIC` schema to modify default ingestion options
for certain properties. This procedure allows you to change the following:

> * `EXCLUDE_NULLS` - Remove fields containing null values from the ingested data. Setting this value to `TRUE`
>   can improve the data ingestion throughput. The default value is `FALSE`.
> * `DISABLE_AUTO_RELOADS` - Disables auto reloading data. For more details about auto reload see [Data ingestion model for the Snowflake Connector for Google Analytics Raw Data](gard-connector-data-ingestion-model.md).
>   Setting this value to `TRUE` can reduce credit consumption, but late data won’t be ingested into Snowflake. This property
>   cannot be set to `true` for the `FRESH_DAILY` export type. The default value is `FALSE`.
> * `ENABLED_EXPORT_TYPES` - A list of export types, which connector will try to ingest data for. Possible values are: `DAILY`, `FRESH_DAILY`, `INTRADAY`, `USERS` and `PSEUDONYMOUS_USERS`.

```sqlsyntax
CALL UPDATE_INGESTION_OPTIONS(
    PROPERTY_IDS => ['<property_1>', '<property_2>'],
    EXCLUDE_NULLS => <boolean>,
    ENABLED_EXPORT_TYPES => ['DAILY', 'FRESH_DAILY' 'INTRADAY', 'USERS', 'PSEUDONYMOUS_USERS']
 );
```

> **Note:**
>
> To leave an ingestion option unchanged, omit the argument from the
> `UPDATE_INGESTION_OPTIONS` procedure call.

## Refreshing flattened views on demand

You can use the `REFRESH_VIEWS` procedure defined in the `PUBLIC` schema to trigger an on-demand refresh of the flattened views.
The flattened views are refreshed automatically daily by default.
For more details about views see [Accessing data ingested by Snowflake Connector for Google Analytics Raw Data](gard-connector-accessing-data.md).

```sqlsyntax
CALL REFRESH_VIEWS();
```

---
title: Managing, updating, and uninstalling the Snowflake Connector for ServiceNow®
source: https://docs.snowflake.com/en/connectors/servicenow/managing.md
section: Connectors & Drivers
---

# Managing, updating, and uninstalling the Snowflake Connector for ServiceNow®

This topic and its sections describe typical tasks you might need to perform after installing and configuring the connector.

## Pausing and resuming the Snowflake Connector for ServiceNow®

The following sections describe how to pause and resume the connector.

### Pausing the Snowflake Connector for ServiceNow®

To stop all tasks started by the connector, call the `PAUSE_CONNECTOR` stored procedure:

```sqlsyntax
CALL PAUSE_CONNECTOR();
```

Pausing the connector disables interaction with it (e.g. enabling/disabling tables or configuring the connector) until the connector is resumed by calling the `RESUME_CONNECTOR` stored procedure.

Pausing the connector also stops any cost generation for the connector.

### Resuming the Snowflake Connector for ServiceNow®

To resume all tasks stopped by `PAUSE_CONNECTOR` stored procedure, call `RESUME_CONNECTOR` stored procedure:

```sqlsyntax
CALL RESUME_CONNECTOR();
```

## Changing the warehouse used by the connector

If you want to change the warehouse used by the connector or add a dedicated warehouse, do this by calling:

> ```sqlsyntax
> CALL UPDATE_WAREHOUSE('<warehouse_name>');
> ```

Where:

`warehouse_name`
:   Specifies the name of the warehouse that the connector should use.

> **Note:**
>
> Before configuring the connector to use a different warehouse, verify that the
> [connector application](installing-sql.md)
> has the USAGE privilege for the new warehouse.
>
> Additionally, the connector has to be in the `paused` state. See pausing the connector.

## Deleting tables

To delete table state data (including configuration, statistics, internal connector data, related tasks) and not display the table in the
[views for monitoring the connector](monitoring.md), use the following procedure:

> ```sqlsyntax
> CALL DELETE_TABLE('<table_name>', <drop_related_objects>);
> ```

Where:

`table_name`
:   Specifies the name of the table to be deleted. This table must be [disabled](ingestion.md)
    and not in the process of [reloading](ingestion.md).

`drop_related_objects` *(optional)*
:   Specifies whether to drop the related objects. If set to `true`, the procedure also drops all objects created for this table in the destination database,
    including views, raw data and event log tables. If set to `false`, the table state is dropped, but the related objects remain intact.

> **Note:**
>
> By default, the `DELETE_TABLE` procedure does not remove the objects created for this table in the destination database that contain the ServiceNow® data in Snowflake
> (such as [raw data table](accessing-data.md), [event logs table](accessing-data.md),
> and [flattened views](accessing-data.md)). You can either provide the `drop_related_objects` parameter or drop these objects manually.
>
> To drop these elements manually, you must first transfer the ownership of them from the connector using a role with `MANAGE GRANTS` privilege. For example:
>
> ```sqlsyntax
> USE ROLE ACCOUNTADMIN;
> GRANT OWNERSHIP ON TABLE <destination_database>.<destination_schema>.<table_name> TO ROLE ACCOUNTADMIN REVOKE CURRENT GRANTS;
> DROP TABLE <destination_database>.<destination_schema>.<table_name>;
> ```

## Updating the refresh token used by the connector

If you set up the connector with OAuth authentication, you must update the refresh token regularly. Otherwise, once the token expires,
the connector cannot access ServiceNow® anymore. By default, the token expires 90 days after its generation.

If you configure [email alerts](monitoring.md) for the connector,
you get a reminder to update the refresh token on the first day of each month. If the token expires, you get an email
once the connector encounters issues accessing ServiceNow®.

### Updating the refresh token for the connector installed using Snowsight

To update the refresh token if the connector was [installed using Snowsight](installing-snowsight.md), do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.

   > **Note:**
   >
   > Make sure that the Snowsight URL you are using matches Snowsight URL that was used when OAuth redirect URL was
   > configured. That is, if OAuth redirect URL set in ServiceNow® was provided by Snowsight accessed via Private Link,
   > you should be signed in to Snowsight via Private Link to refresh the token. Similarly, when redirect URL was
   > configured with publicly accessible Snowsight URL, the refresh should be done by Snowsight accessible with public
   > URL.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow, then select the tile for the connector.
4. In the top navigation menu select Settings » Authentication » Reauthenticate.

   > > **Note:**
   > >
   > > Make sure you are logged in to ServiceNow® as the same user the connector was initially configured with.
   > > You can check the currently logged in user in the upper right corner of the dialog.
5. To confirm that you allow the connector to connect to your ServiceNow® account, select Allow in the dialog.
   The refresh token is now updated.

To learn how to update the refresh token using SQL commands, refer to Updating the refresh token using SQL commands.

### Updating the refresh token using SQL commands

To update the refresh token using SQL commands do the following:

1. Get a new [OAuth refresh token](installing-sql.md).
   Make sure you use the same `client_id`, `client_secret` and user credentials that the connector is using at the moment.
2. Find out the fully qualified name of the secret object by querying the [CONNECTOR_CONFIGURATION view](monitoring.md):

   > ```sqlexample
   > SELECT value FROM connector_configuration WHERE config_key = 'secret';
   > ```
3. Update the secret object by running the [ALTER SECRET](../../sql-reference/sql/alter-secret.md) commands, changing the following parameters:

   * Set `OAUTH_REFRESH_TOKEN` to the OAuth refresh token that you retrieved in the first step.
   * Set `OAUTH_REFRESH_TOKEN_EXPIRY_TIME` to the refresh token expiration timestamp in UTC timezone. You can calculate
     this by adding the refresh token lifespan from ServiceNow® to the date when the token was issued. By default, the
     token expires in 100 days.

   For example, to update the `secretsdb.apiauth.servicenow_creds_oauth_code` secret, run the following command:

   ```sqlexample
   ALTER SECRET secretsdb.apiauth.servicenow_creds_oauth_code SET OAUTH_REFRESH_TOKEN = '34n;vods4nQsdg09wee4qnfvadH', OAUTH_REFRESH_TOKEN_EXPIRY_TIME = '2022-01-06 20:00:00';
   ```

   > **Note:**
   >
   > To update the secret, you must use the role with OWNERSHIP privilege.
   >
   > * If you [installed the connector using Snowsight](installing-snowsight.md), the role is ACCOUNTADMIN.
   > * If you [installed the connector using SQL commands](installing-sql.md), the role is secretadmin.

## Updating the ServiceNow® password for basic authentication

To update the password you need to find an existing secret and modify it using the [ALTER SECRET](../../sql-reference/sql/alter-secret.md) command.

1. Determine the fully qualified name of the secret object using either Snowsight or SQL command.

   > 1. To get a secret using Snowsight, do the following:
   >
   >    > 1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the `ACCOUNTADMIN` role.
   >    > 2. In the navigation menu, select Catalog » Apps.
   >    > 3. Search for the Snowflake Connector for ServiceNow, then select the tile for the connector.
   >    > 4. In the top navigation menu select Settings » Authentication.
   >    >
   >    >    The Authentication section shows the secret object, for example: `CONNECTORS_UI.SERVICENOW_GZSTZTP0KHD.SECRET`.
   > 2. To get the fully qualified name of the secret object using SQL command, query the [CONNECTOR_CONFIGURATION view](monitoring.md):
   >
   >    > ```sqlexample
   >    > SELECT value FROM connector_configuration WHERE config_key = 'secret';
   >    > ```
2. Pause the connector.
3. Update the secret object by running the [ALTER SECRET](../../sql-reference/sql/alter-secret.md) command, changing the `PASSWORD` parameter.
4. Resume the connector.

   > The password is now updated and used by the connector.

> **Note:**
>
> Similar to changing the password, you have the option to update the username using [ALTER SECRET](../../sql-reference/sql/alter-secret.md) command. Simply set the `USERNAME` parameter to the new username.
> Before changing the username, ensure that the new username has, at the very least, the same privileges as the previous one, otherwise, the connector may not function properly.

## Updating the connection to ServiceNow® instance

It’s possible to update the connection to ServiceNow® instance. It allows to change External Access Integration and Secret
used by the connector. It also allows to fix the issue when the Secret was detached from the External Access UDF in the
connector.

The connection configuration can be updated with the following procedure:

```sqlsyntax
CALL UPDATE_CONNECTION_CONFIGURATION({
  'service_now_url': '<servicenow_base_url>',
  'secret': '<secret_name>',
  'external_access_integration': '<external_access_integration_name>'
});
```

Where:

> `servicenow_base_url`
> :   Specifies the URL of the ServiceNow® instance that the connector should use. The URL must be set to the same value
>     as during connector installation and should be in the following format:
>
>     ```none
>     https://<servicenow_instance_name>.service-now.com
>     ```
>
>     Change of the ServiceNow® instance URL is not supported.
>
> `secret_name`
> :   Specifies the fully qualified name of the
>     [secret object containing the credentials for authenticating to ServiceNow®](installing-sql.md).
>
>     You must specify the fully qualified name of the secret object in the following format:
>
>     ```none
>     <database_name>.<schema_name>.<secret_name>
>     ```
>
>     The names of the database, schema, and secret must be valid [object identifiers](../../sql-reference/identifiers-syntax.md).
>
> `external_access_integration_name`
> :   Specifies the name of the
>     [external access integration for ServiceNow®](installing-sql.md).
>
>     The name of the integration must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).

For example, to update the connection to a ServiceNow® instance that:

* Has the URL `https://myinstance.service-now.com`.
* Uses the secret stored in `secretsdb.apiauth.servicenow_creds_oauth_code`.
* Uses the external access integration named `servicenow_external_access_integration`.

Run the following command:

```sqlsyntax
CALL UPDATE_CONNECTION_CONFIGURATION({
  'service_now_url': 'https://myinstance.service-now.com',
  'secret': 'SECRETSDB.APIAUTH.SERVICENOW_CREDS_OAUTH_CODE',
  'external_access_integration': 'SERVICENOW_API_INTEGRATION'
});
```

The update of the configuration can be performed only by a user with a grant to the `ADMIN` application role. Additionally,
to run the procedure the connector has to be in the `paused` state. See pausing the connector.

## Exporting the connector state

It’s possible to export a snapshot of the current connector state and configuration. The snapshot with the exported connector state is useful when
reinstalling the connector to preserve already enabled tables and the ingestion state, or when replication of the
destination schema to the failover region is configured to aid with disaster recovery.

The state can be exported with the following procedure:

> ```sqlsyntax
> CALL EXPORT_CONNECTOR_STATE();
> ```

The procedure creates a new `__CONNECTOR_STATE_EXPORT` table in the destination schema with an exported state. To perform an
export the following conditions must be met:

* The export can be performed only by a user with a grant to the `ADMIN` application role.
* There is no ongoing table [reload](ingestion.md).

> **Note:**
>
> The `__CONNECTOR_STATE_EXPORT` table contains all information necessary to restore the connector state during reinstallation,
> but it’s worth noting some information is missing:
>
> * The destination database and destination schema, warehouse, Data Reader role, ServiceNow URL, Secret object, External Access
>   Integration and name of journal table (if configured) aren’t exported. This information must be provided again when reinstalling the connector.
>   This can be used as an opportunity to e.g. change Secret object or name of journal table used by the Connector, provided that
>   after the reinstallation the same ServiceNow instance and destination schema will be used.
> * For each ingested table and ingestion mode only the newest ingestion state is exported. As a result after the connector state import,
>   historical data and statistics won’t be available.

Configuration is also exported automatically each time the connector triggers the ingestion according to [configured schedule](ingestion.md),
provided the following conditions are met:

* At least one table is enabled for ingestion.
* There is no ongoing table reload.

## Uninstalling the application

This section explains how to uninstall the application with Snowsight and with worksheets and how to remove
the objects created by the connector, but which need to be intentionally removed by the user.

### Uninstalling the application using Snowsight

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow, then select the three-dot menu to open the contextual view and select Uninstall.
4. If all the ingested data staying in the destination database should be preserved, choose Transfer object ownership to another role
   and choose the role which should be granted ownership on all the objects owned by the application. Otherwise, select
   Delete all objects to remove all the data.

   > **Note:**
   >
   > Too see what objects would be transferred to the selected role (or removed), expand Show objects menu.
5. Select Uninstall to confirm the changes. The connector’s application is now uninstalled.

### Uninstalling the application using worksheets

Data ingested by the connector remains in the selected destination database and schema, which are owned by the role
used for connector’s installation (usually it will be ACCOUNTADMIN). However, all sink tables and views containing your ServiceNow® data within the destination schema
are owned by the Snowflake Connector for ServiceNow® application. Therefore if you uninstall the connector before transferring the ownership of these tables and views
to an account role, they will be deleted as well.

> **Note:**
>
> If you do not want data to be deleted along with the connector, transfer the ownership of all tables and views in the destination schema to an account role and revoke current grants from the application.
>
> To prevent disruption in existing pipelines using ingested data, we recommend that all pipelines use dedicated Data Owner role to access the data, to which the ownership should be temporarily transferred.
>
> If you have granted additional privileges on the tables and views in the destination schema that your pipelines rely on, you can run the ownership transfer query below with COPY CURRENT GRANTS clause instead of REVOKE CURRENT GRANTS to keep these grants.

To transfer ownership of all tables and views in the destination schema to an account role, run the following queries:

> ```sqlsyntax
> USE ROLE ACCOUNTADMIN;
>
> GRANT OWNERSHIP ON ALL TABLES IN SCHEMA <destination_database>.<destination_schema>
> TO ROLE <account_role>
> REVOKE CURRENT GRANTS;
>
> GRANT OWNERSHIP ON ALL VIEWS IN SCHEMA <destination_database>.<destination_schema>
> TO ROLE <account_role>
> REVOKE CURRENT GRANTS;
> ```

To ensure the connector does not own any objects you do not want removed, run the following query:

> ```sqlsyntax
> SHOW OBJECTS OWNED BY APPLICATION <application_name>;
> ```

Finally, to drop the connector application, run the following query:

> ```sqlsyntax
> DROP APPLICATION <application_name>;
> ```

> **Warning:**
>
> If you have decided not to transfer the ownership of tables and views in the destination schema away from the connector, you can run this query to drop them alongside the connector instead:
>
> > ```sqlsyntax
> > DROP APPLICATION <application_name> CASCADE;
> > ```

### Deleting the objects created during the installation

Removing the connector database does not delete the ingested data that is stored in a separate database or the
objects that were created during the installation performed using Snowsight.

To see objects created during the installation, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow, then select it.
4. In the top navigation menu select Settings » Authentication.

   Secret, External Access Integration and Security Integration are the objects that you need to remove manually.
   In addition to these objects, there might be also a Network Rule object staying in the same schema as the secret (if the application was installed using Snowsight).

   > **Warning:**
   >
   > Secret’s and network rule’s database and schema can also be dropped. However, be careful if you are also using other Snowflake’s connectors,
   > for example Snowflake Connector for Google Analytics Raw Data. Objects of such application might also be located in the same database.

To delete those objects run the [DROP <object>](../../sql-reference/sql/drop.md) command.

For example, to delete the secret, run the [DROP SECRET](../../sql-reference/sql/drop-secret.md) statement.

## Upgrading the connector

The connector is upgraded automatically meaning that the user does not need to perform any action in order to have an up to date application.

## Scaling the connector

If there’s many ServiceNow® tables to be ingested and you want to increase the number of concurrently ingested tables, you can change the parameter by:

> ```sqlsyntax
> CALL CONFIGURE_CONCURRENCY(<number>);
> ```

Where:

`number`
:   Specifies the maximum number of workers able to ingest tables concurrently. By default this value is set to 10.

    Tables with continuous schedule use separate pool of dedicated workers and are not counted towards the concurrency limit.

Increasing the concurrency should be considered along with changing the size of the warehouse used for data ingestion.
If you are experiencing any slowdowns, try resizing the warehouse. See [Working with warehouses](../../user-guide/warehouses-tasks.md) for more information.

> **Warning:**
>
> Increasing the concurrency may result in an overloaded ServiceNow® instance, which will result in overall lower performance
> and ingestion errors. Compare connector’s performance and stability before and after any scaling changes to find the best parameters.

## Reinstalling the connector with the same database and schema for ServiceNow® data

To reinstall the connector, follow the process below:

1. Query the `TABLES_STATE` view and verify that none of the tables is currently in `RELOADING` status:

   ```sqlexample
   SELECT TABLE_NAME FROM TABLES_STATE WHERE STATUS = 'RELOADING';
   ```
2. If any tables are currently reloading, wait for reloads to complete or [cancel them](ingestion.md).
3. Stop the connector by calling the following stored procedure:

   ```sqlexample
   CALL PAUSE_CONNECTOR();
   ```
4. Export connector’s state and configuration. See this section for details.

   > **Important:**
   >
   > It’s recommended to export a connector’s state and configuration **before** dropping the connector application.
   > This will allow the preservation of all custom options (for example, tables enabled for synchronization and their schedules) and state
   > with their most fresh changes in the new installation.
   >
   > If you have already removed the connector but left the database and schema containing the ingested data intact, you can still reinstall the connector
   > relying on the automatically exported state but the reinstalled connector might repeat ingestion of some records.
5. Remove the connector, transferring the ownership of tables and views in the destination schema.
6. Reinstall the connector by:

   > * [reinstalling the connector using Snowsight](installing-snowsight.md) (preferred).
   > * [reinstalling the connector using SQL](installing-sql.md).

During the installation process:

* Provide the previously used database and schema.

  After installation the connector will detect that database and schema contain ingested data and will continue the ingestion
  from the place it was left before reinstallation. If you exported connector state and it’s successfully imported during
  installation, previously ingested tables will be automatically enabled with the same schedules and configuration. Otherwise,
  you will have to manually enable all tables and e.g. configure their schedules to restore the ingestion.

  When reinstalling with SQL commands, remember to transfer the ownership of views and tables in the destination schema
  to the connector, as described in the SQL installation guide. Otherwise the connector will not have access to these tables
  and views, preventing it from resuming the ingestion.
* Provide the same ServiceNow® instance name.
* For the other arguments, you can reuse [the objects that you created when installing the connector](installing-sql.md), or you can use new objects.

---
title: Monitor the Snowflake Connector for SharePoint
source: https://docs.snowflake.com/en/connectors/unstructured-data-connectors/sharepoint/monitor.md
section: Connectors & Drivers
---

# Monitor the Snowflake Connector for SharePoint

> **Note:**
>
> The Snowflake Connector for SharePoint is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for SharePoint.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements is not guaranteed. The new solution is available as [Openflow Connector for SharePoint](../../../user-guide/data-integration/openflow/connectors/sharepoint/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic describes how to monitor the state of the Snowflake Connector for SharePoint.

## Views

To monitor the state of the Snowflake Connector for SharePoint and troubleshoot problems, view
the connector configuration, error messages and statistics detailed the following views, which are defined in
the PUBLIC schema of the connector application.

| View | Description |
| --- | --- |
| `APP_PROPERTIES` | Provides information to the user interface about properties supported by the Snowflake Connector for SharePoint |
| `CONNECTOR_CONFIGURATION` | Provides a list of the values for configuration settings used by the connector. |
| `CONNECTOR_ERRORS` | Provides access to errors that have occurred during data ingestion. |
| `SYNC_STATUS` | Provides the general status of the connector and the ingestion process, and includes states:   * `PAUSED`: The connector is currently paused or resuming and no ingestion of any data is currently ongoing. * `SYNCING_DATA`: The connector is actively ingesting data. * `LAST_SYNCED`: Ingestion for at least one table has completed.   The timestamp of the most recently completed ingestion is provided in the `LAST_SYNCED_AT` column. |

### Examples

> ```sqlexample
> SELECT * FROM <APPLICATION_INSTANCE_DATABASE>.PUBLIC.CONNECTOR_CONFIGURATION;
> SELECT * FROM <APPLICATION_INSTANCE_DATABASE>.PUBLIC.CONNECTOR_ERRORS;
> SELECT * FROM <APPLICATION_INSTANCE_DATABASE>.PUBLIC.SYNC_STATUS;
> ```

Note that all the timestamps displayed in these views are provided in the UTC timezone with no offset.

### Required roles

The following roles have access to the described views:

* The owner of the connector application (typically the ACCOUNTADMIN system role).
* Any role with ADMIN or VIEWER application role granted.

---
title: Monitoring the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-monitoring.md
section: Connectors & Drivers
---

# Monitoring the Snowflake Connector for Google Analytics Aggregate Data

This section describes how to monitor the state of the Snowflake Connector for Google Analytics Aggregate Data and troubleshoot problems.

## About monitoring the connector

To monitor the state of the Snowflake Connector for Google Analytics Aggregate Data and troubleshoot problems, you can use the following database objects to access
the connector’s configuration, error messages, and statistics:

| Name | Description |
| --- | --- |
| `PUBLIC.REPORT_LIST_VIEW` | The list of currently configured reports, including information about the most recent ingestion:   * Report unique ID and name * Google Analytics property ID * Refresh interval * Information about the most recent ingestion |
| `PUBLIC.CONNECTOR_STATS` | Data about finished ingestion runs:   * Resource ingestion definition id: unique report ID * Ingestion configuration id: always `STANDARD` * Ingestion process id: always `NULL` * Resource name: report name * Started at: start of a single ingestion run * Updated at: last update time * Completed at: end of a single ingestion run * Status: status of ingestion run; can be `COMPLETED`, `FAILED` or `CANCELED` * Ingested rows: how many rows were fetched during the ingestion * Duration in seconds: duration of ingestion (difference between *started at* and *completed at*, in seconds) * Throughput in rows per seconds: number of ingested rows divided by duration |
| `PUBLIC.CONNECTOR_ERRORS` | Stored application error logs:   * Code: error code * Message: readable message as a string * Created_at: timestamp when log was created * context: payload that defines the error context |

These objects can be accessed by the `ADMIN` and `VIEWER` application roles defined by the connector.

---
title: Monitoring the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-monitoring.md
section: Connectors & Drivers
---

# Monitoring the Snowflake Connector for Google Analytics Raw Data

This topic describes how to monitor the state of the Snowflake Connector for Google Analytics Raw Data and troubleshoot problems.

## About monitoring the connector

To monitor the state of the Snowflake Connector for Google Analytics Raw Data and troubleshoot problems, you can access
the connector configuration, error messages and statistics through the following views, which are defined
in the `PUBLIC` schema in the database that serves an instance of the connector:

| View Name | Description |
| --- | --- |
| `CONNECTOR_CONFIGURATION` | The parameters used to configure the connector such as:   * Destination database and schema for ingested data. * Data owner role. * Warehouse used by the connector. * Secret used for authentication. * Dispatcher schedule. * Number of worker tasks. * Object names of the external access integration, security integration and secret used by the Connector. |
| `CONNECTOR_ERRORS` | Log of errors that occurred during the connector’s work. |
| `CONNECTOR_STATS` | Log of all the connector’s attempt to retrieve data, detailing:   * Source BigQuery table and Snowflake destination table. * Start and end time of the attempt. * Status of the attempt. Possible values include:  + `IN_PROGRESS` - data ingestion is currently running.   + `COMPLETED` - data ingestion has successfully finished.   + `FAILED` - data ingestion failed and is being retried.   + `CANCELLED` - data ingestion was terminated and is being retried.   + `DATA_NOT_FOUND` - at the time of the ingestion attempt, the related Google Analytics daily table was not visible in BigQuery. * For successful attempts, the total number of records retrieved, the time it took to retrieve them, and the average throughput. * Errors encountered during the attempt, if any. |
| `ENABLED_PROPERTIES` | List of currently enabled Google Analytics properties. |

To see the full list of columns in each view, use the [DESCRIBE VIEW](../../../sql-reference/sql/desc-view.md) command.

After the connector installation, only the ACCOUNTADMIN system role has access to these views. To change it, grant the appropriate permission to the other roles.

---
title: Monitoring the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/monitor.md
section: Connectors & Drivers
---

# Monitoring the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The following sections describe how to monitor the connector by querying views and examining log files:

* Viewing general information about the connector
* Viewing data sources
* Viewing the replication state of data sources
* Viewing the replication state of source tables
* Viewing table schema version history
* Viewing connector metrics
* Viewing aggregated connector metrics
* Viewing experimental views
* Viewing the connector audit log view
* Viewing the agent audit log view
* Viewing the connector logs
* Viewing the agent logs

## Viewing general information about the connector

To view general information about the connector, run [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command:

> ```sqlsyntax
> DESCRIBE APPLICATION <app_db_name>;
> ```
>
> Where:
>
> > `app_db_name`
> > :   Specifies the name of the connector database.

To view more specific information about the connector, query the `PUBLIC.CONNECTOR_CONFIGURATION` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.CONNECTOR_CONFIGURATION;
> ```

The `PUBLIC.CONNECTOR_CONFIGURATION` view displays a row for each parameter configured for the connector.

The following table describes these parameters:

| Parameter | Description |
| --- | --- |
| alertingLogsView | If you [enabled email notifications](email-notifications.md), this specifies the name of [the view that provides access to the event table](email-notifications.md). |
| alertingNotificationIntegration | If you [enabled email notifications](email-notifications.md), this specifies the name of the notification integration object used for email notifications. |
| alertingRecipients | If you [enabled email notifications](email-notifications.md), this specifies the list of email addresses (separated by commas) that can receive email notifications from the connector. |
| alertingSchedule | If you [enabled email notifications](email-notifications.md), this specifies the schedule or frequency at which the connector should check for errors and send a notification. |
| operational_warehouse | Name of the operational warehouse used by the connector. |
| warehouse | Name of the compute warehouse for merging data. |

## Viewing data sources

To view information about data sources, query the `PUBLIC.DATA_SOURCES` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.DATA_SOURCES;
> ```

The `PUBLIC.DATA_SOURCES` view displays a row for each data source configured for the connector. The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the data source. |
| SCHEDULE | VARCHAR | Schedule for running the replication. Displays NULL if scheduled replication of that data source is disabled. |
| DESTINATION_DB_NAME | VARCHAR | Name of the destination database. |

## Viewing the replication state of data sources

To view the current replication state of data sources, query the `PUBLIC.DATA_SOURCE_REPLICATION_STATE` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.DATA_SOURCE_REPLICATION_STATE;
> ```

The `PUBLIC.DATA_SOURCE_REPLICATION_STATE` view displays a row for each data source configured in the connector. The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the data source. |
| TABLES_ADDED_COUNT | NUMBER | Numbers of tables actively replicated in this data source. This number does not include tables for which the replication failed permanently. |
| CONNECTED_AGENT_ID | VARCHAR | ID of the agent application assigned to the data source. |
| SCHEDULE | VARCHAR | Schedule for running the replication. Displays NULL if scheduled replication of that data source is disabled. |
| REPLICATION_STATUS | VARCHAR | Replication status of the data source. Possible values:   * `WAITING` * `ONGOING` |
| PREVIOUS_SCHEDULED_RUN_STATUS | VARCHAR | Status of previous scheduled replication. Displays NULL if scheduled replication of that data source is disabled. Possible values:   * `DONE` * `WARNING` |
| PREVIOUS_RUN_FINISHED_AT | TIMESTAMP_NTZ | Timestamp of the end of last scheduled replication. Displays NULL if scheduled replication of that data source is disabled. |

## Viewing the replication state of source tables

To view the current replication state of each source table, query the `PUBLIC.REPLICATION_STATE` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.REPLICATION_STATE;
> ```

The `PUBLIC.REPLICATION_STATE` view displays a row for each source table. The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATA_SOURCE_NAME | VARCHAR | Name of the data source that contains the source table |
| SCHEMA_NAME | VARCHAR | Name of the schema of the source table |
| TABLE_NAME | VARCHAR | Name of the source table |
| REPLICATION_PHASE | VARCHAR | Current replication phase. Possible values:   * `SCHEMA_INTROSPECTION` * `INITIAL_LOAD` * `INCREMENTAL_LOAD`   For descriptions of each status, see Understanding replication phases. |
| SCHEMA_INTROSPECTION_STATUS | VARCHAR | Current schema introspection status. Possible values:   * `WAITING` * `IN_PROGRESS` * `DONE` * `RETRYING` * `FAILED` |
| SNAPSHOT_REPLICATION_STATUS | VARCHAR | Current snapshot replication status. Possible values:   * `WAITING` * `IN_PROGRESS` * `DONE` * `RETRYING` * `FAILED` |
| INCREMENTAL_REPLICATION_STATUS | VARCHAR | Current incremental replication status. Possible values:   * `WAITING` * `IN_PROGRESS` * `DONE` * `RETRYING` * `FAILED` |

### Understanding replication phases

Replication of each of the source tables can be in the following replication phases:

| Replication Phase | Description |
| --- | --- |
| `SCHEMA_INTROSPECTION` | Schema of the source table is being checked. Once this phase is done the destination table is created. |
| `INITIAL_LOAD` | The connector is processing the snapshot load for the source table. |
| `INCREMENTAL_LOAD` | Initial load is done, data is being replicated using change data capture process. |

> **Note:**
>
> You can start FAILED replications from the beginning by removing table from replication and adding it again as described in [Configuring replication for the Snowflake Connector for MySQL](configure-replication.md).

## Viewing table schema version history

To view the history of table schema changes, query the `PUBLIC.SCHEMA_CHANGE_HISTORY` view using a command similar to:

> ```sqlsyntax
> SELECT * FROM PUBLIC.SCHEMA_CHANGE_HISTORY;
> ```

The `PUBLIC.SCHEMA_CHANGE_HISTORY` view displays one or two rows for each table’s valid schema version.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATA_SOURCE_NAME | VARCHAR | Source table data source name. |
| SCHEMA_NAME | VARCHAR | Source table schema name. |
| TABLE_NAME | VARCHAR | Source table name. |
| VERSION | INTEGER | Schema version identifier, initially 0, and incremented by 1 with each schema change. Numbering restarts at zero if the table is removed and later re-added. |
| STATE | VARCHAR | one of:  * ACCEPTED: schema change is valid, but has yet to be applied to the destination table. * APPLIED: schema change has already been applied to the destination table.  Initially, at the start of the replication, contains only a single row with the value APPLIED. After subsequent valid schema changes will include two rows - one with state=ACCEPTED and one with state=APPLIED. |
| SOURCE_SCHEMA | VARIANT | JSON describing the schema of the source table. |
| DESTINATION_TABLE_SCHEMA | VARIANT | JSON describing the schema of the destination table after this schema version is applied. |
| INSERTED_AT | TIMESTAMP_NTZ | UTC timestamp when this record was inserted. |

## Viewing connector metrics

To view the connector replication metrics, query the `PUBLIC.CONNECTOR_STATS` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.CONNECTOR_STATS;
> ```

The `PUBLIC.CONNECTOR_STATS` view displays a row for each periodic merge of data into destination table during incremental load replication phase.

> **Note:**
>
> The first run for a given table in this view will be longer and larger than a typical later run. This is due to the fact that the connector gathers incremental updates to tables during the initial load phase, but processes them only after the whole table has been replicated.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| RESOURCE_INGESTION_DEFINITION_ID | VARCHAR | Identifier of a replicated table constructed from data source name, schema name and table name. |
| INGESTION_CONFIGURATION_ID | VARCHAR | Internal column for future integrations. |
| INGESTION_PROCESS_ID | VARCHAR | ID of the merge process. |
| INGESTION_DEFINITION_NAME | VARCHAR | Internal column for future integrations. |
| DATA_SOURCE_NAME | VARCHAR | Name of the data source to which the table belongs. |
| SCHEMA_NAME | VARCHAR | Name of the table’s schema. |
| RESOURCE_NAME | VARCHAR | Table name. |
| STARTED_AT | TIMESTAMP_NTZ | Time when the first record of the batch of records merged to the destination table was read from source database. |
| STATUS | VARCHAR | Merge process status. Possible values:   * `FINISHED` * `FAILED` |
| INGESTED_ROWS | NUMBER | Number of rows merged in the batch |
| INGESTION_DURATION_S | NUMBER | Batch processing time in seconds calculated as difference between first record being observed and the batch of records being merged into the destination table. |
| NATIVE_APP_PROCESSING_DURATION_S | NUMBER | Duration in seconds of data processing on Snowflake side. |
| AGENT_PROCESSING_DURATION_S | NUMBER | Duration in seconds of data processing on agent side. |
| THROUGHPUT_RPS | NUMBER | Connector throughput in records per second (RPS). Takes into account the overall processing time. |
| NATIVE_APP_THROUGHPUT_RPS | NUMBER | Throughput of the data processing on Snowflake side in records per second (RPS). |

## Viewing aggregated connector metrics

To view the connector replication metrics, query the `PUBLIC.AGGREGATED_CONNECTOR_STATS` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AGGREGATED_CONNECTOR_STATS;
> ```

The `PUBLIC.AGGREGATED_CONNECTOR_STATS` view shows the metrics of the connector aggregated hourly. Additional columns with data source name, schema name and table name are provided for further aggregations and analysis.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATE | DATE | Date of the aggregate, hourly. |
| PROCESSED_ROWS_COUNT | NUMBER | Sum of rows ingested for the table during the aggregate time. |
| THROUGHPUT_RPS | NUMBER | Throughput for the table for the aggregate time in records per second (RPS). |
| DATA_SOURCE_NAME | VARCHAR | Name of the data source to which the table belongs. |
| SCHEMA_NAME | VARCHAR | Name of the table’s schema. |
| SOURCE_TABLE_NAME | VARCHAR | Table name. |

## Viewing experimental views

The connector comes with a several additional views containing low-level information about the state of the connector and support state
change history tracking. These views are found in the `PUBLIC` schema with names that begin with the prefix `EXPERIMENTAL`.

The following table summarizes the currently available experimental views:

| View Name | Description |
| --- | --- |
| **EXPERIMENTAL_TABLE_REPLICATION_HISTORY** | A history of state changes for all enabled source tables in the connector. |
| **EXPERIMENTAL_DATA_SOURCE_REPLICATION_HISTORY** | A history of state changes for all configured data sources in the connector. |
| **EXPERIMENTAL_EVENTS_HISTORY** | A history of all events that occurred in the connector. |

> **Note:**
>
> Experimental views are subject to change and can be modified or removed in future connector releases.

## Viewing the connector audit log view

To view the audit log of user actions in the connector, query the `PUBLIC.AUDIT_LOG` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AUDIT_LOG;
> ```

The `PUBLIC.AUDIT_LOG` view displays a row for each user-initiated action recorded by the connector.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ACTION_TIME | TIMESTAMP_NTZ | Time when the action happened. |
| ACTION_TYPE | VARCHAR | Action type. |
| PARAMETERS | VARIANT | Additional parameters of the action. |

Actions recorded in this view are:

> * Data source added
> * Table replication enabled
> * Table replication disabled
> * Scheduled replication enabled for data source
> * Scheduled replication disabled for data source

## Viewing the agent audit log view

To view the audit log of agent actions in the connector, query the `PUBLIC.AGENT_AUDIT_LOG` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AGENT_AUDIT_LOG;
> ```

The `PUBLIC.AGENT_AUDIT_LOG` view displays a row for each agent-reported action registered by the connector.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ACTION_TIME | TIMESTAMP_NTZ | Time when the action happened. |
| ACTION_TYPE | VARCHAR | Action type. |
| PARAMETERS | VARIANT | Additional parameters of the action. |

Actions shown in this view are:

> * Agent assigned to data source
> * Agent unassigned from data source
> * Agent registered
> * Agent unregistered
> * Snapshot load started
> * Snapshot load finished
> * Snapshot load failed
> * Snapshot load terminated
> * Schema introspection succeeded
> * Schema introspection failed
> * Incremental load started
> * Incremental load stopped
> * Incremental load failed
> * Incremental load terminated
> * Schema change reported

## Viewing the connector logs

To view the connector logs, query the event table that you created while setting up the connector [log view](install-snowsight.md).

To view the audit log of agent actions in the connector, query the `PUBLIC.AGENT_AUDIT_LOG` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AGENT_AUDIT_LOG;
> ```

The `PUBLIC.AGENT_AUDIT_LOG` view displays a row for each agent-reported action registered by the connector.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ACTION_TIME | TIMESTAMP_NTZ | Time when the action happened. |
| ACTION_TYPE | VARCHAR | Action type. |
| PARAMETERS | VARIANT | Additional parameters of the action. |

Actions shown in this view are:

> * Agent assigned to data source
> * Agent unassigned from data source
> * Agent registered
> * Agent unregistered
> * Snapshot load started
> * Snapshot load finished
> * Snapshot load failed
> * Snapshot load terminated
> * Schema introspection succeeded
> * Schema introspection failed
> * Incremental load started
> * Incremental load stopped
> * Incremental load failed
> * Incremental load terminated
> * Schema change reported

## Viewing the agent logs

> When the agent is running, it periodically sends logs to Snowflake. These logs are available in the `AGENT_LOGS` view
> and can be retrieved using the following query:
>
> > ```sqlsyntax
> > SELECT * FROM PUBLIC.AGENT_LOGS;
> > ```

## Next steps

If required, and after completing these procedures, review the steps in [Troubleshooting the Snowflake Connector for MySQL](troubleshoot.md).

---
title: Monitoring the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/monitor.md
section: Connectors & Drivers
---

# Monitoring the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The following sections describe how to monitor the connector by querying views and examining log files:

* Viewing general information about the connector
* Viewing data sources
* Viewing the replication state of data sources
* Viewing the replication state of source tables
* Viewing table schema version history
* Viewing connector metrics
* Viewing aggregated connector metrics
* Viewing experimental views
* Viewing the connector audit log view
* Viewing the agent audit log view
* Viewing the connector logs
* Viewing the agent logs

## Viewing general information about the connector

To view general information about the connector, run [DESCRIBE APPLICATION](../../sql-reference/sql/desc-application.md) command:

> ```sqlsyntax
> DESCRIBE APPLICATION <app_db_name>;
> ```
>
> Where:
>
> > `app_db_name`
> > :   Specifies the name of the connector database.

To view more specific information about the connector, query the `PUBLIC.CONNECTOR_CONFIGURATION` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.CONNECTOR_CONFIGURATION;
> ```

The `PUBLIC.CONNECTOR_CONFIGURATION` view displays a row for each parameter configured for the connector.

The following table describes these parameters:

| Parameter | Description |
| --- | --- |
| alertingLogsView | If you [enabled email notifications](email-notifications.md), this specifies the name of [the view that provides access to the event table](email-notifications.md). |
| alertingNotificationIntegration | If you [enabled email notifications](email-notifications.md), this specifies the name of the notification integration object used for email notifications. |
| alertingRecipients | If you [enabled email notifications](email-notifications.md), this specifies the list of email addresses (separated by commas) that can receive email notifications from the connector. |
| alertingSchedule | If you [enabled email notifications](email-notifications.md), this specifies the schedule or frequency at which the connector should check for errors and send a notification. |
| operational_warehouse | Name of the operational warehouse used by the connector. |
| warehouse | Name of the compute warehouse for merging data. |

## Viewing data sources

To view information about data sources, query the `PUBLIC.DATA_SOURCES` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.DATA_SOURCES;
> ```

The `PUBLIC.DATA_SOURCES` view displays a row for each data source configured for the connector. The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the data source. |
| SCHEDULE | VARCHAR | Schedule for running the replication. Displays NULL if scheduled replication of that data source is disabled. |
| DESTINATION_DB_NAME | VARCHAR | Name of the destination database. |

## Viewing the replication state of data sources

To view the current replication state of data sources, query the `PUBLIC.DATA_SOURCE_REPLICATION_STATE` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.DATA_SOURCE_REPLICATION_STATE;
> ```

The `PUBLIC.DATA_SOURCE_REPLICATION_STATE` view displays a row for each data source configured in the connector. The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| NAME | VARCHAR | Name of the data source. |
| TABLES_ADDED_COUNT | NUMBER | Numbers of tables actively replicated in this data source. This number does not include tables for which the replication failed permanently. |
| CONNECTED_AGENT_ID | VARCHAR | ID of the agent application assigned to the data source. |
| SCHEDULE | VARCHAR | Schedule for running the replication. Displays NULL if scheduled replication of that data source is disabled. |
| REPLICATION_STATUS | VARCHAR | Replication status of the data source. Possible values:   * `WAITING` * `ONGOING` |
| PREVIOUS_SCHEDULED_RUN_STATUS | VARCHAR | Status of previous scheduled replication. Displays NULL if scheduled replication of that data source is disabled. Possible values:   * `DONE` * `WARNING` |
| PREVIOUS_RUN_FINISHED_AT | TIMESTAMP_NTZ | Timestamp of the end of last scheduled replication. Displays NULL if scheduled replication of that data source is disabled. |

## Viewing the replication state of source tables

To view the current replication state of each source table, query the `PUBLIC.REPLICATION_STATE` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.REPLICATION_STATE;
> ```

The `PUBLIC.REPLICATION_STATE` view displays a row for each source table. The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATA_SOURCE_NAME | VARCHAR | Name of the data source that contains the source table |
| SCHEMA_NAME | VARCHAR | Name of the schema of the source table |
| TABLE_NAME | VARCHAR | Name of the source table |
| REPLICATION_PHASE | VARCHAR | Current replication phase. Possible values:   * `SCHEMA_INTROSPECTION` * `INITIAL_LOAD` * `INCREMENTAL_LOAD`   For descriptions of each status, see Understanding replication phases. |
| SCHEMA_INTROSPECTION_STATUS | VARCHAR | Current schema introspection status. Possible values:   * `WAITING` * `IN_PROGRESS` * `DONE` * `RETRYING` * `FAILED` |
| SNAPSHOT_REPLICATION_STATUS | VARCHAR | Current snapshot replication status. Possible values:   * `WAITING` * `IN_PROGRESS` * `DONE` * `RETRYING` * `FAILED` |
| INCREMENTAL_REPLICATION_STATUS | VARCHAR | Current incremental replication status. Possible values:   * `WAITING` * `IN_PROGRESS` * `DONE` * `RETRYING` * `FAILED` |

### Understanding replication phases

Replication of each of the source tables can be in the following replication phases:

| Replication Phase | Description |
| --- | --- |
| `SCHEMA_INTROSPECTION` | Schema of the source table is being checked. Once this phase is done the destination table is created. |
| `INITIAL_LOAD` | The connector is processing the snapshot load for the source table. |
| `INCREMENTAL_LOAD` | Initial load is done, data is being replicated using change data capture process. |

> **Note:**
>
> You can start FAILED replications from the beginning by removing table from replication and adding it again as described in [Configuring replication for the Snowflake Connector for PostgreSQL](configure-replication.md).

## Viewing table schema version history

To view the history of table schema changes, query the `PUBLIC.SCHEMA_CHANGE_HISTORY` view using a command similar to:

> ```sqlsyntax
> SELECT * FROM PUBLIC.SCHEMA_CHANGE_HISTORY;
> ```

The `PUBLIC.SCHEMA_CHANGE_HISTORY` view displays one or two rows for each table’s valid schema version.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATA_SOURCE_NAME | VARCHAR | Source table data source name. |
| SCHEMA_NAME | VARCHAR | Source table schema name. |
| TABLE_NAME | VARCHAR | Source table name. |
| VERSION | INTEGER | Schema version identifier, initially 0, and incremented by 1 with each schema change. Numbering restarts at zero if the table is removed and later re-added. |
| STATE | VARCHAR | one of:  * ACCEPTED: schema change is valid, but has yet to be applied to the destination table. * APPLIED: schema change has already been applied to the destination table.  Initially, at the start of the replication, contains only a single row with the value APPLIED. After subsequent valid schema changes will include two rows - one with state=ACCEPTED and one with state=APPLIED. |
| SOURCE_SCHEMA | VARIANT | JSON describing the schema of the source table. |
| DESTINATION_TABLE_SCHEMA | VARIANT | JSON describing the schema of the destination table after this schema version is applied. |
| INSERTED_AT | TIMESTAMP_NTZ | UTC timestamp when this record was inserted. |

## Viewing connector metrics

To view the connector replication metrics, query the `PUBLIC.CONNECTOR_STATS` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.CONNECTOR_STATS;
> ```

The `PUBLIC.CONNECTOR_STATS` view displays a row for each periodic merge of data into destination table during incremental load replication phase.

> **Note:**
>
> The first run for a given table in this view will be longer and larger than a typical later run. This is due to the fact that the connector gathers incremental updates to tables during the initial load phase, but processes them only after the whole table has been replicated.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| RESOURCE_INGESTION_DEFINITION_ID | VARCHAR | Identifier of a replicated table constructed from data source name, schema name and table name. |
| INGESTION_CONFIGURATION_ID | VARCHAR | Internal column for future integrations. |
| INGESTION_PROCESS_ID | VARCHAR | ID of the merge process. |
| INGESTION_DEFINITION_NAME | VARCHAR | Internal column for future integrations. |
| DATA_SOURCE_NAME | VARCHAR | Name of the data source to which the table belongs. |
| SCHEMA_NAME | VARCHAR | Name of the table’s schema. |
| RESOURCE_NAME | VARCHAR | Table name. |
| STARTED_AT | TIMESTAMP_NTZ | Time when the first record of the batch of records merged to the destination table was read from source database. |
| STATUS | VARCHAR | Merge process status. Possible values:   * `FINISHED` * `FAILED` |
| INGESTED_ROWS | NUMBER | Number of rows merged in the batch |
| INGESTION_DURATION_S | NUMBER | Batch processing time in seconds calculated as difference between first record being observed and the batch of records being merged into the destination table. |
| NATIVE_APP_PROCESSING_DURATION_S | NUMBER | Duration in seconds of data processing on Snowflake side. |
| AGENT_PROCESSING_DURATION_S | NUMBER | Duration in seconds of data processing on agent side. |
| THROUGHPUT_RPS | NUMBER | Connector throughput in records per second (RPS). Takes into account the overall processing time. |
| NATIVE_APP_THROUGHPUT_RPS | NUMBER | Throughput of the data processing on Snowflake side in records per second (RPS). |

## Viewing aggregated connector metrics

To view the connector replication metrics, query the `PUBLIC.AGGREGATED_CONNECTOR_STATS` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AGGREGATED_CONNECTOR_STATS;
> ```

The `PUBLIC.AGGREGATED_CONNECTOR_STATS` view shows the metrics of the connector aggregated hourly. Additional columns with data source name, schema name and table name are provided for further aggregations and analysis.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATE | DATE | Date of the aggregate, hourly. |
| PROCESSED_ROWS_COUNT | NUMBER | Sum of rows ingested for the table during the aggregate time. |
| THROUGHPUT_RPS | NUMBER | Throughput for the table for the aggregate time in records per second (RPS). |
| DATA_SOURCE_NAME | VARCHAR | Name of the data source to which the table belongs. |
| SCHEMA_NAME | VARCHAR | Name of the table’s schema. |
| SOURCE_TABLE_NAME | VARCHAR | Table name. |

## Viewing experimental views

The connector comes with a several additional views containing low-level information about the state of the connector and support state
change history tracking. These views are found in the `PUBLIC` schema with names that begin with the prefix `EXPERIMENTAL`.

The following table summarizes the currently available experimental views:

| View Name | Description |
| --- | --- |
| **EXPERIMENTAL_TABLE_REPLICATION_HISTORY** | A history of state changes for all enabled source tables in the connector. |
| **EXPERIMENTAL_DATA_SOURCE_REPLICATION_HISTORY** | A history of state changes for all configured data sources in the connector. |
| **EXPERIMENTAL_EVENTS_HISTORY** | A history of all events that occurred in the connector. |

> **Note:**
>
> Experimental views are subject to change and can be modified or removed in future connector releases.

## Viewing the connector audit log view

To view the audit log of user actions in the connector, query the `PUBLIC.AUDIT_LOG` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AUDIT_LOG;
> ```

The `PUBLIC.AUDIT_LOG` view displays a row for each user-initiated action recorded by the connector.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ACTION_TIME | TIMESTAMP_NTZ | Time when the action happened. |
| ACTION_TYPE | VARCHAR | Action type. |
| PARAMETERS | VARIANT | Additional parameters of the action. |

Actions recorded in this view are:

> * Data source added
> * Table replication enabled
> * Table replication disabled
> * Scheduled replication enabled for data source
> * Scheduled replication disabled for data source

## Viewing the agent audit log view

To view the audit log of agent actions in the connector, query the `PUBLIC.AGENT_AUDIT_LOG` view:

> ```sqlsyntax
> SELECT * FROM PUBLIC.AGENT_AUDIT_LOG;
> ```

The `PUBLIC.AGENT_AUDIT_LOG` view displays a row for each agent-reported action registered by the connector.

The view consists of the following columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| ACTION_TIME | TIMESTAMP_NTZ | Time when the action happened. |
| ACTION_TYPE | VARCHAR | Action type. |
| PARAMETERS | VARIANT | Additional parameters of the action. |

Actions shown in this view are:

> * Agent assigned to data source
> * Agent unassigned from data source
> * Agent registered
> * Agent unregistered
> * Snapshot load started
> * Snapshot load finished
> * Snapshot load failed
> * Snapshot load terminated
> * Schema introspection succeeded
> * Schema introspection failed
> * Incremental load started
> * Incremental load stopped
> * Incremental load failed
> * Incremental load terminated
> * Schema change reported

## Viewing the connector logs

To view the connector logs, query the event table that you created while setting up the connector [log view](install-snowsight.md).

```sqlsyntax
SELECT * FROM <fully_qualified_event_table_name>
   WHERE RECORD_TYPE = 'LOG'
   AND RESOURCE_ATTRIBUTES:"snow.database.name" = '<app_db_name>';
```

Where:

`fully_qualified_event_table_name`
:   Specifies the fully qualified name of the event table.

`app_db_name`
:   Specifies the name of the connector database.

## Viewing the agent logs

> When the agent is running, it periodically sends logs to Snowflake. These logs are available in the `AGENT_LOGS` view
> and can be retrieved using the following query:
>
> > ```sqlsyntax
> > SELECT * FROM PUBLIC.AGENT_LOGS;
> > ```

## Next steps

If required, and after completing these procedures, review the steps in [Troubleshooting the Snowflake Connector for PostgreSQL](troubleshoot.md).

---
title: Monitoring the Snowflake Connector for ServiceNow®
source: https://docs.snowflake.com/en/connectors/servicenow/monitoring.md
section: Connectors & Drivers
---

# Monitoring the Snowflake Connector for ServiceNow®

This topic describes how to monitor the state of the Snowflake Connector for ServiceNow® and troubleshoot problems.

## About Monitoring the Connector

To monitor the state of the Snowflake Connector for ServiceNow® and troubleshoot problems, you can access
the connector configuration, error messages and statistics through the following views, which are defined
in the `PUBLIC` schema of [the connector application](installing-sql.md):

| View Name | Description |
| --- | --- |
| `AGGREGATED_CONNECTOR_STATS` | Provides access to information about a total number of rows updated by the connector (records inserted, modified and deleted) in each full hour. |
| `APP_PROPERTIES` | Provides information to the User Interface about properties supported by the Snowflake Connector for ServiceNow® |
| `CONFIGURED_TABLES` | Provides the list of ServiceNow® tables that have been configured. You can use this view to determine which tables are enabled for synchronization, their ingestion strategy, schedule and other ingestion options. |
| `CONNECTOR_CONFIGURATION` | Provides a list of the values of the configuration settings used by the connector. |
| `CONNECTOR_ERRORS` | Provides access to the errors that occurred during data ingestion. |
| `CONNECTOR_OVERVIEW` | Provides general information about the connector. |
| `CONNECTOR_STATS` | Provides statistics about the ongoing data ingestion process and the amount of data collected by the connector in each ingestion run. |
| `RELOADED_TABLES` | Provides information about the tables that are currently being reloaded. It combines the configuration values from the `CONFIGURED_TABLES` view with the manually provided reload options and gives the reload process overview. |
| `SYNC_STATUS` | Provides the general status of the connector and the ingestion process:   * `PAUSED` - the connector is currently paused or in the middle of resuming and no ingestion of any table is currently ongoing. * `NOT_SYNCING` - the connector is ready to ingest data but has not ingested any data yet. * `SYNCING_DATA` - the connector is ingesting data but there is no table for which ingestion has finished yet. * `LAST_SYNCED` - ingestion for at least one table has finished. The timestamp of the last finished ingestion is provided in LAST_SYNCED_AT column. |
| `TABLES_STATE` | Provides access to information about the tables that have ever been enabled for synchronization. This information includes:   * the status of the table - whether it is enabled, disabled or in the middle of reload. * the status of the last ingestion:    + `DONE` means that the fetched data is available in the sync table.   + `RUNNING` means that the download is in progress or the data is already fetched into the event log table but the sync table was not updated yet.   + `FAILED` means that the ingestion run was interrupted because of an error. This may result in only part of the data being downloaded. This won’t cause any data discrepancy, and depending on ingestion strategy some batches might be collected again.   + `DISABLED` means that given table was disabled in the middle of this ingestion run. * the timestamp of the last scheduled synchronization. * the page size used in the requests collecting data for the table. * the status of flattened views creation. * the timestamp of the last time the connector checked if flattened views for the table need to be recreated. |
| `WORKERS_STATE` | Provides access to information about currently ingested tables and when were the worker tasks assigned to them. |

Please note that all the timestamps displayed in the above views are provided in the UTC timezone with no offset, which may
differ from the timezone of the dates displayed by the ServiceNow instance.

The following roles have access to these views:

* The owner of [the connector application](installing-sql.md) (usually the ACCOUNTADMIN system role).
* Any role with ADMIN or VIEWER application role granted.

## Configuring Email Alerts

You can enable email alerts for the connector. The connector uses the
[Notification System Stored Procedure](../../user-guide/notifications/email-stored-procedures.md)
to send the email notifications. In order to configure alerts, [the connector must be installed first](installing-snowsight.md).
These email notifications include the number of errors encountered and the type of each error.

### Enabling Email Notifications Using Snowsight

To configure email alerts, navigate to the Snowflake Connector for ServiceNow® application in the Marketplace:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow®, then select the tile for the connector.
4. In the page for the Snowflake Connector for ServiceNow®, select the Settings tab in the upper bar then switch to the
   Email Alerts section from the list on the left.
5. Enter the following information in the dialog box:

   | Field | Description |
   | --- | --- |
   | Email Address | Single email address where alerts should be sent. You must specify an email address that is associated with the Snowflake account. |
   | Frequency | There are two possible values:  * Immediately - Errors are summarized and the report is sent as often as the lowest configured ingestion schedule. * Once per day - An email message with a summary of all errors is sent once a day at 12PM UTC. |

### Disabling Email Notifications Using Snowsight

To disable email alerts, navigate to the Snowflake Connector for ServiceNow® application in Marketplace:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow®, then select the tile for the connector.
4. In the page for the Snowflake Connector for ServiceNow®, select the Settings tab in the upper bar then switch to the
   Email Alerts section from the list on the left.
5. Select Stop receiving alerts, then confirm by selecting Stop receiving alerts again.

Under the hood, a [notification integration](../../user-guide/notifications/email-notifications.md) object, used to send email
alerts, is created. The name of this integration is the same as the name of
[the connector application](installing-sql.md)
with an added suffix `_NOTIFICATION_INTEGRATION`. The connector references this object by name.
Changing name of this object or dropping it causes the email alerts functionality to break.

### Enabling Email Notifications Using SQL

To configure email alerts, you must create a [notification integration](../../user-guide/notifications/email-notifications.md).

After creating the notification integration, you must grant USAGE on this integration to [the connector application](installing-sql.md).
For example, to grant the following privileges to the connector named `my_connector_servicenow`:

> ```sqlsyntax
> GRANT USAGE ON INTEGRATION <notification_integration_name> TO APPLICATION <connector_application>;
> ```

To configure and enable email alerts, call the `CONFIGURE_ALERTS` procedure:

> ```sqlsyntax
> CALL CONFIGURE_ALERTS({
>   'notification_integration_name': '<notification_integration_name>',
>   'email_addresses': ['<email_address>'],
>   'schedule_type': '<schedule>'
> });
> ```

Where:

`notification_integration_name`
:   Identifier for the [notification integration](../../user-guide/notifications/email-notifications.md) that you created for sending the email alerts.

`email_address`
:   Email address where the email notifications should be sent.

    * You can specify only one email address.
    * The email address must be specified in the ALLOWED_RECIPIENTS clause of the notification integration.

`schedule`
:   The frequency with which notifications should be sent. Specify one of the following values:

    * ONCE_PER_DAY: Send email notifications once a day at 12PM UTC.
    * LOWEST_INGESTION_SCHEDULE: Send email notifications immediately after an error occurs.

For example, if you named your connector application MY_CONNECTOR_SERVICENOW, use the notification integration `SN_EMAILS`
to send daily email notifications to `john.doe@snowflake.com` email, run the following commands:

> ```sqlexample
> GRANT USAGE ON INTEGRATION SN_EMAILS TO APPLICATION MY_CONNECTOR_SERVICENOW;
>
> CALL CONFIGURE_ALERTS({
>   'notification_integration_name': 'SN_EMAILS',
>   'email_addresses': ['john.doe@snowflake.com'],
>   'schedule_type': 'ONCE_PER_DAY'
> });
> ```

The connector references [notification integration](../../user-guide/notifications/email-notifications.md) object by name. Changing the name of this object or dropping it
causes the email alerts functionality to break.

### Disabling Email Notifications Using SQL

To disable email notifications, call the `DISABLE_ALERTS()` stored procedure:

> ```sqlsyntax
> CALL DISABLE_ALERTS();
> ```

If you need to enable email notifications again, see Enabling Email Notifications Using Snowsight.

---
title: Prepare your ServiceNow® instance
source: https://docs.snowflake.com/en/connectors/servicenow/prereqs.md
section: Connectors & Drivers
---

# Prepare your ServiceNow® instance

Before installing the Snowflake Connector for ServiceNow®, you must set up your ServiceNow® instance. Complete the following steps:

* ServiceNow® instance access - ensure your ServiceNow® instance is ready for use
* ServiceNow® user - Ensure the required user is properly configured
* Set up column indexes for optimized performance - Configure column indices for best performance
* Optional steps - review and perform optional configuration, if required

## ServiceNow® instance access

* Ensure the ServiceNow® instance is publicly available. The connector does not work with instances hidden behind a VPN.
* If you are using [IP Address Access Control](https://docs.servicenow.com/bundle/washingtondc-platform-security/page/administer/login/task/t_AccessControl.html) for your ServiceNow® instance, you won’t be able to successfully install the connector.
  For more information see [the community article](https://community.snowflake.com/s/article/Why-Snowflake-doesn-t-share-static-IP-address-with-customer).

## ServiceNow® user

Identify or create the ServiceNow® user for the connector.

To connect to the ServiceNow® instance, the connector must authenticate to the instance as a ServiceNow®
user. Choose a ServiceNow® user that meets the following requirements:

* The username cannot contain a colon (`:`).
* The user must have `read`, `query_match` and `query_range` access to all records in the ServiceNow® tables that
  you plan to ingest. Access control lists (ACLs) should not hide any records in these tables from this user.
* The user must have `read`, `query_match` and `query_range` access to all rows in the following tables in order
  to enable schema detection:

  + `sys_db_object` (with the fields `name`, `super_class`, `sys_id`),
  + `sys_glide_object` (with the fields `name`, `scalar_type`, `sys_id`),
  + `sys_dictionary` (with the fields `element`, `internal_type`, `name`, `sys_id`).
* The user must have `read`, `query_match` and `query_range` access to all rows in the following table in order to
  use the proper ingestion strategy:

  + `sys_table_rotation` (with the `name` and `sys_id` fields).
* The user must have `read`, `query_match` and `query_range` access to the `sys_updated_on` field in the
  below tables in order to not to use less cost-effective “truncate and load” ingestion mode:

  + `sys_db_object`,
  + `sys_glide_object`,
  + `sys_dictionary`,
  + `sys_table_rotation`,
  + journal table (usually `sys_audit_delete`).

> **Note:**
>
> Configuring the connection in Snowsight using the OAuth authentication to ServiceNow® is possible only
> with [interactive user](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/users-and-groups/concept/c_NonInteractiveSessions.html). The ServiceNow® user is interactive if the Web service access only
> setting is disabled for the user.
>
> You can use the OAuth authentication with non-interactive users only if you configure the connection with SQL commands.
> In this case, you cannot log in to ServiceNow® or get the OAuth refresh token using Snowsight.

## Set up column indexes for optimized performance

If you plan to ingest and synchronize a ServiceNow® table that has a `sys_updated_on` field, we recommend setting up
an index on that column. For information on setting up the indexes, see the [Create a Table Index](https://docs.servicenow.com/bundle/washingtondc-application-development/page/administer/table-administration/task/t_CreateCustomIndex.html) in the ServiceNow® documentation.

After you create the index through the user interface, it may take some time for the index to be constructed.
The indexing process runs as a background task.

If your instance has large tables, Snowflake recommends contacting ServiceNow® customer support to ask
about the best approach to indexing large tables.

## Optional steps

* If you plan to use the OAuth authentication method, and you have the [read-only role](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/user-administration/concept/c_ReadOnlyRole.html)
  assigned to your ServiceNow® user, make sure the `glide.security.snc_read_only_role.tables.exempt_create`
  system property has the `oauth_credential` table in its value list.

  Create or edit the `glide.security.snc_read_only_role.tables.exempt_create`
  property in the `sys_properties` table. For more details on editing this property, see [ServiceNow Knowledge Base](https://support.servicenow.com/kb?id=kb_article_view&sysparm_article=KB0783404).

  To learn how to add a new system property, see [Add a system property](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/reference-pages/reference/r_AvailableSystemProperties.html#t_AddAPropertyUsingSysPropsList) in the ServiceNow® documentation.
* To enable deleted records to be propagated, use the `sys_audit_delete` table as the source of information about deleted records.

  > **Note:**
  >
  > Please note that the connector must have access to all journal table records or the installation may fail.
  > Otherwise record deletions in other tables may not be correct.
  >
  > If journal table rows are hidden by ACLs, connector behavior is unpredictable.
  > Even if the installation is successful, some deletions may not be correctly synchronized at later points in the process.

  + To use `sys_audit_delete`:

    1. Set the `no_audit_delete` [dictionary attribute](https://docs.servicenow.com/bundle/washingtondc-application-development/page/administer/reference-pages/concept/c_DictionaryAttributes.html) to `false`.
    2. Make sure that the ServiceNow® user for the connector has access to the `sys_audit_delete` table and
       the `documentkey`, `tablename`, `sys_id`, and `sys_created_on` fields in this table.
    > **Note:**
    >
    > The connector is only able to synchronize deleted records if they are audited.
    > Delete operations that do not call `DBDelete.setWorkflow()` are not ingested in Snowflake.
    >
    > Refer to your ServiceNow® product documentation for more information on using `DBDelete.setWorkflow()`.
    >
    > Also, note the following about deleted records:
    >
    > - Record deletions are not tracked for tables with the `no_audit_delete=true` dictionary attribute.
    > - Record deletions from tables with a `sys` prefix are not tracked by default.
    > - The connector can only ingest records deleted with cascade record deletion if the reference field is on an
    >   audited table. Refer to your ServiceNow® product documentation for more information on cascade record deletion.

## Next steps

After completing these procedures, follow the steps in [Install and configure the connector with Snowsight](installing-snowsight.md) or [Install and configure the connector with SQL commands](installing-sql.md).

---
title: Preparing your Google Analytics and Google Cloud accounts
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-prereqs.md
section: Connectors & Drivers
---

# Preparing your Google Analytics and Google Cloud accounts

Before installing Snowflake Connector for Google Analytics Aggregate Data:

* Ensure that your Google Analytics properties are migrated to Google Analytics 4 (GA4). The Snowflake Connector for Google Analytics Aggregate Data does not support Universal Analytics.
* Ensure that the Google Analytics Admin API and Google Analytics Data API are enabled for your Google Cloud project.
* Do one of the following:

  > + Create a service account key for your Google Cloud project. The Snowflake Connector for Google Analytics Aggregate Data uses the service account to authenticate against the GA4 API. For more information, see [Configure service account authentication for Google Cloud](gaad-connector-create-service-account-key.md).
  > + Alternatively, configure the OAuth consent screen and client ID in your Google Cloud project. The Snowflake Connector for Google Analytics Aggregate Data uses the OAuth consent screen and the client ID to authenticate against the GA4 API. For more information, see [Configure OAuth authentication for Google Cloud](gaad-connector-create-client-id.md).

---
title: Preparing your Google Analytics and Google Cloud accounts
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-prereqs.md
section: Connectors & Drivers
---

# Preparing your Google Analytics and Google Cloud accounts

Before installing the Snowflake connector for Snowflake Connector for Google Analytics Raw Data:

* Ensure that your Google account has access to both Google Analytics and Google Cloud.
* Ensure that your Google Analytics properties are migrated to Google Analytics 4 (GA4). The Snowflake connector for Snowflake Connector for Google Analytics Raw Data does not support Universal Analytics.
* Configure the BigQuery link for each GA4 property you want to load to Snowflake. The link enables the GA4 raw data extraction to the Google Cloud project. For details, see [Configuring BigQuery Link for Google Analytics 4 property](gard-connector-create-link.md).
* Configure authentication for your Google Cloud project. Choose one of the two authentication methods supported by Snowflake Connector for Google Analytics Raw Data:

  + **Service account authentication**: Create a service account key for your Google Cloud project. The connector uses the service account to authenticate to the Google Cloud project and to read the GA4 data from the BigQuery storage. For details, see [Configuring service account authentication for Google Cloud Platform (GCP)](gard-connector-create-service-account-key.md).
  + **OAuth authentication**: Configure the OAuth consent screen and client ID for your Google Cloud project. The connector uses the OAuth consent screen and the client ID to authenticate to the Google Cloud project and to read the GA4 data from the BigQuery storage. For details, see [Configuring OAuth authentication for Google Cloud Platform (GCP)](gard-connector-create-client-id.md).
* Ensure that the Cloud Resource Manager API is enabled for your Google Cloud project. This allows the connector to list the GA4 properties available in your project.

---
title: Prerequisites for Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/prereqs.md
section: Connectors & Drivers
---

# Prerequisites for Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Before installing the Snowflake Connector for MySQL, you must ensure that the following prerequisites are met in your
MySQL and Snowflake environments.

## Setting up the prerequisites for MySQL

Before installing the Snowflake Connector for MySQL, do the following in your MySQL environment:

* Ensure that you have a MySQL 8 server that includes data you want to synchronize with Snowflake.
* Set the following options for your MySQL server:

  ```ini
  log_bin = on
  binlog_format = row
  binlog_row_metadata = full
  binlog_row_image = full
  binlog_row_value_options =
  ```

  > **Note:**
  >
  > Be cautious about the binary log expiration period (`binlog_expire_logs_seconds`). After it ends, binary log
  > files might be automatically removed. If the agent is paused for a long period of time (for example due to
  > maintenance work) and the expired binary log files are deleted during this time, the agent is not able to
  > replicate the data from these files. Set the binary log expiration period to at least a few hours to ensure stable
  > work of the connector.
  >
  > For more information about the automatic purging of binary log files, see
  > [MySQL Reference Manual](https://dev.mysql.com/doc/refman/8.0/en/replication-options-binary-log.html).

## Setting up the prerequisites for running the agent

Before installing the connector, you must set up the environment where the agent runs.

### Configuring your firewall to access to Snowflake

If you are using a firewall, add the Snowflake hostnames and port numbers to the allowed list.
For more information, see [Allowing Host names](../../user-guide/hostname-allowlist.md).

After adding the hostnames and port numbers to the allowed list, use
[SnowCD](../../user-guide/snowcd.md) to verify the Snowflake connection from the host where
you run the agent.

### Installing an orchestration tool

The agent is distributed as a Docker image that you can run using orchestration tools and services like Docker,
Kubernetes, or OpenShift.

To run the agent, you must have one of these tools installed. Your environment must have:

* At least 6 GB of RAM available to the container running the agent. The agent is a memory-intensive application.
* 4 CPUs available to handle the throughput requirements of the agent. Decreasing the number of CPUs decreases the
  throughput linearly. Having additional CPUs does not provide significant gains.

The Snowflake Connector for MySQL requires exactly one instance of the agent application to be running at all times.

## Next steps

After completing these procedures, follow the steps in [Prerequisites for Snowflake Connector for MySQL datasources](prereqs-datasource.md).

---
title: Prerequisites for Snowflake Connector for MySQL datasources
source: https://docs.snowflake.com/en/connectors/mysql6/prereqs-datasource.md
section: Connectors & Drivers
---

# Prerequisites for Snowflake Connector for MySQL datasources

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Before installing the Snowflake Connector for MySQL, do the following in your MySQL environment:

* Configure associated datasource
* Create required user

## Configure associated datasource

* Ensure that you have a MySQL version 8 or higher server that includes data you want to synchronize with Snowflake.
* Set the following options for your MySQL server:

  ```ini
  log_bin = on
  binlog_format = row
  binlog_row_metadata = full
  binlog_row_image = full
  binlog_row_value_options =
  ```

  > **Note:**
  >
  > Be cautious about the binary log expiration period (`binlog_expire_logs_seconds`). After it ends, binary log
  > files might be automatically removed. If the agent is paused for a long period of time (for example due to
  > maintenance work) and the expired binary log files are deleted during this time, the agent is not able to
  > replicate the data from these files. Set the binary log expiration period to at least a few hours to ensure stable
  > work of the connector.
  >
  > For more information about the automatic purging of binary log files, see
  > [MySQL Reference Manual](https://dev.mysql.com/doc/refman/8.0/en/replication-options-binary-log.html).

## Create required user

Create a user for the Snowflake Connector for MySQL with the following permissions:

> * `REPLICATION SLAVE` and `REPLICATION CLIENT` to be able to read from `binlog`.
>
>   For example:
>
>   > ```sql
>   > GRANT REPLICATION SLAVE ON *.* TO '<username>'@'%'
>   > GRANT REPLICATION CLIENT ON *.* TO '<username>'@'%'
>   > ```
> * `SELECT` permission to all tables that are replicated.
>
>   For example:
>
>   > ```sql
>   > GRANT SELECT ON <schema>.* TO '<username>'@'%'
>   > GRANT SELECT ON <schema>.<table> TO '<username>'@'%'
>   > ```
>
>   Where `<schema>.<table>` is the unique identifier of a table to be replicated.

## Next steps

After completing these procedures, follow the steps in [Setting up the Snowflake Connector for MySQL using Snowsight](install-snowsight.md).

---
title: Prerequisites for Snowflake Connector for PostgreSQL datasources
source: https://docs.snowflake.com/en/connectors/postgres6/prereqs-datasource.md
section: Connectors & Drivers
---

# Prerequisites for Snowflake Connector for PostgreSQL datasources

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Before installing the Snowflake Connector for PostgreSQL, prepare the associated datasource by performing
the following tasks:

* Configure associated datasource
* Create required user

## Configure associated datasource

Ensure that you have a PostgreSQL version 11 or higher server that includes data you want to synchronize with Snowflake.
Before installing the Snowflake Connector for PostgreSQL, perform the following in your PostgreSQL environment:

* Configure wal_level
* Configure publication
* Create replication slot

### Configure wal_level

Snowflake Connector for PostgreSQL requires [wal_level](https://www.postgresql.org/docs/current/runtime-config-wal.html#GUC-WAL-LEVEL) set to `logical`.

Depending on where PostgreSQL server is hosted it can be done in different ways

|  |  |
| --- | --- |
| On premise | Execute following query with superuser or user with `ALTER SYSTEM` privilege:  ```ini ALTER SYSTEM SET wal_level = logical; ``` |
| RDS | User used by the agent needs to have the `rds_superuser` or `rds_replication` roles assigned.  You also need to set:  * `rds.logical_replication` static parameter to 1. * `max_replication_slots`, `max_connections` and `max_wal_senders` parameters according to your database and replication setup. |
| AWS Aurora | Set the `rds.logical_replication` static parameter to 1. |
| GCP | Set the following flags:  * `cloudsql.logical_decoding=on`. * `cloudsql.enable_pglogical=on`.   For more information, see [Google Cloud documentation](https://cloud.google.com/sql/docs/postgres/replication/configure-logical-replication#set-up-logical-replication-with-pglogical). |
| Azure | Set the replication support to `Logical`. For more information, see [Azure documentation](https://learn.microsoft.com/en-us/azure/postgresql/single-server/concepts-logical#set-up-your-server). |

### Configure publication

Snowflake Connector for PostgreSQL requires [Publication](https://www.postgresql.org/docs/current/logical-replication-publication.html#LOGICAL-REPLICATION-PUBLICATION) to be created and configured.

Login as user with `CREATE` privilege in the database and execute following query:

> ```ini
> CREATE PUBLICATION <publication name>;
> ```

Then define tables that the Snowflake Connector for PostgreSQL agent will be able to see using:

> ```sqlsyntax
> ALTER PUBLICATION <publication name> ADD TABLE <table name>;
> ```

> **Attention:**
>
> **For Postgres v15 and later**
>
> In case of publications created for subset of table’s columns, please add tables for replication
> using [ADD_TABLE_WITH_COLUMNS](configure-replication.md) procedure, specifying exactly
> the same set of columns.
>
> If `ADD_TABLES` will be used, the connector will work, but following non-obvious side effects will occur:
>
> > * in the destination database, columns that are not included in filter will be suffixed with `_DELETED`. All data replicated during snapshot phase will still be there.
> > * in case of adding more columns to the publication, table will result in `Permanently Failed` state, requiring restarting the replication.

For more information see [ALTER PUBLICATION documentation](https://www.postgresql.org/docs/current/sql-alterpublication.html).

### Create replication slot

Snowflake Connector for PostgreSQL will create [Replication Slot](https://www.postgresql.org/docs/current/logicaldecoding-explanation.html#LOGICALDECODING-REPLICATION-SLOTS)
in PostgreSQL server with name having pattern `sf_db_conn_rs_kbmd_<DATASOURCE NAME>`, where `<DATASOURCE NAME>` is
the one specified in [ADD_DATA_SOURCE](configure-replication.md) procedure.

If the connector is not used anymore, Replication Slot must be removed to not accumulate data in PostgreSQL server.

> ```sql
> select pg_drop_replication_slot(<slot_name>)
> ```

## Create required user

Create user for Snowflake Connector for PostgreSQL with the `REPLICATION` attribute. For more information on replication security, see [PostgreSQL documentation](https://www.postgresql.org/docs/current/logical-replication-security.html).

## Next steps

After completing these procedures, follow the steps in [Setting up the Snowflake Connector for PostgreSQL using Snowsight](install-snowsight.md).

---
title: Query the Cortex Search service with Snowflake Connector for SharePoint
source: https://docs.snowflake.com/en/connectors/unstructured-data-connectors/sharepoint/cortex.md
section: Connectors & Drivers
---

# Query the Cortex Search service with Snowflake Connector for SharePoint

> **Note:**
>
> The Snowflake Connector for SharePoint is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for SharePoint.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements is not guaranteed. The new solution is available as [Openflow Connector for SharePoint](../../../user-guide/data-integration/openflow/connectors/sharepoint/about.md) and
> includes better performance, customizability, and enhanced deployment options.

You can use the [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) service to build chat
and search applications to chat with or query your documents in SharePoint.

After you install and configure the Snowflake Connector for SharePoint and it begins
ingesting content from Sharepoint, you can query the Cortex Search service.
For more information about using Cortex Search, see [Query a Cortex Search service](../../../user-guide/snowflake-cortex/cortex-search/query-cortex-search-service.md).

## Filter responses

To restrict responses from the Cortex Search service to documents that a specific user
has access to in SharePoint, you can specify a filter containing the user ID or email address of the user
when you query Cortex Search. For example, `filter.@contains.user_ids` or `filter.@contains.user_emails`.
The name of the Cortex Search service created by the connector is `search_service` in the schema `Cortex`.

Run the following SQL code in a SQL worksheet to query
the Cortex Search service with files ingested from your SharePoint site.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.
* `your_question`: The question that you want to get responses for.
* `number_of_results`: Maximum number of results to return in the response. The maximum value is 1000 and the default value is 10.

```sqlexample
SELECT PARSE_JSON(
  SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
    '<application_instance_name>.cortex.search_service',
      '{
        "query": "<your_question>",
         "columns": ["chunk", "web_url"],
         "filter": {"@contains": {"user_emails": "<user_emailID>"} },
         "limit": <number_of_results>
       }'
   )
)['results'] AS results
```

Here’s a complete list of values that you can enter for `columns`:

| Column name | Type | Description |
| --- | --- | --- |
| `full_name` | String | A full path to the file from the Sharepoint site documents root. Example: `folder_1/folder_2/file_name.pdf`. |
| `web_url` | String | A URL that displays an original Sharepoint file in a browser. |
| `last_modified_date_time` | String | Date and time when the item was most recently modified. |
| `chunk` | String | A piece of text from the document that matched the Cortex Search query. |
| `user_ids` | Array | An array of Microsoft 365 user IDs that have access to the document. It also includes user IDs from all the Microsoft 365 groups that are assigned to the document. To find a specific user ID, see [Get a user](https://learn.microsoft.com/en-us/graph/api/user-get?view=graph-rest-1.0&tabs=http). |
| `user_emails` | Array | An array of Microsoft 365 user email IDs that have access to the document. It also includes user email IDs from all the Microsoft 365 groups that are assigned to the document. |

## Example: Query an AI assistant for human resources (HR) information

You can use Cortex Search to query an AI assistant for employees to chat with the latest versions of
HR information, such as onboarding, code of conduct, team processes, and organization policies.
Using response filters, you can also allow HR team members to query employee contracts while adhering to access controls configured in SharePoint.

SQLPythonREST API

Run the following in a [SQL worksheet](../../../user-guide/ui-snowsight-worksheets-gs.md) to query the Cortex Search service with files ingested from SharePoint.
Select the database as your application instance name and schema as **Cortex**.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```sqlexample
SELECT PARSE_JSON(
     SNOWFLAKE.CORTEX.SEARCH_PREVIEW(
          '<application_instance_name>.cortex.search_service',
          '{
             "query": "What is my vacation carry over policy?",
             "columns": ["chunk", "web_url"],
             "filter": {"@contains": {"user_emails": "<user_emailID>"} },
             "limit": 1
          }'
     )
 )['results'] AS results
```

Run the following code in a [Python worksheet](../../../user-guide/ui-snowsight-worksheets-gs.md) to query the
Cortex Search service with files ingested from SharePoint.
Ensure that you add the `snowflake.core` package to your database.

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `user_emailID`: Email ID of the user who you want to filter the responses for.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.core import Root

def main(session: snowpark.Session):

   root = Root(session)

   # fetch service
   my_service = (root
     .databases["<application_instance_name>"]
     .schemas["cortex"]
     .cortex_search_services["search_service"]
   )

   # query service
   resp = my_service.search(
     query="What is my vacation carry over policy?",
     columns = ["chunk", "web_url"],
     filter = {"@contains": {"user_emails": "<user_emailID>"} },
     limit=1
   )
   return (resp.to_json())
```

Execute the following code in a command-line interface to query the Cortex Search
service with files ingested from your SharePoint.
You will need to authentication through key pair authentication and OAuth to access the
Snowflake REST APIs. For more information,
see [REST API](../../../user-guide/snowflake-cortex/cortex-search/query-cortex-search-service.md)
and [Authenticating Snowflake REST APIs with Snowflake](../../../developer-guide/snowflake-rest-api/authentication.md).

Replace the following:

* `application_instance_name`: Name of your database and connector application instance.
* `account_url`: Your Snowflake account URL. For instructions on finding your account URL, see [Finding the organization and account name for an account](../../../user-guide/admin-account-identifier.md).

```bash
curl --location "https://<account_url>/api/v2/databases/<application_instance_name>/schemas/cortex/cortex-search-services/search_service" \
     --header 'Content-Type: application/json' \
     --header 'Accept: application/json' \
     --header "Authorization: Bearer <CORTEX_SEARCH_JWT>" \
     --data '{
         "query": "What is my vacation carry over policy?",
         "columns": ["chunk", "web_url"],
         "limit": 1
     }'
```

Sample response:

```output
{
  "results" : [ {
  "web_url" : "https://<domain>.sharepoint.com/sites/<site_name>/<path_to_file>",
  "chunk" : "Answer to the question asked."
  } ]
}
```

## Next steps

[Manage the Snowflake Connector for SharePoint](manage.md).

---
title: Reinstall the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/reinstall.md
section: Connectors & Drivers
---

# Reinstall the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

To upgrade or reinstall the Snowflake Connector for MySQL, do the following:

1. Drop the connector database using Snowsight. You will find instructions [here.](https://other-docs.snowflake.com/en/native-apps/consumer-managing-applications#uninstall-an-app-using-snowsight)
2. Install and run the new version as described in [Snowflake Connector for MySQL installation and configuration tasks](tasks.md).

   > > **Note:**
   > >
   > > The reinstalled connector will need to be set up again and will pull all the data from the source system like a fresh installation.
   > > Destination database can be reused, but data in existing tables will be reloaded instead of updated.

---
title: Reinstall the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/reinstall.md
section: Connectors & Drivers
---

# Reinstall the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

To upgrade or reinstall the Snowflake Connector for PostgreSQL, do the following:

1. Shut down the connector.
2. Install and run the new version as described in [Snowflake Connector for PostgreSQL installation and configuration tasks](tasks.md).

   > > **Note:**
   > >
   > > The reinstalled connector will need to be set up again and will pull all the data from the source system like a fresh installation.
   > > Destination database can be reused, but data in existing tables will be reloaded instead of updated.

---
title: Role-based access control for connectors
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-rbac-access-control.md
section: Connectors & Drivers
---

# Role-based access control for connectors

The following sections describe application roles used in the connector application:

* ADMIN
* VIEWER
* DATA_READER

These application roles are automatically assigned to the account level role responsible for installing the application on the account.
They can be reassigned to others to grant control and data access to connector data and to the connector itself.
See also [GRANT APPLICATION ROLE](../../../sql-reference/sql/grant-application-role.md).

## ADMIN application role

You must use Snowflake Role ACCOUNTADMIN role paired with Application Role `ADMIN` to perform initial configuration of the connector, including the installation.

You can pair the `ADMIN` application role with any other Snowflake role after initial configuration to manage connector data synchronization.
The `ADMIN` application role grants access to all public views and procedures, which when paired with granted account level privileges can be used to:

* View Home Tab and ingestion statistics.
* View and manage data synchronization.
* View settings and connector configuration and manage alerts.

> **Attention:**
>
> To manage connector alerts, grant either the ACCOUNTADMIN role or the CREATE INTEGRATION privilege to the role that the ADMIN application is assigned to.
> To grant these rights, execute the following SQL code:
> `GRANT CREATE INTEGRATION ON ACCOUNT TO ROLE <replace-with-your-role-name>;`

## VIEWER application role

The `VIEWER` application role can be assigned to any role and is used to:

> * View the connector home tab and ingestion statistics.
> * View connector data synchronization.
> * View connector settings and configuration.

## DATA_READER application role

Users who want to access the ingested data should use only the `DATA_READER` role.
The `DATA_READER` application role *must* be used to grant read privileges on replicated data.

This role is used to grant access to ingested data. To assign the `DATA_READER` role,
you can either use Manage access in Snowsight or execute the following SQL statement:

```sqlexample
GRANT APPLICATION ROLE DATA_READER to ROLE <replace-with-your-role-name>;
```

Do not attempt to access replicated data by changing ownership to the destination database;
instead grant the `DATA_READER` application role.

To view replicated data, a user must have the following privileges:

* `USAGE` on the destination database
* `USAGE` on the destination schema
* `SELECT` on the destination table

The connector grants `USAGE`/`SELECT` privileges to this role on all tables and views created by the application.

> **Attention:**
>
> The `DATA_READER` application role is granted privileges only on objects created by the application.
> If the destination database or destination schema already exists and is not owned by the connector application,
> the connector won’t be able to grant proper privileges to the `DATA_READER` role on these objects.
> In such situations, account level roles with the `DATA_READER` application role must be manually updated with a `USAGE` grant on these objects.

---
title: Role-based access control for connectors (GARD)
source: https://docs.snowflake.com/en/connectors/google/gard/gard-rbac-access-control.md
section: Connectors & Drivers
---

# Role-based access control for connectors (GARD)

The following sections describe application roles used in the connector application:

* ADMIN
* VIEWER
* DATA_READER

These application roles are automatically assigned to the account level role responsible for installing the application on the account.
They can be reassigned to others to grant control and data access to connector data and to the connector itself.
See also [GRANT APPLICATION ROLE](../../../sql-reference/sql/grant-application-role.md).

## ADMIN application role

You must use Snowflake Role ACCOUNTADMIN role paired with Application Role `ADMIN` to perform initial configuration of the connector, including the installation.

You can pair the `ADMIN` application role with any other Snowflake role after initial configuration to manage connector data synchronization.
The `ADMIN` application role grants access to all public views and procedures, which when paired with granted account level privileges can be used to:

* View Home Tab and ingestion statistics.
* View and manage data synchronization.
* View settings and connector configuration and manage alerts.

> **Attention:**
>
> To manage connector alerts, grant either the ACCOUNTADMIN role or the CREATE INTEGRATION privilege to the role that the ADMIN application is assigned to.
> To grant these rights, execute the following SQL code:
> `GRANT CREATE INTEGRATION ON ACCOUNT TO ROLE <replace-with-your-role-name>;`

## VIEWER application role

The `VIEWER` application role can be assigned to any role and is used to:

> * View the connector home tab and ingestion statistics.
> * View connector data synchronization.
> * View connector settings and configuration.

## DATA_READER application role

Users who want to access the ingested data should use only the `DATA_READER` role.
The `DATA_READER` application role *must* be used to grant read privileges on replicated data.

This role is used to grant access to ingested data. To assign the `DATA_READER` role,
you can either use Manage access in Snowsight or execute the following SQL statement:

```sqlexample
GRANT APPLICATION ROLE DATA_READER to ROLE <replace-with-your-role-name>;
```

Do not attempt to access replicated data by changing ownership to the destination database;
instead grant the `DATA_READER` application role.

To view replicated data, a user must have the following privileges:

* `USAGE` on the destination database
* `USAGE` on the destination schema
* `SELECT` on the destination table

The connector grants `USAGE`/`SELECT` privileges to this role on all tables and views created by the application.

> **Attention:**
>
> The `DATA_READER` application role is granted privileges only on objects created by the application.
> If the destination database or destination schema already exists and is not owned by the connector application,
> the connector won’t be able to grant proper privileges to the `DATA_READER` role on these objects.
> In such situations, account level roles with the `DATA_READER` application role must be manually updated with a `USAGE` grant on these objects.

---
title: Role-based access control for connectors (ServiceNow)
source: https://docs.snowflake.com/en/connectors/servicenow/application-roles.md
section: Connectors & Drivers
---

# Role-based access control for connectors (ServiceNow)

The following sections describe application roles used in the connector application:

* ADMIN
* VIEWER
* DATA_READER

These application roles are automatically assigned to the account level role responsible for installing the application on the account.
They can be reassigned to others to grant control and data access to connector data and to the connector itself.
See also [GRANT APPLICATION ROLE](../../sql-reference/sql/grant-application-role.md).

## ADMIN application role

You must use Snowflake Role ACCOUNTADMIN role paired with Application Role `ADMIN` to perform initial configuration of the connector, including the installation.

You can pair the `ADMIN` application role with any other Snowflake role after initial configuration to manage connector data synchronization.
The `ADMIN` application role grants access to all public views and procedures, which when paired with granted account level privileges can be used to:

* View Home Tab and ingestion statistics.
* View and manage data synchronization.
* View settings and connector configuration and manage alerts.

> **Attention:**
>
> To manage connector alerts, grant either the ACCOUNTADMIN role or the CREATE INTEGRATION privilege to the role that the ADMIN application is assigned to.
> To grant these rights, execute the following SQL code:
> `GRANT CREATE INTEGRATION ON ACCOUNT TO ROLE <replace-with-your-role-name>;`

## VIEWER application role

The `VIEWER` application role can be assigned to any role and is used to:

> * View the connector home tab and ingestion statistics.
> * View connector data synchronization.
> * View connector settings and configuration.

## DATA_READER application role

Users who want to access the ingested data should use only the `DATA_READER` role.
The `DATA_READER` application role *must* be used to grant read privileges on replicated data.

This role is used to grant access to ingested data. To assign the `DATA_READER` role,
you can either use Manage access in Snowsight or execute the following SQL statement:

```sqlexample
GRANT APPLICATION ROLE DATA_READER to ROLE <replace-with-your-role-name>;
```

Do not attempt to access replicated data by changing ownership to the destination database;
instead grant the `DATA_READER` application role.

To view replicated data, a user must have the following privileges:

* `USAGE` on the destination database
* `USAGE` on the destination schema
* `SELECT` on the destination table

The connector grants `USAGE`/`SELECT` privileges to this role on all tables and views created by the application.

> **Attention:**
>
> The `DATA_READER` application role is granted privileges only on objects created by the application.
> If the destination database or destination schema already exists and is not owned by the connector application,
> the connector won’t be able to grant proper privileges to the `DATA_READER` role on these objects.
> In such situations, account level roles with the `DATA_READER` application role must be manually updated with a `USAGE` grant on these objects.

---
title: Set up data ingestion for your ServiceNow® data
source: https://docs.snowflake.com/en/connectors/servicenow/ingestion.md
section: Connectors & Drivers
---

# Set up data ingestion for your ServiceNow® data

This topic describes how to set up data ingestion for the Snowflake Connector for ServiceNow®.

> **Note:**
>
> The Snowflake Connector for ServiceNow® ingests data from ServiceNow® tables into Snowflake. Data ingestion
> depends on `v2` of the ServiceNow® [table API](https://developer.servicenow.com/dev.do#!/reference/api/latest/rest/c_TableAPI).

## Strategies for ingesting ServiceNow® tables

> **Note:**
>
> * The connector can only ingest tables with `sys_id` columns present.
> * [ServiceNow views](https://docs.servicenow.com/bundle/washingtondc-application-development/page/use/reporting/concept/c_DatabaseViews.html) are not supported. Instead of ingesting these views, you should synchronize all tables
>   for the underlying view and join the synchronized tables in Snowflake.

The connector uses different ingestion strategies, depending on the table schema. The connector uses three
ingestion modes:

* The initial load of data occurs for each table when the table is enabled for synchronization.

  In this mode, the table is ingested by iterating through the records identified by the IDs in the `sys_id` column. When all records are ingested,
  the initial load phase is complete. For certain tables, you can also set the
  data range start time value which can restrict which records are ingested.
* Incremental updates occur only for tables with `sys_updated_on` or `sys_created_on` columns.

  Incremental updates begin after the initial load is done and occur on a regular schedule that you can configure.
  In this mode, the connector ingests only the records that were added, updated, or deleted since the last
  synchronization. Information about deletions comes from the journal table provided during connector configuration.
* For tables that don’t have `sys_updated_on` or `sys_created_on` columns, the connector uses the
  truncate and load mode.

  In this mode, the table is always ingested using the initial load approach, and newly ingested data replaces
  the old data. The connector replaces the data by running the `INSERT OVERWRITE` command.

> **Note:**
>
> * In the “incremental updates” mode, the connector uses the `sys_updated_on` column, if that column is
>   present. If the column is not present, the connector uses the `sys_created_on` column instead.
> * For [rotated tables](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/platform-performance/concept/c_TableRotation.html), the connector always uses the `sys_created_on` column. If the table is rotated using
>   a different column than `sys_created_on`, the ingestion of that table might cause performance issues.
> * If the `sys_updated_on` or `sys_created_on` fields are not updated when the record is modified
>   in ServiceNow, those modifications won’t be propagated to Snowflake, which results in data
>   inconsistency. Snowflake recommends that you avoid [disabling the update of system fields](https://developer.servicenow.com/dev.do#!/reference/api/washingtondc/server_legacy/c_GlideRecordAPI%23r_GlideRecord-autoSysFields_Boolean).
> * If a record deletion is [not audited](https://developer.servicenow.com/dev.do#!/reference/api/washingtondc/server_legacy/c_GlideRecordAPI#r_GlideRecord-setWorkFlow_Boolean), information about deleted records won’t be propagated to
>   Snowflake, resulting in a data inconsistency.

> **Note:**
>
> Because of restrictions on the Snowflake and ServiceNow® REST APIs, the connector cannot ingest data
> into a table if a single row exceeds 128 MB of data. In that case, the connector tries to ingest data
> with the frequency defined in the table schedule. If a row exceeds the limit, the connector
> generates an error message and continues ingesting other tables. To overcome this limitation,
> you can configure column filtering
> to exclude large columns from ingestion.

### Archived records

The connector does not actively reflect the [records archived in ServiceNow](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/database-rotation/concept/c_ArchiveData.html) on the Snowflake side for the ingested tables.
Assuming that you archive inactive records older than a certain date, the following apply:

* Any record archived before the connector ingested it (for example, before the initial load of the table) will not be present in the table on the Snowflake side at all.
* Any record archived after it was already ingested by the connector remains on the Snowflake side with no indication of archive action occurring.
* Any archived record restored for a table that is already operating in incremental updates mode will not be ingested on the Snowflake side unless that record is also modified afterwards (with its `sys_updated_on` value being updated to current time).
* An archived record restored during the initial load of the table may be ingested on the Snowflake side depending on its ID in the `sys_id` column.

If you want to bring the table with an active archive rule up to date, you can reload the entire table
but any record archived or restored after the reload is finished will follow the same principles listed above.

ServiceNow archive tables `ar_[table_name]` can be enabled for synchronization. However, the first incremental update
that follows the initial load of such table are searched for records created/updated past the date
the initial load of the archive table has started. Because neither `sys_updated_on` nor `sys_created_on`
are modified when the record is archived, records archived after the initial load of the archive
table up to a certain point in time are missing in it on the Snowflake side. For example, if you archive records older
than a year, then any record archived for a year after the initial load of the archive table is not ingested
to the archive table on the Snowflake side. The archived records that were restored or deleted by a [destroy rule](https://docs.servicenow.com/bundle/washingtondc-platform-administration/page/administer/database-rotation/task/t_CreateADestructionRule.html)
following initial load of an archive table is never removed from it on the Snowflake side.

## Parallel ingestion of ServiceNow® tables

The connector ingests a few tables in parallel, but the ingestion of each individual table is a synchronous
process. This means that ingesting large tables might block the connector from updating other tables. This issue
is more likely to occur during the initial load phase than in other phases.
By default the connector uses 10 worker threads, which is considered an optimal value to not overload the ServiceNow® instance.
If you are sure that your instance can support additional concurrency, you can increase this value to a maximum of 30 by calling
[CONFIGURE_CONCURRENCY procedure](managing.md).

## Set up data ingestion using Snowsight

To set up data ingestion using Snowsight, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow® app, then select the tile for the connector.
4. In the page for the Snowflake Connector for ServiceNow®, select Data Sync tab.

   This displays a list of all the ServiceNow® tables.

   > **Note:**
   >
   > The connector can only ingest tables with `sys_id` columns present.
5. Select the tables you want to ingest:

   1. Search for the table you want to ingest.
   2. Select the checkbox in the Status column from the left to the table you want to select.
   3. Under Sync Schedule, select how frequently you want to synchronize the table between Snowflake and ServiceNow®.
   4. Repeat these steps for each table you want to ingest into Snowflake.
6. Select the heading of the Status column to see the tables you have currently selected.
7. Select Start sync to begin ingesting data into your Snowflake account.

The connector status changes to Syncing data. When at least one of the tables is ingested successfully, the
connector status changes to Last Sync: just now.

Refer to [Monitoring the Snowflake Connector for ServiceNow®](monitoring.md) for information on how to view the contents of the tables in Snowflake.

### Modify data ingestion using Snowsight

To modify the ServiceNow® tables to be ingested or the synchronization schedule for the tables, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for ServiceNow® app, then select the tile for the connector.
4. In the page for the Snowflake Connector for ServiceNow®, select Data Sync.
5. Select Edit tables button to enter into the editing mode.
6. Modify the tables you want to ingest:

   1. Search for the table you want to ingest.
   2. Select the checkbox in the Status column from the left to the table you want to select or deselect.
   3. Under Sync Schedule, select how frequently you want to synchronize the table between Snowflake
      and ServiceNow®.
7. Select Update data sync.

## Set up data ingestion using SQL statements

To set up data ingestion using SQL statements, do the following:

* The schedule for synchronizing the tables.
* The list of tables that should be synchronized.

> **Note:**
>
> To configure these settings, you use stored procedures that are defined in the PUBLIC schema of
> [the database that serves as an instance of the connector](installing-sql.md).
>
> Before calling these stored procedures, select that database as the database to use for the session.
>
> For example, if that database is named `my_connector_servicenow`, run the following command:
>
> ```sqlexample
> USE DATABASE my_connector_servicenow;
> ```

### Enable or disable the table synchronization

This section describes how to enable or disable the synchronization of a table in ServiceNow®.
Synchronization enablement can be done with both default and custom configuration.

#### Enable multiple tables using the default configuration

To enable the synchronization of data for at least one table in ServiceNow®, call the `ENABLE_TABLES` stored procedure with the following arguments:

```sqlsyntax
CALL ENABLE_TABLES(<tables_to_enable>);
```

Where:

`tables_to_enable`
:   Specifies an array of ServiceNow® table names.

    Use the table name, not the label displayed in the ServiceNow® UI. You can find the table name in the
    [data dictionary tables in ServiceNow](https://docs.servicenow.com/bundle/washingtondc-application-development/page/administer/managing-data/concept/c_DataDictionaryTables.html). In the ServiceNow® UI, go to
    System Definition » Tables. The Name column displays the name of the table.

For example, to enable the synchronization of the tables named `table1`, `table2`, and `table3`, run
the following command:

```sqlexample
CALL ENABLE_TABLES(['table1', 'table2', 'table3']);
```

#### Disable multiple tables

To disable table data synchronization for a specific table in ServiceNow®, call the `DISABLE_TABLES`
stored procedure with the following arguments:

```sqlsyntax
CALL DISABLE_TABLES(<tables_to_disable>);
```

Where:

`tables_to_disable`
:   Specifies an array of ServiceNow® table names.

    Use the table name, not the label displayed in the ServiceNow® UI. You can find the table name in the
    [data dictionary tables in ServiceNow](https://docs.servicenow.com/bundle/washingtondc-application-development/page/administer/managing-data/concept/c_DataDictionaryTables.html). In the ServiceNow® UI, go to
    System Definition » Tables. The Name column displays the name of the table.

For example, to disable the synchronization of the tables named `table1` and `table2`, run the following command:

```sqlexample
CALL DISABLE_TABLES(['table1', 'table2']);
```

Disabling the table stops synchronization gracefully, as soon as it’s possible.
When the table synchronization is re-enabled, ingestion resumes from where it was paused.

> **Note:**
>
> Disabling all tables from synchronization does not mean that the Snowflake Connector for ServiceNow® stops incurring cost.
> Background tasks, such as those related to notifications, can continue to execute.

The `ENABLE_TABLES` and `DISABLE_TABLES` procedures add the specified table names to the `CONFIGURED_TABLES` view.

> **Note:**
>
> The connector does not support [roll backs or delete recoveries](https://docs.servicenow.com/bundle/washingtondc-application-development/page/administer/table-administration/concept/rollback-delete-recovery.html) in ServiceNow®.
>
> Using the roll back and delete recovery features may result in data inconsistency. Records that are recovered in
> ServiceNow® may still be marked as deleted in Snowflake. To resolve it you can reload
> the table.

#### Enable a single table by using custom configuration

* To enable the synchronization of data with custom configuration for a specific table in ServiceNow®, call the `ENABLE_TABLE` stored procedure with the following arguments:

  ```sqlsyntax
  CALL ENABLE_TABLE('<table_to_enable>', <table_config>);
  ```

  Where:

  `table_to_enable`
  :   Specifies a ServiceNow® table name.

  `table_config`
  :   Optional: Specifies an object with table ingestion configuration. If not specified, table ingestion uses the default configuration.

      Currently supported configurations are:

      > + column filtering: Provide `include_columns` or `exclude_columns` properties with a list of column names.
      > + row filtering: Provide `filter` property with a filter expression.
      > + synchronization schedule: Provide the `schedule` property with custom ingestion schedule.
      > + deletions synchronization enablement: Provide the `sync_deletions` boolean property.
      > + display values fetching: Provide the `fetch_display_values` boolean property.

      > **Note:**
      >
      > All of the custom configurations can be combined in a single object and used simultaneously for a single table ingestion.
      >
      > **Example**:
      >
      > The table `sys_audit` has the following configuration:
      >
      > > + The table should be synchronized every Saturday at 10:00 AM UTC.
      > > + Only the columns `newvalue` and `reason` should be ingested.
      > > + Only the rows that have the `newvalue` column starting with the string `privacy` should be ingested.
      > > + If a journal table is configured, deletions shouldn’t be synchronized.
      > > + Display values should be fetched for all fields.
      > >
      > > To enable ingestion of the table, run the following command:
      > >
      > > ```sqlsyntax
      > > CALL ENABLE_TABLE('sys_audit', {
      > >   'schedule': { 'type': 'custom', 'value': { 'hour': 10, 'day_of_week': '6' } },
      > >   'include_columns': ['newvalue', 'reason'],
      > >   'row_filter': 'newvalue STARTSWITH "privacy"',
      > >   'sync_deletions': false,
      > >   'fetch_display_values': true
      > > });
      > > ```

### Enable a single table by using column filtering

If you don’t need all columns from a ServiceNow® table in Snowflake, the connector can ignore them. For example, skip columns if a single row exceeds the maximum row size of 128 MB.

To enable table ingestion with specified columns, run the following command:

```sqlsyntax
CALL ENABLE_TABLE('<table_to_enable>', <table_config>);
```

Where:

`table_to_enable`
:   Specifies a ServiceNow® table name.

`table_config`
:   Object including `include_columns` or `exclude_columns` properties with a list of column names.
    If `sys_id`, `sys_created_on`, and `sys_updated_on` exist, they are always included.
    You don’t have to add them to `included_columns` array and cannot exclude them using `excluded_columns` as the connector uses them in the ingestion process.

> **Note:**
>
> Since columns in ServiceNow® are written in lowercase and the API that the connector uses is case-sensitive, the values provided for specified columns must also be in lowercase.

> **Note:**
>
> You shouldn’t provide both `include_columns` and `exclude_columns`. If you want to list `include_columns`, you should skip the `exclude_columns` property, and vice versa.
> If both arrays are not empty and there aren’t any conflicting columns, `include_columns` takes precedence over `exclude_columns`.
>
> If both `include_columns` and `exclude_columns` are empty arrays, all the available columns will be ingested.

For example with a ServiceNow® table named `u_table` with columns `sys_id`, `sys_updated_on`, `col_1` and `col_2` and executing:

```sqlsyntax
CALL ENABLE_TABLE('u_table', { 'include_columns': ['sys_id', 'sys_updated_on'] });
```

will ingest only `sys_id` and `sys_updated_on` columns for the given table, but calling:

```sqlsyntax
CALL ENABLE_TABLE('u_table', { 'exclude_columns': ['col_1'] });
```

will ingest `sys_id`, `sys_updated_on` and also `col_2`.

The connector validates the provided columns and rejects the enablement request if any of the columns are not available in ServiceNow®.
ServiceNow® API supports only include mode. As a result the connector transforms provided column arrays into included columns list and sends them with each request to ServiceNow®.
The URL with included columns could possibly be too long to be handled by ServiceNow®. The connector validates this limitation when the `ENABLE_TABLE` is invoked.

Columns configuration for each table can be found in the `INCLUDED_COLUMNS` column of the `CONFIGURED_TABLES` view.
To modify the list of ingested columns, you need to disable the specific table first.
If column filtering is configured for a table, you can enable the table only using the `ENABLE_TABLE` procedure. You cannot use the `ENABLE_TABLES`, which accepts a list of tables as an argument.

Flattened views only include the columns specified when the table was enabled. They are updated every time the list of included columns changes.
If column filtering is not configured, views contain all the available columns.

> **Note:**
>
> Configuration change does not affect the previously ingested data. Column filtering applies only to the newly ingested records.
> To apply the filter to the previously ingested data, the table needs to be reloaded.

### Enable a single table by using row filtering

You can exclude data ingestion for select rows from a ServiceNow® table by specifying a filter condition.
For example, to exclude the rows which include sensitive data that you don’t want in Snowflake,
or exclude the rows which include unnecessary data in order to reduce cost.

To enable table ingestion with specified row filter run the following command:

```sqlsyntax
CALL ENABLE_TABLE('<table_to_enable>', <table_config>);
```

Where:

`table_to_enable`
:   Specifies a ServiceNow® table name.

`table_config`
:   Object including `row_filter` property with a filter expression, which is a valid string.

    Currently supported filter operators are:

    | Operator | Description | Example |
    | --- | --- | --- |
    | `AND` | Logical operator to join conditions, where both must be fulfilled. | `active = "true" AND impact = "2"` |
    | `OR` | Logical operator to join conditions, where at least one of them must be fulfilled.  **Important:** Takes precedence over `AND` operator. See the examples below. | `tablename = "incident" OR tablename = "problem"` |
    | `=` | Returns `true` if the values are equal. | `priority = "1"` |
    | `!=` | Returns `true` if the values are not equal. | `state != "7"` |
    | `LIKE` | Returns `true` if the value contains the specified character sequence. [1] | `newvalue LIKE "privacy"` |
    | `NOT LIKE` | Returns `true` if the value doesn’t contain a specified character sequence. [1] | `description NOT LIKE "test"` |
    | `STARTSWITH` | Returns `true` if the value starts with the specified character sequence. [1] | `description STARTSWITH "important"` |
    | `ENDSWITH` | Returns `true` if value ends with specified character sequence. [1] | `description ENDSWITH "important"` |
    | `IN` | Returns `true` if the value is equal to any of the list of values. [2] | `tablename IN ("incident", "task", "cmdb_ci")` |
    | `NOT IN` | Returns `true` if the value is not equal to any of the list values. [2] | `status NOT IN ("in progress", "on hold", "cancelled")` |

[1] - fields must be of `string` data type.

[2] - choice fields must contain strings.

> Filter expression rules and limitations:
>
> > * any two filter expressions must be joined with the `AND` or the `OR` operator.
> > * Operators must be separated by space and be in uppercase.
> > * Value expressions must be enclosed in double quotes.
> > * expressions are case-sensitive.
> > * the expression cannot operate on `sys_id`, `sys_updated_on`, or `sys_created_on` columns.

> **Note:**
>
> Configuration changes do not affect the previously ingested data. Row filtering applies only to the newly ingested records.
> To apply the filter to the already ingested data, the table must be reloaded.

#### Examples

* To enable ingestion of table `sys_audit`, but synchronize only the rows that are related to the privacy incidents in the `INCIDENT` table, execute:

```sqlsyntax
CALL ENABLE_TABLE('sys_audit', {
  'row_filter': 'tablename = "incident" AND fieldname = "cause" AND newvalue LIKE "privacy"'
});
```

* To enable ingestion of table `incident`, but synchronize only the rows under such conditions that:

  + `active` field is equal to `true`,
  + `sys_created_by` field starts with `support` or ends with `admin`,
  + `category` field is one of `Network`, `Cloud Management`,

  execute:

```sqlsyntax
CALL ENABLE_TABLE('incident', {
  'row_filter': 'active = "true" AND sys_created_by STARTSWITH "support" OR sys_created_by ENDSWITH "admin" AND category IN ("Network", "Cloud Management")'
});
```

* To enable ingestion of table `incident`, but ingest only the rows in the specified incident state and only the given columns, execute:

```sqlsyntax
CALL ENABLE_TABLE('incident', {
  'row_filter': 'incident_state IN ("1", "2", "3")', -- "New", "In Progress", "On Hold"
  'include_columns': ['incident_state', 'description']
});
```

### Specify the synchronization schedule

The Snowflake Connector for ServiceNow® synchronizes data from all ServiceNow® tables to Snowflake on a specified
schedule. By default, all of the tables are synchronized once every hour (1h).

To change the default synchronization schedule for all tables, call the `CONFIGURE_DATA_INGESTION_SCHEDULE` stored procedure
with the following arguments:

```sqlsyntax
CALL CONFIGURE_DATA_INGESTION_SCHEDULE(<schedule>);
```

Where:

> `schedule`
> :   Specifies the frequency of the synchronization. You can specify one of the following JSON values:
>
>     * `{ 'type': 'continuous' }`, which is near real-time ingestion schedule. A table with this synchronization schedule
>       uses dedicated worker to ingest data and doesn’t count towards the maximum number of tables that can be synchronized in
>       parallel. For more information, see [Scaling the connector](managing.md). You can configure up to
>       20 tables with continuous schedule.
>
>       > **Warning:**
>       >
>       > Tables with continuous schedule cause increased load on a ServiceNow® instance. Additionally, it causes the connector
>       > warehouse to be constantly utilised, which ramps up warehouse credit consumption. Snowflake recommends using continuous
>       > schedules carefully and only for tables that require near real-time data in Snowflake. To prevent overloading of a
>       > ServiceNow® instance, the connector implements a detection mechanism that is able to automatically disable
>       > failing tables with continuous schedule. See [Table with continuous schedule disabled by the connector](troubleshooting.md) for more
>       > information.
>     * `{ 'type': 'interval', 'value': '<interval_value>' }`, where `interval_value` is one of the following
>       string values:
>
>       + `'30m'`
>       + `'1h'`
>       + `'3h'`
>       + `'6h'`
>       + `'12h'`
>       + `'1d'`
>     * `{ 'type': 'custom', 'value': { 'hour': <hour>, 'day_of_week': '<day_of_week>' } }`, where `hour` specifies the
>       hour in UTC timezone at which the ingestion should start, and `day_of_week` specifies day of the week when the ingestion
>       should be performed. It is possible to use special expressions as a day of week:
>
>       + `'*'` to run the ingestion everyday.
>       + `'1-3'` to run the ingestion from Monday to Wednesday.
>       + `'0,5,6'` to run the ingestion on Friday, Saturday and Sunday.
>
>       Possible values that can be used in the expression for `day_of_week` configuration are:
>
>       + `'0'` - Sunday
>       + `'1'` - Monday
>       + `'2'` - Tuesday
>       + `'3'` - Wednesday
>       + `'4'` - Thursday
>       + `'5'` - Friday
>       + `'6'` - Saturday
>
>       Other non-digit values like `'5L'` indicating the last Friday of a month or `'FRI-SUN'` indicating
>       the range from Friday to Sunday are not supported.

It’s possible to configure ingestion schedule for a specific table during its enablement.
To enable a single table and set its ingestion schedule, call the `ENABLE_TABLE` stored procedure with the following arguments:

```sqlsyntax
CALL ENABLE_TABLE('<table_name>', <table_config>);
```

Where:

> `table_name`
> :   Specifies a ServiceNow® table name to enable.
>
> `table_config`
> :   Object including `schedule` property, which specifies the configuration of the table synchronization.
>     Check `schedule` of `CONFIGURE_DATA_INGESTION_SCHEDULE` stored procedure for details.

For example to enable ingestion of table `table_1` and synchronize data every 3h call the following stored procedure:

```sqlsyntax
CALL ENABLE_TABLE('table_1', { 'schedule': { 'type': 'interval', 'value': '3h' } });
```

The connector also allows you to specify a different schedule for each table that is enabled for
synchronization. To change the synchronization schedule for a selected set of tables, call the
`CONFIGURE_TABLES_SCHEDULE` stored procedure with the following arguments:

```sqlsyntax
CALL CONFIGURE_TABLES_SCHEDULE(<table_names>, <schedule>);
```

Where:

> `table_names`
> :   Specifies an array of table names for which you want to configure the synchronization schedule.
>
> `schedule`
> :   Specifies the frequency of the synchronization. Check `schedule` of `CONFIGURE_DATA_INGESTION_SCHEDULE` stored procedure for details.

For example to ingest tables `table_1` and `table_2` each Saturday and Sunday at 11:00 PM UTC call the following stored
procedure:

```sqlsyntax
CALL CONFIGURE_TABLES_SCHEDULE(['table_1', 'table_2'], { 'type': 'custom', 'value': { 'hour': 23, 'day_of_week': '0,6' } });
```

By default the connector tries to start the ingestion in 3 hour time window from scheduled start time. If it
is not possible to start the ingestion within that time frame, for example, when the connector is ingesting other
tables, the current scheduled run is not executed. The connector attempts to run the ingestion at the next
scheduled time frame. It is possible to change the duration of the time frame by calling `CONFIGURE_CUSTOM_SCHEDULE_START_INGESTION_WINDOW`
stored procedure:

```sqlsyntax
CALL CONFIGURE_CUSTOM_SCHEDULE_START_INGESTION_WINDOW(<window_length>);
```

where `window_length` is the window length in ISO 8601 duration format. The duration must be rounded up to
the next whole hour, and must last for at least 1 hour. For example, value `'PT12H'` specifies a window that lasts for
12 hours, and `'P2D'` specifies a window that lasts for 2 days.

If you only enable tables with custom schedules, this configuration only affects time it takes to create and
refresh flattened views for the configured tables. The flattened views are created in the first ingestion cycle after
the following conditions are met:

* Ingestion of metadata tables is finished
* Ingestion of the configured table has started.

If email alerts are enabled, Snowflake recommends changing the alert frequency to Once per day when using
custom scheduling.

### Specify whether deletions should be synchronized

You can specify if the connector should synchronize deletions from ServiceNow® to Snowflake. By default, the connector
synchronizes deletions if a journal table is configured. However, you might want to disable deletions synchronization
of a specific table and not change the global configuration.

To enable table ingestion with specified deletions synchronization setting, run the following command:

```sqlsyntax
CALL ENABLE_TABLE('<table_to_enable>', <table_config>);
```

Where:

`table_to_enable`
:   Specifies a ServiceNow® table name.

`table_config`
:   Object including `sync_deletions` boolean property. If the value is set to `true`, the connector synchronizes deletions for the table;
    if the value is set to `false`, the connector does not synchronize deletions for the table.

For example, to enable ingestion of the table `incident` but not synchronize the deletions, run the following command:

```sqlsyntax
CALL ENABLE_TABLE('incident', { 'sync_deletions': false });
```

> **Note:**
>
> If you want to use the default configuration, don’t provide the `sync_deletions` property in the configuration object.
> If the journal table is not configured, the connector does not synchronize deletions regardless of the provided configuration.

### Specify whether display values should be fetched

The connector can fetch display values for any supported types of fields in ServiceNow®.
Display values are readable values that correspond to the actual values stored in the database, for example, a field with a value of `1` might have a display value of `High`.
To learn more about display values, see [the ServiceNow® documentation](https://www.servicenow.com/docs/bundle/xanadu-platform-administration/page/administer/field-administration/concept/c_DisplayValues.html).

The resolved value is displayed in a flattened view in a separate column with the suffix `__DISPLAY_VALUE`.
The connector creates text and boolean columns with the Snowflake types, however for other types, for example,
different possible formats of number or date values, the display values are stored as variants.

> **Warning:**
>
> Metadata tables are not supported for display values fetching.

> **Note:**
>
> Configuration changes do not affect the previously ingested data. Display values fetching applies only to the newly ingested records.
> To fetch display values for the already ingested data, the table must be reloaded.

#### Display values fetching per table

To enable fetching display values for a specific table, call the `ENABLE_TABLE` stored procedure with the following arguments:

```sqlsyntax
CALL ENABLE_TABLE('<table_to_enable>', <table_config>);
```

Where:

`table_to_enable`
:   Specifies a ServiceNow® table name.

`table_config`
:   Object including `fetch_display_values` boolean property. If the value is set to `true`, the connector fetches display values for the table;
    if the value is set to `false` (default), the connector does not fetch display values for the table.

For example, to enable ingestion of the table `incident` and fetch display values for it, run the following command:

```sqlsyntax
CALL ENABLE_TABLE('incident', { 'fetch_display_values': true });
```

> **Note:**
>
> Per table configuration is not affected by the global configuration.

#### Configure default display values fetching setting for all tables

To enable fetching display values for all tables, call the `CONFIGURE_DISPLAY_VALUE_FETCHING` stored procedure with the following arguments:

```sqlsyntax
CALL CONFIGURE_DISPLAY_VALUE_FETCHING(<fetch_display_values>);
```

Where:

`fetch_display_values`
:   Specifies a boolean value. If the value is set to `true`, the connector fetches display values for all tables;
    if the value is set to `false` (default), the connector does not fetch display values for any table by default.

For example, to enable fetching display values for all tables, run the following command:

```sqlsyntax
CALL CONFIGURE_DISPLAY_VALUE_FETCHING(true);
```

### Specify the data range start time

By default, the Snowflake Connector for ServiceNow® synchronizes all the records in the corresponding ServiceNow® tables. For the tables with: `sys_updated_on` or `sys_created_on`
columns (from now on here called *time columns*) present, it is possible to restrict the range of synchronized
data by setting a *data range start time* - i.e. lower bound for the corresponding *time column* value of the records.

With such a configuration, records with the corresponding *time column* value older than the *data range start timestamp* are **not** ingested.
The corresponding *time column* used by this procedure is determined in the same way as for the incremental updates .

To change the *data range start time* value, call the `CONFIGURE_TABLES_RANGE_START` stored procedure with the following arguments:

> ```sqlsyntax
> CALL CONFIGURE_TABLES_RANGE_START(<table_names>, <range_start>);
> ```

Where:

> `table_names`
> :   Specifies an array of table names for which you want to configure the *data range start time*.
>
> `range_start`
> :   Timestamp specifying the *data range start time* in TIMESTAMP_TZ format or NULL to unset the current value.

> **Note:**
>
> You cannot set the data range start time for the tables with neither `sys_updated_on` nor `sys_created_on` column present.

* If the ingestion of the table has not been started yet, the *data range start time* value is taken into account with the first ingestion.
* If the ingestion of the table has already been started (e.g. a reload is in progress), the *data range start time* value is ignored
  and (another) reload of the table(s)
  is required to filter out the records with too old corresponding *time column* value.

It is therefore recommended to set the *data range start time* before starting the first ingestion of a table (hence also before enabling it).

For example, if tables `table1` and `table2` have the required *time column(s)*, in order to set the *data range start time* to 2022-11-23 07:00:00 UTC for theses two tables,
run the following command:

> ```sqlexample
> CALL CONFIGURE_TABLES_RANGE_START(['table1', 'table2'], TO_TIMESTAMP_TZ('2022-11-23 07:00:00 +00:00'));
> ```

Then:

* for table `table1`, for example, if its ingestion has not started yet, all records with a corresponding *time column* value before 2022-11-23 07:00:00 are **not** ingested.
* for table `table2`, for example, if its ingestion has already started, the *data range time start* value is ignored in all data synchronizations until reloading this table.
  During the reload, all records with a corresponding *time column* value before 2022-11-23 07:00:00 are not ingested.

It is also possible to unset the *data range start time*. For example, in order to unset it for table `table1`, run the following command:

> ```sqlexample
> CALL CONFIGURE_TABLES_RANGE_START(['table1'], NULL);
> ```

Again, if an ingestion of table `table1` has already been started, reloading this table is required to ingest all the records back from ServiceNow®.

> **Note:**
>
> Loading data with the *data range start time* may take longer than loading all historical data because of lower performance of incremental updates.

## Reload data in a table

The connector allows you to reload data in a table. It’s useful when you want to apply the changes in the configuration to the
already ingested data or when you want to make sure that the data is up to date with the source.

There are two types of reloads, full for complete data replacement and filtered for affecting only part of the data by
specifying conditions for the reload.

> > **Note:**
> >
> > Every reload takes the current reloaded table configuration into account. For example, this can restrict which records are ingested.
> >
> > To see the configuration of the main table, check the `CONFIGURED_TABLES` view.
> >
> > To see the result configuration of the reloaded table, check the `RELOADED_TABLES` view.

### Full reload

To reload data in particular table, call the `RELOAD_TABLE` stored procedure:

```sqlsyntax
CALL RELOAD_TABLE('<table_name>');
```

Where:

`table_name`
:   Specifies the name of the table to reload.

When you call the `RELOAD_TABLE` stored procedure, the connector performs the following:

1. The connector suspends the original table for ingestion temporarily.

   > **Note:**
   >
   > While the table is being reloaded, you cannot re-enable the table for ingestion.
2. The connector creates a separate temporary table for ingestion.
3. The connector ingests the data into this new temporary table. This table is visible in
   the [CONNECTOR_STATS](monitoring.md) view as a table named with a
   `__tmp` suffix).
4. After the data is ingested, the connector replaces the data in the original table with the data in the
   temporary table.
5. The connector deletes the temporary table.
6. The connector re-enables the original table for ingestion.

During this process, you can continue to query the existing data in the original table. However,
changes to the data in the ServiceNow® table won’t be reflected in the Snowflake table
until the ingestion process completes.

### Filtered reload

To reload only part of data in particular table, call the `RELOAD_TABLE` stored procedure with a configuration object parameter:

```sqlsyntax
CALL RELOAD_TABLE('<table_name>', <config>);
```

Where:

`table_name`
:   Specifies the name of the table to reload.

`config`
:   Specifies the configuration of the reload. The configuration object can include the following properties:

    * `sys_ids`: An array of ServiceNow® record identifiers (`sys_id`) to be reloaded.
    * `data_reload_range_start_time` and `data_reload_range_end_time`: Timestamp values specifying the data range in TIMESTAMP_TZ format.
      Depending on the given table ingestion type, only records with `sys_updated_on` or `sys_created_on` within the specified time frame are reloaded.
    * `conditions`: A string expression that specifies the conditions on the fields in a ServiceNow® table.
      Only the records that meet the conditions are reloaded.

      The syntax of the expression is the same as for the row filtering.
      If row filtering is configured on the regular table, it is applied to the conditions as well.

In contrast to the full reload, the filtered reload does not replace the data in the original table, but only changes the selected records.

> **Tip:**
>
> Right after enabling a large table for ingestion for the first time, you can quickly ingest a small subset of records
> that you are interested in without waiting for the initial load to complete using filtered reload.

> **Note:**
>
> `data_reload_range_start_time` and `data_reload_range_end_time` time ranges and `conditions` filter
> can be used simultaneously. The records that meet both conditions are reloaded.
>
> `sys_ids` is exclusive with other configuration properties.

For example, to reload only the records with the `sys_id` values of `1`, `2`, and `3` in the table `incident`, run the following command:

```sqlsyntax
CALL RELOAD_TABLE('incident', { 'sys_ids': ['1', '2', '3'] });
```

To reload only the records with the `sys_updated_on` values between 2022-11-23 07:00:00 and 2022-11-23 08:00:00 UTC,
and are still active in the table `incident`, run the following command:

```sqlsyntax
CALL RELOAD_TABLE('incident', {
  'data_reload_range_start_time': TO_TIMESTAMP_TZ('2022-11-23 07:00:00 +00:00'),
  'data_reload_range_end_time': TO_TIMESTAMP_TZ('2022-11-23 08:00:00 +00:00'),
  'conditions': 'active = "true"'
});
```

### Cancel table reload

To cancel the process of reloading the data in a table, use the `CANCEL_RELOAD_TABLE` stored procedure as
shown in the following example:

```sqlsyntax
CALL CANCEL_RELOAD_TABLE('<table_name>');
```

Where:

`table_name`
:   Specifies the name of the table whose reload you want to cancel.

When you cancel the reload, the connector drops all temporary objects created during the reload. The table is
then available for ingestion as part of the normal synchronization schedule.

## Configure the use of read replicas

To configure the connector to use read replicas in your ServiceNow® environment, you can set up a custom query category.
This configuration allows the connector to direct API requests to read replicas instead of the primary instance, which
can help distribute load and improve performance.

To configure a custom query category for read replica usage, call the `CONFIGURE_QUERY_CATEGORY` stored procedure with
the following argument:

```sqlsyntax
CALL CONFIGURE_QUERY_CATEGORY('<query_category>');
```

Where:

`query_category`
:   Specifies the query category identifier that will be added to ServiceNow® API requests.

When configured, the connector will add the `sysparm_query_category=<query_category>` parameter to all ServiceNow® API requests, allowing ServiceNow® to route these requests to the appropriate read replicas based on your instance configuration.

Default query category value set during connector installation is `list`.

For example, to configure the connector to use a query category named `connector_replica`, run the following command:

```sqlsyntax
CALL CONFIGURE_QUERY_CATEGORY('connector_replica');
```

## Configure the size of a single page fetch for a table

The connector fetches data from a table by dividing it into smaller chunks called pages.
Each API request to ServiceNow® fetches one page.

To account for this, the connector limits the number of rows fetched within a single API request. This limit is the page size.

The connector uses the following process to determine the page size:

1. Initially, the default page size is set to 10,000 rows.
2. If the fetch request fails during ingestion because the response size is exceeded, the page size is
   gradually decreased by 1000, 100, 10 and 1 until the request succeeds or the final page size
   is set to 1.
3. The successful page size is saved in the connector state and this value is used by subsequent requests.

The current page size for a table is available in the `TABLES_STATE` view. To see the page size, run
the following command:

```sqlsyntax
SELECT PAGE_SIZE FROM TABLES_STATE WHERE TABLE_NAME = '<table_name>';
```

Where:

`table_name`
:   Specifies the name of the ServiceNow® table being ingested.

The process the connector uses for determining the page size may lead to inefficiencies. This process only
reduces the page size. It does not increase the page size. This can happen in situations where a table has a
single large row that causes the page size to be set to a lower value.

To avoid this situation, you can manually set the page size by calling the `RESET_PAGE_SIZE` stored
procedure as shown in the following examples:

```sqlsyntax
CALL RESET_PAGE_SIZE('<table_name>');
```

or

```sqlsyntax
CALL RESET_PAGE_SIZE('<table_name>', <page_size>);
```

Where:

`table_name`
:   Specifies the name of the ServiceNow® table being ingested.

`page_size`
:   (Optional) Specifies the number of rows to fetch in a single page. If not provided, the default value provided in the connector configuration is used. The default and recommended value is 10000. The minimum value is 1 and the maximum value is 25000.

> **Note:**
>
> The page size can be also set for a configured journal table, usually `sys_audit_delete`. If failures occur
> during the deletions ingestion from an underperforming journal table, you can lower the page size to avoid further failures.
>
> Note that the journal table does not need to be explicitly enabled for ingestion to make the connector synchronize deleted rows.

## Ingestion run

Ingestion runs for a given table are triggered according to the configured schedule.
A run downloads all the relevant rows divided into pages mentioned in the previous paragraph from the source table in a loop.

**Initial load and updates**

As soon as a page of data is fetched, it is inserted into the corresponding event log table.
At this stage the newly fetched changes are not yet available in the sync table or through flattened views.
When it is done the next request with updated criteria is issued as long as any data is returned.
When the ingestion run is complete, and there is no more data to fetch in the source table, an asynchronous merge task is triggered,
that takes all the changes from the event log inserted since the last merge and applies them to the sync table.
When it is complete, the data becomes available in sync table and flattened views.

**Truncate and load**

In truncate and load mode a temporary table is created for each ingestion run.
Each fetched page of rows is first inserted into this temporary table (this table exists in the internal connector schema and is not available to connector users).
At this stage the newly fetched changes are not yet available in the sync table or through flattened views, they still show data fetched in the previous run.
When the ingestion run is completed, and there is no more data available in the source table, data from the temporary table replaces existing data in the sync table.
All the fetched rows are also added to the event log.
At the end the temporary table is dropped.

**Monitoring progress**

To check the status of a current or past ingestion run, you can query the `CONNECTOR_STATS` view. It’s visible in the `STATUS` column.
It’s set to `DONE` only if data was successfully fetched and all the changes were applied to the sync table.
When the ingestion is running or the merge to the sync table/replace of rows in the sync table has not been completed yet, the status is `RUNNING`.

## Next steps

After configuring ingestion, perform the steps described in [Access the ServiceNow® data in Snowflake](accessing-data.md) to view and otherwise access ServiceNow® data.

---
title: Set up data ingestion for your Snowflake Connector for Google Analytics Aggregate Data instance
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-setting-up-data.md
section: Connectors & Drivers
---

# Set up data ingestion for your Snowflake Connector for Google Analytics Aggregate Data instance

This topic describes how to access the Snowflake Connector for Google Analytics Aggregate Data in your Snowflake account.

## Add reports in the connector

To set up data ingestion using Snowsight, follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Snowflake Connector for Google Analytics Aggregate Data, and then select the tile for the connector.
4. In the Data sync section, select Add report.
5. In the new dialog, complete the following fields:

   | Field | Description |
   | --- | --- |
   | Report name | Identifier for the new report  Specify a name that is unique for your destination schema. The name of the report must follow the naming rules for [unquoted object identifiers](../../../sql-reference/identifiers-syntax.md). |
   | Property | A Google Analytics property that holds the data you want to ingest  Choose one of the available Google Analytics properties.  **Note:** If a Google Analytics property that you want to use is not available, check whether the credentials used by the connector have access to it. For more information, see [Preparing your Google Analytics and Google Cloud accounts](gaad-connector-prereqs.md). |
   | Dimensions | Google Analytics 4 dimensions to be included in your report  Dimensions are attributes of your data. For example, the dimension `city` indicates the city from which an event originates. The connector includes the `date` dimension in all reports.  **Note:** The dimensions field appears after you select the Google Analytics property. At most nine dimensions can be configured.  For more details about the available dimensions, see [API Dimensions & Metrics](https://developers.google.com/analytics/devguides/reporting/data/v1/api-schema). |
   | Metrics | Google Analytics 4 metrics to be included in your report  Metrics are quantitative measurements of a report. For example, the metric `active1DayUsers` is the number of distinct active users on your site or app within a one-day period. You must select at least one metric.  **Note:** The metrics field appears after you select the Google Analytics property. At most 10 metrics can be configured.  For more details about the available metrics, see [API Dimensions & Metrics](https://developers.google.com/analytics/devguides/reporting/data/v1/api-schema). |
   | Keep empty rows | If this is selected, the ingested data should contain records with dimension combinations for which all the metrics are zero (indicating that there were no events correlated with those dimensions).  **Note:** Some dimension combinations might not be present in the ingested data. |
   | Avoid sampling | If selected, the connector may cancel some ongoing ingestion runs and retry with a shorter interval length to download unsampled data, see [Snowflake Connector for Google Analytics Aggregate Data ingestion model](gaad-ingestion-model.md). |
   | Sync data from | Start date for the initial load of data |
   | Sync schedule | Sync frequency for the ongoing load of data |
6. Select Start Sync.

It can take a few minutes for the ingestion process to be complete. The table and view with your report data will not be visible in the destination
database until the data from GA is fully fetched.

## Delete reports from the connector

> **Note:**
>
> Deleting the report does not delete the ingested data for that report.

1. In the Data sync section, next to the report that you want to delete, select the trash bin button .

   A confirmation dialog asks you to confirm that you want to delete the selected report.
2. Select Delete Report.

The delete report process may take several minutes.

---
title: Set up tasks for the Snowflake High Performance connector for Kafka
source: https://docs.snowflake.com/en/connectors/kafkahp/setup-tasks.md
section: Connectors & Drivers
---

# Set up tasks for the Snowflake High Performance connector for Kafka

This topic describes the overall tasks required to set up, configure, and run the Snowflake High Performance connector for Kafka.

## Prerequisites

Ensure the following prerequisites are met:

1. Ensure that you have reviewed [Snowflake High Performance connector for Kafka](about.md).
2. Ensure that you have reviewed [how the connector works](how-the-connector-works.md).
3. You have configured Kafka with the desired data retention time and/or storage limit.
   See [how the connector works](how-the-connector-works.md) for all supported properties.
4. You have installed and configured the Kafka Connect cluster. .
   Snowflake recommends using the same versions on Kafka Broker and Kafka Connect Runtime.
5. You have configured the Kafka Connect cluster to run in the same cloud provider
   [region](../../user-guide/intro-regions.md) as your Snowflake account. .
   Snowflake strongly recommends running your Kafka Connect instance in the same cloud provider region
   as your Snowflake account. .
   While not strictly required, running in the same region typically improves throughput. .
   As such, Snowflake strongly recommend running your Kafka Connect instance in the same cloud
   provider region as your Snowflake account.

## Tasks

Perform the following tasks to set up, configure, and run the Snowflake High Performance connector for Kafka.

| Order | Task | Description |
| --- | --- | --- |
| 0 | Review Prerequisites | Review and confirm all required prerequisites. |
| 1 | [Configure Snowflake](setup-snowflake.md) | Configure Snowflake for the Snowflake High Performance connector for Kafka. |
| 2 | [Configure Kafka](setup-kafka.md) | Configure Kafka for the Snowflake High Performance connector for Kafka. |
| 3 | [Test the connector](test-connector.md) | Test the connector with a small amount of data. |

---
title: Set up the Snowflake Connector for SharePoint
source: https://docs.snowflake.com/en/connectors/unstructured-data-connectors/sharepoint/setup.md
section: Connectors & Drivers
---

# Set up the Snowflake Connector for SharePoint

> **Note:**
>
> The Snowflake Connector for SharePoint is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms/).

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for SharePoint.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements is not guaranteed. The new solution is available as [Openflow Connector for SharePoint](../../../user-guide/data-integration/openflow/connectors/sharepoint/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic describes the steps to set up your Snowflake Connector for SharePoint.

## Prerequisites

Before you begin installing and configuring the connector, you must do the following:

1. Ensure that you have a [Microsoft Graph](https://learn.microsoft.com/en-us/graph/overview) application with the following permissions:

   * [Sites.Selected](https://learn.microsoft.com/en-us/graph/permissions-reference#sitesselected): limits access only to specified sites.
   * [Files.SelectedOperations.Selected](https://learn.microsoft.com/en-us/graph/permissions-reference#filesselectedoperationsselected): limits access only to files in specified sites.
   * [GroupMember.Read.All](https://learn.microsoft.com/en-us/graph/permissions-reference#groupmemberreadall): used for resolving SharePoint group permissions.
2. Configure SharePoint to enable OAuth authentication as described
   in [Get access without a user](https://learn.microsoft.com/en-us/graph/auth-v2-service?tabs=http#authentication-and-authorization-steps).
   The connector uses the following Microsoft Graph APIs to fetch data from SharePoint:

   * [Download driveItem content](https://learn.microsoft.com/en-us/graph/api/driveitem-get-content?view=graph-rest-1.0&tabs=http)
   * [driveItem: delta](https://learn.microsoft.com/en-us/graph/api/driveitem-delta?view=graph-rest-1.0&tabs=http)
   * [List who has access to a file](https://learn.microsoft.com/en-us/graph/api/driveitem-list-permissions?view=graph-rest-1.0&tabs=http)
   * [group: delta](https://learn.microsoft.com/en-us/graph/api/group-delta?view=graph-rest-1.0&tabs=http)
   * [List group members](https://learn.microsoft.com/en-us/graph/api/group-list-members?view=graph-rest-1.0&tabs=http)
3. Get the site URL of your Microsoft 365 SharePoint site with files or folders that you want to ingest into Snowflake
   and the credentials from your Azure or Office 365 account administrator.

## Install the Snowflake Connector for SharePoint

Connectors are instances of Snowflake native applications.
To install the Snowflake Connector for SharePoint, do the following:

1. Sign in to Snowflake as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for SharePoint and select Get.
4. In the dialog box, expand Options and enter the following information:

   * In Application name, enter a name for your connector application.
   * In Warehouse used for installation, select the warehouse that you want to use for installing the connector.

     > **Note:**
     >
     > This is not the same warehouse that is used by the connector to synchronize data from SharePoint. In a later step,
     > you will create a separate warehouse for this purpose.
5. Select Get to begin the installation process. This can take a few minutes to complete.
6. After the connector is successfully installed, either select Configure to proceed with
   the configuration or select Done to close the dialog box and complete the installation.

### Optional: Install multiple instances of Snowflake Connector for SharePoint

You can install multiple instances of the Snowflake Connector for SharePoint on your Snowflake account.
To install an additional instance, do the following:

1. Navigate to Snowflake Marketplace and select Snowflake Connector for SharePoint. The application details page appears.
2. Click Add instance. The installation dialog appears.
3. Provide the instance name and select the warehouse to be used during the installation.
4. Select Get to begin the installation process.

> **Note:**
>
> * Adding connector instances can take several minutes. When the installation process completes, you get an email notification.
> * To avoid ingested data corruption, during connector configuration, always use a database schema that is different from all other native applications.

## Configure the Snowflake Connector for SharePoint

Each connector application instance must be configured to communicate with its associated Sharepoint instance.
After completing the installation process, proceed with the following steps.

1. Ensure that all the prerequisites are completed. For more information see Prerequisites.
2. If required, open the configuration wizard as follows:

   1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
   2. In the navigation menu, select Catalog » Apps.
   3. Search for the Snowflake Connector for SharePoint and select it.

### Configure

3. In the Configure step of the wizard, enter information in the following fields:

> > **Note:**
> >
> > By default, the fields are set to the names of objects created when you configure the connector.
> > Snowflake recommends using new objects for these fields.
> > However, if required, you can specify the names of existing objects, for example if reinstalling the connector.
>
> | Field | Description |
> | --- | --- |
> | Warehouse for Ingestion Data | Identifier for a new dedicated virtual warehouse for the connector. This warehouse is used for computing the data ingestion and document processing tasks.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  Alternatively, you can select an existing warehouse.  **Note:** Do not specify the warehouse used during the initial creation of the connector. |
> | Warehouse for Cortex Search: | Identifier for a new, dedicated Cortex search virtual warehouse. This warehouse is used to process and serve Cortex Search queries.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  **Note:** Do not specify the same warehouse that you selected at the beginning of the connector installation. The configuration process creates a new X-Small warehouse with the specified name. |
> | Role for Cortex Search | Identifier for a new custom role for the connector. Specify a name that is unique for your account. The name of the role must be a valid [object identifier](../../../sql-reference/identifiers-syntax.md).  Users who have granted the role can use their account to query Cortex REST API about the data ingested by the application. By default, only the account that you used to install the connector has permission to query Cortex. |

4. Click Configure to continue.

### Authenticate and connect to Sharepoint

> **Important:**
>
> Ensure that pop-ups are enabled in your browser.

5. In the Authentication step of the wizard, enter the following information and credentials to complete the OAuth2 authentication and connect to SharePoint.

   Contact your Azure or Office 365 account administrator for this information.

   | Field | Description |
   | --- | --- |
   | SharePoint site URL | URL or Sharepoint site from which the connector will ingest content.  For top-level sites, use domain name only, for example, `sitename.sharepoint.com`. For sub-sites,use a domain name with the site path, for example, `sitename.sharepoint.com/sites/SubSite`. |
   | Client ID | Enter your client ID. To learn about client ID and how to find it in Microsoft Entra, see [Application ID (client ID)](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#application-id-client-id). |
   | Client secret | Enter your client secret. To learn about a client secret and how to find it in Microsoft Entra, see [Certificates & secrets](https://learn.microsoft.com/en-us/azure/healthcare-apis/register-application#certificates--secrets). |
   | Tenant ID | Enter your tenant ID. To learn about tenant ID and how to find it in Microsoft Entra, see [Find your Microsoft 365 tenant ID](https://learn.microsoft.com/en-us/sharepoint/find-your-office-365-tenant-id). |
6. Click Next to begin the connection process, which can take several minutes to complete.

### Validate source

In the Validate source step of the wizard, do the following:

7. Select the source from which you want to fetch the files:

   * Select All folders if you want to fetch files from all the folders that are accessible through
     the credentials you provided in Authenticate and connect to Sharepoint.
   * Select Specific folder if you want to fetch files from a specific folder that is accessible
     through the credentials you provided in Authenticate and connect to Sharepoint.

     > **Note:**
     >
     > This path is relative to the Shared Documents folder. For example, to ingest
     > files from the folder `Shared%20Documents/user_manuals/cars`, enter `user_manuals/cars`.
   > **Note:**
   >
   > To change the fetch file source at a later time, you must reinstall the connector.
8. Click Validate to start the process of validating source, which can take several minutes.
9. After your connector is successfully configured, click Ingest files to begin data ingestion.

## Next step

Once your connector is set up, continue on to [Query the Cortex Search service with Snowflake Connector for SharePoint](cortex.md).

---
title: Setting up data ingestion for your Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-setting-up-data.md
section: Connectors & Drivers
---

# Setting up data ingestion for your Snowflake Connector for Google Analytics Raw Data

This topic describes how to access Snowflake Connector for Google Analytics Raw Data in your Snowflake account.

> **Note:**
>
> Any single property can only be ingested from one GCP project at a time. Changing the project for a previously-configured property currently requires reinstalling the connector. This limitation will be removed in the future.
>
> If you change the export settings for a property, and start exporting it into a different GCP project, you should also manually move data from the previous BigQuery instance, and consolidate it in the newly-configured one.

## Setting up data ingestion using Snowsight

To set up data ingestion using Snowsight, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Google Analytics Raw Data, then select the tile for the connector.
4. In the page for the Snowflake Connector for Google Analytics Raw Data, navigate to the Data Sync section.

   This displays a list of all the Google Analytics properties.
5. Select the properties you want to ingest:

   1. Search for the property you want to ingest.
   2. Select the checkbox in the Status column next to the property you want to select.
   3. Repeat these steps for each property you want to ingest into Snowflake.
6. Select the heading of the Status column to see the properties you have currently selected.
7. Select Start sync to begin ingesting data into your Snowflake account.

Selected properties appear in the properties list.

Data Ingestion status will be displayed in the right top corner of the Manage data synchronization section.

Data sync for each property will create two loads:

* Initial load, which ingests historical data. It starts with the current day and runs backward until the first day, for which data is available is reached.
* Present load, which ingests data from the current day and runs forward.

If you wish to only sync current data, you can do so via a worksheet.

Enabling a property using Snowsight will cause the connector to attempt ingestion for all possible export types. If you want to
ingest only specific export types, for example if you only have `events_` tables in BigQuery, you can do so by using SQL statements.

> **Note:**
>
> Once a property **with** an initial load is enabled, initial load can be disabled. On the other hand,
> when property is enabled **without** an initial load, initial load cannot later be enabled.

### Modifying data ingestion using Snowsight

To modify the Google Analytics tables to be ingested or the synchronization schedule for the tables, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Google Analytics Raw Data, then select the tile for the connector.
4. In the page for the Snowflake Connector for Google Analytics Raw Data, navigate to the Data Sync section.
5. Select Edit properties.
6. Modify the tables you want to ingest:

   1. Search for the table you want to ingest.
   2. Select the checkbox in the Status column next to the table you want to select or deselect.
7. Select Update data sync.

## Setting up data ingestion using SQL statements

To set up data ingestion using SQL statements, do the following:

* List the properties available for ingestion.
* Prepare destination database.
* Enable ingestion of a property.

> **Note:**
>
> To configure these settings, use stored procedures that are defined in the PUBLIC schema of
> the database that serves as an instance of the connector installation database.
>
> Before calling these stored procedures, select that database as the database to use for the session.
>
> For example, if that database is named `snowflake_connector_for_google_analytics_raw_data`, run the following command:
>
> ```sqlexample
> USE DATABASE snowflake_connector_for_google_analytics_raw_data;
> ```

### Listing the properties available for ingestion

To list all the available properties in a given GCP project, call the following stored procedure:

> ```sqlsyntax
> CALL LIST_GA_PROPERTIES();
> ```

The result displays all the available projects and properties to ingest by an authorized account. If no results are returned please check:

* If the data export from Google Analytics to BigQuery is configured.
* If exported data is visible in BigQuery.
* If proper roles are assigned to the used Service Account / authenticated user.

Please be advised that it can take up to 24 hours between setting up the data export and storing data in BigQuery.
This delay can be a cause for the `LIST_GA_PROPERTIES` procedure producing no results.

Turning the Google Analytics export off does not mean the property is ommited by `LIST_GA_PROPERTIES`.
Even though the export was turned off, data can still persist in BigQuery and can be synchronized by the connector.

### Preparing destination database

Before enabling the ingestion, you need to grant the connector access to creating tables and views inside your destination database and schema.

> ```sqlsyntax
> GRANT USAGE ON DATABASE <destination database> TO APPLICATION <application name>;
>
> GRANT USAGE ON SCHEMA <destination database>.<destination schema> TO APPLICATION <application name>;
>
> GRANT CREATE TABLE ON SCHEMA <destination database>.<destination schema> TO APPLICATION <application name>;
>
> GRANT CREATE VIEW ON SCHEMA <destination database>.<destination schema> TO APPLICATION <application name>;
> ```

### Enabling or disabling the ingestion of a property

To enable or disable the synchronization of data for a specific property in Google Analytics, call the `ENABLE_PROPERTIES`
stored procedure with the following arguments:

> ```sqlsyntax
> CALL ENABLE_PROPERTIES('<gcp_project>', ['<properties_to_configure>'], <enable_initial_load>, <exclude_nulls>, <disable_auto_reloads>, <enabled_export_types>);
> ```

Where:

`gcp_project`
:   Specifies the GCP project of the enabled properties.

`properties_to_configure`
:   Specifies a comma-delimited list of Google Analytics properties names in single quotation marks.

    Use the property name without the `analytics_` prefix.

`enable_initial_load`
:   A boolean indicating whether to enable or disable the initial data load, which ingests all historical data for a property in parallel to the current sync.

    This is an optional argument and the default value for it is `true`.

    When a property was previously enabled, this flag is ignored, and ingestion will continue from the point when it stopped when the property was disabled.

`exclude_nulls`
:   Optional boolean indicating whether to exclude fields containing null values from the ingested data. Setting this parameter to `true`
    can improve the data ingestion throughput. Default value is `false`.

`disable_auto_reloads`
:   An optional boolean indicating whether to disable automatic reloads. For more details about auto reload see [Data ingestion model for the Snowflake Connector for Google Analytics Raw Data](gard-connector-data-ingestion-model.md).
    Setting this value to `true` can reduce credit consumption, but late data won’t be ingested into Snowflake. This property cannot be set to `true` for the `FRESH_DAILY` export type.
    Default value is `false`.

`enabled_export_types`
:   An optional list of export types, which connector will try to ingest data for. Possible values are: `DAILY`, `FRESH_DAILY`, `INTRADAY`, `USERS` and `PSEUDONYMOUS_USERS`.
    By default, all export types, except `FRESH_DAILY`, will be enabled.

For example, to enable the synchronization of the properties named `property1`, `property2`, and `property3` in the project `gcp_example_project`, run
the following query:

> ```sqlexample
> CALL ENABLE_PROPERTIES('gcp_example_project', ['property1','property2','property3']);
> ```

To enable properties without the initial data loading, use an ENABLE_PROPERTIES query similar to:

> ```sqlexample
> CALL ENABLE_PROPERTIES('gcp_example_project', ['property1','property2','property3'], false);
> ```

If only have daily and user data in BigQuery, you can explicitly omit the intraday export by running the following query:

> ```sqlexample
> CALL ENABLE_PROPERTIES(PROJECT_ID => 'gcp_example_project', PROPERTY_IDS => ['property1'], ENABLED_EXPORT_TYPES => ['DAILY', 'FRESH_DAILY', 'USERS', 'PSEUDONYMOUS_USERS']);
> ```

You can use named arguments to specify specific arguments and leave the remainder unchanged.
For example, to enable properties with the initial load and exclude fields containing null values, run the following query:

> ```sqlexample
>  CALL ENABLE_PROPERTIES(
>     PROJECT_ID => 'gcp_example_project',
>     PROPERTY_IDS => ['property1', 'property2', 'property3'],
>     INITIAL_LOAD => TRUE,
>     EXCLUDE_NULLS => TRUE
> );
> ```

To prevent these properties from being ingested, run the following command:

> ```sqlexample
> CALL DISABLE_PROPERTIES('gcp_example_project', ['property1','property2','property3']);
> ```

Disabling the property stops its synchronization. When the property synchronization is disabled, the whole ingestion that started, but not finished yet is removed from the destination database.

The `ENABLE_PROPERTIES` procedure adds the specified property names to the `ENABLED_PROPERTIES` view.

## Initial load

After enabling a new property, the connector starts ingesting all historical data found in BigQuery in parallel to the current sync responsible for collecting new events.
The initial load runs backwards, starting from the current day until the first day for which data is available is reached.

## Reloading already ingested data

To reload already ingested data, or to load data that has not been ingested at all (e.g. because you enabled property without initial load, or data was absent in BigQuery and now it’s available) you can call one of the following procedures:

> ```sqlexample
> CALL RELOAD_PROPERTY('<property id>');
> ```
>
> This procedure triggers reload of all data (`DAILY`, `FRESH_DAILY`, `INTRADAY`, `USERS` and `PSEUDONYMOUS_USERS`) of a given property, between the earliest table it can find in BigQuery and the last ingested (or terminally marked as `DATA_NOT_FOUND`) table date between the connector.
>
> ```sqlexample
> CALL RELOAD_PROPERTY('<property id>', <first date>, <last date>);
> ```
>
> Triggers reload of all data (`DAILY`, `FRESH_DAILY`, `INTRADAY`, `USERS` and `PSEUDONYMOUS_USERS`) of a given property, between provided dates.
>
> ```sqlexample
> CALL RELOAD_PROPERTY('<property id>', '<export type>', <first date>, <last date>);
> ```
>
> Triggers reload of `DAILY`, `FRESH_DAILY`, `INTRADAY`, `USERS` or `PSEUDONYMOUS_USERS` data of a given property, between provided dates.

> **Note:**
>
> * Reload is processed in parallel to main load.
> * You can trigger as many reloads of a property, as you want, as long date ranges do not overlap.
> * Data is swapped after downloading each table from BigQuery.
> * Reload swaps data only if there is data in BigQuery for particular day.

Ongoing reloads can be observed via dedicated view:

> ```sqlexample
> SELECT * FROM PUBLIC.ONOGOING_RELOADS;
> ```

To cancel ongoing reload execute following query:

> ```sqlexample
> CALL CANCEL_RELOAD_PROPERTY('<load id>');
> ```

---
title: Setting up Email Notifications for the MySQL connector
source: https://docs.snowflake.com/en/connectors/mysql6/email-notifications.md
section: Connectors & Drivers
---

# Setting up Email Notifications for the MySQL connector

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

You can enable email notifications for the connector. The connector uses the
[Notification System Stored Procedure](../../user-guide/notifications/email-stored-procedures.md)
to send email notifications. Setting up email notifications is an optional but recommended action.

## Configuring Email Notifications

You can configure the connector to send email notifications when errors occur.

On a given schedule, the connector checks whether new errors have occurred. If so, an email containing the number of
errors is sent to specified recipients. Email notifications are sent on an incremental basis, meaning that only new errors
trigger notification. For security reasons, emails contains only information about the number of errors (not the errors
themselves).

To receive email notifications about errors, you must have already created and set up an event table for the account
(to capture the logged errors), and that event table must have CHANGE_TRACKING set to TRUE.

To configure email notifications do the following:

1. Create a notification integration.
2. Create a log view for the connector.
3. Enable email notifications.

### Create a notification integration

To send email notifications, the connector uses the notification integration object, which is a Snowflake object that provides
an interface between Snowflake and email services.

To create notification integration, run the following command:

> ```sqlsyntax
> CREATE NOTIFICATION INTEGRATION <integration_name>
>     TYPE=EMAIL
>     ENABLED=TRUE
>     ALLOWED_RECIPIENTS=('first.last@example.com','first2.last2@example.com');
> ```

Where:

> `integration_name`
> :   Specifies the name of the notification integration.

The connector requires the `USAGE` privilege on the notification integration that is used to send the email.
To grant this privilege, run the following command:

> ```sqlsyntax
> GRANT USAGE ON INTEGRATION <integration_name> TO APPLICATION <app_db_name>;
> ```

Where:

> `integration_name`
> :   Specifies the name of the notification integration.
>
> `app_db_name`
> :   Specifies the name of the connector database.

More information about creating notification integration can be
found [here](../../user-guide/notifications/email-notifications.md).

### Create a log View for the connector

To configure email notifications you must create a log view for the event table that stores the logged messages from the
connector. You can create the log view in any database and schema, except the database that serves as the connector instance.

Run the following command to create a log view on the event table:

> ```sqlsyntax
> CREATE SECURE VIEW <logs_view> CHANGE_TRACKING = TRUE AS
>   SELECT *
>   FROM <fully_qualified_event_table_name>
>   WHERE RECORD_TYPE = 'LOG' AND
>   RESOURCE_ATTRIBUTES:"snow.database.name" = '<app_db_name>';
> ```

Where:

> `logs_view`
> :   Specifies the name of the view that you want to create.
>
> `fully_qualified_event_table_name`
> :   Specifies the fully-qualified name of the event table.
>
> `app_db_name`
> :   Specifies the name of the connector database.

The connector requires the `SELECT` privilege on the view. It also requires `USAGE` privilege both on the database
and the schema that contains the view. To grant these privileges, run the following commands:

> ```sqlsyntax
> GRANT USAGE ON DATABASE <logs_db> TO APPLICATION <app_db_name>;
> GRANT USAGE ON SCHEMA <logs_db>.<logs_schema> TO APPLICATION <app_db_name>;
> GRANT SELECT ON VIEW <logs_db>.<logs_schema>.<logs_view> TO APPLICATION <app_db_name>;
> ```

Where:

> `logs_db`
> :   Specifies the name of the database that contains the view that you just created.
>
> `logs_schema`
> :   Specifies the name of the schema that contains the view that you just created.
>
> `logs_view`
> :   Specifies the name of the view that you just created.
>
> `app_db_name`
> :   Specifies the name of the connector database.

### Enable email notifications

After creating the email notification integration and the log view, run the following command to enable email notifications
from the connector:

> ```sqlsyntax
> CALL PUBLIC.CONFIGURE_ALERTS('<integration_name>', '<logs_db>.<logs_schema>.<logs_view>', '<schedule>', ['<email_address_1>' [, ... '<email_address_2>']]);
> ```

Where:

> `integration_name`
> :   Specifies the name of the notification integration.
>
> `logs_db`
> :   Specifies the name of the database that contains the view that you created in the previous step.
>
> `logs_schema`
> :   Specifies the name of the schema that contains the view that you created in the previous step.
>
> `logs_view`
> :   Specifies the name of the view that you created in the previous step.
>
> `schedule`
> :   Specifies the schedule or frequency at which the connector should check for errors and send a notification. For details
>     on specifying the schedule or frequency, see [SCHEDULE parameter](../../sql-reference/sql/create-task.md).
>
> `['email_address_1' [, ... 'email_address_2']]`
> :   Specifies the array of one or more quoted email addresses that can receive email notifications from the connector. The email
>     addresses in this array must be in the ALLOWED_RECIPIENTS parameter specified in the [email notification integration](../../user-guide/notifications/email-notifications.md).

To change the configuration of an email notifications, use the above command providing the revised parameters.

### Disabling Email Notifications

To disable email notifications, run the following command:

> ```sqlsyntax
> CALL PUBLIC.DISABLE_ALERTS();
> ```

This command removes all email addresses that were added during the initial configuration.

## Next steps

After completing these procedures, follow the steps in [Setting up the Snowflake Connector for MySQL Agent container](install-agent.md)

---
title: Setting up Email Notifications for the PostgreSQL connector
source: https://docs.snowflake.com/en/connectors/postgres6/email-notifications.md
section: Connectors & Drivers
---

# Setting up Email Notifications for the PostgreSQL connector

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

You can enable email notifications for the connector. The connector uses the
[Notification System Stored Procedure](../../user-guide/notifications/email-stored-procedures.md)
to send email notifications. Setting up email notifications is an optional but recommended action.

## Configuring email notifications

You can configure the connector to send email notifications when errors occur.

On a given schedule, the connector checks whether new errors have occurred. If so, an email containing the number of
errors is sent to specified recipients. Email notifications are sent on an incremental basis, meaning that only new errors
trigger notification. For security reasons, emails contains only information about the number of errors (not the errors
themselves).

To receive email notifications about errors, you must have already created and set up an event table for the account
(to capture the logged errors), and that event table must have CHANGE_TRACKING set to TRUE.

To configure email notifications do the following:

1. Create a notification integration
2. Create a log view for the connector
3. Enable email notifications

### Create a notification integration

To send email notifications, the connector uses the notification integration object, which is a Snowflake object that provides
an interface between Snowflake and email services.

To create notification integration, run the following command:

> ```sqlsyntax
> CREATE NOTIFICATION INTEGRATION <integration_name>
>     TYPE=EMAIL
>     ENABLED=TRUE
>     ALLOWED_RECIPIENTS=('first.last@example.com','first2.last2@example.com');
> ```

Where:

> `integration_name`
> :   Specifies the name of the notification integration.

The connector requires the `USAGE` privilege on the notification integration that is used to send the email.
To grant this privilege, run the following command:

> ```sqlsyntax
> GRANT USAGE ON INTEGRATION <integration_name> TO APPLICATION <app_db_name>;
> ```

Where:

> `integration_name`
> :   Specifies the name of the notification integration.
>
> `app_db_name`
> :   Specifies the name of the connector database.

More information about creating notification integration can be
found [here](../../user-guide/notifications/email-notifications.md).

### Create a log view for the connector

To configure email notifications you must create a log view for the event table that stores the logged messages from the
connector. You can create the log view in any database and schema, except the database that serves as the connector instance.

Run the following command to create a log view on the event table:

> ```sqlsyntax
> CREATE SECURE VIEW <logs_view> CHANGE_TRACKING = TRUE AS
>   SELECT *
>   FROM <fully_qualified_event_table_name>
>   WHERE RECORD_TYPE = 'LOG' AND
>   RESOURCE_ATTRIBUTES:"snow.database.name" = '<app_db_name>';
> ```

Where:

> `logs_view`
> :   Specifies the name of the view that you want to create.
>
> `fully_qualified_event_table_name`
> :   Specifies the fully-qualified name of the event table.
>
> `app_db_name`
> :   Specifies the name of the connector database.

The connector requires the `SELECT` privilege on the view. It also requires `USAGE` privilege both on the database
and the schema that contains the view. To grant these privileges, run the following commands:

> ```sqlsyntax
> GRANT USAGE ON DATABASE <logs_db> TO APPLICATION <app_db_name>;
> GRANT USAGE ON SCHEMA <logs_db>.<logs_schema> TO APPLICATION <app_db_name>;
> GRANT SELECT ON VIEW <logs_db>.<logs_schema>.<logs_view> TO APPLICATION <app_db_name>;
> ```

Where:

> `logs_db`
> :   Specifies the name of the database that contains the view that you just created.
>
> `logs_schema`
> :   Specifies the name of the schema that contains the view that you just created.
>
> `logs_view`
> :   Specifies the name of the view that you just created.
>
> `app_db_name`
> :   Specifies the name of the connector database.

### Enable email notifications

After creating the email notification integration and the log view, run the following command to enable email notifications
from the connector:

> ```sqlsyntax
> CALL PUBLIC.CONFIGURE_ALERTS('<integration_name>', '<logs_db>.<logs_schema>.<logs_view>', '<schedule>', ['<email_address_1>' [, ... '<email_address_2>']]);
> ```

Where:

> `integration_name`
> :   Specifies the name of the notification integration.
>
> `logs_db`
> :   Specifies the name of the database that contains the view that you created in the previous step.
>
> `logs_schema`
> :   Specifies the name of the schema that contains the view that you created in the previous step.
>
> `logs_view`
> :   Specifies the name of the view that you created in the previous step.
>
> `schedule`
> :   Specifies the schedule or frequency at which the connector should check for errors and send a notification. For details
>     on specifying the schedule or frequency, see [SCHEDULE parameter](../../sql-reference/sql/create-task.md).
>
> `['email_address_1' [, ... 'email_address_2']]`
> :   Specifies the array of one or more quoted email addresses that can receive email notifications from the connector. The email
>     addresses in this array must be in the ALLOWED_RECIPIENTS parameter specified in the [email notification integration](../../user-guide/notifications/email-notifications.md).

To change the configuration of an email notifications, use the above command providing the revised parameters.

### Disabling Email Notifications

To disable email notifications, run the following command:

> ```sqlsyntax
> CALL PUBLIC.DISABLE_ALERTS();
> ```

This command removes all email addresses that were added during the initial configuration.

## Next steps

After completing these procedures, follow the steps in [Setting up the Snowflake Connector for PostgreSQL Agent container](install-agent.md)

---
title: Setting up the Snowflake Connector for MySQL Agent container
source: https://docs.snowflake.com/en/connectors/mysql6/install-agent.md
section: Connectors & Drivers
---

# Setting up the Snowflake Connector for MySQL Agent container

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic describes the procedure to set up the Snowflake Connector for MySQL agent container.
A database connector agent is a containerized application that runs inside your infrastructure,
connecting directly to your databases and to Snowflake.

The process of configuring the Snowflake Connector for MySQL agent includes the following tasks:

1. Review prerequisites and choose an orchestration system
2. Configure and run the agent
3. [Optionally] Configure orchestration using Kubernetes
4. Monitor the agent

## Review prerequisites and choose an orchestration system

Review and complete all prerequisites and proceed to Configure and run the agent.

### Choose a container orchestration system

The agent is packaged as a standard Docker container image, and can be run on any orchestration system
that supports the standard.
This can be a stand-alone [Docker](https://www.docker.com/) instance, [Kubernetes](https://kubernetes.io/),
[RedHat OpenShift](https://www.redhat.com/en/technologies/cloud-computing/openshift),
a cloud-managed solution, such as [AWS Fargate](https://aws.amazon.com/fargate/), and others. Your organization will often have a preferred, existing system for you to use.

Pay attention to the agent configuration section of this document, because different orchestration systems come with
different constraints. Your system, or specific setup, may not permit you to mount writable volumes
(as is required with the agent’s primary configuration option).

Later examples will focus on Kubernetes as the most popular orchestration system.
The approach will often be similar in other systems, and you will need to adjust the examples for your setup.

### Confirm required system resources

* The agent is a memory-intensive application, and requires a minimum 6GB of RAM to operate correctly.
* The optimal number of CPUs is 4. Fewer CPUs can decrease performance. More CPUs will not improve performance.

### Set up connectivity

The agent needs to connect to every source database that you intend to read data from.
Configure your network and firewalls, so that direct connections are possible, and MySQL’s classic client port is reachable.
Typically that’s port `3306`.
For more information see [MySQL’s Port Reference Tables](https://dev.mysql.com/doc/mysql-port-reference/en/mysql-port-reference-tables.html).

If your source databases reside in isolated networks, and connecting from a single agent won’t be possible,
you will need to install additional instances of the connector’s native application, each one with its own agent.
This feature is currently in private preview. Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request access.

The agent also connects directly to your Snowflake deployment. For information on which hostnames need to be available
see [Allowing Host names](../../user-guide/hostname-allowlist.md).

If any of the agent’s connections pass through a proxy, you will need to pass additional configuration to the agent.
See Review optional configuration environment variables.

## Configure and run the agent

Configuring and running the agent is composed of the following steps:

1. Download the MariaDB JDBC driver
2. [Optional] Obtain and prepare SSL certificates for source databases
3. Prepare agent configuration files or environment variables for the agent and start the agent
4. Review optional configuration environment variables
5. Set up PrivateLink connectivity where required

### Download the MariaDB JDBC driver

The agent uses this driver to connect to, and interact with MySQL databases.
Despite it nominally being a driver for MariaDB, they’re compatible.

Due to licensing limitations, the JDBC driver cannot be distributed together with the agent, and you’ll have to provide it.
Download the [MariaDB JDBC Connector 3.4.1](https://repo1.maven.org/maven2/org/mariadb/jdbc/mariadb-java-client/3.4.1/mariadb-java-client-3.4.1.jar) and
save it in a directory from which you can mount it to the agent’s container.

### [Optional] Obtain and prepare SSL certificates for source databases

When the agent connects to source databases via SSL, it requires their certificates to validate the connections. These certificates must be available in the Java Truststore inside the agent’s container, under the path `/opt/java/openjdk/lib/security/cacerts`.

The simplest way to supply certificates to the agent is to add them to the host machine’s existing `cacerts` file, and then mount that file to the running container.

```bash
openssl x509 -outform der -in ca-root.pem -out ca-root.der
keytool -import -alias server-root \
    -keystore $JAVA_HOME/jre/lib/security/cacerrts -file ca-root.der
```

### Prepare agent configuration

The agent can be configured via container-mounted JSON files, or environment variables, or a mix of both.
The access keys required to connect to Snowflake can be mounted from the host’s file system, supplied as container secrets,
or as environment variables.

The following sections describe different configuration options, from the most straightforward, to the most comprehensive.
Choose an approach based on the specifics of your infrastructure.

JSONEnvironment variables

The simplest way to configure the agent is to mount two JSON files into the container at runtime:

* `snowflake.json` contains configuration for the agent to connect to your Snowflake account.
  :   Download this file at the end of the connector’s setup process via the wizard available in Snowsight.
* `datasources.json` contains the list of source databases for the agent to connect to.
  :   You will need to prepare this file yourself.

Right after downloading, `snowflake.json` includes a temporary private key for the Snowflake service account that represents the agent. When starting the agent for the first time, the agent will automatically replace that temporary key with a new, permanent set of keys, and output them to the path `/home/agent/.ssh/` inside the container. Both `snowflake.json` and the path under `/home/agent/.ssh/` must be mounted as writable for the agent to start.

Alternatively, you can provide your own private key for the agent’s service account.
See Review optional configuration environment variables for the required environment variables to pass.

> **Caution:**
>
> If the agent finds an existing private key, either as a mounted file or as
> an environment variable, it will ignore any temporary key that might still be
> present in `snowflake.json`.

Prepare the `datasources.json` file by using the following template:

```json
{
    "<data_source_name_1>": {
        "url": "jdbc:mariadb://<host>:<port>/[?<key1>=<value1>[&<key2>=<value2>]]",
        "username": "<mysql_db_username>",
        "password": "<mysql_db_password>"
    },
    "<data_source_name_2>": {
        "url": "jdbc:mariadb://<host>:<port>/[?<key1>=<value1>[&<key2>=<value2>]]",
        "username": "<mysql_db_username>",
        "password": "<mysql_db_password>"
    }
}
```

When creating the file:

* You have to add at least one data source with a URL, otherwise the agent will not start.
* You can add as many data sources, as you need, as long as the agent can connect directly to all of them.
* The names you enter become the identifiers that you will need later to call the connector’s native app and enable replication. They must be unique for each data source.
* The names of data sources can contain only letters and numbers. All lowercase letters are automatically uppercased by the agent.

Once you have both JSON files in place, the JAR file with JDBC drivers, and a directory to output the new set of keys, you can run the container:

```bash
docker run -d \
   --restart unless-stopped \
   --name database-connector-agent \
   --volume </path/to/ssh/keys/directory>:/home/agent/.ssh \
   --volume </path/to/mariadb/jdbc/jar>:/home/agent/libs/mariadb-jdbc-driver \
   --volume </path/to/snowflake/json/file>:/home/agent/snowflake.json \
   --volume </path/to/datasources/json/file>:/home/agent/datasources.json \
   -m 6g \
   snowflakedb/database-connector-agent:latest
```

Configuration options passed through `snowflake.json` and `datasources.json` can be supplied through environment variables.

> **Important:**
>
> Environment variables take precedence over settings supplied through either of the JSON files.

The environment variable names follow the same structure as the paths in both JSON files.
Nested keys must be separated with underscores `_`, every variable must be prefixed with `SNOWFLAKE_`, and array entries prefixed with integer indexes.

```bash
docker run \
  -e SNOWFLAKE_USERNAME="MYSQL_AGENT_USER" \
  -e SNOWFLAKE_APPLICATION_NAME="SNOWFLAKE_CONNECTOR_FOR_MYSQL" \
  -e SNOWFLAKE_ALLOWLIST_0_HOST="example_account.us-west-2.aws.snowflakecomputing.com" \
  -e SNOWFLAKE_ALLOWLIST_0_PORT=443 \
  -e SNOWFLAKE_ALLOWLIST_0_TYPE="SNOWFLAKE_DEPLOYMENT" \
  ...
```

Is equivalent to:

```json
{
  "userName": "MYSQL_AGENT_USER",
  "applicationName": "SNOWFLAKE_CONNECTOR_FOR_MYSQL",
  "allowlist": [
  {
    "host": "example_account.us-west-2.aws.snowflakecomputing.com",
    "port": 443,
    "type": "SNOWFLAKE_DEPLOYMENT"
  }
  ]
}
```

You don’t need to copy all the entries of the `allowList` or `allowlistPrivatelink` arrays. Instead, find the `allowList` entry with the `type` of `SNOWFLAKE_DEPLOYMENT` and use this URL to set the variable `SNOWFLAKE_ENFORCEDURL`, as in:

```bash
docker run \
  -e SNOWFLAKE_USERNAME="CONNECTOR_MYSQL_AGENT" \
  -e SNOWFLAKE_APPLICATION_NAME="CONNECTOR_MYSQL_INSTANCE" \
  -e SNOWFLAKE_ENFORCEDURL="example_account.us-west-2.aws.snowflakecomputing.com:443" \
  ...
```

Data sources follow a similar structure and are prefixed with `SNOWFLAKE_DATASOURCES_`.

For example:

```bash
docker run \
  -e SNOWFLAKE_DATASOURCES_MYSQLDS1_URL="jdbc:mariadb://example.internal:3306/" \
  -e SNOWFLAKE_DATASOURCES_MYSQLDS1_USERNAME="example_user" \
  -e SNOWFLAKE_DATASOURCES_MYSQLDS1_PASSWORD="example_password" \
  ...
```

Is equivalent to:

```json
{
    "MYSQLDS1": {
        "url": "jdbc:mariadb://example.internal:3306/",
        "username": "example_user",
        "password": "example_password"
    }
}
```

### Review optional configuration environment variables

The agent supports the following, optional settings, available by setting additional environment variables on the container:

`SNOWFLAKE_PRIVATEKEYPATH`
:   Specifies the absolute path to the file with the agent user’s private key. This is used when mounting your own private key to the container, usually via an orchestration system’s secret.

`SNOWFLAKE_PRIVATEKEYPASSWORD`
:   Specifies the password for agent user’s the private key. If you let the agent generate the keys, this password will be set on the private key. If you reuse existing keys, this password will be used to access the existing private key.

`SNOWFLAKE_PRIVATEKEY`
:   Specifies the content of the agent user’s private key. This can be set, when mounting the private key as a file in the container is not an option.

`SNOWFLAKE_ENFORCEDURL`
:   Specifies the URL to connect to Snowflake, overriding the agent’s own discovery mechanism. This is primarily used to connect to PrivateLink deployments.

`JAVA_OPTS`
:   Specifies additional Java settings or properties that will be passed to the agent’s process.

    The most commonly used are:

    * The `-Xmx` option to set the maximum Java heap size. Snowflake recommends setting this value to the amount of memory available to the container, minus 1GB.

      For example, if the container has 6GB of RAM available, set the following:

      ```bash
      JAVA_OPTS=-Xmx5g
      ```
    * When the connection from agent to Snowflake requires a proxy, set the following:

      ```bash
      JAVA_OPTS=-Dhttp.useProxy=true -Dhttp.proxyHost=<proxy-host> -Dhttp.proxyPort=<proxy-port>
      ```
    * To bypass the proxy for some hosts or IP addresses, for instance, source databases, set the additonal `http.nonProxyHosts` property. Use a pipe symbol (`|`) to separate multiple host names. Use an asterisk (`*`) as a wildcard character.

      ```bash
      JAVA_OPTS=-Dhttp.useProxy=true -Dhttp.proxyHost=<proxy-host> -Dhttp.proxyPort=<proxy-port>
        -Dhttp.nonProxyHosts='*.example.com|localhost|myorganization-myaccount.snowflakecomputing.com|192.168.91.*'
      ```
    * To pass credentials for the proxy, set the `http.proxyUser` and `http.proxyPassword` system properties.

      ```bash
      JAVA_OPTS=-Dhttp.useProxy=true -Dhttp.proxyHost=<proxy-host> -Dhttp.proxyPort=<proxy-port>
        -Dhttp.proxyUser=<proxy-user> -Dhttp.proxyPassword=<proxy-pass>
      ```

### Set up PrivateLink connectivity

If you’re connecting to a PrivateLink deployment, you must provide the URL for the agent to
connect to explicitly by setting the `SNOWFLAKE_ENFORCEDURL` environment variable.

To determine the PrivateLink URL of your account, you can either:

* Open the `snowflake.json` file that you obtained during the configuration process. Find the array named `allowlistPrivatelink`, and then the entry with the `type` of `SNOWFLAKE_DEPLOYMENT`.
* Use the [SYSTEM$GET_PRIVATELINK_CONFIG](../../sql-reference/functions/system_get_privatelink_config.md) function.

### Understanding Snowflake access keys

The agent authenticates with Snowflake as a service account,
created by the connector’s setup wizard in Snowsight, using a set of access keys.
The setup wizard creates temporary access keys, and adds the *private* key
to the `snowflake.json` file in a field named `temporaryPrivateKey`.

During its initial startup, the agent replaces these temporary keys by:

1. Generating a new set of access keys, and storing them under `/home/agent/.ssh`
   as `database-connector-agent-app-private-key.p8` and `database-connector-agent-app-public-key.pub` inside the container.
   This directory should be mounted as an external, writable volume to the container,
   so that the keys persist when the container shuts down.
2. Altering its service account to use the new keys.
3. Removing the `temporaryPrivateKey` field from the `snowflake.json` file.

After the initial key replacement, the agent never rotates access keys.

You can use the keys generated by the agent.
Or you can create your own set, alter the service account,
and provide the private key to the agent.

The agent’s private key discovery order is:

1. Any key passed using the `SNOWFLAKE_PRIVATEKEY` environment variable. If this value is found, the connector will ignore the temporary key from `snowflake.json`.
2. Keys found on mounted volumes in `/home/agent/.ssh/database-connector-agent-app-private-key.p8`.
   If this file is found, the connector will ignore the temporary key from `snowflake.json`.
3. The value of the `temporaryPrivateKey` field from the `snowflake.json` file.

## Configure orchestration using Kubernetes

> **Note:**
>
> While this section concentrates on Kubernetes, the connector can be launched in any container orchestration system.
> The configuration syntaxes are often similar. For details, refer to your system’s documentation.

When using Kubernetes, mounting writable volumes is typically not an option.
As a result, the agent will not be able to automatically replace the keys for its Snowflake user account.
You will have to create a set of keys manually, alter the user, and then provide the private key to the container running the agent,
typically as a mounted secret. For details on setting key-pairs for Snowflake users see [Configuring key-pair authentication](../../user-guide/key-pair-auth.md).

We recommend that you store the secrets in a secure store, such as HashiCorp Vault.
These stores usually have existing integrations with Kubernetes, for instance,
[Vault offers a specialized operator](https://developer.hashicorp.com/vault/tutorials/kubernetes/vault-secrets-operator).
The integration details will be specific to your container orchestration system and secure store. Refer to their respective documentation for details.

Kubernetes normally runs in multi-node clusters, with no way to mount files from the host machines.
To supply the agent’s container with the configuration JSON files,
you can create a [Kubernetes ConfigMap](https://kubernetes.io/docs/concepts/configuration/configmap/) storing all three of the files.

The following shows a basic example for configuring the agent in Kubernetes.

1. Create a ConfigMap that will store the JDBC driver and `snowflake.json`:

   ```bash
   kubectl create configmap database-connector-config \
     --from-file=jdbc-driver.jar=</path/to/mariadb/jdbc/jar> \
     --from-file=snowflake.json=</path/to/snowflake/json/file>
   ```

   > **Tip:**
   >
   > The JDBC driver JAR is around 650KB in size, as of this writing, and well under the ConfigMap’s limit of 1MB imposed by Kubernetes. If you prefer not to put this much data into a ConfigMap, a common pattern is to build a custom Docker image, based on the agent’s official one, with the addition of the JDBC JAR.
2. Create a secret that will store the content of the agent user’s private key and `datasources.json`:

   ```bash
   kubectl create secret generic database-connector-secrets \
     --from-file=private-key=</path/to/private/key/file> \
     --from-file=datasources.json=</path/to/datasources.json>
   ```
3. Configure the agent’s Pod, mounting the configuration files and private key as volumes:

   ```yaml
   apiVersion: v1
   kind: Pod
   metadata:
     name: database-connector-agent
   spec:
     restartPolicy: Always
     containers:
       - name: database-connector-agent
         image: snowflakedb/database-connector-agent:latest
         resources:
           requests:
             memory: "6Gi"
           limits:
             memory: "8Gi"
         volumeMounts:
           - name: config
             mountPath: /home/agent/libs/jdbc-driver.jar
             subPath: jdbc-driver.jar
           - name: config
             mountPath: /home/agent/snowflake.json
             subPath: snowflake.json
           - name: secrets
             mountPath: /home/agent/datasources.json
             subPath: datasources.json
           - name: secrets
             mountPath: /etc/private-key/private-key
             subPath: private-key
         env:
           - name: MYSQL_DATASOURCE_DRIVERPATH
             value: /home/agent/libs/jdbc-driver.jar
           - name: SNOWFLAKE_PRIVATEKEYPATH
             value: /etc/private-key/private-key
     volumes:
       - name: config
         configMap:
           name: database-connector-config
       - name: secrets
         secret:
          secretName: database-connector-secrets
   ```
4. Save the Pod’s configuration as a YAML file, for instance, `database-connector.yaml` and start:

   ```bash
   kubectl apply -f database-connector.yaml
   ```

## Monitor the agent

The agent’s container outputs logs to `stdout` which can be accessed using Docker.
For example, if your container’s name is `database-connector-agent`, then the command to view logs would be:

```bash
docker container logs database-connector-agent
```

These logs are also streamed into Snowflake. See [Monitoring the Snowflake Connector for MySQL](monitor.md) for information about how to access these.

### Accessing the health check endpoint

The agent exposes a HTTP endpoint with health information. You can use this endpoint when running the agent in an
orchestration system to determine when the agent has fully launched and whether it is healthy.
The endpoint is available under port `8080` and path `/actuator/health`.

To use the endpoint as a liveness probe in Kubernetes, add the following to your Pod configuration:

```yaml
apiVersion: v1
kind: Pod
spec:
  containers:
  - ...
    livenessProbe:
      httpGet:
        path: /actuator/health
        port: 8080
      initialDelaySeconds: 5
      periodSeconds: 10
```

## Next steps

After completing these procedures, follow the steps in [Configuring replication for the Snowflake Connector for MySQL](configure-replication.md).

---
title: Setting up the Snowflake Connector for MySQL using Snowsight
source: https://docs.snowflake.com/en/connectors/mysql6/install-snowsight.md
section: Connectors & Drivers
---

# Setting up the Snowflake Connector for MySQL using Snowsight

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

> **Note:**
>
> For accounts where the AUTOCOMMIT parameter set to false, it should be set at the sessions level during configuration to true using the SQL command ALTER SESSION SET AUTOCOMMIT=TRUE.

The process of configuring up the Snowflake Connector for MySQL using Snowsight includes the following steps:

## Configuring Logging for the Connector

The Snowflake Connector for MySQL uses event table to store events and logs generated by the connector code. Setting up an event table is a mandatory step.

> **Note:**
>
> If the event table is already configured for the account used for the connector, skip this step.

To create an event table, do the following:

> ```sqlsyntax
> CREATE EVENT TABLE IF NOT EXISTS <fully_qualified_event_table_name> CHANGE_TRACKING = TRUE;
> ALTER ACCOUNT SET EVENT_TABLE = <fully_qualified_event_table_name>;
> ```
>
> Where:
>
> > `fully_qualified_event_table_name`
> > :   Specifies the name of the event table.

More information about an [event table](../../developer-guide/logging-tracing/event-table-setting-up.md) can be found [here](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging).

## Installing the Snowflake Connector for MySQL

The following procedure describes how to install the connector:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for MySQL, then select the tile for the connector.
4. In the page for the Snowflake Connector for MySQL, select Get.

   This displays a dialog that you use to begin the initial part of the installation process.

   In the dialog configure the following:

   1. In the Warehouse used for installation field, select the warehouse that you want to use for installing the connector.

      > **Note:**
      >
      > This is not the same warehouse that is used by the connector to synchronize data from MySQL database. In a
      > later step, you will create a separate warehouse for this purpose.
   2. Optionally, under Options » Application name you can change the name of the application.
   3. Select Get.
5. A dialog appears with the notification: `Successfully Installed`. To continue configuration, select Configure.

   The dialog closes, and the Snowflake Connector for MySQL page displays the UI for configuring
   and managing the connector.

### Optional: Installing multiple instances of Snowflake Connector for MySQL

You can install multiple instances of the same connector application on your Snowflake account.

To install an additional application instance, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select the application for which you want to install another instance. The application details page appears.
4. Click Add instance. The installation dialog appears.
5. Provide the instance name and select the warehouse to be used during the installation.
6. Select Get to begin the installation process.

Adding connector instances can take several minutes. When the installation process completes, you get an email notification.

> **Attention:**
>
> To avoid ingested data corruption, during connector configuration, always use a database schema that is
> different from all other native applications.

To access your installed connector application instances, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select your application instance to access it.

## Configuring the Snowflake Connector for MySQL

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for MySQL, then select it. You will be now moved to the installation wizard page, that will take you through the configuration process.

Configure the application as follows:

### Step 1: Complete prerequisites

Complete the following prerequisite steps to set up your database and agent:

| Step | Description |
| --- | --- |
| Provide access to the source database | [Prerequisites for Snowflake Connector for MySQL datasources](prereqs-datasource.md) |
| Download and install the Agent | [Setting up the Snowflake Connector for MySQL Agent container](install-agent.md) |

Select Mark as done for each completed step.

Select Start configuration.

### Step 2: Configure

In the configuration dialog, enter values for the following fields:

| Field | Description |
| --- | --- |
| Compute Warehouse | Identifier for a new, dedicated virtual warehouse for the connector. This warehouse will be used to process data gained from the agent and put them into target table.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  **Note:** Do not specify the same warehouse that you selected at the beginning of the connector installation.  The configuration process creates a new `X-Small` warehouse with the specified name. |
| Operational Warehouse | Identifier for a new, dedicated virtual warehouse for the connector. This warehouse will be used to manage the activities of connector and its agents.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  **Note:** Do not specify the same warehouse that you selected at the beginning of the connector installation.  The configuration process creates a new `X-Small` warehouse with the specified name. |
| Role | Identifier for a new custom role for the agent.  Specify a name that is unique for your account. The name of the role must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates a new role with the specified name. |
| User | Identifier for a new user that agent will use to authenticate to Snowflake.  Specify a name that is unique within the selected database. The name of the user must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates a new user with the specified name and of type `SERVICE`. |

> **Note:**
>
> By default, the fields are set to the names of objects that are created when you configure the connector.
> Snowflake recommends using new objects for these fields. However, you can specify the names of existing objects,
> if needed (e.g. if you are reinstalling the connector).

Select Configure.

### Step 3: Verify Agent Connection

Check the connection of the agent to Snowflake as follows:

1. Select Generate file to generate initial configuration file for the agent.

   > **Caution:**
   >
   > Every time you click Generate file a new file is generated, with a new set of temporary access keys for the agent’s user. The user is automatically altered to use these new keys for authentication. If you already have the agent running with another set of keys, it will disconnect from Snowflake and stop working.
2. Using the generated `snowflake.json` file, proceed to configure the agent, as described in [Setting up the Snowflake Connector for MySQL Agent container](install-agent.md). Then return to Snowsight.
3. Select Refresh to check connectivity with the agent.
   The application will confirm that the agent is successfully connected, and a confirmation dialog displays.
4. Select Define data to sync to continue.

## Next steps

After completing these procedures, follow the steps in [Setting up the Snowflake Connector for MySQL Agent container](install-agent.md)

---
title: Setting up the Snowflake Connector for PostgreSQL Agent container
source: https://docs.snowflake.com/en/connectors/postgres6/install-agent.md
section: Connectors & Drivers
---

# Setting up the Snowflake Connector for PostgreSQL Agent container

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

This topic describes the procedure to set up the Snowflake Connector for PostgreSQL agent container.
A database connector agent is a containerized application that runs inside your infrastructure,
connecting directly to your databases and to Snowflake.

The process of configuring the Snowflake Connector for PostgreSQL agent includes the following steps:

1. Review prerequisites and choose an orchestration system
2. Configure and run the agent
3. [Optionally] Configure orchestration using Kubernetes
4. Monitor the agent

## Review prerequisites and choose an orchestration system

Review and complete all prerequisites and proceed to Configure and run the agent.

### Choose a container orchestration system

The agent is packaged as a standard Docker container image, and can be run on any orchestration system
that supports the standard.
This can be a stand-alone [Docker](https://www.docker.com/) instance, [Kubernetes](https://kubernetes.io/),
[RedHat OpenShift](https://www.redhat.com/en/technologies/cloud-computing/openshift),
a cloud-managed solution, such as [AWS Fargate](https://aws.amazon.com/fargate/), and others. Your organization will often have a preferred, existing system for you to use.

Pay attention to the agent configuration section of this document, because different orchestration systems come with
different constraints. Your system, or specific setup, may not permit you to mount writable volumes
(as is required with the agent’s primary configuration option).

Later examples will focus on Kubernetes as the most popular orchestration system.
The approach will often be similar in other systems, and you will need to adjust the examples for your setup.

### Confirm required system resources

* The agent is a memory-intensive application, and requires a minimum 6GB of RAM to operate correctly.
* The optimal number of CPUs is 4. Fewer CPUs can decrease performance. More CPUs will not improve performance.

### Set up connectivity

The agent needs to connect to every source database where data will be read. Configure your network and firewalls, so that direct connections are possible, and PostgreSQL’s client port is reachable. Typically port `5432`. For more information see [PostgreSQL’s Connection Settings documentation](https://www.postgresql.org/docs/current/runtime-config-connection.html#GUC-PORT).

If your source databases reside in isolated networks, and connecting from a single agent isn’t possible,
you will need to install additional instances of the connector’s native application, each one with its own agent.
This feature is currently in private preview. Please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request access.

The agent also connects directly to your Snowflake deployment. For information on which hostnames need to be available see [Allowing Host names](../../user-guide/hostname-allowlist.md).

If any of the agent’s connections pass through a proxy, you will need to pass additional configuration to the agent.
See Review optional configuration environment variables.

## Configure and run the agent

Configuring and running the agent is composed of the following steps:

1. [Optional] Obtain and prepare SSL certificates for source databases
2. Prepare agent configuration files or environment variables for the agent and start the agent
3. Review optional configuration environment variables
4. Where required Set up PrivateLink connectivity

Optionally Configure orchestration using Kubernetes.

### [Optional] Obtain and prepare SSL certificates for source databases

If you intend to use SSL connections between the agent and the source databases,
you need to acquire the root certificate for your PostgreSQL instances,
and mount it into the agent’s container under `/home/agent/.postgresql/root.crt`.

### Prepare agent configuration

The agent can be configured via container-mounted JSON files, or environment variables, or a mix of both.
The access keys required to connect to Snowflake can be mounted from the host’s file system, supplied as container secrets,
or as environment variables.

The following sections describe different configuration options, from the most straightforward, to the most comprehensive.
Choose an approach based on the specifics of your infrastructure.

JSONEnvironment variables

The simplest way to configure the agent is to mount two JSON files into the container at runtime:

* `snowflake.json` contains configuration for the agent to connect to your Snowflake account.
  :   Download this file at the end of the connector’s setup process via the wizard available in Snowsight.
* `datasources.json` contains the list of source databases for the agent to connect to.
  :   You will need to prepare this file yourself.

Right after downloading, `snowflake.json` includes a temporary private key for the Snowflake service account that represents the agent. When starting the agent for the first time, the agent will automatically replace that temporary key with a new, permanent set of keys, and output them to the path `/home/agent/.ssh/` inside the container. Both `snowflake.json` and the path under `/home/agent/.ssh/` must be mounted as writable for the agent to start.

Alternatively, you can provide your own private key for the agent’s service account.
See Review optional configuration environment variables for the required environment variables to pass.

> **Caution:**
>
> If the agent finds an existing private key, either as mounted file or as
> an environment variable, it will ignore any temporary key that might still be
> present in `snowflake.json`.

Prepare the `datasources.json` file by copying and filling in the following template:

```json
 {
   "<data_source_name_1>": {
       "url": "jdbc:postgresql://<host>:<port>/<databaseName>[?<key1>=<value1>[&<key2>=<value2>]]",
       "username": "<postgresql_db_username>",
       "password": "<postgresql_db_password>",
       "publication": "<postgresql_db_publication>",
       "ssl": false
   },
   "<data_source_name_2>": {
       "url": "jdbc:postgresql://<host>:<port>/<databaseName>[?<key1>=<value1>[&<key2>=<value2>]]",
       "username": "<postgresql_db_username>",
       "password": "<postgresql_db_password>",
       "publication": "<postgresql_db_publication>",
       "ssl": false
   }
}
```

When creating the file:

* You have to add at least one data source, with a URL, otherwise the agent will not start.
* You can add as many data sources, as you need, as long as the agent can connect directly to all of them.
* The names you enter become the identifiers you will need later to call the connector’s native app and enable replication. They must be unique per data source.
* The names of data sources can contain only letters and numbers. Any lowercase letters will be automatically uppercased by the agent.
* If you enable SSL connections by setting the `ssl` parameter to `true`, you will also need to mount the root certificate of the source database into the container. See [Optional] Obtain and prepare SSL certificates for source databases.

Once you have both JSON files in place, a directory to output the new set of keys, and optionally the root SSL certificate, start the container.

```bash
docker run -d \
   --restart unless-stopped \
   --name database-connector-agent \
   --volume </path/to/ssh/keys/directory>:/home/agent/.ssh \
   --volume </path/to/snowflake.json>:/home/agent/snowflake.json \
   --volume </path/to/datasources.json>:/home/agent/datasources.json \
   --volume </path/to/root.crt>:/home/agent/.postgresql/root.crt \
   -m 6g \
   snowflakedb/database-connector-agent:latest
```

Configuration options passed via `snowflake.json` and `datasources.json` can be supplied via environment variables.

> **Important:**
>
> Environment variables take precedence over settings supplied via either of the JSON files.

The environment variable names follow the same structure as the paths in both JSON files.
Nested keys must be separated with underscores `_`, every variable must be prefixed with `SNOWFLAKE_`, and array entries prefixed with integer indexes.

For example the following script:

```bash
docker run \
  -e SNOWFLAKE_USERNAME="POSTGRESQL_AGENT_USER" \
  -e SNOWFLAKE_APPLICATION_NAME="SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL" \
  -e SNOWFLAKE_ALLOWLIST_0_HOST="example_account.us-west-2.aws.snowflakecomputing.com" \
  -e SNOWFLAKE_ALLOWLIST_0_PORT=443 \
  -e SNOWFLAKE_ALLOWLIST_0_TYPE="SNOWFLAKE_DEPLOYMENT" \
  ...
```

Is equivalent to:

```json
{
  "userName": "POSTGRESQL_AGENT_USER",
  "applicationName": "SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL",
  "allowlist": [
    {
      "host": "example_account.us-west-2.aws.snowflakecomputing.com",
      "port": 443,
      "type": "SNOWFLAKE_DEPLOYMENT"
    }
  ]
}
```

You don’t need to copy all the entries of the `allowList` or `allowlistPrivatelink` arrays.
Instead, find the `allowList` entry with the `type` of `SNOWFLAKE_DEPLOYMENT` and use this URL to set the variable `SNOWFLAKE_ENFORCEDURL`, as in:

```bash
docker run \
    -e SNOWFLAKE_USERNAME="POSTGRESQL_AGENT_USER" \
    -e SNOWFLAKE_APPLICATION_NAME="SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL" \
    -e SNOWFLAKE_ENFORCEDURL="example_account.us-west-2.aws.snowflakecomputing.com:443" \
    ...
```

Data sources follow a similar structure and are prefixed with `SNOWFLAKE_DATASOURCES_`.

For example:

```bash
docker run \
  -e SNOWFLAKE_DATASOURCES_POSTGRESQLDS1_URL="jdbc:postgresql://example.internal:5432/", \
  -e SNOWFLAKE_DATASOURCES_POSTGRESQLDS1_USERNAME="example_user" \
  -e SNOWFLAKE_DATASOURCES_POSTGRESQLDS1_PASSWORD="example_password" \
  -e SNOWFLAKE_DATASOURCES_POSTGRESQLDS1_PUBLICATION="example_publication" \
  -e SNOWFLAKE_DATASOURCES_POSTGRESQLDS1_SSL="false" \
  ...
```

Is equivalent to:

```json
{
    "POSTGRESQLDS1": {
        "url": "jdbc:postgresql://example.internal:5432/",
        "username": "example_user",
        "password": "example_password",
        "publication": "example_publication",
        "ssl": false
    }
}
```

### Review optional configuration environment variables

The agent supports the following, optional settings, available by setting additional environment variables on the container:

`SNOWFLAKE_PRIVATEKEYPATH`
:   Specifies the absolute path to the file with the agent user’s private key. This is used when mounting your own private key to the container, usually via an orchestration system’s secret.

`SNOWFLAKE_PRIVATEKEYPASSWORD`
:   Specifies the password for agent user’s the private key. If you let the agent generate the keys, this password will be set on the private key. If you reuse existing keys, this password will be used to access the existing private key.

`SNOWFLAKE_PRIVATEKEY`
:   Specifies the content of the agent user’s private key. This can be set, when mounting the private key as a file in the container is not an option.

`SNOWFLAKE_ENFORCEDURL`
:   Specifies the URL to connect to Snowflake, overriding the agent’s own discovery mechanism. This is primarily used to connect to PrivateLink deployments.

`JAVA_OPTS`
:   Specifies additional Java settings or properties that will be passed to the agent’s process.

    The most commonly used are:

    * The `-Xmx` option to set the maximum Java heap size. Snowflake recommends setting this value to the amount of memory available to the container, minus 1GB.

      For example, if the container has 6GB of RAM available, set the following:

      ```bash
      JAVA_OPTS=-Xmx5g
      ```
    * When the connection from agent to Snowflake requires a proxy, set the following:

      ```bash
      JAVA_OPTS=-Dhttp.useProxy=true -Dhttp.proxyHost=<proxy-host> -Dhttp.proxyPort=<proxy-port>
      ```
    * To bypass the proxy for some hosts or IP addresses, for instance, source databases, set the additonal `http.nonProxyHosts` property. Use a pipe symbol (`|`) to separate multiple host names. Use an asterisk (`*`) as a wildcard character.

      ```bash
      JAVA_OPTS=-Dhttp.useProxy=true -Dhttp.proxyHost=<proxy-host> -Dhttp.proxyPort=<proxy-port>
        -Dhttp.nonProxyHosts='*.example.com|localhost|myorganization-myaccount.snowflakecomputing.com|192.168.91.*'
      ```
    * To pass credentials for the proxy, set the `http.proxyUser` and `http.proxyPassword` system properties.

      ```bash
      JAVA_OPTS=-Dhttp.useProxy=true -Dhttp.proxyHost=<proxy-host> -Dhttp.proxyPort=<proxy-port>
        -Dhttp.proxyUser=<proxy-user> -Dhttp.proxyPassword=<proxy-pass>
      ```

### Set up PrivateLink connectivity

If you’re connecting to a PrivateLink deployment, you must provide the URL for the agent to
connect to explicitly by setting the `SNOWFLAKE_ENFORCEDURL` environment variable.

To determine the PrivateLink URL of your account, you can either:

* Open the `snowflake.json` file that you obtained during the configuration process. Find the array named `allowlistPrivatelink`, and then the entry with the `type` of `SNOWFLAKE_DEPLOYMENT`.
* Use the [SYSTEM$GET_PRIVATELINK_CONFIG](../../sql-reference/functions/system_get_privatelink_config.md) function.

### Understanding Snowflake access keys

The agent authenticates with Snowflake as a service account,
created by the connector’s setup wizard in Snowsight, using a set of access keys.
The setup wizard creates temporary access keys, and adds the *private* key
to the `snowflake.json` file in a field named `temporaryPrivateKey`.

During its initial startup, the agent replaces these temporary keys by:

1. Generating a new set of access keys, and storing them under `/home/agent/.ssh`
   as `database-connector-agent-app-private-key.p8` and `database-connector-agent-app-public-key.pub` inside the container.
   This directory should be mounted as an external, writable volume to the container,
   so that the keys persist when the container shuts down.
2. Altering its service account to use the new keys.
3. Removing the `temporaryPrivateKey` field from the `snowflake.json` file.

After the initial key replacement, the agent never rotates access keys.

You can use the keys generated by the agent.
Or you can create your own set, alter the service account,
and provide the private key to the agent.

The agent’s private key discovery order is:

1. Any key passed using the `SNOWFLAKE_PRIVATEKEY` environment variable. If this value is found, the connector will ignore the temporary key from `snowflake.json`.
2. Keys found on mounted volumes in `/home/agent/.ssh/database-connector-agent-app-private-key.p8`.
   If this file is found, the connector will ignore the temporary key from `snowflake.json`.
3. The value of the `temporaryPrivateKey` field from the `snowflake.json` file.

## Configure orchestration using Kubernetes

> **Note:**
>
> While this section concentrates on Kubernetes, the connector can be launched in any container orchestration system.
> The configuration syntaxes are often similar. For details, refer to your system’s documentation.

When using Kubernetes, mounting writable volumes is typically not an option.
As a result, the agent will not be able to automatically replace the keys for its Snowflake user account.
You will have to create a set of keys manually, alter the user, and then provide the private key to the container running the agent,
typically as a mounted secret. For details on setting key-pairs for Snowflake users see [Configuring key-pair authentication](../../user-guide/key-pair-auth.md).

We recommend that you store the secrets in a secure store, such as HashiCorp Vault.
These stores usually have existing integrations with Kubernetes, for instance,
[Vault offers a specialized operator](https://developer.hashicorp.com/vault/tutorials/kubernetes/vault-secrets-operator).
The integration details will be specific to your container orchestration system and secure store. Refer to their respective documentation for details.

Kubernetes normally runs in multi-node clusters, with no way to mount files from the host machines.
To supply the agent’s container with the configuration JSON files,
you can create a [Kubernetes ConfigMap](https://kubernetes.io/docs/concepts/configuration/configmap/) storing all three of the files.

The following shows a basic example for configuring the agent in Kubernetes.

1. Create a ConfigMap that will store `snowflake.json` and optionally the SSL root certificate:

   ```bash
   kubectl create configmap database-connector-config \
     --from-file=snowflake.json=</path/to/snowflake.json> \
     --from-file=root.crt=</path/to/root.crt>
   ```
2. Create a secret that will store the content of the agent user’s private key and `datasources.json`:

   ```bash
   kubectl create secret generic database-connector-secrets \
     --from-file=private-key=</path/to/private/key/file> \
     --from-file=datasources.json=</path/to/datasources.json>
   ```
3. Configure the agent’s Pod, mounting the configuration files and private key as volumes:

   ```yaml
   apiVersion: v1
   kind: Pod
   metadata:
     name: database-connector-agent
   spec:
     restartPolicy: Always
     containers:
       - name: database-connector-agent
         image: snowflakedb/database-connector-agent:latest
         resources:
           requests:
             memory: "6Gi"
           limits:
             memory: "8Gi"
         volumeMounts:
           - name: config
             mountPath: /home/agent/snowflake.json
             subPath: snowflake.json
           - name: config
             mountPath: /home/agent/.postgresql/root.crt
             subPath: root.crt
           - name: secrets
             mountPath: /home/agent/datasources.json
             subPath: datasources.json
           - name: secrets
             mountPath: /etc/private-key/private-key
             subPath: private-key
         env:
           - name: SNOWFLAKE_PRIVATEKEYPATH
             value: /etc/private-key/private-key
     volumes:
       - name: config
         configMap:
           name: database-connector-config
       - name: secrets
         secret:
           secretName: database-connector-secrets
   ```
4. Save the Pod’s configuration as a YAML file, for instance, `database-connector.yaml` and start:

   ```bash
   kubectl apply -f database-connector.yaml
   ```

## Monitor the agent

The agent’s container outputs logs to `stdout` which can be accessed using Docker.
For example, if your container’s name is `database-connector-agent`, then the command to view logs would be:

```bash
docker container logs database-connector-agent
```

These logs are also streamed into Snowflake. See [Monitoring the Snowflake Connector for PostgreSQL](monitor.md) for information about how to access these.

### Accessing the health check endpoint

The agent exposes a HTTP endpoint with health information. You can use this endpoint when running the agent in an
orchestration system to determine when the agent has fully launched and whether it is healthy.
The endpoint is available under port `8080` and path `/actuator/health`.

To use the endpoint as a liveness probe in Kubernetes, add the following to your Pod configuration:

```yaml
apiVersion: v1
kind: Pod
spec:
  containers:
  - ...
    livenessProbe:
      httpGet:
        path: /actuator/health
        port: 8080
      initialDelaySeconds: 5
      periodSeconds: 10
```

## Next steps

After completing these procedures, follow the steps in [Configuring replication for the Snowflake Connector for PostgreSQL](configure-replication.md).

---
title: Setting up the Snowflake Connector for PostgreSQL using Snowsight
source: https://docs.snowflake.com/en/connectors/postgres6/install-snowsight.md
section: Connectors & Drivers
---

# Setting up the Snowflake Connector for PostgreSQL using Snowsight

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

> **Note:**
>
> For accounts where the AUTOCOMMIT parameter set to false, it should be set at the sessions level during configuration to true using the SQL command ALTER SESSION SET AUTOCOMMIT=TRUE.

The process of configuring up the Snowflake Connector for PostgreSQL using Snowsight includes the following steps:

## Configuring logging for the connector

The Snowflake Connector for PostgreSQL uses event table to store events and logs generated by the connector code. Setting up an event table is a mandatory step.

> **Note:**
>
> If the event table is already configured for the account used for the connector, skip this step.

To create an event table, do the following:

> ```sqlsyntax
> CREATE EVENT TABLE IF NOT EXISTS <fully_qualified_event_table_name> CHANGE_TRACKING = TRUE;
> ALTER ACCOUNT SET EVENT_TABLE = <fully_qualified_event_table_name>;
> ```
>
> Where:
>
> > `fully_qualified_event_table_name`
> > :   Specifies the name of the event table.

More information about an [event table](../../developer-guide/logging-tracing/event-table-setting-up.md) can be found [here](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging).

## Installing the Snowflake Connector for PostgreSQL

The following procedure describes how to install the connector:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the Snowflake Connector for PostgreSQL, then select the tile for the connector.
4. In the page for the Snowflake Connector for PostgreSQL, select Get.

   This displays a dialog that you use to begin the initial part of the installation process.

   In the dialog configure the following:

   1. In the Warehouse used for installation field, select the warehouse that you want to use for installing the connector.

      > **Note:**
      >
      > This is not the same warehouse that is used by the connector to synchronize data from MySQL database. In a
      > later step, you will create a separate warehouse for this purpose.
   2. Optionally, under Options » Application name you can change the name of the application.
   3. Select Get.
5. A dialog appears with the notification: `Successfully Installed`. To continue configuration, select Configure.

   The dialog closes, and the Snowflake Connector for PostgreSQL page displays the UI for configuring
   and managing the connector.

### Optional: Installing multiple instances of Snowflake Connector for PostgreSQL

You can install multiple instances of the same connector application on your Snowflake account.

To install an additional application instance, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select the application for which you want to install another instance. The application details page appears.
4. Click Add instance. The installation dialog appears.
5. Provide the instance name and select the warehouse to be used during the installation.
6. Select Get to begin the installation process.

Adding connector instances can take several minutes. When the installation process completes, you get an email notification.

> **Attention:**
>
> To avoid ingested data corruption, during connector configuration, always use a database schema that is
> different from all other native applications.

To access your installed connector application instances, do the following:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Select your application instance to access it.

## Configuring the Snowflake Connector for PostgreSQL

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for PostgreSQL, then select it. You will be now moved to the installation wizard page, that will take you through the configuration process.

Configure the application as follows:

### Step 1: Complete prerequisites

Complete the following prerequisite steps to set up your database and agent:

| Step | Description |
| --- | --- |
| Provide access to the source database | [Prerequisites for Snowflake Connector for PostgreSQL datasources](prereqs-datasource.md) |
| Download and install the Agent | [Setting up the Snowflake Connector for PostgreSQL Agent container](install-agent.md) |

Select Mark as done for each completed step.

Select Start configuration.

### Step 2: Configure

In the configuration dialog, enter values for the following fields:

| Field | Description |
| --- | --- |
| Compute Warehouse | Identifier for a new, dedicated virtual warehouse for the connector. This warehouse will be used to process data gained from the agent and put them into target table.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  **Note:** Do not specify the same warehouse that you selected at the beginning of the connector installation.  The configuration process creates a new `X-Small` warehouse with the specified name. |
| Operational Warehouse | Identifier for a new, dedicated virtual warehouse for the connector. This warehouse will be used to manage the activities of connector and its agents.  Specify a name that is unique for your account. The name of the warehouse must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  **Note:** Do not specify the same warehouse that you selected at the beginning of the connector installation.  The configuration process creates a new `X-Small` warehouse with the specified name. |
| Role | Identifier for a new custom role for the agent.  Specify a name that is unique for your account. The name of the role must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates a new role with the specified name. |
| User | Identifier for a new user that agent will use to authenticate to Snowflake.  Specify a name that is unique within the selected database. The name of the user must be a valid [object identifier](../../sql-reference/identifiers-syntax.md).  The configuration process creates a new user with the specified name and of type `SERVICE`. |

> **Note:**
>
> By default, the fields are set to the names of objects that are created when you configure the connector.
> Snowflake recommends using new objects for these fields. However, you can specify the names of existing objects,
> if needed (e.g. if you are reinstalling the connector).

Select Configure.

### Step 3: Verify agent connection

Check the connection of the agent to Snowflake as follows:

1. Select Generate file to generate initial configuration file for the agent.

   > **Caution:**
   >
   > Every time you click Generate file a new file is generated, with a new set of temporary access keys for the agent’s user. The user is automatically altered to use these new keys for authentication. If you already have the agent running with another set of keys, it will disconnect from Snowflake and stop working.
2. Using the generated `snowflake.json` file, proceed to configure the agent, as described in [Setting up the Snowflake Connector for PostgreSQL Agent container](install-agent.md). Then return to Snowsight.
3. Select Refresh to check connectivity with the agent.
   The application will confirm that the agent is successfully connected, and a confirmation dialog displays.
4. Select Define data to sync to continue.

## Next steps

After completing these procedures, follow the steps in [Setting up the Snowflake Connector for PostgreSQL Agent container](install-agent.md)

---
title: Snowflake Connector for Google Analytics Aggregate Data ingestion model
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-ingestion-model.md
section: Connectors & Drivers
---

# Snowflake Connector for Google Analytics Aggregate Data ingestion model

This topic describes how the Snowflake Connector for Google Analytics Aggregate Data ingests data from the [Google Analytics Data API](https://developers.google.com/analytics/devguides/reporting/data/v1) and how sampling may affect ingested data.

## Ingestion strategy

The connector uses two ingestion modes:

* The *initial load* of data occurs directly after configuring the report. Successful *initial load* finishes with data ingested from a chosen start data up to today.
* The *ongoing load* of data begins after completing the *initial load*. Incremental updates occur on a chosen, regular schedule.

The ingestion of each report is an independent process. Ingestion processes may be performed in parallel.

See [Set up data ingestion for your Snowflake Connector for Google Analytics Aggregate Data instance](gaad-connector-setting-up-data.md) to learn how to configure a report or choose a *sync schedule* and a *start date*.

### Choosing interval length

The [Google Analytics Data API](https://developers.google.com/analytics/devguides/reporting/data/v1) requires specifying each request’s date range (*startDate* and *endDate*). The connector may make multiple requests during one ingestion load and adjust an interval length as required.
The default interval is 31 days. The interval may be shortened automatically in the following situations:

* The API responded with an error, which the connector may mitigate by retrying the request with a shorter interval.
* The API responded with sampled data (only if the *avoid sampling* option was chosen during report configuration).
* The report contains a large amount of data. In this case, the interval is shortened to reduce the risk of an API error when retrieving subsequent result pages.

The user cannot set the interval length.

## Monitoring ingestion

Ingestion metadata is available in the `CONNECTOR_STATS` view. See more: [Monitoring the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-monitoring.md).

```sqlsyntax
SELECT * FROM PUBLIC.CONNECTOR_STATS ORDER BY COMPLETED_AT DESC;
```

The `METADATA` column contains, among other things, the request body that was sent in a request to the [Google Analytics Data API](https://developers.google.com/analytics/devguides/reporting/data/v1). The request body contains information about *startDate* and *endDate*.

The `STATUS` column may be equal to one of the following values:
:   * `COMPLETED` - a successful ingestion.
    * `CANCELED` - the interval length was shortened and the ingestion will continue with adjusted date ranges.
    * `FAILED` - ingestion failed and was not continued.

> **Note:**
>
> `FAILED` ingestion doesn’t necessarily mean that the data was lost. The connector may recover from some errors by attempting to download all missing data during the next scheduled report update.
> If succeeding ingestion runs were successful, the connector ingested all missing data.

To receive email notifications about failed ingestion runs, set up alerting. See more: [Manage the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-managing.md).

## About sampling

Sampling is the process of selecting and analyzing a subset of data from a larger dataset in order to extrapolate the result. This means that sampling lowers data quality.
Data quality depends on number of samples used in the process. For more information see [Google Analytics sampling](https://support.google.com/analytics/answer/13331292?hl=en).

> **Note:**
>
> By default, the connector doesn’t try to avoid sampling. This setting can be changed only during the initial report configuration.

### Obtaining sampling metadata

The `METADATA` column from the `CONNECTOR_STATS` view contains also sampling metadata. It can be joined with the data saved in a destination table.

Use the following statement to obtain information about the data that is sampled:

```sqlsyntax
SELECT d.date, d.raw, d.last_update_date, cs.metadata:samplingMetadata:samplesReadCount::INTEGER as samplesReadCount, cs.metadata:samplingMetadata:samplingSpaceSize::INTEGER as samplingSpaceSize, samplesReadCount/samplingSpaceSize as ratio
FROM <destination_table> as d
LEFT JOIN <connector_stats_view> as cs
ON d.ingestion_run_id = cs.run_id
WHERE cs.metadata:samplingMetadata:samplingOccurred::BOOLEAN = true;
```

Replace the placeholders with the actual values, as in the following example for a report named `REPORT_1`.

```sqlsyntax
SELECT d.date, d.raw, d.last_update_date, cs.metadata:samplingMetadata:samplesReadCount::INTEGER as samplesReadCount, cs.metadata:samplingMetadata:samplingSpaceSize::INTEGER as samplingSpaceSize, samplesReadCount/samplingSpaceSize as ratio
FROM google_analytics_aggregate_data_dest_db.google_analytics_aggregate_data_dest_schema.report_1__raw as d
LEFT JOIN snowflake_connector_for_google_analytics_aggregate_data.public.connector_stats as cs
ON d.ingestion_run_id = cs.run_id
WHERE cs.metadata:samplingMetadata:samplingOccurred::BOOLEAN = true;
```

The result contains the following information related to sampling.

| Name | Description |
| --- | --- |
| `samplesReadCount` | The total number of events read in this sampled report for a date range. |
| `samplingSpaceSize` | The total number of events present in this property’s data that could have been analyzed in this report for a date range. |
| `ratio` | The number of analyzed events to the number of events that could have been analyzed. |

The [Google Analytics sampling metadata documentation](https://developers.google.com/analytics/devguides/reporting/data/v1/rest/v1beta/ResponseMetaData#SamplingMetadata) provides more information about the meaning of the sampling metadata values.

> **Note:**
>
> Metadata about ingestion performed before the upgrade to the 1.4.0 version doesn’t contain information about the occurrence of sampling. It is certain that the data is not sampled only if the *samplingOccurred* flag is equal to false.

---
title: Snowflake Connector for Google Analytics Raw Data installation and configuration tasks
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-tasks.md
section: Connectors & Drivers
---

# Snowflake Connector for Google Analytics Raw Data installation and configuration tasks

Working with the Snowflake Connector for Google Analytics Raw Data includes the following common task areas:

* Installing and configuring the connector
* Setting up data ingestion and reviewing data
* Monitoring and managing the connector
* Troubleshooting

Review each before installing and configuring a Snowflake Connector for Google Analytics Raw Data instance.

## Installing and configuring the connector

Perform the following tasks to install and configure the Snowflake Connector for Google Analytics Raw Data.

After all prerequisites are satisfied you can install and configure your Snowflake Connector for Google Analytics Raw Data.

| Task | Description |
| --- | --- |
| [Preparing your Google Analytics and Google Cloud accounts](gard-connector-prereqs.md) | Before installing the Snowflake Connector for Google Analytics Raw Data, you must set up your Snowflake Connector for Google Analytics Raw Data instance and meet any common prerequisites. |
| [Configuring service account authentication for Google Cloud Platform (GCP)](gard-connector-create-service-account-key.md) | An application that authenticates to Google using a service account must provide a service account key file with correct roles set. |
| [Configuring OAuth authentication for Google Cloud Platform (GCP)](gard-connector-create-client-id.md) | For an application that authenticates to Google using OAuth 2.0. |
| [Configuring BigQuery Link for Google Analytics 4 property](gard-connector-create-link.md) | Information on how to configure the BigQuery link for Google Analytics 4 (GA4) property. |
| [Installing and configuring the Snowflake Connector for Google Analytics Raw Data](gard-connector-installing.md) | This topic provides information on installing and configuring the Snowflake Connector for Google Analytics Raw Data. |
| [Configuring the Snowflake Connector for Google Analytics Raw Data using SQL](gard-connector-configuring-sql.md) | Information on how to configure the connector using SQL. |
| [Uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data](gard-connector-uninstalling-and-reinstalling.md) | This topic provides information on uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data. |

After installing and configuring the Snowflake Connector for Google Analytics Raw Data you must set up basic management as described in [Managing the Snowflake Connector for Google Analytics Raw Data](gard-connector-managing.md).

## Monitoring and managing the connector

Review and perform the following tasks to provide routine management and monitoring of the connector.

| Task | Description |
| --- | --- |
| [Managing the Snowflake Connector for Google Analytics Raw Data](gard-connector-managing.md) | This topic describes typical tasks you might need to perform after installing and configuring the connector. |
| [Monitoring the Snowflake Connector for Google Analytics Raw Data](gard-connector-monitoring.md) | Monitor the state of the Snowflake Connector for Google Analytics Raw Data. |

## Setting up data ingestion and reviewing data

After installing and configuring the Snowflake Connector for Google Analytics Raw Data you must configure data ingestion and can then begin accessing data.

Perform the following tasks to configure data ingestion and begin accessing data.

| Task | Description |
| --- | --- |
| [Data ingestion model for the Snowflake Connector for Google Analytics Raw Data](gard-connector-data-ingestion-model.md) | This topic provides information on the data ingestion models supported by the Snowflake Connector for Google Analytics Raw Data. |
| [Setting up data ingestion for your Snowflake Connector for Google Analytics Raw Data](gard-connector-setting-up-data.md) | This topic describes how to access Snowflake Connector for Google Analytics Raw Data in your Snowflake account. |
| [Accessing data ingested by Snowflake Connector for Google Analytics Raw Data](gard-connector-accessing-data.md) | Access ingested data. |

## Troubleshooting

Review and perform the following tasks to troubleshoot errors with the Snowflake Connector for Google Analytics Raw Data installation of normal day-to-day activities.

| Task | Description |
| --- | --- |
| [Troubleshooting the Snowflake Connector for Google Analytics Raw Data](gard-connector-troubleshooting.md) | This topic provides guidelines for troubleshooting issues with the Snowflake Connector for Google Analytics Raw Data. |

---
title: Snowflake Connector for Microsoft Power Platform: [Optional] Validate Entra authorization setup
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/validate-entra-auth.md
section: Connectors & Drivers
---

# Snowflake Connector for Microsoft Power Platform: [Optional] Validate Entra authorization setup

Snowflake recommends the configuration be tested and
suggests using the cURL commands below to determine if Entra is correctly issuing a token.

## Delegated auth validation

The prior steps must be executed to get the required authorization code value.
To obtain the required code, follow the steps outlined in [request an authorization code](https://learn.microsoft.com/en-us/entra/identity-platform/v2-oauth2-auth-code-flow#request-an-authorization-code).

1. Get authorization code:

   In a browser enter the following URL, replacing the placeholders with your values:

   ```bash
   https://login.microsoftonline.com/<tenant_id>/oauth2/v2.0/authorize?client_id=<client_id>&response_type=code&redirect_uri=https%3A%2F%2Flocalhost&response_mode=query&scope=api://<app_resource_id>/session:role-any&state=12345
   ```
2. Get access token:

   Use the authorization code from the previous step to get an access token.
   Replace the placeholders with your values in the following cURL command:

   ```bash
   curl -X  POST \
        -H "Content-Type: application/x-www-form-urlencoded;charset=UTF-8" \
        --data-urlencode "client_id=<your client id>" \
        --data-urlencode "client_secret=<your client secret>"  \
        --data-urlencode "grant_type=authorization_code" \
        --data-urlencode "code=<use auth code from 1>" \
        --data-urlencode "scope=api://7bd09dd9-a6ef-4461-b014-c3226df74ed0/.default"  \
        --data-urlencode "redirect_uri=http://localhost" \
        https://login.microsoftonline.com/9a2d78cb-73e9-40ee-a558-fc1ac5ef57a7/oauth2/v2.0/token
   ```

   > **Note:**
   >
   > You must add `localhost` as an additional redirect URI in the AAD client application.

## Service principal auth validation

```bash
curl -X POST \
     -H "Content-Type: application/x-www-form-urlencoded;charset=UTF-8" \
     --data-urlencode "client_id=<CLIENT_ID>" \
     --data-urlencode "client_secret=<CLIENT_SECRET>" \
     --data-urlencode "grant_type=client_credentials" \
     --data-urlencode "scope=api://<Appl_URI_ID from Oauth Server>/.default" \
     https://login.microsoftonline.com/<TENANT_ID>/oauth2/v2.0/token
```

Where:

> * `CLIENT_ID` = Client ID from [Oauth Client setup](create-oauth-client.md).
> * `CLIENT_SECRET` = Client secret from Oauth Client setup.
> * `TENANT_ID` = Tenant ID from Oauth Client setup.

To validate the token in Snowflake, execute the SQL in the steps below with token from above :

1. Navigate to Snowsight.
2. Open a worksheet.
3. Execute the following code:

   ```sqlexample
   system$verify_external_oauth_token({token});
   ```

## Next steps

After completing these procedures, follow the steps in [Snowflake Connector for Microsoft Power Platform: [Optional] Validate Snowflake access](validate-sf-access.md).

---
title: Snowflake Connector for Microsoft Power Platform: [Optional] Validate Snowflake access
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/validate-sf-access.md
section: Connectors & Drivers
---

# Snowflake Connector for Microsoft Power Platform: [Optional] Validate Snowflake access

Validate Snowflake access using either [SnowCLI](../../../developer-guide/snowflake-cli/index.md) or [SnowSQL](../../../user-guide/snowsql.md).

Open a terminal and execute the following commands:

* For *Delegated Auth*

  ```bash
  snowsql -a <organization-locator> -u 'user@sandbox.onmicrosoft.com' --rolename <snowflake-role> --authenticator oauth --token "<token-value>"
  ```
* For Service Principal Auth

  ```bash
  snowsql -a <organization-locator> -u 'sub-value' -r <snowflake-role> --authenticator oauth --token "<token-value>"
  ```

Where:

> * `snowflake-accountname` from [Snowflake Connector for Microsoft Power Platform: [Optional] Validate Entra authorization setup](validate-entra-auth.md).
> * `snowflake-role` from [Snowflake Connector for Microsoft Power Platform: Create a security integration](create-security-integration.md).
> * `token-value` from the output from cURL in step [Snowflake Connector for Microsoft Power Platform: [Optional] Validate Entra authorization setup](validate-entra-auth.md).

---
title: Snowflake Connector for Microsoft Power Platform: Collect Azure AD information for Snowflake
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/collect-azure-ad-info.md
section: Connectors & Drivers
---

# Snowflake Connector for Microsoft Power Platform: Collect Azure AD information for Snowflake

To collect Azure AD information for Snowflake, follow these steps:

1. Navigate to the [Microsoft Azure Portal](https://portal.azure.com/) and authenticate.
2. Navigate to Azure Active Directory.
3. Select App Registrations.
4. Select the **Snowflake OAuth Resource** that was created in [Snowflake Connector for Microsoft Power Platform: Configure the OAuth resource in Microsoft Entra ID](configure-oauth.md).
5. In the Overview interface select Endpoints
6. On the right-hand side, copy the **OAuth 2.0 token endpoint (v2)** and note the URLs for **OpenID Connect metadata** and **Federation Connect metadata**.

   * The OAuth 2.0 token endpoint (v2) will be known as the `<AZURE_AD_OAUTH_TOKEN_ENDPOINT>` in the following configuration steps.
     The endpoint should be similar to `https://login.microsoftonline.com/<tenant-id>/oauth2/v2.0/token`.
   * For the **OpenID Connect metadata**, open in a new browser window.

     + Locate the `jwks_uri` parameter and copy its value.
     + This parameter value will be known as the `<AZURE_AD_JWS_KEY_ENDPOINT>` in the following configuration steps.
       :   The endpoint should be similar to `https://login.microsoftonline.com/<tenant-id>/discovery/v2.0/keys`.
   * For the Federation metadata document, open the URL in a new browser window.
   * Locate the `"entityID"` parameter in the `XML Root Element` and copy its value.
   * This parameter value will be known as the `<AZURE_AD_ISSUER>` in the following configuration steps. The entityID value should be similar to `https://sts.windows.net/<tenant-id>/`.

## Next steps

After completing these procedures, follow the steps in [Snowflake Connector for Microsoft Power Platform: Create a security integration](create-security-integration.md).

---
title: Snowflake Connector for Microsoft Power Platform: Configure the OAuth resource in Microsoft Entra ID
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/configure-oauth.md
section: Connectors & Drivers
---

# Snowflake Connector for Microsoft Power Platform: Configure the OAuth resource in Microsoft Entra ID

The process for configuring OAuth in Microsoft Entra includes the following steps:

1. Navigate to the [Microsoft Azure Portal](https://portal.azure.com/) and authenticate.
2. Navigate to Microsoft Entra ID.
3. Select App Registrations.
4. Select New Registration.
5. Enter ‘Snowflake OAuth Resource’, or similar value as the **Name**.
6. Verify the Supported account types are set to **Single Tenant**.
7. Select Register.
8. Select Expose an API.
9. Select the link next to Application ID URI to add the Application ID URI. The Application ID URI will be in the format `Application ID URI <api://9xxxxxxxxxxxxxxxxxx>`
10. **For Delegated Auth** or **For Service Principal Auth**

    > 1. **For Delegated Auth Only**
    >
    >    1. Select Add a Scope to add a scope representing the Snowflake role.
    >    2. Select who can consent.
    >    3. Add a description.
    >    4. Click Add Scope to save.
    >
    >       > Example: `session:scope:analyst` to restrict users having specific roles, and `session:role-any` to allow users of any role.
    > 2. **For Service Principal Auth Only**
    >
    >    To add a Snowflake role as a role for OAuth flows where the programmatic client requests an access token for itself, follow these steps:
    >
    >    1. Select App Roles.
    >    2. Select +Create app role.
    >    3. Check Applications as “Allowed member types”.
    >    4. For value enter
    >
    >       Example: `session:role:analyst` to connect to a specific role, or `session:role-any` for any role which the service user is mapped to.
    >
    >       Avoid high-privilege roles such as `ACCOUNTADMIN`, `SECURITYADMIN` or `ORGADMIN`.
11. [Optional] If a security integration is already being used in Snowflake with another Microsoft product
    such as PowerBI and with a different claim mapping, the manifest will need to be modified.
    The manifest will need to emit tokens using a different issuer, so that a separate
    security integration in Snowflake with the unique claim mapping can be created.

    > 1. Select Manifest.
    > 2. Find the attribute `requestedAccessTokenVersion` and set the value to “2”.
    >
    >    * When `requestedAccessTokenVersion` is set to “2”, the Access Token is going to have an issuer of format: `https://login.microsoftonline.com/<Tenant-ID>/v2.0`
    >    * When `requestedAccessTokenVersion` is set to “1”, the Access Token is going to have an issuer of format: `https://sts.windows.net/<tenant-ID>/`
    > 3. Select Save.

## Next steps

After completing these procedures, follow the steps in [Snowflake Connector for Microsoft Power Platform: Create OAuth client in Microsoft Entra ID](create-oauth-client.md).

---
title: Snowflake Connector for Microsoft Power Platform: Create a security integration
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/create-security-integration.md
section: Connectors & Drivers
---

# Snowflake Connector for Microsoft Power Platform: Create a security integration

The `external_oauth_audience_list` parameter of the security integration must
exactly match the Application ID URI that was specified while configuring Microsoft Entra ID.

Create either a Delegated Auth or Service Principal based security integration.

1. Navigate to Snowsight.
2. Open a worksheet.
3. Execute either of the following:

   1. Delegated Auth:

      Using the [CREATE SECURITY INTEGRATION (External OAuth)](../../../sql-reference/sql/create-security-integration-oauth-external.md) command,
      create a security integration with the following parameters:

      ```sqlexample
      CREATE SECURITY INTEGRATION IF NOT EXISTS external_oauth_azure_1
         TYPE = EXTERNAL_OAUTH
         ENABLED = TRUE
         EXTERNAL_OAUTH_TYPE = AZURE
         EXTERNAL_OAUTH_ISSUER = '{AZURE_AD_ISSUER}'
         EXTERNAL_OAUTH_JWS_KEYS_URL = '{AZURE_AD_JWS_KEY_ENDPOINT}'
         EXTERNAL_OAUTH_AUDIENCE_LIST = ('{SNOWFLAKE_APPLICATION_ID_URI}')
         EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = 'upn'
         EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = { 'LOGIN_NAME' | 'EMAIL_ADDRESS' }
      ```

   When using Delegated Authentication, the Snowflake user’s `login_name` or `email_address`
   MUST match the Entra email of the user who will run the Power Automate flow.

   For example:

   ```sqlexample
   ALTER USER SNOWSQL_DELEGATE_USER
   LOGIN_NAME = '{ENTRA-USERID}' or EMAIL_ADDRESS = 'ENTRA-USERID'
   DISPLAY_NAME = 'SnowSQL Delegated User'
   COMMENT = 'A delegate user for SnowSQL client to be used for OAuth based connectivity';
   ```

   **OR**

   * Service Principal Auth:

     ```sqlexample
     CREATE SECURITY INTEGRATION external_oauth_azure_2
        TYPE = EXTERNAL_OAUTH
        ENABLED = TRUE
        EXTERNAL_OAUTH_TYPE = AZURE
        EXTERNAL_OAUTH_ISSUER = '{AZURE_AD_ISSUER}'
        EXTERNAL_OAUTH_JWS_KEYS_URL = '{AZURE_AD_JWS_KEY_ENDPOINT}'
        EXTERNAL_OAUTH_AUDIENCE_LIST = ('{SNOWFLAKE_APPLICATION_ID_URI}')
        EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = 'sub'
        EXTERNAL_OAUTH_SNOWFLAKE_USER_MAPPING_ATTRIBUTE = 'login_name';
     ```
4. Create a user for the Service Principal-based connection:

   * The subvalue should be mapped to a user in Snowflake,
     avoiding using high privilege accounts `ACCOUNTADMIN`, `ORGADMIN`, or `SECURITYADMIN`.

   ```sqlexample
   CREATE OR REPLACE USER SNOWSQL_OAUTH_USER
   LOGIN_NAME = '<subvalue from decoded token>'
   DISPLAY_NAME = 'SnowSQL OAuth User'
   COMMENT = 'A system user for SnowSQL client to be used for OAuth based connectivity';

   CREATE ROLE ANALYST;

   GRANT ROLE ANALYST TO USER SNOWSQL_OAUTH_USER;
   ```

> **Note:**
>
> If a Security Integration for Azure AD was previously configured, execute
> the [ALTER SECURITY INTEGRATION](../../../sql-reference/sql/alter-security-integration-oauth-external.md) as described below:
>
> > ```sqlexample
> > ALTER SECURITY INTEGRATION external_oauth_azure_1 SET EXTERNAL_OAUTH_TOKEN_USER_MAPPING_CLAIM = ('sub','upn');
> > ```

## Next steps

After completing these procedures, follow the steps in [Snowflake Connector for Microsoft Power Platform: [Optional] Validate Entra authorization setup](validate-entra-auth.md).

---
title: Snowflake Connector for Microsoft Power Platform: Create OAuth client in Microsoft Entra ID
source: https://docs.snowflake.com/en/connectors/microsoft/powerapps/create-oauth-client.md
section: Connectors & Drivers
---

# Snowflake Connector for Microsoft Power Platform: Create OAuth client in Microsoft Entra ID

To create an OAuth client in Microsoft Entra ID, follow these steps:

1. Navigate to the [Microsoft Azure Portal](https://portal.azure.com/) and authenticate.
2. Navigate to Azure Active Directory.
3. Select App Registrations.
4. Select New Registration.
5. Enter a name for the client such as `Snowflake OAuth Client`.
6. Verify the Supported account types are set to `Single Tenant`.
7. Click Register.
8. In the Overview section, copy the `ClientID` from the Application (client) ID field.

   This will be known as the `<OAUTH_CLIENT_ID>` in the following steps.
9. Select Certificates & secrets » New client secret.
10. Add a description of the secret.
11. For testing purposes, select `long-living secrets`.

    For Production environments, follow necessary security policies.
12. Select Add and copy the secret. This will be known as the `<OAUTH_CLIENT_SECRET>` in the following steps.
13. For **Delegated Auth** or **Service Principal Auth**

    > 1. For **Delegated Auth**:
    >
    >    1. Select Manage » API Permissions.
    >    2. Select Add Permission.
    >    3. Select My APIs.
    >    4. Select the Snowflake OAuth Resource that was created in [Snowflake Connector for Microsoft Power Platform: Configure the OAuth resource in Microsoft Entra ID](configure-oauth.md).
    >    5. Select the Delegated Permissions box.
    >    6. Confirm the Permission related to the Scopes manually defined in the Application that are to be granted to this client.
    >    7. Click Add Permissions.
    >    8. Click Grant Admin Consent to grant the permissions to the client.
    >
    >       > **Note:**
    >       >
    >       > This method should only be used for testing purposes.
    >       > In production environments, granting permissions in this manner is not recommended.
    >    9. Click Yes.
    >    10. Click Manage » Authentication,
    >        add a platform » Web and enter Redirect URI’s
    >        `https://global.consent.azure-apim.net/redirect/snowflakev2`

    2. For **Service Principal Auth**:

       1. Select Manage » API Permissions.
       2. Select Add Permission.
       3. Select My APIs.
       4. Select the Snowflake OAuth Resource that was created in [Snowflake Connector for Microsoft Power Platform: Configure the OAuth resource in Microsoft Entra ID](configure-oauth.md)
       5. Select the Application Permissions box.
       6. Confirm the Permission related to the Roles manually defined in the Manifest of the Application that are to be granted to this client.
       7. Select Add Permissions.
       8. Click Grant Admin Consent to grant the permissions to the client.
          Note that for testing purposes, permissions are configured this way.
          However, in a production environment, granting permissions in this manner is not advisable.
       9. Click Yes.

## Next steps

After completing these procedures, follow the steps in [Snowflake Connector for Microsoft Power Platform: Collect Azure AD information for Snowflake](collect-azure-ad-info.md).

---
title: Snowflake Connector for MySQL characteristics
source: https://docs.snowflake.com/en/connectors/mysql6/mysql-characteristics.md
section: Connectors & Drivers
---

# Snowflake Connector for MySQL characteristics

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

## Version support

Our general policy is that the Snowflake Connector for MySQL supports any officially supported MySQL Long-Term Support (LTS) version. We will be phasing out support for older versions as our users move onto newer ones, and will be announcing support for new versions as they get released.

While the connector supports a number of MySQL cloud flavors, some of them require additional settings. See [Prerequisites for Snowflake Connector for MySQL datasources](prereqs-datasource.md).

The following table lists tested and officially supported versions.

List of officially supported PostgreSQL versions

|  | 8.0 | 8.4 |
| --- | --- | --- |
| [Standard](https://www.mysql.com/) | Yes | Yes |
| [AWS RDS](https://docs.aws.amazon.com/AmazonRDS/latest/UserGuide/CHAP_MySQL.html) | Yes |  |
| [Amazon Aurora](https://docs.aws.amazon.com/AmazonRDS/latest/AuroraMySQLReleaseNotes/Welcome.html) | Yes, as Version 3 |  |
| [GCP Cloud SQL](https://cloud.google.com/sql/mysql?hl=en) | Yes | Yes |
| [Azure Database](https://azure.microsoft.com/en-us/products/mysql/) | No |  |

## Server settings

For the connector to work correctly, review and adjust the following settings on your MySQL server.

|  |  |
| --- | --- |
| `log_bin` | Set to `on`.  This enables the binary log that records structural and data changes. |
| `binlog_format` | Set to `row`.  The connector supports only row-based replication. MySQL 8.x versions may be the last ones to support this setting, and future versions will only support row-based replication.  Not applicable in GCP Cloud SQL, where it is fixed at the right value. |
| `binlog_row_metadata` | Set to `full`.  The connector requires all row metadata to operate, most importantly, column names and primary key information. |
| `binlog_row_image` | Set to `full`.  The connector requires that all columns be written into the binary log.  Not applicable in Amazon Aurora, where it is fixed at the right value. |
| `binlog_row_value_options` | Leave empty.  This option ony affects JSON columns, where it can be set to include only the modified parts of JSON documents for `UPDATE` statements. The connector requires that full documents are written into the binary log. |
| `binlog_expire_logs_seconds` | Set to *at least* a few hours, or longer to ensure that the database agent can continue incremental replication after extended pauses or downtime.  If you’re using scheduled replication, the value needs to be longer than the configured schedule. |

## The binary log

MySQL’s binary log, once enabled, collects changes from *all* tables in a given instance. There is no way to exclude tables or columns. The connector will therefore receive changes from all tables in the database, and he database agent will process changes from tables that you configure for replication, but discard changes to all other tables.

Every change needs to be first loaded by the database agent, and for some **particularly large changes**, like updates to `BLOB` columns, even if they’re made on tables not configured for replication, these may exhaust the database agent’s memory and cause it to crash. If you store particularly large values anywhere in your database, make sure to configure sufficient memory for the database agent and its container.

**Transaction size** is limited by [MySQL’s replication limits](https://dev.mysql.com/doc/refman/8.4/en/group-replication-limitations.html#group-replication-limitations-transaction-size) to under 4 GB. Transactions crossing the limit will cause replication for the affected table to fail permanently.

## Agent authentication

The only authentication method currently supported is username and password. Every data source entry in the database agent’s configuration includes its own set of credentials, and these can be different for each data source.

The database agent’s users must have the following grants:

* `REPLICATION SLAVE` on all schemas and tables
* `REPLICATION CLIENT` on all schemas and tables
* `SELECT` on all schemas and on all tables

For instructions on how to create a user for the database agent, see [Create required user](prereqs-datasource.md).

---
title: Snowflake Connector for MySQL installation and configuration tasks
source: https://docs.snowflake.com/en/connectors/mysql6/tasks.md
section: Connectors & Drivers
---

# Snowflake Connector for MySQL installation and configuration tasks

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Working with the Snowflake Connector for MySQL involves several key tasks:

* Installation and configuration of the connector
* Configuring data replication and examining the ingested data

It’s important to carefully review each of the installation and configuration steps, before enabling data replication.

## Installation and configuration of the connector

These steps are designed to guide you through the initial setup of the Connector. The installation process involves various stages and contributions from different departments.

| Task | Description | Area Involved |
| --- | --- | --- |
| [Prerequisites for Snowflake Connector for MySQL datasources](prereqs-datasource.md) | The configuration of the source database instance to enable data replication. | Source Database Administrator |
| [Setting up the Snowflake Connector for MySQL using Snowsight](install-snowsight.md) | Installation of the Connector using Snowsight. | Snowflake Administrator |
| [Set up connectivity](install-agent.md) | The configuration of the network access to Snowflake. A sub-task of [Setting up the Snowflake Connector for MySQL Agent container](install-agent.md). | Network Administrator |
| [Setting up the Snowflake Connector for MySQL Agent container](install-agent.md) | Installation and configuring the Snowflake Connector for MySQL agent. | Developer and Operations |
| [Setting up Email Notifications for the MySQL connector](email-notifications.md) | The configuration of email notifications for the connector. | Snowflake Administrator |
| [Configuring replication for the Snowflake Connector for MySQL](configure-replication.md) | Configure connector replication | Developer and Operations |

## Configuring data replication and examining the ingested data

This installation step enables you to configure the data source and activate table replication.

| Task | Description |
| --- | --- |
| [Configuring replication for the Snowflake Connector for MySQL](configure-replication.md) | Configure connector replication |
| [Viewing MySQL data in Snowflake](view-data.md) | Viewing data ingested using the connector. |

---
title: Snowflake Connector for MySQL ongoing tasks
source: https://docs.snowflake.com/en/connectors/mysql6/ongoing.md
section: Connectors & Drivers
---

# Snowflake Connector for MySQL ongoing tasks

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Working with the Snowflake Connector for MySQL includes the following ongoing task areas:

* Monitoring and managing the connector

## Monitoring and managing the connector

Review and perform the following tasks to provide routine management and monitoring of the connector.

| Task | Description |
| --- | --- |
| [Monitoring the Snowflake Connector for MySQL](monitor.md) | Monitor the state of the Snowflake Connector for MySQL. |
| [Troubleshooting the Snowflake Connector for MySQL](troubleshoot.md) | Troubleshooting the state of the Snowflake Connector for MySQL. |
| [Reinstall the Snowflake Connector for MySQL](reinstall.md) | Reinstalling the Snowflake Connector for MySQL. |

---
title: Snowflake Connector for PostgreSQL characteristics
source: https://docs.snowflake.com/en/connectors/postgres6/postgresql-characteristics.md
section: Connectors & Drivers
---

# Snowflake Connector for PostgreSQL characteristics

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

## Version support

The Snowflake Connector for PostgreSQL supports any officially supported PostgreSQL version. Snowflake drops support for older versions as customers move to newer versions. Snowflake announces support for new versions as they are released.

While the connector supports a number of PostgreSQL cloud versions, some require additional settings. See [Prerequisites for Snowflake Connector for PostgreSQL datasources](prereqs-datasource.md) for more information.

The following are the supported PostgresSQL versions.

Supported PostgreSQL versions

|  | 11 | 12 | 13 | 14 | 15 | 16 | 17 |
| --- | --- | --- | --- | --- | --- | --- | --- |
| [Standard](https://www.postgresql.org/) | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| [AWS RDS](https://docs.aws.amazon.com/AmazonRDS/latest/PostgreSQLReleaseNotes/Welcome.html) | Yes | Yes | Yes | Yes | Yes | Yes | Yes |
| [Amazon Aurora](https://docs.aws.amazon.com/AmazonRDS/latest/AuroraPostgreSQLReleaseNotes/Welcome.html) | Yes | Yes | Yes | Yes | Yes | Yes |  |
| [GCP Cloud SQL](https://cloud.google.com/sql/docs/postgres/) | Yes | Yes | Yes | Yes | Yes | Yes |  |
| [Azure Database](https://learn.microsoft.com/en-us/azure/postgresql/) | Yes | Yes | Yes | Yes | Yes | Yes |  |

## Server settings

Review and adjust the following settings on your PostgreSQL server as required:

|  |  |
| --- | --- |
| `wal_level` | Set to `logical`.  The connector relies on primary keys to merge changes into destination tables. The following settings ensure that Write-Ahead Log (WAL) records include primary key information: |
| `max_replication_slots` | Add 1 for every data source configuration entry for this database in the database agent. |
| `max_connections` | Add 1 for every data source configuration entry for this database in the database agent. |
| `max_wal_senders` | Add 1 for every data source configuration entry for this database in the database agent. |

## Publications

The connector requires a [PostgreSQL publication](https://www.postgresql.org/docs/current/logical-replication-publication.html) to access tables for replication.

* The database agent supports exactly one publication per data source. If you need to use multiple publications in a single PostgreSQL server, you can configure that server multiple times as separate data sources, each one with its own publication.
* The publication should include all changes made to data, including `INSERT`, `UPDATE`, `DELETE`, and `TRUNCATE`.
* The publication can be set up for `ALL TABLES` or a subset of tables, but for optimal performance, only add those tables that should be replicated. The connector will only receive changes from the tables included in the publication.
* Tables can be added to the publication with all their columns, or a subset of columns. When adding with a subset of columns, use the [ADD_TABLE_WITH_COLUMNS procedure](configure-replication.md).

> **Warning:**
>
> When a table is added to a publication with a subset of columns, but then enabled for replication using the [ADD_TABLES](configure-replication.md) procedure, columns missing from the publication will be marked in the destination table as deleted. Adding any additional columns to the publication later will result in the table being marked as permanently failed.

For more information on publication configuration, see [Configure publication](prereqs-datasource.md).

## Replication slots

To replicate data and schema changes, the connector creates a [replication slot](https://www.postgresql.org/docs/current/logicaldecoding-explanation.html#LOGICALDECODING-REPLICATION-SLOTS). The slot is created when the first table in a given data source is added to replication, and used for all tables from that data source.

The slot’s name is structured as `sf_db_conn_rs_kbmd_<data-source-name>`, where `<data-source-name>` is the identifier of the data source in the database agent’s configuration.

* If you configure the database agent to connect to the same database multiple times, by adding several data sources, the connector will create *multiple* replication slots.
* If you configure multiple database agents to connect to the same PostgreSQL server, you must provide unique data source names to each database agent.

> **Caution:**
>
> The database agent *does not remove* unused replication slots. If you disconnect the database agent from a PostgreSQL instance or remove all of its tables from replication, then you *must* also manually drop the replication slot to prevent it from holding up WAL trimming.

### WAL growth and replication slot position

Once created, a replication slot will cause PostgreSQL to retain the WAL data from the position held by the replication slot, until the connector confirms and advances that position. The connector periodically confirms the position after records have been stored in its journal tables, even if they were not yet merged into destination tables.

* In **continuous mode**, the connector confirms the position every minute.
* In **scheduled mode**, the connector confirms the position based on the configured schedule. Keep in mind that longer schedules *will cause the WAL to grow larger*.

You must ensure there is enough disk space on your PostgreSQL server for the WAL. If you detect the WAL growing continuously, check the following:

* Is the database agent connected, and the connector actively replicating data? If not, the replication slot is not being advanced, and blocks WAL trimming.
* Is the replication keeping up with the data changes in replicated tables? If not, meaning that the lag between a data change in the source and its appearance in the Snowflake destination table keeps growing, then the replication slot is being advanced too slowly. You need to remove some tables from replication, or increase the compute warehouse size.

The `max_wal_size` setting in PostgreSQL will have no effect on WAL growth when it is caused by a replication slot not advancing.

> **Tip:**
>
> In critical situations, you can manually drop the replication slot used by the connector. This will break any replication running in the connector, but enable PostgreSQL to trim the WAL and reclaim disk space.

## Primary keys and table replica identity

The connector relies on primary keys to merge changes into the destination tables. As a result:

* Every table enabled for replication must have a primary key. The key can be a single column or composite.
* Tables must also have their [REPLICA IDENTITY](https://www.postgresql.org/docs/current/sql-altertable.html#SQL-ALTERTABLE-REPLICA-IDENTITY) set to `DEFAULT`. This ensures primary keys are represented in the WAL, and the connector can read them.

## Agent authentication

The only authentication method currently supported is username and password. Every data source entry in the database agent’s configuration includes its own set of credentials, and these can vary between data sources.

The database agent’s users must have a role with the `REPLICATION` attribute, or `SUPERUSER` if the former cannot be applied.

For instructions on how to create a user for the database agent, see [Create required user](prereqs-datasource.md).

For more information on securing the database agent’s access to the source databases, see [PostgreSQL documentation](https://www.postgresql.org/docs/current/logical-replication-security.html).

---
title: Snowflake Connector for PostgreSQL installation and configuration tasks
source: https://docs.snowflake.com/en/connectors/postgres6/tasks.md
section: Connectors & Drivers
---

# Snowflake Connector for PostgreSQL installation and configuration tasks

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Working with the Snowflake Connector for PostgreSQL includes the following common task areas:

* Installing and configuring the connector
* Reviewing data

It’s important to carefully review each of the installation and configuration steps,
before enabling data replication.

## Installing and configuring the connector

Perform the following tasks to install and configure the Snowflake Connector for PostgreSQL.

After all prerequisites are satisfied you can install and configure your Snowflake Connector for PostgreSQL using either SQL or Snowsight.

| Task | Description | Area Involved |
| --- | --- | --- |
| [Prerequisites for Snowflake Connector for PostgreSQL datasources](prereqs-datasource.md) | The configuration of the source database instance to enable data replication. | Source Database Administrator |
| [Setting up the Snowflake Connector for PostgreSQL using Snowsight](install-snowsight.md) | Installation of the Connector using Snowsight. | Snowflake Administrator |
| [Set up connectivity](install-agent.md) | The configuration of the network access to Snowflake. A sub-task of [Setting up the Snowflake Connector for PostgreSQL Agent container](install-agent.md). | Network Administrator |
| [Setting up the Snowflake Connector for PostgreSQL Agent container](install-agent.md) | Installation and configuring the Snowflake Connector for PostgreSQL agent. | Developer and Operations |
| [Setting up Email Notifications for the PostgreSQL connector](email-notifications.md) | The configuration of email notifications for the connector. | Snowflake Administrator |
| [Configuring replication for the Snowflake Connector for PostgreSQL](configure-replication.md) | Configure connector replication | Developer and Operations |

## Reviewing data

Review the following to examine Snowflake Connector for PostgreSQL data

| Task | Description |
| --- | --- |
| [Viewing PostgreSQL data in Snowflake](view-data.md) | Viewing data ingested using the connector. |

---
title: Snowflake Connector for PostgreSQL ongoing tasks
source: https://docs.snowflake.com/en/connectors/postgres6/ongoing.md
section: Connectors & Drivers
---

# Snowflake Connector for PostgreSQL ongoing tasks

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

Working with the Snowflake Connector for PostgreSQL includes the following ongoing task areas:

* Monitoring and managing the connector

## Monitoring and managing the connector

Review and perform the following tasks to provide routine management and monitoring of the connector.

| Task | Description |
| --- | --- |
| [Monitoring the Snowflake Connector for PostgreSQL](monitor.md) | Monitor the state of the Snowflake Connector for PostgreSQL. |
| [Troubleshooting the Snowflake Connector for PostgreSQL](troubleshoot.md) | Troubleshooting the state of the Snowflake Connector for PostgreSQL. |
| [Reinstall the Snowflake Connector for PostgreSQL](reinstall.md) | Reinstalling the Snowflake Connector for PostgreSQL. |

---
title: Snowflake Connector for ServiceNow® installation and configuration tasks
source: https://docs.snowflake.com/en/connectors/servicenow/tasks.md
section: Connectors & Drivers
---

# Snowflake Connector for ServiceNow® installation and configuration tasks

Working with the Snowflake Connector for ServiceNow® includes the following common task areas:

* Installing and configuring the connector
* Setting up data ingestion and reviewing data
* Monitoring and managing the connector
* Troubleshooting

Review each before installing and configuring a Snowflake Connector for ServiceNow® instance.

## Installing and configuring the connector

Perform the following tasks to install and configure the Snowflake Connector for ServiceNow®.

After all prerequisites are satisfied you can install and configure your Snowflake Connector for ServiceNow® using either SQL or Snowsight.

| Task | Description |
| --- | --- |
| [Prepare your ServiceNow® instance](prereqs.md) | Before installing the Snowflake Connector for ServiceNow®, you must set up your ServiceNow® instance and meet any common prerequisites. |
| [Install and configure the connector with Snowsight](installing-snowsight.md) | installing and configuring the Snowflake Connector for ServiceNow® using Snowsight. |
| [Install and configure the connector with SQL commands](installing-sql.md) | installing and configuring the Snowflake Connector for ServiceNow® using SQL. |

## Monitoring and managing the connector

Review the following tasks that allow to customize default settings, provide routine management and set up monitoring of the connector.

| Task | Description |
| --- | --- |
| [Managing, updating, and uninstalling the Snowflake Connector for ServiceNow®](managing.md) | Manage the connector instance, refreshing tokens, configuring basic authentication, removing the connector and related activities. |
| [Monitoring the Snowflake Connector for ServiceNow®](monitoring.md) | Monitor the state of the Snowflake Connector for ServiceNow®. |

## Setting up data ingestion and reviewing data

After installing and configuring the Snowflake Connector for ServiceNow® you must configure data ingestion and can then begin accessing data.

Perform the following tasks to configure data ingestion and begin accessing ServiceNow® data.

| Task | Description |
| --- | --- |
| [Set up data ingestion for your ServiceNow® data](ingestion.md) | Data ingestion defines how the Snowflake Connector for ServiceNow® ingests data. |
| [Access the ServiceNow® data in Snowflake](accessing-data.md) | Access ingested ServiceNow® data. |

## Troubleshooting

Review and perform the following tasks to troubleshoot errors with the Snowflake Connector for ServiceNow® installation of normal day-to-day activities.

| Task | Description |
| --- | --- |
| [Troubleshooting the connector](troubleshooting.md) | Resolve common problems. |

---
title: Snowflake High Performance connector for Kafka
source: https://docs.snowflake.com/en/connectors/kafkahp/about.md
section: Connectors & Drivers
---

# Snowflake High Performance connector for Kafka

This topic describes the basic concepts of the Snowflake High Performance connector for Kafka, its use cases, benefits, key features, and limitations.

> **Note:**
>
> The Snowflake High Performance connector for Kafka is a sink connector that reads data from Kafka topics and loads that data into Snowflake tables.
> For more information about Kafka Connect and its framework, see [The Apache Kafka and Kafka connect framework](aboutkafkaconnect.md).

## Benefits

The Snowflake High Performance connector for Kafka leverages Snowflake’s [high-performance Snowpipe Streaming architecture](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md), which is engineered for modern, data-intensive organizations requiring near real-time insights. This next-generation architecture significantly advances throughput, efficiency, and flexibility for real-time ingestion into Snowflake.

The high-performance architecture offers several key advantages:

* **Superior throughput and latency**: Designed to support ingest speeds of up to 10 GB/s per table with end-to-end ingest to query latencies within 5 to 10 seconds, enabling near-real-time analytics.
* **Simplified billing**: Provides transparent, throughput-based billing that makes costs more predictable and easier to understand.
* **Enhanced performance**: Uses a Rust-based client core that delivers improved client-side performance and lower resource usage compared to previous implementations.
* **In-flight transformations**: Supports data cleansing and reshaping during ingestion using COPY command syntax within the PIPE object, allowing you to transform data before it reaches the target table.
* **Server-side schema validation**: Moves schema validation from the client side to the server side through the PIPE object, ensuring data quality and reducing client complexity.
* **Pre-clustering capability**: Can cluster data during ingestion when the target table has clustering keys defined, improving query performance without requiring post-ingestion maintenance.

The connector uses Snowflake [PIPE](../../user-guide/data-load-snowpipe-intro.md) objects as the central component for managing ingestion.
The PIPE object acts as the entry point and definition layer for all streaming data, defining how data is processed, transformed, and validated before being committed to the target table. For more information about how the connector works with tables and pipes, see [How the connector works with tables and pipes](how-the-connector-works.md).

## Choosing a connector version

The Kafka connector runs in a Kafka Connect cluster, reading data from the Kafka topics and writing into Snowflake tables.

Snowflake provides two versions of the connector. Both versions of the connector provide the same core functionality for streaming data from Kafka to Snowflake.

* Confluent version of the connector

  High Performance Snowflake Connector for Kafka is not yet available on Confluent Cloud.
  If you are using Confluent Cloud, you must install the connector manually as a custom plugin connector.

  The Confluent version is packaged as a zip file for installation through Confluent Hub or Confluent Control Center and includes all external libraries required to run the connector.

  Choose this version if you’re using the Confluent Platform or Confluent Cloud.

  Please contact Snowflake support to obtain and install Confluent version of the connector.

  For more information, see [Kafka Connect](https://docs.confluent.io/current/connect/).
* OSS Apache Kafka version of the connector

  Available from [open source software (OSS) Apache Kafka package](https://mvnrepository.com/artifact/com.snowflake/snowflake-kafka-connector/).

  The Apache version is distributed as a standard fat JAR file and requires manual installation into your Apache Kafka Connect cluster.
  This version requires [Bouncy Castle](https://www.bouncycastle.org/) cryptography libraries that must be downloaded separately.

  For more information, see [Apache Kafka](https://kafka.apache.org/).

## Using the connector with Apache Iceberg™ tables

The connector can ingest data into a Snowflake-managed [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).
Before you configure the Kafka connector for Iceberg table ingestion, you must create an Iceberg table.
See [Create an Apache Iceberg™ table for ingestion](how-the-connector-works.md) for more information.

## Limitations

The Snowflake High Performance connector for Kafka has the following limitations.

Apache Iceberg™ tables and schema evolution
:   The connector does not support schema evolution for Apache Iceberg™ tables.

Migration of existing pipelines from version 3.x and below
:   The connector does not support migration of the existing pipelines from version 3.x and earlier. You must manually migrate the existing pipelines to the new connector. Ensure that existing pipelines don’t rely on any features that are not yet available with this connector.

Single Message Transformations (SMTs):
:   Most Single Message Transformations (SMTs) are supported when using community converters, with the exception of `regex.router` which is currently not supported.

Not all broken records are sent to Dead Letter Queue (DLQ) by the connector
:   With `errors.tolerance=all` and `errors.deadletterqueue.topic.name` configured, the connector guarantees **at most once** delivery. Only non-convertible records are sent to the DLQ by Kafka Connect. Records that fail Snowflake ingestion are not routed there; Snowpipe Streaming can detect that records failed, but not which specific records.

Broken records which failed to be ingested need to be manually retried
:   When `errors.tolerance=none` and `rows_error_count` increases, the connector task fails.
    To retry broken records, review the channel history to find the broken records.
    For more information about troubleshooting broken records and ingestion errors
    see [error handling](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-error-handling.md).
    You can also use gap finding technique described in
    [Detect and recover from errors using metadata offsets](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-best-practices.md).
    Kafka offset information needed to use this technique is available in the `RECORD_METADATA` column.

## Limitations of fault tolerance with the connector

Kafka Topics can be configured with a limit on storage space or retention time.

* If the system is offline for more than the retention time, then expired records will
  not be loaded. Similarly, if Kafka’s storage space limit is exceeded, some messages will not be delivered.
* If messages in the Kafka topic are deleted, these changes will not be reflected in the Snowflake table.

For more information about SMTs, see [Kafka Connect Single Message Transform Reference for Confluent Cloud or Confluent Platform](https://docs.confluent.io/current/connect/transforms/index.html).

### Snowflake support for the connector

The following table describes the supported versions and information about pre-release and release candidates.

| Release Series | Status | Notes |
| --- | --- | --- |
| 4.x.x | Public Preview | Early access. Currently the migration from 3.x and 2.x is not supported. |
| 3.x.x | Officially supported | Latest version and strongly recommended. |
| 2.x.x | Officially supported | Upgrade recommended. |
| 1.x.x | Not supported |  |

The following features are not supported:

## Breaking changes in the Preview version

See the release notes for the Preview versions for a list of breaking changes

* [4.0.0-rc1](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc1)
* [4.0.0-rc2](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc2)
* [4.0.0-rc3](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc3)
* [4.0.0-rc4](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc4)
* [4.0.0-rc5](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc5)
* [4.0.0-rc6](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc6)
* [4.0.0-rc7](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc7)

## Next steps

Review [how the connector works](how-the-connector-works.md) topic for more information about how the connector works with tables and pipes. .
Review [Set up tasks for the Snowflake High Performance connector for Kafka](setup-tasks.md) topic for the steps to set up the Snowflake High Performance connector for Kafka.

---
title: Snowflake High Performance connector for Kafka: Configure Snowflake
source: https://docs.snowflake.com/en/connectors/kafkahp/setup-snowflake.md
section: Connectors & Drivers
---

# Snowflake High Performance connector for Kafka: Configure Snowflake

This topic describes the steps to configure Snowflake for Snowflake High Performance connector for Kafka.

Snowflake recommends that you create a separate user, using [CREATE USER](../../sql-reference/sql/create-user.md) and role using [CREATE ROLE](../../sql-reference/sql/create-role.md) for each Kafka instance so that the access privileges can be individually revoked as required.

## Creating a role to use the Kafka connector

The following creates a custom role for use by the Kafka connector, for example KAFKA_CONNECTOR_ROLE.
The script references a specific existing database and schema (`kafka_db.kafka_schema`)
and user (`kafka_connector_user_1`):

```sqlexample
-- Use a role that can create and manage roles and privileges.
USE ROLE securityadmin;

-- Create a Snowflake role with the privileges to work with the connector.
CREATE ROLE kafka_connector_role;

-- Grant privileges on the database.
GRANT USAGE ON DATABASE kafka_db TO ROLE kafka_connector_role;

-- Grant privileges on the schema.
GRANT USAGE ON SCHEMA kafka_schema TO ROLE kafka_connector_role;

-- Grant OPERATE on pipes only if you manually created them (user-defined pipe mode).
-- GRANT OPERATE ON PIPE existing_pipe1 TO ROLE kafka_connector_role;

-- Grant INSERT on the table to insert data into.
GRANT INSERT ON TABLE kafka_schema.existing_table TO ROLE kafka_connector_role;

-- Grant the custom role to the user configured in the Kafka connector configuration properties.
GRANT ROLE kafka_connector_role TO USER kafka_connector_user;
```

Note that any privileges must be granted directly to the role used by the connector. Grants cannot be inherited from role hierarchy.

For more information on creating custom roles and role hierarchies, see [Configuring access control](../../user-guide/security-access-control-configure.md).

## Required privileges

The connector requires the following privileges to create and manage Snowflake objects:

| Object | Privilege | When Required |
| --- | --- | --- |
| Database | USAGE | Always required |
| Schema | USAGE | Always required |
| Pipe | OPERATE | If using user-defined pipes |
| Destination table | INSERT | Always required |

## Next steps

[Set up Kafka](setup-kafka.md).

---
title: Snowflake High Performance connector for Kafka: Install and configure
source: https://docs.snowflake.com/en/connectors/kafkahp/setup-kafka.md
section: Connectors & Drivers
---

# Snowflake High Performance connector for Kafka: Install and configure

This topic describes the steps to install and configure the Snowflake High Performance connector for Kafka.

## Installing the Kafka connector

The Kafka connector is provided as a JAR (Java executable) file.

Snowflake provides two versions of the connector:

* A version for the [Confluent Kafka installation](https://www.confluent.io/hub/snowflakeinc/snowflake-kafka-connector/).
* A version for the open source software (OSS) Apache Kafka <https://mvnrepository.com/artifact/com.snowflake/snowflake-kafka-connector/> ecosystem.

The instructions in this topic specify which steps apply only to either version of the connector.

## Installation prerequisites

* The Kafka connector supports the following package versions:

  | Package | Snowflake Kafka Connector Version | Package Support (Tested by Snowflake) |
  | --- | --- | --- |
  | Apache Kafka | 2.0.0 (or later) | Apache Kafka 2.8.2, 3.7.2 |
  | Confluent | 2.0.0 (or later) | Confluent 6.2.15, 7.8.2 |
* The Kafka connector is built for use with Kafka Connect API 3.9.0. Later versions of the Kafka Connect API are untested.
  Versions prior to 3.9.0 are compatible with the connector.
  For more information, see [Kafka Compatibility](https://kafka.apache.org/protocol.html#protocol_compatibility).
* When you have both the Kafka connector and the JDBC driver jar files in your environment,
  ensure your JDBC version matches the `snowflake-jdbc` version specified in the `pom.xml` file of your intended Kafka connector version.
  You can go to your preferred Kafka connector release version, for example, [v4.0.0-rc4](https://github.com/snowflakedb/snowflake-kafka-connector/releases/tag/v4.0.0-rc4). Then browse `pom.xml` file to find out the version of `snowflake-jdbc`.
* If you are using Avro format for ingesting data:

  > + Use the Avro parser, version 1.8.2 (or higher), available from <https://mvnrepository.com/artifact/org.apache.avro>.
  > + If you use the schema registry feature with Avro, use version 5.0.0 (or higher) of the Kafka Connect Avro Converter available at <https://mvnrepository.com/artifact/io.confluent>.
  >
  >   Note that the schema registry feature is not available in the OSS Apache Kafka package.
* Configure Kafka with the desired data retention time and/or storage limit.
* Install and configure the Kafka Connect cluster.

  Each Kafka Connect cluster node should include enough RAM for the Kafka connector. The minimum recommended amount is 5 MB per Kafka partition. This is in addition to the RAM required for any other work that Kafka Connect is doing.
* We recommend using the same versions on Kafka Broker and Kafka Connect Runtime.
* We strongly recommend running your Kafka Connect instance in the same cloud
  provider [region](../../user-guide/intro-regions.md) as your Snowflake account. This is not strictly required, but typically improves throughput.

For a list of the operating systems supported by Snowflake clients, see [Operating system support](../../release-notes/requirements.md).

## Installing the connector

This section provides instructions for installing and configuring the Kafka connector for Confluent.
The following table describes the supported versions and information about pre-release and release candidates.

| Release Series | Status | Notes |
| --- | --- | --- |
| 4.x.x | Public Preview | Early access. **Supporting** Snowpipe Streaming High Performance Architecture <https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview> Currently the migration from 3.x and 2.x versions has to be done manually. It can not be used as a drop in replacement for earlier versions. It has a different feature set than version 3.x, 2.x, 1.x |
| 3.x.x | Officially supported | **Not supporting** Snowpipe Streaming High Performance Architecture <https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview>. |
| 2.x.x | Officially supported | Upgrade recommended. **Not supporting** Snowpipe Streaming High Performance Architecture <https://docs.snowflake.com/en/user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview>. |
| 1.x.x | Not supported | Do not use this release series. |

### Installing the connector for Confluent

#### Download the Kafka connector files

Download the Kafka connector JAR file from either of the following locations:

Confluent Hub:
:   <https://www.confluent.io/hub/>

    The package includes all dependencies required to use either an encrypted or unencrypted private key for key pair authentication.
    For more information, see Using key pair authentication and key rotation later in this topic.

Maven Central Repository:
:   <https://mvnrepository.com/artifact/com.snowflake>

    When using this version you need to download the [Bouncy Castle](https://www.bouncycastle.org/) cryptography libraries (a JAR files):

    > * <https://mvnrepository.com/artifact/org.bouncycastle/bc-fips/2.1.0>
    > * <https://mvnrepository.com/artifact/org.bouncycastle/bcpkix-fips/2.1.8>

    Download these files to the same local folder as the Kafka connector JAR file.

    The source code for the connector is available at <https://github.com/snowflakedb/snowflake-kafka-connector>.

#### Install the Kafka connector

Install the Kafka connector using the instructions provided for installing other connectors:

> <https://docs.confluent.io/current/connect/userguide.html>

### Installing the connector for open source Apache Kafka

This section provides instructions for installing and configuring the Kafka connector for open source Apache Kafka.

#### Install Apache Kafka

1. Download the Kafka package from the [Kafka official website](https://kafka.apache.org/downloads).
2. In a terminal window, change to the directory where you downloaded the package file.
3. Execute the following command to decompress the `kafka_<scala_version>-<kafka_version>.tgz` file:

   ```bash
   tar xzvf kafka_<scala_version>-<kafka_version>.tgz
   ```

#### Install the JDK

Install and configure the Java Development Kit (JDK) version 11 or higher.
Snowflake tests with the Standard Edition (SE) of the JDK. The Enterprise Edition (EE) is expected to be compatible but has not been tested.

If you have previously installed the JDK, you can skip this section.

1. Download the JDK from the [Oracle JDK website](https://www.oracle.com/technetwork/java/javase/downloads/index.html).
2. Install or decompress the JDK.
3. Following the instructions for your operating system, set the environment variable JAVA_HOME to point to the directory containing the JDK.

#### Download the Kafka connector JAR files

1. Download the Kafka connector JAR file from the Maven Central Repository:

   <https://mvnrepository.com/artifact/com.snowflake>
2. Download the [Bouncy Castle](https://www.bouncycastle.org/) cryptography library jar files:

   * <https://mvnrepository.com/artifact/org.bouncycastle/bc-fips/2.1.0>
   * <https://mvnrepository.com/artifact/org.bouncycastle/bcpkix-fips/2.1.8>
3. If your Kafka data is streamed in [Apache Avro](https://avro.apache.org/) format, download the Avro JAR file (1.11.4):

   * <https://mvnrepository.com/artifact/org.apache.avro/avro>

The source code for the connector is available at <https://github.com/snowflakedb/snowflake-kafka-connector>.

#### Install the Kafka connector

Copy the JAR files you downloaded in Installing the connector for open source Apache Kafka to the `<kafka_dir>/libs` folder.

## Configuring the Kafka connector

When deployed in standalone mode, the connector is configured by creating a file that
specifies parameters such as the Snowflake login credentials, topic name(s), Snowflake table name(s), and others.
When deployed in distributed mode the connector is configured by calling REST API endpoint of the kafka connect cluster.

> **Important:**
>
> The Kafka Connect framework broadcasts the configuration settings for the Kafka connector from the master node to worker nodes.
> Configuration settings include sensitive information, specifically, the Snowflake username and private key.
> Make sure to secure the communication channel between Kafka Connect nodes.
> For more information, see the documentation for your Apache Kafka software.

Each configuration specifies the topics and corresponding tables for one database and one schema in that database.
Note that a connector can ingest messages from
any number of topics, but the corresponding tables must all be stored in a single database and schema.

This section provides instructions for both the distributed and standalone modes.

For descriptions of the configuration fields, see Connector configuration properties.

> **Important:**
>
> Because the configuration file typically contains security related information,
> such as the private key, set read/write privileges appropriately on the file to limit access.
>
> In addition, consider storing the configuration file in a secure external
> location or a key management service. For more information, see Externalizing Secrets (in this topic).

### Distributed mode

Create the Kafka configuration file, e.g. `<path>/<config_file>.json`.
Populate the file with all connector configuration information.
The file must be in JSON format.

**Sample configuration file**

```json
{
  "name":"XYZCompanySensorData",
  "config":{
      "connector.class": "com.snowflake.kafka.connector.SnowflakeStreamingSinkConnector",
      "tasks.max": "1",
      "snowflake.topic2table.map": "topic1:table_1,topic2:table_2",
      "snowflake.url.name": "myorganization-myaccount.snowflakecomputing.com:443",
      "snowflake.warehouse.name": "WH",
      "snowflake.private.key": "-----BEGIN PRIVATE KEY-----\n .... \n-----END PRIVATE KEY-----\n",
      "snowflake.schema.name": "MY_SCHEMA",
      "snowflake.database.name": "MY_DATABASE",
      "snowflake.role.name": "MY_ROLE",
      "snowflake.user.name": "MY_USER",
      "value.converter": "org.apache.kafka.connect.json.JsonConverter",
      "key.converter": "org.apache.kafka.connect.storage.StringConverter",
      "errors.log.enable": "true",
      "topics": "topic1,topic2",
      "value.converter.schemas.enable": "false",
      "errors.tolerance": "none"
      }
}
```

### Standalone mode

Create a configuration file, for example `<kafka_dir>/config/SF_connect.properties`.
Populate the file with all connector
configuration information.

**Sample configuration file**

```properties
connector.class=com.snowflake.kafka.connector.SnowflakeStreamingSinkConnector
tasks.max=1
snowflake.topic2table.map=topic1:table_1,topic2:table_2
snowflake.url.name=myorganization-myaccount.snowflakecomputing.com:443
snowflake.warehouse.name=WH
snowflake.private.key=-----BEGIN PRIVATE KEY-----\n .... \n-----END PRIVATE KEY-----\n
snowflake.schema.name=MY_SCHEMA
snowflake.database.name=MY_DATABASE
snowflake.role.name=MY_ROLE
snowflake.user.name=MY_USER
value.converter=org.apache.kafka.connect.json.JsonConverter
key.converter=org.apache.kafka.connect.storage.StringConverter
errors.log.enable=true
topics=topic1,topic2
name=XYZCompanySensorData
value.converter.schemas.enable=false
errors.tolerance=none
```

## Cache considerations for testing and prototyping

The connector caches table and pipe lookup checks to improve performance during partition rebalances.
However, during testing and prototyping, this caching behavior can cause the connector
to not immediately detect manually created tables or pipes.

**Issue:** When you manually create a table or pipe while the connector is running,
the connector may continue to use cached existence check results (which may indicate the object doesn’t exist) for up to 5 minutes by default. This can lead to unexpected errors or behavior during testing.

**Recommendation for testing:** To avoid cache-related issues during testing and prototyping,
configure both cache expiration parameters to their minimum value of `1` millisecond or disable the caching:

```properties
snowflake.cache.table.exists.expire.ms=1
snowflake.cache.pipe.exists.expire.ms=1
```

This configuration ensures that the connector performs fresh existence checks on every partition rebalance, allowing you to see the effects of manually created tables and pipes immediately.

> **Important:**
>
> These minimal cache settings are recommended **only for testing and prototyping**. In production environments, use the default cache expiration values (5 minutes or greater) to minimize metadata queries to Snowflake and improve rebalance performance, especially when handling many partitions.

## Connector configuration properties

### Required properties

`name`
:   Application name. This must be unique across all Kafka connectors used by the customer. This name must be a valid Snowflake unquoted identifier. For information about valid identifiers, see [Identifier requirements](../../sql-reference/identifiers-syntax.md).

`connector.class`
:   `com.snowflake.kafka.connector.SnowflakeStreamingSinkConnector`

`topics`
:   Comma-separated list of topics. By default, Snowflake assumes that the table name is the same as the topic name. If the table name is not the same as the topic name, then use the optional `topic2table.map` parameter (below) to specify the mapping from topic name to table name. This table name must be a valid Snowflake unquoted identifier. For information about valid table names, see [Identifier requirements](../../sql-reference/identifiers-syntax.md).

    > **Note:**
    >
    > Either `topics` or `topics.regex` is required; not both.

`topics.regex`
:   This is a regular expression (“regex”) that specifies the topics that contain the messages to load into Snowflake tables. The connector loads data from any topic name that matches the regex. The regex must follow the rules for Java regular expressions (i.e. be compatible with java.util.regex.Pattern). The configuration file should contain either `topics` or `topics.regex`, not both.

`snowflake.url.name`
:   The URL for accessing your Snowflake account. This URL must include your [account identifier](../../user-guide/admin-account-identifier.md). Note that the protocol (`https://`) and port number are optional.

`snowflake.user.name`
:   User login name for the Snowflake account.

`snowflake.role.name`
:   The name of the role that the connector will use to insert data into the table.

`snowflake.private.key`
:   The private key to authenticate the user. Include only the key, not the header or footer. If the key is split across multiple lines, remove the line breaks. You can provide an unencrypted key, or you can provide an encrypted key and provide the `snowflake.private.key.passphrase` parameter to enable Snowflake to decrypt the key. Use this parameter if and only if the `snowflake.private.key` parameter value is encrypted. This decrypts private keys that were encrypted according to the instructions in [Key-pair authentication and key-pair rotation](../../user-guide/key-pair-auth.md).

    > **Note:**
    >
    > Also see `snowflake.private.key.passphrase` in Optional properties.

`snowflake.database.name`
:   The name of the database that contains the table to insert rows into.

`snowflake.schema.name`
:   The name of the schema that contains the table to insert rows into.

`header.converter`
:   Required only if the records are formatted in Avro and include a header. The value is `"org.apache.kafka.connect.storage.StringConverter"`.

`key.converter`
:   This is the Kafka record’s key converter (e.g. `"org.apache.kafka.connect.storage.StringConverter"`). This is not used by the Kafka connector, but is required by the Kafka Connect Platform.

    See [Kafka connector limitations](../../user-guide/kafka-connector-overview.md) for current limitations.

`value.converter`
:   The connector supports standard Kafka community converters. Choose the appropriate converter based on your data format:

    * For JSON records: `"org.apache.kafka.connect.json.JsonConverter"`
    * For Avro records with Schema Registry: `"io.confluent.connect.avro.AvroConverter"`

    See [Kafka connector limitations](../../user-guide/kafka-connector-overview.md) for current limitations.

### Optional properties

`snowflake.private.key.passphrase`
:   If the value of this parameter is not empty, the connector uses this phrase to try to decrypt the private key.

`tasks.max`
:   Number of tasks, usually the same as the number of CPU cores across the worker nodes in the Kafka Connect cluster. To achieve best performance, Snowflake recommends setting the number of tasks equal to the total number of Kafka partitions, but not exceeding the number of CPU cores. High number of tasks may result in an increased memory consumption and frequent rebalances.

`snowflake.topic2table.map`
:   This optional parameter lets a user specify which topics should be mapped to which tables. Each topic and its table name should be separated by a colon (see example below). This table name must be a valid Snowflake unquoted identifier. For information about valid table names, see [Identifier requirements](../../sql-reference/identifiers-syntax.md). The topic configuration allows use of regular expressions to define topics, just as the use of `topics.regex` does. The regular expressions cannot be ambiguous — any matched topic must match only a single target table.

    Example:

    ```none
    topics="topic1,topic2,topic5,topic6"
    snowflake.topic2table.map="topic1:low_range,topic2:low_range,topic5:high_range,topic6:high_range"
    ```

    could be written as:

    ```none
    topics.regex="topic[0-9]"
    snowflake.topic2table.map="topic[0-4]:low_range,topic[5-9]:high_range"
    ```

`value.converter.schema.registry.url`
:   If the format is Avro and you are using a Schema Registry Service, this should be the URL of the Schema Registry Service. Otherwise this field should be empty.

`value.converter.break.on.schema.registry.error`
:   If loading Avro data from the Schema Registry Service, this property determines if the Kafka connector should stop consuming records if it encounters an error while fetching the schema id. The default value is `false`. Set the value to `true` to enable this behavior.

`jvm.proxy.host`
:   To enable the Snowflake Kafka Connector to access Snowflake through a proxy server, set this parameter to specify the host of that proxy server.

`jvm.proxy.port`
:   To enable the Snowflake Kafka Connector to access Snowflake through a proxy server, set this parameter to specify the port of that proxy server.

`snowflake.streaming.max.client.lag`
:   Specifies how often the connector flushes the data to Snowflake, in seconds.

    Values:
    :   * Minimum: `1` second
        * Maximum: `600` seconds

    Default:
    :   `1` second

`jvm.proxy.username`
:   Username that authenticates with the proxy server.

`jvm.proxy.password`
:   Password for the username that authenticates with the proxy server.

`snowflake.jdbc.map`
:   Example: `"snowflake.jdbc.map": "networkTimeout:20,tracing:WARNING"`

    Additional JDBC properties (see [JDBC Driver connection parameter reference](../../developer-guide/jdbc/jdbc-parameters.md)) are not validated. These additional properties
    are not validated, and must not override nor be used instead of required properties such as: `jvm.proxy.xxx`,
    `snowflake.user.name`, `snowflake.private.key`, `snowflake.schema.name` etc.

    Specifying either of the following combinations:
    :   * `tracing` property along with `JDBC_TRACE` env variable
        * `database` property along with `snowflake.database.name`

    Will result in an ambiguous behavior and the behavior will be determined by the JDBC Driver.

`value.converter.basic.auth.credentials.source`
:   If you are using the Avro data format and require secure access to the Kafka schema registry, set this parameter to the string “USER_INFO”, and set the `value.converter.basic.auth.user.info` parameter described below. Otherwise, omit this parameter.

`value.converter.basic.auth.user.info`
:   If you are using the Avro data format and require secure access to the Kafka schema registry, set this parameter to the string “<user_ID>:<password>”, and set the value.converter.basic.auth.credentials.source parameter described above. Otherwise, omit this parameter.

`snowflake.metadata.createtime`
:   If value is set to FALSE, the `CreateTime` property value is omitted from the metadata in the RECORD_METADATA column. The default value is TRUE.

`snowflake.metadata.topic`
:   If value is set to FALSE, the `topic` property value is omitted from the metadata in the RECORD_METADATA column. The default value is TRUE.

`snowflake.metadata.offset.and.partition`
:   If value is set to FALSE, the `Offset` and `Partition` property values are omitted from the metadata in the RECORD_METADATA column. The default value is TRUE.

`snowflake.metadata.all`
:   If value is set to FALSE, the metadata in the RECORD_METADATA column is completely empty. The default value is TRUE.

`transforms`
:   Specify to skip tombstone records encountered by the Kafka connector and not load them into the target table. A tombstone record is
    defined as a record where the entire value field is null.

    Set the property value to `"tombstoneHandlerExample"`.

    > **Note:**
    >
    > Use this property with the Kafka community converters (i.e. `value.converter` property value) only (e.g.
    > `org.apache.kafka.connect.json.JsonConverter` or `org.apache.kafka.connect.json.AvroConverter`). To manage tombstone record
    > handling with the Snowflake converters, use the `behavior.on.null.values` property instead.

`transforms.tombstoneHandlerExample.type`
:   Required when setting the `transforms` property.

    Set the property value to `"io.confluent.connect.transforms.TombstoneHandler"`

`behavior.on.null.values`
:   Specify how the Kafka connector should handle tombstone records. A tombstone record is defined as a record where the entire value field
    is null. For [Snowpipe](../../user-guide/data-load-snowpipe-intro.md), this property is supported by the Kafka connector version 1.5.5 and later. For [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md), this property is supported by the Kafka connector version 2.1.0 and later.

    This property supports the following values:

    `DEFAULT`
    :   When the Kafka connector encounters a tombstone record, it inserts an empty JSON string in the content column.

    `IGNORE`
    :   The Kafka connector skips tombstone records and does not insert rows for these records.

    The default value is `DEFAULT`.

    > **Note:**
    >
    > Tombstone records ingestion varies by the ingestion methods:
    >
    > * For Snowpipe, the Kafka connector uses Snowflake converters only. To manage tombstone record handling with the Kafka community converters, use the `transform` and `transforms.tombstoneHandlerExample.type` properties instead.
    > * For Snowpipe Streaming, the Kafka connector uses community converters only.
    >
    > Records sent to Kafka brokers must not be NULL because these records will be dropped by the Kafka connector resulting in missing offsets. The missing offsets will break the Kafka connector in specific use cases. It is recommended that you use tombstone records instead of NULL records.

### Using key pair authentication and key rotation

The Kafka connector relies on key pair authentication instead of username and password authentication.
This authentication method requires a 2048-bit (minimum) RSA key pair.
Generate the public-private key pair using OpenSSL.
The public key is assigned to the Snowflake user defined in the configuration file.

After completing the key pair authentication tasks on this page and the tasks for [key pair rotation](../../user-guide/key-pair-auth.md),
evaluate the recommendation for Externalizing secrets, later in this topic.

To configure the public/private key pair:

1. From the command line in a terminal window, generate a private key.

   You can generate either an encrypted version or unencrypted version of the private key.

   > **Note:**
   >
   > The Kafka connector supports encryption algorithms that are validated to meet the Federal Information Processing Standard (140-2) (i.e. FIPS 140-2) requirements. For more information, see [FIPS 140-2](https://csrc.nist.gov/publications/detail/fips/140/2/final).

   To generate an unencrypted version, use the following command:

   > ```bash
   > $ openssl genrsa -out rsa_key.pem 2048
   > ```

   To generate an encrypted version, use the following command:

   > ```bash
   > $ openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 <algorithm> -inform PEM -out rsa_key.p8
   > ```
   >
   > Where `<algorithm>` is a FIPS 140-2 compliant encryption algorithm.
   >
   > For example, to specify AES 256 as the encryption algorithm:
   >
   > ```bash
   > $ openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 aes256 -inform PEM -out rsa_key.p8
   > ```
   >
   > If you generate an encrypted version of the private key, record the passphrase.
   > Later, you will specify the passphrase in the `snowflake.private.key.passphrase` property in the Kafka configuration file.

   **Sample PEM private key**

   ```bash
   -----BEGIN ENCRYPTED PRIVATE KEY-----
   MIIE6TAbBgkqhkiG9w0BBQMwDgQILYPyCppzOwECAggABIIEyLiGSpeeGSe3xHP1
   wHLjfCYycUPennlX2bd8yX8xOxGSGfvB+99+PmSlex0FmY9ov1J8H1H9Y3lMWXbL
   ...
   -----END ENCRYPTED PRIVATE KEY-----
   ```
2. From the command line, generate the public key by referencing the private key:

   Assuming the private key is encrypted and contained in the file named `rsa_key.p8`, use the following command:

   ```bash
   $ openssl rsa -in rsa_key.p8 -pubout -out rsa_key.pub
   ```

   **Sample PEM public key**

   ```bash
   -----BEGIN PUBLIC KEY-----
   MIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8AMIIBCgKCAQEAy+Fw2qv4Roud3l6tjPH4
   zxybHjmZ5rhtCz9jppCV8UTWvEXxa88IGRIHbJ/PwKW/mR8LXdfI7l/9vCMXX4mk
   ...
   -----END PUBLIC KEY-----
   ```
3. Copy the public and private key files to a local directory for storage.
   Note the path to the files.
   The private key is stored using the PKCS#8 (Public Key Cryptography Standards) format
   and is encrypted using the passphrase you specified in the previous step; however,
   the file should still be protected from unauthorized access using the file permission mechanism provided by your
   operating system. It is the users responsibility to secure the file when it is not in use.
4. Log into Snowflake. Assign the public key to the Snowflake user using [ALTER USER](../../sql-reference/sql/alter-user.md).

   For example:

   > ```sqlexample
   > ALTER USER jsmith SET RSA_PUBLIC_KEY='MIIBIjANBgkqh...';
   > ```

   > **Note:**
   > * Only security administrators (i.e. users with the SECURITYADMIN role) or higher can alter a user.
   > * Exclude the public key header and footer in the SQL statement.

   Verify the user’s public key fingerprint using [DESCRIBE USER](../../sql-reference/sql/desc-user.md):

   ```sqlexample
   DESC USER jsmith;
   +-------------------------------+-----------------------------------------------------+---------+-------------------------------------------------------------------------------+
   | property                      | value                                               | default | description                                                                   |
   |-------------------------------+-----------------------------------------------------+---------+-------------------------------------------------------------------------------|
   | NAME                          | JSMITH                                              | null    | Name                                                                          |
   ...
   ...
   | RSA_PUBLIC_KEY_FP             | SHA256:nvnONUsfiuycCLMXIEWG4eTp4FjhVUZQUQbNpbSHXiA= | null    | Fingerprint of user's RSA public key.                                         |
   | RSA_PUBLIC_KEY_2_FP           | null                                                | null    | Fingerprint of user's second RSA public key.                                  |
   +-------------------------------+-----------------------------------------------------+---------+-------------------------------------------------------------------------------+
   ```

   > **Note:**
   >
   > The `RSA_PUBLIC_KEY_2_FP` property is described in [Configuring key-pair rotation](../../user-guide/key-pair-auth.md).
5. Copy and paste the entire private key into the `snowflake.private.key` field in the configuration file. Save the file.

#### Externalizing secrets

Snowflake strongly recommends externalizing secrets such as the private key and storing
them in an encrypted form or in a key management service such as AWS Key Management Service (KMS),
Microsoft Azure Key Vault, or HashiCorp Vault.
This can be accomplished by using a `ConfigProvider` implementation on your Kafka Connect cluster.

For more information, see the Confluent description of this [service](https://docs.confluent.io/current/connect/security.html#externalizing-secrets).

## Starting the connector

Start Kafka using the instructions provided in the third-party Confluent or Apache Kafka documentation.
You can start the Kafka connector in either distributed mode or standalone mode. Instructions for each are shown below:

### Distributed mode

In a terminal window, execute the following command:

```bash
curl -X POST -H "Content-Type: application/json" --data @<path>/<config_file>.json http://localhost:8083/connectors
```

### Standalone mode

In a terminal window, execute the following command:

```bash
<kafka_dir>/bin/connect-standalone.sh <kafka_dir>/<path>/connect-standalone.properties <kafka_dir>/config/SF_connect.properties
```

> **Note:**
>
> A default installation of Apache Kafka or Confluent Kafka should already include the file `connect-standalone.properties`.)

## Next steps

[test the connector](test-connector.md).

---
title: Snowflake High Performance connector for Kafka: Test the connector
source: https://docs.snowflake.com/en/connectors/kafkahp/test-connector.md
section: Connectors & Drivers
---

# Snowflake High Performance connector for Kafka: Test the connector

This topic describes the steps to test the Snowflake High Performance connector for Kafka connector itself.

## Testing and using the Kafka connector

Snowflake recommends testing the Kafka connector with a small amount of data before using the connector in a production system.

1. Verify that Kafka and Kafka Connect are running.
2. Verify that you have created the appropriate Kafka topic.
3. Create (or use an existing) message publisher. Make sure that the messages published to the topic have the right format (JSON, Avro, Protobuf).
4. Create a configuration that specifies the topic to subscribe to and the Snowflake table to write to.
5. Grant the minimum privileges required on the Snowflake objects (database, schema) to the role that will be used to ingest data.
6. Publish a sample set of data to the configured Kafka topic.
7. Wait a few seconds for data to propagate through the system, and then check the Snowflake table to verify that the records were inserted.

---
title: Stored procedures for daily maintenance
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-worksheet-daily-use.md
section: Connectors & Drivers
---

# Stored procedures for daily maintenance

## Introduction

The Snowflake Connector for Google Analytics Aggregate Data provides several stored procedures that allow you to manage your data ingestion and connector configuration programmatically.
Below are detailed descriptions of each stored procedure, including their usage, parameters, and examples.

### Configuring a new report

The `CONFIGURE_REPORT` procedure configures a new report for data ingestion from Google Analytics 4 (GA4) into Snowflake.
This procedure takes the report’s parameters as an input, including dimensions, metrics, and ingestion schedule.

> ```sqlsyntax
> CALL CONFIGURE_REPORT( <report_name>, <property_id>, <dimensions>, <metrics>, <start_date>, <refresh_interval>[, <keep_empty_rows>][, <avoid_sampling>]);
> ```

Where:

`report_name`
:   A string that specifies the name of the report to configure. This name will be used as a prefix for the tables with a raw data created in the destination database.
    After initial ingestion, the report name will be used as a name for the views created in the destination database. **Required.**

    The report name must:
    :   * start with either a letter (uppercase or lowercase) or an underscore.
        * continue with one or more characters that can be letters (uppercase or lowercase), digits, or underscores.
        * be 150 characters or less.

`property_id`
:   A string that specifies the Google Analytics 4 property id to use for the report. The property id has a form of a number obtainable from GA4 account.
    Ensure that the PROPERTY_ID corresponds to a GA4 property accessible by the connectors authentication method (oauth2 or service account). **Required.**

`dimensions`
:   A comma-separated list of GA4 dimensions to include in the report. The dimensions must be separated by commas.
    If `date` dimension is not explicitly specified, it will be added automatically.
    At most nine dimensions can be specified (including `date`). **Required.**

    Example: `'country,city,deviceCategory,sessions'`

`metrics`
:   A comma-separated list of GA4 metrics to include in the report.
    At least one metric is required, with a maximum of ten metrics. **Required.**

    Example: `'sessions,pageViews'.`

`start_date`
:   A string that specifies the start date for the report. The date must be in the format `YYYY-MM-DD`.
    Data from this date onwards will be ingested. **Required.**

`refresh_interval`
:   A string that specifies the refresh interval for the report. **Required.** The interval must have one of the following formats:

    > * `'EVERY <number of minutes> MINUTE'`
    > * `'EVERY <number of hours> HOUR'`
    > * `'EVERY <number of days> DAY'`

`keep_empty_rows`
:   Optional. Default value is `false`. If `true`, the output table includes records with dimension combinations where all metrics are zero.
    Useful for analyzing dimension combinations with no events.

`avoid_sampling`
:   Optional. Default value is `false`. If `true`, the connector may adjust the way it fetches the data from GA4 API to try to avoid data sampling.
    Can improve data accuracy but may increase API call frequency.

    > **Note:**
    >
    > There is no guarantee that the data will be unsampled. The connector will try to avoid sampling, but it may not be possible in all cases.
    > This is due to the limitations of the GA4 API.

    **Example:**

    ```sqlsyntax
    CALL CONFIGURE_REPORT(
        REPORT_NAME => 'USER_ENGAGEMENT_REPORT',
        PROPERTY_ID => '123456789',
        DIMENSIONS => 'country,deviceCategory',
        METRICS => 'activeUsers,newUsers',
        START_DATE => '2023-01-01',
        REFRESH_INTERVAL => 'EVERY 1 DAY',
        KEEP_EMPTY_ROWS => TRUE,
        AVOID_SAMPLING => TRUE
    );
    ```

### Removing existing report

The `DELETE_REPORT` procedure deletes an existing report configuration from the connector, stopping any further data ingestion for that report. Data that has already been ingested will not be removed.

> ```sqlsyntax
> CALL DELETE_REPORT( <report_name> );
> ```

Where:

`report_name`
:   A string that specifies the name of the report to delete.
    Must match the REPORT_NAME used in CONFIGURE_REPORT. **Required.**

    **Example:**

    ```sqlsyntax
    CALL DELETE_REPORT('USER_ENGAGEMENT_REPORT');
    ```

### Listing properties from Google Analytics 4 account

The `GET_PROPERTIES` procedure returns a list of all the Google Analytics 4 properties that are available for ingestion by the connector.

> ```sqlsyntax
> CALL GET_PROPERTIES();
> ```

Example output from the procedure:

> ```json
> {[
>   { "propertyName": "test1", "propertyId": "1" },
>   { "propertyName": "test2", "propertyId": "2" },
>   { "propertyName": "test3", "propertyId": "3" }
> ]}
> ```
>
> > **Note:**
> >
> > The connector must have the necessary permissions to access the properties. If a result is empty, verify access rights in GA4.

### Fetching dimensions and metric for GA4 property

The `GET_DIMENSIONS_AND_METRICS` procedure retrieves the list of available dimensions and metrics for a specified GA4 property.

> ```sqlsyntax
> CALL GET_DIMENSIONS_AND_METRICS( <property_id> );
> ```

Where:

`property_id`

> A string that specifies the Google Analytics 4 property id to use for the report. The property id has a form of a number obtainable from GA4 account.
> Ensure that the `property_id` corresponds to a GA4 property accessible by the connectors authentication method (oauth2 or service account). **Required.**
>
> **Example:**
>
> > ```sqlsyntax
> > CALL GET_DIMENSIONS_AND_METRICS('123456789');
> > ```
>
> **Example output from the procedure:**
>
> ```json
> {
>   "dimensions": [
>     {
>       "dimension": "achievementId", "category": "Other", "description": "Some description."
>     }
>   ],
>   "metrics": [
>     {
>       "metric": "active1DayUsers", "category": "User", "description": "Some description."
>     },
>     {
>       "metric": "active28DayUsers", "category": "User", "description": "Some description."
>     }
>   ]
> }
> ```
>
> > **Note:**
> >
> > The available dimensions and metrics may vary between properties.

### Pausing the connector

The `PAUSE_CONNECTOR` procedure pauses the connector, stopping all data ingestion and processing.

> ```sqlsyntax
> CALL PAUSE_CONNECTOR();
> ```
>
> > **Note:**
> >
> > * Pausing the connector halts data ingestion for all configured reports.
> > * Data ingestion can be resumed using RESUME_CONNECTOR.
> > * Existing data remains accessible during the pause.

### Resuming the connector

The `RESUME_CONNECTOR` procedure resumes the connector, starting all data ingestion and processing that was previously paused.
Data ingestion will continue from the point where it was paused.

> ```sqlsyntax
> CALL RESUME_CONNECTOR();
> ```

### On demand ingestion

The `INGEST_NOW` procedure schedules data ingestion for the specified report in the connector.
This procedure can be used to manually initiate data ingestion for a specific report outside of the scheduled intervals.

> > **Note:**
> >
> > The procedure schedules immediate ingestion for the specified report using `EXECUTE TASK ...`.
> > That means that the ingestion will start as soon as possible, but it may not be instantaneous
> > especially if ingestion for the same report is already in progress. Ensure that the connector is not paused before calling this procedure.
>
> ```sqlsyntax
> CALL INGEST_NOW('<report_name>');
> ```

Where:

`report_name`
:   A string that specifies the name of the report to ingest.
    Must match the REPORT_NAME used in CONFIGURE_REPORT. **Required.**

    **Example:**

    ```sqlsyntax
    CALL INGEST_NOW('USER_ENGAGEMENT_REPORT');
    ```

### Getting the current status of the connector

> > **Note:**
> >
> > Connector `state` and connector `status` are used interchangeably in the context of this document.
> > The status/state of the connector can be retrieved using the `GET_CONNECTOR_STATUS` procedure.
>
> ```sqlsyntax
> CALL GET_CONNECTOR_STATUS();
> ```
>
> Example output from the procedure:
>
> ```json
> {
>   "response_code": "OK",
>   "status": "STARTED",
>   "configurationStatus": "PREREQUISITES_DONE"
> }
> ```

The procedure returns a JSON object with the following fields:

* `response_code` - If the procedure has been executed successfully **OK** value is returned. Response code other than **OK** indicates an error.
* `status` - The current status of the connector. This status can change only when you re/install, pause, resume connector or finalize the configuration process.
  It can have one of the following values:

  + `CONFIGURING` - This is the default state set after the connector is installed from the listing or application package.
    The connector remains in this state until the configuration process is finalized. After the configuration is finalized,
    the connector transitions to the STARTED state.
  + `STARTED` - Once the connector is fully configured or resumed it transitions to the STARTED state.
  + `PAUSED` - When the connector is successfully paused it transitions to the PAUSED state.
  + `ERROR` - If the connector encounters an irreversible error, it transitions to the ERROR state, indicating it cannot be actively used.
    When in this state, `RECOVER_CONNECTOR_STATE` procedure can be used in order to transition to a valid state.
* `configurationStatus` - This is a sub-status of the main `CONFIGURING` status. The connector configuration process is divided into few steps
  which are being tracked by this sub-status. It can have one of the following values:

  + `INSTALLED` - The configuration starts in the INSTALLED state after the connector instance has been created.
  + `PREREQUISITES_DONE` - After the user completes the prerequisites and calls `MARK_ALL_PREREQUISITES_AS_DONE` procedure
    the configuration transitions to the PREREQUISITES_DONE state.
    Prerequisites are manual steps that needs to be executed by the user, e.g. configuring the connection to third party data source
    or creating destination database.
  + `CONFIGURED` - The `CONFIGURE_CONNECTOR(VARIANT)` procedure has been called.
  + `CONNECTED` - The `SET_CONNECTION_CONFIGURATION(VARIANT)` procedure has been called.
  + `FINALIZED` - Finally, after completing the configuration, the configuration transitions to the FINALIZED state
    (the `FINALIZE_CONNECTOR_CONFIGURATION(VARIANT)` procedure has been called).

### Restarting configuration process

The `RESET_CONFIGURATION` stored procedure resets the connector’s configuration to its default state.
This procedure can be used to reset the connector’s configuration before the configuration has been finalized.
In order for the procedure to work, the connector must be in `CONFIGURING` status.
To know more about the connector main statuses and configuration sub-statuses refer to Getting the current status of the connector.

If configuration phase is completed (FINALIZED) this procedure will return an error.

> ```sqlsyntax
> CALL RESET_CONFIGURATION();
> ```

### Recovering from intermediate or erroneous state

The `RECOVER_CONNECTOR_STATE` procedure is intended to recover the connector when it is stuck in an intermediate or error state (`ERROR`, `STARTING`, `PAUSING`)
by manually setting its status to either `STARTED` or `PAUSED`.
Some operations may leave the connector in an inconsistent state and it may happen for various reasons.
For example when the user will drop permissions to certain database objects the connector needs.

The procedure will return an error if the new state is not valid or if the connector is in not in one of predetermined states.
The following transitions are allowed:

> * ERROR -> PAUSED
> * ERROR -> STARTED
> * PAUSING -> PAUSED
> * PAUSING -> STARTED
> * STARTING -> PAUSED
> * STARTING -> STARTED
>
> ```sqlsyntax
> CALL RECOVER_CONNECTOR_STATE('<new_state>');
> ```

Where:

`new_state`
:   A string that specifies the new state for the connector. The state must be either `STARTED` or `PAUSED`. **Required.**

    **Example:**

    ```sqlsyntax
    CALL RECOVER_CONNECTOR_STATE('STARTED');
    ```

### Recovering reports after a connector has been dropped

The `IMPORT_STATE` procedure is used to recover configured reports and ingestion history after the connector has been uninstalled.
This procedure is intended to be used after the connector has been reinstalled and the new connector has been configured to use the same
database that was used by the uninstalled one. The state that is being imported is located in the destination database used by the previous instance of the connector in the form of tables
with `SFSDKEXPORT` suffix. To read more about the process read the [Disaster recovery](gaad-connector-disaster-recovery.md) guide. The procedure will not overwrite
the existing state in the connector if it detects the state is not pristine unless the `force` parameter is set to `true`. Pristine state is a state right after configuration process
is finalized and no reports are configured. If reports were configured but later deleted the state is also assumed to be not pristine.

> > **Note:**
> >
> > When the connector was uninstalled (dropped) with the `CASCADE` options this procedure will not work.
>
> ```sqlsyntax
> CALL IMPORT_STATE([force]);
> ```

Where:

`force`
:   Optional. Default value is `false`. If `true`, the procedure will overwrite the existing state in the connector. Including any reports that are already configured.
    If `false`, the procedure will return an error if it detects that the state is not pristine.

    **Example:**

    ```sqlsyntax
    CALL IMPORT_STATE();
    ```

---
title: The Apache Kafka and Kafka connect framework
source: https://docs.snowflake.com/en/connectors/kafkahp/aboutkafkaconnect.md
section: Connectors & Drivers
---

# The Apache Kafka and Kafka connect framework

This topic describes the basic concepts of Apache Kafka and Kafka Connect Framework.

Apache Kafka software uses a publish and subscribe model to write and read streams of records,
similar to a message queue or enterprise messaging system.
Kafka allows processes to read and write messages asynchronously.
A subscriber does not need to be connected directly to a publisher; a publisher can queue a message in Kafka for the subscriber to receive later.

An application publishes messages to a *topic*, and an application subscribes to a topic to receive those messages.

Kafka Connect is a framework for connecting Kafka with external systems, including databases.
A Kafka Connect cluster is a separate cluster from the Kafka cluster.
The Kafka Connect cluster supports running and scaling out connectors (components that support reading and/or writing to external systems).

Kafka Connect can be used with two types of connectors:

* **Source connectors**: Import data from external systems into Kafka topics.
* **Sink connectors**: Export data from Kafka topics to external systems.

The High Performance Snowflake Connector for Kafka is a sink connector that reads data from Kafka topics and loads it into Snowflake tables.

Kafka Connect handles common operational concerns such as:

* Scalability: Kafka Connect can scale horizontally by adding more worker nodes to the cluster.
* Fault tolerance: If a worker node fails, Kafka Connect automatically redistributes the work to other available nodes.
* Offset management: Kafka Connect tracks which records have been processed, ensuring that data is not lost or duplicated in case of failures.
* Configuration management: Connectors can be configured and managed through a REST API, making it easier to deploy and monitor data pipelines.

---
title: Troubleshooting the connector
source: https://docs.snowflake.com/en/connectors/servicenow/troubleshooting.md
section: Connectors & Drivers
---

# Troubleshooting the connector

This topic provides guidelines for troubleshooting issues with the Snowflake Connector for ServiceNow®.

> **Note:**
>
> The following sections describe stored procedures that are defined in the PUBLIC schema of
> [the connector application](installing-sql.md). Before
> calling these stored procedures, select that application as the database to use for the session.
>
> For example, if that application is named `my_connector_servicenow` and you would call the `TEST_CONNECTION`
> connector procedure by running the following commands:
>
> ```sqlexample
> USE APPLICATION my_connector_servicenow;
> CALL TEST_CONNECTION();
> ```

## Resolving problems during connector installation

Most common issues during the installation of the connector are related to the ACLs set on the metadata tables such as
`sys_db_object`, `sys_dictionary` and `sys_glide_object`. Additionally, the connector requires access to the
`sys_table_rotation` table to determine the correct ingestion strategy and optionally to the journal table (usually
`sys_audit_delete`) to propagate data deletion.

### Authentication step errors

Issues can occur when [connecting to ServiceNow](installing-snowsight.md) in the installation wizard or
running manually the [SET_CONNECTION_CONFIGURATION](installing-sql.md) procedure.
If encountered errors during this step, please make sure that the user used to install the connector has access to the `sys_db_object` table.

The error status codes that might be related to ACL issues in the returned JSON object from the `SET_CONNECTION_CONFIGURATION` procedure are as follows:

* `REQUEST_FAILED`

You can perform below query similar to the connector’s to verify the access. Until the request doesn’t return the expected result,
it won’t be possible to install the connector. For example, if you are using curl to send the HTTP request:

```bash
# checking access to the sys_db_object table
curl -u '<username>:<password>' "https://<servicenow_instance>.service-now.com/api/now/table/sys_db_object?sysparm_limit=1"
```

Where:
:   `servicenow_instance`
    :   Specifies the name of your ServiceNow® instance.

    `username` and `password`
    :   Specify the credentials for your ServiceNow® instance.

Example responses:

* At least some of the fields are returned - the user has the necessary permissions to access the table.
* The response is empty - the user has the permission to access the table, but not to the processed record. It might cause
  issues at a later point.
* The response contains an error - the user does not have the necessary permissions to access the table.

### Validate source step errors

Issues might occur when [validating source](installing-snowsight.md) in the installation wizard or
running manually the [FINALIZE_CONNECTOR_CONFIGURATION](installing-sql.md) procedure.
If encountered errors during this step, please make sure that the user used to install the connector has the necessary permissions to access the metadata tables.

The error status codes that might be related to ACL issues in the returned JSON object from the `FINALIZE_CONNECTOR_CONFIGURATION` procedure are as follows:

* `METADATA_TABLE_ACCESS_VALIDATION_ERROR`
* `JOURNAL_TABLE_ACCESS_VALIDATION_ERROR`

You can perform below queries similar to the connector’s to verify the access. Until the requests don’t return the expected results,
it won’t be possible to install the connector. For example, if you are using curl to send the HTTP request:

```bash
# checking access to the sys_db_object table
# expected fields in the result object: sys_id, super_class, name
curl -u '<username>:<password>' "https://<servicenow_instance>.service-now.com/api/now/table/sys_db_object?sysparm_fields=sys_id,super_class,name&sysparm_limit=1&sysparm_query=name=sys_db_object"

# checking access to the sys_dictionary table
# expected fields in the result object: sys_id, name, element, internal_type
curl -u '<username>:<password>' "https://<servicenow_instance>.service-now.com/api/now/table/sys_dictionary?sysparm_fields=sys_id,name,element,internal_type&sysparm_limit=1&sysparm_query=name=sys_dictionary"

# checking access to the sys_glide_object table
# expected fields in the result object: sys_id, name, scalar_type
curl -u '<username>:<password>' "https://<servicenow_instance>.service-now.com/api/now/table/sys_glide_object?sysparm_fields=sys_id,name,scalar_type&sysparm_limit=1&sysparm_query=name=datetime"

# checking access to the sys_table_rotation table
# expected fields in the result object: sys_id, name
curl -u '<username>:<password>' "https://<servicenow_instance>.service-now.com/api/now/table/sys_table_rotation?sysparm_fields=sys_id,name&sysparm_limit=1&sysparm_query=name=syslog"

# (optional) - check only if deletions auditing is going to be used
# checking access to the journal table
# if known, "&sysparm_query=tablename=<table_name>" or "&sysparm_query=documentkey=<sys_id>" can be appended to the request
# expected fields in the result object: sys_id, sys_created_on, documentkey, tablename
curl -u '<username>:<password>' "https://<servicenow_instance>.service-now.com/api/now/table/<journal_table>?sysparm_fields=sys_id,sys_created_on,documentkey,tablename&sysparm_limit=1"
```

Where:
:   `servicenow_instance`
    :   Specifies the name of your ServiceNow® instance.

    `username` and `password`
    :   Specify the credentials for your ServiceNow® instance.

    `journal_table`
    :   Specifies the name of your ServiceNow® table used for deletions audit. Usually this has value of `sys_audit_delete`.

Example responses:

* All of the expected fields are present in the response - the user has the necessary permissions.
* Some of the expected fields are missing - the user does not have the necessary permissions to all of the columns.
* The response is empty - the user does not have the necessary permissions to all of the rows.
* The response contains an error - the user does not have the necessary permissions to the table.

## Verifying the connection to the ServiceNow® instance

To verify that the Snowflake Connector for ServiceNow® can access the ServiceNow® instance, call the
`TEST_CONNECTION` stored procedure:

```sqlsyntax
CALL TEST_CONNECTION();
```

If the connector is set up correctly, the stored procedure returns the following response:

```json
{
  "responseCode": "OK",
  "message": "Test request to ServiceNow succeeded."
}
```

## Verifying access to the specific table in the ServiceNow® instance

To verify that the Snowflake Connector for ServiceNow® can acces data from the specific table in the ServiceNow® instance, call the
`TEST_TABLE_ACCESS` stored procedure:

```sqlsyntax
CALL TEST_TABLE_ACCESS('<table_name>');
```

Where:

`table_name`
:   Specifies the name of a table in the ServiceNow® instance.

If the connector is set up correctly and data is available to the user used by the connector, the stored procedure returns the following response:

```json
{
  "responseCode": "OK",
  "message": "Test request to ServiceNow® succeeded."
}
```

> **Note:**
>
> If table is empty or all the rows are hidden from the connector because of ACLs, the message will say:
> `Test request to ServiceNow® succeeded but it didn't return any record.`
> In this situation, make sure that the table is really empty. If any rows are visible from the UI, it means that the connector is not able to ingest them.

## Comparing table row counts in ServiceNow® and Snowflake

To compare the current row count for a table in both ServiceNow® and Snowflake, call the `CHECK_ROW_COUNT` procedure:

```sqlsyntax
CALL CHECK_ROW_COUNT('<table_name>');
```

or

```sqlsyntax
CALL CHECK_ROW_COUNT('<table_name>', <max_sys_created_on>);
```

Where:

`table_name`
:   Specifies the name of a table in the ServiceNow® instance.

`max_sys_created_on`
:   Specifies additional optional filter on maximal value of `sys_created_on` column. Only rows matching this filter
    will be counted. Default value of this parameter is `NULL` which means the filter won’t be applied. This parameter
    helps to compare only counts of records already ingested to Snowflake, without taking into account records recently
    created in ServiceNow® but not yet ingested into Snowflake.

The following example shows how to call `CHECK_ROW_COUNT` stored procedure with `max_sys_created_on` parameter:

```sqlexample
CALL CHECK_ROW_COUNT('sys_db_object', '2021-09-10 12:34:56');
```

If the procedure times out, the procedure was unable to use the `stats` API
to determine the row count of the table in ServiceNow®. This may mean that the number of
rows in this table is too large to be counted by this API.

> **Note:**
>
> The number of rows returned may vary. A ServiceNow® table may contain more rows that the equivalent Snowflake
> table. This may be caused by the access control list rules (ACLs) set for a given table in ServiceNow®.
>
> The connector uses different endpoints for retrieving information about the number of rows in a ServiceNow®
> table. The connector uses `stats` for information about a table, including the number of rows. It uses
> `table` to ingest data into Snowflake.

## Checking the status of the ingestion of a row

To check the status of the ingestion of a row in all possible places in ServiceNow® and Snowflake, call the `CHECK_RECORD_HISTORY` procedure:

```sqlsyntax
CALL CHECK_RECORD_HISTORY('<table_name>', '<sys_id>');
```

Where:

`table_name`
:   Specifies the name of a table in the ServiceNow® instance.

`sys_id`
:   Specifies the `sys_id` of the row to check.

The procedure returns a JSON object containing the following properties:

| Property | Description |
| --- | --- |
| `table_name` | Name of the table. |
| `sys_id` | Unique identifier for the row in ServiceNow®. |
| `status` | Status of the ingestion of the row. |
| `is_present_in_servicenow` | `true` if the row is present in the table in ServiceNow®; `false` otherwise. |
| `is_present_in_servicenow_audit_table` | `true` if the row is tracked in the audit table in ServiceNow®; `false` otherwise. |
| `is_present_in_snowflake_destination_table` | `true` if the row has already been ingested and is available in the `dest_db` database in Snowflake; `false` otherwise. |
| `event_log_records` | Array of JSON objects that represent [entries in the event log](accessing-data.md) for the row with this `sys_id`.  Each object contains the following properties, which correspond to the columns in the event log table that specify the timestamps and event types of the data change:   * `sys_updated_on` * `event_date` * `event_type` |

## Determining if a table is audited for deletion

The Snowflake Connector for ServiceNow® relies on auditing to propagate the deletion of records to Snowflake.

To verify that a given table in ServiceNow® is configured to audit the deletion of records, call the `CHECK_IF_AUDIT_ENABLED` stored procedure:

```sqlsyntax
CALL CHECK_IF_AUDIT_ENABLED('<table_name>');
```

Where:

`table_name`
:   Specifies the name of a table in the ServiceNow® instance.

The procedure returns a JSON object containing the following properties:

| Property | Description |
| --- | --- |
| `response_code` | `OK` value if the procedure succeeded or a code of the error in case of a failure. |
| `audit` | Value of the `audit` attribute for the checked table. If set to true then audit is enabled on the table. |
| `no_audit_delete` | Value of the `no_audit_delete` attribute for the checked table. If it’s set, then it overrides value from the `audit` field for delete events. |
| `summary` | Human-readable explanation at to whether audit is enabled on the table based on values of `audit` and `no_audit_delete` fields. Audit is enabled on the table when either:   * `audit` field is set to true and `no_audit_delete` isn’t set to true. * `no_audit_delete` is set to false. |

### Obtaining troubleshooting data

To obtain troubleshooting data, call the `GET_TROUBLESHOOTING_DATA` stored procedure:

```sqlsyntax
CALL GET_TROUBLESHOOTING_DATA(<from_timestamp>, <to_timestamp>);
```

Where:

`from_timestamp`
:   Specifies the start of dates range (in UTC timezone) for which data should be fetched.

`to_timestamp`
:   Specifies the end of dates range (in UTC timezone) for which data should be fetched.

This stored procedure returns the following data in tabular format:

* Configuration information
* Errors experienced by connector
* Ingestion history

The following example shows how to call this stored procedure:

```sqlexample
CALL GET_TROUBLESHOOTING_DATA('2024-02-05 10:00:00', '2024-02-10 22:30:00');
```

You can save the returned data in CSV format to send to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Recovering an inaccessible object

The connector requires multiple database objects that are external to the connector application. If these objects are unavailable, the connector will fail or stop working correctly. Situations where objects can become unavailable
include:

* Dropping an object.
* Recreating an object without keeping or restoring the required grants.
* Revoking the grants necessary for the connector to use these objects.

In most cases you can restore these objects manually by recreating them and granting the necessary privileges. The
following sections describe how to restore different objects.

### Restoring the connector warehouse

If the connector loses access to its warehouse, configure a new one by calling the
[UPDATE_WAREHOUSE](managing.md) stored procedure.

### Restoring the database and schema for the ServiceNow® data

If the [database or schema for the ServiceNow® data](installing-sql.md) is
dropped, the only way to recover is by running the
[UNDROP <object>](../../sql-reference/sql/undrop.md) command. If this command is unavailable, you
must reinstall the connector and ingest the ServiceNow® data again.

If a [view containing the ServiceNow® data](accessing-data.md) is dropped,
it should be recreated automatically the next time the background task responsible for creating them runs.

If one of the tables containing the ServiceNow® data (either the [event logs](accessing-data.md)
or [raw data](accessing-data.md) tables) is dropped and you cannot use the
[UNDROP TABLE](../../sql-reference/sql/undrop-table.md) command to recover it, do the following to start the ingestion of the ServiceNow® table again:

* Ensure both event logs and raw data tables for this ServiceNow® table are dropped.
* [Disable the ServiceNow® table](ingestion.md).
* Use the [DELETE_TABLE procedure](managing.md).
* [Enable the ServiceNow® table](ingestion.md).

### Restoring the notification integration for the connector

If the connector loses access to the notification integration object, do the procedures for
[configuring alerts](monitoring.md) again, recreating the
notification integration object if necessary.

If email notifications are configured via Snowsight then you can just disable and re-enable them to restore
the necessary external objects.

## Error when ingesting data from table. Request to ServiceNow® failed after 2 attempts.

The error occurs because of table ingestion failure. You can check for it in the `CONNECTOR_ERRORS` view. The error message
can include the following sentences:

* Error when ingesting data from table
* Minimal page size of 1 was reached
* Request to ServiceNow® failed after 2 attempts
* Request to ServiceNow® timed out

The error means that the connector tried to perform requests to ServiceNow® API. ServiceNow® API couldn’t correctly
respond to any of these requests. This usually indicates performance problem with the API. There are several possible
solutions to this issue:

* Increase API timeout on the ServiceNow® side:

  1. Log in to the ServiceNow® instance.
  2. Navigate to Transaction Quota Rules panel.
  3. Find and open the REST Table API request timeout rule.
  4. Increase the value of Maximum Duration (seconds). The maximum duration the connector can handle is 120 seconds. Higher duration values aren’t supported and will result in timeouts on the connector side.
* Ensure that there are no unnecessary ACLs on the table. The ACLs heavily impact performance of the API. Ideally, the connector user shouldn’t have any ACLs set on the table. If there is a need to omit some rows from the ingested table, consider using [row filtering](ingestion.md).
* In the ServiceNow® table, create a composite index on either sys_updated_on and sys_id columns or sys_created_on and sys_id columns, if the sys_updated_on column isn’t present.
* Investigate ServiceNow® logs to find out why the API was slow to respond. A good starting point is the Transaction Log in ServiceNow®. To see connector requests:

  1. Log in to the ServiceNow® instance.
  2. Navigate to System Logs > Transactions (all user) panel.
  3. Filter the table by Created by column set to connector user, and URL column containing ingested table name.
  4. Check Response time column for unusually high response time, more than REST Table API request timeout set in the previous step.
  5. Investigate further suspicious transactions for potential bottlenecks.

## Determining the reason for missing columns in flattened views

The connector creates flattened views in the destination schema based on the ServiceNow® metadata.
There are several reasons why a column can be missing on the Snowflake side.

### Checking if column metadata is present in Snowflake

To check if column metadata is present in the `sys_dictionary` table on Snowflake, execute the following query:

```sqlsyntax
SELECT * FROM <dest_db>.<dest_schema>.sys_dictionary__view WHERE name = '<table_name>' AND element = '<column_name>';
```

If the table you’re investigating has parent tables (inherited from another table in ServiceNow®)
and the column you are looking for was added to the parent table, you should use the parent table name instead.

To list all the tables from which the table you’re interested in inherits, please use the following query:

```sqlsyntax
SELECT
    sys_id,
    name,
    PARSE_JSON(super_class):value::string AS super_class_sys_id
FROM <dest_db>.<dest_schema>.sys_db_object__view
START WITH name = '<table_name>'
CONNECT BY sys_id = PRIOR super_class_sys_id;
```

If rows are returned, metadata for the column was correctly ingested into Snowflake but the view has not yet been refreshed.
Check the status and if:

* the view was refreshed recently but the column is still not present, please contact support.
* the view was not refreshed yet, wait for the next ingestion schedule.

If an empty result is returned, it means that the connector didn’t ingest metadata for this column yet.
You need to validate on the ServiceNow® side if the record is visible to the connector and has correct timestamp.

### View refresh status

To validate when the views for a given table were last refreshed and if the operation was successful, execute the following query:

```sqlsyntax
SELECT flattened_views_status, flattened_views_last_updated FROM tables_state WHERE table_name = '<table_name>';
```

If the last refresh failed, you may want to query event table and look for errors reported by the connector.

### View ServiceNow® column metadata availability

It’s possible, that the reason for missing columns in the flattened view is that column metadata cannot be ingested by the connector.
This may be caused by ACLs preventing the row in the `sys_dictionary` table from being returned by the Table API.
Another possible reason is a past timestamp value in the `sys_updated_on` column.
It can also be the case that the column/table definition was imported from a different ServiceNow® instance.
To determine if the connector can access column metadata execute GET request to the following endpoint:

```none
https://<servicenow_instance>.service-now.com/api/now/table/sys_dictionary?sysparm_query=name=<table_name>^element=<column_name>
```

If an empty result is returned the connector cannot access the column. The column may be protected by an ACL or not present.

If a column definition was returned, examine the value of the `sys_updated_on` field.
Confirm the date matches the expected time when the column was added to the table.
If it was imported from another instance it may show the point in time when the column was created.
The CDC (incremental updates) mechanism in the connector may not notice that the record dated in the past was added. In this case, trigger a reload of `sys_dictionary` table.
After reload is completed wait for the next scheduled ingestion to recreate the view with correct list of columns.

## Table with continuous schedule disabled by the connector

The connector automatically disables tables with continuous schedule when it detects that ingestion on such a table failed
for 10 consecutive times, and the cause of all failed ingestion runs is related to the ServiceNow® instance. This mechanism
prevents overloading of the ServiceNow® instance when too many tables with continuous schedule are enabled and allows the
ServiceNow® instance to recover after the table is disabled. The connector does not enable the table again automatically,
it must be enabled manually by the user.

Information that a table is automatically disabled is visible in the `CONNECTOR_ERRORS` view when filtering by
`TABLE_INGESTION_DISABLED` code. When the error occurs, investigate why ingestion runs on the table failed.
You can find detailed information about errors in the `CONNECTOR_ERRORS` view after filtering by `INGESTION_FAILED`
code.

After the cause of error is investigated and resolved, the table can be [enabled](ingestion.md)
again by calling the `ENABLE_TABLE` or `ENABLE_TABLES` procedure.

## Connector is unavailable

The connector can enter an `ERROR` state, which could happen for a variety of reasons.
For instance internal connector error, which cannot be recovered.
In such situations the `Connector unavailable` error message will be displayed when examining connector state.

Currently, there is no automatic recovery mechanism for this state. However you can still execute several connector functions,
including the [EXPORT_CONNECTOR_STATE](managing.md) procedure.

To examine if the connector is in an `ERROR` state, execute the query:

```sqlsyntax
CALL GET_CONNECTOR_STATUS();
```

In addition please contact the [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to help us better understand the problem or determine how it can be possibly avoided. You should also execute
[manual connector reinstallation](managing.md) in order to restore the connector to a working state.
Note that previously ingested data isn’t lost and the connector can continue the ingestion from where it previously stopped.

---
title: Troubleshooting the Snowflake Connector for Google Analytics Aggregate Data
source: https://docs.snowflake.com/en/connectors/google/gaad/gaad-connector-troubleshooting.md
section: Connectors & Drivers
---

# Troubleshooting the Snowflake Connector for Google Analytics Aggregate Data

This topic provides guidelines for troubleshooting issues with the Snowflake Connector for Google Analytics Aggregate Data.

## Calling the get_troubleshooting_data procedure

The `GET_TROUBLESHOOTING_DATA` procedure returns information about the configuration of a connector, ingestion history, errors,
and additional information that can help you determine the root cause of an issue. This procedure may be called on
the connector in any state (configured, not configured, running, paused, and so on).

> **Note:**
>
> To report an issue with the connector to snowflake Support, attach the output from this procedure.

`GET_TROUBLESHOOTING_DATA` takes two parameters: a ‘from’ timestamp and a ‘to’ timestamp. They limit the returned rows
to the relevant time frame. For example, to get troubleshooting data with an ingestion history for the last week, you can call:

```sqlsyntax
CALL GET_TROUBLESHOOTING_DATA(DATEADD(day, -7, SYSDATE()), SYSDATE());
```

## Verifying the connection to Google Analytics

To verify that the connector can access Google Analytics data, call the
`TEST_CONNECTION` stored procedure, which is defined in the PUBLIC schema of the connector’s installation database:

```sqlsyntax
CALL TEST_CONNECTION();
```

## Checking the connector stats and connector errors views

If you encounter problems with data ingestion, you can check the `CONNECTOR_STATS` view and the `CONNECTOR_ERRORS` view
from the `PUBLIC` schema in the connector’s installation database:

```sqlsyntax
SELECT * FROM PUBLIC.CONNECTOR_STATS;
SELECT * FROM PUBLIC.CONNECTOR_ERRORS;
```

For information about returned content, see [Monitoring the Snowflake Connector for Google Analytics Aggregate Data](gaad-connector-monitoring.md).

## Transfering ownership of tables and views in the destination schema

The connector must own all associated report tables and views. If ownership is transferred to another role,
it can be returned to the connector using the `SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION` function.

```sqlexample
USE ROLE accountadmin;
CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION(<connector_app>, true, <destination_database>, <destination_schema>);
```

The `SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION` is a system function provided by Snowflake that allows the transfer of
ownership of tables and views in a specified database or schema to the application. Only the ownership of regular tables and
regular views is transferred, e.g. ownership of dynamic tables, external tables, materialized views, etc. won’t be
transferred.

The function has the following signature:

```sqlexample
SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION(<to_app>, <should_copy_grants>, <from_database>, <from_schema>)
```

Where:

> `to_app`
> :   Specifies the name of the application to which the ownership of objects should be transferred.
>
> `should_copy_grants`
> :   If `TRUE` then copy existing grants, otherwise revoke. Copying grants requires `MANAGE GRANTS`
>     permission on the caller.
>
> `from_database`
> :   Name of the database containing objects whose ownership should be changed.
>
> `from_schema`
> :   (Optional) name of the schema containing objects whose ownership should be changed. If no schema is specified,
>     ownership is transferred on tables and views in all schemas in the provided database. Objects in managed schemas
>     are omitted during ownership transfer.

To execute the function the caller must meet one of the following conditions:

* It has `MANAGE GRANTS` permission (e.g. ACCOUNTADMIN or SECURITYADMIN role), or
* It contains role owning the application instance and role owning all objects to transfer the ownership. Objects on
  which the ownership is missing are omitted by the function.

For example, to return ownership to the connector that:

* Was installed as `snowflake_connector_for_google_analytics_aggregate_data`
* Uses the schema named `dest_db.dest_schema` for the Google Analytics data in Snowflake

Run the following command:

```sqlexample
USE ROLE accountadmin;
CALL SYSTEM$GRANT_OWNERSHIP_TO_APPLICATION('snowflake_connector_for_google_analytics_aggregate_data', true, 'dest_db', 'dest_schema');
```

---
title: Troubleshooting the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-troubleshooting.md
section: Connectors & Drivers
---

# Troubleshooting the Snowflake Connector for Google Analytics Raw Data

This topic provides guidelines for troubleshooting issues with the Snowflake Connector for Google Analytics Raw Data.

## Verifying a connection to the Google Cloud Platform (GCP) instance

To verify that the Snowflake Connector for Google Analytics Raw Data can access the Google Cloud Platform (GCP) instance, call the
`CONNECTION_STATUS` stored procedure, which is defined in the PUBLIC schema of the connector installation database:

```sqlsyntax
CALL CONNECTION_STATUS();
```

To check the connection status in Snowsight, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Select the Snowflake Connector for Google Analytics Raw Data.

The color of the icon in the Authenticate Google Cloud Platform section shows if the connection to GCP was successful. If the icon is red,
the attempt to connect to GCP failed. To try reconnecting, select Reauthenticate.

If the icon is green, the connector is ready to ingest data.

## Checking connector status

To examine connector status use the `GET_CONNECTOR_STATUS` stored procedure, as shown:

```sqlsyntax
CALL PUBLIC.GET_CONNECTOR_STATUS()
```

## Checking current ingestion status

If you’re missing data from a particular day, you can query the `CONNECTOR_STATS` view to see whether there have been any errors when trying to ingest that day’s table from BigQuery:

```sqlsyntax
SELECT * FROM CONNECTOR_STATS WHERE PROPERTY_ID = '<property_name>' AND BIG_QUERY_TABLE = 'events_<date>' ORDER BY RUN_START_TIME DESC;
```

The result will show all attempts to download a particular table from BigQuery’s dataset for a particular property, with the latest one at the top. The `STATUS` column will show the outcome, and for any failed attempt, the `ERROR_MESSAGES` column will detail what happened.

## Downloading connector logs

If you encounter problems with the connector, you can call the
`GET_TROUBLESHOOTING_DATA` stored procedure, which is defined in the PUBLIC schema of the connector installation database:

```sqlsyntax
CALL GET_TROUBLESHOOTING_DATA(7);
```

The parameter defines how many days in the past since now should be included in the logs. Please use 7 as a default unless support asks you to use a different value.

As a result, you get the full connector logs. You can download the logs, filter, and share the logs with the application provider.

## Sharing connector logs from an event table with an application provider

It is possible to share connector logs stored in the event table with the application provider. This could be used by the provider to investigate encountered problems with the connector. Click this link to read more about [consumer enable logging](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging/).

> **Note:**
>
> This feature would not work without [enabling event tables](../../../developer-guide/logging-tracing/event-table-setting-up.md) on the account.

To enable sharing application events from the event table run:

```sqlexample
ALTER APPLICATION <GARD APPLICATION NAME> SET SHARE_EVENTS_WITH_PROVIDER = TRUE;
```

To stop sharing run:

```sqlexample
ALTER APPLICATION <GARD APPLICATION NAME> SET SHARE_EVENTS_WITH_PROVIDER = FALSE;
```

Current status could be checked by running:

```sqlexample
DESC APPLICATION <GARD APPLICATION NAME>;
```

## Comparing row counts in Google Cloud Platform (GCP) and Snowflake

To check if the ingestion was correct, you can compare the row counts in Snowflake and Google Cloud Platform (GCP).

To check the row count in Snowflake, run the following query:

```sqlexample
SELECT COUNT(*) FROM analytics_<property_name> WHERE source_table_date = '<date>' WHERE INGESTION_COMPLETE = true;
```

To check the row count in GCP, run the following query:

```sqlexample
SELECT COUNT(*) FROM '<project_id>.analytics_<property_name>.events_<date>';
```

---
title: Troubleshooting the Snowflake Connector for MySQL
source: https://docs.snowflake.com/en/connectors/mysql6/troubleshoot.md
section: Connectors & Drivers
---

# Troubleshooting the Snowflake Connector for MySQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

## Contact Snowflake Support

If you encounter an issue while using the connector, [submit a support case](../../user-guide/ui-support.md).

Snowflake usually analyzes the logs from your connector to offer a resolution. Logs from the connector, both the native app and the database agent logs, are stored in the event table of your account. However, there are different mechanisms for sharing these logs with Snowflake.

### Sharing the native app logs with Snowflake

By default, the native app logs are accessible to Snowflake. To find out more about the sharing mechanism, see [Set up event tracing for an app](http://docs.snowflake.com/native-apps/consumer-enable-logging).

> **Note:**
>
> If you disable log sharing, then you need to attach the logs to any support case you submit. Re-enabling log sharing does not include historical records, but only entries from the time you re-enable it.

### Sharing the database agent logs with Snowflake

> To share the database agent logs with Snowflake, you must extract them and attach them to the support case manually, as described in the following steps:

1. Query the logs as described in [Viewing the agent logs](monitor.md).

2. Select Download or View Results.
3. Click Export.
4. Attach the exported file to your support case.

---
title: Troubleshooting the Snowflake Connector for PostgreSQL
source: https://docs.snowflake.com/en/connectors/postgres6/troubleshoot.md
section: Connectors & Drivers
---

# Troubleshooting the Snowflake Connector for PostgreSQL

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

## Contact Snowflake Support

If you encounter an issue while using the connector, [submit a support case](../../user-guide/ui-support.md).

Snowflake usually analyzes the logs from your connector to offer a resolution. Logs from the connector, both the native app and the database agent logs, are stored in the event table of your account. However, there are different mechanisms for sharing these logs with Snowflake.

### Sharing the Native App logs

By default, the native app logs are accessible to Snowflake. To find out more about the sharing mechanism, see [Set up event tracing for an app](http://docs.snowflake.com/native-apps/consumer-enable-logging).

> **Note:**
>
> If you disable log sharing, then you need to attach the logs to any support case you submit. Re-enabling log sharing does not include historical records, but only entries from the time you re-enable it.

### Sharing the database agent logs

The agent replicates its Native App counterpart. To send the agent logs back to Snowflake:

1. Access the agent logs table as described in [Viewing the agent logs](monitor.md).

2. Select Download or View Results.
3. Click Export.
4. Attach the exported file to your support case.

---
title: Tutorial: Get started with the MySQL and PostgreSQL connectors for Snowflake
source: https://docs.snowflake.com/en/connectors/tutorials/dbtutorial.md
section: Connectors & Drivers
---

Snowflake

PostgreSQL

MySQL

Connector

> **Note:**
>
> The Snowflake Connector for PostgreSQL and Snowflake Connector for MySQL are subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms).

# Tutorial: Get started with the MySQL and PostgreSQL connectors for Snowflake

## Introduction

Welcome to our tutorial on using the Snowflake Database Connectors. This guide will help you seamlessly transfer data from relational databases into Snowflake.

In this tutorial, you’ll gain the skills to:

* Set up MySQL and PostgreSQL in Docker, complete with sample data for ingestion.
* Install and configure two native applications, one for each database.
* Set up and fine-tune two agents, again one for each database.
* Initiate and manage data ingestion processes.
* Monitor the data ingestion workflow.

Let’s get started!

### Prerequisites

Before beginning this tutorial, ensure you meet the following requirements:

* Docker is installed and operational on your local machine.
* You have a tool available for connecting to the database. This can be a database-specific tool or a general-purpose tool such as IntelliJ or Visual Studio Code.

## Creating MySQL and PostgreSQL Source Databases

In this section, we will guide you through the following steps:

* Starting the Database Instances - Learn how to launch your MySQL and PostgreSQL instances using Docker.
* Connecting to the Database - Instructions on how to establish a connection to your databases.
* Loading Sample Data - A walkthrough on how to populate your databases with sample data.

### Starting the database instances

To begin the MySQL and PostgreSQL database configuration process using Docker, create the file `docker-compose.yaml`.
The content of the file should resemble:

```yaml
services:
  mysql:
    container_name: mysql8
    restart: always
    image: mysql:8.0.28-oracle
    command: --log-bin=/var/lib/mysql/mysql-bin
      --max-binlog-size=4096
      --binlog-format=ROW
      --binlog-row-image=FULL
      --binlog-row-metadata=FULL
      --sql_mode="ONLY_FULL_GROUP_BY,STRICT_TRANS_TABLES,NO_ZERO_IN_DATE,NO_ZERO_DATE,ERROR_FOR_DIVISION_BY_ZERO,NO_ENGINE_SUBSTITUTION,PAD_CHAR_TO_FULL_LENGTH"
    environment:
      MYSQL_ROOT_PASSWORD: 'mysql'
    volumes:
      - ./mysql-data:/var/lib/mysql
    ports:
      - "3306:3306"
  postgres:
    image: "postgres:11"
    container_name: "postgres11"
    environment:
      POSTGRES_USER: 'postgres'
      POSTGRES_PASSWORD: 'postgres'
    ports:
      - "5432:5432"
    command:
      - "postgres"
      - "-c"
      - "wal_level=logical"
    volumes:
      - ./postgres-data:/var/lib/postgresql/data
```

Once your `docker-compose.yaml` is ready, follow these steps:

1. Open a terminal.
2. Navigate to the directory containing the `docker-compose.yaml` file.
3. Execute the following command to start source databases in containers:

   ```bash
   docker compose up -d
   ```

After running this command, you should see two containers actively running the source databases.

### Connecting to the Database

To connect to the pre-configured databases using IntelliJ’s or Visual Studio Code database connections,
perform the following steps with the provided credentials:

MySQLPostgreSQL

1. Open your tool of choice for connecting to the MySQL.
2. Click the ‘+’ sign or similar to add data source.
3. Fill in the connection details:

   * **User**: `root`
   * **Password**: `mysql`
   * **URL**: `jdbc:mysql://localhost:3306`
4. Test the connection and save.

1. Open your tool of choice for connecting to the PostgreSQL.
2. Click the ‘+’ sign or similar to add data source.
3. Fill in the connection details:

   * **User**: `postgres`
   * **Password**: `postgres`
   * **Database**: `postgres`
   * **URL**: `jdbc:postgresql://localhost:5432`
4. Test the connection and save.

### Loading Sample Data

To initialize and load sample please execute those scripts in those connections.

MySQLPostgreSQL

Execute the script to generate sample data

```mysql
CREATE DATABASE mysql_ingest_database;
USE mysql_ingest_database;

CREATE TABLE mysql_rows(
    id INT AUTO_INCREMENT PRIMARY KEY,
    random_string VARCHAR(255),
    random_number INT);

INSERT INTO mysql_rows (random_string, random_number) VALUES
    ('fukjxyiteb', 100),
    ('ndhbbipodi', 37),
    ('laebpztxzh', 83);

SELECT * FROM mysql_ingest_database.mysql_rows;
```

Execute the script to generate sample data

```postgresql
CREATE SCHEMA psql_rows_schema;
SET search_path TO psql_rows_schema;

CREATE TABLE psql_rows_schema.postgres_rows (
  id SERIAL PRIMARY KEY,
  a_text TEXT,
  a_boolean BOOLEAN,
  a_number INTEGER,
  a_decimal DOUBLE PRECISION);

INSERT INTO psql_rows_schema.postgres_rows (a_text, a_boolean, a_number, a_decimal) VALUES
  ('QfJhyWwFuC', True, 37, 15.46),
  ('GwmIFgwvFy', True, 14, 13.21),
  ('jYvqOSEtam', True, 25, 20.85);

-- The publication is required to start the replication progress as the Connector is based on PostgreSQL Logical Replication
CREATE PUBLICATION agent_postgres_publication FOR ALL TABLES;

SELECT * FROM psql_rows_schema.postgres_rows;
```

You should see three rows in each populated database.

## Install and configure the Native App

During this step you will:

* Install the Native Applications
* Configuring the Native Applications

### Install the Native Applications

Follow these steps to install the Application from the Snowflake Native Apps Marketplace:

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Install the **Snowflake Connector for MySQL** and **Snowflake Connector for PostgreSQL** applications.
4. Install both applications.

After installation, you will see the new applications listed in Catalog » Apps.

### Configuring the Native Applications

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Apps.
3. Open each application and do the following:

MySQLPostgreSQL

1. Select Download Driver and save the file. The file name will resemble `mariadb-java-client-3.4.1.jar` or with newer version when available. Save this file for use during agent configuration.
2. Select Mark all as done as we will create and populate source databases from scratch.

   > **Note:**
   >
   > No addition additional network configuration is required at this point as we’ll configure the agent later in the tutorial.
3. Click Start configuration.
4. On the Configure Connector screen, select Configure. The Verify Agent Connection page will display.
5. Select Generate file to generate an agent configuration file. The file name should resemble `snowflake.json`. Save this file for later use in the Agent Configuration section.

1. Select Mark all as done as we will create and populate source databases from scratch.

   > **Note:**
   >
   > No addition additional network configuration is required at this point as we’ll configure the agent later in the tutorial.
2. Click Start configuration
3. On the **Configure Connector** screen, select **Configure**.
4. In the Verify Agent Connection page select Generate file to generate the agent configuration file. The file name should resemble `snowflake.json`. Save this file for use during the Agent Configuration section.

## Configure the agents

In this section, we’ll configure the agent that will operate with your source databases.

The first step is to create directories `agent-mysql` and `agent-postgresql`.

Within each directory, create subdirectories `agent-keys` and `configuration`. Your directory structure should resemble:

```output

├── agent-mysql
│   ├── agent-keys
│   └── configuration
└── agent-postgresql
    ├── agent-keys
    └── configuration
```

### Creating configuration files

In this section, we’ll add content to the configuration files for each agent to operate correctly. The configuration files include:

* `snowflake.json` file to connect to the Snowflake.
* `datasources.json` file to connect to the source databases.
* `postgresql.conf/mysql.conf` files with additional agent environment variables.
* JDBC Driver file for MySQL agent.

MySQLPostgreSQL

1. In a terminal, navigate to the `agent-mysql` directory.
2. Create the Docker Compose file `docker-compose.yaml` with the following content:

   ```yaml
   services:
     mysql-agent:
       container_name: mysql-agent
       image: snowflakedb/database-connector-agent:latest
       volumes:
         - ./agent-keys:/home/agent/.ssh
         - ./configuration/snowflake.json:/home/agent/snowflake.json
         - ./configuration/datasources.json:/home/agent/datasources.json
         - ./configuration/mariadb-java-client-3.4.1.jar:/home/agent/libs/mariadb-java-client-3.4.1.jar
       env_file:
         - configuration/mysql.conf
       mem_limit: 6g
   ```
3. Move the previously downloaded `snowflake.json` file into the `configuration` directory.
4. Move the previously downloaded `mariadb-java-client-3.4.1.jar` file into the `configuration` directory.
5. In the `configuration` directory create `datasources.json` with content:

   ```json
   {
     "MYSQLDS1": {
       "url": "jdbc:mariadb://host.docker.internal:3306/?allowPublicKeyRetrieval=true&useSSL=false",
       "username": "root",
       "password": "mysql",
       "ssl": false
     }
   }
   ```
6. In the `configuration` directory create `mysql.conf` with content:

   ```bash
   JAVA_OPTS=-Xmx5g
   MYSQL_DATASOURCE_DRIVERPATH=/home/agent/libs/mariadb-java-client-3.4.1.jar
   ```
7. Start the agent using the following command. There shouldn’t be any error message and the agent should generate a public and private key pair for authentication to Snowflake.

   ```bash
   docker compose stop  # stops the previous container in case you've launched it before
   docker compose rm -f # removes the agent container to recreate it with the latest image in case you had one before
   docker compose pull  # refresh remote latest tag in case you have cached previous version
   docker compose up -d # run the agent
   ```
8. Please note that the **driver jar file** name should be **identical** to the one downloaded and used in the `docker-compose.yaml` and `mysql.conf` files.

1. On the command line, navigate to the `agent-postgresql` directory.
2. Create the Docker Compose file `docker-compose.yaml` with the following content:

   ```yaml
   services:
     postgresql-agent:
       container_name: postgresql-agent
       image: snowflakedb/database-connector-agent:latest
       volumes:
         - ./agent-keys:/home/agent/.ssh
         - ./configuration/snowflake.json:/home/agent/snowflake.json
         - ./configuration/datasources.json:/home/agent/datasources.json
       env_file:
         - configuration/postgresql.conf
       mem_limit: 6g
   ```
3. Move the previously downloaded `snowflake.json` file into the `configuration` directory.
4. In the `configuration` directory create `datasources.json` with content:

   ```json
   {
     "PSQLDS1": {
       "url": "jdbc:postgresql://host.docker.internal:5432/postgres",
       "username": "postgres",
       "password": "postgres",
       "publication": "agent_postgres_publication",
       "ssl": false
     }
   }
   ```
5. In the `configuration` directory, create `postgresql.conf` with the following content:

   ```bash
   JAVA_OPTS=-Xmx5g
   ```
6. Start the agent using the following command. There shouldn’t be any error message and the agent should generate a public and private key pair for authentication to Snowflake.

   ```bash
   docker compose up -d
   ```

When complete, your directory structure should resemble the following. Please note the inclusion of the automatically generated private and public keys within the agent-keys directories.

```output

├── agent-mysql
│   ├── agent-keys
│   │   ├── database-connector-agent-app-private-key.p8
│   │   └── database-connector-agent-app-public-key.pub
│   ├── configuration
│   │   ├── datasources.json
│   │   ├── mariadb-java-client-3.4.1.jar
│   │   ├── mysql.conf
│   │   └── snowflake.json
│   └── docker-compose.yaml
└── agent-postgresql
    ├── agent-keys
    │   ├── database-connector-agent-app-private-key.p8
    │   └── database-connector-agent-app-public-key.pub
    ├── configuration
    │   ├── datasources.json
    │   ├── postgresql.conf
    │   └── snowflake.json
    └── docker-compose.yaml
```

### Verifying connection with Snowflake

Go back to your previously created native apps. Click on the **Refresh** button in the Agent Connection section.

When successfully Configured you should see:

```text
Agent is fully set up and connected. To select data to ingest Open Worksheet.
```

## Configure and monitor the data ingestion process

In this step, we will instruct the Connector to begin replicating the selected tables. First, let’s create a shared sink database in Snowflake.

```sqlexample
CREATE DATABASE CONNECTORS_DEST_DB;
GRANT CREATE SCHEMA ON DATABASE CONNECTORS_DEST_DB TO APPLICATION SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL;
GRANT CREATE SCHEMA ON DATABASE CONNECTORS_DEST_DB TO APPLICATION SNOWFLAKE_CONNECTOR_FOR_MYSQL;
```

Once the database is ready, we can move on to the configuration process.

MySQLPostgreSQL

1. To begin table replication, you must first add a datasource from which to replicate and then specify the table to be replicated.

   ```sqlexample
   CALL SNOWFLAKE_CONNECTOR_FOR_MYSQL.PUBLIC.ADD_DATA_SOURCE('MYSQLDS1', 'CONNECTORS_DEST_DB');
   CALL SNOWFLAKE_CONNECTOR_FOR_MYSQL.PUBLIC.ADD_TABLES('MYSQLDS1', 'mysql_ingest_database', ARRAY_CONSTRUCT('mysql_rows'));
   ```
2. To monitor the replication, execute the following queries:

   ```sqlexample
   SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_MYSQL.PUBLIC.REPLICATION_STATE;
   SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_MYSQL.PUBLIC.CONNECTOR_STATS;
   ```

1. To begin table replication, you must first add a data source from which to replicate and then specify the table to be replicated.

   ```sqlexample
   CALL SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL.PUBLIC.ADD_DATA_SOURCE('PSQLDS1', 'CONNECTORS_DEST_DB');
   CALL SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL.PUBLIC.ADD_TABLES('PSQLDS1', 'psql_rows_schema', ARRAY_CONSTRUCT('postgres_rows'));
   ```
2. To monitor the replication you can execute the following queries

   ```sqlexample
   SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL.PUBLIC.REPLICATION_STATE;
   SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL.PUBLIC.CONNECTOR_STATS;
   ```

### Understanding connector status

The `REPLICATION_STATE` view is crucial for monitoring the status of table replication. This process encompasses three distinct phases:

1. `SCHEMA_INTROSPECTION`: Ensures that the schema of the source table is accurately replicated.
2. `INITIAL_LOAD`: Transfers the existing data from the source table to the destination.
3. `INCREMENTAL_LOAD`: Continuously replicates ongoing changes from the source.

Upon successful replication, the status display will resemble the following:

> |  |  |  |  |
> | --- | --- | --- | --- |
> | REPLICATION_PHASE | SCHEMA_INTROSPECTION_STATUS | SNAPSHOT_REPLICATION_STATUS | INCREMENTAL_REPLICATION_STATUS |
> | INCREMENTAL_LOAD | DONE | DONE | IN PROGRESS |

You can read more about it in the [official Connector Documentation](../postgres6/monitor.md).

## View data

Execute the following commands to view data, which should include roughly 3 rows per database.

```sqlexample
SELECT * FROM CONNECTORS_DEST_DB."psql_rows_schema"."postgres_rows";
SELECT * FROM CONNECTORS_DEST_DB."mysql_ingest_database"."mysql_rows";
```

## Clean up and additional resources

Congratulations! You have successfully completed this tutorial.

To clean up your environment, execute the commands listed below. Failing to do so will leave the connector running and generating costs.

### Remove the native app

```sqlexample
DROP APPLICATION SNOWFLAKE_CONNECTOR_FOR_POSTGRESQL CASCADE;
DROP APPLICATION SNOWFLAKE_CONNECTOR_FOR_MYSQL CASCADE;
```

### Remove warehouses, roles and users

During the installation multiple warehouses, roles and users were created. Execute the following queries to drop those objects.

MySQLPostgreSQL

```sqlexample
DROP ROLE MYSQL_ADMINISTRATIVE_AGENT_ROLE;
DROP ROLE MYSQL_AGENT_ROLE;

DROP USER MYSQL_AGENT_USER;

DROP WAREHOUSE MYSQL_COMPUTE_WH;
DROP WAREHOUSE MYSQL_OPS_WH;
```

```sqlexample
DROP ROLE POSTGRESQL_ADMINISTRATIVE_AGENT_ROLE;
DROP ROLE POSTGRESQL_AGENT_ROLE;

DROP USER POSTGRESQL_AGENT_USER;

DROP WAREHOUSE POSTGRESQL_COMPUTE_WH;
DROP WAREHOUSE POSTGRESQL_OPS_WH;
```

### Stop database containers

To stop the running containers with MySQL and PostgreSQL, navigate to the directory containing the `docker-compose.yaml` files, then execute the `docker compose down -v`.

### Additional resources

Continue learning about connectors using the following resources:

* [About the Snowflake Connector for MySQL](../mysql6/about.md)
* [About the Snowflake Connector for PostgreSQL](../postgres6/about.md)

---
title: Tutorial: Snowflake ServiceNow® data ingestion connector installation
source: https://docs.snowflake.com/en/connectors/servicenow/tutorials/servicenow-to-snowflake-connector.md
section: Connectors & Drivers
---

Getting Started

Connectors

Data Engineering

> **Note:**
>
> The Snowflake Connector for ServiceNow® is subject to the [Snowflake Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms).

# Tutorial: Snowflake ServiceNow® data ingestion connector installation

## Introduction

Use this tutorial to configure and understand the Snowflake Connector for
ServiceNow® using the Snowsight wizard, select some tables, ingest data, and run an example query.

This tutorial is not meant to be exhaustive. Please review
[About the Snowflake Connector for ServiceNow®](../about.md) for full functionality and limitations.

> **Note:**
>
> This tutorial assumes you do not have a ServiceNow® account, so it guides you through
> the steps of creating a developer account. If you do have a Servicenow® account,
> feel free to try it out, with the caveat that the Snowflake connector for ServiceNow®
> is subject to the [Connector Terms](https://www.snowflake.com/legal/snowflake-connector-terms).

### Prerequisites

Before beginning this tutorial please ensure that you have met the following requirements:

* `ORGADMIN` rights to Accept the Terms of Service in the Snowflake Marketplace.
* `ACCOUNTADMIN` rights on the Snowflake account where you want to install the connector.

### What you’ll learn

In this tutorial you’ll learn how to:

* How to set up the Snowflake Connector for ServiceNow®.
* How to ingest ServiceNow® data into Snowflake
* How to stop the connector to avoid unnecessary costs in a development environment.

### What you’ll need

* [A Snowflake account](https://snowflake.com/)
* [A ServiceNow® developer account](https://developer.servicenow.com/dev.do/)

### What you’ll build

A ServiceNow® to Snowflake ingestion data flow.

## Set up the ServiceNow® developer instance

If you do not want to test this connector on your ServiceNow® account, you can use a
developer instance. This section describes how to set up a developer instance.

1. Go to the [ServiceNow® developer website](https://developer.servicenow.com), and create a developer user.
2. Log on to the developer website with your newly created user and select Create an Instance.
3. Choose an instance type. You’ll receive an email with your instance URL, and your user and password.

Deployment is usually pretty quick, around five minutes. But, while you wait, let’s go to the next step and configure Snowflake!

## Create and set up the Snowflake account

### Create a Snowflake account

If you do not have a Snowflake account, you can get a free trial at [snowflake.com](https://www.snowflake.com/en/).
Select Start for Free and follow the instructions.

### Accept the Terms & Conditions

1. Log on to your Snowflake account through the Snowsight web interface and change to the `ORGADMIN` role.
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, review the Consumer Terms of Service.
4. If you agree to the terms, select Accept Terms & Conditions.

### Set up a virtual warehouse

Connectors require a virtual warehouse. To create the required warehouse perform the following:

Change to the `ACCOUNTADMIN` role.

1. Navigate to Admin -> Warehouses and select + Warehouse.
2. Specify `CONNECTOR_UI_WH` as warehouse name, size XS, and, and leaving all other the defaults.
3. Select Create Warehouse.

### Install the ServiceNow® connector

The connector is delivered through the Snowflake Marketplace, and is available to all Snowflake customers.
Once chosen, it is installed into your account as an application with several views and stored procedures.

1. In the navigation menu, select Marketplace » Snowflake Marketplace.
2. In the search window, enter `ServiceNow` and select the tile.
3. Review the business needs and usage samples.
4. Select Get.
5. Select the warehouse previously created, `CONNECTOR_UI_WH`.
6. Select Options.
7. For this tutorial, accept the default name for the installation database, `Snowflake_Connector_for_ServiceNow`. Do not select any additional roles.
8. Select Get. `Snowflake Connector for ServiceNow` will display indicating the connector is now ready to use.
9. Select Done. Manage options will be specified it in the next section.

Next, check that the connector was installed. From Snowsight, go to Data Products -> Apps. You should see a new installed application with the name Snowflake_Connector_for_ServiceNow.

Navigate to the public schema in Data -> Databases, and examine the newly available views and procedures.

## Complete all the prerequisites

Launch the Snowflake Connector for ServiceNow® from the Data Products -> Apps -> Snowflake Connector for ServiceNow.
You will be presented with a list of tasks that need to be completed before the connector can start data ingestion.
Please read the following descriptions carefully and complete each.

One of the final steps asks you to create an application registry if you want to
enable OAuth2 authentication. The next several steps will focus on this.

For the next section, we suggest you open two browser tabs so that you can copy certain data from Snowflake to ServiceNow®:

* From the Snowflake, use the connector to generate the redirect URL which will be pasted into the Application Registry.
* From the ServiceNow®, you’ll need the Application Registry to provide the Client ID and secret, which you then paste into Snowflake.

### On Snowflake

1. Copy the redirect URL. You will need it in the next section.
2. Open a new tab in your browser (without closing the above) and follow the steps in the next section.

### On ServiceNow®

1. Log in to your ServiceNow® developer instance.
2. From the main page, select All and search Application Registry.
3. Select New in the upper right-hand side of the window.
4. Select Create an OAuth API endpoint for external clients.
5. Give the endpoint a name, such as Snowflake_connector. Leave the client secret blank, as the value populates automatically later in the procedure.
6. Paste in the redirect URL that was generated on the Snowflake side.
7. Select Submit. The window closes.
8. Select the registry you just created to re-open it.
   Note that the **Client ID** and **Client secret** are auto-generated.

   Don’t close the ServiceNow® browser tab or store the **Client ID** and **Client secret** in some safe place, they will be needed later.

   Return to the Snowflake configuration tab.

## Configure the connector

1. Select Start configuration.
   This Configure screen displays. By default, the fields are set to the names of objects that were created when you configured the connector.
   You can also use existing objects. The virtual warehouse selected will be used by the connector for background data ingestion.
2. Review [Configure the Snowflake Connector for ServiceNow®](../installing-snowsight.md) for more information.
3. Select Configure.

Note that it can take a few minutes for the configuration process to complete.

> **Note:**
>
> This step created a Large warehouse with its auto suspension set to ten minutes.
> If you set to refresh every hour, the Large warehouse (8 credits/hour) will
> wake up for a minimum of 10 minutes every hour. For this tutorial, this is not
> needed. In the navigation menu, select Compute » Warehouses » SERVICENOW_WAREHOUSE » … » Edit,
> and change this to an XSMALL, and the auto timeout to one minute. In a real-life
> use case, a Large warehouse size is often needed.

> **Note:**
>
> You should attach a resource monitor to the `SERVICENOW_WAREHOUSE`. To attach
> a resource monitor, In the navigation menu, select Admin » Cost management, and then select Resource Monitors.
> Create a warehouse resource monitor.

## Set up the Snowflake to ServiceNow® OAuth2 hand-shake

1. Select OAuth2 as an authentication method.
2. Fill in the ServiceNow® instance details. This is the first part of the ServiceNow® URL for your ServiceNow® account, **without** `https://*` protocol and the trailing `service-now.com`.
3. Paste the Client id and the Client secret from ServiceNow® into the Snowflake wizard.
4. Select Connect. Your ServiceNow® accounts pops up and requests to connect to Snowflake.
5. Select Allow. The connection is established between the two systems.

To verify the connection, select the three dots […] and View Details. At the top of the pop-up you will see the date ServiceNow authenticated.

> **Note:**
>
> If you are having issues, perhaps the Client secret wasn’t copied. Unlock the password field and copy and paste the text.

## Configure deletions sync

If you want not only inserts and updates, but also deletes to be synchronized to Snowflake, you have to provide name of the journal table.
By default ServiceNow® uses `sys_audit_delete` table to store information about deleted records so feel free to provide this name.
If you don’t care about deletes, you can leave this field empty.

Select Validate to check if the connector is able to connect to the source system and has access to all the required tables.
It can take a few minutes for the process to complete.
When it’s done, select Define data to sync to select tables for the ingestion.

## Select ServiceNow® tables

> **Note:**
>
> Be aware that:
>
> * The connector can only ingest tables with `sys_id` columns present.
> * ServiceNow® views are not supported. Instead of ingesting these views, you should synchronize all tables for the underlying view and join the synchronized tables in Snowflake.
> * Incremental updates occur only for tables with `sys_updated_on` or `sys_created_on` columns.
> * For tables that do not have `sys_updated_on` or `sys_created_on` columns, the connector uses `truncate and load` mode. In this mode, the table is always ingested using the initial load approach, and newly ingested data replaces the old data.

1. In the Snowflake Connector for ServiceNow window, on the top bar, select Data Sync.
2. To be able to run our test query later, we need to ingest a couple of tables. From the search window enter incident and check the box next to it and choose a 30 minute sync time.
3. To choose other tables, clear the search, put the table name and select the checkbox. Do this at least for the `task` table.

> > **Note:**
> >
> > Hint: Clear the search fields, and then select the title Status to sort and show all the tables you selected.

4. Select Start Sync. The select window closes and you get the message Syncing Data from the main Connector window. In addition to the tables you choose, three system tables will also be loaded. These are necessary to build the views on the raw data: `sys_dictionary`, `sys_db_object`, and `sys_glide_object`.

You receive a message indicating success. It appears once at least one table has been fully ingested.

> **Note:**
>
> Don’t stop the ingest prematurely. Ensure that views are built in the destination database first.

## Connector Monitoring

Open a worksheet to examine the connector status.
Here are some examples of SQL queries you can execute to get monitoring
information:

```sqlexample
// Return general information about all ingestions
SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_SERVICENOW.public.connector_stats;
// Search for information about particular table ingestions
SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_SERVICENOW.public.connector_stats WHERE table_name = '<table_name>';
// Examine connector configuration
SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_SERVICENOW.public.connector_configuration;
// Calculate ingested data volume
SELECT
    table_name,
    sum(ingested_rows) AS row_count
FROM SNOWFLAKE_CONNECTOR_FOR_SERVICENOW.public.connector_stats
GROUP BY table_name
ORDER BY table_name;
// General connector statistics
SELECT * FROM SNOWFLAKE_CONNECTOR_FOR_SERVICENOW.public.connector_overview;
```

## Configuring access to the ingested data

The connector exposes the `DATA_READER` application role. This role has read access to all the ingested data in the destination schema.
It’s automatically granted to the role provided during the **Configure** step of the installation process.
It was named `SERVICE_NOW_RESOURCES_PROVIDER` in the screenshot earlier in this guide.
You can grant either application role or account role further if needed.

## Query the data

Examine the tables that the connector has created under the destination schema of the destination database.
For each table in ServiceNow® that is configured for synchronization, the connector creates the following table and views:

* A table with the same name that contains the data in raw form, where each record is contained in a single `VARIANT` column.
* A view named `table_name__view` that contains the data in flattened form, where the view
  contains a column for each column in the original table and a row for each record that is present in the original table.

> **Note:**
>
> After you start the connector, it takes some time for the views to be created. The creation
> of the views relies on data in the ServiceNow® `sys_db_object`, `sys_dictionary`
> and `sys_glide_object` tables. The connector loads metadata from these ServiceNow®
> tables after you enable any table for synchronization. It can take some time for the connector
> to load this metadata. Do not stop the warehouse while views are being created.

* A view named `table_name__view_with_deleted` that contains the same
  data as `table_name__view` as well as rows for records that have been deleted in ServiceNow®.
* A table `table_name__event_log` that contains the history of changes fetched by the connector from ServiceNow®.

> To query from the raw data, review [Access the raw data](../accessing-data.md). To
> query the views (recommended), review [Access the flattened data](../accessing-data.md).

### Query to identify number of incidents raised by month and priority

Here’s a test query to identify the number of incidents raised by
month and priority. Other example queries are provided on the Snowflake Connector
for ServiceNow® page in the Marketplace.

```sqlexample
USE ROLE SERVICE_NOW_RESOURCES_PROVIDER;
USE DATABASE SERVICENOW_DEST_DB;
USE SCHEMA DEST_SCHEMA;
WITH T1 AS (
    SELECT
    DISTINCT
        T.NUMBER AS TICKET_NUMBER,
        T.SHORT_DESCRIPTION,
        T.DESCRIPTION,
        T.PRIORITY,
        T.SYS_CREATED_ON AS CREATED_ON,
        T.SYS_UPDATED_ON AS UPDATED_ON,
        T.CLOSED_AT
    FROM TASK__VIEW T
    LEFT JOIN INCIDENT__VIEW I
        ON I.SYS_ID = T.SYS_ID -- ADDITIONAL INCIDENT DETAIL
   WHERE I.SYS_ID IS NOT NULL -- THIS CONDITION HELPS KEEP JUST THE INCIDENT TICKETS
)
SELECT
 YEAR(CREATED_ON) AS YEAR_CREATED,
 MONTH(CREATED_ON) AS MONTH_CREATED,
 PRIORITY,
 COUNT(DISTINCT TICKET_NUMBER) AS NUM_INCIDENTS
FROM T1
GROUP BY
    YEAR_CREATED,
    MONTH_CREATED,
    PRIORITY
ORDER BY
    YEAR_CREATED,
    MONTH_CREATED,
    PRIORITY
;
```

## Granting access to the connector

The connector exposes two application roles beyond the one used to access the data in destination database:

* The `VIEWER` role has read only access to the connector configuration and state
* The `ADMIN` role that can modify connector configuration and enable/disable ingestion

To monitor errors, run stats, examine connector stats, and examine enabled tables, you
can set up a ServiceNow® monitoring role that allows access to the views and read
only procedures in the connector database.

For example, run the following in a worksheet (and then use the role):

```sqlexample
USE ROLE accountadmin;
CREATE ROLE IF NOT EXISTS servicenow_monitor_role;
GRANT APPLICATION ROLE SNOWFLAKE_CONNECTOR_FOR_SERVICENOW.viewer TO ROLE servicenow_monitor_role;
GRANT USAGE ON WAREHOUSE SERVICENOW_WAREHOUSE TO ROLE servicenow_monitor_role;
```

## Stop the Ingestion

During this tutorial, we’re only ingesting the data, so it makes sense to stop the ingestion
after that initial load. However, in a production environment, you would not stop the connector.

> **Note:**
>
> If you do not stop the connector, it will wake up the virtual warehouse at the specified time interval and consume credits.

1. In Snowsight, select the Snowflake Connector for ServiceNow tile.
2. In the Snowflake Connector for ServiceNow window, select Pause Connector.

## Uninstall the connector (but not the data)

If you have completed the tutorial or for any reason no longer need the connector you can easily uninstall it via the Snowflake Marketplace.

1. Select **Data Products** and then **Apps**.
2. Select three dots icon in the item on the list representing the connector app.
3. Select **Uninstall**.
4. Decide if you want to delete the objects owned by the application (tables and views with ingested data in the destination schema) or transfer ownership of them to another role.
5. Select **Uninstall**.

## Conclusion And Resources

Congratulations! You’ve successfully installed and configured the Snowflake Connector
for ServiceNow®, ingested data and ran a query to gather insights on incidents and priority.

What you learned

* How to set up the Snowflake Connector for ServiceNow®.
* How to ingest ServiceNow® data into Snowflake.
* How to stop the connector to avoid unnecessary costs in a development environment.

### Related Resources

* [Introducing the Snowflake Native Application Framework](https://www.snowflake.com/blog/introducing-snowflake-native-application-framework/)
* [About the Snowflake Connector for ServiceNow®](../about.md)

---
title: Uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data
source: https://docs.snowflake.com/en/connectors/google/gard/gard-connector-uninstalling-and-reinstalling.md
section: Connectors & Drivers
---

# Uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data

This topic provides information on uninstalling and reinstalling the Snowflake Connector for Google Analytics Raw Data.

## Uninstalling the Snowflake Connector for Google Analytics Raw Data

Data ingested by the connector remains in the selected destination database and schema, which are owned by the
customer’s role. However, all sink tables and views containing your Google Analytics data within the destination schema
are owned by the Snowflake Connector for Google Analytics Raw Data application. Therefore if you uninstall the connector before transferring the ownership of these tables and views
to an account role, they will be deleted as well.

> **Note:**
>
> If you do not want data to be deleted along with the connector, transfer the ownership of all tables and views in the destination schema to an account and revoke current grants from the application.

To transfer ownership of all tables and views in the destination schema to an account role, run the following queries:

> ```sqlsyntax
> GRANT OWNERSHIP ON ALL TABLES IN SCHEMA <destination database>.<destination schema>
> TO ROLE <account role>
> REVOKE CURRENT GRANTS;
>
> GRANT OWNERSHIP ON ALL VIEWS IN SCHEMA <destination database>.<destination schema>
> TO ROLE <account role>
> REVOKE CURRENT GRANTS;
> ```

To ensure the connector does not own any objects you do not want removed, run the following query:
:   ```sqlsyntax
    SHOW OBJECTS OWNED BY APPLICATION <application name>;
    ```

During the connector configuration some Snowflake objects which are not owned by the application are created and they will not
be removed automatically during the uninstallation. If you want to remove them as well, you are able to do so by dropping them manually using SQL queries.
Objects are:

* A network rule inside the `CONNECTORS_SECRET.<application name>` schema.
* A secret listed in the Settings » Authentication tab.
* An external access integration listed in the Settings » Authentication tab.
* A security integration listed in the Settings » Authentication tab.

To drop these objects, run the following queries:

> ```sqlsyntax
> DROP SECRET CONNECTORS_SECRET.<application name>.SECRET;
>
> DROP NETWORK RULE CONNECTORS_SECRET.<application name>.NETWORK_RULE;
>
> DROP EXTERNAL ACCESS INTEGRATION <external access integration name>;
>
> DROP SECURITY INTEGRATION <security integration name>;
> ```

For the secret and network rule, you may also want to drop their enclosing database and/or schema.

In order to uninstall the Snowflake Connector for Google Analytics Raw Data follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as a user with the ACCOUNTADMIN role.
2. In the navigation menu, select Catalog » Apps.
3. Search for the Snowflake Connector for Google Analytics Raw Data.
4. Select Uninstall.

## Reinstalling the connector with the same database and schema

If you removed the connector but left the database and schema containing the ingested data intact, you can later reinstall the connector and resume data ingestion from the point where the connector was last run.

> **Note:**
>
> To ensure data consistency, ensure that the current ingestion has completed and stopped before uninstalling the connector.

After uninstalling the connector, you can reinstall the connector by selecting Catalog » Apps in the navigation menu.

During the installation process:

* Provide the previously used database and schema.
* Provide the connector configuration.
* Provide the same ingestion configuration.

By providing the same configuration information, the connector can resume the ingestion process instead of starting over.
However, you need to ensure that the previously ingested data is available in the destination table before finalizing the configuration.
If you manually ingest the data once the connector is running, the ingestion process will start from the beginning.

> **Note:**
>
> If you uninstalled the connector during an ongoing ingestion, incomplete data from the last daily table whose ingestion was interrupted is deleted and re-ingested.

---
title: Use the Snowflake Connector for Google Looker Studio
source: https://docs.snowflake.com/en/connectors/google-looker-studio-connector.md
section: Connectors & Drivers
---

# Use the Snowflake Connector for Google Looker Studio

This topic describes how to use the Snowflake Connector for Google Looker Studio.

The Snowflake Connector for Google Looker Studio provides an interface to [Google Looker Studio](https://cloud.google.com/looker-studio), a data visualization software that you can use to
transform your raw data into the metrics and dimensions needed to create reports and dashboards.
This connector is available to users with a Google account as a Partner Connector within Google Looker Studio.

## Authentication methods

The Snowflake Connector for Google Looker Studio supports the following authentication methods for connecting to Snowflake:

* Username and password
* Key-pair authentication

With the username and password authentication method, users can authenticate the connection by providing their Snowflake credentials.
The key-pair method enables a more secure connection by using a private key for authentication instead of a password. To learn more about configuring
key-pair authentication in a Snowflake database, see [Key-pair authentication and key-pair rotation](../user-guide/key-pair-auth.md).

When configuring the public key for a user in the Snowflake database, ensure that you meet the following requirements:

* The key does not include the strings `-----BEGIN PUBLIC KEY-----` and `-----END PUBLIC KEY-----`.
* All newline characters are removed from the public key. This is required for proper authentication.

> **Note:**
>
> Because of its design for system-to-system communication, the connector is not compatible with interactive authentication methods,
> such as multi-factor authentication (MFA) with Duo Push.

## Connect your Snowflake account to Google Looker Studio

1. Sign in to [Google Looker Studio](https://cloud.google.com/looker-studio).
2. Click +, and then select Data Source.
3. Under the Partner Connectors section, select the Snowflake connector (the connector with the Snowflake logo).
4. If required, authorize Google Looker Studio to use this community connector.
5. Enter the following Snowflake user credentials to connect to Snowflake:

   * Username
   * Password or private key
6. Click Submit.
7. Provide the following parameters required to connect to your Snowflake account:

   * Account URL
   * Role
   * Warehouse
   * Database
   * Schema
   * SQL query
   > **Note:**
   >
   > The SQL query cannot end with a semicolon.
8. Click Connect.

   A page containing data source fields is displayed.
9. To visualize your data, click Create Report or Explore.

> **Note:**
>
> If you have trouble connecting to your Snowflake account, use the following procedure to revoke access, and then try to connect again.

### Revoke access

1. Sign in to [Google Looker Studio](https://cloud.google.com/looker-studio).
2. Select Data Sources.
3. Browse or search for the Snowflake connector, and then click More options.
4. Click Revoke access.

## Mapping Snowflake data types to Looker Studio data types

The connector maps your Snowflake database data types to a [unified set of data types](https://support.google.com/datastudio/answer/9514333) as follows:

| Snowflake data type | Google Looker Studio data type |
| --- | --- |
| `BOOLEAN` | `BOOLEAN` |
| `FIXED` | `NUMBER` |
| `REAL` | `NUMBER` |
| `BINARY` | `TEXT` |
| `TEXT` | `TEXT` |
| `GEOGRAPHY` | `TEXT` \* |
| `DATE` | `YEAR_MONTH_DAY` |
| `TIMESTAMP_LTZ` | `YEAR_MONTH_DAY_SECOND` |
| `TIMESTAMP_NTZ` | `YEAR_MONTH_DAY_SECOND` |
| `TIMESTAMP_TZ` | `YEAR_MONTH_DAY_SECOND` |
| `TIME` | `TEXT` |
| `OBJECT` | `TEXT` \* |
| `VARIANT` | `TEXT` \* |
| `ARRAY` | `TEXT` \* |

[\*]

Google Looker Studio does not support complex spatial types, so they are represented as text. The text format allows you to freely process data in custom visualizations.

> **Note:**
>
> If Google Looker Studio encounters a column in a table or query of an unsupported type, it does not create a field for that
> column.

For more information about Snowflake data types, see [SQL data types reference](../sql-reference-data-types.md).

## Network policy access

Connections from Google Looker Studio to Snowflake come from ephemeral Google servers with no fixed IP addresses. If your
network uses [network policies](../user-guide/network-policies.md), you may need to open up the policy for the Looker Studio user to either allow *all* IP addresses
(0.0.0.0/0) or use [this shell script](https://gist.github.com/n0531m/f3714f6ad6ef738a3b0a) to get a list of possible Google Cloud IP addresses with subnets.

## Identifying Connector queries in your query history

The Snowflake Connector for Google Looker Studio uses user-provided SQL statements as an inner SELECT statement for each
generated query to a database. Therefore, your query history may contain optimized queries that differ from
the queries you entered when configuring a data source.

In your query history, the queries from the connector will include this inner SELECT statement.

## Supported SQL queries

Only the `SELECT`, `SHOW`, and `DESCRIBE` SQL statements are supported. The connector only supports specifying a single SQL statement as the query; it does not support selecting tables and views from a list.

## Limitations

* The connector does not support the use of encrypted private keys for key-pair authentication.
* Because of its design for system-to-system communication, the connector is not compatible with interactive authentication methods, such as MFA with Duo Push.
* The current sign-in flow only supports a single sign-in (username and password or private key), which only works for different accounts if
  all accounts use the same username and password or private key. The connector does not support using multiple sign-ins to the same or different Snowflake accounts.
* Google limits the returned data set to 1 million rows and 50 MB of data. Unexpected errors may occur when you try to
  return more data.
* Column headers (field names) must use ASCII characters only; non-ASCII characters are not supported.
* Reports containing `REGEXP_PARTIAL_MATCH` and `REGEXP_EXACT_MATCH` operators are not optimized by [pushdown filters](https://developers.google.com/datastudio/connector/filters)
  because Snowflake and Google Looker Studio support different regexp types.
* [Pushdown filters](https://developers.google.com/datastudio/connector/filters) are not supported for the `SHOW` and `DESCRIBE` statements and for `DATE`, `TIME`, and `TIMESTAMP` columns.

> **Note:**
>
> If MFA is enabled for the Snowflake username used in the connector, it can lead to excessive Duo Push notifications, which can cause inconvenience to users.
> This behavior arises because the connector may trigger multiple authentication requests during connection attempts.
> To mitigate this issue, consider using the key-pair authentication method instead of username and password.

---
title: Viewing MySQL data in Snowflake
source: https://docs.snowflake.com/en/connectors/mysql6/view-data.md
section: Connectors & Drivers
---

# Viewing MySQL data in Snowflake

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for MySQL.
> We’re now focused on a next-generation solution that will offer a significantly
> improved experience; therefore, moving this connector to the general availability
> status is currently not on our product roadmap.
> You may continue to use this connector as preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for MySQL](../../user-guide/data-integration/openflow/connectors/mysql/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The connector replicates data to the destination database, which was defined while setting up the connector and calling `PUBLIC.ADD_DATA_SOURCE('<data_source_name>', '<dest_db>')`.

Data tables contain the replicated data and are available under identifier `dest_db.schema_name.table_name` where:

* `dest_db` is the name of the destination database.
* `schema_name` is the schema name in which the original MySQL table resides.
* `table_name` is the name of the original MySQL table.

> **Note:**
>
> `dest_db`, `schema_name` and `table_name` needs to be double quoted in case their names are mixed-case.

The replicated tables contain the additional metadata columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `_SNOWFLAKE_INSERTED_AT` | TIMESTAMP_NTZ | Timestamp of when the row was inserted into the destination table, in UTC. |
| `_SNOWFLAKE_UPDATED_AT` | TIMESTAMP_NTZ | Timestamp of when the row was last updated in the destination table, in UTC. |
| `_SNOWFLAKE_DELETED` | BOOLEAN | Value is `true` if the row has been deleted from the source table. |

The replicated data types are mapped to match the Snowflake types. For more information, see MySQL to Snowflake data type mapping.

## Replicated data access control

To control access to replicated data use `DATA_READER` application role. More on connector application roles: [Application roles in the Snowflake Connector for MySQL](roles.md)
For more granular control over specific destination objects, use `ACCOUNTADMIN` role to grant proper privileges or create database roles.

## MySQL to Snowflake data type mapping

In Snowflake, column names of replicated tables are capitalized and types are mapped to match the Snowflake types.

The following table shows how connector data types are mapped to Snowflake types.

| MySQL Type | Snowflake Type | Notes |
| --- | --- | --- |
| DECIMAL / NUMERIC | NUMBER | The maximum number of digits in DECIMAL format for MySQL is 65. For Snowflake, the maximum is 38.  Supported up to the maximum allowed digits in Snowflake. When exceeded, precision is lost. For more information, see [Numeric data types](../../sql-reference/data-types-numeric.md). |
| INT / INTEGER | INT |  |
| TINYINT / BOOL | INT |  |
| SMALLINT | INT |  |
| MEDIUMINT | INT |  |
| BIGINT | INT |  |
| YEAR | INT |  |
| FLOAT | FLOAT |  |
| DOUBLE | FLOAT |  |
| VARCHAR | VARCHAR |  |
| TINYTEXT | VARCHAR |  |
| TEXT | VARCHAR |  |
| ENUM | VARCHAR | Stored as a string. For example, for ENUM(‘one’, ‘two’) the possible values are: ‘one’, ‘two’. |
| SET | VARCHAR | Stored as a comma-joined string in column declaration order. For example, for SET(‘one’, ‘two’) the possible values are: ‘ ‘, ‘one’, ‘two’, ‘one,two’. |
| MEDIUMTEXT | VARCHAR | Supported up to the maximum entry size in Snowflake (16MB). |
| LONGTEXT | VARCHAR | Supported up to the maximum entry size in Snowflake (16MB). |
| CHAR | VARCHAR | Sent to Snowflake without the trailing spaces. |
| BIT | VARCHAR | Represented in hexadecimal, for example: ‘83060c183060c183’. |
| DATE | DATE | Stored in target tables as strings, for example ‘1971-01-31’. In flattened views, date is converted to DATE. |
| DATETIME | DATETIME / TIMESTAMP_NTZ |  |
| TIMESTAMP | TIMESTAMP_TZ | Stored in target tables as strings in UTC, for example ‘2000-12-30 23:59:59.001009+00:00’. In flattened views, timestamps are converted to TIMESTAMP_TZ. |
| TIME | TIME | Stored in target tables as strings, for example ‘23:59:59’. In flattened views, time values are converted to TIME. |
| BINARY | BINARY |  |
| MEDIUMBLOB | BINARY | Supported up to the maximum entry size in Snowflake, which is 16MB. |
| LONGBLOB | BINARY | Supported up to the maximum entry size in Snowflake, which is 16MB. |
| BLOB | BINARY |  |
| VARBINARY | BINARY |  |
| TINYBLOB | BINARY |  |
| JSON | VARIANT | JSON can be stored in the MySQL BinLog as a complete document or as a partial update. By default, it is stored as a complete document. Partial updates are currently not supported.  JSONs are sent to Snowflake as strings, but Snowpipe Streaming converts them to a VARIANT data type and stores them internally as ARRAY, OBJECT, etc.  Supported up to the maximum entry size in Snowflake, which is 16MB. |

## Resuming snapshot load after failures

If the connection between the database agent and the connector is lost during snapshot load, because of time and cost optimisation,
the connector will continue to load the snapshot from the point where it was stopped before. This happens regardless of whether the agent was
restarted or if there was an issue with the connections between the source database and the database agent, and the database agent and the connector.

This feature works for primary key columns of the following types:

* TINYINT
* SMALLINT
* MEDIUMINT
* INT
* BIGINT
* ENUM
* CHAR
* TINYTEXT
* VARCHAR
* TEXT

If the primary key is of any other type, the snapshot load after the connection failure for a particular column will start from the beginning.

## Viewing data from deleted columns

If a column is deleted in the source table, it will not be deleted in the destination table.
Instead, a soft-delete approach is followed, and the column will be renamed to `<previous name>__SNOWFLAKE_DELETED` so that historical values can still be queried.

For example, if a column `A` is deleted, it will be renamed to `A__SNOWFLAKE_DELETED` in the destination table and can be queried as

```sqlsyntax
SELECT A__SNOWFLAKE_DELETED FROM <TABLE_NAME>;
```

## Viewing data from renamed columns

Renaming a column is equal to deleting the column and creating a new one with the new name.
The deletion follows the soft-delete approach explained in the previous section.

For example, if column `A` was renamed to `B` - in the destination table `A` was renamed to `A__SNOWFLAKE_DELETED` and a new column `B` is added.
All rows existing before the change keep the values of the column in the `A__SNOWFLAKE_DELETED` column while new rows added after the change have the values in the `B` column.
Values from the renamed column can be viewed as a single column with a simple query:

```sqlsyntax
SELECT
     CASE WHEN B IS NULL THEN A__SNOWFLAKE_DELETED ELSE B END AS A_RENAMED_TO_B
FROM <TABLE_WITH_RENAMED_COLUMN>;
```

A view can be created to simplify the usage after a column is renamed.

## Next steps

After completing these procedures, review the processes in [Snowflake Connector for MySQL ongoing tasks](ongoing.md)

---
title: Viewing PostgreSQL data in Snowflake
source: https://docs.snowflake.com/en/connectors/postgres6/view-data.md
section: Connectors & Drivers
---

# Viewing PostgreSQL data in Snowflake

> **Important:**
>
> Thank you for your interest in the Snowflake Connector for PostgreSQL.
> Note that we’re now focused on a next-generation solution that will offer a significantly improved experience.
> Hence, moving this connector to the general availability status is currently not on our product roadmap.
> You may continue to use this connector as a preview feature, but please note that support for future bug
> fixes and improvements are not guaranteed. The new solution is available as [Openflow Connector for PostgreSQL](../../user-guide/data-integration/openflow/connectors/postgres/about.md) and
> includes better performance, customizability, and enhanced deployment options.

The connector replicates data to the destination database, which was defined while setting up the connector and calling `PUBLIC.ADD_DATA_SOURCE('<data_source_name>', '<dest_db>')`.

Data tables contain the replicated data and are available under identifier `dest_db.schema_name.table_name` where:

* `dest_db` is the name of the destination database.
* `schema_name` is the schema name in which the original PostgreSQL table resides.
* `table_name` is the name of the original PostgreSQL table.

> **Note:**
>
> `dest_db`, `schema_name` and `table_name` needs to be double quoted in case their names are mixed-case.

The replicated tables contain the additional metadata columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `_SNOWFLAKE_INSERTED_AT` | TIMESTAMP_NTZ | Timestamp of when the row was inserted into the destination table, in UTC. |
| `_SNOWFLAKE_UPDATED_AT` | TIMESTAMP_NTZ | Timestamp of when the row was last updated in the destination table, in UTC. |
| `_SNOWFLAKE_DELETED` | BOOLEAN | Value is `true` if the row has been deleted from the source table. |

The replicated data types are mapped to match the Snowflake types. For more information, see PostgreSQL to Snowflake data type mapping.

## Replicated data access control

To control access to replicated data use `DATA_READER` application role. More on connector application roles: [Application roles in the Snowflake Connector for PostgreSQL](roles.md)
For more granular control over specific destination objects, use `ACCOUNTADMIN` role to grant proper privileges or create database roles.

## PostgreSQL to Snowflake data type mapping

In Snowflake, column names of replicated tables are capitalized and types are mapped to match the Snowflake types.

The following table shows the PostgreSQL to Snowflake types mapping.

| PostgreSQL Type | Snowflake Type | Notes |
| --- | --- | --- |
| BIGINT / INT8 | INT |  |
| BIGSERIAL / SERIAL8 | INT |  |
| BIT [(N)] | VARCHAR |  |
| BIT VARYING [(N)] / VARBIT [(n)] | VARCHAR |  |
| BOOLEAN / BOOL | BOOLEAN |  |
| BOX | VARCHAR |  |
| BYTEA | BINARY(N) | Supported up to the max datapoint size in Snowflake (16MB). Max length 1 GB. |
| CHARACTER [(N)] / CHAR [(N)] | VARCHAR [N] | Max length 10485760 ~= 10 MB |
| CHARACTER VARYING [(N)] / VARCHAR [(N)] | VARCHAR [N] | Max length 10485760 ~= 10 MB |
| CIDR | VARCHAR |  |
| CIRCLE | VARCHAR |  |
| DATE | DATE |  |
| DOUBLE PRECISION / FLOAT8 | FLOAT |  |
| INET | VARCHAR |  |
| INTEGER / INT / INT4 | INT |  |
| INTERVAL [FIELDS][(P)] | VARCHAR |  |
| JSON | VARIANT | Supported up to the max datapoint size in Snowflake (16MB). |
| JSONB | VARIANT | Supported up to the max datapoint size in Snowflake (16MB). |
| LINE | VARCHAR |  |
| LSEG | VARCHAR |  |
| MACADDR | VARCHAR |  |
| MACADDR8 | VARCHAR |  |
| MONEY | VARIANT |  |
| NUMERIC [(P, S)] / DECIMAL [(P, S)] | DECIMAL(P, S) | Scale and precision are also recreated on the Snowflake side preserving Snowflake limitations. |
| PATH | VARCHAR |  |
| PG_LNS | VARCHAR |  |
| POINT | VARCHAR |  |
| POLYGON | VARCHAR |  |
| REAL / FLOAT4 | FLOAT |  |
| SMALLINT / INT2 | INT |  |
| SMALLSERIAL / SERIAL2 | INT |  |
| SERIAL / SERIAL4 | INT |  |
| TEXT | VARCHAR |  |
| TIME [(P)] [ without time zone ] | TIME |  |
| TIME [(P)] with time zone | TIME |  |
| TIMESTAMP [(P)] [ without time zone ] | DATETIME / TIMESTAMP_NTZ |  |
| TIMESTAMP [(P)] with time zone | TIMESTAMP_TZ |  |
| TSQUERY | VARCHAR |  |
| TSVECTOR | VARCHAR |  |
| UUID | VARCHAR |  |
| XML | VARCHAR |  |

All other types, including arrays, ENUMs, custom types and ranges are mapped to VARCHAR values in Snowflake.
The following table illustrates how types not explicitly mentioned in the table above are handled.

| PostgreSQL Type | Data in PostgreSQL | Column in Snowflake |
| --- | --- | --- |
| ENUM | monday | “monday” |
| array of INTEGER | {1,2,3,5} | “{1,2,3,5}” |
| intrange | [6,31) | “[6,31)” |
| custom type (2 fields, INT4 and TEXT) | (text value,5432) | “(text value,5432)” |

## Resuming snapshot load after failures

If the connection between the database agent and the connector is lost during snapshot load, because of time and cost optimisation,
the connector will continue to load the snapshot from the point where it was stopped before. This happens regardless of whether the agent was
restarted or if there was an issue with the connections between the source database and the database agent, and the database agent and the connector.

This feature works for primary key columns of the following types:

* SMALLINT/INT2
* INTEGER/INT/INT4
* BIGINT/INT8
* UUID
* NUMERIC
* TEXT
* VARCHAR
* BOOL

If the primary key is of any other type, the snapshot load after the connection failure for a particular column will start from the beginning.

## Viewing data from deleted columns

If a column is deleted in the source table, it will not be deleted in the destination table.
Instead, a soft-delete approach is followed, and the column will be renamed to `<previous name>__SNOWFLAKE_DELETED` so that historical values can still be queried.

For example, if a column `A` is deleted, it will be renamed to `A__SNOWFLAKE_DELETED` in the destination table and can be queried as

```sqlsyntax
SELECT A__SNOWFLAKE_DELETED FROM <TABLE_NAME>;
```

## Viewing data from renamed columns

Renaming a column is equal to deleting the column and creating a new one with the new name.
The deletion follows the soft-delete approach explained in the previous section.

For example, if column `A` was renamed to `B` - in the destination table `A` was renamed to `A__SNOWFLAKE_DELETED` and a new column `B` is added.
All rows existing before the change keep the values of the column in the `A__SNOWFLAKE_DELETED` column while new rows added after the change have the values in the `B` column.
Values from the renamed column can be viewed as a single column with a simple query:

```sqlsyntax
SELECT
     CASE WHEN B IS NULL THEN A__SNOWFLAKE_DELETED ELSE B END AS A_RENAMED_TO_B
FROM <TABLE_WITH_RENAMED_COLUMN>;
```

A view can be created to simplify the usage after a column is renamed.

## Next steps

After completing these procedures, review the processes in [Snowflake Connector for PostgreSQL ongoing tasks](ongoing.md)

---
title: Working with the Snowflake High Performance connector for Kafka
source: https://docs.snowflake.com/en/connectors/kafkahp/how-the-connector-works.md
section: Connectors & Drivers
---

# Working with the Snowflake High Performance connector for Kafka

This topic describes how the connector works with tables and pipes, and how to configure the connector with these elements.

## How the connector works with tables and pipes

The connector treats each Kafka record as a row to be inserted into a Snowflake table. For example,
if you have a Kafka topic with the content of the message structured like the following JSON:

```json
{
  "order_id": 12345,
  "customer_name": "John",
  "order_total": 100.00,
  "isPaid": true
}
```

By default you don’t have to create a table or pipe before ingestion is begins.
The connector creates a table with columns matching the JSON keys, and relies on the default pipe named `{tableName}-STREAMING`
which will automatically map the record content’s first-level keys to table columns matching by name (case-insensitive).
You can also create your own table with columns matching the JSON keys.
The connector tries to match the record content’s first-level keys to the table columns by name.
If keys from the JSON do not match the table columns, the connector ignores the keys.

```sqlexample
CREATE TABLE ORDERS (
  record_metadata VARIANT,
  order_id NUMBER,
  customer_name VARCHAR,
  order_total NUMBER,
  ispaid BOOLEAN
);
```

If you choose to create your own pipe, you can define the data transformation logic in the pipe’s [COPY INTO](../../sql-reference/sql/copy-into-table.md) statement. You can rename columns as required and cast the data types as needed. For example:

```sqlexample
CREATE TABLE ORDERS (
  order_id VARCHAR,
  customer_name VARCHAR,
  order_total VARCHAR,
  ispaid VARCHAR
);
```

```sqlexample
CREATE PIPE ORDERS AS
COPY INTO ORDERS
FROM (
  SELECT
  $1:order_id::STRING,
  $1:customer_name,
  $1:order_total::STRING,
  $1:isPaid::STRING
FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

or

```sqlexample
CREATE TABLE ORDERS (
 topic VARCHAR,
 partition VARCHAR,
 order_id VARCHAR,
 customer_name VARCHAR,
 order_total VARCHAR,
 ispaid VARCHAR
);
```

```sqlexample
CREATE PIPE ORDERS AS
COPY INTO ORDERS
FROM (
  SELECT
  $1:RECORD_METADATA.topic::STRING AS topic,
  $1:RECORD_METADATA.partition::STRING AS partition,
  $1['order_id']::STRING AS order_id,
  $1['customer_name']::STRING as customer_name,
  CONCAT($1['order_total']::STRING, ' USD') AS order_total,
  $1['isPaid']::STRING AS ispaid
FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

When you define your own pipe your destination table columns need not need match the JSON keys.
You can rename the columns to your desired names and cast the data types if required.

### Topic names, table names, and pipe names

Depending on the configuration settings, the connector will use different names for the destination table.
The destination table name is always derived from the topic name.

#### How the connector maps topic names to the destination table

The Kafka connector provides two modes for mapping Kafka topic names to Snowflake table names:

* **Static mapping**: The connector derives destination table names using only Kafka topic name.
* **Explicit topic-to-table mapping mode**: You specify custom mappings between topics and tables using the `snowflake.topic2table.map` configuration parameter

#### Static mapping

If you do not configure the `snowflake.topic2table.map` parameter, the connector always derives the table names from the topic name.

**Table name generation:**

The connector derives the destination table name from the topic name using the following rules:

1. If the topic name is a valid [Snowflake identifier](../../sql-reference/identifiers-syntax.md)
   the connector uses the topic name as the destination table name, converted to uppercase).
2. If the topic name contains invalid characters, the connector:

   * Replaces invalid characters with underscores
   * Appends an underscore followed by a hash code to ensure uniqueness
   * For example, the topic `my-topic.data` becomes `MY_TOPIC_DATA_<hash>`

**Pipe name determination:**

The connector determines which pipe to use based on the following logic:

1. The connector checks if a pipe exists with the same name as the destination table name.
2. If a user-created pipe with that name exists, the connector uses that pipe (user-defined pipe mode).
3. If not, the connector uses the default pipe named `{tableName}-STREAMING`

> **Note:**
>
> Snowflake recommends choosing topic names that follow the rules for Snowflake identifier names to ensure predictable table names.

### Understanding RECORD_METADATA

The connector populates the `RECORD_METADATA` structure with metadata about the Kafka record. This metadata is sent through the Snowpipe Streaming data source to Snowflake, where it becomes available in pipe transformations using the `$1:RECORD_METADATA` accessor. `RECORD_METADATA` structure is available in both user-defined pipe and default pipe modes. Its content can be saved to the column of type VARIANT, or individual fields can be extracted and saved to separate columns.

**Example pipe with transformations and metadata:**

```sqlexample
CREATE PIPE ORDERS AS
COPY INTO ORDERS_TABLE
FROM (
  SELECT
    $1:order_id::NUMBER,
    $1:customer_name,
    $1:order_total,
    $1:RECORD_METADATA.topic AS source_topic,
    $1:RECORD_METADATA.offset::NUMBER AS kafka_offset,
    $1:RECORD_METADATA.SnowflakeConnectorPushTime::BIGINT AS ingestion_time
  FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

In this example:

* The pipe extracts specific fields from the Kafka message (order_id, customer_name, order_total)
* It also captures metadata fields (topic, offset, and ingestion timestamp)
* The values can be cast and/or transformed as needed

### How metadata fields are populated

The connector automatically populates metadata fields based on the Kafka record properties and connector configuration. You can control which metadata fields are included using these configuration parameters:

* `snowflake.metadata.topic` (default: true) - Includes the topic name
* `snowflake.metadata.offset.and.partition` (default: true) - Includes offset and partition
* `snowflake.metadata.createtime` (default: true) - Includes the Kafka record timestamp
* `snowflake.metadata.all` (default: true) - Includes all available metadata

When `snowflake.metadata.all=true` (the default), all metadata fields are populated. Setting individual metadata flags to `false` excludes those specific fields from the RECORD_METADATA structure.

> **Note:**
>
> The `SnowflakeConnectorPushTime` field is always available and represents the time when the connector pushed the record into the ingestion buffer. This is useful for calculating end-to-end ingestion latency.

The RECORD_METADATA structure contains the following information by default:

| Field | Data Type | Description |
| --- | --- | --- |
| topic | String | The name of the Kafka topic that the record came from. |
| partition | String | The number of the partition within the topic. (Note that this is the Kafka partition, not the Snowflake micro-partition.) |
| offset | number | The offset in that partition. |
| CreateTime / . LogAppendTime | number | This is the timestamp associated with the message in the Kafka topic. The value is milliseconds since midnight January 1, 1970, UTC. For more information, see: [Kafka ProducerRecord documentation](https://kafka.apache.org/0100/javadoc/org/apache/kafka/clients/producer/ProducerRecord.html). |
| SnowflakeConnectorPushTime | number | A timestamp when a record was pushed into an Ingest SDK buffer. The value is the number of milliseconds since midnight January 1, 1970, UTC. For more information, see [Estimating ingestion latency](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka.md). |
| key | String | If the message is a Kafka KeyedMessage, this is the key for that message. In order for the connector to store the key in the RECORD_METADATA, the `key.converter` parameter in the [Kafka configuration properties](../../user-guide/kafka-connector-install.md) must be set to `org.apache.kafka.connect.storage.StringConverter`; otherwise, the connector ignores keys. |
| headers | Object | A header is a user-defined key-value pair associated with the record. Each record can have 0, 1, or multiple headers. |

The amount of metadata recorded in the RECORD_METADATA column is configurable using optional Kafka configuration properties.

The field names and values are case-sensitive.

### How Kafka records are converted before ingestion

Before each row is handed over to Snowpipe Streaming, the connector converts the Kafka Connect record value into a `Map<String, Object>` whose keys must match your target column names (or can be transformed inside a user-defined pipe). Primitive strings, byte arrays, or numbers must be wrapped (for example by using the HoistField SMT) so that the connector receives a structured object. The converter applies the following rules:

* Null values are treated as tombstones. They are skipped when `behavior.on.null.values=IGNORE` or ingested as empty JSON objects otherwise.
* Numeric and boolean fields are passed through as-is. Decimal values whose precision is greater than 38 are serialized as strings to stay within Snowflake’s `NUMBER` limits.
* `byte[]` and `ByteBuffer` payloads are Base64-encoded strings, so store them in `VARIANT` or `VARCHAR` columns.
* Arrays remain arrays, and nested objects remain nested maps. Declare `VARIANT` columns when you rely on the default pipe to land nested data as-is.
* Maps with non-string keys are emitted as arrays of `[key, value]` pairs because Snowflake column names must be text.
* Record headers and keys are copied into `RECORD_METADATA` whenever the relevant metadata flags are enabled.

If you need the entire message body preserved as a single column, wrap it into a new top-level field using SMTs. See Legacy RECORD_CONTENT column for the transformation pattern.

## User-defined pipe mode vs default pipe mode

The connector supports two modes for managing data ingestion:

* User-defined pipe mode
* Default pipe mode

### User-defined pipe mode

In this mode, you have full control over data transformation and column mapping.

**When to use this mode:**

* You need custom column names that differ from JSON field names
* You need to apply data transformations (type casting, masking, filtering)
* You want full control over how data is mapped to columns

### Default pipe mode

In this mode, the connector uses a default pipe named `{tableName}-STREAMING` and maps kafka record fields to table columns matching by name (case-insensitive).

**When to use this mode:**

* Your kafka record key names match your desired column names
* You don’t need custom data transformations
* You want a simple configuration

**Mapping kafka record keys to table columns with default pipe mode**

When using default pipe mode, the connector uses default pipe named `{tableName}-STREAMING` and maps content’s first-level keys directly to table columns using case-insensitive matching.

### Using default pipe mode - example

#### Example 1:

Consider the following kafka record content payload:

```json
{
  "city": "New York",
  "age": 30,
  "married": true,
  "has cat": true,
  "@&$#* includes special characters": true,
  "skills": ["sitting", "standing", "eating"],
  "family": {"son": "Jack", "daughter": "Anna"}
}
```

You create a table with columns matching the JSON keys (case-insensitive, including special characters):

```sqlexample
CREATE TABLE PERSON_DATA (
  record_metadata VARIANT,
  city VARCHAR,
  age NUMBER,
  married BOOLEAN,
  "has cat" BOOLEAN,
  "!@&$#* includes special characters" BOOLEAN,
  skills VARIANT,
  family VARIANT
);
```

**Matching behavior:**

* `"city"` (kafka) → `city` or `CITY` or `City` (column) - case insensitive
* `"has cat"` (kafka) → `"has cat"` (column) - must be quoted due to space
* `"!@&$#* includes special characters"` (kafka) → `"!@&$#* includes special characters"` (column) - special characters preserved
* Nested objects like `skills` and `family` map to VARIANT columns automatically

### Using user-defined pipe mode - examples

This example shows how to configure and use user-defined pipes with custom data transformations.

#### Example 1:

Create a table with your desired schema:

```sqlexample
CREATE TABLE ORDERS (
  order_id NUMBER,
  customer_name VARCHAR,
  order_total NUMBER,
  order_date TIMESTAMP_NTZ,
  source_topic VARCHAR
);
```

Create a pipe that transforms the incoming Kafka records to match your table schema:

```sqlexample
CREATE PIPE ORDERS AS
COPY INTO ORDERS
FROM (
  SELECT
    $1:order_id::NUMBER,
    $1:customer_name,
    $1:order_total::NUMBER,
    $1:order_date::TIMESTAMP_NTZ,
    $1:RECORD_METADATA.topic
  FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

Note that the pipe name (`ORDERS`) matches the table name (`ORDERS`). The pipe definition extracts fields from the JSON payload using `$1:field_name` syntax and maps them to the table columns.

> **Note:**
>
> You can access nested JSON fields and fields with special characters using bracket notation, such as `$1['field name']` or `$1['has cat']`.

Configure topic to table mapping:

```properties
snowflake.topic2table.map=kafka-orders-topic:ORDERS
```

This configuration maps the Kafka topic `kafka-orders-topic` to the pre-existing table and pipe named `ORDERS`.

#### Example 2:

When you need to access keys in the content that do not have conventional names use the following syntax:

* Simple fields: `$1:field_name`
* Fields with spaces or special characters: `$1['field name']` or `$1['has cat']`
* Fields with unicode characters: `$1[' @&$#* has Łułósżź']`
* Nested fields: `$1:parent.child` or `$1:parent['child field']`

Consider this JSON payload from Kafka:

```json
{
  "city": "New York",
  "age": 30,
  "married": true,
  "has cat": true,
  " @&$#* has Łułósżź": true,
  "skills": ["sitting", "standing", "eating"],
  "family": {"son": "Jack", "daughter": "Anna"}
}
```

You create a destination table with your chosen column names:

```sqlexample
CREATE TABLE PERSON_DATA (
  city VARCHAR,
  age NUMBER,
  married BOOLEAN,
  has_cat BOOLEAN,
  weird_field_name BOOLEAN,
  skills VARIANT,
  family VARIANT
);
```

Then create a pipe with the same name that defines the mapping:

```sqlexample
CREATE PIPE PERSON_DATA AS
COPY INTO PERSON_DATA
FROM (
  SELECT
    $1:city,
    $1:age,
    $1:married,
    $1['has cat'] AS has_cat,
    $1[' @&$#* has Łułósżź'] AS weird_field_name,
    $1:skills,
    $1:family
  FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

**Key points:**

* You control column names (e.g., renaming `"has cat"` to `has_cat`)
* You can cast data types as needed (e.g., `$1:age::NUMBER`)
* You can include or exclude fields as desired
* You can add metadata fields (e.g., `$1:RECORD_METADATA.topic`)
* VARIANT columns automatically handle nested JSON structures

#### Example 3: With interactive tables

Interactive tables are a special type of Snowflake table optimized for low-latency, high-concurrency queries. You can find out more about interactive tables in the [interactive tables documentation](../../user-guide/interactive.md).

1. Create an interactive table:

   ```sqlexample
   CREATE INTERACTIVE TABLE REALTIME_METRICS (
     metric_name VARCHAR,
     metric_value NUMBER,
     source_topic VARCHAR,
     timestamp TIMESTAMP_NTZ
   ) AS (SELECT
         $1:M_NAME::VARCHAR,
         $1:M_VALUE::NUMBER,
         $1:RECORD_METADATA.topic::VARCHAR,
         $1:RECORD_METADATA.timestamp::TIMESTAMP_NTZ
         from TABLE(DATA_SOURCE(TYPE => 'STREAMING')));
   ```
2. Configure topic to table mapping:

   ```properties
   snowflake.topic2table.map=metrics-topic:REALTIME_METRICS
   ```

**Important considerations:**

* Interactive tables have specific limitations and query restrictions. Review the
  [interactive tables documentation](../../user-guide/interactive.md) before using them with the connector.
* For interactive tables, any required transformations must be handled in the table definition.
* Interactive warehouses are required to query interactive tables efficiently.

### Explicit topic-to-table mapping

When you configure the `snowflake.topic2table.map` parameter, the connector operates in explicit mapping mode. This mode allows you to:

* Map multiple Kafka topics to a single Snowflake table
* Use custom table names that differ from topic names
* Apply regex patterns to match multiple topics

**Configuration format:**

The `snowflake.topic2table.map` parameter accepts a comma-separated list of topic-to-table mappings in the format:

```none
topic1:table1,topic2:table2,topic3:table3
```

**Example configurations:**

Direct topic mapping

```properties
snowflake.topic2table.map=orders:ORDER_TABLE,customers:CUSTOMER_TABLE
```

Regex pattern matching

```properties
snowflake.topic2table.map=.*_cat:CAT_TABLE,.*_dog:DOG_TABLE
```

This configuration maps all topics ending with `_cat` (such as `orange_cat`, `calico_cat`) to the `CAT_TABLE` table, and all topics ending with `_dog` to the `DOG_TABLE` table.

Many topics to one table

```properties
snowflake.topic2table.map=topic1:shared_table,topic2:shared_table,topic3:other_table
```

This configuration maps both `topic1` and `topic2` to `shared_table`, while `topic3` maps to `other_table`.

> **Important:**
>
> * Regex patterns in the mapping cannot overlap. Each topic must match at most one pattern.
> * Table names in the mapping must be valid Snowflake identifiers with at least 2 characters, starting with a letter or underscore.
> * You can map multiple topics to a single table.

### Legacy RECORD_CONTENT column

In prior versions of the connector (3.x and earlier), when the schematization feature was disabled, the connector created a destination table
with two columns: RECORD_CONTENT and RECORD_METADATA.
The RECORD_CONTENT column contained the entire Kafka message content in a column of type VARIANT.
The RECORD_METADATA column continues to be supported but the RECORD_CONTENT column is no longer created by the connector.
The same functionality can be achieved using SMT transformations (see examples later in this section).
The RECORD_CONTENT key is also no longer available in PIPE transformations. For example, this PIPE definition will not work by default:

> **Note:**
>
> This pipe definition will not work without additional SMT transformations.

```sqlexample
CREATE PIPE ORDERS AS
COPY INTO ORDERS
FROM (
  SELECT
    $1:RECORD_CONTENT
FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

If you need entire Kafka message content saved to a single column, or you need a handle to the entire Kafka message content in a PIPE transformation, you can use the following SMT transformation that wraps the entire Kafka message content into your desired custom field:

```properties
transforms=wrapKafkaMessageContent
transforms.wrapKafkaMessageContent.type=org.apache.kafka.connect.transforms.HoistField$Value
transforms.wrapKafkaMessageContent.field=your_top_level_field_name
```

This transformation will wrap the entire Kafka message content into a custom field named `your_top_level_field_name`. You can then access the entire Kafka message content using the `$1:your_top_level_field_name` accessor in your PIPE transformation.

```sqlexample
CREATE PIPE ORDERS AS
COPY INTO ORDERS
FROM (
  SELECT
    $1:your_top_level_field_name
FROM TABLE(DATA_SOURCE(TYPE => 'STREAMING'))
);
```

Alternatively, if you want to save both the entire metadata and content to a single table using the default pipe, do not create a custom pipe; instead, create only a table with two columns: `RECORD_CONTENT` and `your_top_level_field_name`.

```sqlexample
CREATE TABLE ORDERS (
  record_metadata VARIANT,
  your_top_level_field_name VARIANT
);
```

To read more about the HoistField$Value transformation, see the [Kafka documentation](https://kafka.apache.org/39/documentation.html#connect_transforms).

> **Warning:**
>
> Saving the entire Kafka message content and metadata to a table can negatively impact your ingestion cost, pipeline speed and latency. If you need the best possible performace, consider saving only the data you need if it is accessible from the top-level of the Kafka record content, or use SMT transformations to extract the data from deeply nested fields to top-level fields.

### Handling streaming channel errors and dead-letter queues

The connector inspects the Snowpipe Streaming channel status before committing offsets in Kafka. If the connector detects that the `rowsErrorCount` property on channel has increased since the connector was started, it raises a fatal error (`ERROR_5030`) when `errors.tolerance=none` so that data issues don’t go unnoticed. To allow ingestion to continue while triaging bad rows, set `errors.tolerance=all`

```properties
errors.tolerance=all
```

## Schema evolution

For tables with `ENABLE_SCHEMA_EVOLUTION=TRUE`, the connector automatically evolves their schema, based on the incoming Kafka records. All connector created tables default to `ENABLE_SCHEMA_EVOLUTION=TRUE`.

Schema evolution is limited to the following operations:

* Adding new columns. The connector will add new columns to the table if the incoming Kafka records contain new fields that are not present in the table.
* Dropping the NOT NULL constraint from columns that are missing data in the inserted records

## Using the connector with Apache Iceberg™ tables

The connector can ingest data into a Snowflake-managed Apache Iceberg™ tables but must meeting the following requirements:

* You must have been granted the USAGE privilege on the external volume associated with your Apache Iceberg™ table.
* You must create an Apache Iceberg™ table before running the connector.

### Grant usage on an external volume

To grant USAGE privilege on the external volume associated with your Apache Iceberg™ table to your role for the Kafka connector, run the following statement:

For example, if your Iceberg table uses the `kafka_external_volume` external volume
and the connector uses the role `kafka_connector_role`, run the following statement:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT USAGE ON EXTERNAL VOLUME kafka_external_volume TO ROLE kafka_connector_role;
```

### Create an Apache Iceberg™ table for ingestion

The connector does not create Iceberg tables automatically and does not support schema evolution.
Before you run the connector, you must create an Iceberg table manually.

When you create an Iceberg table, you can use Iceberg data types (including VARIANT) or [compatible Snowflake types](../../user-guide/tables-iceberg-data-types.md).

For example, consider the following message:

```sqljson
{
    "id": 1,
    "name": "Steve",
    "body_temperature": 36.6,
    "approved_coffee_types": ["Espresso", "Doppio", "Ristretto", "Lungo"],
    "animals_possessed":
    {
        "dogs": true,
        "cats": false
    },
    "options":
    {
        "can_walk": true,
        "can_talk": false
    },
    "date_added": "2024-10-15"
}
```

To create an Iceberg table for the example message, use one of the following statements:

> ```sqlexample
> CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
>     id number(38,0),
>     name varchar,
>     body_temperature number(4,2),
>     approved_coffee_types array(varchar),
>     animals_possessed variant,
>     options object(can_walk boolean, can_talk boolean),
>     date_added date
>   )
>   EXTERNAL_VOLUME = 'my_volume'
>   CATALOG = 'SNOWFLAKE'
>   BASE_LOCATION = 'my_location/my_iceberg_table'
>   ICEBERG_VERSION = 3;
> ```
>
> ```sqlexample
> CREATE OR REPLACE ICEBERG TABLE my_iceberg_table (
>     id INT,
>     name string,
>     body_temperature float,
>     approved_coffee_types array(string),
>     animals_possessed variant,
>     date_added date,
>     options object(can_walk boolean, can_talk boolean),
>     )
>   EXTERNAL_VOLUME = 'my_volume'
>   CATALOG = 'SNOWFLAKE'
>   BASE_LOCATION = 'my_location/my_iceberg_table'
>   ICEBERG_VERSION = 3;
> ```

> **Note:**
>
> Field names inside nested structures such as `dogs` or `cats` are case sensitive.

## Next steps

[Set up tasks](setup-tasks.md).

## Collaboration & Marketplace

Snowflake Marketplace, data sharing, listings, and collaboration features.

---
title: About accessing and consuming listings in VPS
source: https://docs.snowflake.com/en/collaboration/virtual-private-snowflake/vps-collaboration-for-consumers.md
section: Collaboration & Marketplace
---

# About accessing and consuming listings in VPS

By using private listings in your Virtual Private Snowflake (VPS) environment,
you are unlocking access to valuable data tailored to your specific needs. Here,
we’ll explore how to efficiently locate relevant listings that match your interests,
request access from providers, and discover how to find these listings once they’ve
been shared with you in the **Private Sharing** section of Snowsight. By understanding
these processes, you’ll be equipped to access exclusive data and collaborate seamlessly
with trusted partners.

Be sure to read [About collaboration in VPS environments](about-vps-collaboration.md)
to understand and enable collaborating with private listings before proceeding
with this topic.

## Enable the consumption of VPS private listings

If your organization uses Virtual Private Snowflake (VPS) and wants access to
a data product provided as a private listing, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to request
that the provider be allowed to share private listings with your VPS environment.

The organization administrator needs to sign any terms and disclaimers that apply,
as described in [Enable VPS collaboration with other organizations](vps-enable-collaboration.md).
If this isn’t done, you won’t be able to access a listing.

### Access the data in a private listing

After your organization administrator allows your selected provider to share private listings
with your VPS environment, you should be able access private listings from the provider by
using the Private Sharing area of Data Products in Snowsight.

1. Open [Private Sharing](https://app.snowflake.com/#/data/shared). Or, if you are already signed in to Snowsight,
   in the navigation menu, select Data sharing » Internal sharing.
2. If you are asked to sign any terms and disclaimers, refer your organization administrator
   to [Enable VPS collaboration with other organizations](vps-enable-collaboration.md).
3. Select the Shared with You tab and locate the listing you want to access.
4. If you want to view the listing page, select the listing title. This page provides information
   about the listing, including the data objects that you can use to query the data and contact
   information for the provider. You can go back in the browser to return to the search results.
5. To open a listing and begin using the data, click the button near the listing. The text on buttons
   that open listings can vary. Here are a couple of examples:

   > * Use the **Get** button to make the data available to your Snowflake region.
   > * Use the **Open in Worksheet** icon to make the data open as a query in a worksheet.

At this point you should be ready to start using the data from the VPS Private listing in your
environment. If you ever have questions, just open the listing page again for more information
or contact the provider for help.

After you’ve opened the listing one time and you know the names of the objects, you can query the
data without returning to the listing page.

If you encounter any difficulty locating or accessing a listing, first check with your organization
administrator to ensure that the provider is already allowed to share listings with your VPS. Ask
the administrator if you have any necessary privileges. If you seem to have everything you need,
contact the provider to confirm that they have shared the listing with you and to ask them for more
help. If all seems to be in order, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) for further assistance.

### Discovering private listings

For listings that aren’t publicly visible, finding a private listing might seem daunting. Here
are some ways you might find a private listing that suits your needs:

* Use your business relationships, networks, or partnerships to find trusted entities who provide data
  as a service on Snowflake.
* Look for advertisements from providers who use targeted marketing. They might invite you to webinars
  or private meetings where you can ask questions and learn more about what they can provide.
* When in VPS, you might not have access to the Snowflake Marketplace. However, you can browse data products
  if you have access to the marketplace website at <https://app.snowflake.com/marketplace>. Some providers
  might advertise their data products there that can be customized for you as VPS Private Listings.

Keep in mind that providers can create a private listing as part of a customized proposal just for you,
bundling it with other services or products tailored to your specific needs. If you learn of a
listing that’s of interest to them, you can formally request access to that individual provider’s listing.
Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you are unable to find the provider’s contact information.

---
title: About collaboration in VPS environments
source: https://docs.snowflake.com/en/collaboration/virtual-private-snowflake/about-vps-collaboration.md
section: Collaboration & Marketplace
---

# About collaboration in VPS environments

Virtual Private Snowflake (VPS) offers the highest level of security by completely isolating your data and
resources. VPS was specifically created for organizations requiring the highest levels of security,
such as financial institutions and enterprises handling sensitive data. With such focus on security
and isolation, VPS deployments handle information collaboration differently from other Snowflake deployments.
Despite the restraints required by this isolation, VPS customers can still share data with other Snowflake users,
allowing both cooperation and data security.

## How collaboration works with VPS

Collaborative data, offered by a *data provider* and accessed by a *data consumer*, can be made
available for collaboration even if one or both parties use VPS. Snowflake Support is always
involved in this process to ensure that all security protocols are strictly followed.
The VPS environment provides strong and flexible options for securely sharing data,
maintaining the highest levels of security and compliance across all data-sharing activities.

## About VPS private listings

Private listings are shared from one provider to one consumer only. Either the provider or the
consumer, or both, use VPS. Discovery of listings can be restricted to specific users or organizations chosen by
the data provider. The VPS consumer also restricts which providers (and listings) are visible to them.
After all the processes described in this guide
are complete and reviewed by Snowflake Support, the end result is that the VPS user can
query the data through a private listing.

If you’re a data consumer in search of listings, see [Finding consumers for private listings](vps-collaboration-for-providers.md).

If you’re a data provider, find more information on locating consumers who might be interested in
your listing in [Finding consumers for private listings](vps-collaboration-for-providers.md).

## Data Collaboration: Access without Duplication

Snowflake uses a process called cross-cloud auto-fulfillment to make the data locally available
without requiring any inbound data transfer into the consumer’s Snowflake account.
Powered by Snowgrid™, Snowflake’s cross-cloud technology, auto-fulfillment connects data across
different regions and cloud providers. Snowgrid creates the seamless global network that keeps data
securely within the provider’s environment while allowing authorized users to access it from anywhere,
without duplication or movement.

For private listings, Snowgrid isolates the data, making it available exclusively to specific
consumers through a dedicated connection. This process ensures that the data remains secure and
isolated while still providing immediate, controlled access.

These private listings only become available after the provider and consumer have officially
arranged to share the data, a process described in this guide. The provider’s catalog of VPS-eligible
listings then becomes discoverable to the VPS consumer. However, these listings are not
accessible unless there is at least one active consumer.

Just as VPS environments are isolated and have globally unique identifiers, each private listing is
also isolated and has a globally unique identifier (called a *Universal Listing Locator* or *ULL*). The
ULL is often casually referred to as a *listing name*, but for private listings the listing name
is unique to the collaboration between one provider and one consumer.

### Provision of data products

VPS users can publish data to a listing if and only if this action is
explicitly allowed and correctly configured.

> **Note:**
>
> **Snowflake Support is always involved in this process to
> ensure that all security protocols are strictly followed.**

Private listings are visible within a dedicated, secure user interface, accessible exclusively
by other participating users. If those users are also in a VPS environment, they must also have
authorized this interaction.

These measures ensure that data remains isolated and secure within VPS environments and that
no data leaves or enters the VPS without strict procedures to ensure compliance and security.

> **Note:**
>
> Currently in preview, VPS providers can publish free and limited trial listings on the Snowflake Marketplace. For more information, see [As a VPS provider, create a listing in Snowflake Marketplace](../provider-listings-creating-publishing.md)

### Consumption of data products

VPS users can see and consume listings that are shared by a controlled list
of vetted data providers. The providers and their data products must be specifically
enabled for the consumers VPS, regardless of whether the provider is in a VPS or not.
To find privately shared data products, VPS users use a separate, isolated user interface
called *Private Sharing*.

This separate interface is designed to uphold the strict isolation requirements of VPS environments,
ensuring that all data interactions comply with VPS compliance and security standards. View and install
listings that are shared with you by visiting <https://app.snowflake.com/pm/pm_aws_us_west_2/#/data/shared>.
Or, if you are already signed in, in the navigation menu, select Data sharing » Internal sharing.

VPS environments have tightly controlled interactions with any external connection,
utilizing a dedicated isolated user interface (UI) that ensures the strict security and isolation
of the VPS environment. While VPS users can publish and consume data in “listings”, these actions
require specific configurations; and only listings that are explicitly enabled (allowed) for
VPS access are publishable or accessible from within this secure environment.

## Limitations

## Limitations on collaborating with Virtual Private Snowflake (VPS)

The following limitations apply to collaboration support for Virtual Private Snowflake (VPS):

* Listings that use manual fulfillment are not supported.
* Listings that use Snowflake connectors are not supported.

---
title: About committed capacity and Snowflake Marketplace Capacity Drawdown
source: https://docs.snowflake.com/en/collaboration/marketplace-capacity-drawdown.md
section: Collaboration & Marketplace
---

# About committed capacity and Snowflake Marketplace Capacity Drawdown

Capacity commitment is part of a contract that lets customers prepay for expected use of Snowflake resources.
With the Snowflake Marketplace Capacity Drawdown (MCD) program, a portion of your committed capacity is reserved for
Snowflake Marketplace purchases. The MCD program makes it easier to access data products and services from providers, and it centralizes billing through the Snowflake Marketplace.

## MCD enrollment

To enroll in the Snowflake Marketplace Capacity Drawdown (MCD) program, your organization must have or create a
committed capacity contract with Snowflake and your business (based on billing and shipping addresses) must be in an area where MCD is available. For a list of supported locations, see MCD limitations.

Consumers must be enrolled in the MCD program to use MCD resources for purchases. However, the provider of the
listing doesn’t need to be enrolled.

If you’re interested in enrolling in the MCD program, contact your Snowflake Account Executive. If your organization is not in a location where the MCD program is supported, contact your Snowflake Account Executive and let them know that you’re interested in joining the program. Although they cannot make exceptions to these restrictions, Snowflake Marketplace Operations maintains a list of requests for future areas of development.

## MCD limitations

The following limitations apply to MCD program listings and payments:

* Listings paid off-platform are not MCD-compatible.
* If MCD is available for a listing, the MCD payment method displays on the purchasing page.
* A provider does not need to enroll in the MCD program, but they must be located in one of the following geographic areas where MCD is available to accept MCD payments:

> * Japan (providers and consumers must be located in Japan and have Japanese billing and shipping addresses)
> * Mexico (by Private Preview only)
> * Switzerland (by Private Preview only)
> * United Kingdom (by Private Preview only)
> * United States (excluding Florida)

The following limitations apply to MCD program enrollment:

* Not available to international consumers outside Japan, Mexico, the United Kingdom, or Switzerland, where it’s in private preview.
* Not available to consumers in Florida.
* Not available to commercial resellers or consumers who purchase Snowflake Capacity through a commercial reseller.
* Not supported in Snowflake Government Regions.
* Not available for Snowflake contracts made through the Google Cloud Marketplace.
* Not compatible with Snowflake Priority Support.
* Not available for consumers with monthly billing frequency.
* Not available to consumers using On Demand accounts

### Commercial resellers

Snowflake supports commercial resellers through the Snowflake Partner Network (SPN) Reseller Program. However, the
MCD program is not available to resellers. If this restriction applies to you, you can still purchase listings
using other payment methods. Committed capacity will still cover the applicable consumption of data products.

---
title: About listings
source: https://docs.snowflake.com/en/collaboration/collaboration-listings-about.md
section: Collaboration & Marketplace
---

# About listings

With listings, you can provide data and other information to other Snowflake users, and you can access data and other information shared by Snowflake providers.

You can explore, access, and provide listings to consumers privately and on the Snowflake Marketplace. To learn more about the Snowflake Marketplace, see [About Snowflake Marketplace](collaboration-marketplace-about.md).

## What is a listing?

A listing is an enhanced method of [Secure Data Sharing](../user-guide/data-sharing-intro.md) and uses the same
[provider and consumer model](../user-guide/data-sharing-intro.md).

As a provider, you can share a Snowflake Native App or data in your Snowflake account by creating and publishing a listing to specific Snowflake
accounts or on the Snowflake Marketplace. To get started, see [Use listings as a provider](provider-becoming.md).

As a consumer, you can access a Snowflake Native App or data shared by other Snowflake accounts on the Snowflake Marketplace or privately with your
account using a listing. To get started, see [Use listings as a consumer](consumer-becoming.md).

Listings add capabilities to Secure Data Sharing such as the following:

* Offer a share publicly on the Snowflake Marketplace.
* Charge consumers for access to the data in the share.
* Monitor interest in your listing and usage of the data in the share.
* Provide metadata about the share, such as a title, description, sample SQL queries, and information about the data provider.

For more details about listings compared with other types of sharing at Snowflake, see
[Overview of Data Sharing at Snowflake](../guides-overview-sharing.md).

You can explore listings and providers on the Snowflake Marketplace through [Snowsight](../user-guide/ui-snowsight-gs.md). See [About Snowflake Marketplace](collaboration-marketplace-about.md).

> **Note:**
>
> To use listings and the Snowflake Marketplace, you need to agree to additional terms. See [Legal requirements for providers and consumers of listings](collaboration-listings-legal.md).

When you offer data and apps to consumers, you choose how to make your data product available to consumers and how consumers can access your data product. A data product is the share or the app attached to your listing.

## Listing availability options

When you offer a listing, you choose how to make your data product available to consumers:

* **Privately**, available only to specific consumers. Private listings let you take advantage of the capabilities of
  listings to share data and other information directly with other Snowflake accounts in any Snowflake region.
* **Publicly**, visible on the Snowflake Marketplace. You can offer listings on the Snowflake Marketplace to market
  your data product across the Snowflake Data Cloud. Offering a listing on the Snowflake Marketplace lets you share curated data offerings with
  many consumers simultaneously, rather than maintaining sharing relationships with each individual consumer.

  See [About Snowflake Marketplace](collaboration-marketplace-about.md) for more about publishing on the Snowflake Marketplace.

## Listing access options

When you offer a listing, you choose how consumers can access your data product:

* Free access to your full data product, with no payment required.
* Limited trial access to your data product, with unlimited access to the full data product available upon request.
* Paid access to your data product, using the pricing models offered by Snowflake.

### Free listings

A free listing is available privately to specific consumers, or publicly on the Snowflake Marketplace, and provides instant access to a
full published dataset.

When published on the Snowflake Marketplace, this type of listing is best for providing generic, aggregated, or non-customer-specific data. When
shared privately with specific consumers, you can use this type of listing to provide data products to existing business partners at no
cost or according to negotiated payment terms.

For more information about creating free listings, see [Create and publish a listing](provider-listings-creating-publishing.md).

### Limited trial listings

A limited trial listing is available on the Snowflake Marketplace and provides instant limited access to a data product.

A provider can choose whether to offer a subset of data as part of the trial data product, or make the full product available for a short
period of time, or something else. Providers can set the availability period for limited trial listings from 1 to 90 days.

Consumers can trial the data product attached to the limited trial listing and request unlimited access to your data product.
A provider can then choose who to offer the full data product to and whether (or how much) to charge for the data product.
For example, in response to a request you might offer:

* A free private listing to a consumer with whom you have an existing business relationship or with whom you have negotiated payment terms.
* A paid private listing to a consumer, using one of the [pricing models](provider-listings-pricing-model.md) offered
  by Snowflake.

Limited trial listings let providers make a data product visible to and free to try by anyone on the Snowflake Marketplace, but fully available
only to consumers that they choose to do business with. This type of listing is best for providing customer-specific data, or for cases
when you want to allow only certain consumers to purchase your data product due to licensing agreements, regulatory requirements, or other
commercial reasons.

For guidance preparing to offer your data product as a limited trial, see [Prepare to offer a limited trial listing](provider-listings-preparing.md).

### Paid listings

A paid listing is available privately or on the Snowflake Marketplace. As a provider, you can create paid listings to charge consumers to access
or use your listing.

Paid listings are only available to consumers in specific regions, and from providers in specific regions.

* For more information about becoming a provider of paid listings, see [Provide paid listings](provider-becoming.md).
* For more information about paying for listings as a consumer, see [Pay for listings](consumer-listings-paying.md).
* For more information about the pricing models you can use as a provider, see [Paid listings pricing models](provider-listings-pricing-model.md).

Paid listings are best for data products that offer proprietary or industry-specific data, or insights and analytics performed on
freely available data. This type of listing also offers consumers the ability to try and buy a data product with unified procurement
through Snowflake.

## Pricing plans and offers

### Pricing plans

Pricing plans allow providers to offer multiple stock keeping units (SKUs) for a single paid listing. With pricing plans, providers don’t have to create a listing for every SKU that they offer to consumers. Instead, after creating a pricing plan, providers create offers that are extended to consumers.

Pricing plans and offers simplify listing monetization and management. An offer provides consumers with individualized billing, payment terms, payment schedules, and contract start and end dates. Consumers can review an offer before committing, and an offer can be quickly accepted or rejected.

> **Note:**
>
> Pricing plans and offers are not available for organizational listings. Organizational listings focus on secure data sharing within an organization, allowing teams to access and utilize internal data products without the complexities of pricing models or offers.

### Offers

Offers define the purchase terms for a listing. Offers are specific to each consumer and provide individualized billing, payment terms, payment schedules, and contract start and end dates. After a consumer receives an offer from a listing provider, the consumer can review the terms and then accept or reject the offer.

Consumers can review offers in Snowsight on the Data sharing » External sharing page.

### Limitations for listings that include pricing plans and offers

* Providers can’t convert a listing to a new type (for example, from a limited trial listing to a paid listing).
* Consumers can’t convert a Snowflake Native App from one listing type to another (for example, from a private listing to a paid listing).

## V1 vs. V2 listings

When working with listings in Snowflake, it’s important to understand the distinctions between Version 1 (V1) and Version 2 (V2) listings. These versions differ significantly in their manifest formats, targeting capabilities, feature sets, and compatibility requirements.

### V1 listings

V1 listings are the original format for listings in Snowflake and are compatible with all Snowflake accounts that support listings. They support basic listing functionalities, including private and public sharing, but lack advanced features such as pricing plans and offers. In the [listing manifest](../progaccess/listing-manifest-reference.md), V1 listings use a `targets` field, and the listing targets are specified by individual account names. For example:

```yaml
...
targets:
  accounts: ["Org1.Account1", "Org2.Account2"]
...
```

### V2 listings

V2 listings introduce a new manifest format that provides enhanced targeting capabilities, allowing providers to specify a wider range of targeting options, including organizations, accounts with specific roles, locations, and organization-level groups.

In the [listing manifest](../progaccess/listing-manifest-reference.md), V2 listings allow users to specify `external_targets` and `locations`. For example:

```yaml
...
external_targets:
  access:
    - organization: OrgName2
      accounts: [acc1, acc2]
    - account: acc2
      roles: [role1, role2]
locations:
  access_regions:
    - name: "PUBLIC.AWS_US_WEST_2"
...
```

---
title: About providing VPS Private Listings
source: https://docs.snowflake.com/en/collaboration/virtual-private-snowflake/vps-collaboration-for-providers.md
section: Collaboration & Marketplace
---

# About providing VPS Private Listings

As a provider of VPS private listings on Snowflake, it’s essential to understand how
to efficiently manage and share your data with consumers. This section guides
you through steps to create a private listing, locate and respond to consumer
requests for access, and securely share your data with them. This process not only
ensures that your data remains protected and accessible only to trusted partners
but also streamlines the collaboration experience by leveraging Snowflake’s powerful
sharing capabilities. Whether you’re sharing data with a few trusted organizations
or managing multiple requests, you’ll find the tools and strategies you need to
successfully manage private listings and enhance your data-sharing workflows.

## Enable provisioning of private listings for your consumers

When you are ready to share private listings with a consumer, and you or your
new consumer uses Virtual Private Snowflake (VPS), contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support)
to enable the provider/consumer relationship through private listings
as described in this section.

1. Contact the consumer to collect the consumer’s Organization Name and Account Identifier.
   For details on how to locate this information, see [Finding the organization and account name for an account](../../user-guide/admin-account-identifier.md).
2. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) and ask for VPS Provider Sharing to be enabled between you
   and your new consumer. Include the following information:

   * Your VPS deployment name and account identifier.
   * The consumer’s account identifiers.

### Create or Manage a VPS Private Listing by using Provider Studio

When you are set up to provide listings, create a private listing for the consumer.
Provider Studio is the web interface that you use to create and otherwise work with
private listing. This section describes how to use it to create a private listing.

> **Note:**
>
> Before you create the listing, your data product must already exist.
>
> If you have a direct share that needs to be converted to a listing,
> see [Create a new listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing).

The organization administrator should have already signed any terms and disclaimers that apply,
as described in [Enable VPS collaboration with other organizations](vps-enable-collaboration.md).
If this isn’t done, you won’t be able to create a listing.

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. To add a listing, select Create Listing » Specified consumers.
4. Enter a descriptive title for your listing. It doesn’t have to be a unique title.
5. Review and/or edit the SQL listing name.

   > **Note:**
   >
   > You can’t change the SQL listing name after the listing is published.
6. Select the Product type button, then select + Select to select the objects to attach to the listing.
7. In the Access type drop-down, select Free.
8. (Optional) If you have multiple provider profiles, select which provider profile to publish this listing as.
   If you don’t select a provider profile, your organization and account name are used.
9. In the Who can access section, add the [organization and account names](../../user-guide/admin-account-identifier.md) for the consumers that you
   want to share the listing with.
10. Enter a description for your listing.
11. (Optional) Provide data dictionary information for your listing. For more information, see [Set up a data dictionary for your listing](../provider-listings-reference.md).
12. (Optional) Provide up to six business needs for your listing. For more information, see [Business needs](../provider-listings-reference.md).
13. (Optional) Provide a sample SQL query or a notebook that demonstrates how to use the data product. For more information, see
    [Attach a notebook to a Snowflake Marketplace listing](../provider-listings-creating-publishing.md).
14. Add legal terms for your listing.
15. (Optional) In the Attributes section, add custom attributes to your listing. For more information, see [Data product - attributes](../provider-listings-reference.md).
16. Select Publish to publish the listing to the selected consumers, or select Save Draft to save it as a draft.
    If you exit without saving, a draft is saved automatically.

When you publish the listing to the consumer, the consumer is notified that you have shared the listing with them.

To manage a listing you’ve already shared, you can use either [Private Sharing](https://app.snowflake.com/#/data/shared) or [Provider Studio](https://app.snowflake.com/#/provider-studio).

## Limitations

## Limitations on collaborating with Virtual Private Snowflake (VPS)

The following limitations apply to collaboration support for Virtual Private Snowflake (VPS):

* Listings that use manual fulfillment are not supported.
* Listings that use Snowflake connectors are not supported.

## Finding consumers for private listings

For listings that aren’t publicly visible, attracting a consumer for your data products,
especially those in a VPS environment, typically involves a more direct and targeted approach.

Here are some ways providers might bring a private listing to the attention of potential consumers:

* The provider can directly invite specific customers or partners to access the listing. This is common
  in scenarios where the provider has identified potential customers who would benefit from the data.
* The provider can leverage existing business relationships, networks, or partnerships to offer the
  private listing to trusted entities.
* Some providers use targeted marketing efforts, contacting potential customers to offer webinars or
  private meetings.
* Satisfied customers or partners often refer other businesses or contacts to the provider, who can
  then extend the offer to these new prospects.
* Many providers might offer the private listing as part of a customized proposal for clients, bundling
  it with other services or products tailored to the client’s specific needs.
* For providers who are able to do so, they can create a listing in the public marketplace just
  to advertise the availability of data. Anyone can browse data products available in Snowflake
  Marketplace, if they have access to this website: <https://app.snowflake.com/marketplace>.

In essence, the visibility of the private listing is managed through more controlled and direct
communication, ensuring that only the preferred audience is aware of it. If a VPS user learns of a
listing that’s of interest to them, they can formally request access to
that individual provider’s listing if they can see it. Otherwise, they can contact their account
representative to inquire about listings that meet their needs or request access to them.

---
title: About Snowflake Marketplace
source: https://docs.snowflake.com/en/collaboration/collaboration-marketplace-about.md
section: Collaboration & Marketplace
---

# About Snowflake Marketplace

The [Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace) is where you can explore, access, and provide listings to consumers. You can also use the Snowflake Marketplace to discover and access third-party data and services, as well as market your own data products across the Snowflake Data Cloud.

As a data provider, you can use listings on the Snowflake Marketplace to share curated data offerings with many consumers simultaneously, rather than maintain sharing relationships with each individual consumer. With [Paid listings](collaboration-listings-about.md), you can also charge for your data products.

As a consumer, you might use the data provided on the Snowflake Marketplace to explore and access the following:

* Historical data for research, forecasting, and machine learning.
* Up-to-date streaming data, such as current weather and traffic conditions.
* Specialized identity data for understanding subscribers and audience targets.
* New insights from unexpected sources of data.

The Snowflake Marketplace is available globally to all Snowflake accounts hosted on Amazon Web Services, Google Cloud, and Microsoft Azure, with the exception of Microsoft Azure Government. Support for Microsoft Azure Government is planned.

> **Note:**
>
> If you’re using private connectivity to access the Snowflake Marketplace through [Snowsight](../user-guide/ui-snowsight-gs.md), you must first create a CNAME
> record, as described in the Snowflake documentation:
>
> * [AWS PrivateLink and Snowflake](../user-guide/admin-security-privatelink.md)
> * [Azure Private Link and Snowflake](../user-guide/privatelink-azure.md)
> * [Google Cloud Private Service Connect and Snowflake](../user-guide/private-service-connect-google.md)

## What can I do in the Snowflake Marketplace?

After you join the Snowflake Marketplace, you can do the following:

* As a provider, you can do the following:

  + Publish listings for free-to-use datasets to generate interest and new opportunities among the Snowflake customer base.
  + Publish listings with samples of datasets that can be provided on request or customized for a specific consumer.
  + Share live datasets securely and in real-time without creating copies of the data or imposing data integration tasks on the consumer.
  + (Preview) Share public listings in Virtual Private Snowflake (VPS) deployments.
  + Eliminate the costs of building and maintaining APIs and data pipelines to deliver data to customers.

  For more information, see [Use listings as a provider](provider-becoming.md) and [Create and publish a listing](provider-listings-creating-publishing.md).
* As a consumer, you can do the following:

  + Discover and test third-party data sources.
  + Receive frictionless access to raw data products from vendors.
  + Combine new datasets with your existing data in Snowflake to derive new business insights.
  + Have datasets available instantly and updated continually for users.
  + Eliminate the costs of building and maintaining various APIs and data pipelines to load and update data.
  + Use the business intelligence (BI) tools of your choice.

  For more information, see [Use listings as a consumer](consumer-becoming.md) and [Explore listings](consumer-listings-exploring.md).

## Snowflake Marketplace version 2 listings in VPS deployments

Providers can create [version 2 (V2) listings](collaboration-listings-about.md) (preview) in Snowflake Marketplace and offer those to specified consumers in VPS deployments using region groupings.

Available region groupings for VPS deployments include the following:

* AWS_US_EAST_1 (“US East (N. Virginia)”)
* AWS_US_EAST_2 (“US East (Ohio)”)
* AWS_US_WEST_2 (“US West (Oregon)”)
* AWS_EU_WEST_1 (“EU (Ireland)”)
* AWS_EU_WEST_2 (“EU (London)”)
* AZURE_EASTUS2 (“East US 2 (Virginia)”)
* AZURE_CENTRALUS (“Central US (Iowa)”)

> **Note:**
>
> Providers must add support to handle V2 listings (currently in preview) in any of their scripts before targeting region groups in a listing.

For more information on creating Snowflake Marketplace listings in VPS deployments, refer to the following topics:

* [As a Snowflake Marketplace provider, create a listing in a Virtual Private Snowflake (VPS) deployment](provider-listings-creating-publishing.md)
* [As a VPS provider, create a listing in Snowflake Marketplace](provider-listings-creating-publishing.md)

### Opt in to consume Snowflake Marketplace listings in a VPS deployment

To consume Snowflake Marketplace listings in a VPS deployment, consumers must first opt in to the feature. To opt in, contact [Create Support cases](../user-guide/ui-support.md). Enabling this feature can take from 1 to 3 business days.

After this feature is enabled on consumer accounts, consumers can access listings that have been shared with them. For more information, see [About accessing and consuming listings in VPS](virtual-private-snowflake/vps-collaboration-for-consumers.md).

### Limitations

* VPS providers can’t create monetized listings.
* VPS customers (providers and consumers) are only identified in the [LISTING_TELEMETRY_DAILY view](../sql-reference/data-sharing-usage/listing-telemetry-daily.md) when the EVENT_TYPE is GET, REQUEST, or UNINSTALL.

  In these cases, the `region_group` field will be populated.

  Otherwise, such as when EVENT_TYPE is LISTING_CLICK or LISTING_VIEW, the `region_group` field will be NULL.
* VPS customers (providers and consumers) can only consume app listings for allowlisted connector listings. The list of allowedlisted connectors includes the following:

  + [Snowflake Connector for Google Analytics Aggregate Data](../connectors/google/gaad/gaad-connector-about.md)
  + [Snowflake Connector for Google Analytics Raw Data](../connectors/google/gard/gard-connector-about.md)
  + [Snowflake Connector for ServiceNow®](../connectors/servicenow/about.md)

---
title: Access and install listings as a consumer
source: https://docs.snowflake.com/en/collaboration/consumer-listings-access.md
section: Collaboration & Marketplace
---

# Access and install listings as a consumer

Whether you explored listings on the Snowflake Marketplace and found a listing that you want to access or purchase, or a provider shared a
private listing with you, this topic guides you through accessing and installing listings as a consumer.

## Access a private listing

To access a private listing that was shared with you, do the following:

> **Note:**
>
> You must use the ACCOUNTADMIN role or another role with the CREATE DATABASE and IMPORT SHARE privileges to access a listing.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. On the Shared with you page under Privately shared listings, select the listing you want to access.
4. Select Get.

## Accessing listings on the Snowflake Marketplace

After you explore listings on the Snowflake Marketplace, access the listings you’re interested in.

> **Note:**
>
> You must use the ACCOUNTADMIN role or another role with the CREATE DATABASE and IMPORT SHARE privileges to access and install a listing.

You can access free and paid listings on the Snowflake Marketplace. To access a free listing on the Snowflake Marketplace, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search or browse to the listing you want to access.
4. Select Get to access a listing already available in your region. A dialog opens with details about the listing. If you have to
   request the listing to be replicated to your region, select Request.
5. (Optional) Specify a database name for the data in the listing.
6. (Optional) Add roles to grant access to the database created from the listing.
7. Select Get.
8. In the confirmation dialog that appears, select Open to open a [Snowsight worksheet](../user-guide/ui-snowsight-worksheets.md) with an example query in a new tab, or
   select Done.

To access a paid listing on the Snowflake Marketplace, see Accessing paid listings.

## Accessing paid listings

You can access paid listings that are privately shared with you, called private listings, or paid listings on the Snowflake Marketplace.
Before you access a paid listing, you can trial it first. See [Trial a listing](consumer-listings-exploring.md).

> **Note:**
>
> You must use the ACCOUNTADMIN role or another role with the CREATE DATABASE and IMPORT SHARE privileges to access a listing.
> To purchase a paid listing, your role must also have the PURCHASE DATA EXCHANGE LISTING privilege.
>
> You must also set up your account to pay for listings. See [Pay for listings](consumer-listings-paying.md).

After setting up your account to pay for listings, do the following to access a paid listing:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. Search or browse to the listing you want to access.
3. Select Get.
4. Select the Paid option and review details about the price.
5. Select Next.
6. (Optional) Add a purchase order (PO) number to be associated with this listing for billing purposes.
7. (Optional) Specify a database name for the data in the listing. If you trialed the listing, Snowflake uses the database name that you
   specified for the trial.
8. (Optional) Add roles to grant access to the database created from the listing.
9. If you are eligible to [use your Capacity commitment to pay](marketplace-capacity-drawdown.md),
   Pay with Marketplace Capacity Drawdown is selected by default. Optionally disable this selection to pay with a credit card,
   invoice, ACH, wire transfer, or another supported method.
10. Select Buy to purchase the listing.

    > **Note:**
    >
    > If your organization administrator (i.e. an account with or granted the ORGADMIN role) has not previously
    > accepted the [Provider and Consumer Terms](https://www.snowflake.com/legal/snowflake-provider-and-consumer-terms/), the Setup incomplete dialog appears and a reminder email is sent requesting that
    > they review and accept these terms. You cannot get access to the listing until these terms are accepted.
    >
    > If an organization administrator has previously accepted the terms, then you can continue.
11. In the confirmation dialog that appears, select Open to open a [Snowsight worksheet](../user-guide/ui-snowsight-worksheets.md)
    with an example query in a new tab, or select Done.

> **Note:**
>
> If you are not yet eligible to use your Capacity commitment, you can see the date after which your Capacity balance will be used to
> pay for the listing. Snowflake Marketplace invoices received after that date can be paid with your Capacity balance.

### Modify your listing payment method

Changes made to your listing payment method are saved automatically and take effect in the next billing cycle.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select a paid listing, and then select Manage Purchase.
4. Optional. If you participate in the [Marketplace Capacity Drawdown Program](marketplace-capacity-drawdown.md), select Pay with Marketplace Capacity Drawdown.
5. Optional. In the Payment method area, select Show more and then select an alternate payment method.

## Access a limited trial listing on the Snowflake Marketplace

Limited trial listings let providers offer a time-limited or functionality-limited trial of a data product, with unlimited access to the
full data product available on request.

The workflow of a limited trial listing works as follows:

* [Trial the data product](consumer-listings-exploring.md).
* If you want, request unlimited access to the full data product.
* Access the private listing shared by the provider containing the full data product.

### Request a limited trial listing

At any time after you start to [trial a listing](consumer-listings-exploring.md), you can request unlimited access to the full data product.

To request unlimited access to the full data product of a limited trial listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search or browse to the listing you want to request.
4. Select Request.
5. Complete the form that appears with your contact information. If you use an email address from a free email provider, you need to provide
   additional details.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

After you request a limited trial listing, the listing provider is notified and contacts you. You might need to agree to additional
commercial terms or pricing agreements with the provider before the provider fulfills your request for unlimited access to the full data
product.

After the provider fulfills your request, you can access the private listing shared with you. See Access a private listing.

## Limitations for accessing listings from accounts in U.S. government regions

If you access listings from an account in US government region, the following limitations apply:

* You cannot get paid listings.
* You cannot get listings for a Snowflake Native App.
* You cannot get Snowflake connectors.
* You cannot get listings that use manual fulfillment.

Some other listings on the Snowflake Marketplace might be unavailable in your region, but you can contact the provider for more details:

Cause:
:   You cannot access a listing in your region because it uses manual fulfillment instead of auto-fulfillment.

Solution:
:   Contact the provider to determine whether the data product attached to the listing can be offered to you privately.
    Some data products include objects other than the [objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md)
    and therefore cannot be made available in your region.

Cause:
:   The provider chose not to offer the listing in your region.

Solution:
:   Contact the provider to request that they offer the listing in your US government region.

Cause:
:   The listing is paid.

Solution:
:   You cannot get the listing. Customers with accounts in US government regions cannot pay for listings at this time.

---
title: Access Provider Studio
source: https://docs.snowflake.com/en/collaboration/provider-studio-accessing.md
section: Collaboration & Marketplace
---

# Access Provider Studio

Use Provider Studio to work with all aspects of your Marketplace listings.

1. Open [Provider Studio](https://app.snowflake.com/#/provider-studio). If you are already
   signed in to [Snowsight](../user-guide/ui-snowsight-gs.md), in the navigation menu, select Marketplace » Provider Studio.
2. To add a listing, select Create Listing.
3. To view listings, open the Listing tab.

For more information on creating listings, see [Create and publish a listing](provider-listings-creating-publishing.md).

## The Provider Studio landing page

The following table lists the options that are available on the Provider Studio landing page.

|  |  |
| --- | --- |
| Analytics | View detailed insights into how your data products are performing. Explore both overview metrics and detailed analytics to help you track engagement, usage, and the reach of your listings. For more information, see [Create and publish a listing](provider-listings-creating-publishing.md), [Modify published listings](provider-listings-modifying.md), and [Configure listings](provider-listings-reference.md). |
| Listings | View or create listings. Select a listing to view the associated details. For more information, see [Create and publish a listing](provider-listings-creating-publishing.md), [Modify published listings](provider-listings-modifying.md), and [Configure listings](provider-listings-reference.md). |
| Profiles | View or create profiles. To manage the details of a profile, including who can edit it, select a profile from the list. For more information, see [Manage your provider profile](provider-profiles-managing.md). |
| Learn | View resources to help you explore Marketplace capabilities and requirements. To explore Marketplace documentation, see [About listings](collaboration-listings-about.md). |
| Consumer Requests | The Consumer Requests pane displays consumer requests, profile requests, listing requests, fulfillment requests, products awaiting publication, and more. For more information, see [Manage listing requests as a provider](provider-listings-managing.md). |

---
title: Auto-fulfillment costs
source: https://docs.snowflake.com/en/collaboration/provider-understand-cost-auto-fulfillment.md
section: Collaboration & Marketplace
---

# Auto-fulfillment costs

As a provider, you can enable Cross-Cloud Auto-Fulfillment (auto-fulfillment) for a listing to automatically make your data product available in other Snowflake regions.

When you configure auto-fulfillment for your listing, you don’t have to manage
replicating the data. However, you still incur costs associated with transferring and storing your data product in other Snowflake regions
to support consumers of your listing.

Unlike traditional manual database replication, auto-fulfillment doesn’t require a separate account in each region that you
support. Instead, Snowflake creates one secure share area for an organization to manage auto-fulfillment to a region and
associates billing costs with that area. Because of that, the costs associated with auto-fulfillment are
attributed differently when compared to manual
[database replication costs](../user-guide/account-replication-cost.md).

In addition, egress cost optimization can reduce the costs of auto-fulfillment.
For an introduction to egress const optimization, see [Optimizing data transfer costs with Egress Cost Optimizer](provider-listings-auto-fulfillment-eco.md).

## How auto-fulfillment incurs costs

Auto-fulfillment incurs usage costs in the same way that regular usage of Snowflake does:

Compute resources
:   Auto-fulfillment operations use compute resources to copy data and manage the status of the data in the secure share areas in other regions.

    Snowflake Marketplace calculates compute costs for listing auto-fulfillment to VPS regions by using VPS rates. For details on VPS rates, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

Storage resources
:   Databases transferred to secure share areas in other regions incur storage costs.

Data transfer resources (egress)
:   The initial database auto-fulfillment and the subsequent synchronization operations transfer data between regions.
    Cloud providers charge for data transferred from one region to another within their own network or a region in another cloud.

    The data transfer rate is determined by the location of the source account (i.e. the account that stores the primary database)
    and the destination region and cloud. For data transfer pricing, see the [pricing guide](https://www.snowflake.com/resource/the-simple-guide-to-snowflake-pricing/) (on the Snowflake website).

    For more information about data transfer billing, refer to [Understanding data transfer cost](../user-guide/cost-understanding-data-transfer.md).

    Egress costs can often be reduced by enabling Egress Cost Optimization (ECO). For more information see [Optimizing data transfer costs with Egress Cost Optimizer](provider-listings-auto-fulfillment-eco.md).

Attribution to secure share area
:   When you use auto-fulfillment, these usage costs are attributed to one Snowflake-managed secure share area for each region that contains active consumers of your listings. For details about attributing costs, see [View actual costs](provider-listings-auto-fulfillment-monitor-view-costs.md). For details about the components of costs in Snowflake, see [Understanding overall cost](../user-guide/cost-understanding-overall.md).

## Factors that affect auto-fulfillment costs

When you configure auto-fulfillment for your listing, the following factors can affect the cost of fulfilling your
listing to other regions:

Compute Resource Factors
:   Queries run by Snowflake to fulfill your listing contributes to compute resources.
    The refresh frequency that you set affects how frequently these queries run.

Storage Resource Factors
:   The size of the database, the rate at which data is appended and updated, and the rate of change in the database affect
    how much data is auto-fulfilled and stored initially and continuously.

Data Transfer Resource Factors
:   The cloud region that the listing is auto-fulfilled to, and the cloud provider of that region affect the cost of data transfer.
    The more regions that consumers request your listing in, the higher the cost to fulfill those listings, due to the data transfer cost.
    For data transfer pricing, see the [pricing guide](https://www.snowflake.com/resource/the-simple-guide-to-snowflake-pricing/) (on the Snowflake website).

---
title: Auto-fulfillment for listings
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment.md
section: Collaboration & Marketplace
---

# Auto-fulfillment for listings

If you’re a provider, you can use Cross-Cloud Auto-Fulfillment (auto-fulfillment) for a listing to automatically replicate your data product to other Snowflake regions without having to manually replicate data.

When auto-fulfillment is enabled for a listing, Snowflake automatically fulfills your data product to consumer regions as
needed. A data product is any share or application package that is attached to your listing.

By using auto-fulfillment, you can avoid manually replicating your data products and approving requests for your listings,
helping consumers access your listings faster.

> **Note:**
>
> Using Cross-Cloud Auto-Fulfillment in a Snowflake Native App with Snowpark Container Services is only supported on Amazon Web Services (AWS)
> and Microsoft Azure. See
> [Understand limitations in the Snowflake Native App Framework](../developer-guide/native-apps/limitations.md)
> for more information.

## Understanding auto-fulfillment

> **Note:**
>
> Auto-fulfillment isn’t available on trial accounts. Auto-fulfillment is configured on listings, and to offer listings, you must use a full account.

Auto-fulfillment lets you offer a data product in any supported Snowflake region, based on the availability and access options
you select for your listing, without having to manually replicate data.

You can configure and enable auto-fulfillment when a listing is in either draft or published state. When auto-fulfillment
is enabled for a listing, Snowflake automatically fulfills your listing’s product to regions as needed.

How you make your data product available in other regions depends on your data product and how consumers access your listing:

* If your data product is an application package, use auto-fulfillment to make your data product available in other regions.
* If your data product is a share, use auto-fulfillment in most cases:

  + For free or limited trial listings on the Snowflake Marketplace, you can use Cross-Cloud Auto-Fulfillment or
    [manually replicate the data](https://other-docs.snowflake.com/en/collaboration/provider-listings-managing#label-manually-replicate-listing).
  + For paid listings, you use auto-fulfillment.
  + For all listings shared with specific consumer accounts, Snowsight automatically detects whether or not the target account
    is in a different region and enables auto-fulfillment. You cannot manually replicate private listings to other regions.

When you make a data product available in other regions, you incur additional costs.
See [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md).

## How auto-fulfillment works

As a provider, when you set up Cross-Cloud Auto-Fulfillment for your listing, Snowflake manages provisioning for a *secure share area* (SSA) and the
auto-fulfillment of your data product to remote regions. The SSA is managed by Snowflake.
If your data product already exists in the remote region, consumers in that region can get the data product instantly.

Each listing has a data product associated with it, whether a share or an application package. That data product contains objects from
one or more databases, as well as application logic for an application package. Exactly when your data product is auto-fulfilled to a remote region depends on how you make your listing available:

* Private listings are auto-fulfilled after the specified consumers get your listing.
* Public listing shared on Snowflake Marketplace are auto-fulfilled after a consumer in the specific region
  gets the listing.

When your data product is auto-fulfilled to a new region for the first time, it’s transferred to an SSA in that region Auto-fulfillment can be configured with SUB_DATABASE or SUB_DATABASE_WITH_REFERENCE_USAGE settings.

* SUB_DATABASE allows selected objects to be available on-demand.
* SUB_DATABASE_WITH_REFERENCE_USAGE provides account-level scheduling for application packages.

> **Note:**
>
> Specifying FULL_DATABASE for the auto-fulfillment refresh type is deprecated.

Multiple listings can use the same database, but the database is only auto-fulfilled once to a new region.

> **Note:**
>
> For Business Critical Edition (BCE), the handling of shared data differs from high-security deployments like VPS.
> While BCE does not require creating a separate SSA for the region, it enforces strict data security and compliance with
> features like Tri-Secret Secure encryption.
>
> For deployments such as Virtual Private Snowflake (VPS) and government-specific Snowflake environments, there is a separate
> secure share area (SSA) for each deployment. This ensures that auto-fulfillment remains compliant with strict security and
> data isolation requirements unique to those environments.

## How auto-fulfillment refreshes data

When you set up auto-fulfillment for your listing, you can configure a refresh interval for your data product.

After the initial auto-fulfillment of your data product to the SSA in a region, changes to your data product are synced from
your account based on the configured data refresh:

| Data refresh type | Description |
| --- | --- |
| Trigger-based data refresh | Providers can use [SYSTEM$TRIGGER_LISTING_REFRESH](../sql-reference/functions/system_trigger_listing_refresh.md) to trigger an on-demand data refresh, ensuring that consumers receive the most current information.  Snowflake recommends using trigger-based data refresh when an upstream extract-transform-load (ETL) pipeline process completes and you want to trigger a replication when the data is ready. For example, if you are a data provider who delivers stock analysis to financial institutions, you can trigger an update to all the analysts with new datasets as soon as they are updated in your upstream ETL pipeline.  **Note:** This feature is only available using SQL. |
| Trigger-based refresh of an application package | If the data product of a listing is an application package, providers can set the [SYSTEM$TRIGGER_LISTING_REFRESH](../sql-reference/functions/system_trigger_listing_refresh.md) to trigger an on-demand refresh of the application package. However, providers must run this function each time the application package needs to be refreshed.  To configure the application package to refresh each time the release directive is modified, use the LISTING_AUTO_REFRESH clause of the [ALTER APPLICATION PACKAGE](../sql-reference/sql/alter-application-package.md) command. |
| Interval-based data refresh | Providers can establish an interval-based data refresh for all consumers of a listing, with time periods ranging from one minute to eight days. Each listing associated with a database operates on the same refresh interval.  Interval-based data refresh configuration is recommended when you require updates at a predefined cadence. For example, providers who refresh datasets weekly can use interval-based refresh to update their database on the same schedule. Each refresh completion triggers the next refresh according to the cadence. See [Set the account-level refresh interval](provider-listings-auto-fulfillment-set-refresh-interval.md) for details.  **Note:** This feature is available using SQL or Provider Studio in Snowsight. |
| Schedule-based data refresh | Providers can establish a timestamp and schedule for data refreshes across all consumers of a listing. Every listing that utilizes a database will adhere to the same refresh schedule.  Scheduled-based data refresh is recommended for use cases where listing updates need to occur at a specific timestamp and schedule. For example, data providers who need to offer a predictable timestamp for when refreshes are available to all consumers.  Interval-based and scheduled-based data refreshes cannot be used simultaneously. If both are set up, one will override the other. For example, if a cron expression is set up for a scheduled refresh that already has a refresh interval, it will be overridden to support scheduled refresh. See [auto_fulfillment](../progaccess/listing-manifest-reference.md) for details.  **Note:** This feature is available using SQL or Provider Studio in Snowsight. |

### Data products as shares vs application packages

When you set up auto-fulfillment for your listing, the data product you offer determines how you set up the data refresh.

* If your data product is a share, set a data refresh when you configure auto-fulfillment for a listing. The data refresh applies to the database associated with the listing. If multiple listings share objects from that database, they share the same data refresh type and schedule/interval.
* If your data product is an application package, set a data refresh at the account level that applies to every application package available from your account.

## Considerations for auto-fulfillment

When you use auto-fulfillment for your listings, consider the following:

* Snowflake supports having multiple databases with the same name. Auto-fulfillment creates a single secure share area (SSA) account in the target region, and the SSA can’t have two databases with the same name. As a result, if you have two or more databases with the same name in the source account, auto-fulfillment will append a unique prefix to the database name to avoid conflicts in the SSA account. For example, imagine the following scenario:

  > + An organization has two accounts, production and dev.
  > + Production and dev each have a database named `AnyCompanyData`.

  Because the destination will always have one SSA account with two databases, auto-fulfillment will append a prefix to the duplicate database name, resulting in two databases: `AnyCompanyData` and `PrefixXXXXX_AnyCompanyData`.
* If you signed up for Snowflake using AWS Marketplace, Google Cloud Marketplace, or Azure Marketplace, you can only create accounts and
  SSAs in those clouds. Fulfilling listings to regions outside of your current cloud service region will fail.
* Depending on the size of your data product, it can take some time for the data product to be available to the consumer.
  The size of your data product can also affect the cost of auto-fulfillment.
  See [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md) for details about cost.
* Object-level mode (SUB_DATABASE) is used by default.
* (Deprecated) If a listing uses objects that are located in a database that’s already in full database mode (FULL_DATABASE), a warning displays in Snowsight and the database remains in full database mode.
* Snowflake compiles the listing auto-fulfillment refresh history and sends emails for failed listing refreshes daily. These messages are sent to the email address specified on the listing.
* If the provider has a tag that includes a masking policy at the *account* level, auto-fulfillment doesn’t take that masking policy into account when auto-fulfilling the data product. For auto-fulfillment, the scope of sharing is at the database, schema, and table level, but not at the account level.
* Auto-fulfillment enforces a 10TB limit on the size of the data product. For more information, refer to the
  [The database is larger than 10 terabytes](provider-listings-auto-fulfillment-troubleshoot-setup.md) troubleshooting topic. After assessing the cost implications, you can contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to increase the size limit.

* If you use [Tri-Secret Secure](../user-guide/security-encryption-tss.md), you must contact
  [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to enable Tri-Secret Secure for the secure share areas used for auto-fulfillment.

  + With Tri-Secret Secure, query results are encrypted using one key from the provider, one from Snowflake, and one from the consumer. Each key independently governs access. If a key is revoked, only its owner loses access. For example, revoking the provider key does not prevent the consumer from accessing data that has already been retrieved.

---
title: Auto-fulfillment objects
source: https://docs.snowflake.com/en/collaboration/provider-understand-auto-fulfillment-objects.md
section: Collaboration & Marketplace
---

# Auto-fulfillment objects

Before continuing, be sure that you understand the objects that are supported for Cross-Cloud Auto-Fulfillment (auto-fulfillment), how objects may depend on account roles, the internal objects that Snowflake creates for auto-fulfillment, and what exactly gets fulfilled by object-level auto-fulfillment.

## Objects supported for auto-fulfillment

The database objects included in *or referenced by* your
listing must contain only objects supported for auto-fulfillment.

Depending on your data product, different objects are supported:

| Object | Share (Database) | Application package |
| --- | --- | --- |
| Table | ✔ | ✔ |
| Open table (Apache Iceberg™, Delta Lake) | ✔ | ✔ |
| View (Regular, aka Non-Secure) | ✔ | ✔ |
| View (Materialized) | ✔ | ✔ |
| View (Secure) | ✔ | ✔ |
| View (Semantic) | ✔ | ✔ |
| Secure view that references data stored in other databases using the REFERENCE_USAGE privilege. | ✔ | ✔ |
| Secure view that references a directory table of an internal stage (not external). For more information, see [Share unstructured data with a secure view](../user-guide/unstructured-data-sharing.md). | ✔ | ✔ |
| Cortex Knowledge Extensions (CKEs) | ✔ |  |
| Cortex Agents | ✔ |  |
| Dynamic Table | ✔ | ✔ (only from the application package) |
| Database Roles | ✔ | ✔ |
| SQL UDF/UDTF (Regular, also known as non-secure) | ✔ | ✔ (when called from shared views in referenced databases) |
| SQL UDF/UDTF (Secure) | ✔ | ✔ (when called from shared views in referenced databases) |
| Stored Procedure (not used by sharing) | ✔ | ✔ |
| Masking and Row Access Policies | ✔ | ✔ |
| Tags | ✔ | ✔ |
| Policies | ✔ | ✔ |
| Tasks (not used by sharing) | ✔ | ✔ |
| Alerts (not used by sharing) | ✔ | ✔ |
| Secrets (not used by sharing) | ✔ | ✔ |

If an object on this list is designated as part of a replication or failover group, then it’s not supported for auto-fulfillment.
See [Introduction to replication and failover across multiple accounts](../user-guide/account-replication-intro.md) for details. If a primary database contains a hybrid table, the refresh operation fails.
For details, see the [Snowflake Community forum](https://community.snowflake.com/s/article/Auto-fulfillment-error-SQL-execution-error-Primary-database-contains-an-entity-of-type-Table-Replication-of-a-database-with-this-entity-type-is-not-supported).

If your data product contains or references objects other than the listed supported objects, you must update your data product.

## Auto-fulfillment for objects that depend on account roles

Auto-fulfillment does not replicate account roles. Instead, objects in SSAs are owned by the ACCOUNTADMIN role.

If your share or application package contains objects that depend on an account role, the object might work differently than you
expect when shared with consumers. For example:

* If you share a secure view that includes data protected by a policy using the [INVOKER_ROLE](../sql-reference/functions/invoker_role.md) context function, the policy
  might evaluate to a different value than in the provider account region because the view owner role is different.
* If you share a secure view where the objects referenced by the view are restricted to an account role, such as a table where
  only the SECURITYADMIN role has SELECT privileges, the view might fail to expand when queried by a user without the SECURITYADMIN role
  in the provider account, but return results when queried by a user without the SECURITYADMIN role in the consumer account.

Instead of using account roles, use database roles. For more information, see [Share data protected by a policy](../user-guide/data-sharing-policy-protected-data.md)
and [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md).

Snowflake Marketplace calculates compute costs for listing auto-fulfillment to VPS regions by using VPS rates. For details on VPS rates, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Internal Snowflake objects created for auto-fulfillment

Snowflake creates the following internal objects to support Cross-Cloud Auto-Fulfillment:

| Object Type | Name |
| --- | --- |
| Roles | SNOWFLAKE$GDS_RL  AUTO_FULFILLMENT_EXECUTOR |
| Database | SNOWFLAKE$GDS |
| Replication groups | Prefixed with `SNOWFLAKE$GDS` |

These internal objects are used to perform tasks for auto-fulfillment, such as to create a secure share area in another region, and
create a database to store objects used for auto-fulfillment, such as fulfillment tasks.

These internal objects appear when you run [SHOW DATABASES](../sql-reference/sql/show-databases.md), [SHOW ROLES](../sql-reference/sql/show-roles.md), or [SHOW REPLICATION GROUPS](../sql-reference/sql/show-replication-groups.md) respectively.
Do not modify these objects or grant them to other users or roles.

### Object-level auto-fulfillment

When you configure object-level auto-fulfillment, SUB_DATABASE is used for supported objects. Objects that are referenced by these
objects must also be supported. For a list of supported objects,
refer to the Objects supported for auto-fulfillment topic on this page.

1. The first consumer in a region gets the listing.
2. Auto-fulfillment transfers the objects in the share to the secure share area.
3. Any consumer that gets the listing gets the data product from the secure share area in their Snowflake region.

## What gets fulfilled by object-level auto-fulfillment

When you use SUB_DATABASE (object-level) auto-fulfillment for your data product, only the objects granted directly to the share or
app, or referenced by an object in your share or app, are auto-fulfilled.

For example:

| Object in data product | What is transferred |
| --- | --- |
| Table in a database and schema | Table |
| Secure view created from a table in the same database | Secure view and table |
| (Deprecated) Table in a database using FULL_DATABASE auto-fulfillment | Entire database |
| Table in a database using SUB_DATABASE auto-fulfillment | Table |
| Application package using SUB_DATABASE_WITH_REFERENCE_USAGE auto-fulfillment | The application package |

---
title: Automatic Data Agents for listings and shares
source: https://docs.snowflake.com/en/collaboration/auto-generated-data-agents.md
section: Collaboration & Marketplace
---

# Automatic Data Agents for listings and shares

Automatic Data Agents instantly generate AI-powered agents and semantic views for your data listings and shares, transforming static data
into intelligent, conversational experiences that enable end users to query your data using natural language, with no technical expertise
required.

## Understand Automatic Data Agents

Traditionally, Snowflake listings and shares require consumers to understand the underlying schema and to write SQL queries to
extract data. Automatic Data Agents address this barrier by analyzing your listing metadata and data schemas to
automatically construct the
following objects:

* [A semantic view](../user-guide/views-semantic/overview.md): A business-friendly data representation compatible with Cortex Analyst.
* [A Cortex Agent](../user-guide/snowflake-cortex/cortex-agents.md): An AI orchestration layer that understands the specific domain and context of your data.

As a provider, this automation significantly reduces the time required to make a listing or share “Cortex AI-ready,” allowing you to offer
conversational data experiences without manual engineering. After these objects are created, all you need to do is attach them to your
listing or share, and then customers and end consumers can easily interact with your data using Cortex AI products and features.

> **Tip:**
>
> When creating Cortex AI-ready listings, add the Cortex AI ready category to your listing. This category makes it easier for consumers to find your listing.

### Key features of Automatic Data Agents

* **One-click generation** automatically creates both the agent and the semantic view objects based on existing metadata and table structures.
* **Table and view selection** lets you choose which tables and views to include when generating the semantic view, giving you control over which data is exposed through the agent.
* **AI-powered semantic modeling** uses [Semantic View Autopilot](../user-guide/views-semantic/autopilot.md) to identify table relationships, metrics, and dimensions.
* **Dynamic agent instructions** generate context-aware personas and orchestration instructions derived from the listing metadata. (For direct shares, static instructions are used.)
* **Integrated testing** allows providers to validate the agent’s responses before publishing to consumers.
* **Seamless publishing** attaches the generated assets directly to the existing secure share, making them instantly available to consumers.

### Considerations

Use Automatic Data Agents when you want to quickly enable AI capabilities for new or existing listings or shares that contain tables or views.

This feature is best suited for listings and shares that meet the following criteria:

* The data structure is well-defined in tables or views.
* For listings, the listing description clearly explains the data domain. (This improves the AI-generated instructions.)
* You don’t have existing semantic views or agents manually attached to the share.

### Limitations

* **Regeneration:** Regenerating an agent replaces the existing agent and semantic view objects; previous versions are not preserved.
* **Object location:** Generated agents and semantic views must be stored in the same database as the shared content.
* **Exclusive generation:** You can’t use this feature if the share already contains agents, semantic views, or Cortex Search Services.
* **Generation time:** The process may take up to 10 minutes depending on the complexity and size of the shared schemas.

## Work with Automatic Data Agents as a provider

Automatic Data Agents allow you to configure, test, and manage AI agents for your listings and shares directly within Provider Studio (for public
and private [Snowflake Marketplace](collaboration-marketplace-about.md) listings), within Internal Sharing (for
[Internal Marketplace](../user-guide/collaboration/listings/organizational/org-listing-about.md) listings), or from the External sharing page (for
direct shares without a listing).

### Required privileges

To create, edit, and manage Automatic Data Agents, you need the following privileges:

#### Privileges required to create objects (agent generation)

| Privilege | Object | Purpose |
| --- | --- | --- |
| CORTEX_USER | Database | Includes the privileges that allow users to call Snowflake AI functions and to use LLMs to generate semantic views. By default, the CORTEX_USER role is granted to the PUBLIC role. |
| CREATE SEMANTIC VIEW | Schema | Required to create a new semantic view |
| CREATE AGENT | Schema | Required to create the Cortex Agent |
| SELECT | Tables/Views in share | Required on any tables or views used in the semantic view definition |
| USAGE | Database | Required to access the database containing your shared objects |
| USAGE | Schema | Required to access the target schema where objects will be created |

> **Note:**
>
> The SELECT privilege on tables is needed during semantic view creation. However, to query a semantic view afterward, you only need the SELECT privilege on the semantic view itself.

#### Privileges required for adding objects to a share (publishing)

| Privilege | Object | Purpose |
| --- | --- | --- |
| OWNERSHIP | Share | Required to grant privileges on objects to the share |
| OWNERSHIP or MODIFY | Listing | Required to modify the listing and submit for approval (only applicable when using listings) |

When you add objects to a share, the following grants are made automatically:

* `GRANT USAGE ON AGENT ... TO SHARE`
* `GRANT SELECT ON SEMANTIC VIEW ... TO SHARE`
* `GRANT REFERENCES ON SEMANTIC VIEW ... TO SHARE`

#### Privileges required to manage objects (regenerate/delete)

| Privilege | Object | Purpose |
| --- | --- | --- |
| OWNERSHIP | Agent | Required to drop or replace the agent (automatically granted to creator) |
| OWNERSHIP | Semantic view | Required to drop or replace the semantic view (automatically granted to creator) |

### Automatic Data Agents workflow

1. Start Automatic Data Agents.
2. Use SQL to verify the created objects.
3. Test the data agent.
4. Optional: Manage the data agent.
5. Attach the Automatic Data Agent to your listing or share.

### Start Automatic Data Agents

For providers, the Automatic Data Agents setup process analyzes your listing or share and generates the necessary Cortex AI objects. You can use Automatic Data Agents with Snowflake Marketplace listings, Internal Marketplace (organizational) listings, or direct shares. You must provide *all required information* before you can get started with Automatic Data Agents.

The examples below describe how to configure Automatic Data Agents on a Snowflake Marketplace listing, an Internal Marketplace listing, or a direct share. Select the
appropriate option.

> **Note:**
>
> The automatic generation wizard is available only in Snowsight.

#### Option 1. Start Automatic Data Agents on a Snowflake Marketplace listing

The steps below assume that you’ve already created a Snowflake Marketplace listing and attached a data product to it. For more information, see [Create and publish a listing](provider-listings-creating-publishing.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. On the Listings tab, select the public listing that you want to configure.
4. On the Secure share tab for the listing, in the Add an Agent to your listing banner, select Get started.

   > **Note:**
   >
   > The listing must have an attached share. Otherwise, the Secure share tab won’t be available.
   > The listing must also include all required information. Otherwise, the Get started button is disabled.
5. In the configuration dialog, enter the following values:

   * Agent Display Name: Enter a name for the agent (defaults to the listing title).
   * Location: Select the target schema for the generated objects.
   * Tables/Views: Select the tables and views to include in the semantic view. You can choose a subset of the available tables and views in the share to control which data the agent can access.

     > **Note:**
     >
     > This schema must be in the same database as the shared data.
6. Select Create.

The generation process begins immediately. You can view the status of each step, including metadata retrieval, semantic view generation, and
agent creation. This process may take several minutes.

#### Option 2. Start Automatic Data Agents on an Internal Marketplace listing

The steps below assume that you’ve already created an Internal Marketplace listing and attached a data product to it. For more information, see [Create an organizational listing](../user-guide/collaboration/listings/organizational/org-listing-create.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » Internal sharing.
3. On the Internal sharing page, select the listing that you want to configure.
4. On the Secure share tab for the listing, in the Add an Agent to your listing banner, select Get started.

   > **Note:**
   >
   > The listing must have an attached share. Otherwise, the Secure share tab won’t be available.
   > The listing must also include all required information. Otherwise, the Get started button is disabled.
5. In the configuration dialog, enter the following values:

   * Agent Display Name: Enter a name for the agent (defaults to the listing title).
   * Location: Select the target schema for the generated objects.
   * Tables/Views: Select the tables and views to include in the semantic view. You can choose a subset of the available tables and views in the share to control which data the agent can access.

     > **Note:**
     >
     > This schema must be in the same database as the shared data.
6. Select Create.

The generation process begins immediately. You can view the status of each step, including metadata retrieval, semantic view generation, and
agent creation. This process may take several minutes.

#### Option 3. Start Automatic Data Agents on a direct share

You can also generate an Automatic Data Agent for a direct share that isn’t associated with a listing.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. On the Shared by your account tab, select the share that you want to configure.
4. In the Add an Agent to your share banner, select Get started.
5. In the configuration dialog, enter the following values:

   * Agent Display Name: Enter a name for the agent.
   * Location: Select the target schema for the generated objects.
   * Tables/Views: Select the tables and views to include in the semantic view. You can choose a subset of the available tables and views in the share to control which data the agent can access.

     > **Note:**
     >
     > This schema must be in the same database as the shared data.
6. Select Create.

The generation process begins immediately. You can view the status of each step, including metadata retrieval, semantic view generation, and
agent creation. This process may take several minutes.

### Verify created objects using SQL

You can use SQL to verify the created objects.

> ```sqlexample
> -- Verify the agent was created
> SHOW AGENTS IN SCHEMA my_database.my_schema;
>
> -- Verify the semantic view was created
> SHOW SEMANTIC VIEWS IN SCHEMA my_database.my_schema;
> ```

### Test the data agent

Before publishing, verify that the agent accurately answers questions about your data.

1. In the Agent section of your listing or share, locate the generated agent.
2. Select one of the available Try buttons to open Cortex Studio.

   You can test the agent response or validate the semantic view.
3. Enter natural language queries related to your data, for example, “What was the average sales volume last month?”
4. Review the generated SQL and the textual response for accuracy.
5. If adjustments are needed, edit the semantic view manually or update your listing description, and then regenerate the agent.

### Manage data agents

#### Regenerate an agent

If your data schema changes or you update your listing description to improve the agent’s context, you can regenerate the agent.

> **Caution:**
>
> Regeneration drops the existing agent and semantic view and creates new versions. Any manual edits made to the previous semantic view will be lost.

1. In the Agent section, on the More actions (…) menu, select Regenerate agent.
2. Confirm the action to start the process.

#### Drop an agent

You can drop agents that aren’t attached to shares. If the agent you want to drop is attached to a share, then you need to remove it from
the share before you can drop it.

1. In the Agent section, select the More actions (…) menu.
2. Select Drop agent.
3. Confirm to remove both the agent and the semantic view from your account.

### Attach the Automatic Data Agent to your listing or share

To make the agent available to consumers, attach it to the secure share.

1. Navigate to the Secure share tab of your listing, or the share details page for a direct share.
2. In the Agent section, select Add to secure share.
3. Review the confirmation dialog, which indicates that the agent and semantic view will be granted to the share.
4. Click Add.

   After the agent is added, any updates to these objects in your account are instantly available to consumers who have access to the listing or share.

## Use Automatic Data Agents as a consumer

As a consumer, you can use the Automatic Data Agent to query your data using natural language.

For Snowflake Marketplace listings, follow these steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Select the Cortex AI-ready listing that you want to access and Get the listing if you don’t already have it.
4. Select Open, then select the Agent name to test the agent.

For Internal Marketplace listings, follow these steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Select the Cortex AI-ready listing that you want to access and Get the listing if you don’t already have it.
4. Select Open, then select the Agent name to test the agent.

For privately shared listings, follow these steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Data sharing » External sharing.
3. On the Shared with you tab, select the Cortex AI-ready listing or share that you want to access and Get it if you don’t already have it.
4. Select Open, then select the agent name to test the agent.

---
title: Configure a cron refresh schedule
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-configure-cron-refresh-schedule.md
section: Collaboration & Marketplace
---

# Configure a cron refresh schedule

If you have the MANAGE LISTING AUTO FULFILLMENT privilege, you can use Snowsight or SQL to configure a [cron](https://en.wikipedia.org/wiki/Cron) refresh schedule for an account or for a database.

## Account-level refresh schedules

If your data product is an application package that is auto-fulfilled to remote regions, updates to your product occur based on a schedule that you set at the account level. This is important for providers who need to offer a predictable timestamp for when refreshes are available to all consumers.

When you create a refresh schedule for an account, you update the auto-fulfillment refresh schedule for every application package published by your account. This refresh schedule doesn’t affect listings with shares attached.

> **Note:**
>
> Account-level schedules are used by Snowflake Native Apps. For other shares, the schedule is per database. Listings that use different databases can have different schedules.

## Database-level refresh schedules

If you’re a provider with multiple listings in a database, you can create a refresh schedule for that database. All listings within that database will refresh based on that schedule.

If your listings are in different databases, you can create different schedules for each database.

## Set the refresh schedule for a listing

SnowsightSQL

To set a cron refresh schedule using Snowsight, follow these steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. On the Listings tab, select the row for the listing that you want to manage.
4. On the listing details page, scroll down to the Cloud region availability section.

   > The current refresh schedule is displayed here.
5. Select Cloud region availability » Update refresh frequency.
6. In the Data product refresh menu, select Scheduled time.
7. Specify the frequency and time for this schedule; for example, Daily at 1:00 AM (UTC-7:00) (Local time) Pacific time.
8. To save the updated refresh schedule, select Update.

You can set up cron refresh schedules when you [create](../sql-reference/sql/create-listing.md) or [alter](../sql-reference/sql/alter-listing.md) a listing. The cron expression for configuring a cron refresh schedule consists of the following fields:

```output
# __________ minute (0-59)
# | ________ hour (0-23)
# | | ______ day of month (1-31, or L)
# | | | ____ month (1-12, JAN-DEC)
# | | | | __ day of week (0-6, SUN-SAT, or L)
# | | | | |
# | | | | |
  * * * * *
```

For more information about using SQL to manage data refreshing for auto-fulfillment, see [auto_fulfillment](../progaccess/listing-manifest-reference.md).

The following example sets the cron refresh schedule for a listing to occur Monday through Friday at 5:00 p.m. London (UTC) time:

```sqlexample
ALTER LISTING shared_listing
  $$
    auto-fulfillment:
      refresh_schedule: "USING CRON  0 17 * * MON-FRI Europe/London"
      refresh_type: "SUB_DATABASE"
  $$
PUBLISH=TRUE;
```

---
title: Configure Egress Cost Optimizer
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-eco-configure.md
section: Collaboration & Marketplace
---

# Configure Egress Cost Optimizer

If you’re a provider, you can use Cross-Cloud Auto-Fulfillment (auto-fulfillment) for a listing to automatically replicate your data product to other Snowflake regions without having to manually replicate data.

This section describes how to authorize multi-region sharing and enable and disable Egress Cost Optimizer (ECO) for your organization.

## Authorize multi-region sharing egress cost optimization

ECO must be authorized before it can be used. You can enable ECO initially from the [Snowsight](../user-guide/ui-snowsight-gs.md) Home page, or later using the Settings.

To initially authorize ECO, do the following:

Snowsight

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md) as a user who has been granted the ORGADMIN privilege.
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Home tab.
4. Click Get Started.
5. Click Authorize.

   **Why this is important**

   Egress Cost Optimizer is a feature of Cross-Cloud Auto-Fulfillment that helps you reduce
   egress costs by up to 80% when sharing the same data to multiple regions.

   **Enable Egress Cost Optimizer**
   By opting to use Egress Cost Optimizer, you enable Snowflake to intelligently
   route your data to minimize egress costs. You also authorize the associated Snowflake Third-party

   [Sub-processors](https://www.snowflake.com/en/legal/privacy/snowflake-sub-processors/)
   to process your data in the cloud regions described in our Documentation.
   Your data is always secure with end-to-end encryption in transit and rest with no impact to query latency when processed through
   Egress Cost Optimizer.

   **Once Authorized**
   The optimizations will be enabled for all accounts in your organization.
   The account administrators can disable it.

## Enabling Egress Cost Optimizer

You can enable or disable ECO at the account level. Where an auto-fulfillment schedule is set, ECO will be enabled on all the listings in a database that follow the account schedule.

### Enable or disable ECO for an entire account

SnowsightSQL

After ECO is authorized at the organization level, enable or disable ECO for an account by doing the following:

> 1. In the navigation menu, select Marketplace » Provider Studio.
> 2. Select the Settings tab.
> 3. In the Cross-Cloud Auto-Fulfillment pane, click the toggle next to Egress Cost Optimizer to enable or disable ECO.

You can enable egress cost optimization by executing the [ALTER ACCOUNT](../sql-reference/sql/alter-account.md) command to set the ENABLE_EGRESS_COST_OPTIMIZER parameter to TRUE:

```sqlexample
ALTER ACCOUNT SET ENABLE_EGRESS_COST_OPTIMIZER=TRUE;
```

To disable egress cost optimization, set the ENABLE_EGRESS_COST_OPTIMIZER parameter to FALSE:

```sqlexample
ALTER ACCOUNT SET ENABLE_EGRESS_COST_OPTIMIZER=FALSE;
```

For more information see [ALTER ACCOUNT](../sql-reference/sql/alter-account.md).

## Limitations of ECO

* Incremental data ingestion is required for the cloud cache to be fully used by the egress cost optimizer.
* The cloud cache is only used by the egress cost optimizer for refreshes made by auto-fulfillment.
* Egress cost optimizer will only use the cloud cache if the overall egress costs for all listings on the same database are getting optimized. The optimizer algorithm measures the size of the listings at a database level and not at a table level.
* ECO is not supported for listings that include a [Cortex Knowledge Extension (CKE)](../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md).

  Providers should be aware of the cost implications for replication with listings that have a CKE.

  If a CKE is added to a listing that has ECO enabled, ECO will be automatically turned off, and the provider will be notified by email. With ECO turned off, costs associated with the listing can increase.

  Similarly, if a CKE is added to a listing that’s part of a replication group, then ECO will be turned off for all listings within that replication group. An email notification will be sent to the provider indicating that the ECO was turned off.

---
title: Configure listings
source: https://docs.snowflake.com/en/collaboration/provider-listings-reference.md
section: Collaboration & Marketplace
---

# Configure listings

When you [create a listing](provider-listings-creating-publishing.md), you must complete additional fields for your
listing before making it available for consumers.

Depending on how you make your listing available, and how consumers access your data product, some sections or fields might be optional.

After you configure your listing, publish it to consumers. See [Publish a listing](provider-listings-creating-publishing.md).

## Basic information

Complete basic information about your listing. The following table describes the fields in the Basic Information section:

| Field Name | Description | Example |
| --- | --- | --- |
| Title | The title of the listing. When consumers view your listing, the title appears below your company name.  The title must have the following characteristics to be approved:   * Should be between 40–60 characters, up to 110 characters * All major words are capitalized (use title case) * Must be unique * Must be different from any other listings offered by your provider profile | Historical Weather by Postcode. |
| Subtitle | Provides a short, informative explanation of your data product that is visible to consumers.  The subtitle cannot exceed 100 characters. Use sentence case for the subtitle and do not repeat the title.  This option is not available for private listings. | Historical weather data by location. |
| Category | Categories help consumers find your data or app on the Snowflake Marketplace. Select the desired category from the drop-down list of available values. You can select up to three categories.  This option is not available for private listings. | Environment |
| Terms of Service | Specifies a link to the listing terms - the service agreement for the listing. Consumers must accept the listing terms before they can access the listing. Listing terms are required for all listings.  Select one of the following:   * Standard Agreement for Marketplace Products: Snowflake provides standard listing terms for   Snowflake Marketplace products, called the Standard Agreement for Marketplace Products. This agreement is available   at: <https://www.snowflake.com/marketplace/standard-agreement/>. You can choose to use the Standard Agreement as the   listing terms. You can learn more about the Standard Agreement by reviewing the FAQs   available at: <https://www.snowflake.com/standard-agreement-for-marketplace-products/>.  By selecting the Standard Agreement for Marketplace Products, you’re confirming that you’ve reviewed it and the   [Disclaimer](https://www.snowflake.com/standard-agreement-for-marketplace-products/) with your legal counsel. * Custom: Specify a URL to the listing terms. The URL must be publicly accessible and   not require authentication to access. The listing terms can be a PDF or other document type that is accessible from a URL. * Listing terms will be provided offline: Only available for private listings offered directly to specific consumers.   This option lets you provide listing terms to your consumers that isn’t available at a URL. | Custom, `https://www.example.com/en/legal` |

## Details

Complete additional details for your listing.

> **Note:**
>
> This section is optional for private listings.

The following table describes the available fields in the Details section:

| Field Name | Description | Example |
| --- | --- | --- |
| Description | Description of the data product shared in the listing. The description helps consumers understand what is in your data product.  Enter a description between 250 and 6000 characters, with line breaks between paragraphs.  Use dashes instead of bullet points. The description must include an introductory paragraph with information about the data product, such as the scope of the dataset.  For listings that include services or secure functions, the description must include the expected workflow for consumers to access your services or secure functions.  The description can also include the data sources for your listing, or additional information not covered in other fields. | ACME is the number one supplier of customized, pinpoint weather warnings to large enterprises, as well as a vital information source for worldwide weather forecasts, data and meteorological consulting services. This listing provides historical weather data for US zip codes that can be used to further enhance your existing data to provide deeper analytics.  Expected Workflow:   * Consumer shares list of zip codes with provider using a private listing. * Provider enriches the zip codes. * Provider shares enriched data back with the consumer using a private listing. |
| Link to Documentation | A link to a page on your website with more detailed documentation for the listing. Documentation must be clear, and reference the correct schema objects present in the data share or Snowflake Native App associated with the listing. The link must be accessible on the internet, and not require authentication to access. | `https://www.example.com/documentation/` |
| Link to Video | A link to an unlisted or public YouTube video for the listing. Private videos are not supported. Video thumbnails are displayed on the listing details page and videos do not play automatically.  **Tip:** When making a video to display on a listing details page, keep the following in mind:   * Video final frames are displayed after videos play and should include a call to action. * Videos should be short, and show actual product usage, with the first 5 seconds being the most important. | <https://www.youtube.com/watch?v=MEFlT3dc3uc> |

## Data product

Configure the data product for your listing. This can be a secure share, a Snowflake Native App, or a Connected App.

You can select objects and have Snowflake create a secure share, or add a share that you already created.
See [Prepare the shares for your listing](provider-listings-preparing.md) for guidance creating shares for paid listings.

When adding a data product to your listing, consider the following:

* Secure shares can only be attached to one listing.
* After the listing is published, you cannot attach a different share.
* You can only see shares that your current role owns.
* The data product must be legally shareable (i.e. you must own the data or have the right to share it).
* Until a listing is published, it can only be associated with a share in the local/primary account. After the
  listing is published, it can be associated with a share in additional regions that you have selected.

The following table describes the available fields in the Data Product section:

| Field Name | Description |
| --- | --- |
| Database Objects or Secure Share | Data that you want to share as part of the listing. |

## Data product - data dictionary

After adding a data product to your listing, you can add a data dictionary. A data dictionary provides consumers insight into the contents
and structure of a free or paid listing offered on the Snowflake Marketplace before installing the data product into their account.

> **Note:**
>
> This section is optional for private listings.

### About data dictionaries

You can use a data dictionary to make the contents of your listing visible to consumers. A data dictionary is generated for tables and
views within a listing. Listings can also include a preview of data, referred to as a Data Dictionary Data Preview.

Your data is visible in two ways:

* Featured objects: Allow the consumer to quickly view the contents of the object. You can select up to five of the most important
  database objects within the listing.
* All objects: Allows the consumer to view all of the objects within a listing. It is auto-generated when you publish a listing.

Data dictionary Data Preview allow both providers and consumers to preview data for tables and views associated with listings.

Previews provide a representative sample of the data, allowing:

* Providers to see exactly what data will be available in a preview.
* Consumers to determine if a listing contains the data they are looking for.

> **Note:**
>
> Data in a listing is automatically made available for preview.
> Providers needn’t do anything special to enable preview.

### Set up a data dictionary for your listing

Before you can add a data dictionary, you must add a data product to the listing. All listings offered on the Snowflake Marketplace must include
a data dictionary.

To set up a data dictionary, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Find and select the listing where you want to add a data dictionary.
4. Select Save and Create Data Dictionary.

   > **Note:**
   >
   > By saving this share, you agree that Snowflake is permitted to create a data dictionary and associated preview for the share and
   > display it to consumers when the listing is published.
   >
   > Please note, Data Dictionary data previews are automatically updated when underlying data changes.
   >
   > If you data product contains PII or other personal data, please mask those columns as such.
   > For more information, including instructions on how to mask PII and other personal data see Mask PII and other data in data previews.

   After saving the listing the data dictionary displays, listing all of the tables, views, and functions within the
   listing.
5. Search for or select an object that you want to include as a featured object, then select Add to featured.

   Optionally, repeat this step to add additional featured objects. You can have up to five featured objects in a listing.
6. Select Save.

You can edit column descriptions for tables in the Provider Studio, or you can use SQL.
Use the COMMENT parameter in the [CREATE <object>](../sql-reference/sql/create.md) and [ALTER <object>](../sql-reference/sql/alter.md) commands or the [COMMENT](../sql-reference/sql/comment.md)
command to add a comment describing an object or [individual table columns](../sql-reference/sql/alter-table-column.md).

### Mask PII and other data in data previews

Snowflake periodically runs [Data Classification](../user-guide/classify-intro.md) on Data Previews to identify and mask any column with a high likelihood of containing PII
(Personally identifiable information) or other personal data. Personal data includes information relating to an identified or identifiable
person, such as:

* Name, age, email, or mailing address
* Educational or employment information
* Location data or device activity
* Customer records, or account information

Once Snowflake identifies and masks a PII column, an email is sent to the technical contact listed in the provider profile to review the
details. At any time, you can manually select or deselect PII columns in the Data Preview.

To view or modify PII classification results, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings from the top navigation. To create a listing, see [Create and publish a listing](provider-listings-creating-publishing.md).
4. Select the listing to review.
5. Under Data Dictionary, select Edit.
6. Select the table or view to review.
7. Review the classification results.
   Columns containing PII or any other personal data are classified as “Contains PII”.
8. Select the Data Preview tab to preview the object’s content, which may include masked data.
9. If a column is not identified correctly:

   * If a column is mistakenly identified as containing PII, deselect the checkbox to the left of the Name column to ensure that the
     data in that column is unmasked in the Data Preview.
   * If a column contains PII but has not been identified as such, select the checkbox to the left of the Name column to ensure that
     the data in that column is masked in the Data Preview.
10. Select Save.
11. If you agree with the Data Classification, select Save.

> **Note:**
>
> Data Preview content is generated when a data dictionary is enabled for a listing. Preview data for individual tables and views may not
> be immediately available while data is generated.

> **Note:**
>
> It can take up to 3 to 4 hours for columns selected as PII in Provider Studio to display as masked on the consumer-facing listing.

### Data Preview refresh

If you add or remove objects in your listing, or change the schema of an object, the associated data preview will be refreshed, if enabled.
Updating the object data will not result in a data preview refresh.

> **Note:**
>
> Data Previews are refreshed automatically approximately every few hours. You may not see the refreshed preview immediately after adding
> or removing objects, or updating schemas. If the data within the existing objects (for example, rows in a table) is updated, but there
> are no schema changes or new objects added, the data preview will not be refreshed immediately. In this case, the data preview will only
> be updated during the next biweekly refresh.

## Data product - attributes

After specifying a data product, you can define additional attributes for a listing.

> **Note:**
>
> This section is optional for private listings.

The following table describes the available fields in the Attributes section for a data product:

| Field Name | Description |
| --- | --- |
| Update Frequency | How often your data product is updated in Snowflake. If your data product is updated at different frequencies, choose the highest frequency of updates for your data product. |
| Geographic Coverage | Select one or more geographic regions for which your data product has coverage. If applicable, choose specific countries or U.S. states. |
| Geographic Granularity | If you specify global or multiple states or countries as the geographic coverage of your dataset, select a granularity for the data product.  You can only choose one option, so select the most granular option available in your data product. |
| Time Range | Specify the time period that your data product covers. You can specify custom dates as a fixed time range (2020-01-01 - 2021-01-01) or a dynamic time range (Next/Last X days, weeks, months, or years). |
| Timestamp Granularity | If you specify a time range, select a timestamp granularity for the data product.  You can only choose one option, so select the most granular timestamp type in your data product. |
| Additional attributes (optional) | Any additional information that you want to communicate to your consumers. You can include up to 4 additional attributes of the data. Use 2-5 words for each attribute to maximize readability. Each attribute must be fewer than 80 characters. |

## Access and pricing - listing access

For listings offered on the Snowflake Marketplace, you can view the Listing Access for a listing, and modify it if the listing is still
in draft.

Listing access controls how consumers can access your data product. See [Listing access options](collaboration-listings-about.md) for more details.

## Access and pricing - trial

Add a trial for a limited trial listing offered on the Snowflake Marketplace. To add a trial for a paid listing,
see Access and pricing - pricing & trial.

> **Note:**
>
> This option is required for limited trial listings.

The following table describes the available fields in the Trial section:

| Field | Description |
| --- | --- |
| Trial Type | Choose the type of trial to offer:   * Limited time. Consumers can trial your data product for a limited period of time. Choose this option if you offer your entire   data product to consumers for a short period of time. * Unlimited time. Consumers can trial your data product indefinitely. Choose this option if you offer a sample of your full data   product. For example, weather data for just one city, while your full data product includes weather data for an entire country.   If your listing has an application package as the data product, you can also choose from two other trial types:   * Limited functionality. Consumers have access to limited functionality of your application package. * Limited functionality and time. Consumers have access to limited functionality of your application package for a limited time.   **Caution:** You must limit functionality to your app by using the [SYSTEM$IS_LISTING_TRIAL](../sql-reference/functions/system_is_listing_trial.md) system function.  If you select a limited functionality trial and your application package is not set up to limit functionality in the shared data content or application logic, your app will provide full functionality to trial customers.  See [Limit functionality of your Snowflake Native App for trial consumers](provider-listings-preparing.md) for details on fully configuring limited functionality trials. |

## Access and pricing - pricing & trial

Add the pricing plan and trial for a paid listing in this section.

This section is required for paid listings offered on the Snowflake Marketplace, but trials are optional for paid private listings.

> **Note:**
>
> Only account administrators (users with the ACCOUNTADMIN role) or the listing owner (a role with OWNERSHIP privilege on the listing) can complete this section.

The following table describes the available fields in the Trial & Pricing section:

| Field | Description |
| --- | --- |
| Pricing Plan | Choose the pricing plan for the listing. See [Paid listings pricing models](provider-listings-pricing-model.md). Prices are in US dollars only. |
| Free Trial | Choose the trial type for the listing:   * Limited time: This option lets consumers participate in a free trial for a time period that you specify (1, 7, 30, 60, or 90 days).   The maximum trial period is 90 days. When the trial ends, the consumer account loses access to the data product.   Other accounts in the same organization can perform a new trial of your listing. * Limited functionality: This option lets consumers participate in a free trial with limited functionality of the paid data product.   This trial mode doesn’t expire until the consumer upgrades to the paid listing. * Limited functionality and time: This option lets consumers participate in a free trial for a period that you specify   (1, 7, 30, 60, or 90 days). The trial version offers limited functionality of your data product, and when the trial period ends,   the account performing the trial loses access to the data product. |

## Business needs

Add the business needs that your data product can help consumers with.

> **Note:**
>
> This section is optional for private listings.

The following table describes the available fields in the Business Needs section:

| Field Name | Description | Example |
| --- | --- | --- |
| Business Need | Help consumers find your listing on the Snowflake Marketplace by specifying relevant business needs addressed by your data product. You can select up to six relevant business needs.  If you do not see a relevant business need in the drop-down list, you can create a custom need using 2-4 words. However, consumers cannot filter by custom business needs on the Snowflake Marketplace.  You can edit the list of business needs at any time without resubmitting the listing for approval. | Location Data Enrichment |
| Description | Description of how your listing addresses the selected business needs, using an example specific to a customer use case or business need.  Add a unique description for each business need. | Location Data Enrichment: Identify all of the zip codes associated with a given county, census tract, or core-based statistical area. |

## Sample SQL queries

You can specify valid sample SQL queries that consumers can use to get value out of your data product, or at least verify that your data
product was successfully installed in their Snowflake account.

> **Note:**
>
> At least one valid SQL query is required in order for you to publish a listing on the Snowflake Marketplace.
> It’s recommended to include 3–4 sample queries.
>
> SQL queries are optional when publishing private listings.

The sample SQL has the following requirements:

* The query must return at least one row.
* The query must reference objects that are explicitly in the share.
* Objects must be qualified using `SCHEMA.OBJECT`. Do not include the database name. For example,
  `EXAMPLE_SCHEMA.TABLE_A`.

Select Add to add one SQL query. The following table describes the available fields in the Sample SQL Queries section:

| Field Name | Description | Example |
| --- | --- | --- |
| Title | Descriptive title for the query to help consumers understand how they can use the data product. | Determine if an outdoor event could be affected by rain. |
| Description (Optional) | Description of the example that ties the title to a specific use case for the data product. The description is automatically loaded as a comment when consumers run the sample query after installing your data product. You can also include additional instructions, such as the name of the schema, sample tables, or fields.  Use <*schema*>.<*table*> format when referencing tables and views in your SQL.  Do not include the database name in the query, because consumers create custom database names when they get your listing. | If you are hosting an outdoor event in the next 7 days, use our forecast data to determine if the event might be affected by rain. |
| SQL Query | Code for your sample SQL queries. The queries should directly answer the title and description.  Snowflake automatically validates your sample queries. To be valid, a sample query must return at least one row.  If a query fails to validate, you can save the listing but the listing cannot be published until all sample queries are successfully validated. You must select a warehouse to use to validate the SQL query. |  |

## Region availability (Marketplace listings only)

The following table describes the available fields in the Region Availability section.

| Field Name | Description |
| --- | --- |
| Region Availability | By default, your listing is available in All regions. Choosing all regions ensures the availability of your listing in any future regions added by Snowflake. For paid listings, selecting this option makes the listing available in [supported regions](consumer-listings-paying.md) and any future supported regions added by Snowflake.  If your listing has specific regional limitations, select All regions to change the region availability to Custom regions and select the regions in which you want to offer your data product. When you choose custom regions, your listing is still visible in all Snowflake Marketplace regions, but consumers can only get your data product in the regions you specify. |
| Fulfillment method | Automatic fulfillment is selected by default. With Cross-Cloud Auto-fulfillment, your data product is automatically fulfilled to a region and you incur costs only when there is consumer demand in that region.  When you use auto-fulfillment, you must also select a refresh frequency at which to update the data product shared with consumers. You must select a refresh frequency of a maximum of 8 days. If your data product is a Snowflake Native App, you can only set a refresh frequency on the account level.  For more details on auto-fulfillment, see [Auto-fulfillment for listings](provider-listings-auto-fulfillment.md).  If you can’t use auto-fulfillment, select Manual to manually replicate your data product. To fulfill requests, you must set up accounts in regions with consumer demand, manually replicate the product to each account, create secure shares in each account, and attach those shares to this listing. See [Manually replicate data to fulfill a listing request](provider-listings-managing.md) |

## Connection string identifiers (Connected Apps only)

For Connected Apps, add one or more valid connection string identifiers (CSIDs). Submit the same CSIDs that you submitted when you registered on the Snowflake Partner Network portal.

## Consumer accounts (private listings only)

To publish a listing to specific consumers, you must specify the account identifiers for the accounts that you want to share with:

| Field Name | Description | Example |
| --- | --- | --- |
| Consumer Accounts | Specifies the Snowflake accounts that you want to share your private listing with. You can use Snowflake account identifiers or URLs. See [Finding the organization and account name for an account](../user-guide/admin-account-identifier.md) for details. | `ORGABC.ACCOUNT123` `https://<organization_name>-<account_name>.snowflakecomputing.com` |

If you’re sharing with a consumer account that is in a different region than your account, you must also set up auto-fulfillment:

| Field Name | Description |
| --- | --- |
| Auto-fulfillment | Select the replication interval and frequency for your data product. For example, you can configure replication to occur every two hours. If your data product is an application package, you can only set the refresh frequency and interval on the account level. |

See [Auto-fulfillment for listings](provider-listings-auto-fulfillment.md) for more information.

---
title: Consumer fulfillment behavior
source: https://docs.snowflake.com/en/collaboration/listings-bcdr-consumers.md
section: Collaboration & Marketplace
---

# Consumer fulfillment behavior

Traditionally, Snowflake uses two primary fulfillment patterns:

* **Same-region access**: Consumers in the same region as the provider access data directly from the provider’s account without additional replication.
* **Cross-region access**: Providers use [auto-fulfillment](provider-listings-auto-fulfillment.md) to replicate data and metadata to a Secure Share Area (SSA) in the consumer’s region.

## The impact of failover groups for listings

With the introduction of failover groups for listings, Snowflake ensures that metadata and relationships remain intact in a
secondary data recovery (DR) account. This capability provides a specialized access pattern designed to prevent downtime for your consumers,
regardless of the region that is currently the primary.

## Consumer access patterns

After providers configure [Business Continuity and Disaster Recovery (BCDR)](listings-bcdr.md) for listings, the fulfillment path depends on the consumer’s location relative to the listing’s original primary region.

### The original primary region

In the region where the listing was originally created, sometimes called the *Home* region, consumers access data directly from the original provider account.

* **Failover status**: Regardless of the failover status, even if the listing fails over to a secondary region, consumers in the original region don’t switch to an SSA.
* **Data updates**: These consumers continue to receive fresh data through the failover group, which replicates data back from the new primary to the old primary.

### Secondary and remote regions

For consumers located in any other region — including the region where the DR secondary account resides — fulfillment follows the SSA pattern.

* **Unified mount point**: To ensure a seamless experience, Snowflake maintains a single *mount point* per region. In these regions, that mount point is the SSA.
* **Failover resiliency**: If a failover occurs, the SSA begins sourcing its updates from the new primary account. The consumer’s connection to the SSA remains unchanged, resulting in zero downtime.

## Comparison of fulfillment paths

The following table summarizes how fulfillment works based on the consumer location.

| Consumer location | Fulfillment source | Access method |
| --- | --- | --- |
| Original primary region | Original provider account | Direct share |
| Secondary (DR) region | SSA | Auto-fulfillment |
| All other remote regions | SSA | Auto-fulfillment |

---
title: Create and publish a listing
source: https://docs.snowflake.com/en/collaboration/provider-listings-creating-publishing.md
section: Collaboration & Marketplace
---

# Create and publish a listing

This topic contains procedures for creating and publishing a listing privately or on the Snowflake Marketplace.

## Prerequisites for listing creation

* Agree to the [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms). Acceptance of the Snowflake Provider and Consumer Terms is not required when creating free or paid off-platform private listings, but you must review and accept the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/).
* Review the [Provider Policies](https://www.snowflake.com/provider-policies/).
* [Create a provider profile](provider-becoming.md) to offer paid listings or listings on the Snowflake Marketplace.
* If you want to charge for your data product, [set up your account to provide paid listings](provider-becoming.md).
* Get access to a role with provider privileges.
* Prepare the data for your listing. See [Prepare data for a listing](provider-listings-preparing.md).

To learn more about the requirements for becoming a provider, see [Use listings as a provider](provider-becoming.md).

## Considerations for sharing listings to accounts in US government regions

Non-government providers who want to share listings with consumer accounts in US government regions must consider the following details:

* The account in the US government region must enable data sharing and collaboration. See [Prepare to access listings from accounts in U.S. government regions](consumer-becoming.md).
* You must use Cross-Cloud Auto-Fulfillment, and your data product can only contain or reference
  [objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md).
* If you offer a listing to US government regions on the Snowflake Marketplace or directly to a consumer account in a
  [US government region](../user-guide/intro-regions.md), the secure share area (SSA)
  created to auto-fulfill the listing to that region incurs costs at the rate specific to that region. See the consumption table
  available from [Snowflake Legal](https://www.snowflake.com/legal/), the [pricing guide](https://www.snowflake.com/resource/the-simple-guide-to-snowflake-pricing/) and
  [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md).

## Share data or apps with specific consumers using a private listing

You can create free or paid listings to share directly with specific consumers. You might create a private listing to fulfill a request
from a limited trial listing, or to share data or apps with a consumer with whom you already have a business relationship.

You must know a consumer’s account identifier to share a listing with them. See [Finding the organization and account name for an account](../user-guide/admin-account-identifier.md).

> **Note:**
>
> Your role must have the required privileges to create a listing.
> See [Privileges required for working with listings](provider-becoming.md).

### Create a free (or paid off-platform) private listing

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Specified Consumers.
4. In the Edit listing title dialog, enter a name for your listing.
5. Select the Add data product button, then click + Select to select the objects to attach to the listing.

   * If you select one or more database objects, Snowflake creates a secure share with those objects.
     You can change the name of the secure share.
   * If you select an existing secure share, the name of that share appears.
6. In the Access type dropdown, select Free.
7. In the Who can access section, add the [organization and account names](../user-guide/admin-account-identifier.md) for the consumers that you want to share the listing with.

   1. If you add a consumer account in a region that isn’t your local region, Snowflake enables [auto-fulfillment](provider-listings-auto-fulfillment.md) to replicate data to the remote region after a consumer gets your listing. Complete the following additional steps:

      1. In the Auto-fulfillment section, enter a value and select an interval to specify how often to replicate your data product from your region to the remote region.
      2. If you don’t have a default warehouse set, select a warehouse to use for auto-fulfillment.
8. Enter a description for your listing.
9. In the Legal Terms section, select the legal terms that apply to your listing.

   If you don’t see any legal terms, you must first accept the [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms).
10. (Optional) In the Attributes section, add custom attributes to your listing. For more information, see [Data product - attributes](provider-listings-reference.md).
11. (Optional) Click in the Data dictionary section to add featured objects from the listing’s data dictionary. For more information, see [Set up a data dictionary for your listing](provider-listings-reference.md).
12. (Optional) Click in the Business needs section to add tags that describe the business needs that your data product addresses. For more information, see [Business needs](provider-listings-reference.md).
13. (Optional) Click in the Quick Start Examples section to add sample SQL queries or a notebook that demonstrate how to use the data product. For more information, see Attach a notebook to a Snowflake Marketplace listing.
14. Select Publish to publish the listing to the selected consumers. Snowflake saves your listing if you don’t publish it immediately.

### Create a paid private listing

The following example shows how to create a private listing that includes a pricing plan and an offer.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Specified Consumers.
4. In the Edit listing title dialog, enter a name for your listing.
5. Select the Add data product button, then click + Select to select the objects to attach to the listing.

   * If you select one or more database objects, Snowflake creates a secure share with those objects.
     You can change the name of the secure share.
   * If you select an existing secure share, the name of that share appears.
6. In the Access type dropdown, select Paid.
7. In the Who can access section, add the [organization and account names](../user-guide/admin-account-identifier.md) for the consumers that you want to share the listing with.

   * If you add a consumer account in a region that isn’t your local region, Snowflake enables [auto-fulfillment](provider-listings-auto-fulfillment.md) to replicate data to the remote region after a consumer gets your listing. Complete the following additional steps:

     > 1. In the Auto-fulfillment section, enter a value and select an interval to specify how often to replicate your data product from your region to the remote region.
     > 2. If you don’t have a default warehouse set, select a warehouse to use for auto-fulfillment.
8. Enter a description for your listing.
9. In the Legal Terms section, select the legal terms that apply to your listing.

   If you don’t see any legal terms, you must first accept the [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms).
10. (Optional) In the Attributes section, add custom attributes to your listing. For more information, see [Data product - attributes](provider-listings-reference.md).
11. (Optional) Click in the Data dictionary section to add featured objects from the listing’s data dictionary. For more information, see [Set up a data dictionary for your listing](provider-listings-reference.md).
12. (Optional) Click in the Business needs section to add tags that describe the business needs that your data product addresses. For more information, see [Business needs](provider-listings-reference.md).
13. (Optional) Click in the Quick Start Examples section to add sample SQL queries or a notebook that demonstrate how to use the data product. For more information, see Attach a notebook to a Snowflake Marketplace listing.
14. Click in the Pricing section to set up pricing information for your listing. For more information about pricing plans and offers, see [Pricing plans and offers](../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md).

    1. In the Pricing plans tab, select Create pricing plan.
    2. In the Settings dialog, specify a display name for your pricing plan, then click Next.
    3. In the Pricing details dialog, specify a pricing model. You can choose either Usage-based or Flat fee.

       * If you select Usage-based, specify the following details:

         + The usage-based access fee (monthly fee).
         + The cost per query and the number of included queries (optional).
         + The maximum monthly charge (optional).
       * If you select Flat fee, specify the flat fee amount and the billing frequency.
    4. Click Next.
    5. In the Summary dialog, review the pricing details, then click Done.
15. Navigate to the Offers tab.

    1. In the Offers tab, select + Create offer.
    2. In the Offer details dialog, specify details for the offer.

       1. Select Standard offer.
       2. In the Purchase type dropdown, select Self-serve to allow consumers to see the price and purchase the listing directly, or select Sales-led to require consumers to contact you to purchase the listing.
       3. Specify a name for the offer.
       4. Select Next.
    3. In the Billing and payments dialog, select the pricing plan to attach to this offer.

       1. In the Select a pricing plan dropdown, select the pricing plan that you created earlier.
       2. Select a contract type of either Limited-time or Recurring (Subscription).
       3. Specify a contract duration.
       4. In the Payment options dropdown, select whether to charge customers based on the pricing plan or to allow payment in installments.

          * If you select Accept installments, specify the number of installments and the frequency of the installments.
       5. Specify the date of the first invoice or to invoice when the offer is accepted.
       6. (Optional) Specify whether to require consumers to include a credit card on file to purchase the listing.
       7. Select Next.
    4. In the Description dialog, enter information about the offer that users will see.

       1. Specify an offer name to display to consumers.
       2. Specify the price to display to consumers.
       3. (Optional) Specify a tagline to display to consumers.
       4. Specify the text for the button that consumers click to purchase the listing.
       5. (Optional) Specify any value propositions for the offer.
    5. Select Next.
16. Return to the Listing details tab. You will see that the offer you created is now attached to the listing.
17. Select Publish to publish the listing to the selected consumers.

    If you exit without publishing, the listing is saved as a draft.

### Create a trial

In a usage-based trial, you can offer a number of free queries that consumers can run against your data product. After all free queries have been used, the consumer must purchase the data product to run additional queries.

To add a trial to a listing, the listing must have a data product attached and you must have the ACCOUNTADMIN role or the OWNERSHIP privilege on the listing. To learn more about the privileges required to manage listings, see [Prepare data for a listing](provider-listings-preparing.md).

1. Create a listing that includes a pricing plan and offer, as created in the previous example.
2. On the Listing details tab, click in the Trial (optional) area, and select one of the following usage trial types:

   * Limited usage (available for usage-based pricing plans only)

     Enter a value in the Number of Free Queries During Trial field.

     This value indicates the number of free queries that consumers can run against your data product. After all free queries have been used, the consumer must purchase the data product to run additional queries.
   * Limited time

     Enter a value in the Length of Trial field.
   * Limited functionality

     Limit the listing’s data product to only the objects and features that you want to include in the trial.
   * Limited functionality and time

     Enter a value in the Length of Trial field, and limit the listing’s data product to only the objects and features that you want to include in the trial.
3. Select Save.
4. Select Publish to publish the listing to the selected consumers.

   If you exit without publishing, the listing is saved as a draft.

### Convert a direct share to a free (or paid off-platform) private listing

You can convert a direct share to a free (or paid off-platform) private listing or to a listing published on Snowflake Marketplace. When you do so:

* Existing consumers retain access to the share.
* You gain access to usage analytics starting from the date the listing is published. Historical usage data is not available.
* You can use auto-fulfillment to share with consumers in remote regions if you are not already using replication for the objects in your
  share and if your share only contains objects supported by auto-fulfillment. For more information, see [Objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md).
* You cannot convert your share to a paid listing if your share has active consumers.

To convert a direct share to a free private listing, follow these steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Snowflake Marketplace.
4. In the Create Listing window, enter a name for your listing.
5. Enter a subtitle and select a profile for your listing.
6. Select the Product type drop-down, then select an existing secure share that backs the direct share, instead of picking individual
   database objects.

   When you select an existing share, its name appears as the attached data product for the listing.
7. In the Access type drop-down, select Free to offer a data product that is freely available to consumers.

   > **Note:**
   >
   > To convert a direct share to a paid private listing, select Paid in the Access type drop-down.
8. In the Who can access section, add the [data sharing account identifier](../user-guide/admin-account-identifier.md) for the consumer that you want to share the listing with.
9. Configure the remaining listing fields to prepare it for publishing.
10. Submit the listing for approval and publishing.

If you decide to use auto-fulfillment to support remote consumers of your share, coordinate the following workflow with the remote consumers of your data:

1. After you publish the listing, let consumers in remote regions know how to access the listing. See [Access a private listing](consumer-listings-access.md).
2. After the consumers in remote regions get your listing, auto-fulfillment replicates the data to the remote region. See [Objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md).
3. When auto-fulfillment completes, the consumer receives an email that the data is available. At that point, the consumer must do the
   following:

   1. Drop the existing imported database created from the direct share.
   2. Get the listing and create a database, using the same name as the database imported from the direct share.

After setting up your direct share as a listing, you can use Provider Studio to manage and modify your listing.
For more information, see [Modify published listings](provider-listings-modifying.md) and [Monitor listing use](provider-listings-monitor-studio.md).

> **Tip:**
>
> To ensure a seamless transition for your high-priority clients, consider implementing a dedicated validation window following the data migration. This allows sensitive accounts an allocated period to verify that their information has successfully moved from the initial share to the final listing.

## Share data or apps publicly on Snowflake Marketplace

> **Note:**
>
> Before you create and publish a paid listing on Snowflake Marketplace, contact your business development partner at Snowflake.
> If you do not have a business development partner, [submit a case with Marketplace Operations](https://snowforce.my.site.com/s/provider-onboarding-case).
> This step is required for listing approval.

To publish data, Snowflake Native Apps, or Connected Apps on Snowflake Marketplace, your role must have the required privileges to create a listing. See [Privileges required for working with listings](provider-becoming.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Snowflake Marketplace.
4. In the Create Listing window, enter a name for your listing.
5. Enter a subtitle and select a profile for your listing.
6. Select the Product type drop-down, then select the product type associated with the listing. The following product types are available:

   * Secure share: Attach data or other objects from your account.
   * Native App: Attach applications that run directly and securely in a consumer’s account.
   * Connected App: List an application that connects to the consumer’s Snowflake account to process data.

     Before attaching a Connected App to a listing, be sure to review the [Connected Apps guidelines](guidelines-reqs-for-listing-apps.md).
7. In the Access type drop-down, select one of the following options:

   * Free to offer a data product that is freely available to consumers.
   * Limited trial to offer a trial of your data product, with unlimited access to the data product available on request.
   * Paid to charge for your data product on Snowflake Marketplace.
   > > **Note:**
   > >
   > > If you select Paid for the Access type, and you want to change it, you have to delete the current draft and create a new one.
8. Configure the remaining listing fields to prepare it for publishing.
9. Submit the listing for approval and publishing.

### As a Snowflake Marketplace provider, create a listing in a Virtual Private Snowflake (VPS) deployment

If you’re a Snowflake Marketplace provider, follow these steps to create a [V2 listing](collaboration-listings-about.md) in a VPS deployment:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Snowflake Marketplace.
4. In the Create Listing window, enter a name for your listing.
5. Select Add data product.
6. In the Data product, click + Select to select the objects to attach to the listing.
7. In the Access type dropdown, select one of the following options:

   * Free to offer a data product that is freely available to consumers.
   * Limited trial to offer a trial of your data product, with unlimited access to the data product available on request.
   * Paid to charge for your data product on Snowflake Marketplace.
   > > **Note:**
   > >
   > > If you select Paid for the Access type, and you want to change it, you have to delete the current draft and create a new one.
8. Scroll to the Region Availability section and select Set region availability.

   1. By default, the region availability is set to All Regions. To change this setting, select the All regions edit button, then select Custom regions.
   2. Click Select regions, and then select the [region groups](collaboration-marketplace-about.md) where you want your listing to be available.

      Review the region groups and regions. Region groups and regions that have deployments in VPS are indicated with an info icon.

      Hover over that icon to see information about the deployment.

      Additional fulfillment costs may incur for listings offered in regions that have deployments in VPS.

      For more information about how auto-fulfillment incurs costs, see [How auto-fulfillment incurs costs](provider-understand-cost-auto-fulfillment.md).

      > **Note:**
      >
      > If providers don’t want to target VPS regions, they can open a Worksheet and replace region grouping names within individual region names in the listing manifest.
   3. Select Done.
9. Configure the listing to prepare it for publishing.
10. Submit the listing for approval and publishing.

### As a VPS provider, create a listing in Snowflake Marketplace

If you’re a VPS provider, follow these steps to create a [V2 listing](collaboration-listings-about.md) that’s available in Snowflake Marketplace:

> **Note:**
>
> VPS providers can’t create paid listings.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Create Listing » Snowflake Marketplace.
4. In the Create Listing window, enter a name for your listing.
5. Select Add data product.
6. In the Data product, click + Select to select the objects to attach to the listing.
7. In the Access type dropdown, select one of the following options:

   * Free to offer a data product that is freely available to consumers.
   * Limited trial to offer a trial of your data product, with unlimited access to the data product available on request.
8. Scroll to the Region Availability section and select Set region availability.

   1. By default, the region availability is set to All Regions. To change this setting, select the All regions edit button, then select Custom regions.
   2. Click Select regions, and then select the [region groups](collaboration-marketplace-about.md) where you want your listing to be available.

      Review the region groups and regions. Region groups and regions that have deployments in VPS are indicated with an info icon.

      Hover over that icon to see information about the deployment.

      Additional fulfillment costs may incur for listings offered in regions that have deployments in VPS.

      For more information about how auto-fulfillment incurs costs, see [How auto-fulfillment incurs costs](provider-understand-cost-auto-fulfillment.md).

      > **Note:**
      >
      > If providers don’t want to target VPS regions, they can open a Worksheet and replace region grouping names within individual region names in the listing manifest.
   3. Configure the fulfillment method. By default, the fulfillment method is set to Automatic.
   4. Select Done.
9. Configure the listing to prepare it for publishing.
10. Submit the listing for approval and publishing.

### Create a listing on the Snowflake Marketplace that includes a compliance badge

SnowsightSQL

To create a listing that includes [compliance certifications](provider-becoming.md) by using Snowsight, follow these steps:

1. Sign in to Snowsight.
2. In the navigation menu, select Marketplace ‣ Provider Studio.
3. Select the Listings tab, then create a new listing.
4. Set the profile and choose a product and access type.
5. In the optional Certifications section, add the certification for your listing. You can upload the supporting compliance documentation and set the expiration date for each certification.
6. Submit your listing for approval.

To create a listing that includes [compliance badges](provider-becoming.md), follow these steps:

1. Using an approved [profile](provider-listings-preparing.md), create your [listing manifest.yml](../progaccess/listing-manifest-reference.md).
2. In the manifest file, add the `compliance_badges` field and include a line for each certification type; for example :

   ```yaml
   title: "My listing title"
   subtitle: "My listing subtitle"
   description: "My listing description"
   profile: "MyProfile"
   …
   compliance_badges:
   - type: SOC2
     expiry: 12-25-2026
     files:
       - soc2_compliance_verification.pdf
   - type: HIPAA
     expiry: 06-07-2026
     files:
       - hipaa_compliance_verification.pdf
   ```
3. Install and configure [SnowSQL](../user-guide/snowsql.md).
4. To connect to SnowSQL, run the following command:

   ```bash
   snowsql -c my_example_connection
   ```
5. To create a database, schema, and stage, run the following commands:

   ```sqlexample
   CREATE DATABASE <db name>;
   CREATE SCHEMA <schema name>;
   CREATE STAGE <stage_name>;
   ```
6. To upload your listing manifest file from local to stage, run the following command:

   ```sqlexample
   PUT file:///<local_path>/manifest.yml @<stage_name>/<prefix>
     SOURCE_COMPRESSION=None
     AUTO_COMPRESSION=False
     OVERWRITE=True;
   ```

   > **Note:**
   >
   > To use Snowsight to upload files to a stage, follow the steps in [Staging files using Snowsight](../user-guide/data-load-local-file-system-stage-ui.md).
7. To upload the compliance documents that are listed in manifest to stage, run the following commands:

   ```sqlexample
   PUT file:///<local_path>/soc2_compliance_verification.pdf @<stage_name>/<prefix>
   PUT file:///<local_path>/hipaa_compliance_verification.pdf @<stage_name>/<prefix>
   PUT file:///<local_path>/sample.pdf @<stage_name>/<prefix>
     SOURCE_COMPRESSION=None
     AUTO_COMPRESSION=False
     OVERWRITE=True;
   ```
8. To verify that the files uploaded successfully and with the correct names, run the following command:

   ```sqlexample
   LIST @<stage_name>/<prefix>;
   ```
9. To create a listing by using the manifest file you uploaded to the stage, use [CREATE LISTING](../sql-reference/sql/create-listing.md); for example:

   ```sqlexample
   CREATE EXTERNAL LISTING <listing_name>
     APPLICATION PACKAGE <app package name>
     FROM @<staging_name>/<prefix>
     REVIEW = True
     PUBLISH = True;
   ```

After you add a certification to a listing, you can verify that it was added correctly:

1. Run the following command:

   ```sqlexample
   DESCRIBE LISTING <listing_name> REVISION = DRAFT;
   ```
2. In the output, check the manifest.yml column for the `compliance_badges` section.

## Configure a listing

You must provide additional details for paid private listings and any listing offered on the Snowflake Marketplace before you can submit your
listing for approval or publish it to specific consumers.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the draft listing you want to configure.
4. Select Add next to each section that appears on the page and provide the required information.

   As you provide information for each section, refer to [Configure listings](provider-listings-reference.md) for information on each
   field. The specific properties available to edit depend on the type of listing that you create.

## Publish a listing

After creating and configuring a listing, you can publish a listing.

The specific procedures for publishing a listing depend on whether you’re publishing a free (or paid off-platform) private listing, offering a paid listing
privately, or offering any listing on the Snowflake Marketplace:

* Publish a listing to specific consumers
* Publish a listing on the Snowflake Marketplace

To publish a listing, you must use the ACCOUNTADMIN role or another role with the OWNERSHIP privilege for the listing that you want to publish.

When you publish a listing, it is visible to consumers in all current and future Marketplace regions, but consumers can only get, purchase,
or request your product in regions you select.

### Publish a listing to specific consumers

To share a private listing with specific consumer accounts, you must publish the listing to those accounts. Private listings
do not appear on the Snowflake Marketplace.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the draft listing you want to publish.
4. Select Publish.

After you publish the listing, it’s available for the selected consumers to access on the Data sharing » External sharing page.

For more information, see [Access and install listings as a consumer](consumer-listings-access.md).

> **Note:**
>
> After you publish a private listing, you cannot change the share associated with the listing.

### Publish a listing on the Snowflake Marketplace

Every listing in the Snowflake Marketplace must go through the review and approval process. After a listing is approved, it can be published in
the Snowflake Marketplace. If a listing is rejected, review the feedback comments, update the listing, and resubmit it for approval.

#### Submit your listing for approval

Before you can publish a listing to the Snowflake Marketplace, you must submit the listing to Snowflake for approval.

If you want to submit your listing for approval but the option to Submit for Approval is disabled, check the following:

* You completed the steps to configure the listing. See Configure a listing.
* You are the ACCOUNTADMIN or have the OWNERSHIP privilege for the data product attached to the listing.
* All sample SQL queries attached to the listing pass validation.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the draft listing you want to submit for approval.
4. Select Submit for Approval.
5. After the listing is reviewed by Snowflake, the state changes to Approved or Denied.

   If the listing has been denied, update the listing based on the feedback provided in comments, and resubmit it for
   approval.

   When a listing is approved or denied, an email notification is sent to both the Business Contact and Technical Contact email
   addresses in the provider profile associated with the listing.

#### Publish your listing

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the approved listing you want to publish.
4. Select Publish.

After you publish your Snowflake Marketplace listing for the first time, subsequent changes to the listing that require approval from Snowflake
are published automatically after approval. To prevent your listing from being automatically published,
see Deactivate automatic publishing.

When you publish a listing, it is visible to consumers in all current and future Snowflake Marketplace regions. Consumers can only get, purchase,
or request your product in regions you select. See [Auto-fulfillment for listings](provider-listings-auto-fulfillment.md) for more about region availability.

After publishing your Snowflake Marketplace listing, you can define a [referral link](provider-listings-referral-link.md) for the listing.
Referral links let you give consumers a direct link to your listing.

### Deactivate automatic publishing

After a listing is published, you can deactivate automatic publishing for future changes to the listing.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the approved listing for which you want to deactivate automatic publishing.
4. On the listing details page, select Settings.
5. In the Publishing section of the Listing Settings, select Edit Publishing.
6. In the Publish Setting window, select Manual.
7. Select Save.

The listing is no longer automatically published. Now, when you make changes to your listing, you must manually publish the listing.
See Publish your listing.

## Attach a notebook to a Snowflake Marketplace listing

Providers can add a notebook to a listing to show potential consumers how they can leverage and benefit from a data product. The listings
can be available on the Snowflake Public Marketplace, Internal Marketplace, or as a private listing to a select set of consumers.

A provider can attach a notebook that was fully run with results to a listing. The results can include tabular outputs or visualization to describe
the value of the data products within a listing. Providers can include both Python-based and SQL-based examples in the notebook and add clear
Markdown explanations to guide consumers.

> **Note:**
>
> Notebooks attached to a listing are view-only and can’t be cloned, downloaded, or interacted with by the consumer.

To attach a notebook to a listing, follow these steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Notebooks.
3. [Create a notebook](../user-guide/ui-snowsight/notebooks-create.md).
4. In each cell of the notebook, select Run all or Run to generate results. Ensure that you run in each cell.
5. To verify that the notebook runs successfully, locate the green run-status indicator.
6. To end your Notebooks session, select the Active drop-down and select End session.
7. In the navigation menu, select Marketplace » Provider Studio. You can attach a notebook to all listing types.
8. Create a new listing or choose an existing listing.
9. Select + Add data product.
10. Choose + Select.
11. In the Quick Start Examples section, select Add Notebook.
12. Select the notebook to attach. You can use the search feature to find a specific notebook.
13. Select Save.
14. To publish the listing to the selected consumers, select Publish.

> > **Note:**
> >
> > To update the contents of the notebook after attaching it to a listing, you must remove the notebook from the listing and attach it again.

## Remove a notebook from a listing

1. In the navigation menu, select Marketplace » Provider Studio.
2. In the Quick Start Examples section, select the notebook to remove.
3. Select Remove Notebook.

> > **Note:**
> >
> > If you lose ownership of the notebook, or if you delete it or remove it from your shared resources, a copy remains with the listing.

## Limitations when attaching notebooks to listings

* Providers can only attach one notebook to a listing, and the provider must have OWNERSHIP privileges on the notebook.
* Consumers can only view the notebook and its results in the listings.
* Changes in the notebook aren’t automatically updated in the listing. To reflect the latest changes, you must remove the notebook and add it back again.

---
title: Define the referral link for a Marketplace listing
source: https://docs.snowflake.com/en/collaboration/provider-listings-referral-link.md
section: Collaboration & Marketplace
---

# Define the referral link for a Marketplace listing

This topic explains how to define the referral link for a listing offered on the Snowflake Marketplace.

Referral links let you send consumers a link directly to your listing on the Snowflake Marketplace. When a consumer accesses
the referral link, they are prompted to either sign in to their existing Snowflake account or sign up for a
Snowflake trial account. In both cases, the consumer is then redirected to your listing in the Snowflake Marketplace.

> **Note:**
>
> Referral links are currently not available for private listings.

To define the referral link for a listing on the Snowflake Marketplace, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for your listing.
4. Note the listing ID in the URL:

   For example, if your listing URL is:

   > `https://app.snowflake.com/marketplace/listing/ABC123XYZ789`

   The listing ID is `ABC123XYZ789`
5. Define the referral link by appending the listing ID to the following URL:

   `https://signup.snowflake.com/?listing=`

   For example, the referral link for the earlier example would be:

   `https://signup.snowflake.com/?listing=ABC123XYZ789`

You can share this link with consumers to provide direct access to your listing on the Snowflake Marketplace.

---
title: Dispute resolution, enforcement, and appeals
source: https://docs.snowflake.com/en/collaboration/dispute-resolution-enforcement-appeal.md
section: Collaboration & Marketplace
---

# Dispute resolution, enforcement, and appeals

## Overview

Snowflake maintains a fair and transparent enforcement process to uphold Marketplace integrity and help protect both Providers and Consumers.

This section outlines how disputes and refunds are handled, how enforcement actions are applied for policy violations, and how Providers may appeal such actions. All Providers and Consumers are expected to comply with the Marketplace Policies and App Listing Guidelines.

Failure to do so may result in enforcement actions, up to and including suspension or removal from the Snowflake Marketplace.

## Dispute resolution process

* The Provider is the Seller of Record for their Product(s) (not Snowflake) and is fully responsible for their Product(s).
* Disputes between Providers and Consumers should be settled between those parties. Snowflake requires that Providers and Consumers reach out to each other directly. Providers are expected to reply to Consumer inquiries within 3 business days per the Provider Policies, and Consumers are expected to respond to Provider outreach within 7 business days.
* In the event that a Provider or Consumer does not respond to a complaint or inquiry within the appropriate time frames above, Snowflake may reach out to the Provider or Consumer to make them aware of the complaint or inquiry. In the event that the Provider or Consumer does not provide any response to the outreach, Snowflake will take appropriate enforcement actions based on the nature of the inquiry or complaint.
* If a Provider provides Snowflake with written confirmation by a Consumer that they agree to no longer use the Provider’s listing, Snowflake may remove the Consumer from the Provider’s listing.

## Refund policies

* Refunds must be authorized and initiated by the Provider.
* Snowflake will not get involved in cases where a chargeback was initiated with the card issuer.

Providers can initiate a refund using [this form](https://snowforce.my.site.com/s/provider-onboarding-case).

## Filing a dispute with Snowflake

If Providers and Consumers are unable to come to a resolution on a dispute, they may file a Case with Snowflake Marketplace Operations.

The Case must include:

* A description of the issue.
* All relevant information and evidence including: account information, listing information, billing periods, and documentation of the policy or terms violation.
* A record of your communications with the other party.

Use the below forms to file your dispute:

* [Provider dispute against a Consumer](https://snowforce.my.site.com/s/provider-onboarding-case)
* [Consumer dispute against a Provider](https://snowforce.my.site.com/s/consumer-reporting)

## Enforcement actions

Snowflake reserves the right to take appropriate action when a Provider or Consumer fails to comply with Marketplace policies or obligations.

Possible enforcement actions include:

* Listing removal from the Marketplace.
* Provider removal from the Marketplace.
* Revocation of ability to monetize on-platform.

All enforcement actions are applied in accordance with the Provider and Consumer Terms and based on the severity and nature of the violation.

## Appeals process

Providers who believe that an enforcement action (for example, suspension, or removal) was made in error or warrants reconsideration may submit an
appeal by filing a Case via the [Snowflake Marketplace Case Form](https://snowforce.my.site.com/s/provider-onboarding-case).

The appeal must include:

* A reference to the original enforcement decision.
* Supporting evidence or clarification.
* A detailed rationale for reconsideration.

Snowflake will review the appeal and respond with a final determination. Appeals that do not include adequate context or documentation may not be reviewed.

---
title: Enable VPS collaboration with other organizations
source: https://docs.snowflake.com/en/collaboration/virtual-private-snowflake/vps-enable-collaboration.md
section: Collaboration & Marketplace
---

# Enable VPS collaboration with other organizations

VPS collaboration with private listings must be enabled through Snowflake
Support. First, however, your organization must agree to the terms and disclaimers.
After that, you can start working with support to setup participation in private listings.

## How to sign the terms and disclaimers

Before any Snowflake customer can begin to use any type of listings, their organization administrator
must accept some terms and disclaimers – this is required one time for the entire organization.
Signing waivers must be done through the Snowflake web app. For more information about Snowflake legal terms and conditions, see
[Legal requirements for providers and consumers of listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-legal).

1. Sign in to [Snowsight](../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select your username, select Primary Role, and then scroll down and select ORGADMIN.
3. In the navigation menu, select Admin » Terms.
4. In the Snowflake Marketplace pane, click View terms for Standard Agreement for Marketplace products.
5. [Review and accept the Snowflake Provider and Consumer Terms](../provider-becoming.md) and save a copy for your records.
6. To sign the terms, select Review & Enable.
7. Review the cross-region disclaimer and save a copy for your records.
8. Select Acknowledge & Continue.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

## How to enable VPS collaboration

When ready to publish or obtain a private listing, both the provider and the
future consumer of the listing need to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to authorize
the new connection with a specific organization.

## Privileges required

When you create a listing, you create it from the account that has the data or application package in it. The role that attaches a data
product to a listing and publishes the listing must be the same role that created, and therefore owns, the application package or share.
You cannot transfer the OWNERSHIP privilege for a share.

If you use a different role to create and manage the listing, grant the MODIFY privilege on the listing to the role
that owns the application package or share. For example:

Share or application package owner role:
:   OWNERSHIP privilege on the share or application package.
    MODIFY privilege on the listing.

Listing owner role:
:   OWNERSHIP privilege on the listing.

    Global CREATE LISTING privilege.

Within the provider account, you can use one of the following to create and manage listings:

ACCOUNTADMIN:
:   If you use the ACCOUNTADMIN role to create and manage listings, the ORGADMIN role must first
    [Delegate privileges to set up auto-fulfillment](../provider-listings-auto-fulfillment-manage-privileges.md).

Custom role:
:   If you use a custom role, the ORGADMIN role must first [Delegate privileges to set up auto-fulfillment](../provider-listings-auto-fulfillment-manage-privileges.md)
    to the ACCOUNTADMIN role, which can then be used to grant the relevant privileges to the custom role.

For more information about granting sharing privileges, see [Granting Privileges to Other Roles:](../../user-guide/data-exchange-marketplace-privileges.md).

## How to disable VPS collaboration

If your organization no longer wants to offer or access private listings,
follow these steps:

**For consumers:**

1. [Delete all of the listings](../provider-listings-removing.md) that you are a consumer of, consistent with the
   applicable requirements in the Provider and Consumer Terms.
2. Delete the data that you got through listings.
3. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to have collaboration disabled for your organization.

**For providers:**

1. [Delete all of the listings](../provider-listings-removing.md) shared from your account, consistent with the applicable
   requirements in the Provider and Consumer Terms.
2. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to have collaboration disabled for your organization.

For more information, see [Removing listings as a provider](https://other-docs.snowflake.com/en/collaboration/provider-listings-removing).

---
title: Explore listings
source: https://docs.snowflake.com/en/collaboration/consumer-listings-exploring.md
section: Collaboration & Marketplace
---

# Explore listings

Explore and try out paid and limited trial listings on the Snowflake Marketplace before you purchase or request them.
If there are listings or providers you need that are currently unavailable, you can request new data or providers
on the Snowflake Marketplace.

You must be a Snowflake listing consumer to access or purchase a listing. See [Use listings as a consumer](consumer-becoming.md).

## Browse listings on the Snowflake Marketplace

You can browse listings on the Snowflake Marketplace whether you have a Snowflake account or not.

* You can see listings on the web by browsing to the [Snowflake Marketplace](collaboration-marketplace-about.md).
* You can see listings in Snowsight by doing the following steps:

  1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
  2. In the navigation menu, select Marketplace » Snowflake Marketplace.

After you find a listing relevant to your use cases, explore it further.

* Get a free listing. See [Access and install listings as a consumer](consumer-listings-access.md).
* Review the data dictionary. See Preview the contents of a listing with data dictionaries.
* Trial a paid or limited trial listing. See Trial a listing.

## View listing information

You can use the following SQL commands to view listing information:

* Run [DESCRIBE AVAILABLE LISTING](../sql-reference/sql/desc-available-listing.md) to view detailed information of listings available to you.
* Run [SHOW AVAILABLE LISTINGS](../sql-reference/sql/show-available-listings.md) to display all the listings available to you.

## Preview the contents of a listing with data dictionaries

Data dictionaries allow you to preview the contents of a listing on the Snowflake Marketplace before installing it in your
account. A data dictionary displays the tables, views, and functions within a listing.

There are two ways you can view the contents of a listing as a consumer:

* Featured objects: allow you to view the most important objects within the listing. Featured objects are selected by the
  listing provider.
* All objects: allow you to view all of the objects within a listing.

To view the contents of a listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace.
3. Search for the listing, then select it.
4. Review the items that appear under Data Dictionary.

   * To view the contents of a featured object, select the object’s name.
   * To view all of the objects within a listing, select View all objects. When viewing all objects you can browse
     through the contents of a listing or search for specific items.
   * To view the data dictionary associated with an object, select its Data Preview tab.

     If Data Preview is enabled for the selected listing, then up to 10 rows of data are displayed.

## Trial a listing

You can trial a limited trial or paid listing to explore the data available in the listing.

After you explore the data, you can [purchase the listing](consumer-listings-access.md) or
[request the full data product](consumer-listings-access.md).

> **Note:**
>
> You must use the ACCOUNTADMIN role or a custom role with the CREATE DATABASE and IMPORT SHARE privileges to trial a listing.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Snowflake Marketplace, and then search for the listing that you want to trial.

   The pricing status of each listing is indicated on the listing.

   To view only listings available to trial:

   1. Select the Data products tab.
   2. In the Pricing filter, select Free to Try.
3. Select a paid or limited trial listing. The listing description page opens and you can read about the listing.
4. Select Get. You can read the description of the trial available for the listing.
5. Enter your preferred name for the database created from the data product in the listing.
6. Select Get Data.
7. If you want to query the data in the listing immediately, select Open to open a [Snowsight worksheet](../user-guide/ui-snowsight-worksheets.md)
   containing example SQL queries for the listing’s dataset. Otherwise, select Done.

### About trialing listings

When you trial a listing, you might have any of the following types of access:

* Indefinite access to a limited set of data.
* Indefinite access to limited functionality of the Snowflake Native App.
* Time-limited access to all data or a limited set of data.
* Time-limited access to all functionality or limited functionality of the Snowflake Native App.

If you are trialing a paid listing, you are not charged for the listing when your trial ends. You are only charged if you choose to purchase
the paid listing.

Using the data product in a listing that you’re trialing works similarly to using a database that you have full access to:

* An account administrator, the role that created the database (if different than the account admin), or any role with the global
  MANAGE GRANTS privilege can grant other roles access to the database and database objects (e.g. tables, views).
* Querying data while you trial a listing incurs compute charges in your Snowflake account, but you do not incur any charges for the listing.
* If the listing provides access to a limited set of data or functionality and you attempt to perform actions that are
  accessible only to consumers with full access to the listing, you see no results.

When your time-limited trial ends, or when you have finished exploring the limited trial data product, decide what to do with the listing:

* If you decide to purchase the data product of a paid listing, see [Accessing paid listings](consumer-listings-access.md).
* If you want to request unlimited access to the full data product of a limited trial listing, see [Request a limited trial listing](consumer-listings-access.md).
* If you don’t want to purchase the data product, an account administrator can drop the database created or Snowflake Native App installed for your trial.
* If you want to continue trialing the data product but the time period of a time-limited trial has expired, other accounts in your
  organization can start a new trial of the listing.

---
title: Government providers
source: https://docs.snowflake.com/en/collaboration/provider-listings-government-providers.md
section: Collaboration & Marketplace
---

# Government providers

If you’re a government provider, this section provides information on how you can prepare to provide listings.

## Prepare to provide listings from accounts in U.S. government regions

If your account is in a [U.S. government region](../user-guide/intro-regions.md) and you want to install data products offered privately or on the Snowflake Marketplace, or
offer listings either privately or on the Snowflake Marketplace, you must review and acknowledge the following cross-region disclaimer for your
organization.

> **Important:**
>
> To get data products and share listings with Snowflake customers outside your region, Snowflake shares organization and account metadata
> and usage analytics with the customers you collaborate with outside of your region.
>
> Compliance standards, such as [FedRAMP](../user-guide/cert-fedramp.md), and support for different regulated workloads, such as [ITAR](../user-guide/cert-itar.md), might be different or unavailable
> outside of your U.S. Government Region. Consider your compliance requirements before choosing to move or share data between Snowflake regions.

> **Note:**
>
> You must use the ORGADMIN role to accept the terms. You only need to accept terms once for your Snowflake account. If you do not have
> the ORGADMIN role, see [Enabling the ORGADMIN role in an account](https:/docs.snowflake.com/en/user-guide/organization-administrators).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, for Sharing & Collaboration, select Review & Enable.
4. Review the cross-region disclaimer and select Acknowledge & Continue.
5. Select Done.

> **Note:**
>
> * Providers can enable [Egress Cost Optimizer (ECO)](provider-listings-auto-fulfillment-eco.md) in a primary account in any commercial region and create listings targeted to any other region, including government regions.
> * By default, ECO is unavailable to customers on a government cloud. If you’re a Gov customer, you can reach out to your Snowflake account executive for more information about ECO enablement.

You must use the ORGADMIN role and you only need to complete this step once for your organization:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, for Sharing & Collaboration, select Review & Enable.
4. Review the cross-region disclaimer and select Acknowledge & Continue.
5. Select Done.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

### Stop sharing and collaboration from an account in a US government region

If you no longer want to offer or access listings from your account in a US government region, do the following:

1. [Delete all of your listings](provider-listings-removing.md) shared from your account, consistent with the applicable
   requirements in the Provider and Consumer Terms.
2. Stop consuming listings by dropping the databases imported when you
   [accessed listings](consumer-listings-access.md).
3. [Contact Snowflake Support](../user-guide/contacting-support.md) to have data sharing and collaboration disabled for your organization.

### Limitations for providing listings from accounts in U.S. government regions

If you provide listings from an account in a U.S. government region, the following limitations apply:

* You cannot offer paid or personalized listings.
* You must use Cross-Cloud Auto-Fulfillment, and your data product can only contain
  [objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md).

Additional considerations apply to providers in non-US-government regions who want to offer listings to accounts in US government regions.
See [Considerations for sharing listings to accounts in US government regions](provider-listings-creating-publishing.md).

## Prepare to provide listings from accounts in the Kingdom of Saudi Arabia (KSA) region

If your account is in a [Europe and Middle East region](../user-guide/intro-regions.md), specifically Dammam (me-central2), and you want to install data products offered privately or on the Snowflake Marketplace, or
offer listings either privately or on the Snowflake Marketplace, you must review and acknowledge the following cross-region disclaimer for your
organization.

> **Important:**
>
> To get data products and share listings with Snowflake customers outside your region, Snowflake shares organization and account metadata
> and usage analytics with the customers you collaborate with outside of your region. Compliance standards and support for different
> regulated workloads might be different or unavailable outside of your region.
> Consider your compliance requirements before choosing to move or share data between Snowflake regions.

> **Note:**
>
> You must use the ORGADMIN role to accept the terms. You only need to accept terms once for your Snowflake account. If you do not have
> the ORGADMIN role, see [Enabling the ORGADMIN role in an account](https:/docs.snowflake.com/en/user-guide/organization-administrators).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, for Sharing & Collaboration, select Review & Enable.
4. Review the cross-region disclaimer and select Acknowledge & Continue.
5. Select Done.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

### Stop sharing and collaboration from an account in a KSA region

If you no longer want to offer or access listings from your account in a KSA region, do the following:

1. [Delete all of your listings](provider-listings-removing.md) shared from your account, consistent with the applicable
   requirements in the Provider and Consumer Terms.
2. Stop consuming listings by dropping the databases imported when you
   [accessed listings](consumer-listings-access.md).
3. [Contact Snowflake Support](../user-guide/contacting-support.md) to have data sharing and collaboration disabled for your organization.

---
title: Guidelines and requirements for listing Apps on Snowflake Marketplace
source: https://docs.snowflake.com/en/collaboration/guidelines-reqs-for-listing-apps.md
section: Collaboration & Marketplace
---

# Guidelines and requirements for listing Apps on Snowflake Marketplace

## Overview

These guidelines define the enforced standards for publishing applications—both Snowflake Native Apps and Connected Apps—on Snowflake Marketplace.

## Native Applications

### Publish on Snowflake Marketplace

When your application package is ready to be published on the Snowflake Marketplace, you must submit it to Snowflake for review and approval.

> **Note:**
>
> The approval process required to publish an app on the Snowflake Marketplace is in addition to the [automated security scan](../developer-guide/native-apps/security-overview.md) that is run when the DISTRIBUTION property of an
> application package is set to EXTERNAL. See [Snowflake Native App listing approval flow](provider-listings-workflows.md) for details.

Before creating a listing, verify that you understand the enforced requirements and ensure that your application package follows each requirement. If an application package does not follow these requirements, your submission may be rejected.

If you receive a rejection notification for the application package you submitted, make the recommended changes and resubmit your application package for approval.

### Standards for Snowflake Native Apps on Snowflake Marketplace

The Snowflake Native App functional review process ensures the quality of apps published on Snowflake Marketplace. To provide clarity into what is evaluated during this process, the following standards apply to all Snowflake Native Apps distributed through Snowflake Marketplace.

**Immediate utility**

The app functionality must be provided within the consumer account and the app must be operational once installed.

**Standalone**

Apps must deliver product experience on Snowflake and facilitate external requirements through Snowflake functionality.

**Data-centric**

Apps should be based on data-centric use cases that leverage data stored in Snowflake.

**Transparent, simple, and secure**

Apps must use Snowflake features to disclose the app’s resource and access requirements and simplify the configuration process for the consumer.

### Enforced standards

Snowflake uses the following requirements to determine if a Snowflake Native App meets the standards for publication on Snowflake Marketplace. These requirements are verified when you submit a listing with an attached application package to Snowflake Marketplace.

1. Immediate utility

   1. Apps must not be shell apps that advertise functionality. Apps must deliver the advertised functionality.
   2. Apps must include a clear framework and instruction for utilizing app functionality.
   3. Apps should not crash, freeze, or otherwise function abnormally.
   4. Apps must list all required credentials and providers must share required credentials with Snowflake at submission for testing.
   5. If apps are not immediately actionable, they must document the expected workflow for a consumer, allowing consumers to fully install and configure the app.
2. Standalone

   1. Apps must not be pass-through. For example, they must not redirect consumers to an external service to enable the app’s core
      functionality.
   2. App interfaces must be accessible after installation directly from Snowflake.
   3. Apps cannot use the Snowflake Marketplace as a distribution platform for cross-selling external applications or services.
   4. Apps that access external services and leverage user authentication should comply with the following standards:

      1. Apps may ask consumers to create a service user in their Snowflake account only to enable access to an external service.

         > 1. Acceptable authentication methods are Programmatic Access Tokens (PAT), OAuth, or key pair. The service user must be granted only the minimum permissions necessary for the app to function.
      2. Apps that require user authentication should never require the consumer to do the following for authentication:

         > 1. Input consumer’s Snowflake username and password.
         > 2. Create a private / public key and share the private key.
3. Data-centric

   1. Apps must leverage Snowflake data in one of the following ways:

      1. Share data from the app provider’s account.
      2. Use datasets from the Snowflake Marketplace.
      3. Access data in the consumer account.
4. Transparent and simple

   1. All account-level privileges and references that the app requires must be listed in the application package manifest file.
   2. All resource requirements for the Snowflake Native App must be listed in the [marketplace.yml](../developer-guide/native-apps/marketplace-file.md) file of the app. The app must create these resources as part of installation and setup.
   3. All account-level privileges and references listed in the application package manifest file must be requested from the consumer through Snowsight or the Python Permission SDK.
   4. Apps must provide a readme file. Apps that do not include a Streamlit or custom user interface must include the following information in the readme file:

      1. A description of what the app does.
      2. The steps the consumer must perform to configure the app after it is installed.
      3. The stored procedures and user-defined functions the app uses.
      4. The privileges the app requires.
      5. Example SQL commands that show consumers how to use the app.
   5. All required SQL commands must be delivered using Snowflake and formatted as code blocks.
   6. If the app provides sample data, you must include procedures on how to use the sample data.
   7. If an application package contains a Streamlit app but does not contain a readme file, you must configure a default [Streamlit
      app](../developer-guide/native-apps/adding-streamlit.md).

### Best practices when publishing a Snowflake Native App

In addition to the requirements for submitting an application package to Snowflake Marketplace, Snowflake also recommends the following best practices when publishing a Snowflake Native App:

* Ensure that all required files are uploaded to the named stage for the version of the app you are submitting, including:

  + The manifest file.
  + The setup script.
  + The README file.
  + Any external stored procedures or user-defined functions required by the application package.
  + Any Streamlit files required by the application package.
  + Any external source code, including Python, Java, etc.
* Ensure that the version of the app you are developing passes the [automated security scan](../developer-guide/native-apps/security-overview.md).
* Test the new version of your application package by creating the application object locally by using the [CREATE APPLICATION](../sql-reference/sql/create-application.md) command.

  + Do not add a new version to your application package or set the DISTRIBUTION property to EXTERNAL while you are developing and testing
    an app. These actions trigger the [automated security scan](../developer-guide/native-apps/security-overview.md). Instead, create the
    application object using [files on a named stage](../developer-guide/native-apps/installing-testing-application.md).
  + If your app includes a Streamlit app, test the application in Snowsight to ensure the Streamlit app works as expected.
  + Verify that interactions between the Streamlit app and Snowflake Worksheets are seamless and that the consumer does not have to navigate excessively between the two.
* Review all parts of a listing before submitting it for approval.
* Ensure that there are no typos or other textual errors in the listing, readme file, and Streamlit app.

### Recommendations for trial listings

* When an app trial listing expires, Snowflake automatically suspends the app to avoid consumers incurring extra compute costs to the consumer. Snowflake only suspends the objects owned by the app that are currently active. Snowflake does not modify the status of objects that are already suspended.
* When a trial listing is converted to a full or paid listing, Snowflake attempts to re-enable the app by resuming tasks, containers, and compute pools. Snowflake only resumes services and compute pools that have the `auto_resume` property set to false.

### Recommendations for apps with containers

* Compute pools should be set to automatically suspend in combination with Snowpark Container Services jobs to avoid idle compute nodes.
* For higher availability during upgrades and to reduce cold start latency, Snowflake recommends that you set the `MIN_NODES` parameter greater than 1.
* If connections across different services are required in the same app, use the DNS name of the service instead of configuring an external access integration.

### Recommendations for event sharing

* Providers should configure an app to emit log messages and trace events that conform to
  [supported event definitions](../developer-guide/native-apps/ui-consumer-enable-logging.md) to ensure that consumers understand what information is collected.
* Mandatory event definitions should be limited to the log messages and trace events required by the app. Excessive or unnecessary mandatory event definitions should be avoided.
* Adding new mandatory event definitions in a version upgrade must require the consumer re-enable event definitions for the app.
* Use the Python Permission SDK to allow consumers to share optional events.

## Connected Apps

Snowflake allows SaaS providers to list their Connected Apps on Snowflake Marketplace. Connected Apps are integrated SaaS applications that securely connect to a Snowflake customer’s account to read or ingest specified data as part of their workflow. Connected Apps enable consumers to interact with their Snowflake data directly through an external UI.

### Requirements to publish a Connected App on Snowflake Marketplace

* **Partner network tier:** Providers must be a member of the [Snowflake Partner Network (SPN)](https://www.snowflake.com/en/why-snowflake/partners/) and hold a **Select**, **Premier**, or **Elite** tier designation.
* **CSID requirement:** Each Connected App must use a Connection String Identifier (CSID) to enable full telemetry and usage tracking. Providers are encouraged to consolidate to a single CSID per application; however, multiple CSIDs are also supported where necessary. The CSID(s) must be initially submitted through SPN and will subsequently be required in your listing submission and verified during the review process.
* **Security transparency:** Providers must complete a short **Security & Data Handling Attestation** as part of the listing process.
* **Listing type:** All connected app listings must leverage a public, paid listing and fulfill deals using standard or private offers.

### Ongoing standards for Connected Apps on Snowflake Marketplace

1. **Ecosystem contribution:** Connected Apps should meaningfully contribute to the **Snowflake Data Cloud ecosystem**, helping drive data collaboration, consumption, or workload adoption.
2. **Active partnership:** Providers must be **active contributors** to the Snowflake ecosystem. To remain listed on the Marketplace, providers must maintain their standing within the Partner Network at the Select tier or above, and their application must continue to benefit the ecosystem. Snowflake may remove a listing if the provider is no longer contributing to the ecosystem (per Snowflake’s discretion) or no longer meets partner eligibility standards.

---
title: Legal requirements for providers and consumers of listings
source: https://docs.snowflake.com/en/collaboration/collaboration-listings-legal.md
section: Collaboration & Marketplace
---

# Legal requirements for providers and consumers of listings

To start using Snowflake Marketplace, including on-platform monetization, as either
a provider or consumer, Snowflake customers must agree to additional Snowflake terms.
All providers and consumers of listings must also abide by Snowflake policies.
This page describes the legal requirements for becoming a provider or consumer.

> **Note:**
>
> Data Sharing and listings are part of the Snowflake Service, subject to your
> Service terms with Snowflake, including the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/)
> and [Snowflake Acceptable Use Policy](https://www.snowflake.com/legal/acceptable-use-policy/).
> Snowflake Marketplace, including our on-platform monetization offering,
> is not part of the Service, and is subject to the [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms).

## Legal requirements for consumers

To access listings as a consumer, the organization administrator (the user with the ORGADMIN role),
or another individual authorized to get listings for your Snowflake account, must agree to the [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms).

These Provider and Consumer Terms govern our customers’ use of Snowflake Marketplace, and address the relationship, rights,
and obligations of Snowflake and our customers in connection with this optional offering.

Consumers of listings must also comply with the applicable [Provider and Consumer Policies](https://www.snowflake.com/provider-and-consumer-policies/).
These policies help ensure a safe, reliable, and respectful experience for providers and consumers.

For more information about becoming a listing consumer, see [Use listings as a consumer](consumer-becoming.md).

## Legal requirements for providers

To provide publicly available and paid listings as a provider, the organization
administrator (user with the ORGADMIN role) must agree to the [Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms).

These Provider and Consumer Terms govern our customers’ use of the Snowflake Marketplace,
and address the relationship, rights, and obligations of Snowflake and our customers
in connection with this optional offering.

Providers of listings must also comply with the applicable [Provider and Consumer Policies](https://www.snowflake.com/provider-and-consumer-policies/).
These policies help ensure a safe, reliable, and respectful experience for providers and consumers.

For more information about becoming a listing provider, see [Use listings as a provider](provider-becoming.md).

## Legal notice on tax compliance

Where required by law, Snowflake Inc. calculates and then issues tax compliant invoices
(as applicable) to Snowflake Marketplace providers and consumers. Snowflake also collects US taxes.

To confirm tax obligations for domestic and international sales, we recommend that
both Snowflake Marketplace providers and Snowflake Marketplace consumers consult their advisors.

---
title: Listing support in Business Continuity and Disaster Recovery
source: https://docs.snowflake.com/en/collaboration/listings-bcdr.md
section: Collaboration & Marketplace
---

# Listing support in Business Continuity and Disaster Recovery

Providers can include listings and their dependencies — such as shares and databases — in [account replication and failover groups](../user-guide/account-replication-intro.md). With failover groups, if a service degradation or outage occurs, the listing relies on failover groups for data replication and disaster recovery.

> **Note:**
>
> * This feature is only available for listings that have enabled [auto-fulfillment](provider-listings-auto-fulfillment.md).
> * Review the Behavior, considerations, and constraints section before using this feature.

## Terms

* Primary listings: Listings in the primary failover group
* Secondary listings: Listings in the secondary failover group

## Understanding the need for Business Continuity and Disaster Recovery

In the event of an outage in the primary region, Business Continuity and Disaster Recovery (BCDR) becomes crucial for providers.

* Providers must continue to support their data products with minimal interruptions during an outage.
* Providers must meet service level agreements (SLAs) with respect to RTO (Recovery Time Objective) and RPO (Recovery Point Objective) to avoid financial penalties.
* Providers must maintain replicas of their data in secondary regions in the event of an outage.

### Manual configuration for failover and recovery

Providers who don’t add listings to their failover groups have higher recovery times and stale information for their consumers. Without
BCDR, providers must re-create listings in the secondary regions after failover. And then consumers must remount to new listing URLs. This
manual replication causes massive disruptions on ETL pipelines and applications, leading to extended downtime for consumers and added data
transfer costs for providers.

### Automated failover and recovery

BCDR for listings improves enterprise-readiness and decreases disruption from a failure.

* BCDR for listings eliminates the requirement for providers to re-create listings after a failover.
* The new primary region doesn’t re-replicate to the Secure Share Area (SSA) accounts, which saves data transfer costs because only incremental changes are replicated to the SSA accounts.

With BCDR support for listings, after a failover:

* Consumers still have access to provider data without downtime.
* Providers can fulfill new consumer regions from the new primary region.
* Providers can still meet consumer data freshness requirements because the listing refreshes from the new primary.

## BCDR workflow for listings

A typical BCDR workflow for listings is as follows:

1. An outage hits the region, affecting the primary region.

   * While the primary region is down, listings in consumer regions can’t refresh. As a result, consumers are operating with stale data.
2. The data recovery administrator initiates the organization’s runbook.
3. The administrator gets approval to fail over to the secondary region.

   * This secondary region becomes the new primary region.
   * The replica in the failover group becomes the new source of information for all objects.
4. The administrator refreshes the new primary with the latest updates from the data sources, such as external tables and ETL pipelines.

   * The administrator gets a snapshot of objects in the new primary to verify it has the most up-to-date data.
   * The administrator audits the new primary region to confirm whether it is ready for production.
   * After the failover is complete, auto-fulfillment begins working again at the next refresh interval from the new primary.

> **Note:**
>
> The administrator can use the [SHOW LISTINGS IN FAILOVER GROUP](../sql-reference/sql/show-listings-in-failover-group.md) command to confirm that the listings are ready for production.

## BCDR selection criteria

BCDR is not supported when:

* The listing is in draft state.
* The listing is backed by a stage.
* The listing is a paid listing.
* The listing doesn’t have [auto-fulfillment](provider-listings-auto-fulfillment.md) enabled.
* The listing is a Snowflake Native App listing.

## Behavior, considerations, and constraints

Review the sections below to understand the behavior, considerations, and constraints of BCDR for listings.

### Behavior

* If a secondary failover group is dropped, the secondary listings in the failover group are automatically dropped.

### Considerations

* Failover of externally managed Iceberg tables isn’t supported at this time, even though they are supported for auto-fulfillment. Failover
  of managed Iceberg tables is currently in [Public Preview](../user-guide/tables-iceberg-replication.md).
* Some features might not be supported for failover under the database but could be supported for failover under the listing. The unsupported objects will be ignored during replication.

### Constraints

Ensure that you understand the following constraints before configuring BCDR for listings:

* Complete subset constraints (all-or-nothing rule)
* Failover group object type requirements
* Listing auto-fulfillment setup constraints
* Internal Marketplace listings constraints
* Profile replication constraints
* Read-only secondary listing constraints

#### Complete subset constraints (all-or-nothing rule)

* When adding an object to a failover group, if any object is referenced by a listing that has auto-fulfillment enabled, all objects referenced by that same listing must be included in the same failover group.
* When removing an object from a failover group, if any object is referenced by a listing that has auto-fulfillment enabled, all objects referenced by that same listing must be removed together.

#### Failover group object type requirements

When a failover group contains databases or shares that are referenced by listings that have auto-fulfillment enabled, the failover group must include `LISTINGS` in its `OBJECT_TYPES` parameter. For example:

```sqlexample
CREATE FAILOVER GROUP provider_dr_fg
  OBJECT_TYPES = DATABASES, LISTINGS
  ...
```

#### Listing auto-fulfillment setup constraints

* Before enabling auto-fulfillment on a listing or publishing a listing that has auto-fulfillment enabled, all listing dependencies — including shares and databases — must be configured in a failover group that includes the `LISTINGS` object type.
* Auto-fulfillment must be enabled on the secondary account manually (if it’s not already enabled). For more information, see [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../sql-reference/functions/system_enable_global_data_sharing_for_account.md).
* When the [REFERENCE_USAGE](../sql-reference/sql/grant-privilege-share.md) privilege is used by listings, there can be objects that aren’t
  directly related to the listing but that also counted as part of the subset in the complete subset constraint.

#### Internal Marketplace listings constraints

The following constraints apply to listings on the Internal Marketplace (organizational listings):

##### Request approval workflow

* If a consumer submits a request for an organizational listing that hasn’t been approved, and the system fails over to a secondary
  deployment, the replica provider won’t see the consumer’s request. This is because requests are tied to the deployment where the requests
  were originally made. The consumer needs to re-request the listing.
* Attempting to unpublish the listing after failing back to the primary region will fail in the following scenario:

  + A consumer requests a listing that’s in a primary region and in its failover regions, and
  + The consumer re-requests the listing, which was approved while in the secondary region.

  This failure occurs because the original pending request remains. To successfully unpublish, the provider must explicitly reject the
  original request and then re-attempt the unpublish operation.

##### Data Dictionary

* Featured objects aren’t part of the failover replication process. As a result, any featured objects selected on the primary instance will
  not be reflected on the secondary instance after a failover. The provider must manually reset these objects after a failover. If the
  provider doesn’t reset the featured objects, the consumer would see a stale Data Dictionary, even if they add new tables. This is because the background job skips this listing. The background job will pick up this listing after featured objects are set.
* If any modifications are made to featured objects while the system is operating as a replica, those changes won’t be synchronized back to the original primary instance after failback.

##### Data Preview

* Data preview information is not replicated to secondary regions. As a result, after a failover, consumers won’t see any Data Preview
  files. On the secondary region, the provider must regenerate Data Preview files.
* Similar to Data Dictionary, any changes made to Data Preview during a failover state won’t be synchronized back to the original primary after failback. The provider can reset the data preview information on the original primary after failback.

##### Organization profiles

* Both the primary provider and the secondary provider must use a [profile](../user-guide/collaboration/organization-profiles/org-profiles-create-manage.md) that can publish the listing.

#### Profile replication constraints

* If profiles aren’t replicated by a failover group, then listings on the secondary account continue to work with no profiles attached.
* If profiles aren’t replicated by a failover group, the original primary account’s profiles will remain unchanged after a failover and failback refresh.
* The secondary account’s profiles are read-only until failover happens. After failover, the new primary account can create, alter, or drop profiles.
* If the secondary account has an existing local profile, the initial failover group refresh will fail intentionally to avoid potential data loss. Follow the steps in the query result message to proceed.
* Profile approval requests aren’t replicated. If there is any pending approval request in the original primary account, then after failover, the new primary account can re-request approval.

#### Read-only secondary listing constraints

Secondary listings can’t be modified directly. All write operations, such as ALTER and DROP, must be performed on the primary listing.

---
title: Manage listing requests as a provider
source: https://docs.snowflake.com/en/collaboration/provider-listings-managing.md
section: Collaboration & Marketplace
---

# Manage listing requests as a provider

You might get requests for a listing depending on how you offer your data product:

* You offer a limited trial of a paid listing and consumers request the full data product after a trial.
* You offer a free listing and choose to manually fulfill your data product in remote regions.

## Review and respond to listing requests

To view requests for the data product attached to a listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the Listings section, locate the listing for which you want to view requests.
4. Select Consumer Requests to view requests from consumers.

   You can review details about the consumer requesting the data product, such as their Snowflake region, company, contact information,
   and a brief message from the consumer.

Depending on the type of listing you offer, take next steps:

| Listing | Next steps |
| --- | --- |
| Free listing with manual fulfillment | To fulfill a consumer request for a free listing in a remote region, manually replicate the listing. See Manually replicate data to fulfill a listing request. |
| Limited trial of a paid listing | To fulfill a consumer request for the full data product after evaluating your limited trial, contact the consumer and share a data product with them privately. See Fulfill a limited trial request for a full data product. |

Email notifications are sent to providers to notify them of requests. In Snowsight, you can view them on the Requests
tab in External sharing or [Provider Studio](https://app.snowflake.com/#/provider-studio).

## Fulfill a limited trial request for a full data product

When you publish a paid listing, consumers can get the trial and request the full data product.

After a consumer requests the full data product, fulfill their request by following these steps:

1. Optionally contact the consumer to gather more details about their data product request and if relevant, negotiate payment terms.
2. Prepare the data product for the consumer. See [Prepare to offer a limited trial listing](provider-listings-preparing.md).
3. Fulfill the data product request by publishing a private listing to the consumer. See [Share data or apps with specific consumers using a private listing](provider-listings-creating-publishing.md).

## Fulfill a listing request from a remote region

If you cannot use Cross-Cloud Auto-Fulfillment, such as if your data product contains objects other than the [objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md),
and a consumer in a remote region wants to get your listing, you must manually replicate the data product to fulfill the consumer request
in that region.

### Manually replicate data to fulfill a listing request

If you offer free listings with manual data product fulfillment, you must manually replicate the data product to other regions when
consumers request your listing.

To manually replicate the data product to other regions, you must do the following:

1. Set up accounts in the regions where you make your listing available. The remote accounts must be part of the same organization as
   the account you published the listing from.
2. Replicate the data product to each account. You do not need to replicate the data to a region until a consumer in that region requests it.

See [Share data securely across regions and cloud platforms](../user-guide/secure-data-sharing-across-regions-platforms.md) for details on creating accounts in the relevant remote regions and
replicating the data shares used by your listings.

After completing those steps, you can approve listing requests.

### Approve a listing request

To approve and fulfill listing requests, you must use a role that has been granted or inherits the OWNERSHIP or MODIFY privilege on the listing.

To approve a request for a data product by a specific consumer:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the listing for which you want to view requests.
4. For the listing, select Consumer Requests.
5. Select the consumer request that you want to fulfill.
6. Sign in to the account in the remote region where the consumer is located.
7. Select the role in that account that has the OWNERSHIP privilege on the share and the shared database objects, or has the necessary
   privileges on the database objects to be able to add them to a share.
8. Choose Select Data.
9. If a secure share exists, navigate to the share, and select it. If a share does not exist, navigate to the desired database, and select the
   database objects you want to add to the share.

   > **Note:**
   >
   > If you do not see a share, it is either already attached to another listing, or has been previously shared with consumers.
10. Select Associate Selected Data to associate the share in the remote region to this listing.
11. Select Done.
12. Select Fulfill Request.

The consumer request for that region shows as approved and subsequent consumers in that region can instantly get your data product.

To approve all requests for a data product in a specific region:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the listing for which you want to view requests.
4. In the message noting that Data unavailable in some regions, select associate shares.
5. For the Region, select the region that you want to fulfill the data product to.
6. For Configuration:

   1. Sign in to an account or choose an account that you are already signed in to.
   2. Select the role that owns the data that you want to share.
   3. Select the data objects or secure share that you want to attach to the listing in the remote region. If you choose data objects,
      a secure share is created for you.
   4. Select Associate Selected Data.
7. Select Done.

The requests for any consumers in that region show as approved.

### Viewing shares in a remote region

If you manually replicated data for a listing in a remote region and want to view the shares attached to the listing, you must sign in to
the remote account you used to attach the share to the listing.

---
title: Manage listings with SQL as a consumer - examples
source: https://docs.snowflake.com/en/collaboration/consumer-listings-progaccess-examples.md
section: Collaboration & Marketplace
---

# Manage listings with SQL as a consumer - examples

The following are examples of the common tasks that consumers can complete programmatically with SQL commands:

* Show available listings
* [DESCRIBE AVAILABLE LISTING](../sql-reference/sql/desc-available-listing.md)
* Request a listing and automatically poll for availability
* Create a database from a listing
* End-to-end example

## Show available listings

Shows the listings available to the consumer running the command. For more information about the SHOW AVAILABLE LISTINGS command, see SHOW AVAILABLE LISTINGS.

| Description | Notes |
| --- | --- |
| Show the available listings. | Use `IS_SHARED_WITH_ME = TRUE` to show only the listings shared privately with the consumer running the command. Use `IS_IMPORTED = TRUE` to show only imported listings. |

```sqlexample
SHOW AVAILABLE LISTINGS
```

## Describe available listings

After running SHOW AVAILABLE LISTINGS to identify the available listings and the global listing names, a consumer can run DESCRIBE AVAILABLE LISTING to return descriptions of the columns in the listings that are available to them. For more information about the DESCRIBE AVAILABLE LISTING command, see [DESCRIBE AVAILABLE LISTING](../sql-reference/sql/desc-available-listing.md).

| Description | Notes |
| --- | --- |
| Describe the listing columns. | Use `listing_global_name` to identify the specific global listing to describe. When the `is_ready_for_import` column is `TRUE`, the data is already present in the region and can be imported by the consumer immediately. |

```sqlexample
DESCRIBE AVAILABLE LISTING < listing_global_name >
```

## Request a listing and automatically poll for availability

After running SHOW AVAILABLE LISTINGS to identify the available listings, a consumer can use the `SYSTEM$REQUEST_LISTING_AND_WAIT` stored procedure to request a listing and automatically poll for availability. A consumer can also use this stored procedure when the `is_ready_for_import` column is `FALSE`. For more information about the `SYSTEM$REQUEST_LISTING_AND_WAIT` stored procedure, see [SYSTEM$REQUEST_LISTING_AND_WAIT](../sql-reference/stored-procedures/system_request_listing_and_wait.md).

| Description | Notes |
| --- | --- |
| Request a specific listing and poll for availability. | `<timeout_mins>` specifies the listing fulfillment waiting period in minutes. The default is 240 minutes or 4 hours.  When a requested listing becomes available or is already available, the message `Success: Listing <listing_global_name> is ready to be imported` is returned.  If the timeout period is exceeded, the message `Error: Timed out waiting for the listing to be available after <timeout_mins> min(s)` is returned.  To request a listing without waiting for listing fulfillment, enter 0 (zero) for the `<timeout_mins>` value. When the value is 0, the message `Success: Listing <listing_global_name> requested successfully, but not waiting to confirm fulfillment` is returned. |

```sqlexample
CALL SYSTEM$REQUEST_LISTING_AND_WAIT( ' <listing_global_name> ' [ , <timeout_mins>. ] );
```

## Create a database from a listing

After requesting a listing, a consumer can use the CREATE DATABASE … FROM LISTING … command to create a database from a listing. For more information about the CREATE DATABASE … FROM LISTING … command, see [CREATE DATABASE … FROM LISTING …](../sql-reference/sql/create-database.md).

| Description | Notes |
| --- | --- |
| Create a database from a listing. | `<name>` specifies the database identifier. It must be unique for your account. The identifier must start with an alphabetic character and cannot contain spaces or special characters unless the entire identifier string is enclosed in double quotes. For example `"My object"`. Identifiers enclosed in double quotes are also case-sensitive. |

```sqlexample
CREATE DATABASE <name> FROM LISTING '<listing_global_name>';
```

## End-to-end example

The following example shows how to use the SQL commands described above to manage listings as a consumer. The example assumes that the consumer has already been granted access to a listing for COVID-19 data named `GZ1MXZFTF1` and that the listing is available in the consumer’s region. The example also assumes that the consumer has been granted the `sysadmin` role, which is required to create a database from a listing.

```sqlexample
-- Switch to sysadmin role
USE ROLE sysadmin;

-- Show available listings with a filter for shared listings
-- Note that you can optionally filter for private shared listings using IS_SHARED_WITH_ME = TRUE
-- The example assumes that the response returns a listing with a listing_global_name of GZ1MXZFTF1
SHOW AVAILABLE LISTINGS;

-- Get the global name and title of listings and filter on the title
SELECT "global_name", "title"
  FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
  WHERE "is_imported" = false
    AND "title" LIKE '%COVID-19%';

-- Request the listing returned in `SHOW AVAILABLE LISTINGS` and wait for completion
CALL SYSTEM$REQUEST_LISTING_AND_WAIT('GZ1MXZFTF1');

-- Accept legal terms for the listing. Email verification is required to create the database from listing GZ1MXZFTF1
CALL SYSTEM$ACCEPT_LEGAL_TERMS('DATA_EXCHANGE_LISTING', 'GZ1MXZFTF1');

-- Create database from the listing
CREATE DATABASE test_california_covid_import
  FROM LISTING 'GZ1MXZFTF1';

-- Use the new database
USE DATABASE test_california_covid_import;

-- Query the 'COVID.CASES' table and limit the results to 100 rows
SELECT * FROM COVID.CASES LIMIT 100;
```

---
title: Manage privileges for auto-fulfillment
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-manage-privileges.md
section: Collaboration & Marketplace
---

# Manage privileges for auto-fulfillment

After auto-fulfillment is enabled on an account, the ACCOUNTADMIN role can delegate the MANAGE LISTING AUTO FULFILLMENT privilege to other roles in the account, revoke the privileges, and determine whether the privileges have been delegated to a specific account within their organization.

## Delegate privileges to set up auto-fulfillment

SQL

After enabling auto-fulfillment on an account, the ACCOUNTADMIN role can grant the MANAGE LISTING AUTO FULFILLMENT privilege to other roles in the account.

```sqlsyntax
USE ROLE ACCOUNTADMIN;
GRANT MANAGE LISTING AUTO FULFILLMENT ON ACCOUNT TO ROLE <role_name>;
```

The ACCOUNTADMIN role can also revoke the MANAGE LISTING AUTO FULFILLMENT privilege.

```sqlsyntax
USE ROLE ACCOUNTADMIN;
REVOKE MANAGE LISTING AUTO FULFILLMENT ON ACCOUNT FROM ROLE <my_role>;
```

## Verify whether auto-fulfillment is enabled for an account

SQL

To determine whether auto-fulfillment is enabled on an account, call the [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](../sql-reference/functions/system_is_global_data_sharing_enabled_for_account.md) system function. The arguments for this system function are described below.

Calling this system function requires the ORGADMIN role.

```sqlsyntax
SELECT SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT(
  '<account_name>'
  );
```

Where:

`account_name`
:   Specifies the name of the account for which you want to check if users with the ACCOUNTADMIN role can manage auto-fulfillment. See [Finding the organization and account name for an account](../user-guide/admin-account-identifier.md).

---
title: Manage regions and replication
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-manage-regions.md
section: Collaboration & Marketplace
---

# Manage regions and replication

To manage or monitor additional auto-fulfillment settings for your listing, do the following:

Snowsight

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Select the row for the listing that you want to manage.
4. From the listing details page, access the auto-fulfillment settings:

   1. For a listing offered on the Snowflake Marketplace, in the Region Availability section, select Manage.
   2. For a listing offered to specific consumers, in the Consumer Accounts section, select ….
5. Select Manage Regions & Replication to see the regions where the listing is fulfilled and the status of auto-fulfillment. You can add or remove availability for a particular region or check the status. Select a region to see the timestamp of the last sync and how many consumers are accessing the data.

   > * If no consumers have accessed your listing’s product in a region, you can select Remove Region.
   > * If a consumer has accessed your listing’s product in a region, you cannot remove the region. Instead, if you want to remove your data product from that region, all consumers using the product must drop the database or application first, or you must delete the listing.

For more details about modifying listings, see [Modify published listings](https://other-docs.snowflake.com/collaboration/provider-listings-modifying).

---
title: Manage your provider profile
source: https://docs.snowflake.com/en/collaboration/provider-profiles-managing.md
section: Collaboration & Marketplace
---

# Manage your provider profile

This topic provides information on managing your provider profile.

## Provider profile fields

When you create or modify your provider profile, you can change a number of fields. The following table describes the fields required for
creating and configuring your provider profile in the Snowflake Marketplace.

| Field Name | Description | Example |
| --- | --- | --- |
| Company Icon | A high-resolution image of your logo in the JPG or PNG format. The file size cannot exceed 2 MB. Square or circle 256px by 256px version of your company logo is recommended. | image.jpg |
| Company Name | Name of your company, which is displayed below the logo image on your listing tile. This is not the name of your Snowflake account. The company name is used as the name of the provider profile. As a provider, you can have more than one provider profile (the provider nickname must be unique for each profile). When you publish a listing, you associate it with a provider profile. | Example Company |
| Company Description | A short introduction (2-3 sentences) about your company, the provider. | Example Company, recognized and documented as the most accurate source of weather forecasts and warnings in the world, has saved tens of thousands of lives, prevented hundreds of thousands of injuries and tens of billions of dollars in property damage. With global headquarters in Palo Alto, CA and other offices around the world, Example Company serves more than 1.5 billion people daily to help them plan their lives. |
| Consumer Contact Email | An email that receives email notifications when a data consumer requests access to your data. The email also appears under Contact Provider on your listing. Providers often create an email alias so several people within their organization can respond to inquiries. Per the Snowflake Provider terms, requests should be responded to within 24 hours, ideally within hours. | `sales@example.com` |
| Support Link or Email | A link (URL) or an email for consumers to contact you for technical support related to the data you are providing. Please make sure to similarly respond to consumer requests quickly. | `support@example.com` |
| Privacy Policy Link | A link (URL) to the provider’s privacy policy. The link is not required for personalized shares. The URL should not be locked behind any login screens or walls. See the Snowflake [Provider Policies](https://www.snowflake.com/provider-policies/) for more information. | `https://www.example.com/privacy` |
| Business Contact Email | An email address for Snowflake to contact the provider with questions about listings. This email address is also used to notify providers when a listing associated with the profile is approved or denied. | `admin@example.com` |
| Technical Contact Email | An email address for Snowflake to contact the provider about shared data. This email address is also used to notify providers when a listing associated with the profile is approved or denied. | `operations@example.com` |

## Edit your provider profile

You can edit your provider profile at any time. Most updates to your profile must be reviewed and approved by Snowflake before they become
visible in the Snowflake Marketplace. Updating the Business Contact and Technical Contact fields in your provider profile does not
require approval from Snowflake.

After your updated profile is approved, the changes are visible for all listings associated with your provider profile.

To modify a provider profile, you must be the owner of the provider profile or you must use a role that has the MODIFY privilege on the
profile. For more information, see [Granting provider privileges to other roles in the Snowflake Marketplace or a Data Exchange](../user-guide/data-exchange-marketplace-privileges.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. Switch to a role that has the [MODIFY privilege on the profile](../user-guide/data-exchange-marketplace-privileges.md).
3. In the navigation menu, select Marketplace » Provider Studio » Profiles.
4. Select the profile you want to update.
5. In the Manage drop-down menu, select Update Profile.
6. Edit the profile and then click Submit for Approval.

## Delete your provider profile

You can delete your provider profile as long as your profile is not associated with any listings, either published or unpublished.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. Switch to a role that has the [MODIFY privilege on the profile](../user-guide/data-exchange-marketplace-privileges.md).
3. In the navigation menu, select Marketplace » Provider Studio » Profiles.
4. Select the profile you want to delete.
5. In the Manage drop-down menu, select Delete Profile.

   > **Note:**
   >
   > If the Delete Profile option is inactive, make sure that no listings are associated with the profile.
6. Click Delete.

---
title: Managing and monitoring auto-fulfillment settings
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-monitor.md
section: Collaboration & Marketplace
---

# Managing and monitoring auto-fulfillment settings

Manage your listing to review the regions where consumers are using your listing, make changes to the refresh interval for your listing, and monitor the cost of auto-fulfillment.

> **Note:**
>
> You must use a role with the [Required privileges to perform auto-fulfillment tasks](provider-listings-auto-fulfillment-setup.md) for configuring auto-fulfillment.

---
title: MARKETPLACE_DISBURSEMENT_REPORT View
source: https://docs.snowflake.com/en/collaboration/views/marketplace-disbursement-report-ds.md
section: Collaboration & Marketplace
---

Schema:
:   [Data Sharing Usage](../../sql-reference/data-sharing-usage.md)

# MARKETPLACE_DISBURSEMENT_REPORT View

The MARKETPLACE_DISBURSEMENT_REPORT view in the [Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema lets you query the history of your earnings from
paid listings in the Snowflake Marketplace.

The view includes the history for a specific listing. Only visible to providers of paid listings, this view includes the history of payment statuses per invoice for purchased listings.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| STRIPE_DISPLAY_NUMBER | VARCHAR | The Stripe invoice or display number. |
| EVENT_DATE | DATE | Date when the payment event occurred. |
| EVENT_TYPE | VARCHAR | Type of event (payment). |
| INVOICE_DATE | DATE | Date of the invoice. |
| LISTING_NAME | VARCHAR | Identifier for the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name for the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. |
| CHARGE_TYPE | VARCHAR | Type of charge assessed. For more information about the components of the pricing model for paid listings, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model). Possible values: `FIXED`: Per-month charges. Also includes per-query charges if included by the provider in the pricing plan for the listing. `VARIABLE`: Per-query charges only. |
| GROSS | DECIMAL | Gross amount billed to the consumer. |
| FEES | DECIMAL | Pre-tax fees, owed to Snowflake by the provider. Snowflake subtracts the fees from the gross amount. |
| TAXES | DECIMAL | Sales tax (on the fees), owed to Snowflake by the provider. Snowflake subtracts the taxes from the gross amount. |
| NET_AMOUNT | DECIMAL | Actual amount to be paid to the provider. The equation for this is: `NET_AMOUNT` = `GROSS` - `FEES` - `TAXES`. |
| CURRENCY | VARCHAR | USD |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Name of the consumer account. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator for the consumer account. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | Name of the consumer organization. |

## Usage Notes

* Latency for the view can be up to 48 hours (2 days).
* The data is retained for 365 days (1 year).

## Examples

Retrieve the total amount disbursed to a provider’s bank account for each month for each listing:

```sqlexample
SELECT
  event_date
, listing_name
, listing_display_name
, listing_global_name
, currency
, SUM(net_amount) AS net_amount
FROM snowflake.data_sharing_usage.marketplace_disbursement_report
WHERE event_type = 'payment'
GROUP BY 1,2,3,4,5;
```

Retrieve the total amount that has been disbursed for each invoice period, grouped by listing and charge type. Note that the invoice period
could be spread out over multiple report dates:

```sqlexample
SELECT
  invoice_date
, listing_name
, listing_display_name
, listing_global_name
, charge_type
, currency
, SUM(gross) AS gross
, SUM(fees) AS fees
, SUM(taxes) AS taxes
, SUM(net_amount) AS net_amount
FROM snowflake.data_sharing_usage.marketplace_disbursement_report
WHERE event_type = 'payment'
GROUP BY 1,2,3,4,5,6;
```

---
title: MARKETPLACE_DISBURSEMENT_REPORT View
source: https://docs.snowflake.com/en/collaboration/views/marketplace-disbursement-report-org.md
section: Collaboration & Marketplace
---

Schema:
:   [Organization Usage](../../sql-reference/organization-usage.md)

# MARKETPLACE_DISBURSEMENT_REPORT View

The MARKETPLACE_DISBURSEMENT_REPORT view in the [Organization Usage](../../sql-reference/organization-usage.md) schema lets you query the history of your earnings from paid
listings in the Snowflake Marketplace.

The view includes the history for all accounts in your Snowflake organization. Only visible to providers of paid listings, this view includes the history of payment statuses per invoice for purchased listings.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| STRIPE_DISPLAY_NUMBER | VARCHAR | The Stripe invoice or display number. |
| EVENT_DATE | DATE | Date when the payment event occurred. |
| EVENT_TYPE | VARCHAR | Type of event (payment). |
| INVOICE_DATE | DATE | Date of the invoice. |
| LISTING_OWNER_ACCOUNT_NAME | VARCHAR | Name of the provider account that owns the listing. |
| LISTING_OWNER_ACCOUNT_LOCATOR | VARCHAR | Account locator for the provider account that owns the listing. For more information about account identifiers, see [Account identifiers](../../user-guide/admin-account-identifier.md). |
| LISTING_NAME | VARCHAR | Identifier for the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. |
| CHARGE_TYPE | VARCHAR | Type of charge assessed. For more information about the components of the pricing model for paid listings, see [Paid listings pricing models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model). Possible values: `FIXED`: Per-month charges. Also includes per-query charges if included by the provider in the pricing plan for the listing. `VARIABLE`: Per-query charges only. |
| GROSS | DECIMAL | Gross amount billed to the consumer. |
| FEES | DECIMAL | Pre-tax fees, owed to Snowflake by the provider. Snowflake subtracts the fees from the gross amount. |
| TAXES | DECIMAL | Sales tax (on the fees), owed to Snowflake by the provider. Snowflake subtracts the taxes from the gross amount. |
| NET_AMOUNT | DECIMAL | Actual amount to be paid to the provider. The equation for this is: `NET_AMOUNT` = `GROSS` - `FEES` - `TAXES`. |
| CURRENCY | VARCHAR | USD |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Name of the consumer account. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator for the consumer account. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | Name of the consumer organization. |

## Usage Notes

* Latency for the view can be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

## Examples

Retrieve the total amount disbursed to a provider’s bank account for each month for each listing:

```sqlexample
SELECT
  event_date
, listing_name
, listing_display_name
, listing_global_name
, currency
, SUM(net_amount) AS net_amount
FROM snowflake.organization_usage.marketplace_disbursement_report
WHERE event_type = 'payment'
GROUP BY 1,2,3,4,5;
```

Retrieve the total amount that has been disbursed for each invoice period, grouped by listing and charge type. Note that the invoice period
could be spread out over multiple report dates:

```sqlexample
SELECT
  invoice_date
, listing_name
, listing_display_name
, listing_global_name
, charge_type
, currency
, SUM(gross) AS gross
, SUM(fees) AS fees
, SUM(taxes) AS taxes
, SUM(net_amount) AS net_amount
FROM snowflake.organization_usage.marketplace_disbursement_report
WHERE event_type = 'payment'
GROUP BY 1,2,3,4,5,6;
```

---
title: MARKETPLACE_LISTING_INVOICE_STATUS view
source: https://docs.snowflake.com/en/collaboration/views/marketplace_listing_invoice_status.md
section: Collaboration & Marketplace
---

Schema:
:   [Data Sharing Usage](../../sql-reference/data-sharing-usage.md)

# MARKETPLACE_LISTING_INVOICE_STATUS view

The MARKETPLACE_LISTING_INVOICE_STATUS view in the [Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema lets you query the history of invoices related to paid listings in the Snowflake Marketplace.

Only visible to providers of paid listings, this view includes the history of payment statuses per invoice for purchased listings.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| STRIPE_DISPLAY_NUMBER | VARCHAR | The Stripe invoice or display number. |
| INVOICE_DATE | DATE | Date of invoice. |
| USAGE_MONTH | VARCHAR | The first month when an invoice is generated, in `YYYY-MM-01` format. For example, if the consumer purchases the listing on 12-MAR-2024, then the date in this field is `2024-03-01`. |
| INVOICE_STATUS | VARCHAR | Status of the invoice. Possible values: `closed` Paid to Snowflake; paid to providers within 30 days. `open` Not yet paid. `void` Canceled. `rebilled` Indicates that a voided invoice was rebilled to make an adjustment. If an invoice is canceled and rebilled, there are two rows for that invoice number: one `void` and one `rebilled`; the invoice created to bill the consumer again has a new number and is `open`. |
| PO_NUMBER | VARCHAR | Purchase order (PO) number specified by the consumer to buy a listing. The PO number is manually entered by the consumer. |
| CURRENCY | VARCHAR | Always USD (alternate currencies are not supported in this view). |
| TOTAL_BILLED_AMOUNT | DECIMAL | Total amount billed to the consumer in USD. This amount includes consumer’s taxes. that apply to the consumer and provider fees. |
| SALES_TAX_AMOUNT | DECIMAL | The sales tax in USD payable by the consumer. This amount is included in the `TOTAL_BILLED_AMOUNT` column amount. |
| FEES | DECIMAL | Provider fees. This amount is included in the `TOTAL_BILLED_AMOUNT` column amount. |
| EXPECTED_PAYOUT_AMOUNT | DECIMAL | The total expected payout to the provider in USD. This value is calculated by subtracting `SALES_TAX_AMOUNT` and `FEES` from `TOTAL_BILLED_AMOUNT`. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | Organization name of the consumer. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Account name of the consumer. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer. |
| CONSUMER_COMPANY_NAME | VARCHAR | Company name of the consumer. |
| CONSUMER_BILLING_EMAIL_ADDRESS | VARCHAR | Email address associated with billing for the consumer. |

## Usage Notes

* Latency for the view can be up to 48 hours (2 days).
* The data is retained for 365 days (1 year).

## Examples

Retrieve billing information for export.

```sqlexample
SELECT
  stripe_display_number AS snowflake_mp_invoice_number,
  invoice_date,
  usage_month AS first_billing_month,
  invoice_status,
  po_number,
  currency,
  total_billed_amount,
  listing_display_name,
  listing_global_name,
  consumer_organization_name,
  consumer_account_name,
  consumer_account_locator,
  consumer_company_name,
  consumer_billing_email_address
FROM snowflake.data_sharing_usage.marketplace_listing_invoice_status;
```

Retrieve details of unpaid invoices by consumer.

```sqlexample
SELECT
  consumer_account_name,
  consumer_account_locator,
  SUM( total_billed_amount ) AS total_outstanding
FROM snowflake.data_sharing_usage.marketplace_listing_invoice_status
WHERE invoice_status IN ('open')
GROUP BY ALL;
```

---
title: MARKETPLACE_PAID_USAGE_DAILY View
source: https://docs.snowflake.com/en/collaboration/views/marketplace-paid-usage-daily-ds.md
section: Collaboration & Marketplace
---

Schema:
:   [Data Sharing Usage](../../sql-reference/data-sharing-usage.md)

# MARKETPLACE_PAID_USAGE_DAILY View

As a consumer, you can use the MARKETPLACE_PAID_USAGE_DAILY view in the [Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema to query the daily history of your
usage of a specific paid listing. Retrieve the charges for the usage and the count of queries executed by your users on specific listings.

This view includes the history of consumer payments for a specific listing.

> **Note:**
>
> * As part of the Offers preview, the value in the UNIT_PRICE column varies for current and preview functionality.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REPORT_DATE | DATETIME | Date when the report was run. |
| USAGE_DATE | DATETIME | Usage date. |
| PROVIDER_NAME | VARCHAR | Provider display name from listing. |
| PROVIDER_ACCOUNT_NAME | VARCHAR | Account name of the provider. |
| PROVIDER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the provider account. For more information about account identifiers, see [Account identifiers](../../user-guide/admin-account-identifier.md). |
| PROVIDER_ORGANIZATION_NAME | VARCHAR | Organization name for the provider. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. |
| DATABASE_NAME | VARCHAR | Name of the database associated with this listing. |
| PO_NUMBER | VARCHAR | Purchase order number associated with this listing. |
| PRICING_PLAN | VARIANT | JSON value that includes the specifics of the pricing plan. Only included in the output for paid usage. |
| CHARGE_TYPE | VARCHAR | Type of charge assessed. For more information about the components of the pricing model for paid listings, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).  Possible values:   * `SAMPLE`: No charge. The queries were executed within the trial period for the listing. * `FIXED`: Per-month charges. * `GRACE`: No charge. The queries were counted among the free queries allowed in the calendar month (after the first query)   before the per-query charge is applied. * `VARIABLE`: Per-query charges. * `MAX_VARIABLE_USAGE_REACHED`: No charge. The queries were executed after the maximum total monthly cost for this listing   was reached. * `NON_MONETIZABLE_BILLING_EVENTS`: No charge. These billable events were emitted during trial usage of a data product,   or for billable events not part of a pricing plan on the listing. * `MONETIZABLE_BILLING_EVENTS`: Custom event billing charges. * `MAX_BILLING_EVENT_USAGE_REACHED`: No charge. These billable events were emitted after the maximum total monthly cost for   the listing was reached.   Additional values are part of preview functionality:   * SPCS_COMPUTE_POOL_SURCHARGE: The amount of the SPCS compute pool surcharge. * MAX_SPCS_COMPUTE_POOL_SURCHARGE_REACHED: No further charge. When the consumer ran additional   queries, they had already reached the maximum total SPCS compute pool surcharge for this listing. |
| UNITS | VARCHAR | Number of queries included in the charge. For a `FIXED` charge, this value is `1`. |
| UNIT_PRICE | DECIMAL | Current functionality: The per-month or per-query fee. For sample data, the value is `0`.  Preview functionality: The discounted price of the listing. |
| CHARGE | DECIMAL | Total charge for this line item on this day (without tax). |
| CURRENCY | VARCHAR | USD |

## Usage Notes

* Latency for the view can be up to 48 hours (2 days).
* The data is retained for 365 days (1 year).
* You can only see the usage for your account, which must be the consumer account that generated the charges.

## Examples

Retrieve the total amount charged per month and listing:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, listing_display_name
, listing_global_name
, SUM(charge) AS charge
FROM snowflake.data_sharing_usage.marketplace_paid_usage_daily
GROUP BY 1,2,3;
```

Retrieve the total amount charged per month, listing, and type of charge:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, listing_display_name
, listing_global_name
, SUM(charge) AS charge
, charge_type
FROM snowflake.data_sharing_usage.marketplace_paid_usage_daily
GROUP BY 1,2,3,4;
```

Retrieve the total amount charged for usage of an application that uses the Custom Event Billing pricing plan:

```sqlexample
SELECT listing_global_name,
   listing_display_name,
   charge_type,
   charge
FROM SNOWFLAKE.DATA_SHARING_USAGE.MARKETPLACE_PAID_USAGE_DAILY
WHERE charge_type='MONETIZABLE_BILLING_EVENTS';
```

---
title: MARKETPLACE_PAID_USAGE_DAILY View
source: https://docs.snowflake.com/en/collaboration/views/marketplace-paid-usage-daily-org.md
section: Collaboration & Marketplace
---

Schema:
:   [Organization Usage](../../sql-reference/organization-usage.md)

# MARKETPLACE_PAID_USAGE_DAILY View

You can use the MARKETPLACE_PAID_USAGE_DAILY view in the [Organization Usage](../../sql-reference/organization-usage.md) schema to query the daily history of your usage of paid
listings. Retrieve the count of queries executed by users in your account on individual listings, with the charges for the usage.

The view includes this history for all accounts in your Snowflake organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REPORT_DATE | DATETIME | Date when the report was run. |
| USAGE_DATE | DATETIME | Usage date. |
| PROVIDER_NAME | VARCHAR | Provider display name from listing. |
| PROVIDER_ACCOUNT_NAME | VARCHAR | Account name of the provider. |
| PROVIDER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the provider account. For more information about account identifiers, see [Account identifiers](../../user-guide/admin-account-identifier.md). |
| PROVIDER_ORGANIZATION_NAME | VARCHAR | Organization name for the provider. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Name of the consumer account. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer account. For more information about account identifiers, see [Account identifiers](../../user-guide/admin-account-identifier.md). |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. |
| DATABASE_NAME | VARCHAR | Name of the database associated with this listing. |
| PO_NUMBER | VARCHAR | Purchase order number associated with this listing. |
| PRICING_PLAN | VARIANT | JSON value that includes the specifics of the pricing plan. Only included in the output for paid usage. |
| CHARGE_TYPE | VARCHAR | Type of charge assessed. For more information about the components of the pricing model for paid listings, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).  Possible values:   * `SAMPLE`: No charge. The queries were executed within the trial period for the listing. * `FIXED`: Per-month charges. * `GRACE`: No charge. The queries were counted among the free queries allowed in the calendar month (after the first query)   before the per-query charge is applied. * `VARIABLE`: Per-query charges. * `MAX_VARIABLE_USAGE_REACHED`: No charge. The queries were executed after the maximum total monthly cost for this listing   was reached. * `NON_MONETIZABLE_BILLING_EVENTS`: No charge. These billable events were emitted during trial usage of a data product,   or for billable events not part of a pricing plan on the listing. * `MONETIZABLE_BILLING_EVENTS`: Custom event billing charges. * `MAX_BILLING_EVENT_USAGE_REACHED`: No charge. These billable events were emitted after the maximum total monthly cost for   the listing was reached.   Additional values are part of preview functionality:   * SPCS_COMPUTE_POOL_SURCHARGE: The amount of the SPCS compute pool surcharge. * MAX_SPCS_COMPUTE_POOL_SURCHARGE_REACHED: No further charge. When the consumer ran additional   queries, they had already reached the maximum total SPCS compute pool surcharge for this listing. |
| UNITS | VARCHAR | Number of queries included in the charge. For a `FIXED` charge, this value is `1`. |
| UNIT_PRICE | DECIMAL | Per-month or per-query fee. For sample data, the value is `0`. |
| CHARGE | DECIMAL | Total charge for this line item on this day (without tax). |
| CURRENCY | VARCHAR | USD |

## Usage Notes

* Latency for the view can be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

## Examples

Retrieve the total amount charged per month and listing:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, listing_display_name
, listing_global_name
, SUM(charge) AS charge
FROM snowflake.organization_usage.marketplace_paid_usage_daily
GROUP BY 1,2,3;
```

Retrieve the total amount charged per month, listing, and consumer account:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, consumer_account_name
, consumer_account_locator
, listing_display_name
, listing_global_name
, SUM(charge) AS charge
FROM snowflake.organization_usage.marketplace_paid_usage_daily
GROUP BY 1,2,3,4,5;
```

---
title: MARKETPLACE_PROVIDER_SPCS_USAGE View
source: https://docs.snowflake.com/en/collaboration/views/marketplace-provider-spcs-usage-ds.md
section: Collaboration & Marketplace
---

Schema:
:   [Data Sharing Usage](../../sql-reference/data-sharing-usage.md)

# MARKETPLACE_PROVIDER_SPCS_USAGE View

The MARKETPLACE_PROVIDER_SPCS_USAGE view in the [Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema lets providers review their daily [Snowpark Container Services (SPCS) usage](../../developer-guide/snowpark-container-services/provider-pricing-surcharges.md). In this view, providers can see the number of compute pool hours and credits consumed by applications that the consumers purchased from the provider.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| START_TIME | DATETIME | The date and beginning of the hour (in the local time zone) in which the usage took place. |
| END_TIME | DATETIME | The date and end of the hour (in the local time zone) in which the usage took place. |
| LISTING_NAME | VARCHAR | Identifier for the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name of the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name of the listing. |
| IDENTIFIER | VARCHAR | The compute pool name. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Account locator of the consumer account. For more information about account identifiers, see [Account identifiers](../../user-guide/admin-account-identifier.md). |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer account. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | Organization name for the consumer. |
| CREDITS | VARCHAR | Credits consumed by the compute pool. |
| COMPUTE_HOURS | VARCHAR | The number of hours consumed by the compute pool. |

## Usage notes

* Latency for the view can be up to 48 hours (2 days).
* The data is retained for 365 days (1 year).

## Examples

Retrieve the total number of SPCS compute pool hours consumed by each of your consumers.

```sqlexample
SELECT
  start_time,
  end_time,
  listing_name,
  listing_display_name,
  listing_global_name,
  identifier,
  consumer_account_name,
  consumer_account_locator,
  consumer_organization_name,
  credits,
  compute_hours,
FROM snowflake.data_sharing_usage.marketplace_provider_spcs_usage;
```

---
title: Modify published listings
source: https://docs.snowflake.com/en/collaboration/provider-listings-modifying.md
section: Collaboration & Marketplace
---

# Modify published listings

This topic describes how to modify listings after they have been published to the Snowflake Marketplace or shared with consumers as a private listing.

## Privileges required to edit listings

To modify listings, you must be the listing owner or have the MODIFY privilege on a listing. See [MODIFY privilege on a listing](../user-guide/data-exchange-marketplace-privileges.md).

## Edit a listing published on the Snowflake Marketplace

When editing a listing published on the Snowflake Marketplace, consider the following:

* When you edit a listing published on the Snowflake Marketplace, a new draft listing is created. To make those changes available to
  consumers, you must resubmit the draft listing for approval and publishing.
* Editing the available regions and business needs fields do not require approval. You can make these changes at any time.
* If you remove a region that was previously available, consumers in that region no longer have access to the shared
  dataset.
* When a new version of a listing is published, the previous version is replaced and cannot be recovered.

If you want to update the data product associated with a listing, see Update a data share.

To edit a listing published on the Snowflake Marketplace, complete the following steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, then select the listing you want to edit.

   * To add or remove regions where the listing is available, click Edit in the Region Availability section. You can skip
     the rest of the steps as no administrator approval is required.
   * To change other fields, such as the listing description, click Edit in the applicable section and select Continue when
     prompted. This creates a new draft listing that is not visible to consumers until submitted, approved, and published.
   * If you have existing changes in progress, select the New Draft toggle next to the listing title to continue working on
     an existing draft. You can discard this draft by selecting the Delete button at the top right of the page.
4. Select Submit for Approval when you are ready to submit your new draft listing for review.

## Edit a private listing

You can edit draft or published private listings in Provider Studio. If you edit a published private listing, any changes that you
make are immediately available to consumers after you save those changes.

If you want to update the data product associated with a listing, see Update a data share for guidance.

To edit a private listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, then choose the listing you want to edit.
4. Make the changes that you want, then click Save.

## As a Snowflake Marketplace provider, edit an existing Snowflake Marketplace listing to be available in a VPS deployment

> **Note:**
>
> This feature isn’t enabled by default. Providers must reach out to Snowflake Support to enable this feature. For more information, see [Snowflake Marketplace version 2 listings in VPS deployments](collaboration-marketplace-about.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, then select the listing to be edited.
4. Scroll to the Region Availability section and select Set region availability.
5. Click Select regions, then select the VPS region or region groups where you want your listing to be available.

   > Region groups and regions that have deployments in VPS are indicated with an info icon.
   >
   > Hover over that icon to see information about the deployment.
   >
   > Additional fulfillment costs may incur for listings offered in regions that have deployments in VPS.
   >
   > For more information about how auto-fulfillment incurs costs, see [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md).
6. Select Save when you’re done.

> **Note:**
>
> Changing the region availability or the business needs doesn’t require approval from the Snowflake Marketplace team.
>
> Any other changes that you make will require the listing to be re-reviewed and approved by the Snowflake Marketplace team.

## As a VPS provider, edit a VPS listing to be available in Snowflake Marketplace

> **Note:**
>
> This feature isn’t enabled by default. Providers must reach out to Snowflake Support to enable this feature. For more information, see [Snowflake Marketplace version 2 listings in VPS deployments](collaboration-marketplace-about.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, then select the listing you want to edit.
4. Scroll to the Region Availability section and select Set region availability.
5. Click Select regions, then select the VPS region or region groups where you want your listing to be available.

   > Region groups and regions that have deployments in VPS are indicated with an info icon.
   >
   > Hover over that icon to see information about the deployment.
   >
   > Additional fulfillment costs may incur for listings offered in regions that have deployments in VPS.
   >
   > For more information about how auto-fulfillment incurs costs, see [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md).
6. Select Save when you’re done.

> **Note:**
>
> Changing the region availability or the business needs doesn’t require approval from the Snowflake Marketplace team.
>
> Any other changes that you make will require the listing to be re-reviewed and approved by the Snowflake Marketplace team.

## Add compliance badges to a listing

You can add [compliance certifications](provider-becoming.md) to listings by using Snowsight or SQL.

### Add compliance certifications by using Snowsight

To add compliance certifications to an existing listing by using Snowsight, follow these steps:

1. Sign in to Snowsight.
2. In the navigation menu, select Marketplace ‣ Provider Studio.
3. Select the Listings tab, then select the listing you want to update.
4. In the optional Certifications section, add the compliance certifications for your listing. You can upload the supporting compliance documentation and set the expiration date for each certification.
5. Select Save to save your changes.
6. Submit your listing for approval.

### Add compliance certifications by using SQL

Snowflake provides two SQL-based methods for adding certification badges to listings:

* Update a listing by using a stage.
* Update a listing by using local files.

#### Update a listing by using a stage

To update a listing that includes compliance certifications by using a stage, complete the following steps:

SQL

1. To find the listing name, use [SHOW LISTINGS](../sql-reference/sql/show-listings.md); for example:

   ```sqlexample
   SHOW LISTINGS IN DATA EXCHANGE snowflake_data_marketplace;
   ```
2. To review the listing’s [manifest.yml](../progaccess/listing-manifest-reference.md) file, use [DESCRIBE LISTING](../sql-reference/sql/desc-listing.md) on a listing; for example:

   ```sqlexample
   DESCRIBE LISTING <listing_name>;
   ```
3. In the output of the DESCRIBE LISTING command, copy the contents of the manifest.yml column into a new manifest file.
4. In the new manifest file, add the `compliance_badges` field and include a line for each certification type; for example:

   ```yaml
   title: "My listing title"
   subtitle: "My listing subtitle"
   description: "My listing description"
   profile: "MyProfile"
   …
   compliance_badges:
   - type: SOC2
     expiry: 12-25-2026
     files:
       - soc2_compliance_verification.pdf
   - type: HIPAA
     expiry: 06-07-2026
     files:
       - hipaa_compliance_verification.pdf
   ```
5. To upload your new listing manifest file to a Snowflake stage, run the following command:

   ```sqlexample
   PUT file:///<path_to_new_manifest_file> @<stage_name>
     SOURCE_COMPRESSION=None
     AUTO_COMPRESSION=False
     OVERWRITE=True;
   ```
6. To upload your supporting documentation to the same Snowflake stage, run the following command:

   ```sqlexample
   PUT file:///<path_to_soc2_compliance_report> @<stage_name>
   PUT file:///<path_to_hipaa_compliance_report> @<stage_name>
     SOURCE_COMPRESSION=None
     AUTO_COMPRESSION=False
     OVERWRITE=True;
   ```
7. To upload a new version of the listing from stage, use [ALTER LISTING](../sql-reference/sql/alter-listing.md) ; for example:

   ```sqlexample
   ALTER LISTING <listing_name>
     ADD VERSION FROM @<stage_name>;
   ```
8. To submit the listing for review, run the following command:

   ```sqlexample
   ALTER LISTING <listing_name> REVIEW;
   ```
9. To publish the updated listing after it’s approved, run the following command:

   ```sqlexample
   ALTER LISTING <listing_name> PUBLISH;
   ```

#### Update a listing by using local files

To update a listing that includes compliance certifications by using local files, complete the following steps:

SQL

1. To find the listing name, use [SHOW LISTINGS](../sql-reference/sql/show-listings.md); for example:

   ```sqlexample
   SHOW LISTINGS IN DATA EXCHANGE snowflake_data_marketplace;
   ```
2. To review the listing’s [manifest.yml](../progaccess/listing-manifest-reference.md) file, use [DESCRIBE LISTING](../sql-reference/sql/desc-listing.md) on a listing; for example:

   ```sqlexample
   DESCRIBE LISTING <listing_name>;
   ```
3. In the output of DESCRIBE LISTING, copy the contents of the manifest.yml column into a new manifest file.
4. In the new manifest file, add a `compliance_badges` section and include a line for each certification type; for example:

   ```yaml
   title: "My listing title"
   subtitle: "My listing subtitle"
   description: "My listing description"
   profile: "MyProfile"
   …
   compliance_badges:
   - type: SOC2
     expiry: 12-25-2026
     files:
       - soc2_compliance_verification.pdf
   ```
5. To add an editable, live version of the listing, use [ALTER LISTING](../sql-reference/sql/alter-listing.md) ; for example:

   ```sqlexample
   ALTER LISTING <listing_name> ADD LIVE VERSION FROM LAST;
   ```
6. To add the badge files and the updated manifest file to the live version of the listing, run the following commands:

   ```sqlexample
   PUT file:///<path_to_soc2_compliance_report> snow://listing/<name>/versions/live
     AUTO_COMPRESS=False
     OVERWRITE=True;

   PUT file:///<path_to_new_manifest_file> snow://listing/<name>/versions/live
     SOURCE_COMPRESSION=None
    AUTO_COMPRESS=False
     OVERWRITE=True;
   ```
7. To commit the live version of the listing, use ALTER LISTING.

   This will add the recent changes to the approval request.

   ```sqlexample
   ALTER LISTING <listing_name> COMMIT;
   ```
8. To submit the listing for review, run the following command:

   ```sqlexample
   ALTER LISTING <listing_name> REVIEW;
   ```
9. To publish the updated listing after it’s approved, run the following command:

   ```sqlexample
   ALTER LISTING <listing_name> PUBLISH;
   ```

#### Confirm that the compliance badge was added to the listing (SQL)

After you add a compliance badge to a listing by using SQL, you can confirm that the badge was added correctly.

SQL

To confirm that the compliance badge was added to a listing, complete the following steps:

1. Run the following command:

   ```sqlexample
   DESCRIBE LISTING <listing_name> REVISION = DRAFT;
   ```
2. In the output, verify that the `manifest.yml` column includes a `compliance_badges` section.

## Unpublish a listing

To hide a listing from the Snowflake Marketplace without deleting it, you can unpublish the listing.

> **Note:**
>
> When you unpublish a listing, existing consumers can still access the data product associated with the listing unless you also remove them
> from the share. See Update a data share. To remove a listing and access to the listing for all consumers using the listing,
> delete the listing. See [Remove listings as a provider](provider-listings-removing.md).

To unpublish a listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings.
4. Select the name of the listing you wish to unpublish.
5. In the top-right corner of the listing page, select the vertical ellipsis (), and then select Unpublish to
   begin unpublishing the listing.

   A confirmation message displays, reminding you that unpublishing a listing removes the listing from Snowflake Marketplace, but existing
   consumers will continue to have access to the listing.
6. Select Unpublish to complete the unpublish process.

   The status at the top of the page changes from Live to Unpublished. You can select that status to view the listing status summary.

> **Note:**
>
> If the listing was automatically replicated to other regions using auto-fulfillment, the listing remains replicated to the remote regions.
> To remove the replicated data product from other regions, change the region availability of the listing. For more information, see
> [Region availability (Marketplace listings only)](provider-listings-reference.md).

## Republish a listing

When you republish a listing on the Snowflake Marketplace, you do not need to submit the listing for approval unless you made changes to the listing.

To republish a listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings.
4. Select the name of the listing you want to republish.
5. In the top-right corner, select Publish to republish the listing.

   * A message displays, confirming that the listing is available on Snowflake Marketplace. From this message, you can select to view the listing
     on Snowflake Marketplace, or you can select Done to return to the listing page.
   * The status at the top of the listing page changes from Unpublished to Live.

## Update a data share

In addition to modifying a listing, you can modify the data share attached as the data product a specific listing. You cannot remove or
replace the data product for a published listing.

For example, you might want to add a data column to a secure view, or rename objects to follow the [Identifier requirements](../sql-reference/identifiers-syntax.md).

> **Important:**
>
> Every time you modify the share associated with a listing, you must notify the consumers to make sure that you do not break their processes.
> Examples of breaking changes to a data share include the following:
>
> * Adding/removing a column
> * Renaming objects
> * Removing objects

To update the objects in a data share, see [Working with Shares](../user-guide/data-sharing-provider.md).

## Modify paid listings

You can modify the price and pricing plan for paid listings, with some restrictions.

### Change the price of a paid listing

If you want to change the price of a paid listing in the Snowflake Marketplace, you must resubmit the listing for approval.
The approval is a technical part of the process of republishing a modified listing.
Snowflake does not provide feedback about the price change.

You cannot change the price of a listing to zero dollars. To make a paid listing free, you must create a new listing.

After the newly priced listing is approved and published, Snowflake automatically notifies current consumers of the listing about the
price change using the billing contact email address associated with each consumer’s account.

After you change the price of your pricing plan:

* New consumers see and are billed according to the new pricing plan immediately.
* Existing consumers are billed the previous rate until the end of their current billing cycle.

  + If you change the price less than 30 days before the next billing cycle begins, customers are billed the previous rate for the
    next billing cycle and the new rate for the following billing cycle.
  + If you change the price more than 30 days before the next billing cycle begins, customers are billed the new rate for the next billing
    cycle.

For example, for a usage-based pricing plan that bills monthly, if you change the price on October 15th, existing consumers are billed
the previous rate for their October invoice and November invoice, but charged the new rate for the December invoice.

For specific scenarios, refer to this example table:

| Pricing plan | Billing cycle | Plan start date | Price change date | Invoice where new price is reflected |
| --- | --- | --- | --- | --- |
| Usage-based | 1 month | Jan 1, 2023 | Jun 15, 2023 | Aug 1, 2023 |
| Usage-based | 1 month | Jan 1, 2023 | Jun 2, 2023 | Aug 1, 2023 |
| Usage-based | 1 month | Jan 1, 2023 | May 30, 2023 | Jul 1, 2023 |
| Subscription-based | 3 months, recurring | Jan 1, 2023 | Feb 15, 2023 | Apr 1, 2023 |
| Subscription-based | 3 months, recurring | Jan 1, 2023 | Mar 15, 2023 | Jul 1, 2023 |

### Change the pricing plan of a paid listing

You can change the pricing plan for a paid listing when you edit the listing. If you want to change the pricing plan, consider the following:

* You cannot remove a pricing plan from a paid listing to make it a free listing. See Change Existing Listings to Paid Listings.
* You cannot change the type of pricing plan. If your listing currently has a
  [usage-based pricing plan](provider-listings-pricing-model.md), you cannot change the plan to a
  [subscription-based pricing plan](provider-listings-pricing-model.md), and vice versa.
* If your paid listing is published in the Snowflake Marketplace, you must resubmit the listing for approval after changing the pricing plan.
  After the updated pricing plan is approved and the updated listing is published, Snowflake automatically notifies current consumers of
  the listing about the pricing plan change using the billing contact email address associated with each consumer’s account.

When you change the pricing plan, existing consumers are charged based on the new pricing plan after the end of their
next billing cycle. New consumers see the new pricing plan immediately.

### Change existing listings to paid listings

You cannot convert a free listing into a paid listing. If you published a listing without a pricing plan, one cannot be added later.

If you want to offer a paid listing, you must attach a pricing plan to the listing before it is first published.

Similarly, you cannot convert a paid listing into a free listing. If you published a listing with a pricing plan, you cannot change the
pricing plan to null, or change the price to zero. To change the price to some other amount, see Change the price of a paid listing.

If you want to change the type of listing that you offer, create the new listing that you want to offer and unpublish the existing listing.
For example, if you want to replace a free listing with a paid listing, unpublish the free listing and create a paid listing with the same
contents. See Unpublish a listing.

---
title: Monetization usage views
source: https://docs.snowflake.com/en/collaboration/provider-monetization-usage.md
section: Collaboration & Marketplace
---

# Monetization usage views

Snowflake provides historical usage data for paid listings as a set of views in the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) and [DATA_SHARING_USAGE](../sql-reference/data-sharing-usage.md)
schemas in the shared SNOWFLAKE database.

You can view historical usage data for other listings using the same [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) and [DATA_SHARING_USAGE](../sql-reference/data-sharing-usage.md)
schemas, or view aggregated usage analytics in the Provider Studio. Refer to [Monitor listing use](provider-listings-monitor-studio.md).

## Monetization usage views in the ORGANIZATION_USAGE schema

* [MARKETPLACE_DISBURSEMENT_REPORT (ORGANIZATION_USAGE) View](views/marketplace-disbursement-report-org.md)
* [MONETIZED_USAGE_DAILY (ORGANIZATION_USAGE) View](views/monetized-usage-daily-org.md)

## Monetization usage views in the DATA_SHARING_USAGE schema

* [LISTING_EVENTS_DAILY view](../sql-reference/data-sharing-usage/listing-events-daily.md)
* [MARKETPLACE_DISBURSEMENT_REPORT (DATA_SHARING_USAGE) View](views/marketplace-disbursement-report-ds.md)
* [MONETIZED_USAGE_DAILY (DATA_SHARING_USAGE) View](views/monetized-usage-daily-ds.md)

---
title: MONETIZED_USAGE_DAILY View
source: https://docs.snowflake.com/en/collaboration/views/monetized-usage-daily-ds.md
section: Collaboration & Marketplace
---

Schema:
:   [Data Sharing Usage](../../sql-reference/data-sharing-usage.md)

# MONETIZED_USAGE_DAILY View

The MONETIZED_USAGE_DAILY view in the [Data Sharing Usage](../../sql-reference/data-sharing-usage.md) schema lets you query the history of daily consumer queries per
listing, including charges accumulated for the usage. To retrieve consumer payment information, query the MARKETPLACE_DISBURSEMENT_REPORT
view in the ORGANIZATION_USAGE or DATA_SHARING_USAGE schema.

The view includes the history of consumer queries for a specific listing.

> **Note:**
>
> As part of the Offers preview, the value in the UNIT_PRICE column varies for current and preview functionality.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REPORT_DATE | DATETIME | Date when the report was run. |
| USAGE_DATE | DATE | Usage date. |
| LISTING_NAME | VARCHAR | SQL identifier for the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name for the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name for the listing. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer account. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Name of the consumer account. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | Organization name of the consumer account. |
| CONSUMER_SNOWFLAKE_REGION | VARCHAR | Cloud service [region](../../user-guide/intro-regions.md) where the consumer account is hosted. |
| PRICING_PLAN | JSON | JSON value that includes the specifics of the pricing plan. Only included in the output for paid usage. |
| CHARGE_TYPE | VARCHAR | Type of charge assessed. For more information about the components of the pricing model for paid listings, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).  Possible values:   * `SAMPLE`: No charge. The queries were executed within the trial period for the listing. * `FIXED`: Per-month charges. * `GRACE`: No charge. The queries were counted among the free queries allowed in the calendar month (after the first query)   before the per-query charge is applied. * `VARIABLE`: Per-query charges. * `MAX_VARIABLE_USAGE_REACHED`: No charge. The queries were executed after the maximum total monthly cost for this listing   was reached. * `NON_MONETIZABLE_BILLING_EVENTS`: No charge. These billable events were emitted during trial usage of a data product,   or for billable events not part of a pricing plan on the listing. * `MONETIZABLE_BILLING_EVENTS`: Custom event billing charges. * `MAX_BILLING_EVENT_USAGE_REACHED`: No charge. These billable events were emitted after the maximum total monthly cost for   the listing was reached.   Additional values are part of preview functionality:   * SPCS_COMPUTE_POOL_SURCHARGE: The amount of the SPCS compute pool surcharge. * MAX_SPCS_COMPUTE_POOL_SURCHARGE_REACHED: No further charge. When the consumer ran additional   queries, they had already reached the maximum total SPCS compute pool surcharge for this listing. |
| UNITS | VARCHAR | Number of queries included in the charge. For a `FIXED` charge, this value is `1`. |
| UNIT_PRICE | DECIMAL | Current functionality: The per-month or per-query fee. For sample data, the value is `0`.  Preview functionality: The discounted price of the listing. |
| GROSS_CHARGE | DECIMAL | Total charge for this line item on this day. |
| CURRENCY | VARCHAR | USD |

## Usage Notes

* Latency for the view can be up to 48 hours (2 days).
* The data is retained for 365 days (1 year).

## Examples

Retrieve the total number of queries run and the total gross charges by customer and month. Queries are returned as number of units:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, consumer_organization_name
, consumer_snowflake_region
, consumer_account_locator
, consumer_account_name
, currency
, SUM(units) AS units
, SUM(gross_charge) AS gross_charge
FROM snowflake.data_sharing_usage.monetized_usage_daily
GROUP BY 1,2,3,4,5,6;
```

Retrieve the total number of queries run and the total gross charges by listing and month:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, listing_name
, listing_display_name
, listing_global_name
, currency
, SUM(units) AS units
, SUM(gross_charge) AS gross_charge
FROM snowflake.data_sharing_usage.monetized_usage_daily
GROUP BY 1,2,3,4,5;
```

Retrieve the total number of queries run and the total gross charges by charge type, consumer, and month:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, consumer_organization_name
, consumer_snowflake_region
, consumer_account_locator
, consumer_account_name
, charge_type
, currency
, SUM(units) AS units
, SUM(gross_charge) AS gross_charge
FROM snowflake.data_sharing_usage.monetized_usage_daily
GROUP BY 1,2,3,4,5,6,7;
```

---
title: MONETIZED_USAGE_DAILY View
source: https://docs.snowflake.com/en/collaboration/views/monetized-usage-daily-org.md
section: Collaboration & Marketplace
---

Schema:
:   [Organization Usage](../../sql-reference/organization-usage.md)

# MONETIZED_USAGE_DAILY View

As a provider of listings, the MONETIZED_USAGE_DAILY view in the [Organization Usage](../../sql-reference/organization-usage.md) schema lets you query the history of daily
consumer usage for each listing, including charges accumulated for the usage. For consumer payment information,
query the MARKETPLACE_DISBURSEMENT_REPORT view in the ORGANIZATION_USAGE or DATA_SHARING_USAGE schema.

The view includes the history of consumer usage for all accounts in your Snowflake organization.

## Columns

| Column Name | Data Type | Description |
| --- | --- | --- |
| REPORT_DATE | DATETIME | Date when the report was run. |
| USAGE_DATE | DATE | Usage date. |
| LISTING_OWNER_ACCOUNT_NAME | VARCHAR | Name of the provider account that owns the listing. |
| LISTING_OWNER_ACCOUNT_LOCATOR | VARCHAR | Account locator for the provider account that owns the listing. For more information about account identifiers, see [Account identifiers](../../user-guide/admin-account-identifier.md). |
| LISTING_NAME | VARCHAR | Identifier for the listing. |
| LISTING_DISPLAY_NAME | VARCHAR | Display name for the listing. |
| LISTING_GLOBAL_NAME | VARCHAR | Global name for the listing. |
| CONSUMER_ACCOUNT_LOCATOR | VARCHAR | Account locator of the consumer account. |
| CONSUMER_ACCOUNT_NAME | VARCHAR | Name of the consumer account. |
| CONSUMER_ORGANIZATION_NAME | VARCHAR | Organization name of the consumer account. |
| CONSUMER_SNOWFLAKE_REGION | VARCHAR | Cloud service [region](../../user-guide/intro-regions.md) where the consumer account is hosted. |
| PRICING_PLAN | JSON | JSON value that includes the specifics of the pricing plan. Only included in the output for paid usage. |
| CHARGE_TYPE | VARCHAR | Type of charge assessed. For more information about the components of the pricing model for paid listings, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).  Possible values:   * `SAMPLE`: No charge. The queries were executed within the trial period for the listing. * `FIXED`: Per-month charges. * `GRACE`: No charge. The queries were counted among the free queries allowed in the calendar month (after the first query)   before the per-query charge is applied. * `VARIABLE`: Per-query charges. * `MAX_VARIABLE_USAGE_REACHED`: No charge. The queries were executed after the maximum total monthly cost for this listing   was reached. * `NON_MONETIZABLE_BILLING_EVENTS`: No charge. These billable events were emitted during trial usage of a data product,   or for billable events not part of a pricing plan on the listing. * `MONETIZABLE_BILLING_EVENTS`: Custom event billing charges. * `MAX_BILLING_EVENT_USAGE_REACHED`: No charge. These billable events were emitted after the maximum total monthly cost for   the listing was reached.   Additional values are part of preview functionality:   * SPCS_COMPUTE_POOL_SURCHARGE: The amount of the SPCS compute pool surcharge. * MAX_SPCS_COMPUTE_POOL_SURCHARGE_REACHED: No further charge. When the consumer ran additional   queries, they had already reached the maximum total SPCS compute pool surcharge for this listing. |
| UNITS | VARCHAR | Number of queries included in the charge. For a `FIXED` charge, this value is `1`. |
| UNIT_PRICE | DECIMAL | Per-month or per-query fee. For free queries or usage after the maximum total charge for the month is reached, the value is `0`. |
| GROSS_CHARGE | DECIMAL | Total charge for this line item on this day. |
| CURRENCY | VARCHAR | USD |

## Usage Notes

* Latency for the view can be up to 24 hours (1 day).
* The data is retained for 365 days (1 year).

## Examples

Retrieve the total number of queries run and the total gross charges by customer and month. Queries are returned as number of units:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, consumer_organization_name
, consumer_snowflake_region
, consumer_account_locator
, consumer_account_name
, currency
, SUM(units) AS units
, SUM(gross_charge) AS gross_charge
FROM snowflake.organization_usage.monetized_usage_daily
GROUP BY 1,2,3,4,5,6;
```

Retrieve the total number of queries run and the total gross charges by listing and month:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, listing_name
, listing_display_name
, listing_global_name
, currency
, SUM(units) AS units
, SUM(gross_charge) AS gross_charge
FROM snowflake.organization_usage.monetized_usage_daily
GROUP BY 1,2,3,4,5;
```

Retrieve the total number of queries run and the total gross charges by charge type, consumer, and month:

```sqlexample
SELECT
  DATE_TRUNC(MONTH, usage_date) AS usage_month
, consumer_organization_name
, consumer_snowflake_region
, consumer_account_locator
, consumer_account_name
, charge_type
, currency
, SUM(units) AS units
, SUM(gross_charge) AS gross_charge
FROM snowflake.organization_usage.monetized_usage_daily
GROUP BY 1,2,3,4,5,6,7;
```

---
title: Monitor listing use
source: https://docs.snowflake.com/en/collaboration/provider-listings-monitor-studio.md
section: Collaboration & Marketplace
---

# Monitor listing use

This topic explains how to monitor the performance of your listing in terms of usage and best practices.

## Which metrics are tracked?

Depending on whether you offer your listing privately or on the Snowflake Marketplace, you see different usage analytics.
To see who is using your listings, you can use Provider Studio or the database views provided by Snowflake.

Snowflake tracks many metrics for listings, including the following:

* Daily telemetry usage for your listing, such as the daily consumer query history.
* Events when consumers get or request your listing.
* Events when consumers view or click your listing detail page on the Snowflake Marketplace.
* Use of your listing, such as number of jobs run on the data product in your listing.
* Access details for your listing, such as viewing the tables in your listing.
* The company name and account name of consumers accessing your listing.
* Information consumers submit when requesting unlimited access to a limited trial listing.
* And more.

See [Data Sharing Usage](../sql-reference/data-sharing-usage.md) and [Organization Usage](../sql-reference/organization-usage.md) for more details about viewing this information in SQL.

For a full list of metrics tracked for listings, and details for viewing this data using SQL, see [Data Sharing Usage](../sql-reference/data-sharing-usage.md) and [Organization Usage](../sql-reference/organization-usage.md).

For paid listings, you can see additional data on a per-listing basis or for all paid listings in your organization:

* Earnings history for your listings.
* Charges accumulated by type of charge.
* Number of queries included in the charge to a consumer.
* Number of consumers trialing your listing.
* Number of purchases of your listing.
* And more.

See [Monetization usage views](provider-monetization-usage.md) for more details about viewing this information in SQL.

## Monitor consumer usage metrics in Provider Studio

To help you manage the performance and usage of your listings, Provider Studio provides overview analytics on the Home tab and
aggregated and detailed analytics on the Analytics tab.

> **Note:**
>
> Providers receive usage data and other details as described in
> Monitor listing use. Consumers accessing
> a data product containing a Snowflake Native App will receive consumer-related monitoring data
> only if the app emits events specifically for the consumer.

### Prerequisites for viewing usage data in Provider Studio

Before you can view usage data, make sure that you meet the prerequisites.

* To view Provider Studio and the data on the Home and Analytics tabs, you must use the ACCOUNTADMIN role
  or a custom role granted the CREATE LISTING privilege and IMPORTED PRIVILEGES on the SNOWFLAKE database.
  See [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md).
* You must select a warehouse that Snowflake can use to bill your account for the queries that generate these usage analytics.
  You are billed at the normal rate for these queries. For more information, see [Understanding compute cost](../user-guide/cost-understanding-compute.md).

### View usage data in Provider Studio

To view usage data in Provider Studio, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select a warehouse. In the upper-right corner of the page, click Select Warehouse.
4. Queries to show analytics data on the Home tab and Analytics tab run successfully and display results.

You can see trends from the last 28 days on the Home tab, including the following:

* The number of queries executed by consumers against your data products, including how those numbers are trending compared to the previous
  28 day period.
* The number of unique consumers that have queried your listings, including how those numbers are trending compared to the previous
  28 day period.
* The name of your most-queried listing.
* The consumer who has run the most queries against your listings.

On the Analytics tab, you can see additional metrics, both aggregated across all your listings or a detailed view for a specific listing.

#### View overview analytics for your listings

You can see the following overview metrics for your listings:

* The reach of your listings on the Snowflake Marketplace, such as views of all of your listings, and a list of the most-viewed listings.
* Engagement with your listings, based on the number of queries executed. You can see the following engagement overview metrics:

  + The number of queries executed across all your listings.
  + Your listings ordered by usage, determined by number of queries executed.
  + The number of consumer accounts actively using your listings per day.
  + A ranking of the most active consumers, based on query execution.
  + A list of the regions in which consumers execute the most queries.
  + For your free listings on the Snowflake Marketplace, you can see consumer conversion from viewing, mounting the share, and querying
    your listings. You can only see consumer conversion over the last 28 days.

#### View detailed metrics for your listings

On the Analytics tab, if you select More for a specific metric or select Detailed Metrics, you can see the following
analytics:

| Metric | Details |
| --- | --- |
| Active consumers | The number of unique consumers that have queried your data products. |
| Consumer requests | The number of consumers that have requested replication of your free listings to their region or requested unlimited access to limited trial listings. |
| Listings installed | The number of listings that consumers have installed in their accounts. |
| Listing views | The number of times that your listings offered on the Snowflake Marketplace have been viewed. |
| Queries executed | The number of queries executed against your listings. |

Using the detailed view, you can filter the data by time, exchange, consumer, listing name, or region.

#### View consumer details for your listings

You can see consumer details, including the company name and Snowflake account name for the consumer, in several places:

* In the Usage Trends on the Home tab that lists the most active consumer.
* On the Overview tab in the Most active consumers tile.
* In the detailed metrics for Listings Installed, Consumer Requests, and Queries Executed when viewing a specific listing.

To see which consumers are installing and using your listings, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. If you do not have a default warehouse set, select a warehouse. In the upper-right corner of the page, choose Select Warehouse.
4. Select Analytics » Detailed Metrics.
5. Select the Queries Executed drop-down and select Listings Installed.
6. Review the table of listings and select a specific listing that you want to see details for.
7. In the table of All consumers for the selected time period, review the company name, account name, first name, last name,
   email address, and region of the consumer that installed the selected listing. If the company name is not available for a consumer,
   you instead see the Snowflake organization and account names.

To see other details about consumers using your listings, review the Most active consumers tile on the Overview tab. It shows
a list of consumer company names ordered by how many queries they executed against your data product in the selected time frame.
Select a consumer to see detailed metrics such as the dates of usage and Snowflake region of the consumer account.

### View usage data by using SQL

The overview metrics on the Analytics tab are derived from the metrics in the DATA_SHARING_USAGE schema. Some metrics, including
most active consumers, conversion rate, most viewed listings, and most used listings, are derived from the metrics and might not exactly
match what you can see in the schema.

The detailed metrics on the Analytics tab are obtained by querying the views in the [Data Sharing Usage](../sql-reference/data-sharing-usage.md) schema.

If you want, you can query the views directly. See [Data Sharing Usage](../sql-reference/data-sharing-usage.md).

Because the views are part of the SNOWFLAKE database, only account administrators (users with the ACCOUNTADMIN role) can perform queries on the data sharing usage for the listings published from that account. Privileges can be granted to other roles in your account to allow other users access. For more details, see [Enabling other roles to use schemas in the SNOWFLAKE database](../sql-reference/account-usage.md).

> **Note:**
>
> The DATA_SHARING_USAGE schema is not updated immediately. There can be up to two days of latency between an event occurring and updates to the DATA_SHARING_USAGE schema.

## Improve listing performance in the Snowflake Marketplace

If you publish your listing on the Snowflake Marketplace and want to improve the listing performance with consumers, review the
[Snowflake Marketplace Provider Best Practices](https://www.snowflake.com/provider-best-practices/).
For a video on how to use Provider Studio to review the performance of your listings,
see [Provider Studio Analytics - Understanding Listing Performance](https://www.snowflake.com/wp-content/uploads/2024/11/Provider_Studio_Analytics_2024-11.mp4).

---
title: Monitor replication costs
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-monitor-replication-costs.md
section: Collaboration & Marketplace
---

# Monitor replication costs

To monitor the cost of auto-fulfillment, do the following:

Snowsight

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Select the row for the listing that you want to manage.
4. From the listing details page, access the auto-fulfillment settings:

   1. For a listing offered on the Snowflake Marketplace, in the Region Availability section, select Manage.
   2. For a listing offered to specific consumers, in the Consumer Accounts section, select ….
5. Select Monitor Replication Cost to monitor the costs related to fulfilling the data product to other regions. See [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md).

For more details about modifying listings, see [Modify published listings](https://other-docs.snowflake.com/collaboration/provider-listings-modifying).

---
title: Monitor resources and view costs
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-monitor-view-costs.md
section: Collaboration & Marketplace
---

# Monitor resources and view costs

This section describes how to monitor auto-fulfillment resources and view estimated and actual costs associated with auto-fulfillment.

## Monitor resources

If you want to minimize costs associated with auto-fulfillment, review the usage of your listings and learn more
about preparing your data for auto-fulfillment:

Monitor Compute Resources
:   Identify the queries run by Snowflake and review the refresh frequency interval for your listings.

    Refer to the [LISTING_AUTO_FULFILLMENT_REFRESH_DAILY view](../sql-reference/data-sharing-usage/listing-auto-fulfillment-refresh-daily.md) to identify the listings and databases contributing to compute cost.

    To identify the queries run by Snowflake to support auto-fulfillment, review the Query History and filter on
    Client generated statements. Refer to the [Query History Page](../user-guide/ui-snowsight-activity.md).

    Review the refresh frequency interval that you set for the listing. Refer to [Set the account-level refresh interval](provider-listings-auto-fulfillment-set-refresh-interval.md).

Monitor Storage Resources
:   Determine what data to put in your listing and how to structure your data to minimize the amount that needs to be auto-fulfilled.
    Refer to [Prepare data for a listing](provider-listings-preparing.md).
    Cross-Cloud Auto-Fulfillment does not support secure views that reference data stored in other databases.

    Refer to the [LISTING_AUTO_FULFILLMENT_DATABASE_STORAGE_DAILY view](../sql-reference/data-sharing-usage/listing-auto-fulfillment-database-storage-daily.md) to identify listings and databases contributing to storage cost.

Monitor Data Transfer Resources
:   Identify the regions in which secure share areas have been created. Run the [SHOW REPLICATION ACCOUNTS](../sql-reference/sql/show-replication-accounts.md) command.

Monitor ECO costs
:   Monitor the ECO costs across your organization. Run the [USAGE_IN_CURRENCY_DAILY view](../sql-reference/organization-usage/usage_in_currency_daily.md) in the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schema of the SNOWFLAKE database. In the SERVICE_TYPE column, review the EGRESS_COST_OPTIMIZER value.

## View estimated costs

SQL

To view estimated costs for all secure share areas associated with the provider accounts in your organization, use the
[LISTING_AUTO_FULFILLMENT_USAGE_HISTORY view](../sql-reference/organization-usage/listing_auto_fulfillment_usage_history.md) in the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schema of the SNOWFLAKE database.

To view actual costs for accounts in your organization, use other views in the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schema of the SNOWFLAKE database.

## View actual costs

You can use the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) view or the Snowsight Usage dashboard to view costs associated with Cross-Cloud
Auto-Fulfillment and attribute costs associated with fulfilling listings to specific regions. Use the accounts prefixed with
SNOWFLAKE_MANAGED$ and AUTO_FULFILLMENT_AREA$ to attribute cost to specific regions.

You must be an account administrator (use the ACCOUNTADMIN role) or use the [ORGANIZATION_USAGE_VIEWER](../sql-reference/snowflake-db-roles.md) database role to view usage data for Snowflake.

SnowsightSQL

To view actual costs in Snowsight, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Cost management, and then select Consumption.
3. Select a warehouse to use to view the usage data.
4. Using the accounts filter, select the accounts titled SNOWFLAKE_MANAGED$PUBLIC_<region_name> or AUTO_FULFILLMENT_AREA$-<region_name> to filter on the secure share areas used by auto-fulfillment.

   > For example, select SNOWFLAKE_MANAGED$PUBLIC_AWS_EU_WEST_2 to view the costs associated with using auto-fulfilling data to the AWS region eu_west_2.
5. Use the filters to view all usage types, or focus on compute, storage, or data transfer costs.

To view estimated costs using SQL, you can query the [LISTING_AUTO_FULFILLMENT_USAGE_HISTORY view](../sql-reference/organization-usage/listing_auto_fulfillment_usage_history.md) in the [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schema. To view actual costs, refer to the other views in the ORGANIZATION_USAGE schema. For more details on viewing costs, see [Exploring overall cost](../user-guide/cost-exploring-overall.md).

The costs that you see reflect all listings shared to a particular region by any account in your organization. To identify which listings
are being consumed in which regions and contributing to the costs in a specific region, see [Monitor listing use](provider-listings-monitor-studio.md).

---
title: Optimizing data transfer costs with Egress Cost Optimizer
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-eco.md
section: Collaboration & Marketplace
---

# Optimizing data transfer costs with Egress Cost Optimizer

Egress Cost Optimizer (ECO) is a capability of auto-fulfillment that minimizes egress costs when sharing data or apps to multiple regions,
helping providers on Snowflake (of both public and private listings) to reduce costs of sharing and cost of service, and as a result, maximize their return on investment (ROI).

> **Note:**
>
> * By default, Egress Cost Optimizer is unavailable to customers using [Virtual Private Snowflake (VPS)](../user-guide/intro-editions.md), using [Business Critical Edition](../user-guide/intro-editions.md), or on a [government cloud](../user-guide/intro-regions.md). If you’re a BCE, VPS or Gov customer, you can reach out to your Snowflake account executive for more information about ECO enablement.
> * Providers can enable ECO in a primary account in any commercial region and create listings targeted to any other region, including VPS, BCE, and Gov.

## How Egress Cost Optimizer works

Egress Cost Optimizer analyzes your listing configuration in terms of the number
of regions and cloud providers where the listing is available, and delivers the most cost-efficient auto-fulfillment for database replication.
For example, if you’re replicating data to multiple cloud regions and incurring repeated egress costs on the same dataset,
it intelligently routes the data through a Snowflake-managed ECO cache.
In this way, customers end up paying zero additional egress costs to expand to new regions, reducing the data transfer costs.

In another example, if you’re only replicating to 1-2 regions within the same cloud provider, the ECO doesn’t use the ECO cache because your data transfer costs are already optimized.
As a result, by turning on ECO, you’re ensuring minimum data transfer costs under any data sharing scenario.
For more information on costs, benefits, and limits of ECO see Benefits and costs of egress cost optimization and Limitations of ECO.

Still another example to consider is whether you’re replicating tables rather than an entire database. ECO only uses the cloud cache if the overall costs are getting optimized at the database level. So if you have one table in a database, and that table is being replicated to 10 regions while the database is only getting replicated to a single region, then replication won’t use the cache.

ECO doesn’t impact existing security, features, and performance commitments of listings,
such as support for data encryption in transit and rest through Snowflake Tri-Secret Secure (TSS), or
existing cross-cloud auto fulfillment features (for example object-level replication, listing refresh cron schedule, and listing refresh history).

You can learn more about the Snowflake supported third-party sub-processors that are leveraged in connection with Cloud Cache by visiting our
[Sub-processor](https://www.snowflake.com/en/legal/privacy/snowflake-sub-processors/) site.

When using ECO, your data will be hosted in the following regions,
in addition to the regions where you make the data available to your consumers:

North and South America

| Local region | Local cloud | Local region ID | Snowflake-managed ECO cache region |
| --- | --- | --- | --- |
| Canada (Central) | AWS | `ca-central-1` | Eastern North America |
| South America (Sao Paulo) | AWS | `sa-east-1` | Eastern North America |
| US West (Oregon) | AWS | `us-west-2` | Western North America |
| US East (Ohio) | AWS | `us-east-2` | Eastern North America |
| US East (N. Virginia) | AWS | `us-east-1` | Eastern North America |
| US Central1 (Iowa) | GCP | `us-central1` | Eastern North America |
| US East4 (N. Virginia) | GCP | `us-east4` | Eastern North America |
| Canada Central (Toronto) | Azure | `canadacentral` | Eastern North America |
| Central US (Iowa) | Azure | `centralus` | Eastern North America |
| East US 2 (Virginia) | Azure | `eastus2` | Eastern North America |
| South Central US (Texas) | Azure | `southcentralus` | Eastern North America |
| West US 2 (Washington) | Azure | `westus2` | Western North America |

Europe and Middle East

| Local region | Local cloud | Local region ID | Snowflake-managed ECO cache region |
| --- | --- | --- | --- |
| EU (Frankfurt) | AWS | `eu-central-1` | European Union |
| EU (Zurich) | AWS | `eu-central-2` | European Union |
| EU (Stockholm) | AWS | `eu-north-1` | European Union |
| EU (Ireland) | AWS | `eu-west-1` | European Union |
| Europe (London) | AWS | `eu-west-2` | European Union |
| EU (Paris) | AWS | `eu-west-3` | European Union |
| Middle East Central2 (Dammam) | GCP | `me-central2` | European Union |
| Europe West2 (London) | GCP | `europe-west-2` | European Union |
| Europe West3 (Frankfurt) | GCP | `europe-west-3` | European Union |
| Europe West4 (Netherlands) | GCP | `europe-west-4` | European Union |
| North Europe (Ireland) | Azure | `northeurope` | European Union |
| Switzerland North (Zurich) | Azure | `switzerlandnorth` | European Union |
| West Europe (Netherlands) | Azure | `westeurope` | European Union |
| UAE North (Dubai) | Azure | `uaenorth` | European Union |
| UK South (London) | Azure | `uksouth` | European Union |

Asia Pacific and China

| Local region | Local cloud | Local region ID | Snowflake-managed ECO cache region |
| --- | --- | --- | --- |
| Asia Pacific (Tokyo) | AWS | `ap-northeast-1` | Asia-Pacific |
| Asia Pacific (Seoul) | AWS | `ap-northeast-2` | Asia-Pacific |
| Asia Pacific (Osaka) | AWS | `ap-northeast-3` | Asia-Pacific |
| Asia Pacific (Mumbai) | AWS | `ap-south-1` | Asia-Pacific |
| Asia Pacific (Singapore) | AWS | `ap-southeast-1` | Asia-Pacific |
| Asia Pacific (Sydney) | AWS | `ap-southeast-2` | Asia-Pacific |
| Asia Pacific (Jakarta) | AWS | `ap-southeast-3` | Asia-Pacific |
| Australia East (New South Wales) | Azure | `australiaeast` | Oceania |
| Central India (Pune) | Azure | `centralindia` | Asia-Pacific |
| Japan East (Tokyo) | Azure | `japaneast` | Asia-Pacific |
| Southeast Asia (Singapore) | Azure | `southeastasia` | Asia-Pacific |

ECO ensures that under any circumstance, you’re only paying cross-cloud egress cost once.
As a result, the more cloud regions that you replicate to, the more the potential egress cost savings.

> **Note:**
>
> This feature is only available for Cross-Cloud Auto-Fulfillment and not for manual replication.

## Benefits and costs of egress cost optimization

Egress cost optimization can be used to reduce and control listing auto-fulfillment costs.

Initial costs:
:   The first time data is auto-fulfilled using the egress cost
    optimizer, the data is cached in Snowflake-managed S3-compatible storage with zero-egress costs,
    and you are charged for the initial egress of all the data in each listing to this storage location.
    Thereafter, egress is charged only for data updates.

Incremental data loading vs full data reloading:
:   If you regularly replace tables, or truncate and reload tables,
    be aware that this fresh data will be treated as a new table. Using
    these processes causes those tables to be re-cached, which incurs a
    higher cost than modifying the data by using less resource-intensive methods.

Greater savings with many regions or clouds:
:   Sharing data across more regions increases your savings on total egress costs.
    The more regions where data is shared, the greater the savings with the egress cost optimizer.

Database level, not listing level:
:   Where an auto-fulfillment schedule is set on the account level,
    rather than on the listing level, the egress cost optimizer will be
    enabled on all the listings that follow the account schedule. After the
    cost optimizer is enabled on a database, all subsequent auto-fulfillment
    involving that database will use it.

Cache storage costs:
:   Cache storage costs are incurred only while the listing is active. For example, if you have a listing that’s cached, and you drop that listing after 10 days, you are only charged for 10 days of cache storage.

For more information about pricing for egress between source and target regions or clouds, see the Snowflake [pricing guide](https://www.snowflake.com/resource/the-simple-guide-to-snowflake-pricing/) and the [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## ECO FAQs

When does ECO use the zero-egress cache?
:   ECO uses a heuristic-based algorithm to decide when to use the ECO cache. For example, if you’re replicating to only one or two regions within the same cloud provider, ECO doesn’t use the zero-egress cost cache because your data transfer costs are already optimized. The algorithm calculates the effective data transfer cost at the listing level.

How do I measure changes in data transfer?
:   When your listing uses the ECO cloud cache, the cache updates the `bytesSkipped` parameter in the [LISTING_REFRESH_HISTORY](../sql-reference/functions/listing_refresh_history.md). If you don’t see the cache being used, then your data transfer is already optimized. Please reach out to Snowflake support for any questions.

How much does it cost to use ECO?
:   The cost to use the ECO cache is described in Table 3(d) of the [Snowflake service consumption table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) (on the Snowflake website). ECO stores the data for 15 days, and you’re charged only for the number of days that the cache is used. For example, you create a listing on Day 1 and enable ECO. The listing uses the cache for cross-cloud replication to target customers. Then you delete the listing on Day 10. In this case, you’re charged for 10 days of ECO cache storage.

## Limitations of ECO

* Incremental data ingestion is required for the cloud cache to be fully used by the egress cost optimizer.
* The cloud cache is only used by the egress cost optimizer for refreshes made by auto-fulfillment.
* Egress cost optimizer will only use the cloud cache if the overall egress costs for all listings on the same database are getting optimized. The optimizer algorithm measures the size of the listings at a database level and not at a table level.
* ECO is not supported for listings that include a [Cortex Knowledge Extension (CKE)](../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-overview.md).

  Providers should be aware of the cost implications for replication with listings that have a CKE.

  If a CKE is added to a listing that has ECO enabled, ECO will be automatically turned off, and the provider will be notified by email. With ECO turned off, costs associated with the listing can increase.

  Similarly, if a CKE is added to a listing that’s part of a replication group, then ECO will be turned off for all listings within that replication group. An email notification will be sent to the provider indicating that the ECO was turned off.

---
title: Paid listings pricing models
source: https://docs.snowflake.com/en/collaboration/provider-listings-pricing-model.md
section: Collaboration & Marketplace
---

# Paid listings pricing models

For a consumer, the pricing model of a listing depends on what the provider of a listing selects from the options provided by Snowflake.
A provider can attach a single pricing plan to a listing.

This topic outlines the pricing models available for providers to choose for their listings.
You can choose from a usage-based plan or a subscription-based plan:

| Pricing model | Charge components | Billing timing |
| --- | --- | --- |
| Usage-based | Providers charge for any combination:   * For billable events * Per query * Monthly fee | Consumers are billed in arrears in months where usage occurs. |
| Subscription-based | Providers charge for a specified term, with optional recurring billing. | Consumers are billed upfront. |

As a provider, you cannot remove a pricing plan from a listing. Any update to a pricing plan for a listing offered publicly on the
Snowflake Marketplace is subject to approval. See [Modify published listings](provider-listings-modifying.md) for more.

After receiving payment from consumers, Stripe pays providers. If consumers use their Capacity commitment to purchase listings,
Snowflake pays providers.

## Usage-based pricing models

You can add a usage-based pricing plan to your paid listing. The options available to you depend on what you choose to share with
a listing and how you choose to bill.

| Content being shared | Pricing model options |
| --- | --- |
| Application | Usage-based models where providers charge for any combination:   * For billable events * Per query * Monthly fee |
| Data | Usage-based models where providers charge for any combination:   * Per query * Monthly fee |

### Components of usage-based pricing models

Usage-based plans charge consumers on a monthly basis according to their usage of your data product. If you choose a usage-based plan,
you can charge for any combination of the following options:

Billable events:
:   Only listings that share an application can use Custom Event Billing, which charges based on billable events.

    With Custom Event Billing, you can charge a price for specific types of usage of your application. For example, you can charge:

    * Per row of data modified by the application
    * Per procedure call made by the application
    * Per row of data used by the application
    * Per unique row of data updated in a month by your application (monthly active rows)

    You can also charge for other events that you define in your application code.

Per-query charge:
:   Pay a fixed price for each query run that accesses paid data.

    If the pricing model includes a per-month charge, the per-query amount is charged in addition to the per-month charge.

Monthly fee:
:   Pay a fixed price per calendar month in which at least one executed query references a Snowflake Native App, or a paid listing data share. For Snowflake Native App with Snowpark Container Services, a one-time monthly fee is charged when the compute pool runs. Your organization isn’t charged a monthly fee if the Snowflake Native App with Snowpark Container Services compute pool isn’t run, or a query isn’t executed on the Snowflake Native App or the data share within the calendar month.

    A billing cycle is the period of time that starts on the first day of the calendar month and ends on the last day of the month. The per-month price is charged regardless of when during the month the first query was run against paid data. The per-month charge is a fixed price and is not prorated.

For usage-based plans with dynamic charges, such as per-query plans or Custom Event Billing, your pricing plan must include additional
components:

Maximum total charge per month:
:   The maximum total monthly cost that can be charged for a listing as defined by the listing provider. This maximum total charge includes
    all usage-based charges included in the pricing plan for the listing. When this maximum monthly charge is reached, subsequent usage, such
    as queries, is free.

Number of free queries:
:   For pricing plans that include per-query charges, the first query in a calendar month is always charged.
    You can also specify a number of free queries allowed after the first query, and then resume charging a per-query price.

    The first query in each calendar month incurs the per-month charge, the per-query charge, or both, depending on the pricing plan for the listing.

### Configure your listing for custom event billing

After you [create a listing](provider-listings-creating-publishing.md), you can configure your listing to add a Custom Event Billing
usage-based pricing plan.

#### Add Custom Event Billing to your listing

> **Note:**
>
> Before you can add Custom Event Billing to your listing, you must configure your application to emit billable events. You must
> know each `class` and corresponding `billing_quantity` used to calculate the `base_charge` in the application to add
> Custom Event Billing to your application.
> Refer to [Add billable events to an application package](../developer-guide/native-apps/adding-custom-event-billing.md).

After you [create a listing](provider-listings-creating-publishing.md), do the following to add a Custom Event Billing pricing plan to your listing:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, and then select the draft listing you want to configure.
4. In the Data Product section, for Pricing & Trial, select Add.

   If you do not see a Pricing & Trial section, you must add your application package to the listing.
5. Select the Usage-based pricing plan.
6. (Optional) To charge for billable events and a monthly fee, select + Monthly Fee, and then add a monthly fee
   in US dollars.
7. For Billable Events, select + Billable Event to add a new billable event.

   > **Note:**
   >
   > You are paid only for billable events that you add to your listing, even if additional types of events are emitted by your application.
   > The billable event details that you specify in the listing must exactly match the billable events emitted by your application.
8. For each billable event that you add, do the following:

   * Enter a Class that exactly matches the `class` defined in the system function for your application.
   * Enter an Event Display Name to describe the billable event. For example, Row Modified.
   * Enter a Billing Quantity to define how much you want to charge for each billable event. For example, 0.01 to charge $1.00
     for 100 modified rows. This value must match the `billing_quantity` variable used to calculate the `base_charge` in your
     application code.
   * Enter a Unit Name to describe the units of the billable event being charged for. For example, row.
9. If desired, add another billable event. You can charge for up to eight (8) billable events.
10. Enter a Description to describe how your application bills consumers. For example, “Charges one cent for each row of data modified
    as a result of actions performed in the application.”
11. Optionally select + Per Query Charge to add charges for each query performed in addition to the charges associated with billable events.

    1. If you add per-query charges, add a Cost per Query in US dollars.
    2. Enter a number of free Included Queries for the pricing plan. For example, enter 200 to start
       charging consumers when they run the 202nd query against the application database, because the first query is always charged.
12. For Charging Limit, specify a Maximum Monthly Charge in US dollars.
13. Select whether to offer a free trial, and if so, select the length of the trial.
    Trials are required for listings offered publicly on the Snowflake Marketplace.
14. Select Save.

Before you publish your application to specific consumers or publicly on the Snowflake Marketplace, test your application to make sure the
charges are being made as you expect.
See [Add billable events to an application package](../developer-guide/native-apps/adding-custom-event-billing.md).

> **Note:**
>
> If you share your application with other consumer accounts in your organization and want to charge for their usage,
> [contact Snowflake Support](../user-guide/contacting-support.md). By default, usage within your organization is not billed to allow for testing.

### Configure your listing for a per-query usage plan

After you [create a listing](provider-listings-creating-publishing.md), you can configure your listing to add a per-query usage-based pricing plan.

With a per-query usage-based plan for your application, you charge a price for each query performed against the shared content.

As with any usage-based plan, you must set a maximum monthly charge for usage to avoid unexpected overages for consumers.

To add a per-query usage-based plan to your listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, and then select the draft listing you want to configure.
4. In the Data Product section, for Pricing & Trial, select Add.

   If you do not see a Pricing & Trial section, you must add a data product to the listing.
5. Select the Usage-based pricing plan.
6. Optionally select + Monthly Fee to also charge a monthly fee for months in which a consumer uses your listing.
7. For Queries, select + Per Query Charge.
8. Add a Cost per Query in US dollars.
9. Optionally enter a number of queries included in the pricing plan for free. For example, enter 200 to start
   charging consumers when they run the 202nd query against the application database, because the first query is always charged.
10. For Charging Limit, specify a Maximum Monthly Charge in US dollars.
11. Select whether to offer a free trial, and if so, select the length and type of the trial.
    Trials are required for listings offered publicly on the Snowflake Marketplace.
12. Select Save.

### Configure your listing for a monthly fee usage plan

After you [create a listing](provider-listings-creating-publishing.md), you can configure it to add a monthly fee usage-based pricing plan.

With a monthly fee usage-based plan, you charge a fixed price for each month in which a consumer
ran a query against a database included in the data product. For listings with applications, you can combine monthly fee usage charges
with Custom Event Billing.

If you want to charge a monthly fee whether or not consumers use your data product, add a subscription-based plan instead.
See Subscription-based pricing models.

To add a monthly fee usage-based plan to your listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, and then select the draft listing you want to configure.
4. In the Data Product section, for Pricing & Trial, select Add.

   If you do not see a Pricing & Trial section, you must add a data product to the listing.
5. Select the Usage-based pricing plan.
6. For Monthly Fee, select + Monthly Fee.
7. Enter a Monthly fee in US dollars.
8. Select whether to offer a free trial, and if so, select the length and type of the trial.
   Trials are required for listings offered publicly on the Snowflake Marketplace.
9. Select Save.

### Usage-based pricing plan examples

The following examples describe possible usage-based pricing plans that a provider might set up for a listing that shares a data product
privately or publicly on the Snowflake Marketplace.

#### Monthly fee only plan

The following diagram shows the costs associated with a pricing plan composed only of a monthly fee charge.

For each month in which users in an account query paid data in the listing, the provider charges only the fixed price of $100 USD.

#### Per-query only plan

The following diagram shows example costs associated with a pricing plan composed only of a per-query charge.

For each month in which users query paid data from a listing, the provider charges a per-query fee of $0.01 USD. The plan includes 1,000 free queries against paid data (after the first query) per billing cycle. This example plan also includes a maximum monthly charge of $200.

In this diagram, the January invoice bills the consumer $20 for a total of 3,000 queries that were run against the listing.

In the February billing cycle, the fixed maximum monthly price of $200 was reached partway through the month. Queries run against the paid
data during the remainder of the billing cycle were free.

#### Per-month plus per-query plan

The following diagram shows example costs associated with a pricing plan composed of combined per-month and per-query charges.

For each month in which users in this account query paid data in the listing, the provider charges a fixed price of $100
in addition to a $0.01 fee per query.

This example pricing plan includes 1,000 free queries against paid data (after the first query) per billing cycle.
This example plan also includes a maximum monthly charge of $200.

In this diagram, the January invoice bills the consumer $20 for a total of 3,000 queries that were run against the listing,
as well as the fixed monthly charge of $100.

In the February billing cycle, the fixed maximum monthly price of $200 was reached partway through the month. The consumer paid for
10,000 queries against the data in addition to the $100 fixed monthly charge. An additional 1,000 queries were free as part of the free
monthly queries, and any queries made after the first 11,000 were also free because the maximum charge was reached.

## Subscription-based pricing models

Choose a subscription-based pricing model to charge an upfront fee for a specified term, with optional recurring billing for your listing.

### Configure your listing for a subscription-based plan

After you [create a listing](provider-listings-creating-publishing.md), you can configure it to add a subscription-based plan.

For this pricing plan, consumers are charged upfront for access to the data product for a specified term.
You can choose to offer the listing with recurring billing for a subscription that auto-renews, or non-recurring billing for
access for a fixed term.

### Add a recurring subscription-based plan

To add a recurring subscription-based plan to your listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, then select the draft listing you want to configure.
4. For the Data Product section, for Pricing & Trial, select Add.

   If you do not see a Pricing & Trial section, you must add a data product to the listing.
5. Select Subscription-based.
6. For Billing and access, select Recurring to charge consumers upfront at the beginning of the recurring term.
7. Specify a term from 1–36 months for the Billing period for the listing.
8. Specify the total price to be paid upfront, in US dollars.
9. Select whether to offer a free trial, and if so, select the type and length of the trial.
   Trials are required for listings offered publicly on the Snowflake Marketplace.
10. Select Save.

### Add a non-recurring subscription-based plan

You can add a non-recurring subscription-based plan to charge consumers once upfront for access to your listing. Listings with this plan
cannot be repurchased by consumers. If you want consumers to be able to repurchase your listing, choose a recurring subscription-based plan.

To add a non-recurring subscription-based plan to your listing, do the following:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select Listings, then select the draft listing you want to configure.
4. For the Data Product section, for Pricing & Trial, select Add.

   If you do not see a Pricing & Trial section, you must add a data product to the listing.
5. Select Subscription-based.
6. If Billing and access is shown, select One time to charge consumers once upfront with no option to renew or repurchase the listing.
7. Specify an Access period from 1–36 months for the listing.
8. Specify the total price to be paid upfront, in US dollars.
9. Select whether to offer a free trial, and if so, select the type and length of the trial.
   Trials are required for listings offered publicly on the Snowflake Marketplace.
10. Select Save.

## Installment plan pricing models

Choose an installment plan pricing model to divide the total price into portions
so the consumer can pay one portion at a time rather than pay the entire amount
in one lump sum.

* Provide consumers the flexibility to make periodic payments by splitting total
  cost into multiple smaller payments.
* Allow providers to define installment payments, including the ability to
  specify larger or smaller payment amounts earlier or later in the
  subscription term. Providers may also define zero value payments, allowing
  consumers to skip payments.

### Configure listings to bill consumers using installments

You must have OWNERSHIP or MODIFY privilege on the listing to configure
installments. For a complete set of privileges for working with listings
see [Privileges required for working with listings](provider-becoming.md).

To configure listings for installment payments:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Find and select the listing where you want to configure for installments.
4. In the Access & Pricing section, select the pencil icon
   next to Listing Access.
5. Select Subscription-based.
6. Select Consumer must pay in installments.
7. Configure the installment for the listing:

   1. Select Access period.
   2. Specify Total price.
   3. Select Installment Type, one of:

      * Use equal amounts for all installments.
      * Set custom amount per installment.

        When using a custom amount you can specify multiple payments and amount
        per payment, including zero payments.
8. Select Save to save the changes.

### Limitations

Price changes to existing subscription pricing, such as by changing
non-recurring subscriptions to recurring, and/or by adding installments to
an existing plan, is not supported. Instead, make a new listing.

### Grant consumers early access to your listing

Early access:

* Grants consumers access to subscription-based listings without requiring
  upfront payment.
* Allows providers to specify early access and set terms such as NET 30, or
  to specify sharing an invoice directly with the consumer.

> **Note:**
>
> The provider is expected to inform the consumer of the desired payment terms.

You must have OWNERSHIP or MODIFY privilege on the listing to configure early
access.

To enable early access:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Find and select the listing that you want to configure for early access.
4. Select Subscription-based pricing plan for paid listings.
5. Select Allow early access in the
   Access the listing without payment section.
6. Select Save to save the changes.

> **Note:**
>
> Snowflake only recommends early access for paid private listings.
>
> Consumers are able to access the paid listing without payment, even if
> they are late or delinquent.

---
title: Pay for listings
source: https://docs.snowflake.com/en/collaboration/consumer-listings-paying.md
section: Collaboration & Marketplace
---

# Pay for listings

Before making a purchase on Snowflake Marketplace, use the topics on this page to learn how to make and manage your purchases on Snowflake Marketplace.
If you want to request trial access before making a purchase,
see [Explore listings](consumer-listings-exploring.md).

> **Note:**
>
> If you are a value-added reseller (VAR) that wants to purchase paid listings, use this form to [submit a case with Marketplace Operations](https://snowforce.my.site.com/s/provider-onboarding-case). You only need to file one case to cover both purchasing and offering listings.

## Supported consumer locations

To access paid listings as a consumer, the billing address registered to your account must be in one of the following countries:

* Australia
* Austria
* Belgium
* Bermuda
* Canada
* Cayman Islands
* Colombia
* Czech Republic
* Denmark
* Finland
* France
* Germany
* India
* Ireland
* Israel
* Italy
* Japan
* Kingdom of Saudi Arabia
* Luxembourg
* Mexico
* Netherlands
* New Zealand
* Norway
* Poland
* Portugal
* Singapore
* South Korea
* Sweden
* Switzerland
* United Arab Emirates
* United Kingdom
* United States

## Usage rules for all consumers

The following statements apply to all organizations that make one or more purchases:

* Any organization can pay for listings by using any of the accepted payment methods.
* All purchases are billed in US dollars.
* Taxes are calculated based on your organization’s shipping and billing addresses.
  This applies even if your organization has multiple locations or is international.

> **Note:**
>
> Snowflake does not support multiple billing entities for a single organization.

## Accept the combined Snowflake Provider and Consumer Terms

Before you purchase anything in the Snowflake Marketplace, an organization administrator needs to accept the combined
[Snowflake Provider and Consumer Terms](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-collaboration-consumer-terms). To learn more about organizations, see [Working with organizations and accounts](../guides-overview-manage.md).

> **Note:**
>
> If your organization intends to access only free listings, and you’ve accepted the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/), you do not need to accept the Snowflake Provider and Consumer Terms.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. To switch to the organization administrator role, in the lower-left corner, select your name » Switch role » ORGADMIN.
3. In the navigation menu, select Admin » Terms.
4. In the Snowflake Marketplace section, review your existing terms.
5. Select Review. The terms and conditions dialog opens.
6. If you agree to the terms, select Accept Terms & Conditions.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

## About billing for listings and data products

To understand how and when you’re billed, it can be helpful to know how Snowflake bills, what usage is billable, and how pricing models
can change your bill. When you purchase a listing, you receive a Snowflake Marketplace invoice. Snowflake Marketplace invoices are
separate from invoices for other Snowflake services, storage, or usage.
For Snowflake Marketplace billing, Snowflake uses an online payment processing service called [Stripe](https://stripe.com/).
As part of enabling your account to purchase listings, a payment account is automatically generated by Stripe, specifically to process
the payment. This is not the same as a Stripe account you might use to conduct your own business; instead, this account is
managed by Snowflake.

Snowflake Marketplace generates an invoice that enables you to pay using any of the supported payment methods. If your automatic monthly
payment fails, an email notification is sent notifying you of the payment failure. The email notification includes instructions for resolving the payment failure.

> **Note:**
>
> Snowflake Marketplace charges fall within a minimum and maximum amount as defined by the online payment processing service.
> This is explained in [Minimum and maximum charge amounts in the Stripe documentation](https://docs.stripe.com/currencies#minimum-and-maximum-charge-amounts). Only the information for US Dollars (USD)
> applies.

### Billing by pricing model

Each listing can have different pricing plans and each pricing plan bills in a different way:

* For [usage-based pricing plans](provider-listings-pricing-model.md),
  Snowflake invoices your account only for the months when you actually use a paid
  listing. If there is no billable event activity or user queries made on paid data,
  no invoices are generated.
* For [subscription-based pricing plans](provider-listings-pricing-model.md),
  billing can vary based on the plan.
  Snowflake invoices your account at the beginning of each billing term or access period.

For more information about pricing models, see [Paid listings pricing models](provider-listings-pricing-model.md).

### Billing by usage

For a usage-based plan, you’re billed for queries that access paid data within a share, even if the query returns no results.
For example, if a query scans a set of paid data but filters out every row in the results, it’s still counted as usage.

You incur charges when you interact with paid data, for example by using SELECT statements or DML statements (such as INSERT, MERGE).
You don’t incur charges running DDL statements unless you interact with data in a DDL statement,
for example by using CREATE TABLE AS SELECT.

While usage for listings is tracked daily — allowing you to monitor your consumption in real time — billing is processed monthly.
You can use the Snowflake dashboards to view your usage at any time, but usage charges are summed up
and reflected in the monthly invoice.

Some listings might include serverless features, which are billed based on compute resources consumed.
For more information about serverless feature billing, please refer to your contract or the Snowflake documentation.

## Pay for monetized listings

Invoices for monetized listings are sent to the billing email address that was provided to Snowflake. Invoices include information about the paid listing and the amount owed including all applicable taxes and fees.

When Snowflake adjusts the amount owed in an invoice, and it benefits a consumer, Snowflake issues a credit note to the consumer’s organization. When a consumer requests the cancelation and rebill of an invoice, Snowflake issues a credit note to the consumer’s organization and then reissues the invoice.

### Payment methods

Snowflake supports a variety of payment methods, including credit card, bank transfer, and Marketplace Capacity Drawdown Program funds.

The first payment method you set up is used as your default, unless you choose another supported payment method at the time of purchase. You can choose a different method for each purchase.

To set up a payment method, use the organization administrator (ORGADMIN) role, and follow the instructions for the appropriate billing method.

### Pay for listings with Snowflake Marketplace Capacity Drawdown Program funds

When you purchase a listing, the payment method defaults to MCD if this payment method is available for that
listing. If the listing is not MCD-compatible or if you don’t want to use MCD for this purchase, you can
complete your purchase by using a different payment method.

For more information about MCD, see [About committed capacity and Snowflake Marketplace Capacity Drawdown](marketplace-capacity-drawdown.md).

### Pay for listings with a credit card or a bank transfer

You can use one of the following payment methods to purchase listings:

* Credit card
* Bank transfer

If your organization selected a credit card as the payment method for paid listings when the listing was purchased, the credit card is automatically charged at the interval defined in the payment schedule.

When you first pay for a listing, the payment method you select is used for the period you access the listing. To change a listing payment method, see [Modify your listing payment method](consumer-listings-access.md).

### Activate an account

Before you can add a payment method, you need to activate your consumer account.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Click the Marketplace billing tab.
4. Click Consumer billing.
5. Click Activate account.

### Add a new credit card

Snowflake uses [Stripe](https://stripe.com/) to process Snowflake Marketplace payments. If you haven’t previously activated your consumer account on Stripe, you’ll be asked to activate your consumer account before the new credit card payment method is available.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Select the Marketplace billing tab.
4. Expand Payment methods and click Add credit card.
5. Complete the following fields:

   * Payment method display name: Optional. Enter a name for the payment method.
   * Card number: Enter the credit card number.
   * Expiration date: Enter the credit card expiration date.
   * CVC: Enter the credit card Card Verification Code (CVC). This is the three or four-digit security verification code printed on your credit card.
   * Full name: Enter the name of the credit card owner as it appears on the credit card.
   * Country or region: Select the credit card billing country.
   * Address line 1: Enter the address where credit card billing information should be sent.
   * Address line 2: Enter additional address information for credit card billing.
   * City: Enter the credit card billing city.
   * State/Province: Enter the credit card billing state or province.
   * ZIP/Postal code: Enter the credit card ZIP or postal code.
6. Click Add card.

   If there are issues with the credit card you added, a status messages appears within the tile for the credit card on the Marketplace billing tab.

### Delete a payment method

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Select the Marketplace billing tab.
4. Click Consumer billing.
5. Expand Payment methods and then select a payment method.
6. Click Delete.
7. Click Delete.

### Virtual bank account number confirmation

Snowflake accounts receivable includes a virtual bank account number (VBAN) confirmation as an attachment with your Snowflake Marketplace invoice. The VBAN is provisioned by Stripe and allows Snowflake to accept bank transfers from your organization.

## Manage your invoices

Consumers can view, download, and pay invoices in [Snowsight](../user-guide/ui-snowsight-gs.md). To access invoice information, the ACCOUNTADMIN role or the PURCHASE DATA EXCHANGE LISTING and IMPORTED PRIVILEGES privileges are required.

To grant privileges to a role as an ACCOUNTADMIN, see [Granting the IMPORTED PRIVILEGES privilege to other roles](../user-guide/data-exchange-marketplace-privileges.md).

To learn more about the PURCHASE DATA EXCHANGE LISTING privilege, see [PURCHASE DATA EXCHANGE LISTING privilege](../user-guide/data-exchange-marketplace-privileges.md).

### View all Snowflake Marketplace invoices

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Select the Marketplace billing tab.
4. Click Consumer billing.

   All invoices appear in the Marketplace invoices section. You can optionally sort your invoices by status, amount, invoice date, and due date.

### View Snowflake Marketplace invoice information

> 1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
> 2. In the navigation menu, select Admin » Billing.
> 3. Select the Marketplace billing tab.
> 4. Click Consumer billing.
> 5. Select an invoice in the Marketplace invoices list.
> 6. Optional. To download a PDF version of the invoice, select Download PDF .

### Pay a Snowflake Marketplace invoice

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Select the Marketplace billing tab.
4. Click Consumer billing.
5. Select an invoice in the Marketplace invoices list.
6. Select Make a payment.

## Manage your purchases

Learn how to manage or cancel payments, control access, or get support from the provider.

To monitor your usage of paid listings, see the following usage views:

* [MONETIZED_USAGE_DAILY (DATA_SHARING_USAGE) View](views/monetized-usage-daily-ds.md) in the
  [DATA_SHARING_USAGE](../sql-reference/data-sharing-usage.md) schema. (Usage at the account level)
* [MONETIZED_USAGE_DAILY (ORGANIZATION_USAGE) View](views/monetized-usage-daily-org.md) in the
  [ORGANIZATION_USAGE](../sql-reference/organization-usage.md) schema. (Usage at the organization level)

### Manage access to your purchases

Use the following access control privilege to control access to paid listings.

| Privilege | Object | Description |
| --- | --- | --- |
| PURCHASE DATA EXCHANGE LISTING | Account (that is, global privilege) | Grants ability to create a database from a paid listing that enables querying all data (paid and trial) in the database or application. Must be granted by the ACCOUNTADMIN role. |

### Contact the provider for support

You should always contact the provider of the listing directly, before contacting Snowflake. Use the support email identified in the
listing. Examples include cases where you need to do one of the following:

* Request a refund
* Report an issue with a product listing

If your issue remains unresolved, you can report it by filing a case with Marketplace Operations:
[Report an issue with a Data Marketplace Listing or Provider](https://snowflakecommunity.force.com/s/consumer-reporting).

### Cancel a purchase

To cancel access to a purchase, you need to use one of the following roles:

> * The account administrator role (ACCOUNTADMIN)
> * A role with the OWNERSHIP privilege granted on the database created from a listing

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. To switch to the account administrator role, in the lower-left corner, select your name » Switch role » ACCOUNTADMIN.
   You can use a custom role if the role has the requisite privileges.
3. In the navigation menu, select Marketplace » Snowflake Marketplace.
4. Select the purchase that you want to cancel.
5. On the page that opens, select Manage Purchase » Cancel Purchase.
6. Review the cancellation date so that you can verify when your access ends.
7. Confirm your choice to cancel.

---
title: Policies, guidelines, and enforcement of listings on Snowflake Marketplace
source: https://docs.snowflake.com/en/collaboration/policies-guidelines-enforcement.md
section: Collaboration & Marketplace
---

# Policies, guidelines, and enforcement of listings on Snowflake Marketplace

Marketplace currently permits the listing of data shares, Native Apps, and Connected Apps. Snowflake Marketplace policies establish the standards and expectations for Providers and Consumers to ensure a secure, transparent, and trustworthy experience. All Providers and listed products must comply with the policies and requirements outlined below.

The following sections describe:

[Policies](provider-consumer-policies.md): Provider and Consumer policies, listing and profile requirements, product standards, and monetization policies.

[Requirements for listing apps](guidelines-reqs-for-listing-apps.md): Enforced requirements and review standards for publishing applications on Snowflake Marketplace.

[Enforcement and appeals:](dispute-resolution-enforcement-appeal.md) Snowflake’s approach to dispute resolution, enforcement actions, and appeal procedures.

[Transactions, invoicing, and collections:](provider-transactions-invoicing-collections.md) Provider and Snowflake responsibilities for invoicing, taxes, collections, payouts, and transaction reporting.

---
title: Prepare data for a listing
source: https://docs.snowflake.com/en/collaboration/provider-listings-preparing.md
section: Collaboration & Marketplace
---

# Prepare data for a listing

This topic contains guidance for preparing to create a listing, including how to prepare a data product for different types of listings.

## Prepare to create a listing

Before you create a listing, do the following:

1. Decide how to offer your data product. See [Listing availability options](collaboration-listings-about.md) and [Listing access options](collaboration-listings-about.md).
2. Set up roles and privileges to simplify creating listings. See Set up roles and privileges for listings.
3. Identify the objects that you want to share. See Decide what to put in a listing.
4. Prepare the objects to be shared with others. See Prepare the shares for your listing.
5. Determine how you want to manage access to your data product:

   * Provide access for free, with no restrictions.
   * Charge for your listing by creating a paid listing. See Prepare to offer a paid listing.
   * Offer limited access to your data product as a free trial, then offer unlimited access to your data product by request.
     See Prepare to offer a limited trial listing.
6. Choose which cloud region(s) you want to offer your listing in. See Prepare your listing to be shared in other regions.

The listing and data share must be in compliance with the Snowflake [Provider Policies](https://www.snowflake.com/provider-policies/).

## Set up roles and privileges for listings

When you create a listing, you create it from the account that has the data or application package in it. The role that attaches a data
product to a listing and publishes the listing must be the same role that created, and therefore owns, the application package or share.
You cannot transfer the OWNERSHIP privilege for a share.

If you use a different role to create and manage the listing, grant the MODIFY privilege on the listing to the role
that owns the application package or share. For example:

Share or application package owner role:
:   OWNERSHIP privilege on the share or application package.
    MODIFY privilege on the listing.

Listing owner role:
:   OWNERSHIP privilege on the listing.

    Global CREATE LISTING privilege.

Within the provider account, you can use one of the following to create and manage listings:

ACCOUNTADMIN:
:   If you use the ACCOUNTADMIN role to create and manage listings, the ORGADMIN role must first
    [Delegate privileges to set up auto-fulfillment](provider-listings-auto-fulfillment-manage-privileges.md).

Custom role:
:   If you use a custom role, the ORGADMIN role must first [Delegate privileges to set up auto-fulfillment](provider-listings-auto-fulfillment-manage-privileges.md)
    to the ACCOUNTADMIN role, which can then be used to grant the relevant privileges to the custom role.

For more information about granting sharing privileges, see [Granting Privileges to Other Roles:](../user-guide/data-exchange-marketplace-privileges.md).

## Decide what to put in a listing

As you prepare to share data from your account with a listing, decide what to put in the listing.

First, make sure that the data you want to share is in Snowflake, and that you have the legal and contractual rights to share the data.
If needed, load the data that you want to share into Snowflake. See [Overview of data loading](../user-guide/data-load-overview.md).

> **Note:**
>
> To the extent any data in your listing or data set is governed by any laws or contractual obligations, you must ensure that you have the
> legal and contractual rights to share such data. For example, you can only share protected health information (PHI) through a personalized
> listing and, to do so, you must: (1) have signed a business associate agreement (BAA) with Snowflake and the Consumer receiving the PHI,
> and; (2) ensure that the Consumer has also signed a BAA with Snowflake. Also, while you can share personal data through both a free or
> personalized listing, to do so you must have the applicable legal and contractual rights if the data is not publicly available.

Next, decide how to offer the data that you have as a listing. If you plan to offer listings on the Snowflake Marketplace or only as
private listings directly with specific customers, you might make different decisions about what to place inside the listing.

* Consider the availability of your data.
* Consider the consumers that you expect to access your listings.
* Consider the formats of the data that you select for the share, such as a table, view, secure view, or other database object.

For example, if you want to provide listings about dog grooming, you might make decisions like the following:

* Offer a publicly available free listing on the Snowflake Marketplace with information about dog breeds and fur length.
* Offer a limited trial listing on the Snowflake Marketplace with a sample data product that contains data about the time it takes to
  groom a standard poodle, with the option for consumers to request a full data product about grooming insights for more dog breeds.
* Offer a limited trial listing on the Snowflake Marketplace with a data product that contains data about the time it takes to groom any
  breed of dog, with the option for consumers to request unlimited access to your data product.
* Offer a private listing to a partner organization with insights about the length of time it takes to groom various dogs, and the
  typical frequency of grooming appointments for different dog breeds.

In this example, you offer valuable data on the Snowflake Marketplace, but offer more specific insights to an organization that you already
have a trusted business relationship with.

## Prepare the shares for your listing

You can create a share before creating a listing, or select the database, tables, and views to comprise your data product when you create
the listing. See [Create and configure shares](../user-guide/data-sharing-provider.md).

If you plan to offer many listings, create shares separately from listings so that you can more easily manage your data product. You cannot
provide multiple listings from the same share.

### Consider how to keep shares updated

Consider the maintenance of the data in your share. Over time, you might need to make changes to your data shares as the information that
you want to provide in listings changes.

You also need to consider how to keep the data in shares updated, and make sure that the contents of the share are useful to consumers.

If objects in a share are dropped and later recreated, you need to add the recreated objects to the share so that they remain available to
consumers. For example, if you refresh some data in the share by dropping and recreating a table in the database, you need to update the
share to include the recreated table.

### Prepare the data to be shared

Prepare the data that you want to share in your listing to be shared with others.

* Use unquoted object identifiers for tables, columns, and share names. Use only upper case and alphanumeric characters for object names to
  let listing consumers use the shared data objects without having to double-quote identifiers. See [Identifier requirements](../sql-reference/identifiers-syntax.md).
* Protect sensitive data in shared databases. Create secure views and use secure objects to control access to data. See
  [Use secure objects to control data access](../user-guide/data-sharing-secure-views.md)
* You can add shares that are already shared with a consumer account, such as with a direct share, to a listing.
* A share can only be attached to one listing. If a share has already been attached to a listing, you cannot attach it to another listing,
  even if the listing has been deleted.
* Do not use account-level roles to protect data, such as with a policy or a secure view definition. Auto-fulfillment does
  not replicate account-level roles. For more information about this restriction, see [Auto-fulfillment for objects that depend on account roles](provider-understand-auto-fulfillment-objects.md).
  Instead, use database roles and the [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md) system function.
  For more information, see [Share data protected by a policy](../user-guide/data-sharing-policy-protected-data.md).

## Prepare to offer a limited trial listing

A limited trial listing lets you offer either a sample of your data product as a free trial, giving consumers insight into what might be
available from a full data product or limited time access to your full data product. Providers can set the availability period for limited trial listings from 1 to 90 days. For more information about limited trial listings, see [Limited trial listings](collaboration-listings-about.md).

If you choose to offer a sample of your full data product, the sample data product ideally provides a subset of the real data
included in your full data product and is representative of the full data product in the following ways:

* Contains the same columns.
* Contains the same or similar ranges and distributions of values in the data.

Limited trial listings include a data dictionary, so the general shape of the data in the full data product should be clear from the sample data product that you offer.

For example, if you are a dog training and grooming company, you might consider offering one of the following sample data products with a
limited trial listing:

| Sample data product recommendation | Sample data product example | Full data product example |
| --- | --- | --- |
| Contains a complete dataset for a specific complete attribute of the data. | Contains up-to-date grooming insights for a Standard Poodle. | Contains up-to-date grooming insights for all dog breeds. |
| Contains the full dataset for a specific, outdated time period. | Contains grooming insights and prices for all dog breeds from May, 2021. | Contains up-to-date grooming insights and prices for all dog breeds. |
| Contains synthetic data that is representative of the full data product. | Contains up-to-date insights and prices about training the fictional Acadian Hound dog breed. | Contains up-to-date insights and prices about training all dog breeds. |

Offering a relevant and complete subset of your full data product as the sample data product for your limited trial listing helps consumers
understand the value of your full data product and makes them more likely to request the full data product.

### Limit functionality of your Snowflake Native App for trial consumers

If you offer your Snowflake Native App on the Snowflake Marketplace as a limited trial listing and want to limit the functionality available to trial
consumers, use the [SYSTEM$IS_LISTING_TRIAL](../sql-reference/functions/system_is_listing_trial.md) system function when creating secure views, secure UDFs, or
Streamlit apps included in your Snowflake Native App.

Using the system function to control the visibility of data and UDF output means that you don’t have to maintain a separate
application package to limit functionality to trial consumers.

You can limit the functionality of the following:

* Secure view
* Secure user-defined function (UDF)
* Application logic, such as the setup script or a Streamlit app.

For more details about adding data content or UDFs to your application package, see:

* [Share data content in a Snowflake Native App](../developer-guide/native-apps/preparing-data-content.md).
* [Add application logic to an application package](../developer-guide/native-apps/adding-application-logic.md).

#### Example 1: Return different data in a view to consumers in a trial

To define a secure view that returns data only to consumers with access to the full version of your Snowflake Native App, you could use the
following example code:

```sqlexample
CREATE OR REPLACE SECURE VIEW limited_functionality_view
  AS
  SELECT *
    FROM db_name.schema_name.table_name
    WHERE SYSTEM$IS_LISTING_TRIAL() = false;
```

If a consumer that is trialing your Snowflake Native App attempts to query the view, they see no results.

#### Example 2: Show the output of a secure SQL UDF only to non-trial consumers

To define a secure SQL UDF `shared_function()` that returns results only to consumers with access to the full version of your Snowflake Native App,
you could use the following example code:

```sqlexample
CREATE OR REPLACE SECURE FUNCTION schema_name.shared_function()
  RETURNS VARCHAR
  AS
  $$
    CASE
      WHEN SYSTEM$IS_LISTING_TRIAL() = FALSE
        THEN 'full product'
      ELSE 'trial'
    END
  $$;
```

In this example, if a consumer is trialing your Snowflake Native App, when they call the secure UDF they see the output `trial`.

#### Example 3: Show a different Streamlit UI to trial consumers

You can also call the system function inside of a Streamlit app to limit the functionality of your Streamlit app in a Snowflake Native App.
For example, you can display one title in the UI to consumers that trial your Snowflake Native App, and another title to consumers with
full access to your Snowflake Native App.

```python
# Import python packages
import streamlit as st
from snowflake.snowpark.context import get_active_session

session = get_active_session()
# Here we assign result of our function to a variable
result = session.sql("SELECT SYSTEM$IS_LISTING__TRIAL()")

# Write directly to the app
if result:
  st.title("Enjoy your limited trial of this application!")
else:
  st.title("Welcome to the full version of this application!")
```

## Prepare to offer a paid listing

If you want to charge for your listing, you must do the following:

1. Determine if you can offer paid listings. See [Who can provide paid listings](provider-becoming.md).
2. Prepare the data to offer a trial of the data. See Prepare shares for a paid listing.
3. Decide on the pricing plan that best fits your listing. See [Paid listings pricing models](provider-listings-pricing-model.md) to review the available pricing plans.

### Where you can publish paid listings

Only providers in certain regions can publish paid listings. See [Who can provide paid listings](provider-becoming.md).

In addition, paid listings can only be published to certain regions. See [Supported consumer locations](consumer-listings-paying.md) to see to which
regions you can publish paid listings.

### Prepare shares for a paid listing

When you offer a paid listing on the Snowflake Marketplace, you must offer consumers the ability to trial the listing before they purchase it.
Trials are optional for paid private listings. As part of the trial, you can limit consumers to specific data and functionality, a specific
time period, or a combination.

If you choose to limit trial consumers to specific data and functionality, create a single share for your paid listing and use secure views and
a system function provided by Snowflake, [SYSTEM$IS_LISTING_PURCHASED](../sql-reference/functions/system_is_listing_purchased.md), to control which data is visible to
trial consumers and which data is available only to paying consumers.

> **Note:**
>
> If your listing includes a secure user-defined function (UDF), you cannot limit visibility of the UDF. Both paying customers and trial
> customers of your listing can view the secure UDF.

Refer to the following examples to create your own secure views to display different data to paying consumers and trial consumers.

If you want to allow trial consumers to use all data in your listing for a limited period of time, do not use the SYSTEM$IS_LISTING_PURCHASED
function in your view definitions for your share.

#### Example 1: Return data based on the purchase status of the account

Create a secure view that selects all columns in a table. The view returns rows only when queried within a consumer account that has
purchased your paid listing.

```sqlexample
CREATE SECURE VIEW paid_v
  AS
  SELECT
    *
  FROM
    paid_t
  WHERE
    SYSTEM$IS_LISTING_PURCHASED() = TRUE;
```

#### Example 2: Return a subset of rows based on the purchase status of the account

Create a secure view that returns a subset of rows based on the boolean value of a specific column in the data. In this example, the
underlying table contains a column named `is_free` that is used to determine which data to show to which consumers.

Some rows have `is_free` set to `TRUE`, indicating that the data in those rows can be shown to trial consumers. Other rows have
`is_free` set to `FALSE`, indicating that the data in those rows should be shown only to paying consumers.

This example view is set up to return all rows only when it is queried by a consumer account that has purchased the paid listing, otherwise
it returns only the rows where `is_free` is set to `TRUE`.

```sqlexample
CREATE SECURE VIEW paid_v
  AS
  SELECT
    *
  FROM
    paid_t
  WHERE
    is_free
    OR
    SYSTEM$IS_LISTING_PURCHASED() = TRUE;
```

#### Example 3: Return only the most recent rows based on the purchase status of the account

Create a secure view that returns only rows from the previous 7 days to a consumer account that is trialing, but has not yet purchased,
your paid listing.

This example uses a column with a timestamp data type to filter the data, but you can use other column data types in your secure view
definition.

```sqlexample
CREATE SECURE VIEW paid_v
  AS
  SELECT *
  FROM
    paid_t
  WHERE
    (timestamp > current_timestamp() - interval '7 days')
    OR
    SYSTEM$IS_LISTING_PURCHASED() = TRUE;
```

#### Validate secure views for paid and trial data

After you prepare your secure views, validate that you set them up correctly by simulating the experiences of paid and trial consumer
accounts. Run queries against the secure views to confirm that each type of consumer has access to the expected data.

> **Important:**
>
> This method does not validate whether consumers can securely access your data. This method only validates whether the share works as
> expected for your consumers.

To validate your shares, execute a query against a secure view using `SHARE_CONTEXT(SYSTEM$IS_LISTING_PURCHASED)`:

```sqlsyntax
EXECUTE USING SHARE_CONTEXT(SYSTEM$IS_LISTING_PURCHASED=>{ 'TRUE' | 'FALSE' })
  AS <query>
```

Where:

* `SYSTEM$IS_LISTING_PURCHASED` specifies whether you want to validate as a paid consumer, or as a trial or unpaid consumer. The valid
  values are:

  + `TRUE`, to validate the share as a paid consumer.
  + `FALSE`, to validate the share as a trial or unpaid consumer.
* `<query>` is the SQL query that you want to run against the secure view.

When you use the command to run your query, the query is executed against the share as though you are a consumer.

For example, suppose you have a share that you want to validate. Your share includes a secure view named `PURCHASED_VIEW`, which
protects all data from a table named `SHARE_TABLE`. You want to validate that the data can be accessed only by a consumer that
purchased the listing.

To confirm that trial consumers cannot access any data in the secure view, run the following query:

```sqlexample
EXECUTE USING share_context(system$is_listing_purchased=>'FALSE')
  AS
    SELECT
      *
    FROM
      example_database.example_schema.PURCHASED_VIEW
```

If the secure view works as expected and no data is accessible to trial consumers, your query returns the following response:

```sqlexample
Query produced no results
```

To confirm that your paid consumers have access to the data, run the following query:

```sqlexample
EXECUTE USING share_context(system$is_listing_purchased=>'TRUE')
  AS
    SELECT
      *
    FROM
      example_database.example_schema.PURCHASED_VIEW
```

If the secure view works as expected, your query returns all of the columns and rows in `SHARE_TABLE`, the desired outcome for paid
consumers.

## Prepare your listing to be shared in other regions

When you configure your listing, you can choose to offer it in different regions. Offering listings in other regions requires replicating data.

Consider the time it takes to replicate data and the costs involved in replication.

* For a listing in the Snowflake Marketplace, you can choose which regions to make your listing available in. When you do, you can manually
  replicate the data or use auto-fulfillment to make your product available to consumers that get your listing. See
  [Manually replicate data to fulfill a listing request](provider-listings-managing.md) or [Auto-fulfillment for listings](provider-listings-auto-fulfillment.md) for more details.
* For a private listing, you need to share your listing to the regions where your consumer’s accounts are. You use auto-fulfillment to
  replicate the product to the consumers that get your listing. See [Auto-fulfillment for listings](provider-listings-auto-fulfillment.md) for more details.

All cross-region data sharing at Snowflake uses Snowflake’s data replication functionality.
See [Share data securely across regions and cloud platforms](../user-guide/secure-data-sharing-across-regions-platforms.md).

---
title: Provider and consumer policies
source: https://docs.snowflake.com/en/collaboration/provider-consumer-policies.md
section: Collaboration & Marketplace
---

# Provider and consumer policies

## Overview

Our policies are intended to help ensure a safe, reliable, and respectful experience for Providers and Consumers.

This page outlines Snowflake’s official policies governing participation in the Marketplace – including Provider and Consumer expectations, listing and profile requirements, product standards, and monetization guidelines.

Together, these policies define how Providers and Consumers can responsibly engage in the Marketplace while maintaining compliance with Snowflake’s legal and operational standards.

> **Note:**
>
> Capitalized terms have the meaning given them in the Snowflake [Provider and Consumer Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/snowflake-marketplace/provider-and-consumer-terms/).

## Provider and consumer expectations

We expect Providers and Consumers to accurately represent themselves and be responsive to Product-related requests.

To report an issue with a Product or a Provider, please fill out [this form](https://snowforce.my.site.com/s/consumer-reporting?_fsi=VpAObqDf&_fsi=VpAObqDf).

### Misrepresentation and impersonation

Providers and Consumers must not mislead other Snowflake customers by impersonating anyone, or implying that their organizations are related to or authorized by someone that they aren’t, including through the use of inappropriate or incorrect logos, descriptions, titles, or other elements.

### Responsiveness

Providers and Consumers are expected to respond promptly, and respectfully, to Product and Marketplace-related inquiries and requests from Snowflake and other Snowflake customers, within three (3) business days.

### No infringing or unlawful provider materials

Providers must not provide any Provider Materials that are infringing, unlawful, or in violation of third-party rights, including third-party terms of service.

### Use of usage metrics

Any usage metrics that we share with a Provider about a Consumer’s use of the Provider’s Products are Confidential Information and must only be used for internal business purposes. Providers may not use usage metrics for publishing benchmarks or revealing adoption of listings. Providers may only use Personal Data supplied by us for marketing if the marketing communications are limited to their Products, such use is in accordance with their public-facing privacy notice, they have obtained all required consents, and they otherwise comply with applicable terms of the Snowflake Provider and Consumer Terms.

## Listing and profile content policies

Provider profiles must accurately describe Providers’ organizations, and Listing Information must accurately describe Providers’ Products and contain any information required by the Snowflake Provider and Consumer Terms and as described below.

### Profile requirements

The intent of the Provider profile is to clearly convey information about Providers and the Products that they offer. We require the following for all Provider profiles:

* **Profile name.** Your profile name should be the same as your company name and/or a name your company publicly does business as.
  Consumers should be able to identify your company through your profile name. In situations where two or more companies demonstrate the
  rights to the same profile name, Snowflake will review the profile submissions on a first come, first serve basis.
* **Single profile.** You may only have one profile per distinct legal entity. If you have multiple legal entities within your organization,
  you may create one profile for each distinct legal entity.
* **Description.** You must include an accurate description of your organization, highlighting its relevance to the Products you offer, and
  identifying your business entity. If your profile name does not match the organization name in the Product’s documentation, Listing Terms,
  or privacy notice, you must also clarify the relationship in the organization description.
* **Eligible entities.** Your business entity may be a C corporation or LLC in the United States, a registered nonprofit, or the equivalent
  entities in jurisdictions outside of the United States.
* **Logo.** You must include a clear and high-quality logo.
* **Publishing on behalf of another entity.** You may publish listings on behalf of other entities provided you have the necessary rights.
  When publishing listings on behalf of another entity under that entity’s name or brand, you may use that entity’s logo if you have the
  necessary rights. You should also include your entity name in your profile name (e.g., if Company A wanted to offer products on behalf of
  Company B, their company name could be “Company B by Company A”).
* **Contact information.** You must include up-to-date contact information for Snowflake and Consumers to contact you, with a business
  domain for all contact emails.
* **Links.** You may include links to information regarding your organization. You must include a link to your public-facing privacy notice
  applicable to all Consumer Personal Data collected by you or on your behalf.

### Listing practices

Providers’ Listing Information should accurately describe their Products and provide information regarding the applicable costs and Listing Terms, including any use restrictions, license grants, and other terms and conditions covering a Consumer’s use of their Products. Specifically:

* **Description.** The Product “Description” must include an introduction, details about the nature of the Product (for example, for data
  Products, details on the Product’s tables and fields and, for application Products, details about the Product’s functionality), and use
  cases for the Product.
* **Category.** The Product “Category” must reflect the Product listed.
* **Geographic coverage.** The Product “Geographic Coverage” selections must reflect the specific countries and regions that your Product
  addresses or represents.
* **Business needs.** The Product descriptions set out for each “Business Need” must be unique.
* **Documentation.** The “Documentation” link must provide additional information about the Product (e.g., sources and methodology utilized
  for compiling).
* **Links.** Links provided for the Provider’s “Documentation,” “Terms of Service,” and “Privacy Notice” must include or otherwise point to
  the applicable Product documentation, Listing Terms, and privacy notice, respectively. Links may only be included in the “Product
  Description” field if they help Consumers understand the Product.
* **Translations.** For Providers that wish to target Consumers in their local language, translated descriptions may be included in the
  Listing Description underneath the English version.
* **SQL examples.** Any SQL queries provided in any Provider Materials must work for all Consumers and produce the results advertised in any
  examples included in the Provider Materials.

### Listing videos and images

If a Provider chooses to include any Video or Image Content in the Listing Information, additional requirements include:

* **Enhance understanding.** The Content must enhance Consumers’ understanding of the Product and the Provider.
* **Rights to content.** The Provider must have the necessary rights for all the Content, including any music, graphics, artwork, and logos.
* **Accessible.** The Content must be immediately accessible for all Consumers (i.e. videos shouldn’t have embedding restrictions, videos
  should be set to the “public” or “unlisted” privacy setting, and images should be in accessible formats, such as jpg, png, webp).
* **Concise.** Video Content must be short (i.e., no longer than ten minutes). Images should highlight the essence of the Provider’s Product
  (e.g., UI screenshots or data valuations).

### Product promotion practices

Providers may not engage in practices to manipulate their position in the Marketplace discovery experience. For example, Providers may not create multiple undistinguished listings, manipulate keyword searching, or create multiple listings for the same Product.

### Links

Links must be relevant, functional, and available to all Consumers. The URLs for links may not be shortened. Providers may not make material changes to the content of their links after publicly available Listing Information has been approved. Changing the requirements to access linked web pages that did not exist at the time of approval (e.g., by adding a login requirement) is an example of a material change not allowed under these Policies.

### Personal data

Within the Listing Information, Providers may not include any actual or synthetic Personal Data, but Providers must describe any types of Personal Data included in their Products. If a Provider includes Personal Data in their Products, they must be legally authorized to share such Personal Data (including, if applicable, by registering the database with relevant authorities), have any required consents, and otherwise comply with the applicable terms of the Snowflake Provider and Consumer Terms. In addition, Products offered to Consumers publicly via the Marketplace may not disclose or reveal any Sensitive Personal Data.

### Product misrepresentation

Providers must accurately describe the Products they are offering. This includes, for Products that are datasets offered publicly via the Marketplace, accurate descriptions of the update frequency, the geographical scope, region availability, and the completeness of the fields in the tables/views.

### Inciting harmful or malicious use of products

Providers’ Listing Information may not include or advertise illegal content or suggest or encourage illegal, threatening, or violent uses.

### Off-platform promotion

Providers may not include in their Listing Information or Products any advertisements, promotions, or opportunities to access or use products or services outside of the Service or Marketplace.

### Total listings

Providers may not publish more than 100 individual listings on the Marketplace.

## Product policies

We expect Providers to deliver the Products they advertise in their Listing Information and to provide Consumers notice when they make changes.

### Product Requirements

* **Responsibility.** Providers are solely responsible for their Products and Listing Information, including any open-source materials.
  Providers must have all the necessary rights to share or sell their Products. If Providers include Personal Data within their Products,
  they must be legally authorized to share such Personal Data (including, if applicable, by registering the database with relevant
  authorities and obtaining any required consents).
* **Data aggregation.** If Providers share anonymized or aggregated data, they are responsible for structuring the data in such a way that
  it remains anonymous, even when combined with additional information.
* **Delivery.** Delivery of the Product must occur via a Share, Native App or Connected App.
* **Scope.** Products should be logically grouped and published as one listing; Providers cannot have multiple listings for the same
  Product.
* **Trial data.** Trial data should be a meaningful representation of the Product. For data shares, trial data should include sufficient
  rows, columns, or sample content to demonstrate core value. For Native Apps, trials may be feature- or duration-limited but must allow
  Consumers to adequately evaluate the app’s primary functionality.
* **On-platform.** Products must drive material on-platform consumption by Consumers and cannot be exclusively delivered off-platform.
* **Operational readiness:** Products that are installed or accessed must be fully functional and deliver the core advertised utility
  immediately upon the Consumer receiving access to the listing.

### Native Application requirements

* **Publishing requirements.** We require that applications meet the [Enforced standards](guidelines-reqs-for-listing-apps.md).
* **Security requirements.** Providers must comply with our [Native Apps Framework security requirements](../developer-guide/native-apps/security-overview.md).

### AI product requirements

Providers offering AI-enabled Products, including **Semantic Views**, **Cortex Knowledge Extensions**, and **Agents**, must ensure that such Products function as advertised and are listed under the appropriate AI Product category. AI Products must include 2–3 representative example prompts demonstrating expected behavior.

If an AI Product uses any type of LLM, the Provider must disclose:

* The specific model and version utilized.
* A plain-language summary of the underlying logic.
* Any safety guardrails applied to the final output.
* All applicable transparency legal requirements.

AI Product functionality will be evaluated as part of Snowflake’s listing review process, and Providers must ensure that the Product reliably performs its core advertised functionality.

### Product Access

Generally, Providers may delist their Products at any time so that new Consumers do not discover them. To delete a listing, Providers must allow Consumers who are accessing or using the applicable Product to continue to access and use the Product for the time periods specified in Snowflake’s “listing retirement” requirements in the Documentation.

### Product continuity

Providers may incorporate improvements to their Products, but they may not otherwise materially change their Products, including by removing core fields or significantly reducing the update frequency.

## Monetization

### Monetization eligibility

Monetized listings on Snowflake Marketplace are available to qualified partners who demonstrate clear go-to-market readiness.

* Before submitting a paid listing, partners must engage with their Snowflake Partner Manager to review and validate their monetization
  strategy.
* On-Demand customers who do not have a Snowflake Partner Manager may request approval by submitting [this case form](https://snowforce.my.site.com/s/provider-onboarding-case) to Snowflake for review. The review may include a vetting call to evaluate
  the provider’s go-to-market readiness for offering paid listings on Snowflake Marketplace. Approval for monetization is determined by
  Snowflake based on the outcome of this review.

Refer to the documentation pages below for additional details and requirements to pay for listings as a consumer and offer paid listings as
a provider:

* [Pay for listings](consumer-listings-paying.md)
* [Who can provide paid listings](provider-becoming.md)

### Product delivery requirements

Providers must ensure that the final Product delivered to and used by Consumers materially matches the Product advertised in the listing, including the data product type.

Providers are thereby strictly prohibited from engaging in practices including, but not limited to:

* Fulfilling a paid transaction by delivering a materially different Product than the one described in the listing.
* Using an approved or private listing as a mechanism to transact for unrelated or substitute products.
* Bundling additional products, services, or support besides what’s advertised in the listing.

If the actual Product delivered fails to match the Product described in the listing, or if delivery occurs off-platform in a manner inconsistent with these requirements, Snowflake may take remediation actions, including requiring corrective updates, unpublishing the listing, restricting monetization privileges, or other enforcement actions.

### Marketplace capacity drawdown restrictions

Providers may not offer any Product that gives Consumers the ability to convert a Consumer’s Snowflake capacity commitment into cash, cash equivalents, or other forms of monetary value for the Consumer. This policy applies to transactions related to the Snowflake Marketplace Capacity Drawdown Program or any other similar program.

For more details on MCD, refer to [About committed capacity and Snowflake Marketplace Capacity Drawdown](marketplace-capacity-drawdown.md).

---
title: Provider workflows
source: https://docs.snowflake.com/en/collaboration/provider-listings-workflows.md
section: Collaboration & Marketplace
---

# Provider workflows

This section describes the workflows that providers follow to become a Snowflake Marketplace provider and to offer data/share listings and Snowflake Native App listings on Snowflake Marketplace.

## Provider approval process

The image below shows the approval process for becoming a Snowflake Marketplace provider. The steps below the image describe the actions that providers take when following the workflow.

1. If you don’t have one already, [create a Snowflake account](https://signup.snowflake.com).
2. Configure your provider profile.

   1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
   2. In the navigation menu, select Marketplace » Provider Studio.
   3. Select the Profiles tab.
   4. Select + Create profile » External profile.

   For more information, see [Manage your provider profile](provider-profiles-managing.md).
3. Submit your profile for approval.

Snowflake will review your profile and respond to you within approximately 1 business day.

> * If your submission is approved, you can then begin publishing listings on Snowflake Marketplace.
> * If your submission is rejected because of a policy violation, Snowflake will provide instructions via email on what needs to be corrected. You can then revise your profile and resubmit it.
>
>   For more information on details for provider profile requirements, see the Snowflake [Provider and Consumer Policies](https://www.snowflake.com/en/legal/provider-and-consumer-policies/).

> **Note:**
>
> Provider profiles won’t be visible on Snowflake Marketplace until a public listing is published.

## Data/share listing approval flow

The following image shows the approval process for publishing listings on Snowflake Marketplace. The steps below the image describe the actions that providers take when following the workflow.

1. Identify the objects that you want to share in a listing.

   For more information, see [Prepare to create a listing](provider-listings-preparing.md).
2. Create a listing and submit it for publishing.

   1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
   2. In the navigation menu, select Marketplace » Provider Studio.
   3. To create your listing, select Create Listing » Snowflake Marketplace.

      For more information on how to create a Snowflake Marketplace listing, see [Share data or apps publicly on Snowflake Marketplace](provider-listings-creating-publishing.md).
   4. Submit your listing for approval.
3. Snowflake reviews the listing metadata and will respond within approximately 1 business day.

   * If your listing is approved, then it will be published and available on Snowflake Marketplace.

     > **Note:**
     >
     > If you submitted your listing using manual publishing, the listing will not be published. The listing will remain approved until you manually publish the listing. For more information, see [Submit your listing for approval](provider-listings-creating-publishing.md).
   * If your listing is rejected, Snowflake will provide instructions via email on what needs to be corrected. You can then revise your listing and resubmit it.
   > For more information on details for provider profile requirements, see the Snowflake [Provider and Consumer Policies](https://www.snowflake.com/en/legal/provider-and-consumer-policies/).

## Snowflake Native App listing approval flow

The image below shows the approval process for a Snowflake Native App listing on Snowflake Marketplace. The steps below the image describe the actions that providers take when following the workflow.

> **Note:**
>
> Snowflake recommends that you test your application by privately sharing it with another account prior to submitting for publishing. This may expedite the review process.

1. Create a Snowflake Native App package.

   For more information, see [Tutorial 1: Create a basic Snowflake Native App](../developer-guide/native-apps/tutorials/getting-started-tutorial.md).
2. To initiate an automated security scan, [set the DISTRIBUTION property for the application package](../developer-guide/native-apps/security-run-scan.md) to `EXTERNAL`.

   * If the automated security scan fails, Snowflake will perform a manual security review that can take approximately 3 business days.
   * If the Snowflake Native App uses Snowpark Container Services (SPCS), then you must complete a [security questionnaire](https://docs.google.com/forms/d/1XLjbcSrp689kXEvVELa6KbEUOPfsJIirSTG5pGQDMZE/viewform?ts=65fb4866&edit_requested=true). After the questionnaire is approved, the automated security scan starts.
3. Create a listing and submit it for approval.

   For more information, see [Publish an app to consumers](../developer-guide/native-apps/ui-provider-publishing-app-package.md).
4. Snowflake reviews the listing metadata and conducts a functional review of the Native App, ensuring that it meets all Snowflake Marketplace [enforced requirements](../developer-guide/native-apps/publish-guidelines.md).

   * If your listing is approved, it will then be published and available on Snowflake Marketplace.

     > **Note:**
     >
     > If you submitted your listing using manual publishing, the listing will not be published. The listing will remain approved until you manually publish the listing. For more information, see [Submit your listing for approval](provider-listings-creating-publishing.md).
   * If your listing is rejected, Snowflake will reach out using the emails listed in the profile contacts (business and technical) with feedback on the application. Reviews may take up to 14 days.

---
title: Remove listings as a provider
source: https://docs.snowflake.com/en/collaboration/provider-listings-removing.md
section: Collaboration & Marketplace
---

# Remove listings as a provider

When you delete a listing, you permanently remove the listing. A deleted listing cannot be recovered
or republished to the Snowflake Marketplace.

You must have MODIFY or OWNERSHIP privileges on a listing to delete a listing.

## About removing listings, remote data, and data access

After a provider deletes a listing, no one can access the listing page. However, the listing might appear in search results until the change propagates throughout the system.

If there are existing consumers, they are immediately notified by email that the listing is being retired. The notification includes the date when listing will become unavailable. That date is based on the type of listing and the date when the provider initiated the removal.

Snowflake Marketplace consumers get to keep access to the listing for a period of time, called a retirement window, between the date when the notification is sent and when the listing becomes inaccessible. This window gives consumers time to plan for the change, reducing their risk of data loss. During the retirement window, consumers still retain access to the listing. The data auto-fulfilled to remote regions for the deleted listing remains in place and available to existing consumers throughout the retirement window.

> **Note:**
>
> When a listing is retired, the consumer billing contact is notified by email of the listing retirement.

**Paid listings**

* Paid listings are those that are paid for through Snowflake Marketplace.
* The retirement window for a paid listing always contains one full calendar month, regardless of the number of days in the month.
* Providers can unpublish advance payment listings from the Snowflake Marketplace immediately, but they must fulfill all existing consumer subscription terms for the unpublished listing.
* If you delete a listing on the first day of the month (March 1, for example),
  the retirement window continues through the last day of the month (March 31). The effective date for the removal of the listing is the first day of the next month (April).
* If you delete a listing after first day of the month (March 2, for example),
  the retirement window continues until the first and last day of a complete month pass. The effective date for the removal of the listing is the first day of the next month (May).

**Free and limited trial listings**

* Free listings include listings that are provided at no charge through the marketplace.
  Free listings also include listings that are paid for on a platform outside Snowflake.
* The retirement window for free and limited trial listings is exactly 30 days from the
  date of deletion. For example, let’s say you delete the listing on March 10. The effective date for the removal of the listing is April 9.

When the retirement window closes on the date of removal, the listing is no longer accessible to consumers. If the data was replicated to other regions using Cross-Cloud Auto-Fulfillment, it is removed from those regions on the effective date of removal.

> **Warning:**
>
> Deleted listings cannot be recovered.

## Delete a published listing

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Listings tab, then select the published listing you want to delete.
4. In the top-right corner of the listing page, select the vertical ellipsis (), and then select Unpublish to
   begin the unpublish process.

   A confirmation message displays, reminding you that unpublishing a listing removes the listing from Snowflake Marketplace, but existing
   consumers will continue to have access to the listing.
5. Select Unpublish to complete the unpublish process.

   The status at the top of the page changes from Live to Unpublished. You can select that status to view the listing status summary.
6. After the listing is unpublished, select the vertical ellipsis () again, and then select Initiate removal to
   begin the listing removal process.
7. Review the Delete Listing summary, and then select Delete to confirm that you want to delete the listing and complete the removal process.

   > **Warning:**
   >
   > This removal process cannot be reversed.

   When the listing is removed, the status at the top of the page changes from Unpublished to Pending retirement.

   Listings that are pending retirement are no longer accessible to new consumers, and existing consumers will lose access after a period of
   about 30 days.

   Select the Pending retirement status at the top of the listing page to see the exact date when existing consumers will lose access
   to the listing.

---
title: Reshare incoming data as a resharer
source: https://docs.snowflake.com/en/collaboration/resharing-as-resharer.md
section: Collaboration & Marketplace
---

# Reshare incoming data as a resharer

As a resharer, you can take data from a provider’s listing and share it with other accounts, either in its original state or transformed
with your own data. This topic describes how to reshare incoming data.

## Prerequisites

* The provider’s listing must have `resharing.enabled` set to `true`.
* You must create secure views in your own database. You can’t modify the imported database directly.
* The same role must create the share and grant it to the listing.

## Limitations

* Resharing is only enabled via listings. You can’t reshare direct shares or apps.
* You can’t attach data objects from imported databases or Uniform Listing Locators (ULLs) directly to another share. To reshare data
  objects from an incoming listing, you must create a secure view in your database.
* Resharers can only reshare tables, dynamic tables, and views from the incoming data products allowed for resharing.
* Reshared listings don’t support disaster recovery.

## Resharing workflow

1. Create an imported database from the provider’s listing.
2. Create a secure view in your own database that references data from the imported database.
3. Create a share and grant SELECT on the secure view to the share.
4. Create a new listing using the share.

```sqlexample
CREATE DATABASE imported_db FROM LISTING provider_listing;
CREATE DATABASE reshared_db;
CREATE SECURE VIEW reshared_db.public.reshared_view
  AS SELECT * FROM imported_db.public.provider_table;

CREATE SHARE my_reshare;
GRANT USAGE ON DATABASE reshared_db TO SHARE my_reshare;
GRANT USAGE ON SCHEMA reshared_db.public TO SHARE my_reshare;
GRANT SELECT ON VIEW reshared_db.public.reshared_view TO SHARE my_reshare;
```

> **Note:**
>
> A REFERENCE_USAGE grant isn’t required on imported databases created from reshared listings.

## Cross-region resharing

> **Note:**
>
> Be sure that you understand [auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md) before you enable
> auto-fulfillment for your reshared listings. Snowflake also provides several views to monitor auto-fulfillment costs and usage. For more
> information, see [Monitor resources and view costs](provider-listings-auto-fulfillment-monitor-view-costs.md).

To reshare data to consumers in other regions, listing auto-fulfillment must be enabled. The `auto-fulfillment` property
includes a `warehouse` field that you must specify when resharing across regions. This field can be omitted when resharing within the
same region.

Resharing data cross-region requires a local copy of the data for further replication downstream. Snowflake automatically creates dynamic
tables to manage this. The warehouse you specify is used to create and refresh these dynamic tables.

```yaml
auto_fulfillment:
  warehouse: my_wh
```

You can use the [SYSTEM$SHOW_DYNAMIC_TABLES_CREATED_FOR_RESHARING](../sql-reference/functions/system_show_dynamic_tables_created_for_resharing.md) system function to view the dynamic tables created for
resharing.

## Enabling further resharing by your consumers

If you want your consumers to further reshare the listing you created, enable resharing on your own listing by setting
`resharing.enabled` to `true`. For details on configuring this as a provider, see
[Using resharing as a provider](resharing-as-provider.md).

## Troubleshooting

If consumers receive the error “The listing has resharing restrictions that prevent access to the underlying data,” work with the
provider to resolve the issue. This error can occur when:

* The provider disables resharing by setting `enabled` to `false`.
* The provider adds or changes governance policies or context functions on the base tables that aren’t compatible with resharing.

---
title: Resharing listings
source: https://docs.snowflake.com/en/collaboration/reshare-listings.md
section: Collaboration & Marketplace
---

# Resharing listings

With resharing, providers can allow consumers to share their incoming listings with other Snowflake accounts, either within the same region
or across regions. Resharing extends the reach of your data products without requiring you to manage individual sharing relationships with
every downstream consumer.

There are three roles involved in resharing, each representing a different Snowflake account:

* **Provider**: The account that owns the original data and publishes the listing. The provider can turn on or off resharing on their
  listings to control downstream resharing of their data.
* **Resharer**: A consumer of the provider’s listing who creates new views from the incoming listing and shares them with other accounts.
* **Consumer**: The account that accesses the reshared listing from the resharer.

> **Note:**
>
> Multi-hop resharing is possible, so a resharer may reshare data that itself has been reshared with them.

Resharing supports several scenarios, including:

* **Resharing external listings within an organization**: A company might acquire weather data on the Snowflake Marketplace or privately
  from an external vendor. The central data team might then want to reshare the incoming listing after applying some entitlements to the
  rest of the organization on the internal marketplace or as private listings.
* **Resharing internal listings within an organization or externally**: Within an internal marketplace, a marketing team creates a
  reshareable listing and provides that listing to the sales team. The sales team accesses that data and uses parts of it, such as a view,
  and incorporates that into their own data product. The sales team then shares that transformed data product with the finance team. The
  sales team might also take some elements of this incoming dataset, apply policies, and share externally with their partners.
* **Resharing cross-region**: A company has Snowflake accounts throughout the world, with a central company-wide warehouse in Germany. An
  account in Malaysia shares data with a Snowflake account in Germany. That data can then be reshared to a second account in the same
  region without creating any copies. For cross-region resharing, Snowflake automatically replicates the data to the target region.

## Key features

* **Expand reach**: Providers can share data publicly or privately and enable that data to be reshared, making their data products more
  valuable to their customers.
* **Low operational and storage costs**: Consumers don’t have to create and maintain physical copies of data.
* **Ability to transform on reshare**: Consumers can optionally manage or transform the data product and then reshare it within their
  internal marketplace with colleagues, or privately share it with business partners.
* **Value-added reseller bundling**: Bundle incoming listings from other vendors as part of your Native App or data products.
* **Cross-region auto-fulfillment**: When a resharer shares listing data to another region, listing auto-fulfillment replicates the data to
  the target region. The provider doesn’t incur additional costs for this replication. Replication costs are attributed to the resharer.

## Resharing listings workflow

A typical workflow for resharing includes a minimum of three parties.

1. The provider who owns the original data product shares their data product publicly or privately with a consumer (Consumer A).
2. Consumer A creates a view that references the shared data and then shares these *outgoing* views publicly or privately with a
   second-level account (Consumer B).
3. Consumer B retrieves and uses the reshared data product.

## Resharing access control requirements

The roles and privileges for resharing a listing are the same as those for [Creating a listing](../user-guide/collaboration/listings/organizational/org-listing-create.md).

For provider-specific information, see [Using resharing as a provider](resharing-as-provider.md).

For resharer-specific information, see [Reshare incoming data as a resharer](resharing-as-resharer.md).

---
title: Set the account-level refresh interval
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-set-refresh-interval.md
section: Collaboration & Marketplace
---

# Set the account-level refresh interval

If your data product is an application package that is auto-fulfilled to remote regions, updates to your product occur following a refresh
frequency that you set at the account level.

If you have the ACCOUNTADMIN role, you can change the refresh interval for the account using Snowsight or a SQL command.
When you do this, you update the auto-fulfillment refresh interval for every application package published by your account.
This refresh interval does not affect listings with shares attached.

SnowsightSQL

To set the refresh frequency for your application using Snowsight, you must use the ACCOUNTADMIN role and complete the following steps:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Select the row for the listing that you want to manage.
4. From the listing details page, access the Auto-fulfillment settings:

   1. For a listing offered on the Snowflake Marketplace, in the Region Availability section, select Manage.
   2. For a listing offered to specific consumers, in the Consumer Accounts section, select ….
5. Select Update Refresh Frequency to update the refresh interval and frequency of your data product.
6. Select a frequency at which to refresh your data product, such as every minute or up to once every 8 days.

   The refresh frequency you select affects all application packages published by your account. You can show
   all listings affected by the refresh frequency change before you make the change.

   You can specify the refresh frequency, but the scheduled time when the refresh occurs in a region is based on the date and time that a
   consumer in that region first requests your data product.
7. Select Update to save the updated refresh frequency.

To set the refresh frequency for your application using SQL, you must use the ACCOUNTADMIN role and run the following command:

```sqlsyntax
ALTER ACCOUNT SET LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE = '<schedule>'
```

Where:

`schedule`:
:   The time interval at which to refresh the data product to other regions. Specify a time period in minutes, including the unit
    `MINUTES`.

For example, to set the Auto-fulfillment refresh frequency for every application package published by your account to every hour,
run the following:

```sqlexample
ALTER ACCOUNT SET LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE = '60 MINUTES'
```

> **Note:**
>
> The refresh schedule for a data product in a region is based on the date and time that a consumer in that region first requests your
> data product. You can also use cron expressions to set listing schedules. For more information, see [LISTING_AUTO_FULFILLMENT_REPLICATION_REFRESH_SCHEDULE](../sql-reference/parameters.md).

---
title: Set up auto-fulfillment
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-setup-steps.md
section: Collaboration & Marketplace
---

# Set up auto-fulfillment

This section describes how to set up Cross-Cloud Auto-Fulfillment (auto-fulfillment) for secure share data products and application package data products. It also describes how to set up object-level auto-fulfillment for a listing.

You must add a data product to your listing before you can set up auto-fulfillment. Also, the steps to set up auto-fulfillment differ depending on the data product you offer and how you make your listing available.

## Set up auto-fulfillment for a secure share data product shared on the Snowflake Marketplace

If your data product is a secure share that you publish to the Snowflake Marketplace using a listing, use the following steps to
set up auto-fulfillment:

Snowsight

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Select the listing for which you want to set up auto-fulfillment.
4. Select Region Availability » Edit.
5. For Region availability, choose your desired availability.

   * By default, All regions is selected. This ensures the availability of your listing in any future regions added by Snowflake.
   * If your listing has specific regional limitations, change the region availability to Custom regions and select the regions in which you want to offer your data product. When you choose custom regions, your listing is visible in all current Snowflake Marketplace regions, but consumers can only get your data product in the regions you specify. Your listing will not be available in any new regions automatically.
   * For paid listings, Custom regions is selected by default. Paid listings are only available in [supported regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-paying#label-monetization-consumer-region-support) and any future supported regions added by Snowflake.
6. For Fulfillment method, Automatic fulfillment is the default selection. With Cross-Cloud Auto-Fulfillment, your data product is automatically fulfilled to a region and you incur costs only when there is consumer demand in that region.

   > If you can’t use auto-fulfillment and the option is available, select Manual to manually replicate your data product. See [Manually replicate data to fulfill a listing request](https://other-docs.snowflake.com/en/collaboration/provider-listings-managing#label-manually-replicate-listing).
7. If you select Automatic for auto-fulfillment:

   1. Select a refresh interval from the drop-down list, then enter a value. You must select a refresh interval of at least 8 days.
   2. If you don’t have a default warehouse set, select a warehouse to use for auto-fulfillment.
   3. When you add a data product to your listing, Snowflake performs a compatibility check to validate that your data product can be auto-fulfilled to other regions. If the check returns any incompatibilities, you might need to update your data product. See [Troubleshooting auto-fulfillment](provider-listings-auto-fulfillment-troubleshooting.md).
   4. Select Save and Enable Fulfillment.

      Auto-fulfillment for the listing is now enabled, but the data product attached to the listing is not fulfilled to any regions
      until the listing is published and a consumer requests the data product. See [How auto-fulfillment works](provider-listings-auto-fulfillment.md).
8. If you chose to manually fulfill the listing, select Save. Before publishing the listing, you must replicate data to each of the available regions you select. See [Manually replicate data to fulfill a listing request](https://other-docs.snowflake.com/en/collaboration/provider-listings-managing#label-manually-replicate-listing).

## Set up auto-fulfillment for an application package data product shared on the Snowflake Marketplace

If your data product is an application package that you publish to the Snowflake Marketplace with a listing, use the following steps to
set up auto-fulfillment:

Snowsight

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Select the listing for which you want to set up auto-fulfillment.
4. Select Region Availability » Edit.
5. For Region availability, choose your desired availability.

   * By default, All regions is selected. Choosing all regions ensures the availability of your listing in any future regions added by Snowflake.
   * If your listing has specific regional limitations, change the region availability to Custom regions and select the regions in which you want to offer your data product. When you choose custom regions, your listing is visible in all current Snowflake Marketplace regions, but consumers can only get your data product in the regions you specify. Your listing will also not become automatically available in any new regions.
   * For paid listings, Custom regions is selected by default. Paid listings are only available in [supported regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-paying#label-monetization-consumer-region-support) and any future supported regions added by Snowflake.
6. Review the refresh interval configured at the account level. If you need to use a different refresh interval, see [Set the account-level refresh interval](provider-listings-auto-fulfillment-set-refresh-interval.md).
7. If you don’t have a default warehouse set, select a warehouse to use for auto-fulfillment.
8. Select Save and Enable Fulfillment.

   Auto-fulfillment for the listing is now enabled, but the data product attached to the listing is not fulfilled to any regions until the listing is published and a consumer requests the data product. See [How auto-fulfillment works](provider-listings-auto-fulfillment.md).

## Set up object-level auto-fulfillment

You can configure auto-fulfillment to automatically transfer the data product associated with your listing to other Snowflake regions. You also can use SUB_DATABASE auto-fulfillment and choose to fulfill only the tables and views in a data product to a remote region using auto-fulfillment. This can help reduce costs and ease the manageability burden of your auto-fulfilled data product.

The steps below describe how to set up object-level auto-fulfillment for a listing. As part of a typical workflow, you set up object-level auto-fulfillment when you set up the region availability (for a listing published on the Snowflake Marketplace) or when you add a consumer located in another region (for a listing shared privately).

Snowsight

1. Create a listing. See [Create a new listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing).
2. Add a data product that contains only supported objects.
3. Set up regions or accounts to share with to start setting up auto-fulfillment:

   For a listing published to the Snowflake Marketplace:

   > 1. Locate the Region Availability section and select Add.
   > 2. For Region availability, keep the default of All regions or select Custom regions for your listing.

   For a listing shared privately, add a consumer account in a remote region.
4. Select your preferred refresh interval for updating the data product in remote regions.
5. Publish your listing or save it as a draft.

## Set up auto-fulfillment for a listing that spans databases

Providers can create a single listing that spans databases, eliminating the need to create one combined database per listing. In this case, all listings associated with a database are auto-fulfilled together.

### Workflow

1. A provider has a database (main database) that they want to share. They also have views in that database that reference objects in another database (referenced database).
2. The provider creates a share in the main database.
3. Using [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md), the provider grants the following required privileges to the share:

   1. The provider grants the USAGE privilege on the main database that contains the view to the share.
   2. The provider grants the REFERENCE_USAGE privilege on the referenced database to the share.

      > **Note:**
      >
      > Setting the auto-fulfillment refresh type to `FULL_DATABASE` is deprecated and isn’t supported for reference usage grants.
   3. The provider grants the USAGE privilege on the schema that contains the view to the share.
   4. The provider grants the SELECT privilege on the view to the share.
4. The provider creates a listing with the share and enables [auto-fulfillment](provider-listings-auto-fulfillment.md) for cross-region cross-cloud consumers.

For more information, see [Share data from multiple databases](../user-guide/data-sharing-multiple-db.md).

### Supported reference types

When REFERENCE_USAGE is granted on a database to a share, the following reference types are supported:

* A view referencing a table or view in another database.
* Tables or views with policies when these policies are stored in another database.
* Tables or views with tags when these tags are stored in another database.

  > **Note:**
  >
  > A tag without an attached policy in a different database will only be replicated if reference usage is granted. Otherwise, replication will be skipped. See [GRANT <privilege> … TO SHARE](../sql-reference/sql/grant-privilege-share.md) for more information. If the tag is used in tag-based masking, then the share is treated as a table or view with row-access policies.

### Limitations

* Snowflake groups listings together when refreshing the data. Setting up listings that span multiple databases can change the way listings are grouped. As a result, the following might be affected:

  + The listing refresh history can be missing or incorrect after update the auto-fulfillment schedule.
  + Setting the `refresh_schedule_override` option may be required. When this option is missing, a resulting error message will include the list of listings that were affected by the change in the order that the listings were grouped.
* Setting the auto-fulfillment refresh type to `FULL_DATABASE` is deprecated and isn’t supported for reference usage grants.

### Usage notes

When setting up auto-fulfillment, if the selected and referenced databases include existing listings, then the values in the Data product refresh section default to the existing refresh schedule. As a result, changes to the auto-fulfillment refresh schedule apply to all other listings associated with this database and with the referenced database.

### Examples

For examples on how to create a secure view that references objects and other views in one or more databases, see the [Share data from multiple databases examples](../user-guide/data-sharing-multiple-db.md).

After you create a secure view, you can create a listing that includes the secure view and set up auto-fulfillment on the listing. For examples on how to create listings on the Snowflake Marketplace, see [Create and publish a listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing).

---
title: Snowflake Marketplace Provider Expectations for Transactions, Invoicing, and Collections
source: https://docs.snowflake.com/en/collaboration/provider-transactions-invoicing-collections.md
section: Collaboration & Marketplace
---

# Snowflake Marketplace Provider Expectations for Transactions, Invoicing, and Collections

This page explains the respective responsibilities of Marketplace Providers and Snowflake for invoicing,
taxes, collections, payouts, and transaction reporting for monetized Snowflake Marketplace transactions.

## Seller of record

The **Provider is the Seller of Record** for their Product(s) (not Snowflake) and is fully responsible for their
Product(s), the applicable listing terms, and the commercial terms offered to the customer.

The Provider remains responsible for dispute and refund discussions, as well as customer communication related to
its products, invoices, and listing terms. Providers may communicate directly with customers regarding their
products, transactions, and invoices, subject to the Snowflake Provider and Consumer Terms and the Provider’s
privacy policy.

For more information on disputes and enforcement, refer to
[Dispute resolution, enforcement, and appeals](dispute-resolution-enforcement-appeal.md).

## Invoicing and taxes

For monetized transactions, **Snowflake** invoices customers on the Provider’s behalf in USD.

**Snowflake Marketplace acts as a marketplace facilitator** and, where required by law, calculates,
collects, and remits applicable taxes and issues tax-compliant invoices as applicable.

In jurisdictions where Snowflake is not registered or authorized to collect a transaction tax, the **provider**
may need to assess, invoice, collect, remit, and report that tax directly. Providers should consult their own
tax advisors.

For additional tax information, refer to the
[Snowflake Marketplace taxes overview for consumers](https://www.snowflake.com/marketplace-taxes-overview-for-consumers/).

## Collections

**Snowflake** makes a best-effort attempt to collect payment on the Provider’s behalf for monetized
Marketplace transactions, including automated payment reminders for overdue invoices. Snowflake
provides support to customers with billing-related questions.

**Providers** are ultimately responsible for collecting payment and taking any necessary action in the event
of consumer nonpayment.

Snowflake provides reporting tools to help Providers monitor invoices, payments, payouts, and usage, including:

1. UI path: Marketplace ‣ Provider Studio ‣ Invoices
2. SQL views: [DATA_SHARING_USAGE schema](../sql-reference/data-sharing-usage.md)

## Payouts, Net Terms and fees

**Stripe** is the payment processor for Snowflake Marketplace payouts. To receive payouts, Providers must set up their
payout method through Snowflake Marketplace. During setup, Providers may either create a new
Stripe Connect account or connect an existing Stripe account. There are no
additional costs to Providers for using the Snowflake Stripe Connect account. As part of the setup process,
Providers must accept the [Stripe Connected Account Agreement](https://stripe.com/legal/connect-account).
Those terms are between the Provider and Stripe, and Snowflake is not a party to that agreement.

**Net terms,** the amount of time a customer has to pay an invoice, are determined by the Provider and should be
specified in the order form and Marketplace Offer.

**Provider payouts** occur only after Snowflake collects the applicable product charges from the customer, net of
Snowflake fees, applicable taxes, and any permitted adjustments. If a customer pays with Marketplace Capacity
Drawdown (MCD), the related capacity invoice must be paid in full before payout is issued. Provider payouts are
made 30 days after payment is received in full.

Snowflake Marketplace charges a **transaction fee**, which is deducted from the Provider’s payout. Find the fee
schedule by navigating in Snowsight: Admin ‣ Billing & Terms ‣ Fee.

---
title: Snowflake Marketplace trust and safety review process
source: https://docs.snowflake.com/en/collaboration/trust-safety-review-process.md
section: Collaboration & Marketplace
---

# Snowflake Marketplace trust and safety review process

At Snowflake, maintaining **trust and safety across the Marketplace** is a top priority. All public profiles, listings, and underlying data products are subject to review processes designed to uphold security, integrity, and compliance standards.

1. **Content metadata**

   1. **Profiles**

      1. Snowflake evaluates each provider profile submission to verify a provider across a set of dimensions to evaluate its legitimacy
         as a business. For more information, see the [Profile requirements](provider-consumer-policies.md) in Provider and Consumer Policies.
   2. **Listings**

      1. Snowflake evaluates each listing to ensure that the listing metadata aligns with the [Listing practices](provider-consumer-policies.md) in Provider and
         Consumer Policies.
2. **Underlying Data Products**

   1. **Native Applications**

      1. **Security scan**: Every externally sharable native app must pass a [security review](../developer-guide/native-apps/security-overview.md). If the automated security scan fails, a manual review is conducted. This process ensures that applications meet our [Security requirements](../developer-guide/native-apps/security-app-requirements.md).
      2. **Functional Review**: Following the security scan, the Marketplace Operations team performs a Functional Review of public native
         applications. This step validates that the application aligns with Snowflake’s
         [Guidelines and requirements for listing Apps on Snowflake Marketplace](guidelines-reqs-for-listing-apps.md), ensuring applications perform the advertised functionality and are
         compliant with Marketplace standards.
   2. **Datasets**

      1. Providers are responsible for their data assets. As such, Snowflake does not conduct reviews of underlying datasets in the normal
         course consistent with the [Provider and Consumer Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/snowflake-marketplace/provider-and-consumer-terms/).
      2. If a credible consumer report is submitted regarding a dataset listing, Snowflake may review the underlying data objects to support the investigation consistent with the [Provider and Consumer Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/snowflake-marketplace/provider-and-consumer-terms/).
   3. **Cortex AI-Ready products (Semantic views, CKE, etc.)**

      1. **Functional review**: All public AI-Ready products go through a functionality review where the Marketplace Operations team reviews for basic functionality — installing, configuring, and running examples — to ensure a high-quality experience for consumers.

## Ongoing oversight

If Providers and Consumers are unable to come to a resolution on a dispute regarding a product, they may file a case with Snowflake Marketplace Operations.

Use these forms to file your dispute:

* Provider dispute against a Consumer: <https://snowforce.my.site.com/s/provider-onboarding-case>
* Consumer dispute against a Provider: <https://snowforce.my.site.com/s/consumer-reporting>

---
title: Troubleshoot auto-fulfillment setup issues
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-troubleshoot-setup.md
section: Collaboration & Marketplace
---

# Troubleshoot auto-fulfillment setup issues

When you set up your listing, underlying issues with your data product can prevent auto-fulfillment.

* A direct share with the same name already exists in the remote account
* A role missing privileges on a share
* Auto-fulfillment failed during snapshot generation for multiple listings
* The database is larger than 10 terabytes
* The data product contains a reference database
* The data product contains unsupported objects
* The listing database is a primary database
* The listing database is a secondary database
* Receiving an error that my account is not set up for auto-fulfillment
* The user is unable to share to accounts in other regions

## A direct share with the same name already exists in the remote account

Error:
:   Two or more providers within an organization can’t create a direct share with the same name.

Cause:
:   A direct share with the same name already exists in the secure share area used by Cross-Cloud Auto-Fulfillment. This can happen if a different account in your organization is using Cross-Cloud Auto-Fulfillment and has a direct share with the same name auto-fulfilled to that cloud region. The secure share area in a cloud region is shared across all provider accounts in your organization.

Solution:
:   Rename the direct share that contains the share attached to the listing that will be auto-fulfilled. Renaming the direct share doesn’t affect any downstream consumers.

## A role missing privileges on a share

Error:
:   OWNERSHIP on the selected share is required to enable auto-fulfillment.

Cause:
:   Only the ACCOUNTADMIN role can set up auto-fulfillment. This error can occur when the ACCOUNTADMIN
    role is not granted and does not inherit the role that owns the share attached to the listing.

Solution:
:   Grant the role that owns the share to the ACCOUNTADMIN role. For example, run the following:

    ```sqlexample
    GRANT ROLE SHARE_OWNER TO ROLE ACCOUNTADMIN;
    ```

## Auto-fulfillment failed during snapshot generation for multiple listings

Error:
:   Internal error occurs during auto-fulfillment for multiple listings.

Cause:
:   The error can occur if multiple listings use the same database for cross-region sharing and one of the listings contains or
    references an unsupported object type. This can impact the auto-fulfillment process for all listings that use that database.
    For example, let’s say a provider adds a new listing to be transferred across clouds or regions. The new listing shares objects from
    a database that other listings also use. The new listing includes a VIEW using a BUILD_SCOPED_FILE_URL, a function that calls
    GET_STAGE_FILE to retrieve data from an external stage in S3. Because external stages are not supported for auto-fulfillment,
    and the objects in that database are transferred together as a group, the other listings get the same error. If no action is taken,
    existing consumers in remote regions will not get updates, and new customers will not be able to get the data product.

    Similar-looking errors can occur for other issues like network issues, authentication problems, or unsupported object types in
    certain operations (like replication).

Solution:
:   Starting with listings that were most recently added or updated, verify the following:

    * Verify that the listings in the group of listings that have errors include only
      supported object types for cross-region auto-fulfillment,
    * Verify that none of the objects make reference to unsupported object types. You might have to check multiple levels of
      dependencies to identify the root cause of the issue, for example, a view calling BUILD_SCOPED_FILE_URL which itself calls
      GET_STAGE_FILE to retrieve data from an external stage.
    * Use separate databases for listings that require different object types to avoid cross-impact.
    * Remove or replace any unsupported objects to avoid auto-fulfillment errors.
    * Check for any potential network, authentication, or missing GRANTS issue.
    * Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if the issue persists or you need assistance.

## The database is larger than 10 terabytes

Error:
:   Auto-fulfillment is unavailable because the share is associated with a database larger than 10TB.

    Auto-fulfillment is unavailable because the data product is associated with a database larger than 10TB.

Cause:
:   The database that contains the objects in your share is larger than the 10TB limit for database replication and auto-fulfillment.
    The limit exists to prevent unexpectedly high costs resulting from auto-fulfillment or replication, but can be changed.

Solution:
:   Explore the cost ramifications for auto-fulfilling a database larger than 10TB to one or more regions. See [Auto-fulfillment costs](provider-understand-cost-auto-fulfillment.md).

    If you accept the potential added cost, you can contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to have the limit adjusted for your entire account.

> **Note:**
>
> If you configured [object-level (SUB_DATABASE) auto-fulfillment](provider-understand-auto-fulfillment-objects.md), then the auto-fulfillment size safety check will only include shared dependencies and not the entire database.

## The data product contains a reference database

Error:
:   The reference database in the share is not supported for auto-fulfillment.

    The shared object references below are incompatible.

    The references below in the shared database are incompatible.

Cause:
:   The share attached to the listing contains a reference database or contains objects that reference a different database. Referencing objects
    in a different database is not supported for auto-fulfillment.

Solution:
:   You can do one of the following:

    > * Remove the reference database, and objects referring to the reference database, from the share.
    > * Use a different database that has all of the objects required for the share. You might need to recreate tables in the new database
    >   and view & function definitions updated.
    > * Use manual fulfillment instead. Only some listings can be manually fulfilled. See [Manually replicate data to fulfill a listing request](https://other-docs.snowflake.com/en/collaboration/provider-listings-managing#label-manually-replicate-listing).

## The data product contains unsupported objects

Error:
:   The data product contains objects incompatible with cross-region sharing. Update the data product to share with accounts in other regions.

    The shared objects below are incompatible.

    The objects below in shared database are incompatible.

Cause:
:   The database that contains the share contains objects unsupported by auto-fulfillment. Because the entire database gets auto-fulfilled,
    even if the share does not contain the objects, you might still encounter this issue.

    For an application package, you might see this issue if the data content included in the application or the referenced database contains
    objects unsupported by auto-fulfillment.

Solution:
:   Review the full list of supported objects for auto-fulfillment. See [Objects supported for auto-fulfillment](provider-understand-auto-fulfillment-objects.md).

    If the database contains objects that are not supported, you can do one of the following:

    > * Remove the unsupported objects from the database or application package to be shared.
    > * Use a different database that has all the objects required for the share, and no unsupported objects.

## The listing database is a primary database

Error:
:   The primary database in the share is not supported for auto-fulfillment.

    The primary database in the data product is not supported for auto-fulfillment.

    Cannot auto-fulfill listing: listing database is a global database, which is not supported.

Cause:
:   The share contains objects from a database that was previously used for database replication.

Solution:
:   You can do one of the following:

    * Convert the secondary and primary databases to use replication groups and set up a manual replication group if desired. See
      [Transitioning from database replication to group-based replication](../user-guide/account-replication-config.md)
    * Use a different database that has all of the objects required for the share, and has not been previously replicated.

## The listing database is a secondary database

Error:
:   The secondary database in the share is not supported for auto-fulfillment. You will need to manually set up accounts in available regions,
    replicate the database to each account, create a secure share in each account, and attach those shares to this listing.

    The secondary database in the data product is not supported for auto-fulfillment. Please choose another data product.

Cause:
:   The database that contains the share is a secondary database, which is read-only and cannot be replicated or auto-fulfilled.

Solution:
:   You can do one of the following:

    > * Create your listing from the account where the database is the primary database.
    > * Stop replicating the database manually to other regions.

## Receiving an error that my account is not set up for auto-fulfillment

Error:
:   Cannot set replication schedule for listing <my_listing>: account not set up for auto-fulfillment.

Cause:
:   Auto-fulfillment hasn’t been enabled for your account, or you’re using a trial account.

Solution:
:   * If you’re using a full (non-trial) account, you can enable auto-fulfillment on your account using the [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../sql-reference/functions/system_enable_global_data_sharing_for_account.md) system function.

      You must use the [ORGADMIN](../user-guide/organization-administrators.md) role to call this system function. If you aren’t the organization administrator, contact your organization administrator to enable auto-fulfillment for your account.

      For more information, see [Enable auto-fulfillment for your account](provider-listings-auto-fulfillment-setup.md).
    * If you’re using a trial account, upgrade to a full account to enable auto-fulfillment.

## The user is unable to share to accounts in other regions

Error:
:   To share to accounts in other regions, please contact your organization administrator to delegate privileges to the ACCOUNTADMIN role in
    this account.

Cause:
:   Your role does not have permission to set up auto-fulfillment.

Solution:
:   Contact your organization administrator to [Manage privileges for auto-fulfillment](provider-listings-auto-fulfillment-manage-privileges.md).

---
title: Troubleshoot problems with auto-fulfilled data products
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-troubleshoot-data-products.md
section: Collaboration & Marketplace
---

# Troubleshoot problems with auto-fulfilled data products

The following issues might occur for auto-fulfilled data products that are improperly configured.

* Data is missing or out of sync for consumers
* Long delay getting data after requesting a listing

## Data is missing or out of sync for consumers

Error:
:   Consumer reports that views from an auto-fulfillment listing are no longer visible.

Cause:
:   You re-created objects, such as tables or views, associated with your listing and either:

    * The objects were not re-granted to the share after being re-created
    * Or, the objects were re-granted but it has been less than 10 minutes. Changes to objects granted to shares are checked every 10 minutes,
      so if it has been less than 10 minutes, the updated objects have not yet been auto-fulfilled to the consumer’s region.

Solution:
:   Verify that the objects were re-granted to the share, and determine how much time has passed since the grant query was run.

    To confirm that all objects are granted to the share in your primary account, run the following:

    ```sqlexample
    SHOW GRANTS to SHARE <share_name>;
    ```

    If needed, re-grant objects to the share:

    ```sqlexample
    GRANT USAGE on DATABASE <db_name> to SHARE <share_name>;
    GRANT USAGE on SCHEMA <schema_name> to SHARE <share_name>;
    GRANT SELECT on TABLE <table_name> to SHARE <share_name>;
    GRANT SELECT on VIEW <view_name> to SHARE <share_name>;
    GRANT USAGE on FUNCTION <function_name(parameters)> to SHARE <share_name>;
    ```

    Allow up to 10 minutes after grants have been updated in the primary region, or after a database has been refreshed with new objects,
    for grants to apply in all remote regions.

## Long delay getting data after requesting a listing

Consumer reports that they requested a listing in their region, but after several days, they still don’t have access to the data product.

Error:
:   Data is replicating to your region…

Cause:
:   If the error message appears for several days with no change in status, it’s likely that an auto-fulfillment error occurred.

Solution:
:   As a provider, view the listing details to identify a specific error preventing auto-fulfillment of the data product, and refer to this
    troubleshooting guide to address the error.

    As a consumer, contact the provider to let them know that there is a problem auto-fulfilling their data product to your region.

---
title: Troubleshooting auto-fulfillment
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-troubleshooting.md
section: Collaboration & Marketplace
---

# Troubleshooting auto-fulfillment

When you use Cross-Cloud Auto-Fulfillment, either by sharing a listing with a consumer account in another region or by setting up the
region availability of your listing on the Snowflake Marketplace, various checks run to determine whether your data product can be auto-fulfilled.

You can use the sections that follow to troubleshoot common issues with auto-fulfillment. Contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) if you encounter an issue not listed here.

> **Note:**
>
> Some issues described in these sections appear when a compatibility check runs for your data product when you [Set up auto-fulfillment](provider-listings-auto-fulfillment-setup-steps.md). For private listings, the compatibility check only runs if you save your listing as a draft before adding consumer accounts, so you might not see the issues when you first publish a private listing.

---
title: Tutorial
source: https://docs.snowflake.com/en/collaboration/tutorial-resharing.md
section: Collaboration & Marketplace
---

# Tutorial

## Use case: A provider shares a listing on the internal marketplace. It’s then reshared with a consumer.

In this use case, a provider shares a reshareable listing on the internal marketplace. Consumer A retrieves the listing and then reshares it
with Consumer B.

> **Note:**
>
> The steps for resharing listings on the Snowflake Marketplace and for resharing private listings are similar to the steps provided in this use case.

### Step 1. The provider creates a reshareable listing on the internal marketplace

> **Note:**
>
> To enable resharing across regions, the provider must enable `change_tracking` on their tables. This can only be done
> programmatically using [CREATE TABLE](../sql-reference/sql/create-table.md) or [ALTER TABLE](../sql-reference/sql/alter-table.md). For more information, see
> [Enable change tracking](../user-guide/dynamic-tables-create.md).

1. Follow the steps for [creating an organization listing](../user-guide/collaboration/listings/organizational/org-listing-create.md) on the internal
   marketplace in Snowsight.

   This use case creates a listing named *Daily revenue reshare*. The listing contains a table named *daily_revenue_table*.
2. Review the Resharing section in the lower-right corner.

   Listings can be reshared by default.
3. Add Consumer A to the targeting of the listing, and then publish the listing.

   The listing will be discoverable in the organization’s internal marketplace after it’s published.

### Step 2. Consumer A retrieves and reshares the listing

In this example, Consumer A retrieves the shared listing from the internal marketplace and then reshares it with a second-level consumer
(Consumer B).

#### Verify that you can see the listing

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md) as Consumer A.
2. In the navigation menu, select Catalog » Internal Marketplace.
3. Verify that the shared listing is available.

   In this use case, the shared listing is named *Daily revenue reshare*.
4. On the Internal Marketplace page, select the shared listing, and then copy the listing ULL.

   You’ll include this ULL when you create a view.

   In this use case, the copied ULL is ORGDATACLOUD$INTERNAL$DAILY_REVENUE_RESHARE.

#### Create a new view

Create a view in a new or existing database that references objects from the shared listing.

The view references the listing as shown in the following example. Include the listing ULL that you copied earlier. This ULL includes the
listing name, the schema, and the table name. This view becomes the *outgoing* view.

```sqlexample
CREATE SECURE VIEW drt_secure_view
  COMMENT = '<comment>'
  AS SELECT * FROM ORGDATACLOUD$INTERNAL$DAILY_REVENUE_RESHARE.public.daily_revenue_table;
```

The new view is listed in the database’s public views.

#### Reshare the listing with Consumer B

To reshare the listing with Consumer B, follow these steps:

1. In the navigation menu, select Marketplace » Provider Studio.
2. On the Listings page, select Create listing » Specified consumer.
3. Specify a name for the listing.

   For this example, the listing is named *Resharing Daily Revenue Table*.
4. Select Add data product.

   1. Select the secure view you created above.

      In this use case, the secure view is named *DRT_SECURE_VIEW*.
   > 1. To add the data product, select Done, and then select Save.
5. Continue updating the required listing fields.

   > For this use case, edit the resharing section so that this listing can’t be reshared. This is optional. You can configure a reshared
   > listing so that it can continue to be reshared.
   >
   > > **Note:**
   > >
   > > If you enable auto-fulfillment for a reshared listing that crosses databases, you must specify a warehouse. This can be done in the UI
   > > in the listing’s auto-fulfillment settings, or it can be done programmatically by specifying the `warehouse` in the listing
   > > manifest’s `auto-fulfillment` property.
6. Publish the listing.

   The listing is now available to the business partner.
7. To see the listings that you’re sharing, follow these steps:

   1. In the navigation menu, select Data sharing » External sharing.
   2. On the External sharing page, select the Shared by your account tab.

### Step 3. Consumer B retrieves the reshared listing

In this example, Consumer B retrieves the listing that was reshared in the previous step.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md) as Consumer B.
2. In the navigation menu, select Data sharing » External sharing.
3. On the External sharing page, select the Shared with you tab.
4. Select Get to retrieve the listing, and then select Get once more to confirm.

   At this point, the reshared listing is ready to use. To test the listing, run the following command:

   ```sqlexample
   SELECT * FROM resharing_daily_revenue_table.public.drt_secure_view;
   ```

---
title: Update the refresh frequency
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-update-refresh-frequency.md
section: Collaboration & Marketplace
---

# Update the refresh frequency

To update the refresh interval and frequency of your data product, do the following:

Snowsight

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio » Listings.
3. Select the row for the listing that you want to manage.
4. From the listing details page, access the auto-fulfillment settings:

   * For a listing offered on the Snowflake Marketplace, in the Cloud region availability section, select the  menu, and then select Update Refresh Frequency.
   * For a listing offered to specific consumers, in the Who can access section, select the  menu, and then select Update Refresh Frequency.
5. In the Update Refresh Frequency pane, you can configure a trigger-based, interval-based, or schedule-based refresh. See [How auto-fulfillment refreshes data](provider-listings-auto-fulfillment.md) for details.

> * The refresh interval of an application package must be set at the account level. See [Set the account-level refresh interval](provider-listings-auto-fulfillment-set-refresh-interval.md).
> * The refresh interval for a share is set at the listing level, but you can only specify one schedule for each database. If you have multiple shares attached to multiple listings that contain objects from the same database, updating the refresh interval for one of the listings updates the refresh interval for all other listings that use the same database.

For more details about modifying listings, see [Modify published listings](https://other-docs.snowflake.com/collaboration/provider-listings-modifying).

---
title: Use BCDR for listings as a provider
source: https://docs.snowflake.com/en/collaboration/listings-bcdr-providers.md
section: Collaboration & Marketplace
---

# Use BCDR for listings as a provider

## Key provider responsibilities

To maintain this seamless experience for your consumers, providers must ensure:

* **Failover group configuration**: All listings, shares, and linked databases must be part of a single failover group.
* **Metadata integrity**: You must regularly refresh the failover group to ensure the secondary account is a faithful replica of the primary.
* **Operational continuity**: In the event of a disaster, when you promote the secondary account to a primary, Snowflake automatically
  manages the redirection of the [auto-fulfillment](provider-listings-auto-fulfillment.md) pipelines. Providers must refresh the failover group in the original primary (when available) to serve consumers in that region.

> **Note:**
>
> The “one mount point per region” constraint remains strictly enforced. This prevents data fragmentation and ensures that your consumers always have a clear, singular path to your data listings.

## Configure failover groups for listings and their dependencies

This section describes how to configure failover groups for your listings so that your listings and their dependencies are better protected during an outage.

### Access control requirements

To review the roles that are required to perform replication and failover on group objects in the system, see [Replication privileges](../user-guide/account-replication-considerations.md).

### Step 1: Create a failover group on a listing

To create a new failover group that includes your listings, use [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md). To add listings to an existing failover group, use [ALTER FAILOVER GROUP](../sql-reference/sql/alter-failover-group.md).

> **Note:**
>
> You must include dependencies along with listings when adding listings to a failover group. If your listing includes dependencies that
> aren’t part of the failover group, such as dangling references, then Snowflake returns an error during the create or alter process.
>
> Adding shares to listings is optional. Snowflake automatically selects all of the eligible listings and their shares for replication and failover.

The following example uses [CREATE FAILOVER GROUP](../sql-reference/sql/create-failover-group.md) to create a new failover group for databases and listings. In this example, the failover group is named `provider_dr_fg`. The object types in the failover group include a database named `provider_dr_db` and an allowed account named `myorg.myaccount2`.

```sqlexample
CREATE FAILOVER GROUP provider_dr_fg
  OBJECT_TYPES = DATABASES, LISTINGS
  ALLOWED_DATABASES = provider_dr_db
  ALLOWED_ACCOUNTS = myorg.myaccount2;
```

### Step 2: Create a secondary failover group

To create a replica of the initial failover group on the allowed account, run the following commands:

```sqlexample
CREATE FAILOVER GROUP provider_dr_fg
  AS REPLICA OF myorg.myaccount1.provider_dr_fg;
ALTER FAILOVER GROUP provider_dr_fg REFRESH;
```

### Step 3: Validate the secondary failover group

1. To validate that the listing resolves, run the [SHOW LISTINGS IN FAILOVER GROUP](../sql-reference/sql/show-listings-in-failover-group.md) command followed by the
   [SHOW LISTINGS](../sql-reference/sql/show-listings.md) command.

   ```sqlexample
   SHOW LISTINGS IN FAILOVER GROUP provider_dr_fg;
   SHOW LISTINGS LIKE 'provider_dr_listing_2';
   ```
2. To confirm that all shares are correctly associated with the listings in the secondary account, run the SHOW SHARES query.

   The response will include a non-NULL value in the `listing_global_name` field.

   ```sqlexample
   SHOW SHARES LIKE 'provider_dr_listing_share';
   ```

   > **Note:**
   >
   > A NULL value in the `listing_global_name` field indicates an issue with attaching the share to the listing in the secondary account. Review your failover group configuration, or reach out to the Snowflake team for assistance.

## Limitations for providers after a failover

* **Listing Analytics**: Information in [Data Sharing Usage](../sql-reference/data-sharing-usage.md) is only available for listings in the account where the
  listings were originally created. This information may not be available in the failover account.

---
title: Use listings as a consumer
source: https://docs.snowflake.com/en/collaboration/consumer-becoming.md
section: Collaboration & Marketplace
---

# Use listings as a consumer

To access listings shared privately or on the Snowflake Marketplace, become a Snowflake consumer. You can also access data shared
as part of direct shares or data exchanges, which offer more limited data sharing capabilities.

As a consumer of listings, you can do the following:

* Access data in listings shared from other cloud platforms and Snowflake regions.
* [Access data in a private listing](virtual-private-snowflake/vps-collaboration-for-consumers.md).
* Pay for listings inside Snowflake instead of negotiating billing with each listing provider.
* Get more information about data in a listing, such as example SQL queries.

To become a consumer of listings, you must meet the following requirements:

* Your organization must agree to the legal terms. See Accept the Snowflake Provider and Consumer Terms.
* Your account must be granted the relevant privileges for working with listings. See Set up required privileges.
* To consume paid listings, you must set up payment information and be eligible to pay for listings. See [Pay for listings](consumer-listings-paying.md).
* If your account is located in a U.S. government region, you must accept the cross-region disclaimer. See Prepare to access listings from accounts in U.S. government regions.

For more information, see [Pay for listings](consumer-listings-paying.md).

## Accept the Snowflake Provider and Consumer Terms

The organization administrator only needs to accept the Snowflake Provider and Consumer Terms once for your organization.
After the terms have been accepted, anyone in your organization that has a role with the necessary privileges can become a consumer of listings. For details about the terms of service, see [Legal requirements for providers and consumers of listings](collaboration-listings-legal.md).

> **Note:**
>
> You must be an organization administrator (a user granted the ORGADMIN role) to accept the terms. You do not need to accept the Snowflake Provider and Consumer Terms if your organization intends to access only free listings, or the listing terms are offline. Custom or standard listing terms must be accepted in Snowsight.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, next to Snowflake Provider and Consumer Terms, select Review.
4. If you agree to the terms, select the checkbox for I accept Snowflake Provider and Consumer Terms.
5. You can also accept additional terms to allow users in your organization to get listings that use the [Standard Agreement for Marketplace Products](https://www.snowflake.com/marketplace/standard-agreement/):

   1. Review the [Standard Agreement for Marketplace Products](https://www.snowflake.com/marketplace/standard-agreement/)
   2. Select I authorize my organization’s user to accept Standard Agreement for Marketplace products.
6. Select Save to accept to finalize the selection or Cancel to cancel.

## Set up required privileges

To access a listing, you must use the ACCOUNTADMIN role or another role with the CREATE DATABASE and IMPORT SHARE privileges.
To pay for a paid listing, your role must also have the PURCHASE DATA EXCHANGE LISTING privilege.

If you don’t have a role with these privileges, you can automatically request access from the account administrator when attempting to access a listing.

To gain access, you can ask your account administrator to do one of the following:

* Grant the CREATE DATABASE and IMPORT SHARE privileges to a role on your account so that you can get access to listings.
* Get a listing for your account, then grant the IMPORTED PRIVILEGES privilege on the database created from the listing to a role on
  your account. This lets you access the data in the listing without having access to get any listing on the Snowflake Marketplace or privately.
* Install the listing for you.

See [Assigning IMPORTED PRIVILEGES to other roles](../user-guide/data-share-consumers.md) for more details about the privileges associated with listings.

## Prepare to access listings from accounts in U.S. government regions

If your account is in a [U.S. government region](../user-guide/intro-regions.md) and you want to install data products offered privately or on the Snowflake Marketplace, or
offer listings either privately or on the Snowflake Marketplace, you must review and acknowledge the following cross-region disclaimer for your
organization.

> **Important:**
>
> To get data products and share listings with Snowflake customers outside your region, Snowflake shares organization and account metadata
> and usage analytics with the customers you collaborate with outside of your region.
>
> Compliance standards, such as [FedRAMP](../user-guide/cert-fedramp.md), and support for different regulated workloads, such as [ITAR](../user-guide/cert-itar.md), might be different or unavailable
> outside of your U.S. Government Region. Consider your compliance requirements before choosing to move or share data between Snowflake regions.

> **Note:**
>
> You must use the ORGADMIN role to accept the terms. You only need to accept terms once for your Snowflake account. If you do not have
> the ORGADMIN role, see [Enabling the ORGADMIN role in an account](https:/docs.snowflake.com/en/user-guide/organization-administrators).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, for Sharing & Collaboration, select Review & Enable.
4. Review the cross-region disclaimer and select Acknowledge & Continue.
5. Select Done.

> **Note:**
>
> * Providers can enable [Egress Cost Optimizer (ECO)](provider-listings-auto-fulfillment-eco.md) in a primary account in any commercial region and create listings targeted to any other region, including government regions.
> * By default, ECO is unavailable to customers on a government cloud. If you’re a Gov customer, you can reach out to your Snowflake account executive for more information about ECO enablement.

You must use the ORGADMIN role and you only need to complete this step once for your organization:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, for Sharing & Collaboration, select Review & Enable.
4. Review the cross-region disclaimer and select Acknowledge & Continue.
5. Select Done.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

### Stop sharing and collaboration from an account in a US government region

If you no longer want to offer or access listings from your account in a US government region, do the following:

1. [Delete all of your listings](provider-listings-removing.md) shared from your account, consistent with the applicable
   requirements in the Provider and Consumer Terms.
2. Stop consuming listings by dropping the databases imported when you
   [accessed listings](consumer-listings-access.md).
3. [Contact Snowflake Support](../user-guide/contacting-support.md) to have data sharing and collaboration disabled for your organization.

The types of listings and data products that you can access are limited. See [Limitations for accessing listings from accounts in U.S. government regions](consumer-listings-access.md).

## Prepare to access listings from accounts in the Kingdom of Saudi Arabia (KSA) region

If your account is in a [Europe and Middle East region](../user-guide/intro-regions.md), specifically Dammam (me-central2), and you want to install data products offered privately or on the Snowflake Marketplace, or
offer listings either privately or on the Snowflake Marketplace, you must review and acknowledge the following cross-region disclaimer for your
organization.

> **Important:**
>
> To get data products and share listings with Snowflake customers outside your region, Snowflake shares organization and account metadata
> and usage analytics with the customers you collaborate with outside of your region. Compliance standards and support for different
> regulated workloads might be different or unavailable outside of your region.
> Consider your compliance requirements before choosing to move or share data between Snowflake regions.

> **Note:**
>
> You must use the ORGADMIN role to accept the terms. You only need to accept terms once for your Snowflake account. If you do not have
> the ORGADMIN role, see [Enabling the ORGADMIN role in an account](https:/docs.snowflake.com/en/user-guide/organization-administrators).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, for Sharing & Collaboration, select Review & Enable.
4. Review the cross-region disclaimer and select Acknowledge & Continue.
5. Select Done.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

### Stop sharing and collaboration from an account in a KSA region

If you no longer want to offer or access listings from your account in a KSA region, do the following:

1. [Delete all of your listings](provider-listings-removing.md) shared from your account, consistent with the applicable
   requirements in the Provider and Consumer Terms.
2. Stop consuming listings by dropping the databases imported when you
   [accessed listings](consumer-listings-access.md).
3. [Contact Snowflake Support](../user-guide/contacting-support.md) to have data sharing and collaboration disabled for your organization.

---
title: Use listings as a provider
source: https://docs.snowflake.com/en/collaboration/provider-becoming.md
section: Collaboration & Marketplace
---

# Use listings as a provider

Becoming a listing provider allows you to offer listings to consumers privately or on the Snowflake Marketplace. Being a Snowflake listing provider
makes it easier to manage sharing from your account to other Snowflake accounts.

When you share data as a provider, you can do the following:

* Monitor usage of the listings and associated data shares and products. See [Monitor listing use](provider-listings-monitor-studio.md).
* Create one or more provider profiles to manage your professional presence with consumers.
  See Set up a provider profile.
* Charge consumers for access to listings within Snowflake. See Set up Stripe to get paid for listings.

Every private paid listing can have a price per consumer. If the trial and purchase price
for a listing differ, Snowflake recommends changing the price of the existing listing
so the consumer doesn’t need to reinstall the listing.
To learn more about listing monetization, see [Paid listings pricing models](provider-listings-pricing-model.md).

## Requirements to become a provider

To offer listings to consumers privately or on the Snowflake Marketplace, you must meet the following requirements:

* You must use a full Snowflake account. Trial accounts can share data with specified consumers, but not on the Snowflake Marketplace.
* You must not be using a Reader Account.
* You must have the ACCOUNTADMIN role or be assigned a role with provider privileges.
  See Privileges required for working with listings.
* You must meet the [Legal requirements for providers and consumers of listings](collaboration-listings-legal.md). See Review and accept the Snowflake Provider and Consumer Terms for instructions.
* If your account is in a U.S. government region, you must also accept the cross-region disclaimer. See [Prepare to provide listings from accounts in U.S. government regions](provider-listings-government-providers.md).

To offer specific types of listings, you must also do the following:

* To offer paid listings or any listings on the Snowflake Marketplace, you must create a provider profile.
  See Set up a provider profile.
* To offer paid listings, you must set up configure your account to get paid for listings. See Set up Stripe to get paid for listings on this page.

### Privileges required for working with listings

When you create a listing, you create it from the account that has the data or application package in it. The role that attaches a data
product to a listing and publishes the listing must be the same role that created, and therefore owns, the application package or share.
You cannot transfer the OWNERSHIP privilege for a share.

If you use a different role to create and manage the listing, grant the MODIFY privilege on the listing to the role
that owns the application package or share. For example:

Share or application package owner role:
:   OWNERSHIP privilege on the share or application package.
    MODIFY privilege on the listing.

Listing owner role:
:   OWNERSHIP privilege on the listing.

    Global CREATE LISTING privilege.

Within the provider account, you can use one of the following to create and manage listings:

ACCOUNTADMIN:
:   If you use the ACCOUNTADMIN role to create and manage listings, the ORGADMIN role must first
    [Delegate privileges to set up auto-fulfillment](provider-listings-auto-fulfillment-manage-privileges.md).

Custom role:
:   If you use a custom role, the ORGADMIN role must first [Delegate privileges to set up auto-fulfillment](provider-listings-auto-fulfillment-manage-privileges.md)
    to the ACCOUNTADMIN role, which can then be used to grant the relevant privileges to the custom role.

For more information about granting sharing privileges, see [Granting Privileges to Other Roles:](../user-guide/data-exchange-marketplace-privileges.md).

### Review and accept the Snowflake Provider and Consumer Terms

Before you can become a Snowflake Marketplace provider, an organization administrator (ORGADMIN) needs to
review and accept the combined Snowflake Provider and Consumer Terms.

> **Note:**
>
> You do not need to accept the Snowflake Provider and Consumer Terms if you’re only creating free private listings and you’ve accepted the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Terms.
3. In the Snowflake Marketplace section, review the Snowflake Provider and Consumer Terms of service.
4. If you agree to the terms, select Accept Terms & Conditions.

> **Note:**
>
> If you see an error, your user profile might be missing some contact information. If you have an administrator role, see
> [Add user details to your user profile](../user-guide/ui-snowsight-profile.md) to update your profile using Snowsight. Otherwise, contact an
> account administrator to update your user details.

See [Legal requirements for providers and consumers of listings](collaboration-listings-legal.md) for more details.

## Set up a provider profile

To offer listings to consumers privately, or on the Snowflake Marketplace, set up a provider profile in [Provider Studio](https://app.snowflake.com/#/provider-studio).
You do not need a provider profile to offer free private listings.

You only need to create a provider profile one time. You can create multiple provider profiles for one account.

Before you can create a provider profile, someone in your Snowflake account must review and accept the Snowflake Provider and Consumer Terms. Acceptance of the Snowflake Provider and Consumer Terms is not required when creating free private listings if you’ve accepted the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/). For more information about the Snowflake Provider and Consumer Terms, see Review and accept the Snowflake Provider and Consumer Terms.

> **Note:**
>
> You must use a role that has been granted the MODIFY privilege on the profile. For more information, see [Granting provider privileges to other roles in the Snowflake Marketplace or a Data Exchange](../user-guide/data-exchange-marketplace-privileges.md).

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In Provider Studio, select Profiles.
4. Select + Create profile to create a profile. A dialog box appears.
5. In the Create Profiles dialog box, complete the fields. All fields are required. For a description
   of the fields, see [Provider profile fields](provider-profiles-managing.md).
6. Select Next, then verify that your profile details are correct.
7. Select Submit for Approval, or click Save Draft if you want to review your profile details
   before submitting it for approval.

Your provider profile must be approved before you can offer paid listings or marketplace listings. For your profile to be approved,
Snowflake verifies the following:

* You have reviewed and accepted the Snowflake Provider and Consumer Terms. Acceptance of the Snowflake Provider and Consumer Terms is not required when creating free private listings, but you must review and accept the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/).
* Your profile abides by the Snowflake [Provider Policies](https://www.snowflake.com/provider-policies/).

See [Legal requirements for providers and consumers of listings](collaboration-listings-legal.md).

## Provide paid listings

To publish paid listings to consumers privately or on the Snowflake Marketplace, do the following:

1. Make sure that your account is eligible to provide paid listings. See Who can provide paid listings.
2. Before creating a paid listing that you want to publish on the Snowflake Marketplace, contact your business development partner at Snowflake.
   If you do not have a business development partner, [submit a case with Marketplace Operations](https://snowforce.my.site.com/s/provider-onboarding-case). This step is required for listing
   approval.
3. Set up a payout method to get paid for listings. See Set up Stripe to get paid for listings.

> **Note:**
>
> If you are a commercial reseller (VAR) that wants to offer paid listings, use this form to [submit a case with Marketplace Operations](https://snowforce.my.site.com/s/provider-onboarding-case).
> You only need to file one case to cover both purchasing and offering listings.

### Who can provide paid listings

As a provider, you can create paid listings if the billing address on your account is in one of the following countries:

* Australia
* Canada
* Colombia
* Finland
* France
* Germany
* Ireland
* Israel
* Italy
* Japan
* Kingdom of Saudi Arabia
* Mexico
* Netherlands
* New Zealand
* Norway
* Singapore
* Sweden
* Switzerland
* United Kingdom
* United States

See [Supported consumer locations](consumer-listings-paying.md) for information on region availability for consumers.

### Set up Stripe to get paid for listings

Stripe is used to send payments to providers for Snowflake Marketplace purchases.
As defined in the [Provider and Consumer Terms](https://www.snowflake.com/legal/snowflake-provider-and-consumer-terms/),
providers appoint Snowflake as their agent for receiving consumer payments.

To receive payments for your listings, you must set up a Stripe Express account associated with Snowflake.

[Stripe](https://stripe.com/) is the online payment processing system used by Snowflake to process payments from consumers who purchase
your paid listings. Payments collected from consumers are disbursed to your Stripe account for Snowflake Marketplace following Stripe
receiving payment from the consumer.

When you set up a Stripe Express account, you need to provide information about your business so that Stripe can verify your business details.
The person that sets up the Stripe account must also set up multi-factor authentication to set up and manage the Stripe account.

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Click the Marketplace billing tab.
4. Click Provider billing tab.
5. Click Activate account.
6. In the Provider payouts section, click Activate.
7. Complete the required information to create and set up your Stripe account. You get payouts in the official currency of the country as
   specified in your Snowflake billing entity. To get payouts in USD, your Snowflake billing address must be in the United States. For
   more information, see
   [Supported accounts and settlement currencies](https://stripe.com/docs/payouts?account-country=TR#supported-accounts-and-settlement-currencies)
   in the Stripe documentation.

After you set up your Stripe account and provide a payout method, the Provider payouts section of the Marketplace billing page reports the current status of the method.
The following table describes the different statuses:

| Status | Description |
| --- | --- |
| Pending verification | Stripe is in the process of verifying your payout method. |
| Completed & verified | Your payout method has been verified by Stripe. If you have already accepted the Marketplace terms, you are ready to sell products and collect payments. |
| Manage Payout account | There is an issue with your Stripe account. The Snowsight interface provides additional details about the exact issue and how to resolve the problem. |
| Rejected | Stripe has rejected your payout method. A valid payout method must be provided. |

If you encounter issues with setting up Stripe or receiving payments, [contact Snowflake Support](../user-guide/contacting-support.md).

## Respond to access requests as an administrator

If you’re an account administrator or a database owner, you can provide access to requesting roles.
You receive an email about the type of request, whether it’s an installation or usage request.
For each request, you receive specific instructions on how to proceed and fulfill the access request effectively.
See [Use listings as a consumer](consumer-becoming.md) for more information.

## The Snowflake Marketplace Capacity Drawdown program

The Snowflake Marketplace Capacity Drawdown (MCD) program allows Snowflake consumers to use a percentage of their Snowflake Capacity commitment
as an additional payment method for Snowflake Marketplace [paid listings](consumer-listings-paying.md).

The MCD program is now generally available to all US-based consumers purchasing from US-based providers (excluding Florida for both
providers and consumers). The MCD program is also available as Private Preview in the United Kingdom, Switzerland, and Mexico.

Eligible providers using on-platform monetization, such as paid listings, who are not outside the US or who are using a Florida address for billing or shipping can accept MCD program payments. For more information about paid listings, see [Paid listings pricing models](provider-listings-pricing-model.md). See [Pay for listings](consumer-listings-paying.md) to learn more about how consumers pay for listings.

The following consumers are excluded from enrollment in the MCD Program:

* Consumers purchasing Snowflake through a reseller
* Priority Support consumers

To enroll in the MCD program, consumers can opt-in by submitting an MCD program order form at the start of a new contract, when they renew a contract, or when they amend an existing MCD program contract. To enroll in the MCD program, consumers must agree to the [Marketplace Capacity Drawdown Program Terms](https://www.snowflake.com/legal/snowflake-marketplace-capacity-drawdown-terms/) and the [Provider and Consumer Terms](https://www.snowflake.com/legal/snowflake-provider-and-consumer-terms/).

A consumer can apply an unused MCD program balance to their service consumption payment. Any invoice that *exceeds* the consumer’s MCD program balance must be paid in
full using other payment methods such as a credit card, ACH transfer, wire transfer or a SWIFT transfer. The consumer’s MCD program balance is applied first and then one of their alternate listed payment methods is used to pay any remaining balance.

Every private paid listing can have a price per consumer. If the trial and purchase price for a listing differ, Snowflake recommends that you change the price of the existing listing so that the consumer doesn’t need to reinstall the listing. For more information about listing monetization, see [Paid listings pricing models](provider-listings-pricing-model.md).

## Listing compliance badges

If you’re a provider who has completed compliance certification by a third-party auditor, you can configure your listings to include this certification. You can add, edit, and remove compliance certifications directly in Provider Studio or through the listing manifest. When you provide the supporting compliance reports, Snowflake’s compliance team will review your submission. Upon approval, marketplace consumers can see your certifications in the Snowflake Marketplace, helping you build trust and transparency with potential consumers.

Consumers can filter Snowflake Marketplace listings by compliance certification, so adding certifications to your listings can increase their visibility to potential buyers.

Snowflake supports the following compliance certifications:

* FedRAMP
* GDPR
* HIPAA
* ISO 27001
* PCI DSS
* SOC 2

> **Note:**
>
> Certifications are tied to listings and not to providers. Providers that have undergone compliance certification must submit proof of compliance for each of their listings.

For information about how to include certification badges on new or existing listings, see [Create a listing on the Snowflake Marketplace that includes a compliance badge](provider-listings-creating-publishing.md).

For information about how to modify existing listings to include certification badges, see [Add compliance badges to a listing](provider-listings-modifying.md).

---
title: Using auto-fulfillment
source: https://docs.snowflake.com/en/collaboration/provider-listings-auto-fulfillment-setup.md
section: Collaboration & Marketplace
---

# Using auto-fulfillment

When you configure a listing and make it available in a region other than your local region, or when you share a private listing with consumer
accounts in another region, you can enable auto-fulfillment. See [Region availability (Marketplace listings only)](provider-listings-reference.md).

## Enable auto-fulfillment for your account

> **Note:**
>
> Auto-fulfillment isn’t available on trial accounts.

To enable auto-fulfillment for your account, use the [SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../sql-reference/functions/system_enable_global_data_sharing_for_account.md) system function.

You must use the [ORGADMIN](../user-guide/organization-administrators.md) role to call this system function.

```sqlsyntax
SELECT SYSTEM$ENABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT(
  '<account_name>'
  );
```

Where:

`account_name`
:   Specifies the name of the account in which to enable users with the ACCOUNTADMIN role to manage Cross-Cloud Auto-Fulfillment. For more information, see [Finding the organization and account name for an account](../user-guide/admin-account-identifier.md).

> **Note:**
>
> To disable auto-fulfillment for an account, use the [SYSTEM$DISABLE_GLOBAL_DATA_SHARING_FOR_ACCOUNT](../sql-reference/functions/system_disable_global_data_sharing_for_account.md) system function. To check whether auto-fulfillment is enabled for an account, use the [SYSTEM$IS_GLOBAL_DATA_SHARING_ENABLED_FOR_ACCOUNT](../sql-reference/functions/system_is_global_data_sharing_enabled_for_account.md) system function.

## Required privileges to perform auto-fulfillment tasks

Before continuing with either Snowsight or SQL, ensure that you have the required privileges to set up auto-fulfillment.

To perform auto-fulfillment tasks, use one of the following roles:

* The ORGADMIN role.
* The ACCOUNTADMIN role after auto-fulfillment is enabled on an account.
* A custom role that has been granted the MANAGE LISTING AUTO FULFILLMENT privilege by a user with the
  [ACCOUNTADMIN role with delegated privileges](provider-listings-auto-fulfillment-manage-privileges.md).

Any role that you use must also have OWNERSHIP or MODIFY privileges on the listing.

Now that you understand the required privileges, you can configure auto-fulfillment for your listing. See [Set up auto-fulfillment](provider-listings-auto-fulfillment-setup-steps.md) for more information. Keep in mind that you must add a data product to your listing before you can set up auto-fulfillment. Also, the steps to set up auto-fulfillment differ depending on the data product you offer and how you make your listing available.

---
title: Using auto-fulfillment with open table formats
source: https://docs.snowflake.com/en/collaboration/use-auto-fulfillment-with-open-table-formats.md
section: Collaboration & Marketplace
---

# Using auto-fulfillment with open table formats

Cross-Cloud Auto-Fulfillment for listings enables you to share open table formats — including [Apache Iceberg™ tables](../user-guide/tables-iceberg.md) and Delta
Lake tables — with internal and external consumers across cloud providers and regions. The tables can be managed by Snowflake or any other
catalog provider. Cross-Cloud Auto-Fulfillment optimizes data transfer costs and ensures data availability across all regions, without
requiring you to maintain extract, transform, and load (ETL) jobs.

Cross-Cloud Auto-Fulfillment reads data directly from the external volume and replicates all data as a Snowflake-managed Iceberg table
within the target regions. For this process, providers are charged for the consumption of the Snowflake-managed data, including egress,
storage, and compute. Virtual Private Snowflake (VPS) rates for compute will only apply if you or your consumers are using VPS. Data
transfer is charged at the same replication rate listed in the serverless feature table in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

> **Note:**
>
> This feature is supported for private listings, public listings in the Snowflake Marketplace, and listings on the Internal Marketplace.

## Tutorials

Snowflake provides the following tutorial for creating an Iceberg table using SQL:
[Tutorial: Create your first Apache Iceberg™ table](../user-guide/tutorials/create-your-first-iceberg-table.md)

## Getting started with Cross-Cloud Auto-Fulfillment for Iceberg tables

[Tutorial: Create your first Apache Iceberg™ table](../user-guide/tutorials/create-your-first-iceberg-table.md) describes how to create an Iceberg table. After the table is created, you can
create a Snowflake listing that includes this table and then provide it to consumers across any region or cloud. Consumers can access the
listing, and Snowflake will manage the data egress, replication, and storage. For more information on how to create a listing, see [Create a
new listing](https://other-docs.snowflake.com/en/collaboration/provider-listings-creating-publishing).

## Accessing Iceberg tables as a consumer

As a consumer, you can access and query shared Iceberg tables. For more information, see [Access and install listings as a consumer](https://other-docs.snowflake.com/collaboration/consumer-listings-access). New
changes to the Iceberg will be synched from the provider account to your account based on your configured auto-fulfillment refresh
frequency. For more information, see [How auto-fulfillment works](provider-listings-auto-fulfillment.md).

## Limitations

Cross-Cloud Auto-Fulfillment for listings is subject to the following limitations:

* You cannot replicate CATALOG or any CATALOG-related information.
* Catalog-linked databases (CLDs) are not supported.
* You cannot access the secure share area created in consumer regions by Cross-Cloud Auto-Fulfillment.
* [Egress cost optimizer (ECO)](provider-listings-auto-fulfillment-eco.md) replication support isn’t available for
  individual files that are larger than 5 GB.
* The following objects are not supported:

  + Streams and dynamic tables on shared Iceberg tables.
  + Streams and dynamic tables on views with a shared Iceberg table base.
  + Streams and dynamic tables on shared views with Iceberg base tables.
  + V3 Iceberg tables.

---
title: Using resharing as a provider
source: https://docs.snowflake.com/en/collaboration/resharing-as-provider.md
section: Collaboration & Marketplace
---

# Using resharing as a provider

As a provider, you can enable resharing on your listings so that consumers can reshare your data product with other accounts. This topic
describes how to set up and manage resharing as a provider.

## Enabling resharing on a listing

Before resharing listings, the provider must enable the `resharing` property in the
[listing manifest reference](../user-guide/collaboration/listings/organizational/org-listing-manifest-reference.md) or in Snowsight
when creating the listing.

To allow consumers to reshare your data product, set the `resharing.enabled` property to `true` in the listing manifest:

```yaml
resharing:
  enabled: true
```

For the full listing manifest reference, see [Listing manifest reference](../progaccess/listing-manifest-reference.md).

## Disabling resharing

You can disable resharing at any time by setting `resharing.enabled` to `false` and republishing the listing. When you disable resharing,
downstream consumption breaks for all consumers of any reshared listings created from your listing.

## Supported context functions

If your shared data uses context functions in governance policies or secure view definitions, only the following context functions are
supported for resharing. These functions return values as if they were executed by the account doing the resharing, not by the downstream
consumer:

* [CURRENT_ACCOUNT](../sql-reference/functions/current_account.md)
* [CURRENT_ACCOUNT_NAME](../sql-reference/functions/current_account_name.md)
* [IS_DATABASE_ROLE_IN_SESSION](../sql-reference/functions/is_database_role_in_session.md)
* [CURRENT_ORGANIZATION_NAME](../sql-reference/functions/current_organization_name.md)
* [CURRENT_DATE](../sql-reference/functions/current_date.md)
* [CURRENT_TIMESTAMP](../sql-reference/functions/current_timestamp.md)

If you add or change governance policies on your base tables that use unsupported context functions, resharers won’t be able to reshare
your data even if `resharing.enabled` is set to `true`. Downstream consumers of reshared listings will also lose access.

## Enabling cross-region resharing for your resharers

To support cross-region resharing, enable `change_tracking` on your tables. For more information, see
[Enable change tracking](../user-guide/dynamic-tables-create.md).

---
title: View consumer invoices for your listings
source: https://docs.snowflake.com/en/collaboration/provider-listings-invoices.md
section: Collaboration & Marketplace
---

# View consumer invoices for your listings

Snowflake issues invoices to consumers for Snowflake Marketplace provider paid listings. A consumer invoice lists the purchases and the billed amount for the invoicing period.

To access consumer invoices in Snowsight, you must have been granted the ACCOUNTADMIN role.

To view consumer invoices in Provider Studio across all your listings:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. Select the Invoices tab.

To view consumer invoices for a specific listing:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Marketplace » Provider Studio.
3. In the right pane, select the Listings tab.
4. Select a paid listing.
5. Select the Invoices tab.

To view consumer invoices from the Billing module:

1. Sign in to [Snowsight](../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Admin » Billing.
3. Select Marketplace Billing.
4. Select the Provider tab.

## Migrations

Guides for migrating workloads to Snowflake from other platforms.

---
title: AI assessment
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/ai-assessment.md
section: Migrations
---

# AI assessment

This topic helps you use the Cortex Code CLI agent to assess the source code to be migrated to the target Snowflake database.

Refer to [Cortex Code CLI](https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code-cli) for more information on how to use the Cortex Code CLI agent.

The Cortex Code CLI agent uses the `snowconvert-assessment` skill to analyze the source database code. It invokes a structured, multi-dimensional analysis of the source database code and generates a detailed actionable report. The generated report helps create a clear, dependency-aware migration plan by identifying potential road blocks and risks associated with the migration.

> **Note:**
>
> The `snowconvert-assessment` skill is currently supported for SQL Server and Redshift database platforms.

## Core functionality of AI assessment

You can perform AI assessment on source database code by invoking the `snowconvert-assessment` skill of the Cortex Code CLI agent. This skill enables the Cortex Code CLI agent to perform assessments by:

* Categorizing the source database objects
* Evaluating the conversion complexity of the source database code
* Establishing a logical migration sequence

The agent then generates a unified comprehensive report in an `html` format at the specified file location.

## Prerequisites to run AI assessment

* Install Cortex Code CLI. Refer to [Cortex Code CLI](https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code-cli) for more information on how to install the Cortex Code CLI agent.
* Reports generated by SnowConvert AI during a prior run of the same migration project.

## Steps to run AI assessment

Follow these steps to run AI assessment using the `snowconvert-assessment` skill of the Cortex Code CLI agent. A comprehensive unified report will be generated in the file location specified in the prompt.

1. Launch Cortex Code CLI in a terminal window.
2. Prompt the agent to use the `snowconvert-assessment` skill. The prompt should also include the file location for the reports generated by SnowConvert AI and the file location for the unified report to be generated.

   > **Example prompt**
   >
   > > ```none
   > > Use skill snowconvert-assessment, create a comprehensive assessment report for <filepath location of SCAI reports>, and put the results in <filepath location of the assessment report>.
   > > ```
3. You may encounter questions asking for read access to files. Selecting **always allow this operation in the future** can prevent these questions from appearing in future prompts.
4. Review the unified report generated at the file location specified in the prompt.

## Output of AI assessment

On completion of the AI assessment, you will find a comprehensive analysis report `multi_report.html` located in the file location you specified in the prompt.
This is an interactive report that allows you to view objects and apply filters based on the object status. You can export the object lists into `csv` files.
The four sections of the unified report consist of:

* Overview
* Exclusion Report
* Dynamic SQL Report
* Waves Report
* SSIS Package Analysis

Each section of this report covers the following critical aspects of risk management during the migration process.

1. **Identifying migration scope**: The Exclusion report helps identify deprecated or unused objects that would have no impact if they were not migrated, preventing unnecessary migration efforts.
2. **Targeting code conversion pain points**: The Dynamic SQL report helps target SQL code containing dynamic SQL statements, which are often the most complex to convert and need to be evaluated for complexity.
3. **Establishing a migration sequence**: The Waves report helps organize the objects into logical execution groups called “waves”, ensuring that all prerequisites for **Wave Two** are fully converted and deployed during **Wave One**. This structured approach guarantees “bottom-up” deployment, eliminating the risk of “missing dependency” errors that can stall complex migration projects.
4. **Replicating the exact workflow of SSIS packages**: The SSIS package analysis report helps classify SSIS packages (for SQL Server Integration Services) into the categories of ingestion, transformation, and configuration. It also includes an assessment of the complexity of the package, and Directed Acyclic Graphs (DAGs) reflecting workflows inside the packages.

### Overview

This section contains a summarized view of the assessment results for the migration workload. It includes the total workload inventory, anticipated manual effort, and quick access tiles to the detailed exclusion report, dynamic SQL report, waves report and SSIS report. Select any tile to view the detailed reports. These reports can also be accessed from the left navigation bar.

### Exclusion report

This section contains a list of objects that can be potentially excluded from migration, based on additional review.
The Cortex Code CLI agent intelligently flags deprecated files, temporary staging objects, test objects, and duplicate objects found in source database code. These objects can be excluded after further review with the subject matter experts (SMEs) before the code conversion kickoff.

Under the **Detailed Objects Analysis** section, you will find a list of objects that are flagged as:

* Temporary/Staging
* Deprecated/Legacy
* Testing objects
* Duplicate objects

Excluding these objects from the migration may help in reducing the migration effort and time.

### Dynamic SQL report

This section contains a list of all source code files that were found to contain dynamic SQL. Select any file to view the dynamic SQL code. The detailed view shows the sections of the SQL file containing dynamic SQL and the corresponding line numbers. Select **Complexity** to view the assessed migration complexity of the dynamic SQL code in the file.

> > **Note:**
> >
> > This section is available for migration from SQL Server databases only.

### Waves report

By default, each wave contains 40 objects. The Cortex Code CLI agent analyzes the relationships between the code objects and ensures that all prerequisites for **Wave Two** are fully converted and deployed during **Wave One**. This structured approach guarantees “bottom-up” deployment, eliminating the risk of “missing dependency” errors.

You can prompt the Cortex Code CLI agent to customize the waves report by specifying the number of objects per wave, or changing the order of migration to align with business needs.

### SSIS report

The SSIS Report helps determine the feasibility and effort required to migrate data workloads consisting of SQL Server Integration Services (SSIS) packages to Snowflake. The core components of this report are:

1. **AI summary**: This section contains an overview of the migration readiness of the SSIS packages analyzed by the Cortex Code CLI agent. It consists of migration workload overview, source and destination data flows, connection managers, and an overview of tasks that contribute to the complexity of migration. The AI summary section is further subdivided into three parts:

   * **Recommended Migration Approach** containing details on methods of migration and the number of packages it can be applied to.
   * **Key risks** containing details on connectivity gaps, process dependencies on external services like email-based data delivery, manual scripting effort required rewrite custom scripts that cannot be converted automatically, compliance requirements (such as HIPAA, FERPA) that may require data masking and role-based access control (RBAC) implementation.
   * **Consolidation opportunities** containing suggestions for applying the same migration approach to similar packages.
2. **Key metrics**: This section contains a quantified technical footprint of the source SSIS environment. The total component counts, number of transformation pipelines required for data flows, number of workflow and orchestration tasks for control flows and the number of databases with corresponding file connections are all summarized here.
3. **Package classification**: This section contains charts to show the classification of package categories and complexity. The interactive table contains a complete list of packages sorted by package name. Enter the package name to search on a specific package. Select options from the **classification** and **complexity** dropdowns to view a filtered list of packages. Select the package name in the list to view the complete AI analysis for the package. Select **Control Flow DAG** to open the Directed Acyclic Graph (DAG) depicting the workflow inside the package.
4. **Component conversion breakdown**: This section shows the number of SSIS package compnents that can be automatically converted to Snowflake and the number of components that require manual intervention.

## Sample prompts

Use the following sample prompts to customize your assessment.

The Cortex Code CLI agent can be scoped to target specific assessments, such as exclusion candidates, dynamic SQL objects, wave reports, or SSIS packages.

**To run selected assessment sections**

```none
Create an assessment for [wave report|dynamic SQL|object exclusion|ssis package analysis] using SnowConvert assessment files and generate an html report in <filepath location of the assessment report>.
```

**To perform SSIS package assessment**

```none
Perform SSIS package analysis and assessment for this workload: [PATH]. Place results in [filepath location of the assessment report]. Follow instructions of skill snowconvert-assessment and respect rules. Every workflow step must be followed in the order of execution.
```

## Billing and cost considerations

AI assessment consumes Snowflake credits based on the token consumption for interacting with Snowflake Cortex LLM functions.
Refer to the current rates in [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Legal notices

Your use of Snowflake AI features is subject to all agreements, terms or policies that apply to such usage, including but not limited to those documented in the [Snowflake AI & ML Documentation](https://docs.snowflake.com/en/guides-overview-ai-features).

Where your configuration of Cortex Code uses a model provided on the [Model and Service Pass-Through Terms](https://docs.snowflake.com/en/guides-overview-ai-features), your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Cortex Code CLI: Covered AI Features |

---
title: AI Assistant setup
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-ewi-assistant-walkthrough/sma-ai-assistant-usage/ai-assistant-setup.md
section: Migrations
---

# AI Assistant setup

1. Install the Snowflake VS Code extension.
2. Open the extension’s settings.
3. Search for and enable the **Snowpark Migration Accelerator AI Assistant**.

---
title: AI code conversion
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/snowconvert-ai-verification.md
section: Migrations
---

# AI code conversion

AI code conversion strengthens the migration process by using AI agents to convert more objects through automated functional validation of converted database code. It uses synthetic data generation, AI-driven unit testing, and AI-driven resolution of errors identified in the deterministic code conversion step, where error warnings and issues (EWIs) and functional difference messages (FDMs) flag conversion issues—along with an intelligent layer in the Snowflake Service that proactively converts code, verifies correctness, resolves errors, and accelerates confidence.

During migration, deterministic logic is first used to translate the source code, surfacing EWIs and FDMs when it cannot automatically resolve certain patterns. Following this, AI code conversion is then used to reduce the manual remediation effort by identifying and resolve issues earlier in the process, and provide assurance to users that the converted objects behave as expected. Users must review and confirm AI suggestions to ensure they align with the functionality and standards.

| AI code conversion is currently available for SQL Server, Redshift, BigQuery, and PostgreSQL databases. |
| --- |

## Key features of AI code conversion

* **Accelerated AI validation**: Reduces the time and resources spent on manual testing.
* **Automated test generation**: Automatically generates test cases with test data based on your existing queries and business logic.
* **Repair suggestions**: Generates suggestions to produce consistent results between the source database system and Snowflake.

## Prerequisites for AI code conversion

Before you get started with AI code conversion, complete the following steps:

1. Download and install [SnowConvert AI](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/download-and-access).
2. [Recommended] Convert your legacy SQL Server code by using [SnowConvert AI](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/download-and-access).
3. Connect an account specifically designated for testing and development and avoid using a production account.

   Some objects will be created as part of the AI code conversion process.
4. Ensure the PUBLIC role in the account you connect doesn’t have access to any production data and doesn’t have privileges to execute any sensitive operations, such as CREATE USER commands.
5. Ensure that the role used for AI code conversion has the following privileges on the account:

   * CREATE DATABASE
   * CREATE MIGRATION
6. Enable Cortex AI SQL functions in the account, specifically for model `claude-4-sonnet`.

   * To enable the model if it’s not available in your region, see [Cross-region inference](https://docs.snowflake.com/en/user-guide/snowflake-cortex/cross-region-inference#any-region).

## Getting started with AI code conversion

To begin a migration validation project, complete the following steps:

1. Execute the [code conversion of SnowConvert AI](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/README) on your SQL Server database.
2. After the code conversion is complete, select **Go to AI code conversion** from the **Results** page.

   | All AI processing happens in the Snowflake account you connect to and consumes Snowflake charges. |
   | --- |
3. You will be redirected to the **Connect to Snowflake** page to enter the connection parameters of a testing account. This is necessary to ensure that the AI code conversion process creates objects and executes queries in the test database and avoids unintended changes to the production database. Select **Continue**.
4. Acknowledge and confirm the **AI disclaimers** and select **Continue**.
5. The **Select objects** page displays the current conversion status of each database object under the **Conversion** column. Select the required objects for AI code conversion. You can also run an [AI code conversion process with source-system verification](https://docs.snowflake.com/en/migrations/snowconvert-docs/ai-verification/snowconvert-ai-twosided-verification) by selecting **Upload custom instructions**.

   SnowConvert automatically performs the following actions:

   1. Automatically selects and validates dependent objects when they are associated with your chosen objects.
   2. Reviews a summary of the selected objects, their dependencies, and the estimated time and Snowflake credit cost.
   3. Confirms the selection to proceed with code conversion.
6. Select **AI Convert**. SnowConvert AI connects to your Snowflake account, where it relies on [Cortex AI Functions](https://docs.snowflake.com/en/user-guide/snowflake-cortex/aisql) to review your code and suggest resolutions to any problems. AI code conversion might take a few minutes to start, and it might run for several minutes or hours depending on the complexity of the code being verified.
7. The **AI Results** page shows the status for the AI code conversion of selected objects. The **Status** column indicates the AI code conversion outcomes. Select **Details** to review the test code and test results, source code, and converted code.

   | Review the code generated by AI before deploying it. Code generated by AI might not be correct. |
   | --- |

   * Status of the AI code conversion:

     + **Converted successfully**: Indicates that the object was successfully converted by deterministic conversion (not by AI code conversion). It is ready to be deployed to Snowflake.
     + **Has issues**: Indicates that the object conversion was not successful and still has issues from either deterministic or AI conversion. It needs manual fixes or another AI code conversion run.
     + **Suggested fixes**: Indicates that AI code conversion proposed code fixes for the object. The fixes need user review before being considered ready for deployment to Snowflake.
     + **Verified by AI**: Indicates that AI code conversion successfully converted the object. It can be considered ready for deployment to Snowflake after review.
     + **Verified by User**: Indicates that a user explicity reviewed the object and marked it as valid for deployment. This is the highest trust level and objects in this state are excluded from subsequent AI code conversion runs.
   * **Open Code**:

     + By default, this option opens and compares your original source code and the code generated by the AI code conversion process in VS Code.
     + If you click the arrow next to **Open Code**, you also have the option to open and compare in VS Code:

       - The converted code from SnowConvert and the code converted by AI.
8. Select **Verified by user** for all objects for which you have accepted the AI code conversion. Only objects that are verified by the user can be deployed.

## Billing and cost considerations with SnowConvert AI code conversion

AI code conversion consumes Snowflake credits based on the compute resources it uses in your Snowflake account. The following features contribute to the cost:

* AI SQL - AI code conversion uses Cortex AI SQL.
* Warehouse - Test queries are executed in a warehouse.
* Snowflake stages - Input and outputs for AI code conversion are stored in a stage, which incurs storage costs.
* SPCS - AI code conversion might consume a small amount of credits to use Snowpark Container
  Services. To find the costs associated with AI code conversion, look for compute pools with names that start with
  `AI_MIGRATOR`. For more information, see [Snowpark Container Services costs](https://docs.snowflake.com/en//developer-guide/snowpark-container-services/accounts-orgs-usage-views).

For more information, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Limitations of AI powered code conversion

The initial version is optimized for standard SQL Server migrations. While migration process can handle many query types, all the changes generated by SnowConvert AI code conversion must be reviewed by the customer before they can be deployed to any account.

## Legal notices for AI features

Your use of Snowflake AI features is subject to all agreements, terms or policies that apply to such usage, including but not limited to those documented in the [Snowflake AI & ML Documentation](https://docs.snowflake.com/en/guides-overview-ai-features).

---
title: AI code conversion with source-system verification
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/ai-verification/snowconvert-ai-twosided-verification.md
section: Migrations
---

# AI code conversion with source-system verification

AI code conversion with source-system verification improves the accuracy of the conversion process. It runs the generated test case against both the converted code in Snowflake and the original source code in the source database. It then checks that both the source code and the converted code produce equivalent results.

> **Warning:**
>
> Never use your production account for AI code conversion with source-system verification. Use a testing account instead, as AI code conversion may introduce unpredictable modifications.

Compared to the default [AI code conversion](https://docs.snowflake.com/en/migrations/snowconvert-docs/snowconvert-ai-verification) where SnowConvert only verifies that the converted code is running successfully on Snowflake, AI code conversion with source-system verification ensures functional parity, higher confidence, and overall better conversion quality.

| AI code conversion with source-system verification is currently available for SQL Server and Redshift databases. |
| --- |

## Prerequisites

Select your source database platform to learn more about the prerequisites for running AI code conversion with source-system verification.

SQL ServerRedshift

AI code conversion with source-system verification currently requires an instance of SQL Server hosted in Snowpark Container Services (SPCS). This SQL Server instance acts as a host for the test cases that will be executed by the source-system verification process. The test results from this instance will be used as a baseline to compare against the test results generated by executing the test cases on the converted Snowflake SQL code.

The prerequisites are:

* An instance of SQL Server should be running in your SPCS environment. Download and run this [shell script](https://snowconvert.snowflake.com/storage/linux/prod/scripts/push_mssql_server.sh) to deploy an instance of SQL Server in the SPCS environment.

  + To obtain your `dns_name` you can run on your account `SHOW SERVICES LIKE 'mssql_server_demo_service';`
* Understand and agree to the legal responsibilities of using AI code conversion with source-system verification on your source data platform code inside the SPCS environment.
* Create a custom specification file (for example, `spec.yaml`). This file contains the connection parameters for the SQL Server instance running inside SPCS.

  Example:

  ```yaml
  mode: "TWO_SIDED"
  n_tests: 3
  repair: true
  num_workers: 2

  source_test_database:
    connection_params:
      hostname: <dns_name>  # Example: "mssql-server-demo-service.n4yw.svc.spcs.internal"
      port: 1433
      username: "user_name"
      password: "password"
    connection_metadata:
      type: "SPCS"
      spcs_service:
        name: "MSSQL_SERVER_DEMO_SERVICE"
        database: "SNOWCONVERT_AI"
        schema: "PUBLIC"

  project:
    custom_instructions:
      - "Preserve NULL handling semantics from source"
      - "Use ANSI SQL where possible"
      exclude: "test/.*\\.sql|backup/.*"
      extra_target_prerequisites: CREATE SCHEMA IF NOT EXISTS test_schema;
      verified objects:
        - "[dbo].[AlreadyVerifiedProc]"
        - "[dbo].[VerifiedView]"
      use_custom_database: true
      extra_file_dependencies:
        salesdb/summary.sql:
        - salesdb/customers.sql
        - salesdb/orders.sql

  additional_options:
    --n-tests: 5
    --project.custom-instructions:
      - "Additional instruction via additional_options"
      --project.extra-file-dependencies:
        additional_salesdb/summary.sql:
        - additional_salesdb/customers.sql
        - additional_salesdb/orders.sql
    --project.use-custom-database: false
    --project.verified-objects:
      - "[dbo].[additional_AlreadyVerifiedProc]"
      - "[dbo].[additional_VerifiedView]"
  ```

  + The `mode` “TWO_SIDED” indicates that the AI code conversion with source-system verification process will run on both source database code and target database code.
  + The host name, port number, and credentials for the SQL Server database running in SPCS are specified under `source_test_database`.
  + The name of the container service, database name, and schema name where the test cases for the source will be executed are specified under `connection_metadata`.

AI code conversion with source-system verification requires a Redshift instance running in your AWS environment. The instance should be accessible from the Snowpark Container Services (SPCS) container, which acts as a host for the test cases that will be executed by the source-system verification process. The test results from this instance will be used as a baseline to compare against the test results generated by executing the same test cases on the converted Snowflake SQL code.

The prerequisites are:

* Create a network rule and an External Access Integration (EAI) to allow incoming traffic on port 5439 (the default Redshift port) from the IP addresses used by your Snowflake environment and establish a connection with the Redshift instance.

  Example:

  ```sql
  -- Create a network rule allowing egress to Redshift
  CREATE OR REPLACE NETWORK RULE SNOWCONVERT_AI.PUBLIC.AI_MIGRATIONS_REDSHIFT_NETWORK_RULE
    MODE = EGRESS
    TYPE = HOST_PORT
    VALUE_LIST = ('your-testing-redshift-account.us-west-2.redshift-serverless.amazonaws.com:5439');

  -- Create the External Access Integration
  CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION AI_MIGRATIONS_REDSHIFT_EAI
    ALLOWED_NETWORK_RULES = (SNOWCONVERT_AI.PUBLIC.AI_MIGRATIONS_REDSHIFT_NETWORK_RULE)
    ENABLED = TRUE;
  ```
* Create a custom specification file (for example, `spec.yaml`). This file contains the connection parameters for the Redshift instance to be used for source-system verification.

  Example:

  ```yaml
  mode: "TWO_SIDED"
  n_tests: 3
  repair: true
  num_workers: 2

  source_test_database:
    connection_params:
      hostname: <dns_name>  # Example: your-testing-redshift-account.us-west-2.redshift-serverless.amazonaws.com
      port: 5439
      database: "database_name"
      username: "user_name"
      password: "password"
    connection_metadata:
      type: "EXTERNAL"
      spcs_service:
      eai: "AI_MIGRATIONS_REDSHIFT_EAI"

  project:
  custom_instructions:
    - "Preserve NULL handling semantics from source"
    - "Use ANSI SQL where possible"
    exclude: "test/.*\\.sql|backup/.*"
    extra_target_prerequisites: CREATE SCHEMA IF NOT EXISTS test_schema;
    verified objects:
      - "[dbo].[AlreadyVerifiedProc]"
      - "[dbo].[VerifiedView]"
    use_custom_database: true
    extra_file_dependencies:
      salesdb/summary.sql:
      - salesdb/customers.sql
      - salesdb/orders.sql

  additional_options:
  --n-tests: 5
  --project.custom-instructions:
    - "Additional instruction via additional_options"
    --project.extra-file-dependencies:
      additional_salesdb/summary.sql:
      - additional_salesdb/customers.sql
      - additional_salesdb/orders.sql
  --project.use-custom-database: false
  --project.verified-objects:
    - "[dbo].[additional_AlreadyVerifiedProc]"
    - "[dbo].[additional_VerifiedView]"
  ```

  + The mode “TWO_SIDED” indicates that the AI code conversion with source-system verification process will run on both source database code and target database code.
  + The host name, port number, and credentials for the Redshift database are specified under `source_test_database`.
  + The name of the container service, database name, and schema name where the test cases for the source will be executed are specified under `connection_metadata`.

### Configuration parameters

This section describes the components of the YAML file used to control the AI code conversion with source-system verification process.

#### Project options

The `project` section configures how the conversion and verification process handles your source objects. The following options are available:

| Option | CLI flag | Description |
| --- | --- | --- |
| `custom_instructions` | `--project.custom-instructions` | A list of custom instructions that guide the AI conversion process. |
| `exclude` | `--project.exclude` | A glob pattern to exclude specific objects from the conversion. |
| `extra_target_prerequisites` | `--project.extra-target-prerequisites` | SQL statements to run on the target database before verification. |
| `path_replacements` | `--project.path-replacements` | A dictionary that maps old file paths to new file paths. |
| `verified_objects` | `--project.verified-objects` | A list of fully qualified object names to include in verification. |
| `use_custom_database` | `--project.use-custom-database` | A boolean flag that enables the use of a custom database for the conversion. |
| `extra_file_dependencies` | `--project.extra-file-dependencies` | A dictionary that maps a filename to a list of dependency filenames that it requires. |

#### Additional options

The `additional_options` section accepts arbitrary key-value pairs that are passed directly to the CLI as flags. This means new CLI flags can be used immediately through the spec file without requiring code changes.

For example, the following spec entries:

```none
```yaml
additional_options:
  --some-new-flag: 42
  --experimental.feature: true
```
```

Are passed through to the API as:

```none
```json
{
  "--some-new-flag": 42,
  "--experimental.feature": ""
}
```
```

Boolean values set to `true` are passed as flag-only options (empty string value). Non-boolean values are passed with their specified value.

#### YAML to API payload mapping

The project options and additional options from the YAML spec are converted into a flat dictionary of CLI flags in the API payload. The following example shows how the full spec maps:

```none
```json
{
  "--project.custom-instructions": "'[\"Use MERGE instead of INSERT for upserts\",\"Preserve original column aliases\"]'",
  "--project.exclude": "'\"staging_*\"'",
  "--project.extra-target-prerequisites": "'\"CREATE SCHEMA IF NOT EXISTS analytics;\"'",
  "--project.path-replacements": "'{\"//old/source/path\":\"//new/source/path\",\"//legacy/scripts\":\"//migrated/scripts\"}'",
  "--project.verified-objects": "'[\"DBO.CUSTOMERS\",\"DBO.ORDERS\"]'",
  "--project.use-custom-database": "",
  "--project.extra-file-dependencies": "'{\"main_procedure.sql\":[\"helper_functions.sql\",\"common_types.sql\"],\"etl_load.sql\":[\"staging_tables.sql\"]}'",
  "--n-tests": 5,
  "--some-new-flag": 42,
  "--experimental.feature": ""
}
```
```

Note the following mapping behaviors:

* Project options are prefixed with `--project.` and use kebab-case (for example, `use_custom_database` becomes `--project.use-custom-database`).
* List values are serialized as JSON arrays.
* Dictionary values are serialized as JSON objects.
* Boolean flags (such as `use_custom_database: true`) are passed with an empty string value, indicating a flag-only option.
* Top-level spec fields like `n_tests` are mapped as `--n-tests`.
* Entries in `additional_options` are passed through as-is.

## Steps to run AI code conversion with source-system verification

1. From the AI code conversion page, connect to Snowflake using a valid connection string. SnowConvert currently supports [programmatic access tokens](https://docs.snowflake.com/en/user-guide/programmatic-access-tokens), in addition to the standard authentication methods for AI verification.
2. Accept the disclaimers and confirm your account identifier. Select **Continue** to proceed to the **Select objects to verify with AI**.
3. Select **Upload custom instructions** on the top left of the **Select objects** page.
4. Select the latest version of the `yaml` file you created as a prerequisite and select **Save**.

   You have now configured the AI code conversion process to run in two-sided mode. This will compare test results from the source SQL Server database to the converted Snowflake SQL code.
5. Proceed with the [AI code conversion steps](https://docs.snowflake.com/en/migrations/snowconvert-docs/snowconvert-ai-verification#getting-started-with-snowconvert-ai-verification) to complete the source-system verification process.

## Impact on costs and dependencies

The costs for running AI code conversion with source-system verification may be different from the default AI code conversion, depending on your Snowflake credits consumption for SPCS and Cortex AI. A summary of the estimated costs and number of objects affected by the source-system verification process can be found in the **Selection Summary**. We recommend reviewing the **Selection Summary** before proceeding to **AI convert**.

---
title: Amazon Redshift Commands Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/redshift_commands.md
section: Migrations
---

# Amazon Redshift Commands Reference

## Overview

This page provides comprehensive reference documentation for Amazon Redshift-specific commands in the Snowflake Data Validation CLI. For SQL Server commands, see [SQL Server Commands Reference](sqlserver_commands.md). For Teradata commands, see [Teradata Commands Reference](teradata_commands.md). For Snowflake-to-Snowflake commands, see [Snowflake Commands Reference](snowflake_commands.md).

---

## Command Structure

All Amazon Redshift commands follow this consistent structure:

```bash
snowflake-data-validation redshift <command> [options]

# Or use the shorter alias
sdv redshift <command> [options]
```

Where `<command>` is one of:

* `run-validation` - Run synchronous validation
* `run-async-validation` - Run asynchronous validation
* `generate-validation-scripts` - Generate validation scripts
* `get-configuration-files` - Get configuration templates
* `auto-generated-configuration-file` - Interactive config generation
* `row-partitioning-helper` - Interactive row partitioning configuration
* `column-partitioning-helper` - Interactive column partitioning configuration

---

## Run Synchronous Validation

Validates data between Amazon Redshift and Snowflake in real-time.

### Syntax

```bash
snowflake-data-validation redshift run-validation \
  --data-validation-config-file /path/to/config.yaml \
  --log-level INFO
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file containing validation settings
* **Example:** `--data-validation-config-file ./configs/redshift_validation.yaml`

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for validation execution
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Basic validation
sdv redshift run-validation \
  --data-validation-config-file ./configs/redshift_validation.yaml

# Validation with debug logging
sdv redshift run-validation \
  --data-validation-config-file ./configs/redshift_validation.yaml \
  --log-level DEBUG

# Using full command name
snowflake-data-validation redshift run-validation \
  -dvf /opt/validations/prod_config.yaml \
  -ll INFO
```

### Use Cases

* Real-time validation during Redshift migration
* Pre-cutover validation checks
* Post-migration verification
* Continuous validation in CI/CD pipelines
* Data lake migration validation

---

## Run Asynchronous Validation

Performs validation using pre-generated metadata files without connecting to databases.

### Syntax

```bash
snowflake-data-validation redshift run-async-validation \
  --data-validation-config-file /path/to/config.yaml \
  --log-level INFO
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file
* **Note:** Configuration must specify paths to pre-generated metadata files

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for validation execution
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Run async validation
sdv redshift run-async-validation \
  --data-validation-config-file ./configs/async_validation.yaml

# Async validation with debug logging
sdv redshift run-async-validation \
  --data-validation-config-file ./configs/async_validation.yaml \
  --log-level DEBUG

# Using full command name
snowflake-data-validation redshift run-async-validation \
  -dvf /data/validations/async_config.yaml \
  -ll INFO
```

### Prerequisites

Before running async validation:

1. Generate validation scripts using `generate-validation-scripts`
2. Execute the generated scripts on Redshift and Snowflake databases
3. Save results to CSV/metadata files
4. Ensure metadata files are available in the configured paths

### Use Cases

* Validating in environments with restricted database access
* Separating metadata extraction from validation
* Batch validation workflows
* Scheduled validation jobs
* When database connections are intermittent

---

## Generate Validation Scripts

Generates SQL scripts for Redshift and Snowflake metadata extraction.

### Syntax

```bash
snowflake-data-validation redshift generate-validation-scripts \
  --data-validation-config-file /path/to/config.yaml \
  --log-level DEBUG
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for script generation
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Generate scripts
sdv redshift generate-validation-scripts \
  --data-validation-config-file ./configs/validation.yaml

# Generate scripts with debug logging
sdv redshift generate-validation-scripts \
  --data-validation-config-file ./configs/validation.yaml \
  --log-level DEBUG

# Using full command name
snowflake-data-validation redshift generate-validation-scripts \
  -dvf /opt/configs/script_generation.yaml \
  -ll INFO
```

### Output

The command generates SQL scripts in the output directory configured in your YAML file:

```text
<output_directory>/
├── redshift_schema_queries.sql
├── redshift_metrics_queries.sql
├── redshift_row_queries.sql
├── snowflake_schema_queries.sql
├── snowflake_metrics_queries.sql
└── snowflake_row_queries.sql
```

### Use Cases

* Generating scripts for execution by DBAs
* Compliance requirements for query review
* Environments where direct CLI database access is restricted
* Manual execution and validation workflows
* Separating metadata extraction from validation

---

## Get Configuration Templates

Retrieves Redshift configuration templates.

### Syntax

```bash
snowflake-data-validation redshift get-configuration-files \
  --templates-directory ./redshift-templates \
  --query-templates
```

### Options

**`--templates-directory, -td`** (optional)

* **Type:** String (path)
* **Default:** Current directory
* **Description:** Directory to save template files
* **Example:** `--templates-directory ./templates`

**`--query-templates`** (optional)

* **Type:** Flag (no value required)
* **Description:** Include J2 (Jinja2) query template files for advanced customization
* **Example:** `--query-templates`

### Example Usage

```bash
# Get basic templates in current directory
sdv redshift get-configuration-files

# Save templates to specific directory
sdv redshift get-configuration-files \
  --templates-directory ./my-project/redshift-templates

# Include query templates for customization
sdv redshift get-configuration-files \
  --templates-directory ./templates \
  --query-templates

# Using short flags
sdv redshift get-configuration-files -td ./templates --query-templates
```

### Output Files

**Without `--query-templates` flag:**

```text
<templates_directory>/
└── redshift_validation_template.yaml
```

**With `--query-templates` flag:**

```text
<templates_directory>/
├── redshift_validation_template.yaml
└── query_templates/
    ├── redshift_columns_metrics_query.sql.j2
    ├── redshift_row_count_query.sql.j2
    ├── redshift_compute_md5_sql.j2
    └── snowflake_columns_metrics_query.sql.j2
```

### Use Cases

* Starting a new Redshift validation project
* Learning Redshift-specific configuration options
* Customizing validation queries for Redshift
* Creating organization-specific templates

---

## Auto-Generate Configuration File

Interactive command for Redshift configuration generation.

### Syntax

```bash
snowflake-data-validation redshift auto-generated-configuration-file
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### Interactive Prompts

The command will prompt for the following information:

1. **Redshift host**

   * Hostname/endpoint of Redshift cluster
   * Example: `redshift-cluster.region.redshift.amazonaws.com`
2. **Redshift port** (default: 5439)

   * Port number for Redshift connection
   * Press Enter to accept default
3. **Redshift username**

   * Authentication username
   * Example: `migration_user`
4. **Redshift password**

   * Authentication password (hidden input)
   * Not displayed on screen for security
5. **Redshift database**

   * Name of the database to validate
   * Example: `analytics_db`
6. **Redshift schema**

   * Schema name within the database
   * Example: `public`
7. **Output directory path**

   * Where to save validation results
   * Example: `./validation_results`

### Example Session

```bash
$ sdv redshift auto-generated-configuration-file

Redshift host: redshift-cluster.us-east-1.redshift.amazonaws.com
Redshift port [5439]:
Redshift username: migration_user
Redshift password: ********
Redshift database: analytics_db
Redshift schema: public
Output directory path: ./validation_results

Configuration file generated successfully: ./redshift_validation_config.yaml
```

### Generated Configuration

The command generates a basic YAML configuration file:

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: ./validation_results

source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: migration_user
  password: "<hidden>"
  database: analytics_db

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables: []
```

### Next Steps After Generation

1. **Edit the configuration file** to add:

   * Target connection details (if not using default)
   * Tables to validate
   * Validation options
   * Column selections and mappings
2. **Review security settings:**

   * Consider using environment variables for passwords
   * Verify IAM authentication if applicable
3. **Add table configurations:**

   * Specify fully qualified table names
   * Configure column selections
   * Set up filtering where clauses
4. **Test the configuration:**

   ```bash
   sdv redshift run-validation \
     --data-validation-config-file ./redshift_validation_config.yaml
   ```

### Use Cases

* Quick setup for new Redshift users
* Generating baseline configurations
* Testing connectivity during setup
* Creating template configurations for teams

---

## Row Partitioning Helper

Interactive command to generate partitioned table configurations for large tables. This helper divides tables into smaller row partitions based on a specified column, enabling more efficient validation of large datasets.

### Syntax

```bash
snowflake-data-validation redshift row-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The table partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply partitioning
3. If partitioning is enabled, collects partition parameters
4. Queries the source Redshift database to determine partition boundaries
5. Generates new table configurations with `WHERE` clauses for each partition
6. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/redshift_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply partitioning?** (yes/no)

   * Whether to partition this specific table
   * Default: yes

   b. **Partition column** (if partitioning)

   * Column name used to divide the table
   * Should be indexed for performance
   * Example: `transaction_id`, `created_date`

   c. **Is partition column a string type?** (yes/no)

   * Determines quoting in generated WHERE clauses
   * Default: no (numeric)

   d. **Number of partitions**

   * How many partitions to create
   * Example: `10`, `50`, `100`

### Example Session

```bash
$ sdv redshift row-partitioning-helper

Generate a configuration file for Redshift table partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip partitioning or specify partitioning parameters for each table.

Configuration file path: ./configs/redshift_validation.yaml

Apply partitioning for public.fact_sales? [Y/n]: y
Write the partition column for public.fact_sales: sale_id
Is 'sale_id' column a string type? [y/N]: n
Write the number of partitions for public.fact_sales: 10

Apply partitioning for public.dim_customer? [Y/n]: n

Apply partitioning for public.transactions? [Y/n]: y
Write the partition column for public.transactions: transaction_date
Is 'transaction_date' column a string type? [y/N]: n
Write the number of partitions for public.transactions: 5

Table partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with WHERE clauses:

```yaml
tables:
  # Original table partitioned into 10 segments
  - fully_qualified_name: public.fact_sales
    where_clause: "sale_id >= 1 AND sale_id < 100000"
    target_where_clause: "sale_id >= 1 AND sale_id < 100000"
    # ... other table settings preserved

  - fully_qualified_name: public.fact_sales
    where_clause: "sale_id >= 100000 AND sale_id < 200000"
    target_where_clause: "sale_id >= 100000 AND sale_id < 200000"
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: public.dim_customer
    # ... original configuration
```

### Use Cases

* **Large table validation**: Break multi-billion row tables into manageable chunks
* **Parallel processing**: Enable concurrent validation of different partitions
* **Memory optimization**: Reduce memory footprint by processing smaller data segments
* **Incremental validation**: Validate specific data ranges independently
* **Performance tuning**: Optimize validation for tables with uneven data distribution

### Best Practices

1. **Choose appropriate partition columns:**

   * Use indexed columns for better query performance
   * Prefer columns with sequential values (IDs, timestamps)
   * Avoid columns with highly skewed distributions
2. **Determine optimal partition count:**

   * Consider table size and available resources
   * Start with 10-20 partitions for tables with 10M+ rows
   * Increase partitions for very large tables (100M+ rows)
3. **String vs numeric columns:**

   * Numeric columns are generally more efficient
   * String columns work but may have uneven distribution
4. **After partitioning:**

   * Review generated WHERE clauses
   * Adjust partition boundaries if needed
   * Test with a subset before full validation

---

## Column Partitioning Helper

Interactive command to generate partitioned table configurations for wide tables with many columns. This helper divides tables into smaller column partitions, enabling more efficient validation of tables with a large number of columns.

### Syntax

```bash
snowflake-data-validation redshift column-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The column partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply column partitioning
3. If partitioning is enabled, collects the number of partitions
4. Queries the source Redshift database to retrieve all column names for the table
5. Divides the columns into the specified number of partitions
6. Generates new table configurations where each partition validates only a subset of columns
7. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/redshift_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply column partitioning?** (yes/no)

   * Whether to partition this specific table by columns
   * Default: yes

   b. **Number of partitions** (if partitioning)

   * How many column partitions to create
   * Example: `3`, `5`, `10`

### Example Session

```bash
$ sdv redshift column-partitioning-helper

Generate a configuration file for Redshift column partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip column partitioning or specify column partitioning parameters for each table.

Configuration file path: ./configs/redshift_validation.yaml

Apply column partitioning for public.wide_table? [Y/n]: y
Write the number of partitions for public.wide_table: 5

Apply column partitioning for public.small_table? [Y/n]: n

Apply column partitioning for public.report_table? [Y/n]: y
Write the number of partitions for public.report_table: 3

Column partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with column subsets:

```yaml
tables:
  # Original table with 100 columns partitioned into 5 segments (20 columns each)
  - fully_qualified_name: public.wide_table
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - column_a
      - column_b
      - column_c
      # ... first 20 columns alphabetically

  - fully_qualified_name: public.wide_table
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - column_d
      - column_e
      - column_f
      # ... next 20 columns alphabetically
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: public.small_table
    # ... original configuration
```

### Use Cases

* **Wide table validation**: Break tables with hundreds of columns into manageable chunks
* **Memory optimization**: Reduce memory footprint by validating fewer columns at a time
* **Parallel processing**: Enable concurrent validation of different column groups
* **Targeted validation**: Validate specific column groups independently
* **Performance tuning**: Optimize validation for tables with many LOB or complex columns

### Best Practices

1. **Determine optimal partition count:**

   * Consider the total number of columns in the table
   * For tables with 50+ columns, start with 3-5 partitions
   * For tables with 100+ columns, consider 5-10 partitions
2. **Column ordering:**

   * Columns are divided alphabetically
   * Related columns may end up in different partitions
3. **After partitioning:**

   * Review generated column lists
   * Verify all required columns are included
   * Test with a subset before full validation
4. **Combine with row partitioning:**

   * For very large, wide tables, consider using both row and column partitioning
   * First partition by columns, then apply row partitioning to each column partition if needed

---

## Amazon Redshift Connection Configuration

Redshift connections require specific configuration in the YAML file.

### Connection Example

```yaml
source_connection:
  mode: credentials
  host: "redshift-cluster.region.redshift.amazonaws.com"
  port: 5439
  username: "redshift_user"
  password: "secure_password"
  database: "source_database"
```

### Connection Fields

**`mode`** (required)

* **Type:** String
* **Valid Values:** `credentials`
* **Description:** Connection mode for Redshift

**`host`** (required)

* **Type:** String
* **Description:** Redshift cluster endpoint
* **Format:** `<cluster-name>.<cluster-id>.<region>.redshift.amazonaws.com`
* **Examples:**

  + `"redshift-cluster-1.abc123.us-east-1.redshift.amazonaws.com"`
  + `"analytics-cluster.xyz789.eu-west-1.redshift.amazonaws.com"`
  + `"data-warehouse.def456.ap-southeast-1.redshift.amazonaws.com"`

**`port`** (required)

* **Type:** Integer
* **Default:** 5439
* **Description:** Redshift port number
* **Note:** Use the port configured for your Redshift cluster

**`username`** (required)

* **Type:** String
* **Description:** Redshift authentication username
* **Example:** `"migration_admin"`

**`password`** (required)

* **Type:** String
* **Description:** Redshift authentication password
* **Security Note:** Consider using environment variables or IAM authentication

**`database`** (required)

* **Type:** String
* **Description:** Redshift database name
* **Example:** `"analytics_database"`

### Connection Examples

**Production Connection:**

```yaml
source_connection:
  mode: credentials
  host: "prod-cluster.abc123.us-east-1.redshift.amazonaws.com"
  port: 5439
  username: "prod_reader"
  password: "${REDSHIFT_PASSWORD}"  # From environment variable
  database: "production_db"
```

**Development Connection:**

```yaml
source_connection:
  mode: credentials
  host: "dev-cluster.xyz789.us-west-2.redshift.amazonaws.com"
  port: 5439
  username: "dev_user"
  password: "dev_password"
  database: "dev_database"
```

**Data Lake Migration Connection:**

```yaml
source_connection:
  mode: credentials
  host: "datalake-cluster.def456.us-east-1.redshift.amazonaws.com"
  port: 5439
  username: "migration_user"
  password: "${AWS_REDSHIFT_PASSWORD}"
  database: "datalake_db"
```

---

## Complete Amazon Redshift Examples

### Example 1: Basic Redshift Configuration

```yaml
# Global configuration
source_platform: Redshift
target_platform: Snowflake
output_directory_path: ./validation_results
max_threads: auto

# Source connection
source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: redshift_user
  password: redshift_password
  database: analytics_db

# Target connection
target_connection:
  mode: name
  name: snowflake_analytics

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Tables to validate
tables:
  - fully_qualified_name: public.customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  - fully_qualified_name: public.orders
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_notes
      - audit_log
```

### Example 2: Redshift Data Lake Migration

```yaml
# Global configuration
source_platform: Redshift
target_platform: Snowflake
output_directory_path: /data/validation/redshift_migration
max_threads: 16

# Source connection
source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: redshift_admin
  password: redshift_secure_password
  database: analytics_db

# Target connection
target_connection:
  mode: name
  name: snowflake_analytics

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 200
  exclude_metrics: false

# Comparison configuration
comparison_configuration:
  tolerance: 0.02

# Logging configuration
logging_configuration:
  level: INFO
  console_level: ERROR
  file_level: DEBUG

# Database mappings
database_mappings:
  analytics_db: ANALYTICS_PROD

# Schema mappings
schema_mappings:
  public: PUBLIC
  staging: STAGING

# Tables configuration
tables:
  # Large fact table with chunking
  - fully_qualified_name: public.fact_sales
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - sale_id
    chunk_number: 50
    max_failed_rows_number: 500

  # Dimension table with column mappings
  - fully_qualified_name: public.dim_customer
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - customer_key
      - customer_name
      - email
      - phone
      - address
    column_mappings:
      customer_key: cust_key
      customer_name: name
    is_case_sensitive: false

  # Filtered validation
  - fully_qualified_name: staging.incremental_load
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - load_timestamp
      - etl_batch_id
    where_clause: "load_date >= CURRENT_DATE - 7"
    target_where_clause: "load_date >= CURRENT_DATE - 7"
    chunk_number: 10
```

### Example 3: Redshift with Complex Filtering

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: /opt/validation/redshift
max_threads: 24

source_connection:
  mode: credentials
  host: complex-cluster.us-west-2.redshift.amazonaws.com
  port: 5439
  username: validation_user
  password: secure_password
  database: enterprise_db

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 150

comparison_configuration:
  tolerance: 0.01

logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

schema_mappings:
  public: PUBLIC

tables:
  # Time-based filtering
  - fully_qualified_name: public.transactions
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - transaction_id
      - customer_id
      - amount
      - transaction_date
      - status
    index_column_list:
      - transaction_id
    where_clause: "transaction_date >= '2024-01-01' AND status IN ('COMPLETED', 'PENDING')"
    target_where_clause: "transaction_date >= '2024-01-01' AND status IN ('COMPLETED', 'PENDING')"
    chunk_number: 30

  # Complex filtering with multiple conditions
  - fully_qualified_name: public.customer_activity
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_score
      - risk_assessment
    where_clause: "last_activity_date >= DATEADD(month, -6, CURRENT_DATE) AND account_status = 'ACTIVE' AND total_purchases > 100"
    target_where_clause: "last_activity_date >= DATEADD(month, -6, CURRENT_DATE) AND account_status = 'ACTIVE' AND total_purchases > 100"
    index_column_list:
      - customer_id
    chunk_number: 20
    max_failed_rows_number: 100

  # Regional filtering
  - fully_qualified_name: public.sales_by_region
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "region IN ('US-EAST', 'US-WEST', 'EU') AND sale_date >= '2024-01-01'"
    target_where_clause: "region IN ('US-EAST', 'US-WEST', 'EU') AND sale_date >= '2024-01-01'"
    index_column_list:
      - sale_id
      - region
```

### Example 4: Redshift View Validation

Validate Amazon Redshift views alongside tables for comprehensive migration verification.

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: ./redshift_view_validation
max_threads: 12

source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: rs_validator
  password: RedshiftPass123!
  database: analytics

target_connection:
  mode: name
  name: snowflake_analytics

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 50

comparison_configuration:
  tolerance: 0.01

database_mappings:
  analytics: ANALYTICS_PROD

schema_mappings:
  public: PUBLIC
  reports: REPORTS

# Tables to validate
tables:
  - fully_qualified_name: public.CUSTOMERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

# Views to validate
views:
  # View with column mappings
  - fully_qualified_name: reports.V_USER_ACTIVITY
    target_name: V_USER_ACTIVITY
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - USER_ID
      - LAST_LOGIN
      - SESSION_COUNT
      - TOTAL_DURATION
    index_column_list: [USER_ID]
    target_index_column_list: [USER_ID]
    column_mappings:
      USER_ID: USER_ID
      LAST_LOGIN: LAST_LOGIN_DATE
      SESSION_COUNT: SESSIONS
      TOTAL_DURATION: DURATION_MINUTES

  # View with date filtering
  - fully_qualified_name: reports.V_CONVERSION_FUNNEL
    target_name: V_CONVERSION_FUNNEL
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [EVENT_ID]
    target_index_column_list: [EVENT_ID]
    where_clause: "event_date >= CURRENT_DATE - 30"
    target_where_clause: "event_date >= CURRENT_DATE - 30"

  # View with composite index columns for row validation
  - fully_qualified_name: public.V_DAILY_METRICS
    target_name: V_DAILY_METRICS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [METRIC_DATE, METRIC_TYPE]
    target_index_column_list: [METRIC_DATE, METRIC_TYPE]

  # View with different target name
  - fully_qualified_name: reports.V_LEGACY_DASHBOARD
    target_database: MODERN_ANALYTICS
    target_schema: DASHBOARDS
    target_name: V_MODERN_DASHBOARD
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [DASHBOARD_ID]
    target_index_column_list: [DASHBOARD_ID]
```

**Note:** View validation creates temporary tables internally to materialize view data for comparison between Amazon Redshift and Snowflake.

---

## Troubleshooting Redshift Connections

### Issue: Connection Timeout

**Symptom:**

```sql
Connection timeout: Unable to connect to Redshift cluster
```

**Solutions:**

1. Verify the cluster endpoint and port:

   ```bash
   telnet redshift-cluster.region.redshift.amazonaws.com 5439
   ```
2. Check VPC security groups allow inbound connections on port 5439
3. Verify the cluster is publicly accessible (if connecting from outside VPC)
4. Check route tables and network ACLs
5. Verify the cluster is in “available” state in AWS console

### Issue: Authentication Failed

**Symptom:**

```sql
Authentication failed for user 'username'
```

**Solutions:**

1. Verify credentials are correct
2. Check user has necessary permissions:

   ```sql
   -- Grant read permissions
   GRANT SELECT ON ALL TABLES IN SCHEMA public TO migration_user;
   ```
3. Verify user account exists:

   ```sql
   SELECT usename FROM pg_user WHERE usename = 'migration_user';
   ```
4. Check if password has expired or needs to be reset

### Issue: Database Not Found

**Symptom:**

```sql
Database 'database_name' does not exist
```

**Solutions:**

1. Verify database name is correct (case-sensitive)
2. List available databases:

   ```sql
   SELECT datname FROM pg_database;
   ```
3. Ensure user has access to the database

### Issue: SSL/TLS Certificate Errors

**Symptom:**

```sql
SSL certificate verification failed
```

**Solutions:**

1. Verify SSL is required for the cluster
2. Check AWS Redshift SSL/TLS settings
3. Ensure you’re using the correct endpoint (not VPC endpoint)

### Issue: Network/VPC Configuration

**Symptom:**

```sql
Connection refused or network unreachable
```

**Solutions:**

1. **Check cluster publicly accessible setting:**

   * In AWS Console, verify “Publicly accessible” is enabled if connecting externally
2. **Verify VPC security group rules:**

   * Inbound rule: Type = Custom TCP, Port = 5439, Source = Your IP
3. **Check VPC route table:**

   * Ensure proper routing to internet gateway (for public access)
4. **Verify VPC Network ACLs:**

   * Allow inbound/outbound traffic on port 5439

---

## Best Practices for Amazon Redshift

### Security

1. **Use IAM authentication when possible:**

   ```yaml
   # Note: IAM authentication setup requires additional AWS configuration
   source_connection:
     mode: credentials
     host: "cluster.region.redshift.amazonaws.com"
     # Use temporary credentials from IAM
   ```
2. **Store passwords securely:**

   ```yaml
   source_connection:
     password: "${REDSHIFT_PASSWORD}"  # From environment variable
   ```
3. **Use read-only accounts:**

   ```sql
   CREATE USER migration_reader WITH PASSWORD 'secure_password';
   GRANT USAGE ON SCHEMA public TO migration_reader;
   GRANT SELECT ON ALL TABLES IN SCHEMA public TO migration_reader;
   ```
4. **Restrict VPC access:**

   * Configure security groups to allow access only from specific IPs
   * Use VPC endpoints for internal AWS connectivity

### Performance

1. **Enable chunking for large tables:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       chunk_number: 50
   ```
2. **Use WHERE clauses to filter data:**

   ```yaml
   tables:
     - fully_qualified_name: transactions
       where_clause: "transaction_date >= CURRENT_DATE - 30"
   ```
3. **Optimize thread count:**

   ```yaml
   max_threads: 16  # Adjust based on cluster size
   ```
4. **Consider cluster size and workload:**

   * Run validations during off-peak hours
   * Monitor cluster performance during validation

### Data Quality

1. **Handle distribution and sort keys:**

   * Be aware that Redshift distribution/sort keys may affect data ordering
   * Use appropriate index columns that match distribution keys
2. **Start with schema validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: false
     row_validation: false
   ```
3. **Progress to metrics validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: true
     row_validation: false
   ```
4. **Enable row validation selectively:**

   ```yaml
   validation_configuration:
     row_validation: true

   tables:
     - fully_qualified_name: critical_fact_table
       # Row validation enabled for this table
   ```

### AWS-Specific Considerations

1. **Monitor cluster performance:**

   * Use AWS CloudWatch metrics during validation
   * Monitor query performance and WLM queues
2. **Consider cluster maintenance windows:**

   * Avoid running validations during maintenance windows
   * Check cluster status before starting validation
3. **Use appropriate cluster endpoints:**

   * Use cluster endpoint for direct connections
   * Use VPC endpoint for internal AWS connectivity
4. **Handle AWS region-specific configurations:**

   ```yaml
   source_connection:
     host: "cluster.us-east-1.redshift.amazonaws.com"  # Specify correct region
   ```

---

## See Also

* [Main CLI Usage Guide](CLI_USAGE_GUIDE.md)
* [SQL Server Commands Reference](sqlserver_commands.md)
* [Teradata Commands Reference](teradata_commands.md)
* [Snowflake Commands Reference](snowflake_commands.md)
* [Configuration Examples](CONFIGURATION_EXAMPLES.md)
* [Quick Reference Guide](CLI_QUICK_REFERENCE.md)

---
title: Amazon Redshift to Snowflake Migration Guide
source: https://docs.snowflake.com/en/migrations/guides/redshift.md
section: Migrations
---

# **Amazon Redshift to Snowflake Migration Guide**

## **Snowflake Migration Framework**

A typical Amazon Redshift-to-Snowflake migration can be broken down into nine key phases. This guide provides a comprehensive framework to navigate the technical and strategic challenges involved, ensuring a smooth transition to Snowflake’s cloud data platform.

## **Migration Phases**

### **Phase 1: Planning and Design**

This initial phase is critical for establishing the foundation of a successful migration. Rushing this step often leads to scope creep, budget overruns, and missed deadlines. A thorough plan ensures all stakeholders are aligned and the project’s goals are clearly defined.

**Your Actionable Steps:**

* **Conduct a Thorough Assessment of Your Redshift Environment:**

  + **Inventory & Analyze:** Catalog all databases, schemas, tables, views, stored procedures, and user-defined functions (UDFs) in your Redshift cluster. Use Redshift system tables (SVV_TABLE_INFO, PG_PROC, etc.) to gather metadata.
  + **Analyze Workloads:** Use Redshift’s STL_QUERY and SVL_QUERY_SUMMARY views to identify query patterns, user concurrency, and performance bottlenecks. This data is crucial for designing your Snowflake virtual warehouse strategy.
  + **Identify Dependencies:** Map all upstream data sources (ETL/ELT jobs) and downstream consumers (BI tools, applications, data science notebooks).
* **Define the Migration Scope and Strategy:**

  + **Prioritize Workloads:** Classify workloads by business impact and technical complexity. Start with a high-impact, low-complexity workload for a quick win and to build momentum.
  + **Choose a Migration Approach:** Decide between a “lift and shift” approach for a faster migration or a re-architecture approach to modernize and optimize data models and pipelines.
* **Develop the Project Plan:**

  + **Establish a Team:** Create a migration team with clear roles and responsibilities (e.g., Project Manager, Data Engineer, DBA, Security Admin, Business Analyst).
  + **Create a Timeline:** Define realistic timelines and milestones for each of the nine phases.
  + **Define Success Metrics:** Establish clear KPIs to measure the success of the migration, such as cost reduction, query performance improvement, and user satisfaction.

### **Phase 2: Environment and Security**

With a solid plan in place, the next step is to prepare the Snowflake environment and replicate your security posture. A key advantage of migrating from Redshift is that both platforms typically run on the same cloud provider (AWS), which simplifies data transfer.

**Your Actionable Steps:**

* **Set Up Your Snowflake Account:**

  + **Choose Edition and Cloud Provider:** Select the Snowflake edition (e.g., Standard, Enterprise, Business Critical) that meets your needs. Choose AWS as the cloud provider and select the same region as your current S3 buckets to minimize data transfer costs and latency.
  + **Design a Warehouse Strategy:** Based on the workload analysis from Phase 1, create an initial set of virtual warehouses. Isolate different workloads (e.g., WH_LOADING, WH_TRANSFORM, WH_BI_ANALYTICS) to prevent resource contention. Start with T-shirt sizes (e.g., X-Small, Small) and plan to resize them based on performance testing.
* **Implement the Security Model:**

  + **Map Redshift Users/Groups to Snowflake Roles:** Translate Redshift’s user and group permissions into Snowflake’s Role-Based Access Control (RBAC) model. Create a hierarchy of functional roles (e.g., SYSADMIN, SECURITYADMIN) and access roles (e.g., BI_READ_ONLY, ETL_READ_WRITE).
  + **Configure Network Policies and Authentication:** Set up network policies to restrict access to trusted IP addresses. Configure authentication methods, such as federated authentication (SSO) using an identity provider like Okta or Azure AD.

### **Phase 3: Database Code Conversion**

This phase involves converting Redshift’s DDL, DML, and procedural code to be compatible with Snowflake. Automation tools can accelerate this process, but manual review and adjustment are essential due to differences in SQL dialects and platform architecture.

**Your Actionable Steps:**

* **Convert DDL (Data Definition Language):**

  + **Tables and Views:** Extract CREATE TABLE and CREATE VIEW statements from Redshift. Convert Redshift-specific data types to their Snowflake equivalents (see Appendix 2).
  + **Remove Redshift-Specific Clauses:** Eliminate Redshift-specific physical design clauses like DISTSTYLE, DISTKEY, and SORTKEY. Snowflake’s architecture handles data distribution and clustering automatically or through logical clustering keys on very large tables.
* **Convert DML (Data Manipulation Language) and Procedural Code:**

  + **Rewrite Stored Procedures:** Redshift uses PL/pgSQL for stored procedures. These must be manually rewritten into a language supported by Snowflake, such as Snowflake Scripting (SQL), JavaScript, Python, or Java. This is often the most time-consuming part of the code conversion process.
  + **Translate SQL Functions:** Map Redshift-specific functions to their Snowflake counterparts. For example, Redshift’s GETDATE() becomes Snowflake’s CURRENT_TIMESTAMP(). See Appendix 3 for common function mappings.
  + **Replace Maintenance Commands:** Scripts containing Redshift-specific commands like VACUUM, ANALYZE, and REINDEX should be removed, as Snowflake handles these maintenance tasks automatically.

### **Phase 4: Data Migration**

This phase focuses on the physical movement of historical data from your Redshift cluster to Snowflake tables. The most efficient method leverages Amazon S3 as an intermediate staging area.

**Your Actionable Steps:**

* **Unload Data from Redshift to S3:**

  + Use the Redshift UNLOAD command to export data from tables into a designated S3 bucket. This is highly parallelized and significantly faster than a SELECT query via a client tool.
  + Format data as Parquet or compressed CSV for optimal loading performance into Snowflake. Use the PARALLEL ON option to write multiple files.
* **Load Data from S3 into Snowflake:**

  + **Create External Stages:** In Snowflake, create an external stage object that points to the S3 bucket containing your unloaded data.
  + **Use the COPY INTO Command:** Use Snowflake’s COPY INTO <table> command to load the data from the S3 stage into the target Snowflake tables. This command is highly performant and scalable.
  + **Leverage a Sized-Up Warehouse:** Use a dedicated, larger virtual warehouse for the initial data load to accelerate the process, and then scale it down or suspend it afterward to manage costs.

### **Phase 5: Data Ingestion**

Once the historical data is migrated, you must re-engineer your ongoing data ingestion pipelines to feed data directly into Snowflake instead of Redshift.

**Your Actionable Steps:**

* **Migrate Batch ETL/ELT Jobs:**

  + Update existing ETL jobs (in tools like AWS Glue, Talend, or Informatica) to target Snowflake as the destination. This typically involves changing the connection details and updating any SQL overrides to use Snowflake’s dialect.
* **Implement Continuous Ingestion with Snowpipe:**

  + For continuous data streams (e.g., from Kinesis or application logs landing in S3), configure Snowpipe. Snowpipe automatically and efficiently loads new data files from S3 into Snowflake tables as they arrive, providing a near-real-time ingestion solution.
* **Utilize the Snowflake Ecosystem:**

  + Explore Snowflake’s native connectors for platforms like Kafka and Spark to simplify direct data streaming.

### **Phase 6: Reporting and Analytics**

This phase involves redirecting all downstream applications, particularly BI and reporting tools, to query data from Snowflake.

**Your Actionable Steps:**

* **Update Connection Drivers:** Install and configure Snowflake’s ODBC/JDBC drivers on servers hosting your BI tools (e.g., Tableau Server, Power BI Gateway).
* **Redirect Reports and Dashboards:**

  + In your BI tools, change the data source connection from Redshift to Snowflake.
  + Test all critical reports and dashboards to ensure they function correctly.
* **Review and Optimize Queries:**

  + Some dashboards may contain custom SQL or database-specific functions. Review and refactor these queries to use Snowflake’s SQL dialect and take advantage of its performance features. Use the Query Profile tool in Snowflake to analyze and optimize slow-running reports.

### **Phase 7: Data Validation and Testing**

Rigorous testing is essential to build business confidence in the new platform and ensure data integrity and performance meet expectations.

**Your Actionable Steps:**

* **Perform Data Validation:**

  + **Row Counts:** Compare row counts between source tables in Redshift and target tables in Snowflake.
  + **Cell-Level Validation:** For critical tables, perform a deeper validation by comparing aggregated values (e.g., SUM(), AVG(), MIN(), MAX()) or using checksums on key columns.
* **Conduct Query and Performance Testing:**

  + **Benchmark Queries:** Execute a representative set of queries against both Redshift and Snowflake and compare results and performance.
  + **BI Tool Performance:** Test the load times and interactivity of key dashboards connected to Snowflake.
* **User Acceptance Testing (UAT):**

  + Involve business users to validate their reports and perform their daily tasks using the new Snowflake environment. Gather feedback and address any issues.

### **Phase 8: Deployment**

Deployment is the final cutover from Redshift to Snowflake. This process should be carefully managed to minimize disruption to business operations.

**Your Actionable Steps:**

* **Develop a Cutover Plan:**

  + Define the sequence of events for the cutover weekend or evening. This includes stopping ETL jobs pointing to Redshift, performing a final data sync, redirecting all connections, and validating system health.
* **Execute the Final Data Sync:**

  + Perform one last incremental data load to capture any data changes that occurred during the testing phase.
* **Go Live:**

  + Switch all production data pipelines and user connections from Redshift to Snowflake.
  + Keep the Redshift environment in a read-only state for a short period as a fallback before decommissioning it.
* **Decommission Redshift:**

  + Once the Snowflake environment is stable and validated in production, you can decommission your Redshift cluster to stop incurring costs.

### **Phase 9: Optimize and Run**

This final phase is an ongoing process of managing performance, cost, and governance in your new Snowflake environment. The goal is to continuously refine your setup to maximize value.

**Your Actionable Steps:**

* **Implement Performance and Cost Optimization:**

  + **Right-Size Warehouses:** Continuously monitor workload performance and adjust virtual warehouse sizes up or down to meet SLAs at the lowest possible cost.
  + **Set Aggressive Auto-Suspend Policies:** Set the auto-suspend timeout for all warehouses to 60 seconds to avoid paying for idle compute time.
  + **Use Clustering Keys:** For very large tables (multi-terabyte), analyze query patterns and define clustering keys to improve the performance of highly filtered queries.
* **Establish Long-Term FinOps and Governance:**

  + **Monitor Costs:** Use Snowflake’s ACCOUNT_USAGE schema and resource monitors to track credit consumption and prevent budget overruns.
  + **Refine Security:** Regularly audit roles and permissions to ensure the principle of least privilege is maintained. Implement advanced security features like Dynamic Data Masking and Row-Access Policies for sensitive data.

## **Appendix**

### **Appendix 1: Snowflake vs. Redshift Architecture**

| Feature | Amazon Redshift | Snowflake |
| --- | --- | --- |
| **Architecture** | Tightly coupled compute and storage (MPP) | Decoupled compute, storage, and cloud services (Multi-cluster, Shared Data) |
| **Storage** | Managed columnar storage on local SSDs attached to nodes | Centralized object storage (e.g., S3) with automatic micro-partitioning |
| **Compute** | Fixed-size cluster of nodes (Leader + Compute Nodes) | Elastic, on-demand virtual warehouses (compute clusters) |
| **Concurrency** | Limited by cluster size; queries can queue | High concurrency via multi-cluster warehouses that spin up automatically |
| **Scaling** | Scale by adding nodes (takes minutes to hours, involves data redistribution) | Instantly scale compute up/down/out (seconds); storage scales automatically |
| **Maintenance** | Requires manual VACUUM and ANALYZE commands | Fully managed; maintenance tasks are automated and run in the background |

### **Appendix 2: Data Type Mappings**

| Amazon Redshift | Snowflake | Notes |
| --- | --- | --- |
| SMALLINT | SMALLINT / NUMBER(5,0) |  |
| INTEGER | INTEGER / NUMBER(10,0) |  |
| BIGINT | BIGINT / NUMBER(19,0) |  |
| DECIMAL(p,s) / NUMERIC(p,s) | NUMBER(p,s) |  |
| REAL / FLOAT4 | FLOAT |  |
| DOUBLE PRECISION / FLOAT8 | FLOAT |  |
| BOOLEAN | BOOLEAN |  |
| CHAR(n) | CHAR(n) / VARCHAR(n) | Snowflake pads CHAR with spaces; VARCHAR is often preferred. |
| VARCHAR(n) | VARCHAR(n) | Max length in Snowflake is 16MB. |
| DATE | DATE |  |
| TIMESTAMP | TIMESTAMP_NTZ | Snowflake separates timestamps with and without time zones. |
| TIMESTAMPTZ | TIMESTAMP_TZ |  |
| GEOMETRY | GEOGRAPHY / GEOMETRY | Snowflake has native support for geospatial data. |
| SUPER | VARIANT | For semi-structured data (JSON). |

### **Appendix 3: SQL & Function Differences**

| Amazon Redshift | Snowflake | Notes |
| --- | --- | --- |
| GETDATE() | CURRENT_TIMESTAMP() | Snowflake has several functions for current date/time. |
| SYSDATE | CURRENT_TIMESTAMP() | SYSDATE is an alias for GETDATE in Redshift. |
| LISTAGG(expr, delim) | LISTAGG(expr, delim) | Syntax is similar but ordering behavior can differ. |
| NVL(expr1, expr2) | NVL(expr1, expr2) / IFNULL(expr1, expr2) | Functionality is identical. |
| DECODE(expr, search, result…) | DECODE(expr, search, result…) | Supported in both. CASE statements are more standard. |
| DATEDIFF(part, start, end) | DATEDIFF(part, start, end) | Supported, but date/time parts may have different names (e.g., yr vs year). |
| DATEADD(part, num, date) | DATEADD(part, num, date) | Supported, but date/time parts may have different names. |
| **Stored Procedures** | PL/pgSQL | Snowflake Scripting (SQL), JavaScript, Python, Java |
| **DDL Clauses** | DISTKEY, SORTKEY, ENCODE | None. Replaced by automatic micro-partitioning and optional Clustering Keys. |
| **Maintenance** | VACUUM, ANALYZE | None. Automated background services handle maintenance. |
| **Data Loading** | UNLOAD, COPY | COPY INTO, Snowpipe |

---
title: ANALYTIC
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/analytic.md
section: Migrations
---

# ANALYTIC

In this section, you will find the documentation for the translation reference of Analytic Language Elements.

## EXPLAIN

Translation specification for the EXPLAIN clause.

As per Teradata’s [documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/QueryGridTM-Installation-and-User-Guide-3.08/Configuring-and-Using-Links/Using-Links/Using-a-Teradata-to-TargetConnector-Link/SQL-Command-Reference-for-the-Teradata-Initiator-Connector/EXPLAIN), the EXPLAIN clause produces a step-by-step execution plan, which is a textual report that breaks down the query’s execution into a series of steps.

The syntax for this statement is as follows:

```sql
 EXPLAIN [ <SQL_statement> ];
```

### Query

```xml
 EXPLAIN SELECT * FROM table_1
```

#### Result

| Explanation |
| --- |
| 1. First, we lock DEMO_USER.table_3 in TD_MAP1 for read on a reserved    RowHash to prevent global deadlock. 2. Next, we lock DEMO_USER.table_3 in TD_MAP1 for read. 3. We do an all-AMPs RETRIEVE step in TD_MAP1 from DEMO_USER.table_3    by way of an all-rows scan with no residual conditions into Spool    1 (group_amps), which is built locally on the AMPs. The size of    Spool 1 is estimated with high confidence to be 1 row (32 bytes).    The estimated time for this step is 0.01 seconds. 4. Finally, we send out an END TRANSACTION step to all AMPs involved    in processing the request. |

**Snowflake**

##### Query

```python
    EXPLAIN SELECT * FROM table_1
```

##### Result

| ID | OPERATION | OBJECTS | SCHEDULE | PROJECTION | EXPRESSIONS |
| --- | --- | --- | --- | --- | --- |
| 0 | ResultFinalize |  | 3 | [1] |  | |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | | 1 | Exchange (SINGLE) |  |  |  |  | |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  |  | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | | 2 | ResultWorker |  | 2 | [1] |  | |  |  |  |  |  |  |  |  |  |  |  |  |  |  | | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | | 3 | Projection |  | 1 | [1] |  | |  |  |  |  |  |  |  | | --- | --- | --- | --- | --- | --- | --- | | 4 | RowGenerator |  | 0 | [] |  |  | | | | |

As you can see from the results, EXPLAIN in Teradata and Snowflake have the same goal: to provide an explanation of the steps that will be performed when a query is executed. However, Teradata uses a more verbose explanation compared to Snowflake, which only shows the name of each step to be executed.

### Related EWIs

No related EWIs.

---
title: Azure Synapse to Snowflake Migration Guide
source: https://docs.snowflake.com/en/migrations/guides/azuresynapse.md
section: Migrations
---

# **Azure Synapse to Snowflake Migration Guide**

## **Snowflake Migration Framework**

A typical Azure Synapse-to-Snowflake migration can be broken down into nine key phases. This guide provides a comprehensive framework to navigate the technical and strategic challenges involved, ensuring a smooth transition from Azure’s analytics platform to Snowflake’s cloud data platform.

## **Migration Phases**

### **Phase 1: Planning and Design**

This initial phase is critical for establishing the foundation of a successful migration. Migrating from Azure Synapse requires a clear understanding of its integrated components and a thorough plan to align stakeholders, define scope, and prevent budget overruns.

**Your Actionable Steps:**

* **Conduct a Thorough Assessment of Your Synapse Environment:**

  + **Inventory & Analyze:** Catalog all objects within your Synapse workspace, including dedicated SQL pool tables, serverless SQL pool views, schemas, T-SQL stored procedures, functions, and views. Use Synapse’s system views (e.g., sys.tables, sys.procedures) to gather metadata.
  + **Analyze Workloads:** Use Azure Monitor and Synapse’s Dynamic Management Views (DMVs) to identify query patterns, user concurrency, resource utilization (DWUs), and performance bottlenecks. This data is crucial for designing your Snowflake virtual warehouse strategy.
  + **Identify Dependencies:** Map all upstream data sources, especially Azure Data Factory (ADF) pipelines, and downstream consumers like Power BI reports, Azure Machine Learning models, and other applications.
* **Define the Migration Scope and Strategy:**

  + **Prioritize Workloads:** Classify workloads by business impact and technical complexity. Start with a high-impact, low-complexity workload (e.g., a specific data mart) to demonstrate value and build momentum.
  + **Choose a Migration Approach:** Decide between a “lift and shift” for a faster migration or a re-architecture approach to modernize data models and pipelines.
* **Develop the Project Plan:**

  + **Establish a Team:** Create a migration team with clear roles (Project Manager, Data Engineer, Synapse/SQL DBA, Snowflake Architect, Security Admin, Business Analyst).
  + **Create a Timeline:** Define realistic timelines and milestones for each of the nine phases.
  + **Define Success Metrics:** Establish clear KPIs to measure success, such as cost reduction, query performance improvement, and user satisfaction.

### **Phase 2: Environment and Security**

With a solid plan, the next step is to prepare the Snowflake environment and translate Azure’s security model. Hosting Snowflake on Azure is highly recommended to simplify data transfer and network integration.

**Your Actionable Steps:**

* **Set Up Your Snowflake Account:**

  + **Choose Edition and Cloud Provider:** Select the Snowflake edition (e.g., Standard, Enterprise, Business Critical) that meets your needs. **Choose Azure as the cloud provider** and select the same region as your Azure Data Lake Storage (ADLS Gen2) to minimize data transfer costs and latency.
  + **Design a Warehouse Strategy:** Based on the workload analysis from Phase 1, create an initial set of virtual warehouses. Isolate different workloads (e.g., WH_LOADING, WH_TRANSFORM, WH_BI_ANALYTICS) to prevent resource contention. Start with T-shirt sizes (e.g., X-Small, Small) and plan to resize them based on performance testing.
* **Implement the Security Model:**

  + **Map Azure AD Principals to Snowflake Roles:** Translate Azure Active Directory (AAD) users and groups into Snowflake’s hierarchical Role-Based Access Control (RBAC) model. Create a hierarchy of functional roles (SYSADMIN, SECURITYADMIN) and access roles (BI_READ_ONLY, ETL_READ_WRITE).
  + **Configure Network Policies and Authentication:** Set up network policies to restrict access to trusted IP addresses via Azure Private Link for a secure connection. Configure SSO by setting up Snowflake as an Enterprise Application in Azure AD.

### **Phase 3: Database Code Conversion**

This phase involves converting Synapse’s T-SQL based DDL, DML, and procedural code to be compatible with Snowflake. Automation tools can accelerate this process, but manual review is essential.

**Your Actionable Steps:**

* **Convert DDL (Data Definition Language):**

  + **Tables and Views:** Extract CREATE TABLE and CREATE VIEW statements from Synapse. Convert Synapse-specific data types to their Snowflake equivalents (see Appendix 2).
  + **Remove Synapse-Specific Clauses:** Eliminate Synapse-specific physical distribution clauses like DISTRIBUTION (e.g., ROUND_ROBIN, HASH) and indexing strategies like CLUSTERED COLUMNSTORE INDEX. Snowflake manages data distribution and storage automatically.
  + **Re-implement Constraints:** Snowflake only enforces NOT NULL constraints. PRIMARY KEY and UNIQUE constraints are informational. All other data integrity logic must be moved into your ETL/ELT processes.
* **Convert DML (Data Manipulation Language) and Procedural Code:**

  + **Rewrite T-SQL Stored Procedures:** Synapse’s T-SQL stored procedures must be rewritten into a language supported by Snowflake, such as Snowflake Scripting (SQL), JavaScript, or Python.
  + **Translate SQL Functions:** Map Synapse/T-SQL specific functions to their Snowflake counterparts (e.g., GETDATE() becomes CURRENT_TIMESTAMP(), ISNULL() becomes IFNULL()). See Appendix 3 for common mappings.

### **Phase 4: Data Migration**

This phase focuses on the physical movement of historical data from your Synapse SQL pools to Snowflake tables. The most efficient method leverages Azure Data Lake Storage (ADLS Gen2) as an intermediate staging area.

**Your Actionable Steps:**

* **Unload Data from Synapse to ADLS Gen2:**

  + Use the CREATE EXTERNAL TABLE AS SELECT (CETAS) command in Synapse to export data from tables into a designated container in your ADLS Gen2 account.
  + Format data as Parquet or compressed CSV for optimal loading performance into Snowflake.
* **Load Data from ADLS Gen2 into Snowflake:**

  + **Create an External Stage:** In Snowflake, create a storage integration object to securely connect to ADLS Gen2, then create an external stage that points to the container with your unloaded data.
  + **Use the COPY INTO Command:** Use Snowflake’s COPY INTO <table> command to load the data from the ADLS stage into the target Snowflake tables.
  + **Leverage a Sized-Up Warehouse:** Use a dedicated, larger virtual warehouse for the initial data load to accelerate the process, then scale it down or suspend it afterward.

### **Phase 5: Data Ingestion**

Once the historical data is migrated, you must re-engineer your ongoing data ingestion pipelines, most commonly in Azure Data Factory, to feed data into Snowflake.

**Your Actionable Steps:**

* **Migrate Azure Data Factory (ADF) Pipelines:**

  + In your ADF pipelines, replace Synapse datasets and activities with their Snowflake equivalents. Use Snowflake’s native connector in ADF for both source and sink activities.
  + Update any Lookup or Script activities to use Snowflake’s SQL dialect.
* **Implement Continuous Ingestion with Snowpipe:**

  + For continuous data streams landing in ADLS Gen2, configure Snowpipe. Snowpipe automatically and efficiently loads new data files into Snowflake tables as they arrive, providing a near-real-time ingestion solution. This can be triggered by Azure Event Grid notifications.
* **Utilize the Snowflake Ecosystem:**

  + Explore Snowflake’s native connectors for platforms like Kafka and Spark to simplify direct data streaming.

### **Phase 6: Reporting and Analytics**

This phase involves redirecting all downstream applications, particularly Power BI, to query data from Snowflake.

**Your Actionable Steps:**

* **Update Connection Drivers:** Ensure Power BI Desktop and the On-premises data gateway have the latest Snowflake drivers.
* **Redirect Power BI Reports:**

  + In Power BI, edit the data source for each report, switching the connection from Azure Synapse to Snowflake. Snowflake’s native Power BI connector is certified and highly recommended.
  + Test all critical reports and dashboards. Pay close attention to reports using DirectQuery, as performance characteristics will change.
* **Review and Optimize Queries:**

  + Some reports may contain native T-SQL queries. These must be refactored to use Snowflake’s SQL dialect. Use the Query Profile tool in Snowflake and the Performance Analyzer in Power BI to optimize slow-running reports.

### **Phase 7: Data Validation and Testing**

Rigorous testing is essential to build business confidence in the new platform and ensure data integrity and performance meet expectations.

**Your Actionable Steps:**

* **Perform Data Validation:**

  + **Row Counts:** Compare row counts between source tables in Synapse and target tables in Snowflake.
  + **Cell-Level Validation:** For critical tables, perform a deeper validation by comparing aggregated values (SUM, AVG, MIN, MAX) on key columns.
* **Conduct Query and Performance Testing:**

  + **Benchmark Queries:** Execute a representative set of queries against both Synapse and Snowflake and compare results and performance.
  + **BI Tool Performance:** Test the load times and interactivity of key Power BI dashboards connected to Snowflake.
* **User Acceptance Testing (UAT):**

  + Involve business users to validate their reports and perform their daily tasks using the new Snowflake environment.

### **Phase 8: Deployment**

Deployment is the final cutover from Azure Synapse to Snowflake. This process should be carefully managed to minimize disruption to business operations.

**Your Actionable Steps:**

* **Develop a Cutover Plan:**

  + Define the sequence of events for the cutover. This includes pausing ADF pipelines pointing to Synapse, performing a final data sync, redirecting all connections, and validating system health.
* **Execute the Final Data Sync:**

  + Perform one last incremental data load to capture any data changes that occurred during the testing phase.
* **Go Live:**

  + Switch all production data pipelines and user connections from Synapse to Snowflake.
  + Keep the Synapse environment available (but paused, if possible) for a short period as a fallback before decommissioning.
* **Decommission Synapse:**

  + Once the Snowflake environment is stable and validated in production, you can decommission your Synapse SQL pools to stop incurring costs.

### **Phase 9: Optimize and Run**

This final phase is an ongoing process of managing performance, cost, and governance in your new Snowflake environment.

**Your Actionable Steps:**

* **Implement Performance and Cost Optimization:**

  + **Right-Size Warehouses:** Continuously monitor workload performance and adjust virtual warehouse sizes. This replaces the concept of scaling Synapse DWUs.
  + **Set Aggressive Auto-Suspend Policies:** Set the auto-suspend timeout for all warehouses to 60 seconds to avoid paying for idle compute time.
  + **Use Clustering Keys:** For very large tables (multi-terabyte), define clustering keys to improve the performance of highly filtered queries.
* **Establish Long-Term FinOps and Governance:**

  + **Monitor Costs:** Use Snowflake’s ACCOUNT_USAGE schema and resource monitors to track credit consumption.
  + **Refine Security:** Regularly audit roles and permissions. Implement advanced security features like Dynamic Data Masking and Row-Access Policies for sensitive data.

## **Appendix**

### **Appendix 1: Snowflake vs. Azure Synapse Architecture**

| Feature | Azure Synapse Analytics | Snowflake |
| --- | --- | --- |
| **Architecture** | Control Node + Compute Nodes (MPP for Dedicated Pools). Decoupled storage but coupled compute within a pool. | Decoupled compute, storage, and cloud services (Multi-cluster, Shared Data). |
| **Storage** | Data stored in Azure Data Lake Storage, managed by the SQL pool. | Centralized object storage (Azure Blob) with automatic micro-partitioning. |
| **Compute** | Provisioned Dedicated SQL Pools (scaled by DWUs) or Serverless SQL Pools (pay-per-query). | Elastic, on-demand virtual warehouses (compute clusters). |
| **Concurrency** | Limited by DWU size and max concurrent query slots (128) in a dedicated pool. | High concurrency via multi-cluster warehouses that spin up automatically. |
| **Scaling** | Scale dedicated pools by changing DWUs (can take several minutes). Can be paused. | Instantly scale compute up/down/out (seconds); storage scales automatically. |
| **Maintenance** | Requires manual maintenance of statistics. Indexing strategies need management. | Fully managed; maintenance tasks like statistics and compaction are automated. |

### **Appendix 2: Data Type Mappings**

| Azure Synapse (T-SQL) | Snowflake | Notes |
| --- | --- | --- |
| bigint | BIGINT / NUMBER(19,0) |  |
| int | INT / NUMBER(10,0) |  |
| smallint | SMALLINT / NUMBER(5,0) |  |
| tinyint | TINYINT / NUMBER(3,0) |  |
| bit | BOOLEAN |  |
| decimal(p,s) / numeric(p,s) | NUMBER(p,s) |  |
| money / smallmoney | NUMBER(19,4) / NUMBER(10,4) | Best practice is to map to NUMBER. |
| float / real | FLOAT |  |
| date | DATE |  |
| datetime / datetime2 | DATETIME / TIMESTAMP_NTZ | TIMESTAMP_NTZ is often the preferred target. |
| datetimeoffset | TIMESTAMP_TZ |  |
| smalldatetime | DATETIME / TIMESTAMP_NTZ |  |
| time | TIME |  |
| char(n) / varchar(n) | VARCHAR(n) |  |
| nchar(n) / nvarchar(n) | VARCHAR(n) | Snowflake uses UTF-8 by default, so N prefix types are not needed. |
| text / ntext | VARCHAR | Deprecated types; map to VARCHAR. |
| binary(n) / varbinary(n) | BINARY(n) |  |
| uniqueidentifier | VARCHAR(36) | Store as a string and use UUID_STRING() if needed. |

### **Appendix 3: SQL & Function Differences**

| Azure Synapse (T-SQL) | Snowflake | Notes |
| --- | --- | --- |
| GETDATE() | CURRENT_TIMESTAMP() | Snowflake has several functions for current date/time. |
| ISNULL(expr1, expr2) | IFNULL(expr1, expr2) | COALESCE is the ANSI standard and works in both. |
| TOP (n) | LIMIT n | Snowflake uses LIMIT clause at the end of the query. |
| IIF(bool, true, false) | IFF(bool, true, false) | Functionality is identical, name is slightly different. |
| DATEADD(part, num, date) | DATEADD(part, num, date) | Supported, but date/time parts may have different names (e.g., dd vs day). |
| DATEDIFF(part, start, end) | DATEDIFF(part, start, end) | Supported, but date/time parts may have different names. |
| STRING_SPLIT | SPLIT_TO_TABLE / SPLIT | Snowflake has more powerful functions for splitting strings. |
| **Procedural Language** | T-SQL (Stored Procedures) | Snowflake Scripting, JavaScript, Java, Python |
| **DDL Clauses** | DISTRIBUTION, CLUSTERED COLUMNSTORE INDEX | None. Replaced by automatic micro-partitioning and optional Clustering Keys. |
| **Temp Tables** | #temptable | CREATE TEMPORARY TABLE |
| **Transactions** | BEGIN TRAN, COMMIT, ROLLBACK | BEGIN, COMMIT, ROLLBACK |
| **Error Handling** | TRY…CATCH | BEGIN…EXCEPTION…END |

---
title: Configuration File Examples
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/CONFIGURATION_EXAMPLES.md
section: Migrations
---

# Configuration File Examples

This document provides ready-to-use configuration examples for various validation scenarios. Copy and adapt these examples for your specific use case.

## Table of Contents

* SQL Server Examples
* Teradata Examples
* Redshift Examples
* Snowflake Examples
* Scenario-Based Examples
* View Validation Examples

---

## SQL Server Examples

### Example 1: Minimal SQL Server Configuration

Perfect for quick testing or simple validations.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./validation_output

source_connection:
  mode: credentials
  host: localhost
  port: 1433
  username: sa
  password: YourPassword123
  database: TestDB

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables:
  - fully_qualified_name: TestDB.dbo.Customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 2: Production SQL Server with SSL/TLS

Secure production setup with proper encryption settings.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: /data/validation/production
max_threads: 16

source_connection:
  mode: credentials
  host: sqlserver-prod.company.com
  port: 1433
  username: validation_user
  password: SecurePassword123!
  database: PRODUCTION_DB
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_production

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 250

comparison_configuration:
  tolerance: 0.01

logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

database_mappings:
  PRODUCTION_DB: PROD_SNOWFLAKE

schema_mappings:
  dbo: PUBLIC

tables:
  - fully_qualified_name: PRODUCTION_DB.dbo.Orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
    chunk_number: 20

  - fully_qualified_name: PRODUCTION_DB.dbo.Customers
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - credit_card_number
    index_column_list:
      - customer_id

  - fully_qualified_name: PRODUCTION_DB.dbo.Products
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - product_id
      - product_name
      - price
      - category
    where_clause: "is_active = 1"
    target_where_clause: "is_active = 1"
```

### Example 3: SQL Server Incremental Validation

Validate only recent changes using date filters.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./incremental_validation
max_threads: auto

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: etl_user
  password: EtlPassword123
  database: DataWarehouse

target_connection:
  mode: name
  name: snowflake_dw

validation_configuration:
  schema_validation: false
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

comparison_configuration:
  tolerance: 0.001

tables:
  - fully_qualified_name: DataWarehouse.dbo.FactSales
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - transaction_id
    where_clause: "transaction_date >= DATEADD(day, -7, GETDATE())"
    target_where_clause: "transaction_date >= DATEADD(day, -7, CURRENT_TIMESTAMP)"
    chunk_number: 10

  - fully_qualified_name: DataWarehouse.dbo.DimCustomer
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "modified_date >= DATEADD(day, -7, GETDATE())"
    target_where_clause: "modified_date >= DATEADD(day, -7, CURRENT_TIMESTAMP)"
```

### Example 4: SQL Server with Column Mappings

Handle renamed columns during migration.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./validation_with_mappings
max_threads: 8

source_connection:
  mode: credentials
  host: legacy-sql.company.com
  port: 1433
  username: migration_user
  password: MigrationPass123
  database: LegacyDB

target_connection:
  mode: name
  name: snowflake_modernized

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true

tables:
  - fully_qualified_name: LegacyDB.dbo.CustomerMaster
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - cust_id
      - cust_name
      - cust_email
      - cust_phone
      - addr_line1
      - addr_line2
      - addr_city
      - addr_state
      - addr_zip
    index_column_list:
      - cust_id
    column_mappings:
      cust_id: customer_id
      cust_name: customer_name
      cust_email: email_address
      cust_phone: phone_number
      addr_line1: address_line_1
      addr_line2: address_line_2
      addr_city: city
      addr_state: state
      addr_zip: postal_code
```

---

## Teradata Examples

### Example 5: Basic Teradata Configuration

Simple Teradata to Snowflake validation.

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: ./teradata_validation
target_database: SNOWFLAKE_DB
max_threads: auto

source_connection:
  mode: credentials
  host: teradata.company.com
  username: td_user
  password: TeradataPass123
  database: PROD_DB

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables:
  - fully_qualified_name: PROD_DB.sales_data
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 6: Teradata Large-Scale Migration

Enterprise-scale Teradata migration validation.

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: /opt/validation/teradata_migration
target_database: ENTERPRISE_DW
max_threads: 32

source_connection:
  mode: credentials
  host: teradata-prod.company.com
  username: validation_service
  password: SecureTdPassword!123
  database: ENTERPRISE_TD

target_connection:
  mode: name
  name: snowflake_enterprise

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 500
  exclude_metrics: false

comparison_configuration:
  tolerance: 0.005

logging_configuration:
  level: INFO
  console_level: ERROR
  file_level: DEBUG

schema_mappings:
  ENTERPRISE_TD: PUBLIC

tables:
  # Large fact table - high chunking
  - fully_qualified_name: ENTERPRISE_TD.fact_transactions
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - transaction_key
    chunk_number: 100
    max_failed_rows_number: 1000

  # Dimension table with exclusions
  - fully_qualified_name: ENTERPRISE_TD.dim_customer
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - tax_id
      - bank_account
    index_column_list:
      - customer_key
    chunk_number: 20

  # Filtered validation for current year only
  - fully_qualified_name: ENTERPRISE_TD.fact_sales
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - sale_key
    where_clause: "sale_date >= DATE '2024-01-01'"
    target_where_clause: "sale_date >= DATE '2024-01-01'"
    chunk_number: 50
```

### Example 7: Teradata Multi-Schema Validation

Validate multiple schemas with different settings.

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: ./multi_schema_validation
target_database: MULTI_SCHEMA_DW
max_threads: 16

source_connection:
  mode: credentials
  host: teradata.company.com
  username: schema_validator
  password: ValidatorPass123
  database: DBC

target_connection:
  mode: name
  name: snowflake_multi_schema

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 200

comparison_configuration:
  tolerance: 0.01

schema_mappings:
  SALES_SCHEMA: SALES
  FINANCE_SCHEMA: FINANCE
  HR_SCHEMA: HUMAN_RESOURCES

tables:
  # Sales schema tables
  - fully_qualified_name: SALES_SCHEMA.orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id

  - fully_qualified_name: SALES_SCHEMA.order_details
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
      - line_number

  # Finance schema tables
  - fully_qualified_name: FINANCE_SCHEMA.invoices
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - invoice_id
    chunk_number: 30

  # HR schema tables - exclude sensitive data
  - fully_qualified_name: HR_SCHEMA.employees
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - salary
      - bank_account
      - emergency_contact
    index_column_list:
      - employee_id
```

---

## Redshift Examples

### Example 8: Basic Redshift Configuration

Simple Redshift to Snowflake validation.

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: ./redshift_validation
max_threads: auto

source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: redshift_user
  password: RedshiftPass123
  database: analytics

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables:
  - fully_qualified_name: public.events
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 9: Redshift Data Lake Migration

Validate Redshift data lake migration to Snowflake.

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: /data/validation/redshift_datalake
max_threads: 24

source_connection:
  mode: credentials
  host: datalake-cluster.us-west-2.redshift.amazonaws.com
  port: 5439
  username: datalake_validator
  password: SecureRedshiftPass!123
  database: datalake

target_connection:
  mode: name
  name: snowflake_datalake

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 500

comparison_configuration:
  tolerance: 0.02

logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

database_mappings:
  datalake: DATALAKE_PROD

schema_mappings:
  public: PUBLIC
  staging: STAGING
  analytics: ANALYTICS

tables:
  # Raw data staging
  - fully_qualified_name: staging.raw_events
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - event_id
    chunk_number: 80
    max_failed_rows_number: 1000

  # Analytics tables
  - fully_qualified_name: analytics.user_sessions
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - session_id
    where_clause: "session_date >= CURRENT_DATE - 30"
    target_where_clause: "session_date >= CURRENT_DATE - 30"
    chunk_number: 40

  - fully_qualified_name: analytics.aggregated_metrics
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - metric_id
      - date_key
    chunk_number: 20

  # Public schema - exclude system columns
  - fully_qualified_name: public.customer_360
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - _sys_created_at
      - _sys_modified_at
      - _sys_user_id
    index_column_list:
      - customer_id
    chunk_number: 50
```

### Example 10: Redshift with Complex Filtering

Advanced filtering and column selection for Redshift.

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: ./complex_validation
max_threads: 16

source_connection:
  mode: credentials
  host: analytics-cluster.region.redshift.amazonaws.com
  port: 5439
  username: validator
  password: ComplexPass123!
  database: analytics_db

target_connection:
  mode: name
  name: snowflake_analytics

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

comparison_configuration:
  tolerance: 0.01

tables:
  # Complex WHERE clause with multiple conditions
  - fully_qualified_name: public.transactions
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - transaction_id
      - customer_id
      - amount
      - transaction_date
      - status
      - payment_method
    index_column_list:
      - transaction_id
    where_clause: "status IN ('completed', 'settled') AND amount > 100 AND transaction_date >= '2024-01-01' AND payment_method != 'test'"
    target_where_clause: "status IN ('completed', 'settled') AND amount > 100 AND transaction_date >= '2024-01-01' AND payment_method != 'test'"
    chunk_number: 30

  # Date-based partitioned validation
  - fully_qualified_name: public.daily_metrics
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - metric_date
      - metric_id
    where_clause: "metric_date >= DATE_TRUNC('month', CURRENT_DATE)"
    target_where_clause: "metric_date >= DATE_TRUNC('month', CURRENT_DATE)"
    chunk_number: 10

  # Selective column validation with mappings
  - fully_qualified_name: public.legacy_customers
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - cust_no
      - full_name
      - email_addr
      - phone_num
      - signup_dt
    index_column_list:
      - cust_no
    column_mappings:
      cust_no: customer_number
      full_name: customer_name
      email_addr: email
      phone_num: phone
      signup_dt: signup_date
```

---

## Snowflake Examples

### Example 10.1: Basic Snowflake-to-Snowflake Configuration

Simple Snowflake-to-Snowflake validation for cross-account or cross-database migration.

```yaml
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: ./snowflake_validation
max_threads: auto

source_connection:
  mode: name
  name: source_connection

target_connection:
  mode: name
  name: target_connection

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables:
  - fully_qualified_name: SOURCE_DB.PUBLIC.CUSTOMERS
    target_database: TARGET_DB
    target_schema: PUBLIC
    target_name: CUSTOMERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - CUSTOMER_ID
```

### Example 10.2: Cross-Account Migration Validation

Enterprise-scale Snowflake cross-account migration validation.

```yaml
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: /opt/validation/cross_account
max_threads: 24

source_connection:
  mode: name
  name: account_a_connection

target_connection:
  mode: name
  name: account_b_connection

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 300

comparison_configuration:
  tolerance: 0.01

logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

database_mappings:
  ANALYTICS_A: ANALYTICS_B

schema_mappings:
  RAW: RAW_DATA
  STAGING: STAGING_DATA

tables:
  # Large fact table with chunking
  - fully_qualified_name: ANALYTICS_A.RAW.FACT_TRANSACTIONS
    target_database: ANALYTICS_B
    target_schema: RAW_DATA
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - TRANSACTION_ID
    chunk_number: 50
    max_failed_rows_number: 500

  # Dimension table with exclusions
  - fully_qualified_name: ANALYTICS_A.RAW.DIM_CUSTOMER
    target_database: ANALYTICS_B
    target_schema: RAW_DATA
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - INTERNAL_SCORE
      - RISK_RATING
    where_clause: "STATUS = 'ACTIVE'"
    target_where_clause: "STATUS = 'ACTIVE'"
    column_mappings:
      CUST_KEY: CUSTOMER_KEY
      CUST_NAME: CUSTOMER_NAME
```

### Example 10.3: Cross-Region Replication Validation

Validate data replication between Snowflake regions.

```yaml
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: /data/validation/region_replication
max_threads: 16

source_connection:
  mode: name
  name: us_east_connection

target_connection:
  mode: name
  name: eu_west_connection

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 150

comparison_configuration:
  tolerance: 0.005

tables:
  # Recent transactions
  - fully_qualified_name: GLOBAL_DB.REPLICATION.TRANSACTIONS
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - TRANSACTION_ID
      - CUSTOMER_ID
      - AMOUNT
      - TRANSACTION_DATE
      - STATUS
    index_column_list:
      - TRANSACTION_ID
    where_clause: "TRANSACTION_DATE >= DATEADD(day, -7, CURRENT_DATE())"
    target_where_clause: "TRANSACTION_DATE >= DATEADD(day, -7, CURRENT_DATE())"
    chunk_number: 30

  # Reference table
  - fully_qualified_name: GLOBAL_DB.REPLICATION.CURRENCIES
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - CURRENCY_CODE
```

### Example 10.4: Database Copy Validation

Validate a database copy within the same Snowflake account.

```yaml
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: ./db_copy_validation
max_threads: auto

source_connection:
  mode: name
  name: prod_connection

target_connection:
  mode: name
  name: prod_connection

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

comparison_configuration:
  tolerance: 0.001

tables:
  - fully_qualified_name: ORIGINAL_DB.PUBLIC.USERS
    target_database: COPIED_DB
    target_schema: PUBLIC
    target_name: USERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - USER_ID

  - fully_qualified_name: ORIGINAL_DB.PUBLIC.EVENTS
    target_database: COPIED_DB
    target_schema: PUBLIC
    target_name: EVENTS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - EVENT_ID
    chunk_number: 20
```

---

## Scenario-Based Examples

### Example 11: Development Environment - Fast Validation

Quick validation for development with minimal overhead.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./dev_validation
max_threads: 4

source_connection:
  mode: credentials
  host: localhost
  port: 1433
  username: dev_user
  password: DevPass123
  database: DevDB
  trust_server_certificate: "yes"
  encrypt: "no"

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false  # Skip for speed

comparison_configuration:
  tolerance: 0.05  # More lenient

logging_configuration:
  level: WARNING  # Less verbose

tables:
  - fully_qualified_name: DevDB.dbo.TestTable1
    use_column_selection_as_exclude_list: false
    column_selection_list: []

  - fully_qualified_name: DevDB.dbo.TestTable2
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "id <= 1000"  # Limit rows for speed
    target_where_clause: "id <= 1000"
```

### Example 12: Staging Environment - Comprehensive Testing

Thorough validation for staging environment.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: /staging/validation
max_threads: 12

source_connection:
  mode: credentials
  host: sqlserver-staging.company.com
  port: 1433
  username: staging_validator
  password: StagingPass123!
  database: STAGING_DB
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_staging

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 200

comparison_configuration:
  tolerance: 0.01

logging_configuration:
  level: INFO
  console_level: INFO
  file_level: DEBUG

tables:
  - fully_qualified_name: STAGING_DB.dbo.Orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
    chunk_number: 15

  - fully_qualified_name: STAGING_DB.dbo.Customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id
    chunk_number: 10
```

### Example 13: Production - Maximum Performance

Optimized for large-scale production validation.

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: /prod/validation
target_database: PROD_SNOWFLAKE
max_threads: 32  # Maximum parallelization

source_connection:
  mode: credentials
  host: teradata-prod.company.com
  username: prod_validator
  password: SecureProdPass!123
  database: PROD_TD

target_connection:
  mode: name
  name: snowflake_prod

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 1000
  exclude_metrics: false

comparison_configuration:
  tolerance: 0.001  # Strict tolerance

logging_configuration:
  level: INFO
  console_level: ERROR  # Minimal console output
  file_level: DEBUG  # Detailed file logging

tables:
  # Massive fact table - heavy chunking
  - fully_qualified_name: PROD_TD.fact_transactions
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - transaction_key
    chunk_number: 200  # Maximum chunking
    max_failed_rows_number: 5000

  # Other tables...
  - fully_qualified_name: PROD_TD.fact_sales
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - sale_key
    chunk_number: 150
```

### Example 14: PII-Compliant Validation

Exclude sensitive personally identifiable information.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./pii_compliant_validation
max_threads: auto

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: compliance_validator
  password: CompliancePass123!
  database: CustomerDB

target_connection:
  mode: name
  name: snowflake_customer

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

comparison_configuration:
  tolerance: 0.01

tables:
  - fully_qualified_name: CustomerDB.dbo.Customers
    use_column_selection_as_exclude_list: true
    column_selection_list:
      # Exclude all PII columns
      - ssn
      - tax_id
      - date_of_birth
      - drivers_license
      - passport_number
      - credit_card_number
      - bank_account_number
      - email_address
      - phone_number
      - home_address
      - mailing_address
    index_column_list:
      - customer_id

  - fully_qualified_name: CustomerDB.dbo.Transactions
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - credit_card_last4
      - account_number
      - routing_number
    index_column_list:
      - transaction_id
```

### Example 15: Migration Cutover Validation

Final validation before production cutover.

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: /cutover/validation
max_threads: 32

source_connection:
  mode: credentials
  host: redshift-prod.amazonaws.com
  port: 5439
  username: cutover_validator
  password: CutoverPass123!
  database: production

target_connection:
  mode: name
  name: snowflake_production_new

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 0  # Zero tolerance for cutover

comparison_configuration:
  tolerance: 0.0001  # Extremely strict

logging_configuration:
  level: DEBUG  # Maximum detail
  console_level: INFO
  file_level: DEBUG

# Validate ALL tables
tables:
  - fully_qualified_name: public.customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id
    chunk_number: 50

  - fully_qualified_name: public.orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
    chunk_number: 100

  - fully_qualified_name: public.order_items
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
      - item_id
    chunk_number: 150

  - fully_qualified_name: public.products
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - product_id
    chunk_number: 20

  - fully_qualified_name: public.inventory
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - inventory_id
    chunk_number: 30
```

### Example 16: Continuous Validation - Daily Incremental

Daily validation of incremental loads.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: /daily/validation
max_threads: 16

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: daily_validator
  password: DailyPass123!
  database: ETL_DB

target_connection:
  mode: name
  name: snowflake_daily

validation_configuration:
  schema_validation: false  # Skip schema check for daily runs
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

comparison_configuration:
  tolerance: 0.01

logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: INFO

tables:
  # Validate only yesterday's data
  - fully_qualified_name: ETL_DB.dbo.DailyTransactions
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - transaction_id
    where_clause: "CAST(created_date AS DATE) = CAST(DATEADD(day, -1, GETDATE()) AS DATE)"
    target_where_clause: "CAST(created_date AS DATE) = CAST(DATEADD(day, -1, CURRENT_TIMESTAMP) AS DATE)"
    chunk_number: 10

  - fully_qualified_name: ETL_DB.dbo.DailyOrders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
    where_clause: "CAST(order_date AS DATE) = CAST(DATEADD(day, -1, GETDATE()) AS DATE)"
    target_where_clause: "CAST(order_date AS DATE) = CAST(DATEADD(day, -1, CURRENT_TIMESTAMP) AS DATE)"
    chunk_number: 5
```

---

## View Validation Examples

### Example 17: Basic View Validation

Validate database views alongside tables for comprehensive migration verification.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./view_validation
max_threads: auto

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: view_validator
  password: ViewPass123!
  database: ReportingDB
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_reporting

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

views:
  - fully_qualified_name: ReportingDB.dbo.customer_summary
    use_column_selection_as_exclude_list: false
    column_selection_list: []

  - fully_qualified_name: ReportingDB.dbo.sales_by_region
    use_column_selection_as_exclude_list: false
    column_selection_list: []

  - fully_qualified_name: ReportingDB.dbo.monthly_revenue
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 18: Combined Tables and Views Validation

Validate both tables and views in a single configuration.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./combined_validation
max_threads: 16

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: migration_user
  password: MigrationPass123!
  database: AnalyticsDB
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_analytics

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

comparison_configuration:
  tolerance: 0.01

# Base tables
tables:
  - fully_qualified_name: AnalyticsDB.dbo.customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  - fully_qualified_name: AnalyticsDB.dbo.orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
    chunk_number: 20

  - fully_qualified_name: AnalyticsDB.dbo.products
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - product_id

# Derived views
views:
  - fully_qualified_name: AnalyticsDB.dbo.vw_customer_orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id
      - order_id

  - fully_qualified_name: AnalyticsDB.dbo.vw_product_sales
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - product_id
      - product_name
      - total_quantity
      - total_revenue
    index_column_list:
      - product_id

  - fully_qualified_name: AnalyticsDB.dbo.vw_monthly_summary
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "year >= 2024"
    target_where_clause: "year >= 2024"
```

### Example 19: Teradata View Validation

Validate Teradata views migrated to Snowflake.

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: ./teradata_view_validation
target_database: SNOWFLAKE_DW
max_threads: auto

source_connection:
  mode: credentials
  host: teradata.company.com
  username: td_validator
  password: TeradataPass123!
  database: DW_DB

target_connection:
  mode: name
  name: snowflake_dw

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

schema_mappings:
  DW_DB: PUBLIC

views:
  - fully_qualified_name: DW_DB.v_customer_360
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - credit_score
    # Exclude sensitive columns

  - fully_qualified_name: DW_DB.v_sales_dashboard
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - region
      - quarter
      - total_sales
      - order_count
      - avg_order_value

  - fully_qualified_name: DW_DB.v_inventory_status
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 20: Redshift View Validation with Column Mappings

Validate Redshift views with column name changes.

```yaml
source_platform: Redshift
target_platform: Snowflake
output_directory_path: ./redshift_view_validation
max_threads: 12

source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: rs_validator
  password: RedshiftPass123!
  database: analytics

target_connection:
  mode: name
  name: snowflake_analytics

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 50

comparison_configuration:
  tolerance: 0.01

database_mappings:
  analytics: ANALYTICS_PROD

schema_mappings:
  public: PUBLIC
  reports: REPORTS

views:
  - fully_qualified_name: reports.v_user_activity
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - user_id
      - last_login
      - session_count
      - total_duration
    index_column_list:
      - user_id
    column_mappings:
      user_id: USER_ID
      last_login: LAST_LOGIN_DATE
      session_count: SESSIONS
      total_duration: DURATION_MINUTES

  - fully_qualified_name: reports.v_conversion_funnel
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "event_date >= CURRENT_DATE - 30"
    target_where_clause: "event_date >= CURRENT_DATE - 30"

  - fully_qualified_name: public.v_daily_metrics
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - metric_date
      - metric_type
```

### Example 21: View Validation with Different Target Names

Validate views when source and target view names differ.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./renamed_view_validation
max_threads: auto

source_connection:
  mode: credentials
  host: legacy-sql.company.com
  port: 1433
  username: migration_user
  password: MigrationPass123!
  database: LegacyDB
  trust_server_certificate: "yes"
  encrypt: "optional"

target_connection:
  mode: name
  name: snowflake_modernized

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

views:
  # Legacy view renamed in Snowflake
  - fully_qualified_name: LegacyDB.dbo.vw_cust_master
    target_database: MODERN_DB
    target_schema: ANALYTICS
    target_name: CUSTOMER_MASTER_VIEW
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - cust_id
      - cust_name
      - cust_type
      - region
    column_mappings:
      cust_id: CUSTOMER_ID
      cust_name: CUSTOMER_NAME
      cust_type: CUSTOMER_TYPE

  # View with schema change only
  - fully_qualified_name: LegacyDB.dbo.vw_sales_summary
    target_schema: SALES
    use_column_selection_as_exclude_list: false
    column_selection_list: []

  # View with database change only
  - fully_qualified_name: LegacyDB.reports.vw_quarterly_report
    target_database: REPORTING_DB
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 22: Large View Validation with Chunking

Validate large views with chunking for better performance.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./large_view_validation
max_threads: 32

source_connection:
  mode: credentials
  host: sqlserver-prod.company.com
  port: 1433
  username: prod_validator
  password: ProdPass123!
  database: DataWarehouse
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_dw

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 500

comparison_configuration:
  tolerance: 0.005

views:
  # Large aggregated view with chunking
  - fully_qualified_name: DataWarehouse.dbo.vw_transaction_history
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - transaction_id
    chunk_number: 50
    max_failed_rows_number: 1000
    where_clause: "transaction_date >= '2024-01-01'"
    target_where_clause: "transaction_date >= '2024-01-01'"

  # Customer analytics view
  - fully_qualified_name: DataWarehouse.dbo.vw_customer_analytics
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_score
      - risk_flag
    index_column_list:
      - customer_id
    chunk_number: 25

  # Time-series metrics view
  - fully_qualified_name: DataWarehouse.dbo.vw_hourly_metrics
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - metric_hour
      - metric_type
      - value
      - count
    index_column_list:
      - metric_hour
      - metric_type
    chunk_number: 100
    where_clause: "metric_hour >= DATEADD(month, -3, GETDATE())"
    target_where_clause: "metric_hour >= DATEADD(month, -3, CURRENT_TIMESTAMP)"
```

---

## Tips for Adapting These Examples

1. **Replace connection details** with your actual database credentials
2. **Update table names** to match your schema
3. **Adjust `max_threads`** based on your system resources
4. **Modify `chunk_number`** based on table sizes
5. **Set appropriate `tolerance`** based on your data characteristics
6. **Customize `where_clause`** for your filtering needs
7. **Add/remove columns** in `column_selection_list` as needed
8. **Update `column_mappings`** if column names differ

## Security Best Practices

* **Never commit** configuration files with real passwords to version control
* Use **environment variables** for sensitive data
* Consider **secret management** tools (AWS Secrets Manager, Azure Key Vault, etc.)
* Use **least privilege** database accounts for validation
* **Encrypt** configuration files containing sensitive information

---
title: Databricks to Snowflake Migration Guide
source: https://docs.snowflake.com/en/migrations/guides/databricks.md
section: Migrations
---

# **Databricks to Snowflake Migration Guide**

## **Snowflake Migration Framework**

A typical Databricks-to-Snowflake migration can be broken into five key steps:

1. **Planning and design** are often overlooked steps in the migration process. The main reason is that companies typically want to show progress quickly, even if they haven’t fully understood the scope of the project. That is why, this phase is critical to understand and prioritize the migration project.
2. **Environment and security** with a plan, a clear timeline, a RACI matrix, and buy-in from all stakeholders, it’s time to move into execution mode.
   Setting up the necessary environments and security measures to begin the migration is very important before starting the migration phase given that there are many moving parts, and will be more impactful for the migration project if all your setup is ready before moving forward.
3. **Database code conversion** process involves extracting code directly from the source systems’ database catalog, such as table definitions, views, stored procedures and functions. Once extracted, you migrate all this code to equivalent data definition languages (DDLs) in Snowflake. This step also includes migrating data manipulation language (DML) scripts, which may be used by business analysts to build reports or dashboards.
   All this code needs to be migrated and adjusted to work in Snowflake. The adjustments can range from simple changes, such as naming conventions and data type mappings, to more complex differences in syntax, platform semantics and other factors. To assist with this, Snowflake offers a powerful solution called SnowConvert AI, which automates much of the database code conversion process.
4. **Data migration** Data migration involves transferring data between different storage systems, formats, or computer systems. In the context of a Databricks to Snowflake migration, it specifically refers to moving data from your Databricks environment to your new Snowflake environment.

   There are two main types discussed in this guide:

* **Historical data migration:** Taking a snapshot of your Databricks data at a specific point in time and transferring it to Snowflake. This is often done as an initial, bulk transfer.
* **Incremental data migration:** Moving new or changed data from Databricks to Snowflake on an ongoing basis after the initial historical migration. This ensures that your Snowflake environment stays up-to-date with your source systems.

5. **Data ingestion:** After migrating the historical data, the next step is migrating the data ingestion process, bringing in live data from various sources. Typically, this process follows an extract, transform, load (ETL) or extract, load, transform (ELT) model, depending on when and where the data transformation occurs before it becomes available to business users.
6. **Reporting and analytics,** now that the database has both historical data and live pipelines continuously importing new data, the next step is to extract value from this data through BI. Reporting can be done using standard BI tools or custom queries. In both cases, the SQL sent to the database may need to be adjusted to meet Snowflake’s requirements. These adjustments can range from simple name changes (common during migration) to syntax and more complex semantic differences. All these need to be identified and addressed.
7. **Data validation and testing:** The goal is to have the data as clean as possible before entering this phase.
   Every organization has its own testing methodologies and requirements for moving data into production. These must be fully understood from the start of the project.
8. **Deployment.** At this stage, the data is validated, an equivalent system is set up, all the ETLs have been migrated, and reports have been verified. Are you ready to go live?
   Not so fast — there are still a few critical considerations before final promotion to production. First, your legacy application may consist of multiple components or services. Ideally, you should migrate these applications one by one (although parallel migration is possible) and promote them to production in the same order. During this process, ensure your bridging strategy is in place so business users don’t have to query both Snowflake and the legacy system. Data synchronization for applications that haven’t been migrated yet should happen behind the scenes through the bridging mechanism. If this isn’t done, business users will have to work in a hybrid environment, and they must understand the implications of this setup.
9. **Optimize and run** once a system has been migrated to Snowflake, it enters normal maintenance mode. All software systems are living organisms requiring ongoing maintenance. This phase, after migration, is referred to as optimize and run, and it is not part of the migration itself.

#

## **Key Phases**

A typical Databricks-to-Snowflake migration can be broken down into several key phases, each with distinct objectives and considerations. Following these phases will help ensure a structured and successful transition.

### **Phase 1: Planning and Design**

This initial phase is crucial for a successful migration. It lays the groundwork by defining your project’s scope, objectives, and requirements. It involves a deep understanding of your current Databricks environment and a clear vision for the future state in Snowflake.

#### **Your Actionable Steps:**

* **Conduct a Thorough Assessment of your Databricks Environment:** This involves more than just a technical inventory; it is a strategic exercise to identify “technical debt” and uncover opportunities for modernizing and simplifying the data estate.

  + **Inventory Existing Data Assets:** Meticulously identify and document all Databricks assets, including databases, tables (especially Delta Lake tables), views, notebooks (categorized by language: Python, Scala, SQL), jobs, workflows, User-Defined Functions (UDFs), and external integrations.
  + **Analyze Query Workloads:** Utilize Databricks’ monitoring tools and logs to pinpoint frequently executed and resource-intensive queries. These queries will be critical for performance validation post-migration.
  + **Categorize Data Assets:** Distinguish between production and non-production data, identify active versus deprecated objects, and pinpoint any redundant assets that can be excluded from migration. This significantly reduces the volume of data and code to be migrated, saving effort, time, and costs.
  + **Assess Security and Compliance Requirements:** Identify sensitive data, regulatory obligations (e.g. GDPR, HIPAA), and potential vulnerabilities within the existing Databricks environment. This information is critical for designing a robust security setup in Snowflake.
* **Define Clear Migration Objectives and Success Metrics:** Overlooking the precise definition of these objectives can lead to “moving goalposts” and project failure.

  + **Articulate Strategic Drivers:** Clearly state the business drivers (e.g. cost reduction, improved BI performance, simplified operations, enhanced governance) and technical objectives for migrating to Snowflake.
  + **Establish Measurable Success Metrics:** Define quantifiable metrics to track progress and demonstrate ROI, such as improvements in query performance (e.g. average query latency reduced by X%), demonstrable cost savings (e.g. Y% reduction in monthly cloud spend), a measurable decrease in operational incidents, increased user satisfaction scores, and verified data accuracy.
* **Choose Your Migration Approach: Phased vs. Big Bang Cutover:** The selection of a migration strategy is fundamentally a risk management decision.

  + **Phased Migration:** This approach involves moving data and workloads in smaller, manageable segments (by subject area, data mart, business unit, or application). It is highly recommended for maintaining zero or minimal downtime, allowing for continuous testing, iterative learning, and gradual workload shifting. This approach facilitates parallel runs for thorough validation.
  + **Big Bang Cutover:** This approach involves migrating all data and workloads at once, followed by an immediate switch. While potentially faster for very simple systems, it carries a high risk of unforeseen issues and is generally less safe for maintaining zero downtime.
* **Establish a Robust Migration Readiness Framework:** Early and continuous involvement of all stakeholders is paramount.

  + **Conduct a Formal Migration Readiness Assessment (MRA):** Involve a cross-functional team of experts (code conversion, data migration, data ingestion, data validation, reporting & analytics) and representatives from both business and technical sides.
  + **Develop a Detailed Project Timeline and RACI Matrix:** Ensure clarity of roles and responsibilities for all migration tasks.
  + **Secure Explicit Buy-in:** Obtain buy-in from all key stakeholders, including executive leadership and business users, from the outset. A technically flawless migration can still fail if business users are not adequately prepared, trained, or involved.

### **Phase 2: Environment and Security**

Setting up the necessary environments and security measures is a critical early step before you begin the migration. Snowflake operates under a shared security model between the platform and administrators.

**Your Actionable Steps:**

* **Set Up Environments:** Decide on the number of Snowflake accounts needed. At minimum, set up a production and development environment. Based on your strategy, consider additional environments for different testing stages.
* **Implement Security Measures:**

  + Start with network policies to ensure only authorized users within your VPN can access the Snowflake system.
  + Define roles based on business needs, as Snowflake’s user access control is role-based.
  + Create user accounts and enforce Multi-Factor Authentication (MFA) and/or Single Sign-On (SSO) for all users.
  + Set up service accounts without relying on traditional username/password authentication.
* **Define Roles During Migration:** Define specific roles for your migration team. Even in non-production environments, where the team may have more freedom, remember that you will be dealing with real data, so maintain robust security.
* **Rethink Your Access Model:** Use this migration to clean up and optimize your access hierarchy, ensuring only necessary users have access to specific resources.
* **Coordinate with Finance:** Align with your finance team to track Snowflake usage by department, utilizing Snowflake’s consumption-based pricing model and object tagging for cost allocation.

### **Phase 3: Database Code Conversion**

This phase focuses on converting your Databricks database code (DDL, SQL, Spark code) to Snowflake-compatible SQL and Snowpark.

**Your Actionable Steps:**

* **Map Databricks Spark Data Types to Snowflake Data Types:**

  + Meticulously identify and map Databricks (Spark) data types to their most appropriate Snowflake equivalents. Pay close attention to precision, scale, and time zone handling for complex types (e.g. TimestampType to TIMESTAMP_NTZ, TIMESTAMP_LTZ, or TIMESTAMP_TZ).
  + Be aware that ByteType maps to Snowflake’s INTEGER, and LongType (64-bit) to INTEGER (32-bit) may require range checks to prevent truncation.
  + ArrayType and MapType commonly map to Snowflake’s VARIANT data type.
* **Translate Data Definition Language (DDL) for Tables and Views:**

  + Extract existing DDL scripts from your Databricks environment, typically from Delta Lake tables.
  + Adjust the extracted DDL for full compatibility with Snowflake’s SQL dialect, removing or re-engineering Databricks-specific features (e.g., Delta Lake table properties, specific partitioning schemes beyond clustering keys).
  + Consider opportunities for schema reorganization, such as breaking down large schemas into multiple Snowflake databases or schemas for better logical separation and access control.
* **Convert Databricks SQL and Spark Code to Snowflake SQL and Snowpark:**

  + **Databricks SQL to Snowflake SQL:** Snowconvert AI now supports Spark SQL and Databricks SQL assessment and translation for TABLES and VIEWS.
  + **Spark Code (PySpark/Scala) to Snowpark:** Convert PySpark or Scala code from Databricks notebooks and jobs to Snowflake’s Snowpark API (Python, Java, Scala). Snowpark DataFrames offer similar functionalities to Spark DataFrames (filter, select, join, groupBy, agg), aiming to bring processing logic directly to data within Snowflake.
  + **User-Defined Functions (UDFs):** Re-implement Databricks UDFs (Python, Scala) as Snowflake UDFs (SQL, JavaScript, Python, Java, Scala). Complex Spark UDFs may require significant re-engineering to leverage Snowpark effectively.
  + **Orchestration Logic:** Re-design and re-implement Databricks Jobs, Workflows, and Delta Live Tables (DLT) orchestration logic in Snowflake using native features like Streams and Tasks for incremental transformations and scheduling. Alternatively, repoint external orchestrators (e.g., Airflow) to Snowflake, rewriting any embedded Databricks-specific code.

### **Phase 4: Data Migration**

Data migration is the process of transferring existing datasets from the Databricks environment to Snowflake. This phase typically involves both historical bulk data transfer and ongoing incremental data ingestion.

**Your Actionable Steps:**

* **Extract Data from Databricks:**

  + For Delta Lake tables, generate manifest files using Apache Spark, which point to the underlying Parquet data files that Snowflake can directly read.
  + For large tables, partition data exports for efficient parallel processing.
  + Leverage Databricks’ native Snowflake connector to directly read data from Databricks and write it to cloud storage (e.g. AWS S3, Azure Blob Storage) as a staging area for Snowflake.
  + Add a timestamp column for ingestion time and a source system name column to maintain lineage and control in Snowflake.
* **Load Data into Snowflake:**

  + Use Snowflake’s COPY INTO command for bulk loading data from external stages (cloud storage locations) into Snowflake tables.
  + For optimal performance with Parquet files, use Snowflake’s vectorized scanner (set USE_VECTORIZED_SCANNER in COPY command, or expect it to be default in future).
  + **Best Practices for Loading:**

    - **File Size Optimization:** Create files in the range of 100-250MB with compression (e.g., Snappy for Parquet) for optimal throughput.
    - **Purging Staged Files:** Use PURGE=TRUE in the COPY command to remove files from the stage after successful loading, optimizing performance and managing storage costs.
    - **Error Handling:** Use ON_ERROR=’CONTINUE’ in the COPY command for large files with potential bad data, allowing good data to load while ignoring problematic rows.
    - **Internal Stages:** Consider using Snowflake’s internal stages for faster loading compared to external stages, but compare storage costs.
  + For incremental data loading, implement Change Data Capture (CDC) pipelines to replicate new or changed data from Databricks to Snowflake. Tools like Fivetran or Matillion can automate these syncs.

### **Phase 5: Data Ingestion**

This phase focuses on migrating the ongoing data ingestion processes and ETL/ELT pipelines from Databricks to Snowflake, ensuring a continuous flow of live data.

**Your Actionable Steps:**

* **Re-engineer Databricks ETL/ELT Workflows:**

  + Re-engineer Databricks ETL/ELT workflows (often built using PySpark, Scala, or SQL with Delta Live Tables (DLT) or Databricks Jobs) for Snowflake.
  + For complex ETL/ELT, convert Spark code to Snowpark DataFrames and UDFs (as discussed in Phase 1). For SQL-based transformations, consider dbt (data build tool) for transformations within Snowflake.
  + **Leverage Snowflake Native Features:**

    - **Streams and Tasks:** Use Streams to record DML changes for incremental processing and Tasks to schedule SQL statements or stored procedures for incremental transformations and orchestration directly within Snowflake.
    - **Snowpipe:** For real-time, continuous loading of new data, use Snowpipe for trickle feeds. For batch loading, the COPY command remains a powerful option.
    - **Snowpipe Streaming:** Ideal for low-latency streaming use cases.
* **Realign Data Sources and Sinks:**

  + Redirect multiple inbound data sources currently landing in Databricks to Snowflake ingestion patterns by configuring connectors or custom ingestion processes to point directly to Snowflake stages or tables.
  + Develop a plan to repoint downstream systems (e.g., BI tools, other applications) that currently read from Databricks to Snowflake once data pipelines have stabilized and data validation is complete.

### **Phase 6: Reporting and Analytics Transition**

This phase focuses on ensuring that Business Intelligence (BI) and analytical tools continue to function correctly and optimally with Snowflake as the new data source.

**Your Actionable Steps:**

* **Adjust BI Tools and Custom Queries:**

  + Repoint or refactor existing reporting tools (e.g., Tableau, Power BI, Looker) and adjust custom queries that previously ran against Databricks.
  + Adjust SQL queries sent to the database for Snowflake’s requirements, which can range from simple name changes to more complex syntax and semantic differences.
* **Engage Business Users and Provide Training:**

  + Include business users as key stakeholders in the migration process (e.g., in the RACI matrix during planning). Their acceptance is crucial for a full transition away from the legacy platform.
  + Train business users on how Snowflake operates and ensure they clearly understand the platform differences. This will enable them to modify their custom queries and reports as needed.
  + Consider a parallel training track for business users, followed by office hours with migration experts, to help address platform differences and guide users through necessary adjustments.

### **Phase 7: Data Validation and Testing**

Data validation and testing are often underestimated steps in the migration planning process, yet they are critical to ensuring data integrity and accuracy in the new Snowflake environment. The goal is to have the data as clean as possible before entering this phase.

**Your Actionable Steps:**

* **Conduct Comprehensive Testing Strategies:** Every organization has its own testing methodologies and requirements for moving data into production, which must be fully understood from the start of the project.

  + **Functional Testing:** Verify that all migrated applications and functionalities work as expected within the new environment, ensuring data integrity and accuracy. This includes verifying that migrated ETLs and reports produce correct results.
  + **Performance Testing:** Evaluate query performance, data loading speed, and overall system responsiveness. This helps identify and address any performance bottlenecks in Snowflake, ensuring the new platform meets or exceeds performance expectations.
  + **User Acceptance Testing (UAT):** Involve end-users in the testing process to ensure the migrated system meets their business requirements and gather feedback for potential improvements. This is crucial for gaining user confidence and adoption.
  + **Data Validation Techniques:** Compare row counts, calculate sums, maximums, minimums, and averages of columns, and hash row values for one-on-one association between source (Databricks) and target (Snowflake) systems. Running parallel systems for a period allows for real-time comparison.
* **Provide Training and Documentation:**

  + Offer comprehensive training to end-users on Snowflake’s features, functionalities, and best practices, covering topics like data access, query optimization, and security.
  + Create comprehensive documentation, including system architecture diagrams, data flow diagrams, operational procedures, user guides, troubleshooting guides, and FAQs for easy reference and ongoing support.

### **Phase 8: Deployment - Going Live**

This stage involves critical considerations before final promotion to production, ensuring a smooth and coordinated cutover.

**Your Actionable Steps:**

* **Plan Phased Rollout and Bridging Strategy:**

  + Ideally, migrate legacy applications one by one and promote them to production in the same order.
  + Ensure a bridging strategy is in place so business users do not have to query both Snowflake and the legacy Databricks system. Data synchronization for applications not yet migrated should happen behind the scenes through this mechanism.
* **Ensure Stakeholder Alignment and Formal Sign-offs:**

  + When ready for cutover, ensure all stakeholders are aligned and understand that Snowflake will be the system of record, not the legacy Databricks platform.
  + Obtain final and formal sign-offs from all stakeholders before proceeding.
  + Emphasize that any reports not migrated are now the responsibility of business users, highlighting the importance of early user involvement.
  + Verify that all permissions have been properly granted in Snowflake, including any Active Directory-based roles.
* **Address Critical Considerations for Cutover:**

  + **Surrogate Keys:** If using surrogate keys, be aware that their lifecycle may differ between legacy and Snowflake systems; these keys need to be synchronized during cutover.
  + **Cutover Timing:** Consider the optimal timing for cutover based on your industry to minimize business impact.
  + **Legacy Platform Decommissioning:** Plan for the decommissioning of the legacy Databricks environment, including considerations for legacy platform licensing and data retention policies.

### **Phase 9: Optimize and Run - Continuous Improvement**

Once a system has been migrated to Snowflake, it enters normal maintenance mode. This phase, referred to as “Optimize and Run,” is not part of the migration itself but focuses on ongoing optimization and continuous improvement.

**Your Actionable Steps:**

* **Focus on Ongoing Optimization and Cost Management:**

  + The team takes full ownership of the system in Snowflake, with optimization driven by usage patterns.
  + While jobs in Snowflake generally run faster, if performance doesn’t meet expectations, optimizations may be needed to fully leverage Snowflake’s unique architecture.
  + Utilize Snowflake’s query analysis tools to identify bottlenecks and optimize specific parts of the workflow.
  + Address only critical performance issues during migration, treating broader optimization as a post-migration effort.
  + **Implement Continuous Cost Management:**

    - Set auto-suspend timeouts for virtual warehouses to 60 seconds to significantly reduce costs, as Snowflake charges for every second a warehouse is running with a minimum of 60 seconds per resume.
    - Reduce virtual warehouse sizes based on workload requirements, as compute resources and costs scale exponentially with warehouse size.
    - Continuously monitor usage patterns and coordinate with finance to track departmental usage for cost allocation.
* **Enhance Governance and Security:**

  + Refine role-based access control, implement dynamic data masking and row access policies for sensitive data, and regularly audit access patterns.
  + Rethink the access model to clean up the hierarchy of users and ensure only necessary users have access to specific resources.

---
title: Databricks to Snowflake notebook transformation
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/notebooks/databricks/sample.md
section: Migrations
---

# Databricks to Snowflake notebook transformation

This document describes the transformation process from Databricks notebooks to Snowflake notebooks (vNext).

The transformation tool converts Databricks notebooks (`.py` format with `# COMMAND ----------` markers) to Snowflake notebooks (`.ipynb` format), adapting Databricks-specific APIs to functional equivalents in Snowflake.

---

## Input and output files

### Input

| File | Description |
| --- | --- |
| `input/dbx_with_dbutis_run.py` | Databricks notebook with `dbutils` commands |

### Output

| File | Description |
| --- | --- |
| `output/Output/dbx_with_dbutis_run.ipynb` | Transformed notebook for Snowflake |

---

## Transformation example

### Input file: `dbx_with_dbutis_run.py`

```python
# Databricks notebook source
dbutils.notebook.help("run")

# COMMAND ----------

dbutils.notebook.run("./my_second_notebook", timeout_seconds=1000)

# COMMAND ----------

print(myVar)

# COMMAND ----------

# MAGIC %r
# MAGIC names <- c("Product A", "Product B", "Product C", "Product D")
# MAGIC sales <- c(120, 450, 300, 780)
# MAGIC df <- data.frame(names, sales)
# MAGIC df$total_with_tax <- df$sales * 1.15
# MAGIC print(df)
# MAGIC barplot(df$sales, names.arg=df$names, col="steelblue", main="Sales Overview")
```

### Output file: `dbx_with_dbutis_run.ipynb`

The transformed notebook contains the following cells:

#### Cell 0 - Connection configuration

```sql
-- To configure the connection in vNext notebook, uncomment the following code and update the values accordingly.
--     use role <ROLE>;
--     use database <DATABASE>;
--     USE SCHEMA <SCHEMA>;
--     USE WAREHOUSE <WAREHOUSE>;
```

#### Cell 1 - Utility imports

```python
import sfutils
from snowflake.snowpark.session import Session

spark = Session.getActiveSession() or Session.builder.configs(connection_parameter).getOrCreate()
```

#### Cell 2 - dbutils help (unchanged)

```python
dbutils.notebook.help("run")
```

#### Cell 3 - Notebook execution (transformed)

```python
sfutils.notebook.run("./my_second_notebook", timeout_seconds = 1000)
```

#### Cell 4 - Python code (unchanged)

```python
print(myVar)
```

#### Cell 5 - R code (with warning)

SPRKDBX1003 R cells code are not supported in Snowsight. You must rewrite the R code in Python.

For more information, see EWI codes in this topic.

```r
names <- c("Product A", "Product B", "Product C", "Product D")
sales <- c(120, 450, 300, 780)
df <- data.frame(names, sales)
df$total_with_tax <- df$sales * 1.15
print(df)
barplot(df$sales, names.arg=df$names, col="steelblue", main="Sales Overview")
```

## Applied transformations

### 1. Addition of initialization cells

Cells are automatically added at the beginning of the notebook for:

* Snowflake connection configuration (commented for customization)
* Import of `sfutils` and Snowpark session creation

### 2. Conversion of `dbutils.notebook.run()`

| Databricks | Snowflake |
| --- | --- |
| `dbutils.notebook.run("./my_second_notebook", timeout_seconds=1000)` | `sfutils.notebook.run("./my_second_notebook", timeout_seconds = 1000)` |

### 3. Handling of unsupported language cells

Cells with `# MAGIC %r` (R) or `# MAGIC %scala` (Scala) are marked with an EWI (Early Warning Issue) comment:

```python
#EWI: SPRKDBX1003 => R cells code are not supported in Snowsight. It is necessary to rewrite the R code in Python.
```

For more information, see EWI codes in this topic.

## EWI codes (early warning issues)

During transformation, warnings may be generated for code that requires manual review:

| Code | Description |
| --- | --- |
| `SPRKDBX1003` | R cell code is not supported in Snowsight. Requires rewriting in Python |

## Important considerations

1. **Snowflake Session**: The transformed notebook automatically initializes a Snowpark session.
2. **R/Scala Cells**: Require manual migration to Python.
3. **Notebook Execution**: `dbutils.notebook.run()` is converted to `sfutils.notebook.run()`.

## Recommended workflow

1. Databricks Notebook (.py with COMMAND)
2. Transformation Tool
3. Snowflake Notebook (.ipynb)

## Complete migration example

### Before (Databricks)

```python
# Databricks notebook source

# Get widget value
env = dbutils.widgets.get("env")
print(f"Running in environment: {env}")

# COMMAND ----------

# Execute child notebook
result = dbutils.notebook.run("./process_data", timeout_seconds=3600, arguments={"env": env})

# COMMAND ----------

# Read file
content = dbutils.fs.head("/mnt/data/config.json")
print(content)
```

### After (Snowflake)

```python
# Cell 0 - Configuration
import sfutils
from snowflake.snowpark.session import Session

spark = Session.getActiveSession() or Session.builder.configs(connection_parameter).getOrCreate()
```

```python
# Cell 1 - Get widget
env = sfutils.widgets.get("env")
print(f"Running in environment: {env}")
```

```python
# Cell 2 - Execute notebook
result = sfutils.notebook.run("./process_data", timeout_seconds=3600, arguments={"env": env})
```

```python
# Cell 3 - Read file
content = sfutils.fs.head("/mnt/data/config.json")
print(content)
```

---
title: ETL Migration
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/etl-migration-replatform.md
section: Migrations
---

# ETL Migration

The Replatform feature of SnowConvert AI can be used to migrate your legacy ETL workflows to cloud-native architectures on Snowflake.
This feature converts the legacy ETL packages into modern data transformation frameworks like [dbt Core](../../../../user-guide/data-engineering/dbt-projects-on-snowflake.md), while preserving the original orchestration logic using Snowflake’s native [TASKs](../../../../sql-reference/sql/create-task.md) and [stored procedures](../../../../sql-reference/sql/create-procedure.md).

This topic describes the process to migrate [SSIS (SQL Server Integration Services)](https://learn.microsoft.com/en-us/sql/integration-services/sql-server-integration-services?view=sql-server-ver17) packages to dbt projects on Snowflake, using the Replatform feature.

## Migration strategy

The process involves separating the transformation logic from the orchestration logic in the existing SSIS packages. Using the Replatform feature, SnowConvert AI decomposes the SSIS packages into two primary components:

* dbt projects for data transformation logic
* Snowflake TASKS or stored procedures for orchestration of the ETL workflows

The generated dbt projects and TASKS/stored procedures can then be deployed to Snowflake and executed.

## Prerequisites

The prerequisites to use the Replatform feature for ETL migration are:

* The latest version of SnowConvert AI is installed.
* Source dependencies are accessible in Snowflake. The source dependencies are required for running the migrated dbt projects.
* DTSX (Data Transformation Services XML) package files should be extracted from ISPAC (Integration Services Project Archive) files as the ISPAC files are not directly supported.
* SSIS package version 8 or later is installed. If you have an earlier version, [upgrade your packages](https://learn.microsoft.com/en-us/sql/integration-services/install-windows/upgrade-integration-services-packages-using-the-ssis-package-upgrade-wizard?view=sql-server-ver17).

## Migration Steps

Follow these steps to migrate your SSIS project:

### Create a project

1. From the SnowConvert AI home page, select **New project**. Enter a **Project name** and **Select source** (for example, SQL Server for SSIS migrations). Optionally, enable **AI features** to verify and fix your converted code using Snowflake Cortex AI. Select **Continue**.

### Add code to the project

2. On the **Add code to your project** page, select **Already have code** to provide your source files directly.

### Set up code and ETL projects

3. On the **Set up code and ETL/BI projects** page:

   a. Select **Browse** next to **Where is your source code?** to specify the path to your DDL scripts and source files. Include DDL scripts for all dependencies to ensure high-quality migrated code.

   b. Select **Browse** next to **Where should the project be saved?** to specify the output path for the migrated code.

   c. Under **Have ETL projects or BI reports?**, check **SSIS or Informatica** and select **Browse** to navigate to your SSIS project folder containing DTSX files.

   d. Select **Save project**.

### Run the conversion

4. From the project dashboard, under **Conversion**, select **Convert code and ETL/BI projects**.

5. Optionally, configure conversion settings. For example, under **Prepare code**, you can enable **Arrange** to let the tool arrange the code before translation. This will help with structuring the input and trying to upgrade package versions. Select **Save settings** to confirm, then select **Continue** to start the conversion.

6. SnowConvert AI migrates your SSIS project and any scripts in the specified paths. After migration completes, review and fix any EWIs (Errors, Warnings, and Informational messages) identified in the reports.

7. Fill placeholders in `sources.yml`, `profiles.yml`, and `dbt_project.yml`.

8. Review the generated output. It should include:

* **ETL/**: Main folder containing all converted SSIS packages

  + **etl_configuration/**: Infrastructure components (control_variables table, UDFs, procedures)
  + **{PackageName}/**: Folder for each SSIS package containing:

    - **{PackageName}.sql**: Orchestration file (TASK or PROCEDURE)
    - **{DataFlowTaskName}/**: dbt project for each Data Flow Task
* **script.sql**: Migrated SQL scripts (if applicable)
  For a detailed description of the output structure, see Output Structure.

9. Upload your dbt project using any one of the following options:

   * **Option A**: Upload using Snowflake CLI

     Run this command in your dbt project directory (replace values in italics with your schema, database, and package names):

     ```bash
     snow dbt deploy --schema schema_name --database database_name --force package_name
     ```

     If successful, continue to step 10.
   * **Option B**: Upload via Snowflake Workspace

     Navigate to **Workspaces > Add new > Upload Folder** and select your dbt project folder.

     Deploy the dbt project to make it accessible for orchestration:

     a. Select **Connect > Deploy dbt project** at the top right corner.

     b. Use the project name that matches your dbt project folder name.
     For example: For `Process_Sales_Files_Load_Sales_Data/`, use “Process_Sales_Files_Load_Sales_Data”. This name is referenced in the orchestration file via `EXECUTE DBT PROJECT` commands.

     If your orchestration uses `public.Package`, use the following:

     ```sql
     EXECUTE DBT PROJECT public.Package ARGS='build --select tag:package_dataflowtask --target dev';
     ```

     Use `Package` as your project name when deploying.

     **Note**: Deploy all dbt projects in your migration.
10. Run your dbt project by selecting the correct database and schema.

    **For single dataflow projects:**

    Run the dbt project directly if you have only one data flow.

    **For multi-dataflow projects:**

    a. **Run the orchestration SQL file** to create all TASK objects. This will create:

    * Initialization TASK and all its dependent TASKs.
    * Stored procedures corresponding to the reusable SSIS packages.

    b. **Execute the orchestration** for TASK-based orchestration (standard packages) and PROCEDURE-based orchestration (reusable packages) as shown below:

    **For TASK-based orchestration**

    ```sql
    -- Execute the root task
    EXECUTE TASK public.Package;
    ```

    **For PROCEDURE-based orchestration**

    ```sql
    -- Call the stored procedure
    CALL public.PackageName();
    ```

    **Note**: Check your generated SQL file to determine whether your package uses the TASK or PROCEDURE pattern.

## Output Structure

SnowConvert AI organizes all migration output under the `Output/ETL/` folder. Here’s the complete folder structure:

```text
Output/
└── ETL/
    ├── etl_configuration/
    │   ├── tables/
    │   │   └── control_variables_table.sql
    │   ├── functions/
    │   │   └── GetControlVariableUDF.sql
    │   └── procedures/
    │       └── UpdateControlVariable.sql
    ├── {PackageName}/
    │   ├── {PackageName}.sql                          # Main orchestration TASK
    │   └── {DataFlowTaskName}/                        # One dbt project per Data Flow Task
    │       ├── dbt_project.yml
    │       ├── profiles.yml
    │       ├── models/
    │       │   ├── sources.yml
    │       │   ├── staging/
    │       │   │   └── stg_raw__{component_name}.sql
    │       │   ├── intermediate/
    │       │   │   └── int_{component_name}.sql
    │       │   └── marts/
    │       │       └── {destination_component_name}.sql
    │       ├── macros/
    │       │   ├── m_update_control_variable.sql
    │       │   └── m_update_row_count_variable.sql
    │       ├── seeds/
    │       └── tests/
    └── (additional packages...)/
```

**SSIS to SnowConvert AI conversion mapping:**

* **SSIS Data Flow Tasks** → dbt projects (one per Data Flow Task)
* **SSIS Control Flow** → Snowflake TASK objects or stored procedures
* **SSIS Variables** → control_variables table + UDFs + DBT variables
* **SSIS Containers** → Inline conversion within parent TASK/procedure

### ETL configuration components

The `etl_configuration/` folder contains shared infrastructure components required by all ETL orchestrations. These components work together to manage variables across package executions:

* **control_variables_table.sql**: Creates a transient table to store package variables, parameters, and their values across orchestration executions
* **GetControlVariableUDF.sql**: User-defined function to retrieve variable values from the control variables table
* **UpdateControlVariable.sql**: Stored procedure to update variable values during orchestration execution

**Schema Dependencies**: The UDFs and stored procedures in the `etl_configuration/` folder are generated with hardcoded schema references (default: `public`). If you deploy these objects to a different schema, you must update the schema references within:

* The `GetControlVariableUDF.sql` function (references `public.control_variables` in the SELECT statement)
* The `UpdateControlVariable.sql` procedure (references `public.control_variables` in the MERGE statement)
* Any orchestration scripts that call these objects

## Common naming and sanitization rules

SnowConvert AI applies consistent sanitization rules to all SSIS object names to ensure dbt and Snowflake compatibility. This includes packages, tasks, components, and variables.

| Rule | Description | Example |
| --- | --- | --- |
| **Convert to lowercase** | All names converted to lowercase | `MyPackage` → `mypackage` |
| **Replace invalid characters** | Spaces, hyphens, and special characters become underscores | `My-Package Name` → `my_package_name` |
| **Remove consecutive underscores** | Avoids `__` sequences (except `stg_raw__` prefix) | `my___package` → `my_package` |
| **Prefix with `t_`** | Adds prefix if name starts with a number | `123package` → `t_123package` |
| **Remove quotes and brackets** | Strips surrounding quotes and brackets | `[Package]` → `package` |

These rules apply uniformly across all generated artifacts: dbt model names, Snowflake TASK names, procedure names, and variable identifiers.

### Data flow task output (dbt Projects)

Each [SSIS Data Flow Task](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/data-flow?view=sql-server-ver17) is converted into a standalone dbt project with a three-tier architecture. These dbt projects contain all the data transformation logic from your original SSIS packages.

**Supported Data Flow Components**: For a complete list of supported sources, transformations, and destinations, see the [SSIS Translation Reference](../../translation-references/ssis/README.md).

#### Layer organization

Each dbt project follows a three-tier architecture that separates data extraction, transformation, and loading:

| Layer | Materialization | Purpose |
| --- | --- | --- |
| **models/staging/** | View | Provides clean, type-safe access to source data referenced in `sources.yml` |
| **models/intermediate/** | Ephemeral | Contains transformation logic from source ETL (not persisted to database for performance) |
| **models/marts/** | Incremental or Table | Final, business-ready data models corresponding to target tables. If the target overrides data in the table or re-creates the table, it will be materialized as a table; otherwise it will be materialized as an incremental model. |

**Materialization configuration:**

Default materializations are defined in `dbt_project.yml`. However, individual models can override these defaults when needed:

* Use `{{ config(materialized='view') }}` to change a specific model’s materialization
* Use `{{ config(alias='...') }}` in mart models to customize the final table name in Snowflake

#### dbt model naming conventions

SnowConvert AI uses prefixes to indicate each model’s layer in the dbt project:

| Model Type | Naming Pattern | Examples |
| --- | --- | --- |
| **Staging** | `stg_raw__{component_name}` | `stg_raw__flat_file_source`, `stg_raw__ole_db_source` |
| **Intermediate** | `int_{component_name}` | `int_derived_column`, `int_union_all` |
| **Mart** | `{destination_component_name}` | `ole_db_destination`, `stgdimgroup` |

The `stg_raw__` prefix indicates a staging model that selects from a raw source, while the `int_` prefix marks intermediate transformation models. Mart models use the destination table name directly or can specify a custom alias.

**Important notes:**

* All component names are sanitized according to the naming rules above
* Mart models become the final table names in Snowflake
* You can customize mart table names using `{{ config(alias='TableName') }}`

#### dbt project organization

**Organization structure:**

* **One dbt project per Data Flow Task** (for example, `Process_Sales_Files_Load_Sales_Data/`)
* **Package-level folder** contains the orchestration SQL file and all dbt project folders
* **Models organized by layer** (staging, intermediate, marts) within each dbt project
* **Orchestration execution** uses `EXECUTE DBT PROJECT` commands

#### sources.yml configuration

The `sources.yml` file, located in the `models/` directory, declares all source tables used by the dbt project. This file serves three key purposes:

* **Connection**: Links dbt models to raw data tables in Snowflake
* **Documentation**: Provides metadata and descriptions for source systems
* **Lineage**: Enables tracking data flow from sources through transformations

**Important**: Before deploying your dbt project, replace the `YOUR_SCHEMA` and `YOUR_DB` placeholders with your actual Snowflake schema and database names.

#### dbt macros

Each dbt project includes these macros:

| Macro | Purpose |
| --- | --- |
| **m_update_control_variable.sql** | Updates control variables from dbt models and syncs changes to orchestration |
| **m_update_row_count_variable.sql** | Captures row counts from transformations (similar to SSIS row count updates) |

### Control flow task output (orchestration)

[SSIS control flow logic](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/control-flow?view=sql-server-ver17) is converted into Snowflake orchestration using [TASK](../../../../sql-reference/sql/create-task.md) objects or [stored procedures](../../../../sql-reference/sql/create-procedure.md). This orchestration layer manages the execution sequence of your dbt projects and handles variables, containers, and package execution.

**Supported control flow elements**: For a complete list of supported tasks and containers, see the [SSIS Translation Reference](../../translation-references/ssis/README.md).

#### Naming conventions for orchestration objects

Orchestration objects follow consistent naming patterns based on the SSIS package and task names:

| Object Type | Naming Pattern | Example |
| --- | --- | --- |
| **Orchestration files** | `{PackageName}.sql` | `Package.sql`, `StgDimGroup.sql` |
| **Package initialization TASK** | `{schema}.{PackageName}` | `public.Package` |
| **Data Flow TASK** | `{schema}.{package_name}_{dataflow_name}` | `public.package_process_sales_files` |
| **Stored Procedure** (reusable) | `{schema}.{PackageName}` | `public.ReusableETLPackage` |

**Notes:**

* All names are sanitized according to the naming rules described earlier
* Stored procedures are used when packages are called by at least one ExecutePackage task from another control flow

#### Orchestration approach

Each SSIS package generates an orchestration SQL file. The conversion pattern depends on whether the package is reused:

##### Standard packages (not called by ExecutePackage tasks)

Standard packages that are not called by ExecutePackage tasks from other control flows are converted to Snowflake TASK objects. Each package typically generates two types of TASKs:

* **Initialization TASK**: Creates and refreshes control variables for the package

  + Deletes existing package variables from the `control_variables` table
  + Inserts all variables and parameters with their default values using `TO_VARIANT()`
* **Main orchestration TASKs**: Contain the core control flow logic

  + Declared with `WAREHOUSE=DUMMY_WAREHOUSE` (update this to your actual warehouse name)
  + Uses the `AFTER` clause to establish task dependencies
  + Executes converted control flow and data flow tasks

##### Reusable packages (called by ExecutePackage tasks)

Packages that are called by at least one ExecutePackage task from another control flow are converted to stored procedures instead of TASK objects. This is necessary because Snowflake TASK objects can’t be called synchronously from other tasks.

**Key characteristics:**

* FDM generated: [SSC-FDM-SSIS0005](../technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md)
* Invocation: `CALL schema.ProcedureName(params)` from parent orchestration
* Benefits: Enables synchronous execution and can be called from multiple parent packages with different parameter values

**Example orchestration structure:**

```sql
CREATE OR REPLACE TASK public.Package AS
BEGIN
   -- Initialize control variables
   DELETE FROM public.control_variables WHERE variable_scope = 'Package';
   INSERT INTO public.control_variables ...
END;

CREATE OR REPLACE TASK public.package_data_flow_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   -- Declare LET variables from control table
   LET User_Variable VARCHAR := public.GetControlVariableUDF('User_Variable', 'Package') :: VARCHAR;

   -- Execute dbt project
   EXECUTE DBT PROJECT public.My_DataFlow_Project ARGS='build --target dev';

   -- Update control variables
   CALL public.UpdateControlVariable('User_Variable', 'Package', TO_VARIANT(:User_Variable));
END;
```

#### Variable management

SSIS variables are converted into a comprehensive management system using four interconnected mechanisms:

##### 1. Control variables table

The `control_variables` table serves as the central storage for all package variables and parameters. Each variable is stored with the following metadata:

| Field | Type | Description |
| --- | --- | --- |
| `variable_name` | VARCHAR | Variable name |
| `variable_value` | VARIANT | Value (accommodates any data type) |
| `variable_type` | VARCHAR | Original SSIS data type |
| `variable_scope` | VARCHAR | Package or container name |
| `is_parameter` | BOOLEAN | Distinguishes parameters from variables |
| `is_persistent` | BOOLEAN | Reserved for future use |
| `last_updated_at` | TIMESTAMP | Last update time |

##### 2. getControlVariableUDF function

This user-defined function retrieves variable values within TASK logic. Use it to read variable values from the control variables table:

```sql
LET MyVar VARCHAR := public.GetControlVariableUDF('MyVar', 'Package') :: VARCHAR;
```

##### 3. updateControlVariable procedure

This stored procedure updates variable values during orchestration execution. Use it to write variable changes back to the control variables table:

```sql
CALL public.UpdateControlVariable('MyVar', 'Package', TO_VARIANT(:MyVar));
```

##### 4. dbt macros

Each dbt project includes macros that enable variable operations from within dbt models:

* `m_update_control_variable.sql`: Updates control variables and syncs changes back to the orchestration layer
* `m_update_row_count_variable.sql`: Captures row counts from transformations, similar to SSIS row count variable updates

#### Migrating SSIS containers

SnowConvert AI uses an **inline conversion approach** for [SSIS containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/control-flow?view=sql-server-ver17) rather than creating separate procedures. This architectural decision preserves execution context and simplifies the migration.

**More on inline conversion**

Migrating SSIS extends beyond the task of “translate this component to that component.” It involves translating the entire ETL context consisting of control flow, variables, and data movement. Our inline approach preserves that context:

* **One place to debug**: Containers and branches are converted inline inside parent Snowflake procedures or tasks.
* **Deterministic orchestration**: Standalone packages are migrated as Snowflake TASKs with explicit dependencies. Packages called by ExecutePackage tasks are migrated as procedures for clean and synchronous reuse.
* **Fewer naming conflicts**: Object names are sanitized across dbt models, tasks, procedures, and variables, so deployments remain predictable in shared environments.
* **Encapsulation of data movement logic and business logic**: Data movement and business logic land in dbt with layered models and macros, while orchestration runs natively on Snowflake.

**What gets converted inline:**

* [Sequence Containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/sequence-container?view=sql-server-ver17) - Sequential task execution with marked boundaries
* [For Loop Containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/for-loop-container?view=sql-server-ver17) - Container structure preserved, iteration logic requires manual implementation
* [ForEach Loop Containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/foreach-loop-container?view=sql-server-ver17) - File enumerators converted to Snowflake stage operations, other types require manual work
* [Event Handlers](https://learn.microsoft.com/en-us/sql/integration-services/integration-services-ssis-event-handlers?view=sql-server-ver17) - Not supported; implement using Snowflake exception handling

For detailed conversion specifications, examples, and EWI/FDM references for all control flow elements and task conversions, see the [SSIS Translation Reference](../../translation-references/ssis/README.md).

---
title: Interactive Assessment Application
source: https://docs.snowflake.com/en/migrations/sma-docs/interactive-assessment-application/overview.md
section: Migrations
---

# Interactive Assessment Application

The Interactive Assessment Application (IAA) is a Streamlit in Snowflake (SiS) app that provides insights into the Snowpark Migration Accelerator (SMA) output. These insights highlight key findings to help you understand and address migration challenges.

To use the IAA, see the [IAA-Support repository](https://github.com/Snowflake-Labs/IAA-Support).

---
title: Mapping for dbutils and sfutils
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/notebooks/databricks/dbutils-mapping.md
section: Migrations
---

# Mapping for dbutils and sfutils

This document provides a reference mapping between Databricks `dbutils` utilities and their Snowflake `sfutils` equivalents. When migrating notebooks from Databricks to Snowflake using the Snowpark Migration Accelerator (SMA), these utility functions are automatically translated to their corresponding Snowflake implementations.

The `dbutils` library in Databricks provides a set of utility functions for working with files, notebooks, and widgets. In Snowflake, the equivalent functionality is provided through the `sfutils` library, which offers compatible methods designed to work seamlessly within the Snowflake environment.

## File System Utilities

The file system utilities allow you to interact with cloud storage and manage files. These functions enable common file operations such as copying, moving, listing, and deleting files.

| Databricks | Snowflake |
| --- | --- |
| dbutils.fs | sfutils.fs |
| dbutils.fs.cp | sfutils.fs.cp |
| dbutils.fs.head | sfutils.fs.head |
| dbutils.fs.ls | sfutils.fs.ls |
| dbutils.fs.mkdirs | sfutils.fs.mkdirs |
| dbutils.fs.mv | sfutils.fs.mv |
| dbutils.fs.put | sfutils.fs.put |
| dbutils.fs.rm | sfutils.fs.rm |

## Notebook Utilities

The notebook utilities provide functionality to run notebooks programmatically and control notebook execution flow. These are essential for orchestrating multi-notebook workflows and building modular data pipelines.

| Databricks | Snowflake |
| --- | --- |
| dbutils.notebook | sfutils.notebook |
| dbutils.notebook.run | sfutils.notebook.run |
| dbutils.notebook.exit | sfutils.notebook.exit |

## Widget Utilities

Widget utilities enable you to create interactive input controls in notebooks. These are useful for parameterizing notebooks and allowing users to provide input values at runtime without modifying the code.

| Databricks | Snowflake |
| --- | --- |
| dbutils.widgets | sfutils.widgets |
| dbutils.widgets.combobox | sfutils.widgets.combobox |
| dbutils.widgets.dropdown | sfutils.widgets.dropdown |
| dbutils.widgets.get | sfutils.widgets.get |
| dbutils.widgets.getAll | sfutils.widgets.getAll |
| dbutils.widgets.getArgument | sfutils.widgets.getArgument |
| dbutils.widgets.multiselect | sfutils.widgets.multiselect |
| dbutils.widgets.remove | sfutils.widgets.remove |
| dbutils.widgets.removeAll | sfutils.widgets.removeAll |
| dbutils.widgets.text | sfutils.widgets.text |

## Usage Notes

* All `dbutils` calls in your Databricks notebooks are automatically translated to their `sfutils` equivalents during the migration process.
* The function signatures and parameters remain consistent between the two implementations to ensure a smooth transition.
* If you encounter any unsupported functionality, refer to the Snowflake documentation for alternative approaches.

---
title: Markdown magic cell transformation
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/notebooks/databricks/magic-md.md
section: Migrations
---

# Markdown magic cell transformation

This document describes how the Snowpark Migration Accelerator (SMA) handles the transformation of Markdown magic cells during notebook migration.

## Magic Markdown cell transformation

When the SMA processes a notebook and detects a magic cell that begins with `%md`, it automatically transforms the cell into a standard Jupyter notebook (`.ipynb`) markdown cell.

### How it works

In Databricks notebooks, Markdown content is commonly written using magic commands:

```python
%md
# My Documentation
This is a **markdown** cell with formatted text.
```

During migration, the SMA recognizes this pattern and converts it into a native notebook cell with the cell metadata set to `"markdown"`. This ensures that:

* The content is properly recognized as documentation/markdown in the target environment.
* Markdown rendering is correctly applied.
* The notebook maintains its intended documentation structure.

### Before migration (Databricks)

A cell with the `%md` magic command in the notebook JSON structure:

```json
{
  "cell_type": "code",
  "source": [
    "%md\n",
    "# Customer Analysis\n",
    "This notebook performs **customer segmentation** analysis."
  ],
  "metadata": {},
  "outputs": []
}
```

### After migration (Snowflake)

The same content is converted to a notebook cell with the language metadata set to `markdown`:

```json
{
  "cell_type": "code",
  "source": [
    "# Customer Analysis\n",
    "This notebook performs **customer segmentation** analysis."
  ],
  "metadata": {
    "language": "markdown"
  },
  "outputs": []
}
```

Note that the `%md` magic command is removed from the source, and the cell metadata now includes `"language": "markdown"` to indicate the cell contains documentation content.

### Benefits

* **Native markdown support**: The migrated notebook uses native markdown cell types instead of magic commands.
* **Better rendering**: Markdown cells are properly rendered in notebook environments without requiring code execution.
* **Cleaner structure**: Removal of magic command prefixes results in cleaner, more portable documentation cells.

---
title: Migrating with Cortex Code
source: https://docs.snowflake.com/en/migrations/sma-docs/migrating-with-cortex-code/README.md
section: Migrations
---

# Migrating with Cortex Code

You can use Cortex Code skills alongside the Snowpark Migration Accelerator to migrate your code to Snowflake.

* [Spark to Snowpark Connect](spark-to-snowpark-connect.md):
  Migrate PySpark code to Snowpark Connect using the Cortex Code CLI.

---
title: Oracle to Snowflake Migration Guide
source: https://docs.snowflake.com/en/migrations/guides/oracle.md
section: Migrations
---

# **Oracle to Snowflake Migration Guide**

## **Snowflake Migration Framework**

A typical Oracle-to-Snowflake migration can be broken down into nine key phases. This guide provides a comprehensive framework to navigate the technical and strategic challenges involved, ensuring a smooth transition from a traditional database architecture to Snowflake’s cloud data platform.

## **Migration Phases**

### **Phase 1: Planning and Design**

This initial phase is critical for establishing the foundation of a successful migration. Migrating from Oracle involves significant architectural shifts, and a thorough plan is essential to align stakeholders, define scope, and prevent budget overruns and missed deadlines.

**Your Actionable Steps:**

* **Conduct a Thorough Assessment of Your Oracle Environment:**

  + **Inventory & Analyze:** Catalog all database objects, including schemas, tables, views, materialized views, indexes, packages, procedures, functions, and triggers. Use Oracle’s data dictionary views (DBA_OBJECTS, DBA_SOURCE, DBA_TABLES, etc.) to gather this metadata.
  + **Analyze Workloads:** Use Oracle’s Automatic Workload Repository (AWR) reports and dynamic performance views (V$SQL, V$SQLAREA) to identify query patterns, user concurrency, performance bottlenecks, and resource utilization. This data is crucial for designing your Snowflake virtual warehouse strategy.
  + **Identify Dependencies:** Map all upstream data sources (ETL/ELT jobs, data streams) and downstream consumers (BI tools, applications, reporting services). Pay special attention to applications that rely heavily on PL/SQL packages.
* **Define the Migration Scope and Strategy:**

  + **Prioritize Workloads:** Classify workloads by business impact and technical complexity. Start with a high-impact, low-complexity workload (e.g., a specific data mart) to demonstrate value and build momentum.
  + **Choose a Migration Approach:** Decide between a “lift and shift” approach for a faster migration or a re-architecture approach to modernize and optimize data models, ETL/ELT pipelines, and procedural logic.
* **Develop the Project Plan:**

  + **Establish a Team:** Create a migration team with clear roles (Project Manager, Data Engineer, Oracle DBA, Snowflake Architect, Security Admin, Business Analyst).
  + **Create a Timeline:** Define realistic timelines and milestones for each of the nine phases.
  + **Define Success Metrics:** Establish clear KPIs to measure success, such as cost reduction, query performance improvement, increased concurrency, and user satisfaction.

### **Phase 2: Environment and Security**

With a solid plan, the next step is to prepare the Snowflake environment and translate Oracle’s security model. This involves setting up accounts, networking, and a new role-based access control (RBAC) structure.

**Your Actionable Steps:**

* **Set Up Your Snowflake Account:**

  + **Choose Edition and Cloud Provider:** Select the Snowflake edition (e.g., Standard, Enterprise, Business Critical) that meets your security and feature requirements. Choose a cloud provider (AWS, Azure, or GCP) and region that aligns with your cloud strategy and minimizes latency to your users and other cloud services.
  + **Design a Warehouse Strategy:** Based on the workload analysis from Phase 1, create an initial set of virtual warehouses. Isolate different workloads (e.g., WH_LOADING, WH_TRANSFORM, WH_BI_ANALYTICS) to prevent resource contention. Start with T-shirt sizes (e.g., X-Small, Small) and plan to resize them based on performance testing.
* **Implement the Security Model:**

  + **Map Oracle Users/Roles to Snowflake Roles:** Translate Oracle’s user, role, and privilege model into Snowflake’s hierarchical RBAC model. This is a significant shift, as Oracle’s granular system-level and object-level privileges do not map directly. Create a hierarchy of functional roles (SYSADMIN, SECURITYADMIN) and access roles (BI_READ_ONLY, ETL_READ_WRITE).
  + **Configure Network Policies and Authentication:** Set up network policies to restrict access to trusted IP addresses (e.g., your corporate network or VPN). Configure authentication methods, such as federated authentication (SSO) with an identity provider like Okta or Azure AD.

### **Phase 3: Database Code Conversion**

This phase involves converting Oracle’s DDL, DML, and extensive PL/SQL codebase to be compatible with Snowflake. This is often the most complex and time-consuming phase of the migration.

**Your Actionable Steps:**

* **Convert DDL (Data Definition Language):**

  + **Tables and Views:** Extract CREATE TABLE and CREATE VIEW statements from Oracle. Convert Oracle-specific data types to their Snowflake equivalents (see Appendix 2).
  + **Remove Oracle-Specific Clauses:** Eliminate Oracle-specific physical storage clauses like TABLESPACE, PCTFREE, INITRANS, STORAGE, and complex partitioning/indexing schemes. Snowflake manages storage and data layout automatically.
  + **Re-implement Constraints:** Snowflake enforces only NOT NULL constraints. PRIMARY KEY and UNIQUE constraints can be defined but are not enforced; they serve primarily as metadata for BI tools and optimizers. FOREIGN KEY constraints are not supported. All data integrity logic must be moved into your ETL/ELT processes.
* **Convert DML (Data Manipulation Language) and Procedural Code:**

  + **Rewrite PL/SQL:** Oracle’s PL/SQL (packages, procedures, functions, triggers) must be completely rewritten. Common targets include Snowflake Scripting (SQL), JavaScript UDFs/UDTFs/Procs, or externalizing the logic into a transformation tool like dbt or an orchestration service like Airflow.
  + **Translate SQL Functions:** Map Oracle-specific functions to their Snowflake counterparts (e.g., SYSDATE becomes CURRENT_TIMESTAMP(), NVL becomes IFNULL, VARCHAR2 becomes VARCHAR). See Appendix 3 for common mappings.
  + **Replace Sequences:** Re-create Oracle sequences using Snowflake’s SEQUENCE object.
  + **Handle MERGE Statements:** Review and test MERGE statements carefully, as the syntax and behavior can differ slightly between Oracle and Snowflake.

### **Phase 4: Data Migration**

This phase focuses on the physical movement of historical data from your Oracle database to Snowflake tables. The most common approach involves extracting data to files and loading them via a cloud storage stage.

**Your Actionable Steps:**

* **Extract Data from Oracle to Files:**

  + Use methods like Oracle Data Pump, SQL\*Plus spooling, or UTL_FILE to extract table data to a structured file format (e.g., Parquet, compressed CSV).
  + For very large databases, consider using third-party data integration tools (e.g., Fivetran, Matillion, Talend, Informatica) that can efficiently extract data from Oracle.
* **Upload Data to a Cloud Storage Stage:**

  + Transfer the extracted files to a cloud storage location (Amazon S3, Azure Blob Storage, or Google Cloud Storage) that will serve as an external stage for Snowflake.
* **Load Data from Stage into Snowflake:**

  + **Create External Stages:** In Snowflake, create an external stage object that points to the cloud storage location containing your data files.
  + **Use the COPY INTO Command:** Use Snowflake’s COPY INTO <table> command to load the data from the stage into the target Snowflake tables. This command is highly performant and scalable.
  + **Leverage a Sized-Up Warehouse:** Use a dedicated, larger virtual warehouse for the initial data load to accelerate the process, then scale it down or suspend it afterward to manage costs.

### **Phase 5: Data Ingestion**

Once the historical data is migrated, you must re-engineer your ongoing data ingestion pipelines to feed data directly into Snowflake.

**Your Actionable Steps:**

* **Migrate Batch ETL/ELT Jobs:**

  + Update existing ETL jobs (in tools like Oracle Data Integrator, Informatica, or Talend) to target Snowflake as the destination. This involves changing the connection details and rewriting Oracle-specific SQL overrides to use Snowflake’s dialect.
* **Implement Continuous Ingestion:**

  + For continuous data loading, configure Snowpipe to automatically ingest files as they arrive in your cloud storage stage. This is an ideal replacement for micro-batch jobs.
* **Utilize the Snowflake Ecosystem:**

  + Explore Snowflake’s native connectors for platforms like Kafka and Spark, or leverage partner tools to simplify direct data streaming and change data capture (CDC) from Oracle.

### **Phase 6: Reporting and Analytics**

This phase involves redirecting all downstream applications, particularly BI and reporting tools, to query data from Snowflake.

**Your Actionable Steps:**

* **Update Connection Drivers:** Install and configure Snowflake’s ODBC/JDBC drivers on servers hosting your BI tools (e.g., Tableau Server, Power BI Gateway, Oracle Analytics Server).
* **Redirect Reports and Dashboards:**

  + In your BI tools, change the data source connection from Oracle to Snowflake.
  + Test all critical reports and dashboards to ensure they function correctly.
* **Review and Optimize Queries:**

  + Many dashboards contain custom SQL with Oracle-specific hints or functions. Review and refactor these queries to use standard SQL and leverage Snowflake’s performance features. Use the Query Profile tool in Snowflake to analyze and optimize slow-running reports.

### **Phase 7: Data Validation and Testing**

Rigorous testing is essential to build business confidence in the new platform and ensure data integrity and performance meet expectations.

**Your Actionable Steps:**

* **Perform Data Validation:**

  + **Row Counts:** Compare row counts between source tables in Oracle and target tables in Snowflake.
  + **Cell-Level Validation:** For critical tables, perform a deeper validation by comparing aggregated values (SUM, AVG, MIN, MAX) or using checksums on key columns.
* **Conduct Query and Performance Testing:**

  + **Benchmark Queries:** Execute a representative set of queries against both Oracle and Snowflake and compare results and performance.
  + **BI Tool Performance:** Test the load times and interactivity of key dashboards connected to Snowflake.
* **User Acceptance Testing (UAT):**

  + Involve business users to validate their reports and perform their daily tasks using the new Snowflake environment. Gather feedback and address any issues.

### **Phase 8: Deployment**

Deployment is the final cutover from Oracle to Snowflake. This process should be carefully managed to minimize disruption to business operations.

**Your Actionable Steps:**

* **Develop a Cutover Plan:**

  + Define the sequence of events for the cutover. This includes stopping ETL jobs pointing to Oracle, performing a final data sync, redirecting all connections, and validating system health.
* **Execute the Final Data Sync:**

  + Perform one last incremental data load to capture any data changes that occurred during the testing phase.
* **Go Live:**

  + Switch all production data pipelines and user connections from Oracle to Snowflake.
  + Keep the Oracle environment in a read-only state for a short period as a fallback before decommissioning it.
* **Decommission Oracle:**

  + Once the Snowflake environment is stable and validated in production, you can decommission your Oracle database servers to stop incurring license and maintenance costs.

### **Phase 9: Optimize and Run**

This final phase is an ongoing process of managing performance, cost, and governance in your new Snowflake environment. The goal is to continuously refine your setup to maximize value.

**Your Actionable Steps:**

* **Implement Performance and Cost Optimization:**

  + **Right-Size Warehouses:** Continuously monitor workload performance and adjust virtual warehouse sizes up or down to meet SLAs at the lowest possible cost.
  + **Set Aggressive Auto-Suspend Policies:** Set the auto-suspend timeout for all warehouses to 60 seconds to avoid paying for idle compute time.
  + **Use Clustering Keys:** For very large tables (multi-terabyte), analyze query patterns and define clustering keys to improve the performance of highly filtered queries.
* **Establish Long-Term FinOps and Governance:**

  + **Monitor Costs:** Use Snowflake’s ACCOUNT_USAGE schema and resource monitors to track credit consumption and prevent budget overruns.
  + **Refine Security:** Regularly audit roles and permissions to ensure the principle of least privilege is maintained. Implement advanced security features like Dynamic Data Masking and Row-Access Policies for sensitive data.

## **Appendix**

### **Appendix 1: Snowflake vs. Oracle Architecture**

| Feature | Oracle | Snowflake |
| --- | --- | --- |
| **Architecture** | Monolithic or shared-disk (RAC). Tightly coupled compute and storage. | Decoupled compute, storage, and cloud services (Multi-cluster, Shared Data). |
| **Storage** | Managed by the database on local disks, SAN, or NAS (filesystems/ASM). | Centralized object storage (S3, Blob, GCS) with automatic micro-partitioning. |
| **Compute** | Fixed server resources (CPU, Memory, I/O). | Elastic, on-demand virtual warehouses (compute clusters). |
| **Concurrency** | Limited by server hardware and session/process limits. | High concurrency via multi-cluster warehouses that spin up automatically. |
| **Scaling** | Vertical (more powerful server) or Horizontal (RAC nodes). Often requires downtime and significant effort. | Instantly scale compute up/down/out (seconds); storage scales automatically. |
| **Maintenance** | Requires DBAs to perform tasks like index rebuilds, statistics gathering, and tablespace management. | Fully managed; maintenance tasks are automated and run in the background. |

### **Appendix 2: Data Type Mappings**

| Oracle | Snowflake | Notes |
| --- | --- | --- |
| NUMBER(p,s) | NUMBER(p,s) | Direct mapping. |
| NUMBER | NUMBER(38,0) | Unspecified Oracle NUMBER maps to Snowflake’s max precision integer. |
| FLOAT, BINARY_FLOAT, BINARY_DOUBLE | FLOAT |  |
| VARCHAR2(n) | VARCHAR(n) | VARCHAR2 and VARCHAR are functionally equivalent. |
| CHAR(n) | CHAR(n) |  |
| NVARCHAR2(n), NCHAR(n) | VARCHAR(n), CHAR(n) | Snowflake’s default character set is UTF-8, making special national character types unnecessary. |
| CLOB, NCLOB | VARCHAR / STRING | Snowflake’s VARCHAR can hold up to 16MB. |
| BLOB | BINARY | Snowflake’s BINARY can hold up to 8MB. For larger objects, consider storing in external stages. |
| RAW(n) | BINARY(n) |  |
| DATE | TIMESTAMP_NTZ | Oracle DATE stores both date and time. TIMESTAMP_NTZ is the closest equivalent. |
| TIMESTAMP(p) | TIMESTAMP_NTZ(p) |  |
| TIMESTAMP(p) WITH TIME ZONE | TIMESTAMP_TZ(p) |  |
| TIMESTAMP(p) WITH LOCAL TIME ZONE | TIMESTAMP_LTZ(p) |  |
| INTERVAL YEAR TO MONTH / DAY TO SECOND | VARCHAR or rewrite logic | Snowflake does not have an INTERVAL data type. Use date/time functions for calculations. |
| XMLTYPE | VARIANT | Load XML data into a VARIANT column for semi-structured querying. |

### **Appendix 3: SQL & Function Differences**

| Oracle | Snowflake | Notes |
| --- | --- | --- |
| SYSDATE | CURRENT_TIMESTAMP() | CURRENT_DATE() and CURRENT_TIME() are also available. |
| DUAL table | None | Not required. SELECT 1; is valid syntax in Snowflake. |
| NVL(expr1, expr2) | IFNULL(expr1, expr2) or NVL(expr1, expr2) | Both are supported in Snowflake. COALESCE is the ANSI standard. |
| DECODE(expr, search, result…) | DECODE(expr, search, result…) or CASE | CASE statements are more standard and flexible. |
| ROWNUM | ROW_NUMBER() window function | ROWNUM is applied before ORDER BY. ROW_NUMBER() is more explicit and standard. |
| LISTAGG(expr, delim) | LISTAGG(expr, delim) | Syntax is similar. |
| Outer Join (+) | LEFT/RIGHT/FULL OUTER JOIN | Snowflake requires the standard ANSI join syntax. |
| MINUS operator | MINUS / EXCEPT | Both are supported in Snowflake. |
| **Procedural Language** | PL/SQL (Packages, Procedures, Triggers) | Snowflake Scripting, JavaScript, Java, Python |
| **Sequences** | CREATE SEQUENCE | CREATE SEQUENCE |
| **Transactions** | COMMIT, ROLLBACK | COMMIT, ROLLBACK |
| **Hints** | /\*+ … \*/ | None |

---
title: PNDSPY1001
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1001.md
section: Migrations
---

# PNDSPY1001

**Message** < **element** > is not supported, pandas element is not supported yet.

**Category** Conversion Error

## Description

This issue appears when the SMA detects the use of a pandas element that isn’t supported in Snowpark pandas and doesn’t have its own error code. This is the generic error code that the SMA uses for an unsupported element.

## Scenario

A pandas element that isn’t supported by Snowpark.

### Input

The following example shows a pandas element that isn’t supported by Snowpark.

```python
import pandas as pd

pd.not_supported_function()
```

### Output

The SMA adds the EWI `PNDSPY1001` to the output code to let you know that this element isn’t supported by Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1001 => pandas.not_supported_function is not supported
pd.not_supported_function()
```

## Recommended fix

Since this is a generic error code that applies to a range of unsupported functions, no single fix applies to all cases. The appropriate action depends on the particular element in use.

Even though the element isn’t supported, you might still find a solution or workaround. This issue code only means that the SMA can’t convert the element automatically.

## Additional recommendations

If you believe that Snowpark pandas already supports this element or that a workaround exists, report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.

---
title: PNDSPY1002
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1002.md
section: Migrations
---

# PNDSPY1002

**Message** Pandas < **element** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

pd.melt(df, id_vars=['A'], value_vars=['B'])
```

### Output

The SMA adds the EWI `PNDSPY1002` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1002 => pandas.core.reshape.melt.melt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
pd.melt(df, id_vars=['A'], value_vars=['B'])
```

## Recommended fix

Since this is a generic error code that applies to a range of partially supported functions, no single fix applies to all cases. The appropriate action depends on the particular element in use.

Even though the element isn’t supported in some scenarios, you might still find a solution or workaround. This issue code only means that the SMA can’t convert the element automatically.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1003
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1003.md
section: Migrations
---

# PNDSPY1003

**Message** < **element** > is not recognized, pandas element is not recognized yet.

**Category** Conversion Error

## Description

This issue appears when the SMA encounters a pandas element that it doesn’t yet recognize.

This issue can occur for different reasons, such as:

* An element that doesn’t exist in pandas.
* An element that was added in a pandas version that the SMA doesn’t yet support.
* An internal error of the SMA when processing the element.

## Scenarios

The following scenarios illustrate different reasons why this issue might occur.

### Scenario 1

An element that doesn’t exist in pandas.

#### Input

The following example shows an element that doesn’t exist in pandas.

```python
import pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

df.non_existent_function()
```

#### Output

Since the element doesn’t exist in pandas, the tool adds the EWI to the output code.

```python
import snowflake.snowpark.modin.pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

#EWI: PNDSPY1003 => pandas.core.frame.DataFrame.non_existent_function is not yet recognized
df.non_existent_function()
```

#### Recommended fix

Check the [pandas documentation](https://pandas.pydata.org/docs/reference/index.html) to verify whether the element exists in pandas.

If it’s a valid pandas element, report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.

If it isn’t a valid pandas element, remove it and use a valid pandas function.

```python
import snowflake.snowpark.modin.pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

df.valid_existent_function()
```

### Scenario 2

An element that was added in a pandas version that the SMA doesn’t yet support.

#### Input

The following example shows an element that was added in a pandas version that the SMA doesn’t yet support.

```python
import pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

df.valid_function_since_x.x.x_version()
```

#### Output

Since the element was added in a pandas version that the tool doesn’t yet support, the tool adds the EWI to the output code.

```python
import snowflake.snowpark.modin.pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

#EWI: PNDSPY1003 => pandas.core.frame.DataFrame.valid_function_since_x.x.x_version is not yet recognized
df.valid_function_since_x.x.x_version()
```

#### Recommended fix

Verify the [Snowpark pandas documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/modin/index). If it’s a valid pandas element, report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.

### Scenario 3

An internal error of the SMA when processing an element.

#### Input

The following example shows an internal error of the SMA when processing an element.

```python
import pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

df.valid_function()
```

#### Output

If an error occurred while processing the element and the tool can’t recognize it, the tool adds the EWI to the output code.

```python
import snowflake.snowpark.modin.pandas as pd

df = pd.DataFrame(
    {
        "Name": ["Alice", "Bob", "Charlie"],
        "Age": [25, 30, 35],
        "City": ["New York", "Los Angeles", "Chicago"],
    }
)

#EWI: PNDSPY1003 => pandas.core.frame.DataFrame.valid_function is not yet recognized
df.valid_function()
```

#### Recommended fix

Verify whether the element exists in the [pandas documentation](https://pandas.pydata.org/docs/reference/index.html) and also check the [Snowpark pandas documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/modin/index).
If it’s a valid pandas element, report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.

---
title: PNDSPY1004
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1004.md
section: Migrations
---

# PNDSPY1004

**Message** Pandas < **element** > has a direct mapping, but it’s restricted to running on a single node. As a result, performance may be impacted, especially with large datasets.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but it’s restricted to running on a single node, so its performance may be impacted.

## Scenario

A method that runs on a single node.

### Input

The following example shows a method that runs on a single node.

```python
import pandas as pd

ser = pd.Series([1, 2, 3, 3])
ser.plot(kind='hist', title="My plot")
```

### Output

The SMA adds the EWI `PNDSPY1004` to the output code to let you know that this element runs on a single node, and its performance may be impacted.

```python
import snowflake.snowpark.modin.pandas as pd

ser = pd.Series([1, 2, 3, 3])
#EWI: PNDSPY1004 => pandas.core.series.Series.plot has a direct mapping, but it's restricted to running on a single node. As a result, performance may be impacted,
especially with large datasets.
ser.plot(kind='hist', title="My plot")
```

## Recommended fix

Since this is a generic error code that applies to a range of partially supported functions, no single recommended fix exists.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify the pandas elements that run on a single node.

---
title: PNDSPY1005
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1005.md
section: Migrations
---

# PNDSPY1005

**Message** `pandas.core.series.Series.str.get` has a partial mapping because in one scenario it has a different behavior in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.series.Series.str.get](https://pandas.pydata.org/docs/reference/api/pandas.Series.str.get.html) usage.
Snowpark pandas offers a partial equivalent, but when it comes to columns with mixed data types the method may not behave as expected. All values within a column must be of the same type.

## Scenario

An unsupported use of `pandas.core.series.Series.str.get`.

### Input

The following example shows an unsupported use of `pandas.core.series.Series.str.get`.

```python
import pandas as pd

s = pd.Series(["String", (1, 2, 3), ["a", "b", "c"], 123, -456, {1: "Hello", "2": "World"}])
print(s.str.get(1))
```

### Output

The SMA adds the EWI `PNDSPY1005` to the output code to indicate that in one scenario it has a different behavior in Snowpark.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

s = pd.Series(["String", (1, 2, 3), ["a", "b", "c"], 123, -456, {1: "Hello", "2": "World"}])
#EWI: PNDSPY1005 => pandas.core.series.Series.str.get has a partial mapping, because in one scenario it has a different behavior in Snowpark.
print(s.str.get(1))
```

## Recommended fix

Ensure that the Series contains only one type of data (all strings, all lists, or all dicts).
No code change is strictly required, but be aware that this operation might not work as expected in Snowpark pandas.

---
title: PNDSPY1006
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1006.md
section: Migrations
---

# PNDSPY1006

**Message** `pandas.core.series.Series.apply` has a partial mapping because Snowpark pandas does not support a non-callable function as parameter.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.series.Series.apply](https://pandas.pydata.org/docs/reference/api/pandas.Series.apply.html) usage.
Snowpark pandas offers a partial equivalent, but it doesn’t support non-callable values as a parameter.

## Scenario

An unsupported use of `pandas.core.series.Series.apply`.

### Input

The following example shows an unsupported use of `pandas.core.series.Series.apply`.

```python
import pandas as pd

ser = pd.Series([20, 21, 12], index=['London', 'New York', 'Helsinki'])
print(ser.apply(5))
```

### Output

The SMA adds the EWI `PNDSPY1006` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

ser = pd.Series([20, 21, 12], index=['London', 'New York', 'Helsinki'])
#EWI: PNDSPY1006 => pandas.core.series.Series.apply has a partial mapping, because Snowpark pandas does not support callable functions as parameter.
print(ser.apply(5))
```

## Recommended fix

Ensure that the function used within the apply method is callable.

```python
import pandas as pd

def my_function(x):
    return x * 5

ser = pd.Series([20, 21, 12], index=['London', 'New York', 'Helsinki'])
print(ser.apply(my_function))
```

---
title: PNDSPY1007
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1007.md
section: Migrations
---

# PNDSPY1007

**Message** `pandas.core.series.Series.str.slice` has a partial mapping because it has several scenarios not supported in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.series.Series.str.slice](https://pandas.pydata.org/docs/reference/api/pandas.Series.str.slice.html) usage.
Snowpark pandas offers a partial equivalent, but the current implementation has two unsupported scenarios.

## Scenarios

The following scenarios illustrate the two unsupported use cases.

### Scenario 1

The first scenario is when the method receives mixed data type columns, which causes it to behave unexpectedly.
All values within a column must be of the same type.

#### Input

The following example shows an unsupported use of `pandas.core.series.Series.str.slice`.

```python
import pandas as pd

s = pd.Series(["String", (1, 2, 3), ["a", "b", "c"], 123, -456, {1: "Hello", "2": "World"}])
print(s.str.slice(1))
```

#### Output

The SMA adds the EWI `PNDSPY1007` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

s = pd.Series(["String", (1, 2, 3), ["a", "b", "c"], 123, -456, {1: "Hello", "2": "World"}])
#EWI: PNDSPY1007 => pandas.core.series.Series.str.slice has a partial mapping,because has several scenarios not supported in Snowpark pandas.
print(s.str.slice(1))
```

#### Recommended fix

Ensure that the Series contains only one type of data (all strings, all lists, or all dicts).
No code change is strictly required, but be aware that this operation might not work as expected in Snowpark pandas.

### Scenario 2

The second scenario is when a column contains list values and the `step` parameter is set to a value other than one.

#### Input

The following example shows an unsupported use of `pandas.core.series.Series.str.slice`.

```python
import pandas as pd

ser = pd.Series(["koala", "dog", "chameleon","cat", "mouse", "elephant","lion", "tiger", "bear"])
print(ser.str.slice(step=3))
```

#### Output

The SMA adds the EWI `PNDSPY1007` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

ser = pd.Series(["koala", "dog", "chameleon","cat", "mouse", "elephant","lion", "tiger", "bear"])
#EWI: PNDSPY1007 => pandas.core.series.Series.str.slice has a partial mapping, because has several scenarios not supported in Snowpark pandas.
print(ser.str.slice(step=3))
```

#### Recommended fix

This requires a manual change. Use the [apply](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.Series.apply) function to achieve the same behavior with a lambda function.
Here is the previous output code with the recommended fix:

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

ser = pd.Series(["koala", "dog", "chameleon","cat", "mouse", "elephant","lion", "tiger", "bear"])
print(ser.apply(lambda lst: lst[::3]))
```

---
title: PNDSPY1008
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1008.md
section: Migrations
---

# PNDSPY1008

**Message** `pandas.core.series.Series.hist` has a partial mapping, Snowpark pandas doesn’t yet support the `bins` parameter with types other than `int`.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.series.Series.hist](https://pandas.pydata.org/docs/reference/api/pandas.Series.hist.html) usage.
Snowpark pandas doesn’t yet support the `bins` parameter with types other than `int`.

## Scenario

An unsupported use of `pandas.core.series.Series.hist`.

### Input

The following example shows an unsupported use of `pandas.core.series.Series.hist`.

```python
import pandas as pd

data = pd.Series([[1.2, -0.5, 0.3, 2.1, -2.2, 1.7, 0.0, -1.1, 2.5, -2.8]])
custom_bins = [-3, -2, -1, 0, 1, 2, 3]
data.hist(bins=custom_bins)
plt.xlabel('Value')
plt.ylabel('Frequency')
plt.show()
```

### Output

The SMA adds the EWI `PNDSPY1008` to the output code to indicate that in one scenario it isn’t supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

data = pd.Series([[1.2, -0.5, 0.3, 2.1, -2.2, 1.7, 0.0, -1.1, 2.5, -2.8]])
custom_bins = [-3, -2, -1, 0, 1, 2, 3]
#EWI: PNDSPY1008 => pandas.core.series.Series.hist has a partial mapping, Snowpark pandas doesn't yet support the `bins` parameter with types other than `int`.
data.hist(bins=custom_bins)
plt.xlabel('Value')
plt.ylabel('Frequency')
plt.show()
```

## Recommended fix

This requires a manual change using the numpy [digitize](https://numpy.org/doc/stable/reference/generated/numpy.digitize.html) function. To use `digitize`, import numpy and replace `pd.Series` with `np.array`. Then count the frequencies for each bin and create labels for the custom bins. Finally, use `plt.bar` to plot the histogram with custom labels.
Here is the previous output code with the fix:

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd
import numpy as np

data = np.array([[1.2, -0.5, 0.3, 2.1, -2.2, 1.7, 0.0, -1.1, 2.5, -2.8]])
custom_bins = [-3, -2, -1, 0, 1, 2, 3]
bin_indices = np.digitize(data, custom_bins, right=False)

counts = [np.sum(bin_indices == i) for i in range(1, len(custom_bins))]

bin_labels = [f"({custom_bins[i - 1]}, {custom_bins[i]})" for i in range(1, len(custom_bins))]

plt.bar(bin_labels, counts, edgecolor='black', alpha=0.7)
plt.xlabel('Value')
plt.ylabel('Frequency')
plt.show()
```

---
title: PNDSPY1009
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1009.md
section: Migrations
---

# PNDSPY1009

**Message** `pandas.core.frame.DataFrame.apply` has a partial mapping because it has several scenarios not supported in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.frame.DataFrame.apply](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.apply.html) usage.
Snowpark pandas offers a partial equivalent, but the current implementation has two unsupported scenarios.

## Scenarios

The following scenarios illustrate the two unsupported use cases.

### Scenario 1

Snowpark pandas `DataFrame.apply` API doesn’t yet support the `result_type` parameter.

#### Input

The following example shows an unsupported use of `pandas.core.frame.DataFrame.apply`.

```python
import pandas as pd

df = pd.DataFrame({"A": [1, 2], "B": [3, 4], "C": [5, 6]})
df.apply(np.mean, axis=1, result_type="expand")
```

#### Output

The SMA adds the EWI `PNDSPY1009` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({"A": [1, 2], "B": [3, 4], "C": [5, 6]})
#EWI: PNDSPY1009 => pandas.core.frame.DataFrame.apply has a partial mapping, because has several scenarios not supported in Snowpark pandas.
df.apply(np.mean, axis=1, result_type="expand")
```

#### Recommended fix

Ensure that the `apply` method doesn’t contain the `result_type` parameter.

### Scenario 2

Snowpark pandas `DataFrame.apply` API doesn’t yet support `DataFrame` or `Series` as `args` or `kwargs` parameters.

#### Input

The following example shows an unsupported use of `pandas.core.frame.DataFrame.apply`.

```python
import pandas as pd

df = pd.DataFrame({"A": [1, 2], "B": [3, 4], "C": [5, 6]})
ser = pd.Series([10, 20])

def custom_func(row, ser):
    return row["A"] + ser[row.name]

df.apply(custom_func, axis=1, args=(ser, ))
```

#### Output

The SMA adds the EWI `PNDSPY1009` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({"A": [1, 2], "B": [3, 4], "C": [5, 6]})
ser = pd.Series([10, 20])

def custom_func(row, ser):
    return row["A"] + ser[row.name]

#EWI: PNDSPY1009 => pandas.core.frame.DataFrame.apply has a partial mapping, because has several scenarios not supported in Snowpark pandas.
df.apply(custom_func, axis=1, args=(ser, ))
```

#### Recommended fix

For this scenario, use the `values` attribute of the Series and create a new column in the DataFrame to hold the Series values, then use the `apply` method without passing the `Series` as an argument.

```python
import pandas as pd

ser = pd.Series([10, 20])

df = pd.DataFrame({"A": [1, 2], "B":[3, 4], "C": [5, 6]})
df["extra"] = ser.values

def custom_func(row):
    return row["A"] + row["extra"]

df.apply(custom_func, axis=1)
```

---
title: PNDSPY1010
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1010.md
section: Migrations
---

# PNDSPY1010

**Message** `pandas.core.groupby.grouper.Grouper` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.groupby.grouper.Grouper](https://pandas.pydata.org/docs/reference/api/pandas.Grouper.html) usage.
Snowpark pandas currently has limitations with `Grouper` parameters. It doesn’t support `origin`, `offset`, `dropna`, or `closed`.

## Scenario

An unsupported use of `pandas.core.groupby.grouper.Grouper`.

### Input

The following example shows an unsupported use of `pandas.core.groupby.grouper.Grouper`.

```python
import pandas as pd

df = pd.DataFrame({
        "date": pd.to_datetime([
            "2023-01-01", "2023-01-02", "2023-01-03", None, "2023-01-05", "2023-01-06", None
        ]),
        "value": [0, 1, 2, 3, 4, 5, 6]
    })

df.groupby(pd.Grouper(key="date", freq="3D", origin="epoch" offset="1D", dropna=True)).sum()
```

### Output

The SMA adds the EWI `PNDSPY1010` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({
        "date": pd.to_datetime([
            "2023-01-01", "2023-01-02", "2023-01-03", None, "2023-01-05", "2023-01-06", None
        ]),
        "value": [0, 1, 2, 3, 4, 5, 6]
    })

#EWI: PNDSPY1010 => pandas.core.groupby.grouper.Grouper has a partial mapping, because there is a not supported scenario in Snowpark pandas.
df.groupby(pd.Grouper(key="date", freq="3D", origin="epoch" offset="1D", dropna=True)).sum()
```

## Recommended fix

This requires a manual adjustment based on the parameters used in the `Grouper` method, essentially mimicking its behavior:

* Sort and Dropna: These parameters can be replaced with the ones in the [groupby](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/pandas_api/modin.pandas.DataFrame.groupby) method.
* Offset and Origin: You can use `pd.Timedelta` to represent these values and manually adjust the datetime column by subtracting the `offset` or `origin` before using `groupby`.

The `groupby` doesn’t have a frequency parameter, so you can use the `pd.Timedelta` to create a new column that represents the period you want to group by.

To illustrate the recommended fix, here is the output code with the changes applied:

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({
        "date": pd.to_datetime([
            "2023-01-01", "2023-01-02", "2023-01-03", None, "2023-01-05", "2023-01-06", None
        ]),
        "value": [0, 1, 2, 3, 4, 5, 6]
    })

freq = pd.Timedelta("3D")

origin = pd.Timestamp("1970-01-01")

origin += pd.Timedelta("1D")

df["period"] = origin + ((df["date"] - origin) // freq) * freq
result = df.groupby("period", dropna=True)["value"].sum()
```

---
title: PNDSPY1011
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1011.md
section: Migrations
---

# PNDSPY1011

**Message** `pandas.core.groupby.generic.DataFrameGroupBy.resample` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.groupby.generic.DataFrameGroupBy.resample](https://pandas.pydata.org/docs/reference/api/pandas.core.groupby.DataFrameGroupBy.resample.html) usage.
Snowpark pandas currently has limitations with `DataFrameGroupBy.resample`. The `rule` parameter only supports `s`, `min`, `h`, and `D` as frequency values.

## Scenario

An unsupported use of `pandas.core.groupby.generic.DataFrameGroupBy.resample`.

### Input

The following example shows an unsupported use of `pandas.core.groupby.generic.DataFrameGroupBy.resample`.

```python
import pandas as pd

df = pd.DataFrame({
        "category": ["A", "A", "B", "B"],
        "date": pd.to_datetime(["2023-01-01", "2023-01-15", "2023-01-01", "2023-01-20"]),
        "value": [10, 20, 30, 40]
    })

df = df.set_index("date")
df.groupby("category").resample("ME").sum()
```

### Output

The SMA adds the EWI `PNDSPY1011` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({
        "category": ["A", "A", "B", "B"],
        "date": pd.to_datetime(["2023-01-01", "2023-01-15", "2023-01-01", "2023-01-20"]),
        "value": [10, 20, 30, 40]
    })

df = df.set_index("date")
#EWI: PNDSPY1011 => pandas.core.groupby.generic.DataFrameGroupBy.resample has a partial mapping because there is a not supported scenario in Snowpark pandas.
df.groupby("category").resample("ME").sum()
```

## Recommended fix

This requires a manual adjustment. You can use the `pd.Grouper` method to create a new column that represents the period you want to group by, and then use the `groupby` method.
To illustrate the recommended fix, here is the output code with the changes applied:

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({
        "category": ["A", "A", "B", "B"],
        "date": pd.to_datetime(["2023-01-01", "2023-01-15", "2023-01-01", "2023-01-20"]),
        "value": [10, 20, 30, 40]
    })

df = df.set_index("date")
df.groupby(["category", pd.Grouper(freq="ME")]).sum()
```

---
title: PNDSPY1012
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1012.md
section: Migrations
---

# PNDSPY1012

**Message** `pandas.core.frame.DataFrame.query` has a partial mapping because there is an unsupported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA detects the use of [pandas.core.frame.DataFrame.query](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.query.html).
This method is commonly used for filtering data in pandas DataFrames, but Snowpark pandas currently has limitations in supporting it.
Specifically, it doesn’t support DataFrames that have a row MultiIndex, which can lead to compatibility issues during migration or execution.

## Scenario

Using `query()` with a row MultiIndex.

### Input

The following example shows how `query()` behaves with a row MultiIndex.

```python
import modin.pandas as pd

data = {
    'name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve', 'Frank'],
    'age': [25, 30, 35, 28, 32, 45],
    'salary': [50000, 60000, 75000, 55000, 80000, 90000],
    'department': ['Sales', 'IT', 'HR', 'Sales', 'IT', 'HR']
}

df = pd.DataFrame(data)

df = df.set_index('name')

print("DataFrame with single-level index:")
print(df)

result = df.query("age > 30 and salary < 85000")

data = {
    'A': [1, 2, 3, 4, 5, 6],
    'B': [10, 20, 30, 40, 50, 60],
    'C': ['x', 'y', 'x', 'y', 'x', 'y']
}

df = pd.DataFrame(data)

df = df.set_index([
    pd.Index(['group1', 'group1', 'group2', 'group2', 'group3', 'group3']),
    pd.Index(['a', 'b', 'a', 'b', 'a', 'b'])
])
df.index.names = ['group', 'subgroup']

result = df.query("A > 2 and B < 55")
```

### Output

The SMA adds the EWI `PNDSPY1012` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

data = {
    'name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve', 'Frank'],
    'age': [25, 30, 35, 28, 32, 45],
    'salary': [50000, 60000, 75000, 55000, 80000, 90000],
    'department': ['Sales', 'IT', 'HR', 'Sales', 'IT', 'HR']
}

df = pd.DataFrame(data)

df = df.set_index('name')

print("DataFrame with single-level index:")
print(df)

#EWI: PNDSPY1012 => pandas.core.frame.DataFrame.query does not support DataFrames that have a row MultiIndex. Check Snowpark pandas documentation for more detail.
result = df.query("age > 30 and salary < 85000")

data = {
    'A': [1, 2, 3, 4, 5, 6],
    'B': [10, 20, 30, 40, 50, 60],
    'C': ['x', 'y', 'x', 'y', 'x', 'y']
}

df = pd.DataFrame(data)

df = df.set_index([
    pd.Index(['group1', 'group1', 'group2', 'group2', 'group3', 'group3']),
    pd.Index(['a', 'b', 'a', 'b', 'a', 'b'])
])
df.index.names = ['group', 'subgroup']

#EWI: PNDSPY1012 => pandas.core.frame.DataFrame.query does not support DataFrames that have a row MultiIndex. Check Snowpark pandas documentation for more detail.
result = df.query("A > 2 and B < 55")
```

## Recommended fix

If the DataFrame contains a MultiIndex, validate the behavior of the `query()` method in Snowpark pandas. Ensure that the DataFrame structure is compatible with Snowpark pandas’ limitations, as MultiIndex rows aren’t supported. Consider restructuring the DataFrame to use a single-level index or alternative filtering methods.

---
title: PNDSPY1013
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1013.md
section: Migrations
---

# PNDSPY1013

**Message** `pandas.core.frame.DataFrame.aggregate` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.frame.DataFrame.aggregate](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.aggregate.html) usage.
Snowpark pandas currently has limitations with `DataFrame.aggregate`. Check [Supported Aggregation Functions](https://github.com/snowflakedb/snowpark-python/blob/main/docs/source/modin/supported/agg_supp.rst) for a list of supported functions.

## Scenario

An unsupported use of `pandas.core.frame.DataFrame.aggregate`.

### Input

The following example shows an unsupported use of `pandas.core.frame.DataFrame.aggregate`.

```python
import pandas as pd
import numpy as np

df = pd.DataFrame([[1, 2, 3],
                       [4, 5, 6],
                       [7, 8, 9],
                       [np.nan, np.nan, np.nan]],
                      columns=['A', 'B', 'C'])
df.aggregate(['sum', 'min'])
```

### Output

The SMA adds the EWI `PNDSPY1013` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd
import numpy as np

df = pd.DataFrame([[1, 2, 3],
                       [4, 5, 6],
                       [7, 8, 9],
                       [np.nan, np.nan, np.nan]],
                      columns=['A', 'B', 'C'])
#EWI: PNDSPY1013 => pandas.core.frame.DataFrame.aggregate does not support some combinations of parameters for specific aggregate functions. Check Snowpark pandas documentation for more detail.
df.aggregate(['sum', 'min'])
```

## Recommended fix

Since this is an error that applies to a range of partially supported aggregate functions, no specific fix applies to all cases. The appropriate action depends on the particular aggregate function in use.

Even though the element isn’t supported in some scenarios, you might still find a solution or workaround. This issue code only means that the SMA can’t convert the element automatically.

---
title: PNDSPY1014
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1014.md
section: Migrations
---

# PNDSPY1014

**Message** `pandas.core.series.Series.aggregate` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.series.Series.aggregate](https://pandas.pydata.org/docs/reference/api/pandas.Series.aggregate.html) usage.
Snowpark pandas currently has limitations with `Series.aggregate`. Check [Supported Aggregation Functions](https://github.com/snowflakedb/snowpark-python/blob/main/docs/source/modin/supported/agg_supp.rst) for a list of supported functions.

## Scenario

An unsupported use of `pandas.core.series.Series.aggregate`.

### Input

The following example shows an unsupported use of `pandas.core.series.Series.aggregate`.

```python
import pandas as pd
import numpy as np

s = pd.Series([1, 2, 3, 4])
s.aggregate(['min', 'max'])
```

### Output

The SMA adds the EWI `PNDSPY1014` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd
import numpy as np

df = pd.DataFrame([[1, 2, 3],
                       [4, 5, 6],
                       [7, 8, 9],
                       [np.nan, np.nan, np.nan]],
                      columns=['A', 'B', 'C'])
#EWI: PNDSPY1014 => pandas.core.series.Series.aggregate does not support some combinations of parameters for specific aggregate functions. Check Snowpark pandas documentation for more detail.
df.aggregate(['sum', 'min'])
```

## Recommended fix

Since this is an error that applies to a range of partially supported aggregate functions, no specific fix applies to all cases. The appropriate action depends on the particular aggregate function in use.

Even though the element isn’t supported in some scenarios, you might still find a solution or workaround. This issue code only means that the SMA can’t convert the element automatically.

---
title: PNDSPY1015
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1015.md
section: Migrations
---

# PNDSPY1015

**Message** `pandas.core.frame.DataFrame.interpolate` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.frame.DataFrame.interpolate](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.interpolate.html) usage.
Snowpark pandas currently has limitations with `DataFrame.interpolate`. It isn’t supported if axis == 1, limit is set, limit_area is “outside”, or method isn’t “linear”, “bfill”, “backfill”, “ffill”, or “pad”. Also, limit_area=”inside” is supported only when method is linear.

## Scenario

An unsupported use of `pandas.core.frame.DataFrame.interpolate`.

### Input

The following example shows an unsupported use of `pandas.core.frame.DataFrame.interpolate`.

```python
import pandas as pd
import numpy as np

df = pd.DataFrame([(0.0, np.nan, -1.0, 1.0),
                   (np.nan, 2.0, np.nan, np.nan),
                   (2.0, 3.0, np.nan, 9.0),
                   (np.nan, 4.0, -4.0, 16.0)],
                  columns=list('abcd'))
df['d'].interpolate(method='polynomial', order=2)
```

### Output

The SMA adds the EWI `PNDSPY1015` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd
import numpy as np

df = pd.DataFrame([(0.0, np.nan, -1.0, 1.0),
                   (np.nan, 2.0, np.nan, np.nan),
                   (2.0, 3.0, np.nan, 9.0),
                   (np.nan, 4.0, -4.0, 16.0)],
                  columns=list('abcd'))
#EWI: PNDSPY1015 => pandas.core.frame.DataFrame.interpolate is not support if axis == 1, limit is set, limit_area is "outside", or method is not "linear", "bfill", "backfill", "ffill", or "pad". And limit_area="inside" is supported only when method is linear. Check Snowpark pandas documentation for more detail.
df['d'].interpolate(method='polynomial', order=2)
```

## Recommended fix

Since this is an error that applies to a range of partially supported parameters, no specific fix applies to all cases. The appropriate action depends on the particular parameter combination in use.

Even though the element isn’t supported in some scenarios, you might still find a solution or workaround. This issue code only means that the SMA can’t convert the element automatically.

---
title: PNDSPY1016
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1016.md
section: Migrations
---

# PNDSPY1016

**Message** `pandas.core.series.Series.interpolate` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.series.Series.interpolate](https://pandas.pydata.org/docs/reference/api/pandas.Series.interpolate.html) usage.
Snowpark pandas currently has limitations with `Series.interpolate`. It isn’t supported if axis == 1, limit is set, limit_area is “outside”, or method isn’t “linear”, “bfill”, “backfill”, “ffill”, or “pad”. Also, limit_area=”inside” is supported only when method is linear.

## Scenario

An unsupported use of `pandas.core.series.Series.interpolate`.

### Input

The following example shows an unsupported use of `pandas.core.series.Series.interpolate`.

```python
import pandas as pd
import numpy as np

s = pd.Series([0, 2, np.nan, 8])
s.interpolate(method='polynomial', order=2)
```

### Output

The SMA adds the EWI `PNDSPY1016` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd
import numpy as np

df = pd.DataFrame([(0.0, np.nan, -1.0, 1.0),
                   (np.nan, 2.0, np.nan, np.nan),
                   (2.0, 3.0, np.nan, 9.0),
                   (np.nan, 4.0, -4.0, 16.0)],
                  columns=list('abcd'))
#EWI: PNDSPY1016 => pandas.core.series.Series.interpolate is not support if axis == 1, limit is set, limit_area is "outside", or method is not "linear", "bfill", "backfill", "ffill", or "pad". And limit_area="inside" is supported only when method is linear. Check Snowpark pandas documentation for more detail.
df['d'].interpolate(method='polynomial', order=2)
```

## Recommended fix

Since this is an error that applies to a range of partially supported parameters, no specific fix applies to all cases. The appropriate action depends on the particular parameter combination in use.

Even though the element isn’t supported in some scenarios, you might still find a solution or workaround. This issue code only means that the SMA can’t convert the element automatically.

---
title: PNDSPY1017
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1017.md
section: Migrations
---

# PNDSPY1017

**Message** `pandas.core.reshape.encoding.get_dummies` has a partial mapping because there is a not supported scenario in Snowpark pandas.

**Category** Warning

## Description

This issue appears when the SMA identifies a [pandas.core.reshape.encoding.get_dummies](https://pandas.pydata.org/docs/reference/api/pandas.get_dummies.html) usage.
Snowpark pandas currently has limitations with `pandas.get_dummies`. It’s supported if parameters “dummy_na” and “drop_first” are both false; otherwise it isn’t supported.

## Scenario

An unsupported use of `pandas.core.reshape.encoding.get_dummies`.

### Input

The following example shows an unsupported use of `pandas.core.reshape.encoding.get_dummies`.

```python
import pandas as pd
import numpy as np

s1 = ['a', 'b', np.nan]
pd.get_dummies(s1, dummy_na=True)

s2 = list('abcaa')
pd.get_dummies(s2, drop_first=True)
```

### Output

The SMA adds the EWI `PNDSPY1017` to the output code to indicate that it has a scenario not supported in Snowpark pandas.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd
import numpy as np

s1 = ['a', 'b', np.nan]
#EWI: PNDSPY1017 => pandas.core.reshape.encoding.get_dummies is supported if parameters "dummy_na" and "drop_first" are both false, otherwise it is not supported. Check Snowpark pandas documentation for more detail.
pd.get_dummies(s1, dummy_na=True)

s2 = list('abcaa')
#EWI: PNDSPY1017 => pandas.core.reshape.encoding.get_dummies is supported if parameters "dummy_na" and "drop_first" are both false, otherwise it is not supported. Check Snowpark pandas documentation for more detail.
pd.get_dummies(s2, drop_first=True)
```

## Recommended fix

### For the `dummy_na` parameter

This requires a manual adjustment:

1. Replace the `np.nan` value with an acceptable value such as `'np.nan'`.
2. Remove the use of the parameter `dummy_na`.
3. Rename the column `'np.nan'` to the original `np.nan` value.

To illustrate the recommended fix, here is the output code with the changes applied:

```python
s1 = s1.replace(np.nan, 'np.nan') if isinstance(s1, (pd.DataFrame, pd.Series)) else ['np.nan' if pd.isna(item) else item for item in s1]
pd.get_dummies(s1).rename(columns={'np.nan': np.nan})
```

### For the `drop_first` parameter

This requires a manual adjustment:

1. Remove the use of the parameter `drop_first`.
2. Remove the first column of the result (you can use the `iloc` indexer for it).

To illustrate the recommended fix, here is the output code with the changes applied:

```python
pd.get_dummies(s2).iloc[:, 1:]
```

---
title: PNDSPY1018
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1018.md
section: Migrations
---

# PNDSPY1018

**Message** < **element** > defaults to single node pandas execution via UDF/Sproc.

**Category** Warning

## Description

This issue appears when the SMA identifies a pandas element that is supported in Snowpark pandas but defaults to single node pandas execution via UDF/Sproc instead of distributed execution.

This means the operation will work correctly, but it may have performance implications for large datasets as it will be executed locally on a single node rather than being distributed across Snowflake’s compute resources.

## Scenario

A pandas element that defaults to single node pandas execution.

### Input

The following example shows a pandas element that defaults to single node execution.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.some_method()  # This method defaults to UDF/Sproc execution
```

### Output

The SMA adds the EWI `PNDSPY1018` to the output code to let you know that this element defaults to single node pandas execution.

```python
from snowflake.snowpark.modin import plugin
import modin.pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
#EWI: PNDSPY1018 => Element defaults to single node pandas execution via UDF/Sproc.
result = df.some_method()
```

## Recommended fix

No immediate fix is required. The code will execute correctly. However, be aware that:

* **Performance impact**: Operations may be slower for large datasets since they run on a single node instead of being distributed across Snowflake’s compute cluster.
* **Memory limitations**: Single node execution is subject to memory constraints of a single worker.
* **Scalability**: For very large datasets, consider alternative approaches that leverage distributed execution.

If performance is critical for this operation, consider:

1. Breaking down the operation into smaller, distributable steps
2. Using native Snowpark functions where available
3. Pre-filtering data to reduce the dataset size before applying the operation

---
title: PNDSPY1019
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1019.md
section: Migrations
---

# PNDSPY1019

**Message** Pandas < **pandas.core.arrays.datetimelike.DatelikeOps.strftime** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if date_format contains directives other than (%d, %m, %Y, %H, %M, %S, %f, %j, %X, %%).

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.strftime
```

### Output

The SMA adds the EWI `PNDSPY1019` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1019 => pandas.core.arrays.datetimelike.DatelikeOps.strftime has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.strftime
```

## Recommended fix

**Behavioral note**: N if date_format contains directives other than (%d, %m, %Y, %H, %M, %S, %f, %j, %X, %%).

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1020
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1020.md
section: Migrations
---

# PNDSPY1020

**Message** Pandas < **pandas.core.arrays.datetimelike.TimelikeOps.ceil** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ambiguous or nonexistent are set to a non-default value.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.ceil
```

### Output

The SMA adds the EWI `PNDSPY1020` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1020 => pandas.core.arrays.datetimelike.TimelikeOps.ceil has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.ceil
```

## Recommended fix

**Behavioral note**: N if ambiguous or nonexistent are set to a non-default value.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1021
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1021.md
section: Migrations
---

# PNDSPY1021

**Message** Pandas < **pandas.core.arrays.datetimelike.TimelikeOps.floor** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ambiguous or nonexistent are set to a non-default value.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.floor
```

### Output

The SMA adds the EWI `PNDSPY1021` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1021 => pandas.core.arrays.datetimelike.TimelikeOps.floor has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.floor
```

## Recommended fix

**Behavioral note**: N if ambiguous or nonexistent are set to a non-default value.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1022
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1022.md
section: Migrations
---

# PNDSPY1022

**Message** Pandas < **pandas.core.arrays.datetimelike.TimelikeOps.round** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ambiguous or nonexistent are set to a non-default value.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.round
```

### Output

The SMA adds the EWI `PNDSPY1022` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1022 => pandas.core.arrays.datetimelike.TimelikeOps.round has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.round
```

## Recommended fix

**Behavioral note**: N if ambiguous or nonexistent are set to a non-default value.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1023
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1023.md
section: Migrations
---

# PNDSPY1023

**Message** Pandas < **pandas.core.arrays.datetimes.DatetimeArray.day_name** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if locale is set.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.day_name
```

### Output

The SMA adds the EWI `PNDSPY1023` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1023 => pandas.core.arrays.datetimes.DatetimeArray.day_name has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.day_name
```

## Recommended fix

**Behavioral note**: N if locale is set.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1024
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1024.md
section: Migrations
---

# PNDSPY1024

**Message** Pandas < **pandas.core.arrays.datetimes.DatetimeArray.month_name** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if locale is set.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.month_name
```

### Output

The SMA adds the EWI `PNDSPY1024` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1024 => pandas.core.arrays.datetimes.DatetimeArray.month_name has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.month_name
```

## Recommended fix

**Behavioral note**: N if locale is set.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1025
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1025.md
section: Migrations
---

# PNDSPY1025

**Message** Pandas < **pandas.core.arrays.datetimes.DatetimeArray.tz_convert** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.tz_convert
```

### Output

The SMA adds the EWI `PNDSPY1025` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1025 => pandas.core.arrays.datetimes.DatetimeArray.tz_convert has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.tz_convert
```

## Recommended fix

**Timezone handling difference**: N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

When working with timezones in Snowpark pandas:

* Ensure your timezone strings are valid IANA timezone names (e.g., ‘UTC’, ‘America/New_York’)
* Test timezone conversions with sample data before running on full dataset
* Consider using `.to_pandas()` for complex timezone operations if results differ

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1026
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1026.md
section: Migrations
---

# PNDSPY1026

**Message** Pandas < **pandas.core.arrays.datetimes.DatetimeArray.tz_localize** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ambiguous or nonexistent are set to a non-default value. N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.tz_localize
```

### Output

The SMA adds the EWI `PNDSPY1026` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1026 => pandas.core.arrays.datetimes.DatetimeArray.tz_localize has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(pd.date_range('2023-01-01', periods=3))
result = s.dt.tz_localize
```

## Recommended fix

**Timezone handling difference**: N if ambiguous or nonexistent are set to a non-default value. N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

When working with timezones in Snowpark pandas:

* Ensure your timezone strings are valid IANA timezone names (e.g., ‘UTC’, ‘America/New_York’)
* Test timezone conversions with sample data before running on full dataset
* Consider using `.to_pandas()` for complex timezone operations if results differ

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1027
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1027.md
section: Migrations
---

# PNDSPY1027

**Message** Pandas < **pandas.core.base.IndexOpsMixin.argmax** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.base.IndexOpsMixin.argmax`
* `pandas.core.series.Series.argmax`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the Series has a MultiIndex index.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.argmax()
```

### Output

The SMA adds the EWI `PNDSPY1027` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1027 => pandas.core.base.IndexOpsMixin.argmax has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.argmax()
```

## Recommended fix

**Behavioral note**: N if the Series has a MultiIndex index.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1028
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1028.md
section: Migrations
---

# PNDSPY1028

**Message** Pandas < **pandas.core.base.IndexOpsMixin.argmin** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.base.IndexOpsMixin.argmin`
* `pandas.core.series.Series.argmin`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the Series has a MultiIndex index.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.argmin()
```

### Output

The SMA adds the EWI `PNDSPY1028` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1028 => pandas.core.base.IndexOpsMixin.argmin has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.argmin()
```

## Recommended fix

**Behavioral note**: N if the Series has a MultiIndex index.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1029
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1029.md
section: Migrations
---

# PNDSPY1029

**Message** Pandas < **pandas.core.base.IndexOpsMixin.value_counts** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.base.IndexOpsMixin.value_counts`
* `pandas.core.indexes.base.Index.value_counts`
* `pandas.core.series.Series.value_counts`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `bins`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.Index([1, 2, 3, 4, 5])
result = idx.value_counts()
```

### Output

The SMA adds the EWI `PNDSPY1029` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1029 => pandas.core.base.IndexOpsMixin.value_counts has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.Index([1, 2, 3, 4, 5])
result = idx.value_counts()
```

## Recommended fix

The parameter `bins` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().value_counts(bins=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1030
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1030.md
section: Migrations
---

# PNDSPY1030

**Message** Pandas < **pandas.core.frame.DataFrame.T** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** D if any column name is not str or tuple of str.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.T
```

### Output

The SMA adds the EWI `PNDSPY1030` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1030 => pandas.core.frame.DataFrame.T has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.T
```

## Recommended fix

**NULL/NaN handling difference**: D if any column name is not str or tuple of str.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1031
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1031.md
section: Migrations
---

# PNDSPY1031

**Message** Pandas < **pandas.core.frame.DataFrame.__dataframe__** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for columns of type Timedelta and columns containing list objects.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.__dataframe__(df)
```

### Output

The SMA adds the EWI `PNDSPY1031` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1031 => pandas.core.frame.DataFrame.__dataframe__ has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.__dataframe__(df)
```

## Recommended fix

**Data type consideration**: N for columns of type Timedelta and columns containing list objects.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1032
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1032.md
section: Migrations
---

# PNDSPY1032

**Message** Pandas < **pandas.core.frame.DataFrame.add** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.add()
```

### Output

The SMA adds the EWI `PNDSPY1032` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1032 => pandas.core.frame.DataFrame.add has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.add()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().add(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1033
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1033.md
section: Migrations
---

# PNDSPY1033

**Message** Pandas < **pandas.core.frame.DataFrame.align** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.align`
* `pandas.core.generic.NDFrame.align`
* `pandas.core.series.Series.align`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `copy`, `level`, `fill_value`

**Reason:** N for MultiIndex, for deprecated parameters method, limit, fill_axis, broadcast_axis, or if fill_value is not default of np.nan.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.align()
```

### Output

The SMA adds the EWI `PNDSPY1033` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1033 => pandas.core.frame.DataFrame.align has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.align()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `copy`, `level`, `fill_value`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.align(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**NULL/NaN handling difference**: N for MultiIndex, for deprecated parameters method, limit, fill_axis, broadcast_axis, or if fill_value is not default of np.nan.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1034
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1034.md
section: Migrations
---

# PNDSPY1034

**Message** Pandas < **pandas.core.frame.DataFrame.all** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.all()
```

### Output

The SMA adds the EWI `PNDSPY1034` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1034 => pandas.core.frame.DataFrame.all has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.all()
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1035
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1035.md
section: Migrations
---

# PNDSPY1035

**Message** Pandas < **pandas.core.frame.DataFrame.any** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.any()
```

### Output

The SMA adds the EWI `PNDSPY1035` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1035 => pandas.core.frame.DataFrame.any has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.any()
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1036
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1036.md
section: Migrations
---

# PNDSPY1036

**Message** Pandas < **pandas.core.frame.DataFrame.applymap** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if na_action == “ignore”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.applymap()
```

### Output

The SMA adds the EWI `PNDSPY1036` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1036 => pandas.core.frame.DataFrame.applymap has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.applymap()
```

## Recommended fix

**NULL/NaN handling difference**: N if na_action == “ignore”.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1037
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1037.md
section: Migrations
---

# PNDSPY1037

**Message** Pandas < **pandas.core.frame.DataFrame.asfreq** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.asfreq`
* `pandas.core.generic.NDFrame.asfreq`
* `pandas.core.series.Series.asfreq`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `how`, `normalize`, `fill_value`

**Reason:** Only DatetimeIndex is supported and its freq will be lost. Only rule frequencies ‘s’, ‘min’, ‘h’, and ‘D’ are supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.asfreq()
```

### Output

The SMA adds the EWI `PNDSPY1037` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1037 => pandas.core.frame.DataFrame.asfreq has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.asfreq()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `how`, `normalize`, `fill_value`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.asfreq(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: Only DatetimeIndex is supported and its freq will be lost. Only rule frequencies ‘s’, ‘min’, ‘h’, and ‘D’ are supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1038
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1038.md
section: Migrations
---

# PNDSPY1038

**Message** Pandas < **pandas.core.frame.DataFrame.astype** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.astype`
* `pandas.core.generic.NDFrame.astype`
* `pandas.core.series.Series.astype`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if from string to datetime/timedelta or errors == “ignore”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.astype()
```

### Output

The SMA adds the EWI `PNDSPY1038` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1038 => pandas.core.frame.DataFrame.astype has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.astype()
```

## Recommended fix

**Behavioral note**: N if from string to datetime/timedelta or errors == “ignore”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1039
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1039.md
section: Migrations
---

# PNDSPY1039

**Message** Pandas < **pandas.core.frame.DataFrame.at** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.at`
* `pandas.core.indexing.IndexingMixin.at`
* `pandas.core.series.Series.at`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for set with MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.at
```

### Output

The SMA adds the EWI `PNDSPY1039` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1039 => pandas.core.frame.DataFrame.at has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.at
```

## Recommended fix

**Behavioral note**: N for set with MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1040
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1040.md
section: Migrations
---

# PNDSPY1040

**Message** Pandas < **pandas.core.frame.DataFrame.backfill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.backfill`
* `pandas.core.generic.NDFrame.backfill`
* `pandas.core.series.Series.backfill`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if param downcast is set.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.backfill()
```

### Output

The SMA adds the EWI `PNDSPY1040` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1040 => pandas.core.frame.DataFrame.backfill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.backfill()
```

## Recommended fix

**Behavioral note**: N if param downcast is set.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1041
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1041.md
section: Migrations
---

# PNDSPY1041

**Message** Pandas < **pandas.core.frame.DataFrame.bfill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.bfill`
* `pandas.core.generic.NDFrame.bfill`
* `pandas.core.series.Series.bfill`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if param downcast is set.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.bfill()
```

### Output

The SMA adds the EWI `PNDSPY1041` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1041 => pandas.core.frame.DataFrame.bfill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.bfill()
```

## Recommended fix

**Behavioral note**: N if param downcast is set.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1042
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1042.md
section: Migrations
---

# PNDSPY1042

**Message** Pandas < **pandas.core.frame.DataFrame.compare** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `align_axis`, `keep_shape`, `keep_equal`, `result_names`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.compare()
```

### Output

The SMA adds the EWI `PNDSPY1042` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1042 => pandas.core.frame.DataFrame.compare has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.compare()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `align_axis`, `keep_shape`, `keep_equal`, `result_names`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.compare(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1043
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1043.md
section: Migrations
---

# PNDSPY1043

**Message** Pandas < **pandas.core.frame.DataFrame.corr** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if method is not ‘pearson’.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.corr()
```

### Output

The SMA adds the EWI `PNDSPY1043` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1043 => pandas.core.frame.DataFrame.corr has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.corr()
```

## Recommended fix

**Behavioral note**: N if method is not ‘pearson’.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1044
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1044.md
section: Migrations
---

# PNDSPY1044

**Message** Pandas < **pandas.core.frame.DataFrame.cumsum** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if values are numeric.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.cumsum()
```

### Output

The SMA adds the EWI `PNDSPY1044` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1044 => pandas.core.frame.DataFrame.cumsum has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.cumsum()
```

## Recommended fix

**Behavioral note**: Y if values are numeric.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1045
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1045.md
section: Migrations
---

# PNDSPY1045

**Message** Pandas < **pandas.core.frame.DataFrame.div** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.div()
```

### Output

The SMA adds the EWI `PNDSPY1045` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1045 => pandas.core.frame.DataFrame.div has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.div()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().div(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1046
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1046.md
section: Migrations
---

# PNDSPY1046

**Message** Pandas < **pandas.core.frame.DataFrame.divide** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.divide()
```

### Output

The SMA adds the EWI `PNDSPY1046` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1046 => pandas.core.frame.DataFrame.divide has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.divide()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().divide(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1047
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1047.md
section: Migrations
---

# PNDSPY1047

**Message** Pandas < **pandas.core.frame.DataFrame.dropna** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if axis == 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.dropna()
```

### Output

The SMA adds the EWI `PNDSPY1047` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1047 => pandas.core.frame.DataFrame.dropna has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.dropna()
```

## Recommended fix

**Behavioral note**: N if axis == 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1048
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1048.md
section: Migrations
---

# PNDSPY1048

**Message** Pandas < **pandas.core.frame.DataFrame.eq** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.eq()
```

### Output

The SMA adds the EWI `PNDSPY1048` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1048 => pandas.core.frame.DataFrame.eq has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.eq()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().eq(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1049
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1049.md
section: Migrations
---

# PNDSPY1049

**Message** Pandas < **pandas.core.frame.DataFrame.eval** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** No support for dataframes with a row MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.eval()
```

### Output

The SMA adds the EWI `PNDSPY1049` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1049 => pandas.core.frame.DataFrame.eval has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.eval()
```

## Recommended fix

**Behavioral note**: No support for dataframes with a row MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1050
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1050.md
section: Migrations
---

# PNDSPY1050

**Message** Pandas < **pandas.core.frame.DataFrame.expanding** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.expanding`
* `pandas.core.generic.NDFrame.expanding`
* `pandas.core.series.Series.expanding`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `method is ignored`

**Reason:** N if axis = 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.expanding()
```

### Output

The SMA adds the EWI `PNDSPY1050` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1050 => pandas.core.frame.DataFrame.expanding has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.expanding()
```

## Recommended fix

The parameter `method is ignored` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().expanding(method is ignored=value)

**Behavioral note**: N if axis = 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1051
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1051.md
section: Migrations
---

# PNDSPY1051

**Message** Pandas < **pandas.core.frame.DataFrame.ffill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.ffill`
* `pandas.core.generic.NDFrame.ffill`
* `pandas.core.series.Series.ffill`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if parameter downcast is set. limit parameter only supported if method parameter is used.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.ffill()
```

### Output

The SMA adds the EWI `PNDSPY1051` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1051 => pandas.core.frame.DataFrame.ffill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.ffill()
```

## Recommended fix

**Behavioral note**: N if parameter downcast is set. limit parameter only supported if method parameter is used.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1052
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1052.md
section: Migrations
---

# PNDSPY1052

**Message** Pandas < **pandas.core.frame.DataFrame.fillna** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.fillna`
* `pandas.core.generic.NDFrame.fillna`
* `pandas.core.series.Series.fillna`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See ffill.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.fillna()
```

### Output

The SMA adds the EWI `PNDSPY1052` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1052 => pandas.core.frame.DataFrame.fillna has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.fillna()
```

## Recommended fix

**Behavioral note**: See ffill.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1053
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1053.md
section: Migrations
---

# PNDSPY1053

**Message** Pandas < **pandas.core.frame.DataFrame.floordiv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.floordiv()
```

### Output

The SMA adds the EWI `PNDSPY1053` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1053 => pandas.core.frame.DataFrame.floordiv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.floordiv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().floordiv(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1054
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1054.md
section: Migrations
---

# PNDSPY1054

**Message** Pandas < **pandas.core.frame.DataFrame.from_records** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if parameter data is set to a DataFrame.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.from_records()
```

### Output

The SMA adds the EWI `PNDSPY1054` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1054 => pandas.core.frame.DataFrame.from_records has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.from_records()
```

## Recommended fix

**Behavioral note**: N if parameter data is set to a DataFrame.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1055
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1055.md
section: Migrations
---

# PNDSPY1055

**Message** Pandas < **pandas.core.frame.DataFrame.ge** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.ge()
```

### Output

The SMA adds the EWI `PNDSPY1055` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1055 => pandas.core.frame.DataFrame.ge has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.ge()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().ge(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1056
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1056.md
section: Migrations
---

# PNDSPY1056

**Message** Pandas < **pandas.core.frame.DataFrame.groupby** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `observed is ignored since Categoricals are not implemented yet`

**Reason:** Y, support axis == 0 and by is column label or Series from the current DataFrame, or a pd.Grouper object; otherwise N. If a pd.Grouper object is passed, then only the default values of the sort, closed, label, and convention arguments are supported. The origin argument currently supports “start_day” and “start”. Note that supported functions are agg, count, cumcount, cummax, cummin, cumsum, first, last, max, mean, median, min, quantile, shift, size, std, sum, and var. Otherwise N.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.groupby()
```

### Output

The SMA adds the EWI `PNDSPY1056` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1056 => pandas.core.frame.DataFrame.groupby has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.groupby()
```

## Recommended fix

The parameter `observed is ignored since Categoricals are not implemented yet` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().groupby(observed is ignored since Categoricals are not implemented yet=value)

**Behavioral note**: Y, support axis == 0 and by is column label or Series from the current DataFrame, or a pd.Grouper object; otherwise N. If a pd.Grouper object is passed, then only the default values of the sort, closed, label, and convention arguments are supported. The origin argument currently supports “start_day” and “start”. Note that supported functions are agg, count, cumcount, cummax, cummin, cumsum, first, last, max, mean, median, min, quantile, shift, size, std, sum, and var. Otherwise N.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1057
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1057.md
section: Migrations
---

# PNDSPY1057

**Message** Pandas < **pandas.core.frame.DataFrame.gt** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.gt()
```

### Output

The SMA adds the EWI `PNDSPY1057` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1057 => pandas.core.frame.DataFrame.gt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.gt()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().gt(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1058
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1058.md
section: Migrations
---

# PNDSPY1058

**Message** Pandas < **pandas.core.frame.DataFrame.idxmax** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for MultiIndex dataframes.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.idxmax()
```

### Output

The SMA adds the EWI `PNDSPY1058` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1058 => pandas.core.frame.DataFrame.idxmax has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.idxmax()
```

## Recommended fix

**Behavioral note**: N for MultiIndex dataframes.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1059
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1059.md
section: Migrations
---

# PNDSPY1059

**Message** Pandas < **pandas.core.frame.DataFrame.idxmin** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for MultiIndex dataframes.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.idxmin()
```

### Output

The SMA adds the EWI `PNDSPY1059` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1059 => pandas.core.frame.DataFrame.idxmin has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.idxmin()
```

## Recommended fix

**Behavioral note**: N for MultiIndex dataframes.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1060
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1060.md
section: Migrations
---

# PNDSPY1060

**Message** Pandas < **pandas.core.frame.DataFrame.info** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Index is different, zero bytes reported for memory.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.info()
```

### Output

The SMA adds the EWI `PNDSPY1060` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1060 => pandas.core.frame.DataFrame.info has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.info()
```

## Recommended fix

**Behavioral note**: Index is different, zero bytes reported for memory.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1061
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1061.md
section: Migrations
---

# PNDSPY1061

**Message** Pandas < **pandas.core.frame.DataFrame.join** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if given the validate param.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.join()
```

### Output

The SMA adds the EWI `PNDSPY1061` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1061 => pandas.core.frame.DataFrame.join has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.join()
```

## Recommended fix

**Behavioral note**: N if given the validate param.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1062
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1062.md
section: Migrations
---

# PNDSPY1062

**Message** Pandas < **pandas.core.frame.DataFrame.le** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.le()
```

### Output

The SMA adds the EWI `PNDSPY1062` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1062 => pandas.core.frame.DataFrame.le has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.le()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().le(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1063
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1063.md
section: Migrations
---

# PNDSPY1063

**Message** Pandas < **pandas.core.frame.DataFrame.loc** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.loc`
* `pandas.core.indexing.IndexingMixin.loc`
* `pandas.core.series.Series.loc`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for set with MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.loc
```

### Output

The SMA adds the EWI `PNDSPY1063` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1063 => pandas.core.frame.DataFrame.loc has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.loc
```

## Recommended fix

**Behavioral note**: N for set with MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1064
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1064.md
section: Migrations
---

# PNDSPY1064

**Message** Pandas < **pandas.core.frame.DataFrame.lt** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.lt()
```

### Output

The SMA adds the EWI `PNDSPY1064` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1064 => pandas.core.frame.DataFrame.lt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.lt()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().lt(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1065
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1065.md
section: Migrations
---

# PNDSPY1065

**Message** Pandas < **pandas.core.frame.DataFrame.map** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if na_action == “ignore”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.map()
```

### Output

The SMA adds the EWI `PNDSPY1065` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1065 => pandas.core.frame.DataFrame.map has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.map()
```

## Recommended fix

**NULL/NaN handling difference**: N if na_action == “ignore”.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1066
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1066.md
section: Migrations
---

# PNDSPY1066

**Message** Pandas < **pandas.core.frame.DataFrame.mask** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.mask`
* `pandas.core.generic.NDFrame.mask`
* `pandas.core.series.Series.mask`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if given axis when other is a DataFrame or level parameters; N if cond or other is Callable.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.mask()
```

### Output

The SMA adds the EWI `PNDSPY1066` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1066 => pandas.core.frame.DataFrame.mask has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.mask()
```

## Recommended fix

**Behavioral note**: N if given axis when other is a DataFrame or level parameters; N if cond or other is Callable.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1067
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1067.md
section: Migrations
---

# PNDSPY1067

**Message** Pandas < **pandas.core.frame.DataFrame.melt** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `col_level`

**Reason:** N when columns are MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.melt()
```

### Output

The SMA adds the EWI `PNDSPY1067` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1067 => pandas.core.frame.DataFrame.melt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.melt()
```

## Recommended fix

The parameter `col_level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().melt(col_level=value)

**Behavioral note**: N when columns are MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1068
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1068.md
section: Migrations
---

# PNDSPY1068

**Message** Pandas < **pandas.core.frame.DataFrame.merge** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if param validate is given.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.merge()
```

### Output

The SMA adds the EWI `PNDSPY1068` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1068 => pandas.core.frame.DataFrame.merge has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.merge()
```

## Recommended fix

**Behavioral note**: N if param validate is given.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1069
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1069.md
section: Migrations
---

# PNDSPY1069

**Message** Pandas < **pandas.core.frame.DataFrame.mod** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.mod()
```

### Output

The SMA adds the EWI `PNDSPY1069` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1069 => pandas.core.frame.DataFrame.mod has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.mod()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().mod(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1070
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1070.md
section: Migrations
---

# PNDSPY1070

**Message** Pandas < **pandas.core.frame.DataFrame.mul** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.mul()
```

### Output

The SMA adds the EWI `PNDSPY1070` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1070 => pandas.core.frame.DataFrame.mul has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.mul()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().mul(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1071
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1071.md
section: Migrations
---

# PNDSPY1071

**Message** Pandas < **pandas.core.frame.DataFrame.multiply** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.multiply()
```

### Output

The SMA adds the EWI `PNDSPY1071` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1071 => pandas.core.frame.DataFrame.multiply has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.multiply()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().multiply(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1072
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1072.md
section: Migrations
---

# PNDSPY1072

**Message** Pandas < **pandas.core.frame.DataFrame.ne** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.ne()
```

### Output

The SMA adds the EWI `PNDSPY1072` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1072 => pandas.core.frame.DataFrame.ne has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.ne()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().ne(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1073
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1073.md
section: Migrations
---

# PNDSPY1073

**Message** Pandas < **pandas.core.frame.DataFrame.nlargest** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if keep == “all”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.nlargest()
```

### Output

The SMA adds the EWI `PNDSPY1073` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1073 => pandas.core.frame.DataFrame.nlargest has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.nlargest()
```

## Recommended fix

**Behavioral note**: N if keep == “all”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1074
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1074.md
section: Migrations
---

# PNDSPY1074

**Message** Pandas < **pandas.core.frame.DataFrame.nsmallest** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if keep == “all”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.nsmallest()
```

### Output

The SMA adds the EWI `PNDSPY1074` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1074 => pandas.core.frame.DataFrame.nsmallest has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.nsmallest()
```

## Recommended fix

**Behavioral note**: N if keep == “all”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1075
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1075.md
section: Migrations
---

# PNDSPY1075

**Message** Pandas < **pandas.core.frame.DataFrame.nunique** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if axis == 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.nunique()
```

### Output

The SMA adds the EWI `PNDSPY1075` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1075 => pandas.core.frame.DataFrame.nunique has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.nunique()
```

## Recommended fix

**Behavioral note**: N if axis == 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1076
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1076.md
section: Migrations
---

# PNDSPY1076

**Message** Pandas < **pandas.core.frame.DataFrame.pad** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.pad`
* `pandas.core.generic.NDFrame.pad`
* `pandas.core.series.Series.pad`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See ffill.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pad()
```

### Output

The SMA adds the EWI `PNDSPY1076` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1076 => pandas.core.frame.DataFrame.pad has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pad()
```

## Recommended fix

**Behavioral note**: See ffill.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1077
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1077.md
section: Migrations
---

# PNDSPY1077

**Message** Pandas < **pandas.core.frame.DataFrame.pct_change** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.pct_change`
* `pandas.core.generic.NDFrame.pct_change`
* `pandas.core.series.Series.pct_change`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `limit`, `freq`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pct_change()
```

### Output

The SMA adds the EWI `PNDSPY1077` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1077 => pandas.core.frame.DataFrame.pct_change has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pct_change()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `limit`, `freq`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.pct_change(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1078
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1078.md
section: Migrations
---

# PNDSPY1078

**Message** Pandas < **pandas.core.frame.DataFrame.pivot** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See pivot_table.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pivot()
```

### Output

The SMA adds the EWI `PNDSPY1078` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1078 => pandas.core.frame.DataFrame.pivot has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pivot()
```

## Recommended fix

**Behavioral note**: See pivot_table.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1079
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1079.md
section: Migrations
---

# PNDSPY1079

**Message** Pandas < **pandas.core.frame.DataFrame.pivot_table** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `observed`, `sort`

**Reason:** N if index, columns, or values is not str, list of str, or None; or MultiIndex; or any argfunc is not “count”, “mean”, “min”, “max”, or “sum”. N if index is None, margins is True and aggfunc is “count” or “mean” or a dictionary. N if index is None and aggfunc is a dictionary containing lists of aggfuncs to apply. N if aggfunc is an unsupported aggregation function <agg_supp.html>_ for pivot.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pivot_table()
```

### Output

The SMA adds the EWI `PNDSPY1079` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1079 => pandas.core.frame.DataFrame.pivot_table has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pivot_table()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `observed`, `sort`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.pivot_table(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**NULL/NaN handling difference**: N if index, columns, or values is not str, list of str, or None; or MultiIndex; or any argfunc is not “count”, “mean”, “min”, “max”, or “sum”. N if index is None, margins is True and aggfunc is “count” or “mean” or a dictionary. N if index is None and aggfunc is a dictionary containing lists of aggfuncs to apply. N if aggfunc is an unsupported aggregation function <agg_supp.html>_ for pivot.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1080
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1080.md
section: Migrations
---

# PNDSPY1080

**Message** Pandas < **pandas.core.frame.DataFrame.pow** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pow()
```

### Output

The SMA adds the EWI `PNDSPY1080` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1080 => pandas.core.frame.DataFrame.pow has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.pow()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().pow(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1081
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1081.md
section: Migrations
---

# PNDSPY1081

**Message** Pandas < **pandas.core.frame.DataFrame.quantile** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if axis == 0, and interpolation is “linear” or “nearest”, and method is “single”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.quantile()
```

### Output

The SMA adds the EWI `PNDSPY1081` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1081 => pandas.core.frame.DataFrame.quantile has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.quantile()
```

## Recommended fix

**Behavioral note**: Y if axis == 0, and interpolation is “linear” or “nearest”, and method is “single”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1082
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1082.md
section: Migrations
---

# PNDSPY1082

**Message** Pandas < **pandas.core.frame.DataFrame.radd** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.radd()
```

### Output

The SMA adds the EWI `PNDSPY1082` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1082 => pandas.core.frame.DataFrame.radd has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.radd()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().radd(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1083
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1083.md
section: Migrations
---

# PNDSPY1083

**Message** Pandas < **pandas.core.frame.DataFrame.rank** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.rank`
* `pandas.core.generic.NDFrame.rank`
* `pandas.core.series.Series.rank`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if axis == 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rank()
```

### Output

The SMA adds the EWI `PNDSPY1083` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1083 => pandas.core.frame.DataFrame.rank has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rank()
```

## Recommended fix

**Behavioral note**: N if axis == 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1084
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1084.md
section: Migrations
---

# PNDSPY1084

**Message** Pandas < **pandas.core.frame.DataFrame.rdiv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rdiv()
```

### Output

The SMA adds the EWI `PNDSPY1084` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1084 => pandas.core.frame.DataFrame.rdiv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rdiv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rdiv(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1085
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1085.md
section: Migrations
---

# PNDSPY1085

**Message** Pandas < **pandas.core.frame.DataFrame.reindex** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if axis is MultiIndex or method is nearest.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.reindex()
```

### Output

The SMA adds the EWI `PNDSPY1085` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1085 => pandas.core.frame.DataFrame.reindex has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.reindex()
```

## Recommended fix

**Behavioral note**: N if axis is MultiIndex or method is nearest.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1086
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1086.md
section: Migrations
---

# PNDSPY1086

**Message** Pandas < **pandas.core.frame.DataFrame.rename** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if mapper is callable or the series has multiindex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rename()
```

### Output

The SMA adds the EWI `PNDSPY1086` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1086 => pandas.core.frame.DataFrame.rename has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rename()
```

## Recommended fix

**Behavioral note**: N if mapper is callable or the series has multiindex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1087
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1087.md
section: Migrations
---

# PNDSPY1087

**Message** Pandas < **pandas.core.frame.DataFrame.replace** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.replace`
* `pandas.core.generic.NDFrame.replace`
* `pandas.core.series.Series.replace`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `copy is ignored`, `method`, `limit`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.replace()
```

### Output

The SMA adds the EWI `PNDSPY1087` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1087 => pandas.core.frame.DataFrame.replace has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.replace()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `copy is ignored`, `method`, `limit`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.replace(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1088
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1088.md
section: Migrations
---

# PNDSPY1088

**Message** Pandas < **pandas.core.frame.DataFrame.resample** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.resample`
* `pandas.core.generic.NDFrame.resample`
* `pandas.core.series.Series.resample`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `axis`, `label`, `convention`, `kind`, , `level`, `origin`, , `offset`, `group_keys`

**Reason:** Only DatetimeIndex is supported and its freq will be lost. rule frequencies ‘s’, ‘min’, ‘h’, and ‘D’ are supported. rule frequencies ‘W’, ‘ME’, and ‘YE’ are supported with closed = “left”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.resample()
```

### Output

The SMA adds the EWI `PNDSPY1088` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1088 => pandas.core.frame.DataFrame.resample has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.resample()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `axis`, `label`, `convention`, `kind`, `level`, `origin`, `offset`, `group_keys`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.resample(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: Only DatetimeIndex is supported and its freq will be lost. rule frequencies ‘s’, ‘min’, ‘h’, and ‘D’ are supported. rule frequencies ‘W’, ‘ME’, and ‘YE’ are supported with closed = “left”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1089
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1089.md
section: Migrations
---

# PNDSPY1089

**Message** Pandas < **pandas.core.frame.DataFrame.rfloordiv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rfloordiv()
```

### Output

The SMA adds the EWI `PNDSPY1089` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1089 => pandas.core.frame.DataFrame.rfloordiv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rfloordiv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rfloordiv(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1090
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1090.md
section: Migrations
---

# PNDSPY1090

**Message** Pandas < **pandas.core.frame.DataFrame.rmod** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rmod()
```

### Output

The SMA adds the EWI `PNDSPY1090` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1090 => pandas.core.frame.DataFrame.rmod has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rmod()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rmod(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1091
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1091.md
section: Migrations
---

# PNDSPY1091

**Message** Pandas < **pandas.core.frame.DataFrame.rmul** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rmul()
```

### Output

The SMA adds the EWI `PNDSPY1091` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1091 => pandas.core.frame.DataFrame.rmul has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rmul()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rmul(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1092
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1092.md
section: Migrations
---

# PNDSPY1092

**Message** Pandas < **pandas.core.frame.DataFrame.rolling** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.rolling`
* `pandas.core.generic.NDFrame.rolling`
* `pandas.core.series.Series.rolling`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `method is ignored`, `step`, `win_type`, `closed`, `on`

**Reason:** N for non-integer window, axis = 1, or min_periods = 0.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rolling()
```

### Output

The SMA adds the EWI `PNDSPY1092` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1092 => pandas.core.frame.DataFrame.rolling has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rolling()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `method is ignored`, `step`, `win_type`, `closed`, `on`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.rolling(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N for non-integer window, axis = 1, or min_periods = 0.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1093
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1093.md
section: Migrations
---

# PNDSPY1093

**Message** Pandas < **pandas.core.frame.DataFrame.round** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if decimals is Series.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.round()
```

### Output

The SMA adds the EWI `PNDSPY1093` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1093 => pandas.core.frame.DataFrame.round has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.round()
```

## Recommended fix

**Behavioral note**: N if decimals is Series.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1094
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1094.md
section: Migrations
---

# PNDSPY1094

**Message** Pandas < **pandas.core.frame.DataFrame.rpow** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rpow()
```

### Output

The SMA adds the EWI `PNDSPY1094` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1094 => pandas.core.frame.DataFrame.rpow has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rpow()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rpow(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1095
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1095.md
section: Migrations
---

# PNDSPY1095

**Message** Pandas < **pandas.core.frame.DataFrame.rsub** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rsub()
```

### Output

The SMA adds the EWI `PNDSPY1095` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1095 => pandas.core.frame.DataFrame.rsub has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rsub()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rsub(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1096
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1096.md
section: Migrations
---

# PNDSPY1096

**Message** Pandas < **pandas.core.frame.DataFrame.rtruediv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rtruediv()
```

### Output

The SMA adds the EWI `PNDSPY1096` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1096 => pandas.core.frame.DataFrame.rtruediv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.rtruediv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rtruediv(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1097
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1097.md
section: Migrations
---

# PNDSPY1097

**Message** Pandas < **pandas.core.frame.DataFrame.sample** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.sample`
* `pandas.core.generic.NDFrame.sample`
* `pandas.core.series.Series.sample`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if weights is specified when axis = 0, or if random_state is not either an integer or None. Setting random_state to a value other than None may slow down this method because the sample implementation will use a sort instead of the Snowflake warehouse’s built-in SAMPLE construct.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sample()
```

### Output

The SMA adds the EWI `PNDSPY1097` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1097 => pandas.core.frame.DataFrame.sample has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sample()
```

## Recommended fix

**Performance consideration**: N if weights is specified when axis = 0, or if random_state is not either an integer or None. Setting random_state to a value other than None may slow down this method because the sample implementation will use a sort instead of the Snowflake warehouse’s built-in SAMPLE construct.

For better performance:

* Filter data before applying this operation to reduce data volume
* Consider breaking the operation into smaller chunks
* Use Snowflake’s native SQL functions via `session.sql()` for large datasets

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1098
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1098.md
section: Migrations
---

# PNDSPY1098

**Message** Pandas < **pandas.core.frame.DataFrame.shift** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `freq`

**Reason:** No support for freq != None.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.shift()
```

### Output

The SMA adds the EWI `PNDSPY1098` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1098 => pandas.core.frame.DataFrame.shift has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.shift()
```

## Recommended fix

The parameter `freq` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().shift(freq=value)

**Behavioral note**: No support for freq != None.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1099
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1099.md
section: Migrations
---

# PNDSPY1099

**Message** Pandas < **pandas.core.frame.DataFrame.skew** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if axis == 1 or skipna == False or numeric_only=False.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.skew()
```

### Output

The SMA adds the EWI `PNDSPY1099` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1099 => pandas.core.frame.DataFrame.skew has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.skew()
```

## Recommended fix

**NULL/NaN handling difference**: N if axis == 1 or skipna == False or numeric_only=False.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1100
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1100.md
section: Migrations
---

# PNDSPY1100

**Message** Pandas < **pandas.core.frame.DataFrame.sort_index** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `key`

**Reason:** N if given the key param. N if axis == 1, or MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sort_index()
```

### Output

The SMA adds the EWI `PNDSPY1100` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1100 => pandas.core.frame.DataFrame.sort_index has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sort_index()
```

## Recommended fix

The parameter `key` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().sort_index(key=value)

**Behavioral note**: N if given the key param. N if axis == 1, or MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1101
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1101.md
section: Migrations
---

# PNDSPY1101

**Message** Pandas < **pandas.core.frame.DataFrame.sort_values** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `key`, `kind is ignored`

**Reason:** N if given the key param or axis == 1. The kind parameter has no effect. Snowpark pandas always uses a stable sort algorithm, while pandas by default does not.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sort_values()
```

### Output

The SMA adds the EWI `PNDSPY1101` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1101 => pandas.core.frame.DataFrame.sort_values has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sort_values()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `key`, `kind is ignored`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.sort_values(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N if given the key param or axis == 1. The kind parameter has no effect. Snowpark pandas always uses a stable sort algorithm, while pandas by default does not.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1102
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1102.md
section: Migrations
---

# PNDSPY1102

**Message** Pandas < **pandas.core.frame.DataFrame.stack** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`, `future_stack is ignored`

**Reason:** N for MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.stack()
```

### Output

The SMA adds the EWI `PNDSPY1102` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1102 => pandas.core.frame.DataFrame.stack has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.stack()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `level`, `future_stack is ignored`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.stack(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N for MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1103
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1103.md
section: Migrations
---

# PNDSPY1103

**Message** Pandas < **pandas.core.frame.DataFrame.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ddof is not 0 or 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.std()
```

### Output

The SMA adds the EWI `PNDSPY1103` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1103 => pandas.core.frame.DataFrame.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.std()
```

## Recommended fix

**Behavioral note**: N if ddof is not 0 or 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1104
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1104.md
section: Migrations
---

# PNDSPY1104

**Message** Pandas < **pandas.core.frame.DataFrame.sub** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sub()
```

### Output

The SMA adds the EWI `PNDSPY1104` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1104 => pandas.core.frame.DataFrame.sub has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.sub()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().sub(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1105
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1105.md
section: Migrations
---

# PNDSPY1105

**Message** Pandas < **pandas.core.frame.DataFrame.subtract** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.subtract()
```

### Output

The SMA adds the EWI `PNDSPY1105` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1105 => pandas.core.frame.DataFrame.subtract has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.subtract()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().subtract(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1106
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1106.md
section: Migrations
---

# PNDSPY1106

**Message** Pandas < **pandas.core.frame.DataFrame.to_csv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.to_csv`
* `pandas.core.generic.NDFrame.to_csv`
* `pandas.core.series.Series.to_csv`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Supports writing to both local and snowflake stage. Filepath starting with @ is treated as snowflake stage location. Writing to local file supports all parameters. Writing to snowflake state does not support float_format, mode, encoding, quoting, quotechar, lineterminator, doublequote and decimal parameters.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.to_csv()
```

### Output

The SMA adds the EWI `PNDSPY1106` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1106 => pandas.core.frame.DataFrame.to_csv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.to_csv()
```

## Recommended fix

**NULL/NaN handling difference**: Supports writing to both local and snowflake stage. Filepath starting with @ is treated as snowflake stage location. Writing to local file supports all parameters. Writing to snowflake state does not support float_format, mode, encoding, quoting, quotechar, lineterminator, doublequote and decimal parameters.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1107
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1107.md
section: Migrations
---

# PNDSPY1107

**Message** Pandas < **pandas.core.frame.DataFrame.transform** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if func is callable.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.transform()
```

### Output

The SMA adds the EWI `PNDSPY1107` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1107 => pandas.core.frame.DataFrame.transform has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.transform()
```

## Recommended fix

**Behavioral note**: Y if func is callable.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1108
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1108.md
section: Migrations
---

# PNDSPY1108

**Message** Pandas < **pandas.core.frame.DataFrame.transpose** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See T.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.transpose()
```

### Output

The SMA adds the EWI `PNDSPY1108` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1108 => pandas.core.frame.DataFrame.transpose has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.transpose()
```

## Recommended fix

**Behavioral note**: See T.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1109
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1109.md
section: Migrations
---

# PNDSPY1109

**Message** Pandas < **pandas.core.frame.DataFrame.truediv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.truediv()
```

### Output

The SMA adds the EWI `PNDSPY1109` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1109 => pandas.core.frame.DataFrame.truediv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.truediv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().truediv(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1110
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1110.md
section: Migrations
---

# PNDSPY1110

**Message** Pandas < **pandas.core.frame.DataFrame.tz_convert** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.tz_convert`
* `pandas.core.generic.NDFrame.tz_convert`
* `pandas.core.series.Series.tz_convert`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `axis`, `level`, `copy`

**Reason:** N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.tz_convert()
```

### Output

The SMA adds the EWI `PNDSPY1110` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1110 => pandas.core.frame.DataFrame.tz_convert has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.tz_convert()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `axis`, `level`, `copy`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.tz_convert(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Timezone handling difference**: N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

When working with timezones in Snowpark pandas:

* Ensure your timezone strings are valid IANA timezone names (e.g., ‘UTC’, ‘America/New_York’)
* Test timezone conversions with sample data before running on full dataset
* Consider using `.to_pandas()` for complex timezone operations if results differ

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1111
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1111.md
section: Migrations
---

# PNDSPY1111

**Message** Pandas < **pandas.core.frame.DataFrame.tz_localize** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.tz_localize`
* `pandas.core.generic.NDFrame.tz_localize`
* `pandas.core.series.Series.tz_localize`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `axis`, `level`, `copy ambiguous`, `nonexistent`

**Reason:** N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.tz_localize()
```

### Output

The SMA adds the EWI `PNDSPY1111` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1111 => pandas.core.frame.DataFrame.tz_localize has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.tz_localize()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `axis`, `level`, `copy ambiguous`, `nonexistent`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.tz_localize(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Timezone handling difference**: N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as UTC+09:00, is not supported.

When working with timezones in Snowpark pandas:

* Ensure your timezone strings are valid IANA timezone names (e.g., ‘UTC’, ‘America/New_York’)
* Test timezone conversions with sample data before running on full dataset
* Consider using `.to_pandas()` for complex timezone operations if results differ

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1112
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1112.md
section: Migrations
---

# PNDSPY1112

**Message** Pandas < **pandas.core.frame.DataFrame.unstack** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `sort`

**Reason:** N for non-integer level.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.unstack()
```

### Output

The SMA adds the EWI `PNDSPY1112` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1112 => pandas.core.frame.DataFrame.unstack has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.unstack()
```

## Recommended fix

The parameter `sort` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().unstack(sort=value)

**Behavioral note**: N for non-integer level.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1113
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1113.md
section: Migrations
---

# PNDSPY1113

**Message** Pandas < **pandas.core.frame.DataFrame.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See std.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.var()
```

### Output

The SMA adds the EWI `PNDSPY1113` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1113 => pandas.core.frame.DataFrame.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.var()
```

## Recommended fix

**Behavioral note**: See std.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1114
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1114.md
section: Migrations
---

# PNDSPY1114

**Message** Pandas < **pandas.core.frame.DataFrame.where** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.frame.DataFrame.where`
* `pandas.core.generic.NDFrame.where`
* `pandas.core.series.Series.where`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See mask.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.where()
```

### Output

The SMA adds the EWI `PNDSPY1114` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1114 => pandas.core.frame.DataFrame.where has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = df.where()
```

## Recommended fix

**Behavioral note**: See mask.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1115
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1115.md
section: Migrations
---

# PNDSPY1115

**Message** Pandas < **pandas.core.generic.NDFrame.shift** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Applies to

This EWI applies to the following elements (same implementation):

* `pandas.core.generic.NDFrame.shift`
* `pandas.core.series.Series.shift`

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `freq`

**Reason:** No support for freq != None.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.shift()
```

### Output

The SMA adds the EWI `PNDSPY1115` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1115 => pandas.core.generic.NDFrame.shift has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.shift()
```

## Recommended fix

The parameter `freq` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().shift(freq=value)

**Behavioral note**: No support for freq != None.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1116
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1116.md
section: Migrations
---

# PNDSPY1116

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.agg** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `axis other than 0 is not implemented.`

**Reason:** Check Supported Aggregation Functions <agg_supp.html>_ for a list of supported functions.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.agg()
```

### Output

The SMA adds the EWI `PNDSPY1116` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1116 => pandas.core.groupby.generic.DataFrameGroupBy.agg has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.agg()
```

## Recommended fix

The parameter `axis other than 0 is not implemented.` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().agg(axis other than 0 is not implemented.=value)

**Behavioral note**: Check Supported Aggregation Functions <agg_supp.html>_ for a list of supported functions.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1117
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1117.md
section: Migrations
---

# PNDSPY1117

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.aggregate** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `axis other than 0 is not implemented.`

**Reason:** See agg.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.aggregate()
```

### Output

The SMA adds the EWI `PNDSPY1117` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1117 => pandas.core.groupby.generic.DataFrameGroupBy.aggregate has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.aggregate()
```

## Recommended fix

The parameter `axis other than 0 is not implemented.` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().aggregate(axis other than 0 is not implemented.=value)

**Behavioral note**: See agg.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1118
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1118.md
section: Migrations
---

# PNDSPY1118

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.fillna** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** GroupBy axis = 0 is supported. Does not support downcast parameter.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.fillna
```

### Output

The SMA adds the EWI `PNDSPY1118` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1118 => pandas.core.groupby.generic.DataFrameGroupBy.fillna has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.fillna
```

## Recommended fix

**Behavioral note**: GroupBy axis = 0 is supported. Does not support downcast parameter.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1119
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1119.md
section: Migrations
---

# PNDSPY1119

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.idxmax** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** When GroupBy axis is 1,N; GroupBy axis = 0 is fully supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.idxmax
```

### Output

The SMA adds the EWI `PNDSPY1119` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1119 => pandas.core.groupby.generic.DataFrameGroupBy.idxmax has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.idxmax
```

## Recommended fix

**Behavioral note**: When GroupBy axis is 1,N; GroupBy axis = 0 is fully supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1120
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1120.md
section: Migrations
---

# PNDSPY1120

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.idxmin** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See idxmax.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.idxmin
```

### Output

The SMA adds the EWI `PNDSPY1120` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1120 => pandas.core.groupby.generic.DataFrameGroupBy.idxmin has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.idxmin
```

## Recommended fix

**Behavioral note**: See idxmax.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1121
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1121.md
section: Migrations
---

# PNDSPY1121

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.transform** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `SeriesGroupBy.transform is not implemented.`

**Reason:** Y when func is a string or callable. A UDTF is created to run transform on every group via apply. transform has the same limitations as apply except for string func also being valid for transform.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.transform()
```

### Output

The SMA adds the EWI `PNDSPY1121` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1121 => pandas.core.groupby.generic.DataFrameGroupBy.transform has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.transform()
```

## Recommended fix

The parameter `SeriesGroupBy.transform is not implemented.` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().transform(SeriesGroupBy.transform is not implemented.=value)

**Behavioral note**: Y when func is a string or callable. A UDTF is created to run transform on every group via apply. transform has the same limitations as apply except for string func also being valid for transform.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1122
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1122.md
section: Migrations
---

# PNDSPY1122

**Message** Pandas < **pandas.core.groupby.generic.DataFrameGroupBy.value_counts** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if bins is given for SeriesGroupBy.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.value_counts
```

### Output

The SMA adds the EWI `PNDSPY1122` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1122 => pandas.core.groupby.generic.DataFrameGroupBy.value_counts has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.value_counts
```

## Recommended fix

**Behavioral note**: N if bins is given for SeriesGroupBy.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1123
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1123.md
section: Migrations
---

# PNDSPY1123

**Message** Pandas < **pandas.core.groupby.groupby.BaseGroupBy.get_group** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Implemented for DataFrameGroupBy objects only.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.get_group
```

### Output

The SMA adds the EWI `PNDSPY1123` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1123 => pandas.core.groupby.groupby.BaseGroupBy.get_group has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.get_group
```

## Recommended fix

**Behavioral note**: Implemented for DataFrameGroupBy objects only.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1124
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1124.md
section: Migrations
---

# PNDSPY1124

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.all** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.all
```

### Output

The SMA adds the EWI `PNDSPY1124` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1124 => pandas.core.groupby.groupby.GroupBy.all has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.all
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1125
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1125.md
section: Migrations
---

# PNDSPY1125

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.any** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.any
```

### Output

The SMA adds the EWI `PNDSPY1125` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1125 => pandas.core.groupby.groupby.GroupBy.any has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.any
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1126
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1126.md
section: Migrations
---

# PNDSPY1126

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.apply** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `axis other than 0 is not implemented.`

**Reason:** Y if the following are true, otherwise N: - func is a callable that always returns either a pandas DataFrame, a pandas Series, or objects that are neither DataFrame nor Series. - grouping on axis=0 - Not applying transform to a dataframe with a non-unique index - Not applying func that returns two dataframes that have different labels for the column at a given position - Not applying func that returns two dataframes that have different names for a given index label - Not applying func that returns two Series that have different labels for the row at a given position - Not applying func that returns two Series that have different names - Not grouping by an “external” by, i.e. an object that is not a label for a column or level of the dataframe.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.apply()
```

### Output

The SMA adds the EWI `PNDSPY1126` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1126 => pandas.core.groupby.groupby.GroupBy.apply has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.apply()
```

## Recommended fix

The parameter `axis other than 0 is not implemented.` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().apply(axis other than 0 is not implemented.=value)

**NULL/NaN handling difference**: Y if the following are true, otherwise N: - func is a callable that always returns either a pandas DataFrame, a pandas Series, or objects that are neither DataFrame nor Series. - grouping on axis=0 - Not applying transform to a dataframe with a non-unique index - Not applying func that returns two dataframes that have different labels for the column at a given position - Not applying func that returns two dataframes that have different names for a given index label - Not applying func that returns two Series that have different labels for the row at a given position - Not applying func that returns two Series that have different names - Not grouping by an “external” by, i.e. an object that is not a label for a column or level of the dataframe.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1127
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1127.md
section: Migrations
---

# PNDSPY1127

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.bfill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** When GroupBy axis is 1,N; GroupBy axis = 0 is fully supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.bfill
```

### Output

The SMA adds the EWI `PNDSPY1127` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1127 => pandas.core.groupby.groupby.GroupBy.bfill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.bfill
```

## Recommended fix

**Behavioral note**: When GroupBy axis is 1,N; GroupBy axis = 0 is fully supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1128
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1128.md
section: Migrations
---

# PNDSPY1128

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.ffill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** When GroupBy axis is 1,N; GroupBy axis = 0 is fully supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.ffill
```

### Output

The SMA adds the EWI `PNDSPY1128` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1128 => pandas.core.groupby.groupby.GroupBy.ffill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.ffill
```

## Recommended fix

**Behavioral note**: When GroupBy axis is 1,N; GroupBy axis = 0 is fully supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1129
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1129.md
section: Migrations
---

# PNDSPY1129

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.first** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Does not support min_count parameter.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.first
```

### Output

The SMA adds the EWI `PNDSPY1129` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1129 => pandas.core.groupby.groupby.GroupBy.first has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.first
```

## Recommended fix

**Behavioral note**: Does not support min_count parameter.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1130
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1130.md
section: Migrations
---

# PNDSPY1130

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.last** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Does not support min_count parameter.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.last
```

### Output

The SMA adds the EWI `PNDSPY1130` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1130 => pandas.core.groupby.groupby.GroupBy.last has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.last
```

## Recommended fix

**Behavioral note**: Does not support min_count parameter.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1131
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1131.md
section: Migrations
---

# PNDSPY1131

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.pct_change** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if axis = 0, freq is None, and limit is None.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.pct_change
```

### Output

The SMA adds the EWI `PNDSPY1131` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1131 => pandas.core.groupby.groupby.GroupBy.pct_change has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.pct_change
```

## Recommended fix

**Behavioral note**: Y if axis = 0, freq is None, and limit is None.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1132
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1132.md
section: Migrations
---

# PNDSPY1132

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.quantile** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for list-like q.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.quantile
```

### Output

The SMA adds the EWI `PNDSPY1132` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1132 => pandas.core.groupby.groupby.GroupBy.quantile has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.quantile
```

## Recommended fix

**Behavioral note**: N for list-like q.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1133
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1133.md
section: Migrations
---

# PNDSPY1133

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.resample** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Implemented for DataFrameGroupBy objects. Only DatetimeIndex is supported and its freq will be lost. rule frequencies ‘s’, ‘min’, ‘h’, and ‘D’ are supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.resample
```

### Output

The SMA adds the EWI `PNDSPY1133` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1133 => pandas.core.groupby.groupby.GroupBy.resample has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.resample
```

## Recommended fix

**Behavioral note**: Implemented for DataFrameGroupBy objects. Only DatetimeIndex is supported and its freq will be lost. rule frequencies ‘s’, ‘min’, ‘h’, and ‘D’ are supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1134
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1134.md
section: Migrations
---

# PNDSPY1134

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.rolling** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Implemented for DataframeGroupby objects. N for on, non-integer window, axis = 1, method != single, min_periods = 0, or closed != None.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.rolling
```

### Output

The SMA adds the EWI `PNDSPY1134` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1134 => pandas.core.groupby.groupby.GroupBy.rolling has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.rolling
```

## Recommended fix

**Behavioral note**: Implemented for DataframeGroupby objects. N for on, non-integer window, axis = 1, method != single, min_periods = 0, or closed != None.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1135
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1135.md
section: Migrations
---

# PNDSPY1135

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.shift** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if axis = 0, freq is None, level is None, and by is in the columns.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.shift
```

### Output

The SMA adds the EWI `PNDSPY1135` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1135 => pandas.core.groupby.groupby.GroupBy.shift has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.shift
```

## Recommended fix

**Behavioral note**: Y if axis = 0, freq is None, level is None, and by is in the columns.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1136
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1136.md
section: Migrations
---

# PNDSPY1136

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ddof is not 0 or 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.std
```

### Output

The SMA adds the EWI `PNDSPY1136` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1136 => pandas.core.groupby.groupby.GroupBy.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.std
```

## Recommended fix

**Behavioral note**: N if ddof is not 0 or 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1137
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1137.md
section: Migrations
---

# PNDSPY1137

**Message** Pandas < **pandas.core.groupby.groupby.GroupBy.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See std.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.var
```

### Output

The SMA adds the EWI `PNDSPY1137` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1137 => pandas.core.groupby.groupby.GroupBy.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': ['foo', 'bar', 'foo', 'bar'], 'B': [1, 2, 3, 4]})
grouped = df.groupby('A')
result = grouped.var
```

## Recommended fix

**Behavioral note**: See std.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1138
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1138.md
section: Migrations
---

# PNDSPY1138

**Message** Pandas < **pandas.core.indexes.base.Index.all** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.Index([1, 2, 3, 4, 5])
result = idx.all()
```

### Output

The SMA adds the EWI `PNDSPY1138` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1138 => pandas.core.indexes.base.Index.all has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.Index([1, 2, 3, 4, 5])
result = idx.all()
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1139
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1139.md
section: Migrations
---

# PNDSPY1139

**Message** Pandas < **pandas.core.indexes.base.Index.any** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.Index([1, 2, 3, 4, 5])
result = idx.any()
```

### Output

The SMA adds the EWI `PNDSPY1139` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1139 => pandas.core.indexes.base.Index.any has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.Index([1, 2, 3, 4, 5])
result = idx.any()
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1140
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1140.md
section: Migrations
---

# PNDSPY1140

**Message** Pandas < **pandas.core.indexes.base.Index.nlevels** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Only single Index supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.Index([1, 2, 3, 4, 5])
result = idx.nlevels
```

### Output

The SMA adds the EWI `PNDSPY1140` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1140 => pandas.core.indexes.base.Index.nlevels has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.Index([1, 2, 3, 4, 5])
result = idx.nlevels
```

## Recommended fix

**Behavioral note**: Only single Index supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1141
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1141.md
section: Migrations
---

# PNDSPY1141

**Message** Pandas < **pandas.core.indexes.base.Index.reindex** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the method is nearest.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.Index([1, 2, 3, 4, 5])
result = idx.reindex()
```

### Output

The SMA adds the EWI `PNDSPY1141` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1141 => pandas.core.indexes.base.Index.reindex has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.Index([1, 2, 3, 4, 5])
result = idx.reindex()
```

## Recommended fix

**Behavioral note**: N if the method is nearest.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1142
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1142.md
section: Migrations
---

# PNDSPY1142

**Message** Pandas < **pandas.core.indexes.base.Index.sort_values** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `key`

**Reason:** Snowpark pandas currently uses stable sort when sorting the index values. pandas uses quicksort.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.Index([1, 2, 3, 4, 5])
result = idx.sort_values()
```

### Output

The SMA adds the EWI `PNDSPY1142` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1142 => pandas.core.indexes.base.Index.sort_values has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.Index([1, 2, 3, 4, 5])
result = idx.sort_values()
```

## Recommended fix

The parameter `key` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().sort_values(key=value)

**Behavioral note**: Snowpark pandas currently uses stable sort when sorting the index values. pandas uses quicksort.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1143
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1143.md
section: Migrations
---

# PNDSPY1143

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.ceil** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `ambiguous`, `nonexistent`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.ceil()
```

### Output

The SMA adds the EWI `PNDSPY1143` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1143 => pandas.core.indexes.datetimes.DatetimeIndex.ceil has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.ceil()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `ambiguous`, `nonexistent`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.ceil(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1144
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1144.md
section: Migrations
---

# PNDSPY1144

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.day_name** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `locale`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.day_name()
```

### Output

The SMA adds the EWI `PNDSPY1144` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1144 => pandas.core.indexes.datetimes.DatetimeIndex.day_name has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.day_name()
```

## Recommended fix

The parameter `locale` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().day_name(locale=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1145
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1145.md
section: Migrations
---

# PNDSPY1145

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.floor** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `ambiguous`, `nonexistent`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.floor()
```

### Output

The SMA adds the EWI `PNDSPY1145` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1145 => pandas.core.indexes.datetimes.DatetimeIndex.floor has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.floor()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `ambiguous`, `nonexistent`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.floor(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1146
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1146.md
section: Migrations
---

# PNDSPY1146

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.month_name** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `locale`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.month_name()
```

### Output

The SMA adds the EWI `PNDSPY1146` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1146 => pandas.core.indexes.datetimes.DatetimeIndex.month_name has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.month_name()
```

## Recommended fix

The parameter `locale` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().month_name(locale=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1147
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1147.md
section: Migrations
---

# PNDSPY1147

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.round** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `ambiguous`, `nonexistent`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.round()
```

### Output

The SMA adds the EWI `PNDSPY1147` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1147 => pandas.core.indexes.datetimes.DatetimeIndex.round has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.round()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `ambiguous`, `nonexistent`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.round(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1148
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1148.md
section: Migrations
---

# PNDSPY1148

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `ddof`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.std()
```

### Output

The SMA adds the EWI `PNDSPY1148` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1148 => pandas.core.indexes.datetimes.DatetimeIndex.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.std()
```

## Recommended fix

The parameter `ddof` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().std(ddof=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1149
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1149.md
section: Migrations
---

# PNDSPY1149

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.tz_convert** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as, UTC+09:00 is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.tz_convert()
```

### Output

The SMA adds the EWI `PNDSPY1149` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1149 => pandas.core.indexes.datetimes.DatetimeIndex.tz_convert has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.tz_convert()
```

## Recommended fix

**Timezone handling difference**: N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as, UTC+09:00 is not supported.

When working with timezones in Snowpark pandas:

* Ensure your timezone strings are valid IANA timezone names (e.g., ‘UTC’, ‘America/New_York’)
* Test timezone conversions with sample data before running on full dataset
* Consider using `.to_pandas()` for complex timezone operations if results differ

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1150
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1150.md
section: Migrations
---

# PNDSPY1150

**Message** Pandas < **pandas.core.indexes.datetimes.DatetimeIndex.tz_localize** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `ambiguous`, `nonexistent`

**Reason:** N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as, UTC+09:00 is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.tz_localize()
```

### Output

The SMA adds the EWI `PNDSPY1150` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1150 => pandas.core.indexes.datetimes.DatetimeIndex.tz_localize has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
idx = pd.DatetimeIndex(['2023-01-01', '2023-02-01', '2023-03-01'])
result = idx.tz_localize()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `ambiguous`, `nonexistent`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.tz_localize(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Timezone handling difference**: N if timezone format is not supported. Only timezones listed in pytz.all_timezones are supported. For example, UTC is supported but UTC+/-<offset>, such as, UTC+09:00 is not supported.

When working with timezones in Snowpark pandas:

* Ensure your timezone strings are valid IANA timezone names (e.g., ‘UTC’, ‘America/New_York’)
* Test timezone conversions with sample data before running on full dataset
* Consider using `.to_pandas()` for complex timezone operations if results differ

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1151
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1151.md
section: Migrations
---

# PNDSPY1151

**Message** Pandas < **pandas.core.indexes.datetimes.bdate_range** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for custom frequencies.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.bdate_range(df)
```

### Output

The SMA adds the EWI `PNDSPY1151` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1151 => pandas.core.indexes.datetimes.bdate_range has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.bdate_range(df)
```

## Recommended fix

**Behavioral note**: N for custom frequencies.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1152
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1152.md
section: Migrations
---

# PNDSPY1152

**Message** Pandas < **pandas.core.indexes.datetimes.date_range** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for custom frequencies.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.date_range(df)
```

### Output

The SMA adds the EWI `PNDSPY1152` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1152 => pandas.core.indexes.datetimes.date_range has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.date_range(df)
```

## Recommended fix

**Behavioral note**: N for custom frequencies.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1153
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1153.md
section: Migrations
---

# PNDSPY1153

**Message** Pandas < **pandas.core.resample.Resampler.asfreq** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `fill_value`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.asfreq()
```

### Output

The SMA adds the EWI `PNDSPY1153` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1153 => pandas.core.resample.Resampler.asfreq has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.asfreq()
```

## Recommended fix

The parameter `fill_value` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().asfreq(fill_value=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1154
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1154.md
section: Migrations
---

# PNDSPY1154

**Message** Pandas < **pandas.core.resample.Resampler.bfill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `limit`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.bfill()
```

### Output

The SMA adds the EWI `PNDSPY1154` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1154 => pandas.core.resample.Resampler.bfill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.bfill()
```

## Recommended fix

The parameter `limit` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().bfill(limit=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1155
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1155.md
section: Migrations
---

# PNDSPY1155

**Message** Pandas < **pandas.core.resample.Resampler.ffill** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `limit`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.ffill()
```

### Output

The SMA adds the EWI `PNDSPY1155` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1155 => pandas.core.resample.Resampler.ffill has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.ffill()
```

## Recommended fix

The parameter `limit` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().ffill(limit=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1156
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1156.md
section: Migrations
---

# PNDSPY1156

**Message** Pandas < **pandas.core.resample.Resampler.fillna** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `limit`

**Reason:** Method nearest is not supported.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.fillna()
```

### Output

The SMA adds the EWI `PNDSPY1156` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1156 => pandas.core.resample.Resampler.fillna has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.fillna()
```

## Recommended fix

The parameter `limit` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().fillna(limit=value)

**Behavioral note**: Method nearest is not supported.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1157
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1157.md
section: Migrations
---

# PNDSPY1157

**Message** Pandas < **pandas.core.resample.Resampler.first** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Does not support min_count parameter.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.first
```

### Output

The SMA adds the EWI `PNDSPY1157` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1157 => pandas.core.resample.Resampler.first has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.first
```

## Recommended fix

**Behavioral note**: Does not support min_count parameter.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1158
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1158.md
section: Migrations
---

# PNDSPY1158

**Message** Pandas < **pandas.core.resample.Resampler.last** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Does not support min_count parameter.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.last
```

### Output

The SMA adds the EWI `PNDSPY1158` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1158 => pandas.core.resample.Resampler.last has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.last
```

## Recommended fix

**Behavioral note**: Does not support min_count parameter.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1159
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1159.md
section: Migrations
---

# PNDSPY1159

**Message** Pandas < **pandas.core.resample.Resampler.quantile** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for list-like q.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.quantile
```

### Output

The SMA adds the EWI `PNDSPY1159` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1159 => pandas.core.resample.Resampler.quantile has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.quantile
```

## Recommended fix

**Behavioral note**: N for list-like q.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1160
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1160.md
section: Migrations
---

# PNDSPY1160

**Message** Pandas < **pandas.core.resample.Resampler.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ddof is not 0 or 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.std
```

### Output

The SMA adds the EWI `PNDSPY1160` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1160 => pandas.core.resample.Resampler.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.std
```

## Recommended fix

**Behavioral note**: N if ddof is not 0 or 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1161
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1161.md
section: Migrations
---

# PNDSPY1161

**Message** Pandas < **pandas.core.resample.Resampler.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ddof is not 0 or 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.var
```

### Output

The SMA adds the EWI `PNDSPY1161` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1161 => pandas.core.resample.Resampler.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3]}, index=pd.date_range('2023-01-01', periods=3, freq='D'))
resampled = df.resample('D')
result = resampled.var
```

## Recommended fix

**Behavioral note**: N if ddof is not 0 or 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1162
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1162.md
section: Migrations
---

# PNDSPY1162

**Message** Pandas < **pandas.core.reshape.concat.concat** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `levels is not supported`, `copy is ignored`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.concat(df)
```

### Output

The SMA adds the EWI `PNDSPY1162` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1162 => pandas.core.reshape.concat.concat has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.concat(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `levels is not supported`, `copy is ignored`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.concat(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1163
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1163.md
section: Migrations
---

# PNDSPY1163

**Message** Pandas < **pandas.core.reshape.melt.melt** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `col_level`, `ignore_index`

**Reason:** N if df.columns is a MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.melt(df)
```

### Output

The SMA adds the EWI `PNDSPY1163` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1163 => pandas.core.reshape.melt.melt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.melt(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `col_level`, `ignore_index`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.melt(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N if df.columns is a MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1164
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1164.md
section: Migrations
---

# PNDSPY1164

**Message** Pandas < **pandas.core.reshape.merge.merge** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `validate`

**Reason:** N if param validate is given.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.merge(df)
```

### Output

The SMA adds the EWI `PNDSPY1164` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1164 => pandas.core.reshape.merge.merge has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.merge(df)
```

## Recommended fix

The parameter `validate` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().merge(validate=value)

**Behavioral note**: N if param validate is given.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1165
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1165.md
section: Migrations
---

# PNDSPY1165

**Message** Pandas < **pandas.core.reshape.merge.merge_asof** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `suffixes`, `tolerance`

**Reason:** N if param direction is nearest.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.merge_asof(df)
```

### Output

The SMA adds the EWI `PNDSPY1165` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1165 => pandas.core.reshape.merge.merge_asof has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.merge_asof(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `suffixes`, `tolerance`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.merge_asof(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N if param direction is nearest.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1166
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1166.md
section: Migrations
---

# PNDSPY1166

**Message** Pandas < **pandas.core.reshape.pivot.crosstab** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if aggfunc is not a supported aggregation function <agg_supp.html>_, margins is True, normalize is “all” or True, and values is passed.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.crosstab(df)
```

### Output

The SMA adds the EWI `PNDSPY1166` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1166 => pandas.core.reshape.pivot.crosstab has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.crosstab(df)
```

## Recommended fix

**Behavioral note**: N if aggfunc is not a supported aggregation function <agg_supp.html>_, margins is True, normalize is “all” or True, and values is passed.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1167
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1167.md
section: Migrations
---

# PNDSPY1167

**Message** Pandas < **pandas.core.reshape.pivot.pivot** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See pivot_table.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.pivot(df)
```

### Output

The SMA adds the EWI `PNDSPY1167` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1167 => pandas.core.reshape.pivot.pivot has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.pivot(df)
```

## Recommended fix

**Behavioral note**: See pivot_table.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1168
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1168.md
section: Migrations
---

# PNDSPY1168

**Message** Pandas < **pandas.core.reshape.pivot.pivot_table** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `observed`, `margins`, `sort`

**Reason:** N if index, columns, or values is not str; or MultiIndex; or any aggfunc is not a supported aggregation function <agg_supp.html>_.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.pivot_table(df)
```

### Output

The SMA adds the EWI `PNDSPY1168` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1168 => pandas.core.reshape.pivot.pivot_table has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.pivot_table(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `observed`, `margins`, `sort`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.pivot_table(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N if index, columns, or values is not str; or MultiIndex; or any aggfunc is not a supported aggregation function <agg_supp.html>_.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1169
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1169.md
section: Migrations
---

# PNDSPY1169

**Message** Pandas < **pandas.core.reshape.tile.cut** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `retbins`, `labels`

**Reason:** N if retbins=Trueor labels!=False.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.cut(df)
```

### Output

The SMA adds the EWI `PNDSPY1169` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1169 => pandas.core.reshape.tile.cut has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.cut(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `retbins`, `labels`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.cut(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: N if retbins=Trueor labels!=False.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1170
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1170.md
section: Migrations
---

# PNDSPY1170

**Message** Pandas < **pandas.core.reshape.tile.qcut** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if labels!=False or retbins=True.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.qcut(df)
```

### Output

The SMA adds the EWI `PNDSPY1170` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1170 => pandas.core.reshape.tile.qcut has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.qcut(df)
```

## Recommended fix

**Behavioral note**: N if labels!=False or retbins=True.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1171
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1171.md
section: Migrations
---

# PNDSPY1171

**Message** Pandas < **pandas.core.series.Series.add** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.add()
```

### Output

The SMA adds the EWI `PNDSPY1171` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1171 => pandas.core.series.Series.add has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.add()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().add(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1172
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1172.md
section: Migrations
---

# PNDSPY1172

**Message** Pandas < **pandas.core.series.Series.all** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.all()
```

### Output

The SMA adds the EWI `PNDSPY1172` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1172 => pandas.core.series.Series.all has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.all()
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1173
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1173.md
section: Migrations
---

# PNDSPY1173

**Message** Pandas < **pandas.core.series.Series.any** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer/boolean types.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.any()
```

### Output

The SMA adds the EWI `PNDSPY1173` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1173 => pandas.core.series.Series.any has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.any()
```

## Recommended fix

**Data type consideration**: N for non-integer/boolean types.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1174
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1174.md
section: Migrations
---

# PNDSPY1174

**Message** Pandas < **pandas.core.series.Series.case_when** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if condition or replacement is a callable.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.case_when()
```

### Output

The SMA adds the EWI `PNDSPY1174` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1174 => pandas.core.series.Series.case_when has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.case_when()
```

## Recommended fix

**Behavioral note**: N if condition or replacement is a callable.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1175
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1175.md
section: Migrations
---

# PNDSPY1175

**Message** Pandas < **pandas.core.series.Series.compare** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `align_axis`, `keep_shape`, `keep_equal`, `result_names`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.compare()
```

### Output

The SMA adds the EWI `PNDSPY1175` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1175 => pandas.core.series.Series.compare has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.compare()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `align_axis`, `keep_shape`, `keep_equal`, `result_names`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.compare(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1176
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1176.md
section: Migrations
---

# PNDSPY1176

**Message** Pandas < **pandas.core.series.Series.cumsum** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if values are numeric, otherwise fails.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.cumsum()
```

### Output

The SMA adds the EWI `PNDSPY1176` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1176 => pandas.core.series.Series.cumsum has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.cumsum()
```

## Recommended fix

**Behavioral note**: Y if values are numeric, otherwise fails.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1177
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1177.md
section: Migrations
---

# PNDSPY1177

**Message** Pandas < **pandas.core.series.Series.div** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** See truediv.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.div()
```

### Output

The SMA adds the EWI `PNDSPY1177` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1177 => pandas.core.series.Series.div has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.div()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().div(level=value)

**Behavioral note**: See truediv.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1178
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1178.md
section: Migrations
---

# PNDSPY1178

**Message** Pandas < **pandas.core.series.Series.divide** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** See truediv.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.divide()
```

### Output

The SMA adds the EWI `PNDSPY1178` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1178 => pandas.core.series.Series.divide has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.divide()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().divide(level=value)

**Behavioral note**: See truediv.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1179
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1179.md
section: Migrations
---

# PNDSPY1179

**Message** Pandas < **pandas.core.series.Series.dropna** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.dropna()
```

### Output

The SMA adds the EWI `PNDSPY1179` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1179 => pandas.core.series.Series.dropna has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.dropna()
```

## Recommended fix

This element has partial support in Snowpark pandas. General recommendations:

1. **Test with sample data**: Verify the operation works as expected with a subset of your data
2. **Check parameters**: Review which parameters are supported in the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index)
3. **Fallback option**: If exact pandas behavior is required:
   .. code-block:: python

   > # Convert to native pandas for full compatibility
   > result = df.to_pandas().dropna(…)
4. **Consider SQL alternative**: For complex operations, Snowflake SQL may offer better performance

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1180
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1180.md
section: Migrations
---

# PNDSPY1180

**Message** Pandas < **pandas.core.series.Series.eq** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.eq()
```

### Output

The SMA adds the EWI `PNDSPY1180` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1180 => pandas.core.series.Series.eq has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.eq()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().eq(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1181
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1181.md
section: Migrations
---

# PNDSPY1181

**Message** Pandas < **pandas.core.series.Series.flags** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** flags can only be read, and not set.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.flags
```

### Output

The SMA adds the EWI `PNDSPY1181` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1181 => pandas.core.series.Series.flags has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.flags
```

## Recommended fix

**Behavioral note**: flags can only be read, and not set.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1182
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1182.md
section: Migrations
---

# PNDSPY1182

**Message** Pandas < **pandas.core.series.Series.floordiv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** Raises division by zero exception when the right hand side contains at least one zero. pandas allows division by zero for non-object type Series and returns +/-inf.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.floordiv()
```

### Output

The SMA adds the EWI `PNDSPY1182` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1182 => pandas.core.series.Series.floordiv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.floordiv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().floordiv(level=value)

**Data type consideration**: Raises division by zero exception when the right hand side contains at least one zero. pandas allows division by zero for non-object type Series and returns +/-inf.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1183
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1183.md
section: Migrations
---

# PNDSPY1183

**Message** Pandas < **pandas.core.series.Series.ge** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.ge()
```

### Output

The SMA adds the EWI `PNDSPY1183` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1183 => pandas.core.series.Series.ge has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.ge()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().ge(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1184
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1184.md
section: Migrations
---

# PNDSPY1184

**Message** Pandas < **pandas.core.series.Series.groupby** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `observed is ignored since Categoricals are not implemented yet`

**Reason:** Y, support axis == 0 and by is column label or Series from the current DataFrame, or a pd.Grouper object; otherwise N. If a pd.Grouper object is passed, then only the default values of the sort, closed, label, and convention arguments are supported. The origin argument currently supports “start_day” and “start”. Note that supported functions are agg, count, cumcount, cummax, cummin, cumsum, first, last, max, mean, median, min, quantile, shift, size, std, sum, and var. Otherwise N.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.groupby()
```

### Output

The SMA adds the EWI `PNDSPY1184` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1184 => pandas.core.series.Series.groupby has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.groupby()
```

## Recommended fix

The parameter `observed is ignored since Categoricals are not implemented yet` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().groupby(observed is ignored since Categoricals are not implemented yet=value)

**Behavioral note**: Y, support axis == 0 and by is column label or Series from the current DataFrame, or a pd.Grouper object; otherwise N. If a pd.Grouper object is passed, then only the default values of the sort, closed, label, and convention arguments are supported. The origin argument currently supports “start_day” and “start”. Note that supported functions are agg, count, cumcount, cummax, cummin, cumsum, first, last, max, mean, median, min, quantile, shift, size, std, sum, and var. Otherwise N.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1185
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1185.md
section: Migrations
---

# PNDSPY1185

**Message** Pandas < **pandas.core.series.Series.gt** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.gt()
```

### Output

The SMA adds the EWI `PNDSPY1185` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1185 => pandas.core.series.Series.gt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.gt()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().gt(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1186
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1186.md
section: Migrations
---

# PNDSPY1186

**Message** Pandas < **pandas.core.series.Series.le** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.le()
```

### Output

The SMA adds the EWI `PNDSPY1186` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1186 => pandas.core.series.Series.le has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.le()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().le(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1187
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1187.md
section: Migrations
---

# PNDSPY1187

**Message** Pandas < **pandas.core.series.Series.lt** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.lt()
```

### Output

The SMA adds the EWI `PNDSPY1187` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1187 => pandas.core.series.Series.lt has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.lt()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().lt(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1188
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1188.md
section: Migrations
---

# PNDSPY1188

**Message** Pandas < **pandas.core.series.Series.map** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `na_action`

**Reason:** N if arg is an instance of a subclass of dict that is not a subclass of collections.defaultdict but that does define a __missing__ method.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.map()
```

### Output

The SMA adds the EWI `PNDSPY1188` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1188 => pandas.core.series.Series.map has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.map()
```

## Recommended fix

The parameter `na_action` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().map(na_action=value)

**Behavioral note**: N if arg is an instance of a subclass of dict that is not a subclass of collections.defaultdict but that does define a __missing__ method.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1189
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1189.md
section: Migrations
---

# PNDSPY1189

**Message** Pandas < **pandas.core.series.Series.mod** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.mod()
```

### Output

The SMA adds the EWI `PNDSPY1189` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1189 => pandas.core.series.Series.mod has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.mod()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().mod(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1190
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1190.md
section: Migrations
---

# PNDSPY1190

**Message** Pandas < **pandas.core.series.Series.mul** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.mul()
```

### Output

The SMA adds the EWI `PNDSPY1190` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1190 => pandas.core.series.Series.mul has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.mul()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().mul(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1191
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1191.md
section: Migrations
---

# PNDSPY1191

**Message** Pandas < **pandas.core.series.Series.multiply** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.multiply()
```

### Output

The SMA adds the EWI `PNDSPY1191` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1191 => pandas.core.series.Series.multiply has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.multiply()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().multiply(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1192
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1192.md
section: Migrations
---

# PNDSPY1192

**Message** Pandas < **pandas.core.series.Series.ne** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.ne()
```

### Output

The SMA adds the EWI `PNDSPY1192` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1192 => pandas.core.series.Series.ne has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.ne()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().ne(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1193
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1193.md
section: Migrations
---

# PNDSPY1193

**Message** Pandas < **pandas.core.series.Series.nlargest** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if keep == “all”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.nlargest()
```

### Output

The SMA adds the EWI `PNDSPY1193` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1193 => pandas.core.series.Series.nlargest has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.nlargest()
```

## Recommended fix

**Behavioral note**: N if keep == “all”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1194
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1194.md
section: Migrations
---

# PNDSPY1194

**Message** Pandas < **pandas.core.series.Series.nsmallest** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if keep == “all”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.nsmallest()
```

### Output

The SMA adds the EWI `PNDSPY1194` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1194 => pandas.core.series.Series.nsmallest has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.nsmallest()
```

## Recommended fix

**Behavioral note**: N if keep == “all”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1195
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1195.md
section: Migrations
---

# PNDSPY1195

**Message** Pandas < **pandas.core.series.Series.pow** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.pow()
```

### Output

The SMA adds the EWI `PNDSPY1195` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1195 => pandas.core.series.Series.pow has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.pow()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().pow(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1196
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1196.md
section: Migrations
---

# PNDSPY1196

**Message** Pandas < **pandas.core.series.Series.quantile** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Y if values are numeric, and interpolation is “linear” or “nearest”; N if q is a DataFrame or Series.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.quantile()
```

### Output

The SMA adds the EWI `PNDSPY1196` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1196 => pandas.core.series.Series.quantile has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.quantile()
```

## Recommended fix

**Behavioral note**: Y if values are numeric, and interpolation is “linear” or “nearest”; N if q is a DataFrame or Series.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1197
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1197.md
section: Migrations
---

# PNDSPY1197

**Message** Pandas < **pandas.core.series.Series.radd** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.radd()
```

### Output

The SMA adds the EWI `PNDSPY1197` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1197 => pandas.core.series.Series.radd has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.radd()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().radd(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1198
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1198.md
section: Migrations
---

# PNDSPY1198

**Message** Pandas < **pandas.core.series.Series.rdiv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** See truediv.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rdiv()
```

### Output

The SMA adds the EWI `PNDSPY1198` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1198 => pandas.core.series.Series.rdiv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rdiv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rdiv(level=value)

**Behavioral note**: See truediv.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1199
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1199.md
section: Migrations
---

# PNDSPY1199

**Message** Pandas < **pandas.core.series.Series.reindex** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the series has MultiIndex, or method is nearest.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.reindex()
```

### Output

The SMA adds the EWI `PNDSPY1199` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1199 => pandas.core.series.Series.reindex has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.reindex()
```

## Recommended fix

**Behavioral note**: N if the series has MultiIndex, or method is nearest.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1200
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1200.md
section: Migrations
---

# PNDSPY1200

**Message** Pandas < **pandas.core.series.Series.rename** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `copy is ignored`

**Reason:** N if mapper is callable or the series has MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rename()
```

### Output

The SMA adds the EWI `PNDSPY1200` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1200 => pandas.core.series.Series.rename has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rename()
```

## Recommended fix

The parameter `copy is ignored` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rename(copy is ignored=value)

**Behavioral note**: N if mapper is callable or the series has MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1201
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1201.md
section: Migrations
---

# PNDSPY1201

**Message** Pandas < **pandas.core.series.Series.rfloordiv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** See floordiv.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rfloordiv()
```

### Output

The SMA adds the EWI `PNDSPY1201` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1201 => pandas.core.series.Series.rfloordiv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rfloordiv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rfloordiv(level=value)

**Behavioral note**: See floordiv.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1202
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1202.md
section: Migrations
---

# PNDSPY1202

**Message** Pandas < **pandas.core.series.Series.rmod** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rmod()
```

### Output

The SMA adds the EWI `PNDSPY1202` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1202 => pandas.core.series.Series.rmod has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rmod()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rmod(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1203
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1203.md
section: Migrations
---

# PNDSPY1203

**Message** Pandas < **pandas.core.series.Series.rmul** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rmul()
```

### Output

The SMA adds the EWI `PNDSPY1203` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1203 => pandas.core.series.Series.rmul has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rmul()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rmul(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1204
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1204.md
section: Migrations
---

# PNDSPY1204

**Message** Pandas < **pandas.core.series.Series.rpow** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rpow()
```

### Output

The SMA adds the EWI `PNDSPY1204` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1204 => pandas.core.series.Series.rpow has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rpow()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rpow(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1205
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1205.md
section: Migrations
---

# PNDSPY1205

**Message** Pandas < **pandas.core.series.Series.rsub** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rsub()
```

### Output

The SMA adds the EWI `PNDSPY1205` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1205 => pandas.core.series.Series.rsub has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rsub()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rsub(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1206
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1206.md
section: Migrations
---

# PNDSPY1206

**Message** Pandas < **pandas.core.series.Series.rtruediv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** See truediv.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.rtruediv()
```

### Output

The SMA adds the EWI `PNDSPY1206` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1206 => pandas.core.series.Series.rtruediv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.rtruediv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().rtruediv(level=value)

**Behavioral note**: See truediv.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1207
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1207.md
section: Migrations
---

# PNDSPY1207

**Message** Pandas < **pandas.core.series.Series.skew** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if axis == 1 or skipna == False or numeric_only=False.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.skew()
```

### Output

The SMA adds the EWI `PNDSPY1207` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1207 => pandas.core.series.Series.skew has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.skew()
```

## Recommended fix

**NULL/NaN handling difference**: N if axis == 1 or skipna == False or numeric_only=False.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1208
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1208.md
section: Migrations
---

# PNDSPY1208

**Message** Pandas < **pandas.core.series.Series.sort_index** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `key`

**Reason:** N if given the key param, or MultiIndex.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.sort_index()
```

### Output

The SMA adds the EWI `PNDSPY1208` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1208 => pandas.core.series.Series.sort_index has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.sort_index()
```

## Recommended fix

The parameter `key` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().sort_index(key=value)

**Behavioral note**: N if given the key param, or MultiIndex.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1209
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1209.md
section: Migrations
---

# PNDSPY1209

**Message** Pandas < **pandas.core.series.Series.sort_values** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `key`, `kind is ignored`

**Reason:** The kind parameter has no effect. Snowpark pandas always uses a stable sort algorithm, while pandas by default does not.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.sort_values()
```

### Output

The SMA adds the EWI `PNDSPY1209` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1209 => pandas.core.series.Series.sort_values has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.sort_values()
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `key`, `kind is ignored`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.sort_values(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: The kind parameter has no effect. Snowpark pandas always uses a stable sort algorithm, while pandas by default does not.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1210
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1210.md
section: Migrations
---

# PNDSPY1210

**Message** Pandas < **pandas.core.series.Series.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if ddof is not 0 or 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.std()
```

### Output

The SMA adds the EWI `PNDSPY1210` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1210 => pandas.core.series.Series.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.std()
```

## Recommended fix

**Behavioral note**: N if ddof is not 0 or 1.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1211
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1211.md
section: Migrations
---

# PNDSPY1211

**Message** Pandas < **pandas.core.series.Series.sub** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.sub()
```

### Output

The SMA adds the EWI `PNDSPY1211` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1211 => pandas.core.series.Series.sub has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.sub()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().sub(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1212
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1212.md
section: Migrations
---

# PNDSPY1212

**Message** Pandas < **pandas.core.series.Series.subtract** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.subtract()
```

### Output

The SMA adds the EWI `PNDSPY1212` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1212 => pandas.core.series.Series.subtract has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.subtract()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().subtract(level=value)

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1213
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1213.md
section: Migrations
---

# PNDSPY1213

**Message** Pandas < **pandas.core.series.Series.truediv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `level`

**Reason:** Raises division by zero exception when right hand hand side contains at least one zero. pandas allows division by zero for non-object type Series and returns +/-inf.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.truediv()
```

### Output

The SMA adds the EWI `PNDSPY1213` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1213 => pandas.core.series.Series.truediv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.truediv()
```

## Recommended fix

The parameter `level` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().truediv(level=value)

**Data type consideration**: Raises division by zero exception when right hand hand side contains at least one zero. pandas allows division by zero for non-object type Series and returns +/-inf.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1214
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1214.md
section: Migrations
---

# PNDSPY1214

**Message** Pandas < **pandas.core.series.Series.unstack** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `sort`

**Reason:** N for non-integer level.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.unstack()
```

### Output

The SMA adds the EWI `PNDSPY1214` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1214 => pandas.core.series.Series.unstack has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.unstack()
```

## Recommended fix

The parameter `sort` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().unstack(sort=value)

**Behavioral note**: N for non-integer level.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1215
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1215.md
section: Migrations
---

# PNDSPY1215

**Message** Pandas < **pandas.core.series.Series.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** See std.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series([1, 2, 3, 4, 5])
result = s.var()
```

### Output

The SMA adds the EWI `PNDSPY1215` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1215 => pandas.core.series.Series.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series([1, 2, 3, 4, 5])
result = s.var()
```

## Recommended fix

**Behavioral note**: See std.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1216
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1216.md
section: Migrations
---

# PNDSPY1216

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.__getitem__** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** For the column data type, only string, list, and dict values are supported. All column values must be of the same type.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.__getitem__
```

### Output

The SMA adds the EWI `PNDSPY1216` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1216 => pandas.core.strings.accessor.StringMethods.__getitem__ has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.__getitem__
```

## Recommended fix

**Data type consideration**: For the column data type, only string, list, and dict values are supported. All column values must be of the same type.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1217
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1217.md
section: Migrations
---

# PNDSPY1217

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.contains** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the na parameter is set to a non-bool value.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.contains
```

### Output

The SMA adds the EWI `PNDSPY1217` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1217 => pandas.core.strings.accessor.StringMethods.contains has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.contains
```

## Recommended fix

**NULL/NaN handling difference**: N if the na parameter is set to a non-bool value.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1218
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1218.md
section: Migrations
---

# PNDSPY1218

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.endswith** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the na parameter is set to a non-bool value.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.endswith
```

### Output

The SMA adds the EWI `PNDSPY1218` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1218 => pandas.core.strings.accessor.StringMethods.endswith has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.endswith
```

## Recommended fix

**NULL/NaN handling difference**: N if the na parameter is set to a non-bool value.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1219
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1219.md
section: Migrations
---

# PNDSPY1219

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.get** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** For the column data type, only string, list, and dict values are supported. All column values must be of the same type.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.get
```

### Output

The SMA adds the EWI `PNDSPY1219` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1219 => pandas.core.strings.accessor.StringMethods.get has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.get
```

## Recommended fix

**Data type consideration**: For the column data type, only string, list, and dict values are supported. All column values must be of the same type.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1220
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1220.md
section: Migrations
---

# PNDSPY1220

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.isdigit** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Does not check for special digits, like superscripted and subscripted digits in unicode.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.isdigit
```

### Output

The SMA adds the EWI `PNDSPY1220` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1220 => pandas.core.strings.accessor.StringMethods.isdigit has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.isdigit
```

## Recommended fix

**Behavioral note**: Does not check for special digits, like superscripted and subscripted digits in unicode.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1221
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1221.md
section: Migrations
---

# PNDSPY1221

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.len** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** For the column data type, only string, list, and dict values are supported. All column values must be of the same type.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.len
```

### Output

The SMA adds the EWI `PNDSPY1221` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1221 => pandas.core.strings.accessor.StringMethods.len has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.len
```

## Recommended fix

**Data type consideration**: For the column data type, only string, list, and dict values are supported. All column values must be of the same type.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1222
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1222.md
section: Migrations
---

# PNDSPY1222

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.lstrip** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if to_strip is non-string.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.lstrip
```

### Output

The SMA adds the EWI `PNDSPY1222` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1222 => pandas.core.strings.accessor.StringMethods.lstrip has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.lstrip
```

## Recommended fix

**Behavioral note**: N if to_strip is non-string.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1223
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1223.md
section: Migrations
---

# PNDSPY1223

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.replace** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if pat is non-string, repl is a non-string, or n is non-numeric or zero.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.replace
```

### Output

The SMA adds the EWI `PNDSPY1223` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1223 => pandas.core.strings.accessor.StringMethods.replace has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.replace
```

## Recommended fix

**Behavioral note**: N if pat is non-string, repl is a non-string, or n is non-numeric or zero.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1224
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1224.md
section: Migrations
---

# PNDSPY1224

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.rstrip** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if to_strip is non-string.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.rstrip
```

### Output

The SMA adds the EWI `PNDSPY1224` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1224 => pandas.core.strings.accessor.StringMethods.rstrip has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.rstrip
```

## Recommended fix

**Behavioral note**: N if to_strip is non-string.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1225
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1225.md
section: Migrations
---

# PNDSPY1225

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.slice** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** For the column data type, only string, list, and dict values are supported. All column values must be of the same type. N if column has list values and step != 1.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.slice
```

### Output

The SMA adds the EWI `PNDSPY1225` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1225 => pandas.core.strings.accessor.StringMethods.slice has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.slice
```

## Recommended fix

**Data type consideration**: For the column data type, only string, list, and dict values are supported. All column values must be of the same type. N if column has list values and step != 1.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1226
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1226.md
section: Migrations
---

# PNDSPY1226

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.split** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if pat is non-string, n is non-numeric, or regex is set.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.split
```

### Output

The SMA adds the EWI `PNDSPY1226` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1226 => pandas.core.strings.accessor.StringMethods.split has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.split
```

## Recommended fix

**Behavioral note**: N if pat is non-string, n is non-numeric, or regex is set.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1227
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1227.md
section: Migrations
---

# PNDSPY1227

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.startswith** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if the na parameter is set to a non-bool value.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.startswith
```

### Output

The SMA adds the EWI `PNDSPY1227` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1227 => pandas.core.strings.accessor.StringMethods.startswith has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.startswith
```

## Recommended fix

**NULL/NaN handling difference**: N if the na parameter is set to a non-bool value.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1228
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1228.md
section: Migrations
---

# PNDSPY1228

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.strip** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if to_strip is non-string.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.strip
```

### Output

The SMA adds the EWI `PNDSPY1228` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1228 => pandas.core.strings.accessor.StringMethods.strip has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.strip
```

## Recommended fix

**Behavioral note**: N if to_strip is non-string.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1229
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1229.md
section: Migrations
---

# PNDSPY1229

**Message** Pandas < **pandas.core.strings.accessor.StringMethods.translate** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N if any value in table has multiple characters.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.translate
```

### Output

The SMA adds the EWI `PNDSPY1229` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1229 => pandas.core.strings.accessor.StringMethods.translate has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
s = pd.Series(['abc', 'def', 'ghi'])
result = s.str.translate
```

## Recommended fix

**Behavioral note**: N if any value in table has multiple characters.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1230
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1230.md
section: Migrations
---

# PNDSPY1230

**Message** Pandas < **pandas.core.tools.datetimes.to_datetime** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `cache is ignored`

**Reason:** N: - if format is None or not supported in Snowflake - or if params exact, infer_datetime_format is given - or origin == “julian” - or arg is DataFrame and data type is not int - or arg is Series and data type is string.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.to_datetime(df)
```

### Output

The SMA adds the EWI `PNDSPY1230` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1230 => pandas.core.tools.datetimes.to_datetime has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.to_datetime(df)
```

## Recommended fix

The parameter `cache is ignored` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().to_datetime(cache is ignored=value)

**Data type consideration**: N: - if format is None or not supported in Snowflake - or if params exact, infer_datetime_format is given - or origin == “julian” - or arg is DataFrame and data type is not int - or arg is Series and data type is string.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1231
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1231.md
section: Migrations
---

# PNDSPY1231

**Message** Pandas < **pandas.core.tools.numeric.to_numeric** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `downcast is ignored`

**Reason:** N if error == “ignore”.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.to_numeric(df)
```

### Output

The SMA adds the EWI `PNDSPY1231` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1231 => pandas.core.tools.numeric.to_numeric has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.to_numeric(df)
```

## Recommended fix

The parameter `downcast is ignored` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().to_numeric(downcast is ignored=value)

**Behavioral note**: N if error == “ignore”.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1232
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1232.md
section: Migrations
---

# PNDSPY1232

**Message** Pandas < **pandas.core.tools.timedeltas.to_timedelta** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `errors`

**Reason:** N if errors is given or converting from string type.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.to_timedelta(df)
```

### Output

The SMA adds the EWI `PNDSPY1232` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1232 => pandas.core.tools.timedeltas.to_timedelta has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.to_timedelta(df)
```

## Recommended fix

The parameter `errors` is not supported in Snowpark pandas. If your code uses this parameter, consider one of these approaches:

1. **Remove the parameter**: If the parameter is not essential for your use case, simply remove it from the function call.
2. **Use default behavior**: The function will work with default values for the unsupported parameter.
3. **Post-process with native pandas**: If the parameter is critical, collect the result using `.to_pandas()` and apply the operation with native pandas:
   .. code-block:: python

   > # Convert to native pandas for unsupported parameter
   > result = df.to_pandas().to_timedelta(errors=value)

**Data type consideration**: N if errors is given or converting from string type.

Ensure data types are compatible:

* Check column dtypes with `df.dtypes` before the operation
* Use `.astype()` to convert columns to expected types
* Numeric operations may require explicit casting: `df['col'].astype(float)`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1233
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1233.md
section: Migrations
---

# PNDSPY1233

**Message** Pandas < **pandas.core.window.ewm.ExponentialMovingWindow.corr** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer window, axis = 1, pairwise = True, other = None, or min_periods != window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.corr
```

### Output

The SMA adds the EWI `PNDSPY1233` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1233 => pandas.core.window.ewm.ExponentialMovingWindow.corr has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.corr
```

## Recommended fix

**Behavioral note**: N for non-integer window, axis = 1, pairwise = True, other = None, or min_periods != window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1234
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1234.md
section: Migrations
---

# PNDSPY1234

**Message** Pandas < **pandas.core.window.ewm.ExponentialMovingWindow.mean** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

### Output

The SMA adds the EWI `PNDSPY1234` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1234 => pandas.core.window.ewm.ExponentialMovingWindow.mean has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1235
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1235.md
section: Migrations
---

# PNDSPY1235

**Message** Pandas < **pandas.core.window.ewm.ExponentialMovingWindow.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

### Output

The SMA adds the EWI `PNDSPY1235` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1235 => pandas.core.window.ewm.ExponentialMovingWindow.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1236
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1236.md
section: Migrations
---

# PNDSPY1236

**Message** Pandas < **pandas.core.window.ewm.ExponentialMovingWindow.sum** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

### Output

The SMA adds the EWI `PNDSPY1236` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1236 => pandas.core.window.ewm.ExponentialMovingWindow.sum has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1237
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1237.md
section: Migrations
---

# PNDSPY1237

**Message** Pandas < **pandas.core.window.ewm.ExponentialMovingWindow.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

### Output

The SMA adds the EWI `PNDSPY1237` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1237 => pandas.core.window.ewm.ExponentialMovingWindow.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1238
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1238.md
section: Migrations
---

# PNDSPY1238

**Message** Pandas < **pandas.core.window.expanding.Expanding.corr** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer window, axis = 1, pairwise = True, other = None, or min_periods != window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.corr
```

### Output

The SMA adds the EWI `PNDSPY1238` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1238 => pandas.core.window.expanding.Expanding.corr has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.corr
```

## Recommended fix

**Behavioral note**: N for non-integer window, axis = 1, pairwise = True, other = None, or min_periods != window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1239
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1239.md
section: Migrations
---

# PNDSPY1239

**Message** Pandas < **pandas.core.window.expanding.Expanding.count** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.count
```

### Output

The SMA adds the EWI `PNDSPY1239` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1239 => pandas.core.window.expanding.Expanding.count has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.count
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1240
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1240.md
section: Migrations
---

# PNDSPY1240

**Message** Pandas < **pandas.core.window.expanding.Expanding.max** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.max
```

### Output

The SMA adds the EWI `PNDSPY1240` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1240 => pandas.core.window.expanding.Expanding.max has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.max
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1241
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1241.md
section: Migrations
---

# PNDSPY1241

**Message** Pandas < **pandas.core.window.expanding.Expanding.mean** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

### Output

The SMA adds the EWI `PNDSPY1241` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1241 => pandas.core.window.expanding.Expanding.mean has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1242
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1242.md
section: Migrations
---

# PNDSPY1242

**Message** Pandas < **pandas.core.window.expanding.Expanding.min** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.min
```

### Output

The SMA adds the EWI `PNDSPY1242` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1242 => pandas.core.window.expanding.Expanding.min has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.min
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1243
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1243.md
section: Migrations
---

# PNDSPY1243

**Message** Pandas < **pandas.core.window.expanding.Expanding.sem** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sem
```

### Output

The SMA adds the EWI `PNDSPY1243` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1243 => pandas.core.window.expanding.Expanding.sem has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sem
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1244
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1244.md
section: Migrations
---

# PNDSPY1244

**Message** Pandas < **pandas.core.window.expanding.Expanding.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

### Output

The SMA adds the EWI `PNDSPY1244` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1244 => pandas.core.window.expanding.Expanding.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1245
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1245.md
section: Migrations
---

# PNDSPY1245

**Message** Pandas < **pandas.core.window.expanding.Expanding.sum** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

### Output

The SMA adds the EWI `PNDSPY1245` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1245 => pandas.core.window.expanding.Expanding.sum has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1246
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1246.md
section: Migrations
---

# PNDSPY1246

**Message** Pandas < **pandas.core.window.expanding.Expanding.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

### Output

The SMA adds the EWI `PNDSPY1246` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1246 => pandas.core.window.expanding.Expanding.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1247
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1247.md
section: Migrations
---

# PNDSPY1247

**Message** Pandas < **pandas.core.window.rolling.Rolling.corr** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for non-integer window, axis = 1, pairwise = True, other = None, or min_periods != window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.corr
```

### Output

The SMA adds the EWI `PNDSPY1247` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1247 => pandas.core.window.rolling.Rolling.corr has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.corr
```

## Recommended fix

**Behavioral note**: N for non-integer window, axis = 1, pairwise = True, other = None, or min_periods != window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1248
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1248.md
section: Migrations
---

# PNDSPY1248

**Message** Pandas < **pandas.core.window.rolling.Rolling.count** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.count
```

### Output

The SMA adds the EWI `PNDSPY1248` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1248 => pandas.core.window.rolling.Rolling.count has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.count
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1249
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1249.md
section: Migrations
---

# PNDSPY1249

**Message** Pandas < **pandas.core.window.rolling.Rolling.max** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.max
```

### Output

The SMA adds the EWI `PNDSPY1249` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1249 => pandas.core.window.rolling.Rolling.max has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.max
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1250
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1250.md
section: Migrations
---

# PNDSPY1250

**Message** Pandas < **pandas.core.window.rolling.Rolling.mean** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

### Output

The SMA adds the EWI `PNDSPY1250` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1250 => pandas.core.window.rolling.Rolling.mean has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1251
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1251.md
section: Migrations
---

# PNDSPY1251

**Message** Pandas < **pandas.core.window.rolling.Rolling.min** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.min
```

### Output

The SMA adds the EWI `PNDSPY1251` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1251 => pandas.core.window.rolling.Rolling.min has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.min
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1252
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1252.md
section: Migrations
---

# PNDSPY1252

**Message** Pandas < **pandas.core.window.rolling.Rolling.sem** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sem
```

### Output

The SMA adds the EWI `PNDSPY1252` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1252 => pandas.core.window.rolling.Rolling.sem has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sem
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1253
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1253.md
section: Migrations
---

# PNDSPY1253

**Message** Pandas < **pandas.core.window.rolling.Rolling.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

### Output

The SMA adds the EWI `PNDSPY1253` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1253 => pandas.core.window.rolling.Rolling.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1254
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1254.md
section: Migrations
---

# PNDSPY1254

**Message** Pandas < **pandas.core.window.rolling.Rolling.sum** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

### Output

The SMA adds the EWI `PNDSPY1254` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1254 => pandas.core.window.rolling.Rolling.sum has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1255
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1255.md
section: Migrations
---

# PNDSPY1255

**Message** Pandas < **pandas.core.window.rolling.Rolling.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

### Output

The SMA adds the EWI `PNDSPY1255` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1255 => pandas.core.window.rolling.Rolling.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1256
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1256.md
section: Migrations
---

# PNDSPY1256

**Message** Pandas < **pandas.core.window.rolling.Window.mean** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

### Output

The SMA adds the EWI `PNDSPY1256` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1256 => pandas.core.window.rolling.Window.mean has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.mean
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1257
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1257.md
section: Migrations
---

# PNDSPY1257

**Message** Pandas < **pandas.core.window.rolling.Window.std** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

### Output

The SMA adds the EWI `PNDSPY1257` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1257 => pandas.core.window.rolling.Window.std has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.std
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1258
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1258.md
section: Migrations
---

# PNDSPY1258

**Message** Pandas < **pandas.core.window.rolling.Window.sum** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

### Output

The SMA adds the EWI `PNDSPY1258` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1258 => pandas.core.window.rolling.Window.sum has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.sum
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1259
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1259.md
section: Migrations
---

# PNDSPY1259

**Message** Pandas < **pandas.core.window.rolling.Window.var** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

### Output

The SMA adds the EWI `PNDSPY1259` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1259 => pandas.core.window.rolling.Window.var has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3, 4, 5]})
rolling = df.rolling(window=2)
result = rolling.var
```

## Recommended fix

**Behavioral note**: N for axis = 1 or min_periods = 0. N for string window with center = True. N for Timedelta or BaseIndexer window.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1260
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1260.md
section: Migrations
---

# PNDSPY1260

**Message** Pandas < **pandas.io.json._json.read_json** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `orient`, `typ`, `dtype`, `convert_axes`, `lines`, `convert_dates`, `date_unit`, `keep_default_dates`, `encoding_errors`, `nrows`, `and chunksize will raise an error. precise_float`, `engine`, `dtype_backend`, `and storage_options are ignored.`

**Reason:** P: - if ndjson files are passed - Supported parameters are compression and encoding.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.read_json(df)
```

### Output

The SMA adds the EWI `PNDSPY1260` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1260 => pandas.io.json._json.read_json has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.read_json(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `orient`, `typ`, `dtype`, `convert_axes`, `lines`, `convert_dates`, `date_unit`, `keep_default_dates`, `encoding_errors`, `nrows`, `and chunksize will raise an error. precise_float`, `engine`, `dtype_backend`, `and storage_options are ignored.`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.read_json(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: P: - if ndjson files are passed - Supported parameters are compression and encoding.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1261
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1261.md
section: Migrations
---

# PNDSPY1261

**Message** Pandas < **pandas.io.parquet.read_parquet** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Missing or Unsupported Parameters:** `use_nullable_dtypes`, `filesystem`, `and filters will raise an error if used. engine`, `storage_options`, `dtype_backend`, `and **kwargs are ignored.`

**Reason:** Supported parameter(s) are: columns.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.read_parquet(df)
```

### Output

The SMA adds the EWI `PNDSPY1261` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1261 => pandas.io.parquet.read_parquet has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.read_parquet(df)
```

## Recommended fix

The following parameters are not supported in Snowpark pandas: `use_nullable_dtypes`, `filesystem`, `and filters will raise an error if used. engine`, `storage_options`, `dtype_backend`, `and **kwargs are ignored.`.

**Recommended approaches:**

1. **Avoid unsupported parameters**: Modify your code to not use these parameters if they are not essential.
2. **Use :code:`.to_pandas()` for full compatibility**: If you need these parameters, convert to native pandas first:
   .. code-block:: python

   > # Convert to native pandas when unsupported parameters are needed
   > native_df = df.to_pandas()
   > result = native_df.read_parquet(…) # Use all parameters
3. **Split the operation**: Perform supported operations in Snowpark pandas, then use native pandas only for the unsupported functionality.

**Behavioral note**: Supported parameter(s) are: columns.

This behavior may differ from native pandas. Recommended actions:

* Test with a representative sample of your data
* Compare results with native pandas if precision is critical
* Use `.to_pandas()` if exact pandas behavior is required

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: PNDSPY1262
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/PNDSPY1262.md
section: Migrations
---

# PNDSPY1262

**Message** Pandas < **pandas.io.parsers.readers.read_csv** > has a partial mapping with a few scenarios not supported in Snowpark.

**Category** Warning

## Description

This issue appears when the SMA detects the use of a pandas element that has a direct equivalent in Snowpark pandas, but some scenarios might behave differently than pandas.

**Reason:** Reads both local and staged file(s) into a Snowpark pandas DataFrame. Note, the order of rows in the may differ from the order of rows in the original file(s) if using staged csvs. Local files are parsed with native pandas and thus support most of the parameters supported by pandas itself. The usecols and names parameter are applied after creating a temp table in snowflake. Previously staged files will use the Snowflake COPY FROM parser and schema inference. If you need to use staged files often, it is recommended that you upload these as parquet files to improve performance. You can force the use of the Snowflake parser with engine=snowflake.

## Scenario

A method with a few scenarios that aren’t supported in Snowpark.

### Input

The following example shows a method with a few unsupported scenarios in Snowpark.

```python
import pandas as pd

df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.read_csv(df)
```

### Output

The SMA adds the EWI `PNDSPY1262` to the output code to let you know that this element has a few scenarios that aren’t supported in Snowpark.

```python
import snowflake.snowpark.modin.pandas as pd

#EWI: PNDSPY1262 => pandas.io.parsers.readers.read_csv has a partial mapping, with few scenarios not supported. Check Snowpark pandas documentation for more detail.
df = pd.DataFrame({'A': [1, 2, 3], 'B': [4, 5, 6]})
result = pd.read_csv(df)
```

## Recommended fix

**NULL/NaN handling difference**: Reads both local and staged file(s) into a Snowpark pandas DataFrame. Note, the order of rows in the may differ from the order of rows in the original file(s) if using staged csvs. Local files are parsed with native pandas and thus support most of the parameters supported by pandas itself. The usecols and names parameter are applied after creating a temp table in snowflake. Previously staged files will use the Snowflake COPY FROM parser and schema inference. If you need to use staged files often, it is recommended that you upload these as parquet files to improve performance. You can force the use of the Snowflake parser with engine=snowflake.

Snowpark pandas may handle NULL/NaN values differently:

* Pre-filter NULL values using `.dropna()` or `.fillna()` before the operation
* Verify NULL handling behavior with a small sample dataset
* Use explicit NULL checks: `df[df['column'].notna()]`

## Additional recommendations

Check the [Snowpark pandas documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/modin/supported/index) to verify which scenarios aren’t supported for that specific element.

---
title: Prerequisites
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-ewi-assistant-walkthrough/prerequisites.md
section: Migrations
---

# Prerequisites

To use the SMA AI Assistant, you must first complete the following prerequisites.
Minimum requirements
====================

1. Install and execute the Snowpark Migration Accelerator (SMA). For optimal accuracy, use the latest available version.
2. Install the Snowflake VS Code Extension and confirm that its AI Assistant feature is activated.

   This is crucial for using it with the Snowpark Migration Accelerator results.
3. Set up a Snowflake account.

---
title: Processing Databricks files
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/notebooks/databricks/databricks-overview.md
section: Migrations
---

# Processing Databricks files

This document describes how the Snowpark Migration Accelerator (SMA) processes Databricks files based on their file extensions during the inventory and migration phases.

## File processing by extension

The SMA recognizes and processes various Databricks file formats. Each file type is handled according to its structure and origin.

### SQL files

| Extension | Format | Description |
| --- | --- | --- |
| .sql | JSON cells | Inventoried by the SMA. Typically extracted from a `.dbc` file. |
| .sql | First-line-comment | Databricks notebook exported to SQL format. Inventoried by the SMA. |

#### Example: SQL with JSON cells format

```json
{
  "version": "NotebookV1",
  "commands": [
    {
      "command": "CREATE TABLE customers (\n  id INT,\n  name STRING\n)",
      "commandType": "sql"
    },
    {
      "command": "SELECT * FROM customers",
      "commandType": "sql"
    }
  ]
}
```

#### Example: SQL with first-line-comment format

```sql
-- Databricks notebook source
CREATE TABLE customers (
  id INT,
  name STRING
)

-- COMMAND ----------

SELECT * FROM customers
```

### Python files

| Extension | Format | Description |
| --- | --- | --- |
| .python | JSON cells | Inventoried by the SMA. Typically extracted from a `.dbc` file. |
| .py | First-line-comment | Databricks notebook exported to Python format. Inventoried by the SMA. |

#### Example: Python with JSON cells format

```json
{
  "version": "NotebookV1",
  "commands": [
    {
      "command": "df = spark.read.table(\"customers\")",
      "commandType": "python"
    },
    {
      "command": "df.filter(df.status == \"active\").show()",
      "commandType": "python"
    }
  ]
}
```

#### Example: Python with first-line-comment format (.py)

```python
# Databricks notebook source
df = spark.read.table("customers")

# COMMAND ----------

df.filter(df.status == "active").show()
```

### Scala files

| Extension | Format | Description |
| --- | --- | --- |
| .scala | JSON cells | Inventoried by the SMA. Typically extracted from a `.dbc` file. |
| .scala | First-line-comment | Databricks notebook exported to Scala format. Inventoried by the SMA. |

#### Example: Scala with JSON cells format

```json
{
  "version": "NotebookV1",
  "commands": [
    {
      "command": "val df = spark.read.table(\"customers\")",
      "commandType": "scala"
    },
    {
      "command": "df.filter($\"status\" === \"active\").show()",
      "commandType": "scala"
    }
  ]
}
```

#### Example: Scala with first-line-comment format

```scala
// Databricks notebook source
val df = spark.read.table("customers")

// COMMAND ----------

df.filter($"status" === "active").show()
```

### Databricks archive files

| Extension | Description |
| --- | --- |
| .dbc | Databricks compressed archive file. The SMA extracts and analyzes its contents. |

#### Example: DBC file structure

A `.dbc` file is a ZIP archive containing notebook files. When extracted, the structure looks like the following:

```text
my_project.dbc (extracted)
|-- notebook1.python
|-- notebook2.sql
|-- folder/
|   |-- notebook3.python
|   |-- notebook4.scala
|-- utils/
    |-- helpers.python
```

## How it works

* **DBC Files**: When the SMA encounters a `.dbc` file, it automatically extracts the compressed contents and processes each file individually based on its extension.
* **JSON Cells Format**: Files with JSON cell structure are native Databricks notebook formats, typically found inside `.dbc` archives. These contain cell definitions with metadata, source code, and outputs.
* **First-Line-Comment Format**: Files exported from Databricks using the export functionality contain a special comment in the first line that identifies them as Databricks notebooks. The SMA recognizes this pattern and processes them accordingly.

## Inventory process

During the inventory phase, the SMA:

1. Scans all provided files and directories.
2. Identifies file types based on extension and internal structure.
3. Catalogs each notebook with its language, cell count, and dependencies.
4. Prepares the files for the translation phase.

---
title: Prompt guide for AI assessment
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/ai-assessment-promptguide.md
section: Migrations
---

# Prompt guide for AI assessment

Use the suggested prompts on this page to use the `snowconvert-assessment` skill of the Cortex Code CLI agent.
These prompts can be used to direct the Cortex Code CLI agent to customize the assessment of specific sections of the source database code.

You can use the prompts to customize the suggested migration plan in the generated unified report.
Refer to [AI assessment report](ai-assessment.md) for more information on the unified report.

> **Note:**
>
> Snowflake strongly recommends installing Python (3.11 or later) and the [uv package manager](https://docs.astral.sh/uv/) to avoid issues related to environment dependencies.

## Invoke the skill

When you invoke the `snowconvert-assessment` skill, the Cortex Code CLI agent responds by:

1. Showing a welcome message explaining the functionality.
2. Confirming your request before running any analysis.
3. Asking for more details on path, goals, preferences, if needed.

## Example prompt scenarios

1. Start by invoking the `snowconvert-assessment` skill as shown below:

   ```none
   Use skill snowconvert-assessment
   ```

   The Cortex Code CLI agent will ask for the following:

   1. Path to your SnowConvert AI reports.
   2. Path to the file location for the assessment report that will be generated as output.
   3. Any specific goals or preferences for the assessment.
2. Start the assessment with the following prompt:

   ```none
   Run a comprehensive assessment with all the analyses.

   Analyze my SnowConvert AI reports at /path/to/Reports.

   Start fresh analysis (do not reuse previous results).
   ```
3. Set specific goals for the Cortex Code CLI agent. You can limit the number of deployment waves with the following prompt:

   ```none
   I want a maximum of five deployment waves.
   ```

   You can also specify the range of deployment waves as shown below:

   ```none
   Create 3-4 deployment waves.
   ```
4. To change the size of the deployment waves, use the following prompt as shown below:

   ```none
   Waves should have 20-30 objects each.
   ```
5. To force the wave to contain a fixed number of objects as shown below:

   ```none
   I need smaller batches with a maximum of 15 objects per wave.
   ```
6. You can prioritize objects in the deployment waves based on the business functions. For example, to prioritize all payroll-related objects in a specific deployment wave, use the following prompt:

   ```none
   Prioritize all Payroll-related objects in Wave 1.

   Put all customer* objects in the earliest waves.

   I need PKG_PAYROLL, PKG_HR, and PKG_FINANCE deployed first.
   ```
7. Once you have initial results, you can refine the AI assessment report.

   * To reorganize the objects in the suggested deployment waves, use the following prompt:

     ```none
     Move dbo.CriticalTable to Wave 1.

     Relocate all reporting procedures to Wave 5.
     ```
   * To investigate dependencies, use the following prompt:

     ```none
     Show me which objects have circular dependencies

     What objects are blocking the migration?

     Which objects depend on dbo.LegacyTable?
     ```
8. After investigations, you can regenerate the assessment report with the following prompts:

   ```none
   Generate a new HTML report.

   Regenerate with smaller batch sizes.

   Redo the analysis excluding the Staging schema.
   ```

   The Cortex Code CLI agent maintains the context. There is no need to start over.
9. To analyze the results of the assessment report, you can ask targeted questions using the following prompts:

   ```none
   How many objects are flagged for exclusion?

   What is the breakdown by schema?

   Show me a summary of the assessment.
   ```
10. You can drill down into the generated assessment report with the following prompts:

> * For objects categorized under the Exclusion Report:
>
>   ```none
>   Identify temporary and staging objects.
>
>   Find deprecated objects that can be excluded.
>   ```
> * For Dynamic SQL Report review, use the following prompt:
>
>   ```none
>   Analyze Dynamic SQL patterns in my codebase.
>   ```
> * For SSIS/ETL package analysis, use the following prompt:
>
>   ```none
>   Assess my SSIS packages for classification and migration complexity.
>   ```

## Tips for running AI assessments

The `snowconvert-assessment` skill is a powerful tool that can be used to generate actionable migration plans for complex workloads.
This section contains helpful tips that optimize the skill for best results.

1. Use a structured approach with target goals for maximum efficiency.

   **Example of an inefficient prompt sequence:**

   Prompt 1:

   ```none
   Generate waves
   ```

   Prompt 2:

   ```none
   Make the waves smaller
   ```

   Prompt 3:

   ```none
   Prioritize Payroll objects
   ```

   **Example of an efficient prompt:**

   ```none
   Generate waves with 20-30 objects each, prioritizing all Payroll-related objects
   ```
2. Use wildcards to expand the object selection to include all related objects.

   **Example of supported wildcard patterns:**

   > * `*payroll*` matches all objects containing the term “payroll” in the object name.
   > * `PKG_*` matches all objects starting with `PKG_` in the object name.
   > * `dbo.Customer*` matches all objects in the dbo schema starting with “Customer”.
   > * `*_Archive` matches all objects ending with “_Archive”.
3. Select dependency-based ordering or category-based ordering for deployment waves. By default, the Cortex Code CLI agent organizes the objects in the deployment waves based on their category, in the following order:

   > 1. Tables
   > 2. Views
   > 3. Procedures and functions
   > 4. ETL/SSIS packages

> If you prefer a dependency-based ordering:
>
> ```none
> Use dependency-based ordering instead of category-based
> ```

## Troubleshooting

**I want different wave sizes**

Use the following prompt to specify the minimum and maximum number of objects per wave:

```none
Regenerate waves with a minimum of 15 and maximum of 30 objects per wave
```

**Important objects are late in the waves**

Use the following prompt to move important objects to the earlier waves:

```none
Prioritize *CriticalProcess* objects to appear in Wave 1.
```

or reorganize after generation:

```none
Move dbo.CriticalTable to Wave 1.
```

**I have too many waves**

Use the following prompt to reduce the number of waves by increasing the number of objects per wave:

```none
Regenerate with larger waves - 60-100 objects each
```

**There are objects with circular dependencies**

Review the `cycles.txt` file from the SnowConvert AI reports and consider schema refactoring, or deploying circular dependency groups together, or manual intervention for complex cases.

## Frequently asked questions

1. Can I run just one type of analysis/assessment (for example, waves only)?

   Yes, you can run just one type of analysis/assessment by specifying the name of the report in the prompt (for example, deployment waves only).
2. How do I update the analysis/assessment after code changes?

   Re-run the assessment with updated SnowConvert AI reports, or specify “Start fresh analysis” in the prompt.
3. Can I export the data from the assessment report?

   Yes, the interactive `multi_report.html` report allows you to export the data from the assessment report into `csv` files.
4. What if I disagree with the object exclusion recommendations?

   The object exclusions are recommendations only. You can decide what to exclude.
5. How do I handle the objects that the Cortex Code CLI agent cannot categorize?

   Review the objects in the report manually. Such objects are flagged as “Uncertain items” in the report.

---
title: Release Notes - Snowflake Data Validation CLI
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/release_notes.md
section: Migrations
---

# Release Notes - Snowflake Data Validation CLI

## Version 1.2.0 (January 2026)

### What’s New

* **View Validation:** Full support for validating database views alongside tables

  + Available only for SQL SERVER.
  + Configure views in a dedicated `views:` section in your YAML configuration
  + Supports all table configuration options including column selection, filtering, column mappings, and chunking
  + Override target database, schema, or view name with `target_database`, `target_schema`, and `target_name` options
  + Views are validated by creating temporary tables internally to materialize schema for comparison

### Usage Example

```yaml
# Views are configured similarly to tables
views:
  - fully_qualified_name: INVENTORY.dbo.INTEGRATION_TEST_VIEW_1
    target_name: INTEGRATION_TEST_VIEW_1
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [ORDERID]
    target_index_column_list: [ORDERID]
```

* **Snowflake to Snowflake Validation:** Full support for validating data between Snowflake instances.

---

## Version 1.0.2 (December 2025)

### What’s New

* **New Command:** `column-partitioning-helper` - Interactive helper to partition wide tables by columns for more efficient validation of tables with many columns

### Changes

* **Command Renamed:** `table-partitioning-helper` has been renamed to `row-partitioning-helper` for improved clarity on its purpose

---

## Version 1.0.1 (December 2025)

### What’s New

This release introduces an enhancement to improve the CLI user experience whenaccessing help documentation.

### Example

```bash
# These commands no longer create log files
sdv --help
sdv sqlserver --help
sdv sqlserver run-validation --help

# These commands still create log files normally
sdv sqlserver run-validation --data-validation-config-file config.yaml
```

---

## Documentation

For complete information about using the Snowflake Data Validation CLI, refer to:

* **[Documentation Index](index.md)** - Start here for navigation to all documentation
* **[CLI Usage Guide](CLI_USAGE_GUIDE.md)** - Comprehensive CLI documentation
* **[Quick Reference Guide](CLI_QUICK_REFERENCE.md)** - Fast lookup reference
* **[Configuration Examples](CONFIGURATION_EXAMPLES.md)** - Ready-to-use configuration examples
* **[SQL Server Commands](sqlserver_commands.md)** - SQL Server specific commands
* **[Teradata Commands](teradata_commands.md)** - Teradata specific commands
* **[Redshift Commands](redshift_commands.md)** - Redshift specific commands

---

## Support

If you encounter any issues or have questions:

* **Email:** [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)
* **Documentation:** [Full Documentation](https://github.com/snowflake-eng/migrations-data-validation)
* **Issues:** [GitHub Issues](https://github.com/snowflake-eng/migrations-data-validation/issues)

---
title: Selecting objects for AI code conversion
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/ai-verification/select-objects-for-ai-conversion.md
section: Migrations
---

# Selecting objects for AI code conversion

This topic describes the functions available on the **Select Objects** page, when running AI code conversion.

## Tree view

The tree view as shown above displays the structure of databases and schemas in the converted workload, along with the conversion status of each code unit. You can perform the following actions on the database objects shown in a tree view under the **Name** column:

* Select the database objects to run AI code conversion on.
* Mark objects as **Verified by user** to exclude them from subsequent runs of the AI code conversion process. This status indicates that the user has reviewed the code and considers it valid for deployment. It represents the highest level of trust.
* Filter the database objects by type, database, schema and status.

## Selection Summary

The **Selection Summary** indicates the number of selected objects, the associated dependencies, and the estimated time and cost for AI code conversion. Note that the total number of objects includes the object selected and its dependencies. The estimated time and costs depend upon the size of the selected objects and token usage based on historical benchmark data.

**Total objects**: Shows the total number of objects to be processed, including both the user’s selection and any dependent objects.

**Estimated cost in Snowflake credits**: Shows the estimate based on object size and historical benchmark data. It calculates total token usage converted into standard Snowflake credit costs.

## Breakdown

The **Breakdown** section shows all objects grouped by their conversion status as below:

* **Converted successfully**: The object was successfully converted with deterministic conversion (non AI code conversion) and is ready to be deployed to Snowflake.
* **Has issues**: The object has issues from either deterministic or AI code conversion and cannot be deployed without fixes.Try running AI code conversion again or fixing it manually.
* **Suggested fixes**: The AI code conversion process has generated suggested fixes and requires user review.
* **Verified by AI/AI converted**: The AI code conversion process verified that the source code and converted code produced equivalent results. User review is required, additionally.
* **Verified by User**: These objects were reviewed by the user in a previous AI code conversion and are valid for deployment. This status represents the highest level of trust. All objects should be marked as “Verified by User” after the user review and manual fixes.

## Footer actions

After selecting objects and reviewing the **Selection Summary**, the AI code conversion process can be initiated by selecting **AI convert**

The process of running AI code conversion is meant to be iterative. Multiple execution runs can be performed on the same code base until the expected conversion accuracy is achieved. Select **Go to latest conversion** at the bottom of the page to view the results of the last iteration of AI code conversion.

---
title: SMA AI Assistant Usage
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-ewi-assistant-walkthrough/sma-ai-assistant-usage/README.md
section: Migrations
---

# SMA AI Assistant Usage

To ensure correct usage of the SMA AI Assistant, please follow these steps:

1. Open VS Code.
2. Follow the steps in the [AI Assistant Setup](ai-assistant-setup.md) section.
3. Open the migration output folder in VS Code.
4. Open the Snowflake Extension by using the sidebar at the left.

   1. Clicking on an EWI line will directly navigate you to its corresponding source line.
5. Log into your Snowflake account to get explanations and responses.
6. View the SMA EWI Assistant code lens.

   1. Click the SMA AI Assistant code lens to open a window providing explanations and resolutions for the EWI.
7. Log into your Snowflake account to unlock detailed explanations and helpful responses.
8. View the EWI explanation and responses.
9. Follow the assistant responses.

---
title: SMA EWI Assistant walkthrough
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-ewi-assistant-walkthrough/README.md
section: Migrations
---

# SMA EWI Assistant walkthrough

To make migrating to Snowpark faster, you can use a custom AI Assistant to resolve errors, warnings, and issues (EWIs). Integrated with the Snowflake VS Code extension, you can use this tool after running the Snowpark Migration Accelerator tool. For more information on conversion, see the [conversion guide](../../user-guide/snowpark-api-conversion/README.md)).

---
title: SnowConvert AI -  Oracle Conversion Settings
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/oracle-conversion-settings.md
section: Migrations
---

# SnowConvert AI - Oracle Conversion Settings

## General Conversion Settings

### Object Conversion

1. **Transform Synonyms:** Flag to indicate whether or not Synonyms should be transformed. By default, it’s set to true.
2. **Transform Packages to new Schemas:** Flag to indicate whether or not the Packages should be transformed to new Schemas.

   Please check the naming of the procedure enabling and disabling the flag:

**Input**

```sql
CREATE OR REPLACE PACKAGE emp_mgmt AS
PROCEDURE remove_emp (employee_id NUMBER );
END emp_mgmt;

CREATE OR REPLACE PACKAGE BODY emp_mgmt AS
PROCEDURE remove_emp (employee_id NUMBER) IS
   BEGIN
      DELETE FROM employees
      WHERE employees.employee_id = remove_emp.employee_id;
      tot_emps := tot_emps - 1;
   END;
END emp_mgmt;
```

**Output Default**

```none
CREATE SCHEMA IF NOT EXISTS emp_mgmt
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE emp_mgmt.remove_emp (employee_id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      DELETE FROM
         employees
         WHERE employees.employee_id = remove_emp.employee_id;
         tot_emps :=
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
                     tot_emps - 1;
   END;
$$;
```

**Output with param disablePackagesAsSchemas**

```none
-- Additional Params: --disablePackagesAsSchemas
CREATE OR REPLACE PROCEDURE EMP_MGMT_REMOVE_EMP (employee_id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      DELETE FROM
         employees
         WHERE employees.employee_id = remove_emp.employee_id;
         tot_emps :=
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
                     tot_emps - 1;
   END;
$$;
```

3. **Transform Date as Timestamp:**

Flag to indicate whether `SYSDATE` should be transformed into `CURRENT_DATE` *or* `CURRENT_TIMESTAMP`. This will also affect all `DATE` columns that will be transformed to `TIMESTAMP`.

**Input**

```sql
CREATE TABLE DATE_TABLE(
    DATE_COL DATE
);

SELECT SYSDATE FROM DUAL;
```

**Output Default**

```sql
CREATE OR REPLACE TABLE DATE_TABLE (
        DATE_COL TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    SELECT
        CURRENT_TIMESTAMP()
    FROM DUAL;
```

**Output with param disableDateAsTimestamp**

```sql
-- Additional Params: --disableDateAsTimestamp
CREATE OR REPLACE TABLE DATE_TABLE (
        DATE_COL DATE /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    SELECT
        CURRENT_DATE()
    FROM DUAL;
```

4. **Transform OUTER JOINS to ANSI Syntax:** Flag to indicate whether Outer Joins should be transformed to only ANSI syntax.

### Data type mappings

SnowConvert defines default mappings for data type conversions. However, you can point to a JSON file to customize specific data type mappings.

**Customize data types:** You can upload a JSON file to define specific data type transformation rules. This feature allows you to customize how data types are converted during migration.

**Supported transformations include:**

* `NUMBER` to custom `NUMBER` with specific precision and scale
* `NUMBER` to `DECFLOAT` for preserving exact decimal precision

When you upload a data type customization file:

* SnowConvert AI applies your transformation rules during conversion
* Numeric literals in `INSERT` statements targeting customized columns are automatically cast to the appropriate type
* A [TypeMappings Report](../review-results/reports/type-mappings-report.md) is generated showing all data type transformations applied

**JSON Structure:**

The JSON file supports three ways to specify data type changes:

| Method | Scope | Use Case |
| --- | --- | --- |
| `projectTypeChanges.types` | Global | Transform all occurrences of a specific data type |
| `projectTypeChanges.columns` | Global | Transform columns matching a name pattern (case-insensitive substring match) |
| `specificTableTypeChanges.tables` | Table-specific | Transform specific columns in specific tables |

> **Warning:**
>
> **Use column name patterns carefully.** The `projectTypeChanges.columns` rules only apply to columns with `NUMBER` data types, but they match by name pattern without considering the precision or scale of the original `NUMBER` type. This means a pattern like `"MONTH"` will transform **all** matching `NUMBER` columns to the target type, regardless of their original precision (e.g., `NUMBER(10,0)`, `NUMBER(38,18)`, or `NUMBER` without precision). Always review the [TypeMappings Report](../review-results/reports/type-mappings-report.md) after conversion to verify that the transformations were applied correctly.

**Priority order:** When multiple rules apply to the same column, SnowConvert AI uses this priority (highest to lowest):

1. `specificTableTypeChanges` (most specific)
2. `projectTypeChanges.columns` (name pattern)
3. `projectTypeChanges.types` (global type mapping)

**Example JSON configuration:**

```json
{
  "projectTypeChanges": {
    "types": {
      "NUMBER": "DECFLOAT",
      "NUMBER(10, 0)": "NUMBER(18, 0)"
    },
    "columns": [
      {
        "nameExpression": "PRICE",
        "targetType": "DECFLOAT"
      },
      {
        "nameExpression": ".*_AMOUNT$",
        "targetType": "NUMBER(18, 2)"
      }
    ]
  },
  "specificTableTypeChanges": {
    "tables": [
      {
        "tableName": "EMPLOYEES",
        "columns": [
          {
            "columnName": "SALARY",
            "targetType": "NUMBER(15, 2)"
          }
        ]
      }
    ]
  }
}
```

**Download template:** Copy and save the JSON structure above as your starting point.

**Example transformation:**

Given the following Oracle input code:

#### Oracle

```sql
CREATE TABLE employees (
    employee_ID NUMBER,
    manager_YEAR NUMBER(10, 0),
    manager_MONTH NUMBER(10, 0),
    salary NUMBER(12, 2)
);
```

And a JSON customization file with:

* `"NUMBER": "NUMBER(11, 2)"` in `projectTypeChanges.types`
* `"NUMBER(10, 0)": "NUMBER(18, 0)"` in `projectTypeChanges.types`
* `"MONTH"` pattern targeting `NUMBER(2,0)` in `projectTypeChanges.columns`
* `SALARY` column targeting `NUMBER(15, 2)` in `specificTableTypeChanges` for EMPLOYEES table

The output will be:

#### Snowflake

```sql
CREATE OR REPLACE TABLE employees (
    employee_ID NUMBER(11, 2),
    manager_YEAR NUMBER(18, 0),
    manager_MONTH NUMBER(2, 0),
    salary NUMBER(15, 2)
);
```

| Column | Original Type | Transformed To | Rule Applied |
| --- | --- | --- | --- |
| employee_ID | NUMBER | NUMBER(11, 2) | `projectTypeChanges.types` |
| manager_YEAR | NUMBER(10, 0) | NUMBER(18, 0) | `projectTypeChanges.types` |
| manager_MONTH | NUMBER(10, 0) | NUMBER(2, 0) | `projectTypeChanges.columns` (MONTH pattern) |
| salary | NUMBER(12, 2) | NUMBER(15, 2) | `specificTableTypeChanges` (highest priority) |

### General Result Tab

1. **Comment objects with missing dependencies:** This flag indicates whether the user wants to comment on nodes with missing dependencies.
2. **Set encoding of the input files:** Check [General Conversion Settings](general-conversion-settings.md) for more details.

> **Note:**
>
> To review the Settings that apply to all supported languages, go to the following [article](general-conversion-settings.md).

## DB Objects Names Settings

1. **Schema:** The string value specifies the custom schema name to apply. If not specified, the original database name will be used. Example: DB1.**myCustomSchema**.Table1.
2. **Database:** The string value specifies the custom database name to apply. Example: **MyCustomDB**.PUBLIC.Table1.
3. **Default:** None of the above settings will be used in the object names.

## Prepare Code Settings

### **Description**

**Prepare my code:** Flag to indicate whether the input code should be processed before parsing and transformation. This can be useful to improve the parsing process. By default, it’s set to FALSE.

Splits the input code top-level objects into multiple files. The containing folders would be organized as follows:

Copy

```none
└───A new folder named ''[input_folder_name]_Processed''
    └───Top-level object type
        └───Schema name
```

### **Example**

#### **Input**

```none
├───in
│       DDL_Packages.sql
│       DDL_Procedures.sql
│       DDL_Tables.sql
```

#### **Output**

Assume that the name of the files is the name of the top-level objects in the input files.

```none
├───in_Processed
    ├───package
    │   └───MY_SCHEMA
    │           MY_FIRST_PACKAGE.sql
    │           ANOTHER_PACKAGE.sql
    │
    ├───procedure
    │   └───MY_SCHEMA
    │           A_PROCEDURE.sql
    │           ANOTHER_PROCEDURE.sql
    │           YET_ANOTHER_PROCEDURE.sql
    │
    └───table
        └───MY_SCHEMA
                MY_TABLE.sql
                ADDITIONAL_TABLE.sql
                THIRD_TABLE.sql
```

Inside the “schema name” folder, there should be as many files as top-level objects in the input code. Also, it is possible to have copies of some files when multiple same-type top-level objects have the same name. In this case, the file names will be enumerated in ascending order.

### Requirements

To identify top-level objects, a tag must be included in a comment before their declaration. Our [Extraction](../../code-extraction/oracle.md) scripts generate these tags.

The tag should follow the next format:

```none
<sc-top_level_object_type>top_level_object_name</sc-top_level_object_type>
```

You can follow the next example:

```sql
/* <sc-table> MY_SCHEMA.MY_TABLE</sc-table> */
CREATE TABLE "MY_SCHEMA"."MY_TABLE" (
    "MY_COLUMN" VARCHAR2(128)
) ;
```

## Conversion Rate Settings

On this page, you can choose whether the successfully converted code percentage is calculated using lines of code or using the total number of characters. The **character conversion rate** is the default option. You can read the entire rate documentation on the[documentation page](../../../user-guide/snowconvert/README.md).

## Stored Procedures Target Languages Settings

On this page, you can choose whether stored procedures are migrated to JavaScript embedded in Snow SQL, or to Snowflake Scripting. The default option is Snowflake Scripting.

**Reset Settings:** The reset settings option appears on every page. If you’ve made changes, you can reset SnowConvert AI to its original default settings.

---
title: SnowConvert AI -  Renaming Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/renaming-reports.md
section: Migrations
---

# SnowConvert AI - Renaming Report

## What is a renaming object?

It is an object that underwent a name change during the migration, following the changes configured in Redshift Studio.

> **Note:**
>
> The report includes all migrated top-level code units, regardless of whether they underwent renaming or not.

### What information does it contain?

The renaming report is presented in a table format, and contains the following columns:

| Column | Description |
| --- | --- |
| CodeUnit | The type of the Code Unit. |
| SourceDatabase | The source database. |
| SourceSchema | The source schema. |
| SourceName | The source name. |
| SnowflakeDatabase | The Snowflake database. |
| SnowflakeSchema | The Snowflake schema |
| SnowflakeName | The Snowflake name. |

Input Code

```sql
CREATE SCHEMA Renaming_example_schema;

CREATE TABLE Renaming_example_schema.Renaming_example_table_tl (
    id INT,
    name VARCHAR(100)
);

INSERT INTO Renaming_example_schema.Renaming_example_table_tl(id, name) VALUES (1, "tom");

SELECT * FROM Renaming_example_schema.Renaming_example_table_tl;

CREATE TABLE DB_1.MASTER.Renaming_example_table_tl_v2 (
    id INT,
    name VARCHAR(100)
);

INSERT INTO DB_1.MASTER.Renaming_example_table_tl_v2(id, name) VALUES (1, "tom");

SELECT * FROM DB_1.MASTER.Renaming_example_table_tl_v2;

CREATE TABLE NoRenaming_db.NoRenaming_schema.NoRenamingTable_test (
    id INT,
    name VARCHAR(100)
)

INSERT INTO NoRenaming_db.NoRenaming_schema.NoRenamingTable_test(id, name) VALUES (1, "tom");

SELECT * FROM NoRenaming_db.NoRenaming_schema.NoRenamingTable_test;
```

Output code

```sql
CREATE SCHEMA IF NOT EXISTS Target_Renaming_example_schema
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "10/23/2024" }}'
;

CREATE TABLE Target_Renaming_example_schema.Target_Renaming_example_table_tl (
    id INT,
    name VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "10/23/2024" }}';

INSERT INTO Target_Renaming_example_schema.Target_Renaming_example_table_tl (id, name) VALUES (1, "tom");

SELECT * FROM
    Target_Renaming_example_schema.Target_Renaming_example_table_tl;

CREATE TABLE Target_DB_1.MASTER.Renaming_example_table_tl_v2 (
    id INT,
    name VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "10/23/2024" }}';

INSERT INTO Target_DB_1.MASTER.Renaming_example_table_tl_v2 (id, name) VALUES (1, "tom");

SELECT * FROM
    Target_DB_1.MASTER.Renaming_example_table_tl_v2;

CREATE TABLE NoRenaming_db.NoRenaming_schema.NoRenamingTable_test (
    id INT,
    name VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "10/23/2024" }}'

INSERT INTO NoRenaming_db.NoRenaming_schema.NoRenamingTable_test (id, name) VALUES (1, "tom");

SELECT * FROM
    NoRenaming_db.NoRenaming_schema.NoRenamingTable_test;
```

## Embedded Objects

Renaming and reporting are only available for top-level objects. Embedded objects will not appear in the report and renaming will not be applied to these objects.

Input Code

```none
CREATE TABLE Renaming_example_table_tl (
   id INT,
   name VARCHAR(100)
);

CREATE PROCEDURE Renaming_example_procedure()
    LANGUAGE plpgsql
AS $$
BEGIN
CREATE TABLE Renaming_example_table_embedded (
   id INT,
   name VARCHAR(100)
);
SELECT * FROM Renaming_example_table_embedded;
SELECT * FROM Renaming_example_table_tl;
END;
$$;
```

Output Code

```none
CREATE TABLE Target_Renaming_example_table_tl (
   id INT,
   name VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/13/2024",  "domain": "test" }}';

CREATE PROCEDURE Target_Renaming_example_procedure ()
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS $$
BEGIN
CREATE TABLE Renaming_example_table_embedded (
   id INT,
   name VARCHAR(100)
);
SELECT * FROM
   Renaming_example_table_embedded;
SELECT * FROM
   Target_Renaming_example_table_tl;
END;
$$;
```

---
title: SnowConvert AI - About
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/about.md
section: Migrations
---

# SnowConvert AI - About

SnowConvert AI is an AI-powered Snowflake utility that accurately converts source database code from other platforms to Snowflake.

It ingests scripts for the source database objects such as tables, views, stored procedures, and functions. These scripts are then converted into Snowflake SQL scripts to recreate the exact equivalent Snowflake database objects. After reviewing the converted code, you can deploy the Snowflake SQL scripts on an existing instance of Snowflake.

Database platforms differ in syntax, built-in and procedural language features, logical architecture, SQL extensions, data types, and the extent of user defined customizations.
SnowConvert AI intelligently detects these differences and flags them as errors, warnings, issues ([EWIs](technical-documentation/issues-and-troubleshooting/conversion-issues/README.md)), and functional difference messages ([FDMs](technical-documentation/issues-and-troubleshooting/functional-difference/README.md)). These can be resolved manually or by using the built AI Code Conversion feature.
The AI Code Conversion feature also creates test cases to verify that the functionality of the converted Snowflake SQL code is exactly the same as the source database code.

Users can deploy the converted and verified code in an existing Snowflake instance.

## SnowConvert AI capabilities

| Source Technology | Availability | Supported Code Conversion | Source DB Connection | Data Migration | Snowflake Deployment | AI Code Conversion |
| --- | --- | --- | --- | --- | --- | --- |
| Teradata | GA | Tables, views, stored procedures, functions, Basic Teradata Query (BTEQ), Teradata MultiLoad (MLOAD), Teradata Parallel Data Pump (TPUMP) | Extraction script | No | No | No |
| Oracle | GA | Tables, views, stored procedures, functions, packages | Extraction script | No | No | No |
| SQL Server | GA | Tables, views, stored procedures, functions | Extraction script, direct DB connection | Yes | Yes | Yes |
| Redshift | GA | Tables, views, stored procedures, functions | Extraction script, direct DB connection | Yes | Yes | Yes |
| Azure Synapse | GA | Tables, views, stored procedures, functions | Extraction script | No | No | No |
| Sybase IQ | GA | Tables, views, stored procedures, functions | Extraction script | No | No | No |
| Google BigQuery | GA | Tables, views | Extraction script | No | No | Yes |
| Greenplum | GA | Tables, views | No | No | No | No |
| Netezza | GA | Tables, views | No | No | No | No |
| PostgreSQL | GA | Tables, views | No | No | No | Yes |
| Spark SQL | GA | Tables, views | No | No | No | No |
| Databricks SQL | GA | Tables, views | No | No | No | No |
| Vertica | GA | Tables, views | No | No | No | No |
| Hive | GA | Tables, views | No | No | No | No |
| IBM DB2 | GA | Tables, views, stored procedures, functions | No | No | No | No |

For more information, contact [snowconvert-info@snowflake.com](mailto:snowconvert-info%40snowflake.com).

---
title: SnowConvert AI - Ambiguous Comments Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/ambiguous-comments-validations.md
section: Migrations
---

# SnowConvert AI - Ambiguous Comments Validation

## Description

This validation step verifies if the entry code has a sequence of characters that may create ambiguous comments (`/*/`)

If the entry code has an ambiguous comment, the following warning is displayed:

Also, in the ScopeValidation report, you will find information about the failed file(s).

### Why is it ambiguous?

Block comments on SQL start with `/*` and end with `*/` . When the character sequence `/*/` is used, depending on the source language, it can start a nesting inside the block comment, or finish the whole block.

Here is an example of valid statements using `/*/`

```none
select col1,
  /*Some comment/*/ */*/
  col2,
  col3
from
  table1;
```

```none
select col1,
  /*Some comment/*/
  col2,
  col3
from
  table1;
```

```none
select col1,
  /*Some comment/*/ */*/
  col2,
  col3
from
  table1;
```

```none
select col1,
  /*Some comment/*/
  col2,
  col3
from
  table1;
```

As you can see, the comment behaves differently in Teradata and SQL Server than in Oracle and Snowflake. Even on Teradata, there is another treatment for bteq and other scripting languages.

### Solving the ambiguity

In Snowflake, if you encounter the /\*/ sequence in your code, it typically ends a block comment. However, if you’re using it differently in your source code, make sure to adjust it accordingly.

---
title: SnowConvert AI - ANSI SQL - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/general/built-in-functions.md
section: Migrations
---

# SnowConvert AI - ANSI SQL - Built-in functions

> This article provides an alphabetical list of built-in functions shared by the different dialects.

| ANSI SQL | Snowflake |
| --- | --- |
| ABS | [ABS](https://docs.snowflake.com/en/sql-reference/functions/abs) |
| ACOS | [ACOS](https://docs.snowflake.com/en/sql-reference/functions/acos) |
| ACOSH | [ACOSH](https://docs.snowflake.com/en/sql-reference/functions/acosh) |
| ANY_VALUE | [ANY_VALUE](https://docs.snowflake.com/en/sql-reference/functions/any_value) |
| APPROX_COUNT_DISTINCT | [APPROX_COUNT_DISTINCT](https://docs.snowflake.com/en/sql-reference/functions/approx_count_distinct) |
| ARRAY | [ARRAY_CONSTRUCT](https://docs.snowflake.com/en/sql-reference/functions/array_construct) |
| ASCII | [ASCII](https://docs.snowflake.com/en/sql-reference/functions/ascii) |
| ASIN | [ASIN](https://docs.snowflake.com/en/sql-reference/functions/asin) |
| ASINH | [ASINH](https://docs.snowflake.com/en/sql-reference/functions/asinh) |
| ATAN | [ATAN](https://docs.snowflake.com/en/sql-reference/functions/atan) |
| ATAN2 | [ATAN2](https://docs.snowflake.com/en/sql-reference/functions/atan2) |
| ATANH | [ATANH](https://docs.snowflake.com/en/sql-reference/functions/atanh) |
| ATN2 | [ATAN2](https://docs.snowflake.com/en/sql-reference/functions/atan2) |
| AVE | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg) |
| AVERAGE | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg) |
| AVG | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg) |
| BTRIM | [TRIM](https://docs.snowflake.com/en/sql-reference/functions/trim) |
| CBRT | [CBRT](https://docs.snowflake.com/en/sql-reference/functions/cbrt) |
| CEIL | [CEIL](https://docs.snowflake.com/en/sql-reference/functions/ceil) |
| CEILING | [CEIL](https://docs.snowflake.com/en/sql-reference/functions/ceil) |
| CHARACTER_LENGTH | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| CHARINDEX | [CHARINDEX](https://docs.snowflake.com/en/sql-reference/functions/charindex) |
| CHAR_LENGTH | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| CHR | [CHR](https://docs.snowflake.com/en/sql-reference/functions/chr) |
| COALESCE | [COALESCE](https://docs.snowflake.com/en/sql-reference/functions/coalesce) |
| CONCAT | [CONCAT](https://docs.snowflake.com/en/sql-reference/functions/concat) |
| CORR | [CORR](https://docs.snowflake.com/en/sql-reference/functions/corr) |
| COS | [COS](https://docs.snowflake.com/en/sql-reference/functions/cos) |
| COSH | [COSH](https://docs.snowflake.com/en/sql-reference/functions/cosh) |
| COT | [COT](https://docs.snowflake.com/en/sql-reference/functions/cot) |
| COUNT | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) |
| COVAR_POP | [COVAR_POP](https://docs.snowflake.com/en/sql-reference/functions/covar_pop) |
| COVAR_SAMP | [COVAR_SAMP](https://docs.snowflake.com/en/sql-reference/functions/covar_samp) |
| CUME_DIST | [CUME_DIST](https://docs.snowflake.com/en/sql-reference/functions/cume_dist) |
| CURDATE | [CURRENT_DATE](https://docs.snowflake.com/en/sql-reference/functions/current_date) |
| CURRENT_DATABASE | [CURRENT_DATABASE](https://docs.snowflake.com/en/sql-reference/functions/current_database) |
| CURRENT_DATE | [CURRENT_DATE](https://docs.snowflake.com/en/sql-reference/functions/current_date) |
| CURRENT_SCHEMA | [CURRENT_SCHEMA](https://docs.snowflake.com/en/sql-reference/functions/current_schema) |
| CURRENT_TIMESTAMP | [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp) |
| CURRENT_USER | [CURRENT_USER](https://docs.snowflake.com/en/sql-reference/functions/current_user) |
| DATE | [DATE](https://docs.snowflake.com/en/sql-reference/functions/to_date) |
| DECODE | [DECODE](https://docs.snowflake.com/en/sql-reference/functions/decode) |
| DEGREES | [DEGREES](https://docs.snowflake.com/en/sql-reference/functions/degrees) |
| DENSE_RANK | [DENSE_RANK](https://docs.snowflake.com/en/sql-reference/functions/dense_rank) |
| EXP | [EXP](https://docs.snowflake.com/en/sql-reference/functions/exp) |
| FIRST_VALUE | [FIRST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/first_value) |
| FLOOR | [FLOOR](https://docs.snowflake.com/en/sql-reference/functions/floor) |
| GREATEST | [GREATEST](https://docs.snowflake.com/en/sql-reference/functions/greatest) |
| GROUPING | [GROUPING](https://docs.snowflake.com/en/sql-reference/functions/grouping) |
| IF | [IFF](https://docs.snowflake.com/en/sql-reference/functions/iff) |
| IFF | [IFF](https://docs.snowflake.com/en/sql-reference/functions/iff) |
| IFNULL | [IFNULL](https://docs.snowflake.com/en/sql-reference/functions/ifnull) |
| IIF | [IFF](https://docs.snowflake.com/en/sql-reference/functions/iff) |
| INITCAP | [INITCAP](https://docs.snowflake.com/en/sql-reference/functions/initcap) |
| KURTOSIS | [KURTOSIS](https://docs.snowflake.com/en/sql-reference/functions/kurtosis) |
| LAG | [LAG](https://docs.snowflake.com/en/sql-reference/functions/lag) |
| LAST_DAY | [LAST_DAY](https://docs.snowflake.com/en/sql-reference/functions/last_day) |
| LAST_VALUE | [LAST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/last_value) |
| LEAD | [LEAD](https://docs.snowflake.com/en/sql-reference/functions/lead) |
| LEAST | [LEAST](https://docs.snowflake.com/en/sql-reference/functions/least) |
| LEFT | [LEFT](https://docs.snowflake.com/en/sql-reference/functions/left) |
| LEN | [LEN](https://docs.snowflake.com/en/sql-reference/functions/length) |
| LENGTH | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| LN | [LN](https://docs.snowflake.com/en/sql-reference/functions/ln) |
| LOG | [LOG](https://docs.snowflake.com/en/sql-reference/functions/log) |
| LOWER | [LOWER](https://docs.snowflake.com/en/sql-reference/functions/lower) |
| LPAD | [LPAD](https://docs.snowflake.com/en/sql-reference/functions/lpad) |
| LTRIM | [LTRIM](https://docs.snowflake.com/en/sql-reference/functions/ltrim) |
| MAX | [MAX](https://docs.snowflake.com/en/sql-reference/functions/max) |
| MAXIMUM | [MAX](https://docs.snowflake.com/en/sql-reference/functions/max) |
| MEDIAN | [MEDIAN](https://docs.snowflake.com/en/sql-reference/functions/median) |
| MIN | [MIN](https://docs.snowflake.com/en/sql-reference/functions/min) |
| MINIMUM | [MIN](https://docs.snowflake.com/en/sql-reference/functions/min) |
| MOD | [MOD](https://docs.snowflake.com/en/sql-reference/functions/mod) |
| NOW | [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp) |
| NTH_VALUE | [NTH_VALUE](https://docs.snowflake.com/en/sql-reference/functions/nth_value) |
| NTILE | [NTILE](https://docs.snowflake.com/en/sql-reference/functions/ntile) |
| NULLIF | [NULLIF](https://docs.snowflake.com/en/sql-reference/functions/nullif) |
| NULLIFZERO | [NULLIFZERO](https://docs.snowflake.com/en/sql-reference/functions/nullifzero) |
| NVL | [NVL](https://docs.snowflake.com/en/sql-reference/functions/nvl) |
| NVL2 | [NVL2](https://docs.snowflake.com/en/sql-reference/functions/nvl2) |
| OCTET_LENGTH | [OCTET_LENGTH](https://docs.snowflake.com/en/sql-reference/functions/octet_length) |
| PERCENTILE_CONT | [PERCENTILE_CONT](https://docs.snowflake.com/en/sql-reference/functions/percentile_cont) |
| PERCENTILE_DISC | [PERCENTILE_DISC](https://docs.snowflake.com/en/sql-reference/functions/percentile_disc) |
| PERCENT_RANK | [PERCENT_RANK](https://docs.snowflake.com/en/sql-reference/functions/percent_rank) |
| PI | [PI](https://docs.snowflake.com/en/sql-reference/functions/pi) |
| POSITION | [POSITION](https://docs.snowflake.com/search?q=POSITION) |
| POW | [POW](https://docs.snowflake.com/en/sql-reference/functions/pow) |
| POWER | [POWER](https://docs.snowflake.com/en/sql-reference/functions/pow) |
| RADIANS | [RADIANS](https://docs.snowflake.com/en/sql-reference/functions/radians) |
| RANDOM | [RANDOM](https://docs.snowflake.com/en/sql-reference/functions/random) |
| RANK | [RANK](https://docs.snowflake.com/en/sql-reference/functions/rank) |
| REGEXP_COUNT | [REGEXP_COUNT](https://docs.snowflake.com/en/sql-reference/functions/regexp_count) |
| REGEXP_SUBSTR | [REGEXP_SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/regexp_substr) |
| REGR_AVGX | [REGR_AVGX](https://docs.snowflake.com/en/sql-reference/functions/regr_avgx) |
| REGR_AVGY | [REGR_AVGY](https://docs.snowflake.com/en/sql-reference/functions/regr_avgy) |
| REGR_COUNT | [REGR_COUNT](https://docs.snowflake.com/en/sql-reference/functions/regr_count) |
| REGR_INTERCEPT | [REGR_INTERCEPT](https://docs.snowflake.com/en/sql-reference/functions/regr_intercept) |
| REGR_SLOPE | [REGR_SLOPE](https://docs.snowflake.com/en/sql-reference/functions/regr_slope) |
| REGR_SXX | [REGR_SXX](https://docs.snowflake.com/en/sql-reference/functions/regr_sxx) |
| REGR_SXY | [REGR_SXY](https://docs.snowflake.com/en/sql-reference/functions/regr_sxy) |
| REGR_SYY | [REGR_SYY](https://docs.snowflake.com/en/sql-reference/functions/regr_syy) |
| REPEAT | [REPEAT](https://docs.snowflake.com/en/sql-reference/functions/repeat) |
| REPLACE | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| REPLICATE | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| RIGHT | [RIGHT](https://docs.snowflake.com/en/sql-reference/functions/right) |
| ROLLUP | [ROLLUP](https://docs.snowflake.com/en/sql-reference/constructs/group-by-rollup) |
| ROUND | [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round) |
| ROW_NUMBER | [ROW_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/row_number) |
| RPAD | [RPAD](https://docs.snowflake.com/en/sql-reference/functions/rpad) |
| RTRIM | [RTRIM](https://docs.snowflake.com/en/sql-reference/functions/rtrim) |
| SHA1 | [SHA1](https://docs.snowflake.com/en/sql-reference/functions/sha1) |
| SHA2 | [SHA2](https://docs.snowflake.com/en/sql-reference/functions/sha2) |
| SIGN | [SIGN](https://docs.snowflake.com/en/sql-reference/functions/sign) |
| SIN | [SIN](https://docs.snowflake.com/en/sql-reference/functions/sin) |
| SOUNDEX | [SOUNDEX](https://docs.snowflake.com/en/sql-reference/functions/soundex) |
| SPACE | [SPACE](https://docs.snowflake.com/en/sql-reference/functions/space) |
| SPLIT_PART | [SPLIT_PART](https://docs.snowflake.com/en/sql-reference/functions/split_part) |
| SQRT | [SQRT](https://docs.snowflake.com/en/sql-reference/functions/sqrt) |
| STDDEV_POP | [STDDEV_POP](https://docs.snowflake.com/en/sql-reference/functions/stddev_pop) |
| STDDEV_SAMP | [STDDEV_SAMP](https://docs.snowflake.com/en/sql-reference/functions/stddev) |
| STDDEV | [STDDEV](https://docs.snowflake.com/en/sql-reference/functions/stddev) |
| SUBSTR | [SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/substr) |
| SUBSTRING | [SUBSTRING](https://docs.snowflake.com/en/sql-reference/functions/substr) |
| SUM | [SUM](https://docs.snowflake.com/en/sql-reference/functions/sum) |
| TAN | [TAN](https://docs.snowflake.com/en/sql-reference/functions/tan) |
| TANH | [TANH](https://docs.snowflake.com/en/sql-reference/functions/tanh) |
| TIMESTAMP | [TO_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/to_timestamp) |
| TO_TIMESTAMP | [TO_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/to_timestamp) |
| TRANSLATE | [TRANSLATE](https://docs.snowflake.com/en/sql-reference/functions/translate) |
| TRIM | [TRIM](https://docs.snowflake.com/en/sql-reference/functions/trim) |
| UCASE | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper) |
| UPPER | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper) |
| USER | [CURRENT_USER](https://docs.snowflake.com/en/sql-reference/functions/current_user) |
| VAR_POP | [VAR_POP](https://docs.snowflake.com/en/sql-reference/functions/var_pop) |
| VAR_SAMP | [VAR_SAMP](https://docs.snowflake.com/en/sql-reference/functions/var_samp) |
| VARIANCE_POP | [VARIANCE_POP](https://docs.snowflake.com/en/sql-reference/functions/variance_pop) |
| VARIANCE_SAMP | [VARIANCE_SAMP](https://docs.snowflake.com/en/sql-reference/functions/variance) |
| VARIANCE | [VARIANCE](https://docs.snowflake.com/en/sql-reference/functions/variance) |
| VARP | [VAR_POP](https://docs.snowflake.com/en/sql-reference/functions/var_pop) |
| WIDTH_BUCKET | [WIDTH_BUCKET](https://docs.snowflake.com/en/sql-reference/functions/width_bucket) |
| ZEROIFNULL | [ZEROIFNULL](https://docs.snowflake.com/en/sql-reference/functions/zeroifnull) |

---
title: SnowConvert AI - ANSI SQL - Interval Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/general/interval-data-types.md
section: Migrations
---

# SnowConvert AI - ANSI SQL - Interval Data Types

## Description

The INTERVAL data type represents a duration or period of time. Many SQL dialects support INTERVAL types in column definitions, literals, and arithmetic expressions. Snowflake supports two families of INTERVAL types: `INTERVAL YEAR TO MONTH` and `INTERVAL DAY TO SECOND`. Learn more from the Snowflake documentation: [INTERVAL Data Type](../../../../sql-reference/data-types-datetime.md).

> **Warning:**
>
> The Snowflake INTERVAL data type is currently in **Public Preview**. The transformations described on this page require the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md) to be enabled. Without this flag, INTERVAL types are converted to `VARCHAR` as described in each language’s data type reference.

### Snowflake Interval Types

Snowflake supports two interval qualifier families. Mixed qualifiers (combining year-to-month and day-to-second parts) are not allowed.

| Qualifier Family | Supported Qualifiers |
| --- | --- |
| Year-to-Month | `INTERVAL YEAR`, `INTERVAL MONTH`, `INTERVAL YEAR TO MONTH` |
| Day-to-Second | `INTERVAL DAY`, `INTERVAL HOUR`, `INTERVAL MINUTE`, `INTERVAL SECOND`, `INTERVAL DAY TO HOUR`, `INTERVAL DAY TO MINUTE`, `INTERVAL DAY TO SECOND`, `INTERVAL HOUR TO MINUTE`, `INTERVAL HOUR TO SECOND`, `INTERVAL MINUTE TO SECOND` |

### Language Behavior Summary

| Source Language | Qualifier Handling | Notes |
| --- | --- | --- |
| Oracle, Teradata | Source qualifiers preserved | These languages use explicit INTERVAL qualifiers that map directly to Snowflake |
| Redshift, Spark, Databricks, Vertica | Source qualifiers preserved | Same as above |
| BigQuery, PostgreSQL, Greenplum, Netezza | Normalized to `DAY TO SECOND` | Bare `INTERVAL` and mixed qualifiers become `INTERVAL DAY TO SECOND` ([SSC-FDM-0042](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md)) |
| SQL Server, DB2 | Qualifiers preserved where applicable | Limited native INTERVAL support in source language |

## Interval Column Type Transformations

When the `--UseIntervalDatatype` flag is enabled, INTERVAL columns in `CREATE TABLE` statements are preserved as native Snowflake INTERVAL types.

### Languages with explicit qualifiers

For Oracle, Teradata, Redshift, Spark, Databricks, and Vertica, the source INTERVAL qualifier is preserved directly.

#### Source (Oracle)

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE intervals (
    column1 INTERVAL YEAR TO MONTH,
    column2 INTERVAL DAY TO SECOND,
    column3 INTERVAL YEAR,
    column4 INTERVAL DAY,
    column5 INTERVAL HOUR TO SECOND
);
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
CREATE OR REPLACE TABLE intervals (
    column1 INTERVAL YEAR TO MONTH,
    column2 INTERVAL DAY TO SECOND,
    column3 INTERVAL YEAR,
    column4 INTERVAL DAY,
    column5 INTERVAL HOUR TO SECOND
)
;
```

### Languages with unqualified INTERVAL

For BigQuery, PostgreSQL, Greenplum, and Netezza, bare `INTERVAL` is normalized to `INTERVAL DAY TO SECOND` with [SSC-FDM-0042](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md).

#### Source (BigQuery)

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE intervals (
    COL1 INTERVAL
);
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE intervals (
    COL1 INTERVAL DAY TO SECOND /*** SSC-FDM-0042 - INTERVAL QUALIFIER CHANGED TO DAY TO SECOND, SNOWFLAKE DOES NOT SUPPORT MIXING YEAR TO MONTH AND DAY TO SECOND TIME PARTS. ***/
)
;
```

### Without the flag (default behavior)

Without the `--UseIntervalDatatype` flag, INTERVAL columns are converted to `VARCHAR(30)` with [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md).

## Interval Literal Transformations

When the flag is enabled, interval literals are normalized to Snowflake-compatible INTERVAL literal syntax. The normalization depends on the source dialect.

### ANSI standard literals

Standard ANSI interval literals with explicit qualifiers are preserved as-is across all languages.

#### Source (Oracle, Teradata, Redshift)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '5-10' YEAR TO MONTH,
  INTERVAL '10 02:30:15.6554' DAY TO SECOND;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '5-10' YEAR TO MONTH,
  INTERVAL '10 02:30:15.6554' DAY TO SECOND;
```

### Verbose syntax normalization

PostgreSQL, Redshift, Vertica, Greenplum, Netezza, and Spark support verbose interval syntax with named units (for example, `WEEK`, `DAY`, `HOUR`). These are normalized to compact Snowflake INTERVAL literals.

#### Source (PostgreSQL)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '5 WEEK 3 DAY 4 HOUR 30 MINUTE 15 SECOND 233 MILLISECOND';
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '38 04:30:15.233' DAY TO SECOND;
```

### ISO 8601 format normalization

PostgreSQL and related languages support ISO 8601 duration format. These are normalized to compact Snowflake literals.

#### Source (PostgreSQL)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL 'P1Y2M3DT4H5M6S',
  INTERVAL 'PT33M16S',
  INTERVAL 'P22-01-05';
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '428 04:05:06' DAY TO SECOND,
  INTERVAL '0 00:33:16' DAY TO SECOND,
  INTERVAL '8065 00:00:00' DAY TO SECOND;
```

### BigQuery expression intervals

BigQuery supports computed interval expressions (`INTERVAL expr unit`). These are transformed to CAST expressions.

#### Source (BigQuery)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL 3 QUARTER,
  INTERVAL 1 + 1 YEAR,
  INTERVAL 3 * 1 DAY,
  INTERVAL 2 * 3 HOUR;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '270 00:00:00' DAY TO SECOND,
  CAST((1 + 1) * 365 AS INTERVAL DAY),
  CAST(3 * 1 AS INTERVAL DAY),
  CAST(2 * 3 AS INTERVAL HOUR);
```

### Overflow normalization

Teradata interval literals with overflowing values are normalized to valid Snowflake intervals.

#### Source (Teradata)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL '10 73:80:10' DAY TO SECOND;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL '13 02:20:10' DAY TO SECOND;
```

### Negative sign normalization

Teradata places the negative sign outside the literal string. SnowConvert AI normalizes it inside the string for Snowflake compatibility.

#### Source (Teradata)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL -'10-3' YEAR TO MONTH;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL '-10-3' YEAR TO MONTH;
```

### AGO keyword handling

PostgreSQL and related languages support the `AGO` keyword to negate an interval. SnowConvert AI resolves this into a negated Snowflake interval literal.

#### Source (PostgreSQL)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '1 DAY -3 MINUTES AGO' DAY TO SECOND,
  INTERVAL '-1 DAY 5 HOURS AGO' DAY TO SECOND;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '-0 23:57:00' DAY TO SECOND,
  INTERVAL '0 19:00:00' DAY TO SECOND;
```

### Spark multi-unit intervals

Spark supports multi-unit interval literals with mixed positive and negative components. These are normalized to compact Snowflake form.

#### Source (Spark)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL -2 HOUR '3' MINUTE 15 SECOND;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL '-01:56:45' HOUR TO SECOND;
```

### Vertica named-unit intervals

Vertica supports named-unit interval syntax that is normalized to compact Snowflake form.

#### Source (Vertica)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL '4 years 1 month 4 days 14 hours';
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT INTERVAL '1494 14:00:00' DAY TO SECOND;
```

## Interval Arithmetic

When the flag is enabled, datetime subtraction expressions that produce interval results are transformed to use Snowflake’s native interval types.

### Year-to-Month family

Datetime subtraction with `YEAR`, `MONTH`, or `YEAR TO MONTH` qualifiers uses `TIMESTAMPDIFF` and `CAST` to produce a native interval result.

#### Source (Oracle, Teradata)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT (TIMESTAMP '2025-10-12 10:30:15' - TIMESTAMP '2022-01-07 11:00:15') YEAR TO MONTH FROM dual;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  CAST(CAST(TIMESTAMPDIFF(MONTH, TIMESTAMP '2022-01-07 11:00:15', TIMESTAMP '2025-10-12 10:30:15') AS INTERVAL MONTH) AS INTERVAL YEAR TO MONTH) FROM dual;
```

### Day-to-Second family

Datetime subtraction with `DAY`, `HOUR`, `MINUTE`, `SECOND`, or compound qualifiers uses direct timestamp subtraction, producing a native interval result.

#### Source (Oracle, Teradata)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT (TIMESTAMP '2024-12-31 23:59:59' - TIMESTAMP '2024-01-01 00:00:00') DAY TO SECOND FROM dual;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  TIMESTAMP '2024-12-31 23:59:59' - TIMESTAMP '2024-01-01 00:00:00' INTERVAL DAY TO SECOND FROM dual;
```

### DATE operand handling

When DATE operands are used in interval subtraction, they are wrapped with `TIMESTAMP_NTZ_FROM_PARTS` to produce a timestamp suitable for interval subtraction.

#### Source (Oracle)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT (DATE '2024-12-31' - DATE '2024-01-01') DAY TO SECOND FROM dual;
```

#### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  TIMESTAMP_NTZ_FROM_PARTS(DATE '2024-12-31', '00:00:00') - TIMESTAMP_NTZ_FROM_PARTS(DATE '2024-01-01', '00:00:00') INTERVAL DAY TO SECOND FROM dual;
```

## CAST to Interval

CAST expressions targeting interval types are preserved when the flag is enabled.

### Source (Teradata)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  CAST('10-5' AS INTERVAL YEAR TO MONTH),
  CAST(INTERVAL '5 03:30' DAY TO MINUTE AS INTERVAL HOUR(4) TO SECOND);
```

### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  CAST('10-5' AS INTERVAL YEAR TO MONTH),
  CAST(INTERVAL '5 03:30' DAY TO MINUTE AS INTERVAL HOUR(4) TO SECOND);
```

## Interval Arithmetic with Existing Columns

Standard interval arithmetic with columns and literals (addition, subtraction, multiplication, division) is preserved as-is.

### Source (Oracle, Teradata, Redshift)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  date_column + interval_column,
  date_column - interval_column,
  interval_column * 2,
  interval_column / 3,
  DATE '2024-01-01' + INTERVAL '10' DAY,
  TIMESTAMP '2025-01-01 10:30:00' - INTERVAL '5-3' YEAR TO MONTH
FROM datesTable;
```

### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  date_column + interval_column,
  date_column - interval_column,
  interval_column * 2,
  interval_column / 3,
  DATE '2024-01-01' + INTERVAL '10' DAY,
  TIMESTAMP '2025-01-01 10:30:00' - INTERVAL '5-3' YEAR TO MONTH
FROM
  datesTable;
```

## Oracle Precision Zero Handling

Oracle allows `INTERVAL` types with precision `(0)`, which has no equivalent in Snowflake. SnowConvert AI removes the zero precision qualifier.

### Source (Oracle)

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '0-5' YEAR(0) TO MONTH,
  INTERVAL '0' DAY(0)
FROM DUAL;
```

### Snowflake

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  INTERVAL '0-5' YEAR TO MONTH,
  INTERVAL '0' DAY
FROM DUAL;
```

## Known Limitations

The following scenarios produce warnings when the `--UseIntervalDatatype` flag is enabled:

* **Dynamic Tables** ([SSC-EWI-0118](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)): Snowflake does not support INTERVAL columns in Dynamic Tables. The Dynamic Table is still generated with a warning.
* **UDFs and Snowflake Scripting** ([SSC-EWI-0117](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)): Snowflake does not support the INTERVAL data type in UDF/procedure parameters, return types, or variable declarations.
* **Semi-structured types** ([SSC-EWI-0116](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)): Snowflake does not support INTERVAL values inside VARIANT, ARRAY, MAP, or STRUCT columns.
* **Qualifier normalization** ([SSC-FDM-0042](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md)): For languages with unqualified or mixed INTERVAL types, the qualifier is changed to `DAY TO SECOND`.

## Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to other data type (INTERVAL to VARCHAR when flag is off).
2. [SSC-EWI-0107](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Interval literal not supported in current scenario.
3. [SSC-EWI-0116](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Snowflake does not support interval values inside semi-structured type columns.
4. [SSC-EWI-0117](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Snowflake does not support interval data type in UDFs or Snowflake Scripting.
5. [SSC-EWI-0118](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Snowflake does not support interval columns in Dynamic Tables.
6. [SSC-EWI-0119](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Interval type column was converted to VARCHAR.
7. [SSC-FDM-0042](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Interval qualifier changed to DAY TO SECOND.

---
title: SnowConvert AI - ANSI SQL - Subqueries
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/general/subqueries.md
section: Migrations
---

# SnowConvert AI - ANSI SQL - Subqueries

## Description

> A subquery is a query within another query. Subqueries in a [FROM](https://docs.snowflake.com/en/sql-reference/constructs/from) or [WHERE](https://docs.snowflake.com/en/sql-reference/constructs/where) clause are used to provide data that will be used to limit or compare/evaluate the data returned by the containing query. ([Snowflake subqueries documentation](https://docs.snowflake.com/en/user-guide/querying-subqueries)).

Subqueries can be correlated/uncorrelated as well as scalar/non-scalar.

**Correlated subqueries** reference columns from the outer query. In Snowflake, correlated subqueries execute for each row in the query. On the other hand, **Uncorrelated subqueries** do not reference the outer query and are executed once for the entire query.

**Scalar subqueries** return a single value as result, otherwise the subquery is **non-scalar.**

The following patterns are based on these categories.

## Sample Source Patterns

### Setup data

#### Teradata

```sql
CREATE TABLE tableA
(
    col1 INTEGER,
    col2 VARCHAR(20)
);

CREATE TABLE tableB
(
    col3 INTEGER,
    col4 VARCHAR(20)
);

INSERT INTO tableA VALUES (50, 'Hey');
INSERT INTO tableA VALUES (20, 'Example');

INSERT INTO tableB VALUES (50, 'Hey');
INSERT INTO tableB VALUES (20, 'Bye');
```

#### *Snowflake*

```sql
CREATE OR REPLACE TABLE tableA
(
    col1 INTEGER,
    col2 VARCHAR(20)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "12/02/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE tableB
(
    col3 INTEGER,
    col4 VARCHAR(20)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "12/02/2024",  "domain": "test" }}'
;

INSERT INTO tableA
VALUES (50, 'Hey');

INSERT INTO tableA
VALUES (20, 'Example');

INSERT INTO tableB
VALUES (50, 'Hey');

INSERT INTO tableB
VALUES (20, 'Bye');
```

### Correlated Scalar subqueries

Snowflake evaluates correlated subqueries **at compile time** to determine if they are scalar and therefore valid in the context were a single return value is expected. To solve this, the ANY_VALUE aggregate function is added to the returned column when the result is not an aggregate function. This allows the compiler to determine a single value return is expected. Since scalar subqueries are expected to return a single value the function ANY_VALUE will not change the result, it will just return the original value as is.

#### *Teradata*

```sql
SELECT col2
FROM tableA
WHERE col1 = (SELECT col3 FROM tableB WHERE col2 = col4);
```

#### Results

```sql
+------+
| col2 |
+------+
| Hey  |
+------+
```

#### *Snowflake*

```sql
SELECT
    col2
FROM
    tableA
WHERE col1 =
             --** SSC-FDM-0002 - CORRELATED SUBQUERIES MAY HAVE SOME FUNCTIONAL DIFFERENCES. **
             (
                 SELECT
                     ANY_VALUE(col3) FROM
                     tableB
                 WHERE
                     RTRIM( col2) = RTRIM(col4));
```

#### Results

```sql
+------+
| col2 |
+------+
| Hey  |
+------+
```

### Uncorrelated Scalar subqueries

Snowflake fully supports uncorrelated scalar subqueries.

#### *Teradata*

```sql
SELECT col2, (SELECT AVG(col3) FROM tableB) AS avgTableB
FROM tableA
WHERE col1 = (SELECT MAX(col3) FROM tableB);
```

#### Results

```sql
+------+-----------+
| col2 | avgTableB |
+------+-----------+
| Hey  | 35        |
+------+-----------+
```

#### *Snowflake*

```sql
SELECT
    col2,
    (
                 SELECT
                     AVG(col3) FROM
                     tableB
    ) AS avgTableB
            FROM
    tableA
            WHERE col1 = (
                 SELECT
                     MAX(col3) FROM
                     tableB
    );
```

#### Results

```sql
+------+-----------+
| col2 | avgTableB |
+------+-----------+
| Hey  | 35.000000 |
+------+-----------+
```

### Non-scalar subqueries

Non-scalar subqueries specified inside subquery operators (ANY/ALL/IN/EXISTS) are supported.

Non-scalar subqueries used as derived tables are also supported.

#### *Teradata*

```sql
SELECT col2
FROM tableA
WHERE col1 IN (SELECT col3 FROM tableB);

SELECT col2
FROM tableA
WHERE col1 >= ALL(SELECT col3 FROM tableB);

SELECT col2, myDerivedTable.col4
FROM tableA, (SELECT * FROM tableB) AS myDerivedTable
WHERE col1 = myDerivedTable.col3;
```

#### Result

```sql
+---------+
| col2    |
+---------+
| Example |
+---------+
| Hey     |
+---------+

+---------+
| col2    |
+---------+
| Hey     |
+---------+

+---------+------+
| col2    | col4 |
+---------+------+
| Example | Bye  |
+---------+------+
| Hey     | Hey  |
+---------+------+
```

#### *Snowflake*

```sql
SELECT
    col2
            FROM
    tableA
            WHERE col1 IN (
                 SELECT
                     col3 FROM
                     tableB
    );

                     SELECT
    col2
            FROM
    tableA
            WHERE col1 >= ALL(
                 SELECT
                     col3 FROM
                     tableB
    );
                    SELECT
    col2,
    myDerivedTable.col4
            FROM
    tableA, (
                 SELECT
                     * FROM
                     tableB
    ) AS myDerivedTable
            WHERE col1 = myDerivedTable.col3;
```

#### Results

```sql
+---------+
| col2    |
+---------+
| Example |
+---------+
| Hey     |
+---------+

+---------+
| col2    |
+---------+
| Hey     |
+---------+

+---------+------+
| col2    | col4 |
+---------+------+
| Example | Bye  |
+---------+------+
| Hey     | Hey  |
+---------+------+
```

## Known Issues

**1. Subqueries with FETCH first that are not uncorrelated scalar**

Oracle allows using the FETCH clause in subqueries, Snowflake only allows using this clause if the subquery is uncorrelated scalar, otherwise an exception will be generated.

SnowConvert AI will mark any inalid usage of FETCH in subqueries with [SSC-EWI-0108](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)

Oracle:

```sql
-- Correlated scalar
SELECT col2
FROM tableA
WHERE col2 = (SELECT col4 FROM tableB WHERE col3 = col1 FETCH FIRST ROW ONLY);

-- Uncorrelated scalar
SELECT col2
FROM tableA
WHERE col2 = (SELECT col4 FROM tableB FETCH FIRST ROW ONLY);
```

Snowflake:

```sql
-- Correlated scalar
SELECT col2
FROM
    tableA
    WHERE col2 =
                 --** SSC-FDM-0002 - CORRELATED SUBQUERIES MAY HAVE SOME FUNCTIONAL DIFFERENCES. **
                 !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!! (SELECT
                         ANY_VALUE( col4) FROM
                         tableB
                     WHERE col3 = col1
                     FETCH FIRST 1 ROW ONLY);

 -- Uncorrelated scalar
SELECT col2
FROM
    tableA
    WHERE col2 = (SELECT col4 FROM
                         tableB
                     FETCH FIRST 1 ROW ONLY);
```

## Related EWIs

1. [SSC-FDM-0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Correlated subquery may have functional differences
2. [SSC-EWI-0108](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The following subquery matches at least one of the patterns considered invalid and may produce compilation errors

---
title: SnowConvert AI - Assessment Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/README.md
section: Migrations
---

# SnowConvert AI - Assessment Report

## General Summary

The purpose of this document is to provide guidance for users to understand the summary results from the SnowConvert AI conversion tools. It will guide through the different metrics returned and how these metrics can be used to determine the automation level achieved, and the amount of manual effort needed to make the output code into functionally equivalent Snowflake code.

Most of the concepts presented in this document are already explained on the Report’s main page. But here is some other helpful information about the most important information of the image above.

* **Total Parsing Errors**: The number of times that the conversion tool found text fragments that could not be recognized as syntactically correct elements for the source language under conversion. A parsing error could have a little or large impact. It is important to determine the number of LOC affected by parsing errors and how much they represent of the total workload. Sometimes parsing errors can occur due to encoding issues or because the workload needs some preparation.
* **Code Conversion Rate**: The conversion rate is the percentage of the total source code that was successfully converted by SnowConvert AI into functionally equivalent Snowflake code. Every time that the tool identifies not supported elements, *i.e,* fragments in the input source code that were not converted into Snowflake, this will affect the conversion rate.
* **Identified Objects**: The count of all the Top Level DDL Objects ( Table, View, Procedure, etc.. ) that the SnowConvert AI identified. If there were a parsing error on an object, it wouldn’t be an Identified Object.
  Example: The [first objects](../../../../../technical-documentation/issues-and-troubleshooting/functional-difference/README.md) from line #1 to line #6. There is evidently a parsing error, so the SnowConvert AI cannot identify that as an object.

### Conversion rate modes

As mentioned before, when an element is marked as not supported (due to parsing errors or because there is no support for it in Snowflake) the conversion rate will be punished. How much of the conversion rate is punished for each not-supported element depends on the unit of code selected, two units are available: characters or lines.

#### Conversion rate using code characters

When characters of code are selected, the total amount of characters in the input source will represent the overall units to convert. So, if there are 100 characters total and there is only one not-supported element with 10 characters, the conversion rate will be **90%**. The conversion rate using characters is more precise because only the characters belonging to the not-supported elements are punished but, it is harder to manually calculate and understand.

#### Conversion rate using lines of code

When lines of code are chosen (default option), the number of lines of code in the input source code will represent the overall units to convert, and lines containing not-supported elements will be **entirely** considered as not-supported units of code. So, if the same input code with those 100 characters is split into 5 lines of code, and the not-supported element is in just one line, then the conversion rate will be **80%**; the **entire line** containing the not-supported element is considered not supported as well. The conversion rate using lines is easier to follow however, it is less accurate because entire lines of code containing not-supported elements are punished (even if there are other supported elements in that same line).

The next example shows how the conversion rate is calculated using both metrics.

### Conversion rate example

**Input source code**

```sql
--Comment123
CREATE TABLE Table1(
 Prefix_Employee_Name CHAR(25),
 !ERROR_Col,
 Prefix_Employee_Sal DECIMAL(8))
```

The above code has exactly 100 code characters because whitespaces and line breaks are not considered code characters. The comment above `Table1` belongs to the table and is part of those 100 characters. This is the output code that SnowConvert AI generates for this input.

**Output source code**

```sql
--Comment123
CREATE OR REPLACE TABLE Table1 (
 Prefix_Employee_Name CHAR(25)
--                              ,
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '4' COLUMN '2' OF THE SOURCE CODE STARTING AT '!'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS ',' ON LINE '3' COLUMN '31'. FAILED TOKEN WAS '!' ON LINE '4' COLUMN '2'. CODE '15'. **
-- !ERROR_Col
           ,
 Prefix_Employee_Sal DECIMAL(8))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

The second column of the table has a parsing error and therefore this is a not-supported element. Let’s take a look a how the conversion rate is punished using the two units available.

#### Conversion rate using code characters

Here is the breakdown of the characters in the input code

```sql
--Comment123 /*12 code characters*/
CREATE TABLE Table1( /*18 code characters*/
 Prefix_Employee_Name CHAR(25), /*29 code characters*/
 !ERROR_Col, /*11 code characters*/
 Prefix_Employee_Sal DECIMAL(8)) /*30 code characters*/
```

* Total amount of code characters: 100
* Code characters in not-supported elements: 10
* **Result: 90.00%**

> **Note:**
>
> Notice there are 11 characters in the 4th line but only 10 are marked as not supported. This is because of how the parsing recovery mechanism works. When the parser encounters an error, it will consider all the following characters, until the next delimiter character, in this case the comma (‘,’), as part of the error. That means that the amount of not-supported characters in any input code can greatly depend on the type of parsing errors. In some cases, the parser will be able to recover close to where the actual error is, but sadly in other cases, a lot of code can be swallowed by the error.

#### Conversion rate using lines of code

The conversion rate using lines of code as units is much simpler to calculate.

* Total amount of lines of code: 5
* Lines of code with not-supported elements: 1
* **Result: 80%**

#### LOC conversion rate depends on how the code is formatted

When using lines of code as the unit, the conversion rate will greatly depend on how the input code is formatted. For example, the following two code samples are equivalent but in the first case all the code is put into the same line and in the second case the code is split into 5 lines of code

```sql
SELECT col1, !error_col FROM table1;
```
```none
SELECT
   col1,
   !error_col
 FROM
    table1;
```

Notice that the second column that is being referenced in the SELECT has an error because it starts with an invalid character. In the first case, since the whole code is in the same line, the conversion rate will be 0%. But in the second case, since the code is split, only one line of code is punished and therefore the conversion rate will be 80%.

### Conversion rate differences

Conversion results of a migration may differ depending on the operating system.

This occurs because most of the time, Microsoft Windows uses CRLF line-breaking in their files. This format uses the characters `\r\n`, but UNIX OS only `\n`(LF).
Due to that format difference, when our code processor is reading the input files, it will count the CRLF format as two characters and just one in LF files. These counting differences generate different results in the conversion rates, specifically, in string expressions present in your code.

To avoid this problem, you can use Visual Studio Code or similar tools to change the line-breaking format.

## File and Object-Level Breakdown

### SQL - Files

| File | Conversion Rate | Lines of Code | Total Object Quantity | Parsing Errors |
| --- | --- | --- | --- | --- |
| SQL | 42% | 20 | 2 | 3 |

In this section, you’ll get the overall assessment summary information for all the SQL Files

* **Code Conversion Rate**: This is an estimation of the conversion rate based on the characters of the given SQL Files.
* **Line of Code**: The count for the lines of code of the given SQL Files.
* **Total Object Quantity**: The count of total identified objects of the given SQL Files.
* **Parsing Errors**: The count of total parsing errors of the given SQL Files.

> **Warning:**
>
> The Unrecognized objects will be counted also a parsing errors of the SQL Files section

> **Warning:**
>
> The Code conversion rate may differ from Identified conversion rate because this is also considering the unrecognized objects.

### SQL - Identified Objects

| Object | Conversion Rate | Lines of Code | Total Object Quantity | Parsing Errors |
| --- | --- | --- | --- | --- |
| Tables | 67% | 5 | 1 | 1 |
| Views | 57% | 7 | 1 | 1 |
| Procedures | - | 0 | 0 | 0 |
| Functions | N/A | N/A | N/A | N/A |

> **Note:**
>
> If N/A is listed in the table above, it means that the object type is not supported in Snowflake, most likely due to architectural reasons. These objects are commented out in the generated code, and they do not punish the conversion rate.

> **Note:**
>
> If the Conversion Rate field has a “-”, it means that the current set of files you have migrated didn’t contain any instance of the specified object.

In this section, you’ll get the assessment information for all the identified objects divided by the DDL objects like Tables, Views, Procedures, etc.

> **Warning:**
>
> If there is a code where the parser couldn’t handle it, the entire object will be accounted as *Unrecognized Object,* and therefore it wouldn’t show here

* **Code Conversion Rate**: This is an estimation of the conversion rate based on the characters for the identified objects like Table, View, Procedure, etc.
* **Line of Code**: The count for the lines of code of each type of identified object.
* **Total Object Quantity**: The count for each type of identified object.
* **Parsing Errors**: The count for the parsing errors that occurred inside each type of identified object.

Example: For the 2 tables that we have in the [source code](../../../../../technical-documentation/issues-and-troubleshooting/functional-difference/README.md), one is an unrecognized object and one is successfully identified. The conversion rate of that table of 5 lines of code is 75% due to 1 parsing error.

## Issues Breakdown

In this page, you will get the number of unique issues and the list of issues ordered by severity in descendant sort.‌

For example, for the given source code, we have 2 critical issues related to parsing errors and one medium severity issue related to the *Not supported function.*

> **Note:**
>
> Only errors with Medium/High/Critical severity will affect the current conversion rate. Warnings are just informative.

---
title: SnowConvert AI - Azure Synapse
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/azure-synapse.md
section: Migrations
---

# SnowConvert AI - Azure Synapse

## What is SnowConvert AI for Azure Synapse?

SnowConvert AI is a software tool that understands Azure Synapse scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for Azure Synapse performs the following conversions:

### Azure Synapse to Snowflake SQL

SnowConvert AI understands the Azure Synapse source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake.

### Sample code

Azure Synapse basic input code:

```sql
 CREATE TABLE Persons (
    PersonID int,
    LastName varchar(255),
    FirstName varchar(255),
    Address varchar(255),
    City varchar(255)
);
```

Snowflake SQL output code:

```sql
 CREATE OR REPLACE TABLE Persons (
    PersonID INT,
    LastName VARCHAR(255),
    FirstName VARCHAR(255),
    Address VARCHAR(255),
    City VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"azure synapse"}}'
;
```

As you can see, most of the structure remains the same. There are some cases where the datatypes have to be transformed, for example.

#### Azure Synapse Stored Procedures to JavaScript Embedded in Snowflake SQL

SnowConvert AI takes Azure Synapse stored procedures and converts them to JavaScript embedded into Snowflake SQL. Azure Synapse’s CREATE PROCEDURE is replaced by Snowflake’s CREATE OR REPLACE PROCEDURE. JavaScript is called as a scripting language, and all of the inner statements are converted to JavaScript.

##### Sample code

Azure Synapse basic stored procedure:

```sql
 CREATE PROCEDURE SelectAllCustomers
AS
SELECT * FROM Customers
GO;
```

Snowflake SQL output code, with embedded JavaScript:

```sql
 -- Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE SelectAllCustomers ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   // REGION SnowConvert AI Helpers Code
   // END REGION

 EXEC(`SELECT
   *
FROM
   Customers`);
$$;
;
```

* When creating the JavaScript code, there is a portion of code added as a *helper*, required for an easier transformation of the contents of the procedure.
* You can expect to see warnings with an associated code to help you find out what is happening in the converted code. (See [issues and troubleshooting](../../../technical-documentation/issues-and-troubleshooting/README.md))

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI*: the software that converts securely and automatically your Azure Synapse files to the Snowflake cloud data platform.
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

On the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for SQL Server is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Best practices
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/best-practices.md
section: Migrations
---

# SnowConvert AI - Best practices

## 1. Extraction

We highly recommend you use our scripts to extract your workload:

* Teradata: [DDL Export Scripts for Teradata](https://github.com/Snowflake-Labs/SC.DDLExportScripts/blob/main/Teradata/README.md).
* Oracle: [DDL Export Scripts for Oracle](https://github.com/Snowflake-Labs/SC.DDLExportScripts/blob/main/Oracle/README.md).
* SQLServer: [DDL Export Scripts for SQL Server](https://github.com/Snowflake-Labs/SC.DDLExportScripts/blob/main/SQLServer/README.pdf).
* Redshift: [Redshift code extraction guide](code-extraction/redshift.md).

## 2. Preprocess

We highly recommend you use a Preprocess Script that aims to give you better results before starting an assessment or a conversion. This script performs the following tasks:

1. Create a single file for each top-level object
2. Organize each file by a defined folder hierarchy (The default is: Database Name -> Schema Name -> Object Type)
3. Generate an inventory report that provides information on all the objects that are in the workload.

### 2.1 Download

* Download the [binary of the script for macOS](https://sctoolsartifacts.z5.web.core.windows.net/tools/extractorscope/standardize_sql_files)
  and make sure to follow the setup instructions in Step 2.3.
* Download the [binary of the script for Windows](https://sctoolsartifacts.z5.web.core.windows.net/tools/extractorscope/standardize_sql_files.exe).

### 2.2 Description

The following information is needed to run the script:

| **Script Argument** | **Example Value** | **Required** | **Usage** |
| --- | --- | --- | --- |
| Input folder | `/home/user/extracted_ddls` | Yes | `{ -i | ifolder= }` |
| Output folder | `/home/user/processed_extracted_ddls` | Yes | `{ -o | ofolder= }` |
| Database name | `sampleDataBase` | Yes | `{ -d | dname= }` |
| Database engine | `Microsoft SQL Server` | Yes | `{ -e | dengine= }` |
| Output folder structure | `Database name, top level object type and schema` | No | `[ { -s | structure= } ]` |
| Pivot tables generation | `Yes` | No | `[ -p ]` |

> **Note:**
>
> The supported values for the database engine argument (-e) are: oracle, mssql and teradata

> **Note:**
>
> The supported values for the output folder structure argument (-s) are: database_name, schema_name and top_level_object_name_type.
> When specifying this argument, all the previous values need to be separated by a comma. For example: `-s database_name,top_level_object_name_type,schema_name`.
>
> This argument is optional and when it is not specified the default structure is the following: Database name, top-level object type and schema name.

> **Note:**
>
> The pivot tables generation parameter (-p) is optional.

### 2.3 Setup the binary for Mac

1. Set the binary as an executable:
   `chmod +x standardize_sql_files`
2. Run the script by executing the following command:

   `./standardize_sql_files`

   * If this is the first time running the binary the following message will pop-up:
     Click OK.
   * Open Settings -> Privacy & Security -> Click Allow Anyway

### Running the script

1. Running the script using the following format:

   1. Mac format
      `./standardize_sql_files -i "input path" -o "output path" -d Workload1 -e teradata`
   2. Windows format
      `./standardize_sql_files.exe -i "input path" -o "output path" -d Workload1 -e teradata`
2. If the script is successfully executed the following output will be displayed:

   `Splitting process completed successfully!`
   `Report successfully created!`
   `Script successfully executed!`

---
title: SnowConvert AI - BigQuery
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/README.md
section: Migrations
---

# SnowConvert AI - BigQuery

> **Conversion Scope:**
>
> SnowConvert AI for Google BigQuery currently supports assessment and translation for TABLES and VIEWS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

This page provides a comprehensive reference for how SnowConvert AI translates Google BigQuery grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - BigQuery - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/bigquery-functions.md
section: Migrations
---

# SnowConvert AI - BigQuery - Built-in functions

Translation reference for all the supported built-in functions by SnowConvert AI for BigQuery.

> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

## Aggregate Functions

| BigQuery | Snowflake |
| --- | --- |
| [ANY_VALUE](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#any_value) | [ANY_VALUE](https://docs.snowflake.com/en/sql-reference/functions/any_value)  *Note: Unlike BigQuery, Snowflake does not ignore NULLs . Additionally, Snowflake’s `OVER()` clause does not support the use of `ORDER BY` or explicit window frames.* |
| [ANY_VALUE](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#any_value)( expr1, HAVING MAX expr2)  [ANY_VALUE](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#any_value)( expr1, HAVING MIN expr2) | [MAX_BY](https://docs.snowflake.com/en/sql-reference/functions/max_by)(expr1, expr1)  [MIN_BY](https://docs.snowflake.com/en/sql-reference/functions/min_by)(expr1, expr2) |
| [AVG](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#avg) | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg) |
| [COUNT](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#count) | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) |
| [COUNTIF](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#countif) | [COUNT_IF](https://docs.snowflake.com/en/sql-reference/functions/count_if) |
| [LOGICAL_AND](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#logical_and) | [BOOLAND_AGG](https://docs.snowflake.com/en/sql-reference/functions/booland_agg) |
| [LOGICAL_OR](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#logical_or) | [BOOLOR_AGG](https://docs.snowflake.com/en/sql-reference/functions/boolor_agg) |
| [MAX](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#max) | [MAX](https://docs.snowflake.com/en/sql-reference/functions/max) |
| [MIN](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#min) | [MIN](https://docs.snowflake.com/en/sql-reference/functions/min) |
| [SUM](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#sum) | [SUM](https://docs.snowflake.com/en/sql-reference/functions/sum) |

## Array Functions

| BigQuery | Snowflake |
| --- | --- |
| [ARRAY_AGG](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#array_agg) | [ARRAY_AGG](https://docs.snowflake.com/en/sql-reference/functions/array_agg) |
| [ARRAY_CONCAT](https://cloud.google.com/bigquery/docs/reference/standard-sql/array_functions#array_concat) | [ARRAY_CAT](https://docs.snowflake.com/en/sql-reference/functions/array_cat) |
| [ARRAY_CONCAT_AGG](https://cloud.google.com/bigquery/docs/reference/standard-sql/aggregate_functions#array_concat_agg) | [ARRAY_FLATTEN](https://docs.snowflake.com/en/sql-reference/functions/array_flatten) |
| [ARRAY_TO_STRING](https://cloud.google.com/bigquery/docs/reference/standard-sql/array_functions#array_to_string)(expr, delimiter) | [ARRAY_TO_STRING](https://docs.snowflake.com/en/sql-reference/functions/array_to_string)(ARRAY_COMPACT(expr), delimiter) |
| [ARRAY_TO_STRING](https://cloud.google.com/bigquery/docs/reference/standard-sql/array_functions#array_to_string)(expr, delimiter, null_text) | ARRAY_TO_STRING_UDF(expr, delimiter, null_text)  *Notes: SnowConvert AI generates a UDF to handle the NULL replacement parameter which is not natively supported in Snowflake’s ARRAY_TO_STRING function.* |
| [SELECT ARRAY](https://cloud.google.com/bigquery/docs/reference/standard-sql/query-syntax#array_subquery) (SELECT query) | SELECT (SELECT ARRAY_AGG(\*) FROM (SELECT query))  *Notes: BigQuery’s ARRAY subquery syntax is transformed to use ARRAY_AGG with a subquery in Snowflake.* |

## Conditional Expressions

| BigQuery | Snowflake |
| --- | --- |
| [COALESCE](https://cloud.google.com/bigquery/docs/reference/standard-sql/conditional_expressions#coalesce) | [COALESCE](https://docs.snowflake.com/en/sql-reference/functions/coalesce) |
| [IF](https://cloud.google.com/bigquery/docs/reference/standard-sql/conditional_expressions#if) | [IFF](https://docs.snowflake.com/en/sql-reference/functions/iff) |
| [IFNULL](https://cloud.google.com/bigquery/docs/reference/standard-sql/conditional_expressions#ifnull) | [IFNULL](https://docs.snowflake.com/en/sql-reference/functions/ifnull) |
| [NULLIF](https://cloud.google.com/bigquery/docs/reference/standard-sql/conditional_expressions#nullif) | [NULLIF](https://docs.snowflake.com/en/sql-reference/functions/nullif) |

## Conversion Functions

| BigQuery | Snowflake |
| --- | --- |
| [SAFE_CAST](https://cloud.google.com/bigquery/docs/reference/standard-sql/conversion_functions#safe_casting) | [TRY_CAST](https://docs.snowflake.com/en/sql-reference/functions/try_cast) |

## Date Functions

| BigQuery | Snowflake |
| --- | --- |
| [CURRENT_DATE](https://cloud.google.com/bigquery/docs/reference/standard-sql/date_functions#current_date) [CURRENT_DATE](https://cloud.google.com/bigquery/docs/reference/standard-sql/date_functions#current_date)() | [CURRENT_DATE](https://docs.snowflake.com/en/sql-reference/functions/current_date)  [CURRENT_DATE](https://docs.snowflake.com/en/sql-reference/functions/current_date)() |
| [FORMAT_DATE](https://cloud.google.com/bigquery/docs/reference/standard-sql/date_functions#format_date) | [TO_CHAR](https://docs.snowflake.com/en/sql-reference/functions/to_char)  *Note: For further details on this translation, please consult this* [*page*](format_date.md)*.* |

## Datetime Functions

| BigQuery | Snowflake |
| --- | --- |
| [CURRENT_DATETIME](https://cloud.google.com/bigquery/docs/reference/standard-sql/datetime_functions#current_datetime)  [CURRENT_DATETIME](https://cloud.google.com/bigquery/docs/reference/standard-sql/datetime_functions#current_datetime)() | [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp) :: TIMESTAMP_NTZ [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp)() :: TIMESTAMP_NTZ |

## Geography Functions

| BigQuery | Snowflake |
| --- | --- |
| [ST_GEOGFROMTEXT](https://cloud.google.com/bigquery/docs/reference/standard-sql/geography_functions#st_geogfromtext) | [ST_GEOGFROMTEXT](https://docs.snowflake.com/en/sql-reference/functions/st_geographyfromwkt)  *Note: For further details on this translation, please consult this* [*page*](st_geogfromtext.md)*.* |
| [ST_GEOGPOINT](https://cloud.google.com/bigquery/docs/reference/standard-sql/geography_functions#st_geogpoint) | [ST_POINT](https://docs.snowflake.com/en/sql-reference/functions/st_makepoint)  *Note: For further details on this translation, please consult this* [*page*](st_geogpoint.md)*.* |

## JSON Functions

| BigQuery | Snowflake |
| --- | --- |
| [JSON_VALUE](https://cloud.google.com/bigquery/docs/reference/standard-sql/json_functions#json_value) / [JSON_EXTRACT_SCALAR](https://cloud.google.com/bigquery/docs/reference/standard-sql/json_functions#json_extract_scalar) | [JSON_EXTRACT_PATH_TEXT](https://docs.snowflake.com/en/sql-reference/functions/json_extract_path_text)  *Notes: SnowConvert AI automatically translates BigQuery JSON paths to their Snowflake equivalents.* |
| [JSON_VALUE_ARRAY](https://cloud.google.com/bigquery/docs/reference/standard-sql/json_functions#json_value_array) | JSON_VALUE_ARRAY_UDF  *Notes: SnowConvert AI generates a UDF to obtain an equivalent behavior for extracting arrays from JSON.* |
| [LAX_INT64](https://cloud.google.com/bigquery/docs/reference/standard-sql/json_functions#lax_int64) | PUBLIC.LAX_INT64_UDF  *Notes: SnowConvert AI generates a UDF to obtain an equivalent behavior.* |
| [LAX_BOOL](https://cloud.google.com/bigquery/docs/reference/standard-sql/json_functions#lax_bool) | PUBLIC.LAX_BOOL_UDF  *Notes: SnowConvert AI generates a UDF to obtain an equivalent behavior.* |

## Mathematical Functions

| BigQuery | Snowflake |
| --- | --- |
| [ABS](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#abs) | [ABS](https://docs.snowflake.com/en/sql-reference/functions/abs) |
| [LEAST](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#least) | [LEAST](https://docs.snowflake.com/en/sql-reference/functions/least) |
| [MOD](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#mod) | [MOD](https://docs.snowflake.com/en/sql-reference/functions/mod) |
| [ROUND](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#round)(X) [ROUND](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#round)(X, Y) [ROUND](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#round)(X, Y, ‘ROUND_HALF_EVEN’) [ROUND](https://cloud.google.com/bigquery/docs/reference/standard-sql/mathematical_functions#round)(X, Y, ‘ROUND_HALF_AWAY_FROM_ZERO’) | [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round)(X) [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round)(X, Y) [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round)(X, Y, ‘HALF_TO_EVEN’) [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round)(X, Y, ‘HALF_AWAY_FROM_ZERO’) |

## Navigation Functions

| BigQuery | Snowflake |
| --- | --- |
| [FIRST_VALUE](https://cloud.google.com/bigquery/docs/reference/standard-sql/navigation_functions#first_value) | [FIRST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/first_value) |
| [LAG](https://cloud.google.com/bigquery/docs/reference/standard-sql/navigation_functions#lag) | [LAG](https://docs.snowflake.com/en/sql-reference/functions/lag) |
| [LEAD](https://cloud.google.com/bigquery/docs/reference/standard-sql/navigation_functions#lead) | [LEAD](https://docs.snowflake.com/en/sql-reference/functions/lead) |
| [LAST_VALUE](https://cloud.google.com/bigquery/docs/reference/standard-sql/navigation_functions#last_value) | [LAST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/last_value) |

## Numbering Functions

| BigQuery | Snowflake |
| --- | --- |
| [RANK](https://cloud.google.com/bigquery/docs/reference/standard-sql/numbering_functions#rank) | [RANK](https://docs.snowflake.com/en/sql-reference/functions/rank) |
| [ROW_NUMBER](https://cloud.google.com/bigquery/docs/reference/standard-sql/numbering_functions#row_number) | [ROW_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/row_number) |

## String Functions

| BigQuery | Snowflake |
| --- | --- |
| [BYTE_LENGTH](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#byte_length)(expr) | LENGTH(TO_BINARY(HEX_ENCODE(expr)))  *Notes: BigQuery’s BYTE_LENGTH returns the number of bytes in an encoded string. Snowflake equivalent converts to binary after hex encoding to get byte length.* |
| [CHARACTER_LENGTH](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#character_length) [CHAR_LENGTH](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#char_length) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [CONCAT](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#concat) | [CONCAT](https://docs.snowflake.com/en/sql-reference/functions/concat) |
| [ENDS_WITH](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#ends_with) | [ENDSWITH](https://docs.snowflake.com/en/sql-reference/functions/endswith) |
| [FROM_BASE64](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#from_base64) | [TRY_BASE64_DECODE_BINARY](https://docs.snowflake.com/en/sql-reference/functions/try_base64_decode_binary)  *Notes: BigQuery defaults to BASE64 for binary data output, but Snowflake uses HEX. In Snowflake, you can use the* [*`BASE64_ENCODE`*](https://docs.snowflake.com/en/sql-reference/functions/base64_encode) *function or set* [*`BINARY_OUTPUT_FORMAT`*](https://docs.snowflake.com/en/sql-reference/parameters#binary-output-format) *to `’BASE64’` to view binary data in BASE64.* |
| [FROM_HEX](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#from_hex) | [TRY_HEX_DECODE_BINARY](https://docs.snowflake.com/en/sql-reference/functions/try_hex_decode_binary)  *Notes: BigQuery defaults to BASE64 for binary data output, but Snowflake uses HEX. In Snowflake, you can use the* [*`BASE64_ENCODE`*](https://docs.snowflake.com/en/sql-reference/functions/base64_encode) *function or set* [*`BINARY_OUTPUT_FORMAT`*](https://docs.snowflake.com/en/sql-reference/parameters#binary-output-format) *to `’BASE64’` to view binary data in BASE64.* |
| [LEFT](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#left) | [LEFT](https://docs.snowflake.com/en/sql-reference/functions/left) |
| [LENGTH](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#length) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [LOWER](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#lower) | [LOWER](https://docs.snowflake.com/en/sql-reference/functions/lower) |
| [LPAD](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#lpad) | [LPAD](https://docs.snowflake.com/en/sql-reference/functions/lpad) |
| [LTRIM](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#ltrim) | [LTRIM](https://docs.snowflake.com/en/sql-reference/functions/ltrim) |
| [REGEXP_CONTAINS](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#regexp_contains)(value, regexp) | [REGEXP_INSTR](../../../../sql-reference/functions/regexp_instr.md)(value, regexp) > 0 |
| [REGEXP_EXTRACT_ALL](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#regexp_extract_all) | [REGEXP_SUBSTR_ALL](https://docs.snowflake.com/en/sql-reference/functions/regexp_substr_all) |
| [REPLACE](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#replace) | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| [RIGHT](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#right) | [RIGHT](https://docs.snowflake.com/en/sql-reference/functions/right) |
| [RPAD](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#rpad) | [RPAD](https://docs.snowflake.com/en/sql-reference/functions/rpad) |
| [RTRIM](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#rtrim) | [RTRIM](https://docs.snowflake.com/en/sql-reference/functions/rtrim) |
| [SPLIT](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#split) | [SPLIT](https://docs.snowflake.com/en/sql-reference/functions/split) |
| [STARTS_WITH](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#starts_with) | [STARTSWITH](https://docs.snowflake.com/en/sql-reference/functions/startswith) |
| [SUBSTR](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#substr)(string, position)  [SUBSTRING](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#substring)(string, position)  [SUBSTR](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#substr)(string, position, length)  [SUBSTRING](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#substring)(string, position, length) | [SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/substr)(string, IFF(position < -LENGTH(string), 1, position))  [SUBSTRING](https://docs.snowflake.com/en/sql-reference/functions/substr)(string, IFF(position < -LENGTH(string), 1, position))  [SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/substr)(string, IFF(position < -LENGTH(string), 1, position), length)  [SUBSTRING](https://docs.snowflake.com/en/sql-reference/functions/substr)(string, IFF(position < -LENGTH(string), 1, position), length) |
| [TO_HEX](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#to_hex) | [HEX_ENCODE](https://docs.snowflake.com/en/sql-reference/functions/hex_encode) |
| [UPPER](https://cloud.google.com/bigquery/docs/reference/standard-sql/string_functions#upper) | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper) |

## Timestamp Functions

| BigQuery | Snowflake |
| --- | --- |
| [CURRENT_TIMESTAMP](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#current_timestamp) [CURRENT_TIMESTAMP](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#current_timestamp)() | [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp)  [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp)() |
| [SAFE.TIMESTAMP_MILLIS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#timestamp_millis) | IFF(expr BETWEEN -62135596800000 AND 253402300799999, TO_TIMESTAMP(expr / 1000), null)  *Notes: Safe version with range validation to prevent overflow errors.* |
| [SAFE.TIMESTAMP_SECONDS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#timestamp_seconds) | SAFE_TIMESTAMP_SECONDS_UDF(expr)  *Notes: SnowConvert AI generates a UDF to provide safe timestamp conversion with error handling.* |
| [TIMESTAMP_MILLIS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#timestamp_millis) | TO_TIMESTAMP(expr / 1000)  *Notes: Converts milliseconds since epoch to timestamp by dividing by 1000.* |
| [TIMESTAMP_SECONDS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#timestamp_seconds)(expr) | DATEADD(‘seconds’, expr, ‘1970-01-01’)  *Notes: Adds seconds to Unix epoch start date.* |
| [UNIX_MICROS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#unix_micros)(timestamp) | DATE_PART(‘epoch_microsecond’, CONVERT_TIMEZONE(‘UTC’, timestamp))  *Notes: Extracts microseconds since Unix epoch from timestamp converted to UTC.* |
| [UNIX_MILLIS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#unix_millis)(timestamp) | DATE_PART(‘epoch_millisecond’, CONVERT_TIMEZONE(‘UTC’, timestamp))  *Notes: Extracts milliseconds since Unix epoch from timestamp converted to UTC.* |
| [UNIX_SECONDS](https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions#unix_seconds)(timestamp) | DATE_PART(‘epoch_seconds’, CONVERT_TIMEZONE(‘UTC’, timestamp))  *Notes: Extracts seconds since Unix epoch from timestamp converted to UTC.* |

## FORMAT_DATE

Format_date function

### Description

Formats a `DATE` value according to a specified format string.

For more information, please refer to [FORMAT_DATE](https://cloud.google.com/bigquery/docs/reference/standard-sql/date_functions#format_date) function.

### Grammar Syntax

```sql
 FORMAT_DATE(format_string, date_expr)
```

#### Sample Source

##### BigQuery

```sql
CREATE TABLE TEST_DATE (col1 DATE);
SELECT FORMAT_DATE('%Y', col1);
```

##### Snowflake

```sql
CREATE TABLE TEST_DATE (col1 DATE);
SELECT
  TO_CHAR(col1, 'YYYY')
FROM
  TEST_DATE;
```

#### BigQuery Formats Equivalents

| BigQuery | Snowflake |
| --- | --- |
| %A | PUBLIC.DAYNAME_LONG_UDF(date_expr)  *Note: Generate UDF in conversion for support.* |
| %a | DY |
| %B | MMMM |
| %b | MON |
| %C | PUBLIC.CENTURY_UDF(date_expr)  *Note: Generate UDF in conversion for support.* |
| %c | DY MON DD HH24:MI:SS YYYY |
| %D | MM/DD/YY |
| %d | DD |
| %e | DD |
| %F | YYYY-MM-DD |
| %G | YEAROFWEEKISO(date_expr) |
| %g | PUBLIC.ISO_YEAR_PART_UDF(date_expr, 2)  *Note: Generate UDF in conversion for support.* |
| %H | HH24 |
| %h | MON |
| %I | HH12 |
| %J | PUBLIC.DAY_OF_YEAR_ISO_UDF(date_expr)  *Note: Generate UDF in conversion for support.* |
| %j | DAYOFYEAR(date_expr) |
| %k | HH24 |
| %l | HH12 |
| %M | MI |
| %m | MM |
| %n | *Not equivalent format* |
| %P | pm |
| %p | AM |
| %Q | QUARTER(date_expr) |
| %R | HH24:MI |
| %S | SS |
| %s | *Not equivalent format* |
| %T | HH24:MI:SS |
| %t | *Not equivalent format* |
| %U | WEEK(date_expr) |
| %u | DAYOFWEEKISO(date_expr) |
| %V | WEEKISO(date_expr) |
| %W | WEEK(date_expr)   *Note: Unlike BigQuery, Snowflake results are dictated by the values set for the WEEK_OF_YEAR_POLICY and/or WEEK_START session parameters. So, results could differ from BigQuery based on those parameters.* |
| %w | DAYOFWEEK(date_expr)  *Note: Unlike BigQuery, Snowflake results are dictated by the values set for the WEEK_OF_YEAR_POLICY and/or WEEK_START session parameters. So, results could differ from BigQuery based on those parameters.* |
| %X | HH24:MI:SS |
| %x | MM/DD/YY |
| %Y | YYYY |
| %y | YY |
| %Z | *Not equivalent format* |
| %z | *Not equivalent format* |
| %Ez | *Not equivalent format* |
| %E<number>S | *Not equivalent format* |
| %E\*S | *Not equivalent format* |
| %EY4 | YYYY |

> **Warning:**
>
> In BigQuery, the format related to time is not applied when the type is DATE, but Snowflake applies the format with values in zero for HH:MI:SS usages.

> **Note:**
>
> For more information, please refer to [BigQuery DateTime formats](https://cloud.google.com/bigquery/docs/reference/standard-sql/format-elements#format_elements_date_time).

## ST_GEOGFROMTEXT

Geography Function.

### Description

> Returns a `GEOGRAPHY` value that corresponds to the input [WKT](https://en.wikipedia.org/wiki/Well-known_text) representation.

For more information, please refer to [ST_GEOGFROMTEXT](https://cloud.google.com/bigquery/docs/reference/standard-sql/geography_functions#st_geogfromtext) function.

> **SuccessPlaceholder:**
>
> ST_GEOGFROMTEXT function is supported in Snowflake.

### Grammar Syntax

```sql
 ST_GEOGFROMTEXT(wkt_string[, oriented])
```

#### Sample Source

The oriented parameter in the ST_GEOGFROMTEXT function is not supported in Snowflake.

##### BigQuery

```sql
 SELECT ST_GEOGFROMTEXT('POINT(-122.35 37.55)');
SELECT ST_GEOGFROMTEXT('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))', TRUE);
```

##### Snowflake

```sql
 SELECT ST_GEOGFROMTEXT('POINT(-122.35 37.55)');
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0006 - ORIENTED PARAMETER IN THE ST_GEOGFROMTEXT FUNCTION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
ST_GEOGFROMTEXT('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))');
```

Please keep in mind that the default output format for geography data types is **WKT** **(Well-Known Text)** and in Snowflake **WKB (Well-Known Binary)**. You can use the [ST_ASWKT](https://docs.snowflake.com/en/sql-reference/functions/st_aswkt) function or set the [GEOGRAPHY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#geography-output-format) format if you want to view the data in **WKT** format.

#### Using ST_GEOGFROMTEXT function to insert geography data

This function is not allowed in the values clause and is not required in Snowflake.

##### BigQuery

```sql
 CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType VALUES
    (ST_GEOGFROMTEXT('POINT(-122.35 37.55)')),
    (ST_GEOGFROMTEXT('LINESTRING(-124.20 42.00, -120.01 41.99)'));
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType
VALUES
    (
     --** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
     'POINT(-122.35 37.55)'),
    (
     --** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
     'LINESTRING(-124.20 42.00, -120.01 41.99)');
```

### Related EWIs

1. [SSC-EWI-BQ0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): Oriented parameter in the ST_GEOGFROMTEXT function is not supported in Snowflake.
2. [SSC-FDM-BQ0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Geography function is not required in Snowflake.

## ST_GEOGPOINT

Geography Function.

### Description

> Creates a `GEOGRAPHY` with a single point. `ST_GEOGPOINT` creates a point from the specified `FLOAT64` longitude (in degrees, negative west of the Prime Meridian, positive east) and latitude (in degrees, positive north of the Equator, negative south) parameters and returns that point in a `GEOGRAPHY` value.

For more information, please refer to [ST_GEOGPOINT](https://cloud.google.com/bigquery/docs/reference/standard-sql/geography_functions#st_geogpoint) function.

> **Note:**
>
> The function ST_GEOGPOINT is translated to ST_POINT in Snowflake.

### Grammar Syntax

```sql
 ST_GEOGPOINT(longitude, latitude)
```

#### Sample Source

##### BigQuery

```sql
 SELECT ST_GEOGPOINT(-122.0838, 37.3860);
```

##### Snowflake

```sql
 SELECT ST_POINT(-122.0838, 37.3860);
```

Please keep in mind that the default output format for geography data types is **WKT** **(Well-Known Text)** and in Snowflake **WKB (Well-Known Binary)**. You can use the [ST_ASWKT](https://docs.snowflake.com/en/sql-reference/functions/st_aswkt) function or set the [GEOGRAPHY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#geography-output-format) format if you want to view the data in **WKT** format.

#### Using ST_POINT function to insert geography data

This function is not allowed in the values clause and is not required in Snowflake.

##### BigQuery

```sql
 CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType
VALUES (ST_GEOGPOINT(-122.0838, 37.3860));
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/03/2025",  "domain": "test" }}';

INSERT INTO test.geographyType
VALUES (
--** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
'POINT(122.0838 37.3860)');
```

### Related EWIs

1. [SSC-FDM-BQ0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Geography function is not required in Snowflake.

---
title: SnowConvert AI - BigQuery - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/bigquery-create-table.md
section: Migrations
---

# SnowConvert AI - BigQuery - CREATE TABLE

## Grammar syntax

```sql
CREATE [ OR REPLACE ] [ TEMP | TEMPORARY ] TABLE [ IF NOT EXISTS ]
table_name
[(
  column | constraint_definition[, ...]
)]
[DEFAULT COLLATE collate_specification]
[PARTITION BY partition_expression]
[CLUSTER BY clustering_column_list]
[OPTIONS(table_option_list)]
[AS query_statement]
```

### Sample Source Patterns

#### DEFAULT COLLATE

##### BigQuery

```sql
CREATE TABLE table1 (
    col1 STRING
)
DEFAULT COLLATE 'und:ci';
```

##### Snowflake

```sql
CREATE TABLE table1 (
    col1 STRING
)
DEFAULT_DDL_COLLATION='und-ci';
```

#### Labels table option

##### BigQuery

```sql
CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
OPTIONS(
  labels=[("org_unit", "development")]
);
```

##### Snowflake

```sql
CREATE TAG IF NOT EXISTS "org_unit";

CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
WITH TAG( "org_unit" = "development" )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/09/2025",  "domain": "test" }}'
;
```

#### Description table option

##### BigQuery

```sql
CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
OPTIONS(
  description = 'My table comment'
);
```

##### Snowflake

```sql
CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
COMMENT = '{ "description": "My table comment", "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/09/2025",  "domain": "test" }}'
;
```

#### Description table option

##### BigQuery

```sql
CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
OPTIONS(
  friendly_name = 'Some_table'
);
```

##### Snowflake

```sql
CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
COMMENT = '{ "friendly_name": "Some_table", "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/09/2025",  "domain": "test" }}'
;
```

### Known Issues

**1. Unsupported table options**

Not all table options are supported in Snowflake, when an unsupported table option is encountered in the OPTIONS clause, an EWI will be generated to warn about this.

#### BigQuery

```sql
 CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
OPTIONS(
  expiration_timestamp=TIMESTAMP "2025-01-01 00:00:00 UTC",
  partition_expiration_days=1,
  description="a table that expires in 2025, with each partition living for 24 hours",
  labels=[("org_unit", "development")]
);
```

#### Snowflake

```sql
 CREATE TAG IF NOT EXISTS "org_unit";

CREATE TABLE table1
(
  col1 INT,
  col2 DATE
)
WITH TAG( "org_unit" = "development" )
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0001 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: EXPIRATION_TIMESTAMP, PARTITION_EXPIRATION_DAYS. ***/!!!
OPTIONS(
  expiration_timestamp=TIMESTAMP "2025-01-01 00:00:00 UTC",
  partition_expiration_days=1
)
COMMENT = '{ "description": "a table that expires in 2025, with each partition living for 24 hours", "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/09/2025",  "domain": "test" }}'
;
```

**2. Micro-partitioning is automatically managed by Snowflake**

Snowflake performs automatic partitioning of data. User defined partitioning is not supported.

##### BigQuery

```sql
 CREATE TABLE table1(
    transaction_id INT,
    transaction_date DATE
)
PARTITION BY transaction_date;
```

##### Snowflake

```sql
 CREATE TABLE table1 (
    transaction_id INT,
    transaction_date DATE
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0002 - MICRO-PARTITIONING IS AUTOMATICALLY PERFORMED ON ALL SNOWFLAKE TABLES. ***/!!!
PARTITION BY transaction_date;
```

## Related EWIs

1. [SSC-EWI-BQ0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): Snowflake does not support the options clause.
2. [SSC-EWI-BQ0002](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): Micro-partitioning is automatically performed on all Snowflake tables.

## COLUMN DEFINITION

### Grammar syntax

```sql
 column :=
  column_name column_schema

column_schema :=
   {
     simple_type
     | STRUCT<field_list>
     | ARRAY<array_element_schema>
   }
   [PRIMARY KEY NOT ENFORCED | REFERENCES table_name(column_name) NOT ENFORCED]
   [DEFAULT default_expression]
   [NOT NULL]
   [OPTIONS(column_option_list)]

simple_type :=
  { data_type | STRING COLLATE collate_specification }

field_list :=
  field_name column_schema [, ...]

array_element_schema :=
  { simple_type | STRUCT<field_list> }
  [NOT NULL]
```

### Sample Source Patterns

#### Description option

##### BigQuery

```sql
CREATE TABLE table1 (
  col1 VARCHAR(20) OPTIONS(description="A repeated STRING field")
);
```

##### Snowflake

```sql
CREATE TABLE table1 (
  col1 VARCHAR(20) COMMENT = 'A repeated STRING field'
);
```

#### COLLATE

##### BigQuery

```sql
CREATE TABLE table1 (
  col1 STRING COLLATE 'und:ci'
);
```

##### Snowflake

```sql
CREATE TABLE table1 (
  col1 STRING COLLATE 'und-ci'
);
```

### Known Issues

**1. Rounding mode not supported**

Snowflake does not support specifying a default rounding mode on columns.

#### BigQuery

```sql
CREATE TABLE table1 (
  col1 STRING OPTIONS(rounding_mode = "ROUND_HALF_EVEN")
);
```

#### Snowflake

```sql
CREATE TABLE table1 (
    col1 STRING
    !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0001 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: ROUNDING_MODE. ***/!!!
    OPTIONS(
        rounding_mode = "ROUND_HALF_EVEN"
    )
)
```

### Related EWIs

1. [SSC-EWI-BQ0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): Snowflake does not support the options clause.

## CREATE EXTERNAL TABLE

### Description

External tables let BigQuery query data that is stored outside of BigQuery storage. ([BigQuery SQL Language Reference CREATE EXTERNAL TABLE](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language#create_external_table_statement))

Syntax

```sql
CREATE [ OR REPLACE ] EXTERNAL TABLE [ IF NOT EXISTS ] table_name
[(
  column_name column_schema,
  ...
)]
[WITH CONNECTION {connection_name | DEFAULT}]
[WITH PARTITION COLUMNS
  [(
      partition_column_name partition_column_type,
      ...
  )]
]
OPTIONS (
  external_table_option_list,
  ...
);
```

The CREATE EXTERNAL TABLE statement from BigQuery will be transformed to a CREATE EXTERNAL TABLE statement from [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-external-table), however, this transformation requires user intervention.

To complete the transformation performed by SnowConvert AI, it is necessary to define a [Storage Integration](https://docs.snowflake.com/en/sql-reference/sql/create-storage-integration), a [External Stage](https://docs.snowflake.com/en/sql-reference/sql/create-stage) and (optional) [Notification Integration](https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration) that have access to the external source were files are located. Please refer to the following guides on how to set up the connection for each provider:

* [For external tables referencing Amazon S3](https://docs.snowflake.com/en/user-guide/tables-external-s3)
* [For external tables referencing Google Cloud Storage](https://docs.snowflake.com/en/user-guide/tables-external-gcs)
* [For external tables referencing Azure Blob Storage](https://docs.snowflake.com/en/user-guide/tables-external-azure)

Important considerations for the transformations shown in this page:

* The @EXTERNAL_STAGE placeholder must be replaced with the external stage created after following the previous guide.
* It is assumed that the external stage will point to the root of the bucket. This is important to consider because the PATTERN clause generated for each table specifies the file/folder paths starting at the base of the bucket, defining the external stage pointing to a different location in the bucket might produce undesired behavior.
* The `AUTO_REFRESH = FALSE` clause is generated to avoid errors, please note that automatic refresh of external table metadata is only valid if your Snowflake account cloud provider and the bucket provider are the same and a Notification Integration was created.

### Sample Source Patterns

#### CREATE EXTERNAL TABLE with explicit column list

When the column list is provided, SnowConvert AI will automatically generate the AS expression column options for each column to extract the file values.

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.Employees_test
(
  Employee_id INTEGER,
  Name STRING,
  Mail STRING,
  Position STRING,
  Salary INTEGER
)
OPTIONS(
  FORMAT='CSV',
  SKIP_LEADING_ROWS=1,
  URIS=['gs://sc_external_table_bucket/folder_with_csv/Employees.csv']
);
```

##### Snowflake

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.Employees_test
(
  Employee_id INTEGER AS CAST(GET_IGNORE_CASE($1, 'c1') AS INTEGER),
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c2') AS STRING),
  Mail STRING AS CAST(GET_IGNORE_CASE($1, 'c3') AS STRING),
  Position STRING AS CAST(GET_IGNORE_CASE($1, 'c4') AS STRING),
  Salary INTEGER AS CAST(GET_IGNORE_CASE($1, 'c5') AS INTEGER)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Employees.csv'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER =1);
```

#### CREATE EXTERNAL TABLE without explicit column list

When the column list is not provided, BigQuery automatically detects the schema of the columns from the file structure. To replicate this behavior, SnowConvert AI will generate a USING TEMPLATE clause that makes use of the [INFER_SCHEMA](https://docs.snowflake.com/en/sql-reference/functions/infer_schema) function to generate the column definitions.

Since the INFER_SCHEMA function requires a file format to work, SnowConvert AI will generate a temporary file format for this purpose, this file format is only required when running the CREATE EXTERNAL TABLE statement and it will be automatically dropped when the session ends.

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json
OPTIONS(
  FORMAT='JSON',
  URIS=['gs://sc_external_table_bucket/folder_with_json/Cars.jsonl']
);
```

##### Snowflake

```sql
CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_JSON_FORMAT
TYPE = JSON;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/folder_with_json/Cars.jsonl', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_JSON_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_json/Cars.jsonl'
FILE_FORMAT = (TYPE = JSON);
```

#### CREATE EXTERNAL TABLE with multiple URIs

When multiple source URIs are specified, they will be joined in the regex of the PATTERN clause in Snowflake, the wildcard `*` characters used will be transformed to its `.*` equivalent in Snowflake.

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.multipleFilesTable
(
  Name STRING,
  Code STRING,
  Price NUMERIC,
  Expiration_date DATE
)

OPTIONS(
  format="CSV",
  skip_leading_rows = 1,
  uris=['gs://sc_external_table_bucket/folder_with_csv/Food.csv', 'gs://sc_external_table_bucket/folder_with_csv/other_products/*']
);
```

##### Snowflake

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.multipleFilesTable
(
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c1') AS STRING),
  Code STRING AS CAST(GET_IGNORE_CASE($1, 'c2') AS STRING),
  Price NUMERIC AS CAST(GET_IGNORE_CASE($1, 'c3') AS NUMERIC),
  Expiration_date DATE AS CAST(GET_IGNORE_CASE($1, 'c4') AS DATE)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Food.csv|folder_with_csv/other_products/.*'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER = 1);
```

#### WITH CONNECTION clause

The WITH CONNECTION clause is removed because the connection information is already provided to Snowflake using the Storage Integration.

##### BigQuery

```sql
 CREATE EXTERNAL TABLE test.awsTable
  WITH CONNECTION `aws-us-east-1.s3-read-connection`
  OPTIONS (
    format="JSON",
    uris=["s3://s3-bucket/json_files/example.jsonl"]
);
```

##### Snowflake

```sql
CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_AWSTABLE_FORMAT
TYPE = JSON;

CREATE EXTERNAL TABLE test.awsTable USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/json_files/example.jsonl', FILE_FORMAT => 'SC_TEST_AWSTABLE_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS s3://s3-bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'json_files/example.jsonl'
FILE_FORMAT = (TYPE = JSON);
```

#### Supported table options

The following external table options are supported in Snowflake and transformed by SnowConvert AI:

* FORMAT
* ENCODING
* SKIP_LEADING_ROWS
* FIELD_DELIMITER
* COMPRESSION

##### BigQuery

```sql
CREATE OR REPLACE EXTERNAL TABLE test.songs_test
(
  Name STRING,
  Release_date INTEGER,
  Songs INT,
  Genre STRING
)
OPTIONS(
  FORMAT='CSV',
  ENCODING='UTF-8',
  SKIP_LEADING_ROWS=1,
  FIELD_DELIMITER='|',
  COMPRESSION='GZIP',
  URIS=['gs://sc_external_table_bucket/folder_with_csv/Albums.csv']
);
```

##### Snowflake

```sql
CREATE OR REPLACE EXTERNAL TABLE test.songs_test
(
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c1') AS STRING),
  Release_date INTEGER AS CAST(GET_IGNORE_CASE($1, 'c2') AS INTEGER),
  Songs INT AS CAST(GET_IGNORE_CASE($1, 'c3') AS INT),
  Genre STRING AS CAST(GET_IGNORE_CASE($1, 'c4') AS STRING)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Albums.csv'
FILE_FORMAT = (TYPE = CSV
  ENCODING= 'UTF8' SKIP_HEADER =1
  FIELD_DELIMITER='|'
  COMPRESSION= GZIP);
```

### Known Issues

**1. CREATE EXTERNAL TABLE without explicit column list and CSV file format**

Currently, Snowflake external tables do not support parsing the header of CSV files. When an external table with no explicit column list and CSV file format is found, SnowConvert AI will produce the SKIP_HEADER file format option to avoid runtime errors, however, this will cause the table column names to have the autogenerated names c1, c2, …, cN.

An FDM is generated to notify that the header can not be parsed and that manually renaming the columns is necessary to preserve the names.

#### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_csv
OPTIONS(
  FORMAT='CSV',
  URIS=['gs://sc_external_table_bucket/folder_with_csv/Employees.csv']
);
```

#### Snowflake

```sql
CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_CSV_FORMAT
TYPE = CSV
SKIP_HEADER = 1;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_csv
--** SSC-FDM-BQ0005 - PARSING THE CSV HEADER IS NOT SUPPORTED IN EXTERNAL TABLES, COLUMNS MUST BE RENAMED TO MATCH THE ORIGINAL NAMES **
USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/folder_with_csv/Employees.csv', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_CSV_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Employees.csv'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER = 1);
```

**2. External tables referencing Google Drive sources**

Snowflake does not support reading data from files hosted in Google Drive, an FDM will be generated to notify about this and request that the files are uploaded to the bucket and accessed through the external stage.

The PATTERN clause will hold autogenerated placeholders FILE_PATH0, FILE_PATH1, …, FILE_PATHN that should be replaced with the file/folder path after the files were moved to the external location.

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_drive_test
OPTIONS(
  FORMAT='JSON',
  URIS=['https://drive.google.com/open?id=someFileId']
);
```

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_DRIVE_TEST_FORMAT
TYPE = JSON;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_drive_test USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-BQ0008 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_DRIVE_TEST_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS A EXTERNAL LOCATION, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
--** SSC-FDM-BQ0006 - READING FROM GOOGLE DRIVE IS NOT SUPPORTED IN SNOWFLAKE, UPLOAD THE FILES TO THE EXTERNAL LOCATION AND REPLACE THE FILE_PATH PLACEHOLDERS **
PATTERN = 'FILE_PATH0'
FILE_FORMAT = (TYPE = JSON);
```

**3. External tables with the GOOGLE_SHEETS file format**

Snowflake does not support Google Sheets as a file format, however, its structure is similar to CSV files, which are supported by Snowflake.

When SnowConvert AI detects an external table using the GOOGLE_SHEETS format, it will produce an external table with the CSV file format instead.

Since Google Sheets are stored in Google Drive, it would be necessary to upload the files as CSV to the external location and specify the file paths in the PATTERN clause, just as mentioned in the previous issue.

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.spreadsheetTable
(
  Name STRING,
  Code INTEGER,
  Price INTEGER,
  Expiration_date DATE
)
OPTIONS(
  format="GOOGLE_SHEETS",
  skip_leading_rows = 1,
  uris=['https://docs.google.com/spreadsheets/d/someFileId/edit?usp=sharing']
);
```

##### Snowflake

```sql
 --** SSC-FDM-BQ0007 - THE GOOGLE_SHEETS FORMAT IS NOT SUPPORTED IN SNOWFLAKE. CSV FILE TYPE IS USED AS A WORKAROUND. **
CREATE OR REPLACE EXTERNAL TABLE test.spreadsheetTable
(
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c1') AS STRING),
  Code INTEGER AS CAST(GET_IGNORE_CASE($1, 'c2') AS INTEGER),
  Price INTEGER AS CAST(GET_IGNORE_CASE($1, 'c3') AS INTEGER),
  Expiration_date DATE AS CAST(GET_IGNORE_CASE($1, 'c4') AS DATE)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS A EXTERNAL LOCATION, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
--** SSC-FDM-BQ0006 - READING FROM GOOGLE DRIVE IS NOT SUPPORTED IN SNOWFLAKE, UPLOAD THE FILES TO THE EXTERNAL LOCATION AND REPLACE THE FILE_PATH PLACEHOLDERS **
PATTERN = 'FILE_PATH0'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER = 1);
```

**4. External tables with unsupported file formats**

Snowflake supports the following BigQuery formats:

| BigQuery | Snowflake |
| --- | --- |
| AVRO | AVRO |
| CSV GOOGLE_SHEETS | CSV |
| NEWLINE_DELIMITED_JSON JSON | JSON |
| ORC | ORC |
| PARQUET | PARQUET |

Other formats will be marked as not supported.

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.backup_restore_table
OPTIONS (
  format = 'DATASTORE_BACKUP',
  uris = ['gs://backup_bucket/backup_folder/*']
);
```

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0013 - EXTERNAL TABLE DATA FORMAT NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE OR REPLACE EXTERNAL TABLE test.backup_restore_table
OPTIONS (
  format = 'DATASTORE_BACKUP',
  uris = ['gs://backup_bucket/backup_folder/*']
);
```

**5. Hive partitioned external tables**

Snowflake does not support hive partitioned external tables, the WITH PARTITION COLUMNS clause will be marked as not supported.

##### BigQuery

```sql
CREATE EXTERNAL TABLE test.CustomHivePartitionedTable
WITH PARTITION COLUMNS (
  field_1 STRING,
  field_2 INT64)
OPTIONS (
  uris = ['gs://sc_external_table_bucket/folder_with_parquet/*'],
  format = 'PARQUET',
  hive_partition_uri_prefix = 'gs://sc_external_table_bucket/folder_with_parquet',
  require_hive_partition_filter = false);
```

##### Snowflake

```sql
CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_CUSTOMHIVEPARTITIONEDTABLE_FORMAT
TYPE = PARQUET;

CREATE EXTERNAL TABLE test.CustomHivePartitionedTable USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-BQ0008 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_CUSTOMHIVEPARTITIONEDTABLE_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0014 - HIVE PARTITIONED EXTERNAL TABLES ARE NOT SUPPORTED IN SNOWFLAKE ***/!!!
WITH PARTITION COLUMNS (
  field_1 STRING,
  field_2 INT64)
PATTERN = 'folder_with_parquet/.*'
FILE_FORMAT = (TYPE = PARQUET)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0001 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: HIVE_PARTITION_URI_PREFIX, REQUIRE_HIVE_PARTITION_FILTER. ***/!!!
OPTIONS(
  hive_partition_uri_prefix = 'gs://sc_external_table_bucket/folder_with_parquet',
  require_hive_partition_filter = false
);
```

**6. External table without columns list and no valid file URI for the INFER_SCHEMA function**

The INFER_SCHEMA function requires a LOCATION parameter that specifies the path to a file or folder that will be used to construct the table columns, however, this path does not support regex, meaning that the wildcard `*` character is not supported.

When the table has no columns, SnowConvert AI will check all URIS to find one that does not use wildcards and use it in the INFER_SCHEMA function, when no URI meets such criteria an FDM and FILE_PATH placeholder will be generated, the placeholder has to be replaced with the path of one of the files referenced by the external table to generate the table columns.

##### BigQuery

```sql
CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json2
OPTIONS(
  FORMAT='JSON',
  URIS=['gs://sc_external_table_bucket/folder_with_json/*']
);
```

##### Snowflake

```sql
CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_JSON2_FORMAT
TYPE = JSON;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json2 USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-BQ0008 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_JSON2_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_json/.*'
FILE_FORMAT = (TYPE = JSON);
```

**7. Unsupported table options**

Any other table option not mentioned in the Supported table options pattern will be marked as not supported.

##### BigQuery

```sql
CREATE OR REPLACE EXTERNAL TABLE dataset.CsvTable
(
  x INTEGER,
  y STRING
)
OPTIONS (
  format = 'CSV',
  uris = ['gs://bucket/example.csv'],
  field_delimiter = '|',
  max_bad_records = 5
);
```

##### Snowflake

```sql
CREATE OR REPLACE EXTERNAL TABLE dataset.CsvTable
(
  x INTEGER AS CAST(GET_IGNORE_CASE($1, 'c1') AS INTEGER),
  y STRING AS CAST(GET_IGNORE_CASE($1, 'c2') AS STRING)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'example.csv'
FILE_FORMAT = (TYPE = CSV
  field_delimiter = '|')
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0001 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: MAX_BAD_RECORDS. ***/!!!
OPTIONS(
  max_bad_records = 5
);
```

### Related EWIs

1. [SSC-EWI-BQ0013](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): External table data format not supported in snowflake
2. [SSC-EWI-BQ0014](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): Hive partitioned external tables are not supported in snowflake
3. [SSC-EWI-BQ0015](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): External table requires an external stage to access an external location, define and replace the EXTERNAL_STAGE placeholder
4. [SSC-FDM-BQ0004](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): The INFER_SCHEMA function requires a file path without wildcards to generate the table template, replace the FILE_PATH placeholder with it
5. [SSC-FDM-BQ0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Parsing the CSV header is not supported in external tables, columns must be renamed to match the original names
6. [SSC-FDM-BQ0006](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Reading from Google Drive is not supported in Snowflake, upload the files to the external location and replace the FILE_PATH placeholders
7. [SSC-FDM-BQ0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): The GOOGLE_SHEETS format is not supported in Snowflake. CSV file type is used as a workaround.

## CREATE TABLE CLONE

### Grammar syntax

```sql
CREATE TABLE [ IF NOT EXISTS ]
destination_table_name
CLONE source_table_name [FOR SYSTEM_TIME AS OF time_expression]
...
[OPTIONS(table_option_list)]
```

### Sample Source Patterns

#### FOR SYSTEM TIME AS OF

##### BigQuery

```sql
CREATE TABLE my_clone_table
CLONE some_table_name2
FOR SYSTEM_TIME AS OF TIMESTAMP "2025-01-01 00:00:00 UTC";
```

##### Snowflake

```sql
CREATE TABLE my_clone_table
CLONE some_table_name2 AT (TIMESTAMP => TIMESTAMP "2025-01-01 00:00:00 UTC");
```

::{note}
The LABELS option in CREATE TABLE CLONE statements are not transformed into TAGs because the TAGs of the source table are copied, they cannot be changed during the copy of the table.
Transformation of other table options are the same as specified for the CREATE TABLE statement.

## CREATE TABLE COPY

### Grammar syntax

```sql
CREATE [ OR REPLACE ] TABLE [ IF NOT EXISTS ] table_name
COPY source_table_name
...
[OPTIONS(table_option_list)]
```

### Sample Source Patterns

#### General case

CREATE TABLE CLONE in Snowflake is functionally equivalent to CREATE TABLE COPY.

##### Input Code

##### BigQuery

```sql
CREATE TABLE newtable
COPY sourceTable;
```

##### Snowflake

```sql
CREATE TABLE newtable CLONE sourceTable;
```

> **Note:**
>
> The LABELS option in CREATE TABLE COPY statements are not transformed into TAGs because the TAGs of the source table are copied, they cannot be changed during the copy of the table.
> Transformation of other table options are the same as specified for the CREATE TABLE statement.

## CREATE TABLE LIKE

### Grammar syntax

```sql
CREATE [ OR REPLACE ] TABLE [ IF NOT EXISTS ]
table_name
LIKE [[project_name.]dataset_name.]source_table_name
...
[OPTIONS(table_option_list)]
```

> **Success:**
>
> CREATE TABLE LIKE is fully supported by Snowflake.

> **Note:**
>
> The LABELS option in CREATE TABLE LIKE statements are not transformed into TAGs because the TAGs of the source table are copied, they cannot be changed during the copy of the table.
> Transformation of other table options are the same as specified for the CREATE TABLE statement.

## CREATE TABLE SNAPSHOT

### Grammar syntax

```sql
CREATE SNAPSHOT TABLE [ IF NOT EXISTS ] table_snapshot_name
CLONE source_table_name
[FOR SYSTEM_TIME AS OF time_expression]
[OPTIONS(snapshot_option_list)]
```

### Sample Source Patterns

#### General case

The Snapshot keyword is removed in Snowflake, transforming the table into a CREATE TABLE CLONE.

The two differences between snapshot and clones are that snapshots are not editable and usually have an expiration date. Expiration dates are not supported, this is handled as specified for the CREATE TABLE statement unsupported options.

##### BigQuery

```sql
CREATE SNAPSHOT TABLE mytablesnapshot
CLONE mytable;
```

##### Snowflake

```sql
CREATE TABLE mytablesnapshot CLONE mytable;
```

#### FOR SYSTEM TIME AS OF

##### BigQuery

```sql
CREATE SNAPSHOT TABLE IF NOT EXISTS my_snapshot_table2
CLONE some_table_name2
FOR SYSTEM_TIME AS OF TIMESTAMP "2025-01-01 00:00:00 UTC";
```

##### Snowflake

```sql
CREATE TABLE IF NOT EXISTS my_snapshot_table2
CLONE some_table_name2 AT (TIMESTAMP => TIMESTAMP "2025-01-01 00:00:00 UTC");
```

> **Note:**
>
> The LABELS option in CREATE TABLE COPY statements are not transformed into TAGs because the TAGs of the source table are copied, they cannot be changed during the copy of the table.
>
> Transformation of other table options are the same as specified for the CREATE TABLE statement.

---
title: SnowConvert AI - BigQuery - CREATE VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/bigquery-create-view.md
section: Migrations
---

# SnowConvert AI - BigQuery - CREATE VIEW

## Description

Creates a new view. ([BigQuery SQL Language Reference Create view statement](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language?hl=en#create_view_statement))

> **Success:**
>
> This syntax is fully supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-view).

## Grammar Syntax

```sql
CREATE [ OR REPLACE ] VIEW [ IF NOT EXISTS ] view_name
[(view_column_name_list)]
[OPTIONS(view_option_list)]
AS query_expression

view_column_name_list :=
  view_column[, ...]

view_column :=
  column_name [OPTIONS(view_column_option_list)]
```

### Sample Source Patterns

#### BigQuery

```sql
CREATE VIEW myuser
AS
SELECT lastname FROM users;

CREATE OR REPLACE VIEW myuser2
AS
SELECT lastname FROM users2;

CREATE VIEW IF NOT EXISTS myuser2
AS
SELECT lastname FROM users2;
```

#### Snowflake

```sql
CREATE VIEW myuser
AS
SELECT lastname FROM
users;

CREATE OR REPLACE VIEW myuser2
AS
SELECT lastname FROM
users2;

CREATE VIEW myuser3
AS
SELECT lastname FROM
users3;
```

### Known Issues

There are no known Issues.

### Related EWIs

There are no related EWIs.

## View column name list

### Description

The view’s column name list is optional. The names must be unique but do not have to be the same as the column names of the underlying SQL query. ([BigQuery SQL Language Reference View column name list](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language?hl=en#view_column_name_list))

> **Success:**
>
> This syntax is fully supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-view).

### Grammar Syntax

```sql
view_column_name_list :=
  view_column [OPTIONS(view_column_option_list)] [, ...]

view_column_option_list :=
  DESCRIPTION = value
```

### Sample Source Patterns

#### BigQuery

```sql
CREATE VIEW `myproject.mydataset.newview` (
  column_1_new_name OPTIONS (DESCRIPTION='Description of the column 1 contents'),
  column_2_new_name OPTIONS (DESCRIPTION='Description of the column 2 contents'),
  column_3_new_name OPTIONS (DESCRIPTION='Description of the column 3 contents')
)
AS SELECT column_1, column_2, column_3 FROM `myproject.mydataset.mytable`
```

#### Snowflake

```sql
 CREATE VIEW myproject.mydataset.newview
(
  column_1_new_name COMMENT 'Description of the column 1 contents',
  column_2_new_name COMMENT 'Description of the column 2 contents',
  column_3_new_name COMMENT 'Description of the column 3 contents'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "03/25/2025",  "domain": "test" }}'
AS SELECT column_1, column_2, column_3 FROM
  myproject.mydataset.mytable
```

### Known Issues

There are no known Issues.

### Related EWIs

There are no related EWIs.

## View Options

### Description

> The option list allows you to set view options such as a label and an expiration time. You can include multiple options using a comma-separated list. ([BigQuery SQL Language Reference View Options](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language?hl=en#view_option_list))

> **Warning:**
>
> This syntax is partially supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-view).

### Grammar Syntax

```none
OPTIONS(view_option_list [,...])

view_option_list :=
  NAME = value
```

| NAME | Value | Supported |
| --- | --- | --- |
| expiration_timestamp | TIMESTAMP | false |
| friendly_name | STRING | true |
| description | STRING | true |
| labels | ARRAY<STRUCT<STRING, STRING>> | true |
| privacy_policy | JSON-formatted STRING | false |

### Sample Source Patterns

#### Description & Friendly_name:

The description and friendly_name options are included in the Comment Clause generated by SnowConvert AI .

##### BigQuery

```sql
CREATE VIEW my_view
OPTIONS (
  description="This is a view description",
  friendly_name="my_friendly_view") AS
SELECT column1, column2
FROM my_table;
```

##### Snowflake

```sql
CREATE VIEW my_view
COMMENT = '{ "description": "This is a view description", "friendly_name": "my_friendly_view", "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "03/25/2025",  "domain": "test" }}'
AS
SELECT column1, column2
FROM
 my_table;
```

#### Labels:

In BigQuery the labels associated with a view can be used to organize and group tables in the database administrative environment, in Snowflake the Tags can be used for the same functionality. But to ensure that the tag exists, SnowConvert AI will add the corresponding CREATE TAG before the CREATE VIEW if it contains labels. It is important to know that the `CREATE TAG` feature requires Enterprise Edition or higher

##### BigQuery

```sql
CREATE VIEW my_view
OPTIONS(
    labels=[("label1", "value1"), ("label2", "value2")]
)
AS
SELECT column1, column2
FROM table1;
```

##### Snowflake

```sql
CREATE TAG IF NOT EXISTS "label1";
CREATE TAG IF NOT EXISTS "label2";

CREATE VIEW my_view
WITH TAG( "label1" = "value1","label2" = "value2" )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "03/26/2025",  "domain": "test" }}'
AS
SELECT column1, column2
FROM
  table1;
```

#### Unsupported Options:

When an option clause includes elements not supported by Snowflake, An EWI will be added.

##### BigQuery

```sql
CREATE VIEW my_view
OPTIONS (
  expiration_timestamp=TIMESTAMP "2026-01-01 00:00:00 UTC",
  privacy_policy='{"aggregation_threshold_policy": {"threshold": 50, "privacy_unit_columns": "ID"}}'
) AS
SELECT column1, column2
FROM my_table;
```

##### Snowflake

```sql
CREATE VIEW my_view10
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0001 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: EXPIRATION_TIMESTAMP, PRIVACY_POLICY ***/!!!
OPTIONS(
  expiration_timestamp=TIMESTAMP "2026-01-01 00:00:00 UTC",
  privacy_policy='{"aggregation_threshold_policy": {"threshold": 50, "privacy_unit_columns": "ID"}}'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "03/26/2025",  "domain": "test" }}'
AS
SELECT column1, column2
FROM
  my_table;
```

#### Known Issues

* The label-to-tag transformation could lead to errors if the Snowflake account is not Enterprise Edition or higher.

#### Related EWIs

1. [SSC-EWI-BQ0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): The OPTIONS clause within View is not supported in Snowflake.

---
title: SnowConvert AI - BigQuery - Data types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/bigquery-data-types.md
section: Migrations
---

# SnowConvert AI - BigQuery - Data types

Snowflake provides support for the majority of fundamental SQL data types, with specific restrictions, across various SQL constructs including columns, local variables, expressions, and parameters.

## Boolean Data Type

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [BOOL/BOOLEAN](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#boolean_type) | [BOOLEAN](https://docs.snowflake.com/en/sql-reference/data-types-logical#boolean) |  |

## Bytes Data Type

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [BYTES](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#bytes_type) | [BINARY](https://docs.snowflake.com/en/sql-reference/data-types-text#binary) | BYTES data type is **not supported** in Snowflake. BINARY is used instead. For more information, please refer to the BYTES data type documentation. |

## Datetime Data Types

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [DATE](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#date_type) | [DATE](https://docs.snowflake.com/en/sql-reference/data-types-datetime#date) |  |
| [DATETIME](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#date_type) | [DATETIME](https://docs.snowflake.com/en/sql-reference/data-types-datetime#datetime) | DATETIME is an alias for TIMESTAMP_NTZ in Snowflake. |
| [TIMESTAMP](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#timestamp_type) | [TIMESTAMP_TZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime#timestamp-ltz-timestamp-ntz-timestamp-tz) | TIMESTAMP data type is converted to TIMESTAMP_TZ. For more information, please refer to the TIMESTAMP data type documentation. |
| [TIME](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#time_type) | [TIME](https://docs.snowflake.com/en/sql-reference/data-types-datetime#time) |  |

## Geography Data Type

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [GEOGRAPHY](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#geography_type) | [GEOGRAPHY](https://docs.snowflake.com/en/sql-reference/data-types-geospatial#geography-data-type) |  |

## Interval Data Type

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [INTERVAL](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#interval_type) | [VARCHAR(30)](https://docs.snowflake.com/en/sql-reference/data-types-text#varchar) | INTERVAL data type is **not supported** in Snowflake. VARCHAR is used instead. For more information, please refer to the INTERVAL data type documentation. With the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md), maps to `INTERVAL DAY TO SECOND`. See [Interval Data Types](../general/interval-data-types.md). |

## Json Data Type

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [JSON](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#json_type) | [VARIANT](https://docs.snowflake.com/en/sql-reference/data-types-semistructured#variant) | JSON data type is **not supported** in Snowflake. VARIANT is used instead. For more information, please refer to the JSON data type documentation. |

## Numeric Data Types

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [INT64](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [INT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | INT is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [INT](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [INT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | INT is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [SMALLINT](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [SMALLINT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | SMALLINT is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [INTEGER](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [INTEGER](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | INTEGER is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [BIGINT](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [BIGINT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | BIGINT is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [TINYINT](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [TINYINT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | TINYINT is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [BYTEINT](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [BYTEINT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) | BYTEINT is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [NUMERIC](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [NUMERIC](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric) | NUMERIC is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [DECIMAL](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [DECIMAL](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric) | DECIMAL is an alias for the NUMBER data type in Snowflake. The maximum precision and scale is NUMBER(38,37). |
| [BIGNUMERIC](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [NUMERIC](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric)​ | Snowflake does not support the BIGNUMERIC data type. Use NUMERIC instead. BIGNUMERIC’s precision 76,76 exceeds Snowflake’s limit (38), resulting in truncation or rounding, which can introduce significant inaccuracies. |
| [BIGDECIMAL](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#numeric_types) | [DECIMAL](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric) | Snowflake does not support the BIGDECIMAL data type. Use NUMERIC instead. BIGDECIMAL’s precision 76,76 exceeds Snowflake’s limit (38), resulting in truncation or rounding, which can introduce significant inaccuracies. |
| [FLOAT64](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#floating_point_types) | [FLOAT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#data-types-for-floating-point-numbers) |  |

## String Data Types

| BigQuery | Snowflake | Notes |
| --- | --- | --- |
| [STRING](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#string_type) | [STRING](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#string_type) | STRING is an alias for the VARCHAR data type in Snowflake. VARCHAR holds Unicode UTF-8 characters. |

## ANY TYPE

Translation specification for BigQuery’s ANY TYPE data type

### Description

The following is an extract of information about the usage of `ANY TYPE` within `CREATE FUNCTION` statements.

> A parameter with a type equal to `ANY TYPE` can match more than one argument type when the function is called.
>
> * If more than one parameter has type `ANY TYPE`, then BigQuery doesn’t enforce any type relationship between these arguments.
> * The function return type cannot be `ANY TYPE`. It must be either omitted, which means to be automatically determined based on `sql_expression`, or an explicit type.
> * Passing the function arguments of types that are incompatible with the function definition results in an error at call time.

### Sample source patterns

#### Type definition for UDFs

`ANY TYPE` can only be found as the type for a function’s parameter. SnowConvert AI automatically translates `ANY TYPE` to `VARIANT`.

##### BigQuery

```sql
CREATE FUNCTION addFourAndDivideAny(x ANY TYPE, y ANY TYPE)
AS (
  (x + 4) / y
);
```

##### Snowflake

```sql
CREATE FUNCTION addFourAndDivideAny (x VARIANT, y VARIANT)
RETURNS VARIANT
AS
$$
  ((x + 4) / y) :: VARIANT
$$;
```

## ARRAY<T>

Translation specification for the ARRAY datatype from BigQuery to Snowflake

### Description

In BigQuery, an array is an ordered list of zero or more elements of non-array values. Elements in an array must share the same type. ([Array Type. BigQuery](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#array_type))

### Sample Source Patterns

#### BigQuery

```sql
CREATE TABLE test.arrayTable
(
  col1 ARRAY<INT64>
);

CREATE TABLE test.anotherArrayTable
(
  col2 ARRAY<INT64>
);

INSERT INTO test.arrayTable VALUES ([4, 10, 55]);
INSERT INTO test.arrayTable VALUES ([6, 7, 33]);
INSERT INTO test.arrayTable VALUES ([50, 12, 22]);

INSERT INTO test.anotherArrayTable VALUES ([9, 11, 52]);
INSERT INTO test.anotherArrayTable VALUES ([3, 18, 11]);
INSERT INTO test.anotherArrayTable VALUES ([33, 27, 43]);
```

#### Snowflake

```sql
CREATE TABLE test.arrayTable
(
  col1 ARRAY DEFAULT []
);

CREATE TABLE test.anotherArrayTable
(
  col2 ARRAY DEFAULT []
);

INSERT INTO test.arrayTable SELECT [4, 10, 55];
INSERT INTO test.arrayTable SELECT [6, 7, 33];
INSERT INTO test.arrayTable SELECT [50, 12, 22];

INSERT INTO test.anotherArrayTable SELECT [9, 11, 52];
INSERT INTO test.anotherArrayTable SELECT [3, 18, 11];
INSERT INTO test.anotherArrayTable SELECT [33, 27, 43];
```

#### ARRAY access by index

##### BigQuery

```sql
SELECT
col1[0] + 4 AS byIndex,
col1[OFFSET(0)] + 4 AS byOffset,
col1[ORDINAL(1)] + 4 AS byOrdinal
FROM test.arrayTable ORDER BY col1[0];
```

##### Snowflake

```sql
SELECT
--** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
col1[0] + 4 AS byIndex,
--** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
col1[0] + 4 AS byOffset,
--** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
col1[1 - 1] + 4 AS byOrdinal
FROM
test.arrayTable
ORDER BY
--** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
col1[0];
```

#### Safe ARRAY access by index

##### BigQuery

```sql
SELECT
col1[SAFE_OFFSET(0)] AS byOffsset,
col1[SAFE_OFFSET(-4)] AS byOffsetUnderflow,
col1[SAFE_OFFSET(500)] AS byOffsetOverflow,
col1[SAFE_ORDINAL(1)] AS byOrdinal,
col1[SAFE_ORDINAL(-4)] AS byOrdinalUnderflow,
col1[SAFE_ORDINAL(500)] AS byOrdinalOverflow
FROM test.arrayTable ORDER BY col1[0];
```

##### Snowflake

```sql
SELECT
PUBLIC.SAFE_OFFSET_UDF(col1, 0) AS byOffsset,
PUBLIC.SAFE_OFFSET_UDF(col1, -4) AS byOffsetUnderflow,
PUBLIC.SAFE_OFFSET_UDF(col1, 500) AS byOffsetOverflow,
PUBLIC.SAFE_OFFSET_UDF(col1, 1 - 1) AS byOrdinal,
PUBLIC.SAFE_OFFSET_UDF(col1, -4 - 1) AS byOrdinalUnderflow,
PUBLIC.SAFE_OFFSET_UDF(col1, 500 - 1) AS byOrdinalOverflow
FROM test.arrayTable ORDER BY
--** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
col1[0];
```

#### INSERT with ARRAY in the VALUES clause

##### BigQuery

```sql
INSERT INTO test.arrayTable VALUES ([4, 10]);

INSERT INTO test.arrayTable (COL1)
VALUES ([1, 2, 3]), ([4, 5, 6]);

SELECT col1 FROM test.arrayTable ORDER BY col1[0], col1[1];
```

##### Snowflake

```sql
INSERT INTO test.arrayTable SELECT [4, 10];

INSERT INTO test.arrayTable (COL1)
SELECT [1, 2, 3]
UNION ALL
SELECT [4, 5, 6];

SELECT col1 FROM
  test.arrayTable
ORDER BY
  --** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
  col1[0],
  --** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
  col1[1];
```

#### MERGE statement

##### BigQuery

```sql
MERGE INTO test.anotherArrayTable
USING test.arrayTable
ON col1[0] = col2[0]
WHEN MATCHED THEN UPDATE SET col2 = col1
WHEN NOT MATCHED THEN INSERT VALUES ([100, 100, 100]);

SELECT col2 FROM test.anotherArrayTable ORDER BY col2[0];
```

##### Snowflake

```sql
MERGE INTO test.anotherArrayTable
USING test.arrayTable
ON col1[0] = col2[0]
WHEN MATCHED THEN UPDATE SET col2 = col1
WHEN NOT MATCHED THEN INSERT VALUES ([100, 100, 100]) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'MergeStatement' NODE ***/!!!;

SELECT col2 FROM
  test.anotherArrayTable
ORDER BY
  --** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
  col2[0];
```

#### ARRAY DEFAULT column value insertion/update

##### BigQuery

```sql
 INSERT INTO test.arrayTable VALUES (DEFAULT);

UPDATE test.arrayTable
SET col1 = DEFAULT
WHERE TRUE;

SELECT col1 FROM test.arrayTable;
```

##### Snowflake

```sql
 INSERT INTO test.arrayTable SELECT [];

UPDATE test.arrayTable
SET col1 = DEFAULT
WHERE TRUE;

SELECT col1 FROM test.arrayTable;
```

#### INSERT/UPDATE with NULL value

##### BigQuery

```sql
 INSERT INTO test.arrayTable
  SELECT
    numbers
  FROM
    (SELECT [6] AS numbers
    UNION ALL
    SELECT CAST(NULL AS ARRAY<INT64>));

UPDATE test.arrayTable
SET col1 = NULL
WHERE ARRAY_LENGTH(col1) > 1;

SELECT col1 FROM test.arrayTable ORDER BY ARRAY_LENGTH(col1);
```

##### Snowflake

```sql
INSERT INTO test.arrayTable
SELECT
  numbers
FROM
  (SELECT [6] AS numbers
  UNION ALL
  SELECT IFNULL(CAST(NULL AS ARRAY), []));

UPDATE test.arrayTable
SET col1 = IFNULL(NULL, [])
WHERE ARRAY_SIZE(col1) > 1;

SELECT col1 FROM test.arrayTable ORDER BY ARRAY_SIZE(col1);
```

#### ARRAY concatenation

##### BigQuery

```sql
SELECT [50, 30, 12] || [22, 33, 44] AS result;
```

##### Snowflake

```sql
SELECT ARRAY_CAT([50, 30, 12], [22, 33, 44]) AS result;
```

#### ARRAY used as parameter/return type

##### BigQuery

```sql
CREATE FUNCTION test.myArrayFunction (valuesArray ARRAY<INT64>, otherValue INTEGER)
RETURNS ARRAY<INT64>
AS
(
  valuesArray || [otherValue]
);

SELECT test.myArrayFunction([5, 20, 10], 55) AS result;
```

##### Snowflake

```sql
CREATE FUNCTION test.myArrayFunction (valuesArray ARRAY, otherValue INTEGER)
RETURNS ARRAY
AS
$$
  ARRAY_CAT(valuesArray, [otherValue])
$$;

SELECT test.myArrayFunction([5, 20, 10], 55) AS result;
```

### Known Issues

**1. Non-safe ARRAY access will not fail for positive out of bounds indexes**

In BigQuery, accessing an array element by index will fail for any index value that is too low (underflow) or too high (overflow) when not using SAFE_OFFSET or SAFE_ORDINAL. However, in Snowflake errors are thrown only for underflow cases, any index that would case an overflow error will generate a NULL value instead.

When non-safe access to elements in an array is detected SnowConvert AI will generate [SSC-FDM-BQ0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md) to warn the user about this.

### Related EWIs

1. [SSC-FDM-BQ0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Accessing arrays produces NULL instead of an error for positive out of bounds indexes in Snowflake.

## BYTES

Bytes data type and usages

### Description

> Sequence of bytes with a maximum of L bytes allowed in the binary string. The maximum length is 8 MB (8,388,608 bytes). For more information please refer to [BigQuery BYTES data type](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#bytes_type).

> **Note:**
>
> BYTES data type is not supported in Snowflake, currently transformed to [BINARY](https://docs.snowflake.com/en/sql-reference/data-types-text#binary).

### Sample Source Patterns

#### BYTES output format

The default output format for binary data types in BigQuery is ‘BASE64’ and in Snowflake ‘HEX’. For this reason, when a binary column is selected, the [BASE64_ENCODE](https://docs.snowflake.com/en/sql-reference/functions/base64_encode) function is automatically added. In order to maintain the default formatting of BigQuery.

##### BigQuery

```sql
 CREATE OR REPLACE TABLE bytesTable
(
  COL1 BYTES,
  COL2 BYTES(20)
);

INSERT INTO bytesTable VALUES (B"01020304", B"""AABBCCDD""");
INSERT INTO bytesTable VALUES (B'''\x01\x02\x03''', B"/+A=");

SELECT COL1 FROM bytesTable;
```

##### Snowflake:

```sql
CREATE OR REPLACE TABLE bytesTable
(
  COL1 BINARY,
  COL2 BINARY(20)
);

INSERT INTO bytesTable
SELECT
  TRY_TO_BINARY('01020304', 'utf-8'),
  TRY_TO_BINARY('AABBCCDD', 'utf-8');

INSERT INTO bytesTable
SELECT
  TRY_TO_BINARY('\x01\x02\x03', 'utf-8'),
  TRY_TO_BINARY('/+A=', 'utf-8');

SELECT BASE64_ENCODE( COL1) FROM bytesTable;
```

In case it is not added automatically and you want to see the data in BASE64 format, you can use the [BASE64_ENCODE](https://docs.snowflake.com/en/sql-reference/functions/base64_encode) function or set the [BINARY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#binary-output-format) format.

#### BYTES Literal

The following cases represent the forms that can be used to format byte literals in BigQuery.

```sql
 B"abc"
B'''abc'''
b"""abc"""
```

These literals are not supported in Snowflake, but instead the [TRY_TO_BINARY](https://docs.snowflake.com/en/sql-reference/functions/try_to_binary) function can be used to convert the input expression to a binary value. This function is a special version of [TO_BINARY](https://docs.snowflake.com/en/sql-reference/functions/to_binary) that performs the same operation, but with error handling support.

It is important to take into consideration that the binary format for the conversion can be: HEX, BASE64, or UTF-8. The default is the value of the [BINARY_INPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#binary-input-format) session parameter. If this parameter is not set, the default value is HEX.

#### Observations

* Please keep in mind that the default output format for binary data types in BigQuery is ‘BASE64’ and in Snowflake ‘HEX’. You can use the [BASE64_ENCODE](https://docs.snowflake.com/en/sql-reference/functions/base64_encode) function or set the [BINARY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#binary-output-format) format if you want to view the data in BASE64 format.
* The only formats supported by Snowflake are: HEX, BASE64, or UTF-8. For more information, please refer to[Binary Input and Output](https://docs.snowflake.com/en/user-guide/binary-input-output) in Snowflake.
* Binary functions used to insert data into a values clause are not supported in Snowflake.

## GEOGRAPHY

GEOGRAPHY data type and usages

### Description

A collection of points, linestrings, and polygons, which is represented as a point set, or a subset of the surface of the Earth. For more information please refer to [BigQuery GEOGRAPHY data type](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#geography_type).

> **Success:**
>
> Supported data type in Snowflake.

### Sample Source Patterns

#### GEOGRAPHY output format

The default output format for geography data types in BigQuery is **WKT** **(Well-Known Text)** and in Snowflake **WKB (Well-Known Binary)**. For this reason, when geography columns are selected, the [ST_ASWKT](https://docs.snowflake.com/en/sql-reference/functions/st_aswkt) function is automatically added. In addition, when all the columns of a table are selected and it contains a Geography column, the [GEOGRAPHY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#geography-output-format) is set to WKT. This is in order to keep the default BigQuery format.

##### BigQuery

```sql
CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType VALUES
    (ST_GEOGFROMTEXT('POINT(-122.35 37.55)')), (ST_GEOGFROMTEXT('LINESTRING(-124.20 42.00, -120.01 41.99)'));

SELECT COL1 FROM test.geographyType;
SELECT * FROM test.geographyType;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType
VALUES
    (
     --** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
     'POINT(-122.35 37.55)'), (
     --** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
     'LINESTRING(-124.20 42.00, -120.01 41.99)');

SELECT ST_ASWKT( COL1) FROM test.geographyType;

ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';
SELECT * FROM test.geographyType;
```

In case it is not added automatically and you want to see the data in WKT format, you can use the [ST_ASWKT](https://docs.snowflake.com/en/sql-reference/functions/st_aswkt) function or set the [GEOGRAPHY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#geography-output-format) format.

#### Insert GEOGRAPHY data

To insert data in geography type columns, no function is needed, because Snowflake automatically detects that the data follows the [WGS 84 standard](https://spatialreference.org/ref/epsg/wgs-84/).

#### Observations

* Please keep in mind that the default output format for geography data types is **WKT** **(Well-Known Text)** and in Snowflake **WKB (Well-Known Binary)**. You can use the [ST_ASWKT](https://docs.snowflake.com/en/sql-reference/functions/st_aswkt) function or set the [GEOGRAPHY_OUTPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/parameters#geography-output-format) format if you want to view the data in **WKT** format.
* Geography functions used to insert data into a values clause are not needed in Snowflake.

### Related EWIs

1. [SSC-FDM-BQ0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Geography function is not required in Snowflake.

## INTERVAL

Interval data type and usages

### Description

An `INTERVAL` object represents duration or amount of time, without referring to any specific point in time. By default, it is transformed to VARCHAR because Snowflake historically did not support a stored INTERVAL type ([BigQuery Language Reference INTERVAL Data Type](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#interval_type)).

> **Note:**
>
> **Preview Feature:** When the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled, BigQuery `INTERVAL` columns are preserved as native Snowflake `INTERVAL DAY TO SECOND` types instead of being converted to VARCHAR. Interval literals and expressions are also normalized to Snowflake-compatible syntax. See the [Interval Data Types](../general/interval-data-types.md) translation reference for complete transformation details.

**Syntax**

```sql
INTERVAL int64_expression datetime_part

INTERVAL datetime_parts_string starting_datetime_part TO ending_datetime_part
```

### Sample Source Patterns

#### Interval with a single DateTime part

##### BigQuery

```sql
SELECT INTERVAL 1 YEAR;

SELECT CURRENT_DATE + INTERVAL 1 YEAR,
  CURRENT_DATE + INTERVAL 1 QUARTER,
  CURRENT_DATE + INTERVAL 1 MONTH,
  CURRENT_DATE + INTERVAL 1 WEEK,
  CURRENT_DATE + INTERVAL 1 DAY,
  CURRENT_DATE + INTERVAL 1 HOUR,
  CURRENT_DATE + INTERVAL 1 MINUTE,
  CURRENT_DATE + INTERVAL 1 SECOND;
```

##### Result

```none
1-0 0 0:0:0
```

```none
2024-10-13T00:00:00
2024-01-13T00:00:00
2023-11-13T00:00:00
2023-10-20T00:00:00
2023-10-14T00:00:00
2023-10-13T01:00:00
2023-10-13T00:01:00
2023-10-13T00:00:01
```

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0107 - INTERVAL LITERAL IS NOT SUPPORTED BY SNOWFLAKE IN THIS SCENARIO  ***/!!! INTERVAL 1 YEAR;

SELECT
CURRENT_DATE() + INTERVAL '1 year',
CURRENT_DATE() + INTERVAL '1 quarter',
CURRENT_DATE() + INTERVAL '1 month',
CURRENT_DATE() + INTERVAL '1 week',
CURRENT_DATE() + INTERVAL '1 day',
CURRENT_DATE() + INTERVAL '1 hour',
CURRENT_DATE() + INTERVAL '1 minute',
CURRENT_DATE() + INTERVAL '1 second';
```

##### Result

```none
2024-10-13
2024-01-13
2023-11-13
2023-10-20
2023-10-14
2023-10-13 01:00:00.000
2023-10-13 00:01:00.000
2023-10-13 00:00:01.000
```

Snowflake does not support the scenario where the **Interval** data type is queried directly, on the contrary when it is used as an operator for a given date its translation is done using an [Interval constant](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) (if possible).

#### Interval with a DateTime part range

##### BigQuery

```sql
 SELECT INTERVAL '2-1 10' YEAR TO DAY;

SELECT CURRENT_DATE + INTERVAL '2-11' YEAR TO MONTH,
  CURRENT_DATE + INTERVAL '2-11 28' YEAR TO DAY,
  CURRENT_DATE + INTERVAL '2-11 28 16' YEAR TO HOUR,
  CURRENT_DATE + INTERVAL '2-11 28 16:15' YEAR TO MINUTE,
  CURRENT_DATE + INTERVAL '2-11 28 16:15:14' YEAR TO SECOND,
  CURRENT_DATE + INTERVAL '11 28' MONTH TO DAY,
  CURRENT_DATE + INTERVAL '11 28 16' MONTH TO HOUR,
  CURRENT_DATE + INTERVAL '11 28 16:15' MONTH TO MINUTE,
  CURRENT_DATE + INTERVAL '11 28 16:15:14' MONTH TO SECOND,
  CURRENT_DATE + INTERVAL '28 16' DAY TO HOUR,
  CURRENT_DATE + INTERVAL '28 16:15' DAY TO MINUTE,
  CURRENT_DATE + INTERVAL '28 16:15:14' DAY TO SECOND,
  CURRENT_DATE + INTERVAL '16:15' HOUR TO MINUTE,
  CURRENT_DATE + INTERVAL '16:15:14' HOUR TO SECOND,
  CURRENT_DATE + INTERVAL '15:14' MINUTE TO SECOND;
```

##### Result

```none
2-1 10 0:0:0
```

```none
2026-09-13T00:00:00
2026-10-11T00:00:00
2026-10-11T16:00:00
2026-10-11T16:15:00
2026-10-11T16:15:14
2024-10-11T00:00:00
2024-10-11T16:00:00
2024-10-11T16:15:00
2024-10-11T16:15:14
2023-11-10T16:00:00
2023-11-10T16:15:00
2023-11-10T16:15:14
2023-10-13T16:15:00
2023-10-13T16:15:14
2023-10-13T00:15:14
```

##### Snowflake

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0107 - INTERVAL LITERAL IS NOT SUPPORTED BY SNOWFLAKE IN THIS SCENARIO  ***/!!! INTERVAL '2-1 10' YEAR TO DAY;

SELECT
CURRENT_DATE() + INTERVAL '2y, 11mm',
CURRENT_DATE() + INTERVAL '2y, 11mm, 28d',
CURRENT_DATE() + INTERVAL '2y, 11mm, 28d, 16h',
CURRENT_DATE() + INTERVAL '2y, 11mm, 28d, 16h, 15m',
CURRENT_DATE() + INTERVAL '2y, 11mm, 28d, 16h, 15m, 14s',
CURRENT_DATE() + INTERVAL '11mm, 28d',
CURRENT_DATE() + INTERVAL '11mm, 28d, 16h',
CURRENT_DATE() + INTERVAL '11mm, 28d, 16h, 15m',
CURRENT_DATE() + INTERVAL '11mm, 28d, 16h, 15m, 14s',
CURRENT_DATE() + INTERVAL '28d, 16h',
CURRENT_DATE() + INTERVAL '28d, 16h, 15m',
CURRENT_DATE() + INTERVAL '28d, 16h, 15m, 14s',
CURRENT_DATE() + INTERVAL '16h, 15m',
CURRENT_DATE() + INTERVAL '16h, 15m, 14s',
CURRENT_DATE() + INTERVAL '15m, 14s';
```

##### Result

```none
2026-09-13
2026-10-11
2026-10-11 16:00:00.000
2026-10-11 16:15:00.000
2026-10-11 16:15:14.000
2024-10-11
2024-10-11 16:00:00.000
2024-10-11 16:15:00.000
2024-10-11 16:15:14.000
2023-11-10 16:00:00.000
2023-11-10 16:15:00.000
2023-11-10 16:15:14.000
2023-10-13 16:15:00.000
2023-10-13 16:15:14.000
2023-10-13 00:15:14.000
```

The Interval value is transformed to a supported Snowflake format and then inserted as text inside the column. Since Snowflake does not support **Interval** as a data type, it is only supported in arithmetic operations. In order to use the value, it needs to be extracted and used as an [Interval constant](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) (if possible).

#### Interval as a Column data type

##### BigQuery

```sql
 CREATE OR REPLACE TABLE test.my_table (
  id INT NOT NULL,
  interval_column INTERVAL
);

INSERT INTO test.my_table
VALUES (1, INTERVAL '2-11 28' YEAR TO DAY);

INSERT INTO test.my_table
VALUES (2, INTERVAL '2-11 28 16:15:14' YEAR TO SECOND);

INSERT INTO test.my_table
VALUES (3, INTERVAL '11 28 16:15:14' MONTH TO SECOND);

INSERT INTO test.my_table
VALUES (4, INTERVAL '15:14' MINUTE TO SECOND);

SELECT * FROM test.my_table;
```

##### Result

| ID | interval_column |
| --- | --- |
| 1 | 2-11 28 0:0:0 |
| 2 | 2-11 28 16:15:14 |
| 3 | 0-11 28 16:15:14 |
| 4 | 0-0 0 0:15:14 |

##### Snowflake

```sql
 CREATE OR REPLACE TABLE test.my_table (
  id INT NOT NULL,
interval_column VARCHAR(30) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/01/2025",  "domain": "test" }}';

INSERT INTO test.my_table
VALUES (1, '2y, 11mm, 28d');

INSERT INTO test.my_table
VALUES (2, '2y, 11mm, 28d, 16h, 15m, 14s');

INSERT INTO test.my_table
VALUES (3, '11mm, 28d, 16h, 15m, 14s');

INSERT INTO test.my_table
VALUES (4, '15m, 14s');

SELECT * FROM
test.my_table;
```

##### Result

| ID | interval_column |
| --- | --- |
| 1 | 2y, 11mm, 28d |
| 2 | 2y, 11mm, 28d, 16h, 15m, 14s |
| 3 | 11mm, 28d, 16h, 15m, 14s |
| 4 | 15m, 14s |

In BigQuery the datetime_part follows the next canonical format:

```none
[sign]Y-M [sign]D [sign]H:M:S[.F]
```

#### Interval comparison

##### BigQuery

```sql
SELECT INTERVAL 1 YEAR = INTERVAL 1 YEAR;

SELECT CURRENT_DATE + INTERVAL '-2 -16' DAY TO HOUR =  CURRENT_DATE + INTERVAL '-2 -16' DAY TO HOUR;

SELECT INTERVAL '-2 -16' DAY TO HOUR != INTERVAL '-2 16' DAY TO HOUR,
  INTERVAL '-2 -16' DAY TO HOUR <> INTERVAL '-2 16' DAY TO HOUR,
  INTERVAL '2 16:15' DAY TO MINUTE = INTERVAL '2 -16:15' DAY TO MINUTE,
  INTERVAL '2 16:15' DAY TO MINUTE > INTERVAL '2 -16:15' DAY TO MINUTE,
  INTERVAL '2 16:15' DAY TO MINUTE >= INTERVAL '2 -16:15' DAY TO MINUTE,
  INTERVAL '2 16:15' DAY TO MINUTE < INTERVAL '2 -16:15' DAY TO MINUTE,
  INTERVAL '2 16:15' DAY TO MINUTE <= INTERVAL '2 -16:15' DAY TO MINUTE,
  INTERVAL '1-5' YEAR TO MONTH = INTERVAL '1-5' YEAR TO MONTH,
  INTERVAL '1-5' YEAR TO MONTH > INTERVAL '2 16' DAY TO HOUR,
  INTERVAL '2-11 28 16:15:14.222' YEAR TO SECOND = INTERVAL '2-11 28 16:15:14.222' YEAR TO SECOND,
  INTERVAL '1-1 3' YEAR TO DAY = INTERVAL '13 3' MONTH TO DAY,
  INTERVAL '1-5' YEAR TO MONTH > INTERVAL '2 16' DAY TO HOUR;
```

##### Snowflake

```sql
SELECT
'1 year' = '1 year';

SELECT
CURRENT_DATE() + INTERVAL '-2d, -16h' = CURRENT_DATE() + INTERVAL '-2d, -16h';

SELECT
CURRENT_TIMESTAMP + INTERVAL '-2d, -16h' != CURRENT_TIMESTAMP + INTERVAL '-2d, 16h',
CURRENT_TIMESTAMP + INTERVAL '-2d, -16h' <> CURRENT_TIMESTAMP + INTERVAL '-2d, 16h',
CURRENT_TIMESTAMP + INTERVAL '2d, 16h, 15m' = CURRENT_TIMESTAMP + INTERVAL '2d, -16h, -15m',
CURRENT_TIMESTAMP + INTERVAL '2d, 16h, 15m' > CURRENT_TIMESTAMP + INTERVAL '2d, -16h, -15m',
CURRENT_TIMESTAMP + INTERVAL '2d, 16h, 15m' >= CURRENT_TIMESTAMP + INTERVAL '2d, -16h, -15m',
CURRENT_TIMESTAMP + INTERVAL '2d, 16h, 15m' < CURRENT_TIMESTAMP + INTERVAL '2d, -16h, -15m',
CURRENT_TIMESTAMP + INTERVAL '2d, 16h, 15m' <= CURRENT_TIMESTAMP + INTERVAL '2d, -16h, -15m',
CURRENT_TIMESTAMP + INTERVAL '1y, 5mm' = CURRENT_TIMESTAMP + INTERVAL '1y, 5mm',
CURRENT_TIMESTAMP + INTERVAL '1y, 5mm' > CURRENT_TIMESTAMP + INTERVAL '2d, 16h',
CURRENT_TIMESTAMP + INTERVAL '2y, 11mm, 28d, 16h, 15m, 14s, 222ms' = CURRENT_TIMESTAMP + INTERVAL '2y, 11mm, 28d, 16h, 15m, 14s, 222ms',
CURRENT_TIMESTAMP + INTERVAL '1y, 1mm, 3d' = CURRENT_TIMESTAMP + INTERVAL '13mm, 3d',
CURRENT_TIMESTAMP + INTERVAL '1y, 5mm' > CURRENT_TIMESTAMP + INTERVAL '2d, 16h';
```

As is known, Snowflake only supports Interval as a data type in arithmetic operations, which is why the `CURRENT_TIMESTAMP` function is added to each operand to correctly support the comparison.

### Known Issues

#### 1. Only arithmetic operations are supported

Snowflake Intervals have several limitations. Only arithmetic operations between `DATE` or `TIMESTAMP` and [Interval Constants](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) are supported, every other scenario is not supported.

##### 2. Working with signs in the Interval data type

In BigQuery, when the substring corresponding to the year-month is preceded by a sign (+ -), it affects both the year and the month. In a similar way, it works for the substring corresponding to the time, in this case, the following affects the hour, minute, and second. An example of this is shown below.

##### BigQuery

```sql
SELECT CURRENT_DATE + INTERVAL '-2-11 -28 -16:15:14.222' YEAR TO SECOND;
```

##### Snowflake

```sql
 SELECT CURRENT_DATE + INTERVAL '-2y, -11mm, -28d, -16h, -15m, -14s, -222ms';
```

### Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0107](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Interval Literal Not Supported In Current Scenario.

## JSON

Json data type and usages

### Description

Represents JSON, a lightweight data-interchange format. For more information please refer to [BigQuery JSON data type](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#json_type).

> **Danger:**
>
> JSON data type is not supported in Snowflake, currently transformed to [VARIANT](https://docs.snowflake.com/en/sql-reference/data-types-semistructured#variant).

#### JSON Literals

```sql
 JSON 'json_formatted_data'
```

For more information please refer to [JSON Literals in BigQuery](https://cloud.google.com/bigquery/docs/reference/standard-sql/lexical#json_literals).

These literals are not supported in Snowflake, but instead the [PARSE_JSON](https://docs.snowflake.com/en/sql-reference/functions/parse_json) function can be used to convert the input expression to a json type. The only point to take into consideration is that this function cannot be used in the values clause in Snowflake, for this reason it is transformed to a subquery.

### Sample Source Patterns

#### BigQuery

```sql
CREATE OR REPLACE TABLE test.jsonType
(
  COL1 JSON
);

INSERT INTO test.jsonType
VALUES
  (JSON'{"name": "John", "age": 30, "city": "New York"}'),
  (JSON'{"name": "Alice", "age": 28, "city": "San Francisco"}');

SELECT * FROM test.jsonType;

SELECT JSON'{"name": "John", "age": 30, "city": "New York"}';
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE test.jsonType
(
  COL1 VARIANT
);

INSERT INTO test.jsonType
SELECT
  PARSE_JSON('{"name": "John", "age": 30, "city": "New York"}')
UNION ALL
SELECT
  PARSE_JSON('{"name": "Alice", "age": 28, "city": "San Francisco"}');

SELECT * FROM test.jsonType;

SELECT
  PARSE_JSON('{"name": "John", "age": 30, "city": "New York"}');
```

## STRUCT

Translation specification for the STRUCT datatype from BigQuery to Snowflake.

### Description

In BigQuery, a container of ordered fields each with a type (required) and field name (optional). See [Struct Type](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#struct_type).

In Snowflake, [`OBJECT_CONSTRUCT`](https://docs.snowflake.com/en/sql-reference/functions/object_construct) can be used to emulate the `STRUCT` behavior, and SnowConvert AI handles most implementation differences.

> **Note:**
>
> Arguments that represent keys within the OBJECT_CONSTRUCT must be the original names of the target STRUCT. Any name specified within a STRUCT expression body will be replaced with the name found in the target STRUCT. Most of the data pattern examples below contain an example of a name that is replaced by the target name.

### Sample Source Patterns

#### BigQuery

```sql
CREATE OR REPLACE TABLE test.structTypes
(
    COL1 STRUCT<sc1 INT64>,
    COL2 STRUCT<sc2 STRING(10)>,
    COL3 STRUCT<sc3 STRUCT<sc31 INT64, sc32 INT64>>,
    COL4 STRUCT<sc4 ARRAY<INT64>>,
    COL5 STRUCT<sc5 INT64, sc51 INT64>,
    COL7 STRUCT<sc7 INT64 OPTIONS(description = "A repeated STRING field"), sc71 BOOL>,
    COL8 STRUCT<sc8 INT64 NOT NULL, sc81 BOOL NOT NULL OPTIONS(description = "A repeated STRING field")>
);

CREATE OR REPLACE TABLE test.tuple_sample (
  COL1 STRUCT<Key1 INT64, Key2 INT64>
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE test.structTypes
(
    COL1 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<INT> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL2 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<STRING(10)> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL3 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<STRUCT<INT64, INT64>> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL4 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL5 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<INT, INT> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL7 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<INT, BOOLEAN> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL8 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<INT, BOOLEAN> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/
);

CREATE OR REPLACE TABLE test.tuple_sample (
  COL1 VARIANT /*** SSC-FDM-BQ0009 - STRUCT<INT, INT> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/
);
```

#### Insert INT Data Type to STRUCT column

##### BigQuery

```sql
INSERT INTO test.structTypes (COL1) VALUES
(STRUCT(1)),
(STRUCT<INT64>(2)),
(STRUCT<a INT64>(3)),
(STRUCT<sc1 INT64>(4)),
(STRUCT<sc1 INT64>(5));
```

##### Snowflake

```sql
INSERT INTO test.structTypes (COL1)
SELECT
    OBJECT_CONSTRUCT('sc1', 1 :: INT)
UNION ALL
SELECT
    OBJECT_CONSTRUCT('sc1', 2 :: INT)
UNION ALL
SELECT
    OBJECT_CONSTRUCT('sc1', 3 :: INT)
UNION ALL
SELECT
    OBJECT_CONSTRUCT('sc1', 4 :: INT)
UNION ALL
SELECT
    OBJECT_CONSTRUCT('sc1', 5 :: INT);
```

#### Insert STRING Data Type to STRUCT column

##### BigQuery

```sql
INSERT INTO test.structTypes (COL2) VALUES
(STRUCT('t1')),
(STRUCT<STRING>('t2')),
(STRUCT<sc2 STRING>('t3'));
```

##### Snowflake

```sql
INSERT INTO test.structTypes (COL2)
SELECT
    OBJECT_CONSTRUCT('sc2', 't1' :: STRING)
UNION ALL
SELECT
    OBJECT_CONSTRUCT('sc2', 't2' :: STRING)
UNION ALL
SELECT
    OBJECT_CONSTRUCT('sc2', 't3' :: STRING);
```

#### Insert STRUCT Data Type to STRUCT column

##### BigQuery

```sql
INSERT INTO test.structTypes (COL3) VALUES
(STRUCT(STRUCT(1,2))),
(STRUCT<sc3 STRUCT<sc31 INT64, sc32 INT64>>(STRUCT<INT64, INT64>(3, 4))),
(STRUCT<sc3 STRUCT<sc31 INT64, sc32 INT64>>(STRUCT<sc31 INT64, sc32 INT64>(5, 6))),
(STRUCT<STRUCT<INT64,INT64>>(STRUCT<INT64, INT64>(7, 8))),
(STRUCT<STRUCT<INT64,INT64>>(STRUCT(9, 10)));
```

##### Snowflake

```sql
INSERT INTO test.structTypes (COL3)
SELECT
  OBJECT_CONSTRUCT('sc3', OBJECT_CONSTRUCT('sc31', 1 :: INT, 'sc32', 2 :: INT))
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc3', OBJECT_CONSTRUCT('sc31', 3 :: INT, 'sc32', 4 :: INT))
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc3', OBJECT_CONSTRUCT('sc31', 5 :: INT, 'sc32', 6 :: INT))
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc3', OBJECT_CONSTRUCT('sc31', 7 :: INT, 'sc32', 8 :: INT))
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc3', OBJECT_CONSTRUCT('sc31', 9 :: INT, 'sc32', 10 :: INT));
```

#### Insert ARRAY Data Type to STRUCT column

##### BigQuery

```sql
INSERT INTO test.structTypes (COL4) VALUES
(STRUCT([1,2,3,4])),
(STRUCT<sc4 ARRAY<INT64>>(ARRAY[5,6,7])),
(STRUCT<ARRAY<INT64>>([8,9,10,11]));
```

##### Snowflake

```sql
INSERT INTO test.structTypes (COL4)
SELECT
  OBJECT_CONSTRUCT('sc4', [1,2,3,4] :: ARRAY)
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc4', [5,6,7] :: ARRAY)
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc4', [8,9,10,11] :: ARRAY);
```

#### Insert to selected STRUCT columns

##### BigQuery

```sql
INSERT INTO test.structTypes (COL7, COL8) VALUES
(STRUCT(1,true), STRUCT(2,false)),
(STRUCT<INT64, BOOL>(3, false), STRUCT<INT64, BOOL>(4, false)),
(STRUCT<a INT64, b BOOL>(5, true), STRUCT<a INT64, b BOOL>(6, true));
```

##### Snowflake

```sql
INSERT INTO test.structTypes (COL7, COL8)
SELECT
  OBJECT_CONSTRUCT('sc7', 1 :: INT, 'sc71', true),
  OBJECT_CONSTRUCT('sc8', 2 :: INT, 'sc81', false)
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc7', 3 :: INT, 'sc71', false),
  OBJECT_CONSTRUCT('sc8', 4 :: INT, 'sc81', false)
UNION ALL
SELECT
  OBJECT_CONSTRUCT('sc7', 5 :: INT, 'sc71', true),
  OBJECT_CONSTRUCT('sc8', 6 :: INT, 'sc81', true);
```

#### Insert to STRUCT column tuple syntax

> **Warning:**
>
> Translation of tuple syntax values is currently not supported.

##### BigQuery

```sql
INSERT INTO test.tuple_sample
VALUES
  ((12, 34)),
  ((56, 78)),
  ((9, 99)),
  ((12, 35));
```

##### Snowflake

```sql
INSERT INTO test.tuple_sample
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0012 - SNOWCONVERT AI WAS UNABLE TO GENERATE A CORRECT OBJECT_CONSTRUCT PARAMETER. MISSING SYMBOL INFORMATION. ***/!!!
VALUES
  ((12, 34)),
  ((56, 78)),
  ((9, 99)),
  ((12, 35));
```

#### Update STRUCT column

##### BigQuery

```sql
UPDATE test.structTypes
SET col1 = STRUCT(100 AS number)
WHERE col1.sc1 = 4;
```

##### Snowflake

```sql
UPDATE test.structTypes
    SET col1 = OBJECT_CONSTRUCT('sc1', 100 :: INT)
WHERE col1:sc1 = 4;
```

#### Update STRUCT column field

##### BigQuery

```sql
UPDATE test.structTypes
SET col3 = STRUCT(STRUCT(80,90))
WHERE col3.sc3.sc31 = 20;
```

##### Snowflake

```sql
UPDATE test.structTypes
SET col3 = OBJECT_CONSTRUCT('sc3', OBJECT_CONSTRUCT('sc31', 80 :: INT, 'sc32', 90 :: INT))
WHERE col3:sc3:sc31 = 20;
```

#### Select from STRUCT column

##### BigQuery

```sql
SELECT COL3.sc3 FROM test.structTypes;
SELECT COL3.sc3.sc32 FROM test.structTypes;
SELECT COL4.sc4 FROM test.structTypes WHERE COL4.sc4 IS NOT NULL;
```

##### Snowflake

```sql
SELECT COL3:sc3
FROM
test.structTypes;
SELECT COL3:sc3:sc32
FROM
test.structTypes;
SELECT COL4:sc4
FROM
test.structTypes
WHERE COL4:sc4 IS NOT NULL;
```

#### Select from STRUCT column tuple syntax

##### BigQuery

```sql
SELECT *
FROM test.tuple_sample
WHERE (COL1.Key1, COL1.Key2) IN ((12, 34), (56, 78));

SELECT STRUCT<x ARRAY<INT64>, y INT64>(COL4.sc4, COL1.sc1)
FROM test.structTypes
WHERE COL1.sc1 IS NOT NULL;
```

##### Snowflake

```sql
SELECT *
FROM
test.tuple_sample
WHERE (COL1:Key1, COL1:Key2) IN ((12, 34), (56, 78));

SELECT
OBJECT_CONSTRUCT('x', COL4:sc4 :: ARRAY, 'y', COL1:sc1 :: INT)
FROM
test.structTypes
WHERE COL1:sc1 IS NOT  NULL;
```

#### Create a view using an anonymous STRUCT definition

##### BigQuery

```sql
CREATE OR REPLACE TABLE project-test.mydataset.sourcetable (
  id STRING,
  payload JSON
);

CREATE VIEW project-test.mydataset.myview AS
SELECT
  id,
  STRUCT(
    payload.user_id AS user_id,
    STRUCT(
      JSON_VALUE(payload, '$.details.ip_address') AS ip_address,
      JSON_VALUE(payload, '$.details.item_id') AS item_id,
      SAFE_CAST(JSON_VALUE(payload, '$.details.quantity') AS INT64) AS quantity,
      SAFE_CAST(JSON_VALUE(payload, '$.details.price') AS FLOAT64) AS price,
      JSON_VALUE(payload, '$.details.text') AS text
    ) AS details
  ) AS structured_payload
  FROM project-test.mydataset.sourcetable;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE "project-test".mydataset.sourcetable (
  id STRING,
  payload VARIANT
);

CREATE VIEW "project-test".mydataset.myview
AS
SELECT
  id,
  OBJECT_CONSTRUCT('user_id',
  payload:user_id, 'details', OBJECT_CONSTRUCT('ip_address', JSON_EXTRACT_PATH_TEXT(payload, 'details.ip_address'), 'item_id', JSON_EXTRACT_PATH_TEXT(payload, 'details.item_id'), 'quantity', TRY_CAST(JSON_EXTRACT_PATH_TEXT(payload, 'details.quantity') AS INT), 'price', TRY_CAST(JSON_EXTRACT_PATH_TEXT(payload, 'details.price') AS FLOAT), 'text', JSON_EXTRACT_PATH_TEXT(payload, 'details.text'))) AS structured_payload
  FROM
  "project-test".mydataset.sourcetable;
```

#### STRUCT column comparison expressions

BigQuery comparison operations for Structs compare value to value, ignoring the key if it exists, while Snowflake comparison operations for Objects compare both, value and key. This may cause that some comparisons return a different result.

##### BigQuery

```sql
SELECT * FROM test.structTypes WHERE COL1 NOT IN (COL2);
SELECT * FROM test.structTypes WHERE COL1 <> (COL2);
SELECT * FROM test.structTypes WHERE COL1 != (COL2);
```

##### Snowflake

```sql
SELECT * FROM
test.structTypes
--** SSC-FDM-BQ0008 - WHERE CLAUSE REFERENCES A COLUMN OF STRUCT TYPE. COMPARISON OPERATIONS MAY PRODUCE DIFFERENT RESULTS IN SNOWFLAKE. **
WHERE COL1 NOT IN (COL2);
SELECT * FROM
test.structTypes
--** SSC-FDM-BQ0008 - WHERE CLAUSE REFERENCES A COLUMN OF STRUCT TYPE. COMPARISON OPERATIONS MAY PRODUCE DIFFERENT RESULTS IN SNOWFLAKE. **
WHERE COL1 <> (COL2);
SELECT * FROM
test.structTypes
--** SSC-FDM-BQ0008 - WHERE CLAUSE REFERENCES A COLUMN OF STRUCT TYPE. COMPARISON OPERATIONS MAY PRODUCE DIFFERENT RESULTS IN SNOWFLAKE. **
WHERE COL1 != (COL2);
```

### Related EWIs

1. [SSC-FDM-BQ0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Struct converted to VARIANT. Some of its usages might have functional differences.
2. [SSC-EWI-BQ0012](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md): SnowConvert AI was unable to generate a correct OBJECT_CONSTRUCT parameter. Missing symbol information.
3. [SSC-FDM-BQ0008](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md): Where clause references a column of STRUCT type.

## TIMESTAMP

Timestamp data type and usages

### Description

> A timestamp value represents an absolute point in time, independent of any time zone or convention such as daylight saving time (DST), with microsecond precision. For more information please refer to [BigQuery Timestamp data type](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#timestamp_type).

### Grammar syntax

| Name | Range |
| --- | --- |
| TIMESTAMP | 0001-01-01 00:00:00 to 9999-12-31 23:59:59.999999 UTC |

> **Success:**
>
> TIMESTAMP data type currently transformed to [TIMESTAMP_TZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime#timestamp-ltz-timestamp-ntz-timestamp-tz).

It is important to remark that BigQuery stores TIMESTAMP data in Coordinated Universal Time (UTC).

### Sample Source Patterns

#### TIMESTAMP without time

##### BigQuery

```sql
 CREATE OR REPLACE TABLE timestampTable
(
  COL1 TIMESTAMP
);

INSERT INTO timestampTable VALUES ('2008-12-26 15:30:00');
INSERT INTO timestampTable VALUES (TIMESTAMP'2008-12-27 18:30:00');
SELECT * FROM timestampTable;
```

##### Result

```none
2008-12-26 15:30:00 UTC
2008-12-27 18:30:00 UTC
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE timestampTable
(
  COL1 TIMESTAMP_TZ
);

INSERT INTO timestampTable VALUES ('2008-12-26 15:30:00');
INSERT INTO timestampTable VALUES (TIMESTAMP'2008-12-27 18:30:00');
SELECT * FROM timestampTable;
```

##### Result

```none
2008-12-26 15:30:00.000 -0800
2008-12-27 18:30:00.000 -0800
```

#### TIMESTAMP with time zone

When the time zone is defined you need to use the [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) function to store the data in Coordinated Universal Time (UTC). Also the timezone name inside the timestamp literal is not supported by Snowflake, in that case it is necessary to use this function as well.

##### BigQuery

```sql
CREATE OR REPLACE TABLE test.timestampType
(
  COL1 TIMESTAMP
);

INSERT INTO test.timestampType VALUES ('2008-12-25 15:30:00 America/Chicago');
INSERT INTO test.timestampType VALUES ('2018-04-05 12:00:00+02:00');
INSERT INTO test.timestampType VALUES ('2008-12-26 15:30:00-08:00');
INSERT INTO test.timestampType VALUES (TIMESTAMP'2022-12-25 15:30:00 America/North_Dakota/New_Salem');
INSERT INTO test.timestampType VALUES (TIMESTAMP'2022-04-05 12:00:00+02:00');
INSERT INTO test.timestampType VALUES (TIMESTAMP'2022-12-26 15:30:00-08:00');
SELECT * FROM test.timestampType ORDER BY COL1;
```

##### Result

```sql
2008-12-25 21:30:00 UTC
2008-12-26 23:30:00 UTC
2018-04-05 10:00:00 UTC
2022-04-05 10:00:00 UTC
2022-12-25 21:30:00 UTC
2022-12-26 23:30:00 UTC
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE test.timestampType
(
  COL1 TIMESTAMP_TZ
);

INSERT INTO test.timestampType
VALUES (CONVERT_TIMEZONE('America/Chicago', 'UTC', '2008-12-25 15:30:00'));
INSERT INTO test.timestampType
VALUES (CONVERT_TIMEZONE('UTC','2018-04-05 12:00:00+02:00'));
INSERT INTO test.timestampType
VALUES (CONVERT_TIMEZONE('UTC','2008-12-26 15:30:00-08:00'));

INSERT INTO test.timestampType
VALUES (CONVERT_TIMEZONE('America/North_Dakota/New_Salem', 'UTC', '2022-12-25 15:30:00'));
INSERT INTO test.timestampType
VALUES (CONVERT_TIMEZONE('UTC', '2022-04-05 12:00:00+02:00'));
INSERT INTO test.timestampType
VALUES (CONVERT_TIMEZONE('UTC', '2022-12-26 15:30:00-08:00'));
SELECT * FROM test.timestampType ORDER BY COL1;
```

##### Result

```sql
 2008-12-25 21:30:00.000 -0800
2008-12-26 23:30:00.000 +0000
2018-04-05 10:00:00.000 +0000
2022-04-05 10:00:00.000 +0000
2022-12-25 21:30:00.000 -0800
2022-12-26 23:30:00.000 +0000
```

---
title: SnowConvert AI - BigQuery - Identifier differences between BigQuery and Snowflake
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/bigquery-identifiers.md
section: Migrations
---

# SnowConvert AI - BigQuery - Identifier differences between BigQuery and Snowflake

## Quoted identifiers

BigQuery quoted identifiers are enclosed by backticks (`) while Snowflake encloses them in double quotes (“).

In BigQuery, quoted identifiers stick to the [case sensitivity rules](https://cloud.google.com/bigquery/docs/reference/standard-sql/lexical#case_sensitivity), which means that, for example, column names are still case insensitive even when quoted:

### BigQuery

```sql
CREATE TABLE test.quotedIdentTable
(
  `col#1` INTEGER
);

SELECT `col#1` FROM test.quotedIdentTable;

SELECT `COL#1` FROM test.quotedIdentTable;
```

In Snowflake, case sensitivity of quoted identifiers depends on the session parameter [QUOTED_IDENTIFIERS_IGNORE_CASE](https://docs.snowflake.com/en/sql-reference/parameters#quoted-identifiers-ignore-case), by default quoted identifiers comparison is case sensitive, this means that the result code from migrating the above example:

### Snowflake

```sql
CREATE TABLE test.quotedIdentTable
(
  "col#1" INTEGER
);

SELECT
  "col#1"
FROM
  test.quotedIdentTable;

SELECT
  "COL#1"
FROM
  test.quotedIdentTable;
```

Will fail when executing the second select unless the session parameter is set to TRUE.

## How SnowConvert AI migrates quoted identifiers

SnowConvert AI will analyze quoted identifiers to determine if they contain non-alphanumeric characters or are reserved words in Snowflake, if they do then it will transform them to quoted identifiers in Snowflake, alphanumeric identifiers will be left unquoted:

### BigQuery

```sql
CREATE TABLE `test.identsTable1`
(
  `col#1` INTEGER,
  `col2` INTEGER
);

-- Group is a reserved word
SELECT
`col#1` AS `group`,
`col2`AS `hello`
FROM
`test.identsTable1`;
```

### Snowflake

```sql
CREATE TABLE test.identsTable1
(
  "col#1" INTEGER,
  col2 INTEGER
);

-- Group is a reserved word
SELECT
  "col#1" AS "group",
  col2 AS hello
FROM
  test.identsTable1;
```

## Known issues

By default, BigQuery considers table and dataset names as case sensitive, unless the [is_case_insensitive](https://cloud.google.com/bigquery/docs/reference/standard-sql/data-definition-language#schema_option_list) option is activated for the dataset, this allows the following tables to coexist without problems:

### BigQuery

```sql
CREATE TABLE test.myTable
(
  col1 INTEGER
);

CREATE TABLE test.MyTable
(
  col1 INTEGER
);
```

However, unquoted identifiers in Snowflake are [always stored and compared in uppercase](https://docs.snowflake.com/en/sql-reference/identifiers-syntax), meaning that `test.MyTable` will raise a duplicated object error when trying to create it. SnowConvert AI also works under the assumption that identifiers are case insensitive, so when one of these scenarios appears during transformation, SSC-FDM-0019 will be generated to warn the user:

### Snowflake

```sql
CREATE TABLE test.myTable
(
  col1 INTEGER
);

--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR test.MyTable. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
CREATE TABLE test.MyTable
(
  col1 INTEGER
);
```

## Related EWIs

1. [SSC-FDM-0019](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Semantic information could not be loaded

---
title: SnowConvert AI - BigQuery - Operators
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/bigquery/bigquery-operators.md
section: Migrations
---

# SnowConvert AI - BigQuery - Operators

## IS operators

IS operators return `TRUE` or `FALSE` for the condition they are testing. They never return `NULL`, even for `NULL` inputs. ([BigQuery SQL Language Reference IS operators](https://cloud.google.com/bigquery/docs/reference/standard-sql/operators?hl=en#is_operators))

| BigQuery | Snowflake |
| --- | --- |
| `X IS TRUE` | `NVL(X, FALSE)` |
| `X IS NOT TRUE` | `NVL(NOT X, TRUE)` |
| `X IS FALSE` | `NVL(NOT X, FALSE)` |
| `X IS NOT FALSE` | `NVL(X, TRUE)` |
| `X IS NULL` | `X IS NULL` |
| `X IS NOT NULL` | `X IS NOT NULL` |
| `X IS UNKNOWN` | `X IS NULL` |
| `X IS NOT UNKNOWN` | `X IS NOT NULL` |

## UNNEST operator

The UNNEST operator takes an array and returns a table with one row for each element in the array. ([BigQuery SQL Language Reference UNNEST operator](https://cloud.google.com/bigquery/docs/reference/standard-sql/query-syntax#unnest_operator)).

This operator will be emulated using the [FLATTEN](../../../../sql-reference/functions/flatten.md) function, the `VALUE` and `INDEX` columns returned by the function will be renamed accordingly to match the UNNEST operator aliases

| BigQuery | Snowflake |
| --- | --- |
| `UNNEST(arrayExpr)` | `FLATTEN(INPUT => arrayExpr) AS F0_(SEQ, KEY, PATH, INDEX, F0_, THIS)` |
| `UNNEST(arrayExpr) AS alias` | `FLATTEN(INPUT => arrayExpr) AS alias(SEQ, KEY, PATH, INDEX, alias, THIS)` |
| `UNNEST(arrayExpr) AS alias WITH OFFSET` | `FLATTEN(INPUT => arrayExpr) AS alias(SEQ, KEY, PATH, OFFSET, alias, THIS)` |
| `UNNEST(arrayExpr) AS alias WITH OFFSET AS offsetAlias` | `FLATTEN(INPUT => arrayExpr) AS alias(SEQ, KEY, PATH, offsetAlias, alias, THIS)` |

### SELECT \* with UNNEST

When the UNNEST operator is used inside a SELECT \* statement the `EXCLUDE` keyword will be used to remove the unnecessary FLATTEN columns.

Input:

```sql
SELECT * FROM UNNEST ([10,20,30]) AS numbers WITH OFFSET position;
```

Generated code:

```sql
 SELECT
* EXCLUDE(SEQ, KEY, PATH, THIS)
FROM
TABLE(FLATTEN(INPUT => [10,20,30])) AS numbers (
SEQ,
KEY,
PATH,
position,
numbers,
THIS
);
```

---
title: SnowConvert AI - BigQuery Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/bigqueryFDM.md
section: Migrations
---

# SnowConvert AI - BigQuery Functional Differences

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Google BigQuery currently supports assessment and translation for TABLES and VIEWS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

## SSC-FDM-BQ0001

Accessing arrays produces NULL instead of an error for positive out of bounds indexes in Snowflake.

### Description

When accessing an ARRAY object by index in Snowflake, specifying an index greater than the size of the array will result in a NULL value, this differs with the behavior of BigQuery, where accessing an ARRAY with an index that is out of bounds will produce an error, unless the functions `SAFE_OFFSET` or `SAFE_ORDINAL` are used.

This FDM is added to any ARRAY access that is not safe.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 SELECT ([40, 12, 30])[8];

SELECT ([40, 12, 30])[SAFE_OFFSET(8)];
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
--** SSC-FDM-BQ0001 - ACCESSING ARRAYS PRODUCES NULL INSTEAD OF AN ERROR FOR POSITIVE OUT OF BOUNDS INDEXES IN SNOWFLAKE **
([40, 12, 30])[8];

SELECT
PUBLIC.SAFE_OFFSET_UDF( ([40, 12, 30]), 8);
```

#### Best Practices

* Analyze the uses of array access in the code. If there was never the risk of getting an out of bounds error in the original code, no difference will be observed and this FDM can be safely ignored.
* If the original code relies on out-of-bounds access raising an error (e.g., for flow control), add explicit bounds checking in Snowflake using `ARRAY_SIZE` before accessing the array.

## SSC-FDM-BQ0002

Exception system variables are not supported in Snowflake.

### Description

BigQuery’s [exception system variables](https://cloud.google.com/bigquery/docs/reference/standard-sql/procedural-language#beginexceptionend) (`@@error.message`, `@@error.stack_trace`, `@@error.statement_text`, `@@error.formatted_stack_trace`) have no direct equivalent in Snowflake. SnowConvert AI replaces exception variable references with `OBJECT_CONSTRUCT('SQLERRM', SQLERRM, 'SQLCODE', SQLCODE, 'SQLSTATE', SQLSTATE)` as a workaround. This workaround provides basic error information but does not include stack trace or statement text details available in BigQuery. For more information, see [Handling Exceptions in Snowflake](../../../../../../developer-guide/snowflake-scripting/exceptions.md).

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE PROCEDURE test.proc1()
BEGIN
  SELECT 1/0;
EXCEPTION WHEN ERROR THEN
  SELECT
    @@error.message as message,
    @@error.stack_trace as stack_trace,
    @@error.statement_text as statement_text,
    @@error.formatted_stack_trace as formatted_stack_trace;
END;
```

##### Result

```json
 [{
  "message": "Query error: division by zero: 1 / 0 at [snowflake-snowconvert-team.test.proc1:2:3]",
  "stack_trace": [{
    "line": "2",
    "column": "3",
    "filename": null,
    "location": "snowflake-snowconvert-team.test.proc1"
  }, {
    "line": "1",
    "column": "1",
    "filename": null,
    "location": null
  }],
  "statement_text": "SELECT 1/0",
  "formatted_stack_trace": "At snowflake-snowconvert-team.test.proc1[2:3]\nAt [1:1]\n"
}]
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE test.proc1 ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/09/2025",  "domain": "test" }}'
AS
$$
    BEGIN
    SELECT 1/0;
  EXCEPTION WHEN OTHER THEN
--      --** SSC-FDM-BQ0002 - EXCEPTION SYSTEM VARIABLES ARE NOT SUPPORTED IN SNOWFLAKE. **
--    SELECT
--      @@error.message as message,
--      @@error.stack_trace as stack_trace,
--      @@error.statement_text as statement_text,
--      @@error.formatted_stack_trace as formatted_stack_trace;
      RETURN OBJECT_CONSTRUCT('SQLERRM', SQLERRM, 'SQLCODE', SQLCODE, 'SQLSTATE', SQLSTATE);
    END;
$$;
```

##### Result

```json
 {
  "SQLCODE": 100051,
  "SQLERRM": "Division by zero",
  "SQLSTATE": "22012"
}
```

#### Best Practices

* Snowflake provides three built-in exception variables as an alternative to BigQuery’s `@@error` system variables:

  | BigQuery Variable | Snowflake Equivalent | Notes |
  | --- | --- | --- |
  | `@@error.message` | `SQLERRM` | Error message text |
  | `@@error.statement_text` | N/A | No direct equivalent in Snowflake |
  | `@@error.stack_trace` | N/A | No direct equivalent in Snowflake |
  | `@@error.formatted_stack_trace` | N/A | No direct equivalent in Snowflake |
  | N/A | `SQLSTATE` | 5-character ANSI SQL state code |
  | N/A | `SQLCODE` | 5-digit signed integer error code |
* Review the generated `OBJECT_CONSTRUCT('SQLERRM', SQLERRM, 'SQLCODE', SQLCODE, 'SQLSTATE', SQLSTATE)` workaround and adjust it based on your specific error-handling requirements.
* For more information, see [Handling Exceptions in Snowflake](../../../../../../developer-guide/snowflake-scripting/exceptions.md).

## SSC-FDM-BQ0003

Unable to generate correct return table clause due to missing dependent object information.

> **Note:**
>
> This issue is deprecated and no longer generated by SnowConvert AI. Check [SSC-EWI-BQ0009](../conversion-issues/bigqueryEWI.md) for the issue now generated for this scenario

### Description

Snowflake requires a valid RETURNS TABLE clause for CREATE TABLE FUNCTION statements.

If the original BigQuery source code does not have a RETURNS TABLE clause, SnowConvert AI must build one. To do this, an analysis is made to the CREATE TABLE FUNCTION query to properly infer the types of the columns of the resulting table. When SnowConvert AI cannot gather the required information, this EWI is added.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE TABLE FUNCTION function_name_noreturns_asterisk_join (parameter_name INTEGER)
AS
  SELECT *
  FROM unknownTable1 t1
  JOIN unknownTable2 t2 ON t1.col1 = t2.fk_col1;
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "unknownTable1", "unknownTable2" **

CREATE OR REPLACE FUNCTION function_name_noreturns_asterisk_join (parameter_name INTEGER)
----** SSC-FDM-BQ0003 - UNABLE TO GENERATE CORRECT RETURNS TABLE CLAUSE DUE TO MISSING DEPENDENT OBJECT INFORMATION. **
--RETURNS TABLE (
--)
AS
    $$
      SELECT *
      FROM
      unknownTable1 t1
      JOIN
          unknownTable2 t2 ON t1.col1 = t2.fk_col1
    $$;
```

#### Best Practices

* Always try to include any dependent object definitions in the input code, so that SnowConvert AI has access to important information.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-BQ0004

The INFER_SCHEMA function requires a file path without wildcards to generate the table template, replace the FILE_PATH placeholder with it

> **Warning:**
>
> This FDM is deprecated; please refer to [SSC-FDM-0035](generalFDM.md) for the latest version of this FDM.

### Description

The [INFER_SCHEMA](https://docs.snowflake.com/en/sql-reference/functions/infer_schema) function is used in Snowflake to generate the columns definition of a table based on the structure of a file, it requires a LOCATION parameter that specifies the path to a file or folder that will be used to construct the table columns, however, this path does not support regex, meaning that the wildcard `*` character is not supported.

When the table has no columns, SnowConvert AI will check all URIS to find one that does not use wildcards and use it in the INFER_SCHEMA function. When no URI meets such criteria, this FDM and a FILE_PATH placeholder is generated, and the placeholder has to be replaced with the path of one of the files referenced by the external table to generate the table columns.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json2
OPTIONS(
  FORMAT='JSON',
  URIS=['gs://sc_external_table_bucket/folder_with_json/*']
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_JSON2_FORMAT
TYPE = JSON;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json2 USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-BQ0004 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_JSON2_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_json/.*'
FILE_FORMAT = (TYPE = JSON);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-BQ0005

Parsing the CSV header is not supported in external tables, columns must be renamed to match the original names

### Description

Snowflake external tables do not support parsing the header of CSV files. SKIP_HEADER is used as a workaround to avoid runtime errors, but the resulting table column names will have auto-generated names (`c1`, `c2`, …, `cN`) instead of the original header names.

When SnowConvert AI detects an external table with CSV file format and no explicit column list, it adds the `SKIP_HEADER = 1` file format option. The columns must be manually renamed to match the original names from the CSV header.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_csv
OPTIONS(
  FORMAT='CSV',
  URIS=['gs://sc_external_table_bucket/folder_with_csv/Employees.csv']
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_CSV_FORMAT
TYPE = CSV
SKIP_HEADER = 1;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_csv
--** SSC-FDM-BQ0005 - PARSING THE CSV HEADER IS NOT SUPPORTED IN EXTERNAL TABLES, COLUMNS MUST BE RENAMED TO MATCH THE ORIGINAL NAMES **
USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/folder_with_csv/Employees.csv', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_CSV_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Employees.csv'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER = 1);
```

#### Best Practices

* Rename the auto-generated column names (`c1`, `c2`, …, `cN`) back to the original column names from the CSV file header.
* If the original column names are known, use `ALTER TABLE ... RENAME COLUMN` or recreate the external table with explicit column definitions.
* For non-external-table loading scenarios, consider using `MATCH_BY_COLUMN_NAME` with `PARSE_HEADER = TRUE` in the file format to automatically match columns by header names.

## SSC-FDM-BQ0006

Reading from Google Drive is not supported in Snowflake, upload the files to the external location and replace the FILE_PATH placeholders

### Description

Snowflake does not support reading data from files hosted in Google Drive, this FDM is generated to notify it, please upload the Google Drive files to the external location so they can be accessed through the external stage.

The PATTERN clause will hold autogenerated placeholders FILE_PATH0, FILE_PATH1, …, FILE_PATHN that should be replaced with the file/folder path after the files were moved to the external location.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_drive_test
OPTIONS(
  FORMAT='JSON',
  URIS=['https://drive.google.com/open?id=someFileId']
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_DRIVE_TEST_FORMAT
TYPE = JSON;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_drive_test USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-0035 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_DRIVE_TEST_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS AN EXTERNAL LOCATION, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
--** SSC-FDM-BQ0006 - READING FROM GOOGLE DRIVE IS NOT SUPPORTED IN SNOWFLAKE, UPLOAD THE FILES TO THE EXTERNAL LOCATION AND REPLACE THE FILE_PATH PLACEHOLDERS **
PATTERN = 'FILE_PATH0'
FILE_FORMAT = (TYPE = JSON);
```

#### Best Practices

* Download the files from Google Drive and upload them to a cloud storage location accessible by Snowflake (e.g., Amazon S3, Azure Blob Storage, or Google Cloud Storage).
* Create or configure an external stage in Snowflake pointing to the cloud storage location.
* Replace the `FILE_PATH` placeholders in the `PATTERN` clause with the actual file or folder paths relative to the external stage.

## SSC-FDM-BQ0007

The GOOGLE_SHEETS format is not supported in Snowflake. CSV file type is used as a workaround.

### Description

The GOOGLE_SHEETS format is not supported in Snowflake. CSV file type is used as a workaround because the structure of Google Sheets data is similar to CSV.

When SnowConvert AI detects an external table using the GOOGLE_SHEETS format, it produces an external table with the CSV file format instead. The resulting table expects a CSV file rather than a Google Sheets source.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.spreadsheetTable
(
  Name STRING,
  Code INTEGER,
  Price INTEGER,
  Expiration_date DATE
)
OPTIONS(
  format="GOOGLE_SHEETS",
  skip_leading_rows = 1,
  uris=['https://docs.google.com/spreadsheets/d/someFileId/edit?usp=sharing']
);
```

##### Generated Code:

##### Snowflake

```sql
--** SSC-FDM-BQ0007 - THE GOOGLE_SHEETS FORMAT IS NOT SUPPORTED IN SNOWFLAKE. CSV FILE TYPE IS USED AS A WORKAROUND. **
CREATE OR REPLACE EXTERNAL TABLE test.spreadsheetTable
(
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c1') AS STRING),
  Code INTEGER AS CAST(GET_IGNORE_CASE($1, 'c2') AS INTEGER),
  Price INTEGER AS CAST(GET_IGNORE_CASE($1, 'c3') AS INTEGER),
  Expiration_date DATE AS CAST(GET_IGNORE_CASE($1, 'c4') AS DATE)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS AN EXTERNAL LOCATION, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
--** SSC-FDM-BQ0006 - READING FROM GOOGLE DRIVE IS NOT SUPPORTED IN SNOWFLAKE, UPLOAD THE FILES TO THE EXTERNAL LOCATION AND REPLACE THE FILE_PATH PLACEHOLDERS **
PATTERN = 'FILE_PATH0'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER = 1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}';
```

#### Best Practices

* Export the Google Sheets data as CSV files and upload them to a cloud storage location accessible by Snowflake.
* Verify that the CSV export preserves the expected data types and formatting, especially for dates, numbers, and text fields with commas.
* If the external table also references Google Drive URIs, see SSC-FDM-BQ0006 for instructions on migrating the files to an external stage.

## SSC-FDM-BQ0008

Where clause references a column of STRUCT type. Comparison operations may produce different results in Snowflake.

### Description

BigQuery STRUCT types have no direct equivalent in Snowflake. VARIANT is used as a workaround (see [SSC-FDM-0034](generalFDM.md)). When a comparison involves a Snowflake VARIANT created from a BigQuery STRUCT, the results may differ because Snowflake compares both keys and values, whereas BigQuery compares only values regardless of field names.

This FDM is added when a WHERE clause comparison involves a column of STRUCT type that was converted to VARIANT.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE TABLE test.compExprTable
(
  COL1 STRUCT<sc1 INT64>,
  COL2 STRUCT<sc2 INT64>
);

SELECT * FROM test.compExprTable WHERE COL1 <> (COL2);
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE test.compExprTable
(
  COL1 VARIANT /*** SSC-FDM-0034 - STRUCT<INT64> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
  COL2 VARIANT /*** SSC-FDM-0034 - STRUCT<INT64> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}';

SELECT * FROM
  test.compExprTable
--** SSC-FDM-BQ0008 - WHERE CLAUSE REFERENCES A COLUMN OF STRUCT TYPE. COMPARISON OPERATIONS MAY PRODUCE DIFFERENT RESULTS IN SNOWFLAKE. **
WHERE COL1 <> (COL2);
```

#### Best Practices

* Review WHERE clause comparisons involving STRUCT-derived VARIANT columns. If the original BigQuery query compared STRUCTs by value only, extract and compare individual fields explicitly in Snowflake.
* For example, replace `WHERE col1 <> col2` with `WHERE col1:sc1 <> col2:sc2` to compare specific field values instead of the entire VARIANT object.
* For more information on VARIANT comparison behavior, see the [Snowflake VARIANT documentation](https://docs.snowflake.com/en/sql-reference/data-types-semistructured).

## SSC-FDM-BQ0010

Geography function is not required in Snowflake.

### Description

Snowflake automatically detects GEOGRAPHY data from [WGS 84](https://spatialreference.org/ref/epsg/wgs-84/) formatted strings (WKT, WKB, GeoJSON), so explicit geography conversion functions like `ST_GEOGFROMTEXT` are not required in VALUES clause inserts. SnowConvert AI removes the function call and passes the string literal directly. This FDM is added to notify that the geography function was removed.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType VALUES
(ST_GEOGFROMTEXT('POINT(-122.35 37.55)')),
(ST_GEOGFROMTEXT('LINESTRING(-124.20 42.00, -120.01 41.99)'));

SELECT * FROM test.geographyType;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE test.geographyType
(
  COL1 GEOGRAPHY
);

INSERT INTO test.geographyType
VALUES
    (
     --** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
     'POINT(-122.35 37.55)'), (
     --** SSC-FDM-BQ0010 - THE FUNCTION 'ST_GEOGFROMTEXT' IS NOT REQUIRED IN SNOWFLAKE. **
     'LINESTRING(-124.20 42.00, -120.01 41.99)');

ALTER SESSION SET GEOGRAPHY_OUTPUT_FORMAT = 'WKT';
SELECT * FROM
test.geographyType;
```

#### Best Practices

* This FDM can be safely ignored in most cases. Snowflake natively supports GEOGRAPHY data from WKT, WKB, and GeoJSON string formats without requiring explicit conversion functions.
* If the removed function performed validation or transformation beyond simple type casting, verify that the inserted data is valid GEOGRAPHY data in Snowflake.
* For more information, see the [Snowflake GEOGRAPHY data type documentation](https://docs.snowflake.com/en/sql-reference/data-types-geospatial).

## SSC-FDM-BQ0011

Named parameters in this script were transformed to Snowflake CLI variables.

### Description

BigQuery supports named parameters using the `@parameter_name` syntax in queries. SnowConvert AI transforms these named parameters to Snowflake CLI variables using the `<% parameter_name %>` syntax.

To execute the transformed `.sql` scripts containing named parameters, use Snowflake CLI with variable substitution.

For more information on how to set up and use Snowflake CLI, see [What is Snowflake CLI?](../../../../../../developer-guide/snowflake-cli/index.md)

#### Code Example

##### Input Code:

##### BigQuery

```sql
SELECT column1 FROM test.parametersExample WHERE column2 = @searchValue;
```

##### Example execution (using the bq query command)

```bash
bq query \
  --use_legacy_sql=false \
  --parameter=searchValue:Int64:80 \
  'SELECT column1 FROM test.parametersExample WHERE column2 = @searchValue'
```

##### Output Code:

##### Snowflake

```sql
--** SSC-FDM-BQ0011 - NAMED PARAMETERS IN THIS SCRIPT WERE TRANSFORMED TO SNOWFLAKE CLI VARIABLES. **
SELECT column1 FROM
test.parametersExample
WHERE column2 = <% searchValue %>;
```

##### Example execution (Snowflake CLI)

```bash
snow sql -f output_file_path -D "searchValue=80"
```

### Best Practices

* Install and configure [Snowflake CLI](../../../../../../developer-guide/snowflake-cli/index.md) to execute the transformed scripts with variable substitution using the `-D` flag (e.g., `snow sql -f script.sql -D "param=value"`).
* Review each transformed `<% parameter_name %>` variable to ensure the parameter name and intended value match the original BigQuery `@parameter_name` usage.
* If the transformed script will be executed outside of Snowflake CLI (e.g., in a Snowflake worksheet), replace `<% parameter_name %>` variables with literal values or session variables as appropriate.

## SSC-FDM-BQ0012

Select \* with multiple UNNEST operators will produce column ambiguity in Snowflake

### Description

As part of the SnowConvert transformation for the UNNEST operator, the [FLATTEN](../../../../../../sql-reference/functions/flatten.md) function is used, this function generates multiple columns not required to emulate the UNNEST operator functionality like the `THIS` or `PATH` columns.

When a SELECT \* with the UNNEST operator is found, SnowConvert will remove the unnecessary columns using the `EXCLUDE` keyword, however, when multiple UNNEST operators are used in the same statement, the columns can not be removed due to ambiguity problems, this FDM will be generated to mark these cases.

It is recommended to expand the SELECT expression list in order to specify only the expected columns and solve this issue.

#### Code Example

##### Input Code:

##### BigQuery

```sql
SELECT * FROM UNNEST ([10,20,30]);

SELECT * FROM UNNEST ([10,20,30]) AS numbers, UNNEST(['Hi', 'Hello', 'Bye']) AS words;
```

##### Generated Code:

##### Snowflake

```sql
SELECT
* EXCLUDE(SEQ, KEY, PATH, THIS, INDEX)
FROM
TABLE(FLATTEN(INPUT => [10,20,30])) AS F0_ (
SEQ,
KEY,
PATH,
INDEX,
F0_,
THIS
);

SELECT
--** SSC-FDM-BQ0012 - SELECT * WITH MULTIPLE UNNEST OPERATORS WILL RESULT IN COLUMN AMBIGUITY IN SNOWFLAKE **
 * FROM
TABLE(FLATTEN(INPUT => [10,20,30])) AS numbers (
SEQ,
KEY,
PATH,
INDEX,
numbers,
THIS
),
TABLE(FLATTEN(INPUT => ['Hi', 'Hello', 'Bye'])) AS words (
SEQ,
KEY,
PATH,
INDEX,
words,
THIS
);
```

#### Recommendations

1. **Expand the SELECT list:** Replace `SELECT *` with an explicit column list specifying only the columns you need from each UNNEST/FLATTEN result. This eliminates the ambiguity caused by duplicate metadata columns.
2. **Use table aliases:** Qualify each column reference with the corresponding table alias to avoid ambiguity between the FLATTEN results.

---
title: SnowConvert AI - BigQuery Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/bigqueryEWI.md
section: Migrations
---

# SnowConvert AI - BigQuery Issues

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Google BigQuery currently supports assessment and translation for TABLES and VIEWS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

This page provides a comprehensive reference for how SnowConvert AI translates Google BigQuery grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

## SSC-EWI-BQ0001

Snowflake does not support the options clause.

> **Warning:**
>
> This EWI is deprecated; please refer to [SSC-EWI-0016](generalEWI.md) for the latest version of this EWI.

### Severity

Medium

#### Description

This EWI is added to DDL statements when the `OPTIONS` has unsupported options by Snowflake.

#### Code Example

**Input Code:**

##### BigQuery

```sql
 CREATE VIEW my_view
OPTIONS (
  expiration_timestamp=TIMESTAMP "2026-01-01 00:00:00 UTC",
  privacy_policy='{"aggregation_threshold_policy": {"threshold": 50, "privacy_unit_columns": "ID"}}'
) AS
SELECT column1, column2
FROM my_table;
```

**Output Code:**

##### Snowflake

```sql
 CREATE VIEW my_view
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0001 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: EXPIRATION_TIMESTAMP, PRIVACY_POLICY ***/!!!
OPTIONS(
  expiration_timestamp=TIMESTAMP "2026-01-01 00:00:00 UTC",
  privacy_policy='{"aggregation_threshold_policy": {"threshold": 50, "privacy_unit_columns": "ID"}}'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "03/26/2025",  "domain": "test" }}'
AS
SELECT column1, column2
FROM
  my_table;
```

##### Recommendations

* Add manual changes to the not-transformed expression.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0002

Micro-partitioning is automatically performed on all Snowflake tables.

> **Note:**
>
> This issue is deprecated and no longer generated by SnowConvert AI

### Severity

Medium

#### Description

This warning is added to the Create table when the partition by clause is present. `PARTITION BY` is an optional clause that controls [table partitioning](https://cloud.google.com/bigquery/docs/partitioned-tables) but is not supported in Snowflake.

All data in Snowflake tables is automatically divided into micro-partitions, which are contiguous units of storage. Each micro-partition contains between 50 MB and 500 MB of uncompressed data. This size and structure allows for extremely granular pruning of very large tables, which can be comprised of millions, or even hundreds of millions, of micro-partitions.

Snowflake stores metadata about all rows stored in a micro-partition, including:

* The range of values for each of the columns in the micro-partition.
* The number of distinct values.
* Additional properties used for both optimization and efficient query processing.

Also the tables are transparently partitioned using the ordering of the data as it is inserted/loaded. For more information please refer to [Benefits of Micro-partitioning](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions#benefits-of-micro-partitioninghttps://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions#benefits-of-micro-partitioning).

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE TABLE table1(
    transaction_id INT,
    transaction_date DATE
)
PARTITION BY transaction_date;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    transaction_id INT,
  transaction_date DATE
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0002 - MICRO-PARTITIONING IS AUTOMATICALLY PERFORMED ON ALL SNOWFLAKE TABLES. ***/!!!
PARTITION BY transaction_date
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/09/2025",  "domain": "test" }}';
```

#### Recommendations

* No additional user actions are required, it is just informative.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0003

Pending SnowConvert AI translation for differential privacy.

### Severity

Medium

#### Description

BigQuery allows applying [differential privacy](https://cloud.google.com/bigquery/docs/differential-privacy#what_is_differential_privacy) over some statistical functions to introduce noise in the data, making it difficult to extract information about individuals when analyzing query results.

Snowflake now supports [differential privacy](../../../../../../user-guide/diff-privacy/differential-privacy-overview.md) natively. However, SnowConvert AI has not yet implemented the translation for this feature. Any use of differential privacy in BigQuery will be commented out and this issue will be generated to flag the need for manual conversion.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 SELECT
  WITH DIFFERENTIAL_PRIVACY
    OPTIONS(epsilon=10, delta=.01, max_groups_contributed=2, privacy_unit_column=id)
    item,
    COUNT(quantity, contribution_bounds_per_group => (0,100)) total_quantity
FROM professors
GROUP BY item;
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0003 - PENDING SNOWCONVERT AI TRANSLATION FOR DIFFERENTIAL PRIVACY. ***/!!!
  WITH DIFFERENTIAL_PRIVACY
    OPTIONS(epsilon=10, delta=.01, max_groups_contributed=2, privacy_unit_column=id)
    item,
    COUNT(quantity,
                    !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0003 - PENDING SNOWCONVERT AI TRANSLATION FOR DIFFERENTIAL PRIVACY. ***/!!! contribution_bounds_per_group => (0,100)) total_quantity
FROM
  professors
GROUP BY item;
```

#### Recommendations

1. **Use native Snowflake support:** Snowflake now supports [differential privacy](../../../../../../user-guide/diff-privacy/differential-privacy-overview.md) natively. Rewrite the BigQuery differential privacy syntax using Snowflake’s privacy policies and privacy budgets.
2. **Key differences:** Snowflake’s differential privacy implementation uses privacy policies assigned to tables/views, privacy budgets to manage analyst queries, and privacy domains for fact and dimension columns. The syntax differs from BigQuery’s inline `WITH DIFFERENTIAL_PRIVACY` clause.
3. **Further reading:** [Snowflake Differential Privacy Overview](../../../../../../user-guide/diff-privacy/differential-privacy-overview.md)

## SSC-EWI-BQ0004

Snowflake does not support named windows.

### Severity

Medium

#### Description

BigQuery allows the definition and usage of named windows in aggregate functions, they are defined in the `WINDOW` clause of the query they are used and can be used inside the `OVER` clause of these functions.

Snowflake does not support declaring named windows, please consider taking the window definition and apply it to all usages of that window directly in the `OVER` clause of the functions.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 SELECT
    COUNT(col1) OVER(myWindow)
FROM
    test.exampleTable
WINDOW
    myWindow AS (ORDER BY col2);
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
    COUNT(col1)
    !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0004 - SNOWFLAKE DOES NOT SUPPORT NAMED WINDOWS. ***/!!! OVER(myWindow)
FROM
    test.exampleTable
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0004 - SNOWFLAKE DOES NOT SUPPORT NAMED WINDOWS. ***/!!!
WINDOW
    myWindow AS (ORDER BY col2);
```

#### Recommendations

* Review your named window definitions, it might be possible to take the definition and apply it to the `OVER` clause of the functions it is used in. However, keep in mind the functional differences between BigQuery and Snowflake window frames still apply, take the following case as an example:

BigQuery:

```sql
 SELECT
    COUNT(col1) OVER(myWindow)
FROM
    test.exampleTable
WINDOW
    myWindow AS (ORDER BY col2);
```

Snowflake:

```sql
 SELECT
    COUNT(col1) OVER(ORDER BY col2)
FROM
    test.exampleTable;
```

These two queries will produce the same rows but the Snowflake results will not be ordered, this is because the `ORDER BY` clause for window frames does **not** impact the entire query ordering as it does in BigQuery.

## SSC-EWI-BQ0005

Javascript code has not been validated by SnowConvert AI.

### Severity

High

### Description

SnowConvert AI does not transform Javascript code. Since the Javascript code extracted from BigQuery’s functions hasn’t been changed at all, this code might need some tweaks to work on Snowflake.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE FUNCTION test.languageJs (x integer, y integer)
RETURNS integer
LANGUAGE js
AS "return x * y;";
```

##### Generated Code:

##### Snowflake

```sql
 CREATE FUNCTION test.languageJs (x integer, y integer)
RETURNS DOUBLE
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0005 - JAVASCRIPT CODE HAS NOT BEEN VALIDATED BY SNOWCONVERT AI. ***/!!!
AS
$$
return x * y;
$$;
```

#### Recommendations

* Review all Javascript code before deployment.
* Javascript parameters in Snowflake must be uppercase.
* For more information, visit Snowflake’s [Introduction to Javascript UDFs](../../../../../../developer-guide/udf/javascript/udf-javascript-introduction.md).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0006

Oriented parameter in the ST_GEOGFROMTEXT function is not supported in Snowflake.

### Severity

Low

#### Description

This warning is added when the oriented parameter is specified in the [`ST_GEOGFROMTEXT`](https://cloud.google.com/bigquery/docs/reference/standard-sql/geography_functions#st_geogfromtext) function, because it is not supported in Snowflake. If this parameter is set to TRUE, any polygon in the input is assumed to be oriented as follows: if someone walks along the polygon boundary in the order of the input vertices, the interior of the polygon is to the left. This allows WKT to represent polygons larger than a hemisphere. If oriented is FALSE or omitted, this function returns the polygon with the smallest area.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 SELECT ST_GEOGFROMTEXT('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))', TRUE);
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0006 - ORIENTED PARAMETER IN THE ST_GEOGFROMTEXT FUNCTION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
 ST_GEOGFROMTEXT('POLYGON((0 0, 1 0, 1 1, 0 1, 0 0))');
```

#### Recommendations

1. **Review polygon orientation:** If the `oriented` parameter was set to `TRUE`, verify that the polygon does not span more than a hemisphere. Snowflake’s `ST_GEOGFROMTEXT` always returns the polygon with the smallest area.
2. **Manual validation:** For polygons larger than a hemisphere, consider splitting them into smaller polygons or using alternative geospatial representations.
3. **Remove the parameter:** After manual review, remove the `oriented` parameter from the function call, as Snowflake’s `ST_GEOGFROMTEXT` accepts only the WKT string argument.

## SSC-EWI-BQ0007

Escape Sequence is not valid in Snowflake.

### Severity

Low

#### Description

Bell character (\a) and Vertical character (\v) are valid escape sequences in BigQuery, but not in Snowflake.

This warning is added when a bell character or vertical character escape sequence is found when translating BigQuery code. For more information, see [BigQuery Escape Sequences](https://cloud.google.com/bigquery/docs/reference/standard-sql/lexical#escape_sequences).

#### Code Example

##### Input Code:

##### BigQuery

```sql
 SELECT "\a";
SELECT "\v";
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0007 - ESCAPE SEQUENCE \a IS NOT VALID IN SNOWFLAKE. ***/!!!
    '\a';
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0007 - ESCAPE SEQUENCE \v IS NOT VALID IN SNOWFLAKE. ***/!!!
    '\v';
```

#### Recommendations

1. **Replace with Unicode escapes:** Replace `\a` (bell character, U+0007) with `\x07` and `\v` (vertical tab, U+000B) with `\x0B`, which are supported by Snowflake.
2. **Review usage:** If the escape sequence was used for formatting purposes, consider whether it is still needed in the Snowflake context.

## SSC-EWI-BQ0008

Eight hex digit Unicode escape sequence is not supported in Snowflake.

### Severity

Low

#### Description

BigQuery supports Unicode sequences of 8 hex digits. Snowflake doesn’t support this kind of Unicode sequences.

This warning is added when an 8 hex digits Unicode sequence is found when translating BigQuery code. More about [BigQuery Escape Sequences](https://cloud.google.com/bigquery/docs/reference/standard-sql/lexical#escape_sequences).

#### Code Example

##### Input Code:

##### BigQuery

```sql
 SELECT "\U00100000";
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0008 - EIGHT HEX DIGIT UNICODE ESCAPE SEQUENCE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
    '\U00100000';
```

#### Recommendations

1. **Use surrogate pairs:** Convert the 8-hex-digit Unicode sequence into two 4-hex-digit surrogate pair sequences. For example, `\U00100000` can be represented using surrogate pairs `\uDBC0\uDC00`.
2. **Use CHR function:** Alternatively, use Snowflake’s `CHR` function with the Unicode code point to generate the character at runtime.

## SSC-EWI-BQ0009

SnowConvert AI was unable to generate the correct return table clause. Missing symbol information.

### Severity

High

#### Description

Snowflake requires a valid RETURNS TABLE clause for CREATE TABLE FUNCTION statements. SnowConvert AI has to build a new one from the ground up. To do this, an analysis is made on the CREATE TABLE FUNCTION query in order to properly infer the types of the columns of the resulting table, however there may be scenarios where SnowConvert AI currently has a limitation to be able to build the return clause properly.

These scenarios will be considered in the future, but in the meantime this error will be added.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE TABLE FUNCTION tableValueFunction2()
AS
SELECT *
REPLACE("John" AS employee_name)
FROM employees;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE FUNCTION tableValueFunction2 ()
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0009 - SNOWCONVERT AI WAS UNABLE TO GENERATE THE CORRECT RETURN TABLE CLAUSE. MISSING SYMBOL INFORMATION. ***/!!!
RETURNS TABLE (
)
AS
  $$
      SELECT
        * REPLACE("John" AS employee_name) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ExceptReplaceOperator' NODE ***/!!!
      FROM
        employees
  $$;
```

#### Recommendations

1. **Manually define the RETURNS TABLE clause:** Inspect the original BigQuery TABLE FUNCTION body to determine the column names and types of the result set, then populate the empty `RETURNS TABLE()` clause with the correct column definitions.
2. **Provide source references:** If the issue is caused by missing references, ensure all referenced tables and views are included in the input provided to SnowConvert AI.

## SSC-EWI-BQ0010

The resulting table has no columns

### Severity

Medium

#### Description

This EWI is added when SnowConvert AI creates an external table whose definition has no columns. External tables in BigQuery can be defined using only OPTIONS (e.g., FORMAT and URIS) without explicit column definitions, relying on schema inference. When the resulting table structure has no columns after conversion, SnowConvert AI emits this EWI to flag that manual definition of the table schema may be required.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE EXTERNAL TABLE my_dataset.sensor_readings
OPTIONS (
  format = 'PARQUET',
  uris = ['gs://my_bucket/sensors/*.parquet']
);
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0010 - THE RESULTING TABLE HAS NO COLUMNS ***/!!!
CREATE EXTERNAL TABLE my_dataset.sensor_readings
OPTIONS (
  format = 'PARQUET',
  uris = ['gs://my_bucket/sensors/*.parquet']
);
```

#### Recommendations

1. **Provide column definitions:** If the source BigQuery external table uses inferred schema, manually add the expected column definitions to the generated Snowflake external table based on the actual file structure.
2. **Use INFER_SCHEMA:** Consider using Snowflake’s [INFER_SCHEMA](https://docs.snowflake.com/en/sql-reference/functions/infer_schema) function with a sample file path (without wildcards) to generate the table template.
3. **Include table definitions:** Ensure all referenced table or view definitions are included in the input provided to SnowConvert AI so that symbol information can be collected.

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0011

Session variable assignment of complex datatype is not supported in Snowflake

### Severity

Medium

#### Description

In BigQuery, declaring a variable at script level allows it to be used in the entire script, to replicate this behavior in Snowflake [SQL variables](https://docs.snowflake.com/en/sql-reference/session-variables) are used.

However, declaring variables of datatypes that are complex like ARRAY, GEOGRAPHY, STRUCT or JSON will fail in Snowflake when trying to set the value to the SQL variable. When SnowConvert AI detects one of such cases then this is EWI will be added to the SQL variable declaration.

Variables of these types can be declared without problems inside block statements and other procedural statements, this EWI applies only for variables declared at script level.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE TABLE test.JsonTable
(
    col1 JSON
);

DECLARE myVar1 JSON DEFAULT JSON'{"name": "John", "age": 30}';

INSERT INTO test.JsonTable VALUES (myVar1);

BEGIN
    DECLARE myVar2 JSON DEFAULT JSON'{"name": "Mike", "age": 27}';
    INSERT INTO test.JsonTable VALUES (myVar2);
END;

SELECT col1 FROM test.JsonTable;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE test.JsonTable
(
    col1 VARIANT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}';

!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0011 - SESSION VARIABLE ASSIGNMENT OF COMPLEX DATATYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
SET myVar1 = (
    SELECT
        PARSE_JSON('{"name": "John", "age": 30}')
);

INSERT INTO test.JsonTable
VALUES ($myVar1);

BEGIN
    LET myVar2 VARIANT DEFAULT PARSE_JSON('{"name": "Mike", "age": 27}');
    INSERT INTO test.JsonTable
    VALUES (:myVar2);
END;

SELECT col1 FROM
    test.JsonTable;
```

#### Recommendations

* If the uses of the variable are limited to a single scope or its value is never modified, consider declaring the variable locally in the scopes that use it, that will solve the issue.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0012

SnowConvert AI was unable to generate a correct OBJECT_CONSTRUCT parameter. Missing symbol information.

### Severity

High

#### Description

SnowConvert AI was unable to generate a correct `OBJECT_CONSTRUCT` parameter due to missing symbol information. This typically occurs when the table definition is not included in the input provided to SnowConvert AI, or when the table uses complex types (such as `STRUCT`) whose field names are needed to build the `OBJECT_CONSTRUCT` call.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 INSERT INTO test.tuple_sample
VALUES
  ((12, 34)),
  ((56, 78)),
  ((9, 99)),
  ((12, 35));
```

##### Generated Code:

##### Snowflake

```sql
 INSERT INTO test.tuple_sample
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0012 - SNOWCONVERT AI WAS UNABLE TO GENERATE A CORRECT OBJECT_CONSTRUCT PARAMETER. MISSING SYMBOL INFORMATION. ***/!!!
VALUES
  ((12, 34)),
  ((56, 78)),
  ((9, 99)),
  ((12, 35));
```

#### Recommendations

1. **Provide table definitions:** Ensure all referenced table definitions (CREATE TABLE statements) are included in the input provided to SnowConvert AI so that symbol information can be collected.
2. **Manual replacement:** Inspect the original BigQuery INSERT statement and manually construct the `OBJECT_CONSTRUCT` call with the correct field names and values matching the target table’s schema.

## SSC-EWI-BQ0013

External table data format not supported in snowflake

> **Warning:**
>
> This EWI is deprecated; please refer to [SSC-EWI-0029](generalEWI.md) for the latest version of this EWI.

### Severity

Medium

#### Description

Snowflake supports the following BigQuery formats:

| BigQuery | Snowflake |
| --- | --- |
| AVRO | AVRO |
| CSV GOOGLE_SHEETS | CSV |
| NEWLINE_DELIMITED_JSON JSON | JSON |
| ORC | ORC |
| PARQUET | PARQUET |

When an external table has other FORMAT not specified in the above table, this EWI will be generated to inform the user that the FORMAT is not supported.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.backup_restore_table
OPTIONS (
  format = 'DATASTORE_BACKUP',
  uris = ['gs://backup_bucket/backup_folder/*']
);
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0013 - EXTERNAL TABLE DATA FORMAT NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE OR REPLACE EXTERNAL TABLE test.backup_restore_table
OPTIONS (
  format = 'DATASTORE_BACKUP',
  uris = ['gs://backup_bucket/backup_folder/*']
);
```

#### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0014

Hive partitioned external tables are not supported in Snowflake

### Severity

Medium

#### Description

Snowflake does not support hive partitioned external tables, when the WITH PARTITION COLUMNS clause is found in the external table, it will be marked as not supported using this EWI.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE EXTERNAL TABLE test.CustomHivePartitionedTable
WITH PARTITION COLUMNS (
  field_1 STRING,
  field_2 INT64)
OPTIONS (
  uris = ['gs://sc_external_table_bucket/folder_with_parquet/*'],
  format = 'PARQUET',
  hive_partition_uri_prefix = 'gs://sc_external_table_bucket/folder_with_parquet',
  require_hive_partition_filter = false);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_CUSTOMHIVEPARTITIONEDTABLE_FORMAT
TYPE = PARQUET;

CREATE EXTERNAL TABLE test.CustomHivePartitionedTable USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-0035 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_CUSTOMHIVEPARTITIONEDTABLE_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0014 - HIVE PARTITIONED EXTERNAL TABLES ARE NOT SUPPORTED IN SNOWFLAKE ***/!!!
WITH PARTITION COLUMNS (
  field_1 STRING,
  field_2 INT64)
PATTERN = 'folder_with_parquet/.*'
FILE_FORMAT = (TYPE = PARQUET)
!!!RESOLVE EWI!!! /*** SSC-EWI-0016 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: HIVE_PARTITION_URI_PREFIX, REQUIRE_HIVE_PARTITION_FILTER. ***/!!!
OPTIONS(
  hive_partition_uri_prefix = 'gs://sc_external_table_bucket/folder_with_parquet',
  require_hive_partition_filter = false
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}';
```

#### Recommendations

1. **Remove the WITH PARTITION COLUMNS clause:** Snowflake external tables use automatic partitioning based on the file path. Remove the `WITH PARTITION COLUMNS` clause from the generated code.
2. **Use Snowflake partitioning:** Define partition columns using expressions in the external table’s column definitions. Snowflake can automatically infer partition columns from the directory structure.
3. **Hive metastore integration:** If you use a Hive metastore, consider integrating it with Snowflake to synchronize external table metadata automatically.

## SSC-EWI-BQ0015

External table requires an external stage to access an external location, define and replace the EXTERNAL_STAGE placeholder

> **Warning:**
>
> This EWI is deprecated; please refer to [SSC-EWI-0032](generalEWI.md) for the latest version of this EWI.

### Description

When transforming the CREATE EXTERNAL TABLE statement, SnowConvert AI will generate an EXTERNAL_STAGE placeholder that has to be replaced with the external stage created for connecting with the external location from Snowflake.

Please refer to the following guides to set up the necessary Storage Integration and External Stage in your Snowflake account:

* [For external tables referencing Amazon S3](https://docs.snowflake.com/en/user-guide/tables-external-s3)
* [For external tables referencing Google Cloud Storage](https://docs.snowflake.com/en/user-guide/tables-external-gcs)
* [For external tables referencing Azure Blob Storage](https://docs.snowflake.com/en/user-guide/tables-external-azure)

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.Employees_test
(
  Employee_id INTEGER,
  Name STRING,
  Mail STRING,
  Position STRING,
  Salary INTEGER
)
OPTIONS(
  FORMAT='CSV',
  SKIP_LEADING_ROWS=1,
  URIS=['gs://sc_external_table_bucket/folder_with_csv/Employees.csv']
);
```

##### Generated Code:

##### Snowflake

```
CREATE OR REPLACE EXTERNAL TABLE test.Employees_test
(
  Employee_id INTEGER AS CAST(GET_IGNORE_CASE($1, 'c1') AS INTEGER),
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c2') AS STRING),
  Mail STRING AS CAST(GET_IGNORE_CASE($1, 'c3') AS STRING),
  Position STRING AS CAST(GET_IGNORE_CASE($1, 'c4') AS STRING),
  Salary INTEGER AS CAST(GET_IGNORE_CASE($1, 'c5') AS INTEGER)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0015 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Employees.csv'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER =1);
```

#### Recommendations

* Set up your external connection in the Snowflake account and replace the EXTERNAL_STAGE placeholder to complete the transformation.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-BQ0016

Select \* with multiple UNNEST operators will produce column ambiguity

> **Warning:**
>
> This EWI is deprecated; please refer to [SSC-FDM-0012](../functional-difference/bigqueryFDM.md) for the latest version of this issue.

### Severity

Medium

#### Description

As part of the SnowConvert transformation for the UNNEST operator, the [FLATTEN](../../../../../../sql-reference/functions/flatten.md) function is used, this function generates multiple columns not required to emulate the UNNEST operator functionality like the `THIS` or `PATH` columns.

When a SELECT \* with the UNNEST operator is found, SnowConvert will remove the unnecessary columns using the `EXCLUDE` keyword, however, when multiple UNNEST operators are used in the same statement, the columns can not be removed due to ambiguity problems, this EWI will be generated to mark these cases.

It is recommended to expand the SELECT expression list in order to specify only the expected columns and solve this issue.

#### Code Example

##### Input Code:

##### BigQuery

```sql
SELECT * FROM UNNEST ([10,20,30]);

SELECT * FROM UNNEST ([10,20,30]) AS numbers, UNNEST(['Hi', 'Hello', 'Bye']) AS words;
```

##### Generated Code:

##### Snowflake

```sql
SELECT
* EXCLUDE(SEQ, KEY, PATH, THIS, INDEX)
FROM
TABLE(FLATTEN(INPUT => [10,20,30])) AS F0_ (
SEQ,
KEY,
PATH,
INDEX,
F0_,
THIS
);

SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-BQ0016 - SELECT * WITH MULTIPLE UNNEST OPERATORS WILL RESULT IN COLUMN AMBIGUITY IN SNOWFLAKE ***/!!!
 * FROM
TABLE(FLATTEN(INPUT => [10,20,30])) AS numbers (
SEQ,
KEY,
PATH,
INDEX,
numbers,
THIS
),
TABLE(FLATTEN(INPUT => ['Hi', 'Hello', 'Bye'])) AS words (
SEQ,
KEY,
PATH,
INDEX,
words,
THIS
);
```

## SSC-EWI-BQ0017

Pending SnowConvert AI translation for UNNEST of an array of structs

### Severity

Medium

#### Description

When unnesting an array of structs, BigQuery generates a column for each struct field and splits the struct values into their corresponding columns. SnowConvert AI does not yet support this transformation. Whenever SnowConvert AI detects that the UNNEST operator is applied over an array of structs, this EWI is generated to flag the need for manual conversion.

#### Code Example

##### Input Code:

##### BigQuery

```sql
CREATE TABLE test.myTestTable
(
  column1 ARRAY<STRUCT<x INT64, y STRING, z STRUCT<a INT64, b INT64>>>
);

SELECT structValues FROM test.myTestTable AS someTable, UNNEST(someTable.column1) AS structValues;
```

##### Generated Code:

##### Snowflake

```sql
CREATE TABLE test.myTestTable
(
  column1 ARRAY DEFAULT []
);

SELECT structValues FROM
  test.myTestTable AS someTable,
  !!!RESOLVE EWI!!! /*** SSC-EWI-BQ0017 - PENDING SNOWCONVERT AI TRANSLATION FOR UNNEST OF AN ARRAY OF STRUCTS ***/!!! UNNEST(someTable.column1) AS structValues;
```

#### Recommendations

1. **Use FLATTEN with LATERAL:** Manually flatten the array column using Snowflake’s [FLATTEN](../../../../../../sql-reference/functions/flatten.md) function, then extract individual struct fields using dot notation or `GET` on the `VALUE` column.
2. **Example workaround:**

   ```sql
   SELECT f.VALUE:x::INT64 AS x, f.VALUE:y::STRING AS y
   FROM test.myTestTable AS t, LATERAL FLATTEN(INPUT => t.column1) AS f;
   ```

---
title: SnowConvert AI - Code Completeness Score
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/code-completeness-score.md
section: Migrations
---

# SnowConvert AI - Code Completeness Score

## Code Completeness Score Value

This number represents the percentage of code units whose references to other code units are correctly addressed by SnowConvert AI. If the score is less than one hundred represents that there is at least one code unit referencing one or more code units not included in the source code.

### Formula

```none
((total_CU - impacted_CU) / total_CU ) * 100

total_CU = total number of Code Units
impacted_CU = Code Units with missing references
```

#### Sample

```sql
-- Code Unit with no missing references
CREATE TABLE table1
(
    COL1 VARCHAR
)

-- Code Unit with no missing references
SELECT * from table1;

-- Code Unit with a missing reference
SELECT * from missing_table;
```

```sql
-- Code Unit with no missing references
CREATE OR REPLACE TABLE table1
(
    COL1 VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

-- Code Unit with no missing references
SELECT
* from
table1;

-- Code Unit with a missing reference
SELECT
* from
missing_table;
```

**Expected Code Completeness Score:** 66.67

**Explanation:** In this case, we have 3 code units and only one of them has a missing reference. The `SELECT` in line 11 references another code unit called ‘missing_table’ whose definition is not present in the source code, therefore this `SELECT` is considered a code unit with missing references.

---
title: SnowConvert AI - Code Extraction
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/code-extraction/README.md
section: Migrations
---

# SnowConvert AI - Code Extraction

---
title: SnowConvert AI - Considerations
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/considerations/README.md
section: Migrations
---

# SnowConvert AI - Considerations

---
title: SnowConvert AI - Contact Us
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/contact-us.md
section: Migrations
---

# SnowConvert AI - Contact Us

SnowConvert AI [is now a part of Snowflake](https://investors.snowflake.com/news/news-details/2023/Snowflake-Announces-Intent-to-Acquire-Mobilize.Nets-SnowConvert-to-Accelerate-Legacy-Migrations-to-the-Data-Cloud/default.aspx).

For additional information about SnowConvert AI, please contact us at:

* For additional information, contact: [snowconvert-info@snowflake.com](mailto:snowconvert-info%40snowflake.com)
* For technical support, contact: [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Conversion
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/README.md
section: Migrations
---

# SnowConvert AI - Conversion

## How to Execute a Conversion

To execute a conversion, you need a valid access code. You can request one for free from the app by clicking the link ‘Get an Access Code’:

Then complete the form and click on send:

> **Note:**
>
> Personal emails with domain as gmail, outlook, etc, cannot be used to get an access code.

An email will be sent to your inbox containing the access code.

To execute a conversion fill out the required fields in the **‘Project Creation’** page like:

* Project name
* Source language (Teradata, Oracle, Sql-Server and Redshift)
* Input and output folder
* A valid access code

> **Note:**
>
> You can use the same access code to convert from any available source language; just select it in the source dropdown.

## Conversion Setup

To execute a conversion SnowConvert AI will be using all the information provided in the project creation screen, the values that you can change here are:

1. Output folder path (Changing this is optional): \

SnowConvert AI will always generate the output into a sub-folder with the following format: Conversion-[Timestamp of the conversion], this folder will be always inside your provided output path, which means that SnowConvert AI won’t override any previously created output.

2. Conversion settings (Only for Teradata, Oracle, or SQL Server)\

For a better understanding of how the Conversion settings work please go to the specific article of the supported languages:

1. [Teradata Conversion Settings](teradata-conversion-settings.md)
2. [Oracle Conversion Settings](oracle-conversion-settings.md)
3. [SQL Server Conversion Settings](sql-server-conversion-settings.md)
4. [Azure Synapse Conversion Settings](sql-server-conversion-settings.md)

Once you are done with the setup, you just need to click on **‘Save & Start Conversion’** button to save the project, continue with the conversion and the progress screen will inform you about the execution status.

When this process is completed you will be able to see:

1. **Conversion Reports:** Check the conversion reports by clicking on the “View Results” button.
2. **Conversion Output Code**: On the conversion results screen click on the “View Output” button to open the folder containing the converted code.
3. **Retry Conversion**: On the conversion results screen you can select the **Retry Conversion** button to run again the conversion. That is useful if you change the source code and want to convert the new source code again.

---
title: SnowConvert AI - Conversion Issues (EWIs)
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/README.md
section: Migrations
---

# SnowConvert AI - Conversion Issues (EWIs)

When SnowConvert AI cannot completely convert a piece of code, it generates an Error, Warning, and Issue (EWI). Each EWI negatively affects the conversion rate of a code unit. SnowConvert AI may encounter conversion difficulties for various reasons.

* The feature has not been implemented in the conversion tool yet.
* Required dependent code is missing for the conversion rule to work.
* An equivalent statement is not available in Snowflake, or a User-Defined Function (UDF) has not been created to provide similar functionality.

---
title: SnowConvert AI - Conversion Software Terms of Use
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/terms-and-conditions/README.md
section: Migrations
---

# SnowConvert AI - Conversion Software Terms of Use

For the most current and authoritative version of the Conversion Software Terms of Use, please visit the official Snowflake legal site:

[Conversion Software Terms of Use](https://www.snowflake.com/en/legal/technical-services-and-education/conversion-software-terms/)

---
title: SnowConvert AI - Converting subfolders
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/converting-subfolders.md
section: Migrations
---

# SnowConvert AI - Converting subfolders

SnowConvert AI allows you to run a conversion over a specific portion of your code, ignoring the parts that do not need to be converted.

## How to execute a conversion over a subfolder

The ‘**Project Creation**’ page will show a checkbox called ‘**Convert a subfolder’** below the input folder path field.

Click on it to open the **folder explorer** where a specific folder could be selected for conversion.

> **Note:**
>
> The folders shown on the folder explorer component, are the ones that contain files with allowed extensions (depending on the selected source platform). So, if a folder does not show up on the folder explorer, it means it does not contain files with the allowed extensions.

To select a subfolder, click on the radio button located on the left side of the subfolder list item. You can expand or collapse the subfolder to review the files within it by clicking on the subfolder name or clicking on the expand/collapse icon on each item.

After selecting a subfolder, the selected folder path can be viewed on the “**Convert the following**” section above the folder explorer component.

> **Note:**
>
> Hovering on the path label will show a tooltip with the full path, this applies to any field that contains a shortened path (input folder path, output folder path, etc.).

Then, enter your access code and click on the ‘**Save & Start Conversion’** button. The conversion will be executed using **only** the selected subfolder as the input.

When this process is completed you will be able to see:

1. **Conversion Results:** Conversion reports will be open as soon as your conversion is finished and you click on the ‘**View Results**’ button.

   The selected subfolder will appear below the ‘**Execution Summary**’ section along with other information.\
2. **Conversion Output Code**: To check this you only need to click on ‘**View Output**’ on the Conversion Results page and the folder that contains your converted code will be opened.
3. **Retry Conversion**: After you execute a conversion, on the Conversion Results page you can select the **Retry Conversion** button to run again the conversion. That is useful if you change the source code and want to convert the new source code again, or even if you want to select another subfolder to convert.

---
title: SnowConvert AI - Databases & Schemas
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/databases-and-schemas.md
section: Migrations
---

# SnowConvert AI - Databases & Schemas

> **Note:**
>
> This page of the documentation is for Teradata only.

## Number of Databases Containing Objects

Represents the number of databases that contain identified top-level objects. Each different database name will only count as one single database.

It is important to consider that this number will only be incremented by the names used in the top-level objects, the references to the object names will not be counted in this assessment value.

> **Note:**
>
> The SQL and script files affect this field.

### Sample

```sql
CREATE TABLE database1.table1(COL1 INTEGER);
CREATE TABLE DATABASE1.table1(COL1 INTEGER);

CREATE VIEW "database2"."view2" AS SELECT * FROM table2;
CREATE VIEW "DATABASE2"."view2" AS SELECT * FROM table2;

CREATE VIEW view3 AS SELECT * FROM database3.table3;
```

```none
CREATE OR REPLACE TABLE database1.table1 (
COL1 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR DATABASE1.table1. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
CREATE OR REPLACE TABLE DATABASE1.table1 (
COL1 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table2" **
CREATE OR REPLACE VIEW "database2"."view2"
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
* FROM
table2;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table2" **
--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR "DATABASE2"."view2". CHECK IF THE NAME IS INVALID OR DUPLICATED. **
CREATE OR REPLACE VIEW "DATABASE2"."view2"
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
* FROM
table2;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "database3.table3" **
CREATE OR REPLACE VIEW view3
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
* FROM
database3.table3;
```

**Expected Databases containing objects:** 4

**Explanation:** Only the databases used in the name of a DDL (tables, views, macros, join indexes, procedures, and functions) will count as a database object. In this case, **database1** and **DATABASE1** in CREATE TABLE statements and **“database2”** and **“DATABASE2”** in CREATE VIEW statements will be counted.

---
title: SnowConvert AI - DB2
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/README.md
section: Migrations
---

# SnowConvert AI - DB2

This page provides a comprehensive reference for how SnowConvert AI translates IBM DB2 grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - Download and Access
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/download-and-access.md
section: Migrations
---

# SnowConvert AI - Download and Access

## Download

Getting up and running with Snowflake SnowConvert AI is quick and easy.

## System requirements

Before you start, make sure that your system meets the minimum requirements list given here:

* MacOS

  + Catalina 10.15.6 or higher
  + Java JDK 8 or higher
  + 4 GB of RAM or higher
* Windows

  + Windows 10
  + .NET Framework 4.6.2 (runtime)
  + Java JDK 8 or higher
  + 4 GB of RAM or higher

Before you download, if you’re into the legalities of SnowConvert AI, you can view our [End User License Agreement (EULA)](../terms-and-conditions/README.md)

> **Note:**
>
> SnowConvert AI will run using .NET 9 and it’s shipped in a self-contained package so you don’t have to install any dependencies

If you encounter any issues in the download, installation, or setup process, let us know! Send a message to [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com). We’ll get you up and running again.

### Download

Download the SnowConvert AI installer from [SnowConvert AI Migrations](https://www.snowflake.com/en/migrate-to-the-cloud/snowconvert-ai/).

We highly recommend completing the free course “[SnowConvert AI for Conversion](https://tinyurl.com/54uzk9nx).” This course provides both an overview and technical hands-on training on how to use SnowConvert AI for assessments and conversions.

### Sign in

You can sign in to SnowConvert AI with your Snowflake credentials (same as that you use to login to http://app.snowflake.com/).
If you do not have Snowflake credentials, create a free trial account at https://signup.snowflake.com/

### Authentication

Ensure that your VPN is connected and you have access to your MFA device for authentication.
SnowConvert AI does not support authentication via passkeys or time-based one time password (TOTP) authentication methods.

To resolve any sign in or authentication issues, contact snowconvert-info@snowflake.com

### Supported Platforms

Refer to [SnowConvert About](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/about)

If you are looking for a platform that is not supported, let us know at [snowconvert-info@snowflake.com](mailto:snowconvert-info%40snowflake.com).

---
title: SnowConvert AI - Elements Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/elements-report.md
section: Migrations
---

# SnowConvert AI - Elements Report

## What is an “element”?

The term “element” is used in this context to address a grammar element; that is, an element from a grammar that has a name, a syntax and a purpose within a specific language.

Usually, these elements are highlighted and quite important within the documentation of a language.

These are some examples of elements in SQL languages:

* Any DDL, such as `CREATE TABLE` and `CREATE VIEW`
* Important content of DML, such as `PARTITION BY` and `NOT NULL`
* Any DML, like `INSERT` and `DELETE`
* Some important expressions, such as `IN`, `NOT IN`, `BETWEEN` and `LIKE`
* Operators, including conditionals and arithmetic operators
* Some internal parts of queries, such as `ORDER BY`, `WHEN`, `INNER JOIN` and `TOP`.
* Important functions, such as `AVG` and `RANK`

Essentially, anything that is worth keeping track of for assessment purposes can be considered an element.

### Where can I find it?

The elements report can be found in a folder named *“reports”*, in the output folder of your conversion. The name of the file itself starts with *“Elements”* so it can easily be located.

The format of the file is **.CSV**.

### What information does it contain?

The elements report is presented in a table format, and contains the following columns:

| Column | Description |
| --- | --- |
| SessionID | The session ID of the transformation. This is a unique identifier for the transformation session. |
| Category | The element's corresponding category. These can be DDL, DDL Content, DML, Functions & Expressions, Statement, Query, and so on. |
| Grammar Element | The name associated to the element, often the same as found in the official documentation for the language. |
| File Type | The type of the file that contains the element. For example: SQL. |
| Total Count | The total count of that particular element found during the transformation process. |
| Not Converted Count (Self) | The count of that particular element that presented issues severe enough for it to not properly transform. Usually unsupported structures or elements that had a particular transformation error. Keep in mind that "Self" means that some of the inner contents of the element may or may not be not converted, but if the element itself did not present errors, it will not be counted towards this column. |
| Language | The programming language or SQL dialect of the source code unit. |

#### Summarization

Each individual element is summarized using a specific criteria, that may include multiple columns to form a “composite key”. The basic grouping is made using the Category, Grammar Element and File Type columns.

Following this convention, the same `SELECT` element could be summarized differently depending on the type of the file that contains it, or two elements that share the same grammar element (or name) may still be summarized independently if their category is different.

---
title: SnowConvert AI - Embedded Code Units Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/embedded-code-units-report.md
section: Migrations
---

# SnowConvert AI - Embedded Code Units Report

## What is a Embedded Code Unit?

A Code Unit, as the name suggests, is the most atomic, standalone executable element. In most cases, these are statements, but they also include script files as well because those are executed as a single element.

So according to the previous definition, an embedded code unit is when a Code Unit is inside a Top Level Code Unit. For more information please refer to [Top-Level Code Units Report](top-level-code-units-report.md).

## Examples of Embedded Code Units

In the following section, we can see some examples of Embedded Code Units.

### Packages

A package can define multiple elements inside its body. The package body is considered the Top-Level Code Unit because those elements cannot be created individually without creating the entire package body. Elements or code units inside a package will count as Embedded Code Units.

The following statements will be counted as embedded code units in `packages`:

* Functions
* Procedures
* Types
* Cursors
* Constants
* Variables
* Exceptions
* Pragmas

```sql
CREATE OR REPLACE PACKAGE my_package1 IS
    PROCEDURE outer_procedure(input_value NUMBER);
END my_package1;
/

CREATE OR REPLACE PACKAGE BODY my_package1 IS
    FUNCTION outer_function(value NUMBER) RETURN NUMBER IS
        BEGIN
            RETURN value * 2;
        END inner_function;

    PROCEDURE outer_procedure(input_value NUMBER) IS
    BEGIN
        DBMS_OUTPUT.PUT_LINE('Result of inner function: ' || inner_function(input_value));
        DBMS_OUTPUT.PUT_LINE('Input Value: ' || input_value);
    END outer_procedure;
END my_package1;
```

For this case the embedded function `"outer_function(NUMBER)"` and the embedded procedure `"outer_procedure(NUMBER)"` will be counted.

## Information in the Embedded Code Units Report

| Column | Description |
| --- | --- |
| Partition Key | The unique identifier of the conversion. |
| File Type | The type of the file that the Embedded Code Unit is in. (SQL, BTEQ, etc…) |
| ParentCategory | The Category of the Top Level Code Unit in which the code unit is embedded. |
| ParentID | The fully qualified name of the Top Level Code Unit in which the code unit is embedded. |
| Category | The broader class or type each Embedded Code Unit belongs to. |
| Code Unit | The type of Embedded Code Unit that this element belongs to. |
| Code Unit Name | The name of the Embedded Code Unit if it has one such as tables or procedures. It will be N/A for elements without a name. |
| File Name | The name of the file in which the Embedded Code Unit is located. Uses the relative path starting from the input directory. |
| Line Number | The line number inside the file where the Embedded Code Unit is located. |
| Lines of Code | The total lines of code that the Embedded Code Unit has. |
| EWI Count | The amount of EWIs found within the code unit. You can learn more about EWIs [here](../../../../technical-documentation/issues-and-troubleshooting/conversion-issues/README.md). |
| FDM Count | The amount of FDMs found within the code unit. You can learn more about FDMs [here](../../../../technical-documentation/issues-and-troubleshooting/functional-difference/README.md). |
| PRF Count | The amount of PRFs found within the code unit. You can learn more about PRFs [here](../../../../technical-documentation/issues-and-troubleshooting/performance-review/README.md). |
| Highest EWI Severity | The highest EWI severity found within the Embedded Code Unit. The severity order is the following:   * N/A (when there are not any EWIs) * Low * Medium * High * Critical |
| UDFs Used | The names of all the user defined functions found within the Embedded Code Unit. The name of the UDFs used are separated by a pipe if there is more than one. |
| EWI | The codes of all the EWIs found within the code unit. These codes are separated by pipes and do not include repeated codes. |
| FDM | The codes of all the FDMs found within the code unit. These codes are separated by pipes and do not include repeated codes. |
| PRF | The codes of all the PRFs found within the code unit. These codes are separated by pipes and do not include repeated codes. |
| Conversion Status | The final status of the conversion of the code unit.  The possible conversion statuses are:   * **NotSupported:** When the Embedded Code Unit has a 0% conversion rate. * **Partial:** When the conversion rate of the Embedded Code Unit is between 0% and 100%. * **Success:** When the Embedded Code Unit conversion rate is 100%. |
| LoC Conversion Percentage | The conversion percentage is based on Lines of Code. A single line of code may have supported and unsupported fragments depending on how the input code was formatted. In these cases, the entire line is considered as not supported. |
| Deployment Order | The deployment order is the topological level of each code unit based on its dependencies. It shows the right order in which the code units should be deployed to avoid missing dependencies during the deployment phase. |

## Example

Assume that the following `CREATE PACKAGE` in ORACLE SQL is located in its file called Oracle_01.sql.

```none
CREATE OR REPLACE PACKAGE my_package1 IS
    PROCEDURE calculate_salary(emp_id IN NUMBER);
END my_package1;
/

CREATE OR REPLACE PACKAGE BODY my_package1 IS
    PROCEDURE calculate_salary(emp_id IN NUMBER) IS
        emp_name VARCHAR2(100);
        emp_salary NUMBER;
    BEGIN
        SELECT name, salary INTO emp_name, emp_salary FROM employees WHERE employee_id = emp_id;
        DBMS_OUTPUT.PUT_LINE('Employee ID: ' || emp_id);
        DBMS_OUTPUT.PUT_LINE('Employee Name: ' || emp_name);
        DBMS_OUTPUT.PUT_LINE('Employee Salary: ' || emp_salary);
    END calculate_salary;
END my_package1;
```

```sql
CREATE SCHEMA IF NOT EXISTS my_package1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employees" **
CREATE OR REPLACE PROCEDURE my_package1.calculate_salary(emp_id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        emp_name VARCHAR(100);
        emp_salary NUMBER(38, 18);
    BEGIN
        SELECT name, salary INTO
            :emp_name,
            :emp_salary
        FROM
            employees
        WHERE employee_id = :emp_id;
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF('Employee ID: ' || NVL(:emp_id :: STRING, ''));
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF('Employee Name: ' || NVL(:emp_name :: STRING, ''));
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF('Employee Salary: ' || NVL(:emp_salary :: STRING, ''));
    END;
$$;
```

The Embedded Code Units report will have only one embedded procedure.

Here are all the values that would be reported in the entry of this embedded procedure inside the package:

* The **Partition Key** value will depend on migration so the value here will vary.
* The **File Type** will be SQL because it was migrated on a file with the .sql extension.
* The **ParentCategory** will be `PACKAGE BODY` because the `PACKAGE BODY` is the top level code unit that contains the embedded procedure.
* The **ParentID** will be `my_package1` because is the top level code unit name that contains the embedded procedure.
* The **Category** for the embedded procedure will be `PROCEDURE` because the `CREATE PROCEDURE` statement is part of the `PROCEDURE` Code Unit Category.
* The **Code Unit** itself will be `CREATE PROCEDURE`.
* The **Code Unit** **Name** will be `calculate_salary(NUMBER)`.
* The **File Name** where this code unit was found would be Oracle_01.sql.
* Assuming that the `CREATE PROCEDURE` statement is in the `PACKAGE BODY DEFINITION`, the **Line Number** will be 8.
* The **Lines of Code** number would be 9.
* The **EWI Count** column will report 0 because the output code does not have EWIs.
* The **FDM Count** column will report 3 because the output code has three FDM related to the UDFs that were added to the output code.
* The **PRF Count** column will report N/A because the output code does not have PRFs.
* The **Highest EWI Severity** in this case would be “N/A” because there are no EWIs.
* The **UDFs Used** column will be `DBMS_OUTPUT.PUT_LINE_UDF` because this custom User Defined Function was added to convert the `DBMS_OUTPUT.PUT_LINE`.
* The **EWI** column will show N/A because there are no EWI issues.
* The **FDM** column will show “`SSC-FDM-OR0035`” in this case.
* The **PRF** column will show N/A because there are no PRF issues.
* The **Conversion Status** will be “`Success`” .
* The **LoC Conversion Percentage** is `100%` because all the lines were converted successfully.

## Deployment Order

The deployment order column represents the correct order to deploy each code unit into Snowflake. For the embedded code units report, the deployment order is only available for `FUNCTIONS` and `STORED PROCEDURES`. Other embedded code units would have `N/A` deployment order. See Deployment Order in [Top-Level Code Units Report](top-level-code-units-report.md) for more details.

---
title: SnowConvert AI - ETL Replatform Component Summary Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/etl-replatform-report.md
section: Migrations
---

# SnowConvert AI - ETL Replatform Component Summary Report

The Component Summary Report provides a comprehensive inventory of all identified SSIS components and their migration outcomes. Use this report to understand the overall migration scope and identify areas requiring attention.

## Report Fields

| Field | Description |
| --- | --- |
| **SessionID** | Unique identifier for the migration run |
| **Technology** | Original ETL technology (SSIS) |
| **Category** | Component category (Component, Package, Data Flow, Control Flow) |
| **Subtype** | Component type (e.g., Microsoft.OLEDBSource, Microsoft.DerivedColumn) |
| **FullName** | Full component name including hierarchy (e.g., Package1/DataFlow1/Component1) |
| **FileName** | Relative path to the DTSX file |
| **Status** | Migration status (Success, NotSupported, Partial) |
| **EWI Count** | Number of EWIs for this component |
| **EWIs** | Unique EWI codes found in the component |
| **FDM Count** | Number of FDMs (functional difference messages) |
| **FDMs** | Unique FDM codes found in the component |
| **PRF Count** | Number of PRFs (performance warnings) |
| **PRFs** | Unique PRF codes found in the component |

## How to Use This Report

Use the Status column to prioritize your post-migration work:

* **NotSupported** status or high EWI counts indicate components that require manual intervention
* **Partial** status means the component was converted but has limitations or warnings that need review
* **Success** status indicates a clean conversion with no known issues

## Example CSV

```text
Technology,Category,Subtype,FullName,FileName,Status,EWI Count,EWIs,FDM Count,FDMs,PRF Count,PRFs
SSIS,Component,Data Flow Task,Customer ETL,Package.dtsx,Success,0,,0,,0,
SSIS,Component,Microsoft.OLEDBSource,OLE DB Source,Package.dtsx,Success,0,,0,,0,
SSIS,Component,Microsoft.DerivedColumn,Derived Column 1,Package.dtsx,Success,0,,0,,0,
SSIS,Component,Microsoft.Lookup,Lookup 1,Package.dtsx,Partial,0,,1,SSC-FDM-SSIS0001,0,
```

---
title: SnowConvert AI - ETL Replatform Issues Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/etl-replatform-issues-report.md
section: Migrations
---

# SnowConvert AI - ETL Replatform Issues Report

The EWIs Report provides a detailed inventory of errors, warnings, and issues encountered during migration. Use this report to identify components that require manual intervention or review.

## Report Fields

| Field | Description |
| --- | --- |
| **SessionID** | Unique identifier for the migration run |
| **Code** | Issue code (e.g., SSC-EWI-SSIS0001, SSC-FDM-SSIS0001) |
| **Name** | Issue type or problematic ETL element name |
| **Description** | Brief description of the issue |
| **Parent File Name** | Relative path to the source DTSX file |
| **Component Full Name** | Full name of the SSIS component with the issue |

## How to Use This Report

* Prioritize addressing “Critical” level EWIs as they indicate components that could not be converted and will prevent successful execution in dbt
* “High” severity issues suggest potential problems that might require manual review or adjustments
* “Medium” and “None” severity issues provide context or suggest best practices for the migrated dbt project

## Example CSV

```text
Code,Name,Description,ParentFileName,ComponentFullName
SSC-EWI-SSIS0001,ScriptComponent1,SSIS COMPONENT IS NOT SUPPORTED BY SNOWCONVERT,Package.dtsx,Package.Data Flow Task.ScriptComponent1
SSC-EWI-SSIS0002,DerivedColumn1,SSIS EXPRESSION CANNOT BE CONVERTED TO SNOWFLAKE SQL,Package.dtsx,Package.Data Flow Task.DerivedColumn1
SSC-FDM-SSIS0001,Lookup1,SSIS COMPONENT BEHAVIOR MAY DIFFER IN SNOWFLAKE ENVIRONMENT,Package.dtsx,Package.Data Flow Task.Lookup1
```

---
title: SnowConvert AI - Extraction Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/extraction-validation.md
section: Migrations
---

# SnowConvert AI - Extraction Validation

## Description

This validation step verifies if the entry code was extracted, which means the Extraction Script tool was used.

> **Warning:**
>
> **IMPORTANT**! There is only an Extraction Script tool for [Oracle](https://github.com/Snowflake-Labs/SC.DDLExportScripts/releases/latest/download/oracle.zip), [Teradata](https://github.com/Snowflake-Labs/SC.DDLExportScripts/releases/latest/download/teradata.zip), and [SQLServer](https://github.com/Snowflake-Labs/SC.DDLExportScripts/releases/latest/download/sql-server.zip) languages. The related link will download a .zip with the extraction script and instructions.

If the entry code is not extracted, the following warning is displayed:

### Exception for SQLServer

In the case of SQLServer as an input language, the extraction script does not generate the file required to validate if this tool is being used or not thus if you follow all the instructions mentioned in the extraction script guide, you could create a .***sc_extracted*** file and locate it at the root folder of the entry code to avoid that the warning of not extraction is displayed.

---
title: SnowConvert AI - File and Object Level Breakdown - SQL Files
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/file-and-object-level-breakdown-sql-files.md
section: Migrations
---

# SnowConvert AI - File and Object Level Breakdown - SQL Files

> **Note:**
>
> In Teradata, this table applies to all the files with the following extensions:
>
> * .sql
> * .dml
> * .ddl

## Code Conversion Rate

This section shows the code conversion rate of the SQL files.

### Formula

```none
(converted_lines / total_lines) * 100
```

#### CSV Associated Field Names

* SqlLoCConversionRate

#### Sample

Consider the following example, even though the second table is not recognized due to a parsing error, the comments inside are considered supported lines of code.

```sql
CREATE TABLE sample_table1  -- converted
(    -- line with error
 -- Comment 1  -- converted
 col1 INTEGER,  -- converted
 -- Comment 2  -- converted
 col2 INTEGER,  -- converted
 -- Comment 3  -- converted
 col3 INTEGER,  -- converted
 -- Comment 4  -- converted
 col4 !INTEGER,  -- line with error
 -- Comment 5  -- converted
 col5 INTEGER!  -- line with error
);

CREATE !TABLE sample_table2 -- line with error
(    -- line with error
 -- Comment 1  -- converted
     col1 INTEGER,  -- line with error
 -- Comment 2  -- converted
 col2 INTEGER  -- line with error
)    -- line with error
```

**Expected Conversion Rate**: 65%

**Explanation:** There is a total of 20 lines of code, and 13 of them were successfully converted by the tool. Using the formula, the conversion rate is (13/20)\*100.

A line with an error is defined as every line of code that contains at least one error message. For more information check the Issues and Troubleshooting section of each language documentation.

## Conversion Rate - Files Generated

> **Note:**
>
> This field applies only to Teradata reports.

It describes the percentage of SQL files that were successfully generated. The files that were not generated in the output are due to unexpected issues during the process of transformation.

### Formulae

```none
(files_generated / total_files) * 100
```

#### CSV Associated Field Names

* SqlFilesConversionRate

#### Sample

```none
input_folder
    input1.sql
    input2.sql
    input3.sql
```

```none
:force:
input_folder
    input1.sql
    input2.sql
```

**Expected Files Generated Conversion Rate**: 66.67%

**Explanation:** Only 2 of the 3 input files of the conversion were successfully generated in the output.

## Conversion Rate - LOC

> **Note:**
>
> This field applies only to Teradata reports.

It describes the same as the Code Conversion Rate common section but applies to all the supported SQL file extensions in Teradata.

## Total File Quantity

> **Note:**
>
> This field applies only to Teradata reports.

It describes the total number of identified SQL files.

### CSV Associated Field Names

* SqlFileCount

#### Sample

```none
input_folder
    input1.sql
    input2.dml
    input3.ddl
    input4.bteq
    input5.fl
```

**Expected Total File Quantity**: 3

**Explanation:** In this sample, 3 of the files have a supported SQL extension.

## Total LOC

> **Note:**
>
> This field applies only to Teradata reports.

It describes the same as the Lines of Code common section but applies to all the supported SQL file extensions in Teradata.

## Lines of Code

It represents the number of lines of code in the SQL extension files. This counting does not consider blank lines, only the ones that contain code, comments, or both.

### CSV Associated Field Names

* SqlLinesCount

#### Sample

```none
:force:
Folder1
    input1.sql            -- 20 lines
    input2.sql            -- 20 lines
Folder2
    input3.sql            -- 10 lines
    input4.sql            -- 5 lines
    input5.txt            -- 15 lines
```

```sql
CREATE TABLE sample_table1
(
 -- Comment 1
 col1 INTEGER,
 -- Comment 2
 col2 INTEGER,
 -- Comment 3
 col3 INTEGER,
 -- Comment 4
 col4 !INTEGER,
 -- Comment 5
 col5 INTEGER!
);

CREATE !TABLE sample_table2
(
 -- Comment 1
     col1 INTEGER,
 -- Comment 2
 col2 INTEGER
)
```

**Expected Lines of code**: 55

**Explanation:** Only the lines in the SQL extension files are considered in this section.

## Total Object Quantity

It describes the number of objects successfully identified in the SQL extension files.

### CSV Associated Field Names

* SqlIdentifiedObjects

#### Sample

```sql
CREATE TABLE sample_table1
(
 -- Comment 1
 col1 INTEGER,
 -- Comment 2
 col2 INTEGER,
 -- Comment 3
 col3 INTEGER,
 -- Comment 4
 col4 !INTEGER,
 -- Comment 5
 col5 INTEGER!
);

CREATE !TABLE sample_table2
(
 -- Comment 1
     col1 INTEGER,
 -- Comment 2
 col2 INTEGER
)
```

**Expected Identified Objects**: 1

**Explanation:** There are two `CREATE TABLE` statements in this example. The first one is fully recognized since it is parsed correctly, but the second one has two misspelled words in the definition so it is not recognized by Snow Convert.

## Parsing Errors

This section shows the total number of unrecognized fragments of code in the SQL files.

### CSV Associated Field Names

* SqlTotalParsingErrors

#### Sample

```sql
CREATE TABLE sample_table1
(
 -- Comment 1
 col1 INTEGER,
 -- Comment 2
 col2 INTEGER,
 col3 INTEGER,
 col4 !INTEGER,

 col5 INTEGER!

);

CREATE !TABLE sample_table2
(
 -- Comment 1
     col1 INTEGER,
 -- Comment 2
 col2 INTEGER
)
```

**Expected Parsing Errors**: 3

**Explanation:** There are two parsing errors inside the first table and the second table is considered a whole parsing error due to the misspelled keyword.

---
title: SnowConvert AI - File and Object Level Breakdown - SQL Identified Objects
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/file-and-object-level-breakdown-sql-identified-objects.md
section: Migrations
---

# SnowConvert AI - File and Object Level Breakdown - SQL Identified Objects

## Conversion Rate - Object

> **Note:**
>
> An object is considered successfully migrated if it does not have issues with medium, high or critical severity.

Represents the percentage of identified objects by SnowConvert AI that were successfully migrated. This will help to determine the number of objects that were successfully migrated and the objects that need manual work in order to complete the migration of the objects to Snowflake. If N/A is listed in the column, it means that the object type is not supported in Snowflake. A “-” could also be listed in this column, this means that the set of files migrated by SnowConvert AI did not contain objects of the specific type that could be identified.

### Formula

```sql
(successfully_converted_objects / total_object_quantity) * 100
```

#### CSV Associated Field Names

* **All Languages**

  + **Tables:** SqlTableObjectConversionRate
  + **Views:** SqlViewObjectConversionRate
  + **Procedures:** SqlProcedureObjectConversionRate
  + **Functions:** SqlFunctionObjectConversionRate
  + **Triggers**: SqlTriggerObjectConversionRate
  + **Indexes:** N/A
* **Teradata**

  + **Macros:** SqlMacroObjectConversionRate
  + **Join Indexes:** SqlJoinIndexObjectConversionRate
* **Oracle**

  + **Packages:** SqlPackageObjectConversionRate
  + **Packages Bodies:** SqlPBodyObjectConversionRate
  + **Sequences:** SqlSequenceObjectConversionRate
  + **Synonyms:** SqlSynonymObjectConversionRate
  + **Types:** SqlTypeObjectConversionRate
  + **DB Link:** N/A
  + **Materialized Views:** SqlMaterializedObjectConversionRate
* **SQLServer**

  + **Materialized Views:** SqlMaterializedObjectConversionRate

#### Sample

```sql
-- Table that is migrated successfully to Snowflake.
CREATE TABLE table1 (
  col1 INTEGER
);

-- Table that is not migrated successfully to Snowflake because of the data type of col1.
CREATE TABLE table2 (
  col1 ANYTYPE
);
```

**Expected Object Conversion Rate:** 50%

**Explanation:** With the previous sample code we will have a 50% Object Conversion Rate because only 1 of the 2 identified tables were successfully migrated to Snowflake.

## Conversion Rate - Code

Represents the percentage of lines or characters of code of the top-level object that were successfully migrated. You can read more about the different conversion rate modes and how they are calculated by SnowConvert AI [here](README.md).

### CSV Associated Field Names

> **Note:**
>
> Each top-level object will have two fields for the code conversion rate in the `Assessment.csv` report. One will be for the conversion rate using lines of code and the other using the characters.

* **All Languages:**

  + **Tables**

    - **Lines of Code:** SqlTableLoCConversionRate
    - **Characters:** SqlTableCharacterConversionRate
  + **Views**

    - **Lines of Code:** SqlViewLoCConversionRate
    - **Characters:** SqlViewCharacterConversionRate
  + **Procedures**

    - **Lines of Code:** SqlProcedureLoCConversionRate
    - **Characters:** SqlProcedureCharacterConversionRate
  + **Functions**

    - **Lines of Code:** SqlFunctionLoCConversionRate
    - **Characters:** SqlFunctionCharacterConversionRate
  + **Indexes**

    - **Lines of Code:** N/A
    - **Characters:** N/A
  + **Triggers**

    - **Lines of Code:** SqlTriggerLoCConversionRate
* **Teradata**

  + **Macros**

    - **Lines of Code:** SqlMacroLoCConversionRate
    - **Characters:** SqlMacroCharacterConversionRate
  + **Join Indexes**

    - **Lines of Code:** SqlJoinIndexLoCConversionRate
    - **Characters:** SqlJoinIndexCharacterConversionRate
* **Oracle**

  + **Materialized Views**

    - **Lines of Code:** SqlMaterializedViewLoCConversionRate
    - **Characters:** SqlMaterializedViewCharacterConversionRate
  + **Packages**

    - **Lines of Code:** SqlPackageLoCConversionRate
    - **Characters:** SqlPackageCharacterConversionRate
  + **Package Bodies**

    - **Lines of Code:** SqlPBodyLoCConversionRate
    - **Characters:** SqlPBodyCharacterConversionRate
  + **Sequences**

    - **Lines of Code:** SqlSequenceLoCConversionRate
    - **Characters:** SqlSequenceCharacterConversionRate
  + **Synonyms**

    - **Lines of Code:** SqlSynonymLoCConversionRate
    - **Characters:** SqlSynonymCharacterConversionRate
  + **Types**

    - **Lines of Code:** SqlTypeLoCConversionRate
    - **Characters:** SqlTypeCharacterConversionRate
* **SQLServer**

  + **Materialized Views**

    - **Lines of Code:** SqlMaterializedViewLoCConversionRate
    - **Characters:** SqlMaterializedViewCharacterConversionRate

#### Sample

```sql
CREATE TABLE table1 (
  col1 INTEGER
);
CREATE TABLE table2 (
  col1 ANYTYPE
);
```

**Expected Code Conversion Rate:** 83.33%

**Explanation:** In the previous sample code, there are two `CREATE TABLE` statements and SnowConvert AI is executed using lines of code to calculate the code conversion rate. `table1` was successfully migrated but `table2` was not migrated completely, in this case, line 5 of the input code could not be migrated and only 5 of the 6 total lines of code were migrated successfully. This calculation will generate a conversion rate for tables of 83.33%.

## Lines of Code

Represents the total amount of lines code used for the identified top-level objects. It is important to take into account that the lines of code of the top-level object as well as the comments are used for this column. On the other hand, empty lines will not be counted in this column.

### CSV Associated Field Names

* **All Languages**

  + **Tables:** SqlTableTotalLinesOfCode
  + **Views:** SqlViewTotalLinesOfCode
  + **Procedures:** SqlProcedureTotalLinesOfCode
  + **Functions:** SqlFunctionTotalLinesOfCode
  + **Indexes:** SqlIndexTotalLinesOfCode
  + **Triggers:** SqlTriggerTotalLinesOfCode
* **Teradata**

  + **Macros:** SqlMacroTotalLinesOfCode
  + **Join Indexes:** SqlJoinIndexTotalLinesOfCode
* **Oracle**

  + **Packages:** SqlPackageTotalLinesOfCode
  + **Packages Bodies:** SqlPBodyTotalLinesOfCode
  + **Sequences:** SqlSequenceTotalLinesOfCode
  + **Synonyms:** SqlSynonymTotalLinesOfCode
  + **Types:** SqlTypeTotalLinesOfCode
  + **DB Link:** SqlDbLinkTotalLinesOfCode
  + **Materialized Views:** SqlMaterializedViewTotalLinesOfCode
* **SQLServer**

  + **Materialized Views:** SqlMaterializedViewTotalLinesOfCode

#### Sample

```sql
-- Hello World
CREATE TABLE table1 (
  col1 INTEGER
);

CREATE TABLE table2 (
-- Hello world 2
  col1 ANYTYPE
);
```

**Expected Lines of Code:** 8

**Explanation:** In this case, we have 6 lines that come from the code used for the `CREATE TABLE` statements and 2 for comments that are inside of the top-level objects.

## Total Object Quantity

Represents the total amount of objects identified by SnowConvert AI during the parsing phase.

### CSV Associated Field Names

* **All Languages**

  + **Tables:** SqlTableTotalOccurrences
  + **Views:** SqlViewTotalOccurrences
  + **Procedures:** SqlProcedureTotalOccurrences
  + **Functions:** SqlFunctionTotalOccurrences
  + **Indexes:** SqlIndexTotalOccurrences
  + **Triggers:** SqlTriggerTotalOccurrences
* **Teradata**

  + **Macros:** SqlMacroTotalOccurrences
  + **Join Indexes:** SqlJoinIndexTotalOccurrences
* **Oracle**

  + **Packages:** SqlPackageTotalOccurrences
  + **Packages Bodies:** SqlPBodyTotalOccurrences
  + **Sequences:** SqlSequenceTotalOccurrences
  + **Synonyms:** SqlSynonymTotalOccurrences
  + **Types:** SqlTypeTotalOccurrences
  + **DB Link:** SqlDbLinkTotalOccurrences
  + **Materialized Views:** SqlMaterializedViewTotalOccurrences
* **SQLServer**

  + **Materialized Views:** SqlMaterializedViewTotalOccurrences

#### Sample

```sql
-- Successfully parsed table.
CREATE TABLE table1 (
  col1 INTEGER
);

-- Table with a parsing error that could not be identified.
CRATE TABLE table2 (
  col1 INTEGER
);
```

**Expected Total Object Quantity:** 1.

**Explanation:** One table was completely parsed by SnowConvert AI during the parsing phase but the other table has a parsing error that causes SnowConvert AI to not identify it as a table object.

## Parsing Errors

Represents the number of parsing errors that are inside of the identified objects of each top-level object type.

### CSV Associated Field Names

* **All Languages**

  + **Tables:** SqlTableTotalParsingErrors
  + **Views:** SqlViewTotalParsingErrors
  + **Materialized Views:** SqlMaterializedViewTotalParsingErrors
  + **Procedures:** SqlProcedureTotalParsingErrors
  + **Functions:** SqlFunctionParsingErrors
  + **Triggers**: SqlTriggerTotalParsingErrors
  + **Indexes**: SqlIndexTotalParsingErrors
* **Teradata**

  + **Macros:** SqlMacroTotalParsingErrors
  + **Join Indexes:** SqlJoinIndexTotalParsingErrors
* **Oracle**

  + **Packages:** SqlPackageTotalParsingErrors
  + **Packages Bodies:** SqlPBodyTotalParsingErrors
  + **Sequences:** SqlSequenceTotalParsingErrors
  + **Synonyms:** SqlSynonymTotalParsingErrors
  + **Types:** SqlTypeTotalParsingErrors
  + **DB Link:** SqlDbLinkTotalParsingErrors
  + **Materialized Views:** SqlMaterializedViewTotalParsingErrors
* **SQLServer**

  + **Materialized Views:** SqlMaterializedViewTotalParsingErrors

#### Sample

```sql
-- Table with parsing error but still was identified by SnowConvert.
CREATE TABLE table1 (
  col3 NUMBER,
);

-- Table with parsing error but was not identified by SnowConvert.
CRATE TABLE table2 (
  col1 INTEGER
);
```

**Expected Parsing Errors:** 1

**Explanation:** Only one parsing error will be reported in the **Parsing Errors** column because SnowConvert AI was able to only identify the first table. Since the second table was not identified, those parsing errors will not be counted in the **Parsing Errors** column.

---
title: SnowConvert AI - File Encoding Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/file-encoding-validation.md
section: Migrations
---

# SnowConvert AI - File Encoding Validation

## Description

This validation step tries to recognize the file’s encoding; if not, it is marked as invalid. If it is recognized as different from the encoding selected in the Assessment or Conversion configuration process, the file will also be marked as invalid.

> **Warning:**
>
> **IMPORTANT**! The entry code files should contain the [BOM](https://en.wikipedia.org/wiki/Byte_order_mark) signature to recognize the file’s encoding.

If this validation step fails, the following warning is displayed:

Also, in the ScopeValidation report, you will find information about the failed file(s).

---
title: SnowConvert AI - File Extension Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/file-extension-validation.md
section: Migrations
---

# SnowConvert AI - File Extension Validation

## Description

This validation step verifies the file extensions. These are the valid file extensions:

* All the languages (.sql)
* Teradata (.ddl, .dml, .bteq, .btq, .fl, .fload, .ml, .mld, .mload, .tp, .tpump, .tpt)
* Hive (.hql)

> **Warning:**
>
> **IMPORTANT**! Uppercase file extensions are also invalid.

If one of the files has an invalid extension, the following warning is displayed:

Also, in the ScopeValidation report, you will find information about the failed file(s).

---
title: SnowConvert AI - File Format Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/file-format-validation.md
section: Migrations
---

# SnowConvert AI - File Format Validation

## Description

This validation step verifies the file’s structure and indentation. If the average number of characters per line across all input code files is greater than the maximum allowed, the following warning is displayed:

```none
CREATE TABLE LongLines(

    COL1                                                                                                                                                                                                                                                                                                                                                                                                                                                    VARCHAR(22331) -- this line has more than 500 characters
);
```

> **Note:**
>
> Please scroll to the right to watch all the sample code

---
title: SnowConvert AI - Frequently Asked Questions (FAQ)
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/frequently-asked-questions-faq.md
section: Migrations
---

# SnowConvert AI - Frequently Asked Questions (FAQ)

## What database platforms does SnowConvert AI translate SQL code from?

SnowConvert AI can translate SQL code from Teradata, Oracle, SQL Server, Amazon Redshift, Sybase IQ, Google BigQuery, Azure Synapse, Greenplum, PostgresSQL, Vertica, Hive, Spark, Databricks, Netezza and IBM DB2.

---

## How do I get SnowConvert AI?

SnowConvert AI can be officially downloaded in the Snowsight Snowflake web page.

However, it is highly recommended to take the free course “[SnowConvert AI for Conversion](https://training.snowflake.com/lmt/!clmsLink.dt?site=sf&amp;region=us&amp;lang=en-us&amp;type=O&amp;id=130596852)”. This course is both an overview and a technical hands on training of how to use SnowConvert AI for assessments and conversions.

If you require additional help, please contact our customer support team at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---

## What are the system requirements for using SnowConvert AI?

### For MacOS

* macOS Ventura 13.3.1 or newer version
* Minimum 4 GB of RAM

### For Windows

* Windows 11 or newer version of Windows operating system
* Minimum of 4 GB RAM (more memory is recommended)

---

## How do I give permission to SnowConvert AI config folder?

Providing access to the SnowConvert AI configuration folder depends on your operating system.

SnowConvert AI requires read, write, and execute permissions for its configuration folder (`.config` on MacOS or `AppData` on Windows). This folder stores temporary files, logs, and license information. To grant SnowConvert AI access to this folder, follow these steps:

### For macOS

1. Open a Terminal window.
2. Navigate to your home directory by typing `cd ~` and pressing Enter.
3. Change the permissions of the .config directory by typing `chmod 777 .config`. If you receive a “Operation not permitted” error, run the command with sudo: `sudo chmod 777 .config`.
4. Close the Terminal window and launch SnowConvert AI.

### For Windows

1. Open the **Run** dialog by pressing the **Windows key + R** on your keyboard.
2. Enter `%AppData%` and press **Enter** or click **OK**.
3. Find the Snowflake Inc folder, right-click on it, and verify that the `Read-only` checkbox under Attributes is unchecked.

---

## How do I make sure that .config is a folder instead of a file?

*This problem only affects macOS systems.*

SnowConvert AI requires read, write, and execute permissions for the configuration folder (`.config` on macOS). This folder is used to store temporary files, log files, and license information.

The `.config` must be a directory (folder). If you find that `.config` exists as a file, you need to convert it to a directory and set the appropriate permissions.

To resolve this issue, follow these steps:

1. Find the `.config` file in your home directory at `'/Users/[Username]/'`.
2. Delete the `.config` file.
3. Create a new folder called `.config` in the same location.
4. Launch Terminal.
5. Navigate to your home directory by typing `cd ~` and pressing Enter.
6. Change folder permissions by typing `chmod 777 .config`. If you see an `Operation not permitted` error, use `sudo chmod 777 .config` instead.
7. Exit Terminal and start SnowConvert AI.

## What is a Top-Level Code Unit?

A Code Unit is the smallest independent piece of code that can be executed. While Code Units typically consist of individual statements, they can also be entire script files since these are executed as one unit. Code Units can be hierarchical, with some units contained within others. When a Code Unit is not nested within any other unit, it is referred to as a Top-Level Code Unit.

---

## Does SnowConvert AI provide resources to understand how it translates SQL code?

You can find the translation reference for each source in the following locations:

* [Teradata](../translation-references/teradata/README.md)
* [Oracle](../translation-references/oracle/README.md)
* [SQL Server](../translation-references/transact/README.md)

---

## What is the code completeness metric?

The Code Completeness score shows whether all necessary code components are present in your codebase. A score below 100 indicates that SnowConvert AI has detected missing object references that may be required for successful migration.

---

### Why my files are not being converted and marked with the code SSC-OOS-001?

Depending on the selected encoding, SnowConvert AI will not be able to parse the input; you should validate the correct encoding in the settings options before starting a conversion. [How to use the setting](getting-started/running-snowconvert/conversion/general-conversion-settings.md).

---

## Are there release notes available for previous versions of SnowConvert AI?

Release notes are available here: [release-notes](release-notes/release-notes/README.md)

---

## Is SnowConvert AI a free tool, or are there paid plans available?

SnowConvert AI is now free for everyone and allows full conversion functionality of your workload.

Besides, if you need additional support you are provided with the option of a Professional Service Engagement.

---

## Why SnowConvert AI is not auto-updating?

### Internet connection

SnowConvert AI automatically checks for new versions when you have an active internet connection. If you receive an error message, first verify that your system is connected to the internet and that the connection is working properly.

If you are still experiencing connection problems, it may be due to a Firewall rule blocking your access.

#### Firewall Blocked

SnowConvert AI checks for updates by connecting to a Snowflake storage service. If your local firewall blocks access to this site, you won’t be able to get updates. If you see a “Destination unreachable” message, ask your network administrator to whitelist the `https://snowconvert.snowflake.com/` website.

---

## How can I remove my licenses ?

To remove all SnowConvert AI licenses, you need to delete the `.profile` file in the config folder. The file location depends on your operating system. Follow the steps specific to your operating system to locate and delete this file.

### Windows

* Exit SnowConvert AI completely.
* Press the Windows key (`⊞ Win`) and ‘R’ key together to open the Run command window. Type `%appdata%Snowflake Inc` and press Enter.
* Find and delete the file named `.profile`.

### MacOS

* Exit SnowConvert AI if it is currently running
* Open Finder and use the keyboard shortcut `⌘ + Shift ⇧ + G` to open “Go to Folder”. Enter `~/.config/Snowflake Inc/` to access the configuration directory
* Look for the “.profile” file. On Mac systems, this is a hidden file. To view hidden files, use the keyboard shortcut `⌘ + Shift ⇧ + .`
* Find and remove the “.profile” file

After deleting the file, when you open SnowConvert AI, you will see an empty license list.

---
title: SnowConvert AI - Function References - Shared
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/function-references/shared/README.md
section: Migrations
---

# SnowConvert AI - Function References - Shared

## INTERVAL_MULTIPLY_UDF (VARCHAR, VARCHAR, INTEGER)

### Definition

This user-defined function (UDF) is used to multiply a time interval by a factor of N.

```sql
INTERVAL_MULTIPLY_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE VARCHAR(), INPUT_MULT INTEGER)
```

### Parameters

`INPUT_PART` VARCHAR

The format of the operation. E.g.: `DAY`, `HOUR TO SECOND,` `YEAR TO MONTH`.

`INPUT_VALUE` VARCHAR

The interval of time to be multiplied.

`INPUT_MULT` INTEGER

The time to multiply the interval of time.

### Returns

Returns a varchar with the result of the multiplication.

### Usage example

Input:

```sql
SELECT INTERVAL_MULTIPLY_UDF('DAY', '2', 100);
```

Output:

```sql
200
```

## TRUNC_UDF (TIMESTAMP_LTZ, VARCHAR)

### Definition

This user-defined function (UDF) reproduces the Teradata and Oracle TRUNC(Date) functionality when the format parameter is specified.

```sql
TRUNC_UDF(DATE_TO_TRUNC TIMESTAMP_LTZ, DATE_FMT VARCHAR(5))
```

### Parameters

`DATE_TO_TRUNC` TIMESTAMP_LTZ

A `timestamp_ltz` value to truncate which must be a date, timestamp, or timestamp with timezone.

`DATE_FMT` VARCHAR

A varchar value that should be one of the date formats supported by the `trunc` function.

### Returns

Returns a date truncated using the format specified.

### Usage example

Input:

```sql
SELECT TRUNC_UDF(TIMESTAMP '2015-08-18 12:30:00', 'Q')
```

Output:

```sql
2015-07-01
```

## INTERVAL_TO_SECONDS_UDF (VARCHAR, VARCHAR)

### Definition

This user-defined function (UDF) is used to determine the quantity of seconds from an interval which is also correlated to the processed time type. This is an auxiliary function.

```sql
INTERVAL_TO_SECONDS_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE VARCHAR())
```

### Parameters

`INPUT_PART` VARCHAR

The related type of the second parameter. E.g. `DAY`, `DAY TO HOUR`, `HOUR`, `MINUTE`.

`INPUT_VALUE` VARCHAR

The value to be converted to seconds.

### Returns

Returns a decimal value type with the number of seconds.

### Usage example

Input:

```sql
SELECT INTERVAL_TO_SECONDS_UDF('DAY', '1');
```

Output:

```sql
86400.000000
```

## DATEDIFF_UDF (DATE, STRING)

### Definition

This user-defined function (UDF) is used to generate the difference between an interval value and a date.

```sql
DATEDIFF_UDF(D DATE, INTERVAL_VALUE STRING)
```

### Parameters

`D` DATE

The date to be used to process the difference with the interval.

`INTERVAL_VALUE` STRING

The interval value that will be used to create the difference from.

### Returns

Returns a date with the resulting value of the subtraction of time.

### Usage example

Input:

```sql
SELECT DATEDIFF_UDF('2024-01-30', 'INTERVAL ''2-1'' YEAR(2) TO MONTH');
```

Output:

```sql
2021-12-30
```

## SECONDS_TO_INTERVAL_UDF (VARCHAR, NUMBER)

### Definition

This user-defined function (UDF) is used to transform seconds into intervals. This is an auxiliary function.

```sql
SECONDS_TO_INTERVAL_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE NUMBER)
```

### Parameters

`INPUT_PART` VARCHAR

The related type of the second parameter. E.g. `DAY`, `DAY TO HOUR`, `HOUR`, `MINUTE`, `MINUTE TO SECOND`.

`INPUT_VALUE` VARCHAR

The seconds to be converted to intervals.

### Returns

Returns

### Usage example

Input:

```sql
SELECT SECONDS_TO_INTERVAL_UDF('DAY TO SECOND', '86400');
```

Output:

```sql
1 000:000:000
```

## DATEADD_UDF (STRING, DATE)

### Definition

This user-defined function (UDF) is used to add a date with an interval of time.

```sql
DATEADD_UDF(INTERVAL_VALUE STRING,D DATE)
```

### Parameters

`INTERVAL_VALUE` STRING

The interval of time to be added.

`D` DATE

The date to be added with the interval of time.

### Returns

Returns a date with the addition of the interval of time and the date.

### Usage example

Input:

```sql
SELECT DATEADD_UDF('INTERVAL ''2-1'' YEAR(2) TO MONTH', '2024-01-30');
```

Output:

```sql
2026-02-28
```

## DATEDIFF_UDF (STRING, DATE)

### Definition

This user-defined function (UDF) is used to generate the difference between an interval value and a date.

```sql
DATEDIFF_UDF(INTERVAL_VALUE STRING,D DATE)
```

### Parameters

`INTERVAL_VALUE` STRING

The interval value that will be used to create the difference from.

`D` DATE

The date to be used to process the difference with the interval.

### Returns

Returns a date with the resulting value of the subtraction of time.

### Usage example

Input:

```sql
SELECT DATEDIFF_UDF('INTERVAL ''2-1'' YEAR(2) TO MONTH', '2024-01-30');
```

Output:

```sql
2021-12-30
```

## DATEADD_UDF (DATE, STRING)

### Definition

This user-defined function (UDF) is used to add a date with an interval of time.

```sql
DATEADD_UDF(D DATE, INTERVAL_VALUE STRING)
```

### Parameters

`D` DATE

The date to be added with the interval of time.

`INTERVAL_VALUE` STRING

The interval of time to be added.

### Returns

Returns a date with the addition of the interval of time and the date.

### Usage example

Input:

```sql
SELECT DATEADD_UDF('2024-01-30', 'INTERVAL ''1-1'' YEAR(2) TO MONTH');
```

Output:

```sql
2025-02-28
```

## TO_INTERVAL_UDF (TIME)

### Definition

This user-defined function (UDF) is used to generate a separate interval of time from the current time.

```sql
TO_INTERVAL_UDF(D2 TIME)
```

### Parameters

`D2` TIME

The input time to converts into a separate interval.

### Returns

Returns a string with the information of the input time separated.

### Usage example

Input:

```sql
SELECT TO_INTERVAL_UDF(CURRENT_TIME);
```

Output:

```sql
INTERVAL '4 HOURS,33 MINUTES,33 SECOND'
```

## INTERVAL_TO_MONTHS_UDF (VARCHAR)

### Definition

This user-defined function (UDF) is used to generate an integer with the quantity of a month from an interval. This is an auxiliary function.

```sql
INTERVAL_TO_MONTHS_UDF
(INPUT_VALUE VARCHAR())
```

### Parameters

`INPUT_VALUE` VARCHAR

The interval value to be transformed into months.

### Returns

Returns an integer with the processed information about months.

### Usage example

Input:

```sql
SELECT PUBLIC.INTERVAL_TO_MONTHS_UDF('1-6');
```

Output:

```sql
18
```

## DATEDIFF_UDF (STRING, TIMESTAMP)

### Definition

This user-defined function (UDF) is used to subtract an interval of time with a timestamp.

```sql
DATEADD_UDF(INTERVAL_VALUE STRING,D TIMESTAMP)
```

### Parameters

`INTERVAL_VALUE` STRING

The interval of time to be subtracted.

`D` TIMESTAMP

The timestamp to be subtracted with the interval of time.

### Returns

Returns a date with the subtraction of the interval of time and the date.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF('INTERVAL ''1-1'' YEAR(2) TO MONTH', TO_TIMESTAMP('2024-01-31 05:09:09.799 -0800'));
```

Output:

```sql
2022-12-31 05:09:09.799
```

## MONTHS_TO_INTERVAL_UDF (VARCHAR, NUMBER)

### Definition

This user-defined function (UDF) is used to transform month values to intervals. This is an auxiliary function.

```sql
MONTHS_TO_INTERVAL_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE NUMBER)
```

### Parameters

`INPUT_PART` VARCHAR

The related type of the second parameter. E.g. `YEAR TO MONTH`, `YEAR`, `MONTH`.

`INPUT_VALUE` VARCHAR

The month to be converted to intervals.

### Returns

Returns a varchar with the input value transform to an interval.

### Usage example

Input:

```sql
SELECT MONTHS_TO_INTERVAL_UDF('YEAR TO MONTH', 2);
```

Output:

```sql
2
```

## DATEDIFF_UDF (TIMESTAMP, STRING)

### Definition

This user-defined function (UDF) is used to subtract a timestamp with an interval of time.

```sql
DATEDIFF_UDF(D TIMESTAMP, INTERVAL_VALUE STRING)
```

### Parameters

`D` TIMESTAMP

The timestamp that will be subtracted with the interval of time.

`INTERVAL_VALUE` STRING

The interval of time to be subtracted.

### Returns

Returns a date with the subtraction of the interval of time and the date.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF(TO_TIMESTAMP('2024-01-31 05:09:09.799 -0800'), 'INTERVAL ''1-1'' YEAR(2) TO MONTH');
```

Output:

```sql
2022-12-31 05:09:09.799
```

## TRUNC_UDF (NUMBER)

### Definition

This user-defined function (UDF) reproduces the Teradata and Oracle `TRUNC(Numeric)` functionality when a scale is **not** specified.

```sql
TRUNC_UDF(INPUT NUMBER)
```

### Parameters

`INPUT` NUMBER

The number to truncate.

### Returns

Returns an int as the input truncated to zero decimal places.

### Usage example

Input:

```sql
SELECT TRUNC_UDF(25122.3368)
```

Output:

```sql
25122
```

## TRUNC_UDF (NUMBER, NUMBER)

### Definition

This user-defined function (UDF) reproduces the Teradata and Oracle `TRUNC(Numeric)` functionality when a scale is specified.

```sql
TRUNC_UDF(INPUT NUMBER, SCALE NUMBER)
```

### Parameters

`INPUT` NUMBER

The number to truncate.

`SCALE` NUMBER

The amount of places to truncate (between -38 and 38).

### Returns

Returns an int as the input truncated to scale places.

### Usage example

Input:

```sql
SELECT TRUNC_UDF(25122.3368, -2);
```

Output:

```sql
25100
```

## INTERVAL_ADD_UDF (VARCHAR, VARCHAR, VARCHAR, VARCHAR, CHAR, VARCHAR)

### Definition

This user-defined function (UDF) is used to add or subtract intervals with a specific time type.

```sql
INTERVAL_ADD_UDF
(INPUT_VALUE1 VARCHAR(), INPUT_PART1 VARCHAR(30), INPUT_VALUE2 VARCHAR(), INPUT_PART2 VARCHAR(30), OP CHAR, OUTPUT_PART VARCHAR())
```

### Parameters

`INPUT_VALUE1` VARCHAR

The quantity referenced to a time type.

`INPUT_PART1` VARCHAR

The time type of the *`INPUT_VALUE1`*. E.g.: `HOUR`.

`INPUT_VALUE2` VARCHAR

The second quantity referenced to a time type.

`INPUT_PART2` VARCHAR

The time type of the *`INPUT_VALUE2`*. E.g.: `HOUR`.

`OP` CHAR

The operation. It can be a ‘+’ or a ‘-‘.

`OUTPUT_PART` VARCHAR

The time type of the output operation.

### Returns

Returns a varchar with the result of the indicated operation and values.

### Usage example

Input:

```sql
SELECT INTERVAL_ADD_UDF('7', 'HOUR', '1', 'HOUR', '+', 'HOUR');
```

Output:

```sql
8
```

## DATEADD_UDF (STRING, TIMESTAMP)

### Definition

This user-defined function (UDF) is used to add a timestamp with an interval of time.

```sql
DATEADD_UDF(INTERVAL_VALUE STRING,D TIMESTAMP)
```

### Parameters

`INTERVAL_VALUE` STRING

The interval of time to be added.

`D` TIMESTAMP

The timestamp to be added with the interval of time.

### Returns

Returns a date with the addition of the interval of time and the date.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEADD_UDF('INTERVAL ''1-1'' YEAR(2) TO MONTH', TO_TIMESTAMP('2024-01-31 05:09:09.799 -0800'));
```

Output:

```sql
2025-02-28 05:09:09.799
```

## TRUNC_UDF (TIMESTAMP_LTZ)

### Definition

This user-defined function (UDF) reproduces the Teradata and Oracle TRUNC(Date) functionality when the format parameter is **not** specified.

```sql
TRUNC_UDF(INPUT TIMESTAMP_LTZ)
```

### Parameters

`DATE_TO_TRUNC` TIMESTAMP_LTZ

A `timestamp_ltz` value to truncate which must be a date, timestamp, or timestamp with timezone.

### Returns

Returns a date part of `DATE_TO_TRUNC`.

### Usage example

Input:

```sql
SELECT TRUNC_UDF(TIMESTAMP '2015-08-18 12:30:00')
```

Output:

```sql
2015-08-18
```

## DATEADD_UDF (TIMESTAMP, STRING)

### Definition

This user-defined function (UDF) is used to add a timestamp with an interval of time.

```sql
DATEADD_UDF(D TIMESTAMP, INTERVAL_VALUE STRING)
```

### Parameters

`D` TIMESTAMP

The timestamp to be added with the interval of time.

`INTERVAL_VALUE` STRING

The interval of time to be added.

### Returns

Returns a date with the addition of the interval of time and the date.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEADD_UDF(TO_TIMESTAMP('2024-01-31 05:09:09.799 -0800'), 'INTERVAL ''1-1'' YEAR(2) TO MONTH');
```

Output:

```sql
2025-02-28 05:09:09.799
```

## LOG_INFO_UDP (VARCHAR)

### Definition

This user-defined store procedure (UDP) is used to log messages using the Snowflake [SYSTEM$LOG](../../../../../../developer-guide/logging-tracing/logging-snowflake-scripting.md) functions.

```sql
DATEADD_UDF(D TIMESTAMP, INTERVAL_VALUE STRING)
```

### Parameters

`MESSAGE` VARCHAR

The message to be logged.

### Returns

A success message indicating the log operation was completed.

### Usage example

Input:

```sql
CALL PUBLIC.LOG_INFO_UDP('My log message');
```

Output:

| RESULT |
| --- |
| ‘Message logged successfully’ |

---
title: SnowConvert AI - Function References for Oracle
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/function-references/oracle/README.md
section: Migrations
---

# SnowConvert AI - Function References for Oracle

## DATEDIFF_UDF(TIMESTAMP, NUMBER)

### Definition

This user-defined function (UDF) is used to subtract a `number` (which is a number of days) from a `timestamp`.

```sql
PUBLIC.DATEDIFF_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM NUMBER)
```

### Parameters

`FIRST_PARAM` TIMESTAMP

The `timestamp` that represents the minuend.

`SECOND_PARAM` NUMBER

The number of days that represents the subtrahend.

### Returns

Returns a timestamp with the difference between the `timestamp` and the `number`.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF('2024-01-26 22:00:50.708 -0800', 3);
```

Output:

```sql
2024-01-23
```

## DATEDIFF_UDF(TIMESTAMP, DATE)

### Definition

This user-defined function (UDF) is used to subtract a `date` from a `timestamp`.

```sql
PUBLIC.DATEDIFF_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM DATE)
```

### Parameters

`FIRST_PARAM` TIMESTAMP

The `timestamp` that represents the minuend.

`SECOND_PARAM` DATE

The `date` that represents the subtrahend.

### Returns

Returns an integer with the difference between the `timestamp` and the `date`.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF('2024-01-26 22:00:50.708 -0800', TO_DATE('2023-01-26'));
```

Output:

```sql
365
```

## DATE_TO_JULIAN_DAYS_UDF

### Definition

This user-defined function (UDF) transforms from Gregorian date to Julian date (The number of days since January 1, 4712 BC.).

```sql
PUBLIC.DATE_TO_JULIAN_DAYS_UDF(INPUT_DATE DATE)
```

### Parameters

`INPUT_DATE` DATE

The Gregorian date to transform.

### Returns

Returns the date representation of the Julian date.

### Migration example

Input:

```sql
Select TO_CHAR(SYSDATE, 'J') as A from DUAL;
```

Output:

```sql
Select
PUBLIC.DATE_TO_JULIAN_DAYS_UDF(CURRENT_TIMESTAMP()) as A from DUAL;
```

### Usage example

Input:

```sql
SELECT PUBLIC.DATE_TO_JULIAN_DAYS_UDF(DATE '1998-12-25');
```

Output:

```sql
2451173
```

## UTL_FILE.PUT_LINE_UDF

### Definition

This user-defined function (UDF) is used to replicate the functionality of the Oracle UTL_FILE_PUT_LINE procedure.

```sql
UTL_FILE.PUT_LINE_UDF(FILE VARCHAR,BUFFER VARCHAR)
```

### Parameters

`FILE` VARCHAR

The file to open and save the new buffer.

`BUFFER` VARCHAR

The buffer to be saved on the defined file.

### Returns

Returns a varchar with the result.

### Usage example

> **Warning:**
>
> To review the lines in the file, there are two ways: Downloading the file from the Snowflake CLI or briefly review the information with `SELECT * FROM UTL_FILE.FOPEN_TABLES_LINES;` but only if the file has not been closed.

Input:

```sql
CREATE OR REPLACE PROCEDURE PROC()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
    file_data  VARIANT;
   BEGIN

    CALL UTL_FILE.FOPEN_UDF('test2.csv','a');

    SELECT
      *
    INTO
      file_data
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));

    CALL UTL_FILE.PUT_LINE_UDF(:file_data,'New line');

    CALL UTL_FILE.FCLOSE_UDF(:file_data);

   END
$$;

CALL PROC();
```

Output:

```sql
null
```

## UTL_FILE.FOPEN_UDF (VARCHAR,VARCHAR)

### Definition

This user-defined function (UDF) is used to replicate the functionality of the Oracle `UTL_FILE_FOPEN` procedure.

```sql
UTL_FILE.FOPEN_UDF(FILENAME VARCHAR,OPEN_MODE VARCHAR)
```

### Parameters

`FILENAME` VARCHAR

The file to be opened.

`OPEN_MODE` VARCHAR

Indicates the mode on which the file will be available.

### Returns

Returns a varchar with the result.

### Usage example

> **Warning:**
>
> The `UTL_FILE.FOPEN_UDF` allows to open a .csv file. To access the file it is required to create a `stage` for the file and use the Snowflake CLI to upload it.

Input:

```sql
CREATE OR REPLACE PROCEDURE PROC()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
    file_data  VARIANT;
   BEGIN

    CALL UTL_FILE.FOPEN_UDF('test2.csv','a');

    SELECT
      *
    INTO
      file_data
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));

   END
$$;

CALL PROC();
```

Output:

```sql
null
```

## JSON_VALUE_UDF

### Definition

This user-defined function (UDF) reproduces the JSON_VALUE function to extract a single result out of a JSON variable.

```sql
JSON_VALUE_UDF(JSON_OBJECT VARIANT, JSON_PATH STRING, RETURNING_TYPE STRING, ON_ERROR_MESSAGE VARIANT, ON_EMPTY_MESSAGE VARIANT)
```

### Parameters

`JSON_OBJECT` VARIANT

The JSON variable from which to extract the values.

`JSON_PATH` STRING

The JSON path that indicates where the values are located inside the JSON_OBJECT.

`RETURNING_TYPE` STRING

The type to return.

`ON_ERROR_MESSAGE` VARIANT

The error message to add if needed.

`ON_EMPTY_MESSAGE` VARIANT

The error message to add in case of empty message.

### Returns

Returns a single value specified by the JSON_PATH inside the JSON_OBJECT. If the result is not a single value, returns a default error message or an error message defined in the input parameters.

### Usage example

Input:

```sql
   SELECT
     JSON_VALUE_UDF(

     PARSE_JSON('{
  "iceCreamOrders": [
    {
      "customerID": "CUST001",
      "orderID": "ORD001",
      "productID": "PROD001",
      "quantity": 2
    }
  ]
}'),

JSON_EXTRACT_PATH_TEXT('{
  "iceCreamOrders": [
    {
      "customerID": "CUST001",
      "orderID": "ORD001",
      "productID": "PROD001",
      "quantity": 2
    }
  ]
}', 'iceCreamOrders'), 'VARIANT', TO_VARIANT('There was an error'), TO_VARIANT('Empty message'));
```

Output:

```sql
"Empty message"
```

## DATEADD_UDF (FLOAT, TIMESTAMP)

### Definition

This user-defined function (UDF) is used in cases when there is an addition between a `float` number and a `timestamp`.

```sql
PUBLIC.DATEADD_UDF(FIRST_PARAM FLOAT, SECOND_PARAM TIMESTAMP)
```

### Parameters

`FIRST_PARAM` FLOAT

The timestamp number that is going to be added with the second float parameter.

`SECOND_PARAM` DATE

The float number to be added with the timestamp in the first parameter.

### Returns

Returns a timestamp with the addition between the timestamp and the float number specified.

### Usage example

Input:

```sql
SELECT DATEADD_UDF(1, current_timestamp);
```

Output:

```sql
2024-01-30 18:47:16.988
```

## FETCH_BULK_COLLECTIONS_UDF (OBJECT)

### Definition

This user-defined function (UDF) is used to replicate the functionality of fetching bulk for collections in Oracle. This function version receives the cursor only.

```sql
FETCH_BULK_COLLECTIONS_UDF(CURSOR OBJECT)
```

### Parameters

`CURSOR` OBJECT

The cursor that is processed and filled with the data in the `fetch bulk collection`.

### Returns

Returns an object with information related to the logic of fetching bulk collections.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE MY_TABLE (test VARCHAR(100));
INSERT INTO MY_TABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      MY_TABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_COLLECTIONS_UDF(:MY_CURSOR)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      MY_TABLE",
  "RESULT": [
    [
      "TEST_A"
    ]
  ],
  "ROWCOUNT": 1
}
```

## DATEADD_UDF (DATE, FLOAT)

### Definition

This user-defined function (UDF) is used in cases when there is an addition between a date and a type as `float` or `timestamp`.

```sql
PUBLIC.DATEADD_UDF(FIRST_PARAM DATE, SECOND_PARAM FLOAT)
```

### Parameters

`FIRST_PARAM` DATE

The date to be added with the number in the second parameter.

`SECOND_PARAM` FLOAT

The float number that is going to be added with the first date parameter.

### Returns

Returns the addition between the date and the float number specified.

### Migration example

Input:

```sql
SELECT TO_DATE('05/11/21', 'dd/mm/yy') + 3.4 from dual;
```

Output:

```sql
SELECT
PUBLIC.DATEADD_UDF( TO_DATE('05/11/21', 'dd/mm/yy'), 3.4) from dual;
```

### Usage example

Input:

```sql
SELECT DATEADD_UDF('2022-02-14',6);
```

Output:

```sql
2022-02-20
```

## DATEDIFF_UDF(DATE, TIMESTAMP)

### Definition

This user-defined function (UDF) is used to subtract a `timestamp` from a `date`.

```sql
PUBLIC.DATEDIFF_UDF(FIRST_PARAM DATE, SECOND_PARAM TIMESTAMP)
```

### Parameters

`FIRST_PARAM` DATE

The date over the subtraction is done.

`SECOND_PARAM` TIMESTAMP

The `timestamp` to subtract from the first parameter.

### Returns

Returns an integer with the days between the first and the second parameter.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF(TO_DATE('2024-01-26'), '2022-02-14 15:31:00');
```

Output:

```sql
711
```

## DBMS_RANDOM.VALUE_UDF

### Definition

This user-defined function (UDF) is to replicate the functionality of the Oracle DBMS_RANDOM.VALUE function.

```sql
DBMS_RANDOM.VALUE_UDF()
```

### Parameters

No input parameters.

### Returns

Returns a `double` number with a random number.

### Usage example

Input:

```sql
SELECT DBMS_RANDOM.VALUE_UDF();
```

Output:

```sql
0.6666235896
```

## DBMS_RANDOM.VALUE_UDF (DOUBLE, DOUBLE)

### Definition

This user-defined function (UDF) is to replicate the functionality of the Oracle DBMS_RANDOM.VALUE function.

```sql
DBMS_RANDOM.VALUE_UDF(low DOUBLE, high DOUBLE)
```

### Parameters

`low` DOUBLE

The initial limit to be considered.

`high` DOUBLE

The delimiting limit that coordinates with the first parameter.

### Returns

Returns a `double` number with a random number between the limits specified.

### Usage example

Input:

```sql
SELECT DBMS_RANDOM.VALUE_UDF(1.1, 2.2);
```

Output:

```sql
1.637802374
```

## FETCH_BULK_RECORD_COLLECTIONS_UDF (OBJECT, ARRAY)

### Definition

This user-defined function (UDF) is used to cover the functionality of `fetch bulk records` with different input parameters that determine the information added or the behavior of the cursor.

```sql
FETCH_BULK_RECORD_COLLECTIONS_UDF(CURSOR OBJECT, COLUMN_NAMES ARRAY)
```

### Parameters

`CURSOR` OBJECT

The cursor that is being processed.

`COLUMN_NAMES` ARRAY

The column names that are associated with the cursor.

### Returns

Returns an object with the processed information.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));
INSERT INTO BULKCOLLECTTABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_RECORD_COLLECTIONS_UDF(:MY_CURSOR, NULL)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "RESULT": {
    "TEST": [
      "TEST_A"
    ]
  },
  "ROWCOUNT": 1
}
```

## FETCH_BULK_COLLECTION_RECORDS_UDF (OBJECT, ARRAY)

### Definition

This user-defined function (UDF) is used to replicate the functionality of FETCH in Oracle. This is the variation where it receives the cursor and the column names.

```sql
FETCH_BULK_COLLECTION_RECORDS_UDF(CURSOR OBJECT, COLUMN_NAMES ARRAY)
```

### Parameters

`CURSOR` OBJECT

The cursor that is processed and filled with the data in the `fetch bulk`.

`COLUMN_NAMES` ARRAY

The name associated with the column is not the initial name.

### Returns

Returns an object with the records from the `fetch bulk`.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE MY_TABLE (test VARCHAR(100));
INSERT INTO MY_TABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      MY_TABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:MY_CURSOR, NULL)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      MY_TABLE",
  "RESULT": [
    {
      "TEST": "TEST_A"
    }
  ],
  "ROWCOUNT": 1
}
```

## JULIAN_TO_GREGORIAN_DATE_UDF

### Definition

This user-defined function (UDF) is used to transform a Julian date into the formats: JD Edwards, YYYYDDD (astronomical), and YYYYDDD (ordinal).

```sql
JULIAN_TO_GREGORIAN_DATE_UDF(JULIAN_DATE CHAR(7), FORMAT_SELECTED CHAR(1))
```

### Parameters

`JULIAN_DATE` CHAR

The Julian date to transform.

`FORMAT_SELECTED` CHAR

The format required for the logic. E.g. `'E'`, `'J'`, `'R'`. Astronomy standardized or `'J'` is the default format.

### Returns

Returns a variant with the date representation of the Julian date.

### Usage example

Input:

```sql
SELECT JULIAN_TO_GREGORIAN_DATE_UDF('098185');
```

Output:

```sql
'1998-07-04' --(a.k.a Sat Jul 04 1998)
```

## TIMESTAMP_DIFF_UDF

### Definition

This user-defined function (UDF) is used for the timestamps arithmetic operations and the equivalence functionality in Snowflake.

```sql
TIMESTAMP_DIFF_UDF(LEFT_TS TIMESTAMP, RIGHT_TS TIMESTAMP )
```

### Parameters

LEFT_TS TIMESTAMP

The minuend value.

RIGHT_TS TIMESTAMP

The subtrahend value.

### Returns

Returns a varchar with the resulting difference between timestamps.

### Usage example

Input:

```sql
SELECT TIMESTAMP_DIFF_UDF(TO_TIMESTAMP('2024-01-31 11:47:20.532 -0800'), TO_TIMESTAMP('2024-01-31 11:47:20.532 -0800'));
```

Output:

```sql
-000000000  00:00:00.00000000
```

## REGEXP_LIKE_UDF (STRING, STRING, STRING)

### Definition

This user-defined function (UDF) is

```sql
REGEXP_LIKE_UDF(COL STRING, PATTERN STRING, MATCHPARAM STRING)
```

### Parameters

COL STRING

The string to be evaluated with the pattern.

PATTERN STRING

The pattern to be checked.

MATCHPARAM STRING

The match parameter that will determine whether the case-sensitive or not.

### Returns

Returns

### Usage example

Input:

```sql
SELECT REGEXP_LIKE_UDF('san Francisco', 'San* [fF].*', 'i');
```

Output:

```sql
TRUE
```

## FETCH_BULK_COLLECTIONS_UDF (OBJECT, FLOAT)

### Definition

This user-defined function (UDF) is used to replicate the functionality of fetching bulk for collections in Oracle. This function version receives the cursor and the limit value for the row count.

```sql
FETCH_BULK_COLLECTIONS_UDF(CURSOR OBJECT, LIMIT FLOAT)
```

### Parameters

`CURSOR` OBJECT

The cursor that is processed and filled with the data in the `fetch bulk collection`.

`LIMIT` FLOAT

The limit for the records to call.

### Returns

Returns an object with information related to the logic of fetching bulk collections.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE MY_TABLE (test VARCHAR(100));
INSERT INTO MY_TABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      MY_TABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_COLLECTIONS_UDF(:MY_CURSOR, 1.0)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      MY_TABLE",
  "RESULT": [
    [
      "TEST_A"
    ]
  ],
  "ROWCOUNT": 1
}
```

## INIT_CURSOR_UDF

### Definition

This user-defined function (UDF) is to initialize a cursor object with the equivalent functionality.

```sql
INIT_CURSOR_UDF(NAME VARCHAR, QUERY VARCHAR)
```

### Parameters

`NAME` VARCHAR

The name of the cursor.

`QUERY` VARCHAR

The query that is associated with the cursor.

### Returns

Returns an object with the cursor information.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "ISOPEN": false,
  "NAME": "MY_CURSOR",
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "ROWCOUNT": -1
}
```

## UPDATE_PACKAGE_VARIABLE_STATE_UDF

### Definition

This user-defined function (UDF) updates the given package variable values. It is a wrapper for the Snowflake SETVARIABLE() function.

```sql
UPDATE_PACKAGE_VARIABLE_STATE_UDF (VARIABLE VARCHAR, NEW_VALUE VARCHAR)
```

### Parameters

`VARIABLE` VARCHAR

The variable name to set the value.

`NEW_VALUE` VARCHAR

The value that will be stored.

### Returns

Returns a varchar with the information of the updated variable.

### Usage example

> **Warning:**
>
> Please, review the existence of the variable.

Input:

```sql
CALL PUBLIC.UPDATE_PACKAGE_VARIABLE_STATE_UDF('MY_LOCAL_VARIABLE', '1');
```

Output:

```sql
1
```

## OPEN_BULK_CURSOR_UDF (OBJECT)

### Definition

This user-defined function (UDF) is used to pen a cursor without bindings.

```sql
OPEN_BULK_CURSOR_UDF(CURSOR OBJECT)
```

### Parameters

`CURSOR` OBJECT

The cursor to process as open.

### Returns

Returns an object with the current information of the cursor.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "ROWCOUNT": 0
}
```

## DATEADD_UDF (TIMESTAMP, FLOAT)

### Definition

This user-defined function (UDF) is used in cases when there is an addition between a `timestamp` and a `float` number.

```sql
PUBLIC.DATEADD_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM FLOAT)
```

### Parameters

`FIRST_PARAM` TIMESTAMP

The timestamp number that is going to be added with the second float parameter.

`SECOND_PARAM` FLOAT

The float number to be added with the timestamp in the first parameter.

### Returns

Returns a timestamp with the addition between the timestamp and the float number specified.

### Usage example

Input:

```sql
SELECT DATEADD_UDF(current_timestamp, 1);
```

Output:

```sql
2024-01-26 13:22:49.354
```

## DATEDIFF_UDF(TIMESTAMP, TIMESTAMP)

### Definition

This user-defined function (UDF) subtracts a `timestamp` from another `timestamp`.

```sql
PUBLIC.DATEDIFF_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM TIMESTAMP)
```

### Parameters

`FIRST_PARAM` TIMESTAMP

The `timestamp` that represents the minuend.

`SECOND_PARAM` TIMESTAMP

The `timestamp` that represents the subtrahend.

### Returns

Returns an integer with the difference of days between the first and the second timestamps.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF('2024-01-26 22:00:50.708 -0800','2023-01-26 22:00:50.708 -0800');
```

Output:

```sql
365
```

## UTL_FILE.FCLOSE_UDF

### Definition

This user-defined function (UDF) is used to replicate the functionality of the Oracle `UTL_FILE_FCLOSE` procedure.

```sql
UTL_FILE.FCLOSE_UDF(FILE VARCHAR)
```

### Parameters

`FILE` VARCHAR

The file to process and close.

### Returns

Returns a varchar with the result.

### Usage example

> **Warning:**
>
> The `UTL_FILE.FCLOSE_UDF` closes the file that is being processed. To review the result or handle files, it is required to use the Snowflake CLI console. The Snowflake CLI console allows the upload or download of a file.

Input:

```sql
CREATE OR REPLACE PROCEDURE PROC()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
    file_data  VARIANT;
   BEGIN

    CALL UTL_FILE.FOPEN_UDF('test2.csv','a');

    SELECT
      *
    INTO
      file_data
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));

    CALL UTL_FILE.PUT_LINE_UDF(:file_data,'New line');

    CALL UTL_FILE.FCLOSE_UDF(:file_data);

   END
$$;

CALL PROC();
```

Output:

```sql
null
```

## FETCH_BULK_RECORD_COLLECTIONS_UDF (OBJECT)

### Definition

This user-defined function (UDF) is used to cover the functionality of `fetch bulk records` with different input parameters that determine the information added or the behavior of the cursor.

```sql
FETCH_BULK_RECORD_COLLECTIONS_UDF(CURSOR OBJECT)
```

### Parameters

`CURSOR` OBJECT

The cursor that is being processed.

### Returns

Returns an object with the processed information.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));
INSERT INTO BULKCOLLECTTABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_RECORD_COLLECTIONS_UDF(:MY_CURSOR)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "RESULT": {
    "TEST": [
      "TEST_A"
    ]
  },
  "ROWCOUNT": 1
}
```

## CAST_DATE_UDF

### Definition

The function processes a timestamp in string format to a date. It returns a date with the specified format.

```sql
PUBLIC.CAST_DATE_UDF(DATESTR STRING)
```

### Parameters

`DATESTR` STRING

The date as a `string` to be formatted. The format should be ‘`YYYY-MM-DD"T"HH24:MI:SS.FF'` e.g. `'2024-01-25T23:25:11.120'`.

Please review the following information about formatting [here](https://docs.snowflake.com/en/sql-reference/date-time-input-output#timestamp-formats).

### Returns

Returns a `date` with the new format applied.

### Usage example

Input:

```sql
SELECT PUBLIC.CAST_DATE_UDF('2024-01-25T23:25:11.120');
```

Output:

```sql
2024-01-25
```

## FETCH_BULK_COLLECTION_RECORDS_UDF (OBJECT, FLOAT, ARRAY)

### Definition

This user-defined function (UDF) is used to replicate the functionality of FETCH in Oracle. This is the variation where it receives the cursor, the limit, and the column names.

```sql
FETCH_BULK_COLLECTION_RECORDS_UDF(CURSOR OBJECT, LIMIT FLOAT, COLUMN_NAMES ARRAY)
```

### Parameters

`CURSOR` OBJECT

The cursor that is processed and filled with the data in the `fetch bulk`.

`LIMIT` FLOAT

The limit for the records to call.

`COLUMN_NAMES` ARRAY

The name associated with the column is not the initial name.

### Returns

Returns an object with the records from the `fetch bulk`.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE MY_TABLE (test VARCHAR(100));
INSERT INTO MY_TABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      MY_TABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:MY_CURSOR, 1.0, NULL)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      MY_TABLE",
  "RESULT": [
    {
      "TEST": "TEST_A"
    }
  ],
  "ROWCOUNT": 1
}
```

## DATEDIFF_UDF(DATE, INTEGER)

### Definition

This user-defined function (UDF) applies a subtraction of days over a date.

```sql
PUBLIC.DATEDIFF_UDF(FIRST_PARAM DATE, SECOND_PARAM INTEGER)
```

### Parameters

`FIRST_PARAM` DATE

The initial date to apply the subtraction.

`SECOND_PARAM` INTEGER

The number of days to be subtracted from the first date parameter.

### Returns

Returns the date after subtracting the indicated number of days.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF(TO_DATE('2024-01-26'), 365);
```

Output:

```sql
2023-01-26
```

## DATE_TO_RR_FORMAT_UDF

### Definition

This user-defined function (UDF) transforms from date to oracle RR datetime format date

```sql
PUBLIC.DATE_TO_RR_FORMAT_UDF(INPUT_DATE DATE)
```

### Parameters

`INPUT_DATE` DATE

The date to transform.

### Returns

The input date with years adjusted to RR format.

### Migration example

Input:

```sql
Select TO_DATE('17-NOV-30','DD-MON-RR') as A from DUAL;
```

Output:

```sql
Select
PUBLIC.DATE_TO_RR_FORMAT_UDF( TO_DATE('17-NOV-30', 'DD-MON-YY')) as A from DUAL;
```

### Usage example

Input:

```sql
PUBLIC.CONVERT_DATE_WITH_RR_FORMAT_UDF(TO_DATE('17-NOV-30','DD-MON-YY')) as A from DUAL;
```

Output:

```sql
2030-11-17
```

## FETCH_BULK_RECORD_COLLECTIONS_UDF (OBJECT, INTEGER)

### Definition

This user-defined function (UDF) is used to cover the functionality of `fetch bulk records` with different input parameters that determine the information added or the behavior of the cursor.

```sql
FETCH_BULK_RECORD_COLLECTIONS_UDF(CURSOR OBJECT, LIMIT INTEGER)
```

### Parameters

`CURSOR` OBJECT

The cursor that is being processed.

`LIMIT` INTEGER

The limit of the row count.

### Returns

Returns an object with the processed information.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));
INSERT INTO BULKCOLLECTTABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_RECORD_COLLECTIONS_UDF(:MY_CURSOR, 0)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": false,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": true,
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "RESULT": {
    "TEST": []
  },
  "ROWCOUNT": 0
}
```

## DBMS_OUTPUT.PUT_LINE_UDF

### Definition

This user-defined function (UDF) is used to replicate the functionality of the Oracle DBMS_OUTPUT_PUT_LINE function.

```sql
DBMS_OUTPUT.PUT_LINE_UDF(LOG VARCHAR)
```

> **Warning:**
>
> Notice that performance may be affected by using this UDF. To start logging information uncomment the implementation inside the function.

### Parameters

`LOG` VARCHAR

The information to be shown in the command line.

### Returns

Returns a `varchar` with the information logged.

### Usage example

Input:

```sql
SELECT DBMS_OUTPUT.PUT_LINE_UDF(to_varchar(123));
```

Output:

```sql
123
```

## DATEDIFF_UDF(DATE, DATE)

### Definition

This user-defined function (UDF) is used when there is a subtraction between two dates.

```sql
PUBLIC.DATEDIFF_UDF(FIRST_PARAM DATE, SECOND_PARAM DATE)
```

### Parameters

`FIRST_PARAM` DATE

The date that represents the minuend in the subtraction.

`SECOND_PARAM` DATE

The date that represents the subtrahen in the subtraction.

### Returns

Returns an integer with the number of days between the dates.

### Usage example

Input:

```sql
SELECT PUBLIC.DATEDIFF_UDF(TO_DATE('2024-01-26'), TO_DATE('2023-01-26'));
```

Output:

```sql
365
```

## OPEN_BULK_CURSOR_UDF (OBJECT, ARRAY)

### Definition

This user-defined function (UDF) is used to open a cursor with bindings.

```sql
OPEN_BULK_CURSOR_UDF(CURSOR OBJECT, BINDINGS ARRAY)
```

### Parameters

`CURSOR` OBJECT

The cursor to process as open.

`BINDINGS` ARRAY

The binding that is related to the cursor.

### Returns

Returns an object with the current information of the cursor.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR, NULL)
        );
        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "ROWCOUNT": 0
}
```

## CLOSE_BULK_CURSOR_UDF

### Definition

This user-defined function (UDF) deletes the temporary table that stores the result set of the cursor and resets the cursor properties to their initial state.

```sql
CLOSE_BULK_CURSOR_UDF(CURSOR OBJECT)
```

### Parameters

`CURSOR` OBJECT

The cursor that is checked and closed.

### Returns

Returns an object with the cursor properties reset.

### Migration example

Input:

```sql
-- [procedure initial logic]
CLOSE C1;
-- [procedure ending logic]
```

Output:

```sql
C1 := (
            CALL CLOSE_BULK_CURSOR_UDF(:C1)
        );
```

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL CLOSE_BULK_CURSOR_UDF(:MY_CURSOR)
        );

        RETURN MY_CURSOR;
    END;
$$;
```

Output:

```sql
{
  "FOUND": null,
  "ISOPEN": false,
  "NAME": "MY_CURSOR",
  "NOTFOUND": null,
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "ROWCOUNT": -1
}
```

## DATEADD_UDF (FLOAT, DATE)

### Definition

This user-defined function (UDF) is used in cases when there is an addition between a type as `float` or `timestamp` and a `date`.

```sql
PUBLIC.DATEADD_UDF(FIRST_PARAM FLOAT, SECOND_PARAM DATE)
```

### Parameters

`FIRST_PARAM` FLOAT

The float number that is going to be added with the second date parameter.

`SECOND_PARAM` DATE

The date to be added with the number in the first parameter.

### Returns

Returns the addition between the float number and the date specified.

### Usage example

Input:

```sql
SELECT DATEADD_UDF(6, '2022-02-14');
```

Output:

```sql
2022-02-20
```

## BFILENAME_UDF

### Definition

The function takes the directory name and the filename parameter as a `string`. Then, it returns a concatenation using the `'\'.`

> **Warning:**
>
> The character `'\'` must be changed to match the Operating System file concatenation character.

```sql
PUBLIC.BFILENAME_UDF (DIRECTORYNAME STRING, FILENAME STRING);
```

### Parameters

`DIRECTORYNAME` STRING

The directory name to be processed as a `string`.

`FILENAME` STRING

The filename to be concatenated.

### Returns

Returns a `string` that contains the directory name and filename concatenated by a `'\'`.

### Migration example

Input:

```sql
SELECT BFILENAME ('directory', 'filename.jpg') FROM DUAL;
```

Output:

```sql
SELECT
PUBLIC.BFILENAME_UDF('directory', 'filename.jpg') FROM DUAL;
```

### Usage example

Input:

```sql
SELECT PUBLIC.BFILENAME_UDF('directory', 'filename.jpg');
```

Output:

```sql
directory\filename.jpg
```

## REGEXP_LIKE_UDF (STRING, STRING)

### Definition

This user-defined function (UDF) is used to support the Oracle `REGEXP_LIKE` functionality.

```sql
REGEXP_LIKE_UDF(COL STRING, PATTERN STRING)
```

### Parameters

COL STRING

The string to be evaluated with the pattern.

PATTERN STRING

The pattern to be checked.

### Returns

Returns a boolean expression. True if the pattern matches the string; otherwise, false.

### Usage example

Input:

```sql
SELECT REGEXP_LIKE_UDF('San Francisco', 'San* [fF].*');
```

Output:

```sql
TRUE
```

## UTL_FILE.FOPEN_UDF (VARCHAR, VARCHAR, VARCHAR)

### Definition

This user-defined function (UDF) is used to replicate the functionality of the Oracle `UTL_FILE_FOPEN` procedure.

```sql
UTL_FILE.FOPEN_UDF(PACKAGE_VARIABLE VARCHAR, FILENAME VARCHAR, OPEN_MODE VARCHAR)
```

### Parameters

`PACKAGE_VARIABLE` VARCHAR

The variable related to the file opening.

`FILENAME` VARCHAR

The file to be opened.

`OPEN_MODE` VARCHAR

Indicates the mode on which the file will be available.

### Returns

Returns a varchar with the result.

### Usage example

> **Warning:**
>
> The `UTL_FILE.FOPEN_UDF` allows to open a .csv file. To access the file it is required to create a `stage` for the file and use the Snowflake CLI to upload it.

Input:

```sql
CREATE OR REPLACE PROCEDURE PROC()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
    file_data  VARIANT;
   BEGIN

    CALL UTL_FILE.FOPEN_UDF(NULL, 'test2.csv','a');

    SELECT
      *
    INTO
      file_data
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));

    CALL UTL_FILE.PUT_LINE_UDF(:file_data,'New line');

    CALL UTL_FILE.FCLOSE_UDF(:file_data);

   END
$$;

CALL PROC();
```

Output:

```sql
null
```

## FETCH_BULK_COLLECTION_RECORDS_UDF (OBJECT)

### Definition

This user-defined function (UDF) is used to replicate the functionality of FETCH in Oracle. This is the variation where it receives the cursor only.

```sql
FETCH_BULK_COLLECTION_RECORDS_UDF(CURSOR OBJECT)
```

### Parameters

`CURSOR` OBJECT

The cursor that is processed and filled with the data in the `fetch bulk`.

### Returns

Returns an object with the records from the `fetch bulk`.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE MY_TABLE (test VARCHAR(100));
INSERT INTO MY_TABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      MY_TABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:MY_CURSOR)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      MY_TABLE",
  "RESULT": [
    {
      "TEST": "TEST_A"
    }
  ],
  "ROWCOUNT": 1
}
```

## FETCH_BULK_RECORD_COLLECTIONS_UDF (OBJECT, FLOAT, ARRAY)

### Definition

This user-defined function (UDF) is used to cover the functionality of `fetch bulk records` with different input parameters that determine the information added or the behavior of the cursor.

```sql
FETCH_BULK_RECORD_COLLECTIONS_UDF(CURSOR OBJECT, LIMIT FLOAT, COLUMN_NAMES ARRAY)
```

### Parameters

`CURSOR` OBJECT

The cursor that is being processed.

`LIMIT` FLOAT

The limit of the row count.

`COLUMN_NAMES` ARRAY

The column names that are associated with the cursor.

### Returns

Returns an object with the processed information.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE BULKCOLLECTTABLE(test VARCHAR(100));
INSERT INTO BULKCOLLECTTABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      BULKCOLLECTTABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_RECORD_COLLECTIONS_UDF(:MY_CURSOR, 1.0, NULL)
        );

        RETURN MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": true,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": false,
  "QUERY": "   SELECT * FROM\n      BULKCOLLECTTABLE",
  "RESULT": {
    "TEST": [
      "TEST_A"
    ]
  },
  "ROWCOUNT": 1
}
```

## FETCH_BULK_COLLECTION_RECORDS_UDF (OBJECT, INTEGER)

### Definition

This user-defined function (UDF) is used to replicate the functionality of FETCH in Oracle. This is the variation where it receives the cursor and the limit.

```sql
FETCH_BULK_COLLECTION_RECORDS_UDF(CURSOR OBJECT, LIMIT INTEGER)
```

### Parameters

`CURSOR` OBJECT

The cursor that is processed and filled with the data in the `fetch bulk`.

`LIMIT` FLOAT

The limit for the records to call.

### Returns

Returns an object with the records from the `fetch bulk`.

### Usage example

Input:

```sql
CREATE OR REPLACE TABLE MY_TABLE (test VARCHAR(100));
INSERT INTO MY_TABLE VALUES ('TEST_A');

CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_CURSOR OBJECT := INIT_CURSOR_UDF('MY_CURSOR', '   SELECT * FROM
      MY_TABLE');

    BEGIN
        MY_CURSOR := (
            CALL OPEN_BULK_CURSOR_UDF(:MY_CURSOR)
        );
        MY_CURSOR := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:MY_CURSOR, 0)
        );

        Return MY_CURSOR;
    END;
$$;

CALL MY_PROCEDURE();
```

Output:

```sql
{
  "FOUND": false,
  "ISOPEN": true,
  "NAME": "MY_CURSOR",
  "NOTFOUND": true,
  "QUERY": "   SELECT * FROM\n      MY_TABLE",
  "RESULT": [],
  "ROWCOUNT": 0
}
```

---
title: SnowConvert AI - Function References for SQL-Server
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/function-references/sql-server/README.md
section: Migrations
---

# SnowConvert AI - Function References for SQL-Server

## ISNUMERIC_UDF

### Definition

This user-defined function (UDF) determines whether an expression is a valid numeric type.

```sql
ISNUMERIC_UDF(EXPR VARCHAR)
```

### Parameters

`EXPR` VARCHAR

The expression to be evaluated.

### Returns

Returns 1 when the input expression evaluates to a valid numeric data type; otherwise, it returns 0.

### Usage example

Input:

```sql
SELECT ISNUMERIC_UDF('5');
```

Output:

```sql
1
```

## PATINDEX_UDF

### Definition

This user-defined function (UDF) returns the starting position of the first occurrence of a pattern in a specified expression or zeros if the pattern is not found.

```sql
PATINDEX_UDF(PATTERN VARCHAR, EXPRESSION VARCHAR)
```

### Parameters

`PATTERN` VARCHAR

The pattern to search for.

`EXPRESSION` VARCHAR

The expression that is being evaluated.

### Returns

Returns an integer with the starting position of the pattern.

### Usage example

Input:

```sql
SELECT PATINDEX_UDF('an', 'banana');
```

Output:

```sql
2
```

## ERROR_SEVERITY_UDF

### Definition

This user-defined function (UDF) gets a value indicating the severity of an error. The default value will always be 16.

```sql
ERROR_SEVERITY_UDF()
```

### Parameters

No input parameters.

### Returns

Returns a `string` with the value associated with the SQL variable name `ERROR_SEVERITY`.

### Usage example

Input:

```sql
SELECT ERROR_SEVERITY_UDF();
```

Output:

```sql
null -- No information set.
```

## TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(STRING, STRING, ARRAY, ARRAY)

### Definition

This user-defined function (UDF) emulates the behavior of embedded parameters (Data Binding) in the SP_EXECUTESQL system procedure by directly replacing their values in the SQL string.

Additionally, it removes the OUTPUT parameters from the string as this is done outside the EXECUTE IMMEDIATE to which the SP_EXECUTESQL will be transformed.

For more information, check the SP_EXECUTESQL translation specification.

```sql
TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(
    _SQL_STRING STRING,
    _PARAMS_DEFINITION STRING,
    _PARAMS_NAMES ARRAY,
    _PARAMS_VALUES ARRAY
)
```

### Parameters

`_SQL_STRING` STRING

The string to be transformed.

`_PARAMS_DEFINITION` STRING

The original parameters definition checks the order in which parameter values must be assigned.

`_PARAMS_NAMES` ARRAY

The array of parameter names to replace the values in the SQL string.

`_PARAMS_VALUES` ARRAY

The array of the parameter values to be replaced in the SQL string.

### Returns

Returns a STRING with the embedded parameters values replaced.

### Usage example

Input:

```sql
SELECT TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(
    'SELECT * FROM PERSONS WHERE NAME LIKE (@NAME) AND ID < @id AND AGE < @age;', '@age INT, @id INT, @name VARCHAR(25)',
    ARRAY_CONSTRUCT('', '', ''),
    ARRAY_CONSTRUCT(30, 100, 'John Smith'));
```

Output:

```sql
SELECT * FROM PERSONS WHERE NAME LIKE ('John Smith') AND ID < 100 AND AGE < 30;
```

## TABLE_OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if a table with a specific name has been created before.

```sql
TABLE_OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The table name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the table.

### Usage example

Input:

```sql
SELECT TABLE_OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## ERROR_PROCEDURE_UDF

### Definition

This user-defined function (UDF) returns the value associated with the SQL variable name `ERROR_PROCEDURE`.

```sql
ERROR_PROCEDURE_UDF()
```

### Parameters

No input parameters.

### Returns

Returns a `string` with the value associated with the SQL variable name `ERROR_PROCEDURE`.

### Usage example

Input:

```sql
SELECT ERROR_PROCEDURE_UDF();
```

Output:

```sql
null -- No information set.
```

## DB_ID_UDF(STRING)

### Definition

This user-defined function (UDF) emulates the [DB_ID](https://learn.microsoft.com/en-us/sql/t-sql/functions/db-id-transact-sql?view=sql-server-ver16) functionality.

```sql
DB_ID_UDF(p_database_name STRING)
```

### Parameters

`p_database_name` STRING

The name of the database to obtain the id.

### Returns

Returns an id which corresponds to the number assigned to the database when it is created. This number is assigned consecutively.

### Usage example

Input:

```sql
SELECT DB_ID_UDF('MY_DATABASE')
```

Output:

```sql
6
```

> **Warning:**
>
> If the database does not exist, it returns null.

## ERROR_LINE_UDF

### Definition

This user-defined function (UDF) returns the value associated with the SQL variable name `ERROR_LINE`.

```sql
ERROR_LINE_UDF()
```

### Parameters

No input parameters.

### Returns

Returns a `string` with the value associated with the SQL variable name `ERROR_LINE`.

### Usage example

Input:

```sql
SELECT ERROR_LINE_UDF();
```

Output:

```sql
null -- No information set.
```

## FUNCTION_OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if a function with a specific name has been created before.

```sql
VIEW_OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The function name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the function.

### Usage example

Input:

```sql
SELECT FUNCTION_OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## CONSTRAINT_OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if a constraint with a specific name has been created before.

```sql
CONSTRAINT_OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The constraint name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the constraint.

### Usage example

Input:

```sql
SELECT CONSTRAINT_OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## FOR_XML_UDF (OBJECT, VARCHAR, VARCHAR)

### Definition

This user-defined function (UDF) converts an object to XML.

```sql
FOR_XML_UDF(OBJ OBJECT, ELEMENT_NAME VARCHAR, ROOT_NAME VARCHAR)
```

### Parameters

`OBJ` OBJECT

Object to be converted.

`ELEMENT_NAME` VARCHAR

Element name to be given the object.

`ROOT_NAME` VARCHAR

The root name for XML.

### Returns

Returns a varchar in the format of XML.

### Usage example

Input:

```sql
SELECT
FOR_XML_UDF(OBJECT_CONSTRUCT('id', 1, 'name', 'David'), 'employee', 'employees');
```

Output:

```xml
<employees>
    <employee type="OBJECT">
        <id type="INTEGER">1</id>
        <name type="VARCHAR">David</name>
    </employee>
<employees>
```

## OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if an object with a specific name has been created before.

```sql
OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The object name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the object.

### Usage example

Input:

```sql
SELECT OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## PROCEDURE_OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if a procedure with a specific name has been created before.

```sql
PROCEDURE_OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The procedure name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the procedure.

### Usage example

Input:

```sql
SELECT PROCEDURE_OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## ISDATE_UDF

### Definition

This user-defined function (UDF) determines whether the input value is a valid date.

```sql
ISDATE_UDF(DATE_VALUE STRING)
```

### Parameters

`DATE_VALUE` STRING

The date that is going to be evaluated.

### Returns

Returns 1 when the input expression evaluates to a valid date data type; otherwise, it returns 0.

### Usage example

Input:

```sql
SELECT ISDATE_UDF('2024-01-26');
```

Output:

```sql
1
```

## ERROR_NUMBER_UDF

### Definition

This user-defined function (UDF) returns the value associated with the SQL variable name `ERROR_NUMBER`.

```sql
ERROR_NUMBER_UDF()
```

### Parameters

No input parameters.

### Returns

Returns a `string` with the value associated with the SQL variable name `ERROR_NUMBER`.

### Usage example

Input:

```sql
SELECT ERROR_NUMBER_UDF();
```

Output:

```sql
null -- No information set.
```

## OFFSET_FORMATTER (VARCHAR)

### Definition

This user-defined function (UDF) is an **auxiliary function** to format the offset hour and its prefix operator.

```sql
OFFSET_FORMATTER(offset_hrs VARCHAR)
```

### Parameters

`offset_hrs` VARCHAR

The value to be formatted.

### Returns

Returns a varchar value with the formatted output for the offset.

### Usage example

Input:

```sql
 SELECT OFFSET_FORMATTER('2024-01-26 22:00:50.708 -0800');
```

Output:

```sql
2024-01-26 22:00:50.708 -0800
```

## OPENXML_UDF

### Definition

This user-defined function (UDF) generates a query from an XML reading.

```sql
OPENXML_UDF(XML VARCHAR, PATH VARCHAR)
```

### Parameters

`XML` VARCHAR

The XML content as a `varchar`.

`PATH` VARCHAR

The path of the node to extract.

### Returns

Returns a table with the data generated by the XML reading.

### Usage example

Input:

```sql
SELECT * FROM TABLE(OPENXML_UDF('<iceCreamOrders>
    <order>
        <customer customerID="CUST001" contactName="Test ABC">
            <iceCreamOrder orderID="ORD001" employeeID="101" orderDate="2023-05-15T14:30:00">
                <iceCreamDetail productID="001" quantity="2"/>
                <iceCreamDetail productID="003" quantity="1"/>
            </iceCreamOrder>
        </customer>
    </order>
    <order>
        <customer customerID="CUST002" contactName="Test XYZ">
            <iceCreamOrder orderID="ORD002" employeeID="102" orderDate="2023-06-20T12:45:00">
                <iceCreamDetail productID="005" quantity="3"/>
                <iceCreamDetail productID="007" quantity="2"/>
            </iceCreamOrder>
        </customer>
    </order>
</iceCreamOrders>
', 'iceCreamOrders:order'));
```

Output:

|  | Value |
| --- | --- |
| 1 | { "order": { "$name": "order", "customer": [ { "customer": { "$name": "customer", "@contactName": "Test ABC", "@customerID": "CUST001", "iceCreamOrder": [ { "iceCreamOrder": { "$name": "iceCreamOrder", "@employeeID": 101, "@orderDate": "2023-05-15T14:30:00", "@orderID": "ORD001", "iceCreamDetail": [ { "iceCreamDetail": { "$name": "iceCreamDetail", "@productID": "001", "@quantity": 2 } }, { "iceCreamDetail": { "$name": "iceCreamDetail", "@productID": "003", "@quantity": 1 } } ] } } ] } } ] } } |
| 2 | { "order": { "$name": "order", "customer": [ { "customer": { "$name": "customer", "@contactName": "Test XYZ", "@customerID": "CUST002", "iceCreamOrder": [ { "iceCreamOrder": { "$name": "iceCreamOrder", "@employeeID": 102, "@orderDate": "2023-06-20T12:45:00", "@orderID": "ORD002", "iceCreamDetail": [ { "iceCreamDetail": { "$name": "iceCreamDetail", "@productID": "005", "@quantity": 3 } }, { "iceCreamDetail": { "$name": "iceCreamDetail", "@productID": "007", "@quantity": 2 } } ] } } ] } } ] } } |

## QUOTENAME_UDF (VARCHAR, VARCHAR)

### Definition

This user-defined function (UDF) creates a valid SQL Server delimited identifier by returning a Unicode string with the delimiters added.

```sql
QUOTENAME_UDF(STR VARCHAR, QUOTECHAR VARCHAR)
```

### Parameters

`STR` VARCHAR

The string to be transformed.

`QUOTECHAR` VARCHAR

The delimiter to add to the first parameter.

### Returns

Returns a varchar with the second parameter identifier added as delimiter.

### Usage example

Input:

```sql
SELECT QUOTENAME_UDF('test', '?');
```

Output:

```sql
?test?
```

## UPDATE_ERROR_VARS_UDF (STRING, STRING, STRING)

### Definition

This user-defined function (UDF) updates the error variables in an environment in order to know when the procedure throws an error.

```sql
UPDATE_ERROR_VARS_UDF(MESSAGE STRING, SEVERITY STRING, STATE STRING)
```

### Parameters

`STATE` STRING

The state of the error message.

`MESSAGE` STRING

The message to be shown in the error.

`SEVERITY` STRING

The severity of the error.

### Returns

Returns a `string` value with the new error message information.

### Usage example

Input:

```sql
  SELECT UPDATE_ERROR_VARS_UDF('Message', '1', '1');
```

Output:

```sql
1ABC1
```

## ROUND_MILLISECONDS_UDF (TIMESTAMP_TZ)

### Definition

This user-defined function (UDF) is a function that rounds milliseconds to increments of 0, 3, or 7 milliseconds. Transact automatically rounds the milliseconds of datetime values.

```sql
ROUND_MILLISECONDS_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The input time to be rounded.

### Returns

Returns the same input `TIMESTAMP_TZ` value but with the milliseconds rounded.

### Usage example

Input:

```sql
SELECT PUBLIC.ROUND_MILLISECONDS_UDF('1900-01-01 00:00:00.995 +0100')
```

Output:

```sql
'1900-01-01 00:00:00.997 +0100'
```

## CAST_NUMERIC_TO_TIMESTAMP_TZ_UDF (NUMBER)

### Definition

This user-defined function (UDF) is used to cast a numeric value to `timestamp_tz`.

```sql
CAST_NUMERIC_TO_TIMESTAMP_TZ_UDF(INPUT NUMBER)
```

### Parameters

`INPUT` NUMBER

The number to be cast.

### Returns

Returns a `timestamp_tz` with the current timezone.

### Usage example

Input:

```sql
SELECT PUBLIC.CAST_NUMERIC_TO_TIMESTAMP_TZ_UDF(0)
```

Output:

```sql
1900-01-01 01:00:00.000 +0100
```

## IDENTITY_UDF

### Definition

This user-defined function (UDF) determines whether an expression is a valid numeric type.

```sql
IDENTITY_UDF()
```

### Parameters

No input parameters.

### Returns

Returns an integer expression.

### Usage example

> **Warning:**
>
> A sequence is generated to support the logic.

Input:

```sql
IDENTITY_UDF()
```

Output:

```sql
1
```

## FOR_XML_UDF (OBJECT, VARCHAR)

### Definition

This user-defined function (UDF) converts an object to XML.

```sql
FOR_XML_UDF(OBJ OBJECT, ELEMENT_NAME VARCHAR)
```

### Parameters

`OBJ` OBJECT

Object to be converted.

`ELEMENT_NAME` VARCHAR

Element name to be given the object.

### Returns

Returns a varchar in the format of XML.

### Usage example

Input:

```sql
SELECT
FOR_XML_UDF(OBJECT_CONSTRUCT('id', 1, 'name', 'David'), 'employee');
```

Output:

```xml
<employee type="OBJECT">
    <id type="INTEGER">1</id>
    <name type="VARCHAR">David</name>
</employee>
```

## QUOTENAME_UDF (VARCHAR)

### Definition

This user-defined function (UDF) creates a valid SQL Server delimited identifier by returning a Unicode string with the delimiters added.

```sql
QUOTENAME_UDF(STR VARCHAR)
```

### Parameters

`STR` VARCHAR

The string to be transformed.

### Returns

Returns a varchar with the delimited identifier added.

### Usage example

Input:

```sql
SELECT QUOTENAME_UDF('test');
```

Output:

```sql
"test"
```

## VIEW_OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if a view with a specific name has been created before.

```sql
VIEW_OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The view name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the view.

### Usage example

Input:

```sql
SELECT VIEW_OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## SUBTRACT_TIMESTAMP_TZ_UDF (TIMESTAMP_TZ, TIMESTAMP_TZ)

### Definition

This user-defined function (UDF) converts both inputs to the system session timezone and subtracts the dates (`FIRST_DATE` - `SECOND_DATE`) taking 1900-01-01 00:00:00.000 as the zero value. If any value does not include the timezone, the current session timezone is used.

```sql
PUBLIC.SUBTRACT_TIMESTAMP_TZ_UDF(FIRST_DATE TIMESTAMP_TZ, SECOND_DATE TIMESTAMP_TZ)
```

### Parameters

`FIRST_DATE` TIMESTAMP_TZ

The first date to be subtracted from.

`SECOND_DATE` TIMESTAMP_TZ

The second date to be subtracted to.

### Returns

Returns the difference between the two input dates.

### Usage example

Input:

```sql
SELECT SUBTRACT_TIMESTAMP_TZ_UDF('1900-01-01 00:00:00.000 +0100', '1900-01-01 00:00:00.003 -0100')
```

Output:

```sql
1899-12-31 13:59:59.997 -0800
```

## STR_UDF (FLOAT, VARCHAR)

### Definition

This user-defined function (UDF) is a template for translating the functionality of SQL Server STR() to Snowflake when it’s used with one or two optional parameters

```sql
STR_UDF(FLOAT_EXPR FLOAT, FORMAT VARCHAR)
```

### Parameters

`FLOAT_EXPR` FLOAT

The expression to be processed.

`FORMAT` VARCHAR

The format to apply.

### Returns

Returns a varchar with the formatted expression.

### Usage example

Input:

```sql
SELECT STR_UDF(1.5, '99');
```

Output:

```sql
2
```

## XML_JSON_SIMPLE

### Definition

This user-defined function (UDF) generates an object with the information from executing a reading from an XML value.

```sql
XML_JSON_SIMPLE(XML VARIANT)
```

### Parameters

`XML` VARIANT

The XML to be read.

### Returns

Returns an object with the processed information from the XML.

### Usage example

Input:

```sql
SELECT XML_JSON_SIMPLE(TO_VARIANT(PARSE_XML('<iceCreamOrders>
    <order>
        <customer customerID="CUST001" contactName="Test ABC">
            <iceCreamOrder orderID="ORD001" employeeID="101" orderDate="2023-05-15T14:30:00">
                <iceCreamDetail productID="001" quantity="2"/>
                <iceCreamDetail productID="003" quantity="1"/>
            </iceCreamOrder>
        </customer>
    </order>
    <order>
        <customer customerID="CUST002" contactName="Test XYZ">
            <iceCreamOrder orderID="ORD002" employeeID="102" orderDate="2023-06-20T12:45:00">
                <iceCreamDetail productID="005" quantity="3"/>
                <iceCreamDetail productID="007" quantity="2"/>
            </iceCreamOrder>
        </customer>
    </order>
</iceCreamOrders>
')));
```

Output:

```sql
{
  "iceCreamOrders": {
    "$name": "iceCreamOrders",
    "order": [
      {
        "order": {
          "$name": "order",
          "customer": [
            {
              "customer": {
                "$name": "customer",
                "@contactName": "Test ABC",
                "@customerID": "CUST001",
                "iceCreamOrder": [
                  {
                    "iceCreamOrder": {
                      "$name": "iceCreamOrder",
                      "@employeeID": 101,
                      "@orderDate": "2023-05-15T14:30:00",
                      "@orderID": "ORD001",
                      "iceCreamDetail": [
                        {
                          "iceCreamDetail": {
                            "$name": "iceCreamDetail",
                            "@productID": "001",
                            "@quantity": 2
                          }
                        },
                        {
                          "iceCreamDetail": {
                            "$name": "iceCreamDetail",
                            "@productID": "003",
                            "@quantity": 1
                          }
                        }
                      ]
                    }
                  }
                ]
              }
            }
          ]
        }
      },
      {
        "order": {
          "$name": "order",
          "customer": [
            {
              "customer": {
                "$name": "customer",
                "@contactName": "Test XYZ",
                "@customerID": "CUST002",
                "iceCreamOrder": [
                  {
                    "iceCreamOrder": {
                      "$name": "iceCreamOrder",
                      "@employeeID": 102,
                      "@orderDate": "2023-06-20T12:45:00",
                      "@orderID": "ORD002",
                      "iceCreamDetail": [
                        {
                          "iceCreamDetail": {
                            "$name": "iceCreamDetail",
                            "@productID": "005",
                            "@quantity": 3
                          }
                        },
                        {
                          "iceCreamDetail": {
                            "$name": "iceCreamDetail",
                            "@productID": "007",
                            "@quantity": 2
                          }
                        }
                      ]
                    }
                  }
                ]
              }
            }
          ]
        }
      }
    ]
  }
}
```

## FORMATMESSAGE_UDF

### Definition

This user-defined function (UDF) provides the functionality of the SQL Server FORMATMESSAGE function. It constructs a message from an existing message from a provided string.

```sql
FORMATMESSAGE_UDF(MESSAGE STRING, ARGS ARRAY)
```

### Parameters

`MESSAGE` STRING

The existing message string.

`ARGS` ARRAY

The arguments to be added on the first message string.

### Returns

Returns a string with the corresponding concatenated message related to the argument’s positions.

### Usage example

Input:

```sql
SELECT FORMATMESSAGE_UDF('Test %s!', TO_ARRAY('a'));
```

Output:

```sql
Test a!
```

## IS_MEMBER_UDF

### Definition

This user-defined function (UDF) determines the windows group membership by examining an access token.

```sql
IS_MEMBER_UDF(ROLE STRING)
```

### Parameters

`ROLE` STRING

The role name to be checked.

### Returns

Returns a boolean expression on true when the current user is a member of the role; otherwise returns false.

### Usage example

Input:

```sql
SELECT IS_MEMBER_UDF('TEST');
```

Output:

```sql
FALSE
```

## RAISERROR_UDF (DOUBLE, DOUBLE, DOUBLE, ARRAY)

### Definition

This user-defined function (UDF) throws an exception with a specific message.

```sql
RAISERROR_UDF(MSG_ID DOUBLE, SEVERITY DOUBLE, STATE DOUBLE, PARAMS ARRAY)
```

### Parameters

`MSG_ID` DOUBLE

The message ID of the error message.

`SEVERITY` DOUBLE

The severity number for the error.

`STATE` DOUBLE

The state number for the error message.

`PARAMS` ARRAY

The additional information of the error message.

### Returns

Returns a varchar with an error message.

### Usage example

Input:

```sql
SELECT RAISERROR_UDF(2.1, 1.6, 1.0, array_construct('More information'));
```

Output:

```sql
MESSAGE: 2.1, LEVEL: 1.6, STATE: 1
```

## STR_UDF(FLOAT)

### Definition

This user-defined function (UDF) is a template for translating the functionality of SQL Server STR() to Snowflake when it’s used with one or two optional parameters

```sql
STR_UDF(FLOAT_EXPR FLOAT, FORMAT VARCHAR)
```

### Parameters

`FLOAT_EXPR` FLOAT

The expression to be processed.

### Returns

Returns a varchar with the formatted expression.

### Usage example

Input:

```sql
SELECT STR_UDF(1.5);
```

Output:

```sql
2
```

## SWITCHOFFSET_UDF (TIMESTAMP_TZ, VARCHAR)

### Definition

This user-defined function (UDF) returns a new timestamp_tz with the adjusted time taken for parameter target_tz.

```sql
SWITCHOFFSET_UDF(source_timestamp TIMESTAMP_TZ, target_tz varchar)
```

### Parameters

`source_timestamp` TIMESTAMP_TZ

The source timestamp to adjust.

`target_tz` varchar

The target time to take.

### Returns

Returns the formatted target time as TIMESTAMP_TZ.

### Usage example

Input:

```sql
SELECT SWITCHOFFSET_UDF(time_in_paris, '-0600') as time_in_costa_rica;
```

Output:

| time_in_paris | time_in_costa_rica |
| --- | --- |
| 2022-10-05 22:00:24.467 +02:00 | 2022-10-05 14:00:24.467 -06:00 |

## GET_CURRENT_TIMEZONE_UDF

### Definition

This user-defined function (UDF) gets the current session or system timezone as a literal.

```sql
GET_CURRENT_TIMEZONE_UDF()
```

### Parameters

No parameters.

### Returns

Returns a literal value with the current session or system timezone as a literal.

### Usage example

Input:

```sql
SELECT PUBLIC.GET_CURRENT_TIMEZONE_UDF();
```

Output:

```sql
'Europe/London'
```

## UPDATE_ERROR_VARS_UDF (STRING, STRING, STRING, STRING, STRING, STRING)

### Definition

This user-defined function (UDF) updates the error variables in an environment in order to know when the procedure throws an error.

```sql
UPDATE_ERROR_VARS_UDF(LINE STRING,CODE STRING, STATE STRING, MESSAGE STRING, PROC_NAME STRING, SEVERITY STRING)
```

### Parameters

`LINE` STRING

The line related to the error.

`CODE` STRING

The error code associated with the error message.

`STATE` STRING

The state of the error message.

`MESSAGE` STRING

The message to be shown in the error.

`PROC_NAME` STRING

The procedure name.

`SEVERITY` STRING

The severity of the error.

### Returns

Returns a `string` value with the new error message information.

### Usage example

Input:

```sql
  SELECT UPDATE_ERROR_VARS_UDF('1', '1', '1', 'ABC', 'TEST', '1');
```

Output:

```sql
111ABCTEST1
```

## SEQUENCE_OBJECT_ID_UDF (VARCHAR)

### Definition

This user-defined function (UDF) checks if a sequence with a specific name has been created before.

```sql
SEQUENCE_OBJECT_ID_UDF(NAME VARCHAR)
```

### Parameters

`NAME` VARCHAR

The sequence name to be evaluated.

### Returns

Returns a boolean expression depending on the existence of the sequence.

### Usage example

Input:

```sql
SELECT SEQUENCE_OBJECT_ID_UDF('Test');
```

Output:

```sql
FALSE
```

## CAST_TIMESTAMP_TZ_TO_NUMERIC_UDF (TIMESTAMP_TZ)

### Definition

This user-defined function (UDF) is used to cast `timestamp_tz` to numeric. It converts the current timezone to UTC because the numeric value cannot save the `timestamp` information.

```sql
CAST_TIMESTAMP_TZ_TO_NUMERIC_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The `timestamp` input that is going to be cast.

### Returns

Returns a numeric with a decimal point. The integer part represents the number of days from 1900-01-01 and the decimal part is the percentage of milliseconds in 24 hours.

### Usage example

Input:

```sql
SELECT PUBLIC.CAST_TIMESTAMP_TZ_TO_NUMERIC_UDF('1900-01-01 01:00:00.000 +0100')
```

Output:

```sql
0
```

## RAISERROR_UDF (VARCHAR, DOUBLE, DOUBLE, ARRAY)

### Definition

This user-defined function (UDF) throws an exception with a specific message.

```sql
RAISERROR_UDF(MSG_TEXT VARCHAR, SEVERITY DOUBLE, STATE DOUBLE, PARAMS ARRAY)
```

### Parameters

`MSG_TEXT` VARCHAR

The message text of the error message.

`SEVERITY` DOUBLE

The severity number for the error.

`STATE` DOUBLE

The state number for the error message.

`PARAMS` ARRAY

The additional information of the error message.

### Returns

Returns a varchar with an error message.

### Usage example

Input:

```sql
SELECT RAISERROR_UDF('<\<%*.*s>> TEST', 1.0, 1, array_construct());
```

Output:

```sql
MESSAGE: <<undefined>> TEST, LEVEL: 1, STATE: 1
```

## PARSENAME_UDF

### Definition

This user-defined function (UDF) gets the PART_NUMBER index of a `string` separated by `'.'`.

```sql
PARSENAME_UDF(STR VARCHAR, PART_NUMBER INT)
```

### Parameters

`STR` VARCHAR

The object name as a `string`.

`PART_NUMBER` INT

The part of the object name to be checked.

### Returns

Returns the specified part of an object name.

### Usage example

Input:

```sql
SELECT PARSENAME_UDF('Test_A.Test_B.Test_C]', 2);
```

Output:

```sql
Test_B
```

## ERROR_STATE_UDF

### Definition

This user-defined function (UDF) gets the error state regardless of how many times it is run, or where it is run within the scope of the `CATCH` block.

```sql
ERROR_STATE_UDF()
```

### Parameters

No input parameters.

### Returns

Returns the `string` with the error state regardless of how many times it is run, or where it is run within the scope of the `CATCH` block.

### Usage example

Input:

```sql
SELECT ERROR_STATE_UDF();
```

Output:

```sql
null -- No information set.
```

## CAST_TIME_TO_TIMESTAMP_TZ_UDF (TIME)

### Definition

This user-defined function (UDF) casts `time` to `timestamp_tz`.

```sql
CAST_TIME_TO_TIMESTAMP_TZ_UDF(INPUT TIME)
```

### Parameters

`INPUT` TIME

The input time to be cast to `timestamp_tz`.

### Returns

Returns a `timestamp_tz` with the date as 1900-01-01 and the same time as the input.

### Usage example

Input:

```sql
SELECT PUBLIC.CAST_TIME_TO_TIMESTAMP_TZ_UDF('00:00:00.995')
```

Output:

```sql
1900-01-01 00:00:00.997
```

## SUM_TIMESTAMP_TZ_UDF (TIMESTAMP_TZ, TIMESTAMP_TZ)

### Definition

This user-defined function (UDF) converts both inputs to the system or session timezone and sums the dates taking 1900-01-01 00:00:00.000 as the zero value. If any value does not include the timezone, the current session timezone is used.

```sql
SUM_TIMESTAMP_TZ_UDF(FIRST_DATE TIMESTAMP_TZ, SECOND_DATE TIMESTAMP_TZ)
```

### Parameters

`FIRST_DATE` TIMESTAMP_TZ

The first date to sum to.

`SECOND_DATE` TIMESTAMP_TZ

The second date to sum to.

### Returns

Returns the sum between the two input dates.

### Usage example

Input:

```sql
SELECT SUM_TIMESTAMP_TZ_UDF('1900-01-01 00:00:00.000 +0100', '1900-01-01 00:00:00.003 -0100')
```

Output:

```sql
1900-01-01 00:00:00.003 +0000
```

## GET_WEEK_START_UDF

### Definition

This user-defined function (UDF) retrieves the WEEK_START configuration, which is equivalent to the @@FIRSTDATE function. To maintain consistency across platforms, ensure the [WEEK_START](https://docs.snowflake.com/en/sql-reference/parameters#week-start) parameter matches the DATEFIRST setting in Transact-SQL.

```sql
GET_WEEK_START_UDF()
```

### Returns

Returns a number representing the first day of the week.

### Usage example

Snowflake’s default value for WEEK_START is `0`. However, this function returns `7` to align with the default DATEFIRST value in Transact-SQL, ensuring consistent behavior.

Input:

```sql
SELECT GET_WEEK_START_UDF();
```

Output:

```sql
7
```

## DATE_PART_WEEK_DAY_UDF

### Definition

This user-defined function (UDF) gets the day of the week as a number (1-7) To ensure the consistency across platforms, please set the [WEEK_START](https://docs.snowflake.com/en/sql-reference/parameters#week-start) parameter to the same value as the DATEFIRST setting in Transact-SQL.

```sql
DATE_PART_WEEK_DAY_UDF(INPUT DATE)
```

### Parameters

`INPUT` DATE

Date to get the day.

### Returns

Returns a number representing the day of the week where Monday=1, Tuesday=2, …, Sunday=7.

### Usage example

The WEEK_START parameter is 0, which causes the DATE_PART_WEEK_DAY_UDF to return a value of 1.

Input:

```sql
SELECT PUBLIC.DATE_PART_WEEK_DAY_UDF('2025-08-17') AS "Sunday";
```

Output:

```sql
1
```

## SCOPE_IDENTITY()

### Definition

The `SCOPE_IDENTITY()` function in SQL Server returns the last identity value inserted into an identity column in the same scope. SnowConvert AI transforms this function into a time-travel query using `AT(STATEMENT =>)` to retrieve the identity value from the most recent INSERT statement.

### Transformation Pattern

**SQL Server:**

```sql
INSERT INTO TableName (Column1) VALUES (Value1);
SET @VariableName = SCOPE_IDENTITY();
```

**Snowflake:**

```sql
INSERT INTO TableName (Column1) VALUES (Value1);
LET _scope_identity_query_id VARCHAR := LAST_QUERY_ID();
VariableName := (SELECT MAX(IdentityColumn) FROM TableName AT(STATEMENT => _scope_identity_query_id));
```

### Requirements

* Only works within **procedural contexts** (stored procedures, functions) that are transformed to SnowScript
* Requires an **identity column** defined on the target table using `IDENTITY(seed, increment)`
* The preceding INSERT statement must target a table with a resolvable identity column in the symbol table

### Usage Example

#### Input (SQL Server):

```sql
CREATE TABLE Orders (OrderID INT IDENTITY(1,1), CustomerID INT);
GO

CREATE PROCEDURE InsertOrder @CustomerID INT
AS
BEGIN
    DECLARE @OrderID INT;
    INSERT INTO Orders (CustomerID) VALUES (@CustomerID);
    SET @OrderID = SCOPE_IDENTITY();
    SELECT @OrderID;
END;
```

#### Output (Snowflake):

```sql
CREATE OR REPLACE TABLE Orders (
  OrderID INT IDENTITY(1, 1) ORDER,
  CustomerID INT
)
;

CREATE OR REPLACE PROCEDURE InsertOrder (CUSTOMERID INT)
RETURNS TABLE()
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ORDERID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    INSERT INTO Orders (CustomerID) VALUES (:CUSTOMERID);
    LET _scope_identity_query_id VARCHAR := LAST_QUERY_ID();
    ORDERID :=
      SELECT
        MAX(OrderID)
      FROM
        Orders AT (STATEMENT => _scope_identity_query_id);
    ProcedureResultSet := (SELECT
      :ORDERID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;
```

### Known Limitations

#### Nested Scope Edge Case

When `SCOPE_IDENTITY()` is used inside a nested `BEGIN...END` block while the INSERT statement is in the outer procedure body, the transformation may not detect the INSERT correctly:

```sql
CREATE PROCEDURE Example
AS
BEGIN
    INSERT INTO Orders (CustomerID) VALUES (@CustomerID);  -- outer scope
    BEGIN
        DECLARE @OrderID INT;
        SET @OrderID = SCOPE_IDENTITY();  -- inner scope
    END;
END;
```

In this case, SnowConvert AI may generate [SSC-EWI-TS0095](../../issues-and-troubleshooting/conversion-issues/sqlServerEWI.md) indicating that no preceding INSERT was found, even though one exists at a different nesting level. This is a known limitation tracked for future enhancement.

**Workaround:** Refactor the code to keep `SCOPE_IDENTITY()` in the same block as the INSERT statement.

#### Batch Context

`SCOPE_IDENTITY()` is **not transformed** in batch contexts (scripts outside of procedures/functions). In such cases, the original function call is preserved with [SSC-EWI-0073](../../issues-and-troubleshooting/conversion-issues/generalEWI.md).

### Related Issues

When `SCOPE_IDENTITY()` cannot be transformed, SnowConvert AI generates one of these EWI codes:

* **[SSC-EWI-TS0095](../../issues-and-troubleshooting/conversion-issues/sqlServerEWI.md)** - No preceding INSERT statement found
* **[SSC-EWI-TS0096](../../issues-and-troubleshooting/conversion-issues/sqlServerEWI.md)** - Target table cannot be resolved
* **[SSC-EWI-TS0097](../../issues-and-troubleshooting/conversion-issues/sqlServerEWI.md)** - Table has no identity column

### Additional Notes

* The `AT(STATEMENT =>)` time-travel clause may return incorrect results under high-concurrency scenarios where multiple sessions insert into the same table simultaneously
* For more information about time-travel queries, see the [Snowflake documentation](https://docs.snowflake.com/en/user-guide/querying-time-travel)

---
title: SnowConvert AI - Function References for Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/function-references/teradata/README.md
section: Migrations
---

# SnowConvert AI - Function References for Teradata

## QUARTERNUMBER_OF_YEAR_UDF

### Definition

UDF (User-Defined Function) that calculates the quarter number of a given date according to the ISO calendar year, similar to Teradata’s QUARTERNUMBER_OF_YEAR_UDF(date, ‘ISO’) function.

```sql
PUBLIC.QUARTERNUMBER_OF_YEAR_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TimeSTAMP_TZ

The method to extract the quarter number.

### Returns

An integer (1-4) indicating which quarter of the year the date falls into.

### Usage example

Input:

```sql
SELECT PUBLIC.QUARTERNUMBER_OF_YEAR_UDF(DATE '2022-01-01'),
PUBLIC.QUARTERNUMBER_OF_YEAR_UDF(DATE '2025-12-31');
```

Output:

```sql
4, 1
```

## DAYNUMBER_OF_YEAR_UDF

### Definition

Returns the day number within the year for a given timestamp. The day number ranges from 1 to 365 (or 366 in leap years). This function behaves the same way as DAYNUMBER_OF_YEAR(DATE, ‘ISO’).

```sql
PUBLIC.DAYNUMBER_OF_YEAR_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

To get the day number of the year from a date.

### Returns

A whole number from 1 to 371.

### Example

Input:

```sql
SELECT DAYNUMBER_OF_YEAR(CURRENT_DATE,'ISO');
```

Output:

```sql
SELECT
PUBLIC.DAYNUMBER_OF_YEAR_UDF(CURRENT_DATE());
```

## SUBSTR_UDF (STRING, FLOAT)

> **Warning:**
>
> This user-defined function (UDF) accepts two parameters (overloaded function).

### Definition

Retrieves a portion of text from a specified string by using a starting position and length.

```sql
PUBLIC.SUBSTR_UDF(BASE_EXPRESSION STRING, START_POSITION FLOAT)
```

### Parameters

`BASE_EXPRESSION` is a string parameter that defines the base expression for the operation.

The source text from which you want to extract a portion.

`START_POSITION` - A floating-point number that specifies the starting position in the input string.

The position where you want to begin extracting characters from the string.

### Returns

The substring that must be included.

### Migration example

Input:

```sql
SELECT SUBSTRING('Hello World!' FROM -2);
```

Output:

```sql
SELECT
PUBLIC.SUBSTR_UDF('Hello World!', -2);
```

## CHKNUM_UDF

### Definition

Verify whether a string contains a valid numeric value.

```sql
PUBLIC.CHKNUM_UDF(NUM STRING);
```

### Parameters

`NUM` A string representing a number

The text string that needs to be validated.

### Returns

Returns 1 if the input parameter is a valid numeric value. If the input is not a valid number (for example, text or special characters), returns 0.

### Example

```sql
SELECT CHKNUM('1032');
```

Output:

```sql
SELECT
PUBLIC.CHKNUM_UDF('1032');
```

## TD_YEAR_END_UDF

### Definition

UDF (User-Defined Function) that replicates Teradata’s TD_YEAR_END(DATE) or TD_YEAR_END(DATE, ‘COMPATIBLE’) function, which returns the last day of the year for a given date.

```sql
PUBLIC.TD_YEAR_END_UDF(INPUT date)
```

### Parameters

`INPUT` DATE

Get the last day of the current year.

### Returns

The final day of December (December 31st).

### Usage example

Input:

```sql
SELECT  PUBLIC.TD_YEAR_END_UDF(DATE '2022-01-01'),
PUBLIC.TD_YEAR_END_UDF(DATE '2022-04-12');
```

Output:

```sql
2022-12-31, 2022-12-31
```

## PERIOD_OVERLAPS_UDF

### Definition

A user-defined function (UDF) that implements the OVERLAPS OPERATOR functionality. This function compares two or more time periods and determines whether they have any overlapping time ranges.

```sql
PERIOD_OVERLAPS_UDF(PERIODS ARRAY)
```

### Parameters

`PERIODS` is an array that contains time periods

All period expressions that will be compared.

### Returns

TRUE if all time periods in the set have at least one point in common (overlap), FALSE otherwise.

### Migration example

```sql
SELECT
	PERIOD(DATE '2009-01-01', DATE '2010-09-24')
	OVERLAPS
	PERIOD(DATE '2009-02-01', DATE '2009-06-24');
```

Output:

```sql
SELECT
	PUBLIC.PERIOD_OVERLAPS_UDF(ARRAY_CONSTRUCT(PUBLIC.PERIOD_UDF(DATE '2009-01-01', DATE '2010-09-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!, PUBLIC.PERIOD_UDF(DATE '2009-02-01', DATE '2009-06-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!)) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!;
```

## WEEK_NUMBER_OF_QUARTER_COMPATIBLE_UDF

### Definition

Calculates which week number within the current quarter a specified date falls into.

```sql
PUBLIC.WEEK_NUMBER_OF_QUARTER_COMPATIBLE_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date used to calculate which week of the quarter it falls into.

### Returns

An integer indicating which week of the quarter the date falls in (1-13).

### Usage example

Input:

```sql
SELECT WEEK_NUMBER_OF_QUARTER_COMPATIBLE_UDF(DATE '2022-05-01', 'COMPATIBLE'),
WEEK_NUMBER_OF_QUARTER_COMPATIBLE_UDF(DATE '2022-07-06', 'COMPATIBLE')
```

Output:

```sql
5, 1
```

## ROMAN_NUMERALS_MONTH_UDF

### Definition

Converts a date into its corresponding month in Roman numerals.

```sql
PUBLIC.ROMAN_NUMERALS_MONTH_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The input date from which to extract the month.

### Returns

A `varchar` representing the month extracted from a given date.

### Usage example

Input:

```sql
SELECT PUBLIC.ROMAN_NUMERALS_MONTH_UDF(DATE '2021-10-26');
```

Output:

```sql
'X'
```

## TD_YEAR_BEGIN_UDF

### Definition

A user-defined function (UDF) that mimics the behavior of TD_YEAR_BEGIN or TD_YEAR_BEGIN(DATE, ‘COMPATIBLE’) by returning the first day of the year for a given date.

```sql
PUBLIC.TD_YEAR_BEGIN_UDF(INPUT DATE)
```

### Parameters

`INPUT` DATE

Get the first day of the current year.

### Returns

The first day of January.

### Usage example

Input:

```sql
SELECT TD_YEAR_BEGIN(DATE '2022-01-01', 'COMPATIBLE'),
TD_YEAR_BEGIN(DATE '2022-04-12');
```

Output:

```sql
2022-01-01, 2022-01-01
```

## FULL_MONTH_NAME_UDF

### Definition

Returns the full name of a month in your choice of formatting: all uppercase letters, all lowercase letters, or with only the first letter capitalized.

```sql
PUBLIC.FULL_MONTH_NAME_UDF(INPUT TIMESTAMP_TZ, RESULTCASE VARCHAR)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date format should display the month name.

`RESULTCASE` VARCHAR

The format in which the result should be displayed. Valid options are ‘uppercase’, ‘lowercase’, or ‘capitalized’.

### Returns

Returns a `varchar` containing the full name of a month

### Usage example

Input:

```sql
SELECT PUBLIC.FULL_MONTH_NAME_UDF(DATE '2021-10-26', 'uppercase');
SELECT PUBLIC.FULL_MONTH_NAME_UDF(DATE '2021-10-26', 'lowercase');
SELECT PUBLIC.FULL_MONTH_NAME_UDF(DATE '2021-10-26', 'firstOnly');
```

Output:

```sql
OCTOBER
october
October
```

## TO_BYTES_HEX_UDF

### Definition

Converts a decimal (base 10) number into its hexadecimal (base 16) representation.

```sql
TO_BYTES_HEX_UDF(INPUT FLOAT)
```

### Parameters

`INPUT` is a floating-point number parameter.

The number that will be converted into hexadecimal format.

### Returns

A string representing the hexadecimal value.

### Usage example

Input:

```sql
SELECT TO_BYTES_HEX_UDF('448');
```

Output:

```sql
01c0
```

## PERIOD_INTERSECT_UDF

### Definition

A user-defined function (UDF) that replicates the P_INTERSECT operator. This function compares two or more time periods and identifies where they overlap, returning the common time interval between them.

For more details about the source function, please refer to the [documentation](https://docs.teradata.com/r/SQL-Date-and-Time-Functions-and-Expressions/July-2021/Period-Functions-and-Operators/P_INTERSECT/P_INTERSECT-Syntax).

```sql
PERIOD_INTERSECT_UDF(PERIODS ARRAY)
```

### Parameters

`PERIODS` is an array that contains time periods.

All period expressions that need to be compared.

### Returns

The section where two time periods intersect or share common dates.

### Migration example

Input:

```sql
SELECT
	PERIOD(DATE '2009-01-01', DATE '2010-09-24')
	P_INTERSECT
	PERIOD(DATE '2009-02-01', DATE '2009-06-24');
```

Output:

```sql
SELECT
	PUBLIC.PERIOD_INTERSECT_UDF(ARRAY_CONSTRUCT(PUBLIC.PERIOD_UDF(DATE '2009-01-01', DATE '2010-09-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!, PUBLIC.PERIOD_UDF(DATE '2009-02-01', DATE '2009-06-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!)) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!;
```

## INTERVAL_TO_SECONDS_UDF

### Definition

Converts a time interval into seconds.

```sql
PUBLIC.INTERVAL_TO_SECONDS_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE VARCHAR())
```

### Parameters

`INPUT_PART` is a variable of type VARCHAR that stores input data.

The time duration that will be converted into seconds.

`INPUT_VALUE` VARCHAR - The input parameter that accepts text data. - The input parameter that accepts text data.

The time interval type for conversion. Examples include ‘DAY’, ‘DAY TO HOUR’, and other valid interval types.

### Returns

A decimal number representing the time interval in seconds.

## TIMESTAMP_ADD_UDF

### Definition

Combines two timestamps into a single value.

```sql
PUBLIC.TIMESTAMP_ADD_UDF(FIRST_DATE TIMESTAMP_LTZ, SECOND_DATE TIMESTAMP_LTZ)
```

### Parameters

`FIRST_DATE` is a timestamp field that includes both date and time information, with timezone support (TIMESTAMP_LTZ)

The initial date when this was added.

`SECOND_DATE` is a timestamp column that includes timezone information (TIMESTAMP_LTZ) (Timestamp with local time zone)

The date when the item was added for the second time.

### Returns

A timestamp generated by combining the input date parameters.

## INTERVAL_MULTIPLY_UDF

### Definition

A user-defined function (UDF) that performs multiplication operations on time intervals.

```sql
PUBLIC.INTERVAL_MULTIPLY_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE VARCHAR(), INPUT_MULT INTEGER)
```

### Parameters

`INPUT_PART` is a variable of type VARCHAR that stores input data.

The value used for multiplication, specified as ‘YEAR TO MONTH’.

`INPUT_VALUE` VARCHAR

The interval to multiply by.

`INPUT_MULT` is an integer parameter that serves as a multiplier for input values.

The number that will be used in the multiplication operation.

### Returns

The output is calculated by multiplying a time interval by a numeric value.

### Migration example

Input:

```sql
SELECT INTERVAL '6-10' YEAR TO MONTH * 8;
```

Output:

```sql
SELECT
PUBLIC.INTERVAL_MULTIPLY_UDF('YEAR TO MONTH', '6-10', 8);
```

## TD_DAY_OF_WEEK_UDF

### Definition

User-defined function (UDF) that replicates Teradata’s `TD_DAY_OF_WEEK` functionality. For details about the original Teradata function, see [here](https://docs.teradata.com/r/SQL-Date-and-Time-Functions-and-Expressions/July-2021/Calendar-Functions/td_day_of_week/DayOfWeek).

```sql
PUBLIC.TD_DAY_OF_WEEK_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

Date from which to get the day of the week.

### Returns

An integer from 1 to 7 representing the day of the week, where:

* 1 = Sunday
* 2 = Monday
* 3 = Tuesday
* 4 = Wednesday
* 5 = Thursday
* 6 = Friday
* 7 = Saturday

### Migration example

Input:

```sql
SELECT td_day_of_week(DATE '2022-03-02');
```

Output:

```sql
SELECT
PUBLIC.TD_DAY_OF_WEEK_UDF(DATE '2022-03-02');
```

## ISO_YEAR_PART_UDF

### Definition

Calculates the ISO calendar year from a given date. The result can be shortened by specifying the number of digits to keep.

```sql
PUBLIC.ISO_YEAR_PART_UDF(INPUT TIMESTAMP_TZ, DIGITS INTEGER)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date from which to extract the ISO year.

`DIGITS` A whole number that represents the maximum number of digits to display

The number of decimal places desired in the output.

### Returns

Returns a string (varchar) representing the ISO year of a given date.

### Usage example

Input:

```sql
SELECT PUBLIC.ISO_YEAR_PART_UDF(DATE '2021-10-26', 3);
SELECT PUBLIC.ISO_YEAR_PART_UDF(DATE '2021-10-26', 2);
SELECT PUBLIC.ISO_YEAR_PART_UDF(DATE '2021-10-26', 1);
```

Output:

```sql
'021'
'21'
'1'
```

## DIFF_TIME_PERIOD_UDF

### Definition

Computes the time interval between two dates based on the specified time unit parameter.

```sql
PUBLIC.DIFF_TIME_PERIOD_UDF(TIME STRING, PERIOD VARCHAR(50))
```

### Parameters

`TIME` is a data type used to store time values in hours, minutes, seconds, and fractions of seconds. is a data type that represents a time value stored as a text string.

The timestamp that will be used as an anchor point.

`PERIOD` A text field (VARCHAR) that represents a time period

The period column used for expansion.

### Returns

A numerical value indicating the time interval between two dates.

### Usage example

Input:

```sql
SELECT DIFF_TIME_PERIOD_UDF('SECONDS','2022-11-26 10:15:20.000*2022-11-26 10:15:25.000');
```

Output:

```sql
5
```

## WEEK_NUMBER_OF_QUARTER_ISO_UDF

### Definition

Calculates which week number a date falls into within its quarter, using ISO calendar standards. This function behaves identically to Teradata’s `WEEKNUMBER_OF_QUARTER(DATE, 'ISO')` function.

```sql
PUBLIC.WEEK_NUMBER_OF_QUARTER_ISO_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date used to calculate which week of the quarter it falls into.

### Returns

An integer indicating which week of the quarter (1-13) this represents.

### Usage example

Input:

```sql
SELECT WEEKNUMBER_OF_QUARTER(DATE '2022-05-01', 'ISO'),
WEEKNUMBER_OF_QUARTER(DATE '2022-07-06', 'ISO')
```

Output:

```sql
SELECT
PUBLIC.SUBSTR_UDF('Hello World!', -2);
```

## NVP_UDF

### Definition

Performs the same function as Teradata’s [NVP function](https://docs.teradata.com/r/SQL-Functions-Expressions-and-Predicates/June-2020/String-Operators-and-Functions/NVP/NVP-Function-Syntax).

```sql
NVP_UDF(INSTRING VARCHAR, NAME_TO_SEARCH VARCHAR, NAME_DELIMITERS VARCHAR, VALUE_DELIMITERS VARCHAR, OCCURRENCE FLOAT)
```

### Parameters

`INSTRING` VARCHAR

Name-value pairs are data elements that consist of a name and its corresponding value.

`NAME_TO_SEARCH` of type VARCHAR

The name parameter used to search within the Name-Value Pair (NVP) function.

`NAME_DELIMITERS` VARCHAR

The character used to separate names from their corresponding values.

`VALUE_DELIMITERS` VARCHAR

The character used to connect a name with its corresponding value.

`OCCURRENCE` represents a floating-point number that indicates how many times something occurs

The number of matching patterns to search for.

### Returns

A text string (VARCHAR) containing identical data as the input string.

### Usage example

Input:

```sql
SELECT PUBLIC.NVP_UDF('entree=-orange chicken&entree+.honey salmon', 'entree', '&', '=- +.', 1);
```

Output:

```sql
orange chicken
```

## MONTH_SHORT_UDF

### Definition

Returns the abbreviated name of a month (three letters) in your choice of uppercase, lowercase, or capitalized format. For example: “Jan”, “jan”, or “JAN”.

```sql
PUBLIC.MONTH_SHORT_UDF(INPUT TIMESTAMP_TZ, RESULTCASE VARCHAR)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date formatted to display the abbreviated month name.

`RESULTCASE` VARCHAR

The letter case format to be used. Valid options are:

* ‘uppercase’: converts text to all capital letters
* ‘lowercase’: converts text to all small letters
* ‘firstOnly’: capitalizes only the first letter

### Returns

A `varchar` containing the abbreviated name of a month (e.g., “Jan”, “Feb”, etc.).

### Usage example

Input:

```sql
SELECT PUBLIC.MONTH_SHORT_UDF(DATE '2021-10-26', 'uppercase');
SELECT PUBLIC.MONTH_SHORT_UDF(DATE '2021-10-26', 'lowercase');
SELECT PUBLIC.MONTH_SHORT_UDF(DATE '2021-10-26', 'firstOnly');
```

Output:

```sql
OCT
oct
Oct
```

## DATE_TO_INT_UDF

### Definition

UDF (User-Defined Function) that converts a date value to its numeric representation, similar to Teradata’s DATE-TO-NUMERIC function.

```sql
PUBLIC.DATE_TO_INT_UDF(DATE_TO_CONVERT DATE)
```

### Parameters

`DATE_TO_CONVERT` represents a date value that needs to be converted

Convert the date value to an integer format.

### Returns

Returns a date value in numeric format.

### Example

Input:

```sql
SELECT mod(date '2015-11-26', 5890), sin(current_date);

CREATE TABLE SAMPLE_TABLE
(
    VARCHAR_TYPE VARCHAR,
    CHAR_TYPE CHAR(11),
    INTEGER_TYPE INTEGER,
    DATE_TYPE DATE,
    TIMESTAMP_TYPE TIMESTAMP,
    TIME_TYPE TIME,
    PERIOD_TYPE PERIOD(DATE)
);

REPLACE VIEW SAMPLE_VIEW
AS
SELECT
CAST(DATE_TYPE AS SMALLINT),
CAST(DATE_TYPE AS DECIMAL),
CAST(DATE_TYPE AS NUMBER),
CAST(DATE_TYPE AS FLOAT),
CAST(DATE_TYPE AS INTEGER)
FROM SAMPLE_TABLE;
```

Output:

```sql
SELECT
mod(PUBLIC.DATE_TO_INT_UDF(date '2015-11-26'), 5890),
sin(PUBLIC.DATE_TO_INT_UDF(CURRENT_DATE()));

CREATE TABLE PUBLIC.SAMPLE_TABLE
(
    VARCHAR_TYPE VARCHAR,
    CHAR_TYPE CHAR(11),
    INTEGER_TYPE INTEGER,
    DATE_TYPE DATE,
    TIMESTAMP_TYPE TIMESTAMP,
    TIME_TYPE TIME,
    PERIOD_TYPE VARCHAR(24) COMMENT 'PERIOD(DATE)' /*** MSC-WARNING - MSCEWI1036 - PERIOD DATA TYPE "PERIOD(DATE)" CONVERTED TO VARCHAR ***/
);

CREATE OR REPLACE VIEW PUBLIC.SAMPLE_VIEW
AS
SELECT
PUBLIC.DATE_TO_INT_UDF(DATE_TYPE),
PUBLIC.DATE_TO_INT_UDF(DATE_TYPE),
PUBLIC.DATE_TO_INT_UDF(DATE_TYPE),
PUBLIC.DATE_TO_INT_UDF(DATE_TYPE),
PUBLIC.DATE_TO_INT_UDF(DATE_TYPE)
FROM PUBLIC.SAMPLE_TABLE;
```

## PERIOD_UDF

### Definition

A user-defined function (UDF) that replicates the P_INTERSECT operator. This function compares two or more time periods and identifies where they overlap, returning the common time interval between them.

Creates a string representation of a period’s start and end values (for `TIMESTAMP` represents a data type that stores both date and time information., `TIME`, or `DATE` is a data type used to store calendar dates (year, month, and day) without time information. data types). This function emulates Teradata’s period value constructor function. The output string follows Snowflake’s default format for `PERIOD` values. To adjust the precision of the output, you can either:

* Modify the session parameter `timestamp_output_format`
* Use the three-parameter version of this UDF

More details about the source function can be found in the [Teradata documentation](https://docs.teradata.com/r/SQL-External-Routine-Programming/July-2021/SQL-Data-Type-Mapping/C-Data-Types/PERIOD-DATE/PERIOD-TIME/PERIOD-TIMESTAMP).

```sql
PERIOD_UDF(D1 TIMESTAMP_NTZ, D2 TIMESTAMP_NTZ)
PERIOD_UDF(D1 DATE, D2 DATE)
PERIOD_UDF(D1 TIME, D2 TIME)
PERIOD_UDF(D1 TIMESTAMP_NTZ, D2 TIMESTAMP_NTZ, PRECISIONDIGITS INT)
PERIOD_UDF(D1 TIME, D2 TIME, PRECISIONDIGITS INT)
PERIOD_UDF(D1 TIMESTAMP_NTZ)
PERIOD_UDF(D1 DATE)
PERIOD_UDF(D1 TIME)
```

### Parameters

`TIMESTAMP`

The TimeStamp data type represents a specific point in time, including both the date and time components.

`TIME`

The Time data type represents a specific time of day without a date component.

`DATE`

The Date data type represents a calendar date without a time component.

`PRECISIONDIGITS` specifies the number of decimal places to display in numeric values.

The number of digits to display in the time format.

### Returns

Returns a string representation of a `PERIOD` type value

### Usage example

Input:

```sql
SELECT
PERIOD_UDF('2005-02-03'),
PERIOD_UDF(date '2005-02-03'),
PERIOD_UDF(TIMESTAMP '2005-02-03 12:12:12.340000'),
PERIOD_UDF(TIMESTAMP '2005-02-03 12:12:12.340000');
```

Output:

```sql
2005-02-03*2005-02-04,
2005-02-03*2005-02-04,
2005-02-03 12:12:12.340000*2005-02-03 12:12:12.340001,
2005-02-03 12:12:12.340000*2005-02-03 12:12:12.340001
```

## DAYNAME_LONG_UDF (TIMESTAMP_TZ, VARCHAR)

> **Warning:**
>
> This is the user-defined function (UDF) that accepts **two** **different parameter types.**

### Definition

Returns the full name of a weekday in your choice of uppercase, lowercase, or capitalized format (e.g., “MONDAY”, “monday”, or “Monday”).

```sql
PUBLIC.DAYNAME_LONG_UDF(INPUT TIMESTAMP_TZ, RESULTCASE VARCHAR)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The input date from which to determine the day of the week.

`RESULTCASE` VARCHAR

The expected outcome or scenario that will be demonstrated.

### Returns

Returns a string containing the full name of a day of the week.

### Usage example

Input:

```sql
SELECT PUBLIC.DAYNAME_LONG_UDF(DATE '2021-10-26', 'uppercase');
SELECT PUBLIC.DAYNAME_LONG_UDF(DATE '2021-10-26', 'lowercase');
SELECT PUBLIC.DAYNAME_LONG_UDF(DATE '2021-10-26', 'firstOnly');
```

Output:

```sql
'TUESDAY'
'tuesday'
'Tuesday'
```

## TD_DAY_OF_WEEK_COMPATIBLE_UDF

### Definition

Process a timestamp to determine which day of the week it falls on. This function behaves identically to `DAYNUMBER_OF_WEEK(DATE, 'COMPATIBLE')`.

```sql
PUBLIC.TD_DAY_OF_WEEK_COMPATIBLE_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The input date used to determine the day of the week.

### Returns

Returns a number from 1 to 7 representing the day of the week, where 1 represents the first day of the week. For example, if January 1st falls on a Wednesday, then Wednesday = 1, Thursday = 2, Friday = 3, Saturday = 4, Sunday = 5, Monday = 6, and Tuesday = 7.

### Usage example

Input:

```sql
SELECT PUBLIC.TD_DAY_OF_WEEK_COMPATIBLE_UDF(DATE '2022-01-01'),
PUBLIC.TD_DAY_OF_WEEK_COMPATIBLE_UDF(DATE '2023-05-05');
```

Output:

```sql
1, 6
```

## JAROWINKLER_UDF

### Definition

Calculates how similar two strings are using the Jaro-Winkler algorithm. This algorithm gives a score between 0 (completely different) and 1 (identical).

```sql
PUBLIC.JAROWINKLER_UDF (string1 VARCHAR, string2 VARCHAR)
```

### Parameters

`string1` of type VARCHAR

The text to be processed

`string2` of type VARCHAR

The text to be processed

### Returns

The function returns either 0 or 1.

### Usage example

Input:

```sql
SELECT PUBLIC.JAROWINKLER_UDF('święta', 'swieta')
```

Output:

```sql
0.770000
```

## YEAR_BEGIN_ISO_UDF

### Definition

UDF that calculates the first day of the ISO year for a given date. It works by finding the Monday closest to January 1st of the year, using the `DAYOFWEEKISO` function in combination with `PUBLIC.FIRST_DAY_JANUARY_OF_ISO_UDF`. The function either adds or subtracts days to locate this Monday.

```sql
PUBLIC.YEAR_BEGIN_ISO_UDF(INPUT DATE)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date that represents January 1st of the current year according to the ISO calendar standard.

### Returns

The first day of the year according to the ISO calendar standard.

### Usage example

Input:

```sql
SELECT  PUBLIC.YEAR_BEGIN_ISO_UDF(DATE '2022-01-01'),
PUBLIC.YEAR_BEGIN_ISO_UDF(DATE '2022-04-12');
```

Output:

```sql
2021-01-04, 2022-01-03
```

## YEAR_PART_UDF

### Definition

Extract the year from a date and truncate it to a specified number of digits.

```sql
PUBLIC.YEAR_PART_UDF(INPUT TIMESTAMP_TZ, DIGITS INTEGER)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date from which to extract the year.

`DIGITS` A whole number that represents the maximum number of digits to display

The number of decimal places desired in the output.

### Returns

Extracts the year component from a specified date.

### Usage example

Input:

```sql
SELECT PUBLIC.YEAR_PART_UDF(DATE '2021-10-26', 3);
SELECT PUBLIC.YEAR_PART_UDF(DATE '2021-10-26', 2);
SELECT PUBLIC.YEAR_PART_UDF(DATE '2021-10-26', 1);
```

Output:

```sql
'021'
'21'
'1'
```

## YEAR_WITH_COMMA_UDF

### Definition

Extracts the year from a date and adds a comma between the first and second digits. For example, if the year is 2023, it returns “2,023”.

```sql
PUBLIC.YEAR_WITH_COMMA_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The input date from which to extract the year.

### Returns

Returns the year portion of a date value as a varchar (text) with a comma separator.

### Usage example

Input:

```sql
SELECT PUBLIC.YEAR_WITH_COMMA_UDF(DATE '2021-10-26');
```

Output:

```sql
'2,021'
```

## MONTHS_BETWEEN_UDF

### Definition

Calculate the Number of Months Between Two Dates

```sql
MONTHS_BETWEEN_UDF(FIRST_DATE TIMESTAMP_LTZ, SECOND_DATE TIMESTAMP_LTZ)
```

### Parameters

`FIRST_DATE` is a timestamp column that includes both date and time information, with timezone support (TIMESTAMP_LTZ)

The initial date from which the function will begin processing data.

`SECOND_DATE` TIMESTAMP_LTZ

The ending date that defines when to stop counting.

### Returns

The duration in months between two dates.

### Usage example

Input:

```sql
SELECT MONTHS_BETWEEN_UDF('2022-02-14', '2021-02-14');
```

Output:

```sql
12
```

## SECONDS_PAST_MIDNIGHT_UDF

### Definition

Calculate the number of seconds elapsed since midnight for a specified time.

```sql
PUBLIC.SECONDS_PAST_MIDNIGHT_UDF(INPUT TIME)
```

### Parameters

`INPUT` TIME

The function calculates the total number of seconds elapsed since midnight (00:00:00) until the current time.

### Returns

A `varchar` value representing the number of seconds elapsed since midnight.

### Usage example

Input:

```sql
SELECT PUBLIC.SECONDS_PAST_MIDNIGHT_UDF(TIME'10:30:45');
```

Output:

```sql
'37845'
```

## CHAR2HEXINT_UDF

### Definition

Returns a string containing the hexadecimal (base-16) representation of each character in the input string.

```sql
PUBLIC.CHAR2HEXINT_UDF(INPUT_STRING VARCHAR);
```

### Parameters

`INPUT_STRING` is a variable of type VARCHAR that stores text data.

The input string that needs to be converted.

### Returns

Returns a string containing the hexadecimal representation of the input string.

### Example

Input:

```sql
SELECT CHAR2HEXINT('1234') from t1;
```

Output:

```sql
SELECT
PUBLIC.CHAR2HEXINT_UDF('1234') from
t1;
```

### More information from the source function

Function documentation is available in the [Teradata documentation](https://docs.teradata.com/r/SQL-Functions-Expressions-and-Predicates/June-2020/String-Operators-and-Functions/CHAR2HEXINT).

## INTERVAL_ADD_UDF

### Definition

UDFs (User-Defined Functions) that handle subtraction operations between an interval value and a column reference of type interval.

```sql
PUBLIC.INTERVAL_ADD_UDF
(INPUT_VALUE1 VARCHAR(), INPUT_PART1 VARCHAR(30), INPUT_VALUE2 VARCHAR(), INPUT_PART2 VARCHAR(30), OP CHAR, OUTPUT_PART VARCHAR())
```

### Parameters

`INPUT_VALUE1` of type VARCHAR

The input data that will be processed by the system.

`INPUT_PART1` of type VARCHAR

The time unit to be used, such as ‘`HOUR`’.

`INPUT_VALUE2` is a VARCHAR data type parameter.

The name of the referenced column, such as ‘`INTERVAL_HOUR_TYPE`’

`INPUT_PART2` VARCHAR

The data type assigned to the referenced column.

`OP` character

The symbol or operator that is currently being analyzed.

`OUTPUT_PART` VARCHAR

The data type of the returned value.

### Returns

A `varchar` value that represents the result of subtracting two time intervals.

### Migration example

Input:

```sql
CREATE TABLE INTERVAL_TABLE
(
    INTERVAL_YEAR_TYPE INTERVAL YEAR
);

SELECT INTERVAL_YEAR_TYPE - INTERVAL '7' MONTH FROM INTERVAL_TABLE;
```

Output:

```sql
CREATE OR REPLACE TABLE INTERVAL_TABLE
(
    INTERVAL_YEAR_TYPE VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

SELECT
    PUBLIC.INTERVAL_ADD_UDF(INTERVAL_YEAR_TYPE, 'YEAR', '7', 'MONTH', '-', 'YEAR TO MONTH')
    FROM
    INTERVAL_TABLE;
```

## DAY_OF_WEEK_LONG_UDF

### Definition

A user-defined function (UDF) that converts a timestamp into the full name of the day (for example, “Monday”, “Tuesday”, etc.).

```sql
PUBLIC.DAY_OF_WEEK_LONG_UDF(INPUT_DATE TIMESTAMP)
```

### Parameters

`INPUT_DATE` represents a timestamp value

The timestamp will be converted into a full day name (for example, “Monday”, “Tuesday”, etc.).

### Returns

The name of the day in English.

## TD_WEEK_OF_CALENDAR_UDF

### Definition

The user-defined function (UDF) serves as a direct replacement for Teradata’s [TD_WEEK_OF_CALENDAR](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Date-and-Time-Functions-and-Expressions/Calendar-Functions/td_week_of_calendar) function, providing the same functionality in Snowflake.

```sql
PUBLIC.TD_WEEK_OF_CALENDAR_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

Date used to calculate the number of weeks that have elapsed since January 1, 1900.

### Returns

An integer representing the number of complete weeks between January 1, 1900, and the specified date

### Migration example

Input:

```sql
SELECT TD_WEEK_OF_CALENDAR(DATE '2023-11-30')
```

Output:

```sql
SELECT
PUBLIC.TD_WEEK_OF_CALENDAR_UDF(DATE '2023-11-30');
```

## WRAP_NEGATIVE_WITH_ANGLE_BRACKETS_UDF

### Definition

Converts negative numbers to use angle brackets (< >) instead of the minus sign (-). This conversion occurs when the PR (parentheses) format element is present in the original Teradata format string.

```sql
PUBLIC.WRAP_NEGATIVE_WITH_ANGLE_BRACKETS_UDF(INPUT NUMBER, FORMATARG VARCHAR)
```

### Parameters

`INPUT` is a numeric value

The numeric value that will be converted into a text string (varchar).

`FORMATARG` is a parameter of type VARCHAR that specifies the format of the data.

The format parameter specifies how to convert the INPUT value into a text (varchar) representation.

### Returns

A `varchar` containing negative numbers enclosed in angle brackets (< >).

### Usage example

Input:

```sql
SELECT PUBLIC.WRAP_NEGATIVE_WITH_ANGLE_BRACKETS_UDF(8456, '9999');
SELECT PUBLIC.WRAP_NEGATIVE_WITH_ANGLE_BRACKETS_UDF(-8456, '9999');
```

Output:

```sql
'8456'
'<8456>'
```

## INSTR_UDF (STRING, STRING)

> **Warning:**
>
> This is the user-defined function (UDF) that accepts **two** **different parameter sets**.

### Definition

Finds all instances where search_string appears within source_string.

```sql
PUBLIC.INSTR_UDF(SOURCE_STRING STRING, SEARCH_STRING STRING)
```

### Parameters

`SOURCE_STRING` represents the input string that needs to be processed

The text that will be searched.

`SEARCH_STRING` is a parameter of type STRING that specifies the text to search for.

The text pattern that the function will look for and match.

### Returns

The index position where the pattern is found in the source string (starting from position 1).

### Usage example

Input:

```sql
SELECT INSTR_UDF('INSTR FUNCTION','N');
```

Output:

```sql
2
```

## TRANSLATE_CHK_UDF

### Definition

Checks whether the code can be successfully converted without generating any errors.

```sql
PUBLIC.TRANSLATE_CHK_UDF(COL_NAME STRING, SOURCE_REPERTOIRE_NAME STRING)
```

### Parameters

`COL_NAME` is a string variable that represents a column name.

The column that needs to be validated.

`SOURCE_REPERTOIRE_NAME` is a string parameter that specifies the name of the source directory.

The name of the source collection or library.

### Returns

0: The translation was successful and completed without errors.
NULL: No result was returned (null value).

The first character’s position in the string is causing a translation error.

### Usage example

Input:

```sql
SELECT PUBLIC.TRANSLATE_CHK_UDF('ABC', 'UNICODE_TO_LATIN');
```

Output:

```sql
0
```

## EXPAND_ON_UDF

> **Note:**
>
> For better readability, we have simplified some sections of the code in this example.

### Definition

Replicates the behavior of Teradata’s expand-on function.

```sql
PUBLIC.EXPAND_ON_UDF(TIME STRING, SEQ NUMBER, PERIOD STRING)
```

### Parameters

`TIME` is a data type that stores time values as text (STRING).

The time required for the anchor to fully expand.

`SEQ` Sequence Number

The order in which each row’s values are computed.

`PERIOD` A text value representing a time period

The date for the specified time period.

### Returns

A `VARCHAR` value that defines how to calculate the expansion period in the expand-on clause.

### Migration example

Input:

```sql
SELECT bg FROM table1 EXPAND ON pd AS bg BY ANCHOR ANCHOR_SECOND;
```

Output:

```sql
WITH
ExpandOnCTE AS
(
SELECT
PUBLIC.EXPAND_ON_UDF('ANCHOR_SECOND', VALUE, pd) bg
FROM
table1,
TABLE(FLATTEN(PUBLIC.ROW_COUNT_UDF(PUBLIC.DIFF_TIME_PERIOD_UDF('ANCHOR_SECOND', pd))))
)
SELECT
bg
FROM
table1,
ExpandOnCTE;
```

## ROW_COUNT_UDF

### Definition

Returns an array containing sequential numbers from 1 to the value returned by DIFF_TIME_PERIOD_UDF.

```sql
PUBLIC.ROW_COUNT_UDF(NROWS DOUBLE)
```

### Parameters

`NROWS` represents the total number of rows in a dataset as a decimal number (DOUBLE)

The value returned by the DIFF_TIME_PERIOD_UDF function.

### Returns

An array that determines the number of rows required to replicate the functionality of the EXPAND ON clause.

### Usage example

Input:

```sql
SELECT ROW_COUNT_UDF(DIFFTTIME_PERIOD('SECONDS','2022-11-26 10:15:20.000*2022-11-26 10:15:25.000'));
```

Output:

```sql
[1, 2, 3, 4, 5]
```

### Migration example

Input:

```sql
SELECT NORMALIZE emp_id, duration FROM project EXPAND ON duration AS bg BY ANCHOR ANCHOR_SECOND;
```

Output:

```sql
WITH ExpandOnCTE AS
(
SELECT
    PUBLIC.EXPAND_ON_UDF('ANCHOR_SECOND', VALUE, duration) bg
FROM
    project,
TABLE(FLATTEN(PUBLIC.ROW_COUNT_UDF(PUBLIC.DIFF_TIME_PERIOD_UDF('ANCHOR_SECOND', duration))))
)
SELECT NORMALIZE emp_id,
    duration
FROM
    project,
    ExpandOnCTE;
```

## CENTURY_UDF

### Definition

Calculates the century for a given date.

```sql
PUBLIC.CENTURY_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The input date used to determine the century.

### Returns

Returns the century number as a varchar for a given date.

### Usage example

Input:

```sql
SELECT PUBLIC.CENTURY_UDF(DATE '1915-02-23');
```

Output:

```sql
'20'
```

## TIME_DIFFERENCE_UDF

> **Warning:**
>
> This UDF has been deprecated as Snowflake now provides a built-in equivalent function. For more details, please refer to the [TIMEDIFF documentation](../../../../../../sql-reference/functions/timediff.md).

### Definition

Calculates the time interval between two given timestamps.

```sql
PUBLIC.TIME_DIFFERENCE_UDF
(MINUEND TIME, SUBTRAHEND TIME, INPUT_PART VARCHAR)
```

### Parameters

`MINUEND` A timestamp value that will be subtracted from

Time to be subtracted from the original value.

`SUBTRAHEND` The timestamp value to be subtracted from another timestamp

Time has been subtracted.

`INPUT_PART` is a variable of type VARCHAR that stores input data.

`EXTRACT_PART` is a variable of type VARCHAR that stores the extracted portion of a string.

Extract a numeric value from a time interval.

### Returns

A text value (VARCHAR) representing a specific time.

### Example

Input:

```sql
select extract(day from (timestampColumn1 - timestampColumn2 day to hour)) from tableName;
```

Output:

```sql
SELECT
EXTRACT_TIMESTAMP_DIFFERENCE_UDF(timestampColumn1, timestampColumn2, 'DAY TO HOUR', 'DAY')
                                 from
tableName;
```

## INTERVAL_DIVIDE_UDF

### Definition

A custom function (UDF) that performs interval division calculations.

```sql
PUBLIC.INTERVAL_DIVIDE_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE VARCHAR(), INPUT_DIV INTEGER)
```

### Parameters

`INPUT_PART` is a variable of type VARCHAR that represents the input portion of the data.

The value that specifies the interval type, such as ‘YEAR TO MONTH’.

`INPUT_VALUE` VARCHAR

The time interval to be divided.

`INPUT_DIV` is an integer value that represents the input divisor.

The number that will be divided by another number.

### Returns

The output is calculated by dividing a time interval by a numeric value.

### Migration example

Input:

```sql
SELECT INTERVAL '6-10' YEAR TO MONTH / 8;
```

Output:

```sql
SELECT
PUBLIC.INTERVAL_DIVIDE_UDF('YEAR TO MONTH', '6-10', 8);
```

## DAYNUMBER_OF_MONTH_UDF

### Definition

The UDF determines which day of the month a given timestamp falls on. It functions similarly to Teradata’s DAYNUMBER_OF_MONTH(DATE, ‘ISO’) function.

```sql
PUBLIC.DAYNUMBER_OF_MONTH_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

A date value that will be used to determine the corresponding day of the week.

### Returns

A whole number from 1 to 33 (inclusive).

### Example

Input:

```sql
SELECT DAYNUMBER_OF_MONTH (DATE'2022-12-22', 'ISO');
```

Output:

```sql
SELECT
PUBLIC.DAYNUMBER_OF_MONTH_UDF(DATE'2022-12-22');
```

## LAST_DAY_DECEMBER_OF_ISO_UDF

### Definition

UDF (User-Defined Function) that processes December 31st and returns the corresponding ISO year. This function is used as a component of the PUBLIC.YEAR_END_IDO_UDF calculation.

```sql
PUBLIC.LAST_DAY_DECEMBER_OF_ISO_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

To get the last day of December using the ISO year format, use December 31st.

### Returns

A date representing December 31st in ISO year format.

### Usage example

Input:

```sql
SELECT PUBLIC.LAST_DAY_DECEMBER_OF_ISO_UDF(DATE '2022-01-01');
```

Output:

```sql
2021-12-31
```

## DATEADD_UDF

> **Note:**
>
> For better readability, we have simplified some sections of the code in this example.

### Definition

Function to Calculate the Sum of Two Dates

```sql
PUBLIC.DATE_ADD_UDF(FIRST_DATE DATE, SECOND_DATE DATE)
```

### Parameters

`FIRST_DATE` represents a column of type DATE

The initial date value to be included.

`SECOND_DATE` represents a column of type DATE

Add the second date value together with first_date.

###

### Returns

The result is a date calculated by combining both input parameters.

### Example

Input:

```sql
SELECT
    CAST(CAST (COLUMNB AS DATE FORMAT 'MM/DD/YYYY') AS TIMESTAMP(0))
    +
    CAST (COLUMNA AS TIME(0) FORMAT 'HHMISS' )
FROM TIMEDIFF;
```

Output:

```sql
SELECT
    PUBLIC.DATEADD_UDF(CAST(CAST(COLUMNB AS DATE) !!!RESOLVE EWI!!! /*** SSC-EWI-0033 - FORMAT 'MM/DD/YYYY' REMOVED, SEMANTIC INFORMATION NOT FOUND. ***/!!! AS TIMESTAMP(0)), PUBLIC.TO_INTERVAL_UDF(CAST(COLUMNA AS TIME(0)) !!!RESOLVE EWI!!! /*** SSC-EWI-0033 - FORMAT 'HHMISS' REMOVED, SEMANTIC INFORMATION NOT FOUND. ***/!!!))
    FROM
    TIMEDIFF;
```

## JULIAN_TO_DATE_UDF

### Definition

A user-defined function (UDF) that converts a Julian Date format (YYYYDDD) into a standard Gregorian calendar date (YYYY-MM-DD).

```sql
PUBLIC.JULIAN_TO_DATE_UDF(JULIAN_DATE CHAR(7))
```

### Parameters

`JULIAN_DATE` CHAR - A character data type used to store dates in Julian format.

The date to be converted from Julian format.

### Returns

Returns the date representation of the Julian date, or null if the conversion cannot be performed.

### Usage example

Input:

```sql
SELECT JULIAN_TO_DATE_UDF('2022045');
```

Output:

```sql
'2022-02-14'
```

### Migration example

Input:

```sql
SELECT TO_DATE('2020002', 'YYYYDDD');
```

Output:

```sql
SELECT
PUBLIC.JULIAN_TO_DATE_UDF('2020002');
```

## FIRST_DAY_JANUARY_OF_ISO_UDF

### Definition

The first day of January in the ISO calendar year, which is used by the `PUBLIC.YEAR_BEGIN_ISO_UDF` function to calculate its result.

```sql
FUNCTION PUBLIC.FIRST_DAY_JANUARY_OF_ISO_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date that represents January 1st using the ISO calendar year format.

### Returns

A date representing January 1st of the specified ISO calendar year.

### Usage example

Input:

```sql
SELECT PUBLIC.FIRST_DAY_JANUARY_OF_ISO_UDF(DATE '2022-01-01');
```

Output:

```sql
2021-01-01
```

## TIMESTAMP_DIFFERENCE_UDF

### Definition

How to Subtract Two Dates Using a User-Defined Function (UDF)

```sql
PUBLIC.TIMESTAMP_DIFFERENCE_UDF
(MINUEND TIMESTAMP, SUBTRAHEND TIMESTAMP, INPUT_PART VARCHAR)
```

### Differences between Teradata and Snowflake date time subtraction

Teradata and Snowflake use different methods for date and time calculations. They differ in their syntax, output data types, and precision levels.

* **Syntax:** In Teradata, DATE, TIMESTAMP, and TIME subtraction uses a minus sign and interval to specify the result’s format. For more details, see <https://docs.teradata.com/r/w19R4KsuHIiEqyxz0WYfgA/7kLLsWrP0kHxbk3iida0mA>. Snowflake handles these operations differently using three functions:

  + DATEDIFF (works with all date types)
  + TIMESTAMPDIFF
  + TIMEDIFF
    Each function requires the two dates to compare and the date part to return. For DATE types, you can also use the minus sign, which returns the difference in days.
* **Return Type:** Teradata returns various Interval types (see <https://www.docs.teradata.com/r/T5QsmcznbJo1bHmZT2KnFw/z~5iW7rYVstcmNYbd6Dsjg>). Snowflake’s functions return an Integer representing the number of units. For details, see <https://docs.snowflake.com/en/sql-reference/functions/datediff.html>
* **Rounding:** The way DATEDIFF handles date parts may produce different results than Teradata. Check <https://docs.snowflake.com/en/sql-reference/functions/datediff.html#usage-notes> for specific rounding behavior.

> **Warning:**
>
> When performing date calculations, results may differ by one day due to rounding or timezone differences.

### Parameters

`MINUEND` A timestamp value that will be subtracted from represents the timestamp value that will be subtracted from

The date being used as the starting point for subtraction.

`SUBTRAHEND` is a timestamp value that will be subtracted from another timestamp.

The date has been removed.

`INPUT_PART` is a variable of type VARCHAR (variable-length character string)

Parts that need to be returned.

### Returns

Format the string value based on the specified `INPUT_PART` parameter.

### Example

Input:

```sql
select (timestampColumn1 - timestampColumn2 YEAR) from tableName;
```

```sql
SELECT
(
PUBLIC.TIMESTAMP_DIFFERENCE_UDF(timestampColumn1, timestampColumn2, 'YEAR')) from
tableName;
```

## FIRST_DAY_OF_MONTH_ISO_UDF

### Definition

The User-defined function (UDF) returns the first day of a given month in ISO format (YYYY-MM-DD).

```sql
PUBLIC.FIRST_DAY_OF_MONTH_ISO_UDF(YEAR NUMBER, MONTH NUMBER)
```

### Parameters

`YEAR` is a numeric data type used to store a four-digit year value.

A numeric value representing a calendar year (e.g., 2023).

`MONTH` A numeric value representing a month (1-12)

A numeric value (1-12) representing a calendar month.

### Returns

Returns the first day of the current month in ISO format (YYYY-MM-DD).

### Example

> **Note:**
>
> This UDF is a helper function that is used within the **`DAYNUMBER_OF_MONTH_UDF`** function.

## INT_TO_DATE_UDF

### Definition

UDF to Convert Numeric Values to Dates (Teradata Compatibility Function)

```sql
PUBLIC.INT_TO_DATE_UDF(NUMERIC_EXPRESSION INTEGER)
```

### Parameters

`NUMERIC_EXPRESSION` represents a numeric value or expression that evaluates to an integer

A value that represents a date in a specific format, such as YYYY-MM-DD

### Returns

Number converted to a date format.

### Example

Input:

```sql
SELECT * FROM table1
WHERE date_column > 1011219
```

Output:

```sql
SELECT
* FROM
table1
WHERE date_column > PUBLIC.INT_TO_DATE_UDF( 1011219);
```

## NULLIFZERO_UDF

### Definition

Replaces zero values with NULL in the data to prevent division by zero errors.

```sql
PUBLIC.NULLIFZERO_UDF(NUMBER_TO_VALIDATE NUMBER)
```

### Parameters

`NUMBER_TO_VALIDATE` NUMBER

The number that needs to be validated.

### Returns

Returns null if the input number is zero; otherwise, returns the original number.

### Usage example

```sql
SELECT NULLIFZERO_UDF(0);
```

Output:

```sql
NULL
```

## DATE_LONG_UDF

### Definition

Converts a date into the format ‘Day, Month DD, YYYY’ (for example, ‘Monday, January 01, 2024’). This format matches Teradata’s DL date format element.

```sql
PUBLIC.DATE_LONG_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date should be displayed in a long date format (for example: “September 15, 2023”).

### Returns

A `VARCHAR` data type that represents the Teradata DL format element.

### Usage example

Input:

```sql
SELECT PUBLIC.DATE_LONG_UDF(DATE '2021-10-26');
```

Output:

```sql
'Tuesday, October 26, 2021'
```

## TD_MONTH_OF_CALENDAR_UDF

### Definition

The user-defined function (UDF) serves as a replacement for Teradata’s [TD_MONTH_OF_CALENDAR](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Date-and-Time-Functions-and-Expressions/Calendar-Functions/td_month_of_calendar) function, providing the same functionality.

```sql
PUBLIC.TD_MONTH_OF_CALENDAR_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

Date used to calculate the number of months elapsed since January 1, 1900.

### Returns

An integer representing the number of months between January 1, 1900 and the specified date

### Migration example

Input:

```sql
SELECT TD_MONTH_OF_CALENDAR(DATE '2023-11-30')
```

Output:

```sql
SELECT
PUBLIC.TD_MONTH_OF_CALENDAR_UDF(DATE '2023-11-30');
```

## MONTH_NAME_LONG_UDF

### Definition

A user-defined function (UDF) that converts a timestamp into its corresponding full month name.

```sql
PUBLIC.MONTH_NAME_LONG_UDF(INPUT_DATE TIMESTAMP)
```

### Parameters

`INPUT` DATE

The timestamp should be converted to display the full month name.

### Returns

The name of the month in English.

## TD_DAY_OF_CALENDAR_UDF

### Definition

User-defined function (UDF) that replicates Teradata’s `TO_DAY_OF_CALENDAR` functionality

```sql
PUBLIC.TD_DAY_OF_CALENDAR_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

Date used to calculate the number of days elapsed since January 1, 1900.

### Returns

An integer representing the number of days between January 1, 1900 and the `INPUT` date

### Migration example

Input:

```sql
SELECT td_day_of_calendar(current_date)
```

Output:

```sql
SELECT
PUBLIC.TD_DAY_OF_CALENDAR_UDF(CURRENT_DATE());
```

## PERIOD_TO_TIME_UDF

### Definition

Function that converts a Teradata PERIOD value to a TIME value, maintaining Teradata’s casting behavior.

```sql
PERIOD_TO_TIME_UDF(PERIOD_VAL VARCHAR(22))
```

### Parameters

`PERIOD_VAL` represents a time period value

The time period that needs to be converted.

### Returns

The function returns a `TIME` value representing the `PERIOD`. If the conversion cannot be completed, it returns null.

### Usage example

Input:

```sql
SELECT PERIOD_TO_TIME_UDF(PERIOD_UDF(CURRENT_TIME()));
```

Output:

```sql
08:42:04
```

## INSTR_UDF (STRING, STRING, DOUBLE, DOUBLE)

> **Warning:**
>
> This user-defined function (UDF) accepts **four** **input parameters**.

### Definition

Finds all instances where search_string appears within source_string.

```sql
PUBLIC.INSTR_UDF(SOURCE_STRING STRING, SEARCH_STRING STRING, POSITION DOUBLE, OCCURRENCE DOUBLE)
```

### Parameters

`SOURCE_STRING` represents the input string that needs to be processed

The text string that will be searched.

`SEARCH_STRING` is a text value that you want to search for.

The text pattern that the function will look for and match.

`POSITION` DOUBLE - A numeric data type that stores decimal numbers with double precision.

The position in the text where the search will begin (starting from position 1).

`OCCURRENCE` DOUBLE - A numeric data type that represents the number of times an event occurs, stored as a double-precision floating-point number.

The position in the text where the search will begin (starting from position 1).

### Returns

The index position where the specified text is found within the source string.

### Usage example

Input:

```sql
SELECT INSTR_UDF('CHOOSE A CHOCOLATE CHIP COOKIE','CH',2,2);
```

Output:

```sql
20
```

## ROUND_DATE_UDF

### Definition

A user-defined function (UDF) that processes a DATE_VALUE by rounding the time portion to a specified unit (UNIT_TO_ROUND_BY). This function is similar to the Teradata ROUND(date) function.

```sql
PUBLIC.ROUND_DATE_UDF(DATE_TO_ROUND TIMESTAMP_LTZ, UNIT_TO_ROUND_BY VARCHAR(5))
```

### Parameters

`DATE_TO_ROUND` TIMESTAMP_TZ (A timestamp value with timezone information that needs to be rounded)

The date value that needs to be rounded.

`UNIT_TO_ROUND_BY` VARCHAR - Specifies the time unit used for rounding

The time unit used for rounding the date.

### Returns

Returns a date rounded to the specified time unit. The UNIT_TO_ROUND_BY parameter determines how the date will be rounded.

### Migration example

Input:

```sql
SELECT ROUND(CURRENT_DATE, 'RM') RND_DATE
```

Output:

```sql
SELECT
PUBLIC.ROUND_DATE_UDF(CURRENT_DATE(), 'RM') RND_DATE;
```

## SUBSTR_UDF (STRING, FLOAT, FLOAT)

> **Warning:**
>
> This is the user-defined function (UDF) that accepts **three** **parameters**.

### Definition

Retrieves a portion of text from a specified string by using starting and ending positions.

```sql
PUBLIC.SUBSTR_UDF(BASE_EXPRESSION STRING, START_POSITION FLOAT, LENGTH FLOAT)
```

### Parameters

`BASE_EXPRESSION` is a string parameter that defines the base expression.

The source text from which you want to extract a portion.

`START_POSITION` is a floating-point number that defines the initial position.

The position where you want to begin extracting characters from the string.

`LENGTH` is a floating-point number that represents the length value.

The position where you want to begin extracting characters from the string.

### Returns

The substring that must be included.

### Usage example

Input:

```sql
SELECT
    PUBLIC.SUBSTR_UDF('ABC', -1, 1),
    PUBLIC.SUBSTR_UDF('ABC', -1, 2),
    PUBLIC.SUBSTR_UDF('ABC', -1, 3),
    PUBLIC.SUBSTR_UDF('ABC', 0, 1),
    PUBLIC.SUBSTR_UDF('ABC', 0, 2);
```

Output:

```sql
'','','A','','A'
```

## GETQUERYBANDVALUE_UDF (VARCHAR)

> **Warning:**
>
> This is the user-defined function (UDF) that accepts **one** **parameter**.

### Definition

Returns a value from a name-value pair stored in the transaction, session, or profile query band.

```sql
GETQUERYBANDVALUE_UDF(SEARCHNAME VARCHAR)
```

### Parameters

`SEARCHNAME` VARCHAR - A variable of type VARCHAR used to store search terms or names. - A variable of type VARCHAR used to store search terms or names.

The name to search for within the key-value pairs.

### Returns

The session query band’s “name” key value, or null if not present.

### Usage example

Input:

```sql
ALTER SESSION SET QUERY_TAG = 'user=Tyrone;role=security';
SELECT GETQUERYBANDVALUE_UDF('role');
```

Output:

```sql
security
```

### Migration example

Input:

```sql
SELECT GETQUERYBANDVALUE(1, 'group');
```

Output:

```sql
/** MSC-ERROR - MSCEWI2084 - TRANSACTION AND PROFILE LEVEL QUERY TAGS NOT SUPPORTED IN SNOWFLAKE, REFERENCING SESSION QUERY TAG INSTEAD **/
SELECT GETQUERYBANDVALUE_UDF('group');
```

## TD_WEEK_OF_YEAR_UDF

### Definition

User-defined function (UDF) that calculates the full week number of a given date within the year. This function provides the same functionality as Teradata’s `TD_WEEK_OF_YEAR` and `WEEKNUMBER_OF_YEAR` functions.

```sql
PUBLIC.TD_WEEK_OF_YEAR_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

Date used to calculate the week number.

### Returns

A numerical value indicating which week of the year the specified date falls into.

### Usage example

Input:

```sql
SELECT PUBLIC.WEEK_OF_YEAR_UDF(DATE '2024-05-10'),
PUBLIC.WEEK_OF_YEAR_UDF(DATE '2020-01-03')
```

Output:

```sql
18, 0
```

## EXTRACT_TIMESTAMP_DIFFERENCE_UDF

> **Note:**
>
> For better readability, we have simplified the code examples by showing only the most relevant parts.

### Definition

Retrieves the ‘Data’ portion from the result of subtracting `SUBTRAHEND` from `MINUEND`

```sql
PUBLIC.EXTRACT_TIMESTAMP_DIFFERENCE_UDF
(MINUEND TIMESTAMP, SUBTRAHEND TIMESTAMP, INPUT_PART VARCHAR, EXTRACT_PART VARCHAR)
```

### Differences between Teradata and Snowflake date-time extraction

Teradata and Snowflake functions may have different parameter requirements and return different data types.

* **Parameters:** The key distinction between Teradata and Snowflake’s EXTRACT functions is that Snowflake only works with dates and times, while Teradata also supports intervals. For more details, refer to [Snowflake’s EXTRACT function documentation](https://docs.snowflake.com/en/sql-reference/functions/extract.html) and [Teradata’s EXTRACT function documentation](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/SIkE2wnHyQBnU4AGWRZSRw).
* **Return type:** The functions return values differently: Teradata’s EXTRACT returns either an integer or decimal(8, 2), while Snowflake’s EXTRACT returns a number representing the requested date-time part.

Teradata and Snowflake functions may have different input parameters and output types.

### Parameters

`MINUEND` TIMESTAMP

The date being used as the starting point for subtraction.

`SUBTRAHEND` The timestamp value to be subtracted from another timestamp

The date has been removed.

`INPUT_PART` VARCHAR

The formatted varchar must match the original requested part (which is the same as `TIMESTAMP_DIFERENCE` `INPUT_PART`) and must be one of the following:

* `'DAY TO HOUR'`
* `'DAY TO MINUTE'`
* `'DAY TO SECOND'`
* `'DAY TO MINUTE'`
* `'HOUR TO MINUTE'`
* `'HOUR TO SECOND'`
* `'MINUTE TO SECOND'`

`EXTRACT_PART` is a VARCHAR data type that represents the extracted portion of a string.

The time unit for extraction must be one of the following values: `'DAY'`, `'HOUR'`, `'MINUTE'`, or `'SECOND'`. The requested time unit should fall within the input time interval.

### Returns

The number of requests included in the extraction process.

### Example

Input:

```sql
select extract(day from (timestampColumn1 - timestampColumn2 day to hour)) from tableName;
```

Output:

```sql
SELECT
EXTRACT_TIMESTAMP_DIFFERENCE_UDF(timestampColumn1, timestampColumn2, 'DAY TO HOUR', 'DAY')
from
tableName;
```

## JSON_EXTRACT_DOT_NOTATION_UDF

### Definition

A user-defined function (UDF) that allows you to query JSON objects using dot notation, similar to how you would access nested properties in JavaScript or Python.

```sql
JSON_EXTRACT_DOT_NOTATION_UDF(JSON_OBJECT VARIANT, JSON_PATH STRING)
```

### Differences between Teradata JSON Entity Reference (dot notation ) and Snowflake JSON query method.

Teradata and Snowflake use different methods to traverse JSON data. Teradata uses a JavaScript-based approach with dot notation, array indexing, and special operators like wildcard access and double dot notation. In contrast, Snowflake has more limited JSON traversal capabilities, only supporting direct member access and array indexing.

### Parameters

`JSON_OBJECT` A data type that represents a JSON object, which can contain nested key-value pairs of varying data types.

The JSON object containing the values you want to extract.

`JSON_PATH` A string parameter that specifies the path to extract data from a JSON document

The location within the JSON_OBJECT where the values can be found, specified using JSON path notation.

### Returns

The data elements within the JSON_OBJECT that match the specified JSON_PATH.

### Migration example

Input:

```sql
SELECT CAST(varcharColumn AS JSON(2000))..name FROM variantTest;
```

Output:

```sql
SELECT
JSON_EXTRACT_DOT_NOTATION_UDF(CAST(varcharColumn AS VARIANT), '$..name')
FROM
variantTest;
```

## WEEK_OF_MONTH_UDF

### Definition

Calculates which week of the month a specific date falls into.

```sql
PUBLIC.WEEK_OF_MONTH_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date used to determine which week of the month it falls into.

### Returns

A VARCHAR column that displays which week of the month a specific date falls in.

### Usage example

Input:

```sql
SELECT PUBLIC.WEEK_OF_MONTH_UDF(DATE '2021-10-26');
```

Output:

```sql
'4'
```

## DAYNAME_LONG_UDF (TIMESTAMP_TZ)

> **Warning:**
>
> This is the user-defined function (UDF) that accepts **one** **parameter**.

### Definition

UDF that creates a variant of the DAYNAME_LONG_UDF function which returns day names with the first letter capitalized (default format).

```sql
PUBLIC.DAYNAME_LONG_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date from which you want to get the day of the week.

### Returns

Returns a string containing the full name of a day of the week.

### Usage example

Input:

```sql
SELECT PUBLIC.DAYNAME_LONG_UDF(DATE '2022-06-30');
```

Output:

```sql
'Thursday'
```

## INTERVAL_TO_MONTHS_UDF

### Definition

Converts a time interval into months.

```sql
PUBLIC.INTERVAL_TO_MONTHS_UDF
(INPUT_VALUE VARCHAR())
```

### Parameters

`INPUT_VALUE` VARCHAR

The time period that will be changed into months.

### Returns

The number of months to be processed, specified as an integer.

## GETQUERYBANDVALUE_UDF (VARCHAR, FLOAT, VARCHAR)

> **Warning:**
>
> This user-defined function (UDF) accepts three parameters.

### Definition

Returns a value from a name-value pair stored in the transaction, session, or profile query band. The value is associated with a specific name in the query band.

```sql
GETQUERYBANDVALUE_UDF(QUERYBAND VARCHAR, SEARCHTYPE FLOAT, SEARCHNAME VARCHAR)
```

### Parameters

`QUERYBAND` is a VARCHAR data type that stores query band information.

The query band combines transaction, session, and profile query bands into a single string.

`SEARCHTYPE` is a floating-point number data type.

The maximum depth at which matching pairs will be searched.

0 represents a wildcard value that matches any input.

A transaction represents a single unit of work in a database.

A Session object represents a connection to Snowflake.

3 = Create a profile.

`SEARCHNAME` VARCHAR

The name to search for within the key-value pairs.

### Returns

Returns the value of the ‘name’ key at the specified level in the hierarchy. If no value is found, returns null.

### Usage example

Input:

```sql
SELECT GETQUERYBANDVALUE_UDF('=T> account=Matt;user=Matt200; =S> account=SaraDB;user=Sara;role=DbAdmin;', 0, 'account');
SELECT GETQUERYBANDVALUE_UDF('=T> account=Matt;user=Matt200; =S> account=SaraDB;user=Sara;role=DbAdmin;', 2, 'account');
SELECT GETQUERYBANDVALUE_UDF('=T> account=Matt;user=Matt200; =S> account=SaraDB;user=Sara;role=DbAdmin;', 0, 'role');
SELECT GETQUERYBANDVALUE_UDF('=T> account=Matt;user=Matt200; =S> account=SaraDB;user=Sara;role=DbAdmin;', 1, 'role');
```

Output:

```sql
      Matt
      SaraDB
      DbAdmin
      NULL
```

### Migration example

Input:

```sql
SELECT GETQUERYBANDVALUE('=T> account=Matt;user=Matt200; =S> account=SaraDB;user=Sara;role=DbAdmin;', 0, 'account')
```

Output:

```sql
WITH
--** MSC-WARNING - MSCEWI2078 - THE EXPAND ON CLAUSE FUNCTIONALITY IS TRANSFORMED INTO A CTE BLOCK **
ExpandOnCTE AS
(
SELECT
PUBLIC.EXPAND_ON_UDF('ANCHOR_SECOND', VALUE, duration) bg
FROM
project,
TABLE(FLATTEN(PUBLIC.ROW_COUNT_UDF(PUBLIC.DIFF_TIME_PERIOD_UDF('ANCHOR_SECOND', duration))))
)
SELECT NORMALIZE emp_id,
duration
FROM
project,
ExpandOnCTE;
```

## JULIAN_DAY_UDF

### Definition

Calculates the Julian day number, which represents the continuous count of days since January 1, 4713 BCE (Before Common Era). The Julian day is used in astronomy and calendar calculations.

```sql
PUBLIC.JULIAN_DAY_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date that will be converted to a Julian day number.

### Returns

A `varchar` value representing the calculated Julian date.

### Usage example

Input:

```sql
SELECT PUBLIC.JULIAN_DAY_UDF(DATE '2021-10-26');
```

Output:

```sql
'2459514'
```

## WEEKNUMBER_OF_MONTH_UDF

### Definition

Identify the month from a given date.

```sql
PUBLIC.WEEKNUMBER_OF_MONTH_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date from which to calculate the month number.

### Returns

A numeric value representing the month (1-12) of a given date.

### Usage example

Input:

```sql
SELECT PUBLIC.WEEKNUMBER_OF_MONTH_UDF(DATE '2022-05-21')
```

Output:

```sql
3
```

## JSON_EXTRACT_UDF

### Definition

A user-defined function (UDF) that mimics the behavior of `JSONExtract`, `JSONExtractValue`, and `JSONExtractLargeValue` functions. This UDF allows you to extract multiple values from a JSON object.

```sql
JSON_EXTRACT_UDF(JSON_OBJECT VARIANT, JSON_PATH STRING, SINGLE_VALUE BOOLEAN)
```

### Parameters

`JSON_OBJECT` is a data type that stores JSON-formatted data in a structured format.

The JSON object containing the values you want to extract.

`JSON_PATH` A string that specifies the path to extract data from a JSON document

The location within the JSON_OBJECT where the desired values can be found, specified using JSON path notation.

`SINGLE_VALUE` A boolean flag that indicates whether to return a single value or multiple values.

BOOLEAN parameter: When set to true, returns a single value (required for JSONExtractValue and JSONExtractLargeValue functions). When set to false, returns an array of values (used with JSONExtract).

### Returns

The data values found at the specified JSON path within the JSON object.

### Migration example

Input:

```sql
SELECT
    Store.JSONExtract('$..author') as AllAuthors
FROM BookStores;
```

Output:

```sql
SELECT
    JSON_EXTRACT_UDF(Store, '$..author', FALSE) as AllAuthors
    FROM
    BookStores;
```

## COMPUTE_EXPAND_ON_UDF

### Definition

Determines how to expand data based on the specified time period type.

```sql
PUBLIC.COMPUTE_EXPAND_ON_UDF(TIME STRING, SEQ NUMBER, PERIOD TIMESTAMP, PERIODTYPE STRING)
```

### Parameters

`TIME` STRING

The timestamp used in the anchor.

`SEQ` sequence number

The order in which each row’s calculations are performed.

`PERIOD` represents a timestamp value that indicates a specific point in time.

The date for the specified time period.

`PERIODTYPE` is a string value that defines the type of time period.

The time period used for the calculation (either ‘`BEGIN`’ or ‘`END`’)

### Returns

A timestamp indicating when each row in the EXPAND-ON operation was processed.

### Example

> **Warning:**
>
> This UDF is a derived function that extends the functionality of EXPAND_ON_UDF.

## WEEK_NUMBER_OF_QUARTER_UDF

### Definition

Returns the week number within the current quarter for a specified date. This function follows the same behavior as Teradata’s `WEEKNUMBER_OF_QUARTER(DATE, 'ISO')` function, using the ISO calendar system.

```sql
PUBLIC.WEEK_NUMBER_OF_QUARTER_UDF(INPUT TIMESTAMP_TZ)
```

### Parameters

`INPUT` TIMESTAMP_TZ

The date used to calculate which week of the quarter it falls into.

### Returns

An integer indicating which week of the quarter (1-13) is being referenced.

### Usage example

Input:

```sql
SELECT WEEK_NUMBER_OF_QUARTER_UDF(DATE '2023-01-01'),
WEEK_NUMBER_OF_QUARTER_UDF(DATE '2022-10-27')
```

Output:

```sql
1, 4
```

## YEAR_END_ISO_UDF

### Definition

User-defined function (UDF) that calculates the last day of the year for a given date using ISO calendar standards, similar to Teradata’s TD_YEAR_END function.

```sql
PUBLIC.YEAR_END_ISO_UDF(INPUT date)
```

### Parameters

`INPUT` DATE

The date that represents the last day of the year according to the ISO calendar standard.

### Returns

The last day of the year according to the ISO calendar system.

### Usage example

Input:

```sql
SELECT  PUBLIC.YEAR_END_ISO_UDF(DATE '2022-01-01'),
PUBLIC.YEAR_END_ISO_UDF(DATE '2022-04-12');
```

Output:

```sql
2022-01-02, 2023-01-01
```

## INSERT_CURRENCY_UDF

### Definition

Insert the currency symbol directly before the first digit of the number to ensure there are no spaces or symbols between the currency symbol and the number.

```sql
PUBLIC.INSERT_CURRENCY_UDF(INPUT VARCHAR, CURRENCYINDEX INTEGER, CURRENCYVALUE VARCHAR)
```

### Parameters

`INPUT` VARCHAR

The output of TO_CHAR when converting a numeric value that requires currency formatting.

`CURRENCYINDEX` is an integer value that represents the index of a currency.

The position in the array where the currency should be inserted.

`CURRENCYVALUE` A VARCHAR field that stores currency values

The text that will be used as the currency value.

### Returns

A `varchar` field containing the currency text at a defined position.

### Usage example

Input:

```sql
SELECT PUBLIC.INSERT_CURRENCY_UDF(to_char(823, 'S999999'), '1', 'CRC');
```

Output:

```sql
'+CRC823'
```

## INSTR_UDF (STRING, STRING, INT)

> **Warning:**
>
> This user-defined function (UDF) accepts three parameters.

### Definition

Finds all instances where search_string appears within source_string.

```sql
PUBLIC.INSTR_UDF(SOURCE_STRING STRING, SEARCH_STRING STRING, POSITION INT)
```

### Parameters

`SOURCE_STRING` represents a string value that will be used as input

The text that will be searched.

`SEARCH_STRING` is a text value that you want to search for.

The text pattern that the function will look for and match.

`POSITION` is an integer data type that represents a position in a sequence.

The position in the text where the search begins (starting from position 1).

### Returns

The location within the original string where the match is found.

### Usage example

Input:

```sql
SELECT INSTR_UDF('FUNCTION','N', 3);
```

Output:

```sql
8
```

---
title: SnowConvert AI - Functional Difference Messages
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/README.md
section: Migrations
---

# SnowConvert AI - Functional Difference Messages

An FDM is generated when SnowConvert AI is able to output syntactically correct code but that code may not provide exact functional equivalence to the original legacy code. Reasons for functional in-equivalence may be due to features that are not available within Snowflake and require re-solutioning beyond what can be done with straight code conversion. Many times, business or architectural input will be required to determine further course of action needed (if any) related to FDMs.

---
title: SnowConvert AI - Functions Usage Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/functions-usage-report.md
section: Migrations
---

# SnowConvert AI - Functions Usage Report

## What is an “function usage”?

The term “usage” is used in this context to indicate that a specific function was invoked in the code. This function could be a built-in or user-defined function in a source language.

These are some examples of places where functions can be invoked in SQL languages:

* Any DDL, `CREATE TABLE` default columns value or as part of a `CREATE VIEW` select using a function.
* Any DML, like `INSERT` and `DELETE`
* In procedural language, assign the returned value of a function to a sql variable
* In the `FROM` using table valued functions.

### Where can I find it?

The Functions Usage report can be found in a folder named *“reports”*, in the output folder of your conversion. The name of the file itself starts with *“SqlFunctionsUsage”* so it can easily be located.

The format of the file is **.CSV**.

### What information does it contain?

The function usage report is presented in a table format, and contains the following columns:

| Column | Description |
| --- | --- |
| Function | The name of the function found in code, or its signature in the case of a UDF. |
| Count | The function's usage summarized count by migration status. |
| Category | The function category. These can be User_Defined, Built_In, or Uncategorized. |
| Migration Status | The migration status of the function invocation. These can be Pending (not transformed to Snowflake), PendingSPCall (requires manual intervention because it was converted to a stored procedure), and Transformation (successfully converted to Snowflake). |

#### Summarization

Each individual function usage is summarized using a specific criteria, that may include multiple columns to form a “composite key”. The basic grouping is made using the Category, and Migration Status columns.

---
title: SnowConvert AI - General Conversion Settings
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/general-conversion-settings.md
section: Migrations
---

# SnowConvert AI - General Conversion Settings

## File encoding settings

This setting in SnowConvert AI determines how the tool reads and interprets the text within your source files. Choosing the correct encoding is important to ensure that all characters, especially accented letters, symbols, or text from various languages, are processed correctly during conversion. By default SnowConvert AI uses `UTF-8`.

**Manually Selecting an Encoding**

You can choose to override this automatic process by selecting a specific encoding from the dropdown menu. If you select an encoding manually (even if you select `UTF-8` explicitly), SnowConvert AI will use *only* that chosen encoding to read the files.

**Available Encoding Options**

The dropdown list allows you to force SnowConvert AI to use one of these specific encodings:

| Code Page | Name | Display Name |
| --- | --- | --- |
| 1200 | utf-16 | Unicode |
| 1201D | unicodeFFFE | Unicode (Big endian) |
| 12000 | utf-32 | Unicode (UTF-32) |
| 12001 | utf-32BE | Unicode (UTF-32 Big endian) |
| 20127 | us-ascii | US-ASCII |
| 28591 | iso-8859-1 | Western European (ISO) |
| 65000 | utf-7 | Unicode (UTF-7). *Not available in .NET 5* |
| 65001 | utf-8 | Unicode (UTF-8). ***Default encoding*** |

**Understanding `System Default (Preview)`**

When selecting the **`System Default (Preview)`** , SnowConvert AI uses a flexible approach:

1. It first tries to automatically detect the specific character encoding of each input file.
2. If auto-detection doesn’t identify the encoding, SnowConvert AI proceeds using `UTF-8`, which handles a very wide range of characters and is common for modern files.
3. As a fallback, if the `UTF-8` interpretation fails because it finds characters that aren’t valid in UTF-8, SnowConvert AI will then attempt to use your computer’s default system encoding.

It’s marked “Preview” because this behavior is experimental. System defaults can vary significantly between different computers and operating systems, potentially leading to inconsistent results or unsupported encodings.

**Recommendation**

If you encounter errors related to text interpretation or see garbled characters in your results, manually selecting the correct encoding is the best solution. If you know your files use a specific format (like `Western European`), select that. If you’re unsure but suspect encoding issues, explicitly selecting `UTF-8` is often a good starting point as it’s the most common standard for modern files.

## Materialized views conversion settings

On this page, you will find the necessary options to customize the parameters for translating Materialized Views (or join indexes in Teradata) to Dynamic Tables during your conversion.

To preserve the full functionality of Materialized Views, or Teradata’s Join Indexes, SnowConvert AI generates Dynamic Tables instead of creating a one-to-one Materialized View or transforming a Join Index into a Materialized View. This approach is necessary because Snowflake lacks certain configuration options available in other systems’ Materialized Views.

For further details on the limitations of Snowflake’s Materialized Views, please refer to [Materialized Views Limitations](https://docs.snowflake.com/en/user-guide/views-materialized#label-limitations-on-creating-materialized-views).

### Transformation

The settings defined here will apply to every instance of a Dynamic Table generated during the conversion process.

Dynamic Table Conversion Settings:

* **Target Lag**: This setting specifies the maximum allowable time for the dynamic table’s content to lag behind updates in the base table. For example, setting this to 5 minutes ensures that the data in the dynamic table is no more than 5 minutes behind the base table’s updates.
* **Warehouse**: This setting specifies the name of the Warehouse that supplies the computing resources for refreshing the dynamic table. You must have the USAGE privilege on this warehouse to create the dynamic table. By default, SnowConvert AI will use a placeholder value.

For more information, please refer to the Snowflake Dynamic Table [documentation](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table).

## **Next steps for Amazon Redshift databases**

For Amazon Redshift databases, you can use SnowConvert AI to complete the following tasks after conversion:

* [Deployment](../../../user-guide/deployment.md)
* [Data migration](../../../user-guide/data-migration.md)

---
title: SnowConvert AI - General Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md
section: Migrations
---

# SnowConvert AI - General Functional Differences

## SSC-FDM-0001

Views selecting all columns from a single table are not required in Snowflake

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

Views that only select all columns of a single table and do not have any filtering clauses are not required in Snowflake and may affect performance.

#### Code Example

##### Input Code (Oracle):

```sql
 CREATE OR REPLACE VIEW simpleView1
AS
SELECT
*
FROM
simpleTable;

CREATE OR REPLACE VIEW simpleView2
AS
SELECT
*
FROM
simpleTable GROUP BY col1;
```

##### Generated Code:

```sql
 CREATE OR REPLACE VIEW simpleView1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
*
FROM
simpleTable;

CREATE OR REPLACE VIEW simpleView2
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
*
FROM
simpleTable
GROUP BY col1;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0002

Correlated Subquery May Have Functional Differences

### Description

This message is reported when a `Correlated Subquery` (subquery that refers to a column from the outer query) is located. This type of subqueries can, in some cases, present some functional differences in Snowflake ([Working with Subqueries](https://docs.snowflake.com/en/user-guide/querying-subqueries#correlated-vs-uncorrelated-subqueries)).

#### Code Example

##### Input Code:

```sql
 CREATE TABLE schema1.table1(column1 NVARCHAR(50), column2 NVARCHAR(50));
CREATE TABLE schemaA.tableA(columnA NVARCHAR(50), columnB NVARCHAR(50));

--Correlated Subquery
SELECT columnA FROM schemaA.tableA ta WHERE columnA = (SELECT SUM(column1) FROM schema1.table1 t1 WHERE t1.column1 = ta.columnA);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE schema1.table1 (
column1 VARCHAR(50),
column2 VARCHAR(50))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/11/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE schemaA.tableA (
columnA VARCHAR(50),
columnB VARCHAR(50))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/11/2024",  "domain": "test" }}'
;

--Correlated Subquery
SELECT
columnA
FROM
schemaA.tableA ta
WHERE
columnA =
          --** SSC-FDM-0002 - CORRELATED SUBQUERIES MAY HAVE SOME FUNCTIONAL DIFFERENCES. **
          (SELECT
          SUM(column1) FROM
          schema1.table1 t1
          WHERE
          t1.column1 = ta.columnA
          );
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0003

Conversion Rate Inconsistency

### Description

This message is reported when a conversion rate inconsistency is found on the assessment field specified. These situations are resolved automatically by SnowConvert AI, so this is just an informative warning.

> **Note:**
>
> This Informative warning will only be visible in the assessment documents and not the output code

#### Best Practices

* Despite SnowConvert AI’s ability to automatically fix the issue, you can still notify the SnowConvert AI support team by emailing [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com) and specifying the issue.

## SSC-FDM-0004

External table translated to regular table

### Description

This warning is added to clauses related to external handling. Snowflake recommends that all data should be managed inside the Snowflake data storage. For more information on this subject, see the [Snowflake data storage considerations](https://docs.snowflake.com/en/user-guide/tables-storage-considerations.html#data-storage-considerations).

#### Code Example

##### Input Code:

```sql
 CREATE EXTERNAL TABLE ext_csv_file (
    id INT,
    name TEXT,
    age INT,
    city TEXT
)
LOCATION (
    'gpfdist://192.168.1.100:8080/data/my_data.csv'
)
FORMAT 'CSV' (DELIMITER ',' HEADER);
```

##### Generated Code:

```sql
 --** SSC-FDM-0004 - EXTERNAL TABLE TRANSLATED TO REGULAR TABLE **
CREATE TABLE ext_csv_file (
       id INT,
       name TEXT,
       age INT,
       city TEXT
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "07/09/2025",  "domain": "no-domain-provided" }}'
;
```

#### Best Practices

* The data stored in files of the external tables must be somehow moved into the Snowflake database.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0005

TIME ZONE not supported for time data type

### Description

The Time data type in Snowflake does not store Timezone values

> TIME internally stores “wallclock” time, and all operations on TIME values are performed without taking any time zone into consideration. For more information, see the [Snowflake TIME data type documentation](https://docs.snowflake.com/en/sql-reference/data-types-datetime#time).

#### Example Code

##### Input Code:

```sql
 CREATE TABLE TABLE_TIME_TYPE (
    COLNAME TIME (9) WITH TIME ZONE
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TABLE_TIME_TYPE (
    COLNAME TIME(9) /*** SSC-FDM-0005 - TIME ZONE NOT SUPPORTED FOR TIME DATA TYPE ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0006

Number type column may not behave similarly in Snowflake

### Description

This functional difference message appears when a `NUMBER` Type column is being created within a Table. The reason for this is due to arithmetic differences when performing operations related to the scales of intermediate values in Snowflake which could make some operations fail. For more information please refer to [Snowflake’s post on intermediate numbers in Snowflake](https://community.snowflake.com/s/question/0D50Z00008HhSHCSA3/sql-compilation-error-invalid-intermediate-datatype-number7148) and [Number out of representable range](https://community.snowflake.com/s/article/Number-out-of-representable-range-error-occurs-during-the-multiplication-of-numeric-values).

To avoid these arithmetic issues, you can run data samplings to verify the needed precision and scales for these operations.

#### Example Codes

#### Simple Table with Number Columns

##### Input Code (Oracle):

```sql
 CREATE TABLE table1
(
column1 NUMBER,
column2 NUMBER (20, 4)
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE table1
(
column1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
column2 NUMBER(20, 4) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### Arithmetic Issue Examples

The next examples show how the arithmetic issues can happen when using Number columns:

##### Snowflake Code with Division Error:

```sql
 CREATE OR REPLACE TABLE number_table( column1 NUMBER(38, 19) );

INSERT INTO number_table VALUES (1);

SELECT column1 / column1 FROM number_table;
```

##### Snowflake Code with Multiplication Error:

```sql
 CREATE OR REPLACE TABLE number_table( column1 NUMBER(38, 20) );

INSERT INTO number_table VALUES (1);

SELECT column1 * column1 FROM number_table;
```

When running either `SELECT` statements Snowflake will return an error:

`Number out of representable range: type FIXEDSB16{nullable}, value 1.0000000000000000000`

This is due to the intermediate operation’s result overflowing Snowflake’s maximum capacity; reducing the number scales by 1 on each example will fix the error and work normally:

##### Snowflake Code with Division:

```sql
 CREATE OR REPLACE TABLE number_table( column1 NUMBER(38, 18) );

INSERT INTO number_table VALUES (1);

SELECT column1 / column1 FROM number_table;
```

##### Snowflake Code with Multiplication:

```sql
 CREATE OR REPLACE TABLE numbertable( column1 NUMBER(38, 19) );

INSERT INTO number_table VALUES (1);

SELECT column1 * column1 FROM number_table;
```

For this reason, SnowConvert AI sets the default scale of Numbers to 18, minimizing the number of errors when migrating.

#### Best Practices

* Verify that your operations’ intermediate values don’t exceed a scale of 37, as that is Snowflake’s maximum.
* Run Data Samplings on your data, to make sure you have the required precision and scales before running any operations.
* In most cases, after doing some data sampling or discussing with the business you might come to the conclusion that the precision can be different. For example, for `MONEY` columns a typical precision is `NUMBER(20,4)`. In snowflake you cannot easily alter a column data type, you can check this [post on our forum](https://www.mobilize.net/blog/how-to-alter-column-datatype-in-snowflake) which provides some guidance on how to alter your columns data types and preserve your data.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0007

Element with missing dependencies

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons

### Description

There is a missing dependency for an object, Snow Convert could not resolve some data types. Also there exists a possibility to have a deployment error if the dependency was not in the source code.

#### Example Code

##### Input Code:

```sql
 CREATE VIEW VIEW01 AS SELECT * FROM TABLE1;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
CREATE OR REPLACE VIEW VIEW01
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
* FROM
TABLE1;
```

> **Note:**
>
> Note that the TABLE1 definition is missing.

#### Best Practices

* Make sure all the dependencies of the objects are in the source code.
* If not, find the references to the object in the code and check if the operations are well managed.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0008

On Commit not supported

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

The ON COMMIT clauses in your CREATE TABLE statement have been commented out. Snowflake does not support the ON COMMIT clause, as it’s typically used for temporary tables in other SQL dialects. If you need to manage transaction-specific behavior, consider using Snowflake’s transactions or temporary tables with explicit TRUNCATE or DROP statements instead.

#### Example Code

##### Input Code

```sql
CREATE TEMPORARY TABLE TABLE02 (COLNAME VARCHAR(20)) ON COMMIT DELETE ROWS
```

##### Generated Code

```sql
CREATE OR REPLACE TEMPORARY TABLE TABLE02 (
COLNAME VARCHAR(20))
----** SSC-FDM-0008 - ON COMMIT (DELETE ROWS) IS NOT SUPPORTED **
--ON COMMIT DELETE ROWS
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "09/22/2025",  "domain": "no-domain-provided" }}'
;
```

## SSC-FDM-0009

GLOBAL TEMPORARY TABLE functionality not supported.

### Description

Global temporary tables are considered a complex pattern, due to the fact they can come in several variations, as indicated in [Snowflake’s documentation](https://docs.snowflake.com/en/sql-reference/sql/create-table#variant-syntax).

#### Example Code

##### Input Code

```sql
 CREATE OR REPLACE GLOBAL TEMPORARY TABLE GLOBAL_TEMP_TABLE
(
    col3 INTEGER,
    col4 VARCHAR(50)
);
```

##### Generated Code

```sql
 --** SSC-FDM-0009 - GLOBAL TEMPORARY TABLE FUNCTIONALITY NOT SUPPORTED. **
CREATE OR REPLACE TABLE GLOBAL_TEMP_TABLE
    (
        col3 INTEGER,
        col4 VARCHAR(50)
    )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0010

Type changed to Date.

### Description

This message is shown when SnowConvert AI finds a DEFAULT SYSDATE and the data type is NOT a DATE or TIMESTAMP datatype. In this case, the data type is changed to DATE.

#### Example Code

##### Input Code

```sql
 CREATE TABLE "SYSDATE_DEFAULT_TEST_TABLE_1"(
 "COLUMN1" VARCHAR2(30 BYTE) DEFAULT SYSDATE
);
```

##### Generated Code

```sql
 CREATE OR REPLACE TABLE "SYSDATE_DEFAULT_TEST_TABLE_1" (
  "COLUMN1" TIMESTAMP DEFAULT CURRENT_TIMESTAMP() /*** SSC-FDM-0010 - CONVERTED FROM VARCHAR2 TO DATE FOR CURRENT_DATE DEFAULT ***/
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0011

Column Name Is Snowflake Reserved Keyword.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-0045](../conversion-issues/generalEWI.md) documentation.

### Description

Column names that are valid for the source language but are reserved keywords in Snowflake.

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE TABLE T1
(
    LOCALTIME VARCHAR,
    CURRENT_USER VARCHAR
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE T1
    (
    --** SSC-FDM-0011 - COLUMN NAME 'LOCALTIME' IS A SNOWFLAKE RESERVED KEYWORD **
    "LOCALTIME" VARCHAR,
    --** SSC-FDM-0011 - COLUMN NAME 'CURRENT_USER' IS A SNOWFLAKE RESERVED KEYWORD **
    "CURRENT_USER" VARCHAR
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Best Practices

* Consider renaming the columns that use names that are not supported in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0012

Constraint Name in some constraints is not Supported.

### Description

This message is added when a constraint is of type Null, Not Null, or default and was defined with a name. Snowflake does not support the name in those constraints. For that, SnowConvert AI will remove it and add the comment.

#### Example Code

##### Input Code

```sql
 CREATE TABLE TABLE1 (
COL1 VARCHAR (10) CONSTRAINT constraintName DEFAULT ('0') NOT NULL
);
```

##### Generated Code

```sql
 CREATE OR REPLACE TABLE TABLE1 (
COL1 VARCHAR(10) DEFAULT ('0') /*** SSC-FDM-0012 - CONSTRAINT NAME 'constraintName' IN DEFAULT EXPRESSION CONSTRAINT IS NOT SUPPORTED IN SNOWFLAKE ***/ NOT NULL
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0013

Timezone expression could not be mapped

### Description

This FDM message is added to indicate scenarios where the actual value of a timezone expression cannot be determined, and therefore, the translated results might be different. When the timezone value used is a literal string, SnowConvert AI can take it and map it to its corresponding timezone value in Snowflake. However, when this value is specified by an expression, SnowConvert AI cannot get the timezone value that will be used at runtime and, therefore, cannot map this value to its corresponding Snowflake equivalent.

#### Example Code

##### Input Code (Oracle)

```sql
 SELECT TIMESTAMP '1998-12-25 09:26:50.12' AT TIME ZONE SESSIONTIMEZONE FROM DUAL;
SELECT TIMESTAMP '1998-12-25 09:26:50.12' AT TIME ZONE Expression FROM DUAL;
```

##### Generated Code

```sql
 SELECT
--** SSC-FDM-0013 - TIMEZONE EXPRESSION COULD NOT BE MAPPED, RESULTS MAY BE DIFFERENT **
TO_TIMESTAMP_LTZ( TIMESTAMP '1998-12-25 09:26:50.12')
FROM DUAL;

SELECT
--** SSC-FDM-0013 - TIMEZONE EXPRESSION COULD NOT BE MAPPED, RESULTS MAY BE DIFFERENT **
CONVERT_TIMEZONE(Expression, TIMESTAMP '1998-12-25 09:26:50.12')
FROM DUAL;
```

##### Input Code (Teradata)

```sql
 select TIMESTAMP '1998-12-25 09:26:50.12' AT TIME ZONE SESSIONTIMEZONE;
select current_timestamp at time zone CONCAT(' America ', ' Pacific');
select current_timestamp at time zone (SELECT COL1 FROM TABLE1 WHERE COL2 = 2);
```

##### Generated Code

```sql
 SELECT
CONVERT_TIMEZONE(SESSIONTIMEZONE, TIMESTAMP '1998-12-25 09:26:50.12') /*** SSC-FDM-0013 - TIMEZONE EXPRESSION COULD NOT BE MAPPED, RESULTS MAY BE DIFFERENT ***/;

SELECT
CONVERT_TIMEZONE(CONCAT(' America ', ' Pacific'), CURRENT_TIMESTAMP) /*** SSC-FDM-0013 - TIMEZONE EXPRESSION COULD NOT BE MAPPED, RESULTS MAY BE DIFFERENT ***/;

SELECT
CONVERT_TIMEZONE((
SELECT
COL1 FROM
TABLE1
WHERE COL2 = 2), CURRENT_TIMESTAMP) /*** SSC-FDM-0013 - TIMEZONE EXPRESSION COULD NOT BE MAPPED, RESULTS MAY BE DIFFERENT ***/;
```

####

> **Note:**
>
> ##### Timezone Compatibility in Oracle
>
> The majority of timezone name expressions in Oracle are directly supported in Snowflake, when this is the case, the migration will run without issues. Additionally, here is a list of which ones are not supported by Snowflake at the moment, and therefore will include the functional difference message.

* Africa/Doula
* Asia/Ulaanbaator
* Asia/Yetaterinburg
* Canada/East-Saskatchewan
* CST
* PST
* US/Pacific-New

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0014

Check statement not supported.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-0035](../conversion-issues/generalEWI.md) documentation.

### Description

***CHECK*** constraint is not supported by Snowflake but it does not affect functionally.

#### Example Code

##### Input Code Oracle :

```sql
 CREATE TABLE "Schema"."BaseTable"(
  "COLUMN1" VARCHAR2(255),
  CHECK ( COLUMN1 IS NOT NULL )
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE "Schema"."BaseTable" (
    "COLUMN1" VARCHAR(255)
--                          ,
--    --** SSC-FDM-0014 - CHECK STATEMENT NOT SUPPORTED **
--    CHECK ( COLUMN1 IS NOT NULL )
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
  ;
```

##### Input Code Teradata:

```sql
 CREATE TABLE TABLE1,
    NO FALLBACK,
    NO BEFORE JOURNAL,
    NO AFTER JOURNAL
(
    COL0 BYTEINT,
    CONSTRAINT constraint_name CHECK (COL1 < COL2)
)
```

##### Generated Code:

```sql
 CREATE TABLE TABLE1
(
    COL0 BYTEINT
--                ,
--    --** SSC-FDM-0014 - CHECK STATEMENT NOT SUPPORTED **
--    CONSTRAINT constraint_name CHECK (COL1 < COL2)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

##### Input Code SqlServer

```sql
 ALTER TABLE table_name2
ADD column_name VARCHAR(255)
CONSTRAINT constraint_name
CHECK NOT FOR REPLICATION (column_name > 1);
```

##### Generated Code:

```sql
 ALTER TABLE IF EXISTS table_name2
ADD column_name VARCHAR(255)
----** SSC-FDM-0014 - CHECK STATEMENT NOT SUPPORTED **
--CONSTRAINT constraint_name
--CHECK NOT FOR REPLICATION (column_name > 1)
                                           ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0015

​Referenced custom type in query not found.

### Description

This error happens when the definition for a Custom Type was not found or an Oracle built-in data type was not recognized by SnowConvert.

#### Example code

##### Input Code (Oracle):

```sql
 --Type was never defined
--CREATE TYPE type1;

CREATE TABLE table1
(
column1 type1
);
```

##### Generated Code:

```sql
 --Type was never defined
--CREATE TYPE type1;

CREATE OR REPLACE TABLE table1
(
column1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-TS0015 - DATA TYPE TYPE1 IS NOT SUPPORTED IN SNOWFLAKE ***/!!! NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}}'
;
```

#### Best Practices

* Verify that the type that the referenced data type was defined in the input code.
* Check the Snowflake data types [documentation](https://docs.snowflake.com/en/sql-reference/data-types.html) to find an equivalent for the data type.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0016

Constants are not supported by Snowflake Scripting. It was transformed to a variable.

### Description

Snowflake Scripting does not support constants. Therefore, all constants inside procedures are being transformed into variables when the Snowflake Scripting flag is active.

#### Example code

##### Oracle:

```sql
 CREATE OR REPLACE PROCEDURE p_constants
AS
my_const1 CONSTANT NUMBER := 40;
my_const2 CONSTANT NUMBER NOT NULL := 40;
BEGIN
NULL;
END;
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE PROCEDURE p_constants ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
--** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
my_const1 NUMBER(38, 18) := 40;
--** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
--** SSC-FDM-OR0025 - NOT NULL CONSTRAINT IS NOT SUPPORTED BY SNOWFLAKE **
my_const2 NUMBER(38, 18) := 40;
BEGIN
NULL;
END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0017

WITH SYSTEM VERSIONING clause is not supported by Snowflake

### Description

The `WITH SYSTEM VERSIONING` clause in ANSI SQL is used to enable system versioning for a table, allowing you to maintain a history of changes to the table’s data over time. This clause is not supported by Snowflake.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE t1 (
    ID INT PRIMARY KEY,
    Name VARCHAR(50),
    SysStartTime TIMESTAMP,
    SysEndTime TIMESTAMP
) WITH SYSTEM VERSIONING;
```

##### Generated Code:

```sql
 CREATE TABLE t1 (
    ID INT PRIMARY KEY,
    Name VARCHAR(50),
    SysStartTime TIMESTAMP,
    SysEndTime TIMESTAMP
)
----** SSC-FDM-0017 - WITH SYSTEM VERSIONING CLAUSE IS NOT SUPPORTED BY SNOWFLAKE. **
--WITH SYSTEM VERSIONING
                      ;
```

#### Best Practices

* You can use [Time Travel](https://docs.snowflake.com/en/user-guide/data-time-travel) in Snowflake, Time Travel enables accessing historical data (that is, data that has been changed or deleted) at any point within a defined period. It serves as a powerful tool for performing the following tasks:

  + Restoring data-related objects (tables, schemas, and databases) that might have been accidentally or intentionally deleted.
  + Duplicating and backing up data from key points in the past.
  + Analyzing data usage/manipulation over specified periods of time.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0018

CHARACTER SET clause is not supported by Snowflake.

### Description

The column option CHARACTER SET determines the allowed set of characters that can be stored in the column, this clause is not supported by Snowflake.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE TABLE01(
    COLNAME VARCHAR(20) CHARACTER SET character_specification
);
```

##### Generated Code:

```sql
 CREATE TABLE TABLE01 (
    COLNAME VARCHAR(20)
--                        --** SSC-FDM-0018 - CHARACTER SET CLAUSE IS NOT SUPPORTED BY SNOWFLAKE. **
--                        CHARACTER SET character_specification
);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0019

Semantic information could not be loaded.

### Description

This warning lets the user know that SnowConvert AI was not able to load semantic information for a specific object. This is most likely caused because if there is a duplicated object with the same name, SnowConvert AI could not load the semantic information of this object and complete the analysis.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE T1
(
    COL1 INTEGER
);

CREATE TABLE T1
(
    COL2 INTEGER
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE T1
(
    COL1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR T1. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
CREATE TABLE T1
(
    COL2 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* Check for duplicate objects in the input code since this may affect the loading of semantic information.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0020

Multiple result sets are returned in temporary tables

### Description

Snowflake Scripting procedures only allow one result set to be returned per procedure.

To replicate Teradata behavior, when there are two or more result sets to return, they are stored in temporary tables. The Snowflake Scripting procedure will return an array containing the name of the temporary tables.

#### Example code

##### Input Code (Teradata):

```sql
 REPLACE MACRO sampleMacro AS
(
    SELECT CURRENT_DATE AS DT;
    SELECT CURRENT_DATE AS DT_TWO;
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE sampleMacro ()
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        return_arr ARRAY := array_construct();
        tbl_nm VARCHAR;
    BEGIN
        tbl_nm := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
        CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_nm) AS
            SELECT
                CURRENT_DATE() AS DT;
        return_arr := array_append(return_arr, :tbl_nm);
        tbl_nm := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
        CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_nm) AS
            SELECT
                CURRENT_DATE() AS DT_TWO;
        return_arr := array_append(return_arr, :tbl_nm);
        --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
        RETURN return_arr;
    END;
$$;
```

#### Best Practices

* To obtain the result sets, it is necessary to run a SELECT query with the name of the temporary tables returned by the procedure.
* As much as possible, avoid procedures that return multiple result sets; instead, make them single-responsibility for more direct results.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0021

Create Index Not Supported

### Description

Due to architectural reasons, Snowflake does not support indexes so, SnowConvert AI will comment out all the code related to the creation of indexes. Snowflake automatically creates micro-partitions for every table that help speed up the performance of DML operations, the user does not have to worry about creating or managing these micro-partitions.

Usually, this is enough to have a very good query performance however, there are ways to improve it by creating data clustering keys. [Snowflake’s official page](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions.html) provides more information about micro-partitions and data clustering.

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE INDEX index1
ON table1(column1);
```

##### Generated Code:

```sql
 ----** SSC-FDM-0021 - CREATE INDEX IS NOT SUPPORTED BY SNOWFLAKE **
----** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table1" **
--CREATE INDEX index1
--ON table1(column1)
                  ;
```

#### Best Practices

* Data clustering might be a way to speed up query performance on tables.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0022

Window frame unit was changed to Rows

### Severity

Low

### Description

This warning is added when an unsupported Window Frame Unit was changed into Rows, leading to output differences. One example of this is the GROUPS unit, which is not supported by Snowflake.

Please note that this message is also used in cases where a Window Frame Unit is **partially** unsupported leading to it being changed, like the RANGE unit.

#### Example Code

Given the following data as an example to explain it.

| C_NAME | C_BIRTH_DAY |
| --- | --- |
| USA | 1 |
| USA | 4 |
| Poland | 9 |
| Canada | 10 |
| USA | 5 |
| Canada | 12 |
| Costa Rica | 3 |
| Poland | 4 |
| USA | 2 |
| Costa Rica | 7 |
| Costa Rica | 10 |

##### Oracle:

##### Code

```
SELECT
    C_NAME,
    SUM(C_BIRTH_DAY)
    OVER (ORDER BY C_BIRTH_DAY
    RANGE BETWEEN UNBOUNDED PRECEDING AND 1 PRECEDING) AS MAX1
FROM WINDOW_TABLE;
```

##### Result

| C_NAME | MAX1 |
| --- | --- |
| USA | - |
| USA | 1 |
| Costa Rica | 3 |
| USA | 6 |
| Poland | 6 |
| USA | 14 |
| Costa Rica | 19 |
| Poland | 26 |
| Canada | 35 |
| Costa Rica | 35 |
| Canada | 55 |

##### Snowflake:

##### Code

```sql
 SELECT
    C_NAME,
    SUM(C_BIRTH_DAY)
    OVER (ORDER BY C_BIRTH_DAY ROWS BETWEEN UNBOUNDED PRECEDING AND 1 PRECEDING /*** SSC-FDM-0022 - WINDOW FRAME UNIT 'RANGE' WAS CHANGED TO ROWS ***/) AS MAX1
    FROM
WINDOW_TABLE;
```

##### Result

| C_NAME | MAX1 |
| --- | --- |
| USA | - |
| USA | 1 |
| Costa Rica | 3 |
| USA | 6 |
| Poland | 10 |
| USA | 14 |
| Costa Rica | 19 |
| Poland | 26 |
| Canada | 35 |
| Costa Rica | 45 |
| Canada | 55 |

#### Best Practices

* Ensure deterministic ordering for rows to ensure deterministic outputs when running in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0023

A Global Temporary Table is being referenced.

### Severity

Medium

### Description

SnowConvert AI transforms Global Temporary tables into regular Create Table. References to these tables may behave different than expected.

#### Code example

##### Input

```sql
 create global temporary table t1
    (col1 varchar);
create view view1 as
    select col1 from t1;
```

##### Output

```sql
 --** SSC-FDM-0009 - GLOBAL TEMPORARY TABLE FUNCTIONALITY NOT SUPPORTED. **
CREATE OR REPLACE TABLE t1
    (col1 varchar)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE VIEW view1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
select col1 from
    --** SSC-FDM-0023 - A Global Temporary Table is being referenced **
    t1;
```

#### Related Issues

* SSC-FDM-0009: GLOBAL TEMPORARY TABLE functionality not supported.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0024

Functionality is not currently supported by Snowflake Scripting

> **Note:**
>
> This `FDM` is deprecated, please refer to [SSC-EWI-0058](../conversion-issues/generalEWI.md) documentation.

### Description

This error happens when a statement used in a create procedure is not currently supported by Snowflake Scripting.

#### Example code

##### Input Code (Oracle):

```sql
 CREATE OR REPLACE PROCEDURE PROC01
IS
  number_variable INTEGER;
BEGIN
  EXECUTE IMMEDIATE 'SELECT 1 FROM DUAL' INTO number_variable;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC01 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    number_variable INTEGER;
  BEGIN
    EXECUTE IMMEDIATE 'SELECT 1 FROM DUAL'
--                                           --** SSC-FDM-0024 - FUNCTIONALITY FOR 'EXECUTE IMMEDIATE RETURNING CLAUSE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING **
--                                           INTO number_variable
                                                               ;
  END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0025

Unexpected end of statement

### Description

This message is reported when SnowConvert AI encounters an unexpected end of statement during conversion. This typically occurs when the parser reaches the end of the source code while still expecting additional tokens to complete a statement. The EWI includes the line number of the original source code where the issue was detected.

#### Example Code

##### Input Code:

```sql
 UPDATE orders
SET total = 100
WHERE order_id = 1
```

##### Generated Code:

```sql
 --** SSC-FDM-0025 - UNEXPECTED END OF STATEMENT. PLEASE CHECK THE LINE 5 OF ORIGINAL SOURCE CODE. **
UPDATE orders
SET total = 100
WHERE order_id = 1;
```

#### Best Practices

* Check the specified line in the original source code for missing semicolons or incomplete statements.
* Ensure all statements are properly terminated.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0026

Type not supported by Snowflake

> **Note:**
>
> This `FDM` is deprecated, please refer to [SSC-EWI-0028](../conversion-issues/generalEWI.md) documentation.

### Description

This message appears when a type is not supported in Snowflake.

#### Example

##### Input Code (Oracle):

```sql
 CREATE TABLE MYTABLE
(
    COL1 SYS.ANYDATASET
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE MYTABLE
    (
    --** SSC-FDM-0026 - TYPE NOT SUPPORTED BY SNOWFLAKE **
        COL1 SYS.ANYDATASET
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0027

Removed next statement, not applicable in Snowflake.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This message appears when a specific statement is not applicable in Snowflake, it means that there is no Snowflake equivalent for this statement and it is no longer needed, and for that reason, it is removed from the source code. However, SnowConvert AI keeps the original statement as part of the comment at the end.

#### Example Code

##### Input Code:

```sql
 .LOGTABLE tduser.Employee_log;
   .BEGIN MLOAD TABLES Employee_Stg;
      .LAYOUT Employee;
      .FIELD in_EmployeeNo * VARCHAR(10);
      .FIELD in_FirstName * VARCHAR(30);
      .FIELD in_LastName * VARCHAR(30);
      .FIELD in_BirthDate * VARCHAR(10);
      .FIELD in_JoinedDate * VARCHAR(10);
      .FIELD in_DepartmentNo * VARCHAR(02);

      .dml label EmpLabel
  IGNORE DUPLICATE INSERT ROWS;
      INSERT INTO Employee_Stg (
         EmployeeNo,
         FirstName,
         LastName,
         BirthDate,
         JoinedDate,
         DepartmentNo
      )
      VALUES (
         :in_EmployeeNo,
         :in_FirstName,
         :in_Lastname,
         :in_BirthDate,
         :in_JoinedDate,
         :in_DepartmentNo
      );
      .IMPORT INFILE employee.txt
      FORMAT VARTEXT ','
      LAYOUT Employee
      APPLY EmpLabel;
   .END MLOAD;
LOGOFF;
```

##### Generated Code:

```sql
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***
// SnowConvert AI Helpers Code section is omitted.

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #.LOGTABLE tduser.Employee_log

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #.BEGIN MLOAD TABLES Employee_Stg

  Employee_TableName = "Employee_TEMP_TABLE"
  Employee_Columns = """in_EmployeeNo VARCHAR(10),
in_FirstName VARCHAR(30),
in_LastName VARCHAR(30),
in_BirthDate VARCHAR(10),
in_JoinedDate VARCHAR(10),
in_DepartmentNo VARCHAR(02)"""
  Employee_Conditions = """in_EmployeeNo AS in_EmployeeNo, in_FirstName AS in_FirstName, in_LastName AS in_LastName, in_BirthDate AS in_BirthDate, in_JoinedDate AS in_JoinedDate, in_DepartmentNo AS in_DepartmentNo"""
  def EmpLabel(tempTableName, queryConditions = ""):
    exec(f"""INSERT INTO Employee_Stg (EmployeeNo, FirstName, LastName, BirthDate, JoinedDate, DepartmentNo)
SELECT
   SRC.in_EmployeeNo,
   SRC.in_FirstName,
   :in_Lastname,
   SRC.in_BirthDate,
   SRC.in_JoinedDate,
   SRC.in_DepartmentNo
FROM {tempTableName} SRC {queryConditions}""")
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #.IMPORT INFILE employee.txt FORMAT VARTEXT ',' LAYOUT Employee APPLY EmpLabel

  snowconvert.helpers.import_file_to_temptable(fr"employee.txt", Employee_TableName, Employee_Columns, Employee_Conditions, ',')
  EmpLabel(Employee_TableName)
  exec(f"""DROP TABLE {Employee_TableName}""")

  if con is not None:
    con.close()
    con = None
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0028

Not supported.

> **Note:**
>
> This `FDM` is deprecated, please refer to [SSC-EWI-0021](../conversion-issues/generalEWI.md) documentation.

### Description

This message appears when a specific node or statement from the source code is not supported in Snowflake.

#### Example Code

##### Input Code:

```sql
 WITH my_av ANALYTIC VIEW AS
(USING sales_av HIERARCHIES(time_hier) ADD MEASURES(lag_sales AS (LAG(sales) OVER (HIERARCHY time_hier OFFSET 1 ))))
SELECT aValue from my_av;
```

##### Generated Code:

```sql
 ----** SSC-FDM-0028 - SubavFactoring NOT SUPPORTED IN SNOWFLAKE **
--WITH my_av ANALYTIC VIEW AS
--(USING sales_av HIERARCHIES(time_hier) ADD MEASURES(lag_sales AS (LAG(sales) OVER (HIERARCHY time_hier OFFSET 1 ))))
--SELECT aValue from my_av
                        ;
```

#### Best Practices

* If this error happens, it is because there is no Snowflake equivalent for the node that is being converted.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0029

User defined function was transformed to a Snowflake procedure.

> **Warning:**
>
> This EWI is deprecated, please refer to [SSC-EWI-0068](../conversion-issues/generalEWI.md) documentation

### Severity

Low

### Description

Snowflake user defined functions do not support the same features as Oracle or SQL Server. To maintain the functional equivalence the function is transformed to a Snowflake stored procedure. This will affect their usage in queries.

#### Example Code

##### SQL Server:

##### Input Code

```sql
 CREATE OR ALTER FUNCTION PURCHASING.FOO()
RETURNS INT
AS
BEGIN
    DECLARE @i int = 0, @p int;
    Select @p = COUNT(*) FROM PURCHASING.VENDOR

    WHILE (@p < 1000)
    BEGIN
        SET @i = @i + 1
        SET @p = @p + @i
    END

    IF (@i = 6)
        RETURN 1

    RETURN @p
END;
```

##### Generated Code

```sql
 --** SSC-FDM-0029 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE **
CREATE OR REPLACE PROCEDURE PURCHASING.FOO ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I INT := 0;
        P INT;
    BEGIN

        Select
            COUNT(*)
        INTO
            :P
 FROM
            PURCHASING.VENDOR;
        WHILE (:P < 1000) LOOP
            I := :I + 1;
            P := :P + :I;
        END LOOP;
        IF ((:I = 6)) THEN
            RETURN 1;
        END IF;
        RETURN :P;
    END;
$$;
```

##### Oracle:

##### Input Code

```sql
 CREATE FUNCTION employee_function (param1 in NUMBER) RETURN NUMBER is
  var1    employees.employee_ID%TYPE;
  var2    employees.manager_ID%TYPE;
  var3    employees.title%TYPE;
BEGIN
  SELECT employee_ID, manager_ID, title
  INTO var1, var2, var3
  FROM employees
    START WITH manager_ID = param1
    CONNECT BY manager_ID = PRIOR employee_id;
  RETURN var1;
EXCEPTION
   WHEN no_data_found THEN RETURN param1;
END employee_function;
```

##### Generated Code

```sql
 --** SSC-FDM-0029 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE **
CREATE OR REPLACE PROCEDURE employee_function (param1 NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/14/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    var1    employees.employee_ID%TYPE;
    var2    employees.manager_ID%TYPE;
    var3    employees.title%TYPE;
  BEGIN
    SELECT employee_ID, manager_ID, title
    INTO
      :var1,
      :var2,
      :var3
    FROM
      employees
      START WITH manager_ID = :param1
    CONNECT BY
      manager_ID = PRIOR employee_id;
    RETURN :var1;
  EXCEPTION
     WHEN no_data_found THEN
      RETURN :param1;
  END;
$$;
```

### Best Practices

* Separate the inside queries to maintain the same logic.
* The source code may need to be restructured to fit with the Snowflake user-defined functions [approach](https://docs.snowflake.com/en/sql-reference/user-defined-functions.html#udfs-user-defined-functions).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0030

Replaced invalid characters for new identifier

### Description

The given identifier has invalid characters for the output language. Those characters were replaced with their UTF-8 codes.

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE PROCEDURE PROC1
AS
    "VAR0" INT;
    "VAR`/1ͷ" VARCHAR(20);
    "o*/o" FLOAT;
    " . " INT;
    ". ." INT;
    "123Name" INT;
    "return" INT;
    yield INT;
    ident#10 INT;
BEGIN
    NULL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        "VAR0" INT;
        --** SSC-FDM-0030 - IDENTIFIER '"VAR`/1ͷ"' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES **
        VAR_u60_u2F1_uCD_B7 VARCHAR(20);
        --** SSC-FDM-0030 - IDENTIFIER '"o*/o"' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES **
        o_u2A_u2Fo FLOAT;
        --** SSC-FDM-0030 - IDENTIFIER '" . "' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES **
        _u20_u2E_u20 INT;
        --** SSC-FDM-0030 - IDENTIFIER '". ."' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES **
        _u2E_u20_u2E INT;
        "123Name" INT;
        "return" INT;
        yield INT;
        IDENT_HASHTAG_10 INT;
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0031

Dynamic Table required parameters set by default

### Description

Materialized Views (and Join Indexes in the case of Teradata) are migrated to Dynamic Tables in Snowflake. Dynamic Tables require two parameters to be set: TARGET_LAG and WAREHOUSE.

If these parameters are not set in the configuration options, they are set by default during conversion.

Read more about the [required Dynamic Tables parameters here](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table#required-parameters).

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE MATERIALIZED VIEW mv1
AS SELECT * FROM table1;
```

##### Generated Code:

```sql
 CREATE OR REPLACE DYNAMIC TABLE mv1
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT * FROM
table1;
```

#### Best Practices

* Configure the dynamic table required parameters according to your needs.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0032

Parameter is not a literal value, transformation could not be fully applied

### Description

For multiple transformations, SnowConvert AI sometimes requires to validate the contents of a parameter, which is only possible if the parameter is a literal value.

This message is generated to warn the user that SnowConvert AI could not retrieve the value of the parameter because it was passed by reference, causing the transformation of the function or statement to not be completed.

#### Example Code

##### Input Code (Redshift):

```sql
 SELECT TO_CHAR(DATE '2001-01-01', 'YYY/MM/DD'),
TO_CHAR(DATE '2001-01-01', f)
FROM (SELECT 'YYY/MM/DD' as f);
```

##### Generated Code:

```sql
 SELECT
PUBLIC.YEAR_PART_UDF(DATE '2001-01-01', 3) || TO_CHAR(DATE '2001-01-01', '/MM/DD'),
--** SSC-FDM-0032 - PARAMETER 'format_string' IS NOT A LITERAL VALUE, TRANSFORMATION COULD NOT BE FULLY APPLIED **
TO_CHAR(DATE '2001-01-01', f)
FROM (SELECT 'YYY/MM/DD' as f);
```

#### Best Practices

* Try to provide the specified parameter as a literal value.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0033

Sample clause behaves differently in Snowflake.

### Description

This message is generated to showcase the functional difference while sampling rows in Snowflake. The differences are related to the quantity of rows retrieved. When in Teradata there is the same quantity of rows in the non-deterministic output, it may change in Snowflake and return a few rows more or less. This is because a probability related topic and it is expected to behaves like that in Snowflake.

If there is a requirement of retrieving the same values and the same quantity, a deterministic output, it is recommended to use a seed in the Snowflake query.

#### Example Code

##### Input Code (Teradata):

```sql
 SELECT * FROM Employee SAMPLE 2;
SELECT * FROM Employee SAMPLE 0.25;
```

##### Generated Code:

```sql
 SELECT
    * FROM
    Employee
--** SSC-FDM-0033 - SAMPLE CLAUSE BEHAVES DIFFERENTLY IN SNOWFLAKE **
SAMPLE(2 ROWS);

SELECT
    * FROM
    Employee
--** SSC-FDM-0033 - SAMPLE CLAUSE BEHAVES DIFFERENTLY IN SNOWFLAKE **
SAMPLE(25);
```

#### Best Practices

* Try to use the seed part of the query when it is required a deterministic output.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0034

Struct converted to VARIANT. Some of its usages might have functional differences.

### Description

Snowflake does not natively support the STRUCT data type. SnowConvert AI automatically converts STRUCT to VARIANT. When used in INSERT statements, STRUCT data will be handled using `OBJECT_CONSTRUCT`. Be aware that this conversion may introduce functional differences in some use cases.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE TABLE test.structTypes
(
  COL1 STRUCT<sc1 INT64>,
  COL2 STRUCT<sc2 STRING(10)>,
  COL3 STRUCT<sc3 STRUCT<sc31 INT64, sc32 INT64>>,
  COL4 STRUCT<sc4 ARRAY<INT64>>,
  COL5 STRUCT<sc5 INT64, sc51 INT64>,
  COL7 STRUCT<sc7 INT64 OPTIONS(description = "A repeated STRING field"), sc71 BOOL>,
  COL8 STRUCT<sc8 INT64 NOT NULL, sc81 BOOL NOT NULL OPTIONS(description = "A repeated STRING field")>
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE test.structTypes
(
  COL1 VARIANT /*** SSC-FDM-0034 - STRUCT<INT64> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL2 VARIANT /*** SSC-FDM-0034 - STRUCT<STRING(10)> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL3 VARIANT /*** SSC-FDM-0034 - STRUCT<STRUCT<INT64, INT64>> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL4 VARIANT /*** SSC-FDM-0034 - STRUCT<ARRAY<INT64>> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL5 VARIANT /*** SSC-FDM-0034 - STRUCT<INT64, INT64> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL7 VARIANT /*** SSC-FDM-0034 - STRUCT<INT, BOOL> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/,
    COL8 VARIANT /*** SSC-FDM-0034 - STRUCT<INT64, BOOLEAN> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/
  )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "05/30/2025",  "domain": "no-domain-provided" }}';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0035

The INFER_SCHEMA function requires a file path without wildcards to generate the table template, replace the FILE_PATH placeholder with it

### Description

The [INFER_SCHEMA](https://docs.snowflake.com/en/sql-reference/functions/infer_schema) function is used in Snowflake to generate the columns definition of a table based on the structure of a file, it requires a LOCATION parameter that specifies the path to a file or folder that will be used to construct the table columns, however, this path does not support regex, meaning that the wildcard `*` character is not supported.

When the table has no columns, SnowConvert AI will check all URIS to find one that does not use wildcards and use it in the INFER_SCHEMA function, when no URI meets such criteria this FDM and a FILE_PATH placeholder will be generated, the placeholder has to be replaced with the path of one of the files referenced by the external table to generate the table columns.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json2
OPTIONS(
  FORMAT='JSON',
  URIS=['gs://sc_external_table_bucket/folder_with_json/*']
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY FILE FORMAT SC_TEST_MY_EXTERNAL_TABLE_JSON2_FORMAT
TYPE = JSON;

CREATE OR REPLACE EXTERNAL TABLE test.my_external_table_json2 USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-0035 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_MY_EXTERNAL_TABLE_JSON2_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_json/.*'
FILE_FORMAT = (TYPE = JSON);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0036

The transformed numeric/date format may have a different behavior in Snowflake.

### Description

The transformed numeric formats to Snowflake use [Fixed position formats](../../../../../../sql-reference/sql-format-models.md). The transformed format can fail and generate a different output when there are more digits in the integer part of the number than there are digit positions in the format; all digits are printed as # to indicate overflow.

For date/time custom format specifiers, some SQL Server specifiers are mapped to Snowflake equivalents that may produce slightly different output. For example, `dddd` (full day name) maps to `DY` (abbreviated day name), uppercase `F`–`FFFFFFF` (fractional seconds without trailing zeros) map to `F1`–`F7` (Snowflake always includes trailing zeros), and `z` (UTC offset hours only) maps to `TZH`. These differences are flagged with this FDM marker so you can verify the output matches your requirements.

> **Note:**
>
> **For SQL Server migrations:** Advanced numeric format specifiers (such as `P`, `N`, `%`) are now translated by default without requiring any flag. If you are converting SQL Server code that uses custom single-character date format specifiers (for example, `%y`, `%M`, `%d`, `%H`, `%h`, `%m`, `%s`), consider enabling the `--enableFormatSpecifiersPreview` preview flag. This flag enables access to new Snowflake date/time format specifiers that provide more accurate translations of these formats. See [Preview Features Settings](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) for more details.
>
> **Note:** This requires requesting preview access in your Snowflake account through [this form](https://docs.google.com/forms/u/0/d/1-aIsixSftqhqjkpgBHAzcbSi2mk7s71TMQsRdOBppFw/viewform?edit_requested=true) to use the date/time preview features.

#### Code Example — Numeric Formats

##### Input Code:

##### Sql Server

```sql
SELECT
 FORMAT(value, '00.00') as formatted,
 FORMAT(value, '#,##0') as formatSource
 FROM MY_TABLE;
```

##### Generated Code:

##### Snowflake

```sql
SELECT
 TO_CHAR(value, 'FM9999999999900.00') /*** SSC-FDM-0036 - TRANSFORMATION OF '00.00' FORMAT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/ as formatted,
 TO_CHAR(value, 'FM9,999,999,999,990') /*** SSC-FDM-0036 - TRANSFORMATION OF '#,##0' FORMAT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/ as formatSource
 FROM
 MY_TABLE;
```

##### Result

```sql
#############
```

#### Code Example — Date/Time Custom Format Specifiers

##### Input Code:

##### Sql Server

```sql
SELECT FORMAT(CAST('12/12/2024' as datetime), 'dddd, MMMM dd yyyy HH:mm:ss.FFF');
```

##### Snowflake

```sql
SELECT
 TO_CHAR(TO_TIMESTAMP_NTZ('12/12/2024'), 'DY, MMMM DD YYYY HH24:MI:SS.F3') /*** SSC-FDM-0036 - TRANSFORMATION OF dddd, MMMM dd yyyy HH:mm:ss.FFF FORMAT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/;
```

> **Tip:**
>
> The generated `TO_TIMESTAMP_NTZ('12/12/2024')` relies on Snowflake’s `TIMESTAMP_INPUT_FORMAT` session parameter for date parsing. If your session uses a non-default format, you may need to set it explicitly (e.g., `ALTER SESSION SET TIMESTAMP_INPUT_FORMAT = 'MM/DD/YYYY'`).

> **Note:**
>
> The following date/time format specifiers are translated with this FDM marker due to behavioral differences between SQL Server and Snowflake:
>
> | SQL Server Specifier | Snowflake Equivalent | Difference |
> | --- | --- | --- |
> | `dddd` (full day name) | `DY` (abbreviated day name) | Snowflake `DY` returns abbreviated names (e.g., “Mon”) instead of full names (e.g., “Monday”) |
> | `F` through `FFFFFFF` (fractional seconds, no trailing zeros) | `F1` through `F7` | Snowflake `F1`–`F7` always include trailing zeros, unlike SQL Server uppercase `F` specifiers which suppress them |
> | `z` (UTC offset hours) | `TZH` | Formatting differences in timezone offset representation |

#### Best Practices

* If the numeric digit does not fit in the format, please update the format by adding more digits based on the possible input data values.
* For date/time formats flagged with this FDM, review the Snowflake output to verify it matches your application’s expected format. You may need to apply additional string manipulation to achieve the exact SQL Server behavior.

## SSC-FDM-0037

Statistics function not needed in Snowflake.

### Description

DROP, COLLECT, or HELP statistics are not needed in Snowflake. Snowflake already collects statistics used for automatic query optimization.

#### Example Code

##### Input Code:

```sql
  HELP STATISTICS TestName;
```

##### Generated Code

```sql
  ----** SSC-FDM-0037 - HELP STATISTICS NOT NEEDED. SNOWFLAKE AUTOMATICALLY COLLECTS STATISTICS. **
  --HELP STATISTICS TestName
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0038

Micro-partitioning is automatically performed on all Snowflake tables.

### Description

This message is added to the CREATE TABLE statement when a PARTITION BY clause is present. The PARTITION BY clause, which controls table partitioning in some databases, is not supported in Snowflake.

In Snowflake, all tables are automatically divided into micro-partitions—contiguous units of storage ranging from 50 MB to 500 MB of uncompressed data. This architecture enables highly granular pruning of large tables, which may consist of millions of micro-partitions.

Snowflake automatically stores metadata for each micro-partition, including:

* The range of values for each column in the micro-partition.
* The number of distinct values.
* Additional properties used for optimization and efficient query processing.

Tables are transparently partitioned based on the order of data as it is inserted or loaded. For more details, see the [Benefits of Micro-partitioning](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions#benefits-of-micro-partitioning).

#### Example Code

##### Input Code:

```sql
 CREATE TABLE orders
    (
      storeid INTEGER NOT NULL,
      productid INTEGER NOT NULL,
      orderdate DATE FORMAT 'yyyy-mm-dd' NOT NULL,
      totalorders INTEGER NOT NULL)
      PRIMARY INDEX (storeid, productid)
      PARTITION BY (RANGE_N(totalorders BETWEEN *, 100, 1000 AND *),RANGE_N(orderdate BETWEEN *, '2005-12-31' AND *) );
```

##### Generated Code

```sql
CREATE OR REPLACE TABLE orders
(
 storeid INTEGER NOT NULL,
 productid INTEGER NOT NULL,
 orderdate DATE NOT NULL,
 totalorders INTEGER NOT NULL)
-- --** SSC-FDM-0038 - MICRO-PARTITIONING IS AUTOMATICALLY HANDLED ON ALL SNOWFLAKE TABLES **
-- PARTITION BY (RANGE_N(totalorders BETWEEN *, 100, 1000 AND *)
--              ,RANGE_N(orderdate BETWEEN *, '2005-12-31' AND *) )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "09/17/2025",  "domain": "no-domain-provided" }}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0039

SQLWARNING may not be captured as an exception in Snowflake.

### Description

In source databases such as Teradata, SQLWARNING can be caught in exception handlers. Snowflake’s exception handling may not capture SQLWARNING in the same way. When a handler is configured to catch SQLWARNING, SnowConvert AI translates it, but the behavior in Snowflake may differ.

#### Example Code

##### Input Code:

```sql
 UPDATE orders SET status = 'processed';
DECLARE sqlwarning CONDITION FOR SQLSTATE '01000';
DECLARE EXIT HANDLER FOR sqlwarning
  INSERT INTO log_table VALUES ('Warning occurred');
```

##### Generated Code:

```sql
 UPDATE orders SET status = 'processed';
DECLARE sqlwarning CONDITION FOR SQLSTATE '01000';
DECLARE EXIT HANDLER FOR
  --** SSC-FDM-0039 - SQLWARNING MAY NOT BE CAPTURED AS AN EXCEPTION IN SNOWFLAKE. **
  sqlwarning
  INSERT INTO log_table VALUES ('Warning occurred');
```

#### Best Practices

* Review exception handling logic that catches SQLWARNING; the handler may not be invoked in Snowflake for the same conditions.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0040

REGEXP_SUBSTR / REGEXP_INSTR function may fail if regex argument is not POSIX

### Description

When translating regex-based functions (such as REGEXP_SUBSTR, REGEXP_INSTR) from source dialects to Snowflake, the regex argument must be valid POSIX syntax. Source dialects may support extended regex features that Snowflake does not. If the regex pattern uses non-POSIX syntax, the function may fail at runtime or produce different results.

#### Example Code

##### Input Code:

```sql
 SELECT REGEXP_INSTR(product_name, format_pattern) FROM products;
```

##### Generated Code:

```sql
 SELECT
  --** SSC-FDM-0040 - REGEXP_INSTR FUNCTION MAY FAIL IF REGEX ARGUMENT IS NOT POSIX **
  REGEXP_INSTR(product_name, format_pattern)
FROM products;
```

#### Best Practices

* Ensure regex patterns use POSIX-compliant syntax.
* Test regex functions with sample data after conversion.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0041

Default parameters were reordered to the end of the parameter list.

### Description

Snowflake requires all parameters with default values to appear after all non-default parameters. When SnowConvert AI detects a procedure whose default parameters are not at the end of the parameter list, it automatically reorders them. Code not provided to SnowConvert AI that uses positional arguments may need to be updated to match the new parameter order.

> **Note:**
>
> This FDM replaces the deprecated [SSC-EWI-0002](../conversion-issues/generalEWI.md), which previously only warned about the issue without performing the reorder.

#### Example Code

##### Input Code (SQL Server):

```sql
 CREATE PROCEDURE dbo.TestProc (@Param1 INT = 10, @Param2 VARCHAR(50))
AS
BEGIN
   SET @Param1 = @Param1;
END
```

##### Generated Code (SQL Server):

```sql
 CREATE OR REPLACE PROCEDURE dbo.TestProc
--** SSC-FDM-0041 - DEFAULT PARAMETERS WERE REORDERED TO THE END OF THE PARAMETER LIST TO MATCH SNOWFLAKE REQUIREMENTS. CALLERS USING POSITIONAL ARGUMENTS MAY NEED TO BE UPDATED **
(PARAM2 STRING, PARAM1 INT DEFAULT 10)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    PARAM1 := :PARAM1;
  END;
$$;
```

##### Input Code (Oracle):

```sql
 CREATE OR REPLACE PROCEDURE TestProc (param1 IN NUMBER DEFAULT 10, param2 IN VARCHAR2)
IS
BEGIN
   param1 := param1;
END;
```

##### Generated Code (Oracle):

```sql
 CREATE OR REPLACE PROCEDURE TestProc
--** SSC-FDM-0041 - DEFAULT PARAMETERS WERE REORDERED TO THE END OF THE PARAMETER LIST TO MATCH SNOWFLAKE REQUIREMENTS. CALLERS USING POSITIONAL ARGUMENTS MAY NEED TO BE UPDATED **
(param2 VARCHAR, param1 NUMBER(38, 18) DEFAULT 10)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    param1 := :param1;
  END;
$$;
```

##### Positional Call Site Conversion

When callers use positional arguments and the parameters have been reordered, SnowConvert AI automatically converts them to named arguments:

```sql
 CREATE PROCEDURE dbo.CallerProc
AS
BEGIN
   EXEC dbo.TestProc 5, 'hello';
END
```

```sql
 CREATE OR REPLACE PROCEDURE dbo.CallerProc ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    CALL dbo.TestProc(PARAM1 => 5, PARAM2 => 'hello');
  END;
$$;
```

#### Best Practices

* Review all callers of the affected procedure. If positional arguments are used, update them to match the new parameter order or convert them to named arguments.
* Consider using named arguments (e.g., `param1 => value`) instead of positional arguments to avoid issues with parameter ordering.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-0042

Interval qualifier changed to DAY TO SECOND, Snowflake does not support mixing year to month and day to second time parts.

### Description

This FDM is emitted when the `--UseIntervalDatatype` [preview flag](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled and a source INTERVAL type is converted to `INTERVAL DAY TO SECOND` regardless of its original qualifier. This applies to languages such as BigQuery, PostgreSQL, Greenplum, and Netezza, where **all** INTERVAL types (whether unqualified, `DAY TO SECOND`, or `YEAR TO MONTH`) are normalized to `INTERVAL DAY TO SECOND`.

The reason for this transformation is that these languages allow mixing intervals from the `YEAR TO MONTH` and `DAY TO SECOND` families in all kinds of operations (addition, subtraction, inserts, updates, and so on). This behavior breaks the ANSI SQL standard for the INTERVAL data type, which Snowflake’s implementation is based on. To avoid runtime errors caused by mixing the two interval families, SnowConvert AI forces `DAY TO SECOND` across the entire migration. This approach ensures there are no exceptions related to mixed interval families while minimizing the precision loss inherent to such a type change.

For languages that enforce explicit qualifiers and do not allow mixing (such as Oracle and Teradata), the original qualifier is preserved and this FDM is not emitted.

For more details on how interval types are handled across languages, see the [Interval Data Types](../../../../translation-references/general/interval-data-types.md) translation reference.

#### Example Code

##### Input Code (BigQuery):

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE intervals (
    COL1 INTERVAL
);
```

##### Generated Code (BigQuery):

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE intervals (
    COL1 INTERVAL DAY TO SECOND /*** SSC-FDM-0042 - INTERVAL QUALIFIER CHANGED TO DAY TO SECOND, SNOWFLAKE DOES NOT SUPPORT MIXING YEAR TO MONTH AND DAY TO SECOND TIME PARTS. ***/
)
;
```

##### Input Code (PostgreSQL):

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  CAST(someColumn AS INTERVAL YEAR TO MONTH),
  someColumn::INTERVAL YEAR TO MONTH
FROM someTable;
```

##### Generated Code (PostgreSQL):

```sql
-- Additional Params: --UseIntervalDatatype
SELECT
  CAST(someColumn AS INTERVAL DAY TO SECOND /*** SSC-FDM-0042 - INTERVAL QUALIFIER CHANGED TO DAY TO SECOND, SNOWFLAKE DOES NOT SUPPORT MIXING YEAR TO MONTH AND DAY TO SECOND TIME PARTS. ***/),
  someColumn :: INTERVAL DAY TO SECOND /*** SSC-FDM-0042 - INTERVAL QUALIFIER CHANGED TO DAY TO SECOND, SNOWFLAKE DOES NOT SUPPORT MIXING YEAR TO MONTH AND DAY TO SECOND TIME PARTS. ***/
FROM someTable;
```

#### Best Practices

* Review all converted INTERVAL columns and expressions in the migrated code. Because the source language allows mixing `YEAR TO MONTH` and `DAY TO SECOND` interval families, every INTERVAL is normalized to `DAY TO SECOND` to prevent runtime errors in Snowflake. Verify that this choice is acceptable for your data.
* If any source columns store year-to-month durations exclusively (for example, subscription lengths or contract terms), the `DAY TO SECOND` normalization may lose semantic meaning. Consider manually changing those specific columns to `INTERVAL YEAR TO MONTH` after migration, but only after confirming that they are never mixed with `DAY TO SECOND` intervals in your queries.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - General Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md
section: Migrations
---

# SnowConvert AI - General Issues

## SSC-EWI-0001

Unrecognized token on the line of the source code.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Critical

#### Description

This issue occurs when there is an error while parsing the source code that is being converted. It means there is a source code syntax error or a specific statement of the code is not being recognized yet.

#### Example Code

The following example illustrates different parsing error scenarios where invalid syntax is placed in the input. Notice how the message varies between every scenario, these contents may be helpful on isolating and fixing the issue. For more information check “Message Contents” below.

##### Input Code:

```sql
 CRATE;

CREATE TABLE someTable(col1 INTEGER, !);

CREATE TABRE badTable(col1 INTEGER);

CREATE PROCEDURE proc1()
BEGIN
    CREATE TABLE badEmbeddedTable(col1 INTEGER);
END;
```

##### Generated Code:

```sql
 -- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '1' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CRATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CRATE' ON LINE '1' COLUMN '1'. **
--CRATE
     ;

CREATE OR REPLACE TABLE someTable (
    col1 INTEGER
--                ,

-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '3' COLUMN '37' OF THE SOURCE CODE STARTING AT '!'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS ',' ON LINE '3' COLUMN '35'. FAILED TOKEN WAS '!' ON LINE '3' COLUMN '37'. **
--                  !
                   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/04/2024" }}'
;

-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '5' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CREATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CREATE' ON LINE '5' COLUMN '1'. **
--CREATE TABRE badTable(col1 INTEGER)
                                   ;

CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/04/2024" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CREATE OR REPLACE TABLE badEmbeddedTable (
            col1 INTEGER);
    END;
$$;
```

#### Message Contents

1. Starting clause: Specifies the starting location (line, column, and ‘text’) of the unrecognized code. The code will be commented from the ‘text’ element onward for every unrecognized element until the parser locates a possible recovery point.
2. Expected grammar clause: Specifies the type of grammar that the parser was expecting. Check if the commented code has a matching type of the expected grammar.
3. Last matching token clause (OPTIONAL): May appear if the unrecognized code was partially recognized. This signals the point up until the parser recognized valid elements, so check the following tokens in the commented code to make sure they are valid.
4. Failed Token clause (OPTIONAL): May only be present when a “Last matching Token clause” is also present. This represents at which point the parser ultimately determined the code is invalid or not recognized. Make sure this element can be placed in this syntactical location.

#### Deprecated Message Contents

> **Note:**
>
> The items in this list are not actively in usage, and are left here for historical purposes.

1. Recovery Code (DEPRECATED): It is intended to be used as an error code, and may be supplied for better support during parser upgrade requests. It represents how the parser triggered its recovery mechanism.

#### Best Practices

* Check if the source code has the correct syntax.
* The message can be used to isolate and solve the issue.
* If the syntax is not supported, it may be manually changed to a supported syntax.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0002

Default Parameters May Need To Be Reordered

> **Note:**
>
> This EWI is deprecated. SnowConvert AI now automatically reorders default parameters to the end of the parameter list instead of emitting this warning. Please refer to [SSC-FDM-0041](../functional-difference/generalFDM.md) for the updated behavior.

### Severity

Medium

#### Description

Default parameters may need to be reordered. Snowflake only supports default parameters at the end of the parameter declarations.

#### Example Code

##### Input Code:

```sql
 CREATE PROCEDURE MySampleProc
    @Param1 NVARCHAR(50) = NULL,
    @Param2 NVARCHAR(10),
    @Param3 NVARCHAR(10) = NULL,
    @Param4 NVARCHAR(10)
AS
    SELECT 1;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE MySampleProc
!!!RESOLVE EWI!!! /*** SSC-EWI-0002 - DEFAULT PARAMETERS MAY NEED TO BE REORDERED. SNOWFLAKE ONLY SUPPORTS DEFAULT PARAMETERS AT THE END OF THE PARAMETERS DECLARATIONS ***/!!!
(PARAM1 STRING DEFAULT NULL, PARAM2 STRING, PARAM3 STRING DEFAULT NULL, PARAM4 STRING)
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        ProcedureResultSet RESULTSET;
    BEGIN
        ProcedureResultSet := (
        SELECT 1);
        RETURN TABLE(ProcedureResultSet);
    END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0003

System column for built-in object has not been translated.

### Severity

Medium

#### Description

This EWI is generated when SnowConvert AI maps a built-in system object (table, view) to the Snowflake-equivalent object, but there is no map for one of its internal columns.

#### Code Example

**Input Code:**

```sql
select name,
       parent_object_id
    from sys.tables;
```

**Output Code:**

##### Snowflake

```sql
select
    TABLE_NAME,
       parent_object_id !!!RESOLVE EWI!!! /*** SSC-EWI-0003 - SYSTEM COLUMN 'parent_object_id' FOR BUILT-IN OBJECT 'SYS.TABLES' HAS NOT BEEN TRANSLATED. ***/!!!
    from
    INFORMATION_SCHEMA.TABLES;
```

## SSC-EWI-0005

### Severity

Critical

#### Description

This issue appears when an unexpected transformation error occurs while trying to convert the source code and the output code file can not be generated.

#### Best Practices

* Check the error log file for more information about the issue.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0006

The current date/numeric format may have a different behavior in Snowflake.

### Severity

Medium

#### Description

This error is added because Snowflake does not support date/numeric formats in some functions as is supported in the source language.

> **Note:**
>
> **For SQL Server migrations:** Advanced numeric format specifiers (such as `P`, `N`, `%`) are now translated by default without requiring any flag. If you are converting SQL Server code that uses custom single-character date format specifiers (such as `%y`, `%M`, `%d`, `%H`, `%h`, `%m`, `%s`), consider enabling the
> `--enableFormatSpecifiersPreview` preview flag. This flag enables access to new Snowflake date/time format specifiers that provide more accurate
> translations of these formats. See [Preview Features Settings](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) for more details.
>
> **Note:** You must [request preview access in your Snowflake account](https://docs.google.com/forms/u/0/d/1-aIsixSftqhqjkpgBHAzcbSi2mk7s71TMQsRdOBppFw/viewform?edit_requested=true) to use the date/time preview features.

The following format elements are the ones that may behave differently in [Snowflake](https://docs.snowflake.com/en/sql-reference/functions-conversion#label-date-time-format-conversion):

#### Redshift Date / Time

| Format Element | Description |
| --- | --- |
| HH | Hour of day (01–12). |
| MS | Millisecond (000–999). |
| US | Microsecond (000000–999999). |
| SSSS, SSSSS | Seconds past midnight (0–86399). |
| Y,YYY | Year (4 or more digits) with comma. |
| YYY | Last 3 digits of year. |
| Y | Last digit of year. |
| IYYY | ISO 8601 week-numbering year(4 or more digits). |
| IYY | Last 3 digits of ISO 8601 week-numbering year. |
| IY | Last 2 digits of ISO 8601 week-numbering year. |
| I | Last digit of ISO 8601 week-numbering year. |
| BC, bc, AD or ad | Era indicator (without periods). |
| B.C., b.c., A.D. or a.d. | Era indicator (with periods). |
| MONTH | Full upper case month name (blank-padded to 9 chars). |
| Month | Full capitalized month name (blank-padded to 9 chars). |
| month | Full lower case month name (blank-padded to 9 chars). |
| DAY | Full upper case day name (blank-padded to 9 chars). |
| Day | Full capitalized day name (blank-padded to 9 chars). |
| day | Full lower case day name (blank-padded to 9 chars). |
| DDD | Day of year (001–366). |
| IDDD | Day of ISO 8601 week-numbering year (001–371; day 1 of the year is Monday of the first ISO week). |
| D | Day of the week, Sunday (1) to Saturday (7). |
| ID | ISO 8601 day of the week, Monday (1) to Sunday (7). |
| W | Week of month (1–5) (the first week starts on the first day of the month). |
| WW | Week number of year (1–53) (the first week starts on the first day of the year). |
| IW | Week number of ISO 8601 week-numbering year (01–53; the first Thursday of the year is in week 1). |
| CC | Century (2 digits) (the twenty-first century starts on 2001-01-01). |
| J | Julian Date. |
| Q | Quarter. |
| RM | Month in upper case Roman numerals (I–XII; I=January). |
| rm | Month in lower case Roman numerals (i–xii; i=January). |
| TZ | Upper case time-zone abbreviation (only supported in `to_char`). |
| tz | Lower case time-zone abbreviation (only supported in `to_char`). |
| TZH | Time-zone hours. |
| TZM | Time-zone minutes. |
| OF | Time-zone offset from UTC (only supported in `to_char`). |
| FM prefix | Fill mode (suppress leading zeroes and padding blanks). |
| TH suffix | Upper case ordinal number suffix. |
| th suffix | Lower case ordinal number suffix. |
| FX prefix | Fixed format global option (see usage notes). |
| TM prefix | Translation mode (use localized day and month names based on [lc_time](https://www.postgresql.org/docs/current/runtime-config-client.html#GUC-LC-TIME)). |
| SP suffix | Spell mode. |

> **Note:**
>
> For more information please refer to [PostgreSQL Date/Time formats](https://www.postgresql.org/docs/current/functions-formatting.html#FUNCTIONS-FORMATTING-DATETIME-TABLE).

> **Note:**
>
> The transformation of the TO_CHAR function supports most of this format elements, for a full list of suppported format elements and their equivalent mappings please refer to the [Translation specification](../../../../translation-references/redshift/redshift-functions.md)

### BigQuery Format

Review the [BigQuery format elements reference](https://cloud.google.com/bigquery/docs/reference/standard-sql/format-elements).

### Numeric

| Pattern | Description |
| --- | --- |
| PR | negative value in angle brackets |
| RN | Roman numeral (input between 1 and 3999) |
| TH or th | ordinal number suffix |
| V | shift specified number of digits (see notes) |
| EEEE | exponent for scientific notation |

> **Note:**
>
> For more information please refer to [PostgreSQL Numeric formats](https://www.postgresql.org/docs/current/functions-formatting.html#FUNCTIONS-FORMATTING-NUMERIC-TABLE).

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 SELECT
   DATE_TRUNC('decade', TIMESTAMP '2017-03-17 02:09:30'),
   DATE_TRUNC('century', TIMESTAMP '2017-03-17 02:09:30'),
   DATE_TRUNC('millennium', TIMESTAMP '2017-03-17 02:09:30');
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
      !!!RESOLVE EWI!!! /*** SSC-EWI-PG0005 - DECADE FORMAT IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
      DATE_TRUNC('decade', TIMESTAMP '2017-03-17 02:09:30'),
      !!!RESOLVE EWI!!! /*** SSC-EWI-PG0005 - CENTURY FORMAT IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
      DATE_TRUNC('century', TIMESTAMP '2017-03-17 02:09:30'),
      !!!RESOLVE EWI!!! /*** SSC-EWI-PG0005 - MILLENNIUM FORMAT IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
      DATE_TRUNC('millennium', TIMESTAMP '2017-03-17 02:09:30');
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0007

### Severity

Critical

#### Description

This error appears when an error occurs in writing the output file.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0008

COLLATE clause may have a different behavior in Snowflake

### Severity

Medium

#### Description

This warning is added when the collate clause is used as a column option because it is supported in Snowflake, but behaves differently in the collate specification. To verify which specifiers are supported in Snowflake, see [Collate specifications](https://docs.snowflake.com/en/sql-reference/collation#label-collation-specification).

#### Example Code

##### Input Code:

```sql
 CREATE TABLE TABLE01 (
    col1 text COLLATE "C"
);
```

##### Generated Code:

```sql
 CREATE TABLE TABLE01 (
    col1 text
              !!!RESOLVE EWI!!! /*** SSC-EWI-0008 - COLLATE CLAUSE MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!! COLLATE "C"
);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0009

Regexp_Substr Function only supports POSIX regular expressions.

### Severity

Low

#### Description

Currently, there is no support in Snowflake for extended regular expression beyond the [POSIX Basic Regular Expression syntax](https://en.wikipedia.org/wiki/Regular_expression#POSIX_basic_and_extended).

This EWI is added every time a function call to *REGEX_SUBSTR, REGEX_REPLACE,* or *REGEX_INSTR* is transformed to Snowflake to warn the user about possible unsupported regular expressions. Some of the features **not supported** are lookahead, lookbehind, and non-capturing groups.

#### Example Code

##### Input Code:

```sql
 SELECT REGEXP_SUBSTR('qaqequ','q(?=u)', 1, 1);
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0009 - REGEXP_SUBSTR FUNCTION ONLY SUPPORTS POSIX REGULAR EXPRESSIONS ***/!!!
REGEXP_SUBSTR('qaqequ','q(?=u)', 1, 1);
```

#### Best Practices

* Check the regular expression used in each case to determine whether it needs manual intervention. More information about expanded regex support and alternatives in Snowflake can be found [**here**](https://community.snowflake.com/s/question/0D50Z00007ENLKsSAP/expanded-support-for-regular-expressions-regex)**.**
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0010

### Severity

Critical

#### Description

This error appears when there is not a transformation rule for a specific procedure statement.

#### Best Practices

* Check if the procedure statement is correct.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0011

### Severity

High

#### Description

This error appears when there is an unexpected end of the statement in the source code and the error cannot be handled correctly.

#### Best Practices

* Check if the source code is incomplete or if the statement that is being converted ends correctly.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0012

### Severity

High

#### Description

This error appears when there is an unexpected end of the statement in the source code

#### Example Code

##### Input Code:

```sql
 CREATE VOLATILE SET TABLE VOLATILETABLE
(
    COL1                    INTEGER,
    COL2                    INTEGER,
    COL3                    INTEGER
)
ON COMMIT PRESERVE ROWS;
UPDATE TABLE2 as T2
SET T2.COL1 + VOLATILETABLE.COL1
WHERE T2.COL2 = VOLATILETABLE.COL2
    AND T2.COL3 = VOLATILETABLE.COL3
    AND     T2.COL4 = ( SELECT MAX(T3.COL1)
                                   FROM
                                   TABLE3 T3
                                   WHERE T3.COL1 = T2.COL1);
```

##### Generated Code:

```sql
 --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
CREATE OR REPLACE TEMPORARY TABLE VOLATILETABLE
(
    COL1 INTEGER,
    COL2 INTEGER,
    COL3 INTEGER
)
--    --** SSC-FDM-0008 - ON COMMIT NOT SUPPORTED **
--ON COMMIT PRESERVE ROWS
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "TABLE2", "TABLE3" **
UPDATE TABLE2 AS T2
    SET
        --** SSC-FDM-0025 - UNEXPECTED END OF STATEMENT. PLEASE CHECK THE LINE 9 OF ORIGINAL SOURCE CODE. **
        T2.COL1 + VOLATILETABLE.COL1
    FROM
        VOLATILETABLE
        WHERE T2.COL2 = _VOLATILETABLE.COL2
            AND T2.COL3 = _VOLATILETABLE.COL3
            AND     T2.COL4 = (
                SELECT
                    MAX(T3.COL1)
                                                  FROM
                    TABLE3 T3
                                                  WHERE T3.COL1 = T2.COL1);
```

#### Recommendation

* Check if the source code is incomplete or if the statement that is being converted ends correctly.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0013

### Severity

Critical

#### Description

This error appears when an exception is raised while converting an item from the source code.

#### Recommendation

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0014

### Severity

Critical

#### Description

This error appears when the body of a specific procedure statement is not generated.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0015

Pivot/Unpivot multiple function not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

This section describes the different issues that could be triggered by PIVOT and UNPIVOT clauses. The not-supported scenarios are presented in the following table.

|  | PIVOT | UNPIVOT | ORACLE | TERADATA |
| --- | --- | --- | --- | --- |
| MULTIPLE COLUMN | X | X | X | X |
| RENAME COLUMN | X | X | X | X |
| MULTIPLE FUNCTION | X |  | X | X |
| WITH CLAUSE | X |  |  | X |
| XML OUTPUT FORMAT | X |  | X |  |
| IN CLAUSE SUBQUERY | X |  | X | X |
| IN CLAUSE ANY SEQUENCE | X |  | X |  |
| INCLUDE/EXCLUDE NULLS |  | X | X | X |

#### MULTIPLE COLUMN

Multiple columns are not supported by PIVOT and UNPIVOT clauses.

##### Example Code

##### Input Code:

```sql
 SELECT * FROM star1p UNPIVOT ((sales,cogs)  FOR  yr_qtr
    IN ((Q101Sales, Q101Cogs) AS 'Q101A',
        (Q201Sales, Q201Cogs) AS 'Q201A',
        (Q301Sales, Q301Cogs) AS 'Q301A')) AS Tmp;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT * FROM
    star1p
           !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT MULTIPLE COLUMN NOT SUPPORTED ***/!!!
           UNPIVOT ((sales,cogs)  FOR  yr_qtr
    !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT MULTIPLE COLUMN NOT SUPPORTED ***/!!!
    IN ((Q101Sales, Q101Cogs) AS 'Q101A',
        (Q201Sales, Q201Cogs) AS 'Q201A',
        (Q301Sales, Q301Cogs) AS 'Q301A')) AS Tmp;
```

#### RENAME COLUMN

Renaming columns with aliases is not supported in Snowflake UNPIVOT clauses. SnowConvert will remove aliases for functions or columns to create a valid query and check that this change does not affect the original functionality.

For PIVOT, the use of column aliases is only supported in SnowConvert AI for Teradata if the following two conditions are true: all expressions inside the IN clause have an alias associated and SnowConvert AI has information about the columns that will be generated as a result, either by providing the table definition or using a subquery with an explicit column list as input to the clause.

##### Example Code

##### Input Code:

```
CREATE TABLE star1(
	country VARCHAR(20),
	state VARCHAR(10),
	yr INTEGER,
	qtr VARCHAR(3),
	sales INTEGER,
	cogs INTEGER
);

--SAMPLE 1
SELECT * FROM db1.star1p UNPIVOT (column1  FOR  for_column
    IN (col1 AS 'as_col1', col2 AS 'as_col2')) Tmp;

--SAMPLE 2
SELECT *
FROM star1 PIVOT (
	SUM(sales) as ss1 FOR qtr
    IN ('Q1' AS Quarter1,
    	'Q2' AS Quarter2,
        'Q3' AS Quarter3)
)Tmp;

--SAMPLE 3
SELECT
	*
FROM (
	SELECT
		country,
		state,
		yr,
		qtr,
		sales,
		cogs
	FROM star1 ) A
PIVOT (
	SUM(sales) as ss1 FOR qtr
    IN ('Q1' AS Quarter1,
    	'Q2' AS Quarter2,
        'Q3' AS Quarter3)
)Tmp;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE star1 (
	country VARCHAR(20),
	state VARCHAR(10),
	yr INTEGER,
	qtr VARCHAR(3),
	sales INTEGER,
	cogs INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "08/14/2024" }}'
;

--SAMPLE 1
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "db1.star1p" **
SELECT
	* FROM db1.star1p UNPIVOT (column1  FOR  for_column
	    IN (col1 AS 'as_col1', col2 AS 'as_col2')) Tmp !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PivotUnpivotTableReference' NODE ***/!!!;

--SAMPLE 2
SELECT
	*
FROM
	star1 PIVOT (
	SUM(sales) FOR qtr IN ('Q1',
	   	'Q2',
	       'Q3')) Tmp (
		country,
		state,
		yr,
		cogs,
		Quarter1_ss1,
		Quarter2_ss1,
		Quarter3_ss1
	);

--SAMPLE 3
	SELECT
		*
	FROM (
		SELECT
				country,
				state,
				yr,
				qtr,
				sales,
				cogs
			FROM
				star1
	) A
	PIVOT (
		SUM(sales) FOR qtr IN ('Q1',
	    'Q2',
	        'Q3')) Tmp (
		country,
		state,
		yr,
		cogs,
		Quarter1_ss1,
		Quarter2_ss1,
		Quarter3_ss1
	);
```

#### MULTIPLE FUNCTION

Multiple function is not supported for PIVOT clauses, sometimes multiple function queries could be re-written using case statements, see the following Teradata sample for more information <https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/L0kKSOrOeu_68mcW3o8ilw>

##### Example Code

##### Input Code:

```sql
 SELECT *
FROM STAR1 PIVOT(SUM(COL1), SUM(COL2) FOR YR IN ('Y1', 'Y2', 'Y3'))TMP;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
*
FROM
STAR1
      !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT MULTIPLE FUNCTION NOT SUPPORTED ***/!!!
      PIVOT(SUM(COL1), SUM(COL2) FOR YR IN ('Y1', 'Y2', 'Y3'))TMP;
```

#### WITH CLAUSE

Teradata PIVOT has an optional WITH clause, this is not allowed in Snowflake’s PIVOT.

##### Example Code

##### Input Code:

```sql
 SELECT *
FROM STAR1 PIVOT(SUM(COL1) FOR YR IN ('Y1', 'Y2', 'Y3') WITH SUM(*) AS withalias)TMP;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
*
FROM
STAR1 PIVOT(SUM(COL1) FOR YR IN ('Y1', 'Y2', 'Y3')
                                                   !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT WITH CLAUSE NOT SUPPORTED ***/!!!
 WITH SUM(*) AS withalias)TMP;
```

#### XML OUTPUT FORMAT

XML output for the PIVOT clause is not supported by Snowflake.

##### Example Code

##### Input Code:

```sql
 SELECT * FROM   (SELECT product_code, quantity FROM pivot_test)
PIVOT XML (SUM(quantity)
FOR (product_code) IN ('A','B','C'));
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT * FROM
(
SELECT product_code, quantity FROM
pivot_test)
!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT XML OUTPUT FORMAT NOT SUPPORTED ***/!!!
PIVOT (SUM(quantity) FOR product_code IN ( 'A', 'B', 'C'));
```

#### IN CLAUSE SUBQUERY

Subqueries for the IN clause are not supported.

##### Example Code

##### Input Code:

```sql
 SELECT * FROM s1 PIVOT(SUM(COL1) FOR FORCOL IN (SELECT SELCOL FROM S2))DT;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT * FROM
s1 PIVOT (SUM(COL1) FOR FORCOL
                               !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT IN CLAUSE SUBQUERY NOT SUPPORTED ***/!!! IN (SELECT SELCOL FROM
                               S2));
```

#### IN CLAUSE ANY SEQUENCE

This error is triggered when ANY keyword is used in the IN clause. This is currently not supported.

##### **Example Code**

##### Input Code:

```sql
 SELECT * FROM (SELECT product_code, quantity FROM pivot_test)
PIVOT (SUM(quantity)
FOR product_code IN (ANY, ANY, ANY));
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT * FROM (SELECT product_code, quantity FROM
pivot_test)
PIVOT (SUM(quantity)
FOR product_code
                 !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT IN CLAUSE ANY SEQUENCE NOT SUPPORTED ***/!!!
 IN (ANY, ANY, ANY));
```

#### INCLUDE/EXCLUDE NULLS

INCLUDE NULLS or EXCLUDE NULLS are not valid options for UNPIVOT clauses in Snowflake.

##### Example Code

##### Input Code:

```sql
 SELECT * FROM db1.star1p UNPIVOT INCLUDE NULLS (column1  FOR  for_column IN (col1, col2)) Tmp;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT * FROM
db1.star1p
!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT INCLUDE NULLS NOT SUPPORTED ***/!!!
UNPIVOT ( column1 FOR for_column IN (
col1,
col2)) Tmp;
```

#### Best Practices

* Re-write the query if possible, otherwise, no additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0016

Snowflake does not support the options clause.

### Severity

Medium

#### Description

This EWI is added to DDLs statements when the `OPTIONS` has unsupported options by Snowflake.

#### Code Example

**Input Code:**

##### BigQuery

```sql
 CREATE VIEW my_view
OPTIONS (
  expiration_timestamp=TIMESTAMP "2026-01-01 00:00:00 UTC",
  privacy_policy='{"aggregation_threshold_policy": {"threshold": 50, "privacy_unit_columns": "ID"}}'
) AS
SELECT column1, column2
FROM my_table;
```

**Output Code:**

##### Snowflake

```sql
 CREATE VIEW my_view
!!!RESOLVE EWI!!! /*** SSC-EWI-0016 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: EXPIRATION_TIMESTAMP, PRIVACY_POLICY. ***/!!!
OPTIONS(
  expiration_timestamp=TIMESTAMP "2026-01-01 00:00:00 UTC",
  privacy_policy='{"aggregation_threshold_policy": {"threshold": 50, "privacy_unit_columns": "ID"}}'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
AS
SELECT column1, column2
FROM
  my_table;
```

## SSC-EWI-0020

CUSTOM UDF INSERTED.

### Severity

Low

### Summary

There are several User-Defined Functions (UDF) provided by SnowConvert AI used to reproduce source language behaviors that are not supported by Snowflake, functionality and descriptions are detailed below.

UDFs can be found in “UDF Helpers” folder created in the output path after the migration has occurred.

#### Best Practices

* Check if the UDF Helpers folder is being created with files inside it.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0021

Not supported.

### Severity

Medium

#### Description

This message appears when a specific node or statement from the source code is not supported in Snowflake.

#### Example Code

##### Input Code:

```sql
 WITH my_av ANALYTIC VIEW AS
(USING sales_av HIERARCHIES(time_hier) ADD MEASURES(lag_sales AS (LAG(sales) OVER (HIERARCHY time_hier OFFSET 1 ))))
SELECT aValue from my_av;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - SubavFactoring NOT SUPPORTED IN SNOWFLAKE ***/!!!
WITH my_av ANALYTIC VIEW AS
(USING sales_av HIERARCHIES(time_hier) ADD MEASURES(lag_sales AS (LAG(sales) OVER (HIERARCHY time_hier OFFSET 1 ))))
SELECT aValue from my_av;
```

#### Best Practices

* If this error happens is because there is no Snowflake equivalent for the node that is being converted.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0022

One or more identifiers in a specific statement are considered parameters by default.

> **Warning:**
>
> The EWI is only generated when Javascript is the target language for Stored Procedures. This is a deprecated translation feature, as Snowflake Scripting is the recommended target language for Stored Procedures.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

This error is used to report that one or more identifiers in a specific statement are considered parameters by default.

#### Example Code

##### Input Code:

```sql
 -- Additional Params: -t javascript
CREATE MACRO SAME_MACRO_COLUMN_AND_PARAMATERS (
LOAD_USER_ID (VARCHAR (32), CHARACTER SET LATIN),
UPDATE_USER_ID (VARCHAR (32), CHARACTER SET LATIN)
) AS (
UPDATE TABLE1 SET LOAD_USER_ID = :LOAD_USER_ID, UPDATE_USER_ID = :UPDATE_USER_ID;
INSERT INTO TABLE1 (LOAD_USER_ID, UPDATE_USER_ID) VALUES (:LOAD_USER_ID, :UPDATE_USER_ID);
DELETE FROM TABLE1 WHERE :LOAD_USER_ID = LOAD_USER_ID;
);
```

##### Generated Code:

```sql
-- Additional Params: -t javascript
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
CREATE OR REPLACE PROCEDURE SAME_MACRO_COLUMN_AND_PARAMATERS (LOAD_USER_ID VARCHAR (32), UPDATE_USER_ID VARCHAR (32))
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
// REGION SnowConvert AI Helpers Code
var HANDLE_NOTFOUND;
var _RS, ROW_COUNT, _ROWS, MESSAGE_TEXT, SQLCODE = 0, SQLSTATE = '00000', ERROR_HANDLERS, ACTIVITY_COUNT = 0, INTO, _OUTQUERIES = [], DYNAMIC_RESULTS = -1;
var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
var fixBind = function (arg) {
arg = arg == undefined ? null : arg instanceof Date ? formatDate(arg) : arg;
return arg;
};
var EXEC = function (stmt,binds,noCatch,catchFunction,opts) {
try {
binds = binds ? binds.map(fixBind) : binds;
_RS = snowflake.createStatement({
sqlText : stmt,
binds : binds
});
_ROWS = _RS.execute();
ROW_COUNT = _RS.getRowCount();
ACTIVITY_COUNT = _RS.getNumRowsAffected();
HANDLE_NOTFOUND && HANDLE_NOTFOUND(_RS);
if (INTO) return {
INTO : function () {
return INTO();
}
};
if (_OUTQUERIES.length < DYNAMIC_RESULTS) _OUTQUERIES.push(_ROWS.getQueryId());
if (opts && opts.temp) return _ROWS.getQueryId();
} catch(error) {
MESSAGE_TEXT = error.message;
SQLCODE = error.code;
SQLSTATE = error.state;
var msg = `ERROR CODE: ${SQLCODE} SQLSTATE: ${SQLSTATE} MESSAGE: ${MESSAGE_TEXT}`;
if (catchFunction) catchFunction(error);
if (!noCatch && ERROR_HANDLERS) ERROR_HANDLERS(error); else throw new Error(msg);
}
};
// END REGION

EXEC(`UPDATE TABLE1
   SET
      LOAD_USER_ID = :1,
      UPDATE_USER_ID = :2`,[LOAD_USER_ID,UPDATE_USER_ID]);
// ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
EXEC(`INSERT INTO TABLE1 (LOAD_USER_ID, UPDATE_USER_ID)
VALUES (:1, :2)`,[LOAD_USER_ID,UPDATE_USER_ID]);
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Delete' NODE ***/!!!
//DELETE FROM
//   TABLE1
//WHERE
//   UPPER(RTRIM(:LOAD_USER_ID)) = UPPER(RTRIM(LOAD_USER_ID))
null
$$;
```

#### Best Practices

* Make sure all the dependencies(tables and views) related to the procedure statement are being migrated.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0023

Performance Review - A loop contains an insert, delete, or update statement.

> **Warning:**
>
> The EWI is only generated when Javascript is the target language for Stored Procedures. This is a deprecated translation feature, as Snowflake Scripting is the recommended target language for Stored Procedures.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This warning indicates a possible consideration that the user should have in terms of performance.

#### Example Code

##### Input Code:

```sql
 -- Additional Params: -t javascript
REPLACE PROCEDURE Database1.Proc1()
BEGIN
    DECLARE lNumber INTEGER DEFAULT 1;
    FOR class1 AS class2 CURSOR FOR
      SELECT COL0,
      TRIM(COL1) AS COL1ALIAS,
      TRIM(COL2),
      COL3
      FROM someDb.prefixCol
    DO
      INSERT INTO TempDB.Table1 (:lgNumber, :lNumber, (',' || :class1.ClassCD || '_Ind CHAR(1) NOT NULL'));
      SET lNumber = lNumber + 1;
    END FOR;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE Database1.Proc1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var LNUMBER = 1;
    /*** SSC-EWI-0023 - PERFORMANCE REVIEW - THIS LOOP CONTAINS AN INSERT, DELETE OR UPDATE STATEMENT ***/
    for(var CLASS2 = new CURSOR(`SELECT
   COL0,
   TRIM(COL1) AS COL1ALIAS,
   TRIM(COL2),
   COL3
FROM
   someDb.prefixCol`,[],false).OPEN();CLASS2.NEXT();) {
        let CLASS1 = CLASS2.CURRENT;
        EXEC(`INSERT INTO TempDB.Table1
VALUES (:lgNumber, :1, (',' || :
!!!RESOLVE EWI!!! /*** SSC-EWI-0026 - THE  VARIABLE class1.ClassCD MAY REQUIRE A CAST TO DATE, TIME OR TIMESTAMP ***/!!!
:2 || '_Ind CHAR(1) NOT NULL'))`,[LNUMBER,CLASS1.CLASSCD]);
        LNUMBER = LNUMBER + 1;
    }
    CLASS2.CLOSE();
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0025

Binding time variables might require a change in the query.

> **Warning:**
>
> The EWI is only generated when Javascript is the target language for Stored Procedures. This is a deprecated translation feature, as Snowflake Scripting is the recommended target language for Stored Procedures.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

The action of binding time variables might require changes in the query that contains them.

#### Example Code

##### Input Code:

```sql
 -- Additional Params: -t javascript
CREATE PROCEDURE P_1025()
BEGIN
  DECLARE LN_EMP_KEY_NO_PARAM NUMERIC DEFAULT -1;
  DECLARE FLOATVARNAME FLOAT DEFAULT 12.1;
  DECLARE hErrorMsg CHARACTER(30) DEFAULT 'NO ERROR';
  DECLARE CurrTs TIME DEFAULT CURRENT_TIME;
  DECLARE CurrTs2 TIME DEFAULT CURRENT_TIMESTAMP;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE P_1025 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  var LN_EMP_KEY_NO_PARAM = -1;
  var FLOATVARNAME = 12.1;
  var HERRORMSG = `NO ERROR`;
  var CURRTS = new Date() /*** SSC-EWI-0025 - BINDING TIME VARIABLE MIGHT REQUIRE CHANGE IN QUERY. ***/;
  var CURRTS2 = new Date();
$$;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0026

Qualified variables may require a cast.

> **Warning:**
>
> The EWI is only generated when Javascript is the target language for Stored Procedures. This is a deprecated translation feature, as Snowflake Scripting is the recommended target language for Stored Procedures.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This warning is added when there is a query with a variable with a qualified member like an Oracle record or a Teradata for loop variable. Depending on where the variable is being used and the type of value, a cast may be necessary to work properly.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE TABLE1 (COL1 DATE);
CREATE TABLE TABLE2 (COL1 VARCHAR(25));

CREATE OR REPLACE PROCEDURE EXAMPLE
IS
    CURSOR C1 IS SELECT * FROM TABLE1;
BEGIN
    FOR REC1 IN C1 LOOP
		    insert into TABLE2 values (TO_CHAR(REC1.COL1, 'DD-MM-YYYY'));
    END LOOP;
END;
```

##### Generated Code:

```sql
 -- Additional Params: -t javascript
CREATE OR REPLACE TABLE TABLE1 (COL1 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE TABLE2 (COL1 VARCHAR(25))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE EXAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	let C1 = new CURSOR(`SELECT * FROM
      TABLE1`,() => []);
	C1.OPEN();
	// ** SSC-EWI-0023 - PERFORMANCE REVIEW - THIS LOOP CONTAINS AN INSERT, DELETE OR UPDATE STATEMENT **
	while ( C1.NEXT() ) {
		let REC1 = C1.CURRENT;
		EXEC(`insert into TABLE2
		    values (TO_CHAR(
		    !!!RESOLVE EWI!!! /*** SSC-EWI-0026 - THE  VARIABLE REC1.COL1 MAY REQUIRE A CAST TO DATE, TIME OR TIMESTAMP ***/!!!
		    ?, 'DD-MM-YYYY'))`,[REC1.COL1]);
	}
	C1.CLOSE();
$$;
```

##### Generated Code with adjustments:

```sql
 CREATE OR REPLACE TABLE TABLE1 (COL1 TIMESTAMP
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE TABLE2 (COL1 VARCHAR(25))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE EXAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	let C1 = new CURSOR(`SELECT * FROM
      TABLE1`,() => []);
	C1.OPEN();
	// ** SSC-EWI-0023 - PERFORMANCE REVIEW - THIS LOOP CONTAINS AN INSERT, DELETE OR UPDATE STATEMENT **
	while ( C1.NEXT() ) {
		let REC1 = C1.CURRENT;
		EXEC(`insert into TABLE2
		    values (TO_CHAR(REC1.COL1::DATE, 'DD-MM-YYYY'))`,[REC1.COL1]);
	}
	C1.CLOSE();
$$;
```

#### Best Practices

* Check if a cast to a Date, Time, or Timestamp is necessary for the binding. Some cases are not necessary because an implicit conversion is done to the value.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0027

The following statement uses a variable/literal with an invalid query and it will not be executed.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

This warning is used to report that a specific statement uses a variable or literal with an invalid query and for that reason, it will not be executed.

#### Example Code

##### Input Code:

```sql
 REPLACE PROCEDURE TEST.COLLECT_STATS ()
BEGIN
  COLLECT STATS ON DBC.AccessRights COLUMN(COLNAME);

  SET STATS_STATEMENT = 'COLLECT STATS ON ' || OUT_DB || '.' || OUT_TBL || ' COLUMN(' || C4.ColumnName || ');';

  EXECUTE IMMEDIATE STATS_STATEMENT;

  EXECUTE IMMEDIATE 'COLLECT STATS ON DBC.AccessRights COLUMN(COLNAME);';

  SET STATS_STATEMENT_NOT_DYNAMIC = 'COLLECT STATS ON DBC.AccessRights COLUMN(COLNAME);';

  EXECUTE IMMEDIATE STATS_STATEMENT_NOT_DYNAMIC;

END;
;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE TEST.COLLECT_STATS ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
  BEGIN
--    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. COLLECT **
--    COLLECT STATS ON DBC.AccessRights COLUMN(COLNAME);
    STATS_STATEMENT := 'COLLECT STATS ON ' || OUT_DB || '.' || OUT_TBL || ' COLUMN(' || C4.ColumnName || ')';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0027 - THE FOLLOWING STATEMENT USES A VARIABLE/LITERAL WITH AN INVALID QUERY AND IT WILL NOT BE EXECUTED ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!

    EXECUTE IMMEDIATE STATS_STATEMENT;
    !!!RESOLVE EWI!!! /*** SSC-EWI-0027 - THE FOLLOWING STATEMENT USES A VARIABLE/LITERAL WITH AN INVALID QUERY AND IT WILL NOT BE EXECUTED ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!

    EXECUTE IMMEDIATE 'COLLECT STATS ON DBC.AccessRights COLUMN(COLNAME)';
    STATS_STATEMENT_NOT_DYNAMIC := 'COLLECT STATS ON DBC.AccessRights COLUMN(COLNAME)';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0027 - THE FOLLOWING STATEMENT USES A VARIABLE/LITERAL WITH AN INVALID QUERY AND IT WILL NOT BE EXECUTED ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!

    EXECUTE IMMEDIATE STATS_STATEMENT_NOT_DYNAMIC;
  END;
$$;
```

#### Best Practices

* Check if a cast to a Date, Time, or Timestamp is necessary for the binding. Some cases are not necessary because an implicit conversion is done to the value.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0028

Type not supported by Snowflake

### Severity

Medium

#### Description

This message appears when a type is not supported in Snowflake.

#### Example

##### Input Code (Oracle):

```sql
 CREATE TABLE MYTABLE
(
    COL1 SYS.ANYDATASET
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE MYTABLE
    (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
        COL1 SYS.ANYDATASET
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0029

External table data format not supported in snowflake

### Severity

Medium

#### Description

Snowflake supports the following External Table formats:

| BigQuery | Snowflake |
| --- | --- |
| AVRO | AVRO |
| CSV GOOGLE_SHEETS | CSV |
| NEWLINE_DELIMITED_JSON JSON | JSON |
| ORC | ORC |
| PARQUET | PARQUET |

When an external table has other FORMAT not specified in the above table, this EWI will be generated to inform the user that the FORMAT is not supported.

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.backup_restore_table
OPTIONS (
  format = 'DATASTORE_BACKUP',
  uris = ['gs://backup_bucket/backup_folder/*']
);
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0029 - EXTERNAL TABLE DATA FORMAT NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE OR REPLACE EXTERNAL TABLE test.backup_restore_table USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-0035 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_TEST_BACKUP_RESTORE_TABLE_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://backup_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'backup_folder/.*'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0030

The statement below has usages of dynamic SQL

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

This error is used to indicate that the statement has usages of dynamic SQL. Each specific source language has its own set of statements that can execute dynamic SQL. Dynamic SQL refers to code that is built as text using the string manipulation tools the database engine language provides.

This scenario is considered a complex pattern because dynamic SQL is built and executed in runtime making it more difficult to track and debug errors. This error is meant to be a helper to spot some problems that a static-code analyzer such as Snow Convert cannot.

#### Code Example

#### Teradata

##### Input

```sql
 REPLACE PROCEDURE teradata_dynamic_sql()
BEGIN
  DECLARE str_sql VARCHAR(20);
  SET str_sql = 'UPDATE TABLE
                    SET COLA = 0,
                        COLB = ''test''';

  EXECUTE IMMEDIATE str_sql;
  EXECUTE IMMEDIATE 'INSERT INTO TABLE1(COL1) VALUES(1)';
  EXECUTE str_sql;
  CALL DBC.SysExecSQL('INSERT INTO TABLE1(COL1) VALUES(1)');
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE teradata_dynamic_sql ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    str_sql VARCHAR(20);
  BEGIN

    str_sql := 'UPDATE "TABLE"
   SET COLA = 0,
       COLB = ''test''';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE str_sql;
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE 'INSERT INTO TABLE1 (COL1)
VALUES (1);';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE str_sql;
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE 'INSERT INTO TABLE1 (COL1)
VALUES (1);';
  END;
$$;
```

#### Oracle

##### Input

```sql
 CREATE OR REPLACE PROCEDURE oracle_dynamic_sql
AS
    dynamic_statement VARCHAR(100);
    numeric_variable INTEGER;
    dynamic_statement VARCHAR(100);
    column_variable VARCHAR(100);
    cursor_variable SYS_REFCURSOR;
    c INTEGER;
    dynamic_statement VARCHAR(100);
BEGIN
    dynamic_statement := 'INSERT INTO sample_table(col1) VALUES(1)';
    numeric_variable := 3;
    column_variable := 'col1';

    EXECUTE IMMEDIATE dynamic_statement;
    EXECUTE IMMEDIATE 'INSERT INTO sample_table(col1) VALUES(' || numeric_variable || ')';

    OPEN cursor_variable FOR dynamic_statement;
    OPEN cursor_variable FOR 'SELECT ' || column_variable || ' FROM sample_table';
    OPEN cursor_variable FOR 'SELECT col1 FROM sample_table';

    c := DBMS_SQL.OPEN_CURSOR;
    dynamic_statement := 'SELECT * FROM sample_table';
    DBMS_SQL.PARSE(c, dynamic_statement);
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE oracle_dynamic_sql ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        dynamic_statement VARCHAR(100);
        numeric_variable INTEGER;
        dynamic_statement VARCHAR(100);
        column_variable VARCHAR(100);
        cursor_variable_res RESULTSET;
        c INTEGER;
        dynamic_statement VARCHAR(100);
    BEGIN
        dynamic_statement := 'INSERT INTO sample_table(col1) VALUES(1)';
        numeric_variable := 3;
        column_variable := 'col1';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE :dynamic_statement;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE 'INSERT INTO sample_table(col1) VALUES(' || NVL(:numeric_variable :: STRING, '') || ')';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        cursor_variable_res := (
            EXECUTE IMMEDIATE :dynamic_statement
        );
        LET cursor_variable CURSOR
        FOR
            cursor_variable_res;
        OPEN cursor_variable;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        cursor_variable_res := (
            EXECUTE IMMEDIATE 'SELECT ' || NVL(:column_variable :: STRING, '') || ' FROM
   sample_table'
        );
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0133 - THE CURSOR VARIABLE NAMED 'cursor_variable' HAS ALREADY BEEN ASSIGNED IN ANOTHER CURSOR ***/!!!
        LET cursor_variable CURSOR
        FOR
            cursor_variable_res;
        OPEN cursor_variable;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        cursor_variable_res := (
            EXECUTE IMMEDIATE 'SELECT col1 FROM
   sample_table'
        );
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0133 - THE CURSOR VARIABLE NAMED 'cursor_variable' HAS ALREADY BEEN ASSIGNED IN ANOTHER CURSOR ***/!!!
        LET cursor_variable CURSOR
        FOR
            cursor_variable_res;
        OPEN cursor_variable;
        c :=
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_SQL.OPEN_CURSOR' IS NOT CURRENTLY SUPPORTED. ***/!!!
        '' AS OPEN_CURSOR;
        dynamic_statement := 'SELECT * FROM
   sample_table';
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_SQL.PARSE' IS NOT CURRENTLY SUPPORTED. ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    DBMS_SQL.PARSE(:c, :dynamic_statement);
    END;
$$;
```

#### SQL Server

##### Input

```sql
 CREATE OR ALTER PROCEDURE transact_dynamic_sql
AS
BEGIN
    DECLARE @dynamicStatement AS VARCHAR(200);
    DECLARE @numericVariable AS VARCHAR(200);

    SET @dynamicStatement = 'INSERT INTO sample_table(col1) VALUES(1);';
    SET @numericVariable = '3';

    EXECUTE (@dynamicStatement);
    EXEC ('INSERT INTO sampleTable(col1) VALUES (' + @numericVariable + ');');
    EXECUTE ('INSERT INTO sampleTable(col1) VALUES(10);') AS USER = 'DbAdmin';

    INSERT INTO sampleTable EXECUTE sp_executesql @statement = 'SELECT * FROM sampleTable;';
    INSERT INTO sampleTable EXECUTE ('SELECT * FROM sampleTable;');
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE transact_dynamic_sql ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/13/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        DYNAMICSTATEMENT VARCHAR(200);
        NUMERICVARIABLE VARCHAR(200);
    BEGIN

        DYNAMICSTATEMENT := 'INSERT INTO sample_table (col1) VALUES(1);';
        NUMERICVARIABLE := '3';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE :DYNAMICSTATEMENT;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE 'INSERT INTO sampleTable (col1) VALUES (' || :NUMERICVARIABLE || ');';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - EXECUTE AS USER/LOGIN NOT SUPPORTED IN SNOWFLAKE ***/!!!
        EXECUTE IMMEDIATE 'INSERT INTO sampleTable (col1) VALUES(10);';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INSERT WITH EXECUTE' NODE ***/!!!
        INSERT INTO sampleTable EXECUTE IMMEDIATE 'SELECT
   *
FROM
   sampleTable;';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INSERT WITH EXECUTE' NODE ***/!!!
    INSERT INTO sampleTable EXECUTE IMMEDIATE 'SELECT
   *
FROM
   sampleTable;';
    END;
$$;
```

#### Issues Inside of Dynamic SQL

Something important to take into account is that when migrating dynamic SQL code, SnowConvert AI will not report any type of issue inside of dynamic SQL in the output code or in the assessment reports. This will happen even when the documentation of an issue or the translation specification describes that an issue will always be added to the output code. Here is an example of a migration in Oracle where this situation might be encountered:

##### Oracle

```sql
 SELECT dbms_random.value() FROM dual;

CREATE OR REPLACE PROCEDURE dynamic_sql_procedure
AS
  result VARCHAR(100) := 'SELECT dbms_random.value() from dual';
BEGIN
  NULL;
END;
```

##### Snowflake

```sql
 SELECT
  --** SSC-FDM-OR0033 - DBMS_RANDOM.VALUE DIGITS OF PRECISION ARE LOWER IN SNOWFLAKE **
  DBMS_RANDOM.VALUE_UDF() FROM dual;

CREATE OR REPLACE PROCEDURE dynamic_sql_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    result VARCHAR(100) := 'SELECT
   DBMS_RANDOM.VALUE_UDF() from dual';
  BEGIN
    NULL;
  END;
$$;
```

In the previous example, the query and the variable assignment inside the procedure will be converted exactly the same, the difference is that in the dynamic SQL code the conversion issues will not be shown in the output code and in the assessment reports.

#### Best Practices

* Use this tag to track every dynamically built statement and review its correctness when troubleshooting.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0031

Function not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

This warning is used to report that a specific ***built-in function*** of Teradata, Oracle, or SQL Server is not supported.

#### Example Code

##### **Input Code (Oracle):**

```sql
 SELECT VALUE(ST) FROM SampleTable ST;
```

##### **Output Code:**

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0031 - VALUE FUNCTION NOT SUPPORTED ***/!!!
 VALUE(ST) FROM
 SampleTable ST;
```

##### **Input Code (Teradata):**

```sql
 SELECT HASHBUCKET(HASHROW(col1)) FROM my_table;
```

##### **Output Code:**

```sql
 SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHBUCKET FUNCTION NOT SUPPORTED ***/!!!
   HASHBUCKET(
              !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHROW FUNCTION NOT SUPPORTED ***/!!!
              HASHROW(col1))
 FROM
   my_table;
```

> **Note:**
>
> Teradata hash functions (HASHBUCKET, HASHROW, HASHAMP, HASHBAKAMP) are tied to Teradata’s shared-nothing AMP architecture for data distribution. Snowflake manages data distribution internally and has no equivalent mechanism. While Snowflake provides a [HASH](https://docs.snowflake.com/en/sql-reference/functions/hash) function, the HASH function uses a different algorithm, produces values in a different range (Snowflake HASH: signed 64-bit integers; HASHBUCKET: 0–1,048,575), and handles NULLs differently. For this reason, SnowConvert marks these functions with EWI markers rather than attempting an automatic translation.

#### Best Practices

* Please refer to the following links to check the current transformation of the specific function you are trying to convert:

  + [Oracle built-in functions](../../../../translation-references/oracle/functions/README.md)
  + [Teradata built-in functions](../../../../translation-references/teradata/sql-translation-reference/teradata-built-in-functions.md)
  + [SQL Server built-in functions](../../../../translation-references/transact/transact-built-in-functions.md)
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0032

External table requires an external stage to access an external location, define and replace the EXTERNAL_STAGE placeholder

### Description

When transforming the CREATE EXTERNAL TABLE statement, SnowConvert AI will generate an EXTERNAL_STAGE placeholder that has to be replaced with the external stage created for connecting with the external location from Snowflake.

Please refer to the following guides to set up the necessary Storage Integration and External Stage in your Snowflake account:

* [For external tables referencing Amazon S3](https://docs.snowflake.com/en/user-guide/tables-external-s3)
* [For external tables referencing Google Cloud Storage](https://docs.snowflake.com/en/user-guide/tables-external-gcs)
* [For external tables referencing Azure Blob Storage](https://docs.snowflake.com/en/user-guide/tables-external-azure)

#### Code Example

##### Input Code:

##### BigQuery

```sql
 CREATE OR REPLACE EXTERNAL TABLE test.Employees_test
(
  Employee_id INTEGER,
  Name STRING,
  Mail STRING,
  Position STRING,
  Salary INTEGER
)
OPTIONS(
  FORMAT='CSV',
  SKIP_LEADING_ROWS=1,
  URIS=['gs://sc_external_table_bucket/folder_with_csv/Employees.csv']
);
```

##### Generated Code:

##### Snowflake

```
CREATE OR REPLACE EXTERNAL TABLE test.Employees_test
(
  Employee_id INTEGER AS CAST(GET_IGNORE_CASE($1, 'c1') AS INTEGER),
  Name STRING AS CAST(GET_IGNORE_CASE($1, 'c2') AS STRING),
  Mail STRING AS CAST(GET_IGNORE_CASE($1, 'c3') AS STRING),
  Position STRING AS CAST(GET_IGNORE_CASE($1, 'c4') AS STRING),
  Salary INTEGER AS CAST(GET_IGNORE_CASE($1, 'c5') AS INTEGER)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs://sc_external_table_bucket, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
PATTERN = 'folder_with_csv/Employees.csv'
FILE_FORMAT = (TYPE = CSV SKIP_HEADER =1);
```

#### Best Practices

* Set up your external connection in the Snowflake account and replace the EXTERNAL_STAGE placeholder to complete the transformation.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0033

Format removed, semantic information not found.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This warning appears when a column used in a ***CAST*** function with a specific output format was not found in the source code.

#### Example Code

##### Input Code (Teradata):

```sql
 CREATE VIEW SampleView AS
SELECT
    DAY_DATE(FORMAT 'MMM-YYYY')(CHAR(8))
FROM
    SampleTable;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
CREATE OR REPLACE VIEW SampleView
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
SELECT
    CAST(RPAD(TO_VARCHAR(
    DAY_DATE !!!RESOLVE EWI!!! /*** SSC-EWI-0033 - FORMAT 'MMM-YYYY' REMOVED, SEMANTIC INFORMATION NOT FOUND. ***/!!!), 8) AS CHAR(8))
    FROM
    SampleTable;
```

#### Best Practices

* Make sure all the dependencies(tables and views) related to the procedure statement are being migrated.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0034

Format removed.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This warning appears when the format of the column used in a CAST function is removed.

#### Example Code

##### Input Code (Teradata):

```sql
 CREATE VIEW SampleView AS
SELECT
    DAY_DATE(FORMAT 'MMM-YYYY') + 1
FROM
    SampleTable;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
CREATE OR REPLACE VIEW SampleView
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
SELECT
    DAY_DATE !!!RESOLVE EWI!!! /*** SSC-EWI-0034 - FORMAT 'MMM-YYYY' REMOVED. ***/!!! + 1
FROM
    SampleTable;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0035

Check statement not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

***CHECK*** constraint is not supported by Snowflake but it does not affect functionally.

#### Example Code

##### Input Code Oracle :

```sql
 CREATE TABLE "Schema"."BaseTable"(
  "COLUMN1" VARCHAR2(255),
  CHECK ( COLUMN1 IS NOT NULL )
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE "Schema"."BaseTable" (
    "COLUMN1" VARCHAR(255),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
    CHECK ( COLUMN1 IS NOT NULL )
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
  ;
```

##### Input Code Teradata:

```sql
 CREATE TABLE TABLE1,
    NO FALLBACK,
    NO BEFORE JOURNAL,
    NO AFTER JOURNAL
(
    COL0 BYTEINT,
    CONSTRAINT constraint_name CHECK (COL1 < COL2)
)
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TABLE1
(
    COL0 BYTEINT,
    !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
    CONSTRAINT constraint_name CHECK (COL1 < COL2)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

##### Input Code SqlServer

```sql
 ALTER TABLE table_name2
ADD column_name VARCHAR(255)
CONSTRAINT constraint_name
CHECK NOT FOR REPLICATION (column_name > 1);
```

##### Generated Code:

```sql
 ALTER TABLE IF EXISTS table_name2
ADD column_name VARCHAR(255)
!!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
CONSTRAINT constraint_name
CHECK NOT FOR REPLICATION (column_name > 1);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0036

Data type converted to another data type.

### Severity

Low

#### Description

This warning appears when a data type is changed into another one.

#### Example Code

##### Source Code:

```sql
 CREATE TABLE SampleTable (
    SampleYear INTERVAL YEAR(2),
    SampleMonth INTERVAL MONTH(2)
);
```

##### Converted Code:

```sql
 CREATE OR REPLACE TABLE SampleTable (
    SampleYear VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR(2) DATA TYPE CONVERTED TO VARCHAR ***/!!!,
    SampleMonth VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL MONTH(2) DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/23/2024" }}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0040

Clause Not Supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This warning is added when there is a statement that is not supported in Snowflake.

#### Example Code

In the following example, the `PERCENT` clause from SQL Server is used on the SELECT query, this is not supported by Snowflake.

##### Input Code (SQL Server):

```sql
 SELECT TOP 1 PERCENT * FROM SampleTable;
```

##### Source Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
TOP 1 !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'TOP PERCENT' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
	*
FROM
	SampleTable;
```

#### Best Practices

* Review the original functionality of the statement and check if it is actually needed for your specific needs in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0041

The file has an unexpected encoding and was not translated

> **Note:**
>
> This `EWI` is deprecated, please refer to [SSC-OOS-0001](../out-of-scope/generalOOS.md) documentation.

### Description

This issue happens when a source code file has an encoding format not recognized by the tool. Character encoding is the process of assigning numbers to graphical characters, in this context written characters of human language, thus the error indicates the conversion tool could not recognize certain characters.

#### Best Practices

* All files in the input folder should have the same encoding to avoid this error.
* The appropriate encoding should be selected through the conversion settings or by utilizing the –encoding conversion parameter with the [CLI](../../../user-guide/snowconvert/command-line-interface/README.md). To determine which encoding to select online tools such as [Free Online Formater](https://freeonlineformatter.com/encoding-string) can be used or run the command `file -i *` in the case of Linux or OS.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0045

Column Name is Snowflake Reserved Keyword.

### Severity

Medium

#### Description

In some cases, column names that are valid in the source language may conflict with Snowflake’s reserved keywords. These conflicts arise because Snowflake reserves a set of keywords that cannot be used directly as column names without special handling. For details, refer to Snowflake’s official documentation on [reserved and limited keywords](https://docs.snowflake.com/en/sql-reference/reserved-keywords).

#### Code example

##### Input

```sql
 CREATE TABLE T1
(
    LOCALTIME VARCHAR,
    CURRENT_USER VARCHAR
);
```

##### Output

```sql
 CREATE OR REPLACE TABLE T1
    (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0045 - COLUMN NAME 'LOCALTIME' IS A SNOWFLAKE RESERVED KEYWORD ***/!!!
    "LOCALTIME" VARCHAR,
    !!!RESOLVE EWI!!! /*** SSC-EWI-0045 - COLUMN NAME 'CURRENT_USER' IS A SNOWFLAKE RESERVED KEYWORD ***/!!!
    "CURRENT_USER" VARCHAR
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;
```

#### Best Practices

* Consider renaming the columns that use names that are not supported in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0046

Nested function/procedure declarations are considered a complex pattern and not supported in Snowflake.

### Severity

Critical

#### Description

Snowflake does not support the declaration of nested functions/procedures, this warning is added to any create function or create procedure statement in which nested declarations were found.

#### Code example

##### Input

```sql
 CREATE OR REPLACE FUNCTION myFunction
RETURN INTEGER
IS
   total_count INTEGER;
   -- Function Declaration
   FUNCTION function_declaration(param1 VARCHAR) RETURN INTEGER;
   FUNCTION function_definition
   RETURN INTEGER
   IS
   count INTEGER;
   PROCEDURE procedure_declaration(param1 INTEGER)
   IS
       BEGIN
            NULL;
       END;
  BEGIN
    RETURN count;
  end;
BEGIN
    -- Your logic to calculate the total employee count goes here
    RETURN total_count;
END;
```

##### Output

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0046 - NESTED FUNCTION/PROCEDURE DECLARATIONS ARE NOT SUPPORTED IN SNOWFLAKE. ***/!!!
CREATE OR REPLACE FUNCTION myFunction ()
RETURNS FLOAT
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
AS
$$
  let TOTAL_COUNT;
  !!!RESOLVE EWI!!! /*** SSC-EWI-OR0057 - TRANSFORMATION FOR NESTED FUNCTION IS NOT SUPPORTED IN THIS SCENARIO ***/!!!
  /*    -- Function Declaration
     FUNCTION function_declaration(param1 VARCHAR) RETURN INTEGER; */
  // Function Declaration
  ;
  !!!RESOLVE EWI!!! /*** SSC-EWI-OR0057 - TRANSFORMATION FOR NESTED FUNCTION IS NOT SUPPORTED IN THIS SCENARIO ***/!!!
  /*    FUNCTION function_definition
     RETURN INTEGER
     IS
     count INTEGER;
     PROCEDURE procedure_declaration(param1 INTEGER)
     IS
         BEGIN
              NULL;
         END;
    BEGIN
      RETURN count;
    end; */
  ;
  // Your logic to calculate the total employee count goes here
  return TOTAL_COUNT;
$$;
```

#### Best Practices

* Remove the nested declarations from the function/procedure.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0049

A Global Temporary Table is being referenced.

> **Note:**
>
> This `EWI` is deprecated, please refer to [SSC-FDM-0023](../functional-difference/generalFDM.md) documentation.

### Severity

Medium

#### Description

SnowConvert AI transforms Global Temporary tables into regular Create Table. References to these tables may behave different than expected.

#### Code example

##### Input

```sql
 create global temporary table t1
    (col1 varchar);
create view view1 as
    select col1 from t1;
```

##### Output

```sql
 --** SSC-FDM-0009 - GLOBAL TEMPORARY TABLE FUNCTIONALITY NOT SUPPORTED. **
CREATE OR REPLACE TABLE t1
    (col1 varchar)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE VIEW view1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
select col1 from
    !!!RESOLVE EWI!!! /*** SSC-EWI-0049 - A Global Temporary Table is being referenced ***/!!!
    t1;
```

#### Related Issues

* [SSC-FDM-0009](../functional-difference/generalFDM.md): GLOBAL TEMPORARY TABLE functionality not supported.

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0052

Unusable object

### Severity

Medium

#### Description

This error happens when the source code uses a parameter or variable that is not supported or was not recognized by the conversion tool.

#### Example code

##### Input Code (Oracle):

```sql
 -- Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE PROCEDURE_PARAMETERS(PARAM SDO_GEOMETRY)
AS
    VARIABLE SDO_GEOMETRY;
BEGIN
    VARIABLE := PARAM;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROCEDURE_PARAMETERS (PARAM GEOMETRY)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // REGION SnowConvert AI Helpers Code
    var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
    var fixBind = function (arg) {
        arg = arg instanceof Date ? formatDate(arg) : IS_NULL(arg) ? null : arg;
        return arg;
    };
    var SQL = {
        FOUND : false,
        NOTFOUND : false,
        ROWCOUNT : 0,
        ISOPEN : false
    };
    var _RS, _ROWS, SQLERRM = "normal, successful completion", SQLCODE = 0;
    var getObj = (_rs) => Object.assign(new Object(),_rs);
    var getRow = (_rs) => (values = Object.values(_rs)) && (values = values.splice(-1 * _rs.getColumnCount())) && values;
    var fetch = (_RS,_ROWS,fmode) => _RS.getRowCount() && _ROWS.next() && (fmode ? getObj : getRow)(_ROWS) || (fmode ? new Object() : []);
    var EXEC = function (stmt,binds,opts) {
        try {
            binds = !(arguments[1] instanceof Array) && ((opts = arguments[1]) && []) || (binds || []);
            opts = opts || new Object();
            binds = binds ? binds.map(fixBind) : binds;
            _RS = snowflake.createStatement({
                    sqlText : stmt,
                    binds : binds
                });
            _ROWS = _RS.execute();
            if (opts.sql !== 0) {
                var isSelect = stmt.toUpperCase().trimStart().startsWith("SELECT");
                var affectedRows = isSelect ? _RS.getRowCount() : _RS.getNumRowsAffected();
                SQL.FOUND = affectedRows != 0;
                SQL.NOTFOUND = affectedRows == 0;
                SQL.ROWCOUNT = affectedRows;
            }
            if (opts.row === 2) {
                return _ROWS;
            }
            var INTO = function (opts) {
                if (opts.vars == 1 && _RS.getColumnCount() == 1 && _ROWS.next()) {
                    return _ROWS.getColumnValue(1);
                }
                if (opts.rec instanceof Object && _ROWS.next()) {
                    var recordKeys = Object.keys(opts.rec);
                    Object.assign(opts.rec,Object.fromEntries(new Map(getRow(_ROWS).map((element,Index) => [recordKeys[Index],element]))))
                    return opts.rec;
                }
                return fetch(_RS,_ROWS,opts.row);
            };
            var BULK_INTO_COLLECTION = function (into) {
                for(let i = 0;i < _RS.getRowCount();i++) {
                    FETCH_INTO_COLLECTIONS(into,fetch(_RS,_ROWS,opts.row));
                }
                return into;
            };
            if (_ROWS.getRowCount() > 0) {
                return _ROWS.getRowCount() == 1 ? INTO(opts) : BULK_INTO_COLLECTION(opts);
            }
        } catch(error) {
            RAISE(error.code,error.name,error.message)
        }
    };
    var RAISE = function (code,name,message) {
        message === undefined && ([name,message] = [message,name])
        var error = new Error(message);
        error.name = name
        SQLERRM = `${(SQLCODE = (error.code = code))}: ${message}`
        throw error;
    };
    var FETCH_INTO_COLLECTIONS = function (collections,fetchValues) {
        for(let i = 0;i < collections.length;i++) {
            collections[i].push(fetchValues[i]);
        }
    };
    var IS_NULL = (arg) => !(arg || arg === 0);
    // END REGION

    let VARIABLE = new SDO_GEOMETRY();
    VARIABLE =
        !!!RESOLVE EWI!!! /*** SSC-EWI-0052 - UNUSABLE OBJECT PARAM, ITS DATATYPE WAS NOT TRANSFORMED ***/!!!
        PARAM;
$$;
```

#### Best Practices

* Look for an alternative for the used data type.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0053

Object may not work.

### Severity

Low

#### Description

This error happens when the conversion tool could not determine the data type of a variable. This may happen because the declaration of a variable could be missing.

#### Example code

##### Input Code (Oracle):

```sql
 -- Additional Params: -t javascript
CREATE OR REPLACE PROCEDURE PROCEDURE_VARIABLES
AS
    VARIABLE INTEGER;
BEGIN
    VARIABLE := ANOTHER_VARIABLE;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROCEDURE_VARIABLES ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // REGION SnowConvert AI Helpers Code
    var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
    var fixBind = function (arg) {
        arg = arg instanceof Date ? formatDate(arg) : IS_NULL(arg) ? null : arg;
        return arg;
    };
    var SQL = {
        FOUND : false,
        NOTFOUND : false,
        ROWCOUNT : 0,
        ISOPEN : false
    };
    var _RS, _ROWS, SQLERRM = "normal, successful completion", SQLCODE = 0;
    var getObj = (_rs) => Object.assign(new Object(),_rs);
    var getRow = (_rs) => (values = Object.values(_rs)) && (values = values.splice(-1 * _rs.getColumnCount())) && values;
    var fetch = (_RS,_ROWS,fmode) => _RS.getRowCount() && _ROWS.next() && (fmode ? getObj : getRow)(_ROWS) || (fmode ? new Object() : []);
    var EXEC = function (stmt,binds,opts) {
        try {
            binds = !(arguments[1] instanceof Array) && ((opts = arguments[1]) && []) || (binds || []);
            opts = opts || new Object();
            binds = binds ? binds.map(fixBind) : binds;
            _RS = snowflake.createStatement({
                    sqlText : stmt,
                    binds : binds
                });
            _ROWS = _RS.execute();
            if (opts.sql !== 0) {
                var isSelect = stmt.toUpperCase().trimStart().startsWith("SELECT");
                var affectedRows = isSelect ? _RS.getRowCount() : _RS.getNumRowsAffected();
                SQL.FOUND = affectedRows != 0;
                SQL.NOTFOUND = affectedRows == 0;
                SQL.ROWCOUNT = affectedRows;
            }
            if (opts.row === 2) {
                return _ROWS;
            }
            var INTO = function (opts) {
                if (opts.vars == 1 && _RS.getColumnCount() == 1 && _ROWS.next()) {
                    return _ROWS.getColumnValue(1);
                }
                if (opts.rec instanceof Object && _ROWS.next()) {
                    var recordKeys = Object.keys(opts.rec);
                    Object.assign(opts.rec,Object.fromEntries(new Map(getRow(_ROWS).map((element,Index) => [recordKeys[Index],element]))))
                    return opts.rec;
                }
                return fetch(_RS,_ROWS,opts.row);
            };
            var BULK_INTO_COLLECTION = function (into) {
                for(let i = 0;i < _RS.getRowCount();i++) {
                    FETCH_INTO_COLLECTIONS(into,fetch(_RS,_ROWS,opts.row));
                }
                return into;
            };
            if (_ROWS.getRowCount() > 0) {
                return _ROWS.getRowCount() == 1 ? INTO(opts) : BULK_INTO_COLLECTION(opts);
            }
        } catch(error) {
            RAISE(error.code,error.name,error.message)
        }
    };
    var RAISE = function (code,name,message) {
        message === undefined && ([name,message] = [message,name])
        var error = new Error(message);
        error.name = name
        SQLERRM = `${(SQLCODE = (error.code = code))}: ${message}`
        throw error;
    };
    var FETCH_INTO_COLLECTIONS = function (collections,fetchValues) {
        for(let i = 0;i < collections.length;i++) {
            collections[i].push(fetchValues[i]);
        }
    };
    var IS_NULL = (arg) => !(arg || arg === 0);
    // END REGION

    let VARIABLE;
    VARIABLE =
        !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT ANOTHER_VARIABLE MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
        ANOTHER_VARIABLE;
$$;
```

#### Best Practices

* Make sure the input code has the variable declared.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0054

Unsupported outer join subquery

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This error happens when a correlated subquery is found within an OR logical expression of an OUTER JOIN (Left, Right or Full). In those cases they could produce inconsistent results or cause the following error:

**`SQL compilation error: Unsupported subquery type cannot be evaluated.`**

These limitations with subqueries are briefly mentioned in [Snowflake documentation](https://docs.snowflake.com/en/user-guide/querying-subqueries.html#limitations) and some information about them can also be found in [Snowflake forums.](https://community.snowflake.com/s/question/0D53r00009mIxwYCAS/sql-compilation-error-unsupported-subquery-type-cannot-be-evaluated)

#### Example code

##### Input Code (Teradata):

```sql
 SELECT a.Column1, b.Column2
FROM
    TableA a
    LEFT JOIN TableB b ON (a.Column1 = b.Column1)
    AND (
        a.Column2 = b.Column2
        OR EXISTS(
            SELECT * FROM Table3 c
            WHERE c.Column1 = a.Column1
        )
    );
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
    a.Column1,
    b.Column2
FROM
    TableA a
   LEFT JOIN
        TableB b ON (a.Column1 = b.Column1)
   AND (
       a.Column2 = b.Column2
       OR EXISTS
                !!!RESOLVE EWI!!! /*** SSC-EWI-0054 - CORRELATED SUBQUERIES WITHIN AN OR EXPRESSION OF AN OUTER JOIN COULD CAUSE COMPILATION ERRORS ***/!!!(
                    SELECT
                        * FROM
                        Table3 c
                               WHERE c.Column1 = a.Column1
       )
   );
```

#### Best Practices

* Verify the output code does not produce a compilation error.
* Verify the output code’s functional equivalence.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0056

Custom Types Not Supported

> **Note:**
>
> **Deprecation:** This issue code is deprecated. SnowConvert now translates many Oracle and cross-dialect `CREATE TYPE` definitions to [Snowflake native user-defined types](https://docs.snowflake.com/en/sql-reference/sql/create-type) where supported. This entry’s **Input Code** and **Generated Code** examples are kept for historical reference and may not match current tool output. For current behavior, see the translation reference for your source: [Oracle](../../../../translation-references/oracle/sql-translation-reference/create_type.md), [IBM DB2](../../../../translation-references/db2/db2-create-type.md), [Teradata](../../../../translation-references/teradata/sql-translation-reference/teradata-create-type.md), [SQL Server / Azure Synapse](../../../../translation-references/transact/transact-create-type.md), [PostgreSQL / Greenplum / Netezza](../../../../translation-references/postgres/ddls/postgresql-create-type.md), [Sybase IQ](../../../../translation-references/sybase/sybase-create-type.md).

### Severity

Low

#### Description

This message appears when a user-defined type (UDT) is defined. User-defined types are not supported in Snowflake, so references to the custom type are changed to an appropriate Snowflake type (such as VARIANT or OBJECT).

Snowflake has a UDT Private Preview feature available. For more information about accessing this feature, please contact [udt-prpr@snowflake.com](mailto:udt-prpr%40snowflake.com).

> **Note:**
>
> The type definition is commented on but is still being taken into account for resolving usages, see SSC-EWI-0062 for more information.

#### Example code

##### Input Code (Oracle):

```sql
 CREATE TYPE type1 AS OBJECT (column1 INT);

CREATE OR REPLACE PROCEDURE record_procedure
IS
    TYPE record_typ IS RECORD(col1 INTEGER, col2 FLOAT);
BEGIN
    NULL;
END;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - USER-DEFINED TYPES ARE NOT SUPPORTED IN SNOWFLAKE. REFERENCES WERE CHANGED TO VARIANT. A UDT PRIVATE PREVIEW FEATURE IS AVAILABLE, FOR MORE INFORMATION, PLEASE CONTACT udt-prpr@snowflake.com ***/!!!
CREATE TYPE type1 AS OBJECT (column1 INT)
;

CREATE OR REPLACE PROCEDURE record_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - USER-DEFINED TYPES ARE NOT SUPPORTED IN SNOWFLAKE. REFERENCES WERE CHANGED TO OBJECT. A UDT PRIVATE PREVIEW FEATURE IS AVAILABLE, FOR MORE INFORMATION, PLEASE CONTACT udt-prpr@snowflake.com ***/!!!
        TYPE record_typ IS RECORD(col1 INTEGER, col2 FLOAT);
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* Consider using Snowflake’s OBJECT or VARIANT data types as alternatives to user-defined types for storing complex structured data.
* For more information about the UDT Private Preview feature, contact [udt-prpr@snowflake.com](mailto:udt-prpr%40snowflake.com).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0058

Functionality is not currently supported by Snowflake Scripting

### Severity

Medium

#### Description

This error happens when a statement used in a create procedure is not currently supported by Snowflake Scripting.

#### Example code

##### Input Code (Oracle):

```sql
 CREATE OR REPLACE PROCEDURE PROC01
IS
  number_variable INTEGER;
BEGIN
  EXECUTE IMMEDIATE 'SELECT 1 FROM DUAL' INTO number_variable;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC01 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    number_variable INTEGER;
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE 'SELECT 1 FROM DUAL'
                                           !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'EXECUTE IMMEDIATE RETURNING CLAUSE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
                                           INTO number_variable;
  END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0062

Custom type usage changed to variant

### Severity

Low

#### Description

This message appears when a Custom Type is referenced, and then its usage is changed to a variant.

> **Note:**
>
> This message is heavily related to SSC-EWI-0056.

#### Example code

##### Input Code (Oracle):

```sql
 CREATE TYPE type1 AS OBJECT(type1_column1 INT);

CREATE TABLE table1
(
column1 type1
);
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE type1 AS OBJECT(type1_column1 INT)
;

CREATE OR REPLACE TABLE table1
(
column1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'type1' USAGE CHANGED TO VARIANT ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
;

CREATE OR REPLACE VIEW table1_view
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
column1:type1_column1 :: VARCHAR AS type1_column1
FROM
table1;
```

#### Best Practices

* Remember to transform all of its input data into a Variant-compliant data type as well.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-0064

Referenced custom type in query not found

### Severity

High

#### Description

This error happens when a Custom Type is referenced in a source for a DML statement, but the Custom Type was never defined.
For example in a Table Column whose type might be a UDT but it was never defined.

> **Warning:**
>
> Not to be confused with SSC-FDM-0015, which is when it was referenced in a DDL query.

#### Example Code

##### Input Code (Oracle):

```sql
 --Type was never defined
--CREATE TYPE type1;

CREATE TABLE table1
(
--the type will be unresolved
column1 type1
);

SELECT
column1
FROM table1;
```

##### Generated Code:

```sql
 --Type was never defined
--CREATE TYPE type1;
!!!RESOLVE EWI!!! /*** SSC-EWI-0050 - MISSING DEPENDENT OBJECT "type1" ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0063 - 'PUBLIC.table1_view' ADDED BECAUSE 'table1' USED A CUSTOM TYPE ***/!!!
CREATE OR REPLACE TABLE table1
(
--the type will be unresolved
column1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0064 - REFERENCED CUSTOM TYPE 'type1' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/!!! /*** SSC-FDM-0015 - DATA TYPE 'type1' NOT RECOGNIZED ***/
);

CREATE OR REPLACE VIEW PUBLIC.table1_view
AS
SELECT
column1
FROM
table1;

SELECT
column1 !!!RESOLVE EWI!!! /*** SSC-EWI-0064 - REFERENCED CUSTOM TYPE 'type1' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/!!!
FROM
table1;
```

#### Best Practices

* Verify that the type that was referenced was defined in the input code.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0066

Expression not supported in Snowflake.

### Severity

High

#### Description

This error is used to inform that a *specific* **expression** is not supported in Snowflake.

#### Example Code

##### **Input Code:**

```sql
 SELECT * from T1 where (cast('2016-03-17' as DATE),
       cast('2016-03-21' as DATE)) OVERLAPS
       (cast('2016-03-20' as DATE), cast('2016-03-22' as DATE));
```

##### **Output Code:**

```sql
 SELECT * from
       T1
where
       !!!RESOLVE EWI!!! /*** SSC-EWI-0066 - EXPRESSION 'OVERLAPS' IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! (cast('2016-03-17' as DATE),
       cast('2016-03-21' as DATE)) OVERLAPS
       (cast('2016-03-20' as DATE), cast('2016-03-22' as DATE));
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0067

UDF was transformed to Snowflake procedure, calling procedures inside a query is not supported.

### Severity

High

#### Description

This error is added when a call to a UDF (user defined function) is found inside a query. Oracle UDFs and UDFs inside packages and some SQL Server UDFs, are being transformed to Snowflake Stored Procedures, which can not be called from a query.

The function is transformed to a Stored procedure to maintain functional equivalence and the function call is transformed to an empty Snowflake UDF function.

> **Note:**
>
> This EWI is strongly related to [SSC-EWI-0068](../functional-difference/generalFDM.md)

#### Example Code

##### SQL Server:

##### Input Code

```sql
 CREATE OR ALTER FUNCTION PURCHASING.FOO()
RETURNS INT
AS
BEGIN
    DECLARE @i int = 0, @p int;
    Select @p = COUNT(*) FROM PURCHASING.VENDOR

    WHILE (@p < 1000)
    BEGIN
        SET @i = @i + 1
        SET @p = @p + @i
    END

    IF (@i = 6)
        RETURN 1

    RETURN @p
END;
GO

SELECT PURCHASING.FOO() AS RESULT;
```

##### Generated Code

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "PURCHASING.VENDOR" **
CREATE OR REPLACE PROCEDURE PURCHASING.FOO ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I INT := 0;
        P INT;
    BEGIN

        Select
            COUNT(*)
        INTO
            :P
 FROM
            PURCHASING.VENDOR;
        WHILE (:P < 1000) LOOP
            I := :I + 1;
            P := :P + :I;
        END LOOP;
        IF ((:I = 6)) THEN
            RETURN 1;
        END IF;
        RETURN :P;
    END;
$$;

SELECT
    PURCHASING.FOO() !!!RESOLVE EWI!!! /*** SSC-EWI-0067 - UDF WAS TRANSFORMED TO SNOWFLAKE PROCEDURE, CALLING PROCEDURES INSIDE QUERIES IS NOT SUPPORTED ***/!!! AS RESULT;
```

##### Oracle:

##### Input Code

```sql
 CREATE FUNCTION employee_function (param1 in NUMBER) RETURN NUMBER is
  var1    employees.employee_ID%TYPE;
  var2    employees.manager_ID%TYPE;
  var3    employees.title%TYPE;
BEGIN
  SELECT employee_ID, manager_ID, title
  INTO var1, var2, var3
  FROM employees
    START WITH manager_ID = param1
    CONNECT BY manager_ID = PRIOR employee_id;
  RETURN var1;
EXCEPTION
   WHEN no_data_found THEN RETURN param1;
END employee_function;

SELECT employee_function(2) FROM employees;
```

##### Generated Code

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employees" **
CREATE OR REPLACE PROCEDURE employee_function (param1 NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    var1    employees.employee_ID%TYPE;
    var2    employees.manager_ID%TYPE;
    var3    employees.title%TYPE;
  BEGIN
    SELECT employee_ID, manager_ID, title
    INTO
      :var1,
      :var2,
      :var3
    FROM
      employees
      START WITH manager_ID = :param1
    CONNECT BY
      manager_ID = PRIOR employee_id;
    RETURN :var1;
  EXCEPTION
     WHEN no_data_found THEN
      RETURN :param1;
  END;
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employees" **

SELECT
  !!!RESOLVE EWI!!! /*** SSC-EWI-0067 - UDF WAS TRANSFORMED TO SNOWFLAKE PROCEDURE, CALLING PROCEDURES INSIDE QUERIES IS NOT SUPPORTED ***/!!! employee_function(2) FROM
  employees;
```

#### Best Practices

* The source code may need to be restructured to fit with the Snowflake user-defined functions [approach](https://docs.snowflake.com/en/sql-reference/user-defined-functions.html#udfs-user-defined-functions).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0068

User defined function was transformed to a Snowflake procedure.

Snowflake user defined functions do not support the same features as Oracle or SQL Server. To maintain the functional equivalence the function is transformed to a Snowflake stored procedure. This will affect their usage in queries.

### Example Code

#### SQL Server:

##### Input Code

```sql
 CREATE OR ALTER FUNCTION PURCHASING.FOO()
RETURNS INT
AS
BEGIN
    DECLARE @i int = 0, @p int;
    Select @p = COUNT(*) FROM PURCHASING.VENDOR

    WHILE (@p < 1000)
    BEGIN
        SET @i = @i + 1
        SET @p = @p + @i
    END

    IF (@i = 6)
        RETURN 1

    RETURN @p
END;
```

##### Generated Code

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE PURCHASING.FOO ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "06/25/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I INT := 0;
        P INT;
    BEGIN

        Select
            COUNT(*)
        INTO
            :P
 FROM
            PURCHASING.VENDOR;
        WHILE (:P < 1000) LOOP
            I := :I + 1;
            P := :P + :I;
        END LOOP;
        IF ((:I = 6)) THEN
            RETURN 1;
        END IF;
        RETURN :P;
    END;
$$;
```

##### Oracle:

##### Input Code

```
CREATE OR REPLACE FUNCTION FUN1(PAR1 VARCHAR)
RETURN VARCHAR
IS
    VAR1 VARCHAR(20);
    VAR2 VARCHAR(20);
BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE1 where col1 = 1;
    VAR2 := PAR1 || VAR1;
    RETURN VAR2;
END;
/
```

##### Generated Code

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE FUN1(PAR1 VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    VAR1 VARCHAR(20);
    VAR2 VARCHAR(20);
  BEGIN
    SELECT COL1 INTO
      :VAR1
    FROM
      TABLE1
    where col1 = 1;
    VAR2 := NVL(:PAR1 :: STRING, '') || NVL(:VAR1 :: STRING, '');
    RETURN :VAR2;
  END;
$$;
```

### Best Practices

* Separate the inside queries to maintain the same logic.
* The source code may need to be restructured to fit with the Snowflake user-defined functions [approach](https://docs.snowflake.com/en/sql-reference/user-defined-functions.html#udfs-user-defined-functions).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0073

Pending Functional Equivalence Review

### Severity

Medium

#### Description

This EWI is added when there is a grammar clause in the input platform that has not been reviewed by the SnowConvert AI developer team. The code may require manual revision for it to work in Snowflake.

#### Example Code

##### SQLServer:

##### Input Code

```sql
 CREATE OR ALTER PROC SampleProcedure
AS
BEGIN
   INSERT INTO aTable (columnA = 'varcharValue', columnB = 1);
   INSERT exampleTable VALUES ('Hello', 23);
   INSERT INTO exampleTable DEFAULT VALUES;
END
```

##### Generated Code

```sql
 CREATE OR REPLACE PROCEDURE SampleProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      INSERT INTO aTable (columnA = 'varcharValue', columnB = 1);
      INSERT INTO exampleTable VALUES ('Hello', 23);
      !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INSERT WITH DEFAULT VALUES' NODE ***/!!!
      INSERT INTO exampleTable DEFAULT VALUES;
   END;
$$;
```

Notice in line 6 of the input code, that there is a reference to a `INSERT` statement with `DEFAULT VALUES`, this is currently a not supported statement by SnowConvert AI and that is why in lines 11 and 12 the EWI is generated.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0077

Cycle found between CTE calls. CTEs cannot be ordered.

### Severity

Low

#### Description

This warning is added when a query that has several CTE (Common Table Expression) reference calls creates a cycle that cannot determine the calling order of the CTEs, and then the CTEs cannot be ordered and the query will remain as the source.

#### Example Code

##### Input Code (Teradata):

```sql
 WITH t1(c1) as (SELECT c1 FROM t2),
     t2(c2) as (SELECT c2 FROM t3),
     RECURSIVE t3(c3) as (SELECT c3, someOtherColumn FROM t1, t3)
     SELECT * FROM t1;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0077 - CYCLE FOUND BETWEEN CTE REFERENCE CALLS, CTES CANNOT BE ORDERED AND THE QUERY WILL REMAIN AS ORIGINAL ***/!!!
WITH RECURSIVE t1(c1) AS
(
     SELECT
          c1 FROM t2
),
t2(c2) AS
(
     SELECT
          c2 FROM t3
),
t3(c3) AS
(
     SELECT
          c3,
          someOtherColumn FROM t1, t3
)
SELECT
     * FROM t1;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0080

Default value is not allowed on binary columns

### Severity

Low

#### Description

This EWI is added when the source code has a default value for BINARY data type, which is not supported in Snowflake SQL

##### Example Code

**Input Code (SqlServer):**

```sql
 create table test1345
(
  key1 binary default 0
);
```

**Output Code:**

```sql
 CREATE OR REPLACE TABLE test1345
(
  key1 BINARY
              !!!RESOLVE EWI!!! /*** SSC-EWI-0080 - DEFAULT VALUE IS NOT ALLOWED ON BINARY COLUMNS ***/!!!
              default 0
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

##### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0084

XMLTABLE is not supported.

### Severity

High

#### Description

XMLTABLE function is not currently supported.

#### Example Code

##### Input Code (DB2):

```sql
 SELECT
    *
FROM
    XMLTABLE(
        'stringValue' PASSING BY REF passingExpr AS AliasName
    ) AS XMLTABLENAME
```

##### Generated Code:

```sql
 SELECT
    *
FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-0084 - XMLTABLE IS NOT SUPPORTED BY SNOWFLAKE ***/!!!
    XMLTABLE(
        'stringValue' PASSING BY REF passingExpr AS AliasName
    ) AS XMLTABLENAME
```

#### Best Practices

* Check this [blog](https://www.snowflake.com/blog/easily-load-xml-sql/) for XML transformations in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0086

Replaced invalid characters for new identifier

> **Note:**
>
> This `EWI` is deprecated, please refer to [SSC-FDM-0030](../functional-difference/generalFDM.md) documentation.

### Severity

Low

#### Description

The given identifier has invalid characters for the output language. Those characters were replaced with their UTF-8 codes.

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE PROCEDURE PROC1
AS
    "VAR0" INT;
    "VAR`/1ͷ" VARCHAR(20);
    "o*/o" FLOAT;
    " . " INT;
    ". ." INT;
    "123Name" INT;
    "return" INT;
    yield INT;
    ident#10 INT;
BEGIN
    NULL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        "VAR0" INT;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0086 - IDENTIFIER '"VAR`/1ͷ"' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES ***/!!!
        VAR_u60_u2F1ͷ VARCHAR(20);
        !!!RESOLVE EWI!!! /*** SSC-EWI-0086 - IDENTIFIER '"o*/o"' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES ***/!!!
        o_u2A_u2Fo FLOAT;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0086 - IDENTIFIER '" . "' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES ***/!!!
        _u20_u2E_u20 INT;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0086 - IDENTIFIER '". ."' HAS INVALID CHARACTERS. CHARACTERS WERE REPLACED WITH THEIR UTF-8 CODES ***/!!!
        _u2E_u20_u2E INT;
        "123Name" INT;
        "return" INT;
        yield INT;
        IDENT_HASHTAG_10 INT;
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0092

Materialized View was converted to regular View.

> **Danger:**
>
> Deprecated

### Severity

Low

#### Description

Currently, all Materialized Views are being converted to regular Views. This process eliminates additional clauses that the Materialized Views may have had. For more information, see [Limitations on creating materialized views](https://docs.snowflake.com/en/user-guide/views-materialized.html#label-limitations-on-creating-materialized-views).

#### Example Code

##### Input Code:

```sql
 CREATE MATERIALIZED VIEW MATERIALIZED_VIEW1
SEGMENT CREATION IMMEDIATE
ORGANIZATION HEAP PCTFREE 10 PCTUSED 40 INITRANS 1 MAXTRANS 255
PCTFREE 10 PCTUSED 40 INITRANS 1 MAXTRANS 255
INMEMORY PRIORITY NONE MEMCOMPRESS FOR QUERY LOW DISTRIBUTE AUTO NO DUPLICATE
AS
select
   *
from
   aTable;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0092 - MATERIALIZED VIEW WAS CONVERTED TO REGULAR VIEW. ***/!!!
CREATE OR REPLACE VIEW MATERIALIZED_VIEW1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
select
   *
from
   aTable;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0094

Label declaration not supported

### Severity

Low

#### Description

Currently there is no equivalent for labels declaration in Snow Scripting, so an EWI is added, and the label is commented out

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE OR REPLACE PROCEDURE Example ( grade NUMBER )
IS
BEGIN
	<<CASE1>><<CASE2>>
	CASE grade
		WHEN 10 THEN NULL;
		ELSE NULL;
	END CASE CASE1;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE Example (grade NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		!!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<CASE1>><<CASE2>> ***/!!!
		CASE :grade
			WHEN 10 THEN
				NULL;
			ELSE NULL;
		END CASE;
	END;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0095

Create Type not supported in Snowflake

> **Note:**
>
> **Deprecation:** This issue code is deprecated. Snowflake supports native user-defined types, and SnowConvert emits `CREATE TYPE` for many supported patterns. This entry’s **Input Code** and **Generated Code** examples are preserved as historical reference. For current conversion behavior, see the translation reference for your source: [Oracle](../../../../translation-references/oracle/sql-translation-reference/create_type.md), [IBM DB2](../../../../translation-references/db2/db2-create-type.md), [Teradata](../../../../translation-references/teradata/sql-translation-reference/teradata-create-type.md), [SQL Server / Azure Synapse](../../../../translation-references/transact/transact-create-type.md), [PostgreSQL / Greenplum / Netezza](../../../../translation-references/postgres/ddls/postgresql-create-type.md), [Sybase IQ](../../../../translation-references/sybase/sybase-create-type.md).

### Severity

High

#### Description

User-defined types (UDTs) created with the `CREATE TYPE` statement are not currently supported in Snowflake. When SnowConvert AI encounters a `CREATE TYPE` statement, it adds this warning to indicate that manual intervention is required.

Snowflake has a UDT Private Preview feature available. For more information about accessing this feature, please contact [udt-prpr@snowflake.com](mailto:udt-prpr%40snowflake.com).

#### Example Code

##### Input Code (Oracle):

```sql
CREATE OR REPLACE TYPE address_type AS OBJECT (
    street VARCHAR2(100),
    city VARCHAR2(50),
    state VARCHAR2(2),
    zip_code VARCHAR2(10)
);
```

##### Generated Code:

```sql
--** SSC-EWI-0095 - USER-DEFINED TYPE: 'address_type' IS CURRENTLY NOT SUPPORTED IN SNOWFLAKE. A UDT PRIVATE PREVIEW FEATURE IS AVAILABLE, FOR MORE INFORMATION, PLEASE CONTACT udt-prpr@snowflake.com **
CREATE OR REPLACE TYPE address_type AS OBJECT (
    street VARCHAR(100),
    city VARCHAR(50),
    state VARCHAR(2),
    zip_code VARCHAR(10)
);
```

#### Best Practices

* Consider using Snowflake’s OBJECT or VARIANT data types as alternatives to user-defined types for storing complex structured data.
* For more information about the UDT Private Preview feature, contact [udt-prpr@snowflake.com](mailto:udt-prpr%40snowflake.com).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0101

Commented out transaction label name because is not applicable in Snowflake

### Severity

Low

#### Description

Snowflake does not operate transaction label names because there should not be nested transactions to identify in different COMMIT or ROLLBACK statements.

#### Example code

##### Input Code (SQL Server):

```sql
 CREATE PROCEDURE TestTransaction
AS
BEGIN
    DROP TABLE IF EXISTS NEWTABLE;
    CREATE TABLE NEWTABLE(COL1 INT, COL2 VARCHAR);
      BEGIN TRANSACTION LabelA;
        INSERT INTO NEWTABLE VALUES (1, 'MICHAEL');
        INSERT INTO NEWTABLE VALUES(2, 'JACKSON');
      COMMIT TRANSACTION LabelA;
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE TestTransaction ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        DROP TABLE IF EXISTS NEWTABLE;
        CREATE OR REPLACE TABLE NEWTABLE (
            COL1 INT,
            COL2 VARCHAR
        );
            BEGIN TRANSACTION
            !!!RESOLVE EWI!!! /*** SSC-EWI-0101 - COMMENTED OUT TRANSACTION LABEL NAME BECAUSE IS NOT APPLICABLE IN SNOWFLAKE ***/!!!
            LabelA;
            INSERT INTO NEWTABLE VALUES (1, 'MICHAEL');
        INSERT INTO NEWTABLE VALUES(2, 'JACKSON');
            COMMIT;
    END;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0102

Removed statement option from code, already handled in conversion to Snowflake

> **Note:**
>
> This `EWI` is deprecated.

### Severity

Low

#### Description

Snowflake statements could remove some options when they are handled by the conversion rule. So it will be removed from the output code but the functionality is equivalent.

#### Example code

##### Input Code (PostgreSQL):

```sql
 -- Case 1:
TRUNCATE ONLY table_base2 RESTART IDENTITY CASCADE;

-- Case 2:
TRUNCATE TABLE table_inherit_and_generated RESTART IDENTITY CASCADE;
```

##### Generated Code:

```sql
 -- Case 1:
!!!RESOLVE EWI!!! /*** SSC-EWI-0102 - REMOVED ONLY OPTION FROM CODE, ALREADY HANDLED IN CONVERSION TO SNOWFLAKE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0102 - REMOVED CASCADE OPTION FROM CODE, ALREADY HANDLED IN CONVERSION TO SNOWFLAKE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0102 - REMOVED RESTART IDENTITY OPTION FROM CODE, ALREADY HANDLED IN CONVERSION TO SNOWFLAKE ***/!!!
TRUNCATE table_base2;

-- Case 2:
!!!RESOLVE EWI!!! /*** SSC-EWI-0102 - REMOVED CASCADE OPTION FROM CODE, ALREADY HANDLED IN CONVERSION TO SNOWFLAKE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0102 - REMOVED RESTART IDENTITY OPTION FROM CODE, ALREADY HANDLED IN CONVERSION TO SNOWFLAKE ***/!!!
TRUNCATE TABLE table_inherit_and_generated;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0107

Interval Literal Not Supported In Current Scenario

### Severity

High

#### Description

Snowflake Intervals can only be used in arithmetic operations. Intervals used in any other scenario are not supported.

#### Example Code

**Input Code:**

```sql
 SELECT INTERVAL '1-5' YEAR TO MONTH FROM DUAL;
```

**Output Code:**

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0107 - INTERVAL LITERAL IS NOT SUPPORTED BY SNOWFLAKE IN THIS SCENARIO  ***/!!!
 INTERVAL '1-5' YEAR TO MONTH FROM DUAL;
```

##### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0108

The following subquery matches at least one of the patterns considered invalid and may produce compilation errors

### Severity

High

#### Description

In Snowflake, there are multiple patterns and elements in a subquery that are not supported and make it not executable. According to the [Snowflake documentation on subqueries](https://docs.snowflake.com/en/user-guide/querying-subqueries#types-supported-by-snowflake) the following subquery types are **supported**:

* Uncorrelated scalar subqueries in any place that a value expression can be used.
* Correlated scalar subqueries in WHERE clauses.
* EXISTS, ANY / ALL, and IN subqueries in WHERE clauses. These subqueries can be correlated or uncorrelated.

Please note that the list above is not exhaustive, meaning that subqueries that match none of the specified types may still be considered valid.

To help avoid errors, SnowConvert AI knows a set of subquery patterns that normally invalidate subqueries, this EWI is added to warn the user that the subquery matches at least one of these patterns and therefore may produce errors when compiled in Snowflake.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE tableA
(
    col1 INTEGER,
    col2 VARCHAR(20)
);

CREATE TABLE tableB
(
    col3 INTEGER,
    col4 VARCHAR(20)
);

INSERT INTO tableA VALUES (50, 'Hey');

INSERT INTO tableB VALUES (50, 'Hey');
INSERT INTO tableB VALUES (50, 'Example');
INSERT INTO tableB VALUES (10, 'Bye');

-- Snowflake only allows the usage of FETCH in subqueries that are uncorrelated scalar, this subquery execution will fail
SELECT col2
FROM tableA
WHERE col2 = (SELECT col4 FROM tableB WHERE col3 = col1 FETCH FIRST ROW ONLY);

-- This subquery is uncorrelated scalar so FETCH is valid to use
SELECT col2
FROM tableA
WHERE col2 = (SELECT col4 FROM tableB FETCH FIRST ROW ONLY);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE tableA
    (
        col1 INTEGER,
        col2 VARCHAR(20)
    )
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/05/2024",  "domain": "test" }}'
    ;

    CREATE OR REPLACE TABLE tableB
    (
        col3 INTEGER,
        col4 VARCHAR(20)
    )
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/05/2024",  "domain": "test" }}'
    ;

    INSERT INTO tableA
    VALUES (50, 'Hey');

    INSERT INTO tableB
    VALUES (50, 'Hey');

    INSERT INTO tableB
    VALUES (50, 'Example');

    INSERT INTO tableB
    VALUES (10, 'Bye');

    -- Snowflake only allows the usage of FETCH in subqueries that are uncorrelated scalar, this subquery execution will fail
SELECT col2
FROM
    tableA
    WHERE col2 =
                 --** SSC-FDM-0002 - CORRELATED SUBQUERIES MAY HAVE SOME FUNCTIONAL DIFFERENCES. **
                 !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!! (SELECT
                         ANY_VALUE( col4) FROM
                         tableB
                     WHERE col3 = col1
                     FETCH FIRST 1 ROW ONLY);

    -- This subquery is uncorrelated scalar so FETCH is valid to use
SELECT col2
FROM
    tableA
    WHERE col2 = (SELECT col4 FROM
                         tableB
                     FETCH FIRST 1 ROW ONLY);
```

#### Best Practices

* Check the subquery in Snowflake, if it compiles without problems then this EWI can be safely ignored.
* Please check the complex patterns section for subqueries inside the assessment report, it contains a list of the patterns that normally invalidate subqueries and their occurrences, it can be used to review the migrated subqueries and why are they considered invalid.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0109

Alter Table syntax is not applicable in Snowflake.

### Severity

Medium

#### Description

The Alter Table syntax used is not applicable in Snowflake, then this message is being added.

#### Example Code:

##### Input Code:

```sql
 ALTER TABLE SOMENAME DEFAULT COLLATION SOMENAME;

ALTER TABLE SOMENAME ROW ARCHIVAL;

ALTER TABLE SOMENAME MODIFY CLUSTERING;

ALTER TABLE SOMENAME DROP CLUSTERING;

ALTER TABLE SOMENAME SHRINK SPACE COMPACT CASCADE;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
DEFAULT COLLATION SOMENAME;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
ROW ARCHIVAL;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
MODIFY CLUSTERING;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
DROP CLUSTERING;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
SHRINK SPACE COMPACT CASCADE;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0110

Transformation Not Performed Due To Missing Dependencies

### Severity

Low

#### Description

When there are missing dependencies, the EWI is added to indicate that a transformation cannot be executed. SnowConvert AI utilizes abstract syntax trees to create a semantic model of the input code, which is then used to generate new code that replicates the functionality of the original source. However, in this particular scenario, the transformation could not be completed because the semantic model lacks certain dependencies.

#### Example code

##### Input Code :

```sql
 ALTER TABLE MissingTable ADD
CONSTRAINT constraint1  DEFAULT (suser_name()) FOR col1;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "MissingTable" **
!!!RESOLVE EWI!!! /*** SSC-EWI-0110 - TRANSFORMATION NOT PERFORMED DUE TO MISSING DEPENDENCIES ***/!!!

ALTER TABLE MissingTable
ADD
CONSTRAINT constraint1 DEFAULT (CURRENT_USER()) FOR col1;
```

#### Best Practices

* Add the missing dependencies to the input code.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0111

Only one level of nesting is allowed for nested procedures in Snowflake.

### Severity

Critical

#### Description

Snowflake supports only a single level of nesting for procedures. Defining a nested procedure inside another nested procedure is not allowed. If this pattern is detected, this error will be generated.

#### Example code

##### Input Code :

```sql
CREATE OR REPLACE PROCEDURE calculate_executive_salary (
    p_result OUT NUMBER
)
AS
    PROCEDURE calculate_senior_level (
        senior_result OUT NUMBER
    )
    AS
        PROCEDURE calculate_base_level (
            base_result OUT NUMBER
        )
        AS
        BEGIN
            base_result := 75000;
        END calculate_base_level;
    BEGIN
        calculate_base_level(senior_result);
        senior_result := senior_result * 1.5;
    END calculate_senior_level;
BEGIN
    calculate_senior_level(p_result);
END calculate_executive_salary;
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE calculate_executive_salary (p_result OUT NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        calculate_senior_level PROCEDURE (senior_result NUMBER(38, 18)
           )
        RETURNS NUMBER
        AS
            DECLARE
                !!!RESOLVE EWI!!! /*** SSC-EWI-0111 - ONLY ONE LEVEL OF NESTING IS ALLOWED FOR NESTED PROCEDURES IN SNOWFLAKE. ***/!!!
                PROCEDURE calculate_base_level (
                    base_result OUT NUMBER
                )
                AS
                BEGIN
                    base_result := 75000;
                END calculate_base_level;
                call_results NUMBER;
            BEGIN
                call_results := (
                CALL
                calculate_base_level(:senior_result)
                );
                senior_result := :call_results;
                senior_result := :senior_result * 1.5;
                RETURN senior_result;
            END;
        call_results NUMBER;
        BEGIN
        call_results := (
            CALL
            calculate_senior_level(:p_result)
        );
        p_result := :call_results;
        END;
$$;
```

#### Best Practices

* Refactor your code to avoid more than one level of nested procedures. Move deeply nested procedures to the top level or restructure your logic to comply with Snowflake’s single-level nesting limitation.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0112

Nested procedure overloading is not supported.

### Severity

Critical

#### Description

Snowflake does not support overloading of nested procedures. In other words, you cannot define multiple nested procedures with the same name but different parameter lists within the same parent procedure. If the source code contains overloaded nested procedures, this error will be generated to indicate that such patterns are not supported in Snowflake.

#### Example code

##### Input Code :

```sql
CREATE OR REPLACE PROCEDURE demonstrate_salary_calculations(
    final_summary OUT VARCHAR2
)
AS
    result1 VARCHAR2(100);
    result2 VARCHAR2(100);
    result3 VARCHAR2(100);

    PROCEDURE calculate_salary(
        output OUT VARCHAR2
    )
    AS
    BEGIN
        output := 'Standard: 55000';
    END;

    PROCEDURE calculate_salary(
        base_amount IN NUMBER,
        output OUT VARCHAR2
    )
    AS
    BEGIN
        output := 'Calculated: ' || (base_amount * 1.15);
    END;

    PROCEDURE calculate_salary(
        employee_level IN VARCHAR2,
        output OUT VARCHAR2
    )
    AS
    BEGIN
        output := 'Level ' || UPPER(employee_level) || ': 60000';
    END;

BEGIN
    calculate_salary(result1);
    calculate_salary(50000, result2);
    calculate_salary('senior', result3);
    final_summary := result1 || ' | ' || result2 || ' | ' || result3;
END demonstrate_salary_calculations;
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE demonstrate_salary_calculations (final_summary OUT VARCHAR
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        result1 VARCHAR(100);
        result2 VARCHAR(100);
        result3 VARCHAR(100);
        calculate_salary PROCEDURE(output VARCHAR
            )
        RETURNS VARCHAR
        AS
            BEGIN
                output := 'Standard: 55000';
                RETURN output;
            END;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0112 - NESTED PROCEDURE OVERLOADING IS NOT SUPPORTED. ***/!!!
        calculate_salary PROCEDURE(base_amount NUMBER(38, 18), output VARCHAR
            )
        RETURNS VARCHAR
        AS
            BEGIN
                output := 'Calculated: ' || NVL((:base_amount * 1.15) :: STRING, '');
                RETURN output;
            END;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0112 - NESTED PROCEDURE OVERLOADING IS NOT SUPPORTED. ***/!!!
        calculate_salary PROCEDURE(employee_level VARCHAR, output VARCHAR
            )
        RETURNS VARCHAR
        AS
            BEGIN
                output := 'Level ' || NVL(UPPER(:employee_level) :: STRING, '') || ': 60000';
                RETURN output;
            END;
        call_results VARCHAR;
        BEGIN
        call_results := (
            CALL
            calculate_salary(:result1)
        );
        result1 := :call_results;
        call_results := (
            CALL
            calculate_salary(50000, :result2)
        );
        result2 := :call_results;
        call_results := (
            CALL
            calculate_salary('senior', :result3)
        );
        result3 := :call_results;
        final_summary := NVL(:result1 :: STRING, '') || ' | ' || NVL(:result2 :: STRING, '') || ' | ' || NVL(:result3 :: STRING, '');
        END;
$$;
```

#### Best Practices

* Attempting to overload nested procedures in Snowflake will result in compilation errors or unexpected behavior. To ensure compatibility, you should refactor your code to avoid overloading nested procedures. Consider renaming procedures so that each nested procedure has a unique name within its scope, or restructure your logic to eliminate the need for overloading. Additionally, review and update all procedure calls to use the new unique names.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0113

The usage of Snowflake scripting UDF is not supported in this scenario.

### Severity

Medium

#### Description

The usage of Snowflake Scritping UDFs in specific scenarios is not supported. The following cases are not supported:

* Snowflake Scripting UDFs can’t be used when creating a materialized view.
* Snowflake Scripting UDFs can’t be used to specify a default column value.

#### Example code

##### Input Code :

```sql
CREATE TABLE Table1 (
  col1 INT DEFAULT SnowScriptUdf()
);

CREATE MATERIALIZED VIEW CreateView1
AS
SELECT
  col1,
  SnowScriptUdf() AS col2
FROM Table1;
```

##### Generated Code:

```sql
CREATE OR REPLACE TABLE Table1 (
col1 INT DEFAULT SnowScriptUdf() !!!RESOLVE EWI!!! /*** SSC-EWI-0113 - THE USAGE OF SNOWFLAKE SCRIPTING UDF IS NOT SUPPORTED IN THIS SCENARIO. ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/17/2025",  "domain": "no-domain-provided" }}'
;

CREATE OR REPLACE DYNAMIC TABLE CreateView1
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/17/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
      col1,
      SnowScriptUdf() !!!RESOLVE EWI!!! /*** SSC-EWI-0113 - THE USAGE OF SNOWFLAKE SCRIPTING UDF IS NOT SUPPORTED IN THIS SCENARIO. ***/!!! AS col2
FROM
      Table1;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0114

MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING.

### Severity

Medium

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Description

In database systems like DB2, Teradata, and others, it is possible to declare both CONTINUE and EXIT exception handlers in the same procedural block. However, Snowflake Scripting does not support mixing CONTINUE and EXIT handlers within the same EXCEPTION block.

When SnowConvert AI encounters a procedure with both types of handlers declared in the same block, it generates separate EXCEPTION blocks for each handler type and adds this EWI to indicate that manual review and testing are required to ensure the converted code maintains the intended behavior.

**Key Behavioral Differences:**

* **CONTINUE HANDLER**: Allows execution to continue after handling the exception
* **EXIT HANDLER**: Terminates the current block after handling the exception

Since Snowflake cannot mix these behaviors in a single EXCEPTION block, the conversion may result in different execution flow compared to the source system.

#### Example Code

##### Input Code:

**DB2**

```sql
CREATE OR REPLACE PROCEDURE with_continueAndExit()
BEGIN
    DECLARE test_1 INTEGER DEFAULT 10;

    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO error_test VALUES ('EXCEPTION');

    DECLARE EXIT HANDLER FOR SQLSTATE '20000'
        INSERT INTO error_test VALUES ('ERROR 2000');

    SET test_1 = 1 / 0;
    INSERT INTO error_test VALUES ('EXIT');
END;
```

##### Generated Code:

**Snowflake**

```sql
CREATE OR REPLACE PROCEDURE with_continueAndExit()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        test_1 INTEGER DEFAULT 10;
    BEGIN
        test_1 := 1 / 0;
        INSERT INTO error_test VALUES ('EXIT');
        EXCEPTION
            WHEN OTHER CONTINUE THEN
                INSERT INTO error_test VALUES ('EXCEPTION')
        !!!RESOLVE EWI!!! /*** SSC-EWI-0114 - MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        EXCEPTION
            WHEN OTHER EXIT THEN
                CASE
                    WHEN (SQLSTATE = '20000') THEN
                        INSERT INTO error_test VALUES ('ERROR 2000')
                END
    END;
$$;
```

#### Best Practices

When dealing with mixed CONTINUE and EXIT handlers:

1. **Review Exception Handling Logic**: Carefully review the converted code to understand how exceptions are handled in each block.
2. **Test Thoroughly**: Test all error scenarios to ensure the behavior matches the source system’s expectations.
3. **Consider Refactoring**: If possible, refactor the code to use only one type of handler (either all CONTINUE or all EXIT) within a block.
4. **Use Nested Blocks**: Consider restructuring the logic using nested BEGIN…END blocks, where each block has its own exception handling strategy.
5. **Document Behavior Changes**: Document any differences in exception handling behavior for future maintenance.

##### Recommended Pattern

Instead of mixing handlers, consider this approach:

```sql
BEGIN
    -- Handle operations that should continue on error
    BEGIN
        operation1();
        operation2();
    EXCEPTION
        WHEN OTHER CONTINUE THEN
            log_error('Continue handler');
    END;

    -- Handle operations that should exit on error
    BEGIN
        critical_operation();
    EXCEPTION
        WHEN OTHER EXIT THEN
            log_error('Exit handler');
    END;
END;
```

#### Related Documentation

* [DB2 CONTINUE HANDLER](../../../../translation-references/db2/db2-continue-handler.md)
* [DB2 EXIT HANDLER](../../../../translation-references/db2/db2-exit-handler.md)
* [Teradata Exception Handlers](../../../../translation-references/teradata/teradata-to-snowflake-scripting-translation-reference.md)
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0115

Iceberg table contains unsupported datatypes.

### Severity

Medium

#### Description

This EWI is emitted for tables that contain datatypes currently not supported by Snowflake on Iceberg tables.
Currently, Snowflake offers support for Iceberg tables in V2 format.

#### Example code

##### Input Code :

```sql
-- Additional Params: --TablesTransformationTarget SnowflakeIceberg
CREATE TABLE unsupported_types_table
(
  column1 TIMESTAMP(8) WITH TIME ZONE,
  column2 JSON(1000),
  column3 XML(1000)
);
```

##### Generated Code:

```sql
 -- Additional Params: --TablesTransformationTarget SnowflakeIceberg
!!!RESOLVE EWI!!! /*** SSC-EWI-0115 - ICEBERG TABLE CONTAINS THE FOLLOWING UNSUPPORTED DATATYPES: TIMESTAMP(8) WITH TIME ZONE, JSON(1000), XML(1000) ***/!!!
CREATE OR REPLACE ICEBERG TABLE unsupported_types_table
(
 column1 TIMESTAMP_TZ(8),
 column2 VARIANT,
 column3 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - XML DATA TYPE CONVERTED TO VARIANT ***/!!!
)
CATALOG = 'SNOWFLAKE'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 1,  "minor": 0,  "patch": "0.0" }, "attributes": {  "component": "teradata",  "convertedOn": "12/16/2025",  "domain": "no-domain-provided",  "migrationid": "9CebAVkM33qsfTnTrMh3Dw==" }}'
;
```

#### Best Practices

* Consider modifying the columns and logic to make use of datatypes supported in Iceberg tables
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0116

Snowflake does not support interval values inside semi-structured type columns.

### Severity

Medium

#### Description

This EWI is emitted when the `--UseIntervalDatatype` [preview flag](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled and an INTERVAL data type appears inside a semi-structured type column such as ARRAY, MAP, or STRUCT. Snowflake does not support storing INTERVAL values inside VARIANT-based columns. The outer type is still converted (for example, STRUCT becomes VARIANT), but the EWI warns that the INTERVAL values within cannot be preserved.

For more details on how interval types are handled across languages, see the [Interval Data Types](../../../../translation-references/general/interval-data-types.md) translation reference.

#### Example Code

##### Input Code (BigQuery):

```sql
CREATE TABLE test.table1
(
  col1 ARRAY<INTERVAL>
);
```

##### Generated Code (BigQuery):

```sql
CREATE TABLE test.table1 (
  col1 ARRAY !!!RESOLVE EWI!!! /*** SSC-EWI-0116 - SNOWFLAKE DOES NOT SUPPORT INTERVAL VALUES INSIDE SEMI-STRUCTURED TYPE COLUMNS ***/!!! DEFAULT []
)
;
```

##### Input Code (Hive):

```sql
CREATE TABLE tb1
(col1 STRUCT<a:INTERVAL, b:INT>);
```

##### Generated Code (Hive):

```sql
CREATE TABLE tb1 (
  col1 VARIANT /*** SSC-FDM-0034 - STRUCT<INTERVAL, INT> CONVERTED TO VARIANT. SOME OF ITS USAGES MIGHT HAVE FUNCTIONAL DIFFERENCES. ***/!!!RESOLVE EWI!!! /*** SSC-EWI-0116 - SNOWFLAKE DOES NOT SUPPORT INTERVAL VALUES INSIDE SEMI-STRUCTURED TYPE COLUMNS ***/!!!
)
;
```

#### Best Practices

* Consider extracting interval values from semi-structured columns into dedicated INTERVAL-typed columns
* If interval values must be stored in VARIANT columns, store them as strings and convert back to intervals when needed using CAST expressions
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0117

Snowflake does not support interval data type in UDFs or Snowflake Scripting.

### Severity

Medium

#### Description

This EWI is emitted when the `--UseIntervalDatatype` [preview flag](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled and an INTERVAL data type is used in a context not yet supported by Snowflake Scripting: UDF or procedure parameters, return types, or variable declarations. The INTERVAL type is preserved in the output for reference, but the EWI warns that it will not work at runtime in these contexts.

For more details on how interval types are handled across languages, see the [Interval Data Types](../../../../translation-references/general/interval-data-types.md) translation reference.

#### Example Code

##### Input Code (BigQuery):

```sql
CREATE FUNCTION test.fn1(p1 INTERVAL)
RETURNS INT64
AS (1);
```

##### Generated Code (BigQuery):

```sql
CREATE FUNCTION test.fn1 (p1 INTERVAL DAY TO SECOND /*** SSC-FDM-0042 - INTERVAL QUALIFIER CHANGED TO DAY TO SECOND, SNOWFLAKE DOES NOT SUPPORT MIXING YEAR TO MONTH AND DAY TO SECOND TIME PARTS. ***/!!!RESOLVE EWI!!! /*** SSC-EWI-0117 - SNOWFLAKE DOES NOT SUPPORT THE INTERVAL DATA TYPE IN UDF/PROCEDURE PARAMETERS, RETURN TYPES, OR VARIABLE DECLARATIONS ***/!!!)
RETURNS INT
AS
$$
  1
$$;
```

##### Input Code (Teradata):

```sql
-- Additional Params: --UseIntervalDatatype
CREATE PROCEDURE test_proc(IN p1 INTERVAL DAY TO SECOND)
BEGIN
  SELECT 1;
END;
```

##### Generated Code (Teradata):

```sql
-- Additional Params: --UseIntervalDatatype
CREATE OR REPLACE PROCEDURE test_proc (P1 INTERVAL DAY TO SECOND !!!RESOLVE EWI!!! /*** SSC-EWI-0117 - SNOWFLAKE DOES NOT SUPPORT THE INTERVAL DATA TYPE IN UDF/PROCEDURE PARAMETERS, RETURN TYPES, OR VARIABLE DECLARATIONS ***/!!!)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    SELECT 1;
  END;
$$;
```

#### Best Practices

* Consider using VARCHAR parameters for interval values and converting them inside the procedure body using CAST expressions
* If the interval parameter is used only for datetime arithmetic within the procedure, consider passing the numeric components separately
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0118

Snowflake does not support interval columns in Dynamic Tables.

### Severity

Medium

#### Description

This EWI is emitted when the `--UseIntervalDatatype` [preview flag](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled and a materialized view (converted to a Snowflake Dynamic Table) references columns with INTERVAL data types. Snowflake Dynamic Tables do not support INTERVAL-typed columns. The Dynamic Table is still generated, but the EWI warns that it may fail at runtime.

For more details on how interval types are handled across languages, see the [Interval Data Types](../../../../translation-references/general/interval-data-types.md) translation reference.

#### Example Code

##### Input Code (BigQuery):

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE src_table
(
  col1 INT64,
  col2 INTERVAL
);

CREATE MATERIALIZED VIEW mv1
AS
SELECT col1, col2 FROM src_table;
```

##### Generated Code (BigQuery):

```sql
-- Additional Params: --UseIntervalDatatype
CREATE TABLE src_table (
  col1 INT,
  col2 INTERVAL DAY TO SECOND /*** SSC-FDM-0042 - INTERVAL QUALIFIER CHANGED TO DAY TO SECOND, SNOWFLAKE DOES NOT SUPPORT MIXING YEAR TO MONTH AND DAY TO SECOND TIME PARTS. ***/
)
;

!!!RESOLVE EWI!!! /*** SSC-EWI-0118 - SNOWFLAKE DOES NOT SUPPORT INTERVAL COLUMNS IN DYNAMIC TABLES ***/!!!
CREATE OR REPLACE DYNAMIC TABLE mv1
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
AS
  SELECT
    col1,
    col2
  FROM
    src_table;
```

#### Best Practices

* Consider excluding INTERVAL columns from the Dynamic Table query, or casting them to VARCHAR before selecting
* If the interval values are needed in downstream queries, consider creating a regular view instead of a Dynamic Table for the INTERVAL columns
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-0119

Interval type column was converted to VARCHAR.

### Severity

Low

#### Description

This EWI is emitted in Dynamic Table contexts when the `--UseIntervalDatatype` [preview flag](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is **not** enabled and a source column had an INTERVAL data type that was converted to VARCHAR. This alerts users that the column in the Dynamic Table query originally referenced an interval-typed column that lost its type during conversion.

#### Example Code

##### Input Code (BigQuery):

```sql
CREATE TABLE src_table
(
  col1 INT64,
  col2 INTERVAL
);

CREATE MATERIALIZED VIEW mv1
AS
SELECT col1, col2 FROM src_table;
```

##### Generated Code (BigQuery):

```sql
CREATE TABLE src_table (
  col1 INT,
  col2 VARCHAR(30) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
;

CREATE OR REPLACE DYNAMIC TABLE mv1
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
AS
  SELECT
    col1,
    col2 !!!RESOLVE EWI!!! /*** SSC-EWI-0119 - INTERVAL TYPE COLUMN WAS CONVERTED TO VARCHAR ***/!!!
  FROM
    src_table;
```

#### Best Practices

* Consider enabling the `--UseIntervalDatatype` [preview flag](../../../getting-started/running-snowconvert/conversion/preview-conversion-settings.md) to preserve native INTERVAL types where possible
* Review the VARCHAR columns in the output to ensure the string representation of interval values is compatible with your application logic
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - General Performance Review Messages
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md
section: Migrations
---

# SnowConvert AI - General Performance Review Messages

## SSC-PRF-0001

This statement has usages of cursor fetch bulk operations

### Description

This warning indicates that the statement uses cursor fetch bulk operations. These operations allow you to retrieve multiple rows of data from a cursor at once, instead of one row at a time. Using bulk operations improves performance by reducing the number of communications needed between the client and server.

This pattern can become complex if not implemented correctly. For example, retrieving too many rows in a single fetch operation can consume excessive memory. It’s crucial to maintain a balance between the number of rows fetched and the available memory resources.

### Code Example

#### Oracle

##### Input

```sql
 CREATE OR REPLACE PROCEDURE oracle_cursor_fetch_bulk AS
--cursor and variable declarations
BEGIN
    OPEN c1;
    FETCH c1 BULK COLLECT INTO col1;
    CLOSE c1;
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE oracle_cursor_fetch_bulk ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
--cursor and variable declarations
$$
    BEGIN
        OPEN c1;
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        c1 := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:c1)
        );
        col1 := :c1:RESULT;
        CLOSE c1;
    END;
$$;
```

### Best Practices

* For additional support, please contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0002

Case insensitive columns can decrease performance of queries

### Description

Using collation in Snowflake can impact query performance, particularly in WHERE clauses. To learn more about how collation affects performance, please refer to the [Performance Implications of Using Collation](https://docs.snowflake.com/en/sql-reference/collation#performance-implications-of-using-collation).

A warning has been generated to indicate that a column was created with case-insensitive collation. Using this column in queries may cause slower performance.

### Code examples

#### Output

```sql
 CREATE TABLE exampleTable
(
    col1 CHAR(10),
    col2 CHAR(20) COLLATE 'en-ci' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Oracle

##### Input

```sql
 CREATE TABLE exampleTable (
    col1 VARCHAR(50) COLLATE BINARY_CI,
    col2 VARCHAR(50) COLLATE BINARY_CS
);
```

##### Output

```sql
 CREATE OR REPLACE TABLE exampleTable (
       col1 VARCHAR(50) COLLATE BINARY_CI /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/,
       col2 VARCHAR(50) COLLATE BINARY_CS
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
   ;
```

#### Microsoft SQL Server

##### Input

```sql
 CREATE TABLE exampleTable (
    col1 VARCHAR(50) COLLATE Latin1_General_CI_AS,
    col2 VARCHAR(50) COLLATE Latin1_General_CS_AS
);
```

##### Output

```sql
 CREATE OR REPLACE TABLE exampleTable (
    col1 VARCHAR(50) COLLATE 'EN-CI-AS' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/,
    col2 VARCHAR(50) COLLATE 'EN-CS-AS'
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

### Best Practices

* If your application’s performance is significantly affected by case-insensitive collation, consider rewriting your code to avoid using it. However, if the performance impact is acceptable, you can ignore this warning.
* For additional assistance, contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0003

Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance

### Severity

Low

### Description

This warning appears when a `FETCH` statement is detected within a loop. The `FETCH` statement retrieves and processes individual rows from a result set one at a time.

Processing large datasets using cursors within loops can become complex, especially when:

* Multiple table joins are involved
* Complex calculations are required
* Large numbers of rows need to be processed

This pattern may lead to performance issues and can be difficult to maintain as the data volume grows.

#### Code Example

#### Teradata

##### Input

```sql
 REPLACE PROCEDURE teradata_fetch_inside_loop()
DYNAMIC RESULT SETS 1
BEGIN
    DECLARE col_name VARCHAR(200);
    DECLARE col_int INTEGER DEFAULT 0;
    DECLARE cursor_var CURSOR FOR SELECT some_column FROM tabla1;
    WHILE (col_int <> 0) DO
        FETCH cursor_var INTO col_name;
        SET col_int = col_int + 1;
    END WHILE;
END;
```

##### Output

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "tabla1" **
CREATE OR REPLACE PROCEDURE teradata_fetch_inside_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        col_name VARCHAR(200);
        col_int INTEGER DEFAULT 0;
    BEGIN

        LET cursor_var CURSOR
        FOR
            SELECT
                some_column FROM
                tabla1;
                WHILE (:col_int <> 0) LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
                    FETCH cursor_var INTO col_name;
            col_int := col_int + 1;
                END LOOP;
    END;
$$;
```

#### Oracle

##### Input

```sql
 CREATE PROCEDURE oracle_fetch_inside_loop
IS
  var1 table1.column1%TYPE;
  CURSOR cursor1 IS SELECT COLUMN_NAME FROM table1;
BEGIN
  WHILE true LOOP
    FETCH cursor1 INTO var1;
    EXIT WHEN cursor1%NOTFOUND;
  END LOOP;
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE oracle_fetch_inside_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    var1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'table1.column1%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
    --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
    cursor1 CURSOR
    FOR
      SELECT COLUMN_NAME FROM
        table1;
  BEGIN
    WHILE (true)
                 --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                 LOOP
      --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
         FETCH cursor1 INTO
        :var1;
         IF (var1 IS NULL) THEN
        EXIT;
         END IF;
       END LOOP;
  END;
$$;
```

#### SQL Server

##### Input

```sql
 CREATE OR ALTER PROCEDURE transact_fetch_inside_loop
AS
BEGIN
    DECLARE cursor1 CURSOR
        FOR SELECT col1 FROM my_table;
    WHILE 1=0
    BEGIN
       FETCH NEXT FROM @cursor1 INTO @variable1;
    END
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE transact_fetch_inside_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
        cursor1 CURSOR
        FOR
            SELECT
                col1
            FROM
                my_table;
    BEGIN

        WHILE (1=0) LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
            FETCH
                CURSOR1
                INTO
                :VARIABLE1;
        END LOOP;
    END;
$$;
```

### Best Practices

* To improve performance and avoid complex patterns, use set-based operations instead of loops. Replace row-by-row processing with SQL statements (SELECT, UPDATE, DELETE) that operate on multiple rows simultaneously using WHERE clauses. This approach is more efficient and easier to maintain.

#### Oracle

```sql
 CREATE OR REPLACE PROCEDURE cursor_fetch_inside_loop
AS
  record_employee employees%rowtype;
  CURSOR emp_cursor IS SELECT * FROM employees;
BEGIN
  OPEN emp_cursor;
  LOOP
    FETCH emp_cursor INTO record_employee;
    EXIT WHEN emp_cursor%notfound;
    INSERT INTO new_employees VALUES (record_employee.first_name, record_employee.last_name);
  END LOOP;
  CLOSE emp_cursor;
END;
```

#### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE cursor_fetch_inside_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    record_employee OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
    --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
    emp_cursor CURSOR
    FOR
      SELECT
        OBJECT_CONSTRUCT( *) sc_cursor_record FROM
        employees;
  BEGIN
    OPEN emp_cursor;
    --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
      --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
      FETCH emp_cursor INTO
        :record_employee;
      IF (record_employee IS NULL) THEN
        EXIT;
      END IF;
      INSERT INTO new_employees
      SELECT
        :record_employee:FIRST_NAME,
        :record_employee:LAST_NAME;
    END LOOP;
  CLOSE emp_cursor;
  END;
$$;
```

Set-based operations can be used to process data more efficiently.

```sql
 CREATE OR REPLACE PROCEDURE cursor_fetch_inside_loop AS
BEGIN
  INSERT INTO new_employees (first_name, last_name)
  SELECT first_name, last_name FROM employees;
END;
```

Set-based operations can be used to process data more efficiently.

```sql
 CREATE OR REPLACE PROCEDURE cursor_fetch_inside_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    INSERT INTO new_employees(first_name, last_name)
    SELECT first_name, last_name FROM
      employees;
  END;
$$;
```

### Best Practices

* For additional support, please contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0004

This statement has usages of cursor for loop

### Severity

None

### Description

This warning indicates that the statement contains cursor for loops. A cursor for loop is a programming structure that processes query results one row at a time, allowing you to work with individual records from a result set.

This warning helps identify potential performance issues in cursor FOR loops. Performance problems may arise when:

* The SELECT statement within the cursor returns a large dataset
* The loop contains complex operations
* The loop contains nested loops

While SnowConvert AI can detect these patterns, you should review and optimize the code to ensure efficient execution.

#### Code Example

#### Teradata

##### Input

```sql
 REPLACE PROCEDURE teradata_cursor_for_loop()
BEGIN
    FOR fUsgClass AS cUsgClass CURSOR FOR
        (SELECT col1
        FROM sample_table)
    DO
        SET var1 = fUsgClass.col1;
    END FOR;
END;
```

##### Output

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "sample_table" **
CREATE OR REPLACE PROCEDURE teradata_cursor_for_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        !!!RESOLVE EWI!!! /*** SSC-EWI-0110 - TRANSFORMATION NOT PERFORMED DUE TO MISSING DEPENDENCIES ***/!!!
        temp_fUsgClass_col1;
    BEGIN
        LET cUsgClass CURSOR
        FOR
            SELECT
                col1
                   FROM
                sample_table;
        --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
        FOR fUsgClass IN cUsgClass DO
            temp_fUsgClass_col1 := fUsgClass.col1;
            var1 := :temp_fUsgClass_col1;
        END FOR;
    END;
$$;
```

#### Oracle

##### Input

```sql
 CREATE OR REPLACE PROCEDURE oracle_cursor_for_loop AS
BEGIN
    FOR r1 IN (SELECT col1 FROM sample_table) LOOP
        NULL;
    END LOOP;
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE oracle_cursor_for_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        LET temporary_for_cursor_0 CURSOR
        FOR
            (SELECT col1 FROM
                    sample_table
            );
        --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
        FOR r1 IN temporary_for_cursor_0 DO
            NULL;
        END FOR;
    END;
$$;
```

### Best Practices

* For additional support, please contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0005

The statement below has usages of nested cursors

> **Note:**
>
> For better readability, we have simplified some sections of the code in this example.

### Severity

None

### Description

This warning indicates that the statement contains nested cursors. A cursor is a database feature that lets you process rows from a query result one at a time. Nested cursors occur when you use one cursor inside another cursor’s loop, which can impact performance and should be used with caution.

Nested cursors can significantly slow down your code’s performance, particularly when working with large amounts of data. This is because each time a cursor operates, it needs to communicate with the database server, creating additional processing overhead and delays.

### Code examples

#### SQL Server

##### Input

```sql
 CREATE OR ALTER PROCEDURE procedureSample
AS
BEGIN
  DECLARE
    @outer_category_id INT,
    @outer_category_name NVARCHAR(50),
    @inner_product_name NVARCHAR(50);

  -- Define the outer cursor
  DECLARE outer_cursor CURSOR FOR
    SELECT category_id, category_name FROM categories;

  -- Open the outer cursor
  OPEN @outer_cursor;

  -- Fetch the first row from the outer cursor
  FETCH NEXT FROM outer_cursor INTO @outer_category_id, @outer_category_name;

  -- Start the outer loop
  WHILE @@FETCH_STATUS = 0
  BEGIN

    PRINT 'Category: ' + @outer_category_name;

    -- Define the inner cursor
    DECLARE inner_cursor CURSOR FOR
      SELECT product_name FROM products WHERE category_id = @outer_category_id;

    -- Open the inner cursor
    OPEN inner_cursor;
	FETCH NEXT FROM inner_cursor INTO @inner_product_name;

    WHILE @@FETCH_STATUS = 0
    BEGIN
      PRINT 'Product: ' + @inner_product_name + ' Category: ' + CAST(@outer_category_id AS NVARCHAR(10));

      -- Fetch the next row from the inner cursor
      FETCH NEXT FROM inner_cursor INTO @inner_product_name;
    END;

    -- Close the inner cursor
    CLOSE inner_cursor;
    DEALLOCATE inner_cursor;

    -- Fetch the next row from the outer cursor
    FETCH NEXT FROM outer_cursor INTO @outer_category_id, @outer_category_name;
  END;

  -- Close the outer cursor
  CLOSE outer_cursor;
  DEALLOCATE outer_cursor;

END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE procedureSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		OUTER_CATEGORY_ID INT;
		OUTER_CATEGORY_NAME VARCHAR(50);
		INNER_PRODUCT_NAME VARCHAR(50);

		-- Define the outer cursor
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		outer_cursor CURSOR
		FOR
			SELECT
				category_id,
				category_name
			FROM
				categories;

		-- Define the inner cursor
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		inner_cursor CURSOR
		FOR
			SELECT
				product_name
			FROM
				products
			WHERE
				category_id = :OUTER_CATEGORY_ID;
	BEGIN

		-- Open the outer cursor
		--** SSC-PRF-0005 - THE STATEMENT BELOW HAS USAGES OF NESTED CURSORS. **
		OPEN OUTER_CURSOR;
  -- Fetch the first row from the outer cursor
		FETCH
			outer_cursor
			INTO
			:OUTER_CATEGORY_ID,
			:OUTER_CATEGORY_NAME;

			-- Start the outer loop

			  -- Define the inner cursor
			WHILE (:FETCH_STATUS = 0) LOOP
			!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PRINT' NODE ***/!!!

			  PRINT 'Category: ' + @outer_category_name;

			-- Open the inner cursor
			OPEN inner_cursor;
			--** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
			FETCH
				inner_cursor
			INTO
				:INNER_PRODUCT_NAME;
			WHILE (:FETCH_STATUS = 0) LOOP
				!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PRINT' NODE ***/!!!
				PRINT 'Product: ' + @inner_product_name + ' Category: ' + CAST(@outer_category_id AS NVARCHAR(10));
				-- Fetch the next row from the inner cursor
				--** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
				FETCH
					inner_cursor
				INTO
					:INNER_PRODUCT_NAME;
			END LOOP;
			-- Close the inner cursor
			CLOSE inner_cursor;
			!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'DEALLOCATE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
			  DEALLOCATE inner_cursor;
			-- Fetch the next row from the outer cursor
			--** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
			FETCH
				outer_cursor
			INTO
				:OUTER_CATEGORY_ID,
				:OUTER_CATEGORY_NAME;
			END LOOP;
  -- Close the outer cursor
			CLOSE outer_cursor;
			!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'DEALLOCATE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
			DEALLOCATE outer_cursor;
	END;
$$;
```

#### Oracle

#### Explicit cursor

##### Input

```sql
 CREATE OR REPLACE PROCEDURE procedureSample AS
BEGIN
DECLARE
  CURSOR outer_cursor IS
    SELECT category_id, category_name FROM categories;

  CURSOR inner_cursor (p_category_id NUMBER) IS
    SELECT product_name FROM products WHERE category_id = p_category_id;

  outer_category_id categories.category_id%TYPE;
  outer_category_name categories.category_name%TYPE;
  inner_product_name products.product_name%TYPE;
BEGIN

  OPEN outer_cursor;
  FETCH outer_cursor INTO outer_category_id, outer_category_name;

  LOOP
    EXIT WHEN outer_cursor%NOTFOUND;
    DBMS_OUTPUT.PUT_LINE('Category: ' || outer_category_name);

    OPEN inner_cursor(outer_category_id);
    LOOP
        FETCH inner_cursor INTO inner_product_name;
        EXIT WHEN inner_cursor%NOTFOUND;
        DBMS_OUTPUT.PUT_LINE('Product: ' || inner_product_name || ' Category: ' || outer_category_id);
    END LOOP;
    CLOSE inner_cursor;

    FETCH outer_cursor INTO outer_category_id, outer_category_name;
  END LOOP;

  CLOSE outer_cursor;
END;
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE procedureSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    DECLARE
      --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
      outer_cursor CURSOR
      FOR
        SELECT category_id, category_name FROM
          categories;
      --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
      inner_cursor CURSOR
      FOR
        SELECT product_name FROM
          products
        WHERE category_id = ?;
      outer_category_id VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'categories.category_id%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
      outer_category_name VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'categories.category_name%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
      inner_product_name VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'products.PRODUCT_NAME%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
      call_results VARIANT;
    BEGIN
      --** SSC-PRF-0005 - THE STATEMENT BELOW HAS USAGES OF NESTED CURSORS. **
      OPEN outer_cursor USING ('DEFAULT VALUE NOT FOUND');
      FETCH outer_cursor INTO
        :outer_category_id,
        :outer_category_name;
      --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
      LOOP
        IF (outer_category_id IS NULL) THEN
          EXIT;
        END IF;
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        call_results := (
          CALL DBMS_OUTPUT.PUT_LINE_UDF('Category: ' || NVL(:outer_category_name :: STRING, ''))
        );
        OPEN inner_cursor USING (:outer_category_id);
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
          --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
            FETCH inner_cursor INTO
            :inner_product_name;
          IF (inner_product_name IS NULL) THEN
            EXIT;
          END IF;
          --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
          call_results := (
            CALL DBMS_OUTPUT.PUT_LINE_UDF('Product: ' || NVL(:inner_product_name :: STRING, '') || ' Category: ' || NVL(:outer_category_id :: STRING, ''))
          );
        END LOOP;
        CLOSE inner_cursor;
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        FETCH outer_cursor INTO
          :outer_category_id,
          :outer_category_name;
      END LOOP;
      CLOSE outer_cursor;
      RETURN call_results;
    END;
  END;
$$;
```

#### Implicit Cursor

##### Input

```sql
 CREATE OR REPLACE PROCEDURE procedureSample AS
BEGIN
DECLARE
   inner_category_id categories.category_name%TYPE;
   inner_product_name products.product_name%TYPE;
   inner_cursor SYS_REFCURSOR;
BEGIN
   FOR outer_cursor IN (SELECT category_id, category_name FROM categories)
   LOOP
      OPEN inner_cursor
       FOR SELECT product_name, category_id FROM products WHERE category_id = outer_cursor.category_id;
      LOOP
         FETCH inner_cursor INTO inner_product_name, inner_category_id;
         EXIT WHEN inner_cursor%NOTFOUND;
         dbms_output.put_line( 'Category id: '|| outer_cursor.category_id);
         dbms_output.put_line('Product name: ' || inner_product_name);
      END LOOP;
      CLOSE inner_cursor;
   END LOOP;
END;
END;
```

##### Output

```sql
 CREATE OR REPLACE PROCEDURE procedureSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      DECLARE
         inner_category_id VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'categories.category_name%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
         inner_product_name VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'products.product_name%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
         inner_cursor_res RESULTSET;
         call_results VARIANT;
      BEGIN
         LET temporary_for_cursor_0 CURSOR
         FOR
            (SELECT category_id, category_name FROM
                  categories
            );
         --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
         --** SSC-PRF-0005 - THE STATEMENT BELOW HAS USAGES OF NESTED CURSORS. **
         FOR outer_cursor IN temporary_for_cursor_0 DO
            LET inner_cursor CURSOR
            FOR
               SELECT product_name, category_id FROM
                  products
               WHERE category_id = outer_cursor.category_id;
            OPEN inner_cursor;
            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                 LOOP
               --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
                    FETCH inner_cursor INTO
                  :inner_product_name,
                  :inner_category_id;
               IF (inner_product_name IS NULL) THEN
                  EXIT;
               END IF;
               --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
               call_results := (
                  CALL dbms_output.put_line( 'Category id: ' || NVL(outer_cursor.category_id :: STRING, ''))
               );
               --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
               call_results := (
                  CALL dbms_output.put_line('Product name: ' || NVL(:inner_product_name :: STRING, ''))
               );
                 END LOOP;
                 CLOSE inner_cursor;
         END FOR;
         RETURN call_results;
      END;
   END;
$$;
```

### Best Practices

* Nested cursors should be avoided as they can negatively impact performance and make code more complex.
* Instead of nested cursors, use SQL features such as:

  + SQL functions
  + Joins
  + Subqueries
  + Window functions
  + Common Table Expressions (CTEs)
  + Recursive queries
    These alternatives process data in bulk and are more efficient.
* For additional assistance, contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0006

Nested cursor inside query is not supported in Snowflake

### Severity

None

### Description

This message appears when a query contains a cursor definition. When a cursor expression is evaluated, it returns and automatically opens a nested cursor. For more details, see [Oracle Cursor Expression](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/CURSOR-Expressions.html#GUID-B28362BE-8831-4687-89CF-9F77DB3698D2).

### Code examples

#### Input

```sql
 SELECT
  category_id,
  category_name,
  CURSOR (
    SELECT
      product_id,
      product_name || ', ' || category_id
    FROM
      products e
    WHERE
      e.category_id = d.category_id
  ) EMP_CUR
FROM
  categories d;
```

#### Output

```sql
 SELECT
  category_id,
  category_name,
  --** SSC-PRF-0006 - NESTED CURSOR INSIDE QUERY IS NOT SUPPORTED IN SNOWFLAKE. **
  CURSOR
    !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!! (
    SELECT
      product_id,
      NVL(
      product_name :: STRING, '') || ', ' || NVL(category_id :: STRING, '')
    FROM
      products e
    WHERE
      e.category_id = d.category_id
  ) EMP_CUR
FROM
  categories d;
```

### Best Practices

* We recommend avoiding cursors as they can negatively affect performance and make code more complex.
* Instead of using nested cursors, consider these alternatives:

  + SQL functions
  + Joins
  + Subqueries
  + Window functions
  + Common Table Expressions (CTEs)
  + Recursive queries
    These options are better for processing large amounts of data efficiently.
* For additional assistance, contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0007

PERFORMANCE REVIEW - CLUSTER BY

### Description

Marks where the usage of CLUSTER BY may cause performance issues.

#### Example Code

##### Teradata:

```sql
 CREATE MULTISET TABLE T_2008,
NO FALLBACK,
NO BEFORE JOURNAL,
NO AFTER JOURNAL,
CHECKSUM = DEFAULT,
DEFAULT MERGEBLOCKRATIO
(
      COL1 NUMBER(20,0) NOT NULL,
      COL2 INTEGER,
      COL3 VARCHAR(4) CHARACTER SET LATIN NOT CASESPECIFIC,
      COL4 DATE FORMAT 'YYYY-MM-DD'
)
PRIMARY INDEX
(
      COL1, COL2
)
PARTITION BY ( RANGE_N(COL4 BETWEEN DATE '2010-01-01' AND DATE '2025-12-31' EACH INTERVAL '1' YEAR ),
CASE_N(
COL3  = 'T',
COL3 = 'M',
COL3 = 'L') ); -- PARTITION BY transformed to CLUSTER BY
```

##### Snowflake:

```sql
CREATE OR REPLACE TABLE T_2008
(
      COL1 NUMBER(20,0) NOT NULL,
      COL2 INTEGER,
      COL3 VARCHAR(4),
      COL4 DATE
)
--** SSC-PRF-0007 - PERFORMANCE REVIEW - CLUSTER BY **
CLUSTER BY (
             !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - RANGE_N FUNCTION NOT SUPPORTED ***/!!!
             RANGE_N(COL4 BETWEEN DATE '2010-01-01' AND DATE '2025-12-31' EACH INTERVAL '1' YEAR ),
!!!RESOLVE EWI!!! /*** SSC-EWI-0031 - CASE_N FUNCTION NOT SUPPORTED ***/!!!
CASE_N(
COL3  = 'T',
COL3 = 'M',
COL3 = 'L'))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
; -- PARTITION BY transformed to CLUSTER BY
```

##### Transact:

```sql
 CREATE TABLE my_table (
    enterprise_cif INT,
    name NVARCHAR(100),
    address NVARCHAR(255),
    created_at DATETIME
)
WITH (
    DISTRIBUTION = HASH(enterprise_cif),
    CLUSTERED INDEX (enterprise_cif)
);
```

##### Snowflake:

```sql
 CREATE OR REPLACE TABLE my_table (
  enterprise_cif INT,
  name VARCHAR(100),
  address VARCHAR(255),
  created_at TIMESTAMP_NTZ(3)
)
--** SSC-PRF-0007 - PERFORMANCE REVIEW - CLUSTER BY **
CLUSTER BY (enterprise_cif)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2024" }}'
;
```

#### Best Practices

* Review the code in order to identify possible performance issues. More information about this topic can be read [here](https://docs.snowflake.com/en/user-guide/tables-clustering-keys.html).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0008

### Description

This message appears when SnowConvert AI detects loop usage in procedural code. Loops such as `LOOP`, `WHILE`, and `FOR` can lead to row-by-row processing and may degrade performance in Snowflake, especially when the loop iterates over large datasets or contains complex logic. The message is informational and prompts a review of the pattern.

#### Code Example

##### PostgreSQL:

```sql
CREATE OR REPLACE FUNCTION loop_example() RETURNS void AS $$
BEGIN
  FOR i IN 1..10 LOOP
    NULL;
  END LOOP;
END;
$$ LANGUAGE plpgsql;
```

##### Snowflake:

```sql
CREATE OR REPLACE PROCEDURE loop_example ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    FOR i IN 1 TO 10
                     LOOP
      NULL;
    END LOOP;
  END;
$$;
;
```

### Best practices

* Prefer set-based SQL operations (SELECT, INSERT, UPDATE, DELETE) over row-by-row loops.
* Avoid nested loops when possible; use joins, CTEs, or window functions instead.
* If loops are required, keep iterations small and limit expensive operations inside the loop.
* Consider refactoring procedural logic into single statements or bulk operations.

## SSC-PRF-0009

CURSOR usage review

### Severity

None

### Description

This message appears when SnowConvert AI detects a cursor declaration in procedural code. Cursors allow row-by-row processing of query results, which can lead to performance issues in Snowflake, especially when processing large datasets.

While cursors are valid in Snowflake Scripting, they introduce overhead because:

* Each row is processed individually rather than as a set
* Multiple round trips to the database may be required
* Memory usage can be higher compared to set-based operations

This warning is informational and prompts a review of whether the cursor usage is necessary or can be replaced with more efficient set-based operations.

### Code Example

#### Oracle

##### Input

```sql
CREATE OR REPLACE PROCEDURE get_first_employee AS
  CURSOR emp_cursor IS SELECT employee_id, first_name FROM employees;
  v_emp_id NUMBER;
  v_name VARCHAR2(100);
BEGIN
  OPEN emp_cursor;
  FETCH emp_cursor INTO v_emp_id, v_name;
  CLOSE emp_cursor;
END;
```

##### Output

```sql
CREATE OR REPLACE PROCEDURE get_first_employee ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
    emp_cursor CURSOR
    FOR
      SELECT employee_id, first_name FROM
        employees;
    v_emp_id NUMBER(38, 18);
    v_name VARCHAR(100);
  BEGIN
    OPEN emp_cursor;
    FETCH emp_cursor INTO
      :v_emp_id,
      :v_name;
    CLOSE emp_cursor;
  END;
$$;
```

### Best Practices

* Replace cursor-based row-by-row processing with set-based SQL operations (SELECT, INSERT, UPDATE, DELETE) whenever possible.
* Use JOINs, subqueries, CTEs (Common Table Expressions), or window functions instead of cursors to process multiple rows efficiently.
* If cursors are unavoidable, minimize the work done inside the cursor loop and avoid nested cursors.
* Consider using MERGE statements for upsert operations instead of cursor-based conditional INSERT/UPDATE logic.
* For additional assistance, contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-PRF-0010

Partition by removed, at least one of the specified expressions have no iceberg partition transform equivalent

### Severity

None

### Description

Snowflake supports the PARTITION BY clause in Iceberg tables, however, only [Iceberg partition transforms](https://iceberg.apache.org/spec/#partition-transforms) are supported. When transforming paritioning into Iceberg tables, SnowConvert AI will generate the equivalent partition transforms for supported cases. When no partition transform equivalent can be generated for the partition expressions, the PARTITION BY will be removed from the table by commenting it out with this PRF.

This PRF is only generated when SnowConvert AI migrates tables into Iceberg tables using the [Tables translation](../../../getting-started/running-snowconvert/conversion/teradata-conversion-settings.md) conversion setting.

### Code examples

#### Input

```sql
 -- Additional Params: --TablesTransformationTarget SnowflakeIceberg
CREATE TABLE FINANCE.FINANCE_TABLE
(
  customerName VARCHAR(30),
  accountBalance VARCHAR(20)
)
PARTITION BY CASE_N(
accountBalance <  0 ,
accountBalance >=  0);
```

#### Output

```sql
 -- Additional Params: --TablesTransformationTarget SnowflakeIceberg
CREATE OR REPLACE ICEBERG TABLE FINANCE.FINANCE_TABLE
  (
 customerName VARCHAR,
 accountBalance VARCHAR
  )
  CATALOG = 'SNOWFLAKE'
--  --** SSC-PRF-0010 - PARTITION BY REMOVED, AT LEAST ONE OF THE SPECIFIED EXPRESSIONS HAVE NO ICEBERG PARTITION TRANSFORM EQUIVALENT **
--  PARTITION BY CASE_N(
--  accountBalance <  0 ,
--  accountBalance >=  0)
  COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 1,  "minor": 0,  "patch": "0.0" }, "attributes": {  "component": "teradata",  "convertedOn": "12/16/2025",  "domain": "no-domain-provided",  "migrationid": "9CebAVkM33qsfTnTrMh3Dw==" }}'
;
```

### Best Practices

* Analyze the impact of partitioning in the performance of queries over the generated Iceberg tables, if the difference is neglible then this PRF can be safely ignored.
* For additional assistance, contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - General Translation Specification
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/general/README.md
section: Migrations
---

# SnowConvert AI - General Translation Specification

Translation references are essential for understanding how SnowConvert AI translates SQL statements from Oracle, Teradata, SQL Server, and all other available SQL languages to Snowflake.

---
title: SnowConvert AI - Getting Started
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/README.md
section: Migrations
---

# SnowConvert AI - Getting Started

Everything you need to get started with SnowConvert.

Getting started with a migration can seem like a daunting task. SnowConvert AI is here to help do the heavy lifting by providing in-depth assessment information and conversion capabilities to accelerate your migration. You can get started down this road by working through this documentation prepared for you by the migration experts here at Snowflake.

Setup

* [Download and Access](download-and-access.md)
* [Installation](../user-guide/snowconvert/how-to-install-the-tool/README.md)

Using the Tool

* [User Guide](../user-guide/snowconvert/README.md) - How to use SnowConvert.
* [Training](https://learn.snowflake.com/en/courses/OD-SC-D/) - There are multiple training courses available from Snowflake. To get the most out of SnowConvert AI, it’s important to understand what it does and does not do.
* [Before you run SnowConvert AI](best-practices.md) - Some best practices to set you up for success.
* Conversion Guide - Executing a conversion is simple, but it is rare that a codebase will convert at 100%.
* [Conversion Quick Start](running-snowconvert/conversion/README.md)
* [Issue Resolution Guide](../technical-documentation/issues-and-troubleshooting/README.md) - You’ve converted, now what? SnowConvert AI tells you what it cannot convert, and helps point you in the right direction.

Time… to get started!

---
title: SnowConvert AI - Google BigQuery
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/google-bigquery.md
section: Migrations
---

# SnowConvert AI - Google BigQuery

## What is SnowConvert AI for Google BigQuery?

SnowConvert AI is a software tool that understands SQL Google BigQuery scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for Google BigQuery performs the following conversions:

### Google BigQuery to Snowflake SQL

SnowConvert AI recognizes the Google BigQuery source code and converts the different statements into the appropriate SQL for the Snowflake target.

### Sample code

#### Input Code:

```sql
CREATE TABLE IF NOT EXISTS your_project_id.my_dataset.product_catalog (
  product_sku STRING,
  stock_level INT64,
  unit_price FLOAT64
);
```

#### Output Code:

```sql
CREATE TABLE IF NOT EXISTS your_project_id.my_dataset.product_catalog (
  product_sku STRING,
  stock_level INT,
  unit_price FLOAT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "bigquery",  "convertedOn": "04/08/2025",  "domain": "test" }}';
```

As you can see, most of the structure remains the same, but some column properties have to be transformed into Snowflake equivalents. For more information please refer to Google [BigQuery Translation References documentation](../../../../translation-references/bigquery/README.md).

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI*: the software that converts securely and automatically your Google BigQuery files to the Snowflake cloud data platform.
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

In the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for Google BigQuery is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Greenplum - CREATE MATERIALIZED VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/create-materialized-view/greenplum-create-materialized-view.md
section: Migrations
---

# SnowConvert AI - Greenplum - CREATE MATERIALIZED VIEW

Translation from Greenplum to Snowflake

## Description

This section explains features exclusive to Greenplum.

For more information, please refer to [`CREATE MATERIALIZED VIEW`](https://techdocs.broadcom.com/us/en/vmware-tanzu/data-solutions/tanzu-greenplum/7/greenplum-database/ref_guide-sql_commands-CREATE_MATERIALIZED_VIEW.html) in the documentation.

## Grammar Syntax

```sql
CREATE MATERIALIZED VIEW <table_name>
AS <query>
[
    DISTRIBUTED {
        BY <column> [<opclass>], [ ... ] | RANDOMLY | REPLICATED
        }
]
```

## DISTRIBUTED BY

> **Hint:**
>
> This syntax is translated to its most equivalent form in Snowflake.

The DISTRIBUTED BY clause in Greenplum controls how data is physically distributed across the system’s segments. Meanwhile, CLUSTER BY is a subset of columns in a dynamic table (or expressions on a dynamic table) explicitly designated to co-locate the data in the table in the same micro-partitions. While they operate at different architectural levels, they aim to improve query performance by distributing data efficiently.

### Grammar Syntax

```sql
DISTRIBUTED BY ( <column> [<opclass>] [, ... ] )
```

### Sample Source

Input Code:

#### Greenplum

```sql
CREATE MATERIALIZED VIEW product_summary AS
SELECT
    category,
    COUNT(*) AS total_products,
    MAX(price) AS max_price
FROM products
GROUP BY category
DISTRIBUTED BY (category);
```

Output Code:

##### Snowflake

```sql
CREATE OR REPLACE DYNAMIC TABLE product_summary
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
--** SSC-FDM-GP0001 - THE PERFORMANCE OF THE CLUSTER BY MAY VARY COMPARED TO THE PERFORMANCE OF DISTRIBUTED BY **
CLUSTER BY (category)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "04/24/2025",  "domain": "test" }}'
AS
    SELECT
    category,
    COUNT(*) AS total_products,
    MAX(price) AS max_price
FROM
    products
    GROUP BY category;
```

## DISTRIBUTED RANDOMLY - REPLICATED

> **Note:**
>
> This syntax is not needed in Snowflake.

The DISTRIBUTED REPLICATED or DISTRIBUTED RANDOMLY clause in Greenplum controls how data is physically distributed across the system’s segments. As Snowflake automatically handles data storage, these options will be removed in the migration.

## Related EWIs

1. [SSC-FDM-GP0001](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/greenplumFDM.md): The performance of the CLUSTER BY may vary compared to the performance of Distributed By.

---
title: SnowConvert AI - Greenplum - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/create-table/greenplum-create-table.md
section: Migrations
---

# SnowConvert AI - Greenplum - CREATE TABLE

Translation from Greenplum to Snowflake

## Description

This section explains features exclusive to Greenplum.

For more information, please refer to [`CREATE TABLE`](https://techdocs.broadcom.com/us/en/vmware-tanzu/data-solutions/tanzu-greenplum/7/greenplum-database/ref_guide-sql_commands-CREATE_TABLE.html) the documentation.

## Grammar Syntax

```sql
CREATE TABLE <table_name> (
  [ <column_name> <data_type> [ ENCODING ( <storage_directive> [, ...] ) ]
] )
[ DISTRIBUTED BY ( <column> [<opclass>] [, ... ] )
    | DISTRIBUTED RANDOMLY
    | DISTRIBUTED REPLICATED ]
```

## ENCODING

> **Note:**
>
> This syntax is not needed in Snowflake.

The compression encoding for a column. In Snowflake, defining ENCODING is unnecessary because it automatically handles data compression, unlike Greenplum, which could set up the encoding manually. For this reason, the ENCODING statement is removed during migration.

### Grammar Syntax

```sql
ENCODING ( <storage_directive> [, ...] )
```

### Sample Source

#### Input Code:

##### Greenplum

```sql
CREATE TABLE TABLE1 (
   COL1 integer ENCODING (compresstype = quicklz, blocksize = 65536)
);
```

#### Output Code:

##### Snowflake

```sql
CREATE TABLE TABLE1 (
   COL1 integer
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "03/26/2025",  "domain": "test" }}'
;
```

## DISTRIBUTED BY

> **Hint:**
>
> This syntax is fully supported in Snowflake.

The DISTRIBUTED BY clause in Greenplum controls how table data is physically distributed across the system’s segments. Meanwhile, CLUSTER BY is a subset of columns in a table (or expressions on a table) that are explicitly designated to co-locate the data in the table in the same micro-partitions.

### Grammar Syntax

```sql
DISTRIBUTED BY ( <column> [<opclass>] [, ... ] )
```

### Sample Source Patterns

#### Input Code:

##### Greenplum

```sql
CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
DISTRIBUTED BY (colum1, colum2);
```

#### Output Code:

##### Snowflake

```sql
CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
--** SSC-FDM-GP0001 - THE PERFORMANCE OF THE CLUSTER BY MAY VARY COMPARED TO THE PERFORMANCE OF DISTRIBUTED BY **
CLUSTER BY (colum1, colum2)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "03/26/2025",  "domain": "test" }}'
;
```

## DISTRIBUTED RANDOMLY - REPLICATED

> **Note:**
>
> This syntax is not needed in Snowflake.

The DISTRIBUTED REPLICATED or DISTRIBUTED RANDOMLY clause in Greenplum controls how table data is physically distributed across the system’s segments. As Snowflake automatically handles data storage, these options will be removed in the migration.

### Grammar Syntax

```sql
DISTRIBUTED RANDOMLY | DISTRIBUTED REPLICATED
```

### Sample Source Patterns

#### Input Code:

##### Greenplum

```sql
CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
DISTRIBUTED RANDOMLY;
```

#### Output Code:

##### Snowflake

```sql
CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "03/26/2025",  "domain": "test" }}'
;
```

## Related EWIs

1. [SSC-FDM-GP0001](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/greenplumFDM.md): The performance of the CLUSTER BY may vary compared to the performance of Distributed By.

---
title: SnowConvert AI - Greenplum Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/greenplumFDM.md
section: Migrations
---

# SnowConvert AI - Greenplum Functional Differences

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Greenplum focuses its assessment and translation capabilities primarily on TABLES and VIEWS.
> While SnowConvert AI can recognize other types of ANSI-standard statements, these are not yet fully supported for conversion. This means that while the tool may identify them, it won’t perform a complete translation for these unsupported code units.

## SSC-FDM-GP0001

The performance of the CLUSTER BY may vary compared to the performance of Distributed By

### Description

The `DISTRIBUTED BY` in Greenplum is analogous to `CLUSTER BY` in Snowflake. However, performance implications may vary due to architectural differences between Greenplum and Snowflake.

* **`DISTRIBUTED BY`** controls the physical distribution of data across the nodes (segments) in Greenplum’s MPP architecture..
* **`CLUSTER BY`** in Snowflake organizes data into blocks based on designated columns, aiding in filtering and aggregation tasks.

Understanding these mechanisms is crucial for optimizing performance in each respective platform.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
DISTRIBUTED BY (colum1, colum2);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
--** SSC-FDM-GP0001 - THE PERFORMANCE OF THE CLUSTER BY MAY VARY COMPARED TO THE PERFORMANCE OF DISTRIBUTED BY **
CLUSTER BY (colum1, colum2)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "03/26/2025",  "domain": "test" }}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Hive - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/built-in-functions.md
section: Migrations
---

# SnowConvert AI - Hive - Built-in functions

Applies to

* Hive SQL
* Spark SQL
* Databricks SQL
> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

## Built-in Functions

> This article provides an alphabetically-ordered list of built-in functions and operators in Databricks. ([Databricks SQL Language Reference Built-in functions](https://docs.databricks.com/aws/en/sql/language-manual/sql-ref-functions-builtin-alpha)).

| Spark SQL - Databricks SQL | Snowflake |
| --- | --- |
| ABS | ABS |
| ACOS | ACOS |
| ACOSH | ACOSH |
| ADD_MONTHS | ADD_MONTHS |
| ANY_VALUE | ANY_VALUE |
| ANY | BOOLOR_AGG |
| APPROX_COUNT_DISTINCT | APPROX_COUNT_DISTINCT |
| APPROX_PERCENTILE | APPROX_PERCENTILE |
| ARRAY_AGG | ARRAY_AGG |
| ARRAY_APPEND | ARRAY_APPEND |
| ARRAY_COMPACT | ARRAY_COMPACT |
| ARRAY_CONTAINS | ARRAY_CONTAINS |
| ARRAY_DISTINCT | ARRAY_DISTINCT |
| ARRAY_EXCEPT | ARRAY_EXCEPT |
| ARRAY_INSERT | ARRAY_INSERT_UDF  *Note: A User Defined Function is created to replicate the source behaviour.* |
| ARRAY_INTERSECT | ARRAY_INTERSECTION |
| ARRAY_JOIN | ARRAY_TO_STRING |
| ARRAY_MAX | ARRAY_MAX |
| ARRAY_MIN | ARRAY_MIN |
| ARRAY_POSITION(array, element) | ARRAY_POSITION(element, array)  *Note: Parameters are inverted.* |
| ARRAY_PREPEND | ARRAY_PREPEND |
| ARRAY_REMOVE | ARRAY_REMOVE |
| ARRAY_SIZE | ARRAY_SIZE |
| ARRAY | ARRAY_CONSTRUCT |
| ARRAYS_OVERLAP | ARRAYS_OVERLAP |
| ARRAYS_ZIP | ARRAYS_ZIP |
| ASCII | ASCII |
| ASIN | ASIN |
| ASINH | ASINH |
| ATAN | ATAN |
| ATAN2 | ATAN2 |
| ATANH | ATANH |
| AVG | AVG |
| BIT_COUNT | BITCOUNT |
| BIT_GET | GETBIT |
| BOOL_AND | BOOLAND_AGG |
| BOOL_OR | BOOLOR_AGG |
| BTRIM | TRIM |
| CBRT | CBRT |
| CEIL | CEIL |
| CEILING | CEIL |
| CHAR_LENGTH | LENGTH |
| CHARACTER_LENGTH | LENGTH |
| CHR | CHR |
| COALESCE | COALESCE |
| COLLECT_LIST | ARRAY_AGG |
| CONCAT_WS | CONCAT_WS_UDF  Note: A User Defined Function is created to emulate the source behaviour. |
| CONCAT | CONCAT |
| CONTAINS | CONTAINS |
| CORR | CORR |
| COS | COS |
| COSH | COSH |
| COT | COT |
| COUNT_IF | COUNT_IF |
| COUNT | COUNT |
| COVAR_POP | COVAR_POP |
| COVAR_SAMP | COVAR_SAMP |
| CUME_DIST | CUME_DIST |
| CURDATE | CURRENT_DATE |
| CURRENT_DATABASE | CURRENT_DATABASE |
| CURRENT_DATE | CURRENT_DATE |
| CURRENT_SCHEMA | CURRENT_SCHEMA |
| CURRENT_TIMESTAMP | CURRENT_TIMESTAMP |
| CURRENT_USER | CURRENT_USER |
| DATE_ADD | DATEADD |
| DATE_DIFF | DATEDIFF |
| DATE_TRUNC | DATE_TRUNC |
| DATE | DATE |
| DAY | DAY |
| DAYNAME | DAYNAME |
| DAYOFWEEK | DAYOFWEEK |
| DAYOFYEAR | DAYOFYEAR |
| DECODE | DECODE |
| DEGREES | DEGREES |
| DENSE_RANK | DENSE_RANK |
| ENDSWITH | ENDSWITH |
| EVERY | BOOLAND_AGG |
| EXP | EXP |
| FIRST_VALUE | FIRST_VALUE |
| FLOOR | FLOOR |
| GET | GET |
| GETBIT | GETBIT |
| GETDATE | CURRENT_TIMESTAMP |
| GREATEST | GREATEST |
| GROUPING | GROUPING |
| HASH | HASH |
| HEX | HEX_ENCODE |
| HLL_SKETCH_ESTIMATE | HLL_ESTIMATE |
| HOUR | HOUR |
| HOUR | HOUR |
| IF | IFF |
| IFF | IFF |
| IFNULL | IFNULL |
| INITCAP | INITCAP |
| KURTOSIS | KURTOSIS |
| LAG | LAG |
| LAST_DAY | LAST_DAY |
| LAST_DAY | LAST_DAY |
| LAST_VALUE | LAST_VALUE |
| LCASE | LOWER |
| LEAD | LEAD |
| LEAST | LEAST |
| LEFT | LEFT |
| LEN | LEN |
| LENGTH | LENGTH |
| LEVENSHTEIN | EDITDISTANCE |
| LISTAGG | LISTAGG |
| LN | LN |
| LOCATE | CHARINDEX |
| LOG | LOG |
| LOWER | LOWER |
| LPAD | LPAD |
| LTRIM | LTRIM |
| MAP_KEYS | OBJECT_KEYS |
| MAP(key, value, …) | OBJECT_CONSTRUCT(key, value, …)  *Note: The keys are casted to VARCHAR since Snowflake does not allow another type as keys.* |
| MAX_BY | MAX_BY |
| MAX | MAX |
| MD5 | MD5 |
| MEAN | AVG |
| MEDIAN | MEDIAN |
| MIN_BY | MIN_BY |
| MIN | MIN |
| MINUTE | MINUTE |
| MOD | MOD |
| MODE | MODE |
| MONTH | MONTH |
| MONTHS_BETWEEN | MONTHS_BETWEEN |
| NAMED_STRUCT | OBJECT_CONSTRUCT |
| NOW | CURRENT_TIMESTAMP |
| NTH_VALUE | NTH_VALUE |
| NTILE | NTILE |
| NULLIF | NULLIF |
| NULLIFZERO | NULLIFZERO |
| NVL | NVL |
| NVL2 | NVL2 |
| OCTET_LENGTH | OCTET_LENGTH |
| PARSE_JSON | PARSE_JSON |
| PERCENT_RANK | PERCENT_RANK |
| PERCENTILE_APPROX | APPROX_PERCENTILE |
| PERCENTILE_CONT | PERCENTILE_CONT |
| PERCENTILE_DISC | PERCENTILE_DISC |
| PI | PI |
| POSITION | POSITION |
| POW | POW |
| POWER | POWER |
| QUARTER | QUARTER |
| RADIANS | RADIANS |
| RANDOM | RANDOM |
| RANK | RANK |
| REGEXP_COUNT | REGEXP_COUNT |
| REGEXP_INSTR | REGEXP_INSTR |
| REGEXP_REPLACE | REGEXP_REPLACE |
| REGEXP_SUBSTR | REGEXP_SUBSTR |
| REGR_AVGX | REGR_AVGX |
| REGR_AVGY | REGR_AVGY |
| REGR_COUNT | REGR_COUNT |
| REGR_INTERCEPT | REGR_INTERCEPT |
| REGR_R2 | REGR_R2 |
| REGR_SLOPE | REGR_SLOPE |
| REGR_SXX | REGR_SXX |
| REGR_SXY | REGR_SXY |
| REGR_SYY | REGR_SYY |
| REPEAT | REPEAT |
| REPLACE | REPLACE |
| REVERSE | REVERSE |
| RIGHT | RIGHT |
| ROUND | ROUND |
| ROW_NUMBER | ROW_NUMBER |
| RPAD | RPAD |
| RTRIM | RTRIM |
| SECOND | SECOND |
| SESSION_USER | CURRENT_USER |
| SHA1 | SHA1 |
| SHA2 | SHA2 |
| SHIFTLEFT | BITSHIFTLEFT |
| SHIFTRIGHT | BITSHIFTRIGHT |
| SIGN | SIGN |
| SIGNUM | SIGN |
| SIN | SIN |
| SINH | SINH |
| SKEWNESS | SKEW |
| SOME | BOOLOR_AGG |
| SOUNDEX | SOUNDEX |
| SPACE | SPACE |
| SPLIT_PART | SPLIT_PART |
| SQRT | SQRT |
| STARTSWITH | STARTSWITH |
| STD | STDDEV_SAMP |
| STDDEV_POP | STDDEV_POP |
| STDDEV_SAMP | STDDEV_SAMP |
| STDDEV | STDDEV_SAMP |
| STRING | TO_VARCHAR |
| STRUCT | OBJECT_CONSTRUCT |
| SUBSTR | SUBSTR |
| SUBSTRING | SUBSTRING |
| SUM | SUM |
| TAN | TAN |
| TANH | TANH |
| TIMESTAMP | TO_TIMESTAMP |
| TO_CHAR | TO_CHAR |
| TO_DATE | TO_DATE |
| TO_NUMBER | TO_NUMBER |
| TO_TIMESTAMP | TO_TIMESTAMP |
| TO_VARCHAR | TO_VARCHAR |
| TRANSLATE | TRANSLATE |
| TRIM | TRIM |
| TRUNC | TRUNC |
| TRUNC | TRUNC |
| TRY_AVG | AVG |
| TRY_CAST | TRY_CAST |
| TRY_SUM | TRY_SUM |
| TRY_TO_NUMBER | TRY_TO_NUMBER |
| TRY_TO_TIMESTAMP | TRY_TO_TIMESTAMP |
| TYPEOF | TYPEOF |
| UCASE | UPPER |
| UPPER | UPPER |
| USER | CURRENT_USER |
| UUID | UUID_STRING |
| VAR_POP | VAR_POP |
| VAR_SAMP | VAR_SAMP |
| VARIANCE_POP | VARIANCE_POP |
| VARIANCE_SAMP | VARIANCE_SAMP |
| VARIANCE | VARIANCE |
| WIDTH_BUCKET | WIDTH_BUCKET |
| YEAR | YEAR |
| ZEROIFNULL | ZEROIFNULL |

---
title: SnowConvert AI - Hive - CREATE EXTERNAL TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/ddls/create-external-table.md
section: Migrations
---

# SnowConvert AI - Hive - CREATE EXTERNAL TABLE

Applies to

* Hive SQL
* Spark SQL
* Databricks SQL

## Description

> External Tables defines a new table using a Data Source. ([Spark SQL Language Reference CREATE DATASOURCE TABLE](https://spark.apache.org/docs/latest/sql-ref-syntax-ddl-create-table-datasource.html))

```sql
CREATE TABLE [ IF NOT EXISTS ] table_identifier
[ ( col_name1 col_type1 [ COMMENT col_comment1 ], ... ) ]
USING data_source
[ OPTIONS ( key1=val1, key2=val2, ... ) ]
[ PARTITIONED BY ( col_name1, col_name2, ... ) ]
[ CLUSTERED BY ( col_name3, col_name4, ... )
    [ SORTED BY ( col_name [ ASC | DESC ], ... ) ]
    INTO num_buckets BUCKETS ]
[ LOCATION path ]
[ COMMENT table_comment ]
[ TBLPROPERTIES ( key1=val1, key2=val2, ... ) ]
[ AS select_statement ]
```

The CREATE EXTERNAL TABLE statement from Spark/Databricks will be transformed to a CREATE EXTERNAL TABLE statement from [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-external-table); however, this transformation requires user intervention.

To complete the transformation performed by SnowConvert AI, it is necessary to define a [Storage Integration](https://docs.snowflake.com/en/sql-reference/sql/create-storage-integration), an [External Stage](https://docs.snowflake.com/en/sql-reference/sql/create-stage), and (optionally) a [Notification Integration](https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration) that have access to the external source where files are located. Please refer to the following guides on how to set up the connection for each provider:

* [For external tables referencing Amazon S3](https://docs.snowflake.com/en/user-guide/tables-external-s3)
* [For external tables referencing Google Cloud Storage](https://docs.snowflake.com/en/user-guide/tables-external-gcs)
* [For external tables referencing Azure Blob Storage](https://docs.snowflake.com/en/user-guide/tables-external-azure)

Important considerations for the transformations shown on this page:

* The @EXTERNAL_STAGE placeholder must be replaced with the external stage created after following the previous guide.
* It is assumed that the external stage will point to the root of the bucket. This is important to consider because the PATTERN clause generated for each table specifies the file/folder paths starting at the base of the bucket, defining the external stage pointing to a different location in the bucket might produce undesired behavior.
* The `AUTO_REFRESH = FALSE` clause is generated to avoid errors. Please note that automatic refresh of external table metadata is only valid if your Snowflake account cloud provider and the bucket provider are the same, and a Notification Integration was created.

## Sample Source Patterns

### Create External Table with explicit column list

When the column list is provided, SnowConvert AI will automatically generate the AS expression column options for each column to extract the file values.

#### Input Code:

```sql
CREATE EXTERNAL TABLE IF NOT EXISTS external_table
(
  order_id int,
  date string,
  client_name string,
  total float
)
USING AVRO
LOCATION 'gs://sc_external_table_bucket/folder_with_avro/orders.avro';
```

#### Output Code:

```sql
CREATE EXTERNAL TABLE IF NOT EXISTS external_table
(
  order_id int AS CAST(GET_IGNORE_CASE($1, 'order_id') AS int),
  date string AS CAST(GET_IGNORE_CASE($1, 'date') AS string),
  client_name string AS CAST(GET_IGNORE_CASE($1, 'client_name') AS string),
  total float AS CAST(GET_IGNORE_CASE($1, 'total') AS float)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs:, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
FILE_FORMAT = (TYPE = AVRO)
PATTERN = '/sc_external_table_bucket/folder_with_avro/orders.avro'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "spark",  "convertedOn": "06/18/2025",  "domain": "no-domain-provided" }}';
```

### CREATE EXTERNAL TABLE without an explicit column list

When the column list is not provided, Spark automatically detects the schema of the columns from the file structure. To replicate this behavior, SnowConvert AI will generate a USING TEMPLATE clause that makes use of the [INFER_SCHEMA](https://docs.snowflake.com/en/sql-reference/functions/infer_schema) function to generate the column definitions.

Since the INFER_SCHEMA function requires a file format to work, SnowConvert AI will generate a temporary file format for this purpose. This file format is only required when running the CREATE EXTERNAL TABLE statement, and it will be automatically dropped when the session ends.

#### Input Code:

```sql
CREATE EXTERNAL TABLE IF NOT EXISTS external_table_No_Columns
using AVRO
LOCATION 'gs://sc_external_table_bucket/folder_with_avro/orders.avro';
```

#### Output Code:

```sql
CREATE OR REPLACE TEMPORARY FILE FORMAT SC_HIVE_FORMAT_ORDERS_NO_COLUMNS_FORMAT
TYPE = AVRO;
CREATE EXTERNAL TABLE IF NOT EXISTS hive_format_orders_No_Columns USING TEMPLATE (
SELECT
  ARRAY_AGG(OBJECT_CONSTRUCT('COLUMN_NAME', COLUMN_NAME, 'TYPE', TYPE, 'NULLABLE', NULLABLE, 'EXPRESSION', EXPRESSION))
FROM
  --** SSC-FDM-0035 - THE INFER_SCHEMA FUNCTION REQUIRES A FILE PATH WITHOUT WILDCARDS TO GENERATE THE TABLE TEMPLATE, REPLACE THE FILE_PATH PLACEHOLDER WITH IT **
  TABLE(INFER_SCHEMA(LOCATION => '@EXTERNAL_STAGE/FILE_PATH', FILE_FORMAT => 'SC_HIVE_FORMAT_ORDERS_NO_COLUMNS_FORMAT'))
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs:, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
FILE_FORMAT = (TYPE = AVRO)
PATTERN = '/sc_external_table_bucket/folder_with_avro/orders.avro'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "spark",  "convertedOn": "06/18/2025",  "domain": "no-domain-provided" }}';
```

### CREATE EXTERNAL TABLE using Hive format

The creation of External Tables using [Hive Format](https://spark.apache.org/docs/latest/sql-ref-syntax-ddl-create-table-hiveformat.html) is also supported. They will have an FDM added informing the user that inserting into those tables is not supported.

#### Input Code:

```sql
CREATE EXTERNAL TABLE IF NOT EXISTS External_table_hive_format
(
  order_id int,
  date string,
  client_name string,
  total float
)
stored as AVRO
LOCATION 'gs://sc_external_table_bucket/folder_with_avro/orders.avro';
```

#### Output Code:

```sql
--** SSC-FDM-HV0001 - INSERTING VALUES INTO AN EXTERNAL TABLE IS NOT SUPPORTED IN SNOWFLAKE **
CREATE EXTERNAL TABLE IF NOT EXISTS hive_format_orders_Andres
(
  order_id int AS CAST(GET_IGNORE_CASE($1, 'order_id') AS int),
  date string AS CAST(GET_IGNORE_CASE($1, 'date') AS string),
  client_name string AS CAST(GET_IGNORE_CASE($1, 'client_name') AS string),
  total float AS CAST(GET_IGNORE_CASE($1, 'total') AS float)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs:, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
FILE_FORMAT = (TYPE = AVRO)
PATTERN = '/sc_external_table_bucket/folder_with_avro/orders.avro'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "spark",  "convertedOn": "06/18/2025",  "domain": "no-domain-provided" }}';
```

## Known Issues

**1. External tables with unsupported file formats**

Snowflake supports the following Spark formats:

* CSV
* PARQUET
* ORC
* XML
* JSON
* AVRO

Other formats will be marked as not supported.

**2. Unsupported table options**

Some table options are not supported by SnowConvert AI and are marked with an EWI.

## Input Code:

```sql
CREATE EXTERNAL TABLE IF NOT EXISTS hive_format_orders_Andres
(
  order_id int,
  date string,
  client_name string,
  total float
)
using AVRO
LOCATION 'gs://sc_external_table_bucket/folder_with_avro/orders.avro'
Tblproperties (
    'unsupported_table_option' = 'value'
);
```

## Output Code:

```sql
CREATE EXTERNAL TABLE IF NOT EXISTS hive_format_orders_Andres
(
  order_id int AS CAST(GET_IGNORE_CASE($1, 'order_id') AS int),
  date string AS CAST(GET_IGNORE_CASE($1, 'date') AS string),
  client_name string AS CAST(GET_IGNORE_CASE($1, 'client_name') AS string),
  total float AS CAST(GET_IGNORE_CASE($1, 'total') AS float)
)
    !!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs:, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
    LOCATION = @EXTERNAL_STAGE
    AUTO_REFRESH = false
    PATTERN = '/sc_external_table_bucket/folder_with_avro/orders.avro'
    FILE_FORMAT = (TYPE = AVRO)
    !!!RESOLVE EWI!!! /*** SSC-EWI-0016 - SNOWFLAKE DOES NOT SUPPORT THE OPTIONS: 'UNSUPPORTED_TABLE_OPTION'. ***/!!!
    TBLPROPERTIES (
  'unsupported_table_option' = 'value'
    )
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "spark",  "convertedOn": "06/19/2025",  "domain": "no-domain-provided" }}';
```

## Related EWIs

1. [SSC-EWI-0029](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): External table data format not supported in Snowflake
2. [SSC-EWI-0032](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): External table requires an external stage to access an external location, define and replace the EXTERNAL_STAGE placeholder
3. [SSC-FDM-0034](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): The INFER_SCHEMA function requires a file path without wildcards to generate the table template, replace the FILE_PATH placeholder with it
4. [SSC-EWI-0016](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Snowflake does not support the options clause.
5. [SSC-FDM-HV0001](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/hiveFDM.md): Inserting values into an external table is not supported in Snowflake.

---
title: SnowConvert AI - Hive - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/ddls/tables.md
section: Migrations
---

# SnowConvert AI - Hive - CREATE TABLE

Applies to

* Hive SQL
* Spark SQL
* Databricks SQL

## Description

Creates a new table in the current database. You define a list of columns, which each hold data of a distinct type. The owner of the table is the issuer of the CREATE TABLE command.

For more information, please refer to [`CREATE TABLE`](https://spark.apache.org/docs/3.5.3/sql-ref-syntax-ddl-create-table.html) documentation.

## Grammar Syntax

```sql
--DATASOURCE TABLE
CREATE TABLE [ IF NOT EXISTS ] table_identifier
    [ ( col_name1 col_type1 [ COMMENT col_comment1 ], ... ) ]
    USING data_source
    [ OPTIONS ( key1=val1, key2=val2, ... ) ]
    [ PARTITIONED BY ( col_name1, col_name2, ... ) ]
    [ CLUSTERED BY ( col_name3, col_name4, ... )
        [ SORTED BY ( col_name [ ASC | DESC ], ... ) ]
        INTO num_buckets BUCKETS ]
    [ LOCATION path ]
    [ COMMENT table_comment ]
    [ TBLPROPERTIES ( key1=val1, key2=val2, ... ) ]
    [ AS select_statement ]

--HIVE FORMAT TABLE
CREATE [ EXTERNAL ] TABLE [ IF NOT EXISTS ] table_identifier
    [ ( col_name1[:] col_type1 [ COMMENT col_comment1 ], ... ) ]
    [ COMMENT table_comment ]
    [ PARTITIONED BY ( col_name2[:] col_type2 [ COMMENT col_comment2 ], ... )
        | ( col_name1, col_name2, ... ) ]
    [ CLUSTERED BY ( col_name1, col_name2, ...)
        [ SORTED BY ( col_name1 [ ASC | DESC ], col_name2 [ ASC | DESC ], ... ) ]
        INTO num_buckets BUCKETS ]
    [ ROW FORMAT row_format ]
    [ STORED AS file_format ]
    [ LOCATION path ]
    [ TBLPROPERTIES ( key1=val1, key2=val2, ... ) ]
    [ AS select_statement ]

--LIKE TABLE
CREATE TABLE [IF NOT EXISTS] table_identifier LIKE source_table_identifier
    USING data_source
    [ ROW FORMAT row_format ]
    [ STORED AS file_format ]
    [ TBLPROPERTIES ( key1=val1, key2=val2, ... ) ]
    [ LOCATION path ]
```

## IF NOT EXISTS

## Description

> Ensures the table is created only if it does not already exist, preventing duplication and errors in your SQL script.

> **Hint:**
>
> This syntax is fully supported in Snowflake.

## Applies to

* Hive
* Spark
* Databricks

## Grammar Syntax

```sql
IF NOT EXISTS
```

## Sample Source Patterns

## Input Code:

```sql
CREATE TABLE IF NOT EXISTS table1 (
    col1 INTEGER
);
```

## Output Code:

```sql
CREATE TABLE IF NOT EXISTS table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2024" }}';
```

## PARTITION BY

## Description

> Partitions are created on the table, based on the columns specified.

This syntax is not needed in Snowflake.

## Applies to

* Hive
* Spark
* Databricks

## Grammar Syntax

```sql
PARTITIONED BY ( { partition_column [ column_type ] } [, ...] )
```

## Sample Source Patterns

## Input Code:

```sql
CREATE TABLE orders (
    order_id INT,
    customer_id INT,
    order_date DATE,
    total_amount DECIMAL(10, 2),
    order_status STRING
)
PARTITIONED BY (order_status);
```

## Output Code:

```sql
CREATE TABLE orders (
    order_id INT,
    customer_id INT,
    order_date DATE,
    total_amount DECIMAL(10, 2),
    order_status STRING
);
```

## CLUSTERED BY

## Description

> Partitions created on the table will be bucketed into fixed buckets based on the column specified for bucketing.

This grammar is partially supported

## Applies to

* Hive
* Spark
* Databricks

## Grammar Syntax

```sql
CLUSTERED BY (column_name1 [ASC|DESC], ...)
[SORTED BY (sort_column1 [ASC|DESC], ...)]
INTO num_buckets BUCKETS
```

* The **`CLUSTERED BY`** clause, used for performance optimization, will be converted to **`CLUSTER BY`** in Snowflake. Performance may vary between the two architectures.
* The **`SORTED BY`** clause can be removed during migration, as Snowflake automatically handles data sorting within its micro-partitions.
* The **`INTO BUCKETS`** clause, a SparkSQL/Databrick specific partitioning setting, should be entirely eliminated, as it’s not applicable in Snowflake.

## Sample Source Patterns

## Input Code:

```sql
CREATE TABLE table_name (
column1 data_type, column2 data_type, ... ) USING format CLUSTERED BY (bucketing_column1) SORTED BY (sorting_column1 DESC, sorting_column2 ASC) INTO 10 BUCKETS;
```

## Output Code:

```sql
CREATE TABLE table_name ( column1 data_type, column2 data_type, ... ) USING format
CLUSTER BY (bucketing_column1);
```

## ROW FORMAT

## Description

> Specifies the row format for input and output.

This grammar is not supported in Snowflake

## Applies to

* Hive
* Spark
* Databricks

## Grammar Syntax

```sql
ROW FORMAT fow_format

row_format:
   { SERDE serde_class [ WITH SERDEPROPERTIES (serde_key = serde_val [, ...] ) ] |
     { DELIMITED [ FIELDS TERMINATED BY fields_terminated_char [ ESCAPED BY escaped_char ] ]
       [ COLLECTION ITEMS TERMINATED BY collection_items_terminated_char ]
       [ MAP KEYS TERMINATED BY map_key_terminated_char ]
       [ LINES TERMINATED BY row_terminated_char ]
       [ NULL DEFINED AS null_char ] } }
```

## Sample Source Patterns

## Input Code:

```sql
CREATE TABLE parquet_table ( id INT, data STRING )  STORED AS TEXTFILE LOCATION '/mnt/delimited/target' ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' ESCAPED BY '\\' COLLECTION ITEMS TERMINATED BY ';' MAP KEYS TERMINATED BY ':' LINES TERMINATED BY '\n' NULL DEFINED AS 'NULL_VALUE';
```

## Output Code:

```sql
CREATE TABLE delimited_like_delta LIKE source_delta_table STORED AS TEXTFILE LOCATION '/mnt/delimited/target'
!!!RESOLVE EWI!!! /*** SSC-EWI-HV0002 - THE ROW FORMAT CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!! ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' ESCAPED BY '\\' COLLECTION ITEMS TERMINATED BY ';' MAP KEYS TERMINATED BY ':' LINES TERMINATED BY '\n' NULL DEFINED AS 'NULL_VALUE';
```

## STORED AS

## Description

> File format for table storage.

This grammar is not supported in Snowflake

## Applies to

* Hive
* Spark
* Databricks

---
title: SnowConvert AI - Hive - CREATE VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/ddls/create-view.md
section: Migrations
---

# SnowConvert AI - Hive - CREATE VIEW

Applies to

* Hive SQL
* Spark SQL
* Databricks SQL
> **Warning:**
>
> This grammar is partially supported in Snowflake. Translation pending for these CREATE VIEW elements:

```sql
[ [ GLOBAL ] TEMPORARY ]
[ TBLPROPERTIES ( property_name = property_value [ , ... ] ) ]
```

## Description

> Views are based on the result-set of an `SQL` query. `CREATE VIEW` constructs a virtual table that has no physical data therefore other operations like `ALTER VIEW` and `DROP VIEW` only change metadata. ([Spark SQL Language Reference CREATE VIEW](https://spark.apache.org/docs/latest/sql-ref-syntax-ddl-create-view.html))

## Grammar Syntax

```sql
CREATE [ OR REPLACE ] [ [ GLOBAL ] TEMPORARY ] VIEW [ IF NOT EXISTS ] view_identifier
    create_view_clauses AS query

create_view_clauses :=
[ ( column_name [ COMMENT column_comment ], ... ) ]
[ COMMENT view_comment ]
[ TBLPROPERTIES ( property_name = property_value [ , ... ] ) ]
```

## Sample Source Patterns

### COMMENT clause

#### Input Code:

```sql
CREATE VIEW my_view
COMMENT 'This view selects specific columns from person'
AS
SELECT
   name,
   age,
   address
FROM
   person;
```

#### Output Code:

```sql
CREATE VIEW my_view
COMMENT = '{ "Description": "This view selects specific columns from person", "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "databricks",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
   name,
   age,
   address
FROM
   person;
```

### OR REPLACE

> **Note:**
>
> This clause is fully supported in Snowflake

### TEMPORARY (non-GLOBAL) VIEW

> **Note:**
>
> This clause is fully supported in Snowflake

### IF NOT EXISTS

> **Note:**
>
> This clause is fully supported in Snowflake

### Columns list

> **Note:**
>
> This clause is fully supported in Snowflake

---
title: SnowConvert AI - Hive - Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/data-types.md
section: Migrations
---

# SnowConvert AI - Hive - Data Types

Snowflake supports most basic SQL data types (with some restrictions) for
columns, local variables, expressions, parameters, and other
appropriate/suitable locations.

Applies to

* Hive SQL
* Spark SQL
* Databricks SQL

## Exact and approximate numerics

| SparkSQL-DatabricksSQL | Snowflake | Notes |
| --- | --- | --- |
| TINYINT, SHORT | SMALLINT | ​Snowflake's SMALLINT has a larger range (-32768 to +32767) than Spark's TINYINT (-128 to +127). This should generally be a safe transformation. |
| SMALLINT | SMALLINT | Direct equivalent in terms of range. |
| INT, INTEGER | INT, INTEGER | ​Direct equivalent in terms of range. |
| BIGINT | BIGINT​ | Direct equivalent in terms of range. |
| DECIMAL(p, s)​ | NUMBER(p, s) | Snowflake's NUMBER(p, s) is the direct equivalent for fixed-precision and scale numbers. p is the precision (total number of digits) and s is the scale (number of digits to the right of the decimal point). |
| NUMERIC(p, s) | NUMBER(p, s) | Synonym for DECIMAL(p, s), maps directly to Snowflake's NUMBER(p, s). |
| FLOAT | FLOAT | Direct equivalent in terms of range. |
| DOUBLE, DOUBLE PRECISION | DOUBLE | Generally a good equivalent for double-precision floating-point numbers. |
| REAL | REAL | If REAL in your Spark context is strictly single-precision, be mindful of potential precision differences. |

## Date and time

| Hive-Spark-Databricks SQL | Snowflake | Notes |
| --- | --- | --- |
| DATE | DATE | Direct equivalent for storing calendar dates (year, month, day). |
| TIMESTAMP | TIMESTAMP_NTZ | Snowflake offers several timestamp variations. TIMESTAMP_NTZ (no time zone) is often the best general equivalent if your Spark TIMESTAMP doesn’t have specific time zone information tied to the data itself. |

## Character strings

| Hive-Spark-Databricks SQL | Snowflake | Notes |
| --- | --- | --- |
| STRING | VARCHAR | ​Snowflake’s VARCHAR is the most common and flexible string type. It can store variable-length strings. |
| VARCHAR(n)​ | VARCHAR(n) | Direct equivalent for variable-length strings with a maximum length. |
| CHAR(n) | CHAR(n) | Direct equivalent for fixed-length strings. |

## Binary strings

| Hive-Spark-Databricks SQL | Snowflake | Notes |
| --- | --- | --- |
| BINARY | ​BINARY | Direct equivalent for storing raw byte sequences. |

## Boolean type

| Hive-Spark-Databricks SQL | Snowflake | Notes |
| --- | --- | --- |
| BOOLEAN, BOOL | ​BOOLEAN | Direct equivalent for storing boolean (TRUE/FALSE) values. |

## Complex type

| Hive-Spark-Databricks SQL | Snowflake | Notes |
| --- | --- | --- |
| ARRAY<DataType> | ​ARRAY | Snowflake’s ARRAY type can store ordered lists of elements of a specified data type. The dataType within the array should also be mapped accordingly. |
| MAP<keyType, valueType> | VARIANT |  |
| STRUCT<name: dataType, …> | VARIANT |  |
| INTERVAL | VARCHAR(30) | INTERVAL data type is **not supported** in Snowflake. VARCHAR is used instead. With the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md), maps to native Snowflake INTERVAL types. See [Interval Data Types](../general/interval-data-types.md). |

---
title: SnowConvert AI - Hive - SELECT
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/ddls/select.md
section: Migrations
---

# SnowConvert AI - Hive - SELECT

Applies to

* Hive SQL
* Spark SQL
* Databricks SQL

## Description

Spark supports a `SELECT` statement and conforms to the ANSI SQL standard. Queries are used to retrieve result sets from one or more tables. ([Spark SQL Language Reference SELECT](https://spark.apache.org/docs/latest/sql-ref-syntax-qry-select.html))

> **Warning:**
>
> This grammar is partially supported in Snowflake. Translation pending for these CREATE VIEW elements:

```sql
[ SORT BY { expression [ ASC | DESC ] [ NULLS { FIRST | LAST } ] [ , ... ] } ]
[ CLUSTER BY { expression [ , ... ] } ]
[ DISTRIBUTE BY { expression [, ... ] } ]
[ WINDOW { named_window [ , WINDOW named_window, ... ] } ]
[ PIVOT clause ]
[ UNPIVOT clause ]
[ LATERAL VIEW clause ] [ ... ]
[ regex_column_names ]
[ TRANSFORM (...) ]
[ LIMIT non_literal_expression ]

from_item :=
join_relation
table_value_function
LATERAL(subquery)
file_format.`file_path`

select_statement { INTERSECT | EXCEPT } { ALL | DISTINCT } select_statement
```

## Grammar Syntax

```sql
[ WITH with_query [ , ... ] ]
select_statement [ { UNION | INTERSECT | EXCEPT } [ ALL | DISTINCT ] select_statement, ... ]
    [ ORDER BY { expression [ ASC | DESC ] [ NULLS { FIRST | LAST } ] [ , ... ] } ]
    [ SORT BY { expression [ ASC | DESC ] [ NULLS { FIRST | LAST } ] [ , ... ] } ]
    [ CLUSTER BY { expression [ , ... ] } ]
    [ DISTRIBUTE BY { expression [, ... ] } ]
    [ WINDOW { named_window [ , WINDOW named_window, ... ] } ]
    [ LIMIT { ALL | expression } ]

select_statement :=
SELECT [ hints , ... ] [ ALL | DISTINCT ] { [ [ named_expression | regex_column_names ] [ , ... ] | TRANSFORM (...) ] }
    FROM { from_item [ , ... ] }
    [ PIVOT clause ]
    [ UNPIVOT clause ]
    [ LATERAL VIEW clause ] [ ... ]
    [ WHERE boolean_expression ]
    [ GROUP BY expression [ , ... ] ]
    [ HAVING boolean_expression ]

with_query :=
expression_name [ ( column_name [ , ... ] ) ] [ AS ] ( query )

from_item :=
table_relation |
join_relation |
table_value_function |
inline_table |
LATERAL(subquery) |
file_format.`file_path`
```

## Sample Source Patterns

### GROUP BY

The `WITH { CUBE | ROLLUP }` syntax is transformed to its `CUBE(expr1, ...)` or `ROLLUP(expr1, ...)` equivalent

#### Input Code:

```sql
-- Basic case of GROUP BY
SELECT id, sum(quantity) FROM dealer GROUP BY 1;

-- Grouping by GROUPING SETS
SELECT city, car_model, sum(quantity) AS sum FROM dealer
    GROUP BY GROUPING SETS ((city, car_model), (city), (car_model), ());

-- Grouping by ROLLUP
SELECT city, car_model, sum(quantity) AS sum FROM dealer
    GROUP BY ROLLUP(city, car_model);

SELECT city, car_model, sum(quantity) AS sum FROM dealer
    GROUP BY city, car_model WITH ROLLUP;

-- Grouping by CUBE
SELECT city, car_model, sum(quantity) AS sum FROM dealer
    GROUP BY CUBE(city, car_model);

SELECT city, car_model, sum(quantity) AS sum FROM dealer
    GROUP BY city, car_model WITH CUBE;
```

#### Output Code:

```sql
-- Basic case of GROUP BY
SELECT id,
    SUM(quantity) FROM
    dealer
GROUP BY 1;

-- Grouping by GROUPING SETS
SELECT city, car_model,
    SUM(quantity) AS sum FROM
    dealer
    GROUP BY GROUPING SETS ((city, car_model), (city), (car_model), () !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'EmptyGroupingSet' NODE ***/!!!);

-- Grouping by ROLLUP
SELECT city, car_model,
    SUM(quantity) AS sum FROM
    dealer
    GROUP BY
    ROLLUP(city, car_model);

SELECT city, car_model,
    SUM(quantity) AS sum FROM
    dealer
GROUP BY
    ROLLUP(city, car_model);

-- Grouping by CUBE
SELECT city, car_model,
    SUM(quantity) AS sum FROM
    dealer
    GROUP BY CUBE(city, car_model) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'CUBE' NODE ***/!!!;

SELECT city, car_model,
    SUM(quantity) AS sum FROM
    dealer
GROUP BY
    CUBE(city, car_model);
```

### Hints

Snowflake performs automatic optimization of JOINs and partitioning, meaning that hints are unnecessary, they are preserved as comments in the output code.

#### Input Code:

```sql
SELECT
/*+ REBALANCE */ /*+ COALESCE(2) */
*
FROM my_table;
```

#### Output Code:

```sql
SELECT
/*+ REBALANCE */ /*+ COALESCE(2) */
*
FROM
my_table;
```

### CTE

The `AS` keyword is optional in Spark/Databricks, however in Snowflake is required so it is added.

#### Input Code:

```sql
WITH my_cte (
   SELECT id, name FROM my_table
)
SELECT *
FROM my_cte
WHERE id = 1;
```

#### Output Code:

```sql
WITH my_cte AS (
     SELECT id, name FROM
        my_table
  )
SELECT *
FROM
     my_cte
WHERE id = 1;
```

### LIMIT

`LIMIT ALL` is removed as it is not needed in Snowflake, LIMIT with a literal value is preserved as-is.

#### Input Code:

```sql
SELECT * FROM my_table LIMIT ALL;

SELECT * FROM my_table LIMIT 5;
```

#### Output Code:

```sql
SELECT * FROM
my_table;

SELECT * FROM
my_table
LIMIT 5;
```

### ORDER BY

> **Note:**
>
> This clause is fully supported in Snowflake

### WHERE

> **Note:**
>
> This clause is fully supported in Snowflake

### HAVING

> **Note:**
>
> This clause is fully supported in Snowflake

### FROM table_relation

> **Note:**
>
> This clause is fully supported in Snowflake

### FROM inline_table

> **Note:**
>
> This clause is fully supported in Snowflake

### UNION [ALL | DISTINCT]

> **Note:**
>
> This clause is fully supported in Snowflake

### INTERSECT (no keywords)

> **Note:**
>
> This clause is fully supported in Snowflake

### EXCEPT (no keywords)

> **Note:**
>
> This clause is fully supported in Snowflake

---
title: SnowConvert AI - Hive Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/hiveFDM.md
section: Migrations
---

# SnowConvert AI - Hive Functional Differences

## SSC-FDM-HV0001

Inserting values into an external table is not supported in Snowflake

### Description

Hive Format tables allow you to insert values, but Snowflake External Tables do not support value insertions. This means that while the table structure will be converted, any operations that attempt to insert data directly into the external table in Snowflake will fail.

### Code Example

#### Input

##### Spark

```sql
 CREATE EXTERNAL TABLE IF NOT EXISTS External_table_hive_format
(
  order_id int,
  date string,
  client_name string,
  total float
)
stored as AVRO
LOCATION 'gs://sc_external_table_bucket/folder_with_avro/orders.avro';
```

#### Output

##### Snowflake

```sql
 --** SSC-FDM-HV0001 - INSERTING VALUES INTO AN EXTERNAL TABLE IS NOT SUPPORTED IN SNOWFLAKE **
CREATE EXTERNAL TABLE IF NOT EXISTS hive_format_orders_Andres
(
  order_id int AS CAST(GET_IGNORE_CASE($1, 'order_id') AS int),
  date string AS CAST(GET_IGNORE_CASE($1, 'date') AS string),
  client_name string AS CAST(GET_IGNORE_CASE($1, 'client_name') AS string),
  total float AS CAST(GET_IGNORE_CASE($1, 'total') AS float)
)
!!!RESOLVE EWI!!! /*** SSC-EWI-0032 - EXTERNAL TABLE REQUIRES AN EXTERNAL STAGE TO ACCESS gs:, DEFINE AND REPLACE THE EXTERNAL_STAGE PLACEHOLDER ***/!!!
LOCATION = @EXTERNAL_STAGE
AUTO_REFRESH = false
FILE_FORMAT = (TYPE = AVRO)
PATTERN = '/sc_external_table_bucket/folder_with_avro/orders.avro'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "spark",  "convertedOn": "06/18/2025",  "domain": "no-domain-provided" }}';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-HV0002

Partitioned column added to table definition

### Description

For Hive/Spark partitioned tables, the partition columns are stored in the directory structure rather than in the table data. Snowflake does not support this pattern. SnowConvert AI adds the partitioned columns to the table definition as regular columns so the table schema is complete.

### Code Example

#### Input

##### Hive

```sql
 CREATE EXTERNAL TABLE sales_data
(
  product_id INT,
  amount DECIMAL(10,2)
)
PARTITIONED BY (sale_month STRING)
STORED AS PARQUET
LOCATION 's3://bucket/sales/';
```

#### Output

##### Snowflake

```sql
 CREATE EXTERNAL TABLE sales_data (
  product_id INT,
  amount DECIMAL(10,2),
  sale_month STRING
)
--** SSC-FDM-HV0002 - PARTITIONED COLUMN ADDED TO TABLE DEFINITION. **
LOCATION = @EXTERNAL_STAGE
FILE_FORMAT = (TYPE = PARQUET);
```

#### Best Practices

* Verify that partition columns are correctly mapped to your file path structure.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-HV0003

NULL format parameter is not supported in FROM_UNIXTIME

### Description

Hive’s FROM_UNIXTIME function allows a NULL format parameter, in which case it uses a default format. Snowflake’s equivalent (TO_VARCHAR with TO_TIMESTAMP_NTZ) does not support a NULL format parameter. SnowConvert AI passes the NULL through, but the conversion may fail at runtime or behave unexpectedly.

### Code Example

#### Input

##### Hive

```sql
 SELECT FROM_UNIXTIME(1697328000, CAST(NULL AS STRING));
```

#### Output

##### Snowflake

```sql
 SELECT
  --** SSC-FDM-HV0003 - NULL FORMAT PARAMETER IS NOT SUPPORTED IN FROM_UNIXTIME. **
  TO_VARCHAR(TO_TIMESTAMP_NTZ(1697328000), CAST(NULL AS STRING));
```

#### Best Practices

* Replace NULL format parameters with an explicit format string (e.g., ‘yyyy-MM-dd HH:mm:ss’).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-HV0004

INSTR transformed to REGEXP_INSTR changes literal to regex pattern

### Description

Hive’s INSTR function uses literal string matching. Snowflake does not have INSTR; SnowConvert AI translates it to REGEXP_INSTR. REGEXP_INSTR interprets the pattern as a regex, so metacharacters (e.g., `.`, `*`, `$`) will behave differently than in Hive’s literal matching.

### Code Example

#### Input

##### Hive

```sql
 SELECT INSTR('price: $10.99', pattern_col, 1, 1);
```

#### Output

##### Snowflake

```sql
 SELECT
  --** SSC-FDM-HV0004 - HIVE'S INSTR USES LITERAL STRING MATCHING, BUT REGEXP_INSTR INTERPRETS THE PATTERN AS A REGEX. METACHARACTERS WILL BEHAVE DIFFERENTLY. **
  REGEXP_INSTR('price: $10.99', pattern_col, 1, 1);
```

#### Best Practices

* When the pattern contains regex metacharacters, escape them or use REGEXP_REPLACE to sanitize the pattern.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Hive Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/hiveEWI.md
section: Migrations
---

# SnowConvert AI - Hive Issues

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Hive focuses its assessment and translation capabilities primarily on TABLES and VIEWS.
> While SnowConvert AI can recognize other types of ANSI-standard statements, these are not yet fully supported for conversion. This means that while the tool may identify them, it won’t perform a complete translation for these unsupported code units.

This page provides a comprehensive reference for how SnowConvert AI translates Hive grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

## SSC-EWI-HV0001

The ROW FORMAT clause is not supported in Snowflake

### Severity

Medium

#### Description

This EWI is added when a ROW FORMAT statement is encountered.

#### Code Example

**Input Code:**

##### Hive

```sql
 CREATE TABLE parquet_table (
id INT, data STRING
)
 ROW FORMAT DELIMITED FIELDS TERMINATED BY ',' ESCAPED BY '\\' COLLECTION ITEMS TERMINATED BY ';' MAP KEYS TERMINATED BY ':' LINES TERMINATED BY '\n' NULL DEFINED AS 'NULL_VALUE';
```

**Generated Code:**

##### Snowflake

```sql
 CREATE TABLE parquet_table (
 id INT,
 data STRING
)
!!!RESOLVE EWI!!! /*** SSC-EWI-HV0001 - THE ROW FORMAT CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ',' ESCAPED BY '\\'
COLLECTION ITEMS TERMINATED BY ';'
MAP KEYS TERMINATED BY ':'
LINES TERMINATED BY '\n'
NULL DEFINED AS 'NULL_VALUE'
;
```

---
title: SnowConvert AI - Hive-Spark-Databricks SQL
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/hive-spark-databricks-sql.md
section: Migrations
---

# SnowConvert AI - Hive-Spark-Databricks SQL

SnowConvert AI is a software tool that understands SQL scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI performs the following conversions:

### Hive-Spark- Databricks SQL to Snowflake SQL

SnowConvert AI understands the Hive- Spark - Databricks SQL source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake.

#### Sample code

Hive - Spark - Databricks SQL basic input code:

```sql
CREATE TABLE films (
  code        char(5) CONSTRAINT firstkey PRIMARY KEY,
  title       varchar(40) NOT NULL,
  did         integer NOT NULL,
  date_prod   date
);
```

Snowflake SQL output code:

```sql
CREATE TABLE films (
  code        char(5) CONSTRAINT firstkey PRIMARY KEY,
  title       varchar(40) NOT NULL,
  did         integer NOT NULL,
  date_prod   date
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "databricks",  "convertedOn": "04/24/2025",  "domain": "test" }}';
```

As you can see, most of the structure remains the same. For example, some cases require the data types to be transformed.

# SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI: the software that converts your Hive-Spark-Databricks SQL files securely and automatically to the Snowflake cloud data platform.*
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* Parsing is an initial process by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

On the following few pages, you’ll learn more about the kind of conversions that SnowConvert AI for Hive -Spark - Databricks SQL is capable of. If you’re ready, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Hive-Spark-Databricks SQL
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/README.md
section: Migrations
---

# SnowConvert AI - Hive-Spark-Databricks SQL

> **Conversion Scope:**
>
> SnowConvert AI for Hive, Spark and Databricks SQL currently supports assessment and translation for TABLES and VIEWS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

This page provides a comprehensive reference for how SnowConvert AI translates Hive, Spark and Databricks SQL grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - How to install the tool
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/how-to-install-the-tool/README.md
section: Migrations
---

# SnowConvert AI - How to install the tool

## Installation

Now that you’ve [downloaded](../../../getting-started/download-and-access.md) the tool, you can run the installer. Follow the steps below to get started using SnowConvert.

SnowConvert AI can be installed on either of the following operating systems:

* Windows 11 or later.
* MacOS 13.3 Ventura or later.

Follow the steps below to install the tool for your OS.

* [Windows Installation guide](windows.md)
* [MacOS Installation guide](macos.md)
* [Linux Installation guide](linux.md)

For Command Line Interface (CLI) refer to [SnowConvert AI CLI](../command-line-interface/README.md).

---
title: SnowConvert AI - How to Retrieve Your UUID for Offline Activation in Linux
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/how-to-request-an-access-code/how-to-retrieve-your-uuid-for-offline-activation-in-linux.md
section: Migrations
---

# SnowConvert AI - How to Retrieve Your UUID for Offline Activation in Linux

SnowConvert AI requires a **UUID** to validate the license for offline activation in Linux. Follow these steps to find and provide the UUID:

## **Step 1: Open a Terminal**

To begin, open a terminal on your Linux system by:

* Pressing **Ctrl + Alt + T**
* Searching for **“Terminal”** in your application menu

## **Step 2: Retrieve the UUID of the Root Device**

You need to obtain the **UUID** of your root device. Try the following commands in the given order until you get a valid UUID.

### **Option 1: Primary Command**

Run the following command first:

```bash
findmnt / -o UUID -n
```

If successful, it will return a UUID similar to this:

```none
5a14ccf7-6bac-47b7-a3d6-6c10822fb10d
```

### **Option 2: Alternative Command (If Option 1 Fails)**

If the first command does not return a UUID in a single line, try:

```bash
blkid -s UUID -o value $(findmnt -n -o SOURCE /)
```

Expected output:

```none
5a14ccf7-6bac-47b7-a3d6-6c10822fb10d
```

### **Option 3: Last Resort (If the Previous Commands Fail)**

If neither of the above commands work, use:

```bash
lsblk -nro UUID
```

This must return **only one UUID**. If multiple UUIDs are listed, this method **will not work**. Ensure that the output contains a **single valid UUID**, or try one of the previous methods again.

## **Step 3: Send the UUID**

Once you have retrieved the correct UUID, copy it and send it to the **SnowConvert AI support team** for activation.

---
title: SnowConvert AI - How to update the tool
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/how-to-update-the-tool.md
section: Migrations
---

# SnowConvert AI - How to update the tool

SnowConvert AI checks for updates automatically when you launch the application. But you can check any time if there is a new version available to be downloaded.

You can use the application’s menu to check for updates:

If there is an update available the system will prompt you an alert to download the latest version.

After the download is complete the installation of the new version will start.

Once the update process is finished you can check the updates again and verify that you are using the latest SnowConvert AI version.

Remember to check out our [Release Notes](../../release-notes/release-notes/README.md) page to stay tuned with our latest cool features.

---
title: SnowConvert AI - How to Use SnowConvert AI with Docker
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/others/using-snowconvert-in-a-ubuntu-docker-image.md
section: Migrations
---

# SnowConvert AI - How to Use SnowConvert AI with Docker

## Dependencies

The following dependencies must be installed on the machine:

* [Docker desktop](https://docs.docker.com/desktop/windows/install/)
* [Visual Code](https://code.visualstudio.com/download)
* [Docker Extension in Visual Code](https://marketplace.visualstudio.com/items?itemName=ms-azuretools.vscode-docker)

## Steps

### Create the image config file

Create a file called *“Dockerfile” (no extension)* with the following content. This configuration will be used to build the Docker image.

```bash
FROM ubuntu
COPY snowCli /dockerDestinationFolder
ENV DOTNET_SYSTEM_GLOBALIZATION_INVARIANT=1
RUN apt-get update
RUN apt-get install -y ca-certificates openssl
```

When using the [Ubuntu](https://hub.docker.com/_/ubuntu) image to run the SnowConvert AI CLI for Linux a couple of dependencies must be added to the Dockerfile in order to activate the license, for this purpose [System.Globalization.Invariant](https://docs.microsoft.com/en-us/dotnet/core/run-time-config/globalization) must be turned ON and the OpenSSL must be installed to be able to establish an HTTPS connection for the license validation.

In addition to the dependencies installation, the second line (`COPY` command) is used to copy files from the local machine inside the image. In this case, the *snowCLI* file (located in the same folder as the Dockerfile) will be copied to`/dockerDestinationFolder`inside the image.

### Build the image

Launch Docker Desktop app.

Open Visual Code where the “*Dockerfile”* is located. If you have previously installed the Docker extension for Visual Code, the *“Dockerfile”* will be automatically recognized as a docker configuration file by Visual Code. Right-click on the “Dockerfile” and hit *“Build image…”*

This will prompt for a name to give the image, at the top of Visual Code.

Use any name you want and hit “*Enter”.* That causes Docker to set up the container, by pulling the Ubuntu image, installing dependencies, copying the specified files. Wait for the terminal to finish. Once you see a message like this one, it means the image was successfully built.

```bash
> Executing task: docker build --pull --rm -f "Dockerfile" -t release:Ubuntu "." <

[+] Building 2.0s (11/11) FINISHED                                                                                           0.0s

```

### Run the image

Go to Docker Desktop in the Images tab, and hit run on the recently created image.

Go back to Visual Code, and go to the Docker tab. You should see, under *Containers* the image that was just run. You can expand it and explore the file directory.

### Connect to the container

Finally, if you right-click on the running container and hit *“Attach shell”* you will be able to connect to the container in the Terminal and use all your favorite commands.

You should see your personal files here that were specified to be copied by the COPY command in the configuration file.

---
title: SnowConvert AI - IBM DB2
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/ibm-db2.md
section: Migrations
---

# SnowConvert AI - IBM DB2

## What is SnowConvert AI for IBM DB2?

SnowConvert AI is a software tool that understands SQL IBM DB2 scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for IBM DB2 performs the following conversions:

### IBM DB2 to Snowflake SQL

SnowConvert AI recognizes the IBM DB2 source code and converts the different statements into the appropriate SQL for the Snowflake target.

### Sample code

#### Input Code:

```sql
CREATE TABLE IF NOT EXISTS your_project_id.my_dataset.product_catalog (
  product_ID INT,
  stock_level BLOB
)
;
```

#### Output Code:

```sql
CREATE TABLE IF NOT EXISTS your_project_id.my_dataset.product_catalog (
  product_ID INT,
  stock_level BINARY
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/02/2025",  "domain": "no-domain-provided" }}'
;
```

As you can see, most of the structure remains the same, but some column properties have to be transformed into Snowflake equivalents. For more information please refer to [IBM DB2 Translation References documentation](../../../../translation-references/db2/README.md).

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI*: the software that converts securely and automatically your IBM DB2 files to the Snowflake cloud data platform.
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

In the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for IBM DB2 is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - IBM DB2 - CONTINUE HANDLER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-continue-handler.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - CONTINUE HANDLER

## Description

> A CONTINUE handler allows the execution to continue after a condition is encountered. When a condition occurs and a continue handler is invoked, control is passed to the handler. When the handler completes, control returns to the statement following the statement that raised the condition.

In IBM DB2, the `DECLARE CONTINUE HANDLER` statement is used to define actions that should be taken when specific SQL conditions or errors occur during procedure execution, while allowing the procedure to continue running.

When migrating from DB2 to Snowflake, SnowConvert AI transforms CONTINUE HANDLER declarations into equivalent Snowflake Scripting exception handling using EXCEPTION blocks with appropriate logic to continue execution.

For more information about DB2 condition handlers, see [IBM DB2 DECLARE HANDLER](https://www.ibm.com/docs/en/db2/11.5?topic=statements-declare-handler).

## Grammar Syntax

```sql
DECLARE CONTINUE HANDLER FOR condition_value [, ...]
  handler_action_statement;

-- Where condition_value can be:
-- SQLSTATE [VALUE] sqlstate_value
-- condition_name
-- SQLWARNING
-- SQLEXCEPTION
-- NOT FOUND
```

## Sample Source Patterns

### DECLARE CONTINUE HANDLER FOR SQLEXCEPTION

The most common use case is handling SQL exceptions while allowing the procedure to continue.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE error_handler_example()
LANGUAGE SQL
BEGIN
    DECLARE error_count INT DEFAULT 0;

    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
    BEGIN
        SET error_count = error_count + 1;
    END;

    -- These statements may cause errors
    INSERT INTO table1 VALUES (1/0);
    UPDATE table2 SET status = 'completed' WHERE id = -1;
    DELETE FROM table3 WHERE invalid_column = 'test';

    -- This will execute even if errors occurred above
    INSERT INTO error_summary VALUES (error_count);
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE error_handler_example()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        error_count INT := 0;
    BEGIN
        -- Statements in procedure body
        INSERT INTO table1 VALUES (1/0);
        UPDATE table2 SET status = 'completed' WHERE id = -1;
        DELETE FROM table3 WHERE invalid_column = 'test';

        -- This will execute even if errors occurred above
        INSERT INTO error_summary VALUES (error_count);

        EXCEPTION
            WHEN OTHER CONTINUE THEN
                error_count := error_count + 1;
    END;
$$;
```

### DECLARE CONTINUE HANDLER FOR SQLSTATE

Handling specific SQLSTATE codes allows more granular control over error handling.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE sqlstate_handler_example()
LANGUAGE SQL
BEGIN
    DECLARE duplicate_key_count INT DEFAULT 0;

    -- Handle duplicate key errors (SQLSTATE 23505)
    DECLARE CONTINUE HANDLER FOR SQLSTATE '23505'
    BEGIN
        SET duplicate_key_count = duplicate_key_count + 1;
    END;

    -- Attempt to insert multiple records
    INSERT INTO users VALUES (1, 'John');
    INSERT INTO users VALUES (1, 'Jane');  -- Duplicate key
    INSERT INTO users VALUES (2, 'Bob');

    -- Log the results
    INSERT INTO process_log VALUES ('Duplicates found: ' || duplicate_key_count);
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE sqlstate_handler_example()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        duplicate_key_count INT := 0;
    BEGIN
        -- Attempt to insert multiple records
        INSERT INTO users VALUES (1, 'John');
        INSERT INTO users VALUES (1, 'Jane');  -- Duplicate key
        INSERT INTO users VALUES (2, 'Bob');

        -- Log the results
        INSERT INTO process_log VALUES ('Duplicates found: ' || duplicate_key_count);

        EXCEPTION
            WHEN OTHER CONTINUE THEN
                CASE
                    WHEN (SQLSTATE = '23505') THEN
                        duplicate_key_count := duplicate_key_count + 1;
                END;
    END;
$$;
```

### DECLARE CONTINUE HANDLER FOR NOT FOUND

The NOT FOUND condition is commonly used with cursors and SELECT INTO statements.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE cursor_handler_example()
LANGUAGE SQL
BEGIN
    DECLARE v_id INT;
    DECLARE v_name VARCHAR(100);
    DECLARE v_done INT DEFAULT 0;

    DECLARE CONTINUE HANDLER FOR NOT FOUND
        SET v_done = 1;

    DECLARE cur1 CURSOR FOR
        SELECT id, name FROM employees WHERE department = 'Sales';

    OPEN cur1;

    fetch_loop:
    LOOP
        FETCH cur1 INTO v_id, v_name;

        IF v_done = 1 THEN
            LEAVE fetch_loop;
        END IF;

        INSERT INTO sales_employees VALUES (v_id, v_name);
    END LOOP fetch_loop;

    CLOSE cur1;
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE cursor_handler_example()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        v_id INT;
        v_name VARCHAR(100);
        v_done INT := 0;
        cur1 CURSOR FOR
            SELECT id, name FROM employees WHERE department = 'Sales';
    BEGIN
        OPEN cur1;

        LOOP
            BEGIN
                FETCH cur1 INTO v_id, v_name;
            EXCEPTION
                WHEN NO_DATA_FOUND THEN
                    v_done := 1;
            END;

            IF (v_done = 1) THEN
                BREAK;
            END IF;

            INSERT INTO sales_employees VALUES (v_id, v_name);
        END LOOP;

        CLOSE cur1;
    END;
$$;
```

### DECLARE CONTINUE HANDLER FOR SQLWARNING

Handling warnings while allowing execution to continue.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE warning_handler_example()
LANGUAGE SQL
BEGIN
    DECLARE warning_count INT DEFAULT 0;

    DECLARE CONTINUE HANDLER FOR SQLWARNING
    BEGIN
        SET warning_count = warning_count + 1;
        INSERT INTO warning_log VALUES (CURRENT_TIMESTAMP, SQLSTATE, SQLCODE);
    END;

    -- Operations that might generate warnings
    UPDATE products SET price = price * 1.1 WHERE category = 'Electronics';
    DELETE FROM old_records WHERE record_date < CURRENT_DATE - 365 DAYS;

    INSERT INTO process_summary VALUES (warning_count);
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE warning_handler_example()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        warning_count INT := 0;
    BEGIN
        -- Note: Snowflake doesn't distinguish warnings from errors in the same way
        -- Warning handling may need to be implemented through validation logic

        BEGIN
            UPDATE products SET price = price * 1.1 WHERE category = 'Electronics';
        EXCEPTION
            WHEN OTHER THEN
                warning_count := warning_count + 1;
                INSERT INTO warning_log
                VALUES (CURRENT_TIMESTAMP(), :SQLSTATE, :SQLCODE);
        END;

        BEGIN
            DELETE FROM old_records WHERE record_date < CURRENT_DATE - 365;
        EXCEPTION
            WHEN OTHER THEN
                warning_count := warning_count + 1;
                INSERT INTO warning_log
                VALUES (CURRENT_TIMESTAMP(), :SQLSTATE, :SQLCODE);
        END;

        INSERT INTO process_summary VALUES (warning_count);
    END;
$$;
```

## Known Issues

### CONTINUE HANDLER Behavior Differences

Applies to

* IBM DB2

#### Description

The exact behavior of DB2’s CONTINUE HANDLER cannot be fully replicated in Snowflake due to architectural differences:

1. **Execution Continuation**: In DB2, a CONTINUE HANDLER allows execution to continue from the statement immediately following the one that raised the condition. In Snowflake, each statement must be wrapped in its own exception block to achieve similar behavior.
2. **Performance Impact**: Wrapping multiple statements in individual exception blocks can impact performance compared to a single handler declaration.
3. **Scope**: DB2 CONTINUE HANDLERs apply to all statements in their scope. In Snowflake, exception handling must be more explicit.

#### Related EWIs

1. [SSC-EWI-0114](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING
2. [SSC-FDM-0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE (applies to FROM clause RETURN DATA UNTIL statements)

### SQLSTATE Mapping

Not all DB2 SQLSTATE codes have direct equivalents in Snowflake. SnowConvert AI performs best-effort mapping:

| DB2 SQLSTATE | Condition | Snowflake Equivalent |
| --- | --- | --- |
| 02000 | NOT FOUND | NO_DATA_FOUND |
| 23xxx | Integrity Constraint Violation | STATEMENT_ERROR |
| 42xxx | Syntax Error | STATEMENT_ERROR |
| 01xxx | Warning | OTHER (requires validation) |

#### Input Code:

##### IBM DB2

```sql
DECLARE CONTINUE HANDLER FOR SQLSTATE '42S02'
BEGIN
    -- Table doesn't exist
    CREATE TABLE missing_table (id INT, name VARCHAR(100));
END;
```

#### Output Code:

##### Snowflake

```sql
BEGIN
    -- Operation that might fail
    SELECT * FROM missing_table;
EXCEPTION
    WHEN STATEMENT_ERROR THEN
        LET errcode := :SQLCODE;
        LET sqlerrmsg := :SQLERRM;
        IF (CONTAINS(sqlerrmsg, 'does not exist') OR CONTAINS(sqlerrmsg, 'Table')) THEN
            -- Table doesn't exist
            CREATE TABLE missing_table (id INT, name VARCHAR(100));
        ELSE
            RAISE;
        END IF;
END;
```

### Multiple CONTINUE Handlers

DB2 allows multiple CONTINUE HANDLERs with different priorities. In Snowflake, handler precedence must be managed through explicit conditional logic using CASE statements.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE multiple_handlers()
BEGIN
    DECLARE CONTINUE HANDLER FOR SQLSTATE '23505'
        INSERT INTO log VALUES ('Duplicate key error');

    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO log VALUES ('General SQL exception');

    INSERT INTO table1 VALUES (1, 'test');
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE multiple_handlers()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    BEGIN
        INSERT INTO table1 VALUES (1, 'test');
        EXCEPTION
            WHEN OTHER CONTINUE THEN
                CASE
                    WHEN (SQLSTATE = '23505') THEN
                        INSERT INTO log VALUES ('Duplicate key error')
                    ELSE
                        INSERT INTO log VALUES ('General SQL exception')
                END;
    END;
$$;
```

### Mixed CONTINUE and EXIT Handlers

Applies to

* IBM DB2

#### Description

DB2 allows declaring both CONTINUE and EXIT handlers in the same procedure block. However, Snowflake Scripting does not support mixing CONTINUE and EXIT handlers in the same EXCEPTION block. When this pattern is encountered, SnowConvert AI generates separate EXCEPTION blocks with an EWI warning.

#### Input Code:

##### IBM DB2

```sql
CREATE OR REPLACE PROCEDURE with_continueAndExit()
BEGIN
    DECLARE test_1 INTEGER DEFAULT 10;
    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO error_test VALUES ('EXCEPTION');
    DECLARE EXIT HANDLER FOR SQLSTATE '20000'
        INSERT INTO error_test VALUES ('ERROR 2000');

    SET test_1 = 1 / 0;
    INSERT INTO error_test VALUES ('EXIT');
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE with_continueAndExit()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        test_1 INTEGER DEFAULT 10;
    BEGIN
        test_1 := 1 / 0;
        INSERT INTO error_test VALUES ('EXIT');
        EXCEPTION
            WHEN OTHER CONTINUE THEN
                INSERT INTO error_test VALUES ('EXCEPTION')
        !!!RESOLVE EWI!!! /*** SSC-EWI-0114 - MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        EXCEPTION
            WHEN OTHER EXIT THEN
                CASE
                    WHEN (SQLSTATE = '20000') THEN
                        INSERT INTO error_test VALUES ('ERROR 2000')
                END
    END;
$$;
```

#### Related EWIs

1. [SSC-EWI-0114](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING

## Best Practices

When working with converted CONTINUE HANDLER code:

1. **Validate Error Handling**: Thoroughly test all error scenarios to ensure the converted code behaves as expected.
2. **Review Performance**: Multiple exception blocks can impact performance. Consider refactoring when appropriate.
3. **Use Appropriate Exception Types**: Map DB2 conditions to the most specific Snowflake exception types available.
4. **Implement Logging**: Add comprehensive logging to track errors and ensure visibility into exception handling.
5. **Consider Transactions**: Use Snowflake’s transaction support to maintain data consistency when errors occur.
6. **Document Behavior Changes**: Document any differences in behavior between DB2 CONTINUE HANDLER and the Snowflake implementation.

## Related Documentation

* [IBM DB2 DECLARE HANDLER](https://www.ibm.com/docs/en/db2/11.5?topic=statements-declare-handler)
* [Snowflake Exception Handling](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions)
* [Snowflake Stored Procedures](https://docs.snowflake.com/en/sql-reference/stored-procedures-overview)

## See Also

* [DB2 CREATE PROCEDURE](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-procedure-sql)
* [DB2 FROM Clause](db2-from-clause.md)
* [DB2 SELECT Statement](db2-select-statement.md)
* [DB2 Data Types](db2-data-types.md)

---
title: SnowConvert AI - IBM DB2 - CREATE FUNCTION
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-create-function.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - CREATE FUNCTION

## Description

> Creates a new user defined function or replaces an existing function for the current database. ([IBM DB2 SQL Language Reference Create Function](https://www.ibm.com/docs/en/db2/12.1.0?topic=statements-create-function-sql-scalar-table-row)).

DB2 User Defined Functions (UDFs) allow developers to extend the built-in functionality of the database by creating custom functions that can be invoked in SQL statements. DB2 supports several types of SQL UDFs:

* **Scalar Functions**: Return a single value and can be used wherever an SQL expression is valid.
* **Table Functions**: Return a table result set and can be used in the FROM clause of SELECT statements.

**SnowConvert AI Translation Support**: SnowConvert AI supports translation of **Inline UDFs**, **SQL Scalar Functions**, and **SQL Table Functions** with the following migration approaches:

* **Inline Functions**: Will be kept as **Inline Functions** in Snowflake.
* **Non-inline UDFs**: If they are Snowflake Scripting compliant, they will be transformed into [**Snowflake Scripting UDFs**](https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-procedural-functions)
* **Complex UDFs**: Any UDFs that don’t fit the above two categories will be migrated to **Stored Procedures** instead.

In cases where UDFs are converted to procedures, an **EWI** message will be added to inform the user why the UDF could not be directly migrated to a Snowflake UDF and was converted to a procedure instead.

## Grammar Syntax

The following is the SQL syntax to create a user defined function in IBM DB2. Click [here](https://www.ibm.com/docs/en/db2/12.1.0?topic=statements-create-function-sql-scalar-table-row#r0003493__title__4) to go to the DB2 specification for this syntax.

```sql
CREATE [ OR REPLACE ] FUNCTION function_name
  [ ( [ { IN | OUT | INOUT } ] parameter_name data_type [ DEFAULT default_clause ]... ) ]
  RETURNS { data_type
          | ROW ( column_name data_type [, column_name data_type ]... )
          | TABLE ( column_name data_type [, column_name data_type ]... )
          | row_type_name
          | anchored_row_data_type
          | ELEMENT OF array_type_name }
  [ LANGUAGE SQL ]
  [ PARAMETER CCSID { ASCII | UNICODE } ]
  [ SPECIFIC specific_name ]
  [ { DETERMINISTIC | NOT DETERMINISTIC } ]
  [ { EXTERNAL ACTION | NO EXTERNAL ACTION } ]
  [ { READS SQL DATA | CONTAINS SQL | MODIFIES SQL DATA } ]
  [ { ALLOW PARALLEL | DISALLOW PARALLEL } ]
  [ STATIC DISPATCH ]
  [ CALLED ON NULL INPUT ]
  [ INHERIT SPECIAL REGISTERS ]
  [ PREDICATES ( predicate_specification ) ]
  [ { INHERIT ISOLATION LEVEL [ { WITHOUT LOCK REQUEST | WITH LOCK REQUEST } ] } ]
  [ { SECURED | NOT SECURED } ]
  RETURN { expression
         | SELECT statement
         | BEGIN [ ATOMIC ]
             [ DECLARE declarations ]
             statement...
           END }
```

## UDF Option List

### Description

DB2 CREATE FUNCTION statements support various options that control the behavior, performance, and security characteristics of the function. These options specify how the function should be executed, what SQL operations it can perform, whether it’s deterministic, and how it handles parallel execution, among other settings.

### Migration Support Table

The following table shows the DB2 to Snowflake UDF option equivalencies:

| DB2 Option | Snowflake Equivalent | Notes |
| --- | --- | --- |
| `LANGUAGE SQL` | `LANGUAGE SQL` | Translated to Snowflake’s equivalent syntax |
| `SPECIFIC specific_name` | Not Needed | Snowflake doesn’t support specific names for UDFs |
| `DETERMINISTIC` / `NOT DETERMINISTIC` | `IMMUTABLE` / `MUTABLE` | Preserved in Snowflake UDF definition |
| `EXTERNAL ACTION` / `NO EXTERNAL ACTION` | Not Needed | Snowflake doesn’t have equivalent option |
| `READS SQL DATA` | `LANGUAGE SQL` | Snowflake UDFs can read data by default |
| `CONTAINS SQL` | `LANGUAGE SQL` | Basic SQL support is default in Snowflake |
| `MODIFIES SQL DATA` | Not Needed | UDFs that modify data are converted to stored procedures |
| `ALLOW PARALLEL` / `DISALLOW PARALLEL` | Not Needed | Snowflake handles parallelization automatically |
| `STATIC DISPATCH` | Not Needed | Not applicable in Snowflake’s architecture |
| `CALLED ON NULL INPUT` | Default Behavior | Snowflake UDFs handle NULL inputs by default |
| `INHERIT SPECIAL REGISTERS` | Not Needed | Snowflake doesn’t have equivalent special registers |
| `PREDICATES (...)` | Not Needed | Snowflake doesn’t support predicate pushdown specifications |
| `INHERIT ISOLATION LEVEL` | Not Needed | Snowflake uses different transaction isolation model |
| `SECURED` / `NOT SECURED` | Not Needed | Snowflake uses different security model |
| `PARAMETER CCSID` | Not Needed | Character set handling differs in Snowflake |

> **Note:**
>
> Options marked as “Not Needed” will be removed during migration because Snowflake either handles the functionality automatically (e.g., parallelization, NULL input handling) or the option is not applicable in Snowflake’s architecture (e.g., special registers, isolation levels).

## INLINE UDF

### Description

Inline UDFs in DB2 are simple functions that contain a single SQL expression or return statement without complex procedural logic. These functions are typically defined with a direct `RETURN` clause followed by a simple expression, calculation, or query.

SnowConvert AI preserves these as **Inline Functions** in Snowflake, maintaining their simplicity and performance characteristics.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

## Sample Source Patterns

### Input Code:

#### Db2 - Inline UDF with expression

```sql
CREATE FUNCTION CALCULATE_TAX (price DECIMAL(10,2), tax_rate DECIMAL(5,4))
RETURNS DECIMAL(10,2)
LANGUAGE SQL
DETERMINISTIC
NO EXTERNAL ACTION
CONTAINS SQL
RETURN price * tax_rate;
```

### Output Code:

#### Snowflake

```sql
CREATE FUNCTION CALCULATE_TAX (price DECIMAL(10,2), tax_rate DECIMAL(5,4))
RETURNS DECIMAL(10,2)
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "CzuaAR6Mu3GfenyLFPxUGw==" }}'
AS
$$
    price * tax_rate
$$;
```

### Input Code:

#### Db2 - Inline UDF with Select

```sql
CREATE FUNCTION GET_EMPLOYEE_COUNT (dept_id INTEGER)
RETURNS INTEGER
LANGUAGE SQL
DETERMINISTIC
READS SQL DATA
RETURN SELECT COUNT(*) FROM employees WHERE department_id = dept_id;
```

### Output Code:

#### Snowflake

```sql
CREATE FUNCTION GET_EMPLOYEE_COUNT (dept_id INTEGER)
RETURNS INTEGER
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "HjuaARhyBn6VZ1Uyctr5Ag==" }}'
AS
$$
     SELECT
      COUNT(*) FROM
      employees
     WHERE department_id = :dept_id
$$;
```

### Known Issues

There are no known issues.

### Related EWIs

There are no related EWIs.

## Scalar UDF

### Description

Scalar UDFs in DB2 are functions that return a single value and can contain more complex logic than inline functions. These functions may include procedural constructs such as variable declarations, conditional statements, loops, and multiple SQL statements. Unlike inline UDFs, scalar UDFs with complex logic use a `BEGIN...END` block structure to encapsulate their functionality.

**SnowConvert AI Migration**: If the scalar UDF logic is compatible with Snowflake Scripting syntax, it will be translated to a **Snowflake Scripting UDF**. This preserves the function’s behavior while adapting it to Snowflake’s UDF implementation. Functions that contain unsupported constructs will be converted to stored procedures instead.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Sample Source Patterns

#### Input Code:

#### Db2 - Scalar UDF with IF ELSE Statement

```sql
CREATE FUNCTION CALCULATE_DISCOUNT (purchase_amount DECIMAL(10,2), customer_type VARCHAR(20))
RETURNS DECIMAL(10,2)
LANGUAGE SQL
DETERMINISTIC
NO EXTERNAL ACTION
CONTAINS SQL
BEGIN
    DECLARE discount_rate DECIMAL(5,4);
    DECLARE final_discount DECIMAL(10,2);

    IF customer_type = 'PREMIUM' THEN
        SET discount_rate = 0.15;
    ELSIF customer_type = 'GOLD' THEN
        SET discount_rate = 0.10;
    ELSIF customer_type = 'SILVER' THEN
        SET discount_rate = 0.05;
    ELSE
        SET discount_rate = 0.02;
    END IF;

    SET final_discount = purchase_amount * discount_rate;
    RETURN final_discount;
END;
```

### Output Code:

#### Snowflake

```sql
CREATE FUNCTION CALCULATE_DISCOUNT (purchase_amount DECIMAL(10,2), customer_type VARCHAR(20))
RETURNS DECIMAL(10,2)
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "11/26/2025",  "domain": "no-domain-provided",  "migrationid": "usGaAajDzniYFKtk+Fvnzg==" }}'
AS
$$
    DECLARE
        discount_rate DECIMAL(5,4);
        final_discount DECIMAL(10,2);
    BEGIN
        IF (:customer_type = 'PREMIUM') THEN
            discount_rate := 0.15;
        ELSEIF (:customer_type = 'GOLD') THEN
            discount_rate := 0.10;
        ELSEIF (:customer_type = 'SILVER') THEN
            discount_rate := 0.05;
        ELSE
            discount_rate := 0.02;
        END IF;
        final_discount := purchase_amount * discount_rate;
    RETURN final_discount;
    END
$$;
```

### Input Code:

#### Db2 - Scalar UDF with WHILE Loop

```sql
CREATE FUNCTION CALCULATE_COMPOUND_INTEREST (principal DECIMAL(15,2), rate DECIMAL(5,4), years INTEGER)
RETURNS DECIMAL(15,2)
LANGUAGE SQL
DETERMINISTIC
NO EXTERNAL ACTION
CONTAINS SQL
BEGIN
    DECLARE counter INTEGER DEFAULT 1;
    DECLARE amount DECIMAL(15,2);

    SET amount = principal;

    WHILE counter <= years DO
        SET amount = amount * (1 + rate);
        SET counter = counter + 1;
    END WHILE;

    RETURN amount;
END;
```

### Output Code:

#### Snowflake

```sql
CREATE FUNCTION CALCULATE_COMPOUND_INTEREST (principal DECIMAL(15,2), rate DECIMAL(5,4), years INTEGER)
RETURNS DECIMAL(15,2)
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "11/26/2025",  "domain": "no-domain-provided",  "migrationid": "usGaAajDzniYFKtk+Fvnzg==" }}'
AS
$$
    DECLARE
        counter INTEGER DEFAULT 1;
        amount DECIMAL(15,2);
    BEGIN
        amount := principal;
        WHILE (:counter <= :years) DO
            amount := amount * (1 + rate);
            counter := counter + 1;
        END WHILE;

        RETURN amount;
    END
$$;
```

### Input Code:

#### Db2 - Scalar UDF with simple select into for variable assignment.

```sql
CREATE OR REPLACE FUNCTION CalculatePrice
(
    p_BasePrice DECIMAL(10, 2),
    p_Quantity INT
)
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
BEGIN
    DECLARE v_Discount DECIMAL(5, 2);
    DECLARE v_Subtotal DECIMAL(10, 2);
    DECLARE v_FinalPrice DECIMAL(10, 2);

    SELECT CASE
               WHEN p_Quantity >= 10 THEN 0.15
               WHEN p_Quantity >= 5 THEN 0.10
               ELSE 0.05
           END,
           p_BasePrice * p_Quantity
    INTO v_Discount, v_Subtotal
    FROM SYSIBM.SYSDUMMY1;

    SET v_FinalPrice = v_Subtotal * (1 - v_Discount);

    RETURN v_FinalPrice;
END;
```

### Output Code:

#### Snowflake

```sql
CREATE OR REPLACE FUNCTION CalculatePrice
(
    p_BasePrice DECIMAL(10, 2),
    p_Quantity INT
)
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "11/26/2025",  "domain": "no-domain-provided",  "migrationid": "o8GaAaDmH3aONT/SIuOCFw==" }}'
AS
$$
    DECLARE
        v_Discount DECIMAL(5, 2);
        v_Subtotal DECIMAL(10, 2);
        v_FinalPrice DECIMAL(10, 2);
    BEGIN
        v_Discount := CASE
                        WHEN :p_Quantity >= 10 THEN 0.15
                        WHEN :p_Quantity >= 5 THEN 0.10
                            ELSE 0.05
                        END;
        v_Subtotal := :p_BasePrice * :p_Quantity;
        v_FinalPrice := v_Subtotal * (1 - v_Discount);

    RETURN v_FinalPrice;
    END
$$;
```

### Input Code:

#### Db2 - Scalar UDF with Values statement for variable assignment.

```sql
CREATE OR REPLACE FUNCTION CalculatePrice
(
    p_BasePrice DECIMAL(10, 2),
    p_Quantity INT
)
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
BEGIN
    DECLARE v_Discount DECIMAL(5, 2);
    DECLARE v_Subtotal DECIMAL(10, 2);
    DECLARE v_FinalPrice DECIMAL(10, 2);

    VALUES (CASE
                WHEN p_Quantity >= 10 THEN 0.15
                WHEN p_Quantity >= 5 THEN 0.10
                ELSE 0.05
            END,
            p_BasePrice * p_Quantity)
    INTO v_Discount, v_Subtotal;

    SET v_FinalPrice = v_Subtotal * (1 - v_Discount);

    RETURN v_FinalPrice;
END;
```

### Output Code:

#### Snowflake

```sql
CREATE OR REPLACE FUNCTION CalculatePrice
(
    p_BasePrice DECIMAL(10, 2),
    p_Quantity INT
)
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "11/26/2025",  "domain": "no-domain-provided",  "migrationid": "usGaAajDzniYFKtk+Fvnzg==" }}'
AS
$$
    DECLARE
        v_Discount DECIMAL(5, 2);
        v_Subtotal DECIMAL(10, 2);
        v_FinalPrice DECIMAL(10, 2);
    BEGIN
        v_Discount := CASE
                            WHEN :p_Quantity >= 10 THEN 0.15
                            WHEN :p_Quantity >= 5 THEN 0.10
                            ELSE 0.05
                        END;
        v_Subtotal :=
        p_BasePrice * p_Quantity;
        v_FinalPrice := v_Subtotal * (1 - v_Discount);

            RETURN v_FinalPrice;
    END
$$;
```

### Known Issues

> **Warning:**
>
> **SnowConvert AI will not translate UDFs containing the following elements into SnowScripting UDFs, as these features are unsupported in SnowScripting UDFs:**
>
> * Access database tables
> * Use cursors
> * Call other UDFs
> * Contain aggregate or window functions
> * Perform DML operations (INSERT/UPDATE/DELETE)
> * Return result sets

### Related EWIs

There are no related EWIs.

## Table UDF

### Description

Table UDFs (Table-Valued Functions) in DB2 are functions that return a table result set rather than a single value. These functions can be used in the FROM clause of SELECT statements. Table UDFs are defined with a `RETURNS TABLE` clause that specifies the structure of the returned table.

**SnowConvert AI Migration**: Table UDFs that are compatible with Snowflake’s table function syntax will be translated to **Snowflake Table Functions**. This preserves their functionality while adapting them to Snowflake’s implementation. Functions that contain unsupported [Snowflake Scripting](https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-procedural-functions) elements will be converted to stored procedures that return result sets instead.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

## Sample Source Patterns

### Input Code:

#### Db2 - Simple Table UDF

```sql
CREATE FUNCTION GET_EMPLOYEES_BY_DEPT (dept_id INTEGER)
RETURNS TABLE (
    employee_id INTEGER,
    employee_name VARCHAR(100),
    salary DECIMAL(10,2),
    hire_date DATE
)
LANGUAGE SQL
DETERMINISTIC
READS SQL DATA
RETURN
    SELECT emp_id, emp_name, emp_salary, emp_hire_date
    FROM employees
    WHERE department_id = dept_id
    ORDER BY emp_name;
```

### Output Code:

#### Snowflake

```sql
CREATE FUNCTION GET_EMPLOYEES_BY_DEPT (dept_id INTEGER)
RETURNS TABLE (
    employee_id INTEGER,
     employee_name VARCHAR(100),
     salary DECIMAL(10,2),
     hire_date DATE
)
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "3TuaAdyRFHCJuSE7bnMCGg==" }}'
AS
$$
    SELECT emp_id, emp_name, emp_salary, emp_hire_date
    FROM
      employees
    WHERE department_id = dept_id
    ORDER BY emp_name
$$;
```

### Input Code:

#### Db2 - Complex Table UDF with Multiple Parameters

```sql
CREATE FUNCTION GET_SALES_REPORT (start_date DATE, end_date DATE, min_amount DECIMAL(10,2))
RETURNS TABLE (
    sales_id INTEGER,
    customer_name VARCHAR(100),
    product_name VARCHAR(100),
    sale_amount DECIMAL(10,2),
    sale_date DATE,
    region VARCHAR(50)
)
LANGUAGE SQL
DETERMINISTIC
READS SQL DATA
RETURN
    SELECT s.sale_id, c.customer_name, p.product_name,
           s.amount, s.sale_date, c.region
    FROM sales s
    JOIN customers c ON s.customer_id = c.customer_id
    JOIN products p ON s.product_id = p.product_id
    WHERE s.sale_date BETWEEN start_date AND end_date
      AND s.amount >= min_amount
    ORDER BY s.sale_date DESC, s.amount DESC;
```

### Output Code:

#### Snowflake

```sql
CREATE FUNCTION GET_SALES_REPORT (start_date DATE, end_date DATE, min_amount DECIMAL(10,2))
RETURNS TABLE (
    sales_id INTEGER,
     customer_name VARCHAR(100),
     product_name VARCHAR(100),
     sale_amount DECIMAL(10,2),
     sale_date DATE,
     region VARCHAR(50)
)
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "3TuaAdyRFHCJuSE7bnMCGg==" }}'
AS
$$
    SELECT s.sale_id, c.customer_name, p.product_name,
           s.amount, s.sale_date, c.region
    FROM
      sales s
    JOIN
        customers c ON s.customer_id = c.customer_id
    JOIN
        products p ON s.product_id = p.product_id
    WHERE s.sale_date BETWEEN start_date AND end_date
      AND s.amount >= min_amount
    ORDER BY s.sale_date DESC, s.amount DESC
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## UDF Converted to Stored Procedure

### Description

Some DB2 UDFs cannot be directly migrated as Snowflake UDFs due to limitations in [Snowflake Scripting UDFs](https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-procedural-functions). When a DB2 UDF contains elements that are not supported in Snowflake Scripting UDFs (such as SQL DML statements, cursors, result sets, or calls to other UDFs), SnowConvert AI will migrate these functions as **Stored Procedures** instead.

**SnowConvert AI Migration**: These UDFs are converted to stored procedures with an **EWI** message explaining why the direct UDF migration was not possible.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Sample Source Patterns

#### Input Code:

#### Db2 - UDF with DML Statement

```sql
CREATE FUNCTION LOG_AUDIT_EVENT (event_type VARCHAR(50), event_details VARCHAR(500))
RETURNS INTEGER
LANGUAGE SQL
DETERMINISTIC
MODIFIES SQL DATA
BEGIN
    INSERT INTO audit_log (event_type, event_details, log_timestamp)
    VALUES (event_type, event_details, CURRENT_TIMESTAMP);

    RETURN 1;
END;
```

#### Output Code:

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE BECAUSE IT CONTAINS THE FOLLOWING: VALUES CLAUSE, INSERT STATEMENT ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "audit_log" **
CREATE OR REPLACE PROCEDURE LOG_AUDIT_EVENT (event_type VARCHAR(50), event_details VARCHAR(500))
RETURNS INTEGER
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "9DuaAVemZHCpeN/qzhiOkg==" }}'
AS
$$
BEGIN
    INSERT INTO audit_log (event_type, event_details, log_timestamp)
    VALUES (event_type, event_details, CURRENT_TIMESTAMP);

    RETURN 1;
END
$$;
```

### Known Issues

The main limitation is that the resulting converted procedure must be invoked using the CALL syntax, preventing its use directly within standard SQL expressions like the original UDF.

### Related EWIs

1. **SSC-EWI-0068**: User defined function was transformed to a Snowflake procedure.

---
title: SnowConvert AI - IBM DB2 - CREATE PROCEDURE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-create-procedure.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - CREATE PROCEDURE

## Description

> Creates a new stored procedure or replaces an existing procedure for the current database. ([IBM DB2 SQL Language Reference Create Procedure](https://www.ibm.com/docs/en/db2/12.1.0?topic=statements-create-procedure-sql)).

## Grammar Syntax

The following is a SQL syntax for creating a procedure in IBM Db2. See the [DB2 CREATE PROCEDURE specification](https://www.ibm.com/docs/en/db2/12.1.0?topic=statements-create-procedure-sql).

```sql
CREATE [ OR REPLACE ] PROCEDURE procedure_name
  ( [ parameter { , parameter }* ] )
LANGUAGE SQL
BEGIN
  statements
END;

parameter := [ IN | OUT | INOUT ] param_name data_type [ DEFAULT expression ]
```

## Sample Source Patterns

### Input Code:

#### Db2

```sql
CREATE OR REPLACE PROCEDURE TEST_PROCEDURE ()
LANGUAGE SQL
BEGIN
    VALUES CURRENT_TIMESTAMP;
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE TEST_PROCEDURE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
   BEGIN
      SELECT
         CURRENT_TIMESTAMP      ;
   END
$$;
```

## Related EWIs

There are no issues for this transformation.

## DECLARE

### Description

Section to declare all the procedure variables except for loop variables.
Db2 supports multiple DECLARE sections per block statement, since Snowflake does not support this behavior they must be merged into a single declaration statement per block.

### Grammar Syntax

```sql
 [ DECLARE declarations ]
```

### Sample Source Patterns

#### Input Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE first_procedure (first_parameter INTEGER)
LANGUAGE SQL
BEGIN
   DECLARE i INTEGER DEFAULT first_parameter;
   SELECT i;
END;

CREATE OR REPLACE PROCEDURE second_procedure (first_parameter INTEGER)
LANGUAGE SQL
BEGIN
   DECLARE i INTEGER DEFAULT first_parameter;
   DECLARE j INTEGER DEFAULT first_parameter;
   SELECT i;
END;
```

##### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE first_procedure (first_parameter INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
   DECLARE
      i INTEGER DEFAULT first_parameter;
   BEGIN
      SELECT
         :i;
   END
$$;

CREATE OR REPLACE PROCEDURE second_procedure (first_parameter INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
   DECLARE
      i INTEGER DEFAULT first_parameter;
      j INTEGER DEFAULT first_parameter;
   BEGIN
      SELECT
         :i;
   END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## EXCEPTION

### Description

Db2 handles exceptions with handlers declared in the block. A handler can be `CONTINUE` (execution continues) or `EXIT` (leaves the block) and can catch general or specific conditions (for example, `SQLEXCEPTION`, `SQLSTATE 'state'`, `SQLCODE code`).

### Grammar Syntax

```sql
DECLARE { CONTINUE | EXIT } HANDLER FOR condition
  statements;

condition := SQLEXCEPTION | SQLSTATE 'state' | SQLCODE code
```

### Sample Source Patterns

#### Input Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE update_employee_sp ()
LANGUAGE SQL
BEGIN
    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO error_log(ts, msg) VALUES (CURRENT_TIMESTAMP, 'An exception occurred');

    SELECT var;
END;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE update_employee_sp ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
BEGIN

    SELECT var;
      EXCEPTION
         WHEN OTHER CONTINUE THEN
            INSERT INTO error_log (ts, msg) VALUES (CURRENT_TIMESTAMP, 'An exception occurred')
END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## LABEL

### Description

Labels are used in Db2 to qualify a block or to use the EXIT or END statement. Snowflake does not support labels. However, a workaround is used for accessing outer-block-declared variables which can be accessed by the fully qualified name, such as `outer_block.variable_name`

> **Warning:**
>
> Since labels are not supported in Snowflake, an EWI will be printed.

### Grammar Syntax

```sql
 label : BEGIN
    statements
 END label;
```

### Sample Source Patterns

#### Input Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE P_DEMO_SCOPE()
BEGIN
outer_block:
BEGIN
    DECLARE v_scope_test VARCHAR(50) DEFAULT 'I am from the OUTER block';
    INSERT INTO TABLETEST(VALUE, TIME) VALUES(v_scope_test, CURRENT_TIMESTAMP);
    inner_block:
    BEGIN
    DECLARE v_scope_test VARCHAR(50) DEFAULT 'I am from the INNER block';
    SET outer_block.v_scope_test = 'The INNER block changed me!';
    INSERT INTO TABLETEST(VALUE, TIME) VALUES(v_scope_test, CURRENT_TIMESTAMP);

    END inner_block;

    INSERT INTO TABLETEST(VALUE, TIME) VALUES(v_scope_test, CURRENT_TIMESTAMP);

END outer_block;
END;
```

##### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE P_DEMO_SCOPE ()
RETURNS VARCHAR
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "vjqaAbThwXqZ0mSDaENBCw==" }}'
AS
$$
    BEGIN
        DECLARE
            outer_block_v_scope_test VARCHAR(50) DEFAULT 'I am from the OUTER block';
        BEGIN
            INSERT INTO TABLETEST (VALUE, TIME) VALUES(:outer_block_v_scope_test, CURRENT_TIMESTAMP);
            DECLARE
                v_scope_test VARCHAR(50) DEFAULT 'I am from the INNER block';
            BEGIN
                outer_block_v_scope_test := 'The INNER block changed me!';
            INSERT INTO TABLETEST (VALUE, TIME) VALUES(:v_scope_test, CURRENT_TIMESTAMP);
            END;

            INSERT INTO TABLETEST (VALUE, TIME) VALUES(:outer_block_v_scope_test, CURRENT_TIMESTAMP);
        END;
    END
$$;
```

### Known Issues

1. If a variable name is the same as a modified one, it will cause inconsistencies.

### Related EWIs

There are no related EWIs.

## VARIABLE DECLARATION

### Description

Declare variables inside the block’s `DECLARE` area. Variables can specify an initial value using `DEFAULT`. Subsequent assignments use the `SET` statement.

> **Note:**
>
> Variable declarations are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
DECLARE
  name type [ DEFAULT expression ];
```

Notes:

* Use `SET name = expression;` to assign after declaration.

### Sample Source Patterns

#### Input Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE VARIABLE_DECLARATION ()
LANGUAGE SQL
BEGIN
    DECLARE v_simple_int INTEGER;
    DECLARE v_default_char CHAR(4) DEFAULT 'ABCD';
    DECLARE v_default_decimal DECIMAL(10,2) DEFAULT 10.00;
    DECLARE v_text VARCHAR(50) DEFAULT 'Test default';
    VALUES v_simple_int, v_default_char, v_default_decimal, v_text;
END;
```

##### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE VARIABLE_DECLARATION ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
   DECLARE
      v_simple_int INTEGER;
      v_default_char CHAR(4) DEFAULT 'ABCD';
      v_default_decimal DECIMAL(10,2) DEFAULT 10.00;
      v_text VARCHAR(50) DEFAULT 'Test default';
   BEGIN
      SELECT
         v_simple_int, v_default_char, v_default_decimal, v_text      ;
   END
$$;
```

### Known Issues

No issues were found.

### Related EWIs

There are no related EWIs.

## SET

### Description

Assign a value to a variable within a procedure block.

### Grammar Syntax

```sql
SET variable_name = expression;
```

### Sample Source Patterns

#### Input Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE PROC_SET ()
LANGUAGE SQL
BEGIN
    DECLARE v_total INTEGER DEFAULT 0;
    SET v_total = v_total + 10;
END;
```

##### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC_SET ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
   DECLARE
      v_total INTEGER DEFAULT 0;
   BEGIN
      v_total := v_total + 10;
   END
$$;
```

## IF

### Description

Evaluate conditions and execute different branches. Db2 supports `ELSEIF` and an optional `ELSE` branch.

### Grammar Syntax

```sql
 IF boolean-expression THEN
  statements
[ ELSIF boolean-expression THEN
  statements
[ ELSIF boolean-expression THEN
  statements
    ...] ]
[ ELSE
  statements ]
END IF;
```

### Sample Source Patterns

#### Input Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE PROC1 (paramNumber INTEGER)
LANGUAGE SQL
BEGIN
    DECLARE result VARCHAR(100);
    IF paramNumber = 0 THEN
      SET result = 'zero';
    ELSEIF paramNumber > 0 THEN
      SET result = 'positive';
    ELSEIF paramNumber < 0 THEN
      SET result = 'negative';
    ELSE
      SET result = 'NULL';
    END IF;
END;
```

##### Output Code:

##### Db2

```sql
CREATE OR REPLACE PROCEDURE PROC1 (paramNumber INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "10/31/2025",  "domain": "no-domain-provided",  "migrationid": "tDqaAcdlYXqyx5yxM208hw==" }}'
AS
$$
   DECLARE
      result VARCHAR(100);
   BEGIN
      IF (:paramNumber = 0) THEN
         result := 'zero';
         ELSEIF (:paramNumber > 0) THEN
         result := 'positive';
         ELSEIF (:paramNumber < 0) THEN
         result := 'negative';
         ELSE
         result := 'NULL';
      END IF;
   END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

---
title: SnowConvert AI - IBM DB2 - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-create-table.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - CREATE TABLE

## Description

> The complete CREATE TABLE [syntax](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table) for IBM DB2 is big enough that it does not fit on one page. However, the following image shows an overview of the syntax with some logical grouping that is later referenced.

## Grammar Syntax

## As Result Table

### Description

> Specifies that the columns of the new table have the same name, data type, and optionally same data, as the result from the fullselect.

> **Warning:**
>
> AS RESULT TABLE is partially supported in Snowflake. The Copy options do not apply in Snowflake.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_frag-as-result-table) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE TABLE TestTable1
AS (SELECT * FROM OriginalTable) WITH NO DATA;
```

#### Snowflake

```sql
CREATE TABLE TestTable1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
AS (SELECT * FROM
  OriginalTable
 LIMIT 0
);
```

##### IBM DB2

```sql
 CREATE TABLE TestTable2
AS (SELECT * FROM OriginalTable) WITH DATA
INCLUDING COLUMN DEFAULTS
INCLUDING IDENTITY;
```

##### Snowflake

```sql
CREATE TABLE TestTable2
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
AS (SELECT * FROM
  OriginalTable
 );
```

## Materialized Query Definition

### Description

> Materialized query tables (MQTs) are tables whose definition is based on the result of a query.

Currently, translation for the IBM DB2 Materialized Query is not supported by SnowConvert AI

See the [DB2 materialized query definition documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_materialized-query-definition) for this syntax.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable4 (ACCTID, LOCID, YEAR, CNT) AS
  (SELECT ACCOUNTID, LOCATIONID, YEAR, COUNT(*)
     FROM TRANS
     GROUP BY ACCOUNTID, LOCATIONID, YEAR )
     DATA INITIALLY DEFERRED
     REFRESH DEFERRED
     MAINTAINED BY SYSTEM
     ENABLE QUERY OPTIMIZATION;
```

#### Snowflake

```sql
  CREATE TABLE TestTable4 (ACCTID, LOCID, YEAR, CNT) AS
(SELECT ACCOUNTID, LOCATIONID, YEAR,
  COUNT(*)
FROM
  TRANS
GROUP BY ACCOUNTID, LOCATIONID, YEAR )
 !!!RESOLVE EWI!!! /*** SSC-EWI-DB0021 - MATERIALIZED QUERY IS NOT SUPPORTED ***/!!!
DATA INITIALLY DEFERRED
REFRESH DEFERRED
MAINTAINED BY SYSTEM
ENABLE QUERY OPTIMIZATION
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Related EWIs

1. [SSC-EWI-DB0021](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): NODE NOT SUPPORTED

## Of Type

### Description

> Specifies that the columns of the table are based on the attributes of the structured type.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_of) to navigate to the IBM DB2 documentation page for this syntax.

TYPED TABLES are not supported in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
    CREATE TABLE TestTable5 OF Student_t UNDER Person
   INHERIT SELECT PRIVILEGES;
```

#### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-DB0017 - TYPED TABLES ARE NOT SUPPORTED ***/!!!

CREATE TABLE TestTable5 OF Student_t UNDER Person
   INHERIT SELECT PRIVILEGES;
```

### Related EWIs

1. [SSC-EWI-DB0017](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): NODE NOT SUPPORTED

## Staging Table Definition

### Description

> A *staging table* allows incremental maintenance support for deferred materialized query table.

STAGING TABLES are not supported in Snowflake.

See the [DB2 staging data tables](https://www.ibm.com/docs/en/db2/11.5?topic=tables-creating-staging-data) or the [staging table definition syntax](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_staging-table-definition) in the IBM DB2 documentation.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
create table TestTable6 for emp_summary propagate immediate;
```

#### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-DB0018 - STAGING TABLES ARE NOT SUPPORTED ***/!!!
create table TestTable6 for emp_summary propagate immediate;
```

### Related EWIs

1. [SSC-EWI-DB0018](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): NODE NOT SUPPORTED

## Element List

## Check Constraint

### Description

> Constraints are used to specify rules for the data in a table.

See the [DB2 column options documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_frag-column-options) for this syntax.

> **Warning:**
>
> Some CONSTRAINT options are migrated as is to Snowflake but some of them are removed because of platform differences. Check the code example to learn more.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE TABLE TestTable7(
    COL1 VARCHAR(1),
    CONSTRAINT CN1 CHECK(COL1<1),
    CONSTRAINT CN2 CHECK(SOMENAME DETERMINED BY OTHERNAME),
    CONSTRAINT CN2 CHECK((SOMENAME1, SOMENAME2) DETERMINED BY (SOMENAME3, SOMENAME4))
    );
```

#### Snowflake

```sql
CREATE TABLE TestTable7 (
    COL1 VARCHAR(1),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
    CONSTRAINT CN1 CHECK(COL1<1),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
    CONSTRAINT CN2 CHECK(SOMENAME DETERMINED BY OTHERNAME),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
    CONSTRAINT CN2 CHECK((SOMENAME1, SOMENAME2) DETERMINED BY (SOMENAME3, SOMENAME4))
    )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Related EWIs

1. [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check Statement Not Supported.

## Period Definition

### Description

Defines a period of time in which the data of a row is valid.

> **Warning:**
>
> PERIOD-DEFINITION does not have a functional equivalent in Snowflake.

> **Note:**
>
> Snowflake allows the storage of historical table data for up to 90 days, to know more about this see [Understanding & Using Time Travel](https://docs.snowflake.com/en/user-guide/data-time-travel.html).

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_period-definition) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### Sample Source Patterns

```sql
CREATE TABLE TestTable8(
COL1 DATE,
COL2 DATE,
PERIOD SYSTEM_TIME (COL1, COL2));
)
```

```sql
CREATE TABLE TestTable8 (
COL1 DATE,
    COL2 DATE,
    !!!RESOLVE EWI!!! /*** SSC-EWI-DB0003 - PERIOD SPECIFICATION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
    PERIOD SYSTEM_TIME (COL1, COL2))
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
 CREATE OR REPLACE TABLE TestTable9 (
    COL1 VARCHAR(1)
 )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
 ;
```

### Related EWIs

1. [SSC-EWI-DB0003](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): Period definition is not applicable in Snowflake.

## Referential Constraint

### Description

> Foreign Key Constraints are migrated through ALTER TABLE statements to remove dependencies at the table creation time and therefore facilitate database deployment.

See the [DB2 column options documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_frag-column-options) for this syntax.

### Grammar Syntax

### Sample Source Patterns

```sql
CREATE TABLE TestTable9(
    COL1 VARCHAR(1),
    CONSTRAINT FKCOL1 FOREIGN KEY (COL1) REFERENCES T1,
    CONSTRAINT FKCOL2 FOREIGN KEY (COL1) REFERENCES T1(COL1),
    CONSTRAINT FKCOL3 FOREIGN KEY (COL1) REFERENCES T1(COL1) ON DELETE CASCADE ON UPDATE NO ACTION,
    CONSTRAINT FKCOL4 FOREIGN KEY (COL1) REFERENCES T1(COL1) ENFORCED DISABLE QUERY OPTIMIZATION,
    FOREIGN KEY (COL1) REFERENCES T1
);
```

```sql
 CREATE OR REPLACE TABLE TestTable9 (
    COL1 VARCHAR(1)
 )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
 ;

 ALTER TABLE TestTable9
 ADD
    CONSTRAINT FKCOL1 FOREIGN KEY (COL1) REFERENCES T1 ;

 ALTER TABLE TestTable9
 ADD
    CONSTRAINT FKCOL2 FOREIGN KEY (COL1) REFERENCES T1 (COL1) ;

 ALTER TABLE TestTable9
 ADD
    CONSTRAINT FKCOL3 FOREIGN KEY (COL1) REFERENCES T1 (COL1) ON DELETE CASCADE ON UPDATE NO ACTION;

 ALTER TABLE TestTable9
 ADD
    CONSTRAINT FKCOL4 FOREIGN KEY (COL1) REFERENCES T1 (COL1) ENFORCED;

 ALTER TABLE TestTable9
 ADD CONSTRAINT TestTable9_COL1_T1
    FOREIGN KEY (COL1) REFERENCES T1 ;
```

## QUERY OPTIMIZATION

### Description

> Specifies whether the constraint or functional dependency can be used for query optimization under appropriate circumstances.

See the [DB2 WITHOUT OVERLAPS documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_without_overlaps) for this syntax.

> **Warning:**
>
> ENABLE QUERY OPTIMIZATION Constraint attributes are removed because they are not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE TABLE TestTable11
(
COL1 VARCHAR(10),
COL2 VARCHAR(10),
CONSTRAINT ConstraintName UNIQUE (COL1, COL2) ENABLE QUERY OPTIMIZATION
);
```

#### Snowflake

```sql
CREATE TABLE TestTable11
(
COL1 VARCHAR(10),
    COL2 VARCHAR(10),
    CONSTRAINT ConstraintName UNIQUE (COL1, COL2)
    )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

## WITHOUT OVERLAPS

### Description

> BUSINESS_TIME WITHOUT OVERLAPS means that for the other specified keys, the values are unique with respect to time for the BUSINESS_TIME period

See the [DB2 column options documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_frag-column-options) for this syntax.

> **Warning:**
>
> BUSINESS_TIME WITHOUT OVERLAPS Constraint attribute is removed because they are not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable12
(
COL1 VARCHAR(10),
CONSTRAINT ConstraintName UNIQUE (COL1, COL2, BUSINESS_TIME WITHOUT OVERLAPS)
);
```

#### Snowflake

```sql
 CREATE TABLE TestTable12
(
COL1 VARCHAR(10),
    CONSTRAINT ConstraintName UNIQUE (COL1, COL2)
    )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

## Column Options

## COMPRESS

### Description

> Specifies that system default values are to be stored using minimal space.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_compress_system_default) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> COMPRESS SYSTEM DEFAULT is removed because it is not applicable in Snowflake

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable13
(
COL1 VARCHAR(10) COMPRESS SYSTEM DEFAULT
);
```

#### Snowflake

```sql
 CREATE TABLE TestTable13
(
COL1 VARCHAR(10)
)
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Known issues

There are no known issues.

## HIDDEN

### Description

> Specifies whether the column is to be defined as hidden. The hidden attribute determines whether the column is included in an implicit reference to the table, or whether it can be explicitly referenced in SQL statements.

See the [DB2 NOT HIDDEN documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_not_hidden) for this syntax.

> **Warning:**
>
> HIDDEN Option is removed because it is not applicable in Snowflake

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable14
(
COL1 VARCHAR(10) IMPLICITLY HIDDEN
);
```

#### Snowflake

```sql
 CREATE TABLE TestTable14
(
COL1 VARCHAR(10)
)
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

## INLINE LENGTH

### Description

> Identifies the Inline Length of the reference type column.

See the [DB2 INLINE LENGTH documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_inline_length) for this syntax

> **Warning:**
>
> INLINE LENGTH is removed because it is not applicable in Snowflake.

### Grammar Syntax

```none
CREATE TABLE T1
(
COL1 VARCHAR(10) INLINE LENGTH 1024
);
```

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable15
(
COL1 VARCHAR(10) INLINE LENGTH 1024
);
```

#### Snowflake

```sql
 CREATE TABLE TestTable15
(
COL1 VARCHAR(10)
)
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Known issues

There are no known issues.

## LOB OPTIONS

### Description

> Options for the LOB (Large Object Binary) data types

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_frag-lob-options) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> LOB OPTIONS are removed because they are not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable16
(
COL1 VARCHAR(10) LOGGED,
COL2 VARCHAR(10) NOT LOGGED,
COL3 VARCHAR(10) COMPACT,
COL4 VARCHAR(10) NOT COMPACT
)
```

#### Snowflake

```sql
 CREATE TABLE TestTable16
(
COL1 VARCHAR(10),
    COL2 VARCHAR(10),
    COL3 VARCHAR(10),
    COL4 VARCHAR(10)
    )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

## SCOPE

### Description

> Identifies the scope of the reference type column.

For this syntax, see the [IBM CREATE TABLE statement documentation](https://www.ibm.com/docs/en/db2/12.1.x?topic=statements-create-table).

> **Warning:**
>
> SCOPE options are removed because they are not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable17
(
COL1 VARCHAR(10) SCOPE TABLE2,
COL2 VARCHAR(10) SCOPE VIEW1
);
```

#### Snowflake

```sql
 CREATE TABLE TestTable17
(
COL1 VARCHAR(10),
    COL2 VARCHAR(10)
    )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

## SECURED

### Description

> Identifies a security label that exists for the security policy that is associated with the table.

See the [DB2 security label documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_security-label-name) for this syntax.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE TABLE TestTable18
(
COL1 VARCHAR(10) COLUMN SECURED WITH securityLabel,
COL2 VARCHAR(10) COLUMN SECURED WITH securityLabel
);
```

#### Snowflake

```sql
 CREATE TABLE TestTable18
(
COL1 VARCHAR(10),
    COL2 VARCHAR(10)
    )
 WITH ROW ACCESS POLICY securityLabel ON (
    COL1,
    COL2
 )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Known issues

If multiple security labels are declared an [SSC-EWI-DB0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md)will appear in the Snowflake output code as shown below

#### IBM DB2

```sql
CREATE TABLE TestTable19
(
COL1 VARCHAR(10) COLUMN SECURED WITH securityLabel1,
COL2 VARCHAR(10) COLUMN SECURED WITH securityLabel2
);
```

#### Snowflake

```sql
CREATE TABLE TestTable19
(
COL1 VARCHAR(10),
    COL2 VARCHAR(10)
    )
 WITH ROW ACCESS POLICY securityLabel1 ON (
    COL1
 )
 !!!RESOLVE EWI!!! /*** SSC-EWI-DB0001 - WITH ROW ACCESS POLICY CLAUSE DOES NOT SUPPORT MULTIPLE DECLARATION IN SNOWFLAKE ***/!!!
 WITH ROW ACCESS POLICY securityLabel2 ON (
    COL2
 )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

#### Related EWIs

1. [SSC-EWI-DB0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md)Multiple Row Access policies

## Table Options

## CCSID

### Description

> Specifies the encoding scheme for string data that is stored in the table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_ccsid) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> CCSID is not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE TABLE TestTable20 (
COL1 INT
) CCSID ASCII;
```

#### Snowflake

```sql
 CREATE TABLE TestTable20 (
COL1 INT
)
-- --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- CCSID ASCII
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Related EWIs

1. [SSC-FDM-0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): REMOVED STATEMENT, NOT APPLICABLE IN SNOWFLAKE.

## Compression Options

### Description

> Specifies whether row compression is to be used for the table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_tablespace-name) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> The Compression Options are not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE TABLE TestTable21_01 (
COl1 INT,
COL2 INT
)
COMPRESS YES
;

CREATE TABLE TestTable21_02 (
COl1 INT,
COL2 INT
)
COMPRESS YES ADAPTIVE
;

CREATE TABLE TestTable21_03 (
COl1 INT,
COL2 INT
)
COMPRESS YES STATIC
;

CREATE TABLE TestTable21_04 (
COl1 INT,
COL2 INT
)
COMPRESS NO
;

CREATE TABLE TestTable21_05 (
COl1 INT,
COL2 INT
)
VALUE COMPRESSION
;
```

#### Snowflake

```sql
CREATE TABLE TestTable21_01 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--COMPRESS YES
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
;

CREATE TABLE TestTable21_02 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--COMPRESS YES ADAPTIVE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
;

CREATE TABLE TestTable21_03 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--COMPRESS YES STATIC
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
;

CREATE TABLE TestTable21_04 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--COMPRESS NO
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
;

CREATE TABLE TestTable21_05 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--VALUE COMPRESSION
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
;
```

### Related EWIs

1. [SSC-FDM-0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): REMOVED STATEMENT, NOT APPLICABLE IN SNOWFLAKE.

## Data Capture

### Description

> Indicates whether extra information for inter-database data replication is to be written to the log.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_data_capture) to navigate to the IBM DB2 documentation page for this syntax.

DATA CAPTURE is not supported

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 CREATE TABLE TestTable22
(
	COL1 INT
) DATA CAPTURE CHANGES;
```

#### Snowflake

```sql
 CREATE TABLE TestTable22
(
	COL1 INT
)
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0020 - DATA CAPTURE IS NOT SUPPORTED ***/!!!
 DATA CAPTURE CHANGES
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Related EWIs

1. [SSC-EWI-DB0020](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): NODE NOT SUPPORTED

## REMOVED CLAUSES

### Description

The following clauses are removed in SnowConvert AI since they are not applicable in Snowflake:

* `Distribution` Clause
* `Not Logged Initially` Clause
* `Options` Clause
* `Organize by` Clause
* `Partition by` Clause
* `Security Policy` Clause
* `In` Clause
* `Long In` Clause
* `Index In` Clause
* `With Restrict On` Clause

### Sample Source Patterns

#### IBM DB2

```sql
-- Distribution Clause
 CREATE TABLE TestTable23
(
	COL1 INT
) DISTRIBUTE BY REPLICATION;

-- Not Logged Initially Clause
 CREATE TABLE TestTable24 (
COL1 INT
) NOT LOGGED INITIALLY;

-- Options Clause
 CREATE TABLE TestTable25 (
COL1 INT
) OPTIONS(tableOptionName 'stringConst', tableOptionName2 'stringConst');

-- Organize By Clause
 CREATE TABLE TestTable26
(
	COL1 INT,
	COL2 INT,
	COL3 INT
) ORGANIZE BY ROW;

-- Partition By Clause
 CREATE TABLE TestTable27_01 (
COl1 INT,
COL2 INT
)
PARTITION BY RANGE (COL1 NULLS LAST, COL2 NULLS FIRST)
(PARTITION partitionName STARTING FROM (MINVALUE, MAXVALUE, 3) EXCLUSIVE ENDING AT MAXVALUE EXCLUSIVE IN tablespaceName INDEX IN tablespaceName LONG IN tablespaceName);

-- Partition By Clause
CREATE TABLE TestTable27_02 (
COl1 INT,
COL2 INT
) PARTITION BY (COL1 NULLS LAST) (STARTING MINVALUE INCLUSIVE ENDING 3 EXCLUSIVE IN tablespaceName);

-- Partition By Clause
CREATE TABLE TestTable27_03 (
COL1 INT,
COL2 INT
) PART BY (COL1) (STARTING 1 ENDING 3);

-- Partition By Clause
CREATE TABLE TestTable27_04 (
COL1 INT,
COL2 INT
) PART BY (COL1) (PARTITION 5 STARTING 1 ENDING 3);

-- Partition By Clause
CREATE TABLE TestTable27_05 (
COL1 INT,
COL2 INT
) PARTITION BY (COL1 NULLS LAST)
(STARTING MINVALUE INCLUSIVE ENDING 3 EXCLUSIVE EVERY 3 YEAR);

-- Partition By Clause
CREATE TABLE TestTable27_06 (
COL1 INT,
COL2 INT
)
PARTITION BY (COL1 NULLS LAST)
(STARTING MINVALUE INCLUSIVE VALUES 3 EXCLUSIVE);

-- Partition By Clause
CREATE TABLE TestTable27_07 (
JYEARS INT
)
PARTITION BY RANGE (SKACDY_DAY ASC)
(
PARTITION 1 ENDING AT ('16.10.2019') HASH SPACE 2G,
PARTITION 2 ENDING AT ('17.10.2019')
);

-- Partition By Clause
CREATE TABLE TestTable27_08 (
TRANS_DATE DATE NOT NULL
)
PARTITION BY RANGE ("TRANS_DATE")
(
PART "PART_2019_03_01" STARTING ('2019-03-01') ENDING ('2019-03-01') IN "SLTPAYMFACTD1903",
PART "PART_2021_08_19" STARTING ('2021-08-19') ENDING ('2021-08-19') IN "SLTPAYMFACTD2108",
PARTITION "PART_2021_08_19" STARTING ('2021-08-19') ENDING ('2021-08-19') IN "SLTPAYMFACTD2108"
);

-- Security Policy Clause
 CREATE TABLE TestTable28 (
COL1 INT
) SECURITY POLICY PolicyName;

-- In Clause
 CREATE TABLE TestTable29
(
	COL1 INT
) IN TablescapeName;

-- Long In Clause
 CREATE TABLE TestTable29
(
	COL1 INT
) LONG IN TablespaceName;

-- Index In Clause
 CREATE TABLE TestTable30
(
	COL1 INT
) INDEX IN TablespaceName;

-- With Restrict On Drop Clause
 CREATE TABLE TestTable31 (
COL1 INT
) WITH RESTRICT ON DROP;
```

#### Snowflake

```sql
 -- Distribution Clause
 CREATE TABLE TestTable23
 (
	COL1 INT
)
-- --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- DISTRIBUTE BY REPLICATION
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Not Logged Initially Clause
 CREATE TABLE TestTable24 (
COL1 INT
)
-- --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- NOT LOGGED INITIALLY
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Options Clause
 CREATE TABLE TestTable25 (
COL1 INT
)
-- --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- OPTIONS(tableOptionName 'stringConst', tableOptionName2 'stringConst')
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Organize By Clause
 CREATE TABLE TestTable26
 (
	COL1 INT,
COL2 INT,
COL3 INT
)
-- --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- ORGANIZE BY ROW
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
 CREATE TABLE TestTable27_01 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PARTITION BY RANGE (COL1 NULLS LAST, COL2 NULLS FIRST)
--(PARTITION partitionName STARTING FROM (MINVALUE, MAXVALUE, 3) EXCLUSIVE ENDING AT MAXVALUE EXCLUSIVE IN tablespaceName INDEX IN tablespaceName LONG IN tablespaceName)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_02 (
COl1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PARTITION BY (COL1 NULLS LAST) (STARTING MINVALUE INCLUSIVE ENDING 3 EXCLUSIVE IN tablespaceName)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_03 (
COL1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PART BY (COL1) (STARTING 1 ENDING 3)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_04 (
COL1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PART BY (COL1) (PARTITION 5 STARTING 1 ENDING 3)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_05 (
COL1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PARTITION BY (COL1 NULLS LAST)
--(STARTING MINVALUE INCLUSIVE ENDING 3 EXCLUSIVE EVERY 3 YEAR)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_06 (
COL1 INT,
COL2 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PARTITION BY (COL1 NULLS LAST)
--(STARTING MINVALUE INCLUSIVE VALUES 3 EXCLUSIVE)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_07 (
JYEARS INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PARTITION BY RANGE (SKACDY_DAY ASC)
--(
--PARTITION 1 ENDING AT ('16.10.2019') HASH SPACE 2G,
--PARTITION 2 ENDING AT ('17.10.2019')
--)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Partition By Clause
CREATE TABLE TestTable27_08 (
TRANS_DATE DATE NOT NULL
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--PARTITION BY RANGE ("TRANS_DATE")
--(
--PART "PART_2019_03_01" STARTING ('2019-03-01') ENDING ('2019-03-01') IN "SLTPAYMFACTD1903",
--PART "PART_2021_08_19" STARTING ('2021-08-19') ENDING ('2021-08-19') IN "SLTPAYMFACTD2108",
--PARTITION "PART_2021_08_19" STARTING ('2021-08-19') ENDING ('2021-08-19') IN "SLTPAYMFACTD2108"
--)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Security Policy Clause
 CREATE TABLE TestTable28 (
COL1 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--SECURITY POLICY PolicyName
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- In Clause
 CREATE TABLE TestTable29
(
	COL1 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--IN TablescapeName
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Long In Clause
--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR TestTable29. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
 CREATE TABLE TestTable29
(
	COL1 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--LONG IN TablespaceName
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- Index In Clause
 CREATE TABLE TestTable30
(
	COL1 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--INDEX IN TablespaceName
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';

-- With Restrict On Drop Clause
 CREATE TABLE TestTable31 (
COL1 INT
)
----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
--WITH RESTRICT ON DROP
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}';
```

### Related EWIs

1. [SSC-FDM-0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): REMOVED STATEMENT, NOT APPLICABLE IN SNOWFLAKE.

---
title: SnowConvert AI - IBM DB2 - CREATE TYPE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-create-type.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - CREATE TYPE

This page describes how SnowConvert translates Db2 **distinct types** (`CREATE DISTINCT TYPE ... AS type`) and structured `CREATE TYPE ... AS (...)` definitions. Distinct types map to Snowflake `CREATE TYPE name AS <base_type>`; attribute lists map to `OBJECT(...)`.

## Distinct types

`CREATE DISTINCT TYPE` becomes `CREATE TYPE`. The `WITH COMPARISONS` clause is not carried forward; base types use the same data-type normalization as the rest of the Db2 migration.

**Source (Db2):**

```sql
CREATE DISTINCT TYPE CURRENCY AS DECIMAL(15,2) WITH COMPARISONS;
```

**Snowflake equivalent:**

```sql
CREATE TYPE CURRENCY AS DECIMAL(15, 2);
```

**Source (Db2):**

```sql
CREATE DISTINCT TYPE EMAIL_ADDR AS VARCHAR(255);
```

**Snowflake equivalent:**

```sql
CREATE TYPE EMAIL_ADDR AS VARCHAR(255);
```

**Source (Db2):**

```sql
CREATE DISTINCT TYPE myschema.PHONE_NUM AS VARCHAR(20) WITH COMPARISONS;
```

**Snowflake equivalent:**

```sql
CREATE TYPE myschema.PHONE_NUM AS VARCHAR(20);
```

## Structured types (attribute list)

Composite-style definitions with `CREATE TYPE name AS (col type, ...)` map to Snowflake `OBJECT(...)`.

**Source (Db2):**

```sql
CREATE TYPE address_t AS (street VARCHAR(100), city VARCHAR(50), state CHAR(2));
```

**Snowflake equivalent:**

```sql
CREATE TYPE address_t AS OBJECT (street VARCHAR(100), city VARCHAR(50), state CHAR(2));
```

**Source (Db2):**

```sql
CREATE TYPE person_t AS (first_name VARCHAR(50), last_name VARCHAR(50), age INTEGER);
```

**Snowflake equivalent:**

```sql
CREATE TYPE person_t AS OBJECT (first_name VARCHAR(50), last_name VARCHAR(50), age INTEGER);
```

**Notes:** Unsupported or highly Db2-specific type features may still emit EWIs/FDMs. For structured types, IBM also documents [`CREATE TYPE` (structured)](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-type-structured).

---
title: SnowConvert AI - IBM DB2 - CREATE VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-create-view.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - CREATE VIEW

## Description

> The CREATE VIEW statement defines a view on one or more tables, views or nicknames.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-view) to navigate to the IBM DB2 documentation page for this syntax.

## Grammar Syntax

Navigate to the following pages to get more details about the translation spec for the subsections of the CREATE VIEW grammar.

## Examples of Supported Create Views

In order to test a CREATE VIEW, we need a Table with some values. Let’s look at the following code for a table with some inserts.

```sql
 CREATE TABLE PUBLIC.TestTable
(
	ID INT,
	NAME VARCHAR(10)
);

Insert into TestTable Values(1,'MARCO');
Insert into TestTable Values(2,'ESTEBAN');
Insert into TestTable Values(3,'JEFF');
Insert into TestTable Values(4,'OLIVER');
```

Now that we have a Table with some data, we can do a couple of examples about a Create View.

### IBM DB2

```sql
CREATE VIEW ViewTest1 AS
SELECT *
FROM TestTable
WHERE ID > 2;
```

### Snowflake

```sql
CREATE VIEW ViewTest1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/03/2025",  "domain": "no-domain-provided" }}'
AS SELECT *  FROM
 TestTable
WHERE ID > 2;
```

## OF type-name

### Description

> Specifies that the columns of the view are based on the attributes of the structured type identified by type-name.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-view#sdx-synid_type-name) to navigate to the IBM DB2 documentation page for this syntax.

CREATE VIEW OF type-name is not supported in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE VIEW ViewTest2
OF Rootview MODE DB2SQL(REF IS oidColumn USER GENERATED)
AS SELECT * FROM TestTable;
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0015 - CREATE VIEW OF TYPE IS NOT SUPPORTED ***/!!!
 CREATE VIEW ViewTest2
OF Rootview MODE DB2SQL(REF IS oidColumn USER GENERATED)
AS SELECT * FROM TestTable;
```

### Related EWIs

1. [SSC-EWI-DB0015](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): CREATE VIEW OF TYPE IS NOT SUPPORTED

## WITH CHECK OPTION

### Description

> Specifies the constraint that every row that is inserted or updated through the view must conform to the definition of the view. A row that does not conform to the definition of the view is a row that does not satisfy the search conditions of the view.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-view#sdx-synid_cascaded) to navigate to the IBM DB2 documentation page for this syntax.

WITH CHECK OPTION is not supported in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE VIEW ViewTest3 AS
Select * from TestTable
WITH CASCADED CHECK OPTION;
```

##### Snowflake

```sql
CREATE VIEW ViewTest3
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/03/2025",  "domain": "no-domain-provided" }}'
AS
Select * from
 TestTable;
```

## WITH ROW MOVEMENT

### Description

> Specifies the action to take for an updatable UNION ALL view when a row is updated in a way that violates a check constraint on the underlying table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-view#sdx-synid_with_no_row_movement) to navigate to the IBM DB2 documentation page for this syntax.

WITH ROW MOVEMENT is not supported in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
CREATE VIEW ViewTest4
AS Select *
from TestTableId1
WITH ROW MOVEMENT;
```

##### Snowflake

```sql
CREATE VIEW ViewTest4
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/03/2025",  "domain": "no-domain-provided" }}'
AS Select *
from
 TestTableId1
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0005 - MANIPULATION OF DATA IN VIEWS IS NOT SUPPORTED. ***/!!!
WITH ROW MOVEMENT;
```

### Related EWIs

1. [SSC-EWI-DB0005](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): MANIPULATION OF DATA IN VIEWS IS NOT SUPPORTED

---
title: SnowConvert AI - IBM DB2 - Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-data-types.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - Data Types

## Description

> Specifies the data type of the column

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-table#sdx-synid_built-in-type) to navigate to the IBM DB2 documentation page for this syntax.

## Transformations

The following table shows the transformation from Db2 to Snowflake.

| Db2 | Snowflake | EWI |
| --- | --- | --- |
| SMALLINT | SMALLINT |  |
| INTEGER | INTEGER |  |
| INT | INT |  |
| BIGINT | BIGINT |  |
| DECIMAL | DECIMAL |  |
| DEC | DEC |  |
| NUMERIC | NUMERIC |  |
| NUM | NUMERIC |  |
| FLOAT | FLOAT |  |
| REAL | REAL |  |
| DOUBLE | DOUBLE |  |
| DECFLOAT | DECFLOAT |  |
| CHARACTER | CHARACTER |  |
| CHAR | CHAR |  |
| VARCHAR | VARCHAR |  |
| CHARACTER VARYING | CHARACTER VARYING |  |
| CHAR VARYING | CHAR VARYING |  |
| CLOB | VARCHAR |  |
| CHARACTER LARGE OBJECT | VARCHAR |  |
| CHAR LARGE OBJECT | VARCHAR |  |
| CLOB | VARCHAR |  |
| CHARACTER LARGE OBJECT | VARCHAR |  |
| CHAR LARGE OBJECT | VARCHAR |  |
| GRAPHIC | BINARY |  |
| VARGRAPHIC | BINARY |  |
| DBCLOB | VARCHAR |  |
| NCHAR | NCHAR |  |
| NATIONAL CHAR | NCHAR |  |
| NATIONAL CHARACTER | NCHAR |  |
| NVARCHAR | NVARCHAR |  |
| NCHAR VARYING | NCHAR VARYING |  |
| NATIONAL CHAR VARYING | NCHAR VARYING |  |
| NATIONAL CHARACTER VARYING | NCHAR VARYING |  |
| NCLOB | VARCHAR |  |
| NCHAR LARGE OBJECT | VARCHAR |  |
| NATIONAL CHARACTER LARGE OBJECT | VARCHAR |  |
| BINARY | BINARY |  |
| VARBINARY | VARBINARY |  |
| BINARY VARYING | BINARY VARYING |  |
| BLOB | BINARY |  |
| BINARY LARGE OBJECT | BINARY |  |
| DATE | DATE |  |
| TIME | TIME |  |
| TIMESTAMP | TIMESTAMP |  |
| XML | VARIANT | [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI) |
| BOOLEAN | BOOLEAN |  |

## Sample Source Patterns

### IBM DB2

```sql
 CREATE TABLE T1
(
	COL1 SMALLINT,
	COL2 INTEGER,
	COL3 INT,
	COL4 BIGINT,
	COL55 DECIMAL,
	COl5 DECIMAL(5,0),
	COL66 DEC,
	COL6 DEC(5,0),
	COL77 NUMERIC,
	COL7 NUMERIC(5,0),
	COL88 NUM,
	COL8 NUM(5,0),
	COL9 FLOAT,
	COL10 FLOAT(53),
	COL11 REAL,
	COL12 DOUBLE,
	COL13 DOUBLE PRECISION,
	COL14 DECFLOAT(34),
	COL144 DECFLOAT,
	COL153 CHARACTER(8 OCTETS) FOR BIT DATA,
	COL163 CHAR(8 OCTETS) FOR BIT DATA,
	COL164 CHAR(8 OCTETS) CCSID ASCII,
	COL171 VARCHAR(8 OCTETS),
	COL172 VARCHAR(8) FOR BIT DATA,
	COL18 CHARACTER VARYING(8),
	COL180 CHARACTER VARYING(8) FOR BIT DATA,
	COL19 CHAR VARYING(8),
	COL199 CHAR VARYING(8) FOR BIT DATA,
	COL20 CLOB(1M),
	COL21 CHARACTER LARGE OBJECT(8K OCTETS),
	COL22 CHAR LARGE OBJECT,
	COL23 GRAPHIC(1),
	COL233 GRAPHIC(1 CODEUNITS16),
	COL234 GRAPHIC(1 CODEUNITS32),
	COL24 VARGRAPHIC(8 CODEUNITS16),
	COL25 DBCLOB(1M),
	COL255 DBCLOB(1K),
	COL26 NCHAR(1),
	COL27 NATIONAL CHAR(2),
	COL28 NATIONAL CHARACTER(3),
	COL29 NVARCHAR(8),
	COL30 NCHAR VARYING(8),
	COL31 NATIONAL CHAR VARYING(8),
	COL32 NATIONAL CHARACTER VARYING(8),
	COL333 NCLOB(1M),
	COL334 NCHAR LARGE OBJECT(5),
	COL335 NATIONAL CHARACTER LARGE OBJECT(1M),
	COL33 BINARY,
	COL34 VARBINARY(14),
	COL35 BINARY VARYING(10),
	COL36 BLOB(1M),
	COL37 BINARY LARGE OBJECT(1M),
	COL38 DATE,
	COL39 TIME,
	COL40 TIMESTAMP,
	COL41 XML,
	COL42 BOOLEAN
);
```

#### Snowflake

```sql
 CREATE TABLE T1
 (

	COL88 NUMERIC,
	COL8 NUMERIC(5,0),
	COL9 FLOAT,
	COL10 FLOAT(53),
	COL11 REAL,
	COL12 DOUBLE,
	COL13 DOUBLE PRECISION,
	COL14 DECFLOAT,
	COL144 DECFLOAT,
	COL153 BINARY,
	COL163 BINARY,
	COL164 CHAR(8),
	COL171 VARCHAR(8),
	COL172 BINARY,
	COL18 CHARACTER VARYING(8),
	COL180 BINARY,
	COL19 CHAR VARYING(8),
	COL199 BINARY,
	COL20 VARCHAR,
	COL21 VARCHAR,
	COL22 VARCHAR,
	COL23 BINARY,
	COL233 BINARY,
	COL234 BINARY,
	COL24 BINARY,
	COL25 VARCHAR,
	COL255 VARCHAR,
	COL26 NCHAR(1),
	COL27 NCHAR(2),
	COL28 NCHAR(3),
	COL29 NVARCHAR(8),
	COL30 NCHAR VARYING(8),
	COL31 NCHAR VARYING(8),
	COL32 NCHAR VARYING(8),
	COL333 VARCHAR,
	COL334 VARCHAR,
	COL335 VARCHAR,
	COL33 BINARY,
	COL34 VARBINARY(14),
	COL35 BINARY VARYING(10),
	COL36 BINARY,
	COL37 BINARY,
	COL38 DATE,
	COL39 TIME,
	COL40 TIMESTAMP,
	COL41 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - XMLTYPE DATA TYPE CONVERTED TO VARIANT ***/!!!,
	COL42 BOOLEAN
)
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "08/29/2025",  "domain": "no-domain-provided" }}';
```

## DECFLOAT Data Type

### Description

The `DECFLOAT` data type in IBM DB2 is a decimal floating-point data type that can store decimal numbers with high precision. DB2 supports `DECFLOAT(16)` and `DECFLOAT(34)` precisions.

SnowConvert AI transforms DB2 `DECFLOAT` columns to Snowflake’s native `DECFLOAT` data type in table column definitions and `CAST` expressions.

### Supported Contexts

`DECFLOAT` is supported in the following contexts:

* **Table column definitions**: `DECFLOAT` columns in `CREATE TABLE` statements are transformed to Snowflake `DECFLOAT`
* **CAST expressions**: `CAST(value AS DECFLOAT)` is preserved in Snowflake

### Unsupported Contexts

`DECFLOAT` is **not** supported in the following contexts and will be transformed to `NUMBER(38, 37)` with an FDM warning:

* Procedure parameters
* Function parameters
* Local variable declarations

### INSERT Statement Handling

When inserting data into `DECFLOAT` columns, SnowConvert AI automatically adds `CAST` expressions to ensure proper data type handling:

#### INSERT with VALUES

Numeric literals in `INSERT ... VALUES` statements targeting `DECFLOAT` columns are wrapped with `CAST(... AS DECFLOAT)`:

##### DB2

```sql
CREATE TABLE prices (
    product_id INT,
    price DECFLOAT(34)
);

INSERT INTO prices VALUES (1, 99.99);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE prices (
    product_id INT,
    price DECFLOAT
);

INSERT INTO prices VALUES (1, CAST(99.99 AS DECFLOAT));
```

#### INSERT with SELECT

Column references in `INSERT ... SELECT` statements are also cast when the target column is `DECFLOAT`:

##### DB2

```sql
CREATE TABLE prices (
    product_id INT,
    price DECFLOAT(34)
);

INSERT INTO prices (product_id, price)
SELECT id, amount FROM source_table;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE prices (
    product_id INT,
    price DECFLOAT
);

INSERT INTO prices (product_id, price)
SELECT id, CAST(amount AS DECFLOAT) FROM source_table;
```

### Related EWIs

1. [SSC-FDM-DB0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/db2FDM.md): DECFLOAT is not supported in this context.

## Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

---
title: SnowConvert AI - IBM DB2 - EXIT HANDLER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-exit-handler.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - EXIT HANDLER

## Description

> An EXIT handler terminates the current compound statement when the specified condition occurs. When a condition occurs and an exit handler is invoked, control is passed to the handler. When the handler completes, control returns to the caller of the compound statement.

In IBM DB2, the `DECLARE EXIT HANDLER` statement is used to define actions that should be taken when specific SQL conditions or errors occur during procedure execution. Unlike CONTINUE handlers, EXIT handlers terminate the execution of the current block and return control to the caller.

When migrating from DB2 to Snowflake, SnowConvert AI transforms EXIT HANDLER declarations into equivalent Snowflake Scripting exception handling using EXCEPTION blocks with `WHEN OTHER EXIT THEN` or specific exception types.

For more information about DB2 condition handlers, see [IBM DB2 DECLARE HANDLER](https://www.ibm.com/docs/en/db2/11.5?topic=statements-declare-handler).

## Grammar Syntax

```sql
DECLARE EXIT HANDLER FOR condition_value [, ...]
  handler_action_statement;

-- Where condition_value can be:
-- SQLSTATE [VALUE] sqlstate_value
-- condition_name
-- SQLWARNING
-- SQLEXCEPTION
-- NOT FOUND
```

## Sample Source Patterns

### DECLARE EXIT HANDLER FOR SQLEXCEPTION

The most common use case is handling SQL exceptions and exiting the current block.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE error_exit_handler()
LANGUAGE SQL
BEGIN
    DECLARE EXIT HANDLER FOR SQLEXCEPTION
    BEGIN
        INSERT INTO error_log VALUES (CURRENT_TIMESTAMP, 'Error occurred, exiting');
    END;

    -- These statements may cause errors
    INSERT INTO table1 VALUES (1/0);
    UPDATE table2 SET status = 'completed' WHERE id = -1;

    -- This will NOT execute if an error occurred above
    INSERT INTO success_log VALUES ('All operations completed');
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE error_exit_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    BEGIN
        -- These statements may cause errors
        INSERT INTO table1 VALUES (1/0);
        UPDATE table2 SET status = 'completed' WHERE id = -1;

        -- This will NOT execute if an error occurred above
        INSERT INTO success_log VALUES ('All operations completed');

        EXCEPTION
            WHEN OTHER THEN
                BEGIN
                    INSERT INTO error_log VALUES (CURRENT_TIMESTAMP(), 'Error occurred, exiting');
                END;
    END;
$$;
```

### DECLARE EXIT HANDLER FOR SQLSTATE

Handling specific SQLSTATE codes with exit behavior.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE sqlstate_exit_handler()
LANGUAGE SQL
BEGIN
    DECLARE EXIT HANDLER FOR SQLSTATE '23505'
    BEGIN
        INSERT INTO error_log VALUES ('Duplicate key error, exiting procedure');
        ROLLBACK;
    END;

    -- Attempt to insert records
    INSERT INTO users VALUES (1, 'John');
    INSERT INTO users VALUES (1, 'Jane');  -- Duplicate key - will trigger handler
    INSERT INTO users VALUES (2, 'Bob');   -- Will NOT execute
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE sqlstate_exit_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    BEGIN
        -- Attempt to insert records
        INSERT INTO users VALUES (1, 'John');
        INSERT INTO users VALUES (1, 'Jane');  -- Duplicate key - will trigger handler
        INSERT INTO users VALUES (2, 'Bob');   -- Will NOT execute

        EXCEPTION
            WHEN OTHER EXIT THEN
                CASE
                    WHEN (SQLSTATE = '23505') THEN
                        BEGIN
                            INSERT INTO error_log VALUES ('Duplicate key error, exiting procedure');
                            ROLLBACK;
                        END;
                END;
    END;
$$;
```

### DECLARE EXIT HANDLER FOR NOT FOUND

The NOT FOUND condition is commonly used with cursors and SELECT INTO statements.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE cursor_exit_handler()
LANGUAGE SQL
BEGIN
    DECLARE v_id INT;
    DECLARE v_name VARCHAR(100);

    DECLARE EXIT HANDLER FOR NOT FOUND
        INSERT INTO log_table VALUES ('No data found, exiting');

    -- This will trigger the handler if no rows found
    SELECT id, name INTO v_id, v_name
    FROM employees
    WHERE department = 'NonExistent';

    -- This will NOT execute if no rows were found
    INSERT INTO results VALUES (v_id, v_name);
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE cursor_exit_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        v_id INT;
        v_name VARCHAR(100);
    BEGIN
        -- This will trigger the handler if no rows found
        SELECT id, name INTO v_id, v_name
        FROM employees
        WHERE department = 'NonExistent';

        -- This will NOT execute if no rows were found
        INSERT INTO results VALUES (v_id, v_name);

        EXCEPTION
            WHEN NO_DATA_FOUND THEN
                INSERT INTO log_table VALUES ('No data found, exiting');
    END;
$$;
```

### Multiple EXIT Handlers

DB2 allows multiple EXIT HANDLERs with different priorities. In Snowflake, handler precedence must be managed through explicit conditional logic using CASE statements.

#### Input Code:

##### IBM DB2

```sql
CREATE PROCEDURE multiple_exit_handlers()
BEGIN
    DECLARE EXIT HANDLER FOR SQLSTATE '23505'
        INSERT INTO log VALUES ('Duplicate key error');

    DECLARE EXIT HANDLER FOR SQLEXCEPTION
        INSERT INTO log VALUES ('General SQL exception');

    INSERT INTO table1 VALUES (1, 'test');
    INSERT INTO success_log VALUES ('Completed');
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE multiple_exit_handlers()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    BEGIN
        INSERT INTO table1 VALUES (1, 'test');
        INSERT INTO success_log VALUES ('Completed');

        EXCEPTION
            WHEN OTHER EXIT THEN
                CASE
                    WHEN (SQLSTATE = '23505') THEN
                        INSERT INTO log VALUES ('Duplicate key error')
                    ELSE
                        INSERT INTO log VALUES ('General SQL exception')
                END;
    END;
$$;
```

## Known Issues

### EXIT HANDLER Behavior

Applies to

* IBM DB2

#### Description

EXIT HANDLER in DB2 terminates the current compound statement and returns control to the caller. In Snowflake, this is achieved using the EXCEPTION block, which automatically exits the current BEGIN…END block when an exception occurs.

The main behavioral differences are:

1. **Execution Termination**: Both DB2 and Snowflake exit the current block when an EXIT handler is triggered.
2. **Statement-level Control**: In DB2, the EXIT handler activates at the statement that causes the error. In Snowflake, the entire remaining block is skipped.
3. **Nested Blocks**: Exit behavior in nested blocks is consistent between DB2 and Snowflake.

#### Related EWIs

1. [SSC-EWI-0114](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING

### Mixed CONTINUE and EXIT Handlers

Applies to

* IBM DB2

#### Description

DB2 allows declaring both CONTINUE and EXIT handlers in the same procedure block. However, Snowflake Scripting does not support mixing CONTINUE and EXIT handlers in the same EXCEPTION block. When this pattern is encountered, SnowConvert AI generates separate EXCEPTION blocks with an EWI warning.

See the [CONTINUE HANDLER documentation](db2-continue-handler.md) for detailed examples of this limitation.

#### Input Code:

##### IBM DB2

```sql
CREATE OR REPLACE PROCEDURE with_continueAndExit()
BEGIN
    DECLARE test_1 INTEGER DEFAULT 10;
    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO error_test VALUES ('EXCEPTION');
    DECLARE EXIT HANDLER FOR SQLSTATE '20000'
        INSERT INTO error_test VALUES ('ERROR 2000');

    SET test_1 = 1 / 0;
    INSERT INTO error_test VALUES ('COMPLETED');
END;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE with_continueAndExit()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        test_1 INTEGER DEFAULT 10;
    BEGIN
        test_1 := 1 / 0;
        INSERT INTO error_test VALUES ('COMPLETED');
        EXCEPTION
            WHEN OTHER CONTINUE THEN
                INSERT INTO error_test VALUES ('EXCEPTION')
        !!!RESOLVE EWI!!! /*** SSC-EWI-0114 - MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        EXCEPTION
            WHEN OTHER EXIT THEN
                CASE
                    WHEN (SQLSTATE = '20000') THEN
                        INSERT INTO error_test VALUES ('ERROR 2000')
                END
    END;
$$;
```

#### Related EWIs

1. [SSC-EWI-0114](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING

### SQLSTATE Mapping

Not all DB2 SQLSTATE codes have direct equivalents in Snowflake. SnowConvert AI performs best-effort mapping:

| DB2 SQLSTATE | Condition | Snowflake Equivalent |
| --- | --- | --- |
| 02000 | NOT FOUND | NO_DATA_FOUND |
| 23xxx | Integrity Constraint Violation | STATEMENT_ERROR |
| 42xxx | Syntax Error | STATEMENT_ERROR |
| 01xxx | Warning | OTHER |

## Best Practices

When working with converted EXIT HANDLER code:

1. **Understand Exit Behavior**: EXIT handlers terminate the current block. Ensure your application logic accounts for this behavior.
2. **Test Error Scenarios**: Thoroughly test all error conditions to verify that the EXIT handler behaves as expected.
3. **Use Transactions**: Leverage Snowflake’s transaction support to ensure data consistency when errors cause early exits.
4. **Logging**: Implement comprehensive logging in exception handlers to track when and why procedures exit early.
5. **Nested Blocks**: When using nested blocks, understand that EXIT handlers only exit the current block, not the entire procedure.
6. **Return Values**: Consider setting return values or output parameters in exception handlers to indicate the reason for exit.

## Related Documentation

* [IBM DB2 DECLARE HANDLER](https://www.ibm.com/docs/en/db2/11.5?topic=statements-declare-handler)
* [Snowflake Exception Handling](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions)
* [Snowflake Stored Procedures](https://docs.snowflake.com/en/sql-reference/stored-procedures-overview)

## See Also

* [CONTINUE HANDLER](db2-continue-handler.md)
* [DB2 CREATE PROCEDURE](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-procedure-sql)
* [DB2 SELECT Statement](db2-select-statement.md)
* [DB2 Data Types](db2-data-types.md)

---
title: SnowConvert AI - IBM DB2 - From Clause
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-from-clause.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - From Clause

## Description

> The FROM clause specifies an intermediate result table

See the [DB2 FROM clause documentation](https://www.ibm.com/docs/en/db2/11.5?topic=subselect-from-clause) for this syntax.

## Grammar Syntax

## Table Reference

### Description

> A *table-reference* specifies an intermediate result table.

See the [DB2 table reference documentation](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference) for this syntax.

### Grammar Syntax

Navigate to the following pages to get more details about the translation spec for the subsections of the Table Reference grammar.

## Analyze Table Expression

### Description

> Returns the result of executing a specific data mining model by using an in-database analytics provider, a named model implementation, and input data.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_analyze_table-expression) to navigate to the IBM DB2 documentation page for this syntax.

Analyze Table Expressions are not supported in Snowflake. The output query can be malformed

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 SELECT
   *
FROM v1 ANALYZE_TABLE(
   IMPLEMENTATION 'PROVIDER=SAS; ROUTINE_SOURCE_TABLE=ETLIN.SOURCE_TABLE; ROUTINE_SOURCE_NAME=SCORING_FUN3;')
ORDER BY 1;
```

##### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0019 - ANALYZE TABLE FACTOR IS NOT SUPPORTED ***/!!!
 v1 ANALYZE_TABLE(
   IMPLEMENTATION 'PROVIDER=SAS; ROUTINE_SOURCE_TABLE=ETLIN.SOURCE_TABLE; ROUTINE_SOURCE_NAME=SCORING_FUN3;')
ORDER BY 1;
```

### Related EWIs

1. [SSC-EWI-DB0019](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): ANALYZE TABLE FACTOR IS NOT SUPPORTED

## Collection Derived Table

### Description

> A collection-derived-table can be used to convert the elements of an array into values of a column in separate rows. If WITH ORDINALITY is specified, an extra column of data type INTEGER is appended. This column contains the position of the element in the array.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_frag-collection-derived-table) to navigate to the IBM DB2 documentation page for this syntax.

Collection Derived Tables are not supported in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
SELECT
   *
FROM
   UNNEST(testArray) WITH ORDINALITY;
```

##### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0016 - UNNEST FUNCTION IS NOT SUPPORTED ***/!!!
   UNNEST(test) WITH ORDINALITY;
```

### Related EWIs

1. [SSC-EWI-DB0016](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): UNNEST FUNCTION IS NOT SUPPORTED

## Data Change Table Reference

### Description

> A *data-change-table-reference* clause specifies an intermediate result table. This table is based on the rows that are directly changed by the searched UPDATE, searched DELETE, or INSERT statement that is included in the clause.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_data-change-table-reference) to navigate to the IBM DB2 documentation page for this syntax.

Data Change Table Reference is not supported in Snowflake. The output query can be malformed.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 SELECT
   *
FROM
   OLD Table(UPDATE T1 SET NAME = 'Tony' where ID = 4)
```

#### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0006 - INTERMEDIATE RESULT TABLE IS NOT SUPPORTED. ***/!!!
   OLD Table(UPDATE T1 SET NAME = 'Tony' where ID = 4);
```

### Related EWIs

1. [SSC-EWI-DB0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): INTERMEDIATE RESULT TABLE IS NOT SUPPORTED.

## External Table Reference

### Description

> An external table resides in a text-based, delimited or non-delimited file outside of a database. An external-table-reference specifies the name of the file that contains an external table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_external-table-reference) to navigate to the IBM DB2 documentation page for this syntax.

External Table Reference is not supported in Snowflake. The output query can be malformed.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 SELECT
   *
FROM
   EXTERNAL SOMENAME AS T1 LIKE TABLE2 USING(COMPRESS NO)
```

##### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0014 - THE USE OF EXTERNAL TABLE REFERENCES IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
   EXTERNAL SOMENAME AS T1 LIKE TABLE2 USING(COMPRESS NO);
```

### Related EWIs

1. [SSC-EWI-DB0014](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): THE USE OF EXTERNAL TABLE REFERENCES IS NOT SUPPORTED IN SNOWFLAKE

## Nested Table Expression

### Description

> A fullselect in parentheses is called a *nested table expression*. The intermediate result table is the result of that fullselect.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_frag-nested-table-expression) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> Nested Table Expression is partially applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### Unsupported cases

##### IBM DB2

```sql
 Select
   AValue
from
   LATERAL RETURN DATA UNTIL FEDERATED SQLSTATE VALUE 'stringConstant' WITHIN(
      Select
         AValue
      from
         ATable
   );
```

##### Snowflake

```sql
Select
   AValue
from
   LATERAL
--           --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. CONTINUE HANDLER **
--           RETURN DATA UNTIL FEDERATED SQLSTATE VALUE 'stringConstant' WITHIN
                                                                             (
      Select
         AValue
      from
         ATable
   );
```

### Related EWIs

1. [SSC-FDM-0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.

## ONLY TABLE REFERENCE

### Description

> The use of ONLY(table-name) or ONLY(view-name) means that the rows of the applicable subtables or subviews are not included in the intermediate result table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_only-table-reference) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 Select * from ONLY(ATable) AS CorrelationName;
```

##### Snowflake

```sql
 Select * from
   ATable AS CorrelationName;
```

## OUTER TABLE REFERENCE

### Description

> The use of OUTER(table-name) or OUTER(view-name) represents a virtual table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_outer-table-reference) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> OUTER TABLE REFERENCE is not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 Select * from OUTER(ATable) AS CorrelationName;
```

##### Snowflake

```sql
 Select * from
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0004 - OUTER TABLE REFERENCE IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! OUTER(ATable) AS CorrelationName;
```

### Related EWIs

1. [SSC-EWI-DB0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): OUTER TABLE REFERENCE IS NOT SUPPORTED IN SNOWFLAKE.

## Period Specification

> A period-specification identifies an intermediate result table consisting of the rows of the referenced table where the period matches the specification. A period-specification can be specified following the name of a temporal table or the name of a view

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_period-specification) to navigate to the IBM DB2 documentation page for this syntax.

Period Specification is currently not supported by Snowflake.

### Grammar Syntax

### Sample Source Patterns

#### IBM DB2

```sql
 SELECT
   *
FROM
   Table1
FOR BUSINESS_TIME AS OF "12-12-12"
```

#### Snowflake

```sql
SELECT
   *
FROM
   Table1
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0003 - PERIOD SPECIFICATION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
FOR BUSINESS_TIME AS OF "12-12-12";
```

### Related EWIs

1. [SSC-EWI-DB0003](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md): PERIOD SPECIFICATION IS NOT SUPPORTED IN SNOWFLAKE.

## Table Function Reference

### Description

> Table functions return columns of a table, resembling a table created through a simple CREATE TABLE statement. A table function can be used only in the FROM clause of a statement.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=clause-table-reference#sdx-synid_table-function-reference) to navigate to the IBM DB2 documentation page for this syntax.

> **Warning:**
>
> Table Function Reference is not applicable in Snowflake.

### Grammar Syntax

### Sample Source Patterns

For the transformation of Table Function Reference, we must comment out the table-UDF-cardinality-clause. This clause is used for performance reasons, and is not relevant in Snowflake.

#### IBM DB2

```sql
 SELECT * FROM TABLE(TUDF1(3) CARDINALITY 30) AS X;
```

##### Snowflake

```sql
SELECT * FROM TABLE(TUDF1(3)) AS X;
```

Note that each function along with the type of its arguments specified in the table reference must exist, otherwise it will cause errors.

---
title: SnowConvert AI - IBM DB2 - SELECT STATEMENT
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/db2/db2-select-statement.md
section: Migrations
---

# SnowConvert AI - IBM DB2 - SELECT STATEMENT

## Description

> A subdivision of the SELECT statement done in IBM DB2.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=queries-fullselect) to navigate to the IBM DB2 documentation page for this syntax.

## Grammar Syntax

## From Clause

All information about this part of the syntax is specified on the [from-clause page](db2-from-clause.md).

## Where Clause

> The WHERE clause specifies an intermediate result table that consists of those rows of R for which the search-condition is true. R is the result of the FROM clause of the subselect.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=subselect-where-clause) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

> **SuccessPlaceholder:**
>
> All the grammar specified in this where clause of DB2 is ANSI compliant, equivalent to Snowflake, and is therefore translated as is by SnowConvert AI.

## Group By Clause

> The GROUP BY clause specifies an intermediate result table that consists of a grouping of the rows of R. R is the result of the previous clause of the subselect.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=subselect-group-by-clause) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### No explicit column reference

> The following expressions, which do not contain an explicit column reference, can be used in a grouping-expression to identify a column of R:
>
> * ROW CHANGE TIMESTAMP FOR table-designator
> * ROW CHANGE TOKEN FOR table-designator
> * RID_BIT or RID scalar function

ROW CHANGE Expressions and RID/RID_BIT scalar functions are not supported in Snowflake.

#### Sample Source Patterns

##### IBM DB2

```sql
select * from product group by ROW CHANGE TIMESTAMP FOR product;
```

##### Snowflake

```sql
select * from
 product
--!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - GROUP BY ROW CHANGE TIMESTAMP FOR NOT SUPPORTED IN SNOWFLAKE ***/!!!
--group by ROW CHANGE TIMESTAMP FOR product
                                         ;
```

##### IBM DB2

```sql
    select * from product group by RID();
```

##### Snowflake

```sql
select * from
 product
--!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - GROUP BY scalar function RID NOT SUPPORTED IN SNOWFLAKE ***/!!!
--group by RID()
              ;
```

#### Related EWIs

1. [SSC-EWI-0021](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)

## Fetch Clause

### Description

> Sets a maximum number of rows to be retrieved.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=subselect-fetch-clause) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### Sample Source Patterns

#### Fetch without row count

##### IBM DB2

```sql
 SELECT * FROM Product FETCH First Row ONLY;
/* or */
SELECT * FROM Product FETCH First Rows ONLY;
/* or */
SELECT * FROM Product FETCH Next Row ONLY;
/* or */
SELECT * FROM Product FETCH Next Rows ONLY;
```

###### Snowflake

```sql
SELECT * FROM
   Product
FETCH NEXT 1 ROW ONLY;
```

## Offset Clause

### Description

> Sets the number of rows to skip.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=subselect-offset-clause) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### Sample Source Patterns

#### Offset row-count

##### IBM DB2

```sql
 SELECT * FROM Product OFFSET 3 ROW;
/* or */
SELECT * FROM Product OFFSET 3 ROWS;
```

##### Snowflake

```sql
SELECT * FROM
   Product
LIMIT NULL
OFFSET 3;
```

#### Limit X,Y

##### IBM DB2

```sql
SELECT * FROM Product LIMIT 3,2;
```

##### Snowflake

```sql
SELECT * FROM
   Product
OFFSET 3 ROWS
FETCH NEXT 2 ROWS ONLY;
```

## Order by Clause

### Description

> The ORDER BY clause specifies an ordering of the rows of the result table.

Click [here](https://www.ibm.com/docs/en/db2/11.5?topic=subselect-order-by-clause) to navigate to the IBM DB2 documentation page for this syntax.

### Grammar Syntax

### Sample Source Patterns

The only paths of ORDER BY in Db2 that are not supported in Snowflake are those when it is used with ORDER OF and INPUT SEQUENCE; hence, if these are present, the clause will be marked with an EWI.

#### IBM DB2 Not Supported Examples

```sql
Select * from ORDERBYTest ORDER BY ORDER OF TableDesignator;
Select * from ORDERBYTest ORDER BY INPUT SEQUENCE;
```

##### Snowflake

```sql
Select * from
   ORDERBYTest
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - ORDER BY ORDER OF NOT SUPPORTED IN SNOWFLAKE ***/!!!
ORDER BY ORDER OF TableDesignator;

Select * from
   ORDERBYTest
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - ORDER BY INPUT SEQUENCE NOT SUPPORTED IN SNOWFLAKE ***/!!!
ORDER BY INPUT SEQUENCE;
```

### Related EWIs

1. [SSC-EWI-0021](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): NODE NOT SUPPORTED

## Values Clause

### Description

> Derives a result table by specifying the actual values, using expressions or row expressions, for each column of a row in the result table. hin

> **Note:**
>
> The VALUES clause is not supported in Snowflake. For this reason, it is translated to a SELECT statement, as shown in the examples below.

### Grammar Syntax

### Sample Source Patterns

The Values clause is not supported in Snowflake. For this reason, the values clause is translated to a select query.

#### IBM DB2

```sql
VALUES 1, 2, 3
```

|  |
| --- |
| 1 |
| 2 |
| 3 |

##### Snowflake

```sql
SELECT 1, 2, 3
```

|  |  |  |
| --- | --- | --- |
| 1 | 2 | 3 |

For the values with multiple rows, a Union is used:

##### IBM DB2

```sql
VALUES (1, 1, 1),
    (2, 2, 2),
    (3, 3, 3)
```

|  |  |  |
| --- | --- | --- |
| 1 | 1 | 1 |
| 2 | 2 | 2 |
| 3 | 3 | 3 |

##### Snowflake

```sql
SELECT
   1, 1, 1
UNION
SELECT
   2, 2, 2
UNION
SELECT
   3, 3, 3
```

|  |  |  |
| --- | --- | --- |
| 1 | 1 | 1 |
| 2 | 2 | 2 |
| 3 | 3 | 3 |

## Removed Clauses

### Description

The following clauses are removed since they are not applicable in Snowflake:

* FOR READ ONLY
* Update Clause
* Optimize for Clause
* Concurrent access resolution Clause
* Isolation Clause

### Sample Source Patterns

#### IBM DB2

```sql
-- For Read Only
SELECT
   *
FROM
   Table1
FOR READ ONLY;

-- Update Clause
SELECT
   *
FROM
   Table1
FOR UPDATE OF
   COL1,
   COL2;

--Optimize For Clause
SELECT
   *
FROM
   Table1
OPTIMIZE FOR 2 ROWS;

-- Concurrent access resolution Clause
SELECT
   *
FROM
   Table1
WAIT FOR OUTCOME;

-- Isolation Clause
SELECT
   *
FROM
   Table1
WITH RR USE AND KEEP EXCLUSIVE LOCKS;
```

##### Snowflake

```sql
-- For Read Only
SELECT
   *
FROM
   Table1;

-- Update Clause
SELECT
   *
FROM
   Table1;

--Optimize For Clause
SELECT
   *
FROM
   Table1;

-- Concurrent access resolution Clause
SELECT
   *
FROM
   Table1;

-- Isolation Clause
SELECT
   *
FROM
   Table1;
```

---
title: SnowConvert AI - IBM DB2 Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/db2FDM.md
section: Migrations
---

# SnowConvert AI - IBM DB2 Functional Differences

## SSC-FDM-DB0001

FUNCTIONALITY MIGHT BE DIFFERENT DEPENDING ON THE DB2 DATABASE.

### Severity

Low

### Description

This message is shown whenever a SQL element behaves differently depending on the DB2 database version ([DB2 for i](https://www.ibm.com/docs/en/i/7.4?topic=database), [DB2 for z/OS](https://www.ibm.com/docs/en/db2-for-zos/12?topic=db2-sql), or [DB2 for Linux, Unix, and Windows](https://www.ibm.com/docs/en/db2/11.5?topic=database-fundamentals)). SnowConvert AI treats all DB2 versions as one and therefore, the translation for the element might have functionality differences when compared to the original platform.

### Cases

Listed below are all the SQL elements so far identified, that behave differently depending on the DB2 database version.

#### CURRENT MEMBER

DB2 for z/OS: [CURRENT MEMBER](https://www.ibm.com/docs/en/db2-for-zos/11?topic=registers-current-member) specifies the member name of a current Db2 data sharing member on which a statement is executing. The value of CURRENT MEMBER is a character string.

Db2 for LUW: The [CURRENT MEMBER](https://www.ibm.com/docs/en/db2/11.5?topic=registers-current-member) special register specifies an INTEGER value that identifies the coordinator member for the statement.

##### Code example

##### Input code:

```sql
 CREATE TABLE T1
(
  COL1 INT,
  COL2 CHAR(8) WITH DEFAULT CURRENT MEMBER
);
```

##### Output code:

```sql
 CREATE TABLE T1
 (
  COL1 INT,
  COL2 CHAR(8) DEFAULT
  --** SSC-FDM-DB0001 - FUNCTIONALITY FOR CURRENT_ROLE MIGHT BE DIFFERENT DEPENDING ON THE DB2 DATABASE. **
  CURRENT_ROLE()
)
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/02/2025",  "domain": "no-domain-provided" }}';
```

### Recommendations

* Review your code and keep in mind that the result transformation can behave differently according to the Db2 version that is being used.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-DB0002

DECFLOAT TYPE CHANGED TO NUMBER BECAUSE IT IS ONLY SUPPORTED IN TABLE COLUMNS AND CAST EXPRESSIONS IN SNOWFLAKE.

### Severity

Low

### Description

This message is shown when a `DECFLOAT` data type is used in a context not supported by Snowflake. In Snowflake, `DECFLOAT` is only permitted in:

* Table column definitions (`CREATE TABLE`)
* `CAST` expressions (`CAST(value AS DECFLOAT)`)

When `DECFLOAT` is used in other contexts such as procedure parameters, function parameters, or local variable declarations, SnowConvert AI transforms it to `NUMBER(38, 37)` and adds this FDM to indicate the functional difference.

### Code Example

#### DB2

```sql
CREATE PROCEDURE TestProc (param1 DECFLOAT)
BEGIN
  DECLARE local_var DECFLOAT;
  SET local_var = param1;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE TestProc (param1 NUMBER(38, 37) --** SSC-FDM-DB0002 - DECFLOAT TYPE CHANGED TO NUMBER BECAUSE IT IS ONLY SUPPORTED IN TABLE COLUMNS AND CAST EXPRESSIONS IN SNOWFLAKE. **
)
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
  LET local_var NUMBER(38, 37) --** SSC-FDM-DB0002 - DECFLOAT TYPE CHANGED TO NUMBER BECAUSE IT IS ONLY SUPPORTED IN TABLE COLUMNS AND CAST EXPRESSIONS IN SNOWFLAKE. **
  := NULL;
  local_var := param1;
END;
$$;
```

### Recommendations

* Review the converted code to ensure that using `NUMBER(38, 37)` instead of `DECFLOAT` does not affect your application logic.
* If precise decimal floating-point arithmetic is critical for these parameters or variables, consider refactoring your code to use table columns or `CAST` expressions where `DECFLOAT` is supported.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - IBM DB2 Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/db2EWI.md
section: Migrations
---

# SnowConvert AI - IBM DB2 Issues

## SSC-EWI-DB0001

WITH ROW ACCESS POLICY CLAUSE DOES NOT SUPPORT MULTIPLE DECLARATION

### Severity

Low

### Description

This message is shown whenever SnowConvert AI detects multiple security label column options inside the same `CREATE TABLE` clause, the security label is translated to a row access policy clause and Snowflake does not support multiple row access policy declarations. Therefore, if more than one security labels are found they will be commented out with this EWI.

#### Code example

##### Input code:

```sql
 CREATE TABLE T1
(
COL1 VARCHAR(10) COLUMN SECURED WITH securityLabel1,
COL2 VARCHAR(10) COLUMN SECURED WITH securityLabel2
);
```

##### Output code:

```sql
CREATE TABLE T1
(
COL1 VARCHAR(10),
COL2 VARCHAR(10)
)
WITH ROW ACCESS POLICY securityLabel1 ON (
COL1
)
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0001 - WITH ROW ACCESS POLICY CLAUSE DOES NOT SUPPORT MULTIPLE DECLARATION IN SNOWFLAKE ***/!!!
WITH ROW ACCESS POLICY securityLabel2 ON (
COL2
)
;
```

### Recommendations

* Review your code and ensure that only one security label is inside the `CREATE TABLE` clause
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0003

PERIOD DEFINITION IS NOT SUPPORTED IN SNOWFLAKE.

### Severity

Medium

### Description

DB2 temporal tables do not have a functional equivalent in Snowflake. When an application-period or system-period temporal table declaration is found in the `CREATE TABLE` columns, that column is commented out from the resulting script. The behavior of the SELECT statement will differ from Snowflake because temporal tables are not part of the Snowflake solution and this causes the result to be different if the Select statement is migrated partially, see the example below for more information about this.

#### Select Query

```sql
 SELECT
  ID,
  Start,
  END
FROM
  timetable
FOR system_time as of '2022-05-09-16.20.17.0';
```

##### Result

| ID | START | END |
| --- | --- | --- |
| 1001 | 19:45.3 | 22:39.5 |
| 1002 | 19:45.5 | 22:39.6 |
| 1003 | 19:45.6 | 22:39.8 |
| 1004 | 19:45.7 | 00:00.0 |
| 1005 | 19:45.8 | 00:00.0 |
| 1006 | 19:46.0 | 00:00.0 |
| 7 | 16:21.8 | 00:00.0 |

If the Select statement is migrated partially we get a very different result as shown below.

##### Select Query

```sql
 SELECT
  ID,
  Start,
  END
FROM
  timetable
-- FOR system_time as of '2022-05-09-16.20.17.0';
```

##### Result

|  |  |  |
| --- | --- | --- |
| ID | START | END |
| 2001 | 22:39.5 | 00:00.0 |
| 2002 | 22:39.6 | 00:00.0 |
| 2003 | 22:39.8 | 00:00.0 |
| 1004 | 19:45.7 | 00:00.0 |
| 1005 | 19:45.8 | 00:00.0 |
| 1006 | 19:46.0 | 00:00.0 |
| 7 | 16:21.8 | 00:00.0 |

#### Code example

##### DB2

##### Create table

```sql
CREATE TABLE TestTable (
COL1 DATE,
COL2 DATE,
PERIOD SYSTEM_TIME (COL1, COL2),
PERIOD BUSINESS_TIME (COL1, COL2)
```

##### Select Query

```sql
SELECT
   *
FROM
   Table1
FOR SYSTEM_TIME AS of Value
```

##### Snowflake

##### Create Table

```sql
CREATE TABLE TestTable (
COL1 DATE,
COL2 DATE,
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0003 - PERIOD SPECIFICATION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
PERIOD SYSTEM_TIME (COL1, COL2),
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0003 - PERIOD SPECIFICATION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
PERIOD BUSINESS_TIME (COL1, COL2)
)
```

##### Select Query

```sql
SELECT
   *
FROM
Table1
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0003 - PERIOD SPECIFICATION IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
FOR SYSTEM_TIME AS of Value
```

### Recommendations

* Snowflake allows the storage of historical table data for up to 90 days, to know more about this see [Understanding & Using Time Travel](https://docs.snowflake.com/en/user-guide/data-time-travel.html).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0004

OUTER TABLE REFERENCE NOT APPLICABLE IN SNOWFLAKE

### Severity

Low

### Description

This message is shown when an OUTER table reference is found in a FROM clause inside of a SELECT statement. This clause is used to include from subtables in the intermediate result table of the SELECT statement. Subtables are related to [typed tables](https://www.ibm.com/docs/en/db2/9.7?topic=tables-creating-typed) in the DB2 database, that are created with the [OF clause](https://docs.mobilize.net/snowconvert-limited-access/-MUuBuIkrrZbtDaKcru_/for-ibm-db2/translation-reference/statements/create-table/content-source/of-type) of the CREATE TABLE statement, which is also not supported in Snowflake.

#### Code example

##### Input code:

```sql
Select * from OUTER(ATable);
Select * from ONLY(ATable);
```

##### Output code:

```sql
Select * from
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0004 - OUTER TABLE REFERENCE IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
OUTER(ATable) AS AliasName;

Select * from
ATable;
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0005

MANIPULATION OF DATA IN VIEWS IS NOT SUPPORTED

### Severity

Medium

### Description

This message is shown when it is found in a CREATE VIEW a node or clause that is related to the data manipulation of rows in a CREATE VIEW. Note that in DB2 you can insert or update rows directly from a VIEW meanwhile in Snowflake this is not supported, because of this, nodes or clauses related to this functionality are commented and an EWI is added.

#### Code example

##### Input code:

```sql
CREATE VIEW TestTableId2 AS Select * from TestTableId1 WITH ROW MOVEMENT;
```

##### Output code:

```sql
 CREATE VIEW TestTableId2
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "db2",  "convertedOn": "09/02/2025",  "domain": "no-domain-provided" }}'
 AS Select * from
  TestTableId1
 !!!RESOLVE EWI!!! /*** SSC-EWI-DB0005 - MANIPULATION OF DATA IN VIEWS IS NOT SUPPORTED. ***/!!!
 WITH ROW MOVEMENT;
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0006

INTERMEDIATE RESULT TABLE IS NOT SUPPORTED

### Severity

Medium

### Description

This message is shown when a DATA CHANGE TABLE REFERENCE is found in a FROM Clause. A DATA CHANGE TABLE REFERENCE specifies an intermediate table, which consists of the rows that are changed by an UPDATE, DELETE or INSERT statement included in the DATA CHANGE TABLE REFERENCE.

In Snowflake, this is not supported, since it can’t modify the rows and return a result set of table at the same time, hence the Select is commented.

#### Code example

##### DB2 Input code:

##### Select statement

```sql
 SELECT
   *
FROM
   OLD Table(UPDATE T1 SET NAME = 'Tony' where ID = 4)
```

##### Update statement

```sql
 UPDATE (SELECT EMPNO, SALARY, COMM,
     AVG(SALARY) OVER (PARTITION BY WORKDEPT),
     AVG(COMM) OVER (PARTITION BY WORKDEPT)
     FROM EMPLOYEE E) AS E(EMPNO, SALARY, COMM, AVGSAL, AVGCOMM)
   SET (SALARY, COMM) = (AVGSAL, AVGCOMM)
   WHERE EMPNO = '000120';

 UPDATE TABLE5
INCLUDE (col1 INT, col2 Varchar(10))
SET Column1 = 1;
```

##### Snowflake Output code:

##### Select statement

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0006 - INTERMEDIATE RESULT TABLE IS NOT SUPPORTED. ***/!!!
   OLD Table(UPDATE T1 SET NAME = 'Tony' where ID = 4)
```

##### Update statement

```sql
UPDATE
       !!!RESOLVE EWI!!! /*** SSC-EWI-DB0006 - INTERMEDIATE RESULT TABLE IS NOT SUPPORTED. ***/!!!
 (SELECT EMPNO, SALARY, COMM,
       AVG(SALARY) OVER (PARTITION BY WORKDEPT),
       AVG(COMM) OVER (PARTITION BY WORKDEPT)
       FROM EMPLOYEE E) AS E(EMPNO, SALARY, COMM, AVGSAL, AVGCOMM)
       SET
       SALARY = AVGSAL,
       COMM = AVGCOMM
WHERE EMPNO = '000120';

UPDATE TABLE5
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0006 - INTERMEDIATE RESULT TABLE IS NOT SUPPORTED. ***/!!!
INCLUDE (col1 INT, col2 Varchar(10))
SET Column1 = 1;
```

#### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0007

QUERY AS INSERT TARGET NAME IS NOT SUPPORTED.

### Severity

Medium

### Description

Unlike DB2, Snowflake does not allow using SELECT query results as the target of an INSERT statement, requiring instead that data be inserted directly into tables or materialized views.

#### Code example

##### DB2

##### Query

```sql
 INSERT INTO
   (SELECT * FROM SOMEOTHERTABLE)
VALUES
   (DEFAULT);
```

##### Snowflake

##### Query

```sql
 INSERT INTO
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0007 - QUERY AS INSERT TARGET NAME IS NOT SUPPORTED ***/!!!
   (SELECT * FROM SOMEOTHERTABLE)
VALUES
   (DEFAULT);
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0008

DELETE FROM SELECT STATEMENT IS NOT SUPPORTED.

### Severity

Medium

### Description

Snowflake does not support the use of select queries in the From clause of a Delete statement. If the Delete statement is migrated partially we get an incomplete statement as the From clause will be empty.

#### Code example

##### DB2

##### Select Query

```sql
 DELETE FROM (
SELECT * FROM table1
)
```

##### Snowflake

##### Select Query

```sql
DELETE FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0008 - DELETE FROM SELECT STATEMENT IS NOT SUPPORTED. ***/!!!
 (
SELECT * FROM table1
)
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0009

POSITIONED STATEMENT IS NOT SUPPORTED.

### Severity

Medium

### Description

Snowflake does not support the use of cursors as part of the Delete statement and Update statement. If the statement is migrated partially, we will get rid of the where clause in which the cursor is forming part, making it dangerous to delete or update the whole table.

#### Code example

##### DB2

##### Delete statement

```sql
 DELETE FROM table1
WHERE CURRENT OF cursor1
```

##### Update statement

```sql
 UPDATE table1
     SET col1 = 1
     WHERE CURRENT OF cursor1
```

##### Snowflake

##### Delete statement

```sql
DELETE FROM
table1
WHERE
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0009 - POSITIONED CURRENT OF IS NOT SUPPORTED. ***/!!! CURRENT OF cursor1
```

##### Update statement

```sql
UPDATE TABLE1
SET Column1 = 1
WHERE
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0009 - POSITIONED CURRENT OF IS NOT SUPPORTED. ***/!!!
 CURRENT OF cursor1;
```

### Recommendations

* For additional support, contact SnowConvert support at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-DB0010

ATTRIBUTE NAME IS NOT SUPPORTED IN SNOWFLAKE

### Severity

Medium

### Description

This message is displayed when specifying the attribute of a structured type that is being set (called an attribute assignment). A structured type can be a subtype that allows attributes to be inherited from a supertype.

Snowflake does not support these types of structures.

For more information, see the [DB2 CREATE TYPE (structured) documentation](https://www.ibm.com/docs/en/db2/11.5?topic=statements-create-type-structured).

#### Code Example

##### DB2

```sql
 UPDATE CIRCLES
     SET C..CENTER..X = C..CENTER..Y,
       C..CENTER..Y = C..CENTER..X
     WHERE ID = 999;
```

##### Snowflake

```sql
UPDATE CIRCLES
     SET
          !!!RESOLVE EWI!!! /*** SSC-EWI-DB0010 - ATTRIBUTE NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!! C..CENTER..X =
          !!!RESOLVE EWI!!! /*** SSC-EWI-DB0010 - ATTRIBUTE NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!! C..CENTER..Y,
          !!!RESOLVE EWI!!! /*** SSC-EWI-DB0010 - ATTRIBUTE NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
       C..CENTER..Y =
                      !!!RESOLVE EWI!!! /*** SSC-EWI-DB0010 - ATTRIBUTE NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!! C..CENTER..X
     WHERE ID = 999
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0011

ASSIGNMENT CLAUSE TYPE IS NOT SUPPORTED IN SNOWFLAKE

### Severity

Medium

### Description

This message is displayed when the assignment clause contains an expression not supported by Snowflake

### Cases

#### Update Statement

When an assignment clause presents a multi-column assignment of a row selection, an example of this can be found in the Code example section.

#### Code Example

##### DB2

```sql
 UPDATE EMPLOYEE EU
    SET (EU.COM, EU.SALARY) = (SELECT ES.SALARY FROM EMPLOYEE ES WHERE ES.WORKDEPT = EU.WORKDEPT)
    WHERE EU.EMPNO = '000120';
```

##### Snowflake

```sql
UPDATE EMPLOYEE EU
    SET
        !!!RESOLVE EWI!!! /*** SSC-EWI-DB0011 - ASSIGNMENT CLAUSE TYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
 (EU.COM, EU.SALARY) = (SELECT ES.SALARY FROM
         EMPLOYEE ES WHERE ES.WORKDEPT = EU.WORKDEPT)
    WHERE EU.EMPNO = '000120';
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0012

INVALID NAME AS INSERTION TARGET, USE OF VIEW NAME IS NOT SUPPORTED IN SNOWFLAKE

### Severity

Medium

### Description

Snowflake does not support the use of view name in the insert target name statement.

#### Code Example

##### DB2

```sql
 CREATE VIEW VIEW1 AS SELECT * FROM T;
INSERT INTO VIEW1 (COL1, COL2) VALUES (NULL, DEFAULT);
```

##### Snowflake

```sql
 CREATE VIEW PUBLIC.VIEW1
AS SELECT * FROM
PUBLIC.T;

!!!RESOLVE EWI!!! /*** SSC-EWI-DB0012 - INVALID NAME AS INSERTION TARGET, USE OF VIEW NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
INSERT INTO VIEW1 (COL1, COL2) VALUES (NULL,DEFAULT);
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0013

INVALID NAME AS DELETE TARGET, USE OF VIEW NAME IS NOT SUPPORTED IN SNOWFLAKE

### Severity

Medium

### Description

Snowflake does not support the use of view name in the delete target name statement. For this reason, the result query could not be valid

#### Code Example

##### DB2

```sql
 CREATE VIEW VIEW1 AS SELECT * FROM T;
DELETE FROM VIEW1
```

##### Snowflake

```sql
 CREATE VIEW PUBLIC.VIEW1
AS SELECT * FROM
PUBLIC.T;

DELETE FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0013 - INVALID NAME AS DELETE TARGET, USE OF VIEW NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
 VIEW1
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0014

THE USE OF EXTERNAL TABLE REFERENCES IS NOT SUPPORTED IN SNOWFLAKE

### Severity

Medium

### Description

Snowflake does not support the use of external tables in the Select statement. For this reason, the result query could not be valid

#### Code Example

##### DB2

```sql
 SELECT
   *
FROM
   EXTERNAL SOMENAME AS T1 LIKE TABLE2 USING(COMPRESS NO)
```

##### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0014 - THE USE OF EXTERNAL TABLE REFERENCES IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
   EXTERNAL SOMENAME AS T1 LIKE TABLE2 USING(COMPRESS NO)
```

### Recommendations

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0015

The use of Create View Of Type is not supported in Snowflake

### Severity

High

### Description

This message is shown when SnowConvert AI detects a `CREATE VIEW` statement that uses the `OF type` clause. In DB2, typed views are created with the `OF type MODE DB2SQL` syntax and are based on structured types for object-relational modeling. Snowflake does not support typed views or structured types, so the view definition is marked with this EWI and marked as invalid.

#### Code Example

##### DB2

```sql
CREATE VIEW testView
OF Rootview MODE DB2SQL(REF IS oidColumn USER GENERATED)
AS SELECT * FROM testTable;
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0015 - CREATE VIEW OF TYPE IS NOT SUPPORTED ***/!!!
CREATE VIEW testView
OF Rootview MODE DB2SQL(REF IS oidColumn USER GENERATED)
AS SELECT * FROM testTable;
```

### Recommendations

* Refactor the typed view into a standard view or materialized view that selects from the underlying table(s)
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0016

The use of Unnest Function is not supported in Snowflake

### Severity

High

### Description

This message is shown when SnowConvert AI detects the `UNNEST` or `TABLE` function in a `FROM` clause. In DB2, these table functions expand arrays or collections into rows (optionally with `WITH ORDINALITY` for row numbering). Snowflake has different syntax and semantics for array unnesting—`FLATTEN` is the equivalent—so the DB2 `UNNEST`/`TABLE` usage is marked as not supported.

#### Code Example

##### DB2

```sql
SELECT
   *
FROM
   UNNEST(arrray) WITH ORDINALITY
```

##### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0016 - UNNEST FUNCTION IS NOT SUPPORTED ***/!!!
   UNNEST(arrray) WITH ORDINALITY;
```

### Recommendations

* Replace DB2 `UNNEST` or `TABLE` with Snowflake `FLATTEN` to expand arrays into rows. Use `FLATTEN(input => array_column)` with appropriate column references
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0017

The use of Typed Tables is not supported in Snowflake

### Severity

High

### Description

This message is shown when SnowConvert AI detects a `CREATE TABLE` statement that uses the `OF type` or `UNDER` clause. In DB2, typed tables are defined with a structured type hierarchy (e.g., `OF Student_t UNDER Person`) and support inheritance. Snowflake does not support typed tables or structured types, so the table definition is marked with this EWI.

#### Code Example

##### DB2

```sql
CREATE TABLE Student OF Student_t UNDER Person
INHERIT SELECT PRIVILEGES;
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0017 - TYPED TABLES ARE NOT SUPPORTED ***/!!!
CREATE TABLE Student OF Student_t UNDER Person
INHERIT SELECT PRIVILEGES;
```

### Recommendations

* Refactor typed tables into standard tables. Model the type hierarchy with separate tables and foreign keys if inheritance relationships need to be preserved
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0018

The use of Staging Tables is not supported in Snowflake

### Severity

High

### Description

This message is shown when SnowConvert AI detects a `CREATE TABLE` statement that defines a staging table using the `FOR` clause (e.g., `CREATE TABLE emp_summary_s FOR emp_summary PROPAGATE IMMEDIATE`). In DB2, staging tables are used for materialized query table propagation. Snowflake does not support this construct, so the table definition is marked with this EWI.

#### Code Example

##### DB2

```sql
CREATE TABLE emp_summary_s FOR emp_summary PROPAGATE IMMEDIATE;
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0018 - STAGING TABLES ARE NOT SUPPORTED ***/!!!
CREATE TABLE emp_summary_s FOR emp_summary PROPAGATE IMMEDIATE;
```

### Recommendations

* Use Snowflake streams and tasks, or materialized views with refresh logic, to achieve similar incremental propagation behavior
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0019

The use of Analyze Table Factor is not supported in Snowflake

### Severity

Low

### Description

This message is shown when SnowConvert AI detects an `ANALYZE_TABLE` table factor in a `FROM` clause. In DB2, `ANALYZE_TABLE` invokes external analytics (e.g., SAS routines) inline in a query. Snowflake does not support this DB2-specific analytics integration, so the table reference is marked with this EWI.

#### Code Example

##### DB2

```sql
SELECT
   *
FROM v1 ANALYZE_TABLE(
   IMPLEMENTATION 'PROVIDER=SAS; ROUTINE_SOURCE_TABLE=ETLIN.SOURCE_TABLE; ROUTINE_SOURCE_NAME=SCORING_FUN3;')
ORDER BY 1;
```

##### Snowflake

```sql
SELECT
   *
FROM
   !!!RESOLVE EWI!!! /*** SSC-EWI-DB0019 - ANALYZE TABLE FACTOR IS NOT SUPPORTED ***/!!!
   v1 ANALYZE_TABLE(
   IMPLEMENTATION 'PROVIDER=SAS; ROUTINE_SOURCE_TABLE=ETLIN.SOURCE_TABLE; ROUTINE_SOURCE_NAME=SCORING_FUN3;')
ORDER BY 1;
```

### Recommendations

* Implement the analytics logic in Snowflake using Snowpark (Python/Java), stored procedures, or external functions, and restructure the query accordingly
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0020

The use of Data Capture is not supported in Snowflake

### Severity

High

### Description

This message is shown when SnowConvert AI detects the `DATA CAPTURE CHANGES` (or `DATA CAPTURE NONE`) clause in a `CREATE TABLE` statement. In DB2, this clause controls whether changed data is captured for replication (e.g., Q Replication). Snowflake does not support this DB2-specific clause, so it is marked with this EWI.

#### Code Example

##### DB2

```sql
CREATE TABLE TestTable
(
   COL1 INT
) DATA CAPTURE CHANGES;
```

##### Snowflake

```sql
CREATE TABLE TestTable
(
   COL1 INT
)
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0020 - DATA CAPTURE IS NOT SUPPORTED ***/!!!
 DATA CAPTURE CHANGES
;
```

### Recommendations

* For change data capture in Snowflake, use [Streams](https://docs.snowflake.com/en/user-guide/streams-intro) to track changes on tables
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0021

The use of Materialized Query is not supported in Snowflake

### Severity

Low

### Description

This message is shown when SnowConvert AI detects a `CREATE TABLE ... AS` statement with materialized query options such as `DATA INITIALLY DEFERRED`, `REFRESH DEFERRED`, `MAINTAINED BY SYSTEM`, or `ENABLE QUERY OPTIMIZATION`. In DB2, these options define a refreshable materialized query table. Snowflake materialized views use different syntax and semantics, so these options are marked with this EWI.

#### Code Example

##### DB2

```sql
CREATE TABLE TRANSCNT (ACCTID, LOCID, YEAR, CNT) AS
  (SELECT ACCOUNTID, LOCATIONID, YEAR, COUNT(*)
     FROM TRANS
     GROUP BY ACCOUNTID, LOCATIONID, YEAR )
     DATA INITIALLY DEFERRED
     REFRESH DEFERRED
     MAINTAINED BY SYSTEM
     ENABLE QUERY OPTIMIZATION;
```

##### Snowflake

```sql
CREATE TABLE TRANSCNT (ACCTID, LOCID, YEAR, CNT) AS
  (SELECT ACCOUNTID, LOCATIONID, YEAR, COUNT(*)
     FROM TRANS
     GROUP BY ACCOUNTID, LOCATIONID, YEAR )
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0021 - MATERIALIZED QUERY IS NOT SUPPORTED ***/!!!
     DATA INITIALLY DEFERRED
     REFRESH DEFERRED
     MAINTAINED BY SYSTEM
     ENABLE QUERY OPTIMIZATION
;
```

### Recommendations

* Convert to a Snowflake [materialized view](https://docs.snowflake.com/en/user-guide/views-materialized) if you need automatic refresh. Use `CREATE MATERIALIZED VIEW` with appropriate refresh settings
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-DB0022

The use of With Select Analyzed Table is not supported in Snowflake

### Severity

High

### Description

This message is shown when SnowConvert AI detects a `WITH` (CTE) query in which the main `SELECT` references a table using the `ANALYZE_TABLE` table factor. DB2 allows inline analytics (e.g., SAS routines) via `ANALYZE_TABLE` in such contexts. Snowflake does not support this, so the entire `WITH` query is marked with this EWI.

#### Code Example

##### DB2

```sql
WITH sas_score_in (c1,c2) AS
  (SELECT c1,c2 FROM t1)
  SELECT *
    FROM sas_score_in ANALYZE_TABLE(
    IMPLEMENTATION 'PROVIDER=SAS; ROUTINE_SOURCE_TABLE=ETLIN.SOURCE_TABLE; ROUTINE_SOURCE_NAME=SCORING_FUN3;');
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-DB0022 - WITH SELECT ANALYZED TABLE IS NOT SUPPORTED ***/!!!
WITH sas_score_in (c1,c2) AS
  (SELECT c1,c2 FROM t1)
  SELECT *
    FROM sas_score_in ANALYZE_TABLE(
    IMPLEMENTATION 'PROVIDER=SAS; ROUTINE_SOURCE_TABLE=ETLIN.SOURCE_TABLE; ROUTINE_SOURCE_NAME=SCORING_FUN3;');
```

### Recommendations

* Refactor the query to remove `ANALYZE_TABLE`. Implement the analytics logic in Snowflake using Snowpark, stored procedures, or external functions, then integrate results via a separate step or view
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Informatica PowerCenter
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/informatica/README.md
section: Migrations
---

# SnowConvert AI - Informatica PowerCenter

This topic documents the Informatica PowerCenter components supported by SnowConvert AI and describes the output generated for each component category.

For general information about the ETL migration process, prerequisites, and step-by-step migration instructions, see [ETL Migration](../../general/user-guide/etl-migration-replatform.md).

## Control flow elements

Informatica Workflows define the orchestration logic: which Sessions run, in what order, and with what variable context. SnowConvert AI converts Workflow elements into Snowflake [TASK](../../../../sql-reference/sql/create-task.md) DAGs and [stored procedures](../../../../sql-reference/sql/create-procedure.md).

These Informatica Workflow elements are supported:

| Element | Category | Conversion Target | Notes |
| --- | --- | --- | --- |
| Session | Task | Snowflake TASK with `EXECUTE DBT PROJECT` | See Session task section |
| Session Overrides | Configuration | Applied during dbt project generation | Pre/Post SQL, SQL Override |
| Worklet | Container | Nested Snowflake [Task Graph](../../../../user-guide/tasks-graphs.md) | See Worklet task section |
| Start | Task | Initialization TASK | Variable setup via `InitVariablesFromConfig` |
| Variable Assignment | Configuration | Table-driven variable management | See Variable management section |
| Parameter Files | Configuration | Loaded into `control_variables` at runtime | `.txt` and `.xml` formats supported |

> **Note:**
>
> Unlisted Workflow elements generate an EWI code indicating manual conversion is required.

### Session task

A Session in Informatica executes a Mapping at runtime. Each Session is converted into a Snowflake TASK that runs the corresponding dbt project.

**Conversion output:**

* A `CREATE OR REPLACE TASK` statement with `AFTER` dependency on the preceding task
* Variable initialization via `BuildDbtVarsJsonUDF`
* `EXECUTE DBT PROJECT` call with the variable JSON payload

**Example:**

```sql
CREATE OR REPLACE TASK public.wf_daily_load_s_load_customers
WAREHOUSE = DUMMY_WAREHOUSE
AFTER public.wf_daily_load
AS
BEGIN
    LET dbt_vars VARCHAR := public.BuildDbtVarsJsonUDF('wf_daily_load');
    EXECUTE DBT PROJECT public.m_load_customers ARGS = :dbt_vars;
END;
```

**Notes:**

* Replace `DUMMY_WAREHOUSE` with your actual Snowflake warehouse name
* The `AFTER` clause preserves the original Workflow execution order
* Reusable Sessions are resolved to reference the correct Mapping

### Session overrides

Informatica Sessions can override properties defined at the Mapping level. SnowConvert AI captures these overrides and applies them during conversion.

#### Pre-SQL and Post-SQL

* Session-level Pre-SQL statements are generated before the `EXECUTE DBT PROJECT` call within the TASK body
* Session-level Post-SQL statements are generated after the `EXECUTE DBT PROJECT` call
* Pre-SQL and Post-SQL on Target transformations are also supported

#### SQL Override

* When a Session defines a SQL Override for a Source Qualifier, the override takes priority over the Mapping-level SQL
* The override is applied during dbt model generation, replacing the default source query

### Worklet task

Informatica Worklets are reusable, nested sub-workflows that can be embedded inside a parent Workflow. SnowConvert AI converts Worklets into nested Snowflake [Task Graphs](../../../../user-guide/tasks-graphs.md).

**Conversion output:**

* A separate SQL file per Worklet in the `worklets/` directory
* The Worklet’s internal Start task and Session tasks are converted following the same patterns as the parent Workflow
* The parent Workflow references the Worklet root task via an `AFTER` dependency

**Example structure:**

```text
{FolderName}/{MappingName}/
├── wf_daily_load.sql               # Parent Workflow
└── worklets/
    └── wklt_validate.sql             # Worklet sub-graph
```

The Worklet [Task Graph](../../../../user-guide/tasks-graphs.md) preserves the original execution hierarchy. Worklet tasks follow the same naming and variable management patterns as the parent Workflow.

### Start task

The Informatica Start task (the entry point of a Workflow) is converted into an initialization TASK that sets up the variable context:

```sql
CREATE OR REPLACE TASK public.wf_daily_load AS
BEGIN
    CALL public.InitVariablesFromConfig('wf_daily_load');
END;
```

This TASK loads default variable values and applies parameter file overrides before any Session tasks execute.

### dbt project execution

Within the orchestration code, Sessions are executed using Snowflake’s `EXECUTE DBT PROJECT` command:

```sql
EXECUTE DBT PROJECT schema.project_name ARGS = :dbt_vars;
```

**Important requirements:**

* The `project_name` must match the name used when deploying the dbt project (via `CREATE DBT PROJECT` or Snowflake Workspace deployment)
* The `ARGS` parameter passes workflow variables as a JSON payload built by `BuildDbtVarsJsonUDF`
* Each execution runs the entire dbt project with all models in dependency order

**Deployment options:**

* Snowflake CLI: `snow dbt deploy --schema schema_name --database database_name --force package_name`
* Snowflake Workspace: Upload and deploy via UI

### Variable management

Informatica variables (`$$var` assignments) are converted into a table-driven management system. The infrastructure components are generated in the `etl_configuration/` folder.

#### Control variables table

The `control_variables` table stores all workflow variables, parameters, and their values:

| Field | Type | Description |
| --- | --- | --- |
| `variable_name` | VARCHAR | Variable name |
| `variable_value` | VARIANT | Value (accommodates any data type) |
| `variable_type` | VARCHAR | Original Informatica data type |
| `variable_scope` | VARCHAR | Workflow or session name |
| `is_parameter` | BOOLEAN | Distinguishes parameters from variables |
| `is_persistent` | BOOLEAN | Reserved for future use |
| `last_updated_at` | TIMESTAMP | Last update time |

#### UDFs and procedures

| Component | Purpose |
| --- | --- |
| **GetControlVariableUDF** | Retrieves a variable value from the control table |
| **BuildDbtVarsJsonUDF** | Constructs a JSON payload of all variables for a workflow scope, passed as `ARGS` to `EXECUTE DBT PROJECT` |
| **InitVariablesFromConfig** | Initializes variables at workflow start; loads defaults and applies parameter file overrides |
| **UpdateControlVariable** | Updates a variable value during orchestration execution |

#### Parameter file support

Informatica `.txt` and `.xml` parameter files are parsed during migration. Parameter values are loaded into the `control_variables` table at runtime via `InitVariablesFromConfig`, overriding default variable values for each workflow execution.

#### Built-in variables

| Variable | Description |
| --- | --- |
| `SESSSTARTTIME` | Session start timestamp. Captured at task start via `CURRENT_TIMESTAMP()`. |
| `SYSDATE` | Current date and time. Mapped to `CURRENT_TIMESTAMP()`. |

## Data flow components

Informatica Mappings define the data transformation logic: how data flows from sources through transformations to targets. SnowConvert AI converts each Mapping into a standalone [dbt](../../../../user-guide/data-engineering/dbt-projects-on-snowflake.md) project with a three-tier model architecture.

These Informatica Mapping transformations are supported:

| Informatica Component | Category | dbt Output | Naming Pattern | Notes |
| --- | --- | --- | --- | --- |
| **Source Qualifier** | Source | Staging Model | `stg_{source_name}` |  |
| **Source Definition** | Source | Staging Model | `stg_{source_name}` |  |
| **Expression** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Filter** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Joiner** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Lookup (Connected)** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Lookup (Unconnected)** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Aggregator** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Aggregator (No Group By)** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Router** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Sorter** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Union** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Normalizer** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Rank** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Sequence Generator** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Update Strategy** | Transformation | Intermediate Model | `int_{transformation_name}` | Mart uses incremental materialization with merge strategy |
| **Stored Procedure** | Transformation | Intermediate Model | `int_{transformation_name}` |  |
| **Mapplet** | Reuse | dbt Macro | `macros/{mapplet_name}.sql` |  |
| **Target Definition** | Destination | Mart Model | `{target_name}` |  |

> **Note:**
>
> Unlisted Mapping transformations generate an EWI code indicating manual conversion is required.

### Layer organization

| Layer | Materialization | Purpose |
| --- | --- | --- |
| **models/staging/** | View | Clean, type-safe access to source data referenced in `sources.yml`. Generated from Source Qualifier and Source Definition transformations. |
| **models/intermediate/** | Ephemeral | Transformation logic from the original Mapping. Not persisted to database. Generated from Expression, Joiner, Filter, Lookup, and other transformations. |
| **models/marts/** | Incremental or Table | Business-ready data models corresponding to Target Definitions. Uses incremental materialization with merge strategy when an Update Strategy transformation is present. |

### dbt project structure

Each Mapping produces a standalone dbt project:

```text
{MappingName}/
├── dbt_project.yml                   # Materialization config
├── profiles.yml                      # Snowflake connection profile
├── models/
│   ├── sources.yml                   # Source table definitions
│   ├── staging/
│   │   ├── stg_customers.sql
│   │   └── stg_regions.sql
│   ├── intermediate/
│   │   ├── int_expression.sql
│   │   └── int_joiner.sql
│   └── marts/
│       └── customer_dim.sql
└── macros/
    └── *.sql                         # Mapplet-derived macros
```

> **Important:**
>
> Before deploying, replace the `YOUR_SCHEMA` and `YOUR_DB` placeholders in `sources.yml` and `profiles.yml` with your actual Snowflake schema and database names.

### Expression functions

SnowConvert AI supports over 60 Informatica expression functions across the following categories:

| Category | Functions |
| --- | --- |
| **String** | CONCAT, SUBSTR, LENGTH, LOWER, UPPER, LTRIM, RTRIM, LPAD, RPAD, INSTR, REPLACECHR, REPLACESTR, REVERSE, INITCAP, CHR, ASCII |
| **Date and time** | ADD_TO_DATE, GET_DATE_PART, TO_DATE, DATE_DIFF, DATE_COMPARE, LAST_DAY, SYSTIMESTAMP, TO_CHAR |
| **Numeric** | ROUND, TRUNC, ABS, CEIL, FLOOR, POWER, SQRT, EXP, LN, LOG, MOD, MEDIAN, SIGN |
| **Aggregate** | SUM, MIN, MAX, AVG, COUNT, CUME |
| **Type conversion** | TO_INTEGER, TO_DECIMAL, TO_BIGINT, TO_FLOAT, TO_CHAR, TO_DATE |
| **Conditional** | IIF, DECODE, ISNULL, IN |
| **Encoding** | MD5, ENC_BASE64 |
| **Variable management** | SETVARIABLE, SETMAXVARIABLE, SETMINVARIABLE, ABORT |
| **Other** | GREATEST, LEAST, REG_MATCH |

### Naming and sanitization rules

SnowConvert AI applies consistent sanitization rules to all Informatica object names to ensure dbt and Snowflake compatibility:

| Rule | Description | Example |
| --- | --- | --- |
| **Convert to lowercase** | All names converted to lowercase | `M_Load_Customers` → `m_load_customers` |
| **Replace invalid characters** | Spaces, hyphens, and special characters become underscores | `m_load-customer data` → `m_load_customer_data` |
| **Remove consecutive underscores** | Avoids `__` sequences | `m___load` → `m_load` |
| **Prefix with `t_`** | Adds prefix if name starts with a number | `123_mapping` → `t_123_mapping` |
| **Remove quotes and brackets** | Strips surrounding quotes and brackets | `"M_Load"` → `m_load` |

These rules apply uniformly across all generated artifacts: dbt model names, Snowflake TASK names, procedure names, and variable identifiers.

---
title: SnowConvert AI - Issues Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/issues-report.md
section: Migrations
---

# SnowConvert AI - Issues Report

## What is an “Issue”?

An issue is a message that provides relevant information about the transformations done by SnowConvert AI.

### Where can I find it?

The issues report can be found in a folder named *“Reports”*, in the output folder of your conversion. The name of the file itself starts with *“Issues”* so it can easily be located.

The format of the file is **.CSV**.

### What information does it contain?

The issues report contains the following information about all the issues added during the conversion:

| Column | Description |
| --- | --- |
| Session ID | The session ID of the transformation. This is a unique identifier for the transformation session. |
| Severity | One of the following values: Critical, High, Medium, Low, or None. This is an indicator of how much effort it takes to manually solve the problem. The None severity does not punish the conversion rate of the code unit. |
| Code | A unique identifier for the issue. |
| Name | The name of the issue message. |
| Description | The final message that was added to the output code. Something important to take into account is that some of the issues might have slightly different descriptions even though they have the same issue code, this happens because some of the descriptions have dynamic values. |
| Parent File | The relative path of the file where the issues is generated. |
| Line | The text line within the parent file where the issue is generated. |
| Column | The column within the line where the issue is generated. |
| Code Unit Database | The database name (if applicable) of the code unit that contains the issue message. It might be empty because the generated issue has no explicit database name or it is not generated inside a code unit with a name that identifies it. |
| Code Unit Schema | The schema name (if applicable) of the code unit that contains the issue message. It might be empty because the generated issue has no explicit schema name or it is not generated inside a code unit with a name that identifies it. |
| Code Unit Package | The package name (if applicable) of the code unit that contains the issue message. It might be empty because the generated issue has no explicit schema name or it is not generated inside a code unit with a name that identifies it. This column only applies to Oracle SQL migrations. |
| Code Unit Name | The name of the code unit, without database and or schema qualification. This column only applies to code units that have a name that identifies them. |
| Code Unit ID | A string that uniquely identifies the code unit. The name of the object, without database and or schema qualification. |
| Code Unit | The code unit that contains the issue. |
| Code Unit Size | A size classification of the code unit, based on its line of code. The available measurements are XS, S, M, L, and XL. |
| Language | The programming language or SQL dialect of the source code unit. |

### Report Example

Given the following Oracle SQL input code, SnowConvert AI will add the SSC-FDM-OR0035 conversion issue.

```sql
CREATE OR REPLACE PROCEDURE schema1.procedure1
AS
BEGIN
  DBMS_OUTPUT.PUT_LINE('hello world');
END;
```

```none
CREATE OR REPLACE PROCEDURE schema1.procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
    CALL DBMS_OUTPUT.PUT_LINE_UDF('hello world');
  END;
$$;
```

The row in the issues report for the SSC-FDM-OR0035 conversion issue will have the following information:

| Column | Value |
| --- | --- |
| Session ID | Not available |
| Severity | Low |
| Code | SSC-FDM-OR0035 |
| Name | Custom UDF inserted |
| Description | CUSTOM UDF ‘DBMS_OUTPUT.PUT_LINE_UDF’ INSERTED. |
| Parent File | sample.sql |
| Line | 4 |
| Column | 3 |
| Code Unit Database | N/A |
| Code Unit Schema | schema1 |
| Code Unit Package | N/A |
| Code Unit Name | procedure1 |
| Code Unit Id | schema1.procedure1 |
| Code Unit | CREATE PROCEDURE |
| Code Unit Size | XS |

---
title: SnowConvert AI - Linux
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/how-to-install-the-tool/linux.md
section: Migrations
---

# SnowConvert AI - Linux

Support for Linux is limited to the Command Line Interface (CLI).

For Command Line Interface (CLI) refer to [SnowConvert AI CLI](../command-line-interface/README.md).

---
title: SnowConvert AI - MacOS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/how-to-install-the-tool/macos.md
section: Migrations
---

# SnowConvert AI - MacOS

## MacOS Installation

1. Click on the [downloaded](../../../getting-started/download-and-access.md) .dmg file.
2. Double-click on the SnowConvert AI logo or drag it into the application’s folder.
   \

Once your installation is complete, you can launch SnowConvert.

### Setting up the CLI

For Command Line Interface (CLI) refer to [SnowConvert AI CLI](../command-line-interface/README.md).

---
title: SnowConvert AI - Migration Assistant
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/migration-assistant/README.md
section: Migrations
---

# SnowConvert AI - Migration Assistant

Visual Studio Code Extension

The SnowConvert AI Migration Assistant is an AI-powered tool designed to streamline the resolution of errors, warnings, and issues ([EWIs](../general/technical-documentation/issues-and-troubleshooting/conversion-issues/README.md)) encountered after converting SQL code using SnowConvert.

Integrated within the Snowflake Visual Studio Code extension, the Migration Assistant offers an interactive workflow for navigating, understanding, and fixing EWIs, accelerating your migration to Snowflake.

The assistant leverages the [Snowflake REST API](https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-rest-api) to provide explanations and actionable suggestions for EWIs that SnowConvert AI cannot automatically resolve.

> **Warning:**
>
> * The SnowConvert AI Migration Assistant uses **Snowflake Cortex AI** to provide helpful suggestions. Large language models can make mistakes, so it’s essential to review and validate all explanations and fixes before implementation.
> * Using this tool requires signing in to your Snowflake account and having access to **SNOWFLAKE.CORTEX.COMPLETE** and at least one of the [supported models](model-preference.md) by the Assistant.
> * You can use [cross-region inference](https://docs.snowflake.com/en/user-guide/snowflake-cortex/cross-region-inference) if your preferred models are not available in your default region.

## Key Features

* AI-driven analysis of EWIs using the Snowflake REST API.
* Explanations of EWI root causes.
* Chat interaction about SQL-related topics
* Actionable solutions and recommendations.
* Seamless integration with the Snowflake Visual Studio Code extension.

## Supported sources

SnowConvert AI Migration Assistant has been optimized for migrations with Microsoft SQL Server as a source database, and we recommend using it for migrations from this source.

The assistant is designed to work with all supported SnowConvert AI source databases, and in **future releases**, we will optimize results for a wider set of source databases.

## Learn More

* [Getting Started](getting-started.md)
* [Troubleshooting](troubleshooting.md)
* [Legal Notices](legal-notices.md)

---
title: SnowConvert AI - Migration Assistant - Billing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/migration-assistant/billing.md
section: Migrations
---

# SnowConvert AI - Migration Assistant - Billing

The SnowConvert AI Migration Assistant uses the [Snowflake Cortex REST API](https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-rest-api), which incurs compute costs based on the number of tokens processed. You can view current rates in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf), and get more information on how to monitor LLM usage and costs in your account in the [Snowflake documentation for using Large Language Models](https://docs.snowflake.com/en/user-guide/snowflake-cortex/llm-functions#cost-considerations).

It’s not possible to precisely estimate the cost of a given interaction with the Migration Assistant because each response is custom-generated based on the code object you’re working on. A reasonable estimate for a common, one-cycle interaction to resolve an EWI is 3500 tokens. At current rates of 2.55 credits per million tokens, this results in 0.0089 credits used, at a cost of ~$0.027 for [Enterprise Edition in AWS, US East (Northern Virginia)](https://www.snowflake.com/en/pricing-options/). This is only an estimate, and it is possible for interactions to use significantly more tokens than this, especially if they involve multiple rounds of conversation with the LLM.

Since SnowConvert AI Migration Assistant is using Snowflake Cortex REST API, the only way to get the required information to estimate the costs of the requests executed is by consulting the [CORTEX_ACCOUNT_USAGE_HISTORY](https://docs.snowflake.com/en/sql-reference/account-usage/cortex_functions_usage_history) view by running the following query:

```sql
 SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_FUNCTIONS_USAGE_HISTORY WHERE warehouse_id = 0 ORDER BY start_time DESC;
```

The provided query isolates requests made by the SnowConvert AI Migration Assistant by filtering for calls that did not use a virtual warehouse. This condition is effective because Cortex REST API calls are processed without a warehouse, allowing us to distinguish them from standard SQL-based queries.

> **Warning:**
>
> **Limitation**: This query cannot distinguish between REST API calls made by the SnowConvert AI Migration Assistant and any other REST API calls executed by the same user. Consequently, if a user utilizes the Cortex REST API for other purposes, it will be impossible to isolate and attribute consumption specifically to this tool.

---
title: SnowConvert AI - Migration Assistant - Getting Started
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/migration-assistant/getting-started.md
section: Migrations
---

# SnowConvert AI - Migration Assistant - Getting Started

This guide will walk you through the SnowConvert AI Migration Assistant’s basic steps to resolve post-conversion issues in your SQL code.

## Prerequisites

* You have installed the Snowflake Visual Studio Code extension version **GA** **1.14.0** or later.

> **Warning:**
>
> Please be aware that the documentation has been updated to reflect changes in version 1.17.0. The streaming feature, along with some instruction changes, e.g, [Billing](billing.md), are only available in version 1.17.0 or newer.

* You have **.sql** files that contain EWIs from SnowConvert.
* You have a Snowflake account with access to any of the supported models. For more information, please check the [Model Preference documentation](model-preference.md).

## Steps

### 1. Install the Snowflake Visual Studio Code extension

See Snowflake documentation on how to install from the [Visual Studio Marketplace](https://docs.snowflake.com/en/user-guide/vscode-ext#install-the-vs-code-extension-from-visual-studio-marketplace) or from a [.vsix file](https://docs.snowflake.com/en/user-guide/vscode-ext#install-the-vs-code-extension-from-a-vsix-file).

Be sure you’re using version **GA** **1.14.0** or later.

### 2. Sign in to Snowflake with the Visual Studio Code extension

See Snowflake documentation on how to [sign in](https://docs.snowflake.com/en/user-guide/vscode-ext#sign-in-to-snowflake-with-the-vs-code-extension) to Snowflake using the VS Code extension.

### 3. Enable SnowConvert AI Migration Assistant in the Snowflake VS Code Extension Settings

Open the VS Code settings panel and navigate to Extensions. Select the Snowflake extension, and open the settings panel for the Snowflake extension.

In the Snowflake extension settings, you must:

* Check “Enable SnowConvert AI Migration Assistant”

### 4. Set up Model Preference

For more information about how to set up the model preference, please check the [Model Preference](model-preference.md) documentation.

### 5. Open a workspace folder containing SnowConvert AI migration results

First, ensure you have a workspace folder open in Visual Studio Code. Then, access the Snowflake extension by selecting its icon from the activity bar on the left. A “SnowConvert AI Issues” panel will appear at the bottom within the Snowflake extension’s view. This panel automatically populates with a list of all folders and files in the current workspace that have SnowConvert AI migration issues. If no workspace is selected, the following message is prompted on the SnowConvert AI Issues panel: “No SnowConvert AI Migration issues found.”

Once your workspace folder containing SnowConvert AI migration issues is open, you can access the toolbar by hovering over the “SnowConvert AI Issues” panel. This toolbar in the panel’s top-left corner allows you to interact with the list of migration issues identified.

* **🏠 (Return to Workspace Root):** Clicking this icon resets the view to display the entire workspace folder’s initial state.
* **📁 (Select Folder):** Allows you to navigate to and select a specific subfolder within your workspace to focus the issue list.
* **🔄 (Refresh Issues):** Use this to update the list of SnowConvert AI migration issues manually. The list will also update automatically whenever an issue is resolved or a new one is detected.
* **➖ (Collapse All):** Collapses all expanded items in the issues list for a more compact view.

### 6. See SnowConvert AI Migration Issues and click the sparkles for help resolving

Once you’ve opened a folder containing .sql files with migration issues, you will see a list of all the EWIs, FDMs, and PRFs in that folder and the files containing them. Clicking on a migration issue from the list will focus the code editor on the line of code where the issue was found.

> **Note:**
>
> **EWIs** are indicated by the ⚠️ icon.
>
> **FDMs and PRFs** are indicated by the ℹ️ icon.
>
> The folder icon changes from 📁 (collapsed) to 📂 (expanded) to reflect its state.

There are two ways to get AI-powered assistance and recommended solutions for a migration issue:

1. Click the sparkles icon located next to the migration issue in the list.

2. Click on the CodeLenses identified by *SnowConvert AI, which are* located above every migration issue.

### 7. Get help

Once you click the sparkles icon or the CodeLenses, the SnowConvert AI Migration Assistant will query Snowflake Cortex AI with the migration issue and a snippet of the code context surrounding the migration issue. The call to Cortex happens entirely within your Snowflake account, using the connection details you configured in the Snowflake VS Code Extension.

Once a result has been generated, it will appear in a panel to the right of the code editor. The result will contain an explanation of the migration issue in the context of your code, and a suggested fix to make the code run correctly on Snowflake. If the assistant is unable to generate a response with high confidence, it will abstain from providing a recommended solution.

### 8. Interacting with the Migration Assistant

* **Refine Solutions:** If an AI suggestion is incorrect or you prefer a different approach, enter your preferred changes or instructions into the chatbox.
* **Ask SQL-Related Questions:** If the suggestion is correct, you can still ask for clarifications or further explanations on any SQL-related topic.
* **Request Code Modifications:** You can also ask for specific code changes, such as adding a header to your script.

> **Note:**
>
> The assistant will refrain from answering non-SQL-related questions.

---
title: SnowConvert AI - Migration Assistant - Legal Notices
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/migration-assistant/legal-notices.md
section: Migrations
---

# SnowConvert AI - Migration Assistant - Legal Notices

This feature relies on the [Snowflake Cortex REST API](https://docs.snowflake.com/en/user-guide/snowflake-cortex/cortex-rest-api) to generate explanations of migration issues and suggest fixes. When the user interacts with the assistant, Usage Data may be collected through the COMPLETE function that is executed in the background.

For additional information about the use of AI, see [Snowflake AI and ML](https://docs.snowflake.com/en/guides-overview-ai-features).

---
title: SnowConvert AI - Migration Assistant - Model Preference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/migration-assistant/model-preference.md
section: Migrations
---

# SnowConvert AI - Migration Assistant - Model Preference

The SnowConvert AI Migration Assistant supports configurable AI model preferences with automatic fallback functionality. This feature allows you to customize which AI models are used for generating fixes and in what order they should be attempted.

## Supported Models

The Migration Assistant supports the following AI models through Snowflake Cortex AI:

| Model | Status | Description |
| --- | --- | --- |
| Claude 3.7 Sonnet | Recommended | Best quality responses, optimized for migration assistance |
| Claude 3.5 Sonnet | Stable | High quality alternative if Claude 3.7 is unavailable |
| Claude 4 Sonnet | Experimental | Improved quality over Claude 3.7 Sonnet, latest Claude model |
| Llama 3.1 70B | Experimental | Results may vary, assistant optimized for Claude models |
| Mistral Large 2 | Experimental | Results may vary, assistant optimized for Claude models |

> **Warning:**
>
> The Migration Assistant has been primarily optimized for Claude models. While other models are supported, they may provide varying quality results compared to the Claude models.

## How to Configure Model Preferences

Open VS Code Settings

* Go to File > Preferences > Settings (or Code > Preferences > Settings on macOS)
* Or use the keyboard shortcut: `Ctrl + ,` (Windows/Linux) or `Cmd + ,` (macOS)

Navigate to the Settings

* Search for **Snowflake: Snow Convert Migration Assistant: Model Preference**
* Or navigate to Extensions > Snowflake > **Snowflake: Snow Convert Migration Assistant: Model Preference**

Configure Your Preferences

* **Add models**

  + Select any model from the dropdown list to add it to your preferences
  + The model will be added to the end of your current list
* **Remove models**

  + Click the “X” next to any model to remove it from your preferences
  + You must have at least one model configured to use the assistant
* **Reorder models**

  + You can either:

    - Remove or add models to your desired order.
    - Change the model by clicking the “pencil” icon and selecting one of the available models.
  + The first model in the list will always be attempted first

### Default Configuration

By default, the Migration Assistant comes configured with this model preference order:

1. Claude 3.7 Sonnet (recommended)
2. Claude 3.5 Sonnet (high quality alternative)
3. Llama 3.1 70B (experimental)
4. Mistral Large 2 (experimental)

## Execution Order and Fallback Mechanism

The Migration Assistant uses an intelligent fallback system that works as follows:

1. **Sequential Execution**: Models are tried in the exact order you specify in your preference list
2. **Automatic Fallback**: If the first model fails or is unavailable, the assistant automatically attempts the next model in your list
3. **Complete Cycle**: The process continues through your entire model list until one succeeds or all models have been exhausted
4. **Error Handling**: If all models fail, you’ll receive detailed error information and suggestions for each attempted model

Example Execution Flow:

```none
1. Attempt: Claude 3.7 Sonnet → Failed (model unavailable in region)
2. Attempt: Claude 3.5 Sonnet → Failed (budget exceeded)
3. Attempt: Llama 3.1 70B → Success → Response generated
```

---
title: SnowConvert AI - Migration Assistant - Troubleshooting
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/migration-assistant/troubleshooting.md
section: Migrations
---

# SnowConvert AI - Migration Assistant - Troubleshooting

Guidance on resolving issues you may encounter when using the SnowConvert AI Migration Assistant.

## **1. The Explanation or Fix suggestion is incorrect**

SnowConvert AI Migration Assistant uses Snowflake Cortex AI to generate suggestions and explanations using Large Language Models (LLMs). These models can make mistakes, so please review each output thoroughly and carefully before applying it.

## **2. Error when trying to run Cortex**

If an error occurs when executing the Snowflake Cortex, verify that your Snowflake account has access to Snowflake Cortex features, specifically, the [COMPLETE](https://docs.snowflake.com/en/sql-reference/functions/complete-snowflake-cortex) function, and the [models](model-preference.md) you selected.

If one or more of your selected models are not available in your default Snowflake region, you can configure [Cross-Region Inference](https://docs.snowflake.com/en/user-guide/snowflake-cortex/cross-region-inference) to allow Cortex calls to process in a region where they are available.

Access to these features is **necessary** to use the AI capabilities of the SnowConvert AI Migration Assistant.

## **3. No issues are listed in the SnowConvert AI Issues panel**

If no issues are listed, make sure you have selected a file with the .sql extension that contains EWIs, which are included in the .sql files that SnowConvert AI provides.

## **4. Removed issues are still on the list**

To remove EWIs you’ve resolved from the SnowConvert AI Issues list, refresh the list using the refresh button at the top of the list.

---
title: SnowConvert AI - Missing Objects Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/missing-objects-report.md
section: Migrations
---

# SnowConvert AI - Missing Objects Report

## What is a “Missing Object”?

Missing object is the term used to refer to missing DDL definitions inside the source code that are being referenced by code units. The table below shows which elements could be missing objects in each supported language.

| Object | Teradata | Oracle | Transact-SQL | Redshift | BigQuery | Spark | Databricks | Hive | Vertica | PostgreSQL | Greenplum | Netezza | Azure Synapse | IBM DB2 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| Table | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| View | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Procedure | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Function | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Macro | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |  |
| Package Function |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Package Procedure |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| \*Package |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Join Index | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |  |
| Index |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Synonym |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Database Link |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Type | ✓ | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |
| Materialized View |  | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |  | ✓ | ✓ | ✓ | ✓ |  |
| Trigger | ✓ | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |
| Sequence | ✓ | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |
| Constraint |  | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |

> **Note:**
>
> If an asterisk (‘\*’) is listed in the section above, it means that the object is used to call properties from itself that are not considered DDL statements such as constants, variables, or cursors.

### Where can I find it?

The missing objects report can be found in a folder named *“reports”*, in the output folder of your conversion. The name of the file itself starts with *“MissingObjectReferences”* so it can easily be located.

The format of the file is **.CSV**.

### What information does it contain?

The missing objects report contains the following information about all the missing objects found while converting:

| Column | Description |
| --- | --- |
| PartitionKey | The unique identifier of the conversion. |
| FileName | The name of the file in which the object is located. |
| Caller_CodeUnit | The type of code unit that references a missing element. |
| Caller_CodeUnit_Database | The database where the code unit referencing the missing element is deployed. For now, only SQL Server objects can have a database. |
| Caller_CodeUnit_Schema | The schema where the code unit referencing the missing element is deployed. |
| Caller_CodeUnit_Name | The name of the code unit referencing the missing element. |
| Caller_CodeUnit_FullName | The full qualified name of the code unit referencing the missing element. |
| Referenced_Element_Database | The database where the missing element is deployed. For now, only SQL Server objects can have a database. |
| Referenced_Element_Schema | The schema where the missing element is deployed. |
| Referenced_Element_Name | The name of the missing element. |
| Referenced_Element_FullName | The full qualified name of the missing element. |
| Line | The line number inside the file where the reference is located. |
| Relation_Type | Shows the type of relation used through the caller code unit and the MISSING reference. |

### Known Issues

> **Warning:**
>
> Variables defined in shell files used in script files like .bteq are considered missing objects because their definition is not part of the input files that SnowConvert AI processes. E.g. the `myDB` variable is defined in the shell file but this is a file that is not part of the input for SnowConvert AI. Only the .bteq file will be processed and therefore, line 5 will be marked as a missing reference.

```sh
export myDB=exampleDatabase
bteq < example.bteq
```

```sql
.LABEL EX_SQE

create multiset volatile table DR as
   select * from ${myDB}.myTable;
```

> **Warning:**
>
> Preprocessing an Oracle workload by splitting packages can result in extra missing references if the package’s schema is not specified in the extracted objects.

**Original Code**

```sql
CREATE package Schema1.Package1
IS
  CREATE TABLE Table1 (
    col1 INTEGER
  );

  CREATE PROCEDURE Proc1
    BEGIN
      SELECT * FROM Schema1.Table1;
    END

END
```

Notice that in this case, `Table1` is automatically created within the schema `Schema1`, so the reference in line 9 resolves correctly. However, if a package split process is executed prior to the migration and the resulting files are like these ones:

**Modified Code after a package split process**

```sql
  CREATE TABLE Table1 (
    col1 INTEGER1
  );
```

```sql
CREATE PROCEDURE Proc1
    BEGIN
        SELECT * FROM Schema1.Table1;
    END
```

The reference on line 3 of the file `Schema1_Proc1.sql` will be marked as a missing reference, because `Table1` was not explicitly created within the schema `Schema1`.

---
title: SnowConvert AI - Netezza - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/create-table/netezza-create-table.md
section: Migrations
---

# SnowConvert AI - Netezza - CREATE TABLE

Translation from Netezza to Snowflake

## Description

Creates a new table in Netezza. For more information, please refer to [`CREATE TABLE`](https://www.ibm.com/docs/en/netezza?topic=npsscr-create-table) documentation.

> **Warning:**
>
> This grammar is partially supported in Snowflake. Translation pending for these table options:
>
> ```sql
> [ ORGANIZE ON { (<col>) | NONE } ]
> [ ROW SECURITY ]
> [ DATA_VERSION_RETENTION_TIME <number-of-days> ]
> ```

## Grammar Syntax

```sql
CREATE [ TEMPORARY | TEMP ] TABLE [IF NOT EXISTS] <table>
( <col> <type> [<col_constraint>][,<col> <type> [<col_constraint>]…]
<table_constraint> [,<table_constraint>… ] )
[ DISTRIBUTE ON { RANDOM | [HASH] (<col>[,<col>…]) } ]
[ ORGANIZE ON { (<col>) | NONE } ]
[ ROW SECURITY ]
[ DATA_VERSION_RETENTION_TIME <number-of-days> ]
```

## DISTRIBUTE ON RANDOM - DISTRIBUTE ON HASH

> **Note:**
>
> This syntax is not needed in Snowflake.

These clauses controls how table data is physically distributed across the system’s segments. As Snowflake automatically handles data storage, these options will be removed in the migration.

### Grammar Syntax

```sql
DISTRIBUTE ON { RANDOM | [HASH] (<col>[,<col>…]) }
```

### Sample Source Patterns

#### Input Code:

##### Greenplum

```sql
CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
DISTRIBUTE ON RANDOM;
```

#### Output Code:

##### Snowflake

```sql
CREATE TABLE table1 (colum1 int, colum2 int, colum3 smallint, colum4 int )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "netezza",  "convertedOn": "05/11/2025",  "domain": "test" }}'
;
```

## Related EWIs

1. [SSC-EWI-0073](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - Netezza - Data types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/data-types/netezza-data-types.md
section: Migrations
---

# SnowConvert AI - Netezza - Data types

Current Data types conversion for Netezza to Snowflake.

The following data types are specific to [Netezza](https://www.ibm.com/docs/en/netezza?topic=vc-data-types-aliases). For more information please refer to the [PostgreSQL & based languages data types documentation](postgresql-data-types.md).

| Netezza | Snowflake |
| --- | --- |
| DOUBLE | DOUBLE |
| BYTEINT | BYTEINT |
| INT1 | BYTEINT    *Notes: This type is an alias* of BYTEINT at Netezza. |
| TIMESPAN | *VARCHAR*    *Notes: This type is an alias* of INTERVAL at Netezza*. This data type is **not supported** in Snowflake. VARCHAR is used instead. For more information please refer to* [*SSC-EWI-0036*](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*.* |

## Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

---
title: SnowConvert AI - Object Conversion Summary
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/object-conversion-summary.md
section: Migrations
---

# SnowConvert AI - Object Conversion Summary

## Identified Objects

The count of all the top-level DDL objects (such as Table, View, and Procedure) that the SnowConvert AI identified. If an object has a parsing error that makes it unreconcilable, it would not be identified.

### CSV Associated field name

* TotalIdentifiedObjects

#### Sample

```sql
-- Statement without parsing error
CREATE TABLE table1(
     column1 INT,
     column2 INT
);

-- Statements with parsing error
CREATE TABLE table2(
     column1 INT,
     column2 INT INT
);

CRATE TABLE table3(
     column1 INT
);
```

```sql
-- Statement without parsing error
CREATE OR REPLACE TABLE table1 (
     column1 INT,
     column2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

-- Statements with parsing error
CREATE OR REPLACE TABLE table2 (
     column1 INT
--                ,
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '10' COLUMN '6' OF THE SOURCE CODE STARTING AT 'column2'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS 'INT' ON LINE '10' COLUMN '14'. CODE '15'. **
--     column2 INT INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '13' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CRATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CRATE' ON LINE '13' COLUMN '1'. CODE '81'. **
--CRATE TABLE table3(
--     column1 INT
--)
 ;
```

**Expected Identified Objects: 2**

**Explanation:** The `table1` presented doesn’t have a parsing error; the `table2` even though it has a parsing error, the parser is still capable of recognizing the object as a table, so both are counted as an identified object; the `table3` has a parsing error that makes it unreconcilable for the parser and, as a consequence, is not counted as an identified object.

## Object Conversion Rate

The percentage of fully converted objects among the objects identified

### Formula

```none
(identify_objects_converted_successfully / total_identify_objects) * 100
```

#### CSV Associated field name

* ObjectConversionRate

#### Sample

```sql
CREATE TABLE table1(
     column1 INT,
     column2 INT
);

CREATE VIEW view1 AS
SELECT orderkey
FROM orders;

CREATE TABLE table2(
     COLNAME VARCHAR(20)
)
ON COMMIT PRESERVE ROWS;
```

```sql
CREATE OR REPLACE TABLE table1 (
     column1 INT,
     column2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "orders" **
CREATE OR REPLACE VIEW view1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
SELECT
     orderkey
FROM
     orders;

CREATE TABLE OR REPLACE table2 (
COLNAME VARCHAR(20)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

**Expected Object Conversion Rate: 66.66%**

**Explanation:** In this example we have 3 statements, all of them have been identified as an object, but just the `table1`, and the `view1` have a 100% conversion rate. The `table3` has an error warning meaning that the conversion of this table is not 100%, that’s why just 2 of the 3 statements are counted as fully converted objects.

## Fully Converted Objects

The number of identify objects that were converted successfully, meaning this objects have a 100% conversion rate.

### CSV Associated field name

* ObjectsSuccessfullyConverted

#### Sample

```sql
CREATE TABLE table1(
     column1 INT,
     column2 INT
);

CREATE VIEW view1 AS
SELECT orderkey
FROM orders;

CREATE TABLE table2(
     COLNAME VARCHAR(20)
)
ON COMMIT PRESERVE ROWS;
```

```sql
CREATE OR REPLACE TABLE table1 (
     column1 INT,
     column2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

CREATE OR REPLACE VIEW view1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
SELECT
     orderkey
FROM
     orders;

CREATE OR REPLACE TABLE table2 (
COLNAME VARCHAR(20)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

**Expected Fully Converted Objects: 2**

**Explanation:** In this example we have 3 statements, all of them have been identified as an object, but just the `table1`, and the `view1` have a 100% conversion rate. The `table3` has an error warning meaning that the conversion of this table is not 100%, that’s why just 2 of the 3 statements are counted as fully converted objects.

## Unrecognized Elements

Represents any code element (or parts of them) such as DML, DDL, control statements, with parsing errors that SnowConvert AI was unable to process.

### CSV Associated field name

* UnrecognizedElements

#### Sample

```sql
CREATE TABLE table1(
     column1 INT,
     column2 INT
);

CREATE VIEWW view1 AS
SELECT orderkey
FROM orders;
```

```sql
CREATE OR REPLACE TABLE table1 (
     column1 INT,
     column2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '6' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CREATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CREATE' ON LINE '6' COLUMN '1'. CODE '81'. **
--CREATE VIEWW view1 AS
--SELECT orderkey
--FROM orders;
```

**Expected Unrecognized Elements: 1**

**Explanation:** In this example we have 2 statements, the table1 is successfully identified as an object, in the other hand the view1, has a parsing error that means it’s impossible to identify the view as an object, because of this SnowConvert AI reports 1 Unrecognized object.

## Lines of Code in Unrecognized Elements

Represents the number of lines in unrecognized elements.

### CSV Associated field name

* UnrecognizedElementsLOC

#### Sample

```sql
CREATE TABLE table1(
     column1 INT,
     column2 INT
);

CREATE VIEWW view1 AS
SELECT orderkey
FROM orders;
```

```sql
CREATE OR REPLACE TABLE table1 (
     column1 INT,
     column2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '6' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CREATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CREATE' ON LINE '6' COLUMN '1'. CODE '81'. **
--CREATE VIEWW view1 AS
--SELECT orderkey
--FROM orders;
```

**Expected Lines of Code in Unrecognized Elements: 3**

**Explanation:** The element `view1` is an unrecognized element, this means that the lines related to this elements are counted as Lines of Code in Unrecognized Elements.

## Wrapped Objects

Represent the number of wrapped objects present in source input code

> **Note:**
>
> This field applies only to Oracle reports.

### CSV Associated field name

* WrappedObjects

#### Sample

```sql
CREATE OR REPLACE PROCEDURE PROC123 wrapped
a000000
b2
abcd
abcd
abcd
abcd
abcd
abcd
7
5f 9a
s25TmlGXjM9M+sFyW30UiYolBNowg6Rff8upynSmTEOUpAF/NYAbDvDIFsjmTDq1lhTLv74p
xZxnFllpF1iGaIfGOejm9divodC9qOeCQyIa89b2l+uNwqOzJHmOKVySIoi/l9IooFyJs9Es
FQyI4Q==

/
```

```sql
----** SSC-OOS - OUT OF SCOPE CODE UNIT. Wrapped PROCEDURE IS OUT OF TRANSLATION SCOPE. **
--CREATE OR REPLACE PROCEDURE PROC123 wrapped
--a000000
--b2
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--7
--5f 9a
--s25TmlGXjM9M+sFyW30UiYolBNowg6Rff8upynSmTEOUpAF/NYAbDvDIFsjmTDq1lhTLv74p
--xZxnFllpF1iGaIfGOejm9divodC9qOeCQyIa89b2l+uNwqOzJHmOKVySIoi/l9IooFyJs9Es
--FQyI4Q==
```

**Expected Lines of Code in Unrecognized Elements: 1**

**Explanation:** The procedure is declared as a wrapped object, that’s why is counted as a wrapped object.

---
title: SnowConvert AI - Object References Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/object-references-report.md
section: Migrations
---

# SnowConvert AI - Object References Report

> **Note:**
>
> Built-in elements are not considered as part of this report.

## What is an “Object Reference”?

An object reference is the term used to refer to DDL definitions in the source code, that are being referenced by code units. The table below shows which elements could be referenced in each supported language.

| Object | Teradata | Oracle | Transact-SQL | Redshift | BigQuery | Spark | Databricks | Hive | Vertica | PostgreSQL | Greenplum | Netezza | Azure Synapse | IBM DB2 |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| Table | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| View | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Procedure | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Function | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |
| Macro | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |  |
| Package Function |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Package Procedure |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| \*Package |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Join Index | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |  |
| Index |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Synonym |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Database Link |  | ✓ |  |  |  |  |  |  |  |  |  |  |  |  |
| Type | ✓ | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |
| Materialized View |  | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ |  | ✓ | ✓ | ✓ | ✓ |  |
| Trigger | ✓ | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |
| Sequence | ✓ | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |
| Constraint |  | ✓ | ✓ |  |  |  |  |  |  |  |  |  | ✓ |  |

> **Note:**
>
> If an asterisk (‘\*’) is listed in the section above, it means that the object is used to call properties from itself that are not considered DDL statements such as constants, variables, or cursors.

### Where can I find it?

The object references report can be found in a folder named *“reports”*, in the output folder of your conversion. The name of the file itself starts with *“ObjectReferences”* so it can easily be located.

The format of the file is **.CSV**.

### What information does it contain?

The object references report contains the following information about all the references found while converting:

| Column | Description |
| --- | --- |
| PartitionKey | The unique identifier of the conversion. |
| FileName | The name of the file in which the object is located. |
| Caller_CodeUnit | The type of the code unit referencing an existing element. |
| Caller_CodeUnit_Database | The database of the code unit referencing an existing element. For now, only SQL Server objects can have a database. |
| Caller_CodeUnit_Schema | The schema of the code unit referencing an existing element. |
| Caller_CodeUnit_Name | The name of the code unit referencing an existing element. |
| Caller_CodeUnit_FullName | The fully qualified name of the object referencing an existing element. |
| Referenced_Element_Type | The DDL type of the referenced element. |
| Referenced_Element_Database | The database of the referenced element. For now, only SQL Server objects can have a database. |
| Referenced_Element_Schema | The schema of the referenced element. |
| Referenced_Element_Name | The name of the referenced element. |
| Referenced_Element_FullName | The full qualified name of the referenced element. |
| Line | The line number inside the file where the reference is located. |
| Relation_Type | Shows the type of relation used through the caller code unit and the object reference. |

### Oracle Database Links as object references

To get the information such as database name, schema name, or object name of database link references, we need to know how the database link was defined. Database links contain the most relevant information in the connection string used in its definition. E.g.

#### Database Link with database name

```sql
 CREATE DATABASE LINK remote_hr_db
CONNECT TO hr_user
IDENTIFIED BY hr_password
USING 'RemoteDB';

SELECT * FROM hr.employees@remote_hr_db;
```

Using the example above, the object reference information should look like this:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_Database | Referenced_Element_Schema | Referenced_Element_Name | Referenced_Element_FullName | Line |
| --- | --- | --- | --- | --- | --- | --- |
| SELECT | CREATE DATABASE LINK | RemoteDb | N/A | remote_hr_db | hr.employees@remote_hr_db | 6 |

#### Database Link with database and schema names

```sql
 CREATE DATABASE LINK remote_hr_db1
CONNECT TO hr_user
IDENTIFIED BY hr_password
USING 'RemoteDB.MySchema';

SELECT * FROM employees@remote_hr_db1;
```

Using the example above, the object reference information should look like this:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_Database | Referenced_Element_Schema | Referenced_Element_Name | Referenced_Element_FullName | Line |
| --- | --- | --- | --- | --- | --- | --- |
| SELECT | CREATE DATABASE LINK | RemoteDb | MySchema | remote_hr_db1 | hr.employees@remote_hr_db1 | 6 |

##### Database Link with a connection string

```sql
 CREATE DATABASE LINK remote_hr_db2
CONNECT TO hr_user
IDENTIFIED BY hr_password
USING '(DESCRIPTION=(
          ADDRESS=
          (PROTOCOL=TCP)
          (HOST=10.48.195.17)
          (PORT=1521))
      (CONNECT_DATA=(SID=MyDB)))';

SELECT * FROM employees@remote_hr_db2;
```

Using the example above, the object reference information should look like this:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_Database | Referenced_Element_Schema | Referenced_Element_Name | Referenced_Element_FullName | Line |
| --- | --- | --- | --- | --- | --- | --- |
| SELECT | CREATE DATABASE LINK | MyDB | N/A | remote_hr_db2 | employees@remote_hr_db2 | 6 |

### Relation Type

The relation type represents how a caller code unit is related to an object reference. SnowConvert AI is able to identify the following kinds of relations:

* FOREIGN KEY
* INSERT
* DELETE
* UPDATE
* CALL
* EXECUTE
* SYNONYM
* ALTER
* DROP
* MERGE
* TRUNCATE
* LOCK
* INDEX
* TABLE COLUMN
* GRANT
* REVOKE
* SELECT

  + COLUMN
  + FROM
  + WHERE
  + HAVING
  + GROUP BY
  + JOIN
  + ORDER BY

#### Examples

1. A stored procedure referencing a table through an UPDATE statement:

```sql
 CREATE TABLE TABLE2
(
  COL1 VARCHAR(50) NOT NULL,
  COL2 INT NOT NULL
);

CREATE OR REPLACE PROCEDURE Procedure01 (param1 NUMBER)
IS
BEGIN
    UPDATE TABLE2
    SET COL1 = 'Anderson'
    WHERE COL2 = param1;
END;
```

The report will show something like the following table:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_FullName | Line | Relation_Type |
| --- | --- | --- | --- | --- |
| CREATE PROCEDURE | CREATE TABLE | TABLE2 | 10 | UPDATE |

2. A table referencing another table through a FOREIGN KEY:

```sql
 CREATE TABLE TABLE1
(
  COL1 INT
);

CREATE TABLE TABLE2
(
  COL1 INT,
  CONSTRAINT FK_COL1 FOREIGN KEY (COL1)
    REFERENCES TABLE1(COL1)
);
```

The report will show something like the following table:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_FullName | Line | Relation_Type |
| --- | --- | --- | --- | --- |
| CREATE TABLE | CREATE TABLE | TABLE1 | 10 | FOREIGN KEY |

3. A table referenced by a view in the FROM clause of the SELECT statement:

```sql
 CREATE TABLE TABLE1
(
  COL1 INT
);

CREATE VIEW VIEW1
AS
SELECT * FROM TABLE1;
```

The report will show something like the following table:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_FullName | Line | Relation_Type |
| --- | --- | --- | --- | --- |
| CREATE VIEW | CREATE TABLE | TABLE1 | 8 | SELECT - FROM |

4. A user-defined function (UDF) referenced by a view as a result set column.

```sql
 CREATE FUNCTION FUNCTION1(PARAM1 INT)
RETURN NUMBER
IS
BEGIN
  RETURN(PARAM1 + 1);
END;

CREATE VIEW VIEW1
AS
SELECT FUNCTION1(*) FROM TABLE1;
```

The report will show something like the following table:

| Caller_CodeUnit | Referenced_Element_Type | Referenced_Element_FullName | Line | Relation_Type |
| --- | --- | --- | --- | --- |
| CREATE VIEW | CREATE FUNCTION | FUNCTION1 | 10 | SELECT - COLUMN |

---
title: SnowConvert AI - Open Source Libraries
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/terms-and-conditions/open-source-libraries.md
section: Migrations
---

# SnowConvert AI - Open Source Libraries

## .NET Open Source Libraries

| name | version | type | licenses | license urls |
| --- | --- | --- | --- | --- |
| AWSSDK.Core | 4.0.1.1 | nuget |  | <https://github.com/aws/aws-sdk-net/blob/master/License.txt> |
| AWSSDK.Redshift | 4.0.2.6 | nuget |  | <https://github.com/aws/aws-sdk-net/blob/master/License.txt> |
| AWSSDK.RedshiftServerless | 4.0.0.29 | nuget |  | <https://github.com/aws/aws-sdk-net/blob/master/License.txt> |
| AWSSDK.S3 | 4.0.4 | nuget |  | <https://github.com/aws/aws-sdk-net/blob/master/License.txt> |
| Apache.Arrow | 14.0.2 | nuget |  | <https://github.com/apache/arrow/blob/master/LICENSE.txt> |
| Azure.Core | 1.44.1 | nuget |  | <https://github.com/Azure/azure-sdk-for-net/blob/main/LICENSE.txt> |
| Azure.Identity | 1.13.2 | nuget |  | <https://github.com/Azure/azure-sdk-for-net/blob/main/LICENSE.txt> |
| Azure.Storage.Blobs | 12.24.0 | nuget |  | <https://github.com/Azure/azure-sdk-for-net/blob/main/LICENSE.txt> |
| Azure.Storage.Common | 12.23.0 | nuget |  | <https://github.com/Azure/azure-sdk-for-net/blob/main/LICENSE.txt> |
| BouncyCastle.Cryptography | 2.3.1 | nuget |  | <https://github.com/bcgit/bc-csharp/blob/master/LICENSE.md> |
| Castle.Core | 5.1.1 | nuget |  | <https://github.com/castleproject/Core/blob/master/LICENSE> |
| CommandLineParser | 2.9.1 | nuget |  | <https://github.com/commandlineparser/commandline/blob/master/License.md> |
| ConcurrentHashSet | 1.3.0 | nuget |  | <https://github.com/i3arnon/ConcurrentHashSet/blob/master/LICENSE> |
| CsvHelper | 33.0.1 | nuget |  | <https://github.com/JoshClose/CsvHelper/blob/master/LICENSE.txt> |
| DocumentFormat.OpenXml | 3.3.0 | nuget |  | <https://github.com/dotnet/Open-XML-SDK/blob/master/LICENSE> |
| DocumentFormat.OpenXml.Framework | 3.3.0 | nuget |  | <https://github.com/dotnet/Open-XML-SDK/blob/master/LICENSE> |
| ElectronCgi.DotNet.signed | 20.3.3 | nuget |  | <https://github.com/ruidfigueiredo/electron-cgi/blob/master/LICENSE> |
| Google.Api.Gax | 4.8.0 | nuget |  | <https://github.com/googleapis/gax-dotnet/blob/master/LICENSE> |
| Google.Api.Gax.Rest | 4.8.0 | nuget |  | <https://github.com/googleapis/gax-dotnet/blob/master/LICENSE> |
| Google.Apis | 1.67.0 | nuget |  | <https://github.com/googleapis/google-api-dotnet-client/blob/master/LICENSE> |
| Google.Apis.Auth | 1.67.0 | nuget |  | <https://github.com/googleapis/google-api-dotnet-client/blob/master/LICENSE> |
| Google.Apis.Core | 1.67.0 | nuget |  | <https://github.com/googleapis/google-api-dotnet-client/blob/master/LICENSE> |
| Google.Apis.Storage.v1 | 1.67.0.3365 | nuget |  | <https://github.com/googleapis/google-api-dotnet-client/blob/master/LICENSE> |
| Google.Cloud.Storage.V1 | 4.10.0 | nuget |  | <https://github.com/googleapis/google-cloud-dotnet/blob/master/LICENSE> |
| H.Formatters | 14.0.0 | nuget |  | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| H.Formatters.System.Text.Json | 14.0.0 | nuget |  | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| H.Pipes | 14.0.0 | nuget |  | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| H.Pipes.AccessControl | 14.0.0 | nuget |  | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| IronCompress | 1.6.3 | nuget |  | <https://github.com/aloneguid/ironcompress/blob/master/LICENSE> |
| Markdig | 0.41.1 | nuget |  | <https://github.com/xoofx/markdig/blob/master/license.txt> |
| Microsoft.ApplicationInsights | 2.22.0 | nuget |  | <https://github.com/Microsoft/ApplicationInsights-dotnet/blob/master/LICENSE> |
| Microsoft.Bcl | 1.1.10 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Bcl.Async | 1.0.168 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Bcl.AsyncInterfaces | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Bcl.Build | 1.0.14 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Bcl.Cryptography | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Build | 15.9.20 | nuget |  | <https://github.com/dotnet/msbuild/blob/master/LICENSE> |
| Microsoft.Build.Framework | 17.8.43 | nuget |  | <https://github.com/dotnet/msbuild/blob/master/LICENSE> |
| Microsoft.Build.Tasks.Core | 17.8.43 | nuget |  | <https://github.com/dotnet/msbuild/blob/master/LICENSE> |
| Microsoft.Build.Utilities.Core | 17.8.43 | nuget |  | <https://github.com/dotnet/msbuild/blob/master/LICENSE> |
| Microsoft.CSharp | 4.7.0 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| Microsoft.Data.SqlClient | 6.0.2 | nuget |  | <https://github.com/dotnet/sqlclient/blob/master/LICENSE> |
| Microsoft.Data.SqlClient.SNI.runtime | 6.0.2 | nuget |  | <https://github.com/dotnet/SqlClient/blob/main/LICENSE> |
| Microsoft.Data.Sqlite | 9.0.4 | nuget |  | <https://github.com/dotnet/efcore/blob/master/LICENSE.txt> |
| Microsoft.Data.Sqlite.Core | 9.0.4 | nuget |  | <https://github.com/dotnet/efcore/blob/master/LICENSE.txt> |
| Microsoft.DotNet.PlatformAbstractions | 3.1.6 | nuget |  | <https://github.com/dotnet/core-setup/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Caching.Abstractions | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Caching.Memory | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.Abstractions | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.Binder | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.CommandLine | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.EnvironmentVariables | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.FileExtensions | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.Json | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.UserSecrets | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.DependencyInjection | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.DependencyInjection.Abstractions | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.DependencyModel | 7.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Diagnostics | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Diagnostics.Abstractions | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.FileProviders.Abstractions | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.FileProviders.Physical | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.FileSystemGlobbing | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Hosting | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Hosting.Abstractions | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Localization | 9.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| Microsoft.Extensions.Localization.Abstractions | 9.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| Microsoft.Extensions.Logging | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.Abstractions | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.Configuration | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.Console | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.Debug | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.EventLog | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.EventSource | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.TraceSource | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Options | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Options.ConfigurationExtensions | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Primitives | 9.0.5 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.FeatureManagement | 2.0.0 | nuget |  | <https://github.com/microsoft/FeatureManagement-Dotnet/blob/main/LICENSE> |
| Microsoft.IO.RecyclableMemoryStream | 3.0.1 | nuget |  | <https://github.com/Microsoft/Microsoft.IO.RecyclableMemoryStream/blob/master/LICENSE> |
| Microsoft.Identity.Client | 4.67.2 | nuget |  | <https://github.com/AzureAD/microsoft-authentication-library-for-dotnet/blob/master/LICENSE> |
| Microsoft.Identity.Client.Extensions.Msal | 4.67.2 | nuget |  | <https://github.com/AzureAD/microsoft-authentication-library-for-dotnet/blob/master/LICENSE> |
| Microsoft.IdentityModel.Abstractions | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| Microsoft.IdentityModel.JsonWebTokens | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| Microsoft.IdentityModel.Logging | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| Microsoft.IdentityModel.Protocols | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| Microsoft.IdentityModel.Protocols.OpenIdConnect | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| Microsoft.IdentityModel.Tokens | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| Microsoft.NET.StringTools | 17.8.43 | nuget |  | <https://github.com/dotnet/msbuild/blob/master/LICENSE> |
| Microsoft.NETCore.Platforms | 5.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.NETCore.Targets | 1.1.3 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Net.Http | 2.2.29 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.SqlServer.DacFx | 170.0.94 | nuget |  | <https://github.com/microsoft/DacFx/blob/master/LICENSE.txt> |
| Microsoft.SqlServer.Server | 1.0.0 | nuget |  | <https://github.com/dotnet/sqlclient/blob/master/LICENSE> |
| Microsoft.SqlServer.TransactSql.ScriptDom | 170.18.0 | nuget |  | <https://github.com/microsoft/SqlScriptDOM/blob/master/LICENSE> |
| Microsoft.SqlServer.Types | 160.1000.6 | nuget |  | <https://www.nuget.org/packages/Microsoft.SqlServer.Types/160.1000.6/License> |
| Microsoft.VisualStudio.Setup.Configuration.Interop | 3.2.2146 | nuget |  | <https://www.nuget.org/packages/Microsoft.VisualStudio.Setup.Configuration.Interop/3.2.2146/License> |
| Microsoft.Win32.Registry | 4.7.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Mono.Unix | 7.1.0-final.1.21458.1 | nuget |  | <https://github.com/mono/mono.posix/blob/master/LICENSE> |
| My.Extensions.Localization.Json | 3.4.0 | nuget |  | <https://github.com/hishamco/My.Extensions.Localization.Json/blob/master/LICENSE> |
| NamedPipeServerStream.NetFrameworkVersion | 1.1.13 | nuget |  | <https://github.com/HavenDV/NamedPipeServerStream.NetFrameworkVersion/blob/master/LICENSE.txt> |
| Newtonsoft.Json | 13.0.4 | nuget |  | <https://github.com/JamesNK/Newtonsoft.Json/blob/master/LICENSE.md> |
| Npgsql | 9.0.3 | nuget |  | <https://github.com/npgsql/npgsql/blob/master/LICENSE> |
| Parquet.Net | 5.1.1 | nuget |  | <https://github.com/aloneguid/parquet-dotnet/blob/master/LICENSE> |
| SQLitePCLRaw.bundle_e_sqlite3 | 2.1.2 | nuget |  | <https://github.com/ericsink/SQLitePCL.raw/blob/master/LICENSE.TXT> |
| SQLitePCLRaw.bundle_green | 2.1.11 | nuget |  | <https://github.com/ericsink/SQLitePCL.raw/blob/master/LICENSE.TXT> |
| SQLitePCLRaw.core | 2.1.2 | nuget |  | <https://github.com/ericsink/SQLitePCL.raw/blob/master/LICENSE.TXT> |
| SQLitePCLRaw.lib.e_sqlite3 | 2.1.2 | nuget |  | <https://github.com/ericsink/SQLitePCL.raw/blob/master/LICENSE.TXT> |
| SQLitePCLRaw.provider.e_sqlite3 | 2.1.2 | nuget |  | <https://github.com/ericsink/SQLitePCL.raw/blob/master/LICENSE.TXT> |
| Serilog | 4.2.0 | nuget |  | <https://github.com/serilog/serilog/blob/master/LICENSE> |
| Serilog.AspNetCore | 7.0.0 | nuget |  | <https://github.com/serilog/serilog-aspnetcore/blob/master/LICENSE> |
| Serilog.Extensions.Hosting | 7.0.0 | nuget |  | <https://github.com/serilog/serilog-extensions-hosting/blob/master/LICENSE> |
| Serilog.Extensions.Logging | 9.0.0 | nuget |  | <https://github.com/serilog/serilog-extensions-logging/blob/master/LICENSE> |
| Serilog.Formatting.Compact | 1.1.0 | nuget |  | <https://github.com/serilog/serilog-formatting-compact/blob/master/LICENSE> |
| Serilog.Settings.Configuration | 7.0.0 | nuget |  | <https://github.com/serilog/serilog-settings-configuration/blob/master/LICENSE> |
| Serilog.Sinks.Console | 6.0.0 | nuget |  | <https://github.com/serilog/serilog-sinks-console/blob/master/LICENSE> |
| Serilog.Sinks.Debug | 2.0.0 | nuget |  | <https://github.com/serilog/serilog-sinks-debug/blob/master/LICENSE> |
| Serilog.Sinks.File | 6.0.0 | nuget |  | <https://github.com/serilog/serilog-sinks-file/blob/master/LICENSE> |
| SharpZipLib | 1.4.2 | nuget |  | <https://github.com/icsharpcode/SharpZipLib/blob/master/LICENSE.txt> |
| Snappier | 1.1.6 | nuget |  | <https://github.com/brantburnett/Snappier/blob/master/LICENSE> |
| Spectre.Console | 0.49.1 | nuget |  | <https://github.com/spectreconsole/spectre.console/blob/master/LICENSE.md> |
| Spectre.Console.Cli | 0.49.1 | nuget |  | <https://github.com/spectreconsole/spectre.console/blob/master/LICENSE.md> |
| System.Buffers | 4.3.0 | nuget |  | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.ClientModel | 1.1.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.CodeDom | 7.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Collections | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| System.Collections.Concurrent | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| System.Collections.Immutable | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| System.ComponentModel.Composition | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| System.Configuration.ConfigurationManager | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| System.Data.DataSetExtensions | 4.5.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Diagnostics.Debug | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Diagnostics.DiagnosticSource | 6.0.1 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Diagnostics.EventLog | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Diagnostics.TraceSource | 4.0.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Diagnostics.Tracing | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Globalization | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Globalization.Calendars | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Globalization.Extensions | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.IO | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.IO.Abstractions | 22.0.14 | nuget |  | <https://github.com/TestableIO/System.IO.Abstractions/blob/master/LICENSE> |
| System.IO.Compression | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.IO.Compression.ZipFile | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.IO.FileSystem | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.IO.FileSystem.AccessControl | 5.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.IO.FileSystem.Primitives | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.IO.Hashing | 6.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.IO.Packaging | 8.0.1 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.IdentityModel.Tokens.Jwt | 7.5.0 | nuget |  | <https://github.com/AzureAD/azure-activedirectory-identitymodel-extensions-for-dotnet/blob/main/LICENSE.txt> |
| System.Linq | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Linq.Async | 6.0.1 | nuget |  | <https://github.com/dotnet/reactive/blob/master/LICENSE> |
| System.Management | 7.0.2 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Memory | 4.6.3 | nuget |  | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.Memory.Data | 6.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Net.Http | 4.3.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Net.Primitives | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Numerics.Vectors | 4.5.0 | nuget |  | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.Reflection | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Reflection.Emit.Lightweight | 4.7.0 | nuget |  | <https://github.com/dotnet/runtime/blob/main/LICENSE.TXT> |
| System.Reflection.Extensions | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Reflection.Metadata | 1.6.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Reflection.Primitives | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Reflection.TypeExtensions | 4.1.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Resources.Extensions | 7.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Resources.ResourceManager | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime | 4.3.1 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime.Caching | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Runtime.CompilerServices.Unsafe | 6.1.2 | nuget |  | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.Runtime.Extensions | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime.Handles | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime.InteropServices | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime.InteropServices.RuntimeInformation | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime.Loader | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Runtime.Numerics | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Security.AccessControl | 6.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.Algorithms | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Security.Cryptography.Cng | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.Csp | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.Encoding | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Security.Cryptography.OpenSsl | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.Pkcs | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.Primitives | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Security.Cryptography.ProtectedData | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.X509Certificates | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.Xml | 7.0.1 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Permissions | 8.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Principal.Windows | 5.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Text.Encoding | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Text.Encoding.CodePages | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Text.Encodings.Web | 7.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Text.Json | 9.0.9 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Text.RegularExpressions | 4.3.1 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Threading | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Threading.AccessControl | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Threading.Channels | 9.0.4 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Threading.Tasks | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| System.Threading.Tasks.DataFlow | 7.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Threading.Tasks.Dataflow | 7.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Threading.Tasks.Extensions | 4.5.4 | nuget |  | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.ValueTuple | 4.5.0 | nuget |  | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.Windows.Extensions | 8.0.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| TestableIO.System.IO.Abstractions | 22.0.14 | nuget |  | <https://github.com/TestableIO/System.IO.Abstractions/blob/master/LICENSE> |
| TestableIO.System.IO.Abstractions.Wrappers | 22.0.14 | nuget |  | <https://github.com/TestableIO/System.IO.Abstractions/blob/master/LICENSE> |
| Testably.Abstractions.FileSystem.Interface | 9.0.0 | nuget |  | <https://github.com/Testably/Testably.Abstractions/blob/master/LICENSE> |
| TinyCsvParser | 2.7.1 | nuget |  | <https://github.com/bytefish/TinyCsvParser/blob/master/LICENSE> |
| Tomlyn.Signed | 0.17.0 | nuget |  | <https://github.com/xoofx/Tomlyn/blob/master/license.txt> |
| YamlDotNet | 16.3.0 | nuget |  | <https://github.com/aaubry/YamlDotNet/blob/master/LICENSE.txt> |
| ZstdSharp.Port | 0.8.1 | nuget |  | <https://github.com/oleg-st/ZstdSharp/blob/master/LICENSE> |
| coverlet.collector | 6.0.0 | nuget |  | <https://github.com/coverlet-coverage/coverlet/blob/master/LICENSE> |
| runtime.debian.8-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.fedora.23-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.fedora.24-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.native.System | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.native.System.IO.Compression | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.native.System.Net.Http | 4.3.0 | nuget |  | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| runtime.native.System.Security.Cryptography.Apple | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.opensuse.13.2-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.opensuse.42.1-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.osx.10.10-x64.runtime.native.System.Security.Cryptography.Apple | 4.3.0 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.osx.10.10-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.rhel.7-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.ubuntu.14.04-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.ubuntu.16.04-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |
| runtime.ubuntu.16.10-x64.runtime.native.System.Security.Cryptography.OpenSsl | 4.3.2 | nuget |  | <https://github.com/dotnet/core/blob/main/license-information.md> |

## Node Open Source Libraries

| name | version | type | licenses | license urls |
| --- | --- | --- | --- | --- |
| @babel/runtime | 7.28.4 | npm |  | <https://github.com/babel/babel/blob/master/LICENSE> |
| @cspotcode/source-map-support | 0.8.1 | npm |  | <https://github.com/cspotcode/node-source-map-support#readme/blob/master/LICENSE.md> |
| @datadog/browser-core | 6.23.0 | npm |  | <https://github.com/DataDog/browser-sdk/blob/master/LICENSE> |
| @datadog/browser-rum | 6.23.0 | npm |  | <https://github.com/DataDog/browser-sdk/blob/master/LICENSE> |
| @datadog/browser-rum-core | 6.23.0 | npm |  | <https://github.com/DataDog/browser-sdk/blob/master/LICENSE> |
| @electron/notarize | 3.1.1 | npm |  | <https://github.com/electron/notarize/blob/master/LICENSE> |
| @esbuild/aix-ppc64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/android-arm | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/android-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/android-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/darwin-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/darwin-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/freebsd-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/freebsd-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-arm | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-ia32 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-loong64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-mips64el | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-ppc64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-riscv64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-s390x | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/linux-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/netbsd-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/netbsd-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/openbsd-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/openbsd-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/openharmony-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/sunos-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/win32-arm64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/win32-ia32 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @esbuild/win32-x64 | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| @formatjs/ecma402-abstract | 2.3.6 | npm |  | <https://github.com/formatjs/formatjs/blob/main/LICENSE.md> |
| @formatjs/fast-memoize | 2.2.7 | npm |  | <https://github.com/formatjs/formatjs#readme/blob/master/LICENSE.md> |
| @formatjs/icu-messageformat-parser | 2.11.4 | npm |  | <https://github.com/formatjs/formatjs#readme/blob/master/LICENSE.md> |
| @formatjs/icu-skeleton-parser | 1.8.16 | npm |  | <https://github.com/formatjs/formatjs#readme/blob/master/LICENSE.md> |
| @formatjs/intl | 3.1.8 | npm |  | <https://github.com/formatjs/formatjs/blob/main/LICENSE.md> |
| @formatjs/intl-localematcher | 0.6.2 | npm |  | <https://github.com/formatjs/formatjs#readme/blob/master/LICENSE.md> |
| @inversifyjs/common | 1.5.2 | npm |  | <https://github.com/inversify/monorepo/blob/master/LICENSE> |
| @inversifyjs/container | 1.14.1 | npm |  | <https://github.com/inversify/monorepo/blob/master/LICENSE> |
| @inversifyjs/core | 9.1.1 | npm |  | <https://github.com/inversify/monorepo/blob/master/LICENSE> |
| @inversifyjs/plugin | 0.2.0 | npm |  | <https://github.com/inversify/monorepo/blob/master/LICENSE> |
| @inversifyjs/prototype-utils | 0.1.3 | npm |  | <https://github.com/inversify/monorepo/blob/master/LICENSE> |
| @inversifyjs/reflect-metadata-utils | 1.4.1 | npm |  | <https://github.com/inversify/monorepo/blob/main/LICENSE> |
| @isaacs/balanced-match | 4.0.1 | npm |  | <https://github.com/isaacs/balanced-match/blob/master/LICENSE.md> |
| @isaacs/brace-expansion | 5.0.0 | npm |  | <https://github.com/isaacs/brace-expansion/blob/main/LICENSE> |
| @isaacs/cliui | 8.0.2 | npm |  | <https://github.com/yargs/cliui#readme/blob/master/LICENSE.md> |
| @jridgewell/gen-mapping | 0.3.13 | npm |  | <https://github.com/jridgewell/gen-mapping/blob/main/LICENSE> |
| @jridgewell/resolve-uri | 3.1.2 | npm |  | <https://github.com/jridgewell/resolve-uri#readme/blob/master/LICENSE.md> |
| @jridgewell/source-map | 0.3.11 | npm |  | <https://github.com/jridgewell/source-map/blob/main/LICENSE> |
| @jridgewell/sourcemap-codec | 1.5.5 | npm |  | <https://github.com/jridgewell/sourcemap-codec/blob/main/LICENSE> |
| @jridgewell/trace-mapping | 0.3.9 | npm |  | <https://github.com/jridgewell/trace-mapping/blob/main/LICENSE> |
| @kurkle/color | 0.3.4 | npm |  | <https://github.com/kurkle/color/blob/master/LICENSE.md> |
| @parcel/watcher | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-android-arm64 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-darwin-arm64 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-darwin-x64 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-freebsd-x64 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-linux-arm-glibc | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-linux-arm-musl | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-linux-arm64-glibc | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-linux-arm64-musl | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-linux-x64-glibc | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-linux-x64-musl | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-win32-arm64 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-win32-ia32 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @parcel/watcher-win32-x64 | 2.5.1 | npm |  | <https://github.com/parcel-bundler/watcher/blob/master/LICENSE> |
| @pkgjs/parseargs | 0.11.0 | npm |  | <https://github.com/pkgjs/parseargs/blob/main/LICENSE> |
| @react-aria/ssr | 3.9.10 | npm |  | <https://github.com/adobe/react-spectrum/blob/master/LICENSE> |
| @react-aria/utils | 3.31.0 | npm |  | <https://github.com/adobe/react-spectrum/blob/master/LICENSE> |
| @react-stately/flags | 3.1.2 | npm |  | <https://github.com/adobe/react-spectrum/blob/master/LICENSE> |
| @react-stately/utils | 3.10.8 | npm |  | <https://github.com/adobe/react-spectrum/blob/master/LICENSE> |
| @react-types/shared | 3.32.1 | npm |  | <https://github.com/adobe/react-spectrum/blob/master/LICENSE> |
| @rollup/rollup-android-arm-eabi | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-android-arm64 | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-darwin-arm64 | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-darwin-x64 | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-freebsd-arm64 | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-freebsd-x64 | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-arm-gnueabihf | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-arm-musleabihf | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-arm64-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-arm64-musl | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-loong64-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-ppc64-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-riscv64-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-riscv64-musl | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-s390x-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-x64-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-linux-x64-musl | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-openharmony-arm64 | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-win32-arm64-msvc | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-win32-ia32-msvc | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-win32-x64-gnu | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @rollup/rollup-win32-x64-msvc | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| @sqltools/formatter | 1.2.5 | npm |  | <https://github.com/mtxr/vscode-sqltools/blob/master/LICENSE.md> |
| @stylexjs/stylex | 0.15.4 | npm |  | <https://github.com/facebook/stylex/blob/master/LICENSE> |
| @swc/core | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-darwin-arm64 | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-darwin-x64 | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-linux-arm-gnueabihf | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-linux-arm64-gnu | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-linux-arm64-musl | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-linux-x64-gnu | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-linux-x64-musl | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-win32-arm64-msvc | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-win32-ia32-msvc | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/core-win32-x64-msvc | 1.15.0 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/counter | 0.1.3 | npm |  | <https://github.com/swc-project/swc/blob/main/LICENSE> |
| @swc/helpers | 0.5.17 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @swc/types | 0.1.25 | npm |  | <https://github.com/swc-project/swc/blob/master/LICENSE> |
| @tanstack/history | 1.133.28 | npm |  | <https://github.com/TanStack/router/blob/master/LICENSE> |
| @tanstack/query-core | 5.90.7 | npm |  | <https://github.com/TanStack/query/blob/master/LICENSE> |
| @tanstack/query-devtools | 5.90.1 | npm |  | <https://github.com/TanStack/query/blob/master/LICENSE> |
| @tanstack/react-query | 5.90.7 | npm |  | <https://github.com/TanStack/query/blob/master/LICENSE> |
| @tanstack/react-query-devtools | 5.90.2 | npm |  | <https://github.com/TanStack/query/blob/master/LICENSE> |
| @tanstack/react-router | 1.134.13 | npm |  | <https://github.com/TanStack/router/blob/master/LICENSE> |
| @tanstack/react-router-devtools | 1.134.13 | npm |  | <https://github.com/TanStack/router/blob/master/LICENSE> |
| @tanstack/react-store | 0.8.0 | npm |  | <https://github.com/TanStack/store/blob/master/LICENSE> |
| @tanstack/router-core | 1.134.13 | npm |  | <https://github.com/TanStack/router/blob/master/LICENSE> |
| @tanstack/router-devtools-core | 1.134.13 | npm |  | <https://github.com/TanStack/router/blob/master/LICENSE> |
| @tanstack/store | 0.8.0 | npm |  | <https://github.com/TanStack/store/blob/master/LICENSE> |
| @tsconfig/node10 | 1.0.11 | npm |  | <https://github.com/tsconfig/bases/blob/master/LICENSE.md> |
| @tsconfig/node12 | 1.0.11 | npm |  | <https://github.com/tsconfig/bases/blob/master/LICENSE.md> |
| @tsconfig/node14 | 1.0.3 | npm |  | <https://github.com/tsconfig/bases/blob/master/LICENSE.md> |
| @tsconfig/node16 | 1.0.4 | npm |  | <https://github.com/tsconfig/bases/blob/master/LICENSE.md> |
| @types/debug | 4.1.12 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/estree | 1.0.8 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/estree-jsx | 1.0.5 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/hast | 3.0.4 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/hoist-non-react-statics | 3.3.7 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/mdast | 4.0.4 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/ms | 2.1.0 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/node | 22.13.10 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/react | 19.2.2 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/react-transition-group | 4.4.12 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @types/unist | 3.0.3 | npm |  | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| @ungap/structured-clone | 1.3.0 | npm |  | <https://github.com/ungap/structured-clone/blob/master/LICENSE> |
| acorn | 8.15.0 | npm |  | <https://github.com/acornjs/acorn/blob/master/acorn/LICENSE> |
| acorn-walk | 8.3.4 | npm |  | <https://github.com/acornjs/acorn/blob/master/acorn-walk/LICENSE> |
| ajv | 8.17.1 | npm |  | <https://github.com/ajv-validator/ajv/blob/master/LICENSE> |
| ajv-formats | 3.0.1 | npm |  | <https://github.com/ajv-validator/ajv-formats/blob/master/LICENSE> |
| ansi-regex | 6.2.2 | npm |  | <https://github.com/chalk/ansi-regex/blob/master/license> |
| ansi-styles | 6.2.3 | npm |  | <https://github.com/chalk/ansi-styles/blob/master/license> |
| ansis | 3.17.0 | npm |  | <https://github.com/webdiscus/ansis/blob/master/LICENSE> |
| app-root-path | 3.1.0 | npm |  | <https://github.com/inxilpro/node-app-root-path/blob/master/LICENSE> |
| arg | 4.1.3 | npm |  | <https://github.com/vercel/arg#readme/blob/master/LICENSE.md> |
| argparse | 2.0.1 | npm |  | <https://github.com/nodeca/argparse/blob/master/LICENSE> |
| atomically | 2.1.0 | npm |  | <https://github.com/fabiospampinato/atomically/blob/master/license> |
| available-typed-arrays | 1.0.7 | npm |  | <https://github.com/inspect-js/available-typed-arrays/blob/master/LICENSE> |
| bail | 2.0.2 | npm |  | <https://github.com/wooorm/bail/blob/master/license> |
| balanced-match | 1.0.2 | npm |  | <https://github.com/juliangruber/balanced-match/blob/master/LICENSE.md> |
| base64-js | 1.5.1 | npm |  | <https://github.com/beatgammit/base64-js/blob/master/LICENSE> |
| brace-expansion | 2.0.2 | npm |  | <https://github.com/juliangruber/brace-expansion/blob/master/LICENSE> |
| braces | 3.0.3 | npm |  | <https://github.com/micromatch/braces/blob/master/LICENSE> |
| buffer | 6.0.3 | npm |  | <https://github.com/feross/buffer/blob/master/LICENSE> |
| buffer-from | 1.1.2 | npm |  | <https://github.com/LinusU/buffer-from/blob/master/LICENSE> |
| builder-util-runtime | 9.5.0 | npm |  | <https://github.com/electron-userland/electron-builder/blob/master/LICENSE> |
| call-bind | 1.0.8 | npm |  | <https://github.com/ljharb/call-bind/blob/master/LICENSE> |
| call-bind-apply-helpers | 1.0.2 | npm |  | <https://github.com/ljharb/call-bind-apply-helpers/blob/master/LICENSE> |
| call-bound | 1.0.4 | npm |  | <https://github.com/ljharb/call-bound/blob/master/LICENSE> |
| ccount | 2.0.1 | npm |  | <https://github.com/wooorm/ccount#readme/blob/master/LICENSE.md> |
| change-case | 5.4.4 | npm |  | <https://github.com/blakeembrey/change-case/blob/master/LICENSE> |
| character-entities | 2.0.2 | npm |  | <https://github.com/wooorm/character-entities/blob/master/license> |
| character-entities-html4 | 2.1.0 | npm |  | <https://github.com/wooorm/character-entities-html4/blob/master/license> |
| character-entities-legacy | 3.0.0 | npm |  | <https://github.com/wooorm/character-entities-legacy/blob/master/license> |
| character-reference-invalid | 2.0.1 | npm |  | <https://github.com/wooorm/character-reference-invalid/blob/master/license> |
| chart.js | 4.5.1 | npm |  | <https://github.com/chartjs/Chart.js/blob/master/LICENSE.md> |
| chokidar | 4.0.3 | npm |  | <https://github.com/paulmillr/chokidar/blob/master/LICENSE> |
| cliui | 8.0.1 | npm |  | <https://github.com/yargs/cliui#readme/blob/master/LICENSE.md> |
| clsx | 2.1.1 | npm |  | <https://github.com/lukeed/clsx/blob/master/license> |
| color-convert | 2.0.1 | npm |  | <https://github.com/Qix-/color-convert#readme/blob/master/LICENSE.md> |
| color-name | 1.1.4 | npm |  | <https://github.com/colorjs/color-name/blob/master/LICENSE> |
| comma-separated-tokens | 2.0.3 | npm |  | <https://github.com/wooorm/comma-separated-tokens/blob/master/license> |
| commander | 2.20.3 | npm |  | <https://github.com/tj/commander.js/blob/master/LICENSE> |
| conf | 15.0.2 | npm |  | <https://github.com/sindresorhus/conf/blob/master/license> |
| cookie | 1.0.2 | npm |  | <https://github.com/jshttp/cookie/blob/master/LICENSE> |
| cookie-es | 2.0.0 | npm |  | <https://github.com/unjs/cookie-es/blob/master/LICENSE> |
| core-util-is | 1.0.3 | npm |  | <https://github.com/isaacs/core-util-is/blob/master/LICENSE> |
| create-require | 1.1.1 | npm |  | <https://github.com/nuxt-contrib/create-require/blob/master/LICENSE> |
| cross-spawn | 7.0.6 | npm |  | <https://github.com/moxystudio/node-cross-spawn/blob/master/LICENSE> |
| css-mediaquery | 0.1.2 | npm |  | <https://github.com/ericf/css-mediaquery/blob/master/LICENSE> |
| csstype | 3.1.3 | npm |  | <https://github.com/frenic/csstype/blob/master/LICENSE> |
| dayjs | 1.11.19 | npm |  | <https://github.com/iamkun/dayjs/blob/master/LICENSE> |
| debounce-fn | 6.0.0 | npm |  | <https://github.com/sindresorhus/debounce-fn/blob/master/license> |
| debug | 4.4.3 | npm |  | <https://github.com/debug-js/debug#readme/blob/master/LICENSE.md> |
| decimal.js | 10.6.0 | npm |  | <https://github.com/MikeMcl/decimal.js#readme/blob/master/LICENSE.md> |
| decode-named-character-reference | 1.2.0 | npm |  | <https://github.com/wooorm/decode-named-character-reference/blob/master/license> |
| dedent | 1.7.0 | npm |  | <https://github.com/dmnd/dedent/blob/master/LICENSE> |
| define-data-property | 1.1.4 | npm |  | <https://github.com/ljharb/define-data-property/blob/master/LICENSE> |
| dequal | 2.0.3 | npm |  | <https://github.com/lukeed/dequal/blob/master/license> |
| detect-libc | 1.0.3 | npm |  | <https://github.com/lovell/detect-libc/blob/master/LICENSE> |
| devlop | 1.1.0 | npm |  | <https://github.com/wooorm/devlop/blob/master/license> |
| diff | 4.0.2 | npm |  | <https://github.com/kpdecker/jsdiff/blob/master/LICENSE> |
| dom-helpers | 5.2.1 | npm |  | <https://github.com/react-bootstrap/dom-helpers/blob/master/LICENSE> |
| dot-prop | 10.1.0 | npm |  | <https://github.com/sindresorhus/dot-prop/blob/master/license> |
| dotenv | 17.2.3 | npm |  | <https://github.com/motdotla/dotenv/blob/master/LICENSE> |
| dunder-proto | 1.0.1 | npm |  | <https://github.com/es-shims/dunder-proto/blob/master/LICENSE> |
| eastasianwidth | 0.2.0 | npm |  | <https://github.com/komagata/eastasianwidth/blob/master/MIT-LICENSE.txt> |
| electron-cgi | 1.0.6 | npm |  | <https://github.com/ruidfigueiredo/electron-cgi#readme/blob/master/LICENSE.md> |
| electron-debug | 4.1.0 | npm |  | <https://github.com/sindresorhus/electron-debug#readme/blob/master/LICENSE.md> |
| electron-is-accelerator | 0.1.2 | npm |  | <https://github.com/brrd/electron-is-accelerator/blob/master/LICENSE> |
| electron-is-dev | 3.0.1 | npm |  | <https://github.com/sindresorhus/electron-is-dev/blob/master/license> |
| electron-localshortcut | 3.2.1 | npm |  | <https://github.com/parro-it/electron-localshortcut#readme/blob/master/LICENSE.md> |
| electron-log | 5.4.3 | npm |  | <https://github.com/megahertz/electron-log#readme/blob/master/LICENSE.md> |
| electron-store | 11.0.2 | npm |  | <https://github.com/sindresorhus/electron-store/blob/master/license> |
| electron-updater | 6.7.0 | npm |  | <https://github.com/electron-userland/electron-builder/blob/master/LICENSE> |
| emoji-regex | 9.2.2 | npm |  | <https://github.com/mathiasbynens/emoji-regex/blob/main/LICENSE-MIT.txt> |
| entities | 6.0.1 | npm |  | <https://github.com/fb55/entities/blob/master/LICENSE> |
| env-paths | 3.0.0 | npm |  | <https://github.com/sindresorhus/env-paths/blob/master/license> |
| err-code | 2.0.3 | npm |  | <https://github.com/IndigoUnited/js-err-code#readme/blob/master/LICENSE.md> |
| es-define-property | 1.0.1 | npm |  | <https://github.com/ljharb/es-define-property/blob/master/LICENSE> |
| es-errors | 1.3.0 | npm |  | <https://github.com/ljharb/es-errors/blob/master/LICENSE> |
| es-object-atoms | 1.1.1 | npm |  | <https://github.com/ljharb/es-object-atoms/blob/master/LICENSE> |
| esbuild | 0.25.12 | npm |  | <https://github.com/evanw/esbuild/blob/master/LICENSE.md> |
| escalade | 3.2.0 | npm |  | <https://github.com/lukeed/escalade/blob/master/license> |
| escape-string-regexp | 5.0.0 | npm |  | <https://github.com/sindresorhus/escape-string-regexp/blob/master/license> |
| estree-util-is-identifier-name | 3.0.0 | npm |  | <https://github.com/syntax-tree/estree-util-is-identifier-name/blob/master/license> |
| eventemitter3 | 5.0.1 | npm |  | <https://github.com/primus/eventemitter3/blob/master/LICENSE> |
| extend | 3.0.2 | npm |  | <https://github.com/justmoon/node-extend/blob/master/LICENSE> |
| fast-deep-equal | 3.1.3 | npm |  | <https://github.com/epoberezkin/fast-deep-equal/blob/master/LICENSE> |
| fast-diff | 1.3.0 | npm |  | <https://github.com/jhchen/fast-diff/blob/master/LICENSE> |
| fast-uri | 3.1.0 | npm |  | <https://github.com/fastify/fast-uri/blob/main/LICENSE> |
| fdir | 6.5.0 | npm |  | <https://github.com/thecodrr/fdir/blob/master/LICENSE> |
| fill-range | 7.1.1 | npm |  | <https://github.com/jonschlinkert/fill-range/blob/master/LICENSE> |
| for-each | 0.3.5 | npm |  | <https://github.com/Raynos/for-each/blob/master/LICENSE> |
| foreground-child | 3.3.1 | npm |  | <https://github.com/tapjs/foreground-child/blob/master/LICENSE.md> |
| fs-extra | 10.1.0 | npm |  | <https://github.com/jprichardson/node-fs-extra/blob/master/LICENSE> |
| fsevents | 2.3.3 | npm |  | <https://github.com/fsevents/fsevents/blob/master/LICENSE> |
| function-bind | 1.1.2 | npm |  | <https://github.com/Raynos/function-bind/blob/master/LICENSE> |
| get-caller-file | 2.0.5 | npm |  | <https://github.com/stefanpenner/get-caller-file/blob/master/LICENSE.md> |
| get-intrinsic | 1.3.0 | npm |  | <https://github.com/ljharb/get-intrinsic/blob/master/LICENSE> |
| get-proto | 1.0.1 | npm |  | <https://github.com/ljharb/get-proto/blob/master/LICENSE> |
| glob | 11.0.3 | npm |  | <https://github.com/isaacs/node-glob/blob/main/LICENSE.md> |
| goober | 2.1.18 | npm |  | <https://github.com/cristianbote/goober/blob/master/LICENSE> |
| gopd | 1.2.0 | npm |  | <https://github.com/ljharb/gopd/blob/master/LICENSE> |
| graceful-fs | 4.2.11 | npm |  | <https://github.com/isaacs/node-graceful-fs/blob/master/LICENSE.md> |
| has-property-descriptors | 1.0.2 | npm |  | <https://github.com/inspect-js/has-property-descriptors/blob/master/LICENSE> |
| has-symbols | 1.1.0 | npm |  | <https://github.com/inspect-js/has-symbols/blob/master/LICENSE> |
| has-tostringtag | 1.0.2 | npm |  | <https://github.com/inspect-js/has-tostringtag#readme/blob/master/LICENSE.md> |
| hasown | 2.0.2 | npm |  | <https://github.com/inspect-js/hasOwn/blob/master/LICENSE> |
| hast-util-from-parse5 | 8.0.3 | npm |  | <https://github.com/syntax-tree/hast-util-from-parse5/blob/master/license> |
| hast-util-is-element | 3.0.0 | npm |  | <https://github.com/syntax-tree/hast-util-is-element#readme/blob/master/LICENSE.md> |
| hast-util-parse-selector | 4.0.0 | npm |  | <https://github.com/syntax-tree/hast-util-parse-selector/blob/master/license> |
| hast-util-raw | 9.1.0 | npm |  | <https://github.com/syntax-tree/hast-util-raw/blob/master/license> |
| hast-util-to-jsx-runtime | 2.3.6 | npm |  | <https://github.com/syntax-tree/hast-util-to-jsx-runtime/blob/master/license> |
| hast-util-to-parse5 | 8.0.0 | npm |  | <https://github.com/syntax-tree/hast-util-to-parse5/blob/master/license> |
| hast-util-to-text | 4.0.2 | npm |  | <https://github.com/syntax-tree/hast-util-to-text#readme/blob/master/LICENSE.md> |
| hast-util-whitespace | 3.0.0 | npm |  | <https://github.com/syntax-tree/hast-util-whitespace/blob/master/license> |
| hastscript | 9.0.1 | npm |  | <https://github.com/syntax-tree/hastscript#readme/blob/master/LICENSE.md> |
| highlight.js | 11.11.1 | npm |  | <https://github.com/highlightjs/highlight.js/blob/master/LICENSE> |
| hoist-non-react-statics | 3.3.2 | npm |  | <https://github.com/mridgway/hoist-non-react-statics/blob/master/LICENSE.md> |
| html-url-attributes | 3.0.1 | npm |  | <https://github.com/rehypejs/rehype-minify.git#main/blob/master/LICENSE.md> |
| html-void-elements | 3.0.0 | npm |  | <https://github.com/wooorm/html-void-elements/blob/master/license> |
| ieee754 | 1.2.1 | npm |  | <https://github.com/feross/ieee754/blob/master/LICENSE> |
| immediate | 3.0.6 | npm |  | <https://github.com/calvinmetcalf/immediate/blob/master/LICENSE.txt> |
| immutable | 5.1.4 | npm |  | <https://github.com/immutable-js/immutable-js/blob/master/LICENSE> |
| inherits | 2.0.4 | npm |  | <https://github.com/isaacs/inherits/blob/master/LICENSE.md> |
| inline-style-parser | 0.2.6 | npm |  | <https://github.com/remarkablemark/inline-style-parser/blob/master/LICENSE> |
| intl-messageformat | 10.7.18 | npm |  | <https://github.com/formatjs/formatjs/blob/main/LICENSE.md> |
| invariant | 2.2.4 | npm |  | <https://github.com/zertosh/invariant#readme/blob/master/LICENSE.md> |
| inversify | 7.10.4 | npm |  | <https://github.com/inversify/monorepo/blob/master/LICENSE> |
| is-alphabetical | 2.0.1 | npm |  | <https://github.com/wooorm/is-alphabetical/blob/master/license> |
| is-alphanumerical | 2.0.1 | npm |  | <https://github.com/wooorm/is-alphanumerical/blob/master/license> |
| is-callable | 1.2.7 | npm |  | <https://github.com/inspect-js/is-callable/blob/master/LICENSE> |
| is-decimal | 2.0.1 | npm |  | <https://github.com/wooorm/is-decimal/blob/master/license> |
| is-extglob | 2.1.1 | npm |  | <https://github.com/jonschlinkert/is-extglob/blob/master/LICENSE> |
| is-fullwidth-code-point | 3.0.0 | npm |  | <https://github.com/sindresorhus/is-fullwidth-code-point#readme/blob/master/LICENSE.md> |
| is-glob | 4.0.3 | npm |  | <https://github.com/micromatch/is-glob/blob/master/LICENSE> |
| is-hexadecimal | 2.0.1 | npm |  | <https://github.com/wooorm/is-hexadecimal/blob/master/license> |
| is-number | 7.0.0 | npm |  | <https://github.com/jonschlinkert/is-number/blob/master/LICENSE> |
| is-plain-obj | 4.1.0 | npm |  | <https://github.com/sindresorhus/is-plain-obj/blob/master/license> |
| is-typed-array | 1.1.15 | npm |  | <https://github.com/inspect-js/is-typed-array/blob/master/LICENSE> |
| isarray | 2.0.5 | npm |  | <https://github.com/juliangruber/isarray/blob/master/LICENSE> |
| isbot | 5.1.32 | npm |  | <https://github.com/omrilotan/isbot/blob/main/LICENSE> |
| isexe | 2.0.0 | npm |  | <https://github.com/isaacs/isexe/blob/master/LICENSE.md> |
| jackspeak | 4.1.1 | npm |  | <https://github.com/isaacs/jackspeak/blob/master/LICENSE.md> |
| jiti | 2.6.1 | npm |  | <https://github.com/unjs/jiti#readme/blob/master/LICENSE.md> |
| js-tokens | 4.0.0 | npm |  | <https://github.com/lydell/js-tokens/blob/master/LICENSE> |
| js-yaml | 4.1.0 | npm |  | <https://github.com/nodeca/js-yaml/blob/master/LICENSE> |
| json-schema-traverse | 1.0.0 | npm |  | <https://github.com/epoberezkin/json-schema-traverse/blob/master/LICENSE> |
| json-schema-typed | 8.0.1 | npm |  | <https://github.com/RemyRylan/json-schema-typed/blob/master/LICENSE.md> |
| jsonfile | 6.2.0 | npm |  | <https://github.com/jprichardson/node-jsonfile#readme/blob/master/LICENSE.md> |
| jszip | 3.10.1 | npm |  | <https://github.com/Stuk/jszip/blob/main/LICENSE.markdown> |
| keyboardevent-from-electron-accelerator | 2.0.0 | npm |  | <https://github.com/parro-it/keyboardevent-from-electron-accelerator/blob/master/license> |
| keyboardevents-areequal | 0.2.2 | npm |  | <https://github.com/parro-it/keyboardevents-areequal/blob/master/license> |
| lazy-val | 1.0.5 | npm |  | <https://github.com/develar/lazy-val/blob/master/package.json> |
| lie | 3.3.0 | npm |  | <https://github.com/calvinmetcalf/lie/blob/master/license.md> |
| lodash | 4.17.21 | npm |  | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash-es | 4.17.21 | npm |  | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash.clonedeep | 4.5.0 | npm |  | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash.escaperegexp | 4.1.2 | npm |  | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash.isequal | 4.5.0 | npm |  | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| longest-streak | 3.1.0 | npm |  | <https://github.com/wooorm/longest-streak/blob/master/license> |
| loose-envify | 1.4.0 | npm |  | <https://github.com/zertosh/loose-envify/blob/master/LICENSE> |
| lowlight | 3.3.0 | npm |  | <https://github.com/wooorm/lowlight#readme/blob/master/LICENSE.md> |
| lru-cache | 11.2.2 | npm |  | <https://github.com/isaacs/node-lru-cache/blob/master/LICENSE.md> |
| make-error | 1.3.6 | npm |  | <https://github.com/JsCommunity/make-error/blob/master/LICENSE> |
| markdown-table | 3.0.4 | npm |  | <https://github.com/wooorm/markdown-table/blob/master/license> |
| math-intrinsics | 1.1.0 | npm |  | <https://github.com/es-shims/math-intrinsics/blob/master/LICENSE> |
| mdast-util-find-and-replace | 3.0.2 | npm |  | <https://github.com/syntax-tree/mdast-util-find-and-replace/blob/master/license> |
| mdast-util-from-markdown | 2.0.2 | npm |  | <https://github.com/syntax-tree/mdast-util-from-markdown/blob/master/license> |
| mdast-util-gfm | 3.1.0 | npm |  | <https://github.com/syntax-tree/mdast-util-gfm/blob/master/license> |
| mdast-util-gfm-autolink-literal | 2.0.1 | npm |  | <https://github.com/syntax-tree/mdast-util-gfm-autolink-literal/blob/master/license> |
| mdast-util-gfm-footnote | 2.1.0 | npm |  | <https://github.com/syntax-tree/mdast-util-gfm-footnote/blob/master/license> |
| mdast-util-gfm-strikethrough | 2.0.0 | npm |  | <https://github.com/syntax-tree/mdast-util-gfm-strikethrough/blob/master/license> |
| mdast-util-gfm-table | 2.0.0 | npm |  | <https://github.com/syntax-tree/mdast-util-gfm-table/blob/master/license> |
| mdast-util-gfm-task-list-item | 2.0.0 | npm |  | <https://github.com/syntax-tree/mdast-util-gfm-task-list-item/blob/master/license> |
| mdast-util-mdx-expression | 2.0.1 | npm |  | <https://github.com/syntax-tree/mdast-util-mdx-expression/blob/master/license> |
| mdast-util-mdx-jsx | 3.2.0 | npm |  | <https://github.com/syntax-tree/mdast-util-mdx-jsx/blob/master/license> |
| mdast-util-mdxjs-esm | 2.0.1 | npm |  | <https://github.com/syntax-tree/mdast-util-mdxjs-esm/blob/master/license> |
| mdast-util-phrasing | 4.1.0 | npm |  | <https://github.com/syntax-tree/mdast-util-phrasing#readme/blob/master/LICENSE.md> |
| mdast-util-to-hast | 13.2.0 | npm |  | <https://github.com/syntax-tree/mdast-util-to-hast#readme/blob/master/LICENSE.md> |
| mdast-util-to-markdown | 2.1.2 | npm |  | <https://github.com/syntax-tree/mdast-util-to-markdown/blob/master/license> |
| mdast-util-to-string | 4.0.0 | npm |  | <https://github.com/syntax-tree/mdast-util-to-string#readme/blob/master/LICENSE.md> |
| micromark | 4.0.2 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-core-commonmark | 2.0.3 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-extension-gfm | 3.0.0 | npm |  | <https://github.com/micromark/micromark-extension-gfm/blob/master/license> |
| micromark-extension-gfm-autolink-literal | 2.1.0 | npm |  | <https://github.com/micromark/micromark-extension-gfm-autolink-literal/blob/master/license> |
| micromark-extension-gfm-footnote | 2.1.0 | npm |  | <https://github.com/micromark/micromark-extension-gfm-footnote/blob/master/license> |
| micromark-extension-gfm-strikethrough | 2.1.0 | npm |  | <https://github.com/micromark/micromark-extension-gfm-strikethrough/blob/master/license> |
| micromark-extension-gfm-table | 2.1.1 | npm |  | <https://github.com/micromark/micromark-extension-gfm-table/blob/master/license> |
| micromark-extension-gfm-tagfilter | 2.0.0 | npm |  | <https://github.com/micromark/micromark-extension-gfm-tagfilter/blob/master/license> |
| micromark-extension-gfm-task-list-item | 2.1.0 | npm |  | <https://github.com/micromark/micromark-extension-gfm-task-list-item/blob/master/license> |
| micromark-factory-destination | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-factory-label | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-factory-space | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-factory-title | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-factory-whitespace | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-character | 2.1.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-chunked | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-classify-character | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-combine-extensions | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-decode-numeric-character-reference | 2.0.2 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-decode-string | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-encode | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-html-tag-name | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-normalize-identifier | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-resolve-all | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-sanitize-uri | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-subtokenize | 2.1.0 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-symbol | 2.0.1 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromark-util-types | 2.0.2 | npm |  | <https://github.com/micromark/micromark.git#main/blob/master/LICENSE.md> |
| micromatch | 4.0.8 | npm |  | <https://github.com/micromatch/micromatch/blob/master/LICENSE> |
| mimic-function | 5.0.1 | npm |  | <https://github.com/sindresorhus/mimic-function/blob/master/license> |
| minimatch | 9.0.5 | npm |  | <https://github.com/isaacs/minimatch/blob/main/LICENSE> |
| minipass | 7.1.2 | npm |  | <https://github.com/isaacs/minipass/blob/master/LICENSE.md> |
| ms | 2.1.3 | npm |  | <https://github.com/vercel/ms/blob/master/LICENSE> |
| nanoid | 3.3.11 | npm |  | <https://github.com/ai/nanoid/blob/master/LICENSE> |
| node-addon-api | 7.1.1 | npm |  | <https://github.com/nodejs/node-addon-api/blob/main/LICENSE.md> |
| object-assign | 4.1.1 | npm |  | <https://github.com/sindresorhus/object-assign/blob/master/license> |
| object-inspect | 1.13.4 | npm |  | <https://github.com/inspect-js/object-inspect/blob/main/LICENSE> |
| package-json-from-dist | 1.0.1 | npm |  | <https://github.com/isaacs/package-json-from-dist#readme/blob/master/LICENSE.md> |
| pako | 1.0.11 | npm |  | <https://github.com/nodeca/pako/blob/master/LICENSE> |
| papaparse | 5.5.3 | npm |  | <https://github.com/mholt/PapaParse/blob/master/LICENSE> |
| parchment | 3.0.0 | npm |  | <https://github.com/quilljs/parchment/blob/main/LICENSE> |
| parse-entities | 4.0.2 | npm |  | <https://github.com/wooorm/parse-entities/blob/master/license> |
| parse5 | 7.3.0 | npm |  | <https://github.com/inikulin/parse5/blob/master/LICENSE> |
| path-key | 3.1.1 | npm |  | <https://github.com/sindresorhus/path-key/blob/master/license> |
| path-scurry | 2.0.0 | npm |  | <https://github.com/isaacs/path-scurry/blob/master/LICENSE.md> |
| picocolors | 1.1.1 | npm |  | <https://github.com/alexeyraspopov/picocolors/blob/master/LICENSE> |
| picomatch | 4.0.3 | npm |  | <https://github.com/micromatch/picomatch/blob/master/LICENSE> |
| playwright-trx-reporter | 1.0.10 | npm |  | <https://github.com/estruyf/playwright-trx-reporter/blob/master/LICENSE> |
| possible-typed-array-names | 1.1.0 | npm |  | <https://github.com/ljharb/possible-typed-array-names/blob/master/LICENSE> |
| postcss | 8.5.6 | npm |  | <https://github.com/postcss/postcss/blob/master/LICENSE> |
| primereact | 10.9.7 | npm |  | <https://github.com/primefaces/primereact/blob/master/LICENSE.md> |
| process-nextick-args | 2.0.1 | npm |  | <https://github.com/calvinmetcalf/process-nextick-args/blob/master/license.md> |
| promise-retry | 2.0.1 | npm |  | <https://github.com/IndigoUnited/node-promise-retry/blob/master/LICENSE> |
| prop-types | 15.8.1 | npm |  | <https://github.com/facebook/prop-types/blob/master/LICENSE> |
| property-information | 7.1.0 | npm |  | <https://github.com/wooorm/property-information/blob/master/license> |
| punycode | 1.4.1 | npm |  | <https://github.com/mathiasbynens/punycode.js/blob/main/LICENSE-MIT.txt> |
| qs | 6.14.0 | npm |  | <https://github.com/ljharb/qs/blob/master/LICENSE.md> |
| quill | 2.0.3 | npm |  | <https://github.com/slab/quill/blob/master/LICENSE> |
| quill-delta | 5.1.0 | npm |  | <https://github.com/quilljs/delta/blob/master/LICENSE> |
| react | 19.2.0 | npm |  | <https://github.com/facebook/react/blob/master/LICENSE> |
| react-dom | 19.2.0 | npm |  | <https://github.com/facebook/react/blob/master/LICENSE> |
| react-intl | 7.1.14 | npm |  | <https://github.com/formatjs/formatjs/blob/master/LICENSE.md> |
| react-is | 16.13.1 | npm |  | <https://github.com/facebook/react/blob/master/LICENSE> |
| react-markdown | 10.1.0 | npm |  | <https://github.com/remarkjs/react-markdown/blob/master/license> |
| react-router | 7.9.5 | npm |  | <https://github.com/remix-run/react-router/blob/master/LICENSE.md> |
| react-router-dom | 7.9.5 | npm |  | <https://github.com/remix-run/react-router/blob/master/LICENSE.md> |
| react-transition-group | 4.4.5 | npm |  | <https://github.com/reactjs/react-transition-group/blob/master/LICENSE> |
| readable-stream | 2.3.8 | npm |  | <https://github.com/nodejs/readable-stream/blob/master/LICENSE> |
| readdirp | 4.1.2 | npm |  | <https://github.com/paulmillr/readdirp/blob/master/LICENSE> |
| reflect-metadata | 0.2.2 | npm |  | <https://github.com/rbuckton/reflect-metadata/blob/master/LICENSE> |
| rehype-highlight | 7.0.2 | npm |  | <https://github.com/rehypejs/rehype-highlight#readme/blob/master/LICENSE.md> |
| rehype-raw | 7.0.0 | npm |  | <https://github.com/rehypejs/rehype-raw/blob/master/license> |
| remark-gfm | 4.0.1 | npm |  | <https://github.com/remarkjs/remark-gfm/blob/master/license> |
| remark-parse | 11.0.0 | npm |  | <https://github.com/remarkjs/remark.git#main/blob/master/LICENSE.md> |
| remark-rehype | 11.1.2 | npm |  | <https://github.com/remarkjs/remark-rehype/blob/master/license> |
| remark-stringify | 11.0.0 | npm |  | <https://github.com/remarkjs/remark.git#main/blob/master/LICENSE.md> |
| require-directory | 2.1.1 | npm |  | <https://github.com/troygoode/node-require-directory/blob/master/LICENSE> |
| require-from-string | 2.0.2 | npm |  | <https://github.com/floatdrop/require-from-string#readme/blob/master/LICENSE.md> |
| retry | 0.12.0 | npm |  | <https://github.com/tim-kos/node-retry/blob/master/License> |
| rollup | 4.52.5 | npm |  | <https://github.com/rollup/rollup/blob/master/LICENSE.md> |
| safe-buffer | 5.2.1 | npm |  | <https://github.com/feross/safe-buffer/blob/master/LICENSE> |
| sass | 1.93.3 | npm |  | <https://github.com/sass/dart-sass/blob/master/LICENSE> |
| sax | 1.4.3 | npm |  | <https://github.com/isaacs/sax-js/blob/master/LICENSE.md> |
| scheduler | 0.27.0 | npm |  | <https://github.com/facebook/react/blob/master/LICENSE> |
| semver | 7.7.3 | npm |  | <https://github.com/npm/node-semver/blob/master/LICENSE> |
| seroval | 1.3.2 | npm |  | <https://github.com/lxsmnsyc/seroval/blob/master/LICENSE> |
| seroval-plugins | 1.3.3 | npm |  | <https://github.com/lxsmnsyc/seroval/blob/master/LICENSE> |
| set-cookie-parser | 2.7.2 | npm |  | <https://github.com/nfriedly/set-cookie-parser/blob/master/LICENSE> |
| set-function-length | 1.2.2 | npm |  | <https://github.com/ljharb/set-function-length/blob/master/LICENSE> |
| setimmediate | 1.0.5 | npm |  | <https://github.com/yuzujs/setImmediate/blob/master/LICENSE.txt> |
| sha.js | 2.4.12 | npm |  | <https://github.com/crypto-browserify/sha.js/blob/master/LICENSE> |
| shebang-command | 2.0.0 | npm |  | <https://github.com/kevva/shebang-command/blob/master/license> |
| shebang-regex | 3.0.0 | npm |  | <https://github.com/sindresorhus/shebang-regex/blob/master/license> |
| side-channel | 1.1.0 | npm |  | <https://github.com/ljharb/side-channel/blob/master/LICENSE> |
| side-channel-list | 1.0.0 | npm |  | <https://github.com/ljharb/side-channel-list#readme/blob/master/LICENSE.md> |
| side-channel-map | 1.0.1 | npm |  | <https://github.com/ljharb/side-channel-map/blob/master/LICENSE> |
| side-channel-weakmap | 1.0.2 | npm |  | <https://github.com/ljharb/side-channel-weakmap/blob/master/LICENSE> |
| signal-exit | 4.1.0 | npm |  | <https://github.com/tapjs/signal-exit#readme/blob/master/LICENSE.md> |
| solid-js | 1.9.10 | npm |  | <https://github.com/solidjs/solid/blob/master/LICENSE> |
| source-map | 0.6.1 | npm |  | <https://github.com/mozilla/source-map/blob/master/LICENSE> |
| source-map-js | 1.2.1 | npm |  | <https://github.com/7rulnik/source-map-js/blob/master/LICENSE> |
| source-map-support | 0.5.21 | npm |  | <https://github.com/evanw/node-source-map-support#readme/blob/master/LICENSE.md> |
| space-separated-tokens | 2.0.2 | npm |  | <https://github.com/wooorm/space-separated-tokens/blob/master/license> |
| sql-highlight | 6.1.0 | npm |  | <https://github.com/scriptcoded/sql-highlight/blob/master/LICENSE> |
| string-width | 5.1.2 | npm |  | <https://github.com/sindresorhus/string-width/blob/master/license> |
| string_decoder | 1.1.1 | npm |  | <https://github.com/nodejs/string_decoder/blob/master/LICENSE> |
| stringify-entities | 4.0.4 | npm |  | <https://github.com/wooorm/stringify-entities/blob/master/license> |
| strip-ansi | 7.1.2 | npm |  | <https://github.com/chalk/strip-ansi#readme/blob/master/LICENSE.md> |
| stubborn-fs | 2.0.0 | npm |  | <https://github.com/fabiospampinato/stubborn-fs/blob/master/license> |
| stubborn-utils | 1.0.2 | npm |  | <https://github.com/fabiospampinato/stubborn-utils/blob/master/license> |
| style-to-js | 1.1.19 | npm |  | <https://github.com/remarkablemark/style-to-js/blob/master/LICENSE> |
| style-to-object | 1.0.12 | npm |  | <https://github.com/remarkablemark/style-to-object#readme/blob/master/LICENSE.md> |
| styleq | 0.2.1 | npm |  | <https://github.com/necolas/styleq/blob/master/LICENSE> |
| tagged-tag | 1.0.0 | npm |  | <https://github.com/sindresorhus/tagged-tag#readme/blob/master/LICENSE.md> |
| terser | 5.44.1 | npm |  | <https://github.com/terser/terser/blob/master/LICENSE> |
| tiny-invariant | 1.3.3 | npm |  | <https://github.com/alexreardon/tiny-invariant#readme/blob/master/LICENSE.md> |
| tiny-typed-emitter | 2.1.0 | npm |  | <https://github.com/binier/tiny-typed-emitter/blob/master/LICENSE> |
| tiny-warning | 1.0.3 | npm |  | <https://github.com/alexreardon/tiny-warning/blob/master/LICENSE> |
| tinyglobby | 0.2.15 | npm |  | <https://github.com/SuperchupuDev/tinyglobby/blob/master/LICENSE> |
| to-buffer | 1.2.2 | npm |  | <https://github.com/browserify/to-buffer/blob/master/LICENSE> |
| to-regex-range | 5.0.1 | npm |  | <https://github.com/micromatch/to-regex-range/blob/master/LICENSE> |
| trim-lines | 3.0.1 | npm |  | <https://github.com/wooorm/trim-lines/blob/master/license> |
| trough | 2.2.0 | npm |  | <https://github.com/wooorm/trough/blob/master/license> |
| ts-node | 10.9.2 | npm |  | <https://github.com/TypeStrong/ts-node/blob/master/LICENSE> |
| tslib | 2.8.1 | npm |  | <https://github.com/Microsoft/tslib/blob/master/LICENSE.txt> |
| type-fest | 5.2.0 | npm |  | <https://github.com/sindresorhus/type-fest#readme/blob/master/LICENSE.md> |
| typed-array-buffer | 1.0.3 | npm |  | <https://github.com/inspect-js/typed-array-buffer/blob/master/LICENSE> |
| typeorm | 0.3.27 | npm |  | <https://github.com/typeorm/typeorm/blob/master/LICENSE> |
| typescript | 5.9.3 | npm |  | <https://github.com/microsoft/TypeScript/blob/main/LICENSE.txt> |
| uint8array-extras | 1.5.0 | npm |  | <https://github.com/sindresorhus/uint8array-extras/blob/master/license> |
| undici-types | 6.20.0 | npm |  | <https://github.com/nodejs/undici/blob/main/LICENSE> |
| unified | 11.0.5 | npm |  | <https://github.com/unifiedjs/unified/blob/master/license> |
| unist-util-find-after | 5.0.0 | npm |  | <https://github.com/syntax-tree/unist-util-find-after/blob/master/license> |
| unist-util-is | 6.0.1 | npm |  | <https://github.com/syntax-tree/unist-util-is/blob/master/license> |
| unist-util-position | 5.0.0 | npm |  | <https://github.com/syntax-tree/unist-util-position/blob/master/license> |
| unist-util-stringify-position | 4.0.0 | npm |  | <https://github.com/syntax-tree/unist-util-stringify-position/blob/master/license> |
| unist-util-visit | 5.0.0 | npm |  | <https://github.com/syntax-tree/unist-util-visit#readme/blob/master/LICENSE.md> |
| unist-util-visit-parents | 6.0.2 | npm |  | <https://github.com/syntax-tree/unist-util-visit-parents/blob/master/license> |
| universalify | 2.0.1 | npm |  | <https://github.com/RyanZim/universalify/blob/master/LICENSE> |
| url | 0.11.4 | npm |  | <https://github.com/defunctzombie/node-url/blob/master/LICENSE> |
| use-sync-external-store | 1.6.0 | npm |  | <https://github.com/facebook/react#readme/blob/master/LICENSE.md> |
| util-deprecate | 1.0.2 | npm |  | <https://github.com/TooTallNate/util-deprecate/blob/master/LICENSE> |
| uuid | 9.0.1 | npm |  | <https://github.com/uuidjs/uuid/blob/master/LICENSE.md> |
| v8-compile-cache-lib | 3.0.1 | npm |  | <https://github.com/cspotcode/v8-compile-cache-lib/blob/master/LICENSE> |
| vfile | 6.0.3 | npm |  | <https://github.com/vfile/vfile/blob/master/license> |
| vfile-location | 5.0.3 | npm |  | <https://github.com/vfile/vfile-location/blob/master/license> |
| vfile-message | 4.0.3 | npm |  | <https://github.com/vfile/vfile-message/blob/master/license> |
| vite | 7.2.1 | npm |  | <https://github.com/vitejs/vite/blob/master/LICENSE> |
| web-namespaces | 2.0.1 | npm |  | <https://github.com/wooorm/web-namespaces/blob/master/license> |
| when-exit | 2.1.5 | npm |  | <https://github.com/fabiospampinato/when-exit#readme/blob/master/LICENSE.md> |
| which | 2.0.2 | npm |  | <https://github.com/npm/node-which/blob/master/LICENSE> |
| which-typed-array | 1.1.19 | npm |  | <https://github.com/inspect-js/which-typed-array/blob/master/LICENSE> |
| wrap-ansi | 8.1.0 | npm |  | <https://github.com/chalk/wrap-ansi#readme/blob/master/LICENSE.md> |
| y18n | 5.0.8 | npm |  | <https://github.com/yargs/y18n/blob/master/LICENSE> |
| yaml | 2.8.1 | npm |  | <https://github.com/eemeli/yaml/blob/master/LICENSE> |
| yargs | 17.7.2 | npm |  | <https://github.com/yargs/yargs/blob/master/LICENSE> |
| yargs-parser | 21.1.1 | npm |  | <https://github.com/yargs/yargs-parser/blob/master/LICENSE.txt> |
| yn | 3.1.1 | npm |  | <https://github.com/sindresorhus/yn/blob/master/license> |
| zwitch | 2.0.4 | npm |  | <https://github.com/wooorm/zwitch/blob/master/license> |

---
title: SnowConvert AI - Oracle
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/code-extraction/oracle.md
section: Migrations
---

# SnowConvert AI - Oracle

The first step for migration is getting the code that you need to migrate. There are many ways to extract the code from your database. However, we recommend using the extraction scripts provided by Snowflake.

All the source code for these scripts is open source and is available on [GitHub](https://github.com/Snowflake-Labs/SC.DDLExportScripts/).

## Prerequisites

* Access to a server with an Oracle database.
* Permission to run shell scripts with access to the server.
* Tools to connect to the Database like [`sqlplus`](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/SQL-Plus-quick-start.html#GUID-BF1995BD-EF9B-4EA2-9B32-7BFACDEB79DA) or [`sqlcl`](https://www.oracle.com/database/technologies/appdev/sqlcl.html)

## Installing the scripts

Go to <https://github.com/Snowflake-Labs/SC.DDLExportScripts/>

From the Code option, select the drop-down and use the **Download ZIP** option to download the code.

Decompress the ZIP file. The code for Oracle should be under the Oracle folder

When the script is done, the output folder will contain all the DDLs for the migration.

Follow the [Usage instructions](https://github.com/Snowflake-Labs/SC.DDLExportScripts/tree/main/Oracle#readme) to modify the files and run them on your system.

You can then compress this folder to use with [SnowConvert AI](../../../overview.md)

```none
zip -r output.zip ./output
```

---
title: SnowConvert AI - Oracle
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/oracle.md
section: Migrations
---

# SnowConvert AI - Oracle

## What is SnowConvert AI for Oracle?

SnowConvert AI is a software tool that understands [Oracle SQL and PL/SQL](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/index.html), and performs the following conversions:

* Oracle SQL to [Snowflake SQL](https://www.snowflake.com/)
* Oracle PL/SQL to:

  + [Snowflake Scripting](../../../../../../developer-guide/snowflake-scripting/index.md)
  + [JavaScript](../../../../../../developer-guide/stored-procedure/stored-procedures-javascript.md) embedded in Snowflake SQL

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* ***SQL (Structured Query Language):*** The standard language for storing, manipulating, and retrieving data in most modern database architectures.
* ***PL/SQL:*** Procedural Language for SQL. This was created by Oracle, and is still used by Oracle as the scripting language for stored procedures and functions in Oracle.
* ***SnowConvert AI*****:** The software that converts securely and automatically your Oracle files to the Snowflake cloud data platform.
* ***Conversion rule or transformation rule:*** Rules that allow SnowConvert AI to convert from a portion of source code and determine the expected target code.
* ***Parse:*** Parse or parsing is an initial process done by SnowConvert AI to understand the source code, and build up an internal data structure to process the conversion rules.

Let’s dive into some of the code conversions that **Snowflake SnowConvert AI** can perform.

## Code Conversions

### Oracle SQL to Snowflake SQL

SnowConvert AI for Oracle takes in Oracle source code in SQL and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in Snowflake SQL.

#### Example

Here is an example of the conversion of a simple `CREATE TABLE` statement.

The source code:

```sql
CREATE TABLE "MyTable"
(
  "COL1" NUMBER,
  "COL2" NUMBER,
  "COL3" NUMBER GENERATED ALWAYS AS (COL1 * COL2) VIRTUAL,
  "COL4" LONG,
  "COL5" CLOB,
  "COL6" ROWID,
  "COL7" NVARCHAR2(10),
  "COL8" RAW(255),
  CONSTRAINT "PK" PRIMARY KEY ("COL1")
);
```

The migrated Snowflake SQL code:

```sql
CREATE OR REPLACE TABLE "MyTable"
  (
    "COL1" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    "COL2" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    "COL3" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ AS (COL1 * COL2),
    "COL4" VARCHAR,
    "COL5" VARCHAR,
    "COL6" VARCHAR(18) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWID DATA TYPE CONVERTED TO VARCHAR ***/!!!,
    "COL7" VARCHAR(10),
    "COL8" BINARY,
    CONSTRAINT "PK" PRIMARY KEY ("COL1")
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
  ;
```

In this converted SQL you will notice that we are converting many things. A few highlights:

* Adding `PUBLIC` Schema by default for all the Table and view names if the user doesn’t specify one
* `CREATE TABLE` to `CREATE OR REPLACE TABLE`
* Data Type Conversions:

  + `LONG` to `VARCHAR`
  + `CLOB` to `VARCHAR`
  + `ROWID` to `VARCHAR`
  + `NVARCHAR2` to `VARCHAR`
  + `RAW` to `BINARY`
* Data Type Attributes: `GENERATED ALWAYS AS (COL1 * COL2) VIRTUAL` to `AS (COL1 * COL2)`

For more information about data types and their equivalent: [Data Types](../../../../translation-references/oracle/README.md). More examples can be found in the rest of the documentation.

### Oracle PL/SQL

SnowConvert AI takes Oracle stored procedures and functions (**PL/SQL**) and converts them to either **Snowflake Scripting** or **JavaScript** embedded into Snowflake SQL. Oracle `CREATE PROCEDURE` and `REPLACE PROCEDURE` syntax is replaced by Snowflake `CREATE OR REPLACE PROCEDURE` syntax.

#### Example

Here is an example of the conversion of a simple `CREATE PROCEDURE` in Oracle that does an insert into a table used for logging.

> **Note:**
>
> This example will be used for both Snowflake Scripting and JavaScript.

```sql
CREATE OR REPLACE PROCEDURE SC_DEMO.PROC_LOG
      (final_proc  VARCHAR2,
       final_message   VARCHAR2,
       logger_type VARCHAR2 DEFAULT 'I')
AS
BEGIN
  INSERT INTO SC_DEMO.PROC_LOG_TABLE
    VALUES (SC_DEMO.final_logging_seq.NEXTVAL,
            sysdate,
            SUBSTR(logger_type, 1, 1),
            SUBSTR(final_proc, 1, 30),
            SUBSTR(final_message, 1, 1024));
  COMMIT;

END;
```

### To Snowflake Scripting

Snowflake Scripting works as an extension to Snowflake SQL, it adds support for procedural logic and this allows us to create Stored Procedures and replicate similar behaviours and statements of Oracle PL/SQL.

#### Migrated Example

```sql
CREATE OR REPLACE PROCEDURE SC_DEMO.PROC_LOG
(final_proc VARCHAR, final_message VARCHAR,
 logger_type VARCHAR DEFAULT 'I')
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    INSERT INTO SC_DEMO.PROC_LOG_TABLE
      VALUES (SC_DEMO.final_logging_seq.NEXTVAL, CURRENT_TIMESTAMP(),
              SUBSTR(:logger_type, 1, 1),
              SUBSTR(:final_proc, 1, 30),
              SUBSTR(:final_message, 1, 1024));
    --** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
    COMMIT;
  END;
$$;
```

### To JavaScript

JavaScript is called as a scripting language, all inner statements are converted to JavaScript. If you want to understand better the JavaScript API check [this documentation](https://docs.snowflake.com/en/sql-reference/stored-procedures-javascript.html).

#### Migrated Example

```sql
-- Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE SC_DEMO.PROC_LOG
(final_proc STRING, final_message STRING,
 logger_type STRING DEFAULT 'I')
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  EXEC(`INSERT INTO SC_DEMO.PROC_LOG_TABLE
  VALUES (SC_DEMO.final_logging_seq.NEXTVAL, CURRENT_TIMESTAMP(),
            SUBSTR(?, 1, 1),
            SUBSTR(?, 1, 30),
            SUBSTR(?, 1, 1024))`,[LOGGER_TYPE,FINAL_PROC,FINAL_MESSAGE]);
  EXEC(`--** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
COMMIT;`);
$$;
```

In this converted SQL you will notice that we have converted to a new language (JavaScript) embedded into Snowflake SQL. There are more than a few highlights, but suffice it to say that this documentation has all the essentials to understand this kind of conversion.

The line that states `// ... Necessary SnowConvert AI Helpers are inserted here ...` will actually have the **SnowConvert AI JavaScript Helpers**. They can be lengthy, so they are removed from this first example.

####

And that’s it! Snowflake SnowConvert AI takes the pain and frustration out of changing data platforms. Learn more about getting started with SnowConvert AI for Oracle on the next page.

---
title: SnowConvert AI - Oracle
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/oracle.md
section: Migrations
---

# SnowConvert AI - Oracle

## Specific CLI arguments

### `--disableSnowScript`

Flag to indicate whether SnowConvert AI should migrate the procedures to Javascript and Python. By default, it is set to **false**.

#### `--disableSynonym`

Flag to indicate whether or not Synonyms should be transformed. By default, it’s set to **true**.

#### `--disablePackagesAsSchemas`

Flag to indicate whether or not the Packages should be transformed to new Schemas.

Please check the naming of the procedure enabling and disabling the flag:

```sql
CREATE OR REPLACE PACKAGE emp_mgmt AS
PROCEDURE remove_emp (employee_id NUMBER );
END emp_mgmt;

CREATE OR REPLACE PACKAGE BODY emp_mgmt AS
PROCEDURE remove_emp (employee_id NUMBER) IS
   BEGIN
      DELETE FROM employees
      WHERE employees.employee_id = remove_emp.employee_id;
      tot_emps := tot_emps - 1;
   END;
END emp_mgmt;
```

```none
CREATE SCHEMA IF NOT EXISTS emp_mgmt
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE emp_mgmt.remove_emp (employee_id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      DELETE FROM
         employees
         WHERE employees.employee_id = remove_emp.employee_id;
         tot_emps :=
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
                     tot_emps - 1;
   END;
$$;
```

```none
-- Additional Params: --disablePackagesAsSchemas
CREATE OR REPLACE PROCEDURE EMP_MGMT_REMOVE_EMP (employee_id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      DELETE FROM
         employees
         WHERE employees.employee_id = remove_emp.employee_id;
         tot_emps :=
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
                     tot_emps - 1;
   END;
$$;
```

#### `--outerJoinsToOnlyAnsiSyntax`

Flag to indicate whether Outer Joins should be transformed to only ANSI syntax.

#### `--disableDateAsTimestamp`

Flag to indicate whether `SYSDATE` should be transformed into `CURRENT_DATE` *or* `CURRENT_TIMESTAMP`. This will also affect all `DATE` columns that will be transformed to `TIMESTAMP`.

```sql
CREATE TABLE DATE_TABLE(
    DATE_COL DATE
);

SELECT SYSDATE FROM DUAL;
```

```sql
CREATE OR REPLACE TABLE DATE_TABLE (
        DATE_COL TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    SELECT
        CURRENT_TIMESTAMP()
    FROM DUAL;
```

```sql
-- Additional Params: --disableDateAsTimestamp
CREATE OR REPLACE TABLE DATE_TABLE (
        DATE_COL DATE /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

SELECT
    CURRENT_DATE()
FROM DUAL;
```

Learn more about how you can get access to the SnowConvert AI for Oracle Command Line Interface tool by filling out the form on our [**Snowflake Migrations Info**](https://www.mobilize.net/services/database-migrations/snowflake/get-info) page.

#### `--arrange`

Flag to indicate whether the input code should be processed before parsing and transformation.

Learn more about this step on our **Processing the code** page.

#### `--dataTypeCustomizationFile`

The path to a .json file that specifies rules of data type transformation considering data type origin and column name. This feature allows you to customize how data types are transformed during migration, including support for transforming `NUMBER` columns to `DECFLOAT`.

When this argument is provided, SnowConvert AI generates a [TypeMappings Report](../../../getting-started/running-snowconvert/review-results/reports/type-mappings-report.md) that shows all data type transformations applied, making it easy to verify your customization rules were applied correctly.

Navigate to the [Data Type Customization](../../../../translation-references/oracle/basic-elements-of-oracle-sql/data-types/README.md) documentation to learn more about configuring data type transformation rules.

---
title: SnowConvert AI - Oracle
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/README.md
section: Migrations
---

# SnowConvert AI - Oracle

Translation specification for Oracle grammar syntax

This documentation shows the transformations from Oracle to Snowflake.

The structure is intended to replicate the [Oracle language reference documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/) to make its navigation as intuitive as possible. You will find the Oracle elements transformation in the same place as they are in their original documentation.

In this translation reference, you will find, code examples, functional equivalence results, recommendations, known issues, and descriptions of each transformation.

The entire documentation is under construction and constant improvement as well as the tool itself, we are constantly updating it to provide the best user experience.

---
title: SnowConvert AI - Oracle - Any Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/any-types.md
section: Migrations
---

# SnowConvert AI - Oracle - Any Types

## Description

> The `Any` types provide highly flexible modeling of procedure parameters and table columns where the actual type is not known. These data types let you dynamically encapsulate and access type descriptions, data instances, and sets of data instances of any other SQL type. ([Oracle SQL Language Reference ANYTYPES Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-5A8C5AC6-BC32-4D78-B0DE-037162106C72))

## ANYDATA

### Description

> This type contains an instance of a given type, with data, plus a description of the type. `ANYDATA` can be used as a table column data type and lets you store heterogeneous values in a single column. The values can be of SQL built-in types as well as user-defined types. ([Oracle SQL Language Reference ANYDATA Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-2FCFAF23-DFE9-4D05-8518-88AB134E0692)).

The `ANYDATA` data type is **not supported** in Snowflake.

```sql
{ SYS.ANYDATA | ANYDATA }
```

### Sample Source Patterns

#### Create Table with ANYDATA

##### Oracle

```sql
CREATE TABLE anydatatable
(
    col1 NUMBER,
    col2 ANYDATA,
    col3 SYS.ANYDATA
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE anydatatable
    (
        col1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
        col2 VARIANT,
        col3 VARIANT
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Inserting data into ANYDATA column

##### Oracle

```sql
INSERT INTO anydatatable VALUES(
	555,
	ANYDATA.ConvertVarchar('Another Test Text')
);
```

##### Snowflake

```sql
INSERT INTO anydatatable
VALUES(
	555,
	ANYDATA.ConvertVarchar('Another Test Text') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.ConvertVarchar' NODE ***/!!!
);
```

#### Functional Example

> **Warning:**
>
> This example **is not a translation** of SnowConvert AI, it is only used to show the functional equivalence between Oracle `ANYDATA` and Snowflake `VARIANT`

> **Warning:**
>
> We are using the `ANYDATA` built-in package. The conversion for this package is currently **not supported** by SnowConvert.

##### Oracle

```sql
--Create Table
CREATE TABLE anydatatable_example
(
	col1 ANYDATA,
	col2 ANYDATA,
	col3 ANYDATA,
	col4 ANYDATA,
	col5 ANYDATA
);

--Insert data
INSERT INTO anydatatable_example VALUES(
	ANYDATA.ConvertNumber(123),
	ANYDATA.ConvertVarchar('Test Text'),
	ANYDATA.ConvertBFloat(3.14f),
	ANYDATA.ConvertDate(CURRENT_DATE),
	ANYDATA.ConvertTimestamp(CURRENT_TIMESTAMP)
);

--Retrieve information
SELECT
	ANYDATA.AccessNumber(col1) AS col1,
	ANYDATA.AccessVarchar(col2) AS col2,
	ANYDATA.AccessBFloat(col3) AS col3,
	ANYDATA.AccessDate(col4) AS col4,
	ANYDATA.AccessTimestamp(col5) AS col5
FROM anydatatable_example;
```

##### Result

| COL1 | COL2 | COL3 | COL4 | COL5 |
| --- | --- | --- | --- | --- |
| 123 | Test Text | 3.14 | 2021-12-05 18:24:59.000 | 2021-12-05 18:24:59.100 |

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE anydatatable_example
	(
		col1 VARIANT,
		col2 VARIANT,
		col3 VARIANT,
		col4 VARIANT,
		col5 VARIANT
	)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

--Insert data
INSERT INTO anydatatable_example
VALUES(
	ANYDATA.ConvertNumber(123) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.ConvertNumber' NODE ***/!!!,
	ANYDATA.ConvertVarchar('Test Text') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.ConvertVarchar' NODE ***/!!!,
	ANYDATA.ConvertBFloat(3.14) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.ConvertBFloat' NODE ***/!!!,
	ANYDATA.ConvertDate(CURRENT_DATE()) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.ConvertDate' NODE ***/!!!,
	ANYDATA.ConvertTimestamp(CURRENT_TIMESTAMP()) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.ConvertTimestamp' NODE ***/!!!
);

--Retrieve information
SELECT
	ANYDATA.AccessNumber(col1) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.AccessNumber' NODE ***/!!! AS col1,
	ANYDATA.AccessVarchar(col2) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.AccessVarchar' NODE ***/!!! AS col2,
	ANYDATA.AccessBFloat(col3) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.AccessBFloat' NODE ***/!!! AS col3,
	ANYDATA.AccessDate(col4) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.AccessDate' NODE ***/!!! AS col4,
	ANYDATA.AccessTimestamp(col5) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ANYDATA.AccessTimestamp' NODE ***/!!! AS col5
FROM
	anydatatable_example;
```

##### Result

| COL1 | COL2 | COL3 | COL4 | COL5 |
| --- | --- | --- | --- | --- |
| 123 | “Test Text” | 3.14 | “2021-12-05” | “2021-12-05 18:24:43.326 -0800” |

### Known Issues

#### 1. No access to the ANYDATA built-in package

Most operations with `ANYDATA` columns require to use the `ANYDATA` built-in package, transformation for Oracle built-in packages is not supported by SnowConvert AI yet.

### Related EWIs

1. [SSC-FDM-0006](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
2. [SSC-EWI-0073](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## ANYDATASET

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> This type contains a description of a given type plus a set of data instances of that type. `ANYDATASET` can be used as a procedure parameter data type where such flexibility is needed. The values of the data instances can be of SQL built-in types as well as user-defined types. ([Oracle SQL Language Reference ANYDATASET Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-CBC6D668-4FDB-40C9-B240-DFDA6420C13B)).

The `ANYDATASET` data type is **not supported** in Snowflake. A possible workaround for this data type could be [Snowflake ARRAY](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html#array), however that transformation is currently not supported by SnowConvert.

```sql
{ SYS.ANYDATASET | ANYDATASET }
```

### Sample Source Patterns

#### Create Table with ANYDATASET

##### Oracle

```sql
CREATE TABLE anydatasettable
(
	col1 NUMBER,
	col2 ANYDATASET,
	col3 SYS.ANYDATASET
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE anydatasettable
	(
		col1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
	!!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
		col2 ANYDATASET,
	!!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
		col3 SYS.ANYDATASET
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
	;
```

#### Inserting data into ANYDATASET column

##### Oracle

```sql
DECLARE
    anytype_example    ANYTYPE;
    anydataset_example ANYDATASET;
BEGIN
    ANYDATASET.BEGINCREATE(DBMS_TYPES.TYPECODE_VARCHAR2, anytype_example, anydataset_example);

    anydataset_example.ADDINSTANCE;
    anydataset_example.SETVARCHAR2('First element');

    anydataset_example.ADDINSTANCE;
    anydataset_example.SETVARCHAR2('Second element');

    ANYDATASET.ENDCREATE(anydataset_example);

    INSERT INTO anydatasettable VALUES (123, anydataset_example);
END;
```

##### Snowflake

```sql
DECLARE
    !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
    anytype_example    ANYTYPE;
    !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
    anydataset_example ANYDATASET;
BEGIN
    CALL
    ANYDATASET.BEGINCREATE(
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_TYPES.TYPECODE_VARCHAR2' IS NOT CURRENTLY SUPPORTED. ***/!!!
    '' AS TYPECODE_VARCHAR2, :anytype_example, :anydataset_example);
    CALL

    anydataset_example.ADDINSTANCE();
    CALL
    anydataset_example.SETVARCHAR2('First element');
    CALL

    anydataset_example.ADDINSTANCE();
    CALL
    anydataset_example.SETVARCHAR2('Second element');
    CALL

    ANYDATASET.ENDCREATE(:anydataset_example);

    INSERT INTO anydatasettable
    VALUES (123, :anydataset_example);
END;
```

### Known Issues

#### 1. Inserts are being parsed incorrectly

Some of the functions needed to create and insert a new `ANYDATASET` object are not being parsed correctly by SnowConvert.

##### 1. No access to the ANYDATASET built-in package

Most operations with `ANYDATASET` columns require to use the `ANYDATASET` built-in package, transformation for Oracle built-in packages is not supported by SnowConvert AI yet.

### Related EWIs

1. [SSC-EWI-OR0076](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Built In Package Not Supported.
2. [SSC-FDM-0006:](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) Number type column may not behave similarly in Snowflake
3. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported by Snowflake.

## ANYTYPE

### Description

> This type can contain a type description of any named SQL type or unnamed transient type. ([Oracle SQL Language Reference ANYTYPE Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-CBC6D668-4FDB-40C9-B240-DFDA6420C13B)).

The `ANYTYPE` data type is **not supported** in Snowflake.

```sql
{ SYS.ANYTYPE | ANYTYPE }
```

### Sample Source Patterns

#### Create Table with ANYTYPE

##### Oracle

```sql
CREATE TABLE anytypetable
(
	col1 NUMBER,
	col2 ANYTYPE,
	col3 SYS.ANYTYPE
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE anytypetable
	(
		col1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
	!!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
		col2 ANYTYPE,
	!!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
		col3 SYS.ANYTYPE
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
	;
```

#### Inserting data into ANYTYPE column

##### Oracle

```sql
--Create Custom Type
CREATE OR REPLACE TYPE example_type AS OBJECT (id NUMBER, name VARCHAR(20));

--Insert
INSERT INTO anytypetable VALUES(
    123,
    GETANYTYPEFROMPERSISTENT ('HR', 'EXAMPLE_TYPE')
);
```

##### Snowflake

```sql
--Create Custom Type
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE OR REPLACE TYPE example_type AS OBJECT (id NUMBER, name VARCHAR(20))
;

--Insert
INSERT INTO anytypetable
VALUES(
    123,
    GETANYTYPEFROMPERSISTENT ('HR', 'EXAMPLE_TYPE') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'GETANYTYPEFROMPERSISTENT' NODE ***/!!!
);
```

### Known Issues

#### 1. No access to the ANYTYPE built-in package

Most operations with `ANYDATA` columns require to use the `ANYTYPE` built-in package, transformation for Oracle built-in packages is not supported by SnowConvert AI yet.

### Related EWIs

1. [SSC-EWI-0056](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
2. [SSC-EWI-0073](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
3. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported in Snowflake.
4. [SSC-FDM-0006](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.

---
title: SnowConvert AI - Oracle - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/functions/README.md
section: Migrations
---

# SnowConvert AI - Oracle - Built-in functions

This section shows equivalents between functions in Oracle and in Snowflake.

| Oracle | Snowflake | Notes |
| --- | --- | --- |
| ABS | ABS |  |
| ACOS | ACOS |  |
| ADD_MONTHS | ADD_MONTHS |  |
| ANY_VALUE | ANY_VALUE | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| APPROX_COUNT | *\*to be defined* |  |
| APPROX_COUNT_DISTINCT | APPROX_COUNT_DISTINCT |  |
| APPROX_COUNT_DISTINCT_AGG | *\*to be defined* |  |
| APPROX_COUNT_DISTINCT_DETAIL | *\*to be defined* |  |
| APPROX_MEDIAN | *\*to be defined* |  |
| APPROX_PERCENTILE | APPROX_PERCENTILE |  |
| APPROX_PERCENTILE_AGG | *\*to be defined* |  |
| APPROX_PERCENTILE_DETAIL | *\*to be defined* |  |
| APPROX_RANK | *\*to be defined* |  |
| APPROX_SUM | *\*to be defined* |  |
| ASCII | ASCII |  |
| ASCIISTR | *\*to be defined* |  |
| ASIN | ASIN |  |
| ATAN | ATAN |  |
| ATAN2 | ATAN2 |  |
| AVG | AVG |  |
| BFILENAME | *\*to be defined* |  |
| BIN_TO_NUM | *\*to be defined* |  |
| BITAND | BITAND |  |
| BIT_AND_AGG | BITAND_AGG | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| BITMAP_BIT_POSITION | BITMAP_BIT_POSITION |  |
| BITMAP_BUCKET_NUMBER | BITMAP_BUCKET_NUMBER |  |
| BITMAP_CONSTRUCT___AGG | BITMAP_CONSTRUCT___AGG | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| BITMAP_COUNT | BITMAP_BIT_COUNT | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| BITMAP_OR_AGG | BITMAP_OR___AGG | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| BIT_OR_AGG | BIT_OR_AGG | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| BIT_XOR_AGG | BIT_XOR_AGG | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| CARDINALITY | *\*to be defined* |  |
| CAST | CAST  TO_DATE  TO_NUMBER  TO_TIMESTAMP  Not Supported | The function is converted to stub ***‘CAST_STUB’*** and outputs an error, when comes with one of the following not supported statement: ***‘DEFAULT ON CONVERSION ERROR’*** or ***‘MULTISET’***. Also, it is converted to a stub and outputs an **error** if the **data type** is not supported. The function is converted to the ***‘TO_NUMBER’*** function when the expression to cast is of type ***number*** and outputs an **error** indicating that the explicit cast is not possible to be done. The function is converted to the ***‘TO_DATE’*** function when the expression to cast is of type ***date*** and outputs an **error** indicating that the explicit cast is not possible to be done. The function is converted to the ***‘TO_TIMESTAMP’*** function when the expression to cast is of type ***timestamp*** and outputs an error indicating that the explicit cast is not possible to be done. |
| CEIL | CEIL |  |
| CHARTOROWID | *\*to be defined* |  |
| CHECKSUM | *\*to be defined* |  |
| CHR | CHR | ***USING NCHAR_CS*** statement is not supported by the Snowflake function equivalent. The clause is removed. |
| CLUSTER_DETAILS | *\*to be defined* |  |
| CLUSTER_DISTANCE | *\*to be defined* |  |
| CLUSTER_ID | *\*to be defined* |  |
| CLUSTER_PROBABILITY | *\*to be defined* |  |
| CLUSTER_SET | *\*to be defined* |  |
| COALESCE | COALESCE |  |
| COLLATION | COLLATION |  |
| COLLECT | *\*to be defined* |  |
| COMPOSE | *\*to be defined* |  |
| CON_DBID_TO_ID | *\*to be defined* |  |
| CON_GUID_TO_ID | *\*to be defined* |  |
| CON_NAME_TO_ID | *\*to be defined* |  |
| CON_UID_TO_ID | *\*to be defined* |  |
| CONCAT | CONCAT | Every expression parameter will be inside of an ***NVL(expr, ‘ ‘)*** function to avoid an error in case one of the expressions is null. |
| CONVERT | *\*to be defined* |  |
| CORR | CORR | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| CORR_S | *\*to be defined* |  |
| CORR_K | *\*to be defined* |  |
| COS | COS |  |
| COSH | COSH |  |
| COUNT | COUNT |  |
| COVAR_POP | COVAR_POP | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| COVAR_SAMP | COVAR_SAMP | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| CUBE_TABLE | Not Supported | Converted to a stub ***‘CUBE_TABLE_STUB’*** and an **error** is added. |
| CUME_DIST | CUME_DIST | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| CURRENT_DATE | CURRENT_DATE |  |
| CURRENT_TIMESTAMP | CURRENT_TIMESTAMP |  |
| CV | *\*to be defined* |  |
| DATAOBJ_TO_MAT_PARTITION | *\*to be defined* |  |
| DATAOBJ_TO_PARTITION | *\*to be defined* |  |
| DBTIMEZONE | *\*to be defined* |  |
| DECODE | DECODE |  |
| DECOMPOSE | *\*to be defined* |  |
| DENSE_RANK | DENSE_RANK | There are two kinds of syntax, ***aggregate syntax***, and ***analytic syntax***. The ***aggregate syntax*** is not supported and an **error** is added. The analytic syntax is supported but the ***‘SIBLINGS’*** keyword is removed from the ***‘order by’*** ***clause*** and a **warning** is added. |
| DEPTH | *\*to be defined* |  |
| DEREF | *\*to be defined* |  |
| DUMP | *\*to be defined* |  |
| EMPTY_BLOB | *\*to be defined* |  |
| EMPTY_CLOB | *\*to be defined* |  |
| EXISTSNODE | *\*to be defined* |  |
| EXP | EXP |  |
| EXTRACT (datetime) | EXTRACT (datetime)  Not supported | Kept as an ***EXTRACT*** function but outputs a warning when the function has ***‘MINUTE’*** or ***‘TIMEZONE_MINUTE’*** as the first keyword parameter. Converted to a stub ***‘EXTRACT_STUB’*** and outputs an **error** when the first keyword parameter is ***‘TIMEZOME_REGION’*** or ***‘TIMEZONE_ABBR’*** |
| EXTRACT (XML) | Not Supported | Function related to **XML** is not supported. It is converted to a stub ***‘EXTRACT_STUB’*** and an error is added. Please check the following link about how to handle the loading for XML: |
| EXTRACTVALUE | Not Supported | Converted to a stub ***‘EXTRACTVALUE_STUB’*** and an **error** is added. |
| FEATURE_COMPARE | *\*to be defined* |  |
| FEATURE_DETAILS | *\*to be defined* |  |
| FEATURE_ID | *\*to be defined* |  |
| FEATURE_SET | *\*to be defined* |  |
| FEATURE_VALUE | *\*to be defined* |  |
| FIRST | Not Supported | The statement used to indicate that only the **first** or **last** values of the ***aggregate function*** will be returned is not supported. Outputs an **error**. |
| FIRST_VALUE | FIRST_VALUE | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| FLOOR | FLOOR |  |
| FROM_TZ | *\*to be defined* |  |
| GREATEST | GREATEST |  |
| GROUP_ID | *\*to be defined* |  |
| GROUPING | GROUPING |  |
| GROUPING_ID | GROUPING_ID |  |
| HEXTORAW | *\*to be defined* |  |
| INITCAP | INITCAP |  |
| INSTR | POSITION  REGEXP_INSTR | Parameter order is inverted. With ***‘occurrence’***, `REGEXP_INSTR` is used. Position = -1 is automatically translated. Positions < -1 emit [SSC-EWI-OR0020](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md). Also applies to INSTRB, INSTRC, INSTR2, INSTR4. |
| ITERATION_NUMBER | *\*to be defined* |  |
| JSON_ARRAY | *\*to be defined* |  |
| JSON_ARRAYAGG | *\*to be defined* |  |
| JSON | *\*to be defined* |  |
| JSON_MERGE_PATCH | *\*to be defined* |  |
| JSON_OBJECT | *\*to be defined* |  |
| JSON_OBJECTAGG | *\*to be defined* |  |
| JSON_QUERY | *\*to be defined* |  |
| JSON_SCALAR | *\*to be defined* |  |
| JSON_SERIALIZE | *\*to be defined* |  |
| JSON_TABLE | Not Supported | Outputs an error: ***JSON_TABLE IS NOT SUPPORTED.*** |
| JSON_TRANSFORM | *\*to be defined* |  |
| JSON_VALUE | [*JSON_VALUE_UDF*](custom_udfs.md) |  |
| KURTOSIS_POP | *\*to be defined* |  |
| KURTOSIS_SAMP | *\*to be defined* |  |
| LAG | LAG | When the value expression comes with the ***RESPECT*** |
| LAST | Not Supported | The statement used to indicate that only the **first** or **last** values of the ***aggregate function*** will be returned is not supported. Outputs an **error**. |
| LAST_DAY | LAST_DAY |  |
| LAST_VALUE | LAST_VALUE | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| LEAD | LEAD | When the value expression comes with the ***RESPECT | IGNORE NULLS** statement,* the statement is moved outside the parenthesis in order to match the Snowflake grammar. |
| LEAST | LEAST |  |
| LENGTH | LENGTH |  |
| LISTAGG | LISTAGG | The ***overflow clause*** is removed from the function. |
| LN | LN |  |
| LNNVL | *\*to be defined* |  |
| LOCALTIMESTAMP | LOCALTIMESTAMP |  |
| LOG | LOG |  |
| LOWER | LOWER |  |
| LPAD | LPAD |  |
| LTRIM | LTRIM |  |
| MAKE_REF | *\*to be defined* |  |
| MAX | MAX |  |
| MEDIAN | MEDIAN |  |
| MIN | MIN |  |
| MOD | MOD |  |
| MONTHS_BETWEEN | MONTHS_BETWEEN_UDF | Converted to a ***user-defined function***. |
| NANVL | *\*to be defined* |  |
| NCHR | *\*to be defined* |  |
| NEW_TIME | *\*to be defined* |  |
| NEXT_DAY | NEXT_DAY |  |
| NLS_CHARSET_DESCL_LEN | *\*to be defined* |  |
| NLS_CHARSET_ID | *\*to be defined* |  |
| NLS_CHARSET_NAME | *\*to be defined* |  |
| NLS_COLLATION_ID | *\*to be defined* |  |
| NLS_COLLATION_NAME | *\*to be defined* |  |
| NLS_INITCAP | *\*to be defined* |  |
| NLS_LOWER | *\*to be defined* |  |
| NLS_UPPER | *\*to be defined* |  |
| NLSSORT | COLLATE  Not Supported | When the function is outside of a ***‘where’*** or ***‘order by’*** clause, it is not supported and it is converted to stub ***‘NLSSORT_STUB’*** and an **error** is added. Otherwise, if the function is inside a ***‘where’*** or ***‘order by’*** clause, it is converted to the ***COLLATE*** function. |
| NTH_VALUE | NTH_VALUE |  |
| NTILE | NTILE |  |
| NULLIF | NULLIF |  |
| NUMTODSINTERVAL | Not Supported | While the function itself is not supported, some usages can be migrated manually. For example DATEADD can be used to manually migrate a sum between a Date/Timestamp and this function. |
| NUMTOYMINTERVAL | Not Supported | While the function itself is not supported, some usages can be migrated manually. For example DATEADD can be used to manually migrate a sum between a Date/Timestamp and this function. |
| NVL | NVL |  |
| NVL2 | NVL2 |  |
| ORA_DM_PARTITION_NAME | *\*to be defined* |  |
| ORA_DST_AFFECTED | *\*to be defined* |  |
| ORA_DST_CONVERTED | *\*to be defined* |  |
| ORA_DST_ERROR | *\*to be defined* |  |
| ORA_HASH | Not Supported | Converted to a stub ***‘ORA_HASH_STUB’*** and an **error** is added. |
| PATH | *\*to be defined* |  |
| PERCENT_RANK | PERCENT_RANK | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| PERCENTILE_CONT | PERCENTILE_CONT |  |
| PERCENTILE_DISC | PERCENTILE_DISC |  |
| POWER | POWER |  |
| POWERMULTISET | *\*to be defined* |  |
| POWERMULTISET_BY_CARDINALITY | *\*to be defined* |  |
| PREDICTION | *\*to be defined* |  |
| PREDICTION_BOUNDS | *\*to be defined* |  |
| PREDICTION_COST | *\*to be defined* |  |
| PREDICTION_DETAILS | *\*to be defined* |  |
| PREDICTION_PROBABILITY | *\*to be defined* |  |
| PREDICTION_SET | *\*to be defined* |  |
| PRESENTNNV | *\*to be defined* |  |
| PRESENTV | *\*to be defined* |  |
| PREVIOUS | *\*to be defined* |  |
| RANK | RANK | There are two kinds of syntax, ***aggregate syntax***, and ***analytic syntax***. The ***aggregate syntax*** is not supported and an **error** is added. The analytic syntax is supported but the ***‘SIBLINGS’*** keyword is removed from the ***‘order by’*** ***clause*** and a **warning** is added. |
| RATIO_TO_REPORT | RATIO_TO_REPORT |  |
| RAWTOHEX | *\*to be defined* |  |
| RAWTONHEX | *\*to be defined* |  |
| REF | *\*to be defined* |  |
| REFTOHEX | *\*to be defined* |  |
| REGEXP_COUNT | REGEXP_COUNT |  |
| REGEXP_INSTR | REGEXP_INSTR |  |
| REGEXP_REPLACE | REGEXP_REPLACE | In the ***replace_string*** parameter (the third one) is being added an extra **’’** symbol to escape the other one. In the ***match_param*** parameter (last one) the equivalence works like this: **’c’ -> ‘c’** *specifies case-sensitive* **’i’ -> ‘i’** *specifies case-insensitive* **’n’ -> ‘s’** *allows the period(.), which is the match-any-character character, to match the newline character* **’m’ -> ‘m’** *treats the source string as multiple lines* **’x’ -> ‘e’** *ignores whitespace characters* |
| REGEXP_SUBSTR | REGEXP_SUBSTR | In the ***replace_string*** parameter (the second one) is being added an extra **’’** symbol to escape the other one. In the ***match_param*** parameter the equivalence works like this: **’c’ -> ‘c’** *specifies case-sensitive* **’i’ -> ‘i’** *specifies case-insensitive* **’n’ -> ‘s’** *allows the period(.), which is the match-any-character character, to match the newline character* **’m’ -> ‘m’** *treats the source string as multiple lines* **’x’ -> ‘e’** *ignores whitespace characters* |
| REGR | REGR | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| REMAINDER | *\*to be defined* |  |
| REPLACE | REPLACE |  |
| REVERSE | REVERSE |  |
| ROUND | ROUND |  |
| ROUND_TIES_TO_EVEN | *\*to be defined* |  |
| ROW_NUMBER | ROW_NUMBER |  |
| RPAD | RPAD |  |
| ROWIDTOCHAR | *\*to be defined* |  |
| ROWIDTONCHAR | *\*to be defined* |  |
| RTRIM | RTRIM |  |
| SCN_TO_TIMESTAMP | *\*to be defined* |  |
| SESSIONTIMEZONE | *\*to be defined* |  |
| SET | *\*to be defined* |  |
| SIGN | SIGN |  |
| SINH | SINH |  |
| SKEWNESS_POP | *\*to be defined* |  |
| SKEWNESS_SAMP | *\*to be defined* |  |
| SOUNDEX | SOUNDEX |  |
| SQRT | SQRT |  |
| STANDARD_HASH | SHA1  SHA2  MD5 | Converted based on the algorithm parameter: default/`'SHA1'` → `SHA1`, `'SHA256'`/`'SHA384'`/`'SHA512'` → `SHA2(expr, bits)`, `'MD5'` → `MD5`. A [warning (SSC-FDM-OR0032)](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md) is emitted when the input is a non-string parameter. Dynamic algorithm parameters emit [SSC-EWI-OR0138](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md). |
| STATS_BINOMIAL_TEST | *\*to be defined* |  |
| STATS_CROSSTAB | *\*to be defined* |  |
| STATS_F_TEST | *\*to be defined* |  |
| STATS_KS_TEST | *\*to be defined* |  |
| STATS_MODE | *\*to be defined* |  |
| STATS_MW_TEST | *\*to be defined* |  |
| STATS_ONE_WAY_ANOVA | *\*to be defined* |  |
| STATS_T_TEST | *\*to be defined* |  |
| STATS_WSR_TEST | *\*to be defined* |  |
| STDDEV | STDDEV |  |
| STDDEV_POP | STDDEV_POP |  |
| STDDEV_SAMP | STDDEV_SAMP |  |
| SUBSTR | SUBSTR | All the types of SUBSTR ***(SUBSTRB, SUBSTRC, SUBSTR2, SUBSTR4)*** are being converted to **SUBSTR** |
| SUM | SUM |  |
| SYS_CONNECT_BY_PATH | *\*to be defined* |  |
| SYS_CONTEXT | CURRENT_USER CURRENT_SCHEMA CURRENT_DATABASE IS_ROLE_IN_SESSION CURRENT_CLIENT CURRENT_SESSION Not supported | Depending on the parameters of the function SYS_CONTEXT, it is converted to one of the specified functions. ***’CURRENT_SCHEMA’*** converted to ***CURRENT_SCHEMA()***  ***’CURRENT_USER’*** converted to ***CURRENT_USER()***  ***’DB_NAME’*** converted to ***CURRENT_DATABASE()***  ***’ISDBA’*** converted to ***IS_ROLE_IN_SESSION(‘DBA’)***  ***’SERVICE_NAME’*** converted to ***CURRENT_CLIENT()***  ***’SESSIONID’*** converted to ***CURRENT_SESSION()***  ***’GUEST’*** converted to ***IS_ROLE_IN_SESSION(‘GUEST’)***  ***’SESSION_USER’*** converted to ***CURRENT_USER()***  ***’AUTHENTICATED_IDENTITY’*** converted to ***CURENT_USER()***  When a parameter is not supported it is converted to stub ***’SYS_CONTEXT_STUB’*** |
| SYS_DBURIGEN | *\*to be defined* |  |
| SYS_EXTRACT_UTC | *\*to be defined* |  |
| SYS_GUID | *\*to be defined* |  |
| SYS_OP_ZONE_ID | *\*to be defined* |  |
| SYS_TYPEID | *\*to be defined* |  |
| SYS_XMLAGG | *\*to be defined* |  |
| SYS_XMLGEN | *\*to be defined* |  |
| TAN | TAN |  |
| TANH | TANH |  |
| TIMESTAMP_TO_SCN | *\*to be defined* |  |
| TO_APPROX_COUNT_DISTINCT | *\*to be defined* |  |
| TO_APPROX_PERCENTILE | *\*to be defined* |  |
| TO_BINARY_DOUBLE | *\*to be defined* |  |
| TO_BINARY_FLOAT | *\*to be defined* |  |
| TO_BLOB (bfile) | *\*to be defined* |  |
| TO_BLOB (raw) | *\*to be defined* |  |
| TO_CHAR (character) | TO_CHAR |  |
| TO_CHAR (datetime) | TO_CHAR(datetime) Conditional Expression(CASE) Not Supported | Depending on the format parameter, the function is converted to **conditional expression** ***(CASE WHEN)*** or a ***user-defined function*** or kept as ***TO_CHAR(datetime)***. Sometimes the function will be between another function to get an equivalent result. When the function is not supported it is converted to stub ***‘TO_CHAR_STUB’***. Go to To_Char(datetime) to get more information about this function. |
| TO_CHAR (number) | TO_CHAR (number) | If the ***numeric*** parameter is of type ***double*** or ***float*** the function is commented out and an error is added. When comes a format not supported, the ***format*** parameter is removed from the function and an error is added. Not supported formats: ***C L PR RN TM U V***. If the function has the ***nlsparam*** parameter, it is removed from the function and an error is added. |
| TO_CLOB ( bfile | blob ) | TO_VARCHAR | Outputs a **warning** to indicate the ***bfile/blob*** parameters are considered ***binary***. Also outputs an **error** when the function has more than one parameter. |
| TO_CLOB (character) | TO_VARCHAR | Outputs a **warning** to indicate the ***bfile/blob*** parameters are considered ***binary***. Also outputs an **error** when the function has more than one parameter. |
| TO_DATE | TO_DATE | When comes a ***format*** not supported, the function is commented out and an error is added. Not supported formats: ***FXFMDD-MON-YYYY*** ***J*** ***DDD*** ***MONTH*** ***RM*** ***DD-MON-RR*** ***DD-MON-RRRR*** ***SSSSS*** ***YYYY*** ***YYY*** ***Y*** |
| TO_DSINTERVAL | *\*to be defined* |  |
| TO_LOB | *\*to be defined* |  |
| TO_MULTI_BYTE | *\*to be defined* |  |
| TO_NCHAR | *\*to be defined* |  |
| TO_NCHAR (datetime) | *\*to be defined* |  |
| TO_NCLOB | *\*to be defined* |  |
| TO_NUMBER | TO_NUMBER  Not Supported | The ‘***DEFAULT integer ON CONVERSION ERROR’*** statement is removed and outputs an error,  Converted to a stub ***TO_NUMBER_STUB*** and an error is added when the ***’format’*** parameter is not supported and also when the function has the ***’nlsparam’*** parameter. |
| TO_SINGLE_BYTE | *\*to be defined* |  |
| TO_TIMESTAMP | TO_DATE | When comes a ***format*** not supported, the function is commented out and an error is added. Not supported formats: ***FXFMDD-MON-YYYY*** ***J*** ***DDD*** ***MONTH*** ***RM*** ***DD-MON-RR*** ***DD-MON-RRRR*** ***SSSSS*** ***YYYY*** ***YYY*** ***Y*** |
| TO_TIMESTAMP_TZ | TO_DATE | When comes a ***format*** not supported, the function is commented out and an error is added. Not supported formats: ***FXFMDD-MON-YYYY*** ***J*** ***DDD*** ***MONTH*** ***RM*** ***DD-MON-RR*** ***DD-MON-RRRR*** ***SSSSS*** ***YYYY*** ***YYY*** ***Y*** |
| TO_UTC_TIMESTAMP_TZ | *\*to be defined* |  |
| TO_YMINTERVAL | *\*to be defined* |  |
| TRANSLATE | TRANSLATE |  |
| TRANSLATE_USING | TRANSLATE_USING |  |
| TREAT | *\*to be defined* |  |
| TRIM | TRIM  LTRIM  RTRIM | Depending on the first parameter it will be converted to: ***LEADING*** keyword -> ***LTRIM TRAILING*** keyword -> ***RTRIM BOTH*** keyword -> ***TRIM*** None of these keywords -> keep as **TRIM** function. Also, the order of the ***’trimsource’*** parameter and the **’trimcharacter**’ parameter is inverted, and the ***FROM*** keyword is removed from the function. |
| TRUNC (date) | TRUNC(date) | *‘**DAY’*** expression is added as a second parameter of the function. |
| TRUNC (number) | TRUNC(number) |  |
| TZ_OFFSET | *\*to be defined* |  |
| UID | *\*to be defined* |  |
| UNISTR | TO_VARCHAR(expr) | In the ***expr*** parameter is being added the **‘u’** letter after every **‘'** symbol. |
| UPPER | UPPER |  |
| USER | *\*to be defined* |  |
| USERNV | *\*to be defined* |  |
| VALIDATE_CONVERSION | *\*to be defined* |  |
| VALUE | Not Supported | Converted to a stub ***‘VALUE_STUB’*** and an **error** is added. |
| VAR_POP | VAR_POP |  |
| VAR_SAMP | VAR_SAMP |  |
| VARIANCE | VARIANCE | A warning is being added to indicate the Snowflake counterpart may not be functionally equivalent. |
| VSIZE | *\*to be defined* |  |
| WIDTH_BUCKET | WIDTH_BUCKET |  |
| XMLAGG | *\*to be defined* |  |
| XMLCAST | *\*to be defined* |  |
| XMLCDATA | *\*to be defined* |  |
| XMLCOLATVAL | *\*to be defined* |  |
| XMLCOMMENT | *\*to be defined* |  |
| XMLCONCAT | *\*to be defined* |  |
| XMLDIFF | *\*to be defined* |  |
| XMLELEMENT | *\*to be defined* |  |
| XMLEXISTS | *\*to be defined* |  |
| XMLFOREST | *\*to be defined* |  |
| XMLISVALID | *\*to be defined* |  |
| XMLPARSE | *\*to be defined* |  |
| XMLPATCH | *\*to be defined* |  |
| XMLPI | *\*to be defined* |  |
| XMLQUERY | Not Supported |  |
| XMLSEQUENCE | Not Supported | Converted to a stub ***‘XMLSEQUENCE_STUB’*** and an **error** is added. |
| XMLSERIALIZE | *\*to be defined* |  |
| XMLTABLE | Not Supported | Outputs an error: ***XMLTABLE IS NOT SUPPORTED***. |
| XMLTRANSFORM | *\*to be defined* |  |

## Functions Details.

### To_Char(datetime)

According to the format parameter, the function will be converted to:

| Format | Conversion |
| --- | --- |
| AD or BC  A.D. or B.C. | The function will be converted to a ***conditional expression*** ***(CASE)*** where the **format** is added as a result of the ***’when’*** condition. **For Example:** `from: To_Char(DATE ‘1998-12-25’, ‘AD’)` `to: CASE WHEN YEAR(DATE ‘1998-12-25’) < 0 THEN`**`’BC’`** |
| CC or SCC | The function will be converted to a ***conditional expression*** where the original function body is added as a ***when*** condition but it will be between  a ***MOD*** function, after that the original function is added as a ***then*** result but contained by a ***SUBSTR*** function. **For example:**  `from: To_Char(DATE ‘1998-12-25’,’CC’)` `to: CASE WHEN MOD(YEAR(DATE ‘1998-12-25’), 100) = 0` `THEN SUBSTR(TO_CHAR(DATE ‘1998-12-25’, ‘YYYY’), 1, 2)` |
| D | The function will be converted to the snowflake function equivalent but the function body will be between the ***DAYOFWEEK*** datetime part.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’D’)`  `to: TO_CHAR(DAYOFWEEK(DATE ‘1998-12-25’) + 1)` |
| DAY | The function will be converted to a ***user-defined function*** inside of an ***UPPER*** function. **For Example:** `from: To_Char(DATE ‘1998-12-25’,’DAY’)`  `to: UPPER(SNOWCONVERT.PUBLIC.FULL_DAY_NAME_UDF(DATE ‘1998-12-25’))` |
| DDD | The function will be converted to the snowflake function equivalent but the function body will be between the ***DAYOFYEAR*** datetime part.  **For Example:** `from: To_Char(DATE ‘1998-12-25’,’DDD’)`  `to: TO_CHAR(DAYOFYEAR(DATE ‘1998-12-25’))` |
| DD-MON-RR | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: *’DD-MON-YY’.*  **For Example:**  `from: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’DD-MON-RR’)`  `to: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’DD-MON-YY’)` |
| DL | The function will be converted to a ***user-defined function*** plus the ***’OR’*** operator plus snowflake equivalent keeping the function body but changing the format  to: *’**, MMM DD, YYYY***  **For example:**  `from: To_Char(DATE ‘1998-12-25’,’DL’)`  `to: SNOWCONVERT.PUBLIC.FULL_DAY_NAME_UDF(DATE ‘1998-12-25’)` |
| DS | The function will be converted to a combination of the snowflake function  equivalent inside of the ***LTRIM*** function and the snowflake function equivalent.  All the parts combined with the ***’OR’*** operator.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’DS’)`  `to: LTRIM(TO_CHAR(DATE ‘1998-12-25’, ‘MM’), ‘0’)` |
| DY | The function will be converted to the snowflake function equivalent  inside of the ***UPPER*** function.  **For example:** `from: To_Char(DATE ‘1998-12-25’,’DY’)` `to: UPPER(TO_CHAR(DATE ‘1998-12-25’, ‘DY’))` |
| I | The function will be converted to the snowflake function equivalent  inside of the ***SUBSTR*** function.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’I’)`  `to: SUBSTR(TO_CHAR(DATE ‘1998-12-25’, ‘YYYY’), 4, 1)` |
| IW | The function will be converted to the snowflake function equivalent but the function body will be between the ***WEEKISO*** datetime part.  **For Example:**  `from:To_Char(DATE ‘1998-12-25’,’IW’)`  `to: TO_CHAR(WEEKISO(DATE ‘1998-12-25’))` |
| IY | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: ***’YY’**.*  **For example:**  `from:To_Char(DATE ‘1998-12-25’, ‘IY’)`  `to: TO_CHAR(DATE ‘1998-12-25’, ‘YY’)` |
| IYY | The function will be converted to the snowflake function equivalent  inside of the ***SUBSTR*** function and change the format to: ***’YYYY’***.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’IYY’)`  `to: SUBSTR(TO_CHAR(DATE ‘1998-12-25’, ‘YYYY’), 2, 3)` |
| IYYY | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: ***’YYYY’**.*  **For example:**  `from:To_Char(DATE ‘1998-12-25’, ‘IYYY’)`  `to: TO_CHAR(DATE ‘1998-12-25’, ‘YYYY’)` |
| J | The function will be converted to a conditional expression with ‘B.C.’ as a ***’then’***  result and ***’A.D.***’ as an else result.  **For example:**  `from: To_Char(DATE ‘1998-12-25’,’J’)`  `to:` DATE_TO_JULIANDAYS_UDF(DATE ‘1998-12-25’) |
| MI | The function will be converted to the snowflake equivalent. If the function  argument is ***SYSDATE*** it will be changed to ***CURRENT_TIMESTAMP***, otherwise,  if it is of type date, the function will return null.  **For Example:**  `from: To_Char(SYSDATE,’MI’);`  `to: To_Char(CURRENT_TIMESTAMP,’MI’)` |
| MON | The function will be converted to the snowflake function equivalent inside of the ***UPPER*** function.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’MON’)`  `to: UPPER(TO_CHAR(DATE ‘1998-12-25’, ‘MON’))` |
| MONTH | The function will be converted to the snowflake function equivalent  inside of the ***UPPER*** function and change the format to: ***’MMMM’***.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’MONTH’)`  `to: UPPER(TO_CHAR(DATE ‘1998-12-25’, ‘MMMM’))` |
| Q | The function will be converted to the snowflake function equivalent inside of the ***QUARTER*** function.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’Q’)`  `to: TO_CHAR(QUARTER(DATE ‘1998-12-25’))` |
| RM | The function will be converted to a ***user-defined function.***  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’RM’)`  `to: SNOWCONVERT.PUBLIC.ROMAN_MONTH_UDF(DATE ‘1998-12-25’)` |
| RR | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: ***’YY’**.*  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’RR’)`  `to: TO_CHAR(DATE ‘1998-12-25’, ‘YY’)` |
| RR-MON-DD | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: ***’YY-MON-DD’**.*  **For Example:**  `from: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’RR-MON-DD’)`  `to: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’YY-MON-DD’)` |
| RRRR | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: ***’YYYY’**.*  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’RRRR’)`  `to: TO_CHAR(DATE ‘1998-12-25’, ‘YYYY’)` |
| SS | The function will be converted to a combination of a ***conditional expression*** and the snowflake function equivalent.  All the parts combined with the ***’OR’*** operator. **For Example:** `from: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’SS’)`  `to: CASE WHEN SECOND(TIMESTAMP ‘1998-12-25 09:26:50.12’) = 0` `THEN ‘00’ WHEN SECOND(TIMESTAMP ‘1998-12-25 09:26:50.12’) < 10` `THEN ‘0’` |
| SSSS | The function will be converted to the snowflake function equivalent but the  function body will be a concatenation of ***SECOND***, ***MINUTE,*** and ***HOUR*** datetime parts.  **For Example:**  `from: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’SSSS’)`  `to: TO_CHAR(SECOND(TIMESTAMP ‘1998-12-25 09:26:50.12’) +` `MINUTE(TIMESTAMP ‘1998-12-25 09:26:50.12’) * 60 +` `HOUR(TIMESTAMP ‘1998-12-25 09:26:50.12’) * 3600)` |
| TS | The function will be converted to the snowflake function equivalent keeping the  function body but changing the format to: ***’HH:MI:SS PM’**.*  **For Example:**  `from: To_Char(TIMESTAMP ‘1998-12-25 09:26:50.12’,’TS’)`  `to: TO_CHAR(TIMESTAMP ‘1998-12-25 09:26:50.12’, ‘HH:MI:SS PM’)` |
| W | The function will be converted to the ***TRUNC*** function with the ***DAYOFMONTH*** datetime part.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’W’)`  `to: TRUNC(DAYOFMONTH(DATE ‘1998-12-25’) / 7 + 1)` |
| WW | The function will be converted to the ***TRUNC*** function with the ***DAYOFYEAR*** datetime part.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’WW’)`  `to: TRUNC(DAYOFYEAR(DATE ‘1998-12-25’) / 7 + 1)` |
| Y  YYY | The function will be converted to the snowflake function equivalent  inside of the ***SUBSTR*** function and change the format to: ***’YYYY’***.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’Y’)`  `to: SUBSTR(TO_CHAR(DATE ‘1998-12-25’, ‘YYYY’), 4, 1)` |
| Y,YYY | The function will be converted to a combination of the snowflake function equivalent inside of the **SUBSTR** function and a comma symbol. All the parts combined with the ***’OR’*** operator.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’Y,YYY’)`  `to: SUBSTR(TO_CHAR(YEAR(DATE ‘1998-12-25’)), 1, 1)` |
| YEAR  SYEAR | The function will be converted to a ***user-defined function*** inside of an ***UPPER*** function.  **For Example:**  `from: To_Char(DATE ‘1998-12-25’,’YEAR’)`  `to: UPPER(SNOWCONVERT.PUBLIC.YEAR_NAME_UDF(DATE ‘1998-12-25’))` |

## MAX KEEP DENSE_RANK

### Description

The Oracle `MAX KEEP DENSE_RANK` function is an aggregate function that returns the maximum value from a set of values while considering only the rows that have the first (smallest) rank according to the specified ordering. The `KEEP (DENSE_RANK FIRST ORDER BY ...)` clause filters the rows to include only those with the smallest rank value before applying the MAX function. ([Oracle Aggregate Functions Documentation](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/Aggregate-Functions.html#GUID-62BE676B-AF18-4E63-BD14-25206FEA0848)).

### Sample Source Pattern

#### Syntax

##### Oracle

```none
MAX(expression) KEEP (DENSE_RANK FIRST ORDER BY order_by_expression [ASC|DESC])
```

##### Snowflake SQL

```none
FIRST_VALUE(expression) OVER (ORDER BY order_by_expression [ASC|DESC])
```

### Examples

#### Oracle

**Code:**

```sql
SELECT department_id,
       MAX(salary) KEEP (DENSE_RANK FIRST ORDER BY hire_date) AS first_hired_max_salary
FROM employees
GROUP BY department_id;
```

#### Snowflake SQL

**Code:**

```sql
SELECT department_id,
       FIRST_VALUE(salary)
       OVER (
       ORDER BY hire_date) AS first_hired_max_salary
FROM
       employees
GROUP BY department_id;
```

> **Note:**
>
> To ensure a deterministic order for the rows in a window function’s results, the ORDER BY clause must include a key or combination of keys that makes each row unique.

## MIN KEEP DENSE_RANK

### Description

The Oracle `MIN KEEP DENSE_RANK` function is an aggregate function that returns the minimum value from a set of values while considering only the rows that have the last (highest) rank according to the specified ordering. The `KEEP (DENSE_RANK LAST ORDER BY ...)` clause filters the rows to include only those with the highest rank value before applying the MIN function. ([Oracle Aggregate Functions Documentation](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/Aggregate-Functions.html#GUID-62BE676B-AF18-4E63-BD14-25206FEA0848)).

### Sample Source Pattern

#### Syntax

##### Oracle

```none
MIN(expression) KEEP (DENSE_RANK LAST ORDER BY order_by_expression [ASC|DESC])
```

##### Snowflake SQL

```none
LAST_VALUE(expression) OVER (ORDER BY order_by_expression [ASC|DESC])
```

### Examples

#### Oracle

**Code:**

```sql
SELECT department_id,
       MIN(salary) KEEP (DENSE_RANK LAST ORDER BY hire_date) AS first_hired_min_salary
FROM employees
GROUP BY department_id;
```

#### Snowflake SQL

**Code:**

```sql
SELECT department_id,
       LAST_VALUE(salary)
       OVER (
       ORDER BY hire_date) AS first_hired_min_salary
FROM
       employees
GROUP BY department_id;
```

> **Note:**
>
> To ensure a deterministic order for the rows in a window function’s results, the ORDER BY clause must include a key or combination of keys that makes each row unique.

## NLSSORT

### Description

NLSSORT returns a collation key for the character value char and an explicitly or implicitly specified collation. A collation key is a string of bytes used to sort char according to the specified collation. The property of the collation keys is that mutual ordering of two such keys generated for the given collation when compared according to their binary order is the same as mutual ordering of the source character values when compared according to the given collation.. ([NLSSORT in Oracle](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/NLSSORT.html#GUID-781C6FE8-0924-4617-AECB-EE40DE45096D)).

### Sample Source Pattern

#### Syntax

##### Oracle

```none
NLSSORT(char [, 'nlsparam' ])
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/collate)

```none
COLLATE(<string_expression>, '<collation_specification>')
```

### Examples

#### Oracle

**Code:**

```sql
CREATE TABLE test (name VARCHAR2(15));
INSERT INTO test VALUES ('Gaardiner');
INSERT INTO test VALUES ('Gaberd');
INSERT INTO test VALUES ('Gaasten');

SELECT *
  FROM test
  ORDER BY NLSSORT(name, 'NLS_SORT = XDanish');
```

**Result:**

| NAME |
| --- |
| Gaberd |
| Gaardiner. |
| Gaasten |

##### Snowflake SQL

**Code:**

```sql
CREATE OR REPLACE TABLE test (name VARCHAR(15))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO test
VALUES ('Gaardiner');

INSERT INTO test
VALUES ('Gaberd');

INSERT INTO test
VALUES ('Gaasten');

SELECT *
  FROM
  test
ORDER BY
COLLATE(name, '');
```

**Result:**

| NAME |
| --- |
| Gaberd |
| Gaardiner |
| Gaasten |

## TO_NUMBER

### Description

Converts an input expression to a fixed-point number. For NULL input, the output is NULL.

#### Arguments

**Required:**

&#xNAN;*`<expr>`*

An expression of a numeric, character, or variant type.

**Optional:**

*`<format>`*

The SQL format model used to parse the input *`expr`* and return. For more information, see [SQL Format Models](https://docs.snowflake.com/en/sql-reference/sql-format-models).

*`<precision>`*

The maximal number of decimal digits in the resulting number; from 1 to 38. In Snowflake, precision is not used for determination of the number of bytes needed to store the number and does not have any effect on efficiency, so the default is the maximum (38).

*`<scale>`*

The number of fractional decimal digits (from 0 to *`precision`* - 1). 0 indicates no fractional digits (i.e. an integer number). The default scale is 0.

#### Returns

The function returns `NUMBER(`*`precision`*``` ,`` `` ```*`scale`*`)`.

* If the *`precision`* is not specified, then it defaults to 38.
* If the *`scale`* is not specified, then it defaults to 0.

To more information check the [TO_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/to_decimal) in Snowflake documentation.

```sql
SELECT CAST('123,456E+40' AS NUMBER, '999,999EEE') FROM DUAL;
SELECT CAST('12sdsd3,456E+40' AS NUMBER, '999,999EEE') FROM DUAL;
SELECT CAST('12345sdsd' AS NUMBER, '99999') FROM DUAL;
SELECT CAST('12.345678912345678912345678912345678912' AS NUMBER, '99.999999999999999999999999999999999999') FROM DUAL;
SELECT CAST('               12.345678912345678912345678912345678912' AS NUMBER, '99.999999999999999999999999999999999999') FROM DUAL;
SELECT CAST('               -12.345678912345678912345678912345678912' AS NUMBER, '99.999999999999999999999999999999999999') FROM DUAL;
SELECT CAST('12.34567891234567891234567891234567891267' AS NUMBER, '99.999999999999999999999999999999999999') FROM DUAL;
SELECT CAST('123.456E-40' AS NUMBER, '999.9999EEE') FROM DUAL;
select cast('12,345,678,912,345,678,912,345,678,912,345,678,912' as number, '99,999,999,999,999,999,999,999,999,999,999,999,999') from dual;
SELECT CAST('  123.456E-40' AS NUMBER, '999.9999EEE') FROM DUAL;
select cast('       12,345,678,912,345,678,912,345,678,912,345.678912' as number, '99,999,999,999,999,999,999,999,999,999,999.999999') from dual;

SELECT CAST('12.34567891234567891234567891234567891267+' AS NUMBER, '99.999999999999999999999999999999999999S') FROM DUAL;
select cast('12,345,678,912,345,678,912,345,678,912,345,678,912+' as number, '99,999,999,999,999,999,999,999,999,999,999,999,999S') from dual;

select cast('12.48+' as number, '99.99S') from dual;
select cast('  12.48+' as number, '99.99S') from dual;
select cast('12.48+   ' as number, '99.99S') from dual;

SELECT CAST('123.456+E-2' AS NUMBER, '999.9999SEEE') FROM DUAL;
SELECT CAST('123.456+E-2-' AS NUMBER, '999.9999SEEE') FROM DUAL;

SELECT CAST('12356-' AS NUMBER, '99999S') FROM DUAL;

select cast(' 1.0E+123' as number, '9.9EEEE') from dual;
select cast('1.2E+02' as number, 'FM9.9EEEE') from dual;
select cast('123.45' as number, 'FM999.009') from dual;
select cast('123.00' as number, 'FM999.009') from dual;
select cast(' $123.45' as number, 'L999.99') from dual;
select cast('$123.45' as number, 'FML999.99') from dual;
select cast('1234567890+' as number, '9999999999S') from dual;
```

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE '123,456E+40' ***/!!!
 CAST('123,456E+40' AS NUMBER(38, 18) , '999,999EEE') FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0053 - INCORRECT INPUT FORMAT '12sdsd3,456E+40' ***/!!! CAST('12sdsd3,456E+40' AS NUMBER(38, 18) , '999,999EEE') FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0053 - INCORRECT INPUT FORMAT '12345sdsd' ***/!!! CAST('12345sdsd' AS NUMBER(38, 18) , '99999') FROM DUAL;

SELECT
 TO_NUMBER('12.345678912345678912345678912345678912', '99.999999999999999999999999999999999999', 38, 36)
FROM DUAL;

SELECT
 TO_NUMBER('               12.345678912345678912345678912345678912', '99.999999999999999999999999999999999999', 38, 36)
FROM DUAL;

SELECT
 TO_NUMBER('               -12.345678912345678912345678912345678912', '99.999999999999999999999999999999999999', 38, 36)
FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE '12.34567891234567891234567891234567891267' ***/!!! CAST('12.34567891234567891234567891234567891267' AS NUMBER(38, 18) , '99.999999999999999999999999999999999999') FROM DUAL;

SELECT
 TO_NUMBER('123.456E-40', '999.9999EEE', 38, 37)
FROM DUAL;

select
 TO_NUMBER('12,345,678,912,345,678,912,345,678,912,345,678,912', '99,999,999,999,999,999,999,999,999,999,999,999,999', 38, 0)
from dual;

SELECT
 TO_NUMBER('  123.456E-40', '999.9999EEE', 38, 37)
FROM DUAL;

select
 TO_NUMBER('       12,345,678,912,345,678,912,345,678,912,345.678912', '99,999,999,999,999,999,999,999,999,999,999.999999', 38, 6)
from dual;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE '12.34567891234567891234567891234567891267+' ***/!!! CAST('12.34567891234567891234567891234567891267+' AS NUMBER(38, 18) , '99.999999999999999999999999999999999999S') FROM DUAL;

select
 TO_NUMBER('12,345,678,912,345,678,912,345,678,912,345,678,912+', '99,999,999,999,999,999,999,999,999,999,999,999,999S', 38, 0)
from dual;

select
 TO_NUMBER('12.48+', '99.99S', 38, 2)
from dual;

select
 TO_NUMBER('  12.48+', '99.99S', 38, 2)
from dual;

select
 TO_NUMBER('12.48+   ', '99.99S', 38, 2)
from dual;

SELECT
 TO_NUMBER('123.456+E-2', '999.9999SEEE', 38, 5)
FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0053 - INCORRECT INPUT FORMAT '123.456+E-2-' ***/!!! CAST('123.456+E-2-' AS NUMBER(38, 18) , '999.9999SEEE') FROM DUAL;

SELECT
 TO_NUMBER('12356-', '99999S', 38, 0)
FROM DUAL;

select
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE ' 1.0E+123' ***/!!! cast(' 1.0E+123' as NUMBER(38, 18) , '9.9EEEE') from dual;

select
 TO_NUMBER('1.2E+02', 'FM9.9EEEE', 38, 0)
from dual;

select
 TO_NUMBER('123.45', 'FM999.009', 38, 2)
from dual;

select
 TO_NUMBER('123.00', 'FM999.009', 38, 2)
from dual;

select
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0045 - CAST TYPE L AND FML NOT SUPPORTED ***/!!! cast(' $123.45' as NUMBER(38, 18) , 'L999.99') from dual;

select
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0045 - CAST TYPE L AND FML NOT SUPPORTED ***/!!! cast('$123.45' as NUMBER(38, 18) , 'FML999.99') from dual;

select
 TO_NUMBER('1234567890+', '9999999999S', 38, 0)
from dual;
```

#### Recommendations

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

### Related EWIs

1. [SSC-EWI-OR0045](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Cast type L and FML are not supported.
2. [SSC-EWI-OR0050](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Input Expression is out of the range.
3. [SSC-EWI-OR0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Incorrect input format.

---
title: SnowConvert AI - Oracle - Built-In packages
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/built-in-packages.md
section: Migrations
---

# SnowConvert AI - Oracle - Built-In packages

Translation reference for Built-in packages.

## Description

> Oracle supplies many PL/SQL packages with the Oracle server to extend database functionality and provide PL/SQL access to SQL features. ([Oracle PL/SQL Built-in Packages](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/introduction-to-oracle-supplied-plsql-packages-and-types.html#GUID-4AA6AA30-CAEE-4DCD-B214-9AD51D0229B4))

## DBMS_OUTPUT

### Description

> The `DBMS_OUTPUT` package is especially useful for displaying PL/SQL debugging information. ([Oracle PL/SQL DBMS_OUTPUT](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/DBMS_OUTPUT.html#GUID-C1400094-18D5-4F36-A2C9-D28B0E12FD8C))

### PUT_LINE procedure

Translation reference for DBMS_OUTPUT.PUT_LINE.

#### Description

> This procedure places a line in the buffer. ([Oracle PL/SQL DBMSOUTPUT.PUT_LINE](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/DBMS_OUTPUT.html#GUID-19FA480D-591E-4584-9650-5D37C4AFA530))

This UDF is implemented using a temporary table to insert the data to be displayed to replicate the functionality of Oracle `DBMS_OUTPUT.PUT_LINE` function.

#### Syntax

```sql
 DBMS_OUTPUT.PUT_LINE(LOG VARCHAR);
```

#### Custom procedure

##### Setup data

The `DBMS_OUTPUT` schema must be created.

```sql
CREATE SCHEMA IF NOT EXISTS DBMS_OUTPUT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}';
```

##### DBMS_OUTPUT.PUT_LINE(VARCHAR)

##### **Parameters**

* **LOG**: Item in a buffer that you want to display.

```sql
CREATE OR REPLACE procedure DBMS_OUTPUT.PUT_LINE_UDF(LOG VARCHAR)
RETURNS VARCHAR
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS $$

  //Performance may be affected by using this UDF.
  //If you want to start logging information, please uncomment the implementation.
  //Once the calls of DBMS_OUTPUT.PUT_LINE have been done, please use
  //the following query to read all the logs:
  //SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG.

  //snowflake.execute({sqlText:`
  //CREATE TEMPORARY TABLE IF NOT EXISTS DBMS_OUTPUT_LOG
  //(
  //  WHEN TIMESTAMP,
  //  DATABASE VARCHAR,
  //  LOG VARCHAR
  //);`});

  //snowflake.execute({sqlText:`INSERT INTO DBMS_OUTPUT_LOG(WHEN, DATABASE, LOG) VALUES (CURRENT_TIMESTAMP,CURRENT_DATABASE(),?)`, binds:[LOG]});
  return LOG;
$$;
```

> **Note:**
>
> * Note that this is using a temporary table, if you want the data to persist after a session ends, please remove TEMPORARY from the CREATE TABLE.
> * The [temporary tables](https://docs.snowflake.com/en/user-guide/tables-temp-transient.html#temporary-tables) store non-permanent transitory data. They only exist within the session in which they were created and persist only for the rest of the session. After the session ends, the data stored in the table is completely removed from the system and is therefore not recoverable, either by the user who created the table or by Snowflake.

> **Warning:**
>
> If you do not use the temporary table, keep in mind that you may need another column in the table where the USER running DBMS_OUTPUT.PUT_LINE UDF is inserted to avoid confusion.

##### Usage example

###### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC
IS
BEGIN
    DBMS_OUTPUT.PUT_LINE('Test');
END;

CALL PROC();
```

###### Result

```sql
|DBMS_OUTPUT.PUT_LINE('test') |
|-----------------------------|
|test                         |
```

###### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF('Test');
    END;
$$;

CALL PROC();
```

###### Result

```sql
|ROW |WHEN                    |DATABASE    |LOG      |
|----|------------------------|------------|---------|
| 1  |2022-04-25 11:16:23.844 |CODETEST    |test     |
```

#### Known Issues

* The UDF code will remain commented out because it can affect performance, if the user decides to use it, they just need to uncomment the code.
* The user can modify the UDF so that the necessary information is inserted into the DBMS_OUTPUT.PUT_LINE table.

#### Related EWIs

1. [SSC-FDM-OR0035](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Check UDF implementation for DBMS_OUTPUT.PUT_LINE_UDF.

## DBMS_LOB

### Description

> The `DBMS_LOB` package provides subprograms to operate on `BLOBs`, `CLOBs`, `NCLOBs`, `BFILEs`, and temporary `LOBs`. You can use `DBMS_LOB` to access and manipulate specific parts of a LOB or complete LOBs. ([Oracle PL/SQL DBMS_LOB](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/DBMS_LOB.html#GUID-A35DE03B-41A6-4E55-8CDE-77737FED9306))

### SUBSTR Function

Translation reference for DBMS_LOB.SUBSTR.

#### Description

> This function returns `amount` bytes or characters of a LOB, starting from an absolute `offset` from the beginning of the LOB. ([Oracle PL/SQL DBMS_LOB.SUBSTR](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/DBMS_LOB.html))

This built-in function is replaced with Snowflake [SUBSTR function](https://docs.snowflake.com/en/sql-reference/functions/substr.html#substr-substring). However, there are some differences.

> **Note:**
>
> The **amount** and **offset** parameters are inverted in Snowflake

#### Syntax

```sql
DBMS_LOB.SUBSTR (
   lob_loc     IN    BLOB,
   amount      IN    INTEGER := 32767,
   offset      IN    INTEGER := 1)
  RETURN RAW;

DBMS_LOB.SUBSTR (
   lob_loc     IN    CLOB   CHARACTER SET ANY_CS,
   amount      IN    INTEGER := 32767,
   offset      IN    INTEGER := 1)
  RETURN VARCHAR2 CHARACTER SET lob_loc%CHARSET;

DBMS_LOB.SUBSTR (
   file_loc     IN    BFILE,
   amount      IN    INTEGER := 32767,
   offset      IN    INTEGER := 1)
  RETURN RAW;
```

#### Function overloads

**DBMS_LOB.SUBSTR(‘string’, amount, offset)**

##### Usage example

###### Oracle

```sql
SELECT
-- 1. "some magic here"
DBMS_LOB.SUBSTR('some magic here', 15, 1) "1",
-- 2. "some"
DBMS_LOB.SUBSTR('some magic here', 4, 1) "2",
-- 3. "me magic here"
DBMS_LOB.SUBSTR('some magic here', 15, 3) "3",
-- 4. "magic"
DBMS_LOB.SUBSTR('some magic here', 5, 6) "4",
-- 5. "here"
DBMS_LOB.SUBSTR('some magic here', 20, 12) "5",
-- 6. " "
DBMS_LOB.SUBSTR('some magic here', 250, 16) "6"
FROM DUAL;
```

###### Result

```sql
1              |2   |3            |4    |5   |6|
---------------+----+-------------+-----+----+-+
some magic here|some|me magic here|magic|here| |
```

###### Snowflake

```sql
SELECT
-- 1. "some magic here"
SUBSTR('some magic here', 1, 15) "1",
-- 2. "some"
SUBSTR('some magic here', 1, 4) "2",
-- 3. "me magic here"
SUBSTR('some magic here', 3, 15) "3",
-- 4. "magic"
SUBSTR('some magic here', 6, 5) "4",
-- 5. "here"
SUBSTR('some magic here', 12, 20) "5",
-- 6. " "
SUBSTR('some magic here', 16, 250) "6"
FROM DUAL;
```

###### Result

```sql
1              |2   |3            |4    |5   |6|
---------------+----+-------------+-----+----+-+
some magic here|some|me magic here|magic|here| |
```

##### DBMS_LOB.SUBSTR(**B**LOB, amount, offset)

###### Usage example

> **Warning:**
>
> Result values in Oracle and Snowflake are being converted from bytes to strings for easier understanding of the function.
>
> For **Snowflake** consider using:
>
> **hex_decode_string( to_varchar(SUBSTR(blob_column, 1, 6), ‘HEX’));**
>
> and for **Oracle** consider using:
>
> **utl_raw.cast_to_varchar2(DBMS_LOB.SUBSTR(blob_column, 1, 6));**
>
> to obtain the result as a string.

###### Oracle

```sql
-- Create Table
CREATE TABLE blobtable( blob_column BLOB );

-- Insert sample value
INSERT INTO blobtable VALUES (utl_raw.cast_to_raw('some magic here'));

-- Select different examples
SELECT
-- 1. "some magic here"
DBMS_LOB.SUBSTR(blob_column, 15, 1) "1",
-- 2. "some"
DBMS_LOB.SUBSTR(blob_column, 4, 1) "2",
-- 3. "me magic here"
DBMS_LOB.SUBSTR(blob_column, 15, 3) "3",
-- 4. "magic"
DBMS_LOB.SUBSTR(blob_column, 5, 6) "4",
-- 5. "here"
DBMS_LOB.SUBSTR(blob_column, 20, 12) "5",
-- 6. " "
DBMS_LOB.SUBSTR(blob_column, 250, 16) "6"
FROM BLOBTABLE;
```

###### Result

```sql
1              |2   |3            |4    |5   |6|
---------------+----+-------------+-----+----+-+
some magic here|some|me magic here|magic|here| |
```

###### Snowflake

```sql
-- Create Table
CREATE OR REPLACE TABLE blobtable ( blob_column BINARY
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

-- Insert sample value
INSERT INTO blobtable
VALUES (
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'utl_raw.cast_to_raw' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS cast_to_raw);

-- Select different examples
SELECT
-- 1. "some magic here"
SUBSTR(blob_column, 1, 15) "1",
-- 2. "some"
SUBSTR(blob_column, 1, 4) "2",
-- 3. "me magic here"
SUBSTR(blob_column, 3, 15) "3",
-- 4. "magic"
SUBSTR(blob_column, 6, 5) "4",
-- 5. "here"
SUBSTR(blob_column, 12, 20) "5",
-- 6. " "
SUBSTR(blob_column, 16, 250) "6"
FROM
BLOBTABLE;
```

###### Result

```sql
1              |2   |3            |4    |5   |6|
---------------+----+-------------+-----+----+-+
some magic here|some|me magic here|magic|here| |
```

> **Warning:**
>
> **Note:** `UTL_RAW.CAST_TO_RAW()` is currently not being transformed to `TO_BINARY()`. The function is used to show the functional equivalence of the example.

##### DBMS_LOB.SUBSTR(CLOB, amount, offset)

###### Usage example

###### Oracle

```sql
-- Create Table
CREATE TABLE clobtable(clob_column CLOB);

-- Insert sample value
INSERT INTO clobtable VALUES ('some magic here');

-- Select
SELECT
-- 1. "some magic here"
DBMS_LOB.SUBSTR(clob_column, 15, 1) "1",
-- 2. "some"
DBMS_LOB.SUBSTR(clob_column, 4, 1) "2",
-- 3. "me magic here"
DBMS_LOB.SUBSTR(clob_column, 15, 3) "3",
-- 4. "magic"
DBMS_LOB.SUBSTR(clob_column, 5, 6) "4",
-- 5. "here"
DBMS_LOB.SUBSTR(clob_column, 20, 12) "5",
-- 6. " "
DBMS_LOB.SUBSTR(clob_column, 250, 16) "6"
FROM clobtable;
```

###### Result

```sql
1              |2   |3            |4    |5   |6|
---------------+----+-------------+-----+----+-+
some magic here|some|me magic here|magic|here| |
```

###### Snowflake

```sql
-- Create Table
CREATE OR REPLACE TABLE clobtable (clob_column VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
;

-- Insert sample value
INSERT INTO clobtable
VALUES ('some magic here');

-- Select
SELECT
-- 1. "some magic here"
SUBSTR(clob_column, 1, 15) "1",
-- 2. "some"
SUBSTR(clob_column, 1, 4) "2",
-- 3. "me magic here"
SUBSTR(clob_column, 3, 15) "3",
-- 4. "magic"
SUBSTR(clob_column, 6, 5) "4",
-- 5. "here"
SUBSTR(clob_column, 12, 20) "5",
-- 6. " "
SUBSTR(clob_column, 16, 250) "6"
FROM
clobtable;
```

###### Result

```sql
1              |2   |3            |4    |5   |6|
---------------+----+-------------+-----+----+-+
some magic here|some|me magic here|magic|here| |
```

> **Warning:**
>
> **Note:** `UTL_RAW.CAST_TO_RAW()` is currently not being transformed to `TO_BINARY()`. The function is used to show the functional equivalence of the example.

##### DBMS_LOB.SUBSTR(BFILE, amount, offset)

###### Usage example

Using DBMS_LOB.SUBSTR() on a BFILE column returns a substring of the file content.

> **Warning:**
>
> Next example is **not** a current migration, but a functional example to show the differences of the SUBSTR function on BFILE types.

**File Content (file.txt):**

```sql
some magic here
```

###### Oracle

```sql
CREATE OR REPLACE PROCEDURE bfile_substr_procedure
IS
    fil BFILE := BFILENAME('MY_DIR', 'file.txt');
BEGIN
    DBMS_LOB.FILEOPEN(fil, DBMS_LOB.FILE_READONLY);
    DBMS_OUTPUT.PUT_LINE(UTL_RAW.CAST_TO_VARCHAR2(DBMS_LOB.SUBSTR(fil,9,1)));
    --Console Output:
    -- "some magi"
    DBMS_LOB.FILECLOSE(fil);
END;
```

###### Console Log

```sql
DBMS_OUTPUT.PUT_LINE(UTL_RAW.CAST_TO_VARCHAR2(DBMS_LOB.SUBSTR(fil,4,1))) |
-------------------------------------------------------------------------|
some magi                                                                |
```

###### Snowflake

**BFILE** columns are translated into **VARCHAR** columns, therefore applying a `SUBSTR` function on the same column would return a substring of the file name, not the file content.

```sql
CREATE OR REPLACE PROCEDURE bfile_substr_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        fil VARCHAR := PUBLIC.BFILENAME_UDF('MY_DIR', 'file.txt');
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_LOB.FILEOPEN' IS NOT CURRENTLY SUPPORTED. ***/!!!
        DBMS_LOB.FILEOPEN(:fil,
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_LOB.FILE_READONLY' IS NOT CURRENTLY SUPPORTED. ***/!!!
        '' AS FILE_READONLY);
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF(
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'UTL_RAW.CAST_TO_VARCHAR2' IS NOT CURRENTLY SUPPORTED. ***/!!!
        '' AS CAST_TO_VARCHAR2);
        --Console Output:
        -- "some magi"
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_LOB.FILECLOSE' IS NOT CURRENTLY SUPPORTED. ***/!!!
        DBMS_LOB.FILECLOSE(:fil);
    END;
$$;
```

###### Result

| SUBSTR(bfile_column, 1, 9) |
| --- |
| MY_DIR\fi |

#### Known Issues

##### 1. Using DBMS_LOB.SUBSTR with BFILE columns

The current transformation for BFILE datatypes in columns is VARCHAR, where the name of the file is stored as a string. Therefore applying the SUBSTR function on a BFILE column after transformation will return a substring of the file name, while Oracle would return a substring of the file content.

#### Related EWIs

1. [SSC-EWI-OR0076](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Built In Package Not Supported.
2. [SSC-FDM-OR0035](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.

## UTL_FILE

### Description

> With `UTL_FILE` package, PL/SQL programs can read and write text files. ([Oracle PL/SQL UTL_FILE](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-EBC42A36-EB72-4AA1-B75F-8CF4BC6E29B4))

### FCLOSE procedure

Translation reference for UTL_FILE.FCLOSE.

#### Description

> This procedure closes an open file identified by a file handle. ([Oracle PL/SQL UTL_FILE.FCLOSE](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-68874564-1A2C-4071-8D48-60539C805E0D))

This procedure is implemented using Snowflake [STAGE](https://docs.snowflake.com/en/sql-reference/sql/create-stage.html) to store the written text files.

> **Note:**
>
> This procedure needs to be used in conjunction with:
>
> * `UTL_FILE.FOPEN` procedure

#### Syntax

```sql
UTL_FILE.FCLOSE(
    FILE VARCHAR
    );
```

#### Setup data

* The `UTL_FILE` schema must be created.

```sql
CREATE SCHEMA IF NOT EXISTS UTL_FILE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}';
```

* If you want to download the file, run the following command.

```sql
GET @UTL_FILE.utlfile_local_directory/<filename> file://<path_to_file>/<filename>;
```

> **Warning:**
>
> * The [GET](https://docs.snowflake.com/en/sql-reference/sql/get.html) command runs in [Snowflake CLI](https://docs.snowflake.com/en/user-guide/snowsql-install-config.html).

#### Custom procedure overloads

##### UTL_FILE.FCLOSE(VARCHAR)

###### **Parameters**

* **FILE**: Active file handler returned from the call to `UTL_FILE.FOPEN`

###### Functionality

This procedure uses the `FOPEN_TABLES_LINES` table created in the `UTL_FILE.FOPEN` procedure.

This procedure writes to the utlfile_local_directory stage all lines with the same `FHANDLE` from the file in `FOPEN_TABLES_LINES`.

```sql
CREATE OR REPLACE PROCEDURE UTL_FILE.FCLOSE_UDF(FILE VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
     DECLARE
        fhandle VARCHAR;
        fileParse VARIANT;
        File_is_read_only exception;
        fileNameConcat VARCHAR;
        copyIntoQuery VARCHAR ;
    BEGIN
        fileParse:= PARSE_JSON(FILE);
        fhandle:= :fileParse:handle;
        fileNameConcat:= '@UTL_FILE.utlfile_local_directory/'||:fileParse:name;
        copyIntoQuery:= 'COPY INTO '||:fileNameConcat||' FROM (SELECT LINE FROM UTL_FILE.FOPEN_TABLES_LINES WHERE FHANDLE = ? ORDER BY SEQ) FILE_FORMAT= (FORMAT_NAME = my_csv_format COMPRESSION=NONE)   OVERWRITE=TRUE';
        EXECUTE IMMEDIATE :copyIntoQuery USING (fhandle);
        DELETE FROM UTL_FILE.FOPEN_TABLES_LINES WHERE FHANDLE = :fhandle;
        DELETE FROM UTL_FILE.FOPEN_TABLES WHERE FHANDLE = :fhandle;
    END
$$;
```

> **Note:**
>
> * Note that this procedure uses the **stage** that was created previously. For now, if you want to write the file in another stage, you must modify the name.
> * These procedures are implemented for the internal stages in the [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-location.html)

##### Usage example

###### Oracle

```sql
DECLARE
    w_file UTL_FILE.FILE_TYPE;
BEGIN
    w_file:= UTL_FILE.FOPEN('MY_DIR','test.csv','w',1024);
    UTL_FILE.PUT_LINE(w_file,'New line');
    UTL_FILE.FCLOSE(w_file);
END;
```

> **Warning:**
>
> To run this example, see [`ORACLE UTL_FILE`](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-FA16A38B-26AA-4002-9BE0-7D3950557F8C)

###### Snowflake

```sql
DECLARE
    w_file OBJECT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'UTL_FILE.FILE_TYPE' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/ := OBJECT_CONSTRUCT();
    call_results VARIANT;
BEGIN
    w_file:=
    --** SSC-FDM-OR0036 - PARAMETERS: 'LOCATION, MAX_LINESIZE_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
    UTL_FILE.FOPEN_UDF('MY_DIR','test.csv','w',1024);
    --** SSC-FDM-OR0036 - PARAMETERS: 'AUTOFLUSH_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
    call_results := (
        CALL UTL_FILE.PUT_LINE_UDF(:w_file,'New line')
    );
    call_results := (
        CALL UTL_FILE.FCLOSE_UDF(:w_file)
    );
    RETURN call_results;
END;
```

#### Known Issues

##### 1. **Modify the procedure for changing the name of the stage.**

The user can modify the procedure if it is necessary to change the name of the stage.

##### 2. Location **static.**

The location used to write to this procedure is static. A new version of the procedure is expected to increase its extensibility by using the location that has the `FILE` parameter.

##### 5. Files supported.

This procedure for now, only writes .CSV files.

#### Related EWIs

1. [SSC-FDM-0015](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Data Type Not Recognized.
2. [SSC-FDM-OR0036](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Unnecessary built-in packages parameters.

### FOPEN procedure

Translation reference for UTL_FILE.FOPEN.

#### Description

> This procedure opens a file. ([Oracle PL/SQL UTL_FILE.FOPEN](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-DF14ADC3-983D-4E0F-BE2C-60733FF58539))

This procedure is implemented using Snowflake [STAGE](https://docs.snowflake.com/en/sql-reference/sql/create-stage.html) to store the text files.

The user is in charge of uploading the local files to the [STAGE](https://docs.snowflake.com/en/sql-reference/sql/create-stage.html) to be used by the procedure.

> **Note:**
>
> This procedure needs to be used in conjunction with:
>
> * `UTL_FILE.FCLOSE` procedure

#### Syntax

```sql
UTL_FILE.FOPEN(
    LOCATION VARCHAR,
    FILENAME VARCHAR,
    OPEN_MODE VARCHAR,
    MAX_LINESIZE NUMBER,
    );
```

#### Setup data

* The `UTL_FILE` schema must be created.

```sql
CREATE SCHEMA IF NOT EXISTS UTL_FILE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}';
```

* Create the stage `utlfile_local_directory`.

```sql
CREATE OR REPLACE FILE FORMAT  my_csv_format TYPE = csv;

CREATE OR REPLACE STAGE utlfile_local_directory
  file_format = my_csv_format;
```

* If the value in the `OPEN_MODE` parameter is **w** or **r** it is necessary to upload the file in the `utlfile_local_directory`.

```sql
PUT file://<path_to_file>/<filename> @UTL_FILE.utlfile_local_directory auto_compress=false;
```

> **Warning:**
>
> * The [PUT](https://docs.snowflake.com/en/sql-reference/sql/put.html) command runs in [Snowflake CLI](https://docs.snowflake.com/en/user-guide/snowsql-install-config.html).

#### Custom procedure overloads

##### UTL_FILE.FOPEN( VARCHAR, VARCHAR)

###### **Parameters**

* **FILENAME:** The name of the file, including extension\*\*.\*\*
* **OPEN_MODE:** Specifies how the file is opened.

###### **Open modes**

The Oracle Built-in package [`UTL_FILE.FOPEN`](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-DF14ADC3-983D-4E0F-BE2C-60733FF58539) procedure supports six modes of how to open the file, but only three of them are supported in the Snowscripting procedure.

| OPEN_MODE | DESCRIPTION | STATUS |
| --- | --- | --- |
| w | Write mode | Supported |
| a | Append mode | Supported |
| r | Read mode | Supported |
| rb | Read byte mode | Unsupported |
| wb | Write byte mode | Unsupported |
| ab | Append byte mode | Unsupported |

###### Functionality

This procedure uses two tables with which the operation of opening a file will be emulated. The `FOPEN_TABLES` table will store the files that are open and the `FOPEN_TABLES_LINES` table stores the lines that each file owns.

If the file is opened in write mode, a new file is created, if it is opened in read or append mode, it loads the lines of the file in `FOPEN_TABLES_LINES` and inserts the file in `FOPEN_TABLES`.

```sql
CREATE OR REPLACE PROCEDURE UTL_FILE.FOPEN_UDF(FILENAME VARCHAR,OPEN_MODE VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS $$
    DECLARE
        fhandle VARCHAR;
        key VARCHAR;
        status VARCHAR;
        File_is_not_loaded_on_stage exception;
        fileNameConcat VARCHAR:= '@UTL_FILE.utlfile_local_directory/'||:FILENAME;
        copyIntoQuery VARCHAR DEFAULT 'COPY INTO UTL_FILE.FOPEN_TABLES_LINES (FHANDLE, LINE) FROM (SELECT ? , stageFile.$1 FROM '||:fileNameConcat||' stageFile)';
    BEGIN
        CREATE TABLE IF NOT EXISTS UTL_FILE.FOPEN_TABLES
        (
          FHANDLE VARCHAR,
          FILENAME VARCHAR,
          OPEN_MODE VARCHAR
        );

        CREATE TABLE IF NOT EXISTS UTL_FILE.FOPEN_TABLES_LINES
        (
          SEQ    NUMBER AUTOINCREMENT,
          FHANDLE VARCHAR,
          LINE    VARCHAR
        );
        SELECT FHANDLE INTO fhandle FROM UTL_FILE.FOPEN_TABLES WHERE FILENAME = :FILENAME;
        SELECT UUID_STRING() INTO key;
        IF (OPEN_MODE = 'w') THEN
            INSERT INTO UTL_FILE.FOPEN_TABLES(FHANDLE, FILENAME, OPEN_MODE) VALUES(:key,:FILENAME,:OPEN_MODE);
            RETURN TO_JSON({ 'name': FILENAME, 'handle': key});
        ELSE
            IF (fhandle IS NULL) THEN
                EXECUTE IMMEDIATE :copyIntoQuery USING (key);
                SELECT OBJECT_CONSTRUCT(*):status INTO status FROM table(result_scan(last_query_id()));
                IF (status = 'LOADED') THEN
                    INSERT INTO UTL_FILE.FOPEN_TABLES(FHANDLE, FILENAME, OPEN_MODE) VALUES(:key,:FILENAME,:OPEN_MODE);
                    RETURN TO_JSON({'name': FILENAME, 'handle': key});
                ELSE
                    raise File_is_not_loaded_on_stage;
                END IF;
            ELSE
                UPDATE UTL_FILE.FOPEN_TABLES SET OPEN_MODE = :OPEN_MODE WHERE FHANDLE = :fhandle;
                RETURN TO_JSON({'name': FILENAME, 'handle': fhandle});
           END IF;
        END IF;
    END
$$;
```

> **Note:**
>
> * Note that this procedure uses the **stage** that was created previously. For now, if you want to use another name for the stage, you must modify the procedure.
> * These procedures are implemented for the internal stages in the [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html)

##### Usage example

###### Oracle

```sql
DECLARE
    w_file UTL_FILE.FILE_TYPE;
BEGIN
    w_file:= UTL_FILE.FOPEN('MY_DIR','test.csv','w',1024);
END;
```

> **Warning:**
>
> To run this example, see [`ORACLE UTL_FILE`](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-FA16A38B-26AA-4002-9BE0-7D3950557F8C)

###### Snowflake

```sql
DECLARE
    w_file OBJECT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'UTL_FILE.FILE_TYPE' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/ := OBJECT_CONSTRUCT();
BEGIN
    w_file:=
    --** SSC-FDM-OR0036 - PARAMETERS: 'LOCATION, MAX_LINESIZE_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
    UTL_FILE.FOPEN_UDF('MY_DIR','test.csv','w',1024);
END;
```

#### Known Issues

##### 1. **Modify the procedure for changing the name of the stage.**

The user can modify the procedure if it is necessary to change the name of the stage.

##### 2. **`LOCATION` parameter is not used.**

The `LOCATION` parameter is not used now because the stage used in the procedure is static. It is planned for an updated version of the procedure to increase its extensibility by using this parameter to enter the name of the stage where the file you want to open is located.

##### 3. `MAX_LINESIZE` parameter is not used.

The Oracle Built-in package [`UTL_FILE.FOPEN`](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-DF14ADC3-983D-4E0F-BE2C-60733FF58539) procedure has the `MAX_LINESIZE` parameter, but in the Snowscripting procedure it is removed because it is not used.

##### 4. `OPEN_MODE` values supported.

This procedure supports *write* (**w**), *read* (**r**), and *append* (**a**) modes to open files.

##### 5. Files supported.

This procedure for now, only supports .CSV files.

#### Related EWIs

1. [SSC-FDM-0015](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Data Type Not Recognized.
2. [SSC-FDM-OR0036](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): UnnecessaryBuiltInPackagesParameters

### PUT_LINE procedure

Translation reference for UTL_FILE.PUT_LINE.

#### Description

> This procedure writes the text string stored in the buffer parameter to the open file identified by the file handle. ([Oracle PL/SQL UTL_FILE.PUT_LINE](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-BC046363-6F14-4128-B4D2-836DDBDB9B48))

#### Syntax

```sql
UTL_FILE.PUT_LINE(
    FILE VARCHAR,
    BUFFER VARCHAR,
    );
```

#### Setup data

* The `UTL_FILE` schema must be created.

```sql
CREATE SCHEMA IF NOT EXISTS UTL_FILE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}';
```

#### Custom UDF

##### UTL_FILE.PUT_LINE(VARCHAR, VARCHAR)

###### **Parameters**

* **FILE**: Active file handler returned from the call to `UTL_FILE.FOPEN`
* **BUFFER:** Text buffer that contains the text to be written to the file\*\*.\*\*

###### Functionality

This procedure uses the `FOPEN_TABLES_LINES` table created in the `UTL_FILE.FOPEN` procedure.

If the `OPEN_MODE` of the file is *write* (**w**) or *append* (**a**), it inserts the buffer into `FOPEN_TABLES_LINES`, but if the `OPEN_MODE` is read (**r**), it throws the `File_is_read_only` exception.

```sql
CREATE OR REPLACE PROCEDURE UTL_FILE.PUT_LINE_UDF(FILE VARCHAR,BUFFER VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS $$
    DECLARE
        openMode VARCHAR;
        openModeTemp VARCHAR;
        fhandle VARCHAR;
        fileParse VARIANT;
        File_is_read_only exception;
    BEGIN
        fileParse:= PARSE_JSON(FILE);
        fhandle:= :fileParse:handle;
        SELECT OPEN_MODE INTO openModeTemp FROM UTL_FILE.FOPEN_TABLES WHERE FHANDLE = :fhandle;
        IF (openModeTemp = 'a' or openModeTemp = 'w') THEN
            INSERT INTO UTL_FILE.FOPEN_TABLES_LINES(FHANDLE,LINE) VALUES(:fhandle,:BUFFER);
        ELSE
            raise File_is_read_only;
        END IF;
    END
$$;

-- This SELECT is manually added and not generated by SnowConvert AI
SELECT * FROM UTL_FILE.FOPEN_TABLES_LINES;
```

> **Warning:**
>
> **Note:**
>
> * To use this procedure you must open the file with UTL_FILE.FOPEN

##### Usage example

###### Oracle

```sql
DECLARE
    w_file UTL_FILE.FILE_TYPE;
BEGIN
    w_file:= UTL_FILE.FOPEN('MY_DIR','test.csv','w',1024);
    UTL_FILE.PUT_LINE(w_file,'New line');
END;
```

> **Warning:**
>
> To run this example, see [`ORACLE UTL_FILE`](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-FA16A38B-26AA-4002-9BE0-7D3950557F8C)

###### Snowflake

```sql
DECLARE
    w_file OBJECT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'UTL_FILE.FILE_TYPE' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/ := OBJECT_CONSTRUCT();
    call_results VARIANT;
BEGIN
    w_file:=
    --** SSC-FDM-OR0036 - PARAMETERS: 'LOCATION, MAX_LINESIZE_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
    UTL_FILE.FOPEN_UDF('MY_DIR','test.csv','w',1024);
    --** SSC-FDM-OR0036 - PARAMETERS: 'AUTOFLUSH_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
    call_results := (
        CALL UTL_FILE.PUT_LINE_UDF(:w_file,'New line')
    );
    RETURN call_results;
END;
```

#### Known Issues

##### 1. `AUTOFLUSH` parameter is not used.

The Oracle Built-in package [`UTL_FILE.PUT_LINE`](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/UTL_FILE.html#GUID-BC046363-6F14-4128-B4D2-836DDBDB9B48) procedure has the `AUTOFLUSH` parameter, but in the Snowscripting procedure it is removed because it is not used.

#### Related EWIs

1. [SSC-FDM-0015](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Data Type Not Recognized.
2. [SSC-FDM-OR0036](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Unnecessary built-in packages parameters.

## DBMS_RANDOM

### Description

> The `DBMS_RANDOM` package provides a built-in random number generator. `DBMS_RANDOM` is not intended for cryptography. ([Oracle PL/SQL DBMS_RANDOM](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/DBMS_RANDOM.html#GUID-8DC48B0C-3707-4172-A306-C0308DD2EB0F))

### VALUE functions

Translation reference for DBMS_RANDOM.VALUE.

#### Description

> The basic function gets a random number, greater than or equal to 0 and less than 1. Alternatively, you can get a random Oracle number **`X`**, where **`X`** is greater than or equal to `low` and less than `high`. ([Oracle PL/SQL DBMS_RANDOM.VALUE](https://docs.oracle.com/en/database/oracle/oracle-database/21/arpls/DBMS_RANDOM.html#GUID-AAD9E936-D74F-440D-9E16-24F3F0DE8D31))

This UDF is implemented using the [Math.random](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/random) function of Javascript to replicate the functionality of Oracle DBMS_RANDOM.VALUE function.

#### Syntax

```sql
DBMS_RANDOM.VALUE()
    RETURN NUMBER;

DBMS_RANDOM.VALUE(
    low NUMBER,
    high NUMBER)
    RETURN NUMBER;
```

#### Custom UDF overloads

##### Setup data

The `DBMS_RANDOM` schema must be created.

```sql
CREATE SCHEMA IF NOT EXISTS DBMS_RANDOM
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}';
```

##### DBMS_RANDOM.VALUE()

###### **Parameters**

* No parameters.

```sql
CREATE OR REPLACE FUNCTION DBMS_RANDOM.VALUE_UDF()
RETURNS DOUBLE
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
  return Math.random();
$$;
```

> **Note:**
>
> **Note:** The UDF only supports approximately between 9 and 10 digits in the decimal part of the number (9 or 10 digits of precision)

##### Usage example

###### Oracle

```sql
SELECT DBMS_RANDOM.VALUE() FROM DUAL;
```

###### Result

```sql
|DBMS_RANDOM.VALUE()                         |
|--------------------------------------------|
|0.47337471168356406022193430290380483126    |
```

> **Note:**
>
> The function can be called either_`DBMS_RANDOM.VALUE()`_ or *`DBMS_RANDOM.VALUE.`*

###### Snowflake

```sql
SELECT
--** SSC-FDM-OR0033 - DBMS_RANDOM.VALUE DIGITS OF PRECISION ARE LOWER IN SNOWFLAKE **
DBMS_RANDOM.VALUE_UDF() FROM DUAL;
```

###### Result

```sql
|DBMS_RANDOM.VALUE() |
|--------------------|
|0.1014560867        |
```

> **Note:**
>
> In Snowflake, you must put the parentheses.

**DBMS_RANDOM.VALUE(NUMBER, NUMBER)**

###### **Parameters**

* **low**: The lowest `NUMBER` from which a random number is generated. The number generated is greater than or equal to `low`.
* **high**: The highest `NUMBER` used as a limit when generating a random number. The number generated will be less than `high`.

```sql
CREATE OR REPLACE FUNCTION DBMS_RANDOM.VALUE_UDF(low double, high double)
RETURNS DOUBLE
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    if (LOW > HIGH) {
        [LOW, HIGH] = [HIGH, LOW];
    }

    const MAX_DECIMAL_DIGITS = 38;
    return (Math.random() * (HIGH - LOW) + LOW).toFixed(MAX_DECIMAL_DIGITS);
$$;
```

> **Note:**
>
> * The Oracle DBMS_RANDOM.VALUE(low, high) function does not require parameters to have a specific order so the Snowflake UDF is implemented to support this feature by always taking out the highest and lowest number.
> * The UDF only supports approximately between 9 and 10 digits in the decimal part of the number (9 or 10 digits of precision).

##### Usage example

###### Oracle

```sql
SELECT DBMS_RANDOM.VALUE(-10,30) FROM DUAL;
```

###### Result

```sql
|DBMS_RANDOM.VALUE(-10,30)                   |
|--------------------------------------------|
|16.0298681859960167648070354679783928085    |
```

###### Snowflake

```sql
SELECT
--** SSC-FDM-OR0033 - DBMS_RANDOM.VALUE DIGITS OF PRECISION ARE LOWER IN SNOWFLAKE **
DBMS_RANDOM.VALUE_UDF(-10,30) FROM DUAL;
```

###### Result

```sql
|DBMS_RANDOM.VALUE(-10,30)   |
|----------------------------|
|-6.346055187                |
```

#### Known Issues

No issues were found.

#### Related EWIs

1. [SSC-FDM-OR0033](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_RANDOM.VALUE Built-In Package precision is lower in Snowflake.

---
title: SnowConvert AI - Oracle - COLLECTIONS AND RECORDS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/collections-and-records.md
section: Migrations
---

# SnowConvert AI - Oracle - COLLECTIONS AND RECORDS

Translation reference to convert Oracle COLLECTIONS and RECORDS to Snowflake Scripting

> **Warning:**
>
> This section is a work in progress, information may change in the future.

## General Description

> PL/SQL lets you define two kinds of composite data types: collection and record, where composite is a data type that stores values that have internal components.
>
> In a collection, the internal components always have the same data type, and are called elements.
>
> In a record, the internal components can have different data types, and are called fields. ([Oracle PL/SQL Language Reference COLLECTIONS AND RECORDS](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-collections-and-records.html#GUID-7115C8B6-62F9-496D-BEC3-F7441DFE148A))

> **Note:**
>
> Please take into account the [CREATE TYPE statement translation reference](../sql-translation-reference/create_type.md) since some workarounds can overlap and may be functional in both scenarios.

## Limitations

Snowflake doesn’t support user-defined data types, which includes PL Collections and Records, according to its online documentation [Unsupported Data Types](https://docs.snowflake.com/en/sql-reference/data-types-unsupported.html), but it supports [Semi-structured Data Types](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html), which can be used to mimic both the hierarchy-like structure of Record and the element structure of Collection User-defined types. For this reason, there are multiple types of features that have no workaround.

Following are the features for which **NO** workaround is proposed:

### Variable size cannot exceed 16MB

Snowflake sets VARIANT, OBJECT, and ARRAY’s maximum size on 16MBs. This means that if a Record, a Collection, or any element of either exceeds this size it will cause a Runtime Error.

### Varray capacity cannot be limited

Oracle’s varrays offer the capacity to limit the number of elements within them. This is not supported by Snowflake.

## Proposed Workaround

### About Record types definition

The proposed workaround is to use an “OBJECT” semi-structured data type to mimic Oracle’s data type.

### About Collection types definition

There are two different workarounds that depend on the type of collection to be migrated:

* Associative Arrays are proposed to be changed into an “OBJECT” semi-structured data type.
* Varrays and Nested Table Arrays are proposed to be changed into an “ARRAY” semi-structured data type.

## Current SnowConvert AI Support

The next table shows a summary of the current support provided by the SnowConvert AI tool. Please keep in mind that translations may still not be final, and more work may be needed.

| Sub-Feature | Current recognition status | Current translation status | Has Known Workarounds |
| --- | --- | --- | --- |
| Record Type Definitions | Recognized. | Not Translated. | Yes. |
| Associative Array Type Definitions | Not Recognized. | Not Translated. | Yes. |
| Varray Type Definitions | Recognized. | Not Translated. | Yes. |
| Nested Table Array Type Definitions | Recognized. | Not Translated. | Yes. |

## Known Issues

### 1. Associate Arrays are considered a Nested Table

As of now, SnowConvert AI doesn’t differentiate between an Associative Array and a Nested Table meaning they are mixed up in the same assessment counts.

## Related EWIs

No related EWIs.

## Associative Array Type Definition

This is a translation reference to convert the Oracle Associative Array Declaration to Snowflake

> **Warning:**
>
> This section is a work in progress, information may change in the future.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> An associative array (formerly called PL/SQL table or index-by table) is a set of key-value pairs. Each key is a unique index, used to locate the associated value with the syntax `variable_name(index)`.
>
> The data type of `index` can be either a string type (`VARCHAR2`, `VARCHAR`, `STRING`, or `LONG`) or `PLS_INTEGER`. Indexes are stored in sort order, not creation order. For string types, sort order is determined by the initialization parameters `NLS_SORT` and `NLS_COMP`.
>
> ([Oracle PL/SQL Language Reference ASSOCIATIVE ARRAYS](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-collections-and-records.html#GUID-8060F01F-B53B-48D4-9239-7EA8461C2170))

> **Warning:**
>
> Not to be confused with the PL/SQL NESTED TABLE Type definition.

For the translation, the type definition is replaced by an OBJECT [Semi-structured Data Type](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) and then its usages are changed accordingly across any operations.

To define an Associative Array type, the syntax is as follows:

```sql
type_definition := TYPE IS TABLE OF datatype INDEX BY indexing_datatype;

indexing_datatype := { PLS_INTEGER
                     | BINARY_INTEGER
                     | string_datatype
                     }
```

To declare a variable of this type:

```none
variable_name collection_type;
```

### Sample Source Patterns

#### Varchar-indexed Associative Array

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE associative_array
IS
    TYPE associate_array_typ IS TABLE OF INTEGER
        INDEX BY VARCHAR2(50);

    associate_array associate_array_typ := associate_array_typ();
    associate_index VARCHAR2(50);
BEGIN
    associate_array('abc') := 1;
    associate_array('bca') := 2;
    associate_array('def') := 3;

    DBMS_OUTPUT.PUT_LINE(associate_array('abc'));
    associate_array('abc') := 4;
    --THROWS 'NO DATA FOUND'
    --DBMS_OUTPUT.PUT_LINE(associate_array('no exists'));

    DBMS_OUTPUT.PUT_LINE(associate_array.COUNT);

    associate_index := associate_array.FIRST;
    WHILE associate_index IS NOT NULL
    LOOP
        DBMS_OUTPUT.PUT_LINE(associate_array(associate_index));
        associate_index := associate_array.NEXT(associate_index);
    END LOOP;
END;

CALL associative_array();
```

##### Result

| DBMS OUTPUT |
| --- |
| 1 |
| 3 |
| 4 |
| 2 |
| 3 |

##### Snowflake

Please note the ‘true’ parameter in the OBJECT_INSERT. This is so that the element is updated if it is already present in the array.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.associative_array ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
      associate_array OBJECT := OBJECT_CONSTRUCT();
      associate_index VARCHAR(50);
   BEGIN
      associate_array := OBJECT_INSERT(associate_array, 'abc', 1, true);
      associate_array := OBJECT_INSERT(associate_array, 'bca', 2, true);
      associate_array := OBJECT_INSERT(associate_array, 'def', 3, true);

      CALL DBMS_OUTPUT.PUT_LINE(:associate_array['abc']);
      CALL DBMS_OUTPUT.PUT_LINE(:associate_array['not found']);

      associate_array := OBJECT_INSERT(:associate_array, 'abc', 4, true);

      CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(OBJECT_KEYS(:associate_array)));

      FOR i IN 1 TO ARRAY_SIZE(OBJECT_KEYS(:associate_array))
      LOOP
         associate_index := OBJECT_KEYS(:associate_array)[:i-1];
         CALL DBMS_OUTPUT.PUT_LINE(:associate_array[:associate_index]);
      END LOOP;
   END;
$$;

CALL PUBLIC.associative_array();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| 1 |
| 3 |
| 4 |
| 2 |
| 3 |

#### Numeric-indexed Associative Array

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE numeric_associative_array
IS
    TYPE numeric_associative_array_typ IS TABLE OF INTEGER
        INDEX BY PLS_INTEGER;

    associate_array numeric_associativ
    e_array_typ := numeric_associative_array_typ();
    associate_index PLS_INTEGER;
BEGIN
    associate_array(1) := -1;
    associate_array(2) := -2;
    associate_array(3) := -3;

    DBMS_OUTPUT.PUT_LINE(associate_array(1));
    associate_array(1) := -4;

    DBMS_OUTPUT.PUT_LINE(associate_array.COUNT);

    associate_index := associate_array.FIRST;
    WHILE associate_index IS NOT NULL
    LOOP
        DBMS_OUTPUT.PUT_LINE(associate_array(associate_index));
        associate_index := associate_array.NEXT(associate_index);
    END LOOP;
END;

CALL numeric_associative_array();
```

##### Result

| DBMS OUTPUT |
| --- |
| -1 |
| 3 |
| -4 |
| -2 |
| -3 |

##### Snowflake

Please note that the numeric value is converted to varchar accordingly when the operation needs it. Additionally, note the ‘true’ parameter in the OBJECT_INSERT. This is so that the element is updated if it is already present in the array.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.numeric_associative_array ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
      associate_array OBJECT := OBJECT_CONSTRUCT();
      associate_index NUMBER;
   BEGIN
      associate_array := OBJECT_INSERT(associate_array, '1', -1, true);
      associate_array := OBJECT_INSERT(associate_array, '2', -2, true);
      associate_array := OBJECT_INSERT(associate_array, '3', -3, true);

      CALL DBMS_OUTPUT.PUT_LINE(:associate_array['1']);

      associate_array := OBJECT_INSERT(:associate_array, '1', -4, true);

      CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(OBJECT_KEYS(:associate_array)));

      FOR i IN 1 TO ARRAY_SIZE(OBJECT_KEYS(:associate_array))
      LOOP
         associate_index := OBJECT_KEYS(:associate_array)[:i-1];
         CALL DBMS_OUTPUT.PUT_LINE(:associate_array[:associate_index::VARCHAR]);
      END LOOP;
   END;
$$;

CALL PUBLIC.numeric_associative_array();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| -1 |
| 3 |
| -4 |
| -2 |
| -3 |

#### Record-element Numeric-indexed Associative Array

In this case, the associative array is composed of a Record-structure, and this structure needs to be preserved. For this purpose, further operations on insertions were added.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE record_associative_array
IS
    TYPE record_typ IS RECORD(col1 INTEGER);
    TYPE record_associative_array_typ IS TABLE OF record_typ
        INDEX BY PLS_INTEGER;

    associate_array record_associati ve_array_typ := record_associative_array_typ();
    associate_index PLS_INTEGER;
BEGIN
    associate_array(1).col1 := -1;
    associate_array(2).col1 := -2;
    associate_array(3).col1 := -3;

    DBMS_OUTPUT.PUT_LINE(associate_array(1).col1);
    associate_array(4).col1 := -4;

    DBMS_OUTPUT.PUT_LINE(associate_array.COUNT);

    associate_index := associate_array.FIRST;
    WHILE associate_index IS NOT NULL
    LOOP
        DBMS_OUTPUT.PUT_LINE(associate_array(associate_index).col1);
        associate_index := associate_array.NEXT(associate_index);
    END LOOP;
END;
/

CALL record_associative_array();
```

##### Result

| DBMS OUTPUT |
| --- |
| -1 |
| 3 |
| -4 |
| -2 |
| -3 |

##### Snowflake

In this scenario, the insertion/update assumes an automatic creation of the record within the associative array and this needs to be taken into account when creating new records.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.record_associative_array ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
      associate_array OBJECT := OBJECT_CONSTRUCT();
      associate_index NUMBER;
   BEGIN
      associate_array := OBJECT_INSERT(associate_array, '1', OBJECT_INSERT(NVL(associate_array['1'], OBJECT_CONSTRUCT()), 'col1', -1, true), true);
      associate_array := OBJECT_INSERT(associate_array, '2', OBJECT_INSERT(NVL(associate_array['2'], OBJECT_CONSTRUCT()), 'col1', -2, true), true);
      associate_array := OBJECT_INSERT(associate_array, '3', OBJECT_INSERT(NVL(associate_array['3'], OBJECT_CONSTRUCT()), 'col1', -3, true), true);

      CALL DBMS_OUTPUT.PUT_LINE(:associate_array['1']:col1);

      associate_array := OBJECT_INSERT(associate_array, '1', OBJECT_INSERT(NVL(associate_array['1'], OBJECT_CONSTRUCT()), 'col1', -4, true), true);

      CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(OBJECT_KEYS(:associate_array)));

      FOR i IN 1 TO ARRAY_SIZE(OBJECT_KEYS(:associate_array))
      LOOP
         associate_index := OBJECT_KEYS(:associate_array)[:i-1];
         CALL DBMS_OUTPUT.PUT_LINE(:associate_array[:associate_index::VARCHAR]:col1);
      END LOOP;
   END;
$$;

CALL PUBLIC.record_associative_array();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| -1 |
| 3 |
| -4 |
| -2 |
| -3 |

### Known Issues

#### 1. They are currently not being recognized

SnowConvert AI treats these collections as Nested Table Arrays. There is a work item to fix this.

### Related EWIs

No related EWIs.

## Collection Methods

This is a translation reference to convert the Oracle Collection Methods to Snowflake

> **Warning:**
>
> This section is a work in progress, information may change in the future

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> A collection method is a PL/SQL subprogram—either a function that returns information about a collection or a procedure that operates on a collection. Collection methods make collections easier to use and your applications easier to maintain.
>
> ([Oracle PL/SQL Language Reference COLLECTION METHODS](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-collections-and-records.html#GUID-0452FBDC-D9C1-486E-B432-49AF84743A9F))

Some of these methods can be mapped to native Snowflake semi-structured operations. The ones that can’t or have differences will be mapped to a UDF implementation.

### Current SnowConvert AI Support

The next table shows a summary of the current support provided by the SnowConvert AI tool. Please keep in mind that translations may still not be final, and more work may be needed.

| Method | Current recognition status | Current translation status | Mapped to |
| --- | --- | --- | --- |
| DELETE | Not Recognized. | Not Translated. | UDF |
| TRIM | Not Recognized. | Not Translated. | UDF (To be defined) |
| EXTEND | Not Recognized. | Not Translated. | UDF |
| EXISTS | Not Recognized. | Not Translated. | [ARRAY_CONTAINS](https://docs.snowflake.com/en/sql-reference/functions/array_contains.html) |
| FIRST | Not Recognized. | Not Translated. | UDF |
| LAST | Not Recognized. | Not Translated. | UDF |
| COUNT | Not Recognized. | Not Translated. | [ARRAY_SIZE](https://docs.snowflake.com/en/sql-reference/functions/array_size.html) |
| LIMIT | Not Recognized. | Not Translated. | Not Supported. |
| PRIOR | Not Recognized. | Not Translated. | UDF (To be defined) |
| NEXT | Not Recognized. | Not Translated. | UDF (To be defined) |

### Sample Source Patterns

#### COUNT

This method returns the count of “non-undefined” (not to be confused with null) elements within a collection (nested tables can become sparse leaving these elements in between). In associative arrays, it returns the number of keys in the array.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_count
IS
    TYPE varray_typ IS VARRAY(5) OF INTEGER;
    TYPE nt_typ IS TABLE OF INTEGER;
    TYPE aa_typ IS TABLE OF INTEGER INDEX BY VARCHAR2(20);

    associative_array aa_typ := aa_typ('abc'=>1, 'bca'=>1);
    varray_variable varray_typ := varray_typ(1, 2, 3);
    nt_variable nt_typ := nt_typ(1, 2, 3, 4);
BEGIN
    DBMS_OUTPUT.PUT_LINE(associative_array.COUNT);
    DBMS_OUTPUT.PUT_LINE(varray_variable.COUNT);
    DBMS_OUTPUT.PUT_LINE(nt_variable.COUNT);
END;

CALL collection_count();
```

##### Result

| DBMS OUTPUT |
| --- |
| 2 |
| 3 |
| 4 |

##### Snowflake

The snowflake equivalent is the [ARRAY_SIZE](https://docs.snowflake.com/en/sql-reference/functions/array_size.html) method.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.collection_count()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    associative_array OBJECT := OBJECT_CONSTRUCT('abc', 1, 'bca', 1);
    varray_variable ARRAY := ARRAY_CONSTRUCT(1, 2, 3);
    nt_variable ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);
BEGIN
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(OBJECT_KEYS(:associative_array)));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(:varray_variable));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(:nt_variable));
END;
$$;

CALL PUBLIC.collection_count();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| 2 |
| 3 |
| 4 |

#### EXISTS

This method returns true if the given element is contained within the collection. In associative arrays, it tests if the key is contained.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_exists
IS
    TYPE nt_typ IS TABLE OF INTEGER;
    TYPE aa_typ IS TABLE OF INTEGER INDEX BY VARCHAR2(20);

    associative_array aa_typ := aa_typ('abc'=>1, 'bca'=>1);
    nt_variable nt_typ := nt_typ(1, 2, 3, 4);
BEGIN
    IF associative_array.EXISTS('abc')
    THEN DBMS_OUTPUT.PUT_LINE('Found');
    END IF;

    IF NOT associative_array.EXISTS('not found')
    THEN DBMS_OUTPUT.PUT_LINE('Not found');
    END IF;

    IF nt_variable.EXISTS(1)
    THEN DBMS_OUTPUT.PUT_LINE('Found');
    END IF;

    IF NOT nt_variable.EXISTS(5)
    THEN DBMS_OUTPUT.PUT_LINE('Not found');
    END IF;
END;
/

CALL collection_exists();
```

##### Result

| DBMS OUTPUT |
| --- |
| 2 |
| 3 |
| 4 |

##### Snowflake

The snowflake equivalent is the [ARRAY_CONTAINS](https://docs.snowflake.com/en/sql-reference/functions/array_contains.html) method. Note that, when using Varchar elements, casting to Variant is necessary.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.collection_exists()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    associative_array OBJECT := OBJECT_CONSTRUCT('abc', 1, 'bca', 1);
    nt_variable ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);
BEGIN
    IF (ARRAY_CONTAINS('abc'::VARIANT, OBJECT_KEYS(associative_array)))
    THEN CALL DBMS_OUTPUT.PUT_LINE('Found');
    END IF;

    IF (NOT ARRAY_CONTAINS('not found'::VARIANT, OBJECT_KEYS(associative_array)))
    THEN CALL DBMS_OUTPUT.PUT_LINE('Not found');
    END IF;

    IF (ARRAY_CONTAINS(1, nt_variable))
    THEN CALL DBMS_OUTPUT.PUT_LINE('Found');
    END IF;

    IF (NOT ARRAY_CONTAINS(5, nt_variable))
    THEN CALL DBMS_OUTPUT.PUT_LINE('Not found');
    END IF;
END;
$$;

CALL PUBLIC.collection_exists();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| 2 |
| 3 |
| 4 |

#### FIRST/LAST

These two methods return the First/Last element of the collection, respectively. If the collection is empty it returns null. This operation is mapped to a UDF, which will be added in further revisions.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_first_last
IS
    TYPE nt_typ IS TABLE OF INTEGER;
    TYPE aa_typ IS TABLE OF INTEGER INDEX BY VARCHAR2(20);

    associative_array aa_typ := aa_typ('abc'=>1, 'bca'=>1);
    nt_variable nt_typ := nt_typ();
BEGIN
    DBMS_OUTPUT.PUT_LINE(associative_array.FIRST);
    DBMS_OUTPUT.PUT_LINE(associative_array.LAST);

    DBMS_OUTPUT.PUT_LINE(nt_variable.FIRST);
    DBMS_OUTPUT.PUT_LINE(nt_variable.LAST);
    nt_variable := nt_typ(1, 2, 3, 4);
    DBMS_OUTPUT.PUT_LINE(nt_variable.FIRST);
    DBMS_OUTPUT.PUT_LINE(nt_variable.LAST);
END;
/

CALL collection_first_last();
```

##### Result

| DBMS OUTPUT |
| --- |
| abc |
| bca |
| –These empty spaces are due to it evaluating to null |
|  |
| 1 |
| 4 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.collection_first_last()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    associative_array OBJECT := OBJECT_CONSTRUCT('abc', 1, 'bca', 1);
    nt_variable ARRAY := ARRAY_CONSTRUCT();
BEGIN
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_FIRST(:associative_array));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_LAST(:associative_array));

    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_FIRST(:nt_variable));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_LAST(:nt_variable));
    nt_variable := ARRAY_CONSTRUCT(1, 2, 3, 4);
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_FIRST(:nt_variable));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_LAST(:nt_variable));
END;
$$;

CALL PUBLIC.collection_first_last();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### UDFs

```sql
CREATE OR REPLACE FUNCTION ARRAY_FIRST(array_variable VARIANT)
RETURNS VARIANT
LANGUAGE SQL
AS
$$
    IFF (IS_OBJECT(array_variable),
        ARRAY_FIRST(OBJECT_KEYS(array_variable)),
        IFF (ARRAY_SIZE(array_variable) = 0, null, array_variable[0]))
$$;

CREATE OR REPLACE FUNCTION ARRAY_LAST(array_variable VARIANT)
RETURNS VARIANT
LANGUAGE SQL
AS
$$
    IFF (IS_OBJECT(array_variable),
        ARRAY_LAST(OBJECT_KEYS(array_variable)),
        IFF (ARRAY_SIZE(array_variable) = 0, null, array_variable[ARRAY_SIZE(array_variable)-1]))
$$;
```

##### Result

| DBMS OUTPUT |
| --- |
| abc |
| bca |
| –These empty spaces are due to it evaluating to null |
|  |
| 1 |
| 4 |

#### DELETE

This method is used to remove elements from a Collection. It has three possible variants:

* .DELETE removes all elements.
* .DELETE(n) removes the element whose index matches ‘n’.
* .DELETE(n, m) removes in the indexes from ‘n’ through ‘m’.

> **Note:**
>
> In Oracle, using this operation on Nested Tables causes it to have “undefined” elements within it due to them being sparse.

> **Warning:**
>
> Please note that the second and third versions do not apply to Varrays.

##### Oracle

For the sake of simplicity, this sample only checks on the number of elements but may be modified to display the contents of each collection.

```sql
CREATE OR REPLACE PROCEDURE collection_delete
IS
    TYPE varray_typ IS VARRAY(5) OF INTEGER;
    TYPE nt_typ IS TABLE OF INTEGER;
    TYPE aa_typ IS TABLE OF INTEGER INDEX BY VARCHAR2(20);

    associative_array1 aa_typ := aa_typ('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);
    associative_array2 aa_typ := aa_typ('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);
    associative_array3 aa_typ := aa_typ('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);

    varray_variable1 varray_typ := varray_typ(1, 2, 3, 4);

    nt_variable1 nt_typ := nt_typ(1, 2, 3, 4);
    nt_variable2 nt_typ := nt_typ(1, 2, 3, 4);
    nt_variable3 nt_typ := nt_typ(1, 2, 3, 4);
BEGIN
    varray_variable1.DELETE;--delete everything

    nt_variable1.DELETE;--delete everything
    nt_variable2.DELETE(2);--delete second position
    nt_variable3.DELETE(2, 3);--delete range

    associative_array1.DELETE;--delete everything
    associative_array2.DELETE('def');--delete second position
    associative_array3.DELETE('def', 'jkl');--delete range

    DBMS_OUTPUT.PUT_LINE(varray_variable1.COUNT);
    DBMS_OUTPUT.PUT_LINE(nt_variable1.COUNT);
    DBMS_OUTPUT.PUT_LINE(nt_variable2.COUNT);
    DBMS_OUTPUT.PUT_LINE(nt_variable3.COUNT);

    DBMS_OUTPUT.PUT_LINE(associative_array1.COUNT);
    DBMS_OUTPUT.PUT_LINE(associative_array2.COUNT);
    DBMS_OUTPUT.PUT_LINE(associative_array3.COUNT);
END;
/

CALL collection_delete();
```

##### Result

| DBMS OUTPUT |
| --- |
| 0 |
| 0 |
| 3 |
| 2 |
| 0 |
| 3 |
| 1 |

##### Snowflake

Snowflake does not support deletions from an existing ARRAY and for this reason, the only offered workaround is to rebuild a new ARRAY depending on the original parameters of the DELETE.

> **Note:**
>
> Note that a UDF was added to implement the functionality for the update of the element.
>
> This UDF will be added in later revisions.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.collection_delete()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    associative_array1 OBJECT := OBJECT_CONSTRUCT('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);
    associative_array2 OBJECT := OBJECT_CONSTRUCT('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);
    associative_array3 OBJECT := OBJECT_CONSTRUCT('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);

    varray_variable1 ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);

    nt_variable1 ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);
    nt_variable2 ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);
    nt_variable3 ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);
BEGIN
    varray_variable1 := ARRAY_CONSTRUCT();--delete everything

    nt_variable1 := ARRAY_CONSTRUCT();--delete everything
    nt_variable2 := ARRAY_DELETE_UDF(nt_variable2, 2);--delete second position
    nt_variable3 := ARRAY_DELETE_UDF(nt_variable3, 2, 3);--delete range

    associative_array1 := OBJECT_CONSTRUCT();--delete everything
    associative_array2 := ASSOCIATIVE_ARRAY_DELETE_UDF('def');--delete second position
    associative_array3 := ASSOCIATIVE_ARRAY_DELETE_UDF('def', 'jkl');--delete range

    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(varray_variable1));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(nt_variable1);
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(nt_variable2);
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(nt_variable3);

    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(associative_array1));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(associative_array2));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(associative_array3));
END;
$$;

CALL PUBLIC.collection_first_last();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| 0 |
| 0 |
| 3 |
| 2 |
| 0 |
| 3 |
| 1 |

#### EXTEND

This method is used to append new elements to a Nested Table or a Varray. It has three possible variants:

* .EXTEND inserts a null element.
* .EXTEND(n) inserts ‘n’ null elements.
* .EXTEND(n, i) inserts ‘n’ copies of the element at ‘i’.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_extend
IS
    TYPE varray_typ IS VARRAY(5) OF INTEGER;
    TYPE nt_typ IS TABLE OF INTEGER;

    nt_variable1 nt_typ := nt_typ(1, 2, 3, 4);
    varray_variable1 varray_typ := varray_typ(1, 2, 3);
    varray_variable2 varray_typ := varray_typ(1, 2, 3);
BEGIN
    nt_variable1.EXTEND;
    varray_variable1.EXTEND(2);
    varray_variable2.EXTEND(2, 1);

    DBMS_OUTPUT.PUT_LINE(nt_variable1.COUNT);
    DBMS_OUTPUT.PUT_LINE(varray_variable1.COUNT);
    DBMS_OUTPUT.PUT_LINE(varray_variable2.COUNT);
END;
/

CALL collection_extend();
```

##### Result

| DBMS OUTPUT |
| --- |
| 5 |
| 5 |
| 5 |

##### Snowflake

> **Note:**
>
> Note that a UDF was added to implement the functionality for the update of the element.
>
> This UDF will be added in later revisions.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.collection_first_last()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    nt_variable1 ARRAY := ARRAY_CONSTRUCT(1, 2, 3, 4);
    varray_variable1 ARRAY := ARRAY_CONSTRUCT(1, 2, 3);
    varray_variable2 ARRAY := ARRAY_CONSTRUCT(1, 2, 3);
BEGIN
    nt_variable1 := ARRAY_EXTEND_UDF(nt_variable);
    varray_variable1 := ARRAY_EXTEND_UDF(varray_variable1, 2);
    varray_variable2 := ARRAY_EXTEND_UDF(varray_variable2, 2, 1);

    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(nt_variable1);
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(varray_variable1));
    CALL DBMS_OUTPUT.PUT_LINE(ARRAY_SIZE(varray_variable2));
END;
$$;

CALL PUBLIC.collection_first_last();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### Result

| DBMS OUTPUT |
| --- |
| 5 |
| 5 |
| 5 |

#### TRIM

This method is used to remove the last elements from a Nested Table or a Varray. It has two possible variants:

* .TRIM removes the last element.
* .TRIM(n) removes the last ‘n’ elements.

> **Note:**
>
> This functionality may be implemented using [ARRAY_SLICE](https://docs.snowflake.com/en/sql-reference/functions/array_slice.html)

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_trim
IS
    TYPE varray_typ IS VARRAY(5) OF INTEGER;
    TYPE nt_typ IS TABLE OF INTEGER;

    varray_variable1 varray_typ := varray_typ(1, 2, 3);
    nt_variable1 nt_typ := nt_typ(1, 2, 3, 4);
BEGIN
    varray_variable1.TRIM;
    nt_variable1.TRIM(2);

    DBMS_OUTPUT.PUT_LINE(nt_variable1.COUNT);
    DBMS_OUTPUT.PUT_LINE(varray_variable1.COUNT);
END;
/

CALL collection_trim();
```

##### Result

```none
DBMS OUTPUT
-----------
2
2
```

#### LIMIT

This method returns the maximum limit of a Varray.

> **Danger:**
>
> This method is not supported in Snowflake.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_limit
IS
    TYPE varray_typ1 IS VARRAY(5) OF INTEGER;
    TYPE varray_typ2 IS VARRAY(6) OF INTEGER;

    varray_variable1 varray_typ1 := varray_typ1(1, 2, 3);
    varray_variable2 varray_typ2 := varray_typ2(1, 2, 3, 4);
BEGIN
    DBMS_OUTPUT.PUT_LINE(varray_variable1.LIMIT);
    DBMS_OUTPUT.PUT_LINE(varray_variable2.LIMIT);
END;
/

CALL collection_limit();
```

##### Result

| DBMS OUTPUT |
| --- |
| 5 |
| 6 |

#### PRIOR/NEXT

This method returns the prior/next index, given an index. If there is not a prior/next then it returns null. It is most frequently used to traverse a collection.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE collection_prior_next
IS
    TYPE varray_typ1 IS VARRAY(5) OF INTEGER;
    TYPE aa_typ IS TABLE OF INTEGER INDEX BY VARCHAR2(20);

    varray_variable1 varray_typ1 := varray_typ1(-1, -2, -3);
    associative_array1 aa_typ := aa_typ('abc'=>1, 'def'=>2, 'ghi'=>3, 'jkl'=>4);
BEGIN
    DBMS_OUTPUT.PUT_LINE(varray_variable1.PRIOR(1));
    DBMS_OUTPUT.PUT_LINE(varray_variable1.PRIOR(2));
    DBMS_OUTPUT.PUT_LINE(varray_variable1.NEXT(2));
    DBMS_OUTPUT.PUT_LINE(varray_variable1.NEXT(3));

    DBMS_OUTPUT.PUT_LINE(associative_array1.PRIOR('abc'));
    DBMS_OUTPUT.PUT_LINE(associative_array1.PRIOR('def'));
    DBMS_OUTPUT.PUT_LINE(associative_array1.NEXT('ghi'));
    DBMS_OUTPUT.PUT_LINE(associative_array1.NEXT('jkl'));
    DBMS_OUTPUT.PUT_LINE(associative_array1.PRIOR('not found'));
END;
/

CALL collection_prior_next();
```

##### Result

| DBMS OUTPUT |
| --- |
| – Empty spaces are due to null results |
| 1 |
| 3 |
|  |
|  |
| abc |
| jkl |
|  |
| jkl |

### Known Issues

#### 1. Limit method is not supported in Snowflake

Snowflake does not have support for limited-space varrays. For this reason, this method is not supported.

### Related EWIs

No EWIs related.

## Nested Table Array Type Definition

This is a translation reference to convert the Oracle Nested Table Array Declaration to Snowflake

> **Warning:**
>
> This section is a work in progress, information may change in the future.

> **Note:**
>
> This section is for the PL/SQL Version of the Nested Table Arrays, for the Standalone Version please see [Nested Table Type Definition](../sql-translation-reference/create_type.md).

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> In the database, a nested table is a column type that stores an unspecified number of rows in no particular order.
>
> When you retrieve a nested table value from the database into a PL/SQL nested table variable, PL/SQL gives the rows consecutive indexes, starting at 1. Using these indexes, you can access the individual rows of the nested table variable. The syntax is `variable_name(index)`. The indexes and row order of a nested table might not remain stable as you store and retrieve the nested table from the database.
>
> ([Oracle PL/SQL Language Reference NESTED TABLES](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-collections-and-records.html#GUID-5ADB7EE2-71F6-4172-ACD8-FFDCF2787A37))

For the translation, the type definition is replaced by an ARRAY [Semi-structured Data Type](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) and then its usages are changed accordingly across any operations. Please note how the translation for Nested Tables and Varrays are the same.

To define a Nested Table Array type, the syntax is as follows:

```none
type_definition := TYPE IS TABLE OF datatype;
```

To declare a variable of this type:

```none
variable_name collection_type;
```

### Sample Source Patterns

#### Nested Table Array definitions

This illustrates how to create different nested table arrays, and how to migrate the definitions for the variables.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE nested_table_procedure
IS
    TYPE nested_table_array_typ IS TABLE OF INTEGER;
    TYPE nested_table_array_typ2 IS TABLE OF DATE;

    nested_table_array nested_table_array_typ;
    nested_table_array2 nested_table_array_typ2;
BEGIN
    NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE nested_table_procedure()
RETURNS INTEGER
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    -- NO LONGER NEEDED
    /*
    TYPE associative_array_typ IS TABLE OF INTEGER INDEX BY VARCHAR2(30);
    TYPE associative_array_typ2 IS TABLE OF INTEGER INDEX BY PLS_INTEGER;
    */

    associative_array ARRAY;
    associative_array2 ARRAY;
BEGIN
    NULL;
END;
$$;
```

#### Nested Table iteration

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE nested_table_iteration
IS
    TYPE nested_table_typ IS TABLE OF INTEGER;
    nested_table_variable nested_table_typ := nested_table_typ (10, 20, 30);
BEGIN
    FOR i IN 1..nested_table_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(nested_table_variable(i));
    END LOOP;

    nested_table_variable (1) := 40;

    FOR i IN 1..nested_table_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(nested_table_variable(i));
    END LOOP;
END;
/

CALL nested_table_iteration();
```

##### Result

| DBMS OUTPUT |
| --- |
| 10 |
| 20 |
| 30 |
| 40 |
| 20 |
| 30 |

##### Snowflake

> **Note:**
>
> Note that a UDF was added to implement the functionality for the update of the element.
>
> This UDF will be added in later revisions.

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.nested_table_iteration()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
DECLARE
    nested_table_variable ARRAY := ARRAY_CONSTRUCT(10, 20, 30);
BEGIN
    FOR i IN 1 TO ARRAY_SIZE(nested_table_variable)
    LOOP
        CALL DBMS_OUTPUT.PUT_LINE(:nested_table_variable[:i-1]);
    END LOOP;

    nested_table_variable:= INSERT_REPLACE_COLLECTION_ELEMENT_UDF(nested_table_variable, 1, 40);

    FOR i IN 1 TO ARRAY_SIZE(nested_table_variable)
    LOOP
        CALL DBMS_OUTPUT.PUT_LINE(:nested_table_variable[:i-1]);
    END LOOP;
END;
$$;

CALL PUBLIC.nested_table_iteration();
SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG;
```

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.INSERT_REPLACE_COLLECTION_ELEMENT_UDF(varray ARRAY, position INTEGER, newValue VARIANT)
RETURNS ARRAY
LANGUAGE SQL
AS
$$
    ARRAY_CAT(
        ARRAY_APPEND(ARRAY_SLICE(varray, 0, (position)-1), newValue),
        ARRAY_SLICE(varray, position, ARRAY_SIZE(varray)))
$$;
```

##### Result

| DBMS OUTPUT |
| --- |
| 10 |
| 20 |
| 30 |
| 40 |
| 20 |
| 30 |

### Known Issues

#### 1. They are currently not being converted

SnowConvert AI does not support translating these elements.

##### 2. Indexing needs to be modified

Oracle’s indexes start at 1, on Snowflake they will begin at 0.

### Related EWIs

No EWIs related.

## Record Type Definition

This is a translation reference to convert the Oracle Record Declaration to Snowflake

> **Warning:**
>
> This section is a work in progress, information may change in the future.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> A record variable is a composite variable whose internal components, called fields, can have different data types. The value of a record variable and the values of its fields can change.
>
> You reference an entire record variable by its name. You reference a record field with the syntax `record.field`.
>
> You can create a record variable in any of these ways:
>
> * Define a record type and then declare a variable of that type.
> * Use `%ROWTYPE` to declare a record variable that represents either a full or partial row of a database table or view.
> * Use `%TYPE` to declare a record variable of the same type as a previously declared record variable.
>
> ([Oracle PL/SQL Language Reference RECORD VARIABLES](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-collections-and-records.html#GUID-75875E26-FC7B-4513-A5E2-EDA26F1D67B1))

For the translation, the type definition is replaced by an OBJECT [Semi-structured Data Type](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) and then its usages are changed accordingly across any operations.

To define a Record type, the syntax is as follows:

```none
type_definition := TYPE IS RECORD ( field_definition [, field_definition...] );

field_definition := field_name datatype [ { [NOT NULL default ] | default } ]

default := [ { := | DEFAULT } expression]
```

To declare a variable of this type:

```none
variable_name { record_type
              | rowtype_attribute
              | record_variable%TYPE
              };
```

### Sample Source Patterns

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Record initialization and assignment

This sample attempts to insert two new rows using a record variable which is reassigned mid-procedure.

##### Oracle

```sql
CREATE TABLE record_table(col1 FLOAT, col2 INTEGER);

CREATE OR REPLACE PROCEDURE record_procedure
IS
    TYPE record_typ IS RECORD(col1 INTEGER, col2 FLOAT);
    record_variable record_typ := record_typ(1, 1.5);--initialization
BEGIN
    INSERT INTO record_table(col1, col2)
        VALUES (record_variable.col2, record_variable.col1);--usage

    --reassignment of properties
    record_variable.col1 := 2;
    record_variable.col2 := 2.5;

    INSERT INTO record_table(col1, col2)
        VALUES (record_variable.col2, record_variable.col1);--usage
END;

CALL record_procedure();
SELECT * FROM record_table;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| 1.5 | 1 |
| 2.5 | 2 |

##### Snowflake

Notice how the reassignments are replaced by an OBJECT_INSERT that updates if the column already exists, and how the VALUES clause is replaced by a SELECT.

```sql
CREATE OR REPLACE TABLE record_table (col1 FLOAT,
    col2 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE record_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
        TYPE record_typ IS RECORD(col1 INTEGER, col2 FLOAT);
        record_variable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - record_typ DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT('COL1', 1, 'COL2', 1.5);--initialization

    BEGIN
        INSERT INTO record_table(col1, col2)
        SELECT
            :record_variable:COL2,
            :record_variable:COL1;--usage

        --reassignment of properties
        record_variable := OBJECT_INSERT(record_variable, 'COL1', 2, true);
        record_variable := OBJECT_INSERT(record_variable, 'COL2', 2.5, true);

        INSERT INTO record_table(col1, col2)
        SELECT
            :record_variable:COL2,
            :record_variable:COL1;--usage

    END;
$$;

CALL record_procedure();

SELECT * FROM
    record_table;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| 1.5 | 1 |
| 2.5 | 2 |

#### %ROWTYPE Record and Values Record

Since the operations are the ones that define the structure, these definitions can be replaced by an OBJECT datatype, but the values of the record need to be decomposed as inserting the record “as-is” is not supported.

##### Oracle

```sql
CREATE TABLE record_table(col1 INTEGER, col2 VARCHAR2(50), col3 DATE);
CREATE OR REPLACE PROCEDURE insert_record
IS
    record_variable record_table%ROWTYPE;
BEGIN
    record_variable.col1 := 1;
    record_variable.col2 := 'Hello';
    record_variable.col3 := DATE '2020-12-25';

    INSERT INTO record_table VALUES record_variable;
END;

CALL insert_record();
SELECT * FROM record_table;
```

##### Result

| COL1 | COL2 | COL3 |
| --- | --- | --- |
| 1 | “Hello” | 25-DEC-20 |

##### Snowflake

Please note finally how the OBJECT variable needs to be initialized to add the information to it.

```sql
CREATE OR REPLACE TABLE record_table (col1 INTEGER,
    col2 VARCHAR(50),
    col3 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE insert_record ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        record_variable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
    BEGIN
        record_variable := OBJECT_INSERT(record_variable, 'COL1', 1, true);
        record_variable := OBJECT_INSERT(record_variable, 'COL2', 'Hello', true);
        record_variable := OBJECT_INSERT(record_variable, 'COL3', DATE '2020-12-25', true);
        INSERT INTO record_table
        SELECT
            :record_variable:COL1,
            :record_variable:COL2,
            :record_variable:COL3;
    END;
$$;

CALL insert_record();

SELECT * FROM
    record_table;
```

##### Result

| COL1 | COL2 | COL3 |
| --- | --- | --- |
| 1 | “Hello” | 25-DEC-20 |

#### Fetching data into a Record

##### Oracle

```sql
CREATE TABLE record_table(col1 INTEGER, col2 VARCHAR2(50), col3 DATE);
INSERT INTO record_table(col1, col2 , col3)
    VALUES (1, 'Hello', DATE '2020-12-25');

CREATE OR REPLACE PROCEDURE load_cursor_record
IS
    CURSOR record_cursor IS
        SELECT *
        FROM record_table;

    record_variable record_cursor%ROWTYPE;
BEGIN
    OPEN record_cursor;
    LOOP
        FETCH record_cursor INTO record_variable;
        EXIT WHEN record_cursor%NOTFOUND;

        DBMS_OUTPUT.PUT_LINE(record_variable.col1);
        DBMS_OUTPUT.PUT_LINE(record_variable.col2);
        DBMS_OUTPUT.PUT_LINE(record_variable.col3);
    END LOOP;
    CLOSE record_cursor;
END;

CALL load_cursor_record();
```

##### Result

| DBMS OUTPUT |
| --- |
| 1 |
| Hello |
| 25-DEC-20 |

##### Snowflake

Please note the additional OBJECT_CONSTRUCT in the Cursor definition, this is what allows to extract an OBJECT, which then can be used to seamlessly migrate the FETCH statement.

```sql
CREATE OR REPLACE TABLE record_table (col1 INTEGER,
    col2 VARCHAR(50),
    col3 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO record_table(col1, col2 , col3)
    VALUES (1, 'Hello', DATE '2020-12-25');

CREATE OR REPLACE PROCEDURE load_cursor_record ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        record_cursor CURSOR
        FOR
            SELECT
                OBJECT_CONSTRUCT( *) sc_cursor_record
            FROM
                record_table;
    record_variable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
    BEGIN
        OPEN record_cursor;
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        FETCH record_cursor INTO
                :record_variable;
        IF (record_variable IS NULL) THEN
                EXIT;
        END IF;
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF(:record_variable:COL1);
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF(:record_variable:COL2);
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF(:record_variable:COL3::DATE);
    END LOOP;
    CLOSE record_cursor;
    END;
$$;

CALL load_cursor_record();
```

##### Result

| DBMS OUTPUT |
| --- |
| 1 |
| Hello |
| 25-DEC-20 |

#### Assigning a Record Variable in a SELECT INTO

This transformation consists in taking advantage of the OBJECT_CONTRUCT function to initialize the record using the SELECT columns as the arguments.

#### Sample auxiliary code

##### Oracle

```sql
create table sample_table(ID number, NAME varchar2(23));
CREATE TABLE RESULTS (COL1 VARCHAR(20), COL2 VARCHAR(40));
insert into sample_table values(1, 'NAME 1');
insert into sample_table values(2, 'NAME 2');
insert into sample_table values(3, 'NAME 3');
insert into sample_table values(4, 'NAME 4');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE sample_table (ID NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
NAME VARCHAR(23))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE RESULTS (COL1 VARCHAR(20),
COL2 VARCHAR(40))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

insert into sample_table
values(1, 'NAME 1');

insert into sample_table
values(2, 'NAME 2');

insert into sample_table
values(3, 'NAME 3');

insert into sample_table
values(4, 'NAME 4');
```

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE sp_sample1 AS
-- Rowtype variable
rowtype_variable sample_table%rowtype;

--Record variable
TYPE record_typ_def IS RECORD(ID number, NAME varchar2(23));
record_variable_def record_typ_def;

-- Auxiliary variable
name_var VARCHAR(20);
BEGIN
   SELECT * INTO rowtype_variable FROM sample_table WHERE ID = 1 FETCH NEXT 1 ROWS ONLY;
   name_var := rowtype_variable.NAME;
   INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 1', name_var);

   SELECT ID, NAME INTO rowtype_variable FROM sample_table WHERE ID = 2 FETCH NEXT 1 ROWS ONLY;
   name_var := rowtype_variable.NAME;
   INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 2', name_var);

   SELECT * INTO record_variable_def FROM sample_table WHERE ID = 3 FETCH NEXT 1 ROWS ONLY;
   name_var := record_variable_def.NAME;
   INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 3', name_var);

   SELECT ID, NAME INTO record_variable_def FROM sample_table WHERE ID = 4 FETCH NEXT 1 ROWS ONLY;
   name_var := record_variable_def.NAME;
   INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 4', name_var);
END;

call sp_sample1();

SELECT * FROM results;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| SELECT 1 | NAME 1 |
| SELECT 2 | NAME 2 |
| SELECT 3 | NAME 3 |
| SELECT 4 | NAME 4 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE sp_sample1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      -- Rowtype variable
      rowtype_variable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();

      --Record variable
      !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
      TYPE record_typ_def IS RECORD(ID number, NAME varchar2(23));
      record_variable_def OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - record_typ_def DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();

      -- Auxiliary variable
      name_var VARCHAR(20);
   BEGIN
      SELECT
         OBJECT_CONSTRUCT( *) INTO
         :rowtype_variable
      FROM
         sample_table
      WHERE ID = 1
      FETCH NEXT 1 ROWS ONLY;
      name_var := :rowtype_variable:NAME;
      INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 1', :name_var);

      SELECT
         OBJECT_CONSTRUCT()
      INTO
         :rowtype_variable
      FROM
         sample_table
      WHERE ID = 2
      FETCH NEXT 1 ROWS ONLY;
      name_var := :rowtype_variable:NAME;
      INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 2', :name_var);

      SELECT
         OBJECT_CONSTRUCT( *) INTO
         :record_variable_def
      FROM
         sample_table
      WHERE ID = 3
      FETCH NEXT 1 ROWS ONLY;
      name_var := :record_variable_def:NAME;
      INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 3', :name_var);

      SELECT
         OBJECT_CONSTRUCT('ID', ID, 'NAME', NAME) INTO
         :record_variable_def
      FROM
         sample_table
      WHERE ID = 4
      FETCH NEXT 1 ROWS ONLY;
      name_var := :record_variable_def:NAME;
      INSERT INTO RESULTS(COL1, COL2) VALUES('SELECT 4', :name_var);
   END;
$$;

call sp_sample1();

SELECT * FROM
   results;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| SELECT 1 | NAME 1 |
| SELECT 2 | NAME 2 |
| SELECT 3 | NAME 3 |
| SELECT 4 | NAME 4 |

### Known Issues

#### 1. The following functionalities are currently not being converted:

* Fetching data into a Record.
* Nested records (Records inside records).
* Collections inside records.

### Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported
3. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
4. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.
5. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.
6. [SSC-PRF-0003](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.

## Varray Type Definition

This is a translation reference to convert the Oracle Varray Declaration to Snowflake

> **Warning:**
>
> This section is a work in progress, information may change in the future.

> **Note:**
>
> This section is for the PL/SQL Version of the Varrays, for the Standalone Version please see [Array Type Definition](../sql-translation-reference/create_type.md).

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> A varray (variable-size array) is an array whose number of elements can vary from zero (empty) to the declared maximum size.
>
> To access an element of a varray variable, use the syntax `variable_name(index)`. The lower bound of `index` is 1; the upper bound is the current number of elements. The upper bound changes as you add or delete elements, but it cannot exceed the maximum size. When you store and retrieve a varray from the database, its indexes and element order remain stable.
>
> ([Oracle PL/SQL Language Reference VARRAYS](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-collections-and-records.html#GUID-E932FC04-C7AD-4562-9555-8BA05446C0B8))

For the translation, the type definition is replaced by an ARRAY [Semi-structured Data Type](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) and then its usages are changed accordingly across any operations. Please note how the translation for Nested Tables and Varrays are the same.

To define a varray type, the syntax is as follows:

```none
type_definition := { VARRAY | [VARYING] ARRAY } (size_limit) OF datatype
            [NOT NULL];
```

To declare a variable of this type:

```none
variable_name collection_type;
```

### Sample Source Patterns

#### Varray definitions

This illustrates how three different ways to create a varray, and how to migrate these definitions for the variables.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE associative_array_procedure
IS
    TYPE varray_typ IS ARRAY(10) OF INTEGER;
    TYPE varray_typ2 IS VARRAY(10) OF INTEGER;
    TYPE varray_typ3 IS VARYING ARRAY(10) OF INTEGER;

    array_variable varray_typ;
    array_variable2 varray_typ2;
    array_variable3 varray_typ3;
BEGIN
    NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE associative_array_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE varray_typ IS ARRAY(10) OF INTEGER;
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE varray_typ2 IS VARRAY(10) OF INTEGER;
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE varray_typ3 IS VARYING ARRAY(10) OF INTEGER;

        array_variable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'varray_typ' USAGE CHANGED TO VARIANT ***/!!!;
        array_variable2 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'varray_typ2' USAGE CHANGED TO VARIANT ***/!!!;
        array_variable3 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'varray_typ3' USAGE CHANGED TO VARIANT ***/!!!;
    BEGIN
        NULL;
    END;
$$;
```

#### Varray iteration

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE varray_iteration
IS
    TYPE varray_typ IS VARRAY(3) OF INTEGER;
    varray_variable varray_typ := varray_typ(10, 20, 30);
BEGIN
    FOR i IN 1..varray_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(varray_variable(i));
    END LOOP;

    varray_variable(1) := 40;

    FOR i IN 1..varray_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(varray_variable(i));
    END LOOP;
END;
/

CALL varray_iteration();
```

##### Result

| DBMS OUTPUT |
| --- |
| 10 |
| 20 |
| 30 |
| 40 |
| 20 |
| 30 |

##### Snowflake

> **Note:**
>
> Note that a UDF was added to implement the functionality for the update of the element.
>
> This UDF will be added in later revisions.

```sql
CREATE OR REPLACE PROCEDURE varray_iteration ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE varray_typ IS VARRAY(3) OF INTEGER;
        varray_variable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'varray_typ' USAGE CHANGED TO VARIANT ***/!!! := varray_typ(10, 20, 30);
    BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        FOR i IN 1 TO 0 /*varray_variable.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'VARRAY CUSTOM TYPE EXPRESSION' NODE ***/!!!
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            CALL DBMS_OUTPUT.PUT_LINE_UDF(varray_variable(i));
        END LOOP;
            !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
            varray_variable(1) := 40;
            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
            FOR i IN 1 TO 0 /*varray_variable.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'VARRAY CUSTOM TYPE EXPRESSION' NODE ***/!!!
            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
               LOOP
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            CALL DBMS_OUTPUT.PUT_LINE_UDF(varray_variable(i));
               END LOOP;
    END;
$$;

CALL varray_iteration();
```

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.INSERT_REPLACE_COLLECTION_ELEMENT_UDF(varray ARRAY, position INTEGER, newValue VARIANT)
RETURNS ARRAY
LANGUAGE SQL
AS
$$
    ARRAY_CAT(
        ARRAY_APPEND(ARRAY_SLICE(varray, 0, (position)-1), newValue),
        ARRAY_SLICE(varray, position, ARRAY_SIZE(varray)))
$$;
```

##### Result

| DBMS OUTPUT |
| --- |
| 10 |
| 20 |
| 30 |
| 40 |
| 20 |
| 30 |

### Known Issues

#### 1. They are currently not being converted

SnowConvert AI does not support translating these elements.

##### 2. Indexing needs to be modified

Oracle’s indexes start at 1, on Snowflake they will begin at 0.

##### 3. Array Density may not match the original

Since the ARRAY datatype can become sparse, care should be taken when performing additions or deletions of the array. Using [ARRAY_COMPACT()](https://docs.snowflake.com/en/sql-reference/functions/array_compact.html) after such operations can be helpful if the density is a concern.

### Related EWIs

1. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
4. [SSC-EWI-OR0108](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The Following Assignment Statement is Not Supported by Snowflake Scripting.
5. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.

## Collection Bulk Operations

This is a translation reference to convert the Oracle Collection Bulk Operations to Snowflake

> **Warning:**
>
> This section is a work in progress, information may change in the future

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The `BULK` `COLLECT` clause, a feature of bulk SQL, returns results from SQL to PL/SQL in batches rather than one at a time.
>
> The `BULK` `COLLECT` clause can appear in:
>
> * `SELECT` `INTO` statement
> * `FETCH` statement
> * `RETURNING` `INTO` clause of:
>
>   + `DELETE` statement
>   + `INSERT` statement
>   + `UPDATE` statement
>   + `EXECUTE` `IMMEDIATE` statement
>
> With the `BULK` `COLLECT` clause, each of the preceding statements retrieves an entire result set and stores it in one or more collection variables in a single operation (which is more efficient than using a loop statement to retrieve one result row at a time).

([Oracle PL/SQL Language Reference BULK COLLECT CLAUSE](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-optimization-and-tuning.html#GUID-19F50644-C88E-49AF-B31C-3EE4B4432714))

This section has some workarounds for SELECTs and FETCH Cursor with Bulk Clauses.

### Sample Source Patterns

#### Source Table

##### Oracle

```sql
CREATE TABLE bulk_collect_table(col1 INTEGER);

INSERT INTO bulk_collect_table VALUES(1);
INSERT INTO bulk_collect_table VALUES(2);
INSERT INTO bulk_collect_table VALUES(3);
INSERT INTO bulk_collect_table VALUES(4);
INSERT INTO bulk_collect_table VALUES(5);
INSERT INTO bulk_collect_table VALUES(6);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE bulk_collect_table (col1 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO bulk_collect_table
VALUES(1);

INSERT INTO bulk_collect_table
VALUES(2);

INSERT INTO bulk_collect_table
VALUES(3);

INSERT INTO bulk_collect_table
VALUES(4);

INSERT INTO bulk_collect_table
VALUES(5);

INSERT INTO bulk_collect_table
VALUES(6);
```

#### Bulk Collect from a Table

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE bulk_collect_procedure
IS
    CURSOR record_cursor IS
        SELECT *
        FROM bulk_collect_table;

    TYPE fetch_collection_typ IS TABLE OF record_cursor%ROWTYPE;
    fetch_collection_variable fetch_collection_typ;

    TYPE collection_typ IS TABLE OF bulk_collect_table%ROWTYPE;
    collection_variable collection_typ;
BEGIN
    SELECT * BULK COLLECT INTO collection_variable FROM bulk_collect_table;

    FOR i IN 1..collection_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(collection_variable(i).col1);
    END LOOP;

    collection_variable := null;
    OPEN record_cursor;
    FETCH record_cursor BULK COLLECT INTO collection_variable;
    CLOSE record_cursor;

    FOR i IN 1..collection_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(collection_variable(i).col1+6);
    END LOOP;

    collection_variable := null;
    EXECUTE IMMEDIATE 'SELECT * FROM bulk_collect_table' BULK COLLECT INTO collection_variable;

    FOR i IN 1..collection_variable.COUNT
    LOOP
        DBMS_OUTPUT.PUT_LINE(collection_variable(i).col1+12);
    END LOOP;
END;
/

CALL bulk_collect_procedure();
```

##### Result

| DBMS OUTPUT |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |
| 10 |
| 11 |
| 12 |
| 13 |
| 14 |
| 15 |
| 16 |
| 17 |
| 18 |

##### Snowflake

> **Danger:**
>
> EXECUTE IMMEDIATE with Bulk Collect clause has no workarounds offered.

> **Note:**
>
> Please note, that while the FETCH Cursor can be mostly preserved, it is advised to be changed into SELECT statements whenever possible for performance issues.

```sql
CREATE OR REPLACE PROCEDURE bulk_collect_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        record_cursor CURSOR
        FOR
            SELECT *
            FROM
                bulk_collect_table;
--                !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--                TYPE fetch_collection_typ IS TABLE OF record_cursor%ROWTYPE;
    fetch_collection_variable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'fetch_collection_typ' USAGE CHANGED TO VARIANT ***/!!!;
--                !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

--    TYPE collection_typ IS TABLE OF bulk_collect_table%ROWTYPE;
    collection_variable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'collection_typ' USAGE CHANGED TO VARIANT ***/!!!;
    BEGIN
                !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'RECORDS AND COLLECTIONS' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
                SELECT * BULK COLLECT INTO collection_variable FROM bulk_collect_table;
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                FOR i IN 1 TO 0 /*collection_variable.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NESTED TABLE CUSTOM TYPE EXPRESSION' NODE ***/!!!
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                   LOOP
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            CALL DBMS_OUTPUT.PUT_LINE_UDF(:collection_variable(i).col1);
                   END LOOP;
                !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

                collection_variable := null;
                OPEN record_cursor;
                --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
                record_cursor := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:record_cursor)
                );
                collection_variable := :record_cursor:RESULT;
                CLOSE record_cursor;
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                FOR i IN 1 TO 0 /*collection_variable.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NESTED TABLE CUSTOM TYPE EXPRESSION' NODE ***/!!!
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                   LOOP
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            CALL DBMS_OUTPUT.PUT_LINE_UDF(
            !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
            :collection_variable(i).col1+6);
                   END LOOP;
                !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

                collection_variable := null;
                !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
                EXECUTE IMMEDIATE 'SELECT * FROM
   bulk_collect_table'
                      !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'EXECUTE IMMEDIATE RETURNING CLAUSE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
                      BULK COLLECT INTO collection_variable;
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                FOR i IN 1 TO 0 /*collection_variable.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NESTED TABLE CUSTOM TYPE EXPRESSION' NODE ***/!!!
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                   LOOP
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            CALL DBMS_OUTPUT.PUT_LINE_UDF(
            !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
            :collection_variable(i).col1+12);
                   END LOOP;
    END;
$$;

CALL bulk_collect_procedure();
```

##### Result

| DBMS OUTPUT |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |
| 10 |
| 11 |
| – EXECUTE IMMEDIATE NOT EXECUTED, it’s not supported |

#### SELECT INTO statement case

In this case, the translation specification uses RESULTSETs. Review the documentation for WITH, SELECT, and BULK COLLECT INTO statements here:

with-select-and-bulk-collect-into-statements.md

### Known Issues

#### 1. Heavy performance issues on FETCH Cursor workaround

The workaround for the Fetch cursor has heavy performance requirements due to the Temporary table. It is advised for them to be manually migrated to SELECT statements

##### 2. Execute immediate statements are not transformed

They are not supported by SnowConvert AI but may be manually changed to SELECT statements.

### Related EWIs

1. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review
4. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
5. [SSC-EWI-OR0108](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The Following Assignment Statement is Not Supported by Snowflake Scripting.
6. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.
7. [SSC-PRF-0001](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor fetch bulk operations.
8. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL

## WITH, SELECT, and BULK COLLECT INTO statements

> **Danger:**
>
> This section is a translation specification. Information may change in the future.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This section is a translation specification for the statement WITH subsequent to a SELECT statement which uses a BULK COLLECT INTO statement. For more information review the following documentation:

* [SELECT INTO Statement Documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/SELECT-INTO-statement.html#GUID-6E14E04D-4344-45F3-BE80-979DD26C7A90).
* SnowConvert AI Bulk Collect translation.

### Sample Source Patterns

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

The following query is used for the following examples.

#### Oracle

```sql
-- Sample MySampleTable table
CREATE TABLE MySampleTable (
  MySampleID NUMBER PRIMARY KEY,
  FirstName VARCHAR2(50),
  Salary NUMBER,
  Department VARCHAR2(50)
);

-- Insert some sample data
INSERT INTO MySampleTable (MySampleID, FirstName, Salary, Department)
VALUES (1, 'Bob One', 50000, 'HR');

INSERT INTO MySampleTable (MySampleID, FirstName, Salary, Department)
VALUES (2, 'Bob Two', 60000, 'HR');

INSERT INTO MySampleTable (MySampleID, FirstName, Salary, Department)
VALUES (3, 'Bob Three', 75000, 'IT');

INSERT INTO MySampleTable (MySampleID, FirstName, Salary, Department)
VALUES (4, 'Bob Four', 80000, 'IT');
```

##### Snowflake

```sql
-- Sample MySampleTable table
CREATE OR REPLACE TABLE MySampleTable (
   MySampleID NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ PRIMARY KEY,
   FirstName VARCHAR(50),
   Salary NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
   Department VARCHAR(50)
 )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

-- Insert some sample data
INSERT INTO MySampleTable(MySampleID, FirstName, Salary, Department)
VALUES (1, 'Bob One', 50000, 'HR');

INSERT INTO MySampleTable(MySampleID, FirstName, Salary, Department)
VALUES (2, 'Bob Two', 60000, 'HR');

INSERT INTO MySampleTable(MySampleID, FirstName, Salary, Department)
VALUES (3, 'Bob Three', 75000, 'IT');

INSERT INTO MySampleTable(MySampleID, FirstName, Salary, Department)
VALUES (4, 'Bob Four', 80000, 'IT');
```

#### 1. Inside procedure simple case

> **Danger:**
>
> This is an approach that uses a resultset data type. User-defined types must be reviewed. Review the following [Snowflake documentation](https://docs.snowflake.com/developer-guide/snowflake-scripting/resultsets) to review more information about RESULTSETs.

The following example uses a User-defined type and it is declared indirectly as a table. The translation for this case implements a RESULTSET as a data type in Snowflake. The resultset is stored on a variable which must be returned wrapped on a `TABLE()` function.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE simple_procedure
IS
  TYPE salary_collection IS TABLE OF NUMBER;
  v_salaries salary_collection := salary_collection();

BEGIN
  WITH IT_Employees AS (
    SELECT Salary
    FROM MySampleTable
    WHERE Department = 'IT'
  )
  SELECT Salary BULK COLLECT INTO v_salaries
  FROM IT_Employees;
END;

CALL simple_procedure();
```

##### Result

> **Note:**
>
> The query does not return results but the expected gathered information would be the IT Salary Information used for the example:

| IT_Salary |
| --- |
| 75000 |
| 80000 |

> **Danger:**
>
> One of the limitations of the RESULTSETs is that they cannot be used as tables. E.g.: `select * from my_result_set;` (This is an error, review the following [documentation](https://docs.snowflake.com/developer-guide/snowflake-scripting/resultsets#limitations-of-the-resultset-data-type) for more information).

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE simple_procedure ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  !!!RESOLVE EWI!!! /*** SSC-EWI-OR0072 - PROCEDURAL MEMBER TYPE DEFINITION NOT SUPPORTED. ***/!!!
  /*   TYPE salary_collection IS TABLE OF NUMBER */
  ;
  !!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
  /*   v_salaries salary_collection := salary_collection() */
  ;
  EXEC(`SELECT Salary
    FROM
       MySampleTable
    WHERE Department = 'IT'`);
  [
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PlBulkCollectionItem' NODE ***/!!!
    //v_salaries
    null,V_SALARIES] = EXEC(`SELECT
   Salary
 FROM IT_Employees`);
$$;

CALL simple_procedure();
```

##### Result

| SALARY |
| --- |
| 77500 |
| 80000 |

#### 2. Simple case for iterations: FOR LOOP statement

The following case is to define a translation for iteration with `FOR...LOOP`. In this case, the User-defined type is implicitly a table, thus, it is possible to use a cursor to iterate. Review the following documentation to learn more:

* Snowflake documentation about Returning a [Table for a Cursor.](https://docs.snowflake.com/developer-guide/snowflake-scripting/cursors#returning-a-table-for-a-cursor)
* In this case, there is a need to create a cursor for the iteration. Review the following [Cursor Assignment Syntax](https://docs.snowflake.com/sql-reference/snowflake-scripting/let#cursor-assignment-syntax) documentation.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE simple_procedure
IS
  TYPE salary_collection IS TABLE OF NUMBER;
  v_salaries salary_collection := salary_collection();
  v_average_salary NUMBER;
  salaries_count NUMBER;

BEGIN
  salaries_count := 0;
  WITH IT_Employees AS (
    SELECT Salary
    FROM MySampleTable
    WHERE Department = 'IT'
  )
  SELECT Salary BULK COLLECT INTO v_salaries
  FROM IT_Employees;

  -- Calculate the average salary
  IF v_salaries.COUNT > 0 THEN
    v_average_salary := 0;
    FOR i IN 1..v_salaries.COUNT LOOP
		v_average_salary := v_average_salary + v_salaries(i);
		salaries_count := salaries_count + 1;
    END LOOP;
    v_average_salary := v_average_salary / salaries_count;
  END IF;

  -- Display the average salary
  DBMS_OUTPUT.PUT_LINE('Average Salary for IT Department: ' || v_average_salary);
END;
/

CALL simple_procedure();
```

##### Result

```none
Statement processed.
Average Salary for IT Department: 77500
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE simple_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
--		!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--		TYPE salary_collection IS TABLE OF NUMBER;
		v_salaries VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'salary_collection' USAGE CHANGED TO VARIANT ***/!!! := salary_collection();
		v_average_salary NUMBER(38, 18);
		salaries_count NUMBER(38, 18);
	BEGIN
		salaries_count := 0;
		WITH IT_Employees AS
		(
		  SELECT Salary
		  FROM
		  	MySampleTable
		  WHERE Department = 'IT'
		)
		!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'RECORDS AND COLLECTIONS' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
		SELECT Salary BULK COLLECT INTO v_salaries
		FROM IT_Employees;
		-- Calculate the average salary
		IF (null /*v_salaries.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NESTED TABLE CUSTOM TYPE EXPRESSION' NODE ***/!!! > 0) THEN
		  v_average_salary := 0;
		  --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
		  FOR i IN 1 TO 0 /*v_salaries.COUNT*/!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NESTED TABLE CUSTOM TYPE EXPRESSION' NODE ***/!!!
 		                                                                                                                                                                        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
 		                                                                                                                                                                        LOOP
		  	v_average_salary :=
		  	!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN NUMBER AND salary_collection ***/!!!
		  	:v_average_salary + v_salaries(i);
		  	salaries_count := :salaries_count + 1;
 		                                                                                                                                                                           END LOOP;
		  v_average_salary := :v_average_salary / :salaries_count;
		END IF;
		-- Display the average salary
		--** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
		CALL DBMS_OUTPUT.PUT_LINE_UDF('Average Salary for IT Department: ' || NVL(:v_average_salary :: STRING, ''));
	END;
$$;

CALL simple_procedure();
```

##### Result

| SIMPLE_PROCEDURE |
| --- |
| Average Salary for IT Department: 77500 |

### Known Issues

#### 1. Resulset limitations.

There are limitations while using the RESULTSET data type. Review the following [Snowflake documentation](https://docs.snowflake.com/developer-guide/snowflake-scripting/resultsets#limitations-of-the-resultset-data-type) to learn more. Markable limitations are the following:

* Declaring a column of type RESULTSET.
* Declaring a parameter of type RESULTSET.
* Declaring a stored procedure’s return type as a RESULTSET.

##### 2. Execute statements with Bulk Collect clause are not supported.

Review the following documentation.

### Related EWIs

1. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review
4. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
5. [SSC-EWI-OR0072](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Procedural Member not supported
6. [SSC-EWI-OR0104](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Unusable collection variable.
7. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
8. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.

---
title: SnowConvert AI - Oracle - CREATE FUNCTION
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/create-function.md
section: Migrations
---

# SnowConvert AI - Oracle - CREATE FUNCTION

Oracle Create Function to Snowflake Snow Scripting

## Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> A **stored function** (also called a **user function** or **user-defined function**) is a set of PL/SQL statements you can call by name. Stored functions are very similar to procedures, except that a function returns a value to the environment in which it is called. User functions can be used as part of a SQL expression.
>
> A **call specification** declares a Java method or a third-generation language (3GL) routine so that it can be called from PL/SQL. You can also use the `CALL` SQL statement to call such a method or routine. The call specification tells Oracle Database which Java method, or which named function in which shared library, to invoke when a call is made. It also tells the database what type conversions to make for the arguments and return value. [Oracle SQL Language Reference Create Function](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/CREATE-FUNCTION.html).

### Oracle Syntax

For more information, see the [Oracle CREATE FUNCTION documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/CREATE-FUNCTION-statement.html).

#### Oracle Create Function Syntax

```sql
CREATE [ OR REPLACE ] [ EDITIONABLE | NONEDITIONABLE ]
FUNCTION
[ schema. ] function_name
  [ ( parameter_declaration [, parameter_declaration]... ) ] RETURN datatype
[ sharing_clause ]
  [ { invoker_rights_clause
    | accessible_by_clause
    | default_collation_clause
    | deterministic_clause
    | parallel_enable_clause
    | result_cache_clause
    | aggregate_clause
    | pipelined_clause
    | sql_macro_clause
       }...
  ]
{ IS | AS } { [ declare_section ]
    BEGIN statement ...
    [ EXCEPTION exception_handler [ exception_handler ]... ]
    END [ name ] ;
      |
    { java_declaration | c_declaration } } ;
```

### Snowflake Syntax

Snowflake allows 3 different languages in their user-defined functions:

* SQL
* JavaScript
* Java

For now, SnowConvert AI will support only `SQL` and `JavaScript` as target languages.

For more information, see the [Snowflake UDF overview](https://docs.snowflake.com/en/developer-guide/udf/udf-overview).

#### SQL

> **Note:**
>
> SQL user-defined functions only support one query as their body. They can read from the database but are not allowed to write to or modify it ([Scalar SQL UDFs](https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-scalar-functions.html)).

```sql
CREATE [ OR REPLACE ] [ SECURE ] FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ]
  [ COMMENT = '<string_literal>' ]
  AS '<function_definition>'
```

##### JavaScript

> **Note:**
>
> JavaScript user-defined functions allow multiple statements in their bodies but cannot perform queries to the database. ([Scalar JavaScript UDFs](https://docs.snowflake.com/en/developer-guide/udf/javascript/udf-javascript-scalar-functions)).

```sql
CREATE [ OR REPLACE ] [ SECURE ] FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE JAVASCRIPT
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ]
  [ COMMENT = '<string_literal>' ]
  AS '<function_definition>'
```

## Sample Source Patterns

### Sample auxiliary data

> **Note:**
>
> This code was executed for a better understanding of the examples:

#### Oracle

```sql
CREATE TABLE table1 (col1 int, col2 int, col3 varchar2(250), col4 varchar2(250), col5 date);

INSERT INTO table1 VALUES (1, 11, 'val1_1', 'val1_2', TO_DATE('2004/05/03', 'yyyy-MM-dd'));
INSERT INTO table1 VALUES (2, 22, 'val2_1', 'val2_2', TO_DATE('2014/05/03', 'yyyy-MM-dd'));
INSERT INTO table1 VALUES (3, 33, 'val3_1', 'val3_2', TO_DATE('2024/05/03', 'yyyy-MM-dd'));
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (col1 int,
col2 int,
col3 VARCHAR(250),
col4 VARCHAR(250),
col5 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/25/2024" }}'
;

INSERT INTO table1
VALUES (1, 11, 'val1_1', 'val1_2', TO_DATE('2004/05/03', 'yyyy-MM-dd'));

INSERT INTO table1
VALUES (2, 22, 'val2_1', 'val2_2', TO_DATE('2014/05/03', 'yyyy-MM-dd'));

INSERT INTO table1
VALUES (3, 33, 'val3_1', 'val3_2', TO_DATE('2024/05/03', 'yyyy-MM-dd'));
```

## Known Issues

No issues were found.

## Related EWIs

1. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior

## Cursor for a return variable

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

This pattern defines a function in Oracle PL/SQL that uses a cursor to fetch a single value and return it.

**Components:**

1. **Function Declaration:**

   * `CREATE FUNCTION functionName(parameters) RETURN returnType`
   * Declares the function with input parameters and the return type.
2. **Variable Declarations:**

   * Declares variables, including the return variable.
3. **Cursor Declaration:**

   * `CURSOR cursorName IS SELECT singleColumn FROM ... WHERE ... [AND col1 = localVar1];`
   * Defines a cursor to select a single column from a table with optional filtering conditions.
4. **BEGIN-END Block:**

   * Variables assignment.
   * Opens the cursor.
   * Fetch the result into the return variable.
   * Closes the cursor.
   * Returns the fetched value.

In this case, the variables are transformed into a common table expression (CTE). As well as the query within the cursor to which, in addition, the `FETCH FIRST 1 ROW ONLY` clause is added to simulate the `FETCH CURSOR` behavior.

`RETURN` statement is transformed to the final select.

### Queries

#### Oracle

```sql
CREATE OR REPLACE FUNCTION func1 (
   company_ IN VARCHAR2,
   book_id_ IN DATE,
   object_id_ IN VARCHAR2 ) RETURN INTEGER
IS
   temp_ table1.col2%TYPE;
   CURSOR get_attr IS
      SELECT col2
      FROM table1
      WHERE col3 = company_
      AND   col4 = object_id_
      AND   col5 = book_id_;
BEGIN
   OPEN get_attr;
   FETCH get_attr INTO temp_;
   CLOSE get_attr;
   RETURN temp_;
END func1;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION func1 (company_ VARCHAR, book_id_ TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/, object_id_ VARCHAR)
RETURNS INTEGER
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte1 AS
   (
      SELECT
         (
         SELECT col2
         FROM table1
         WHERE col3 = company_
         AND   col4 = object_id_
         AND   col5 = book_id_
         FETCH FIRST 1 ROW ONLY) AS temp_
   )
   SELECT
      temp_
   FROM
      declaration_variables_cte1
$$;
```

##### Result

| FUNC1() |
| --- |
| 2004-05-03. |

##### Oracle

```sql
CREATE FUNCTION func2 (
   fa_period_   IN NUMBER,
   to_date_     IN DATE DEFAULT NULL,
   from_date_   IN DATE DEFAULT NULL ) RETURN NUMBER
IS
   value_                    NUMBER;
   cond_date_to_             DATE;
   cond_date_from_           DATE;
   CURSOR get_acq_value IS
      SELECT NVL(SUM(col1),0)
      FROM   table1
      WHERE  col3                   IN (DECODE(fa_period_, 1, 'val1_1', 'val2_1'))
      AND    col5           <= cond_date_to_
      AND    col5           >= cond_date_from_;
BEGIN
   value_ := 0;
   cond_date_to_       := Get_Cond_Date( to_date_, 'MAX' );
   cond_date_from_     := Get_Cond_Date( from_date_, 'MIN' );
   OPEN get_acq_value;
   FETCH get_acq_value INTO value_;
   CLOSE get_acq_value;
   RETURN (NVL(value_,0));
END func2;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION func2 (fa_period_ NUMBER(38, 18),
  to_date_ TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/ DEFAULT NULL,
  from_date_ TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/ DEFAULT NULL )
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte1 AS
   (
      SELECT
         0 AS
         value_,
         Get_Cond_Date( to_date_, 'MAX' ) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Get_Cond_Date' NODE ***/!!! AS
         cond_date_to_,
         Get_Cond_Date( from_date_, 'MIN' ) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Get_Cond_Date' NODE ***/!!! AS
         cond_date_from_
   ),
   declaration_variables_cte2 AS
   (
      SELECT
         (
         SELECT NVL(SUM(col1),0)
         FROM   table1
         WHERE  col3                   IN (DECODE(fa_period_, 1, 'val1_1', 'val2_1'))
         AND    col5           <= cond_date_to_
         AND    col5           >= cond_date_from_
         FETCH FIRST 1 ROW ONLY) AS value_,
         cond_date_to_,
         cond_date_from_
      FROM
         declaration_variables_cte1
   )
   SELECT
      (NVL(value_,0))
   FROM
      declaration_variables_cte2
$$;
```

##### Result

| FUNC1() |
| --- |
| 2004-05-03. |

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.
2. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Cursor with IF statement

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

This pattern defines a function that conditionally uses a cursor to fetch and return a value based on an `IF` statement.

**Components:**

1. **Function Declaration:**

   * `CREATE FUNCTION functionName(parameters) RETURN returnType`
   * Declares the function with input parameters and the return type.
2. **Cursor Declaration:**

   * `CURSOR cursorName IS SELECT singleColumn FROM ... WHERE ... [AND col1 = localVar1];`
   * Defines a cursor to select a single column from a table with optional filtering conditions.
3. **Variable Declaration:**

   * Declares variables, including the return variable.
4. **BEGIN-END Block with IF Statement:**

   * Variables assignment.
   * Check if a condition is true.
   * If true, opens the cursor, fetches the result into the return variable, closes the cursor, and returns the fetched value. (The cursor can also be opened in the `ELSE` block and must meet the same conditions)
   * The `ELSE` Block is optional, if it exists, it should only contain a single statement that can be an assignment or a `RETURN` statement.

The variables are transformed into a common table expression (CTE). As well as the query within the cursor to which, in addition, the `FETCH FIRST 1 ROW ONLY` clause is added to simulate the `FETCH CURSOR` behavior.

`IF/ELSE` statement can be handled using the [`CASE EXPRESSION`](https://docs.snowflake.com/en/sql-reference/functions/case) inside the select allowing conditionals inside the queries. `RETURN` statement is transformed to the final select..

### Queries

#### Oracle

```sql
CREATE OR REPLACE FUNCTION func1 (
   company_          IN NUMBER) RETURN NUMBER
IS
   CURSOR getmaxperiod IS
      SELECT max(col2)
      FROM   table1;
   max_period_               NUMBER := 12;
BEGIN
   IF 1 = 1 THEN
      OPEN   getmaxperiod;
      FETCH  getmaxperiod INTO max_period_ ;
      CLOSE  getmaxperiod;
      RETURN max_period_;
   ELSE
      RETURN NULL;
   END IF;
END func1;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION func1 (company_ NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte0 AS
   (
      SELECT
         12 AS
         max_period_
   ),
   declaration_variables_cte1 AS
   (
      SELECT
         CASE
            WHEN 1 = 1
               THEN (
               SELECT max(col2)
               FROM   table1
               FETCH FIRST 1 ROW ONLY)
            ELSE NULL
         END AS max_period_
      FROM
         declaration_variables_cte0
   )
   SELECT
      max_period_
   FROM
      declaration_variables_cte1
$$;
```

##### Result

| FUNC2(0) |
| --- |
| NULL |

| FUNC2(1) |
| --- |
| 33 |

##### Oracle

```sql
CREATE OR REPLACE FUNCTION func2(
   company_          IN NUMBER) RETURN NUMBER
IS
   CURSOR getmaxperiod IS
      SELECT max(col2)
      FROM   table1;
   max_period_               NUMBER := 1;
BEGIN
   max_period_:= 2;
   IF company_ = 1 THEN
      RETURN max_period_ * 2;
   ELSE
      OPEN   getmaxperiod;
      FETCH  getmaxperiod INTO max_period_ ;
      CLOSE  getmaxperiod;
      RETURN max_period_;
   END IF;
END func2;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION func2 (company_ NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte0 AS
   (
      SELECT
         1 AS
         max_period_
   ),
   declaration_variables_cte1 AS
   (
      SELECT
         2 AS
         max_period_
      FROM
         declaration_variables_cte0
   ),
   declaration_variables_cte2 AS
   (
      SELECT
         CASE
            WHEN company_ = 1
               THEN max_period_ * 2
            ELSE (
            SELECT max(col2)
            FROM   table1
            FETCH FIRST 1 ROW ONLY)
         END AS max_period_
      FROM
         declaration_variables_cte1
   )
   SELECT
      max_period_
   FROM
      declaration_variables_cte2
$$;
```

##### Result

| FUNC2(0) |
| --- |
| 33 |

| FUNC2(1) |
| --- |
| 2 |

##### Oracle

```sql
CREATE OR REPLACE FUNCTION func3 (
   company_          IN NUMBER) RETURN NUMBER
IS
   CURSOR getmaxperiod IS
      SELECT max(col2)
      FROM   table1;
   max_period_               NUMBER := 0;
BEGIN
   IF company_ = 1 THEN
      OPEN   getmaxperiod;
      FETCH  getmaxperiod INTO max_period_ ;
      CLOSE  getmaxperiod;
   END IF;
   RETURN max_period_;
END func10;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION func3 (company_ NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte0 AS
   (
      SELECT
         0 AS
         max_period_
   ),
   declaration_variables_cte1 AS
   (
      SELECT
         CASE
            WHEN company_ = 1
               THEN (
               SELECT max(col2)
               FROM   table1
               FETCH FIRST 1 ROW ONLY)
            ELSE max_period_
         END AS max_period_
      FROM
         declaration_variables_cte0
   )
   SELECT
      max_period_
   FROM
      declaration_variables_cte1
$$;
```

##### Result

| FUNC2(0) |
| --- |
| 0 |

| FUNC2(1) |
| --- |
| 33 |

### Known Issues

No issues were found.

### Related EWIs

No EWIs related.

## Multiple IF statement

This pattern defines a function that uses conditional statements over local variables.

**Components:**

1. **Function Declaration:**

   * `CREATE FUNCTION functionName(parameters) RETURN returnType`
   * Declares the function with input parameters and the return type.
2. **Variable Declaration:**

   * Declares variables, including the return variable.
3. **BEGIN-END Block with IF Statement:**

   * Check if a condition is true.
   * Each case is used to assign a value over the same variable.

### Conversion:

**`DECLARE SECTION`** : variables with default expression are moved to a common table expression.

**`IF/ELSE`** statement can be handled using the [`CASE EXPRESSION`](https://docs.snowflake.com/en/sql-reference/functions/case) inside the select allowing conditionals inside the queries.

**`RETURN`** statement is transformed to the final select.

#### Oracle

```sql
CREATE OR REPLACE FUNCTION Case1 (
   in_date_ IN DATE,
   min_max_ IN VARCHAR2 )
RETURN DATE
IS
   cond_date_  DATE := CURRENT_DATE;
BEGIN
   IF ( in_date_ IS NULL ) THEN
      IF ( min_max_ = 'MIN' ) THEN
         cond_date_ := FOO1();
      ELSE
         cond_date_ := FOO2();
      END IF;
   ELSE
      cond_date_ := TRUNC(in_date_);
   END IF;
   RETURN cond_date_;
END Case1;
```

#### Snowflake

```sql
CREATE OR REPLACE FUNCTION Case1 (in_date_ TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/, min_max_ VARCHAR)
RETURNS TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte0 AS
   (
      SELECT
         CURRENT_DATE AS
         cond_date_
   ),
   declaration_variables_cte1 AS
   (
      SELECT
         CASE
            WHEN ( in_date_ IS NULL )
               THEN CASE
                  WHEN ( min_max_ = 'MIN' )
                     THEN FOO1() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO1' NODE ***/!!!
                  ELSE FOO2() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO2' NODE ***/!!!
               END
            ELSE TRUNC(in_date_, 'DD')
         END AS cond_date_
      FROM
         declaration_variables_cte0
   )
   SELECT
      cond_date_
   FROM
      declaration_variables_cte1
$$;
```

#### Oracle

```sql
CREATE OR REPLACE FUNCTION Case2 (
   year_        IN NUMBER,
   id           IN NUMBER)
   RETURN VARCHAR2
IS
   base_value_        NUMBER;
   fully_depritiated_ VARCHAR2(5);
   residual_value_    NUMBER;
   acc_depr_prev_     NUMBER;
   acc_depr_          NUMBER;
BEGIN

   base_value_     := FOO1(year_, id);
   acc_depr_       := FOO2(year_, id);
   acc_depr_prev_  := FOO3(year_, id);
   residual_value_ := NVL(base_value_,0) -(acc_depr_ + acc_depr_prev_);

   IF (residual_value_=0 AND base_value_!=0) THEN
      fully_depritiated_ := 'TRUE';
   ELSE
      fully_depritiated_ := 'FALSE';
   END IF;

   RETURN fully_depritiated_;
END Case2;
```

#### Snowflake

```sql
CREATE OR REPLACE FUNCTION Case2 (year_ NUMBER(38, 18), id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte1 AS
   (
      SELECT
         FOO1(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO1' NODE ***/!!! AS

         base_value_,
         FOO2(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO2' NODE ***/!!! AS
         acc_depr_,
         FOO3(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO3' NODE ***/!!! AS
         acc_depr_prev_,
         !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN NUMBER AND unknown ***/!!!
         NVL(base_value_,0) -(acc_depr_ + acc_depr_prev_) AS
         residual_value_,
         CASE
            WHEN (residual_value_=0 AND base_value_!=0)
               THEN 'TRUE'
            ELSE 'FALSE'
         END AS fully_depritiated_
   )
   SELECT
      fully_depritiated_
   FROM
      declaration_variables_cte1
$$;
```

#### Oracle

```sql
CREATE OR REPLACE FUNCTION Case2_1 (
   year_        IN NUMBER,
   id           IN NUMBER)
   RETURN VARCHAR2
IS
   base_value_        NUMBER;
   fully_depritiated_ VARCHAR2(5);
   residual_value_    NUMBER;
   acc_depr_prev_     NUMBER;
   acc_depr_          NUMBER;
BEGIN

   base_value_     := FOO1(year_, id);
   acc_depr_       := FOO2(year_, id);
   acc_depr_prev_  := FOO3(year_, id);
   residual_value_ := NVL(base_value_,0) -(acc_depr_ + acc_depr_prev_);

   IF (residual_value_=0 AND base_value_!=0) THEN
      fully_depritiated_ := 'TRUE';
   ELSE
      fully_depritiated_ := 'FALSE';
   END IF;

   fully_depritiated := fully_depritiated || ' CONCAT FOR TESTING';
   fully_depritiated := fully_depritiated || ' CONCAT FOR TESTING2';
   RETURN fully_depritiated_;
END Case2;
```

#### Snowflake

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "FOO1", "FOO2", "FOO3" **
CREATE OR REPLACE FUNCTION Case2_1 (year_ NUMBER(38, 18), id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS
$$
   WITH declaration_variables_cte1 AS
   (
      SELECT
         FOO1(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO1' NODE ***/!!! AS

         base_value_,
         FOO2(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO2' NODE ***/!!! AS
         acc_depr_,
         FOO3(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO3' NODE ***/!!! AS
         acc_depr_prev_,
         !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN NUMBER AND unknown ***/!!!
         NVL(base_value_,0) -(acc_depr_ + acc_depr_prev_) AS
         residual_value_,
         CASE
            WHEN (residual_value_=0 AND base_value_!=0)
               THEN 'TRUE'
            ELSE 'FALSE'
         END AS fully_depritiated_,
         NVL(fully_depritiated :: STRING, '') || ' CONCAT FOR TESTING' AS

         fully_depritiated
   ),
   declaration_variables_cte2 AS
   (
      SELECT
         NVL(fully_depritiated :: STRING, '') || ' CONCAT FOR TESTING2' AS
         fully_depritiated,
         base_value_,
         acc_depr_,
         acc_depr_prev_,
         residual_value_
      FROM
         declaration_variables_cte1
   )
   SELECT
      fully_depritiated_
   FROM
      declaration_variables_cte2
$$;
```

#### Oracle

```sql
CREATE OR REPLACE FUNCTION Case2_1 (
   year_        IN NUMBER,
   id           IN NUMBER)
   RETURN VARCHAR2
IS
   base_value_        NUMBER;
   fully_depritiated_ VARCHAR2(5);
   residual_value_    NUMBER;
   acc_depr_prev_     NUMBER;
   acc_depr_          NUMBER;
BEGIN

   base_value_     := FOO1(year_, id);
   acc_depr_       := FOO2(year_, id);
   acc_depr_prev_  := FOO3(year_, id);
   residual_value_ := NVL(base_value_,0) -(acc_depr_ + acc_depr_prev_);

   IF (residual_value_=0 AND base_value_!=0) THEN
      fully_depritiated_ := 'TRUE';
   ELSE
      fully_depritiated_ := 'FALSE';
   END IF;

   fully_depritiated := fully_depritiated || ' CONCAT FOR TESTING';
   fully_depritiated := fully_depritiated || ' CONCAT FOR TESTING2';
   RETURN fully_depritiated_;
END Case2;
```

#### Snowflake

```sql
CREATE OR REPLACE FUNCTION Case2_1 (year_ NUMBER(38, 18), id NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "09/06/2024" }}'
AS
$$
   WITH declaration_variables_cte1 AS
   (
      SELECT
         FOO1(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO1' NODE ***/!!! AS

         base_value_,
         FOO2(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO2' NODE ***/!!! AS
         acc_depr_,
         FOO3(year_, id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FOO3' NODE ***/!!! AS
         acc_depr_prev_,
         !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN NUMBER AND unknown ***/!!!
         NVL(base_value_,0) -(acc_depr_ + acc_depr_prev_) AS
         residual_value_,
         CASE
            WHEN (residual_value_=0 AND base_value_!=0)
               THEN 'TRUE'
            ELSE 'FALSE'
         END AS fully_depritiated_,
         NVL(fully_depritiated :: STRING, '') || ' CONCAT FOR TESTING' AS

         fully_depritiated
   ),
   declaration_variables_cte2 AS
   (
      SELECT
         NVL(fully_depritiated :: STRING, '') || ' CONCAT FOR TESTING2' AS
         fully_depritiated,
         base_value_,
         acc_depr_,
         acc_depr_prev_,
         residual_value_
      FROM
         declaration_variables_cte1
   )
   SELECT
      fully_depritiated_
   FROM
      declaration_variables_cte2
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.
2. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
3. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.

## Snowflake Script UDF (SCALAR)

Translation reference for Oracle User Defined Functions to [Snowflake Scripting UDFs](../../../../../developer-guide/udf/sql/udf-sql-procedural-functions.md)

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

SnowConvert now supports translating Oracle PL/SQL User Defined Functions directly to **Snowflake Scripting UDFs** (SnowScript UDFs) when they meet specific criteria.

**Snowflake Scripting UDFs** are user-defined functions written using Snowflake’s procedural language syntax (Snowscript) within a SQL UDF body. They support variables, loops, conditional logic, and exception handling without requiring database access.

#### When Functions Become SnowScript UDFs

SnowConvert analyzes each Oracle function and automatically determines the appropriate Snowflake target. A function becomes a SnowScript UDF when it contains **only** procedural logic without data access operations.

### Sample Source Patterns

#### Simple Calculation Function

A basic function that performs calculations without querying data.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION CalculateTax (
    amount_ IN NUMBER,
    tax_rate_ IN NUMBER
) RETURN NUMBER
IS
    tax_amount_ NUMBER;
BEGIN
    tax_amount_ := amount_ * (tax_rate_ / 100);
    RETURN tax_amount_;
END CalculateTax;
```

##### Result

| CALCULATETAX(1000, 15) |
| --- |
| 150 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION CalculateTax (amount_ NUMBER(38, 18), tax_rate_ NUMBER(38, 18)
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "zsqZAVE5n32hZZFtsi0zsg==" }}'
AS
$$
   DECLARE
      tax_amount_ NUMBER(38, 18);
   BEGIN
      tax_amount_ := :amount_ * (:tax_rate_ / 100);
      RETURN :tax_amount_;
   END;
$$;
```

##### Result

| CALCULATETAX(1000, 15) |
| --- |
| 150 |

#### Function with IF/ELSIF/ELSE Logic

Functions using conditional statements for business logic.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION GetShippingCost (
    distance_ IN NUMBER,
    weight_ IN NUMBER
) RETURN NUMBER
IS
    shipping_cost_ NUMBER := 0;
BEGIN
    IF distance_ < 50 THEN
        shipping_cost_ := 10;
    ELSIF distance_ < 100 THEN
        shipping_cost_ := 20;
    ELSIF distance_ < 200 THEN
        shipping_cost_ := 35;
    ELSE
        shipping_cost_ := 50;
    END IF;

    IF weight_ > 20 THEN
        shipping_cost_ := shipping_cost_ * 1.5;
    END IF;

    RETURN shipping_cost_;
END GetShippingCost;
```

##### Result

| GETSHIPPINGCOST(75, 25) |
| --- |
| 30 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION GetShippingCost (distance_ NUMBER(38, 18), weight_ NUMBER(38, 18)
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "zsqZAVE5n32hZZFtsi0zsg==" }}'
AS
$$
   DECLARE
      shipping_cost_ NUMBER(38, 18) := 0;
   BEGIN
      IF (:distance_ < 50) THEN
         shipping_cost_ := 10;
      ELSEIF (:distance_ < 100) THEN
         shipping_cost_ := 20;
      ELSEIF (:distance_ < 200) THEN
         shipping_cost_ := 35;
    ELSE
         shipping_cost_ := 50;
      END IF;
      IF (:weight_ > 20) THEN
         shipping_cost_ := :shipping_cost_ * 1.5;
      END IF;
      RETURN :shipping_cost_;
   END;
$$;
```

##### Result

| GETSHIPPINGCOST(75, 25) |
| --- |
| 30 |

#### Function with FOR Loop

Functions using loops for iterative calculations.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION CalculateCompoundInterest (
    principal_ IN NUMBER,
    rate_ IN NUMBER,
    years_ IN NUMBER
) RETURN NUMBER
IS
    amount_ NUMBER;
    i NUMBER;
BEGIN
    amount_ := principal_;

    FOR i IN 1..years_ LOOP
        amount_ := amount_ * (1 + rate_ / 100);
    END LOOP;

    RETURN ROUND(amount_, 2);
END CalculateCompoundInterest;
```

##### Result

| CALCULATECOMPOUNDINTEREST(1000, 5, 3) |
| --- |
| 1157.63 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION CalculateCompoundInterest (principal_ NUMBER(38, 18), rate_ NUMBER(38, 18), years_ NUMBER(38, 18)
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "zsqZAVE5n32hZZFtsi0zsg==" }}'
AS
$$
   DECLARE
      amount_ NUMBER(38, 18);
      i NUMBER(38, 18);
   BEGIN
      amount_ := :principal_;
      --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
      FOR i IN 1 TO :years_
                            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                            LOOP
         amount_ := :amount_ * (
                                !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Number AND unknown ***/!!!1 + :rate_ / 100);
                               END LOOP;
      RETURN ROUND(:amount_, 2);
   END;
$$;
```

##### Result

| CALCULATECOMPOUNDINTEREST(1000, 5, 3) |
| --- |
| 1157.63 |

#### CASE and DECODE Logic

Functions using CASE expressions and DECODE for categorization.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION GetCustomerTier (
    annual_spend_ IN NUMBER,
    years_active_ IN NUMBER
) RETURN VARCHAR2
IS
    tier_ VARCHAR2(20);
    base_tier_ VARCHAR2(20);
BEGIN
    -- Determine base tier by spending
    base_tier_ := CASE
        WHEN annual_spend_ >= 10000 THEN 'PLATINUM'
        WHEN annual_spend_ >= 5000 THEN 'GOLD'
        WHEN annual_spend_ >= 2000 THEN 'SILVER'
        ELSE 'BRONZE'
    END;

    -- Upgrade tier if customer is loyal (5+ years)
    IF years_active_ >= 5 THEN
        tier_ := DECODE(base_tier_,
            'GOLD', 'PLATINUM',
            'SILVER', 'GOLD',
            'BRONZE', 'SILVER',
            base_tier_);
    ELSE
        tier_ := base_tier_;
    END IF;

    RETURN tier_;
END GetCustomerTier;
```

##### Result

| GETCUSTOMERTIER(3000, 6) |
| --- |
| GOLD |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION GetCustomerTier (annual_spend_ NUMBER(38, 18), years_active_ NUMBER(38, 18)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "zsqZAVE5n32hZZFtsi0zsg==" }}'
AS
$$
   DECLARE
      tier_ VARCHAR(20);
      base_tier_ VARCHAR(20);
   BEGIN
      -- Determine base tier by spending
      base_tier_ := CASE
                WHEN :annual_spend_ >= 10000 THEN 'PLATINUM'
                WHEN :annual_spend_ >= 5000 THEN 'GOLD'
                WHEN :annual_spend_ >= 2000 THEN 'SILVER'
                ELSE 'BRONZE'
            END;
      -- Upgrade tier if customer is loyal (5+ years)
      IF (:years_active_ >= 5) THEN
                tier_ := DECODE(:base_tier_,
                           'GOLD', 'PLATINUM',
                           'SILVER', 'GOLD',
                           'BRONZE', 'SILVER', :base_tier_);
      ELSE
                tier_ := :base_tier_;
      END IF;
      RETURN :tier_;
   END;
$$;
```

##### Result

| GETCUSTOMERTIER(3000, 6) |
| --- |
| GOLD |

#### Select Into variable assignment

Functions using simple select into for variable assignment.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION CalculatePrice
(
    p_BasePrice NUMBER,
    p_Quantity NUMBER
)
RETURN NUMBER
IS
    v_Discount NUMBER;
    v_Subtotal NUMBER;
    v_FinalPrice NUMBER;
BEGIN

    SELECT CASE
               WHEN p_Quantity >= 10 THEN 0.15
               WHEN p_Quantity >= 5 THEN 0.10
               ELSE 0.05
           END,
           p_BasePrice * p_Quantity
    INTO v_Discount, v_Subtotal
    FROM DUAL;

    v_FinalPrice := v_Subtotal * (1 - v_Discount);

    RETURN v_FinalPrice;
END;
```

##### Result

| CALCULATEPRICE(100, 3) |
| --- |
| 285 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION CalculatePrice
(p_BasePrice NUMBER(38, 18), p_Quantity NUMBER(38, 18)
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/26/2025",  "domain": "no-domain-provided",  "migrationid": "DsGaAXVMinypPa0FTZmrKQ==" }}'
AS
$$
    DECLARE
        v_Discount NUMBER(38, 18);
        v_Subtotal NUMBER(38, 18);
        v_FinalPrice NUMBER(38, 18);
    BEGIN
        v_Discount := CASE
                          WHEN :p_Quantity >= 10 THEN 0.15
                          WHEN :p_Quantity >= 5 THEN 0.10
                          ELSE 0.05
                      END;
        v_Subtotal := :p_BasePrice * :p_Quantity;
        v_FinalPrice := :v_Subtotal * (1 - :v_Discount);
        RETURN :v_FinalPrice;
    END;
$$;
```

##### Result

| CALCULATEPRICE(100, 3) |
| --- |
| 285 |

### Known Issues

> **Warning:**
>
> **SnowConvert AI will not translate UDFs containing the following elements into SnowScripting UDFs, as these features are unsupported in SnowScripting UDFs:**
>
> * Access database tables
> * Use cursors
> * Call other UDFs
> * Contain aggregate or window functions
> * Perform DML operations (INSERT/UPDATE/DELETE)
> * Return result sets

### Related EWIs

1. [SSC-EWI-0067](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): UDF was transformed to Snowflake procedure, calling procedures inside a query is not supported.
2. [SSC-EWI-0068](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): User defined function was transformed to a Snowflake procedure.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
4. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

---
title: SnowConvert AI - Oracle - Create Materialized Views
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-translation-reference/create-materialized-view.md
section: Migrations
---

# SnowConvert AI - Oracle - Create Materialized Views

Translation reference to convert Oracle Materialized View to Snowflake Dynamic Table

## Description

In SnowConvert AI, Oracle Materialized Views are transformed into Snowflake Dynamic Tables. To properly configure Dynamic Tables, two essential parameters must be defined: TARGET_LAG and WAREHOUSE. If these parameters are left unspecified in the configuration options, SnowConvert AI will default to preassigned values during the conversion, as demonstrated in the example below.

For more information on Materialized Views, click [here](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/CREATE-MATERIALIZED-VIEW.html).

For details on the necessary parameters for Dynamic Tables, click [here](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table).

## Sample Source Patterns

### Oracle

```sql
CREATE MATERIALIZED VIEW sales_total
AS
SELECT SUM(amount) AS total_sales
FROM sales;
```

### Snowflake

```sql
CREATE OR REPLACE DYNAMIC TABLE sales_total
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
SELECT SUM(amount) AS total_sales
FROM
sales;
```

### Refresh Modes

Snowflake dynamic tables support an equivalent to Oracle’s materialized view refresh modes. The corresponding modes are as follows:

* **Oracle**:

  + **FAST**: Refreshes only the rows that have changed.
  + **COMPLETE**: Refreshes the entire materialized view.
  + **FORCE**: Uses FAST if possible, otherwise uses COMPLETE.
* **Snowflake**:

  + **AUTO**: Automatically determines the best refresh method.
  + **FULL**: Refreshes the entire table, equivalent to Oracle’s COMPLETE mode.
  + **INCREMENTAL**: Refreshes only the changed rows.

#### Default Refresh Mode

When using SnowConvert AI, the dynamic table’s default refresh mode is **AUTO**.

#### Mode Mappings

* **Oracle FAST** and **FORCE** -> **Snowflake AUTO**
* **Oracle COMPLETE** -> **Snowflake FULL**

For more details, refer to the official documentation on [Oracle Refresh Modes](https://docs.oracle.com/en/database/oracle/oracle-database/12.2/dwhsg/refreshing-materialized-views.html) and [Snowflake Refresh Modes](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table#optional-parameters).

##### Oracle

```sql
CREATE MATERIALIZED VIEW CUSTOMER_SALES_SUMMARY
REFRESH COMPLETE
AS
SELECT
    CUSTOMER_ID,
    SUM(AMOUNT) AS TOTAL_AMOUNT
FROM
    SALES
GROUP BY
    CUSTOMER_ID;
```

##### Snowflake

```sql
CREATE OR REPLACE DYNAMIC TABLE CUSTOMER_SALES_SUMMARY
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
REFRESH_MODE=FULL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
SELECT
   CUSTOMER_ID,
   SUM(AMOUNT) AS TOTAL_AMOUNT
FROM
   SALES
GROUP BY
   CUSTOMER_ID;
```

## Known Issues

No known errors detected at this time.

## Related EWIs

1. [SSC-FDM-0031](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default

---
title: SnowConvert AI - Oracle - CREATE PROCEDURE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/create-procedure.md
section: Migrations
---

# SnowConvert AI - Oracle - CREATE PROCEDURE

Oracle Create Procedure to Snowflake Snow Scripting

## Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> A procedure is a group of PL/SQL statements that you can call by name. A call specification (sometimes called call spec) declares a Java method or a third-generation language (3GL) routine so that it can be called from SQL and PL/SQL. The call spec tells Oracle Database which Java method to invoke when a call is made. It also tells the database what type conversions to make for the arguments and return value. [Oracle SQL Language Reference Create Procedure](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/CREATE-PROCEDURE.html#GUID-771879D8-BBFD-4D87-8A6C-290102142DA3).

For more information regarding Oracle Create Procedure, check [here](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/CREATE-PROCEDURE-statement.html#GUID-5F84DB47-B5BE-4292-848F-756BF365EC54).

### Oracle Create Procedure Syntax

```sql
CREATE [ OR REPLACE ] [ EDITIONABLE | NONEDITIONABLE ]
PROCEDURE
[ schema. ] procedure_name
[ ( parameter_declaration [, parameter_declaration ]... ) ] [ sharing_clause ]
[ ( default_collation_option | invoker_rights_clause | accessible_by_clause)... ]
{ IS | AS } { [ declare_section ]
    BEGIN statement ...
    [ EXCEPTION exception_handler [ exception_handler ]... ]
    END [ name ] ;
      |
    { java_declaration | c_declaration } } ;
```

For more information regarding Snowflake Create Procedure, check [here](https://docs.snowflake.com/en/sql-reference/sql/create-procedure.html#create-procedure).

#### Snowflake Create Procedure Syntax

```sql
CREATE [ OR REPLACE ] PROCEDURE <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS <result_data_type> [ NOT NULL ]
  LANGUAGE SQL
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ]
  [ COMMENT = '<string_literal>' ]
  [ EXECUTE AS { CALLER | OWNER } ]
  AS '<procedure_definition>'
```

## Sample Source Patterns

### 1. Basic Procedure

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
BEGIN
null;
END;
```

##### Snow Scripting

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
BEGIN
null;
END;
$$;
```

### 2. Procedure with Different Parameters

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE proc2
(
    p1 OUT INTEGER,
    p2 OUT INTEGER,
    p3 INTEGER := 1,
    p4 INTEGER DEFAULT 1
)
AS
BEGIN
	p1 := 17;
	p2 := 93;
END;
```

##### Snow Scripting

```sql
CREATE OR REPLACE PROCEDURE proc2
(p1 OUT INTEGER, p2 OUT INTEGER,
    p3 INTEGER DEFAULT 1,
    p4 INTEGER DEFAULT 1
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		p1 := 17;
		p2 := 93;
	END;
$$;
```

#### Output parameters

Snowflake does not allow output parameters in procedures, a way to simulate this behavior could be to declare a variable and return its value at the end of the procedure.

#### Parameters with default values

Snowflake does not allow setting default values for parameters in procedures, a way to simulate this behavior could be to declare a variable with the default value or overload the procedure.

### 3. Procedure with Additional Settings

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE proc3
DEFAULT COLLATION USING_NLS_COMP
AUTHID CURRENT_USER
AS
BEGIN
NULL;
END;
```

##### Snow Scripting

```sql
CREATE OR REPLACE PROCEDURE proc3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/14/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
BEGIN
NULL;
END;
$$;
```

### 4. Procedure with Basic Statements

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE proc4
(
  param1 NUMBER
)
IS
  localVar1 NUMBER;
  countRows NUMBER;
  tempSql VARCHAR(100);
  tempResult NUMBER;
  CURSOR MyCursor IS SELECT COL1 FROM Table1;

BEGIN
    localVar1 := param1;
    countRows := 0;
    tempSql := 'SELECT COUNT(*) FROM Table1 WHERE COL1 =' || localVar1;

    FOR myCursorItem IN MyCursor
        LOOP
            localVar1 := myCursorItem.Col1;
            countRows := countRows + 1;
        END LOOP;
    INSERT INTO Table2 VALUES(countRows, 'ForCursor: Total Row count is: ' || countRows);
    countRows := 0;

    OPEN MyCursor;
    LOOP
        FETCH MyCursor INTO tempResult;
        EXIT WHEN MyCursor%NOTFOUND;
        countRows := countRows + 1;
    END LOOP;
    CLOSE MyCursor;
    INSERT INTO Table2 VALUES(countRows, 'LOOP: Total Row count is: ' || countRows);

    EXECUTE IMMEDIATE tempSql INTO tempResult;
    IF tempResult > 0 THEN
        INSERT INTO Table2 (COL1, COL2) VALUES(tempResult, 'Hi, found value:' || localVar1 || ' in Table1 -- There are ' || tempResult || ' rows');
        COMMIT;
    END IF;
END proc3;
```

##### Snow Scripting

```sql
CREATE OR REPLACE PROCEDURE proc4
(param1 NUMBER(38, 18)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    localVar1 NUMBER(38, 18);
    countRows NUMBER(38, 18);
    tempSql VARCHAR(100);
    tempResult NUMBER(38, 18);
    --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
    MyCursor CURSOR
    FOR
      SELECT COL1 FROM
        Table1;
  BEGIN
    localVar1 := :param1;
    countRows := 0;
    tempSql := 'SELECT COUNT(*) FROM
   Table1
WHERE COL1 =' || NVL(:localVar1 :: STRING, '');
    OPEN MyCursor;
    --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
    FOR myCursorItem IN MyCursor DO
      localVar1 := myCursorItem.Col1;
      countRows := :countRows + 1;
    END FOR;
    CLOSE MyCursor;
    INSERT INTO Table2
    VALUES(:countRows, 'ForCursor: Total Row count is: ' || NVL(:countRows :: STRING, ''));
    countRows := 0;
    OPEN MyCursor;
    --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
      --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        FETCH MyCursor INTO
        :tempResult;
      IF (tempResult IS NULL) THEN
        EXIT;
      END IF;
      countRows := :countRows + 1;
    END LOOP;
    CLOSE MyCursor;
    INSERT INTO Table2
    SELECT
      :countRows,
      'LOOP: Total Row count is: ' || NVL(:countRows :: STRING, '');
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!

    EXECUTE IMMEDIATE :tempSql
                               !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'EXECUTE IMMEDIATE RETURNING CLAUSE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
                               INTO tempResult;
    IF (:tempResult > 0) THEN
      INSERT INTO Table2(COL1, COL2)
      SELECT
        :tempResult,
        'Hi, found value:' || NVL(:localVar1 :: STRING, '') || ' in Table1 -- There are ' || NVL(:tempResult :: STRING, '') || ' rows';
      --** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
      COMMIT;
    END IF;
  END;
$$;
```

### 5. Procedure with empty `RETURN` statements

In Oracle procedures you can have empty `RETURN` statements to finish the execution of a procedure. In Snowflake Scripting procedures can have `RETURN` statements but they must have a value. By default all empty `RETURN` statements are converted with a `NULL` value.

#### Oracle

```sql
-- Procedure with empty return
CREATE OR REPLACE PROCEDURE MY_PROC
IS
BEGIN
   NULL;
   RETURN;
END;
```

##### Snowflake Scripting

```sql
-- Procedure with empty return
CREATE OR REPLACE PROCEDURE MY_PROC ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      NULL;
      RETURN NULL;
   END;
$$;
```

#### `RETURN` statements in procedures with output parameters

In procedures with output parameters, instead of a `NULL` value an `OBJECT_CONSTRUCT` will be used in the empty `RETURN` statements to simulate the output parameters in Snowflake Scripting.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_OUTPUT_PARAMETERS (
    param1 OUT NUMBER,
    param2 OUT NUMBER,
    param3 NUMBER
)
IS
BEGIN
    IF param3 > 0 THEN
        param1 := 2;
        param2 := 1000;
        RETURN;
    END IF;
    param1 := 5;
    param2 := 3000;
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_OUTPUT_PARAMETERS (param1 OUT NUMBER(38, 18), param2 OUT NUMBER(38, 18), param3 NUMBER(38, 18)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        IF (:param3 > 0) THEN
            param1 := 2;
            param2 := 1000;
            RETURN NULL;
        END IF;
        param1 := 5;
        param2 := 3000;
    END;
$$;
```

### 6. Procedure with DEFAULT parameters

DEFAULT parameters allow named parameters to be initialized with default values if no value is passed.

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE TEST(
    X IN VARCHAR DEFAULT 'P',
    Y IN VARCHAR DEFAULT 'Q'
)
AS
    varX VARCHAR(32767) := NVL(X, 'P');
    varY NUMBER := NVL(Y, 1);
BEGIN
    NULL;
END TEST;

BEGIN
    TEST(Y => 'Y');
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE TEST (
    X VARCHAR DEFAULT 'P',
    Y VARCHAR DEFAULT 'Q'
)
    RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
    EXECUTE AS CALLER
    AS
    $$
        DECLARE
            varX VARCHAR(32767) := NVL(:X, 'P');
            varY NUMBER(38, 18) := NVL(:Y, 1);
        BEGIN
            NULL;
        END;
    $$;

    DECLARE
        call_results VARIANT;

        BEGIN
        CALL
        TEST(Y => 'Y');
        RETURN call_results;
        END;
```

## Known Issues

### 1. Unsupported OUT parameters

Snowflake procedures do not have a native option for output parameters.

#### 2. Unsupported Oracle additional settings

The following Oracle settings and clauses are not supported by Snowflake procedures:

* `sharing_clause`
* `default_collation_option`
* `invoker_rights_clause`
* `accessible_by_clause`
* `java_declaration`
* `c_declaration`

## Related EWIs

1. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting
2. [SSC-EWI-OR0097](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Procedures properties are not supported in Snowflake procedures.
3. [SSC-FDM-OR0012:](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md) COMMIT and ROLLBACK statements require adequate setup to perform as intended.
4. [SSC-PRF-0003](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.
5. [SSC-PRF-0004](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor for loop.
6. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL

---
title: SnowConvert AI - Oracle - Create Table
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-translation-reference/create-table.md
section: Migrations
---

# SnowConvert AI - Oracle - Create Table

In this section you could find information about TABLES, their syntax and current conversions.

## Description

In Oracle, the CREATE TABLE statement is used to create one of the following types of tables: a relational table which is the basic structure to hold user data, or an object table which is a table that uses an object type for a column definition. ([Oracle documentation](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/CREATE-TABLE.html#GUID-F9CE0CC3-13AE-4744-A43C-EAC7A71AAAB6))

**Oracle syntax**

```sql
CREATE [ { GLOBAL | PRIVATE } TEMPORARY | SHARDED | DUPLICATED | [ IMMUTABLE ] BLOCKCHAIN
  | IMMUTABLE  ]
   TABLE
  [ schema. ] table
  [ SHARING = { METADATA | DATA | EXTENDED DATA | NONE } ]
  { relational_table | object_table | XMLType_table }
  [ MEMOPTIMIZE FOR READ ]
  [ MEMOPTIMIZE FOR WRITE ]
  [ PARENT [ schema. ] table ] ;
```

**Snowflake Syntax**

```sql
CREATE [ OR REPLACE ]
    [ { [ { LOCAL | GLOBAL } ] TEMP | TEMPORARY | VOLATILE | TRANSIENT } ]
  TABLE [ IF NOT EXISTS ] <table_name> (
    -- Column definition
    <col_name> <col_type>
      [ inlineConstraint ]
      [ NOT NULL ]
      [ COLLATE '<collation_specification>' ]
      [
        {
          DEFAULT <expr>
          | { AUTOINCREMENT | IDENTITY }
            [
              {
                ( <start_num> , <step_num> )
                | START <num> INCREMENT <num>
              }
            ]
            [ { ORDER | NOORDER } ]
        }
      ]
      [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col_name> , <cond_col1> , ... ) ] ]
      [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
      [ COMMENT '<string_literal>' ]

    -- Additional column definitions
    [ , <col_name> <col_type> [ ... ] ]

    -- Out-of-line constraints
    [ , outoflineConstraint [ ... ] ]
  )
  [ CLUSTER BY ( <expr> [ , <expr> , ... ] ) ]
  [ ENABLE_SCHEMA_EVOLUTION = { TRUE | FALSE } ]
  [ STAGE_FILE_FORMAT = (
     { FORMAT_NAME = '<file_format_name>'
       | TYPE = { CSV | JSON | AVRO | ORC | PARQUET | XML } [ formatTypeOptions ]
     } ) ]
  [ STAGE_COPY_OPTIONS = ( copyOptions ) ]
  [ DATA_RETENTION_TIME_IN_DAYS = <integer> ]
  [ MAX_DATA_EXTENSION_TIME_IN_DAYS = <integer> ]
  [ CHANGE_TRACKING = { TRUE | FALSE } ]
  [ DEFAULT_DDL_COLLATION = '<collation_specification>' ]
  [ COPY GRANTS ]
  [ COMMENT = '<string_literal>' ]
  [ [ WITH ] ROW ACCESS POLICY <policy_name> ON ( <col_name> [ , <col_name> ... ] ) ]
  [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

> **Note:**
>
> For more Snowflake information review the following [documentation](https://docs.snowflake.com/en/sql-reference/sql/create-table).

## Sample Source Patterns

### 2.1. Physical and Table Properties

#### Oracle

```sql
CREATE TABLE "MySchema"."BaseTable"
(
    BaseId NUMBER DEFAULT 10 NOT NULL ENABLE
) SEGMENT CREATION IMMEDIATE
  PCTFREE 0 PCTUSED 40 INITRANS 1 MAXTRANS 255
  COLUMN STORE COMPRESS FOR QUERY HIGH NO ROW LEVEL LOCKING LOGGING
  STORAGE(INITIAL 65536 NEXT 1048576 MINEXTENTS 1 MAXEXTENTS 2147483645
  PCTINCREASE 0 FREELISTS 1 FREELIST GROUPS 1
  BUFFER_POOL DEFAULT FLASH_CACHE DEFAULT CELL_FLASH_CACHE DEFAULT)
  TABLESPACE "MyTableSpace"
  PARTITION BY LIST ("BaseId")
 (
    PARTITION "P20211231"  VALUES (20211231) SEGMENT CREATION DEFERRED
    PCTFREE 10 PCTUSED 40 INITRANS 1 MAXTRANS 255
    ROW STORE COMPRESS ADVANCED LOGGING
    STORAGE(
    BUFFER_POOL DEFAULT FLASH_CACHE DEFAULT CELL_FLASH_CACHE DEFAULT)
    TABLESPACE "MyTableSpace"
  )
  PARALLEL;
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE "MySchema"."BaseTable"
 (
     BaseId NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ DEFAULT 10 NOT NULL
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;
```

> **Note:**
>
> Table properties are removed because they are not required after the migration in Snowflake.

### 2.2. Constraints and Constraint States

The following constraints will be commented out:

* `CHECK` Constraint

> **Note:**
>
> The `USING INDEX` constraint will be entirely removed from the output code during the conversion.

#### Oracle

```sql
CREATE TABLE "MySchema"."BaseTable"
(
    BaseId NUMBER DEFAULT 10 NOT NULL ENABLE NOVALIDATE,
    "COL1" NUMBER CHECK( "COL1" IS NOT NULL ),
	  CHECK( "COL1" IS NOT NULL ),
    CONSTRAINT "Constraint1BaseTable" PRIMARY KEY (BaseId)
        USING INDEX PCTFREE 10 INITRANS 2 MAXTRANS 255 COMPUTE STATISTICS
        STORAGE(INITIAL 65536 NEXT 1048576 MINEXTENTS 1 MAXEXTENTS 2147483645
        PCTINCREASE 0 FREELISTS 1 FREELIST GROUPS 1) ENABLE
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE "MySchema"."BaseTable"
	(
	    BaseId NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ DEFAULT 10 NOT NULL,
	    "COL1" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ NOT NULL
 	                                                                                                                     !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
 	                                                                                                                     CHECK( "COL1" IS NOT NULL ),
	!!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
		  CHECK( "COL1" IS NOT NULL ),
	    CONSTRAINT "Constraint1BaseTable" PRIMARY KEY (BaseId)
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
	;
```

On the other hand, but in the same way, in case you have any constraint state after a NOT NULL constraint as follows:

* `RELY`
* `NO RELY`
* `RELY ENABLE`
* `RELY DISABLE`
* `VALIDATE`
* `NOVALIDATE`

These will also be commented out.

> **Note:**
>
> The ENABLE constraint state will be completely removed from the output code during the conversion process. In the case of the DISABLE state, it will also be removed concurrently with the NOT NULL constraint.

#### Oracle

```sql
CREATE TABLE Table1(
  col1 INT NOT NULL ENABLE,
  col2 INT NOT NULL DISABLE,
  col3 INT NOT NULL RELY
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE Table1 (
    col1 INT NOT NULL,
    col2 INT ,
    col3 INT NOT NULL /*** SSC-FDM-OR0006 - CONSTRAINT STATE RELY REMOVED FROM NOT NULL INLINE CONSTRAINT ***/
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
  ;
```

### 2.3. Foreign Key

If there is a table with a NUMBER column with no precision nor scale, and another table with a NUMBER(\*,0) column that references to the previously mentioned NUMBER column, we will comment out this foreign key.

#### Oracle

```sql
CREATE TABLE "MySchema"."MyTable"
(
    "COL1" NUMBER,
    CONSTRAINT "PK" PRIMARY KEY ("COL1")
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE "MySchema"."MyTable"
    (
        "COL1" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
        CONSTRAINT "PK" PRIMARY KEY ("COL1")
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

### 2.4. Virtual Column

#### Oracle

```sql
CREATE TABLE "MySchema"."MyTable"
(
    "COL1" NUMBER GENERATED ALWAYS AS (COL1 * COL2) VIRTUAL
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE "MySchema"."MyTable"
    (
        "COL1" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ AS (COL1 * COL2)
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

### 2.5. Identity Column

For identity columns, a sequence is created and assigned to the column.

#### Oracle

```sql
CREATE TABLE "MySchema"."BaseTable"
(
	"COL0" NUMBER GENERATED BY DEFAULT ON NULL
		AS IDENTITY MINVALUE 1 MAXVALUE 9999999999999999999999999999
		INCREMENT BY 1
		START WITH 621
		CACHE 20
		NOORDER  NOCYCLE  NOT NULL ENABLE
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE "MySchema"."BaseTable"
	(
		"COL0" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ IDENTITY(621, 1) ORDER NOT NULL
	)
	COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
	;
```

### 2.6. CLOB and BLOB column declaration

Columns declared as CLOB or BLOB will be changed to VARCHAR.

#### Oracle

```sql
CREATE TABLE T
(
 Col1 BLOB DEFAULT EMPTY_BLOB(),
Col5 CLOB DEFAULT EMPTY_CLOB()
);
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE T
 (
  Col1 BINARY,
 Col5 VARCHAR
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;
```

### 2.7. Constraint Name

> **Warning:**
>
> The constraint name is removed from the code because it is not applicable in Snowflake.

#### Oracle

```sql
CREATE TABLE "CustomSchema"."BaseTable"(
 "PROPERTY" VARCHAR2(64) CONSTRAINT "MICROSOFT_NN_PROPERTY" NOT NULL ENABLE
  );
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE "CustomSchema"."BaseTable" (
  "PROPERTY" VARCHAR(64) NOT NULL /*** SSC-FDM-0012 - CONSTRAINT NAME '"MICROSOFT_NN_PROPERTY"' IN NULL OR NOT NULL CONSTRAINT IS NOT SUPPORTED IN SNOWFLAKE ***/
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
   ;
```

### 2.8. Default columns with times

The columns declared as Date types will be cast to match with the specific date type.

#### Oracle

```sql
CREATE TABLE TABLE1
(
"COL1" VARCHAR(50) DEFAULT CURRENT_TIMESTAMP
);

CREATE TABLE TABLE1
(
 COL0 TIMESTAMP(6) DEFAULT CURRENT_TIMESTAMP,
 COL1 TIMESTAMP(6) DEFAULT CURRENT_TIME,
 COL2 TIMESTAMP(6) WITH LOCAL TIME ZONE DEFAULT '1900-01-01 12:00:00',
 COL3 TIMESTAMP(6) WITH TIME ZONE DEFAULT '1900-01-01 12:00:00',
 COL4 TIMESTAMP(6) WITHOUT TIME ZONE DEFAULT '1900-01-01 12:00:00',
 COL5 TIMESTAMP(6) DEFAULT TO_TIMESTAMP('01/01/1900 12:00:00.000000 AM', 'MM/DD/YYYY HH:MI:SS.FF6 AM')
 );
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE TABLE1
 (
 "COL1" VARCHAR(50) DEFAULT TO_VARCHAR(CURRENT_TIMESTAMP(), 'YYYY-MM-DD HH:MI:SS')
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;

 --** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR TABLE1. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
 CREATE OR REPLACE TABLE TABLE1
 (
  COL0 TIMESTAMP(6) DEFAULT CURRENT_TIMESTAMP() :: TIMESTAMP(6),
  COL1 TIMESTAMP(6) DEFAULT CURRENT_TIME() :: TIMESTAMP(6),
  COL2 TIMESTAMP_LTZ(6) DEFAULT '1900-01-01 12:00:00' :: TIMESTAMP_LTZ(6),
  COL3 TIMESTAMP_TZ(6) DEFAULT '1900-01-01 12:00:00' :: TIMESTAMP_TZ(6),
  COL4 TIMESTAMP(6) WITHOUT TIME ZONE DEFAULT '1900-01-01 12:00:00' :: TIMESTAMP(6) WITHOUT TIME ZONE,
  COL5 TIMESTAMP(6) DEFAULT TO_TIMESTAMP('01/01/1900 12:00:00.000000 AM', 'MM/DD/YYYY HH:MI:SS.FF6 AM') :: TIMESTAMP(6)
  )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;
```

### 2.9 Sharing and Memoptimize options

Some options in Oracle are not required in Snowflake. That is the case for the `sharing` and `memoptimize` options, they will be removed in the output code.

#### Oracle

```sql
CREATE TABLE table1
    SHARING = METADATA (
     id NUMBER,
     name VARCHAR2(50),
     date DATE,
     CONSTRAINT pk_table PRIMARY KEY (id)
 ) MEMOPTIMIZE FOR READ;
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (
     id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
     name VARCHAR(50),
     date TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
     CONSTRAINT pk_table PRIMARY KEY (id)
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
 ;
```

### 2.10 AS SubQuery

The following properties and clauses are unsupported when creating a table through `AS SubQuery` in Snowflake.

```sql
[ immutable_table_clauses ]
[ blockchain_table_clauses ]
[ DEFAULT COLLATION collation_name ]
[ ON COMMIT { DROP | PRESERVE } DEFINITION ]
[ ON COMMIT { DELETE | PRESERVE } ROWS ]
[ physical_properties ]
```

#### Oracle

```sql
create table table1
-- NO DROP NO DELETE HASHING USING sha2_512 VERSION v1 -- blockchain_clause not yet supported
DEFAULT COLLATION somename
ON COMMIT DROP DEFINITION
ON COMMIT DELETE ROWS
COMPRESS
NOLOGGING
AS
   select
      *
   from
      table1;
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE table1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
-- NO DROP NO DELETE HASHING USING sha2_512 VERSION v1 -- blockchain_clause not yet supported
AS
   select
      *
   from
      table1;
```

## Known Issues

1. Some properties on the tables may be adapted to or commented on because the behavior in Snowflake is different.

## Related EWIs

1. [SSC-EWI-0035](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check statement not supported.
2. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
3. [SSC-FDM-0019](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Semantic information could not be loaded.
4. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.
5. [SSC-FDM-OR0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Constraint state removed from not null inline constraint.

---
title: SnowConvert AI - Oracle - Create Type
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-translation-reference/create_type.md
section: Migrations
---

# SnowConvert AI - Oracle - Create Type

This is a translation reference to convert Oracle Create Type Statements (UDTs) to snowflake

## General Description

SnowConvert translates many Oracle `CREATE TYPE` statements to **[Snowflake native user-defined types](https://docs.snowflake.com/en/sql-reference/sql/create-type)** where the shape is supported—for example object types with attributes, `VARRAY` mapped to Snowflake `ARRAY`, and nested table types mapped to `ARRAY` of the element type. Unsupported options (subtype inheritance, member bodies, incomplete types, and others) are flagged with Oracle-specific EWIs; see Related EWIs and the [issues reference](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md).

One of the most important features the Oracle database engine offers is an Object-Oriented approach. PL/SQL offers capabilities beyond other relational databases in the form of OOP by using Java-like statements in the form of packages, functions, tables and types. This document will cover the last one and how SnowConvert AI solves it, remaining compliant to functionality.

Oracle supports the following specifications:

* Abstract Data Type (*ADT*) (*including an SQLJ object type*).
* Standalone varying array (*varray*) type.
* Standalone nested table type.
* Incomplete object type.

All this according to the information found in [Oracle Create Type Statement Documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/CREATE-TYPE-statement.html#GUID-389D603D-FBD0-452A-8414-240BBBC57034)

```sql
CREATE [ OR REPLACE ] [ EDITIONABLE | NONEDITIONAL ] TYPE <type name>
[ <type source creation options> ]
[<type definition>]
[ <type properties> ]
```

## Limitations

Snowflake supports **native user-defined types** (`CREATE TYPE … AS OBJECT`, `ARRAY`, etc.) as documented in the [SQL data types overview](https://docs.snowflake.com/en/sql-reference/data-types.html). SnowConvert maps many Oracle type definitions to those native types. Patterns that still have **no** or **partial** mapping—such as subtype inheritance (`UNDER`), member methods and type bodies, table types, and incomplete forward declarations—may require manual redesign or are reported via EWIs (for example [SSC-EWI-OR0139](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) through [SSC-EWI-OR0142](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md)). [Semi-structured Data Types](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) remain relevant for legacy scenarios that still use `VARIANT` in converted code.

Following are the User Defined Types features for which **NO** workaround is proposed:

### Subtypes: Type Hierarchy

These statements aren’t supported in Snowflake. SnowConvert AI only recognizes them, but no translation is offered.

```sql
CREATE TYPE person_t AS OBJECT (name VARCHAR2(100), ssn NUMBER)
   NOT FINAL;
/

CREATE TYPE employee_t UNDER person_t
   (department_id NUMBER, salary NUMBER)
   NOT FINAL;
/

CREATE TYPE part_time_emp_t UNDER employee_t (num_hrs NUMBER);
/
```

### Type properties

These refer to the options that are normally used when using OOP in PL/SQL: Persistable, Instantiable and Final.

```sql
CREATE OR REPLACE TYPE type1 AS OBJECT () NOT FINAL NOT INSTANTIABLE NOT PERSISTABLE;
CREATE OR REPLACE TYPE type2 AS OBJECT () FINAL INSTANTIABLE PERSISTABLE;
```

### Nested Table Type

These statements aren’t supported in Snowflake. SnowConvert AI only recognizes them, but no translation is offered.

```sql
CREATE TYPE textdoc_typ AS OBJECT
    ( document_typ      VARCHAR2(32)
    , formatted_doc     BLOB
    ) ;
/

CREATE TYPE textdoc_tab AS TABLE OF textdoc_typ;
/
```

### Type Source Creation Options

These options stand for custom options regarding access and querying the type.

```sql
CREATE TYPE type1 FORCE OID 'abc' SHARING = METADATA DEFAULT COLLATION schema1.collation ACCESSIBLE BY (schema1.unitaccesor) AS OBJECT ();
CREATE TYPE type2 FORCE OID 'abc' SHARING = NONE DEFAULT COLLATION collation ACCESSIBLE BY (PROCEDURE unitaccesor) AS OBJECT ();
CREATE TYPE type3 AUTHID CURRENT_USER AS OBJECT ();
CREATE TYPE type4 AUTHID DEFINER AS OBJECT ();
```

## Proposed workarounds

### About types definition

For the definition, the proposed workaround is to create semi-structure data type to mimic Oracle’s data type.

### About types member function

For the member functions containing logic and DML, the proposed workaround relies on helpers to translate this into stored procedures.

## Current SnowConvert AI Support

The next table shows a summary of the current support provided by the SnowConvert AI tool. Please keep in mind that translations may still not be final, and more work may be needed.

| Type Statement Element | Current recognition status | Current translation status | Has Known Workarounds |
| --- | --- | --- | --- |
| Object Type Definitions | Recognized. | Translated to Snowflake `CREATE TYPE … AS OBJECT` where supported. | Yes. |
| Subtype Definitions | Recognized. | Not Translated. | No. |
| Array Type Definitions | Recognized. | Translated to Snowflake `CREATE TYPE … AS ARRAY` (see [SSC-FDM-OR0051](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md)). | Yes. |
| Nested Table Definitions | Recognized. | Translated to Snowflake `ARRAY` of element type where supported. | Limited. |
| Member Function Definitions | Recognized. | Not Translated. | Yes. |

## Known Issues

### 1. DML usages for Object Types are not being transformed

As of now, only DDL definitions that use User-Defined Types are being transformed into Variant. This means that any Inserts, Updates or Deletes using User-defined Types are not being transformed and need to be manually transformed. There is no EWI for this but there is a work item to add this corresponding EWI.

#### 2. Create Type creation options are not supported

Currently, there is no known workaround for any of the creation options, for these reasons they are not taken into account when defining the type.

## Related EWIs

Deprecation and replacement messaging for legacy “not supported” issues: [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md), [SSC-EWI-0095](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md), [SSC-EWI-OR0007](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md).

Unsupported or incomplete `CREATE TYPE` shapes: [SSC-EWI-OR0139](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md), [SSC-EWI-OR0140](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md), [SSC-EWI-OR0141](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md), [SSC-EWI-OR0142](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md).

Functional differences: [SSC-FDM-OR0051](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md)–[SSC-FDM-OR0054](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md).

## Array Type Definition

This is a translation reference to convert the Array Variant of the Oracle Create Type Statements (UDTs) to Snowflake

> **Note:**
>
> Oracle `VARRAY` types are translated to Snowflake `CREATE TYPE … AS ARRAY ( element_type )`. Fixed varray capacity is **not** preserved; see [SSC-FDM-OR0051](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md). Column usages may still be migrated to `VARIANT` in older or mixed scenarios—verify generated DDL for your workload.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Array Types define an array structure of a previously existing datatype (including other Custom Types).

For many workloads, the type definition is emitted as a Snowflake **native `ARRAY` type**. Usages in tables and PL/SQL may still involve [Semi-structured Data Types](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) or `OBJECT` depending on context.

```sql
CREATE TYPE <type name>
AS { VARRAY | [VARYING] ARRAY } ( <size limit> ) OF <data type>
```

### Sample Source Patterns

#### Inserts for the array usage

The next data will be inserted inside the table before querying the select. Please note these Inserts currently need to be manually migrated into Snowflake.

##### Oracle

```sql
INSERT INTO customer_table_demo(customer_table_id, customer_data) VALUES
(1, phone_list_typ_demo('2000-0000', '4000-0000', '0000-0000'));

INSERT INTO customer_table_demo(customer_table_id, customer_data) VALUES
(1, phone_list_typ_demo('8000-2000', '0000-0000', '5000-0000'));
```

##### Snowflake

```sql
INSERT INTO customer_table_demo(customer_table_id, customer_data)
SELECT 1, ARRAY_CONSTRUCT('2000-0000', '4000-0000', '0000-0000');

INSERT INTO customer_table_demo(customer_table_id, customer_data)
SELECT 1, ARRAY_CONSTRUCT('8000-2000', '0000-0000', '5000-0000');
```

#### Array Type usage

##### Oracle

```sql
CREATE TYPE phone_list_typ_demo AS VARRAY(3) OF VARCHAR2(25);
/

CREATE TABLE customer_table_demo (
    customer_table_id INTEGER,
    customer_data phone_list_typ_demo
);
/

SELECT * FROM customer_table_demo;
/
```

##### Results

| CUSTOMER_TABLE_ID | CUSTOMER_DATA |
| --- | --- |
| 1 | [[‘2000-0000’,’4000-0000’,’0000-0000’]] |
| 1 | [[‘8000-2000’,’0000-0000’,’5000-0000’]] |

##### Snowflake

```sql
--** SSC-FDM-OR0051 - ARRAY SIZE LIMIT '3' WAS REMOVED. SNOWFLAKE ARRAYS ARE DYNAMICALLY SIZED. **
CREATE TYPE phone_list_typ_demo AS ARRAY ( VARCHAR(25) );

CREATE OR REPLACE TABLE customer_table_demo (
        customer_table_id INTEGER,
        customer_data phone_list_typ_demo
    )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE VIEW PUBLIC.customer_table_demo_view
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
AS
SELECT
        customer_table_id,
        customer_data
FROM
        customer_table_demo;

    SELECT * FROM
        customer_table_demo_view;
```

##### Results

| CUSTOMER_TABLE_ID | CUSTOMER_DATA |
| --- | --- |
| 1 | [[‘2000-0000’, ‘4000-0000’, ‘0000-0000’]] |
| 1 | [[‘8000-2000’, ‘0000-0000’, ‘5000-0000’]] |

### Known Issues

#### 1. Create Type creation options are not supported

Currently, there is no known workaround for any of the creation options, for these reasons they are not taken into account when defining the type.

##### 2. Migrated code output is not functional

The statements are being changed unnecessarily, which makes them no longer be functional on the output code. This will be addressed when a proper transformation for them is in place.

### Related EWIs

1. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
2. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Member Function Definitions

This is a translation reference to convert the Member Functions of the Oracle Create Type Statements (UDTs) to Snowflake

> **Danger:**
>
> SnowConvert AI still does not recognize type member functions nor type body definitions. This page is only used as a future reference for translation.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Like other Class definitions, Oracle’s TYPE can implement methods to expose behaviors based on its attributes. MEMBER FUCTION will be transformed to Snowflake’s Stored Procedures, to maintain functional equivalence due to limitations.

Since functions are being transformed into procedures, the [transformation reference for PL/SQL](../pl-sql-to-snowflake-scripting/README.md) also applies here.

### Sample Source Patterns

#### Inserts for Simple square() member function

The next data will be inserted inside the table before querying the select. Please note these Inserts currently need to be manually migrated into Snowflake.

##### Oracle

```sql
INSERT INTO table_member_function_demo(column1) VALUES
(type_member_function_demo(5));
```

##### Snowflake

```sql
INSERT INTO table_member_function_demo (column1)
SELECT OBJECT_CONSTRUCT('a1', 5);
```

#### Simple square() member function

##### Oracle

```sql
-- TYPE DECLARATION
CREATE TYPE type_member_function_demo AS OBJECT (
    a1 NUMBER,
    MEMBER FUNCTION get_square RETURN NUMBER
);
/

-- TYPE BODY DECLARATION
CREATE TYPE BODY type_member_function_demo IS
   MEMBER FUNCTION get_square
   RETURN NUMBER
   IS x NUMBER;
   BEGIN
      SELECT c.column1.a1*c.column1.a1 INTO x
      FROM table_member_function_demo c;
      RETURN (x);
   END;
END;
/

-- TABLE
CREATE TABLE table_member_function_demo (column1 type_member_function_demo);
/

-- QUERYING DATA
SELECT
    t.column1.get_square()
FROM
    table_member_function_demo t;
/
```

##### Results

| T.COLUMN1.GET_SQUARE() |
| --- |
| 25 |

##### Snowflake

```sql
-- TYPE DECLARATION
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE type_member_function_demo AS OBJECT (
    a1 NUMBER,
    MEMBER FUNCTION get_square RETURN NUMBER
)
;

---- TYPE BODY DECLARATION
--!!!RESOLVE EWI!!! /*** SSC-EWI-OR0007 - CREATE TYPE WITHOUT BODY IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
--CREATE TYPE BODY type_member_function_demo IS
--   MEMBER FUNCTION get_square
--   RETURN NUMBER
--   IS x NUMBER;
--   BEGIN
--      SELECT c.column1.a1*c.column1.a1 INTO x
--      FROM table_member_function_demo c;
--      RETURN (x);
--   END;
--END
   ;

-- TABLE
CREATE OR REPLACE TABLE table_member_function_demo (column1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'type_member_function_demo' USAGE CHANGED TO VARIANT ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE VIEW PUBLIC.table_member_function_demo_view
<strong>COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
</strong><strong>AS
</strong>SELECT
    column1:a1 :: NUMBER AS a1
FROM
    table_member_function_demo;

-- QUERYING DATA
SELECT
    t.column1.get_square() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 't.column1.get_square' NODE ***/!!!
FROM
    table_member_function_demo t;
```

##### Results

| GET_SQUARE() |
| --- |
| 25 |

### Known Issues

No Known issues.

### Related EWIs

1. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
4. [SSC-EWI-OR0007](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Create Type Not Supported in Snowflake

## Nested Table Type Definition

This is a translation reference to convert the Nested Table Variant of the Oracle Create Type Statements (UDTs) to Snowflake

> **Note:**
>
> Standalone `CREATE TYPE … AS TABLE OF element_type` is translated to Snowflake `CREATE TYPE … AS ARRAY ( element_type )` when the element type is supported. Nested tables used as table columns may still require manual review depending on DML and PL/SQL usage.

### Description

Nested Table Types define an embedded table structure of a previously existing datatype (including other Custom Types). They are closely related to Array Type definitions; SnowConvert maps many patterns to Snowflake `ARRAY`.

```sql
CREATE TYPE <type name> AS TABLE OF <data type>
```

### Sample Source Patterns

#### Nested Table Type usage

##### Oracle

```sql
CREATE TYPE textdoc_typ AS OBJECT (
    document_typ VARCHAR2(32),
    formatted_doc BLOB
);
/

CREATE TYPE textdoc_tab AS TABLE OF textdoc_typ;
/
```

##### Snowflake

```sql
CREATE TYPE textdoc_typ AS OBJECT (
    document_typ VARCHAR(32),
    formatted_doc BINARY
)
;

CREATE TYPE textdoc_tab AS ARRAY ( textdoc_typ );
```

### Known Issues

#### 1. Create Type creation options are not supported

Currently, there is no known workaround for any of the creation options; for these reasons, they are not taken into account when defining the type.

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review
2. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.

## Object Type Definition

This is a translation reference to convert the Object Variant of the Oracle Create Type Statements (UDTs) to Snowflake

> **Note:**
>
> SnowConvert AI supports a translation for Object Type Definitions itself. However, their usages are still a work in progress.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Object Types define a structure of data similar to a record, with the added advantages of the member function definitions. Meaning that their data may be used along some behavior within the type.

For the translation of object types, the type definition is replaced by a [Semi-structured Data Type](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html) and then it is expanded on any usages across the code. For tables this means replacing the column for a Variant, adding a View so that selects (and also Views) to the original table can still function.

```sql
CREATE TYPE <type name> AS OBJECT
( [{<type column definition> | type method definition } , ...]);
```

### Sample Source Patterns

#### Inserts for Simple Type usage

The next data will be inserted inside the table before querying the select. Please note these Inserts currently need to be manually migrated into Snowflake.

##### Oracle

```sql
INSERT INTO customer_table_demo(customer_table_id, customer_data)
VALUES ( 1, customer_typ_demo(1, 'First Name 1', 'Last Name 1'));

INSERT INTO customer_table_demo(customer_table_id, customer_data)
VALUES ( 2, customer_typ_demo(2, 'First Name 2', 'Last Name 2'));
```

##### Snowflake

```sql
INSERT INTO customer_table_demo(customer_table_id, customer_data)
VALUES ( 1, customer_typ_demo(1, 'First Name 1', 'Last Name 1') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'customer_typ_demo' NODE ***/!!!);

INSERT INTO customer_table_demo(customer_table_id, customer_data)
VALUES ( 2, customer_typ_demo(2, 'First Name 2', 'Last Name 2') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'customer_typ_demo' NODE ***/!!!);
```

#### Simple Type usage

##### Oracle

```sql
CREATE TYPE customer_typ_demo AS OBJECT (
    customer_id INTEGER,
    cust_first_name VARCHAR2(20),
    cust_last_name VARCHAR2(20)
);

CREATE TABLE customer_table_demo (
    customer_table_id INTEGER,
    customer_data customer_typ_demo
);

SELECT * FROM customer_table_demo;
```

##### Results

| CUSTOMER_TABLE_ID | CUSTOMER_DATA |
| --- | --- |
| 1 | [1, First Name 1, Last Name 1] |
| 2 | [2, First Name 2, Last Name 2] |

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE customer_typ_demo AS OBJECT (
    customer_id INTEGER,
    cust_first_name VARCHAR2(20),
    cust_last_name VARCHAR2(20)
)
;

CREATE OR REPLACE TABLE customer_table_demo (
        customer_table_id INTEGER,
        customer_data VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'customer_typ_demo' USAGE CHANGED TO VARIANT ***/!!!
    )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
;

CREATE OR REPLACE VIEW PUBLIC.customer_table_demo_view
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
AS
SELECT
        customer_table_id,
        customer_data:customer_id :: INTEGER AS customer_id,
        customer_data:cust_first_name :: VARCHAR AS cust_first_name,
        customer_data:cust_last_name :: VARCHAR AS cust_last_name
FROM
        customer_table_demo;

    SELECT * FROM
        customer_table_demo_view;
```

##### Results

| CUSTOMER_TABLE_ID | CUST_ID | CUST_FIRST_NAME | CUST_LAST_NAME |
| --- | --- | --- | --- |
| 1 | 1 | First Name 1 | Last Name 1 |
| 2 | 2 | First Name 2 | Last Name 2 |

#### Inserts for Nested Type Usage

These statements need to be placed between the table creation and the select statement to test the output.

##### Oracle

```sql
INSERT INTO customer_table_demo(customer_id, customer_data) values
(1, customer_typ_demo('Customer 1', email_typ_demo('email@domain.com')));

INSERT INTO customer_table_demo(customer_id, customer_data) values
(2, customer_typ_demo('Customer 2', email_typ_demo('email2@domain.com')));
```

##### Snowflake

```sql
INSERT INTO customer_table_demo(customer_id, customer_data) values
(1, customer_typ_demo('Customer 1', email_typ_demo('email@domain.com') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'email_typ_demo' NODE ***/!!!) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'customer_typ_demo' NODE ***/!!!);

INSERT INTO customer_table_demo(customer_id, customer_data) values
(2, customer_typ_demo('Customer 2', email_typ_demo('email2@domain.com') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'email_typ_demo' NODE ***/!!!) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'customer_typ_demo' NODE ***/!!!);
```

#### Nested Type Usage

##### Oracle

```sql
CREATE TYPE email_typ_demo AS OBJECT (email VARCHAR2(20));

CREATE TYPE customer_typ_demo AS OBJECT (
    cust_name VARCHAR2(20),
    cust_email email_typ_demo
);

CREATE TABLE customer_table_demo (
    customer_id INTEGER,
    customer_data customer_typ_demo
);

SELECT * FROM customer_table_demo;
```

##### Results

| CUSTOMER_ID | CUSTOMER_DATA |
| --- | --- |
| 1 | [Customer 1, [email@domain.com]] |
| 2 | [Customer 2, [email2@domain.com]] |

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE email_typ_demo AS OBJECT (email VARCHAR2(20))
;

!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!

CREATE TYPE customer_typ_demo AS OBJECT (
    cust_name VARCHAR2(20),
    cust_email email_typ_demo
)
;

CREATE OR REPLACE TABLE customer_table_demo (
    customer_id INTEGER,
    customer_data VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'customer_typ_demo' USAGE CHANGED TO VARIANT ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
;

CREATE OR REPLACE VIEW PUBLIC.customer_table_demo_view
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
AS
SELECT
    customer_id,
    customer_data:cust_name :: VARCHAR AS cust_name,
    customer_data:cust_email:email :: VARCHAR AS email
FROM
    customer_table_demo;

SELECT * FROM
    customer_table_demo_view;
```

##### Results

| CUSTOMER_ID | CUST_NAME | CUST_EMAIL |
| --- | --- | --- |
| 1 | Customer 1 | email@domain.com |
| 2 | Customer 2 | email2@domain.com |

### Known Issues

#### 1. Migrated code output is not the same

The view statement is being changed unnecessarily, which makes the table no longer have the same behavior in the output code. There is a work item to fix this issue.

##### 2. DML for User-defined Types is not being transformed

DML that interacts with elements that have User-defined types within them (like a table) are not being transformed. There is a work item to implement this in the future.

##### 3. Create Type creation options are not supported

Currently, there is no known workaround for any of the creation options, for these reasons they are not taken into account when defining the type.

### Related EWIs

1. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Subtype Definition

This is a translation reference to convert the Subtype Variant of the Oracle Create Type Statements (UDTs) to Snowflake

> **Danger:**
>
> Since there are no known workarounds, SnowConvert AI only recognizes these definitions and does not support any translation for them.

### Description

Subtypes define a structure of data similar to a record, with the added advantages of the member function definitions. Meaning that their data may be used along some behavior within the type. Unlike Object Types, Subtypes are built as an extension to another existing type.

Regarding subtype definitions, there is still no translation, but there might be a way to reimplement them using Object Type Definitions and then using their respective translation.

```sql
CREATE TYPE <type name> UNDER <super type name>
( [{<type column definition> | type method definition } , ...]);
```

### Sample Source Patterns

#### Subtypes under an Object Type

##### Oracle

```sql
CREATE TYPE person_t AS OBJECT (name VARCHAR2(100), ssn INTEGER)
   NOT FINAL;
/

CREATE TYPE employee_t UNDER person_t
   (department_id INTEGER, salary INTEGER)
   NOT FINAL;
/

CREATE TYPE part_time_emp_t UNDER employee_t (num_hrs INTEGER);
/
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE person_t AS OBJECT (name VARCHAR2(100), ssn INTEGER)
   NOT FINAL;

--!!!RESOLVE EWI!!! /*** SSC-EWI-OR0007 - CREATE TYPE SUBTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!

--CREATE TYPE employee_t UNDER person_t
--   (department_id INTEGER, salary INTEGER)
--   NOT FINAL
            ;

--!!!RESOLVE EWI!!! /*** SSC-EWI-OR0007 - CREATE TYPE SUBTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!

--CREATE TYPE part_time_emp_t UNDER employee_t (num_hrs INTEGER)
                                                              ;
```

### Known Issues

#### 1. Create Type creation options are not supported

Currently, there is no known workaround for any of the creation options, for these reasons they are not taken into account when defining the type.

### Related EWIs

1. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
2. [SSC-EWI-OR0007](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Create Type Not Supported in Snowflake.

---
title: SnowConvert AI - Oracle - Create View
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-translation-reference/create-view.md
section: Migrations
---

# SnowConvert AI - Oracle - Create View

In this section, you could find information about Oracle Views and their Snowflake equivalent. The syntax of subquery used to create the view can be found in the SELECT section

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

## Create View

```sql
CREATE OR REPLACE VIEW View1 AS SELECT Column1 from Schema1.Table1;
```

```sql
CREATE OR REPLACE VIEW View1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
SELECT Column1 from
Schema1.Table1;
```

The following clauses for Create View are removed:

* No Force/ Force
* Edition Clause
* Sharing Clause
* Default collation
* Bequeath clause
* Container clause

```sql
CREATE OR REPLACE
NO FORCE
NONEDITIONABLE
VIEW Schema1.View1
SHARING = DATA
DEFAULT COLLATION Collation1
BEQUEATH CURRENT_USER
AS SELECT Column1 from Schema1.Table1
CONTAINER_MAP;
```

```sql
CREATE OR REPLACE VIEW Schema1.View1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
SELECT Column1 from
Schema1.Table1;
```

## Alter View

Alter is not supported by SnowConvert AI yet.

## Drop View

The CASCADE CONSTRAINT clause is not supported yet.

```sql
DROP VIEW Schema1.View1;

DROP VIEW Schema1.View1
CASCADE CONSTRAINTS;
```

```sql
DROP VIEW Schema1.View1;

DROP VIEW Schema1.View1
CASCADE CONSTRAINTS !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'DropBehavior' NODE ***/!!!;
```

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - Oracle - CURSOR
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/cursor.md
section: Migrations
---

# SnowConvert AI - Oracle - CURSOR

## Description

> **Danger:**
>
> This section covers the Translation Reference for Oracle [Explicit Cursor](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/static-sql.html#GUID-89E0242F-42AC-4B21-9DF1-ACD6F4FC03B9). For Oracle [Cursor Variables](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/static-sql.html#GUID-4A6E054A-4002-418D-A1CA-DE849CD7E6D5) there is no equivalent in Snowflake Scripting.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

Cursors are pointers that allow users to iterate through query results. For more information, see the [Oracle Cursors documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/static-sql.html#GUID-F1FE15F9-5C96-4C4E-B240-B7363D25A8F1).

### Oracle Cursor Syntax

**Cursor Definition**

```sql
CURSOR cursor
 [ ( cursor_parameter_dec [, cursor_parameter_dec ]... )]
   [ RETURN rowtype] IS select_statement ;
```

**Cursor Open**

```sql
OPEN cursor [ ( cursor_parameter [ [,] actual_cursor_parameter ]... ) ] ;
```

**Cursor Fetch**

```sql
FETCH { cursor | cursor_variable | :host_cursor_variable }
  { into_clause | bulk_collect_into_clause [ LIMIT numeric_expression ] } ;
```

**Cursor Close**

```sql
CLOSE { cursor | cursor_variable | :host_cursor_variable } ;
```

**Cursor Attributes**

```sql
named_cursor%{ ISOPEN | FOUND | NOTFOUND | ROWCOUNT }
```

**Cursor FOR Loop**

```sql
[ FOR record IN
  { cursor [ ( cursor_parameter_dec
               [ [,] cursor_parameter_dec ]... )]
  | ( select_statement )
  }
    LOOP statement... END LOOP [label] ;
```

Snowflake Scripting has support for cursors, however, they have fewer functionalities compared to Oracle. For more information, see the [Snowflake Scripting cursors documentation](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/cursors.html).

#### Snowflake Scripting Cursor Syntax

**Cursor Declaration**

```sql
<cursor_name> CURSOR FOR <query>
```

**Cursor Open**

```sql
OPEN <cursor_name> [ USING (bind_variable_1 [, bind_variable_2 ...] ) ] ;
```

**Cursor Fetch**

```sql
FETCH <cursor_name> INTO <variable> [, <variable> ... ] ;
```

**Cursor Close**

```sql
CLOSE <cursor_name> ;
```

**Cursor FOR Loop**

```sql
FOR <row_variable> IN <cursor_name> DO
    statement;
    [ statement; ... ]
END FOR [ <label> ] ;
```

## Sample Source Patterns

### 1. Basic cursor example

#### Oracle Cursor Example

```sql
CREATE OR REPLACE PROCEDURE basic_cursor_sample AS
    var1 VARCHAR(20);
    CURSOR cursor1 IS SELECT region_name FROM hr.regions ORDER BY region_name;
BEGIN
    OPEN cursor1;
    FETCH cursor1 INTO var1;
    CLOSE cursor1;
END;
```

##### Snowflake Scripting Cursor Example

```sql
CREATE OR REPLACE PROCEDURE basic_cursor_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        var1 VARCHAR(20);
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT region_name FROM
                hr.regions
            ORDER BY region_name;
    BEGIN
        OPEN cursor1;
        FETCH cursor1 INTO
            :var1;
    CLOSE cursor1;
    END;
$$;
```

### 2. Explicit Cursor For Loop

#### Oracle Explicit Cursor For Loop Example

```sql
CREATE OR REPLACE PROCEDURE explicit_cursor_for_sample AS
    CURSOR cursor1 IS SELECT region_name FROM hr.regions ORDER BY region_name;
BEGIN
    FOR r1 IN cursor1 LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting Explicit Cursor For Loop Example

```sql
CREATE OR REPLACE PROCEDURE explicit_cursor_for_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT region_name FROM
                hr.regions
            ORDER BY region_name;
    BEGIN
                OPEN cursor1;
                --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
                FOR r1 IN cursor1 DO
            NULL;
                END FOR;
                CLOSE cursor1;
    END;
$$;
```

### 3. Implicit Cursor For Loop

#### Oracle Implicit Cursor For Loop Example

```sql
CREATE OR REPLACE PROCEDURE implicit_cursor_for_sample AS
BEGIN
    FOR r1 IN (SELECT region_name FROM hr.regions ORDER BY region_name) LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting Implicit Cursor For Loop Example

```sql
CREATE OR REPLACE PROCEDURE implicit_cursor_for_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        LET temporary_for_cursor_0 CURSOR
        FOR
            (SELECT region_name FROM
                    hr.regions
                ORDER BY region_name);
        --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
        FOR r1 IN temporary_for_cursor_0 DO
            NULL;
        END FOR;
    END;
$$;
```

### 4. Parameterized Cursor

You can use “?” In the filter condition of the cursor at the declaration section define the bind variable. While opening the cursor we can add the additional syntax “USING <bind_variable_1 >” to pass the bind variable.

Below are some examples of scenarios that can occur in the use of parameters in cursors:

#### 4.1 Basic Cursor Parameterized Example

##### Oracle Parameterized Cursor Example

```sql
CREATE OR REPLACE PROCEDURE parameterized_cursor_for_sample AS
    CURSOR cursor1 (low number, high IN number) IS
        SELECT region_name FROM hr.regions WHERE region_id BETWEEN low AND high;
BEGIN
    OPEN cursor1(3,5);
    CLOSE cursor1;
END;
```

##### Snowflake Parameterized Cursor Example

```sql
CREATE OR REPLACE PROCEDURE parameterized_cursor_for_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT region_name FROM
                hr.regions
            WHERE region_id BETWEEN ? AND ?;
    BEGIN
                OPEN cursor1 USING (3, 5);
                CLOSE cursor1;
    END;
$$;
```

#### 4.2 Parameterized Cursors With Multiple Sending Parameters

##### Oracle Parameterized Cursor Example

```sql
CREATE OR REPLACE PROCEDURE parameterized_cursor_for_sample AS
    CURSOR cursor1 (low number DEFAULT 2, high IN number DEFAULT 7) IS
        SELECT region_name FROM hr.regions
        WHERE region_id BETWEEN low AND high OR low < 0;
BEGIN
    OPEN cursor1(3,5);
    OPEN cursor1(3);
    OPEN cursor1;
    OPEN cursor1(high => 15, low => 5);
    OPEN cursor1(high => 15);
    CLOSE cursor1;
END;
```

##### Snowflake Parameterized Cursor Example

```sql
CREATE OR REPLACE PROCEDURE parameterized_cursor_for_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT region_name FROM
                hr.regions
            WHERE region_id BETWEEN ? AND ?
                OR ? < 0;
    BEGIN
                OPEN cursor1 USING (3, 5, 3);
                OPEN cursor1 USING (3, 7, 3);
                OPEN cursor1 USING (2, 7, 2);
                OPEN cursor1 USING (5, 15, 5);
                OPEN cursor1 USING (2, 15, 2);
                CLOSE cursor1;
    END;
$$;
```

#### 4.3 Parameterized Cursors With Use Of Procedure Parameters In Query

##### Oracle Parameterized Cursor Example

```sql
CREATE OR REPLACE PROCEDURE parameterized_cursor_for_sample (high_param number) AS
    CURSOR cursor1 (low number DEFAULT 2) IS
        SELECT region_name FROM hr.regions
        WHERE region_id BETWEEN low AND high_param;
BEGIN
    OPEN cursor1(3);
    CLOSE cursor1;
END;
CALL parameterized_cursor_for_sample(5);
```

##### Snowflake Parameterized Cursor Example

```sql
CREATE OR REPLACE PROCEDURE parameterized_cursor_for_sample (high_param NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT region_name FROM
                hr.regions
            WHERE region_id BETWEEN ? AND ?;
    BEGIN
                OPEN cursor1 USING (3, high_param);
                CLOSE cursor1;
    END;
$$;

CALL parameterized_cursor_for_sample(5);
```

### 5. Using Cursors In Fetch And For Loop

Cursors can be controlled through the use of the FOR statement, allowing each and every record of a cursor to be processed while the FETCH statement puts, record by record, the values returned by the cursor into a set of variables, which may be PLSQL records

#### 5.1 Cursors For Loop

##### Oracle Cursor For Loop Example

```sql
CREATE OR REPLACE PROCEDURE p_cursors_for_loop AS
 datePlusOne TIMESTAMP;
 CURSOR c_product(low number, high number) IS
    SELECT name, price, create_on FROM products WHERE price BETWEEN low AND high;
BEGIN
    FOR record_product IN c_product(3,5)
    LOOP
      datePlusOne := record_product.create_on + 1;
      INSERT INTO sold_items values(record_product.name, record_product.price, datePlusOne);
    END LOOP;
END;
```

##### Snowflake Cursor For Loop Example

```sql
CREATE OR REPLACE PROCEDURE p_cursors_for_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  datePlusOne TIMESTAMP(6);
  --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
  c_product CURSOR
  FOR
     SELECT
      OBJECT_CONSTRUCT('NAME', name, 'PRICE', price, 'CREATE_ON', create_on) sc_cursor_record FROM
      products
     WHERE price BETWEEN ? AND ?;
 BEGIN
  OPEN c_product USING (3, 5);
  --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
  FOR record_product IN c_product DO
     LET record_product OBJECT := record_product.sc_cursor_record;
     datePlusOne :=
                    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
                    record_product.CREATE_ON + 1;
                    INSERT INTO sold_items
                    SELECT
      :record_product:NAME,
      :record_product:PRICE,
      :datePlusOne;
  END FOR;
  CLOSE c_product;
 END;
$$;
```

#### 5.2 Cursors Fetch

##### Oracle Cursor Fetch Example

```sql
CREATE OR REPLACE PROCEDURE p_cursors_fetch AS
record_product products%rowtype;
 CURSOR c_product(low number, high number) IS
    SELECT * FROM products WHERE price BETWEEN low AND high;
BEGIN
    OPEN c_product(3,5);
    LOOP
        FETCH c_product INTO record_product;
        EXIT WHEN c_product%notfound;
        INSERT INTO sold_items VALUES (record_product.name, record_product.price);
        INSERT INTO sold_items VALUES record_product;
    END LOOP;
    CLOSE c_product;
END;
```

##### Snowflake Cursor Fetch Example

```sql
CREATE OR REPLACE PROCEDURE p_cursors_fetch ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  record_product OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
  --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
  c_product CURSOR
  FOR
     SELECT
      OBJECT_CONSTRUCT( *) sc_cursor_record FROM
      products
     WHERE price BETWEEN ? AND ?;
 BEGIN
  OPEN c_product USING (3, 5);
  --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
  LOOP
     --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
      FETCH c_product INTO
      :record_product;
      IF (record_product IS NULL) THEN
      EXIT;
      END IF;
      INSERT INTO sold_items
      SELECT
      :record_product:NAME,
      :record_product:PRICE;
      INSERT INTO sold_items
      SELECT
      null !!!RESOLVE EWI!!! /*** SSC-EWI-OR0002 - COLUMNS FROM EXPRESSION products%rowtype NOT FOUND ***/!!!;
  END LOOP;
    CLOSE c_product;
 END;
$$;
```

## Known Issues

### 1. RETURN clause is not supported in Snowflake Scripting Cursor Declaration

The Cursor Declaration for Snowflake Scripting does not include this clause. It can be removed from the Oracle Cursor definition to get functional equivalence.

#### 2. OPEN statement cannot pass values for declared arguments

Even though arguments can be declared for a cursor, their values cannot be assigned in Snowflake Scripting. The best alternative is to use the `USING` clause with bind variables.

#### 3. FETCH statement cannot use records

Snowflake Scripting does not support records. However, it is possible to migrate them using the OBJECT data type and the OBJECT_CONSTRUCT() method. For more information please see the [Record Type Definition Section](collections-and-records.md).

#### 4. FETCH BULK COLLECT INTO clause is not supported in Snowflake Scripting

Snowflake Scripting does not support the BULK COLLECT INTO clause. However, it is possible to use ARRAY_AGG along with a temporal table to construct a new variable with the data corresponding to the Cursor information. For more information please see the [Collection Bulk Operations Section](collections-and-records.md).

#### 5. Cursor attributes do not exist in Snowflake Scripting

Oracle cursors have different attributes that allow the user to check their status like if it is opened or the amount of fetched rows, however, these attributes regarding the cursor status do not exist in Snowflake Scripting.

#### 6. The cursor’s query does not have access to the procedure’s variables and parameters

In Oracle, the query in the cursor declaration has access to procedure variables and parameters but in Snowflake Scripting, it does not. The alternative to this is to use the `USING` clause with bind variables.

#### 7. %NOTFOUND attribute is not supported in Snowflake Scripting Cursor

In Oracle can be used, before the first fetch from an open cursor, cursor_name%NOTFOUND returns TRUE if the last fetch failed to return a row, or FALSE if the last fetch returned a row. Snowflake Scripting does not support the use of this attribute instead it can be validated if the variable assigned to the cursor result contains values

## Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-OR0002](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Columns from expression not found.
3. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
4. [SSC-PRF-0003](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.
5. [SSC-PRF-0004](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor for loop.

## CURSOR DECLARATION

> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> **Notice that this statement removed from the migration; because it is a non-relevant syntax. It means that it is not required in Snowflake.**

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This section explains the translation of the declaration of cursors in Oracle. For more information review the following documentation about [procedures](https://docs.oracle.com/en/database/oracle/oracle-database/19/lnpls/CREATE-PROCEDURE-statement.html#GUID-5F84DB47-B5BE-4292-848F-756BF365EC54) and [cursors](https://docs.oracle.com/en/database/oracle/oracle-database/19/lnpls/cursor-variable-declaration.html#GUID-CE884B31-07F0-46AA-8067-EBAF73821F3D) in Oracle.

### Sample Source Patterns

#### CURSOR DECLARATION

Notice that in this example the `CURSOR` statement has been deleted. This is a non-relevant syntax in the transformation targeted to Snowflake.

##### Oracle

```sql
CREATE PROCEDURE PROC_COLLECTIONS
AS
CURSOR C2 RETURN T1%TYPE;
BEGIN
    NULL;
END
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC_COLLECTIONS ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Cursor Variables

Translation reference for cursor variables and the OPEN FOR statement

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> A cursor variable is like an explicit cursor that is not limited to one query.
>
> ([Oracle PL/SQL Language Reference Cursor Variable Declaration](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/cursor-variable-declaration.html#GUID-CE884B31-07F0-46AA-8067-EBAF73821F3D))

#### Oracle Syntax

**Ref cursor type definition**

```sql
TYPE type IS REF CURSOR
  [ RETURN
    { {db_table_or_view | cursor | cursor_variable}%ROWTYPE
    | record%TYPE
    | record_type
    | ref_cursor_type
    }
  ] ;
```

**Cursor variable declaration**

```sql
cursor_variable type;
```

**OPEN FOR statement**

```sql
OPEN { cursor_variable | :host_cursor_variable}
  FOR select_statement [ using_clause ] ;
```

> **Warning:**
>
> Snowflake Scripting has no direct equivalence with cursor variables and the `OPEN FOR` statement, however, they can be emulated with different workarounds to get functional equivalence.

### Sample Source Patterns

#### 1. OPEN FOR statement with dynamic SQL inside a VARCHAR variable

##### Oracle Example

```sql
CREATE OR REPLACE PROCEDURE procedure1
AS
	query1 VARCHAR(200) := 'SELECT 123 FROM dual';
	cursor_var SYS_REFCURSOR;
BEGIN
	OPEN cursor_var FOR query1;
	CLOSE cursor_var;
END;
```

##### Snowflake Scripting Example

```sql
CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		query1 VARCHAR(200) := 'SELECT 123 FROM dual';
		cursor_var_res RESULTSET;
	BEGIN
		!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
		cursor_var_res := (
			EXECUTE IMMEDIATE :query1
		);
		LET cursor_var CURSOR
		FOR
			cursor_var_res;
		OPEN cursor_var;
		CLOSE cursor_var;
	END;
$$;
```

#### 2. OPEN FOR statement with dynamic SQL inside a string literal.

##### Oracle Example

```sql
CREATE OR REPLACE PROCEDURE procedure2
AS
    cursor_var SYS_REFCURSOR;
BEGIN
    OPEN cursor_var FOR 'SELECT 123 FROM dual';
    CLOSE cursor_var;
END;
```

##### Snowflake Scripting Example

```sql
CREATE OR REPLACE PROCEDURE procedure2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        cursor_var_res RESULTSET;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        cursor_var_res := (
            EXECUTE IMMEDIATE 'SELECT 123 FROM dual'
        );
        LET cursor_var CURSOR
        FOR
            cursor_var_res;
        OPEN cursor_var;
        CLOSE cursor_var;
    END;
$$;
```

#### 3. OPEN FOR statement with SELECT statement

##### Oracle Example

```sql
CREATE OR REPLACE PROCEDURE procedure3
AS
	cursor_var SYS_REFCURSOR;
BEGIN
	OPEN cursor_var FOR SELECT 123 FROM dual;
	CLOSE cursor_var;
END;
```

##### Snowflake Scripting Example

```sql
CREATE OR REPLACE PROCEDURE procedure3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		cursor_var_res RESULTSET;
	BEGIN
		LET cursor_var CURSOR
		FOR
			SELECT 123 FROM dual;
		OPEN cursor_var;
		CLOSE cursor_var;
	END;
$$;
```

#### 4. Cursor Variable declared with REF CURSOR type

##### Oracle Example

```sql
CREATE OR REPLACE PROCEDURE procedure4
AS
    TYPE cursor_ref_type1 IS REF CURSOR;
    query1 VARCHAR(200) := 'SELECT 123 FROM dual';
    cursor_var cursor_ref_type1;
BEGIN
    OPEN cursor_var FOR query1;
    CLOSE cursor_var;
END;
```

##### Snowflake Scripting Example

```sql
CREATE OR REPLACE PROCEDURE procedure4 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL REF CURSOR TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE cursor_ref_type1 IS REF CURSOR;
        query1 VARCHAR(200) := 'SELECT 123 FROM dual';
        cursor_var_res RESULTSET;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        cursor_var_res := (
            EXECUTE IMMEDIATE :query1
        );
        LET cursor_var CURSOR
        FOR
            cursor_var_res;
        OPEN cursor_var;
        CLOSE cursor_var;
    END;
$$;
```

#### 5. OPEN FOR statement with USING clause

##### Oracle Example

```sql
CREATE OR REPLACE PROCEDURE procedure5
AS
    query1 VARCHAR(200) := 'SELECT col1 FROM cursortable1 WHERE col1 = :a';
    column_filter INTEGER := 1;
    cursor_var SYS_REFCURSOR;
BEGIN
    OPEN cursor_var FOR query1 USING column_filter;
    CLOSE cursor_var;
END;
```

##### Snowflake Scripting Example

```sql
CREATE OR REPLACE PROCEDURE procedure5 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        query1 VARCHAR(200) := 'SELECT col1 FROM
   cursortable1
WHERE col1 = ?';
        column_filter INTEGER := 1;
        cursor_var_res RESULTSET;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        cursor_var_res := (
            EXECUTE IMMEDIATE :query1 USING ( column_filter)
        );
        LET cursor_var CURSOR
        FOR
            cursor_var_res;
        OPEN cursor_var;
        CLOSE cursor_var;
    END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.
2. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.

## PARAMETRIZED CURSOR

Parametrized Cursor is not supported by Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Oracle supports parameters for cursors that are declared. However, Snowflake Scripting does not support this feature, so the declaration and the usage of the cursor are not possible.

#### Example Code

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE parametrized_cursor_sample AS
    CURSOR cursor1(param1 number) IS SELECT region_name FROM hr.regions where region_id = param1 ORDER BY region_name;
    var1 integer;
BEGIN
    OPEN cursor1(123);
    FETCH cursor1 INTO var1;
    CLOSE cursor1;
    FOR r1 IN cursor1(456) LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE parametrized_cursor_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT
                OBJECT_CONSTRUCT('REGION_NAME', region_name) sc_cursor_record FROM
                hr.regions
            where region_id = ?
            ORDER BY region_name;
                var1 integer;
    BEGIN
                OPEN cursor1 USING (123);
                FETCH cursor1 INTO
            :var1;
    CLOSE cursor1;
                OPEN cursor1 USING (456);
                --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
                FOR r1 IN cursor1 DO
            LET r1 OBJECT := r1.sc_cursor_record;
                   NULL;
                END FOR;
                CLOSE cursor1;
    END;
$$;
```

#### Recommendations

* Try using bindings for the query in the cursor and open the cursor with the `USING` clause. Keep in mind that a parameter that is used multiple times on a single cursor may require passing the variable multiple times in the `USING` clause.

##### Snowflake Query

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.parametrized_cursor_sample_fixed ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
      var1 STRING;
      cursor1 CURSOR FOR SELECT region_name FROM hr.regions where region_id = ? ORDER BY region_name;
   BEGIN
      NULL;
      OPEN cursor1 USING (1);
      FETCH cursor1 INTO var1;
      CLOSE cursor1;
      OPEN cursor1 USING (2);
      FOR r1 IN cursor1 DO
         NULL;
      END FOR;
      CLOSE cursor1;
   END;
$$;
```

* Manually change the cursor to use bindings.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

1. [SSC-PRF-0004](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor for loop.

## Workaround for cursors using parameters or procedure variables

### Description

This section describes how to simulate the usage of cursor parameters and procedure variables inside the query of a cursor. The name of the variables or parameters is replaced with bindings using the `?` sign. Then, when the cursor is opened, the values should be passed with the `USING` clause.

> **Note:**
>
> ```none
> Some parts in the output code are omitted for clarity reasons.
> ```

#### Cursor with local variables

Use bindings for the query in the cursor for variable or procedure parameter used and open the cursor with the `USING` clause.

##### Oracle Cursor

```sql
CREATE OR REPLACE PROCEDURE oracle_cursor_sample
AS
    like_value VARCHAR(255);
    CURSOR c1 IS SELECT region_name FROM hr.regions WHERE region_name LIKE like_value ORDER BY region_name;
    r_name VARCHAR(255);
BEGIN
    like_value := 'E%';
    OPEN c1;
    FETCH c1 INTO r_name;
    CLOSE c1;
    like_value := 'A%';
    FOR r1 IN c1 LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting Cursor

```sql
CREATE OR REPLACE PROCEDURE oracle_cursor_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        like_value VARCHAR(255);
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        c1 CURSOR
        FOR
            SELECT region_name FROM
                hr.regions
            WHERE region_name LIKE ?
            ORDER BY region_name;
        r_name VARCHAR(255);
    BEGIN
        like_value := 'E%';
        OPEN c1 USING (like_value);
        FETCH c1 INTO
            :r_name;
    CLOSE c1;
        like_value := 'A%';
        OPEN c1;
        --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
        FOR r1 IN c1 DO
            NULL;
        END FOR;
        CLOSE c1;
    END;
$$;
```

#### Cursor with parameters

Use bindings for the query in the cursor for each parameter used and open the cursor with the `USING` clause. Keep in mind that a parameter that is used multiple times on a single cursor may require passing the variable multiple times in the `USING` clause.

##### Oracle Cursor

```sql
CREATE OR REPLACE PROCEDURE parametrized_cursor_sample AS
    CURSOR cursor1(param1 number) IS SELECT region_name FROM hr.regions where region_id = param1 ORDER BY region_name;
    var1 integer;
BEGIN
    OPEN cursor1(123);
    FETCH cursor1 INTO var1;
    CLOSE cursor1;
    FOR r1 IN cursor1(456) LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting Cursor

```sql
CREATE OR REPLACE PROCEDURE parametrized_cursor_sample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        cursor1 CURSOR
        FOR
            SELECT
                OBJECT_CONSTRUCT('REGION_NAME', region_name) sc_cursor_record FROM
                hr.regions
            where region_id = ?
            ORDER BY region_name;
                var1 integer;
    BEGIN
                OPEN cursor1 USING (123);
                FETCH cursor1 INTO
            :var1;
    CLOSE cursor1;
                OPEN cursor1 USING (456);
                --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
                FOR r1 IN cursor1 DO
            LET r1 OBJECT := r1.sc_cursor_record;
                   NULL;
                END FOR;
                CLOSE cursor1;
    END;
$$;
```

### Related EWIs

1. [SSC-PRF-0004](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor for loop

---
title: SnowConvert AI - Oracle - Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/README.md
section: Migrations
---

# SnowConvert AI - Oracle - Data Types

This section shows equivalents between data types in Oracle and Snowflake, as well as some notes on arithmetic differences.

| **Oracle** | **Snowflake** |
| --- | --- |
| ANSI Data Types | *\*Go to the link to get more information* |
| [BFILE](oracle-built-in-data-types.md) | VARCHAR |
| [BINARY_DOUBLE](oracle-built-in-data-types.md) | FLOAT |
| [BINARY_FLOAT](oracle-built-in-data-types.md) | FLOAT |
| [BLOB](oracle-built-in-data-types.md) | BINARY |
| [CHAR (N)](oracle-built-in-data-types.md) | CHAR (N) |
| [CLOB](oracle-built-in-data-types.md) | VARCHAR |
| [DATE](oracle-built-in-data-types.md) | TIMESTAMP |
| [FLOAT](oracle-built-in-data-types.md) | FLOAT |
| [INTERVAL YEAR TO MONTH](oracle-built-in-data-types.md) | VARCHAR(20) |
| [INTERVAL DAY TO SECOND](oracle-built-in-data-types.md) | VARCHAR(20) |
| [JSON](oracle-built-in-data-types.md) | VARIANT |
| [LONG](oracle-built-in-data-types.md) | VARCHAR |
| [LONG RAW](oracle-built-in-data-types.md) | BINARY |
| [NCHAR (N)](oracle-built-in-data-types.md) | NCHAR (N) |
| [NCLOB](oracle-built-in-data-types.md) | VARCHAR |
| [NUMBER(p, s)](oracle-built-in-data-types.md) | NUMBER(p, s) |
| [NVARCHAR2 (N)](oracle-built-in-data-types.md) | VARCHAR (N) |
| [RAW](oracle-built-in-data-types.md) | BINARY |
| [ROWID](rowid-types.md) | VARCHAR(18) |
| [VARCHAR2 (N)](oracle-built-in-data-types.md) | VARCHAR (N) |
| [SDO_GOMETRY](spatial-types.md) | Currently not supported |
| [SDO_TOPO___GEOMETRY](spatial-types.md) | *\*to be defined* |
| [SDO_GEORASTER](spatial-types.md) | *\*to be defined* |
| [SYS.ANYDATA](any-types.md) | VARIANT |
| [SYS.ANYDATASET](any-types.md) | *\*to be defined* |
| [SYS.ANYTYPE](any-types.md) | *\*to be defined* |
| [TIMESTAMP](oracle-built-in-data-types.md) | TIMESTAMP |
| [TIMESTAMP WITH TIME ZONE](oracle-built-in-data-types.md) | TIMESTAMP_TZ |
| [TIMESTAMP WITH LOCAL TIME ZONE](oracle-built-in-data-types.md) | TIMESTAMP_LTZ |
| [URITYPE](xml-types.md) | *\*to be defined* |
| [UROWID](rowid-types.md) | VARCHAR(18) |
| [VARCHAR](oracle-built-in-data-types.md) | VARCHAR |
| [VARCHAR2](oracle-built-in-data-types.md) | VARCHAR |
| [XMLType](xml-types.md) | VARIANT |

## Notes on arithmetic operations

Please be aware that every operation performed on numerical datatypes is internally stored as a Number. Furthermore, depending on the operation performed it is possible to incur an error related to how intermediate values are stored within Snowflake, for more information please check this post on [Snowflake’s post on intermediate numbers in Snowflake](https://community.snowflake.com/s/question/0D50Z00008HhSHCSA3/sql-compilation-error-invalid-intermediate-datatype-number7148).

## ANSI Data Types

### Description

> SQL statements that create tables and clusters can also use ANSI data types and data types from the IBM products SQL/DS and DB2. Oracle recognizes the ANSI or IBM data type name that differs from the Oracle Database data type name. It converts the data type to the equivalent Oracle data type, records the Oracle data type as the name of the column data type, and stores the column data in the Oracle data type based on the conversions shown in the tables that follow. ([Oracle Language Reference ANSI, DB2, and SQL/DS Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-0BC16006-32F1-42B1-B45E-F27A494963FF)).

When creating a new table, Oracle and Snowflake handle some data types as synonyms and aliases and transform them into the default data type. As shown in the next table:

| ANSI | ORACLE | SNOWFLAKE |
| --- | --- | --- |
| CHARACTER (n) | CHAR (n) | VARCHAR |
| CHAR (n) | CHAR (n) | VARCHAR |
| CHARACTER VARYING (n) | VARCHAR2 (n) | VARCHAR |
| CHAR VARYING (n) | VARCHAR2 (n) | VARCHAR |
| NATIONAL CHARACTER (n) | NCHAR (n) | VARCHAR\* |
| NATIONAL CHAR (n) | NCHAR (n) | VARCHAR\* |
| NCHAR (n) | NCHAR (n) | VARCHAR |
| NATIONAL CHARACTER VARYING (n) | NVARCHAR2 (n) | VARCHAR\* |
| NATIONAL CHAR VARYING (n) | NVARCHAR2 (n) | VARCHAR\* |
| NCHAR VARYING (n) | NVARCHAR2 (n) | NUMBER (p, s) |
| NUMERIC [(p, s)] | NUMBER (p, s) | NUMBER (p, s) |
| DECIMAL [(p, s)] | NUMBER (p, s) | NUMBER (38) |
| INTEGER | NUMBER (38) | NUMBER (38) |
| INT | NUMBER (38) | NUMBER (38) |
| SMALLINT | NUMBER (38) | NUMBER (38) |
| FLOAT | FLOAT (126) | DOUBLE |
| DOUBLE PRECISION | FLOAT (126) | DOUBLE |
| REAL | FLOAT (63) | DOUBLE |

To get more information about the translation specification of the Oracle data types, go to [Oracle Built-in Data Types](oracle-built-in-data-types.md).

> **Note:**
>
> VARCHAR\*: Almost all the ANSI datatypes compile in Snowflake, but those marked with an asterisk, are manually converted to VARCHAR.

### Known Issues

No issues were found.

### Related EWIs

EWIs related to these data types are specified in the transformation of the [Oracle Built-in data types.](oracle-built-in-data-types.md)

## Data Type Customization

SnowConvert AI enables Data Type Customization to specify rules for data type transformation based on data type origin and column name. This feature allows you to personalize data type conversions and set precision values more accurately during migration.

For complete documentation on configuring data type customization, including JSON structure, configuration options, and priority rules, see [Data type mappings](../../../../general/getting-started/running-snowconvert/conversion/oracle-conversion-settings.md) in the Oracle Conversion Settings documentation.

### NUMBER to DECFLOAT Transformation

SnowConvert AI supports transforming Oracle `NUMBER` columns to Snowflake `DECFLOAT` data type. This is useful when you need to preserve the exact decimal precision of numeric values during migration.

When a `NUMBER` column is configured to be transformed to `DECFLOAT`:

1. The column data type in `CREATE TABLE` statements is transformed to `DECFLOAT`
2. Numeric literals in `INSERT` statements that target `DECFLOAT` columns are automatically wrapped with `CAST(... AS DECFLOAT)` to ensure proper data type handling
3. Column references in `INSERT ... SELECT` statements are also cast appropriately

#### Example

##### Oracle

```sql
CREATE TABLE products (
    product_id NUMBER(10),
    price NUMBER(15, 2)
);

INSERT INTO products VALUES (1, 99.99);
```

##### Snowflake (with DECFLOAT customization for price column)

```sql
CREATE OR REPLACE TABLE products (
    product_id NUMBER(10),
    price DECFLOAT
);

INSERT INTO products VALUES (1, CAST(99.99 AS DECFLOAT));
```

> **Note:**
>
> The TypeMappings report (TypeMappings.csv) provides a detailed view of all data type transformations applied during conversion. See [TypeMappings Report](../../../../general/getting-started/running-snowconvert/review-results/reports/type-mappings-report.md) for more information.

---
title: SnowConvert AI - Oracle - DML STATEMENTS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/dml-statements.md
section: Migrations
---

# SnowConvert AI - Oracle - DML STATEMENTS

## Description

DML statement extensions differ from normal DML statements because they can use PL/SQL elements like collections and records. So far some of these elements are not supported by snowflake scripting. If one statement is not supported, an EWI will be added during the translation. Other DML statements will be translated as if they were not inside a procedure.

## INSERT Statement Extension

Translation reference to convert Oracle INSERT Statement Extension to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The PL/SQL extension to the SQL `INSERT` statement lets you specify a record name in the `values_clause` of the `single_table_insert` instead of specifying a column list in the `insert_into_clause.` ([Oracle PL/SQL Language Reference INSERT Statement Extension](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/INSERT-statement-extension.html#GUID-D81224C4-06DE-4635-A850-41D29D4A8E1B))

Snowflake INSERT INTO differs from Snowflake Scripting in variable constraints; needing to have the names preceded by a colon ‘:’ to bind the variables’ value.

### Recommendations

> **Note:**
>
> This code was executed for a better understanding of the examples:

#### Oracle

```sql
CREATE TABLE numbers_table(num integer, word varchar2(20));
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE PUBLIC.numbers_table (num integer,
word VARCHAR(20));
```

#### INSERT Statement Extension simple case

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE proc_insert_statement
AS
number_variable integer := 10;
word_variable varchar2(20) := 'ten';
BEGIN
	INSERT INTO numbers_table VALUES(number_variable, word_variable);
	INSERT INTO numbers_table VALUES(11, 'eleven');
END;

CALL proc_insert_statement();
SELECT * FROM numbers_table ;
```

##### Result

| NUM | WORD |
| --- | --- |
| 10 | ten |
| 11 | eleven |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE proc_insert_statement ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		number_variable integer := 10;
		word_variable VARCHAR(20) := 'ten';
	BEGIN
		INSERT INTO numbers_table
		VALUES(:number_variable, :word_variable);
		INSERT INTO numbers_table
		VALUES(11, 'eleven');
	END;
$$;

CALL proc_insert_statement();

SELECT * FROM
	numbers_table;
```

##### Result

| NUM | WORD |
| --- | --- |
| 10 | ten |
| 11 | eleven |

### Known Issues

#### 1. Records are not supported by Snowflake Scripting

Since records are not supported by snowflake scripting, instead of using the `VALUES record` clause, it is necessary to change it into a SELECT clause and split the columns of the record. For more information please see the [Record Type Definition Section](collections-and-records.md).

### Related EWIs

No related EWIs.

## MERGE Statement

Translation reference to convert Oracle MERGE statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The `MERGE` statement is used to select rows from one or more sources for update or insertion into a table or view. It is possible to specify conditions to determine whether to update or insert into the target table or view. This statement is a convenient way to combine multiple operations. It lets you avoid multiple `INSERT`, `UPDATE`, and `DELETE` DML statements. `MERGE` is a deterministic statement. It is not possible to update the same row of the target table multiple times in the same `MERGE` statement. ([Oracle PL/SQL Language Reference MERGE Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/MERGE.html#GUID-5692CCB7-24D9-4C0E-81A7-A22436DC968F))

#### Oracle MERGE Syntax

```sql
MERGE [ hint ]
   INTO [ schema. ] { table | view } [ t_alias ]
   USING { [ schema. ] { table | view }
         | ( subquery )
         } [ t_alias ]
   ON ( condition )
   [ merge_update_clause ]
   [ merge_insert_clause ]
   [ error_logging_clause ] ;

merge_update_clause := WHEN MATCHED THEN
UPDATE SET column = { expr | DEFAULT }
           [, column = { expr | DEFAULT } ]...
[ where_clause ]
[ DELETE where_clause ]

merge_insert_clause := WHEN NOT MATCHED THEN
INSERT [ (column [, column ]...) ]
VALUES ({ expr | DEFAULT }
          [, { expr | DEFAULT } ]...
       )
[ where_clause ]

error_logging_clause := LOG ERRORS
  [ INTO [schema.] table ]
  [ (simple_expression) ]
  [ REJECT LIMIT { integer | UNLIMITED } ]

where_clause := WHERE condition
```

##### Snowflake Scripting MERGE Syntax

```sql
MERGE INTO <target_table> USING <source> ON <join_expr>
{ matchedClause | notMatchedClause } [ ... ]

matchedClause ::= WHEN MATCHED [ AND <case_predicate> ]
THEN { UPDATE SET <col_name> = <expr> [ , <col_name2> = <expr2> ... ] | DELETE } [ ... ]

notMatchedClause ::= WHEN NOT MATCHED [ AND <case_predicate> ]
THEN INSERT [ ( <col_name> [ , ... ] ) ] VALUES ( <expr> [ , ... ] )
```

### Sample Source Patterns

#### Sample auxiliary data

> **Note:**
>
> This code was executed for a better understanding of the examples:

##### Oracle

```sql
CREATE TABLE people_source (
    person_id INTEGER NOT NULL PRIMARY KEY,
    first_name VARCHAR2(20) NOT NULL,
    last_name VARCHAR2(20) NOT NULL,
    title VARCHAR2(10) NOT NULL
);

CREATE TABLE people_target (
    person_id INTEGER NOT NULL PRIMARY KEY,
    first_name VARCHAR2(20) NOT NULL,
    last_name VARCHAR2(20) NOT NULL,
    title VARCHAR2(10) NOT NULL
);

CREATE TABLE bonuses (
    employee_id NUMBER,
    bonus NUMBER DEFAULT 100
);

INSERT INTO people_target
VALUES (1, 'John', 'Smith', 'Mr');

INSERT INTO people_target
VALUES (2, 'alice', 'jones', 'Mrs');

INSERT INTO people_source
VALUES (2, 'Alice', 'Jones', 'Mrs.');

INSERT INTO people_source
VALUES (3, 'Jane', 'Doe', 'Miss');

INSERT INTO people_source
VALUES (4, 'Dave', 'Brown', 'Mr');

INSERT INTO
    bonuses(employee_id) (
        SELECT
            e.employee_id
        FROM
            hr.employees e,
            oe.orders o
        WHERE
            e.employee_id = o.sales_rep_id
        GROUP BY
            e.employee_id
    );
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE people_source (
    person_id INTEGER NOT NULL PRIMARY KEY,
    first_name VARCHAR(20) NOT NULL,
    last_name VARCHAR(20) NOT NULL,
    title VARCHAR(10) NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE people_target (
    person_id INTEGER NOT NULL PRIMARY KEY,
    first_name VARCHAR(20) NOT NULL,
    last_name VARCHAR(20) NOT NULL,
    title VARCHAR(10) NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE bonuses (
    employee_id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    bonus NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ DEFAULT 100
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO people_target
VALUES (1, 'John', 'Smith', 'Mr');

INSERT INTO people_target
VALUES (2, 'alice', 'jones', 'Mrs');

INSERT INTO people_source
VALUES (2, 'Alice', 'Jones', 'Mrs.');

INSERT INTO people_source
VALUES (3, 'Jane', 'Doe', 'Miss');

INSERT INTO people_source
VALUES (4, 'Dave', 'Brown', 'Mr');

INSERT INTO bonuses(employee_id) (
    SELECT
        e.employee_id
    FROM
        hr.employees e,
        oe.orders o
    WHERE
        e.employee_id = o.sales_rep_id
    GROUP BY
        e.employee_id
);
```

#### MERGE Statement simple case

##### Oracle

```sql
MERGE INTO people_target pt USING people_source ps ON (pt.person_id = ps.person_id)
WHEN MATCHED THEN
UPDATE
SET
    pt.first_name = ps.first_name,
    pt.last_name = ps.last_name,
    pt.title = ps.title
    WHEN NOT MATCHED THEN
INSERT
    (
        pt.person_id,
        pt.first_name,
        pt.last_name,
        pt.title
    )
VALUES
    (
        ps.person_id,
        ps.first_name,
        ps.last_name,
        ps.title
    );

SELECT * FROM people_target;
```

##### Result

| PERSON_ID | FIRST_NAME | LAST_NAME | TITLE |
| --- | --- | --- | --- |
| 1 | John | Smith | Mr |
| 2 | Alice | Jones | Mrs. |
| 3 | Jane | Doe | Miss |
| 4 | Dave | Brown | Mr |

##### Snowflake

```sql
MERGE INTO people_target pt USING people_source ps ON (pt.person_id = ps.person_id)
WHEN MATCHED THEN
    UPDATE
SET
    pt.first_name = ps.first_name,
    pt.last_name = ps.last_name,
    pt.title = ps.title
WHEN NOT MATCHED THEN
INSERT
    (
        pt.person_id,
        pt.first_name,
        pt.last_name,
        pt.title
    )
VALUES
    (
        ps.person_id,
        ps.first_name,
        ps.last_name,
        ps.title
    );

SELECT * FROM
    people_target;
```

##### Result

| PERSON_ID | FIRST_NAME | LAST_NAME | TITLE |
| --- | --- | --- | --- |
| 1 | John | Smith | Mr |
| 2 | Alice | Jones | Mrs. |
| 3 | Jane | Doe | Miss |
| 4 | Dave | Brown | Mr |

#### MERGE Statement with DELETE and where clause

To find an equivalence for the **DELETE** statement and the **where clause**, it is necessary to reorder and implement some changes in the Snowflake merge statement.

##### Changed required:

* Replace the Oracle’s **DELETE where_clause** with a new Snowflake’s **matchedClause** with the **AND predicate** statement
* Replace the **where_clause** from the Oracle’s **merge_insert_clause** with an **AND predicate** statement in the Snowflake’s **notMatchedClause**

##### Oracle

```sql
MERGE INTO bonuses D USING (
    SELECT
        employee_id,
        salary,
        department_id
    FROM
        hr.employees
    WHERE
        department_id = 80
) S ON (D.employee_id = S.employee_id)
WHEN MATCHED THEN
UPDATE
SET
    D.bonus = D.bonus + S.salary *.01 DELETE
WHERE
    (S.salary > 8000)
    WHEN NOT MATCHED THEN
INSERT
    (D.employee_id, D.bonus)
VALUES
    (S.employee_id, S.salary *.01)
WHERE
    (S.salary <= 8000);

SELECT * FROM bonuses ORDER BY employee_id;
```

##### Result

| EMPLOYEE_ID | BONUS |
| --- | --- |
| 153 | 180 |
| 154 | 175 |
| 155 | 170 |
| 159 | 180 |
| 160 | 175 |
| 161 | 170 |
| 164 | 72 |
| 165 | 68 |
| 166 | 64 |
| 167 | 62 |
| 171 | 74 |
| 172 | 73 |
| 173 | 61 |
| 179 | 62 |

##### Snowflake

```sql
--** SSC-FDM-OR0018 - SNOWFLAKE MERGE STATEMENT MAY HAVE SOME FUNCTIONAL DIFFERENCES COMPARED TO ORACLE **
MERGE INTO bonuses D USING (
 SELECT
     employee_id,
     salary,
     department_id
 FROM
     hr.employees
 WHERE
     department_id = 80) S ON (D.employee_id = S.employee_id)
    WHEN MATCHED AND
    (S.salary > 8000) THEN
 DELETE
    WHEN MATCHED THEN
 UPDATE SET
    D.bonus = D.bonus + S.salary *.01
    WHEN NOT MATCHED AND
    (S.salary <= 8000) THEN
 INSERT
 (D.employee_id, D.bonus)
VALUES
 (S.employee_id, S.salary *.01);

SELECT * FROM
bonuses
ORDER BY employee_id;
```

##### Result

| EMPLOYEE_ID | BONUS |
| --- | --- |
| 153 | 180 |
| 154 | 175 |
| 155 | 170 |
| 159 | 180 |
| 160 | 175 |
| 161 | 170 |
| 164 | 72 |
| 165 | 68 |
| 166 | 64 |
| 167 | 62 |
| 171 | 74 |
| 172 | 73 |
| 173 | 61 |
| 179 | 62 |

> **Warning:**
>
> In some cases the changes applied may not work as expected, like the next example:

##### Oracle

```sql
MERGE INTO people_target pt USING people_source ps ON (pt.person_id = ps.person_id)
WHEN MATCHED THEN
UPDATE
SET
    pt.first_name = ps.first_name,
    pt.last_name = ps.last_name,
    pt.title = ps.title DELETE
where
    pt.title = 'Mrs.'
    WHEN NOT MATCHED THEN
INSERT
    (
        pt.person_id,
        pt.first_name,
        pt.last_name,
        pt.title
    )
VALUES
    (
        ps.person_id,
        ps.first_name,
        ps.last_name,
        ps.title
    )
WHERE
    ps.title = 'Mr';

SELECT * FROM people_target;
```

##### Result

| PERSON_ID | FIRST_NAME | LAST_NAME | TITLE |
| --- | --- | --- | --- |
| 1 | John | Smith | Mr |
| 4 | Dave | Brown | Mr |

##### Snowflake

```sql
--** SSC-FDM-OR0018 - SNOWFLAKE MERGE STATEMENT MAY HAVE SOME FUNCTIONAL DIFFERENCES COMPARED TO ORACLE **
MERGE INTO people_target pt USING people_source ps ON (pt.person_id = ps.person_id)
    WHEN MATCHED AND
    pt.title = 'Mrs.' THEN
        DELETE
    WHEN MATCHED THEN
        UPDATE SET
    pt.first_name = ps.first_name,
    pt.last_name = ps.last_name,
    pt.title = ps.title
    WHEN NOT MATCHED AND
    ps.title = 'Mr' THEN
        INSERT
        (
            pt.person_id,
            pt.first_name,
            pt.last_name,
            pt.title
        )
VALUES
        (
            ps.person_id,
            ps.first_name,
            ps.last_name,
            ps.title
        );

SELECT * FROM
        people_target;
```

##### Result

| PERSON_ID | FIRST_NAME | LAST_NAME | TITLE |
| --- | --- | --- | --- |
| 1 | John | Smith | Mr |
| 2 | Alice | Jones | Mrs. |
| 4 | Dave | Brown | Mr |

### Known Issues

#### 1. Oracle’s error_logging_clause is not supported

There is no equivalent for the error logging clause in Snowflake Scripting.

##### 2. Changed applied do not work as expected

Sometimes, the changes applied to achieve the functional equivalence between Oracle’s merge statement and Snowflake’s do not work as expected.

### Related EWIs

1. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
2. [SSC-FDM-OR0018](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Merge statement may not work as expected

## SELECT INTO Statement

Translation reference to convert Oracle SELECT INTO statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The `SELECT` `INTO` statement retrieves values from one or more database tables (as the SQL `SELECT` statement does) and stores them in variables (which the SQL `SELECT` statement does not do). ([Oracle PL/SQL Language Reference SELECT INTO Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/SELECT-INTO-statement.html#GUID-6E14E04D-4344-45F3-BE80-979DD26C7A90))

#### Oracle SELECT INTO Syntax

```sql
SELECT [ { DISTINCT | UNIQUE } | ALL ] select_list
    { into_clause | bulk_collect_into_clause } FROM rest-of-statement ;
```

##### Oracle Into Clause Syntax

```sql
INTO { variable [, variable ]... | record )
```

##### Oracle Bulk Collect Syntax

```sql
BULK COLLECT INTO { collection | :host_array }
  [, { collection | :host_array } ]...
```

##### Snowflake Scripting SELECT INTO Syntax

```sql
SELECT [ { ALL | DISTINCT } ]
    {
          [{<object_name>|<alias>}.]*
        | [{<object_name>|<alias>}.]<col_name>
        | [{<object_name>|<alias>}.]$<col_position>
        | <expr>
        [ [ AS ] <col_alias> ]
    }
    [ , ... ]
    INTO :<variable> [, :<variable> ... ]
    [...]
```

### Sample Source Patterns

#### Sample auxiliary data

> **Note:**
>
> This code was executed for a better understanding of the examples:

##### Oracle

```sql
CREATE TABLE numbers_table(num integer, word varchar2(20));
INSERT INTO numbers_table VALUES (1, 'one');
CREATE TABLE aux_numbers_table(aux_num integer, aux_word varchar2(20));
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE numbers_table (num integer,
word VARCHAR(20))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO numbers_table
VALUES (1, 'one');

CREATE OR REPLACE TABLE aux_numbers_table (aux_num integer,
aux_word VARCHAR(20))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### SELECT INTO Statement simple case

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE proc_select_into_variables
AS
number_variable integer;
word_variable varchar2(20);
BEGIN
	SELECT * INTO number_variable, word_variable FROM numbers_table;
	INSERT INTO aux_numbers_table VALUES(number_variable, word_variable);
END;

CALL proc_select_into_variables();
SELECT * FROM aux_numbers_table;
```

##### Result

| AUX_NUM | AUX_WORD |
| --- | --- |
| 1 | one |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE proc_select_into_variables ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		number_variable integer;
		word_variable VARCHAR(20);
	BEGIN
		SELECT * INTO
			:number_variable,
			:word_variable
		FROM
			numbers_table;
		INSERT INTO aux_numbers_table
		VALUES(:number_variable, :word_variable);
	END;
$$;

CALL proc_select_into_variables();

SELECT * FROM
	aux_numbers_table;
```

##### Result

```none
|AUX_NUM|AUX_WORD|
|-------|--------|
|1      |one     |
```

### Known Issues

#### 1. BULK COLLECT INTO is not supported

Snowflake Scripting does not support the BULK COLLECT INTO clause. However, it is possible to use ARRAY_AGG to construct a new variable. For more information please see the [Collection Bulk Operations Section](collections-and-records.md).

##### 2. Collections and records are not supported

Snowflake Scripting does not support the use of collections nor records. It is possible to migrate them using Semi-structured data types as explained in [Collections and records](collections-and-records.md).

### Related EWIs

No related EWIs.

## Work around to simulate the use of Records

> **Warning:**
>
> This page is deprecated but was left for compatibility purposes. If you want to see the updated section, please refer to [Collections And Records](collections-and-records.md)

### Description

This section describes how to simulate the behavior of Oracle records in SELECT and INSERT Statements, using RESULTSET and CURSORS of Snowflake Scripting.

#### Snowflake Scripting RESULTSET and CURSOR

##### Snowflake RESULTSET Syntax

```sql
<resultset_name> RESULTSET [ DEFAULT ( <query> ) ] ;

LET <resultset_name> RESULTSET [ { DEFAULT | := } ( <query> ) ] ;

LET <resultset_name> RESULTSET [ { DEFAULT | := } ( <query> ) ] ;
```

### Recommendations

> **Note:**
>
> For the following examples, this code was executed to better understanding of the examples:

#### Oracle

```sql
CREATE TABLE numbers_table(num integer, word varchar2(20));
INSERT INTO numbers_table VALUES (1, 'one');
CREATE TABLE aux_numbers_table(aux_num integer, aux_word varchar2(20));
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE numbers_table (num integer,
word VARCHAR(20))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO numbers_table
VALUES (1, 'one');

CREATE OR REPLACE TABLE aux_numbers_table (aux_num integer,
aux_word VARCHAR(20))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### Using RESULTSET and Cursors instead of Records

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE proc_insert_select_resultset
AS
TYPE number_record_definition IS RECORD(
	rec_num numbers_table.num%type,
	rec_word numbers_table.word%type
);
number_record number_record_definition;
BEGIN
	SELECT * INTO number_record FROM numbers_table;
	INSERT INTO aux_numbers_table VALUES number_record;
END;

CALL proc_insert_select_resultset();
SELECT * FROM aux_numbers_table;
```

##### Result

| AUX_NUM | AUX_WORD |
| --- | --- |
| 1 | one |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE proc_insert_select_resultset ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
		TYPE number_record_definition IS RECORD(
			rec_num numbers_table.num%type,
			rec_word numbers_table.word%type
		);
		number_record OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - number_record_definition DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
	BEGIN
		SELECT
			OBJECT_CONSTRUCT( *) INTO
			:number_record
		FROM
			numbers_table;
		INSERT INTO aux_numbers_table
		SELECT
			:number_record:REC_NUM,
			:number_record:REC_WORD;
	END;
$$;

CALL proc_insert_select_resultset();

SELECT * FROM
	aux_numbers_table;
```

using cursor

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.proc_select_into()
RETURNS INTEGER
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
$$
DECLARE
    NUMBER_VARIABLE INTEGER;
    WORD_VARIABLE VARCHAR;
    NUMBER_RECORD RESULTSET;
BEGIN
    LET c2 CURSOR FOR NUMBER_RECORD;
    FOR row_variable IN c2 DO
        let var1 integer := row_variable.num;
        let var2 varchar := row_variable.word;
        INSERT INTO PUBLIC.aux_numbers_table VALUES(:var1, :var2);
    END FOR;
end;
$$;
```

##### Result

| AUX_NUM | AUX_WORD |
| --- | --- |
| 1 | one |

### Known Issues

#### 1. Limitation in the use of RESULTSET

RESULTSET is very limited in its use. If `table(result_scan(last_query_id()))` statement, should be used just after the RESULTSET’s query is executed. For further information check this [link](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/resultsets.html#limitations-of-the-resultset-data-type).

### Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.

---
title: SnowConvert AI - Oracle - HELPERS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/helpers.md
section: Migrations
---

# SnowConvert AI - Oracle - HELPERS

In this section you will find helper functions or procedures that are used to achieve functional equivalence of some Oracle features that are not supported natively in Snowflake Scripting.

## Bulk Cursor Helpers

> **Note:**
>
> You might also be interested in [Default FORALL transformation](README.md).

The Cursor is simulated with an `OBJECT` with different information regarding the state of the cursor. A temporary table is created to store the result set of the cursor’s query.

Most of these Procedures return a new Object with the updated state of the cursor.

### INIT_CURSOR

This function initializes a new object with the basic cursor information

```sql
CREATE OR REPLACE FUNCTION INIT_CURSOR(NAME VARCHAR, QUERY VARCHAR)
RETURNS OBJECT
AS
$$
  SELECT OBJECT_CONSTRUCT('NAME', NAME, 'ROWCOUNT', -1, 'QUERY', QUERY, 'ISOPEN', FALSE, 'FOUND', NULL, 'NOTFOUND', NULL)
$$;
```

### OPEN_BULK_CURSOR

These procedures create a temporary table with the query of the cursor. An optional overload exists to support bindings.

```sql
CREATE OR REPLACE PROCEDURE OPEN_BULK_CURSOR(CURSOR OBJECT, BINDINGS ARRAY)
RETURNS OBJECT
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
  var query = `CREATE OR REPLACE TEMPORARY TABLE ${CURSOR.NAME}_TEMP_TABLE AS ${CURSOR.QUERY}`;
  snowflake.execute({ sqlText: query, binds: BINDINGS });
  CURSOR.ROWCOUNT = 0;
  CURSOR.ISOPEN = true;
  return CURSOR;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE OPEN_BULK_CURSOR(CURSOR OBJECT)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL OPEN_BULK_CURSOR(:CURSOR, NULL));
    RETURN :RESULT;
  END;
$$;
```

### CLOSE_BULK_CURSOR

This procedure deletes the temporary table that stored the result set of the cursor and resets the cursor’s properties to their initial state.

```sql
CREATE OR REPLACE PROCEDURE CLOSE_BULK_CURSOR(CURSOR OBJECT)
RETURNS OBJECT
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
  var query = `DROP TABLE ${CURSOR.NAME}_TEMP_TABLE`;
  snowflake.execute({ sqlText: query });
  CURSOR.ROWCOUNT = -1;
  CURSOR.ISOPEN = false;
  CURSOR.FOUND = null;
  CURSOR.NOTFOUND = null;
  return CURSOR;
$$;
```

### FETCH Helpers

Due to Oracle being capable of doing the `FETCH` statement on different kind of scenarios, multiple procedures with overloads were created to handle each case. These helpers save the fetched values into the `RESULT` property in the `CURSOR` object.

Some of the overloads include variations when the `LIMIT` clause was used or not. Other overloads have a `COLUMN_NAMES` argument that is necessary when the `FETCH` statement is being done into a variable that has or contains records with column names that are different to the column names of the query.

#### FETCH_BULK_COLLECTION_RECORDS

These procedures are used when a `FETCH BULK` is done into a collection of records.

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_COLLECTION_RECORDS(CURSOR OBJECT, LIMIT FLOAT, COLUMN_NAMES ARRAY)
RETURNS OBJECT
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
  var objectConstructArgs = [];
  if (COLUMN_NAMES) {
    for (let i = 0 ; i < COLUMN_NAMES.length ; i++) {
      objectConstructArgs.push("'" + COLUMN_NAMES[i] + "'");
      objectConstructArgs.push('$' + (i + 1));
    }
  } else {
    objectConstructArgs.push('*');
  }
  var limitValue = LIMIT ?? 'NULL';
  var query = `SELECT ARRAY_AGG(OBJECT_CONSTRUCT(${objectConstructArgs.join(', ')})) FROM (SELECT * FROM ${CURSOR.NAME}_TEMP_TABLE LIMIT ${limitValue} OFFSET ${CURSOR.ROWCOUNT})`;
  var stmt = snowflake.createStatement({ sqlText: query});
  var resultSet = stmt.execute();
  resultSet.next();
  CURSOR.RESULT = resultSet.getColumnValue(1);
  CURSOR.ROWCOUNT += CURSOR.RESULT.length;
  CURSOR.FOUND = CURSOR.RESULT.length > 0;
  CURSOR.NOTFOUND = !CURSOR.FOUND;
  return CURSOR;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_COLLECTION_RECORDS(CURSOR OBJECT)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_COLLECTION_RECORDS(:CURSOR, NULL, NULL));
    RETURN :RESULT;
  END;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_COLLECTION_RECORDS(CURSOR OBJECT, LIMIT INTEGER)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_COLLECTION_RECORDS(:CURSOR, :LIMIT, NULL));
    RETURN :RESULT;
  END;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_COLLECTION_RECORDS(CURSOR OBJECT, COLUMN_NAMES ARRAY)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_COLLECTION_RECORDS(:CURSOR, NULL, :COLUMN_NAMES));
    RETURN :RESULT;
  END;
$$;
```

#### FETCH_BULK_COLLECTIONS

These procedures are used when the `FETCH` statement is done into one or multiple collections. Since the columns are specified in this `FETCH` operation, an override for specific `COLUMN_NAMES` is not necessary.

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_COLLECTIONS(CURSOR OBJECT, LIMIT FLOAT)
RETURNS OBJECT
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
  var limitClause = '';
  var limitValue = LIMIT ?? 'NULL';
  var query = `SELECT * FROM ${CURSOR.NAME}_TEMP_TABLE LIMIT ${limitValue} OFFSET ${CURSOR.ROWCOUNT}`;
  var stmt = snowflake.createStatement({ sqlText: query});
  var resultSet = stmt.execute();
  var column_count = stmt.getColumnCount();
  CURSOR.RESULT = [];
  for (let i = 0 ; i < column_count ; i++) {
    CURSOR.RESULT[i] = [];
  }

  while (resultSet.next()) {
    for (let i = 1 ; i <= column_count ; i++) {
      let columnName = stmt.getColumnName(i);
      CURSOR.RESULT[i - 1].push(resultSet.getColumnValue(columnName));
    }
  }
  CURSOR.ROWCOUNT += stmt.getRowCount();
  CURSOR.FOUND = stmt.getRowCount() > 0;
  CURSOR.NOTFOUND = !CURSOR.FOUND;
  return CURSOR;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_COLLECTIONS(CURSOR OBJECT)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_COLLECTIONS(:CURSOR, NULL));
    RETURN :RESULT;
  END;
$$;
```

#### FETCH_BULK_RECORD_COLLECTIONS

These procedures are used when a `FETCH BULK` is done into a record of collections.

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_RECORD_COLLECTIONS(CURSOR OBJECT, LIMIT FLOAT, COLUMN_NAMES ARRAY)
RETURNS OBJECT
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
  var limitValue = LIMIT ?? 'NULL';
  var query = `SELECT * FROM ${CURSOR.NAME}_TEMP_TABLE LIMIT ${limitValue} OFFSET ${CURSOR.ROWCOUNT}`;
  var stmt = snowflake.createStatement({ sqlText: query});
  var resultSet = stmt.execute();
  var column_count = stmt.getColumnCount();
  CURSOR.RESULT = {};
  if (COLUMN_NAMES)
  {
    for (let i = 0 ; i < COLUMN_NAMES.length ; i++) {
      CURSOR.RESULT[COLUMN_NAMES[i]] = [];
    }
  } else {
    for (let i = 1 ; i <= column_count ; i++) {
      let columnName = stmt.getColumnName(i);
      CURSOR.RESULT[columnName] = [];
    }
  }

  while (resultSet.next()) {
    for (let i = 1 ; i <= column_count ; i++) {
      let columnName = stmt.getColumnName(i);
      let fieldName = COLUMN_NAMES ? COLUMN_NAMES[i - 1] : columnName;
      CURSOR.RESULT[fieldName].push(resultSet.getColumnValue(columnName));
    }
  }
  CURSOR.ROWCOUNT += stmt.getRowCount();
  CURSOR.FOUND = stmt.getRowCount() > 0;
  CURSOR.NOTFOUND = !CURSOR.FOUND;
  return CURSOR;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_RECORD_COLLECTIONS(CURSOR OBJECT)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_RECORD_COLLECTIONS(:CURSOR, NULL, NULL));
    RETURN :RESULT;
  END;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_RECORD_COLLECTIONS(CURSOR OBJECT, LIMIT INTEGER)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_RECORD_COLLECTIONS(:CURSOR, :LIMIT, NULL));
    RETURN :RESULT;
  END;
$$;
```

```sql
CREATE OR REPLACE PROCEDURE FETCH_BULK_RECORD_COLLECTIONS(CURSOR OBJECT, COLUMN_NAMES ARRAY)
RETURNS OBJECT
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT OBJECT;
  BEGIN
    RESULT := (CALL FETCH_BULK_RECORD_COLLECTIONS(:CURSOR, NULL, :COLUMN_NAMES));
    RETURN :RESULT;
  END;
$$;
```

---
title: SnowConvert AI - Oracle - Javascript Helpers
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-javascript/helpers.md
section: Migrations
---

# SnowConvert AI - Oracle - Javascript Helpers

In this section you will find the helper functions used inside procedures that are used to achieve functional equivalence of some Oracle features that are not supported natively in Snowflake.

## Between operator helper

### Between Operator Helper Function Definition

```javascript
var BetweenFunc = function (expression,startExpr,endExpr) {
   if ([expression,startExpr,endExpr].some((arg) => arg == null)) {
      return null;
   }
   return expression >= startExpr && expression <= endExpr;
};
```

## Concat Value Helper

> **Note:**
>
> This helper also uses IS NULL helper.

### Concat Helper Function Definition

Helper method used to concatenate values in a JavaScript Template Literal. This is necessary to check if values are null or not. Oracle handles null values as empty strings in concatenations.

```javascript
 :force:
 var concatValue = (arg) => IS_NULL(arg) ? "" : arg;
```

## Cursor Helper

> **Note:**
>
> You might also be interested in:
>
> * [Cursor FOR LOOP.](README.md)
> * [OPEN, FETCH and CLOSE statements](README.md).
> * [Cursor declaration.](README.md)

> **Note:**
>
> This helper also uses Raise helper and EXEC helper.

### Cursor Helper Function Definition

```javascript
var FETCH_INTO_COLLECTIONS = function (collections,fetchValues) {
   for(let i = 0;i < collections.length;i++) {
      collections[i].push(fetchValues[i]);
   }
};
var CURSOR = function (stmt,binds,isRefCursor,isOut) {
   var statementObj, result_set, total_rows, ISOPEN = false, result_set_table = '', self = this, row_count, found;
   this.CURRENT = new Object;
   this.INTO = function () {
         return self.res;
      };
   this.OPEN = function (openParameters) {
         if (ISOPEN && !isRefCursor) RAISE(-6511,"CURSOR_ALREADY_OPEN","cursor already open");
         var finalStmt = openParameters && openParameters.query || stmt;
         var parameters = openParameters && openParameters.binds || [];
         var finalBinds = binds instanceof Function ? binds(...parameters) : binds;
         finalBinds = finalBinds || parameters;
         try {
            if (isOut) {
               if (!temptable_prefix) {
                  temptable_prefix = `${procname}_TEMP_${(EXEC(`select current_session() || '_' || to_varchar(current_timestamp, 'yyyymmddhh24missss')`,{
                        sql : 0
                     }))[0]}_`;
               }
               if (!result_set_table) {
                  result_set_table = temptable_prefix + outCursorResultNumber++;
                  EXEC(`CREATE OR REPLACE TEMPORARY TABLE ${result_set_table} AS ${finalStmt}`,{
                     sql : 0
                  });
               }
               finalStmt = "SELECT * FROM " + result_set_table
            }
            [result_set,statementObj,total_rows] = [EXEC(finalStmt,finalBinds,{
                  sql : 0,
                  row : 2
               }),_RS,_RS.getColumnCount()]
            ISOPEN = true;
            row_count = 0;
         } catch(error) {
            RAISE(error.code,"error",error.message);
         }
         return this;
      };
   this.NEXT = function () {
         if (total_rows && result_set.next()) {
            this.CURRENT = new Object;
            for(let i = 1;i <= statementObj.getColumnCount();i++) {
               (this.CURRENT)[statementObj.getColumnName(i)] = result_set.getColumnValue(i);
            }
            return true;
         } else return false;
      };
   this.FETCH = function (record) {
         var recordKeys = record ? Object.keys(record) : undefined;
         self.res = [];
         if (!ISOPEN) RAISE(-1001,"INVALID_CURSOR","invalid cursor");
         if (recordKeys && recordKeys.length != statementObj.getColumnCount()) RAISE(-6504,"ROWTYPE_MISMATCH","Return types of Result Set variables or query do not match");
         self.res = fetch(statementObj,result_set);
         if (self.res && self.res.length > 0) {
            found = true;
            row_count++;
            if (recordKeys) {
               for(let i = 0;i < self.res.length;i++) {
                  record[recordKeys[i]] = (self.res)[i];
               }
               return false;
            }
            return true;
         } else found = false;
         return false;
      };
   this.CLOSE = function () {
         if (!ISOPEN) RAISE(-1001,"INVALID_CURSOR","invalid cursor");
         found = row_count = result_set_table = total_rows = result_set = statementObj = undefined;
         ISOPEN = false;
      };
   this.FETCH_BULK_COLLECT_INTO = function (variables,limit) {
         if (variables.length != statementObj.getColumnCount()) RAISE(-6504,"ROWTYPE_MISMATCH","Return types of Result Set variables or query do not match");
         if (limit) {
            for(let i = 0;i < limit && this.FETCH();i++)FETCH_INTO_COLLECTIONS(variables,self.res);
         } else {
            while ( this.FETCH() )
               FETCH_INTO_COLLECTIONS(variables,self.res);
         }
      };
   this.FOUND = () => ISOPEN ? typeof(found) == "boolean" ? found : null : RAISE(-1001,"INVALID_CURSOR","invalid cursor");
   this.NOTFOUND = () => ISOPEN ? typeof(found) == "boolean" ? !found : null : RAISE(-1001,"INVALID_CURSOR","invalid cursor");
   this.ROWCOUNT = () => ISOPEN ? row_count : RAISE(-1001,"INVALID_CURSOR","invalid cursor");
   this.ISOPEN = () => ISOPEN;
   this.SAVE_STATE = function () {
         return {
            tempTable : result_set_table,
            position : row_count
         };
      };
   this.RESTORE_STATE = function (tempTable,position) {
         result_set_table = tempTable
         if (result_set_table) {
            isOut = true
            this.OPEN();
            for(let i = 0;i < position;i++)this.FETCH();
         }
      };
   this.ROWTYPE = () => ROWTYPE(stmt,binds());
};
var outCursorResultNumber = 0;
```

## EXEC Helper

> **Note:**
>
> You might also be interested in:
>
> * [DDL - DML Statements.](README.md)
> * [Commit.](README.md)
> * [Execute Immediate.](README.md)

> **Note:**
>
> EXEC helper depends on IS NULL helper.

### Syntax

EXEC(stmt)
EXEC(stmt, binds[])
EXEC(stmt, opts{})
EXEC(stmt, binds[], opts{})

### Parameters

#### stmt

The string of the SQL statement to execute.

#### binds (optional)

An array with the values or the variables to bind into the SQL statement.

#### opts (optional)

This is a Javascript object to describe how the values returned by the exec should be formatted, this is used for SELECT statements.

##### Valid arguments for opts parameter

The following tables describe, how arguments should be sent to opts parameter in EXEC call:

##### Options when a query returns a single row

| opts | description |
| --- | --- |
| { } | When opts is empty or not sent to exec call, the data will be returned inside an array. |
| {vars: 0} | This has the same effect as the default option. It will return the data inside an array. |
| {vars: 1} | This is used when a query returns just one column and one row. EXEC will return the value directly. This is equivalent to EXEC(stmt)[0] |
| {rec:recordVariable} | Used when you want to store the values returned by the query inside a record. Translation of records is described in [Records translation reference](README.md). Record variable should be passed as an argument. |
| {row: 1} | This option returns a copy of ResultSet, this means that the object returned contains the methods described in [ResultSet Snowflake documentation](https://docs.snowflake.com/en/sql-reference/stored-procedures-api.html#object-resultset). |

##### Options when a query returns multiple rows

| opts | Description |
| --- | --- |
| {row:2} | With this option, it always returns a copy of the ResultSet regardless of the number of rows returned by the EXEC. |

##### General options

| opts | Description |
| --- | --- |
| {sql:0} | It makes sure that the SQL implicit Cursor attribute is not modified after executing the statement. |

### EXEC Helper Function Definition

```javascript
var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
var fixBind = function (arg) {
   arg = arg instanceof Date ? formatDate(arg) : IS_NULL(arg) ? null : arg;
   return arg;
};
var _RS, _ROWS, SQLERRM = "normal, successful completion", SQLCODE = 0;
var getObj = (_rs) => Object.assign(new Object(),_rs);
var getRow = (_rs) => (values = Object.values(_rs)) && (values = values.splice(-1 * _rs.getColumnCount())) && values;
var fetch = (_RS,_ROWS,fmode) => _RS.getRowCount() && _ROWS.next() && (fmode ? getObj : getRow)(_ROWS) || (fmode ? new Object() : []);

var EXEC = function (stmt,binds,opts) {
   try {
      binds = !(arguments[1] instanceof Array) && ((opts = arguments[1]) && []) || (binds || []);
      opts = opts || new Object();
      binds = binds ? binds.map(fixBind) : binds;
      _RS = snowflake.createStatement({
            sqlText : stmt,
            binds : binds
         });
      _ROWS = _RS.execute();
      if (opts.sql !== 0) {
         var isSelect = stmt.toUpperCase().trimStart().startsWith("SELECT");
         var affectedRows = isSelect ? _RS.getRowCount() : _RS.getNumRowsAffected();
         SQL.FOUND = affectedRows != 0;
         SQL.NOTFOUND = affectedRows == 0;
         SQL.ROWCOUNT = affectedRows;
      }
      if (opts.row === 2) {
         return _ROWS;
      }
      var INTO = function (opts) {
         if (opts.vars == 1 && _RS.getColumnCount() == 1 && _ROWS.next()) {
            return _ROWS.getColumnValue(1);
         }
         if (opts.rec instanceof Object && _ROWS.next()) {
            var recordKeys = Object.keys(opts.rec);
            Object.assign(opts.rec,Object.fromEntries(new Map(getRow(_ROWS).map((element,Index) => [recordKeys[Index],element]))))
            return opts.rec;
         }
         return fetch(_RS,_ROWS,opts.row);
      };
      var BULK_INTO_COLLECTION = function (into) {
         for(let i = 0;i < _RS.getRowCount();i++) {
            FETCH_INTO_COLLECTIONS(into,fetch(_RS,_ROWS,opts.row));
         }
         return into;
      };
      if (_ROWS.getRowCount() > 0) {
         return _ROWS.getRowCount() == 1 ? INTO(opts) : BULK_INTO_COLLECTION(opts);
      }
   } catch(error) {
      RAISE(error.code,error.name,error.message)
   }
};
```

### Usage Samples

The following code examples illustrates how EXEC works.

#### EXEC simple case

##### Oracle

```javascript
CREATE OR REPLACE PROCEDURE EXECUTE_PROC AS
BEGIN
  --CREATES HARDWARE TABLE WITH COLUMNS ID, DEVICE AND COLOR
  --THIS IS AN EXECUTE IMMEDIATE JUST WITH AN STATEMENT
  EXECUTE IMMEDIATE 'CREATE TABLE HARDWARE (ID NUMBER, DEVICE VARCHAR2(15), COLOR VARCHAR(15))';
END;
```

##### Snowflake

```javascript
CREATE OR REPLACE PROCEDURE EXECUTE_PROC ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  //CREATES HARDWARE TABLE WITH COLUMNS ID, DEVICE AND COLOR
  //THIS IS AN EXECUTE IMMEDIATE JUST WITH AN STATEMENT
  EXEC(`CREATE OR REPLACE TABLE HARDWARE (ID NUMBER(38, 18),
   DEVICE VARCHAR(15),
   COLOR VARCHAR(15))`);
$$;
```

#### EXEC with bindings

##### Oracle

```javascript
CREATE OR REPLACE PROCEDURE EXECUTE_PROC AS
  ID_VAR NUMBER;
  DEVICE_VAR VARCHAR2(15);
  DEV_COLOR  VARCHAR2(15);
  COLOR_VAR  VARCHAR2(15);
BEGIN
  --EXEC WITH BINDINGS
  --INSERTS A ROW WITH  | 12 | MOUSE | BLACK |  VALUES USING DIRECT BINDING FOR MOUSE
  EXECUTE IMMEDIATE 'INSERT INTO HARDWARE VALUES (12, :MOUSE, ''BLACK'')' USING 'MOUSE';

  --INSERTS A ROW WITH  | 13 | KEYBOARD | WHITE |  VALUES USING DIRECT BINDING FOR 13 AND KEYBOARD
  EXECUTE IMMEDIATE 'INSERT INTO HARDWARE VALUES (:ID, :KEYBOARD, ''WHITE'')' USING 13, 'KEYBOARD';

  --INSERTS A ROW WITH  | 14 | HEADSET | GRAY |  VALUES USING BINDING VARIABLES
  ID_VAR := 14;
  DEVICE_VAR := 'HEADSET';
  COLOR_VAR := 'GRAY';
  EXECUTE IMMEDIATE 'INSERT INTO HARDWARE VALUES (:DEV_ID, :DEV_VAR, :DEV_COLOR)' USING  ID_VAR, DEVICE_VAR, COLOR_VAR;
END;
```

##### Snowflake

```javascript
CREATE OR REPLACE PROCEDURE EXECUTE_PROC ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let ID_VAR;
  let DEVICE_VAR;
  let DEV_COLOR;
  let COLOR_VAR;
  //EXEC WITH BINDINGS
  //INSERTS A ROW WITH  | 12 | MOUSE | BLACK |  VALUES USING DIRECT BINDING FOR MOUSE
  EXEC(`INSERT INTO HARDWARE
VALUES (12, ?, 'BLACK')`,[`MOUSE`]);
  //INSERTS A ROW WITH  | 13 | KEYBOARD | WHITE |  VALUES USING DIRECT BINDING FOR 13 AND KEYBOARD
  EXEC(`INSERT INTO HARDWARE
VALUES (?, ?, 'WHITE')`,[13,`KEYBOARD`]);

  //INSERTS A ROW WITH  | 14 | HEADSET | GRAY |  VALUES USING BINDING VARIABLES
  ID_VAR = 14;
  DEVICE_VAR = `HEADSET`;
  COLOR_VAR = `GRAY`;
  EXEC(`INSERT INTO HARDWARE
VALUES (?, ?, ?)`,[ID_VAR,DEVICE_VAR,COLOR_VAR]);
$$;
```

#### EXEC with options

##### Oracle

```javascript
CREATE OR REPLACE PROCEDURE EXECUTE_PROC AS
BEGIN
  --STORES THE ID INTO ID_VAR
  EXECUTE IMMEDIATE 'SELECT ID FROM HARDWARE WHERE COLOR = ''BLACK''' INTO ID_VAR;
  DBMS_OUTPUT.PUT_LINE(ID_VAR);

  --STORES THE ID AND DEVICE INTO ID_VAR AND DEV_VAR, USING BINDING FOR COLOR
  COLOR_VAR := 'BLACK';
  EXECUTE IMMEDIATE 'SELECT ID, DEVICE FROM HARDWARE WHERE COLOR = :DEV_COLOR' INTO ID_VAR, DEVICE_VAR USING COLOR_VAR;
  DBMS_OUTPUT.PUT_LINE(ID_VAR || ' ' || DEVICE_VAR);
END;
```

##### Snowflake

```javascript
CREATE OR REPLACE PROCEDURE EXECUTE_PROC ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  //STORES THE ID INTO ID_VAR
  [ID_VAR] = EXEC(`SELECT ID FROM
   HARDWARE
WHERE COLOR = 'BLACK'`);
  EXEC(`--** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
CALL DBMS_OUTPUT.PUT_LINE_UDF(ID_VAR)`);

  //STORES THE ID AND DEVICE INTO ID_VAR AND DEV_VAR, USING BINDING FOR COLOR
  COLOR_VAR = `BLACK`;
  [ID_VAR,DEVICE_VAR] = EXEC(`SELECT ID, DEVICE FROM
   HARDWARE
WHERE COLOR = ?`,[
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT COLOR_VAR MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    COLOR_VAR]);
  EXEC(`--** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
CALL DBMS_OUTPUT.PUT_LINE_UDF(NVL(ID_VAR :: STRING, '') || ' ' || NVL(DEVICE_VAR :: STRING, ''))`);
$$;
```

For the following sample, EXEC call returns [12], with object destructuring `ID_VAR` stores 12:

```sql
[ID_VAR] = EXEC(`SELECT ID FROM PUBLIC.HARDWARE WHERE COLOR = 'BLACK'`);
```

The following two EXEC calls are alternative ways for the previous sample without object destructuring:

```sql
ID_VAR = EXEC(`SELECT ID FROM PUBLIC.HARDWARE WHERE COLOR = 'BLACK'`)[0];
ID_VAR = EXEC(`SELECT ID FROM PUBLIC.HARDWARE WHERE COLOR = 'BLACK'`, {vars:1});
```

Object destructuring also works with bindings as you may note on these statements (EXEC call returns [12, “MOUSE”] values):

```sql
COLOR_VAR = `BLACK`;
[ID_VAR,DEVICE_VAR] = EXEC(`SELECT ID, DEVICE FROM PUBLIC.HARDWARE WHERE COLOR = ?`,[COLOR_VAR]);
```

To obtain the actual result set returned by Snowflake, you can use this syntax:

```sql
let RESULT_SET_COPY;
RESULT_SET_COPY = EXEC(`SELECT * FROM PUBLIC.HARDWARE WHERE COLOR = 'BLACK'`, {row:1});
/* RETURNS
{
  "COLOR": "BLACK",
  "DEVICE": "MOUSE",
  "ID": 12,
  "getColumnCount": {},
  ...
  "next": {}
}*/
```

#### EXEC with record types

> **Note:**
>
> You might be interested in [Records transformation](README.md).

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE EXECUTE_PROC AS
  TYPE DEVTRECTYP IS RECORD (
    ID NUMBER(4) NOT NULL := 0,
    DEV_TYPE VARCHAR2(30) NOT NULL := 'UNKNOWN',
    COLOR VARCHAR2(30) := 'GREEN'
  );

  DEV_VARIABLE DEVTRECTYP;
BEGIN

  --STORES THE ROW VALUES IN THE RECORD
  EXECUTE IMMEDIATE 'SELECT * FROM HARDWARE WHERE COLOR = ''BLACK''' INTO DEV_VARIABLE;
  DBMS_OUTPUT.PUT_LINE(DEV_VARIABLE.ID || ' ' || DEV_VARIABLE.DEV_TYPE || ' ' || DEV_VARIABLE.COLOR);
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE EXECUTE_PROC ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  class DEVTRECTYP {
    ID = 0
    DEV_TYPE = `UNKNOWN`
    COLOR = `GREEN`
    constructor() {
      [...arguments].map((element,Index) => this[(Object.keys(this))[Index]] = element)
    }
  }
  let DEV_VARIABLE = new DEVTRECTYP();
  //STORES THE ROW VALUES IN THE RECORD
  EXEC(`SELECT * FROM
   HARDWARE
WHERE COLOR = 'BLACK'`,{
    rec : DEV_VARIABLE
  });
  EXEC(`--** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
CALL DBMS_OUTPUT.PUT_LINE_UDF(NVL(? :: STRING, '') || ' ' || NVL(? :: STRING, '') || ' ' || NVL(? :: STRING, ''))`,[DEV_VARIABLE.ID,DEV_VARIABLE.DEV_TYPE,DEV_VARIABLE.COLOR]);
$$;
```

> **Warning:**
>
> This is still a work in progress. The transformation to properly store the record values will be:
>
> ```sql
> EXEC(`SELECT * FROM PUBLIC.HARDWARE WHERE COLOR = 'BLACK'`, {rec:DEV_VARIABLE});
> ```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Object may not work.
2. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation

## Implicit Cursor attribute helper

### Overview

These are the attributes that you can use inside Snowflake stored procedures using this helper:

* FOUND
* NOTFOUND
* ROWCOUNT
* ISOPEN

In Snowflake code, inside the procedures, you will find the initialization of these attributes:

```javascript
 var SQL = {
  FOUND : false,
  NOTFOUND : false,
  ROWCOUNT : 0,
  ISOPEN : false
 };
```

The attribute ISOPEN is always false, just like in Oracle.

### Usage Samples

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
VAR1 VARCHAR(100) := '';
BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE1 WHERE COL1 = 1;
    VAR1 := 'Rows affected: ' || TO_CHAR(SQL%ROWCOUNT);
    VAR1 := 'Error: ' || SQLERRM;

    PKG.TEST_PROC1(SQL%ROWCOUNT, SQL%FOUND, SQL%NOTFOUND);
    PKG.TEST_PROC2(SQLCODE);

    SELECT SQL%ROWCOUNT FROM DUAL;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let VAR1 = undefined;
    [VAR1] = EXEC(`SELECT
   COL1
FROM
   TABLE1
WHERE COL1 = 1`);
    VAR1 = `Rows affected: ${concatValue((EXEC(`SELECT
   TO_CHAR(?)`,[SQL.ROWCOUNT]))[0])}`;
    VAR1 = `Error: ${concatValue(SQLERRM)}`;
    EXEC(`CALL

PKG.TEST_PROC1(?, ?, ?)`,[SQL.ROWCOUNT,SQL.FOUND,SQL.NOTFOUND]);
    EXEC(`CALL
PKG.TEST_PROC2(?)`,[SQLCODE]);
    EXEC(`SELECT
       ?
    FROM DUAL`,[SQL.ROWCOUNT]);
$$;
```

> **Note:**
>
> SQLCODE and SQLERRM are converted into helper variables with the same name and are bound in the same way as the cursor variables.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## IS NULL Helper

### IS NULL Helper Function Definition

This helper method is used to transform the NULL predicate. It is also used by other helpers to check if a value is null. This is necessary to handle values like NaN or empty strings as nulls.

Oracle handles empty strings as null values. This helper takes that into account.

```javascript
var IS_NULL = (arg) => !(arg || arg === 0);
```

## Like operator Helper

### Like Operator Helper Function Definition

```javascript
function LIKE(expr,pattern,esc,cs) {
   function fixPattern(pattern,esc) {
      const specials = '/.*+?|(){}[]\\'.split('');
      var newPattern = "";
      var fix = (c) => specials.includes(c) ? '\\' + c : c;
      for(var i = 0;i < pattern.length;i++) {
         var c = pattern[i];
         if (c === esc) {
            newPattern += pattern[i + 1]
            i++
         } else if (c === '%') {
            newPattern += ".*?"
         } else if (c === '_') {
            newPattern += "."
         } else if (c === '[' || ']') {
            newPattern += c
         } else newPattern += fix(c)
      }
      return newPattern;
   }
   return new RegExp(`^${fixPattern(pattern,esc)}$`,cs ? '' : 'i').exec(expr) != null;
}
```

## Package variables helper

> **Note:**
>
> You might also be interested in [variables declaration](README.md) and [package variables inside procedures.](README.md)

### Package variables Helper Function Definition

> **Note:**
>
> Helper depends on IS NULL helper

When a package variable is used inside a procedure, the following helper will be generated:

When a package variable is used inside a procedure, the following helper will be generated:

```javascript
function StateManager(packageName,keepInCache) {
   function getTypeChar(arg) {
      if (arg instanceof Date) {
         return "&";
      } else if (typeof arg == "number") {
         return "#";
      } else if (IS_NULL(arg)) {
         return "~";
      } else {
         return "$";
      }
   }
   function deserialize(arg) {
      if (arg === null) return undefined;
      let prefix = arg[0];
      let rest = arg.substr(1);
      switch(prefix) {
         case "&":return new Date(rest);
         case "#":return parseFloat(rest);
         case "$":return rest;
         case "~":return undefined;
         default:return arg;
      }
   }
   function saveVar(varName,value) {
      let varPackageName = `${packageName}.${varName}`;
      let fixedValue = `${getTypeChar(value)}${fixBind(value)}`;
      EXEC("SELECT SETVARIABLE(?,?)",[varPackageName,fixedValue]);
   }
   function readVar(varName) {
      let varPackageName = `${packageName}.${varName}`;
      return deserialize((EXEC("SELECT GETVARIABLE(?)",[varPackageName]))[0]);
   }
   this.saveState = function () {
         let keys = Object.keys(this.cache);
         for(let key of keys) {
            saveVar(key,(this.cache)[key]);
         }
      }
   this.cache = new Object();
   let c = this.cache;
   let rsProxy = new Proxy(this,{
      get : function (target,prop,receiver) {
         if (!target[prop]) {
            c[prop] === undefined && (c[prop] = readVar(prop));
            return c[prop];
         }
         return Reflect.get(...arguments);
      },
      set : function (target,prop,value) {
         if (target[prop]) return;
         c[prop] = value;
         if (!keepInCache) {
            saveVar(prop,value);
         }
      }
   });
   return rsProxy;
};
var PACKAGE_VARIABLES = new StateManager("PACKAGE_VARIABLES",true);
```

A helper instance is created for each package used to access its variables. Variables will be qualified with the name of the package if they are not qualified with it.

At the end of the procedure, the state of the variables used will be saved using the helper.

Note that in the following statement, name of the variable will change to match the package name:

```javascript
var PACKAGE_VARIABLES = new StateManager("PACKAGE_VARIABLES",true);
```

## Raise Helper

> **Note:**
>
> You might be interested in [Errors and Exception Handling.](README.md)

### Raise Helper Function Definition

```javascript
var RAISE = function (code,name,message) {
    message === undefined && ([name,message] = [message,name])
    var error = new Error(message);
    error.name = name
    SQLERRM = `${(SQLCODE = (error.code = code))}: ${message}`
    throw error;
};
```

## ROWTYPE Helper

> **Note:**
>
> You might be interested in ROWTYPE Record Declaration.

### ROWTYPE Helper Function Definition

```javascript
var ROWTYPE = (stmt, binds = [], obj = new Object()) => {
      EXEC(`SELECT * FROM (${stmt}) LIMIT 0`,binds);
      for(let i = 1;i <= _RS.getColumnCount();i++)obj[_ROWS.getColumnName(i)] = null;
      return obj;
   };
```

---
title: SnowConvert AI - Oracle - Joins
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-queries-and-subqueries/joins.md
section: Migrations
---

# SnowConvert AI - Oracle - Joins

> A join is a query that combines rows from two or more tables, views, or materialized views. Oracle Database performs a join whenever multiple tables appear in the `FROM` clause of the query. ([Oracle SQL Language Reference JOINS](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-568EC26F-199A-4339-BFD9-C4A0B9588937))

## Antijoin

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> An antijoin returns rows from the left side of the predicate for which there are no corresponding rows on the right side of the predicate. It returns rows that fail to match (NOT IN) the subquery on the right side. Antijoin transformation cannot be done if the subquery is on an `OR` branch of the `WHERE` clause. ([Oracle SQL Language Reference Anti Join](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-D688F2E3-7F1E-4339-894F-01A73E62328C)).

No special transformation is performed for this kind of *Join* since Snowflake supports the same syntax.

### Sample Source Patterns

> **Note:**
>
> *Order by clause* added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove it to retrieve the entire result set.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

#### Where Not In

##### Oracle

```sql
SELECT e.employee_id, e.first_name, e.last_name FROM hr.employees e
WHERE e.department_id NOT IN

    (SELECT h.department_id FROM hr.departments h WHERE location_id = 1700)

ORDER BY e.last_name
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME |
| --- | --- | --- |
| 174 | Ellen | Abel |
| 166 | Sundar | Ande |
| 130 | Mozhe | Atkinson |
| 105 | David | Austin |
| 204 | Hermann | Baer |
| 167 | Amit | Banda |
| 172 | Elizabeth | Bates |
| 192 | Sarah | Bell |
| 151 | David | Bernstein |
| 129 | Laura | Bissot |

##### Snowflake

```sql
SELECT e.employee_id, e.first_name, e.last_name FROM
    hr.employees e
WHERE e.department_id NOT IN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!!
    (SELECT h.department_id FROM
            hr.departments h WHERE location_id = 1700)

ORDER BY e.last_name
    FETCH FIRST 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME |
| --- | --- | --- |
| 174 | Ellen | Abel |
| 166 | Sundar | Ande |
| 130 | Mozhe | Atkinson |
| 105 | David | Austin |
| 204 | Hermann | Baer |
| 167 | Amit | Banda |
| 172 | Elizabeth | Bates |
| 192 | Sarah | Bell |
| 151 | David | Bernstein |
| 129 | Laura | Bissot |

#### Where Not Exists

##### Oracle

```sql
SELECT   d.department_id, d.department_name
FROM     hr.departments d
WHERE    NOT EXISTS

         (SELECT 1 FROM hr.employees E WHERE
         e.department_id = d.department_id)

ORDER BY d.department_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| DEPARTMENT_ID | DEPARTMENT_NAME |
| --- | --- |
| 120 | Treasury |
| 130 | Corporate Tax |
| 140 | Control And Credit |
| 150 | Shareholder Services |
| 160 | Benefits |
| 170 | Manufacturing |
| 180 | Construction |
| 190 | Contracting |
| 200 | Operations |
| 210 | IT Support |

##### Snowflake

```sql
SELECT   d.department_id, d.department_name
FROM
         hr.departments d
WHERE    NOT EXISTS
                  !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!!
         (SELECT 1 FROM
                           hr.employees E WHERE
         e.department_id = d.department_id)

ORDER BY d.department_id
         FETCH FIRST 10 ROWS ONLY;
```

##### Result

| DEPARTMENT_ID | DEPARTMENT_NAME |
| --- | --- |
| 120 | Treasury |
| 130 | Corporate Tax |
| 140 | Control And Credit |
| 150 | Shareholder Services |
| 160 | Benefits |
| 170 | Manufacturing |
| 180 | Construction |
| 190 | Contracting |
| 200 | Operations |
| 210 | IT Support |

### Known issues

#### 1. Results ordering mismatch between languages

The result of the query will have the same content in both database engines but the order might be different if no *Order By* clause is defined in the query.

### Related EWIs

1. [SSC-EWI-0108](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): This subquery matches a pattern considered invalid and may cause compilation errors.

## Band Join

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> A **band join** is a special type of nonequijoin in which key values in one data set must fall within the specified range (“band”) of the second data set. The same table can serve as both the first and second data sets. ([Oracle SQL Language Reference BandJoin](https://docs.oracle.com/en/database/oracle/oracle-database/21/tgsql/joins.html#GUID-24F34188-110F-4245-9DE7-43954092AFE0))

In this section, we will see how a band join is executed in Snowflake and the execution plan is very similar to the improved version of Oracle.

### Sample Source Patterns

> **Note:**
>
> *Order by* clause added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove it to retrieve the entire result set.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

> **Warning:**
>
> If you migrate this code without the create tables, the converter won’t be able to load semantic information of the columns and a warning will appear on the arithmetic operations.

#### Basic Band Join case

##### Oracle

```sql
SELECT  e1.last_name ||
        ' has salary between 100 less and 100 more than ' ||
        e2.last_name AS "SALARY COMPARISON"
FROM    employees e1,
        employees e2
WHERE   e1.salary
BETWEEN e2.salary - 100
AND     e2.salary + 100
ORDER BY "SALARY COMPARISON"
FETCH FIRST 10 ROWS ONLY
```

##### Result

| SALARY COMPARISON |
| --- |
| Abel has salary between 100 less and 100 more than Abel |
| Abel has salary between 100 less and 100 more than Cambrault |
| Abel has salary between 100 less and 100 more than Raphaely |
| Ande has salary between 100 less and 100 more than Ande |
| Ande has salary between 100 less and 100 more than Mavris |
| Ande has salary between 100 less and 100 more than Vollman |
| Atkinson has salary between 100 less and 100 more than Atkinson |
| Atkinson has salary between 100 less and 100 more than Baida |
| Atkinson has salary between 100 less and 100 more than Gates |
| Atkinson has salary between 100 less and 100 more than Geoni |

##### Snowflake

```sql
SELECT
                NVL(  e1.last_name :: STRING, '') ||
                ' has salary between 100 less and 100 more than ' || NVL(
                e2.last_name :: STRING, '') AS "SALARY COMPARISON"
FROM
                employees e1,
                employees e2
WHERE   e1.salary
BETWEEN
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!! e2.salary - 100
AND
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!     e2.salary + 100
ORDER BY "SALARY COMPARISON"
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| SALARY COMPARISON |
| --- |
| Abel has salary between 100 less and 100 more than Abel |
| Abel has salary between 100 less and 100 more than Cambrault |
| Abel has salary between 100 less and 100 more than Raphaely |
| Ande has salary between 100 less and 100 more than Ande |
| Ande has salary between 100 less and 100 more than Mavris |
| Ande has salary between 100 less and 100 more than Vollman |
| Atkinson has salary between 100 less and 100 more than Atkinson |
| Atkinson has salary between 100 less and 100 more than Baida |
| Atkinson has salary between 100 less and 100 more than Gates |
| Atkinson has salary between 100 less and 100 more than Geoni |

> **Warning:**
>
> Migrating some `SELECT` statements without the corresponding tables could generate the [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues. To avoid this warning, include the `CREATE TABLE` inside the file.

The results are the same making the BAND JOIN functional equivalent.

#### Execution plan

As extra information, the special thing about the band joins is the execution plan.

The following image shows the [enhanced execution plan](https://docs.oracle.com/en/database/oracle/oracle-database/21/tgsql/joins.html#GUID-24F34188-110F-4245-9DE7-43954092AFE0) (implemented since Oracle 12c) for the test query:

And in the following image, we will see the execution plan in Snowflake:

> **Note:**
>
> The execution plan in Snowflake is very similar to Oracle’s optimized version. The final duration and performance of the query will be affected by many other factors and are completely dependent on each DBMS internal functionality.

### Known Issues

#### 1. Results ordering mismatch between languages

The query result will have the same content in both database engines but the order might be different if no *Order By* clause is defined in the query.

### Related EWIs

* [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md)[:](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) Types resolution issues, the arithmetic operation may not behave correctly between string and date.

## Cartesian Products

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> If two tables in a join query have no join condition, then Oracle Database returns their Cartesian product. Oracle combines each row of one table with each row of the other. ([Oracle SQL Reference Cartesian Products Subsection](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-70DD48FA-BF46-4479-9C3F-146C5616E440))

Oracle and Snowflake are also compatible with the ANSI Cross Join syntax that has the same behavior of a cartesian product.

No special transformation is performed for this kind of *Join* since Snowflake supports the same syntax.

### Sample Source Patterns

> **Note:**
>
> *Order by clause* was added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove it to retrieve the entire result set.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

#### Implicit Syntax

##### Oracle

```sql
-- Resulting rows
SELECT * FROM hr.employees, hr.departments
ORDER BY first_name
FETCH FIRST 5 ROWS ONLY;

-- Resulting total rows
SELECT COUNT(*) FROM hr.employees, hr.departments;
```

##### Result 1

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 10 | Administration | 200 | 1700 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 50 | Shipping | 121 | 1500 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 40 | Human Resources | 203 | 2400 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 30 | Purchasing | 114 | 1700 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 20 | Marketing | 201 | 1800 |

##### Result 2

| COUNT(\*) |
| --- |
| 2889 |

##### Snowflake

```sql
-- Resulting rows
SELECT * FROM
hr.employees,
hr.departments
ORDER BY first_name
FETCH FIRST 5 ROWS ONLY;

-- Resulting total rows
SELECT COUNT(*) FROM
hr.employees,
hr.departments;
```

##### Result 1

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 | ST_MAN | 8200.00 |  | 100 | 50 | 40 | Human Resources | 203 | 2400 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 | ST_MAN | 8200.00 |  | 100 | 50 | 20 | Marketing | 201 | 1800 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 | ST_MAN | 8200.00 |  | 100 | 50 | 10 | Administration | 200 | 1700 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 | ST_MAN | 8200.00 |  | 100 | 50 | 50 | Shipping | 121 | 1500 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 | ST_MAN | 8200.00 |  | 100 | 50 | 30 | Purchasing | 114 | 1700 |

##### Result 2

| COUNT(\*) |
| --- |
| 2889 |

#### Cross Join Syntax

##### Oracle

```sql
-- Resulting rows
SELECT * FROM hr.employees CROSS join hr.departments
ORDER BY first_name
FETCH FIRST 5 ROWS ONLY;

-- Resulting total rows
SELECT COUNT(*) FROM hr.employees CROSS join hr.departments;
```

##### Result 1

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 10 | Administration | 200 | 1700 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 50 | Shipping | 121 | 1500 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 40 | Human Resources | 203 | 2400 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 30 | Purchasing | 114 | 1700 |
| 121 | Adam | Fripp | AFRIPP | 650.123.2234 | 2005-04-10 00:00:00.000 | ST_MAN | 8200 |  | 100 | 50 | 20 | Marketing | 201 | 1800 |

##### Result 2

| COUNT(\*) |
| --- |
| 2889 |

##### Snowflake

```sql
-- Resulting rows
SELECT * FROM
hr.employees
CROSS join hr.departments
ORDER BY first_name
FETCH FIRST 5 ROWS ONLY;

-- Resulting total rows
SELECT COUNT(*) FROM
hr.employees
CROSS join hr.departments;
```

### Known issues

#### 1. Results ordering mismatch between languages

The result of the query will have the same content in both database engines but the order might be different if no *Order By* clause is defined in the query.

### Related EWIs

No related EWIs.

## Equijoin

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

An equijoin is an implicit form of the join with a join condition containing an equality operator. For more information, see the [Oracle Equijoin documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-3AA5EB23-2D84-4E19-BD7E-E66A3C59D888).

No special transformation is performed for this kind of *Join* since Snowflake supports the same syntax.

### Sample Source Patterns

> **Note:**
>
> *Order by clause* added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Since the result set is too large, the *Row Limiting Clause* was added. You can remove it to retrieve the entire result set.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

#### Basic Equijoin case

##### Oracle

```sql
 SELECT last_name, job_id, hr.departments.department_id, department_name
FROM hr.employees, hr.departments
WHERE hr.employees.department_id = hr.departments.department_id
ORDER BY last_name
FETCH FIRST 5 ROWS ONLY;
```

##### Result

| LAST_NAME | JOB_ID | DEPARTMENT_ID | DEPARTMENT_NAME |
| --- | --- | --- | --- |
| Abel | SA_REP | 80 | Sales |
| Ande | SA_REP | 80 | Sales |
| Atkinson | ST_CLERK | 50 | Shipping |
| Austin | IT_PROG | 60 | IT |
| Baer | PR_REP | 70 | Public Relations |

##### Snowflake

```sql
 SELECT last_name, job_id, hr.departments.department_id, department_name
FROM
hr.employees,
hr.departments
WHERE hr.employees.department_id = hr.departments.department_id
ORDER BY last_name
FETCH FIRST 5 ROWS ONLY;
```

##### Result

| LAST_NAME | JOB_ID | DEPARTMENT_ID | DEPARTMENT_NAME |
| --- | --- | --- | --- |
| Abel | SA_REP | 80 | Sales |
| Ande | SA_REP | 80 | Sales |
| Atkinson | ST_CLERK | 50 | Shipping |
| Austin | IT_PROG | 60 | IT |
| Baer | PR_REP | 70 | Public Relations |

### Known issues

#### 1. Results ordering mismatch between languages

The result of the query will have the same content in both database engines but the order might be different if no *Order By* clause is defined in the query.

### Related EWIs

No related EWIs.

## Inner Join

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> An inner join (sometimes called a simple join) is a join of two or more tables that returns only those rows that satisfy the join condition. ([Oracle SQL Reference Inner Join Subsection](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-794F7DD5-FB18-4ADC-9E46-ADDA8C30C3C6)).

```sql
{ [ INNER ] JOIN table_reference
 { ON condition
 | USING (column [, column ]...)
 }
| { CROSS
 | NATURAL [ INNER ]
 }
 JOIN table_reference
}
```

### Sample Source Patterns

> **Note:**
>
> *Order by* clause added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove this clause to retrieve the entire result set.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

#### Basic Inner Join

In the Inner Join clause “INNER” is an optional keyword, the following queries have two selects that retrieve the same data set.

##### Oracle

```sql
 SELECT
    *
FROM
    hr.employees
INNER JOIN hr.departments ON
    hr.departments.department_id = hr.employees.department_id
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;

SELECT
    *
FROM
    hr.employees
JOIN hr.departments ON
    hr.departments.department_id = hr.employees.department_id
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 90 | Executive | 100 | 1700 |
| 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 00:00:00.000 | AD_VP | 17000 |  | 100 | 90 | 90 | Executive | 100 | 1700 |
| 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 00:00:00.000 | AD_VP | 17000 |  | 100 | 90 | 90 | Executive | 100 | 1700 |
| 103 | Alexander | Hunold | AHUNOLD | 590.423.4567 | 2006-01-03 00:00:00.000 | IT_PROG | 9000 |  | 102 | 60 | 60 | IT | 103 | 1400 |
| 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 00:00:00.000 | IT_PROG | 6000 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 00:00:00.000 | IT_PROG | 4800 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 00:00:00.000 | IT_PROG | 4800 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 00:00:00.000 | IT_PROG | 4200 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 108 | Nancy | Greenberg | NGREENBE | 515.124.4569 | 2002-08-17 00:00:00.000 | FI_MGR | 12008 |  | 101 | 100 | 100 | Finance | 108 | 1700 |
| 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 00:00:00.000 | FI_ACCOUNT | 9000 |  | 108 | 100 | 100 | Finance | 108 | 1700 |

##### Snowflake

```sql
 SELECT
    *
FROM
hr.employees
INNER JOIN
    hr.departments
    ON
    hr.departments.department_id = hr.employees.department_id
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;

SELECT
    *
FROM
    hr.employees
JOIN
    hr.departments
    ON
    hr.departments.department_id = hr.employees.department_id
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 90 | Executive | 100 | 1700 |
| 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 | AD_VP | 17000.00 |  | 100 | 90 | 90 | Executive | 100 | 1700 |
| 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 | AD_VP | 17000.00 |  | 100 | 90 | 90 | Executive | 100 | 1700 |
| 103 | Alexander | Hunold | AHUNOLD | 590.423.4567 | 2006-01-03 | IT_PROG | 9000.00 |  | 102 | 60 | 60 | IT | 103 | 1400 |
| 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 | IT_PROG | 6000.00 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 | IT_PROG | 4800.00 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 | IT_PROG | 4800.00 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 | IT_PROG | 4200.00 |  | 103 | 60 | 60 | IT | 103 | 1400 |
| 108 | Nancy | Greenberg | NGREENBE | 515.124.4569 | 2002-08-17 | FI_MGR | 12008.00 |  | 101 | 100 | 100 | Finance | 108 | 1700 |
| 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 | FI_ACCOUNT | 9000.00 |  | 108 | 100 | 100 | Finance | 108 | 1700 |

#### Inner Join with using clause

##### Oracle

```sql
SELECT
    *
FROM
    hr.employees
INNER JOIN hr.departments
    USING(department_id)
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| DEPARTMENT_ID | EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 90 | 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | Executive | 100 | 1700 |
| 90 | 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 00:00:00.000 | AD_VP | 17000 |  | 100 | Executive | 100 | 1700 |
| 90 | 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 00:00:00.000 | AD_VP | 17000 |  | 100 | Executive | 100 | 1700 |
| 60 | 103 | Alexander | Hunold | AHUNOLD | 590.423.4567 | 2006-01-03 00:00:00.000 | IT_PROG | 9000 |  | 102 | IT | 103 | 1400 |
| 60 | 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 00:00:00.000 | IT_PROG | 6000 |  | 103 | IT | 103 | 1400 |
| 60 | 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 00:00:00.000 | IT_PROG | 4800 |  | 103 | IT | 103 | 1400 |
| 60 | 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 00:00:00.000 | IT_PROG | 4800 |  | 103 | IT | 103 | 1400 |
| 60 | 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 00:00:00.000 | IT_PROG | 4200 |  | 103 | IT | 103 | 1400 |
| 100 | 108 | Nancy | Greenberg | NGREENBE | 515.124.4569 | 2002-08-17 00:00:00.000 | FI_MGR | 12008 |  | 101 | Finance | 108 | 1700 |
| 100 | 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 00:00:00.000 | FI_ACCOUNT | 9000 |  | 108 | Finance | 108 | 1700 |

##### Snowflake

```sql
SELECT
    *
FROM
hr.employees
INNER JOIN
    hr.departments
    USING(department_id)
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| DEPARTMENT_ID | EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 90 | 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | Executive | 100 | 1700 |
| 90 | 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 | AD_VP | 17000.00 |  | 100 | Executive | 100 | 1700 |
| 90 | 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 | AD_VP | 17000.00 |  | 100 | Executive | 100 | 1700 |
| 60 | 103 | Alexander | Hunold | AHUNOLD | 590.423.4567 | 2006-01-03 | IT_PROG | 9000.00 |  | 102 | IT | 103 | 1400 |
| 60 | 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 | IT_PROG | 6000.00 |  | 103 | IT | 103 | 1400 |
| 60 | 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 | IT_PROG | 4800.00 |  | 103 | IT | 103 | 1400 |
| 60 | 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 | IT_PROG | 4800.00 |  | 103 | IT | 103 | 1400 |
| 60 | 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 | IT_PROG | 4200.00 |  | 103 | IT | 103 | 1400 |
| 100 | 108 | Nancy | Greenberg | NGREENBE | 515.124.4569 | 2002-08-17 | FI_MGR | 12008.00 |  | 101 | Finance | 108 | 1700 |
| 100 | 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 | FI_ACCOUNT | 9000.00 |  | 108 | Finance | 108 | 1700 |

#### Cross Inner Join

##### Oracle

```sql
SELECT
    *
FROM
    hr.employees
CROSS JOIN hr.departments
ORDER BY department_name, employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 110 | Accounting | 205 | 1700 |
| 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 00:00:00.000 | AD_VP | 17000 |  | 100 | 90 | 110 | Accounting | 205 | 1700 |
| 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 00:00:00.000 | AD_VP | 17000 |  | 100 | 90 | 110 | Accounting | 205 | 1700 |
| 103 | Alexander | Hunold | AHUNOLD | 590.423.4567 | 2006-01-03 00:00:00.000 | IT_PROG | 9000 |  | 102 | 60 | 110 | Accounting | 205 | 1700 |
| 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 00:00:00.000 | IT_PROG | 6000 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 00:00:00.000 | IT_PROG | 4800 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 00:00:00.000 | IT_PROG | 4800 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 00:00:00.000 | IT_PROG | 4200 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 108 | Nancy | Greenberg | NGREENBE | 515.124.4569 | 2002-08-17 00:00:00.000 | FI_MGR | 12008 |  | 101 | 100 | 110 | Accounting | 205 | 1700 |
| 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 00:00:00.000 | FI_ACCOUNT | 9000 |  | 108 | 100 | 110 | Accounting | 205 | 1700 |

##### Snowflake

```sql
 SELECT
    *
FROM
hr.employees
CROSS JOIN hr.departments
ORDER BY department_name, employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 110 | Accounting | 205 | 1700 |
| 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 | AD_VP | 17000.00 |  | 100 | 90 | 110 | Accounting | 205 | 1700 |
| 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 | AD_VP | 17000.00 |  | 100 | 90 | 110 | Accounting | 205 | 1700 |
| 103 | Alexander | Hunold | AHUNOLD | 590.423.4567 | 2006-01-03 | IT_PROG | 9000.00 |  | 102 | 60 | 110 | Accounting | 205 | 1700 |
| 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 | IT_PROG | 6000.00 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 | IT_PROG | 4800.00 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 | IT_PROG | 4800.00 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 | IT_PROG | 4200.00 |  | 103 | 60 | 110 | Accounting | 205 | 1700 |
| 108 | Nancy | Greenberg | NGREENBE | 515.124.4569 | 2002-08-17 | FI_MGR | 12008.00 |  | 101 | 100 | 110 | Accounting | 205 | 1700 |
| 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 | FI_ACCOUNT | 9000.00 |  | 108 | 100 | 110 | Accounting | 205 | 1700 |

#### Natural Inner Join

##### Oracle

```sql
SELECT
    *
FROM
    hr.employees
NATURAL JOIN hr.departments
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| MANAGER_ID | DEPARTMENT_ID | EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | DEPARTMENT_NAME | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | 90 | 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 00:00:00.000 | AD_VP | 17000 |  | Executive | 1700 |
| 100 | 90 | 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 00:00:00.000 | AD_VP | 17000 |  | Executive | 1700 |
| 103 | 60 | 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 00:00:00.000 | IT_PROG | 6000 |  | IT | 1400 |
| 103 | 60 | 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 00:00:00.000 | IT_PROG | 4800 |  | IT | 1400 |
| 103 | 60 | 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 00:00:00.000 | IT_PROG | 4800 |  | IT | 1400 |
| 103 | 60 | 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 00:00:00.000 | IT_PROG | 4200 |  | IT | 1400 |
| 108 | 100 | 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 00:00:00.000 | FI_ACCOUNT | 9000 |  | Finance | 1700 |
| 108 | 100 | 110 | John | Chen | JCHEN | 515.124.4269 | 2005-09-28 00:00:00.000 | FI_ACCOUNT | 8200 |  | Finance | 1700 |
| 108 | 100 | 111 | Ismael | Sciarra | ISCIARRA | 515.124.4369 | 2005-09-30 00:00:00.000 | FI_ACCOUNT | 7700 |  | Finance | 1700 |
| 108 | 100 | 112 | Jose Manuel | Urman | JMURMAN | 515.124.4469 | 2006-03-07 00:00:00.000 | FI_ACCOUNT | 7800 |  | Finance | 1700 |

##### Snowflake

```sql
SELECT
    *
FROM
hr.employees
NATURAL JOIN
    hr.departments
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| MANAGER_ID | DEPARTMENT_ID | EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | DEPARTMENT_NAME | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | 90 | 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 | AD_VP | 17000.00 |  | Executive | 1700 |
| 100 | 90 | 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 | AD_VP | 17000.00 |  | Executive | 1700 |
| 103 | 60 | 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 | IT_PROG | 6000.00 |  | IT | 1400 |
| 103 | 60 | 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 | IT_PROG | 4800.00 |  | IT | 1400 |
| 103 | 60 | 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 | IT_PROG | 4800.00 |  | IT | 1400 |
| 103 | 60 | 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 | IT_PROG | 4200.00 |  | IT | 1400 |
| 108 | 100 | 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 | FI_ACCOUNT | 9000.00 |  | Finance | 1700 |
| 108 | 100 | 110 | John | Chen | JCHEN | 515.124.4269 | 2005-09-28 | FI_ACCOUNT | 8200.00 |  | Finance | 1700 |
| 108 | 100 | 111 | Ismael | Sciarra | ISCIARRA | 515.124.4369 | 2005-09-30 | FI_ACCOUNT | 7700.00 |  | Finance | 1700 |
| 108 | 100 | 112 | Jose Manuel | Urman | JMURMAN | 515.124.4469 | 2006-03-07 | FI_ACCOUNT | 7800.00 |  | Finance | 1700 |

#### Cross Natural Join

##### Oracle

```sql
SELECT
    *
FROM
    hr.employees
CROSS NATURAL JOIN hr.departments
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| MANAGER_ID | DEPARTMENT_ID | EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | DEPARTMENT_NAME | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | 90 | 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 00:00:00.000 | AD_VP | 17000 |  | Executive | 1700 |
| 100 | 90 | 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 00:00:00.000 | AD_VP | 17000 |  | Executive | 1700 |
| 103 | 60 | 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 00:00:00.000 | IT_PROG | 6000 |  | IT | 1400 |
| 103 | 60 | 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 00:00:00.000 | IT_PROG | 4800 |  | IT | 1400 |
| 103 | 60 | 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 00:00:00.000 | IT_PROG | 4800 |  | IT | 1400 |
| 103 | 60 | 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 00:00:00.000 | IT_PROG | 4200 |  | IT | 1400 |
| 108 | 100 | 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 00:00:00.000 | FI_ACCOUNT | 9000 |  | Finance | 1700 |
| 108 | 100 | 110 | John | Chen | JCHEN | 515.124.4269 | 2005-09-28 00:00:00.000 | FI_ACCOUNT | 8200 |  | Finance | 1700 |
| 108 | 100 | 111 | Ismael | Sciarra | ISCIARRA | 515.124.4369 | 2005-09-30 00:00:00.000 | FI_ACCOUNT | 7700 |  | Finance | 1700 |
| 108 | 100 | 112 | Jose Manuel | Urman | JMURMAN | 515.124.4469 | 2006-03-07 00:00:00.000 | FI_ACCOUNT | 7800 |  | Finance | 1700 |

##### Snowflake

```sql
SELECT
    *
FROM
    hr.employees
    NATURAL JOIN
        hr.departments
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| MANAGER_ID | DEPARTMENT_ID | EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | DEPARTMENT_NAME | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | 90 | 101 | Neena | Kochhar | NKOCHHAR | 515.123.4568 | 2005-09-21 | AD_VP | 17000.00 |  | Executive | 1700 |
| 100 | 90 | 102 | Lex | De Haan | LDEHAAN | 515.123.4569 | 2001-01-13 | AD_VP | 17000.00 |  | Executive | 1700 |
| 103 | 60 | 104 | Bruce | Ernst | BERNST | 590.423.4568 | 2007-05-21 | IT_PROG | 6000.00 |  | IT | 1400 |
| 103 | 60 | 105 | David | Austin | DAUSTIN | 590.423.4569 | 2005-06-25 | IT_PROG | 4800.00 |  | IT | 1400 |
| 103 | 60 | 106 | Valli | Pataballa | VPATABAL | 590.423.4560 | 2006-02-05 | IT_PROG | 4800.00 |  | IT | 1400 |
| 103 | 60 | 107 | Diana | Lorentz | DLORENTZ | 590.423.5567 | 2007-02-07 | IT_PROG | 4200.00 |  | IT | 1400 |
| 108 | 100 | 109 | Daniel | Faviet | DFAVIET | 515.124.4169 | 2002-08-16 | FI_ACCOUNT | 9000.00 |  | Finance | 1700 |
| 108 | 100 | 110 | John | Chen | JCHEN | 515.124.4269 | 2005-09-28 | FI_ACCOUNT | 8200.00 |  | Finance | 1700 |
| 108 | 100 | 111 | Ismael | Sciarra | ISCIARRA | 515.124.4369 | 2005-09-30 | FI_ACCOUNT | 7700.00 |  | Finance | 1700 |
| 108 | 100 | 112 | Jose Manuel | Urman | JMURMAN | 515.124.4469 | 2006-03-07 | FI_ACCOUNT | 7800.00 |  | Finance | 1700 |

#### Natural Cross Join

##### Oracle

```sql
SELECT
    *
FROM
    hr.employees
NATURAL CROSS JOIN hr.departments
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 10 | Administration | 200 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 100 | Finance | 108 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 90 | Executive | 100 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 80 | Sales | 145 | 2500 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 70 | Public Relations | 204 | 2700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 60 | IT | 103 | 1400 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 50 | Shipping | 121 | 1500 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 40 | Human Resources | 203 | 2400 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 30 | Purchasing | 114 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 00:00:00.000 | AD_PRES | 24000 |  |  | 90 | 20 | Marketing | 201 | 1800 |

##### Snowflake

```sql
SELECT
    *
FROM
    hr.employees
    CROSS JOIN hr.departments
ORDER BY employee_id
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| EMPLOYEE_ID | FIRST_NAME | LAST_NAME | EMAIL | PHONE_NUMBER | HIRE_DATE | JOB_ID | SALARY | COMMISSION_PCT | MANAGER_ID | DEPARTMENT_ID | DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 80 | Sales | 145 | 2500 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 20 | Marketing | 201 | 1800 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 60 | IT | 103 | 1400 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 70 | Public Relations | 204 | 2700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 90 | Executive | 100 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 30 | Purchasing | 114 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 10 | Administration | 200 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 100 | Finance | 108 | 1700 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 50 | Shipping | 121 | 1500 |
| 100 | Steven | King | SKING | 515.123.4567 | 2003-06-17 | AD_PRES | 24000.00 |  |  | 90 | 40 | Human Resources | 203 | 2400 |

### Known issues

#### 1. Results ordering mismatch between languages

The result of the query will have the same content in both database engines but the order might be different if no *Order By* clause is defined in the query.

### Related EWIs

No related EWIs.

## Outer Join

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> An outer join extends the result of a simple join. An outer join returns all rows that satisfy the join condition and returns some or all those rows from one table for which no rows from the other satisfy the join condition. ([Oracle SQL Language Reference Outer Joins Subsection](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-29A4584C-0741-4E6A-A89B-DCFAA222994A)).

#### Oracle ANSI syntax

```sql
[ query_partition_clause ] [ NATURAL ]
outer_join_type JOIN table_reference
 [ query_partition_clause ]
 [ ON condition
 | USING ( column [, column ]...)
 ]
```

```sql
outer_join_type
{ FULL | LEFT | RIGHT } [ OUTER ]
```

Oracle also supports the (+) operator that can be used to do outer joins. This operator is added to a column expression in the WHERE clause.

```sql
column_expression (+)
```

#### Snowflake ANSI syntax

Snowflake also supports the ANSI syntax for OUTER JOINS, just like Oracle. However, the behavior when using the (+) operator might be different depending on the usage. For more information, see the [Snowflake JOIN documentation](https://docs.snowflake.com/en/sql-reference/constructs/join.html).

The Snowflake grammar is one of the following:

```sql
SELECT ...
FROM <object_ref1> [
                     {
                       INNER
                       | { LEFT | RIGHT | FULL } [ OUTER ]
                     }
                   ]
                   JOIN <object_ref2>
  [ ON <condition> ]
[ ... ]
```

```sql
SELECT *
FROM <object_ref1> [
                     {
                       INNER
                       | { LEFT | RIGHT | FULL } [ OUTER ]
                     }
                   ]
                   JOIN <object_ref2>
  [ USING( <column_list> ) ]
[ ... ]
```

```sql
SELECT ...
FROM <object_ref1> [
                     {
                       | NATURAL [ { LEFT | RIGHT | FULL } [ OUTER ] ]
                       | CROSS
                     }
                   ]
                   JOIN <object_ref2>
[ ... ]
```

### Sample Source Patterns

> **Note:**
>
> *Order by* clause added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove it to retrieve the entire result set.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

> **Note:**
>
> For the following examples, these inserts and alter statements were executed to distinguish better the result for each kind of JOIN:

```sql
INSERT INTO hr.regions VALUES (5, 'Oceania');
ALTER TABLE hr.countries DROP CONSTRAINT countr_reg_fk;
INSERT INTO hr.countries VALUES ('--', 'Unknown Country', 0);
```

#### 1. ANSI syntax

Snowflake fully supports the ANSI syntax for SQL JOINS. The behavior is the same for both database engines.

#### Left Outer Join On

##### Oracle

```sql
SELECT * FROM
hr.countries c
LEFT OUTER JOIN hr.regions r ON c.region_id = r.region_id
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – | Unknown Country | 0 |  |  |
| AR | Argentina | 2 | 2 | Americas |
| AU | Australia | 3 | 3 | Asia |
| BE | Belgium | 1 | 1 | Europe |
| BR | Brazil | 2 | 2 | Americas |
| CA | Canada | 2 | 2 | Americas |
| CH | Switzerland | 1 | 1 | Europe |
| CN | China | 3 | 3 | Asia |
| DE | Germany | 1 | 1 | Europe |
| DK | Denmark | 1 | 1 | Europe |

##### Snowflake

```sql
SELECT * FROM
hr.countries c
LEFT OUTER JOIN
hr.regions r ON c.region_id = r.region_id
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – | Unknown Country | 0.0000000000000000000 |  |  |
| AR | Argentina | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| AU | Australia | 3.0000000000000000000 | 3.0000000000000000000 | Asia |
| BE | Belgium | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| BR | Brazil | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| CA | Canada | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| CH | Switzerland | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| CN | China | 3.0000000000000000000 | 3.0000000000000000000 | Asia |
| DE | Germany | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| DK | Denmark | 1.0000000000000000000 | 1.0000000000000000000 | Europe |

#### Right Outer Join On

##### Oracle

```sql
SELECT * FROM
hr.countries c
RIGHT OUTER JOIN hr.regions r ON c.region_id = r.region_id
ORDER BY country_id DESC
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – |  |  | 5 | Oceania |
| ZW | Zimbabwe | 4 | 4 | Middle East and Africa |
| ZM | Zambia | 4 | 4 | Middle East and Africa |
| US | United States of America | 2 | 2 | Americas |
| UK | United Kingdom | 1 | 1 | Europe |
| SG | Singapore | 3 | 3 | Asia |
| NL | Netherlands | 1 | 1 | Europe |
| NG | Nigeria | 4 | 4 | Middle East and Africa |
| MX | Mexico | 2 | 2 | Americas |
| ML | Malaysia | 3 | 3 | Asia |

##### Snowflake

```sql
SELECT * FROM
hr.countries c
RIGHT OUTER JOIN
hr.regions r ON c.region_id = r.region_id
ORDER BY country_id DESC
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – |  | 5.0000000000000000000 | Oceania |  |
| ZW | Zimbabwe | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| ZM | Zambia | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| US | United States of America | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| UK | United Kingdom | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| SG | Singapore | 3.0000000000000000000 | 3.0000000000000000000 | Asia |
| NL | Netherlands | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| NG | Nigeria | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| MX | Mexico | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| ML | Malaysia | 3.0000000000000000000 | 3.0000000000000000000 | Asia |

#### Full Outer Join On

##### Oracle

```sql
SELECT * FROM
hr.countries c
FULL OUTER JOIN hr.regions r ON c.region_id = r.region_id
ORDER BY r.region_name DESC, c.country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – | Unknown Country | 0 |  |  |
| – |  |  | 5 | Oceania |
| EG | Egypt | 4 | 4 | Middle East and Africa |
| IL | Israel | 4 | 4 | Middle East and Africa |
| KW | Kuwait | 4 | 4 | Middle East and Africa |
| NG | Nigeria | 4 | 4 | Middle East and Africa |
| ZM | Zambia | 4 | 4 | Middle East and Africa |
| ZW | Zimbabwe | 4 | 4 | Middle East and Africa |
| BE | Belgium | 1 | 1 | Europe |
| CH | Switzerland | 1 | 1 | Europe |

##### Snowflake

```sql
SELECT * FROM
hr.countries c
FULL OUTER JOIN
hr.regions r ON c.region_id = r.region_id
ORDER BY r.region_name DESC, c.country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – | Unknown Country | 0.0000000000000000000 |  |  |
| – |  |  | 5.0000000000000000000 | Oceania |
| EG | Egypt | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| IL | Israel | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| KW | Kuwait | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| NG | Nigeria | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| ZM | Zambia | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| ZW | Zimbabwe | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| BE | Belgium | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| CH | Switzerland | 1.0000000000000000000 | 1.0000000000000000000 | Europe |

#### 2. Natural Outer Join

Both Oracle and Snowflake support the Natural Outer Join and they behave the same.

> A NATURAL JOIN is identical to an explicit JOIN on the common columns of the two tables, except that the common columns are included only once in the output. (A natural join assumes that columns with the same name, but in different tables, contain corresponding data.)([Snowflake SQL Language Reference JOIN](https://docs.snowflake.com/en/sql-reference/constructs/join.html))

#### Natural Left Outer Join

##### Oracle

```sql
SELECT * FROM
hr.countries c
NATURAL LEFT OUTER JOIN hr.regions r
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | COUNTRY_ID | COUNTRY_NAME | REGION_NAME |
| --- | --- | --- | --- |
| 0 | – | Unknown Country |  |
| 2 | AR | Argentina | Americas |
| 3 | AU | Australia | Asia |
| 1 | BE | Belgium | Europe |
| 2 | BR | Brazil | Americas |
| 2 | CA | Canada | Americas |
| 1 | CH | Switzerland | Europe |
| 3 | CN | China | Asia |
| 1 | DE | Germany | Europe |
| 1 | DK | Denmark | Europe |

##### Snowflake

```sql
SELECT * FROM
hr.countries c
NATURAL LEFT OUTER JOIN
hr.regions r
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | COUNTRY_ID | COUNTRY_NAME | REGION_NAME |
| --- | --- | --- | --- |
| 0.0000000000000000000 | – | Unknown Country |  |
| 2.0000000000000000000 | AR | Argentina | Americas |
| 3.0000000000000000000 | AU | Australia | Asia |
| 1.0000000000000000000 | BE | Belgium | Europe |
| 2.0000000000000000000 | BR | Brazil | Americas |
| 2.0000000000000000000 | CA | Canada | Americas |
| 1.0000000000000000000 | CH | Switzerland | Europe |
| 3.0000000000000000000 | CN | China | Asia |
| 1.0000000000000000000 | DE | Germany | Europe |
| 1.0000000000000000000 | DK | Denmark | Europe |

#### Natural Right Outer Join

##### Oracle

```sql
SELECT * FROM
hr.countries c
NATURAL RIGHT OUTER JOIN hr.regions r
ORDER BY country_id DESC
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | COUNTRY_ID | COUNTRY_NAME | REGION_NAME |
| --- | --- | --- | --- |
| 5 |  |  | Oceania |
| 4 | ZW | Zimbabwe | Middle East and Africa |
| 4 | ZM | Zambia | Middle East and Africa |
| 2 | US | United States of America | Americas |
| 1 | UK | United Kingdom | Europe |
| 3 | SG | Singapore | Asia |
| 1 | NL | Netherlands | Europe |
| 4 | NG | Nigeria | Middle East and Africa |
| 2 | MX | Mexico | Americas |
| 3 | ML | Malaysia | Asia |

##### Snowflake

```sql
SELECT * FROM
hr.countries c
NATURAL RIGHT OUTER JOIN
hr.regions r
ORDER BY country_id DESC
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | COUNTRY_ID | COUNTRY_NAME | REGION_NAME |
| --- | --- | --- | --- |
| 5.0000000000000000000 |  |  | Oceania |
| 4.0000000000000000000 | ZW | Zimbabwe | Middle East and Africa |
| 4.0000000000000000000 | ZM | Zambia | Middle East and Africa |
| 2.0000000000000000000 | US | United States of America | Americas |
| 1.0000000000000000000 | UK | United Kingdom | Europe |
| 3.0000000000000000000 | SG | Singapore | Asia |
| 1.0000000000000000000 | NL | Netherlands | Europe |
| 4.0000000000000000000 | NG | Nigeria | Middle East and Africa |
| 2.0000000000000000000 | MX | Mexico | Americas |
| 3.0000000000000000000 | ML | Malaysia | Asia |

#### 3. Basic Outer Join with USING

Table columns can be joined using the USING keyword. The results will be the same as a basic OUTER JOIN with the ON keyword.

#### Left Outer Join Using

##### Oracle

```sql
SELECT * FROM
hr.countries c
LEFT OUTER JOIN hr.regions r USING (region_id)
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | COUNTRY_ID | COUNTRY_NAME | REGION_NAME |
| --- | --- | --- | --- |
| 0 | – | Unknown Country |  |
| 2 | AR | Argentina | Americas |
| 3 | AU | Australia | Asia |
| 1 | BE | Belgium | Europe |
| 2 | BR | Brazil | Americas |
| 2 | CA | Canada | Americas |
| 1 | CH | Switzerland | Europe |
| 3 | CN | China | Asia |
| 1 | DE | Germany | Europe |
| 1 | DK | Denmark | Europe |

##### Snowflake

```sql
SELECT * FROM
hr.countries c
LEFT OUTER JOIN
hr.regions r USING (region_id)
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | COUNTRY_ID | COUNTRY_NAME | REGION_NAME |
| --- | --- | --- | --- |
| 0.0000000000000000000 | – | Unknown Country |  |
| 2.0000000000000000000 | AR | Argentina | Americas |
| 3.0000000000000000000 | AU | Australia | Asia |
| 1.0000000000000000000 | BE | Belgium | Europe |
| 2.0000000000000000000 | BR | Brazil | Americas |
| 2.0000000000000000000 | CA | Canada | Americas |
| 1.0000000000000000000 | CH | Switzerland | Europe |
| 3.0000000000000000000 | CN | China | Asia |
| 1.0000000000000000000 | DE | Germany | Europe |
| 1.0000000000000000000 | DK | Denmark | Europe |

#### 4. (+) Operator

Oracle and Snowflake have a (+) operator that can be used for outer joins too. In some cases, Snowflake may not work properly when using this operator.

For more information regarding this operator in Snowflake, check [this](https://docs.snowflake.com/en/sql-reference/constructs/where.html#joins-in-the-where-clause).

#### Left Outer Join with (+) operator

##### Oracle

```sql
SELECT * FROM hr.countries c, hr.regions r
WHERE c.region_id = r.region_id(+)
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – | Unknown Country | 0 |  |  |
| AR | Argentina | 2 | 2 | Americas |
| AU | Australia | 3 | 3 | Asia |
| BE | Belgium | 1 | 1 | Europe |
| BR | Brazil | 2 | 2 | Americas |
| CA | Canada | 2 | 2 | Americas |
| CH | Switzerland | 1 | 1 | Europe |
| CN | China | 3 | 3 | Asia |
| DE | Germany | 1 | 1 | Europe |
| DK | Denmark | 1 | 1 | Europe |

##### Snowflake

```sql
SELECT * FROM
hr.countries c,
hr.regions r
WHERE c.region_id = r.region_id(+)
ORDER BY country_id
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – | Unknown Country | 0.0000000000000000000 |  |  |
| AR | Argentina | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| AU | Australia | 3.0000000000000000000 | 3.0000000000000000000 | Asia |
| BE | Belgium | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| BR | Brazil | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| CA | Canada | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| CH | Switzerland | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| CN | China | 3.0000000000000000000 | 3.0000000000000000000 | Asia |
| DE | Germany | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| DK | Denmark | 1.0000000000000000000 | 1.0000000000000000000 | Europe |

#### Right Outer Join with (+) operator

##### Oracle

```sql
SELECT * FROM hr.countries c, hr.regions r
WHERE c.region_id (+) = r.region_id
ORDER BY country_id DESC
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – |  |  | 5 | Oceania |
| ZW | Zimbabwe | 4 | 4 | Middle East and Africa |
| ZM | Zambia | 4 | 4 | Middle East and Africa |
| US | United States of America | 2 | 2 | Americas |
| UK | United Kingdom | 1 | 1 | Europe |
| SG | Singapore | 3 | 3 | Asia |
| NL | Netherlands | 1 | 1 | Europe |
| NG | Nigeria | 4 | 4 | Middle East and Africa |
| MX | Mexico | 2 | 2 | Americas |
| ML | Malaysia | 3 | 3 | Asia |

##### Snowflake

```sql
SELECT * FROM
hr.countries c,
hr.regions r
WHERE c.region_id (+) = r.region_id
ORDER BY country_id DESC
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME |
| --- | --- | --- | --- | --- |
| – |  |  | 5.0000000000000000000 | Oceania |
| ZW | Zimbabwe | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| ZM | Zambia | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| US | United States of America | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| UK | United Kingdom | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| SG | Singapore | 3.0000000000000000000 | 3.0000000000000000000 | Asia |
| NL | Netherlands | 1.0000000000000000000 | 1.0000000000000000000 | Europe |
| NG | Nigeria | 4.0000000000000000000 | 4.0000000000000000000 | Middle East and Africa |
| MX | Mexico | 2.0000000000000000000 | 2.0000000000000000000 | Americas |
| ML | Malaysia | 3.0000000000000000000 | 3.0000000000000000000 | Asia |

#### Single table joined with multiple tables with (+)

In Oracle, you can join a single table with multiple tables using the (+) operator, however, Snowflake does not support this. Queries with this kind of Outer Joins will be changed to ANSI syntax.

##### Oracle

```sql
SELECT
c.country_id,
c.country_name,
r.region_id,
r.region_name,
l.location_id,
l.street_address,
l.postal_code,
l.city
FROM
hr.countries c, hr.regions r,  hr.locations l
WHERE
c.region_id(+) = r.region_id AND
l.country_id = c.country_id(+)
ORDER BY r.region_id, l.city
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_NAME | LOCATION_ID | STREET_ADDRESS | POSTAL_CODE | CITY |
| --- | --- | --- | --- | --- | --- | --- | --- |
|  |  | 1 | Europe | 2000 | 40-5-12 Laogianggen | 190518 | Beijing |
| CH | Switzerland | 1 | Europe | 3000 | Murtenstrasse 921 | 3095 | Bern |
|  |  | 1 | Europe | 2100 | 1298 Vileparle (E) | 490231 | Bombay |
| CH | Switzerland | 1 | Europe | 2900 | 20 Rue des Corps-Saints | 1730 | Geneva |
|  |  | 1 | Europe | 1300 | 9450 Kamiya-cho | 6823 | Hiroshima |
| UK | United Kingdom | 1 | Europe | 2400 | 8204 Arthur St |  | London |
|  |  | 1 | Europe | 3200 | Mariano Escobedo 9991 | 11932 | Mexico City |
| DE | Germany | 1 | Europe | 2700 | Schwanthalerstr. 7031 | 80925 | Munich |
| UK | United Kingdom | 1 | Europe | 2500 | Magdalen Centre, The Oxford Science Park | OX9 9ZB | Oxford |
| IT | Italy | 1 | Europe | 1000 | 1297 Via Cola di Rie | 00989 | Roma |

##### Snowflake

```sql
SELECT
c.country_id,
c.country_name,
r.region_id,
r.region_name,
l.location_id,
l.street_address,
l.postal_code,
l.city
FROM
hr.regions r
CROSS JOIN hr.locations l
LEFT OUTER JOIN
hr.countries c
ON
c.region_id = r.region_id
AND
l.country_id = c.country_id
ORDER BY r.region_id, l.city
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_NAME | LOCATION_ID | STREET_ADDRESS | POSTAL_CODE | CITY |
| --- | --- | --- | --- | --- | --- | --- | --- |
|  |  | 1.0000000000000000000 | Europe | 2000 | 40-5-12 Laogianggen | 190518 | Beijing |
| CH | Switzerland | 1.0000000000000000000 | Europe | 3000 | Murtenstrasse 921 | 3095 | Bern |
|  |  | 1.0000000000000000000 | Europe | 2100 | 1298 Vileparle (E) | 490231 | Bombay |
| CH | Switzerland | 1.0000000000000000000 | Europe | 2900 | 20 Rue des Corps-Saints | 1730 | Geneva |
|  |  | 1.0000000000000000000 | Europe | 1300 | 9450 Kamiya-cho | 6823 | Hiroshima |
| UK | United Kingdom | 1.0000000000000000000 | Europe | 2400 | 8204 Arthur St |  | London |
|  |  | 1.0000000000000000000 | Europe | 3200 | Mariano Escobedo 9991 | 11932 | Mexico City |
| DE | Germany | 1.0000000000000000000 | Europe | 2700 | Schwanthalerstr. 7031 | 80925 | Munich |
| UK | United Kingdom | 1.0000000000000000000 | Europe | 2500 | Magdalen Centre, The Oxford Science Park | OX9 9ZB | Oxford |
| IT | Italy | 1.0000000000000000000 | Europe | 1000 | 1297 Via Cola di Rie | 00989 | Roma |

#### Using (+) operator with a column from a not-joined table and a non-column value

In Oracle, you can use the (+) operator with a Column and join it with a value that is not a column from another table. Snowflake can also do this but it will fail if the table of the column was not joined with another table. To solve this issue, the (+) operator is removed from the query when this scenario happens and the result will be the same as in Oracle.

##### Oracle

```sql
SELECT * FROM hr.regions r
WHERE
r.region_name (+) LIKE 'A%'
ORDER BY region_id;
```

##### Result

| REGION_ID | REGION_NAME |
| --- | --- |
| 2 | Americas |
| 3 | Asia |

##### Snowflake

```sql
SELECT * FROM
hr.regions r
WHERE
r.region_name LIKE 'A%'
ORDER BY region_id;
```

##### Result

| REGION_ID | REGION_NAME |
| --- | --- |
| 2.0000000000000000000 | Americas |
| 3.0000000000000000000 | Asia |

### Known issues

For all the unsupported cases, please check the related EWIs to obtain recommendations and possible workarounds.

#### 1. Converted Outer Joins to ANSI syntax might reorder the columns

When a query with a non-ANSI Outer Join is converted to an ANSI Outer Join, it may change the order of the columns in the converted query. To fix this issue, try to select the columns in the specific order required.

##### Oracle

```sql
SELECT
*
FROM
hr.countries c, hr.regions r,  hr.locations l
WHERE
c.region_id(+) = r.region_id AND
l.country_id = c.country_id(+)
ORDER BY r.region_id, l.city
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME | LOCATION_ID | STREET_ADDRESS | POSTAL_CODE | CITY | STATE_PROVINCE | COUNTRY_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
|  |  |  | 1 | Europe | 2000 | 40-5-12 Laogianggen | 190518 | Beijing |  | CN |
| CH | Switzerland | 1 | 1 | Europe | 3000 | Murtenstrasse 921 | 3095 | Bern | BE | CH |
|  |  |  | 1 | Europe | 2100 | 1298 Vileparle (E) | 490231 | Bombay | Maharashtra | IN |
| CH | Switzerland | 1 | 1 | Europe | 2900 | 20 Rue des Corps-Saints | 1730 | Geneva | Geneve | CH |
|  |  |  | 1 | Europe | 1300 | 9450 Kamiya-cho | 6823 | Hiroshima |  | JP |
| UK | United Kingdom | 1 | 1 | Europe | 2400 | 8204 Arthur St |  | London |  | UK |
|  |  |  | 1 | Europe | 3200 | Mariano Escobedo 9991 | 11932 | Mexico City | Distrito Federal, | MX |
| DE | Germany | 1 | 1 | Europe | 2700 | Schwanthalerstr. 7031 | 80925 | Munich | Bavaria | DE |
| UK | United Kingdom | 1 | 1 | Europe | 2500 | Magdalen Centre, The Oxford Science Park | OX9 9ZB | Oxford | Oxford | UK |
| IT | Italy | 1 | 1 | Europe | 1000 | 1297 Via Cola di Rie | 00989 | Roma |  | IT |

##### Snowflake

```sql
SELECT
*
FROM
hr.regions r
CROSS JOIN hr.locations l
LEFT OUTER JOIN
hr.countries c
ON
c.region_id = r.region_id
AND
l.country_id = c.country_id
ORDER BY r.region_id, l.city
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| REGION_ID | REGION_NAME | LOCATION_ID | STREET_ADDRESS | POSTAL_CODE | CITY | STATE_PROVINCE | COUNTRY_ID | COUNTRY_ID | COUNTRY_NAME | REGION_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 1.0000000000000000000 | Europe | 2000 | 40-5-12 Laogianggen | 190518 | Beijing |  | CN |  |  |  |
| 1.0000000000000000000 | Europe | 3000 | Murtenstrasse 921 | 3095 | Bern | BE | CH | CH | Switzerland | 1.0000000000000000000 |
| 1.0000000000000000000 | Europe | 2100 | 1298 Vileparle (E) | 490231 | Bombay | Maharashtra | IN |  |  |  |
| 1.0000000000000000000 | Europe | 2900 | 20 Rue des Corps-Saints | 1730 | Geneva | Geneve | CH | CH | Switzerland | 1.0000000000000000000 |
| 1.0000000000000000000 | Europe | 1300 | 9450 Kamiya-cho | 6823 | Hiroshima |  | JP |  |  |  |
| 1.0000000000000000000 | Europe | 2400 | 8204 Arthur St |  | London |  | UK | UK | United Kingdom | 1.0000000000000000000 |
| 1.0000000000000000000 | Europe | 3200 | Mariano Escobedo 9991 | 11932 | Mexico City | Distrito Federal, | MX |  |  |  |
| 1.0000000000000000000 | Europe | 2700 | Schwanthalerstr. 7031 | 80925 | Munich | Bavaria | DE | DE | Germany | 1.0000000000000000000 |
| 1.0000000000000000000 | Europe | 2500 | Magdalen Centre, The Oxford Science Park | OX9 9ZB | Oxford | Oxford | UK | UK | United Kingdom | 1.0000000000000000000 |
| 1.0000000000000000000 | Europe | 1000 | 1297 Via Cola di Rie | 00989 | Roma |  | IT | IT | Italy | 1.0000000000000000000 |

##### 2. Outer joined between predicate with an interval with multiple tables

Between predicates can be used for non-ANSI OUTER JOINS. In Oracle, columns inside the interval can be outer joined, even if they come from different tables, however, Snowflake does not support this. For these cases, the between predicate will be commented out.

##### Oracle

```sql
SELECT
*
FROM
hr.countries c, hr.regions r,  hr.locations l WHERE
l.location_id  BETWEEN r.region_id(+) AND c.region_id(+)
ORDER BY r.region_id, l.city
FETCH FIRST 10 ROWS ONLY;
```

##### Result

| COUNTRY_ID | COUNTRY_NAME | REGION_ID | REGION_ID | REGION_NAME | LOCATION_ID | STREET_ADDRESS | POSTAL_CODE | CITY | STATE_PROVINCE | COUNTRY_ID |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
|  |  |  | 1 | Europe | 2000 | 40-5-12 Laogianggen | 190518 | Beijing |  | CN |
|  |  |  | 1 | Europe | 3000 | Murtenstrasse 921 | 3095 | Bern | BE | CH |
|  |  |  | 1 | Europe | 2100 | 1298 Vileparle (E) | 490231 | Bombay | Maharashtra | IN |
|  |  |  | 1 | Europe | 2900 | 20 Rue des Corps-Saints | 1730 | Geneva | Geneve | CH |
|  |  |  | 1 | Europe | 1300 | 9450 Kamiya-cho | 6823 | Hiroshima |  | JP |
|  |  |  | 1 | Europe | 2400 | 8204 Arthur St |  | London |  | UK |
|  |  |  | 1 | Europe | 3200 | Mariano Escobedo 9991 | 11932 | Mexico City | Distrito Federal, | MX |
|  |  |  | 1 | Europe | 2700 | Schwanthalerstr. 7031 | 80925 | Munich | Bavaria | DE |
|  |  |  | 1 | Europe | 2500 | Magdalen Centre, The Oxford Science Park | OX9 9ZB | Oxford | Oxford | UK |
|  |  |  | 1 | Europe | 1000 | 1297 Via Cola di Rie | 00989 | Roma |  | IT |

##### Snowflake

```sql
SELECT
*
FROM
hr.countries c,
hr.regions r,
hr.locations l WHERE
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0090 - INVALID NON-ANSI OUTER JOIN BETWEEN PREDICATE CASE FOR SNOWFLAKE. ***/!!!
l.location_id  BETWEEN r.region_id(+) AND c.region_id(+)
ORDER BY r.region_id, l.city
FETCH FIRST 10 ROWS ONLY;
```

### Related EWIs

1. [SSC-EWI-OR0090](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Non-Ansi Outer Join has an invalid Between predicate.

## Self Join

> **Note:**
>
> Some parts in the output codes are omitted for clarity reasons.

### Description

> A self join is a join of a table to itself. This table appears twice in the `FROM` clause and is followed by table aliases that qualify column names in the join condition. ([Oracle SQL Language Reference Self Join Subsection](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-B0F5C614-CBDD-45F6-966D-00BAD6463440))

### Sample Source Patterns

> **Note:**
>
> *Order by* clause added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

#### Basic Self Join case

##### Oracle

```sql
SELECT e1.last_name||' works for '||e2.last_name
   "Employees and Their Managers"
   FROM hr.employees e1, hr.employees e2
   WHERE e1.manager_id = e2.employee_id
      AND e1.last_name LIKE 'R%'
   ORDER BY e1.last_name;
```

##### Result

| Employees and Their Managers |
| --- |
| Rajs works for Mourgos |
| Raphaely works for King |
| Rogers works for Kaufling |
| Russell works for King |

##### Snowflake

```sql
SELECT
   NVL( e1.last_name :: STRING, '') || ' works for ' || NVL(e2.last_name :: STRING, '') "Employees and Their Managers"
FROM
   hr.employees e1,
   hr.employees e2
   WHERE e1.manager_id = e2.employee_id
      AND e1.last_name LIKE 'R%'
   ORDER BY e1.last_name;
```

##### Result

| Employees and Their Managers |
| --- |
| Rajs works for Mourgos |
| Raphaely works for King |
| Rogers works for Kaufling |
| Russell works for King |

> **Note:**
>
> As proved previously the **self join** in Oracle is functionally equivalent to Snowflake.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Semijoin

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> A semijoin returns rows that match an `EXISTS` subquery without duplicating rows from the left side of the predicate when multiple rows on the right side satisfy the criteria of the subquery. Semijoin transformation cannot be done if the subquery is on an `OR` branch of the `WHERE` clause. ([Oracle SQL Language Reference Semijoin Subsection](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Joins.html#GUID-E98C180E-8A17-469D-8E68-56245E28104B))

### Sample Source Patterns

> **Note:**
>
> *Order by* clause added because the result order may vary between Oracle and Snowflake.

> **Note:**
>
> Check this [section](../sample-data.md) to set up the sample database.

#### Basic Semijoin case

##### Oracle

```sql
SELECT * FROM hr.departments
   WHERE EXISTS
   (SELECT * FROM hr.employees
       WHERE departments.department_id = employees.department_id
       AND employees.salary > 2500)
   ORDER BY department_name;
```

##### Result

| DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- |
| 110 | Accounting | 205 | 1700 |
| 10 | Administration | 200 | 1700 |
| 90 | Executive | 100 | 1700 |
| 100 | Finance | 108 | 1700 |
| 40 | Human Resources | 203 | 2400 |
| 60 | IT | 103 | 1400 |
| 20 | Marketing | 201 | 1800 |
| 70 | Public Relations | 204 | 2700 |
| 30 | Purchasing | 114 | 1700 |
| 80 | Sales | 145 | 2500 |
| 50 | Shipping | 121 | 1500 |

##### Snowflake

```sql
SELECT * FROM
   hr.departments
   WHERE EXISTS
   (SELECT * FROM
         hr.employees
       WHERE departments.department_id = employees.department_id
       AND employees.salary > 2500)
   ORDER BY department_name;
```

##### Result

| DEPARTMENT_ID | DEPARTMENT_NAME | MANAGER_ID | LOCATION_ID |
| --- | --- | --- | --- |
| 110 | Accounting | 205 | 1700 |
| 10 | Administration | 200 | 1700 |
| 90 | Executive | 100 | 1700 |
| 100 | Finance | 108 | 1700 |
| 40 | Human Resources | 203 | 2400 |
| 60 | IT | 103 | 1400 |
| 20 | Marketing | 201 | 1800 |
| 70 | Public Relations | 204 | 2700 |
| 30 | Purchasing | 114 | 1700 |
| 80 | Sales | 145 | 2500 |
| 50 | Shipping | 121 | 1500 |

> **Note:**
>
> As proved previously the **semijoin** in Oracle is functionally equivalent to Snowflake.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Oracle - Literals
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/literals.md
section: Migrations
---

# SnowConvert AI - Oracle - Literals

> The terms literal and constant value are synonymous and refer to a fixed data value.
> ([Oracle SQL Language Reference Literals](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Literals.html#GUID-192417E8-A79D-4A1D-9879-68272D925707))

## Interval Literal

Interval Literal Not Supported In Current Scenario

### Description

Snowflake Intervals can only be used in arithmetic operations. Intervals used in any other scenario are not supported.

#### Example Code

##### Oracle

```sql
SELECT INTERVAL '1-5' YEAR TO MONTH FROM DUAL;
```

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0107 - INTERVAL LITERAL IS NOT SUPPORTED BY SNOWFLAKE IN THIS SCENARIO  ***/!!!
 INTERVAL '1-5' YEAR TO MONTH FROM DUAL;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0107](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Interval Literal Not Supported In Current Scenario.

## Interval Type and Date Type

Operation Between Interval Type and Date Type not Supported

### Description

`INTERVAL YEAR TO MONTH` and `INTERVAL DAY TO SECOND` are not a supported data type, they are transformed to `VARCHAR(20)`. Therefore all arithmetic operations between **Date Types** and the original **Interval Type Columns** are not supported.

Furthermore, operations between an Interval Type and Date Type (in this order) are not supported in Snowflake; and these operations use this EWI as well.

#### Example Code

##### Oracle

```sql
CREATE TABLE table_with_intervals
(
    date_col DATE,
    time_col TIMESTAMP,
    intervalYearToMonth_col INTERVAL YEAR TO MONTH,
    intervalDayToSecond_col INTERVAL DAY TO SECOND
);

-- Date + Interval Y to M
SELECT date_col + intervalYearToMonth_col FROM table_with_intervals;

-- Date - Interval D to S
SELECT date_col - intervalDayToSecond_col FROM table_with_intervals;

-- Timestamp + Interval D to S
SELECT time_col + intervalDayToSecond_col FROM table_with_intervals;

-- Timestamp - Interval Y to M
SELECT time_col - intervalYearToMonth_col FROM table_with_intervals;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE table_with_intervals
    (
        date_col TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
        time_col TIMESTAMP(6),
        intervalYearToMonth_col VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR TO MONTH DATA TYPE CONVERTED TO VARCHAR ***/!!!,
        intervalDayToSecond_col VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DAY TO SECOND DATA TYPE CONVERTED TO VARCHAR ***/!!!
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    -- Date + Interval Y to M
    SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! date_col + intervalYearToMonth_col FROM
    table_with_intervals;

    -- Date - Interval D to S
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! date_col - intervalDayToSecond_col FROM
    table_with_intervals;

    -- Timestamp + Interval D to S
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! time_col + intervalDayToSecond_col FROM
    table_with_intervals;

    -- Timestamp - Interval Y to M
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! time_col - intervalYearToMonth_col FROM
    table_with_intervals;
```

#### Recommendations

* Implement the UDF to simulate the Oracle behavior.
* Extract the already transformed value that was stored in the column during migration, and use it as a Snowflake [**Interval Constant**](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) when possible.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-OR0095](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Operation Between Interval Type and Date Type not Supported.
3. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

## Text literals

### Description

> Use the text literal notation to specify values whenever `string` appears in the syntax of expressions, conditions, SQL functions, and SQL statements in other parts of this reference.
>
> ([Oracle SQL Language Reference Text literals](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Literals.html#GUID-1824CBAA-6E16-4921-B2A6-112FB02248DA))

```sql
[ {N | n} ]
{ '[ c ]...'
| { Q | q } 'quote_delimiter c [ c ]... quote_delimiter'
}
```

### Sample Source Patterns

#### Empty string (‘’)

The empty strings are equivalent to *NULL* in Oracle, so in order to emulate the behavior in Snowflake, the empty strings are converted to *NULL* or *undefined* depending if the literal is used inside a procedure or not.

##### Oracle

```sql
SELECT UPPER('') FROM DUAL;
```

##### Result

| UPPER(‘’) |
| --- |
|  |

##### Snowflake

```sql
SELECT UPPER(NULL) FROM DUAL;
```

##### Result

| UPPER(NULL) |
| --- |
|  |

#### Empty string in stored procedures

##### Oracle

```sql
CREATE TABLE empty_string_table(
col1 VARCHAR(10),
col2 VARCHAR(10));

CREATE OR REPLACE PROCEDURE null_proc AS
    var1 INTEGER := '';
    var3 INTEGER := null;
    var2 VARCHAR(20) := 'hello';
BEGIN
    var1 := var1 + 456;
    var2 := var2 || var1;
    IF var1 IS NULL THEN
        INSERT INTO empty_string_table VALUES (var1, var2);
    END IF;
END;

CALL null_proc();

SELECT * FROM empty_string_table;
```

##### Result

| COL1 | COL2 |
| --- | --- |
|  | hello |

##### Snowflake

```sql
CREATE OR REPLACE TABLE empty_string_table (
    col1 VARCHAR(10),
    col2 VARCHAR(10))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
;

CREATE OR REPLACE PROCEDURE null_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        var1 INTEGER := NULL;
        var3 INTEGER := null;
        var2 VARCHAR(20) := 'hello';
    BEGIN
        var1 := :var1 + 456;
        var2 := NVL(:var2 :: STRING, '') || NVL(:var1 :: STRING, '');
        IF (:var1 IS NULL) THEN
            INSERT INTO empty_string_table
            VALUES (:var1, :var2);
        END IF;
    END;
$$;

CALL null_proc();

SELECT * FROM
    empty_string_table;
```

##### Result

| COL1 | COL2 |
| --- | --- |
|  | hello |

#### Empty string in built-in functions

> **Warning:**
>
> The transformation does not apply when the empty string is used as an argument of the *REPLACE* and *CONCAT* functions in order to keep the functional equivalence.

##### Oracle

```sql
SELECT REPLACE('Hello world', '', 'l'), CONCAT('A','') FROM DUAL;
```

##### Result

| REPLACE(‘HELLOWORLD’,’’,’L’) | CONCAT(‘A’,’’) |
| --- | --- |
| Hello world | A |

##### Snowflake

```sql
SELECT REPLACE('Hello world', '', 'l'), CONCAT('A','') FROM DUAL;
```

##### Result

| REPLACE(‘HELLO WORLD’, ‘’, ‘L’) | CONCAT(‘A’,’’) |
| --- | --- |
| Hello world | A |

> **Note:**
>
> If the empty strings are replaced by NULL for these cases, the results of the queries will be different.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Oracle - Oracle Built-in Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/oracle-built-in-data-types.md
section: Migrations
---

# SnowConvert AI - Oracle - Oracle Built-in Data Types

## Extended Data Types

### Description

> Beginning with Oracle Database 12_c_, you can specify a maximum size of 32767 bytes for the `VARCHAR2`, `NVARCHAR2`, and `RAW` data types. You can control whether your database supports this new maximum size by setting the initialization parameter `MAX_STRING_SIZE`.
>
> A `VARCHAR2` or `NVARCHAR2` data type with a declared size of greater than 4000 bytes, or a `RAW` data type with a declared size of greater than 2000 bytes, is an **extended** **data** **type**. ([Oracle SQL Language Reference Extended Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-8EFA29E9-E8D8-40A6-A43E-954908C954A4)).

Oracle allows increasing the database max string size from `STANDARD` to `EXTENDED`, however, Snowflake **does not contain an equivalent** for this functionality.

Therefore `VARCHAR2`, `NVARCHAR2` and `RAW` extended Data Types are not supported in Snowflake, and they are transformed just as regular `VARCHAR2`, `NVARCHAR2`, and `RAW` data types. Check Character Data Types and RAW Data Types for more information.

### Known Issues

#### 1. MAX STRING SIZE not recognized

`ALTER SYSTEM SET MAX_STRING_SIZE='EXTENDED';`

Is not being parsed by SnowConvert.

### Related EWIs

No related EWIs.

## JSON Data Type

### Description

> Oracle Database supports JSON natively with relational database features, including transactions, indexing, declarative querying, and views. Unlike relational data, JSON data can be stored in the database, indexed, and queried without any need for a schema that defines the data. ([Oracle SQL Language Reference JSON Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-E441F541-BA31-4E8C-B7B4-D2FB8C42D0DF)).

The JSON data types are transformed to VARIANT to emulate the Oracle behavior.

```sql
JSON
```

### Sample Source Patterns

#### JSON Data Type as a column in Create Table

##### Oracle

```sql
CREATE TABLE jsontable (
	json_column JSON
);

INSERT INTO jsontable VALUES('{"id": 1, "content":"json content"}');
INSERT INTO jsontable VALUES('{"stringdata": "this is a text","number": 1,"numberNeg": -1,"booleanT": true,"booleanGF": false,"nullvalue": null,"object": {"1": 1,"2": 2},"array": [1, 2, 3]}');
INSERT INTO jsontable VALUES(JSON('{"id": 4}'));

SELECT  * FROM jsontable;
```

##### Result

| COL1 |
| --- |
| {“id”:1,”content”:”json content”} |
| {“stringdata”:”this is a text”,”number”:1,”numberNeg”:-1,”booleanT”:true,”booleanGF”:false,”nullvalue”:null,”object”:{“1”:1,”2”:2},”array”:[1,2,3]} |
| {“id”:4} |

##### Snowflake

```sql
CREATE OR REPLACE TABLE jsontable (
	json_column VARIANT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO jsontable
VALUES('{"id": 1, "content":"json content"}');

INSERT INTO jsontable
VALUES('{"stringdata": "this is a text","number": 1,"numberNeg": -1,"booleanT": true,"booleanGF": false,"nullvalue": null,"object": {"1": 1,"2": 2},"array": [1, 2, 3]}');

INSERT INTO jsontable
VALUES(JSON('{"id": 4}') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'JSON' NODE ***/!!!);

SELECT  * FROM
	jsontable;
```

> **Warning:**
>
> JSON data insertions are not being correctly handled. Check the Recommendations section for workarounds.

### Known Issues

**1. JSON data insertions**

JSON data insertions are not being correctly handled by SnowConvert.

**2. JSON objects manipulation**

The usages of JSON objects (columns, variables, or parameters) are not correctly converted by SnowConvert AI. Check the Recommendations section for workarounds

### Recommendations

#### 1. JSON **Data Type** translation workaround

JSON datatype is translated to *VARIANT*, so the information can be formatted using the Snowflake *PARSE_JSON* function. This approach will allow you to store, query, and operate the JSON data in Snowflake using similar syntax as Oracle.

##### Oracle

```sql
CREATE TABLE jsontable (
	json_column JSON
);

INSERT INTO jsontable VALUES('{"id": 1, "content":"json content"}');
INSERT INTO jsontable VALUES('{"id": 2, "content": {"header": "header text one", "content": "content text one"}}');
INSERT INTO jsontable VALUES('{"id": 3, "content": {"header": "header tex two", "content": "content text two"}}');

SELECT * FROM jsontable;
SELECT 'ID: ' || jt.json_column.id, 'HEADER: ' || UPPER(jt.json_column.content.header) FROM jsontable jt;
```

##### Result 1

| JSON_SERIALIZE(JSON_COLUMN) |
| --- |
| {“id”:1,”content”:”json content”} |
| {“id”:2,”content”:{“header”:”header text one”,”content”:”content text one”}} |
| {“id”:3,”content”:{“header”:”header tex two”,”content”:”content text two”}} |

##### Result 2

| ‘ID:’ JT.JSON_COLUMN.ID | ‘HEADER:’ UPPER(JT.JSON_COLUMN.CONTENT.HEADER) |
| --- | --- |
| ID: 1 | HEADER: |
| ID: 2 | HEADER: “HEADER TEXT ONE” |
| ID: 3 | HEADER: “HEADER TEX TWO” |

##### Snowflake

```sql
CREATE OR REPLACE TABLE jsontable (
	json_column VARIANT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO jsontable
VALUES('{"id": 1, "content":"json content"}');

INSERT INTO jsontable
VALUES('{"id": 2, "content": {"header": "header text one", "content": "content text one"}}');

INSERT INTO jsontable
VALUES('{"id": 3, "content": {"header": "header tex two", "content": "content text two"}}');

SELECT * FROM
	jsontable;

SELECT 'ID: ' || NVL(jt.json_column.id :: STRING, ''), 'HEADER: ' || NVL(UPPER(jt.json_column.content.header) :: STRING, '') FROM
	jsontable jt;
```

##### Result 1

| JSON_COLUMN |
| --- |
| { “content”: “json content”, “id”: 1} |
| { “content”: { “content”: “content text one”, “header”: “header text one” }, “id”: 2} |
| { “content”: { “content”: “content text two”, “header”: “header tex two” }, “id”: 3} |

##### Result 2

| ‘ID: ‘ JT.JSON_COLUMN:ID | ‘HEADER: ‘ UPPER(JT.JSON_COLUMN:CONTENT:HEADER) |
| --- | --- |
| ID: 1 |  |
| ID: 2 | HEADER: HEADER TEXT ONE |
| ID: 3 | HEADER: HEADER TEX TWO |

> **Note:**
>
> You must use *SELECT* as the INSERT *INTO* argument instead of the *VALUES* clause to use the *PARSE_JSON* function.

> **Note:**
>
> Use the ‘:’ instead of the ‘.’ operator to access the JSON object properties. It allows several levels of nesting in both engines.

### Related EWIs

1. [SSC-EWI-0073](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review

## LONG Data Type

> `LONG` columns store variable-length character strings containing up to 2 gigabytes -1, or 231-1 bytes. `LONG` columns have many of the characteristics of `VARCHAR2` columns. You can use `LONG` columns to store long text strings. The length of `LONG` values may be limited by the memory available on your computer. ([Oracle SQL Language Reference Long Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-F6309DF8-162F-48A4-9454-FEE59EC6644F))

```sql
LONG
```

### Sample Source Patterns

#### Long in Create Table

##### Oracle

```sql
CREATE TABLE long_table
(
     id 	  NUMBER,
     long_column  LONG
);

 INSERT INTO long_table VALUES (1, 'this is a text');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE long_table
 (
      id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
      long_column VARCHAR
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;

 INSERT INTO long_table
 VALUES (1, 'this is a text');
```

#### Retrieving data from a Long column

##### Oracle

```sql
SELECT long_column FROM long_table;
```

##### Result

| LONG_COLUMN |
| --- |
| this is a text |

##### Snowflake

```sql
SELECT long_column FROM
long_table;
```

##### Result

| LONG_COLUMN |
| --- |
| this is a text |

### Known Issues

#### 1. The max length of long (Oracle) and varchar (Snowflake) are different

According to [Oracle documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnoci/data-types.html#GUID-A4B5A998-038A-44BA-A673-C41BEAC05C42), Long column can store up to 2 gigabytes of data, but [Snowflake varchar](https://docs.snowflake.com/en/sql-reference/data-types-text.html#varchar) is limited to 16Mb.

##### 2. Cast of Long column

The Long data type can only be cast to a CLOB data type by using the [TO_LOB function](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/TO_LOB.html#GUID-35810313-029E-4CB8-8C27-DF432FA3C253). This function only works when used in the select list of a subquery in an INSERT statement. Consider the following sample

##### Oracle

```sql
CREATE TABLE target_table (col CLOB);

INSERT INTO target_table (SELECT TO_LOB(long_column) FROM long_table);
```

> **Warning:**
>
> If the target table column data type is different from CLOB, Oracle may insert null values or display an error when attempting to insert the data.

### Related EWIs

1. [SSC-FDM-0006](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake

## RAW and LONG RAW Data types

### Description

> The `RAW` and `LONG` `RAW` data types store data that is not to be explicitly converted by Oracle Database when moving data between different systems. These data types are intended for binary data or byte strings. ([Oracle SQL Language Reference Row and Long Raw Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-4FD497DD-3331-4C25-9147-3CEBEFDBFF22))

```sql
{ LONG RAW | RAW (size) }
```

### Sample Source Patterns

#### Raw and Long Raw in Create Table

##### Oracle

```sql
CREATE TABLE raw_table
(
     id INTEGER,
     raw_column RAW(2000),
     long_raw_column LONG RAW
);

INSERT  INTO raw_table values(1, 'FF00FF00FF', 'FF00FF00FFAABAABABABABA917843210984237123ABABABABAABBAAABBACDFFD');
INSERT  INTO raw_table values(2, 'AAAAAAAAAA', 'ABABABABABABABABABABABABABABABAbABAbABAABABAAABABABABABABABABABABA');
--Insert with largest string posible (2000 HEX characters)
INSERT INTO raw_table VALUES (3, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA1AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA', 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA1AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA')
```

##### Snowflake CREATE OR REPLACE TABLE raw_table

```sql
CREATE OR REPLACE TABLE raw_table
     (
          id INTEGER,
          raw_column BINARY,
          long_raw_column BINARY
     )
     COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
     ;

     INSERT  INTO raw_table
     values(1, 'FF00FF00FF', 'FF00FF00FFAABAABABABABA917843210984237123ABABABABAABBAAABBACDFFD');

     INSERT  INTO raw_table
     values(2, 'AAAAAAAAAA', 'ABABABABABABABABABABABABABABABAbABAbABAABABAAABABABABABABABABABABA');

     --Insert with largest string posible (2000 HEX characters)
INSERT INTO raw_table
     VALUES (3, 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA1AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA', 'AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA1AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA');
```

#### Retrieving data from Raw and Long Raw column

##### Oracle

```sql
SELECT * FROM raw_table ORDER BY id;
```

##### Result

| ID | RAW_COLUMN | LONG_RAW_COLUMN |
| --- | --- | --- |
| 1 |  | ªº««««© 2 B7 :ºººº«ºª»¬ßý |
| 2 | ªªªªª | «««««««««««««««««««ªººªºººººººººº |
| 3 | ªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªª | ªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªª |

##### Snowflake

```sql
SELECT * FROM
raw_table
ORDER BY id;
```

##### Result

| ID | RAW_COLUMN | LONG_RAW_COLUMN |
| --- | --- | --- |
| 1 |  | ªº««««© 2 B7 :ºººº«ºª»¬ßý |
| 2 | ªªªªª | «««««««««««««««««««ªººªºººººººººº |
| 3 | ªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªª | ªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªªª |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Numeric Data Types

### Description

> The Oracle Database numeric data types store positive and negative fixed and floating-point numbers, zero, infinity, and values that are the undefined result of an operation—“not a number” or `NAN`. ([Oracle Language Reference Numeric Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-9401BC04-81C4-4CD5-99E7-C5E25C83F608))

#### Notes on arithmetic operations

Please be aware that every operation performed on numerical datatypes is internally stored as a Number. Furthermore, depending on the operation performed it is possible to incur an error related to how intermediate values are stored within Snowflake, for more information on [Snowflake’s post on intermediate numbers in Snowflake](https://community.snowflake.com/s/question/0D50Z00008HhSHCSA3/sql-compilation-error-invalid-intermediate-datatype-number7148).

## FLOAT Data Type

### Description

> The `FLOAT` data type is a subtype of `NUMBER`. It can be specified with or without precision, which has the same definition it has for`NUMBER`and can range from 1 to 126. Scale cannot be specified but is interpreted from the data. ([Oracle Language Reference Float Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-10D4D073-866D-4BD4-B3E9-ED153D505A6A))

> **Warning:**
>
> #### Notes on arithmetic operations
>
> Please be aware that every operation performed on numerical datatypes is internally stored as a Number. Furthermore, depending on the operation performed it is possible to incur an error related to how intermediate values are stored within Snowflake, for more information please check this post on [Snowflake’s post on intermediate numbers in Snowflake](https://community.snowflake.com/s/question/0D50Z00008HhSHCSA3/sql-compilation-error-invalid-intermediate-datatype-number7148).

### Sample Source Patterns

Please, consider the following table and its inserts for the examples below:

#### Float data type in Create Table

##### Oracle

```sql
CREATE TABLE float_data_type_table(
col1 FLOAT,
col2 FLOAT(5),
col3 FLOAT(126)
);

INSERT INTO float_data_type_table (col1) VALUES (100.55555);
INSERT INTO float_data_type_table (col1) VALUES (1.9);
INSERT INTO float_data_type_table (col2) VALUES (1.23);
INSERT INTO float_data_type_table (col2) VALUES (7.89);
INSERT INTO float_data_type_table (col2) VALUES (12.79);
INSERT INTO float_data_type_table (col2) VALUES (123.45);
INSERT INTO float_data_type_table (col3) VALUES (1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111.99999999999999999999555555);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE float_data_type_table (
col1 FLOAT,
col2 FLOAT(5),
col3 FLOAT(126)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO float_data_type_table(col1) VALUES (100.55555);

INSERT INTO float_data_type_table(col1) VALUES (1.9);

INSERT INTO float_data_type_table(col2) VALUES (1.23);

INSERT INTO float_data_type_table(col2) VALUES (7.89);

INSERT INTO float_data_type_table(col2) VALUES (12.79);

INSERT INTO float_data_type_table(col2) VALUES (123.45);

INSERT INTO float_data_type_table(col3) VALUES (1111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111111.99999999999999999999555555);
```

#### FLOAT

There are no differences between Oracle and Snowflake regarding FLOAT data type without precision.

##### Oracle

```sql
SELECT col1 FROM float_data_type_table;
```

##### Result

| col1 |
| --- |
| 100.55555 |
| 1.9 |

##### Snowflake

```sql
SELECT col1 FROM
float_data_type_table;
```

##### Result

| col1 |
| --- |
| 100.55555 |
| 1.9 |

#### FLOAT ( p )

Queries results may not be equivalent when the precision **(p)** is specified in the`FLOAT`data type. There are small rounding differences.

##### Oracle

```sql
SELECT col2 FROM float_data_type_table;

SELECT col3 FROM float_data_type_table;
```

##### Result

| col2 |
| --- |
| 1.2 |
| 7.9 |
| 13 |
| 120 |
|  |
| col3 |
| —————————————————————————————————- |
| 1111111111111111111111111111111111111100000000000000000000000000000000000000000000000000000000000000 |
|  |

##### Snowflake

```sql
SELECT col2 FROM
float_data_type_table;

SELECT col3 FROM
float_data_type_table;
```

##### Result

| col2 |
| --- |
| 1.23 |
| 7.89 |
| 12.79 |
| 123.45 |
|  |
| col3 |
| —————————————————————————————————- |
| 1111111111111111000000000000000000000000000000000000000000000000000000000000000000000000000000000000 |

### Known Issues

#### 1. FLOAT data type with precision

When the **FLOAT** data type has precision, the queries results may have small rounding differences.

### Related EWIs

No related EWIs.

## NUMBER Data Type

### Description

> The `NUMBER` data type stores zero as well as positive and negative fixed numbers with absolute values from 1.0 x 10-130 to but not including 1.0 x 10126. If you specify an arithmetic expression whose value has an absolute value greater than or equal to 1.0 x 10126, then Oracle returns an error. Each `NUMBER` value requires from 1 to 22 bytes. ([Oracle Language Reference Number Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-75209AF6-476D-4C44-A5DC-5FA70D701B78)).

The `NUMBER` data type can be specified using the following form `NUMBER(p, s)` (both parameters are optional) where:

* `p` is the **precision** or the maximum number of significant decimal digits, where the most significant digit is the left-most nonzero digit, and the least significant digit is the right-most known digit. The precision can range from 0 to 38.
* `s` is the **scale** or the number of digits from the decimal point to the least significant digit. The scale can range from -84 to 127.

On Oracle, not specifying precision (using `NUMBER or NUMBER(*)`) causes the column to be created as an “undefined precision”. This means that Oracle will store values dynamically, allowing to store any number within that column. Snowflake does not support this functionality; for this reason, they will be changed to NUMBER(38, 18), allowing to store the widest variety of numbers.

> **Warning:**
>
> #### Notes on arithmetic operations
>
> Please be aware that every operation performed on numerical data types is internally stored as a Number. Furthermore, depending on the operation performed it is possible to incur an error related to how intermediate values are stored within Snowflake, for more information please check this post on [Snowflake’s post on intermediate numbers in Snowflake](https://community.snowflake.com/s/question/0D50Z00008HhSHCSA3/sql-compilation-error-invalid-intermediate-datatype-number7148) or check the functional equivalence message [SSC-FDM-0006](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md).

### Sample Source Patterns

Please, consider the following table and its inserts for the examples below:

#### Number data types in Create Table

##### Oracle

```sql
CREATE TABLE number_data_type_table
(
col1 NUMBER,
col2 NUMBER(1),
col3 NUMBER(10, 5),
col4 NUMBER(5, -2),
col5 NUMBER(4, 5)
);

INSERT INTO number_data_type_table(COL1) VALUES(100);
INSERT INTO number_data_type_table(COL2) VALUES(1.99999);
INSERT INTO number_data_type_table(COL3) VALUES(12345.12345);
INSERT INTO number_data_type_table(COL4) VALUES(16430.55555);
INSERT INTO number_data_type_table (COL4) VALUES(17550.55555);
INSERT INTO number_data_type_table(COL5) VALUES(0.00009);
INSERT INTO number_data_type_table(COL5) VALUES(0.000021);
INSERT INTO number_data_type_table(COL5) VALUES(0.012678912);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE number_data_type_table
(
col1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
col2 NUMBER(1) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
col3 NUMBER(10, 5) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
col4 NUMBER(5) !!!RESOLVE EWI!!! /*** SSC-EWI-OR0092 - NUMBER DATATYPE NEGATIVE SCALE WAS REMOVED FROM OUTPUT ***/!!! /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
col5 NUMBER(5, 5) /*** SSC-FDM-OR0010 - NUMBER DATATYPE SMALLER PRECISION WAS INCREASED TO MATCH SCALE ***/ /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO number_data_type_table(COL1) VALUES(100);

INSERT INTO number_data_type_table(COL2) VALUES(1.99999);

INSERT INTO number_data_type_table(COL3) VALUES(12345.12345);

INSERT INTO number_data_type_table(COL4) VALUES(16430.55555);

INSERT INTO number_data_type_table(COL4) VALUES(17550.55555);

INSERT INTO number_data_type_table(COL5) VALUES(0.00009);

INSERT INTO number_data_type_table(COL5) VALUES(0.000021);

INSERT INTO number_data_type_table(COL5) VALUES(0.012678912);
```

#### NUMBER ( default case )

When the precision and the scale are not specified, the default values are the maximum available`NUMBER(38, 127)` . The current transformation for the default case is `NUMBER(38,19).`

> **Warning:**
>
> In Oracle, not defining Precision nor scale defaults to an “Undefined Precision and Scale”. It behaves by storing the input “as received”, which means it can both deal with Integer and Floating point numbers. We use **38, 18** to try to cover both of them, by using 20 for integers, and leaving 18 for floating-point digits.

##### Oracle

```sql
SELECT col1 FROM number_data_type_table;
```

##### Result

| col1 |
| --- |
| 100 |

##### Snowflake

```sql
SELECT col1 FROM
number_data_type_table;
```

##### Result

| col1 |
| --- |
| 100.0000000000000000000 |

#### NUMBER ( p )

In this case, the precision will specify the number of digits that the number could have at the left of the decimal point.

##### Oracle

```sql
SELECT col2 FROM number_data_type_table;
```

##### Result

| col2 |
| --- |
| 2 |

##### Snowflake

```sql
SELECT col2 FROM
number_data_type_table;
```

##### Result

| col2 |
| --- |
| 2 |

#### NUMBER ( p, s ) p > s

In the case where the **s** is lower than the **p**, the precision will specify the number of digits that the number could have. The scale will specify the number of significant digits to the right of the decimal point, so the number of digits at the left of the decimal point will depend on the scale specified.

##### Oracle

```sql
SELECT col3 FROM number_data_type_table;
```

##### Result

| col3 |
| --- |
| 12345.12345 |

##### Snowflake

```sql
SELECT col3 FROM
number_data_type_table;
```

##### Result

| col3 |
| --- |
| 12345.12345 |

#### NUMBER ( p, -s )

A negative scale is the number of significant digits to the left of the decimal point, to but not including the least significant digit. For the negative scale, the least significant digit is on the left side of the decimal point, because the actual data is rounded to the specified number of places to the left of the decimal point. The current transformation is to remove the negative scale.

##### Oracle

```sql
SELECT col4 FROM number_data_type_table;
```

##### Result

| col4 |
| --- |
| 16400 |
| 17600 |

##### Snowflake

```sql
SELECT col4 FROM
number_data_type_table;
```

##### Result

| col4 |
| --- |
| 16431 |
| 17551 |

#### NUMBER ( p, s ) s > p

When the scale is greater than the precision, consider the following aspects:

* The number to insert could not have significant digits to the left of the decimal point. Only zero is available.
* The first digit to the right of the decimal point must be zero.
* The precision specifies the maximum number of significant digits to the right of the decimal point.

##### Oracle

```sql
SELECT col5 FROM number_data_type_table;
```

##### Result

| col5 |
| --- |
| 0.00009 |
| 0.00002 |
| 0.01268 |

##### Snowflake

```sql
SELECT col5 FROM
number_data_type_table;
```

##### Result

| col5 |
| --- |
| 0.00009 |
| 0.00002 |
| 0.01268 |

### Known Issues

#### 1. Scale value exceeds the maximum allowed by Snowflake

When specifying a scale greater than the maximum allowed in Snowflake (37) it is being changed to 18. To get more information about this please go to the [SSC-FDM-0006](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) documentation.

##### 2. Negative scale

Snowflake does not allow negative scale, so it is being removed. This could cause functional inequivalence. To get more information about this issue please go to the [SSC-EWI-0R0092](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) documentation.

### Recommendations

#### 1. UDF for NUMBER datatype Operations

It is possible to migrate these operations manually by using the next UDF when performing arithmetic operations to avoid incurring the issues noted:

##### UDF

```sql
CREATE OR REPLACE FUNCTION fixed_divide(a NUMBER(38,19), b NUMBER(38,19))
RETURNS NUMBER(38,19)
LANGUAGE JAVA
CALLED ON NULL INPUT
HANDLER='TestFunc.divide'
AS
'
import java.math.BigDecimal;
import java.math.RoundingMode;
class TestFunc {
public static BigDecimal divide(BigDecimal a, BigDecimal b) {
return a.divide(b,RoundingMode.HALF_UP);
}
}';
```

### Related EWIs

1. [SSC-EWI-OR0092](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) Number datatype negative scale was removed from output.
2. [SSC-FDM-0006](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake
3. [SSC-FDM-OR0010](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md) Number datatype smaller precision was increased to match scale

## Floating-Point Numbers

### Description

> Floating-point numbers can have a decimal point anywhere from the first to the last digit or can have no decimal point at all. An exponent may optionally be used following the number to increase the range, for example, 1.777 e-20. A scale value is not applicable to floating-point numbers, because the number of digits that can appear after the decimal point is not restricted.Binary floating-point numbers are stored using binary precision (the digits 0 and 1)([Oracle Language Reference Floating-Point Numbers](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-F579F4B8-EF13-4CAF-9B06-03B076861C41))

## BINARY_DOUBLE

### Description

> `BINARY_DOUBLE` is a 64-bit, double-precision floating-point number data type. Each `BINARY_DOUBLE` value requires 8 bytes. In a `BINARY_DOUBLE` column, floating-point numbers have binary precision. The binary floating-point numbers support the special values infinity and `NaN` (not a number). ([Oracle Language Reference Binary_Double data type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-12FE5221-9B49-4110-8D16-BF51BCED5562))

It is possible to specify floating-point numbers within the next limits:

* **Maximum positive finite value** = 1.79769313486231E+308
* **Minimum positive finite value** = 2.22507485850720E-308

### Sample Source Patterns

Please, consider the following table and its inserts for the example below:

#### Binary Double in Create Table

##### Oracle

```sql
CREATE TABLE binary_double_data_type_table
(
COL1 BINARY_DOUBLE
);

INSERT INTO binary_double_data_type_table VALUES(2.22507485850720E-308D);
INSERT INTO binary_double_data_type_table VALUES(1.79769313486231E+308D);
INSERT INTO binary_double_data_type_table VALUES('NaN');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE binary_double_data_type_table
(
COL1 FLOAT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO binary_double_data_type_table
VALUES(2.22507485850720E-308);

INSERT INTO binary_double_data_type_table
VALUES(1.79769313486231E+308);

INSERT INTO binary_double_data_type_table
VALUES('NaN');
```

> **Note:**
>
> **‘NaN’** means ***Not a Number***, this value is allowed by the`BINARY_DOUBLE` data type in Oracle and by the`FLOAT`data type in Snowflake.

#### BINARY_DOUBLE -> FLOAT

Since the`BINARY_DOUBLE`data type is not supported by Snowflake it is being converted to FLOAT.

##### Oracle

```sql
SELECT * FROM binary_double_data_type_table;
```

##### Result

| col1 |
| --- |
| 0 |
| 179769313486231000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 |
| NaN |

##### Snowflake

```sql
SELECT * FROM
binary_double_data_type_table;
```

##### Result

| col1 |
| --- |
| 0 |
| 179769313486231000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000000 |
| NaN |

### Known Issues

#### 1. The BINARY_DOUBLE data type is not supported by Snowflake

The BINARY_DOUBLE data type is converted to FLOAT since it is not supported by Snowflake.

### Related EWIs

No related EWIs.

## BINARY_FLOAT

### Description

> `BINARY_FLOAT` is a 32-bit, single-precision floating-point number data type. Each`BINARY_FLOAT`value requires 4 bytes. In a `BINARY_FLOAT`column, floating-point numbers have binary precision. The binary floating-point numbers support the special values infinity and `NaN` (not a number). ([Oracle Language Reference Binary_Float data type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-CFE7487C-A4D0-4E90-A836-2697C45BDD10))

It is possible to specify floating-point numbers within the next limits:

* **Maximum positive finite value** = 3.40282E+38F
* **Minimum positive finite value** = 1.17549E-38F

### Sample Source Patterns

Please, consider the following table and its inserts for the example below:

#### Binary Float in Create Table

##### Oracle

```sql
CREATE TABLE binary_float_data_type_table
(
col1 BINARY_FLOAT
);

INSERT INTO binary_float_data_type_table VALUES(1.17549E-38F);
INSERT INTO binary_float_data_type_table VALUES(3.40282E+38F);
INSERT INTO binary_float_data_type_table VALUES('NaN');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE binary_float_data_type_table
(
col1 FLOAT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO binary_float_data_type_table
VALUES(1.17549E-38);

INSERT INTO binary_float_data_type_table
VALUES(3.40282E+38);

INSERT INTO binary_float_data_type_table
VALUES('NaN');
```

> **Note:**
>
> **‘NaN’** means ***Not a Number***, this value is allowed by the`BINARY_FLOAT` data type in Oracle and by the`FLOAT`data type in Snowflake.

#### BINARY_FLOAT -> FLOAT

Since the`BINARY_FLOAT`data type is not supported by Snowflake it is being converted to FLOAT.

##### Oracle

```sql
SELECT * FROM binary_float_data_type_table;
```

##### Result

| col1 |
| --- |
| 0 |
| 340282001837565600000000000000000000000 |
| NaN |

##### Snowflake

```sql
SELECT * FROM binary_float_data_type_table;
```

##### Result

| col1 |
| --- |
| 0 |
| 340282000000000000000000000000000000000 |
| NaN |

### Known Issues

#### 1. The BINARY_FLOAT data type is not supported by Snowflake

The BINARY_FLOAT data type is converted to FLOAT since it is not supported by Snowflake.

### Related EWIs

No related EWIs.

## Datetime and Interval Data Types

> The datetime data types are `DATE`, `TIMESTAMP`, `TIMESTAMP` `WITH` `TIME` `ZONE`, and `TIMESTAMP` `WITH` `LOCAL` `TIME` `ZONE`. Values of datetime data types are sometimes called datetimes. The interval data types are `INTERVAL` `YEAR` `TO` `MONTH` and `INTERVAL` `DAY` `TO` `SECOND`. Values of interval data types are sometimes called intervals. ([Oracle SQL Language Reference Datetime and Interval Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-7690645A-0EE3-46CA-90DE-C96DF5A01F8F))

## DATE Data Type

### Description

> Oracle’s date data type stores both date and time information, however Snowflake’s date data type only stores date information. ([Oracle SQL Language Reference Date Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-5405B652-C30E-4F4F-9D33-9A4CB2110F1B))

The default transformation for Oracle `DATE` is to Snowflake `TIMESTAMP`. You can add the `disableDateAsTimestamp` flag (SnowConvert AI Command Line Interface) or **disable** the *Transform Date as Timestamp* setting (SnowConvert AI desktop application) to transform the `DATE` type to `TIMESTAMP`. Keep in mind that Snowflake `DATE` only stores date information and Oracle stores date and time information, if you want to avoid losing information you should transform `DATE` to `TIMESTAMP`.

> **Note:**
>
> **Important Rounding Behavior Difference**: When performing operations between date/timestamp data types and intervals involving seconds, Oracle does not round the seconds but preserves the precision as specified, while Snowflake rounds the seconds to the nearest whole second. This difference in rounding behavior can lead to different results.

### Sample Source Patterns

#### Date in Create Table

##### Oracle

```sql
CREATE TABLE date_table
(
	date_col date
);

INSERT INTO date_table(date_col) VALUES (DATE '2010-10-10');
```

##### Snowflake without –disableDateAsTimestamp flag or with “Transform Date as Timestamp” setting enabled

```sql
CREATE OR REPLACE TABLE date_table
	(
		date_col TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
	;

	INSERT INTO date_table(date_col) VALUES (DATE '2010-10-10');
```

##### Snowflake with –disableDateAsTimestamp flag or with “Transform Date as Timestamp” setting disabled

```sql
CREATE OR REPLACE TABLE date_table
	(
		date_col date
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO date_table(date_col) VALUES (DATE '2010-10-10');
```

#### Retrieving data from a Date column

##### Oracle

```sql
SELECT date_col FROM date_table;
```

###### Result

| DATE_COL |
| --- |
| 2010-10-10 00:00:00.000 |

##### Snowflake

```sql
SELECT date_col FROM
date_table;
```

###### Result

| DATE_COL |
| --- |
| 2010-10-10 00:00:00.000 |

###### Result with disableDateAsTimestamp flag

| DATE_COL |
| --- |
| 2010-10-10 |

### Known Issues

#### 1. Input and output format may differ between languages

In Snowflake, *`DATE`* input and output formats depend on the *`DATE_INPUT_FORMAT`* and *`DATE_OUTPUT_FORMAT`* session variables. Insertions may fail because the `DATE_INPUT_FORMAT` enforces the user to use a specific format when a date is added by text. You can modify those variables using the following syntax.

```sql
ALTER SESSION SET DATE_INPUT_FORMAT = 'YYYY-DD-MM' DATE_OUTPUT_FORMAT = 'DD-MM-YYYY';
```

### Related EWIs

1. [SSC-FDM-OR0042](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior

## INTERVAL DAY TO SECOND Data Type

### Description

> INTERVAL DAY TO SECOND stores a period of time in terms of days, hours, minutes, and seconds. ([Oracle SQL Language Reference INTERVAL DAY TO SECOND Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-B03DD036-66F8-4BD3-AF26-6D4433EBEC1C))

By default, there is no equivalent for this data type in Snowflake and it is transformed to `VARCHAR`.

> **Note:**
>
> **Preview Feature:** When the `--UseIntervalDatatype` [preview flag](../../../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled, Oracle `INTERVAL DAY TO SECOND` columns are preserved as native Snowflake `INTERVAL DAY TO SECOND` types. See the [Interval Data Types](../../../general/interval-data-types.md) translation reference for complete transformation details.

```sql
INTERVAL DAY [(day_precision)] TO SECOND [(fractional_seconds_precision)]
```

### Sample Source Patterns

#### Interval Day to Second in Create Table

##### Oracle

```sql
CREATE TABLE interval_day_to_second_table
(
	interval_day_col1 interval day to second,
	interval_day_col2 interval day(1) to second(4)
);

INSERT INTO interval_day_to_second_table(interval_day_col1) VALUES ( INTERVAL '1 2:3:4.56' DAY TO SECOND );
INSERT INTO interval_day_to_second_table(interval_day_col2) VALUES ( INTERVAL '1 2:3:4.56' DAY(1) TO SECOND(4) );
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE interval_day_to_second_table
	(
		interval_day_col1 VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL day to second DATA TYPE CONVERTED TO VARCHAR ***/!!!,
		interval_day_col2 VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL day(1) to second(4) DATA TYPE CONVERTED TO VARCHAR ***/!!!
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO interval_day_to_second_table(interval_day_col1) VALUES ('1d, 2h, 3m, 4s, 56ms');

	INSERT INTO interval_day_to_second_table(interval_day_col2) VALUES ('1d, 2h, 3m, 4s, 56ms');
```

The Interval value is transformed to a supported Snowflake format and then inserted as text inside the column. Since Snowflake does not support **Interval** as a data type, it is only supported in arithmetic operations. To use the value, it needs to be extracted and used as an [Interval constant](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) (if possible).

**Original Oracle value:** `INTERVAL '1 2:3:4.567' DAY TO SECOND`

**Value stored in Snowflake column:** `'1d, 2h, 3m, 4s, 567ms'`

**Value as Snowflake Interval constant:** `INTERVAL '1d, 2h, 3m, 4s, 567ms'`

#### Retrieving data from an Interval Day to Second column

##### Oracle

```sql
SELECT * FROM interval_day_to_second_table;
```

###### Result

| INTERVAL_DAY_COL1 | INTERVAL_DAY_COL2 |
| --- | --- |
| 1 2:3:4.567 |  |
|  | 1 2:3:4.567 |

##### Snowflake

```sql
SELECT * FROM
interval_day_to_second_table;
```

###### Result

| INTERVAL_DAY_COL1 | INTERVAL_DAY_COL2 |
| --- | --- |
| 1d, 2h, 3m, 4s, 56ms |  |
|  | 1d, 2h, 3m, 4s, 56ms |

### Known Issues

#### 1. Only arithmetic operations are supported

Snowflake Intervals have several limitations. Only arithmetic operations between `DATE` or `TIMESTAMP` and [Interval Constants](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) are supported, every other scenario is not supported.

### Related EWIs

1. [SSC-EWI-0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## INTERVAL YEAR TO MONTH Data Type

### Description

> INTERVAL YEAR TO MONTH stores a period of time using the YEAR and MONTH datetime fields. There is no equivalent in Snowflake so it is transformed to Varchar ([Oracle SQL Language Reference INTERVAL YEAR TO MONTH Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-ED59E1B3-BA8D-4711-B5C8-B0199C676A95))

By default, there is no equivalent for this data type in Snowflake and it is transformed to VARCHAR.

> **Note:**
>
> **Preview Feature:** When the `--UseIntervalDatatype` [preview flag](../../../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled, Oracle `INTERVAL YEAR TO MONTH` columns are preserved as native Snowflake `INTERVAL YEAR TO MONTH` types. See the [Interval Data Types](../../../general/interval-data-types.md) translation reference for complete transformation details.

```sql
INTERVAL YEAR [(year_precision)] TO MONTH
```

### Sample Source Patterns

#### Interval Year To Month in Create Table

##### Oracle

```sql
CREATE TABLE interval_year_to_month_table
(
	interval_year_col1 interval year to month,
	interval_year_col2 interval year(4) to month
);

INSERT INTO interval_year_to_month_table(interval_year_col1) VALUES ( INTERVAL '1-2' YEAR TO MONTH );
INSERT INTO interval_year_to_month_table(interval_year_col2) VALUES ( INTERVAL '1000-11' YEAR(4) TO MONTH );
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE interval_year_to_month_table
	(
		interval_year_col1 VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL year to month DATA TYPE CONVERTED TO VARCHAR ***/!!!,
		interval_year_col2 VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL year(4) to month DATA TYPE CONVERTED TO VARCHAR ***/!!!
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO interval_year_to_month_table(interval_year_col1) VALUES ('1y, 2mm');

	INSERT INTO interval_year_to_month_table(interval_year_col2) VALUES ('1000y, 11mm');
```

The Interval value is transformed to a supported Snowflake format and then inserted as text inside the column. Since Snowflake does not support **Interval** as a data type, it is only supported in arithmetic operations. To use the value, it needs to be extracted and used as an [Interval constant](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) (if possible).

**Original Oracle value:** `INTERVAL '1-2' YEAR TO MONTH`

**Value stored in Snowflake column:** `'1y, 2m'`

**Value as Snowflake Interval constant:** `INTERVAL '1y, 2m'`

#### Retrieving data from an Interval Year To Month column

##### Oracle

```sql
SELECT * FROM interval_year_to_month_table;
```

###### Result

| INTERVAL_YEAR_COL1 | INTERVAL_YEAR_COL2 |
| --- | --- |
| 1-2 |  |
|  | 1000-11 |

##### Snowflake

```sql
SELECT * FROM
interval_year_to_month_table;
```

###### Result

| INTERVAL_YEAR_COL1 | INTERVAL_YEAR_COL2 |
| --- | --- |
| 1y, 2m |  |

```none
              |1000y, 11m        |
```

### Known Issues

#### 1. Only arithmetic operations are supported

Snowflake Intervals have several limitations. Only arithmetic operations between `DATE` or `TIMESTAMP` and [Interval Constants](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) are supported, every other scenario is not supported.

### Related EWIs

* [SSC-EWI-0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## TIMESTAMP Data Type

### Description

> The TIMESTAMP data type is an extension of the DATE data type. It stores the year, month, and day of the DATE data type, plus hour, minute, and second values. ([Oracle SQL Language Reference Timestamp Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-94A82966-D380-4583-9AF1-AEE681881E64))

Both Oracle and Snowflake `TIMESTAMP` data types have the same precision range (0-9) but different default values. In Oracle, the default precision value is 6 and in Snowflake is 9.

However, there is a difference in behavior when an inserted value exceeds the set precision. Oracle rounds up the exceeding decimals, while Snowflake just trims the values.

```sql
TIMESTAMP [(fractional_seconds_precision)]
```

### Sample Source Patterns

#### Timestamp in Create Table

##### Oracle

```sql
CREATE TABLE timestamp_table
(
	timestamp_col1 TIMESTAMP,
	timestamp_col2 TIMESTAMP(7)
);

INSERT INTO timestamp_table(timestamp_col1, timestamp_col2) VALUES (TIMESTAMP '2010-10-10 12:00:00', TIMESTAMP '2010-10-10 12:00:00');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE timestamp_table
	(
		timestamp_col1 TIMESTAMP(6),
		timestamp_col2 TIMESTAMP(7)
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO timestamp_table(timestamp_col1, timestamp_col2) VALUES (TIMESTAMP '2010-10-10 12:00:00', TIMESTAMP '2010-10-10 12:00:00');
```

#### Retrieving data from a Timestamp column

##### Oracle

```sql
SELECT * FROM timestamp_table;
```

###### Result

| TIMESTAMP_COL1 | TIMESTAMP_COL2 |
| --- | --- |
| 2010-10-10 12:00:00.000 | 2010-10-10 12:00:00.000 |

##### Snowflake

```sql
SELECT * FROM
timestamp_table;
```

###### Result

| TIMESTAMP_COL1 | TIMESTAMP_COL2 |
| --- | --- |
| 2010-10-10 12:00:00.000 | 2010-10-10 12:00:00.000 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## TIMESTAMP WITH LOCAL TIME ZONE Data Type

### Description

> It differs from TIMESTAMP WITH TIME ZONE in that data stored in the database is normalized to the database time zone, and the time zone information is not stored as part of the column data..([Oracle SQL Language Reference Timestamp with Local Time Zone Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-E7CA339A-2093-4FE4-A36E-1D09593591D3))

The Snowflake equivalent is [TIMESTAMP_LTZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#timestamp-ltz-timestamp-ntz-timestamp-tz).

For more information, see also the TIMESTAMP section.

```none
TIMESTAMP [(fractional_seconds_precision)] WITH LOCAL TIME ZONE
```

### Sample Source Patterns

#### Timestamp with Time Zone in Create Table

##### Oracle

```sql
CREATE TABLE timestamp_with_local_time_zone_table
(
	timestamp_col1 TIMESTAMP(5) WITH LOCAL TIME ZONE
);

INSERT INTO timestamp_with_local_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00');
INSERT INTO timestamp_with_local_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00 -08:00');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE timestamp_with_local_time_zone_table
	(
		timestamp_col1 TIMESTAMP_LTZ(5)
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO timestamp_with_local_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00');

	INSERT INTO timestamp_with_local_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00 -08:00');
```

#### Retrieving data from a Timestamp with Local Time Zone column

##### Oracle

```sql
SELECT * FROM timestamp_with_local_time_zone_table;
```

##### Result

| TIMESTAMP_COL1 |
| --- |
| 2010-10-10 18:00:00.000 |
| 2010-10-10 20:00:00.000 |

##### Snowflake

```sql
SELECT * FROM
timestamp_with_local_time_zone_table;
```

##### Result

| TIMESTAMP_COL1 |
| --- |
| 2010-10-10 12:00:00.000 -0700 |
| 2010-10-10 12:00:00.000 -0700 |

> **Note:**
>
> Note that the results are different in both engines because each database is set with a different time zone. The Oracle timezone is ‘+00:00’ and the Snowflake timezone is ‘America/Los_Angeles’.

Use the following syntax to change the default timezone of the database:

```sql
ALTER account SET timezone = timezone_string;
```

### Known Issues

#### 1. Default database timezone

The operations with this kind of data type will be affected by the database timezone, the results may be different. You can check the default timezone using the following queries:

##### Oracle

```sql
SELECT dbtimezone FROM dual;
```

##### Snowflake

```sql
SELECT dbtimezone FROM dual;
```

##### 2. Oracle Timestamp with local timezone behavior

When operating timestamps with local timezone data types, Oracle converts the timestamps to the default timezone of the database. To emulate this behavior in Snowflake, the TIMESTAMP_TYPE_MAPPING session parameter should be set to ‘TIMESTAMP_LTZ’.

```sql
ALTER SESSION SET TIMESTAMP_TYPE_MAPPING = 'TIMESTAMP_LTZ';
```

##### 3. Timestamp formats may be different

Snow Convert does not perform any conversion for the date/timestamps format strings, so there may be errors when deploying the code. Example:

##### Oracle

```sql
INSERT INTO timestamp_with_local_time_zone_table (timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00 -8:00');
```

##### Snowflake

```sql
INSERT INTO timestamp_with_local_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00 -8:00');
```

> **Warning:**
>
> The query will fail in Snowflake because the default timestamp input format does not recognize ‘-8:00’ as a valid UTC offset. It should be replaced with ‘0800’ or ‘-08:00’ to get the same result.

### Related EWIs

No related EWIs.

## TIMESTAMP WITH TIME ZONE Data Type

### Description

> TIMESTAMP WITH TIME ZONE is a variant of TIMESTAMP that includes a time zone region name or a time zone offset in its value. The Snowflake equivalent is TIMESTAMP_TZ.([Oracle SQL Language Reference Timestamp with Time Zone Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-BE23545B-469A-4A57-8D13-505F2F5DB706))

The Snowflake equivalent is [TIMESTAMP_TZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#timestamp-ltz-timestamp-ntz-timestamp-tz).

For more information, see also the TIMESTAMP section.

```none
TIMESTAMP [(fractional_seconds_precision)] WITH TIME ZONE
```

### Sample Source Patterns

#### Timestamp with Time Zone in Create Table

##### Oracle

```sql
CREATE TABLE timestamp_with_time_zone_table
(
	timestamp_col1 TIMESTAMP(5) WITH TIME ZONE
);

INSERT INTO timestamp_with_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE timestamp_with_time_zone_table
	(
		timestamp_col1 TIMESTAMP_TZ(5)
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO timestamp_with_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00');
```

#### Retrieving data from a Timestamp with Time Zone column

##### Oracle

```sql
SELECT * FROM timestamp_with_time_zone_table;
```

##### Result

| TIMESTAMP_COL1 |
| --- |
| 2010-10-10 12:00:00.000 -0600 |

##### Snowflake

```sql
SELECT * FROM
timestamp_with_time_zone_table;
```

##### Result

| TIMESTAMP_COL1 |
| --- |
| 2010-10-10 12:00:00.000 -0700 |

> **Note:**
>
> Note that the timezone is different in both engines because when the timezone is not specified, the default timezone of the database is added.

Use the following syntax to change the default timezone of the database:

```sql
ALTER account SET sqtimezone = timezone_string;
```

### Known Issues

#### 1. Timestamp formats may be different

Snow Convert does not perform any conversion for the date/timestamps format strings, so there may be errors when deploying the code. Example:

##### Oracle

```sql
INSERT INTO timestamp_with_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00 -8:00');
```

##### Snowflake

```sql
INSERT INTO timestamp_with_time_zone_table(timestamp_col1) VALUES (TIMESTAMP '2010-10-10 12:00:00 -8:00');
```

> **Warning:**
>
> The query will fail in Snowflake because the default timestamp input format does not recognize ‘-8:00’ as a valid UTC offset. It should be replaced with ‘-0800’ or ‘-08:00’ to get the same result.

### Related EWIs

No related EWIs.

## Datetime Arithmetic

This content explains the current transformation for some arithmetic operations between datetime types.

### Description

In Oracle, some arithmetic operations could be performed between DateTime types, like addition, subtraction, multiplication, and division. Currently, SnowConvert AI can resolve some cases of addition and subtraction. These cases are explained below.

### Sample Source Patterns

This is a summary of the current transformation for the different combinations of the addition and subtraction operations with date, timestamps, number, and unknown types.

> **Note:**
>
> **Consider the next table for the examples below.**

#### Oracle

```sql
CREATE OR REPLACE TABLE TIMES (
AsTimeStamp TIMESTAMP(6),
AsTimestampTwo TIMESTAMP(6),
AsDate TIMESTAMP,
AsDateTwo TIMESTAMP
);

INSERT INTO TIMES
VALUES (
TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_DATE('06/11/21', 'dd/mm/yy'),
TO_DATE('05/11/21', 'dd/mm/yy'));
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE TIMES (
 AsTimeStamp TIMESTAMP(6),
 AsTimestampTwo TIMESTAMP(6),
 AsDate TIMESTAMP(6),
 AsDateTwo TIMESTAMP(6)
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;

 INSERT INTO TIMES
 VALUES (
TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_DATE('06/11/21', 'dd/mm/yy'),
TO_DATE('05/11/21', 'dd/mm/yy'));
```

### Addition

#### Combination Matrix

This is a summary of how the migrator resolves the addition operations for the different combinations with date, timestamps, number, and unknown types.

| Addition | Date | Timestamp | Number | Interval | Unknown | Float |
| --- | --- | --- | --- | --- | --- | --- |
| **Date** | INVALID | INVALID | Date + Interval day | Date + Interval IntervalUnit | DATEADD_UDF | DATEADD_UDF |
| **Timestamp** | INVALID | INVALID | Timestamp + Interval day | Timestamp + Interval IntervalUnit | DATEADD_UDF | DATEADD_UDF |
| **Number** | Date + Interval day | Timestamp + Interval day | Number + Number | INVALID | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Number + Float |
| **Interval** | Date + Interval IntervalUnit | Timestamp + Interval IntervalUnit | INVALID | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Unknown + Interval IntervalUnit | INVALID |
| **Unknown** | DATEADD_UDF | DATEADD_UDF | Unknown + Number | Unknown + Interval IntervalUnit | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) |
| **Float** | DATEADD_UDF | DATEADD_UDF | Float + Number | INVALID | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Float + Float |

> **Note:**
>
> An Unknown Type column is the result of the migrator being unable to establish the data type that the column contains. This can happen for many reasons, for example, missing DDLs for the tables being operated on, or columns resulting from operations on views, CTEs, or subqueries.

> **Warning:**
>
> By default, Snow Convert migrates operations of type Date/Timestamp + Interval to the native Snowflake operations, but in some cases may be useful to use [UDF](../../functions/custom_udfs.md) instead. For further details, see Interval UDFs vs. Snowflake native interval operation.

The different paths that the migrator can use for resolving the add operations will be explained below:

#### Invalid

Certain combinations are not valid to perform addition operations in Oracle:

##### Oracle

```sql
SELECT AsDate + AsDateTwo From TIMES;

SELECT AsDate + AsTimeStamp From TIMES;
```

##### Result

```none
SQL Error [975] [42000]: ORA-00975: date + date not allowed

SQL Error [30087] [99999]: ORA-30087: Adding two datetime values is not allowed
```

#### Date + Interval day

This is the current transformation for the addition operation between a date type and a number (and vice versa). For example

##### Oracle

```sql
SELECT AsDate + 1 FROM TIMES;

SELECT 1 + AsDate FROM TIMES;
```

##### Result

| ASDATE+1 |
| --- |
| 2021-11-07 00:00:00.000 |

| 1+ASDATE |
| --- |
| 2021-11-07 00:00:00.000 |

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
 AsDate + 1 FROM
 TIMES;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Number AND unknown ***/!!! 1 + AsDate FROM
 TIMES;
```

##### Result

| ASDATE + INTERVAL ‘1 DAY’ |
| --- |
| 2021-11-07 |

#### Timestamp + Interval day

This is the current transformation for the addition operation between a timestamp type and a number (and vice versa). For example

##### Oracle

```sql
SELECT AsTimestamp + 1 FROM TIMES;

SELECT 1 + AsTimestamp FROM TIMES;
```

##### Result

| ASTIMESTAMP+1 |
| --- |
| 2021-11-06 11:00:00.000 |

| 1+ASTIMESTAMP |
| --- |
| 2021-11-06 11:00:00.000 |

> **Note:**
>
> Note: In Oracle, both DATE and TIMESTAMP columns contain a time component, but Oracle has used the format mask specified by the NLS_DATE_FORMAT parameter to decide how to implicitly convert the date to a string, that is why when performing some operations between TIMESTAMP and Intervals, he result could be shown as DATE, hiding the time component, unless the NLS_DATE_FORMAT parameter is changed.

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
 AsTimestamp + 1 FROM
 TIMES;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Number AND unknown ***/!!! 1 + AsTimestamp FROM
 TIMES;
```

##### Result

| ASTIMESTAMP + INTERVAL ‘1 DAY’ |
| --- |
| 2021-11-06 11:00:00.000 |

#### DATEADD_UDF

For those cases where there is an addition operation between a date or timestamp type and an unknown type, a user-defined function (UDF) is added. See the [DATEADD_UDF implementation](../../functions/custom_udfs.md) for details. The UDF is located in the UDFs folder. For example:

> **Note:**
>
> For the following examples, a subquery will be used, trying to simulate the Unknown Type column

##### Oracle

```sql
SELECT AsDate + (SELECT EXTRACT(DAY FROM AsTimestampTwo) FROM TIMES) FROM TIMES;

SELECT AsTimestamp + (SELECT EXTRACT(DAY FROM AsTimestampTwo) FROM TIMES) FROM TIMES;
```

##### Result

| ASDATE+(SELECTEXTRACT(DAYFROMASTIMESTAMPTWO)FROMTIMES) |
| --- |
| 2021-11-11 00:00:00.000 |

| ASTIMESTAMP+(SELECTEXTRACT(DAYFROMASTIMESTAMPTWO)FROMTIMES) |
| --- |
| 2021-11-10 11:00:00.000 |

##### Snowflake

```sql
SELECT AsDate + (SELECT EXTRACT(DAY FROM AsTimestampTwo) FROM
TIMES
) FROM
TIMES;

SELECT AsTimestamp + (SELECT EXTRACT(DAY FROM AsTimestampTwo) FROM
TIMES
) FROM
TIMES;
```

##### Result

| PUBLIC.DATEADD_UDF( ASDATE, (SELECT EXTRACT(DAY FROM ASTIMESTAMPTWO) FROM PUBLIC.TIMES)) |
| --- |
| 2021-11-11 |

| PUBLIC.DATEADD_UDF( ASTIMESTAMP, (SELECT EXTRACT(DAY FROM ASTIMESTAMPTWO) FROM PUBLIC.TIMES)) |
| --- |
| 2021-11-10 11:00:00.000 |

### Subtraction

#### Combination Matrix

| Subtraction | Date | Timestamp | Number | Interval | Unknown | Float |
| --- | --- | --- | --- | --- | --- | --- |
| **Date** | DATEDIFF | TIMESTAMP_DIFF___UDF | Date - Interval day | Date - Interval IntervalUnit | DATEDIFF_UDF | DATEDIFF_UDF |
| **Timestamp** | TIMESTAMP_DIFF___UDF | TIMESTAMP_DIFF___UDF | Timestamp - Interval day | Timestamp - Interval IntervalUnit | DATEDIFF_UDF | DATEDIFF_UDF |
| **Number** | INVALID | INVALID | Number - Number | INVALID | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Number - Float |
| **Interval** | INVALID | INVALID | INVALID | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Unknown - Interval IntervalUnit | NOT SUPPORTED IN ORACLE |
| **Unknown** | DATEDIFF_UDF | DATEDIFF_UDF | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Unknown - Interval IntervalUnit | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) |
| **Float** | DATEDIFF_UDF | DATEDIFF_UDF | Float - Number | NOT SUPPORTED IN ORACLE | [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) | Float - Float |

> **Note:**
>
> An Unknown Type column is the result of the migrator being unable to establish the data type that the column contains. This can happen for many reasons, for example, missing DDLs for the tables being operated on, or columns resulting from operations on views, CTEs, or subqueries.

> **Warning:**
>
> By default, Snow Convert migrates operations of type Date/Timestamp + Interval to the native Snowflake operations, but in some cases may be useful to use [UDF](../../functions/custom_udfs.md) instead. For further details, see Interval UDFs vs. Snowflake native interval operation.

The different paths that the migrator can use for resolving the subtract operations will be explained below:

#### Invalid

Certain combinations are not valid to perform subtraction operations in Oracle:

##### Oracle

```sql
SELECT 1 - AsDate FROM TIMES;

SELECT 1 - AsTimestamp FROM TIMES;
```

##### Result

```none
SQL Error [932] [42000]: ORA-00932: inconsistent datatypes: expected NUMBER got DATE

SQL Error [932] [42000]: ORA-00932: inconsistent datatypes: expected NUMBER got TIMESTAMP
```

#### DATEDIFF

The subtraction between two operands of date type is converted to the Snowflake DATEDIFF function, using as a time unit (first parameter) ‘day’. For example

##### Oracle

```sql
SELECT AsDate - AsDateTwo FROM TIMES;
```

##### Result

| ASDATE-ASDATETWO |
| --- |
| 1 |

##### Snowflake

```sql
SELECT AsDate - AsDateTwo FROM
TIMES;
```

##### Result

| DATEDIFF(DAY, ASDATETWO, ASDATE) |
| --- |
| 1 |

#### Date - Interval day

This is the current transformation for the subtraction operation between a date type and a number. For example

##### Oracle

```sql
SELECT AsDate - 1 FROM TIMES;

SELECT AsDate + -1 FROM TIMES;
```

##### Result

| ASDATE-1 |
| --- |
| 2021-11-05 00:00:00.000 |

| ASDATE+-1 |
| --- |
| 2021-11-05 00:00:00.000 |

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
 AsDate - 1 FROM
 TIMES;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!! AsDate + -1 FROM
 TIMES;
```

##### Result

| ASDATE - INTERVAL ‘1 DAY’ |
| --- |
| 2021-11-05 |

| ASDATE + INTERVAL ‘-1 DAY’ |
| --- |
| 2021-11-05 |

#### Timestamp - Interval day

This is the current transformation for the addition operation between a timestamp type and a number. For example

##### Oracle

```sql
SELECT AsTimestamp - 1 FROM TIMES;

SELECT AsTimestamp + -1 FROM TIMES;
```

##### Result

| ASTIMESTAMP-1 |
| --- |
| 2021-11-04 11:00:00.000 |

| ASTIMESTAMP+-1 |
| --- |
| 2021-11-04 11:00:00.000 |

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!
 AsTimestamp - 1 FROM
 TIMES;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!! AsTimestamp + -1 FROM
 TIMES;
```

##### Result

| ASTIMESTAMP - INTERVAL ‘1 DAY’ |
| --- |
| 2021-11-04 11:00:00.000 |

| ASTIMESTAMP + INTERVAL ‘-1 DAY’ |
| --- |
| 2021-11-04 11:00:00.000 |

> **Note:**
>
> Note: In Oracle, both DATE and TIMESTAMP columns contain a time component, but Oracle uses the format mask specified by the NLS_DATE_FORMAT parameter to decide how to implicitly convert the date to a string, that is why when performing some operations between the TIMESTAMP and Intervals, the result could be shown as DATE, hiding the time component, unless the NLS_DATE_FORMAT parameter is changed.
>
> For more information, see the [Oracle NLS_DATE_FORMAT documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/refrn/NLS_DATE_FORMAT.html#GUID-FC23EEEE-AA9F-4B3C-8CBB-888C9C0CA27F).

#### TIMESTAMP_DIFF_UDF

The subtractions between timestamp types and dates with a timestamp and vice versa; are resolved by inserting the TIMESTAMP_DIFF_UDF user-defined function, (see the [TIMESTAMP_DIFF_UDF implementation](../../functions/custom_udfs.md)). For example

##### Oracle

```sql
SELECT AsTimeStamp - AsTimeStampTwo FROM TIMES;

SELECT AsTimeStamp - AsDateTwo FROM TIMES;

SELECT AsDateTwo - AsTimeStamp FROM TIMES;
```

##### Result

| ASTIMESTAMP-ASTIMESTAMPTWO |
| --- |
| +000000000 01:00:00.000000 |

| ASTIMESTAMP-ASDATETWO |
| --- |
| +000000000 11:00:00.000000 |

| ASDATETWO-ASTIMESTAMP |
| --- |
| -000000000 11:00:00.000000 |

##### Snowflake

```sql
SELECT AsTimeStamp - AsTimeStampTwo FROM
TIMES;

SELECT AsTimeStamp - AsDateTwo FROM
TIMES;

SELECT AsDateTwo - AsTimeStamp FROM
TIMES;
```

##### Result

| PUBLIC.TIMESTAMP_DIFF_UDF( ASTIMESTAMP, ASTIMESTAMPTWO) |
| --- |
| +000000000 01:00:00.00000000 |

| PUBLIC.TIMESTAMP_DIFF_UDF( ASTIMESTAMP, ASDATETWO) |
| --- |
| +000000000 11:00:00.00000000 |

| PUBLIC.TIMESTAMP_DIFF_UDF( ASDATETWO, ASTIMESTAMP) |
| --- |
| -000000000 -11:00:00.00000000 |

#### DATEDIFF_UDF

For those cases where there is an addition operation between a date or timestamp type and an unknown type, a user-defined function (UDF) is added. See the [DATEDIFF_UDF implementation](../../functions/custom_udfs.md), which could be edited to perform what is required. The UDF is located in the UDFs folder. For example:

##### Oracle

```sql
SELECT ASDATE - (EXTRACT(DAY FROM ASDATE)) FROM TIMES;

SELECT ASTIMESTAMP - (EXTRACT(DAY FROM ASDATE)) FROM TIMES;
```

##### Result

| ASDATE-(EXTRACT(DAYFROMASDATE)) |
| --- |
| 2021-10-31 00:00:00.000 |

| ASTIMESTAMP-(EXTRACT(DAYFROMASDATE)) |
| --- |
| 2021-10-30 11:00:00.000 |

##### Snowflake

```sql
SELECT ASDATE - (EXTRACT(DAY FROM ASDATE)) FROM
TIMES;

SELECT ASTIMESTAMP - (EXTRACT(DAY FROM ASDATE)) FROM
TIMES;
```

##### Result

| PUBLIC.DATEDIFF_UDF( ASDATE, (EXTRACT(DAY FROM ASDATE))) |
| --- |
| 2021-10-31 |

| PUBLIC.DATEDIFF_UDF( ASTIMESTAMP, (EXTRACT(DAY FROM ASDATE))) |
| --- |
| 2021-10-30 11:00:00.000 |

### Common Cases

#### Warning: SSC-EWI-OR0036

This warning is used to indicate whether an addition or subtraction operation may not behave correctly due to the operands data types. It means that maybe the result of the operation in Snowflake is not functionally equivalent to Oracle. The addition and subtraction between a date or numeric type and an unknown type are one of the most common cases. For example

##### Oracle

```sql
SELECT AsDate - (EXTRACT(DAY FROM ASDATE)) FROM TIMES;
```

##### Snowflake

```sql
SELECT AsDate - (EXTRACT(DAY FROM ASDATE)) FROM
TIMES;
```

This EWI is added in operations where the type of a column could not be resolved, if the column type is INTERVAL and it is operated only with other intervals, EWI will be added but code will not be commented out. The following example describes this behavior:

##### Oracle

```sql
SELECT INTERVAL '1' DAY + interval_column FROM UNKNOWN_TABLE;
```

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
interval_column + INTERVAL '1 day' FROM
UNKNOWN_TABLE;
```

### Known Issues

#### 1. TIMESTAMP DIFF UDF improvement

The TIMESTAMP_DIFF_UDF must be improved to be able to specify the return type. It means adding a third parameter where it is possible to
specify the time part, such as day, hour, or month.

##### 2. Built-in functions as operators

There is currently no management for date operations between built-in functions that return date types.

##### 3. Multiple operands

Currently, there is no management for date operation with more than two operands, it may work but you may also find issues.

##### 4. Comparison operators

Currently, there is no management for date operations with comparison operators, such as greater than or less than.

##### 5. Output format

The result’s format of the arithmetic operations could be changed by using the next command `ALTER SESSION SET DATE_OUTPUT_FORMAT = 'DESIRED-FORMAT';` in Snowflake.

##### 6. Issues in interval operations with seconds precision

Some operations may differ in precision, specifically those that include intervals with seconds precision, this is because Oracle rounds depending on the precision, Snowflake’s interval does not support seconds with decimal places, to have the same result, it is necessary to change the second decimal places by milliseconds in intervals considering the rounding that Oracle performs. The following example shows this issue

##### Oracle

```sql
SELECT AsTimeStamp+INTERVAL '15.6789' SECOND(2,3) FROM times;

SELECT AsTimeStamp+INTERVAL '15.6783' SECOND(2,3) FROM times;
```

##### Result

| ASTIMESTAMP+INTERVAL’15.6789’SECOND(2,3) |
| --- |
| 2021-11-05 11:00:15.679 |

| ASTIMESTAMP+INTERVAL’15.6783’SECOND(2,3) |
| --- |
| 2021-11-05 11:00:15.678 |

##### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 AsTimeStamp + INTERVAL '15.6789 second'
FROM
 times;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!! AsTimeStamp + INTERVAL '15.6783 second'
FROM
 times;
```

##### Result

| ASTIMESTAMP + INTERVAL ‘15.6789 SECOND’ |
| --- |
| 2021-11-05 11:00:16.000 |

| ASTIMESTAMP + INTERVAL ‘15.6783 SECOND’ |
| --- |
| 2021-11-05 11:00:16.000 |

| ASTIMESTAMP + INTERVAL ‘15 SECOND, 679 MILLISECOND’ |
| --- |
| 2021-11-05 11:00:15.679 |

| ASTIMESTAMP + INTERVAL ‘15 SECOND, 678 MILLISECOND’ |
| --- |
| 2021-11-05 11:00:15.678 |

### Related EWIs

1. [SSC-EWI-0108](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The following subquery matches at least one of the patterns considered invalid and may produce compilation errors.
2. [SSC-EWI-OR0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.

## Interval UDFs vs Snowflake native interval operation

### Description

The following table shows a comparison between the [DATEADD_UDF INTERVAL](../../functions/custom_udfs.md) and [DATEDIFF_UDF INTERVAL](../../functions/custom_udfs.md) vs the [Snowflake native operation](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) for interval arithmetic.

#### Necessary Code

To run the queries of the comparative table it is necessary to run the following code:

```sql
CREATE OR REPLACE TABLE TIMES(
AsTimeStamp TIMESTAMP,
AsTimestampTwo TIMESTAMP,
AsDate DATE,
AsDateTwo DATE
);

INSERT INTO TIMES VALUES (
  TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
  TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
  TO_DATE('06/11/21', 'dd/mm/yy'),
  TO_DATE('05/11/21', 'dd/mm/yy'));

CREATE TABLE UNKNOWN_TABLE(
  Unknown timestamp
);

INSERT INTO UNKNOWN_TABLE VALUES (
  TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.')
);
```

```sql
CREATE OR REPLACE TABLE TIMES (
  AsTimeStamp TIMESTAMP(6),
  AsTimestampTwo TIMESTAMP(6),
  AsDate TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
  AsDateTwo TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
  ;

  INSERT INTO TIMES
  VALUES (
  TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
  TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
  TO_DATE('06/11/21', 'dd/mm/yy'),
  TO_DATE('05/11/21', 'dd/mm/yy'));

  CREATE OR REPLACE TABLE UNKNOWN_TABLE (
  Unknown TIMESTAMP(6)
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
  ;

  INSERT INTO UNKNOWN_TABLE
  VALUES (
  TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.')
);
```

### Comparison Table

#### Oracle

```sql
SELECT AsTimeStamp+INTERVAL '1-1' YEAR(2) TO MONTH FROM TIMES;
SELECT AsTimeStamp-INTERVAL '1-1' YEAR(2) TO MONTH FROM TIMES;
SELECT AsTimeStamp+INTERVAL '2-1' YEAR(4) TO MONTH FROM TIMES;
SELECT AsTimeStamp-INTERVAL '2-1' YEAR(4) TO MONTH FROM TIMES;
SELECT AsTimeStamp+INTERVAL '1' MONTH FROM TIMES;
SELECT AsTimeStamp-INTERVAL '1' MONTH FROM TIMES;
SELECT AsTimeStamp+INTERVAL '2' MONTH FROM TIMES;
SELECT AsTimeStamp-INTERVAL '2' MONTH FROM TIMES;
SELECT AsTimeStamp+INTERVAL '1 01:00:00.222' DAY TO SECOND(3) FROM TIMES;
SELECT AsTimeStamp-INTERVAL '1 01:00:00.222' DAY TO SECOND(3) FROM TIMES;
SELECT AsTimeStamp+INTERVAL '1 01:10' DAY TO MINUTE FROM TIMES;
SELECT AsTimeStamp-INTERVAL '1 01:10' DAY TO MINUTE FROM TIMES;
SELECT AsTimeStamp+INTERVAL '1 1' DAY TO HOUR FROM TIMES;
SELECT AsTimeStamp-INTERVAL '1 1' DAY TO HOUR FROM TIMES;
SELECT AsTimeStamp+INTERVAL '10' DAY FROM TIMES;
SELECT AsTimeStamp-INTERVAL '10' DAY FROM TIMES;
SELECT AsTimeStamp+INTERVAL '3:05' HOUR TO MINUTE FROM TIMES;
SELECT AsTimeStamp-INTERVAL '3:05' HOUR TO MINUTE FROM TIMES;
SELECT AsTimeStamp+INTERVAL '5' HOUR FROM TIMES;
SELECT AsTimeStamp-INTERVAL '5' HOUR FROM TIMES;
SELECT AsTimeStamp+INTERVAL '5:10' MINUTE TO SECOND FROM TIMES;
SELECT AsTimeStamp-INTERVAL '5:10' MINUTE TO SECOND FROM TIMES;
SELECT AsTimeStamp+INTERVAL '30' MINUTE FROM TIMES;
SELECT AsTimeStamp-INTERVAL '30' MINUTE FROM TIMES;
SELECT AsTimeStamp+INTERVAL '333' HOUR(3) FROM TIMES;
SELECT AsTimeStamp-INTERVAL '333' HOUR(3) FROM TIMES;
SELECT AsTimeStamp+INTERVAL '15.6789' SECOND(2,3) FROM TIMES;
SELECT AsTimeStamp-INTERVAL '15.6789' SECOND(2,3) FROM TIMES;
SELECT AsDate+INTERVAL '1-1' YEAR(2) TO MONTH FROM TIMES;
SELECT AsDate-INTERVAL '1-1' YEAR(2) TO MONTH FROM TIMES;
SELECT AsDate+INTERVAL '2-1' YEAR(4) TO MONTH FROM TIMES;
SELECT AsDate-INTERVAL '2-1' YEAR(4) TO MONTH FROM TIMES;
SELECT AsDate+INTERVAL '1' MONTH FROM TIMES;
SELECT AsDate-INTERVAL '1' MONTH FROM TIMES;
SELECT AsDate+INTERVAL '2' MONTH FROM TIMES;
SELECT AsDate-INTERVAL '2' MONTH FROM TIMES;
SELECT AsDate+INTERVAL '1 01:00:00.222' DAY TO SECOND(3) FROM TIMES;
SELECT AsDate-INTERVAL '1 01:00:00.222' DAY TO SECOND(3) FROM TIMES;
SELECT AsDate+INTERVAL '1 01:10' DAY TO MINUTE FROM TIMES;
SELECT AsDate-INTERVAL '1 01:10' DAY TO MINUTE FROM TIMES;
SELECT AsDate+INTERVAL '1 1' DAY TO HOUR FROM TIMES;
SELECT AsDate-INTERVAL '1 1' DAY TO HOUR FROM TIMES;
SELECT AsDate+INTERVAL '10' DAY FROM TIMES;
SELECT AsDate-INTERVAL '10' DAY FROM TIMES;
SELECT AsDate+INTERVAL '3:05' HOUR TO MINUTE FROM TIMES;
SELECT AsDate-INTERVAL '3:05' HOUR TO MINUTE FROM TIMES;
SELECT AsDate+INTERVAL '5' HOUR FROM TIMES;
SELECT AsDate-INTERVAL '5' HOUR FROM TIMES;
SELECT AsDate+INTERVAL '5:10' MINUTE TO SECOND FROM TIMES;
SELECT AsDate-INTERVAL '5:10' MINUTE TO SECOND FROM TIMES;
SELECT AsDate+INTERVAL '30' MINUTE FROM TIMES;
SELECT AsDate-INTERVAL '30' MINUTE FROM TIMES;
SELECT AsDate+INTERVAL '333' HOUR(3) FROM TIMES;
SELECT AsDate-INTERVAL '333' HOUR(3) FROM TIMES;
SELECT AsDate+INTERVAL '15.6789' SECOND(2,3) FROM TIMES;
SELECT AsDate-INTERVAL '15.6789' SECOND(2,3) FROM TIMES;
SELECT Unknown+INTERVAL '1-1' YEAR(2) TO MONTH FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '1-1' YEAR(2) TO MONTH FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '2-1' YEAR(4) TO MONTH FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '2-1' YEAR(4) TO MONTH FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '1' MONTH FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '1' MONTH FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '2' MONTH FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '2' MONTH FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '1 01:00:00.222' DAY TO SECOND(3) FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '1 01:00:00.222' DAY TO SECOND(3) FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '1 01:10' DAY TO MINUTE FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '1 01:10' DAY TO MINUTE FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '1 1' DAY TO HOUR FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '1 1' DAY TO HOUR FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '10' DAY FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '10' DAY FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '3:05' HOUR TO MINUTE FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '3:05' HOUR TO MINUTE FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '5' HOUR FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '5' HOUR FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '5:10' MINUTE TO SECOND FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '5:10' MINUTE TO SECOND FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '30' MINUTE FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '30' MINUTE FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '333' HOUR(3) FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '333' HOUR(3) FROM UNKNOWN_TABLE;
SELECT Unknown+INTERVAL '15.6789' SECOND(2,3) FROM UNKNOWN_TABLE;
SELECT Unknown-INTERVAL '15.6789' SECOND(2,3) FROM UNKNOWN_TABLE;
SELECT INTERVAL '1-1' YEAR(2) TO MONTH+ AsTimeStamp FROM TIMES;
SELECT INTERVAL '1-1' YEAR(2) TO MONTH+AsDate FROM TIMES;
SELECT INTERVAL '1-1' YEAR(2) TO MONTH+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '2-1' YEAR(4) TO MONTH+AsTimeStamp FROM TIMES;
SELECT INTERVAL '2-1' YEAR(4) TO MONTH+AsDate FROM TIMES;
SELECT INTERVAL '2-1' YEAR(4) TO MONTH+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '1' MONTH+AsTimeStamp FROM TIMES;
SELECT INTERVAL '1' MONTH+AsDate FROM TIMES;
SELECT INTERVAL '1' MONTH+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '2' MONTH+AsTimeStamp FROM TIMES;
SELECT INTERVAL '2' MONTH+AsDate FROM TIMES;
SELECT INTERVAL '2' MONTH+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '1 01:00:00.222' DAY TO SECOND(3)+AsTimeStamp FROM TIMES;
SELECT INTERVAL '1 01:00:00.222' DAY TO SECOND(3)+AsDate FROM TIMES;
SELECT INTERVAL '1 01:00:00.222' DAY TO SECOND(3)+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '1 01:10' DAY TO MINUTE+AsTimeStamp FROM TIMES;
SELECT INTERVAL '1 01:10' DAY TO MINUTE+AsDate FROM TIMES;
SELECT INTERVAL '1 01:10' DAY TO MINUTE+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '1 1' DAY TO HOUR+AsTimeStamp FROM TIMES;
SELECT INTERVAL '1 1' DAY TO HOUR+AsDate FROM TIMES;
SELECT INTERVAL '1 1' DAY TO HOUR+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '10' DAY+AsTimeStamp FROM TIMES;
SELECT INTERVAL '10' DAY+AsDate FROM TIMES;
SELECT INTERVAL '10' DAY+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '3:05' HOUR TO MINUTE+AsTimeStamp FROM TIMES;
SELECT INTERVAL '3:05' HOUR TO MINUTE+AsDate FROM TIMES;
SELECT INTERVAL '3:05' HOUR TO MINUTE+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '5' HOUR+AsTimeStamp FROM TIMES;
SELECT INTERVAL '5' HOUR+AsDate FROM TIMES;
SELECT INTERVAL '5' HOUR+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '5:10' MINUTE TO SECOND+AsTimeStamp FROM TIMES;
SELECT INTERVAL '5:10' MINUTE TO SECOND+AsDate FROM TIMES;
SELECT INTERVAL '5:10' MINUTE TO SECOND+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '30' MINUTE+AsTimeStamp FROM TIMES;
SELECT INTERVAL '30' MINUTE+AsDate FROM TIMES;
SELECT INTERVAL '30' MINUTE+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '333' HOUR(3)+AsTimeStamp FROM TIMES;
SELECT INTERVAL '333' HOUR(3)+AsDate FROM TIMES;
SELECT INTERVAL '333' HOUR(3)+Unknown FROM UNKNOWN_TABLE;
SELECT INTERVAL '15.6789' SECOND(2,3)+AsTimeStamp FROM TIMES;
SELECT INTERVAL '15.6789' SECOND(2,3)+AsDate FROM TIMES;
SELECT INTERVAL '15.6789' SECOND(2,3)+Unknown FROM UNKNOWN_TABLE;
```

#### Snowflake

```sql
SELECT AsTimeStamp + INTERVAL '1y, 1mm' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '1y, 1mm' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '2y, 1mm' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '2y, 1mm' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '1 month' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '1 month' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '2 month' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '2 month' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '1d, 01h, 00m, 00s, 222ms' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '1d, 01h, 00m, 00s, 222ms' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '1d, 01h, 10m' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '1d, 01h, 10m' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '1d, 1h' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '1d, 1h' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '10 day' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '10 day' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '3h, 05m' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '3h, 05m' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '5 hour' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '5 hour' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '5m, 10s' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '5m, 10s' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '30 minute' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '30 minute' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '333 hour' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '333 hour' FROM PUBLIC.TIMES;
SELECT AsTimeStamp + INTERVAL '15.6789 second' FROM PUBLIC.TIMES;
SELECT AsTimeStamp - INTERVAL '15.6789 second' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '1y, 1mm' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '1y, 1mm' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '2y, 1mm' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '2y, 1mm' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '1 month' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '1 month' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '2 month' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '2 month' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '1d, 01h, 00m, 00s, 222ms' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '1d, 01h, 00m, 00s, 222ms' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '1d, 01h, 10m' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '1d, 01h, 10m' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '1d, 1h' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '1d, 1h' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '10 day' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '10 day' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '3h, 05m' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '3h, 05m' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '5 hour' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '5 hour' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '5m, 10s' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '5m, 10s' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '30 minute' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '30 minute' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '333 hour' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '333 hour' FROM PUBLIC.TIMES;
SELECT AsDate + INTERVAL '15.6789 second' FROM PUBLIC.TIMES;
SELECT AsDate - INTERVAL '15.6789 second' FROM PUBLIC.TIMES;
SELECT Unknown + INTERVAL '1y, 1mm' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '1y, 1mm' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '2y, 1mm' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '2y, 1mm' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '1 month' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '1 month' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '2 month' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '2 month' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '1d, 01h, 00m, 00s, 222ms' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '1d, 01h, 00m, 00s, 222ms' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '1d, 01h, 10m' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '1d, 01h, 10m' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '1d, 1h' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '1d, 1h' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '10 day' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '10 day' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '3h, 05m' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '3h, 05m' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '5 hour' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '5 hour' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '5m, 10s' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '5m, 10s' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '30 minute' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '30 minute' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '333 hour' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '333 hour' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown + INTERVAL '15.6789 second' FROM PUBLIC.UNKNOWN_TABLE;
SELECT Unknown - INTERVAL '15.6789 second' FROM PUBLIC.UNKNOWN_TABLE;
```

#### Snowflake UDF

```sql
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''1-1'' YEAR(2) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''1-1'' YEAR(2) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''2-1'' YEAR(4) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''2-1'' YEAR(4) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''1'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''1'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''2'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''2'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''1 01:10'' DAY TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''1 01:10'' DAY TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''1 1'' DAY TO HOUR') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''1 1'' DAY TO HOUR') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''10'' DAY') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''10'' DAY') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''3:05'' HOUR TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''3:05'' HOUR TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''5'' HOUR') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''5'' HOUR') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''5:10'' MINUTE TO SECOND') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''5:10'' MINUTE TO SECOND') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''30'' MINUTE') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''30'' MINUTE') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''333'' HOUR(3)') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''333'' HOUR(3)') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsTimeStamp,'INTERVAL ''15.6789'' SECOND(2,3)') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsTimeStamp,'INTERVAL ''15.6789'' SECOND(2,3)') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''1-1'' YEAR(2) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''1-1'' YEAR(2) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''2-1'' YEAR(4) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''2-1'' YEAR(4) TO MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''1'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''1'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''2'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''2'' MONTH') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''1 01:10'' DAY TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''1 01:10'' DAY TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''1 1'' DAY TO HOUR') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''1 1'' DAY TO HOUR') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''10'' DAY') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''10'' DAY') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''3:05'' HOUR TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''3:05'' HOUR TO MINUTE') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''5'' HOUR') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''5'' HOUR') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''5:10'' MINUTE TO SECOND') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''5:10'' MINUTE TO SECOND') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''30'' MINUTE') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''30'' MINUTE') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''333'' HOUR(3)') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''333'' HOUR(3)') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(AsDate,'INTERVAL ''15.6789'' SECOND(2,3)') FROM PUBLIC.TIMES;
SELECT DATEDIFF_UDF(AsDate,'INTERVAL ''15.6789'' SECOND(2,3)') FROM PUBLIC.TIMES;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''1-1'' YEAR(2) TO MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''1-1'' YEAR(2) TO MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''2-1'' YEAR(4) TO MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''2-1'' YEAR(4) TO MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''1'' MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''1'' MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''2'' MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''2'' MONTH') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''1 01:10'' DAY TO MINUTE') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''1 01:10'' DAY TO MINUTE') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''1 1'' DAY TO HOUR') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''1 1'' DAY TO HOUR') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''10'' DAY') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''10'' DAY') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''3:05'' HOUR TO MINUTE') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''3:05'' HOUR TO MINUTE') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''5'' HOUR') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''5'' HOUR') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''5:10'' MINUTE TO SECOND') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''5:10'' MINUTE TO SECOND') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''30'' MINUTE') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''30'' MINUTE') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''333'' HOUR(3)') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''333'' HOUR(3)') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEADD_UDF(UnKnown,'INTERVAL ''15.6789'' SECOND(2,3)') FROM PUBLIC.UNKNOWN_TABLE;
SELECT DATEDIFF_UDF(UnKnown,'INTERVAL ''15.6789'' SECOND(2,3)') FROM PUBLIC.UNKNOWN_TABLE;
```

#### Results

| Oracle | Snowflake Operation | UDF |
| --- | --- | --- |
| 2022-12-05 11:00:00.000 | 2022-12-05 11:00:00.000 | 2022-12-05 11:00:00.000 |
| 2020-10-05 11:00:00.000 | 2020-10-05 11:00:00.000 | 2020-10-05 11:00:00.000 |
| 2023-12-05 11:00:00.000 | 2023-12-05 11:00:00.000 | 2023-12-05 11:00:00.000 |
| 2019-10-05 11:00:00.000 | 2019-10-05 11:00:00.000 | 2019-10-05 11:00:00.000 |
| 2021-12-05 11:00:00.000 | 2021-12-05 11:00:00.000 | 2021-12-05 11:00:00.000 |
| 2021-10-05 11:00:00.000 | 2021-10-05 11:00:00.000 | 2021-10-05 11:00:00.000 |
| 2022-01-05 11:00:00.000 | 2022-01-05 11:00:00.000 | 2022-01-05 11:00:00.000 |
| 2021-09-05 11:00:00.000 | 2021-09-05 11:00:00.000 | 2021-09-05 11:00:00.000 |
| 2021-11-06 12:00:00.222 | 2021-11-06 12:00:00.222 | 2021-11-06 12:00:00.222 |
| 2021-11-04 09:59:59.778 | 2021-11-04 09:59:59.778 | 2021-11-04 09:59:59.778 |
| 2021-11-06 12:10:00.000 | 2021-11-06 12:10:00.000 | 2021-11-06 12:10:00.000 |
| 2021-11-04 09:50:00.000 | 2021-11-04 09:50:00.000 | 2021-11-04 09:50:00.000 |
| 2021-11-06 12:00:00.000 | 2021-11-06 12:00:00.000 | 2021-11-06 12:00:00.000 |
| 2021-11-04 10:00:00.000 | 2021-11-04 10:00:00.000 | 2021-11-04 10:00:00.000 |
| 2021-11-15 11:00:00.000 | 2021-11-15 11:00:00.000 | 2021-11-15 11:00:00.000 |
| 2021-10-26 11:00:00.000 | 2021-10-26 11:00:00.000 | 2021-10-26 11:00:00.000 |
| 2021-11-05 14:05:00.000 | 2021-11-05 14:05:00.000 | 2021-11-05 14:05:00.000 |
| 2021-11-05 07:55:00.000 | 2021-11-05 07:55:00.000 | 2021-11-05 07:55:00.000 |
| 2021-11-05 16:00:00.000 | 2021-11-05 16:00:00.000 | 2021-11-05 16:00:00.000 |
| 2021-11-05 06:00:00.000 | 2021-11-05 06:00:00.000 | 2021-11-05 06:00:00.000 |
| 2021-11-05 11:05:10.000 | 2021-11-05 11:05:10.000 | 2021-11-05 11:05:10.000 |
| 2021-11-05 10:54:50.000 | 2021-11-05 10:54:50.000 | 2021-11-05 10:54:50.000 |
| 2021-11-05 11:30:00.000 | 2021-11-05 11:30:00.000 | 2021-11-05 11:30:00.000 |
| 2021-11-05 10:30:00.000 | 2021-11-05 10:30:00.000 | 2021-11-05 10:30:00.000 |
| 2021-11-19 08:00:00.000 | 2021-11-19 08:00:00.000 | 2021-11-19 08:00:00.000 |
| 2021-10-22 14:00:00.000 | 2021-10-22 14:00:00.000 | 2021-10-22 14:00:00.000 |
| 2021-11-05 11:00:15.679 | 2021-11-05 11:00:16.000 | 2021-11-05 11:00:15.678 |
| 2021-11-05 10:59:44.321 | 2021-11-05 10:59:44.000 | 2021-11-05 11:00:15.678 |
| 2022-12-06 00:00:00.000 | 2022-12-06 | 2022-12-06 |
| 2020-10-06 00:00:00.000 | 2020-10-06 | 2020-10-06 |
| 2023-12-06 00:00:00.000 | 2023-12-06 | 2023-12-06 |
| 2019-10-06 00:00:00.000 | 2019-10-06 | 2019-10-06 |
| 2021-12-06 00:00:00.000 | 2021-12-06 | 2021-12-06 |
| 2021-12-06 00:00:00.000 | 2021-10-06 | 2021-10-06 |
| 2022-01-06 00:00:00.000 | 2022-01-06 | 2022-01-06 |
| 2021-09-06 00:00:00.000 | 2021-09-06 | 2021-09-06 |
| 2021-11-07 01:00:00.000 | 2021-11-07 01:00:00.222 | 2021-11-07 |
| 2021-11-04 22:59:59.000 | 2021-11-04 22:59:59.778 | 2021-11-04 |
| 2021-11-07 01:10:00.000 | 2021-11-07 01:10:00.000 | 2021-11-07 |
| 2021-11-04 22:50:00.000 | 2021-11-04 22:50:00.000 | 2021-11-04 |
| 2021-11-07 01:00:00.000 | 2021-11-07 01:00:00.000 | 2021-11-07 |
| 2021-11-04 23:00:00.000 | 2021-11-04 23:00:00.000 | 2021-11-04 |
| 2021-11-16 00:00:00.000 | 2021-11-16 | 2021-11-16 |
| 2021-10-27 00:00:00.000 | 2021-10-27 | 2021-10-27 |
| 2021-11-06 03:05:00.000 | 2021-11-06 03:05:00.000 | 2021-11-06 |
| 2021-11-05 20:55:00.000 | 2021-11-05 20:55:00.000 | 2021-11-05 |
| 2021-11-06 05:00:00.000 | 2021-11-06 05:00:00.000 | 2021-11-06 |
| 2021-11-05 19:00:00.000 | 2021-11-05 19:00:00.000 | 2021-11-05 |
| 2021-11-06 00:05:10.000 | 2021-11-06 00:05:10.000 | 2021-11-06 |
| 2021-11-05 23:54:50.000 | 2021-11-05 23:54:50.000 | 2021-11-05 |
| 2021-11-06 00:30:00.000 | 2021-11-06 00:30:00.000 | 2021-11-06 |
| 2021-11-05 23:30:00.000 | 2021-11-05 23:30:00.000 | 2021-11-05 |
| 2021-11-19 21:00:00.000 | 2021-11-19 21:00:00.000 | 2021-11-19 |
| 2021-10-23 03:00:00.000 | 2021-10-23 03:00:00.000 | 2021-10-23 |
| 2021-11-06 00:00:15.000 | 2021-11-06 00:00:16.000 | 2021-11-06 |
| 2021-11-05 23:59:44.000 | 2021-11-05 23:59:44.000 | 2021-11-05 |
| 2010-11-01 12:00:00.000 | 2010-11-01 12:00:00.000 | 2010-11-01 12:00:00.000 |
| 2008-09-01 12:00:00.000 | 2008-09-01 12:00:00.000 | 2008-09-01 12:00:00.000 |
| 2011-11-01 12:00:00.000 | 2011-11-01 12:00:00.000 | 2011-11-01 12:00:00.000 |
| 2007-09-01 12:00:00.000 | 2007-09-01 12:00:00.000 | 2007-09-01 12:00:00.000 |
| 2009-11-01 12:00:00.000 | 2009-11-01 12:00:00.000 | 2009-11-01 12:00:00.000 |
| 2009-09-01 12:00:00.000 | 2009-09-01 12:00:00.000 | 2009-09-01 12:00:00.000 |
| 2009-12-01 12:00:00.000 | 2009-12-01 12:00:00.000 | 2009-12-01 12:00:00.000 |
| 2009-08-01 12:00:00.000 | 2009-08-01 12:00:00.000 | 2009-08-01 12:00:00.000 |
| 2009-10-02 13:00:00.222 | 2009-10-02 13:00:00.222 | 2009-10-02 13:00:00.222 |
| 2009-09-30 10:59:59.778 | 2009-09-30 10:59:59.778 | 2009-09-30 10:59:59.778 |
| 2009-10-02 13:10:00.000 | 2009-10-02 13:10:00.000 | 2009-10-02 13:10:00.000 |
| 2009-09-30 10:50:00.000 | 2009-09-30 10:50:00.000 | 2009-09-30 10:50:00.000 |
| 2009-10-02 13:00:00.000 | 2009-10-02 13:00:00.000 | 2009-10-02 13:00:00.000 |
| 2009-09-30 11:00:00.000 | 2009-09-30 11:00:00.000 | 2009-09-30 11:00:00.000 |
| 2009-10-11 12:00:00.000 | 2009-10-11 12:00:00.000 | 2009-10-11 12:00:00.000 |
| 2009-09-21 12:00:00.000 | 2009-09-21 12:00:00.000 | 2009-09-21 12:00:00.000 |
| 2009-10-01 15:05:00.000 | 2009-10-01 15:05:00.000 | 2009-10-01 15:05:00.000 |
| 2009-10-01 08:55:00.000 | 2009-10-01 08:55:00.000 | 2009-10-01 08:55:00.000 |
| 2009-10-01 17:00:00.000 | 2009-10-01 17:00:00.000 | 2009-10-01 17:00:00.000 |
| 2009-10-01 07:00:00.000 | 2009-10-01 07:00:00.000 | 2009-10-01 07:00:00.000 |
| 2009-10-01 12:05:10.000 | 2009-10-01 12:05:10.000 | 2009-10-01 12:05:10.000 |
| 2009-10-01 11:54:50.000 | 2009-10-01 11:54:50.000 | 2009-10-01 11:54:50.000 |
| 2009-10-01 12:30:00.000 | 2009-10-01 12:30:00.000 | 2009-10-01 12:30:00.000 |
| 2009-10-01 11:30:00.000 | 2009-10-01 11:30:00.000 | 2009-10-01 11:30:00.000 |
| 2009-10-15 09:00:00.000 | 2009-10-15 09:00:00.000 | 2009-10-15 09:00:00.000 |
| 2009-09-17 15:00:00.000 | 2009-09-17 15:00:00.000 | 2009-09-17 15:00:00.000 |
| 2009-10-01 12:00:15.679 | 2009-10-01 12:00:16.000 | 2009-10-01 12:00:15.678 |
| 2009-10-01 11:59:44.321 | 2009-10-01 11:59:44.000 | 2009-10-01 11:59:44.321 |

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-OR0042](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior

## PL SQL Data Types

## BINARY_INTEGER Data Type

This data type is identical to the PLS_INTEGER data type.

## PLS_INTEGER Data Type

### Description

> The `PLS_INTEGER` data type stores signed integers in the range -2,147,483,648 through 2,147,483,647, represented in 32 bits. ([Oracle Language Reference PLS_INTEGER Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-data-types.html#GUID-9517B7AC-9CEA-4C36-A454-52588BEEBE8F))

The `PLS_INTEGER` datatype is transformed to `NUMBER`. This transformation also applies for each `PLS_INTEGER` subtype:

* `NATURAL`
* `NATURALN`
* `POSITIVE`
* `POSITIVEN`
* `SIGNTYPE`
* `SIMPLE_INTEGER`

> **Warning:**
>
> Some of these subtypes are currently not recognized by SnowConvert AI so they are converted to `VARIANT` and considered user-defined types. There is already a work item to fix the issue.

### Sample Source Patterns

Please, consider the following table and its inserts for the examples below:

#### Code

```sql
CREATE TABLE PLS_INTEGER_TABLE(
	COL NUMBER
);
```

#### PLS_INTEGER usage in procedural blocks

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PLS_INTEGER_EXAMPLE
IS
-- PLS_INTEGER AND BINARY INTEGER ALIASES
PLS_INTEGER_VAR PLS_INTEGER;
BINARY_INTEGER_VAR BINARY_INTEGER;

NUMBER_VAR NUMBER;
BEGIN
	NUMBER_VAR := 2;

	-- maximum possible value
	PLS_INTEGER_VAR := 2147483647;

	-- implicit cast to number
	INSERT INTO PLS_INTEGER_TABLE (COL) VALUES (PLS_INTEGER_VAR);
	PLS_INTEGER_VAR := 2147483647;

	-- operations with other numeric expressions
	INSERT INTO PLS_INTEGER_TABLE (COL) VALUES (PLS_INTEGER_VAR + 1);
	INSERT INTO PLS_INTEGER_TABLE (COL) VALUES (PLS_INTEGER_VAR + NUMBER_VAR);
END;

CALL PLS_INTEGER_EXAMPLE();
SELECT * FROM PLS_INTEGER_TABLE;
```

##### Result

| COL |
| --- |
| 2147483647 |
| 2147483648 |
| 2147483649 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PLS_INTEGER_EXAMPLE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		-- PLS_INTEGER AND BINARY INTEGER ALIASES
		PLS_INTEGER_VAR NUMBER;
		BINARY_INTEGER_VAR NUMBER;

		NUMBER_VAR NUMBER(38, 18);
	BEGIN
		NUMBER_VAR := 2;
	-- maximum possible value
		PLS_INTEGER_VAR := 2147483647;

		-- implicit cast to number
		INSERT INTO PLS_INTEGER_TABLE(COL) VALUES (:PLS_INTEGER_VAR);
		PLS_INTEGER_VAR := 2147483647;

	-- operations with other numeric expressions
	INSERT INTO PLS_INTEGER_TABLE(COL) VALUES (:PLS_INTEGER_VAR + 1);
	INSERT INTO PLS_INTEGER_TABLE(COL) VALUES (:PLS_INTEGER_VAR + :NUMBER_VAR);
	END;
$$;

CALL PLS_INTEGER_EXAMPLE();

SELECT * FROM
	PLS_INTEGER_TABLE;
```

##### Result

| COL |
| --- |
| 2147483647 |
| 2147483648 |
| 2147483649 |

### Known Issues

#### 1. Storage and performance features were not preserved

Oracle `PLS_INTEGER` has some advantages in terms of storage size and performance in arithmetic operations. These features were not emulated because Snowflake `NUMBER` does not have them. For more information, check the [PLS_INTEGER documentation.](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-data-types.html#GUID-9517B7AC-9CEA-4C36-A454-52588BEEBE8F)

### Related EWIs

No related EWIs.

## Character Data Types

> Character data types store character (alphanumeric) data, which are words and free-form text, in the database character set or national character set. ([Oracle SQL Language Reference Character Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-1BABC478-FB47-4962-9B0C-8B8BD059E733))

## CHAR Data type

### Description

> The `CHAR` data type specifies a **fixed**-length character string in the database character set.([Oracle SQL Language Reference CHAR Data type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-85E0A0DD-9E90-4AE1-9AD5-93C89FDCFC49))

As denoted in the Oracle documentation, size in CHAR data type is a length constraint and should not be confused with capacity. Total characters that can be stored in a CHAR may vary according to the database character set and configuration, but commonly the maximum size allowed is 2000.

In Snowflake, CHAR types are synonymous with VARCHAR, and as you can check here:

[Snowflake SQL Language reference text data types](https://docs.snowflake.com/en/sql-reference/data-types-text.html#varchar)

The standard maximum size is quite bigger. But, this doesn’t mean that a Snowflake VARCHAR will consume more storage, as mentioned in their documentation:

> A 1-character string in a VARCHAR(16777216) column only consumes a single character.

```sql
CHAR [ (size [ BYTE | CHAR ]) ]
```

### Sample Source Patterns

#### Char data types in Create Table

##### Oracle

```sql
CREATE TABLE char_data_types
(
	char_column1 CHAR,
	char_column2 CHAR(15),
	char_column3 CHAR(15 BYTE),
	char_column4 CHAR(15 CHAR)
);

INSERT INTO char_data_types VALUES ('H', 'Hello world', 'Hello world', 'Hello world');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE char_data_types
(
	char_column1 CHAR,
	char_column2 CHAR(15),
	char_column3 CHAR(15),
	char_column4 CHAR(15)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO char_data_types
VALUES ('H', 'Hello world', 'Hello world', 'Hello world');
```

#### Retrieving data from char columns

##### Oracle

```sql
SELECT * FROM char_data_types;
```

##### Result

| CHAR_COLUMN1 | CHAR_COLUMN2 | CHAR_COLUMN3 | CHAR_COLUMN4 |
| --- | --- | --- | --- |
| H | Hello world | Hello world | Hello world |

##### Snowflake

```sql
SELECT * FROM
char_data_types;
```

##### Result

| CHAR_COLUMN1 | CHAR_COLUMN2 | CHAR_COLUMN3 | CHAR_COLUMN4 |
| --- | --- | --- | --- |
| H | Hello world | Hello world | Hello world |

> **Note:**
>
> In Oracle, the value is filled with empty spaces to fit the fixed size determined in the column definition. On the other hand, Snowflakes uses dynamic size (keeping the length restriction) to store the value.

#### Checking internal data types for CHAR

As mentioned in the beginning, Snowflake internally uses a VARCHAR for the CHAR type columns, we can confirm it by describing the tables:

##### Oracle

##### Snowflake

> **Note:**
>
> The length restriction is preserved, but the memory that the columns are using is different on each DBMS.

#### Retrieving the size in bytes of each column:

##### Oracle

```sql
SELECT
LENGTHB(char_column1),
LENGTHB(char_column2),
LENGTHB(char_column3),
LENGTHB(char_column4)
FROM char_data_types;
```

##### Result

| LENGTHB(CHAR_COLUMN1) | LENGTHB(CHAR_COLUMN2) | LENGTHB(CHAR_COLUMN3) | LENGTHB(CHAR_COLUMN4) |
| --- | --- | --- | --- |
| 1 | 15 | 15 | 15 |

##### Snowflake

```sql
SELECT
OCTET_LENGTH(char_column1) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/,
OCTET_LENGTH(char_column2) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/,
OCTET_LENGTH(char_column3) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/,
OCTET_LENGTH(char_column4) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/
FROM
char_data_types;
```

##### Result

| OCTET_LENGTH(CHAR_COLUMN1) | OCTET_LENGTH(CHAR_COLUMN2) | OCTET_LENGTH(CHAR_COLUMN3) | OCTET_LENGTH(CHAR_COLUMN4) |
| --- | --- | --- | --- |
| 1 | 11 | 11 | 11 |

> **Note:**
>
> Besides these slight differences, the integrity of the data is preserved.

### Known Issues

**1. Results obtained from some built-in functions may vary**

As explained in the previous section, there may be cases using built-in functions over the columns that may retrieve different results. For example, get the length of a column.

### Related EWIs

1. [SSC-FDM-OR0015](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): LENGTHB transformed to OCTET_LENGTH.

## NCHAR Data Type

### Description

> The NCHAR data type specifies a **fixed**-length character string in the national character set. ([Oracle SQL Language Reference NCHAR](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-FE15E51B-52C6-45D7-9883-4DF47716A17D))

NCHAR allows to store special characters with their Unicode to be preserved across any usage, these special characters may need more bits to be stored and that is why, by default, the NCHAR character set is `AL16UTF16`, contrary to the common character data set for CHAR which is usually `AL32UTF8`.

NCHAR is preserved as NCHAR in Snowflake, but, in the background, Snowflake uses VARCHAR. Transformation information related to CHAR is also valid for NCHAR.

```none
NCHAR [ (size) ]
```

### Sample Souce Patterns

#### Nchar data types in Create Table

##### Oracle

```sql
CREATE TABLE nchar_data_types
(
	nchar_column1 NCHAR,
	nchar_column2 NCHAR(5)
);

INSERT INTO nchar_data_types VALUES ('ភ', 'ភាសាខ');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE nchar_data_types
(
	nchar_column1 NCHAR,
	nchar_column2 NCHAR(5)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO nchar_data_types
VALUES ('ភ', 'ភាសាខ');
```

> **Note:**
>
> In Oracle, trying to insert these values in a CHAR column with the same size, will trigger an error: *value too large for column*.

#### Retrieving information from Nchar columns

##### Oracle

```sql
SELECT * FROM nchar_data_types;
```

##### Result

| NCHAR_COLUMN1 | NCHAR_COLUMN2 |
| --- | --- |
| ភ | ភាសាខ |

##### Snowflake

```sql
SELECT * FROM
nchar_data_types;
```

##### Result

| NCHAR_COLUMN1 | NCHAR_COLUMN2 |
| --- | --- |
| ភ | ភាសាខ |

#### Retrieving the size in bytes of each column

##### Oracle

```sql
SELECT
LENGTHB(nchar_column1),
LENGTHB(nchar_column2)
FROM nchar_data_types;
```

##### Result

| LENGTHB(NCHAR_COLUMN1) | LENGTHB(NCHAR_COLUMN2) |
| --- | --- |

```none
                 2|                    10|
```

##### Snowflake

```sql
SELECT
OCTET_LENGTH(nchar_column1) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/,
OCTET_LENGTH(nchar_column2) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/
FROM
nchar_data_types;
```

##### Result

| OCTET_LENGTH(NCHAR_COLUMN1) | OCTET_LENGTH(NCHAR_COLUMN2) |
| --- | --- |

```none
                      3|                         15|
```

Note that the number specified in the column declaration is the size in characters and not in bytes, That is why we see more space used to store those special characters.

> **Note:**
>
> In Snowflake, VARCHAR uses UTF-8, size can vary depending on the Unicode character that can be represented in 1, 2, 3, or 4 bytes. In this case, the Cambodian character is using 3 bytes to be stored.

> **Note:**
>
> Besides these slight differences, the integrity of the data is preserved.

### Known Issues

**1. Results obtained from some built-in functions may vary**

As explained in the previous section, there may be cases using built-in functions over the columns that may retrieve different results. For example, get the length of a column.

### Related EWIs

1. [SSC-FDM-OR0015](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): LENGTHB transformed to OCTET_LENGTH.

## NVARCHAR2 Data Type

### Description

> The `NVARCHAR2` data type specifies a variable-length character string in the national character set. ([Oracle SQL Language Reference NVARCHAR2](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-DF7E10FC-A461-4325-A295-3FD4D150809E))

```none
NVARCHAR2 (size)
```

NVARCHAR2 allows to store special characters with their Unicode to be preserved across any usage, these special characters may need more bits to be stored and that is why, by default, the NVARCHAR2 character set is `AL16UTF16`, contrary to the common character data set for VARCHAR2 which is usually `AL32UTF8`.

NVARCHAR transformed to Snowflake VARCHAR, Transformation information related to VARCHAR2, is also valid for NVARCHAR2.

```none
NVARCHAR2 (size)
```

### Sample Souce Patterns

#### Nvarchar2 data type in Create Table

##### Oracle

```sql
CREATE TABLE nvarchar2_data_types
(
	nvarchar2_column NVARCHAR2 (5)
);

INSERT INTO nvarchar2_data_types VALUES ('ភាសាខ');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE nvarchar2_data_types
	(
		nvarchar2_column VARCHAR(5)
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO nvarchar2_data_types
	VALUES ('ភាសាខ');
```

> **Note:**
>
> In Oracle, trying to insert these values in a VARCHAR2 column with the same size, will trigger an error: *value too large for column*.

#### Retrieving information from Nchar columns

##### Oracle

```sql
SELECT * FROM nvarchar2_data_types;
```

##### Result

| NVARCHAR2_COLUMN |
| --- |
| ភាសាខ |

##### Snowflake

```sql
SELECT * FROM
nvarchar2_data_types;
```

##### Result

| NVARCHAR2_COLUMN |
| --- |
| ភាសាខ |

#### Retrieving the size in bytes of each column

##### Oracle

```sql
SELECT
LENGTHB(nvarchar2_column)
FROM nvarchar2_data_types;
```

##### Result

| LENGTHB(NVARCHAR2_COLUMN) |
| --- |
| 10 |

##### Snowflake

```sql
SELECT
OCTET_LENGTH(nvarchar2_column) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/
FROM
nvarchar2_data_types;
```

##### Result

| OCTET_LENGTH(NVARCHAR2_COLUMN) |
| --- |
| 15 |

Note that the number specified in the column declaration is the size in characters and not in bytes, That is why we see more space used to store those special characters.

> **Note:**
>
> In Snowflake, VARCHAR uses UTF-8, size can vary depending on the Unicode character that can be represented in 1, 2, 3, or 4 bytes. In this case, the Cambodian characters are using 3 bytes to be stored.

> **Note:**
>
> Besides these slight differences, the integrity of the data is preserved.

### Known Issues

**1. Results obtained from some built-in functions may vary**

As explained in the previous section, there may be cases using built-in functions over the columns that may retrieve different results. For example, get the length of a column.

### Related EWIs

1. [SSC-FDM-OR0015](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): LENGTHB transformed to OCTET_LENGTH.

## VARCHAR Data Type

### Description

Oracle recommends using VARCHAR2 instead of VARCHAR as explained in their documentation:

[Oracle SQL Language reference Varchar](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-DF7E10FC-A461-4325-A295-3FD4D150809E)

Even though, the syntaxis is parsed and transformed using the [ANSI, DB2, and SQL/DS Data Types.](README.md)

## VARCHAR2 Data Type

### Description

> The `VARCHAR2` data type specifies a **variable**-length character string in the database character set. ([Oracle SQL Language Reference VARCHAR2](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-0DC7FFAA-F03F-4448-8487-F2592496A510))

As denoted in the Oracle documentation, size in VARCHAR2 data type is a length constraint and should not be confused with capacity. Total characters that can be stored in a VARCHAR2 may vary according to the database character set and configuration, but commonly the maximum size allowed is 4000.

VARCHAR2 is translated to Snowflake VARCHAR which can store a bigger number of bytes/characters by default. Either way, the memory used is variable using the size of the value stored in the column as same as in Oracle.

```sql
VARCHAR2 (size [ BYTE | CHAR ])
```

### Sample Source Patterns

#### Varchar2 data types in Create Table

##### Oracle

```sql
CREATE TABLE varchar2_data_types
(
	varchar2_column1 VARCHAR2(5),
	varchar2_column2 VARCHAR2(5 BYTE),
	varchar2_column3 VARCHAR2(5 CHAR)
);

INSERT INTO varchar2_data_types VALUES ('H', 'Hello', 'Hell');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE varchar2_data_types
	(
		varchar2_column1 VARCHAR(5),
		varchar2_column2 VARCHAR(5),
		varchar2_column3 VARCHAR(5)
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO varchar2_data_types
	VALUES ('H', 'Hello', 'Hell');
```

#### Retrieving data from varchar columns

##### Oracle

```sql
SELECT * FROM varchar2_data_types;
```

###### Result

| VARCHAR2_COLUMN1 | VARCHAR2_COLUMN2 | VARCHAR2_COLUMN3 |
| --- | --- | --- |
| H | Hello | Hell |

##### Snowflake

```sql
SELECT * FROM
varchar2_data_types;
```

###### Result

| VARCHAR2_COLUMN1 | VARCHAR2_COLUMN2 | VARCHAR2_COLUMN3 |
| --- | --- | --- |
| H | Hello | Hell |

#### Reviewing the variable size in the columns

##### Oracle

```sql
SELECT
LENGTHB(varchar2_column1),
LENGTHB(varchar2_column2),
LENGTHB(varchar2_column3)
FROM VARCHAR2_DATA_TYPES;
```

###### Result

| LENGTHB(VARCHAR2_COLUMN1) | LENGTHB(VARCHAR2_COLUMN2) | LENGTHB(VARCHAR2_COLUMN3) |
| --- | --- | --- |
| 1 | 5 | 4 |

##### Snowflake

```sql
SELECT
OCTET_LENGTH(varchar2_column1) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/,
OCTET_LENGTH(varchar2_column2) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/,
OCTET_LENGTH(varchar2_column3) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/
FROM
VARCHAR2_DATA_TYPES;
```

###### Result

| OCTET_LENGTH(VARCHAR2_COLUMN1) | OCTET_LENGTH(VARCHAR2_COLUMN2) | OCTET_LENGTH(VARCHAR2_COLUMN3) |
| --- | --- | --- |
| 1 | 5 | 4 |

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-OR0015](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): LENGTHB transformed to OCTET_LENGTH.

## LOB Data Types

### Description

> The built-in LOB data types `BLOB`, `CLOB`, and `NCLOB` (stored internally) and `BFILE` (stored externally) can store large and unstructured data such as text, image, video, and spatial data. ([Oracle SQL Language Reference LOB Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-1A71C635-188E-4EC9-B821-1DBEC2B45451))

```sql
BFILE
BLOB
CLOB
NCLOB
```

> **Warning:**
>
> LOB data types are **not supported** in Snowflake. Per [Snowflake’s documentation](https://docs.snowflake.com/en/sql-reference/data-types-unsupported.html), it is recommended to transform `CLOB` to `VARCHAR`, and `BLOB` to `BINARY`, however, there are several limitations.
> {% endhint %}
>
> > **Warning:**
> >
> > LOB properties for tables are also **not supported** in Snowflake.
> > {% endhint %}
> >
> > ## BFILE Data Type
> >
> > ### Description
> >
> > > Contains a locator to a large binary file stored outside the database. Enables byte stream I/O access to external LOBs residing on the database server. A `BFILE` column or attribute stores a `BFILE` locator, which serves as a pointer to a binary file on the server file system. The locator maintains the directory name and the filename. ([Oracle SQL Language Reference BFILE Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-3D9CC018-1637-45CB-95CF-DE67319D1A54)).
> >
> > > **Warning:**
> > >
> > > BFILE Data Type is **not supported** in Snowflake. VARCHAR is used instead.

### Sample Source Patterns

#### Bfile data type in Create Table

> **Warning:**
>
> Oracle `BFILE` columns are used to store a locator with the directory and filename. They are changed to Snowflake `VARCHAR` to store the directory and filename into the column. However, loading the content of the file must be done manually.

##### Oracle

```sql
--Create Table
CREATE TABLE bfile_table
(
    col1 BFILE
);

--Insert Bfilename
INSERT INTO bfile_table VALUES (
    BFILENAME('mydirectory', 'myfile.png')
);

--Select
SELECT * FROM bfile_table;
```

##### Result

| COL1 |
| --- |
| [BFILE:myfile.png] |

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE bfile_table
    (
        col1
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0105 - ADDITIONAL WORK IS NEEDED FOR BFILE COLUMN USAGE. BUILD_STAGE_FILE_URL FUNCTION IS A RECOMMENDED WORKAROUND ***/!!!
    VARCHAR
    )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

--Insert Bfilename
INSERT INTO bfile_table
VALUES (PUBLIC.BFILENAME_UDF('mydirectory', 'myfile.png')
);

--Select
SELECT * FROM
    bfile_table;
```

##### Result

| COL1 |
| --- |
| mydirectory\myfile.png |

> **Warning:**
>
> UDF added to replace `BFILENAME()`.

**UDF Added**

```sql
CREATE OR REPLACE FUNCTION PUBLIC.BFILENAME_UDF (DIRECTORYNAME STRING, FILENAME STRING)
RETURNS STRING
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	DIRECTORYNAME || '\\' || FILENAME
$$;
```

### Known Issues

#### 1. No access to the DBMS_LOB built-in package

Since LOB data types are not supported in Snowflake there is no equivalent for the `DBMS_LOB` functions and there are no implemented workarounds yet.

### Related EWIs

1. [SSC-EWI-OR0105](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Additional work is needed for BFILE column usage. BUILD_STAGE_URL function is a recommended workaround.

## BLOB Data Type

### Description

> The `BLOB` data type stores unstructured binary large objects. `BLOB` objects can be thought of as bitstreams with no character set semantics. ([Oracle SQL Language Reference BLOB Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-4570CDFD-8F91-44B9-BE7F-13076AA2AEBF)).

> **Warning:**
>
> BLOB Data Type is **not supported** in Snowflake. BINARY is used instead.

### Sample Source Patterns

#### BLOB in Create Table

##### Oracle

```sql
CREATE TABLE blobtable( blob_column BLOB, empty_column BLOB );

INSERT INTO blobtable VALUES (NULL, EMPTY_BLOB());
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE blobtable ( blob_column BINARY,
empty_column BINARY
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO blobtable
VALUES (NULL, TO_BINARY(' '));
```

#### Retrieving Data

##### Oracle

```sql
SELECT * FROM blobtable;
```

##### Result

| BLOB_COLUMN | EMPTY_COLUMN |
| --- | --- |
| [NULL] | [BLOB] |

##### Snowflake

```sql
SELECT * FROM
blobtable;
```

##### Result

| BLOB_COLUMN | EMPTY_COLUMN |
| --- | --- |
| NULL |  |

#### Functional Example

> **Warning:**
>
> This example **is not a translation** of SnowConvert AI, it is only used to show the functional equivalence between Oracle `BLOB` and Snowflake `BINARY`

> **Warning:**
>
> We are using “`utl_raw.cast_to_raw`” and “`DBMS_LOB.SUBSTR`” functions. The conversion for these functions is currently **not supported** by SnowConvert.

##### Oracle

```sql
INSERT INTO blobtable VALUES(
utl_raw.cast_to_raw('hello world'), EMPTY_BLOB());

SELECT DBMS_LOB.SUBSTR(blob_column) AS result
FROM blobtable;
```

##### Result

| RESULT |
| --- |
| [NULL] |
| hello world |

##### Snowflake

```sql
INSERT INTO blobtable
VALUES(
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'utl_raw.cast_to_raw' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS cast_to_raw, TO_BINARY(' '));

SELECT
SUBSTR(blob_column, 1) AS result
FROM
blobtable;
```

##### Result

| RESULT |
| --- |
| [NULL] |
| hello world |

### Known Issues

#### 1. The difference in max length BLOB (Oracle) and BINARY (Snowflake)

An [Oracle BLOB](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-4570CDFD-8F91-44B9-BE7F-13076AA2AEBF) column’s maximum size is **(4 gigabytes - 1) \* (database block size)**, but [Snowflake BINARY](https://docs.snowflake.com/en/sql-reference/data-types-text.html#binary) is limited to **8MB**.

##### 2. Empty value with EMPTY_BLOB

Initializing a column using `EMPTY_BLOB()` will return an empty LOB locator. While after translation the column will return a string with ‘ ‘.

##### 3. No access to the DBMS_LOB built-in package

Since LOB data types are not supported in Snowflake there is no equivalent for the `DBMS_LOB` functions and there are no implemented workarounds yet.

### Related EWIs

1. [SSC-EWI-OR0076](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Built In Package Not Supported.

## CLOB Data Type

### Description

> A character large object containing single-byte or multibyte characters. Both fixed-width and variable-width character sets are supported, both using the database character set. ([Oracle SQL Language Reference CLOB Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-0EAC5929-0674-429C-AF42-2D454C982F8F)).

> **Warning:**
>
> CLOB Data Type is **not supported** in Snowflake. VARCHAR is used instead.

### Sample Source Patterns

#### CLOB in Create Table

##### Oracle

```sql
CREATE TABLE clobtable ( clob_column CLOB, empty_column CLOB );

INSERT INTO clobtable VALUES ( 'THIS IS A TEST', EMPTY_CLOB() );
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE clobtable ( clob_column VARCHAR,
empty_column VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO clobtable
VALUES ( 'THIS IS A TEST', TO_VARCHAR(' - '));
```

#### Retrieving Data

##### Oracle

```sql
SELECT * FROM clobtable;
```

##### Result

| CLOB_COLUMN | EMPTY_COLUMN |
| --- | --- |
| THIS IS A TEST |  |

##### Snowflake

```sql
SELECT * FROM
clobtable;
```

##### Result

| CLOB_COLUMN | EMPTY_COLUMN |
| --- | --- |
| THIS IS A TEST | - |

### Known Issues

#### 1. The difference in max length CLOB (Oracle) and VARCHAR (Snowflake)

An [Oracle CLOB](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-0EAC5929-0674-429C-AF42-2D454C982F8F) column maximum size is **(4 gigabytes - 1) \* (database block size)**, but [Snowflake VARCHAR](https://docs.snowflake.com/en/sql-reference/data-types-text.html#varchar) is limited to **16MB**.

##### 2. Empty value with EMPTY_CLOB

Initializing a column using `EMPTY_CLOB()` will return an empty LOB locator. While in Snowflake after translation the column will return a string with ‘ `-` ‘.

##### 3. No access to the DBMS_LOB built-in package

Since LOB data types are not supported in Snowflake there is not an equivalent for the `DBMS_LOB` functions and there are no implemented workarounds yet.

### Related EWIs

No related EWIs.

## NCLOB Data type

### Description

> A character large object containing Unicode characters. Both fixed-width and variable-width character sets are supported, both using the database national character set. ([Oracle SQL Language Reference NCLOB Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-AB053D2C-2A40-478E-82E5-B9176C8776FD)).

> **Warning:**
>
> NCLOB Data Type is **not supported** in Snowflake. VARCHAR is used instead.

### Sample Source Patterns

#### NCLOB in Create Table

##### Oracle

```sql
CREATE TABLE nclobtable ( nclob_column NCLOB, empty_column NCLOB );

INSERT INTO nclobtable VALUES ( 'THIS IS A TEST', EMPTY_CLOB() );
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE nclobtable ( nclob_column VARCHAR,
empty_column VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO nclobtable
VALUES ( 'THIS IS A TEST', TO_VARCHAR(' - '));
```

#### Retrieving Data

##### Oracle

```sql
SELECT * FROM nclobtable;
```

##### Result

| NCLOB_COLUMN | EMPTY_COLUMN |
| --- | --- |
| THIS IS A TEST |  |

##### Snowflake

```sql
SELECT * FROM
nclobtable;
```

##### Result

| NCLOB_COLUMN | EMPTY_COLUMN |
| --- | --- |
| THIS IS A TEST | - |

### Known Issues

#### 1. The difference in max length CLOB (Oracle) and VARCHAR (Snowflake)

An [Oracle NCLOB](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-AB053D2C-2A40-478E-82E5-B9176C8776FD) column maximum size is **(4 gigabytes - 1) \* (database block size)**, but [Snowflake VARCHAR](https://docs.snowflake.com/en/sql-reference/data-types-text.html#varchar) is limited to **16MB**.

##### 2. Empty value with EMPTY_CLOB

Initializing a column using `EMPTY_CLOB()` will return an empty LOB locator. While after translation the column will return a string with ‘ `-` ‘.

##### 3. No access to the DBMS_LOB built-in package

Since LOB data types are not supported in Snowflake there is not an equivalent for the `DBMS_LOB` functions and there are no implemented workarounds yet.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Oracle - PACKAGES
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/packages.md
section: Migrations
---

# SnowConvert AI - Oracle - PACKAGES

## Description

> Use the `CREATE` `PACKAGE` statement to create the specification for a stored package, which is an encapsulated collection of related procedures, functions, and other program objects stored together in the database. The package specification declares these objects. The package body, specified subsequently, defines these objects.([Oracle PL/SQL Language Reference CREATE PACKAGE Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/CREATE-PACKAGE.html#GUID-40636655-899F-47D0-95CA-D58A71C94A56))

Snowflake does not have an equivalent for Oracle packages, so in order to maintain the structure, the packages are transformed into a schema, and all its elements are defined inside it. Also, the package and its elements are renamed to preserve the original schema name.

## BODY

### Description

The header of the PACKAGE BODY is removed and each procedure or function definition is transformed into a standalone function or procedure.

#### CREATE PACKAGE SYNTAX

```none
CREATE [ OR REPLACE ]
[ EDITIONABLE | NONEDITIONABLE ]
PACKAGE BODY plsql_package_body_source
```

### Sample Source Patterns

> **Note:**
>
> The following queries were transformed with the PackagesAsSchema option disabled.

#### Oracle

```sql
CREATE OR REPLACE PACKAGE BODY SCHEMA1.PKG1 AS
    PROCEDURE procedure1 AS
        BEGIN
            dbms_output.put_line('hello world');
        END;
END package1;
```

##### Snowflake

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE SCHEMA1_PKG1.procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        CALL DBMS_OUTPUT.PUT_LINE_UDF('hello world');
    END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.

## Constants

Translation spec for Package Constants

### Description

PACKAGE CONSTANTS can be declared either in the package declaration or in the PACKAGE BODY. When a package constant is used in a procedure, a new variable is declared with the same name and value as the constant, so the resulting code is pretty similar to the input.

#### Oracle Constant declaration Syntax

```none
constant CONSTANT datatype [NOT NULL] { := | DEFAULT } expression ;
```

### Sample Source Patterns

#### Sample auxiliary code

##### Oracle

```sql
create table table1(id number);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

##### Oracle

```sql
CREATE OR REPLACE PACKAGE PKG1 AS
    PROCEDURE procedure1;
    package_constant CONSTANT NUMBER:= 9999;
END PKG1;

CREATE OR REPLACE PACKAGE BODY PKG1 AS
    PROCEDURE procedure1 AS
    BEGIN
        INSERT INTO TABLE1(ID) VALUES(package_constant);
    END;
END PKG1;

CALL PKG1.procedure1();

SELECT * FROM TABLE1;
```

##### Result

| ID |
| --- |
| 9999 |

##### Snowflake

```sql
CREATE SCHEMA IF NOT EXISTS PKG1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE PKG1.procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        PACKAGE_CONSTANT NUMBER := 9999;
    BEGIN
        INSERT INTO TABLE1(ID) VALUES(:PACKAGE_CONSTANT);
    END;
$$;

CALL PKG1.procedure1();

SELECT * FROM
    TABLE1;
```

##### Result

| ID |
| --- |
| 9999 |

> **Note:**
>
> Note that the`PROCEDURE` definition is being removed since it is not required in Snowflake.

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.

## DECLARATION

### Description

The declaration is converted to a schema, so each inner element is declared inside this schema. All the elements present in the package are commented except for the VARIABLES which have a proper transformation.

#### CREATE PACKAGE SYNTAX

```none
CREATE [ OR REPLACE ]
[ EDITIONABLE | NONEDITIONABLE ]
PACKAGE plsql_package_source
```

### Sample Source Patterns

> **Note:**
>
> The following queries were transformed with the PackagesAsSchema option disabled.

#### Oracle

```sql
CREATE OR REPLACE PACKAGE SCHEMA1.PKG1 AS
   -- Function Declaration
   FUNCTION function_declaration(param1 VARCHAR) RETURN INTEGER;

   -- Procedure Declaration
   PROCEDURE procedure_declaration(param1 VARCHAR2, param2 VARCHAR2);

END PKG1;
```

##### Snowflake

```sql
CREATE SCHEMA IF NOT EXISTS SCHEMA1_PKG1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

> **Note:**
>
> Note that both `FUNCTION` and `PROCEDURE` definitions are being removed since they are not required in Snowflake.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## VARIABLES

Translation spec for Package Variables

### Description

PACKAGE VARIABLES can be declared either in the package declaration or in the PACKAGE BODY. Due to its behavior, these variables are converted into [Snowflake session variables](https://docs.snowflake.com/en/sql-reference/session-variables.html) so each usage or assignment is translated to its equivalent in Snowflake.

#### Oracle Variable declaration syntax

```sql
variable datatype [ [ NOT NULL] {:= | DEFAULT} expression ] ;
```

### Sample Source Patterns

#### Sample auxiliary code

##### Oracle

```sql
create table table1(id number);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### Variable declaration

##### Oracle

```sql
CREATE OR REPLACE PACKAGE PKG1 AS
    package_variable NUMBER:= 100;
END PKG1;
```

##### Snowflake Scripting

```sql
CREATE SCHEMA IF NOT EXISTS PKG1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

SET "PKG1.PACKAGE_VARIABLE" = '' || (100);
```

#### Variable Usage

Package variable usages are transformed into the Snowflake [GETVARIABLE](https://docs.snowflake.com/en/sql-reference/session-variables.html#session-variable-functions) function which accesses the current value of a session variable. An explicit cast is added to the original variable data type in order to maintain the functional equivalence in the operations where these variables are used.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE PKG1 AS
    PROCEDURE procedure1;
    package_variable NUMBER:= 100;
END PKG1;

CREATE OR REPLACE PACKAGE BODY PKG1 AS
    PROCEDURE procedure1 AS
    BEGIN
        INSERT INTO TABLE1(ID) VALUES(package_variable);
    END;
END PKG1;

CALL SCHEMA1.PKG1.procedure1();

SELECT * FROM TABLE1;
```

##### Result

| ID |
| --- |
| 100 |

##### Snowflake

```sql
CREATE SCHEMA IF NOT EXISTS PKG1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

SET "PKG1.PACKAGE_VARIABLE" = '' || (100);

CREATE OR REPLACE PROCEDURE PKG1.procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        INSERT INTO TABLE1(ID) VALUES(GETVARIABLE('PKG1.PACKAGE_VARIABLE') :: NUMBER);
    END;
$$;

CALL SCHEMA1.PKG1.procedure1();

SELECT * FROM
    TABLE1;
```

##### Result

| ID |
| --- |
| 100 |

> **Note:**
>
> Note that the `PROCEDURE` definition in the package is removed since it is not required by Snowflake.

### Variable regular assignment

When a package variable is assigned using the `:=` operator, the assignation is replaced by a SnowConvert AI UDF called UPDATE_PACKAGE_VARIABLE_STATE which is an abstraction of the Snowflake [SETVARIABLE](https://docs.snowflake.com/en/sql-reference/session-variables.html#session-variable-functions) function.

Oracle

#### Oracle

```sql
CREATE OR REPLACE PACKAGE PKG1 AS
    PROCEDURE procedure1;
    package_variable NUMBER:= 100;
END PKG1;

CREATE OR REPLACE PACKAGE BODY PKG1 AS
    PROCEDURE procedure1 AS
    BEGIN
        package_variable := package_variable + 100;
        INSERT INTO TABLE1(ID) VALUES(package_variable);
    END;
END PKG1;

CALL PKG1.procedure1();

SELECT * FROM TABLE1;
```

##### Result

| ID |
| --- |
| 200 |

#### Snowflake

```sql
CREATE SCHEMA IF NOT EXISTS PKG1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

SET "PKG1.PACKAGE_VARIABLE" = '' || (100);

CREATE OR REPLACE PROCEDURE PKG1.procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CALL UPDATE_PACKAGE_VARIABLE_STATE_UDF('PKG1.PACKAGE_VARIABLE', TO_VARCHAR(GETVARIABLE('PKG1.PACKAGE_VARIABLE') :: NUMBER + 100));
        INSERT INTO TABLE1(ID) VALUES(GETVARIABLE('PKG1.PACKAGE_VARIABLE') :: NUMBER);
    END;
$$;

CALL PKG1.procedure1();

SELECT * FROM
    TABLE1;
```

##### Result

| ID |
| --- |
| 200 |

> **Note:**
>
> Note that the `PROCEDURE` definition in the package is removed since it is not required by Snowflake.

#### Variable assignment as an output argument

When a package variable is used as an output argument a new variable is declared inside the procedure, this variable will catch the output argument value of the procedure, and then the variable will be used to update the session variable which refers to the package variable using the UPDATE_PACKAGE_VARIABLE_STATE mentioned above.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE PKG1 AS
    PROCEDURE procedure1;
    PROCEDURE procedure2(out_param OUT NUMBER);
    package_variable NUMBER:= 100;
END PKG1;

CREATE OR REPLACE PACKAGE BODY PKG1 AS
    PROCEDURE procedure1 AS
    BEGIN
        procedure2(package_variable);
        INSERT INTO TABLE1(ID) VALUES(package_variable);
    END;
    PROCEDURE procedure2 (out_param OUT NUMBER) AS
    BEGIN
        out_param := 1000;
    END;
END PKG1;

CALL PKG1.procedure1();
```

##### Result

| ID |
| --- |
| 1000 |

##### Snowflake

```sql
CREATE SCHEMA IF NOT EXISTS PKG1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
;

SET "PKG1.PACKAGE_VARIABLE" = '' || (100);

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
CREATE OR REPLACE PROCEDURE PKG1.procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        PKG1_PACKAGE_VARIABLE VARIANT;
    BEGIN
        CALL PKG1.
        procedure2(:PKG1_PACKAGE_VARIABLE);
        CALL UPDATE_PACKAGE_VARIABLE_STATE_UDF('PKG1.PACKAGE_VARIABLE', TO_VARCHAR(:PKG1_PACKAGE_VARIABLE));
        INSERT INTO TABLE1(ID) VALUES(GETVARIABLE('PKG1.PACKAGE_VARIABLE') :: NUMBER);
    END;
$$;

CREATE OR REPLACE PROCEDURE PKG1.procedure2 (out_param OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        out_param := 1000;
    END;
$$;

CALL PKG1.procedure1();
```

##### Result

| ID |
| --- |
| 1000 |

> **Note:**
>
> Note that the `PROCEDURE` definition in the package is removed since it is not required by Snowflake.

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.

---
title: SnowConvert AI - Oracle - PL/SQL to Javascript
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-javascript/README.md
section: Migrations
---

# SnowConvert AI - Oracle - PL/SQL to Javascript

This is a translation reference to convert PL/SQL statements to snowflake JavaScript

## Collections & Records

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Records

> **Note:**
>
> You might also be interested in Records declaration.

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE RECORDS_PROC AS
 TYPE DEPTRECTYP IS RECORD (
    DEPT_ID    NUMBER(4) NOT NULL := 10,
    DEPT_NAME  VARCHAR2(30) NOT NULL := 'ADMINISTRATION',
    MGR_ID     NUMBER(6) := 200,
    LOC_ID     NUMBER(4) := 1700
  );

  TYPE NAME_REC IS RECORD (
    FIRST  EMPLOYEES.FIRST_NAME%TYPE,
    LAST   EMPLOYEES.LAST_NAME%TYPE
  );

  TYPE CONTACT IS RECORD (
    NAME  NAME_REC,-- NESTED RECORD
    PHONE EMPLOYEES.PHONE_NUMBER%TYPE
  );

  DEPT1 DEPTRECTYP;
  DEPT_NAME DEPTRECTYP;
  C1 CONTACT;
BEGIN
  DEPT1.DEPT_NAME := 'PURCHASING';
  C1.NAME.FIRST := 'FALVARADO';
  C1.PHONE := '50687818481';
  SELECT * INTO DEPT1 FROM FTABLE46;
  INSERT INTO TABLA1 VALUES (DEPT1.DEPT_NAME);
  INSERT INTO TABLA1 VALUES (DEPT_NAME.DEPT_NAME);
  EXECUTE IMMEDIATE 'SELECT * FROM FTABLE46' INTO DEPT_NAME;
END;
```

#### Snowflake

> **Warning:**
>
> Transformation for “SELECT INTO Record” is in progress.

```sql
CREATE OR REPLACE PROCEDURE RECORDS_PROC ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
  TYPE DEPTRECTYP IS RECORD (
     DEPT_ID    NUMBER(4) NOT NULL := 10,
     DEPT_NAME  VARCHAR2(30) NOT NULL := 'ADMINISTRATION',
     MGR_ID     NUMBER(6) := 200,
     LOC_ID     NUMBER(4) := 1700
   );
  !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!

   TYPE NAME_REC IS RECORD (
     FIRST  EMPLOYEES.FIRST_NAME%TYPE,
     LAST   EMPLOYEES.LAST_NAME%TYPE
   );
  !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!

   TYPE CONTACT IS RECORD (
     NAME  NAME_REC,-- NESTED RECORD
     PHONE EMPLOYEES.PHONE_NUMBER%TYPE
   );

   DEPT1 OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - DEPTRECTYP DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
   DEPT_NAME OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - DEPTRECTYP DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
   C1 OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - CONTACT DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
 BEGIN
  DEPT1 := OBJECT_INSERT(DEPT1, 'DEPT_NAME', 'PURCHASING', true);
  C1 := OBJECT_INSERT(C1, 'FIRST', 'FALVARADO', true);
  C1 := OBJECT_INSERT(C1, 'PHONE', '50687818481', true);
  SELECT
   OBJECT_CONSTRUCT( *) INTO
   :DEPT1
  FROM
   FTABLE46;
  INSERT INTO TABLA1
  SELECT
   :DEPT1.DEPT_NAME:DEPT_ID,
   :DEPT1.DEPT_NAME:DEPT_NAME,
   :DEPT1.DEPT_NAME:MGR_ID,
   :DEPT1.DEPT_NAME:LOC_ID;
  INSERT INTO TABLA1
  SELECT
   :DEPT_NAME.DEPT_NAME:DEPT_ID,
   :DEPT_NAME.DEPT_NAME:DEPT_NAME,
   :DEPT_NAME.DEPT_NAME:MGR_ID,
   :DEPT_NAME.DEPT_NAME:LOC_ID;
  !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
  EXECUTE IMMEDIATE 'SELECT * FROM
   FTABLE46'
            !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'EXECUTE IMMEDIATE RETURNING CLAUSE' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
            INTO DEPT_NAME;
 END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
3. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
4. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL

## Conditional Compilation

### Description

> Provides conditional compilation based on the truth value of a condition.

For more information regarding Oracle Conditional Compilation IF, check [here](https://www.oracle.com/partners/campaign/plsql-conditional-compilation-133587.pdf).

```sql
$IF conditional_expression $THEN
     statement
     [ statement ]...
[ $ELSIF conditional_expression $THEN
     statement
     [ statement ]... ]...
[ $ELSE
     statement
     [ statement ]... ]
$END;
```

### Sample Source Patterns

#### Possible IF variations

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE_DEMO ()
   AS
   BEGIN
      SELECT 2 FROM DUAL;
      $IF $$debug_flag
      $THEN
         SELECT 1 FROM DUAL;
      $END
   END PROCEDURE_DEMO;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE_DEMO ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      SELECT 2 FROM DUAL;
      !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'DOLLAR IF STATEMENT' NODE ***/!!!
      $IF $$debug_flag
      $THEN
         SELECT 1 FROM DUAL;
      $END
   END;
$$;
```

### Known issues

1. Transformation of Conditional Compilation is not currently supported.

### Related EWIs

* [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Control Statements

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### IF, ELSIF and ELSE Statement

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
    sal_raise NUMBER;
BEGIN
  IF jobid = 'PU_CLERK' THEN sal_raise := .09;
  ELSIF jobid = 'SH_CLERK' THEN sal_raise := .08;
  ELSIF jobid = 'ST_CLERK' THEN sal_raise := .07;
  ELSE sal_raise := 0;
  END IF;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let SAL_RAISE;
  if (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT jobid MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    JOBID == `PU_CLERK`) {
    SAL_RAISE = 0.09;
  } else if (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT jobid MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    JOBID == `SH_CLERK`) {
    SAL_RAISE = 0.08;
  } else if (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT jobid MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    JOBID == `ST_CLERK`) {
    SAL_RAISE = 0.07;
  } else {
    SAL_RAISE = 0;
  }
$$;
```

### Loop

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
BEGIN
  LOOP
    i := i + 1;
    j := 0;
    LOOP
      j := j + 1;
      s := s + i * j; -- Sum several products
    END LOOP inner_loop;
  END LOOP outer_loop;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  while ( true ) {
    I =
        !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT i MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
        I + 1;
    J = 0;
    while ( true ) {
      J =
          !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT j MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
          J + 1;
      S =
          !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT s MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
          S +
            !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT i MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
            I *
            !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT j MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
            J;
    }
  }
$$;
```

### While Statement

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
I NUMBER := 1;
J NUMBER := 10;
BEGIN
  WHILE I <> J LOOP
    I := I+1;
  END LOOP;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let I = 1;
  let J = 10;
  while ( I != J ) {
    I = I + 1;
  }
$$;
```

### Related EWIs

1. [SSC-EWI-0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Object may not work.

## Declarations

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Variable declaration and assignment

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_VARIABLES
IS
  localVar1 NUMBER;
  localVar2 VARCHAR(100);
  localVar3 VARCHAR2 := 'local variable 3';
  localVar4 VARCHAR2 DEFAULT 'local variable 4';
  localVar5 VARCHAR2 NOT NULL := 'local variable 5';
  localVar6 VARCHAR2 NOT NULL DEFAULT 'local variable 6';
  localVar7 NUMBER := NULL;
  localVar8 NUMBER := '';
BEGIN
    localVar1 := 123;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC_VARIABLES ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let LOCALVAR1;
  let LOCALVAR2;
  let LOCALVAR3 = `local variable 3`;
  let LOCALVAR4 = `local variable 4`;
  let LOCALVAR5 = `local variable 5`;
  let LOCALVAR6 = `local variable 6`;
  let LOCALVAR7 = undefined;
  let LOCALVAR8 = undefined;
  LOCALVAR1 = 123;
$$;
```

### Record variable declaration

> **Note:**
>
> You might also be interested in Records transformation section.

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_RECORDS
IS
    TYPE DEPTRECTYP IS RECORD (
    DEPT_ID    NUMBER(4) NOT NULL := 10,
    DEPT_NAME  VARCHAR2(30) NOT NULL := 'ADMINISTRATION',
    MGR_ID     NUMBER(6) := 200,
    LOC_ID     NUMBER(4) := 1700
  );

  TYPE NAME_REC IS RECORD (
    FIRST  EMPLOYEES.FIRST_NAME%TYPE,
    LAST   EMPLOYEES.LAST_NAME%TYPE
  );

  TYPE CONTACT IS RECORD (
    NAME  NAME_REC,-- NESTED RECORD
    PHONE EMPLOYEES.PHONE_NUMBER%TYPE
  );
BEGIN
    null;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC_RECORDS ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  class DEPTRECTYP {
    DEPT_ID = 10
    DEPT_NAME = `ADMINISTRATION`
    MGR_ID = 200
    LOC_ID = 1700
    constructor() {
      [...arguments].map((element,Index) => this[(Object.keys(this))[Index]] = element)
    }
  }
  class NAME_REC {
    FIRST
    LAST
    constructor() {
      [...arguments].map((element,Index) => this[(Object.keys(this))[Index]] = element)
    }
  }
  class CONTACT {
    NAME = new NAME_REC()
    PHONE
    constructor() {
      [...arguments].map((element,Index) => this[(Object.keys(this))[Index]] = element)
    }
  }
  null;
$$;
```

### Rowtype Record variable declaration

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE ROWTYPE_PROC AS
  varname number := 1;
  CURSOR BOOK_CURSOR IS SELECT * FROM BOOK where 1 = varname;

  BOOK_REC BOOK%ROWTYPE;
  BOOK_CUR_REC BOOK_CURSOR%ROWTYPE;
BEGIN
  BOOK_REC.ID     := 10;
  BOOK_REC.TITLE  := 'A STUDY IN SCARLET';
  BOOK_REC.AUTHOR := 'SIR ARTHUR CONAN DOYLE';

  INSERT INTO BOOK VALUES(BOOK_REC.ID, BOOK_REC.TITLE, BOOK_REC.AUTHOR);
  OPEN BOOK_CURSOR;
  FETCH BOOK_CURSOR INTO BOOK_CUR_REC;
  CLOSE BOOK_CURSOR;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE ROWTYPE_PROC ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let VARNAME = 1;
  let BOOK_CURSOR = new CURSOR(`SELECT * FROM
      BOOK
   where 1 = ?`,() => [VARNAME]);
  let BOOK_REC = ROWTYPE(`BOOK`);
  let BOOK_CUR_REC = BOOK_CURSOR.ROWTYPE();
  BOOK_REC.ID = 10;
  BOOK_REC.TITLE = `A STUDY IN SCARLET`;
  BOOK_REC.AUTHOR = `SIR ARTHUR CONAN DOYLE`;
  EXEC(`INSERT INTO BOOK
  VALUES(
  !!!RESOLVE EWI!!! /*** SSC-EWI-0026 - THE  VARIABLE BOOK_REC.ID MAY REQUIRE A CAST TO DATE, TIME OR TIMESTAMP ***/!!!
  ?,
  !!!RESOLVE EWI!!! /*** SSC-EWI-0026 - THE  VARIABLE BOOK_REC.TITLE MAY REQUIRE A CAST TO DATE, TIME OR TIMESTAMP ***/!!!
  ?,
  !!!RESOLVE EWI!!! /*** SSC-EWI-0026 - THE  VARIABLE BOOK_REC.AUTHOR MAY REQUIRE A CAST TO DATE, TIME OR TIMESTAMP ***/!!!
  ?)`,[BOOK_REC.ID,BOOK_REC.TITLE,BOOK_REC.AUTHOR]);
  BOOK_CURSOR.OPEN();
  BOOK_CURSOR.FETCH(BOOK_CUR_REC) && ([BOOK_CUR_REC] = BOOK_CURSOR.INTO());
  BOOK_CURSOR.CLOSE();
$$;
```

### Constant Declaration

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_CONSTANTS
IS
    MY_VAR1 NUMBER;
    MY_CONST_VAR1 CONSTANT INTEGER(4) := 40;
    MY_CONST_VAR2 CONSTANT INTEGER(4) NOT NULL := MY_CONST_VAR1;
    MY_CONST_VAR3 CONSTANT VARCHAR(20) DEFAULT 'const variable';
    MY_CONST_VAR4 CONSTANT REAL NOT NULL DEFAULT 3.14159;
BEGIN
    MY_VAR1 := MY_CONST_VAR1 + MY_CONST_VAR2 + 1;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC_CONSTANTS ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let MY_VAR1;
    const MY_CONST_VAR1 = 40;
    const MY_CONST_VAR2 = MY_CONST_VAR1;
    const MY_CONST_VAR3 = `const variable`;
    const MY_CONST_VAR4 = 3.14159;
    const MY_CONST_VAR1 = 40;
    const MY_CONST_VAR2 = MY_CONST_VAR1;
    MY_VAR1 = MY_CONST_VAR1 + MY_CONST_VAR2 + 1;
$$;
```

### Cursor declarations and definition

#### Oracle

> **Note:**
>
> You might also be interested in [Cursor helper](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC_CURSORS
IS
    CURSOR C1 RETURN Table1%ROWTYPE;
    CURSOR C2 RETURN UserDefinedRecordType;
    CURSOR C3 RETURN Table1%ROWTYPE IS
        SELECT * FROM Table1 WHERE ID = 110;
    CURSOR C4 IS
        SELECT * FROM Table1 WHERE ID = 123;
    CURSOR C5 (cursorParam NUMBER ) RETURN Table1%ROWTYPE IS
        SELECT * FROM Table1 WHERE ID = cursorParam;
BEGIN
    null;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC_CURSORS ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let C1 = new CURSOR();
    let C2 = new CURSOR();
    let C3 = new CURSOR(`SELECT * FROM
           Table1
        WHERE ID = 110`,() => []);
    let C4 = new CURSOR(`SELECT * FROM
           Table1
        WHERE ID = 123`,() => []);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    let C5 = new CURSOR(`SELECT * FROM
           Table1
        WHERE ID = ?`,(CURSORPARAM) => [CURSORPARAM]);
    null;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

1. [SSC-EWI-0022](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): One or more identifiers in this statement were considered parameters by default.
2. [SSC-EWI-0026](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The variable may require a cast to date, time or timestamp.

## Expressions and operators

### Expressions

#### Concatenation Operator

> **Note:**
>
> You might also be interested in [Concat helper.](helpers.md)

Oracle concatenation is achieved in JavaScript using [Template literal](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Template_literals). Also it uses the *Concat Helper* to properly handle concatenations with nulls.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE CONCAT_TEST
IS
NUM1 INTEGER := 123;
NUM2 INTEGER := 321;
VAR1 VARCHAR(10) := 'value';
concat_var VARCHAR(100);
sql_stmt VARCHAR(100);
BEGIN
    concat_var := NUM1 || NUM2 || VAR1 || 'literal';
    sql_stmt := 'INSERT INTO t1 VALUES (''' || concat_var || ''')';
    EXECUTE IMMEDIATE sql_stmt;
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE CONCAT_TEST ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let NUM1 = 123;
    let NUM2 = 321;
    let VAR1 = `value`;
    let CONCAT_VAR;
    let SQL_STMT;
    CONCAT_VAR = `${concatValue(NUM1)}${concatValue(NUM2)}${concatValue(VAR1)}literal`;
    SQL_STMT = `INSERT INTO t1
VALUES ('${concatValue(CONCAT_VAR)}')`;
    EXEC(SQL_STMT);
$$;
```

#### Logical Operators

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE BOOLEAN_PROC (b_name VARCHAR2, b_value  BOOLEAN)
IS
BOOL1 BOOLEAN := FALSE;
x NUMBER := 5;
y NUMBER := NULL;
BEGIN

  IF b_value IS NULL THEN
    null;
  ELSIF b_value = TRUE THEN
    null;
  ELSIF b_value = TRUE AND b_value = BOOL1  OR b_value = BOOL1 THEN
    null;
  ELSIF x > y THEN
    null;
  ELSIF x != y AND x <> y THEN
    null;
  ELSE
    null;
  END IF;
END;
```

##### Snowflake

> **Note:**
>
> You might also be interested in [IS NULL helper](helpers.md)[.](helpers.md)

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE BOOLEAN_PROC (b_name STRING, b_value BOOLEAN)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

  // SnowConvert AI Helpers Code section is omitted.

  let BOOL1 = false;
  let X = 5;
  let Y = undefined;
  if (IS_NULL(B_VALUE)) {
    null;
  } else if (B_VALUE == true) {
    null;
  } else if (B_VALUE == true && B_VALUE == BOOL1 || B_VALUE == BOOL1) {
    null;
  } else if (X > Y) {
    null;
  } else if (X != Y && X != Y) {
    null;
  } else {
    null;
  }
$$;
```

#### Comparison Operator

Documentation in progress.

##### IS [NOT] NULL

> **Note:**
>
> You might also be interested in [IS NULL helper](helpers.md).

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE NULL_TEST
IS
NUM1 INTEGER := 789;
BEGIN
    IF NUM1 IS NOT NULL THEN
        NULL;
    END IF;

    NUM1 := NULL;

    IF NUM1 IS NULL THEN
        NULL;
    END IF;
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE NULL_TEST ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    // SnowConvert AI Helpers Code section is omitted.

    let NUM1 = 789;
    if (!IS_NULL(NUM1)) {
        null;
    }
    NUM1 = undefined;
    if (IS_NULL(NUM1)) {
        null;
    }
$$;
```

##### Like Operator

> **Note:**
>
> You might also be interested in [Like operator helper.](helpers.md)

When there is a LIKE operation, the helper function will be called instead.

###### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE_WITH_LIKE AS
BEGIN
	IF 'ABC' LIKE '%A%' THEN
		 null;
	END IF;
  IF 'ABC' LIKE 'A%' THEN
     null;
  END IF;
  IF 'ABC' NOT LIKE 'D_%' THEN
     null;
  END IF;
  IF 'ABC' NOT LIKE 'D/%%' ESCAPE '/' THEN
     null;
  END IF;
END;
```

###### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE_WITH_LIKE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	if (LIKE(`ABC`,`%A%`)) {
		null;
	}
	if (LIKE(`ABC`,`A%`)) {
		null;
	}
	if (!LIKE(`ABC`,`D_%`)) {
		null;
	}
	if (!LIKE(`ABC`,`D/%%`,`/`)) {
		null;
	}
$$;
```

##### Between Operator

> **Note:**
>
> You may also be interested in [Between operator helper.](helpers.md)

###### Oracle

```sql
CREATE OR REPLACE PROCEDURE BETWEEN_TEST
IS
NUM1 INTEGER := 789;
US INTEGER := 1000;
BEGIN
    IF 800 BETWEEN US AND NUM1 THEN
        NULL;
    END IF;
    IF 'BA' BETWEEN 'B' AND 'CA' THEN
        NULL;
    END IF;

    -- Assign null to the variable num1
    NUM1 := NULL;

    IF (0 BETWEEN NULL AND NUM1) IS NULL THEN
        NULL;
    END IF;
END;
```

###### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE BETWEEN_TEST ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let NUM1 = 789;
    let US = 1000;
    if (BetweenFunc(800,US,NUM1)) {
        null;
    }
    if (BetweenFunc(`BA`,`B`,`CA`)) {
        null;
    }

    // Assign null to the variable num1
    NUM1 = undefined;
    if (IS_NULL(BetweenFunc(0,undefined,NUM1))) {
        null;
    }
$$;
```

##### IN Operator

###### Oracle

```sql
CREATE OR REPLACE PROCEDURE IN_PROC
IS
letter VARCHAR2(1) := 'm';
BEGIN
  IF letter IN ('a', 'b', 'c') THEN
    null;
  ELSE
    null;
  END IF;
END;
```

###### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE IN_PROC ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let LETTER = `m`;
  if ([`a`,`b`,`c`].includes(LETTER)) {
    null;
  } else {
    null;
  }
$$;
```

#### Boolean Expressions

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE BOOLEAN_TEST
IS
done BOOLEAN;
BEGIN
  -- These WHILE loops are equivalent
  done := FALSE;
  WHILE done = FALSE
    LOOP
      done := TRUE;
    END LOOP;

  done := FALSE;
  WHILE NOT (done = TRUE)
    LOOP
      done := TRUE;
    END LOOP;

  done := FALSE;
  WHILE NOT done
    LOOP
      done := TRUE;
    END LOOP;
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE BOOLEAN_TEST ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let DONE;
  // These WHILE loops are equivalent
  DONE = false;
  while ( DONE == false ) {
    DONE = true;
  }
  DONE = false;
  while ( !(DONE == true) ) {
    DONE = true;
  }
  DONE = false;
  while ( !DONE ) {
    DONE = true;
  }
$$;
```

#### Function Expressions

For Function Expressions inside procedures, they are being converted to the corresponding function or expression in Snowflake. These function calls are passed to an EXEC with a CALL or a SELECT depending on the converted value.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE FUNCTIONS_TEST(DATEPARAM DATE)
IS
	STRING_VALUE VARCHAR(20) := 'HELLO';
BEGIN
	STRING_VALUE := TO_CHAR(123);
	STRING_VALUE := TO_CHAR(DATEPARAM, 'dd-mm-yyyy', 'NLS_DATE_LANGUAGE = language');
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE FUNCTIONS_TEST (DATEPARAM TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	let STRING_VALUE = `HELLO`;
	STRING_VALUE = (EXEC(`SELECT
   TO_CHAR(123)`))[0];
	STRING_VALUE = (EXEC(`SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-OR0013 - NLS PARAMETER 'NLS_DATE_LANGUAGE = language' NOT SUPPORTED ***/!!!
   TO_CHAR(PUBLIC.CAST_DATE_UDF(?), 'dd-mm-yyyy')`,[DATEPARAM]))[0];
$$;
```

For more information on the function’s transformations check [here](../functions/README.md).

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

1. [SSC-EWI-OR0013](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): NLS parameter is not supported.
2. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

## User defined functions

### General Description

Most Oracle UDFs and UDFs inside packages, are being transformed to Snowflake Stored Procedures, to maintain functional equivalence, due to Snowflake UDFs having some limitations executing DML (Data Manipulation Language) statements.

### Translation

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Create Function

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(PAR1 VARCHAR)
RETURN VARCHAR
IS
    VAR1 VARCHAR(20);
    VAR2 VARCHAR(20);
BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE1 where col1 = 1;
    VAR2 := PAR1 || VAR1;
    RETURN VAR2 ;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (PAR1 VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
    WITH declaration_variables_cte1 AS
    (
        SELECT
            (
            SELECT COL1
            FROM
                TABLE1
            where col1 = 1) AS VAR1,
            NVL(PAR1 :: STRING, '') || NVL(VAR1 :: STRING, '') AS
            VAR2
    )
    SELECT
        VAR2
    FROM
        declaration_variables_cte1
$$;
```

#### Function inside Package

##### Oracle

```sql
CREATE OR REPLACE PACKAGE BODY pkg1 AS
FUNCTION f1(PAR1 VARCHAR) RETURN VARCHAR IS
    VAR1 VARCHAR(20);
    VAR2 VARCHAR(20);
  BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE1 where col1 = 1;
    VAR2 := PAR1 || VAR1;
    RETURN VAR2 ;
  END f1;
END pkg1;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION pkg1.f1(PAR1 VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
  WITH declaration_variables_cte1 AS
  (
    SELECT
      (
      SELECT COL1
      FROM
        TABLE1
      where col1 = 1) AS VAR1,
      NVL(PAR1 :: STRING, '') || NVL(VAR1 :: STRING, '') AS
      VAR2
  )
  SELECT
    VAR2
  FROM
    declaration_variables_cte1
$$;
```

### Return data type mapping

| Oracle PL SQL type | Snowflake equivalent |
| --- | --- |
| NUMBER | FLOAT |
| LONG | VARCHAR |
| VARCHAR2 | STRING |
| BLOB | BINARY |
| BFILE | BINARY |

### Call

#### Inside queries

Calls of functions that were transformed to procedures inside queries are converted into an empty Snowflake JavaScript UDF. This Snowflake UDF is generated in the **STUB_UDF.sql** file inside the **UDF Helpers** directory.

##### Oracle

```sql
CREATE VIEW VIEW1 AS SELECT FUN1(COL2) FROM TABLE1;
CREATE VIEW VIEW2 AS SELECT PKG1.F1(COL1) FROM TABLE1;
```

##### Snowflake

```sql
CREATE OR REPLACE VIEW VIEW1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
SELECT FUN1(COL2) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FUN1' NODE ***/!!! FROM
TABLE1;

CREATE OR REPLACE VIEW VIEW2
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
SELECT PKG1.F1(COL1) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PKG1.F1' NODE ***/!!! FROM
TABLE1;
```

#### Inside other functions or stored procedures

The functions that are converted to procedures are called using the [EXEC Snowflake helper](helpers.md).

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(x NUMBER) RETURN NUMBER IS
  VAR1 NUMBER;
  BEGIN
    -- FUN2 is another UDF
    VAR1 := FUN2(pkg1.f1(X, FUN2(10)));
    RETURN VAR1;
  END f1;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (x NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
  WITH declaration_variables_cte1 AS
  (
    SELECT
      FUN2(pkg1.f1(X, FUN2(10) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FUN2' NODE ***/!!!) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'pkg1.f1' NODE ***/!!!) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FUN2' NODE ***/!!! AS
      -- FUN2 is another UDF
      VAR1
  )
  SELECT
    VAR1
  FROM
    declaration_variables_cte1
$$;
```

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(x NUMBER) RETURN NUMBER IS
  VAR1 NUMBER;
  BEGIN
    -- FUN2 is another UDF
    VAR1 := FUN2(X);
    RETURN VAR1;
  END f1;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (x NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
  WITH declaration_variables_cte1 AS
  (
    SELECT
      FUN2(X) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FUN2' NODE ***/!!! AS
      -- FUN2 is another UDF
      VAR1
  )
  SELECT
    VAR1
  FROM
    declaration_variables_cte1
$$;
```

### Different cases and limitations

#### Functions with DMLs

These functions cannot be executed in queries in Oracle, so their usage wont be limited when transforming them to Snowflake Procedures.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(x NUMBER)
RETURN NUMBER IS
VAR1 NUMBER;
BEGIN
    VAR1 := VAR1 + 1;
    INSERT INTO TABLE1(col1, col2) VALUES(X, VAR1);
    UPDATE TABLE2 SET COL1 = VAR1 WHERE ID = X;
    RETURN VAR1;
END FUN1;
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE FUN1 (x FLOAT)
RETURNS FLOAT
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let VAR1;
    VAR1 = VAR1 + 1;
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`INSERT INTO TABLE1(col1, col2) VALUES(?, ?)`,[X,VAR1]);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`UPDATE TABLE2
       SET COL1 = ?
       WHERE ID = ?`,[VAR1,X]);
    return VAR1;
$$;
```

#### Functions with only one SELECT INTO

These functions are transformed to Snowflake SQL functions by removing the INTO part of the select.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(PAR1 VARCHAR)
RETURN VARCHAR
IS
    VAR1 VARCHAR(20);
BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE1 where col1 = PAR1;
    RETURN VAR1;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (PAR1 VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
    WITH declaration_variables_cte1 AS
    (
        SELECT
            (
            SELECT COL1
            FROM
                TABLE1
            where col1 = PAR1) AS VAR1
    )
    SELECT
        VAR1
    FROM
        declaration_variables_cte1
$$;
```

#### Functions with only logic

UDFs that do not use any SQL statement are converted into Snowflake JavaScript UDFs.

> **Note:**
>
> When SQL built-in functions are included in the logic the user defined function is converted to a Snowflake procedure. Translation for built in functions to a JavaScript equivalent is planned to be delivered in the future.
>
> Examples for built-in functions: UPPER(), TRIM(), ABS().

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(x NUMBER)
RETURN NUMBER IS
VAR1 NUMBER;
BEGIN
    IF x < 5 THEN
        VAR1 := 1;
    ELSE
        VAR1 := 0;
    END IF;
    RETURN VAR1;
END FUNC01;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (x NUMBER(38, 18))
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
    WITH declaration_variables_cte1 AS
    (
        SELECT
            CASE
                WHEN x < 5
                    THEN 1
                ELSE 0
            END AS VAR1
    )
    SELECT
        VAR1
    FROM
        declaration_variables_cte1
$$;
```

#### Functions with more than one SQL statement

> **Warning:**
>
> UDFs transformed into procedures cannot be called from a query.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(x NUMBER)
RETURN NUMBER IS
VAR1 NUMBER;
BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE1 WHERE ID = X;
    IF VAR1 < 5 THEN
        VAR1 := 1;
    ELSE
        VAR1 := 0;
    END IF;
    UPDATE TABLE1 SET COL1 = VAR1 WHERE ID = X;
    RETURN VAR1;
END FUN1;
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE FUN1 (x FLOAT)
RETURNS FLOAT
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let VAR1;
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    [VAR1] = EXEC(`SELECT
   COL1
FROM
   TABLE1
WHERE ID = ?`,[X]);
    if (VAR1 < 5) {
        VAR1 = 1;
    } else {
        VAR1 = 0;
    }
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`UPDATE TABLE1
       SET COL1 = ?
       WHERE ID = ?`,[VAR1,X]);
    return VAR1;
$$;
```

#### Functions with only logic and built-in SQL functions

> **Note:**
>
> This transformation is planned to be delivered in the future, currently all functions are being transformed to stored procedures.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1(x FLOAT)
RETURN NUMBER IS
VAR1 NUMBER;
BEGIN
    IF TRUNC(X) < 5 THEN
        VAR1 := 1;
    ELSE
        VAR1 := 0;
    END IF;
    RETURN VAR1;
END FUNC01;
```

###### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (x FLOAT)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
    WITH declaration_variables_cte1 AS
    (
        SELECT
            CASE
                WHEN TRUNC(X) < 5
                    THEN 1
                ELSE 0
            END AS VAR1
    )
    SELECT
        VAR1
    FROM
        declaration_variables_cte1
$$;
```

#### RETURN CASE

The transformation is the same transformation when the CASE is use to assign a variable.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION FUN1 (flag FLOAT)
RETURN NUMBER IS
BEGIN
  return CASE flag
	WHEN 1 THEN 'one'
	WHEN 2 THEN 'two'
	WHEN 3 THEN 'three'
	WHEN 4 THEN 'four'
	ELSE 'unknown' END;
END FUN1;
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION FUN1 (flag FLOAT)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/13/2024",  "domain": "test" }}'
AS
$$
	SELECT
		CASE flag
			WHEN 1 THEN 'one'
			WHEN 2 THEN 'two'
			WHEN 3 THEN 'three'
			WHEN 4 THEN 'four'
			ELSE 'unknown' END
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0022](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): One or more identifiers in this statement were considered parameters by default.
2. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
3. [SSC-FDM-0029](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): User defined function was transformed to a Snowflake procedure.

## Packages

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Package Declaration

This section shows the equivalence between Oracle Package Declaration members and Snowflake statements.

#### Package Translation options

There are two options to migrate packages, each option will affect directly the naming of the objects inside the package. Check [here](../../../general/getting-started/running-snowconvert/conversion/oracle-conversion-settings.md) how you can change this mode in the UI.

Let’s suppose that we have the next scenario in Oracle:

* A package named `MY_PACKAGE.`
* A procedure inside the package named `MY_PROCEDURE.`

##### Option 1 (Using new schema)

With this option, packages are transformed into new schemas. Package elements like functions and procedures are created inside the new schema. If the package is already inside a schema, the name of the package will be joined with the name of the schema with an underscore.

This is the **default** option for translating packages.

Result:

* A schema will be created with the name `MY_PACKAGE`.
* Qualified name of the procedure will be updated to `MY_PACKAGE.MY_PROCEDURE`.
* If the package is inside a schema then the procedure will be updated to `MY_SCHEMA_MY_PACKAGE.MY_PROCEDURE`.

##### Option 2

With this option, the name of the package elements will be joined with the package name with an underscore. New schemas will not be created.

Result:

* Name of the procedure will be updated to `MY_PACKAGE_MY_PROCEDURE`.
* If the package is inside a schema then the procedure will be updated to `MY_SCHEMA.MY_PACKAGE_MY_PROCEDURE`.

#### Create Package

The CREATE PACKAGE statement will be converted to a CREATE SCHEMA statement. Any member inside the package will be converted outside of the package.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE MY_PACKAGE AS
-- Other elements...
END MY_PACKAGE ;
```

##### Transformation with option 1 (Using new schema)

```sql
CREATE IF NOT EXISTS SCHEMA MY_PACKAGE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
-- Other elements...
```

##### Transformation with option 2

With this option, the Schema won’t be generated and only the inner elements will be kept but with their names renamed.

```sql
-- Other elements...
```

#### Procedure and function declaration

Procedure and function declarations are not necessary for the transformation to Snowflake. Existing procedure or function declarations will be commented out.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE MY_PACKAGE AS
  PROCEDURE MY_PROCEDURE(PARAM1 VARCHAR2);
  FUNCTION MY_FUNCTION(PARAM1 VARCHAR2) RETURN NUMBER ;
END MY_PACKAGE;
```

##### Transformation with option 1 (Using new schema)

```sql
CREATE SCHEMA IF NOT EXISTS MY_PACKAGE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

> **Note:**
>
> Note that that for option 1, the `PROCEDURE` definition in the package is removed since it is not required by Snowflake.

#### Variables declaration

> **Note:**
>
> You might also be interested in [variables helper.](helpers.md)

Oracle package variables are transformed into Snowflake Session Variables. A prefix is added to the values to know what type it is inside stored procedures. If the value should be null, a “~” is added. Because of this, variables that depend on other variables will require a SUBSTR and a CAST.

##### Data type and Code mappings

| Data type or value | Code |
| --- | --- |
| Numeric types | # |
| Datetime types | & |
| String types | $ |
| NULL values | ~ |

The transformation of the variables will be always the same regardless of the transformation option.

###### Oracle

```sql
CREATE OR REPLACE PACKAGE PACKAGE_VARIABLES AS
    VAR1 integer := 333;
    VAR2 INTEGER := VAR1 + 456;
	  VAR3 DATE := CURRENT_DATE;
	  VAR4 VARCHAR(20) := 'HELLO WORLD';
	  VAR5 INTEGER;
END;
```

###### Snowflake

```sql
CREATE SCHEMA IF NOT EXISTS PACKAGE_VARIABLES
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

SET "PACKAGE_VARIABLES.VAR1" = '' || (333);

SET "PACKAGE_VARIABLES.VAR2" = (SELECT
	'' || (GETVARIABLE('PACKAGE_VARIABLES.VAR1') :: INTEGER + 456));

SET "PACKAGE_VARIABLES.VAR3" = (SELECT
	'' || (CURRENT_DATE()));

SET "PACKAGE_VARIABLES.VAR4" = '' || ('HELLO WORLD');

SET "PACKAGE_VARIABLES.VAR5" = '~';
```

#### Constants declaration

Constants declaration will be declared inside the procedure or functions that use them. Existing package constants declaration will be commented out and a warning will be added.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE PACKAGE_CONSTANTS
IS
const_name CONSTANT VARCHAR(10) := 'Snow';
PROCEDURE PROCEDURE1;
END PACKAGE_CONSTANTS;

CREATE OR REPLACE PACKAGE BODY PACKAGE_CONSTANTS
IS
PROCEDURE MY_PROCEDURE IS
   BEGIN
      INSERT INTO DBUSER ("USER_NAME")
      VALUES (const_name);
   END;

END PACKAGE_CONSTANTS;
```

**Transformation with option 1**

```sql
CREATE SCHEMA IF NOT EXISTS PACKAGE_CONSTANTS
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE PACKAGE_CONSTANTS.MY_PROCEDURE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      CONST_NAME VARCHAR(10) := 'Snow';
   BEGIN
      INSERT INTO DBUSER("USER_NAME")
      VALUES (:CONST_NAME);
   END;
$$;
```

> **Note:**
>
> Note that the `PROCEDURE` definition in the package is removed since it is not required by Snowflake.

#### Other Package members

The transformation for other package members like cursors, exceptions and user defined types, is still a work in progress.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE MY_PACKAGE_EX AS
    an_exception EXCEPTION;
END MY_PACKAGE_EX;
```

##### Transformation with option 1

```sql
CREATE SCHEMA IF NOT EXISTS MY_PACKAGE_EX
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

!!!RESOLVE EWI!!! /*** SSC-EWI-OR0049 - PACKAGE EXCEPTIONS in stateful package MY_PACKAGE_EX are not supported yet ***/!!!
an_exception EXCEPTION;
```

### Package Body Definition

This section shows the equivalence between Oracle Package Body Definition members and Snowflake statements.

#### Create Package Body

Elements inside a Package Body are going to be extracted from the package. The package body will disappear so the Create Package Body statement is removed in the converted code.

#### Procedure Definition

Stored Procedures inside packages use the same transformations defined in the PL/SQL Translation Reference.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE BODY PACKAGE_PROCEDURE
IS
PROCEDURE MY_PROCEDURE (MY_PARAM VARCHAR) IS
   BEGIN
      null;
   END;

END PACKAGE_PROCEDURE;
```

##### Transformation with option 1

```sql
CREATE OR REPLACE PROCEDURE PACKAGE_PROCEDURE.MY_PROCEDURE (MY_PARAM STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

   null;
$$;
```

##### Transformation with option 2

```sql
CREATE OR REPLACE PROCEDURE PACKAGE_PROCEDURE_MY_PROCEDURE (MY_PARAM STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   // REGION SnowConvert AI Helpers Code
   null;
$$;
```

#### Function Definition

Functions inside package bodies are converted into Snowflake stored procedures.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE BODY PACKAGE_FUNCTION
IS
FUNCTION MY_FUNCTION (MY_PARAM VARCHAR) RETURN NUMBER
AS
   BEGIN
      null;
   END;
END PACKAGE_FUNCTION;
```

##### Transformation with option 1

```sql
CREATE OR REPLACE FUNCTION PACKAGE_FUNCTION.MY_FUNCTION (MY_PARAM STRING)
RETURNS FLOAT
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
$$
   // SnowConvert AI Helpers Code section is omitted.
   null;
$$;
```

##### Transformation with option 2

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE FUNCTION PACKAGE_FUNCTION_MY_FUNCTION (MY_PARAM STRING)
RETURNS NUMBER
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
$$
   // REGION SnowConvert AI Helpers Code
   null;
$$;
```

#### Other package body members

Please refer to the “other package members” section in Package declaration.

### Using package members

#### Call of procedures inside packages

If the procedure is inside a package and the package is inside a schema, the call will be renamed.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE02(param1 NUMBER, param2 VARCHAR)
IS
BEGIN
    SCHEMA1.PACKAGE1.PROCEDURE01(param1, param2);
END;

CALL SCHEMA1.PACKAGE1.PROCEDURE01(param1, param2);
```

##### Transformation with option 1

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE02 (param1 FLOAT, param2 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    EXEC(`CALL
SCHEMA1.PACKAGE1.PROCEDURE01(?, ?)`,[PARAM1,PARAM2]);
$$;

CALL SCHEMA1.PACKAGE1.PROCEDURE01(param1, param2);
```

##### Transformation with option 2

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

With this option, the call of the procedures will be renamed accordingly to the rename of the procedure declaration. The schema name will be separated from the procedure name with a dot.

###### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PUBLIC.PROCEDURE02 (param1 FLOAT, param2 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   // REGION SnowConvert AI Helpers Code
   EXEC(`CALL SCHEMA1.PACKAGE1_PROCEDURE01(?, ?)`,[PARAM1,PARAM2]);
$$;

CALL SCHEMA1.PACKAGE1_PROCEDURE01(param1, param2);
```

#### Package variables inside procedures

> **Note:**
>
> Packages variables are transformed to session variables. Those variables are usable through the “[Package variables helper](helpers.md)”.

> **Note:**
>
> This sample is using variables declared in packages Variables declaration section.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE BODY PACKAGE_VARIABLES AS
  PROCEDURE P1 AS
    BEGIN
			VAR1 := VAR1 + 888;
			INSERT INTO TABLE1 values (VAR1);
         INSERT INTO TABLE2 values (VAR4);
    END;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PACKAGE_VARIABLES.P1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	VAR1 =
			!!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT VAR1 MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
			VAR1 + 888;
	EXEC(`INSERT INTO TABLE1
			values (VAR1)`);
	EXEC(`INSERT INTO TABLE2
         values (VAR4)`);
$$;
```

### Known Issues

No issues were found.

#### Related EWIs

1. [SSC-EWI-0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Object may not work.
2. [SSC-EWI-OR0049](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Package constants in stateful package are not supported yet.

## Procedures

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

**Example 1:** Basic Procedure Conversion

### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
BEGIN
null;
END;
```

### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    null;
$$;
```

**Example 2:** Procedure Conversion with basic statements: Declaration, Assignment, Cursor Declaration, FOR Cursor, Open, LOOP, CLOSE, IF,

### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
(
  param1 NUMBER
)
IS
  localVar1 NUMBER;
  countRows NUMBER;
  tempSql VARCHAR(100);
  tempResult NUMBER;
  CURSOR MyCursor
    IS
       SELECT COL1 FROM Table1;

BEGIN
    localVar1 := param1;
    countRows := 0;
    tempSql := 'SELECT COUNT(*) FROM Table1 WHERE COL1 =' || localVar1;

    FOR myCursorItem IN MyCursor
        LOOP
            localVar1 := myCursorItem.Col1;
            countRows := countRows + 1;
        END LOOP;
    INSERT INTO Table2 VALUES(countRows, 'ForCursor: Total Row count is: ' || countRows);
    countRows := 0;

    OPEN MyCursor;
    LOOP
        FETCH MyCursor INTO tempResult;
        EXIT WHEN MyCursor%NOTFOUND;
        countRows := countRows + 1;
    END LOOP;
    CLOSE MyCursor;
    INSERT INTO Table2 VALUES(countRows, 'LOOP: Total Row count is: ' || countRows);

    EXECUTE IMMEDIATE tempSql INTO tempResult;
    IF tempResult > 0 THEN
        INSERT INTO Table2 (COL1, COL2) VALUES(tempResult, 'Hi, found value:' || localVar1 || ' in Table1 -- There are ' || tempResult || ' rows');
        COMMIT;
    END IF;
END PROC1;
```

### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1
(param1 FLOAT
)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // REGION SnowConvert AI Helpers Code
  var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
  var fixBind = function (arg) {
    arg = arg instanceof Date ? formatDate(arg) : IS_NULL(arg) ? null : arg;
    return arg;
  };
  var SQL = {
    FOUND : false,
    NOTFOUND : false,
    ROWCOUNT : 0,
    ISOPEN : false
  };
  var _RS, _ROWS, SQLERRM = "normal, successful completion", SQLCODE = 0;
  var getObj = (_rs) => Object.assign(new Object(),_rs);
  var getRow = (_rs) => (values = Object.values(_rs)) && (values = values.splice(-1 * _rs.getColumnCount())) && values;
  var fetch = (_RS,_ROWS,fmode) => _RS.getRowCount() && _ROWS.next() && (fmode ? getObj : getRow)(_ROWS) || (fmode ? new Object() : []);
  var EXEC = function (stmt,binds,opts) {
    try {
      binds = !(arguments[1] instanceof Array) && ((opts = arguments[1]) && []) || (binds || []);
      opts = opts || new Object();
      binds = binds ? binds.map(fixBind) : binds;
      _RS = snowflake.createStatement({
          sqlText : stmt,
          binds : binds
        });
      _ROWS = _RS.execute();
      if (opts.sql !== 0) {
        var isSelect = stmt.toUpperCase().trimStart().startsWith("SELECT");
        var affectedRows = isSelect ? _RS.getRowCount() : _RS.getNumRowsAffected();
        SQL.FOUND = affectedRows != 0;
        SQL.NOTFOUND = affectedRows == 0;
        SQL.ROWCOUNT = affectedRows;
      }
      if (opts.row === 2) {
        return _ROWS;
      }
      var INTO = function (opts) {
        if (opts.vars == 1 && _RS.getColumnCount() == 1 && _ROWS.next()) {
          return _ROWS.getColumnValue(1);
        }
        if (opts.rec instanceof Object && _ROWS.next()) {
          var recordKeys = Object.keys(opts.rec);
          Object.assign(opts.rec,Object.fromEntries(new Map(getRow(_ROWS).map((element,Index) => [recordKeys[Index],element]))))
          return opts.rec;
        }
        return fetch(_RS,_ROWS,opts.row);
      };
      var BULK_INTO_COLLECTION = function (into) {
        for(let i = 0;i < _RS.getRowCount();i++) {
          FETCH_INTO_COLLECTIONS(into,fetch(_RS,_ROWS,opts.row));
        }
        return into;
      };
      if (_ROWS.getRowCount() > 0) {
        return _ROWS.getRowCount() == 1 ? INTO(opts) : BULK_INTO_COLLECTION(opts);
      }
    } catch(error) {
      RAISE(error.code,error.name,error.message)
    }
  };
  var RAISE = function (code,name,message) {
    message === undefined && ([name,message] = [message,name])
    var error = new Error(message);
    error.name = name
    SQLERRM = `${(SQLCODE = (error.code = code))}: ${message}`
    throw error;
  };
  var FETCH_INTO_COLLECTIONS = function (collections,fetchValues) {
    for(let i = 0;i < collections.length;i++) {
      collections[i].push(fetchValues[i]);
    }
  };
  var IS_NULL = (arg) => !(arg || arg === 0);
  var CURSOR = function (stmt,binds,isRefCursor,isOut) {
    var statementObj, result_set, total_rows, ISOPEN = false, result_set_table = '', self = this, row_count, found;
    this.CURRENT = new Object;
    this.INTO = function () {
        return self.res;
      };
    this.OPEN = function (openParameters) {
        if (ISOPEN && !isRefCursor) RAISE(-6511,"CURSOR_ALREADY_OPEN","cursor already open");
        var finalStmt = openParameters && openParameters.query || stmt;
        var parameters = openParameters && openParameters.binds || [];
        var finalBinds = binds instanceof Function ? binds(...parameters) : binds;
        finalBinds = finalBinds || parameters;
        try {
          if (isOut) {
            if (!temptable_prefix) {
              temptable_prefix = `${procname}_TEMP_${(EXEC(`select current_session() || '_' || to_varchar(current_timestamp, 'yyyymmddhh24missss')`,{
                  sql : 0
                }))[0]}_`;
            }
            if (!result_set_table) {
              result_set_table = temptable_prefix + outCursorResultNumber++;
              EXEC(`CREATE OR REPLACE TEMPORARY TABLE ${result_set_table} AS ${finalStmt}`,{
                sql : 0
              });
            }
            finalStmt = "SELECT * FROM " + result_set_table
          }
          [result_set,statementObj,total_rows] = [EXEC(finalStmt,finalBinds,{
              sql : 0,
              row : 2
            }),_RS,_RS.getColumnCount()]
          ISOPEN = true;
          row_count = 0;
        } catch(error) {
          RAISE(error.code,"error",error.message);
        }
        return this;
      };
    this.NEXT = function () {
        if (total_rows && result_set.next()) {
          this.CURRENT = new Object;
          for(let i = 1;i <= statementObj.getColumnCount();i++) {
            (this.CURRENT)[statementObj.getColumnName(i)] = result_set.getColumnValue(i);
          }
          return true;
        } else return false;
      };
    this.FETCH = function (record) {
        var recordKeys = record ? Object.keys(record) : undefined;
        self.res = [];
        if (!ISOPEN) RAISE(-1001,"INVALID_CURSOR","invalid cursor");
        if (recordKeys && recordKeys.length != statementObj.getColumnCount()) RAISE(-6504,"ROWTYPE_MISMATCH","Return types of Result Set variables or query do not match");
        self.res = fetch(statementObj,result_set);
        if (self.res && self.res.length > 0) {
          found = true;
          row_count++;
          if (recordKeys) {
            for(let i = 0;i < self.res.length;i++) {
              record[recordKeys[i]] = (self.res)[i];
            }
            return false;
          }
          return true;
        } else found = false;
        return false;
      };
    this.CLOSE = function () {
        if (!ISOPEN) RAISE(-1001,"INVALID_CURSOR","invalid cursor");
        found = row_count = result_set_table = total_rows = result_set = statementObj = undefined;
        ISOPEN = false;
      };
    this.FETCH_BULK_COLLECT_INTO = function (variables,limit) {
        if (variables.length != statementObj.getColumnCount()) RAISE(-6504,"ROWTYPE_MISMATCH","Return types of Result Set variables or query do not match");
        if (limit) {
          for(let i = 0;i < limit && this.FETCH();i++)FETCH_INTO_COLLECTIONS(variables,self.res);
        } else {
          while ( this.FETCH() )
            FETCH_INTO_COLLECTIONS(variables,self.res);
        }
      };
    this.FOUND = () => ISOPEN ? typeof(found) == "boolean" ? found : null : RAISE(-1001,"INVALID_CURSOR","invalid cursor");
    this.NOTFOUND = () => ISOPEN ? typeof(found) == "boolean" ? !found : null : RAISE(-1001,"INVALID_CURSOR","invalid cursor");
    this.ROWCOUNT = () => ISOPEN ? row_count : RAISE(-1001,"INVALID_CURSOR","invalid cursor");
    this.ISOPEN = () => ISOPEN;
    this.SAVE_STATE = function () {
        return {
          tempTable : result_set_table,
          position : row_count
        };
      };
    this.RESTORE_STATE = function (tempTable,position) {
        result_set_table = tempTable
        if (result_set_table) {
          isOut = true
          this.OPEN();
          for(let i = 0;i < position;i++)this.FETCH();
        }
      };
    this.ROWTYPE = () => ROWTYPE(stmt,binds());
  };
  var outCursorResultNumber = 0;
  var concatValue = (arg) => IS_NULL(arg) ? "" : arg;
  // END REGION

  let LOCALVAR1;
  let COUNTROWS;
  let TEMPSQL;
  let TEMPRESULT;
  let MYCURSOR = new CURSOR(`SELECT COL1 FROM
          Table1`,() => []);
  LOCALVAR1 = PARAM1;
  COUNTROWS = 0;
  TEMPSQL = `SELECT COUNT(*) FROM
   Table1
WHERE COL1 =${concatValue(LOCALVAR1)}`;
  MYCURSOR.OPEN();
  while ( MYCURSOR.NEXT() ) {
    let MYCURSORITEM = MYCURSOR.CURRENT;
    LOCALVAR1 = MYCURSORITEM.COL1;
    COUNTROWS = COUNTROWS + 1;
  }
  MYCURSOR.CLOSE();
  EXEC(`INSERT INTO Table2
    VALUES(?, 'ForCursor: Total Row count is: ' || NVL(? :: STRING, ''))`,[COUNTROWS,COUNTROWS]);
  COUNTROWS = 0;
  MYCURSOR.OPEN();
  while ( true ) {
    MYCURSOR.FETCH(TEMPRESULT) && ([TEMPRESULT] = MYCURSOR.INTO());
    if (MYCURSOR.NOTFOUND()) {
      break;
    }
    COUNTROWS = COUNTROWS + 1;
  }
  MYCURSOR.CLOSE();
  EXEC(`INSERT INTO Table2
    VALUES(?, 'LOOP: Total Row count is: ' || NVL(? :: STRING, ''))`,[COUNTROWS,COUNTROWS]);
  [TEMPRESULT] = EXEC(TEMPSQL);
  if (TEMPRESULT > 0) {
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`INSERT INTO Table2(COL1, COL2) VALUES(?, 'Hi, found value:' || NVL(? :: STRING, '') || ' in Table1 -- There are ' || NVL(? :: STRING, '') || ' rows')`,[TEMPRESULT,LOCALVAR1,TEMPRESULT]);
    EXEC(`--** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
COMMIT;`);
  }
$$;
```

#### Call of procedures inside other procedure

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE01(param1 NUMBER, param2 VARCHAR)
IS
BEGIN
INSERT INTO TABLE1 VALUES(param1, param2);
END;

CREATE OR REPLACE PROCEDURE PROCEDURE02(param1 NUMBER, param2 VARCHAR)
IS
BEGIN
PROCEDURE01(param1, param2);
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE01 (param1 FLOAT, param2 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	EXEC(`INSERT INTO TABLE1
	VALUES(?, ?)`,[PARAM1,PARAM2]);
$$;

CREATE OR REPLACE PROCEDURE PROCEDURE02 (param1 FLOAT, param2 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	EXEC(`CALL
	PROCEDURE01(?, ?)`,[PARAM1,PARAM2]);
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0022](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): One or more identifiers in this statement were considered parameters by default.
2. [SSC-FDM-OR0012](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): COMMIT and ROLLBACK statements require adequate setup to perform as intended.

## SQL Language Elements

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Cursor FOR LOOP

> **Note:**
>
> You might also be interested in [Cursor helper](helpers.md) and Cursor declaration.

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
    MyVariable1 NUMBER;
    MyOtherVariable2 NUMBER := 1;
    CURSOR C1 IS
        SELECT * FROM Table1 WHERE ID = 123;
    CURSOR C2 (paramCursor1 NUMBER) IS
        SELECT COL1 AS C_1 FROM TABLE1 WHERE ID = paramCursor1;
BEGIN
    FOR myCursorRecord IN C1
        LOOP
            MyVariable1 := myCursorRecord.Col1;
        END LOOP;

    FOR myCursorRecord IN (SELECT * FROM Table1 WHERE ID = MyVariable1)
        LOOP
            MyVariable1 := myCursorRecord.Col1;
        END LOOP;

    <<Block1>>
    FOR myCursorRecord IN C2 (MyOtherVariable2)
        LOOP
            MyVariable1 := myCursorRecord.Col1;
        END LOOP Block1;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let MYVARIABLE1;
    let MYOTHERVARIABLE2 = 1;
    let C1 = new CURSOR(`SELECT * FROM
           Table1
        WHERE ID = 123`,() => []);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    let C2 = new CURSOR(`SELECT COL1 AS C_1 FROM
           TABLE1
        WHERE ID = ?`,(PARAMCURSOR1) => [PARAMCURSOR1]);
    C1.OPEN();
    while ( C1.NEXT() ) {
        let MYCURSORRECORD = C1.CURRENT;
        MYVARIABLE1 = MYCURSORRECORD.COL1;
    }
    C1.CLOSE();
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    for(var MYCURSORRECORD_CURSOR = new CURSOR(`(SELECT * FROM
      Table1
   WHERE ID = ?
)`,[MYVARIABLE1]).OPEN();MYCURSORRECORD_CURSOR.NEXT();) {
        let MYCURSORRECORD = MYCURSORRECORD_CURSOR.CURRENT;
        MYVARIABLE1 = MYCURSORRECORD.COL1;
    }
    MYCURSORRECORD_CURSOR.CLOSE();
    C2.OPEN({
        binds : [MYOTHERVARIABLE2]
    });
    while ( C2.NEXT() ) {
        let BLOCK1 = C2.CURRENT;
        MYVARIABLE1 = MYCURSORRECORD.COL1;
    }
    C2.CLOSE();
$$;
```

### OPEN, FETCH and CLOSE Statement

> **Note:**
>
> You might also be interested in [Cursor helper](helpers.md) and Cursor declaration.

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC2
IS
    col1Value   table1.COL1%TYPE;
    col2Value   table1.COL2%TYPE;
    entireRow   table1%ROWTYPE;
    TYPE MyRowType IS RECORD ( COLUMN1 NUMBER, COLUMN2 NUMBER);
    entireRow_1 MyRowType;
    CURSOR C1 IS  SELECT * FROM table1;
    C2 SYS_REFCURSOR;
    TYPE COLLECTION_TYPE IS TABLE OF TABLE1.COL1%TYPE;
    MY_COLLECTION MY_COLLECTION_TYPE := MY_COLLECTION_TYPE();
    SOME_SELECT VARCHAR(200);
BEGIN
    OPEN C1;
    FETCH C1 INTO col1Value, col2Value;
    CLOSE C1;

    OPEN C1;
    FETCH C1 INTO entireRow;
    CLOSE C1;

    OPEN C1;
    FETCH C1 INTO entireRow_1;
    CLOSE C1;

    OPEN C2 FOR 'SELECT COL1 FROM TABLE1 WHERE COL1 <> :v' USING 123;
    FETCH C2 BULK COLLECT INTO MY_COLLECTION LIMIT 2;
    CLOSE C2;

    OPEN C2 FOR SELECT * FROM TABLE1 WHERE COL1 = NUM1;
    CLOSE C2;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC2 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let COL1VALUE;
    let COL2VALUE;
    let ENTIREROW = ROWTYPE(`table1`);
    class MYROWTYPE {
        COLUMN1
        COLUMN2
        constructor() {
            [...arguments].map((element,Index) => this[(Object.keys(this))[Index]] = element)
        }
    }
    let ENTIREROW_1 = new MYROWTYPE();
    let C1 = new CURSOR(`SELECT * FROM
   table1`,() => []);
    let C2 = new CURSOR(undefined,undefined,true);
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0072 - PROCEDURAL MEMBER TYPE DEFINITION NOT SUPPORTED. ***/!!!
    /*     TYPE COLLECTION_TYPE IS TABLE OF TABLE1.COL1%TYPE */
    ;
    let MY_COLLECTION = new MY_COLLECTION_TYPE();
    let SOME_SELECT;
    C1.OPEN();
    C1.FETCH(COL1VALUE,COL2VALUE) && ([COL1VALUE,COL2VALUE] = C1.INTO());
    C1.CLOSE();
    C1.OPEN();
    C1.FETCH(ENTIREROW) && ([ENTIREROW] = C1.INTO());
    C1.CLOSE();
    C1.OPEN();
    C1.FETCH(ENTIREROW_1) && ([ENTIREROW_1] = C1.INTO());
    C1.CLOSE();
    C2.OPEN({
        query : `SELECT COL1 FROM
   TABLE1
WHERE COL1 <> ?`,
        binds : [123]
    });
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
    /*     FETCH C2 BULK COLLECT INTO MY_COLLECTION LIMIT 2 */
    ;
    C2.CLOSE();
    C2.OPEN({
        query : `SELECT * FROM
   TABLE1
WHERE COL1 = NUM1`
    });
    C2.CLOSE();
$$;
```

> **Warning:**
>
> Transformation for the following lines corresponds to custom types, which are work in progress:
>
> ```sql
> entireRow   table1%ROWTYPE; // ROW TYPES
> TYPE COLLECTION_TYPE IS TABLE OF TABLE1.COL1%TYPE; // COLLECTIONS
> ```
>
> Currently the next statement is being emitted but the class is not being created yet. A warning will be applied in the future to all the uses of the unsupported custom types.
>
> ```javascript
> let MY_COLLECTION = new MY_COLLECTION_TYPE();
> ```

### SQL Implicit Cursor

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE SP_IMPLICIT_CURSOR_SAMPLE AUTHID DEFINER IS
  VAR_AUX  NUMBER(3);
  STMT_STAT1  NUMBER(3):= 0;
  STMT_STAT2  NUMBER(3):= 0;
  STMT_STAT3  NUMBER(3):= 0;
BEGIN
  EXECUTE IMMEDIATE 'CREATE TABLE FTABLE35(COL1 NUMBER(3))';
  IF SQL%FOUND THEN
    STMT_STAT1 := 1;
  END IF;
  IF SQL%NOTFOUND THEN
   STMT_STAT2 := 1;
  END IF;
  IF SQL%ISOPEN THEN
   STMT_STAT3 := 1;
  END IF;
  EXECUTE IMMEDIATE 'INSERT INTO FTABLE33 VALUES(:D1,:D2,:D3,:D4)' USING SQL%ROWCOUNT, STMT_STAT1, STMT_STAT2, STMT_STAT3;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE SP_IMPLICIT_CURSOR_SAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
  !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PlInvokerRightsClause' NODE ***/!!!
  //AUTHID DEFINER
  null
  // SnowConvert AI Helpers Code section is omitted.

  let VAR_AUX;
  let STMT_STAT1 = 0;
  let STMT_STAT2 = 0;
  let STMT_STAT3 = 0;
  EXEC(`CREATE OR REPLACE TABLE FTABLE35 (COL1 NUMBER(3)
)`);
  if (SQL.FOUND) {
    STMT_STAT1 = 1;
  }
  if (SQL.NOTFOUND) {
    STMT_STAT2 = 1;
  }
  if (SQL.ISOPEN) {
    STMT_STAT3 = 1;
  }
  EXEC(`INSERT INTO FTABLE33
VALUES(?, ?, ?, ?)`,[SQL.ROWCOUNT /*** SSC-FDM-OR0009 - SQL IMPLICIT CURSOR VALUES MAY DIFFER ***/,STMT_STAT1,STMT_STAT2,STMT_STAT3]);
$$;
```

### EXIT

> **Note:**
>
> You might also be interested in Loop and while statements.

> **Warning:**
>
> Transformation for labels is a work in progress.

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE1
IS
  i NUMBER := 0;
  j NUMBER := 0;
  k NUMBER := 0;
BEGIN
  <<loop_a>>
  LOOP
    i := i + 1;

    <<loop_b>>
    LOOP
      j := j + 1;

      <<loop_c>>
      LOOP
        k := k + j + i;
        EXIT;
      END LOOP loop_c;

      EXIT loop_b WHEN (j > 3);
    END LOOP loop_b;

    EXIT loop_a WHEN (i > 3);
  END LOOP loop_a;

END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let I = 0;
  let J = 0;
  let K = 0;
  while ( true ) {
    I = I + 1;
    while ( true ) {
      J = J + 1;
      while ( true ) {
        K = K + J + I;
        break;
      }
      !!!RESOLVE EWI!!! /*** SSC-EWI-OR0075 - LABELS IN STATEMENTS ARE NOT SUPPORTED. ***/!!!
      /*
            EXIT loop_b WHEN (j > 3) */
      ;
    }
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0075 - LABELS IN STATEMENTS ARE NOT SUPPORTED. ***/!!!
    /*
        EXIT loop_a WHEN (i > 3) */
    ;
  }
$$;
```

### Execute Immediate

> **Note:**
>
> You might also be interested in [EXEC helper](helpers.md)

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE sp_sample5 AS
   sql_stmt    VARCHAR2(200);
   plsql_block VARCHAR2(500);
   emp_id      NUMBER(4) := 7566;
   dept_id     NUMBER(2) := 20;
   dept_id2     NUMBER(2) := 12;
   dept_id_upd VARCHAR(14);
   dept_name   VARCHAR2(14) := 'PERSONNEL';
   location    VARCHAR2(13) := 'DALLAS';
   dept_rec     deptt%ROWTYPE;
   TYPE NumList IS TABLE OF NUMBER;
   sals   NumList;
BEGIN
   EXECUTE IMMEDIATE 'CREATE TABLE dept (id NUMBER, name varchar(14), location varchar2(13))';
   sql_stmt := 'INSERT INTO dept VALUES (:1, :2, :3)';
   EXECUTE IMMEDIATE sql_stmt USING dept_id, dept_name, location;
   sql_stmt := 'SELECT * FROM dept WHERE id = :idd';
   EXECUTE IMMEDIATE sql_stmt INTO dept_rec USING dept_id;
   sql_stmt := 'UPDATE dept SET id = 200 WHERE id = :1 RETURNING name INTO :2';
   EXECUTE IMMEDIATE sql_stmt USING dept_id RETURNING INTO dept_id_upd;
   sql_stmt := 'delete from dept where id = :1 RETURNING name INTO :2';
   EXECUTE IMMEDIATE sql_stmt USING dept_id RETURNING INTO dept_id_upd;
   EXECUTE IMMEDIATE 'INSERT INTO dept VALUES (12, ''NAME1'', ''TEXAS'')';
   EXECUTE IMMEDIATE 'INSERT INTO DEPT VALUES(13, ''' || dept_name || ''', ''LA'')';
   EXECUTE IMMEDIATE 'DELETE FROM dept WHERE id = :num' USING dept_id2;
   EXECUTE IMMEDIATE 'ALTER SESSION SET NLS_DATE_FORMAT = ''DD-MM-YYYY''';
   EXECUTE IMMEDIATE 'SELECT id FROM dept' BULK COLLECT INTO sals;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE sp_sample5 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

   let SQL_STMT;
   let PLSQL_BLOCK;
   let EMP_ID = 7566;
   let DEPT_ID = 20;
   let DEPT_ID2 = 12;
   let DEPT_ID_UPD;
   let DEPT_NAME = `PERSONNEL`;
   let LOCATION = `DALLAS`;
   let DEPT_REC = ROWTYPE(`deptt`);
   !!!RESOLVE EWI!!! /*** SSC-EWI-OR0072 - PROCEDURAL MEMBER TYPE DEFINITION NOT SUPPORTED. ***/!!!
   /*    TYPE NumList IS TABLE OF NUMBER */
   ;
   !!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
   /*    sals   NumList */
   ;
   EXEC(`CREATE OR REPLACE TABLE dept (id NUMBER(38, 18),
   name varchar(14),
   location VARCHAR(13))`);
   SQL_STMT = `INSERT INTO dept
VALUES (?, ?, ?)`;
   EXEC(SQL_STMT,[DEPT_ID,DEPT_NAME,LOCATION]);
   SQL_STMT = `SELECT * FROM
   dept
WHERE id = ?`;
   EXEC(SQL_STMT,[DEPT_ID],{
      rec : dept_rec
   });
   SQL_STMT = `UPDATE dept
   SET id = 200 WHERE id = ?
   RETURNING name INTO :2`;
   !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'THIS EXECUTE IMMEDIATE CASE' NODE ***/!!!
   /*    EXECUTE IMMEDIATE sql_stmt USING dept_id RETURNING INTO dept_id_upd */
   ;
   SQL_STMT = `delete FROM
   dept
where id = ?
RETURNING name INTO :2`;
   !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'THIS EXECUTE IMMEDIATE CASE' NODE ***/!!!
   /*    EXECUTE IMMEDIATE sql_stmt USING dept_id RETURNING INTO dept_id_upd */
   ;
   EXEC(`INSERT INTO dept
VALUES (12, 'NAME1', 'TEXAS')`);
   EXEC(`INSERT INTO DEPT
VALUES(13, '${concatValue(DEPT_NAME)}', 'LA')`);
   EXEC(`DELETE FROM
   dept
WHERE id = ?`,[DEPT_ID2]);
   EXEC(`ALTER SESSION SET DATE_INPUT_FORMAT = 'DD-MM-YYYY' DATE_OUTPUT_FORMAT = 'DD-MM-YYYY'`);
   !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'THIS EXECUTE IMMEDIATE CASE' NODE ***/!!!
   /*    EXECUTE IMMEDIATE 'SELECT id FROM dept' BULK COLLECT INTO sals */
   ;
$$;
```

> **Warning:**
>
> Since the “RETURNING INTO” clause requires special analysis of the statement executed, its translation is planned to be delivered in the future.

> **Warning:**
>
> Transformation for the following line corresponds to collection types, which is work in progress:
>
> ```sql
> TYPE NumList IS TABLE OF NUMBER;
> ```
>
> Currently the next statement is being emitted but the class is not being created yet. A warning will be applied in the future to all the uses of the unsupported custom types.
>
> ```javascript
> let SALS = new NUMLIST();
> ```
>
> Also the following `EXECUTE IMMEDIATE` related with the `BULK COLLECT` into the `sals` variable, is also work in progress.
>
> ```sql
> EXECUTE IMMEDIATE 'SELECT id FROM dept' BULK COLLECT INTO sals;
> ```

### Errors and Exception Handling

> **Note:**
>
> You might also be interested in [Raise helper](helpers.md)

#### Raise Helper Usage

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE HANDLERS_WITH_OTHERS_COMMENTS AUTHID DEFINER IS
  deadlock_detected EXCEPTION;
  deadlock_dex EXCEPTION;
  PRAGMA EXCEPTION_INIT(deadlock_detected, -60);
  PRAGMA EXCEPTION_INIT(deadlock_dex, -63);
BEGIN

  IF true THEN
    RAISE NO_DATA_FOUND;
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20010, SQLERRM);
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20000, SQLERRM, PARM);
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20000, SQLERRM, TRUE);
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20000, SQLERRM, FALSE);
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20000, 'CUSTOM ERROR MESSAGE', TRUE);
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20010, 'SECOND CUSTOM ERROR MESSAGE', TRUE);
  END IF;
  IF TRUE THEN
    RAISE_APPLICATION_ERROR(-20010, 'OTHER CUSTOM ERROR MESSAGE', FALSE);
  END IF;

EXCEPTION
    WHEN EXC_NAME THEN
        --Handle Exc_name  found exception
        null;
    WHEN NO_DATA_FOUND THEN
        --Handle No data found exception
        null;
    WHEN OTHERS THEN
        --Handler for others exception
        null;
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE HANDLERS_WITH_OTHERS_COMMENTS ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
  !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PlInvokerRightsClause' NODE ***/!!!
  //AUTHID DEFINER
  null
  // SnowConvert AI Helpers Code section is omitted.

  try {
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0052 - EXCEPTION DECLARATION IS HANDLED BY RAISE FUNCTION ***/!!!
    /*   deadlock_detected EXCEPTION */
    ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0052 - EXCEPTION DECLARATION IS HANDLED BY RAISE FUNCTION ***/!!!
    /*   deadlock_dex EXCEPTION */
    ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
    /*   PRAGMA EXCEPTION_INIT(deadlock_detected, -60) */
    ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
    /*   PRAGMA EXCEPTION_INIT(deadlock_dex, -63) */
    ;
    if (true) {
      RAISE(100,`NO_DATA_FOUND`,`Single row SELECT returned no rows or your program referenced a deleted element in a nested table or an uninitialized element in an associative array (index-by table).`);
    }
    if (true) {
      RAISE(-20010,SQLERRM);
    }
    if (true) {
      // ** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT PARM WAS REMOVED. **
      RAISE(-20000,SQLERRM);
    }
    if (true) {
      // ** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT TRUE WAS REMOVED. **
      RAISE(-20000,SQLERRM);
    }
    if (true) {
      RAISE(-20000,SQLERRM);
    }
    if (true) {
      // ** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT TRUE WAS REMOVED. **
      RAISE(-20000,`CUSTOM ERROR MESSAGE`);
    }
    if (true) {
      // ** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT TRUE WAS REMOVED. **
      RAISE(-20010,`SECOND CUSTOM ERROR MESSAGE`);
    }
    if (true) {
      RAISE(-20010,`OTHER CUSTOM ERROR MESSAGE`);
    }
  } catch(error) {
    switch(error.name) {
      case `EXC_NAME`: {
        //Handle Exc_name  found exception
        null;
        break;
      }
      case `NO_DATA_FOUND`: {
        //Handle No data found exception
        null;
        break;
      }
      default: {
        //Handler for others exception
        null;
        break;
      }
    }
  }
$$;
```

When there is not OTHERS handler, SnowConvert AI uses the “default” case in the switch that throws the original Error Object.

#### Commit

> **Note:**
>
> You might also be interested in [EXEC helper](helpers.md)

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 NUMBER, param2 NUMBER)
IS
BEGIN
    INSERT INTO TABLE1 VALUES(param1, param2);
    COMMIT;
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 FLOAT, param2 FLOAT)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    EXEC(`INSERT INTO TABLE1
    VALUES(?, ?)`,[PARAM1,PARAM2]);
    EXEC(`--** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
COMMIT;`);
$$;
```

#### CASE

##### Oracle

```sql
CREATE OR REPLACE EDITIONABLE PROCEDURE PROCEDURE2 ()
IS
  localVar1 NUMBER;
  localVar2 VARCHAR(100);
BEGIN
CASE (localVar1)
WHEN 1 THEN
    localVar2 := 'one';
WHEN 2 THEN
    localVar := 'two';
WHEN 3 THEN
    lovalVar := 'three';
ELSE
    localVar := 'error';
END CASE;

CASE
WHEN localVar = 1 THEN
    localVar2 := 'one';
WHEN localVar = 2 THEN
    localVar := 'two';
WHEN localVar = 3 THEN
    lovalVar := 'three';
ELSE
    localVar := 'error';
END CASE;
END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
--** SSC-FDM-OR0007 - SNOWFLAKE DOESN'T SUPPORT VERSIONING OF OBJECTS. DEVELOPERS SHOULD CONSIDER ALTERNATE APPROACHES FOR CODE VERSIONING. **
CREATE OR REPLACE PROCEDURE PROCEDURE2 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  let LOCALVAR1;
  let LOCALVAR2;
  switch(LOCALVAR1) {
    case 1:LOCALVAR2 = `one`;
    break;
    case 2:LOCALVAR = `two`;
    break;
    case 3:LOVALVAR = `three`;
    break;
    default:LOCALVAR = `error`;
    break;
  }
  if (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT localVar MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    LOCALVAR == 1) {
    LOCALVAR2 = `one`;
  } else if (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT localVar MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    LOCALVAR == 2) {
    LOCALVAR = `two`;
  } else if (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT localVar MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
    LOCALVAR == 3) {
    LOVALVAR = `three`;
  } else {
    LOCALVAR = `error`;
  }
$$;
```

#### CASE in a variable assignment

##### Oracle

```sql
CREATE OR REPLACE EDITIONABLE PROCEDURE PROCEDURE2 ()
IS
  localVar1 NUMBER;
BEGIN
	var1 := CASE flag
	WHEN 1 THEN 'one'
	WHEN 2 THEN 'two'
	WHEN 3 THEN 'three'
	WHEN 4 THEN 'four'
	ELSE 'unknown' END;

END;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
--** SSC-FDM-OR0007 - SNOWFLAKE DOESN'T SUPPORT VERSIONING OF OBJECTS. DEVELOPERS SHOULD CONSIDER ALTERNATE APPROACHES FOR CODE VERSIONING. **
CREATE OR REPLACE PROCEDURE PROCEDURE2 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	let LOCALVAR1;
	VAR1 =
					!!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT flag MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
					FLAG == 1 && `one` || (
						!!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT flag MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
						FLAG == 2 && `two` || (
							!!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT flag MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
							FLAG == 3 && `three` || (
								!!!RESOLVE EWI!!! /*** SSC-EWI-0053 - OBJECT flag MAY NOT WORK PROPERLY, ITS DATATYPE WAS NOT RECOGNIZED ***/!!!
								FLAG == 4 && `four` || `unknown`)));
$$;
```

#### Call to external C or Java programs

##### Oracle

```sql
CREATE OR REPLACE EDITIONABLE PROCEDURE "OWB_REP_OWNER"."WB_RT_DP_CREATE_FKPARTITION" (prfID IN NUMBER,datatype IN VARCHAR2) AUTHID CURRENT_USER AS LANGUAGE JAVA NAME 'oracle.wh.service.impl.dataProfile.analysis.storedprocs.ForeignKey.createFKPartition(int,java.lang.String)';
```

##### Snowflake

```sql
----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE PROCEDURE IS OUT OF TRANSLATION SCOPE. **
--CREATE OR REPLACE EDITIONABLE PROCEDURE "OWB_REP_OWNER"."WB_RT_DP_CREATE_FKPARTITION" (prfID IN NUMBER,datatype IN VARCHAR2) AUTHID CURRENT_USER AS LANGUAGE JAVA NAME 'oracle.wh.service.impl.dataProfile.analysis.storedprocs.ForeignKey.createFKPartition(int,java.lang.String)'
                                                                                                                                                                                                                                                                                   ;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0022](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): One or more identifiers in a specific statement are considered parameters by default.
2. [SSC-EWI-0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Object may not work.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
4. [SSC-EWI-OR0052](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Exception declaration is handled by the raise function.
5. [SSC-EWI-OR0072](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Procedural Member not supported.
6. [SSC-EWI-OR0075](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Current of clause is not supported in Snowflake.
7. [SSC-EWI-OR0104](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Unusable collection variable.
8. [SSC-FDM-OR0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Snowflake does not support the versioning of objects. Developers should consider alternate approaches for code versioning.
9. [SSC-FDM-OR0009](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): SQL IMPLICIT CURSOR VALUES MAY DIFFER.
10. [SSC-FDM-OR0011](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): The Boolean argument was removed because the “add to stack” options is not supported.
11. [SSC-FDM-OR0012:](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md) COMMIT and ROLLBACK statements require adequate setup to perform as intended.

## DDL - DML Statements

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> All statements use the [EXEC helper.](helpers.md)

### SELECT

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 VARCHAR)
IS
    VAR1 NUMBER := 789;
BEGIN
    SELECT * FROM TABLE01;
    SELECT DISTINCT COL1 FROM TABLE01;
    SELECT * FROM TABLE01 WHERE COL1 = VAR1;
    SELECT * FROM TABLE01 WHERE COL1 = PARAM1;
    SELECT * FROM TABLE01 WHERE COL1 = PARAM1 AND COL2 = VAR1;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let VAR1 = 789;
    EXEC(`SELECT * FROM
       TABLE01`);
    EXEC(`SELECT DISTINCT COL1 FROM
       TABLE01`);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`SELECT * FROM
       TABLE01
    WHERE COL1 = ?`,[VAR1]);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`SELECT * FROM
       TABLE01
    WHERE COL1 = ?`,[PARAM1]);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`SELECT * FROM
       TABLE01
    WHERE COL1 = ?
       AND COL2 = ?`,[PARAM1,VAR1]);
$$;
```

### SELECT INTO

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 VARCHAR, param2 VARCHAR)
IS
    VAR1 NUMBER;
    VAR2 NUMBER;
BEGIN
    SELECT COL1 INTO VAR1 FROM TABLE01;
    SELECT COL1 INTO VAR1 FROM TABLE01 WHERE COL2 = PARAM1;
    SELECT COL1 INTO VAR1, VAR2 FROM TABLE01;
    SELECT COL1 INTO VAR1, VAR2 FROM TABLE01
        WHERE COL2 = param1 AND COL3 = param1;
END
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 STRING, param2 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let VAR1;
    let VAR2;
    [VAR1] = EXEC(`SELECT
   COL1
FROM
   TABLE01`);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    [VAR1] = EXEC(`SELECT
   COL1
FROM
   TABLE01
WHERE COL2 = ?`,[PARAM1]);
    [VAR1,VAR2] = EXEC(`SELECT
   COL1
FROM
   TABLE01`);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    [VAR1,VAR2] = EXEC(`SELECT
   COL1
FROM
   TABLE01
       WHERE COL2 = ?
   AND COL3 = ?`,[PARAM1,PARAM1]);
$$;
```

### INSERT and INSERT INTO SELECT

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 VARCHAR)
IS
    var1 NUMBER := 789;
BEGIN
    INSERT INTO TABLE01 VALUES('name', 123);
    INSERT INTO TABLE01 VALUES(param1, 456);
    INSERT INTO TABLE01 VALUES(param1, var1);
    INSERT INTO TABLE01 (col1, col2)
    SELECT col1, col2 FROM TABLE02 tb2
    WHERE tb2.col1 = 'myName';
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 (param1 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let VAR1 = 789;
    EXEC(`INSERT INTO TABLE01
    VALUES('name', 123)`);
    EXEC(`INSERT INTO TABLE01
    VALUES(?, 456)`,[PARAM1]);
    EXEC(`INSERT INTO TABLE01
    VALUES(?, ?)`,[PARAM1,VAR1]);
    EXEC(`INSERT INTO TABLE01(col1, col2)
    SELECT col1, col2 FROM
       TABLE02 tb2
    WHERE tb2.col1 = 'myName'`);
$$;
```

### DELETE

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1 (PARAM1 VARCHAR)
IS
    VAR1 NUMBER := 0;
BEGIN
    DELETE FROM TABLE1 WHERE COL2 = 1;
    DELETE FROM TABLE1 WHERE COL2 = VAR1;
    DELETE FROM TABLE1 WHERE COL1 = PARAM1;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 (PARAM1 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

    let VAR1 = 0;
    EXEC(`DELETE FROM
       TABLE1
    WHERE COL2 = 1`);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`DELETE FROM
       TABLE1
    WHERE COL2 = ?`,[VAR1]);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`DELETE FROM
       TABLE1
    WHERE COL1 = ?`,[PARAM1]);
$$;
```

### UPDATE

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1(PARAM1 VARCHAR)
IS
    VAR1 NUMBER := 3;
BEGIN
    UPDATE TABLE1 SET COL2 = 1 where COL2 = 0;
    UPDATE TABLE1 SET COL1 = VAR1 where COL1 = 0;
    UPDATE TABLE1 SET COL1 = 'name' where COL1 = PARAM11;
    UPDATE TABLE1 SET COL2 = VAR1 where COL1 = PARAM1;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 (PARAM1 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

    let VAR1 = 3;
    EXEC(`UPDATE TABLE1
       SET COL2 = 1 where COL2 = 0`);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`UPDATE TABLE1
       SET COL1 = ?
       where COL1 = 0`,[VAR1]);
    EXEC(`UPDATE TABLE1
       SET COL1 = 'name' where COL1 = PARAM11`);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`UPDATE TABLE1
       SET COL2 = ?
       where COL1 = ?`,[VAR1,PARAM1]);
$$;
```

### MERGE

#### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC1
IS
BEGIN
	MERGE INTO TABLE01 t01
	USING TABLE02 t02
		ON (t01.col2 = t02.col2)
	WHEN MATCHED THEN
		UPDATE SET t01.col1 = t02.col2;
END;
```

#### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	EXEC(`MERGE INTO TABLE01 t01
	USING TABLE02 t02
		ON (t01.col2 = t02.col2)
		WHEN MATCHED THEN
		   UPDATE SET t01.col1 = t02.col2`);
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0022](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): One or more identifiers in a specific statement are considered parameters by default.

## Synonyms

Synonyms used inside PL/SQL blocks are changed to the referenced object and the Schema will be added if necessary.

### Implicit Schema added

When the procedure or function is inside a schema and the synonym is inside that schema, but it is being used without the schema, the converted code will add the schema.

#### Oracle

```sql
CREATE TABLE schema_one.TABLE_TEST1(
    COL1 INTEGER,
    COL2 DATE DEFAULT SYSDATE
    );

CREATE OR REPLACE SYNONYM schema_one.MY_SYNONYM1 FOR schema_one.TABLE_TEST1;

create or replace procedure schema_one.procedure1  as
returnval integer;
begin
    select col1 into returnval from my_synonym1;
end;
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE schema_one.TABLE_TEST1 (
        COL1 INTEGER,
        COL2 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/ DEFAULT CURRENT_TIMESTAMP()
        )
        COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
        ;

--        --** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **

--        CREATE OR REPLACE SYNONYM schema_one.MY_SYNONYM1 FOR schema_one.TABLE_TEST1
                                                                                   ;

        CREATE OR REPLACE PROCEDURE schema_one.procedure1 ()
        RETURNS VARCHAR
        LANGUAGE SQL
        COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
        EXECUTE AS CALLER
        AS
        $$
        DECLARE
                returnval integer;
        BEGIN
                select col1 into
                    :returnval
                from
                    schema_one.TABLE_TEST1;
        END;
        $$;
```

### Schema of referenced object added

When the synonym references an object that is in a specific schema, the schema name will be added to the referenced object.

#### Oracle

```sql
CREATE OR REPLACE SYNONYM MY_SYNONYM2 FOR schema_one.TABLE_TEST1;

create or replace procedure procedure2  as
returnval integer;
begin
    select col1 into returnval from my_synonym2;
end;
```

#### Snowflake

```sql
----** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **
--CREATE OR REPLACE SYNONYM MY_SYNONYM2 FOR schema_one.TABLE_TEST1
                                                                ;

CREATE OR REPLACE PROCEDURE procedure2 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let RETURNVAL;
    [RETURNVAL] = EXEC(`SELECT
   col1
from
   schema_one.TABLE_TEST1`);
$$;
```

### Related EWIs

1. [SSC-FDM-OR0005](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Synonyms are not supported in Snowflake but references to this synonym were changed by the original object name.
2. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

## Triggers

> **Warning:**
>
> Triggers are not supported by Snowflake, and then they will not be migrated automatically.

Snowflake at this moment does not provide a direct mechanism for triggers, but some Snowflake features can be used to achieve similar results.

We recommend that you perform an analysis of your triggers, and classify them by purpose:

* **Audit Triggers:** the intention of these triggers is to capture information and record the changes done on some tables into other tables.
* **Initialization Triggers:** the intention of these triggers is to add some default values to the new records. They are usually before or after insert triggers
* **Business Rule Barrier Triggers**: these usually apply for BEFORE/AFTER DELETE or UPDATE. These triggers are meant to create a *barrier* to avoid data entry or deletion that will break some business rules.
* **Instead of Triggers**: used for example to allow inserts on views are not supported. The recommendation will be to turn that logic into a stored procedure and introduce calls whenever they were used for insert/delete/update operations.
* **Database Triggers:** cannot be replicated, it is also recommended to encapsulate this logic into a stored procedure. But this logic will need to be manually invoked.
* **Generic After Triggers**: for some **after** triggers, streams, and tasks can be leveraged see section below.

### Audit Trigger

```sql
CREATE OR REPLACE TRIGGER SCHEMA.TRIGGER_NAME
BEFORE UPDATE OR INSERT ON SCHEMA.TRIGGER_NAME FOR EACH ROW
BEGIN
:NEW.LAST_UPDATE := SYSDATE;
END;
```

Before UPDATE triggers for audit cases like this cannot be handled directly. For the INSERT case you can use the default value case explained for the initialization trigger. However for the update case the only option will be to use a task as it is explained later for AFTER triggers. However the LAST_*UPDATE will not be accurate, there will be an offset because the recorded modification will be at the time of task execution (for example if the tasks executes each 5min then the LAST_UPDATE will be recorded 5min later)*.

For UPDATE cases trying to capture the CURRENT_USER is not possible.

Other cases of AUDIT triggers are when they register changes of a table into an update table. Using the AFTER trigger technique describe later can be used but again USER information cannot be tracked and TIME information will not be accurate.

### Initialization Trigger

```sql
CREATE OR REPLACE TRIGGER SCHEMA.TRIGGER_NAME
BEFORE INSERT ON SCHEMA.TABLE1 FOR EACH ROW
BEGIN
   SELECT SCHEMA.TABLE.NEXTVAL INTO :NEW.COLUMN_SEQ FROM DUAL;
   SELECT USER INTO :NEW.UPDATED_BY FROM DUAL;
   SELECT SYSTIMESTAMP INTO :NEW.UPDATED_TM FROM DUAL;
END
```

For these triggers, you might use [Snowflake Default column values](https://docs.snowflake.com/en/sql-reference/sql/create-table.html#optional-parameters) for example for sequence values.

You can also use `CURRENT_`*`USER`() and `CURRENT_TIMESTAMP` instead of `USER` or `SYS_TIMESTAMP`*

This only applies for BEFORE INSERT or AFTER INSERT cases.

### Business Rule Barrier

```sql
CREATE OR REPLACE EDITIONABLE TRIGGER SCHEMA.TRIGGER_NAME
BEFORE DELETE ON SCHEMA.TABLE FOR EACH ROW
BEGIN
   IF (:OLD.termination_date is NULL OR
   :OLD.termination_date >= TRUNC(SYSDATE)+1 ) THEN
     RAISE_APPLICATION_ERROR(-30001,'An employee must be terminated before deleteing the row');
 END IF;
```

For these cases you will need to in-line the trigger actions after/before the DELETE or UPDATE is performed.

A task is not recommended here because tasks are run on an schedule, and then the row will already be modified.

> **Warning:**
>
> This section shows a known workaround for partially implementing *AFTER* Triggers.

### **GENERIC AFTER TRIGGER**

#### Example 1: Basic Trigger conversion

##### Oracle

```sql
CREATE TRIGGER example_trigger
AFTER INSERT ON table1
SELECT * FROM DUAL;
```

##### Snowflake

> **Note:**
>
> SnowConvert AI helpers Code removed from the example. You can find them [here.](helpers.md)

```sql
----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE TRIGGER IS OUT OF TRANSLATION SCOPE. **
--CREATE TRIGGER example_trigger
--AFTER INSERT ON table1
--SELECT * FROM DUAL
```

### In-depth explanation for the snowflake code

#### Streams

These take care of storing the changes made to the table. Please note:

* These will store the delta between the current table state, and the last offset stored by the stream itself. Please take this into account for billing purposes.
* Notice that these do **not** store the information of updates, but rather store them as an insertion.
* In the same manner, they cannot be configured to track only deletions or only updates, and thus they should have to be filtered in the procedure and the task itself (see below).

#### Procedures

These take care of running the trigger’s SQL statement(s). Please note:

* There is a need to flush the stream, hence the new stream creation at the end of the procedure.
* Any actions that need to be filtered (like AFTER-INSERTs-only triggers) will need to be filtered in the stored procedure itself.

#### Tasks

These take care of regularly verifying for stream changes and accordingly execute the trigger’s SQL statement(s). Please note:

* The Tasks work on a schedule, an action does not trigger them. This means that there will be trigger scheduled checks with no data changes performed in the table.
* Tasks cannot be configured to run more than once every sixty (60) seconds, as the minimum time is one (1) minute.
* Once the stream has detected changes there will be, in the worst-case scenario, sixty (60) seconds of delay between the change detection and the trigger execution.
* While adding the WHEN avoids Task execution, snowflake still adds Charge every time it is evaluated; and said Charge will be added to the bill when the trigger actually executes.
* The Task needs a Warehouse to be executed in and will need to be manually set by the client.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## TYPE attribute

### Description

This chapter is related to transforming the [TYPE attribute](https://docs.oracle.com/en/database/oracle/oracle-database/18/lnpls/TYPE-attribute.html#GUID-EAB44F7E-B2AB-4AC6-B83D-B586193D75FC) when it references a column, variable, record, collection, or cursor. The transformation involves getting the referenced item data type and replacing the referencing item TYPE attribute for the data type obtained.

### Sample Source Patterns

#### TYPE attribute for columns

In this case, the referenced item is a column from a table created previously.

##### Oracle

```sql
CREATE TABLE table1(
col1 NUMBER
);

CREATE OR REPLACE PROCEDURE procedure1
IS
var1 table1.col1%TYPE;
BEGIN
NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (
col1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
var1 NUMBER(38, 18);
BEGIN
NULL;
END;
$$;
```

#### TYPE attribute for variables

In this case, the referenced item is a variable declared previously.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure1
IS
var0 FLOAT;
var1 var0%TYPE;
var2 var1%TYPE;
var3 var2%TYPE;
BEGIN
NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
var0 FLOAT;
var1 FLOAT;
var2 FLOAT;
var3 FLOAT;
BEGIN
NULL;
END;
$$;
```

> **Note:**
>
> Further information about FLOAT datatype can be found in [FLOAT Data Type](../basic-elements-of-oracle-sql/data-types/oracle-built-in-data-types.md) section

#### TYPE attribute for records

In this case, the referenced item is a record declared previously.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure1
IS
TYPE record_typ_def IS RECORD(field1 NUMBER);
record_var record_typ_def;
var1 record_var%TYPE;
var2 record_var.field1%TYPE;
BEGIN
NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
TYPE record_typ_def IS RECORD(field1 NUMBER);
record_var OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - record_typ_def DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
var1 OBJECT := OBJECT_CONSTRUCT();
var2 NUMBER(38, 18);
BEGIN
NULL;
END;
$$;
```

In the example before, the variable which is referencing the record variable is changed to `OBJECT` as same as the record variable, and the variable which is referencing the record field is changed to the record field data type (`NUMBER (38, 18)`).

> **Warning:**
>
> These changes don’t work for embedded records.

> **Note:**
>
> Further information about records can be found in Collection & Records section.

#### TYPE attribute for collections

In this case, the referenced item is a collection variable, but since collections are not supported, the referencing item TYPE attribute is changed to VARIANT data type.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure1
IS
TYPE collection_type IS TABLE OF NUMBER;
collection_var collection_type;
var1 collection_var%TYPE;
BEGIN
NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
--!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--TYPE collection_type IS TABLE OF NUMBER;
collection_var VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'collection_type' USAGE CHANGED TO VARIANT ***/!!!;
var1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'collection_var%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
BEGIN
NULL;
END;
$$;
```

#### TYPE attribute for cursors

In this case, the referenced item is a cursor variable, but since REF cursors are not supported, the referencing item TYPE attribute is changed to VARIANT data type.

##### Oracle

```sql
CREATE TABLE table1 (col1 NUMBER);

CREATE OR REPLACE PROCEDURE procedure1
IS
TYPE cursor_type IS REF CURSOR RETURN table1%ROWTYPE;
cursor_var cursor_type;
var1 cursor_var%TYPE;
BEGIN
NULL;
END;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (col1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
--!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL REF CURSOR TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--TYPE cursor_type IS REF CURSOR RETURN table1%ROWTYPE;
cursor_var_res RESULTSET;
var1_res RESULTSET;
BEGIN
NULL;
END;
$$;
```

> **Note:**
>
> For those cases when the data type of the referenced item cannot be obtained, the referencing item TYPE attribute is changed to `VARIANT`.

### Knows Issues

#### 1. Cursors and collections declarations are not supported.

Collection and cursor variable declarations are not supported yet so the referencing item TYPE attribute is changed to VARIANT and a warning is added in these cases.

##### 2. Original data type could not be obtained.

When the referenced item data type could not be obtained the referencing item TYPE attribute is changed to VARIANT and a warning is added.

### Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
3. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
4. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
5. [SSC-EWI-OR0129](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The statement below has usages of nested cursors.
6. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.

---
title: SnowConvert AI - Oracle - PL/SQL to Snowflake Scripting
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pl-sql-to-snowflake-scripting/README.md
section: Migrations
---

# SnowConvert AI - Oracle - PL/SQL to Snowflake Scripting

## ASSIGNMENT STATEMENT

### Description

> The assignment statement sets the value of a data item to a valid value.
> ([Oracle PL/SQL Language Reference ASSIGNMENT Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/assignment-statement.html#GUID-4C3BEFDF-3FFA-4E9D-96D0-4C5E13E08643))

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Oracle Assignment Syntax

```sql
assignment_statement_target := expression ;

assignment_statement_target =
{ collection_variable [ ( index ) ]
| cursor_variable
| :host_cursor_variable
| object[.attribute]
| out_parameter
| placeholder
| record_variable[.field]
| scalar_variable
}
```

##### Snowflake Scripting Assignment Syntax

```sql
LET <variable_name> <type> { DEFAULT | := } <expression> ;

LET <variable_name> { DEFAULT | := } <expression> ;
```

> **Note:**
>
> `LET` keyword is not needed for assignment statements when the variable has been declared before. Check [Snowflake Assignment documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/let.html#let) for more information.

### Sample Source Patterns

#### 1. Scalar Variables

##### Oracle

```sql
CREATE TABLE TASSIGN (
    COL1 NUMBER,
    COL2 NUMBER,
    COL3 VARCHAR(20),
    COL4 VARCHAR(20)
);

CREATE OR REPLACE PROCEDURE PSCALAR
AS
   var1  NUMBER := 40;
   var2  NUMBER := 22.50;
   var3  VARCHAR(20);
   var4  BOOLEAN;
   var5  NUMBER;
BEGIN
   var1 := 1;
   var2 := 2.1;
   var2 := var2 + var2;
   var3 := 'Hello World';
   var4 := true;
   var4 := var1 > 500;
   IF var4 THEN
      var5 := 0;
   ELSE
      var5 := 1;
   END IF;
  INSERT INTO TASSIGN VALUES(var1, var2, var3, var5);
END;

CALL PSCALAR();

SELECT * FROM TASSIGN;
```

##### Result

| COL1 | COL2 | COL3 | COL4 |
| --- | --- | --- | --- |
| 1 | 4.2 | Hello World | 1 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE TASSIGN (
     COL1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
     COL2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
     COL3 VARCHAR(20),
     COL4 VARCHAR(20)
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 ;

 CREATE OR REPLACE PROCEDURE PSCALAR ()
 RETURNS VARCHAR
 LANGUAGE SQL
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
 EXECUTE AS CALLER
 AS
 $$
     DECLARE
     var1 NUMBER(38, 18) := 40;
     var2 NUMBER(38, 18) := 22.50;
     var3  VARCHAR(20);
     var4  BOOLEAN;
     var5 NUMBER(38, 18);
     BEGIN
     var1 := 1;
     var2 := 2.1;
     var2 := :var2 + :var2;
     var3 := 'Hello World';
     var4 := true;
     var4 := :var1 > 500;
     IF (:var4) THEN
       var5 := 0;
       ELSE
       var5 := 1;
       END IF;
       INSERT INTO TASSIGN
       VALUES(:var1, :var2, :var3, :var5);
     END;
 $$;

 CALL PSCALAR();

SELECT * FROM
     TASSIGN;
```

##### Result

| COL1 | COL2 | COL3 | COL4 |
| --- | --- | --- | --- |
| 1.000000000000000000 | 4.000000000000000000 | Hello World | 1 |

> **Warning:**
>
> Transformation for some data types needs to be updated, it may cause different results. For example, NUMBER to NUMBER rounds the value and the decimal point is lost. There is already a work item for this issue.

#### 2. Out Parameter Assignment

To get more information about how the output parameters are being converted, please go to the following article Output Parameters.

#### 3. Not Supported Assignments

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE pinvalid(out_parameter   IN OUT NUMBER)
AS
record_variable       employees%ROWTYPE;

TYPE cursor_type IS REF CURSOR;
cursor1   cursor_type;
cursor2   SYS_REFCURSOR;

TYPE collection_type IS TABLE OF NUMBER INDEX BY VARCHAR(64);
collection_variable     collection_type;

BEGIN
--Record Example
  record_variable.last_name := 'Ortiz';

--Cursor Example
  cursor1 := cursor2;

--Collection
  collection_variable('Test') := 5;

--Out Parameter
  out_parameter := 123;
END;
```

##### Snowflake Scripting

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employees" **
CREATE OR REPLACE PROCEDURE pinvalid (out_parameter OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    record_variable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL REF CURSOR TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

--    TYPE cursor_type IS REF CURSOR;
    cursor1_res RESULTSET;
    cursor2_res RESULTSET;
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

--    TYPE collection_type IS TABLE OF NUMBER INDEX BY VARCHAR(64);
    collection_variable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'collection_type' USAGE CHANGED TO VARIANT ***/!!!;
  BEGIN
    --Record Example
    record_variable := OBJECT_INSERT(record_variable, 'LAST_NAME', 'Ortiz', true);

    --Cursor Example
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
      cursor1 := :cursor2;

    --Collection
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
      collection_variable('Test') := 5;
    --Out Parameter
    out_parameter := 123;
  END;
$$;
```

### Known Issues

#### 1. Several Unsupported Assignment Statements

Currently, transformation for cursor, collection, record, and user-defined type variables are not supported by Snow Scripting. Therefore assignment statements using these variables are commented and marked as not supported. Changing these variables to Snowflake [semi-structured data types](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html#semi-structured-data-types) could help as a workaround in some scenarios.

### Related EWIs

1. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
3. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
4. [SSC-EWI-OR0108](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The Following Assignment Statement is Not Supported by Snowflake Scripting.
5. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
6. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.

## CALL

### Description

There are two types of call statements in Oracle:

#### 1-CALL Statement:

> Use the `CALL` statement to execute a routine (a standalone procedure or function, or a procedure or function defined within a type or package) from within SQL. ([Oracle SQL Language Reference CALL](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/CALL.html#GUID-6CD7B9C4-E5DC-4F3C-9B6A-876AD2C63545))

#### 2-Call Specification:

> A call specification declares a Java method or a C language subprogram so that it can be invoked from PL/SQL. ([Oracle SQL Language Reference Call Specification](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/call-specification.html#GUID-C5F117AE-E9A2-499B-BA6A-35D072575BAD))

The CALL Specification is not supported in Snowflake Scripting since this is part of the development libraries for C and JAVA, not a SQL statement, therefore this statement is not transformed.

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## CASE

Translation reference for CASE statements

### Description

> The `CASE` statement chooses from a sequence of conditions and runs a corresponding statement. For more information regarding Oracle CASE, check [here](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/CASE-statement.html#GUID-F4251A23-0284-4990-A156-00A92F83BC35).

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Simple case

##### Oracle CASE Syntax

```sql
[ <<label>> ] CASE case_operand
  WHEN boolean_expression THEN statement ;
  [ WHEN boolean_expression THEN statement ; ]...
  [ ELSE statement [ statement ]... ;
END CASE [ label ] ;
```

##### Snowflake Scripting CASE Syntax

```sql
CASE ( <expression_to_match> )
    WHEN <expression> THEN
        <statement>;
        [ <statement>; ... ]
    [ WHEN ... ]
    [ ELSE
        <statement>;
        [ <statement>; ... ]
    ]
END [ CASE ] ;
```

#### Searched case

##### Oracle CASE Syntax

```sql
[ <<label>> ] CASE
  WHEN boolean_expression THEN statement ;
  [ WHEN boolean_expression THEN statement ; ]...
  [ ELSE statement [ statement ]... ;
END CASE [ label ];
```

##### Snowflake Scripting CASE Syntax

```sql
CASE
    WHEN <boolean_expression> THEN
        <statement>;
        [ <statement>; ... ]
    [ WHEN ... ]
    [ ELSE
        <statement>;
        [ <statement>; ... ]
    ]
END [ CASE ] ;
```

### Sample Source Patterns

#### Sample auxiliary table

##### Oracle

```sql
CREATE TABLE case_table(col varchar(30));
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE case_table (col varchar(30))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### Simple Case

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE caseExample1 ( grade NUMBER )
IS
RESULT VARCHAR(20);
BEGIN
   <<CASE1>>
   CASE grade
    WHEN 10 THEN RESULT:='Excellent';
    WHEN 9 THEN RESULT:='Very Good';
    WHEN 8 THEN RESULT:='Good';
    WHEN 7 THEN RESULT:='Fair';
    WHEN 6 THEN RESULT:='Poor';
    ELSE RESULT:='No such grade';
  END CASE CASE1;
  INSERT INTO CASE_TABLE(COL) VALUES (RESULT);
END;

CALL caseExample1(6);

CALL caseExample1(4);

CALL caseExample1(10);

SELECT * FROM CASE_TABLE;
```

##### Result

| COL |
| --- |
| Poor |
| No such grade |
| Excellent |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE caseExample1 (grade NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT VARCHAR(20);
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<CASE1>> ***/!!!
    CASE :grade
      WHEN 10 THEN
        RESULT := 'Excellent';
      WHEN 9 THEN
        RESULT := 'Very Good';
      WHEN 8 THEN
        RESULT := 'Good';
      WHEN 7 THEN
        RESULT := 'Fair';
      WHEN 6 THEN
        RESULT := 'Poor';
        ELSE
        RESULT := 'No such grade';
    END CASE;
    INSERT INTO CASE_TABLE(COL) VALUES (:RESULT);
  END;
$$;

CALL caseExample1(6);

CALL caseExample1(4);

CALL caseExample1(10);

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "CASE_TABLE" **

SELECT * FROM
  CASE_TABLE;
```

##### Result

| COL |
| --- |
| Poor |
| No such grade |
| Excellent |

#### Searched Case

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE caseExample2 ( grade NUMBER )
IS
RESULT VARCHAR(20);
BEGIN
    <<CASE1>>
    CASE
    	WHEN grade = 10 THEN RESULT:='Excellent';
    	WHEN grade = 9 THEN RESULT:='Very Good';
    	WHEN grade = 8 THEN RESULT:='Good';
    	WHEN grade = 7 THEN RESULT:='Fair';
    	WHEN grade = 6 THEN RESULT:='Poor';
    	ELSE RESULT:='No such grade';
  END CASE CASE1;
  INSERT INTO CASE_TABLE(COL) VALUES (RESULT);
END;

CALL caseExample2(6);
CALL caseExample2(4);
CALL caseExample2(10);
SELECT * FROM CASE_TABLE;
```

##### Result

| COL |
| --- |
| Poor |
| No such grade |
| Excellent |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE caseExample2 (grade NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    RESULT VARCHAR(20);
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<CASE1>> ***/!!!
    CASE
      WHEN :grade = 10 THEN
        RESULT := 'Excellent';
      WHEN :grade = 9 THEN
        RESULT := 'Very Good';
      WHEN :grade = 8 THEN
        RESULT := 'Good';
      WHEN :grade = 7 THEN
        RESULT := 'Fair';
      WHEN :grade = 6 THEN
        RESULT := 'Poor';
        ELSE
        RESULT := 'No such grade';
    END CASE;
    INSERT INTO CASE_TABLE(COL) VALUES (:RESULT);
  END;
$$;

CALL caseExample2(6);

CALL caseExample2(4);

CALL caseExample2(10);

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "CASE_TABLE" **
SELECT * FROM
  CASE_TABLE;
```

##### Result

| COL |
| --- |
| Poor |
| No such grade |
| Excellent |

### Known issues

#### 1. Labels are not supported in Snowflake Scripting CASE syntax

The labels are commented out or removed depending on their position.

### Related EWIs

1. [SSC-EWI-0094](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Label declaration not supported.
2. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.

## COMPOUND STATEMENTS

This section is a translation specification for the compound statements

> **Warning:**
>
> This section is a work in progress, information may change in the future.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### General description

> The basic unit of a PL/SQL source program is the block, which groups related declarations and statements.
>
> A PL/SQL block is defined by the keywords DECLARE, BEGIN, EXCEPTION, and END. These keywords divide the block into a declarative part, an executable part, and an exception-handling part. Only the executable part is required. ([PL/SQL Anonymous Blocks](https://livesql.oracle.com/apex/livesql/file/tutorial_KS0KNKP218J86THKN85XU37.html))

The **`BEGIN...END`** block in Oracle can have the following characteristics:

1. Be nested.
2. Contain the DECLARE statement for variables.
3. Group multiple SQL or PL/SQL statements.

#### Oracle syntax

```sql
[DECLARE <Variable declaration>]
BEGIN
  <Executable statements>
[EXCEPTION <Exception handler>]
END
```

#### Snowflake syntax

```sql
BEGIN
    <statement>;
    [ <statement>; ... ]
[ EXCEPTION <exception_handler> ]
END;
```

> **Note:**
>
> In Snowflake, a BEGIN/END block can be the top-level construct inside an anonymous block ([Snowflake documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/begin#usage-notes)).

### Sample Source Patterns

#### 1. IF-ELSE block

Review the following documentation about IF statements to learn more: SnowConvert AI IF statements translation and [Snowflake IF statement documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/if)

##### Oracle

```sql
DECLARE
    age NUMBER := 18;
BEGIN
    IF age >= 18 THEN
        DBMS_OUTPUT.PUT_LINE('You are an adult.');
    ELSE
        DBMS_OUTPUT.PUT_LINE('You are a minor.');
    END IF;
END;
```

##### Result

```none
Statement processed.
You are an adult.
```

##### Snowflake

> **Warning:**
>
> When calling a procedure or user-defined function (UDF), generating code is needed to support the equivalence as `call_results` variable. In this case, is used to print the information.
>
> Review the user-defined function (UDF) used [here](../built-in-packages.md).

```sql
DECLARE
    age NUMBER(38, 18) := 18;
    call_results VARIANT;
BEGIN
    IF (:age >= 18) THEN
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        call_results := (
            CALL DBMS_OUTPUT.PUT_LINE_UDF('You are an adult.')
        );
    ELSE
        --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
        call_results := (
            CALL DBMS_OUTPUT.PUT_LINE_UDF('You are a minor.')
        );
    END IF;
    RETURN call_results;
END;
```

##### Result

```sql
anonymous block
You are an adult.
```

#### 2. CASE statement

For more information, review the following documentation: SnowConvert AI CASE statement documentation and [Snowflake CASE documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/case)

##### Oracle

```sql
BEGIN
   DECLARE
      day_of_week NUMBER := 3;
   BEGIN
      CASE day_of_week
         WHEN 1 THEN DBMS_OUTPUT.PUT_LINE('Sunday');
         WHEN 2 THEN DBMS_OUTPUT.PUT_LINE('Monday');
         WHEN 3 THEN DBMS_OUTPUT.PUT_LINE('Tuesday');
         WHEN 4 THEN DBMS_OUTPUT.PUT_LINE('Wednesday');
         WHEN 5 THEN DBMS_OUTPUT.PUT_LINE('Thursday');
         WHEN 6 THEN DBMS_OUTPUT.PUT_LINE('Friday');
         WHEN 7 THEN DBMS_OUTPUT.PUT_LINE('Saturday');
         ELSE DBMS_OUTPUT.PUT_LINE('Invalid day');
      END CASE;
   END;
END;
```

##### Result

```none
Statement processed.
Tuesday
```

##### Snowflake

> **Warning:**
>
> When calling a procedure or user-defined function (UDF), generating code is needed to support the equivalence as `call_results` variable. In this case, is used to print the information.
>
> Review the user-defined function (UDF) used [here](../built-in-packages.md).

```sql
DECLARE
   call_results VARIANT;
BEGIN
   DECLARE
      day_of_week NUMBER(38, 18) := 3;
   BEGIN
      CASE :day_of_week
         WHEN 1 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Sunday')
            );
         WHEN 2 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Monday')
            );
         WHEN 3 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Tuesday')
            );
         WHEN 4 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Wednesday')
            );
         WHEN 5 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Thursday')
            );
         WHEN 6 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Friday')
            );
         WHEN 7 THEN
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Saturday')
            );
         ELSE
            --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
            call_results := (
               CALL DBMS_OUTPUT.PUT_LINE_UDF('Invalid day')
            );
      END CASE;
   END;
   RETURN call_results;
END;
```

##### Result

```sql
anonymous block
Tuesday
```

#### 3. LOOP statements

For more information review the following documentation: SnowConvert AI FOR LOOP and Snowflake [LOOP documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/loop) and [FOR documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/for).

##### Oracle

```sql
BEGIN
    FOR i IN 1..10 LOOP
        NULL;
    END LOOP;
END;
```

##### Result

```none
Statement processed.
```

##### Snowflake

##### First Tab

```sql
BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        FOR i IN 1 TO 10
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         LOOP
                                    NULL;
                                END LOOP;
END;
```

##### Result

```none
anonymous block
```

#### 4. Procedure call and OUTPUT parameters

Anonymous block in Oracle may have calls to procedures. Furthermore, the following documentation may be useful: [SnowConvert AI Procedure documentation](create-procedure.md).

The following example uses the OUT parameters, the information about the current transformation can be found here: SnowConvert AI OUTPUT Parameters

##### Oracle

```sql
-- Procedure declaration
CREATE OR REPLACE PROCEDURE calculate_sum(
    p_num1 IN NUMBER,
    p_num2 IN NUMBER,
    p_result OUT NUMBER
)
IS
BEGIN
    -- Calculate the sum of the two numbers
    p_result := p_num1 + p_num2;
END;
/

-- Anonymous block with a procedure call
DECLARE
    -- Declare variables to hold the input and output values
    v_num1 NUMBER := 10;
    v_num2 NUMBER := 20;
    v_result NUMBER;
BEGIN
    -- Call the procedure with the input values and get the result
    calculate_sum(v_num1, v_num2, v_result);

    -- Display the result
    DBMS_OUTPUT.PUT_LINE('The sum of ' || v_num1 || ' and ' || v_num2 || ' is ' || v_result);
END;
/
```

##### Result

```none
Statement processed.
The sum of 10 and 20 is 30
```

##### Snowflake

```sql
-- Procedure declaration
CREATE OR REPLACE PROCEDURE calculate_sum (p_num1 NUMBER(38, 18), p_num2 NUMBER(38, 18), p_result OUT NUMBER(38, 18)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
    -- Calculate the sum of the two numbers
        p_result := :p_num1 + :p_num2;
    END;
$$;

-- Anonymous block with a procedure call
DECLARE
    -- Declare variables to hold the input and output values
    v_num1 NUMBER(38, 18) := 10;
    v_num2 NUMBER(38, 18) := 20;
    v_result NUMBER(38, 18);
    call_results VARIANT;
BEGIN
    CALL
    -- Call the procedure with the input values and get the result
    calculate_sum(:v_num1, :v_num2, :v_result);

    -- Display the result
    --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
    call_results := (
        CALL DBMS_OUTPUT.PUT_LINE_UDF('The sum of ' || NVL(:v_num1 :: STRING, '') || ' and ' || NVL(:v_num2 :: STRING, '') || ' is ' || NVL(:v_result :: STRING, ''))
    );
    RETURN call_results;
END;
```

##### Result

```none
anonymous block
The sum of 10 and 20 is 30
```

#### 5. Alter session

For more information, review the following documentation: [Alter session documentation](../sql-translation-reference/README.md).

Notice that in Oracle, the block `BEGIN...END` should use the `EXECUTE IMMEDIATE` statement to run `alter session` statements.

##### Oracle

```sql
DECLARE
     lv_sql_txt VARCHAR2(200);
BEGIN
     lv_sql_txt := 'ALTER SESSION SET nls_date_format = ''DD-MM-YYYY''';
     EXECUTE IMMEDIATE lv_sql_txt;
END;
```

##### Result

```none
Statement processed.
Done
```

##### Snowflake

```sql
DECLARE
     lv_sql_txt VARCHAR(200);
BEGIN
     lv_sql_txt := 'ALTER SESSION SET nls_date_format = ''DD-MM-YYYY''';
     !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0027 - THE FOLLOWING STATEMENT USES A VARIABLE/LITERAL WITH AN INVALID QUERY AND IT WILL NOT BE EXECUTED ***/!!!
     EXECUTE IMMEDIATE :lv_sql_txt;
END;
```

##### Result

```none
anonymous block
Done
```

#### 6. Cursors

The following example displays the usage of a `cursor` inside a `BEGIN...END` block. Review the following documentation to learn more: [Cursor documentation](cursor.md).

##### Oracle

```sql
CREATE TABLE employee (
    ID_Number	NUMBER,
    emp_Name	VARCHAR(200),
    emp_Phone	NUMBER
);

INSERT INTO employee VALUES (1, 'NameA NameZ', 1234567890);
INSERT INTO employee VALUES (2, 'NameB NameY', 1234567890);

DECLARE
    var1 VARCHAR(20);
    CURSOR cursor1 IS SELECT emp_Name FROM employee ORDER BY ID_Number;
BEGIN
    OPEN cursor1;
    FETCH cursor1 INTO var1;
    CLOSE cursor1;
	DBMS_OUTPUT.PUT_LINE(var1);
END;
```

##### Result

```none
Statement processed.
NameA NameZ
```

##### Snowflake

> **Warning:**
>
> When calling a procedure or user-defined function (UDF), generating code is needed to support the equivalence as `call_results` variable. In this case, is used to print the information.
>
> Review the user-defined function (UDF) used [here](../built-in-packages.md).

```sql
CREATE OR REPLACE TABLE employee (
	   ID_Number NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
	   emp_Name	VARCHAR(200),
	   emp_Phone NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO employee
VALUES (1, 'NameA NameZ', 1234567890);

INSERT INTO employee
VALUES (2, 'NameB NameY', 1234567890);

DECLARE
    var1 VARCHAR(20);
	   --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
	   cursor1 CURSOR
	   FOR
		SELECT emp_Name FROM
			employee
		ORDER BY ID_Number;
	   call_results VARIANT;
BEGIN
	   OPEN cursor1;
	   FETCH cursor1 INTO
		:var1;
	   CLOSE cursor1;
	   --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
	   call_results := (
		CALL DBMS_OUTPUT.PUT_LINE_UDF(:var1)
	   );
	   RETURN call_results;
END;
```

##### Result

```none
anonymous block
NameA NameZ
```

#### 7. Select statements

For more information review the following documentation: [Select documentation](../sql-queries-and-subqueries/selects.md).

##### Oracle

```sql
CREATE TABLE employee (
    ID_Number NUMBER,
    emp_Name VARCHAR(200),
    emp_Phone NUMBER
);

INSERT INTO employee VALUES (1, 'NameA NameZ', 1234567890);
INSERT INTO employee VALUES (2, 'NameB NameY', 1234567890);

DECLARE
    var_Result NUMBER;
BEGIN
    SELECT COUNT(*) INTO var_Result FROM employee;
    DBMS_OUTPUT.PUT_LINE(var_Result);
END;
```

##### Result

```none
Statement processed.
2
```

##### Snowflake

> **Warning:**
>
> When calling a procedure or user-defined function (UDF), generating code is needed to support the equivalence as `call_results` variable. In this case, is used to print the information.
>
> Review the user-defined function (UDF) used [here](../built-in-packages.md).

```sql
CREATE OR REPLACE TABLE employee (
       ID_Number NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
       emp_Name VARCHAR(200),
       emp_Phone NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
   ;

   INSERT INTO employee
   VALUES (1, 'NameA NameZ', 1234567890);

   INSERT INTO employee
   VALUES (2, 'NameB NameY', 1234567890);

   DECLARE
    var_Result NUMBER(38, 18);
       call_results VARIANT;
   BEGIN
       SELECT COUNT(*) INTO
           :var_Result
       FROM
           employee;
       --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
       call_results := (
           CALL DBMS_OUTPUT.PUT_LINE_UDF(:var_Result)
       );
       RETURN call_results;
   END;
```

##### Result

```none
anonymous block
2
```

#### 8. Join Statements

For more information review the following documentation: [Joins documentation](../sql-queries-and-subqueries/joins.md).

##### Oracle

```sql
CREATE TABLE t1 (col1 INTEGER);
CREATE TABLE t2 (col1 INTEGER);

INSERT INTO t1 (col1) VALUES (2);
INSERT INTO t1 (col1) VALUES (3);
INSERT INTO t1 (col1) VALUES (4);

INSERT INTO t2 (col1) VALUES (1);
INSERT INTO t2 (col1) VALUES (2);
INSERT INTO t2 (col1) VALUES (2);
INSERT INTO t2 (col1) VALUES (3);

DECLARE
    total_price FLOAT;
    CURSOR cursor1 IS SELECT t1.col1 as FirstTable, t2.col1 as SecondTable
    FROM t1 INNER JOIN t2
        ON t2.col1 = t1.col1
    ORDER BY 1,2;
BEGIN
    total_price := 0.0;
    FOR rec IN cursor1 LOOP
      total_price := total_price + rec.FirstTable;
    END LOOP;
    DBMS_OUTPUT.PUT_LINE(total_price);
END;
```

##### Result

```none
Statement processed.
7
```

##### Snowflake

> **Warning:**
>
> When calling a procedure or user-defined function (UDF), generating code is needed to support the equivalence as `call_results` variable. In this case, is used to print the information.
>
> Review the user-defined function (UDF) used [here](../built-in-packages.md).

```sql
CREATE OR REPLACE TABLE t1 (col1 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE t2 (col1 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO t1(col1) VALUES (2);

INSERT INTO t1(col1) VALUES (3);

INSERT INTO t1(col1) VALUES (4);

INSERT INTO t2(col1) VALUES (1);

INSERT INTO t2(col1) VALUES (2);

INSERT INTO t2(col1) VALUES (2);

INSERT INTO t2(col1) VALUES (3);

DECLARE
    total_price FLOAT;
    --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
    cursor1 CURSOR
    FOR
        SELECT t1.col1 as FIRSTTABLE, t2.col1 as SECONDTABLE
           FROM
            t1
            INNER JOIN
                t2
               ON t2.col1 = t1.col1
           ORDER BY 1,2;
    call_results VARIANT;
BEGIN
    total_price := 0.0;
    OPEN cursor1;
    --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
    FOR rec IN cursor1 DO
        total_price :=
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN FLOAT AND unknown ***/!!!
        :total_price + rec.FIRSTTABLE;
    END FOR;
    CLOSE cursor1;
    --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
    call_results := (
        CALL DBMS_OUTPUT.PUT_LINE_UDF(:total_price)
    );
    RETURN call_results;
END;
```

#### 9. Exception handling

##### Oracle

```sql
DECLARE
      v_result NUMBER;
BEGIN
   v_result := 1 / 0;
   EXCEPTION
      WHEN ZERO_DIVIDE THEN
         DBMS_OUTPUT.PUT_LINE( SQLERRM );
END;
```

##### Result

```none
Statement processed.
ORA-01476: divisor is equal to zero
```

##### Snowflake

> **Warning:**
>
> `ZERO_DIVIDE` exception in Snowflake is not supported.

```sql
DECLARE
      v_result NUMBER(38, 18);
      error_results VARIANT;
BEGIN
      v_result := 1 / 0;
   EXCEPTION
      WHEN ZERO_DIVIDE THEN
      --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
      error_results := (
         CALL DBMS_OUTPUT.PUT_LINE_UDF( SQLERRM )
      );
      RETURN error_results;
END;
```

##### Result

```none
anonymous block
Division by zero
```

### Known issues

1. Unsupported GOTO statements in Oracle.
2. Exceptions that use GOTO statements may be affected too.
3. Cursor functionality may be adapted under current restrictions on translations.

### Related EWIs

1. [SSC-EWI-0027](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md):The following statement uses a variable/literal with an invalid query and it will not be executed.
2. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
3. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.
4. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
5. [SSC-PRF-0004](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor for loop.
6. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL

## CONTINUE

Translation reference to convert Oracle CONTINUE statement to Snowflake Scripting

### Description

> The `CONTINUE` statement exits the current iteration of a loop, either conditionally or unconditionally, and transfers control to the next iteration of either the current loop or an enclosing labeled loop.
> ([Oracle PL/SQL Language Reference CONTINUE Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/CONTINUE-statement.html#GUID-3ED7E5D5-E2D0-42D1-8A7F-97FFC7372775))

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Oracle CONTINUE Syntax

```sql
CONTINUE [ label ] [ WHEN boolean_expression ] ;
```

##### Snowflake Scripting CONTINUE Syntax

```sql
{ CONTINUE | ITERATE } [ <label> ] ;
```

### Sample Source Patterns

#### 1. Simple Continue

Code skips the `INSERT` statement by using `CONTINUE`.

> **Note:**
>
> This case is functionally equivalent.

##### Oracle

```sql
CREATE TABLE continue_testing_table_1 (iterator VARCHAR2(5));

CREATE OR REPLACE PROCEDURE continue_procedure_1
IS
I NUMBER := 0;
J NUMBER := 20;
BEGIN
    WHILE I <= J LOOP
        I := I + 1;
        CONTINUE;
        INSERT INTO continue_testing_table_1
        VALUES (TO_CHAR(I));
    END LOOP;
END;

CALL continue_procedure_1();
SELECT * FROM continue_testing_table_1;
```

##### Result

| ITERATOR |
| --- |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE continue_testing_table_1 (iterator VARCHAR(5))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE continue_procedure_1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I NUMBER(38, 18) := 0;
        J NUMBER(38, 18) := 20;
    BEGIN
        WHILE (:I <= :J)
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         LOOP
                             I := :I + 1;
                             CONTINUE;
                                    INSERT INTO continue_testing_table_1
                                    VALUES (TO_CHAR(:I));
                                END LOOP;
    END;
$$;

CALL continue_procedure_1();

SELECT * FROM
    continue_testing_table_1;
```

##### Result

| ITERATOR |
| --- |

#### 2. Continue with condition

Code skips inserting even numbers by using `CONTINUE`.

> **Note:**
>
> This case is not functionally equivalent, but, you can turn the condition into an `IF` statement.

##### Oracle

```sql
CREATE TABLE continue_testing_table_2 (iterator VARCHAR2(5));

CREATE OR REPLACE PROCEDURE continue_procedure_2
IS
I NUMBER := 0;
J NUMBER := 20;
BEGIN
    WHILE I <= J LOOP
        I := I + 1;
        CONTINUE WHEN MOD(I,2) = 0;
        INSERT INTO continue_testing_table_2 VALUES(TO_CHAR(I));
    END LOOP;
END;

CALL continue_procedure_2();
SELECT * FROM continue_testing_table_2;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 3 |
| 5 |
| 7 |
| 9 |
| 11 |
| 13 |
| 15 |
| 17 |
| 19 |
| 21 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE continue_testing_table_2 (iterator VARCHAR(5))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE continue_procedure_2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I NUMBER(38, 18) := 0;
        J NUMBER(38, 18) := 20;
    BEGIN
        WHILE (:I <= :J)
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         LOOP
                             I := :I + 1;
                             IF (MOD(:I,2) = 0) THEN
                                 CONTINUE;
                             END IF;
                                    INSERT INTO continue_testing_table_2
                             VALUES(TO_CHAR(:I));
                                END LOOP;
    END;
$$;

CALL continue_procedure_2();

SELECT * FROM
    continue_testing_table_2;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 3 |
| 5 |
| 7 |
| 9 |
| 11 |
| 13 |
| 15 |
| 17 |
| 19 |
| 21 |

#### 3. Continue with label and condition

Code skips line 19, and the inner loop is only executed once because the `CONTINUE` is always jumping to the outer loop using the label.

> **Note:**
>
> This case is functionally equivalent applying the same process as the previous sample.

> **Note:**
>
> Note that labels are going to be commented out.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE continue_procedure_3
IS
I NUMBER := 0;
J NUMBER := 10;
K NUMBER := 0;
BEGIN
    <<out_loop>>
    WHILE I <= J LOOP
        I := I + 1;
        INSERT INTO continue_testing_table_3 VALUES('I' || TO_CHAR(I));

        <<in_loop>>
        WHILE K <= J * 2 LOOP
            K := K + 1;
            CONTINUE out_loop WHEN K > J / 2;
            INSERT INTO continue_testing_table_3 VALUES('K' || TO_CHAR(K));
        END LOOP in_loop;

        K := 0;
    END LOOP out_loop;
END;

CALL continue_procedure_3();
SELECT * FROM continue_testing_table_3;
```

##### Result

| ITERATOR |
| --- |
| I1 |
| K1 |
| K2 |
| K3 |
| K4 |
| K5 |
| I2 |
| I3 |
| I4 |
| I5 |
| I6 |
| I7 |
| I8 |
| I9 |
| I10 |
| I11 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE continue_procedure_3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I NUMBER(38, 18) := 0;
        J NUMBER(38, 18) := 10;
        K NUMBER(38, 18) := 0;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<out_loop>> ***/!!!
        WHILE (:I <= :J)
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         LOOP
                             I := :I + 1;
                                    INSERT INTO continue_testing_table_3
                             VALUES('I' || NVL(TO_CHAR(:I) :: STRING, ''));
                             !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<in_loop>> ***/!!!
                             WHILE (:K <= :J * 2)
                                                  --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                                  LOOP
                                                      K := :K + 1;
                                                      IF (:K > :J / 2) THEN
                                                          CONTINUE out_loop;
                                                      END IF;
                                        INSERT INTO continue_testing_table_3
                                                      VALUES('K' || NVL(TO_CHAR(:K) :: STRING, ''));
                                    END LOOP in_loop;
                             K := 0;
                                END LOOP out_loop;
    END;
$$;

CALL continue_procedure_3();

SELECT * FROM
    continue_testing_table_3;
```

##### Result

| ITERATOR |
| --- |
| I1 |
| K1 |
| K2 |
| K3 |
| K4 |
| K5 |
| I2 |
| I3 |
| I4 |
| I5 |
| I6 |
| I7 |
| I8 |
| I9 |
| I10 |
| I11 |

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0094](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Label declaration not supported.

## DECLARE

Translation reference to convert Oracle DECLARE statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Oracle DECLARE statement is an optional part of the PL/SQL block statement. It allows the creation of variables, constants, procedures declarations, and definitions, functions declarations, and definitions, exceptions, cursors, types, and many other statements. For more information regarding Oracle DECLARE, check [here](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/block.html#GUID-9ACEB9ED-567E-4E1A-A16A-B8B35214FC9D).

#### Oracle DECLARE Syntax

```sql
declare_section body

declare_section::= { item_list_1 [ item_list_2 ] | item_list_2 }

item_list_1::=
{ type_definition
| cursor_declaration
| item_declaration
| function_declaration
| procedure_declaration
}
 ...

item_list_2::=
{ cursor_declaration
| cursor_definition
| function_declaration
| function_definition
| procedure_declaration
| procedure_definition
}
 ...

item_declaration::=
{ collection_variable_decl
| constant_declaration
| cursor_variable_declaration
| exception_declaration
| record_variable_declaration
| variable_declaration
}

body::= BEGIN statement ...
  [ EXCEPTION exception_handler [ exception_handler ]... ] END [ name ] ;
```

##### Snowflake Scripting DECLARE Syntax

```sql
[ DECLARE
  { <variable_declaration> | <cursor_declaration> | <exception_declaration> | <resultset_declaration> }
  [, { <variable_declaration> | <cursor_declaration> | <exception_declaration> | <resultset_declaration> } ... ]
]
BEGIN
    <statement>;
    [ <statement>; ... ]
[ EXCEPTION <exception_handler> ]
END [ <label> ] ;
```

### Sample Source Patterns

#### Variable declaration

##### Oracle Variable Declaration Syntax

```sql
variable_declaration::=
variable datatype [ [ NOT NULL] {:= | DEFAULT} expression ] ;
```

##### Snowflake Scripting Variable Declaration Syntax

```sql
<variable_name> <type>;

<variable_name> DEFAULT <expression> ;

<variable_name> <type> DEFAULT <expression> ;
```

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE var_decl_proc
IS
var1 NUMBER;
var2 NUMBER := 1;
var3 NUMBER NOT NULL := 1;
var4 NUMBER DEFAULT 1;
var5 NUMBER NOT NULL DEFAULT 1;
BEGIN
    NULL;
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE var_decl_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        var1 NUMBER(38, 18);
        var2 NUMBER(38, 18) := 1;
        var3 NUMBER(38, 18) := 1 /*** SSC-FDM-OR0025 - NOT NULL CONSTRAINT IS NOT SUPPORTED BY SNOWFLAKE ***/;
        var4 NUMBER(38, 18) DEFAULT 1;
        var5 NUMBER(38, 18) DEFAULT 1 /*** SSC-FDM-OR0025 - NOT NULL CONSTRAINT IS NOT SUPPORTED BY SNOWFLAKE ***/;
    BEGIN
        NULL;
    END;
$$;
```

#### Constant declaration

> **Warning:**
>
> Constants are not supported in Snowflake Scripting, however, they are being transformed to variables to simulate the behavior.

##### Oracle Constant Declaration Syntax

```sql
constant_declaration::=
constant CONSTANT datatype [NOT NULL] { := | DEFAULT } expression ;
```

##### Snowflake Scripting Variable Declaration Syntax

```sql
<variable_name> <type>;

<variable_name> DEFAULT <expression> ;

<variable_name> <type> DEFAULT <expression> ;
```

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE const_decl_proc
IS
my_const1 CONSTANT NUMBER := 40;
my_const2 CONSTANT NUMBER NOT NULL := 40;
my_const2 CONSTANT NUMBER DEFAULT 40;
my_const2 CONSTANT NUMBER NOT NULL DEFAULT 40;
BEGIN
    NULL;
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE const_decl_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
        my_const1 NUMBER(38, 18) := 40;
        --** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
        --** SSC-FDM-OR0025 - NOT NULL CONSTRAINT IS NOT SUPPORTED BY SNOWFLAKE **
        my_const2 NUMBER(38, 18) := 40;
        --** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
        my_const2 NUMBER(38, 18) DEFAULT 40;
        --** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
        --** SSC-FDM-OR0025 - NOT NULL CONSTRAINT IS NOT SUPPORTED BY SNOWFLAKE **
        my_const2 NUMBER(38, 18) DEFAULT 40;
    BEGIN
        NULL;
    END;
$$;
```

#### Cursor declaration

##### Oracle Cursor Declaration Syntax

```sql
cursor_declaration::= CURSOR cursor
  [( cursor_parameter_dec [, cursor_parameter_dec ]... )]
    RETURN rowtype;

cursor_parameter_dec::= parameter [IN] datatype [ { := | DEFAULT } expression ]

rowtype::=
{ {db_table_or_view | cursor | cursor_variable}%ROWTYPE
  | record%TYPE
  | record_type
  }
```

##### Snowflake Scripting Cursor Declaration Syntax

```sql
<cursor_name> CURSOR [ ( <argument> [, <argument> ... ] ) ]
        FOR <query> ;
```

> **Danger:**
>
> The Oracle ***cursor declaration*** is not required so it might be commented out on the output code. The ***cursor definition*** will be used instead of and it will be converted to the Snowflake Scripting ***cursor declaration***. Please go to the [CURSOR](https://github.com/snowflake-mountain/SC.Docs/blob/main/translation-reference/translation-reference-1/pl-sql-to-snowflake-scripting/broken-reference/#README) section to get more information about cursor definition.

#### Exception declaration

The exception declaration sometimes could be followed by the exception initialization, the current transformation takes both and merge them into the Snowflake Scripting exception declaration. The original `PRAGMA` `EXCEPTION_INIT` will be commented out.

##### Oracle Exception Declaration Syntax

```sql
exception_declaration::= exception EXCEPTION;

PRAGMA EXCEPTION_INIT ( exception, error_code ) ;
```

##### Snowflake Scripting Exception Declaration Syntax

```sql
<exception_name> EXCEPTION [ ( <exception_number> , '<exception_message>' ) ] ;
```

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure_exception
IS
my_exception EXCEPTION;
my_exception2 EXCEPTION;
PRAGMA EXCEPTION_INIT ( my_exception2, -20100 );
my_exception3 EXCEPTION;
PRAGMA EXCEPTION_INIT ( my_exception3, -19000 );
BEGIN
    NULL;
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE procedure_exception ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        my_exception EXCEPTION;
        my_exception2 EXCEPTION (-20100, '');
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
        PRAGMA EXCEPTION_INIT ( my_exception2, -20100 );
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0099 - EXCEPTION CODE NUMBER EXCEEDS SNOWFLAKE SCRIPTING LIMITS ***/!!!
        my_exception3 EXCEPTION;
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
PRAGMA EXCEPTION_INIT ( my_exception3, -19000 );
    BEGIN
        NULL;
    END;
$$;
```

#### Not supported cases

The next Oracle declaration statements are not supported by the Snowflake Scripting declaration block:

1. Cursor variable declaration.
2. Collection variable declaration.
3. Record variable declaration.
4. Type definition (all its variants).
5. Function declaration and definition.
6. Procedure declaration and definition.

### Known issues

#### 1. The variable declarations with NOT NULL constraints are not supported by Snow Scripting.

The creation of variables with `NOT NULL` constraint throws an error in Snow Scripting.

##### 2. The cursor declaration has no equivalent to Snowflake Scripting.

The Oracle cursor declaration is useless so it might be commented out in the output code. The cursor definition will be used instead and it will be converted to the Snowflake Scripting cursor declaration.

##### 3. The exception code exceeds Snowflake Scripting limits.

Oracle exception code is being removed when it exceeds the Snowflake Scripting code limits. The exception code must be an integer between -20000 and -20999.

##### 3. The not supported cases.

There are some Oracle declaration statements that are not supported by the Snowflake Scripting declaration block, so it might be commented out and a warning will be added.

### Related EWIs

1. [SSC-EWI-OR0051](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): PRAGMA EXCEPTION_INIT is not supported.
2. [SSC-EWI-OR0099](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The exception code exceeds the Snowflake Scripting limit.
3. [SSC-FDM-0016](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Constants are not supported by Snowflake Scripting. It was transformed into a variable.
4. [SSC-FDM-OR0025](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Not Null constraint is not supported in Snowflake Procedures.

## DEFAULT PARAMETERS

This article is about the current transformation of the default parameters and how their functionality is being emulated.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

A **default parameter** is a parameter that has a value in case an argument is not passed in the procedure or function call. Since Snowflake doesn’t support default parameters, SnowConvert AI inserts the default value in the procedure or function call.

In the declaration, the DEFAULT VALUE clause of the parameter is removed. Both syntaxes, the `:=` symbol and the `DEFAULT` clause, are supported.

### Sample Source Patterns

#### Sample auxiliaryy code

##### Oracle

```sql
CREATE TABLE TABLE1(COL1 NUMBER, COL2 NUMBER);
CREATE TABLE TABLE2(COL1 NUMBER, COL2 NUMBER, COL2 NUMBER);0016
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE TABLE1 (COL1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
COL2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE TABLE TABLE2 (COL1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
COL2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
COL2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
```

#### Default parameter declaration

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_PARAMS1 (
    param1 NUMBER,
    param2 NUMBER default TO_NUMBER(1)
)
AS
BEGIN
	INSERT INTO TABLE1 (COL1, COL2)
    VALUES(param1, param2);
END;
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_PARAMS2 (
    param1 NUMBER default 1,
    param2 NUMBER default 2
)
AS
BEGIN
	INSERT INTO TABLE1 (COL1, COL2)
    VALUES(param1, param2);
END;

CREATE OR REPLACE PROCEDURE PROCEDURE_WITH_DEAFAULT_PARAMS3 (
    param1 NUMBER DEFAULT 100,
    param2 NUMBER,
    param3 NUMBER DEFAULT 1000
)
IS
BEGIN
	INSERT INTO TABLE2(COL1, COL2, COL3)
    VALUES (param1, param2, param3);
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_PARAMS1 (param1 NUMBER(38, 18),
   param2 NUMBER(38, 18) DEFAULT TO_NUMBER(1)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		INSERT INTO TABLE1(COL1, COL2)
		   VALUES(:param1, :param2);
	END;
$$;

CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_PARAMS2 (
   param1 NUMBER(38, 18) DEFAULT 1,
   param2 NUMBER(38, 18) DEFAULT 2
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		INSERT INTO TABLE1(COL1, COL2)
		   VALUES(:param1, :param2);
	END;
$$;

CREATE OR REPLACE PROCEDURE PROCEDURE_WITH_DEAFAULT_PARAMS3 (
   param1 NUMBER(38, 18) DEFAULT 100, param2 NUMBER(38, 18),
   param3 NUMBER(38, 18) DEFAULT 1000
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		INSERT INTO TABLE2(COL1, COL2, COL3)
		   VALUES (:param1, :param2, :param3);
	END;
$$;
```

#### Calling procedures with default parameters

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_CALLS
AS
BEGIN
    PROC_WITH_DEFAULT_PARAMS1(10, 15);
    PROC_WITH_DEFAULT_PARAMS1(10);
    PROC_WITH_DEFAULT_PARAMS2(10, 15);
    PROC_WITH_DEFAULT_PARAMS2(10);
    PROC_WITH_DEFAULT_PARAMS2();
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_CALLS ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CALL
        PROC_WITH_DEFAULT_PARAMS1(10, 15);
        CALL
        PROC_WITH_DEFAULT_PARAMS1(10);
        CALL
        PROC_WITH_DEFAULT_PARAMS2(10, 15);
        CALL
        PROC_WITH_DEFAULT_PARAMS2(10);
        CALL
        PROC_WITH_DEFAULT_PARAMS2();
    END;
$$;
```

In order to check that the functionality is being emulated correctly the following query is going to execute the procedure and a `SELECT` from the table mentioned before.

##### Oracle

```sql
CALL PROC_WITH_DEFAULT_CALLS();

SELECT * FROM TABLE1;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| 10 | 15 |
| 10 | 1 |
| 10 | 15 |
| 10 | 2 |
| 1 | 2 |

##### Snowflake Scripting

```sql
CALL PROC_WITH_DEFAULT_CALLS();

SELECT * FROM TABLE1;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| 10 | 15 |
| 10 | 1 |
| 10 | 15 |
| 10 | 2 |
| 1 | 2 |

#### Calling procedures with named arguments and default parameters

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_CALLS2
AS
BEGIN
    PROCEDURE_WITH_DEAFAULT_PARAMS3(10, 20, 30);
    PROCEDURE_WITH_DEAFAULT_PARAMS3(param1 => 10, param2 => 20, param3 => 30);
    PROCEDURE_WITH_DEAFAULT_PARAMS3(param3 => 10, param1 => 20, param2 => 30);
    PROCEDURE_WITH_DEAFAULT_PARAMS3(param3 => 10, param2 => 30);
    PROCEDURE_WITH_DEAFAULT_PARAMS3(param2 => 10, param3 => 30);
    PROCEDURE_WITH_DEAFAULT_PARAMS3(param2 => 10);
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_CALLS2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CALL
        PROCEDURE_WITH_DEAFAULT_PARAMS3(10, 20, 30);
        CALL
        PROCEDURE_WITH_DEAFAULT_PARAMS3(10, 20, 30);
        CALL
        PROCEDURE_WITH_DEAFAULT_PARAMS3(10, 20, 30);
        CALL
        PROCEDURE_WITH_DEAFAULT_PARAMS3(10, 30);
        CALL
        PROCEDURE_WITH_DEAFAULT_PARAMS3(10, 30);
        CALL
        PROCEDURE_WITH_DEAFAULT_PARAMS3(10);
    END;
$$;
```

In order to check that the functionality is being emulated correctly the following query is going to execute the procedure and a `SELECT` from the table mentioned before.

##### Oracle

```sql
CALL PROC_WITH_DEFAULT_CALLS2();

SELECT * FROM TABLE2;
```

##### Result

| COL1 | COL2 | COL3 |
| --- | --- | --- |
| 10 | 20 | 30 |
| 10 | 20 | 30 |
| 20 | 30 | 10 |
| 100 | 30 | 10 |
| 100 | 10 | 30 |
| 100 | 10 | 1000 |

##### Snowflake Scripting

```sql
CALL PROC_WITH_DEFAULT_CALLS2();

SELECT * FROM TABLE2;
```

##### Result

| COL1 | COL2 | COL3 |
| --- | --- | --- |
| 10 | 20 | 30 |
| 10 | 20 | 30 |
| 20 | 30 | 10 |
| 100 | 30 | 10 |
| 100 | 10 | 30 |
| 100 | 10 | 1000 |

### Known Issues

1. No issues found

### Related EWIs

No related EWIs.

## EXECUTE IMMEDIATE

Translation reference to convert Oracle EXECUTE IMMEDIATE statement to Snowflake Scripting

### Description

> The `EXECUTE` `IMMEDIATE` statement builds and runs a dynamic SQL statement in a single operation.
>
> Native dynamic SQL uses the `EXECUTE` `IMMEDIATE` statement to process most dynamic SQL statements. ([Oracle PL/SQL Language Reference EXECUTE IMMEDIATE Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/EXECUTE-IMMEDIATE-statement.html#GUID-C3245A95-B85B-4280-A01F-12307B108DC8))

#### Oracle EXECUTE IMMEDIATE Syntax

```sql
EXECUTE IMMEDIATE <dynamic statement> [<additional clause> , ...];

dynamic statement::= { '<string literal>' | <variable> }

additional clauses::=
{ <into clause> [<using clause>]
| <bulk collect into clause> [<using clause>]
| <using clause> [<dynamic return clause>]
| <dynamic return clasue> }
```

Snowflake Scripting has support for this statement, albeit with some functional differences. For more information on the Snowflake counterpart, please visit [Snowflake’s EXECUTE IMMEDIATE documentation](https://docs.snowflake.com/en/LIMITEDACCESS/snowscript-introduction.html#execute-immediate).

##### Snow Scripting EXECUTE IMMEDIATE Syntax

```sql
EXECUTE IMMEDIATE <dynamic statement> ;

dynamic statement::= {'<string literal>' | <variable> | $<session variable>}
```

### Sample Source Patterns

The next samples will create a table, and attempt to drop the table using Execute Immediate.

#### Using a hard-coded string

##### Oracle

```sql
CREATE TABLE immediate_dropped_table(
    col1 INTEGER
);

CREATE OR REPLACE PROCEDURE dropping_procedure
AS BEGIN
    EXECUTE IMMEDIATE 'DROP TABLE immediate_dropped_table PURGE';
END;

CALL dropping_procedure();
SELECT * FROM immediate_dropped_table;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE immediate_dropped_table (
    col1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE dropping_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE 'DROP TABLE immediate_dropped_table';
    END;
$$;

CALL dropping_procedure();

SELECT * FROM
    immediate_dropped_table;
```

#### Storing the string in a variable

##### Oracle

```sql
CREATE TABLE immediate_dropped_table(
    col1 INTEGER
);

CREATE OR REPLACE PROCEDURE dropping_procedure
AS
BEGIN
    DECLARE
        statement_variable VARCHAR2(500) := 'DROP TABLE immediate_dropped_table PURGE';
    BEGIN
        EXECUTE IMMEDIATE statement_variable;
    END;
END;

CALL dropping_procedure();
SELECT * FROM immediate_dropped_table;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE immediate_dropped_table (
    col1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE dropping_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        DECLARE
            statement_variable VARCHAR(500) := 'DROP TABLE immediate_dropped_table';
        BEGIN
            !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
            EXECUTE IMMEDIATE :statement_variable;
        END;
    END;
$$;

CALL dropping_procedure();

SELECT * FROM
    immediate_dropped_table;
```

#### Concatenation for parameters in dynamic statement

##### Oracle

```sql
CREATE TABLE immediate_dropped_table(
    col1 INTEGER
);

CREATE OR REPLACE PROCEDURE dropping_procedure(param1 VARCHAR2)
AS
BEGIN
    DECLARE
        statement_variable VARCHAR2(500) := 'DROP TABLE ' || param1 || ' PURGE';
    BEGIN
        EXECUTE IMMEDIATE statement_variable;
    END;
END;

CALL dropping_procedure();
SELECT * FROM immediate_dropped_table;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE immediate_dropped_table (
    col1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE dropping_procedure (param1 VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        DECLARE
            statement_variable VARCHAR(500) := 'DROP TABLE ' || NVL(:param1 :: STRING, '');
        BEGIN
            !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
            EXECUTE IMMEDIATE :statement_variable;
        END;
    END;
$$;

CALL dropping_procedure();

SELECT * FROM
    immediate_dropped_table;
```

#### USING Clause transformation

##### Oracle

```sql
CREATE TABLE immediate_inserted_table(COL1 INTEGER);

CREATE OR REPLACE PROCEDURE inserting_procedure_using(param1 INTEGER)
AS
BEGIN
    EXECUTE IMMEDIATE 'INSERT INTO immediate_inserted_table VALUES (:1)' USING param1;
END;

CALL inserting_procedure_using(1);

SELECT * FROM immediate_inserted_table;
```

##### Results

| COL1 |
| --- |
| 1 |

##### Snowflake Scripting

> **Note:**
>
> Please note parenthesis are required for parameters in the USING Clause in Snowflake Scripting.

```sql
CREATE OR REPLACE TABLE immediate_inserted_table (COL1 INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE inserting_procedure_using (param1 INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE 'INSERT INTO immediate_inserted_table
VALUES (?)' USING ( param1);
    END;
$$;

CALL inserting_procedure_using(1);

SELECT * FROM
    immediate_inserted_table;
```

##### Results

| COL1 |
| --- |
| 1 |

### Known Issues

#### 1. Immediate Execution results cannot be stored in variables.

SnowScripting does not support INTO nor BULK COLLECT INTO clauses. For this reason, results will need to be passed through other means.

##### 2. Numeric Placeholders

Numeric Names for placeholders are currently not being recognized by SnowConvert AI, but there is a work item to fix this issue.

##### 3. Argument Expressions are not supported by Snowflake Scripting

In Oracle it is possible to use Expressions as Arguments for the Using Clause; however, this is not supported by Snowflake Scripting, and they are commented out.

##### 4. Dynamic SQL Execution queries may be marked incorrectly as non-runnable.

In some scenarios there an execute statement may be commented regardless of being safe or non-safe to run so please take this into account:

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE inserting_procedure_variable_execute_concatenation_parameter(param1 INTEGER)
IS
    query VARCHAR2(500) := 'INSERT INTO immediate_inserted_table VALUES (';
BEGIN
    EXECUTE IMMEDIATE query || param1 || ')';
END;
```

##### Snowflake Scripting

> **Note:**
>
> Please note parenthesis are required for parameters in the USING Clause in Snowflake Scripting.

```sql
CREATE OR REPLACE PROCEDURE inserting_procedure_variable_execute_concatenation_parameter (param1 INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    DECLARE
        query VARCHAR(500) := 'INSERT INTO immediate_inserted_table VALUES (';
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        !!!RESOLVE EWI!!! /*** SSC-EWI-0027 - THE FOLLOWING STATEMENT USES A VARIABLE/LITERAL WITH AN INVALID QUERY AND IT WILL NOT BE EXECUTED ***/!!!
        EXECUTE IMMEDIATE NVL(:query :: STRING, '') || NVL(:param1 :: STRING, '') || ')';
    END;
$$;
```

### Related EWIs

1. [SSC-EWI-0027](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Variable with invalid query.
2. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.

## EXIT

Translation reference to convert Oracle EXIT statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The `EXIT` statement exits the current iteration of a loop, either conditionally or unconditionally, and transfers control to the end of either the current loop or an enclosing labeled loop.
> ([Oracle PL/SQL Language Reference EXIT Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/EXIT-statement.html#GUID-66E20B6C-3606-42AD-A7DB-C8EC782B94D8))

#### Oracle EXIT Syntax

```sql
EXIT [ label ] [ WHEN boolean_expression ] ;
```

##### Snowflake Scripting EXIT Syntax

```sql
{ BREAK | EXIT } [ <label> ] ;
```

### Sample Source Patterns

> **Note:**
>
> Note that you can change `EXIT`with `BREAK`and everything will work the same.

#### 1. Simple Exit

Code skips the `INSERT` statement by using `EXIT`.

> **Note:**
>
> This case is functionally equivalent.

##### Oracle

```sql
CREATE TABLE exit_testing_table_1 (
    iterator VARCHAR2(5)
);

CREATE OR REPLACE PROCEDURE exit_procedure_1
IS
I NUMBER := 0;
J NUMBER := 20;
BEGIN
    WHILE I <= J LOOP
        I := I + 1;
        EXIT;
        INSERT INTO exit_testing_table_1 VALUES(TO_CHAR(I));
    END LOOP;
END;

CALL exit_procedure_1();
SELECT * FROM exit_testing_table_1;
```

##### Result

| ITERATOR |
| --- |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE exit_testing_table_1 (
       iterator VARCHAR(5)
   )
   COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
   ;

   CREATE OR REPLACE PROCEDURE exit_procedure_1 ()
   RETURNS VARCHAR
   LANGUAGE SQL
   COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
   EXECUTE AS CALLER
   AS
   $$
       DECLARE
           I NUMBER(38, 18) := 0;
           J NUMBER(38, 18) := 20;
       BEGIN
           WHILE (:I <= :J)
                            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                            LOOP
                                I := :I + 1;
                                EXIT;
                                       INSERT INTO exit_testing_table_1
                                VALUES(TO_CHAR(:I));
                                   END LOOP;
       END;
   $$;

   CALL exit_procedure_1();

   SELECT * FROM
       exit_testing_table_1;
```

##### Result

| ITERATOR |
| --- |

#### 2. Exit with condition

Code exits the loop when the iterator is greater than 5.

> **Note:**
>
> This case is functionally equivalent by turning the condition into an `IF` statement.

##### Oracle

```sql
CREATE TABLE exit_testing_table_2 (
    iterator VARCHAR2(5)
);

CREATE OR REPLACE PROCEDURE exit_procedure_2
IS
I NUMBER := 0;
J NUMBER := 20;
BEGIN
    WHILE I <= J LOOP
        EXIT WHEN I > 5;
        I := I + 1;
        INSERT INTO exit_testing_table_2 VALUES(TO_CHAR(I));
    END LOOP;
END;

CALL exit_procedure_2();
SELECT * FROM exit_testing_table_2;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE exit_testing_table_2 (
       iterator VARCHAR(5)
   )
   COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
   ;

   CREATE OR REPLACE PROCEDURE exit_procedure_2 ()
   RETURNS VARCHAR
   LANGUAGE SQL
   COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
   EXECUTE AS CALLER
   AS
   $$
       DECLARE
           I NUMBER(38, 18) := 0;
           J NUMBER(38, 18) := 20;
       BEGIN
           WHILE (:I <= :J)
                            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                            LOOP
                                IF (:I > 5) THEN
                                    EXIT;
                                END IF;
                                I := :I + 1;
                                       INSERT INTO exit_testing_table_2
                                VALUES(TO_CHAR(:I));
                                   END LOOP;
       END;
   $$;

   CALL exit_procedure_2();

   SELECT * FROM
       exit_testing_table_2;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |

#### 3. Exit with label and condition

Code breaks both loops by using the `EXIT` statement pointing to the outer loop.

> **Note:**
>
> This case is functionally equivalent applying the same process as the previous sample.

> **Note:**
>
> Note that labels are going to be commented out.

##### Oracle

```sql
CREATE TABLE exit_testing_table_3 (
    iterator VARCHAR2(5)
);

CREATE OR REPLACE PROCEDURE exit_procedure_3
IS
I NUMBER := 0;
J NUMBER := 10;
K NUMBER := 0;
BEGIN
    <<out_loop>>
    WHILE I <= J LOOP
        I := I + 1;
        INSERT INTO exit_testing_table_3 VALUES('I' || TO_CHAR(I));

        <<in_loop>>
        WHILE K <= J * 2 LOOP
            K := K + 1;
                EXIT out_loop WHEN K > J / 2;
            INSERT INTO exit_testing_table_3 VALUES('K' || TO_CHAR(K));
        END LOOP in_loop;

        K := 0;
    END LOOP out_loop;
END;

CALL exit_procedure_3();
SELECT * FROM exit_testing_table_3;
```

##### Result

| ITERATOR |
| --- |
| I1 |
| K1 |
| K2 |
| K3 |
| K4 |
| K5 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE exit_testing_table_3 (
       iterator VARCHAR(5)
   )
   COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
   ;

   CREATE OR REPLACE PROCEDURE exit_procedure_3 ()
   RETURNS VARCHAR
   LANGUAGE SQL
   COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
   EXECUTE AS CALLER
   AS
   $$
       DECLARE
           I NUMBER(38, 18) := 0;
           J NUMBER(38, 18) := 10;
           K NUMBER(38, 18) := 0;
       BEGIN
           !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<out_loop>> ***/!!!
           WHILE (:I <= :J)
                            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                            LOOP
                                I := :I + 1;
                                       INSERT INTO exit_testing_table_3
                                VALUES('I' || NVL(TO_CHAR(:I) :: STRING, ''));
                                !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<in_loop>> ***/!!!
                                WHILE (:K <= :J * 2)
                                                     --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                                     LOOP
                                                         K := :K + 1;
                                                         IF (:K > :J / 2) THEN
                                                             EXIT out_loop;
                                                         END IF;
                                           INSERT INTO exit_testing_table_3
                                                         VALUES('K' || NVL(TO_CHAR(:K) :: STRING, ''));
                                       END LOOP in_loop;
                                K := 0;
                                   END LOOP out_loop;
       END;
   $$;

   CALL exit_procedure_3();

   SELECT * FROM
       exit_testing_table_3;
```

##### Result

| ITERATOR |
| --- |
| I1 |
| K1 |
| K2 |
| K3 |
| K4 |
| K5 |

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0094](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Label declaration not supported.

## EXPRESSIONS

Translation reference for Oracle expressions to Snow Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

The following table has a summary of how to transform the different [Oracle Expression kinds](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8) into Snow Scripting.

| **Syntax** | **Conversion status** | **Notes** |
| --- | --- | --- |
| [Character Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CHDGJCJE) | Partial | Partially Supported Common scenarios |
| [Numeric Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CHDIEJAI) | Partial | Partially Supported Common scenarios |
| [Date Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CHDIAFJD) | Partial | Partially Supported Common scenarios |
| [Boolean Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CHDDGEFH) | Partial | Not supported boolean expressions |
| [Simple Case Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CHDIFFCB) | Full | N/A |
| [Searched Case Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CHDGJEJJ) | Full | N/A |
| [Collection Constructor](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/expression.html#GUID-D4700B45-F2C8-443E-AEE7-2BD20FFD45B8__CJACBCAB) | Not Translated | Snowflake does not have a native equivalent for Oracle collections. See [Collections and Records](collections-and-records.md). |
| [Qualified Expressions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/qualified-expression.html#GUID-1C475462-11D2-4D0B-B2D1-497491F88746__SECTION_O3N_JWF_4JB) | Not Translated | Snowflake does not have a native equivalent for Oracle record types. See [Collections and Records](collections-and-records.md). |

#### Partially supported common scenarios

##### Oracle Constants

##### Oracle

```sql
CREATE TABLE EXPRESSIONS_TABLE(col VARCHAR(30));
CREATE OR REPLACE PROCEDURE EXPRESSIONS_SAMPLE
IS
RESULT VARCHAR(50);
CONST CONSTANT VARCHAR(20) := 'CONSTANT TEXT';
BEGIN
	-- CONSTANT EXPRESSIONS
	RESULT := CONST;
	INSERT INTO EXPRESSIONS_TABLE(COL) VALUES (RESULT);
END;

CALL EXPRESSIONS_SAMPLE();
SELECT * FROM EXPRESSIONS_TABLE;
```

##### Result

| COL |
| --- |
| CONSTANT TEXT |

##### Snowflake

```sql
CREATE OR REPLACE TABLE EXPRESSIONS_TABLE (col VARCHAR(30))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE EXPRESSIONS_SAMPLE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		RESULT VARCHAR(50);
		--** SSC-FDM-0016 - CONSTANTS ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING. IT WAS TRANSFORMED TO A VARIABLE **
		CONST VARCHAR(20) := 'CONSTANT TEXT';
	BEGIN
		-- CONSTANT EXPRESSIONS
		RESULT := :CONST;
		INSERT INTO EXPRESSIONS_TABLE(COL) VALUES (:RESULT);
	END;
$$;

CALL EXPRESSIONS_SAMPLE();

SELECT * FROM
	EXPRESSIONS_TABLE;
```

##### Result

| COL |
| --- |
| CONSTANT TEXT |

#### Not supported numeric expressions

##### Oracle

```sql
CREATE TABLE NUMERIC_EXPRESSIONS_TABLE(col number);

CREATE OR REPLACE PROCEDURE NUMERIC_EXPRESSIONS
IS
RESULT NUMBER;
CURSOR C1 IS SELECT * FROM NUMERIC_EXPRESSIONS_TABLE;
TYPE NUMERIC_TABLE IS TABLE OF NUMBER(10);
COLLECTION NUMERIC_TABLE;
BEGIN
	-- CURSOR EXPRESSIONS
	OPEN C1;
	RESULT := C1%ROWCOUNT;
	CLOSE C1;
	INSERT INTO NUMERIC_EXPRESSIONS_TABLE(COL) VALUES (RESULT);

	-- ** OPERATOR
	RESULT := 10 ** 2;
	INSERT INTO NUMERIC_EXPRESSIONS_TABLE(COL) VALUES (RESULT);

	-- COLLECTION EXPRESSIONS
	COLLECTION := NUMERIC_TABLE(1, 2, 3, 4, 5, 6);
	RESULT := COLLECTION.COUNT + COLLECTION.FIRST;
	INSERT INTO NUMERIC_EXPRESSIONS_TABLE(COL) VALUES (RESULT);

	-- IMPLICIT CURSOR EXPRESSIONS
	UPDATE NUMERIC_EXPRESSIONS_TABLE SET COL = COL + 4;
	RESULT := SQL%ROWCOUNT;
	INSERT INTO NUMERIC_EXPRESSIONS_TABLE(COL) VALUES (RESULT);
END;

CALL NUMERIC_EXPRESSIONS();
SELECT * FROM NUMERIC_EXPRESSIONS_TABLE;
```

##### Result

| COL |
| --- |
| 4 |
| 104 |
| 11 |
| 3 |

#### Not supported boolean expressions

##### Oracle

```sql
--Aux function to convert BOOLEAN to VARCHAR
CREATE OR REPLACE FUNCTION convert_bool(p1 in BOOLEAN)
RETURN VARCHAR
AS
var1 VARCHAR(20) := 'FALSE';
BEGIN
IF p1 THEN
var1 := 'TRUE';
END IF;
RETURN var1;
END;

--Table
CREATE TABLE t_boolean_table
(
conditional_predicate VARCHAR(20),
collection_variable VARCHAR(20),
sql_variable VARCHAR(20)
)

--Main Procedure
CREATE OR REPLACE PROCEDURE p_boolean_limitations
AS

TYPE varray_example IS VARRAY(4) OF VARCHAR(15);
colection_example varray_example := varray_example('John', 'Mary', 'Alberto', 'Juanita');
collection_variable BOOLEAN;
conditional_predicate BOOLEAN;
sql_variable BOOLEAN;

--Result variables
col1 VARCHAR(20);
col2 VARCHAR(20);
col3 VARCHAR(20);
BEGIN

--Conditional predicate
conditional_predicate := INSERTING;

--Collection.EXISTS(index)
collection_variable := colection_example.EXISTS(2);

--Cursor FOUND / NOTFOUND / ISOPEN
sql_variable:= SQL%FOUND OR SQL%NOTFOUND OR SQL%ISOPEN;

--Convert BOOLEAN to VARCHAR to insert
col1 := convert_bool(conditional_predicate);
col2 := convert_bool(collection_variable);
col3 := convert_bool(sql_variable);

INSERT INTO t_boolean_table VALUES (col1, col2, col3);

END;

CALL p_boolean_limitations();

SELECT * FROM t_boolean_table;
```

### Related EWIs.

1. [SSC-FDM-0016](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Constants are not supported by Snowflake Scripting. It was transformed to a variable.

## FOR LOOP

### Description

> With each iteration of the `FOR` `LOOP` statement, its statements run, its index is either incremented or decremented, and control returns to the top of the loop. ([Oracle PL/SQL Language Reference FOR LOOP Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/FOR-LOOP-statement.html#GUID-D00F8F0B-ECFC-48B6-B399-D8B5114E7E21)).

#### Oracle Syntax

```sql
FOR
pls_identifier [ MUTABLE | IMMUTABLE ] [ constrained_type ]
[ , iterand_decl ]

IN

[ REVERSE ] iteration_control pred_clause_seq
[, qual_iteration_ctl]...

LOOP
statement...
END LOOP [ label ] ;
```

##### Snowflake Scripting Syntax

```sql
FOR <counter_variable> IN [ REVERSE ] <start> TO <end> { DO | LOOP }
    statement;
    [ statement; ... ]
END { FOR | LOOP } [ <label> ] ;
```

Snowflake Scripting supports `FOR LOOP` that loops a specified number of times. The upper and lower bounds must be `INTEGER`. Check more information in the [Snowflake Scripting documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/for.html#for).

Oracle `FOR LOOP` behavior can also be modified by using the statements:

* CONTINUE
* EXIT
* GOTO
* RAISE

### Sample Source Patterns

#### 1. FOR LOOP

> **Note:**
>
> This case is functionally equivalent.

##### Oracle FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P1
AS
BEGIN
    FOR i IN 1..10
    LOOP
        NULL;
    END LOOP;

    FOR i IN VAR1..VAR2
    LOOP
        NULL;
    END LOOP;

    FOR i IN REVERSE 1+2..10+5
    LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        FOR i IN 1 TO 10
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            NULL;
        END LOOP;
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        FOR i IN VAR1 TO VAR2
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            NULL;
        END LOOP;
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        FOR i IN REVERSE 1+2 TO 10+5
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            NULL;
        END LOOP;
    END;
$$;
```

#### 2. FOR LOOP with additional clauses

##### Oracle FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P2
AS
BEGIN
    FOR i IN 1..10 WHILE i <= 5 LOOP
        NULL;
    END LOOP;

    FOR i IN 5..15 BY 5 LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "WHILE" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        FOR i IN 1 TO 10
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         LOOP
                                    NULL;
                                END LOOP;
                         !!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "BY" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         FOR i IN 5 TO 15
                                          --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                          LOOP
                                           NULL;
                                       END LOOP;
    END;
$$;
```

#### 3. FOR LOOP with multiple conditions

##### Oracle FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P3
AS
BEGIN
    FOR i IN REVERSE 1..3,
    REVERSE i+5..i+7
    LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0100 - FOR LOOP WITH MULTIPLE CONDITIONS IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        FOR i IN REVERSE 1 TO 3
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            NULL;
        END LOOP;
    END;
$$;
```

#### 4. FOR LOOP with unsupported format

##### Oracle FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P3
AS
TYPE values_aat IS TABLE OF PLS_INTEGER INDEX BY PLS_INTEGER;
l_employee_values   values_aat;
BEGIN
    FOR power IN REPEAT power*2 WHILE power <= 64 LOOP
        NULL;
    END LOOP;

    FOR i IN VALUES OF l_employee_values LOOP
        NULL;
    END LOOP;
END;
```

##### Snowflake Scripting FOR LOOP Example

```sql
CREATE OR REPLACE PROCEDURE P3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE values_aat IS TABLE OF PLS_INTEGER INDEX BY PLS_INTEGER;
        l_employee_values VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'values_aat' USAGE CHANGED TO VARIANT ***/!!!;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0103 - FOR LOOP FORMAT IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "WHILE" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        FOR power IN REPEAT power*2 WHILE power <= 64
                                                      --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                                      LOOP
            NULL;
        END LOOP;
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0103 - FOR LOOP FORMAT IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **

        FOR i IN VALUES OF :l_employee_values
                                              --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                              LOOP
            NULL;
        END LOOP;
    END;
$$;
```

> **Warning:**
>
> Transformation for custom types is currently not supported for Snowflake Scripting.

### Known Issues

#### 1. For With Multiple Conditions

Oracle allows multiple conditions in a single `FOR LOOP` however, Snowflake Scripting only allows one condition per `FOR LOOP`. Only the first condition is migrated and the others are ignored during transformation. Check SSC-FDM-OR0022.

##### Oracle

```sql
FOR i IN REVERSE 1..3,
REVERSE i+5..i+7
LOOP
    NULL;
END LOOP;
```

##### Snowflake Scripting FOR LOOP Example

```sql
--** SSC-FDM-OR0022 - FOR LOOP WITH MULTIPLE CONDITIONS IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING **
FOR i IN REVERSE 1 TO 3 LOOP
    NULL;
END LOOP;
```

**2. Mutable vs Inmutable Counter Variable**

Oracle allows modifying the value of the `FOR LOOP` variable inside the loop. The [current documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/for.html#usage-notes) includes this functionality but Snowflake recommends avoiding this. Modifying the value of this variable may not behave correctly in Snowflake Scripting.

**3. Integer vs Float number for Upper or Lower Bound**

Snowflake Scripting only allows an `INTEGER` or an expression that evaluates to an `INTEGER` as a bound for the `FOR LOOP` condition. Floating numbers will be rounded up or down and alter the original bound.

**4. Oracle Unsupported Clauses**

Oracle allows additional clauses to the `FOR LOOP` condition. Like the **BY** clause for a stepped increment in the condition. And the **WHILE** and **WHEN** clause for boolean expressions. These additional clauses are not supported in Snowflake Scripting and are ignored during transformation. Check [SSC-EWI-OR0101](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md).

##### Oracle

```sql
FOR i IN 5..15 BY 5 LOOP
    NULL;
END LOOP;
```

##### Snowflake Scripting

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "BY" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
FOR i IN 5 TO 15 LOOP
    NULL;
END LOOP;
```

**5. Unsupported Formats**

Oracle allows different types of conditions for a `FOR LOOP`. It supports boolean expressions, collections, records… However, Snowflake scripting only supports `FOR LOOP` with defined integers as bounds. All other formats are marked as not supported and require additional manual effort to be transformed. Check [SSC-EWI-OR0103](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md).

### Related EWIs

1. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-OR0100](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): For Loop With Multiple Conditions Is Currently Not Supported By Snowflake Scripting. Only First Condition Is Used.
4. [SSC-EWI-OR0101](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Specific For Loop Clause Is Currently Not Supported By Snowflake Scripting.
5. [SSC-EWI-OR0103](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): For Loop Format Is Currently Not Supported By Snowflake Scripting.

## FORALL

### Description

> The `FORALL` statement runs one DML statement multiple times, with different values in the `VALUES` and `WHERE` clauses. ([Oracle PL/SQL Language Reference FORALL Statement](https://docs.oracle.com/database/121/LNPLS/forall_statement.htm#LNPLS01321)).

#### Oracle Syntax

```sql
FORALL index IN bounds_clause [ SAVE ] [ EXCEPTIONS ] dml_statement ;
```

> **Warning:**
>
> Snowflake Scripting has no direct equivalence with the `FORALL` statement, however can be emulated with different workarounds to get functional equivalence.

### Sample Source Patterns

#### Setup Data

##### Oracle

##### Tables 1

```sql
CREATE TABLE table1 (
    column1 NUMBER,
    column2 NUMBER
);

INSERT INTO table1 (column1, column2) VALUES (1, 2);
INSERT INTO table1 (column1, column2) VALUES (2, 3);
INSERT INTO table1 (column1, column2) VALUES (3, 4);
INSERT INTO table1 (column1, column2) VALUES (4, 5);
INSERT INTO table1 (column1, column2) VALUES (5, 6);

CREATE TABLE table2 (
    column1 NUMBER,
    column2 NUMBER
);

INSERT INTO table2 (column1, column2) VALUES (1, 2);
```

##### Tables 2

```sql
CREATE TABLE error_table (
    ORA_ERR_NUMBER$ NUMBER,
    ORA_ERR_MESG$ VARCHAR2(2000),
    ORA_ERR_ROWID$ ROWID,
    ORA_ERR_OPTYP$ VARCHAR2(2),
    ORA_ERR_TAG$ VARCHAR2(2000)
);

--departments
CREATE TABLE parent_table(
    Id   INT PRIMARY KEY,
    Name VARCHAR2(10)
);
INSERT INTO parent_table VALUES (10, 'IT');
INSERT INTO parent_table VALUES (20, 'HR');
INSERT INTO parent_table VALUES (30, 'INFRA');

--employees
CREATE TABLE source_table(
  Id INT PRIMARY KEY,
  Name VARCHAR2(20) NOT NULL,
  DepartmentID INT REFERENCES parent_table(Id)
);
INSERT INTO source_table VALUES (101, 'Anurag111111111', 10);
INSERT INTO source_table VALUES (102, 'Pranaya11111111', 20);
INSERT INTO source_table VALUES (103, 'Hina11111111111', 30);

--a copy of source
CREATE TABLE target_table(
  Id INT PRIMARY KEY,
  Name VARCHAR2(10) NOT NULL,
  DepartmentID INT REFERENCES parent_table(Id)
);

INSERT INTO target_table VALUES (101, 'Anurag', 10);
```

##### Snowflake

##### Tables 1

```sql
CREATE OR REPLACE TABLE table1 (
    column1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    column2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO table1(column1, column2) VALUES (1, 2);

INSERT INTO table1(column1, column2) VALUES (2, 3);

INSERT INTO table1(column1, column2) VALUES (3, 4);

INSERT INTO table1(column1, column2) VALUES (4, 5);

INSERT INTO table1(column1, column2) VALUES (5, 6);

CREATE OR REPLACE TABLE table2 (
    column1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    column2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO table2(column1, column2) VALUES (1, 2);
```

##### Tables 2

```sql
CREATE OR REPLACE TABLE error_table (
  "ORA_ERR_NUMBER$" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
  "ORA_ERR_MESG$" VARCHAR(2000),
  "ORA_ERR_ROWID$" VARCHAR(18) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWID DATA TYPE CONVERTED TO VARCHAR ***/!!!,
  "ORA_ERR_OPTYP$" VARCHAR(2),
  "ORA_ERR_TAG$" VARCHAR(2000)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

--departments
CREATE OR REPLACE TABLE parent_table (
      Id   INT PRIMARY KEY,
      Name VARCHAR(10)
  )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO parent_table
VALUES (10, 'IT');

INSERT INTO parent_table
VALUES (20, 'HR');

INSERT INTO parent_table
VALUES (30, 'INFRA');

--employees
CREATE OR REPLACE TABLE source_table (
  Id INT PRIMARY KEY,
  Name VARCHAR(20) NOT NULL,
  DepartmentID INT REFERENCES parent_table (Id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO source_table
VALUES (101, 'Anurag111111111', 10);

INSERT INTO source_table
VALUES (102, 'Pranaya11111111', 20);

INSERT INTO source_table
VALUES (103, 'Hina11111111111', 30);

--a copy of source
CREATE OR REPLACE TABLE target_table (
  Id INT PRIMARY KEY,
  Name VARCHAR(10) NOT NULL,
  DepartmentID INT REFERENCES parent_table (Id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO target_table
VALUES (101, 'Anurag', 10);
```

#### 1. FORALL With Collection of Records

##### Oracle

> **Note:**
>
> The three cases below have the same transformation to Snowflake Scripting and are functionally equivalent.

##### Source

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS SELECT * FROM table1;
    TYPE tableType IS TABLE OF cursorVariable%ROWTYPE;
    tableVariable tableType;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO tableVariable LIMIT 100;
        EXIT WHEN tableVariable.COUNT = 0;

        FORALL forIndex IN 1..tableVariable.COUNT
            INSERT INTO table2 (column1, column2)
            VALUES (tableVariable(forIndex).column1, tableVariable(forIndex).column2);
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |

```none
   1|	2|
   1|       2|
   2|       3|
   3|       4|
   4|       5|
   5|       6|
```

##### Snowflake

##### FORALL With Collection of Records

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2(column1, column2)
        (
            SELECT
                column1,
                column2
            FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

> **Note:**
>
> The EWIs SSC-PRF-0001 and SSC-PRF-0003 are added in every FETCH BULK COLLECT occurrence into FORALL statement.

#### 2. FORALL With INSERT INTO

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                * FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 3. FORALL With Multiple Fetched Collections

##### Oracle

##### With INSERT INTO

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    column1Collection dbms_sql.NUMBER_table;
    column2Collection dbms_sql.NUMBER_table;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO column1Collection, column2Collection limit 20;
        EXIT WHEN column1Collection.COUNT = 0;
        FORALL forIndex IN 1..column1Collection.COUNT
            INSERT INTO table2 VALUES (
                column1Collection(forIndex),
                column2Collection(forIndex)
            );
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### With UPDATE

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    column1Collection dbms_sql.NUMBER_table;
    column2Collection dbms_sql.NUMBER_table;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO column1Collection, column2Collection limit 2;
        EXIT WHEN column1Collection.COUNT = 0;
        FORALL forIndex IN 1..column1Collection.COUNT
            UPDATE table2 SET column2 = column2Collection(forIndex)
            WHERE column1 = column1Collection(forIndex);
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results INSERT INTO

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |
| 1 | 2 |

##### Results UPDATE

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2. |

##### Snowflake

##### With INSERT INTO

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                $1,
                $2
            FROM
                table1
        );
    END;
$$;
```

##### With UPDATE

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        UPDATE table2
            SET column2 = column1Collection.$2
            FROM
                (
                    SELECT
                        * FROM
                        table1) AS column1Collection
            WHERE
                column1 = column1Collection.$1;
    END;
$$;
```

##### Results INSERT INTO

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

##### Results UPDATE

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |

#### 4. FORALL With Record of Collections

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE recordType IS RECORD(
        column1Collection dbms_sql.NUMBER_table,
        column2Collection dbms_sql.NUMBER_table
    );
    columnRecord recordType;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO columnRecord.column1Collection, columnRecord.column2Collection limit 20;
        FORALL forIndex IN 1..columnRecord.column1Collection.COUNT
            INSERT INTO table2 VALUES (
                columnRecord.column1Collection(forIndex),
                columnRecord.column2Collection(forIndex)
            );
        EXIT WHEN cursorVariable%NOTFOUND;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### Scripting FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                $1,
                $2
            FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 5. FORALL With Dynamic SQL

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    cursorVariable SYS_REFCURSOR;
    TYPE collectionTypeDefinition IS
        TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
    query VARCHAR(200) := 'SELECT * FROM table1';
BEGIN
    OPEN cursorVariable FOR query;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        query VARCHAR(200) := 'SELECT * FROM
   table1';
    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE 'CREATE OR REPLACE TEMPORARY TABLE query AS ' || :query;
        INSERT INTO table2
        (
            SELECT
                *
            FROM
                query
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 6. FORALL With Literal SQL

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE SampleProcedure
IS
TYPE TabRecType IS RECORD (
    column1 NUMBER,
    column2 NUMBER
);
TYPE tabType IS TABLE OF TabRecType;
cursorRef SYS_REFCURSOR;
tab tabType;
BEGIN
    OPEN cursorRef FOR 'SELECT src.column1, src.column2 FROM ' || 'table1' || ' src';

    LOOP
        BEGIN
            FETCH cursorRef BULK COLLECT INTO tab LIMIT 1000;
            FORALL i IN 1..tab.COUNT
                INSERT INTO table2 (column1, column2)
                VALUES (tab(i).column1, tab(i).column2);

            EXIT WHEN cursorRef%NOTFOUND;
        END;
    END LOOP;

    CLOSE cursorRef;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE SampleProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        EXECUTE IMMEDIATE 'CREATE OR REPLACE TEMPORARY TABLE cursorRef_TEMP_TABLE AS ' || 'SELECT src.column1, src.column2 FROM ' || 'table1' || ' src';
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2(column1, column2)
        (
            SELECT
                *
            FROM
                cursorRef_TEMP_TABLE
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

#### 7. FORALL With Parametrized Cursors

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    intVariable INTEGER := 7;
    CURSOR cursorVariable(param1 INTEGER, param2 INTEGER default 5) IS
        SELECT * FROM table1
        WHERE
            column2 = intVariable OR
            column1 BETWEEN param1 AND param2;
    TYPE collectionTypeDefinition IS
        TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable(1);
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 20;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        intVariable INTEGER := 7;
    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                * FROM
                table1
                   WHERE
                       column2 = :intVariable
                OR
                       column1 BETWEEN 1 AND 5
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

#### 8. FORALL Without LOOPS

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE  myProcedure IS
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    SELECT * BULK COLLECT INTO collectionVariable FROM table1;
        FORALL forIndex IN 1..collectionVariable.COUNT
            INSERT INTO table2 VALUES (
                collectionVariable (forIndex).column1,
                collectionVariable (forIndex).column2
            );
        collectionVariable.DELETE;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                column1,
                column2
            FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 9. FORALL With UPDATE Statements

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            UPDATE table2 SET column1 = '54321' WHERE column2 = collectionVariable(forIndex).column2;
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        UPDATE table2
            SET column1 = '54321'
            FROM
                (
                    SELECT
                        * FROM
                        table1) AS collectionVariable
            WHERE
                column2 = collectionVariable.column2;
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |

#### 10. FORALL With DELETE Statements

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            DELETE FROM table2 WHERE column2 = collectionVariable(forIndex).column2;
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

```none
no data found
```

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        DELETE FROM
            table2
        USING (
            SELECT
                * FROM
                table1) collectionVariable
                WHERE
            table2.column2 = collectionVariable.column2;
    END;
$$;
```

##### Results

```none
Query produced no results
```

#### 11. FORALL With PACKAGE References

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PACKAGE MyPackage AS
    TYPE collectionTypeDefinition IS
        TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
END;
/

CREATE OR REPLACE PROCEDURE InsertIntoPackage(param integer) IS
BEGIN
    SELECT
        param,
        param BULK COLLECT INTO MyPackage.collectionVariable
    FROM
        DUAL;
END;
/

CREATE OR REPLACE PROCEDURE InsertUsingPackage IS
BEGIN
        FORALL forIndex IN MyPackage.collectionVariable.FIRST..MyPackage.collectionVariable.LAST
            INSERT INTO table2 VALUES MyPackage.collectionVariable(forIndex);
        MyPackage.collectionVariable.DELETE;
END;
/

DECLARE
    param_value INTEGER := 10;
BEGIN
    InsertIntoPackage(param_value);
    InsertUsingPackage;
END;

select * from table2;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 10 | 10 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE SCHEMA IF NOT EXISTS MyPackage
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

!!!RESOLVE EWI!!! /*** SSC-EWI-OR0049 - PACKAGE TYPE DEFINITIONS in stateful package MyPackage are not supported yet ***/!!!
TYPE collectionTypeDefinition IS
    TABLE OF table1%ROWTYPE;

CREATE OR REPLACE TEMPORARY TABLE MYPACKAGE_COLLECTIONVARIABLE (
);

CREATE OR REPLACE PROCEDURE InsertIntoPackage (param integer)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        DELETE FROM
            MYPACKAGE_COLLECTIONVARIABLE;
        INSERT INTO MYPACKAGE_COLLECTIONVARIABLE
        (
            SELECT
                :param,
                :param
            FROM
        DUAL
        );
    END;
$$;

CREATE OR REPLACE PROCEDURE InsertUsingPackage ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                *
            FROM
                MYPACKAGE_COLLECTIONVARIABLE
        );
    END;
$$;

DECLARE
    param_value INTEGER := 10;
    call_results VARIANT;
BEGIN
    CALL
    InsertIntoPackage(:param_value);
    CALL
    InsertUsingPackage();
    RETURN call_results;
END;

select * from
    table2;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 10.000000000000000000 | 10.000000000000000000 |

> **Warning:**
>
> The transformation above only works if the variable defined in the package is a record of collections.

#### 12. FORALL With MERGE Statements

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS
        TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
        MERGE INTO table2 tgt
            USING (
                SELECT
                    collectionVariable(forIndex).column1 column1,
                    collectionVariable(forIndex).column2 column2
                FROM DUAL
            ) src
           ON (tgt.column1 = src.column1)
        WHEN MATCHED THEN
            UPDATE SET
               tgt.column2 = src.column2 * 2
        WHEN NOT MATCHED THEN
            INSERT (column1, column2)
            VALUES (src.column1, src.column2);
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 4 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        MERGE INTO table2 tgt
            USING (
                SELECT
                    collectionVariable.column1 column1,
                    collectionVariable.column2 column2
                FROM
                    (
                        SELECT
                            * FROM
                            table1
                    ) collectionVariable
            ) src
           ON (tgt.column1 = src.column1)
        WHEN MATCHED THEN
            UPDATE SET
               tgt.column2 = src.column2 * 2
        WHEN NOT MATCHED THEN
            INSERT (column1, column2)
            VALUES (src.column1, src.column2);
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |
| 1.000000000000000000 | 4.000000000000000000 |

> **Warning:**
>
> The transformation above only works if the `SELECT` statement inside the `MERGE` is selecting from `DUAL` table.

#### 13. Default FORALL transformation

> **Note:**
>
> You might also be interested in [Bulk Cursor Helpers](helpers.md).

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS SELECT * FROM table1;
    TYPE columnsRecordType IS RECORD (column1 dbms_sql.NUMBER_table, column2 dbms_sql.NUMBER_table);
    recordVariable columnsRecordType;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
    col1 dbms_sql.NUMBER_table;
    col2 dbms_sql.NUMBER_table;
BEGIN
    OPEN cursorVariable;
    FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
    FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
        INSERT INTO table2 (column1, column2)
        VALUES (collectionVariable(forIndex).column1, collectionVariable(forIndex).column2);

    FETCH cursorVariable BULK COLLECT INTO col1, col2 limit 2;
    FORALL forIndex IN col1.FIRST..col1.LAST
        INSERT INTO table2 (column1, column2)
        VALUES (col1(forIndex), col2(forIndex));

    LOOP
        FETCH cursorVariable BULK COLLECT INTO recordVariable limit 2;
        EXIT WHEN recordVariable.column1.COUNT = 0;
        FORALL forIndex IN recordVariable.column1.FIRST..recordVariable.column1.LAST
            INSERT INTO table2 (column1, column2)
            VALUES (recordVariable.column1(forIndex), recordVariable.column2(forIndex));
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "table1", "table2" **
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        cursorVariable OBJECT := INIT_CURSOR_UDF('cursorVariable', '   SELECT * FROM
      table1');
        !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
           TYPE columnsRecordType IS RECORD (column1 dbms_sql.NUMBER_table, column2 dbms_sql.NUMBER_table);
           recordVariable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - columnsRecordType DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--           TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
           collectionVariable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'collectionTypeDefinition' USAGE CHANGED TO VARIANT ***/!!!;
           col1 VARIANT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'dbms_sql.NUMBER_table' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/;
           col2 VARIANT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'dbms_sql.NUMBER_table' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/;
        FORALL INTEGER;
    BEGIN
        cursorVariable := (
            CALL OPEN_BULK_CURSOR_UDF(:cursorVariable)
        );
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        cursorVariable := (
            CALL FETCH_BULK_COLLECTION_RECORDS_UDF(:cursorVariable, 2)
        );
        collectionVariable := :cursorVariable:RESULT;
        FORALL := ARRAY_SIZE(:collectionVariable);
        INSERT INTO table2(column1, column2)
        (
            SELECT
                :collectionVariable[forIndex]:column1,
                : collectionVariable[forIndex]:column2
            FROM
                (
                    SELECT
                        seq4() AS forIndex
                    FROM
                        TABLE(GENERATOR(ROWCOUNT => :FORALL))
                )
        );
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        cursorVariable := (
            CALL FETCH_BULK_COLLECTIONS_UDF(:cursorVariable, 2)
        );
        col1 := :cursorVariable:RESULT[0];
        col2 := :cursorVariable:RESULT[1];
        FORALL := ARRAY_SIZE(:col1);
        INSERT INTO table2(column1, column2)
        (
            SELECT
                :col1[forIndex],
                : col2[forIndex]
            FROM
                (
                    SELECT
                        seq4() AS forIndex
                    FROM
                        TABLE(GENERATOR(ROWCOUNT => :FORALL))
                )
        );
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **

        LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
            --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
            cursorVariable := (
                CALL FETCH_BULK_RECORD_COLLECTIONS_UDF(:cursorVariable, 2)
            );
            recordVariable := :cursorVariable:RESULT;
            IF (ARRAY_SIZE(:recordVariable:column1) = 0) THEN
                EXIT;
            END IF;
            FORALL := ARRAY_SIZE(:recordVariable:column1);
            INSERT INTO table2(column1, column2)
            (
                SELECT
                    :recordVariable:column1[forIndex],
                    : recordVariable:column2[forIndex]
                FROM
                    (
                        SELECT
                            seq4() AS forIndex
                        FROM
                            TABLE(GENERATOR(ROWCOUNT => :FORALL))
                    )
            );
        END LOOP;
        cursorVariable := (
            CALL CLOSE_BULK_CURSOR_UDF(:cursorVariable)
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

> **Note:**
>
> This transformation is done only when none of the previously mentioned transformations can be done.

#### 14. Multiple FORALL inside a LOOP clause

> **Note:**
>
> This pattern applies when there is more than one FORALL in the same procedure and it meets the following structure.

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;

    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 20;
        EXIT WHEN collectionVariable.COUNT = 0;

        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);

        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            UPDATE table2 SET column1 = '54321' WHERE column2 = collectionVariable(forIndex).column2;

    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |
| 54321 | 2 |
| 54321 | 3 |
| 54321 | 4 |
| 54321 | 5 |
| 54321 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                * FROM
                table1
        );
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        UPDATE table2
            SET column1 = '54321'
            FROM
                (
                    SELECT
                        * FROM
                        table1) AS collectionVariable
            WHERE
                column2 = collectionVariable.column2;
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |
| 54321 | 2 |
| 54321 | 3 |
| 54321 | 4 |
| 54321 | 5 |
| 54321 | 6 |

#### 15. Multiple FORALL inside different LOOP clauses

> **Note:**
>
> This pattern applies when there is more than one FORALL in the same procedure and it meets the following structure.

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;

    CURSOR cursorVariable2 IS
        SELECT * FROM table1;

    TYPE collectionTypeDefinition IS
        TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;

    TYPE collectionTypeDefinition2 IS
        TABLE OF table1%ROWTYPE;
    collectionVariable2 collectionTypeDefinition2;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);
    END LOOP;
    CLOSE cursorVariable;

    OPEN cursorVariable2;
    LOOP
        FETCH cursorVariable2 BULK COLLECT INTO collectionVariable2 limit 2;
        EXIT WHEN collectionVariable2.COUNT = 0;
        FORALL forIndex IN collectionVariable2.FIRST..collectionVariable2.LAST
            UPDATE table2 SET column1 = '54321' WHERE column2 = collectionVariable2(forIndex).column2;
    END LOOP;
    CLOSE cursorVariable2;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |
| 54321 | 2 |
| 54321 | 3 |
| 54321 | 4 |
| 54321 | 5 |
| 54321 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                * FROM
                table1
        );
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        UPDATE table2
            SET column1 = '54321'
            FROM
                (
                    SELECT
                        * FROM
                        table1) AS collectionVariable2
            WHERE
                column2 = collectionVariable2.column2;
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |
| 54321 | 2 |
| 54321 | 3 |
| 54321 | 4 |
| 54321 | 5 |
| 54321 | 6 |

#### 16. FORALL with MERGE INTO with LOG ERRORS

> **Warning:**
>
> This pattern is not yet implemmented

##### Oracle

##### LOG ERRORS

```sql
CREATE OR REPLACE PROCEDURE procedure_example (
    department_id_in   IN source_table.DepartmentID%TYPE)
IS
    TYPE employee_ids_t IS TABLE OF source_table%ROWTYPE
    INDEX BY PLS_INTEGER;
    employee_list   employee_ids_t;
BEGIN
    SELECT *
        BULK COLLECT INTO employee_list
        FROM source_table
        WHERE DepartmentID = procedure_example.department_id_in;

    FORALL indx IN 1 .. employee_list.COUNT
      MERGE INTO target_table
      USING (SELECT * FROM DUAL) src
      ON (id = employee_list(indx).id)
      WHEN MATCHED THEN
        UPDATE SET
          name = employee_list(indx).Name
      WHEN NOT MATCHED THEN
        INSERT (Id, Name, DepartmentID)
        VALUES (employee_list(indx).Id, employee_list(indx).Name, employee_list(indx).DepartmentID)
      LOG ERRORS INTO error_table('MERGE INTO ERROR')
      REJECT LIMIT UNLIMITED;

END;

CALL procedure_example(10);

select * from target_table;
select * from error_table;
```

##### Snowflake

##### LOG ERRORS

```sql
--Generated by SnowConvert---------------
CREATE OR REPLACE TRANSIENT TABLE target_staging_table(
  Id INT PRIMARY KEY,
  Name VARCHAR2(10) NOT NULL,
  DepartmentID INT REFERENCES parent_table(Id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
--Generated by SnowConvert---------------

CREATE OR REPLACE PROCEDURE procedure_example (DEPARTMENT_ID_IN INT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'source_table.DepartmentID%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CREATE OR REPLACE TEMP TABLE SOURCE_TEMPORAL AS
        WITH source_data as (
            SELECT *
            FROM source_table
            WHERE DEPARTMENTID =: DEPARTMENT_ID_IN
        )
        SELECT source_data.*, parent_table.id as PARENT_KEY
        FROM source_data
        left join parent_table on source_data.DepartmentID = parent_table.id;

        --All records violating foreign key integrity
        INSERT INTO error_table (ERROR, COLUMN_NAME, REJECTED_RECORD)
        SELECT
            'Foreign Key Constraint Violated' ERROR,'KEY_COL' COLUMN_NAME, id
        FROM SOURCE_TEMPORAL
        WHERE PARENT_KEY IS NULL;

        DELETE FROM SOURCE_TEMPORAL
        WHERE PARENT_KEY IS NULL;

        BEGIN
            MERGE INTO target_table
            USING SOURCE_TEMPORAL SRC
            ON SRC.id = target_table.id
            WHEN MATCHED THEN
                UPDATE SET
                    name = SRC.name
            WHEN NOT MATCHED THEN
               INSERT (Id, Name, DepartmentID)
               VALUES (SRC.Id, SRC.Name, SRC.DepartmentID);
        EXCEPTION
            WHEN OTHER THEN
                CREATE OR REPLACE TEMPORARY STAGE my_int_stage
                  COPY_OPTIONS = (ON_ERROR='continue');

                --Create my file and populate with data
                COPY INTO @my_int_stage/my_file FROM (
                SELECT  * exclude(PARENT_KEY) FROM SOURCE_TEMPORAL
                ) OVERWRITE = TRUE ;

                COPY INTO target_staging_table(id, name, DepartmentID)
                FROM (
                  SELECT
                    -- distinct
                    t.$1, t.$2, t.$3
                  FROM @my_int_stage/my_file t
                  ) ON_ERROR = CONTINUE;

                INSERT INTO ERROR_TABLE (ERROR, FILE, LINE, CHARACTER, CATEGORY, CODE, SQL_STATE, COLUMN_NAME, ROW_NUMBER, REJECTED_RECORD)
                SELECT
                    ERROR, FILE,LINE, CHARACTER, CATEGORY, CODE, SQL_STATE, COLUMN_NAME, ROW_NUMBER, REJECTED_RECORD
                FROM TABLE(VALIDATE(target_staging_table, JOB_ID => '_last')) order by line; --The last charge on the current session

                MERGE INTO target_table
                USING target_staging_table staging
                ON staging.id = target_table.id
                WHEN MATCHED THEN
                    UPDATE SET
                        name = staging.name
                WHEN NOT MATCHED THEN
                INSERT (Id, Name, DepartmentID)
                VALUES (staging.Id, staging.Name, staging.DepartmentID);
        END;

        return 'Awesome!';
    END;
$$;

CALL procedure_example(10);

SELECT * FROM target_table;
SELECT * FROM error_table;
```

#### 17. FORALL with INSERT with LOG ERRORS

> **Warning:**
>
> This pattern is not yet implemmented

##### Oracle

##### LOG ERRORS

```sql
CREATE OR REPLACE PROCEDURE procedure_example (
    department_id_in   IN source_table.DepartmentID%TYPE)
IS
    TYPE employee_ids_t IS TABLE OF source_table%ROWTYPE
    INDEX BY PLS_INTEGER;
    employee_list   employee_ids_t;
BEGIN
    SELECT *
        BULK COLLECT INTO employee_list
        FROM source_table
        WHERE DepartmentID = procedure_example.department_id_in;

    FORALL indx IN 1 .. employee_list.COUNT
        INSERT INTO target_table(Id, Name, DepartmentID)
        VALUES (employee_list(indx).Id, employee_list(indx).Name, employee_list(indx).DepartmentID)
        LOG ERRORS INTO error_table('MERGE INTO ERROR')
        REJECT LIMIT UNLIMITED;
END;
```

##### Snowflake

##### LOG ERRORS

```sql
--Generated by SnowConvert---------------
CREATE OR REPLACE TRANSIENT TABLE target_staging_table(
  Id INT PRIMARY KEY,
  Name VARCHAR2(10) NOT NULL,
  DepartmentID INT REFERENCES parent_table(Id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;
--Generated by SnowConvert---------------

CREATE OR REPLACE PROCEDURE procedure_example (DEPARTMENT_ID_IN INT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'employees.DepartmentID%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CREATE OR REPLACE TEMP TABLE SOURCE_TEMPORAL AS
        WITH source_data as (
            SELECT *
            FROM source_table
            WHERE DEPARTMENTID =: DEPARTMENT_ID_IN
        )
        SELECT source_data.*, parent_table.id as PARENT_KEY
        FROM source_data
        left join parent_table on source_data.DepartmentID = parent_table.id;

        --All records violating foreign key integrity
        INSERT INTO error_table (ERROR, COLUMN_NAME, REJECTED_RECORD)
        SELECT
            'Foreign Key Constraint Violated' ERROR,'KEY_COL' COLUMN_NAME, id
        FROM SOURCE_TEMPORAL
        WHERE PARENT_KEY IS NULL;

        DELETE FROM SOURCE_TEMPORAL
        WHERE PARENT_KEY IS NULL;

        BEGIN
            INSERT INTO target_table (Id, Name, DepartmentID)
            SELECT SRC.Id, SRC.Name, SRC.DepartmentID FROM SOURCE_TEMPORAL SRC;
        EXCEPTION
            WHEN OTHER THEN
                CREATE OR REPLACE TEMPORARY STAGE my_int_stage
                  COPY_OPTIONS = (ON_ERROR='continue');

                --Create my file and populate with data
                COPY INTO @my_int_stage/my_file FROM (
                SELECT  * exclude(PARENT_KEY) FROM SOURCE_TEMPORAL
                ) OVERWRITE = TRUE ;

                COPY INTO target_staging_table(id, name, DepartmentID)
                FROM (
                  SELECT
                    -- distinct
                    t.$1, t.$2, t.$3
                  FROM @my_int_stage/my_file t
                  ) ON_ERROR = CONTINUE;

                INSERT INTO ERROR_TABLE (ERROR, FILE, LINE, CHARACTER, CATEGORY, CODE, SQL_STATE, COLUMN_NAME, ROW_NUMBER, REJECTED_RECORD)
                SELECT
                    ERROR, FILE,LINE, CHARACTER, CATEGORY, CODE, SQL_STATE, COLUMN_NAME, ROW_NUMBER, REJECTED_RECORD
                FROM TABLE(VALIDATE(target_staging_table, JOB_ID => '_last')) order by line; --The last charge on the current session

                INSERT INTO target_table (Id, Name, DepartmentID)
                SELECT staging.Id, staging.Name, staging.DepartmentID FROM target_staging_table staging;
        END;
    END;
$$;

CALL procedure_example(10);

SELECT * FROM target_table;
SELECT * FROM error_table;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.
2. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
3. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
4. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
5. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
6. [SSC-EWI-OR0049](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Package constants in stateful package are not supported yet.
7. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
8. [SSC-FDM-0015:](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) ​Referenced custom type in query not found.
9. [SSC-PRF-0001](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor fetch bulk operations.
10. [SSC-PRF-0003](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.

## IF

### Description

The `IF` statement either runs or skips a sequence of one or more statements, depending on the value of a `BOOLEAN` expression. For more information regarding Oracle IF, check [here](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/IF-statement.html#GUID-B7D65A8E-B0C3-448F-B79C-6C330190A266).

```sql
IF boolean_expression THEN
    statement
    [ statement ]...
[
ELSIF boolean_expression THEN
    statement
    [ statement ]... ]...
   [
ELSE
statement [ statement ]... ] END IF ;
```

```sql
IF ( <condition> ) THEN
    <statement>;
    [ <statement>; ... ]
[
ELSEIF ( <condition> ) THEN
    <statement>;
    [ <statement>; ... ]
]
[
ELSE
    <statement>;
    [ <statement>; ... ]
]
END IF;
```

### Sample Source Patterns

#### Sample auxiliary table

```sql
CREATE TABLE if_table(col1 varchar(30));
```

```sql
CREATE OR REPLACE TABLE PUBLIC.if_table (col1 varchar(30));
```

#### Possible IF variations

##### Oracle

###### Code 1

```sql
CREATE OR REPLACE PROCEDURE ifExample1 ( flag NUMBER )
IS
BEGIN
    IF flag = 1 THEN
        INSERT INTO if_table(col1) VALUES ('one');
    END IF;
END;

CALL ifExample1(1);
SELECT * FROM if_table;
```

###### Code 2

```sql
CREATE OR REPLACE PROCEDURE ifExample2 ( flag NUMBER )
IS
BEGIN
    IF flag = 1 THEN
        INSERT INTO if_table(col1) VALUES ('one');
    ELSE
        INSERT INTO if_table(col1) VALUES ('Unexpected input.');
    END IF;
END;

CALL ifExample2(2);
SELECT * FROM if_table;
```

###### Code 3

```sql
CREATE OR REPLACE PROCEDURE ifExample3 ( flag NUMBER )
IS
BEGIN
    IF flag = 1 THEN
        INSERT INTO if_table(col1) VALUES ('one');
    ELSIF flag = 2 THEN
        INSERT INTO if_table(col1) VALUES ('two');
    ELSIF flag = 3 THEN
        INSERT INTO if_table(col1) VALUES ('three');
    END IF;
END;

CALL ifExample3(3);
SELECT * FROM if_table;
```

###### Code 4

```sql
CREATE OR REPLACE PROCEDURE ifExample4 ( flag NUMBER )
IS
BEGIN
    IF flag = 1 THEN
        INSERT INTO if_table(col1) VALUES ('one');
    ELSIF flag = 2 THEN
        INSERT INTO if_table(col1) VALUES ('two');
    ELSIF flag = 3 THEN
        INSERT INTO if_table(col1) VALUES ('three');
    ELSE
        INSERT INTO if_table(col1) VALUES ('Unexpected input.');
    END IF;
END;

CALL ifExample4(4);
SELECT * FROM if_table;
```

###### Result 1

| COL1 |
| --- |
| one |

###### Result 2

| COL1 |
| --- |
| Unexpected input. |

###### Result 3

| COL1 |
| --- |
| three |

###### Result 4

| COL1 |
| --- |
| Unexpected input. |

##### Snowflake Scripting

###### Code 1

```sql
CREATE OR REPLACE PROCEDURE ifExample1 (flag NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        IF (:flag = 1) THEN
            INSERT INTO if_table(col1) VALUES ('one');
        END IF;
    END;
$$;

CALL ifExample1(1);

SELECT * FROM
    if_table;
```

###### Code 2

```sql
CREATE OR REPLACE PROCEDURE ifExample2 (flag NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        IF (:flag = 1) THEN
            INSERT INTO if_table(col1) VALUES ('one');
        ELSE
            INSERT INTO if_table(col1) VALUES ('Unexpected input.');
        END IF;
    END;
$$;

CALL ifExample2(2);

SELECT * FROM
    if_table;
```

###### Code 3

```sql
CREATE OR REPLACE PROCEDURE ifExample3 (flag NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        IF (:flag = 1) THEN
            INSERT INTO if_table(col1) VALUES ('one');
        ELSEIF (:flag = 2) THEN
            INSERT INTO if_table(col1) VALUES ('two');
        ELSEIF (:flag = 3) THEN
            INSERT INTO if_table(col1) VALUES ('three');
        END IF;
    END;
$$;

CALL ifExample3(3);

SELECT * FROM
    if_table;
```

###### Code 4

```sql
CREATE OR REPLACE PROCEDURE ifExample4 (flag NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        IF (:flag = 1) THEN
            INSERT INTO if_table(col1) VALUES ('one');
        ELSEIF (:flag = 2) THEN
            INSERT INTO if_table(col1) VALUES ('two');
        ELSEIF (:flag = 3) THEN
            INSERT INTO if_table(col1) VALUES ('three');
        ELSE
            INSERT INTO if_table(col1) VALUES ('Unexpected input.');
        END IF;
    END;
$$;

CALL ifExample4(4);

SELECT * FROM if_table;
```

###### Result 1

| COL1 |
| --- |
| one |

###### Result 2

| COL1 |
| --- |
| Unexpected input. |

###### Result 3

| COL1 |
| --- |
| three |

###### Result 4

| COL1 |
| --- |
| Unexpected input. |

### Known issues

No issues were found.

### Related EWIs

No related EWIs.

## IS EMPTY

This is a translation reference to convert the Oracle IS EMPTY statement to Snowflake

> **Warning:**
>
> This section is a work in progress; information may change in the future.

### Description

> Use the IS [NOT] EMPTY conditions to test whether a specified nested table is empty, regardless whether any elements of the collection are NULL. ([Documentation](https://docs.oracle.com/cd/B14117_01/server.101/b10759/conditions013.htm)).

#### Oracle syntax

```sql
nested_table IS [ NOT ] EMPTY
```

### Sample Source Patterns

#### Oracle

The following example shows the usage of the IS EMPTY statement. The statement is applied over a nested table which uses a UDT as the definition type. The output shows the name of the employees who do not have a phone number.

```sql
CREATE TYPE phone_number_type AS OBJECT (phone_number VARCHAR2(30));
/

CREATE TYPE phone_number_list AS TABLE OF phone_number_type;

CREATE TABLE employee (
    emp_id NUMBER,
    emp_name VARCHAR2(50),
    phone_numbers_col phone_number_list
) NESTED TABLE phone_numbers_col STORE AS nested_tab return as value;

INSERT INTO employee VALUES (
    1,
    'John Doe',
    phone_number_list(phone_number_type('1234567890'))
);
/

INSERT INTO employee VALUES (
    2,
    'Jane Smith',
    phone_number_list()
);

SELECT emp_name
FROM employee
WHERE phone_numbers_col IS EMPTY;
```

##### Output

| EMP_NAME |
| --- |
| Jane Smith |

##### Snowflake

The Snowflake query shown below is the equivalence of the functionality of the IS EMPTY statement. Particularly, the IS EMPTY statement has a difference between a NULL and an EMPTY object.

Notice that the User-Defined Types are transformed to a VARIANT. The VARIANT type in Snowflake is able to store objects and arrays. Since a nested table is a sequence of information, the ARRAY type is the most suitable type to redefine them and verify is the object ARRAY is empty.

The ARRAY_SIZE equivalent solution also allows to ask for nullability of the nested table (transformed to VARIANT). In other words, the VARIANT type can also store NULLs and empty ARRAYs.

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE phone_number_type AS OBJECT (phone_number VARCHAR2(30))
;

!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NESTED TABLE' NODE ***/!!!

CREATE TYPE phone_number_list AS TABLE OF phone_number_type;

CREATE OR REPLACE TABLE employee (
    emp_id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    emp_name VARCHAR(50),
    phone_numbers_col VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'phone_number_list' USAGE CHANGED TO VARIANT ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE VIEW PUBLIC.employee_view
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
AS
SELECT
    emp_id,
    emp_name,
    phone_numbers_col
FROM
    employee;

INSERT INTO employee
VALUES (
    1,
    'John Doe',
    phone_number_list(phone_number_type('1234567890') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'phone_number_type' NODE ***/!!!) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'phone_number_list' NODE ***/!!!
);

INSERT INTO employee
VALUES (
    2,
    'Jane Smith',
    phone_number_list() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'phone_number_list' NODE ***/!!!
);

SELECT emp_name
FROM
    employee
WHERE
    ARRAY_SIZE( phone_numbers_col) = 0;
```

##### Output

| EMP_NAME |
| --- |
| Jane Smith |

#### Other possible combinations

| Description | Oracle | Snowflake |
| --- | --- | --- |
| Ask for a IS NOT EMPTY | ``` (...) WHERE phone_numbers_col IS NOT EMPTY; ``` | ``` (...) WHERE ARRAY_SIZE(phone_numbers_col) != 0; ``` |
| Ask for NULL instead of EMPTY | ``` (...) WHERE phone_numbers_col IS NULL; ``` | ``` (...) WHERE ARRAY_SIZE(phone_numbers_col) IS NULL; ``` |

### Known Issues

#### **1. User-defined types are being transformed into Variant.**

User-defined types are not supported thus they are transformed into Variant types which could need manual effort to ensure some functionalities.

Review the following page for more information:

[create-type-statement](../sql-translation-reference/create_type.md)

##### **2. Nested tables are not supported.**

Nested tables are not currently supported. The best approach based on this equivalence is to handle nested tables as Variant but declare Arrays with JSON data inside and execute the PARSE_JSON Snowflake function to populate the nested information.

Review the following pages for more information:

[nested-table-array-type-definition.md](collections-and-records.md)

[nested-table-type-definition.md](../sql-translation-reference/create_type.md)

##### **3. Insert statements are not supported for User-defined types.**

Since User-defined types are not supported in consequence the Insert statements to these types are not supported. Specifically in nested tables, the `INSERT INTO ... VALUES` statement has to be changed to a `INSERT INTO ...SELECT` because the ARRAY_CONSTRUCT function is expected to be used in that pattern.

Review the following page for more information:

[object-type-definition.md](../sql-translation-reference/create_type.md)

##### **4. Logic should be adapted to `ARRAY` types.**

Since the nested tables should be equivalently transformed to `VARIANT` and behave as `ARRAYs,`the functionality and logic of implementing procedures and interaction with the data should be adapted.

Review the following examples:

##### 4.1 Procedures equivalence

##### Oracle

```sql
create or replace procedure proc1
as
    col1 phone_number_list:= phone_number_list();
begin
   IF col1 IS EMPTY
   THEN
    dbms_output.put_line('IS EMPTY');
   END IF;
end;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      col1 VARIANT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'phone_number_list' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/ := phone_number_list() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'phone_number_list' NODE ***/!!!;
   BEGIN
      IF (ARRAY_SIZE(:col1) = 0) THEN
         --** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
         CALL DBMS_OUTPUT.PUT_LINE_UDF('IS EMPTY');
      END IF;
   END;
$$;
```

##### Output

| PROC1 |
| --- |
| IS EMPTY |

##### 4.2 Select statements

Outputs may differ from tables to `ARRAYs`.

##### Oracle

```sql
SELECT
    t.*
FROM
    employee e,
    table(e.phone_numbers_col) t
WHERE
    emp_id = 1;
```

##### Output

| PHONE_NUMBER |
| --- |
| 1234567890 |

##### Snowflake

```sql
SELECT
    t.*
FROM
    employee e,
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0035 - TABLE FUNCTION IS NOT SUPPORTED WHEN IT IS USED AS A COLLECTION OF EXPRESSIONS ***/!!!
    table(e.phone_numbers_col) t
WHERE
    emp_id = 1;
```

##### Output

| PHONE_NUMBERS_COL |
| --- |
| [ 1234567890 ] |

### Related EWIs

1. [SSC-EWI-0056](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Create Type Not Supported.
2. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
4. [SSC-EWI-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The table function is not supported when it is used as a collection of expressions.
5. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
6. [SSC-FDM-0015:](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) ​Referenced custom type in query not found.
7. [SSC-FDM-OR0035](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): DBMS_OUTPUT.PUTLINE check UDF implementation.

## LOCK TABLE

> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> **Notice that this statement removed from the migration; because it is a non-relevant syntax. It means that it is not required in Snowflake.**

### Description

In Oracle, the `LOCK TABLE` statement allows to explicitly acquire a shared or exclusive table lock on the specified table. The table lock lasts until the end of the current transaction. Review more information [here](https://docs.oracle.com/javadb/10.6.2.1/ref/rrefsqlj40506.html).

**Syntax**

```sql
LOCK TABLE tableName IN { SHARE | EXCLUSIVE } MODE
```

### Sample Source Patterns

#### Locking table

Notice that in this example the `LOCK TABLE` statement has been deleted. This is because Snowflake handles locking in a different method through transactions.

##### Oracle

```sql
LOCK TABLE table1 IN EXCLUSIVE MODE;
```

##### Snowflake

```sql
[Empty output]
```

## LOG ERROR

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The `FORALL` statement runs one DML statement multiple times, with different values in the `VALUES` and `WHERE` clauses. ([Oracle PL/SQL Language Reference FORALL Statement](https://docs.oracle.com/database/121/LNPLS/forall_statement.htm#LNPLS01321)).

#### Oracle Syntax

```sql
FORALL index IN bounds_clause [ SAVE ] [ EXCEPTIONS ] dml_statement ;
```

> **Warning:**
>
> Snowflake Scripting has no direct equivalence with the `FORALL` statement, however can be emulated with different workarounds to get functional equivalence.

### Sample Source Patterns

#### Setup Data

##### Oracle

##### Tables

```sql
CREATE TABLE error_table (
    ORA_ERR_NUMBER$ NUMBER,
    ORA_ERR_MESG$ VARCHAR2(2000),
    ORA_ERR_ROWID$ ROWID,
    ORA_ERR_OPTYP$ VARCHAR2(2),
    ORA_ERR_TAG$ VARCHAR2(2000)
);

--departments
CREATE TABLE parent_table(
    Id INT PRIMARY KEY,
    Name VARCHAR2(10)
);

INSERT INTO parent_table VALUES (10, 'IT');
INSERT INTO parent_table VALUES (20, 'HR');
INSERT INTO parent_table VALUES (30, 'INFRA');

--employees
CREATE TABLE source_table(
    Id INT PRIMARY KEY,
    Name VARCHAR2(20) NOT NULL,
    DepartmentID INT REFERENCES parent_table(Id)
);

INSERT INTO source_table VALUES (101, 'Anurag111111111', 10);
INSERT INTO source_table VALUES (102, 'Pranaya11111111', 20);
INSERT INTO source_table VALUES (103, 'Hina11111111111', 30);

--a copy of source
CREATE TABLE target_table(
    Id INT PRIMARY KEY,
    Name VARCHAR2(10) NOT NULL,
    DepartmentID INT REFERENCES parent_table(Id)
);

INSERT INTO target_table VALUES (101, 'Anurag', 10);
```

##### Snowflake

##### Tables

```sql
CREATE OR REPLACE TABLE error_table (
    "ORA_ERR_NUMBER$" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
    "ORA_ERR_MESG$" VARCHAR(2000),
    "ORA_ERR_ROWID$" VARCHAR(18) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWID DATA TYPE CONVERTED TO VARCHAR ***/!!!,
    "ORA_ERR_OPTYP$" VARCHAR(2),
    "ORA_ERR_TAG$" VARCHAR(2000)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

--departments
CREATE OR REPLACE TABLE parent_table (
        Id INT PRIMARY KEY,
        Name VARCHAR(10)
    )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO parent_table
VALUES (10, 'IT');

INSERT INTO parent_table
VALUES (20, 'HR');

INSERT INTO parent_table
VALUES (30, 'INFRA');

--employees
CREATE OR REPLACE TABLE source_table (
    Id INT PRIMARY KEY,
    Name VARCHAR(20) NOT NULL,
    DepartmentID INT REFERENCES parent_table (Id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO source_table
VALUES (101, 'Anurag111111111', 10);

INSERT INTO source_table
VALUES (102, 'Pranaya11111111', 20);

INSERT INTO source_table
VALUES (103, 'Hina11111111111', 30);

--a copy of source
CREATE OR REPLACE TABLE target_table (
    Id INT PRIMARY KEY,
    Name VARCHAR(10) NOT NULL,
    DepartmentID INT REFERENCES parent_table (Id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO target_table
VALUES (101, 'Anurag', 10);
```

#### 1. MERGE INTO Inside a FORALL

##### Oracle

> **Note:**
>
> The three cases below have the same transformation to Snowflake Scripting and are functionally equivalent.

##### Case 1

```sql
CREATE OR REPLACE PROCEDURE procedure_example (
    department_id_in   IN source_table.DepartmentID%TYPE)
IS
    TYPE employee_ids_t IS TABLE OF source_table%ROWTYPE
    INDEX BY PLS_INTEGER;
    employee_list   employee_ids_t;
BEGIN
    SELECT *
        BULK COLLECT INTO employee_list
        FROM source_table
        WHERE DepartmentID = procedure_example.department_id_in;

    FORALL indx IN 1 .. employee_list.COUNT
      MERGE INTO target_table
      USING (SELECT * FROM DUAL) src
      ON (id = employee_list(indx).id)
      WHEN MATCHED THEN
        UPDATE SET
          name = employee_list(indx).Name
      WHEN NOT MATCHED THEN
        INSERT (Id, Name, DepartmentID)
        VALUES (employee_list(indx).Id, employee_list(indx).Name, employee_list(indx).DepartmentID)
      LOG ERRORS INTO error_table('MERGE INTO ERROR')
      REJECT LIMIT UNLIMITED;

END;

CALL procedure_example(10);

select * from target_table;
select * from error_table;
```

##### Snowflake

##### FORALL With Collection of Records

```sql
CREATE OR REPLACE PROCEDURE procedure_example (department_id_in VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'source_table.DepartmentID%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE employee_ids_t IS TABLE OF source_table%ROWTYPE
--        INDEX BY PLS_INTEGER;
        employee_list VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'employee_ids_t' USAGE CHANGED TO VARIANT ***/!!!;
        FORALL INTEGER;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'RECORDS AND COLLECTIONS' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        SELECT *
            BULK COLLECT INTO employee_list
            FROM source_table
            WHERE DepartmentID = procedure_example.department_id_in;
        FORALL := ARRAY_SIZE(:employee_list);
          MERGE INTO target_table
          USING (SELECT * FROM
                (
                    SELECT
                        seq4() AS indx
                    FROM
                        TABLE(GENERATOR(ROWCOUNT => :FORALL))
                )) src
          ON (id = : employee_list[indx]:id)
        WHEN MATCHED THEN
        UPDATE SET
          name = : employee_list[indx]:Name
        WHEN NOT MATCHED THEN
        INSERT (Id, Name, DepartmentID)
        VALUES (:employee_list[indx]:Id, : employee_list[indx]:Name, : employee_list[indx]:DepartmentID)
--        --** SSC-FDM-OR0031 - THE ERROR LOGGING CLAUSE IN DML STATEMENTS IS NOT SUPPORTED BY SNOWFLAKE **
--          LOG ERRORS INTO error_table('MERGE INTO ERROR')
--          REJECT LIMIT UNLIMITED
                                ;
    END;
$$;

CALL procedure_example(10);

select * from
    target_table;

select * from
    error_table;
```

#### 2. FORALL With INSERT INTO

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                * FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 3. FORALL With Multiple Fetched Collections

##### Oracle

##### With INSERT INTO

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    column1Collection dbms_sql.NUMBER_table;
    column2Collection dbms_sql.NUMBER_table;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO column1Collection, column2Collection limit 20;
        EXIT WHEN column1Collection.COUNT = 0;
        FORALL forIndex IN 1..column1Collection.COUNT
            INSERT INTO table2 VALUES (
                column1Collection(forIndex),
                column2Collection(forIndex)
            );
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### With UPDATE

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    column1Collection dbms_sql.NUMBER_table;
    column2Collection dbms_sql.NUMBER_table;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO column1Collection, column2Collection limit 2;
        EXIT WHEN column1Collection.COUNT = 0;
        FORALL forIndex IN 1..column1Collection.COUNT
            UPDATE table2 SET column2 = column2Collection(forIndex)
            WHERE column1 = column1Collection(forIndex);
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results INSERT INTO

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |
| 1 | 2 |

##### Results UPDATE

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |

##### Snowflake

##### With INSERT INTO

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                $1,
                $2
            FROM
                table1
        );
    END;
$$;
```

##### With UPDATE

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        UPDATE table2
            SET column2 = column1Collection.$2
            FROM
                (
                    SELECT
                        * FROM
                        table1) AS column1Collection
            WHERE
                column1 = column1Collection.$1;
    END;
$$;
```

##### Results INSERT INTO

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

##### Results UPDATE

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |

#### 4. FORALL With Record of Collections

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE recordType IS RECORD(
        column1Collection dbms_sql.NUMBER_table,
        column2Collection dbms_sql.NUMBER_table
    );
    columnRecord recordType;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO columnRecord.column1Collection, columnRecord.column2Collection limit 20;
        FORALL forIndex IN 1..columnRecord.column1Collection.COUNT
            INSERT INTO table2 VALUES (
                columnRecord.column1Collection(forIndex),
                columnRecord.column2Collection(forIndex)
            );
        EXIT WHEN cursorVariable%NOTFOUND;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### Scripting FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                $1,
                $2
            FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 5. FORALL With Dynamic SQL

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    cursorVariable SYS_REFCURSOR;
    TYPE collectionTypeDefinition IS
        TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
    query VARCHAR(200) := 'SELECT * FROM table1';
BEGIN
    OPEN cursorVariable FOR query;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            INSERT INTO table2 VALUES collectionVariable(forIndex);
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        query VARCHAR(200) := 'SELECT * FROM
   table1';
    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE 'CREATE OR REPLACE TEMPORARY TABLE query AS ' || :query;
        INSERT INTO table2
        (
            SELECT
                *
            FROM
                query
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000 |

#### 6. FORALL Without LOOPS

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE  myProcedure IS
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    SELECT * BULK COLLECT INTO collectionVariable FROM table1;
        FORALL forIndex IN 1..collectionVariable.COUNT
            INSERT INTO table2 VALUES (
                collectionVariable (forIndex).column1,
                collectionVariable (forIndex).column2
            );
        collectionVariable.DELETE;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1 | 2 |
| 1 | 2 |
| 2 | 3 |
| 3 | 4 |
| 4 | 5 |
| 5 | 6 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        INSERT INTO table2
        (
            SELECT
                column1,
                column2
            FROM
                table1
        );
    END;
$$;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 1.000000000000000000 | 2.000000000000000000 |
| 1.000000000000000000 | 2.000000000000000000 |
| 2.000000000000000000 | 3.000000000000000000 |
| 3.000000000000000000 | 4.000000000000000000 |
| 4.000000000000000000 | 5.000000000000000000 |
| 5.000000000000000000 | 6.000000000000000000 |

#### 7. FORALL With UPDATE Statements

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            UPDATE table2 SET column1 = '54321' WHERE column2 = collectionVariable(forIndex).column2;
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

| COLUMN1 | COLUMN2 |
| --- | --- |
| 54321 | 2 |

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        UPDATE table2
            SET column1 = '54321'
            FROM
                (
                    SELECT
                        * FROM
                        table1) AS collectionVariable
            WHERE
                column2 = collectionVariable.column2;
    END;
$$;
```

##### Results

```none
ambiguous column name 'COLUMN2'
```

#### 8. FORALL With DELETE Statements

##### Oracle

##### FORALL Example

```sql
CREATE OR REPLACE PROCEDURE myProcedure IS
    CURSOR cursorVariable IS
        SELECT * FROM table1;
    TYPE collectionTypeDefinition IS TABLE OF table1%ROWTYPE;
    collectionVariable collectionTypeDefinition;
BEGIN
    OPEN cursorVariable;
    LOOP
        FETCH cursorVariable BULK COLLECT INTO collectionVariable limit 2;
        EXIT WHEN collectionVariable.COUNT = 0;
        FORALL forIndex IN collectionVariable.FIRST..collectionVariable.LAST
            DELETE FROM table2 WHERE column2 = collectionVariable(forIndex).column2;
        collectionVariable.DELETE;
    END LOOP;
    CLOSE cursorVariable;
END;
```

##### Results

```none
no data found
```

##### Snowflake

##### FORALL Equivalent

```sql
CREATE OR REPLACE PROCEDURE myProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$

    BEGIN
        --** SSC-PRF-0001 - THIS STATEMENT HAS USAGES OF CURSOR FETCH BULK OPERATIONS **
        --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        DELETE FROM
            table2
        USING (
            SELECT
                * FROM
                table1) collectionVariable
                WHERE
            table2.column2 = collectionVariable.column2;
    END;
$$;
```

##### Results

```none
Query produced no results
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0030](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.
2. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
3. [SSC-EWI-0058](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
4. [SSC-EWI-0062](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Custom type usage changed to variant.
5. [SSC-EWI-OR0129](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): TYPE attribute could not be resolved.
6. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
7. [SSC-FDM-OR0031:](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md) The error logging clause in DML statements is not supported by Snowflake.
8. [SSC-PRF-0001](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor fetch bulk operations.
9. [SSC-PRF-0003](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.

## LOOP

Translation reference to convert Oracle LOOP statement to Snowflake Scripting

### Description

> With each iteration of the basic `LOOP` statement, its statements run and control returns to the top of the loop. The `LOOP` statement ends when a statement inside the loop transfers control outside the loop or raises an exception.
> ([Oracle PL/SQL Language Reference BASIC LOOP Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/basic-LOOP-statement.html#GUID-99AC48AC-D868-43C4-9E4D-6A7671942A39))

#### Oracle BASIC LOOP Syntax

```sql
LOOP statement... END LOOP [ label ] ;
```

##### Snowflake Scripting BASIC LOOP Syntax

```sql
LOOP
  <statement>;
  [ <statement>; ... ]
END LOOP [ <label> ] ;
```

Oracle `BASIC LOOP` behavior can also be modified by using the statements:

* CONTINUE
* EXIT
* GOTO
* RAISE

### Sample Source Patterns

#### Loop simple case

> **Note:**
>
> This case is functionally equivalent.

##### Oracle

```sql
CREATE TABLE loop_testing_table
(
    iterator VARCHAR2(5)
);

CREATE OR REPLACE PROCEDURE loop_procedure
IS
I NUMBER := 1;
J NUMBER := 10;
BEGIN
  LOOP
    EXIT WHEN I = J;
    INSERT INTO loop_testing_table VALUES(TO_CHAR(I));
    I := I+1;
  END LOOP;
END;

CALL loop_procedure();
SELECT * FROM loop_testing_table;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE loop_testing_table
  (
      iterator VARCHAR(5)
  )
  COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
  ;

  CREATE OR REPLACE PROCEDURE loop_procedure ()
  RETURNS VARCHAR
  LANGUAGE SQL
  COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
  EXECUTE AS CALLER
  AS
  $$
  DECLARE
      I NUMBER(38, 18) := 1;
      J NUMBER(38, 18) := 10;
  BEGIN
      --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
      LOOP
        IF (:I = :J) THEN
          EXIT;
        END IF;
        INSERT INTO loop_testing_table
        VALUES(TO_CHAR(:I));
        I := :I +1;
      END LOOP;
  END;
  $$;

  CALL loop_procedure();

  SELECT * FROM
  loop_testing_table;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## OUTPUT PARAMETERS

### Description

An **output parameter** is a parameter whose value is passed out of the stored procedure/function module, back to the calling PL/SQL block. Since the output parameters are not supported by Snowflake Scripting, a solution has been implemented in order to emulate their functionality.

### Sample Source Patterns

#### Single out parameter

##### Oracle

```sql
-- Procedure with output parameter declaration
CREATE OR REPLACE PROCEDURE proc_with_single_output_parameters(param1 OUT NUMBER)
IS
BEGIN
    param1 := 123;
END;

-- Procedure with output parameter being called
CREATE OR REPLACE PROCEDURE proc_calling_proc_with_single_output_parameters
IS
    var1 NUMBER;
BEGIN
    proc_with_single_output_parameters(var1);
    INSERT INTO TABLE01 VALUES(var1, -1);
END;
```

##### Snowflake Scripting

```sql
-- Procedure with output parameter declaration
CREATE OR REPLACE PROCEDURE proc_with_single_output_parameters (param1 OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        param1 := 123;
    END;
$$;

-- Procedure with output parameter being called
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE01" **
CREATE OR REPLACE PROCEDURE proc_calling_proc_with_single_output_parameters ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        var1 NUMBER(38, 18);
    BEGIN
        CALL
        proc_with_single_output_parameters(:var1);
        INSERT INTO TABLE01
        VALUES(:var1, -1);
    END;
$$;
```

#### Multiple out parameter

##### Oracle

```sql
-- Procedure with output parameters declaration
CREATE OR REPLACE PROCEDURE proc_with_multiple_output_parameters(
    param1 OUT NUMBER,
    param2 IN OUT NUMBER
)
IS
BEGIN
    param1 := 123;
    param2 := 456;
END;

-- Procedure with output parameters being called
CREATE OR REPLACE PROCEDURE proc_calling_proc_with_multiple_output_parameters
IS
    var1 NUMBER;
    var2 NUMBER;
BEGIN
    proc_with_multiple_output_parameters(var1, var2);
    INSERT INTO TABLE01 VALUES(var1, var2);
END;
```

##### Snowflake Scripting

```sql
-- Procedure with output parameters declaration
CREATE OR REPLACE PROCEDURE proc_with_multiple_output_parameters (param1 OUT NUMBER(38, 18), param2 OUT NUMBER(38, 18)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        param1 := 123;
        param2 := 456;
    END;
$$;

-- Procedure with output parameters being called
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE01" **
CREATE OR REPLACE PROCEDURE proc_calling_proc_with_multiple_output_parameters ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        var1 NUMBER(38, 18);
        var2 NUMBER(38, 18);
    BEGIN
        CALL
        proc_with_multiple_output_parameters(:var1, :var2);
        INSERT INTO TABLE01
        VALUES(:var1, :var2);
    END;
$$;
```

In order to check that the functionality is being emulated correctly the following query is going to execute the procedure and a `SELECT` from the table mentioned before.

##### Oracle

```sql
CALL proc_with_single_output_parameters();
CALL proc_with_multiple_output_parameters();

SELECT * FROM table01;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| 123 | -1 |
| 123 | 456 |

##### Snowflake Scripting

```sql
CALL proc_with_single_output_parameters();
CALL proc_with_multiple_output_parameters();

SELECT * FROM table01;
```

##### Result

| COL1 | COL2 |
| --- | --- |
| 123.000000000000000000 | -1 |
| 123.000000000000000000 | 456.000000000000000000 |

#### Customer data type OUT parameters

When the output parameter is a customer type, the process is similar to a regular data type.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure_udtype_out_params (
    p_employee_id NUMBER,
    p_address OUT address_type
)
AS
BEGIN
    -- Retrieve the employee's address based on the employee ID.
    SELECT home_address INTO p_address
    FROM employees
    WHERE employee_id = p_employee_id;
END;
```

##### Snowflake Scripting

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "address_type", "employees" **
CREATE OR REPLACE PROCEDURE procedure_udtype_out_params (p_employee_id NUMBER(38, 18), p_address OUT VARIANT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'address_type' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        -- Retrieve the employee's address based on the employee ID.
        SELECT home_address INTO
            :p_address
        FROM
            employees
        WHERE employee_id = :p_employee_id;
    END;
$$;
```

#### Cursor OUT parameters

Cursor out parameters are not supported in Snowflake; despite that, a workaround that emulates Oracle’s behavior is applied to the transformed code. The procedure with the out parameters generates a temporary table with a dynamic name, and the procedure call will define the name of the temp table as a string to create the table within the procedure call.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE get_employees_by_dept (
  p_department_id IN NUMBER,
  p_employee_cursor OUT SYS_REFCURSOR
)
AS
BEGIN
 OPEN p_employee_cursor FOR
     SELECT employee_id, first_name, last_name
     FROM   employees_sample
     WHERE  department_id = p_department_id
     ORDER BY last_name;
END get_employees_by_dept;
/

CREATE OR REPLACE PROCEDURE proc_calling_proc_with_cursor()
AS
DECLARE
   l_emp_id NUMBER;
   l_first_name VARCHAR;
   l_last_name VARCHAR;
   l_cursor  SYS_REFCURSOR;
BEGIN
   get_employees_by_dept(10, l_cursor);
   LOOP
       FETCH l_cursor INTO l_emp_id, l_first_name, l_last_name;
       EXIT WHEN l_cursor%NOTFOUND;
       INSERT INTO employee VALUES (l_emp_id, l_first_name, l_last_name);
    END LOOP;
    CLOSE l_cursor;
END;
/
```

##### Snowflake Scripting

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employees_sample" **
CREATE OR REPLACE PROCEDURE get_employees_by_dept (p_department_id NUMBER(38, 18), p_employee_cursor VARCHAR
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
 BEGIN
  CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:p_employee_cursor) AS
   SELECT employee_id, first_name, last_name
   FROM
    employees_sample
   WHERE  department_id = :p_department_id
   ORDER BY last_name;
 END;
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employee" **
CREATE OR REPLACE PROCEDURE proc_calling_proc_with_cursor ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
    l_emp_id NUMBER(38, 18);
    l_first_name VARCHAR;
    l_last_name VARCHAR;
    l_cursor_res RESULTSET;
 BEGIN
    CALL
    get_employees_by_dept(10, 'proc_calling_proc_with_cursor_l_cursor');
    LET l_cursor CURSOR
    FOR
   SELECT
    *
   FROM
    IDENTIFIER('proc_calling_proc_with_cursor_l_cursor');
    OPEN l_cursor;
    --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
   --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
        FETCH l_cursor INTO
    :l_emp_id,
    :l_first_name,
    :l_last_name;
   IF (l_emp_id IS NULL) THEN
    EXIT;
   END IF;
        INSERT INTO employee
   SELECT
    :l_emp_id,
    :l_first_name,
    :l_last_name;
     END LOOP;
        CLOSE l_cursor;
 END;
$$;
```

#### Record OUT parameters

Records are not natively supported in Snowflake; however, a workaround was used to emulate them as output parameters. By defining an OBJECT variable instead of the record, we could emulate the record’s field structure by assigning the out parameter result to each object property. Additionally, for each record field assigned as an out parameter, a new variable with the field type will be generated.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure_with_out_params(
  param1 OUT INTEGER,
  param2 OUT INTEGER)
IS
BEGIN
  param1 := 123;
  param2 := 456;
END;

CREATE OR REPLACE PROCEDURE test_proc
IS
  TYPE custom_record1 IS RECORD(field3 INTEGER, field4 INTEGER);
  TYPE custom_record2 IS RECORD(field1 INTEGER, field2 custom_record1);
  var1 custom_record2;
BEGIN
  procedure_with_out_params(var1.field1, var1.field2.field4);
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE procedure_with_out_params (param1 OUT INTEGER, param2 OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    param1 := 123;
    param2 := 456;
  END;
$$;

CREATE OR REPLACE PROCEDURE test_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
    TYPE custom_record1 IS RECORD(field3 INTEGER, field4 INTEGER);
    !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO OBJECT ***/!!!
    TYPE custom_record2 IS RECORD(field1 INTEGER, field2 custom_record1);
    var1 OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - custom_record2 DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
    var1_field1 INTEGER;
    var1_field2_field4 INTEGER;
  BEGIN
    CALL
    procedure_with_out_params(:var1_field1, :var1_field2_field4);
    var1 := OBJECT_INSERT(COALESCE(var1, OBJECT_CONSTRUCT()), 'field1', :var1_field1, true);
    var1 := OBJECT_INSERT(COALESCE(var1, OBJECT_CONSTRUCT()), 'field2', OBJECT_INSERT(COALESCE(var1:field2, OBJECT_CONSTRUCT()), 'field4', :var1_field2_field4, true), true);
  END;
$$;
```

#### Package Variables as OUT parameters

Packages are not supported in Snowflake, so their local members, like variables or constants, should also be preserved using a workaround. In this scenario, the package variable would be emulated using a session variable that would be updated after setting a local variable with the output parameter result.

##### Oracle

```sql
CREATE OR REPLACE PACKAGE scha1.pkg1 AS
    PKG_VAR1 NUMBER;
END my_package;
/

CREATE OR REPLACE PROCEDURE PROC_WITH_OUT_PARAM(param1 OUT NUMBER)
AS
BEGIN
   param1 := 0;
END;
CREATE OR REPLACE PROCEDURE PROC ()
AS
BEGIN
   PROC_WITH_OUT_PARAM(param1 => scha1.pkg1.PKG_VAR1);
END;
```

##### Snowflake Scripting

```sql
CREATE SCHEMA IF NOT EXISTS SCHA1_PKG1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
;

SET "SCHA1_PKG1.PKG_VAR1" = '~';

CREATE OR REPLACE PROCEDURE PROC_WITH_OUT_PARAM (param1 OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      param1 := 0;
   END;
$$;

CREATE OR REPLACE PROCEDURE PROC ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      SCHA1_PKG1_PKG_VAR1 VARIANT;
   BEGIN
      CALL
      PROC_WITH_OUT_PARAM(param1 => :SCHA1_PKG1_PKG_VAR1);
      CALL UPDATE_PACKAGE_VARIABLE_STATE_UDF('SCHA1_PKG1.PKG_VAR1', TO_VARCHAR(:SCHA1_PKG1_PKG_VAR1));
   END;
$$;
```

### Known Issues

#### 1. Procedures with output parameters inside packages may not work correctly

Currently, there is an issue collecting the semantic information of procedures that reside inside packages, which is why the transformation for output parameters may work partially or not work at all. There is already a work in progress to resolve this issue.

#### 2. Some data types may not work properly

As seen in the transformation, when retrieving the value from the called procedures, an implicit cast is performed from VARIANT to the type specified by the variable. Since there are a lot of possible data types, some casts may fail or contain different data.

### Related EWIs

1. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
2. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
3. [SSC-FDM-0015](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Data Type Not Recognized.

## NESTED PROCEDURES

### Description

In Oracle’s PL/SQL, `NESTED` `PROCEDURES` definition refers to a procedure that is declared and defined within the declarative section of another PL/SQL block. This parent block can be an another procedure, a function, or a package body. For more information please refer to [Oracle procedure declarations and definitions](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/procedure-declaration-and-definition.html#GUID-9A48D7CE-3720-46A4-B5CA-C2250CA86AF2__CJACCJID).

> **Note:**
>
> The transformations described below are specific to procedures embedded within other procedures or packages.

### Sample Source Patterns

#### IN Parameter Mode for Nested Procedures

The IN keyword will be removed, as Snowflake nested procedures only support IN parameters implicitly.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE calculate_basic_salary (
    p_base_salary IN NUMBER,
    p_bonus_amount IN NUMBER
)
AS
    v_total_salary NUMBER := p_base_salary;
    PROCEDURE add_bonus (
        p_bonus_to_add IN NUMBER
    )
    AS
    BEGIN
        v_total_salary := v_total_salary + p_bonus_to_add;
        INSERT INTO salary_logs (description, result_value)
        VALUES ('Bonus added', v_total_salary);
    END add_bonus;
BEGIN
    INSERT INTO salary_logs (description, result_value)
    VALUES ('Starting calculation', v_total_salary);
    add_bonus(p_bonus_to_add => p_bonus_amount);
    INSERT INTO salary_logs (description, result_value)
    VALUES ('Final salary', v_total_salary);
END calculate_basic_salary;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE calculate_basic_salary (p_base_salary NUMBER(38, 18), p_bonus_amount NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        v_total_salary NUMBER(38, 18) := :p_base_salary;
        add_bonus PROCEDURE (p_bonus_to_add NUMBER(38, 18)
           )
        RETURNS VARCHAR
        AS
            BEGIN
                v_total_salary := :v_total_salary + :p_bonus_to_add;
            INSERT INTO salary_logs(description, result_value)
            VALUES ('Bonus added', :v_total_salary);
            END;
        BEGIN
        INSERT INTO salary_logs(description, result_value)
        VALUES ('Starting calculation', :v_total_salary);
        CALL
        add_bonus(:p_bonus_amount);
        INSERT INTO salary_logs(description, result_value)
        VALUES ('Final salary', :v_total_salary);
        END;
$$;
```

#### OUT Parameter Mode for Nested Procedures

SnowScript’s nested procedures do not support output parameters. To replicate this functionality in Snowflake, a RETURN type must be created based on the output parameters.

If there’s only one output parameter, that parameter will be returned at the end. In cases with multiple output parameters, an object construct will be generated containing their values. During the call, these values will be assigned to a variable, and subsequently, these results will be assigned to the corresponding variables or parameters.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE calculate_net_salary (
    p_base_salary IN NUMBER,
    p_bonus_amount IN NUMBER,
    p_net_salary OUT NUMBER
)
AS
    PROCEDURE calculate_tax (
        p_gross_amount IN NUMBER,
        p_net_result OUT NUMBER
    )
    AS
    BEGIN
        p_net_result := p_gross_amount * 0.8;
    END calculate_tax;
BEGIN
    calculate_tax(p_base_salary + p_bonus_amount, p_net_salary);
END calculate_net_salary;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE calculate_net_salary (p_base_salary NUMBER(38, 18), p_bonus_amount NUMBER(38, 18), p_net_salary OUT NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        calculate_tax PROCEDURE (p_gross_amount NUMBER(38, 18), p_net_result NUMBER(38, 18)
           )
        RETURNS NUMBER
        AS
            BEGIN
                p_net_result := :p_gross_amount * 0.8;
                RETURN p_net_result;
            END;
        call_results NUMBER;
        BEGIN
        call_results := (
            CALL
            calculate_tax(:p_base_salary + :p_bonus_amount, :p_net_salary)
        );
        p_net_salary := :call_results;
        END;
$$;
```

#### Multiple OUT Parameters in Nested Procedures

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE calculate_comprehensive_salary (
    p_base_salary IN NUMBER,
    p_bonus_amount IN NUMBER,
    p_final_salary OUT NUMBER,
    p_tax_calculated OUT NUMBER,
    p_total_gross OUT NUMBER
)
AS
    l_running_total NUMBER := p_base_salary;
    l_tax_amount NUMBER;
    l_net_amount NUMBER;
    PROCEDURE calculate_all_components (
        p_base_amount IN NUMBER,
        p_bonus_amt IN NUMBER,
        p_running_total_inout IN OUT NUMBER,
        p_tax_out OUT NUMBER,
        p_net_out OUT NUMBER
    )
    AS
    BEGIN
        p_running_total_inout := p_base_amount + p_bonus_amt;
        p_tax_out := p_running_total_inout * 0.25;
        p_net_out := p_running_total_inout - p_tax_out;
    END calculate_all_components;
BEGIN
    calculate_all_components(
        p_base_amount => p_base_salary,
        p_bonus_amt => p_bonus_amount,
        p_running_total_inout => l_running_total,
        p_tax_out => l_tax_amount,
        p_net_out => l_net_amount
    );

    p_final_salary := l_net_amount;
    p_tax_calculated := l_tax_amount;
    p_total_gross := l_running_total;
END calculate_comprehensive_salary;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE calculate_comprehensive_salary (p_base_salary NUMBER(38, 18), p_bonus_amount NUMBER(38, 18), p_final_salary OUT NUMBER(38, 18), p_tax_calculated OUT NUMBER(38, 18), p_total_gross OUT NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        l_running_total NUMBER(38, 18) := :p_base_salary;
        l_tax_amount NUMBER(38, 18);
        l_net_amount NUMBER(38, 18);
        calculate_all_components PROCEDURE (p_base_amount NUMBER(38, 18), p_bonus_amt NUMBER(38, 18), p_running_total_inout NUMBER(38, 18), p_tax_out NUMBER(38, 18), p_net_out NUMBER(38, 18)
           )
        RETURNS VARIANT
        AS
            BEGIN
                p_running_total_inout := :p_base_amount + :p_bonus_amt;
                p_tax_out := :p_running_total_inout * 0.25;
                p_net_out := :p_running_total_inout - :p_tax_out;
                RETURN OBJECT_CONSTRUCT('p_running_total_inout', :p_running_total_inout, 'p_tax_out', :p_tax_out, 'p_net_out', :p_net_out);
            END;
        call_results VARIANT;
        BEGIN
        call_results := (
            CALL
            calculate_all_components(:p_base_salary, :p_bonus_amount, :l_running_total, :l_tax_amount, :l_net_amount)
        );
        l_running_total := :call_results:p_running_total_inout;
        l_tax_amount := :call_results:p_tax_out;
        l_net_amount := :call_results:p_net_out;
        p_final_salary := :l_net_amount;
        p_tax_calculated := :l_tax_amount;
        p_total_gross := :l_running_total;
        END;
$$;
```

#### Multi-level Nested Procedures

Snowflake only permits one level of nesting for nested procedures. Therefore, a nested procedure within another nested procedure is not supported. If this occurs, the transformation will include the error `!!!RESOLVE EWI!!! /*** SSC-EWI-0111 - ONLY ONE LEVEL OF NESTING IS ALLOWED FOR NESTED PROCEDURES IN SNOWFLAKE. ***/!!!`

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE calculate_executive_salary (
    p_result OUT NUMBER
)
AS
    PROCEDURE calculate_senior_level (
        senior_result OUT NUMBER
    )
    AS
        PROCEDURE calculate_base_level (
            base_result OUT NUMBER
        )
        AS
        BEGIN
            base_result := 75000;
        END calculate_base_level;
    BEGIN
        calculate_base_level(senior_result);
        senior_result := senior_result * 1.5;
    END calculate_senior_level;
BEGIN
    calculate_senior_level(p_result);
END calculate_executive_salary;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE calculate_executive_salary (p_result OUT NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        calculate_senior_level PROCEDURE (senior_result NUMBER(38, 18)
           )
        RETURNS NUMBER
        AS
            DECLARE
                !!!RESOLVE EWI!!! /*** SSC-EWI-0111 - ONLY ONE LEVEL OF NESTING IS ALLOWED FOR NESTED PROCEDURES IN SNOWFLAKE. ***/!!!
                PROCEDURE calculate_base_level (
                    base_result OUT NUMBER
                )
                AS
                BEGIN
                    base_result := 75000;
                END calculate_base_level;
                call_results NUMBER;
            BEGIN
                call_results := (
                CALL
                calculate_base_level(:senior_result)
                );
                senior_result := :call_results;
                senior_result := :senior_result * 1.5;
                RETURN senior_result;
            END;
        call_results NUMBER;
        BEGIN
        call_results := (
            CALL
            calculate_senior_level(:p_result)
        );
        p_result := :call_results;
        END;
$$;
```

#### Default Values in Nested Procedures

Nested procedure arguments do not support default clauses. Therefore, if a nested procedure call omits an optional parameter, the default value for that argument must be submitted within the procedure call. SnowConvert AI automatically identifies these scenarios and fills the procedure calls appropriately.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE calculate_total_compensation (
    p_base_salary IN NUMBER,
    p_final_compensation OUT NUMBER
)
AS
    v_total NUMBER := p_base_salary;
    l_bonus NUMBER;
    PROCEDURE add_bonus (
        p_salary_amount IN NUMBER,
        p_multiplier IN NUMBER DEFAULT 1.1,
        p_calculated_bonus OUT NUMBER
    )
    AS
    BEGIN
        p_calculated_bonus := p_salary_amount * (p_multiplier - 1);
    END add_bonus;
BEGIN
    add_bonus(p_base_salary, p_calculated_bonus => l_bonus);
    v_total := v_total + l_bonus;
    add_bonus(p_base_salary, 1.2, p_calculated_bonus => l_bonus);
    v_total := v_total + l_bonus;
    p_final_compensation := v_total;
END calculate_total_compensation;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE calculate_total_compensation (p_base_salary NUMBER(38, 18), p_final_compensation OUT NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        v_total NUMBER(38, 18) := :p_base_salary;
        l_bonus NUMBER(38, 18);
        add_bonus PROCEDURE (p_salary_amount NUMBER(38, 18), p_multiplier NUMBER(38, 18), p_calculated_bonus NUMBER(38, 18)
           )
        RETURNS NUMBER
        AS
            BEGIN
                p_calculated_bonus := :p_salary_amount * (:p_multiplier - 1);
                RETURN p_calculated_bonus;
            END;
        call_results NUMBER;
        BEGIN
        call_results := (
            CALL
            add_bonus(:p_base_salary, 1.1, :l_bonus)
        );
        l_bonus := :call_results;
        v_total := :v_total + :l_bonus;
        call_results := (
            CALL
            add_bonus(:p_base_salary, 1.2, :l_bonus)
        );
        l_bonus := :call_results;
        v_total := :v_total + :l_bonus;
        p_final_compensation := :v_total;
        END;
$$;
```

#### Nested Procedure Overloading

Snowflake does not support the overloading of nested procedures. If this occurs, the EWI `SSC-EWI-0112 - NESTED PROCEDURE OVERLOADING IS NOT SUPPORTED` will be added.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE demonstrate_salary_calculations(
    final_summary OUT VARCHAR2
)
AS
    result1 VARCHAR2(100);
    result2 VARCHAR2(100);
    result3 VARCHAR2(100);
    PROCEDURE calculate_salary(
        output OUT VARCHAR2
    )
    AS
    BEGIN
        output := 'Standard: 55000';
    END;
    PROCEDURE calculate_salary(
        base_amount IN NUMBER,
        output OUT VARCHAR2
    )
    AS
    BEGIN
        output := 'Calculated: ' || (base_amount * 1.15);
    END;
    PROCEDURE calculate_salary(
        employee_level IN VARCHAR2,
        output OUT VARCHAR2
    )
    AS
    BEGIN
        output := 'Level ' || UPPER(employee_level) || ': 60000';
    END;
BEGIN
    calculate_salary(result1);
    calculate_salary(50000, result2);
    calculate_salary('senior', result3);
    final_summary := result1 || ' | ' || result2 || ' | ' || result3;
END demonstrate_salary_calculations;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE demonstrate_salary_calculations (final_summary OUT VARCHAR
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        result1 VARCHAR(100);
        result2 VARCHAR(100);
        result3 VARCHAR(100);
        calculate_salary PROCEDURE(output VARCHAR
            )
        RETURNS VARCHAR
        AS
            BEGIN
                output := 'Standard: 55000';
                RETURN output;
            END;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0112 - NESTED PROCEDURE OVERLOADING IS NOT SUPPORTED. ***/!!!
        calculate_salary PROCEDURE(base_amount NUMBER(38, 18), output VARCHAR
            )
        RETURNS VARCHAR
        AS
            BEGIN
                output := 'Calculated: ' || NVL((:base_amount * 1.15) :: STRING, '');
                RETURN output;
            END;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0112 - NESTED PROCEDURE OVERLOADING IS NOT SUPPORTED. ***/!!!
        calculate_salary PROCEDURE(employee_level VARCHAR, output VARCHAR
            )
        RETURNS VARCHAR
        AS
            BEGIN
                output := 'Level ' || NVL(UPPER(:employee_level) :: STRING, '') || ': 60000';
                RETURN output;
            END;
        call_results VARCHAR;
        BEGIN
        call_results := (
            CALL
            calculate_salary(:result1)
        );
        result1 := :call_results;
        call_results := (
            CALL
            calculate_salary(50000, :result2)
        );
        result2 := :call_results;
        call_results := (
            CALL
            calculate_salary('senior', :result3)
        );
        result3 := :call_results;
        final_summary := NVL(:result1 :: STRING, '') || ' | ' || NVL(:result2 :: STRING, '') || ' | ' || NVL(:result3 :: STRING, '');
        END;
$$;
```

#### Nested procedure without a parameter list

In Snowflake, a nested procedure definition requires empty parentheses `()` to be syntactically valid when it has no parameters; contrary to Oracle, where they are not needed. SnowConvert AI will add these automatically during translation.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE reset_salary_system
AS
    PROCEDURE cleanup_salary_data
    AS
    BEGIN
        DELETE FROM salary_results;
        INSERT INTO salary_results VALUES (0);
    END cleanup_salary_data;
BEGIN
    cleanup_salary_data();
END reset_salary_system;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE reset_salary_system ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        cleanup_salary_data PROCEDURE ()
        RETURNS VARCHAR
        AS
            BEGIN
                DELETE FROM
                salary_results;
            INSERT INTO salary_results
                VALUES (0);
            END;
        BEGIN
        CALL
        cleanup_salary_data();
        END;
$$;
```

#### Nested procedure with REFCURSOR output parameter

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE process_department_salaries (
    p_department_id IN NUMBER
)
AS
    v_employee_cursor SYS_REFCURSOR;
    v_employee_id employees.employee_id%TYPE;
    v_first_name employees.first_name%TYPE;
    v_last_name employees.last_name%TYPE;
    PROCEDURE get_department_employees (
        p_dept_id IN NUMBER,
        p_cursor OUT SYS_REFCURSOR
    )
    AS
    BEGIN
        OPEN p_cursor FOR
            SELECT employee_id, first_name, last_name
            FROM employees
            WHERE department_id = p_dept_id;
    END get_department_employees;
BEGIN
    get_department_employees(p_department_id, v_employee_cursor);
    LOOP
        FETCH v_employee_cursor INTO v_employee_id, v_first_name, v_last_name;
        EXIT WHEN v_employee_cursor%NOTFOUND;
        INSERT INTO salary_audit VALUES (v_employee_id, v_first_name || ' ' || v_last_name);
    END LOOP;
    CLOSE v_employee_cursor;
END process_department_salaries;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE process_department_salaries (p_department_id NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        v_employee_cursor_res RESULTSET;
        v_employee_id NUMBER(38, 18);
        v_first_name VARCHAR(50);
        v_last_name VARCHAR(50);
        get_department_employees PROCEDURE (p_dept_id NUMBER(38, 18), p_cursor VARCHAR
           )
        RETURNS VARCHAR
        AS
            BEGIN
                CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:p_cursor) AS
                SELECT employee_id, first_name, last_name
                FROM
                    employees
                WHERE department_id = :p_dept_id;
                RETURN p_cursor;
            END;
        call_results VARCHAR;
        BEGIN
        call_results := (
            CALL
            get_department_employees(:p_department_id, 'process_department_salaries_v_employee_cursor')
        );
        LET v_employee_cursor CURSOR
        FOR
            SELECT
                *
            FROM
                IDENTIFIER('process_department_salaries_v_employee_cursor');
        OPEN v_employee_cursor;
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
            FETCH v_employee_cursor INTO
                :v_employee_id,
                :v_first_name,
                :v_last_name;
            IF (v_employee_id IS NULL) THEN
                EXIT;
            END IF;
            INSERT INTO salary_audit
            SELECT
                :v_employee_id,
                NVL(:v_first_name :: STRING, '') || ' ' || NVL(:v_last_name :: STRING, '');
        END LOOP;
        CLOSE v_employee_cursor;
        END;
$$;
```

#### Nested procedure with NOCOPY parameter option

In Oracle PL/SQL, the NOCOPY keyword is an optimization hint for `OUT` and `IN OUT` procedure parameters. By default, Oracle passes these parameters by value, creating an expensive copy of the data during the call and copying it back upon completion. This can cause significant performance overhead for large data structures.

NOCOPY instructs Oracle to pass by reference instead, allowing the procedure to directly modify the original data. This eliminates copying overhead and improves performance. However, changes are immediate and are not implicitly rolled back if an unhandled exception occurs within the procedure.

Therefore, we will remove the NOCOPY parameters option and add the FDM `SSC-FDM-OR0050 - EXCEPTIONS WITH NOCOPY PARAMETERS MAY LEAD TO DATA INCONSISTENCY`. This is because procedure execution terminates upon hitting an exception, preventing the `RETURN` statement from being reached. As a result, the variable in the caller’s declare block retains its initial values, as the procedure fails to successfully return a new value for assignment.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE calculate_bonus_with_nocopy (
    p_base_salary IN NUMBER,
    p_multiplier IN NUMBER,
    p_bonus_result OUT NOCOPY NUMBER
)
AS
    PROCEDURE compute_bonus(bonus_amount OUT NOCOPY NUMBER)
    AS
    BEGIN
        IF p_multiplier = 0 THEN
            bonus_amount := NULL;
        ELSE
            bonus_amount := p_base_salary * p_multiplier * 0.1;
        END IF;
    END compute_bonus;
BEGIN
    compute_bonus(p_bonus_result);
END calculate_bonus_with_nocopy;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE calculate_bonus_with_nocopy (p_base_salary NUMBER(38, 18), p_multiplier NUMBER(38, 18), p_bonus_result OUT NUMBER(38, 18)
    )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/22/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
        DECLARE
        compute_bonus PROCEDURE(bonus_amount
        --** SSC-FDM-OR0050 - EXCEPTIONS WITH NOCOPY PARAMETERS MAY LEAD TO DATA INCONSISTENCY. **
        NUMBER(38, 18))
        RETURNS NUMBER
        AS
            BEGIN
                IF (:p_multiplier = 0) THEN
                bonus_amount := NULL;
            ELSE
                bonus_amount := :p_base_salary * :p_multiplier * 0.1;
                END IF;
                RETURN bonus_amount;
            END;
        call_results NUMBER;
        BEGIN
        call_results := (
            CALL
            compute_bonus(:p_bonus_result)
        );
        p_bonus_result := :call_results;
        END;
$$;
```

### Known Issues

#### 1. Multi-level Nested Procedures

Our transformation efforts for nested procedures in Snowflake are limited to those nested directly within other procedures, supporting only one level of nesting. If the nesting level exceeds one, or if a procedure is nested within a standalone function, transformation is not supported, and the EWI `!!!RESOLVE EWI!!! /*** SSC-EWI-0111 - ONLY ONE LEVEL OF NESTING IS ALLOWED FOR NESTED PROCEDURES IN SNOWFLAKE. ***/!!!` will be added.

#### 2. Nested procedures overloading

Additionally, overloading of nested procedures is not supported in Snowflake. In such cases, the EWI `!!!RESOLVE EWI!!! /*** SSC-EWI-0112 - NESTED PROCEDURE OVERLOADING IS NOT SUPPORTED. ***/!!!` will be added.

#### 3. Nested procedures within anonymous blocks

Transformation for nested procedures within anonymous blocks is currently pending. The EWI `!!!RESOLVE EWI!!! /*** SSC-EWI-OR0057 - TRANSFORMATION FOR NESTED PROCEDURE OR FUNCTION IS NOT SUPPORTED IN THIS SCENARIO ***/!!!` will be added.

### Related EWIs

1. [SSC-FDM-OR0050](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Exceptions with `NOCOPY` parameters may lead to data inconsistency.
2. [SSC-EWI-OR0057](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Transformation for nested procedure or function is not supported.
3. [SSC-EWI-0111](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Only one level of nesting is allowed for nested procedures in Snowflake.
4. [SSC-EWI-0112](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Nested procedure overloading is not supported.

## PROCEDURE CALL

Translation reference for PROCEDURE CALL aka SUBPROGRAM INVOCATION

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This section describes the syntax for subprogram invocations within PL blocks, such as procedures or anonymous blocks.

For more information on this subject, please refer to Oracle’s Subprogram documentation: ([Oracle PL/SQL Language Reference Subprogram Invocation Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/plsql-subprograms.html#GUID-C04B6BF9-1B19-42F9-82D8-CA137E97A024))

Procedure calls can be migrated to Snowflake as long as there are no optional parameters and their order matches the formal parameters. Please note that Procedure invocations get migrated to a Call statement.

#### Oracle Subprogram Invocation Syntax

```sql
<subprogram invocation> := subprogram_name [ ( [ parameter [, parameter]... ] ) ]

<parameter> := {
  <actual parameter>
  | <formal parameter name> => <actual parameter>
  }
```

Snowflake Scripting has support for this statement, albeit with some functional differences.

##### Snow Scripting Subprogram Invocation Syntax

```sql
<subprogram invocation> := CALL subprogram_name [ ( [ parameter [, parameter]... ] ) ]

<parameter> := {
  <actual parameter>
  | <formal parameter name> => <actual parameter>
  }
```

### Sample Source Patterns

> **Note:**
>
> **Consider the next table and procedure for the examples below.**

#### Oracle

```sql
CREATE TABLE procedure_call_test_table(
    col1 INTEGER
);

-- Simple Called procedure
CREATE OR REPLACE PROCEDURE called_procedure (param1 INTEGER)
AS
BEGIN
    INSERT INTO procedure_call_test_table VALUES (param1);
END;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE procedure_call_test_table (
        col1 INTEGER
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    -- Simple Called procedure
CREATE OR REPLACE PROCEDURE called_procedure (param1 INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        INSERT INTO procedure_call_test_table
        VALUES (:param1);
    END;
$$;
```

#### Simple call

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE simple_calling_procedure
AS
BEGIN
    called_procedure(1);
END;

CALL simple_calling_procedure();

SELECT * FROM procedure_call_test_table;
```

##### Result

| COL1 |
| --- |
| 1 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE simple_calling_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CALL
        called_procedure(1);
    END;
$$;

CALL simple_calling_procedure();

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "procedure_call_test_table" **

SELECT * FROM
    procedure_call_test_table;
```

##### Result

| COL1 |
| --- |
| 1 |

#### Calling a procedure with an optional parameter

> **Warning:**
>
> This sample contains manual intervention for some functional differences and is used to explain them. For more information on these differences, please check the Known Issues section below.

##### Oracle

```sql
-- Procedure with optional parameters
CREATE OR REPLACE PROCEDURE proc_optional_parameters (param1 INTEGER, param2 INTEGER := 8, param3 INTEGER)
AS
BEGIN
    INSERT INTO procedure_call_test_table VALUES (param1);
    INSERT INTO procedure_call_test_table VALUES (param2);
    INSERT INTO procedure_call_test_table VALUES (param3);
END;

CREATE OR REPLACE PROCEDURE calling_procedure
AS
BEGIN
    -- positional convention
    proc_optional_parameters(1, 2, 3);

    -- named convention
    proc_optional_parameters(param1 => 4, param2 => 5, param3 => 6);

    -- named convention, second gets ommited
    proc_optional_parameters(param1 => 7, param3 => 9);

    -- named convention, different order
    proc_optional_parameters(param3 => 12, param1 => 10, param2 => 11);
END;

CALL calling_procedure();

SELECT * FROM procedure_call_test_table;
```

##### Result

| COL1 |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |
| 10 |
| 11 |
| 12 |

##### Snowflake Scripting

```sql
-- Procedure with optional parameters
CREATE OR REPLACE PROCEDURE proc_optional_parameters
                                                     --** SSC-FDM-0041 - DEFAULT PARAMETERS WERE REORDERED TO THE END OF THE PARAMETER LIST TO MATCH SNOWFLAKE REQUIREMENTS. CALLERS USING POSITIONAL ARGUMENTS MAY NEED TO BE UPDATED **
                                                     (param1 INTEGER, param2 INTEGER DEFAULT 8, param3 INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        INSERT INTO procedure_call_test_table
        VALUES (:param1);
        INSERT INTO procedure_call_test_table
        VALUES (:param2);
        INSERT INTO procedure_call_test_table
        VALUES (:param3);
    END;
$$;

CREATE OR REPLACE PROCEDURE calling_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CALL
        -- positional convention
        proc_optional_parameters(1, 2, 3);
        CALL

        -- named convention
        proc_optional_parameters(param1 => 4, param2 => 5, param3 => 6);
        CALL

        -- named convention, second gets ommited
        proc_optional_parameters(param1 => 7, param3 => 9);
        CALL

        -- named convention, different order
        proc_optional_parameters(param1 => 10, param2 => 11, param3 => 12);
    END;
$$;

CALL calling_procedure();

SELECT * FROM
    procedure_call_test_table;
```

##### Result

| COL1 |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |
| 10 |
| 11 |
| 12 |

### Known Issues

#### 1. Default parameter reordering

Snowflake requires default parameters to appear at the end of the parameter list. SnowConvert AI automatically reorders them and emits an [SSC-FDM-0041](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) notice. When positional callers are detected, they are converted to named arguments.

##### 2. Named parameters are accepted, but not functionally equivalent

Named parameters are supported in Snowflake. When default parameters are reordered, SnowConvert AI automatically converts positional call sites to use named arguments to preserve the original semantics.

##### 3. Calling Subprograms with Out Parameters is not supported

Snowflake does not have support for parameter modes, however, a solution is being implemented to emulate their functionality. To get more information about the transformation for output parameters please go to the following article Output Parameters.

### Related EWIs

1. [SSC-FDM-0041](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Default parameters were reordered to the end of the parameter list.
2. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.

## RAISE

### Description

> The `RAISE` statement explicitly raises an exception.
>
> Outside an exception handler, you must specify the exception name. Inside an exception handler, if you omit the exception name, the `RAISE` statement reraises the current exception.([Oracle PL/SQL Language Reference Raise Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/RAISE-statement.html#GUID-5F58843F-84C8-4768-A7B3-0E318948A88B))

The statement is fully supported by Snowflake Scripting, but please take into account that there might be some differences when having some Commit and Rollback Statement.

```sql
RAISE <exception_name> ;
```

Snowflake Scripting has support for this statement.

```sql
RAISE <exception_name> ;
```

### Sample Source Patterns

#### Simple exception throw

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE simple_exception_throw_handle(param1 INTEGER)
IS
    my_exception EXCEPTION;
    my_other_exception EXCEPTION;
BEGIN
    IF param1 > 0
        THEN RAISE my_exception;
    END IF;
EXCEPTION
    WHEN my_exception THEN
        IF param1 = 1
            THEN RAISE;
        END IF;
        RAISE my_other_exception;
END;

--Completes without issue
CALL simple_exception_throw_handle(0);
--Throws my_exception
CALL simple_exception_throw_handle(1);
--Throws my_exception, catches then raises second my_other_exception
CALL simple_exception_throw_handle(2);
```

###### Result

```sql
Call completed.
-----------------------------------------------------------------------
Error starting at line : 31 in command -
CALL simple_exception_throw_handle(1)
Error report -
ORA-06510: PL/SQL: unhandled user-defined exception
ORA-06512: at "SYSTEM.SIMPLE_EXCEPTION_THROW_HANDLE", line 12
ORA-06512: at "SYSTEM.SIMPLE_EXCEPTION_THROW_HANDLE", line 7
ORA-06512: at line 1
06510. 00000 -  "PL/SQL: unhandled user-defined exception"
*Cause:    A user-defined exception was raised by PL/SQL code, but
           not handled.
*Action:   Fix the problem causing the exception or write an exception
           handler for this condition. Or you may need to contact your
           application administrator or DBA.
-----------------------------------------------------------------------
Error starting at line : 33 in command -
CALL simple_exception_throw_handle(2)
Error report -
ORA-06510: PL/SQL: unhandled user-defined exception
ORA-06512: at "SYSTEM.SIMPLE_EXCEPTION_THROW_HANDLE", line 14
ORA-06510: PL/SQL: unhandled user-defined exception
ORA-06512: at "SYSTEM.SIMPLE_EXCEPTION_THROW_HANDLE", line 7
ORA-06512: at line 1
06510. 00000 -  "PL/SQL: unhandled user-defined exception"
*Cause:    A user-defined exception was raised by PL/SQL code, but
           not handled.
*Action:   Fix the problem causing the exception or write an exception
           handler for this condition. Or you may need to contact your
           application administrator or DBA.
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE simple_exception_throw_handle (param1 INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        my_exception EXCEPTION;
        my_other_exception EXCEPTION;
    BEGIN
        IF (:param1 > 0) THEN
            RAISE my_exception;
        END IF;
        EXCEPTION
            WHEN my_exception THEN
            IF (:param1 = 1) THEN
                    RAISE;
            END IF;
                RAISE my_other_exception;
        END;
$$;

--Completes without issue
CALL simple_exception_throw_handle(0);

--Throws my_exception
CALL simple_exception_throw_handle(1);

--Throws my_exception, catches then raises second my_other_exception
CALL simple_exception_throw_handle(2);
```

###### Result

```sql
Call Completed
-----------------------------------------------------------------------
Uncaught exception of type 'MY_EXCEPTION' on line 7 at position 9
-----------------------------------------------------------------------
Uncaught exception of type 'MY_OTHER_EXCEPTION' on line 14 at position 9
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## RAISE_APPICATION_ERROR

Translation reference for the raise_application_error statement.

### General description

The procedure `RAISE_APPLICATION_ERROR` lets you issue user-defined `ORA-` error messages from stored subprograms. That way, you can report errors to your application and avoid returning unhandled exceptions ([`Oracle documentation`](https://docs.oracle.com/cd/B19306_01/appdev.102/b14261/errors.htm)).

#### **Oracle syntax**

```sql
raise_application_error(
      error_number, message[, {TRUE | FALSE}]);
```

> **Note:**
>
> The `error_number` is a negative integer in the range -20000 .. -20999 and `message` is a character string up to 2048 bytes long.
>
> If the optional third parameter is **TRUE**, the error is placed on the stack of previous errors. If the parameter is **FALSE** (the default), the error replaces all previous errors.

The equivalent statement in Snowflake is the RAISE clause, nevertheless, it is required to declare the user-defined exception as a variable before calling the RAISE statement for it.

#### **Snowflake Syntax**

```sql
<exception_name> EXCEPTION [ ( <exception_number> , '<exception_message>' ) ] ;
```

> **Note:**
>
> For more information review the following [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/exception#label-snowscript-introduction-exceptions-handling-an-exception-examples).

### Sample Source Patterns

#### 1. Exception in functions without declaring section

In this scenario, the function without a declaring section is translated to a procedure with the exception declaration. Please note that:

* The exception variable name is declared in upper case.
* The exception variable name is based on the description and an ending is composed of an exception code name followed by a consecutive number.
* The declaring section is created even though the initial function or procedure does not contain it.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION TEST(
    SAMPLE_A IN NUMBER DEFAULT NULL,
    SAMPLE_B IN NUMBER DEFAULT NULL
)
RETURN NUMBER
AS
BEGIN
    raise_application_error(-20001, 'First exception message', FALSE);
    raise_application_error(-20002, 'Second exception message');
  RETURN 1;
END TEST;
```

##### Output

```none
ORA-20001: First exception message
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE TEST (
    SAMPLE_A NUMBER(38, 18) DEFAULT NULL,
    SAMPLE_B NUMBER(38, 18) DEFAULT NULL
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    FIRST_EXCEPTION_MESSAGE_EXCEPTION_CODE_0 EXCEPTION (-20001, 'FIRST EXCEPTION MESSAGE');
    SECOND_EXCEPTION_MESSAGE_EXCEPTION_CODE_1 EXCEPTION (-20002, 'SECOND EXCEPTION MESSAGE');
  BEGIN
    --** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT FALSE WAS REMOVED. **
    RAISE FIRST_EXCEPTION_MESSAGE_EXCEPTION_CODE_0;
    RAISE SECOND_EXCEPTION_MESSAGE_EXCEPTION_CODE_1;
    RETURN 1;
  END;
$$;
```

##### Output

```none
FIRST EXCEPTION MESSAGE
```

#### 2. Exception code number outside limits

The following example shows the translation commented out in the procedure body. It is because the code is outside the applicable code limits in Snowflake. The solution is to change the exception code for an available code in the query section.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION TEST(
    SAMPLE_A IN NUMBER DEFAULT NULL,
    SAMPLE_B IN NUMBER DEFAULT NULL
)
RETURN NUMBER
AS
BEGIN
    raise_application_error(-20000, 'My exception message');
    RETURN 1;
END TEST;
```

##### Output

```none
ORA-20000: My exception message
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE TEST (
    SAMPLE_A NUMBER(38, 18) DEFAULT NULL,
    SAMPLE_B NUMBER(38, 18) DEFAULT NULL
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_EXCEPTION_MESSAGE_EXCEPTION_CODE_0 EXCEPTION (-20000, 'MY EXCEPTION MESSAGE');
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0099 - EXCEPTION CODE NUMBER EXCEEDS SNOWFLAKE SCRIPTING LIMITS ***/!!!
        RAISE MY_EXCEPTION_MESSAGE_EXCEPTION_CODE_0;
        RETURN 1;
    END;
$$;
```

##### Output

```none
 Invalid error code '-20,000'. Must be between -20,999 and -20,000
```

#### 3. Exception stack functionality

The exception stack functionality is not supported in Snowflake and is removed from the exception declaration.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION TEST(
    SAMPLE_A IN NUMBER DEFAULT NULL,
    SAMPLE_B IN NUMBER DEFAULT NULL
)
RETURN NUMBER
AS
BEGIN
    raise_application_error(-20001, 'My exception message', TRUE);
    RETURN 1;
END TEST;
```

##### Output

```none
ORA-20001: My exception message
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE TEST (
    SAMPLE_A NUMBER(38, 18) DEFAULT NULL,
    SAMPLE_B NUMBER(38, 18) DEFAULT NULL
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        MY_EXCEPTION_MESSAGE_EXCEPTION_CODE_0 EXCEPTION (-20001, 'MY EXCEPTION MESSAGE');
    BEGIN
        --** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT TRUE WAS REMOVED. **
        RAISE MY_EXCEPTION_MESSAGE_EXCEPTION_CODE_0;
        RETURN 1;
    END;
$$;
```

##### Output

```none
MY EXCEPTION MESSAGE
```

#### 4. Multiple exceptions with the same exception code

Multiple exceptions with the same can coexist in the declaring section and raise statements.

##### Oracle

```sql
CREATE OR REPLACE FUNCTION TEST(
    SAMPLE_A IN NUMBER DEFAULT NULL,
    SAMPLE_B IN NUMBER DEFAULT NULL
)
RETURN NUMBER
AS
BEGIN
    IF TRUE THEN
        raise_application_error(-20001, 'The first exception');
    ELSE
        raise_application_error(-20001, 'Other exception inside');
    END IF;
    RETURN 1;
END TEST;
```

##### Output

```none
ORA-20000: The first exception
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE TEST (
    SAMPLE_A NUMBER(38, 18) DEFAULT NULL,
    SAMPLE_B NUMBER(38, 18) DEFAULT NULL
)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        THE_FIRST_EXCEPTION_EXCEPTION_CODE_0 EXCEPTION (-20001, 'THE FIRST EXCEPTION');
        OTHER_EXCEPTION_INSIDE_EXCEPTION_CODE_1 EXCEPTION (-20001, 'OTHER EXCEPTION INSIDE');
    BEGIN
        IF (TRUE) THEN
            RAISE THE_FIRST_EXCEPTION_EXCEPTION_CODE_0;
            ELSE
            RAISE OTHER_EXCEPTION_INSIDE_EXCEPTION_CODE_1;
            END IF;
            RETURN 1;
    END;
$$;
```

##### Output

```none
THE FIRST EXCEPTION
```

### Known Issues

1. SQLREM function may be reviewed.
2. Exception code number outside the applicable limits in Snowflake has to be changed to an available code exception.
3. Add to a stack of errors is not supported.

### Related EWIs

1. [SSC-EWI-OR0099](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The exception code exceeds the Snowflake Scripting limit.
2. [SSC-FDM-0029](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): User defined function was transformed to a Snowflake procedure.
3. [SSC-FDM-OR0011](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): The boolean argument was removed because the “add to stack” options is not supported.

## UDF CALL

Translation reference for User-defined function (UDF) Call

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

As is widely acknowledged, non-scalar user-defined functions (UDFs) in Oracle are converted into Snowflake stored procedures to accommodate more intricate functionalities.

This transformation also alters the way the function is invoked, transitioning from a traditional function call to a stored procedure call.

For additional details regarding the invocation of stored procedures, refer to the documentation accessible here: PROCEDURE CALL.

### Sample Source Patterns

> **Note:**
>
> **Consider the next function and tables for the examples below.**

#### Oracle

```sql
CREATE OR REPLACE FUNCTION sum_to_varchar_function(p_number1 IN NUMBER, p_number2 IN NUMBER)
RETURN VARCHAR
IS
    result VARCHAR(100);
BEGIN
    result := TO_CHAR(p_number1 + p_number2);
    RETURN result;
END sum_to_varchar_function;

CREATE TABLE example_table (
    id NUMBER,
    column1 NUMBER
);
INSERT INTO example_table VALUES (1, 15);

CREATE TABLE result_table (
    id NUMBER,
    result_col VARCHAR(100)
);
```

##### Snowflake

```sql
CREATE OR REPLACE FUNCTION sum_to_varchar_function (p_number1 NUMBER(38, 18), p_number2 NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/14/2024",  "domain": "test" }}'
AS
$$
    WITH declaration_variables_cte1 AS
    (
        SELECT
            TO_CHAR(p_number1 + p_number2) AS
            result
    )
    SELECT
        result
    FROM
        declaration_variables_cte1
$$;

CREATE OR REPLACE TABLE example_table (
       id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
       column1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/14/2024",  "domain": "test" }}'
;

INSERT INTO example_table
VALUES (1, 15);

CREATE OR REPLACE TABLE result_table (
    id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
       result_col VARCHAR(100)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "11/14/2024",  "domain": "test" }}'
;
```

#### UDF Call

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure_calling_function(param1 IN NUMBER)
IS
    result_value VARCHAR(200);
BEGIN
    result_value := sum_to_varchar_function(3, param1);
    INSERT INTO result_table VALUES (1, result_value);
END;

BEGIN
    procedure_calling_function(5);
END;
```

##### Result

```none
ID	RESULT_COL
1	8
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE procedure_calling_function (param1 NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        result_value VARCHAR(200);
    BEGIN
        result_value := sum_to_varchar_function(3, :param1) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'sum_to_varchar_function' NODE ***/!!!;
        INSERT INTO result_table
        VALUES (1, :result_value);
    END;
$$;

DECLARE
    call_results VARIANT;

    BEGIN
    CALL
    procedure_calling_function(5);
    RETURN call_results;
    END;
```

##### Result

```none
ID	RESULT_COL
1	8
```

#### UDF Call within a query

When a function call is embedded within a query, the invocation process becomes more intricate due to Snowflake’s limitation of not being able to call procedures directly within queries. To overcome this limitation, the procedure invocation is moved outside the query, and the result is assigned to a variable. This variable is then referenced within the query, thereby achieving functional equivalence. This approach allows for the execution of more complex behaviors within Snowflake queries while adhering to the procedural constraints.

##### Oracle

```sql
CREATE OR REPLACE PROCEDURE procedure_calling_function(param1 IN NUMBER)
IS
    result_value VARCHAR(200);
    result_value2 VARCHAR(200);
BEGIN
    SELECT
        sum_to_varchar_function(1, param1) AS result_column,
        sum_to_varchar_function(2, param1) AS result_column2
    INTO result_value, result_value2
    FROM example_table ext;

    INSERT INTO result_table VALUES (1, result_value);
    INSERT INTO result_table VALUES (2, result_value2);
END;

BEGIN
    procedure_calling_function(5);
END;
```

##### Result

```none
ID	RESULT_COL
1	6
2   7
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE procedure_calling_function (param1 NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        result_value VARCHAR(200);
        result_value2 VARCHAR(200);
    BEGIN
        SELECT
            sum_to_varchar_function(1, :param1) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'sum_to_varchar_function' NODE ***/!!! AS result_column,
            sum_to_varchar_function(2, :param1) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'sum_to_varchar_function' NODE ***/!!! AS result_column2
        INTO
            :result_value,
            :result_value2
        FROM
            example_table ext;

        INSERT INTO result_table
        VALUES (1, :result_value);
        INSERT INTO result_table
        VALUES (2, :result_value2);
    END;
$$;

DECLARE
    call_results VARIANT;

    BEGIN
    CALL
    procedure_calling_function(5);
    RETURN call_results;
    END;
```

##### Result

```none
ID	RESULT_COL
1	6
2   7
```

### Known Issues

#### 1. Unsupported Usage of UDFs in Queries with Query Dependencies

When calling User-Defined Functions (UDFs) within queries with query dependencies, scenarios involving embedded functions with columns as arguments are not supported. This limitation arises because the column values cannot be accessed from outside the query. Examples of unsupported scenarios include:

```sql
BEGIN
    SELECT
        sum_to_varchar_function(ext.col1, ext.col2) -- columns as arguments not supported
    INTO
        result_value
    FROM example_table ext;
END;
```

The supported scenarios include function calls with other types of arguments such as literal values, external variables, or parameters. For instance:

```sql
BEGIN
    SELECT
        sum_to_varchar_function(100, param1)
    INTO
        result_value
    FROM example_table ext;
END;
```

In the supported scenarios, the function can effectively be migrated.

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
3. [SSC-FDM-0029](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): User defined function was transformed to a Snowflake procedure.

## WHILE

Translation reference to convert Oracle WHILE statement to Snowflake Scripting

### Description

> The `WHILE` `LOOP` statement runs one or more statements while a condition is `TRUE`.
> ([Oracle PL/SQL Language Reference WHILE Statement](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/WHILE-LOOP-statement.html#GUID-9339C3AD-7F41-4D3F-9B2D-6FC5DCE44C6B))

#### Oracle WHILE Syntax

```sql
WHILE boolean_expression
  LOOP statement... END LOOP [ label ] ;
```

##### Snowflake Scripting WHILE Syntax

```sql
WHILE ( <condition> ) { DO | LOOP }
  <statement>;
  [ <statement>; ... ]
END { WHILE | LOOP } [ <label> ] ;
```

Oracle `WHILE` behavior can also be modified by using the statements:

* CONTINUE
* EXIT
* GOTO
* RAISE

### Sample Source Patterns

#### While simple case

> **Note:**
>
> This case is functionally equivalent.

##### Oracle

```sql
CREATE TABLE while_testing_table
(
    iterator VARCHAR2(5)
);

CREATE OR REPLACE PROCEDURE while_procedure
IS
I NUMBER := 1;
J NUMBER := 10;
BEGIN
  WHILE I <> J LOOP
    INSERT INTO while_testing_table VALUES(TO_CHAR(I));
    I := I+1;
  END LOOP;
END;

CALL while_procedure();
SELECT * FROM while_testing_table;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE TABLE while_testing_table
(
    iterator VARCHAR(5)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

CREATE OR REPLACE PROCEDURE while_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
    I NUMBER(38, 18) := 1;
    J NUMBER(38, 18) := 10;
BEGIN
    WHILE (:I <> :J)
    --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
      INSERT INTO while_testing_table
      VALUES(TO_CHAR(:I));
      I := :I +1;
    END LOOP;
END;
$$;

CALL while_procedure();

SELECT * FROM
while_testing_table;
```

##### Result

| ITERATOR |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 5 |
| 6 |
| 7 |
| 8 |
| 9 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Oracle - Power BI Repointing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/etl-bi-repointing/power-bi-oracle-repointing.md
section: Migrations
---

# SnowConvert AI - Oracle - Power BI Repointing

## Description

The Power BI repointing is a feature that provides an easy way to redefine the connections from the M language in the Power Query Editor. This means that the connection parameters will be redefined to point to the Snowflake migration database context. For Oracle, the method in M Language that defined the connection is `Oracle.Database(...).` In Snowflake, there is a connector that depends on some other parameters and the main connection is defined by `Snowflake.Database(...)` method.

## Source Pattern Samples

### Entity Repointing Case: Table

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a table.

**Oracle Connection in the Power Query Editor**

```sql
let
    Source = Oracle.Database("the_oracle_server", [HierarchicalNavigation=true]),
    #"C##POWERBI_USER" = Source{[Schema="C##POWERBI_USER"]}[Data],
    EMPLOYEES_B1 = #"C##POWERBI_USER"{[Name="EMPLOYEES_B"]}[Data]
in
    EMPLOYEES_B1
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="C##POWERBI_USER", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="EMPLOYEES_B", Kind="Table"]}[Data],
    EMPLOYEES_B1 = Table.RenameColumns(SourceSfTbl, {{ "EMPLOYEE_ID", "EMPLOYEE_ID"}, { "FIRST_NAME", "FIRST_NAME"}, { "LAST_NAME", "LAST_NAME"}, { "DEPARTMENT_ID", "DEPARTMENT_ID"}})
in
    EMPLOYEES_B1
```

### Entity Repointing Case: View

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a view.

**Oracle Connection in the Power Query Editor**

```sql
let
    Source = Oracle.Database("the_oracle_server", [HierarchicalNavigation=true]),
    #"C##POWERBI_USER" = Source{[Schema="C##POWERBI_USER"]}[Data],
    DEPARTMENTS_V1 = #"C##POWERBI_USER"{[Name="DEPARTMENTS_V"]}[Data]
in
    DEPARTMENTS_V1
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="C##POWERBI_USER", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="DEPARTMENTS_V", Kind="View"]}[Data],
    DEPARTMENTS_V1 = Table.RenameColumns(SourceSfTbl, {{ "DEPARTMENT_ID", "DEPARTMENT_ID"}, { "DEPARTMENT_NAME", "DEPARTMENT_NAME"}})
in
    DEPARTMENTS_V1
```

### Embedded SQL Case

This case refers to connections that contain embedded SQL inside them. This sample shows a simple query, but SnowConvert AI covers a range of larger scenarios. Besides, depending on the migrated query, there may be warning messages known as EWI—PRF—FDM. This will help the user identify patterns that need extra attention.

**Oracle Connection in the Power Query Editor**

```sql
let
    Source = Oracle.Database("the_oracle_server", [HierarchicalNavigation=true, Query="SELECT * FROM DEPARTMENTS_V"])
in
    Source
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    SfSource = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "SELECT * FROM
DEPARTMENTS_V", null, [EnableFolding=true]),
    Source = Table.RenameColumns(SfSource, {{ "DEPARTMENT_ID", "DEPARTMENT_ID"}, { "DEPARTMENT_NAME", "DEPARTMENT_NAME"}})
in
    Source
```

---
title: SnowConvert AI - Oracle - Pseudocolumns
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/pseudocolumns.md
section: Migrations
---

# SnowConvert AI - Oracle - Pseudocolumns

## ROWID

Translation spec for ROWID pseudocolumn

### Description

> For each row in the database, the `ROWID` pseudocolumn returns the address of the row. ([Oracle SQL Language Reference Rowid pseudocolumn](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/ROWID-Pseudocolumn.html#GUID-F6E0FBD2-983C-495D-9856-5E113A17FAF1))

Snowflake does not have an equivalent for ROWID. The pseudocolumn is transformed to *NULL* in order to avoid runtime errors.

```sql
ROWID
```

### Sample Source Patterns

#### Oracle

```sql
CREATE TABLE sample_table
(
    sample_column varchar(10)
);

INSERT INTO sample_table(sample_column) VALUES ('text 1');
INSERT INTO sample_table(sample_column) VALUES ('text 2');

SELECT ROWID FROM sample_table;
SELECT MAX(ROWID) FROM sample_table;
```

##### Result Query 1

```sql
|ROWID             |
|------------------|
|AAASfCAABAAAIcpAAA|
|AAASfCAABAAAIcpAAB|
```

##### Result Query 2

```sql
|MAX(ROWID)        |
|------------------|
|AAASfCAABAAAIcpAAB|
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE sample_table
    (
        sample_column varchar(10)
    )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO sample_table(sample_column) VALUES ('text 1');

INSERT INTO sample_table(sample_column) VALUES ('text 2');

SELECT
--** SSC-FDM-OR0030 - ROWID PSEUDOCOLUMN IS NOT SUPPORTED IN SNOWFLAKE, IT WAS CONVERTED TO NULL TO AVOID RUNTIME ERRORS **
'' AS ROWID
FROM
sample_table;

SELECT MAX(
--** SSC-FDM-OR0030 - ROWID PSEUDOCOLUMN IS NOT SUPPORTED IN SNOWFLAKE, IT WAS CONVERTED TO NULL TO AVOID RUNTIME ERRORS **
'' AS ROWID) FROM
sample_table;
```

##### Result Query 1

| NULL |
| --- |
|  |
|  |

### Known Issues

No issues were found.

### Related EWIs

* [SSC-FDM-OR0030](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): ROWID pseudocolumn is not supported in Snowflake

## ROWNUM

Translation spec for ROWNUM pseudocolumn

### Description

> For each row returned by a query, the `ROWNUM` pseudocolumn returns a number indicating the order in which Oracle selects the row from a table or set of joined rows. ([Oracle SQL Language Reference Rownum pseudocolumn](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/ROWNUM-Pseudocolumn.html#GUID-2E40EC12-3FCF-4A4F-B5F2-6BC669021726))

Snowflake does not have an equivalent for ROWNUM. The approach for the transformation is taking advantage of the Snowflake [seq8](https://docs.snowflake.com/en/sql-reference/functions/seq1.html) function to emulate the functionality.

```sql
ROWNUM
```

### Sample Source Patterns

#### Oracle

```sql
-- Table with sample data
CREATE TABLE TABLE1(COL1 VARCHAR(20), COL2 NUMBER);
INSERT INTO TABLE1 (COL1, COL2) VALUES('ROWNUM: ', null);
INSERT INTO TABLE1 (COL1, COL2) VALUES('ROWNUM: ', null);

-- Query 1: ROWNUM in a select

@@ -159,10 +171,10 @@ SELECT ROWNUM FROM TABLE1;
-- Query 2: ROWNUM in DML
UPDATE TABLE1 SET COL2 = ROWNUM;
SELECT * FROM TABLE1;
```

##### Result Query 1

```sql
|ROWNUM|
|------|
|1     |
|2     |
```

##### Result Query 2

```sql
|COL1    |COL2|
|--------|----|
|ROWNUM: |1   |
|ROWNUM: |2   |
```

##### Snowflake

```sql
-- Table with sample data
CREATE OR REPLACE TABLE TABLE1 (COL1 VARCHAR(20),
COL2 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}}'
;

INSERT INTO TABLE1(COL1, COL2) VALUES('ROWNUM: ', null);

INSERT INTO TABLE1(COL1, COL2) VALUES('ROWNUM: ', null);

-- Query 1: ROWNUM in a select
SELECT
seq8() + 1
FROM
TABLE1;

-- Query 2: ROWNUM in DML
UPDATE TABLE1
SET COL2 = seq8() + 1;

SELECT * FROM
TABLE1;
```

##### Result Query 1

```sql
|SEQ8() + 1|
|----------|
|1         |
|2         |
```

##### Result Query 2

```sql
|COL1    |COL2|
|--------|----|
|ROWNUM: |1   |
|ROWNUM: |2   |
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-0006:](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) Number type column may not behave similarly in Snowflake

---
title: SnowConvert AI - Oracle - Rowid Data Type
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/rowid-types.md
section: Migrations
---

# SnowConvert AI - Oracle - Rowid Data Type

## Description

> Each row in the database has an address. ([Oracle SQL Language Reference Rowid Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-4231B94A-97E9-4B59-91EB-E7B2D0DA438C))

## ROWID DataType

### Description

> The rows in heap-organized tables that are native to Oracle Database have row addresses called rowids. You can examine a rowid row address by querying the pseudocolumn ROWID. Values of this pseudocolumn are strings representing the address of each row. These strings have the data type ROWID. You can also create tables and clusters that contain actual columns having the ROWID data type. ([Oracle SQL Language Reference ROWID Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-AEF1FE4C-2DE5-4BE7-BB53-83AD8F1E34EF))

```none
ROWID
```

### Sample Source Patterns

#### ROWID in Create Table

##### Oracle

```sql
CREATE TABLE rowid_table
(
    rowid_column ROWID
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE rowid_table
    (
        rowid_column VARCHAR(18) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWID DATA TYPE CONVERTED TO VARCHAR ***/!!!
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Insert data in the ROWID column

It is possible to insert data in ROWID columns if the insert has a valid ROWID, as shown in the example below. Unfortunately retrieving ROWID from a table is not allowed.

##### Oracle

```sql
INSERT INTO rowid_table VALUES ('AAATtCAAMAAAADLABD');

SELECT rowid_column FROM rowid_table;
```

##### Result

| ROWID_COLUMN |
| --- |
| AAATtCAAMAAAADLABD |

##### Snowflake

```sql
INSERT INTO rowid_table
VALUES ('AAATtCAAMAAAADLABD');

SELECT rowid_column FROM
rowid_table;
```

##### Result

| ROWID_COLUMN |
| --- |
| AAATtCAAMAAAADLABD |

### Known Issues

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove this clause to retrieve the entire result set.

**1. Retrieving ROWID from a table that does not have an explicit column with this data type**

As mentioned in the [Snowflake forum](https://community.snowflake.com/s/question/0D50Z00007jUWEU/how-to-convert-oracle-rowids-to-snowflake-sql), ROWID is not supported by Snowflake. The following query displays an error in Snowflake since hr.employees do not contain a ROWID column.

#### Oracle

```sql
SELECT
    ROWID
FROM
    hr.employees
FETCH NEXT 10 ROWS ONLY;
```

##### Result

| ROWID |
| --- |
| AAATtCAAMAAAADLABD |
| AAATtCAAMAAAADLABV |
| AAATtCAAMAAAADLABX |
| AAATtCAAMAAAADLAAv |
| AAATtCAAMAAAADLAAV |
| AAATtCAAMAAAADLAAD |
| AAATtCAAMAAAADLABL |
| AAATtCAAMAAAADLAAP |
| AAATtCAAMAAAADLAA6 |
| AAATtCAAMAAAADLABg |

##### Snowflake

```sql
SELECT
    --** SSC-FDM-OR0030 - ROWID PSEUDOCOLUMN IS NOT SUPPORTED IN SNOWFLAKE, IT WAS CONVERTED TO NULL TO AVOID RUNTIME ERRORS **
    '' AS ROWID
FROM
    hr.employees
FETCH NEXT 10 ROWS ONLY;
```

##### Result

> **Danger:**
>
> SQL compilation error: invalid identifier ‘ROWID’

### Related EWIs

1. [SSC-EWI-0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-FDM-OR0030](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): ROWID pseudocolumn is not supported in Snowflake.

## UROWID Data Type

### Description

> Oracle uses universal rowids (urowids) to store the addresses of index-organized and foreign tables. Index-organized tables have logical urowids and foreign tables have foreign urowids.([Oracle SQL Language Reference UROWID Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-E9F3AE1C-AA6D-4262-A15F-778833251361))

```none
UROWID [(size)]
```

### Sample Source Patterns

#### UROWID in Create Table

##### Oracle

```sql
CREATE TABLE urowid_table
(
    urowid_column UROWID,
    urowid_sized_column UROWID(40)
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE urowid_table
    (
        urowid_column VARCHAR(18) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - UROWID DATA TYPE CONVERTED TO VARCHAR ***/!!!,
        urowid_sized_column VARCHAR(18) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - UROWID DATA TYPE CONVERTED TO VARCHAR ***/!!!
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Insert data in the UROWID column

Just like ROWID, it is possible to insert data in UROWID columns if the insert has a valid UROWID, but retrieving from a table is not allowed.

##### Oracle

```sql
INSERT INTO urowid_table VALUES ('*BAMAAJMCVUv+','*BAMAAJMCVUv+');

SELECT * FROM urowid_table;
```

##### Result

| UROWID_COLUMN | UROWID_SIZED_COLUMN |
| --- | --- |
| \*BAMAAJMCVUv+ | \*BAMAAJMCVUv+ |

##### Snowflake\*\* SSC-FDM-0007 - MISSING DEPENDENT OBJECT “urowid_table” \*\*

```sql
INSERT INTO urowid_table
VALUES ('*BAMAAJMCVUv+','*BAMAAJMCVUv+');

SELECT * FROM
urowid_table;
```

##### Result

| UROWID_COLUMN | UROWID_SIZED_COLUMN |
| --- | --- |
| \*BAMAAJMCVUv+ | \*BAMAAJMCVUv+ |

### Known Issues

> **Note:**
>
> Since the result set is too large, *Row Limiting Clause* was added. You can remove this clause to retrieve the entire result set.

**1. Retrieving UROWID from a table that does not have an explicit column with this data type**

The following query displays an error in Snowflake since hr.countries do not contain a ROWID (as mentioned in [Oracle’s documentation](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-E9F3AE1C-AA6D-4262-A15F-778833251361) UROWID is accessed with `SELECT` … `ROWID` statement) column.

#### Oracle

```sql
SELECT
    rowid,
    country_name
FROM
    hr.countries FETCH NEXT 10 ROWS ONLY;
```

##### Result

| ROWID | COUNTRY_NAME |
| --- | --- |
| \*BAMAAJMCQVL+ | Argentina |
| \*BAMAAJMCQVX+ | Australia |
| \*BAMAAJMCQkX+ | Belgium |
| \*BAMAAJMCQlL+ | Brazil |
| \*BAMAAJMCQ0H+ | Canada |
| \*BAMAAJMCQ0j+ | Switzerland |
| \*BAMAAJMCQ07+ | China |
| \*BAMAAJMCREX+ | Germany |
| \*BAMAAJMCREv+ | Denmark |
| \*BAMAAJMCRUf+ | Egypt |

##### Snowflake

```sql
SELECT
        --** SSC-FDM-OR0030 - ROWID PSEUDOCOLUMN IS NOT SUPPORTED IN SNOWFLAKE, IT WAS CONVERTED TO NULL TO AVOID RUNTIME ERRORS **
        '' AS rowid,
        country_name
FROM
        hr.countries
FETCH NEXT 10 ROWS ONLY;
```

##### Result

> **Danger:**
>
> SQL compilation error: invalid identifier ‘ROWID’

##### 2. EWI should be displayed by SnowConvert AI

EWI should be displayed when trying to select UROWID column. There is a work item to add the corresponding EWI.

> **Danger:**
>
> This issue has been marked as critical and will be fixed in the upcoming releases.

### Related EWIs

1. [SSC-EWI-0036](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-FDM-OR0030](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): ROWID pseudocolumn is not supported in Snowflake.

---
title: SnowConvert AI - Oracle - Sample data
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sample-data.md
section: Migrations
---

# SnowConvert AI - Oracle - Sample data

Sample data used in examples

Some of the code examples are based on the [Oracle Sample database Schemas](https://docs.oracle.com/en/database/oracle/oracle-database/21/comsc/introduction-to-sample-schemas.html#GUID-844E92D8-A4C8-4522-8AF5-761D4BE99200). You can install a duplicate locally to reproduce the queries with the following [repository](https://github.com/oracle/db-sample-schemas). To reproduce the queries in Snowflake you will need to migrate the “hr_cre.sql” and “hr_popul.sql” files from the “human_resources” directory using the SnowConvert AI tool and then deploy the resulting code.

---
title: SnowConvert AI - Oracle - Select
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-queries-and-subqueries/selects.md
section: Migrations
---

# SnowConvert AI - Oracle - Select

In this section you could find information about the select query syntax and its conversions.

> **Note:**
>
> Some parts in the output codes are omitted for clarity reasons.

## Overall Select Translation

### Simple select

#### Oracle:

```sql
select * from table1;
select col1 from schema1.table1;
```

#### Snowflake:

```sql
select * from
table1;

select col1 from
schema1.table1;
```

### Where clause

#### Oracle:

```sql
select col1 from schema1.table1 WHERE col1 = 1 and id > 0 or id < 1;
```

#### Snowflake:

```sql
select col1 from
schema1.table1
WHERE col1 = 1 and id > 0 or id < 1;
```

### Order By clause

#### Oracle:

```sql
select col1 from schema1.table1 order by id ASC;
```

#### Snowflake:

```sql
select col1 from
schema1.table1
order by id ASC;
```

### Group by

#### Oracle:

```sql
select col1 from schema1.table1 GROUP BY id;
```

#### Snowflake:

```sql
select col1 from
schema1.table1
GROUP BY id;
```

### Model Clause

The model clause is not supported yet.

### Row Limiting Clause

#### Oracle:

```sql
-- Using ONLY
select * from TableFetch1 FETCH FIRST 2 ROWS ONLY;
select * from TableFetch1 FETCH FIRST 20 percent ROWS ONLY;
select * from TableFetch1 order by col1 FETCH FIRST 2 ROWS with ties;
select * from TableFetch1 order by col1 FETCH FIRST 20 percent ROWS with ties;

-- Using OFFSET clause
select * from TableFetch1 offset 2 rows FETCH FIRST 2 ROWS ONLY;
select * from TableFetch1 offset 2 rows FETCH FIRST 60 percent rows ONLY;
select * from TableFetch1
order by col1 offset 2 rows FETCH NEXT 2 ROWs with ties;
select * from TableFetch1
order by col1 offset 2 rows FETCH FIRST 60 percent ROWs with ties;

-- Using WITH TIES clause
select * from TableFetch1 FETCH FIRST 2 ROWS with ties;
select * from TableFetch1 FETCH FIRST 20 percent ROWS with ties;
select * from TableFetch1 offset 2 rows FETCH NEXT 2 ROWs with ties;
select * from TableFetch1 offset 2 rows FETCH FIRST 60 percent ROWs with ties;

-- Using ORDER BY clause
select * from TableFetch1 order by col1 FETCH FIRST 2 ROWS ONLY;
select * from TableFetch1 order by col1 FETCH FIRST 20 percent ROWS ONLY;
select * from TableFetch1 order by col1 offset 2 rows FETCH FIRST 2 ROWS ONLY;
select * from TableFetch1
order by col1 offset 2 rows FETCH FIRST 60 percent ROWS ONLY;

select * from TableFetch1 FETCH FIRST ROWS ONLY;

select * from TableFetch1 offset 2 rows;
```

#### Snowflake:

```sql
-- Using ONLY
select * from
TableFetch1
FETCH FIRST 2 ROWS ONLY;

select * from
TableFetch1
QUALIFY
(ROW_NUMBER() OVER (
ORDER BY
NULL) - 1) / COUNT(*) OVER () < 20 / 100;

select * from
TableFetch1
QUALIFY
RANK() OVER (
order by col1) <= 2;

select * from
TableFetch1
QUALIFY
(RANK() OVER (
order by col1) - 1) / COUNT(*) OVER () < 20 / 100;

-- Using OFFSET clause
select * from
TableFetch1
offset 2 rows FETCH FIRST 2 ROWS ONLY;

select * from
TableFetch1
QUALIFY
(ROW_NUMBER() OVER (
ORDER BY
NULL) - 1 - 2) / COUNT(*) OVER () < 60 / 100
LIMIT NULL OFFSET 2;

select * from
TableFetch1
QUALIFY
RANK() OVER (
order by col1) - 2 <= 2
LIMIT NULL OFFSET 2;

select * from
TableFetch1
QUALIFY
(RANK() OVER (
order by col1) - 1 - 2) / COUNT(*) OVER () < 60 / 100
LIMIT NULL OFFSET 2;

-- Using WITH TIES clause
select * from
TableFetch1
FETCH FIRST 2 ROWS ONLY;

select * from
TableFetch1
QUALIFY
(ROW_NUMBER() OVER (
ORDER BY
NULL) - 1) / COUNT(*) OVER () < 20 / 100;

select * from
TableFetch1
offset 2 rows FETCH NEXT 2 ROWS ONLY;

select * from
TableFetch1
QUALIFY
(ROW_NUMBER() OVER (
ORDER BY
NULL) - 1 - 2) / COUNT(*) OVER () < 60 / 100
LIMIT NULL OFFSET 2;

-- Using ORDER BY clause
select * from
TableFetch1
order by col1
FETCH FIRST 2 ROWS ONLY;

select * from
TableFetch1
QUALIFY
(ROW_NUMBER() OVER (
order by col1) - 1) / COUNT(*) OVER () < 20 / 100;

select * from
TableFetch1
order by col1 offset 2 rows FETCH FIRST 2 ROWS ONLY;

select * from
TableFetch1
QUALIFY
(ROW_NUMBER() OVER (
order by col1) - 1 - 2) / COUNT(*) OVER () < 60 / 100
LIMIT NULL OFFSET 2;

select * from
TableFetch1
FETCH FIRST 1 ROWS ONLY;

select * from
TableFetch1
LIMIT NULL OFFSET 2;
```

> **Note:**
>
> In Oracle, the `FETCH` / `OFFSET WITH TIES` is ignored when no `ORDER BY` is specified in the `SELECT`. This case will be transformed to a `FETCH` / `OFFSET` with the ONLY keyword in Snowflake, please note that in Snowflake the `ONLY` keyword has no effect in the results and is used just for readability.

## Pivot

Snowflake does not support the following statements:
- Rename columns
- Multiple Columns

### Oracle:

```sql
select * from schema1.table1
PIVOT(count(*) as count1 FOR (column1, column2) IN (row1 as rowName));
```

#### Snowflake:

```sql
select * from
schema1.table1
!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT RENAME COLUMN NOT SUPPORTED ***/!!!
PIVOT (count(*)
                !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT MULTIPLE COLUMN NOT SUPPORTED ***/!!!
                FOR (column1, column2)
!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT RENAME COLUMN NOT SUPPORTED ***/!!!
IN (row1 as rowName));
```

## Unpivot

Snowflake does not support the following statements:
- INCLUDE / EXCLUDE NULLS

### Oracle:

```sql
select * from schema1.table1
UNPIVOT INCLUDE NULLS (column1 FOR column2 IN (ANY, ANY));
```

#### Snowflake:

```sql
select * from
schema1.table1
!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT INCLUDE NULLS NOT SUPPORTED ***/!!!
UNPIVOT ( column1 FOR column2 IN (
ANY,
ANY));
```

## Transformation of JOIN (+) to ANSI Syntax

> **Danger:**
>
> This translation is currently deactivated and it’s only meant for reference for translations done with previous versions of SnowConvert AI. For the current translation check the section above.

SnowConvert AI translates the NON-ANSI special outer join (+) syntax to ANSI outer join syntax. This subsection shows some examples:

### To LEFT OUTER JOIN

Example 1:

#### Oracle:

```sql
-- Additional Params: --OuterJoinsToOnlyAnsiSyntax
SELECT d.department_name,
       e.employee_name
FROM   departments d, employees e
WHERE  d.department_id = e.department_id (+)
AND    d.department_id >= 30;
```

#### Snowflake:

```sql
SELECT d.department_name,
       e.employee_name
FROM
       departments d
       LEFT OUTER JOIN
              employees e
              ON d.department_id = e.department_id
WHERE
       d.department_id >= 30;
```

Example 2:

#### Oracle:

```sql
-- Additional Params: --OuterJoinsToOnlyAnsiSyntax
SELECT d.department_name,
       e.employee_name
FROM   departments d, employees e
WHERE  d.department_id(+)  = e.department_id
AND    d.department_id >= 30;
```

#### Snowflake:

```sql
SELECT d.department_name,
       e.employee_name
FROM
       employees e
       LEFT OUTER JOIN
              departments d
              ON d.department_id = e.department_id
WHERE
       d.department_id >= 30;
```

Example 3: Multiple join

#### Oracle:

```sql
-- Additional Params: --OuterJoinsToOnlyAnsiSyntax
SELECT d.department_name,
       e.employee_name
FROM   departments d, employees e, projects p
WHERE  e.department_id(+) = d.department_id
AND    p.department_id(+) = d.department_id
AND    d.department_id >= 30;
```

#### Snowflake:

```sql
SELECT d.department_name,
       e.employee_name
FROM
       departments d
       LEFT OUTER JOIN
              employees e
              ON e.department_id = d.department_id
       LEFT OUTER JOIN
              projects p
              ON p.department_id = d.department_id
WHERE
       d.department_id >= 30;
```

Example 4: Join with other kinds of conditional

#### Oracle:

```sql
-- Additional Params: --OuterJoinsToOnlyAnsiSyntax
SELECT d.department_name,
       e.employee_name
FROM   departments d, employees e
WHERE  d.department_id(+)  = e.department_id
AND    d.location(+) IN ('CHICAGO', 'BOSTON', 'NEW YORK')
AND    d.department_id >= 30;
```

#### Snowflake:

```sql
SELECT d.department_name,
       e.employee_name
FROM
       employees e
       LEFT OUTER JOIN
              departments d
              ON d.department_id = e.department_id
              AND d.location IN ('CHICAGO', 'BOSTON', 'NEW YORK')
WHERE
       d.department_id >= 30;
```

Example 5: Join with (+) inside a function

#### Oracle:

```sql
-- Additional Params: --OuterJoinsToOnlyAnsiSyntax
SELECT d.department_name,
       e.employee_name
FROM   departments d, employees e
WHERE SUBSTR(d.department_name, 1, NVL(e.department_id, 1) ) = e.employee_name(+);
```

#### Snowflake:

```sql
SELECT d.department_name,
       e.employee_name
FROM
       departments d
       LEFT OUTER JOIN
              employees e
              ON SUBSTR(d.department_name, 1, NVL(e.department_id, 1) ) = e.employee_name;
```

> **Warning:**
>
> Please be aware that some of the patterns that were translated to LEFT OUTER JOIN could retrieve the rows in a different order.

### To CROSS JOIN

Example 6: Complex case that requires the use of CROSS JOIN

#### Oracle:

```sql
SELECT d.department_name,
       e.employee_name,
       p.project_name,
       c.course_name
FROM   departments d, employees e, projects p, courses c
WHERE
e.salary (+) >= 2000 AND
d.department_id = e.department_id (+)
AND p.department_id = e.department_id(+)
AND c.course_id  = e.department_id(+)
AND d.department_id >= 30;
```

#### Snowflake:

```sql
SELECT d.department_name,
       e.employee_name,
       p.project_name,
       c.course_name
FROM
       departments d
       CROSS JOIN projects p
       CROSS JOIN courses c
       LEFT OUTER JOIN
              employees e
              ON
              e.salary >= 2000
              AND
              d.department_id = e.department_id
              AND p.department_id = e.department_id
              AND c.course_id  = e.department_id
WHERE
       d.department_id >= 30;
```

## Hierarchical Queries

Hierarchical queries in Snowflake allow you to organize and retrieve data in a tree-like structure, typically using the [`CONNECT BY`](https://docs.snowflake.com/en/sql-reference/constructs/connect-by) clause. This clause joins a table to itself to process hierarchical data in the table.

### Sample Source Patterns

#### Oracle:

```sql
SELECT employee_ID, manager_ID, title
FROM employees
START WITH manager_ID = 1
CONNECT BY manager_ID = PRIOR employee_id;
```

#### Snowflake:

```sql
SELECT employee_ID, manager_ID, title
FROM
employees
START WITH manager_ID = 1
CONNECT BY
manager_ID = PRIOR employee_id;
```

### Related EWIs

1. [SSC-EWI-0015](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pivot/Unpivot multiple functions not supported.

## Select Flashback Query

### Description

**Oracle**

The flashback query claused in Oracle retrieves past data from a table, view, or materialized view. In Oracle, the uses can include:

* Restoring deleted data or undoing an incorrect commit, comparing current data with the corresponding data at an earlier time, checking the state of transactional data at a particular time, and reporting generation tools to past data, among others. ([Oracle Flashback query documentation](https://docs.oracle.com/cd/E11882_01/appdev.112/e41502/adfns_flashback.htm#ADFNS01003)).

**Snowflake**

The equivalent mechanism in Snowflake to query data from the past is the `AT | BEGIN` query. Notice that the only equivalent is for the `AS OF` statements.

Furthermore, Snowflake has complete “Time Travel” documentation that allows querying data to clone objects such as tables, views, and schemas. There are limitations on the days to access the past or deleted data (90 days before passing to Fail-safe status). For more information, review the [Snowflake Time Travel Documentation](https://docs.snowflake.com/en/user-guide/data-time-travel).

**Oracle syntax**

```sql
{ VERSIONS BETWEEN
  { SCN | TIMESTAMP }
  { expr | MINVALUE } AND { expr | MAXVALUE }
| AS OF { SCN | TIMESTAMP } expr
}
```

**Snowflake Syntax**

```sql
SELECT ...
FROM ...
  {
   AT( { TIMESTAMP => <timestamp> | OFFSET => <time_difference> | STATEMENT => <id> | STREAM => '<name>' } ) |
   BEFORE( STATEMENT => <id> )
  }
[ ... ]
```

> **Note:**
>
> Notice that the query ID must reference a query executed within 14 days. If the query ID references a query over 14 days old, the following error is returned: `Error: statement <query_id> not found`. To work around this limitation, use the time stamp for the referenced query. ([Snowflake AT | Before documentation](https://docs.snowflake.com/en/sql-reference/constructs/at-before#syntax))

### Sample Source Patterns

The following data is used in the following examples to generate the query outputs.

#### Oracle

```sql
CREATE TABLE Employee (
    EmployeeID NUMBER PRIMARY KEY,
    FirstName VARCHAR2(50),
    LastName VARCHAR2(50),
    EmailAddress VARCHAR2(100),
    HireDate DATE,
    SalaryAmount NUMBER(10, 2)
);

INSERT INTO Employee VALUES (1, 'Bob', 'SampleNameA', 'sample@example.com', TO_DATE('2023-01-15', 'YYYY-MM-DD'), 11111.00);
INSERT INTO Employee VALUES (2, 'Bob', 'SampleNameB', 'sample@example.com', TO_DATE('2023-01-15', 'YYYY-MM-DD'), 11111.00);
INSERT INTO Employee VALUES (3, 'Bob', 'SampleNameC', 'sample@example.com', TO_DATE('2022-03-10', 'YYYY-MM-DD'), 11111.00);
INSERT INTO Employee VALUES (4, 'Bob', 'SampleNameD', 'sample@example.com', TO_DATE('2022-03-10', 'YYYY-MM-DD'), 11111.00);
INSERT INTO Employee VALUES (5, 'Bob', 'SampleNameE', 'sample@example.com', TO_DATE('2022-03-10', 'YYYY-MM-DD'), 11111.00);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE Employee (
       EmployeeID NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ PRIMARY KEY,
       FirstName VARCHAR(50),
       LastName VARCHAR(50),
       EmailAddress VARCHAR(100),
       HireDate TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
       SalaryAmount NUMBER(10, 2) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
   ;

   INSERT INTO Employee
   VALUES (1, 'Bob', 'SampleNameA', 'sample@example.com', TO_DATE('2023-01-15', 'YYYY-MM-DD'), 11111.00);

   INSERT INTO Employee
   VALUES (2, 'Bob', 'SampleNameB', 'sample@example.com', TO_DATE('2023-01-15', 'YYYY-MM-DD'), 11111.00);

   INSERT INTO Employee
   VALUES (3, 'Bob', 'SampleNameC', 'sample@example.com', TO_DATE('2022-03-10', 'YYYY-MM-DD'), 11111.00);

   INSERT INTO Employee
   VALUES (4, 'Bob', 'SampleNameD', 'sample@example.com', TO_DATE('2022-03-10', 'YYYY-MM-DD'), 11111.00);

   INSERT INTO Employee
   VALUES (5, 'Bob', 'SampleNameE', 'sample@example.com', TO_DATE('2022-03-10', 'YYYY-MM-DD'), 11111.00);
```

#### 1. AS OF with TIMESTAMP case

##### Oracle

```sql
SELECT * FROM employees
AS OF TIMESTAMP
TO_TIMESTAMP('2023-09-27 07:00:00', 'YYYY-MM-DD HH:MI:SS')
WHERE last_name = 'SampleName';
```

##### Snowflake

```sql
SELECT * FROM
employees
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0135 - DATA RETENTION PERIOD MAY PRODUCE NO RESULTS ***/!!!
AT (TIMESTAMP =>
TO_TIMESTAMP('2023-09-27 07:00:00', 'YYYY-MM-DD HH:MI:SS'))
WHERE last_name = 'SampleName';
```

#### 2. AS OF with SCN case

##### Oracle

```sql
SELECT * FROM employees
AS OF SCN
TO_TIMESTAMP('2023-09-27 07:00:00', 'YYYY-MM-DD HH:MI:SS')
WHERE last_name = 'SampleName';
```

##### Snowflake

```sql
SELECT * FROM
employees
!!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'FLASHBACK QUERY' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
AS OF SCN
TO_TIMESTAMP('2023-09-27 07:00:00', 'YYYY-MM-DD HH:MI:SS')
WHERE last_name = 'SampleName';
```

### Known Issues

1. The option when it is using SCN is not supported.
2. The VERSION statement is not supported in Snowflake.

### Related EWIs

1. [SSC-EWI-0040](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
2. [SSC-EWI-OR0135](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Current of clause is not supported in Snowflake.
3. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
4. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

---
title: SnowConvert AI - Oracle - SnowConvert AI Custom UDFs
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/functions/custom_udfs.md
section: Migrations
---

# SnowConvert AI - Oracle - SnowConvert AI Custom UDFs

## Description

Some Oracle built-in functions and functionalities may not be available or may behave differently in Snowflake. To minimize these differences, some functions are replaced with SnowConvert AI Custom UDFs.

These UDFs are automatically created during migration, in the `UDF Helper` folder, inside the `Output` folder. There is one file per custom UDF.

## BFILENAME UDF

### Description

This function takes the directory name and the file name parameters of the Oracle `BFILENAME()` as `STRING` and returns a concatenation of them using `\`. Since `BFILE` is translated to `VARCHAR`, the `BFILENAME` result is handled as text.

> **Warning:**
>
> The `\` must be changed to match the corresponding operating system file concatenation character.

### Custom UDF overloads

#### BFILENAME_UDF(string, string)

It concatenates the directory path and the file name.

**Parameters**

1. **DIRECTORYNAME**: A `STRING` that represents the directory path.
2. **FILENAME**: A `STRING` that represents the file name.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.BFILENAME_UDF (DIRECTORYNAME STRING, FILENAME STRING)
RETURNS STRING
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	DIRECTORYNAME || '\\' || FILENAME
$$;
```

##### Oracle

```sql
--Create Table
CREATE TABLE bfile_table ( col1 BFILE );

--Insert Bfilename
INSERT INTO bfile_table VALUES ( BFILENAME('mydirectory', 'myfile.png') );

--Select
SELECT * FROM bfile_table;
```

##### Result

| COL1 |
| --- |
| [BFILE:myfile.png] |

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE bfile_table ( col1
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0105 - ADDITIONAL WORK IS NEEDED FOR BFILE COLUMN USAGE. BUILD_STAGE_FILE_URL FUNCTION IS A RECOMMENDED WORKAROUND ***/!!!
VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

--Insert Bfilename
INSERT INTO bfile_table
VALUES (PUBLIC.BFILENAME_UDF('mydirectory', 'myfile.png') );

--Select
SELECT * FROM
bfile_table;
```

##### Result

| COL1 |
| --- |
| mydirectory\myfile.png |

### Known Issues

#### 1. No access to the DBMS_LOB built-in package

Since LOB data types are not supported in Snowflake there is not an equivalent for the `DBMS_LOB` functions and there are no implemented workarounds yet.

### Related EWIs

1. [SSC-EWI-OR0105](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Additional Work Is Needed For BFILE Column Usage.

## CAST_DATE UDF

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This custom UDF is added to avoid runtime exceptions caused by format differences when casting strings to `DATE`, inside procedures and functions.

### Custom UDF overloads

#### CAST_DATE_UDF(datestr)

It creates a `DATE` from a `STRING`.

**Parameters**

1. **DATESTR**: A `STRING` that represents a `DATE` with a specific format.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.CAST_DATE_UDF(DATESTR STRING)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	SELECT TO_DATE(DATESTR,'YYYY-MM-DD"T"HH24:MI:SS.FF')
$$;
```

##### Oracle

```sql
--Create Table
CREATE TABLE jsdateudf_table( col1 DATE );

--Create Procedure
CREATE OR REPLACE PROCEDURE jsdateudf_proc ( par1 DATE )
IS
BEGIN
    INSERT INTO jsdateudf_table VALUES(par1);
END;

--Insert Date
CALL jsdateudf_proc('20-03-1996');

--Select
SELECT * FROM jsdateudf_table;
```

##### Result

| COL1 |
| --- |
| 1996-03-20 00:00:00.000 |

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE jsdateudf_table ( col1 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

--Create Procedure
CREATE OR REPLACE PROCEDURE jsdateudf_proc (par1 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        INSERT INTO jsdateudf_table
        VALUES(:par1);
    END;
$$;

--Insert Date
CALL jsdateudf_proc('20-03-1996');

--Select
SELECT * FROM
    jsdateudf_table;
```

##### Result

| COL1 |
| --- |
| 1996-03-20 |

### Known Issues

#### 1. Oracle DATE contains TIMESTAMP

Take into consideration that Oracle `DATE` contains an empty `TIMESTAMP` (00:00:00.000), while Snowflake `DATE` does not. SnowConvert AI allows transforming `DATE` to `TIMESTAMP` with the SysdateAsCurrentTimestamp flag.

### Related EWIs

1. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior

## DATE_TO_JULIANDAYS_UDF

### Description

The DATE_TO_JULIANDAYS_UDF() function takes a DATE and returns the number of days since January 1, 4712 BC. This function is equivalent to the Oracle TO_CHAR(DATE,’J’)

### Custom UDF overloads

#### DATE_TO_JULIANDAYS_UDF(date)

**Parameters**

1. **INPUT_DATE**: The `DATE` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATE_TO_JULIAN_DAYS_UDF(input_date DATE)
RETURNS NUMBER
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    DATEDIFF(DAY,TO_DATE('00000101','YYYYMMDD'),TO_DATE('01/01/4712','DD/MM/YYYY')) +
    DATEDIFF(DAY,TO_DATE('00000101','YYYYMMDD'),input_date) + 38
    // Note: The 38 on the equation marks the differences in days between calendars and must be updated on the year 2099
$$
;
```

#### Usage Example

##### Oracle

```sql
--Create Table
CREATE TABLE datetojulian_table (col1 DATE);

INSERT INTO datetojulian_table VALUES (DATE '2020-01-01');
INSERT INTO datetojulian_table VALUES (DATE '1900-12-31');
INSERT INTO datetojulian_table VALUES (DATE '1904-02-29');
INSERT INTO datetojulian_table VALUES (DATE '1903-03-01');
INSERT INTO datetojulian_table VALUES (DATE '2000-12-31');

--Select
SELECT TO_CHAR(col1, 'J') FROM datetojulian_table;
```

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE datetojulian_table (col1 TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO datetojulian_table
VALUES (DATE '2020-01-01');

INSERT INTO datetojulian_table
VALUES (DATE '1900-12-31');

INSERT INTO datetojulian_table
VALUES (DATE '1904-02-29');

INSERT INTO datetojulian_table
VALUES (DATE '1903-03-01');

INSERT INTO datetojulian_table
VALUES (DATE '2000-12-31');

--Select
SELECT
PUBLIC.DATE_TO_JULIAN_DAYS_UDF(col1)
FROM
datetojulian_table;
```

### Known Issues

No issues were found.

### Related EWIs

* [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior

## DATEADD UDF

### Description

This UDF is used as a template for all cases when there is an addition between a `DATE` or `TIMESTAMP` type and `FLOAT` type.

### Custom UDF overloads

#### DATEADD_UDF(date, float)

**Parameters**

1. **FIRST_PARAM**: The first `DATE` of the operation.
2. **SECOND_PARAM**: The `FLOAT` to be added.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(FIRST_PARAM DATE, SECOND_PARAM FLOAT)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
SELECT FIRST_PARAM + SECOND_PARAM::NUMBER
$$;
```

#### DATEADD_UDF(float, date)

**Parameters**

1. **FIRST_PARAM**: The `FLOAT` to be added.
2. **SECOND_PARAM**: The `DATE` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(FIRST_PARAM FLOAT, SECOND_PARAM DATE)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
SELECT FIRST_PARAM::NUMBER + SECOND_PARAM
$$;
```

#### DATEADD_UDF(timestamp, float)

**Parameters**

1. **FIRST_PARAM**: The first `TIMESTAMP` of the operation.
2. **SECOND_PARAM**: The `FLOAT` to be added.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM FLOAT)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
SELECT DATEADD(day, SECOND_PARAM,FIRST_PARAM)
$$;
```

#### DATEADD_UDF(float, timestamp)

**Parameters**

1. **FIRST_PARAM**: The`FLOAT` of the operation.
2. **SECOND_PARAM**: The`TIMESTAMP` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(FIRST_PARAM FLOAT, SECOND_PARAM TIMESTAMP)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
SELECT DATEADD(day, FIRST_PARAM,SECOND_PARAM)
$$;
```

#### Usage example

##### Oracle

```sql
SELECT
    TO_TIMESTAMP('03/08/2009, 12:47 AM', 'dd/mm/yy, hh:mi AM')+62.40750856543442
FROM DUAL;
```

##### Result

| TO_TIMESTAMP(‘03/08/2009,12:47AM’,’DD/MM/YY,HH:MIAM’)+62.40750856543442 |
| --- |
| 2009-10-04 10:33:49.000 |

##### Snowflake

```sql
SELECT
    PUBLIC.DATEADD_UDF(TO_TIMESTAMP('03/08/2009, 12:47 AM', 'dd/mm/yy, hh:mi AM'), 62.40750856543442)
FROM DUAL;
```

##### Result

|PUBLIC.DATEADD_UDF(

| TO_TIMESTAMP(‘03/08/2009, 12:47 AM’, ‘DD/MM/YY, HH12:MI AM’), 62.40750856543442) |
| --- |
| 2009-10-04 00:47:00.000 |

### Known Issues

#### 1. Differences in time precision

When there are operations between Dates or Timestamps and Floats, the time may differ from Oracle’s. There is an action item to fix this issue.

### Related EWIs

No EWIs related.

## DATEDIFF UDF

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This UDF is used as a template for all cases when there is a subtraction between a `DATE,` `TIMESTAMP,` and any other type (except Intervals).

### Custom UDF overloads

#### DATEDIFF_UDF(date, date)

**Parameters**

1. **FIRST_PARAM**: The first `DATE` of the operation.
2. **SECOND_PARAM**: The `DATE` to be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(FIRST_PARAM DATE, SECOND_PARAM DATE)
RETURNS INTEGER
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	FIRST_PARAM - SECOND_PARAM
$$;
```

#### DATEDIFF_UDF(date, **timestamp**)

**Parameters**

1. **FIRST_PARAM**: The first `DATE` of the operation.
2. **SECOND_PARAM**: The `TIMESTAMP` to be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(FIRST_PARAM DATE, SECOND_PARAM TIMESTAMP)
RETURNS INTEGER
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	FIRST_PARAM - SECOND_PARAM::DATE
$$;
```

#### DATEDIFF_UDF(date, integer)

**Parameters**

1. **FIRST_PARAM**: The first `DATE` of the operation.
2. **SECOND_PARAM**: The `INTEGER` to be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(FIRST_PARAM DATE, SECOND_PARAM INTEGER)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	DATEADD(day,SECOND_PARAM*-1 ,FIRST_PARAM)
$$;
```

#### DATEDIFF_UDF(timestamp, timestamp)

**Parameters**

1. **FIRST_PARAM**: The first `TIMESTAMP` of the operation.
2. **SECOND_PARAM**: The `TIMESTAMP` to be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM TIMESTAMP)
RETURNS INTEGER
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	DATEDIFF(day,SECOND_PARAM ,FIRST_PARAM)
$$;
```

#### DATEDIFF_UDF(timestamp, date)

**Parameters**

1. **FIRST_PARAM**: The first `TIMESTAMP` of the operation.
2. **SECOND_PARAM**: The `DATE` to be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM DATE)
RETURNS INTEGER
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	DATEDIFF(day,SECOND_PARAM ,FIRST_PARAM)
$$;
```

#### DATEDIFF_UDF(timestamp, number)

**Parameters**

1. **FIRST_PARAM**: The first `TIMESTAMP` of the operation.
2. **SECOND_PARAM**: The `NUMBER` to be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(FIRST_PARAM TIMESTAMP, SECOND_PARAM NUMBER)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
	DATEADD(day,SECOND_PARAM*-1,FIRST_PARAM)
$$;
```

#### Usage example

> **Note:**
>
> The unknown is a column whose type could not be resolved, it could be a timestamp, date integer, or number.

> **Note:**
>
> **`--disableDateAsTimestamp`**
>
> Flag to indicate whether `SYSDATE` should be transformed into `CURRENT_DATE` *or* `CURRENT_TIMESTAMP`. This will also affect all `DATE` columns that will be transformed to `TIMESTAMP`.

##### Oracle

```sql
--Create Table
CREATE TABLE times(AsTimeStamp TIMESTAMP, AsDate DATE);

--Subtraction operations
SELECT AsDate - unknown FROM times, unknown_table;
SELECT unknown - AsTimeStamp FROM times;
SELECT AsTimeStamp - unknown FROM times;
SELECT unknown - AsDate FROM times;
```

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE times (AsTimeStamp TIMESTAMP(6),
AsDate TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
;

--Subtraction operations
SELECT
PUBLIC.DATEDIFF_UDF(
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN DATE AND unknown ***/!!!
 AsDate, unknown) FROM
times,
unknown_table;

SELECT
PUBLIC.DATEDIFF_UDF(
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND TIMESTAMP ***/!!!
 unknown, AsTimeStamp) FROM
times;

SELECT
PUBLIC.DATEDIFF_UDF(
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN TIMESTAMP AND unknown ***/!!!
 AsTimeStamp, unknown) FROM
times;

SELECT
PUBLIC.DATEDIFF_UDF(
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND DATE ***/!!!
 unknown, AsDate) FROM
times;
```

### Known Issues

#### 1. Functional differences for timestamps

Sometimes the Snowflake value returned by the UDF may differ from the Oracle one due to the time. Consider the following example

##### Oracle

```sql
-- CREATE TABLE UNKNOWN_TABLE(Unknown timestamp);
-- INSERT  INTO UNKNOWN_TABLE VALUES (TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.'));

CREATE TABLE TIMES(AsTimeStamp TIMESTAMP);
INSERT INTO TIMES VALUES (TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'));

SELECT AsTimeStamp - unknown FROM times, unknown_table;
```

##### Result

| ASTIMESTAMP-UNKNOWN |
| --- |
| 4417 23:0:0.0 |

##### Snowflake

```sql
-- CREATE TABLE UNKNOWN_TABLE(Unknown timestamp);
-- INSERT INTO UNKNOWN_TABLE VALUES (TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.'));
CREATE OR REPLACE TABLE TIMES (AsTimeStamp TIMESTAMP(6)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO TIMES
VALUES (TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'));

SELECT
PUBLIC.DATEDIFF_UDF(
                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN TIMESTAMP AND unknown ***/!!!
 AsTimeStamp, unknown) FROM
times,
unknown_table;
```

##### Result

| PUBLIC.DATEDIFF_UDF( ASTIMESTAMP, UNKNOWN) |
| --- |
| 4418 |

### Related EWIs

1. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
2. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

## JSON_VALUE UDF

Translation reference to convert Oracle JSON_VALUE function to Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

As per Oracle’s documentation, this function uses the [SQL/JSON Path Expression](https://docs.oracle.com/en/database/oracle/oracle-database/19/adjsn/json-path-expressions.html#GUID-7B610884-39CD-4910-85E7-C251D342D879) to request information about a portion of a JSON instance. The returning value is always a scalar value, else the function returns `NULL` by default.

```none
JSON_VALUE
  ( expr [ FORMAT JSON ], [ JSON_basic_path_expression ]
    [ JSON_value_returning_clause ] [ JSON_value_on_error_clause ]
    [ JSON_value_on_empty_clause ][ JSON_value_on_mismatch_clause ]
  )
```

The JSON_VALUE_UDF is a Snowflake implementation of the JSONPath specification that uses a modified version of the original JavaScript implementation developed by [Stefan Goessner](https://goessner.net/index.html).

### Sample Source Patterns

#### Setup Data

Run these queries to run queries in the JSON_VALUE Patterns section.

##### Oracle

```sql
CREATE TABLE MY_TAB (
    my_json VARCHAR(5000)
);

INSERT INTO MY_TAB VALUES ('{
    "store": {
      "book": [
        { "category": "reference",
          "author": "Nigel Rees",
          "title": "Sayings of the Century",
          "price": 8.95
        },
        { "category": "fiction",
          "author": "Evelyn Waugh",
          "title": "Sword of Honour",
          "price": 12.99
        },
        { "category": "fiction",
          "author": "Herman Melville",
          "title": "Moby Dick",
          "isbn": "0-553-21311-3",
          "price": 8.99
        },
        { "category": "fiction",
          "author": "J. R. R. Tolkien",
          "title": "The Lord of the Rings",
          "isbn": "0-395-19395-8",
          "price": 22.99
        }
      ],
      "bicycle": {
        "color": "red",
        "price": 19.95
      }
    }
  }');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE MY_TAB (
       my_json VARCHAR(5000)
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
   ;

   INSERT INTO MY_TAB
   VALUES ('{
    "store": {
      "book": [
        { "category": "reference",
          "author": "Nigel Rees",
          "title": "Sayings of the Century",
          "price": 8.95
        },
        { "category": "fiction",
          "author": "Evelyn Waugh",
          "title": "Sword of Honour",
          "price": 12.99
        },
        { "category": "fiction",
          "author": "Herman Melville",
          "title": "Moby Dick",
          "isbn": "0-553-21311-3",
          "price": 8.99
        },
        { "category": "fiction",
          "author": "J. R. R. Tolkien",
          "title": "The Lord of the Rings",
          "isbn": "0-395-19395-8",
          "price": 22.99
        }
      ],
      "bicycle": {
        "color": "red",
        "price": 19.95
      }
    }
  }');
```

#### JSON_VALUE Patterns

##### Oracle

```sql
-- 'Sayings of the Century'
SELECT JSON_VALUE(MY_JSON, '$..book[0].title') AS VALUE FROM MY_TAB;

-- NULL
-- gets books in positions 0, 1, 2 and 3 but returns null (default behavior) since a non scalar value was returned
SELECT JSON_VALUE(MY_JSON, '$..book[0,1 to 3,3]') AS VALUE FROM MY_TAB;

-- 'Sayings of the Century'
SELECT JSON_VALUE(MY_JSON, '$.store.book[*]?(@.category == "reference").title') AS VALUE FROM MY_TAB;

-- 'MY ERROR MESSAGE'
-- triggers error because the result is a non scalar value (is an object)
SELECT JSON_VALUE(MY_JSON, '$..book[0]' DEFAULT 'MY ERROR MESSAGE' ON ERROR DEFAULT 'MY EMPTY MESSAGE' ON EMPTY) AS VALUE FROM MY_TAB;

-- 'MY EMPTY MESSAGE'
-- triggers the on empty class because does not exists in the first book element
SELECT JSON_VALUE(MY_JSON, '$..book[0].isbn' DEFAULT 'MY ERROR MESSAGE' ON ERROR DEFAULT 'MY EMPTY MESSAGE' ON EMPTY) AS VALUE FROM MY_TAB;

-- Oracle error message: ORA-40462: JSON_VALUE evaluated to no value
-- this is a custom message from the UDF when no match is found and the ON ERROR clause is set to ERROR
SELECT JSON_VALUE(MY_JSON, '$..book[0].isbn' ERROR ON ERROR) AS VALUE FROM MY_TAB;

-- NULL
SELECT JSON_VALUE(MY_JSON, '$..book[0].isbn' NULL ON ERROR) AS VALUE FROM MY_TAB;

-- Oracle error message: ORA-40462: JSON_VALUE evaluated to no value
-- this is a custom message from the UDF when no match is found and the ON EMPTY clause is set to ERROR
SELECT JSON_VALUE(MY_JSON, '$..book[0].isbn' ERROR ON EMPTY) AS VALUE FROM MY_TAB;

-- NULL
SELECT JSON_VALUE(MY_JSON, '$..book[0].isbn' NULL ON EMPTY) AS VALUE FROM MY_TAB;

-- 'Sayings of the Century'
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' RETURNING VARCHAR2) AS VALUE FROM MY_TAB;

-- 'Sayin'
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' RETURNING VARCHAR2(5) TRUNCATE) AS VALUE FROM MY_TAB;

-- 'Sayings of the Century'
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' RETURNING CLOB) AS VALUE FROM MY_TAB;

-- NULL
-- This is because the title field is a string and the function expects a number result type
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' RETURNING NUMBER) AS VALUE FROM MY_TAB;

-- 420
-- This is because the title field is a string and the function expects a number result type
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' RETURNING NUMBER DEFAULT 420 ON ERROR) AS VALUE FROM MY_TAB;

-- Oracle error message: ORA-01858: a non-numeric character was found where a numeric was expected
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' RETURNING DATE ERROR ON ERROR) AS VALUE FROM MY_TAB;

-- ORA-40450: invalid ON ERROR clause
SELECT JSON_VALUE(MY_JSON, '$..book[0].title' ERROR ON MISMATCH) AS VALUE FROM MY_TAB;
```

##### Results

| JSON Path | Query result |
| --- | --- |
| `'$..book[0].title'` | `'Sayings of the Century'` |
| `'$..book[0,1 to 3,3]'` | `NULL` |
| `'$.store.book[*]?(@.category == "reference").title'` | `'Sayings of the Century'` |
| `'$..book[0]'` | `'MY ERROR MESSAGE'` |
| `'$..book[0].isbn'` | `'MY EMPTY MESSAGE'` |
| `'$..book[0].isbn'` | `ORA-40462: JSON_VALUE evaluated to no value` |
| `'$..book[0].isbn'` | `NULL` |
| `'$..book[0].isbn'` | `ORA-40462: JSON_VALUE evaluated to no value` |
| `'$..book[0].isbn'` | `NULL` |
| `'$..book[0].title'` | `'Sayings of the Century'` |
| `'$..book[0].title'` | `'Sayin'` |
| `'$..book[0].title'` | `'Sayings of the Century'` |
| `'$..book[0].title'` | `NULL` |
| `'$..book[0].title'` | `420` |
| `'$..book[0].title'` | `ORA-01858: a non-numeric character was found where a numeric was expected` |
| `'$..book[0].title'` | `ORA-40450: invalid ON ERROR clause` |

##### Snowflake

```sql
-- 'Sayings of the Century'
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].title', NULL, NULL, NULL) AS VALUE FROM
MY_TAB;

-- NULL
-- gets books in positions 0, 1, 2 and 3 but returns null (default behavior) since a non scalar value was returned
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0,1 to 3,3]', NULL, NULL, NULL) AS VALUE FROM
MY_TAB;

-- 'Sayings of the Century'
SELECT
JSON_VALUE_UDF(MY_JSON, '$.store.book[*]?(@.category == "reference").title', NULL, NULL, NULL) AS VALUE FROM
MY_TAB;

-- 'MY ERROR MESSAGE'
-- triggers error because the result is a non scalar value (is an object)
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0]', NULL, 'MY ERROR MESSAGE' :: VARIANT, 'MY EMPTY MESSAGE' :: VARIANT) AS VALUE FROM
MY_TAB;

-- 'MY EMPTY MESSAGE'
-- triggers the on empty class because does not exists in the first book element
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].isbn', NULL, 'MY ERROR MESSAGE' :: VARIANT, 'MY EMPTY MESSAGE' :: VARIANT) AS VALUE FROM
MY_TAB;

-- Oracle error message: ORA-40462: JSON_VALUE evaluated to no value
-- this is a custom message from the UDF when no match is found and the ON ERROR clause is set to ERROR
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].isbn', NULL, 'SSC_ERROR_ON_ERROR' :: VARIANT, NULL) AS VALUE FROM
MY_TAB;

-- NULL
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].isbn', NULL, 'SSC_NULL_ON_ERROR' :: VARIANT, NULL) AS VALUE FROM
MY_TAB;

-- Oracle error message: ORA-40462: JSON_VALUE evaluated to no value
-- this is a custom message from the UDF when no match is found and the ON EMPTY clause is set to ERROR
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].isbn', NULL, NULL, 'SSC_ERROR_ON_EMPTY' :: VARIANT) AS VALUE FROM
MY_TAB;

-- NULL
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].isbn', NULL, NULL, 'SSC_NULL_ON_EMPTY' :: VARIANT) AS VALUE FROM
MY_TAB;

-- 'Sayings of the Century'
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].title', 'string', NULL, NULL) AS VALUE FROM
MY_TAB;

-- 'Sayin'
SELECT
LEFT(JSON_VALUE_UDF(MY_JSON, '$..book[0].title', 'string', NULL, NULL), 5) AS VALUE FROM
MY_TAB;

-- 'Sayings of the Century'
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].title', 'string', NULL, NULL) AS VALUE FROM
MY_TAB;

-- NULL
-- This is because the title field is a string and the function expects a number result type
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].title', 'number', NULL, NULL) AS VALUE FROM
MY_TAB;

-- 420
-- This is because the title field is a string and the function expects a number result type
SELECT
JSON_VALUE_UDF(MY_JSON, '$..book[0].title', 'number', 420 :: VARIANT, NULL) AS VALUE FROM
MY_TAB;

-- Oracle error message: ORA-01858: a non-numeric character was found where a numeric was expected
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - RETURNING CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
JSON_VALUE_UDF(MY_JSON, '$..book[0].title', NULL, 'SSC_ERROR_ON_ERROR' :: VARIANT, NULL) AS VALUE FROM
MY_TAB;

-- ORA-40450: invalid ON ERROR clause
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - ON MISMATCH CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
SON_VALUE_UDF(MY_JSON, '$..book[0].title', NULL, NULL, NULL) AS VALUE FROM
MY_TAB;
```

##### Results

| JSON Path | Query result |
| --- | --- |
| `'$..book[0].title'` | `'Sayings of the Century'` |
| `'$..book[0,1 to 3,3]'` | `NULL` |
| `'$.store.book[*]?(@.category == "reference").title'` | `'Sayings of the Century'` |
| `'$..book[0]'` | `'MY ERROR MESSAGE'` |
| `'$..book[0].isbn'` | `'MY EMPTY MESSAGE'` |
| `'$..book[0].isbn'` | `"SSC_CUSTOM_ERROR - NO MATCH FOUND"` |
| `'$..book[0].isbn'` | `NULL` |
| `'$..book[0].isbn'` | `"SSC_CUSTOM_ERROR - NO MATCH FOUND"` |
| `'$..book[0].isbn'` | `NULL` |
| `'$..book[0].title'` | `'Sayings of the Century'` |
| `'$..book[0].title'` | `'Sayin'` |
| `'$..book[0].title'` | `'Sayings of the Century'` |
| `'$..book[0].title'` | `NULL` |
| `'$..book[0].title'` | `420` |
| `'$..book[0].title'` | **NOT SUPPORTED** |
| `'$..book[0].title'` | **NOT SUPPORTED** |

### Known Issues

#### 1. Returning Type Clause is not fully supported

Now, the only supported types when translating the functionality of the RETURNING TYPE clause are `VARCHAR2`, `CLOB` and `NUMBER`.

For all the other types supported by the original JSON_VALUE function, the JSON_VALUE_UDF will behave as if no RETURNING TYPE clause was specified.

Unsupported types:

* `DATE`
* `TIMESTAMP [WITH TIME ZONE]`
* `SDO_GEOMETRY`
* `CUSTOM TYPE`

#### 2. ON MISMATCH Clause is not supported

Now, the ON MISMATCH clause is not supported, and a warning EWI is placed instead. Thus, the translated code will behave as if no ON MISMATCH clause was originally specified.

#### 3. Complex filters are not supported

Complex filters with more than one expression will return null as they are not supported.

For example, with the same data as before, this JSON path `$.store.book[*]?(@.category == "reference").title` is supported and will return `'Sayings of the Century'`.

However, `$.store.book[*]?(@.category == "reference" && @.price < 10).title` will return `null` since more than one expression is used in the filter.

### Related EWIs

1. [SSC-EWI-0021](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Not supported in Snowflake.

## JULIAN TO GREGORIAN DATE UDF

### Description

This User Defined Function (UDF) is used to transform or cast the Julian date format to a Gregorian date format. Julian dates can be received in three different formats such as JD Edwards World, astronomy or ordinary format.

### Custom UDF overloads

#### JULIAN_TO_GREGORIAN_DATE_UDF(julianDate, formatSelected)

It returns a string with the Gregorian date format YYYY-MM-DD.

##### Parameters:

**JulianDate**: The Julian date to be cast. It can be either CYYDDD (where C is the century) or YYYYDDD.

**formatSelected**: It represents the format in which the Julian date should be processed. Besides, it is a CHAR and can accept the following formats:

| Format available | Letter representation in CHAR | Description |
| --- | --- | --- |
| Astronomy standardized | ‘J’ | It is the default format. The cast is based in the expected conversion of the Astronomical Applications Department of the US. The Julian Date format for this is YYYYDDD. |
| JD Edwards World | ‘E’ | The expected Julian date to be received in this case should be CYYDDD (where C represents the century and is operationalized to be added 19 to the corresponding number). |
| Ordinal dates | ‘R’ | The ordinal dates are an arrangement of numbers which represent a concisely date. The format is YYYYDDD and can be easily read because the year part is not mutable. |

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.JULIAN_TO_GREGORIAN_DATE_UDF(JULIAN_DATE CHAR(7), FORMAT_SELECTED CHAR(1))
RETURNS variant
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    const CONST_FOR_MODIFIED_JULIAN_DATE = 0.5;
    const BEGINNING_OF_GREG_CALENTAR = 2299161;
    const CONST_AFTER_GREG_VALUE = 1867216.25;
    const DIVIDENT_TO_GET_CENTURY = 36524.25;
    const LEAP_YEAR_CONSTANT = 4;
    const CONST_TO_GET_DAY_OF_MONTH = 30.6001;

    //Functions definitions

    function julianToGregorian(julianDate){
        const JD = julianDate + CONST_FOR_MODIFIED_JULIAN_DATE; //setting modified julian date
        const Z = Math.floor(JD); //setting fractional part of julian day
        const F = JD - Z; //fractional part of the julian date
        let A, alpha, B, C, D, E, year, month, day;

        //verification for the beginning of gregorian calendar
        if(Z < BEGINNING_OF_GREG_CALENTAR){
            A=Z;
        } else {
            //alpha is for dates after the beginning of gregorian calendar
            alpha = Math.floor((Z-CONST_AFTER_GREG_VALUE) / DIVIDENT_TO_GET_CENTURY);
            A=Z+1+alpha - Math.floor(alpha/LEAP_YEAR_CONSTANT);
        }

        B = A + 1524;
        C = Math.floor((B-122.1)/365.25);
        D = Math.floor(365.25*C);
        E = Math.floor((B-D)/CONST_TO_GET_DAY_OF_MONTH);

        day= Math.floor(B-D-Math.floor(CONST_TO_GET_DAY_OF_MONTH*E)+F);
        month=(E<14)? E -1: E-13;
        year=(month>2)? C-4716: C-4715;

        return new Date(year, month-1, day);
    }

function cyydddToGregorian(julianDate){
        var c=Math.floor(julianDate/1000);
        var yy=(c<80)? c+2000: c+1900;
        var ddd=julianDate%1000;
        var date= new Date(yy, 0);
        date.setDate(ddd);
        return date;
    }

function ordinalDate(ordinalDate){
    const year = parseInt(ordinalDate.toString().substring(0,4));
    const dayOfYear = parseInt(ordinalDate.toString().substring(4));
    const date = new Date(year, 0); //Set date to the first day of year
    date.setDate(dayOfYear);
    return date;
}

function formatDate(toFormatDate){
    toFormatDate = toFormatDate.toDateString();
    let year = toFormatDate.split(" ")[3];
    let month = toFormatDate.split(" ")[1];
    let day = toFormatDate.split(" ")[2];
    return new Date(month + day + ", " + Math.abs(year)).toISOString().split('T')[0]
}

    switch(FORMAT_SELECTED){
        case 'E':
            //JD Edwards World formar, century added  - CYYDDD
            var result = formatDate(cyydddToGregorian(parseInt(JULIAN_DATE)));
            return result;
        break;
        case 'J':
            //astronomical format YYYYDDD
            return formatDate(julianToGregorian(parseInt(JULIAN_DATE)));
        break;
        case 'R':
            //ordinal date format YYYYDDD
            return formatDate(ordinalDate(parseInt(JULIAN_DATE)));
        break;
        default: return null;
    }

$$
;
```

### Usage Example

#### Oracle

```sql
select to_date('2020001', 'J') from dual;
```

##### Result

| TO_DATE(‘2020001’, ‘J’) |
| --- |
| 18-JUN-18 |

##### Formatted result

| TO_CHAR(TO_DATE(‘2020001’, ‘J’), ‘YYYY-MON-DD’) |
| --- |
| 0818-JUN-18 |

* *Note: The date must be formatted to visualize all digits of the year.*

#### Snowflake

```sql
select
PUBLIC.JULIAN_TO_GREGORIAN_DATE_UDF('2020001', 'J')
from dual;
```

##### Result

| JULIAN_TO_GREGORIAN_DATE_UDF(‘2020001’, ‘J’) |
| --- |
| “0818-06-18” |

### Known Issues

1. Any other format: If the Julian Date is formatted in any other not supported format, there would be differences in the output.
2. Ranges of B.C. dates may represent inconsistencies due to unsupported Snowflake functions for dates.

#### Related EWIs

No EWIs related.

## MONTHS BETWEEN UDF [DEPRECATED]

> **Danger:**
>
> This UDF has been deprecated. Current transformation for **Oracle** [MONTHS_BETWEEN()](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/MONTHS_BETWEEN.html#GUID-E4A1AEC0-F5A0-4703-9CC8-4087EB889952) is **Snowflake** [MONTHS_BETWEEN()](https://docs.snowflake.com/en/sql-reference/functions/months_between.html#months-between).

### Description

> `MONTHS_BETWEEN` returns number of months between dates `date1` and `date2`. ([Oracle MONTHS_BETWEEN SQL Language Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/MONTHS_BETWEEN.html#GUID-E4A1AEC0-F5A0-4703-9CC8-4087EB889952))

```sql
MONTHS_BETWEEN(date1, date2)
```

Oracle `MONTHS_BETWEEN` and Snowflake `MONTHS_BETWEEN` function, have some functional differences, to minimize these differences and replicate Oracle `MONTHS_BETWEEN` function better, we added a custom UDF.

### Custom UDF overloads

#### MONTHS_BETWEEN_UDF(timestamp_ltz, timestamp_ltz)

**Parameters**

1. **FIRST_DATE**: The first `TIMESTAMP_LTZ` of the operation.
2. **SECOND_DATE**: The second `TIMESTAMP_LTZ` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION MONTHS_BETWEEN_UDF(FIRST_DATE TIMESTAMP_LTZ, SECOND_DATE TIMESTAMP_LTZ)
RETURNS NUMBER
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
ROUND(MONTHS_BETWEEN(FIRST_DATE, SECOND_DATE))
$$
;
```

##### Oracle

```sql
SELECT
	MONTHS_BETWEEN('2000-03-20 22:01:11', '1996-03-20 10:01:11'),
	MONTHS_BETWEEN('1996-03-20 22:01:11', '2000-03-20 10:01:11'),
	MONTHS_BETWEEN('1982-05-11 22:31:19', '1900-01-25 15:21:15'),
	MONTHS_BETWEEN('1999-12-25 01:15:16', '1900-12-11 02:05:16')
FROM DUAL;
```

##### Result

| MONTHS_BETWEEN(‘2000-03-2022:01:11’,’1996-03-2010:01:11’) | MONTHS_BETWEEN(‘1996-03-2022:01:11’,’2000-03-2010:01:11’) | MONTHS_BETWEEN(‘1982-05-1122:31:19’,’1900-01-2515:21:15’) | MONTHS_BETWEEN(‘1999-12-2501:15:16’,’1900-12-1102:05:16’) |
| --- | --- | --- | --- |
| 48 | -48 | 987.558021206690561529271206690561529271 | 1188.450492831541218637992831541218637993 |

##### Snowflake

```sql
SELECT
	MONTHS_BETWEEN('2000-03-20 22:01:11', '1996-03-20 10:01:11'),
	MONTHS_BETWEEN('1996-03-20 22:01:11', '2000-03-20 10:01:11'),
	MONTHS_BETWEEN('1982-05-11 22:31:19', '1900-01-25 15:21:15'),
	MONTHS_BETWEEN('1999-12-25 01:15:16', '1900-12-11 02:05:16')
FROM DUAL;
```

##### Result

| MONTHS_BETWEEN_UDF(‘2000-03-20 22:01:11’, ‘1996-03-20 10:01:11’) | MONTHS_BETWEEN_UDF(‘1996-03-20 22:01:11’, ‘2000-03-20 10:01:11’) | MONTHS_BETWEEN_UDF(‘1982-05-11 22:31:19’, ‘1900-01-25 15:21:15’) | MONTHS_BETWEEN_UDF(‘1999-12-25 01:15:16’, ‘1900-12-11 02:05:16’) |
| --- | --- | --- | --- |
| 48.000000 | -48.000000 | 987.558024 | 1188.450497 |

### Known Issues

#### 1. Precision may differ from Oracle

Some results may differ in the number of decimal digits.

### Related EWIs

No related EWIs.

## REGEXP LIKE UDF

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> `REGEXP_LIKE` performs regular expression matching. This condition evaluates strings using characters as defined by the input character set. ([Oracle Language Regerence REGEXP_LIKE Condition](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Pattern-matching-Conditions.html#GUID-D2124F3A-C6E4-4CCA-A40E-2FFCABFD8E19))

```sql
REGEXP_LIKE(source_char, pattern [, match_param ])
```

Oracle `REGEXP_LIKE` and Snowflake `REGEXP_LIKE` condition, have some functional differences, to minimize these differences and replicate Oracle `REGEXP_LIKE` function better, we added a custom UDF. The main idea is to escape the backslash symbol from the regular expression where it is required. These are the special characters that need to be escaped when they come with a backslash: `'d', 'D', 'w', 'W', 's', 'S', 'A', 'Z', 'n'`. Also, the **backreference expression** (matches the same text as most recently matched by the “number specified” capturing group) needs to be escaped.

### Custom UDF overloads

#### REGEXP_LIKE_UDF(string, string)

##### Parameters

1. **COL:** is the character expression that serves as the search value.
2. **PATTERN:** is the regular expression.

##### UDF

```sql
CREATE OR REPLACE FUNCTION REGEXP_LIKE_UDF(COL STRING, PATTERN STRING)
RETURNS BOOLEAN
LANGUAGE JAVASCRIPT
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
return COL.match(new RegExp(PATTERN));
$$;
```

##### Oracle

##### Snowflake

#### REGEXP_LIKE_UDF(string, string, string)

##### Parameters

1. **COL:** is the character expression that serves as the search value.
2. **PATTERN:** is the regular expression.
3. **MATCHPARAM**: is a character expression that let’s change the default matching behavior of the condition. In the following table, there are the Oracle characters with their description and their equivalent in the UDF.

| Match Parameter | Description | UDF Equivalent |
| --- | --- | --- |
| ‘i’ | Specifies case-insensitive matching, even if the determined collation of the condition is case-sensitive. | ‘i’ |
| ‘c’ | Specifies case-sensitive and accent-sensitive matching, even if the determined collation of the condition is case-insensitive or accent-insensitive. | Does not have an equivalent. It is being removed from the parameter.. |
| ‘n’ | Allows the period (.), which is the match-any-character wildcard character, to match the newline character. If you omit this parameter, then the period does not match the newline character. | ‘s’ |
| ‘m’ | Treats the source string as multiple lines. Oracle interprets `^` and `$` as the start and end, respectively, of any line anywhere in the source string, rather than only at the start or end of the entire source string. If you omit this parameter, then Oracle treats the source string as a single line. | ‘m’ |
| ‘x’ | Ignores whitespace characters. By default, whitespace characters match themselves. | Does not have an equivalent. It is being removed from the parameter. |

##### UDF

```sql
CREATE OR REPLACE FUNCTION REGEXP_LIKE_UDF(COL STRING, PATTERN STRING, MATCHPARAM STRING)
RETURNS BOOLEAN
LANGUAGE JAVASCRIPT
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
return COL.match(new RegExp(PATTERN, MATCHPARAM));
$$;
```

##### Oracle

##### Snowflake

### Known Issues

#### **1. UDF match parameter may not behave as expected**

Due to all the characters available in the Oracle match parameter does not have their equivalent in the user-defined function, the query result may have some functional differences compared to Oracle.

##### 2. UDF pattern parameter does not allow only ‘\’ as a regular expression

If as a pattern parameter the regular expression used is only ‘\’ an exception will be thrown like this: JavaScript execution error: Uncaught SyntaxError: Invalid regular expression: //: \ at end of pattern in REGEXP_LIKE_UDF at ‘return COL.match(new RegExp(PATTERN));’ position 17 stackstrace: REGEXP_LIKE_UDF

## TIMESTAMP DIFF UDF

### Description

Snowflake does not support the addition operation between `TIMESTAMP` data types with the `-` operand. To replicate this functionality, we have added a custom UDF.

### Custom UDF overloads

#### TIMESTAMP_DIFF_UDF(timestamp, timestamp)

**Parameters**

1. **LEFT_TS**: The first `TIMESTAMP` of the operation.
2. **RIGHT_TS**: The `TIMESTAMP` to be added.

##### UDF

```sql
CREATE OR REPLACE FUNCTION TIMESTAMP_DIFF_UDF(LEFT_TS TIMESTAMP, RIGHT_TS TIMESTAMP )
RETURNS VARCHAR
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH RESULTS(days,hours,min,sec,millisecond,sign) AS
(
  SELECT
  abs(TRUNC(x/1000/3600/24)) days,
  abs(TRUNC(x/1000/60 / 60)-trunc(x/1000/3600/24)*24) hours,
  abs(TRUNC(MOD(x/1000,3600)/60)) min,
  abs(TRUNC(MOD(x/1000,60))) sec,
  abs(TRUNC(MOD(x,1000))) millisecond,
  SIGN(x)
  FROM (SELECT TIMESTAMPDIFF(millisecond, RIGHT_TS, LEFT_TS) x ,SIGN(TIMESTAMPDIFF(millisecond, RIGHT_TS, LEFT_TS)) sign))
  SELECT
  IFF(SIGN>0,'+','-') || TRIM(TO_CHAR(days,'000000000')) || ' ' || TO_CHAR(hours,'00') || ':' || TRIM(TO_CHAR(min,'00')) || ':' || TRIM(TO_CHAR(sec,'00')) || '.' || TRIM(TO_CHAR(millisecond,'00000000'))
  from RESULTS
$$;
```

##### Oracle

```sql
--Create Table
CREATE TABLE timestampdiff_table (col1 TIMESTAMP, col2 TIMESTAMP);

--Insert data
INSERT INTO timestampdiff_table VALUES ('2000-03-20 22:01:11', '1996-03-20 10:01:11');
INSERT INTO timestampdiff_table VALUES ('1996-03-20 22:01:11', '2000-03-20 10:01:11');
INSERT INTO timestampdiff_table VALUES ('1982-05-11 22:31:19', '1900-01-25 15:21:15');
INSERT INTO timestampdiff_table VALUES ('1999-12-25 01:15:16', '1900-12-11 02:05:16');

--Select
SELECT col1 - col2 FROM timestampdiff_table;
```

##### Result

| COL1-COL2 |
| --- |
| 1461 12:0:0.0 |
| -1460 12:0:0.0 |
| 30056 7:10:4.0 |
| 36172 23:10:0.0 |

##### Snowflake

```sql
--Create Table
CREATE OR REPLACE TABLE timestampdiff_table (col1 TIMESTAMP(6),
col2 TIMESTAMP(6)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

--Insert data
INSERT INTO timestampdiff_table
VALUES ('2000-03-20 22:01:11', '1996-03-20 10:01:11');

INSERT INTO timestampdiff_table
VALUES ('1996-03-20 22:01:11', '2000-03-20 10:01:11');

INSERT INTO timestampdiff_table
VALUES ('1982-05-11 22:31:19', '1900-01-25 15:21:15');

INSERT INTO timestampdiff_table
VALUES ('1999-12-25 01:15:16', '1900-12-11 02:05:16');

--Select
SELECT
PUBLIC.TIMESTAMP_DIFF_UDF( col1, col2) FROM
timestampdiff_table;
```

##### Result

| TIMESTAMP_DIFF_UDF( COL1, COL2) |
| --- |
| +000001461 12:00:00.00000000 |
| -000001460 12:00:00.00000000 |
| +000030056 07:10:04.00000000 |
| +000036172 23:10:00.00000000 |

### Known Issues

#### 1. TIMESTAMP format may differ from Oracle

The `TIMESTAMP` format may differ from Oracle, please consider the `TIMESTAMP_OUTPUT_FORMAT` [setting](https://docs.snowflake.com/en/user-guide/date-time-input-output.html#output-formats) when working with `TIMESTAMP` data types.

### Related EWIs

No related EWIs.

## TRUNC (date) UDF

### Description

> The `TRUNC` (date) function returns `date` with the time portion of the day truncated to the unit specified by the format model `fmt`. ([Oracle TRUNC(date) SQL Language Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/TRUNC-date.html#GUID-BC82227A-2698-4EC8-8C1A-ABECC64B0E79))

```sql
TRUNC(date [, fmt ])
```

Oracle `TRUNC` and Snowflake `TRUNC` function with date arguments have some functional differences.

`TRUNC_UDF` helper will be added to handle the following cases:

1. The format is not supported by Snowflake.

2. The format exists in Snowflake but works differently.

3. The tool cannot determine the datatype of the first argument.

4. The format is provided as a column or expression and not as a literal.

### Custom UDF overloads

#### TRUNC_UDF(date)

It applies an explicit `DATE` [cast](https://docs.snowflake.com/en/sql-reference/functions/cast.html) to the input Timestamp.

**Parameters**

1. **INPUT**: The Timestamp with Time Zone ([TIMESTAMP_LTZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#timestamp-ltz-timestamp-ntz-timestamp-tz)) that needs to be truncated.

> **Warning:**
>
> The default parameter for the UDF is `TIMESTAMP_LTZ`. It may need to be changed to `TIMESTAMP_TZ` or `TIMESTAMP_NTZ` to match the default `TIMESTAMP` used by the user.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.TRUNC_UDF(INPUT TIMESTAMP_LTZ)
RETURNS DATE
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    INPUT::DATE
$$;
```

##### Oracle

```sql
SELECT
TRUNC(
	TO_TIMESTAMP ( '20-Mar-1996 21:01:11 ', 'DD-Mon-YYYY HH24:MI:SS' )
	)
"Date" FROM DUAL;
```

##### Result

| Date |
| --- |
| 1996-03-20 00:00:00.000 |

##### Snowflake

```sql
SELECT
TRUNC(
	TO_TIMESTAMP ( '20-Mar-1996 21:01:11 ', 'DD-Mon-YYYY HH24:MI:SS' ), 'DD'
	)
"Date" FROM DUAL;
```

##### Result

| DATE |
| --- |
| 1996-03-20 |

#### TRUNC_UDF(date, fmt)

Manually creates a new date using `DATE_FROM_PARTS()` [function](https://docs.snowflake.com/en/sql-reference/functions/date_from_parts.html#date-from-parts), depending on the format category used.

**Parameters**

1. **DATE_TO_TRUNC**: The Timestamp with Time Zone ([TIMESTAMP_LTZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#timestamp-ltz-timestamp-ntz-timestamp-tz)) that needs to be truncated.
2. **DATE_FMT**: The date format as a VARCHAR. Same [formats that are supported](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/ROUND-and-TRUNC-Date-Functions.html#GUID-8E10AB76-21DA-490F-A389-023B648DDEF8) in Oracle.

> **Warning:**
>
> The default parameter for the UDF is `TIMESTAMP_LTZ`. It may need to be changed to `TIMESTAMP_TZ` or `TIMESTAMP_NTZ` to match the default `TIMESTAMP` used by the user.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.TRUNC_UDF(DATE_TO_TRUNC TIMESTAMP_LTZ, DATE_FMT VARCHAR(5))
RETURNS DATE
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
CAST(CASE
WHEN UPPER(DATE_FMT) IN ('CC','SCC') THEN DATE_FROM_PARTS(CAST(LEFT(CAST(YEAR(DATE_TO_TRUNC) as CHAR(4)),2) || '01' as INTEGER),1,1)
WHEN UPPER(DATE_FMT) IN ('SYYYY','YYYY','YEAR','SYEAR','YYY','YY','Y') THEN DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1)
WHEN UPPER(DATE_FMT) IN ('IYYY','IYY','IY','I') THEN
    CASE DAYOFWEEK(DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 0 THEN DATEADD(DAY, 1, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 1 THEN DATEADD(DAY, 0, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 2 THEN DATEADD(DAY, -1, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 3 THEN DATEADD(DAY, -2, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 4 THEN DATEADD(DAY, -3, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 5 THEN DATEADD(DAY, 3, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
         WHEN 6 THEN DATEADD(DAY, 2, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
    END
WHEN UPPER(DATE_FMT) IN ('MONTH','MON','MM','RM') THEN DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),MONTH(DATE_TO_TRUNC),1)
WHEN UPPER(DATE_FMT)IN ('Q') THEN DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),(QUARTER(DATE_TO_TRUNC)-1)*3+1,1)
WHEN UPPER(DATE_FMT) IN ('WW') THEN DATEADD(DAY, 0-MOD(TIMESTAMPDIFF(DAY,DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1),DATE_TO_TRUNC),7), DATE_TO_TRUNC)
WHEN UPPER(DATE_FMT) IN ('IW') THEN DATEADD(DAY, 0-MOD(TIMESTAMPDIFF(DAY,(CASE DAYOFWEEK(DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 0 THEN DATEADD(DAY, 1, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 1 THEN DATEADD(DAY, 0, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 2 THEN DATEADD(DAY, -1, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 3 THEN DATEADD(DAY, -2, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 4 THEN DATEADD(DAY, -3, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 5 THEN DATEADD(DAY, 3, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                                 WHEN 6 THEN DATEADD(DAY, 2, DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),1,1))
                                                               END),      DATE_TO_TRUNC),7), DATE_TO_TRUNC)
WHEN UPPER(DATE_FMT) IN ('W') THEN DATEADD(DAY, 0-MOD(TIMESTAMPDIFF(DAY,DATE_FROM_PARTS(YEAR(DATE_TO_TRUNC),MONTH(DATE_TO_TRUNC),1),DATE_TO_TRUNC),7), DATE_TO_TRUNC)
WHEN UPPER(DATE_FMT) IN ('DDD', 'DD','J') THEN DATE_TO_TRUNC
WHEN UPPER(DATE_FMT) IN ('DAY', 'DY','D') THEN DATEADD(DAY, 0-DAYOFWEEK(DATE_TO_TRUNC), DATE_TO_TRUNC)
WHEN UPPER(DATE_FMT) IN ('HH', 'HH12','HH24') THEN DATE_TO_TRUNC
WHEN UPPER(DATE_FMT) IN ('MI') THEN DATE_TO_TRUNC
END AS DATE)
$$
;
```

### TRUNC format scenarios

> **Warning:**
>
> The results format depends on the DateTime output formats configurated for the database.

#### 1. Natively supported formats

##### Oracle

```sql
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YYYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YEAR') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'Y') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'Q') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MONTH') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MON') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MM') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DD') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'HH') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MI') FROM DUAL;
```

##### Result

| TRUNC(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’),’YYYY’) |
| --- |
| 01-JAN-22 |
| 01-JAN-22 |
| 01-JAN-22 |
| 01-JAN-22 |
| 01-JAN-22 |
| 01-APR-22 |
| 01-APR-22 |
| 01-APR-22 |
| 01-APR-22 |
| 20-APR-22 |
| 20-APR-22 |
| 20-APR-22 |

##### Snowflake

```sql
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YYYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YEAR') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'YY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'Y') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'Q') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MONTH') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MON') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MM') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DD') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'HH') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'MI') FROM DUAL;
```

##### Result

| TRUNC(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’),’YYYY’) |
| --- |
| 2022-01-01 |
| 2022-01-01 |
| 2022-01-01 |
| 2022-01-01 |
| 2022-01-01 |
| 2022-04-01 |
| 2022-04-01 |
| 2022-04-01 |
| 2022-04-01 |
| 2022-04-20 |
| 2022-04-20 |
| 2022-04-20 |

#### 2. Formats mapped to another format

##### Oracle

```sql
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS')) FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'SYYYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'SYEAR') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'RM') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'IW') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DDD') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'J') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'HH12') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'HH24') FROM DUAL;
```

##### Result

| TRUNC(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’)) |
| --- |
| 20-APR-22 |
| 01-JAN-22 |
| 01-JAN-22 |
| 01-APR-22 |
| 18-APR-22 |
| 20-APR-22 |
| 20-APR-22 |
| 20-APR-22 |
| 20-APR-22 |

##### Snowflake

```sql
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'DD') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'YYYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'YEAR') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'MM') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'WK') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'DD') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'D') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'HH') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'), 'HH') FROM DUAL;
```

##### Result

| TRUNC(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’), ‘DD’) |
| --- |
| 2022-04-20 |
| 2022-01-01 |
| 2022-01-01 |
| 2022-04-01 |
| 2022-04-18 |
| 2022-04-20 |
| 2022-04-20 |
| 2022-04-20 |
| 2022-04-20 |

#### 3. Day formats

##### Oracle

```sql
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DAY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'D') FROM DUAL;
```

##### Result

| TRUNC(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’),’DAY’) |
| --- |
| 17-APR-22 |
| 17-APR-22 |
| 17-APR-22 |

##### Snowflake

```sql
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DAY') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'DY') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'D') FROM DUAL;
```

##### Result

| TRUNC_UDF(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’),’DAY’) |
| --- |
| 2022-04-17 |
| 2022-04-17 |
| 2022-04-17 |

#### 4. Unsupported formats

##### Oracle

```sql
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'CC') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'SCC') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'IYYY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'IY') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'I') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'WW') FROM DUAL UNION ALL
SELECT TRUNC(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'W') FROM DUAL;
```

##### Result

| TRUNC(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’),’CC’) |
| --- |
| 01-JAN-01 |
| 01-JAN-01 |
| 03-JAN-22 |
| 03-JAN-22 |
| 03-JAN-22 |
| 16-APR-22 |
| 15-APR-22 |

##### Snowflake

```sql
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'CC') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'SCC') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'IYYY') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'IY') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'I') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'WW') FROM DUAL UNION ALL
SELECT
TRUNC_UDF(TO_DATE('20/04/2022 13:21:10','DD/MM/YYYY HH24:MI:SS'),'W') FROM DUAL;
```

##### Result

| TRUNC_UDF(TO_DATE(‘20/04/2022 13:21:10’,’DD/MM/YYYY HH24:MI:SS’),’CC’) |
| --- |
| 2001-01-01 |
| 2001-01-01 |
| 2022-01-03 |
| 2022-01-03 |
| 2022-01-03 |
| 2022-04-16 |
| 2022-04-15 |

> **Note:**
>
> When the `TRUNC` function is used with an unsupported format or a parameter that cannot be handled by SnowConvert AI. To avoid any issues, the format is replaced with a valid format, or `TRUNC_UDF` is added.

### Known Issues

#### 1. Oracle DATE contains TIMESTAMP

Take into consideration that Oracle `DATE` contains an empty `TIMESTAMP` (00:00:00.000), while Snowflake `DATE` does not.

### Related EWIs

No related EWIs.

## TRUNC (number) UDF

### Description

> The `TRUNC` (number) function returns `n1` truncated to `n2` decimal places. If `n2` is omitted, then `n1` is truncated to 0 places. `n2` can be negative to truncate (make zero) `n2` digits left of the decimal point. ([Oracle TRUNC(number) SQL Language Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/TRUNC-number.html#GUID-911AE7FE-E04A-471D-8B0E-9C50EBEFE07D))

```none
TRUNC(n1 [, n2 ])
```

TRUNC_UDF for numeric values will be added to handle cases **where the first column has an unrecognized data type.**

Example:

```sql
SELECT TRUNC(column1) FROM DUAL;
```

If the definition of `column1` was not provided to the tool. Then the `TRUNC_UDF` will be added and in execution time, the overload of `TRUNC_UDF` will handle the case if it is a numeric or a date type.

Please refer to [TRUNC (DATE)](README.md) section.

The following sections provide the proof that `TRUNC_UDF` will handle perfectly numeric values.

### Custom UDF overloads

#### TRUNC_UDF(n1)

It calls Snowflake `TRUNC` [function](https://docs.snowflake.com/en/sql-reference/functions/trunc.html#truncate-trunc) with the input number. This overload exists to handle the different types of parameter scenarios, in case that information is not available during the migration.

**Parameters**

1. **INPUT**: The `NUMBER` that needs to be truncated.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.TRUNC_UDF(INPUT NUMBER)
RETURNS INT
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    TRUNC(INPUT)
$$;
```

##### Oracle

```sql
--TRUNC(NUMBER)
SELECT
	TRUNC ( 1.000001 ),
	TRUNC ( 15.79 ),
	TRUNC ( -975.975 ),
	TRUNC ( 135.135 )
FROM DUAL;
```

##### Result

| TRUNC(1.000001) | TRUNC(15.79) | TRUNC(-975.975) | TRUNC(135.135) |
| --- | --- | --- | --- |
| 1 | 15 | -975 | 135 |

##### Snowflake

```sql
--TRUNC(NUMBER)
SELECT
	TRUNC ( 1.000001 ),
	TRUNC ( 15.79 ),
	TRUNC ( -975.975 ),
	TRUNC ( 135.135 )
FROM DUAL;
```

##### Result

| TRUNC_UDF(1.000001) | TRUNC_UDF(15.79) | TRUNC_UDF(-975.975) | TRUNC_UDF(135.135) |
| --- | --- | --- | --- |
| 1 | 15 | -975 | 135 |

#### TRUNC_UDF(n1, n2)

It calls Snowflake `TRUNC` [function](https://docs.snowflake.com/en/sql-reference/functions/trunc.html#truncate-trunc) with the input number and the scale. This overload exists to handle the different types of parameter scenarios, in case that information is not available during the migration.

**Parameters**

1. **INPUT**: The `NUMBER` that needs to be truncated.
2. **SCALE**: Represents the number of digits the output will include after the decimal point.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.TRUNC_UDF(INPUT NUMBER, SCALE NUMBER)
RETURNS INT
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    TRUNC(INPUT, SCALE)
$$;
```

##### Oracle

```sql
--TRUNC(NUMBER, SCALE)
SELECT
	TRUNC ( 1.000001, -2 ),
	TRUNC ( 1.000001, -1 ),
	TRUNC ( 1.000001, 0 ),
	TRUNC ( 1.000001, 1 ),
	TRUNC ( 1.000001, 2 ),
	TRUNC ( 15.79, -2),
	TRUNC ( 15.79, -1),
	TRUNC ( 15.79, 0),
	TRUNC ( 15.79, 1 ),
	TRUNC ( 15.79, 50 ),
	TRUNC ( -9.6, -2 ),
	TRUNC ( -9.6, -1 ),
	TRUNC ( -9.6, 0 ),
	TRUNC ( -9.6, 1 ),
	TRUNC ( -9.6, 2 ),
	TRUNC ( -975.975, -3 ),
	TRUNC ( -975.975, -2 ),
	TRUNC ( -975.975, -1 ),
	TRUNC ( -975.975, 0 ),
	TRUNC ( -975.975, 1 ),
	TRUNC ( -975.975, 2 ),
	TRUNC ( -975.975, 3 ),
	TRUNC ( -975.975, 5 ),
	TRUNC ( 135.135, -10 ),
	TRUNC ( 135.135, -2 ),
	TRUNC ( 135.135, 0 ),
	TRUNC ( 135.135, 1 ),
	TRUNC ( 135.135, 2 ),
	TRUNC ( 135.135, 3 ),
	TRUNC ( 135.135, 5 )
FROM DUAL;
```

##### Result

| TRUNC(1.000001,-2) | TRUNC(1.000001,-1) | TRUNC(1.000001,0) | TRUNC(1.000001,1) | TRUNC(1.000001,2) | TRUNC(15.79,-2) | TRUNC(15.79,-1) | TRUNC(15.79,0) | TRUNC(15.79,1) | TRUNC(15.79,50) | TRUNC(-9.6,-2) | TRUNC(-9.6,-1) | TRUNC(-9.6,0) | TRUNC(-9.6,1) | TRUNC(-9.6,2) | TRUNC(-975.975,-3) | TRUNC(-975.975,-2) | TRUNC(-975.975,-1) | TRUNC(-975.975,0) | TRUNC(-975.975,1) | TRUNC(-975.975,2) | TRUNC(-975.975,3) | TRUNC(-975.975,5) | TRUNC(135.135,-10) | TRUNC(135.135,-2) | TRUNC(135.135,0) | TRUNC(135.135,1) | TRUNC(135.135,2) | TRUNC(135.135,3) | TRUNC(135.135,5) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 0 | 0 | 1 | 1 | 1 | 0 | 10 | 15 | 15.7 | 15.79 | 0 | 0 | -9 | -9.6 | -9.6 | 0 | -900 | -970 | -975 | -975.9 | -975.97 | -975.975 | -975.975 | 0 | 100 | 135 | 135.1 | 135.13 | 135.135 | 135.135 |

##### Snowflake

```sql
--TRUNC(NUMBER, SCALE)
SELECT
	TRUNC ( 1.000001, -2 ),
	TRUNC ( 1.000001, -1 ),
	TRUNC ( 1.000001, 0 ),
	TRUNC ( 1.000001, 1 ),
	TRUNC ( 1.000001, 2 ),
	TRUNC ( 15.79, -2),
	TRUNC ( 15.79, -1),
	TRUNC ( 15.79, 0),
	TRUNC ( 15.79, 1 ),
	TRUNC ( 15.79, 50 ),
	TRUNC ( -9.6, -2 ),
	TRUNC ( -9.6, -1 ),
	TRUNC ( -9.6, 0 ),
	TRUNC ( -9.6, 1 ),
	TRUNC ( -9.6, 2 ),
	TRUNC ( -975.975, -3 ),
	TRUNC ( -975.975, -2 ),
	TRUNC ( -975.975, -1 ),
	TRUNC ( -975.975, 0 ),
	TRUNC ( -975.975, 1 ),
	TRUNC ( -975.975, 2 ),
	TRUNC ( -975.975, 3 ),
	TRUNC ( -975.975, 5 ),
	TRUNC ( 135.135, -10 ),
	TRUNC ( 135.135, -2 ),
	TRUNC ( 135.135, 0 ),
	TRUNC ( 135.135, 1 ),
	TRUNC ( 135.135, 2 ),
	TRUNC ( 135.135, 3 ),
	TRUNC ( 135.135, 5 )
FROM DUAL;
```

##### Result

| TRUNC_UDF ( 1.000001, -2 ) | TRUNC_UDF ( 1.000001, -1 ) | TRUNC_UDF ( 1.000001, 0 ) | TRUNC_UDF ( 1.000001, 1 ) | TRUNC_UDF ( 1.000001, 2 ) | TRUNC_UDF ( 15.79, -2) | TRUNC_UDF ( 15.79, -1) | TRUNC_UDF ( 15.79, 0) | TRUNC_UDF ( 15.79, 1 ) | TRUNC_UDF ( 15.79, 50 ) | TRUNC_UDF ( -9.6, -2 ) | TRUNC_UDF ( -9.6, -1 ) | TRUNC_UDF ( -9.6, 0 ) | TRUNC_UDF ( -9.6, 1 ) | TRUNC_UDF ( -9.6, 2 ) | TRUNC_UDF ( -975.975, -3 ) | TRUNC_UDF ( -975.975, -2 ) | TRUNC_UDF ( -975.975, -1 ) | TRUNC_UDF ( -975.975, 0 ) | TRUNC_UDF ( -975.975, 1 ) | TRUNC_UDF ( -975.975, 2 ) | TRUNC_UDF ( -975.975, 3 ) | TRUNC_UDF ( -975.975, 5 ) | TRUNC_UDF ( 135.135, -10 ) | TRUNC_UDF ( 135.135, -2 ) | TRUNC_UDF ( 135.135, 0 ) | TRUNC_UDF ( 135.135, 1 ) | TRUNC_UDF ( 135.135, 2 ) | TRUNC_UDF ( 135.135, 3 ) | TRUNC_UDF ( 135.135, 5 ) |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| 0 | 0 | 1 | 1.0 | 1.00 | 0 | 10 | 15 | 15.7 | 15.79 | 0 | 0 | -9 | -9.6 | -9.6 | 0 | -900 | -970 | -975 | -975.9 | -975.97 | -975.975 | -975.975 | 0 | 100 | 135 | 135.1 | 135.13 | 135.135 | 135.135 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

# SnowConvert AI - Oracle - INTERVAL UDFs

## Necessary code to run INTERVAL UDFs

To run any of the interval UDFs, it is necessary to run the following code before:

```sql
CREATE OR REPLACE FUNCTION PUBLIC.INTERVAL2MONTHS_UDF
(INPUT_VALUE VARCHAR())
RETURNS INTEGER
IMMUTABLE
AS
$$
CASE WHEN SUBSTR(INPUT_VALUE,1,1) = '-' THEN
   12 * CAST(SUBSTR(INPUT_VALUE,1 , POSITION('-', INPUT_VALUE,2)-1) AS INTEGER)
   - CAST(SUBSTR(INPUT_VALUE,POSITION('-', INPUT_VALUE)+1) AS INTEGER)
ELSE
   12 * CAST(SUBSTR(INPUT_VALUE,1 , POSITION('-', INPUT_VALUE,2)-1) AS INTEGER)
   + CAST(SUBSTR(INPUT_VALUE,POSITION('-', INPUT_VALUE)+1) AS INTEGER)
END
$$;

CREATE OR REPLACE FUNCTION PUBLIC.INTERVAL2SECONDS_UDF
(INPUT_PART VARCHAR(30), INPUT_VALUE VARCHAR())
RETURNS DECIMAL(20,6)
IMMUTABLE
AS
$$
CASE WHEN SUBSTR(INPUT_VALUE,1,1) = '-' THEN
   DECODE(INPUT_PART,
           'DAY',              86400 * INPUT_VALUE,
           'DAY TO HOUR',      86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS DECIMAL(10,0))
                               - 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1) AS DECIMAL(10,0)),
           'DAY TO MINUTE',    86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) AS INTEGER),
           'DAY TO SECOND',    86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) - POSITION(':', INPUT_VALUE) - 1) AS INTEGER)
                               - CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1)+1) AS DECIMAL(10,6)),
           'DAY TO SECOND(3)',  86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) - POSITION(':', INPUT_VALUE) - 1) AS INTEGER)
                               - CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1)+1) AS DECIMAL(10,6)),
           'HOUR(3)',          3600 * INPUT_VALUE,
           'HOUR',             3600 * INPUT_VALUE,
           'HOUR TO MINUTE',   3600 * CAST(SUBSTR(INPUT_VALUE,1 , POSITION(':', INPUT_VALUE)-1) AS INTEGER)
                               - 60 * CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE)+1) AS INTEGER),
           'HOUR TO SECOND',   3600 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) - POSITION(':', INPUT_VALUE) - 1) AS INTEGER)
                               - CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1)+1) AS DECIMAL(10,6)),
           'MINUTE',           60 * INPUT_VALUE,
           'MINUTE TO SECOND', 60 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               - CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) AS DECIMAL(10,6)),
           'SECOND(2,3)',      INPUT_VALUE,
           'SECOND',           INPUT_VALUE
            )
ELSE
   DECODE(INPUT_PART,
           'DAY',              86400 * INPUT_VALUE,
           'DAY TO HOUR',      86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1) AS INTEGER),
           'DAY TO MINUTE',    86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) AS INTEGER),
           'DAY TO SECOND',    86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) - POSITION(':', INPUT_VALUE) - 1) AS INTEGER)
                               + CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1)+1) AS DECIMAL(10,6)),
           'DAY TO SECOND(3)',    86400 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 3600 * CAST(SUBSTR(INPUT_VALUE, POSITION(' ', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) - POSITION(':', INPUT_VALUE) - 1) AS INTEGER)
                               + CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1)+1) AS DECIMAL(10,6)),
           'HOUR(3)',          3600 * INPUT_VALUE,
           'HOUR',             3600 * INPUT_VALUE,
           'HOUR TO MINUTE',   3600 * CAST(SUBSTR(INPUT_VALUE,1 , POSITION(':', INPUT_VALUE)-1) AS INTEGER)
                               + 60 * CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE)+1) AS INTEGER),
           'HOUR TO SECOND',   3600 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + 60 * CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1, POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) - POSITION(':', INPUT_VALUE) - 1) AS INTEGER)
                               + CAST(SUBSTR(INPUT_VALUE,POSITION(':', INPUT_VALUE, POSITION(':', INPUT_VALUE)+1)+1) AS DECIMAL(10,6)),
           'MINUTE',           60 * INPUT_VALUE,
           'MINUTE TO SECOND', 60 * CAST(SUBSTR(INPUT_VALUE, 1, POSITION(':', INPUT_VALUE)-POSITION(' ', INPUT_VALUE)-1) AS INTEGER)
                               + CAST(SUBSTR(INPUT_VALUE, POSITION(':', INPUT_VALUE)+1) AS DECIMAL(10,6)),
           'SECOND(2,3)',      INPUT_VALUE,
           'SECOND',           INPUT_VALUE
        )
END
$$;
```

## DATEADD UDF INTERVAL

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This UDF is used to resolve operations with intervals like:

* INTERVAL + DATE
* INTERVAL + TIMESTAMP
* DATE + INTERVAL
* DATE + TIMESTAMP
* INTERVAL + UNKNOWN
* UNKNOWN + INTERVAL

> **Note:**
>
> An UNKNOWN type is a column or expression whose type could not be resolved by Snow Convert, it tends to happen when the DDLs for tables are not included in the migration or when there is an expression or subquery that can return different data types.

### Custom UDF overloads

#### DATEADD_UDF(string, date)

**Parameters**

1. **INTERVAL_VALUE**: The interval `String` of the operation.
2. **D**: The `DATE` where the interval will be added.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(INTERVAL_VALUE STRING,D DATE)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT

    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)::DATE
    END CASE
FROM VARS
$$;
```

#### DATEADD_UDF(date, string)

**Parameters**

1. **D**: The `DATE` where the interval will be added.
2. **INTERVAL_VALUE**: The interval `String` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(D DATE, INTERVAL_VALUE STRING)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT

    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)::DATE
    END CASE
FROM VARS
$$;
```

#### DATEADD_UDF(string, timestamp)

**Parameters**

1. **INTERVAL_VALUE**: The interval `String` of the operation.
2. **D**: The `TIMESTAMP` where the interval will be added.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(INTERVAL_VALUE STRING,D TIMESTAMP)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT

    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)
    END CASE
FROM VARS
$$;
```

#### DATEADD_UDF(timestamp, string)

**Parameters**

1. **D**: The `TIMESTAMP` where the interval will be added.
2. **INTERVAL_VALUE**: The interval `String` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEADD_UDF(D TIMESTAMP, INTERVAL_VALUE STRING)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT

    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)
    END CASE
FROM VARS
$$;
```

#### Usage example

> **Note:**
>
> **`--disableDateAsTimestamp`**
>
> Flag to indicate whether `SYSDATE` should be transformed into `CURRENT_DATE` *or* `CURRENT_TIMESTAMP`. This will also affect all `DATE` columns that will be transformed to `TIMESTAMP`.

##### Oracle

```sql
-- DROP TABLE UNKNOWN_TABLE;
-- CREATE TABLE UNKNOWN_TABLE(Unknown timestamp);
-- INSERT  INTO UNKNOWN_TABLE VALUES (TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.'));

CREATE TABLE TIMES(
AsTimeStamp TIMESTAMP,
AsTimestampTwo TIMESTAMP,
AsDate DATE,
AsDateTwo DATE
);

INSERT INTO TIMES VALUES (
TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_DATE('06/11/21', 'dd/mm/yy'),
TO_DATE('05/11/21', 'dd/mm/yy'));

SELECT
 AsTimeStamp+INTERVAL '1-1' YEAR(2) TO MONTH,
 AsTimeStamp+INTERVAL '2-1' YEAR(4) TO MONTH,
 AsTimeStamp+INTERVAL '1' MONTH,
 AsTimeStamp+INTERVAL '2' MONTH,
 AsDate+INTERVAL '1-1' YEAR(2) TO MONTH,
 AsDate+INTERVAL '2-1' YEAR(4) TO MONTH,
 AsDate+INTERVAL '1' MONTH,
 AsDate+INTERVAL '2' MONTH,
 Unknown+INTERVAL '1 01:00:00.222' DAY TO SECOND(3),
 Unknown+INTERVAL '1 01:10' DAY TO MINUTE,
 Unknown+INTERVAL '1 1' DAY TO HOUR,
 INTERVAL '1' MONTH+AsTimeStamp,
 INTERVAL '1' MONTH+AsDate,
 INTERVAL '1' MONTH+Unknown,
 INTERVAL '2' MONTH+AsTimeStamp,
 INTERVAL '2' MONTH+AsDate,
 INTERVAL '2' MONTH+Unknown
FROM TIMES, UNKNOWN_TABLE;
```

##### Results

```none
|ASTIMESTAMP+INTERVAL'1-1'YEAR(2)TOMONTH|ASTIMESTAMP+INTERVAL'2-1'YEAR(4)TOMONTH|ASTIMESTAMP+INTERVAL'1'MONTH|ASTIMESTAMP+INTERVAL'2'MONTH|ASDATE+INTERVAL'1-1'YEAR(2)TOMONTH|ASDATE+INTERVAL'2-1'YEAR(4)TOMONTH|ASDATE+INTERVAL'1'MONTH|ASDATE+INTERVAL'2'MONTH|UNKNOWN+INTERVAL'101:00:00.222'DAYTOSECOND(3)|UNKNOWN+INTERVAL'101:10'DAYTOMINUTE|UNKNOWN+INTERVAL'11'DAYTOHOUR|INTERVAL'1'MONTH+ASTIMESTAMP|INTERVAL'1'MONTH+ASDATE|INTERVAL'1'MONTH+UNKNOWN|INTERVAL'2'MONTH+ASTIMESTAMP|INTERVAL'2'MONTH+ASDATE|INTERVAL'2'MONTH+UNKNOWN|
|---------------------------------------|---------------------------------------|----------------------------|----------------------------|----------------------------------|----------------------------------|-----------------------|-----------------------|---------------------------------------------|-----------------------------------|-----------------------------|----------------------------|-----------------------|------------------------|----------------------------|-----------------------|------------------------|
|2022-12-05 11:00:00.000                |2023-12-05 11:00:00.000                |2021-12-05 11:00:00.000     |2022-01-05 11:00:00.000     |2022-12-06 00:00:00.000           |2023-12-06 00:00:00.000           |2021-12-06 00:00:00.000|2022-01-06 00:00:00.000|2009-10-02 13:00:00.222                      |2009-10-02 13:10:00.000            |2009-10-02 13:00:00.000      |2021-12-05 11:00:00.000     |2021-12-06 00:00:00.000|2009-11-01 12:00:00.000 |2022-01-05 11:00:00.000     |2022-01-06 00:00:00.000|2009-12-01 12:00:00.000 |
```

##### Snowflake

> **Note:**
>
> This configuration was used in Snowflake

```sql
ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT= 'DD-MON-YY HH.MI.SS.FF6 AM';
ALTER SESSION SET DATE_OUTPUT_FORMAT= 'DD-MON-YY';
```

```sql
-- DROP TABLE UNKNOWN_TABLE;
-- CREATE TABLE UNKNOWN_TABLE(Unknown timestamp);
-- INSERT  INTO UNKNOWN_TABLE VALUES (TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.'));
CREATE OR REPLACE TABLE TIMES (
 AsTimeStamp TIMESTAMP(6),
 AsTimestampTwo TIMESTAMP(6),
 AsDate TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
 AsDateTwo TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
 )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO TIMES
VALUES (
TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_DATE('06/11/21', 'dd/mm/yy'),
TO_DATE('05/11/21', 'dd/mm/yy'));

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "UNKNOWN_TABLE" **

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp + INTERVAL '1y, 1mm',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp + INTERVAL '2y, 1mm',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp + INTERVAL '1 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp + INTERVAL '2 month',
 AsDate+ INTERVAL '1y, 1mm',
 AsDate+ INTERVAL '2y, 1mm',
 AsDate+ INTERVAL '1 month',
 AsDate+ INTERVAL '2 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown + INTERVAL '1d, 01h, 00m, 00s, 222ms',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown + INTERVAL '1d, 01h, 10m',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown + INTERVAL '1d, 1h',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp + INTERVAL '1 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsDate + INTERVAL '1 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown + INTERVAL '1 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp + INTERVAL '2 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsDate + INTERVAL '2 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown + INTERVAL '2 month'
FROM
 TIMES,
 UNKNOWN_TABLE;
```

##### Results

```none
|DATEADD_UDF(ASTIMESTAMP,'INTERVAL ''1-1'' YEAR(2) TO MONTH')|DATEADD_UDF(ASTIMESTAMP,'INTERVAL ''2-1'' YEAR(4) TO MONTH')|DATEADD_UDF(ASTIMESTAMP,'INTERVAL ''1'' MONTH')|DATEADD_UDF(ASTIMESTAMP,'INTERVAL ''2'' MONTH')|DATEADD_UDF(ASDATE,'INTERVAL ''1-1'' YEAR(2) TO MONTH')|DATEADD_UDF(ASDATE,'INTERVAL ''2-1'' YEAR(4) TO MONTH')|DATEADD_UDF(ASDATE,'INTERVAL ''1'' MONTH')|DATEADD_UDF(ASDATE,'INTERVAL ''2'' MONTH')|DATEADD_UDF(UNKNOWN,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)')|DATEADD_UDF(UNKNOWN,'INTERVAL ''1 01:10'' DAY TO MINUTE')|DATEADD_UDF(UNKNOWN,'INTERVAL ''1 1'' DAY TO HOUR')|DATEADD_UDF('INTERVAL ''1'' MONTH',ASTIMESTAMP)|DATEADD_UDF('INTERVAL ''1'' MONTH',ASDATE)|DATEADD_UDF('INTERVAL ''1'' MONTH',UNKNOWN)|DATEADD_UDF('INTERVAL ''2'' MONTH',ASTIMESTAMP)|DATEADD_UDF('INTERVAL ''2'' MONTH',ASDATE)|DATEADD_UDF('INTERVAL ''2'' MONTH',UNKNOWN)|
|------------------------------------------------------------|------------------------------------------------------------|-----------------------------------------------|-----------------------------------------------|-------------------------------------------------------|-------------------------------------------------------|------------------------------------------|------------------------------------------|-------------------------------------------------------------------|---------------------------------------------------------|---------------------------------------------------|-----------------------------------------------|------------------------------------------|-------------------------------------------|-----------------------------------------------|------------------------------------------|-------------------------------------------|
|2022-12-05 11:00:00.000                                     |2023-12-05 11:00:00.000                                     |2021-12-05 11:00:00.000                        |2022-01-05 11:00:00.000                        |2022-12-06                                             |2023-12-06                                             |2021-12-06                                |2022-01-06                                |2009-10-02 13:00:00.222                                            |2009-10-02 13:10:00.000                                  |2009-10-02 13:00:00.000                            |2021-12-05 11:00:00.000                        |2021-12-06                                |2009-11-01 12:00:00.000                    |2022-01-05 11:00:00.000                        |2022-01-06                                |2009-12-01 12:00:00.000                    |
```

### Known Issues

#### 1. INTERVAL + INTERVAL Operation is not supported

Snowflake does not support INTERVAL + INTERVAL operations.

### Related EWIs

1. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
2. [SSC-EWI-OR0095](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Operation Between Interval Type and Date Type not Supported.
3. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
4. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

## DATEDIFF UDF INTERVAL

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This UDF is used to resolve operations with intervals like:

* INTERVAL - UNKNOWN
* UNKNOWN - INTERVAL
* DATE - INTERVAL
* TIMESTAMP - INTERVAL

> **Note:**
>
> An UNKNOWN type is a column or expression whose type could not be resolved by Snow Convert, it tends to happen when the DDLs for tables are not included in the migration or when there is an expression or subquery that can return different data types.

### Custom UDF overloads

#### DATEADD_DDIF(string, date)

**Parameters**

1. **INTERVAL_VALUE**: The interval `String` of the operation.
2. **D**: The `DATE` where the interval will be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(INTERVAL_VALUE STRING,D DATE)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT
    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,-1*PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,-1*TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,-1*1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)::DATE
    END CASE
FROM VARS
$$;
```

#### DATEADD_DIFF(date, string)

**Parameters**

1. **D**: The `DATE` where the interval will be subtracted.
2. **INTERVAL_VALUE**: The interval `String` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(D DATE, INTERVAL_VALUE STRING)
RETURNS DATE
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT
    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,-1*PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,-1*TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,-1*1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)::DATE
    END CASE
FROM VARS
$$;
```

#### DATEADD_DIFF(string, timestamp)

**Parameters**

1. **INTERVAL_VALUE**: The interval `String` of the operation.
2. **D**: The `TIMESTAMP` where the interval will be subtracted.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(INTERVAL_VALUE STRING,D TIMESTAMP)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT
    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,-1*PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,-1*TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,-1*1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)
    END CASE
FROM VARS
$$;
```

#### DATEADD_DIFF(timestamp, string)

**Parameters**

1. **D**: The `TIMESTAMP` where the interval will be subtracted.
2. **INTERVAL_VALUE**: The interval `String` of the operation.

##### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.DATEDIFF_UDF(D TIMESTAMP, INTERVAL_VALUE STRING)
RETURNS TIMESTAMP
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH VARS(INPUT_VALUE, INPUT_PART) AS (
SELECT SUBSTR(INTERVAL_VALUE,11,POSITION('''',INTERVAL_VALUE,11)-11),
       TRIM(SUBSTR(INTERVAL_VALUE,POSITION('''',INTERVAL_VALUE,11)+1)))
SELECT
    CASE WHEN INPUT_PART='YEAR(2) TO MONTH' OR INPUT_PART='YEAR(4) TO MONTH' THEN
        DATEADD(MONTHS,-1*PUBLIC.INTERVAL_TO_MONTHS_UDF(INPUT_VALUE),D)
    WHEN INPUT_PART='MONTH' THEN
        DATEADD(MONTHS,-1*TO_NUMBER(INPUT_VALUE),D)
    ELSE
        DATEADD(MICROSECONDS,-1*1000000*PUBLIC.INTERVAL_TO_SECONDS_UDF(INPUT_PART, INPUT_VALUE),D)
    END CASE
FROM VARS
$$;
```

#### Usage example

> **Note:**
>
> **`--disableDateAsTimestamp`**
>
> Flag to indicate whether `SYSDATE` should be transformed into `CURRENT_DATE` *or* `CURRENT_TIMESTAMP`. This will also affect all `DATE` columns that will be transformed to `TIMESTAMP`.

##### Oracle

```sql
-- DROP TABLE UNKNOWN_TABLE;
-- CREATE TABLE UNKNOWN_TABLE(Unknown timestamp);
-- INSERT  INTO UNKNOWN_TABLE VALUES (TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.'));

CREATE TABLE TIMES(
AsTimeStamp TIMESTAMP,
AsTimestampTwo TIMESTAMP,
AsDate DATE,
AsDateTwo DATE
);

INSERT INTO TIMES VALUES (
TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_DATE('06/11/21', 'dd/mm/yy'),
TO_DATE('05/11/21', 'dd/mm/yy'));

SELECT
 AsTimeStamp-INTERVAL '1-1' YEAR(2) TO MONTH,
 AsTimeStamp-INTERVAL '2-1' YEAR(4) TO MONTH,
 AsTimeStamp-INTERVAL '1' MONTH,
 AsTimeStamp-INTERVAL '2' MONTH,
 AsDate-INTERVAL '1-1' YEAR(2) TO MONTH,
 AsDate-INTERVAL '2-1' YEAR(4) TO MONTH,
 AsDate-INTERVAL '1' MONTH,
 AsDate-INTERVAL '2' MONTH,
 Unknown-INTERVAL '1 01:00:00.222' DAY TO SECOND(3),
 Unknown-INTERVAL '1 01:10' DAY TO MINUTE,
 Unknown-INTERVAL '1 1' DAY TO HOUR
FROM TIMES, UNKNOWN_TABLE;
```

##### Result

```none
|ASTIMESTAMP-INTERVAL'1-1'YEAR(2)TOMONTH|ASTIMESTAMP-INTERVAL'2-1'YEAR(4)TOMONTH|ASTIMESTAMP-INTERVAL'1'MONTH|ASTIMESTAMP-INTERVAL'2'MONTH|ASDATE-INTERVAL'1-1'YEAR(2)TOMONTH|ASDATE-INTERVAL'2-1'YEAR(4)TOMONTH|ASDATE-INTERVAL'1'MONTH|ASDATE-INTERVAL'2'MONTH|UNKNOWN-INTERVAL'101:00:00.222'DAYTOSECOND(3)|UNKNOWN-INTERVAL'101:10'DAYTOMINUTE|UNKNOWN-INTERVAL'11'DAYTOHOUR|
|---------------------------------------|---------------------------------------|----------------------------|----------------------------|----------------------------------|----------------------------------|-----------------------|-----------------------|---------------------------------------------|-----------------------------------|-----------------------------|
|2020-10-05 11:00:00.000                |2019-10-05 11:00:00.000                |2021-10-05 11:00:00.000     |2021-09-05 11:00:00.000     |2020-10-06 00:00:00.000           |2019-10-06 00:00:00.000           |2021-10-06 00:00:00.000|2021-09-06 00:00:00.000|2009-09-30 10:59:59.778                      |2009-09-30 10:50:00.000            |2009-09-30 11:00:00.000      |
```

##### Snowflake

> **Note:**
>
> This configuration was used in Snowflake

```sql
ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT= 'DD-MON-YY HH.MI.SS.FF6 AM';
ALTER SESSION SET DATE_OUTPUT_FORMAT= 'DD-MON-YY';
```

```sql
-- DROP TABLE UNKNOWN_TABLE;
-- CREATE TABLE UNKNOWN_TABLE(Unknown timestamp);
-- INSERT  INTO UNKNOWN_TABLE VALUES (TO_TIMESTAMP('01/10/09, 12:00 P.M.', 'dd/mm/yy, hh:mi P.M.'));
CREATE OR REPLACE TABLE TIMES (
 AsTimeStamp TIMESTAMP(6),
 AsTimestampTwo TIMESTAMP(6),
 AsDate TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
 AsDateTwo TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
 )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO TIMES
VALUES (
TO_TIMESTAMP('05/11/21, 11:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_TIMESTAMP('05/11/21, 10:00 A.M.', 'dd/mm/yy, hh:mi A.M.'),
TO_DATE('06/11/21', 'dd/mm/yy'),
TO_DATE('05/11/21', 'dd/mm/yy'));

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "UNKNOWN_TABLE" **

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp - INTERVAL '1y, 1mm',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp - INTERVAL '2y, 1mm',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp - INTERVAL '1 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!!
 AsTimeStamp - INTERVAL '2 month',
 AsDate- INTERVAL '1y, 1mm',
 AsDate- INTERVAL '2y, 1mm',
 AsDate- INTERVAL '1 month',
 AsDate- INTERVAL '2 month',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown - INTERVAL '1d, 01h, 00m, 00s, 222ms',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown - INTERVAL '1d, 01h, 10m',
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!!
 Unknown - INTERVAL '1d, 1h'
FROM
 TIMES,
 UNKNOWN_TABLE;
```

##### Result

```none
|DATEDIFF_UDF(ASTIMESTAMP,'INTERVAL ''1-1'' YEAR(2) TO MONTH')|DATEDIFF_UDF(ASTIMESTAMP,'INTERVAL ''2-1'' YEAR(4) TO MONTH')|DATEDIFF_UDF(ASTIMESTAMP,'INTERVAL ''1'' MONTH')|DATEDIFF_UDF(ASTIMESTAMP,'INTERVAL ''2'' MONTH')|DATEDIFF_UDF(ASDATE,'INTERVAL ''1-1'' YEAR(2) TO MONTH')|DATEDIFF_UDF(ASDATE,'INTERVAL ''2-1'' YEAR(4) TO MONTH')|DATEDIFF_UDF(ASDATE,'INTERVAL ''1'' MONTH')|DATEDIFF_UDF(ASDATE,'INTERVAL ''2'' MONTH')|DATEDIFF_UDF(UNKNOWN,'INTERVAL ''1 01:00:00.222'' DAY TO SECOND(3)')|DATEDIFF_UDF(UNKNOWN,'INTERVAL ''1 01:10'' DAY TO MINUTE')|DATEDIFF_UDF(UNKNOWN,'INTERVAL ''1 1'' DAY TO HOUR')|
|-------------------------------------------------------------|-------------------------------------------------------------|------------------------------------------------|------------------------------------------------|--------------------------------------------------------|--------------------------------------------------------|-------------------------------------------|-------------------------------------------|--------------------------------------------------------------------|----------------------------------------------------------|----------------------------------------------------|
|2020-10-05 11:00:00.000                                      |2019-10-05 11:00:00.000                                      |2021-10-05 11:00:00.000                         |2021-09-05 11:00:00.000                         |2020-10-06                                              |2019-10-06                                              |2021-10-06                                 |2021-09-06                                 |2009-09-30 10:59:59.778                                             |2009-09-30 10:50:00.000                                   |2009-09-30 11:00:00.000                             |
```

### Known Issues

#### 1. INTERVAL - INTERVAL Operation is not supported

Snowflake does not support INTERVAL - INTERVAL operations.

### Related EWIs

1. [SSC-EWI-OR0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Types resolution issues, the arithmetic operation may not behave correctly between string and date.
2. [SSC-EWI-OR0095](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Operation Between Interval Type and Date Type not Supported.
3. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
4. [SSC-FDM-OR0042](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Date Type Transformed To Timestamp Has A Different Behavior.

---
title: SnowConvert AI - Oracle - Spatial Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/spatial-types.md
section: Migrations
---

# SnowConvert AI - Oracle - Spatial Types

## Description

> Oracle Spatial and Graph is designed to make spatial data management easier and more natural to users of location-enabled applications, geographic information system (GIS) applications, and geoimaging applications. ([Oracle SQL Language Reference Spatial Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-B4DF3B59-1600-4FA2-B7ED-AF7B734256BF))

```none
{ SDO_Geometry | SDO_Topo_Geometry |SDO_GeoRaster }
```

## SDO_GEOMETRY

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> The geometric description of a spatial object is stored in a single row, in a single column of object type SDO_GEOMETRY in a user-defined table. Any table that has a column of type SDO_GEOMETRY must have another column, or set of columns, that defines a unique primary key for that table. ([Oracle SQL Language Reference SDO_GEOMETRY Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-022A5008-1E15-4AA4-938E-7FD75C594087))

Definition of SDO_GEOMETRY object:

```none
CREATE TYPE SDO_GEOMETRY AS OBJECT
  (sgo_gtype        NUMBER,
   sdo_srid         NUMBER,
   sdo_point        SDO_POINT_TYPE,
   sdo_elem_info    SDO_ELEM_INFO_ARRAY,
   sdo_ordinates    SDO_ORDINATE_ARRAY);
/
```

The `SDO_GEOMETRY` object is **not supported** in Snowflake. A workaround for this data type is to use [Snowflake GEOGRAPHY](https://docs.snowflake.com/en/sql-reference/data-types-geospatial.html), however that transformation is currently not supported by SnowConvert.

### Sample Source Patterns

#### SDO_GEOMETRY in Create Table

##### Oracle

```sql
CREATE TABLE geometry_table(
    geometry_column SDO_GEOMETRY
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE geometry_table (
        geometry_column GEOMETRY
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Inserting data in SDO_GEOMETRY Table

##### Oracle

```sql
INSERT INTO geometry_table VALUES (
	SDO_GEOMETRY('POINT(-79 37)')
);

INSERT INTO geometry_table VALUES (
    SDO_GEOMETRY('LINESTRING(1 3, 1 5, 2 7)')
);

INSERT INTO geometry_table VALUES (
    MDSYS.SDO_GEOMETRY(
		2001,
		8307,
		MDSYS.SDO_POINT_TYPE (
			-86.13631,
			40.485424,
			NULL),
		NULL,
		NULL
	)
);

INSERT  INTO geometry_table VALUES (
SDO_GEOMETRY(
    2003,
    12,
    SDO_POINT_TYPE(12, 14, -5),
    SDO_ELEM_INFO_ARRAY(1,1003,3),
    SDO_ORDINATE_ARRAY(1,1, 5,7)
  )
);

INSERT INTO geometry_table VALUES (
NULL);
```

##### Snowflake

```sql
INSERT INTO geometry_table
VALUES (
	SDO_GEOMETRY('POINT(-79 37)') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_GEOMETRY' NODE ***/!!!
);

INSERT INTO geometry_table
VALUES (
    SDO_GEOMETRY('LINESTRING(1 3, 1 5, 2 7)') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_GEOMETRY' NODE ***/!!!
);

INSERT INTO geometry_table
VALUES (
    MDSYS.SDO_GEOMETRY(
		2001,
		8307,
		MDSYS.SDO_POINT_TYPE (
			-86.13631,
			40.485424,
			NULL) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'MDSYS.SDO_POINT_TYPE' NODE ***/!!!,
		NULL,
		NULL
	) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'MDSYS.SDO_GEOMETRY' NODE ***/!!!
);

INSERT  INTO geometry_table
VALUES (
SDO_GEOMETRY(
    2003,
    12,
    SDO_POINT_TYPE(12, 14, -5) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_POINT_TYPE' NODE ***/!!!,
    SDO_ELEM_INFO_ARRAY(1,1003,3) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_ELEM_INFO_ARRAY' NODE ***/!!!,
    SDO_ORDINATE_ARRAY(1,1, 5,7) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_ORDINATE_ARRAY' NODE ***/!!!
  ) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_GEOMETRY' NODE ***/!!!
);

INSERT INTO geometry_table
VALUES (
NULL);
```

#### Migration using the GEOGRAPHY data type

##### Oracle

```sql
CREATE TABLE geometry_table(
    geometry_column SDO_GEOMETRY
);

INSERT INTO geometry_table VALUES (
	SDO_GEOMETRY('POINT(-79 37)')
);

INSERT INTO geometry_table VALUES (
    SDO_GEOMETRY('LINESTRING(1 3, 1 5, 2 7)')
);

/*
--NOT SUPPORTED BY SNOWFLAKE GEOGRAPHY
INSERT INTO geometry_table VALUES (
    MDSYS.SDO_GEOMETRY(
		2001,
		8307,
		MDSYS.SDO_POINT_TYPE (
			-86.13631,
			40.485424,
			NULL),
		NULL,
		NULL
	)
);
INSERT  INTO geometry_table VALUES (
SDO_GEOMETRY(
    2003,
    12,
    SDO_POINT_TYPE(12, 14, -5),
    SDO_ELEM_INFO_ARRAY(1,1003,3),
    SDO_ORDINATE_ARRAY(1,1, 5,7)
  )
);
*/

SELECT * FROM geometry_table;
```

##### Result

| GEOMETRY_COLUMN |
| --- |
| [2001, null, [-79, 37, null], [NULL], [NULL]] |
| [2002, null, [null, null, null], [1,2,1], [1,3,1,5,2,7]] |

##### Snowflake

```sql
CREATE OR REPLACE TABLE geometry_table (
	    geometry_column GEOMETRY
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO geometry_table
	VALUES (
	SDO_GEOMETRY('POINT(-79 37)') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_GEOMETRY' NODE ***/!!!
);

	INSERT INTO geometry_table
	VALUES (
    SDO_GEOMETRY('LINESTRING(1 3, 1 5, 2 7)') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_GEOMETRY' NODE ***/!!!
);

	/*
--NOT SUPPORTED BY SNOWFLAKE GEOGRAPHY
INSERT INTO geometry_table VALUES (
    MDSYS.SDO_GEOMETRY(
		2001,
		8307,
		MDSYS.SDO_POINT_TYPE (
			-86.13631,
			40.485424,
			NULL),
		NULL,
		NULL
	)
);
INSERT  INTO geometry_table VALUES (
SDO_GEOMETRY(
    2003,
    12,
    SDO_POINT_TYPE(12, 14, -5),
    SDO_ELEM_INFO_ARRAY(1,1003,3),
    SDO_ORDINATE_ARRAY(1,1, 5,7)
  )
);
*/

SELECT * FROM
	    geometry_table;
```

##### Result

| GEOMETRY_COLUMN |
| --- |
| POINT(-79 37) |
| LINESTRING(1 3,1 5,2 7) |

### Related EWIs

1. [SSC-EWI-0073](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## SDO_GEORASTER

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> In the GeoRaster object-relational model, a raster grid or image object is stored in a single row, in a single column of object type `SDO_GEORASTER` in a user-defined table. ([Oracle SQL Language Reference SDO_GEORASTER Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-CFEFCFAC-4756-4B90-B88D-D89B861C1628)).

Definition of SDO_GEORASTER object:

```none
CREATE TYPE SDO_GEORASTER AS OBJECT
  (rasterType         NUMBER,
   spatialExtent      SDO_GEOMETRY,
   rasterDataTable    VARCHAR2(32),
   rasterID           NUMBER,
   metadata           XMLType);
/
```

> **Note:**
>
> SDO_GEORASTER is disabled by default, to enable its usage, follow the steps described in [this section](https://docs.oracle.com/database/121/SPATL/ensuring-that-georaster-works-properly-installation-or-upgrade.htm#GUID-20119C51-6B07-4535-954E-7C55850F51F3) of Oracle documentation.

The `SDO_GEORASTER` object is **not supported** in Snowflake.

### Sample Source Patterns

#### SDO_GEORASTER in Create Table

##### Oracle

```sql
CREATE TABLE georaster_table(
    georaster_column SDO_GEORASTER
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE georaster_table (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
        georaster_column SDO_GEORASTER
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;
```

##### Inserting data in SDO_GEORASTER Table

##### Oracle

```sql
INSERT INTO georaster_table VALUES (null);
INSERT INTO georaster_table VALUES (sdo_geor.init('RDT_11', 1));
```

##### Snowflake

```sql
INSERT INTO georaster_table
VALUES (null);

INSERT INTO georaster_table
VALUES (
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'sdo_geor.init' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS init);
```

### Known Issues

**1. SDO_GEORASTER Data Type not transformed**

SDO_GEORASTER Data Type is not being transformed by SnowConvert.

### Related EWIs

1. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported.
2. [SSC-EWI-OR0076:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) Built In Package Not Supported.

## SDO_TOPO_GEOMETRY

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> This type describes a topology geometry, which is stored in a single row, in a single column of object type `SDO_TOPO_GEOMETRY` in a user-defined table. ([Oracle SQL Language Reference SDO_TOPO_GEOMETRY Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-66AF10E5-137D-444B-B9BC-89B2B340E278)).

Definition of SDO_TOPO_GEOMETRY object:

```none
CREATE TYPE SDO_TOPO_GEOMETRY AS OBJECT
  (tg_type        NUMBER,
   tg_id          NUMBER,
   tg_layer_id    NUMBER,
   topology_id    NUMBER);
/
```

The `SDO_TOPO_GEOMETRY` object is **not supported** in Snowflake.

### Sample Source Patterns

#### SDO_TOPO_GEOMETRY in Create Table

##### Oracle

```sql
CREATE TABLE topo_geometry_table(
    topo_geometry_column SDO_TOPO_GEOMETRY
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE topo_geometry_table (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
        topo_geometry_column SDO_TOPO_GEOMETRY
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Inserting data in SDO_TOPO_GEOMETRY Table

##### Oracle

```sql
INSERT INTO topo_geometry_table VALUES (SDO_TOPO_GEOMETRY(1,2,3,4));
INSERT INTO topo_geometry_table VALUES (NULL);
```

##### Snowflake

```sql
INSERT INTO topo_geometry_table
VALUES (SDO_TOPO_GEOMETRY(1,2,3,4) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SDO_TOPO_GEOMETRY' NODE ***/!!!);

INSERT INTO topo_geometry_table
VALUES (NULL);
```

### Known Issues

**1. SDO_TOPO_GEOMETRY Data Type not transformed**

SDO_TOPO_GEOMETRY Data Type is not being transformed by SnowConvert.

### Related EWIs

1. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported.
2. [SSC-EWI-0073:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Pending functional equivalence review.

---
title: SnowConvert AI - Oracle - SQL Statements
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-translation-reference/README.md
section: Migrations
---

# SnowConvert AI - Oracle - SQL Statements

This document details all the similarities, differences in SQL syntax and how SnowConvert AI would translate those SQL syntaxes into a functional Snowflake SQL Syntax.

## Alter Table

This section shows you the translations related to ALTER TABLE.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### 1. Description

Use the ALTER TABLE statement to alter the definition of a nonpartitioned table, a partitioned table, a table partition, or a table subpartition. For object tables or relational tables with object columns, use ALTER TABLE to convert the table to the latest definition of its referenced type after the type has been altered ([Oracle documentation](https://docs.oracle.com/cd/E11882_01/server.112/e41084/statements_3001.htm#SQLRF01001)).

**Oracle syntax**

```sql
ALTER TABLE [ schema. ] table
  [ alter_table_properties
  | column_clauses
  | constraint_clauses
  | alter_table_partitioning
  | alter_external_table
  | move_table_clause
  ]
  [ enable_disable_clause
  | { ENABLE | DISABLE } { TABLE LOCK | ALL TRIGGERS }
  ] ...
  ;
```

> **Note:**
>
> To review Snowflake syntax, review the following [documentation](https://docs.snowflake.com/en/sql-reference/sql/alter-table).

#### 2. Sample Source Patterns

#### 2.1. Alter table with clauses

> **Warning:**
>
> **memoptimize_read_clause** and **memoptimize_read_clause** are not applicable in Snowflake so are being removed.

##### Oracle

```sql
ALTER TABLE SOMESCHEMA.SOMENAME
MEMOPTIMIZE FOR READ
MEMOPTIMIZE FOR WRITE
 ADD (SOMECOLUMN NUMBER , SOMEOTHERCOLUMN VARCHAR(23))
 (PARTITION PT NESTED TABLE COLUMN_VALUE STORE AS SNAME
 ( SUBPARTITION SPART NESTED TABLE COLUMN_VALUE STORE AS SNAME))
ENABLE TABLE LOCK;
```

##### Snowflake

```sql
ALTER TABLE SOMESCHEMA.SOMENAME
ADD (SOMECOLUMN NUMBER(38, 18), SOMEOTHERCOLUMN VARCHAR(23));
```

> **Note:**
>
> Only some **column_clauses and constraint_clauses** are applicable in Snowflake. In Oracle alter table allows modifying properties from partitions created but in Snowflake, these actions are not required

#### 2.2. Alter table with not supported cases

##### Oracle

```sql
ALTER TABLE SOMENAME MODIFY COLUMN SCOLUMN NOT SUBSTITUTABLE AT ALL LEVELS FORCE;

ALTER TABLE SOMENAME MODIFY(SCOLUMN VISIBLE,SCOLUMN INVISIBLE);

ALTER TABLE SOMENAME MODIFY VARRAY VARRAYITEM (
STORAGE(PCTINCREASE 10));
```

##### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
ALTER TABLE SOMENAME
MODIFY COLUMN SCOLUMN NOT SUBSTITUTABLE AT ALL LEVELS FORCE;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
MODIFY(SCOLUMN VISIBLE,SCOLUMN INVISIBLE);

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!

ALTER TABLE SOMENAME
MODIFY VARRAY VARRAYITEM (
STORAGE(PCTINCREASE 10));
```

#### 2.3. ADD CONSTRAINT action

The ADD CONSTRAINT action has an equivalent in Snowflake, but it only one constraint can be added per ALTER TABLE statement, so it will be commented when the statement contains two or more constraints.

> **Warning:**
>
> **enable_disable_clause** is removed since it is not relevant in Snowflake.

##### Oracle

```sql
-- MULTIPLE CONSTRAINT ADDITION SCENARIO
ALTER TABLE TABLE1 ADD (
CONSTRAINT TABLE1_PK
PRIMARY KEY
(ID)
ENABLE VALIDATE,
CONSTRAINT TABLE1_FK foreign key(ID2)
references TABLE2 (ID) ON DELETE CASCADE);

-- ONLY ONE CONSTRAINT ADDITION SCENARIO
ALTER TABLE TABLE1 ADD (
CONSTRAINT TABLE1_FK foreign key(ID2)
references TABLE2 (ID) ON DELETE CASCADE);
```

##### Snowflake

```sql
-- MULTIPLE CONSTRAINT ADDITION SCENARIO
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0067 - MULTIPLE CONSTRAINT DEFINITION IN A SINGLE STATEMENT IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
ALTER TABLE TABLE1
ADD (
CONSTRAINT TABLE1_PK
PRIMARY KEY
(ID) ,
CONSTRAINT TABLE1_FK foreign key(ID2)
references TABLE2 (ID) ON DELETE CASCADE);

-- ONLY ONE CONSTRAINT ADDITION SCENARIO
ALTER TABLE TABLE1
ADD
CONSTRAINT TABLE1_FK foreign key(ID2)
references TABLE2 (ID) ON DELETE CASCADE;
```

### Known Issues

1. Some properties on the tables may be adapted to or not applicable.

### Related EWIs

1. [SSC-EWI-0109](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Alter Table syntax is not applicable in Snowflake.
2. [SSC-EWI-OR0067](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Multiple constraint definition in a single statement is not supported in Snowflake.

## Create Database Link

> **Warning:**
>
> Currently, ***Create Database Link*** statement is not being converted but it is being parsed. Also, if your source code has`create database link` statements, these are going to be accounted for in the ***Assessment Report.***

### **Example of a Source Code**

```sql
CREATE PUBLIC DATABASE LINK db_link_name
CONNECT TO CURRENT_USER
USING 'connect string'

CREATE DATABASE LINK db_link_name2
CONNECT TO user_name IDENTIFIED BY user_password
USING 'connect string'

CREATE PUBLIC DATABASE LINK db_link_name3
```

### Snowflake output

```sql
----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE DATABASE LINK IS OUT OF TRANSLATION SCOPE. **
--CREATE PUBLIC DATABASE LINK db_link_name
--CONNECT TO CURRENT_USER
--USING 'connect string'

----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE DATABASE LINK IS OUT OF TRANSLATION SCOPE. **
--CREATE DATABASE LINK db_link_name2
--CONNECT TO user_name IDENTIFIED BY user_password
--USING 'connect string'

----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE DATABASE LINK IS OUT OF TRANSLATION SCOPE. **

--CREATE PUBLIC DATABASE LINK db_link_name3
```

### Database Link References

If in your input code you use objects from the database link the output code will keep the name of these objects but the name of the database link that they are using will be removed.

#### Example of a Source Code

```sql
-- CREATE DATABASE LINK STATEMENTS
CREATE DATABASE LINK mylink1
    CONNECT TO user1 IDENTIFIED BY password1
    USING 'my_connection_string1';

CREATE DATABASE LINK mylink2
    CONNECT TO user2 IDENTIFIED BY password2
    USING 'my_connection_string2';

-- SQL statements that use the database links
SELECT * FROM products@mylink1;

INSERT INTO employees@mylink2
    (employee_id, last_name, email, hire_date, job_id)
    VALUES (999, 'Claus', 'sclaus@oracle.com', SYSDATE, 'SH_CLERK');

UPDATE jobs@mylink2 SET min_salary = 3000
    WHERE job_id = 'SH_CLERK';

DELETE FROM employees@mylink2
    WHERE employee_id = 999;

-- SQL statement where it uses an object from
-- a database link that is not created
SELECT * FROM products@mylink;
```

#### Snowflake output

```sql
---- CREATE DATABASE LINK STATEMENTS
----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE DATABASE LINK IS OUT OF TRANSLATION SCOPE. **
--CREATE DATABASE LINK mylink1
--    CONNECT TO user1 IDENTIFIED BY password1
--    USING 'my_connection_string1'

----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE DATABASE LINK IS OUT OF TRANSLATION SCOPE. **

--CREATE DATABASE LINK mylink2
--    CONNECT TO user2 IDENTIFIED BY password2
--    USING 'my_connection_string2'

-- SQL statements that use the database links
SELECT * FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink1 | USER: user1/password1 | CONNECTION: 'my_connection_string1' ] ***/!!!
    products;

INSERT INTO
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink2 | USER: user2/password2 | CONNECTION: 'my_connection_string2' ] ***/!!!
employees
    (employee_id, last_name, email, hire_date, job_id)
    VALUES (999, 'Claus', 'sclaus@oracle.com', CURRENT_TIMESTAMP(), 'SH_CLERK');

UPDATE
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink2 | USER: user2/password2 | CONNECTION: 'my_connection_string2' ] ***/!!!
jobs
    SET min_salary = 3000
    WHERE job_id = 'SH_CLERK';

DELETE FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink2 | USER: user2/password2 | CONNECTION: 'my_connection_string2' ] ***/!!!
    employees
    WHERE employee_id = 999;

-- SQL statement where it uses an object from
-- a database link that is not created
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "mylink" **
SELECT * FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink | USER: / | CONNECTION:  ] ***/!!!
    products;
```

### Related EWIs

1. [SSC-EWI-OR0123](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Db Link connections not supported.
2. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.

## Drop Table

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

A Drop Table statement is used to remove a table. This statement varies a little between [Oracle](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/DROP-TABLE.html#GUID-39D89EDC-155D-4A24-837E-D45DDA757B45) and [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/drop-table.html). Please double-check each documentation for more information regarding the differences.

In Oracle, the Drop Table syntax is:

```sql
DROP TABLE <table_name> [ CASCADE CONSTRAINTS ] [ PURGE ]
```

In Snowflake, the Drop table syntax is:

```sql
DROP TABLE [ IF EXISTS ] <table_name> [ CASCADE | RESTRICT ]
```

The main difference is that Snowflake does not have an equal for the PURGE clause, as the table will not be permanently removed from the system. Though, the CASCADE CONSTRAINTS and the CASCADE clauses *are* the same. Both drop the table, even if foreign keys exist that reference this table.

#### Examples

Now, let’s see some code examples, and what it would look like after it has been transformed. Each example uses a different variation of the Drop Table statement.

##### Example 1:

This example uses the **Drop Table** statement as simple as possible.

**Input Code:**

```sql
DROP TABLE TEST_TABLE1;
```

**Transformed Code:**

```sql
DROP TABLE TEST_TABLE1;
```

##### Example 2:

This example uses the **Drop Table** statement with the PURGE clause. Remember there is no equivalent in Snowflake for the PURGE clause inside a Drop Table statement.

**Input Code:**

```sql
DROP TABLE TEST_TABLE1 PURGE;
```

**Transformed Code:**

```sql
DROP TABLE TEST_TABLE1;
```

##### Example 3:

This example uses the **Drop Table** statement with the CASCADE CONSTRAINTS clause.

**Input Code:**

```sql
DROP TABLE TEST_TABLE1 CASCADE CONSTRAINTS;
```

**Transformed Code:**

```sql
DROP TABLE TEST_TABLE1 CASCADE;
```

In the transformed code, the CONSTRAINTS word is removed from the CASCADE CONSTRAINTS clause.

##### Example 4:

This example uses the **Drop Table** statement with the CASCADE CONSTRAINTS and the PURGE clauses.

**Input Code:**

```sql
DROP TABLE TEST_TABLE1 CASCADE CONSTRAINTS PURGE;
```

**Transformed Code:**

```sql
DROP TABLE TEST_TABLE1 CASCADE;
```

As seen, the code changes. In the new Snowflake code, the PURGE clause is removed and the CONSTRAINTS word is also removed from the CASCADE clause.

#### Functional Equivalence

Run the following code to check for functional equivalence, bear in mind the only part that is not equivalent is the PURGE clause, which in Oracle removes completely the table from the system and there is no equal for Snowflake. In both cases, the table is dropped even if it’s referenced in another table.

**Oracle:**

```sql
CREATE TABLE TEST_TABLE2 (
    col2 INTEGER,
    CONSTRAINT constraint_name PRIMARY KEY (col2)
);

CREATE TABLE OTHER_TABLE (
    other_col INTEGER REFERENCES TEST_TABLE2 (col2)
);

DROP TABLE TEST_TABLE2 CASCADE CONSTRAINTS PURGE;
```

**Snowflake:**

```sql
CREATE OR REPLACE TABLE TEST_TABLE2 (
       col2 INTEGER,
       CONSTRAINT constraint_name PRIMARY KEY (col2)
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
   ;

   CREATE OR REPLACE TABLE OTHER_TABLE (
          other_col INTEGER REFERENCES TEST_TABLE2 (col2)
      )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
   ;

   DROP TABLE TEST_TABLE2 CASCADE;
```

### Related EWIs

No related EWIs.

## Create Index

> **Warning:**
>
> Currently, ***Create Index*** statement is not being converted but it is being parsed. Also, if your source code has create `index` statements, these are going to be accounted for in the ***Assessment Report.***

### Example of a *create index* parsed code:

```sql
CREATE UNIQUE INDEX COL1_INDEX ILM (ADD POLICY OPTIMIZE AFTER 10 DAYS OF NO ACCESS) ON CLUSTER CLUSTER1
ONLINE USABLE DEFERRED INVALIDATION;

CREATE BITMAP INDEX COL1_INDEX ILM (ADD POLICY OPTIMIZE ( ON FUNC1 )) ON TABLE1 AS TAB1 (COL1 ASC) GLOBAL PARTITION BY RANGE (COL1, COL2) ( PARTITION VALUES LESS THAN (MAXVALUE) ) UNUSABLE IMMEDIATE INVALIDATION;

CREATE MULTIVALUE INDEX COL1_INDEX ILM (ADD POLICY SEGMENT TIER TO LOW_COST_TBS) ON TABLE1( TAB1 COL1 DESC, TAB1 COL2 ASC) FROM TABLE1 AS TAB1 WHERE COL1 > 0 LOCAL STORE IN (STORAGE1)
VISIBLE USABLE DEFERRED INVALIDATION;

CREATE INDEX COL1_INDEX ILM (DELETE POLICY POLICY1) ON CLUSTER CLUSTER1
PCTFREE 10
LOGGING
ONLINE
TABLESPACE DEFAULT
NOCOMPRESS
SORT
REVERSE
VISIBLE
INDEXING PARTIAL
NOPARALLEL;

CREATE INDEX COL1_INDEX ILM (DELETE_ALL) ON TABLE1 AS TAB1 (COL1 ASC) LOCAL (
PARTITION PARTITION1 TABLESPACE TABLESPACE1 NOCOMPRESS USABLE) DEFERRED INVALIDATION;

CREATE INDEX COL1_INDEX ON TABLE1 (COL1 ASC) GLOBAL
PARTITION BY HASH (COL1, COL2) (PARTITION PARTITION1 LOB(LOB1) STORE AS BASICFILE LOB_NAME (TABLESPACE TABLESPACE1)) USABLE IMMEDIATE INVALIDATION;

CREATE INDEX COL1_INDEX ON TABLE1 (COL1 DESC, COL2 ASC) INDEXTYPE IS INDEXTYPE1 LOCAL ( PARTITION PARTITION1 PARAMETERS('PARAMS')) NOPARALLEL PARAMETERS('PARAMS') USABLE DEFERRED INVALIDATION;

CREATE INDEX COL1_INDEX ON TABLE1 (COL1 ASC) INDEXTYPE IS XDB.XMLINDEX LOCAL ( PARTITION PARTITION1) PARALLEL 6 UNUSABLE IMMEDIATE INVALIDATION;
```

> **Note:**
>
> Due to architectural reasons, Snowflake does not support indexes so, SnowConvert AI will remove all the code related to the creation of indexes. Snowflake automatically creates micro-partitions for every table that help speed up the performance of DML operations, the user does not have to worry about creating or managing these micro-partitions.
>
> Usually, this is enough to have an exceptionally good query performance. However, there are ways to improve it by creating data clustering keys. [Snowflake’s official page](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions.html) provides more information about micro-partitions and data clustering.

## Create Sequence

Let’s first see a code example, and what it would look like after it has been transformed.

### Oracle:

```sql
CREATE SEQUENCE SequenceSample
START WITH 1000
INCREMENT BY 1
NOCACHE
NOCYCLE;
```

### Snowflake:

```sql
CREATE OR REPLACE SEQUENCE SequenceSample
START WITH 1000
INCREMENT BY 1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

The first change that it is done is to apply the schema or datawarehouse to the name of the sequence. The second transformation consists in removing some elements and add them as comments, since oracle has some elements in the create sequence that are not supported in Snowflake.

In Oracle, after the name of the Sequence, the elements that are NOT commented are the following

* START WITH 1000
* INCREMENT BY 1

If the element is not one of those, it will be commented and added as a warning just before the create sequence, like in the example.

The following elements are the ones that are removed

* MAXVALUE
* NOMAXVALUE
* MINVALUE
* NOMINVALUE
* CYCLE
* NOCYCLE
* CACHE
* NOCACHE
* ORDER
* NOORDER
* KEEP
* NOKEEP
* SESSION
* GLOBAL
* SCALE
* EXTEND
* SCALE
* NOEXTEND
* NOSCALE
* SHARD
* EXTEND
* SHARD
* NOEXTEND
* NOSHARD

### SEQUENCE EXPRESSIONS

* NEXTVAL: Snowflake grammar is the same as the Oracle one.
* CURRVAL: Snowflake does not have an equivalent so it is transformed to a stub function. Check this [link](https://docs.snowflake.com/en/user-guide/querying-sequences.html#currval-not-supported) to understand Snowflake’s approach.

#### Oracle:

```sql
select seq1.nextval from dual;
select seq1.currval from dual;
```

#### Snowflake:

```sql
select seq1.nextval from dual;

select
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0069 - THE SEQUENCE CURRVAL PROPERTY IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! seq1.currval from dual;
```

### Sequence START WITH

`START WITH` statement value may exceed the maximum value allowed by Snowflake. What Snowflake said about the start value is: *Specifies the first value returned by the sequence. Supported values are any value that can be represented by a 64-bit two’s compliment integer (from `-2^63` to `2^63-1`)*. So according to the previously mentioned, the max value allowed is **9223372036854775807** for positive numbers and **9223372036854775808** for negative numbers.

#### Example Code

##### Oracle:

```sql
CREATE SEQUENCE SEQUENCE1
START WITH 9223372036854775808;

CREATE SEQUENCE SEQUENCE2
START WITH -9223372036854775809;
```

##### Snowflake:

```sql
CREATE OR REPLACE SEQUENCE SEQUENCE1
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0068 - SEQUENCE START VALUE EXCEEDS THE MAX VALUE ALLOWED BY SNOWFLAKE. ***/!!!
START WITH 9223372036854775808
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';

CREATE OR REPLACE SEQUENCE SEQUENCE2
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0068 - SEQUENCE START VALUE EXCEEDS THE MAX VALUE ALLOWED BY SNOWFLAKE. ***/!!!
START WITH -9223372036854775809
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

### Related EWIs

1. [SSC-EWI-OR0069](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The sequence CURRVAL property is not supported in Snowflake.
2. [SSC-EWI-OR0068](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): The sequence start value exceeds the max value allowed by Snowflake.

## Alter Session

### Alter session

Alter session has an equivalent in Snowflake and some the variables are mapped to Snowflake variables. If a permutation of Alter Session is not supported the node will be commented and a warning will be added.

#### Oracle:

```sql
alter session set nls_date_format = 'DD-MM-YYYY';
```

#### Snowflake:

```sql
ALTER SESSION SET DATE_INPUT_FORMAT = 'DD-MM-YYYY' DATE_OUTPUT_FORMAT = 'DD-MM-YYYY';
```

### Session Parameters Reference

> **Note:**
>
> The session parameters that doesn’t appear in the table are not currently being transformed.

| Session Parameter | Snowflake transformation |
| --- | --- |
| NLS_DATE_FORMAT | DATE_INPUT_FORMAT and DATE_OUTPUT_FORMAT |
| NLS_NUMERIC_CHARACTERS | NOT SUPPORTED |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Create Synonym

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Create Synonym

Synonyms are not supported in Snowflake. The references to the Synonyms will be changed for the original Object.

#### Oracle:

```sql
CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA;
```

#### Snowflake:

```sql
----** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **
--CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA
                                                       ;
```

#### **Example 1**: Synonym that refers to a table.

Oracle source code:

```sql
CREATE TABLE TABLITA
(
    COLUMN1 NUMBER
);

CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA;

SELECT * FROM B.TABLITA_SYNONYM WHERE B.TABLITA_SYNONYM.COLUMN1 = 20;
```

Snowflake migrated code: you’ll notice that the `SELECT` originally refers to a synonym, but now it refers to the table that points the synonym.

```sql
CREATE OR REPLACE TABLE TABLITA
    (
        COLUMN1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;

--    --** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **

--    CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA
                                                           ;

SELECT * FROM
    TABLITA
    WHERE
    TABLITA.COLUMN1 = 20;
```

#### **Example 2**: Synonym that refers to another synonym.

Oracle source code:

```sql
CREATE TABLE TABLITA
(
    COLUMN1 NUMBER
);

CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA;
CREATE OR REPLACE SYNONYM C.TABLITA_SYNONYM2 FOR B.TABLITA_SYNONYM;

SELECT * FROM C.TABLITA_SYNONYM2 WHERE C.TABLITA_SYNONYM2.COLUMN1 = 20;

UPDATE C.TABLITA_SYNONYM2 SET COLUMN1 = 10;

INSERT INTO C.TABLITA_SYNONYM2 VALUES (1);
```

Snowflake migrated code: you’ll notice that originally the `SELECT` , `UPDATE`, `INSERT` refers to a synonym, and now it refers to the atomic object, which is a table.

```sql
CREATE OR REPLACE TABLE TABLITA
    (
        COLUMN1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;

--    --** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **

--    CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA
                                                           ;

--    --** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **
--CREATE OR REPLACE SYNONYM C.TABLITA_SYNONYM2 FOR B.TABLITA_SYNONYM
                                                                  ;

SELECT * FROM
    TABLITA
    WHERE
    TABLITA.COLUMN1 = 20;

    UPDATE TABLITA
    SET COLUMN1 = 10;

    INSERT INTO TABLITA
    VALUES (1);
```

#### **Example 3**: Synonym that refers to a view

Oracle Source Code

```sql
CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA;

CREATE OR REPLACE SYNONYM C.TABLITA_SYNONYM2 FOR B.TABLITA_SYNONYM;

CREATE VIEW VIEW_ORGINAL AS SELECT * FROM C.TABLITA_SYNONYM2;

CREATE OR REPLACE SYNONYM VIEW_SYNONYM FOR VIEW_ORGINAL;

SELECT * FROM VIEW_SYNONYM;
```

Snowflake migrated code: you’ll notice that the `SELECT` originally refers to a synonym, and now it refers to the atomic objects, which is a view.

```sql
----** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **
--CREATE OR REPLACE SYNONYM B.TABLITA_SYNONYM FOR TABLITA
                                                       ;

----** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **
--CREATE OR REPLACE SYNONYM C.TABLITA_SYNONYM2 FOR B.TABLITA_SYNONYM
                                                                  ;

CREATE OR REPLACE VIEW VIEW_ORGINAL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT * FROM
TABLITA;

----** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **

--CREATE OR REPLACE SYNONYM VIEW_SYNONYM FOR VIEW_ORGINAL
                                                       ;

SELECT * FROM
VIEW_ORGINAL;
```

### Related EWIs

1. [SSC-FDM-0001](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Views selecting all columns from a single table are not required in Snowflake.
2. [SSC-FDM-0006](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Number type column may not behave similarly in Snowflake.
3. [SSC-FDM-OR0005](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md): Synonyms are not supported in Snowflake but references to this synonym were changed by the original object name.

---
title: SnowConvert AI - Oracle - SQL*Plus
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/sql-plus.md
section: Migrations
---

# SnowConvert AI - Oracle - SQL\*Plus

This is a translation reference to convert SQL Plus statements to SnowSQL (CLI Client)

## Accept

> **Warning:**
>
> Transformation for this command is pending

### Description

> Reads a line of input and stores it in a given substitution variable.. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/ACCEPT.html#GUID-5D07E526-202B-429B-9E0C-005D1E37BBAB))

#### Oracle Syntax

```none
ACC[EPT] variable [NUM[BER] | CHAR | DATE | BINARY_FLOAT | BINARY_DOUBLE] [FOR[MAT] format] [DEF[AULT] default] [PROMPT text|NOPR[OMPT]] [HIDE]
```

Snowflake does not have a direct equivalent to this command. To emulate this functionality, the SnowCLI`!system` command will be used by taking advantage of the system resources for the input operations.

#### 1. Accept command

##### Oracle

##### Command

```sql
ACCEPT variable_name CHAR PROMPT 'Enter the variable value >'
```

##### SnowSQL (CLI Client)

##### Command

```sql
!print Enter the value
!system read aux && echo '!define variable_name='"$aux" > sc_aux_file.sql
!load sc_aux_file.sql
!system rm sc_aux_file.sql
```

> **Warning:**
>
> Note that this approach only applies to macOS and Linux. If you want to run these queries in Windows you may need a terminal that supports a Linux bash script language.

### Known Issues

No Known Issues.

### Related EWIs

No related EWIs.

## Append

> **Warning:**
>
> Transformation for this command is pending

### Description

> Adds specified text to the end of the current line in the SQL buffer. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/APPEND.html#GUID-43CA6E91-0BC9-4298-8823-BDB2512FC97F))

#### Oracle Syntax

```none
A[PPEND] text
```

Snowflake does not have a direct equivalent to this command. The Snowflake [`!edit`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#edit) command can be used to edit the last query using a predefined text editor. Whenever this approach does not cover all the `APPEND` functionality but it is an alternative.

#### 1. Append command

##### Oracle

##### Command

```sql
APPEND SOME TEXT
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'APPEND STATEMENT' NODE ***/!!!
APPEND SOME TEXT;
```

### Known Issues

No Known Issues.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Archive Log

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `ARCHIVE LOG` command displays information about redoing log files. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/WHENEVER-OSERROR.html#GUID-A52F926F-D6EC-434E-9C7E-CFDB76422E94))

#### Oracle Syntax

```none
ARCHIVE LOG LIST
```

Snowflake does not have a direct equivalent to this command. The Snowflake [`!options`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#options-opts)command can be used to display the location path of some log files, however, it does not fully comply with the behavior expected by the `ARCHIVE LOG` command. At transformation time, an EWI will be added.

#### 1. Archive Log command

##### Oracle

##### Command

```sql
ARCHIVE LOG LIST
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ARCHIVE LOG STATEMENT' NODE ***/!!!
ARCHIVE LOG LIST;
```

### Known Issues

No Known Issues.

### Related EWIs

* [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Attribute

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `ATTRIBUTE` command specifies display characteristics for a given attribute of an Object Type column. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/ATTRIBUTE.html#GUID-E37F3F55-23A9-42DD-BAA2-719BC5C5DD32))

#### Oracle Syntax

```none
ATTR[IBUTE] [type_name.attribute_name [option ...]]
```

Snowflake does not have a direct equivalent to this command.

#### 1. Attribute command

##### Oracle

##### Command

```sql
ATTRIBUTE Address.street_address FORMAT A10
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ATTRIBUTE STATEMENT' NODE ***/!!!
ATTRIBUTE Address.street_address FORMAT A10;
```

> **Warning:**
>
> The code for the EWI is not defined yet.

### Known Issues

**1. SnowSQL can set the format of a column**

Currently, SnowSQL does not support custom types nor does it have a command to format columns. However, you can use the following workaround to format columns in your query result:

```sql
SELECT SUBSTR(street_address, 1, 4) FROM person

SELECT TO_VARCHAR(1000.89, '$9,999.99')

SELECT to_varchar('03-Feb-2023'::DATE, 'yyyy.mm.dd');
```

This alternative solution must consider an additional strategy to disable when in Oracle the `ATTRIBUTE` command receives the OFF option.

### Related EWIs

No related EWIs.

## Break

> **Warning:**
>
> Transformation for this command is pending

### Description

> Specifies where changes occur in a report and the formatting action to perform. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/BREAK.html))

#### Oracle Syntax

```none
BRE[AK] [ON report_element [action [action]]] ...

report_element := {column|expr|ROW|REPORT}

action := [SKI[P] n|[SKI[P]] PAGE] [NODUP[LICATES]|DUP[LICATES]]
```

Snowflake does not support the use of this command and does not have any that might resemble its functionality. At the time of transformation, an EWI will be added.

#### 1. BREAK command

##### Oracle

##### Command

```sql
BREAK ON customer_age SKIP 5 DUPLICATES;
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BREAK STATEMENT' NODE ***/!!!
BREAK ON customer_age SKIP 5 DUPLICATES;
```

### Known Issues

No Known Issues.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Btitle

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `BTITLE` command places and formats a specified title at the bottom of each report page, or lists the current BTITLE definition. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/BTITLE.html#GUID-5046ABAA-1E2B-4A91-85BB-51EC2B6BD104))

#### Oracle Syntax

```none
BTI[TLE] [printspec [text | variable] ...] | [ON | OFF]
```

Snowflake does not have a direct equivalent to this command.

#### 1. Btitle command

##### Oracle

##### Command

```sql
BTITLE BOLD 'This is the banner title'
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTITLE STATEMENT' NODE ***/!!!
BTITLE BOLD 'This is the banner title';
```

### Known Issues

**1. SnowSQL does not support the display of custom headers and footers in query**

Currently, SnowSQL does not support the display of custom headers and footers in query output. However, you can use the following workaround to display header and footer information in your query output:

```sql
SELECT column1,
       column2
FROM my_table;

SELECT 'This is the banner title' AS BTITLE;

--Another alternative
!print 'This is the banner title'

--To emulate BTITLE COL 5 'This is the banner title'
SELECT CONCAT(SPACE(5), 'This is the banner title');
```

This alternative solution must consider an additional strategy to disable when in Oracle the `BTITLE` command receives the OFF option.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Change

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `CHANGE` command Changes the first occurrence of the specified text on the current line in the buffer. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/CHANGE.html#GUID-9002CADF-74E2-427D-A404-F8019C7A2791))

#### Oracle Syntax

```none
C[HANGE] sepchar old [sepchar [new [sepchar]]]
```

Snowflake does not have a direct equivalent to this command. The Snowflake [`!edit`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#edit) command can be used to edit the last query using a predefined text editor. Whenever this approach does not cover all the `CHANGE` functionality but it is an alternative.

#### 1. Change command

##### Oracle

##### Command

```sql
CHANGE /old/new/
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'CHANGE STATEMENT' NODE ***/!!!
CHANGE /old/new/;
```

### Known Issues

**1. Unsupported scenarios**

The CHANGE command can be presented in various ways, of which 2 of them are not currently supported by the translator, these are presented below:

```sql
3  WHERE col_id = 1
```

Entering a line number followed by a string will replace the line regardless of the text that follows the line number. This scenario is not supported as this does not follow the command grammar.

```sql
CHANGE/OLD/NEW/
```

Enter the text to replace followed by the command without using spaces. This scenario is not supported since it does not follow the logic of tokenization by spaces.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Column

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `COLUMN` command specifies display attributes for a given column. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/COLUMN.html#GUID-643B665F-B134-4A0B-88F7-10400D6D199E))

#### Oracle Syntax

```none
COL[UMN] [{column | expr} [option ...]]
```

Snowflake does not support the use of this command and does not have any that might resemble its functionality. At the time of transformation, an EWI will be added.

#### 1. Column command

The `COLUMN` command with no clauses to list all current column display attributes.

##### Oracle

##### Command

```sql
COLUMN column_id ALIAS col_id NOPRINT
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'COLUMN STATEMENT' NODE ***/!!!
COLUMN column_id ALIAS col_id NOPRINT;
```

### Known Issues

No Known Issues.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Define

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `DEFINE` command specifies a user or predefined variable and assigns a CHAR value to it, or lists the value and variable type of a single variable or all variables. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/DEFINE.html#GUID-72D4998C-EC2C-4FA6-9F7F-A305C407D666))

#### Oracle Syntax

```none
DEF[INE] [variable] | [variable = text]
```

##### SnowSQL (CLI Client) !define

```none
!define [variable] | [variable=text]
```

> **Note:**
>
> Snowflake recommends not adding whitespace in the variable value assignment statement.

#### 1. Define with simple variable assignment

> **Hint:**
>
> This case is functionally equivalent.

The `DEFINE` command is replaced by the [`!define`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#define) command.

##### Oracle

##### Command

```sql
DEFINE column_id = test

DEFINE column_id = &column_reference
```

##### SnowSQL (CLI Client)

##### Command

```sql
!define column_id = test

!define column_id = &column_reference
```

For referring to a previously defined variable, & is preceded by the name of the variable, if the variable does not exist, Oracle allows its execution time assignment, however, Snowflake would throw an error indicating the non-existence of said variable

#### 2. Define without variable assignments

> **Warning:**
>
> This case is not functionally equivalent.

##### Oracle

##### Command

```sql
DEFINE column_id
```

##### SnowSQL (CLI Client)

##### Command

```sql
!define column_id
```

The DEFINE command used without the assignment statement is used in Oracle to show the definition of the variable, on the other hand in Snowflake this way of using the DEFINE command would reset the assignment of the variable, so a way to simulate the behavior presented in Oracle it is by using the SELECT command.

This solution would be something like this:

##### Command

```sql
select '&column_id';
```

### Known Issues

**1. Enabling variable substitution**

To enable SnowSQL CLI to substitute values for the variables, you must set the variable_substitution configuration option to true. This process can be done at installation, when starting a database instance, or by running the following command:

#### Command

```sql
!set variable_substitution=true
```

**2. Predefined variables**

There are nine predefined variables during SQL\*Plus installation. These variables can be used later by the user. The SnowSQL CLI client only has two predefined variables `__ROWCOUNT` and `__SFQID`.

## Host

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `HOST` command executes an operating system command without leaving SQL\*Plus. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/HOST.html#GUID-E6391C3D-E87E-4BCA-B903-A4402D7E399B))

#### Oracle Syntax

```none
HO[ST] [command]
```

##### SnowSQL (CLI Client) !system

```none
!system <command>
```

#### 1. Set with simple variable assignment

> **Hint:**
>
> This case is functionally equivalent.

The `HOST` command is replaced by the [`!system`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#system) command.

##### Oracle

##### Command

```sql
HOST dir *.sql
```

##### SnowSQL (CLI Client)

##### Command

```sql
!system dir *.sql
```

### Known Issues

No Known Issues.

### Related EWIs

No related EWIs.

## Prompt

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `PROMPT` command sends the specified message or a blank line to the user’s screen. If you omit a text, `PROMPT` displays a blank line on the user’s screen. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/PROMPT.html#GUID-2B2DE976-FBA5-4565-8B21-058289A16234))

#### Oracle Syntax

```none
PRO[MPT] [text]
```

##### SnowSQL (CLI Client) !print

```none
!print [text]
```

#### 1. Simple print

The `PROMPT` command is replaced by the [`!print`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#print) command.

> **Hint:**
>
> This case is functionally equivalent.

##### Oracle

##### Command

```sql
PROMPT

PROMPT text

PROMPT db_link_name = "&1"
```

##### SnowSQL (CLI Client)

##### Command

```sql
!print

!print text

!print db_link_name = "&1"
```

### Known Issues

No Known Issues

### Related EWIs

No related EWIs.

## Remark

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `REMARK` command begins a comment in a script. SQL\*Plus does not interpret the comment as a command.. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/REMARK.html#GUID-F4BF8426-AFE4-49C9-B073-57CB91B440F8))

#### Oracle Syntax

```none
REM[ARK] comment
```

Snowflake does not have a direct equivalent for this command. However, some of its functionalities can be emulated.

#### 1. Remark after the first line

> **Hint:**
>
> This case is functionally equivalent.

When the `REMARK` command is not at the beginning of a script you can use the standard SQL comment markers and double hyphens.

##### Oracle

##### Command

```sql
SELECT 'hello world' FROM dual;
REMARK and now exit the session
EXIT;
```

##### SnowSQL (CLI Client)

##### Command

```sql
select 'hello world';
-- and now exit the session
!exit
```

#### 2. Remark on the first line

> **Warning:**
>
> This case is not functionally equivalent.

When the `REMARK` command is at the beginning of a script, scenarios could appear such as:

Case 1: The next line is a query, in which case the conversion to Snowflake of the `REMARK` command succeeds.

Case 2: The next line is another SQL\*Plus command, in which case the conversion cannot be performed since Snowflake is not capable of executing either of the two statements (This also applies to the scenario where there is only one statement in the script statement that corresponds to the `REMARK` command).

Below are some examples, where the first two could not be translated correctly.

##### Oracle

##### Command

```sql
REMARK single line

REMARK first line
HOST dir *.sql

REMARK first line
SELECT 'hello world' FROM dual;
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'REMARK STATEMENT' NODE ***/!!!
REMARK single line;

!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'REMARK STATEMENT' NODE ***/!!!
REMARK first line;

!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'HOST STATEMENT' NODE ***/!!!
HOST dir *.sql;

!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'REMARK STATEMENT' NODE ***/!!!
REMARK first line;
SELECT 'hello world' FROM dual;
```

### Known Issues

No Known Issues.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Set

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `SET` command sets a system variable to alter the SQL\*Plus environment settings for your current session. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/SET.html#GUID-9095C4FF-F4EB-4218-84AA-83061186625F))

#### Oracle Syntax

```none
SET system_variable value
```

##### SnowSQL (CLI Client) !set

```none
!set <option>=<value>
```

> **Note:**
>
> Snowflake recommends not adding whitespace in the variable value assignment statement.

#### 1. Set with simple variable assignment

> **Hint:**
>
> This case is functionally equivalent.

The `SET` command is replaced by the [`!set`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#set) command.

##### Oracle

##### Command

```sql
SET wrap on
```

##### SnowSQL (CLI Client)

##### Command

```sql
!set wrap=true
```

#### 2. Define without variable assignments

> **Warning:**
>
> This case is not functionally equivalent.

Oracle allows bypassing the key-value rule for assigning values to system variables with a numeric domain, assigning the value of 0 by default in such cases. In Snowflake this is not allowed, so an alternative is to set the value of 0 to a said variable explicitly.

##### Oracle

##### Command

```sql
SET pagesize
```

##### SnowSQL (CLI Client)

##### Command

```sql
!set rowset_size=0
```

### Known Issues

**1. Predefined variables**

The SET command only works for system variables, which may differ in quantity, name, or domain between the two languages, so a review should be done on the variable being used within the command to find its correct Snowflake equivalence. To see the list of system variables in Oracle you can use the command `SHOW ALL` whereas in Snowflake you can use [`!options`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#options-opts).

### Related EWIs

No related EWIs.

## Show

> **Warning:**
>
> Transformation for this command is pending

### Description

> Shows the value of a SQLPlus system variable or the current SQLPlus environment. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/SHOW.html#GUID-6BB1499D-E537-43D1-A209-401F5DB95E16))

#### Oracle Syntax

```none
SHO[W] system_variable  ALL BTI[TLE]  CON_ID  CON_NAME EDITION  ERR[ORS] [ {ANALYTIC VIEW | ATTRIBUTE DIMENSION | HIERARCHY | FUNCTION | PROCEDURE | PACKAGE | PACKAGE BODY | TRIGGER | VIEW | TYPE | TYPE BODY | DIMENSION | JAVA CLASS } [schema.]name]HISTORY  LNO  LOBPREF[ETCH]  PARAMETER[S] [parameter_name]  PDBS PNO  RECYC[LEBIN] [original_name]  REL[EASE]  REPF[OOTER]  REPH[EADER]  ROWPREF[ETCH] SGA SPOO[L]  SPPARAMETER[S] [parameter_name]  SQLCODE STATEMENTC[ACHE] TTI[TLE] USER XQUERY
```

Snowflake does not have a direct equivalent for this command. However, some of its functionalities can be emulated.

#### 1. Show ERRORS

> Shows the compilation errors of a stored procedure (includes stored functions, procedures, and packages). After you use the CREATE command to create a stored procedure, a message is displayed if the stored procedure has any compilation errors.

In Snowflake, performing an extra statement to display all the compilation errors is unnecessary. The compilation errors are displayed immediately when executing the CREATE statement.

##### Oracle

##### Command

```sql
CREATE OR REPLACE PROCEDURE RANCOM_PROC
AS
BEGIN
  INSERT INTO NE_TABLE SELECT 1 FROM DUAL;
END;

SHOW ERRORS
```

##### Result

```none
LINE/COL ERROR
-------- -----------------------------------------------------------------
4/3      PL/SQL: SQL Statement ignored
4/10     PL/SQL: ORA-00925: missing INTO keyword
```

> **Note:**
>
> Note that the INTO keyword is misspelled to cause a compilation error.

##### SnowSQL (CLI Client)

##### Command

```sql
CREATE OR REPLACE PROCEDURE RANCOM_PROC ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    INSERT INTO NE_TABLE
    SELECT 1 FROM DUAL;
  END;
$$;

!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SHOW STATEMENT' NODE ***/!!!

SHOW ERRORS;
```

##### Result

```none
001003 (42000): SQL compilation error:
syntax error line 3 at position 7 unexpected 'INT'.
syntax error line 3 at position 11 unexpected 'PUBLIC'.
```

#### Show ALL

> Lists the settings of all SHOW options, except ERRORS and SGA, in alphabetical order.

To display all the possible options in SnowCLI you can run the [`!options`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#options-opts) command.

##### Oracle

##### Command

```sql
show all;
```

##### Result

```none
appinfo is OFF and set to "SQL*Plus"
arraysize 15
autocommit OFF
autoprint OFF
autorecovery OFF
autotrace OFF
blockterminator "." (hex 2e)
btitle OFF and is the first few characters of the next SELECT statement
cmdsep OFF
colinvisible OFF
coljson OFF
colsep " "
compatibility version NATIVE
concat "." (hex 2e)
copycommit 0
COPYTYPECHECK is ON
define "&" (hex 26)
describe DEPTH 1 LINENUM OFF INDENT ON
echo OFF
editfile "afiedt.buf"
embedded OFF
errorlogging is OFF
escape OFF
escchar OFF
exitcommit ON
FEEDBACK ON for 6 or more rows SQL_ID OFF
flagger OFF
flush ON
fullcolname OFF
heading ON
headsep "|" (hex 7c)
history is OFF
instance "local"
jsonprint NORMAL
linesize 80
lno 5
loboffset 1
lobprefetch 0
logsource ""
long 80
longchunksize 80
markup HTML OFF HEAD "<style type='text/css'> body {font:10pt Arial,Helvetica,sans-serif; color:black; background:White;} p {font:10pt Arial,Helvetica,sans-serif; color:black; background:White;} table,tr,td {font:10pt Arial,Helvetica,sans-serif; color:Black; background:#f7f7e7; padding:0px 0px 0px 0px; margin:0px 0px 0px 0px;} th {font:bold 10pt Arial,Helvetica,sans-serif; color:#336699; background:#cccc99; padding:0px 0px 0px 0px;} h1 {font:16pt Arial,Helvetica,Geneva,sans-serif; color:#336699; background-color:White; border-bottom:1px solid #cccc99; margin-top:0pt; margin-bottom:0pt; padding:0px 0px 0px 0px;-
} h2 {font:bold 10pt Arial,Helvetica,Geneva,sans-serif; color:#336699; background-color:White; margin-top:4pt; margin-bottom:0pt;} a {font:9pt Arial,Helvetica,sans-serif; color:#663300; background:#ffffff; margin-top:0pt; margin-bottom:0pt; vertical-align:top;}</style><title>SQL*Plus Report</title>" BODY "" TABLE "border='1' width='90%' align='center' summary='Script output'" SPOOL OFF ENTMAP ON PREFORMAT OFF
markup CSV OFF DELIMITER , QUOTE ON
newpage 1
null ""
numformat ""
numwidth 10
pagesize 14
PAUSE is OFF
pno 1
recsep WRAP
recsepchar " " (hex 20)
release 2103000000
repfooter OFF and is NULL
repheader OFF and is NULL
rowlimit OFF
rowprefetch 1
securedcol is OFF
serveroutput OFF
shiftinout INVISIBLE
showmode OFF
spool OFF
sqlblanklines OFF
sqlcase MIXED
sqlcode 0
sqlcontinue "> "
sqlnumber ON
sqlpluscompatibility 21.0.0
sqlprefix "#" (hex 23)
sqlprompt "SQL> "
sqlterminator ";" (hex 3b)
statementcache is 0
suffix "sql"
tab ON
termout ON
timing OFF
trimout ON
trimspool OFF
ttitle OFF and is the first few characters of the next SELECT statement
underline "-" (hex 2d)
USER is "SYSTEM"
verify ON
wrap : lines will be wrapped
xmloptimizationcheck OFF
```

##### SnowSQL (CLI Client)

##### Command

```sql
!options
```

##### Result

| Name | Value | Help |
| --- | --- | --- |
| auto_completion | True | Displays auto-completion suggestions for commands and Snowflake objects |
| client_session_keep_alive | False | Keeps the session active indefinitely, even if there is no activity from the user. |
| client_store_temporary_credential | False | Enable Linux users to use temporary file to store ID_TOKEN. |
| connection_options | {} | Set arbitrary connection parameters in underlying Python connector connections. |
| echo | False | Outputs the SQL command to the terminal when it is executed |
| editor | vim | Changes the editor to use for the !edit command |
| empty_for_null_in_tsv | False | Outputs an empty string for NULL values in TSV format |
| environment_variables | [] | Specifies the environment variables to be set in the SnowSQL variables. |
|  |  | The variable names should be comma separated. |
| execution_only | False | Executes queries only. No data will be fetched |
| exit_on_error | False | Quits when SnowSQL encounters an error |
| fix_parameter_precedence | True | Fix the connection parameter precedence in the order of 1) Environment variables, 2) Connection parameters, 3) Default connection parameters. |
| force_put_overwrite | False | Forces OVERWRITE=true for PUT. This is to mitigate S3’s eventually consistent issue. |
| friendly | True | Shows the splash text and goodbye messages |
| header | True | Outputs the header in query results |
| insecure_mode | False | Turns off OSCP certificate checks |
| key_bindings | emacs | Changes keybindings for navigating the prompt to emacs or vi |
| log_bootstrap_file | ../snowsql_rt.log_bo.. | SnowSQL bootstrap log file location |
| log_file | ../snowsql_rt.log | SnowSQL main log file location |
| log_level | DEBUG | Changes the log level (critical, debug, info, error, warning) |
| login_timeout | 120 | Login timeout in seconds. |
| noup | False | Turns off auto upgrading Snowsql |
| ocsp_fail_open | True | Sets the fail open mode for OCSP Failures. For help please refer the documentation. |
| output_file | None | Writes output to the specified file in addition to the terminal |
| output_format | psql | Sets the output format for query results. |
| paging | False | Enables paging to pause output per screen height. |
| progress_bar | True | Shows progress bar while transferring data. |
| prompt_format | [user]#[warehouse]@[.. | Sets the prompt format. For help, see the documentation |
| quiet | False | Hides all output |
| remove_comments | False | Removes comments before sending query to Snowflake |
| remove_trailing_semicolons | False | Removes trailing semicolons from SQL text before sending queries to Snowflake |
| results | True | If set to off, queries will be sent asynchronously, but no results will be fetched. |
|  |  | Use !queries to check the status. |
| rowset_size | 1000 | Sets the size of rowsets to fetch from the server. |
|  |  | Set the option low for smooth output, high for fast output. |
| sfqid | False | Turns on/off Snowflake query id in the summary. |
| sfqid_in_error | False | Turns on/off Snowflake query id in the error message |
| sql_delimiter | ; | Defines what reserved keyword splits SQL statements from each other. |
| sql_split | snowflake.connector… | Choose SQL spliter implementation. Currently snowflake.connector.util_text, or snowflake.cli.sqlsplit. |
| stop_on_error | False | Stops all queries yet to run when SnowSQL encounters an error |
| syntax_style | default | Sets the colors for the text of SnowSQL. |
| timing | True | Turns on/off timing for each query |
| timing_in_output_file | False | Includes timing in the output file. |
| variable_substitution | False | Substitutes variables (starting with ‘&’) with values |
| version | 1.2.24 | SnowSQL version |
| wrap | True | Truncates lines at the width of the terminal screen |
| ———————————– | ———————— | ———————————————————————————————————————————————– |

### Known Issues

**1. It’s not possible in SnowCLI to display the value of a single option.**

SnowCLI does not provide a way to display the value of a specific option. You may use `!options` to watch the value of the option.

**2. Research is pending to match each SQLPLUS option to a SnowflakeCLI equivalent.**

It is pending to define an equivalent for each SQLPLUS option.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Spool

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `SPOOL` command stores query results in a file, or optionally sends the file to a printer. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/SPOOL.html#GUID-61492052-ECCB-45C8-AF94-AB9794C60BEA))

#### Oracle Syntax

```none
SPO[OL] [file_name[.ext] [CRE[ATE] | REP[LACE] | APP[END]] | OFF | OUT]
```

##### SnowSQL (CLI Client) !spool

```none
!spool [<file_name>] | [off]
```

#### 1. Spool without options

> **Hint:**
>
> This case is functionally equivalent.

When the `SPOOL` command is not accompanied by any option, by default it creates a new file with the specified name and extension. The `SPOOL` command is replaced by the [`!spool`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#spool) command.

##### Oracle

##### Command

```sql
SPOOL temp
SPOOL temp.txt
```

##### SnowSQL (CLI Client)

##### Command

```sql
!spool temp
!spool temp.txt
```

#### 2. Spool with write options

> **Warning:**
>
> This case is not functionally equivalent.

Oracle allows 3 types of options when writing to a file through the `SPOOL` command, the CREATE and APPEND options create a file for writing from scratch and concatenate text to the end of an existing file (or create a new one if it doesn’t exist) respectively. Snowflake does not support these options, however, its default behavior is to create a file and if it exists, concatenate the text in it. The REPLACE option, on the other hand, writes to the specific file replacing the existing content. To simulate this behavior in Snowflake it is recommended to delete the file where you want to write and start writing again, as shown in the following code

##### Oracle

##### Command

```sql
SPOOL temp.txt CREATE
SPOOL temp.txt APPEND
SPOOL temp.txt REPLACE
```

##### SnowSQL (CLI Client)

##### Command

```sql
!spool temp.txt
!spool temp.txt

!system del temp.txt
!spool temp.txt
```

#### 3. Spool turn off

> **Hint:**
>
> This case is functionally equivalent.

Oracle has two options to turn off results spooling, OFF and OUT. both are meant to stop rolling, with the difference that the second also sends the file to the computer’s standard (default) printer. This option is not available on some operating systems. Snowflake only has the option to turn off results spooling

##### Oracle

##### Command

```sql
SPOOL OFF
SPOOL OUT
```

##### SnowSQL (CLI Client)

##### Command

```sql
!spool off
!spool off
```

### Known Issues

No Known Issues.

### Related EWIs

No related EWIs.

## Start

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `START` command runs the SQL\*Plus statements in the specified script. The script can be called from the local file system or from a web server. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/START.html#GUID-A8D3613E-A141-42FB-8288-654427BAB28F))

#### Oracle Syntax

```none
STA[RT] {url | file_name[.ext] } [arg...]
```

##### SnowSQL (CLI Client) !load

```none
!(load | source) {url | file_name[.ext] }
```

The Snowflake [`!source`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#source-load) and [`!load`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#source-load) commands are equivalent.

#### 1. Simple start

The `START` command is replaced by the [`!load`](https://docs.snowflake.com/en/user-guide/snowsql-use.html#source-load) command.

> **Hint:**
>
> This case is functionally equivalent.

##### Oracle

##### Command

```sql
START C:\Users\My_User\Desktop\My\Path\insert_script.sql
```

##### SnowSQL (CLI Client)

##### Command

```sql
!load C:\Users\My_User\Desktop\My\Path\insert_script.sql
```

#### 2. Start with arguments

##### Oracle

##### Command

```sql
START C:\Users\My_User\Desktop\My\Path\insert_script.sql 123 456 789
```

##### SnowSQL (CLI Client)

##### Command

```sql
!load C:\Users\My_User\Desktop\My\Path\insert_script.sql
```

> **Warning:**
>
> Script arguments are currently not supported for SnowSQL (CLI Client).

### Known Issues

**1. Arguments are not supported in the SnowSQL CLI Client**

Oracle can pass down multiple arguments to a script and can be accessed with &1, &2, and so on, but this cannot be done in the SnowSQL CLI Client. You can simulate arguments by declaring variables with the `!define` command. Keep in mind that these values are defined globally for all the scripts so the behavior may not be equivalent.

This workaround would look something like this:

```sql
!set variable_substitution=true
!define 1=123
!define 2=456
!define 3=789
!load C:\Users\My_User\Desktop\My\Path\insert_script.sql
```

### Related EWIs

No related EWIs.

## Whenever oserror

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `WHENEVER OSERROR` command Performs the specified action (exits SQL\*Plus by default) if an operating system error occurs. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/WHENEVER-OSERROR.html#GUID-A52F926F-D6EC-434E-9C7E-CFDB76422E94))

#### Oracle Syntax

```none
WHENEVER OSERROR {EXIT [SUCCESS | FAILURE | n | variable | :BindVariable]  [COMMIT | ROLLBACK] | CONTINUE [COMMIT | ROLLBACK | NONE]}
```

Snowflake does not support the use of this command and does not have any that might resemble its functionality. At the time of transformation, an EWI will be added.

#### 1. Whenever oserror command

##### Oracle

##### Command

```sql
WHENEVER OSERROR EXIT
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'WHENEVER ERROR STATEMENT' NODE ***/!!!
WHENEVER OSERROR EXIT;
```

### Known Issues

No Known Issues.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Whenever sqlerror

> **Warning:**
>
> Transformation for this command is pending

### Description

> The `WHENEVER SQLERROR` command Performs the specified action (exits SQL\*Plus by default) if a SQL command or PL/SQL block generates an error. ([Oracle SQL Plus User’s Guide and Reference](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqpug/WHENEVER-SQLERROR.html#GUID-66C1C12C-5E95-4440-A37B-7CCE7E33491C))

#### Oracle Syntax

```none
WHENEVER SQLERROR {EXIT [SUCCESS | FAILURE | WARNING | n | variable  | :BindVariable] [COMMIT | ROLLBACK] | CONTINUE [COMMIT | ROLLBACK | NONE]}
```

Snowflake does not support the use of this command and does not have any that might resemble its functionality. At the time of transformation, an EWI will be added.

#### 1. Whenever sqlerror command

##### Oracle

##### Command

```sql
WHENEVER SQLERROR EXIT
```

##### SnowSQL (CLI Client)

##### Command

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'WHENEVER ERROR STATEMENT' NODE ***/!!!
WHENEVER SQLERROR EXIT;
```

### Known Issues

No Known Issues.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - Oracle - User-Defined Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/user-defined-types.md
section: Migrations
---

# SnowConvert AI - Oracle - User-Defined Types

## Description

> User-defined data types use Oracle built-in data types and other user-defined data types as the building blocks of object types that model the structure and behavior of data in applications. The sections that follow describe the various categories of user-defined types. ([Oracle SQL Language Reference User-defined Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-7CF27C66-9908-4C02-9401-06C2F2C4021C))

> **Warning:**
>
> Snowflake does not have any support for User-defined Types. This page is meant to be a summary of Oracle’s features. For the current status of User-defined Types in the SnowConvert AI tool please refer to the [Create Type Statement Page](../../sql-translation-reference/create_type.md) and its subpages.

## Object Types

> **Note:**
>
> SnowConvert AI offers partial translation for Object Types, for more information on this, please refer to the next section: [Object type definition](../../sql-translation-reference/create_type.md)

## REF Data Types

> **Danger:**
>
> Ref Data Types are not recognized by SnowConvert AI, and are instead shown as unrecognized “User-defined Functions”. For more information about them, please read the REF Data Types subpage.

> An object identifier (represented by the keyword `OID`) uniquely identifies an object and enables you to reference the object from other objects or from relational tables. A data type category called `REF` represents such references. A `REF` data type is a container for an object identifier. `REF` values are pointers to objects. ([Oracle SQL Language Reference REF Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-C9818949-BB51-4EB1-9A6D-2BE1F53B105D))

## Varrays

> **Warning:**
>
> SnowConvert AI only recognizes these elements but does not offer any translation for them, for more information on this, please refer to the next section: [Array type definition](../../sql-translation-reference/create_type.md)

## Nested Tables

> **Warning:**
>
> SnowConvert AI only recognizes these elements but does not offer any translation for them since there are no known workarounds for them, for more information on this, please refer to the next section:[Nested table type definition](../../sql-translation-reference/create_type.md)

## Known Issues

### 1. DML usages for Object Types are not being transformed

As of now, only DDL definitions that use User-Defined Types are being transformed into Variant. This means that any Inserts, Updates or Deletes using User-defined Types are not being transformed and need to be manually transformed. There is no EWI for this but there is a work item to add this corresponding EWI.

#### 2. Nested Table types are not being transformed

There is no known workaround for implementing Nested Tables, for this reason SnowConvert AI only offers recognition of these elements.

#### 3. Array types are not being transformed

For now SnowConvert AI only recognizes these elements. A known workaround exists and there is a work item to implement them.

#### 4. REF Data Types are not supported by SnowConvert AI, but there is no EWI related to them

They are not supported, and instead are reported as an unknown User-Defined Function, but there is a work item to add this corresponding EWI.

## Related EWIs

No related EWIs.

## REF Data Types

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> An object identifier (represented by the keyword `OID`) uniquely identifies an object and enables you to reference the object from other objects or relational tables. A data type category called `REF` represents such references. A `REF` data type is a container for an object identifier. `REF` values are pointers to objects. ([Oracle SQL Language Reference REF Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-C9818949-BB51-4EB1-9A6D-2BE1F53B105D))

REF Data types are not supported in Snowflake, and there is no current workaround to implement a similar component.

As of now, they are currently being recognized as user-defined functions and “DANGLING” clauses are not being recognized. Finally, the OID clause in view is being removed, as there is no workaround for them.

```sql
CREATE VIEW generic_view AS
SELECT REF(type) AS ref_col, MAKE_REF(type, identifier_column) AS make_ref_col
FROM generic_table;

SELECT v.ref_col, v.make_ref_col
FROM generic_view v
WHERE v.ref_col IS NOT DANGLING AND v.make_ref_col IS NOT DANGLING
```

### Sample Source Patterns

#### Types and Tables for References

Please consider the following types, tables, inserts and view. They will be used for the next pattern section.

##### Oracle

```sql
CREATE TYPE email_typ_demo AS OBJECT
	( email_id INTEGER
	, email VARCHAR2(30)
	);

CREATE TYPE customer_typ_demo AS OBJECT
    ( customer_id        INTEGER
    , cust_first_name    VARCHAR2(20)
    , cust_last_name     VARCHAR2(20)
    , email_id			 INTEGER
    ) ;

CREATE TABLE email_table_demo OF email_typ_demo;
CREATE TABLE customer_table_demo OF customer_typ_demo;

INSERT INTO customer_table_demo VALUES
(customer_typ_demo(1, 'First Name 1', 'Last Name 1', 1));

INSERT INTO customer_table_demo VALUES
(customer_typ_demo(2, 'First Name 2', 'Last Name 2', 2));

INSERT INTO email_table_demo VALUES
(email_typ_demo(1, 'abc@def.com'));

CREATE VIEW email_object_view OF email_typ_demo WITH OBJECT IDENTIFIER (email_id) AS
SELECT * FROM email_table_demo;
```

#### Selects and Views using REFs

##### Oracle

```sql
CREATE VIEW email_object_view OF email_typ_demo WITH OBJECT IDENTIFIER (email_id) AS
SELECT * FROM email_table_demo;

CREATE VIEW customer_view AS
SELECT REF(ctb) AS customer_reference
     , MAKE_REF(email_object_view, ctb.email_id) AS email_ref
FROM customer_table_demo ctb;

SELECT c.customer_reference.cust_first_name, c.email_ref.email
FROM customer_view c;

SELECT c.customer_reference.cust_first_name, c.email_ref.email
FROM customer_view c
WHERE c.email_ref IS NOT DANGLING;
```

##### Result with danglings

| CUSTOMER_REFERENCE.CUST_FIRST_NAME | EMAIL_REF.EMAIL |
| --- | --- |
| First Name 1 | abc@def.com |
| First Name 2 |  |

##### Result with no danglings

| CUSTOMER_REFERENCE.CUST_FIRST_NAME | EMAIL_REF.EMAIL |
| --- | --- |
| First Name 1 | abc@def.com |

##### Snowflake

```sql
CREATE OR REPLACE VIEW email_object_view
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT * FROM
     email_table_demo;

CREATE OR REPLACE VIEW customer_view
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
SELECT REF(ctb) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'REF' NODE ***/!!! AS customer_reference
     , MAKE_REF(email_object_view, ctb.email_id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'MAKE_REF' NODE ***/!!! AS email_ref
FROM
     customer_table_demo ctb;

     SELECT c.customer_reference.cust_first_name, c.email_ref.email
     FROM
     customer_view c;

     SELECT c.customer_reference.cust_first_name, c.email_ref.email
FROM
     customer_view c
WHERE c.email_ref;
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '14' COLUMN '19' OF THE SOURCE CODE STARTING AT 'IS'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS ';' ON LINE '10' COLUMN '21'. FAILED TOKEN WAS 'IS' ON LINE '14' COLUMN '19'. CODE '94'. **
--                   IS NOT DANGLING
```

### Known Issues

**1. REF and MAKE_REF are not being recognized**

Instead they are currently being marked as user-defined functions.

**2. DANGLING clause is not being recognized**

DANGLING clauses are causing parsing errors when running SnowConvert.

#### 3. OID Clauses in view are not supported by SnowConvert AI, but there is no EWI related to them

The OID clause is not supported by either SnowConvert AI, nor Snowflake but there should be an EWI related to them.

### Related EWIs

1. [SSC-EWI-0001](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unrecognized token on the line of the source code.
2. [SSC-EWI-0073](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
3. [SSC-FDM-0001](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Views selecting all columns from a single table are not required in Snowflake.

---
title: SnowConvert AI - Oracle - Wrapped objects
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/wrapped-objects.md
section: Migrations
---

# SnowConvert AI - Oracle - Wrapped objects

Input code can contain wrapped objects depending on the extraction tool used to produce it. Encrypted code will be exported as a “nonsense” group of characters which are preceded with the “wrapped” word. We call these blocks wrapped objects, they may run in Oracle but won’t be transformed by SnowConvert.

This wrapped code can cause **low conversion rates** in the tool because for now, the migrator tries to recognize those blocks and comment out the entire object. This code is considered not supported and will affect the conversion rate negatively.

The following objects can appear wrapped:

* Functions
* Procedures
* Packages
* Package bodies
* Types
* Type bodies

This is how the source code may look like (sometimes with thousands of lines of code):

```sql
CREATE OR REPLACE PACKAGE BOOKS_ADMIN.PKG_2 wrapped
a000000
b2
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
abcd
9
78 ba
ob/kXtqN74HGC6XDBIra6MlzY6Awg5m49TOf9b9c56Wf0HgJuHQrjwb1mYHHywjS/l6mf3Qq
5OYQspR6c+ZxVUzWIZSscYTm1uRwz/bR/6nKqhfqnFDKDvNnp2tgdQvIa+HIuDO4dAlLwlxp
lgxH+pYJWqEuDFbXPsyxoIvAgcctyaamw2YsCg==

/
```

And this is how the output should look:

```sql
----** SSC-OOS - OUT OF SCOPE CODE UNIT. Wrapped PACKAGE IS OUT OF TRANSLATION SCOPE. **
--CREATE OR REPLACE PACKAGE BOOKS_ADMIN.PKG_2 wrapped
--a000000
--b2
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--abcd
--9
--78 ba
--ob/kXtqN74HGC6XDBIra6MlzY6Awg5m49TOf9b9c56Wf0HgJuHQrjwb1mYHHywjS/l6mf3Qq
--5OYQspR6c+ZxVUzWIZSscYTm1uRwz/bR/6nKqhfqnFDKDvNnp2tgdQvIa+HIuDO4dAlLwlxp
--lgxH+pYJWqEuDFbXPsyxoIvAgcctyaamw2YsCg==

/
```

The objects recognized as wrapped, are being counted in the assessment reports. Find a total wrapped objects count in the second page of the Assessment.docx report:

Also, you can find counts for each specific wrapped object that was recognized in the corresponding statement section:

As a user of the tool you may want to:

* Decrypt and extract the objects again from your database.
* Remove these objects from your source code.
* No actions. Objects should be assessed and commented out, but the conversion rate may drop.

---
title: SnowConvert AI - Oracle - XML Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/basic-elements-of-oracle-sql/data-types/xml-types.md
section: Migrations
---

# SnowConvert AI - Oracle - XML Types

## Description

> Extensible Markup Language (XML) is a standard format developed by the World Wide Web Consortium (W3C) for representing structured and unstructured data on the World Wide Web. Universal resource identifiers (URIs) identify resources such as Web pages anywhere on the Web. Oracle provides types to handle XML and URI data, as well as a class of URIs called `DBURIRef` types to access data stored within the database itself. ([Oracle SQL Language Reference XML Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-BF935A5E-3E6C-42C0-AA18-05D3A268D7D8))

## URIFactory Package

### Description

> Oracle also provides the `URIFactory` package, which can create and return instances of the various subtypes of the `URITypes`. The package analyzes the URL string, identifies the type of URL (HTTP, `DBURI`, and so on), and creates an instance of the subtype. ([Oracle SQL Language Reference URIFactory Package](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-1CA616C7-BFA7-4AFC-A199-7589DA049CB6))

URIFactory contains the following subprograms:

* GETURI
* ESCAPEURI
* UNESCAPURI
* REGISTERURLHANDLER
* UNREGISTERURLHANDLER

### GETURI

#### Oracle

```sql
SELECT SYS.URIFACTORY.GETURI('http://localhost/').GETURL() FROM dual;
```

#### Result

| SYS.URIFACTORY.GETURI(‘HTTP://LOCALHOST/’).GETURL() |
| --- |
| http://localhost/ |

#### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'SYS.URIFACTORY.GETURI' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS GETURI.GETURL() FROM dual;
```

### ESCAPEURI

#### Oracle

```sql
SELECT SYS.URIFACTORY.ESCAPEURI('http://www.<->') FROM dual;
```

#### Result

| SYS.URIFACTORY.ESCAPEURI(‘HTTP://WWW.<->’) |
| --- |
| http://www.%3C-%3E |

#### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'SYS.URIFACTORY.ESCAPEURI' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS ESCAPEURI
FROM dual;
```

### UNESCAPEURI

#### Oracle

```sql
SELECT SYS.URIFACTORY.UNESCAPEURI('http://www.%24-%26-%3C-%3E-%3F') FROM dual;
```

#### Result

| SYS.URIFACTORY.UNESCAPEURI(‘HTTP://WWW.%24-%26-%3C-%3E-%3F’) |
| --- |
| http://www.$-&-<->-? |

#### Snowflake

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'SYS.URIFACTORY.UNESCAPEURI' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS UNESCAPEURI
FROM dual;
```

### REGISTERURLHANDLER

#### Oracle

```sql
CREATE TABLE url_table (urlcol varchar2(80));
INSERT INTO url_table VALUES ('http://www.google.com/');

CREATE OR REPLACE TYPE SCURIType UNDER SYS.URIType (
  OVERRIDING MEMBER FUNCTION getClob RETURN CLOB,
  OVERRIDING MEMBER FUNCTION getBlob RETURN BLOB,
  OVERRIDING MEMBER FUNCTION getExternalURL RETURN VARCHAR2,
  OVERRIDING MEMBER FUNCTION getURI RETURN VARCHAR2,
  STATIC FUNCTION createURI(url IN VARCHAR2) RETURN SCURIType);
/

CALL URIFACTORY.REGISTERURLHANDLER('sc://','HR','SCURITYPE');

INSERT INTO url_table VALUES ('SC://company1/company2=22/comp');
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE url_table (urlcol VARCHAR(80))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO url_table
VALUES ('http://www.google.com/');

--!!!RESOLVE EWI!!! /*** SSC-EWI-OR0007 - CREATE TYPE SUBTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!

--CREATE OR REPLACE TYPE SCURIType UNDER SYS.URIType (
--  OVERRIDING MEMBER FUNCTION getClob RETURN CLOB,
--  OVERRIDING MEMBER FUNCTION getBlob RETURN BLOB,
--  OVERRIDING MEMBER FUNCTION getExternalURL RETURN VARCHAR2,
--  OVERRIDING MEMBER FUNCTION getURI RETURN VARCHAR2,
--  STATIC FUNCTION createURI(url IN VARCHAR2) RETURN SCURIType)
                                                              ;

CALL URIFACTORY.REGISTERURLHANDLER('sc://','HR','SCURITYPE');

INSERT INTO url_table
VALUES ('SC://company1/company2=22/comp');
```

### UNREGISTERURLHANDLER

#### Oracle

```sql
CALL URIFACTORY.UNREGISTERURLHANDLER('sc://');
```

#### Snowflake

```sql
CALL URIFACTORY.UNREGISTERURLHANDLER('sc://');
```

### Known Issues

**1. Subprograms of URIFactory Package are not recognized**

SnowConvert AI does not transform subprograms of built-in packages. Most of the functionality of URI types is not currently supported by Snowflake.

**2. Missing EWIs for URIFactory Package**

The output code should display an EWI indicating that some functionality is not supported by Snowflake. There is a work item to fix this issue.

### Related EWIs

1. [SSC-EWI-OR0007](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Create Type Not Supported in Snowflake.
2. [SSC-EWI-OR0076](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md): Built In Package Not Supported.

## XMLType

### Description

> This Oracle-supplied type can be used to store and query XML data in the database. `XMLType` has member functions you can use to access, extract, and query the XML data using XPath expressions. ([Oracle SQL Language Reference XML Data Type](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-639B3C49-AE4A-43F2-91BF-19BD53FE8193))

Snowflake handles semi-structured data types (including XMLTYPE) using the VARIANT data type, for this reason, XMLTYPEs are to be migrated to VARIANT, and then usages of functions used to manipulate and query XML must be migrated to Snowflake’s counterparts. For more information on how to use XML in Snowflake, please refer to [this post](https://community.snowflake.com/s/article/More-Tips-and-Tricks-for-Working-with-XML-in-Snowflake) in the Snowflake forum and the [TO_XML](https://docs.snowflake.com/en/sql-reference/functions/to_xml.html) function documentation in Snowflake.

```sql
XMLTYPE
```

### Sample Source Patterns

#### XMLType in Create Table

##### Oracle

```sql
CREATE TABLE xml_table(
    xml_column XMLTYPE
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE xml_table (
        xml_column VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - XMLTYPE DATA TYPE CONVERTED TO VARIANT ***/!!!
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
```

#### Insert data in the XML column

##### Oracle

```sql
INSERT INTO xml_table VALUES(
    XMLType(
'<?xml version="1.0"?>
<note>
  <to>SnowConvert AI</to>
  <from>Oracle</from>
  <heading>Greeting</heading>
  <body>Hello there!</body>
</note>')
);
```

##### Snowflake

```sql
INSERT INTO xml_table
VALUES(
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0016 - FUNCTION RELATED WITH XML NOT SUPPORTED ***/!!!
    XMLType(
'<?xml version="1.0"?>
<note>
  <to>SnowConvert AI</to>
  <from>Oracle</from>
  <heading>Greeting</heading>
  <body>Hello there!</body>
</note>')
);
```

### Known Issues

#### 1. XMLType manipulation and query functions are not recognized

The functions for manipulating and querying XML such as XMLTYPE() are not being recognized nor transformed by SnowConvert.

### Related EWIs

1. [SSC-EWI-0036:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Data type converted to another data type.
2. [SSC-EWI-OR0016:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md) XML is not supported.

## URI Data Types

### Description

> Oracle supplies a family of URI types—`URIType`, `DBURIType`, `XDBURIType`, and `HTTPURIType`—which are related by an inheritance hierarchy. `URIType` is an object type and the others are subtypes of `URIType`. ([Oracle SQL Language Reference URI Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-6C9AC925-4E3F-476D-BB63-5A70CC12FC40))

## DBURIType

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> `DBURIType` can be used to store `DBURIRef` values, which reference data inside the database. Storing `DBURIRef` values lets you reference data stored inside or outside the database and access the data consistently. ([Oracle SQL Language Reference URI Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-6C9AC925-4E3F-476D-BB63-5A70CC12FC40))

```none
DBURIType
```

### Sample Source Patterns

> **Note:**
>
> Check this [section](../../sample-data.md) to set up the sample database.

#### DBURIType in create table

##### Oracle

```sql
CREATE TABLE dburitype_table(
    db_uritype_column DBURITYPE,
    sys_db_uritype_column SYS.DBURITYPE
);

INSERT INTO dburitype_table (db_uritype_column) VALUES (
    dburitype.createUri('/HR/EMPLOYEES/ROW[EMPLOYEE_ID=205]/FIRST_NAME ')
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE dburitype_table (
        db_uritype_column VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'DBURITYPE' USAGE CHANGED TO VARIANT ***/!!!,
        !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
        sys_db_uritype_column SYS.DBURITYPE
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;

    CREATE OR REPLACE VIEW PUBLIC.dburitype_table_view
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
    AS
    SELECT
        db_uritype_column,
        sys_db_uritype_column
    FROM
        dburitype_table;

        INSERT INTO dburitype_table(db_uritype_column) VALUES (
    dburitype.createUri('/HR/EMPLOYEES/ROW[EMPLOYEE_ID=205]/FIRST_NAME ') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'dburitype.createUri' NODE ***/!!!
);
```

#### Retrieving data from DBURIType column

##### Oracle

```sql
SELECT dt.db_uritype_column.getclob() FROM dburitype_table dt;
```

##### Result

| DT.DB_URITYPE_COLUMN.GETCLOB() |
| --- |
| xml version="1.0"?¶ <FIRST_NAME>Shelley</FIRST_NAME>¶ |

This result query has XML syntax, this is how it is displayed:

```xml
<?xml version="1.0"?>
 <FIRST_NAME>Shelley</FIRST_NAME>
```

##### Snowflake

```sql
SELECT dt.db_uritype_column.getclob() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'dt.db_uritype_column.getclob' NODE ***/!!! FROM
dburitype_table dt;
```

> **Warning:**
>
> getclob function is not being transformed by the tool, but is necessary to display the data in Oracle, this transformation is going to be available in future releases.

### Known Issues

**1. DBURIType** **Data Type not recognized**

DBURIType is parsed and converted as Custom Data Type by SnowConvert AI or as not supported type if it uses the prefix SYS, there is a work item to fix this issue

### Related EWIs

1. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported.
2. [SSC-EWI-0062:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Custom type usage changed to variant
3. [SSC-EWI-0073:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Pending functional equivalence review

## HTTPURIType

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> You can use `HTTPURIType` to store URLs to external Web pages or to files. Oracle accesses these files using HTTP (Hypertext Transfer Protocol). ([Oracle SQL Language Reference URI Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-6C9AC925-4E3F-476D-BB63-5A70CC12FC40))

```none
HTTPURITYPE
```

### Sample Source Patterns

#### HTTPURIType in create table

##### Oracle

```sql
CREATE TABLE httpuritype_table(
    http_uritype_column HTTPURITYPE,
    sys_http_uritype_column SYS.HTTPURITYPE
);

INSERT INTO httpuritype_table (http_uritype_column) VALUES(
    HTTPURITYPE.createuri('http://localhost/')
);
INSERT INTO httpuritype_table (http_uritype_column) VALUES(
    HTTPURITYPE.createuri('www.google.com')
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE httpuritype_table (
	    http_uritype_column VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'HTTPURITYPE' USAGE CHANGED TO VARIANT ***/!!!,
	    !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
	    sys_http_uritype_column SYS.HTTPURITYPE
	)
	COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
	;

	CREATE OR REPLACE VIEW PUBLIC.httpuritype_table_view
	COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
	AS
	SELECT
	    http_uritype_column,
	    sys_http_uritype_column
	FROM
	    httpuritype_table;

	    INSERT INTO httpuritype_table(http_uritype_column) VALUES(
    HTTPURITYPE.createuri('http://localhost/') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'HTTPURITYPE.createuri' NODE ***/!!!
);

	    INSERT INTO httpuritype_table(http_uritype_column) VALUES(
    HTTPURITYPE.createuri('www.google.com') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'HTTPURITYPE.createuri' NODE ***/!!!
);
```

#### Retrieving data from HTTPURIType column

##### Oracle

```sql
SELECT
	ut.http_uritype_column.getUrl(),
	ut.http_uritype_column.getExternalUrl()
FROM
	httpuritype_table ut;
```

##### Result

| UT.HTTP_URITYPE_COLUMN.GETURL() | UT.HTTP_URITYPE_COLUMN.GETEXTERNALURL() |
| --- | --- |
| http://localhost/ | http://localhost/ |
| http://www.google.com | http://www.google.com |

##### Snowflake

```sql
SELECT
	ut.http_uritype_column.getUrl() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ut.http_uritype_column.getUrl' NODE ***/!!!,
	ut.http_uritype_column.getExternalUrl() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ut.http_uritype_column.getExternalUrl' NODE ***/!!!
FROM
	httpuritype_table ut;
```

> **Warning:**
>
> getUrl and getExternalUrl functions are not being transformed by the tool, but are necessary to display the data in Oracle, this transformation is going to be available in future releases.

### Known Issues

**1. HTTPURIType** **Data Type not recognized**

HTTPURIType is parsed and converted as Custom Data Type by SnowConvert AI or as not supported type if it uses the prefix SYS, there is a work item to fix this issue

### Related EWIs

1. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported.
2. [SSC-EWI-0062:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Custom type usage changed to variant.
3. [SSC-EWI-0073:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Pending functional equivalence review.

## XDBURIType

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> You can use `XDBURIType` to expose documents in the XML database hierarchy as URIs that can be embedded in any `URIType` column in a table. The `XDBURIType` consists of a URL, which comprises the hierarchical name of the XML document to which it refers and an optional fragment representing the XPath syntax. ([Oracle SQL Language Reference URI Data Types](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/Data-Types.html#GUID-6C9AC925-4E3F-476D-BB63-5A70CC12FC40))

```none
XDBURITYPE
```

### Sample Source Patterns

#### XDBURIType in create table

##### Oracle

```sql
CREATE TABLE xdburitype_table(
    xdb_uritype_column XDBURITYPE,
    sys_xdb_uritype_column SYS.XDBURITYPE
);

INSERT INTO xdburitype_table (xdb_uritype_column) VALUES(
    xdburitype('/home/OE/employees/emp_selby.xml')
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE xdburitype_table (
        xdb_uritype_column VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'XDBURITYPE' USAGE CHANGED TO VARIANT ***/!!!,
        !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
        sys_xdb_uritype_column SYS.XDBURITYPE
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    CREATE OR REPLACE VIEW PUBLIC.xdburitype_table_view
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "" }}'
    AS
    SELECT
        xdb_uritype_column,
        sys_xdb_uritype_column
    FROM
        xdburitype_table;

        INSERT INTO xdburitype_table(xdb_uritype_column) VALUES(
    xdburitype('/home/OE/employees/emp_selby.xml') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'xdburitype' NODE ***/!!!
);
```

#### Retrieving data from XDBURIType column

##### Oracle

```sql
SELECT ut.xdb_uritype_column.getclob() FROM xdburitype_table ut;
```

##### Result

| UT.XDB_URITYPE_COLUMN.GETCLOB() |
| --- |
| <emp_name>selby</emp_name> |

This result query has XML syntax, this is how it is displayed:

```xml
<emp_name>selby</emp_name>
```

##### Snowflake

```sql
SELECT ut.xdb_uritype_column.getclob() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ut.xdb_uritype_column.getclob' NODE ***/!!! FROM
xdburitype_table ut;
```

> **Warning:**
>
> getclob function is not being transformed by the tool, but is necessary to display the data in Oracle, this transformation is going to be available in future releases.

### Known Issues

**1. XDBURIType** **Data Type not recognized**

XDBURIType is parsed and converted as Custom Data Type by SnowConvert AI or as not supported type if it uses the prefix SYS, there is a work item to fix this issue

### Related EWIs

1. [SSC-EWI-0028](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Type not supported.
2. [SSC-EWI-0062:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Custom type usage changed to variant
3. [SSC-EWI-0073:](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Pending functional equivalence review

---
title: SnowConvert AI - Oracle Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/oracleFDM.md
section: Migrations
---

# SnowConvert AI - Oracle Functional Differences

## SSC-FDM-OR0001

> **Note:**
>
> This FDM was added for an old version of Oracle SnowConvert AI. Currently, it is deprecated.

### Description

This error is related to the ***Assessment*** report file. It appears when an error occurs while writing the assessment details report file.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0002

The sequence start value exceeds the max value allowed by Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0068](../conversion-issues/oracleEWI.md) documentation

### Description

This error appears when the `START WITH` statement value exceeds the maximum value allowed by Snowflake. What Snowflake said about the start value is: *Specifies the first value returned by the sequence. Supported values are any value that can be represented by a 64-bit two’s compliment integer (from `-2^63` to `2^63-1`)*. So according to the previously mentioned, the max value allowed is **9223372036854775807** for positive numbers and **9223372036854775808** for negative numbers.

#### Example Code

##### Input Code:

```sql
 CREATE SEQUENCE SEQUENCE1
START WITH 9223372036854775808;
```

```sql
 CREATE SEQUENCE SEQUENCE2
START WITH -9223372036854775809;
```

##### Generated Code:

```sql
 CREATE OR REPLACE SEQUENCE SEQUENCE1
--** SSC-FDM-OR0002 - SEQUENCE START VALUE EXCEEDS THE MAX VALUE ALLOWED BY SNOWFLAKE. **
START WITH 9223372036854775808
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

```sql
 CREATE OR REPLACE SEQUENCE SEQUENCE2
--** SSC-FDM-OR0002 - SEQUENCE START VALUE EXCEEDS THE MAX VALUE ALLOWED BY SNOWFLAKE. **
START WITH -9223372036854775809
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

#### Best Practices

* It can be recommended to just reset the sequence and modify its usage too. **NOTE**: the target column must have enough space for holding this value.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0003

Search clause removed from the with element statement.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0038](../conversion-issues/oracleEWI.md) documentation

### Description

The [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) is employed to define the order in which rows are processed in a SELECT statement. This functionality allows for a customized traversal of the data, ensuring that the results are returned in a specific sequence based on the specified criteria. It is important to note, however, that this behavior, characterized by the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142), is not supported in Snowflake.

In databases such as Oracle, the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) is commonly used in conjunction with recursive queries or common table expressions (CTEs) to influence the sequence in which hierarchical data is explored. By designating a particular column or set of columns in the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142), you can control the depth-first or breadth-first traversal of the hierarchy, impacting the order in which rows are processed.

In Snowflake, [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) message will be generated, and the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) is subsequently eliminated.

#### Example Code

##### Input Code:

```sql
 WITH dup_hiredate(eid, emp_last, mgr_id, reportLevel, hire_date, job_id) AS
(SELECT aValue from atable) SEARCH DEPTH FIRST BY hire_date SET order1 SELECT aValue from atable;
```

##### Generated Code:

```sql
 WITH dup_hiredate(eid, emp_last, mgr_id, reportLevel, hire_date, job_id) AS
(
SELECT aValue from
atable
) /*** SSC-FDM-OR0003 - SEARCH CLAUSE REMOVED FROM THE WITH ELEMENT STATEMENT ***/
SELECT aValue from
atable;
```

#### Recommendation

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0004

Siblings keyword removed from the order by clause because Snowflake does not support it.

### Description

In Oracle, the [ORDER BY SIBLINGS](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2171079) clause can be used in hierarchical queries to preserve the order of the data given by the hierarchy, while applying a reorder of the values that are siblings in the same hierarchy. This is not supported in Snowflake.

#### Example Code

##### Input Code:

```sql
 SELECT LEVEL,
       LPAD(' ', 2 * (LEVEL - 1)) || NAME AS FORMATTED_NAME,
       JOB_TITLE
FROM EMPLOYEES
START WITH MANAGER_ID IS NULL
CONNECT BY PRIOR EMPLOYEE_ID = MANAGER_ID
ORDER SIBLINGS BY NAME;
```

##### Generated Code:

```sql
 SELECT LEVEL,
       NVL(
       LPAD(' ', 2 * (
                      !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '-' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!LEVEL - 1)) :: STRING, '') || NVL(NAME :: STRING, '') AS FORMATTED_NAME,
       JOB_TITLE
FROM
       EMPLOYEES
START WITH MANAGER_ID IS NULL
CONNECT BY
       PRIOR EMPLOYEE_ID = MANAGER_ID
ORDER BY
       NAME /*** SSC-FDM-OR0004 - SIBLINGS KEYWORD REMOVED FROM ORDER BY CLAUSE BECAUSE SNOWFLAKE DOES NOT SUPPORT IT ***/;
```

* While the exact same ordering achieved with the SIBLINGS clause might not be accessible, there are a few alternatives to get a similar result.

  + Embed the query within an outer query that applies the desired sorting using `ORDER BY`.
  + Create a CTE with the hierarchical query using `CONNECT BY` and reference the CTE in a subsequent query to apply `ORDER BY` for sibling sorting (rows at the same level).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0005

Synonyms are not supported in Snowflake but references to this synonym were changed by the original object name.

### Description

Synonyms are not supported in Snowflake. The synonyms are replaced by the original name.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE TABLE1
(
    COLUMN1 NUMBER
);

CREATE OR REPLACE SYNONYM B.TABLE1_SYNONYM FOR TABLE1;
SELECT * FROM B.TABLE1_SYNONYM WHERE B.TABLE1_SYNONYM.COLUMN1 = 20;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TABLE1
    (
        COLUMN1 NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;

--    --** SSC-FDM-OR0005 - SYNONYMS NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS SYNONYM WERE CHANGED BY THE ORIGINAL OBJECT NAME. **

--    CREATE OR REPLACE SYNONYM B.TABLE1_SYNONYM FOR TABLE1
                                                         ;
SELECT * FROM
    TABLE1
    WHERE
    TABLE1.COLUMN1 = 20;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0006

Constraint state removed from not null inline constraint.

### Description

This warning occurs when the not null column constraint contains one of the following Oracle constraint states as part of the column inline definition:

```sql
 [ RELY | NORELY | RELY DISABLE | RELY ENABLE | VALIDATE | NOVALIDATE ]
```

Snowflake does not support these states; therefore, they will be removed from the `NOT NULL` inline constraint.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE Table1(
  col1 INT NOT NULL RELY
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE Table1 (
    col1 INT NOT NULL /*** SSC-FDM-OR0006 - CONSTRAINT STATE RELY REMOVED FROM NOT NULL INLINE CONSTRAINT ***/
  )
  COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
  ;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0007

Snowflake does not support the versioning of objects. Developers should consider alternate approaches for code versioning.

### Description

Snowflake doesn’t support the versioning of objects. The modifier EDITIONABLE or NONEDITIONABLE is removed in the converted code and a warning is added.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE EDITIONABLE PROCEDURE FUN1 (n number)is
l_result number;
begin
    DELETE FROM employees;
end;
```

##### Generated Code:

```sql
 --** SSC-FDM-OR0007 - SNOWFLAKE DOESN'T SUPPORT VERSIONING OF OBJECTS. DEVELOPERS SHOULD CONSIDER ALTERNATE APPROACHES FOR CODE VERSIONING. **
CREATE OR REPLACE PROCEDURE FUN1 (n NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        l_result NUMBER(38, 18);
    BEGIN
        DELETE FROM
            employees;
    END;
$$;
```

#### Best Practices

* The user should consider alternate approaches for code versioning.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0008

Set Quantifier Not Supported

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0071](../conversion-issues/oracleEWI.md) documentation

### Description

Quantifier ‘all’ is not supported in Snowflake. The modifier is removed from the source code, and a warning is added; the resulting code may behave unexpectedly.

#### Example Code

##### Input Code:

```sql
 SELECT location_id  FROM locations
MINUS ALL
SELECT location_id  FROM departments;
```

##### Generated Code:

```sql
 SELECT location_id  FROM
locations
--** SSC-FDM-OR0008 - QUANTIFIER 'ALL' NOT SUPPORTED FOR THIS SET OPERATOR, RESULTS MAY DIFFER **
MINUS
SELECT location_id  FROM
departments;
```

In Snowflake, the `INTERSECT` and `MINUS/EXCEPT` operators will always remove duplicate values.

#### Best Practices

* Check alternatives in Snowflake to emulate the functionality of the “all” quantifier.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0009

SQL implicit cursor values may differ.

### Description

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag `-t JavaScript` or `--PLTargetLanguage JavaScript`

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

This EWI is shown when SQL implicit cursor value is used. This is because Oracle uses different values depending on the type of query. For example, for `SELECT` the value used to set SQL implicit cursor values are the number of rows returned by the query. When the query type is `UPDATE/CREATE/DELETE/INSERT` the value used is the number of rows affected, this is the main reason why this EWI is displayed.

#### Example Code

##### Input Code:

```
-- Additional Params: -t JavaScript
--Transformation for implicit cursor
CREATE OR REPLACE PROCEDURE SP_SAMPLE AUTHID DEFINER IS
  stmt_no  POSITIVE;
BEGIN
  IF SQL%ROWCOUNT = 0 THEN
   EXIT ;
  END IF;
  IF SQL%ISOPEN THEN
   EXIT ;
  END IF;
  IF SQL%FOUND THEN
   EXIT ;
  END IF;
  IF SQL%NOTFOUND THEN
   EXIT ;
  END IF;
END;
```

##### Generated Code:

```sql
 -- Additional Params: -t JavaScript
--Transformation for implicit cursor
CREATE OR REPLACE PROCEDURE SP_SAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
  !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PlInvokerRightsClause' NODE ***/!!!
  //AUTHID DEFINER
  null
  // SnowConvert AI Helpers Code section is omitted.

  let STMT_NO = new POSITIVE();
  if (SQL.ROWCOUNT /*** SSC-FDM-OR0009 - SQL IMPLICIT CURSOR VALUES MAY DIFFER ***/ == 0) {
    break;
  }
  if (SQL.ISOPEN) {
    break;
  }
  if (SQL.FOUND) {
    break;
  }
  if (SQL.NOTFOUND) {
    break;
  }
$$;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0010

NUMBER datatype smaller precision was increased to match scale.

### Description

The `NUMBER` data type stores fixed and floating-point numbers. This data is portable among different operating systems running the Oracle Database. The `NUMBER` data type is recommended for most cases in which you must store numeric data. The syntax is the following `NUMBER (X, Y)`, where ***X*** is the precision and ***Y*** is the scale.

For example, `NUMBER(5, 3)` is a number that has ***2*** digits before the decimal and ***3*** digits after the decimal, just like the following:

```none
12.345
```

Another important considerations:

1. Scale ***Y*** specifies the maximum number of digits to the right of the decimal point.
2. Scale-Precision ***Y-X*** specifies the minimum number of zeros present after the decimal point.

This message is shown when a `NUMBER` has a smaller precision than its scale. Snowflake does not support this feature, and this message is used to indicate that the precision’s value was increased to maintain equivalence.

> **Note:**
>
> Please consider that there are cases where this issue can either stack alongside other known transformations or not happen at all. For example, cases where the scale is replaced by nineteen and the former precision is greater than nineteen; will NOT show this message.

#### Example Code

##### Input Code:

##### Queries

```sql
 CREATE TABLE SampleNumberTable(Col1 NUMBER(4, 5));

INSERT INTO SampleNumberTable (Col1)
VALUES (0.00009);

INSERT INTO SampleNumberTable (Col1)
VALUES (0.000021);

INSERT INTO SampleNumberTable (Col1)
VALUES (0.012678912);

SELECT * FROM SampleNumberTable;
```

##### Result

```none
Col1   |
-------+
0.00009|
0.00002|
0.01268|
```

##### Generated Code:

##### Queries

```sql
 CREATE OR REPLACE TABLE SampleNumberTable (Col1 NUMBER(5, 5) /*** SSC-FDM-OR0010 - NUMBER DATATYPE SMALLER PRECISION WAS INCREASED TO MATCH SCALE ***/ /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO SampleNumberTable(Col1)
VALUES (0.00009);

INSERT INTO SampleNumberTable(Col1)
VALUES (0.000021);

INSERT INTO SampleNumberTable(Col1)
VALUES (0.012678912);

SELECT * FROM
SampleNumberTable;
```

##### Result

```none
Col1   |
-------+
0.00009|
0.00002|
0.01268|
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0011

The boolean argument was removed because the “add to stack” options is not supported.

### Description

This warning is displayed when the third optional argument of *RAISE_APPLICATION_ERROR* was removed during the migration. This functionality is not supported by Snowflake.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE FUNCTION TEST(SAMPLE_A      IN NUMBER DEFAULT NULL,
                               SAMPLE_B       IN NUMBER DEFAULT NULL)
  RETURN NUMBER
 AS
BEGIN
    raise_application_error(-20001, 'First exception message', FALSE);
  RETURN 1;
END TEST;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE TEST (SAMPLE_A NUMBER(38, 18) DEFAULT NULL,
                               SAMPLE_B NUMBER(38, 18) DEFAULT NULL)
RETURNS NUMBER(38, 18)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  FIRST_EXCEPTION_MESSAGE_EXCEPTION_CODE_0 EXCEPTION (-20001, 'FIRST EXCEPTION MESSAGE');
 BEGIN
  --** SSC-FDM-OR0011 - ADD TO STACK OF ERRORS IS NOT SUPPORTED, BOOLEAN ARGUMENT FALSE WAS REMOVED. **
  RAISE FIRST_EXCEPTION_MESSAGE_EXCEPTION_CODE_0;
  RETURN 1;
 END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0012

COMMIT and ROLLBACK statements require adequate setup to perform as intended.

### Description

COMMIT and ROLLBACK statements require adequate setup to perform as intended in Snowflake. The following instruction needs to be executed in Snowflake to simulate the correct functionality of these statements:

```sql
 ALTER SESSION SET AUTOCOMMIT = false;
```

#### Example Code

##### Input Code

```sql
 COMMIT;
ROLLBACK;
```

##### Generated Code

```sql
 --** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
COMMIT;

--** SSC-FDM-OR0012 - ROLLBACK REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
ROLLBACK;
```

#### Best Practices

* Execute the query mentioned in the description section before you start to execute your code.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-FDM-OR0013

The cycle clause is not supported in Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0039](../conversion-issues/oracleEWI.md) documentation.

### Description

This message is shown when SnowConvert AI finds a query with a CYCLE clause. Which is not supported in Snowflake, so it is commented out from the code.

This clause marks when there is a recursion.

For more details see the [documentation](https://docs.oracle.com/en/database/oracle/oracle-database/23/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__GUID-8EE64250-3C9A-40C7-A81D-46695F8B2EB9) about the clause functionality.

#### Example Code

#### Connect By

##### Input Code:

```sql
 CREATE OR REPLACE FORCE NONEDITIONABLE VIEW VIEW01 AS
SELECT
      UNIQUE A.*
FROM
      TABLITA A
WHERE
      A.X = A.C CONNECT BY NOCYCLE A.C = 0 START WITH A.B = 1
HAVING
      X = 1
GROUP BY
      A.C;
```

##### Generated Code:

```sql
 CREATE OR REPLACE VIEW VIEW01
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
AS
SELECT DISTINCT
      A.*
FROM
      TABLITA A
WHERE
      A.X = A.C
GROUP BY
      A.C
HAVING
      X = 1
--** SSC-FDM-OR0013 - CYCLE CLAUSE IS NOT SUPPORTED IN SNOWFLAKE **
CONNECT BY
      A.C = 0 START WITH A.B = 1;
```

#### Best Practices

* If there are cycles in the data hierarchy, you can review this [article](https://docs.snowflake.com/en/user-guide/queries-cte#cause-1-cyclic-data-hierarchy) to deal with them.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0014

Foreign key data type mismatch.

### Description

This error happens when there is a mismatch in a foreign key data type.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE "MyDb"."MyTable"
(
    "COL1" NUMBER,
    CONSTRAINT "PK" PRIMARY KEY ("COL1")
);

CREATE TABLE "MyDb"."MyTable1"
(
    "COL1" NUMBER(*,0),
    CONSTRAINT "FK1" FOREIGN KEY ("COL1") REFERENCES "MyDb"."MyTable" ("COL1")
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE "MyDb"."MyTable"
    (
        "COL1" NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
        CONSTRAINT "PK" PRIMARY KEY ("COL1")
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    CREATE OR REPLACE TABLE "MyDb"."MyTable1"
    (
        "COL1" NUMBER(38) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    ALTER TABLE "MyDb"."MyTable1"
    ADD
    --** SSC-FDM-OR0014 - FOREIGN KEY DATA TYPE MISMATCH **
    CONSTRAINT "FK1" FOREIGN KEY ("COL1") REFERENCES "MyDb"."MyTable" ("COL1");
```

> **Note:**
>
> Note that “MyDb”.”MyTable1”.COL1 and “MyDb”.”MyTable”.COL1 are of different types and the ERROR is displayed.

#### Best Practices

* If there are cycles in the data hierarchy, you can review this [article](https://docs.snowflake.com/en/user-guide/queries-cte#cause-1-cyclic-data-hierarchy) to deal with them.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0015

LENGTHB transformed to OCTET_LENGTH results may vary due to memory management of DBMS.

### Description

This issue happens when there is an invocation to [LENGTHB](https://docs.oracle.com/en/database/oracle/oracle-database/21/sqlrf/LENGTH.html#GUID-8F97F652-5AE8-4457-AFD7-7A6F25551E0C) function that returns the size of a column or literal in bytes. This function is transformed into [OCTET_LENGTH](https://docs.snowflake.com/en/sql-reference/functions/octet_length.html) Snowflake’s function.

When the parameter to the function is a column, the result will be the size of the value that the column has, this size may vary from Oracle to Snowflake, the type of the column plays an important role in the result returned by the function.

#### Example Code

##### Input Code:

##### Queries

```sql
 CREATE TABLE char_table
(
	char_column1 CHAR(15)
);

INSERT INTO char_table VALUES ('Hello world');

SELECT char_column1, LENGTHB(char_column1), LENGTH('Hello world') FROM char_table;
```

##### Result

```none
|CHAR_COLUMN1   |LENGTHB(CHAR_COLUMN1)|LENGTH('HELLOWORLD')|
|---------------|---------------------|--------------------|
|Hello world    |15                   |11                  |
```

##### Generated Code:

##### Queries

```
CREATE OR REPLACE TABLE char_table
(
	char_column1 CHAR(15)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO char_table
VALUES ('Hello world');

SELECT char_column1,
OCTET_LENGTH(char_column1) /*** SSC-FDM-OR0015 - LENGTHB TRANSFORMED TO OCTET_LENGTH RESULTS MAY VARY DUE TO MEMORY MANAGEMENT OF DBMS ***/, LENGTH('Hello world') FROM
char_table;
```

##### Result

```none
|CHAR_COLUMN1|OCTET_LENGTH(CHAR_COLUMN1)|LENGTH('HELLO WORLD')|
|------------|--------------------------|---------------------|
|Hello world |11                        |11                   |
```

#### Best Practices

* Manually check the data types used.
* Check the encoding of the columns used because OCTET_LENGTH can return bigger sizes when the string contains Unicode code points.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0016

COMMIT and ROLLBACK options were removed because Snowflake does not require them

### Description

COMMIT and ROLLBACK statement options are being removed because Snowflake does not require them.

#### Example Code

##### Input Code

```sql
 COMMIT WORK FORCE '22.57.53';
ROLLBACK WORK FORCE '22.57.53';
```

##### Generated Code

```sql
 --** SSC-FDM-OR0016 - COMMIT OPTIONS REMOVED BECAUSE SNOWFLAKE DOES NOT REQUIRE THEM **
--** SSC-FDM-OR0012 - COMMIT REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
COMMIT WORK;

--** SSC-FDM-OR0016 - ROLLBACK OPTIONS REMOVED BECAUSE SNOWFLAKE DOES NOT REQUIRE THEM **
--** SSC-FDM-OR0012 - ROLLBACK REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED **
ROLLBACK WORK;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0017

DBTimezone was removed to use the default value of the Timestamp.

### Description

DBTIMEZONE keyword was removed from the AT TIME ZONE expression.

#### Example Code

##### Input Code:

```sql
 SELECT TIMESTAMP '1998-12-25 09:26:50.12' AT TIME ZONE DBTIMEZONE FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-OR0017 - DBTIMEZONE WAS REMOVED TO USE THE DEFAULT VALUE OF THE TIMESTAMP **
TO_TIMESTAMP_LTZ( TIMESTAMP '1998-12-25 09:26:50.12')
FROM DUAL;
```

#### Best Practices

* You may need to set the TIMEZONE session parameter to get equal results.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0018

Merge statement may not work as expected

### Description

This warning is used to indicate that the Snowflake merge statement may have some functional differences compared to Oracle.

#### Example Code

##### Input Code:

```sql
 MERGE INTO people_target pt
USING people_source ps
ON    (pt.person_id = ps.person_id)
WHEN MATCHED THEN UPDATE
  SET pt.first_name = ps.first_name,
      pt.last_name = ps.last_name,
      pt.title = ps.title
  DELETE where pt.title  = 'Mrs.'
WHEN NOT MATCHED THEN INSERT
  (pt.person_id, pt.first_name, pt.last_name, pt.title)
  VALUES (ps.person_id, ps.first_name, ps.last_name, ps.title)
  WHERE ps.title = 'Mr';
```

##### Generated Code:

```sql
 --** SSC-FDM-OR0018 - SNOWFLAKE MERGE STATEMENT MAY HAVE SOME FUNCTIONAL DIFFERENCES COMPARED TO ORACLE **
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "people_target", "people_source" **
MERGE INTO people_target pt
USING people_source ps
ON    (pt.person_id = ps.person_id)
      WHEN MATCHED AND pt.title  = 'Mrs.' THEN
        DELETE
      WHEN MATCHED THEN
        UPDATE SET
          pt.first_name = ps.first_name,
               pt.last_name = ps.last_name,
               pt.title = ps.title
      WHEN NOT MATCHED AND ps.title = 'Mr' THEN
        INSERT
        (pt.person_id, pt.first_name, pt.last_name, pt.title)
        VALUES (ps.person_id, ps.first_name, ps.last_name, ps.title);
```

#### Best Practices

* If you are getting different results compared to Oracle, consider the following:

  + For execution order prioritization, go to the next [link](https://docs.snowflake.com/en/sql-reference/sql/merge.html#usage-notes) to get more information.

    - Execute the skipped DML statements outside (before or after accordingly) the merge statement.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0019

Window frame output may not be equivalent

### Description

This warning is added when a ROWS window frame unit is found within the source code.

ROWS works by using physical row numbers for its computing, which may differ once it is migrated to the target platform. Manually adding extra ORDER BY clauses can help mitigate or remove this issue.

> **Note:**
>
> Note that as the [Oracle documentation](https://docs.oracle.com/en/database/oracle/oracle-database/23/sqlrf/Analytic-Functions.html) states:
> “The value returned by an analytic function with a logical offset is always deterministic. However, the value returned by an analytic function with a physical offset may produce nondeterministic results unless the ordering expression results in a unique ordering. You may have to specify multiple columns in the `order_by_clause` to achieve this unique ordering.”
>
> According to this is recommended to check if the function returned deterministic results beforehand to avoid any issues.

#### Example Code

##### Input Code:

```sql
 SELECT
SUM(C_BIRTH_DAY)
OVER (
    ORDER BY C_BIRTH_COUNTRY
    ROWS UNBOUNDED PRECEDING) AS MAX1
FROM WINDOW_TABLE;
```

##### Generated Code:

```sql
 SELECT
SUM(C_BIRTH_DAY)
OVER (
    ORDER BY C_BIRTH_COUNTRY ROWS UNBOUNDED PRECEDING /*** SSC-FDM-OR0019 - WINDOW FRAME OUTPUT MAY NOT BE EQUIVALENT ***/) AS MAX1
FROM
WINDOW_TABLE;
```

#### Best Practices

* Ensure deterministic ordering for rows to ensure deterministic outputs when running in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0020

PRAGMA EXCEPTION_INIT is not supported.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0051](../conversion-issues/oracleEWI.md) documentation.

### Description

This warning is added when PRAGMA EXCEPTION_INIT function is invoked within a procedure. Exception Name and SQL Code of the exceptions are set in the RAISE function. When it is converted to Snowflake Scripting, the SQL Code is added to the Exception declaration, however, some code values may be invalid in Snowflake Scripting.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE EXCEPTION_DECLARATION_SAMPLE AUTHID DEFINER IS
  NEW_EXCEPTION EXCEPTION;
  PRAGMA EXCEPTION_INIT(NEW_EXCEPTION, -63);
  NEW_EXCEPTION2 EXCEPTION;
  PRAGMA EXCEPTION_INIT ( NEW_EXCEPTION2, -20100 );
BEGIN

  IF true THEN
    RAISE NEW_EXCEPTION;
  END IF;

EXCEPTION
    WHEN NEW_EXCEPTION THEN
        --Handle Exceptions
        NULL;
END;
/
```

##### Generated Code:

##### Snowflake Scription

```sql
 CREATE OR REPLACE PROCEDURE EXCEPTION_DECLARATION_SAMPLE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0097 - PROCEDURE PROPERTIES ARE NOT SUPPORTED IN SNOWFLAKE PROCEDURES ***/!!!
AS
$$
  DECLARE
    --** SSC-FDM-OR0023 - EXCEPTION CODE NUMBER EXCEEDS SNOWFLAKE SCRIPTING LIMITS **
    NEW_EXCEPTION EXCEPTION;
    --** SSC-FDM-OR0020 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED **
    PRAGMA EXCEPTION_INIT(NEW_EXCEPTION, -63);
    NEW_EXCEPTION2 EXCEPTION (-20100, '');
    --** SSC-FDM-OR0020 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED **
  PRAGMA EXCEPTION_INIT ( NEW_EXCEPTION2, -20100 );
  BEGIN
    IF (true) THEN
      RAISE NEW_EXCEPTION;
    END IF;
    EXCEPTION
        WHEN NEW_EXCEPTION THEN
            --Handle Exceptions
            NULL;
    END;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0021

For Loop With Float Number As Bound May Not Behave Correctly In Snowflake Scripting

### Description

Snowflake Scripting only allows an `INTEGER` or an expression that evaluates to an `INTEGER` as a bound for the `FOR LOOP` condition. Floating numbers will be rounded up or down and alter the original bound.

The lower bound will be rounded to the closest integer number. For example:

**3.1 -> 3**, **6.7 -> 7**, **4.5 -> 5**

However the upper bound will be truncated to the closest lower integer. For example:

**3.1 -> 3**, **6.7 -> 6**, **4.5 -> 4**

#### Snowflake Scripting

```sql
 CREATE OR REPLACE PROCEDURE p1()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    DECLARE
        var1 VARCHAR DEFAULT '';
        var2 VARCHAR DEFAULT '';
        var3 VARCHAR DEFAULT '';
    BEGIN
        --Loop 1
        FOR i IN 1.2 TO 5.2 DO
            var1 := var1 || ' ' || i::VARCHAR;
        END FOR;

        --Loop 2
        FOR i IN 1.7 TO 5.5 DO
            var2 := var2 || ' ' || i::VARCHAR;
        END FOR;

        --Loop 3
        FOR i IN 1.5 TO 5.8 DO
            var3 := var3 || ' ' || i::VARCHAR;
        END FOR;
        RETURN  ' Loop1: ' || var1 ||
                ' Loop2: ' || var2 ||
                ' Loop3: ' || var3;
    END;
$$;

CALL p1();
```

##### Result

```none
P1                                                |
--------------------------------------------------+
 Loop1:  1 2 3 4 5                                |
 Loop2:  2 3 4 5                                  |
 Loop3:  2 3 4 5                                  |
```

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE p1
AS
BEGIN
FOR i NUMBER(5,1) IN 1.2 .. 5.7 LOOP
    NULL;
END LOOP;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE p1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-FDM-OR0021 - FOR LOOP WITH FLOAT NUMBER AS LOWER OR UPPER BOUND MAY NOT BEHAVE CORRECTLY IN SNOWFLAKE SCRIPTING **
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        FOR i IN 1.2 TO 5.7
                            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                            LOOP
                                   NULL;
END LOOP;
    END;
$$;
```

#### Best Practices

* Rewrite the FOR LOOP condition so it uses integers.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0022

For Loop With Multiple Conditions Is Currently Not Supported By Snowflake Scripting. Only First Condition Is Used

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0100](../conversion-issues/oracleEWI.md) documentation.

### Description

Oracle allows multiple conditions in a single `FOR LOOP` however, Snowflake Scripting only allows one condition per `FOR LOOP`. Only the first condition is migrated and the others are ignored during transformation.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE P3
AS
BEGIN
FOR i IN REVERSE 1..3,
REVERSE i+5..i+7
LOOP
    NULL;
END LOOP;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE P3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-FDM-OR0022 - FOR LOOP WITH MULTIPLE CONDITIONS IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING **
        FOR i IN REVERSE 1 TO 3 LOOP
            NULL;
        END LOOP;
    END;
$$;
```

#### Best Practices

* Separate the `FOR LOOP` into different loops or rewrite the condition.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0023

The exception code exceeds the Snowflake Scripting limit

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0099](../conversion-issues/oracleEWI.md) documentation.

### Description

This warning appears when an exception declaration error code exceeds the Snowflake Scripting exception number limits. The number must be an integer between -20000 and -20999.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE procedure_exception
IS
my_exception EXCEPTION;
PRAGMA EXCEPTION_INIT ( my_exception, -19000 );
BEGIN
    NULL;
END;
```

##### Generated Code:

```
CREATE OR REPLACE PROCEDURE procedure_exception ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        --** SSC-FDM-OR0023 - EXCEPTION CODE NUMBER EXCEEDS SNOWFLAKE SCRIPTING LIMITS **
        my_exception EXCEPTION;
        --** SSC-FDM-OR0020 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED **
        PRAGMA EXCEPTION_INIT ( my_exception, -19000 );
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* Check if the exception code is between the limits allowed by Snowflake Scripting, if not change it for another exception number available.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0024

Columns from expression not found

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0002](../conversion-issues/oracleEWI.md) documentation.

### Description

This error happens when the columns of a Select Expression were unable to be resolved, usually when it either refers to a Type Access whose reference wasn’t resolved or a column with a User Defined Type whose columns haven’t been defined; such as a Type Without Body or Object Type with no columns.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE record_unknown_table_proc
AS
    unknownTable_variable_rowtype unknownTable%ROWTYPE;
BEGIN
    INSERT INTO MyTable values unknownTable_variable_rowtype;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE record_unknown_table_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        unknownTable_variable_rowtype OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
    BEGIN
        INSERT INTO MyTable
        SELECT
            null /*** SSC-FDM-OR0024 - COLUMNS FROM EXPRESSION unknownTable%ROWTYPE NOT FOUND ***/;
    END;
$$;
```

#### Related EWIs

1. [SSC-EWI-0036](../conversion-issues/generalEWI.md): Data type converted to another data type.

#### Best Practices

* Verify that the type definition that was referenced does have columns within it.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0025

Not Null constraint is not supported in Snowflake Procedures

### Description

The Oracle variable declaration `NOT NULL` constraint is not supported in variable declarations inside procedures in Snowflake.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC04
IS
 var3 FLOAT NOT NULL := 100;
BEGIN
NULL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC04 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  var3 FLOAT := 100 /*** SSC-FDM-OR0025 - NOT NULL CONSTRAINT IS NOT SUPPORTED BY SNOWFLAKE ***/;
 BEGIN
  NULL;
 END;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0026

Type not supported in cast operation.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0045](../conversion-issues/oracleEWI.md) documentation.

### Description

This error happens when a type is not supported in a cast operation.

#### Example

##### Input Code:

```sql
 select cast(' $123.45' as number, 'L999.99') from dual;
```

##### Generated Code:

```sql
 select
--** SSC-FDM-OR0026 - CAST TYPE NOT SUPPORTED **
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0011 - THE FORMAT PARAMETER ' $123.45' IS NOT SUPPORTED ***/!!!
 cast(' $123.45' as NUMBER(38, 18) , 'L999.99') from dual;
```

### Related EWIs

1. SSC-EWI-OR0011: The format parameter is not supported.

#### Best Practices

* The cast is converted to a user-defined function (UDF/Stub), so you can modify it to emulate the behavior of the cast function.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0027

DEFAULT ON CONVERSION ERROR is not supported.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0029](../conversion-issues/oracleEWI.md) documentation

### Description

Default on conversion error not supported in Snowflake

#### Example Code

##### Input Code:

```sql
 SELECT TO_NUMBER('2,00' DEFAULT 0 ON CONVERSION ERROR) "Value" FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-OR0027 - DEFAULT ON CONVERSION ERROR NOT SUPPORTED IN SNOWFLAKE IN SNOWFLAKE **
TO_NUMBER('2,00') "Value" FROM DUAL;
```

#### Best Practices

* You might create UDF to emulate the behavior of `DEFAULT` value `ON CONVERSION ERROR`.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0028

SYS_CONTEXT parameter is not supported.

> **Note:**
>
> This FDM is deprecated, please refer to SSC-EWI-OR0031 documentation.

### Description

This error happens when a SYS_CONTEXT function parameter is not supported.

#### Example Code

##### Input Code:

```sql
 SELECT SYS_CONTEXT ('USERENV', 'NLS_SORT') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-OR0028 - 'NLS_SORT' SYS_CONTEXT PARAMETER NOT SUPPORTED IN SNOWFLAKE **
SYS_CONTEXT ('USERENV', 'NLS_SORT') FROM DUAL;
```

#### Best Practices

* The function is converted to a user defined function(stub), so you can modify it to emulate the behavior of the SYS_CONTEXT parameter.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0029

This ALTER SESSION configuration is not supported in Snowflake.

### Description

A clause or configuration of the ALTER SESSION statement is not currently supported.

#### Example Code

##### Input Code:

```sql
 ALTER SESSION SET SQL_TRACE TRUE;
```

##### Generated Code:

```sql
 ----** SSC-FDM-OR0029 - THIS ALTER SESSION CONFIGURATION IS NOT SUPPORTED IN SNOWFLAKE **
--ALTER SESSION SET SQL_TRACE TRUE
                                ;
```

#### Best Practices

* For session variables, you can check the Snowflake [documentation](https://docs.snowflake.com/en/sql-reference/parameters.html) to find an equivalent.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0030

ROWID pseudocolumn is not supported in Snowflake

### Description

When ROWID is used as a pseudocolumn in a query it is transformed to null to avoid runtime errors and the EWI is added. There is still no transformation to emulate the functionality.

#### Example Code

##### Input Code Oracle:

```sql
 SELECT ROWID FROM T1;
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-OR0030 - ROWID PSEUDOCOLUMN IS NOT SUPPORTED IN SNOWFLAKE, IT WAS CONVERTED TO NULL TO AVOID RUNTIME ERRORS **
'' AS ROWID
FROM
T1;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0031

The error logging clause in DML statements is not supported by Snowflake

### Description

This error is used to advise that the error_logging clause in Oracle’s DML statements is not supported by Snowflake’s DML statements.

#### Example Code

##### Input Code:

```sql
 MERGE INTO people_target pt
USING people_source ps ON (pt.person_id = ps.person_id)
WHEN MATCHED THEN UPDATE
  SET pt.first_name = ps.first_name,
      pt.last_name = ps.last_name,
      pt.title = ps.title
LOG ERRORS;
```

##### Generated Code:

```sql
 MERGE INTO people_target pt
USING people_source ps ON (pt.person_id = ps.person_id)
  WHEN MATCHED THEN
    UPDATE
    SET pt.first_name = ps.first_name,
        pt.last_name = ps.last_name,
        pt.title = ps.title
--  --** SSC-FDM-OR0031 - THE ERROR LOGGING CLAUSE IN DML STATEMENTS IS NOT SUPPORTED BY SNOWFLAKE **
--LOG ERRORS
          ;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0032

StandardHash function with input non-string parameter generates a different result in Snowflake.

### Description

This warning is used when `STANDARD_HASH` function in Oracle with input non-string parameter generates a different result in Snowflake.

> **Note:**
>
> When the algorithm parameter is a dynamic expression (not a string literal), the function cannot be converted and [SSC-EWI-OR0138](../conversion-issues/oracleEWI.md) is emitted instead.

#### Example Code

##### Input Code:

##### Query

```sql
 SELECT STANDARD_HASH(1+1) FROM DUAL;
```

##### Result

```none
 STANDARD_HASH(1+1)                               |
--------------------------------------------------+
 E39323970701D93598FC1D357F4BF04578CE3242         |
```

##### Generated Code:

##### Query

```
SELECT
--** SSC-FDM-OR0032 - STANDARD HASH FUNCTION WITH INPUT NON-STRING PARAMETER GENERATES A DIFFERENT RESULT IN SNOWFLAKE **
SHA1(1+1)
FROM DUAL;
```

##### Result

```
 SHA1(1+1)                                        |
--------------------------------------------------+
 da4b9237bacccdf19c0760cab7aec4a8359010b0         |
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0033

DBMS_RANDOM.VALUE Built-In Package precision is lower in Snowflake

Description

This message is shown when SnowConvert AI migrates a DBMS_RANDOM.VALUE Oracle built-in package function*.* This warning indicates that the UDF added to emulate the functionality has lower precision than the original function*.*

### Example code

#### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE built_in_package_proc
IS
var1 NUMBER;
BEGIN
    SELECT DBMS_RANDOM.VALUE() INTO var1 FROM DUAL;

    SELECT DBMS_RANDOM.VALUE(2,10) INTO var1 FROM DUAL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE built_in_package_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        var1 NUMBER(38, 18);
    BEGIN
        SELECT
            --** SSC-FDM-OR0033 - DBMS_RANDOM.VALUE DIGITS OF PRECISION ARE LOWER IN SNOWFLAKE **
            DBMS_RANDOM.VALUE_UDF() INTO
            :var1
        FROM DUAL;

        SELECT
            --** SSC-FDM-OR0033 - DBMS_RANDOM.VALUE DIGITS OF PRECISION ARE LOWER IN SNOWFLAKE **
            DBMS_RANDOM.VALUE_UDF(2,10) INTO
            :var1
        FROM DUAL;
    END;
$$;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0034

Sequence start value with ‘LIMIT VALUE’ is not supported by Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0001](../conversion-issues/oracleEWI.md) documentation.

### Description

This error appears when the `START WITH` statement value is `LIMIT VALUE`.

In Oracle this clause is used only in ALTER TABLE

> * `START` `WITH` `LIMIT VALUE`, which is specific to `identity_options`, can only be used with `ALTER` `TABLE` `MODIFY`. If you specify `START` `WITH` `LIMIT VALUE`, then Oracle Database locks the table and finds the maximum identity column value in the table (for increasing sequences) or the minimum identity column value (for decreasing sequences) and assigns the value as the sequence generator’s high water mark. The next value returned by the sequence generator will be the high water mark + `INCREMENT` `BY` `integer` for increasing sequences, or the high water mark - `INCREMENT` `BY` `integer` for decreasing sequences.

#### [ALTER TABLE ORACLE](https://docs.oracle.com/en/database/oracle/oracle-database/23/sqlrf/ALTER-TABLE.html#GUID-552E7373-BF93-477D-9DA3-B2C9386F2877)

#### Example Code

##### Input Code:

```sql
 CREATE SEQUENCE SEQUENCE1
  START WITH LIMIT VALUE;
```

##### Generated Code:

```sql
 CREATE OR REPLACE SEQUENCE SEQUENCE1
  --** SSC-FDM-OR0034 - SEQUENCE START VALUE WITH 'LIMIT VALUE' IS NOT SUPPORTED BY SNOWFLAKE. **
  START WITH LIMIT VALUE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0035

DBMS_OUTPUT.PUTLINE check UDF implementation

### Description

This message is shown when SnowConvert AI migrates a `DBMS_OUTPUT.PUT_LINE` Oracle built-in package function*.* This warning tells you to check the added UDF*.*

This EWI exists to tell the user to review the `DBMS_OUTPUT.PUT_LINE_UDF` implementation where the following information will be found:

> **Warning:**
>
> Performance may be affected by using this UDF. If you want to start logging information, please uncomment the implementation. Note that this is using a temporary table, if you want the data to persist after a session ends, please remove TEMPORARY from the CREATE TABLE.
>
> Once the calls of `DBMS_OUTPUT.PUT_LINE_UDF` has been done, please use the following query to read all the logs: `SELECT * FROM DBMS_OUTPUT.DBMS_OUTPUT_LOG.`

#### Example code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE builtin_package_call
IS
BEGIN
	DBMS_OUTPUT.PUT_LINE(1);
	DBMS_OUTPUT.PUT_LINE("Test");
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE builtin_package_call ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		--** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
		CALL DBMS_OUTPUT.PUT_LINE_UDF(1);
		--** SSC-FDM-OR0035 - CHECK UDF IMPLEMENTATION FOR DBMS_OUTPUT.PUT_LINE_UDF. **
		CALL DBMS_OUTPUT.PUT_LINE_UDF("Test");
	END;
$$;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0036

Unnecessary built-in packages parameters

### Description

This message is displayed when SnowConvert AI migrates an Oracle built-in package procedure or function, and some of the arguments are removed from the call.

Some of the original parameters may not have an equivalent in Snowflake or may not be needed in the transformed version, those parameters are removed from the produced code but are preserved in the EWI message so the user can still track them.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE built_in_package_proc
IS
w_file UTL_FILE.FILE_TYPE;
BEGIN
    w_file:= UTL_FILE.FOPEN('MY_DIR','test.txt','W',32760);
    UTL_FILE.PUT_LINE(w_file,'New line');
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE built_in_package_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        w_file OBJECT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'UTL_FILE.FILE_TYPE' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/ := OBJECT_CONSTRUCT();
    BEGIN
        --** SSC-FDM-OR0036 - PARAMETERS: 'LOCATION, MAX_LINESIZE_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
        CALL UTL_FILE.FOPEN_UDF('test.txt', 'W');
        SELECT
            *
        INTO
            w_file
        FROM
            TABLE(RESULT_SCAN(LAST_QUERY_ID()));
        --** SSC-FDM-OR0036 - PARAMETERS: 'AUTOFLUSH_UDF' UNNECESSARY IN THE IMPLEMENTATION. **
        CALL UTL_FILE.PUT_LINE_UDF(:w_file, 'New line');
    END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0037

The used syntax in select is not supported in Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0004](../conversion-issues/oracleEWI.md) documentation

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This warning happens when a clause in a select is not supported in Snowflake. The not supported clauses are:

* CONTAINERS
* SUBQUERY RESTRICTION
* HIERARCHIES
* EXTERNAL MODIFY
* DBLINK
* SHARDS
* PARTITION
* SUBPARTITION
* HIERARCHICAL

#### Example Code

##### Input Code:

```sql
 SELECT * FROM TABLE1 EXTERNAL MODIFY (LOCATION 'file.csv' REJECT LIMIT UNLIMITED);
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
SELECT * FROM
TABLE1
--       --** SSC-FDM-OR0037 - THE 'OPTIONAL MODIFIED EXTERNAL' SYNTAX IN SELECT IS NOT SUPPORTED IN SNOWFLAKE **
--       EXTERNAL MODIFY (LOCATION 'file.csv' REJECT LIMIT UNLIMITED)
                                                                   ;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0038

Boolean cursor attribute is not supported.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0128](../conversion-issues/oracleEWI.md) documentation.

### Description

This message is used to indicate that a boolean cursor attribute is not supported in SnowScript or that there is no transformation that emulates its functionality in SnowScript. The following table shows the boolean cursor attributes that can be emulated:

| Boolean Cursor Attribute | Status |
| --- | --- |
| `%FOUND` | Can be emulated |
| `%NOTFOUND` | Can be emulated |
| `%ISOPEN` | Not Supported |

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE cursor_attributes_proc
IS
    is_open_attr BOOLEAN;
    found_attr BOOLEAN;
    my_record table1%ROWTYPE;
    CURSOR my_cursor IS SELECT * FROM table1;
BEGIN
    OPEN my_cursor;
    LOOP
        FETCH my_cursor INTO my_record;
        EXIT WHEN my_cursor%NOTFOUND;
        is_open_attr := my_cursor%ISOPEN;
        found_attr := my_cursor%FOUND;
    END LOOP;
    CLOSE my_cursor;
END;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table1" **
CREATE OR REPLACE PROCEDURE cursor_attributes_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        is_open_attr BOOLEAN;
        found_attr BOOLEAN;
        my_record OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
        my_cursor CURSOR
        FOR
            SELECT
                OBJECT_CONSTRUCT( *) sc_cursor_record FROM
                table1;
    BEGIN
        OPEN my_cursor;
        LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
            FETCH my_cursor INTO
                :my_record;
            IF (my_record IS NULL) THEN
                EXIT;
            END IF;
            is_open_attr := null /*my_cursor%ISOPEN*/ /*** SSC-FDM-OR0038 - BOOLEAN CURSOR ATTRIBUTE %ISOPEN IS NOT SUPPORTED IN SNOWFLAKE ***/;
            found_attr := my_record IS NOT NULL;
        END LOOP;
        CLOSE my_cursor;
    END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0039

Create Type Not Supported in Snowflake

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0007](../conversion-issues/oracleEWI.md) documentation

### Description

This message is added when a Create Type statement not supported by Snowflake is used.

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE TYPE type6 UNDER type5(COL1 INTEGER);
```

##### Generated Code:

```sql
 ----** SSC-FDM-OR0039 - CREATE TYPE SUBTYPE IS NOT SUPPORTED IN SNOWFLAKE **
--CREATE TYPE type6 UNDER type5(COL1 INTEGER)
                                           ;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0040

Numeric characters cannot be altered in Snowflake. The decimal separator in Snowflake is the dot character.

### Description

Numeric characters cannot be altered in Snowflake. The decimal separator in Snowflake is the dot character. The ALTER session statement is commented and a warning is added.

#### Example Code

##### Oracle:

```sql
 ALTER SESSION SET NLS_NUMERIC_CHARACTERS = ',.';
```

##### Snowflake Scripting:

```sql
 ----** SSC-FDM-OR0040 - NUMERIC CHARACTERS CANNOT BE ALTERED IN SNOWFLAKE. THE DECIMAL SEPARATOR IN SNOWFLAKE IS THE DOT CHARACTER. **
--ALTER SESSION SET NLS_NUMERIC_CHARACTERS = ',.'
                                               ;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0041

Built In Package Not Supported.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0076](../conversion-issues/oracleEWI.md) documentation

### Description

Translation for built-in packages is not currently supported.

#### Example Code

##### Input Code (Oracle):

```sql
 SELECT
UTL_RAW.CAST_TO_RAW('some magic here'),
DBMS_UTILITY.GET_TIME
FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-OR0041 - TRANSLATION FOR BUILT-IN PACKAGE 'UTL_RAW.CAST_TO_RAW' IS NOT CURRENTLY SUPPORTED. **
'' AS CAST_TO_RAW,
--** SSC-FDM-OR0041 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_UTILITY.GET_TIME' IS NOT CURRENTLY SUPPORTED. **
'' AS GET_TIME
FROM DUAL;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0042

Date Type Transformed To Timestamp Has A Different Behavior

### Description

Date type is being transformed to either Date or Timestamp type depending on flag [–disableDateAsTimestamp](../../../user-guide/snowconvert/command-line-interface/oracle.md), because Date type in Snowflake has a different behavior than Oracle.

#### Key Differences

|  | Oracle DATE | Snowflake DATE |
| --- | --- | --- |
| Functionality | Stores date and time information | Stores only date information (year, month, day) |
| Internal Storage | Binary number representing seconds since epoch | Compact format optimized for dates |
| Use Cases | General-purpose date and time storage | Scenarios where only date information is needed |
| Advantages | Supports both date and time | More efficient storage for dates |
| Limitations | Can’t store date and time components separately. | Doesn’t store time information |

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE TABLE "PUBLIC"."TABLE1"
(
    "CREATED_DATE" DATE,
    "UPDATED_DATE" DATE
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE "PUBLIC"."TABLE1"
    (
        "CREATED_DATE" TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
        "UPDATED_DATE" TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;
```

```sql
 -- Additional Params: --disableDateAsTimestamp
CREATE OR REPLACE TABLE "PUBLIC"."TABLE1"
    (
        "CREATED_DATE" DATE /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
        "UPDATED_DATE" DATE /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0043

BFILE/BLOB parameters are considered binary. A format may be needed.

### Description

This error happens when a TO_CLOB is converted to a TO_VARCHAR function. A format may be needed for BFILE/BLOB parameters.

#### Example Code

##### Input Code:

```sql
 SELECT TO_CLOB('Lorem ipsum dolor sit amet') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-OR0043 - BFILE/BLOB PARAMETERS ARE CONSIDERED BINARY, FORMAT MAY BE NEEDED. **
TO_VARCHAR('Lorem ipsum dolor sit amet')
FROM DUAL;
```

#### Best Practices

* Check if outputs in the input code and converted code are equivalent and add a format parameter if needed.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0044

REGEXP_LIKE_UDF match parameter may not behave correctly

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This warning appears when the Oracle `REGEXP_LIKE`condition comes with the third parameter (match parameter)*.* The reason to add the warning is that the `REGEXP_LIKE_UDF`used to replace the `REGEXP_LIKE`does not recognize all the characters used by the match parameter, so the result of the query in Snowflake may not be equivalent to Oracle.

#### Example Code

##### Input Code Oracle:

```sql
 SELECT last_name
FROM hr.employees
WHERE REGEXP_LIKE (last_name, '([aeiou])\1', 'i')
ORDER BY last_name;
```

##### Generated Code:

```sql
 SELECT last_name
FROM
hr.employees
WHERE
--** SSC-FDM-OR0044 - REGEXP_LIKE_UDF MATCH PARAMETER MAY HAVE SOME FUNCTIONAL DIFFERENCES COMPARED TO ORACLE **
PUBLIC.REGEXP_LIKE_UDF(last_name, '([aeiou])\\1', 'i')
ORDER BY last_name;
```

* When the `REGEXP_LIKE`condition comes with one of the characters that are not supported by the user-defined function, maybe a possible solution is to change the regular expression to simulate the behavior of the missing character in the match parameter. To know more about the character not supported go to [REGEXP_LIKE_UDF](../../../../translation-references/oracle/functions/README.md) documentation.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0045

Partitions Clauses are Handled by Snowflake

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-OR0010](../conversion-issues/oracleEWI.md) documentation

### Description

This warning appears when the `PARTITION` and `SUBPARTITION` clauses appear within a query. Snowflake handle partitions automatically

#### Example Code

##### Input Code:

```sql
 SELECT * FROM TABLITA PARTITION(col1);
```

##### Generated Code:

```sql
 SELECT * FROM
TABLITA
--        --** SSC-FDM-OR0045 - PARTITIONS CLAUSES ARE HANDLED BY SNOWFLAKE **
--        PARTITION(col1)
                       ;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0046

The Subquery Restriction is not Possible in Snowflake

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This warning happens when a Subquery Restriction appears in a `SELECT` Statement.

#### Example Code

##### Input Code:

```sql
 SELECT * FROM LATERAL(SELECT * FROM TABLITA WITH READ ONLY CONSTRAINT T);
```

##### Generated Code:

```sql
 SELECT * FROM LATERAL(SELECT * FROM
TABLITA
--        --** SSC-FDM-OR0046 - THE SUBQUERY RESTRICTION IS NOT POSSIBLE IN SNOWFLAKE **
--        WITH READ ONLY CONSTRAINT T
                                   );
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0047

It may be needed to set a TimeStampOutput format.

### Description

TIMESTAMP_OUTPUT_FORMAT session parameter may need to be set to ‘DD-MON-YY HH24.MI.SS.FF AM TZH:TZM’ for timestamp output equivalence.

#### Example Code

##### Input Code:

```sql
 SELECT SYSTIMESTAMP FROM DUAL;
```

##### Example of default TIMESTAMP output in Oracle

> **Output:**
>
> 13-JAN-21 04.18.37.288656 PM +00:00

##### Generated Code:

```sql
 SELECT
CURRENT_TIMESTAMP() /*** SSC-FDM-OR0047 - YOU MAY NEED TO SET TIMESTAMP OUTPUT FORMAT ('DD-MON-YY HH24.MI.SS.FF AM TZH:TZM') ***/
FROM DUAL;
```

##### Example of default TIMESTAMP output in Snowflake

> **Output:**
>
> 2021-01-13 08:18:19.720 -080

#### Best Practices

* To change the timestamp output format in Snowflake use the following query:

  ```
  ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'DD-MON-YY HH24.MI.SS.FF AM TZH:TZM';
  ```
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0048

Date or timestamp output format has to be set

### Description

When SnowConvert AI transforms a DATE or TIMESTAMP to VARCHAR (for example, in a DEFAULT clause using SYSDATE or TRUNC(CURRENT_DATE())), the output depends on the OUTPUT_FORMAT and TIMESTAMP_OUTPUT_FORMAT session parameters. These may not match Oracle’s default format. Set the session parameters to match the Oracle values for equivalent output.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE orders (
   order_id INT,
   created_date VARCHAR(30) DEFAULT TO_CHAR(TRUNC(SYSDATE))
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE orders (
   order_id INT,
   created_date VARCHAR(30) DEFAULT TO_VARCHAR(TRUNC(CURRENT_TIMESTAMP(), 'DD')) /*** SSC-FDM-OR0048 - TRANSFORMATION OF DATE/TIMESTAMP TO VARCHAR DEPENDS ON THE OUTPUT_FORMAT SESSION PARAMETERS, SET THEM TO MATCH THE ORACLE VALUES ***/
);
```

#### Best Practices

* Set TIMESTAMP_OUTPUT_FORMAT and OUTPUT_FORMAT session parameters to match Oracle’s NLS format (e.g., ‘DD-MON-YY HH24.MI.SS.FF AM TZH:TZM’).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0049

LAG function might fail if default value type differs from the expression type.

### Description

In Oracle, the `LAG` function automatically converts the default value’s data type to match the expression’s type. Snowflake, however, does not perform this implicit conversion. Therefore, a warning is issued to indicate that the `LAG` function may fail if the data types are incompatible.

#### Example Code

##### Input Code:

```sql
 SELECT
    LAG(salary, 2, '0') OVER (ORDER BY salary) AS salary_two_steps_back
FROM
    employees;
```

##### Generated Code:

```sql
 SELECT
    --** SSC-FDM-OR0049 - LAG FUNCTION MIGHT FAIL IF DEFAULT VALUE TYPE DIFFERS FROM THE EXPRESSION TYPE. **
    LAG(salary, 2, '0')
    OVER (ORDER BY salary) AS salary_two_steps_back
FROM
    employees;
```

#### Best Practices

* Verify that the data type of the default value matches the data type of the expression in the `LAG` function. If they differ, explicitly cast the default value to the expression’s data type.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0050

Exceptions with `NOCOPY` parameters may lead to data inconsistency.

### Description

In Oracle PL/SQL, the `NOCOPY` keyword is an optimization hint for `OUT` and `IN OUT` procedure parameters. By default, Oracle passes these parameters by value, creating an expensive copy of the data during the call and copying it back upon completion. This can cause significant performance overhead for large data structures.

`NOCOPY` instructs Oracle to pass by reference instead, allowing the procedure to directly modify the original data. This eliminates copying overhead and improves performance. However, changes are immediate and are not implicitly rolled back if an unhandled exception occurs within the procedure.

Therefore, we will remove the NOCOPY parameters option and add this FDM. This is because procedure execution terminates upon hitting an exception, preventing the `RETURN` statement from being reached. As a result, the variable in the caller’s declare block retains its initial values, as the procedure fails to successfully return a new value for assignment.

#### Example Code

##### Input Code:

```sql
CREATE OR REPLACE PROCEDURE calculate_division_with_nocopy (
    p_numerator IN NUMBER,
    p_denominator IN NUMBER,
    p_result OUT NOCOPY NUMBER
)
IS
    PROCEDURE calculate_division(result OUT NOCOPY NUMBER)
    AS
    BEGIN
    result := 20;
    result := p_numerator / p_denominator;
    END calculate_division;
BEGIN
    calculate_division(p_result);
        EXCEPTION
        WHEN OTHERS THEN
            p_result := p_result;
END calculate_division_with_nocopy;
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE calculate_division_with_nocopy (p_numerator NUMBER(38, 18), p_denominator NUMBER(38, 18), p_result OUT NUMBER(38, 18)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/23/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        calculate_division PROCEDURE(result
        --** SSC-FDM-OR0050 - EXCEPTIONS WITH NOCOPY PARAMETERS MAY LEAD TO DATA INCONSISTENCY. **
        NUMBER(38, 18))
        RETURNS NUMBER
        AS
            BEGIN
                result := 20;
                result := :p_numerator / :p_denominator;
                RETURN result;
            END;
        call_results NUMBER;
    BEGIN
        call_results := (
            CALL
            calculate_division(:p_result)
        );
        p_result := :call_results;
        EXCEPTION
        WHEN OTHER THEN
            p_result := :p_result;
        END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0051

Array size limit removed. Snowflake arrays are dynamically sized.

### Severity

None (functional difference)

### Description

When an Oracle `VARRAY` type is converted to a Snowflake `ARRAY` type, any **fixed maximum size** on the varray is not preserved because Snowflake arrays grow dynamically. SnowConvert emits this FDM to document that behavioral difference.

#### Example Code

##### Oracle:

```sql
CREATE TYPE PhoneNumbers AS VARRAY(10) OF VARCHAR2(20);
```

##### Snowflake:

```sql
--** SSC-FDM-OR0051 - ARRAY SIZE LIMIT '10' WAS REMOVED. SNOWFLAKE ARRAYS ARE DYNAMICALLY SIZED. **
CREATE TYPE PhoneNumbers AS ARRAY ( VARCHAR(20) );
```

#### Best Practices

* If application logic relies on a maximum number of elements, enforce it in application code or with constraints outside the type definition.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0052

Collection EXISTS index adjusted from 1-based to 0-based indexing.

### Severity

None (functional difference)

### Description

Oracle nested table and collection APIs often use **1-based** indexing; Snowflake `ARRAY` indexing is **0-based**. SnowConvert may adjust `EXISTS` index expressions and emits this FDM when that adjustment applies.

#### Best Practices

* Review all collection index arithmetic after migration.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0053

Embedded collection type definition removed. Collection variables are transformed to ARRAY.

### Severity

None (functional difference)

### Description

PL/SQL **nested collection types** declared inside a block are not kept as separate type definitions in Snowflake Scripting; variables are migrated toward `ARRAY` usage. This FDM documents that the embedded type definition was removed as part of that transformation.

#### Best Practices

* Validate runtime behavior for nested collections and associative arrays after conversion.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-OR0054

Embedded record type definition removed. Record variables are transformed to OBJECT.

### Severity

None (functional difference)

### Description

PL/SQL **RECORD** types declared inside a procedure or block are inlined or replaced with Snowflake Scripting `OBJECT`-style handling; the original embedded `TYPE ... IS RECORD` definition may be commented or removed with this FDM.

#### Best Practices

* Review accesses to record fields and default initialization after migration.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Oracle Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/oracleEWI.md
section: Migrations
---

# SnowConvert AI - Oracle Issues

## SSC-EWI-OR0001

Sequence start value with ‘LIMIT VALUE’ is not supported by Snowflake.

### Description

This error appears when the `START WITH` statement value is `LIMIT VALUE`.

In Oracle this clause is used only in ALTER TABLE

> * `START` `WITH` `LIMIT VALUE`, which is specific to `identity_options`, can only be used with `ALTER` `TABLE` `MODIFY`. If you specify `START` `WITH` `LIMIT VALUE`, then Oracle Database locks the table and finds the maximum identity column value in the table (for increasing sequences) or the minimum identity column value (for decreasing sequences) and assigns the value as the sequence generator’s high water mark. The next value returned by the sequence generator will be the high water mark + `INCREMENT` `BY` `integer` for increasing sequences, or the high water mark - `INCREMENT` `BY` `integer` for decreasing sequences.

#### [ALTER TABLE ORACLE](https://docs.oracle.com/en/database/oracle/oracle-database/23/sqlrf/ALTER-TABLE.html#GUID-552E7373-BF93-477D-9DA3-B2C9386F2877)

#### Example Code

##### Input Code:

```sql
 CREATE SEQUENCE SEQUENCE1
  START WITH LIMIT VALUE;
```

##### Generated Code:

```sql
 CREATE OR REPLACE SEQUENCE SEQUENCE1
  !!!RESOLVE EWI!!! /*** SSC-EWI-OR0001 - SEQUENCE START VALUE WITH 'LIMIT VALUE' IS NOT SUPPORTED BY SNOWFLAKE. ***/!!!
  START WITH LIMIT VALUE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}';
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0002

Columns from expression not found

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

High

### Description

This error happens when the columns of a Select Expression were unable to be resolved, usually when it either refers to a Type Access whose reference wasn’t resolved or a column with a User Defined Type whose columns haven’t been defined; such as a Type Without Body or Object Type with no columns.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE record_unknown_table_proc
AS
    unknownTable_variable_rowtype unknownTable%ROWTYPE;
BEGIN
    INSERT INTO MyTable values unknownTable_variable_rowtype;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE record_unknown_table_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        unknownTable_variable_rowtype OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
    BEGIN
        INSERT INTO MyTable
        SELECT
            null !!!RESOLVE EWI!!! /*** SSC-EWI-OR0002 - COLUMNS FROM EXPRESSION unknownTable%ROWTYPE NOT FOUND ***/!!!;
    END;
$$;
```

#### Best Practices

* Verify that the type definition that was referenced does have columns within it.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0004

The used syntax in select is not supported in Snowflake.

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Severity

High

### Description

This warning happens when a clause in a select is not supported in Snowflake. The not supported clauses are:

* CONTAINERS
* HIERARCHIES
* EXTERNAL MODIFY
* SHARDS

#### Example Code

##### Input Code:

```sql
 SELECT * FROM TABLE1 EXTERNAL MODIFY (LOCATION 'file.csv' REJECT LIMIT UNLIMITED);
```

##### Generated Code:

```sql
 SELECT * FROM
TABLE1
       !!!RESOLVE EWI!!! /*** SSC-EWI-OR0004 - THE 'OPTIONAL MODIFIED EXTERNAL' SYNTAX IN SELECT IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
       EXTERNAL MODIFY (LOCATION 'file.csv' REJECT LIMIT UNLIMITED);
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0005

BFILE/BLOB parameters are considered binary. A format may be needed.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-OR0043](../functional-difference/oracleFDM.md) documentation.

### Severity

Low

### Description

This error happens when a TO_CLOB is converted to a TO_VARCHAR function. A format may be needed for BFILE/BLOB parameters.

#### Example Code

##### Input Code:

```sql
 SELECT TO_CLOB('Lorem ipsum dolor sit amet') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0005 - BFILE/BLOB PARAMETERS ARE CONSIDERED BINARY, FORMAT MAY BE NEEDED ***/!!!
TO_VARCHAR('Lorem ipsum dolor sit amet')
FROM DUAL;
```

#### Best Practices

* Check if outputs in the input code and converted code are equivalent and add a format parameter if needed.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0006

It may be needed to set a TimeStampOutput format.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-OR0047](../functional-difference/oracleFDM.md) documentation.

### Severity

Low

### Description

TIMESTAMP_OUTPUT_FORMAT session parameter may need to be set to ‘DD-MON-YY HH24.MI.SS.FF AM TZH:TZM’ for timestamp output equivalence.

#### Example Code

##### Input Code:

```sql
 SELECT SYSTIMESTAMP FROM DUAL;
```

##### Example of default TIMESTAMP output in Oracle

> **Output:**
>
> 13-JAN-21 04.18.37.288656 PM +00:00

##### Generated Code:

```sql
 SELECT
CURRENT_TIMESTAMP() !!!RESOLVE EWI!!! /*** SSC-EWI-OR0006 - YOU MAY NEED TO SET TIMESTAMP OUTPUT FORMAT ('DD-MON-YY HH24.MI.SS.FF AM TZH:TZM') ***/!!!
FROM DUAL;
```

##### Example of default TIMESTAMP output in Snowflake

> **Output:**
>
> 2021-01-13 08:18:19.720 -080

#### Best Practices

* To change the timestamp output format in Snowflake use the following query:

  ```
  ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'DD-MON-YY HH24.MI.SS.FF AM TZH:TZM';
  ```
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0007

Create Type Not Supported in Snowflake

> **Note:**
>
> **Deprecation:** This Oracle-specific issue is deprecated in favor of [SSC-EWI-0095](generalEWI.md) (general) and updated `CREATE TYPE` handling. The **Example Code** below is unchanged for historical accuracy. Unsupported `CREATE TYPE` shapes may now surface under SSC-EWI-OR0139, SSC-EWI-OR0140, SSC-EWI-OR0141, or SSC-EWI-OR0142 as applicable.

### Description

This message is added when a Create Type statement not supported by Snowflake is used.

#### Example Code

##### Input Code (Oracle):

```sql
 CREATE TYPE type6 UNDER type5(COL1 INTEGER);
```

##### Generated Code:

```sql
 --!!!RESOLVE EWI!!! /*** SSC-EWI-OR0007 - CREATE TYPE SUBTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
--CREATE TYPE type6 UNDER type5(COL1 INTEGER)
                                           ;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0008

Unknown format, it may have unexpected behavior.

### Severity

Low

### Description

This error is added for unknown date formats that may have unexpected behavior.

#### Example Code

##### Input Code:

```sql
 SELECT TO_CHAR(DATE '1998-12-25','iw-iyyy') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0008 - UNKNOWN FORMAT, MAY HAVE UNEXPECTED BEHAVIOR ***/!!!
 TO_CHAR(DATE '1998-12-25','iw-iyyy'') FROM DUAL;
```

> **Note:**
>
> Note that ‘iw-iyyy’’ is not a supported format.

#### Best Practices

* Check for this [documentation](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#date-and-time-formats) for the supported timestamp formats.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0009

JSON_TABLE is not supported.

### Severity

High

### Description

JSON_TABLE function is not currently supported.

#### Example Code

##### Input Code:

```sql
 SELECT jt.phones
FROM j_purchaseorder,
JSON_TABLE(po_document, '$.ShippingInstructions'
COLUMNS
(phones VARCHAR2(100) FORMAT JSON PATH '$.Phone')) AS jt;
```

##### Generated Code:

```sql
 SELECT jt.phones
FROM
j_purchaseorder,
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0009 - JSON_TABLE IS NOT SUPPORTED ***/!!!
JSON_TABLE(po_document, '$.ShippingInstructions'
COLUMNS
(phones VARCHAR(100) FORMAT JSON PATH '$.Phone')) AS jt;
```

#### Best Practices

* You can take advantage of the [FLATTEN](https://docs.snowflake.com/en/sql-reference/functions/flatten.html) function in Snowflake to emulate the functionality of JSON_TABLE.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0010

Partitions Clauses are Handled by Snowflake. It requires manual fix

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Severity

Critical

### Description

This warning appears when the `PARTITION` and `SUBPARTITION` clauses appear within a query. Snowflake handle partitions automatically

#### Example Code

##### Input Code:

```sql
 SELECT * FROM table1 PARTITION(col1);
```

##### Generated Code:

```sql
 SELECT * FROM
table1
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0010 - PARTITIONS CLAUSES ARE HANDLED BY SNOWFLAKE. IT REQUIRES MANUAL FIX ***/!!!
        PARTITION(col1);
```

#### Best Practices

* Manual change is required to get equivalent functionality in Snowflake. A `WHERE` condition is needed to filter the rows for the specific partition. However, with this workaround, performance is affected.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0011

The format parameter is not supported.

### Severity

Medium

### Description

The format parameter is not currently supported by Snowflake for Cast functions in especial cases. For example, when we use “MONTH” or “DAY” inside the DATE or TIMESTAMP format.

```none
"MONTH/DD/YYYY" or "MM/DAY/YY" ...
```

Other scenario is when you are working with CAST function using NUMBER currently Snowflake need to have 4 arguments to show the decimal part, for now the output code not offer all arguments needed for Snowflake, you need to add the rest arguments for [TO_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/to_decimal) function.

#### Example Code

##### Input Code:

```sql
 SELECT CAST('12.48' AS NUMBER, '99.99') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
TO_NUMBER('12.48', '99.99', 38, 2)
FROM DUAL;
```

##### Input Code:

```sql
 SELECT CAST('FEBRUARY/18/24' as DATE, 'MONTH/DD/YY') FROM DUAL;
SELECT CAST('FEB/MON/24' as DATE, 'MON/DAY/YY') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0011 - THE FORMAT PARAMETER 'MONTH/DD/YY' IS NOT SUPPORTED ***/!!!
TO_TIMESTAMP ('FEBRUARY/18/24' , 'MONTH/DD/YY')
FROM DUAL;

SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0011 - THE FORMAT PARAMETER 'MON/DAY/YY' IS NOT SUPPORTED ***/!!!
TO_TIMESTAMP ('FEB/MON/24' , 'MON/DAY/YY')
FROM DUAL;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0013

NLS parameter is not supported.

### Severity

Medium

### Description

NLS parameter is not currently supported for the following functions:

* TOCHAR
* TODATE
* TONUMBER
* TOTIMESTAMP
* CAST

#### Example Code

##### Input Code:

```sql
 SELECT TO_NUMBER('-AusDollars100','9G999D99', ' NLS_NUMERIC_CHARACTERS = '',.''NLS_CURRENCY= ''AusDollars''') "Amount" FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0013 - NLS PARAMETER ' NLS_NUMERIC_CHARACTERS = '',.''NLS_CURRENCY= ''AusDollars''' NOT SUPPORTED ***/!!!
TO_NUMBER('-AusDollars100', '9G999D99') "Amount" FROM DUAL;
```

## SSC-EWI-OR0014

NLSSORT not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

### Description

NLSSORT function is not currently supported in the body of a select.

#### Example Code

##### Input Code:

```sql
 SELECT NLSSORT(name, 'NLS_SORT = ENGLISH') FROM products;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0014 - FUNCTION NLSSORT IS NOT SUPPORTED ***/!!!
 NLSSORT(name, 'NLS_SORT = ENGLISH') FROM
 products;
```

#### Best Practices

* NLSSORT is converted to a user-defined function (UDF/Stub), so you can modify it to emulate the functionality.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0016

XML is not supported.

### Severity

Medium

### Description

The following XML related functions are not supported:

* EXTRACT
* EXTRACTVALUE
* XMLSEQUENCE
* XMLTYPE

#### Example Code

##### Input Code:

```sql
 select * from table(XMLSequence(XMLType('
<Product ProductCode="200">
 <BrandName>Notebook</BrandName>
 <ProductList>
  <Item ItemNo="200A"><Price>900</Price></Item>
  <Item ItemNo="200B"><Price>700</Price></Item>
  <Item ItemNo="200C"><Price>650</Price></Item>
  <Item ItemNo="200D"><<Price>750</Price></Item>
</ProductList>
</Product>')));
```

##### Generated Code:

```sql
 select * from table(
                    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0016 - FUNCTION RELATED WITH XML NOT SUPPORTED ***/!!!
XMLSequence(
            !!!RESOLVE EWI!!! /*** SSC-EWI-OR0016 - FUNCTION RELATED WITH XML NOT SUPPORTED ***/!!!XMLType('
<Product ProductCode="200">
 <BrandName>Notebook</BrandName>
 <ProductList>
  <Item ItemNo="200A"><Price>900</Price></Item>
  <Item ItemNo="200B"><Price>700</Price></Item>
  <Item ItemNo="200C"><Price>650</Price></Item>
  <Item ItemNo="200D"><<Price>750</Price></Item>
</ProductList>
</Product>')));
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0020

Negative values not supported for function.

### Severity

Medium

### Description

Snowflake does not support negative values for the function, then this will cause different behavior when executed. This EWI is emitted when a function like `INSTR` uses a negative position parameter that cannot be automatically translated.

> **Note:**
>
> `INSTR` with position = -1 is automatically translated to a functionally equivalent Snowflake expression and does not trigger this EWI. Only positions less than -1 (e.g., -3, -5) emit this warning.

#### Example Code

##### Input Code:

```sql
 SELECT INSTR('CORPORATE FLOOR','OR', -3, 2) FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
REGEXP_INSTR('CORPORATE FLOOR','OR', -3, 2) !!!RESOLVE EWI!!! /*** SSC-EWI-OR0020 - NEGATIVE VALUES NOT SUPPORTED FOR FUNCTION ***/!!! FROM DUAL;
```

#### Best Practices

* Create a User Defined Function that can handle the negative parameter or look for another alternative.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0023

AGGREGATE function not supported.

### Severity

High

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This error is added when an aggregate function as

* DENSE_RANK()
* RANK()
* PERCENT_RANK()
* CUME_DIST()

is not supported in Snowflake.

#### Example Code

##### Input Code:

```sql
 SELECT DENSE_RANK(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM employees;

SELECT RANK(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM employees;

SELECT PERCENT_RANK(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM employees;

SELECT CUME_DIST(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM employees;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0023 - DENSE_RANK AGGREGATE FUNCTION SYNTAX IS NOT SUPPORTED BY SNOWFLAKE. ***/!!!
 DENSE_RANK(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM
 employees;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0023 - RANK AGGREGATE FUNCTION SYNTAX IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! RANK(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM
 employees;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0023 - PERCENT_RANK AGGREGATE FUNCTION SYNTAX IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! PERCENT_RANK(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM
 employees;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0023 - CUME_DIST AGGREGATE FUNCTION SYNTAX IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! CUME_DIST(12000) WITHIN GROUP (ORDER BY salary DESC NULLS FIRST) FROM
 employees;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0026

ROWID is not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

### Description

ROWID statement is not currently supported.

#### Example Code

##### Oracle:

```sql
 SELECT QUERY_NAME.ROWID from TABLE1;
```

##### Snowflake Scripting:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0026 - ROWID NOT SUPPORTED ***/!!!
 QUERY_NAME.ROWID from
 TABLE1;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0029

DEFAULT ON CONVERSION ERROR is not supported.

### Description

Default on conversion error not supported in Snowflake

#### Example Code

##### Input Code:

```sql
 SELECT TO_NUMBER('2,00' DEFAULT 0 ON CONVERSION ERROR) "Value" FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
PUBLIC.TO_NUMBER_UDF('2,00', 0) "Value" FROM DUAL;
```

#### Best Practices

* You might create UDF to emulate the behavior of `DEFAULT` value `ON CONVERSION ERROR`.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0030

KEEP statement used in the aggregate function is not supported

### Severity

Medium

### Description

This error appears to advertise that the KEEP statement used to indicate that only the first or last values of the aggregate function will be returned is not supported

#### Example Code

##### Input Code:

```sql
 SELECT
    department_id,
    MIN(salary) KEEP (
        DENSE_RANK FIRST
        ORDER BY
            commission_pct
    ) "Worst"
FROM
    employees;
```

##### Generated Code:

```sql
 SELECT
    department_id,
    MIN(salary)
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0030 - KEEP STATEMENT USED IN THE AGGREGATE FUNCTION IS NOT SUPPORTED ***/!!!
 KEEP (
        DENSE_RANK FIRST
        ORDER BY
            commission_pct
    ) "Worst"
FROM
 employees;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0031

SYS_CONTEXT parameter is not supported.

### Severity

Low

### Description

This error happens when a SYS_CONTEXT function parameter is not supported. Snowflake support similar context functions, check the [page](https://docs.snowflake.com/en/sql-reference/functions-context) to more information

#### Example Code

##### Input Code:

```sql
 SELECT SYS_CONTEXT ('USERENV', 'NLS_SORT') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0031 - 'NLS_SORT' SYS_CONTEXT PARAMETER NOT SUPPORTED IN SNOWFLAKE ***/!!!
 SYS_CONTEXT ('USERENV', 'NLS_SORT') FROM DUAL;
```

#### Best Practices

* The function is converted to a user defined function(stub), so you can modify it to emulate the behavior of the SYS_CONTEXT parameter.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0032

Parameter with the specified format is not supported.

### Severity

Medium

### Description

This error happens when a parameter in a function is not supported.

#### Example Code

##### Input Code:

```sql
 SELECT TO_CHAR(DATE '1998-12-25', 'AM') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0032 - PARAMETER USED IN THE FUNCTION 'TO_CHAR' WITH FORMAT AM IS NOT SUPPORTED ***/!!!
 TO_CHAR(DATE '1998-12-25', 'AM') FROM DUAL;
```

#### Best Practices

* The function is converted to a user defined function(stub), so you can modify it to emulate the behavior of the parameter.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0033

PL/SQL declaration in WITH is not supported.

### Severity

Medium

### Description

PL/SQL declarations in WITH statements are not supported.

#### Example Code

##### Input Code:

```sql
 WITH FUNCTION get_domain ( url VARCHAR2 ) RETURN VARCHAR2 IS pos BINARY_INTEGER;
len BINARY_INTEGER;
BEGIN
pos := INSTR(url, 'www.');
len := INSTR(SUBSTR(url, pos + 4), '.') - 1;
END; SELECT aValue from aTable;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
WITH
     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0033 - PLDECLARATION IN WITH NOT SUPPORTED ***/!!!
 FUNCTION get_domain ( url VARCHAR2 ) RETURN VARCHAR2 IS pos BINARY_INTEGER;
len BINARY_INTEGER;
BEGIN
pos := INSTR(url, 'www.');
len := INSTR(SUBSTR(url, pos + 4), '.') - 1;
END; SELECT aValue from
aTable;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0035

The table function is not supported when it is used as a collection of expressions.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

### Description

TABLE function is not supported in Snowflake when it is used as a collection of expressions.

#### Example Code

##### Input Code:

```sql
 SELECT
TABLE2.COLUMN_VALUES
FROM TABLE1 i, TABLE(i.groups) TABLE2;
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
TABLE2.COLUMN_VALUES
FROM
TABLE1 i,
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0035 - TABLE FUNCTION IS NOT SUPPORTED WHEN IT IS USED AS A COLLECTION OF EXPRESSIONS ***/!!! TABLE(i.groups) TABLE2;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0036

Types resolution issues, the arithmetic operation may not behave correctly between string and date.

### Severity

Low

### Description

This issue happens when an arithmetic operation may not behave correctly between two certain data types.

#### Example Code

##### Input Code:

```sql
 SELECT
    SYSDATE,
    SYSDATE + '1',
    SYSDATE + 'A'
from
    dual;
```

##### Generated Code:

```sql
 SELECT
    CURRENT_TIMESTAMP(),
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Date AND String ***/!!!
    CURRENT_TIMESTAMP() + '1',
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Date AND String ***/!!!
    CURRENT_TIMESTAMP() + 'A'
from
    dual;
```

> **Note:**
>
> Note that the operation between a String and Date may not behave correctly.

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0038

Search clause removed from the with element statement.

### Severity

Low

### Description

The [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) is employed to define the order in which rows are processed in a SELECT statement. This functionality allows for a customized traversal of the data, ensuring that the results are returned in a specific sequence based on the specified criteria. It is important to note, however, that this behavior, characterized by the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142), is not supported in Snowflake.

In databases such as Oracle, the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) is commonly used in conjunction with recursive queries or common table expressions (CTEs) to influence the sequence in which hierarchical data is explored. By designating a particular column or set of columns in the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142), you can control the depth-first or breadth-first traversal of the hierarchy, impacting the order in which rows are processed.

In Snowflake, [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) message will be generated, and the [`search_clause`](https://docs.oracle.com/en/database/oracle/oracle-database/19/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__I2077142) is subsequently eliminated.

#### Example Code

##### Input Code:

```sql
 WITH dup_hiredate(eid, emp_last, mgr_id, reportLevel, hire_date, job_id) AS
(SELECT aValue from atable) SEARCH DEPTH FIRST BY hire_date SET order1 SELECT aValue from atable;
```

##### Generated Code:

```sql
 WITH dup_hiredate(eid, emp_last, mgr_id, reportLevel, hire_date, job_id) AS
(
SELECT aValue from
atable
) !!!RESOLVE EWI!!! /*** SSC-EWI-OR0038 - SEARCH CLAUSE REMOVED FROM THE WITH ELEMENT STATEMENT ***/!!!
SELECT aValue from
atable;
```

#### Recommendation

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0039

The nocycle clause is not supported in Snowflake.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

### Description

This message is shown when SnowConvert AI finds a query with a NOCYCLE clause, which is not supported in Snowflake.

This clause marks when there is a recursion.

For more details see the [documentation](https://docs.oracle.com/en/database/oracle/oracle-database/23/sqlrf/SELECT.html#GUID-CFA006CA-6FF1-4972-821E-6996142A51C6__GUID-8EE64250-3C9A-40C7-A81D-46695F8B2EB9) about the clause functionality.

#### Example Code

#### Connect By

##### Input Code:

```sql
 CREATE OR REPLACE FORCE NONEDITIONABLE VIEW VIEW01 AS
SELECT
      UNIQUE A.*
FROM
      TABLITA A
WHERE
      A.X = A.C CONNECT BY NOCYCLE A.C = 0 START WITH A.B = 1
HAVING
      X = 1
GROUP BY
      A.C;
```

##### Generated Code:

```sql
 CREATE OR REPLACE VIEW VIEW01
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
SELECT DISTINCT
      A.*
FROM
      TABLITA A
WHERE
      A.X = A.C
GROUP BY
      A.C
HAVING
      X = 1
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0039 - NOCYCLE CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
CONNECT BY
      A.C = 0 START WITH A.B = 1;
```

#### Best Practices

* If there are cycles in the data hierarchy, you can review this [article](https://docs.snowflake.com/en/user-guide/queries-cte#cause-1-cyclic-data-hierarchy) to deal with them.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).
* Please review the following link for manual workaround: <https://community.snowflake.com/s/article/NOCYCLE-workaround>

## SSC-EWI-OR0042

Model clause is not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

### Description

This message is shown when SnowConvert AI finds a query with a MODEL clause, which is not supported in Snowflake.

#### Example Code

##### Input Code:

```sql
 SELECT
   employee_id,
   salary
FROM
   employees
MODEL
DIMENSION BY (employee_id)
MEASURES (salary)
();
```

##### Generated Code:

```sql
 SELECT
   employee_id,
   salary
FROM
   employees
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0042 - MODEL CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
MODEL
DIMENSION BY (employee_id)
MEASURES (salary)
();
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0045

Cast type L and FML is not supported

### Severity

Medium

### Description

This issue happens when trying to cast using FML or L format that is not applicable in Snowflake, then the code is commented out and this message is being added.

#### Example Code:

##### Input Code:

```sql
 SELECT CAST(' $123.45' as number, 'L999.99') FROM DUAL;
SELECT CAST('$123.45' as number, 'FML999.99') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0045 - CAST TYPE L AND FML NOT SUPPORTED ***/!!!
 CAST(' $123.45' as NUMBER(38, 18) , 'L999.99') FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0045 - CAST TYPE L AND FML NOT SUPPORTED ***/!!! CAST('$123.45' as NUMBER(38, 18) , 'FML999.99') FROM DUAL;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0046

Alter Table syntax is not applicable in Snowflake.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-EWI-0109](generalEWI.md) documentation

### Severity

Medium

### Description

The Alter Table syntax used is not applicable in Snowflake, then the code is commented out and this message is being added.

#### Example Code:

##### Input Code:

```sql
 ALTER TABLE SOMENAME DEFAULT COLLATION SOMENAME;

ALTER TABLE SOMENAME ROW ARCHIVAL;

ALTER TABLE SOMENAME MODIFY CLUSTERING;

ALTER TABLE SOMENAME DROP CLUSTERING;

ALTER TABLE SOMENAME SHRINK SPACE COMPACT CASCADE;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "SOMENAME" **
ALTER TABLE SOMENAME
DEFAULT COLLATION SOMENAME;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "SOMENAME" **

ALTER TABLE SOMENAME
ROW ARCHIVAL;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "SOMENAME" **

ALTER TABLE SOMENAME
MODIFY CLUSTERING;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "SOMENAME" **

ALTER TABLE SOMENAME
DROP CLUSTERING;

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "SOMENAME" **

ALTER TABLE SOMENAME
SHRINK SPACE COMPACT CASCADE;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0047

TO_NCHAR transformed to TO_VARCHAR, it may not be compilable in Snowflake.

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This warning is added when the function `TO_NCHAR` was found and it was transformed into a `TO_VARCHAR` function.

There are multiple cases where the transformation causes a compilation error, or the output is not the same.

#### Example Code

##### Input Code:

```sql
 select TO_NCHAR(sysdate,'DY','nls_date_language=english') from dual
```

##### Generated Code:

```sql
 select
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0047 - TO_NCHAR TRANSFORMED TO TO_VARCHAR, IT MAY NOT BE COMPILABLE IN SNOWFLAKE ***/!!!
TO_VARCHAR(CURRENT_TIMESTAMP(),'DY','nls_date_language=english') from dual;
```

The example from above will result in an error if it is used in Snowflake.

Not all cases are causing errors.

##### Input Code:

```sql
 SELECT TO_NCHAR(SYSDATE, 'YYYY-MM-DD') FROM dual;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0047 - TO_NCHAR TRANSFORMED TO TO_VARCHAR, IT MAY NOT BE COMPILABLE IN SNOWFLAKE ***/!!!
TO_VARCHAR(CURRENT_TIMESTAMP(), 'YYYY-MM-DD') FROM dual;
```

The last example does not cause an error in Snowflake, and the output is equivalent if executed.

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0049

Package constants in stateful package are not supported yet.

### Severity

Critical

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This warning is added when there is a member of a Stateful Package that is not supported yet.

This feature is planned to be delivered in the future.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PACKAGE MY_PACKAGE
AS
    TYPE COLLECTIONTYPEDEFINITION IS TABLE OF BULKCOLLECTTABLE%ROWTYPE;
END;
```

##### Generated Code:

```sql
 CREATE SCHEMA IF NOT EXISTS MY_PACKAGE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

!!!RESOLVE EWI!!! /*** SSC-EWI-OR0049 - PACKAGE TYPE DEFINITIONS in stateful package MY_PACKAGE are not supported yet ***/!!!
TYPE COLLECTIONTYPEDEFINITION IS TABLE OF BULKCOLLECTTABLE%ROWTYPE;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0050

Input Expression is out of the range

### Severity

Medium

### Description

This issue happens when trying to cast an input value that is out of range. It means the precision values are not applicable in Snowflake, then the code is commented out and this message is being added.

#### Example Code:

##### Input Code:

```sql
 SELECT CAST('123,456E+40' AS NUMBER, '999,999EEE') FROM DUAL;
SELECT CAST('12.34567891234567891234567891234567891267+' AS NUMBER, '99.999999999999999999999999999999999999S') FROM DUAL;
SELECT CAST('12.34567891234567891234567891234567891267' AS NUMBER, '99.999999999999999999999999999999999999') FROM DUAL;
select cast(' 1.0E+123' as number, '9.9EEEE') from dual;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE '123,456E+40' ***/!!!
 CAST('123,456E+40' AS NUMBER(38, 18) , '999,999EEE') FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE '12.34567891234567891234567891234567891267+' ***/!!! CAST('12.34567891234567891234567891234567891267+' AS NUMBER(38, 18) , '99.999999999999999999999999999999999999S') FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE '12.34567891234567891234567891234567891267' ***/!!! CAST('12.34567891234567891234567891234567891267' AS NUMBER(38, 18) , '99.999999999999999999999999999999999999') FROM DUAL;

select
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0050 - INPUT EXPRESSION IS OUT OF THE RANGE ' 1.0E+123' ***/!!! cast(' 1.0E+123' as NUMBER(38, 18) , '9.9EEEE') from dual;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0051

PRAGMA EXCEPTION_INIT is not supported.

### Severity

Low

### Description

This EWI is added when PRAGMA EXCEPTION_INIT function is invoked within a procedure. Exception Name and SQL Code of the exceptions are set in the RAISE function. When it is converted to Snowflake Scripting, the SQL Code is added to the Exception declaration, however, some code values may be invalid in Snowflake Scripting.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE EXCEPTION_DECLARATION_SAMPLE AUTHID DEFINER IS
  NEW_EXCEPTION EXCEPTION;
  PRAGMA EXCEPTION_INIT(NEW_EXCEPTION, -63);
  NEW_EXCEPTION2 EXCEPTION;
  PRAGMA EXCEPTION_INIT ( NEW_EXCEPTION2, -20100 );
BEGIN

  IF true THEN
    RAISE NEW_EXCEPTION;
  END IF;

EXCEPTION
    WHEN NEW_EXCEPTION THEN
        --Handle Exceptions
        NULL;
END;
/
```

##### Generated Code:

##### Snowflake script

```sql
 CREATE OR REPLACE PROCEDURE EXCEPTION_DECLARATION_SAMPLE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0099 - EXCEPTION CODE NUMBER EXCEEDS SNOWFLAKE SCRIPTING LIMITS ***/!!!
    NEW_EXCEPTION EXCEPTION;
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
    PRAGMA EXCEPTION_INIT(NEW_EXCEPTION, -63);
    NEW_EXCEPTION2 EXCEPTION (-20100, '');
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
  PRAGMA EXCEPTION_INIT ( NEW_EXCEPTION2, -20100 );
  BEGIN
    IF (true) THEN
      RAISE NEW_EXCEPTION;
    END IF;
    EXCEPTION
        WHEN NEW_EXCEPTION THEN
            --Handle Exceptions
            NULL;
    END;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0052

Exception declaration is handled by the raise function.

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag `-t JavaScript` or `--PLTargetLanguage JavaScript`

### Description

Exceptions can be defined in both languages, Oracle and Snowflake, but the RAISE function is designed to do declaration, assignment, and throw the error. This is why the Exception declaration is commented out and the warning is displayed.

#### Example Code

##### Input Code:

```sql
 -- Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE EXCEPTION_DECLARATION_SAMPLE AUTHID DEFINER IS
  NEW_EXCEPTION EXCEPTION;
  PRAGMA EXCEPTION_INIT(NEW_EXCEPTION, -63);
BEGIN

  IF true THEN
    RAISE NEW_EXCEPTION;
  END IF;

EXCEPTION
    WHEN NEW_EXCEPTION THEN
        --Handle Exceptions
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE EXCEPTION_DECLARATION_SAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
  !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PlInvokerRightsClause' NODE ***/!!!
  //AUTHID DEFINER
  null
  // SnowConvert AI Helpers Code section is omitted.

  try {
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0052 - EXCEPTION DECLARATION IS HANDLED BY RAISE FUNCTION ***/!!!
    /*   NEW_EXCEPTION EXCEPTION */
    ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
    /*   PRAGMA EXCEPTION_INIT(NEW_EXCEPTION, -63) */
    ;
    if (true) {
      RAISE(-63,`NEW_EXCEPTION`,`NEW_EXCEPTION`);
    }
  } catch(error) {
    switch(error.name) {
      case `NEW_EXCEPTION`: {
        break;
      }
      default: {
        throw error;
        break;
      }
    }
  }
  //Handle Exceptions
  ;
$$;
```

> **Note:**
>
> Some parts of the output code are omitted to improve readability.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0053

Incorrect input format

### Severity

Medium

### Description

This issue happens when trying to cast using a wrong input format, then the code is commented out and this message is being added.

#### Example Code:

##### Input Code:

```sql
 SELECT CAST('12sdsd3,456E+40' AS NUMBER, '999,999EEE') FROM DUAL;
SELECT CAST('12345sdsd' AS NUMBER, '99999') FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0053 - INCORRECT INPUT FORMAT '12sdsd3,456E+40' ***/!!!
 CAST('12sdsd3,456E+40' AS NUMBER(38, 18) , '999,999EEE') FROM DUAL;

SELECT
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0053 - INCORRECT INPUT FORMAT '12345sdsd' ***/!!! CAST('12345sdsd' AS NUMBER(38, 18) , '99999') FROM DUAL;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0057

Transformation for nested procedure or function is not supported in this scenario.

### Severity

Critical

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

Translation of nested functions inside other functions or procedures is not supported. Similarly, procedures nested within functions or anonymous blocks are not currently supported.

However, nested procedures within other procedures or packages are supported. For additional details, see the [Nested Procedures Documentation](../../../../translation-references/oracle/pl-sql-to-snowflake-scripting/README.md).

#### Example Code

##### Input Code:

```sql
CREATE OR REPLACE function FOO1 RETURN INTEGER AS
    FUNCTION FOO2 RETURN INTEGER AS
    BEGIN
        RETURN 123;
    END;
BEGIN
    RETURN FOO2() + 456;
END;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0046 - NESTED FUNCTION/PROCEDURE DECLARATIONS ARE NOT SUPPORTED IN SNOWFLAKE. ***/!!!
CREATE OR REPLACE PROCEDURE FOO1 ()
RETURNS INTEGER
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0057 - TRANSFORMATION FOR NESTED FUNCTION IS NOT SUPPORTED IN THIS SCENARIO ***/!!!
        FUNCTION FOO2 RETURN INTEGER AS
        BEGIN
            RETURN 123;
        END;
    BEGIN
        RETURN FOO2() + 456;
    END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0067

Multiple constraint definition in a single statement is not supported in Snowflake.

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

Multiple Constraint Definition in a single ALTER TABLE statement is not supported in Snowflake.

#### Example Code

##### Oracle:

```sql
 ALTER TABLE TABLE1 ADD (
  CONSTRAINT TABLE1_PK
  PRIMARY KEY
  (ID)
  ENABLE VALIDATE,
  CONSTRAINT TABLE1_FK foreign key(ID2)
  references TABLE2 (ID) ON DELETE CASCADE);
```

##### Snowflake Scripting:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-OR0067 - MULTIPLE CONSTRAINT DEFINITION IN A SINGLE STATEMENT IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
ALTER TABLE TABLE1
ADD (
  CONSTRAINT TABLE1_PK
  PRIMARY KEY
  (ID) ,
  CONSTRAINT TABLE1_FK foreign key(ID2)
  references TABLE2 (ID) ON DELETE CASCADE);
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0068

The sequence start value exceeds the max value allowed by Snowflake.

### Severity

Medium

### Description

This error appears when the `START WITH` statement value exceeds the maximum value allowed by Snowflake. What Snowflake said about the start value is: *Specifies the first value returned by the sequence. Supported values are any value that can be represented by a 64-bit two’s complement integer (from `-2^63` to `2^63-1`)*. So according to the previously mentioned, the max value allowed is **9223372036854775807** for positive numbers and **9223372036854775808** for negative numbers.

#### Example Code

##### Input Code:

```sql
 CREATE SEQUENCE SEQUENCE1
START WITH 9223372036854775808;
```

```sql
 CREATE SEQUENCE SEQUENCE2
START WITH -9223372036854775809;
```

##### Generated Code:

```sql
 CREATE OR REPLACE SEQUENCE SEQUENCE1
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0068 - SEQUENCE START VALUE EXCEEDS THE MAX VALUE ALLOWED BY SNOWFLAKE. ***/!!!
START WITH 9223372036854775808
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

```sql
 CREATE OR REPLACE SEQUENCE SEQUENCE2
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0068 - SEQUENCE START VALUE EXCEEDS THE MAX VALUE ALLOWED BY SNOWFLAKE. ***/!!!
START WITH -9223372036854775809
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}';
```

#### Best Practices

* It can be recommended to just reset the sequence and modify its usage too. **NOTE**: the target column must have enough space to hold this value.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0069

The sequence CURRVAL property is not supported in Snowflake.

### Severity

Medium

### Description

The sequence CURRVAL property is not supported in Snowflake.

#### Example Code

##### Oracle:

```sql
 select seq1.currval from dual;
```

##### Snowflake Scripting:

```sql
 select
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0069 - THE SEQUENCE CURRVAL PROPERTY IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
 seq1.currval from dual;
```

#### Best Practices

* You can check this [link](https://docs.snowflake.com/en/user-guide/querying-sequences.html#currval-not-supported) to see what Snowflake suggests to handle situations where the CURRVAL property is used.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0070

Binary Operation Not Supported

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

A binary operation is not currently supported, a user-defined function is added.

#### Example Code

##### Oracle:

```sql
 -- Unsupported operation: EXCEPT DISTINCT
SELECT someValue MULTISET EXCEPT DISTINCT multiset_except FROM customers_demo;
```

##### Snowflake Scripting:

```sql
 -- Unsupported operation: EXCEPT DISTINCT
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0070 - BINARY OPERATION MULTISET EXCEPT IS NOT SUPPORTED ***/!!!
 someValue MULTISET EXCEPT DISTINCT multiset_except FROM
 customers_demo;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0071

Set Quantifier Not Supported

### Severity

Low

### Description

Quantifier ‘all’ is not supported in Snowflake. The modifier is removed from the source code, and a warning is added; the resulting code may behave unexpectedly.

#### Example Code

##### Input Code:

```sql
 SELECT location_id  FROM locations
MINUS ALL
SELECT location_id  FROM departments;
```

##### Generated Code:

```sql
 SELECT location_id  FROM
locations
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0071 - QUANTIFIER 'ALL' NOT SUPPORTED FOR THIS SET OPERATOR, RESULTS MAY DIFFER ***/!!!
MINUS
SELECT location_id  FROM
departments;
```

In Snowflake, the INTERSECT and MINUS/EXCEPT operators will always remove duplicate values.

#### Best Practices

* Check alternatives in Snowflake to emulate the functionality of the “all” quantifier. Below is a workaround for `MINUS ALL` and `EXCEPT ALL`.

```sql
 SELECT location_id FROM
(
    SELECT location_id, ROW_NUMBER()OVER(PARTITION BY location_id ORDER BY 1) rn
    FROM locations
    MINUS
    SELECT number_val, ROW_NUMBER()OVER(PARTITION BY location_id ORDER BY 1) rn
    FROM departments
);
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0072

Procedural Member not supported

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag -t JavaScript or –PLTargetLanguage JavaScript

### Description

A procedural member is not currently supported. Example of procedural members:

* Constant declarations.
* Cursor declarations.
* Pragma declarations.
* Variable declarations.

#### Example Code

##### Oracle:

```sql
 -- Additional Params: -t JavaScript
CREATE OR REPLACE EDITIONABLE PROCEDURE PROCEDURE1
   IS
   PRAGMA AUTONOMOUS_TRANSACTION;
BEGIN
    NULL;
END;
```

##### Snowflake Scripting:

```sql
 --** SSC-FDM-OR0007 - SNOWFLAKE DOESN'T SUPPORT VERSIONING OF OBJECTS. DEVELOPERS SHOULD CONSIDER ALTERNATE APPROACHES FOR CODE VERSIONING. **
CREATE OR REPLACE PROCEDURE PROCEDURE1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

   !!!RESOLVE EWI!!! /*** SSC-EWI-OR0072 - PROCEDURAL MEMBER PRAGMA DECLARATION NOT SUPPORTED. ***/!!!
   /*    PRAGMA AUTONOMOUS_TRANSACTION */
   ;
   null;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0075

Labels in statements not supported

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag -t JavaScript or –PLTargetLanguage JavaScript

### Description

Labels in statements not supported to reference a code block.

#### Example Code

##### Oracle:

```sql
 --Additional Params: -t JavaScript
CREATE OR REPLACE EDITIONABLE PROCEDURE PROCEDURE1
IS
BEGIN
    -- procedure body
    EXIT loop_b;
    -- procedure body continuation
END;
```

##### Snowflake Scripting:

```sql
--Additional Params: -t JavaScript
--** SSC-FDM-OR0007 - SNOWFLAKE DOESN'T SUPPORT VERSIONING OF OBJECTS. DEVELOPERS SHOULD CONSIDER ALTERNATE APPROACHES FOR CODE VERSIONING. **
CREATE OR REPLACE PROCEDURE PROCEDURE1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    // REGION SnowConvert AI Helpers Code
    var RAISE = function (code,name,message) {
        message === undefined && ([name,message] = [message,name])
        var error = new Error(message);
        error.name = name
        SQLERRM = `${(SQLCODE = (error.code = code))}: ${message}`
        throw error;
    };
    var SQL = {
        FOUND : false,
        NOTFOUND : false,
        ROWCOUNT : 0,
        ISOPEN : false
    };
    var _RS, _ROWS, SQLERRM = "normal, successful completion", SQLCODE = 0;
    var getObj = (_rs) => Object.assign(new Object(),_rs);
    var getRow = (_rs) => (values = Object.values(_rs)) && (values = values.splice(-1 * _rs.getColumnCount())) && values;
    var fetch = (_RS,_ROWS,fmode) => _RS.getRowCount() && _ROWS.next() && (fmode ? getObj : getRow)(_ROWS) || (fmode ? new Object() : []);
    var EXEC = function (stmt,binds,opts) {
        try {
            binds = !(arguments[1] instanceof Array) && ((opts = arguments[1]) && []) || (binds || []);
            opts = opts || new Object();
            binds = binds ? binds.map(fixBind) : binds;
            _RS = snowflake.createStatement({
                    sqlText : stmt,
                    binds : binds
                });
            _ROWS = _RS.execute();
            if (opts.sql !== 0) {
                var isSelect = stmt.toUpperCase().trimStart().startsWith("SELECT");
                var affectedRows = isSelect ? _RS.getRowCount() : _RS.getNumRowsAffected();
                SQL.FOUND = affectedRows != 0;
                SQL.NOTFOUND = affectedRows == 0;
                SQL.ROWCOUNT = affectedRows;
            }
            if (opts.row === 2) {
                return _ROWS;
            }
            var INTO = function (opts) {
                if (opts.vars == 1 && _RS.getColumnCount() == 1 && _ROWS.next()) {
                    return _ROWS.getColumnValue(1);
                }
                if (opts.rec instanceof Object && _ROWS.next()) {
                    var recordKeys = Object.keys(opts.rec);
                    Object.assign(opts.rec,Object.fromEntries(new Map(getRow(_ROWS).map((element,Index) => [recordKeys[Index],element]))))
                    return opts.rec;
                }
                return fetch(_RS,_ROWS,opts.row);
            };
            var BULK_INTO_COLLECTION = function (into) {
                for(let i = 0;i < _RS.getRowCount();i++) {
                    FETCH_INTO_COLLECTIONS(into,fetch(_RS,_ROWS,opts.row));
                }
                return into;
            };
            if (_ROWS.getRowCount() > 0) {
                return _ROWS.getRowCount() == 1 ? INTO(opts) : BULK_INTO_COLLECTION(opts);
            }
        } catch(error) {
            RAISE(error.code,error.name,error.message)
        }
    };
    var FETCH_INTO_COLLECTIONS = function (collections,fetchValues) {
        for(let i = 0;i < collections.length;i++) {
            collections[i].push(fetchValues[i]);
        }
    };
    var IS_NULL = (arg) => !(arg || arg === 0);
    var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
    var fixBind = function (arg) {
        arg = arg instanceof Date ? formatDate(arg) : IS_NULL(arg) ? null : arg;
        return arg;
    };
    // END REGION

    /*     -- procedure body
        EXIT loop_b */
    // procedure body
    // procedure body
    ;
    // procedure body continuation
    ;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0076

Built In Package Not Supported.

### Severity

Medium

### Description

Translation for built-in packages is not currently supported.

#### Example Code

##### Input Code (Oracle):

```sql
 SELECT
UTL_RAW.CAST_TO_RAW('some magic here'),
DBMS_UTILITY.GET_TIME
FROM DUAL;
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'UTL_RAW.CAST_TO_RAW' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS CAST_TO_RAW,
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0076 - TRANSLATION FOR BUILT-IN PACKAGE 'DBMS_UTILITY.GET_TIME' IS NOT CURRENTLY SUPPORTED. ***/!!!
'' AS GET_TIME
FROM DUAL;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0078

Unable to parse dynamic SQL statement inside Execute Immediate.

### Severity

Medium

### Description

SnowConvert AI could not parse the dynamic SQL statement inside the Execute Immediate.

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag `-t JavaScript` or `--PLTargetLanguage JavaScript`

#### Example Code

##### Oracle:

```sql
 --Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE PROC1 AS
BEGIN
    EXECUTE IMMEDIATE 'NOT A VALID SQL STATEMENT';
END;
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0078 - UNABLE TO PARSE DYNAMIC SQL STATEMENT ***/!!!
    /*EXEC(`NOT A VALID SQL STATEMENT`)*/
    ;
$$;
```

#### Best Practices

* Check the dynamic SQL statement for any syntax error.
* Review the SnowConvert AI documentation to see if the statement is still unsupported.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0082

Cannot Convert Nested Type Attribute Expression

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Severity

Medium

### Description

This error message appears when a query, like a select, tries to access an attribute within a column that was defined as a type. These cannot be automatically converted, but they can be quickly converted by hand.

#### Example Code:

##### Input Code Oracle:

```sql
 CREATE TYPE type1 AS OBJECT (
  attribute1 VARCHAR2(20),
  attribute2 NUMBER
);
CREATE TYPE type2 AS OBJECT (
  property1 type1,
  property2 DATE
);
CREATE TABLE my_table (
  id NUMBER PRIMARY KEY,
  column1 type2
);
INSERT INTO my_table VALUES (
  1, type2(type1('value1', 100), SYSDATE)
);
SELECT column1.property1.attribute1, column1.property2
FROM my_table;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE type1 AS OBJECT (
  attribute1 VARCHAR2(20),
  attribute2 NUMBER
)
;

!!!RESOLVE EWI!!! /*** SSC-EWI-0056 - CUSTOM TYPES ARE NOT SUPPORTED IN SNOWFLAKE BUT REFERENCES TO THIS CUSTOM TYPE WERE CHANGED TO VARIANT ***/!!!
CREATE TYPE type2 AS OBJECT (
  property1 type1,
  property2 DATE
)
;

CREATE OR REPLACE TABLE my_table (
  id NUMBER(38, 18) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/ PRIMARY KEY,
  column1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'type2' USAGE CHANGED TO VARIANT ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
;

CREATE OR REPLACE VIEW my_table_view
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
  id,
  column1:property1:attribute1 :: VARCHAR AS attribute1,
  column1:property1:attribute2 :: NUMBER AS attribute2,
  column1:property2 :: DATE AS property2
FROM
  my_table;

INSERT INTO my_table
VALUES (
  1, type2(type1('value1', 100) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'type1' NODE ***/!!!, CURRENT_TIMESTAMP()) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'type2' NODE ***/!!!
);

SELECT column1.property1.attribute1,
  column1.property2
FROM
  my_table;
```

#### Best Practices

* The code can be manually fixed by changing the ‘.’ accessor for the ‘:’ wherever a type column is being accessed.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0087

Ordering of the Outer Joins failed

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This issue happens when an error occurred while reordering the new ANSI JOIN clauses in a query that previously had outer joins with the (+) operator. A query with a cycle of tables joining each other in the WHERE clause can provoke this issue.

When this EWI is present, the JOIN clauses may not work properly due to their order.

#### Example Code

##### Input Code Oracle:

```sql
 SELECT
l.location_id, l.state_province,
r.region_id, r.region_name,
c.country_id, c.country_name
FROM
hr.countries c,  hr.regions r,  hr.locations l, hr.departments d WHERE
l.location_id (+) = c.region_id AND
c.region_id (+) = r.region_id AND
r.region_id (+) = c.region_id AND
l.location_id (+) = d.location_id;
```

##### Generated Code:

```sql
 SELECT
l.location_id, l.state_province,
r.region_id, r.region_name,
c.country_id, c.country_name
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0087 - ORDERING THE OUTER JOINS FAILED. QUERY MAY NOT BEHAVE CORRECTLY ***/!!!
hr.departments d
LEFT OUTER JOIN
hr.locations l
ON
l.location_id = c.region_id
AND
l.location_id = d.location_id
LEFT OUTER JOIN
hr.countries c
ON
c.region_id = r.region_id
LEFT OUTER JOIN
hr.regions r
ON
r.region_id = c.region_id;
```

* Make sure the query is valid and does not have tables that are being joined to each other.
* If the issue still occurs, try qualifying the name of each column in the WHERE clause with the name of the table.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0089

REGEXP_LIKE_UDF match parameter may not behave correctly

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-OR0044](../functional-difference/oracleFDM.md) documentation.

### Severity

Low

### Description

This warning appears when the Oracle `REGEXP_LIKE`condition comes with the third parameter (match parameter)*.* The reason to add the warning is that the `REGEXP_LIKE_UDF`used to replace the `REGEXP_LIKE`does not recognize all the characters used by the match parameter, so the result of the query in Snowflake may not be equivalent to Oracle.

#### Example Code

##### Input Code Oracle:

```sql
 SELECT last_name
FROM hr.employees
WHERE REGEXP_LIKE (last_name, '([aeiou])\1', 'i')
ORDER BY last_name;
```

##### Generated Code:

```sql
 SELECT last_name
FROM
hr.employees
WHERE
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0089 - REGEXP_LIKE_UDF MATCH PARAMETER MAY HAVE SOME FUNCTIONAL DIFFERENCES COMPARED TO ORACLE. ***/!!!
PUBLIC.REGEXP_LIKE_UDF(last_name, '([aeiou])\\1', 'i')
ORDER BY last_name;
```

* When the `REGEXP_LIKE` condition includes characters that are not supported by the user-defined function, you can change the regular
  expression to simulate the behavior of the missing character in the match parameter. For more information about unsupported characters,
  see [REGEXP_LIKE_UDF](../../../../translation-references/oracle/functions/README.md).
* For additional support, contact Snowflake at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-OR0090

Non-Ansi Outer Join has an invalid Between predicate

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This issue happens when there is an OUTER JOIN with the (+) operator inside a BETWEEN clause that cannot be executed in Snowflake. This generally happens when multiple tables are used in the interval of the BETWEEN clause.

#### Example Code

##### Input Code Oracle:

```sql
 SELECT
*
FROM
hr.countries c, hr.regions r,  hr.locations l WHERE
l.location_id  BETWEEN r.region_id(+) AND c.region_id(+);
```

##### Generated Code:

```sql
 SELECT
*
FROM
hr.countries c,
hr.regions r,
hr.locations l WHERE
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0090 - INVALID NON-ANSI OUTER JOIN BETWEEN PREDICATE CASE FOR SNOWFLAKE. ***/!!!
l.location_id  BETWEEN r.region_id(+) AND c.region_id(+);
```

#### Best Practices

* Manually change the Outer Join to ANSI syntax.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0092

NUMBER datatype negative scale was removed from output

### Severity

Low

### Description

This issue happens when a NUMBER with a negative scale is being used to apply rounding to the NUMBER. Snowflake does not support this feature, and this message is used to indicate that the Scale was removed.

#### Example Code

##### Input Code Oracle:

##### Queries

```sql
 CREATE TABLE number_table
(
	col1 NUMBER(38),
	col2 NUMBER(38, -1),
	col3 NUMBER(*, -2)
);

INSERT INTO number_table(col1, col2, col3) VALUES (555, 555, 555);

SELECT * FROM number_table;
```

##### Result

```none
COL1|COL2|COL3|
----+----+----+
 555| 560| 600|
```

##### Generated Code:

##### Queries

```sql
 CREATE OR REPLACE TABLE number_table
	(
		col1 NUMBER(38) /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
		col2 NUMBER(38) !!!RESOLVE EWI!!! /*** SSC-EWI-OR0092 - NUMBER DATATYPE NEGATIVE SCALE WAS REMOVED FROM OUTPUT ***/!!! /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/,
		col3 NUMBER(38) !!!RESOLVE EWI!!! /*** SSC-EWI-OR0092 - NUMBER DATATYPE NEGATIVE SCALE WAS REMOVED FROM OUTPUT ***/!!! /*** SSC-FDM-0006 - NUMBER TYPE COLUMN MAY NOT BEHAVE SIMILARLY IN SNOWFLAKE. ***/
	)
	COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
	;

	INSERT INTO number_table(col1, col2, col3) VALUES (555, 555, 555);

	SELECT * FROM
	number_table;
```

##### Result

```sql
 |COL1|COL2|COL3|
|----|----|----|
|555 |555 |555 |
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0095

Operation Between Interval Type and Date Type not Supported

### Severity

Low

### Description

`INTERVAL YEAR TO MONTH` and `INTERVAL DAY TO SECOND` are not a supported data type, they are transformed to `VARCHAR(20)`. Therefore all arithmetic operations between **Date Types** and the original **Interval Type Columns** are not supported.

Furthermore, operations between an Interval Type and Date Type (in this order) are not supported in Snowflake; and these operations use this EWI as well.

#### Example Code

##### Input Code:

```sql
 CREATE TABLE table_with_intervals
(
    date_col DATE,
    time_col TIMESTAMP,
    intervalYearToMonth_col INTERVAL YEAR TO MONTH,
    intervalDayToSecond_col INTERVAL DAY TO SECOND
);

-- Date + Interval Y to M
SELECT date_col + intervalYearToMonth_col FROM table_with_intervals;

-- Date - Interval D to S
SELECT date_col - intervalDayToSecond_col FROM table_with_intervals;

-- Timestamp + Interval D to S
SELECT time_col + intervalDayToSecond_col FROM table_with_intervals;

-- Timestamp - Interval Y to M
SELECT time_col - intervalYearToMonth_col FROM table_with_intervals;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE table_with_intervals
    (
        date_col TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/,
        time_col TIMESTAMP(6),
        intervalYearToMonth_col VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR TO MONTH DATA TYPE CONVERTED TO VARCHAR ***/!!!,
        intervalDayToSecond_col VARCHAR(20) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DAY TO SECOND DATA TYPE CONVERTED TO VARCHAR ***/!!!
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
    ;

    -- Date + Interval Y to M
    SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! date_col + intervalYearToMonth_col FROM
    table_with_intervals;

    -- Date - Interval D to S
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! date_col - intervalDayToSecond_col FROM
    table_with_intervals;

    -- Timestamp + Interval D to S
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! time_col + intervalDayToSecond_col FROM
    table_with_intervals;

    -- Timestamp - Interval Y to M
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0095 - OPERATION BETWEEN INTERVAL TYPE AND DATE TYPE NOT SUPPORTED ***/!!! time_col - intervalYearToMonth_col FROM
    table_with_intervals;
```

#### Best Practices

* Implement the UDF to simulate the Oracle behavior.
* Extract the already transformed value that was stored in the column during migration, and use it as a Snowflake [**Interval Constant**](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) when possible.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

####

## SSC-EWI-OR0097

Procedure Properties are Not Supported in Snowflake Procedures

### Severity

Low

### Description

Oracle `CREATE PROCEDURE` additional properties are not required and have no equivalent by Snowflake `CREATE PROCEDURE`.

#### Example Code

##### Input Code Oracle:

```sql
 CREATE OR REPLACE PROCEDURE PROC01
DEFAULT COLLATION USING_NLS_COMP
AUTHID CURRENT_USER
ACCESSIBLE BY (PROCEDURE PROC03)
AS
BEGIN
    NULL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE PROC01 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0097 - PROCEDURE PROPERTIES ARE NOT SUPPORTED IN SNOWFLAKE PROCEDURES ***/!!!
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0099

The exception code exceeds the Snowflake Scripting limit

### Severity

Low

### Description

This EWI appears when an exception declaration error code exceeds the Snowflake Scripting exception number limits. The number must be an integer between -20000 and -20999.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE procedure_exception
IS
my_exception EXCEPTION;
PRAGMA EXCEPTION_INIT ( my_exception, -19000 );
BEGIN
    NULL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE procedure_exception ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0099 - EXCEPTION CODE NUMBER EXCEEDS SNOWFLAKE SCRIPTING LIMITS ***/!!!
        my_exception EXCEPTION;
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0051 - PRAGMA EXCEPTION_INIT IS NOT SUPPORTED ***/!!!
        PRAGMA EXCEPTION_INIT ( my_exception, -19000 );
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* Check if the exception code is between the limits allowed by Snowflake Scripting, if not change it for another exception number available.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0100

For Loop With Multiple Conditions Is Currently Not Supported By Snowflake Scripting. Only First Condition Is Used

### Severity

Low

### Description

Oracle allows multiple conditions in a single `FOR LOOP` however, Snowflake Scripting only allows one condition per `FOR LOOP`. Only the first condition is migrated and the others are ignored during transformation.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE P3
AS
BEGIN
FOR i IN REVERSE 1..3,
REVERSE i+5..i+7
LOOP
    NULL;
END LOOP;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE P3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0100 - FOR LOOP WITH MULTIPLE CONDITIONS IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        FOR i IN REVERSE 1 TO 3
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            NULL;
        END LOOP;
    END;
$$;
```

#### Best Practices

* Separate the `FOR LOOP` into different loops or rewrite the condition.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0101

Specific For Loop Clause Is Currently Not Supported By Snowflake Scripting

### Severity

Low

### Description

Oracle allows additional clauses to the `FOR LOOP` condition. Like the **BY,** **WHILE,** and **WHEN** clauses. Both **WHILE** and **WHEN** clauses allow for an extra boolean expression as a condition. While the **BY** clause allows a stepped increment in the iteration. These additional clauses are not supported in Snowflake Scripting and are ignored during transformation.

#### Example Code

##### Input Code Oracle:

```sql
 CREATE OR REPLACE PROCEDURE P2
AS
BEGIN
FOR i IN 1..10 WHILE i <= 5 LOOP
    NULL;
END LOOP;

FOR i IN 5..15 BY 5 LOOP
    NULL;
END LOOP;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE P2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "WHILE" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        FOR i IN 1 TO 10
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         LOOP
                                NULL;
END LOOP;
                         !!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "BY" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
                         --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                         FOR i IN 5 TO 15
                                          --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                          LOOP
                                   NULL;
END LOOP;
    END;
$$;
```

#### Best Practices

* Separate the `FOR LOOP` into different loops or rewrite the condition.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0103

For Loop Format Is Currently Not Supported By Snowflake Scripting

### Severity

High

### Description

Oracle allows different types of conditions for a `FOR LOOP`. It supports boolean expressions, collections, records… However, Snowflake scripting only supports `FOR LOOP` with defined integers as bounds. All other formats are marked as not supported and require additional manual effort to be transformed.

[Oracle iteration control clauses](https://docs.oracle.com/en/database/oracle/oracle-database/21/lnpls/iterator.html#GUID-BD211E6F-8B4A-494A-AECF-AC26A241FF98) that are not supported in Snowflake `FOR LOOP`:

* `single_expression_control`
* `values_of_control`
* `indices_of_control`
* `pairs_of_control`

> **Danger:**
>
> `cursor_iteration_control` is currently marked as not supported. Removing parenthesis from the expression should transform it as a CURSOR FOR LOOP.
>
> **Original:**
>
> `FOR i IN (cursor_variable) LOOP NULL; END LOOP;`
>
> **Should be changed to:**
>
> `FOR i IN cursor_variable LOOP NULL; END LOOP;`

#### Example Code

##### Input Code Oracle:

```sql
 CREATE OR REPLACE PROCEDURE P3
AS
TYPE values_aat IS TABLE OF PLS_INTEGER INDEX BY PLS_INTEGER;
l_employee_values   values_aat;
BEGIN
FOR power IN REPEAT power*2 WHILE power <= 64 LOOP
    NULL;
END LOOP;

FOR i IN VALUES OF l_employee_values LOOP
    NULL;
END LOOP;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE P3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        TYPE values_aat IS TABLE OF PLS_INTEGER INDEX BY PLS_INTEGER;
        l_employee_values VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'values_aat' USAGE CHANGED TO VARIANT ***/!!!;
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0103 - FOR LOOP FORMAT IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0101 - FOR LOOP WITH "WHILE" CLAUSE IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        FOR power IN REPEAT power*2 WHILE power <= 64
                                                      --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                                      LOOP
            NULL;
        END LOOP;
        !!!RESOLVE EWI!!! /*** SSC-EWI-OR0103 - FOR LOOP FORMAT IS CURRENTLY NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **

        FOR i IN VALUES OF :l_employee_values
                                              --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                              LOOP
            NULL;
        END LOOP;
    END;
$$;
```

#### Best Practices

* Rewrite the `FOR LOOP` condition or use a different kind of `LOOP` to simulate the behavior.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0104

Unusable collection variable

### Severity

High

### Description

Oracle collections are not currently supported by SnowConvert AI, all the collection types variables and their usages will be commented out.

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag `-t JavaScript` or `--PLTargetLanguage JavaScript`

#### Example Code

##### Input Code Oracle:

```sql
 -- Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE collection_variable_sample_proc
IS
    TYPE POPULATION IS TABLE OF NUMBER INDEX BY VARCHAR2(64); --Associative array
    city_population POPULATION := POPULATION();
    i  VARCHAR2(64);
BEGIN
	city_population('Smallville')  := 2000;
    city_population('Midland')     := 750000;

    i := city_population.FIRST;
    i := city_population.NEXT(1);
END;
```

##### Output Cod

```sql
 CREATE OR REPLACE PROCEDURE collection_variable_sample_proc ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "12/16/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	!!!RESOLVE EWI!!! /*** SSC-EWI-OR0072 - PROCEDURAL MEMBER TYPE DEFINITION NOT SUPPORTED. ***/!!!
	/*     TYPE POPULATION IS TABLE OF NUMBER INDEX BY VARCHAR2(64) */
	;
	!!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
	/*     city_population POPULATION := POPULATION() */
	;
	let I;
	!!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
	/* 	city_population('Smallville')  := 2000 */
	;
	!!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
	/*     city_population('Midland')     := 750000 */
	;
	I =
		!!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
		/*city_population.FIRST*/
		null;
	I =
		!!!RESOLVE EWI!!! /*** SSC-EWI-OR0104 - UNUSABLE VARIABLE, ITS TYPE WAS NOT TRANSFORMED ***/!!!
		/*city_population.NEXT(1)*/
		null;
$$;
```

#### Best Practices

* No end-user action is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0105

Additional work is needed for BFILE column usage. BUILD_STAGE_URL function is a recommended workaround

### Severity

Low

### Description

The transformation for `BFILE` datatype is `VARCHAR`. However, the translation for the Oracle built-in functions used to interact with BFILE types is currently not supported. The column is migrated to a `VARCHAR` to store the file path and name. For more information, see the `BFILENAME_UDF` documentation.

> **Note:**
>
> The `BUILD_STAGE_FILE_URL` function is a recommended workaround to work with files in Snowflake. It returns a link to the specified file stored in a [stage](https://docs.snowflake.com/en/sql-reference/sql/create-stage.html#create-stage). See the [BUILD_STAGE_FILE_URL function documentation](https://docs.snowflake.com/en/sql-reference/functions/build_stage_file_url.html#build-stage-file-url).

#### Example Code

##### Input Code Oracle:

```sql
 CREATE TABLE bfiletable ( bfile_column BFILE );

INSERT INTO bfiletable VALUES ( BFILENAME('mydirectory', 'myfile.png') );
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE bfiletable ( bfile_column
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0105 - ADDITIONAL WORK IS NEEDED FOR BFILE COLUMN USAGE. BUILD_STAGE_FILE_URL FUNCTION IS A RECOMMENDED WORKAROUND ***/!!!
VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
;

INSERT INTO bfiletable
VALUES (PUBLIC.BFILENAME_UDF('mydirectory', 'myfile.png') );
```

#### Best Practices

* Use the `BUILD_STAGE_FILE_URL` and the other [file functions](https://docs.snowflake.com/en/sql-reference/functions-file.html#file-functions) to handle files.

##### Snowflake Query

```sql
 CREATE OR REPLACE TABLE bfiletable ( bfile_column
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0105 - ADDITIONAL WORK IS NEEDED FOR BFILE COLUMN USAGE. BUILD_STAGE_FILE_URL FUNCTION IS A RECOMMENDED WORKAROUND ***/!!!
VARCHAR
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
;

INSERT INTO bfiletable
VALUES (PUBLIC.BFILENAME_UDF('mydirectory', 'myfile.png') );
```

##### Result

```none
URL                                                                                                   |
------------------------------------------------------------------------------------------------------+
https://thecompany.snowflakecomputing.com/api/files/CODETEST/PUBLIC/MY_STAGE/%2Fmydirectory%2Fmyfile.jpg|
```

> **Note:**
>
> This function works with different cloud storage options, but for information regarding using local files with stages, check this [documentation](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-stage.html#staging-data-files-from-a-local-file-system).

* Change the data type to a supported type.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0108

The Following Assignment Statement is Not Supported by Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

### Description

Some Oracle variable types do not have a direct translation in Snowflake. Currently, transformation for cursor, collection, record, and user-defined type variables; as well as placeholders, objects, and output parameters are not supported by Snow Scripting.

Changing these variables to Snowflake [semi-structured data types](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html#semi-structured-data-types) could help as a workaround in some scenarios.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE pinvalidassign(out_parameter   IN OUT NUMBER)
AS
record_variable       employees%ROWTYPE;

TYPE cursor_type IS REF CURSOR;
cursor1   cursor_type;
cursor2   SYS_REFCURSOR;

TYPE collection_type IS TABLE OF NUMBER INDEX BY VARCHAR(64);
collection_variable     collection_type;

BEGIN
--Record Example
  record_variable.last_name := 'Ortiz';

--Cursor Example
  cursor1 := cursor2;

--Collection
  collection_variable('Test') := 5;

--Out Parameter
  out_parameter := 123;
END;
```

##### Generated Code:

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "employees" **
CREATE OR REPLACE PROCEDURE pinvalidassign (out_parameter OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    record_variable OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL REF CURSOR TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

--    TYPE cursor_type IS REF CURSOR;
    cursor1_res RESULTSET;
    cursor2_res RESULTSET;
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'PL COLLECTION TYPE DEFINITION' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

--    TYPE collection_type IS TABLE OF NUMBER INDEX BY VARCHAR(64);
    collection_variable VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0062 - CUSTOM TYPE 'collection_type' USAGE CHANGED TO VARIANT ***/!!!;
  BEGIN
    --Record Example
    record_variable := OBJECT_INSERT(record_variable, 'LAST_NAME', 'Ortiz', true);

    --Cursor Example
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
      cursor1 := :cursor2;

    --Collection
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0108 - THE FOLLOWING ASSIGNMENT STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
      collection_variable('Test') := 5;
    --Out Parameter
    out_parameter := 123;
  END;
$$;
```

#### Best Practices

* Change the variable data type or try to simulate the behavior using Snowflake [semi-structured data types](https://docs.snowflake.com/en/sql-reference/data-types-semistructured.html#semi-structured-data-types).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0109

Expressions as arguments of Using Clause are not supported by Snowflake Scripting

### Severity

Medium

### Description

Oracle supports using expressions as arguments to any USING Clause for the EXECUTE IMMEDIATE statements. This functionality is not supported by Snowflake Scripting.

Snowflake Scripting does support variable expressions, and this it is possible to replace the expression by manually assigning it to a variable (see example below).

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE expression_arguments
IS
  immediate_input INTEGER := 0;
BEGIN
  EXECUTE IMMEDIATE 'INSERT INTO immediate_table VALUES (:value)' USING immediate_input+1;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE expression_arguments ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    immediate_input INTEGER := 0;
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE 'INSERT INTO immediate_table
VALUES (?)' USING (
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0109 - EXPRESSIONS AS ARGUMENTS OF USING CLAUSE IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
    :immediate_input +1);
  END;
$$;
```

##### Manually migrated Execute Immediate procedure:

Replacing this procedure with the one above will solve the compilation error, and yield the same results as Oracle.

```sql
 CREATE OR REPLACE PROCEDURE PUBLIC.expression_arguments ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   DECLARE
      immediate_input INTEGER := 0;
      using_argument_variable INTEGER;
   BEGIN
      using_argument_variable := immediate_input+1;
      EXECUTE IMMEDIATE 'INSERT INTO PUBLIC.immediate_table VALUES (?)' USING (using_argument_variable );
   END;
$$;
```

#### Best Practices

* Procedures can be manually migrated by adding a variable and then assigning the expression to said variable.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0110

For Update Clause is not supported in Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

High

### Description

There is no equivalent for `FOR UPDATE` clause in Snow Scripting so an EWI is added and the clause is commented out

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE for_update_clause
AS
    update_record f_employee%rowtype;
    CURSOR c1 IS SELECT * FROM f_employee FOR UPDATE OF employee_number nowait;
BEGIN
    FOR CREC IN C1 LOOP
	UPDATE f_employee SET employee_number = employee_number + 1000 WHERE CURRENT OF c1;
	IF crec.id = 2 THEN
	    DELETE FROM f_employee WHERE CURRENT OF c1;
	    EXIT;
	END IF;
    END LOOP;
END;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "f_employee" **
CREATE OR REPLACE PROCEDURE for_update_clause ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		update_record OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
		--** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
		c1 CURSOR
		FOR
			SELECT * FROM
				f_employee
			!!!RESOLVE EWI!!! /*** SSC-EWI-OR0110 - FOR UPDATE CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
			FOR UPDATE OF employee_number nowait;
	BEGIN
		OPEN C1;
		--** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
		FOR CREC IN C1 DO
			!!!RESOLVE EWI!!! /*** SSC-EWI-OR0136 - CURRENT OF CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
			UPDATE f_employee
				SET employee_number =
 				                     !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!! employee_number + 1000 WHERE CURRENT OF c1;
			IF (crec.id = 2) THEN
--				!!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'CURRENT OF' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--				DELETE FROM
--					f_employee
--				WHERE CURRENT OF c1
				                   ;
				EXIT;
			END IF;
		END FOR;
		CLOSE C1;
	END;
$$;
```

#### Best Practices

* Handle the column update in the `UPDATE/DELETE` query for more details check SSC-EWI-OR0136.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0116

Operations between Intervals are not supported

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This error is added when there is an arithmetical operation whose operands are only intervals, this kind of operation is not supported by Snowflake.

#### Example Code

##### Input Code:

```sql
 SELECT INTERVAL '1-1' YEAR(2) TO MONTH + INTERVAL '1-1' YEAR(2) + INTERVAL '1-1' YEAR(2) TO MONTH FROM dual;

SELECT INTERVALCOLUMN + INTERVAL '1-1' YEAR(2) TO MONTH FROM INTERVALTABLE;
```

##### Generated Code:

```sql
 SELECT
--INTERVAL '1-1 year' + INTERVAL '1y, 1mm' + INTERVAL '1y, 1mm'
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0116 - OPERATIONS BETWEEN INTERVALS ARE NOT SUPPORTED BY SNOWFLAKE ***/!!!
null
FROM dual;

SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN Unknown AND Interval ***/!!! INTERVALCOLUMN + INTERVAL '1y, 1mm'
FROM
INTERVALTABLE;
```

#### Best Practices

* Depending on where the operation is located, it could be relocated and made valid by adding dates or timestamps.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0118

Built-In Views/Tables are not supported by Snowflake

### Severity

Medium

### Description

Oracle has a [set of built-in views and tables](https://docs.oracle.com/en/database/oracle/oracle-database/21/refrn/static-data-dictionary-views-1.html#GUID-41B62782-83FA-4066-8C56-0D0B66CC0EC7), that are not present in Snowflake, SnowConvert AI adds an error message to queries and statements that use these elements.

#### Example Code

##### Input Code:

```sql
 SELECT * FROM ALL_COL_COMMENTS;
SELECT * FROM (SELECT * FROM ALL_COL_COMMENTS);
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0118 - TRANSLATION FOR ORACLE BUILT-IN TABLE/VIEW 'ALL_COL_COMMENTS' IS NOT CURRENTLY SUPPORTED. ***/!!!
 * FROM
 ALL_COL_COMMENTS;

SELECT * FROM (SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0118 - TRANSLATION FOR ORACLE BUILT-IN TABLE/VIEW 'ALL_COL_COMMENTS' IS NOT CURRENTLY SUPPORTED. ***/!!! * FROM
ALL_COL_COMMENTS);
```

#### Best Practices

* Some information provided by Oracle Built-In views, can be found in Snowflake [Information Schema](https://docs.snowflake.com/en/sql-reference/info-schema.html#snowflake-information-schema) or using [SHOW](https://docs.snowflake.com/en/sql-reference/sql/show.html) command.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0121

Using DBMS_LOB.SUBSTR built-in package with a BFILE column is not supported in Snowflake

### Severity

Medium

### Description

Oracle BFILE columns are migrated to VARCHAR in Snowflake. The file name is stored as a string in the new column. Therefore, using a SUBSTR function, in Snowflake, on the migrated column will return a substring of the file name. While Oracle DBMS_LOB.SUBSTR will return a substring of the file content. For more information review [BFILE data type](../../../../translation-references/oracle/basic-elements-of-oracle-sql/data-types/oracle-built-in-data-types.md).

#### Example Code

##### Input Code:

```sql
 CREATE TABLE table1
(
    bfile_column BFILE
)
SELECT
DBMS_LOB.SUBSTR(bfile_column, 15, 1)
FROM table1;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE table1
    (
        bfile_column
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0105 - ADDITIONAL WORK IS NEEDED FOR BFILE COLUMN USAGE. BUILD_STAGE_FILE_URL FUNCTION IS A RECOMMENDED WORKAROUND ***/!!!
    VARCHAR
    )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
    ;
    SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0121 - USING DBMS_LOB.SUBSTR ON BFILE SOURCE COLUMN IS NOT SUPPORTED ON SNOWFLAKE ***/!!!
    SUBSTR(bfile_column, 1, 15)
    FROM
    table1;
```

#### Best Practices

* To handle files with Snowflake, see the [UTL_FILE handling documentation](../../../../translation-references/oracle/built-in-packages.md).
* For additional support, contact SnowConvert at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-OR0123

Database Link connections not supported

### Severity

Medium

### Description

A database link connection reference was removed from the object name because the database links and its references are not supported in Snowflake. The only part that is kept is the name before the `@` character.

#### Example Code

##### Input Code:

```sql
 -- Creation of the database link
CREATE DATABASE LINK mylink
    CONNECT TO user1 IDENTIFIED BY password1
    USING 'connection_str';

-- Statements that use the database link we created
SELECT * FROM employees@mylink;

INSERT INTO employees@mylink
    (employee_id, last_name, email, hire_date, job_id)
    VALUES (999, 'Claus', 'sclaus@oracle.com', SYSDATE, 'SH_CLERK');

UPDATE employees@mylink SET min_salary = 3000
    WHERE job_id = 'SH_CLERK';

DELETE FROM employees@mylink
    WHERE employee_id = 999;
```

##### Generated Code:

```sql
 ---- Creation of the database link
----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE DATABASE LINK IS OUT OF TRANSLATION SCOPE. **
--CREATE DATABASE LINK mylink
--    CONNECT TO user1 IDENTIFIED BY password1
--    USING 'connection_str'

    -- Statements that use the database link we created
SELECT * FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink | USER: user1/password1 | CONNECTION: 'connection_str' ] ***/!!!
    employees;

INSERT INTO
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink | USER: user1/password1 | CONNECTION: 'connection_str' ] ***/!!!
employees
    (employee_id, last_name, email, hire_date, job_id)
    VALUES (999, 'Claus', 'sclaus@oracle.com', CURRENT_TIMESTAMP(), 'SH_CLERK');

UPDATE
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink | USER: user1/password1 | CONNECTION: 'connection_str' ] ***/!!!
employees
    SET min_salary = 3000
    WHERE job_id = 'SH_CLERK';

DELETE FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-OR0123 - DBLINK CONNECTIONS NOT SUPPORTED [ DBLINK : mylink | USER: user1/password1 | CONNECTION: 'connection_str' ] ***/!!!
    employees
    WHERE employee_id = 999;
```

#### Best Practices

* It is important to check that all DB Links have different names, if two DB Links share the same and the code is migrated multiple times, then the EWI can change de information based on what DB Link is processed first.
* Move the database objects from the database link reference into the same database instance that is being used in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0126

Unusable object because its built-in custom type is not supported

### Severity

Medium

### Description

This error appears to indicate whether an object with a built-in custom type is being used.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE proc01 is
   var1 DBMS_SQL.VARCHAR2_TABLE;
   var2 CTX_CLS.DOC_TAB;
BEGIN
   varX := var1.property;
   varY := var2(1);
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE proc01 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      var1 VARIANT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'DBMS_SQL.VARCHAR2_TABLE' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/;
      var2 VARIANT /*** SSC-FDM-0015 - REFERENCED CUSTOM TYPE 'CTX_CLS.DOC_TAB' IN QUERY NOT FOUND, USAGES MAY BE AFFECTED ***/;
   BEGIN
      varX := var1.property !!!RESOLVE EWI!!! /*** SSC-EWI-OR0126 - UNUSABLE OBJECT var1, BUILT-IN CUSTOM TYPES ARE NOT SUPPORTED ***/!!!;
      varY := var2(1) !!!RESOLVE EWI!!! /*** SSC-EWI-OR0126 - UNUSABLE OBJECT var2, BUILT-IN CUSTOM TYPES ARE NOT SUPPORTED ***/!!!;
   END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWI

* [SSC-FDM-0015](../functional-difference/generalFDM.md): Data Type Not Recognized.

## SSC-EWI-OR0128

Boolean cursor attribute is not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

### Description

This message is used to indicate that a boolean cursor attribute is not supported in SnowScript or that there is no transformation that emulates its functionality in SnowScript. The following table shows the boolean cursor attributes that can be emulated:

| Boolean Cursor Attribute | Status |
| --- | --- |
| `%FOUND` | Can be emulated |
| `%NOTFOUND` | Can be emulated |
| `%ISOPEN` | Not Supported |

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE cursor_attributes_proc
IS
    is_open_attr BOOLEAN;
    found_attr BOOLEAN;
    my_record table1%ROWTYPE;
    CURSOR my_cursor IS SELECT * FROM table1;
BEGIN
    OPEN my_cursor;
    LOOP
        FETCH my_cursor INTO my_record;
        EXIT WHEN my_cursor%NOTFOUND;
        is_open_attr := my_cursor%ISOPEN;
        found_attr := my_cursor%FOUND;
    END LOOP;
    CLOSE my_cursor;
END;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table1" **
CREATE OR REPLACE PROCEDURE cursor_attributes_proc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        is_open_attr BOOLEAN;
        found_attr BOOLEAN;
        my_record OBJECT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - ROWTYPE DATA TYPE CONVERTED TO OBJECT ***/!!! := OBJECT_CONSTRUCT();
        --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
        my_cursor CURSOR
        FOR
            SELECT
                OBJECT_CONSTRUCT( *) sc_cursor_record FROM
                table1;
    BEGIN
        OPEN my_cursor;
        --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
        LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
            FETCH my_cursor INTO
                :my_record;
            IF (my_record IS NULL) THEN
                EXIT;
            END IF;
            is_open_attr := null /*my_cursor%ISOPEN*/!!!RESOLVE EWI!!! /*** SSC-EWI-OR0128 - BOOLEAN CURSOR ATTRIBUTE %ISOPEN IS NOT SUPPORTED IN SNOWFLAKE ***/!!!;
            found_attr := my_record IS NOT NULL;
        END LOOP;
    CLOSE my_cursor;
    END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0129

TYPE attribute could not be resolved.

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This warning appears when the `TYPE`attribute referenced item could not be resolved and the referencing item’s data type could not be obtained. So the `VARIANT`data type will be assigned instead.

#### Example Code

##### Input Code:

```sql
 CREATE OR REPLACE PROCEDURE procedure01
IS
var1 table01.col1%TYPE;
BEGIN
NULL;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE procedure01 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
var1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0129 - TYPE ATTRIBUTE 'table01.col1%TYPE' COULD NOT BE RESOLVED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
BEGIN
NULL;
END;
$$;
```

#### Best Practices

* Check for the referenced item data type and replace it manually in the referencing item TYPE attribute.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0133

Cursor variable has already been assigned

### Severity

Medium

### Description

When an `OPEN FOR` statement is converted, a cursor assignment with the same name as the cursor variable used in the input code is added along with other statements to emulate its functionality. Since it is possible to use multiple `OPEN FOR` statements with the same cursor variable, there will be multiple cursor assignments with the same name in the output code. Leaving the output code as it is will cause compilation errors when executed in Snowflake.

#### Example code

##### Input code

```sql
 CREATE OR REPLACE PROCEDURE open_for_procedure
AS
	query1 VARCHAR(200) := 'SELECT 123 FROM dual';
	query2 VARCHAR(200) := 'SELECT 456 FROM dual';
	my_cursor_variable SYS_REFCURSOR;
BEGIN
	OPEN my_cursor_variable FOR query1;
	OPEN my_cursor_variable FOR query2;
END;
```

##### Generated Code

```sql
 CREATE OR REPLACE PROCEDURE open_for_procedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"oracle"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		query1 VARCHAR(200) := 'SELECT 123 FROM dual';
		query2 VARCHAR(200) := 'SELECT 456 FROM dual';
		my_cursor_variable_res RESULTSET;
	BEGIN
		!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
		my_cursor_variable_res := (
			EXECUTE IMMEDIATE :query1
		);
		LET my_cursor_variable CURSOR
		FOR
			my_cursor_variable_res;
		OPEN my_cursor_variable;
		!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
		my_cursor_variable_res := (
			EXECUTE IMMEDIATE :query2
		);
		!!!RESOLVE EWI!!! /*** SSC-EWI-OR0133 - THE CURSOR VARIABLE NAMED 'my_cursor_variable' HAS ALREADY BEEN ASSIGNED IN ANOTHER CURSOR ***/!!!
		LET my_cursor_variable CURSOR
		FOR
			my_cursor_variable_res;
		OPEN my_cursor_variable;
	END;
$$;
```

### Related EWI

1. [SSC-EWI-0030](generalEWI.md): The statement below has usages of dynamic SQL.

#### Best Practices

* To solve the compilation errors of the output code the cursor assignments that have the SSC-EWI-OR0133 message should be renamed.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0135

Data Retention Period May Produce No Results

### Severity

Low

### Description

If a query is executed in Snowflake using time travel, it could return no results if the specified time is no longer in the range of the data retention period. We recommend to read more about [Snowflake’s Time Travel.](https://docs.snowflake.com/en/user-guide/data-time-travel)

#### Example code

##### Input code

```sql
 SELECT * FROM employees
AS OF TIMESTAMP
TO_TIMESTAMP('2023-09-27 07:00:00', 'YYYY-MM-DD HH:MI:SS')
WHERE last_name = 'SampleName';
```

##### Generated Code

```sql
 SELECT * FROM
employees
!!!RESOLVE EWI!!! /*** SSC-EWI-OR0135 - DATA RETENTION PERIOD MAY PRODUCE NO RESULTS ***/!!!
AT (TIMESTAMP =>
TO_TIMESTAMP('2023-09-27 07:00:00', 'YYYY-MM-DD HH:MI:SS'))
WHERE last_name = 'SampleName';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0136

Current of clause is not supported in Snowflake

### Severity

Critical

### Description

Some statements like UPDATE and DELETE can use have a CURRENT OF clause inside the WHERE clause, this is not currently supported by Snowflake.

#### Example Code

##### Oracle:

```sql
 CREATE OR REPLACE PROCEDURE proc_update_current_of
AS
  CURSOR C1
  IS
    SELECT * FROM F_EMPLOYEE FOR UPDATE OF SALARY nowait;
BEGIN
  FOR CREC IN C1
  LOOP
    UPDATE F_EMPLOYEE SET SALARY=SALARY+2000 WHERE CURRENT OF C1;
  END LOOP;
END;
```

##### Snowflake Scripting:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "F_EMPLOYEE" **
CREATE OR REPLACE PROCEDURE proc_update_current_of ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --** SSC-PRF-0009 - PERFORMANCE REVIEW - CURSOR USAGE **
    C1 CURSOR
    FOR
      SELECT * FROM
        F_EMPLOYEE
      !!!RESOLVE EWI!!! /*** SSC-EWI-OR0110 - FOR UPDATE CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
      FOR UPDATE OF SALARY nowait;
  BEGIN
      OPEN C1;
      --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
      FOR CREC IN C1 DO
      !!!RESOLVE EWI!!! /*** SSC-EWI-OR0136 - CURRENT OF CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
      UPDATE F_EMPLOYEE
        SET SALARY=
                   !!!RESOLVE EWI!!! /*** SSC-EWI-OR0036 - TYPES RESOLUTION ISSUES, ARITHMETIC OPERATION '+' MAY NOT BEHAVE CORRECTLY BETWEEN unknown AND Number ***/!!!SALARY+2000 WHERE CURRENT OF C1;
      END FOR;
      CLOSE C1;
  END;
$$;
```

### Related EWI

1. SSC-EWI-OR0036: Types resolution issues, the arithmetic operation may not behave correctly between string and date.
2. [SSC-PRF-0004](../performance-review/generalPRF.md): This statement has usages of cursor for loop.
3. SSC-EWI-OR0110: For Update Clause is not supported in Snowflake.

#### Best Practices

* Redesign the query to normal `UPDATE` or `DELETE` specifying the columns in the `WHERE` clause, consider that if there are duplicate records in the table the query can affect them multiple times.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0137

Type attribute reference might be unsupported, so it was transformed to variant data type.

### Severity

Critical

### Description

TYPE ATTRIBUTE ‘TYPEUSED%TYPE’ MIGHT BE UNSUPPORTED, SO IT WAS TRANSFORMED TO VARIANT

#### Example Code

##### Oracle:

```sql
CREATE OR REPLACE TABLE MYTABLE
(
  LOG_ID URITYPE
);

CREATE OR REPLACE PROCEDURE some_procedure()
IS
  L_MESSAGE MYTABLE.LOG_ID%TYPE;
BEGIN
  NULL;
END;
```

##### Snowflake Scripting:

```sql
CREATE OR REPLACE TABLE MYTABLE
  (
  !!!RESOLVE EWI!!! /*** SSC-EWI-0028 - TYPE NOT SUPPORTED BY SNOWFLAKE ***/!!!
    LOG_ID URITYPE
  )
  COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "10/01/2025",  "domain": "no-domain-provided",  "migrationid": "aqCZAdErg3K0P04NglqCCg==" }}'
  ;

  CREATE OR REPLACE PROCEDURE some_procedure ()
  RETURNS VARCHAR
  LANGUAGE SQL
  COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "oracle",  "convertedOn": "10/01/2025",  "domain": "no-domain-provided",  "migrationid": "aqCZAdErg3K0P04NglqCCg==" }}'
  EXECUTE AS CALLER
  AS
  $$
  DECLARE
      L_MESSAGE VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-OR0137 - TYPE ATTRIBUTE 'MYTABLE.LOG_ID%TYPE' MIGHT BE UNSUPPORTED, SO IT WAS TRANSFORMED TO VARIANT ***/!!!;
  BEGIN
      NULL;
  END;
  $$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0138

STANDARD_HASH with dynamic algorithm parameter cannot be converted.

### Severity

Low

### Description

This error is added when the `STANDARD_HASH` function uses a dynamic (non-literal) algorithm parameter, such as a variable or expression. SnowConvert AI cannot determine the target hash function at compile time because the algorithm must be a string literal (`'SHA1'`, `'SHA256'`, `'SHA384'`, `'SHA512'`, or `'MD5'`).

The function is left unconverted and the user must manually resolve the algorithm at runtime.

#### Example Code

##### Oracle:

```sql
 SELECT STANDARD_HASH(col1, algorithm_var) FROM table1;
```

##### Snowflake Scripting:

```sql
 SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-OR0138 - STANDARD_HASH WITH DYNAMIC ALGORITHM PARAMETER CANNOT BE CONVERTED. THE ALGORITHM MUST BE A STRING LITERAL (SHA1, SHA256, SHA384, SHA512, OR MD5). ***/!!!
   STANDARD_HASH(col1, algorithm_var)
 FROM
   table1;
```

### Related EWI

1. [SSC-FDM-OR0032](../functional-difference/oracleFDM.md): StandardHash function with input non-string parameter generates a different result in Snowflake.

#### Best Practices

* Replace the dynamic algorithm parameter with a string literal (e.g., `'SHA256'`) so SnowConvert AI can determine the correct Snowflake hash function.
* If the algorithm must be dynamic at runtime, manually convert the `STANDARD_HASH` call to a `CASE` expression that maps each algorithm to the corresponding Snowflake function (`SHA1`, `SHA2`, `MD5`).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0139

CREATE TYPE with incomplete definition is not supported

### Severity

Medium

### Description

This message is added when Oracle `CREATE TYPE` is used as a **forward declaration** (object type declared without a body). Snowflake does not support that incomplete form; supply a full type definition or migrate the type manually.

#### Best Practices

* Replace forward declarations with a complete `CREATE TYPE ... AS OBJECT (...)` (or equivalent) before conversion, or create the type manually in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0140

CREATE TYPE with member methods or constructors is not supported

### Severity

Medium

### Description

This message is added when a `CREATE TYPE` definition includes **MEMBER** methods, **MAP**/**ORDER** methods, or **constructor** specifications that Snowflake native UDTs do not support in the same way as Oracle. The DDL may be partially emitted or flagged for manual review.

#### Best Practices

* Move procedural logic to stored procedures or functions; keep `CREATE TYPE` to attributes only where possible.
* Review [Oracle CREATE TYPE translation reference](../../../../translation-references/oracle/sql-translation-reference/create_type.md) for supported patterns.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0141

CREATE TYPE with subtype inheritance is not supported

### Severity

Medium

### Description

This message is added when `CREATE TYPE` uses Oracle **UNDER** (subtype inheritance). Snowflake does not model type inheritance the same way as Oracle.

#### Best Practices

* Model hierarchies with separate object types and views, or flatten to a single object type; manual redesign is often required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-OR0142

CREATE TYPE definition is not supported

### Severity

Medium

### Description

This message is added when a `CREATE TYPE` definition cannot be translated to a supported Snowflake native user-defined type. The statement may be commented or flagged depending on context.

#### Best Practices

* Simplify the type definition to supported Snowflake constructs, or implement the type manually.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Out of Scope
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/out-of-scope/generalOOS.md
section: Migrations
---

# SnowConvert AI - Out of Scope

## SSC-OOS-0001

The file has an unexpected encoding and was not translated

### Description

This error occurs when the tool cannot recognize the character encoding format of a source code file. Character encoding is a method of converting text characters into numerical values that computers can process. When the tool encounters characters it cannot interpret, it generates this error.

### Best Practices

* Ensure all files in the input folder use the same character encoding to prevent encoding-related errors.
* Choose the correct encoding using either the conversion settings or by specifying the –encoding parameter in the [CLI](../../../user-guide/snowconvert/command-line-interface/README.md). You can identify the correct encoding using tools like [Free Online Formater](https://freeonlineformatter.com/encoding-string), or by running `file -i *` on Linux or macOS.
* For additional assistance, contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-OOS

Out of scope code unit.

### Description

This issue is generated when SnowConvert AI encounters a top-level SQL statement or code unit that is outside the translation scope. The specific construct is identified in the issue message (e.g., `GRANT`, `REVOKE`, `CREATE FUNCTION` in an unsupported language). SnowConvert AI comments out the entire statement and adds this marker. The limitation may be due to Snowflake not supporting the construct or SnowConvert AI not yet implementing its translation.

### Code Example

#### Input Code:

```sql
GRANT SELECT ON Employees TO ReportingRole;
```

#### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-OOS - OUT OF SCOPE CODE UNIT. GRANT IS OUT OF TRANSLATION SCOPE. ***/!!!
--GRANT SELECT ON Employees TO ReportingRole;
```

### Best Practices

* **Review the commented-out statement:** Determine whether the construct is needed in Snowflake and implement it manually using Snowflake-native syntax (e.g., [GRANT](https://docs.snowflake.com/en/sql-reference/sql/grant-privilege) for access control statements).
* **Check for Snowflake equivalents:** Many out-of-scope constructs have Snowflake counterparts with different syntax. Consult the [Snowflake SQL reference](https://docs.snowflake.com/en/sql-reference) for the appropriate replacement.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Out-of-Scope
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/out-of-scope/README.md
section: Migrations
---

# SnowConvert AI - Out-of-Scope

Examples of Out-of-scope code units if multiple SQL Languages

## Description

As explained in the [conversion scope page](../../../getting-started/running-snowconvert/review-results/snowconvert-scopes.md), certain code units cannot be automatically converted. Below are examples showing how these unsupported code units appear in the output folder.

### Teradata

#### Function with unsupported language:

```sql
 CREATE FUNCTION CFEXTERNALINC (p1 INTEGER)
  RETURNS TABLE(
     c1 INTEGER
   )
   LANGUAGE java
   NO SQL
   PARAMETER STYLE SQL
     EXTERNAL NAME 'CS!fnc_tbf001udt.c'
```

#### Results from Snowflake:

```sql
 ----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE FUNCTION IS OUT OF TRANSLATION SCOPE. **
--CREATE FUNCTION CFEXTERNALINC (p1 INTEGER)
--  RETURNS TABLE(
--     c1 INTEGER
--   )
--   LANGUAGE java
--   NO SQL
--   PARAMETER STYLE SQL
--     EXTERNAL NAME 'CS!fnc_tbf001udt.c'
                                       ;
```

### Oracle Migration

#### Wrapped type definition:

```sql
 CREATE TYPE data_typ1 wrapped
a000000
b2
6CodpsEHq3I=
```

#### Results from Snowflake:

```sql
 ----** SSC-OOS - OUT OF SCOPE CODE UNIT. Wrapped TYPE IS OUT OF TRANSLATION SCOPE. **
--CREATE TYPE data_typ1 wrapped
--a000000
--b2
--6CodpsEHq3I=
```

### Transact-SQL (T-SQL)

#### Trigger:

```sql
 CREATE TRIGGER reminder1
ON Sales.Customer
AFTER INSERT, UPDATE
AS RAISERROR ('Notify Customer Relations', 16, 10);
```

#### Results from Snowflake:

```sql
 ----** SSC-OOS - OUT OF SCOPE CODE UNIT. CREATE TRIGGER IS OUT OF TRANSLATION SCOPE. **
--CREATE TRIGGER reminder1
--ON Sales.Customer
--AFTER INSERT, UPDATE
--AS RAISERROR ('Notify Customer Relations', 16, 10);
```

## Best Practices

* For additional support, please contact us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Output Code
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/output-code.md
section: Migrations
---

# SnowConvert AI - Output Code

## Source Code

Suppose this is the input source code you’ve migrated:

```sql
CREATE TABLE! TABLE_Invalid
(
  COL1 VARCHAR2(255),
  COL2 VARCHAR2
);

CREATE TABLE TABLE1
(
  COL1 INT,
  COL2 VARCHAR2!
);

CREATE OR REPLACE VIEW VIEW1
AS
    SELECT
        UNKOWN_FUNCTION(1),
        COL1,
        COL2
    FROM TABLE1
;
```

## Output code

```sql
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '1' COLUMN '0' OF THE SOURCE CODE STARTING AT 'CREATE'. EXPECTED 'Create table Statement' GRAMMAR. LAST MATCHING TOKEN WAS 'TABLE' ON LINE '1' COLUMN '7'. FAILED TOKEN WAS '!' ON LINE '1' COLUMN '12'. CODE '63'. **
--CREATE TABLE! TABLE_Invalid
--(
--  COL1 VARCHAR2(255),
--  COL2 VARCHAR2
--)
 ;

        CREATE OR REPLACE TABLE TABLE1
        (
          COL1 INT
--                  ,
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '10' COLUMN '3' OF THE SOURCE CODE STARTING AT 'COL2'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS 'VARCHAR2' ON LINE '10' COLUMN '8'. CODE '15'. **
--  COL2 VARCHAR2!
)
        COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

        --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "UNKOWN_FUNCTION" **
        CREATE OR REPLACE VIEW VIEW1
        COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
        AS
        SELECT
          !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'UNKOWN_FUNCTION' NODE ***/!!!
          UNKOWN_FUNCTION(1),
          COL1,
          COL2
    FROM
          TABLE1
;
```

### How to interpret the output code?

* There is one parsing error in line number one. This is because of an invalid token `CREATE TABLE!`
* There is another parsing error on line 10. This is because of an invalid token`VARCHAR2!`
* There is an unknown function `UNKNOWN_FUNCTION` , which is translated as is, but warning SSC-EWI-0073 is added to indicate that this is something that has not been checked yet and therefore, the functional equivalence cannot be assured.

---
title: SnowConvert AI - Overall Conversion Summary
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/overall-conversion-summary.md
section: Migrations
---

# SnowConvert AI - Overall Conversion Summary

## Total Files

Represents the number of files discovered in the input address and that were successfully migrated by SnowConvert.

### CSV Associated field name

* TotalFiles

#### Sample

```none
input_folder
   ├> sql_file.sql
   ├> notes.txt
   └> views.csv
```

```none
output_folder
   └> sql_file.sql
```

**Expected Total Files:** 1

**Explanation:** With the previous sample, we will only have the SQL file as valid for migration, as the other two files have an extension that SnowConvert AI cannot recognize.

## SQL Files

> **Note:**
>
> This field applies only to Teradata reports.

This is the number of files detected in the input folder that have an extension of .sql, .ddl, or .dml.

### CSV Associated field name

* SqlFileCount

#### Sample

```none
input_folder
    ├> ddl_file.ddl
    ├> dml_file.dml
    ├> sql_file.sql
    ├> other_file.ignore
    └> bteq_file.bteq
```

```none
output_folder
    ├> ddl_file.ddl
    ├> dml_file.dml
    ├> sql_file.sql
    └> bteq_file_BTEQ.py
```

**Expected SQL Files:** 3

**Explanation:** In this case, the 3 files with extensions DDL, DML, and SQL are recognized as SQL Files. Other extensions are not counted for SQL Files. Teradata script files are not counted for SQL files, those are counted for Script files.

## Script Files

> **Note:**
>
> This field applies only to Teradata reports.

This is the number of files in the input folder that are of the following type:

* **BTEQ**: .bteq, .btq
* **FastLoad:** .fload, .fl
* **MultiLoad:** .mload, .mld, ml
* **TPump:** .tpump, .tp
* **TPT:** .tpt

### CSV Associated field name

* ScriptFileCount

#### Sample

```none
input_folder
    ├> bteq_file.bteq
    ├> btq_file.btq
    ├> fload_file.fload
    ├> mload_file.mload
    ├> sql_file.sql
    ├> tpt_file.tpt
    └> tpump_file.tpump
```

```none
output_folder
    ├> bteq_file_BTEQ.py
    ├> btq_file_BTEQ.py
    ├> fload_file_FastLoad.py
    ├> mload_file_MultiLoad.py
    ├> sql_file.sql
    ├> tpt_file_TPT.py
    └> tpump_file_TPump.py
```

**Expected Script Files:** 6

**Explanation:** In this case, the 6 files with extensions with Script file extensions are recognized as Script Files. The 2 extensions for BTEQ files previously mentioned are counted but the SQL file is not counted because it is a SQL File.

## Total Files Not Generated

Represents the number of files found in the input address that, because of a failure in SnowConvert AI, failed to generate the migrated output file.

### CSV Associated field name

* TotalFilesNotGenerated

#### Sample

```none
input_folder
   ├> input1.sql
   ├> input2.sql
   └> input3.sql
```

```none
output_folder
   ├> input1.sql
   └> input2.sql
```

**Expected Total File Not Generated:** 1

**Explanation:**

## Conversion Speed

Represents the number of lines processed per second during the migration.

### Formula

```none
total_lines_of_code / conversion_time
```

#### CSV Associated field name

* ConversionSpeed

#### Sample

```sql
CREATE TABLE table1(
     column1 INT,
     column2 INT
     column3 INT
);

CREATE VIEW view1 AS
SELECT orderkey
FROM orders;
```

**Expected Conversion Speed: 4 lines/sec**

**Explanation:** Let’s say that the example execution time was 2 seconds, taking into account that the number of lines is 8. Applying the formula 8/2 = 4, so the Converting Speed is 4 lines per sec.

## Conversion Time

Represents the duration of SnowConvert AI’s migration.

### CSV Associated field name

* ElapsedTime

## Total Conversion Errors

The total count of conversion errors that occurred during the conversion process. This type of error could be related to file I/O, memory management, or any abnormal situation that cannot be handled by SnowConvert AI. These are unhandled code exceptions and are considered critical issues.

### CSV Associated field name

* TotalConversionErrors

## Total Parsing Errors

The total count of parsing errors that occurred during the code analysis process. A parsing error occurs when the parser (the component that reads the source code files) encounters something unexpected. This usually means a syntax error, which refers to a code element in the file that did not match the SQL grammar specification that the parser was expecting. In other cases, these errors can also occur because the parser is not yet ready to support a specific grammar. Parsing errors are also considered critical issues. If this number is high in relation to the migration workload size, input code revision is advised.

### CSV Associated field name

* TotalParsingErrors

#### Sample

```sql
-- Statement without parsing error
CREATE TABLE table1(
     column1 INT,
     column2 INT
);

-- Statements with parsing error
CRATE TABLE table2(
     column1 INT
);

CREATE VIEW view1 AS
SELECT orderkey
FROM FROM orders;
```

```sql
-- Statement without parsing error
CREATE OR REPLACE TABLE table1 (
     column1 INT,
     column2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '8' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CRATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CRATE' ON LINE '8' COLUMN '1'. CODE '81'. **
---- Statements with parsing error
--CRATE TABLE table2(
--     column1 INT
--)
 ;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "orders" **
CREATE OR REPLACE VIEW view1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
AS
SELECT
     orderkey
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '14' COLUMN '1' OF THE SOURCE CODE STARTING AT 'FROM'. EXPECTED 'FROM' GRAMMAR. LAST MATCHING TOKEN WAS 'FROM' ON LINE '14' COLUMN '1'. FAILED TOKEN WAS 'FROM' ON LINE '14' COLUMN '6'. CODE '44'. **
--FROM
    ;
```

**Expected Total Parsing Errors: 2**

**Explanation:** The first table presented doesn’t have a parsing error, all of it grammar is correct, but the two following statements present parsing errors because they have a grammar problem, like the second table that the `CREATE` has a spelling mistake, or the double `FROM` on the `SELECT` of the view.

## Total Warnings

The total count of warnings that SnowConvert AI generated for the given input. A warning is inserted when the translation of a specific element is mostly functionally equivalent but there are some corner cases in which some user intervention might be required. They have low severity because their intention is to provide information that can be reviewed if the code shows any kind of functional difference when executed on the target platform.

### CSV Associated field name

* TotalWarnings

#### Sample

```sql
CREATE TABLE table1(
     COL1 SYS.XMLTYPE
);

SELECT TIMESTAMP '1998-12-25 09:26:50.12' AT LOCAL
FROM DUAL;

CREATE TABLE table2(
INTERVAL_YEAR_TYPE INTERVAL YEAR(2)
);
```

```sql
CREATE OR REPLACE TABLE table1 (
     COL1 SYS.XMLTYPE
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

SELECT
     TIMESTAMP '1998-12-25 09:26:50.12'
FROM
     DUAL;

CREATE OR REPLACE TABLE table2 (
INTERVAL_YEAR_TYPE VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR(2) DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

**Expected Total Warnings: 3**

**Explanation:** In the last example, there is a type of warning in all three statements.

## Total Lines of Code (LOC)

The total number of lines of code in the input files, that were processed by the conversion tool.

> **Note:**
>
> Blank lines are not counted.

### CSV Associated field name

* TotalLinesOfCode

#### Sample

```sql
CREATE TABLE table1(
 column1 INT
);

-- Create View
CREATE VIEW view1 AS
SELECT orderkey
FROM orders;
```

**Expected Total Lines of Code(LOC): 8**

**Explanation:** Although the file shows 10 lines, the valid code lines are 8, because blank lines are not counted.

---
title: SnowConvert AI - Performance Review Messages
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/performance-review/README.md
section: Migrations
---

# SnowConvert AI - Performance Review Messages

A Performance Review (PRF) issue indicates that while SnowConvert AI successfully translated the source code to valid Snowflake syntax, the resulting code may not perform optimally in Snowflake. When you encounter a PRF issue in the converted code, we recommend reviewing that section carefully and considering whether you can rewrite it to improve performance.

---
title: SnowConvert AI - PostgreSQL & Based Languages Conversion Settings
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/postgresql-conversion-settings.md
section: Migrations
---

# SnowConvert AI - PostgreSQL & Based Languages Conversion Settings

This topic applies to the following sources:

* PostgreSQL
* Amazon Redshift
* Greenplum
* Netezza

## Prepare Code Settings

### **Description**

**Prepare my code:** Flag to indicate whether the input code should be processed before parsing and transformation. This can be useful to improve the parsing process. By default, it’s set to FALSE. When this flag is active, a new folder called `source_processed` will be generated and used for the migration.

Searches the input code for routine bodies using literals as definition delimiters. This is a non-standard PostgreSQL specific grammar that allows the user to define a procedure body using single quotes as delimiters. To facilitate SnowConvert AI transformation, the arrange step will transform these occurrences into standard procedure bodies, using `$$` as delimiters. Also, it will change dollar-quoted literals to regular single-quoted literals. All these changes will produce semantically equivalent code.

### **Example**

#### **Input**

```postgresql
CREATE OR REPLACE PROCEDURE proc1 (x varchar default 'pigs')
LANGUAGE plpgsql
AS
'
begin
    --test
   insert into tabletest2 values ($$Dianne''s pigs$$);
   x = ''Diannes pigs'';
end;
';
```

#### **Output**

```postgresql
CREATE OR REPLACE PROCEDURE proc1 (x varchar default 'pigs')
LANGUAGE plpgsql AS $$
begin
    --test
   insert into tabletest2 values ('Dianne''s pigs');
   x = 'Diannes pigs';
end;
 $$;
```

---
title: SnowConvert AI - PostgreSQL - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/postgresql-built-in-functions.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - Built-in functions

## Applies to

* PostgreSQL
* Greenplum
* Netezza

> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

## Aggregate Functions

> Aggregate functions compute a single result value from a set of input values. ([PostgreSQL Language Reference Aggregate Functions](https://www.postgresql.org/docs/12/functions-aggregate.html)).

| PostgreSQL | Snowflake |
| --- | --- |
| [AVG](https://www.postgresql.org/docs/12/functions-aggregate.html) | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg)    *Notes:* PostgreSQL *and Snowflake may show different precision/decimals due to data type rounding/formatting.* |
| [COUNT](https://www.postgresql.org/docs/12/functions-aggregate.html) | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) |
| [MAX](https://www.postgresql.org/docs/12/functions-aggregate.html) | [MAX](https://docs.snowflake.com/en/sql-reference/functions/max) |
| [MEDIAN](https://techdocs.broadcom.com/us/en/vmware-tanzu/data-solutions/tanzu-greenplum/7/greenplum-database/ref_guide-function-summary.html#topic31) | [MEDIAN](https://docs.snowflake.com/en/sql-reference/functions/median)    *Notes**: Snowflake does not allow the use of date types**, while* PostgreSQL *does. (See* [SSC-FDM-PG0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md)*).* |
| [MIN](https://www.postgresql.org/docs/12/functions-aggregate.html) | [MIN](https://docs.snowflake.com/en/sql-reference/functions/min) |
| [PERCENTILE_CONT](https://www.postgresql.org/docs/9.4/functions-aggregate.html#FUNCTIONS-ORDEREDSET-TABLE) | [PERCENTILE_CONT](https://docs.snowflake.com/en/sql-reference/functions/percentile_cont) |
| [STDDEV/STDDEV_SAMP](https://www.postgresql.org/docs/12/functions-aggregate.html) (*expression*) | [STDDEV/STDDEV_SAMP](https://docs.snowflake.com/en/sql-reference/functions/stddev) (*expression*) |
| [STDDEV_POP](https://www.postgresql.org/docs/12/functions-aggregate.html) (*expression*) | [STDDEV_POP](https://docs.snowflake.com/en/sql-reference/functions/stddev_pop) (*expression*) |
| [SUM](https://www.postgresql.org/docs/12/functions-aggregate.html) | [SUM](https://docs.snowflake.com/en/sql-reference/functions/sum) |
| [VARIANCE/VAR_SAMP](https://www.postgresql.org/docs/12/functions-aggregate.html) (*expression*) | [VARIANCE/VAR_SAMP](https://docs.snowflake.com/en/sql-reference/functions/variance)  (*expression*) |
| [VAR_POP](https://www.postgresql.org/docs/12/functions-aggregate.html) (*expression*) | [VAR_POP](https://docs.snowflake.com/en/sql-reference/functions/variance_pop) (*expression*) |

## Conditional expressions

| PostgreSQL | Snowflake |
| --- | --- |
| [COALESCE](https://www.postgresql.org/docs/12/functions-conditional.html) ( value *[, …]* ) | [COALESCE](https://docs.snowflake.com/en/sql-reference/functions/coalesce) ( *expression*, *expression*, … ) |
| [GREATEST](https://www.postgresql.org/docs/12/functions-conditional.html) ( value [, …] ) | [GREATEST_IGNORE_NULLS](https://docs.snowflake.com/en/sql-reference/functions/greatest_ignore_nulls) ( <expr1> [, <expr2> … ] ) |
| [LEAST](https://www.postgresql.org/docs/12/functions-conditional.html) ( value [, …] ) | [LEAST_IGNORE_NULLS](https://docs.snowflake.com/en/sql-reference/functions/least_ignore_nulls) ( <expr1> [, <expr2> … ]) |
| [NULLIF](https://www.postgresql.org/docs/12/functions-conditional.html) | [NULLIF](https://docs.snowflake.com/en/sql-reference/functions/nullif)   *Notes: PostgreSQL’s NULLIF ignores trailing spaces in some string comparisons, unlike Snowflake. Therefore, the transformation adds RTRIM for equivalence.* |

## Data type formatting functions

> Data type formatting functions provide an easy way to convert values from one data type to another. For each of these functions, the first argument is always the value to be formatted and the second argument contains the template for the new format. ([PostgreSQL Language Reference Data type formatting functions](https://www.postgresql.org/docs/12/functions-formatting.html)).

| PostgreSQL | Snowflake |
| --- | --- |
| [TO_CHAR](https://www.postgresql.org/docs/12/functions-formatting.html) | [TO_CHAR](https://docs.snowflake.com/en/sql-reference/functions/to_char)    *Notes: Snowflake’s support for this function is partial (see* [*SSC-EWI-PG0005*](broken-reference)*).* |
| [TO_DATE](https://www.postgresql.org/docs/12/functions-formatting.html) | [TO_DATE](https://docs.snowflake.com/en/sql-reference/functions/to_date)    *Notes: Snowflake’s `TO_DATE` fails on invalid dates like ‘20010631’ (June has 30 days), unlike* PostgreSQL’*s lenient `TO_DATE`. Use `TRY_TO_DATE` in Snowflake to handle these cases by returning NULL. (see* [*SSC-EWI-PG0005*](broken-reference)*,* [*SSC-FDM-0032*](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md)*).* |

## Date and time functions

| PostgreSQL | Snowflake |
| --- | --- |
| [AT TIME ZONE ‘timezone’](https://www.postgresql.org/docs/12/functions-datetime.html#FUNCTIONS-DATETIME-ZONECONVERT) | [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) ( <source_tz> , <target_tz> , <source_timestamp_ntz> )    [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) ( <target_tz> , <source_timestamp> )    *Notes:* PostgreSQL *defaults to UTC; the Snowflake function requires explicit UTC specification. Therefore, it will be added as the target timezone.* |
| [CURRENT_DATE](https://www.postgresql.org/docs/8.2/functions-datetime.html) | [CURRENT_DATE()](https://docs.snowflake.com/en/sql-reference/functions/current_date) |
| [DATE_PART/PGDATE_PART](https://www.postgresql.org/docs/12/functions-datetime.html#FUNCTIONS-DATETIME-EXTRACT) | [DATE_PART](https://docs.snowflake.com/en/sql-reference/functions/date_part)    *Notes: this function is partially supported by Snowflake. (See* [*SSC-EWI-PG0005*](broken-reference)*).* |
| [DATE_TRUNC](https://www.postgresql.org/docs/12/functions-datetime.html#FUNCTIONS-DATETIME-EXTRACT) | [DATE_TRUNC](https://docs.snowflake.com/en/sql-reference/functions/date_trunc)    *Notes: Invalid date part formats are translated to Snowflake-compatible formats.* |
| [TO_TIMESTAMP](https://www.postgresql.org/docs/12/functions-datetime.html#FUNCTIONS-DATETIME-EXTRACT) | [TO_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/to_timestamp) |
| [EXTRACT](https://www.postgresql.org/docs/current/functions-datetime.html#FUNCTIONS-DATETIME-EXTRACT) | [EXTRACT](https://docs.snowflake.com/en/sql-reference/functions/extract)  *Notes:* Part-time or Date time supported: DAY, DOW, DOY, EPOCH, HOUR, MINUTE, MONTH, QUARTER, SECOND, WEEK, YEAR. |
| [TIMEZONE](https://www.postgresql.org/docs/16/functions-datetime.html#FUNCTIONS-DATETIME-ZONECONVERT) | [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) |

> **Note:**
>
> PostgreSQL timestamps default to microsecond precision (6 digits); Snowflake defaults to nanosecond precision (9 digits). Adjust precision as needed using ALTER SESSION (for example, `ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF2';`). Precision loss may occur depending on the data type used.
>
> Since some formats are incompatible with Snowflake, adjusting the account parameters [DATE_INPUT_FORMAT or TIME_INPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/date-time-input-output#data-loading) might maintain functional equivalence between platforms.

## JSON Functions

| PostgreSQL | Snowflake |
| --- | --- |
| [JSON_EXTRACT_PATH_TEXT](https://www.postgresql.org/docs/9.3/functions-json.html) | [JSON_EXTRACT_PATH_TEXT](https://docs.snowflake.com/en/sql-reference/functions/json_extract_path_text)    *Notes:*   1. PostgreSQL *treats newline, tab, and carriage return characters literally; Snowflake interprets them.* 2. *A JSON literal and dot-separated path are required to access nested objects in the Snowflake function.* 3. *Paths with spaces in variables must be quoted.* |

## Math functions

| PostgreSQL | Snowflake |
| --- | --- |
| [ACOS](https://www.postgresql.org/docs/12/functions-math.html) | [ACOS](https://docs.snowflake.com/en/sql-reference/functions/acos) |
| [ASIN](https://www.postgresql.org/docs/12/functions-math.html) | [ASIN](https://docs.snowflake.com/en/sql-reference/functions/asin) |
| [ATAN](https://www.postgresql.org/docs/12/functions-math.html) | [ATAN](https://docs.snowflake.com/en/sql-reference/functions/atan) |
| [ATAN2](https://www.postgresql.org/docs/12/functions-math.html) | [ATAN2](https://docs.snowflake.com/en/sql-reference/functions/atan2) |
| [CBRT](https://www.postgresql.org/docs/12/functions-math.html) | [CBRT](https://docs.snowflake.com/en/sql-reference/functions/cbrt) |
| [CEIL/CEILING](https://www.postgresql.org/docs/12/functions-math.html) | [CEIL](https://docs.snowflake.com/en/sql-reference/functions/ceil) |
| [COS](https://www.postgresql.org/docs/12/functions-math.html) | [COS](https://docs.snowflake.com/en/sql-reference/functions/cos) |
| [COT](https://www.postgresql.org/docs/12/functions-math.html) | [COT](https://docs.snowflake.com/en/sql-reference/functions/cot) |
| [DEGREES](https://www.postgresql.org/docs/12/functions-math.html) | [DEGREES](https://docs.snowflake.com/en/sql-reference/functions/degrees) |
| [LN](https://www.postgresql.org/docs/12/functions-math.html) | [LN](https://docs.snowflake.com/en/sql-reference/functions/ln) |
| [EXP](https://www.postgresql.org/docs/12/functions-math.html) | [EXP](https://docs.snowflake.com/en/sql-reference/functions/exp) |
| [FLOOR](https://www.postgresql.org/docs/12/functions-math.html) | [FLOOR](https://docs.snowflake.com/en/sql-reference/functions/floor) |
| [LOG](https://www.postgresql.org/docs/12/functions-math.html) | [LOG](https://docs.snowflake.com/en/sql-reference/functions/log) |
| [MOD](https://www.postgresql.org/docs/12/functions-math.html) | [MOD](https://docs.snowflake.com/en/sql-reference/functions/mod) |
| [PI](https://www.postgresql.org/docs/12/functions-math.html) | [PI](https://docs.snowflake.com/en/sql-reference/functions/pi) |
| [POWER/POW](https://www.postgresql.org/docs/12/functions-math.html) | [POWER/POW](https://docs.snowflake.com/en/sql-reference/functions/pow) |
| [RADIANS](https://www.postgresql.org/docs/12/functions-math.html) | [RADIANS](https://docs.snowflake.com/en/sql-reference/functions/radians) |
| [RANDOM](https://www.postgresql.org/docs/12/functions-math.html) | [RANDOM](https://docs.snowflake.com/en/sql-reference/functions/random) |
| [ROUND](https://www.postgresql.org/docs/12/functions-math.html) | [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round) |
| [SIN](https://www.postgresql.org/docs/12/functions-math.html) | [SIN](https://docs.snowflake.com/en/sql-reference/functions/sin) |
| [SIGN](https://www.postgresql.org/docs/12/functions-math.html) | [SIGN](https://docs.snowflake.com/en/sql-reference/functions/sign) |
| [SQRT](https://www.postgresql.org/docs/12/functions-math.html) | [SQRT](https://docs.snowflake.com/en/sql-reference/functions/sqrt) |
| [TAN](https://www.postgresql.org/docs/12/functions-math.html) | [TAN](https://docs.snowflake.com/en/sql-reference/functions/tan) |
| [TRUNC](https://www.postgresql.org/docs/12/functions-math.html) | [TRUNC](https://docs.snowflake.com/en/sql-reference/functions/trunc) |

> **Note:**
>
> PostgreSQL and Snowflake results may differ in scale.

## String functions

> String functions process and manipulate character strings or expressions that evaluate to character strings. ([PostgreSQL Language Reference String functions](https://www.postgresql.org/docs/12/functions-string.html)).

| PostgreSQL | Snowflake |
| --- | --- |
| [ASCII](https://www.postgresql.org/docs/12/functions-string.html) | [ASCII](https://docs.snowflake.com/en/sql-reference/functions/ascii) |
| [BTRIM](https://www.postgresql.org/docs/12/functions-string.html) | [TRIM](https://docs.snowflake.com/en/sql-reference/functions/trim) |
| [CHAR_LENGTH](https://www.postgresql.org/docs/12/functions-string.html) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [CHARACTER_LENGTH](https://www.postgresql.org/docs/12/functions-string.html) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [CHR](https://www.postgresql.org/docs/9.1/functions-string.html) | [CHR](https://docs.snowflake.com/en/sql-reference/functions/chr) |
| [CONCAT](https://www.postgresql.org/docs/12/functions-string.html) | [CONCAT](https://docs.snowflake.com/en/sql-reference/functions/concat) |
| [INITCAP](https://www.postgresql.org/docs/12/functions-string.html) | [INITCAP](https://docs.snowflake.com/en/sql-reference/functions/initcap) |
| [LEFT/RIGHT](https://www.postgresql.org/docs/12/functions-string.html) | [LEFT](https://docs.snowflake.com/en/sql-reference/functions/left)/[RIGHT](https://docs.snowflake.com/en/sql-reference/functions/right) |
| [LOWER](https://www.postgresql.org/docs/12/functions-string.html) | [LOWER](https://docs.snowflake.com/en/sql-reference/functions/lower) |
| [OCTET_LENGTH](https://www.postgresql.org/docs/12/functions-string.html) | [OCTET_LENGTH](https://docs.snowflake.com/en/sql-reference/functions/octet_length)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md)*).* |
| [QUOTE_IDENT](https://www.postgresql.org/docs/12/functions-string.html) (*string*) | [CONCAT](https://docs.snowflake.com/en/sql-reference/functions/concat) (‘”’, *string,* ‘”’) |
| [REGEXP_REPLACE](https://www.postgresql.org/docs/12/functions-string.html) | [REGEXP_REPLACE](https://docs.snowflake.com/en/sql-reference/functions/regexp_replace)    *Notes: This function includes a `parameters` argument that enables the user to interpret the pattern using the Perl Compatible Regular Expression (PCRE) dialect, represented by the `p` value, this is removed to avoid any issues*. *(See* [*SSC-EWI-0009*](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*,* [*SC-FDM-0032*](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md)*,* [*SSC-FDM-PG0011*](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md)*).* |
| [REPEAT](https://www.postgresql.org/docs/12/functions-string.html) | [REPEAT](https://docs.snowflake.com/en/sql-reference/functions/repeat) |
| [REPLACE](https://www.postgresql.org/docs/12/functions-string.html) | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| [REVERSE](https://www.postgresql.org/docs/12/functions-string.html) | [REVERSE](https://docs.snowflake.com/en/sql-reference/functions/reverse) |
| [SPLIT_PART](https://www.postgresql.org/docs/12/functions-string.html) | [SPLIT_PART](https://docs.snowflake.com/en/sql-reference/functions/split_part)    *Notes: Snowflake and* PostgreSQL *handle SPLIT_PART differently with case-insensitive collations.* |
| [STRPOS](https://www.postgresql.org/docs/12/functions-string.html) (*string*, *substring* ) | [POSITION](https://docs.snowflake.com/en/sql-reference/functions/position) ( <expr1> IN <expr> ) |
| [SUBSTRING](https://www.postgresql.org/docs/12/functions-string.html) | [*SUBSTRING*](https://docs.snowflake.com/en/sql-reference/functions/substr)    *Notes:* Snowflake partially supports this function. PostgreSQL’s `SUBSTRING`, with a non-positive `start_position`, calculates `start_position + number_characters` (returning ‘’ if the result is non-positive). Snowflake’s behavior differs. |
| [TRANSLATE](https://www.postgresql.org/docs/12/functions-string.html) | [TRANSLATE](https://docs.snowflake.com/en/sql-reference/functions/translate) |
| [TRIM](https://www.postgresql.org/docs/12/functions-string.html) | [*TRIM*](https://docs.snowflake.com/en/sql-reference/functions/trim)    *Notes:* PostgreSQL *uses keywords (BOTH, LEADING, TRAILING) for trim; Snowflake uses TRIM, LTRIM, RTRIM.* |
| [UPPER](https://www.postgresql.org/docs/12/functions-string.html) | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper) |

## Window functions

| PostgreSQL | Snowflake |
| --- | --- |
| [AVG](https://www.postgresql.org/docs/9.4/functions-aggregate.html) | [*AVG*](https://docs.snowflake.com/en/sql-reference/functions/avg)    *Notes: AVG rounding/formatting can vary by data type between* PostgreSQL *and Snowflake.* |
| [COUNT](https://www.postgresql.org/docs/9.4/functions-aggregate.html) | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) |
| [DENSE_RANK](https://www.postgresql.org/docs/current/functions-window.html) | [DENSE_RANK](https://docs.snowflake.com/en/sql-reference/functions/dense_rank)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [FIRST_VALUE](https://www.postgresql.org/docs/current/functions-window.html) | [FIRST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/first_value)    *Notes: Snowflake needs ORDER BY; missing clauses get `ORDER BY <expr>.`* |
| [LAG](https://www.postgresql.org/docs/current/functions-window.html) | [LAG](https://docs.snowflake.com/en/sql-reference/functions/lag) |
| [LAST_VALUE](https://www.postgresql.org/docs/current/functions-window.html) | [LAST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/last_value)    *Notes: Snowflake needs ORDER BY; missing clauses get `ORDER BY <expr>`.* |
| [LEAD](https://www.postgresql.org/docs/current/functions-window.html) | [LEAD](https://docs.snowflake.com/en/sql-reference/functions/lead)    *Notes:* PostgreSQL *allows constant or expression offsets; Snowflake allows only constant offset*s. |
| [NTH_VALUE](https://www.postgresql.org/docs/current/functions-window.html) | [NTH_VALUE](https://docs.snowflake.com/en/sql-reference/functions/nth_value)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [NTILE](https://www.postgresql.org/docs/current/functions-window.html) | [NTILE](https://docs.snowflake.com/en/sql-reference/functions/ntile)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`. (See* [SSC-FDM-PG0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md)*).* |
| [PERCENT_RANK](https://www.postgresql.org/docs/current/functions-window.html) | [PERCENT_RANK](https://docs.snowflake.com/en/sql-reference/functions/percent_rank)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [PERCENTILE_CONT](https://www.postgresql.org/docs/9.4/functions-aggregate.html) | [PERCENTILE_CONT](https://docs.snowflake.com/en/sql-reference/functions/percentile_cont)    *Notes: Rounding varies between platforms.* |
| [PERCENTILE_DISC](https://www.postgresql.org/docs/9.4/functions-aggregate.html) | [PERCENTILE_DISC](https://docs.snowflake.com/en/sql-reference/functions/percentile_disc) |
| [RANK](https://www.postgresql.org/docs/current/functions-window.html) | [RANK](https://docs.snowflake.com/en/sql-reference/functions/rank) |
| [ROW_NUMBER](https://www.postgresql.org/docs/current/functions-window.html) | [ROW_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/row_number)    N*otes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |

## Related EWIs

* [SSC-FDM-0032](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Parameter is not a literal value, transformation could not be fully applied
* [SSC-FDM-PG0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Function syntactically supported by Snowflake but may have functional differences.
* [SSC-EWI-0009](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Regexp_Substr Function only supports POSIX regular expressions.
* [SSC-FDM-PG0011](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): The use of the COLLATE column constraint has been disabled for this pattern-matching condition.

---
title: SnowConvert AI - PostgreSQL - CREATE MATERIALIZED VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/create-materialized-view/postgresql-create-materialized-view.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - CREATE MATERIALIZED VIEW

Translation reference to convert PostgreSQL Materialized View to Snowflake Dynamic Table

## Applies to

* PostgreSQL
* Greenplum
* Netezza

## Description

In SnowConvert AI, Materialized Views are transformed into Snowflake Dynamic Tables. To properly configure Dynamic Tables, two essential parameters must be defined: TARGET_LAG and WAREHOUSE. If these parameters are left unspecified in the configuration options, SnowConvert AI will default to preassigned values during the conversion, as demonstrated in the example below.

## Grammar Syntax

```sql
CREATE MATERIALIZED VIEW [ IF  NOT EXISTS ] <table_name>
    [ (<column_name> [, ...] ) ]
    [ USING <method> ]
    [ WITH ( <storage_parameter> [= <value>] [, ... ] ) ]
    [ TABLESPACE <tablespace_name> ]
    AS <query>
    [ WITH [ NO ] DATA ]
```

## Code Examples

### Simple Case

Input Code:

#### PostgreSQL

```sql
CREATE MATERIALIZED VIEW product_summary AS
SELECT
    category,
    COUNT(*) AS total_products,
    MAX(price) AS max_price
FROM products
GROUP BY category;
```

Output Code:

##### Snowflake

```sql
CREATE OR REPLACE DYNAMIC TABLE product_summary
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/14/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
    category,
    COUNT(*) AS total_products,
    MAX(price) AS max_price
FROM
    products
GROUP BY category;
```

### IF NOT EXISTS

> **Hint:**
>
> This syntax is fully supported in Snowflake.

This clause has been removed during the migration from PostgreSQL to Snowflake.

### USING, TABLESPACE, and WITH

> **Note:**
>
> This syntax is not needed in Snowflake.

These clauses are removed during the conversion process. In PostgreSQL, they are used to further customize data storage manually. This is something that Snowflake handles automatically (micro partitions), and it is typically not a concern.

## Related EWIs

1. [SSC-FDM-0031](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default

---
title: SnowConvert AI - PostgreSQL - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/create-table/postgresql-create-table.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - CREATE TABLE

Translation from PostgreSQL to Snowflake

## Applies to

* PostgreSQL
* Greenplum
* Netezza

## Description

Creates a new table in PostgreSQL. You define a list of columns, each of which holds data of a distinct type. The owner of the table is the issuer of the CREATE TABLE command.

For more information, please refer to `CREATE TABLE` documentation.

## Grammar Syntax

```sql
CREATE [ [ GLOBAL | LOCAL ] { TEMPORARY | TEMP } | UNLOGGED ] TABLE [ IF NOT EXISTS ] table_name ( [
  { column_name data_type [ STORAGE { PLAIN | EXTERNAL | EXTENDED | MAIN | DEFAULT } ] [ COMPRESSION compression_method ] [ COLLATE collation ] [ column_constraint [ ... ] ]
    | table_constraint
    | LIKE source_table [ like_option ... ] }
    [, ... ]
] )
[ INHERITS ( parent_table [, ... ] ) ]
[ PARTITION BY { RANGE | LIST | HASH } ( { column_name | ( expression ) } [ COLLATE collation ] [ opclass ] [, ... ] ) ]
[ USING method ]
[ WITH ( storage_parameter [= value] [, ... ] ) | WITHOUT OIDS ]
[ ON COMMIT { PRESERVE ROWS | DELETE ROWS | DROP } ]
[ TABLESPACE tablespace_name ]

CREATE [ [ GLOBAL | LOCAL ] { TEMPORARY | TEMP } | UNLOGGED ] TABLE [ IF NOT EXISTS ] table_name
    OF type_name [ (
  { column_name [ WITH OPTIONS ] [ column_constraint [ ... ] ]
    | table_constraint }
    [, ... ]
) ]
[ PARTITION BY { RANGE | LIST | HASH } ( { column_name | ( expression ) } [ COLLATE collation ] [ opclass ] [, ... ] ) ]
[ USING method ]
[ WITH ( storage_parameter [= value] [, ... ] ) | WITHOUT OIDS ]
[ ON COMMIT { PRESERVE ROWS | DELETE ROWS | DROP } ]
[ TABLESPACE tablespace_name ]

CREATE [ [ GLOBAL | LOCAL ] { TEMPORARY | TEMP } | UNLOGGED ] TABLE [ IF NOT EXISTS ] table_name
    PARTITION OF parent_table [ (
  { column_name [ WITH OPTIONS ] [ column_constraint [ ... ] ]
    | table_constraint }
    [, ... ]
) ] { FOR VALUES partition_bound_spec | DEFAULT }
[ PARTITION BY { RANGE | LIST | HASH } ( { column_name | ( expression ) } [ COLLATE collation ] [ opclass ] [, ... ] ) ]
[ USING method ]
[ WITH ( storage_parameter [= value] [, ... ] ) | WITHOUT OIDS ]
[ ON COMMIT { PRESERVE ROWS | DELETE ROWS | DROP } ]
[ TABLESPACE tablespace_name ]

where column_constraint is:

[ CONSTRAINT constraint_name ]
{ NOT NULL |
  NULL |
  CHECK ( expression ) [ NO INHERIT ] |
  DEFAULT default_expr |
  GENERATED ALWAYS AS ( generation_expr ) STORED |
  GENERATED { ALWAYS | BY DEFAULT } AS IDENTITY [ ( sequence_options ) ] |
  UNIQUE [ NULLS [ NOT ] DISTINCT ] index_parameters |
  PRIMARY KEY index_parameters |
  REFERENCES reftable [ ( refcolumn ) ] [ MATCH FULL | MATCH PARTIAL | MATCH SIMPLE ]
    [ ON DELETE referential_action ] [ ON UPDATE referential_action ] }
[ DEFERRABLE | NOT DEFERRABLE ] [ INITIALLY DEFERRED | INITIALLY IMMEDIATE ]

and table_constraint is:

[ CONSTRAINT constraint_name ]
{ CHECK ( expression ) [ NO INHERIT ] |
  UNIQUE [ NULLS [ NOT ] DISTINCT ] ( column_name [, ... ] ) index_parameters |
  PRIMARY KEY ( column_name [, ... ] ) index_parameters |
  EXCLUDE [ USING index_method ] ( exclude_element WITH operator [, ... ] ) index_parameters [ WHERE ( predicate ) ] |
  FOREIGN KEY ( column_name [, ... ] ) REFERENCES reftable [ ( refcolumn [, ... ] ) ]
    [ MATCH FULL | MATCH PARTIAL | MATCH SIMPLE ] [ ON DELETE referential_action ] [ ON UPDATE referential_action ] }
[ DEFERRABLE | NOT DEFERRABLE ] [ INITIALLY DEFERRED | INITIALLY IMMEDIATE ]

and like_option is:

{ INCLUDING | EXCLUDING } { COMMENTS | COMPRESSION | CONSTRAINTS | DEFAULTS | GENERATED | IDENTITY | INDEXES | STATISTICS | STORAGE | ALL }

and partition_bound_spec is:

IN ( partition_bound_expr [, ...] ) |
FROM ( { partition_bound_expr | MINVALUE | MAXVALUE } [, ...] )
  TO ( { partition_bound_expr | MINVALUE | MAXVALUE } [, ...] ) |
WITH ( MODULUS numeric_literal, REMAINDER numeric_literal )

index_parameters in UNIQUE, PRIMARY KEY, and EXCLUDE constraints are:

[ INCLUDE ( column_name [, ... ] ) ]
[ WITH ( storage_parameter [= value] [, ... ] ) ]
[ USING INDEX TABLESPACE tablespace_name ]

exclude_element in an EXCLUDE constraint is:

{ column_name | ( expression ) } [ COLLATE collation ] [ opclass [ ( opclass_parameter = value [, ... ] ) ] ] [ ASC | DESC ] [ NULLS { FIRST | LAST } ]

referential_action in a FOREIGN KEY/REFERENCES constraint is:

{ NO ACTION | RESTRICT | CASCADE | SET NULL [ ( column_name [, ... ] ) ] | SET DEFAULT [ ( column_name [, ... ] ) ] }
```

## Tables Options

### TEMPORARY | TEMP, or IF NOT EXISTS

> **Hint:**
>
> This syntax is fully supported in Snowflake.

### GLOBAL | LOCAL

> **Note:**
>
> This syntax is not needed in Snowflake.

According to PostgreSQL’s documentation, GLOBAL | LOCAL are present for SQL Standard compatibility, but have no effect in PostgreSQL and are deprecated. For that reason, SnowConvert AI will remove these keyworks during the migration process.

#### Sample Source

Input Code:

##### PostgreSQL

```sql
CREATE GLOBAL TEMP TABLE TABLE1 (
   COL1 integer
);
```

Output Code:

##### Snowflake

```sql
CREATE TEMPORARY TABLE TABLE1 (
   COL1 integer
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/09/2025",  "domain": "no-domain-provided" }}';
```

### UNLOGGED TABLE

> **Note:**
>
> This syntax is not needed in Snowflake.

UNLOGGED tables offer a significant speed advantage because they are not written to the write-ahead log. Snowflake doesn’t support this functionality, so the `UNLOGGED` clause will be commented out.

### Code Example

#### Input Code:

##### Greenplum

```sql
CREATE UNLOGGED TABLE TABLE1 (
  COL1 integer
);
```

#### Output Code:

##### Snowflake

```sql
CREATE
--       --** SSC-FDM-PG0005 - UNLOGGED TABLE IS NOT SUPPORTED IN SNOWFLAKE, DATA WRITTEN MAY HAVE DIFFERENT PERFORMANCE. **
--       UNLOGGED
                TABLE TABLE1 (
   COL1 integer
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/09/2025",  "domain": "no-domain-provided" }}';
```

## Column Attributes

### CHECK Attribute

> **Danger:**
>
> This syntax is not supported in Snowflake.

The CHECK clause specifies an expression producing a Boolean result that new or updated rows must satisfy for an insert or update operation to succeed. Snowflake does not have an equivalence with this clause; SnowConvert AI will add an EWI. This will be applied as a CHECK attribute or table constraint.

Grammar Syntax

```sql
CHECK  ( <expression> )
```

#### Sample Source

Input Code:

##### PostgreSQL

```sql
CREATE TABLE table1 (
    product_id INT PRIMARY KEY,
    quantity INT CHECK (quantity >= 0)
);
```

Output Code:

##### Snowflake

```sql
CREATE TABLE table1 (
    product_id INT PRIMARY KEY,
    quantity INT
                 !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!! CHECK (quantity >= 0)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/09/2025",  "domain": "no-domain-provided" }}';
```

### GENERATED BY DEFAULT AS IDENTITY

> **Hint:**
>
> This syntax is fully supported in Snowflake.

Specifies that the column is a default IDENTITY column and enables you to assign a unique value to the column automatically.

Grammar Syntax

```sql
 GENERATED { ALWAYS | BY DEFAULT } AS IDENTITY [ ( <sequence_options> ) ]
```

#### Sample Source

Input Code:

##### PostgreSQL

```sql
CREATE TABLE table1 (
idValue INTEGER GENERATED ALWAYS AS IDENTITY)
```

Output Code:

##### Snowflake

```sql
CREATE TABLE table1 (
idValue INTEGER IDENTITY(1, 1) ORDER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/09/2025",  "domain": "no-domain-provided" }}'
```

## Table Constraints

### Primary Key, Foreign Key, and Unique

> **Warning:**
>
> This syntax is partially supported in Snowflake.

SnowConvert AI keeps the constraint definitions; however, in Snowflake, unique, primary, and foreign keys are used for documentation and do not enforce constraints or uniqueness. They help describe table relationships but don’t impact data integrity or performance.

## Table Attributes

### LIKE option

> **Warning:**
>
> This syntax is partially supported in Snowflake.

The `LIKE` clause specifies a table from which the new table automatically copies all column names, their data types, and their not-null constraints. PostgreSQL supports several options, while Snowflake does not so that SnowConvert AI will remove the options like.

#### Grammar Syntax

```sql
  LIKE source_table { INCLUDING | EXCLUDING }
  { AM | COMMENTS | CONSTRAINTS | DEFAULTS | ENCODING | GENERATED | IDENTITY | INDEXES | RELOPT | STATISTICS | STORAGE | ALL }
```

#### Sample Source Patterns

Input Code:

##### PostgreSQL

```sql
CREATE TABLE source_table (
    id INT,
    name VARCHAR(255),
    created_at TIMESTAMP,
    status BOOLEAN
);

CREATE TABLE target_table_no_constraints (LIKE source_table INCLUDING DEFAULTS EXCLUDING CONSTRAINTS EXCLUDING INDEXES);
```

Output Code:

##### Snowflake

```sql
CREATE TABLE source_table (
    id INT,
    name VARCHAR(255),
    created_at TIMESTAMP,
    status BOOLEAN
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/12/2025",  "domain": "no-domain-provided" }}';
CREATE TABLE target_table_no_constraints LIKE source_table;
```

### ON COMMIT

> **Warning:**
>
> This syntax is partially supported.

Specifies the behaviour of the temporary table when a commit is done.

#### Grammar Syntax

```sql
ON COMMIT { PRESERVE ROWS | DELETE ROWS | DROP }
```

## Sample Source Patterns

### Input Code:

#### PostgreSQL

```sql
CREATE GLOBAL TEMPORARY TABLE temp_data_delete (
    id INT,
    data TEXT
) ON COMMIT DELETE ROWS;
```

#### Output Code:

##### Snowflake

```sql
CREATE TEMPORARY TABLE temp_data_delete (
    id INT,
    data TEXT
)
----** SSC-FDM-0008 - ON COMMIT NOT SUPPORTED **
--ON COMMIT DELETE ROWS
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/12/2025",  "domain": "no-domain-provided" }}';
```

### PARTITION BY, USING, TABLESPACE, and WITH

> **Note:**
>
> This syntax is not needed in Snowflake.

These clauses in Snowflake are unnecessary because they automatically handle the data storage, unlike PostgreSQL, which could be set up manually. For this reason, these clauses are removed during migration.

## Related EWIs

1. [SSC-EWI-0035](../../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check statement not supported.
2. [SSC-FDM-PG0005](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): UNLOGGED Table is not supported in Snowflake; data written may have different performance.
3. [SSC-FDM-0008](../../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): On Commit not supported.

---
title: SnowConvert AI - PostgreSQL - CREATE VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/postgresql-create-view.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - CREATE VIEW

Translation from PostgreSQL to Snowflake

## Applies to

* PostgreSQL
* Greenplum
* Netezza

## Description

This command creates a view in a database, which is run every time the view is referenced in a query.

For more information, please refer to [`CREATE VIEW`](https://www.postgresql.org/docs/current/sql-createview.html) documentation.

## Grammar Syntax

```sql
CREATE [OR REPLACE] [TEMP | TEMPORARY] [RECURSIVE] VIEW <name> [ ( <column_name> [, ...] ) ]
    [ WITH ( view_option_name [= view_option_value] [, ... ] ) ]
    AS <query>
    [ WITH [ CASCADED | LOCAL ] CHECK OPTION ]
```

## Code Examples

### [OR REPLACE] [TEMP | TEMPORARY] [RECURSIVE]

> **Hint:**
>
> This syntax is fully supported in Snowflake.

#### Input Code:

##### PostgreSQL

```sql
CREATE OR REPLACE VIEW view1 AS
    SELECT
        product_id,
        SUM(quantity) AS sum_quantity
    FROM
        table1
    GROUP BY
        product_id;

CREATE TEMPORARY RECURSIVE VIEW view2 AS
    SELECT
        product_id,
        SUM(quantity) AS sum_quantity
    FROM
        table1
    GROUP BY
        product_id;
```

#### Output Code:

##### Snowflake

```sql
CREATE OR REPLACE VIEW view1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/14/2025",  "domain": "no-domain-provided" }}'
AS
    SELECT
        product_id,
        SUM(quantity) AS sum_quantity
    FROM
table1
    GROUP BY
        product_id;

CREATE TEMPORARY RECURSIVE VIEW view2
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/14/2025",  "domain": "no-domain-provided" }}'
AS
    SELECT
        product_id,
        SUM(quantity) AS sum_quantity
    FROM
table1
    GROUP BY
        product_id;
```

### WITH CHECK CLAUSE

This WITH CHECK CLAUSE clause on a view enforces that any data inserted or updated through the view must satisfy the view’s defining conditions. LOCAL checks only the current view’s conditions, while CASCADED checks conditions of the view and all underlying views. It prevents creating rows that are invisible through the view and cannot be used with recursive views.

> **Danger:**
>
> This syntax is not supported in Snowflake.

#### Input Code:

##### PostgreSQL

```sql
CREATE VIEW updatable_products AS
    SELECT id, name, price
    FROM products
    WHERE price > 0
WITH LOCAL CHECK OPTION;
```

#### Output Code:

##### Snowflake

```sql
CREATE VIEW updatable_products
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/14/2025",  "domain": "no-domain-provided" }}'
AS
    SELECT id, name, price
    FROM
products
    WHERE price > 0;
```

### WITH PARAMETERS OPTIONS

This WITH PARAMETERS OPTIONS allows setting optional properties for the view, such as how modifications through the view are checked (check_option) and whether to enforce row-level security (security_barrier).

> **Danger:**
>
> This syntax is not supported in Snowflake.

#### Input Code:

##### PostgreSQL

```sql
CREATE VIEW large_orders WITH (security_barrier=true, check_option=local) AS
    SELECT order_id, customer_id, total_amount
    FROM orders
    WHERE total_amount > 1000;
```

#### Output Code:

##### Snowflake

```sql
CREATE VIEW large_orders
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "05/14/2025",  "domain": "no-domain-provided" }}'
AS
    SELECT order_id, customer_id, total_amount
    FROM
orders
    WHERE total_amount > 1000;
```

### VALUES OPTION

> **Hint:**
>
> This syntax is fully supported in Snowflake.

#### Input Code:

##### PostgreSQL

```sql
CREATE VIEW numbers_view (number_1) AS
    VALUES (1,2), (2,2), (3,2), (4,2), (5,2);
```

#### Output Code:

##### Snowflake

```sql
CREATE VIEW numbers_view
AS
SELECT
*
FROM
(
        VALUES (1,2), (2,2), (3,2), (4,2), (5,2)
) AS numbers_view (
        number_1
);
```

---
title: SnowConvert AI - PostgreSQL - Data types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/data-types/postgresql-data-types.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - Data types

Current Data types conversion for PostgreSQL to Snowflake.

## Applies to

* PostgreSQL
* Greenplum
* Netezza

Snowflake supports most basic [SQL data types](https://docs.snowflake.com/en/sql-reference/intro-summary-data-types) (with some restrictions) for use in columns, local variables, expressions, parameters, and any other appropriate/suitable locations.

## Numeric Data Types

| PostgreSQL | Snowflake |
| --- | --- |
| INT | INT |
| INT2 | SMALLINT |
| INT4 | INTEGER |
| INT8 | INTEGER |
| INTEGER | INTEGER |
| BIGINT | BIGINT |
| DECIMAL | DECIMAL |
| DOUBLE PRECISION | DOUBLE PRECISION |
| NUMERIC​ | NUMERIC |
| SMALLINT | SMALLINT |
| FLOAT | FLOAT |
| FLOAT4 | FLOAT4 |
| FLOAT8 | FLOAT8 |
| REAL | REAL​ |
| BIGSERIAL/SERIAL8 | INTEGER  *Note: Snowflake supports defining columns as IDENTITY, which automatically generates sequential values. This is the more concise and often preferred approach in Snowflake.* |

## Character Types

| PostgreSQL | Snowflake |
| --- | --- |
| VARCHAR | VARCHAR  *Note: VARCHAR holds Unicode UTF-8 characters. If no length is specified, the default is the maximum allowed length (16,777,216).* |
| CHAR | CHAR |
| CHARACTER | CHARACTER  *Note:* Snowflake’s CHARACTER is an alias for VARCHAR. |
| NCHAR | NCHAR |
| BPCHAR | VARCHAR  *Note: BPCHAR data type is **not supported** in Snowflake. VARCHAR is used instead. For more information please refer to* [*SSC-FDM-PG0002*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md)*.* |
| CHARACTER VARYING | CHARACTER VARYING |
| NATIONAL CHARACTER | NCHAR |
| NATIONAL CHARACTER VARYING | NCHAR VARYING |
| TEXT | TEXT |
| [NAME](https://www.postgresql.org/docs/current/datatype-character.html) (Special character type) | VARCHAR |

## Boolean Types

| PostgreSQL | Snowflake |
| --- | --- |
| BOOL/BOOLEAN | BOOLEAN |

## Binary Types

| PostgreSQL | Snowflake |
| --- | --- |
| BYTEA | BINARY |

## Bit String Types

| PostgreSQL | Snowflake |
| --- | --- |
| BIT | CHARACTER |
| BIT VARYING | CHARACTER VARYING |
| VARBIT | CHARACTER VARYING |

## Date & Time Data

| PostgreSQL | Snowflake |
| --- | --- |
| DATE | DATE |
| TIME | TIME |
| TIME WITH TIME ZONE | TIME  *Note: Time zone not supported for time data type. For more information, please refer to* [*SSC-FDM-0005*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md)*.* |
| TIME WITHOUT TIME ZONE | TIME |
| TIMESTAMP | TIMESTAMP |
| TIMESTAMPTZ | TIMESTAMP_TZ |
| TIMESTAMP WITH TIME ZONE | TIMESTAMP_TZ |
| TIMESTAMP WITHOUT TIME ZONE | TIMESTAMP_NTZ |
| INTERVAL YEAR TO MONTH | VARCHAR  *Note: Data type is **not supported** in Snowflake. VARCHAR is used instead. For more information please refer to* [*SSC-EWI-0036*](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*. With the `–UseIntervalDatatype` preview flag, maps to `INTERVAL DAY TO SECOND`. See [Interval Data Types](../../general/interval-data-types).* |
| INTERVAL DAY TO SECOND | VARCHAR  *Note: Data type is **not supported** in Snowflake. VARCHAR is used instead. For more information please refer to* [*SSC-EWI-0036*](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*. With the `–UseIntervalDatatype` preview flag, maps to `INTERVAL DAY TO SECOND`. See [Interval Data Types](../../general/interval-data-types).* |

## Pseudo Types

| PostgreSQL | Snowflake |
| --- | --- |
| UNKNOWN | TEXT  *Note: Data type is **not supported** in Snowflake. TEXT is used instead. For more information please refer to* [*SSC-EWI-0036*](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*.* |

## Array Types

| PostgreSQL | Snowflake |
| --- | --- |
| type [] | ARRAY  *Note: Strongly typed array transformed to ARRAY without type checking. For more information please refer to* [*SSC-FDM-PG0016*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md)*.* |

## Related EWIs

1. [SSC-FDM-PG0002](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Bpchar converted to varchar.
2. [SSC-FDM-PG0003](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Bytea Converted To Binary
3. [SSC-FDM-PG0014](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Unknown Pseudotype transformed to Text Type
4. [SSC-FDM-0005](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): TIME ZONE not supported for time data type.
5. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
6. [SSC-EWI-PG0016](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/postgresqlEWI.md): Bit String Type converted to Varchar Type.
7. [SSC-FDM-PG0016](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): *Strongly typed array transformed to ARRAY without type checking*.

---
title: SnowConvert AI - PostgreSQL - Expressions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/postgresql-expressions.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - Expressions

## ALL & ANY array expressions

<> ALL & = ANY array expressions

### Description

> An expression used to **evaluate and compare** each element of an array against a specified expression. ([PostgreSQL Language Reference ANY & ALL (array)](https://www.postgresql.org/docs/current/functions-comparisons.html#FUNCTIONS-COMPARISONS-ANY-SOME))

### Grammar Syntax

```sql
 expression operator ANY (array expression)
expression operator ALL (array expression)
```

To support this expression SnowConvert AI translates the `<> ALL` to `NOT IN` and the `= ANY` to `IN`

### Sample Source Patterns

#### Input Code:

##### PostgreSQL

```sql
 SELECT some_column <> ALL (ARRAY[1, 2, 3])
FROM some_table;

SELECT *
FROM someTable
WHERE column_name = ANY (ARRAY[1, 2, 3]);
```

##### Output Code:

##### Snowflake

```sql
 SELECT some_column NOT IN (1, 2, 3)
FROM some_table;

SELECT *
 FROM someTable
 WHERE column_name IN (1, 2, 3);
```

#### Known Issues

There are no known issues

#### Related EWIs

There are no related EWIs.

---
title: SnowConvert AI - PostgreSQL - PostgreSQL interactive terminal
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/postgresql-interactive-terminal.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - PostgreSQL interactive terminal

PSQL commands

## Applies to

* PostgreSQL
* Netezza

## Description

> PSQL is a terminal-based front-end to PostgreSQL. It enables you to type in queries interactively, issue them to PostgreSQL, and see the query results. Alternatively, input can be from a file. In addition, it provides a number of meta-commands and various shell-like features to facilitate writing scripts and automating a wide variety of tasks. ([PSQL documentation](https://www.postgresql.org/docs/9.2/app-psql.html)).

In Snowflake, **PSQL commands are not applicable.** While no longer needed for execution, SnowConvert AI retains the original PSQL command as a comment

## Sample Source Patterns

### Input Code:

#### Greenplum

```sql
\set ON_ERROR_STOP TRUE
```

### Output Code:

#### Snowflake

```sql
----** SSC-FDM-PG0015 - PSQL COMMAND IS NOT APPLICABLE IN SNOWFLAKE. **
--\set ON_ERROR_STOP TRUE
```

## Related EWIs

1. [SSC-FDM-PG0015](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md) : PSQL command is not applicable in Snowflake.

---
title: SnowConvert AI - PostgreSQL - Power BI Repointing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/etl-bi-repointing/power-bi-postgres-repointing.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - Power BI Repointing

## Description

The Power BI repointing is a feature that provides an easy way to redefine the connections from the M language in the Power Query Editor. This means that the connection parameters will be redefined to point to the Snowflake migration database context. For Postgres, the method in M Language that defined the connection is `PostgreSQL.Database(...).` In Snowflake, there is a connector that depends on some other parameters and the main connection is defined by `Snowflake.Database(...)` method.

## Source Pattern Samples

### Entity Repointing Case: Table

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a table.

**PostgreSQL Connection in the Power Query Editor**

```sql
let
    Source = PostgreSQL.Database("your_connection", "mydatabase"),
    public_products = Source{[Schema="public",Item="products"]}[Data]
in
    public_products
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="public", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="PRODUCTS", Kind="Table"]}[Data],
    public_products = Table.RenameColumns(SourceSfTbl, {{ "PRODUCT_ID", "product_id"}, { "PRODUCT_NAME", "product_name"}, { "PRICE", "price"}, { "STOCK_QUANTITY", "stock_quantity"}})
in
    public_products
```

### Entity Repointing Case: View

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a view. The view uses the same pattern as the tables. It will only be validated with the symbol table; otherwise, it will be converted to a table. DDLs are important to this pattern.

**PostgreSQL Connection in the Power Query Editor**

```sql
let
    Source = PostgreSQL.Database("your_connection", "mydatabase"),
    public_expensive_products = Source{[Schema="public",Item="expensive_products"]}[Data]
in
    public_expensive_products
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="public", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="EXPENSIVE_PRODUCTS", Kind="View"]}[Data],
    public_expensive_products = Table.RenameColumns(SourceSfTbl, {{ "PRODUCT_ID", "product_id"}, { "PRODUCT_NAME", "product_name"}, { "PRICE", "price"}})
in
    public_expensive_products
```

### Embedded SQL Case

This case refers to connections that contain embedded SQL inside them. This sample shows a simple query, but SnowConvert AI covers a range of larger scenarios. Besides, depending on the migrated query, there may be warning messages known as EWI—PRF—FDM. This will help the user identify patterns that need extra attention.

**PostgreSQL Connection in the Power Query Editor**

```sql
let
    Source = Value.NativeQuery(PostgreSQL.Database("your_connection", "mydatabase"), "SELECT * FROM expensive_products", null, [EnableFolding=true])
in
    Source
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    SfSource = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "SELECT * FROM
expensive_products", null, [EnableFolding=true]),
    Source = Table.RenameColumns(SfSource, {{ "PRODUCT_ID", "product_id"}, { "PRODUCT_NAME", "product_name"}, { "PRICE", "price"}})
in
    Source
```

---
title: SnowConvert AI - PostgreSQL - String Comparison
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/postgresql-string-comparison.md
section: Migrations
---

# SnowConvert AI - PostgreSQL - String Comparison

In PostgreSQL and PostgreSQL-based languages (Greenplum, RedShift, Netezza), when comparing fixed-length data types (CHAR, CHARACTER, etc) or comparing fixed-length data types against varchar data types, trailing spaces are ignored. This means that a string like `'water '` (value with a trailing space) would be considered equal to `'water'` (value without a trailing space).

If you compare

```sql
CHAR(6) 'hello', which is stored as 'hello ', with one padded character
```

against

```sql
CHAR(6) 'hello ', with no need to add any padding character
```

They are effectively the same after trailing spaces.

Meanwhile, Snowflake does not have fixed-length character types and takes a more literal approach for its `VARCHAR` data type, treating strings exactly as they are stored, including any trailing blanks. Therefore, in Snowflake, `'water '` is *not* considered equal to `'water'`.

To prevent trailing spaces from affecting string comparison outcomes in PostgreSQL to Snowflake conversions, SnowConvert AI automatically adds `BTRIM` to relevant comparisons as our team has identified. This ensures consistent behavior.

## Sample Source Patterns

Let’s use the following script data to explain string comparison.

```sql
create table table1(c1 char(2), c2 char(2), c3 VARCHAR(2), c4 VARCHAR(2));

insert into table1 values ('a','a ','a','a ');

insert into table1 values ('b','b','b','b');
```

### NULLIF

#### Varchar Data Type

Input Code:

##### PostgreSQL

```sql
SELECT NULLIF(c3,c4) FROM table1;
```

Output Code:

##### Snowflake

```sql
SELECT
NULLIF(c3,c4) FROM
table1;
```

#### Char Data Types

Input Code:

##### PostgreSQL

```sql
select nullif(c1,c2) AS case2 from table1;
```

Output Code:

##### Snowflake

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table1" **
select
nullif(c1,c2) AS case2 from
table1;
```

### GREATEST or LEAST

Input Code:

#### PostgreSQL

```sql
select '"' || greatest(c1, c2) || '"' AS greatest, '"' || least(c1, c2) || '"' AS least from table1;
```

Output Code:

##### Snowflake

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "table1" **
select '"' || GREATEST_IGNORE_NULLS(c1, c2) || '"' AS greatest, '"' || LEAST_IGNORE_NULLS(c1, c2) || '"' AS least from
table1;
```

---
title: SnowConvert AI - PostgreSQL / Greenplum / Netezza - CREATE TYPE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/ddls/postgresql-create-type.md
section: Migrations
---

# SnowConvert AI - PostgreSQL / Greenplum / Netezza - CREATE TYPE

This page describes how SnowConvert translates source `CREATE TYPE` statements for PostgreSQL-family dialects to Snowflake **[native user-defined types](https://docs.snowflake.com/en/sql-reference/sql/create-type)** where supported.

## PostgreSQL

PostgreSQL composite types declared with `CREATE TYPE name AS (...)` are translated to Snowflake `CREATE TYPE ... AS OBJECT (...)`. Other variants (for example `ENUM`, `RANGE`, `CREATE TYPE ... AS BASE`) follow separate rules or default handling and may emit conversion issues.

### Composite types

**Source (PostgreSQL):**

```sql
CREATE TYPE address AS (street VARCHAR(200), city VARCHAR(100), zipcode VARCHAR(10));
```

**Snowflake equivalent:**

```sql
CREATE TYPE address AS OBJECT (street VARCHAR(200), city VARCHAR(100), zipcode VARCHAR(10));
```

#### With schema

**Source (PostgreSQL):**

```sql
CREATE TYPE myschema.person AS (first_name VARCHAR(50), last_name VARCHAR(50), age INTEGER);
```

**Snowflake equivalent:**

```sql
CREATE TYPE myschema.person AS OBJECT (first_name VARCHAR(50), last_name VARCHAR(50), age INTEGER);
```

#### Single attribute

**Source (PostgreSQL):**

```sql
CREATE TYPE wrapper AS (value INTEGER);
```

**Snowflake equivalent:**

```sql
CREATE TYPE wrapper AS OBJECT (value INTEGER);
```

**Notes:** Non-composite `CREATE TYPE` forms are not covered in full on this page; consult EWIs/FDMs and the issue catalogs for enumerations and other definitions.

## Greenplum

Greenplum inherits PostgreSQL-style composite types. SnowConvert maps `CREATE TYPE name AS (...)` to Snowflake `CREATE TYPE ... AS OBJECT (...)`, consistent with the PostgreSQL translation path.

**Source (Greenplum):**

```sql
CREATE TYPE address AS (street VARCHAR(200), city VARCHAR(100), zipcode VARCHAR(10));
```

**Snowflake equivalent:**

```sql
CREATE TYPE address AS OBJECT (street VARCHAR(200), city VARCHAR(100), zipcode VARCHAR(10));
```

**Notes:** For additional `CREATE TYPE` variants (for example `ENUM`), see PostgreSQL-oriented rules and issue catalogs; Greenplum shares the same replacer family as PostgreSQL for supported composite patterns.

## Netezza

Netezza SQL uses the same shared ANSI-style `CREATE TYPE` transformation pipeline as Db2 for the patterns below (scalar alias and attribute-list composite types).

### Type alias (`CREATE TYPE ... AS` scalar)

When the parser produces a predefined (scalar) body, Snowflake receives a normalized `CREATE TYPE name AS <datatype>`.

**Illustrative source (Netezza-style alias):**

```sql
CREATE TYPE email_addr AS VARCHAR(255);
```

**Snowflake equivalent:**

```sql
CREATE TYPE email_addr AS VARCHAR(255);
```

### Structured type (attribute list)

`CREATE TYPE name AS (attr type, ...)` maps to Snowflake `OBJECT(...)`.

**Illustrative source:**

```sql
CREATE TYPE address_t AS (street VARCHAR(100), city VARCHAR(50), state CHAR(2));
```

**Snowflake equivalent:**

```sql
CREATE TYPE address_t AS OBJECT (street VARCHAR(100), city VARCHAR(50), state CHAR(2));
```

**Notes:** Db2 **`CREATE DISTINCT TYPE`** is not a Netezza construct; for IBM Db2 distinct types, see [CREATE TYPE (IBM DB2)](../../db2/db2-create-type.md). Oracle and other dialects have separate translation references.

---
title: SnowConvert AI - PostgreSQL Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md
section: Migrations
---

# SnowConvert AI - PostgreSQL Functional Differences

> **Note:**
>
> SnowConvert AI for PostgreSQL currently supports assessment and translation for TABLES and VIEWS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.
>
> If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-PG0001

FOUND could have a different behavior in Snowflake in some scenarios.

### Severity

Low

#### Description

The FOUND property in PostgreSQL is a property based on the last executed query, it can be affected by some statements such as `INSERT`, `UPDATE`, `DELETE`, `MERGE`, `SELECT INTO`, `PERFORM`, `FETCH` and `FOR` loops. To read more details about this property, this is [PostgreSQL documentation](https://www.postgresql.org/docs/current/plpgsql-statements.html#PLPGSQL-STATEMENTS-DIAGNOSTICS).

In Snowflake there is not a direct translation for this property, for the following scenarios:

* `INSERT`
* `UPDATE`
* `DELETE`
* `MERGE`

The converted code will be `SQLFOUND` Snowflake property ([Here is the documentation](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/dml-status)) since it behaves like the PostgreSQL `FOUND` property.

For the other cases such as:

* `SELECT INTO`
* `PERFORM`
* `FETCH`

The converted code will be a custom UDF (`IS_FOUND_UDF`) that behaves like the PostgreSQL `FOUND` property.

This happens because `SQLFOUND` changes its value only when at least one row is affected by the last executed query, if the last query does not change any row, it does not change.

While the `IS_FOUND_UDF` only works for statements that returns rows, if no row is returned it, it will return `FALSE`.

##### SQLFOUND Example

```sql
INSERT INTO SampleTable (SampleColumn1)
VALUES ('SampleValue0.1');
```

The last query affects a table, so the `SQLFOUND` is the closest to the PostgreSQL functionality.

##### IS_FOUND_UDF Example

```sql
SELECT SampleColumn FROM SampleTable;
```

The last query will return a row but does not change anything, so the `IS_FOUND_UDF()` is the closest to the PostgreSQL functionality.

##### IS_FOUND_UDF Source Code

```sql
CREATE OR REPLACE FUNCTION FOUND_UDF()
RETURNS BOOLEAN
LANGUAGE SQL
IMMUTABLE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "udf",  "convertedOn": "09/09/2024" }}'
AS
$$
SELECT (count(*) != 0) FROM TABLE(result_scan(last_query_id()))
$$;
```

#### Code Example

##### Insert Statement:

##### PostgreSQL

```sql
-- Found property used with INSERT statement.
CREATE OR REPLACE PROCEDURE FoundUsingInsertProcedure()
LANGUAGE plpgsql
AS $$
BEGIN
    -- Insert into SampleTable
    INSERT INTO SampleTable (SampleColumn1)
    VALUES ('SampleValue0.1');

    SELECT FOUND;
END;
$$;
```

##### Snowflake

```sql
-- Found property used with INSERT statement.
CREATE OR REPLACE PROCEDURE FoundUsingInsertProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS $$
BEGIN
    -- Insert into SampleTable
    INSERT INTO SampleTable (SampleColumn1)
    VALUES ('SampleValue0.1');

    SELECT
        SQLFOUND /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
END;
$$;
```

##### Update Statement:

##### PostgreSQL

```sql
 -- Found property used with UPDATE statement.
CREATE OR REPLACE PROCEDURE FoundUsingUpdateProcedure()
LANGUAGE plpgsql
AS
$$
    BEGIN
        UPDATE SampleTable
        SET SampleColumn1 = 'SampleValue0.1'
        WHERE SampleColumn1 = 'SampleValue0.1';
        SELECT FOUND;
    END;
$$;
```

##### Snowflake

```sql
 -- Found property used with UPDATE statement.
CREATE OR REPLACE PROCEDURE FoundUsingUpdateProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    BEGIN
        UPDATE SampleTable
        SET SampleColumn1 = 'SampleValue0.1'
        WHERE SampleColumn1 = 'SampleValue0.1';
        SELECT
        SQLFOUND /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
    END;
$$;
```

##### Delete Statement:

##### PostgreSQL

```sql
 -- Found property used with DELETE statement.
CREATE OR REPLACE PROCEDURE FoundUsingDeleteProcedure()
LANGUAGE plpgsql
AS
$$
    BEGIN
        DELETE FROM SampleTable
        WHERE SampleColumn1 = 'SampleValue0.1';
        SELECT FOUND;
    END;
$$;
```

##### Snowflake

```sql
 -- Found property used with DELETE statement.
CREATE OR REPLACE PROCEDURE FoundUsingDeleteProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    BEGIN
        DELETE FROM
            SampleTable
        WHERE SampleColumn1 = 'SampleValue0.1';
        SELECT
        SQLFOUND /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
    END;
$$;
```

##### Merge Statement:

##### PostgreSQL

```sql
 -- Found property used with MERGE statement.
CREATE OR REPLACE PROCEDURE FoundUsingMergeProcedure()
LANGUAGE plpgsql
AS
$$
    BEGIN
        MERGE INTO SampleTableB B
        USING (SELECT * FROM SampleTableA) A
        ON B.SampleColumn1 = A.SampleColumn2
        WHEN MATCHED THEN DELETE;
        SELECT FOUND;
    END;
$$;
```

##### Snowflake

```sql
 -- Found property used with MERGE statement.
CREATE OR REPLACE PROCEDURE FoundUsingMergeProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    BEGIN
        MERGE INTO SampleTableB B
        USING (SELECT * FROM SampleTableA) A
        ON B.SampleColumn1 = A.SampleColumn2
        WHEN MATCHED THEN DELETE !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'MergeStatement' NODE ***/!!!;
        SELECT
        SQLFOUND /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
    END;
$$;
```

##### Select Into Statement

##### PostgreSQL

```sql
 -- Found property used with SELECT INTO statement.
CREATE OR REPLACE PROCEDURE FoundUsingSelectIntoProcedure()
LANGUAGE plpgsql
AS
$$
    DECLARE
        SampleNumber INTEGER;
    BEGIN
        SELECT 1 INTO SampleNumber;
        SELECT FOUND;
    END;
$$;
```

##### Snowflake

```sql
 -- Found property used with SELECT INTO statement.
CREATE OR REPLACE PROCEDURE FoundUsingSelectIntoProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    DECLARE
        SampleNumber INTEGER;
    BEGIN
        SELECT 1 INTO
        : SampleNumber;
        SELECT
        FOUND_UDF() /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
    END;
$$;
```

##### Perform Statement:

##### PostgreSQL

```sql
 -- Found property used with PERFORM statement.
CREATE OR REPLACE PROCEDURE FoundUsingPerformProcedure()
LANGUAGE plpgsql
AS
$$
    BEGIN
        PERFORM 1;
        RETURN FOUND;
    END;
$$;
```

##### Snowflake

```sql
 -- Found property used with PERFORM statement.
CREATE OR REPLACE PROCEDURE FoundUsingPerformProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    BEGIN
    SELECT
        1;
    RETURN FOUND_UDF() /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
    END;
$$;
```

##### Fetch Statement:

##### PostgreSQL

```sql
 -- Found property used with FETCH statement.
CREATE OR REPLACE PROCEDURE FoundUsingFetchProcedure ()
LANGUAGE plpgsql
AS
$$
    DECLARE
        SampleRow VARCHAR;
        SampleCursor CURSOR FOR SELECT EmptyColumn FROM EmptyTable;
    BEGIN
        OPEN SampleCursor;
        FETCH SampleCursor;
        CLOSE SampleCursor;
        SELECT FOUND;
    END;
$$;
```

##### Snowflake

```sql
 -- Found property used with FETCH statement.
CREATE OR REPLACE PROCEDURE FoundUsingFetchProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    DECLARE
        SampleRow VARCHAR;
        SampleCursor CURSOR FOR SELECT EmptyColumn FROM
        EmptyTable;
    BEGIN
        OPEN SampleCursor;
    !!!RESOLVE EWI!!! /*** SSC-EWI-PG0015 - FETCH CURSOR WITHOUT TARGET VARIABLES IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
        FETCH SampleCursor;
        CLOSE SampleCursor;
        SELECT
        FOUND_UDF() /*** SSC-FDM-PG0001 - FOUND COULD HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE IN SOME SCENARIOS. ***/;
    END;
$$;
```

## SSC-FDM-PG0002

Bpchar converted to varchar.

### Description

This warning is added because bpchar type (“blank-padded char”) may have some functional equivalence difference compared to the varchar data type in Snowflake. However, both data types can store the values up to the “n” length of characters and consume storage for only the amount of actual data stored. The main difference occurs when there are blanks at the end of the data, where bpchar does not store them but snowflake does.

For this reason, we can use the RTRIM function so that these blanks are not stored. But there may be cases where the functionality is not completely equivalent.

#### Code Example

##### Input Code:

##### Column Definition

```sql
CREATE TABLE table1 (
    col1 BPCHAR,
    col2 BPCHAR(20)
);
```

##### Explicit Cast

```sql
SELECT 'Y'::BPCHAR;
SELECT 'Y   '::BPCHAR(20);
SELECT COL1::BPCHAR(20) FROM tbl;
```

##### Generated Code:

##### Column Definition

```sql
CREATE TABLE table1 (
    col1 VARCHAR /*** SSC-FDM-PG0002 - BPCHAR CONVERTED TO VARCHAR. THESE TYPES MAY HAVE SOME FUNCTIONAL DIFFERENCES. ***/,
    col2 VARCHAR(20) /*** SSC-FDM-PG0002 - BPCHAR CONVERTED TO VARCHAR. THESE TYPES MAY HAVE SOME FUNCTIONAL DIFFERENCES. ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';
```

##### Explicit Cast

```sql
SELECT 'Y':: VARCHAR /*** SSC-FDM-PG0002 - BPCHAR CONVERTED TO VARCHAR. THESE TYPES MAY HAVE SOME FUNCTIONAL DIFFERENCES. ***/;

SELECT
    RTRIM( 'Y   ') :: VARCHAR(20) /*** SSC-FDM-PG0002 - BPCHAR CONVERTED TO VARCHAR. THESE TYPES MAY HAVE SOME FUNCTIONAL DIFFERENCES. ***/;

SELECT
    RTRIM( COL1) :: VARCHAR(20) /*** SSC-FDM-PG0002 - BPCHAR CONVERTED TO VARCHAR. THESE TYPES MAY HAVE SOME FUNCTIONAL DIFFERENCES. ***/
FROM
    tbl;
```

#### Best Practices

* The **`rtrim`** function can resolve storage differences in case you want those blanks not to be stored. This case is handled in the [explicit cast](https://docs.snowflake.com/en/sql-reference/functions/cast.html), however, there may be other scenarios where it has to be handled manually. For more information refer to the Snowflake documentation about [RTRIM](https://docs.snowflake.com/en/sql-reference/functions/rtrim.html).

## SSC-FDM-PG0003

Bytea Converted To Binary

### Description

This warning is added because when the bytea data type is converted to binary the size limit is greatly reduced from 1GB to 8MB.

#### Code Example

##### Input Code:

```sql
CREATE TABLE tbl(
    col BYTEA
);
```

##### Generated Code:

```sql
CREATE TABLE tbl (
    col BINARY /*** SSC-FDM-PG0003 - BYTEA CONVERTED TO BINARY. SIZE LIMIT REDUCED FROM 1GB TO 8MB ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';
```

#### Best Practices

* For more information refer to the Snowflake documentation about [Binary Data Type](https://docs.snowflake.com/en/sql-reference/data-types-text.html#binary).

## SSC-FDM-PG0004

The date output format may vary

### Description

The date output format may vary depending on the Timestamp type and the timestamp_output_format being used, see the [Snowflake CURRENT_TIMESTAMP documentation](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp.html).

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
CREATE TABLE table1 (
    dt_update timestamp without time zone DEFAULT clock_timestamp()
);
```

##### Generated Code:

##### Snowflake

```sql
CREATE TABLE table1 (
    dt_update TIMESTAMP_NTZ DEFAULT CAST(
    --** SSC-FDM-PG0004 - THE DATE OUTPUT FORMAT MAY VARY DEPENDING ON THE TIMESTAMP TYPE AND THE TIMESTAMP_OUTPUT_FORMAT BEING USED. **
    CURRENT_TIMESTAMP() AS TIMESTAMP_NTZ)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';
```

### Samples

Example with CREATE TABLE.

#### Input Code:

##### PostgreSQL

```sql
CREATE TABLE sample2 (
    platform_id integer NOT NULL,
    dt_update timestamp with time zone DEFAULT clock_timestamp()
);

insert into postgres.public.sample2 (platform_id) values (1);

select *, clock_timestamp() from postgres.public.sample2;
```

##### Results

| platform_id | dt_update | clock_timestamp |
| --- | --- | --- |
| 1 | 2023-02-05 22:47:34.275 -0600 | 2023-02-05 23:16:15.754 -0600 |

##### Generated Code:

##### Snowflake

```sql
CREATE TABLE sample2 (
    platform_id integer NOT NULL,
    dt_update TIMESTAMP_TZ DEFAULT CAST(
--** SSC-FDM-PG0004 - THE DATE OUTPUT FORMAT MAY VARY DEPENDING ON THE TIMESTAMP TYPE AND THE TIMESTAMP_OUTPUT_FORMAT BEING USED. **
CURRENT_TIMESTAMP() AS TIMESTAMP_TZ)
);

insert into postgres.public.sample2 (platform_id) values (1);
ALTER SESSION SET timestamp_output_format = 'YYYY-MM-DD HH24:MI:SS.FF';

select *,
CURRENT_TIMESTAMP(3)
from
postgres.public.sample2;
```

##### Results

| PLATFORM_ID | DT_UPDATE | CURRENT_TIMESTAMP(3) |
| --- | --- | --- |
| 1 | 2023-02-05 20:52:30.082000000 | 2023-02-05 21:20:31.593 |

Example with SELECT with clock_timestamp().

##### Input Code

##### PostgreSQL

```sql
select clock_timestamp();
```

##### Results

| clock_timestamp |
| --- |
| 2023-02-05 23:24:13.740 |

##### Generated Code

##### Snowflake

```sql
ALTER SESSION SET timestamp_output_format = 'YYYY-MM-DD HH24:MI:SS.FF';
select
    CURRENT_TIMESTAMP(3);
```

##### Results

| CURRENT_TIMESTAMP(3) |
| --- |
| 2023-02-05 21:29:24.258 |

## SSC-FDM-PG0005

UNLOGGED Table is not supported in Snowflake; data written may have different performance.

### Description

PostgreSQL’s `UNLOGGED` tables offer a significant speed advantage by skipping write-ahead logging (WAL). However, their data isn’t replicated to mirror instances. Snowflake doesn’t support this functionality, so the `UNLOGGED` clause will be commented out.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
CREATE UNLOGGED TABLE TABLE1 (
   COL1 integer
);
```

##### Generated Code:

##### Snowflake

```sql
CREATE
--       --** SSC-FDM-PG0005 - UNLOGGED TABLE IS NOT SUPPORTED IN SNOWFLAKE, DATA WRITTEN MAY HAVE DIFFERENT PERFORMANCE. **
--       UNLOGGED
                TABLE TABLE1 (
COL1 integer
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "greenplum",  "convertedOn": "04/21/2025",  "domain": "test" }}';
```

## SSC-FDM-PG0006

Set search path with multiple schemas.

### Description

Set search path with multiple schemas is not supported in Snowflake, see the [Snowflake USE SCHEMA documentation](https://docs.snowflake.com/en/sql-reference/sql/use-schema.html).

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 SET SEARCH_PATH TO schema1, schema2, schema3;
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-PG0006 - SET SEARCH PATH WITH MULTIPLE SCHEMAS IS NOT SUPPORTED IN SNOWFLAKE **
USE SCHEMA schema1 /*, schema2, schema3*/;
```

## SSC-FDM-PG0007

NULL is converted to ‘’ and may have a different behavior in Snowflake.

### Severity

Low

#### Description

In PostgreSQL the removal of a comment is handled by using the `NULL` term. However, in Snowflake, a similar method for removing a comment is to assign the value of an empty string `''` to provide the same result. This approach ensures that the comment is effectively mapped to an empty string with a similar behavior.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 COMMENT ON TABLE mytable IS NULL;
```

##### Generated Code:

##### Snowflake

```sql
 COMMENT ON TABLE mytable IS '' /*** SSC-FDM-PG0007 - NULL IS CONVERTED TO '' AND MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/;
```

## SSC-FDM-PG0008

Select into unlogged tables are not supported by Snowflake.

### Description

Select Into is not supported by Snowflake, this functionality was emulated with `CREATE TABLE AS`. In addition, Snowflake always uses transaction logs to protect tables and ensure data integrity and recoverability. Consequently, tables with the `UNLOGGED` option are not supported by Snowflake.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
select column1
      into UNLOGGED NewTable
      from oldTable;
```

##### Generated Code:

##### Snowflake

```sql
CREATE TABLE IF NOT EXISTS NewTable AS
      select column1
--      --** SSC-FDM-PG0008 - SELECT INTO UNLOGGED TABLES ARE NOT SUPPORTED BY SNOWFLAKE. **
--            into UNLOGGED NewTable
            from
            oldTable;
```

## SSC-FDM-PG0009

Sequence nextval property snowflake does not guarantee generating sequence numbers without gaps

### Description

Snowflake does not guarantee generating sequence numbers without gaps. The generated numbers consistently increase in value (or decrease in value if the step size is negative) but are not necessarily contiguous.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 SELECT nextval('seq1');
```

##### Generated Code:

##### Snowflake

```sql
 SELECT seq1.nextval /*** SSC-FDM-PG0009 - THE SEQUENCE NEXTVAL PROPERTY SNOWFLAKE DOES NOT GUARANTEE GENERATING SEQUENCE NUMBERS WITHOUT GAPS. ***/;
```

## SSC-FDM-PG0010

Datatype of the left operand could not be determined. Results may vary due to the behavior of Snowflake’s bitwise function

### Description

The bitwise operators [`<<`](https://www.postgresql.org/docs/9.4/functions-bitstring.html) and [`>>`](https://www.postgresql.org/docs/9.4/functions-bitstring.html) are converted to the corresponding Snowflake functions [`BITSHIFTLEFT`](https://docs.snowflake.com/en/sql-reference/functions/bitshiftleft) and [`BITSHIFTRIGHT`](https://docs.snowflake.com/en/sql-reference/functions/bitshiftright). However, this transformation depends on knowing semantic information about the left operand, more specifically its datatype.

For shift operations involving integer left operands, the MOD function should be applied to the right operand to get equivalent results, as well as using the `INTEGER_BITSHIFTLEFT_UDF` helper for ensuring the equivalence of the shift left operation on integers. When the datatype of the left operand can not be determined, SnowConvert AI will generate this FDM to warn about the potential functional differences.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
CREATE TABLE someTable (
  intCol INTEGER,
  smallIntCol SMALLINT,
  varbyteCol VARBYTE,
  incrementValue INTEGER
)
;

SELECT
  intCol << incrementValue,
  smallIntCol >> incrementValue,
  varbyteCol << incrementValue
FROM someTable;

SELECT missingCol << incrementValue FROM missingTable;
```

##### Generated Code:

##### Snowflake

```sql
CREATE TABLE someTable (
  intCol INTEGER,
  smallIntCol SMALLINT,
  varbyteCol BINARY,
  incrementValue INTEGER
)
;

SELECT
  PUBLIC.INTEGER_BITSHIFTLEFT_UDF(
  intCol, MOD(incrementValue, 32), 32),
  BITSHIFTRIGHT(
  smallIntCol, MOD(incrementValue, 16)),
  BITSHIFTLEFT(
  varbyteCol, incrementValue)
FROM
  someTable;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "missingTable" **
SELECT
  --** SSC-FDM-PG0010 - DATATYPE OF THE LEFT OPERAND COULD NOT BE DETERMINED. RESULTS MAY VARY DUE TO THE BEHAVIOR OF SNOWFLAKE'S BITSHIFTLEFT BITWISE FUNCTION **
  BITSHIFTLEFT( missingCol, incrementValue) FROM
  missingTable;
```

#### Best Practices

* Ensure the source code you migrate has no missing depedencies, by providing any missing object to SnowConvert AI the operands semantic information should be extracted correctly and this FDM should no longer appear

## SSC-FDM-PG0011

The use of the COLLATE column constraint has been disabled for this pattern-matching condition

### Description

This message is added when a pattern-matching condition uses arguments with COLLATE specifications, as they are not currently supported in Snowflake’s regular expression function. Consequently, the COLLATE clause must be disabled to use this function, which may result in differences in the results.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE TABLE collateTable (
col1 VARCHAR(20) COLLATE CASE_INSENSITIVE,
col2 VARCHAR(30) COLLATE CASE_SENSITIVE);

INSERT INTO collateTable values ('HELLO WORLD!', 'HELLO WORLD!');

SELECT
col1 SIMILAR TO 'Hello%' as ci,
col2 SIMILAR TO 'Hello%' as cs
FROM collateTable;
```

##### Results

| CI | CS |
| --- | --- |
| TRUE | FALSE |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE collateTable (
col1 VARCHAR(20) COLLATE 'en-ci',
col2 VARCHAR(30) COLLATE 'en-cs'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "01/16/2025",  "domain": "test" }}';

INSERT INTO collateTable
values ('HELLO WORLD!', 'HELLO WORLD!');

SELECT
RLIKE(COLLATE(
--** SSC-FDM-PG0011 - THE USE OF THE COLLATE COLUMN CONSTRAINT HAS BEEN DISABLED FOR THIS PATTERN-MATCHING CONDITION. **
col1, ''), 'Hello.*', 's') as ci,
RLIKE(COLLATE(
--** SSC-FDM-PG0011 - THE USE OF THE COLLATE COLUMN CONSTRAINT HAS BEEN DISABLED FOR THIS PATTERN-MATCHING CONDITION. **
col2, ''), 'Hello.*', 's') as cs
FROM
collateTable;
```

##### Results

| CI | CS |
| --- | --- |
| FALSE | FALSE |

#### Best Practices

* If you require equivalence for these scenarios, you can manually add the following parameters to the function to achieve functional equivalence:

  | Parameter | Description |
  | --- | --- |
  | `c` | Case-sensitive matching |
  | `i` | Case-insensitive matching |
* For more information please refer to the following [link](https://docs.snowflake.com/en/sql-reference/functions-regexp#specifying-the-parameters-for-the-regular-expression).

## SSC-FDM-PG0012

NOT NULL constraint has been removed. Assigning NULL to this variable will no longer cause a failure.

### Description

In PostgreSQL, specifying the NOT NULL constraint ensures that assigning a null value to a variable results in a runtime error. Since this clause does not exist in Snowflake, it is removed during transformation and assigning a NULL to this variable will no longer fail in execution.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE OR REPLACE PROCEDURE variable_Not_Null()
LANGUAGE plpgsql
AS $$
DECLARE
    v_notnull VARCHAR NOT NULL DEFAULT 'Test default';
BEGIN
    v_notnull := NULL;
    -- Procedure logic
END;
$$;
```

##### Result

[22004] ERROR: NULL cannot be assigned to variable “v_notnull” declared NOT NULL

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE variable_Not_Null ()
RETURNS VARCHAR
LANGUAGE SQL
AS $$
DECLARE
    --** SSC-FDM-PG0012 - NOT NULL CONSTRAINT HAS BEEN REMOVED. ASSIGNING NULL TO THIS VARIABLE WILL NO LONGER CAUSE A FAILURE. **
    v_notnull VARCHAR DEFAULT 'Test default';
BEGIN
    v_notnull := NULL;
    -- Procedure logic
END;
$$;
```

##### Result

> **Note:**
>
> This assignment will not fail in Snowflake.

#### Best Practices

* Review the procedure logic to ensure this variable is not assigned a `NULL` value.

## SSC-FDM-PG0013

Function syntactically supported by Snowflake but may have functional differences

### Description

This functional difference message indicates that while Snowflake supports the function’s syntax (either directly or through an equivalent mapping), its behavior might be *different* from the original in some situations.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
SELECT
    LISTAGG(skill) WITHIN GROUP (ORDER BY skill) OVER (PARTITION BY
    employee_name) AS employee_skills
FROM
    employees;
```

##### Generated Code:

##### Snowflake

```sql
SELECT
--** SSC-FDM-PG0013 - FUNCTION SYNTACTICALLY SUPPORTED BY SNOWFLAKE BUT MAY HAVE FUNCTIONAL DIFFERENCES **
LISTAGG(skill) WITHIN GROUP (ORDER BY skill) OVER (PARTITION BY
employee_name) AS employee_skills
FROM
    employees;
```

#### Best Practices

* Carefully evaluate the functional behavior for unexpected results, as differences may only occur in specific scenarios.

## SSC-FDM-PG0014

Unknown Pseudotype transformed to Text Type

### Description

This functional difference message indicates that UNKNOWN Pseudo Type used in PostgreSQL is not supported in Snowflake and is transformed to a Text Type.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
CREATE TABLE PSEUDOTYPES
(
  COL1 UNKNOWN
)
```

##### Generated Code:

##### Snowflake

```sql
CREATE TABLE PSEUDOTYPES (
  COL1 TEXT /*** SSC-FDM-PG0014 -  UNKNOWN PSEUDOTYPE TRANSFORMED TO TEXT TYPE ***/
)
```

#### Best Practices

* Carefully evaluate the usages for the columns with Unknown Data Types, as differences may occur in specific scenarios.

## SSC-FDM-PG0015

PSQL command is not applicable in Snowflake

### Description

In Snowflake, **PSQL commands are not applicable.** While no longer needed for execution, SnowConvert AI retains the original PSQL command as a comment.

#### Example Code

##### Input Code:

```sql
 \set ON_ERROR_STOP TRUE
```

##### Generated Code:

```sql
 ----** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. COMMAND OPTION **
--\set ON_ERROR_STOP TRUE
```

## SSC-FDM-PG0016

Strongly typed array transformed to ARRAY without type checking.

### Description

SnowConvert AI will add this warning because PostgreSQL supports arrays of any built-in or user-defined base type, enum type, composite type, range type, or domain, whereas Snowflake does not. In Snowflake, each value in a semi-structured array is of type VARIANT.

#### Example Code

##### Input Code:

```sql
CREATE TABLE sal_emp (
    name            text,
    pay_by_quarter  integer[],
    schedule        text[][]
);
```

##### Generated Code:

```sql
CREATE TABLE sal_emp (
    name            text,
    pay_by_quarter ARRAY /*** SSC-FDM-PG0016 - STRONGLY TYPED ARRAY 'INTEGER[]' TRANSFORMED TO ARRAY WITHOUT TYPE CHECKING ***/,
    schedule ARRAY /*** SSC-FDM-PG0016 - STRONGLY TYPED ARRAY 'TEXT[][]' TRANSFORMED TO ARRAY WITHOUT TYPE CHECKING ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "06/03/2025",  "domain": "no-domain-provided" }}';
```

## SSC-FDM-PG0017

User Defined function that returns a void was transformed to a Snowflake procedure.

### Description

SnowConvert AI will generate a warning for any function that returns void. This is because functions returning void typically indicate a procedure rather than a value-producing operation, which can sometimes require special handling during conversion.

#### Example Code

##### Input Code:

```sql
CREATE OR REPLACE FUNCTION log_user_activity(
    user_id_param INT,
    action_param TEXT
)
RETURNS VOID AS $$
BEGIN
    INSERT INTO user_activity_log (user_id, action, activity_timestamp)
    VALUES (user_id_param, action_param, NOW());
END;
$$ LANGUAGE plpgsql;
```

##### Generated Code:

```sql
--** SSC-FDM-PG0017 - USER DEFINED FUNCTION THAT RETURNS VOID WAS TRANSFORMED TO SNOWFLAKE PROCEDURE **
CREATE OR REPLACE PROCEDURE log_user_activity (
user_id_param INT,
    action_param TEXT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "07/23/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
    INSERT INTO user_activity_log (user_id, action, activity_timestamp)
    VALUES (:user_id_param, : action_param, CURRENT_TIMESTAMP());
END;
$$;
```

## SSC-FDM-PG0018

Analyze statement is commented out, which is not applicable in Snowflake.

### Description

SnowConvert AI flags ANALYZE statements with a warning and comments them out. While ANALYZE is used in PostgreSQL for collecting table statistics, Snowflake automatically manages this process, making the statement redundant and generally unnecessary post-conversion.

#### Example Code

##### Input Code:

```sql
ANALYZE customers (first_name, last_name)
```

##### Generated Code:

```sql
----** SSC-FDM-PG0018 - ANALYZE STATEMENT IS COMMENTED OUT, WHICH IS NOT APPLICABLE IN SNOWFLAKE. **
--ANALYZE customers (first_name, last_name)
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - PostgreSQL Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/postgresqlEWI.md
section: Migrations
---

# SnowConvert AI - PostgreSQL Issues

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for PostgreSQL focuses its assessment and translation capabilities primarily on TABLES and VIEWS.
> While SnowConvert AI can recognize other types of ANSI-standard statements, these are not yet fully supported for conversion. This means that while the tool may identify them, it won’t perform a complete translation for these unsupported code units.

## SSC-EWI-PG0001

Age is not supported on Snowflake

### Severity

Medium

#### Description

This error is added because SnowConvert AI does not support the `age()` functionality.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 SELECT
   age(date1::date, date2::date)
FROM
   Table1;
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "age", "Table1" **
SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-PG0001 - AGE IS NOT SUPPORTED ON SNOWFLAKE. ***/!!!
   AGE(date1::date, date2::date)
FROM
   Table1;
```

#### Best Practices

* The `Datediff` time function can solve some cases where the objective of the query is to obtain a specific range of values but this has to be handled manually for each scenario. For more information please refer to the Snowflake documentation about [Datediff](https://docs.snowflake.com/en/sql-reference/functions/datediff.html).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0002

Constraint index parameter not supported

### Severity

Low

#### Description

The use of the following index parameters in constraints are not supported by Snowflake.

* INCLUDE
* WITH
* USING INDEX TABLESPACE

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE TABLE Table1 (
    code        char(5),
    date_prod   date,
    CONSTRAINT production UNIQUE(date_prod) INCLUDE(code)
);

CREATE TABLE Table2 (
    name    varchar(40),
    UNIQUE(name) WITH (fillfactor=70)
);

CREATE TABLE Table3 (
    name    varchar(40),
    PRIMARY KEY(name) USING INDEX TABLESPACE tablespace_name
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE Table1 (
    code        char(5),
    date_prod   date,
    CONSTRAINT production UNIQUE(date_prod)
                                            !!!RESOLVE EWI!!! /*** SSC-EWI-PG0002 - INCLUDE PARAMETER NOT APPLICABLE. CONSTRAINT INDEX PARAMETERS ARE NOT SUPPORTED IN SNOWFLAKE. ***/!!! INCLUDE(code)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';

CREATE TABLE Table2 (
    name    varchar(40),
    UNIQUE(name)
                 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0002 - WITH PARAMETER NOT APPLICABLE. CONSTRAINT INDEX PARAMETERS ARE NOT SUPPORTED IN SNOWFLAKE. ***/!!! WITH (fillfactor=70)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';

CREATE TABLE Table3 (
    name    varchar(40),
    PRIMARY KEY(name)
                      !!!RESOLVE EWI!!! /*** SSC-EWI-PG0002 - USING PARAMETER NOT APPLICABLE. CONSTRAINT INDEX PARAMETERS ARE NOT SUPPORTED IN SNOWFLAKE. ***/!!! USING INDEX TABLESPACE tablespace_name
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0003

Inheritance not supported

### Severity

Low

#### Description

Inheritance between tables is allowed in PostgreSQL, but Snowflake does not support it. For more information about inheritance in PostgreSQL click [here](https://www.postgresql.org/docs/current/ddl-inherit.html).

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 ALTER TABLE Table1
ADD CONSTRAINT const3 UNIQUE (zip);
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0003 - TABLE INHERITANCE IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
ALTER TABLE Table1
ADD CONSTRAINT const3 UNIQUE (zip);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0004

Exclude constraint not supported

### Severity

Medium

#### Description

The exclude constraint used in PostgreSQL is not supported by Snowflake.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE TABLE Table1 (
    id      int,
    EXCLUDE USING gist (id WITH &&)
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE Table1 (
    id      int,
    !!!RESOLVE EWI!!! /*** SSC-EWI-PG0004 - EXCLUDE CONSTRAINT IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
    EXCLUDE USING gist (id WITH &&)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "09/17/2024" }}';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0006

Reference to a variable using the Label is not supported by Snowflake.

### Severity

Medium

#### Description

This error is added when a FOR loop’s body references a variable using the label. Snowflake does not support referencing a variable using the qualified name.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE OR REPLACE PROCEDURE procedure1(out result VARCHAR(100))
LANGUAGE plpgsql
AS $$
BEGIN
result := '<';
<<outer_loop>>
for i in 1..3 loop
  <<inner_loop>>
  for i in 4..6 loop
  result := result || '(' || outer_loop.i || ', ' || i || ')';
  end loop inner_loop;
end loop outer_loop;
result := result || '>';
END;
$$;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (result OUT VARCHAR(100))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
result := '<';
for i in 1 TO 3
                --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                loop
  for i in 4 TO 6
                  --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                  loop
  result := result || '(' ||
                             !!!RESOLVE EWI!!! /*** SSC-EWI-PG0006 - REFERENCE TO A VARIABLE USING THE LABEL IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! outer_loop.i || ', ' || i || ')';
  end loop inner_loop;
end loop outer_loop;
result := result || '>';
END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0007

Into clause in Dynamic SQL is not support in Snowflake

### Severity

Low

#### Description

PostgreSQL Dynamic SQL allows the `INTO` clause to store query results in variables. Snowflake does not support this functionality. Therefore, the `INTO` clause will be flagged with an EWI’.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE OR REPLACE PROCEDURE get_max_id(table_name VARCHAR, OUT max_id INTEGER)
AS $$
DECLARE
    sql_statement VARCHAR;
BEGIN
    sql_statement := 'SELECT MAX(id) FROM ' || table_name || ';';
    EXECUTE sql_statement INTO max_id;
END;
$$ LANGUAGE plpgsql;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE get_max_id (table_name VARCHAR, max_id OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS $$
DECLARE
    sql_statement VARCHAR;
BEGIN
    sql_statement := 'SELECT MAX(id) FROM ' || table_name || ';';
    EXECUTE IMMEDIATE sql_statement
                                    !!!RESOLVE EWI!!! /*** SSC-EWI-PG0007 - INTO CLAUSE IN DYNAMIC SQL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! INTO max_id;
END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0008

The use of interval within a to_char function is not compatible with Snowflake.

### Severity

High

#### Description

The use of `interval` within the `to_char` to convert date/times data types into text is not supported in Snowflake.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 SELECT to_char(interval '15h 2m 12s', 'HH24:MI:SS');
```

##### Generated Code:

##### Snowflake

```sql
 SELECT to_char(INTERVAL '15h, 2m, 12s', 'HH24:MI:SS') !!!RESOLVE EWI!!! /*** SSC-EWI-PG0008 - THE USE OF INTERVAL WITHIN TO_CHAR IS NOT SUPPORTED BY SNOWFLAKE. ***/!!!;
```

For more information please refer to

* PostgreSQL [to_char](https://www.postgresql.org/docs/15/functions-formatting.html).
* Snowflake [to_char](https://docs.snowflake.com/en/sql-reference/functions/to_char).

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0009

Comment on ‘Type’ is not supported by Snowflake.

### Severity

Low

#### Description

In the original code, there are various objects that can receive comments. However, in Snowflake, several of these objects do not exist, and thus, comments cannot be assigned to them. The code for handling these scenarios is commented out to prevent any potential errors.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 COMMENT ON RULE rule_name on TABLE_NAME IS 'this is a comment';
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0009 - COMMENT ON 'RULE' IS NOT SUPPORTED BY SNOWFLAKE. ***/!!!
COMMENT ON RULE rule_name on TABLE_NAME IS 'this is a comment';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0010

Create temporary sequence is not supported by Snowflake

### Severity

Low

#### Description

When a temporary sequence is created in PostgreSQL, it is only created for the active session and is automatically deleted when you log out of the session. However, this functionality is not available in Snowflake, so it is generated as a normal sequence. When executed, a similar sequence name may already exist, which will cause an error for an existing object.

#### Code Example

##### Input code:

##### PostgreSQL

```sql
 CREATE TEMPORARY SEQUENCE sequence1;
CREATE TEMP SEQUENCE sequence2;
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-PG0009 - THE SEQUENCE NEXTVAL PROPERTY SNOWFLAKE DOES NOT GUARANTEE GENERATING SEQUENCE NUMBERS WITHOUT GAPS. **
CREATE TEMPORARY !!!RESOLVE EWI!!! /*** SSC-EWI-PG0010 - CREATE TEMPORARY SEQUENCE IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! SEQUENCE sequence1;

--** SSC-FDM-PG0009 - THE SEQUENCE NEXTVAL PROPERTY SNOWFLAKE DOES NOT GUARANTEE GENERATING SEQUENCE NUMBERS WITHOUT GAPS. **
 CREATE TEMP !!!RESOLVE EWI!!! /*** SSC-EWI-PG0010 - CREATE TEMPORARY SEQUENCE IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! SEQUENCE sequence2;
```

### Best Practices

* If you have a creation problem, you can try to rename the sequence to avoid collisions.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-PG0011

The sequence option ‘option_name’ is not supported by Snowflake.

### Severity

Low

#### Description

Some options available in PostgreSQL for the sequence statement are not supported by Snowflake.

The unsupported options are:

* Unlogged.
* AS <data_type>.
* MinValue.
* MaxValue.
* No MinValue.
* No MaxValue.
* Cache.
* Cycle.
* Owner By.

#### Code Example

##### Input code:

##### PostgreSQL

```sql
 CREATE UNLOGGED SEQUENCE sequence_name;
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-PG0009 - THE SEQUENCE NEXTVAL PROPERTY SNOWFLAKE DOES NOT GUARANTEE GENERATING SEQUENCE NUMBERS WITHOUT GAPS. **
CREATE UNLOGGED !!!RESOLVE EWI!!! /*** SSC-EWI-PG0011 - 'UNLOGGED' IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! SEQUENCE sequence_name;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0012

NOT VALID constraint option is not supported by Snowflake.

### Description

The [`NOT VALID`](https://www.postgresql.org/docs/current/sql-altertable.html#SQL-ALTERTABLE-DESC-ADD-TABLE-CONSTRAINT) constraint option is used in the context of adding or altering a constraint to indicate that the constraint should be added or modified without checking the existing data for compliance with the constraint. This clause is not supported by Snowflake.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 ALTER TABLE Table1 *
ADD CONSTRAINT const UNIQUE (zip) NOT VALID;
```

##### Generated Code:

##### Snowflake

```sql
 ALTER TABLE Table1
ADD CONSTRAINT const UNIQUE (zip)
                                  !!!RESOLVE EWI!!! /*** SSC-EWI-PG0012 - NOT VALID CONSTRAINT OPTION IS NOT SUPPORTED BY SNOWFLAKE. ***/!!! NOT VALID;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0014

Snowflake scripting cursors do not support fetch orientation

### Severity

Medium

#### Description

In Snowflake, the [FETCH cursor](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/fetch) statement always fetches the next row in the cursor. When transforming the code, SnowConvert AI will transform cursor orientations that are equivalent to a FETCH NEXT as they are functionally equivalent in Snowflake, namely:

* `FETCH NEXT`
* `FETCH FORWARD`
* `FETCH RELATIVE 1`
* `FETCH` (no orientation specified)

Any other orientation is unsupported and the FETCH statement will be marked with this EWI to reflect that.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE OR REPLACE PROCEDURE cursor_test()
AS $$
BEGIN
   FETCH FORWARD FROM cursor1 INTO my_var;
   FETCH FIRST FROM cursor1 INTO my_var;
   FETCH LAST FROM cursor1 INTO my_var;
END;
$$;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE cursor_test ()
RETURNS VARCHAR
AS $$
BEGIN
   FETCH
   	cursor1 INTO my_var;
   !!!RESOLVE EWI!!! /*** SSC-EWI-PG0014 - SNOWFLAKE SCRIPTING CURSORS DO NOT SUPPORT FETCH ORIENTATION. ***/!!!
   FETCH FIRST FROM cursor1 INTO my_var;
   !!!RESOLVE EWI!!! /*** SSC-EWI-PG0014 - SNOWFLAKE SCRIPTING CURSORS DO NOT SUPPORT FETCH ORIENTATION. ***/!!!
   FETCH LAST FROM cursor1 INTO my_var;
END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0015

Fetch cursor without target variables is not supported in Snowflake

### Severity

Medium

#### Description

In PostgreSQL, it is possible to use a [FETCH statement](https://www.postgresql.org/docs/current/sql-fetch.html) without INTO to print on the console the values of fetched rows. However, Snowflake requires the [FETCH statement](https://docs.snowflake.com/en/sql-reference/snowflake-scripting/fetch) to specify the INTO clause with the variables where the fetched row values are going to be stored.

Whenever a FETCH with no INTO is found in the code, SnowConvert AI will generate this EWI to notify the user that this type of FETCH is not supported.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 FETCH PRIOR FROM cursor1;
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0015 - FETCH CURSOR WITHOUT TARGET VARIABLES IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FETCH PRIOR FROM cursor1;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0016

Bit String Type converted to Varchar Type

### Severity

Low

#### Description

When migrating from PostgreSQL, be aware that its BIT String Types and related functions are not natively supported in Snowflake. These data types will be converted to Snowflake’s VARCHAR. This conversion means that any PostgreSQL queries or application logic that depend on bitwise operations on these columns will require significant modification to achieve the same functionality in Snowflake.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE TABLE table1 (
   col1 bit(10)
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
   col1 CHARACTER(10) !!!RESOLVE EWI!!! /*** SSC-EWI-PG0016 - BIT DATA TYPE CONVERTED TO CHARACTER ***/!!!
);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0017

Transformation for routine body literal is not supported.

### Severity

Low

#### Description

SnowConvert AI does not support transformation for quoted literal routine body. Use the [arrange option](../../../getting-started/running-snowconvert/conversion/postgresql-conversion-settings.md) to modify them to dollar routine body.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
CREATE OR REPLACE PROCEDURE proc1 (x varchar default 'pigs')
LANGUAGE plpgsql
AS
'
begin
    --test
   insert into tabletest2 values ($$Dianne''s pigs$$);
   x = ''Diannes pigs'';
end;
';
```

##### Generated Code:

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE proc1 (x varchar default 'pigs' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'ParameterDefaultExpr' NODE ***/!!!)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "01/13/2026",  "domain": "no-domain-provided",  "migrationid": "m7mbAfEK5XyHKQR4pRek1g==" }}'
EXECUTE AS CALLER
AS
   !!!RESOLVE EWI!!! /*** SSC-EWI-PG0017 - TRANSFORMATION FOR ROUTINE BODY LITERAL IS NOT SUPPORTED. USE ARRANGE OPTION. ***/!!!
'
begin
    --test
   insert into tabletest2 values ($$Dianne''s pigs$$);
   x = ''Diannes pigs'';
end;
';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0018

SnowConvert AI does not transform Python code, review the function body to ensure it is Snowflake ready

### Severity

Medium

#### Description

SnowConvert AI does not transform Python code in function bodies. The Python code is passed through unchanged. Review the function body to ensure it is Snowflake-ready before deployment.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE FUNCTION pymax (a integer, b integer)
  RETURNS integer
AS $$
  /*if a > b:
    return a
  return b*/
$$ LANGUAGE plpythonu;
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0018 - SNOWCONVERT AI DOES NOT TRANSFORM PYTHON CODE, REVIEW THE FUNCTION BODY TO ENSURE IT IS SNOWFLAKE READY ***/!!!
CREATE FUNCTION pymax (a integer, b integer)
RETURNS integer
LANGUAGE PYTHON
RUNTIME_VERSION = '3.13'
HANDLER = 'main_py'
AS
  $$
  /*if a > b:
  return a
  return b*/
  $$
;
```

#### Best Practices

* Review all Python code in function bodies for Snowflake compatibility.
* Use the [arrange option](../../../getting-started/running-snowconvert/conversion/postgresql-conversion-settings.md) if the function uses Python syntax that requires preprocessing.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-PG0019

SnowConvert AI does not support Python code parsing, use the arrange option to enable Python code preprocessing

### Severity

Low

#### Description

SnowConvert AI does not support parsing Python code in function bodies. When the arrange option is not activated, Python syntax may not be recognized, and the code may be commented out or left unprocessed. Use the arrange option to enable Python code preprocessing before conversion.

#### Code Example

##### Input Code:

##### PostgreSQL

```sql
 CREATE FUNCTION pymax (a integer, b integer)
  RETURNS integer
AS $$
  if a > b:
    return a
  return b
$$ LANGUAGE plpythonu;
```

##### Generated Code:

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0019 - SNOWCONVERT AI DOES NOT SUPPORT PYTHON CODE PARSING, USE THE ARRANGE OPTION TO ENABLE PYTHON CODE PREPROCESSING ***/!!!
CREATE FUNCTION pymax (a integer, b integer)
RETURNS integer
LANGUAGE PYTHON
RUNTIME_VERSION = '3.13'
HANDLER = 'main_py'
AS
  $$
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON SOME LINE OF THE SOURCE CODE. LAST MATCHING TOKEN WAS 'if' ON LINE '4' COLUMN '3'. **
--  if a > b:
--    return a
--  return b
  $$
;
```

#### Best Practices

* Enable the arrange option before conversion to preprocess Python code.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - PostgreSQL-Greenplum-Netezza
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/postgresql-and-based-languages.md
section: Migrations
---

# SnowConvert AI - PostgreSQL-Greenplum-Netezza

## What is SnowConvert AI for PostgreSQL-Greenplum-Netezza?

SnowConvert AI is a software tool that understands PostgreSQL, Greenplum or Netezza scripts and converts the source code into functionally equivalent Snowflake code.

The PostgreSQL based languages currently supported by SnowConvert AI are:

* [Greenplum](https://techdocs.broadcom.com/us/en/vmware-tanzu/data-solutions/tanzu-greenplum/7/greenplum-database/landing-index.html)
* [Netezza](https://www.ibm.com/docs/en/netezza)

## Conversion Types

Specifically, SnowConvert AI for PostgreSQL-Greenplum-Netezza performs the following conversions:

### PostgreSQL-Greenplum-Netezza to Snowflake SQL

SnowConvert AI understands the PostgreSQL, Greenplum or Netezza source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake.

#### Sample code

PostgreSQL basic input code:

```sql
CREATE TABLE films (
    code        char(5) CONSTRAINT firstkey PRIMARY KEY,
    title       varchar(40) NOT NULL,
    did         integer NOT NULL,
    date_prod   date,
    kind        varchar(10),
    len         interval hour to minute
);
```

Snowflake SQL output code:

```sql
CREATE TABLE films (
    code        char(5) CONSTRAINT firstkey PRIMARY KEY,
    title       varchar(40) NOT NULL,
    did         integer NOT NULL,
    date_prod   date,
    kind        varchar(10),
    len VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "postgresql",  "convertedOn": "04/24/2025",  "domain": "test" }}';
```

As you can see, most of the structure remains the same. For example, some cases require the data types to be transformed.

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI: the software that converts your PostgreSQL, Greenplum or Netezza files securely and automatically to the Snowflake cloud data platform.*
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* Parsing is an initial process by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

On the following few pages, you’ll learn more about the kind of conversions that SnowConvert AI for *PostgreSQL-Greenplum-Netezza* is capable of. If you’re ready, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - PostgreSQL-Greenplum-Netezza
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/README.md
section: Migrations
---

# SnowConvert AI - PostgreSQL-Greenplum-Netezza

This documentation serves as a comprehensive resource, providing detailed information on both PostgreSQL and the SQL languages derived from it, specifically:

1. [PostgreSQL](https://www.postgresql.org/docs/current/)
2. [Greenplum](https://techdocs.broadcom.com/us/en/vmware-tanzu/data-solutions/tanzu-greenplum/7/greenplum-database/landing-index.html)
3. [Netezza](https://www.ibm.com/docs/en/netezza)

This page provides a comprehensive reference for how SnowConvert AI translates PostgreSQL, Greenplum, and Netezza grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - Preview Features Settings
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md
section: Migrations
---

# SnowConvert AI - Preview Features Settings

## Preview Features Settings

The Preview Features Settings in SnowConvert AI allow you to enable conversions that utilize **Snowflake Public Preview features**. By entering any of the available flags in the textbox, SnowConvert AI can generate code that takes advantage of Snowflake features that are currently in public preview status, rather than being limited to only generally available (GA) Snowflake features.

> **Warning:**
>
> Preview features are Snowflake features that are available for evaluation and testing purposes but are not yet generally available (GA). They should not be used in production systems. For more details about Snowflake preview features, see the [Snowflake Preview Terms of Service](https://www.snowflake.com/legal/preview-terms-of-service/).

### Understanding Snowflake Preview Features

Snowflake Public Preview features are new capabilities that have been implemented and tested in Snowflake but may not have complete usability or corner-case handling. When you enable preview features in SnowConvert AI, the conversion process can generate code that uses these preview features when they provide better conversion results.

### How to Use Preview Features

1. **Enable in SnowConvert AI**: Enter any of the available flags in the textbox within the Preview Features Settings to allow SnowConvert AI to generate code using Snowflake preview features
2. **Enable in Snowflake**: Ensure that preview features are enabled in your Snowflake account using system functions like `SYSTEM_ENABLE_PREVIEW_ACCESS`
3. **Test thoroughly**: Always test the converted code in a non-production Snowflake environment when using preview features

### Important Considerations

* **Snowflake account compatibility**: Your Snowflake account must have preview features enabled to use the generated code
* **Feature stability**: Snowflake preview features may change behavior or be removed in future Snowflake releases
* **Production restrictions**: Code using preview features should not be deployed to production Snowflake environments
* **Documentation**: SnowConvert AI may add comments indicating when preview features are being used

### Accessing Preview Features Settings

To configure preview features in SnowConvert AI:

1. Navigate to the **Conversion Settings** section in the SnowConvert AI interface
2. Select the **Preview Features** tab or section
3. Enter any of the available flags in the textbox to allow SnowConvert AI to use Snowflake preview features. Please be sure that each flag is spelled correctly; if any flag is misspelled, all flags will be ignored during conversion.
4. Proceed with conversion - SnowConvert AI will automatically use preview features when they improve conversion results.

### Using Preview Features from CLI

When using SnowConvert AI from the command line interface (CLI), you can enable preview features by using the `--previewFlags` argument. The value must be wrapped with quotes and contain the flags in the following format:

```bash
--previewFlags "\"--enableFlag1 --enableFlag2\""
```

**Example:**

```bash
snowct [command] --previewFlags "\"--enableFlag\"" [other arguments]
```

For multiple flags:

```bash
snowct [command] --previewFlags "\"--enableFlag --enableAnotherFlag\"" [other arguments]
```

### Best Practices

* **Understand implications**: Ensure you understand that the converted code will require Snowflake preview features to be enabled

> **Note:**
>
> For the most current information about which Snowflake preview features SnowConvert AI can utilize, consult the latest SnowConvert AI release notes or contact support.

## Available Preview Features

The following section lists the preview feature flags that can be entered in the textbox to enable specific Snowflake preview features during conversion. Each flag enables SnowConvert AI to use particular Snowflake preview capabilities.

### **`--enableSnowScriptUDF`**

*Deprecated since version 1.19.7 This feature is already in General Availability*

This option enables SnowConvert AI to translate User-Defined Functions, taking advantage of the SnowScript UDF Preview Feature. Learn more from the documentation here: [Snowflake Scripting UDFs](../../../../../../developer-guide/udf/sql/udf-sql-procedural-functions.md).

Available only for the following languages:

* Sql Server.
* Azure Synapse.

### **`--enableFormatSpecifiersPreview`**

This option enables SnowConvert AI to utilize **new Snowflake date/time format specifiers** that are currently in preview. These improvements in Snowflake’s formatting capabilities provide better translation accuracy for SQL Server date/time formatting functions.

> **Note:**
>
> **Numeric format specifiers are now GA.** Advanced numeric format specifiers (`P`, `N`, custom `%` patterns, and `TM9` grouping) are now translated by default without requiring this flag. The translations in the Numeric format specifiers (now default) section below are always active.

**What This Flag Enables:**

This preview feature introduces new date/time format elements in **Snowflake’s TO_CHAR function**, allowing SnowConvert AI to generate more accurate translations of SQL Server `FORMAT()` calls:

1. **New Date/Time Format Elements** - Non-padded format specifiers (Y, MO, D, H24, H12, ME, S, P)

These are **Snowflake improvements**, not just SnowConvert AI translation features. Your Snowflake account must have these preview features enabled to execute the converted date/time code.

**Note:** To use date/time code generated with this flag, you must request access to these preview features in your Snowflake account. Submit your request using this form: [Snowflake Format Improvements Preview Access Request](https://docs.google.com/forms/u/0/d/1-aIsixSftqhqjkpgBHAzcbSi2mk7s71TMQsRdOBppFw/viewform?edit_requested=true)

**Date Format Specifiers (Preview)**

This flag enables SnowConvert AI to use new Snowflake date/time format elements that support non-padded output, providing accurate translations of SQL Server’s custom single-character format specifiers.

**New Snowflake Format Elements:**

These format elements are **new in Snowflake** (in preview) and enable better migration from SQL Server:

* `Y` - Year last 2 digits without padding (e.g., `25` from 2025, `5` from 2005)
* `MO` - Month without padding (e.g., `3` for March)
* `D` - Day without padding (e.g., `5` for the 5th day)
* `H24` - Hour in 24-hour format without padding (e.g., `14` for 2 PM)
* `H12` - Hour in 12-hour format without padding (e.g., `2` for 2 PM)
* `ME` - Minute without padding (e.g., `7` for 07 minutes)
* `S` - Second without padding (e.g., `3` for 03 seconds)
* `P` - Single-character AM/PM indicator (e.g., `A` for AM, `P` for PM)

**Translation Examples:**

The following examples show how SQL Server `FORMAT()` patterns are translated to Snowflake using these new format elements:

| SQL Server Code | SQL Server Output | Snowflake Translation | Snowflake Output |
| --- | --- | --- | --- |
| `FORMAT(CAST('2025-03-05' AS DATE), '%M')` | `3` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05'), 'MO')` | `3` |
| `FORMAT(CAST('2025-03-05' AS DATE), '%d')` | `5` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05'), 'D')` | `5` |
| `FORMAT(CAST('2025-03-05' AS DATE), '%y')` | `25` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05'), 'Y')` | `25` |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), '%H')` | `14` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'H24')` | `14` |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), '%h')` | `2` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'H12')` | `2` |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), '%m')` | `7` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'ME')` | `7` |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), '%s')` | `3` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'S')` | `3` |

**Combined Format Patterns:**

| SQL Server Code | SQL Server Output | Snowflake Translation | Snowflake Output |
| --- | --- | --- | --- |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), 'M/d/yyyy H:m:s')` | `3/5/2025 14:7:3` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'MO/D/YYYY H24:ME:S')` | `3/5/2025 14:7:3` |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), 'M/d/yyyy h:m:s tt')` | `3/5/2025 2:7:3 PM` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'MO/D/YYYY H12:ME:S PM')` | `3/5/2025 2:7:3 PM` |
| `FORMAT(CAST('2025-03-05 14:07:03' AS DATETIME), 'h:m:s t')` | `2:7:3 P` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05 14:07:03'), 'H12:ME:S P')` | `2:7:3 P` |
| `FORMAT(CAST('2025-03-05' AS DATE), 'M/d/%y')` | `3/5/25` | `TO_CHAR(TO_TIMESTAMP_NTZ('2025-03-05'), 'MO/D/Y')` | `3/5/25` |

**Key Points:**

* **MO** replaces SQL Server’s `%M` (uppercase M = month)
* **ME** replaces SQL Server’s `%m` (lowercase m = minute)
* **H24** replaces SQL Server’s `%H` (uppercase H = 24-hour)
* **H12** replaces SQL Server’s `%H` (lowercase h = 12-hour)
* **P** provides single-character AM/PM output (A or P)
* All formats maintain SQL Server’s behavior of no leading zeros

#### Numeric format specifiers (now default)

The following numeric format translations are now applied by default without requiring the `--enableFormatSpecifiersPreview` flag. SnowConvert AI translates SQL Server’s percentage and number formatting patterns using Snowflake’s numeric format capabilities.

**Percentage Formats (P and %):**

SQL Server’s `P` format and custom `%` patterns automatically multiply values by 100 and add percentage symbols. The Snowflake translations use fixed-point formats with `%` symbols:

| SQL Server Code | SQL Server Output | Snowflake Translation | Snowflake Output |
| --- | --- | --- | --- |
| `FORMAT(0.1234, 'P')` | `12.34 %` | `TO_CHAR(0.1234, 'FM9,999,999,999,999.00%')` | `12.34 %` |
| `FORMAT(0.1234, 'P0')` | `12 %` | `TO_CHAR(0.1234, 'FM9,999,999,999,999%')` | `12 %` |
| `FORMAT(0.1234, 'P2')` | `12.34 %` | `TO_CHAR(0.1234, 'FM9,999,999,999,999.00%')` | `12.34 %` |
| `FORMAT(0.1234, '0.00%')` | `12.34%` | `TO_CHAR(0.1234, 'FM9999999999999.00%')` | `12.34%` |
| `FORMAT(0.1234, '#,#.00%')` | `12.34%` | `TO_CHAR(0.1234, 'FM9,999,999,999,999.00%')` | `12.34%` |
| `FORMAT(0.1234, '%0.00')` | `%12.34` | `TO_CHAR(0.1234, '%FM9999999999999.00')` | `%12.34` |

**Number Formats (N):**

SQL Server’s `N` format provides thousand separators and controlled decimal precision. The Snowflake translations use the enhanced `TM9` format element with arguments:

| SQL Server Code | SQL Server Output | Snowflake Translation | Snowflake Output |
| --- | --- | --- | --- |
| `FORMAT(1234567.89, 'N')` | `1,234,567.89` | `TO_CHAR(1234567.89, 'TM9(2,3)')` | `1,234,567.89` |
| `FORMAT(1234567.89, 'N0')` | `1,234,568` | `TO_CHAR(1234567.89, 'TM9(0,3)')` | `1,234,568` |
| `FORMAT(1234567.89, 'N1')` | `1,234,567.9` | `TO_CHAR(1234567.89, 'TM9(1,3)')` | `1,234,567.9` |
| `FORMAT(1234567.89, 'N4')` | `1,234,567.8900` | `TO_CHAR(1234567.89, 'TM9(4,3)')` | `1,234,567.8900` |
| `FORMAT(-1234567.89, 'N2')` | `-1,234,567.89` | `TO_CHAR(-1234567.89, 'TM9(2,3)')` | `-1,234,567.89` |

**TM9 Format Element Enhancement (Now Default)**

The existing Snowflake `TM9` format element has been enhanced to accept two optional arguments for better control over numeric formatting. This is a **Snowflake improvement** that enables better translations from SQL Server. These translations are now applied by default.

**Syntax:** `TM9(fractional_digits, grouping_size)`

**Translation Examples:**

| SQL Server Code | Snowflake Translation | Input Value | Snowflake Output | Description |
| --- | --- | --- | --- | --- |
| `FORMAT(x, 'N2')` | `TO_CHAR(x, 'TM9(2,3)')` | `1234.56789` | `1,234.57` | 2 decimals with grouping |
| `FORMAT(x, 'N0')` | `TO_CHAR(x, 'TM9(0,3)')` | `1234.56789` | `1,235` | No decimals, rounded |
| `FORMAT(x, 'N4')` | `TO_CHAR(x, 'TM9(4,3)')` | `1234567.89` | `1,234,567.8900` | 4 decimals with grouping |
| *(Direct usage)* | `TO_CHAR(x, 'TM9(ALL,3)')` | `1234.56789` | `1,234.56789` | All decimals, grouped |
| *(Direct usage)* | `TO_CHAR(x, 'TM9(3)')` | `1234.56789` | `1234.568` | 3 decimals, no grouping |
| *(Direct usage)* | `TO_CHAR(x, 'TM9')` | `1234.56789` | `1234.56789` | All decimals, no grouping (default) |

**Behavior Details:**

```sql
-- Snowflake examples with TM9 enhancement
SELECT
    TO_CHAR(1234.56789, 'TM9')           AS default_format,    -- 1234.56789
    TO_CHAR(1234.56789, 'TM9(2)')        AS two_decimals,      -- 1234.57
    TO_CHAR(1234.56789, 'TM9(0)')        AS integer_only,      -- 1235
    TO_CHAR(1234.56789, 'TM9(ALL, 3)')   AS all_with_group,    -- 1,234.56789
    TO_CHAR(1234567.89, 'TM9(3, 3)')     AS three_with_group,  -- 1,234,567.890
    TO_CHAR(-1234567.89, 'TM9(2, 3)')    AS negative_value;    -- -1,234,567.89
```

**Available for:** SQL Server only

### **`--UseIntervalDatatype`**

This option enables SnowConvert AI to translate INTERVAL data types to **native Snowflake INTERVAL types** (`INTERVAL YEAR TO MONTH` and `INTERVAL DAY TO SECOND`) instead of converting them to `VARCHAR`. This takes advantage of the Snowflake INTERVAL data type that is currently in public preview. Learn more from the documentation here: [Snowflake INTERVAL Data Type](../../../../../../sql-reference/data-types-datetime.md).

For a comprehensive reference on how interval transformations work across all languages, see the [Interval Data Types](../../../../translation-references/general/interval-data-types.md) translation reference.

**What This Flag Enables:**

1. **Native INTERVAL column types** - INTERVAL columns in `CREATE TABLE` are preserved as Snowflake INTERVAL types instead of being converted to `VARCHAR(30)`
2. **Interval literal normalization** - Dialect-specific interval literal syntax is normalized to Snowflake-compatible INTERVAL literals
3. **Interval arithmetic preservation** - Datetime subtraction expressions that produce intervals are transformed to use Snowflake’s native interval output
4. **CAST to INTERVAL** - CAST expressions targeting interval types are preserved

**Translation Examples:**

| Source SQL | Without Flag | With `--UseIntervalDatatype` |
| --- | --- | --- |
| `col1 INTERVAL DAY TO SECOND` (Oracle, Teradata) | `col1 VARCHAR(30)` + SSC-EWI-0036 | `col1 INTERVAL DAY TO SECOND` |
| `col1 INTERVAL` (BigQuery, PostgreSQL) | `col1 VARCHAR(30)` + SSC-EWI-0036 | `col1 INTERVAL DAY TO SECOND` + SSC-FDM-0042 |
| `SELECT INTERVAL '5-10' YEAR TO MONTH` | `SELECT '5-10'` (string literal) | `SELECT INTERVAL '5-10' YEAR TO MONTH` |
| `SELECT (ts1 - ts2) DAY TO SECOND` | Not transformed to interval | `SELECT ts1 - ts2 INTERVAL DAY TO SECOND` |

**Known Limitations:**

When this flag is enabled, SnowConvert AI emits warnings for scenarios where Snowflake does not yet fully support INTERVAL:

* **Dynamic Tables** ([SSC-EWI-0118](../../../technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)): Snowflake does not support INTERVAL columns in Dynamic Tables
* **UDFs and Snowflake Scripting** ([SSC-EWI-0117](../../../technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)): Snowflake does not support the INTERVAL data type in UDF/procedure parameters, return types, or variable declarations
* **Semi-structured types** ([SSC-EWI-0116](../../../technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)): Snowflake does not support INTERVAL values inside VARIANT, ARRAY, or other semi-structured type columns
* **Qualifier normalization** ([SSC-FDM-0042](../../../technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md)): For languages with unqualified or mixed INTERVAL types, the qualifier is changed to `DAY TO SECOND` because Snowflake does not support mixing year-to-month and day-to-second time parts

**Available for:** All supported languages (Teradata, Oracle, SQL Server, Azure Synapse, BigQuery, Hive, Spark, Databricks, PostgreSQL, Greenplum, Netezza, Redshift, Vertica, DB2)

---
title: SnowConvert AI - Recent Release Notes
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/release-notes/release-notes/README.md
section: Migrations
---

# SnowConvert AI - Recent Release Notes

## Version 2.20.0 (Apr 13, 2026)

### New Features

#### General

* `USER-DEFINED TYPES` translation to Snowflake-native `USER-DEFINED TYPES`. Enabled for:

  + IBM DB2
  + Oracle
  + PostgreSQL
  + SQLServer
  + Sybase IQ
  + Teradata
* `INTERVAL` datatype translation to Snowflake-native `INTERVAL` datatype (PuPr). Enabled for:

  + Amazon RedShift
  + Google BigQuery
  + IBM Netezza
  + Oracle
  + PostgreSQL/Greenplum
  + Spark/Hive SQL
  + Teradata
* Added Snowflake account-level feature flags.
* Added mechanism to set default source connection at project level.

#### Informatica

* Added Decision workflow task translation.
* Added Assignment workflow task translation.

#### Teradata

* Added Teradata support for code extraction.
* Added support for code add for Teradata projects.
* Added column `FORMAT` attribute support in DML statements.
* Added `SIGNAL SQLSTATE` to `RAISE` transformation with FDM for unsupported `SET` items.

### Improvements

#### SQL Server

* Added `FORMAT` date specifiers `dddd`, `F`–`FFFFFFF`, `z` with FDM markers.

#### Teradata

* Replaced `SSC-EWI-TD0031` with `RTRIM` fix for `LIKE` on `CHAR` columns.
* Moved `USING` clause to `EXECUTE IMMEDIATE` for `PREPARE` with variable markers.
* Added Teradata driver upload validation against approved versions for AI Verification.

### Bug Fixes

#### Informatica

* Fixed duplicate key error with partition-specific session attributes.

#### Oracle

* Fixed `TRIM`/`LTRIM`/`RTRIM` on `RAW` (`BINARY`) columns.

#### SSIS

* Fixed Execute SQL task not being converted when invoking stored procedure.
* Fixed column naming with single or double quotes in `SQLCommand` of OLEDB Source generating runtime errors.
* Fixed parsing error EWIs not being generated on SQL output code for tasks executing SQL.

#### Teradata

* Fixed `to_binary()` incorrectly throwing `SSC-EWI-0073`.

#### General

* Fixed Migration Skill promo banner layout on the Home page.
* Fixed `NullReferenceException` from engine execution with actionable error message (`CVT0011`).
* Fixed wrong counting in selection summary in AI code conversion.
* Fixed CSnake corrupted environment when pip is not installed.
* Fixed routing to code conversion from an unavailable page.

## Version 2.19.0 (Apr 08, 2026)

### New Features

#### Informatica

* Added `REG_REPLACE` FDM, `ERROR`, and `REG_EXTRACT` expression function migration support.
* Added `SET_DATE_PART` translation to Snowflake `DATEADD`.

#### Teradata

* Added Teradata connection support.
* Added Teradata driver upload functionality for two-sided AI Verification.

#### General

* Integrated deployment reports into the code deployment workflow.
* Added Migration Skill promotional banner to the Home page.
* Added support for user-defined alias type extraction.

### Improvements

#### SQL Server

* Added dedicated EWI for `SAVE TRANSACTION` statement.

#### Teradata

* Improved `SSC-FDM-TD0013` to avoid false positives in certain situations.

#### General

* Added path validation before conversion.
* Suppressed `FDM-0007` for `DROP IF EXISTS` statements in lineage phase.

### Bug Fixes

#### SSIS

* Fixed variable redeclarations inside of containers.
* Fixed plus operator for `VARBINARY` concatenation.

#### General

* Fixed `CONCAT_WS` to wrap value arguments in `ARRAY_CONSTRUCT`.
* Fixed AI code conversion reporting different numbers in results.
* Fixed AI Verification showing default testing mode instead of two-sided mode when status is pending.

## Version 2.18.0 (Mar 30, 2026)

### New Features

#### Informatica

* Added Stored Procedure Connected Normal mode translation.

#### PostgreSQL

* Added transformation support for the `QUOTE_LITERAL` function.

#### RedShift

* Added support for Iceberg migrations.
* Enabled Cloud Data Migration for Redshift.
* Added support for `CASCADE DROP` transformation.

#### SSIS

* Added conversion of `UserName`, `PackageName`, `PackageID`, and `ExecutionInstanceGUID` SSIS System Variables.
* Enabled SSIS Data Flow Simplification as a GA feature.

#### Teradata

* Enabled AI Conversion for Teradata.
* Added `CREATE DATABASE` translation to Snowflake.

#### General

* Added Cloud Data Validation foundation layer.
* Added CSV report writer for code extraction.
* Added extraction log writer for code extraction.
* Integrated extraction reports into the code extraction workflow.
* Added opt-in support for cloud deployment status tracking.

### Improvements

#### Informatica

* Added Functional Difference Messages (FDMs) to Informatica PowerCenter reusable transformations.

#### General

* Added autoplay to the carousel component.
* Improved uploaded drivers retrieval for AI Verification.
* Enabled ReadyToRun pre-compilation for improved startup performance.
* Improved AI code conversion error messages when Cortex/LLM access is denied.
* Improved two-sided AI Verification environment handling and navigation stability.
* Improved performance by using local credentials first to avoid unnecessary Snowflake connections.

### Bug Fixes

#### BigQuery

* Fixed false positive `EWI-0073` in BigQuery transaction replacers.

#### SQL Server

* Fixed `CONVERT` with `GETDATE` in `DEFAULT` expressions.
* Fixed missing comma before `CONSTRAINT` in `CREATE TABLE`.

#### SSIS

* Fixed identifier sanitization for Snowflake root task names.
* Fixed SSIS `ExecutePackage` connection manager-based references.

#### General

* Fixed telemetry and logs directory permissions.
* Fixed error code when connection test fails.
* Fixed default object type from `views` to `view` in selector report creation.
* Fixed AI Conversion “not processed” typo in results display.

## Version 2.17.0 (Mar 25, 2026)

### New Features

#### Informatica

* Enabled Informatica PowerCenter Replatform support in the UI.

#### Oracle

* Added `CONNECT_BY_ROOT` pass-through support.

#### PostgreSQL

* Added support for the `EXTRACT` built-in function.

#### RedShift

* Added support for the `EXTRACT` built-in function.
* Added support for the `HLL` aggregate function.

#### SQL Server

* Added `OBJECTPROPERTY()` function conversion support.
* Added `UNPIVOT` operator translation support.
* Added support for `GOTO`/`LABEL` statement translation to nested procedures.
* Added SQL Server Agent Job closure support with ownership, tagging, notification, and multi-schedule fan-out.
* Added `CREATE SYNONYM` statement support.

#### SSIS

* Added Microsoft Cache Transform (`Microsoft.Cache`) translation to dbt.
* Added `--simplify-ssis-dataflow` option for SSIS conversion.

#### General

* Added Teradata driver upload functionality for two-sided AI Verification.
* Added carousel with CLI and Cloud Data Migration information.
* Enabled Cloud Data Migration by default.
* Added ability to deploy objects when status is verified by user.

### Improvements

#### SQL Server

* Added defensive guards to `GOTO`/`LABEL` decomposition logic.
* Added `WAITFOR TIME` commenting with Functional Difference Message (FDM).
* Added `SSC-FDM-TS0055` for database-scoped `CREATE USER`.
* Added FDM for global temporary tables.
* Changed `DEALLOCATE` to use FDM instead of EWI.

#### SSIS

* Normalized reusability tracker keys and conditionally suppressed `SSC-EWI-SSIS0008`.
* Simplified SSIS FileSystem Task SQL output.

#### General

* Enhanced AI Verification with estimation formulas retrieval.
* Added validation for two-sided AI Verification source files (UTF-8 no BOM, LF line endings).
* Improved engine crash exception propagation to provide clearer error messages.
* Added TaskManager crash diagnostics for Windows `CVT0008` errors.
* Added filename generation in the element inventory.
* Removed `VARCHAR` length limits from control variables schema.

### Bug Fixes

#### RedShift

* Fixed database not being added to qualified object names on extraction.

#### General

* Fixed code conversion failing when using offline mode.
* Fixed consolidated source not being preserved across repeated code extractions.
* Fixed error message to show supported languages for invalid source language errors.
* Fixed rendering of icons for UTF-8 consoles.

## Version 2.16.1 (Mar 20, 2026)

### New Features

#### Informatica

* Added support for the `SUM` aggregate function translation to Snowflake.
* Added support for the `MIN` aggregate function translation to Snowflake.
* Added support for `GET_DATE_PART` to Snowflake `DATE_PART` translation.
* Added support for `IS_NUMBER` function translation to Snowflake.
* Added support for `IS_DATE` inline translation to Snowflake `TRY_TO_DATE`.
* Added support for `TO_DATE` function translation to Snowflake.
* Added Source Qualifier Pre/Post SQL, session overrides, and variable migration support.
* Added Sorter transformation to dbt translation with `ORDER BY`, `DISTINCT`, and `LOWER` support.
* Added Sequence Generator transformation translation to `ROW_NUMBER()`.

#### SQL Server

* Added SQL Server Agent Job support for CRON schedule building, `sp_send_dbmail`, and `sp_add_category` translation.
* Added `SCOPE_IDENTITY()` transformation to Snowflake time-travel queries.
* Added support for `DROP PROCEDURE` statement with type signatures.

#### SSIS

* Added ADO NET Source (`Microsoft.DataReaderSourceAdapter`) translation to dbt.

#### PowerBI

* Added implementation flag support on entity connector types.

#### General

* Added Cloud Data Exchange Worker for data migration workflows.
* Added infrastructure diagnostics for Cloud Data Migration.

### Improvements

#### Hive

* Included table naming convention support.

#### Oracle

* Added `INSTR` negative position workaround for `position = -1`.

#### SQL Server

* Added `CONVERT` with style mapping to `TO_DATE`/`TO_TIMESTAMP`.
* Added `CREATE STATISTICS` commenting with `SSC-FDM-TS0048` Functional Difference Message (FDM).
* Replaced `EWI-0035` with specific FDMs for `CHECK` constraint handling.
* Removed partition placement `ON` clause from constraints in `CREATE TABLE`.
* Added new Error Warning Information (EWI) for `OBJECT_SCHEMA_NAME`.
* Updated `SSC-FDM-TS0035` trigger FDM message.

#### SSIS

* Improved per-executable resilience, pipeline column enrichment, and GUID resolution.
* Added package name as a prefix of the Snowflake task names.

#### Teradata

* Included `FORMAT` literal in `TD0040` message.

#### General

* Deprecated the selector for code deployment, restricting it to SQL Server and Redshift only.
* Improved conversion performance for code-only projects by disabling registry generation.
* Renamed `legacy_structure` project type to `Full` and `Code` for clarity.
* Improved assessment cleanup by removing engine output after execution.
* Removed legacy `converted` folder references from the project structure.
* Added AI verification status column with parent tooltips in the UI.
* Cloud Data Migration now uses images from the SPCS Image Registry directly.

### Bug Fixes

#### Oracle

* Fixed a variable named `current_date` being incorrectly converted to a function call.

#### RedShift

* Fixed database not being recognized in Redshift code units.
* Fixed EWI on out cursor transformation.

#### SQL Server

* Fixed `FOR XML PATH('')` transformation dropping expressions.

#### SSIS

* Fixed version upgrade emitting malformed XML.
* Fixed nested Jinja in dbt `source()` calls for VariableTable access mode.

#### General

* Fixed language flags to avoid repeated short names.
* Fixed code conversion input and output path resolution in the UI.
* Fixed AI code conversion on Windows by preparing report paths before zipping.
* Fixed AI Conversion status display and progress tracking.
* Fixed ArrangeLog Logger not being disposed, causing Windows file locking.

## Version 2.15.1 (Mar 13, 2026)

### New Features

#### General

* Added an assessment report page with a code tag component for viewing conversion results.

### Improvements

#### General

* Improved code deployment to include dependencies when a `WHERE` clause filter is defined.
* Scoped the new project structure and registry to SQL Server and Redshift platforms.
* Added source identifier tracking to schema objects during extraction.
* Updated application update notifications to display the download URL directly.

### Bug Fixes

#### General

* Fixed broken Data Validation endpoints.
* Fixed a terminal output issue where progress bars could collide with other output.

## Version 2.15.0 (Mar 13, 2026)

### New Features

#### Hive

* Added support for the `GET_JSON_OBJECT` function.

#### SQL Server

* Added non-ASCII identifier double-quoting transformation for Transact-SQL.

#### Teradata

* Added support for Teradata file extraction.

#### SSIS

* Added conversion support for `Microsoft.CharacterMap` transformations.
* Added conversion of `sp_add_jobstep` for SQL Server Agent Job orchestration.

#### Informatica

* Added conversion of the `TRUNC` function to Snowflake `TRUNC`.
* Added conversion of the `DECODE` function to Snowflake.
* Added conversion of the `ADD_TO_DATE` function to Snowflake `DATEADD`.
* Added conversion of the `LTRIM` function to Snowflake.
* Added conversion of the `RTRIM` function to Snowflake.
* Added conversion of the `MAX` aggregate function to Snowflake.

#### General

* Added support for iteratively adding code units to a project.
* Added local report generation for Cloud Data Migration.
* Integrated Testing Orchestration (seed, capture, and validate) into the desktop application.

### Improvements

#### SQL Server

* Added a Functional Difference Message (FDM) for `SET IDENTITY_INSERT` statements, which are now commented out during conversion.

#### PowerBI

* Improved Power BI version integration and applied general cleanup.

#### General

* Integrated the resync command with the conversion engine for re-scanning modified converted files.
* Integrated resync into the code accept workflow to keep issue metadata in sync.
* Enhanced the selection summary panel layout for smaller window sizes.
* Disabled connections with invalid authenticators in Snowflake.
* Extended the close-app guard to cover active Code Extraction jobs.
* Added application type information to CSV reports.
* Improved IPC channel error handling.

### Bug Fixes

#### SQL Server

* Fixed an issue where semicolons in SQL Server passwords caused connection failures.
* Fixed alias duplication in `UPDATE...FROM` translation.
* Fixed `XML.value()` instance parsing and absolute-path handling.

#### SSIS

* Fixed `FuzzyLookup` to only propagate passthrough columns.
* Fixed `ExcelSource` not displaying Error Warning Information (EWI) markers in dbt models.

#### Informatica

* Fixed unresolved EWI description placeholders for Mapplet subtypes.

#### PowerBI

* Fixed SQL extraction from Power Query connections when schema and object names use double quotes without an ending semicolon.

#### General

* Fixed account locator extraction for AI Code Conversion when using Key Pair authentication.
* Fixed an issue where the Key Pair passphrase argument was incorrectly omitted when empty or null.
* Fixed an issue where code formatting could cause errors.
* Fixed a race condition that could affect job initialization during concurrent operations.
* Applied general stability fixes.

## Version 2.14.0 (Mar 06, 2026)

### New Features

#### SSIS

* Added conversion support for `ExcelDestination` components to Snowflake dbt.
* Added conversion support for SQL Server Agent Job `TSQL` command references.
* Added a catch-all replacer for unsupported SQL Server Agent Job procedures.
* Added conversion of `sp_delete_job` to `DROP TASK IF EXISTS` for SQL Server Agent Jobs.
* Added conversion of `sp_update_job` to `ALTER TASK ... SUSPEND/RESUME` for SQL Server Agent Jobs.
* Added conversion support for `Microsoft.FileSystemTask` transformations.
* Added conversion support for `Microsoft.FuzzyLookup` transformations.
* Added support for SSIS `Project.params` conversion.

#### SQL Server

* Added conversion of `WAITFOR DELAY` to `CALL SYSTEM$WAIT`.

#### General

* Added Single Sign-On (SSO) support for executing AI Verification jobs.
* Added Time-based One-Time Password (TOTP) Multi-Factor Authentication (MFA) support.
* Added a resync command for re-scanning modified converted files to update issue metadata.
* Added estimation report generation support.
* Added the last job status per code unit on the selection page.

### Improvements

#### SSIS

* Improved support for older SSIS package formats with automatic version upgrade.

#### RedShift

* Improved the layout of the Redshift connection form.

#### General

* Improved specification and migration report artifact downloads to include terminal job status.
* Extended the close-app guard to cover data migration, data validation, and deployment jobs.
* Improved connection error messages with better formatting.
* Improved log and report file path handling.
* Added individual object listing in the code deploy success summary.
* Added validation for AI Verification job files in the project directory.
* Added application type information to the assessment report.
* Added a flag to skip split operations during code extraction.

### Bug Fixes

#### SQL Server

* Fixed the double-dot identifier transformation to correctly reference the latest active database.
* Fixed `BEGIN TRAN` syntax conversion to Snowflake.

#### SSIS

* Fixed handling of empty XML elements and hardened upgrade error handling.

#### RedShift

* Fixed Redshift code unit name matching in AI Verification.

#### General

* Fixed Data Validation Framework (DVF) progress corruption.

## Version 2.13.0 (Mar 03, 2026)

### New Features

#### RedShift

* Added support for RedShift function extraction.

#### SSIS

* Added support for converting `sp_stop_job` to `ALTER TASK ... SUSPEND` for SQL Server Agent Job orchestration.
* Added an orchestration file generator for SQL Server Agent Job conversion.

#### PowerBI

* Added `QUOTED_IDENTIFIERS_IGNORE_CASE` validation for column renaming support in Power BI conversions.

#### General

* Added a `CodeSyncService` and `IssuesService` for synchronizing issue metadata (Error Warning Information, Functional Difference Messages, Out of Scope, Performance Review) in the code unit registry when converted files are modified.
* Added connection timeout support for Snowflake credentials with authentication-specific defaults.

### Improvements

#### General

* Added a terminal link and icon to the SnowConvert AI home page for quick access to the command-line interface.
* Updated `AgentJobOptions` to support custom database usage and extra file dependencies.
* AI Verification files are now persisted in the project directory instead of temporary folders.
* Improved error handling for source connection resolution.

### Bug Fixes

#### General

* Fixed an issue where the application did not handle .NET backend process crashes gracefully.

## Version 2.12.0 (Feb 27, 2026)

### New Features

#### General

* Added a `SessionManagerService` for session lifecycle management.
* Moved the license file activation from the Login page to the Help menu.

### Improvements

#### General

* Implemented mode-aware documentation links in update notification banners.

### Bug Fixes

#### General

* Fixed an issue where missing C# dependencies caused build errors.
* Fixed a crash on the progress page when step titles were unmapped, by filtering out steps without display strings.
* Fixed slow loading for AI Verification jobs.

## Version 2.11.0 (Feb 27, 2026)

### New Features

#### General

* Implemented state management for code deployment.
* Added a Testing Orchestration engine with CSnakes Python interop.
* Implemented file size validation for AI processing.
* Added T-SQL to Snowflake TDD workflow skills for AI-assisted translation development.

#### Hive

* Added support for the Hive `InStr` function.
* Added support for the Hive `COLLECT_SET` function.

#### Netezza, SQL Server, Sybase, Teradata

* Added a `disable-use-database-generation` flag for schemas.

#### PostgreSQL

* Added PostgreSQL `RETURN QUERY` transformation support.
* Added preprocess and partial transformation support for Python functions in PostgreSQL.

#### RedShift

* Added support for transforming cursor out parameters in RedShift.

#### SSIS

* Added multi-target column unpivot translation support for SSIS.
* Added Microsoft.Pivot transformation support for dbt translation.
* Added Early Warning Indicators (EWI) for unreviewed SSIS expression functions.
* Introduced Agent Job 2.1 with base replacer and EWI/FDM codes for SSIS.

### Improvements

#### General

* Enhanced error handling in AI verification results by adding `lastError` to the job state.
* Enhanced the AI Verification Orchestrator to handle transient PENDING statuses.
* Enhanced AI verification job status handling by adding a ‘FAILED’ state and improving error logging.

#### PowerBI

* Removed the renaming of calculated columns in Power BI conversions.
* Removed table aliases when generating column renames for Power BI.

### Fixes

#### General

* Fixed a bug related to Extract Code and Code Unit Registry.

#### RedShift

* Fixed an issue with data migration navigation.

#### SSIS

* Fixed FlatFileSource naming to use the component path for unique identifiers.

## Version 2.10.0 (Feb 24, 2026)

### New Features

#### Hive

* Added support for the Hive `regexp_extract` built-in function.
* Added a replacer for the Hive `ISNULL` function, incorporating validation best practices.

#### SSIS

* Implemented conversion for SSIS Excel Source components to Snowflake dbt.

#### General

* Added an option to disable the generation of `USE DATABASE` statements.

### Improvements

#### SQL Server

* Optimized Transact-SQL conversions by using the native `parse_json()` function instead of a custom UDF helper.

#### SSIS

* Added sanitization of identifiers for the flat file source translator.
* Improved the SSIS TDD skill based on insights from Microsoft Pivot migration retrospectives.

#### General

* Added test generation capabilities guard and `TST0003` error handling.
* Renamed the Data Validation CSnakes bridge to `_dvf_csnakes_bridge.py`.

### Bug Fixes

#### Power BI

* Fixed an issue where query values were lost when migrating multiple Power BI Template (`.pbit`) files simultaneously.

## Version 2.9.0 (Feb 23, 2026)

### New Features

#### Hive

* Added support for converting the `FROM_UTC_TIMESTAMP` function from Hive to Snowflake.

#### SSIS

* Added support for SSIS Aggregate transformation in dbt translation.
* Added an `ssis-tdd` skill to support the SSIS-to-Snowflake Test-Driven Development (TDD) workflow.
* Added support for translating `Microsoft.Sort` transformations from SSIS to Snowflake.

#### Power BI

* Added an optional `ROLE` parameter for Power BI conversions.

#### General

* Enhanced AI Verification by adding support for Key Pair Authentication.
* Added registry generation to the Extract command.
* Integrated with the `scai test seed` test generation library.
* Added `CUBE` to the list of supported ANSI built-in functions.

### Improvements

#### General

* Enhanced AI conversion logs with improved detail and readability.
* Improved the display of friendly error messages and server-side error details on the AI code conversion error page.

### Bug Fixes

#### SSIS

* Fixed dot-delimited table names being incorrectly split during dbt model generation for SSIS conversions.
* Fixed the `ExcelSource` conversion to the previous Snowflake dbt TDD implementation.

#### General

* Fixed an issue with AI Verification where double-nested source directories were not resolved correctly.
* Fixed an issue with AI Verification two-sided mode serialization.
* Fixed an issue with Data Validation and the minimum Python environment version installation.

## Version 2.8.0 (Feb 19, 2026)

### New Features

#### Hive

* Added support for converting the `FROM_UTC_TIMESTAMP` function from Hive to Snowflake.

#### SSIS

* Added support for SSIS Aggregate transformation in dbt translation.
* Added an SSIS-TDD skill to support the SSIS-to-Snowflake Test-Driven Development workflow.
* Added support for Microsoft.Sort operations in SSIS-to-Snowflake conversions.

#### PowerBI

* Added an optional `ROLE` parameter for Power BI conversions.

#### General

* Introduced an `-o` option for ETL AI verification.
* Implemented Snowflake credential override options and improved authentication error messages.
* Added `CUBE` to the list of supported ANSI built-in functions.

### Improvements

#### General

* Enhanced internal data transfer objects, input building, and progress handling.
* Updated the version retrieval script to save results directly to a file.
* Unified AI Verification with the Standard Job Orchestrator for streamlined operations.
* Refactored job orchestration to support asynchronous job initiation.
* Updated the ‘limit reached’ message for improved clarity.

## Version 2.7.0 (Feb 13, 2026)

### New Features

#### General

* Added Snow-to-Snow support in Data Validation.
* Added AI verification support for `SEQUENCE` object types.
* Added support for a new AI code conversion contract.

### Improvements

#### SSIS

* Extended `IsString` functionality to detect strings through parentheses, function calls, casts, and concatenation (`+`) expressions.

#### Informatica

* Improved the conversion of the `INSTR` built-in function in expressions.

#### General

* Implemented code unit deduplication in the Job Status Mapper to prevent duplicate entries.
* Added a link to the assessment report and improved the ordering of cards on the project page.
* Enabled offline mode for license installation and improved connection error logging.
* Improved error messages for Snowflake authentication failures and refactored connection resolution for commands.
* Introduced `AiVerificationTestBase` to support language as a parameter in AI verification tests.
* Updated the AI initial prompt message and corrected a UI icon.
* Enhanced AI verification job execution to prioritize the `source_Processed` directory when available.
* Added `InvalidInputError` handling for AI verification inputs.

### Fixes

#### SSIS

* Fixed inconsistent Start/End block tagging in stored procedure output.

#### General

* Fixed an issue where SnowConvert AI displayed unsupported objects as pending.

## Version 2.6.3 (Feb 11, 2026)

### New Features

#### SQL Server

* Added support for `INSERT INTO EXECUTE` in procedures.
* Added support for `CONCAT_NULL_YIELDS_NULL`, `NUMERIC_ROUNDABORT`, and `ARITHABORT` SET options.

#### SSIS

* Added OLE DB Command (DELETE/UPDATE) support for SSIS to dbt translation.
* Added conversion of completion precedence constraints.

#### Power BI

* Added support for repointing connections identified as pending changes.

#### General

* Added Terms and Conditions acceptance functionality.

### Improvements

#### SSIS

* Improved the placement of SSIS task comment tags for better readability.

#### General

* Enhanced the project header to display source dialect names alongside project names.
* Improved error handling for array-type errors in conversion results.

### Bug Fixes

#### SQL Server

* Fixed resultset handling for set operators.

#### SSIS

* Fixed false column collision on Lookup No Match output paths.
* Fixed start tags not being added to unsupported control flow tasks.

#### General

* Fixed deployment error messages not being cleared after a successful redeployment.
* Fixed an issue where empty capabilities caused unexpected behavior.

## Version 2.5.0 (Feb 06, 2026)

### New Features

#### Hive

* Added support for Hive Date Format.

#### General

* Added new outcomes for source and converted dependency failures in AI Verification.
* Added a SQL platform selector.
* Implemented logic to split logs by execution and to clean up logs older than 30 days.

### Improvements

#### General

* Changed connection testing to use session state instead of querying data.
* Updated conversion status messages and adjusted icon usage in status templates for better clarity and consistency.
* Updated the `deployment_order` column to allow null values and enhanced parsing logic in the CodeConversionService.
* Changed conversion status and type to object type.
* Enhanced script extraction descriptions and integrated source dialect information in the CodeConversionService CodeLoadPage.
* Added quick access to the connections file from the Login page.
* Ensured all files are copied in AI output when AI flags are enabled.

## Version 2.3.3 (Feb 04, 2026)

### New Features

#### BigQuery

* Enabled BigQuery dialect support for AI Verification.

#### PostgreSQL

* Enabled PostgreSQL dialect support for AI Verification.

#### Redshift

* Added support for the `STRTOL` function.
* Added support for assigning query results to variables within stored procedures.

#### SQL Server

* Added support for the `HASHBYTES` function.

#### General

* Added accept command for AI-Convert.
* Added object selector support in Code Deployment.
* Added source and target mapping information in the ETL.Elements report.
* Added support for the `FROM_UNIXTIME` built-in function.

### Improvements

#### SQL Server

* Improved code formatting.
* Excluded aliases from the Object References report.

#### General

* Added ‘Learn More’ links to footers and introduced `body-small-italic` text variant.
* Migrated from `InternalError` to the `ScError` interface for enhanced error handling.
* Added safeguard to prevent accidental application closure during active conversions.
* Enhanced error handling for code conversions and added tooltips to results tables in Data Migration and Validation.
* Removed unused reject change functionality.
* Refactored connection resolution to separate credential validation from connectivity testing.
* Updated DataMigrator component and refactored Snowflake credential handling.

### Bug Fixes

#### PostgreSQL

* Fixed the ordering of generated function options to prevent deployment errors in Snowflake.

#### General

* Fixed an issue preventing the reporting issue modal from displaying correctly.
* Fixed `HOST_NAME()` function conversion to `CURRENT_IP_ADDRESS()` when using FDM.

## Version 2.3.2 (Jan 30, 2026)

### New Features

#### General

* Added a legacy view for code process results.
* Added support for using a `.yaml` configuration file in the Data Validation command.
* Added direct access to OpenLogs from the application menu.
* Added support for an object selector in Data Validation.

### Improvements

#### General

* Implemented a mechanism to read from `connections.toml` and fall back to `config.toml` when retrieving the default connection.

### Fixes

#### General

* Fixed an issue where AI Verification was missing `.sql` content.
* Fixed an issue where the application did not navigate to results after partially successful migration and validation, and resolved name overflow in progress cards.

## Version 2.3.1 (Jan 28, 2026)

### New Features

#### RedShift

* Added support for RedShift in Data Validation.

#### General

* Added a Sign Out option to the application menu.
* Implemented an offline mode for project commands.
* Added support for the `Unix_TimeStamp` built-in function.

### Improvements

#### RedShift

* Improved error propagation within the Data Validation feature.

#### General

* Added a loading fallback mechanism for the Login page.
* Added dynamic model validation to AI Verification processes.
* Refactored the AI Verification Service to manage job options on a per-project basis.
* Set the connection project default and improved AI Verification commands.
* Improved handling of `ConvertedObjectFixAttemptFailed` status in two-sided conversion runs.
* Added migration project context, loading it from a project-relative path.

### Fixes

#### Teradata

* Fixed the ordering of column options when column-level collation is generated for Teradata conversions.

#### General

* Fixed path comparison logic to correctly handle null or whitespace input in the `ValidateInputPath` method.
* Resolved an issue causing blinking when opening a project.

## Version 2.3.0 (Jan 26, 2026)

### Improvements

#### SSIS

* Added support for converting Microsoft.BulkInsertTask.
* Implemented conversion support for Execute Package Variable Binding.
* Added support for converting `INSERT...OUTPUT INSERTED` statements to `INSERT + SELECT INTO`.

#### PostgreSQL

* Updated the transformation of `<<` and `>>` bitwise operators to correctly handle integer type differences when converting from PostgreSQL to Snowflake.

#### General

* Updated Snowflake authentication to be handled directly through the IConnectionResolver, utilizing the provided connection when available.
* Implemented data validation improvements.

### Fixes

#### SSIS

* Resolved a styling issue affecting cards in the ETL SSIS Report.
* Fixed an issue where the “Check dependencies” button in the ETL SSIS Report was not functioning.

#### General

* Corrected the architecture name used for macOS Intel installers.
* Removed the access code requirement and implemented a direct redirect to the login page.
* Added source system text and corrected color display.

## Version 2.2.7 (Jan 22, 2026)

### Improvements

#### General

* Cached the VS Code path to improve performance.
* Updated connection redirection to fetch the last used credentials.

### Fixes

#### Oracle

* Added support for the `NULLIF` function to trim transformations for Oracle.

#### RedShift

* Fixed the RedShift connection handling for the default port and removed duplicated authentication method input.

#### SQL Server

* Fixed an issue where `JSON_UDF.sql` was not being deployed for SQL Server.
* Fixed integer divisions to correctly use truncation for SQL Server.

## Version 2.2.6 (Jan 21, 2026)

### Improvements

#### General

* Implemented renaming in testing mode and on the verify button.
* Changed the name of “AI Verification” to “AI Code Conversion”.
* Created data models for selector commands and refactored the TopLevelCodeUnits reader.
* Added `useUserMetadata` to set user metadata for telemetry.
* Replaced temporary credentials with active credentials across the application.
* Added the `useCredentialStorage` hook for managing credential IDs in local storage.
* Added a host URL input field to the Snowflake Connection Form.
* Added an AI Verification disclaimer.
* Added pre-extraction checks for the Code Extraction command, including a source connection test.
* Added collation support to code extraction.
* Added a Technical Discovery section.

### Fixes

#### Redshift

* Changed the `WITH NO SCHEMA BINDING` transformation to always remove it for Redshift conversions.

#### SQL Server

* Fixed an issue with `SELECT` statements containing a variable in the `TOP` clause for SQL Server conversions.

#### Teradata

* Removed the `NEXT` keyword in fetch next statements for Teradata conversions.

#### General

* Fixed an issue where procedures and views were not marked as verified by the user.
* Fixed an issue in two-sided validation.
* Fixed path validation when autocompleting.
* Implemented UI fixes for the “Use the extraction script” section.
* Fixed an issue with temporary tables having an incorrect schema.

## Version 2.2.5 (Jan 16, 2026)

### New Features

#### Hive/Spark/Databricks

* Added `DATE_SUB` function replacers.

#### PostgreSQL

* Added an arrange option for routines with quoted definitions.
* Added preprocessing for single-quoted procedure bodies.

#### Teradata

* Added support for `.IMPORT` and `.SET` commands in Mload.

#### SSIS

* Added an ETL SSIS Report.
* Enhanced variables conversion with improvements and refactoring.

#### PowerBI

* Added support for column renaming in embedded SQL cases.

#### General

* Implemented project input validations.
* Added user metadata to telemetry upon login.
* Added list, cancel, and status commands for AI Verification.
* Added an initial Login Page with enhanced licensing features.
* Added support for AI Verification contracts.
* Added transformation of SQL code within `dbt_project.yml` variables.

### Improvements

#### SQL Server

* Improved the transformation of the `ROW` keyword as a reserved keyword.
* Removed unnecessary commas from Transact-SQL transformation output.

#### Teradata

* Updated the collation conversion setting to align with the tool’s new default behavior.
* Updated current collation support to ensure compatibility with Iceberg Tables transformations.
* Removed CREATE STAGE and PUT commands from the EXECUTE IMMEDIATE block in Mload transformations.

#### General

* Improved AI Verification status reporting.
* Improved consistency of capitalization in Settings tabs.
* Consolidated the definition of source languages that support database connections for an improved end-to-end flow.
* Updated the code extraction command to enhance object type handling and improve progress reporting.
* Refactored the Translation API and implemented logic for Mapplet CTE generation.
* Refactored SQL transformation for variables.
* Added a Mapplet CTE Builder Module.

### Fixes

#### SQL Server

* Fixed an issue with the `coalesce` function for integer types when an empty string was present.

#### General

* Corrected an issue where AI verification results were not showing two-sided comparisons.
* Removed Etl instrumentation argument.
* Added null validation to the RAISERROR helper parameters.

## Version 2.2.2 (Jan 13, 2026)

### Improvements

#### General

* SQL Server extraction process now adds ‘GO’ statements after `USE database` commands in object definition files to allow files to be executable.
* Metadata extraction now intelligently focuses on extracting only supported object types for each specific platform.
* Improved conversion settings and disabled action buttons when a conversion status is pending (e.g., during AI verification).
* Added a ConnectionInfoBanner component to clearly display saved connection information.
* Refactored credential management by removing secret handling methods and simplifying connection processes.
* Introduced a database dropdown in data migration and validation screens and removed unnecessary required fields in connection configurations.
* Updated an internal dependency from ‘balto’ to ‘stellar’.
* Enhanced internal utilities for managing test results and progress.

## Version 2.2.1 (Jan 08, 2026)

### New Feature

#### General

* The Missing Object Report has been merged into Object References, streamlining reporting. This unifies both valid and missing object references in a single report.

### Improvements

#### General

* Implemented a mechanism to parse and modify `.toml` files for credentials for Snowflake and various source languages.

## Version 2.2.0 (Jan 07, 2026)

### New Features

#### General

* Implemented initial code unit state management using JSON files.
* Added a new JobStorageService.
* Implemented AI verification server and interfaces.
* Added support for identity columns in table definition queries.
* Added required credentials configuration for the Data Migration Connection Page and updated the Data Validation Connection Page to include KeyPair as a supported authentication method.
* Migrated AI Verification to the new jobs infrastructure and services.
* Added models for the AI Verification job.
* Defined reader and writer components for TOML files.

### Improvements

#### General

* Updated the VersionInfoProvider to strip branch and commit hash from the version string.
* Added a missing singletons registry for dependency injection.
* Made the `reportFilePath` optional in `CodeUnitConversionProgress`.
* Moved the `SpcsManager` and its dependencies to the Databases project for better organization.
* Implemented an AI verification orchestrator.
* Refactored credential management methods to utilize `CreateOrEditCredentials`.
* Updated conversion status in other features when verified by a user.
* Removed AI Verification v1.
* Enhanced password input fields to prevent overflowing.
* Refactored AI Verification to use `CredentialsId` instead of `ConnectionString`.
* Improvements in the deploy command.
* Updated application version retrieval to use semantic versioning.

### Fixes

#### Teradata

* Fixed parenthesis issues that caused incorrect `PARTITION BY` generation for Iceberg table transformations in Teradata.

#### General

* Fixed SSO URL handling in Snowflake credentials configuration.
* Addressed minor issues related to AI verification.
* Fixed relative paths in AI verification job execution and application state management.

## Version 2.1.0 (Dec 18, 2025)

### New Features

#### IBM DB2

* Implemented DECFLOAT transformation.

#### Oracle

* Added support for transforming `NUMBER` to `DECFLOAT` using the [Data Type Mappings](../../getting-started/running-snowconvert/conversion/oracle-conversion-settings.md) feature.
* Added a new report [TypeMappings.csv](../../getting-started/running-snowconvert/review-results/reports/type-mappings-report.md) that displays the data types that were changed using the Data Type Mappings feature.

#### PowerBI

* Added support for the Transact connector pattern for queries and multiple properties in the property list for PowerBI.

#### Teradata

* Added a new conversion setting [Tables Translation](../../getting-started/running-snowconvert/conversion/teradata-conversion-settings.md) which allows transforming all tables in the source code to a specific table type supported by Snowflake.
* Enabled conversion of tables to Snowflake-managed Iceberg tables.

#### SSIS

* Added support for full cache in SSIS lookup transformations.

#### General

* Added temporary credentials retrieval for AI Verification jobs.
* Added summary cards for selection and result pages.
* Implemented full support for the Git Service.
* Added ‘verified by user’ checkboxes and bulk actions to the selection and results pages.
* Added a dependency tag for AI Verification.
* Implemented the generation of a SqlObjects Report.

### Improvements

#### RedShift

* Optimized RedShift transformations to only add escape characters when necessary in `LIKE` conditions.

#### SSIS

* Improved Microsoft.DerivedColumn migrations for SSIS.

#### General

* Added the number of copied files to relevant outputs.
* Changed some buttons to the footer for improved UI consistency.

### Fixes

#### Teradata

* Fixed transformation of bash variables substitution in scripts.

## Version 2.0.86 (Dec 10, 2025)

### Improvements

#### RedShift

* Added support for the `MURMUR3_32_HASH` function.
* Replaced Redshift epoch and interval patterns with Snowflake `TO_TIMESTAMP`.

#### SSIS

* Added support for converting Microsoft SendMailTask to Snowflake SYSTEM.
* Implemented SSIS event handler translation for `OnPreExecute` and `OnPostExecute`.

#### SQL Server

* Enhanced transformation for the `Round` function with three arguments.

#### Informatica

* Updated `InfPcIntegrationTestBase` to import the real implementation of translators and other necessary components.

#### General

* Enhanced procedure name handling and improved identifier splitting logic.
* Improved object name normalization in DDL extracted code.
* Implemented a temporal variable to keep credentials in memory and retrieve the configuration file.
* Updated the TOML Credential Manager.
* Improved error suggestions.
* Added missing path validations related to ETL.
* Improved the application update mechanism.
* Implemented an exception to be thrown when calling the `ToToml` method for Snowflake credentials.
* Changed the log path and updated the cache path.
* Implemented a mechanism to check for updates.
* Merged the Missing Object References Report with ObjectReferences.
* Changed values in the name and description columns of the ETL.Issues report.
* Added support for Open Source and Converted models in AI Verification.
* Added a new custom JSON localizer
* Added a dialog to appear when accepting changes if multiple code units are present in the same file.
* Added a `FileSystemService`.
* Added an expression in the ETL issues report for `SSISExpressionCannotBeConverted`.

### Fixes

#### SQL Server

* Fixed a bug that caused the report database to be generated incorrectly.
* Fixed a bug that caused unknown Code Units to be duplicated during arrangement.

#### General

* Fixed an issue that prevented the cancellation of AI Verification jobs.
* Fixed an issue to support EAI in the AI specification file.
* Fixed an issue where the progress number was not being updated.
* Fixed the handling of application shutdowns during updates.

## Version 2.0.57 (Dec 03, 2025)

### Improvements

#### SQL Server

* Enhanced SQL Server code extraction to return schema-qualified objects.

#### General

* Enhanced Project Service and Snowflake Authentication for improved execution.
* Removed GS validation from client-side, as it is now performed on the server side.
* Implemented connection validation to block deployment, data migration, and data validation when a connection is unavailable.
* Enhanced conversion to use the source dialect from project initialization.
* Improved `CodeUnitStatusMapper` to accurately handle progress status in UI status determination.
* Implemented batch insert functionality for enhanced object result processing.

### Fixes

#### General

* Resolved an issue where conversion settings were not being saved correctly.
* Corrected data validation select tree to properly skip folders.
* Fixed content centering issues in the UI.
* Normalized object names in AI Verification responses to prevent missing status entries in the catalog.

## Version 2.0.34 (Nov 27, 2025)

### Improvements

#### General

* Resolved an issue where PowerBI was not correctly displayed in the list of supported languages.

## Version 2.0.30 (Nov 26, 2025)

### New Features 🚀

#### IBM DB2

* Added transformation support for SELECT INTO and VALUES statement for variable assignments within User-Defined Functions (UDFs).

#### Oracle

* Added transformation support for SELECT INTO for variables assignments within User-Defined Functions (UDFs).

#### SQL Server

* Added transformation support for SELECT INTO for variables assignments within User-Defined Functions (UDFs).

#### SSIS

* Added support for SSIS Event Handlers.

#### General

* Introduced AI Verification Contract Model Codes.
* Created a base component for the Code Processing View.
* Implemented YAML reading and writing services with an enhanced `info` command.
* Created an execution type selector page.

### Improvements

#### General

* Updated GS Version to 9.50.99 to ensure compatibility with newer versions of GS.
* Expanded job storage test coverage across data validation, migration, deployment, extraction, metadata, and AI verification.
* Refactored the FilteredObjectExplorer layout and removed unnecessary container styles from the AI Verification Selection Page and Mappings Page to improve UI consistency.
* Enhanced deployment database selection.

### Fixes

#### SQL Server

* Resolved an error that occurred when parsing SQL Server connections with a specific port.

#### General

* Resolved an issue where mappings were not functioning correctly in code conversion.
* Corrected the process for cleaning the conversion directory before conversion.
* Fixed deployment dropdown functionality.

## Version 2.0.8 (Nov 21, 2025)

### Improvements

#### Teradata

* Added support for `GOTO-LABELS` in SnowScript.

#### Spark SQL

* Added support for transformation rules to handle `INSERT OVERWRITE` statements.

#### SQL Server

* Added support for the `CREATE SEQUENCE` statement.

#### General

* Added support for IBM Db2 from the UI.
* Fixed Db2 support in SourceDialect and Update Conversion settings window.
* Added support for tables with large numbers of rows (> 2.5B) in data migrator.

## Version 2.0.0 (Nov 20, 2025)

The SnowConvert AI interface is revised to improve efficiency, control, and usability.
In the improved interface, you can run specific flows independently, including extraction, deployment, and
data validation. There is now a dedicated project page to show you which flows you can run. The improved
interface gives you more granular control over your project and makes managing complex workflows easier.

For more information, [SnowConvert AI: Project Creation](../../user-guide/project-creation.md)

## Version 1.21.0 (Nov 08, 2025)

### Improvements

#### SQL Server

* Added support for the CREATE SEQUENCE statement.

#### General

* Added a notification to inform users about the SnowConvertAI 2.0 New UI experience.

## Version 1.20.11 (Nov 07, 2025)

### Improvements

#### RedShift

* Added support for the `CURRENT_SETTING` timezone.

#### Spark SQL

* Added support for `INSERT BY NAME` and removed the `TABLE` keyword and partition clause.

#### PowerBI

* Supported dynamic parameterization in connectors with embedded queries in WHERE clauses.

## Version 1.20.10 (Nov 06, 2025)

### Improvements and Fixes

#### RedShift

* Added support for HLL functions.
* Added support for JSON functions.
* Added support for OBJECT_TRANSFORM.

#### SSIS

* Added conversion support for SSIS Microsoft.ExpressionTask.
* Modified the condition used to determine if an SSIS package is reusable.

#### Teradata

* Added support for named arguments in the EXECUTE (Macro Form) statement.
* Fixed an issue where scripts were not being migrated to Snowscript.
* The Continue Handler is now available for Scripts.

#### PostgreSQL

* Fixed an issue where procedures did not have the EXECUTE AS CALLER clause generated by default when the SECURITY clause was absent in the input.

#### RedShift

* Fixed an issue where non-ASCII characters in columns were not quoted during data migration.

#### SQL Server

* Fixed an issue where default GETDATE column constraints applied an unnecessary double cast in the column definition.

## Version 1.20.8 (Nov 05, 2025)

### Improvements

#### General

* Added support for alert preview notifications.
* Improved Claude model validation to inform users about required access.

## Version 1.20.7 (Oct 31, 2025)

### IBM DB2 Stored Procedures & User-Defined Functions Support

SnowConvert AI now supports the conversion of DB2 stored procedures to Snowflake equivalents, enabling seamless migration of procedural code. This feature includes support for variable operations, and control flow statements. Also, DB2 user-defined functions will be converted to Snowflake Scripting UDFs when possible.

### New Features 🚀

#### SSIS

* Implemented SSIS to Snowflake string literal escape sequence conversion.

### Improvements

#### Teradata

* Updated scripts transformation to utilize a `continue` handler.

#### General

* Cleaned up `package.json` for customer distribution.

### Fixes

#### BigQuery

* Fixed an aggregation issue that occurred when column aliases had the same name as table columns.

#### Oracle

* Fixed `NOT NULL` constraint behavior with `INLINE`, `CHECK`, and `PK` constraints.

#### SSIS

* Fixed an issue where comment tags were not displayed in converted reusable packages.

#### General

* Updated database dependencies and resolved dependencies vulnerabilities.

## Version 1.20.6 (Oct 29, 2025)

### New Features 🚀

#### SSIS

* **SSIS Replatform migration (Public Preview)** - SnowConvert AI now supports SSIS package migration to Snowflake in Public Preview, enabling automated conversion of SSIS workflows to modern cloud-native data pipelines.

#### BigQuery

* Added support for the `JSON_TYPE` built-in function.
* Added support for the `SAFE.POW` function.
* Added transformation for array slice patterns.
* Added more array pattern support.

#### IBM DB2

* Added support for `CONTINUE HANDLER`.

#### Oracle

* Added support for the `PARTITION` clause in `MERGE` statements.

#### RedShift

* Added support for `CONTINUE HANDLER`.

#### Teradata

* Added support for `CONTINUE HANDLER`.

#### PowerBI

* Added the `HierarchicalNavigation` flag as an optional parameter in native connectors.

### Improvements

#### Oracle

* Enhanced arithmetic operations with `TIMESTAMP` values.

#### SSIS

* Enhanced the UI for the SSIS Replatform Public Preview release with improved user experience and workflow optimization.

#### General

* Removed the SSC-EWI-0009 warning from non-literal expressions and added FDM instead.
* AiVerification - Added support for `n_tests` parameter in configuration file.

### Fixes

#### BigQuery

* Fixed an issue where UDF files were not generated.

#### General

* Fixed an issue where symbols were not loaded in views and set operations.

## Version 1.20.3 (Oct 20, 2025)

### Fixes

#### General

* Fixed incorrect verified objects count calculation during validation process.
* Updated warehouse validation error messages to maintain consistency with connector messaging.

## Version 1.20.2 (Oct 20, 2025)

### New Features 🚀

#### BigQuery

* Added support for `REGEXP_EXTRACT_ALL` and `ROW_NUMBER` built-in functions.
* Added support for the `ARRAY` built-in function.

### Improvements

#### SQL Server

* Migrated `XACT_STATE` to `CURRENT_TRANSACTION`.
* Enabled `ROLLBACK` transformation within explicit transactions.

#### PowerBI

* Improved the ‘Pending Work Description’ on repointing reports.

### Fixes

#### BigQuery

* Fixed an issue where literals inside `IN UNNEST` were not being transformed.
* Fixed `SAFE_CAST` behavior when the input type is not `VARCHAR`.

#### General

* Fixed queries containing aggregate functions and multiple columns.

## Version 1.20.1 (Oct 16, 2025)

### New Features 🚀

#### BigQuery

* Added support for the `REGEXP_REPLACE` function.
* Added support for the `FORMAT` function with the `%t` argument.
* Added support for the `BYTE_LENGTH` function.
* Added support for the `TIMESTAMP_TRUNC` function.

### Improvements

#### Teradata

* Improved the preservation of default values in `SELECT INTO` statements for empty results.

#### PowerBI

* Improved M-Query source retrieving from metadata files.

### Fixes

#### PowerBI

* Fixed an issue where the parameter list was not read correctly when the connection pattern was rejected.
* Added description in ETLAndBiRepointing assessment report when non-database or non-applicable connectors are unmodified.

## Version 1.20.0 (Oct 15, 2025)

### New Features 🚀

#### BigQuery

* Added support for `TimestampDiff`, `Safe_Divide`, and `Except` functions.
* Added support for the `ARRAY_AGG` function.
* Added the `UNNEST` built-in symbol.
* Added support for the `UNIX_SECONDS` built-in function.
* Added support for `UNIX_MILLIS` and `UNIX_MICROS` built-in functions.
* Added support for `ARRAY_CONCAT`, `TIMESTAMP_MILLIS`, and `ENDS_WITH` functions.
* Added support for `JSON_QUERY`, `JSON_EXTRACT`, `JSON_QUERY_ARRAY`, and `JSON_EXTRACT_ARRAY` functions.

#### Teradata

* Added support for `.REMARK` in SnowScript.

#### SSIS

* Implemented SSIS ForEach File Enumerator translation logic.

#### Tableau

* Added repointing assessment for Tableau repointing.

#### General

* Added support for the MD5 function.

### Improvements

#### Teradata

* Commented out ERROR LEVEL in BTEQ.

#### SSIS

* Enhanced identifier sanitization for SSIS.
* Improved retrieval of the “CopyFromReferenceColumn” property for output columns in SSIS Lookup.

#### General

* Added account information to AiVerification logs.
* Added warehouse validation to the Snowflake login.
* Wrapped control variable values with `TO_VARIANT` in `UpdateControlVariable` calls.
* Refactored SQL task creation to use `CREATE OR REPLACE` syntax.
* Added `INSERT...SELECT` with `TO_VARIANT` for control variables.
* Added transformation for string case-insensitive comparisons.

### Fixes

#### General

* Fixed name collisions of tasks in the main control flow with container tasks.
* Fixed “No expression translation for negative numbers (or unary ‘minus’)” issues.

## Version 1.19.7 (Oct 10, 2025)

### New Features 🚀

#### BigQuery

* Support was added for the `TO_HEX` and `ARRAY_TO_STRING_FUNCTION`.

#### Oracle

* SnowScript UDF is now generally available.

#### SQL Server

* SnowScript UDF is now generally available.

#### General

* A feature flag was added to hide additional options for the AiVerification API.
* An interface was added to Abstract Syntax Trees (ASTs) for representing string comparisons.
* Partial support was added for the `ARRAY_CONCAT_AGG` function.
* Support was added for the `REGEXP_EXTRACT` function.
* Transformation support was added for the `UNNEST` function within an `IN` predicate.
* Support was added for the `NET.IPV4_TO_INT64` function.

### Improvements

* The maximum GS version was bumped to 9.37 to extend the period of SC usage until the end of October.
* The `TSqlNotSupportedStatementReplacer` rule is now bypassed when processing ETL SQL fragments.
* Predecessor name generation now uses the SSIS package file name.

### Fixes

* The verified objects count issue was resolved.

## Version 1.19.6 (Oct 08, 2025)

### New Features 🚀

#### SSIS

* Added core foundation for ForEach File Enumerator.
* Added base infrastructure for Dynamic SQL.
* Added cursor-based iteration structure for SSIS ForEach Loop containers.
* Added dynamic SQL support for SSIS Execute SQL Task.

#### Oracle

* Added support for the BigQuery UNNEST operator.
* Added support for JSON_VALUE_ARRAY built-in function.
* Added support for multiple built-in functions.
* Added support for TIMESTAMP_SECONDS.
* Added support for SnowScript UDF.

#### SQL Server

* Added TSqlSetIdentityInsertReplacer to handle SET IDENTITY_INSERT in Transact.

#### General

* Added AI Verification PuPr Followup items.
* Added ‘connection’ element in Tableau repointing.
* Added semicolons to execute SQL task statements inside containers.
* Added transformation for NET.SAFE_IP_FROM_STRING function.
* Added support for HLL_COUNT.MERGE function.
* Added support for NET.IP_NET_MASK function.
* Added ETL Preprocess Task.
* Added transformation for HLL_COUNT.INIT function.
* Added support for offset array accessor function.

### Improvements

* Added serialization and deserialization of query symbols in the migration context.

### Fixes

#### Teradata

* Fixed issues related to CAST formats.

#### Oracle

* Fixed an issue where supported formats were incorrectly marked as unsupported.
* Fixed a bug related to symbol key creation.

#### General

* Fixed an issue with symbol key creation when loading symbols with context.
* Fixed an issue with quotes and value length in Tableau repointing.

## Version 1.19.5 (Oct 03, 2025)

### New Features 🚀

#### General

* Added support for Snowflake select asterisk column expressions.

#### SSIS

* Added support for converting CAST expressions.

### Improvements

#### General

* Enhanced the SnowflakeLogin method to handle email users correctly.
* Moved the Declare Statement Replacer to SQL.

### Fixes

#### SSIS

* Fixed an issue where SSIS containers BEGIN END without a semicolon.

#### Oracle

* Fixed an issue with incorrect function transformation.
* Fixed an issue where SYS_REFCURSOR was not being migrated correctly.

#### dbt

* Fixed an issue with conditional split downstream ref() calls.

## Version 1.19.4 (Oct 1, 2025)

### New Features 🚀

#### PowerBI

* Expanded test scenarios for Teradata Power BI repointing.

#### SSIS

* Introduced support for the SSIS Merge Join transformation.
* Implemented orchestrator task variable wrappers for SSIS tasks.

#### Teradata

* Ensured script files are now correctly reported as code units when Snowscript is the target.

#### Oracle

* Implemented a warning system for users when a referenced datatype might be unsupported.
* Renamed `RAISE_MESSAGE_UDF.sql` to `RAISE_MESSAGE.sql` for clarity and consistency.

#### SQL Server

* Added parameters as an identifier for improved recognition.

#### dbt

* Relocated configuration files to the ETL output directory and removed analyses and snapshots folders from dbt projects.

#### General

* Added transformations for BTEQ labels to support nested procedures.
* Enabled result binding for the Execute SQL Task.
* Included Migration ID in object tagging and relevant reports for enhanced telemetry.
* Reduced the frequency of the AI Verification prompt.

### Improvements

#### Oracle

* Enhanced the conversion process for `%TYPE` declarations.
* Refactored DB2 variable declarations for improved consistency.

#### General

* Improved collision detection and resolution mechanisms for ETL transformations.

### Fixes

#### Oracle

* Resolved an issue where `RAISE_MESSAGE_UDF.sql` was incorrectly referenced in PostgreSQL tests.
* Addressed a problem where EWI (Error Warning Information) was not being added to unresolved types in Oracle.

#### General

* Improved error handling and logging within the `AiVerificationHttpClient`.

## Version 1.19.3 (Sep 29, 2025)

### Improvements

#### General

* Enhanced VerifiedTemplate to better manage child verification states.

### Fixes

#### General

* Fixed an issue where the role was not being propagated correctly to the login endpoint.
* Fixed an issue where ZIP files created in Windows did not preserve proper Unix permissions.

## Version 1.19.2 (Sep 26, 2025)

### New Features 🚀

#### PowerBI

* Added support for dynamic or custom concatenation for greater flexibility in data transformations.

#### SSIS

* Implemented the core infrastructure for recursive conversion of SSIS containers (e.g., `For Loop`, `Foreach Loop`), enabling the processing of more complex structures.

### Improvements

#### SSIS

* Completed the implementation of inlined conversion for containers to better handle control flows.

#### Teradata

* Reordered UDFs and updated the default time format (`HH:MI:SS.FF6`) to improve conversion compatibility.

#### dbt

* Removed angled brackets (`<>`) from generated YML configuration files to prevent potential syntax errors.
* Removed unnecessary tags from models generated during ETL conversions to produce cleaner code.
* Simplified the names of generated models in ETL conversions to enhance project readability.

### Fixes

#### PostgreSQL

* Resolved an error in the `RAISE_MESSAGE_UDF` when it was called with only two parameters.

#### General

* Updated SQLite storage filename to include file extension for better file management.
* Corrected an issue in the `TRANSFORM_SP_EXECUTE_SQL_STRING_UDF` helper where `datetime` values were formatted incorrectly in dynamic SQL.
* Applied internal fixes related to Nuget package management.
* Corrected an incorrect enumeration in an internal resource file (`IssueResources.json`).

## Version 1.19.0 (Sep 24, 2025)

### New Features 🚀

#### General

* Added support for IDENTITY in CTAS statements.
* Enhanced telemetry settings in data migration configuration for improved metrics collection control.

#### Tableau

* Added initial infrastructure for converting Tableau projects.

#### ETL & SSIS

* Implemented new output structure for ETL conversions, grouped by filename.
* Added support for ISNULL function conversion and variables in “Derived Column” expressions.
* Enhanced SSIS assessment report and task generation using original package names.

#### DB2

* Added support for DECLARE TABLE statement transformation.

#### BigQuery

* Added support for REGEXP_CONTAINS function.

#### dbt

* Refactored dbt project generator to unify variable conversion logic.

### Fixes

#### PowerBI

* Fixed CommandTimeout parameter and schema uppercase conversion issues.

#### SSIS

* Fixed critical bug with plus operator (+) on numeric operands.

#### General

* Corrected conversion rate calculation.
* Enhanced DROP TABLE handling and COALESCE type resolution.
* Removed conversion of On Commit Preserve Rows node for Teradata.

## Version 1.18.3 (Sep 22, 2025)

### Fixes

* Improved the refresh deployment catalog functionality in the end-to-end experience.
* Fixed navigation issues with the Retry Conversion flow.

## Version 1.18.0 (Sep 18, 2025)

### New Features 🚀

#### PuPr AI Verification

* Added new [AI Verification](../../../snowconvert-ai-verification.md) step for SQL Server migrations.

#### SQL Server

* [Preview Feature] Support for UDF translation to [Snowflake Scripting UDFs](../../../../../developer-guide/udf/sql/udf-sql-procedural-functions.md)
* Support for `ERROR_NUMBER` to `SQLCODE`.
* Support for `COL_LENGTH` built-in function.

#### Teradata

* Support for the `TD_MONTH_BEGIN`, `TD_WEEK_BEGIN`, and `TD_WEEK_END` built-in functions.
* Support for hex literals in the `OREPLACE` built-in function.

#### Oracle

* Support `ASCIISTR` built-in function.
* Support for `MAX DENSE_RANK FIRST` and `MIN DENSE_RANK LAST` clauses.

#### SSIS

* Added SSIS Microsoft.Merge transformation
* Enhanced SSIS variable handling transformation

### Fixes

#### Oracle

* Improved recognition of correlated queries.

## Version 1.17.6 (Sep 5, 2025)

### Fixes

* Fixed crashes in code conversion on SnowConvert classic mode.

## Version 1.17.2 (Sep 4, 2025)

### Fixes

* Fixed visual issues in the object selection screen.

## Version 1.17.1 (Sep 1, 2025)

### New Features 🚀

#### General

* [IBM DB2 SQL Support](../../getting-started/running-snowconvert/supported-languages/ibm-db2.md)
  SnowConvert AI now supports the conversion of Tables and Views to Snowflake. This feature includes support for the following:

  + Translation of [Tables](../../../translation-references/db2/db2-create-table.md).
  + Translation of Views.
  + Translation of [Data Types](../../../translation-references/db2/db2-data-types.md).
  + Translation of Built-in Functions.
* Added new columns to [Top Level Code Unit report](../../getting-started/running-snowconvert/review-results/reports/top-level-code-units-report.md): Code Unit Database, Code Unit Schema and Code Unit Name

#### PostgreSQL & Based Languages

* Support for Bitwise Functions

### Fixes

#### General

* Modified [SSC-EWI-0040](../../technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) to specify the error node.

#### Teradata

* Removed .SET FORMAT from BTEQ transformation
* Fixed several BTEQ parsing errors
* Added support for BTEQ `.MESSAGEOUT` command
* Added pending transformation for shell variables inside conditions
* Added support for BTEQ `.SET FOLDLINE` command
* Added transformation for `.SET TITLEDASHES` command
* Downgraded EWI to FDM for `STATISTICS` BTEQ clause
* Downgraded EWI to FDM for `PERIOD` BTEQ clause

#### Oracle

* Fix transformation for DATE type attribute

#### SQL Server

* Improved the handling of procedures containing `SELECT INTO` statements that return a query.
* Transform `@@DateFirst` to `GET_WEEK_START`
* `Numeric format` function support
* `Convert` function support
* `Datename` function support
* Removed the symbol `@` in the conversion that uses XML queries.
* `Print` statement support
* Formats for datetime support.
* Improved the update statement by removing the table name from the clause when it appears in the target table.
* Error functions translation support.

## Version 1.16.2 (Aug 19, 2025)

### New Features 🚀

#### General

* Added a [new report](../../getting-started/running-snowconvert/review-results/reports/functions-usage-report.md), SQLFunctionsUsage.csv, that summarizes the invocations of built-in and user-defined functions grouped by their migration status. This report allows users to get details about function usages, whether they were transformed to Snowflake with no problem, or whether they require an additional post-conversion action.

#### Teradata

* Added transformation for the period `CONTAINS` clause

### Fixes

#### Oracle

* Fixed the `GENERATED ALWAYS` AS expr column option not being transformed
* Fixed dynamic SQL code strings not having their literal values properly escaped in the output

#### SQL Server

* Fixed the `DATETIME2` datatype not transformed correctly when precision is specified
* Fixed object names without brackets not being renamed when using the renamed feature
* Promoted SSC-FDM-TS0015 to EWI [SSC-EWI-TS0015](../../technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md) to fix objects with unsupported datatypes incorrectly marked as successfuly transformed
* Fixed some virtual columns transformed to datatype `VARIANT` instead of the right datatype for their expression
* Implemented transformation for the `STRING_SPLIT` function, previously being left as is in the output code
* Fixed `CREATE FUNCTION` bodies not generated when a `SELECT` statement was found in the `ELSE` clause of an `IF` statement
* Fixed identifiers containing the `@` character producing parsing errors
* Fixed the `DATE_PART` function incorrect transformation when the weekday part is specified
* Fixed the empty statements generated by parsing error recovery causing a pending functional equivalence error to be reported
* Fixed the `DATENAME` function transformation not generating the necessary UDF definitions in the `UDF Helpers` folder
* Fixed the `TRY_CAST/TRY_CONVERT` functions not being transformed in some cases

## Version 1.16.1 (Aug 11, 2025)

### New Features 🚀

* Added Key Pair authentication to login to Snowflake.
* Upgraded data validation Python support to 3.13.

## Version 1.16.0 (Aug 8, 2025)

### Fixes

* Fixed issue with retrieving access codes from SnowConvert due to certificate handling problems.
* Added Data Validation manual execution instruction and scripts.

## Version 1.15.1 (Aug 6, 2025)

### New Features 🚀

* Added support for PostgreSQL Array Expression and Array Access.

### Fixes

* Fixed transformation for Oracle’s JSON_OBJECT function.
* Updated links to the new [official documentation site](../../../overview.md).
* Fixed bug when clicking on retry conversion on a non E2E platform.
* Fixed optional fields in Snowflake connection form.
* Fixed some Oracle functions not being transformed to the correct target.

## Version 1.14.0 (Jul 30, 2025)

### New Features 🚀

* Added Migration Project Context feature.

## Version 1.13.0 (Jul 28, 2025)

### New Features 🚀

* Enhanced data migration performance by increasing default timeout values for large-scale operations including data extraction, analysis, and loading processes.
* Support for [nested procedures](../../../translation-references/oracle/pl-sql-to-snowflake-scripting/README.md) in Oracle.

### Fixes

* Routed SnowConvert AI API traffic from Azure-hosted domains (*.azurewebsites.net) to Snowflake-hosted domains (*.snowflake.com) to streamline integration and deliver a unified user experience.
* Fixed SSO authentication token caching during data migration processes, eliminating repeated authentication prompts that previously opened new browser tabs for each request.

## Version 1.12.1 (Jul 21, 2025)

### New Features 🚀

Conversion Option for External Tables for Hive-Spark-Databricks SQL.

### Fixes

* Backtick Identifiers Support in Sybase.
* Translation for Amazon Redshift COMMENT ON statement.
* Non-returning functions translated to stored procedures for PostgreSQL.

## Version 1.11.1 (Jul 11, 2025)

### New Features 🚀

Support for new Snowflake Out Arguments syntax within Snowflake Scripting on Teradata, Oracle, SQL Server, and Redshift migrations.

### Fixes

Enhanced Teradata Data Type Handling: JSON to VARIANT migration.
Improved recovery on Redshift procedures written with Python.

## Version 1.11.0 (Jul 1, 2025)

### New Features 🚀

New [Data Validation framework integration](../../user-guide/data-validation.md) for SQL Server End-to-End experience: Now, users can validate their data after migrating it. The Data Validation framework offers the following validations:
Schema validation: Validate the table structure to attest the correct mappings among datatypes.
Metrics validation: Generate metrics of the data stored in a table, ensuring the consistency of your data post-migration.

## Version 1.3.0 (Mar 25, 2025)

### Sybase IQ Support

SnowConvert AI now supports the conversion of Sybase IQ Create Table to Snowflake. This feature includes support for the following:

### New Features 🚀

* Sybase:

  + Translation of Regular and Temporary Tables
  + Translation of Constraints
  + Translation of Data Types

### Azure Synapse

* Fix Object References not shown in Object References and Missing Object References reports.
* Added parsing support for Materialized Views with distribution clause

## Version 1.2.17 (Mar 18, 2025)

### Azure Synapse Support

SnowConvert AI is adding support for Azure Synapse to Snowflake, now enabling direct translation for Azure Synapse SQL scripts and stored procedures to Snowflake’s SQL dialect. This complements our existing support for Transact-SQL (T-SQL) and provides a more comprehensive solution for users migrating from Microsoft’s data warehousing ecosystem.

### New Features 🚀

* **Common**:

  + Add a Relation Type column to the [Object References](../../getting-started/running-snowconvert/review-results/reports/object-references-report.md) and [Missing Object References](../../getting-started/running-snowconvert/review-results/reports/missing-objects-report.md) reports.

## Version 1.2.16 (Mar 10, 2025)

### Redshift Stored Procedures Support

SnowConvert AI now supports the conversion of Redshift stored procedures to Snowflake, enabling seamless migration of procedural code. This feature includes support for variable operations, control flow statements, cursor handling, and transaction management capabilities.

### New Features 🚀

Stored procedures new supported functionality.

* **General support**:

  + Transformation for `SELECT INTO` variables inside stored procedures.
  + Transformation for `CASE` statements without ELSE clauses.
  + Transformation of `RETURN` statement in Redshift.
  + Support of `RAISE` for logging, warnings, and exceptions.
* **Variable Binding**:

  + Support for binding variables in stored procedures.
  + Handling positional arguments for binding variables.
  + Variable bindings in the `OPEN cursor` statement.
* **Transaction Support**:

  + Initial support for `COMMIT`, `ROLLBACK`, and `TRUNCATE` statements.
* **Cursor Operations**:

  + Support for the `FETCH` statement.
  + Transformation for `refcursor variable declaration`.
* **DML Operations**:

  + Transformations for `INSERT`, `UPDATE`, `MERGE`, `SELECT INTO` statements.
* `**Control Flow Statements**`:

  + Support for basic control flows statements.
  + Transformations of Labels Stats against loops.
* **DDL Operations**:

  + Support for `CREATE TABLE AS` statement.

### Breaking Changes ⛓️‍💥

* Renamed Code Unit Name to Code Unit ID in Top-Level Code Units report.

## Version 1.2.6 (Feb 26, 2025)

### Oracle

* Fixed CONSTRAINT clauses incorrectly reported as parsing errors.

### Redshift

Added

* Support for **Declare** statement.
* Support for **Merge** statement.
* Support for **Update** statement.
* Support for variable declaration with **Refcursor** type.
* Support for **Declare**, **Open** and **Close** Cursor.

### Teradata

* Fixed ‘chars’, and ‘characters’ built-in functions being reported as missing references.

## Version 1.2.5 (Feb 7, 2025)

### Common

* Improved SnowConvert AI CLI help messages.

## Version 1.2.4 (Feb 7, 2025)

### Common

* Improved SnowConvert AI CLI help messages.

### Teradata

* Improved EWI consistency on DATE casting.

## Version 1.2.1 (Jan 31, 2025)

### Common

**Fixed**

* Improved mechanism to validate the SnowConvert AI license by preventing the use of the powershell current user profile settings, ensuring a smoother execution.

## Version 1.2.0 (Jan 28, 2025)

* **Free** access for anyone with a corporate email.
* **Redshift** conversion is now supported under preview.
* Remove assessment step. Assessment and conversion are now completed in only one step.
* Introduction of the new Code Completeness Score and Code Unit Methodology.
* Improved messages like Functional Difference Messages (FDMs), Performance Reviews (PRFs) and EWIs (error, warnings, and issues).

### Common

**Fixed**

* Usage of correlated scalar subqueries erroneously causing [SSC-EWI-0108](../../technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) under certain scenarios.

### Teradata

**Fixed**

* Set **Character Set** as optional in description of columns in derived tables.

## Version 1.1.91 (Dec 19, 2024)

### Common

**Fixed**

* Correlated scalar subqueries missing an aggregate function.
* Uncorrelated scalar subqueries are being marked as unsupported.

### Teradata

#### Added

* Added “ANSI/TERA Session Mode” and “Use COLLATE for Case Specification” settings:

  + ANSI mode with COLLATE.
  + ANSI mode without COLLATE.
  + TERA mode with COLLATE.
  + TERA mode without COLLATE.
* Support parsing of GENERATED TIMECOLUMN column option.
* Support parsing of TD_NORMALIZE_MEET function.\

#### Fixed

* Fixed inconsistencies in column names when it comes to Snowflake reserved keywords.
* Parsing errors in PARTITION BY RANGE_N clause.
* Improved support for COALESCE expression.

### SQL Server

#### Fixed

* Some functions were incorrectly marked as a pending functional.

## Version 1.1.80 (Dec 5, 2024)

### Common

**Fixed**

* SnowConvert AI was incorrectly marking scalar subqueries as invalid when some function aliases were used.
* Crash when SnowConvert AI didn’t have read/write permissions to configuration folder.

### Teradata

#### Fixed

* Renaming feature now contemplates function with parameters.
* The UPDATE statement with ELSE INSERT syntax was not converted correctly.

### SQL Server

#### Fixed

* SnowConvert AI now successfully converts @@ROWCOUNT using the global variable SQLROWCOUNT.
* View and column names from sys objects are now be paired with INFORMATION_SCHEMA.

## Version 1.1.69 (Nov 14, 2024)

### SQL Server

#### Fixed

* BIT Datatype with DEFAULT value is not converted to true or false but 1 or 0.

### Oracle

#### Fixed

* Code missing when converting a function with CONNECT BY.

## Version 1.1.67 (Oct 30, 2024)

### Teradata

#### Fixed

* Flag TeraModeForStirngComparison is set to true as default.

### SQL Server

#### Fixed

* Columns with default value are now converted correctly with their respective data type casting.

### Oracle

#### Fixed

* Code missing when converting a function with CONNECT BY.

## Version 1.1.63 (Oct 24, 2024)

### Common

* Recovery codes removed from the parsing error messages.
* Windows close button now works as intended.
* Added a new field **domain** to the comment clause for each DDL SnowConvert AI generates.

### Teradata

**Added**

* Support for UNION ALL clause with different data types and column sizes.
* Support for sp_executeql.

#### Fixed

* Inconsistencies in string comparison in Tera mode and ANSI mode.
* Complex column alias with syntax ‘’n is not being recognized by SnowConvert.

### SQL Server

**Added**

* FDM in every corelated subquery.

#### Fixed

* Issue with WITH DISTRIBUTION and CLUSTERED in table creation.

### Oracle

#### Fixed

* Issue that caused SP conversion to fail when using .rownum within a FOR statement.

## Version 1.1.61 (Oct 18, 2024)

### Teradata

#### Fixed

* Conversion of stored procedures inside macros is now supported.
* StringSimilarity Teradata Function is now converted successfully

### Oracle

#### Fixed

* DATEDIFF_UDF now returns date difference with timestamp as parameter with decimals (time part difference).

## Version 1.1.56 (Oct 9, 2024)

### Teradata

#### Fixed

* Create a Stored Procedure to compliance the same flow as in Teradata (StoredProcedure inside a Macro)
* Use a UDF Helper to emulate the functionality given for a VALIDTIME column in Teradata

### Oracle

#### Fixed

* Empty Create Statement
* Return date difference with timestamp as parameter with decimals (time part difference).

## Version 1.1.54 (Oct 3, 2024)

### Common

* Improved the auto-update mechanism.

### Teradata

#### Fixed

* UDF called “PERIOD_TO_TIME_UDF” is now included as part of the code output if it is used in the converted code.
* UDF called “DATE_TO_PERIOD_UDF” is now included as part of the code output if it is used in the converted code.

### SQL Server

#### Fixed

* The CLUSTERED clause is no longer in the output code.

### Oracle

#### Fixed

* PARTITION clause in queries is now identified as an EWI instead of FDM.

## Version 1.1.52 (Sep 24, 2024)

### Common

* Adding an informative message when there is no communication to the licensing API and a link with more information of what is happening.
* A new column named “Lines of Code” was added on the report, specifically the “2.1 Conversion Rates Summary” table

### Teradata

#### Fixed

* Cast to CHAR/CHARACTER causing parsing error

### SQL Server

#### Fixed

* Empty STAT EWI when there is an extra ‘;’.
* Continue statement is not marked as an EWI any more.

### Oracle

#### Fixed

* `DATE_TO_RR_FORMAT_UDF` is now included on the output if there is a reference to it on the input source code.

## Version 1.1.45 (Sep 12, 2024)

### Common

Fix Encoding issue SSC-EWI-0041

#### Teradata

Added

* New conversion setting for TERA MODE strings comparison transformation

Fixed

* Anonymous block of code converted to a stored procedure.
* PRIMARY TIME INDEX not being parsed.

#### SQL Server

Fixed

* Empty stat should not be classified as pending functional
* SQL report has a text referring to Teradata

#### Oracle

Added

* Oracle function conversion to Functions (single statement)

Fixed

* DATE_TO_RR_FORMAT_UDF is added in the view conversion but is not part of the SC output

## Version 1.1.38 (Aug 29, 2024)

### Common

* Improved the performance for running SnowConvert.

#### Teradata

* Added translation for EXTRACT function.
* Fix translation in procedure when there is a presence of IMMUTABLE/VOLATILE.
* Improved translation of EXTRACT_TIMESTAMP_DIFFERENCE_UDF to support timestamp as parameter.

#### SQL Server

* Improved error handling when translating long-named columns.

#### Oracle

* Added translation for STANDARD_HASH function.
* Improved the parser to be able to read DBMS_DATAPUMP.detach.

## Version 1.1.33 (Aug 9, 2024)

### Common

* Fixed numerous SSC-EWI-0013 occurrences.
* Improved UI experience when user does not have read/write permissions on a particular local directory.

#### Teradata

* Added translation for `PREPARE STATEMENT`, `ACTIVITY_COUNT`, `DAY_OF_MONTH`, `DAY_OF_WEEK`, `WEEK_OF_CALENDAR`, `MONTH_OF_CALENDAR`.
* Added translation for `CREATE SCHEMA`.
* Fixed `INTERVAL` literal not converted in minus operations.
* Improved parser capability to read `LATEST` as a column name.

#### Oracle

* Improved translation on PL/SQL parameter data types: VARCHAR and INTEGER.
* Fixed duplicated comments in PL/SQL procedure declarations.

## Version 1.1.26 (Jul 28, 2024)

### Oracle

* Add parsing of `ACCESS PARAMETERS` table options.
* Add parsing of `XMLType` table.
* Added translation for `FUNCTION` definition within anonymous blocks.
* Fixed duplicated code SSC-FDM-OR0045.
* Improve parsing of `XMLSchema` specification.

#### SQLServer

* Fixed `EXECUTE AS` statement wrongly transformed to `EXECUTE IMMEDIATE`.
* Fixed temporary table generated erroneously.
* Improve parsing of `WITH xmlnamespaces` statement.

## Version 1.1.16 (Jun 26, 2024)

### Teradata

* Fixed translation of `LIKE NOT CASESPECIFIC`.
* Improved translation of variable declarations inside `BEGIN…END`.
* Improved parsing of `AS OF` clause and `WITH TIE`S option from `CREATE VIEW`.

#### Oracle

* Fixed translation for columns with whitespaces in `CREATE VIEW`.
* Improved description of `SSC-EWI-OR0042`.
* Improved parsing of `ACCESSIBLE BY` clause and `SQL_MACRO` option from `CREATE FUNCTION`.
* Improved parsing of the `DECLARE` statement.

#### SQLServer

* Fixed translation of `BEGIN…END` showing pending functional equivalence.
* Added translation for `FOR XML PATH` clause.

## Version 1.1.9 (Jun 12, 2024)

### Common

* Added more info in the COMMENT clause of each object.

#### Teradata

* Added an EWI 0073 to `PREPARE` statement.
* Added `OR REPLACE` to `CREATE TABLE`

#### Oracle

* Added translation for Materialized View’s `REFRESH_MODE` property.
* Improved parsing capability to read MODEL clause and to read CREATE VIEW alternate routes.

## Version 1.1.8 (May 31, 2024)

### Common

* Added translation of Materialized View to Dynamic Tables.
* Improved CodeUnit Report to show more code units.

#### SQLServer

* Added translation of SET ANSI_NULLS.
* Added translation of INSERT that contains a FROM Subquery + MERGE INTO pattern.

## Version 1.1.6 (May 21, 2024)

### Teradata

* Fixed translation for `Cast('POINT(x t)' As ST_GEOMETRY`
* Fixed translation of casting from one format to another.
* Fixed translation related to `DATEADD_UDF` and `TO_INTERVAL_UDF`

#### Oracle

* Improved parsing capability to read `JSON_OBJECT` and `JSON_ARRAYAGG` built-in functions.

#### SQLServer

* Improved Missing Object References report’s content.
* Improved robustness during the semantic analysis phase and translation phase.

## Version 1.1.5 (May 10, 2024)

### Common

* Provide more information and details for SSC-EWI-0001
* Improved robustness of assessment mode when providing free tables.

#### Teradata

* Improved translation related to date handling.
* Improved parsing capability to read code that contains block comments.
* Improved parsing capability to read NOT NULL column option before the data type declaration in a table.
* Improved the functionality of TIMESTAMP_DIFFERENCE_UDF and EXTRACT_TIMESTAMP_DIFFERENCE_UDF.

#### SQL Server

* Improved translation for ALTER TABLE CHECK constraint.

## Version 1.1.4 (May 2, 2024)

### Common

* Added a new assessment report EmbeddedCodeUnitReport, for more information, please visit [here](../../getting-started/running-snowconvert/review-results/reports/embedded-code-units-report.md).
* Improved the TopLevelCodeUnitReport. Added four more columns: FDM Count, PRF Count, FDM and PRF. For more information, please visit [here](../../getting-started/running-snowconvert/review-results/reports/embedded-code-units-report.md).
* Fixed an unexpected error in creating an assessment report.

#### Teradata

* Added translation for CONTINUE HANDLER.
* Added new parsing capability for BYTE data type.
* Improved binding variable translations.

#### Oracle

* Added and improved parsing capability to read EXPLAIN PLAN statement, U-Literals and CTAS.
* Improve CURSOR translation when it has to define a cursor with object_construct.
* Improved translation of procedure parameters avoiding deployment errors.

#### SQLServer

* Added translation for DB_ID function.
* Added basic translation for CREATE SCHEMA.
* Added an FDM for CREATE INDEX.
* Improved ALTER TABLE translation.

---
title: SnowConvert AI - Redshift
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/code-extraction/redshift.md
section: Migrations
---

# SnowConvert AI - Redshift

The first step in migration is getting the code you need to migrate. There are many ways to extract the code from your database, but we recommend using the extraction scripts provided by Snowflake.

All the source code for these scripts is open source and is available on [GitHub](https://github.com/Snowflake-Labs/SC.DDLExportScripts/).

## Prerequisites

* Access to a Redshift cluster with a Redshift database.
* Access to the database preferably with a super user or database owner.
* Installation of AWS CLI.
* Access to the AWS Portal.

## Installing the scripts

Go to <https://github.com/Snowflake-Labs/SC.DDLExportScripts/>.

From the Code option, select the drop-down and use the **Download ZIP** option to download the code.

Decompress the ZIP file. The code for Teradata should be under the Teradata folder.

Follow the [Usage instructions](https://github.com/Snowflake-Labs/SC.DDLExportScripts/blob/main/Redshift/README.md) to modify the files and run them on your system.

## Package the results

When the script is done, the output folder will contain all the DDLs for the migration. You can then compress this folder to use it with [SnowConvert AI](../../../overview.md).

E.g. run:

Copy

```none
zip -r output.zip ./output
```

---
title: SnowConvert AI - Redshift
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/redshift.md
section: Migrations
---

# SnowConvert AI - Redshift

## What is SnowConvert AI for Redshift?

SnowConvert AI is a software tool that understands SQL Redshift scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for Redshift performs the following conversions:

### Redshift to Snowflake SQL

SnowConvert AI recognizes the Redshift source code and converts the different statements into the appropriate SQL for the Snowflake target.

### Sample code

#### Input Code

```sql
CREATE TABLE table1 (
    col1 INTEGER GENERATED BY DEFAULT AS IDENTITY(1,1)
);
```

#### Output Code

```sql
CREATE TABLE table1 (
    col1 INTEGER IDENTITY(1,1) ORDER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

As you can see, most of the structure remains the same, but some column properties have to be transformed to Snowflake equivalents. For more information please refer to [Redshift Translation References documentation](../../../../translation-references/redshift/README.md).

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI*: the software that converts securely and automatically your Redshift files to the Snowflake cloud data platform.
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

In the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for Redshift is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Redshift
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/redshift.md
section: Migrations
---

# SnowConvert AI - Redshift

## Specific CLI arguments

The following CLI arguments are specific for executing migrations with **SnowConvert AI for Redshift**

### `--RenamingFile`

The path to a .json file that specifies new names for certain objects such as Tables, Views, Procedures, Functions, and Macros. This parameter can’t be used with the `customSchema` argument. Navigate to the [Renaming Feature](renaming-feature.md) to learn more about this argument.

---
title: SnowConvert AI - Redshift
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/README.md
section: Migrations
---

# SnowConvert AI - Redshift

Translation specification for Redshift grammar syntax

This page provides a comprehensive reference for how SnowConvert AI translates [Redshift grammar elements](https://docs.aws.amazon.com/redshift/latest/dg/cm_chap_SQLCommandRef.html) to Snowflake equivalents. In this translation reference, you will find, code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - Redshift - Basic elements
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-basic-elements.md
section: Migrations
---

# SnowConvert AI - Redshift - Basic elements

## Names and identifiers

Names and identifiers translation for Redshift

### Description

> Names identify database objects, including tables and columns, as well as users and passwords. The terms *name* and *identifier* can be used interchangeably. There are two types of identifiers, standard identifiers and quoted or delimited identifiers. Identifiers must consist of only UTF-8 printable characters. ASCII letters in standard and delimited identifiers are case-insensitive and are folded to lowercase in the database. ([Redshift SQL Language reference Names and identifiers](https://docs.aws.amazon.com/redshift/latest/dg/r_names.html)).

### Standard identifiers

Standard SQL identifiers adhere to a set of rules and must:

* Begin with an ASCII single-byte alphabetic character or underscore character, or a UTF-8 multibyte character two to four bytes long.
* Subsequent characters can be ASCII single-byte alphanumeric characters, underscores, or dollar signs, or UTF-8 multibyte characters two to four bytes long.
* Be between 1 and 127 bytes in length, not including quotation marks for delimited identifiers.
* Contain no quotation marks and no spaces.
* Not be a reserved SQL keyword. ([Redshift SQL Language reference Standard identifiers](https://docs.aws.amazon.com/redshift/latest/dg/r_names.html#r_names-standard-identifiers))

> **Note:**
>
> This syntax is fully supported by Snowflake.

### Special characters identifiers

In Redshift, there is support for using some special characters as part of the name of the identifier. These could be used in any part of an identifier. For this reason, to emulate this behavior, replace these unsupported special characters with a new value valid in Snowflake.

* The **#** character is replaced by a **_H_**.

> **Note:**
>
> In Redshift, if you specify a table name that begins with **‘# ‘**, the table is created as a temporary table.

#### Sample Source Patterns

##### Input Code:

##### Redshift

```sql
 CREATE TABLE #TABLE_NAME
(
    COL#1 int,
    "col2#" int
);

INSERT INTO #TABLE_NAME(COL#1, "col2#") VALUES (1,20),(2,21),(3,22);

SELECT col#1, "col2#" as col# FROM #TABLE_NAME;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TEMP TABLE _H_TABLE_NAME
(
	COL_H_1 int,
	"col2#" int
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/04/2025",  "domain": "test" }}';

INSERT INTO _H_TABLE_NAME (COL_H_1, "col2#") VALUES (1,20),(2,21),(3,22);

SELECT
	col_H_1,
	"col2#" as col_H_
FROM
	_H_TABLE_NAME;
```

### Delimited identifiers

> Delimited identifiers (**also known as quoted identifiers**) begin and end with double quotation marks (“). If you use a delimited identifier, you must use the double quotation marks for every reference to that object. The identifier can contain any standard UTF-8 printable characters other than the double quotation mark itself. Therefore, you can create column or table names that include otherwise illegal characters, such as spaces or the percent symbol. ([Redshift SQL Language reference Delimited identifiers](https://docs.aws.amazon.com/redshift/latest/dg/r_names.html#r_names-delimited-identifiers)).

In Redshift, identifiers can be enclosed in quotes and are [not case-sensitive by default](https://docs.aws.amazon.com/redshift/latest/dg/r_enable_case_sensitive_identifier.html). However, in Snowflake, they are [case-sensitive by default](https://docs.aws.amazon.com/redshift/latest/dg/r_enable_case_sensitive_identifier.html). For this reason, to emulate this behavior, we are removing the quotes from all identifiers that are **enclosed in quotes, are not reserved keywords in Snowflake, and contain alphanumeric characters**. **Reserved** **keywords** in Snowflake will always be enclosed in double quotes and defined in lowercase.

> **Warning:**
>
> This change could impact the desired behavior if the [`enable_case_sensitive_identifier`](https://docs.aws.amazon.com/redshift/latest/dg/r_enable_case_sensitive_identifier.html) flag is set to true in your configuration. Future updates will allow users to define the desired transformation for these identifiers.

#### Sample Source Patterns

For this scenario, please keep in mind that “LATERAL” and “INCREMENT” are reserved words in Snowflake, while “LOCAL” is not a reserved word.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE lateral
(
    INCREMENT int,
    "local" int
);

INSERT INTO lateral(INCREMENT, "local") VALUES (1,20),(2,21),(3,22);

SELECT lateral.INCREMENT, "local" FROM LATERAL;
```

##### Result

| increment | local |
| --- | --- |
| 1 | 20 |
| 2 | 21 |
| 3 | 22 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE "lateral"
(
    "increment" int,
    local int
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "12/10/2024",  "domain": "test" }}';

INSERT INTO "lateral" ("increment", local) VALUES (1,20),(2,21),(3,22);

SELECT
    "lateral"."increment",
    local
FROM
    "lateral";
```

##### Result

| increment | LOCAL |
| --- | --- |
| 1 | 20 |
| 2 | 21 |
| 3 | 22 |

### Quoted identifiers in Functions

In Redshift, function names can be enclosed in quotes and are [not case-sensitive by default](https://docs.aws.amazon.com/redshift/latest/dg/r_enable_case_sensitive_identifier.html). However, in Snowflake, functions may cause issues if they are in quotes and written in lowercase. For this reason, in Snowflake, any function name enclosed in quotes will always be transformed to uppercase and the quotation marks will be removed.

#### Sample Source Patterns

##### Input Code:

##### Redshift

```sql
 SELECT "getdate"();
```

##### Result

| “GETDATE”() |
| --- |
| 2024-11-21 22:08:53.000000 |

##### Output Code:

##### Snowflake

```sql
 SELECT GETDATE();
```

##### Result

| “GETDATE”() |
| --- |
| 2024-11-21 22:08:53.000 +0000 |

#### Recommendations

> To work around this limitation, Snowflake provides the [QUOTED_IDENTIFIERS_IGNORE_CASE](https://docs.snowflake.com/en/sql-reference/parameters.html#label-quoted-identifiers-ignore-case) session parameter, which causes Snowflake to treat lowercase letters in double-quoted identifiers as uppercase when creating and finding objects.
>
> ([Snowflake SQL Language Reference Identifier requirements](https://docs.snowflake.com/en/sql-reference/identifiers-syntax#migrating-from-databases-that-treat-double-quoted-identifiers-as-case-insensitive)).

## Reserved Keywords

Reserved keywords translation for Redshift

### Description

In Redshift you can use some of the [Snowflake reserved keywords](https://docs.snowflake.com/en/sql-reference/reserved-keywords) as column names, table names, etc. For this reason, it is necessary that these words are enclosed in double quotes in order to be able to use them.

> **Note:**
>
> Please be aware that in Snowflake when these names are enclosed in double quotes, they are **case-sensitive**. For this reason It is important to emphasize that when a reserved keyword is used in Snowflake it is always transformed with double quotes and in lowercase. For more information please refer to [Snowflake identifiers documentation.](https://docs.snowflake.com/en/sql-reference/identifiers-syntax#label-delimited-identifier)

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE alter
(
    alter INT
);

CREATE TABLE CONNECT
(
    CONNECT INT
);

DROP TABLE alter;
DROP TABLE CONNECT;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE "alter"
(
    "alter" INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';

CREATE TABLE "connect"
(
    "connect" INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';

DROP TABLE "alter";
DROP TABLE "connect";
```

### Related EWIs

No related EWIs.

### Known Issues

No issues were found.

---
title: SnowConvert AI - Redshift - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-functions.md
section: Migrations
---

# SnowConvert AI - Redshift - Built-in functions

> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

## Aggregate Functions

> Aggregate functions compute a single result value from a set of input values. ([Redshift SQL Language Reference Aggregate Functions](https://docs.aws.amazon.com/redshift/latest/dg/c_Aggregate_Functions.html)).

| Redshift | Snowflake |
| --- | --- |
| [ANY_VALUE](https://docs.snowflake.com/en/sql-reference/functions/any_value) ( [ DISTINCT | ALL ] expression ) |
| [AVG](https://docs.aws.amazon.com/redshift/latest/dg/r_AVG.html) ( [ DISTINCT | ALL ] *expression* ) | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg) ( [ DISTINCT ] expression)    *Notes: Redshift and Snowflake may show different precision/decimals due to data type rounding/formatting.* |
| [COUNT](https://docs.aws.amazon.com/redshift/latest/dg/r_COUNT.html) | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) |
| [LISTAGG](https://docs.aws.amazon.com/redshift/latest/dg/r_LISTAGG.html) | [LISTAGG](https://docs.snowflake.com/en/sql-reference/functions/listagg)    *Notes: Redshift’s DISTINCT ignores trailing spaces (‘a ‘ = ‘a’); Snowflake’s does not. (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [MAX](https://docs.aws.amazon.com/redshift/latest/dg/r_MAX.html) | [MAX](https://docs.snowflake.com/en/sql-reference/functions/max) |
| [MEDIAN](https://docs.aws.amazon.com/redshift/latest/dg/r_MEDIAN.html) | [MEDIAN](https://docs.snowflake.com/en/sql-reference/functions/median)    *Notes**: Snowflake does not allow the use of date types**, while Redshift does. (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [MIN](https://docs.aws.amazon.com/redshift/latest/dg/r_MIN.html) | [MIN](https://docs.snowflake.com/en/sql-reference/functions/min) |
| [PERCENTILE_CONT](https://docs.aws.amazon.com/redshift/latest/dg/r_PERCENTILE_CONT.html) | [PERCENTILE_CONT](https://docs.snowflake.com/en/sql-reference/functions/percentile_cont) |
| [STDDEV/STDDEV_SAMP](https://docs.aws.amazon.com/redshift/latest/dg/r_STDDEV_functions.html) ( [ DISTINCT | ALL ] *expression*)    [STDDEV_POP](https://docs.aws.amazon.com/redshift/latest/dg/r_STDDEV_functions.html) ( [ DISTINCT |
| [SUM](https://docs.aws.amazon.com/redshift/latest/dg/r_SUM.html) | [SUM](https://docs.snowflake.com/en/sql-reference/functions/sum) |
| [VARIANCE/VAR_SAMP](https://docs.aws.amazon.com/redshift/latest/dg/r_VARIANCE_functions.html) ( [ DISTINCT | ALL ] *expression*)    [VAR_POP](https://docs.aws.amazon.com/redshift/latest/dg/r_VARIANCE_functions.html) ( [ DISTINCT |

## Array Functions

> Creates an array of the SUPER data type. ([Redshift SQL Language Reference Array Functions](https://docs.aws.amazon.com/redshift/latest/dg/c_Array_Functions.html)).

| Redshift | Snowflake |
| --- | --- |
| [ARRAY](https://docs.aws.amazon.com/redshift/latest/dg/r_array.html) ( [ expr1 ] [ , expr2 [ , … ] ] ) | [ARRAY_CONSTRUCT](https://docs.snowflake.com/en/sql-reference/functions/array_construct)  ( [ <expr1> ] [ , <expr2> [ , … ] ] ) |
| [ARRAY_CONCAT](https://docs.aws.amazon.com/redshift/latest/dg/r_array_concat.html) ( super_expr1, super_expr2 ) | [ARRAY_CAT](https://docs.snowflake.com/en/sql-reference/functions/array_cat) ( <array1> , <array2> ) |
| [ARRAY_FLATTEN](https://docs.aws.amazon.com/redshift/latest/dg/array_flatten.html)  ( *super_expr1*,*super_expr2*,.. ) | [ARRAY_FLATTEN](https://docs.snowflake.com/en/sql-reference/functions/array_flatten) ( <array> )    *Notes: the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [GET_ARRAY_LENGTH](https://docs.aws.amazon.com/redshift/latest/dg/get_array_length.html) ( *super_expr* ) | [ARRAY_SIZE](https://docs.snowflake.com/en/sql-reference/functions/array_size) ( <array> | <variant>) |
| [SPLIT_TO_ARRAY](https://docs.aws.amazon.com/redshift/latest/dg/split_to_array.html) ( *string*,*delimiter* ) | [SPLIT](https://docs.snowflake.com/en/sql-reference/functions/split) (<string>, <separator>)    *Notes: Redshift allows missing delimiters; Snowflake requires them, defaulting to comma* |
| [SUBARRAY](https://docs.aws.amazon.com/redshift/latest/dg/r_subarray.html) ( *super_expr*, *start_position*, *length* ) | [ARRAY_SLICE](https://docs.snowflake.com/en/sql-reference/functions/array_slice) ( <array> , <from> , <to> )    *Notes: Function names and the second argument differ; adjust arguments for equivalence.* |

## Conditional expressions

| Redshift | Snowflake |
| --- | --- |
| [DECODE](https://docs.aws.amazon.com/redshift/latest/dg/r_DECODE_expression.html) | [DECODE](https://docs.snowflake.com/en/sql-reference/functions/decode)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [COALESCE](https://docs.aws.amazon.com/redshift/latest/dg/r_NVL_function.html) ( *expression*, *expression*, … ) | [COALESCE](https://docs.snowflake.com/en/sql-reference/functions/coalesce) ( *expression*, *expression*, … ) |
| [GREATEST](https://docs.aws.amazon.com/redshift/latest/dg/r_GREATEST_LEAST.html) ( value [, …] ) | [GREATEST_IGNORE_NULLS](https://docs.snowflake.com/en/sql-reference/functions/greatest_ignore_nulls) ( <expr1> [, <expr2> … ] ) |
| [LEAST](https://docs.aws.amazon.com/redshift/latest/dg/r_GREATEST_LEAST.html) ( value [, …] ) | [LEAST_IGNORE_NULLS](https://docs.snowflake.com/en/sql-reference/functions/least_ignore_nulls) ( <expr1> [, <expr2> … ]) |
| [NVL](https://docs.aws.amazon.com/redshift/latest/dg/r_NVL_function.html)( *expression*, *expression*, … ) | [*NVL*](https://docs.snowflake.com/en/sql-reference/functions/nvl) *( expression, expression )*    *Notes: Redshift’s NVL accepts multiple arguments; Snowflake’s NVL accepts only two. To match Redshift behavior, NVL with more than two arguments is converted to COALESCE.* |
| [NVL2](https://docs.aws.amazon.com/redshift/latest/dg/r_NVL2.html) | [NVL2](https://docs.snowflake.com/en/sql-reference/functions/nvl2) |
| [NULLIF](https://docs.aws.amazon.com/redshift/latest/dg/r_NULLIF_function.html) | [NULLIF](https://docs.snowflake.com/en/sql-reference/functions/nullif)    *Notes: Redshift’s NULLIF ignores trailing spaces in some string comparisons, unlike Snowflake. Therefore, the transformation adds RTRIM for equivalence.* |

## Data type formatting functions

> Data type formatting functions provide an easy way to convert values from one data type to another. For each of these functions, the first argument is always the value to be formatted and the second argument contains the template for the new format. ([Redshift SQL Language Reference Data type formatting functions](https://docs.aws.amazon.com/redshift/latest/dg/r_Data_type_formatting.html)).

| Redshift | Snowflake |
| --- | --- |
| [TO_CHAR](https://docs.aws.amazon.com/redshift/latest/dg/r_TO_CHAR.html) | [TO_CHAR](https://docs.snowflake.com/en/sql-reference/functions/to_char)    *Notes: Snowflake’s support for this function is partial (see* [*SSC-EWI-0006*](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*).* |
| [TO_DATE](https://docs.aws.amazon.com/redshift/latest/dg/r_TO_DATE_function.html) | [TO_DATE](https://docs.snowflake.com/en/sql-reference/functions/to_date)    *Notes: Snowflake’s `TO_DATE` fails on invalid dates like ‘20010631’ (June has 30 days), unlike Redshift’s lenient `TO_DATE`. Use `TRY_TO_DATE` in Snowflake to handle these cases by returning NULL. (see* [*SSC-FDM-RS0004*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshift/ssc-fdm-rs0004.md)*,* [*SSC-EWI-0006*](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*,* [*SSC-FDM-0032*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/general/ssc-fdm-0032.md)*).* |

## Date and time functions

| Redshift | Snowflake |
| --- | --- |
| [ADD_MONTHS](https://docs.aws.amazon.com/redshift/latest/dg/r_ADD_MONTHS.html) | [ADD_MONTHS](https://docs.snowflake.com/en/sql-reference/functions/add_months)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [AT TIME ZONE ‘timezone’](https://docs.aws.amazon.com/redshift/latest/dg/r_AT_TIME_ZONE.html) | [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) ( <source_tz> , <target_tz> , <source_timestamp_ntz> )    [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) ( <target_tz> , <source_timestamp> )    *Notes: Redshift defaults to UTC; the Snowflake function requires explicit UTC specification. Therefore, it will be added as the target timezone.* |
| [CONVERT_TIMEZONE](https://docs.aws.amazon.com/redshift/latest/dg/CONVERT_TIMEZONE.html) | [CONVERT_TIMEZONE](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone) |
| [CURRENT_DATE](https://docs.aws.amazon.com/redshift/latest/dg/r_CURRENT_DATE_function.html) | [CURRENT_DATE()](https://docs.snowflake.com/en/sql-reference/functions/current_date) |
| [DATE](https://docs.aws.amazon.com/redshift/latest/dg/r_TO_DATE_function.html) | [DATE](https://docs.snowflake.com/en/sql-reference/functions/to_date) |
| [DATEADD/DATE_ADD](https://docs.aws.amazon.com/redshift/latest/dg/r_DATEADD_function.html) ( *datepart*, *interval*, {*date* | *time* | *timetz* | *timestamp*} ) | [DATE_ADD](https://docs.snowflake.com/en/sql-reference/functions/dateadd) ( <date_or_time_part>, <value>, <date_or_time_expr> )    *Notes: Invalid date part formats are translated to Snowflake-compatible formats.* |
| [DATEDIFF/DATE_DIFF](https://docs.aws.amazon.com/redshift/latest/dg/r_DATEDIFF_function.html) | [DATEDIFF](https://docs.snowflake.com/en/sql-reference/functions/datediff)    *Notes: Invalid date part formats are translated to Snowflake-compatible formats.* |
| [DATE_PART/PGDATE_PART](https://docs.aws.amazon.com/redshift/latest/dg/r_DATE_PART_function.html) | [DATE_PART](https://docs.snowflake.com/en/sql-reference/functions/date_part)    *Notes: this function is partially supported by Snowflake. (See* [*SSC-EWI-0006*](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*).* |
| [DATE_PART_YEAR](https://docs.aws.amazon.com/redshift/latest/dg/r_DATE_PART_YEAR.html) (*date*) | [YEAR](https://docs.snowflake.com/en/sql-reference/functions/year) ( <date_or_timestamp_expr> )    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [DATE_TRUNC](https://docs.aws.amazon.com/redshift/latest/dg/r_DATE_TRUNC.html) | [DATE_TRUNC](https://docs.snowflake.com/en/sql-reference/functions/date_trunc)    *Notes: Invalid date part formats are translated to Snowflake-compatible formats.* |
| [GETDATE](https://docs.aws.amazon.com/redshift/latest/dg/r_GETDATE.html)() | [GETDATE](https://docs.snowflake.com/en/sql-reference/functions/getdate)() |
| [LAST_DAY](https://docs.aws.amazon.com/redshift/latest/dg/r_LAST_DAY.html) | [LAST_DAY](https://docs.snowflake.com/en/sql-reference/functions/last_day)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [NEXT_DAY](https://docs.aws.amazon.com/redshift/latest/dg/r_NEXT_DAY.html) | [NEXT_DAY](https://docs.snowflake.com/en/sql-reference/functions/next_day)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [SYSDATE](https://docs.aws.amazon.com/redshift/latest/dg/r_SYSDATE.html) | [SYSDATE](https://docs.snowflake.com/en/sql-reference/functions/sysdate)() |
| [TIMESTAMP](https://docs.aws.amazon.com/redshift/latest/dg/r_TO_TIMESTAMP.html) | [TO_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/to_timestamp) |
| [TRUNC](https://docs.aws.amazon.com/redshift/latest/dg/r_TRUNC_date.html) | [TRUNC](https://docs.snowflakhttps/docs.snowflake.com/en/sql-reference/functions/trunc2e.com/en/sql-reference/functions/trunc2) |
| [EXTRACT](https://docs.aws.amazon.com/redshift/latest/dg/r_EXTRACT_function.html) | [EXTRACT](https://docs.snowflake.com/en/sql-reference/functions/extract)  *Notes:* Part-time or Date time supported: DAY, DOW, DOY, EPOCH, HOUR, MINUTE, MONTH, QUARTER, SECOND, WEEK, YEAR. |

> **Note:**
>
> Redshift timestamps default to microsecond precision (6 digits); Snowflake defaults to nanosecond precision (9 digits). Adjust precision as needed using ALTER SESSION (for example, `ALTER SESSION SET TIMESTAMP_OUTPUT_FORMAT = 'YYYY-MM-DD HH24:MI:SS.FF2';`). Precision loss may occur depending on the data type used.
>
> Since some formats are incompatible with Snowflake, adjusting the account parameters [DATE_INPUT_FORMAT or TIME_INPUT_FORMAT](https://docs.snowflake.com/en/sql-reference/date-time-input-output#data-loading) might maintain functional equivalence between platforms.

## Hash Functions

> A hash function is a mathematical function that converts a numerical input value into another value. ([Redshift SQL Language Reference Hash functions](https://docs.aws.amazon.com/redshift/latest/dg/hash-functions.html)).

| Redshift | Snowflake |
| --- | --- |
| [FNV_HASH](https://docs.aws.amazon.com/redshift/latest/dg/r_FNV_HASH.html) (value [, seed]) | [*HASH*](https://docs.snowflake.com/en/sql-reference/functions/hash) *( <expr> [ , <expr> … ]* |

## JSON Functions

| Redshift | Snowflake |
| --- | --- |
| [JSON_EXTRACT_PATH_TEXT](https://docs.aws.amazon.com/redshift/latest/dg/JSON_EXTRACT_PATH_TEXT.html) | [JSON_EXTRACT_PATH_TEXT](https://docs.snowflake.com/en/sql-reference/functions/json_extract_path_text)    *Notes:*   1. *Redshift treats newline, tab, and carriage return characters literally; Snowflake interprets them.* 2. *A JSON literal and dot-separated path are required to access nested objects in the Snowflake function.* 3. *Paths with spaces in variables must be quoted.* |

## Math functions

| Redshift | Snowflake |
| --- | --- |
| [ACOS](https://docs.aws.amazon.com/redshift/latest/dg/r_ACOS.html) | [ACOS](https://docs.snowflake.com/en/sql-reference/functions/acos) |
| [ASIN](https://docs.aws.amazon.com/redshift/latest/dg/r_ASIN.html) | [ASIN](https://docs.snowflake.com/en/sql-reference/functions/asin) |
| [ATAN](https://docs.aws.amazon.com/redshift/latest/dg/r_ATAN.html) | [ATAN](https://docs.snowflake.com/en/sql-reference/functions/atan) |
| [ATAN2](https://docs.aws.amazon.com/redshift/latest/dg/r_ATAN2.html) | [ATAN2](https://docs.snowflake.com/en/sql-reference/functions/atan2) |
| [CBRT](https://docs.aws.amazon.com/redshift/latest/dg/r_CBRT.html) | [CBRT](https://docs.snowflake.com/en/sql-reference/functions/cbrt) |
| [CEIL/CEILING](https://docs.aws.amazon.com/redshift/latest/dg/r_CEILING_FLOOR.html) | [CEIL](https://docs.snowflake.com/en/sql-reference/functions/ceil) |
| [COS](https://docs.aws.amazon.com/redshift/latest/dg/r_COS.html) | [COS](https://docs.snowflake.com/en/sql-reference/functions/cos) |
| [COT](https://docs.aws.amazon.com/redshift/latest/dg/r_COT.html) | [COT](https://docs.snowflake.com/en/sql-reference/functions/cot) |
| [DEGREES](https://docs.aws.amazon.com/redshift/latest/dg/r_DEGREES.html) | [DEGREES](https://docs.snowflake.com/en/sql-reference/functions/degrees) |
| [DEXP](https://docs.aws.amazon.com/redshift/latest/dg/r_DEXP.html) | [EXP](https://docs.snowflake.com/en/sql-reference/functions/exp) |
| [DLOG1/LN](https://docs.aws.amazon.com/redshift/latest/dg/r_DLOG1.html) | [LN](https://docs.snowflake.com/en/sql-reference/functions/ln) |
| [DLOG10](https://docs.aws.amazon.com/redshift/latest/dg/r_DLOG10.html) (*number*) | [LOG](https://docs.snowflake.com/en/sql-reference/functions/log) (10, *number*) |
| [EXP](https://docs.aws.amazon.com/redshift/latest/dg/r_EXP.html) | [EXP](https://docs.snowflake.com/en/sql-reference/functions/exp) |
| [FLOOR](https://docs.aws.amazon.com/redshift/latest/dg/r_FLOOR.html) | [FLOOR](https://docs.snowflake.com/en/sql-reference/functions/floor) |
| [LOG](https://docs.aws.amazon.com/redshift/latest/dg/r_LOG.html) | [LOG](https://docs.snowflake.com/en/sql-reference/functions/log) |
| [MOD](https://docs.aws.amazon.com/redshift/latest/dg/r_MOD.html) | [MOD](https://docs.snowflake.com/en/sql-reference/functions/mod) |
| [PI](https://docs.aws.amazon.com/redshift/latest/dg/r_PI.html) | [PI](https://docs.snowflake.com/en/sql-reference/functions/pi) |
| [POWER/POW](https://docs.aws.amazon.com/redshift/latest/dg/r_POWER.html) | [POWER/POW](https://docs.snowflake.com/en/sql-reference/functions/pow) |
| [RADIANS](https://docs.aws.amazon.com/redshift/latest/dg/r_RADIANS.html) | [RADIANS](https://docs.snowflake.com/en/sql-reference/functions/radians) |
| [RANDOM](https://docs.aws.amazon.com/redshift/latest/dg/r_RANDOM.html) | [RANDOM](https://docs.snowflake.com/en/sql-reference/functions/random) |
| [ROUND](https://docs.aws.amazon.com/redshift/latest/dg/r_ROUND.html) | [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round) |
| [SIN](https://docs.aws.amazon.com/redshift/latest/dg/r_SIN.html) | [SIN](https://docs.snowflake.com/en/sql-reference/functions/sin) |
| [SIGN](https://docs.aws.amazon.com/redshift/latest/dg/r_SIGN.html) | [SIGN](https://docs.snowflake.com/en/sql-reference/functions/sign) |
| [SQRT](https://docs.aws.amazon.com/redshift/latest/dg/r_SQRT.html) | [SQRT](https://docs.snowflake.com/en/sql-reference/functions/sqrt) |
| [TAN](https://docs.aws.amazon.com/redshift/latest/dg/r_TAN.html) | [TAN](https://docs.snowflake.com/en/sql-reference/functions/tan) |
| [TRUNC](https://docs.aws.amazon.com/redshift/latest/dg/r_TRUNC.html) | [TRUNC](https://docs.snowflake.com/en/sql-reference/functions/trunc) |

> **Note:**
>
> Redshift and Snowflake results may differ in scale.

## String functions

> String functions process and manipulate character strings or expressions that evaluate to character strings. ([Redshift SQL Language Reference String functions](https://docs.aws.amazon.com/redshift/latest/dg/String_functions_header.html)).

| Redshift | Snowflake |
| --- | --- |
| [ASCII](https://docs.aws.amazon.com/redshift/latest/dg/r_ASCII.html) | [ASCII](https://docs.snowflake.com/en/sql-reference/functions/ascii) |
| [BTRIM](https://docs.aws.amazon.com/redshift/latest/dg/r_BTRIM.html) | [TRIM](https://docs.snowflake.com/en/sql-reference/functions/trim) |
| [CHAR_LENGTH](https://docs.aws.amazon.com/redshift/latest/dg/r_CHAR_LENGTH.html) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [CHARACTER_LENGTH](https://docs.aws.amazon.com/redshift/latest/dg/r_CHARACTER_LENGTH.html) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [CHARINDEX](https://docs.aws.amazon.com/redshift/latest/dg/r_CHARINDEX.html) | [CHARINDEX](https://docs.snowflake.com/en/sql-reference/functions/charindex) |
| [CHR](https://docs.aws.amazon.com/redshift/latest/dg/r_CHR.html) | [CHR](https://docs.snowflake.com/en/sql-reference/functions/chr) |
| [CONCAT](https://docs.aws.amazon.com/redshift/latest/dg/r_CONCAT.html) | [CONCAT](https://docs.snowflake.com/en/sql-reference/functions/concat) |
| [INITCAP](https://docs.aws.amazon.com/redshift/latest/dg/r_INITCAP.html) | [INITCAP](https://docs.snowflake.com/en/sql-reference/functions/initcap) |
| [LEFT/RIGHT](https://docs.snowflake.com/en/sql-reference/functions/initcap) | [LEFT](https://docs.snowflake.com/en/sql-reference/functions/left)/[RIGHT](https://docs.snowflake.com/en/sql-reference/functions/right)    *Notes: For negative lengths in `LEFT`/`RIGHT`, Snowflake returns an empty string; Redshift raises an error.* |
| [LEN](https://docs.aws.amazon.com/redshift/latest/dg/r_LEN.html) | [LEN](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [LOWER](https://docs.aws.amazon.com/redshift/latest/dg/r_LOWER.html) | [LOWER](https://docs.snowflake.com/en/sql-reference/functions/lower) |
| [OCTET_LENGTH](https://docs.aws.amazon.com/redshift/latest/dg/r_OCTET_LENGTH.html) | [OCTET_LENGTH](https://docs.snowflake.com/en/sql-reference/functions/octet_length)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [QUOTE_IDENT](https://docs.aws.amazon.com/redshift/latest/dg/r_QUOTE_IDENT.html) (*string*) | [CONCAT](https://docs.snowflake.com/en/sql-reference/functions/concat) (‘”’, *string,* ‘”’) |
| [REGEXP_REPLACE](https://docs.aws.amazon.com/redshift/latest/dg/REGEXP_REPLACE.html) | [REGEXP_REPLACE](https://docs.snowflake.com/en/sql-reference/functions/regexp_replace)    *Notes: This function includes a `parameters` argument that enables the user to interpret the pattern using the Perl Compatible Regular Expression (PCRE) dialect, represented by the `p` value, this is removed to avoid any issues*. *(See* [*SSC-EWI-0009*](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)*,* [*SC-FDM-0032*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/general/ssc-fdm-0032.md)*,* [*SSC-FDM-PG0011*](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0011.md)*).* |
| [REPEAT](https://docs.aws.amazon.com/redshift/latest/dg/r_REPEAT.html) | [REPEAT](https://docs.snowflake.com/en/sql-reference/functions/repeat) |
| [REPLACE](https://docs.aws.amazon.com/redshift/latest/dg/r_REPLACE.html) | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| [REPLICATE](https://docs.aws.amazon.com/redshift/latest/dg/r_REPLICATE.html) | [REPEAT](https://docs.snowflake.com/en/sql-reference/functions/repeat) |
| [REVERSE](https://docs.aws.amazon.com/redshift/latest/dg/r_REVERSE.html) | [REVERSE](https://docs.snowflake.com/en/sql-reference/functions/reverse) |
| [SOUNDEX](https://docs.aws.amazon.com/redshift/latest/dg/SOUNDEX.html) | [SOUNDEX](https://docs.snowflake.com/en/sql-reference/functions/soundex)    *Notes: Certain special characters, the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [SPLIT_PART](https://docs.aws.amazon.com/redshift/latest/dg/SPLIT_PART.html) | [SPLIT_PART](https://docs.snowflake.com/en/sql-reference/functions/split_part)    *Notes: Snowflake and Redshift handle SPLIT_PART differently with case-insensitive collations.* |
| [STRPOS](https://docs.aws.amazon.com/redshift/latest/dg/r_STRPOS.html) (*string*, *substring* ) | [POSITION](https://docs.snowflake.com/en/sql-reference/functions/position) ( <expr1> IN <expr> ) |
| [SUBSTRING](https://docs.aws.amazon.com/redshift/latest/dg/r_SUBSTRING.html) | [*SUBSTRING*](https://docs.snowflake.com/en/sql-reference/functions/substr)    *Notes:* Snowflake partially supports this function. Redshift’s `SUBSTRING`, with a non-positive `start_position`, calculates `start_position + number_characters` (returning ‘’ if the result is non-positive). Snowflake’s behavior differs. (See [SSC-EWI-RS0006](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshift/ssc-ewi-rs0006.md)). |
| [TEXTLEN](https://docs.aws.amazon.com/redshift/latest/dg/r_TEXTLEN.html) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) |
| [TRANSLATE](https://docs.aws.amazon.com/redshift/latest/dg/r_TRANSLATE.html) | [TRANSLATE](https://docs.snowflake.com/en/sql-reference/functions/translate) |
| [TRIM](https://docs.aws.amazon.com/redshift/latest/dg/r_TRIM.html) | [*TRIM*](https://docs.snowflake.com/en/sql-reference/functions/trim)    *Notes: Redshift uses keywords (BOTH, LEADING, TRAILING) for trim; Snowflake uses TRIM, LTRIM, RTRIM.* |
| [UPPER](https://docs.aws.amazon.com/redshift/latest/dg/r_UPPER.html) | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper) |

## SUPER type information functions

| Redshift | Snowflake |
| --- | --- |
| [IS_ARRAY](https://docs.aws.amazon.com/redshift/latest/dg/r_is_array.html) | [IS_ARRAY](https://docs.snowflake.com/en/sql-reference/functions/is_array)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [IS_BOOLEAN](https://docs.aws.amazon.com/redshift/latest/dg/r_is_boolean.html) | [IS_BOOLEAN](https://docs.snowflake.com/en/sql-reference/functions/is_boolean)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |

## Window functions

| Redshift | Snowflake |
| --- | --- |
| [AVG](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_AVG.html) | [*AVG*](https://docs.snowflake.com/en/sql-reference/functions/avg)    *Notes: AVG rounding/formatting can vary by data type between Redshift and Snowflake.* |
| [COUNT](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_COUNT.html) | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) |
| [DENSE_RANK](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_DENSE_RANK.html) | [DENSE_RANK](https://docs.snowflake.com/en/sql-reference/functions/dense_rank)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [FIRST_VALUE](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_first_value.html) | [FIRST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/first_value)    *Notes: Snowflake needs ORDER BY; missing clauses get `ORDER BY <expr>.`* |
| [LAG](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_LAG.html) | [LAG](https://docs.snowflake.com/en/sql-reference/functions/lag) |
| [LAST_VALUE](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_last_value.html) | [LAST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/last_value)    *Notes: Snowflake needs ORDER BY; missing clauses get `ORDER BY <expr>`.* |
| [LEAD](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_LEAD.html) | [LEAD](https://docs.snowflake.com/en/sql-reference/functions/lead)    *Notes: Redshift allows constant or expression offsets; Snowflake allows only constant offset*s. |
| [LISTAGG](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_LISTAGG.html) | [LISTAGG](https://docs.snowflake.com/en/sql-reference/functions/listagg)    *Notes: Redshift’s DISTINCT ignores trailing spaces (‘a ‘ = ‘a’); Snowflake’s does not. (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [MEDIAN](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_MEDIAN.html) | [MEDIAN](https://docs.snowflake.com/en/sql-reference/functions/median)    *Notes**: Snowflake does not allow the use of date types**, while Redshift does. (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [NTH_VALUE](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_NTH.html) | [NTH_VALUE](https://docs.snowflake.com/en/sql-reference/functions/nth_value)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [NTILE](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_NTILE.html) | [NTILE](https://docs.snowflake.com/en/sql-reference/functions/ntile)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`. (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [PERCENT_RANK](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_PERCENT_RANK.html) | [PERCENT_RANK](https://docs.snowflake.com/en/sql-reference/functions/percent_rank)    *Notes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [PERCENTILE_CONT](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_PERCENTILE_CONT.html) | [PERCENTILE_CONT](https://docs.snowflake.com/en/sql-reference/functions/percentile_cont)    *Notes: Rounding varies between platforms.* |
| [PERCENTILE_DISC](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_PERCENTILE_DISC.html) | [PERCENTILE_DISC](https://docs.snowflake.com/en/sql-reference/functions/percentile_disc) |
| [RANK](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_RANK.html) | [RANK](https://docs.snowflake.com/en/sql-reference/functions/rank) |
| [RATIO_TO_REPORT](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_RATIO_TO_REPORT.html) | [RATIO_TO_REPORT](https://docs.snowflake.com/en/sql-reference/functions/ratio_to_report)    *Notes:* *the results may vary between platforms (See* [SSC-FDM-PG0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresql/ssc-fdm-pg0013.md)*).* |
| [ROW_NUMBER](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_ROW_NUMBER.html) | [ROW_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/row_number)    N*otes: ORDER BY is mandatory in Snowflake; missing clauses are replaced with `ORDER BY 1`.* |
| [STDDEV_SAMP](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_STDDEV.html) | STDDEV |
| [VAR_SAMP](https://docs.aws.amazon.com/redshift/latest/dg/r_WF_VARIANCE.html) | VARIANCE |

## Known Issues

1. For more information, see [Quoted identifiers in functions](redshift-basic-elements.md).

## Related EWIs

* [SSC-EWI-0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Date or time format is not supported in Snowflake.
* [SSC-FDM-0032](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Parameter is not a literal value, transformation could not be fully applied
* [SSC-FDM-RS0004](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Invalid dates will cause errors in Snowflake.
* [SSC-FDM-PG0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Function syntactically supported by Snowflake but may have functional differences.

## IDENTITY

### Description

The IDENTITY function is a system function that operates on a specified column of a table to determine the initial value for the identity. If the initial value is not available, it defaults to the value provided in the function. This will be translation to a Sequence in Snowflake.

### Grammar Syntax

```sql
 "identity"(oid_id, oid_table_id, default)
```

> **Note:**
>
> This function is no longer supported in Redshift. It uses the default value to define the identity and behaves like a standard identity column.

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE IF NOT EXISTS table_test
(
    id integer,
    inventory_combo BIGINT  DEFAULT "identity"(850178, 0, '5,3'::text)
);

INSERT INTO table_test (id) VALUES
    (1),
    (2),
    (3),
    (4);

SELECT * FROM table_test;
```

##### Results

| id | inventory_combo |
| --- | --- |
| 1 | 5 |
| 2 | 8 |
| 3 | 11 |
| 3 | 14 |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS table_test
(
    id integer,
    inventory_combo BIGINT IDENTITY(5,3) ORDER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/13/2024",  "domain": "test" }}';

INSERT INTO table_test (id) VALUES
    (1),
    (2),
    (3),
    (4);

SELECT * FROM
    table_test;
```

##### Results

| id | inventory_combo |
| --- | --- |
| 1 | 5 |
| 2 | 8 |
| 3 | 11 |
| 3 | 14 |

### Related EWIs

There are no known issues.

## TO_CHAR

Date function

## Description

> TO_CHAR converts a timestamp or numeric expression to a character-string data format. ([Redshift SQL Language Reference TO_CHAR function](https://docs.aws.amazon.com/redshift/latest/dg/r_TO_CHAR.html))

> **Warning:**
>
> This function is partially supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/functions/to_char).

For more information, see [Quoted identifiers in functions](redshift-basic-elements.md).

## Grammar Syntax

```sql
 TO_CHAR(timestamp_expression | numeric_expression , 'format')
```

## Sample Source Patterns

### Input Code:

#### Redshift

```sql
 SELECT TO_CHAR(timestamp '2009-12-31 23:15:59', 'YYYY'),
       TO_CHAR(timestamp '2009-12-31 23:15:59', 'YYY'),
       TO_CHAR(timestamp '2009-12-31 23:15:59', 'TH'),
       "to_char"(timestamp '2009-12-31 23:15:59', 'MON-DY-DD-YYYY HH12:MIPM'),
       TO_CHAR(125.8, '999.99'),
       "to_char"(125.8, '999.99');
```

##### Results

| TO_CHAR | TO_CHAR | TO_CHAR | TO_CHAR | TO_CHAR |
| --- | --- | --- | --- | --- |
| 2009 | 009 | DEC-THU-31-2009 11:15PM | 125.80 | 125.80 |

#### Output Code:

##### Snowflake

```sql
 SELECT
       TO_CHAR(timestamp '2009-12-31 23:15:59', 'YYYY'),
       PUBLIC.YEAR_PART_UDF(timestamp '2009-12-31 23:15:59', 3),
       TO_CHAR(timestamp '2009-12-31 23:15:59', 'TH') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - TH FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!!,
       PUBLIC.MONTH_SHORT_UDF(timestamp '2009-12-31 23:15:59', 'uppercase') || '-' || PUBLIC.DAYNAME_SHORT_UDF(timestamp '2009-12-31 23:15:59', 'uppercase') || TO_CHAR(timestamp '2009-12-31 23:15:59', '-DD-YYYY HH12:MI') || PUBLIC.MERIDIAN_INDICATORS_UDF(timestamp '2009-12-31 23:15:59', 'uppercase'),
       TO_CHAR(125.8, '999.99'),
       TO_CHAR(125.8, '999.99');
```

##### Results

| TO_CHAR | TO_CHAR |
| --- | --- |
| 2009 | Dec-Thu-31-2009 11:15PM |

## Known Issues

No issues were found.

## Related EWIs

* [SSC-EWI-0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The current date/numeric format may have a different behavior in Snowflake.

## For datetime values

Translation specification for the TO_CHAR function when transforming date or timestamp values to string

### Description

> The following format strings apply to functions such as TO_CHAR. These strings can contain datetime separators (such as ‘`-`’, ‘`/`’, or ‘`:`’) and the following “dateparts” and “timeparts”. ([Redshift Datetime format strings reference page](https://docs.aws.amazon.com/redshift/latest/dg/r_FORMAT_strings.html))

### Grammar Syntax

```none
TO_CHAR (timestamp_expression, 'format')
```

The following table specifies the mapping of each format element to Snowflake:

| Redshift | Snowflake |
| --- | --- |
| `BC, AD, bc, ad` (upper and lowercase era indicators) | `PUBLIC.ERA_INDICATORS_UDF` |
| `B.C,. A.D., b.c., a.d.` (upper and lowercase era indicators with points) | `PUBLIC.ERA_INDICATORS_WITH_POINTS_UDF` |
| `CC` | `PUBLIC.CENTURY_UDF` |
| `YYYY` and `YY` | Directly supported |
| `YYY` and `Y` | `PUBLIC.YEAR_PART_UDF` |
| `Y,YYY` | `PUBLIC.YEAR_WITH_COMMA_UDF` |
| `IYYY` | `YEAROFWEEKISO` |
| `I, IY, IYY` | `PUBLIC.ISO_YEAR_PART_UDF` |
| `Q` | `QUARTER` |
| `MONTH, Month, month` | `PUBLIC.FULL_MONTH_NAME_UDF` |
| `MON, Mon, mon` | `PUBLIC.MONTH_SHORT_UDF` |
| `RM, rm` | `PUBLIC.ROMAN_NUMERALS_MONTH_UDF` |
| `W` | `PUBLIC.WEEK_OF_MONTH_UDF` |
| `WW` | `PUBLIC.WEEK_NUMBER_UDF` |
| `IW` | `WEEKISO` |
| `DAY, Day, day` | `PUBLIC.DAYNAME_LONG_UDF` |
| `DY, Dy, dy` | `PUBLIC.DAYNAME_SHORT_UDF` |
| `DDD` | `DAYOFYEAR` |
| `IDDD` | `PUBLIC.DAY_OF_YEAR_ISO_UDF` |
| `D` | `PUBLIC.DAY_OF_WEEK_UDF`    *Notes: For this UDF to work correctly the Snowflake session parameter `WEEK_START` should have its default value (`0`).* |
| `ID` | `DAYOFWEEKISO` |
| `J` | `PUBLIC.JULIAN_DAY_UDF` |
| `HH24` | Directly supported |
| `HH` | `HH12` |
| `HH12` | Directly supported |
| `MI` | Directly supported |
| `SS` | Directly supported |
| `MS` | `FF3` |
| `US` | `FF6` |
| `AM, PM, am, pm` (upper and lowercase meridian indicators) | `PUBLIC.MERIDIAN_INDICATORS_UDF` |
| `A.M., P.M., a.m., p.m.` (upper and lowercase meridian indicators with points) | `PUBLIC.MERIDIAN_INDICATORS_WITH_POINTS_UDF` |
| `TZ` and `tz` | `UTC` and `utc`    *Notes: According to the* [*redshift documentation*](https://docs.aws.amazon.com/redshift/latest/dg/r_Datetime_types.html#r_Datetime_types-timestamptz)*, all timestamp with time zone are stored in UTC, which causes this format element to return a fixed result.* |
| `OF` | +00    *Notes: According to the* [*redshift documentation*](https://docs.aws.amazon.com/redshift/latest/dg/r_Datetime_types.html#r_Datetime_types-timestamptz)*, all timestamp with time zone are stored in UTC, which causes this format element to return a fixed result.* |
| `SSSS` | `PUBLIC.SECONDS_PAST_MIDNIGHT` |
| `SP` | *Notes: This is a PostgreSQL template pattern modifier for “spell mode”, however it does nothing on Redshift, so it is removed from the output.* |
| `FX` | *Notes: This is another template pattern modifier for “fixed format”, however it has no use on the TO_CHAR function so it is removed.* |

### Sample Source Patterns

#### Direct format elements transformation (no functions/UDFs)

The result is preserved as a single TO_CHAR function

##### *Redshift*

##### Query

```sql
 SELECT TO_CHAR('2013-10-03 13:50:15.456871'::TIMESTAMP, 'DD/MM/YY HH:MI:SS.MS') AS col1;
```

##### Result

```none
+----------------------+
|col1                  |
+----------------------+
|03/10/13 01:50:15.456 |
+----------------------+
```

##### *Snowflake*

##### Query

```sql
 SELECT TO_CHAR('2013-10-03 13:50:15.456871'::TIMESTAMP, 'DD/MM/YY HH12:MI:SS.FF3') AS col1;
```

##### Result

```none
+----------------------+
|col1                  |
+----------------------+
|03/10/13 01:50:15.456 |
+----------------------+
```

#### Format transformation using functions/UDFs

The result is a concatenation of multiple TO_CHAR, UDFs and Snowflake built-in functions that generate the equivalent string representation of the datetime value

##### *Redshift*

##### Query

```sql
 SELECT TO_CHAR(DATE '2025-07-05', '"Today is " Month DAY DD, "it belongs to the week " IW') AS result;
```

##### Result

```none
+-------------------------------------------------------------+
|result                                                       |
+-------------------------------------------------------------+
|Today is  July      SATURDAY  05, it belongs to the week  27 |
+-------------------------------------------------------------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
    'Today is ' ||
    TO_CHAR(DATE '2025-07-05', ' ') ||
    PUBLIC.FULL_MONTH_NAME_UDF(DATE '2025-07-05', 'firstOnly') ||
    ' ' ||
    PUBLIC.DAYNAME_LONG_UDF(DATE '2025-07-05', 'uppercase') ||
    TO_CHAR(DATE '2025-07-05', ' DD, ') ||
    'it belongs to the week ' ||
    TO_CHAR(DATE '2025-07-05', ' ') ||
    WEEKISO(DATE '2025-07-05') AS result;
```

##### Result

```none
+-------------------------------------------------------------+
|result                                                       |
+-------------------------------------------------------------+
|Today is  July      SATURDAY  05, it belongs to the week  27 |
+-------------------------------------------------------------+
```

#### Quoted text

Format elements in double quoted text are added to the output directly without interpreting them, escaped double quotes are transformed to their Snowflake escaped equivalent.

##### *Redshift*

##### Query

```sql
 SELECT
    TO_CHAR(DATE '2025-01-16', 'MM "TESTING DD" DD') AS result1,
    TO_CHAR(DATE '2025-01-16', 'MM TESTING \\"DD\\" DD') AS result2,
    TO_CHAR(DATE '2025-01-16', 'MM "TESTING \\"DD\\"" DD') AS result3;
```

##### Result

```none
+-----------------+-------------------+-------------------+
|result1          |result2            |result3            |
+-----------------+-------------------+-------------------+
|01 TESTING DD 16 |01 TEST5NG "16" 16 |01 TESTING "DD" 16 |
+-----------------+-------------------+-------------------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
    TO_CHAR(DATE '2025-01-16', 'MM ') || 'TESTING DD' || TO_CHAR(DATE '2025-01-16', ' DD') AS result1,
    TO_CHAR(DATE '2025-01-16', 'MM TEST') || PUBLIC.ISO_YEAR_PART_UDF(DATE '2025-01-16', 1) || TO_CHAR(DATE '2025-01-16', 'NG ""DD"" DD') AS result2,
    TO_CHAR(DATE '2025-01-16', 'MM ') || 'TESTING "DD"' || TO_CHAR(DATE '2025-01-16', ' DD') AS result3;
```

##### Result

```none
+-----------------+-------------------+-------------------+
|result1          |result2            |result3            |
+-----------------+-------------------+-------------------+
|01 TESTING DD 16 |01 TEST5NG "16" 16 |01 TESTING "DD" 16 |
+-----------------+-------------------+-------------------+
```

### Known Issues

#### Template pattern modifiers not supported

The following format template modifiers:

* FM (fill mode)
* TH and th (uppercase and lowercase ordinal number suffix)
* TM (translation mode)

Are not supported, including them in a format will generate [SSC-EWI-0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)

Input code:

```sql
 SELECT TO_CHAR(CURRENT_DATE, 'FMMonth'),
TO_CHAR(CURRENT_DATE, 'DDTH'),
TO_CHAR(CURRENT_DATE, 'DDth'),
TO_CHAR(CURRENT_DATE, 'TMMonth');
```

Output code:

```sql
 SELECT
TO_CHAR(CURRENT_DATE(), 'FM') || PUBLIC.FULL_MONTH_NAME_UDF(CURRENT_DATE(), 'firstOnly') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - FMMonth FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!!,
TO_CHAR(CURRENT_DATE(), 'DDTH') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - DDTH FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!!,
TO_CHAR(CURRENT_DATE(), 'DDth') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - DDth FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!!,
TO_CHAR(CURRENT_DATE(), 'TM') || PUBLIC.FULL_MONTH_NAME_UDF(CURRENT_DATE(), 'firstOnly') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - TMMonth FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!!;
```

**Format parameter passed through variable**

When the format parameter is passed as a variable instead of a string literal, the transformation of format elements can not be applied, an FDM will be added to the uses of the function warning about it.

Input code:

```sql
 SELECT TO_CHAR(d, 'YYYY/MM/DD'),
TO_CHAR(d, f)
FROM (SELECT TO_DATE('2001-01-01','YYYY-MM-DD') as d, 'DD/MM/YYYY' as f);
```

Output code:

```sql
 SELECT TO_CHAR(d, 'YYYY/MM/DD'),
--** SSC-FDM-0032 - PARAMETER 'format_string' IS NOT A LITERAL VALUE, TRANSFORMATION COULD NOT BE FULLY APPLIED **
TO_CHAR(d, f)
FROM (SELECT TO_DATE('2001-01-01','YYYY-MM-DD') as d, 'DD/MM/YYYY' as f);
```

### Related EWIs

1. [SSC-EWI-0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The current date/numeric format may have a different behavior in Snowflake.
2. [SSC-FDM-0032](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Parameter is not a literal value, transformation could not be fully applied

---
title: SnowConvert AI - Redshift - Conditions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-conditions.md
section: Migrations
---

# SnowConvert AI - Redshift - Conditions

## BETWEEN

### Description

> A `BETWEEN` condition tests expressions for inclusion in a range of values, using the keywords `BETWEEN` and `AND`. ([Redshift SQL Language Reference BETWEEN condition](https://docs.aws.amazon.com/redshift/latest/dg/r_range_condition.html))

### Grammar Syntax

```sql
 expression [ NOT ] BETWEEN expression AND expression
```

> **Note:**
>
> This function is fully supported by [Snowflake](https://docs.snowflake.com/en/sql-reference/functions/coalesce).

### Sample Source Patterns

#### Setup Table

##### Redshift

```sql
 CREATE TABLE sales (
    id INTEGER IDENTITY(1,1),
    price FLOAT,
    departmentId INTEGER,
    saleDate DATE
);

INSERT INTO sales (price, departmentId, saleDate) VALUES
(5000, 1, '2008-01-01'),
(8000, 1, '2018-01-01'),
(5000, 2, '2010-01-01'),
(7000, 3, '2010-01-01'),
(5000, 1, '2018-01-01'),
(4000, 4, '2010-01-01'),
(3000, 4, '2018-01-01'),
(9000, 5, '2008-01-01'),
(7000, 5, '2018-01-01'),
(6000, 5, '2006-01-01'),
(5000, 5, '2008-01-01'),
(5000, 4, '2018-01-01'),
(8000, 3, '2006-01-01'),
(7000, 3, '2016-01-01'),
(2000, 2, '2018-01-01');
```

##### Snowflake

```sql
 CREATE TABLE sales (
    id INTEGER IDENTITY(1,1) ORDER,
    price FLOAT,
    departmentId INTEGER,
    saleDate DATE
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/08/2025",  "domain": "test" }}';

INSERT INTO sales (price, departmentId, saleDate) VALUES
(5000, 1, '2008-01-01'),
(8000, 1, '2018-01-01'),
(5000, 2, '2010-01-01'),
(7000, 3, '2010-01-01'),
(5000, 1, '2018-01-01'),
(4000, 4, '2010-01-01'),
(3000, 4, '2018-01-01'),
(9000, 5, '2008-01-01'),
(7000, 5, '2018-01-01'),
(6000, 5, '2006-01-01'),
(5000, 5, '2008-01-01'),
(5000, 4, '2018-01-01'),
(8000, 3, '2006-01-01'),
(7000, 3, '2016-01-01'),
(2000, 2, '2018-01-01');
```

##### Input Code:

##### Redshift

```sql
 SELECT COUNT(*) FROM sales
WHERE departmentId BETWEEN 2 AND 4;

SELECT * FROM sales
WHERE departmentId BETWEEN 4 AND 2;

SELECT * FROM sales
WHERE departmentId NOT BETWEEN 4 AND 2;

SELECT * FROM sales
WHERE departmentId BETWEEN 2 AND 4
AND saleDate BETWEEN '2010-01-01' and '2016-01-01';

select 'some ' between c_start and c_end
from( select 'same' as c_start, 'some' as c_end );
```

##### Results

| count |
| --- |
| 8 |

| id | price | departmentid | saledate |
| --- | --- | --- | --- |
|  |  |  |  |

| id | price | departmentid | saledate |
| --- | --- | --- | --- |
| 1 | 5000 | 1 | 2008-01-01 |
| 2 | 8000 | 1 | 2018-01-01 |
| 3 | 5000 | 2 | 2010-01-01 |
| 4 | 7000 | 3 | 2010-01-01 |
| 5 | 5000 | 1 | 2018-01-01 |
| 6 | 4000 | 4 | 2010-01-01 |
| 7 | 3000 | 4 | 2018-01-01 |
| 8 | 9000 | 5 | 2008-01-01 |
| 9 | 7000 | 5 | 2018-01-01 |
| 10 | 6000 | 5 | 2006-01-01 |
| 11 | 5000 | 5 | 2008-01-01 |
| 12 | 5000 | 4 | 2018-01-01 |
| 13 | 8000 | 3 | 2006-01-01 |
| 14 | 7000 | 3 | 2016-01-01 |
| 15 | 2000 | 2 | 2018-01-01 |

| id | price | departmentid | saledate |
| --- | --- | --- | --- |
| 3 | 5000 | 2 | 2010-01-01 |
| 4 | 7000 | 3 | 2010-01-01 |
| 6 | 4000 | 4 | 2010-01-01 |
| 14 | 7000 | 3 | 2016-01-01 |

##### Output Code:

##### Snowflake

```sql
 SELECT COUNT(*) FROM
    sales
WHERE departmentId BETWEEN 2 AND 4;

SELECT * FROM
    sales
WHERE departmentId BETWEEN 4 AND 2;

SELECT * FROM
    sales
WHERE departmentId NOT BETWEEN 4 AND 2;

SELECT * FROM
    sales
WHERE departmentId BETWEEN 2 AND 4
AND saleDate BETWEEN '2010-01-01' and '2016-01-01';

select
    RTRIM( 'some ') between c_start and c_end
from( select 'same' as c_start, 'some' as c_end );
```

##### Results

| count |
| --- |
| 8 |

| id | price | departmentid | saledate |
| --- | --- | --- | --- |
|  |  |  |  |

| id | price | departmentid | saledate |
| --- | --- | --- | --- |
| 1 | 5000 | 1 | 2008-01-01 |
| 2 | 8000 | 1 | 2018-01-01 |
| 3 | 5000 | 2 | 2010-01-01 |
| 4 | 7000 | 3 | 2010-01-01 |
| 5 | 5000 | 1 | 2018-01-01 |
| 6 | 4000 | 4 | 2010-01-01 |
| 7 | 3000 | 4 | 2018-01-01 |
| 8 | 9000 | 5 | 2008-01-01 |
| 9 | 7000 | 5 | 2018-01-01 |
| 10 | 6000 | 5 | 2006-01-01 |
| 11 | 5000 | 5 | 2008-01-01 |
| 12 | 5000 | 4 | 2018-01-01 |
| 13 | 8000 | 3 | 2006-01-01 |
| 14 | 7000 | 3 | 2016-01-01 |
| 15 | 2000 | 2 | 2018-01-01 |

| id | price | departmentid | saledate |
| --- | --- | --- | --- |
| 3 | 5000 | 2 | 2010-01-01 |
| 4 | 7000 | 3 | 2010-01-01 |
| 6 | 4000 | 4 | 2010-01-01 |
| 14 | 7000 | 3 | 2016-01-01 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Comparison Condition

Conditions

### Description

> Comparison conditions state logical relationships between two values. All comparison conditions are binary operators with a Boolean return type.
>
> ([RedShift SQL Language Reference Comparison Condition](https://docs.aws.amazon.com/redshift/latest/dg/r_comparison_condition.html))

### Grammar Syntax

Redshift supports the comparison operators described in the following table:

| Operator | Syntax | Description |
| --- | --- | --- |
| < | a < b | Value a is less than value b. |
| > | a > b | Value a is greater than value b. |
| <= | a <= b | Value a is less than or equal to value b. |
| >= | a >= b | Value a is greater than or equal to value b. |
| = | a = b | Value a is equal to value b. |
| <> | != | a <> b | a != b | Value a is not equal to value b. |
| ANY | SOME | a = ANY(subquery) | Value a is equal to any value returned by the subquery. |
| ALL | a <> ALL or != ALL (subquery) | Value a is not equal to any value returned by the subquery. |
| IS TRUE | FALSE | UNKNOWN | a IS TRUE | Value a is Boolean TRUE. |

### Use of comparison operators on Strings

It is important to note that in Redshift, comparison operators on strings ignore trailing blank spaces. To replicate this behavior in Snowflake, the transformation applies the `RTRIM` function to remove trailing spaces, ensuring equivalent functionality. For more information: [Significance of trailing blanks](https://docs.aws.amazon.com/redshift/latest/dg/r_Character_types.html#r_Character_types-significance-of-trailing-blanks)

### Conversion Table

Most of the operators are directly supported by Snowflake; however, the following operators require transformation:

| Redshift | Snowflake | Comments |
| --- | --- | --- |
| (expression) IS TRUE | expression | Condition is `TRUE`. |
| (expression) IS FALSE | NOT (expression) | Condition is `FALSE`. |
| (expression) IS UNKNOWN | expression IS NULL | Expression evaluates to `NULL` (same as `UNKNOWN`). |

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE example_data (
    id INT,
    value INT,
    status BOOLEAN,
    category VARCHAR(10)
);

INSERT INTO example_data (id, value, status, category) VALUES
(1, 50, TRUE, 'A'),
(2, 30, FALSE, 'B'),
(3, 40, NULL, 'C'),
(4, 70, TRUE, 'A '),
(5, 60, FALSE, 'B');

SELECT *
FROM example_data
WHERE value < 60 AND value > 40;

SELECT *
FROM example_data
WHERE value <= 60 AND value >= 40;

SELECT *
FROM example_data
WHERE category = 'A';

SELECT *
FROM example_data
WHERE category != 'A' AND category <> 'B';

SELECT *
FROM example_data
WHERE category = ANY(SELECT category FROM example_data WHERE value > 60); --SOME

SELECT *
FROM example_data
WHERE value <> ALL (SELECT value FROM example_data WHERE status = TRUE);

SELECT *
FROM example_data
WHERE status IS TRUE;

SELECT *
FROM example_data
WHERE status IS FALSE;

SELECT *
FROM example_data
WHERE status IS UNKNOWN;
```

##### Results

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 3 | 40 | null | C |
| 5 | 60 | false | B |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 4 | 70 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 3 | 40 | null | C |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 4 | 70 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 2 | 30 | false | B |
| 4 | 40 | null | C |
| 5 | 60 | false | B |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 4 | 70 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 2 | 30 | false | B |
| 5 | 60 | false | B |

| id | value | status | category |
| --- | --- | --- | --- |
| 4 | 40 | null | C |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE example_data (
    id INT,
    value INT,
    status BOOLEAN,
    category VARCHAR(10)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}';

INSERT INTO example_data (id, value, status, category) VALUES
(1, 50, TRUE, 'A'),
(2, 30, FALSE, 'B'),
(3, 40, NULL, 'C'),
(4, 70, TRUE, 'A '),
(5, 60, FALSE, 'B');

SELECT *
FROM
    example_data
WHERE value < 60 AND value > 40;

SELECT *
FROM
    example_data
WHERE value <= 60 AND value >= 40;

SELECT *
FROM
    example_data
WHERE category = 'A';

SELECT *
FROM
    example_data
WHERE category != 'A' AND category <> 'B';

SELECT *
FROM
    example_data
WHERE category = ANY(SELECT category FROM
            example_data
        WHERE value > 60); --SOME

SELECT *
FROM
    example_data
WHERE value <> ALL (SELECT value FROM
            example_data
        WHERE status = TRUE);

SELECT *
FROM
    example_data
WHERE status;

SELECT *
FROM
    example_data
WHERE
    NOT status;

SELECT *
FROM
    example_data
WHERE status IS NULL;
```

##### Results

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 3 | 40 | null | C |
| 5 | 60 | false | B |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 4 | 70 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 3 | 40 | null | C |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 4 | 70 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 2 | 30 | false | B |
| 4 | 40 | null | C |
| 5 | 60 | false | B |

| id | value | status | category |
| --- | --- | --- | --- |
| 1 | 50 | true | A |
| 4 | 70 | true | A |

| id | value | status | category |
| --- | --- | --- | --- |
| 2 | 30 | false | B |
| 5 | 60 | false | B |

| id | value | status | category |
| --- | --- | --- | --- |
| 4 | 40 | null | C |

### Known Issues

No issues were found.

### Related EWIs

There are no known issues.

## EXISTS

### Description

> EXISTS conditions test for the existence of rows in a subquery, and return true if a subquery returns at least one row. If NOT is specified, the condition returns true if a subquery returns no rows. ([Redshift SQL Language Reference EXISTS condition](https://docs.aws.amazon.com/redshift/latest/dg/r_exists_condition.html))

### Grammar Syntax

```sql
 [ NOT ] EXISTS (table_subquery)
```

> **Note:**
>
> This function is fully supported by [Snowflake](https://docs.snowflake.com/en/sql-reference/functions/coalesce).

### Sample Source Patterns

#### Setup Table

```sql
 CREATE TABLE ExistsTest (
    id INTEGER,
    name VARCHAR(30),
    lastname VARCHAR(30)
);

INSERT INTO ExistsTest (id, name, lastname) VALUES
 (1, 'name1', 'lastname1'),
 (2, 'name2', NULL),
 (3, 'name3', 'lastname3'),
 (4, 'name4', NULL);
```

```sql
 CREATE TABLE ExistsTest (
    id INTEGER,
    name VARCHAR(30),
    lastname VARCHAR(30)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/08/2025",  "domain": "test" }}'

INSERT INTO ExistsTest (id, name, lastname) VALUES
 (1, 'name1', 'lastname1'),
 (2, 'name2', NULL),
 (3, 'name3', 'lastname3'),
 (4, 'name4', NULL);
```

##### Input Code:

##### Redshift

```sql
 SELECT * FROM ExistsTest
WHERE EXISTS (
SELECT 1 FROM ExistsTest
WHERE lastname = 'lastname1'
)
ORDER BY id;
```

##### Results

| ID | NAME | LASTNAME |
| --- | --- | --- |
| 1 | name1 | lastname1 |
| 2 | name2 | NULL |
| 3 | name3 | lastname3 |
| 4 | name4 | NULL |

##### Output Code:

##### Snowflake

```sql
 SELECT * FROM
ExistsTest
WHERE EXISTS (
SELECT 1 FROM
ExistsTest
WHERE lastname = 'lastname1'
)
ORDER BY id;
```

##### Results

| ID | NAME | LASTNAME |
| --- | --- | --- |
| 1 | name1 | lastname1 |
| 2 | name2 | NULL |
| 3 | name3 | lastname3 |
| 4 | name4 | NULL |

### Related EWIs

No related EWIs.

### Known Issues

No issues were found.

## IN

### Description

> An IN condition tests a value for membership in a set of values or in a subquery. ([Redshift SQL Language Reference IN condition](https://docs.aws.amazon.com/redshift/latest/dg/r_in_condition.html))

### Grammar Syntax

```sql
 expression [ NOT ] IN (expr_list | table_subquery)
```

> **Note:**
>
> This function is fully supported by [Snowflake](https://docs.snowflake.com/en/sql-reference/functions/coalesce).

### Sample Source Patterns

#### Setup Table

##### Redshift

```sql
 CREATE TABLE sales (
    id INTEGER IDENTITY(1,1),
    price FLOAT,
    saleDate DATE
);

INSERT INTO sales (price, saleDate) VALUES
(5000, '12/19/2024'),
(4000, '12/18/2024'),
(2000, '12/17/2024'),
(1000, '11/11/2024'),
(7000, '10/10/2024'),
(7000, '05/12/2024');

CREATE TABLE InTest (
col1 Varchar(20) COLLATE CASE_INSENSITIVE,
col2 Varchar(30) COLLATE CASE_SENSITIVE,
d1 date,
num integer,
idx integer);

INSERT INTO InTest values ('A', 'A', ('2012-03-02'), 4,6);
INSERT INTO InTest values ('a', 'a', ('2014-01-02'), 41,7);
```

##### Snowflake

```sql
 CREATE TABLE InTest (
    id INTEGER IDENTITY(1,1) ORDER,
    price FLOAT,
    saleDate DATE
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/09/2025",  "domain": "test" }}';

INSERT INTO InTest (price, saleDate) VALUES
(5000, '12/19/2024'),
(4000, '12/18/2024'),
(2000, '12/17/2024'),
(1000, '11/11/2024'),
(7000, '10/10/2024'),
(7000, '05/12/2024');

CREATE TABLE InTest (
col1 Varchar(20) COLLATE 'en-ci',
col2 Varchar(30) COLLATE 'en-cs',
d1 date,
num integer,
idx integer)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/16/2025",  "domain": "test" }}';

INSERT INTO InTest
values ('A', 'A', ('2012-03-02'), 4,6);
INSERT INTO InTest
values ('a', 'a', ('2014-01-02'), 41,7);
```

##### Input Code:

##### Redshift

```sql
 SELECT * FROM sales
WHERE id IN (2,3);

SELECT 5 IN (
SELECT id FROM sales
WHERE price = 7000
) AS ValidId;

select t.col1 in ('a ','b','c') as r1, t.col2 in ('a ','b','c') as r2 from InTest t order by t.idx;
```

##### Results

| ID | PRICE | SALEDATE |
| --- | --- | --- |
| 2 | 4000 | 2024-12-18 |
| 3 | 2000 | 2024-12-17 |

| VALIDID |
| --- |
| TRUE |

| R1 | R2 |
| --- | --- |
| TRUE | FALSE |
| TRUE | TRUE |

##### Output Code:

##### Snowflake

```sql
 SELECT * FROM
    sales
WHERE id IN (2,3);

SELECT 5 IN (
SELECT id FROM
 sales
WHERE price = 7000
) AS ValidId;

select t.col1 in (RTRIM('a '), RTRIM('b'), RTRIM('c')) as r1, t.col2 in (RTRIM('a '), RTRIM('b'), RTRIM('c')) as r2 from
InTest t order by t.idx;
```

##### Results

| ID | PRICE | SALEDATE |
| --- | --- | --- |
| 2 | 4000 | 2024-12-18 |
| 3 | 2000 | 2024-12-17 |

| VALIDID |
| --- |
| TRUE |

| R1 | R2 |
| --- | --- |
| TRUE | FALSE |
| TRUE | TRUE |

### Related EWIs

No related EWIs.

### Known Issues

No issues were found.

## Logical Conditions

### Description

> Logical conditions combine the result of two conditions to produce a single result. All logical conditions are binary operators with a Boolean return type. ([Redshift SQL Language reference Logical Conditions](https://docs.aws.amazon.com/redshift/latest/dg/r_logical_condition.html)).

> **Note:**
>
> This grammar is fully supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/operators-logical).

### Grammar Syntax

```sql
expression
{ AND | OR }
expression
NOT expression
```

| E1 | E2 | E1 AND E2 | E1 OR E2 | NOT E2 |
| --- | --- | --- | --- | --- |
| TRUE | TRUE | TRUE | TRUE | FALSE |
| TRUE | FALSE | FALSE | TRUE | TRUE |
| TRUE | UNKNOWN | UNKNOWN | TRUE | UNKNOWN |
| FALSE | TRUE | FALSE | TRUE |  |
| FALSE | FALSE | FALSE | FALSE |  |
| FALSE | UNKNOWN | FALSE | UNKNOWN |  |
| UNKNOWN | TRUE | UNKNOWN | TRUE |  |
| UNKNOWN | FALSE | FALSE | UNKNOWN |  |
| UNKNOWN | UNKNOWN | UNKNOWN | UNKNOWN |  |

### Sample Source Patterns

#### Setup data

##### Redshift

```sql
 CREATE TABLE employee (
    employee_id INT,
    active BOOLEAN,
    department VARCHAR(100),
    hire_date DATE,
    salary INT
);

INSERT INTO employee (employee_id, active, department, hire_date, salary) VALUES
    (1, TRUE, 'Engineering', '2021-01-15', 70000),
    (2, FALSE, 'HR', '2020-03-22', 50000),
    (3, NULL, 'Marketing', '2019-05-10', 60000),
    (4, TRUE, 'Engineering', NULL, 65000),
    (5, TRUE, 'Sales', '2018-11-05', NULL);
```

##### Input Code:

##### Redshift

```sql
 SELECT
    employee_id,
    (active AND department = 'Engineering') AS is_active_engineering,
    (department = 'HR' OR salary > 60000) AS hr_or_high_salary,
    NOT active AS is_inactive,
    (hire_date IS NULL) AS hire_date_missing,
    (salary IS NULL OR salary < 50000) AS low_salary_or_no_salary
FROM employee;
```

##### Results

| EMPLOYEE_ID | IS_ACTIVE_ENGINEERING | HR_OR_HIGH_SALARY | IS_INACTIVE | HIRE_DATE_MISSING | LOW_SALARY_OR_NO_SALARY |
| --- | --- | --- | --- | --- | --- |
| 1 | TRUE | TRUE | FALSE | FALSE | FALSE |
| 2 | FALSE | TRUE | TRUE | FALSE | FALSE |
| 3 | FALSE | FALSE | NULL | FALSE | FALSE |
| 4 | TRUE | TRUE | FALSE | TRUE | FALSE |
| 5 | FALSE | NULL | FALSE | FALSE | TRUE |

**Output Code:**

##### Snowflake

```sql
 SELECT
    employee_id,
    (active AND department = 'Engineering') AS is_active_engineering,
    (department = 'HR' OR salary > 60000) AS hr_or_high_salary,
    NOT active AS is_inactive,
    (hire_date IS NULL) AS hire_date_missing,
    (salary IS NULL OR salary < 50000) AS low_salary_or_no_salary
FROM
    employee;
```

##### Results

| EMPLOYEE_ID | IS_ACTIVE_ENGINEERING | HR_OR_HIGH_SALARY | IS_INACTIVE | HIRE_DATE_MISSING | LOW_SALARY_OR_NO_SALARY |
| --- | --- | --- | --- | --- | --- |
| 1 | TRUE | TRUE | FALSE | FALSE | FALSE |
| 2 | FALSE | TRUE | TRUE | FALSE | FALSE |
| 3 | FALSE | FALSE | NULL | FALSE | FALSE |
| 4 | TRUE | TRUE | FALSE | TRUE | FALSE |
| 5 | FALSE | NULL | FALSE | FALSE | TRUE |

### Known Issues

No issues were found.

### Related EWIs

There are no known issues.

## NULL

### Description

> The null condition tests for nulls, when a value is missing or unknown. ([Redshift SQL Language Reference NULL condition](https://docs.aws.amazon.com/redshift/latest/dg/r_null_condition.html))

### Grammar Syntax

```sql
 expression IS [ NOT ] NULL
```

> **Note:**
>
> This function is fully supported by [Snowflake](https://docs.snowflake.com/en/sql-reference/functions/coalesce).

### Sample Source Patterns

#### Setup Table

##### Redshift

```sql
 CREATE TABLE NullTest (
    id INTEGER,
    name VARCHAR(30),
    lastname VARCHAR(30)
);

INSERT INTO NullTest (id, name, lastname) VALUES
 (1, 'name1', 'lastname1'),
 (2, 'name2', NULL),
 (3, 'name3', 'lastname3'),
 (4, 'name4', NULL);
```

##### Snowflake

```sql
 CREATE TABLE NullTest (
    id INTEGER,
    name VARCHAR(30),
    lastname VARCHAR(30)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/08/2025",  "domain": "test" }}';

INSERT INTO NullTest (id, name, lastname) VALUES
 (1, 'name1', 'lastname1'),
 (2, 'name2', NULL),
 (3, 'name3', 'lastname3'),
 (4, 'name4', NULL);
```

##### Input Code:

##### Redshift

```sql
 SELECT * FROM nulltest
WHERE lastname IS NULL;
```

##### Results

| ID | NAME | LASTNAME |
| --- | --- | --- |
| 2 | name2 | NULL |
| 4 | name4 | NULL |

##### Output Code:

##### Snowflake

```sql
 SELECT * FROM
    nulltest
WHERE lastname IS NULL;
```

##### Results

| ID | NAME | LASTNAME |
| --- | --- | --- |
| 2 | name2 | NULL |
| 4 | name4 | NULL |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Pattern-matching conditions

### Description

A pattern-matching operator searches a string for a pattern specified in the conditional expression and returns true or false depend on whether it finds a match. Amazon Redshift uses three methods for pattern matching:

* LIKE expressions The LIKE operator compares a string expression, such as a column name, with a pattern that uses the wildcard characters `%` (percent) and `_` (underscore). LIKE pattern matching always covers the entire string. LIKE performs a case-sensitive match and ILIKE performs a case-insensitive match.
* SIMILAR TO regular expressions The SIMILAR TO operator matches a string expression with a SQL standard regular expression pattern, which can include a set of pattern-matching metacharacters that includes the two supported by the LIKE operator. SIMILAR TO matches the entire string and performs a case-sensitive match.
* POSIX-style regular expressions POSIX regular expressions provide a more powerful means for pattern matching than the LIKE and SIMILAR TO operators. POSIX regular expression patterns can match any portion of the string and performs a case-sensitive match.
  ([Redshift SQL Language reference Pattern-matching conditions](https://docs.aws.amazon.com/redshift/latest/dg/pattern-matching-conditions.html)).

### Known Issues

* In Snowflake, the behavior for scenarios such as (`LIKE`, `SIMILAR` TO, and `POSIX Operators`) can vary when the column is of type CHAR. For example:

### Code

```sql
 CREATE TEMPORARY TABLE pattern_matching_sample (
  col1 CHAR(10),
  col2 VARCHAR(10)
);

INSERT INTO pattern_matching_sample VALUES ('1','1');
INSERT INTO pattern_matching_sample VALUES ('1234567891','1234567891');
INSERT INTO pattern_matching_sample VALUES ('234567891','234567891');

SELECT
col1 LIKE '%1' as "like(CHAR(10))",
COL2 LIKE '%1' as "like(VARCHAR(10))"
FROM
pattern_matching_sample;
```

#### Redshift Results

| like(CHAR(10)) | like(VARCHAR(10)) |
| --- | --- |
| FALSE | TRUE |
| TRUE | TRUE |
| FALSE | TRUE |

##### Snowflake Results

| like(CHAR(10)) | like(VARCHAR(10)) |
| --- | --- |
| TRUE | TRUE |
| TRUE | TRUE |
| TRUE | TRUE |

It appears that, because CHAR(10) is “fixed-length,” it assumes the ‘%1’ pattern must match a ‘1’ in the 10th position of a CHAR(10) column. However, in Snowflake, it matches if a ‘1’ exists in the string, with any sequence of zero or more characters preceding it.

## LIKE

Pattern-matching conditions

### Description

> The LIKE operator compares a string expression, such as a column name, with a pattern that uses the wildcard characters % (percent) and _ (underscore). LIKE pattern matching always covers the entire string. To match a sequence anywhere within a string, the pattern must start and end with a percent sign. ([Redshift SQL Language reference LIKE](https://docs.aws.amazon.com/redshift/latest/dg/r_patternmatching_condition_like.html)).

> **Note:**
>
> This grammar is fully supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/operators-logical).

> **Note:**
>
> In Snowflake the cases where the escape character is not provided, the default Redshift escape character `'\\'` will be added for full equivalence.

### Grammar Syntax

```sql
 expression [ NOT ] LIKE | ILIKE pattern [ ESCAPE 'escape_char' ]
```

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE like_ex(name VARCHAR(20));

INSERT INTO like_ex VALUES
  ('John  Dddoe'),
  ('Joe   Doe'),
  ('Joe   Doe '),
  (' Joe   Doe '),
  (' Joe \n Doe '),
  ('John_down'),
  ('Joe down'),
  ('Elaine'),
  (''),
  (null),
  ('1000 times'),
  ('100%');
```

#### Like

##### Input Code:

##### Redshift

```sql
SELECT name
  FROM like_ex
  WHERE name LIKE '%Jo%oe%'
  ORDER BY name;
```

##### Results

| NAME |
| --- |
| Joe Doe |
| Joe Doe |
| Joe Doe |
| Joe Doe |
| John Dddoe |

**Output Code:**

##### Snowflake

```sql
 SELECT name
  FROM like_ex
  WHERE name LIKE '%Jo%oe%' ESCAPE '\\'
  ORDER BY name;
```

##### Results

| NAME |
| --- |
| Joe Doe |
| Joe Doe |
| Joe Doe |
| Joe Doe |
| John Dddoe |

#### Not like

##### Input Code:

##### Redshift

```sql
 SELECT name
  FROM like_ex
  WHERE name NOT LIKE '%Jo%oe%'
  ORDER BY name;
```

##### Results

| NAME |
| --- |
|  |
| 100% |
| 1000 times |
| Elaine |
| Joe down |
| John_down |

**Output Code:**

##### Snowflake

```sql
 SELECT name
  FROM like_ex
  WHERE name NOT LIKE '%Jo%oe%' ESCAPE '\\'
  ORDER BY name;
```

##### Results

| NAME |
| --- |
|  |
| 100% |
| 1000 times |
| Elaine |
| Joe down |
| John_down |

#### Escape characters

##### Input Code:

##### Redshift

```sql
 SELECT name
  FROM like_ex
  WHERE name LIKE '%J%h%^_do%' ESCAPE '^'
  ORDER BY name;

SELECT name
 FROM like_ex
 WHERE name LIKE '100\\%'
 ORDER BY 1;
```

##### Results

| NAME |
| --- |
| John_down |

| NAME |
| --- |
| 100% |

**Output Code:**

##### Snowflake

```sql
 SELECT name
  FROM like_ex
  WHERE name LIKE '%J%h%^_do%' ESCAPE '^'
  ORDER BY name;

SELECT name
 FROM like_ex
 WHERE name LIKE '100\\%' ESCAPE '\\'
 ORDER BY 1;
```

##### Results

| NAME |
| --- |
| John_down |

| NAME |
| --- |
| 100% |

#### ILike

##### Input Code:

##### Redshift

```sql
 SELECT 'abc' LIKE '_B_' AS r1,
       'abc' ILIKE '_B_' AS r2;
```

##### Results

| R1 | R2 |
| --- | --- |
| FALSE | TRUE |

**Output Code:**

##### Snowflake

```sql
 SELECT 'abc' LIKE '_B_' ESCAPE '\\' AS r1,
       'abc' ILIKE '_B_' ESCAPE '\\' AS r2;
```

##### Results

| R1 | R2 |
| --- | --- |
| FALSE | TRUE |

#### Operators

The following operators are translated as follows:

| Redshift | Snowflake |
| --- | --- |
| ~~ | LIKE |
| !~~ | NOT LIKE |
| ~~\* | ILIKE |
| !~~\* | NOT ILIKE |

##### Input Code:

##### Redshift

```sql
 SELECT 'abc' ~~ 'abc' AS r1,
       'abc' !~~ 'a%' AS r2,
       'abc' ~~* '_B_' AS r3,
       'abc' !~~* '_B_' AS r4;
```

##### Results

| R1 | R2 | R3 | R4 |
| --- | --- | --- | --- |
| TRUE | FALSE | TRUE | FALSE |

**Output Code:**

##### Snowflake

```sql
 SELECT 'abc' LIKE 'abc' ESCAPE '\\' AS r1,
       'abc' NOT LIKE 'a%' ESCAPE '\\' AS r2,
       'abc' ILIKE '_B_' ESCAPE '\\' AS r3,
       'abc' NOT ILIKE '_B_' ESCAPE '\\' AS r4;
```

##### Results

| R1 | R2 | R3 | R4 |
| --- | --- | --- | --- |
| TRUE | FALSE | TRUE | FALSE |

### Known Issues

1. The behavior of fixed char types may differ. See Known issues for more information.

### Related EWIs

There are no known issues.

## POSIX Operators

Pattern-matching conditions

### Description

> A POSIX regular expression is a sequence of characters that specifies a match pattern. A string matches a regular expression if it is a member of the regular set described by the regular expression. POSIX regular expression patterns can match any portion of a string. ([Redshift SQL Language reference POSIX Operators](https://docs.aws.amazon.com/redshift/latest/dg/pattern-matching-conditions-posix.html)).

> **Warning:**
>
> This grammar is partially supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/operators-logical). POSIX Operators are transformed to [REGEXP_COUNT](https://docs.snowflake.com/en/sql-reference/functions/regexp_count) in Snowflake.

### Grammar Syntax

```sql
 expression [ ! ] ~ pattern
```

### POSIX pattern-matching metacharacters

POSIX pattern matching supports the following metacharacters (all the cases are supported in Snowflake):

| POSIX | Description |
| --- | --- |
| . | Matches any single character. |
| `*` | Matches zero or more occurrences. |
| `+` | Matches one or more occurrences. |
| `?` | Matches zero or one occurrence. |
| `|` | Specifies alternative matches. |
| `^` | Matches the beginning-of-line character. |
| `$` | Matches the end-of-line character. |
| `$` | Matches the end of the string. |
| [ ] | Brackets specify a matching list, that should match one expression in the list. |
| `( )` | Parentheses group items into a single logical item. |
| `{m}` | Repeat the previous item exactly *m* times. |
| `{m,}` | Repeat the previous item *m* or more times. |
| `{m,n}` | Repeat the previous item at least *m* and not more than *n* times. |
| `[: :]` | Matches any character within a POSIX character class. In the following character classes, Amazon Redshift supports only ASCII characters, just like Snowflake: `[:alnum:]`, `[:alpha:]`, `[:lower:]`, `[:upper:]` |

The parameters ‘m’ (enables multiline mode) and ‘s’ (allows the POSIX wildcard character `.` to match new lines) are used to achieve full equivalence in Snowflake. For more information please refer to [Specifying the parameters for the regular expression in Snowflake](https://docs.snowflake.com/en/sql-reference/functions-regexp#specifying-the-parameters-for-the-regular-expression).

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE posix_test_table (
    id INT,
    column_name VARCHAR(255)
);

INSERT INTO posix_test_table (id, column_name)
VALUES
    (1, 'abc123\nhello world'),
    (2, 'test string\nwith multiple lines\nin this entry'),
    (3, '123abc\nanother line\nabc123'),
    (4, 'line1\nline2\nline3'),
    (5, 'start\nmiddle\nend'),
    (6, 'a@b#c!\nmore text here'),
    (7, 'alpha\nbeta\ngamma'),
    (8, 'uppercase\nlowercase'),
    (9, 'line1\nline2\nline3\nline4'),
    (10, '1234567890\nmore digits'),
    (11, 'abc123\nabc456\nabc789'),
    (12, 'start\nend\nmiddle'),
    (13, 'this is the first line\nthis is the second line'),
    (14, 'special characters\n!@#$%^&*()');
```

#### . : Matches any character

##### Input Code:

##### Redshift

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE column_name ~ 'a.c';
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 1 | abc123 hello world |
| 3 | 123abc another line abc123 |
| 11 | abc123 abc456 abc789 |

**Output Code:**

##### Snowflake

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE REGEXP_COUNT(column_name, 'a.c', 1, 'ms') > 0;
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 1 | abc123 hello world |
| 3 | 123abc another line abc123 |
| 11 | abc123 abc456 abc789 |

#### \* : Matches zero or more occurrences.

##### Input Code:

##### Redshift

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE column_name ~ 'a*b';
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 1 | abc123 hello world |
| 3 | 123abc another line abc123 |
| 6 | a@b#c! more text here |
| 7 | alpha beta gamma |
| 11 | abc123 abc456 abc789 |

**Output Code:**

##### Snowflake

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE REGEXP_COUNT(column_name, 'a*b', 1, 'ms') > 0;
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 1 | abc123 hello world |
| 3 | 123abc another line abc123 |
| 6 | a@b#c! more text here |
| 7 | alpha beta gamma |
| 11 | abc123 abc456 abc789 |

#### ? : Matches zero or one occurrence

##### Input Code:

##### Redshift

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE column_name !~ 'a?b';
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 2 | test string with multiple lines in this entry |
| 4 | line1 line2 line3 |
| 5 | start middle end |
| 8 | uppercase lowercase |
| 9 | line1 line2 line3 line4 |
| 10 | 1234567890 more digits |
| 12 | start end middle |
| 13 | this is the first line this is the second line |
| 14 | special characters !@#$%^&\*() |

**Output Code:**

##### Snowflake

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE REGEXP_COUNT(column_name, 'a?b', 1, 'ms') = 0;
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 2 | test string with multiple lines in this entry |
| 4 | line1 line2 line3 |
| 5 | start middle end |
| 8 | uppercase lowercase |
| 9 | line1 line2 line3 line4 |
| 10 | 1234567890 more digits |
| 12 | start end middle |
| 13 | this is the first line this is the second line |
| 14 | special characters !@#$%^&\*() |

#### ^ : Matches the beginning-of-line character

##### Input Code:

##### Redshift

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE column_name ~ '^abc';
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 1 | abc123 hello world |
| 3 | 123abc another line abc123 |
| 11 | abc123 abc456 abc789 |

**Output Code:**

##### Snowflake

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE REGEXP_COUNT(column_name, '^abc', 1, 'ms') > 0;
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 1 | abc123 hello world |
| 3 | 123abc another line abc123 |
| 11 | abc123 abc456 abc789 |

#### $ : Matches the end of the string.

##### Input Code:

##### Redshift

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE column_name !~ '123$';
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 2 | test string with multiple lines in this entry |
| 4 | line1 line2 line3 |
| 5 | start middle end |
| 6 | a@b#c! more text here |
| 7 | alpha beta gamma |
| 8 | uppercase lowercase |
| 9 | line1 line2 line3 line4 |
| 10 | 1234567890 more digits |
| 12 | start end middle |
| 13 | this is the first line this is the second line |
| 14 | special characters !@#$%^&\*() |

**Output Code:**

##### Snowflake

```sql
 SELECT id, column_name
FROM posix_test_table
WHERE REGEXP_COUNT(column_name, '123$', 1, 'ms') = 0;
```

##### Results

| ID | COLUMN_NAME |
| --- | --- |
| 2 | test string with multiple lines in this entry |
| 4 | line1 line2 line3 |
| 5 | start middle end |
| 6 | a@b#c! more text here |
| 7 | alpha beta gamma |
| 8 | uppercase lowercase |
| 9 | line1 line2 line3 line4 |
| 10 | 1234567890 more digits |
| 12 | start end middle |
| 13 | this is the first line this is the second line |
| 14 | special characters !@#$%^&\*() |

#### Usage of collate columns

Arguments with COLLATE specifications are not currently supported in the RLIKE function. As a result, the COLLATE clause must be disabled to use this function. However, this may lead to differences in the results.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE collateTable (
col1 VARCHAR(20) COLLATE CASE_INSENSITIVE,
col2 VARCHAR(30) COLLATE CASE_SENSITIVE);

INSERT INTO collateTable values ('HELLO WORLD!', 'HELLO WORLD!');

SELECT
col1 ~ 'Hello.*' as ci,
col2 ~ 'Hello.*' as cs
FROM collateTable;
```

##### Results

| CI | CS |
| --- | --- |
| TRUE | FALSE |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE collateTable (
col1 VARCHAR(20) COLLATE 'en-ci',
col2 VARCHAR(30) COLLATE 'en-cs'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/16/2025",  "domain": "test" }}';

INSERT INTO collateTable
values ('HELLO WORLD!', 'HELLO WORLD!');

SELECT
REGEXP_COUNT(COLLATE(
--** SSC-FDM-PG0011 - THE USE OF THE COLLATE COLUMN CONSTRAINT HAS BEEN DISABLED FOR THIS PATTERN-MATCHING CONDITION. **
col1, ''), 'Hello.*', 1, 'ms') > 0 as ci,
REGEXP_COUNT(COLLATE(
--** SSC-FDM-PG0011 - THE USE OF THE COLLATE COLUMN CONSTRAINT HAS BEEN DISABLED FOR THIS PATTERN-MATCHING CONDITION. **
col2, ''), 'Hello.*', 1, 'ms') > 0 as cs
FROM
collateTable;
```

##### Results

| CI | CS |
| --- | --- |
| FALSE | FALSE |

If you require equivalence for these scenarios, you can manually add the following parameters to the function to achieve functional equivalence:

| Parameter | Description |
| --- | --- |
| `c` | Case-sensitive matching |
| `i` | Case-insensitive matching |

### Known Issues

### Known Issues

1. The behavior of fixed char types may differ. See Known issues for more information.
2. Arguments with COLLATE specifications are not currently supported in the REGEXP_COUNT function.

### Related EWIs

* [SSC-FDM-PG0011](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): The use of the COLLATE column constraint has been disabled for this pattern-matching condition.

## SIMILAR TO

Pattern-matching conditions

### Description

> The SIMILAR TO operator matches a string expression, such as a column name, with a SQL standard regular expression pattern. A SQL regular expression pattern can include a set of pattern-matching metacharacters, including the two supported by the [LIKE](https://docs.aws.amazon.com/redshift/latest/dg/r_patternmatching_condition_like.html) operator. ([Redshift SQL Language reference SIMILAR TO](https://docs.aws.amazon.com/redshift/latest/dg/pattern-matching-conditions-similar-to.html)).

> **Warning:**
>
> This grammar is partially supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/operators-logical). SIMILAR TO is transformed to [RLIKE](https://docs.snowflake.com/en/sql-reference/functions/rlike) in Snowflake.

### Grammar Syntax

```sql
 expression [ NOT ] SIMILAR TO pattern [ ESCAPE 'escape_char' ]
```

### Pattern-matching metacharacters

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| ```{code} sql :force: % ``` | ```{code} sql :force: .\* ``` | Matches any sequence of zero or more characters. To achieve full equivalence in Snowflake, we need to replace the '%' operator with '.\*' in the pattern. |
| ```{code} sql :force: _ ``` | ```{code} sql :force: . ``` | Matches any single character. To achieve full equivalence in Snowflake, we need to replace the `_` operator with `.` and add the `s` parameter to enable the POSIX wildcard character `.` to match newline characters. |
| ```{code} sql :force: | ``` | ```{code} sql :force: | ``` | Denotes alternation. This case is fully supported in Snowflake. |
| ```{code} sql :force: \* ``` | ```{code} sql :force: \* ``` | Repeat the previous item zero or more times. This can have a different behavior when newline characters are included. |
| ```{code} sql :force: + ``` | ```{code} sql :force: + ``` | Repeat the previous item one or more times. This can have a different behavior when newline characters are included. |
| ```{code} sql :force: ? ``` | ```{code} sql :force: ? ``` | Repeat the previous item zero or one time. This can have a different behavior when newline characters are included. |
| ```{code} sql :force: {m} ``` | ```{code} sql :force: {m} ``` | Repeat the previous item exactly *m* times and it is fully supported in Snowflake. |
| ```{code} sql :force: {m,} ``` | ```{code} sql :force: {m,} ``` | Repeat the previous item at least *m* and not more than *n* times and it is fully supported in Snowflake. |
| ```{code} sql :force: {m,n} ``` | ```{code} sql :force: {m,n} ``` | Repeat the previous item *m* or more times and it is fully supported in Snowflake. |
| ```{code} sql :force: () ``` | ```{code} sql :force: () ``` | Parentheses group items into a single logical item and it is fully supported in Snowflake. |
| ```{code} sql :force: [...] ``` | ```{code} sql :force: [...] ``` | A bracket expression specifies a character class, just as in POSIX regular expressions. |

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE similar_table_ex (
    column_name VARCHAR(255)
);

INSERT INTO similar_table_ex (column_name)
VALUES
    ('abc_123'),
    ('a_cdef'),
    ('bxyz'),
    ('abcc'),
    ('start_hello'),
    ('apple'),
    ('banana'),
    ('xyzabc'),
    ('abc\ncccc'),
    ('\nabccc'),
    ('abc%def'),
    ('abc_xyz'),
    ('abc_1_xyz'),
    ('applepie'),
    ('start%_abc'),
    ('ab%_xyz'),
    ('abcs_123_xyz'),
    ('aabc123'),
    ('xyzxyz'),
    ('123abc\nanother line\nabc123');
```

#### % : Matches any sequence of zero or more characters

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO '%abc%';
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abcc |
| xyzabc |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| start%_abc |
| abcs_123_xyz |
| aabc123 |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, '.*abc.*', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abcc |
| xyzabc |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| start%_abc |
| abcs_123_xyz |
| aabc123 |

#### _ : Matches any single character

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO 'a_c%';
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| a_cdef |
| abcc |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| abcs_123_xyz |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, 'a.c.*', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| a_cdef |
| abcc |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| abcs_123_xyz |

#### | : Denotes alternation

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO 'a|b%';
```

##### Results

| COLUMN_NAME |
| --- |
| bxyz |
| banana |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, 'a|b.*', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| bxyz |
| banana |

#### {m, n} : Repeat the previous item exactly *m* times.

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO 'abc{2,4}';
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, 'abc{2,4}', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |

#### + : Repeat the previous item one or more times

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO 'abc+';
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |
| abc cccc |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, 'abc+', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |

#### \* : Repeat the previous item zero or more times

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO 'abc*c';
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |
| abc cccc |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, 'abc*c', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |

#### ? : Repeat the previous item zero or one time

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO 'abc?c';
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |
| abc ccc |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM
similar_table_ex
WHERE
RLIKE( column_name, 'abc?c', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abcc |

#### () : Parentheses group items into a single logical item

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO '(abc|xyz)%';
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abcc |
| xyzabc |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| abcs_123_xyz |
| xyzxyz |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, '(abc|xyz).*', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abcc |
| xyzabc |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| abcs_123_xyz |
| xyzxyz |

#### […] : Specifies a character class

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO '[a-c]%';
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| a_cdef |
| bxyz |
| abcc |
| apple |
| banana |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| applepie |
| ab%_xyz |
| abcs_123_xyz |
| aabc123 |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM similar_table_ex
WHERE RLIKE (column_name, '[a-c].*', 's');
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| a_cdef |
| bxyz |
| abcc |
| apple |
| banana |
| abc cccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| applepie |
| ab%_xyz |
| abcs_123_xyz |
| aabc123 |

#### Escape characters

The following characters will be escaped if they appear in the pattern and are not the escape character itself:

* .
* $
* ^

##### Input Code:

##### Redshift

```sql
 SELECT column_name
FROM similar_table_ex
WHERE column_name SIMILAR TO '%abc^_%' ESCAPE '^';

SELECT '$0.87' SIMILAR TO '$[0-9]+(.[0-9][0-9])?' r1;
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abc_xyz |
| abc_1_xyz |

| R1 |
| --- |
| TRUE |

**Output Code:**

##### Snowflake

```sql
 SELECT column_name
FROM
similar_table_ex
WHERE
RLIKE( column_name, '.*abc\_.*', 's');

SELECT
RLIKE( '$0.87', '\\$[0-9]+(\\.[0-9][0-9])?', 's') r1;
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abc_xyz |
| abc_1_xyz |

| R1 |
| --- |
| TRUE |

#### Pattern stored in a variable

If these patterns are stored in a variable, the required adjustments for equivalence will not be applied. You can refer to the recommendations outlined in the table at the beginning of this document for additional equivalence guidelines.

##### Input Code:

##### Redshift

```sql
 WITH pattern AS (
    SELECT '%abc%'::VARCHAR AS search_pattern
)
SELECT column_name
FROM similar_table_ex, pattern
WHERE column_name SIMILAR TO pattern.search_pattern;
```

##### Results

| COLUMN_NAME |
| --- |
| abc_123 |
| abcc |
| xyzabc |
| abc cccc |
| abccc |
| abc%def |
| abc_xyz |
| abc_1_xyz |
| start%_abc |
| abcs_123_xyz |
| aabc123 |
| 123abc another line abc123 |

**Output Code:**

##### Snowflake

```sql
 WITH pattern AS (
    SELECT '%abc%'::VARCHAR AS search_pattern
)
SELECT column_name
FROM
similar_table_ex,
pattern
WHERE
RLIKE( column_name,
                    --** SSC-FDM-0032 - PARAMETER 'search_pattern' IS NOT A LITERAL VALUE, TRANSFORMATION COULD NOT BE FULLY APPLIED **
                    pattern.search_pattern, 's');
```

##### Results

| COLUMN_NAME |
| --- |
| Query produced no results |

#### Usage of collate columns

Arguments with COLLATE specifications are not currently supported in the RLIKE function. As a result, the COLLATE clause must be disabled to use this function. However, this may lead to differences in the results.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE collateTable (
col1 VARCHAR(20) COLLATE CASE_INSENSITIVE,
col2 VARCHAR(30) COLLATE CASE_SENSITIVE);

INSERT INTO collateTable values ('HELLO WORLD!', 'HELLO WORLD!');

SELECT
col1 SIMILAR TO 'Hello%' as ci,
col2 SIMILAR TO 'Hello%' as cs
FROM collateTable;
```

##### Results

| CI | CS |
| --- | --- |
| TRUE | FALSE |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE collateTable (
col1 VARCHAR(20) COLLATE 'en-ci',
col2 VARCHAR(30) COLLATE 'en-cs'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/16/2025",  "domain": "test" }}';

INSERT INTO collateTable
values ('HELLO WORLD!', 'HELLO WORLD!');

SELECT
RLIKE(COLLATE(
--** SSC-FDM-PG0011 - THE USE OF THE COLLATE COLUMN CONSTRAINT HAS BEEN DISABLED FOR THIS PATTERN-MATCHING CONDITION. **
col1, ''), 'Hello.*', 's') as ci,
RLIKE(COLLATE(
--** SSC-FDM-PG0011 - THE USE OF THE COLLATE COLUMN CONSTRAINT HAS BEEN DISABLED FOR THIS PATTERN-MATCHING CONDITION. **
col2, ''), 'Hello.*', 's') as cs
FROM
collateTable;
```

##### Results

| CI | CS |
| --- | --- |
| FALSE | FALSE |

If you require equivalence for these scenarios, you can manually add the following parameters to the function to achieve functional equivalence:

| Parameter | Description |
| --- | --- |
| `c` | Case-sensitive matching |
| `i` | Case-insensitive matching |

### Known Issues

1. The behavior of fixed char types may differ.
2. The `RLIKE` function uses POSIX extended regular expressions, which may result in different behavior in certain cases, especially when line breaks are involved. It appears that when line breaks are present in the string and a match occurs on one line, it returns a positive result for the entire string, even though the match only occurred on a single line and not across the whole string. For example:

#### Redshift code

```sql
 CREATE TABLE table1 (
col1 VARCHAR(20)
);

INSERT INTO table1 values ('abcccc'), ('abc\neab'), ('abc\nccc');

SELECT col1
FROM table1
WHERE col1 SIMILAR TO 'abc*c';
```

##### Snowflake code

```sql
 CREATE TABLE table1 (
col1 VARCHAR(20)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/14/2025",  "domain": "test" }}';

INSERT INTO table1
values ('abcccc'), ('abc\neab'), ('abc\nccc');

SELECT col1
FROM
table1
WHERE
RLIKE( col1, 'abc*c', 's');
```

##### Redshift Results

| COL1 |
| --- |
| abcccc |
| abc eab |
| abc ccc |

##### Snowflake Results

| COL1 |
| --- |
| abcccc |

1. To achieve maximum equivalence, some modifications are made to the pattern operators.
2. If these patterns are stored in a variable, SnowConvert AI does not apply the necessary adjustments for equivalence.
3. Arguments with COLLATE specifications are not currently supported in the RLIKE function.

### Related EWIs

* [SSC-FDM-0032](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Parameter is not a literal value, transformation could not be fully applied.
* [SSC-FDM-PG0011](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): The use of the COLLATE column constraint has been disabled for this pattern-matching condition.

---
title: SnowConvert AI - Redshift - CONTINUE HANDLER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-continue-handler.md
section: Migrations
---

# SnowConvert AI - Redshift - CONTINUE HANDLER

## Description

Amazon Redshift, which uses PL/pgSQL for procedural logic, does not have a native `DECLARE CONTINUE HANDLER` statement in the same way as systems like DB2 or Teradata. In Redshift, exception handling is managed through `EXCEPTION` blocks within procedures.

However, when migrating code from database systems that use CONTINUE HANDLERs (such as DB2, Teradata, or other systems), SnowConvert AI transforms these constructs into equivalent Snowflake Scripting exception handling mechanisms.

A CONTINUE HANDLER allows execution to continue after an error occurs, performing specific actions when certain conditions are met. In Snowflake, this behavior is emulated using EXCEPTION blocks with appropriate error handling logic.

For more information about Redshift exception handling, see [Exception Handling in PL/pgSQL](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-messages-errors).

## Grammar Syntax

Redshift does not have native CONTINUE HANDLER syntax. However, when converting from other database systems, the source pattern typically looks like:

```sql
-- Pattern from source systems (e.g., DB2, Teradata)
DECLARE CONTINUE HANDLER FOR condition_value
  handler_action_statement;
```

In Redshift, exception handling uses:

```sql
BEGIN
  -- statements
EXCEPTION
  WHEN condition THEN
    -- handler statements
END;
```

## Sample Source Patterns

### CONTINUE HANDLER Conversion to Snowflake

When migrating stored procedures from systems with CONTINUE HANDLER to Snowflake via Redshift, SnowConvert AI transforms them into Snowflake-compatible exception handling.

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
-- Example pattern from source system
CREATE PROCEDURE example_handler_procedure()
BEGIN
    DECLARE CONTINUE HANDLER FOR SQLSTATE '02000'
    BEGIN
        -- Handler action: log the error
        INSERT INTO error_log VALUES (CURRENT_TIMESTAMP, 'No data found');
    END;

    -- Main procedure logic
    SELECT column1 INTO result_var FROM table1 WHERE id = 999;
    INSERT INTO results VALUES (result_var);
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE example_handler_procedure()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        result_var VARCHAR;
    BEGIN
        BEGIN
            -- Main procedure logic
            SELECT column1 INTO result_var FROM table1 WHERE id = 999;
        EXCEPTION
            WHEN NO_DATA_FOUND THEN
                -- Handler action: log the error
                INSERT INTO error_log
                VALUES (CURRENT_TIMESTAMP(), 'No data found');
                -- Continue execution by not re-raising
        END;

        INSERT INTO results VALUES (result_var);
    END;
$$;
```

### CONTINUE HANDLER with SQLEXCEPTION

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE multi_statement_handler()
BEGIN
    DECLARE error_count INT DEFAULT 0;

    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
    BEGIN
        SET error_count = error_count + 1;
    END;

    -- Multiple statements that might fail
    UPDATE table1 SET status = 'processed' WHERE id = -1;
    DELETE FROM table2 WHERE amount = 0/0;
    INSERT INTO table3 VALUES (1, 'Success');
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE multi_statement_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        error_count INT := 0;
    BEGIN
        -- Multiple statements with individual exception handling
        BEGIN
            UPDATE table1 SET status = 'processed' WHERE id = -1;
        EXCEPTION
            WHEN OTHER THEN
                error_count := error_count + 1;
        END;

        BEGIN
            DELETE FROM table2 WHERE amount = 0/0;
        EXCEPTION
            WHEN OTHER THEN
                error_count := error_count + 1;
        END;

        INSERT INTO table3 VALUES (1, 'Success');
    END;
$$;
```

### CONTINUE HANDLER for NOT FOUND

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE cursor_with_handler()
BEGIN
    DECLARE done INT DEFAULT 0;
    DECLARE val INT;

    DECLARE CONTINUE HANDLER FOR NOT FOUND
        SET done = 1;

    DECLARE cur CURSOR FOR SELECT id FROM table1;

    OPEN cur;

    read_loop: LOOP
        FETCH cur INTO val;
        IF done = 1 THEN
            LEAVE read_loop;
        END IF;
        -- Process val
        INSERT INTO results VALUES (val);
    END LOOP;

    CLOSE cur;
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE cursor_with_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        done INT := 0;
        val INT;
        cur CURSOR FOR SELECT id FROM table1;
    BEGIN
        OPEN cur;

        LOOP
            BEGIN
                FETCH cur INTO val;
            EXCEPTION
                WHEN NO_DATA_FOUND THEN
                    done := 1;
            END;

            IF (done = 1) THEN
                BREAK;
            END IF;

            -- Process val
            INSERT INTO results VALUES (val);
        END LOOP;

        CLOSE cur;
    END;
$$;
```

## Known Issues

### Limited CONTINUE HANDLER Emulation

The conversion from CONTINUE HANDLER to Snowflake exception handling has some limitations:

1. **Execution Flow**: True CONTINUE HANDLER behavior (continuing from the exact point of error) cannot be fully replicated in Snowflake.
2. **Performance**: Wrapping individual statements in exception blocks can impact performance.
3. **Granularity**: Statement-level exception handling may be required to properly emulate CONTINUE HANDLER behavior.

### SQLSTATE Mapping

Not all SQLSTATE codes from source systems map directly to Snowflake exception types. SnowConvert AI performs best-effort mapping:

* `SQLSTATE '02000'` (NO DATA) → `NO_DATA_FOUND`
* `SQLSTATE '23xxx'` (Integrity Constraint Violation) → `STATEMENT_ERROR`
* Generic SQLEXCEPTION → `OTHER`

#### Known Issues

When migrating CONTINUE HANDLER patterns from other systems to Redshift and then to Snowflake, be aware that exception handling behavior may differ between systems. Thorough testing is recommended to ensure the converted code maintains the intended behavior.

### SQLWARNING Handling

Source systems that use CONTINUE HANDLER for SQLWARNING conditions present special challenges:

* Snowflake does not distinguish between warnings and errors in the same way
* Warnings in source systems may be errors in Snowflake
* Manual review of warning handling logic is recommended

#### Example

##### Source Pattern

```sql
DECLARE CONTINUE HANDLER FOR SQLWARNING
BEGIN
    INSERT INTO warning_log VALUES (SQLCODE, 'Warning occurred');
END;
```

##### Snowflake

```sql
-- Warning handling may need to be implemented through validation logic
BEGIN
    -- Perform validation before operation
    IF EXISTS (SELECT 1 FROM table1 WHERE condition) THEN
        INSERT INTO warning_log VALUES (0, 'Warning occurred');
    END IF;
EXCEPTION
    WHEN OTHER THEN
        -- Handle actual errors
        INSERT INTO error_log VALUES (:SQLCODE, :SQLERRM);
END;
```

## Best Practices

When working with converted CONTINUE HANDLER code in Snowflake:

1. **Test Thoroughly**: Verify that error handling behavior matches the original system’s behavior.
2. **Review Performance**: Multiple exception blocks can impact performance; consider refactoring where appropriate.
3. **Validate Error Conditions**: Ensure that all error conditions from the source system are properly handled.
4. **Use Transactions**: Leverage Snowflake’s transaction support for data consistency.
5. **Monitor Execution**: Use Snowflake’s logging capabilities to track exception handling.

## Related Documentation

* [Snowflake Exception Handling](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions)
* [Redshift Exception Handling](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-messages-errors)
* [CREATE PROCEDURE](rs-sql-statements-create-procedure.md)

## See Also

* [EXCEPTION](rs-sql-statements-create-procedure.md)
* [RAISE](rs-sql-statements-create-procedure.md)
* [DECLARE](rs-sql-statements-create-procedure.md)

---
title: SnowConvert AI - Redshift - CREATE PROCEDURE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/rs-sql-statements-create-procedure.md
section: Migrations
---

# SnowConvert AI - Redshift - CREATE PROCEDURE

## Description

> Creates a new stored procedure or replaces an existing procedure for the current database. ([Redshift SQL Language Reference Create Procedure](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_PROCEDURE.html)).

See the following definitions for more information about procedure clauses:

* ARGUMENTS MODE
* POSITIONAL ARGUMENTS
* NONATOMIC
* PROCEDURE BODY
* SECURITY (DEFINER | INVOKER)

## Grammar Syntax

The following is the SQL syntax to create a Procedure in Amazon Redshift. See the [Redshift CREATE PROCEDURE specification](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_PROCEDURE.html) for this syntax.

```sql
 CREATE [ OR REPLACE ] PROCEDURE sp_procedure_name
  ( [ [ argname ] [ argmode ] argtype [, ...] ] )
[ NONATOMIC ]
AS $$
  procedure_body
$$ LANGUAGE plpgsql
[ { SECURITY INVOKER | SECURITY DEFINER } ]
[ SET configuration_parameter { TO value | = value } ]
```

## Sample Source Patterns

### Input Code:

#### Redshift

```sql
 CREATE PROCEDURE TEST_PROCEDURE()
LANGUAGE PLPGSQL
AS
$$
BEGIN
    NULL;
END;
$$;
```

#### Output Code:

##### Snowflake

```sql
 CREATE PROCEDURE TEST_PROCEDURE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/07/2025",  "domain": "test" }}'
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

## Related EWIs

There are no issues for this transformation.

## ALIAS DECLARATION

### Description

If the stored procedure’s signature omits the argument name, you can declare an alias for the argument.

There is no support for this in Snowflake.

To achieve functional equivalence, aliases will be removed, and all usages will be renamed.

When an alias is declared for a parameter nameless, a generated name will be created for the parameter and the usages. When the alias is for a parameter with name the alias will be replaced by the real parameter name.

### Grammar Syntax

```sql
 name ALIAS FOR $n;
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE test_procedure (integer)
LANGUAGE plpgsql
AS
$$
DECLARE
    first_alias ALIAS  FOR $1;
    second_alias ALIAS  FOR $1;
BEGIN
   INSERT INTO t1
   VALUES (first_alias + 1);
   INSERT INTO t1
   VALUES (second_alias + 2);
END;
$$;

--Notice the parameter already has a name
--and we are defining two alias to the same parameter
CREATE OR REPLACE PROCEDURE test_procedure (PARAMETER1 integer)
LANGUAGE plpgsql
AS
$$
DECLARE
    first_alias ALIAS  FOR $1;
    second_alias ALIAS  FOR $1;
BEGIN
   INSERT INTO t1
   VALUES (first_alias + 1);
   INSERT INTO t1
   VALUES (second_alias + 2);
END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE test_procedure (SC_ARG1 integer)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
$$
BEGIN
   INSERT INTO t1
   VALUES (:SC_ARG1 + 1);
   INSERT INTO t1
   VALUES (:SC_ARG1 + 2);
END;
$$;

--Notice the parameter already has a name
--and we are defining two alias to the same parameter
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "t1" **
CREATE OR REPLACE PROCEDURE test_procedure (PARAMETER1 integer)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
$$
BEGIN
   INSERT INTO t1
   VALUES (:PARAMETER1 + 1);
   INSERT INTO t1
   VALUES (:PARAMETER1 + 2);
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## ARGUMENTS MODE

### Description

Amazon Redshift stored procedures support parameters that can be passed during procedure invocation. These parameters allow you to provide input values, retrieve output values, or use them for input and output operations. Below is a detailed explanation of the types of parameters, their modes, and examples of their usage. Snowflake only supports input values.

#### IN (Input Parameters)

Purpose: Used to pass values into the procedure.

Default Mode: If no mode is specified, parameters are considered IN.

Behavior: Values passed to the procedure cannot be modified inside the procedure.

##### OUT (Output Parameters)

Purpose: Used to return values from the procedure.

Behavior: Parameters can be modified inside the procedure and are returned to the caller. You cannot send an initial value.

##### INOUT (Input/Output Parameters)

Purpose: Used to pass values into the procedure and modify them to return updated values.

Behavior: Combines the behavior of IN and OUT. You must send an initial value regardless of the output.

### Grammar Syntax

```sql
 [ argname ] [ argmode ] argtype
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
CREATE OR REPLACE PROCEDURE SP_PARAMS(
IN PARAM1 INTEGER,
OUT PARAM2 INTEGER,
INOUT PARAM3 INTEGER)
AS
$$
    BEGIN
        NULL;
    END;
$$
LANGUAGE plpgsql;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE SP_PARAMS (PARAM1 INTEGER, PARAM2 OUT INTEGER, PARAM3 OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs

1. [SCC-EWI-0028](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) : Type not supported by Snowflake.
2. [SSC-EWI-RS0010](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): Top-level procedure call with out parameters is not supported.

## PROCEDURE BODY

> **Hint:**
>
> SnowConvert does not support translation for PostgreSQL string constant definition in procedures.
> Use [arrange](../../general/getting-started/running-snowconvert/conversion/postgresql-conversion-settings.md) option

### Description

Like Redshift, Snowflake supports CREATE PROCEDURE using $$ procedure_logic $$ as the body. There is a difference in the Redshift syntax where a word can be inside the $$ like $word$ and used as a delimiter body like $word$ procedure_logic $word$. SnowConvert AI will transform it by removing the word, leaving the $$.

### Grammar Syntax

```sql
 AS
$Alias$
  procedure_body
$Alias$
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE SP()
AS
$somename$
BEGIN
   NULL;
END;
$somename$
LANGUAGE plpgsql;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE SP ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/07/2025",  "domain": "test" }}'
AS
$$
   BEGIN
      NULL;
   END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## BLOCK STATEMENT

### Description

PL/pgSQL is a block-structured language. The complete body of a procedure is defined in a block, which contains variable declarations and PL/pgSQL statements. A statement can also be a nested block, or subblock.

### Grammar Syntax

```sql
 [ <<label>> ]
[ DECLARE
  declarations ]
BEGIN
  statements
EXCEPTION
  WHEN OTHERS THEN
    statements
END [ label ];
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE MY_PROCEDURE()
AS
$$
    BEGIN
        NULL;
    END;
$$
LANGUAGE plpgsql;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE MY_PROCEDURE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/10/2025",  "domain": "test" }}'
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## DECLARE

### Description

Section to declare all the procedure variables except for loop variables.
Redshift supports multiple DECLARE sections per block statement, since Snowflake does not support this behavior they must be merged into a single declaration statement per block.

### Grammar Syntax

```sql
 [ DECLARE declarations ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE first_procedure (first_parameter integer)
LANGUAGE plpgsql
    AS
$$
DECLARE
    i int := first_parameter;
BEGIN
   select i;
END;
$$;

CREATE OR REPLACE PROCEDURE second_procedure (first_parameter integer)
LANGUAGE plpgsql
    AS
$$
DECLARE
    i int := first_parameter;
DECLARE
    j int := first_parameter;
BEGIN
   select i;
END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE first_procedure (first_parameter integer)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/11/2025",  "domain": "test" }}'
    AS
$$
   DECLARE
      i int := first_parameter;
BEGIN
   select i;
END;
$$;

CREATE OR REPLACE PROCEDURE second_procedure (first_parameter integer)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/11/2025",  "domain": "test" }}'
    AS
$$
   DECLARE
      i int := first_parameter;
      j int := first_parameter;
BEGIN
   select i;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## EXCEPTION

### Description

When an exception occurs, and you add an exception-handling block, you can write RAISE statements and most other PL/pgSQL statements. For example, you can raise an exception with a custom message or insert a record into a logging table.

### Grammar Syntax

```sql
 EXCEPTION
  WHEN OTHERS THEN
    statements
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE update_employee_sp() AS
$$
BEGIN
    select var;
EXCEPTION WHEN OTHERS THEN
    RAISE INFO 'An exception occurred.';
END;
$$
LANGUAGE plpgsql;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE update_employee_sp ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
$$
BEGIN
    select var;
EXCEPTION WHEN OTHER THEN
        CALL RAISE_MESSAGE_UDF('INFO', 'An exception occurred.');
        RAISE;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## LABEL

### Description

Labels are used in Redshift to qualify a block or to use the EXIT or END statement. Snowflake does not support labels.

> **Warning:**
>
> Since labels are not supported in Snowflake, an EWI will be printed.

### Grammar Syntax

```sql
 [<<label>>]
BEGIN
    ...
END [label]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE test_procedure (first_parameter integer)
LANGUAGE plpgsql
AS
$$
    <<Begin_block_label>>
BEGIN
   INSERT INTO my_test_table
   VALUES (first_parameter);
END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE test_procedure (first_parameter integer)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
$$
   !!!RESOLVE EWI!!! /*** SSC-EWI-0094 - LABEL DECLARATION FOR A STATEMENT IS NOT SUPPORTED BY SNOWFLAKE SCRIPTING <<Begin_block_label>> ***/!!!
BEGIN
   INSERT INTO my_test_table
   VALUES (:first_parameter);
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs

1. [SSC-EWI-0094](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Label declaration not supported

## NONATOMIC

### Description

The NONATOMIC commits after each statement in the stored procedure. Snowflake supports an AUTOCOMMIT parameter. The default setting for AUTOCOMMIT is TRUE (enabled).

While AUTOCOMMIT is enabled, Each statement outside an explicit transaction is treated as inside its implicit single-statement transaction. In other words, that statement is automatically committed if it succeeds and automatically rolled back if it fails. In other words, Snowflake works as NONATOMIC “by default”.

### Grammar Syntax

```sql
 NONATOMIC
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE SP_NONATOMIC()
NONATOMIC
AS
$$
    BEGIN
        NULL;
    END;
$$
LANGUAGE plpgsql;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE SP_NONATOMIC ()
RETURNS VARCHAR
----** SSC-FDM-RS0008 - SNOWFLAKE USES AUTOCOMMIT BY DEFAULT. **
--NONATOMIC
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/10/2025",  "domain": "test" }}'
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## POSITIONAL ARGUMENTS

### Description

Redshift supports nameless parameters by referencing the parameters by their position using $. Snowflake does not support this behavior. To ensure functional equivalence, SnowConvert AI can convert those references by the parameter’s name if the name is present in the definition. If not, SnowConvert AI will generate a name for the parameter, and the uses will be replaced with the new name.

### Grammar Syntax

```sql
 $n
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE SP_POSITIONAL_REFERENCES(
INTEGER,
param2 INTEGER,
INTEGER)
AS
$$
    DECLARE
        localVariable INTEGER := 0;
    BEGIN
        localVariable := $2 + $3 + $1;
    END;
$$
LANGUAGE plpgsql;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE SP_POSITIONAL_REFERENCES (SC_ARG1
INTEGER,
param2 INTEGER, SC_ARG3 INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
$$
    DECLARE
        localVariable INTEGER := 0;
    BEGIN
        localVariable := param2 + SC_ARG3 + SC_ARG1;
    END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## RAISE

### Description

> Use the `RAISE level` statement to report messages and raise errors.
>
> ([Redshift SQL Language Reference RAISE](https://docs.aws.amazon.com/es_es/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-messages-errors))

> **Note:**
>
> RAISE are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 RAISE level 'format' [, variable [, ...]];
```

In Amazon Redshift, the `RAISE` statement is used to generate messages in the console or throw custom exceptions. Redshift allows you to specify different *levels* to indicate the severity of the message. In Snowflake, this functionality can be emulated using a user-defined function (UDF) that makes a call to the console depending on the specified level.

1. **Exception**:
   When the level is “EXCEPTION”, a custom exception is raised with a general message: *“To view the EXCEPTION MESSAGE, you need to check the log.”* The exception code is `-20002`, which informs the user that the custom message can be found in the logs. This is due to limitations when sending custom exceptions in Snowflake.
2. **Warning**:
   If the level is “WARNING”, `SYSTEM$LOG_WARN` is used to print the warning message to Snowflake’s log, which helps highlight potential issues without interrupting the flow of execution.
3. **Info**:
   For any other level (such as “INFO”), `SYSTEM$LOG_INFO` is used to print the message to the console log, providing more detailed feedback about the system’s state without causing critical disruptions.

This approach allows emulating Redshift’s severity levels functionality, adapting them to Snowflake’s syntax and features, while maintaining flexibility and control over the messages and exceptions generated during execution.

**Limitations**

* To view logs in Snowflake, it is necessary to have specific privileges, such as the `ACCOUNTADMIN` or `SECURITYADMIN` roles.
* Logs in Snowflake are not available immediately and may have a slight delay before the information is visible.
* Personalized error messages in exceptions are not displayed like in Redshift. To view custom messages, you must access the logs directly.

For further information, please refer to the following [page](https://docs.snowflake.com/developer-guide/logging-tracing/logging-snowflake-scripting).

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE raise_example(IN user_id INT)
LANGUAGE plpgsql
AS $$
BEGIN
	RAISE EXCEPTION 'User % not exists.', user_id;
END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE raise_example (user_id INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/11/2025",  "domain": "test" }}'
AS $$
BEGIN
	CALL RAISE_MESSAGE_UDF('EXCEPTION', 'User % not exists.', array_construct(:user_id));
END;
$$;
```

#### UDFs

##### RAISE_MESSAGE_UDF

```sql
 CREATE OR REPLACE PROCEDURE RAISE_MESSAGE_UDF(LEVEL VARCHAR, MESSAGE VARCHAR, ARGS VARIANT)
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
    DECLARE
        MY_EXCEPTION EXCEPTION (-20002, 'To view the EXCEPTION MESSAGE, you need to check the log.');
        SC_RAISE_MESSAGE VARCHAR;
    BEGIN
        SC_RAISE_MESSAGE := STRING_FORMAT_UDF(MESSAGE, ARGS);
        IF (LEVEL = 'EXCEPTION') THEN
            SYSTEM$LOG_ERROR(SC_RAISE_MESSAGE);
            RAISE MY_EXCEPTION;
        ELSEIF (LEVEL = 'WARNING') THEN
            SYSTEM$LOG_WARN(SC_RAISE_MESSAGE);
            RETURN 'Warning printed successfully';
        ELSE
            SYSTEM$LOG_INFO(SC_RAISE_MESSAGE);
            RETURN 'Message printed successfully';
        END IF;
    END;
$$;
```

##### STRING_FORMAT_UDF

```sql
 CREATE OR REPLACE FUNCTION PUBLIC.STRING_FORMAT_UDF(PATTERN VARCHAR, ARGS VARIANT)
RETURNS VARCHAR
LANGUAGE JAVASCRIPT
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "udf",  "convertedOn": "02/11/2025",  "domain": "test" }}'
AS
$$
	var placeholder_str = "{%}";
	var result = PATTERN.replace(/(?<!%)%(?!%)/g, placeholder_str).replace("%%","%");
	for (var i = 0; i < ARGS.length; i++)
	{
		result = result.replace(placeholder_str, ARGS[i]);
	}
	return result;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## RETURN

### Description

> The RETURN statement returns back to the caller from a stored procedure. ([Redshift SQL Language Reference Return](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-return)).

The conversion of the return statement from Amazon Redshift to Snowflake is straightforward, only considering adding a `NULL` to the return statement on Snowflake.

### Grammar Syntax

```sql
 RETURN;
```

### Sample Source Patterns

#### Simple Case

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 ()
AS
$$
BEGIN
   RETURN;
END
$$ LANGUAGE plpgsql;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/12/2025",  "domain": "test" }}'
AS
$$
BEGIN
  RETURN NULL;
END
$$;
```

#### When the procedure has out parameters

SnowConvert AI returns a variant with parameters set up as output parameters. So, for each return, SnowConvert AI will add a variant as a return value.

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (OUT output_value VARCHAR)
AS
$$
BEGIN
   RETURN;
END
$$ LANGUAGE plpgsql;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (output_value OUT VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS
$$
BEGIN
   RETURN NULL;
END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## SECURITY (DEFINER | INVOKER)

### Description

The SECURITY clause in Amazon Redshift stored procedures defines the access control and permissions context under which the procedure executes. This determines whether the procedure uses the privileges of the owner (creator) or the caller (user invoking the procedure).

### Grammar Syntax

```sql
 [ { SECURITY INVOKER | SECURITY DEFINER } ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE SP_SECURITY_INVOKER( )
AS
$$
    BEGIN
        NULL;
    END;
$$
LANGUAGE plpgsql
SECURITY INVOKER
;

CREATE OR REPLACE PROCEDURE SP_SECURITY_DEFINER( )
AS
$$
     BEGIN
        NULL;
    END;
$$
LANGUAGE plpgsql
SECURITY DEFINER;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE SP_SECURITY_INVOKER ( )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/07/2025",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        NULL;
    END;
$$
;

CREATE OR REPLACE PROCEDURE SP_SECURITY_DEFINER ( )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/07/2025",  "domain": "test" }}'
EXECUTE AS OWNER
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## VARIABLE DECLARATION

### Description

> Declare all variables in a block, except for loop variables, in the block’s DECLARE section.
>
> ([Redshift SQL Language Reference Variable Declaration](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-structure.html#r_PLpgSQL-variable-declaration))

> **Note:**
>
> Variable declarations are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 DECLARE
name [ CONSTANT ] type [ NOT NULL ] [ { DEFAULT | := } expression ];
```

In Redshift, the `CONSTANT` keyword prevents variable reassignment during execution. Since Snowflake does not support this keyword, it is removed during transformation. This does not impact functionality, as the logic should not attempt to reassign a constant variable.

The `NOT NULL` constraint in Redshift ensures a variable cannot be assigned a null value and requires a non-null default value. As Snowflake does not support this constraint, it is removed during transformation. However, the default value is retained to maintain functionality.

A variable declare with a Refcursor is transformed to Resultset type, for more information.

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_DECLARATION()
LANGUAGE plpgsql
AS $$
DECLARE
    v_simple_int INT;
    v_default_char CHAR(4) DEFAULT 'ABCD';
    v_default_float FLOAT := 10.00;
    v_constant_char CONSTANT CHAR(4) := 'ABCD';
    v_notnull VARCHAR NOT NULL DEFAULT 'Test default';
    v_refcursor REFCURSOR;
BEGIN
-- Procedure logic
END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_DECLARATION ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
        DECLARE
            v_simple_int INT;
            v_default_char CHAR(4) DEFAULT 'ABCD';
            v_default_float FLOAT := 10.00;
            v_constant_char CHAR(4) := 'ABCD';
            --** SSC-FDM-PG0012 - NOT NULL CONSTRAINT HAS BEEN REMOVED. ASSIGNING NULL TO THIS VARIABLE WILL NO LONGER CAUSE A FAILURE. **
            v_notnull VARCHAR DEFAULT 'Test default';
            v_refcursor RESULTSET;
BEGIN
            NULL;
-- Procedure logic
END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-PG0012](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): NOT NULL constraint has been removed. Assigning NULL to this variable will no longer cause a failure.

## TRANSACTIONS

## COMMIT

### Description

> Commits the current transaction to the database. This command makes the database updates from the transaction permanent. ([Redshift SQL Language Reference COMMIT](https://docs.aws.amazon.com/redshift/latest/dg/r_COMMIT.html))

Grammar Syntax

```none
COMMIT [WORK | TRANSACTION]
```

### Sample Source Patterns

#### Setup data

##### Redshift

##### Query

```sql
 CREATE TABLE transaction_values_test
(
    col1 INTEGER
);
```

##### Snowflake

##### Query

```sql
 CREATE TABLE transaction_values_test
(
    col1 INTEGER
);
```

#### COMMIT with TRANSACTION keyword

The TRANSACTION keyword is not supported in Snowflake. However, since it does not have an impact on functionality it will just be removed.

##### Redshift

##### Query

```sql
 COMMIT TRANSACTION;
```

##### Snowflake

##### Query

```sql
 COMMIT;
```

#### COMMIT in a default transaction behavior procedure (without NONATOMIC clause)

To avoid out of scope transaction exceptions in Snowflake, the usages of COMMIT will be matched with BEGIN TRANSACTION.

When multiple COMMIT statements are present in the procedure, multiple BEGIN TRANSACTION statements will be generated after every COMMIT to emulate the Redshift transaction behavior.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test(a INT)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test VALUES (a);
    COMMIT;
    INSERT INTO transaction_values_test VALUES (a + 1);
    COMMIT;
END
$$;

CALL transaction_test(120);

SELECT * FROM transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 120  |
| 121  |
+------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test (a INT)
RETURNS VARCHAR
    LANGUAGE SQL
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    VALUES (:a);
    COMMIT;
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    VALUES (:a + 1);
    COMMIT;
END
$$;

CALL transaction_test(120);

SELECT * FROM
    transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 120  |
| 121  |
+------+
```

#### COMMIT in a procedure with NONATOMIC behavior

The NONATOMIC behavior from Redshift is emulated in Snowflake by using the session parameter AUTOCOMMIT set to true.

Since the AUTOCOMMIT session parameter is assumed to be true by SnowConvert AI, the COMMIT statement inside NONATOMIC procedures is left as is.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE nonatomic_procedure(a int)
    NONATOMIC
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a + 2);
    INSERT INTO transaction_values_test values (a + 3);
    COMMIT;
END
$$;

CALL nonatomic_procedure(10);

SELECT * FROM transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 12   |
| 13   |
+------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE nonatomic_procedure (a int)
RETURNS VARCHAR
--    --** SSC-FDM-RS0008 - SNOWFLAKE USES AUTOCOMMIT BY DEFAULT. **
--    NONATOMIC
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    INSERT INTO transaction_values_test
    values (:a + 2);
    INSERT INTO transaction_values_test
    values (:a + 3);
    COMMIT;
END
$$;

CALL nonatomic_procedure(10);

SELECT * FROM
transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 12   |
| 13   |
+------+
```

### Known Issues

**1. COMMIT inside a nested procedure call**

In Redshift, when a COMMIT statement is specified in a nested procedure call, the command will commit all pending work from previous statements in the current and parent scopes. Committing the parent scope actions is not supported in Snowflake, when this case is detected an FDM will be generated.

#### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test(a INT)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test VALUES (a);
    COMMIT;
END
$$;

CREATE OR REPLACE PROCEDURE nested_transaction_test(a INT)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    INSERT INTO transaction_values_test values (a + 1);
    INSERT INTO transaction_values_test values (a + 2);
    CALL transaction_test(a + 3);
END
$$;
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test (a INT)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    VALUES (:a);
    COMMIT;
END
$$;

CREATE OR REPLACE PROCEDURE nested_transaction_test (a INT)
RETURNS VARCHAR
    LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    INSERT INTO transaction_values_test
    values (:a);
    INSERT INTO transaction_values_test
    values (:a + 1);
    INSERT INTO transaction_values_test
    values (:a + 2);
    --** SSC-FDM-RS0006 - CALLED PROCEDURE CONTAINS USAGES OF COMMIT/ROLLBACK, MODIFYING THE CURRENT TRANSACTION IN CHILD SCOPES IS NOT SUPPORTED IN SNOWFLAKE **
    CALL transaction_test(:a + 3);
END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs

1. [SSC-FDM-RS0006](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Called procedure contains usages of COMMIT/ROLLBACK, modifying the current transaction in child scopes is not supported in Snowflake.

## ROLLBACK

### Description

> Stops the current transaction and discards all updates made by that transaction. ([Redshift SQL Language Reference ROLLBACK](https://docs.aws.amazon.com/redshift/latest/dg/r_ROLLBACK.html))

Grammar Syntax

```none
ROLLBACK [WORK | TRANSACTION]
```

### Sample Source Patterns

#### Setup data

##### Redshift

##### Query

```sql
 CREATE TABLE transaction_values_test
(
    col1 INTEGER
);
```

##### Snowflake

##### Query

```sql
 CREATE TABLE transaction_values_test
(
    col1 INTEGER
);
```

#### ROLLBACK with TRANSACTION keyword

The TRANSACTION keyword is not supported in Snowflake. However, since it does not have an impact on functionality it will just be removed.

##### Redshift

##### Query

```sql
 ROLLBACK TRANSACTION;
```

##### Snowflake

##### Query

```sql
 ROLLBACK;
```

#### ROLLBACK in a default transaction behavior procedure (without NONATOMIC clause)

To avoid out of scope transaction exceptions in Snowflake, the usages of ROLLBACK will be matched with BEGIN TRANSACTION.

When multiple transaction control statements are present in the procedure, multiple BEGIN TRANSACTION statements will be generated after every each one of them to emulate the Redshift transaction behavior.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test(a INT)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    COMMIT;
    insert into transaction_values_test values (80);
    insert into transaction_values_test values (55);
    ROLLBACK;
END
$$;

CALL transaction_test(120);

SELECT * FROM transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 120  |
+------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test (a INT)
RETURNS VARCHAR
    LANGUAGE SQL
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test values (:a);
    COMMIT;
    BEGIN TRANSACTION;
    insert into transaction_values_test values (80);
    insert into transaction_values_test values (55);
    ROLLBACK;
END
$$;

CALL transaction_test(120);

SELECT * FROM
    transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 120  |
+------+
```

#### ROLLBACK in a procedure with NONATOMIC behavior

The NONATOMIC behavior from Redshift is emulated in Snowflake by using the session parameter AUTOCOMMIT set to true.

Since the AUTOCOMMIT session parameter is assumed to be true by SnowConvert AI, the ROLLBACK statement inside NONATOMIC procedures is left as is.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE nonatomic_procedure(a int)
    NONATOMIC
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    INSERT INTO transaction_values_test values (a + 1);
    ROLLBACK;
    INSERT INTO transaction_values_test values (a + 2);
    INSERT INTO transaction_values_test values (a + 3);
    COMMIT;
END
$$;

CALL nonatomic_procedure(10);

SELECT * FROM transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 10   |
| 11   |
| 12   |
| 13   |
+------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE nonatomic_procedure (a int)
RETURNS VARCHAR
--    --** SSC-FDM-RS0008 - SNOWFLAKE USES AUTOCOMMIT BY DEFAULT. **
--    NONATOMIC
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    INSERT INTO transaction_values_test
    values (:a);
    INSERT INTO transaction_values_test
    values (:a + 1);
    ROLLBACK;
    INSERT INTO transaction_values_test
    values (:a + 2);
    INSERT INTO transaction_values_test
    values (:a + 3);
    COMMIT;
END
$$;

CALL nonatomic_procedure(10);

SELECT * FROM
transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 10   |
| 11   |
| 12   |
| 13   |
+------+
```

### Known Issues

**1. ROLLBACK inside a nested procedure call**

In Redshift, when a ROLLBACK statement is specified in a nested procedure call, the command will commit all pending work from previous statements in the current and parent scopes. Committing the parent scope actions is not supported in Snowflake, when this case is detected an FDM will be generated.

#### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    ROLLBACK;
    INSERT INTO transaction_values_test values (a + 1);
END
$$;

CREATE OR REPLACE PROCEDURE nested_transaction_test(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    CALL transaction_test(a + 3);
    COMMIT;
END
$$;
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test (a int)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a);
    ROLLBACK;
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a + 1);
    COMMIT;
END
$$;

CREATE OR REPLACE PROCEDURE nested_transaction_test (a int)
RETURNS VARCHAR
    LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a);
    --** SSC-FDM-RS0006 - CALLED PROCEDURE CONTAINS USAGES OF COMMIT/ROLLBACK, MODIFYING THE CURRENT TRANSACTION IN CHILD SCOPES IS NOT SUPPORTED IN SNOWFLAKE **
    CALL transaction_test(:a + 3);
    COMMIT;
END
$$;
```

**2. ROLLBACK of DDL statements**

In Snowflake, DDL statements perform an implicit commit whenever they are executed inside a procedure, making effective all the work before executing the DDL as well as the DDL itself. This causes the ROLLBACK statement to not be able to discard any changes before that point, this issue will be informed using an FDM.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE rollback_ddl(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    CREATE TABLE someRollbackTable
    (
        col1 INTEGER
    );

    INSERT INTO someRollbackTable values (a);
    ROLLBACK;
END
$$;
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE rollback_ddl (a int)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a);
    CREATE TABLE someRollbackTable
    (
        col1 INTEGER
    );
    BEGIN TRANSACTION;
    INSERT INTO someRollbackTable
    values (:a);
    --** SSC-FDM-RS0007 - DDL STATEMENTS PERFORM AN AUTOMATIC COMMIT IN SNOWFLAKE. ROLLBACK WILL NOT UNDO DDL-COMMITTED CHANGES. **
    ROLLBACK;
END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs

1. [SSC-FDM-RS0006](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Called procedure contains usages of COMMIT/ROLLBACK, modifying the current transaction in child scopes is not supported in Snowflake.
2. [SSC-FDM-RS0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): DDL statements perform an automatic COMMIT in Snowflake. ROLLBACK will not undo DDL-committed changes.

## TRUNCATE

### Description

> Deletes all of the rows from a table without doing a table scan ([Redshift SQL Language Reference TRUNCATE](https://docs.aws.amazon.com/redshift/latest/dg/r_TRUNCATE.html))

Grammar Syntax

```none
TRUNCATE [TABLE] table_name
```

### Sample Source Patterns

#### Setup data

##### Redshift

##### Query

```sql
 CREATE TABLE transaction_values_test
(
    col1 INTEGER
);
```

##### Snowflake

##### Query

```sql
 CREATE TABLE transaction_values_test
(
    col1 INTEGER
);
```

#### TRUNCATE in a default transaction behavior procedure (without NONATOMIC clause)

Since the TRUNCATE statement automatically commits the transaction it is executed in, any of its usages will generate a COMMIT statement in Snowflake to emulate this behavior.

Since a COMMIT statement is generated the same BEGIN TRANSACTION statement generation will be applied to TRUNCATE. For more information check the COMMIT translation specification.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE truncate_in_procedure(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test VALUES (a);
    TRUNCATE TABLE transaction_values_test;
    INSERT INTO transaction_values_test VALUES (a + 12);
    COMMIT;
END
$$;

CALL truncate_in_procedure(10);

SELECT * FROM transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 22   |
+------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE truncate_in_procedure (a int)
RETURNS VARCHAR
    LANGUAGE SQL
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    VALUES (:a);
    TRUNCATE TABLE transaction_values_test;
    COMMIT;
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    VALUES (:a + 12);
    COMMIT;
END
$$;

CALL truncate_in_procedure(10);

SELECT * FROM
    transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 22   |
+------+
```

#### TRUNCATE in a procedure with NONATOMIC behavior

The NONATOMIC behavior from Redshift is emulated in Snowflake by using the session parameter AUTOCOMMIT set to true.

Since the AUTOCOMMIT session parameter is assumed to be true by SnowConvert AI, the TRUNCATE statement inside NONATOMIC procedures is left as is, there is no need to generate a COMMIT statement because every statement is automatically committed when executed.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE nonatomic_procedure(a int)
    NONATOMIC
    LANGUAGE plpgsql
    AS $$
BEGIN
    TRUNCATE TABLE transaction_values_test;
    INSERT INTO transaction_values_test values (a);
    INSERT INTO transaction_values_test values (a + 1);
    ROLLBACK;
    INSERT INTO transaction_values_test values (a + 2);
    INSERT INTO transaction_values_test values (a + 3);
    COMMIT;
END
$$;

CALL nonatomic_procedure(10);

SELECT * FROM transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 10   |
| 11   |
| 12   |
| 13   |
+------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE nonatomic_procedure (a int)
RETURNS VARCHAR
--    --** SSC-FDM-RS0008 - SNOWFLAKE USES AUTOCOMMIT BY DEFAULT. **
--    NONATOMIC
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    TRUNCATE TABLE transaction_values_test;
    INSERT INTO transaction_values_test
    values (:a);
    INSERT INTO transaction_values_test
    values (:a + 1);
    ROLLBACK;
    INSERT INTO transaction_values_test
    values (:a + 2);
    INSERT INTO transaction_values_test
    values (:a + 3);
    COMMIT;
END
$$;

CALL nonatomic_procedure(10);

SELECT * FROM
transaction_values_test;
```

##### Result

```none
+------+
| col1 |
+------+
| 10   |
| 11   |
| 12   |
| 13   |
+------+
```

### Known Issues

**1. TRUNCATE inside a nested procedure call**

In Redshift, when a COMMIT statement is specified in a nested procedure call, the command will commit all pending work from previous statements in the current and parent scopes. Committing the parent scope actions is not supported in Snowflake, when this case is detected an FDM will be generated.

#### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test(a INT)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test VALUES (a);
    TRUNCATE TABLE transaction_values_test;
END
$$;

CREATE OR REPLACE PROCEDURE nested_transaction_test(a INT)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    INSERT INTO transaction_values_test values (a + 1);
    INSERT INTO transaction_values_test values (a + 2);
    CALL transaction_test(a + 3);
END
$$;
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE transaction_test (a INT)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    VALUES (:a);
    TRUNCATE TABLE transaction_values_test;
    COMMIT;
END
$$;

CREATE OR REPLACE PROCEDURE nested_transaction_test (a INT)
RETURNS VARCHAR
    LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    INSERT INTO transaction_values_test
    values (:a);
    INSERT INTO transaction_values_test
    values (:a + 1);
    INSERT INTO transaction_values_test
    values (:a + 2);
    --** SSC-FDM-RS0006 - CALLED PROCEDURE CONTAINS USAGES OF COMMIT/ROLLBACK, MODIFYING THE CURRENT TRANSACTION IN CHILD SCOPES IS NOT SUPPORTED IN SNOWFLAKE **
    CALL transaction_test(:a + 3);
END
$$;
```

### Known Issues

There are no known issues.

### Related EWIs

1. [SSC-FDM-RS0006](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Called procedure contains usages of COMMIT/ROLLBACK, modifying the current transaction in child scopes is not supported in Snowflake.

## CONDITIONS

## CASE

### Description

> The `CASE` statement in Redshift lets you return values based on conditions, enabling conditional logic in queries. It has two forms: simple and searched. ([Redshift SQL Language Reference Conditionals: Case](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-conditionals-case)).

### Simple Case

A simple CASE statement provides conditional execution based on equality of operands.

> **Note:**
>
> Simple Case are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 CASE search-expression
WHEN expression [, expression [ ... ]] THEN
  statements
[ WHEN expression [, expression [ ... ]] THEN
  statements
  ... ]
[ ELSE
  statements ]
END CASE;
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE proc1(x INT)
LANGUAGE plpgsql
AS $$
BEGIN
  CASE x
WHEN 1, 2 THEN
  NULL;
ELSE
  NULL;
END CASE;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE proc1 (x INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/14/2025",  "domain": "test" }}'
AS $$
BEGIN
  CASE x
    WHEN 1 THEN
      NULL;
    WHEN 2 THEN
      NULL;
   ELSE
     NULL;
  END CASE;
END;
$$;
```

### Searched Case

> **Note:**
>
> Searched Case are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 CASE
WHEN boolean-expression THEN
  statements
[ WHEN boolean-expression THEN
  statements
  ... ]
[ ELSE
  statements ]
END CASE;
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE PROCEDURE PROC1 (paramNumber int)
LANGUAGE plpgsql
AS $$
DECLARE
    result VARCHAR(100);
BEGIN
CASE
  WHEN paramNumber BETWEEN 0 AND 10 THEN
    result := 'value is between zero and ten';
  WHEN paramNumber BETWEEN 11 AND 20 THEN
    result := 'value is between eleven and twenty';
  END CASE;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE PROCEDURE PROC1 (paramNumber int)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
    DECLARE
      result VARCHAR(100);
      case_not_found EXCEPTION (-20002, 'Case not found.');
BEGIN
CASE
  WHEN paramNumber BETWEEN 0 AND 10 THEN
    result := 'value is between zero and ten';
  WHEN paramNumber BETWEEN 11 AND 20 THEN
    result := 'value is between eleven and twenty';
  ELSE
    RAISE case_not_found;
  END CASE;
END;
$$;
```

#### CASE Without ELSE

In Redshift, when a `CASE` expression is executed and none of the validated conditions are met, and there is no `ELSE` defined, the exception ‘CASE NOT FOUND’ is triggered. In Snowflake, the code executes but returns no result. To maintain the same functionality in Snowflake in this scenario, an exception with the same name will be declared and executed if none of the `CASE` conditions are met.

> **Note:**
>
> Case Without Else are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (input_value INT)
AS $$
BEGIN
  CASE input_value
  WHEN 1 THEN
   NULL;
  END CASE;
END;
$$ LANGUAGE plpgsql;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (input_value INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
    DECLARE
      case_not_found EXCEPTION (-20002, 'Case not found.');
BEGIN
  CASE input_value
  WHEN 1 THEN
   NULL;
  ELSE
   RAISE case_not_found;
  END CASE;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## IF

### Description

> This statement allows you to make decisions based on certain conditions. ([Redshift SQL Language Reference Conditionals: IF](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-conditionals-if)).

SnowConvert AI will add the parenthesis in the conditions and change the keyword ELSIF by ELSEIF since Redshift does not require the parenthesis in the conditions and ELSIF is the keyword.

### Grammar Syntax

```sql
 IF boolean-expression THEN
  statements
[ ELSIF boolean-expression THEN
  statements
[ ELSIF boolean-expression THEN
  statements
    ...] ]
[ ELSE
  statements ]
END IF;
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE PROCEDURE PROC1 (paramNumber int)
LANGUAGE plpgsql
AS $$
DECLARE
    result VARCHAR(100);
BEGIN
    IF paramNumber = 0 THEN
      result := 'zero';
    ELSIF paramNumber > 0 THEN
      result := 'positive';
    ELSIF paramNumber < 0 THEN
      result := 'negative';
    ELSE
      result := 'NULL';
    END IF;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE PROCEDURE PROC1 (paramNumber int)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
        DECLARE
            result VARCHAR(100);
BEGIN
            IF (:paramNumber = 0) THEN
                result := 'zero';
            ELSEIF (:paramNumber > 0) THEN
                result := 'positive';
            ELSEIF (:paramNumber < 0) THEN
                result := 'negative';
              ELSE
                result := 'NULL';
            END IF;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## LOOPS

### Description

These statements are used to repeat a block of code until the specified condition. ([Redshift SQL Language Reference Loops](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-loops)).

CONTINUE
FOR
LOOP
WHILE
EXIT

## CONTINUE

### Description

> When the CONTINUE conditions are true, the loop can continue the execution, when is false stop the loop. ([Redshift SQL Language Reference Conditionals: CONTINUE](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-loops)).

> **Warning:**
>
> CONTINUE are partial supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 CONTINUE [ label ] [ WHEN expression ];
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (x INT)
    LANGUAGE plpgsql
AS $$
DECLARE
    i INTEGER := 0;
BEGIN
    <<simple_loop_when>>
    LOOP
        i := i + 1;
        CONTINUE WHEN i = 5;
        RAISE INFO 'i %', i;
        EXIT simple_loop_when WHEN (i >= x);
    END LOOP;
END;
$$;

CREATE OR REPLACE PROCEDURE procedure11 (x INT)
    LANGUAGE plpgsql
AS $$
DECLARE
    i INTEGER := 0;
BEGIN
    LOOP
        i := i + 1;
		IF (I = 5) THEN
        	CONTINUE;
		END IF;
        RAISE INFO 'i %', i;
        EXIT WHEN (i >= x);
    END LOOP;
END;
$$;
```

##### Results

| Console Output |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 6 |
| 7 |

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE procedure1 (x INT)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
    		DECLARE
    			i INTEGER := 0;
BEGIN
    			--** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
        i := i + 1;
        IF (:i = 5) THEN
        	CONTINUE;
        END IF;
        CALL RAISE_MESSAGE_UDF('INFO', 'i %', array_construct(:i));
        IF ((:i >= : x)) THEN
        	EXIT simple_loop_when;
        END IF;
    END LOOP simple_loop_when;
END;
$$;

CREATE OR REPLACE PROCEDURE procedure11 (x INT)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
    		DECLARE
    			i INTEGER := 0;
BEGIN
    			--** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
    LOOP
        i := i + 1;
		IF (:I = 5) THEN
        	CONTINUE;
		END IF;
        CALL RAISE_MESSAGE_UDF('INFO', 'i %', array_construct(:i));
        IF ((:i >= : x)) THEN
        	EXIT;
        END IF;
    END LOOP;
END;
$$;
```

##### Results

| Console Output |
| --- |
| 1 |
| 2 |
| 3 |
| 4 |
| 6 |
| 7 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## EXIT

### Description

> Stop the loop execution when the conditions defined in the WHEN statement are true ([Redshift SQL Language Reference Conditionals: EXIT](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-loops)).

> **Warning:**
>
> EXIT are partial supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 EXIT [ label ] [ WHEN expression ];
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE simple_loop_when(x int)
LANGUAGE plpgsql
AS $$
DECLARE i INTEGER := 0;
BEGIN
  <<simple_loop_when>>
  LOOP
    RAISE INFO 'i %', i;
    i := i + 1;
    EXIT simple_loop_when WHEN (i >= x);
  END LOOP;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE simple_loop_when (x int)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
    DECLARE
      i INTEGER := 0;
BEGIN
      --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
  LOOP
        CALL RAISE_MESSAGE_UDF('INFO', 'i %', array_construct(:i));
    i := i + 1;
        IF ((:i >= : x)) THEN
          EXIT simple_loop_when;
        END IF;
  END LOOP simple_loop_when;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## FOR

### Grammar Syntax

Integer variant

```sql
 [<<label>>]
FOR name IN [ REVERSE ] expression .. expression LOOP
  statements
END LOOP [ label ];
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 ()
AS $$
BEGIN
  FOR i IN 1..10 LOOP
    NULL;
  END LOOP;

  FOR i IN REVERSE 10..1 LOOP
    NULL;
  END LOOP;
END;
$$ LANGUAGE plpgsql;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
  FOR i IN 1 TO 10
                   --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                   LOOP
    NULL;
  END LOOP;

  FOR i IN REVERSE 10 TO 1
                           --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                           LOOP
    NULL;
  END LOOP;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

1. [SSC-EWI-PG0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/postgresqlEWI.md): Reference a variable using the Label is not supported by Snowflake.

## LOOP

### Description

> A simple loop defines an unconditional loop that is repeated indefinitely until terminated by an EXIT or RETURN statement. ([Redshift SQL Language Reference Conditionals: Simple Loop](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-loops)).

> **Warning:**
>
> Simple Loop are partial supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 [<<label>>]
LOOP
  statements
END LOOP [ label ];
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE simple_loop()
LANGUAGE plpgsql
AS $$
BEGIN
  <<simple_while>>
  LOOP
    RAISE INFO 'I am raised once';
    EXIT simple_while;
    RAISE INFO 'I am not raised';
  END LOOP;
  RAISE INFO 'I am raised once as well';
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE simple_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
  --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
  LOOP
    CALL RAISE_MESSAGE_UDF('INFO', 'I am raised once');
    EXIT simple_while;
    CALL RAISE_MESSAGE_UDF('INFO', 'I am not raised');
  END LOOP simple_while;
  CALL RAISE_MESSAGE_UDF('INFO', 'I am raised once as well');
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## WHILE

### Grammar Syntax

```sql
 [<<label>>]
WHILE expression LOOP
  statements
END LOOP [ label ];
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE simple_loop_when()
    LANGUAGE plpgsql
AS $$
DECLARE
    i INTEGER := 0;
BEGIN
    WHILE I > 5 AND I > 10 LOOP
        NULL;
    END LOOP;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE simple_loop_when ()
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
            DECLARE
                i INTEGER := 0;
BEGIN
                WHILE (:I > 5 AND : I > 10)
                                            --** SSC-PRF-0008 - PERFORMANCE REVIEW - LOOP USAGE **
                                            LOOP
        NULL;
    END LOOP;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## CURSORS

## CLOSE CURSOR

### Description

> Closes all of the free resources that are associated with an open cursor.. ([Redshift SQL Language Reference Close Cursor](https://docs.aws.amazon.com/redshift/latest/dg/close.html)).

> **Note:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 CLOSE cursor
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE cursor_test()
AS $$
BEGIN
   CLOSE cursor1;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE cursor_test ()
RETURNS VARCHAR
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/05/2025",  "domain": "test" }}'
AS $$
BEGIN
   CLOSE cursor1;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## FETCH CURSOR

### Description

> Retrieves rows using a cursor. ([Redshift SQL Language reference Fetch](https://docs.aws.amazon.com/redshift/latest/dg/fetch.html))

Transformation information

```sql
 FETCH [ NEXT | ALL | {FORWARD [ count | ALL ] } ] FROM cursor

FETCH cursor INTO target [, target ...];
```

### Sample Source Patterns

#### Setup data

##### Redshift

##### Query

```sql
 CREATE TABLE cursor_example
(
	col1 INTEGER,
	col2 VARCHAR(20)
);

INSERT INTO cursor_example VALUES (10, 'hello');
```

##### Snowflake

##### Query

```sql
 CREATE TABLE cursor_example
(
	col1 INTEGER,
	col2 VARCHAR(20)
);

INSERT INTO cursor_example VALUES (10, 'hello');
```

#### Fetch into

The FETCH into statement from Redshift is fully equivalent in Snowflake

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE fetch_into_example()
LANGUAGE plpgsql
AS $$
DECLARE my_cursor CURSOR FOR
        SELECT col1, col2
        FROM cursor_example;
        some_id INT;
        message VARCHAR(20);
BEGIN
    OPEN my_cursor;
    FETCH my_cursor INTO some_id, message;
    CLOSE my_cursor;
    INSERT INTO cursor_example VALUES (some_id * 10, message || ' world!');
END;
$$;

CALL fetch_into_example();

SELECT * FROM cursor_example;
```

##### Result

```none
+------+-------------+
| col1 | col2        |
+------+-------------+
| 10   | hello       |
| 100  | hello world!|
+------+-------------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE fetch_into_example ()
RETURNS VARCHAR
LANGUAGE SQL
AS $$
DECLARE
    my_cursor CURSOR FOR
    SELECT col1, col2
    FROM
    cursor_example;
    some_id INT;
    message VARCHAR(20);
BEGIN
    OPEN my_cursor;
    FETCH my_cursor INTO some_id, message;
    CLOSE my_cursor;
    INSERT INTO cursor_example
			VALUES (:some_id * 10, :message || ' world!');
END;
$$;

CALL fetch_into_example();

SELECT * FROM
	cursor_example;
```

##### Result

```none
+------+-------------+
| col1 | col2        |
+------+-------------+
| 10   | hello       |
| 100  | hello world!|
+------+-------------+
```

### Known Issues

**1. Fetch without target variables is not supported**

Snowflake requires the FETCH statement to specify the INTO clause with the variables where the fetched row values are going to be stored. When a FETCH statement is found in the code with no INTO clause an EWI will be generated.

Input Code:

```sql
 FETCH FORWARD FROM cursor1;
```

Output Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-PG0015 - FETCH CURSOR WITHOUT TARGET VARIABLES IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FETCH FORWARD FROM cursor1;
```

### Known Issues

There are no known issues.

### Related EWIs

1. [SSC-EWI-PG0015](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/postgresqlEWI.md): Fetch cursor without target variables is not supported in Snowflake

## OPEN CURSOR

### Description

> Before you can use a cursor to retrieve rows, it must be opened. ([Redshift SQL Language Reference Open Cursor](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-cursors)).

> **Note:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 OPEN bound_cursor_name [ ( argument_values ) ];
```

### Sample Source Patterns

#### Setup data

##### Redshift

##### Query

```sql
 CREATE TABLE cursor_example
(
	col1 INTEGER,
	col2 VARCHAR(20)
);

CREATE TABLE cursor_example_results
(
	col1 INTEGER,
	col2 VARCHAR(20)
);

INSERT INTO cursor_example VALUES (10, 'hello');
```

##### Snowflake

##### Query

```sql
 CREATE TABLE cursor_example
(
	col1 INTEGER,
	col2 VARCHAR(20)
);

CREATE TABLE cursor_example_results
(
	col1 INTEGER,
	col2 VARCHAR(20)
);

INSERT INTO cursor_example VALUES (10, 'hello');
```

#### Open cursor without arguments

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE cursor_test()
AS $$
BEGIN
   OPEN cursor1;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE cursor_test ()
RETURNS VARCHAR
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/05/2025",  "domain": "test" }}'
AS $$
BEGIN
   OPEN cursor1;
END;
$$;
```

#### Open cursor with arguments

Cursor arguments have to be bound per each one of its uses, SnowConvert AI will generate the bindings, as well as reorder and repeat the passed values to the OPEN statement as needed to satisfy the bindings.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE cursor_open_test()
LANGUAGE plpgsql
AS $$
DECLARE
    cursor2 CURSOR (val1 VARCHAR(20), val2 INTEGER) FOR SELECT col1 + val2, col2 FROM cursor_example where val1 = col2 and val2 > col1;
    res1 INTEGER;
    res2 VARCHAR(20);
BEGIN
    OPEN cursor2('hello', 50);
    FETCH cursor2 INTO res1, res2;
    CLOSE cursor2;
    INSERT INTO cursor_example_results VALUES (res1, res2);
END;
$$;

call cursor_open_test();

SELECT * FROM cursor_example_results;
```

##### Result

```none
+------+-------+
| col1 | col2  |
+------+-------+
| 60   | hello |
+------+-------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE cursor_open_test ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
        DECLARE
            cursor2 CURSOR FOR SELECT col1 + ?, col2 FROM
                cursor_example
            where
                ? = col2 and ? > col1;
            res1 INTEGER;
            res2 VARCHAR(20);
BEGIN
    OPEN cursor2 USING (50, 'hello', 50);
    FETCH cursor2 INTO res1, res2;
    CLOSE cursor2;
    INSERT INTO cursor_example_results
            VALUES (:res1, : res2);
END;
$$;

call cursor_open_test();
SELECT * FROM
cursor_example_results;
```

##### Result

```none
+------+-------+
| col1 | col2  |
+------+-------+
| 60   | hello |
+------+-------+
```

#### Open cursor with procedure parameters or local variables

The procedure parameters or local variables have to be bound per each one of its uses in the cursor query, SnowConvert AI will generate the bindings and add the parameter or variable names to the OPEN statement, even if the cursor originally had no parameters.

##### Redshift

##### Query

```sql
 CREATE OR REPLACE PROCEDURE cursor_open_test(someValue iNTEGER)
LANGUAGE plpgsql
AS $$
DECLARE
    charVariable VARCHAR(20) DEFAULT 'hello';
    cursor2 CURSOR FOR SELECT col1 + someValue, col2 FROM cursor_example where charVariable = col2 and someValue > col1;
    res1 INTEGER;
    res2 VARCHAR(20);
BEGIN
    OPEN cursor2;
    FETCH cursor2 INTO res1, res2;
    CLOSE cursor2;
    INSERT INTO cursor_example_results VALUES (res1, res2);
END;
$$;

call cursor_open_test(30);
```

##### Result

```none
+------+-------+
| col1 | col2  |
+------+-------+
| 40   | hello |
+------+-------+
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE PROCEDURE cursor_open_test (someValue iNTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
        DECLARE
            charVariable VARCHAR(20) DEFAULT 'hello';
            cursor2 CURSOR FOR SELECT col1 + ?, col2 FROM
                cursor_example
            where
                ? = col2 and ? > col1;
            res1 INTEGER;
            res2 VARCHAR(20);
BEGIN
    OPEN cursor2 USING (someValue, charVariable, someValue);
    FETCH cursor2 INTO res1, res2;
    CLOSE cursor2;
    INSERT INTO cursor_example_results
            VALUES (:res1, : res2);
END;
$$;

call cursor_open_test(30);
```

##### Result

```none
+------+-------+
| col1 | col2  |
+------+-------+
| 40   | hello |
+------+-------+
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## DECLARE CURSOR

### Description

> Defines a new cursor. Use a cursor to retrieve a few rows at a time from the result set of a larger query. ([Redshift SQL Language Reference Declare Cursor](https://docs.aws.amazon.com/redshift/latest/dg/declare.html)).

> **Note:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 name CURSOR [ ( arguments ) ] FOR query
```

### Sample Source Patterns

#### Input Code:

### Input Code:

#### Redshift

```sql
 CREATE OR REPLACE PROCEDURE cursor_test()
AS $$
DECLARE
   -- Declare the cursor
   cursor1 CURSOR FOR SELECT 1;
   cursor2 CURSOR (key integer) FOR SELECT 2 where 1 = key;

BEGIN
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE cursor_test ()
RETURNS VARCHAR
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
      DECLARE
         -- Declare the cursor
         cursor1 CURSOR FOR SELECT 1;
         cursor2 CURSOR FOR SELECT 2 where 1 = ?;
BEGIN
         NULL;
END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## DECLARE REFCURSOR

### Description

> A `refcursor` data type simply holds a reference to a cursor. You can create a cursor variable by declaring it as a variable of type `refcursor`
>
> ([Redshift SQL Language Reference Refcursor Declaration](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-cursors))

> **Note:**
>
> Refcursor declarations are fully supported by [Snowflake](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#declaring-a-variable).

### Grammar Syntax

```sql
 DECLARE
name refcursor;
```

Since Snowflake does not support the `REFCURSOR` data type, its functionality is replicated by converting the `REFCURSOR` variable into a `RESULTSET` type. The query used to open the `REFCURSOR` is assigned to the `RESULTSET` variable, after which a new cursor is created and linked to the `RESULTSET` variable. Additionally, all references to the original `REFCURSOR` within the cursor logic are updated to use the new cursor, thereby replicating the original functionality.

### Sample Source Patterns

#### Case: Single use

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR()
LANGUAGE plpgsql
AS $$
DECLARE
  v_curs1 refcursor;
BEGIN
  OPEN v_curs1 FOR SELECT column1_name, column2_name FROM your_table;
-- Cursor logic
  CLOSE v_curs1;
 END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
  DECLARE
   v_curs1 RESULTSET;
BEGIN
   v_curs1 := (
    SELECT column1_name, column2_name FROM your_table
   );
   LET v_curs1_Resultset_1 CURSOR
   FOR
    v_curs1;
   OPEN v_curs1_Resultset_1;
-- Cursor logic
  CLOSE v_curs1_Resultset_1;
 END;
$$;
```

##### Case: Cursor with Dynamic Sql

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR_DYNAMIC(min_salary NUMERIC)
LANGUAGE plpgsql
AS $$
DECLARE
    cur refcursor;
    qry TEXT;
BEGIN
    qry := 'SELECT id, name FROM employees WHERE salary > ' || min_salary;

    OPEN cur FOR EXECUTE qry;
-- Cursor logic
    CLOSE cur;
END;
$$;

CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR_DYNAMIC2(min_salary NUMERIC)
LANGUAGE plpgsql
AS $$
DECLARE
    cur refcursor;
BEGIN
    OPEN cur FOR EXECUTE 'SELECT id, name FROM employees WHERE salary > ' || min_salary;
-- Cursor logic
    CLOSE cur;
END;
$$;
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR_DYNAMIC (min_salary NUMERIC)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
        DECLARE
            cur RESULTSET;
    qry TEXT;
BEGIN
    qry := 'SELECT id, name FROM employees WHERE salary > ' || min_salary;
            cur := (
                EXECUTE IMMEDIATE qry
            );
            LET cur_Resultset_1 CURSOR
            FOR
                cur;
            OPEN cur_Resultset_1;
-- Cursor logic
    CLOSE cur_Resultset_1;
END;
$$;

CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR_DYNAMIC2 (min_salary NUMERIC)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
        DECLARE
            cur RESULTSET;
BEGIN
            cur := (
                EXECUTE IMMEDIATE 'SELECT id, name FROM employees WHERE salary > ' || min_salary
            );
            LET cur_Resultset_2 CURSOR
            FOR
                cur;
            OPEN cur_Resultset_2;
-- Cursor logic
    CLOSE cur_Resultset_2;
END;
$$;
```

##### Case: Multiple uses:

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR()
LANGUAGE plpgsql
AS $$
DECLARE
  v_curs1 refcursor;
BEGIN
  OPEN v_curs1 FOR SELECT column1_name, column2_name FROM your_table;
-- Cursor logic
  CLOSE v_curs1;
  OPEN v_curs1 FOR SELECT column3_name, column4_name FROM your_table2;
-- Cursor logic
  CLOSE v_curs1;
 END;
$$;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE VARIABLE_REFCURSOR ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS $$
  DECLARE
   v_curs1 RESULTSET;
BEGIN
   v_curs1 := (
    SELECT column1_name, column2_name FROM your_table
   );
   LET v_curs1_Resultset_1 CURSOR
   FOR
    v_curs1;
   OPEN v_curs1_Resultset_1;
-- Cursor logic
  CLOSE v_curs1_Resultset_1;
   v_curs1 := (
    SELECT column3_name, column4_name FROM your_table2
   );
   LET v_curs1_Resultset_2 CURSOR
   FOR
    v_curs1;
   OPEN v_curs1_Resultset_2;
-- Cursor logic
  CLOSE v_curs1_Resultset_2;
 END;
$$;
```

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

---
title: SnowConvert AI - Redshift - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-sql-statements-create-table.md
section: Migrations
---

# SnowConvert AI - Redshift - CREATE TABLE

Create Table Syntax Grammar.

## Description

Creates a new table in the current database. You define a list of columns, which each hold data of a distinct type. The owner of the table is the issuer of the CREATE TABLE command.

For more information please refer to [`CREATE TABLE`](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) documentation.

## Grammar Syntax

```sql
 CREATE [ [LOCAL ] { TEMPORARY | TEMP } ] TABLE
[ IF NOT EXISTS ] table_name
( { column_name data_type [column_attributes] [ column_constraints ]
  | table_constraints
  | LIKE parent_table [ { INCLUDING | EXCLUDING } DEFAULTS ] }
  [, ... ]  )
[ BACKUP { YES | NO } ]
[table_attributes]

where column_attributes are:
  [ DEFAULT default_expr ]
  [ IDENTITY ( seed, step ) ]
  [ GENERATED BY DEFAULT AS IDENTITY ( seed, step ) ]
  [ ENCODE encoding ]
  [ DISTKEY ]
  [ SORTKEY ]
  [ COLLATE CASE_SENSITIVE | COLLATE CASE_INSENSITIVE  ]

and column_constraints are:
  [ { NOT NULL | NULL } ]
  [ { UNIQUE  |  PRIMARY KEY } ]
  [ REFERENCES reftable [ ( refcolumn ) ] ]

and table_constraints  are:
  [ UNIQUE ( column_name [, ... ] ) ]
  [ PRIMARY KEY ( column_name [, ... ] )  ]
  [ FOREIGN KEY (column_name [, ... ] ) REFERENCES reftable [ ( refcolumn ) ]

and table_attributes are:
  [ DISTSTYLE { AUTO | EVEN | KEY | ALL } ]
  [ DISTKEY ( column_name ) ]
  [ [COMPOUND | INTERLEAVED ] SORTKEY ( column_name [,...]) |  [ SORTKEY AUTO ] ]
  [ ENCODE AUTO ]
```

## BACKUP

### Description

Enables Amazon Redshift to automatically adjust the encoding type for all columns in the table to optimize query performance. In Snowflake, the concept of `BACKUP` as seen in other databases is not directly applicable. Snowflake automatically handles data backup and recovery through its built-in features like Time Travel and Fail-safe, eliminating the need for manual backup operations. For these reasons, the statement `BACKUP` is removed during the transformation process

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 BACKUP { YES | NO }
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
BACKUP YES;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## IF NOT EXISTS

### Description

In Amazon Redshift, `IF NOT EXISTS` is used in table creation commands to avoid errors if the table already exists. When included, it ensures that the table is created only if it does not already exist, preventing duplication and errors in your SQL script.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 IF NOT EXISTS
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE IF NOT EXISTS table1 (
    col1 INTEGER
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

### Related EWIs

There are no known issues.

## LOCAL

### Description

In Amazon Redshift, `LOCAL TEMPORARY` or `TEMP` are used to create temporary tables that exist only for the duration of the session. These tables are session-specific and automatically deleted when the session ends. They are useful for storing intermediate results or working data without affecting the permanent database schema.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 LOCAL { TEMPORARY | TEMP }
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE LOCAL TEMPORARY TABLE table1 (
    col1 INTEGER
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE LOCAL TEMPORARY TABLE table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

### Related EWIs

There are no known issues.

## DISTKEY

### Description

In Amazon Redshift, `DISTKEY` is used to distribute data across cluster nodes to optimize query performance. Snowflake, however, automatically handles data distribution and storage without needing explicit distribution keys. Due to differences in architecture and data management approaches, Snowflake does not have a direct equivalent to Redshift’s `DISTKEY`.

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 DISTKEY ( column_name )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
DISTKEY (col1);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - DISTKEY OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTKEY (col1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}';
```

### Related EWIs

1. [SSC-FDM-RS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.

## DISTSTYLE

### Description

Keyword that defines the data distribution style for the whole table.

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 DISTSTYLE { AUTO | EVEN | KEY | ALL }
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
DISTSTYLE AUTO;

CREATE TABLE table2 (
    col1 INTEGER
)
DISTSTYLE EVEN;

CREATE TABLE table3 (
    col1 INTEGER
)
DISTSTYLE KEY
DISTKEY (col1);

CREATE TABLE table4 (
    col1 INTEGER
)
DISTSTYLE ALL;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - DISTSTYLE AUTO OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE AUTO
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';

CREATE TABLE table2 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - DISTSTYLE EVEN OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE EVEN
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';

CREATE TABLE table3 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - DISTSTYLE KEY OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE KEY
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;

CREATE TABLE table4 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - DISTSTYLE ALL OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE ALL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

### Related EWIs

1. [SSC-FDM-RS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.

## ENCODE

### Description

In Snowflake, defining `ENCODE` is unnecessary because it automatically handles data compression, unlike Redshift, which requires manual encoding settings. For this reason, the ENCODE statement is removed during migration.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 ENCODE AUTO
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
ENCODE AUTO;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## SORTKEY

### Description

The keyword that specifies that the column is the sort key for the table. In Snowflake, `SORTKEY` from Redshift can be migrated to `CLUSTER BY` because both optimize data storage for query performance. `CLUSTER BY` in Snowflake organizes data on specified columns, similar to how `SORTKEY` orders data in Redshift.

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 [COMPOUND | INTERLEAVED ] SORTKEY ( column_name [,...]) | [ SORTKEY AUTO ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER,
    col2 VARCHAR,
    col3 INTEGER,
    col4 INTEGER
)
COMPOUND SORTKEY (col1, col3);

CREATE TABLE table2 (
    col1 INTEGER
)
INTERLEAVED SORTKEY (col1);

CREATE TABLE table3 (
    col1 INTEGER
)
SORTKEY AUTO;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER,
    col2 VARCHAR,
    col3 INTEGER,
    col4 INTEGER
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF CLUSTER BY IN SNOWFLAKE MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY IN REDSHIFT. **
CLUSTER BY (col1, col3)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;

CREATE TABLE table2 (
    col1 INTEGER
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF CLUSTER BY IN SNOWFLAKE MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY IN REDSHIFT. **
CLUSTER BY (col1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;

CREATE TABLE table3 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - SORTKEY AUTO OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--SORTKEY AUTO
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

### Related EWIs

1. [SSC-FDM-RS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.
2. [SSC-FDM-RS0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): The performance of CLUSTER BY in Snowflake may vary compared to the performance of SORTKEY in Redshift.

## FOREIGN KEY

### Description

Constraint that specifies a foreign key constraint, which requires that a group of one or more columns of the new table must only contain values that match values in the referenced column or columns of some row of the referenced table.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

> **Warning:**
>
> The translation for Foreign Key will be delivered in the future.

### Grammar Syntax

```sql
 FOREIGN KEY (column_name [, ... ] ) REFERENCES reftable [ ( refcolumn )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table15 (
    col1 INTEGER,
    FOREIGN KEY (col1) REFERENCES table_test (col1)
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table15 (
    col1 INTEGER
--                ,
--    --** SSC-FDM-RS0003 - SNOWCONVERT AI TRANSLATION FOR REDSHIFT FOREIGN KEY CONSTRAINTS IS PENDING. **
--    FOREIGN KEY (col1) REFERENCES table_test (col1)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/26/2024" }}';
```

### Related EWIs

* [SSC-FDM-RSOOO3](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Foreign Key translation will be supported in the future.

## PRIMARY KEY

### Description

Specifies that a column or a number of columns of a table can contain only unique non-null values

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

> **Note:**
>
> In Snowflake, unique, primary and foreign keys are used for documentation and do not enforce constraints or uniqueness. They help describe table relationships but don’t impact data integrity or performance.

### Grammar Syntax

```sql
 PRIMARY KEY ( column_name [, ... ] )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER,
    col2 INTEGER,
    PRIMARY KEY (col1)
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER,
    col2 INTEGER,
    PRIMARY KEY (col1)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## UNIQUE

### Description

Specifies that a group of one or more columns of a table can contain only unique values.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

> **Note:**
>
> In Snowflake, unique, primary and foreign keys are used for documentation and do not enforce constraints or uniqueness. They help describe table relationships but don’t impact data integrity or performance.

### Grammar Syntax

```sql
 UNIQUE ( column_name [, ... ] )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER,
    col2 INTEGER,
    UNIQUE ( col1, col2 )
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER,
    col2 INTEGER,
    UNIQUE ( col1, col2 )
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## NOT NULL | NULL

### Description

NOT NULL specifies that the column isn’t allowed to contain null values. NULL, the default, specifies that the column accepts null values.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 NOT NULL | NULL
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER NOT NULL,
    col2 INTEGER NULL
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER NOT NULL,
    col2 INTEGER NULL
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## REFERENCES

### Description

Specifies a foreign key constraint, which implies that the column must contain only values that match values in the referenced column of some row of the referenced table

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 REFERENCES reftable [ ( refcolumn ) ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER REFERENCES table_test (col1)
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER REFERENCES table_test (col1)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## UNIQUE | PRIMARY KEY

### Description

Specifies that the column can contain only unique values. In Snowflake, both UNIQUE and PRIMARY KEY are used to document and structure data, but they do not have active data validation functionality in the sense that you might expect in other database systems that enforce these restrictions at the storage level.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

> **Note:**
>
> In Snowflake, unique, primary and foreign keys are used for documentation and do not enforce constraints or uniqueness. They help describe table relationships but don’t impact data integrity or performance.

### Grammar Syntax

```sql
 UNIQUE | PRIMARY KEY
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER PRIMARY KEY,
    col2 INTEGER UNIQUE
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER PRIMARY KEY,
    col2 INTEGER UNIQUE
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## COLLATE

### Description

Specifies whether string search or comparison on the column is CASE_SENSITIVE or CASE_INSENSITIVE.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

> **Note:**
>
> The default collation language is English. If your database uses a different language, please update the ‘en-’ prefix to match your database’s language. For more information, please refer to this [link](https://docs.snowflake.com/en/sql-reference/collation#label-collation-specification).

### Grammar Syntax

```sql
 COLLATE CASE_SENSITIVE | COLLATE CASE_INSENSITIVE
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 TEXT COLLATE CASE_SENSITIVE,
    col2 TEXT COLLATE CASE_INSENSITIVE
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 TEXT COLLATE 'en-cs',
    col2 TEXT COLLATE 'en-ci'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Known issues

There are no known issues.

## DEFAULT

### Description

Assigns a default data value for the column.

See the [Redshift CREATE TABLE DEFAULT clause documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html#create-table-default) for this syntax.

### Grammar Syntax

```sql
 DEFAULT default_expr
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER DEFAULT 1
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER DEFAULT 1
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## DISTKEY

### Description

In Amazon Redshift, `DISTKEY` is used to distribute data across cluster nodes to optimize query performance. Snowflake, however, automatically handles data distribution and storage without needing explicit distribution keys. Due to differences in architecture and data management approaches, Snowflake does not have a direct equivalent to Redshift’s `DISTKEY`. For these reasons, the statement `DISTKEY` is removed during the transformation process

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 DISTKEY
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER DISTKEY
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## ENCODE

### Description

The compression encoding for a column. In Snowflake, defining `ENCODE` is unnecessary because it automatically handles data compression, unlike Redshift, which requires manual encoding settings. For this reason, the ENCODE statement is removed during migration.

See the [Redshift CREATE TABLE ENCODE clause documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html#create-table-encode) for this syntax.

### Grammar Syntax

```sql
 ENCODE encoding
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER ENCODE DELTA
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## GENERATED BY DEFAULT AS IDENTITY

### Description

Specifies that the column is a default IDENTITY column and enables you to automatically assign a unique value to the column.

See the [Redshift IDENTITY column documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html#identity-generated-bydefault-clause) for this syntax.

### Grammar Syntax

```sql
 GENERATED BY DEFAULT AS IDENTITY ( seed, step )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER GENERATED BY DEFAULT AS IDENTITY(1,1)
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER IDENTITY(1,1) ORDER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

### Related EWIs

There are no known issues.

## IDENTITY

### Description

> Clause that specifies that the column is an IDENTITY column. ([RedShift SQL Language Reference Identity](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html#identity-clause)).

### Grammar Syntax

```sql
 IDENTITY ( seed, step )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    doc INTEGER,
    id1 INTEGER IDENTITY(1,1),
    id2 INTEGER  DEFAULT "identity"(674435, 0, ('5,3'::character varying)::text),
    id3 INTEGER  DEFAULT default_identity(963861, 1, '1,2'::text),
    id4 INTEGER  DEFAULT "default_identity"(963861, 1, '1,6'::text)
);

INSERT INTO table1 (doc) VALUES (1),(2),(3);

SELECT * FROM table1;
```

##### Results

| DOC | ID1 | ID2 | ID3 | ID4 |
| --- | --- | --- | --- | --- |
| 1 | 1 | 5 | 1 | 1 |
| 2 | 2 | 8 | 3 | 7 |
| 3 | 3 | 11 | 5 | 13 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    doc INTEGER,
    id1 INTEGER IDENTITY(1,1) ORDER,
    id2 INTEGER IDENTITY(5,3) ORDER,
    id3 INTEGER IDENTITY(1,2) ORDER,
    id4 INTEGER IDENTITY(1,6) ORDER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "12/04/2024",  "domain": "test" }}';

INSERT INTO table1 (doc) VALUES (1),(2),(3);

SELECT * FROM
 table1;
```

##### Results

| DOC | ID1 | ID2 | ID3 | ID4 |
| --- | --- | --- | --- | --- |
| 1 | 1 | 5 | 1 | 1 |
| 2 | 2 | 8 | 3 | 7 |
| 3 | 3 | 11 | 5 | 13 |

### Known Issues

No issues were found.

### Related EWIs

There are no known issues.

## SORTKEY

### Description

The keyword that specifies that the column is the sort key for the table. In Snowflake, `SORTKEY` from Redshift can be migrated to `CLUSTER BY` because both optimize data storage for query performance. `CLUSTER BY` in Snowflake organizes data on specified columns, similar to how `SORTKEY` orders data in Redshift.

See the [Redshift data sorting documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Sorting_data.html) for this syntax.

### Grammar Syntax

```sql
 SORTKEY
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER SORTKEY
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF CLUSTER BY IN SNOWFLAKE MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY IN REDSHIFT. **
CLUSTER BY (col1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

### Known issues

1. [SSC-FDM-RS0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): The performance of CLUSTER BY in Snowflake may vary compared to the performance of SORTKEY in Redshift.

---
title: SnowConvert AI - Redshift - CREATE TABLE AS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-sql-statements-create-table-as.md
section: Migrations
---

# SnowConvert AI - Redshift - CREATE TABLE AS

Create Table As Syntax Grammar.

## Description

Creates a new table based on a query. The owner of this table is the user that issues the command.

For more information please refer to [`CREATE TABLE AS`](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_AS.html) documentation.

## Grammar Syntax

```sql
 CREATE [ [ LOCAL ] { TEMPORARY | TEMP } ]
TABLE table_name
[ ( column_name [, ... ] ) ]
[ BACKUP { YES | NO } ]
[ table_attributes ]
AS query

where table_attributes are:
[ DISTSTYLE { AUTO | EVEN | ALL | KEY } ]
[ DISTKEY( distkey_identifier ) ]
[ [ COMPOUND | INTERLEAVED ] SORTKEY( column_name [, ...] ) ]
```

# SnowConvert AI - Redshift - Table Start

## BACKUP

### Description

Enables Amazon Redshift to automatically adjust the encoding type for all columns in the table to optimize query performance. In Snowflake, the concept of `BACKUP` as seen in other databases is not directly applicable. Snowflake automatically handles data backup and recovery through its built-in features like Time Travel and Fail-safe, eliminating the need for manual backup operations. For these reasons, the statement `BACKUP` is removed during the transformation process

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 BACKUP { YES | NO }
```

### Sample Source Patterns

#### NO option

An FDM is added since Snowflake, by default, always creates a backup of the created table.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE table1
BACKUP NO
AS SELECT * FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/10/2025",  "domain": "test" }}'
----** SSC-FDM-RS0001 - BACKUP NO OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--BACKUP NO
AS SELECT * FROM
table_test;
```

#### YES option

The option is removed since Snowflake, by default, applies a backup to the created table.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE table1
BACKUP YES
AS SELECT * FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/10/2025",  "domain": "test" }}'
AS SELECT * FROM
table_test;
```

###

### Related EWIs

* [SSC-FDM-RS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.

## COLUMNS

### Description

The name of a column in the new table. If no column names are provided, the column names are taken from the output column names of the query.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 ( column_name [, ... ] )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1
(
    col1, col2, col3
)
AS SELECT col1, col2, col3 FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
(
    col1, col2, col3
)
AS SELECT col1, col2, col3 FROM
        table_test;
```

### Related EWIs

There are no known issues.

## LOCAL

### Description

In Amazon Redshift, `LOCAL TEMPORARY` or `TEMP` are used to create temporary tables that exist only for the duration of the session. These tables are session-specific and automatically deleted when the session ends. They are useful for storing intermediate results or working data without affecting the permanent database schema.

See the [Redshift CREATE TABLE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html) for this syntax.

### Grammar Syntax

```sql
 LOCAL { TEMPORARY | TEMP }
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE LOCAL TEMP TABLE table1
AS SELECT FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE LOCAL TEMP TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
AS SELECT FROM
table_test;
```

### Related EWIs

There are no known issues.

# SnowConvert AI - Redshift - Table Attributes

## DISTKEY

### Description

In Amazon Redshift, `DISTKEY` is used to distribute data across cluster nodes to optimize query performance. Snowflake, however, automatically handles data distribution and storage without needing explicit distribution keys. Due to differences in architecture and data management approaches, Snowflake does not have a direct equivalent to Redshift’s `DISTKEY`. For these reasons, the statement `DISTKEY` is removed during the transformation process

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 DISTKEY ( column_name )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1
DISTKEY (col1)
AS SELECT * FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/10/2025",  "domain": "test" }}'
----** SSC-FDM-RS0001 - DISTKEY OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTKEY (col1)
AS SELECT * FROM
table_test;
```

### Related EWIs

* [SSC-FDM-RS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.

## DISTSTYLE

### Description

Keyword that defines the data distribution style for the whole table.

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 DISTSTYLE { AUTO | EVEN | KEY | ALL }
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1
DISTSTYLE AUTO
AS SELECT * FROM table_test;

CREATE TABLE table2
DISTSTYLE EVEN
AS SELECT * FROM table_test;

CREATE TABLE table3
DISTSTYLE ALL
AS SELECT * FROM table_test;

CREATE TABLE table4
DISTSTYLE KEY
DISTKEY (col1)
AS SELECT * FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
----** SSC-FDM-RS0001 - DISTSTYLE AUTO OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE AUTO
AS SELECT * FROM
table_test;

CREATE TABLE table2
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
----** SSC-FDM-RS0001 - DISTSTYLE EVEN OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE EVEN
AS SELECT * FROM
table_test;

CREATE TABLE table3
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
----** SSC-FDM-RS0001 - DISTSTYLE ALL OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE ALL
AS SELECT * FROM
table_test;

CREATE TABLE table4
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
----** SSC-FDM-RS0001 - DISTSTYLE KEY OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE KEY
----** SSC-FDM-RS0001 - DISTKEY OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTKEY (col1)
AS SELECT * FROM
table_test;
```

### Related EWIs

1. [SSC-FDM-RS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.

## SORTKEY

### Description

The keyword that specifies that the column is the sort key for the table. In Snowflake, `SORTKEY` from Redshift can be migrated to `CLUSTER BY` because both optimize data storage for query performance. `CLUSTER BY` in Snowflake organizes data on specified columns, similar to how `SORTKEY` orders data in Redshift.

See the [Redshift data distribution documentation](https://docs.aws.amazon.com/redshift/latest/dg/t_Distributing_data.html) for this syntax.

### Grammar Syntax

```sql
 [ COMPOUND | INTERLEAVED ] SORTKEY( column_name [, ...] )
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1,
    col2,
    col3,
    col4
)
COMPOUND SORTKEY (col1, col3)
AS SELECT * FROM table_test;

CREATE TABLE table2 (
    col1
)
INTERLEAVED SORTKEY (col1)
AS SELECT * FROM table_test;

CREATE TABLE table3 (
    col1
)
SORTKEY (col1)
AS SELECT * FROM table_test;
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE table1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
(
    col1,
    col2,
    col3,
    col4
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF THE CLUSTER BY MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY **
CLUSTER BY (col1, col3)
AS SELECT * FROM
        table_test;

CREATE TABLE table2
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
(
    col1
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF THE CLUSTER BY MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY **
CLUSTER BY (col1)
AS SELECT * FROM
        table_test;

CREATE TABLE table3
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
(
    col1
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF THE CLUSTER BY MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY **
CLUSTER BY (col1)
AS SELECT * FROM
        table_test;
```

### Related EWIs

1. [SSC-FDM-RS0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): The performance of the CLUSTER BY may vary compared to the performance of Sortkey.

---
title: SnowConvert AI - Redshift - Data types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-data-types.md
section: Migrations
---

# SnowConvert AI - Redshift - Data types

Current Data types conversion for Redshift in SnowConvert AI.

Snowflake supports most basic [SQL data types](https://docs.snowflake.com/en/sql-reference/intro-summary-data-types) (with some restrictions) for use in columns, local variables, expressions, parameters, and any other appropriate/suitable locations.

## Numeric Data Types

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| INT | INT | Snowflake’s INT is an alias for NUMBER. |
| INT2 | SMALLINT | Snowflake’s INT2 is an alias for NUMBER. |
| INT4 | INTEGER | Snowflake’s INT4 is an alias for NUMBER. |
| INT8 | INTEGER | Snowflake’s INT8 is an alias for NUMBER. |
| INTEGER | INTEGER | Snowflake’s INTEGER is an alias for NUMBER. |
| BIGINT | BIGINT | Snowflake’s BIGINT is an alias for NUMBER. |
| DECIMAL | DECIMAL | Snowflake’s DECIMAL is an alias for NUMBER. |
| DOUBLE PRECISION | DOUBLE PRECISION | Snowflake’s DOUBLE PRECISION is an alias for FLOAT. |
| NUMERIC​ | NUMERIC | Snowflake’s NUMERIC is an alias for NUMBER. |
| SMALLINT | SMALLINT | Snowflake’s SMALLINT is an alias for NUMBER. |
| FLOAT | FLOAT | Snowflake uses double-precision (64 bit) IEEE 754 floating-point numbers. |
| FLOAT4 | FLOAT4 | Snowflake’s FLOAT4 is an alias for FLOAT. |
| FLOAT8 | FLOAT8 | Snowflake’s FLOAT8 is an alias for FLOAT. |
| REAL | REAL​ | Snowflake’s REAL is an alias for FLOAT. |

## Character Types

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| VARCHAR | VARCHAR | VARCHAR holds Unicode UTF-8 characters. If no length is specified, the default is the maximum allowed length (16,777,216). |
| CHAR | CHAR | Snowflake’s CHAR is an alias for VARCHAR. |
| CHARACTER | CHARACTER | Snowflake’s CHARACTER is an alias for VARCHAR. |
| NCHAR | NCHAR | Snowflake’s NCHAR is an alias for VARCHAR. |
| BPCHAR | VARCHAR | BPCHAR data type is **not supported** in Snowflake. VARCHAR is used instead. For more information please refer to [SSC-FDM-PG0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md). |
| NVARCHAR | NVARCHAR | Snowflake’s NVARCHAR is an alias for VARCHAR. |
| CHARACTER VARYING | CHARACTER VARYING | Snowflake’s CHARACTER VARYING is an alias for VARCHAR. |
| NATIONAL CHARACTER | NCHAR | Snowflake’s NCHAR is an alias for VARCHAR. |
| NATIONAL CHARACTER VARYING | NCHAR VARYING | Snowflake’s NCHAR VARYING is an alias for VARCHAR. |
| TEXT | TEXT | Snowflake’s TEXT is an alias for VARCHAR. |
| [NAME](https://www.postgresql.org/docs/current/datatype-character.html) (Special character type) | VARCHAR | VARCHAR holds Unicode UTF-8 characters. If no length is specified, the default is the maximum allowed length (16,777,216). |

> **Note:**
>
> When the MAX precision argument is present in the Redshift data types, they are transformed to the default max precision supported by Snowflake.

## Boolean Types

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| BOOL | BOOLEAN |  |
| BOOLEAN | BOOLEAN |  |

## Binary Data Types

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| VARBYTE | VARBINARY | VARBINARY is synonymous with BINARY. |
| VARBINARY | VARBINARY | VARBINARY is synonymous with BINARY. |
| BINARY | BINARY | The maximum length is 8 MB (8,388,608 bytes) |
| BINARY VARYING | BINARY VARYING | BINARY VARYING is synonymous with BINARY. |

> **Warning:**
>
> The maximum length for binary types in Redshift is 16 MB (16,777,216 bytes), however in [Snowflake](https://docs.snowflake.com/en/sql-reference/data-types-text#data-types-for-binary-strings) it is 8 MB (8,388,608 bytes). Please consider this reduction in the maximum length.

## Date & Time Data Types

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| DATE | DATE | DATE accepts dates in the most common forms (such as `YYYY-MM-DD` and `DD-MON-YYYY`) |
| TIME | TIME | Storing times in the form of `HH:MI:SS`. Time precision can range from 0 (seconds) to 9 (nanoseconds). The default precision is 9. |
| TIMETZ | TIME | Time zone not supported for time data type. For more information please refer to [SSC-FDM-0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md). |
| TIME WITH TIME ZONE | TIME | Time zone not supported for time data type. For more information please refer to [SSC-FDM-0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md). |
| TIME WITHOUT TIME ZONE | TIME | Snowflake supports a single TIME data type for storing times in the form of `HH:MI:SS`. |
| TIMESTAMP | TIMESTAMP | Timestamp precision can range from 0 (seconds) to 9 (nanoseconds). |
| TIMESTAMPTZ | TIMESTAMP_TZ | TIMESTAMP_TZ internally stores UTC time together with an associated *time zone offset*. |
| TIMESTAMP WITH TIME ZONE | TIMESTAMP_TZ | TIMESTAMP_TZ internally stores UTC time together with an associated *time zone offset*. |
| TIMESTAMP WITHOUT TIME ZONE | TIMESTAMP_NTZ | TIMESTAMP_NTZ internally stores “wallclock” time with a specified precision. |
| INTERVAL YEAR TO MONTH | VARCHAR | The interval data type is not supported by Snowflake. Transformed to VARCHAR. With the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md), preserved as native `INTERVAL YEAR TO MONTH`. See [Interval Data Types](../general/interval-data-types.md). |
| INTERVAL DAY TO SECOND | VARCHAR | The interval data type is not supported by Snowflake. Transformed to VARCHAR. With the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md), preserved as native `INTERVAL DAY TO SECOND`. See [Interval Data Types](../general/interval-data-types.md). |

## Other data types

| Redshift | Snowflake | Notes |
| --- | --- | --- |
| GEOMETRY | GEOMETRY | The coordinates are represented as pairs of real numbers (x, y). Currently, only 2D coordinates are supported. |
| GEOGRAPHY | GEOGRAPHY | The GEOGRAPHY data type follows the WGS 84 standard. |
| HLLSKETCH | N/A | Data type not supported in Snowflake. For more information please refer to [SSC-EWI-RS0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md). |
| SUPER | VARIANT | Can contain a value of any other data type, including OBJECT and ARRAY values. |

## Related EWIs

1. [SSC-FDM-PG0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Bpchar converted to varchar.
2. [SSC-FDM-0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): TIME ZONE not supported for time data type.
3. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
4. [SSC-EWI-RS0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): HLLSKETCH data type not supported in Snowflake.

## INTERVAL DAY TO SECOND Data Type

### Description

> INTERVAL DAY TO SECOND specify an interval literal to define a duration in days, hours, minutes, and seconds. ([RedShift SQL Language Reference Interval data type](https://docs.aws.amazon.com/redshift/latest/dg/r_interval_data_types.html#r_interval_data_types-syntax))

By default, there is no equivalent for this data type in Snowflake and it is transformed to `VARCHAR`.

> **Note:**
>
> **Preview Feature:** When the `--UseIntervalDatatype` [preview flag](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled, Redshift INTERVAL columns are preserved as native Snowflake INTERVAL types. See the [Interval Data Types](../general/interval-data-types.md) translation reference for complete transformation details.

### Grammar Syntax

```sql
 INTERVAL day_to_second_qualifier [ (fractional_precision) ]

day_to_second_qualifier:
{ DAY | HOUR | MINUTE | SECOND | DAY TO HOUR | DAY TO MINUTE | DAY TO SECOND |
HOUR TO MINUTE | HOUR TO SECOND | MINUTE TO SECOND }
```

> **Warning:**
>
> The use of the Interval data type is planned for implementation in future updates.

### Sample Source Patterns

#### Interval Day to Second in Create Table

##### Input

##### Redshift

```sql
 CREATE TABLE interval_day_to_second_table
(
	interval_day_col1 INTERVAL DAY TO HOUR,
	interval_day_col2 INTERVAL DAY TO SECOND(4)
);

INSERT INTO interval_day_to_second_table(interval_day_col1) VALUES ( INTERVAL '1 2' DAY TO HOUR );
INSERT INTO interval_day_to_second_table(interval_day_col2) VALUES ( INTERVAL '1 2:3:4.56' DAY TO SECOND(4));
```

##### Output

##### Snowflake

```sql
 CREATE TABLE interval_day_to_second_table
(
	interval_day_col1 VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DAY TO HOUR DATA TYPE CONVERTED TO VARCHAR ***/!!!,
	interval_day_col2 VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DAY TO SECOND(4) DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"redshift"}}'
;

INSERT INTO interval_day_to_second_table(interval_day_col1) VALUES ('1days, 2hours');

INSERT INTO interval_day_to_second_table(interval_day_col2) VALUES ('1days, 2hours, 3mins, 4secs, 56ms');
```

The Interval value is transformed to a supported Snowflake format and then inserted as text inside the column. Since Snowflake does not support **Interval** as a data type, it is only supported in arithmetic operations. To use the value, it needs to be extracted and used as an [Interval constant](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) (if possible).

**Original Redshift value:** `INTERVAL '1 2:3:4.567' DAY TO SECOND`

**Value stored in Snowflake column:** `'1days, 2hours, 3mins, 4secs, 56ms'`

**Value as Snowflake Interval constant:** `INTERVAL '1days, 2hours, 3mins, 4secs, 56ms'`

#### Retrieving data from an Interval Day to Second column

##### Input

##### Redshift

```sql
 SELECT * FROM interval_day_to_second_table;
```

##### Result

| interval_day_col1 | interval_day_col2 |
| --- | --- |
| 1 days 2 hours 0 mins 0.0 secs | NULL |
| NULL | 1 days 2 hours 3 mins 4.56 secs |

##### Output

##### Snowflake

```sql
 SELECT * FROM
interval_day_to_second_table;
```

##### Result

| interval_day_col1 | interval_day_col2 |
| --- | --- |
| 1d, 2h | NULL |
| NULL | 1d, 2h, 3m, 4s, 56ms |

### Known Issues

#### 1. Only arithmetic operations are supported

Snowflake Intervals have several limitations. Only arithmetic operations between `DATE` or `TIMESTAMP` and [Interval Constants](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) are supported, every other scenario is not supported.

### Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## INTERVAL YEAR TO MONTH Data Type

### Description

> INTERVAL YEAR TO MONTH specify an interval data type to store a duration of time in years and months. ([RedShift SQL Language Reference Interval data type](https://docs.aws.amazon.com/redshift/latest/dg/r_interval_data_types.html#r_interval_data_types-syntax))

There is no equivalent for this data type in Snowflake, it is currently transformed to VARCHAR.

### Grammar Syntax

```sql
 INTERVAL {YEAR | MONTH | YEAR TO MONTH}
```

> **Warning:**
>
> The use of the Interval data type is planned for implementation in future updates.

### Sample Source Patterns

#### Interval Year To Month in Create Table

##### Input:

##### Redshift

```sql
 CREATE TABLE interval_year_to_month_table
(
	interval_year_col1 INTERVAL YEAR,
	interval_year_col2 INTERVAL MONTH,
 	interval_year_col3 INTERVAL YEAR TO MONTH
);

INSERT INTO interval_year_to_month_table(interval_year_col1) VALUES ( INTERVAL '12' YEAR);
INSERT INTO interval_year_to_month_table(interval_year_col2) VALUES ( INTERVAL '5' MONTH);
INSERT INTO interval_year_to_month_table(interval_year_col3) VALUES ( INTERVAL '1000-11' YEAR TO MONTH );
```

##### Output

##### Snowflake

```sql
 CREATE TABLE interval_year_to_month_table
(
	interval_year_col1 VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR DATA TYPE CONVERTED TO VARCHAR ***/!!!,
	interval_year_col2 VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL MONTH DATA TYPE CONVERTED TO VARCHAR ***/!!!,
	interval_year_col3 VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR TO MONTH DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"redshift"}}'
;

INSERT INTO interval_year_to_month_table(interval_year_col1) VALUES ('12year, 0mons');

INSERT INTO interval_year_to_month_table(interval_year_col2) VALUES ('0year, 5mons');

INSERT INTO interval_year_to_month_table(interval_year_col3) VALUES ('1000year, 11mons');
```

The Interval value is transformed to a supported Snowflake format and then inserted as text inside the column. Since Snowflake does not support **Interval** as a data type, it is only supported in arithmetic operations. To use the value, it needs to be extracted and used as an [Interval constant](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) (if possible).

**Original Redshift value:** `INTERVAL '1-2' YEAR TO MONTH`

**Value stored in Snowflake column:** `'1y, 2m'`

**Value as Snowflake Interval constant:** `INTERVAL '1y, 2m'`

#### Retrieving data from an Interval Year To Month column

##### Input

##### Redshift

```sql
 SELECT * FROM interval_year_to_month_table;
```

##### Result

| interval_year_col1 | interval_year_col2 | interval_year_col2 |
| --- | --- | --- |
| 12 years 0 mons | NULL | NULL |
| NULL | 0 years 5 mons | NULL |
| NULL | NULL | 1000 years 11 mons |

##### Output

##### Snowflake

```sql
 SELECT * FROM
interval_year_to_month_table;
```

##### Result

| interval_year_col1 | interval_year_col2 | interval_year_col2 |
| --- | --- | --- |
| 12 y 0 mm | NULL | NULL |
| NULL | 0 y 5 mm | NULL |
| NULL | NULL | 1000 y 11 mons |

### Known Issues

#### 1. Only arithmetic operations are supported

Snowflake Intervals have several limitations. Only arithmetic operations between `DATE` or `TIMESTAMP` and [Interval Constants](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) are supported, every other scenario is not supported.

### Related EWIs

* [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## Numeric Format Models

### Description

These are the different Numeric Formats supported by [Redshift](https://docs.aws.amazon.com/redshift/latest/dg/r_Numeric_formating.html) and its equivalent in [Snowflake](https://docs.snowflake.com/en/sql-reference/sql-format-models#numeric-format-models).

| Redshift | Snowflake | Comments |
| --- | --- | --- |
| 0 | 0 |  |
| 9 | 9 |  |
| . (period), D | . (period), D |  |
| , (comma) | , (comma) |  |
| CC |  | Currently there is no equivalent for Century Code in Snowflake. |
| FM | FM |  |
| PR |  | Currently there is no equivalent for this format in Snowflake. |
| S | S | Explicit numeric sign. |
| L | $ | Currency symbol placeholder. |
| G | G |  |
| MI | MI | Minus sign (for negative numbers) |
| PL | S | Currently there is no equivalent for plus sign in Snowflake. So it is translated to the explicit numeric sign. |
| SG | S | Explicit numeric Sign in the specified position. |
| RN |  | Currently there is no equivalent for Roman Numerals in Snowflake. |
| TH |  | Currently there is no equivalent for Ordinal suffix in Snowflake |

### Sample Source Patterns

#### Uses in To_Number function

##### Input:

##### Redshift

```sql
 select to_number('09423', '999999999') as multiple_nines
    , to_number('09423', '00000') as exact_zeros
    , to_number('123.456', '999D999') as decimals
    , to_number('123,031.30', 'FM999,999D999') as fill_mode
    , to_number('$ 12,454.88', '$999,999.99') as currency
;
```

##### Results

| multiple_nines | exact_zeros | decimals | fill_mode | currency |
| --- | --- | --- | --- | --- |
| 9423 | 9423 | 123.456 | 123031.30 | 1254.88 |

##### Output

##### Snowflake

```sql
 select to_number('09423', '999999999') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''999999999'' NODE ***/!!! as multiple_nines
    , to_number('09423', '00000') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''00000'' NODE ***/!!! as exact_zeros
    , to_number('123.456', '999D999') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''999D999'' NODE ***/!!! as decimals
    , to_number('123,031.30', 'FM999,999D999') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''FM999,999D999'' NODE ***/!!! as fill_mode
    , to_number('$ 12,454.88', '$999,999.99') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''$999,999.99'' NODE ***/!!! as currency
;
```

##### Results

| multiple_nines | exact_zeros | decimals | fill_mode | currency |
| --- | --- | --- | --- | --- |
| 9423 | 9423 | 123.456 | 123031.300 | 12454.88 |

##### Input:

##### Redshift

```sql
 select to_number('$ 12,454.88', 'FML99G999D99') as currency_L
    , to_number('123-', '999S') as signed_number_end
    , to_number('+12454.88', 'PL99G999D99') as plus_sign
    , to_number('-12,454.88', 'MI99G999D99') as minus_sign
    , to_number('-12,454.88', 'SG99G999D99') as signed_number
;
```

##### Results

| currency_L | signed_number_end | plus_sign | minus_sign | signed_number |
| --- | --- | --- | --- | --- |
| 12454.8 | -123 | 1254.88 | -12454.88 | -12454.88 |

##### Output:

##### Snowflake

```sql
 select to_number('$ 12,454.88', 'FML99G999D99') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - 'FML99G999D99' FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!! as currency_L
    , to_number('123-', '999S') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''999S'' NODE ***/!!! as signed_number_end
    , to_number('+12454.88', 'PL99G999D99') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - 'PL99G999D99' FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!! as plus_sign
    , to_number('-12,454.88', 'MI99G999D99') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''MI99G999D99'' NODE ***/!!! as minus_sign
    , to_number('-12,454.88', 'SG99G999D99') !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR ''SG99G999D99'' NODE ***/!!! as signed_number
;
```

##### Results

| currency_L | signed_number_end | plus_sign | minus_sign | signed_number |
| --- | --- | --- | --- | --- |
| 12454.8 | -123 | 1254.88 | -12454.88 | -12454.88 |

#### Uses in To_Char function

##### Input:

##### Redshift

```sql
 select to_char(-123, '999S') as signed_number
    , to_char(12454.88, 'FM99G999D99') as decimal_number
    , to_char(-12454.88, '99G999D99') as negative
    , to_char(-12454.88, 'MI99G999D99') as minus_sign
    , to_char(+12454.88, 'PL99G999D99') as plus_sign
    , to_char(09423, '999999999') as multiple_nines
    , to_char(09423, '00000') as exact_zeros
;
```

##### Results

| signed_number | decimal_number | negative | minus_sign | plus_sign | multiple_ninesmultiple_nines | exact_zerosexact_zeros |
| --- | --- | --- | --- | --- | --- | --- |
| '123-' | '12,454.88' | '-12,454.88' | '12454.88' | '-12,454.88' | '09423' | '09423' |

##### Output:

##### Snowflake

```sql
 select
    TO_CHAR(-123, '999S') as signed_number,
    TO_CHAR(12454.88, 'FM99G999D99') as decimal_number,
    TO_CHAR(-12454.88, '99G999D99') as negative,
    TO_CHAR(-12454.88, 'MI99G999D99') as minus_sign,
    TO_CHAR(+12454.88, 'S99G999D99') as plus_sign,
    TO_CHAR(09423, '999999999') as multiple_nines,
    TO_CHAR(09423, '00000') as exact_zeros
;
```

##### Results

| signed_number | decimal_number | negative | minus_sign | plus_sign | multiple_ninesmultiple_nines | exact_zerosexact_zeros |
| --- | --- | --- | --- | --- | --- | --- |
| '123-' | '12,454.88' | '-12,454.88' | '12454.88' | '-12,454.88' | '09423' | '09423' |

#### Unsupported format

The following format is not supported, for which it will be marked with an EWI.

##### Input:

```sql
 SELECT to_char(123031, 'th999,999')
```

##### Output:

```sql
 SELECT
TO_CHAR(123031, 'th999,999') !!!RESOLVE EWI!!! /*** SSC-EWI-0006 - th999,999 FORMAT MAY FAIL OR MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/!!!
```

### Known Issues

#### 1. Using numeric signs inside the number not supported.

When any numeric sign format (MI, SG or PL) is used inside the number, instead of at the start, or at the end of the number is not supported in snowflake

Example

```sql
 select to_number('12,-454.88', '99GMI999D99')
```

### Related EWIs

* [SSC-EWI-0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The current date/numeric format may have a different behavior in Snowflake.

---
title: SnowConvert AI - Redshift - EXIT HANDLER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-exit-handler.md
section: Migrations
---

# SnowConvert AI - Redshift - EXIT HANDLER

## Description

Amazon Redshift, which uses PL/pgSQL for procedural logic, supports EXIT handlers in stored procedures through EXCEPTION blocks. An EXIT handler terminates the current block when a specific condition is met and transfers control to the handler code.

When migrating code from database systems that use EXIT HANDLERs (such as DB2, Teradata, or other systems) to Snowflake, SnowConvert AI transforms these constructs into equivalent Snowflake Scripting exception handling mechanisms.

An EXIT HANDLER causes the procedure to exit the current block and return control to the caller after executing the handler code. In Snowflake, this behavior is emulated using EXCEPTION blocks with appropriate logic.

For more information about Redshift exception handling, see [Exception Handling in PL/pgSQL](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-messages-errors).

## Grammar Syntax

Redshift does not have native `DECLARE EXIT HANDLER` syntax. However, when converting from other database systems, the source pattern typically looks like:

```sql
-- Pattern from source systems (e.g., DB2, Teradata)
DECLARE EXIT HANDLER FOR condition_value
  handler_action_statement;
```

In Redshift, exception handling uses:

```sql
BEGIN
  -- statements
EXCEPTION
  WHEN condition THEN
    -- handler statements that exit the block
END;
```

## Sample Source Patterns

### EXIT HANDLER Conversion to Snowflake

When migrating stored procedures from systems with EXIT HANDLER to Snowflake via Redshift, SnowConvert AI transforms them into Snowflake-compatible exception handling.

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
-- Example pattern from source system
CREATE PROCEDURE exit_handler_procedure()
BEGIN
    DECLARE EXIT HANDLER FOR SQLEXCEPTION
    BEGIN
        INSERT INTO error_log VALUES (CURRENT_TIMESTAMP, 'Error occurred, exiting');
        ROLLBACK;
    END;

    -- Main procedure logic
    INSERT INTO orders VALUES (1, 100.00);
    UPDATE inventory SET quantity = quantity - 1 WHERE product_id = 1;

    -- This will NOT execute if an error occurred
    INSERT INTO audit_log VALUES ('Transaction completed');
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE exit_handler_procedure()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    BEGIN
        -- Main procedure logic
        INSERT INTO orders VALUES (1, 100.00);
        UPDATE inventory SET quantity = quantity - 1 WHERE product_id = 1;

        -- This will NOT execute if an error occurred
        INSERT INTO audit_log VALUES ('Transaction completed');

        EXCEPTION
            WHEN OTHER THEN
                BEGIN
                    INSERT INTO error_log
                    VALUES (CURRENT_TIMESTAMP(), 'Error occurred, exiting');
                    ROLLBACK;
                END;
    END;
$$;
```

### EXIT HANDLER with Specific SQLSTATE

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE specific_error_exit()
BEGIN
    DECLARE EXIT HANDLER FOR SQLSTATE '23505'
    BEGIN
        INSERT INTO error_log VALUES ('Duplicate key error');
    END;

    INSERT INTO users VALUES (1, 'John');
    INSERT INTO users VALUES (1, 'Jane');  -- Duplicate key

    -- This will NOT execute
    INSERT INTO success_log VALUES ('Completed');
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE specific_error_exit()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    BEGIN
        INSERT INTO users VALUES (1, 'John');
        INSERT INTO users VALUES (1, 'Jane');  -- Duplicate key

        -- This will NOT execute
        INSERT INTO success_log VALUES ('Completed');

        EXCEPTION
            WHEN OTHER EXIT THEN
                CASE
                    WHEN (SQLSTATE = '23505') THEN
                        INSERT INTO error_log VALUES ('Duplicate key error')
                END;
    END;
$$;
```

### EXIT HANDLER for NOT FOUND

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE not_found_exit()
BEGIN
    DECLARE v_name VARCHAR(100);

    DECLARE EXIT HANDLER FOR NOT FOUND
        INSERT INTO log_table VALUES ('No data found, exiting');

    SELECT name INTO v_name FROM employees WHERE id = 9999;

    -- This will NOT execute if no data found
    INSERT INTO results VALUES (v_name);
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE not_found_exit()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        v_name VARCHAR(100);
    BEGIN
        SELECT name INTO v_name FROM employees WHERE id = 9999;

        -- This will NOT execute if no data found
        INSERT INTO results VALUES (v_name);

        EXCEPTION
            WHEN NO_DATA_FOUND THEN
                INSERT INTO log_table VALUES ('No data found, exiting');
    END;
$$;
```

### EXIT HANDLER with Cursor

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE cursor_exit_handler()
BEGIN
    DECLARE v_id INT;
    DECLARE v_name VARCHAR(100);
    DECLARE v_count INT := 0;

    DECLARE EXIT HANDLER FOR SQLEXCEPTION
    BEGIN
        INSERT INTO error_log VALUES ('Error in cursor processing');
        RETURN -1;
    END;

    DECLARE cur CURSOR FOR SELECT id, name FROM employees;

    OPEN cur;
    LOOP
        FETCH cur INTO v_id, v_name;
        EXIT WHEN NOT FOUND;

        -- Process each row
        INSERT INTO processed_employees VALUES (v_id, v_name);
        v_count := v_count + 1;
    END LOOP;
    CLOSE cur;

    RETURN v_count;
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE cursor_exit_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/15/2025" }}'
AS
$$
    DECLARE
        v_id INT;
        v_name VARCHAR(100);
        v_count INT := 0;
        cur CURSOR FOR SELECT id, name FROM employees;
    BEGIN
        OPEN cur;
        LOOP
            FETCH cur INTO v_id, v_name;
            IF (SQLCODE != 0) THEN
                BREAK;
            END IF;

            -- Process each row
            INSERT INTO processed_employees VALUES (v_id, v_name);
            v_count := v_count + 1;
        END LOOP;
        CLOSE cur;

        RETURN v_count;

        EXCEPTION
            WHEN OTHER THEN
                BEGIN
                    INSERT INTO error_log VALUES ('Error in cursor processing');
                    RETURN -1;
                END;
    END;
$$;
```

## Known Issues

### EXIT HANDLER Behavior

The conversion from EXIT HANDLER to Snowflake exception handling provides equivalent termination behavior:

1. **Block Termination**: Both EXIT HANDLER and Snowflake EXCEPTION blocks terminate the current BEGIN…END block.
2. **Return Control**: After executing the handler code, control returns to the caller.
3. **Execution Flow**: Statements after the error point are not executed.

### Multiple EXIT Handlers

When multiple EXIT HANDLERs are defined with different conditions, they must be merged into conditional logic:

#### Source Pattern

```sql
DECLARE EXIT HANDLER FOR SQLSTATE '23505'
    INSERT INTO log VALUES ('Duplicate key');

DECLARE EXIT HANDLER FOR SQLEXCEPTION
    INSERT INTO log VALUES ('General error');
```

#### Snowflake

```sql
EXCEPTION
    WHEN OTHER EXIT THEN
        CASE
            WHEN (SQLSTATE = '23505') THEN
                INSERT INTO log VALUES ('Duplicate key')
            ELSE
                INSERT INTO log VALUES ('General error')
        END;
```

### Mixed CONTINUE and EXIT Handlers

Source systems that allow mixing CONTINUE and EXIT handlers in the same block present special challenges. Snowflake does not support this pattern in a single EXCEPTION block.

#### Related EWIs

1. [SSC-EWI-0114](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING

### SQLSTATE Mapping

Not all SQLSTATE codes from source systems map directly to Snowflake exception types. SnowConvert AI performs best-effort mapping:

| Source SQLSTATE | Condition | Snowflake Equivalent |
| --- | --- | --- |
| 02000 | NO DATA | NO_DATA_FOUND |
| 23xxx | Integrity Constraint | STATEMENT_ERROR |
| 42xxx | Syntax Error | STATEMENT_ERROR |
| Other | General | OTHER |

## Best Practices

When working with converted EXIT HANDLER code in Snowflake:

1. **Understand Exit Semantics**: EXIT handlers terminate the current block. Verify this matches your requirements.
2. **Test Error Conditions**: Thoroughly test all error scenarios to ensure proper exit behavior.
3. **Use Return Values**: Consider using RETURN statements in exception handlers to communicate status.
4. **Implement Logging**: Add comprehensive logging to track when and why procedures exit.
5. **Transaction Management**: Use Snowflake’s transaction support to maintain data consistency.
6. **Nested Blocks**: Remember that EXIT only affects the current block, not outer blocks or the entire procedure.
7. **Error Information**: Capture error details (SQLCODE, SQLERRM, SQLSTATE) in exception handlers for debugging.

## Related Documentation

* [Snowflake Exception Handling](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions)
* [Redshift Exception Handling](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-messages-errors)
* [CREATE PROCEDURE](rs-sql-statements-create-procedure.md)

## See Also

* [CONTINUE HANDLER](redshift-continue-handler.md)
* [EXCEPTION](rs-sql-statements-create-procedure.md)
* [RAISE](rs-sql-statements-create-procedure.md)
* [DECLARE](rs-sql-statements-create-procedure.md)

---
title: SnowConvert AI - Redshift - Expressions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-expressions.md
section: Migrations
---

# SnowConvert AI - Redshift - Expressions

## Expression lists

### Description

> An expression list is a combination of expressions, and can appear in membership and comparison conditions (WHERE clauses) and in GROUP BY clauses. ([Redshift SQL Language Reference Expression lists](https://docs.aws.amazon.com/redshift/latest/dg/r_expression_lists.html)).

> **Note:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 expression , expression , ... | (expression, expression, ...)
```

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE table1 (
    quantity VARCHAR(50),
    fruit VARCHAR(50)
);

CREATE TABLE table2 (
    quantity VARCHAR(50),
    fruit VARCHAR(50)
);

CREATE TABLE table3 (
    id INT,
    name VARCHAR(50),
    quantity INT,
    fruit VARCHAR(50),
    price INT
);

INSERT INTO table1 (quantity, fruit)
VALUES
    ('one', 'apple'),
    ('two', 'banana'),
    ('three', 'cherry');

INSERT INTO table2 (quantity, fruit)
VALUES
    ('one', 'apple'),
    ('two', 'banana'),
    ('four', 'orange');

INSERT INTO table3 (id, name, quantity, fruit, price)
VALUES
    (1, 'Alice', 1, 'apple', 100),
    (2, 'Bob', 5, 'banana', 200),
    (3, 'Charlie', 10, 'cherry', 300),
    (4, 'David', 15, 'orange', 400);
```

#### IN Clause

##### Input Code:

##### Redshift

```sql
SELECT *
FROM table3
WHERE quantity IN (1, 5, 10);
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |
| 2 | Bob | 5 | banana | 200 |
| 3 | Charlie | 10 | cherry | 300 |

##### Output Code:

##### Snowflake

```sql
 SELECT *
FROM
    table3
WHERE quantity IN (1, 5, 10);
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |
| 2 | Bob | 5 | banana | 200 |
| 3 | Charlie | 10 | cherry | 300 |

#### Comparisons

##### Input Code:

##### Redshift

```sql
 SELECT *
FROM table3
WHERE (quantity, fruit) = (1, 'apple');
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |

##### Output Code:

##### Snowflake

```sql
 SELECT *
FROM
    table3
WHERE (quantity, fruit) = (1, 'apple');
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |

> **Note:**
>
> Expression list comparisons with the following operators may have a different behavior in Snowflake. ( **`< , <= , > , >=`**). These operators are transformed into logical `AND` operations to achieve full equivalence in Snowflake.

##### Input Code:

##### Redshift

```sql
 SELECT (1,8,20) < (2,2,0) as r1,
       (1,null,2) > (1,0,8) as r2,
       (null,null,2) < (1,0,8) as r3,
       (1,0,null) <= (1,1,0) as r4,
       (1,1,0) >= (1,1,20) as r5;
```

##### Result

| R1 | R2 | R3 | R4 | R5 |
| --- | --- | --- | --- | --- |
| FALSE | FALSE | NULL | NULL | FALSE |

##### Output Code:

##### Snowflake

```sql
 SELECT
    (1 < 2
    AND 8 < 2
    AND 20 < 0) as r1,
    (1 > 1
    AND null > 0
    AND 2 > 8) as r2,
    (null < 1
    AND null < 0
    AND 2 < 8) as r3,
    (1 <= 1
    AND 0 <= 1
    AND null <= 0) as r4,
    (1 >= 1
    AND 1 >= 1
    AND 0 >= 20) as r5;
```

##### Result

| R1 | R2 | R3 | R4 | R5 |
| --- | --- | --- | --- | --- |
| FALSE | FALSE | NULL | NULL | FALSE |

#### Nested tuples

##### Input Code:

##### Redshift

```sql
 SELECT *
FROM table3
WHERE (quantity, fruit) IN ((1, 'apple'), (5, 'banana'), (10, 'cherry'));
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |
| 2 | Bob | 5 | banana | 200 |
| 3 | Charlie | 10 | cherry | 300 |

##### Output Code

##### Snowflake

```sql
 SELECT *
FROM
    table3
WHERE (quantity, fruit) IN ((1, 'apple'), (5, 'banana'), (10, 'cherry'));
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |
| 2 | Bob | 5 | banana | 200 |
| 3 | Charlie | 10 | cherry | 300 |

#### Case statement

##### Input Code:

##### Redshift

```sql
 SELECT
    CASE
        WHEN quantity IN (1, 5, 10) THEN 'Found'
        ELSE 'Not Found'
    END AS result
FROM table3;
```

##### Result

| RESULT |
| --- |
| Found |
| Found |
| Found |
| Not Found |
| Not Found |
| Not Found |

##### Output Code

##### Snowflake

```sql
 SELECT
    CASE
        WHEN quantity IN (1, 5, 10) THEN 'Found'
        ELSE 'Not Found'
    END AS result
FROM
    table3;
```

##### Result

| RESULT |
| --- |
| Found |
| Found |
| Found |
| Not Found |
| Not Found |
| Not Found |

#### Multiple Expressions

##### Input Code:

##### Redshift

```sql
 SELECT *
FROM table3
WHERE (quantity, fruit) IN ((1, 'apple'), (5, 'banana'), (10, 'cherry'))
  AND price IN (100, 200, 300);
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |
| 2 | Bob | 5 | banana | 200 |
| 3 | Charlie | 10 | cherry | 300 |

##### Output Code

##### Snowflake

```sql
 SELECT *
FROM
    table3
WHERE (quantity, fruit) IN ((1, 'apple'), (5, 'banana'), (10, 'cherry'))
  AND price IN (100, 200, 300);
```

##### Result

| ID | NAME | QUANTITY | FRUIT | PRICE |
| --- | --- | --- | --- | --- |
| 1 | Alice | 1 | apple | 100 |
| 2 | Bob | 5 | banana | 200 |
| 3 | Charlie | 10 | cherry | 300 |

#### Joins

##### Input Code:

##### Redshift

```sql
 SELECT *
FROM table1 t1
JOIN table2 t2
    ON (t1.quantity, t1.fruit) = (t2.quantity, t2.fruit)
WHERE t1.quantity = 'one' AND t1.fruit = 'apple';
```

##### Result

| QUANTITY | FRUIT | QUANTITY | FRUIT |
| --- | --- | --- | --- |
| one | apple | one | apple |

##### Output Code

##### Snowflake

```sql
 SELECT *
FROM
table1 t1
JOIN
        table2 t2
    ON (t1.quantity, t1.fruit) = (t2.quantity, t2.fruit)
WHERE t1.quantity = 'one' AND t1.fruit = 'apple';
```

##### Result

| QUANTITY | FRUIT | QUANTITY | FRUIT |
| --- | --- | --- | --- |
| one | apple | one | apple |

### Known Issues

No issues were found.

### Related EWIs

There are no known issues.

## Compound Expressions

### Description

> A compound expression is a series of simple expressions joined by arithmetic operators. A simple expression used in a compound expression must return a numeric value.
>
> ([RedShift SQL Language Reference Compound expressions](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html))

### Grammar Syntax

```sql
 expression operator {expression | (compound_expression)}
```

### Conversion Table

| Redshift | Snowflake | Comments |
| --- | --- | --- |
| [`||`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (Concatenation) | [`||`](https://docs.snowflake.com/en/sql-reference/functions/concat) | Fully supported by Snowflake |

### Sample Source Patterns

#### Input Code:

#### Redshift

```sql
 CREATE TABLE concatenation_demo (
    col1 VARCHAR(20),
    col2 INTEGER,
    col3 DATE
);

INSERT INTO concatenation_demo (col1, col2, col3) VALUES
('Hello', 42, '2023-12-01'),
(NULL, 0, '2024-01-01'),
('Redshift', -7, NULL);

SELECT
    col1 || ' has number ' || col2 AS concat_string_number
FROM concatenation_demo;

SELECT
    col1 || ' on ' || col3 AS concat_string_date
FROM concatenation_demo;

SELECT
    COALESCE(col1, 'Unknown') || ' with number ' || COALESCE(CAST(col2 AS VARCHAR), 'N/A') AS concat_with_null_handling
FROM concatenation_demo;
```

##### Results

| concat_string_number |
| --- |
| Hello has number 42 |
| <NULL> |
| Redshift has number -7 |

| concat_string_date |
| --- |
| Hello on 2023-12-01 |
| <NULL> |
| <NULL> |

| concat_with_null_handling |
| --- |
| Hello with number 42 |
| Unknown with number 0 |
| Redshift with number -7 |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE concatenation_demo (
    col1 VARCHAR(20),
    col2 INTEGER,
    col3 DATE
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "12/16/2024",  "domain": "test" }}';

INSERT INTO concatenation_demo (col1, col2, col3) VALUES
('Hello', 42, '2023-12-01'),
(NULL, 0, '2024-01-01'),
('Redshift', -7, NULL);

SELECT
    col1 || ' has number ' || col2 AS concat_string_number
FROM
    concatenation_demo;

SELECT
    col1 || ' on ' || col3 AS concat_string_date
FROM
    concatenation_demo;

SELECT
    COALESCE(col1, 'Unknown') || ' with number ' || COALESCE(CAST(col2 AS VARCHAR), 'N/A') AS concat_with_null_handling
FROM
    concatenation_demo;
```

##### Results

| concat_string_number |
| --- |
| Hello has number 42 |
| <NULL> |
| Redshift has number -7 |

| concat_string_date |
| --- |
| Hello on 2023-12-01 |
| <NULL> |
| <NULL> |

| concat_with_null_handling |
| --- |
| Hello with number 42 |
| Unknown with number 0 |
| Redshift with number -7 |

### Known Issues

No issues were found.

### Related EWIs

There are no known issues.

### Arithmetic operators

Operators

Translation for Arithmetic Operators

#### Conversion Table

| Redshift | Snowflake | Comments |
| --- | --- | --- |
| [+/-](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (positive and negative sign/operator) | [+/-](https://docs.snowflake.com/en/sql-reference/operators-arithmetic) | Fully supported by Snowflake |
| [^](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (exponentiation) | [POWER](https://docs.snowflake.com/en/sql-reference/functions/pow) | Fully supported by Snowflake |
| [\*](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (multiplication) | [\*](https://docs.snowflake.com/en/sql-reference/operators-arithmetic) | Fully supported by Snowflake |
| [/](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (division) | [/](https://docs.snowflake.com/en/sql-reference/operators-arithmetic) | Redshift division between integers always returns integer value, FLOOR function is added to emulate this behavior. |
| [%](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (modulo) | [%](https://docs.snowflake.com/en/sql-reference/operators-arithmetic) | Fully supported by Snowflake |
| [+](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (addition) | [+](https://docs.snowflake.com/en/sql-reference/operators-arithmetic) and [||](https://docs.snowflake.com/en/sql-reference/functions/concat) | Fully supported by Snowflake. When string are added, it is transformed to a concat. |
| [-](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (subtraction) | [-](https://docs.snowflake.com/en/sql-reference/operators-arithmetic) | Fully supported by Snowflake |
| [@](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (absolute value) | [ABS](https://docs.snowflake.com/en/sql-reference/functions/abs) | Fully supported by Snowflake |
| [|/](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (square root) | [SQRT](https://docs.snowflake.com/en/sql-reference/functions/sqrt) | Fully supported by Snowflake |
| [||/](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (cube root) | [CBRT](https://docs.snowflake.com/en/sql-reference/functions/cbrt) | Fully supported by Snowflake |

#### Sample Source Patterns

##### Addition, Subtraction, Positive & Negative

**Input Code:**

##### Input Code:

##### Redshift

```sql
 CREATE TABLE test_math_operations (
    base_value DECIMAL(10, 2),
    multiplier INT,
    divisor INT,
    description VARCHAR(100),
    created_at TIMESTAMP,
    category VARCHAR(50)
);

INSERT INTO test_math_operations (base_value, multiplier, divisor, description, created_at, category)
VALUES
(100.50, 2, 5, 'Basic test', '2024-12-01 10:30:00', 'Type A'),
(250.75, 3, 10, 'Complex operations', '2024-12-02 15:45:00', 'Type B'),
(-50.25, 5, 8, 'Negative base value', '2024-12-03 20:00:00', 'Type C'),
(0, 10, 2, 'Zero base value', '2024-12-04 09:15:00', 'Type D');

SELECT +base_value AS positive_value,
       -base_value AS negative_value,
       (base_value + multiplier - divisor) AS add_sub_result,
       created_at + INTERVAL '1 day' AS next_day,
       created_at - INTERVAL '1 hour' AS one_hour_before,
       description + category as string_sum,
       base_value + '5' as int_string_sum,
       '5' + base_value as string_int_sum
FROM test_math_operations;
```

##### Results

| positive_value | negative_value | add_sub_result | next_day | one_hour_before | string_sum | int_string_sum | string_int_sum |
| --- | --- | --- | --- | --- | --- | --- | --- |
| 100.50 | -100.50 | 97.50 | 2024-12-02 10:30:00.000000 | 2024-12-01 09:30:00.000000 | Basic testType A | 105.5 | 105.5 |
| 250.75 | -250.75 | 243.75 | 2024-12-03 15:45:00.000000 | 2024-12-02 14:45:00.000000 | Complex operationsType B | 255.75 | 255.75 |
| -50.25 | 50.25 | -53.25 | 2024-12-04 20:00:00.000000 | 2024-12-03 19:00:00.000000 | Negative base valueType C | -45.25 | -45.25 |
| 0.00 | 0.00 | 8.00 | 2024-12-05 09:15:00.000000 | 2024-12-04 08:15:00.000000 | Zero base valueType D | 5 | 5 |

**Output Code:**

##### Snowflake

```sql
 CREATE TABLE test_math_operations (
    base_value DECIMAL(10, 2),
    multiplier INT,
    divisor INT,
    description VARCHAR(100),
    created_at TIMESTAMP,
    category VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}';

INSERT INTO test_math_operations (base_value, multiplier, divisor, description, created_at, category)
VALUES
(100.50, 2, 5, 'Basic test', '2024-12-01 10:30:00', 'Type A'),
(250.75, 3, 10, 'Complex operations', '2024-12-02 15:45:00', 'Type B'),
(-50.25, 5, 8, 'Negative base value', '2024-12-03 20:00:00', 'Type C'),
(0, 10, 2, 'Zero base value', '2024-12-04 09:15:00', 'Type D');

SELECT +base_value AS positive_value,
       -base_value AS negative_value,
       (base_value + multiplier - divisor) AS add_sub_result,
       created_at + INTERVAL '1 day' AS next_day,
       created_at - INTERVAL '1 hour' AS one_hour_before,
       description + category as string_sum,
       base_value + '5' as int_string_sum,
       '5' + base_value as string_int_sum
FROM
       test_math_operations;
```

##### Results

| positive_value | negative_value | add_sub_result | next_day | one_hour_before | string_sum | int_string_sum | string_int_sum |
| --- | --- | --- | --- | --- | --- | --- | --- |
| 100.5 | -100.5 | 97.5 | 2024-12-02 10:30:00 | 2024-12-01 09:30:00 | Basic testType A | 105.5 | 105.5 |
| 250.75 | -250.75 | 243.75 | 2024-12-03 15:45:00 | 2024-12-02 14:45:00 | Complex operationsType B | 255.75 | 255.75 |
| -50.25 | 50.25 | -53.25 | 2024-12-04 20:00:00 | 2024-12-03 19:00:00 | Negative base valueType C | -45.25 | -45.25 |
| 0 | 0 | 8 | 2024-12-05 09:15:00 | 2024-12-04 08:15:00 | Zero base valueType D | 5 | 5 |

#### Exponentiation, multiplication, division & modulo

##### Input Code:

##### Redshift

```sql
 CREATE TABLE test_math_operations (
    base_value DECIMAL(10, 2),
    multiplier INT,
    divisor INT,
    mod_value INT,
    exponent INT
);

INSERT INTO test_math_operations (base_value, multiplier, divisor, mod_value, exponent)
VALUES
(100.50, 2, 5, 3, 2),
(250.75, 3, 10, 7, 3),
(-50.25, 5, 8, 4, 4),
(0, 10, 2, 1, 5);

SELECT
    base_value ^ exponent AS raised_to_exponent,
    (base_value * multiplier) AS multiplied_value,
    (base_value / divisor) AS divided_value,
    base_value::int / divisor as int_division,
    (mod_value % 2) AS modulo_result,
    (base_value + multiplier - divisor) AS add_sub_result,
    (base_value + (multiplier * (divisor - mod_value))) AS controlled_eval
FROM
    test_math_operations;
```

##### Results

| raised_to_exponent | multiplied_value | divided_value | int_division | modulo_result | add_sub_result | controlled_eval |
| --- | --- | --- | --- | --- | --- | --- |
| 10100.25 | 201 | 20.1 | 20 | 1 | 97.5 | 104.5 |
| 15766047.296875 | 752.25 | 25.075 | 25 | 1 | 243.75 | 259.75 |
| 6375940.62890625 | -251.25 | -6.28125 | -6 | 0 | -53.25 | -30.25 |
| 0 | 0 | 0 | 0 | 1 | 8 | 10 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE test_math_operations (
    base_value DECIMAL(10, 2),
    multiplier INT,
    divisor INT,
    mod_value INT,
    exponent INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "12/10/2024",  "domain": "test" }}';

INSERT INTO test_math_operations (base_value, multiplier, divisor, mod_value, exponent)
VALUES
(100.50, 2, 5, 3, 2),
(250.75, 3, 10, 7, 3),
(-50.25, 5, 8, 4, 4),
(0, 10, 2, 1, 5);

SELECT
    POWER(
    base_value, exponent) AS raised_to_exponent,
    (base_value * multiplier) AS multiplied_value,
    (base_value / divisor) AS divided_value,
    FLOOR(
    base_value::int / divisor) as int_division,
    (mod_value % 2) AS modulo_result,
    (base_value + multiplier - divisor) AS add_sub_result,
    (base_value + (multiplier * (divisor - mod_value))) AS controlled_eval
FROM
    test_math_operations;
```

##### Results

| raised_to_exponent | multiplied_value | divided_value | int_division | modulo_result | add_sub_result | controlled_eval |
| --- | --- | --- | --- | --- | --- | --- |
| 10100.25 | 201 | 20.1 | 20 | 1 | 97.5 | 104.5 |
| 15766047.2969 | 752.25 | 25.075 | 25 | 1 | 243.75 | 259.75 |
| 6375940.6289 | -251.25 | -6.2812 | -7 | 0 | -53.25 | -30.25 |
| 0 | 0 | 0 | 0 | 1 | 8 | 10 |

#### Absolute value, Square root and Cube root

##### Input Code:

##### Redshift

```sql
 CREATE TABLE unary_operators
(
    col1 INTEGER,
    col2 INTEGER
);

INSERT INTO unary_operators VALUES
(14, 10),
(-8, 8),
(975, 173),
(-1273, 187);

SELECT
|/ col2 AS square_root,
||/ col1 AS cube_root,
@ col1 AS absolute_value
FROM unary_operators;
```

##### Results

```none
+-------------------+--------------------+--------------+
|square_root        |cube_root           |absolute_value|
+-------------------+--------------------+--------------+
|3.1622776601683795 |2.4101422641752306  |14            |
|2.8284271247461903 |-2                  |8             |
|13.152946437965905 |9.915962413403873   |975           |
|13.674794331177344 |-10.837841647592736 |1273          |
+-------------------+--------------------+--------------+
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE unary_operators
(
    col1 INTEGER,
    col2 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "12/17/2024",  "domain": "test" }}';

INSERT INTO unary_operators
VALUES
(14, 10),
(-8, 8),
(975, 173),
(-1273, 187);

SELECT
    SQRT(col2) AS square_root,
    CBRT(col1) AS cube_root,
    ABS(col1) AS absolute_value
FROM
    unary_operators;
```

##### Results

```none
+-------------+--------------+--------------+
|square_root  |cube_root     |absolute_value|
+-------------+--------------+--------------+
|3.16227766   |2.410142264   |14            |
|2.828427125  |-2            |8             |
|13.152946438 |9.915962413   |975           |
|13.674794331 |-10.837841648 |1273          |
+-------------+--------------+--------------+
```

#### Known Issues

1. In Snowflake, it is possible to use the unary operators `+`and `-` with string values, however in Redshift it is not valid.

#### Related EWIs

No related EWIs.

## Bitwise operators

Operators

Translation for Bitwise Operators

### Conversion Table

| Redshift | Snowflake | Comments |
| --- | --- | --- |
| [`&`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (AND) | [`BITAND`](https://docs.snowflake.com/en/sql-reference/functions/bitand) | Fully supported by Snowflake |
| [`|`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (OR) | [`BITOR`](https://docs.snowflake.com/en/sql-reference/functions/bitor) | Fully supported by Snowflake |
| [`<<`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (Shift Left) | [`BITSHIFTLEFT`](https://docs.snowflake.com/en/sql-reference/functions/bitshiftleft) |  |
| [`>>`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (Shift Right) | [`BITSHIFTRIGHT`](https://docs.snowflake.com/en/sql-reference/functions/bitshiftright) |  |
| [`#`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html#r_compound_expressions-arguments) (XOR) | [`BITXOR`](https://docs.snowflake.com/en/sql-reference/functions/bitxor) | Fully supported by Snowflake |
| [`~`](https://docs.aws.amazon.com/redshift/latest/dg/r_compound_expressions.html) (NOT) | [`BITNOT`](https://docs.snowflake.com/en/sql-reference/functions/bitnot) | Fully supported by Snowflake |

### Sample Source Patterns

#### Setup data

##### Redshift

##### Query

```sql
 CREATE TABLE bitwise_demo (
    col1 INTEGER,
    col2 INTEGER,
    col3 INTEGER,
    col4 VARBYTE(5),
    col5 VARBYTE(7)
);

INSERT INTO bitwise_demo (col1, col2, col3, col4, col5) VALUES
-- Binary: 110, 011, 1111, 0100100001100101011011000110110001101111, 0100100001101001
(6, 3, 15, 'Hello'::VARBYTE, 'Hi'::VARBYTE),
-- Binary: 1010, 0101, 0111, 0100000101000010, 01000011
(10, 5, 7, 'AB'::VARBYTE, 'C'::VARBYTE),
-- Binary: 11111111, 10000000, 01000000, 010000100111100101100101, 01000111011011110110111101100100010000100111100101100101
(255, 128, 64, 'Bye'::VARBYTE, 'GoodBye'::VARBYTE),
-- Edge case with small numbers and a negative number
(1, 0, -1, 'Hey'::VARBYTE, 'Ya'::VARBYTE);
```

##### *Snowflake*

##### Query

```sql
 CREATE TABLE bitwise_demo (
    col1 INTEGER,
    col2 INTEGER,
    col3 INTEGER,
    col4 BINARY(5),
    col5 BINARY(7)
);

-- Binary: 110, 011, 1111, 0100100001100101011011000110110001101111, 0100100001101001
INSERT INTO bitwise_demo (col1, col2, col3, col4, col5) SELECT 6, 3, 15, TO_BINARY(HEX_ENCODE('Hello')), TO_BINARY(HEX_ENCODE('Hi'));
-- Binary: 1010, 0101, 0111, 0100000101000010, 01000011
INSERT INTO bitwise_demo (col1, col2, col3, col4, col5) SELECT 10, 5, 7, TO_BINARY(HEX_ENCODE('AB')), TO_BINARY(HEX_ENCODE('C'));
-- Binary: 11111111, 10000000, 01000000, 010000100111100101100101, 01000111011011110110111101100100010000100111100101100101
INSERT INTO bitwise_demo (col1, col2, col3, col4, col5) SELECT 255, 128, 64, TO_BINARY(HEX_ENCODE('Bye')), TO_BINARY(HEX_ENCODE('GoodBye'));
-- Edge case with small numbers and a negative number
INSERT INTO bitwise_demo (col1, col2, col3, col4, col5) SELECT 1, 0, -1, TO_BINARY(HEX_ENCODE('Hey')), TO_BINARY(HEX_ENCODE('Ya'));
```

#### Bitwise operators on integer values

##### Input Code:

##### Redshift

```sql
 SELECT
    -- Bitwise AND
    col1 & col2 AS bitwise_and,  -- col1 AND col2

    -- Bitwise OR
    col1 | col2 AS bitwise_or,   -- col1 OR col2

    -- Left Shift
    col3 << 1 AS left_shift_col3, -- col3 shifted left by 1

    -- Right Shift
    col3 >> 1 AS right_shift_col3, -- col3 shifted right by 1

    -- XOR
    col1 # col2 AS bitwise_xor, -- col1 XOR col2

    -- NOT
    ~ col3 AS bitwise_not -- NOT col3

FROM bitwise_demo;
```

##### Results

```none
+-------------+------------+-----------------+------------------+-------------+-------------+
| bitwise_and | bitwise_or | left_shift_col3 | right_shift_col3 | bitwise_xor | bitwise_not |
+-------------+------------+-----------------+------------------+-------------+-------------+
|2            |7           |30               |7                 |5            |-16          |
|0            |15          |14               |3                 |15           |-8           |
|128          |255         |128              |32                |127          |-65          |
|0            |1           |-2               |-1                |1            |0            |
+-------------+------------+-----------------+------------------+-------------+-------------+
```

**Output Code:**

##### Snowflake

```sql
 SELECT
        BITAND(
        -- Bitwise AND
        col1, col2) AS bitwise_and,  -- col1 AND col2
        BITOR(

        -- Bitwise OR
        col1, col2) AS bitwise_or,   -- col1 OR col2
        -- Left Shift
        --** SSC-FDM-PG0010 - RESULTS MAY VARY DUE TO THE BEHAVIOR OF SNOWFLAKE'S BITSHIFTLEFT BITWISE FUNCTION **
        BITSHIFTLEFT(
        col3, 1) AS left_shift_col3, -- col3 shifted left by 1
        -- Right Shift
        --** SSC-FDM-PG0010 - RESULTS MAY VARY DUE TO THE BEHAVIOR OF SNOWFLAKE'S BITSHIFTRIGHT BITWISE FUNCTION **
        BITSHIFTRIGHT(
        col3, 1) AS right_shift_col3, -- col3 shifted right by 1
        BITXOR(

        -- XOR
        col1, col2) AS bitwise_xor, -- col1 XOR col2
        -- NOT
        BITNOT(col3) AS bitwise_not -- NOT col3
FROM
        bitwise_demo;
```

##### Results

```none
+-------------+------------+-----------------+------------------+-------------+-------------+
| bitwise_and | bitwise_or | left_shift_col3 | right_shift_col3 | bitwise_xor | bitwise_not |
+-------------+------------+-----------------+------------------+-------------+-------------+
|2            |7           |30               |7                 |5            |-16          |
|0            |15          |14               |3                 |15           |-8           |
|128          |255         |128              |32                |127          |-65          |
|0            |1           |-2               |-1                |1            |0            |
+-------------+------------+-----------------+------------------+-------------+-------------+
```

#### Bitwise operators on binary data

For the `BITAND`, `BITOR` and `BITXOR` functions the`'LEFT'` parameter is added to insert padding in case both binary values have different length, this is done to avoid errors when comparing the values in Snowflake.

##### Redshift

##### Query

```sql
 SELECT
    -- Bitwise AND
    col4 & col5 AS bitwise_and,  -- col4 AND col5

    -- Bitwise OR
    col4 | col5 AS bitwise_or,   -- col4 OR col5

    -- XOR
    col4 # col5 AS bitwise_xor, -- col4 XOR col5

    -- NOT
    ~ col4 AS bitwise_not -- NOT col4

FROM bitwise_demo;
```

##### Result

```none
+-----------------+-----------------+-----------------+-------------+
| bitwise_and     | bitwise_or      | bitwise_xor     | bitwise_not |
+-----------------+-----------------+-----------------+-------------+
|0x0000004869     |0x48656C6C6F     |0x48656C2406     |0xB79A939390 |
|0x0042           |0x4143           |0x4101           |0xBEBD       |
|0x00000000427965 |0x476F6F64427965 |0x476F6F64000000 |0xBD869A     |
|0x004161         |0x487D79         |0x483C18         |0xB79A86     |
+-----------------+-----------------+-----------------+-------------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
    BITAND(
    -- Bitwise AND
    col4, col5, 'LEFT') AS bitwise_and,  -- col4 AND col5
    BITOR(

    -- Bitwise OR
    col4, col5, 'LEFT') AS bitwise_or,   -- col4 OR col5

    -- XOR
    BITXOR(col4, col5, 'LEFT') AS bitwise_xor, -- col4 XOR col5

    -- NOT
    BITNOT(col4) AS bitwise_not -- NOT col4

    FROM bitwise_demo;
```

##### Result

```none
+---------------+---------------+---------------+-------------+
| bitwise_and   | bitwise_or    | bitwise_xor   | bitwise_not |
+---------------+---------------+---------------+-------------+
|0000004869     |48656C6C6F     |48656C2406     |B79A939390   |
|0042           |4143           |4101           |BEBD         |
|00000000427965 |476F6F64427965 |476F6F64000000 |BD869A       |
|004161         |487D79         |483C18         |B79A86       |
+---------------+---------------+---------------+-------------+
```

### Known Issues

No issues were found.

### Related EWIs

* [SSC-FDM-PG0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/postgresqlFDM.md): Results may vary due to the behavior of Snowflake’s bitwise function.

---
title: SnowConvert AI - Redshift - Literals
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-basic-elements-literals.md
section: Migrations
---

# SnowConvert AI - Redshift - Literals

## Description

> A literal or constant is a fixed data value, composed of a sequence of characters or a numeric constant. ([Redshift SQL Language reference Literals](https://docs.aws.amazon.com/redshift/latest/dg/r_Literals.html)).

Amazon Redshift supports several types of literals, including:

* Numeric literals for integer, decimal, and floating-point numbers.
* Character literals, also referred to as strings, character strings, or character constants.
* Datetime and interval literals, used with datetime data types.

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 -- Number literals.
SELECT 42 AS integer_literal, -- Simple integer
    -123 AS negative_integer, -- Negative integer
    3.14159 AS decimal_literal, -- Decimal number
    1E0 AS simple_float; -- Floating-point representation of 1

-- Character literals.
SELECT 'Hello, World!' AS simple_string,
    'Line1\nLine2' AS newline_character, -- Interprets \n as literal
    'Tab\tCharacter' AS tab_character, -- Interprets \t as literal
    'The value is ' || 42 AS mixed_literal;
```

##### Result

| integer_literal | negative_integer | decimal_literal | simple_float |
| --- | --- | --- | --- |
| 42 | -123 | 3.14159 | 1 |

| simple_string | newline_character | tab_character | mixed_literal |
| --- | --- | --- | --- |
| 42 | Line1  Line2 | Tab Character | The value is 42 |

Output Code:

##### Snowflake

```sql
 -- Number literals.
SELECT 42 AS integer_literal, -- Simple integer
    -123 AS negative_integer, -- Negative integer
    3.14159 AS decimal_literal, -- Decimal number
    1E0 AS simple_float; -- Floating-point representation of 1

-- Character literals.
SELECT 'Hello, World!' AS simple_string,
    'Line1\nLine2' AS newline_character, -- Interprets \n as literal
    'Tab\tCharacter' AS tab_character, -- Interprets \t as literal
    'The value is ' || 42 AS mixed_literal;
```

##### Result

| integer_literal | negative_integer | decimal_literal | simple_float |
| --- | --- | --- | --- |
| 42 | -123 | 3.14159 | 1 |

| simple_string | newline_character | tab_character | mixed_literal |
| --- | --- | --- | --- |
| 42 | Line1  Line2 | Tab Character | The value is 42 |

## Known Issues

This functionality is not currently supported in Snowflake, but it will be supported through a future migration.

```sql
 select $MyTagForLiteral$
This is
a test
of a tag literal
$MyTagForLiteral$ as c1;
```

## Related EWIs

There are no known issues.

## Date, time, and timestamp literals

### Description

> Date, time, and timestamp literals supported by Amazon Redshift.([Redshift SQL Language reference Date, Time, Timestamp Literals](https://docs.aws.amazon.com/redshift/latest/dg/r_Date_and_time_literals.html)).

#### Sample Source Patterns

##### Input Code:

##### Redshift

```sql
 --invalid
SELECT
DATEADD(month, 1, 'January 8, 1999'),
DATEADD(month, 1, '2000-Jan-31'),
DATEADD(month, 1, 'Jan-31-2000'),
DATEADD(month, 1, '20000215'),
DATEADD(month, 1, '080215'),
DATEADD(month, 1, '2008.366'),
DATEADD(month, 1, 'now');

--valid
SELECT
DATEADD(month, 1, '1999-01-08'),
DATEADD(month, 1, '1/8/1999'),
DATEADD(month, 1, '01/02/00'),
DATEADD(month, 1, '31-Jan-2000');
```

Output Code:

##### Snowflake

```sql
 --invalid
SELECT
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - 'January 8, 1999' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! 'January 8, 1999'),
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - '2000-Jan-31' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! '2000-Jan-31'),
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - 'Jan-31-2000' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! 'Jan-31-2000'),
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - '20000215' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! '20000215'),
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - '080215' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! '080215'),
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - '2008.366' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! '2008.366'),
 DATEADD(month, 1,
                   !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - 'now' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! 'now');

--valid
SELECT
 DATEADD(month, 1, '1999-01-08'),
 DATEADD(month, 1, '1/8/1999'),
 DATEADD(month, 1, '01/02/00'),
 DATEADD(month, 1, '31-Jan-2000');
```

### Known Issues

Some DATE, TIME, and TIMESTAMP formats may produce different results in Redshift compared to Snowflake.

### Related EWIs

* [SSC-EWI-RS0007](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): Date literal is not supported in Snowflake.

## Interval Literals

### Description

> Interval literals can be used in datetime calculations, such as, adding intervals to dates and timestamps, summing intervals, and subtracting an interval from a date or timestamp. Interval literals can be used as input values to interval data type columns in a table.. ([Redshift SQL Language reference Interval Literals](https://docs.aws.amazon.com/redshift/latest/dg/r_interval_data_types.html#r_interval_data_types-syntax-literal)).

> **Warning:**
>
> This grammar is partially supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/operators-logical).

### Grammar Syntax

```sql
 INTERVAL quoted-string [ year_to_month_qualifier ]
INTERVAL quoted-string [ day_to_second_qualifier ] [ (fractional_precision) ]
```

[Snowflake Intervals](https://docs.snowflake.com/en/sql-reference/data-types-datetime#interval-constants) can only be used in arithmetic operations. Intervals used in any other scenario are not supported.

The following formats are the only ones recognized and fully transformed by SnowConvert AI, allowing optional fields and most of the abbreviations without interval styles:

```tex
 1. 1 year 1 month 1 day 2 hour 3 minutes 4 seconds 123 ms
2. hh:mm:ss.ms
3. 1 year 1 month 1 day hh:mm:ss.ms
```

Snowflake does not support literals with arithmetic signs. If the Literal contains an hour expression the expression can be partially transformed.

### Sample Source Patterns

#### Supported scenarios

##### Input Code:

##### Redshift

```sql
 SELECT
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1year 1month 1day 2hour 3 minute 4.1233455second' AS c1,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1year 1month 1day 2hour 3 minute 4.123second' AS c2,
'2024-01-01 00:00:00' ::TIMESTAMP +  INTERVAL '1.234567' AS c3,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '13 months' AS c4,
('2024-01-01 00:00:00'::timestamp without time zone + '1 day 02:03:04.123'::interval) AS c5,
('2024-01-01 00:00:00'::timestamp without time zone + '1 year 1 mon 00:00:01'::interval) AS c6,
('2024-01-01 00:00:00'::timestamp without time zone + '1 year 1 mon'::interval) AS c7,
('2024-01-01 00:00:00'::timestamp without time zone + '00:00:01.234567'::interval) AS c8,
('2024-01-01 00:00:00'::timestamp without time zone + '1 year 1 mon 1 day 02:03:04.123'::interval) AS c9,
('2024-01-01 00:00:00'::timestamp without time zone + '00:03:04.5678'::interval) AS c10,
('2024-01-01 00:00:00'::timestamp without time zone + '1 day 02:03:00'::interval) AS c11,
('2024-01-01 00:00:00'::timestamp without time zone + '3 days 01:59:00'::interval) AS c11,
('2024-01-01 00:00:00'::timestamp without time zone + '1 year 1 mon'::interval) AS c12,
('2024-01-01 00:00:00'::timestamp without time zone + '10 years'::interval) AS c13,
('2024-01-01 00:00:00'::timestamp without time zone + '1000 years'::interval) AS c14,
('2024-01-01 00:00:00'::timestamp without time zone + '100 years'::interval) AS c15,
('2024-01-01 00:00:00'::timestamp without time zone + '1 year 1 mon'::interval) AS c16
;
```

##### Output Code:

##### Snowflake

```sql
 SELECT
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1year, 1month, 1day, 2hour, 3 minute, 4 seconds, 123 ms' AS c1,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1year, 1month, 1day, 2hour, 3 minute, 4 seconds, 123 ms' AS c2,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 seconds, 234 ms' AS c3,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '13 months' AS c4,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 day, 02 hour, 03 minutes, 04 seconds, 123 ms') AS c5,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year, 1 mon, 00 hour, 00 minutes, 01 seconds') AS c6,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year, 1 mon') AS c7,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '00 hour, 00 minutes, 01 seconds, 234 ms') AS c8,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year, 1 mon, 1 day, 02 hour, 03 minutes, 04 seconds, 123 ms') AS c9,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '00 hour, 03 minutes, 04 seconds, 567 ms') AS c10,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 day, 02 hour, 03 minutes, 00 seconds') AS c11,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '3 days , 01 hour, 59 minutes, 00 seconds') AS c11,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year, 1 mon') AS c12,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '10 years') AS c13,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1000 years') AS c14,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '100 years') AS c15,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year, 1 mon') AS c16
;
```

#### Pending translation scenarios

##### Input Code:

##### Redshift

```sql
 SELECT
INTERVAL '1year 1month 1day 2hour 3 minute 4.1233455second',
'2024-01-01 00:00:00' ::TIMESTAMP +  INTERVAL '1.234567' SECOND AS c2,
'2024-01-01 00:00:00' ::TIMESTAMP +  INTERVAL '1.234567' SECOND (3) AS c3,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '13 months' YEAR AS c4,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '13 months' MONTH AS c5,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 2:3:4.5678' DAY TO MINUTE AS c6,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 2:3:4.5678' DAY TO SECOND AS c7,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 2:3' AS c8,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 49:59:0' AS c9,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 49:59:0' DAY AS c10,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 - 1 1'  AS c11,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1-1' AS c12,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 year -1 day' AS c13,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '3:4.5678' AS c14,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1-1 0 second 0 millisecond' AS c15,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 decade' AS c16,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 millenium' AS c17,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 century' AS c18,
('2024-01-01 00:00:00'::timestamp without time zone + ('00:00:01.234567')::interval second) AS c19,
('2024-01-01 00:00:00'::timestamp without time zone + ('00:00:01.235')::interval second (3)) AS c20,
('2024-01-01 00:00:00'::timestamp without time zone + ('1 year')::interval year) AS c21,
('2024-01-01 00:00:00'::timestamp without time zone + ('1 year 1 mon')::interval month) AS c22,
('2024-01-01 00:00:00'::timestamp without time zone + ('1 day 02:03:00')::interval day to minute) AS c23,
('2024-01-01 00:00:00'::timestamp without time zone + ('1 day 02:03:04.5678')::interval day to second) AS c24,
('2024-01-01 00:00:00'::timestamp without time zone + '-01:56:55.877'::interval) AS c25,
('2024-01-01 00:00:00'::timestamp without time zone + ('3 days')::interval day) AS c26;
```

##### Output Code:

##### Snowflake

```sql
 SELECT
INTERVAL '1year 1month 1day 2hour 3 minute 4.1233455second' !!!RESOLVE EWI!!! /*** SSC-EWI-0107 - INTERVAL LITERAL IS NOT SUPPORTED BY SNOWFLAKE IN THIS SCENARIO  ***/!!!,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 seconds, 234 ms' SECOND !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c2,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 seconds, 234 ms' SECOND (3) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c3,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '13 months' YEAR !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c4,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '13 months' MONTH !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c5,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 , 2 hour, 3 minutes, 4 seconds, 567 ms' DAY TO MINUTE !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c6,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 , 2 hour, 3 minutes, 4 seconds, 567 ms' DAY TO SECOND !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c7,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 2:3' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c8,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 , 49 hour, 59 minutes, 0 seconds' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c9,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 , 49 hour, 59 minutes, 0 seconds' DAY !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c10,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 - 1 1' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!  AS c11,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1-1' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c12,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 year -1 day' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c13,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '3:4.5678' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c14,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1-1 0 second 0 millisecond' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c15,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 decade' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c16,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 millenium' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c17,
'2024-01-01 00:00:00' ::TIMESTAMP + INTERVAL '1 century' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!! AS c18,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '00 hour, 00 minutes, 01 seconds, 234 ms' second !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c19,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '00 hour, 00 minutes, 01 seconds, 235 ms' second (3) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c20,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year' year !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c21,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 year, 1 mon' month !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c22,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 day, 02 hour, 03 minutes, 00 seconds' day to minute !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c23,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + INTERVAL '1 day, 02 hour, 03 minutes, 04 seconds, 567 ms' day to second !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c24,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + '-01:56:55.877':: VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DATA TYPE CONVERTED TO VARCHAR ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c25,
('2024-01-01 00:00:00':: TIMESTAMP_NTZ + ('3 days'):: VARCHAR !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DATA TYPE CONVERTED TO VARCHAR ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INTERVAL FORMAT' NODE ***/!!!) AS c26;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0107](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Interval Literal Not Supported In Current Scenario.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## NULLS

### Description

> If a column in a row is missing, unknown, or not applicable, it is a null value or is said to contain null. ([Redshift SQL Language reference Nulls Literals](https://docs.aws.amazon.com/redshift/latest/dg/r_Nulls.html)).

Nulls can appear in fields of any data type that are not restricted by primary key or NOT NULL constraints. A null is not equivalent to the value zero or to an empty string.

#### Sample Source Patterns

##### Input Code:

##### Redshift

```sql
 SELECT NULL IN (NULL, 0, 1, 2 ,3, 4);
SELECT 1 + NULL, 1 - NULL, 1 * NULL, 1 / NULL, 1 % NULL;
```

##### Result

| Select1 |
| --- |
| NULL |

| 1+NULL | 1\*NULL |
| --- | --- |
| NULL | NULL |

Output Code:

##### Snowflake

```sql
 SELECT NULL IN (NULL, 0, 1, 2 ,3, 4);
SELECT 1 + NULL, 1 - NULL, 1 * NULL, 1 / NULL, 1 % NULL;
```

##### Result

| Select1 |
| --- |
| NULL |

| 1+NULL | 1\*NULL |
| --- | --- |
| NULL | NULL |

### Known Issues

No issues were found.

### Related EWIs

There are no known issues.

---
title: SnowConvert AI - Redshift - Power BI Repointing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/etl-bi-repointing/power-bi-redshift-repointing.md
section: Migrations
---

# SnowConvert AI - Redshift - Power BI Repointing

## Description

The Power BI repointing is a feature that provides an easy way to redefine the connections from the M language in the Power Query Editor. This means that the connection parameters will be redefined to point to the Snowflake migration database context. For Redshift, the method in M Language that defined the connection is `AmazonRedshift.Database(...).` In Snowflake, there is a connector that depends on some other parameters and the main connection is defined by `Snowflake.Database(...)` method.

## Source Pattern Samples

### Entity Repointing Case: Table

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a table.

**Redshift Connection in the Power Query Editor**

```sql
let
    Source = AmazonRedshift.Database("your_connection","snowconvert"),
    public = Source{[Name="public"]}[Data],
    authors1 = public{[Name="authors"]}[Data]
in
    authors1
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="public", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="AUTHORS", Kind="Table"]}[Data],
    authors1 = Table.RenameColumns(SourceSfTbl, {{ "AUTHOR_ID", "author_id"}, { "FIRST_NAME", "first_name"}, { "LAST_NAME", "last_name"}, { "BIRTH_YEAR", "birth_year"}})
in
    authors1
```

### Entity Repointing Case: View

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a view.

**Redshift Connection in the Power Query Editor**

```sql
let
    Source = AmazonRedshift.Database("your_connection","snowconvert"),
    public = Source{[Name="public"]}[Data],
    author_books_view1 = public{[Name="author_books_view"]}[Data]
in
    author_books_view1
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="public", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="AUTHOR_BOOKS_VIEW", Kind="Table"]}[Data],
    author_books_view1 = Table.RenameColumns(SourceSfTbl, {{ "BOOK_TITLE", "book_title"}, { "AUTHOR_FULL_NAME", "author_full_name"}, { "PUBLICATION_YEAR", "publication_year"}, { "GENRE", "genre"}})
in
    author_books_view1
```

### Embedded SQL Case

This case refers to connections that contain embedded SQL inside them. This sample shows a simple query, but SnowConvert AI covers a range of larger scenarios. Besides, depending on the migrated query, there may be warning messages known as EWI—PRF—FDM. This will help the user identify patterns that need extra attention.

**Redshift Connection in the Power Query Editor**

```sql
let
    Source = Value.NativeQuery(AmazonRedshift.Database("your_connection","snowconvert"), "SELECT * FROM authors LIMIT 5", null, [EnableFolding=true])
in
    Source
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    SfSource = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT ""authors"" **
SELECT * FROM
authors
LIMIT 5", null, [EnableFolding=true]),
    Source = Table.RenameColumns(SfSource, {{ "AUTHOR_ID", "author_id"}, { "FIRST_NAME", "first_name"}, { "LAST_NAME", "last_name"}, { "BIRTH_YEAR", "birth_year"}})
in
    Source
```

---
title: SnowConvert AI - Redshift - SELECT
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/rs-sql-statements-select.md
section: Migrations
---

# SnowConvert AI - Redshift - SELECT

## SELECT

### Description

Returns rows from tables, views, and user-defined functions. ([Redshift SQL Language Reference SELECT statement](https://docs.aws.amazon.com/redshift/latest/dg/r_SELECT_synopsis.html))

### Grammar Syntax

```sql
 [ WITH with_subquery [, ...] ]
SELECT
[ TOP number | [ ALL | DISTINCT ]
* | expression [ AS output_name ] [, ...] ]
[ FROM table_reference [, ...] ]
[ WHERE condition ]
[ [ START WITH expression ] CONNECT BY expression ]
[ GROUP BY expression [, ...] ]
[ HAVING condition ]
[ QUALIFY condition ]
[ { UNION | ALL | INTERSECT | EXCEPT | MINUS } query ]
[ ORDER BY expression [ ASC | DESC ] ]
[ LIMIT { number | ALL } ]
[ OFFSET start ]
```

For more information please refer to each of the following links:

1. WITH clause
2. SELECT list
3. FROM clause
4. WHERE clause
5. CONNECT BY clause
6. GROUP BY clause
7. HAVING clause
8. QUALIFY clause
9. UNION, INTERSECT, and EXCEPT
10. ORDER BY clause

## CONNECT BY clause

### Description

The `CONNECT BY` clause specifies the relationship between rows in a hierarchy. You can use `CONNECT BY` to select rows in a hierarchical order by joining the table to itself and processing the hierarchical data. ([Redshift SQL Language Reference CONNECT BY Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_CONNECT_BY_clause.html))

> **Note:**
>
> The [CONNECT BY clause](https://docs.snowflake.com/en/sql-reference/constructs/connect-by) is supported in Snowflake.

### Grammar Syntax

```sql
 [START WITH start_with_conditions]
CONNECT BY connect_by_conditions
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT COUNT(*)
FROM
Employee "start"
CONNECT BY PRIOR id = manager_id
START WITH name = 'John';
```

##### Results

| COUNT(\*) |
| --- |
| 12 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT
  COUNT(*)
FROM
  Employee "start"
CONNECT BY PRIOR id = manager_id
START WITH name = 'John';
```

##### Results

| COUNT(\*) |
| --- |
| 12 |

### Related EWIs

There are no known issues.

## FROM clause

### Description

The `FROM` clause in a query lists the table references (tables, views, and subqueries) that data is selected from. If multiple table references are listed, the tables must be joined, using appropriate syntax in either the `FROM` clause or the `WHERE` clause. If no join criteria are specified, the system processes the query as a cross-join. ([Redshift SQL Language Reference FROM Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_FROM_clause30.html))

> **Warning:**
>
> The [FROM clause](https://docs.snowflake.com/en/sql-reference/constructs/from) is partially supported in Snowflake. [Object unpivoting](https://docs.aws.amazon.com/redshift/latest/dg/query-super.html#unpivoting) is not currently supported.

### Grammar Syntax

```sql
 FROM table_reference [, ...]

<table_reference> ::=
with_subquery_table_name [ table_alias ]
table_name [ * ] [ table_alias ]
( subquery ) [ table_alias ]
table_reference [ NATURAL ] join_type table_reference
   [ ON join_condition | USING ( join_column [, ...] ) ]
table_reference PIVOT (
   aggregate(expr) [ [ AS ] aggregate_alias ]
   FOR column_name IN ( expression [ AS ] in_alias [, ...] )
) [ table_alias ]
table_reference UNPIVOT [ INCLUDE NULLS | EXCLUDE NULLS ] (
   value_column_name
   FOR name_column_name IN ( column_reference [ [ AS ]
   in_alias ] [, ...] )
) [ table_alias ]
UNPIVOT expression AS value_alias [ AT attribute_alias ]
```

### Sample Source Patterns

#### Join types

Snowflake supports all types of joins. For more information, see [the JOIN documentation.](https://docs.snowflake.com/en/sql-reference/constructs/join)

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE department (
    id INT,
    name VARCHAR(50),
    manager_id INT
);

INSERT INTO department(id, name, manager_id) VALUES
(1, 'HR', 100),
(2, 'Sales', 101),
(3, 'Engineering', 102),
(4, 'Marketing', 103);

SELECT e.name AS employee_name, d.name AS department_name
FROM employee e
INNER JOIN department d ON e.manager_id = d.manager_id;

SELECT e.name AS employee_name, d.name AS department_name
FROM employee e
LEFT JOIN department d ON e.manager_id = d.manager_id;

SELECT d.name AS department_name, e.name AS manager_name
FROM department d
RIGHT JOIN employee e ON d.manager_id = e.id;

SELECT e.name AS employee_name, d.name AS department_name
FROM employee e
FULL JOIN department d ON e.manager_id = d.manager_id;
```

##### Results

##### Inner Join

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Sofía | Engineering |

##### Left Join

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| Carlos | null |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Saanvi | null |
| Shirley | null |
| Sofía | Engineering |
| Zhang | null |

##### Right Join

| DEPARTMENT_NAME | MANAGER_NAME |
| --- | --- |
| HR | Carlos |
| Sales | John |
| Engineering | Jorge |
| Marketing | Kwaku |
| null | Liu |
| null | Mateo |
| null | Nikki |
| null | Paulo |
| null | Richard |
| null | Saanvi |
| null | Shirley |
| null | Sofía |
| null | Zhang |

##### Full Join

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| Carlos | null |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Saanvi | null |
| Shirley | null |
| Sofía | Engineering |
| Zhang | null |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE department (
    id INT,
    name VARCHAR(50),
    manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO department (id, name, manager_id) VALUES
(1, 'HR', 100),
(2, 'Sales', 101),
(3, 'Engineering', 102),
(4, 'Marketing', 103);

SELECT e.name AS employee_name, d.name AS department_name
FROM
employee e
INNER JOIN
  department d ON e.manager_id = d.manager_id;

SELECT e.name AS employee_name, d.name AS department_name
FROM
employee e
LEFT JOIN
  department d ON e.manager_id = d.manager_id;

SELECT d.name AS department_name, e.name AS manager_name
FROM
department d
RIGHT JOIN
  employee e ON d.manager_id = e.id;

SELECT e.name AS employee_name, d.name AS department_name
FROM
employee e
FULL JOIN
  department d ON e.manager_id = d.manager_id;
```

##### Results

##### Inner Join

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Sofía | Engineering |

##### Left Join

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| Carlos | null |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Saanvi | null |
| Shirley | null |
| Sofía | Engineering |
| Zhang | null |

##### Right Join

| DEPARTMENT_NAME | MANAGER_NAME |
| --- | --- |
| HR | Carlos |
| Sales | John |
| Engineering | Jorge |
| Marketing | Kwaku |
| null | Liu |
| null | Mateo |
| null | Nikki |
| null | Paulo |
| null | Richard |
| null | Saanvi |
| null | Shirley |
| null | Sofía |
| null | Zhang |

##### Full Join

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| Carlos | null |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Saanvi | null |
| Shirley | null |
| Sofía | Engineering |
| Zhang | null |

#### Pivot Clause

> **Note:**
>
> Column aliases cannot be used in the IN clause of the PIVOT query in Snowflake.

##### Input Code:

##### Redshift

```sql
 SELECT *
FROM
    (SELECT e.manager_id, d.name AS department, e.id AS employee_id
     FROM employee e
     JOIN department d ON e.manager_id = d.manager_id) AS SourceTable
PIVOT
    (
     COUNT(employee_id)
     FOR department IN ('HR', 'Sales', 'Engineering', 'Marketing')
    ) AS PivotTable;
```

##### Results

| MANAGER_ID | ‘HR’ | ‘Sales’ | ‘Engineering’ | ‘Marketing’ |
| --- | --- | --- | --- | --- |
| 100 | 1 | 0 | 0 | 0 |
| 101 | 0 | 3 | 0 | 0 |
| 102 | 0 | 0 | 2 | 0 |
| 103 | 0 | 0 | 0 | 3 |

##### Output Code:

##### Snowflake

```sql
 SELECT *
FROM
    (SELECT e.manager_id, d.name AS department, e.id AS employee_id
     FROM
     employee e
     JOIN
         department d ON e.manager_id = d.manager_id) AS SourceTable
PIVOT
    (
     COUNT(employee_id)
     FOR department IN ('HR', 'Sales', 'Engineering', 'Marketing')
    ) AS PivotTable;
```

##### Results

| MANAGER_ID | ‘HR’ | ‘Sales’ | ‘Engineering’ | ‘Marketing’ |
| --- | --- | --- | --- | --- |
| 100 | 1 | 0 | 0 | 0 |
| 101 | 0 | 3 | 0 | 0 |
| 102 | 0 | 0 | 2 | 0 |
| 103 | 0 | 0 | 0 | 3 |

#### Unpivot Clause

> **Note:**
>
> Column aliases cannot be used in the IN clause of the UNPIVOT query in Snowflake.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE count_by_color (quality VARCHAR, red INT, green INT, blue INT);

INSERT INTO count_by_color VALUES ('high', 15, 20, 7);
INSERT INTO count_by_color VALUES ('normal', 35, NULL, 40);
INSERT INTO count_by_color VALUES ('low', 10, 23, NULL);

SELECT *
FROM (SELECT red, green, blue FROM count_by_color) UNPIVOT (
    cnt FOR color IN (red, green, blue)
);

SELECT *
FROM (SELECT red, green, blue FROM count_by_color) UNPIVOT (
    cnt FOR color IN (red r, green as g, blue)
);
```

##### Results

| COLOR | CNT |
| --- | --- |
| RED | 15 |
| RED | 35 |
| RED | 10 |
| GREEN | 20 |
| GREEN | 23 |
| BLUE | 7 |
| BLUE | 40 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE count_by_color (quality VARCHAR, red INT, green INT, blue INT)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO count_by_color
VALUES ('high', 15, 20, 7);
INSERT INTO count_by_color
VALUES ('normal', 35, NULL, 40);
INSERT INTO count_by_color
VALUES ('low', 10, 23, NULL);

SELECT *
FROM (SELECT red, green, blue FROM
            count_by_color
    ) UNPIVOT (
    cnt FOR color IN (red, green, blue)
);

SELECT *
FROM (SELECT red, green, blue FROM
            count_by_color
) UNPIVOT (
    cnt FOR color IN (red
                          !!!RESOLVE EWI!!! /*** SSC-EWI-RS0005 - SNOWCONVERT AI TRANSLATION FOR COLUMN ALIASES IN THE PIVOT/UNPIVOT IN CLAUSE IS PENDING. ***/!!!
 r, green
          !!!RESOLVE EWI!!! /*** SSC-EWI-RS0005 - SNOWCONVERT AI TRANSLATION FOR COLUMN ALIASES IN THE PIVOT/UNPIVOT IN CLAUSE IS PENDING. ***/!!!
 as g, blue)
);
```

##### Results

| COLOR | CNT |
| --- | --- |
| RED | 15 |
| GREEN | 20 |
| BLUE | 7 |
| RED | 35 |
| BLUE | 40 |
| RED | 10 |
| GREEN | 23 |

### Related EWIs

1. [SSC-EWI-RS0005](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): SnowConvert AI translation for column aliases in the PIVOT/UNPIVOT IN clause is pending.

## GROUP BY clause

### Description

The `GROUP BY` clause identifies the grouping columns for the query. Grouping columns must be declared when the query computes aggregates with standard functions such as `SUM`, `AVG`, and `COUNT`. ([Redshift SQL Language Reference GROUP BY Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_GROUP_BY_clause.html))

> **Note:**
>
> The [GROUP BY clause](https://docs.snowflake.com/en/sql-reference/constructs/group-by) is fully supported in Snowflake.

### Grammar Syntax

```sql
 GROUP BY group_by_clause [, ...]

group_by_clause := {
    expr |
    GROUPING SETS ( () | group_by_clause [, ...] ) |
    ROLLUP ( expr [, ...] ) |
    CUBE ( expr [, ...] )
    }
```

### Sample Source Patterns

#### Grouping sets

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT
    manager_id,
    COUNT(id) AS total_employees
FROM employee
GROUP BY GROUPING SETS
    ((manager_id), ())
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
| null | 1 |
| null | 13 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT
    manager_id,
    COUNT(id) AS total_employees
FROM
    employee
GROUP BY GROUPING SETS
    ((manager_id), ())
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
| null | 1 |
| null | 13 |

#### Group by Cube

##### Input Code:

##### Redshift

```sql
 SELECT
    manager_id,
    COUNT(id) AS total_employees
FROM
    employee
GROUP BY CUBE(manager_id)
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
| null | 1 |
| null | 13 |

##### Output Code:

##### Snowflake

```sql
 SELECT
    manager_id,
    COUNT(id) AS total_employees
FROM
    employee
GROUP BY CUBE(manager_id)
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
| null | 1 |
| null | 13 |

#### Group by Rollup

##### Input Code:

##### Redshift

```sql
 SELECT
    manager_id,
    COUNT(id) AS total_employees
FROM
    employee
GROUP BY ROLLUP(manager_id)
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
| null | 1 |
| null | 13 |

##### Output Code:

##### Snowflake

```sql
 SELECT
    manager_id,
    COUNT(id) AS total_employees
FROM
    employee
GROUP BY ROLLUP(manager_id)
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
| null | 1 |
| null | 13 |

### Related EWIs

There are no known issues.

## HAVING clause

### Description

The `HAVING` clause applies a condition to the intermediate grouped result set that a query returns. ([Redshift SQL Language Reference HAVING Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_HAVING_clause.html))

> **Note:**
>
> The [HAVING clause](https://docs.snowflake.com/en/sql-reference/constructs/having) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ HAVING condition ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT manager_id, COUNT(id) AS total_employees
FROM
employee
GROUP BY manager_id
HAVING COUNT(id) > 2
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 101 | 3 |
| 103 | 3 |
| 104 | 3 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT manager_id, COUNT(id) AS total_employees
FROM
employee
GROUP BY manager_id
HAVING COUNT(id) > 2
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 101 | 3 |
| 103 | 3 |
| 104 | 3 |

### Related EWIs

There are no known issues.

## ORDER BY clause

### Description

The `ORDER BY` clause sorts the result set of a query. ([Redshift SQL Language Reference Order By Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_ORDER_BY_clause.html))

> **Note:**
>
> The [ORDER BY clause](https://docs.snowflake.com/en/sql-reference/constructs/order-by) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ ORDER BY expression [ ASC | DESC ] ]
[ NULLS FIRST | NULLS LAST ]
[ LIMIT { count | ALL } ]
[ OFFSET start ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
    id INT,
    name VARCHAR(20),
    manager_id INT,
    salary DECIMAL(10, 2)
);

INSERT INTO employee (id, name, manager_id, salary) VALUES
(100, 'Carlos', NULL, 120000.00),
(101, 'John', 100, 90000.00),
(102, 'Jorge', 101, 95000.00),
(103, 'Kwaku', 101, 105000.00),
(104, 'Paulo', 102, 110000.00),
(105, 'Richard', 102, 85000.00),
(106, 'Mateo', 103, 95000.00),
(107, 'Liu', 103, 108000.00),
(108, 'Zhang', 104, 95000.00);

SELECT id, name, manager_id, salary
FROM employee
ORDER BY salary DESC NULLS LAST, name ASC NULLS FIRST
LIMIT 5
OFFSET 2;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 107 | Liu | 103 | 108000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 108 | Zhang | 104 | 95000.00 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
    id INT,
    name VARCHAR(20),
    manager_id INT,
    salary DECIMAL(10, 2)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id, salary) VALUES
(100, 'Carlos', NULL, 120000.00),
(101, 'John', 100, 90000.00),
(102, 'Jorge', 101, 95000.00),
(103, 'Kwaku', 101, 105000.00),
(104, 'Paulo', 102, 110000.00),
(105, 'Richard', 102, 85000.00),
(106, 'Mateo', 103, 95000.00),
(107, 'Liu', 103, 108000.00),
(108, 'Zhang', 104, 95000.00);

SELECT id, name, manager_id, salary
FROM
    employee
ORDER BY salary DESC NULLS LAST, name ASC NULLS FIRST
LIMIT 5
OFFSET 2;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 107 | Liu | 103 | 108000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 108 | Zhang | 104 | 95000.00 |

### Related EWIs

There are no known issues.

## QUALIFY clause

### Description

The `QUALIFY` clause filters results of a previously computed window function according to user‑specified search conditions. You can use the clause to apply filtering conditions to the result of a window function without using a subquery. ([Redshift SQL Language Reference QUALIFY Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_QUALIFY_clause.html))

> **Note:**
>
> The [QUALIFY clause](https://docs.snowflake.com/en/sql-reference/constructs/qualify) is supported in Snowflake.

### Grammar Syntax

```sql
 QUALIFY condition
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE store_sales
(
    ss_sold_date DATE,
    ss_sold_time TIME,
    ss_item TEXT,
    ss_sales_price FLOAT
);

INSERT INTO store_sales VALUES ('2022-01-01', '09:00:00', 'Product 1', 100.0),
                               ('2022-01-01', '11:00:00', 'Product 2', 500.0),
                               ('2022-01-01', '15:00:00', 'Product 3', 20.0),
                               ('2022-01-01', '17:00:00', 'Product 4', 1000.0),
                               ('2022-01-01', '18:00:00', 'Product 5', 30.0),
                               ('2022-01-02', '10:00:00', 'Product 6', 5000.0),
                               ('2022-01-02', '16:00:00', 'Product 7', 5.0);

SELECT *
FROM store_sales ss
WHERE ss_sold_time > time '12:00:00'
QUALIFY row_number()
OVER (PARTITION BY ss_sold_date ORDER BY ss_sales_price DESC) <= 2;
```

##### Results

| SS_SOLD_DATE | SS_SOLD_TIME | SS_ITEM | SS_SALES_PRICE |
| --- | --- | --- | --- |
| 2022-01-01 | 17:00:00 | Product 4 | 1000 |
| 2022-01-01 | 18:00:00 | Product 5 | 30 |
| 2022-01-02 | 16:00:00 | Product 7 | 5 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE store_sales
(
    ss_sold_date DATE,
    ss_sold_time TIME,
    ss_item TEXT,
    ss_sales_price FLOAT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}';

INSERT INTO store_sales
VALUES ('2022-01-01', '09:00:00', 'Product 1', 100.0),
                               ('2022-01-01', '11:00:00', 'Product 2', 500.0),
                               ('2022-01-01', '15:00:00', 'Product 3', 20.0),
                               ('2022-01-01', '17:00:00', 'Product 4', 1000.0),
                               ('2022-01-01', '18:00:00', 'Product 5', 30.0),
                               ('2022-01-02', '10:00:00', 'Product 6', 5000.0),
                               ('2022-01-02', '16:00:00', 'Product 7', 5.0);

SELECT *
FROM
    store_sales ss
WHERE ss_sold_time > time '12:00:00'
QUALIFY
    ROW_NUMBER()
OVER (PARTITION BY ss_sold_date ORDER BY ss_sales_price DESC) <= 2;
```

##### Results

| SS_SOLD_DATE | SS_SOLD_TIME | SS_ITEM | SS_SALES_PRICE |
| --- | --- | --- | --- |
| 2022-01-02 | 16:00:00 | Product 7 | 5 |
| 2022-01-01 | 17:00:00 | Product 4 | 1000 |
| 2022-01-01 | 18:00:00 | Product 5 | 30 |

### Related EWIs

There are no known issues.

## SELECT list

### Description

> The SELECT list names the columns, functions, and expressions that you want the query to return. The list represents the output of the query. ([Redshift SQL Language Reference SELECT list](https://docs.aws.amazon.com/redshift/latest/dg/r_SELECT_list.html))

> **Note:**
>
> The [query start options](https://docs.snowflake.com/en/sql-reference/sql/select) are fully supported in Snowflake. Just keep in mind that in Snowflake the `DISTINCT` and `ALL` options must go at the beginning of the query.

> **Note:**
>
> In Redshift, if your application allows foreign keys or invalid primary keys, it can cause queries to return incorrect results. For example, a SELECT DISTINCT query could return duplicate rows if the primary key column does not contain all unique values. ([Redshift SQL Language Reference SELECT list](https://docs.aws.amazon.com/redshift/latest/dg/r_SELECT_list.html))

### Grammar Syntax

```sql
 SELECT
[ TOP number ]
[ ALL | DISTINCT ] * | expression [ AS column_alias ] [, ...]
```

### Sample Source Patterns

#### Top clause

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT TOP 5 id, name, manager_id
FROM employee;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 100 | Carlos | null |
| 101 | John | 100 |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT TOP 5 id, name, manager_id
FROM
    employee;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 100 | Carlos | null |
| 101 | John | 100 |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |

#### ALL

##### Input Code:

##### Redshift

```sql
SELECT ALL manager_id
FROM employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 101 |
| 101 |
| 102 |
| 103 |
| 103 |
| 103 |
| 104 |
| 104 |
| 102 |
| 104 |

##### Output Code:

##### Snowflake

```sql
 SELECT ALL manager_id
FROM
    employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 101 |
| 101 |
| 102 |
| 103 |
| 103 |
| 103 |
| 104 |
| 104 |
| 102 |
| 104 |

#### DISTINCT

##### Input Code:

##### Redshift

```sql
SELECT DISTINCT manager_id
FROM employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 102 |
| 103 |
| 104 |

##### Output Code:

##### Snowflake

```sql
SELECT DISTINCT manager_id
FROM
    employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 102 |
| 103 |
| 104 |

### Related EWIs

There are no known issues.

## UNION, INTERSECT, and EXCEPT

### Description

The `UNION`, `INTERSECT`, and `EXCEPT` *set operators* are used to compare and merge the results of two separate query expressions. ([Redshift SQL Language Reference Set Operators](https://docs.aws.amazon.com/redshift/latest/dg/r_UNION.html))

> **Note:**
>
> [Set operators](https://docs.snowflake.com/en/sql-reference/operators-query) are fully supported in Snowflake.

### Grammar Syntax

```sql
 query
{ UNION [ ALL ] | INTERSECT | EXCEPT | MINUS }
query
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

UNION

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 102

UNION ALL

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

INTERSECT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 103

EXCEPT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 104;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |
| 102 | Jorge | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |

##### Output Code:

##### Snowflake

```sql
 SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

UNION

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 102

UNION ALL

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

INTERSECT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 103

EXCEPT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 104;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |

### Related EWIs

There are no known issues.

## WHERE clause

### Description

> The `WHERE` clause contains conditions that either join tables or apply predicates to columns in tables. ([Redshift SQL Language Reference WHERE Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_WHERE_clause.html))

> **Note:**
>
> The [WHERE clause](https://docs.snowflake.com/en/sql-reference/constructs/where) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ WHERE condition ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT id, name, manager_id
FROM employee
WHERE name LIKE 'J%';
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 102 | Jorge | 101 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/05/2024",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT id, name, manager_id
FROM
  employee
WHERE name LIKE 'J%' ESCAPE '\\';
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 102 | Jorge | 101 |

### Related EWIs

There are no known issues.

## WITH clause

### Description

A `WITH` clause is an optional clause that precedes the SELECT list in a query. The `WITH` clause defines one or more *common_table_expressions*. Each common table expression (CTE) defines a temporary table, which is similar to a view definition. You can reference these temporary tables in the `FROM` clause. ([Redshift SQL Language Reference WITH Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_WITH_clause.html))

> **Note:**
>
> The [WITH clause](https://docs.snowflake.com/en/sql-reference/constructs/with) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ WITH [RECURSIVE] common_table_expression [, common_table_expression , ...] ]

--Where common_table_expression can be either non-recursive or recursive.
--Following is the non-recursive form:
CTE_table_name [ ( column_name [, ...] ) ] AS ( query )

--Following is the recursive form of common_table_expression:
CTE_table_name (column_name [, ...] ) AS ( recursive_query )
```

### Sample Source Patterns

#### Recursive form

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

WITH RECURSIVE john_org(id, name, manager_id, level) AS
( SELECT id, name, manager_id, 1 AS level
  FROM employee
  WHERE name = 'John'
  UNION ALL
  SELECT e.id, e.name, e.manager_id, level + 1 AS next_level
  FROM employee e, john_org j
  WHERE e.manager_id = j.id and level < 4
)
SELECT DISTINCT id, name, manager_id FROM john_org ORDER BY manager_id;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 110 | Liu | 101 |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 201 | Sofía | 102 |
| 106 | Mateo | 102 |
| 105 | Richard | 103 |
| 104 | Paulo | 103 |
| 110 | Nikki | 103 |
| 205 | Zhang | 104 |
| 120 | Saanvi | 104 |
| 200 | Shirley | 104 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

WITH RECURSIVE john_org(id, name, manager_id, level) AS
( SELECT id, name, manager_id, 1 AS level
  FROM
    employee
  WHERE name = 'John'
  UNION ALL
  SELECT e.id, e.name, e.manager_id, level + 1 AS next_level
  FROM
    employee e,
    john_org j
  WHERE e.manager_id = j.id and level < 4
)
SELECT DISTINCT id, name, manager_id FROM
  john_org
ORDER BY manager_id;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |
| 110 | Nikki | 103 |
| 104 | Paulo | 103 |
| 105 | Richard | 103 |
| 120 | Saanvi | 104 |
| 200 | Shirley | 104 |
| 205 | Zhang | 104 |

#### Non recursive form

##### Input Code:

##### Redshift

```sql
 WITH ManagerHierarchy AS (
    SELECT id AS employee_id, name AS employee_name, manager_id
    FROM employee
)
SELECT e.employee_name AS employee, m.employee_name AS manager
FROM ManagerHierarchy e
LEFT JOIN ManagerHierarchy m ON e.manager_id = m.employee_id;
```

##### Results

| EMPLOYEE | MANAGER |
| --- | --- |
| Carlos | null |
| John | Carlos |
| Jorge | John |
| Kwaku | John |
| Liu | John |
| Mateo | Jorge |
| Sofía | Jorge |
| Nikki | Kwaku |
| Paulo | Kwaku |
| Richard | Kwaku |
| Saanvi | Paulo |
| Shirley | Paulo |
| Zhang | Paulo |

##### Output Code:

##### Snowflake

```sql
 WITH ManagerHierarchy AS (
    SELECT id AS employee_id, name AS employee_name, manager_id
    FROM
    employee
)
SELECT e.employee_name AS employee, m.employee_name AS manager
FROM
    ManagerHierarchy e
LEFT JOIN
    ManagerHierarchy m ON e.manager_id = m.employee_id;
```

##### Results

| EMPLOYEE | MANAGER |
| --- | --- |
| John | Carlos |
| Jorge | John |
| Kwaku | John |
| Liu | John |
| Mateo | Jorge |
| Sofía | Jorge |
| Nikki | Kwaku |
| Paulo | Kwaku |
| Richard | Kwaku |
| Saanvi | Paulo |
| Shirley | Paulo |
| Zhang | Paulo |
| Carlos | null |

### Related EWIs

There are no known issues.

---
title: SnowConvert AI - Redshift - SELECT INTO
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/rs-sql-statements-select-into.md
section: Migrations
---

# SnowConvert AI - Redshift - SELECT INTO

## Description

> Returns rows from tables, views, and user-defined functions and inserts them into a new table. ([Redshift SQL Language Reference SELECT statement](https://docs.aws.amazon.com/redshift/latest/dg/r_SELECT_synopsis.html))

## Grammar Syntax

```sql
 [ WITH with_subquery [, ...] ]
SELECT
[ TOP number ] [ ALL | DISTINCT ]
* | expression [ AS output_name ] [, ...]
INTO [ TEMPORARY | TEMP ] [ TABLE ] new_table
[ FROM table_reference [, ...] ]
[ WHERE condition ]
[ GROUP BY expression [, ...] ]
[ HAVING condition [, ...] ]
[ { UNION | INTERSECT | { EXCEPT | MINUS } } [ ALL ] query ]
[ ORDER BY expression
[ ASC | DESC ]
[ LIMIT { number | ALL } ]
[ OFFSET start ]
```

For more information please refer to each of the following links:

1. [WITH clause](rs-sql-statements-select.md)
2. [SELECT list](rs-sql-statements-select.md)
3. [FROM clause](rs-sql-statements-select.md)
4. [WHERE clause](rs-sql-statements-select.md)
5. [CONNECT BY clause](rs-sql-statements-select.md)
6. [GROUP BY clause](rs-sql-statements-select.md)
7. [HAVING clause](rs-sql-statements-select.md)
8. [QUALIFY clause](rs-sql-statements-select.md)
9. [UNION, INTERSECT, and EXCEPT](rs-sql-statements-select.md)
10. [ORDER BY clause](rs-sql-statements-select.md)
11. LIMIT and OFFSET clauses
12. Local Variables and Parameters

## FROM clause

### Description

> The `FROM` clause in a query lists the table references (tables, views, and subqueries) that data is selected from. If multiple table references are listed, the tables must be joined, using appropriate syntax in either the `FROM` clause or the `WHERE` clause. If no join criteria are specified, the system processes the query as a cross-join. ([Redshift SQL Language Reference FROM Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_FROM_clause30.html))

> **Warning:**
>
> The [FROM clause](https://docs.snowflake.com/en/sql-reference/constructs/from) is partially supported in Snowflake. [Object unpivoting](https://docs.aws.amazon.com/redshift/latest/dg/query-super.html#unpivoting) is not currently supported.

### Grammar Syntax

```sql
 FROM table_reference [, ...]

<table_reference> ::=
with_subquery_table_name [ table_alias ]
table_name [ * ] [ table_alias ]
( subquery ) [ table_alias ]
table_reference [ NATURAL ] join_type table_reference
   [ ON join_condition | USING ( join_column [, ...] ) ]
table_reference PIVOT (
   aggregate(expr) [ [ AS ] aggregate_alias ]
   FOR column_name IN ( expression [ AS ] in_alias [, ...] )
) [ table_alias ]
table_reference UNPIVOT [ INCLUDE NULLS | EXCLUDE NULLS ] (
   value_column_name
   FOR name_column_name IN ( column_reference [ [ AS ]
   in_alias ] [, ...] )
) [ table_alias ]
UNPIVOT expression AS value_alias [ AT attribute_alias ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE department (
    id INT,
    name VARCHAR(50),
    manager_id INT
);

INSERT INTO department(id, name, manager_id) VALUES
(1, 'HR', 100),
(2, 'Sales', 101),
(3, 'Engineering', 102),
(4, 'Marketing', 103);

SELECT e.name AS employee_name, d.name AS department_name
INTO employees_in_department
FROM employee e
INNER JOIN department d ON e.manager_id = d.manager_id;
```

##### Results

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Sofía | Engineering |

##### Output Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE department (
    id INT,
    name VARCHAR(50),
    manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO department (id, name, manager_id) VALUES
(1, 'HR', 100),
(2, 'Sales', 101),
(3, 'Engineering', 102),
(4, 'Marketing', 103);

CREATE TABLE IF NOT EXISTS employees_in_department AS
  SELECT e.name AS employee_name, d.name AS department_name
  FROM
    employee e
  INNER JOIN
      department d ON e.manager_id = d.manager_id;
```

##### Results

| EMPLOYEE_NAME | DEPARTMENT_NAME |
| --- | --- |
| John | HR |
| Jorge | Sales |
| Kwaku | Sales |
| Liu | Sales |
| Mateo | Engineering |
| Nikki | Marketing |
| Paulo | Marketing |
| Richard | Marketing |
| Sofía | Engineering |

### Known Issues

There are no known issues.

### Related EWIs.

See [SELECT](rs-sql-statements-select.md) transformation for related EWIs.

## GROUP BY clause

### Description

> The `GROUP BY` clause identifies the grouping columns for the query. Grouping columns must be declared when the query computes aggregates with standard functions such as `SUM`, `AVG`, and `COUNT`. ([Redshift SQL Language Reference GROUP BY Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_GROUP_BY_clause.html))

> **Note:**
>
> The [GROUP BY clause](https://docs.snowflake.com/en/sql-reference/constructs/group-by) is fully supported in Snowflake.

### Grammar Syntax

```sql
 GROUP BY expression [, ...]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT
    manager_id,
    COUNT(id) AS total_employees
INTO manager_employees
FROM employee
GROUP BY manager_id
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
|  | 1 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE IF NOT EXISTS manager_employees AS
  SELECT
      manager_id,
      COUNT(id) AS total_employees
  FROM
      employee
  GROUP BY manager_id
  ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 100 | 1 |
| 101 | 3 |
| 102 | 2 |
| 103 | 3 |
| 104 | 3 |
|  | 1 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## HAVING clause

### Description

> The `HAVING` clause applies a condition to the intermediate grouped result set that a query returns. ([Redshift SQL Language Reference HAVING Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_HAVING_clause.html))

> **Note:**
>
> The [HAVING clause](https://docs.snowflake.com/en/sql-reference/constructs/having) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ HAVING condition ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT manager_id, COUNT(id) AS total_employees
INTO manager_employees
FROM
employee
GROUP BY manager_id
HAVING COUNT(id) > 2
ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 101 | 3 |
| 103 | 3 |
| 104 | 3 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE IF NOT EXISTS manager_employees AS
  SELECT manager_id, COUNT(id) AS total_employees
  FROM
    employee
  GROUP BY manager_id
  HAVING COUNT(id) > 2
  ORDER BY manager_id;
```

##### Results

| MANAGER_ID | TOTAL_EMPLOYEES |
| --- | --- |
| 101 | 3 |
| 103 | 3 |
| 104 | 3 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## LIMIT and OFFSET clauses

### Description

> The LIMIT and OFFSET clauses retrieves and skips the number of rows specified in the number.

> **Note:**
>
> The [LIMIT and OFFSET](https://docs.snowflake.com/en/sql-reference/constructs/limit) clauses are fully supported in Snowflake.

### Grammar Syntax

```sql
 [ LIMIT { number | ALL } ]
[ OFFSET start ]
```

### Sample Source Patterns

#### LIMIT number

##### Input Code:

##### Redshift

```sql
 SELECT id, name, manager_id, salary
INTO limited_employees
FROM employee
LIMIT 5;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 100 | Carlos |  | 120000.00 |
| 101 | John | 100 | 90000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 104 | Paulo | 102 | 110000.00 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS limited_employees AS
SELECT id, name, manager_id, salary
FROM
employee
LIMIT 5;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 100 | Carlos |  | 120000.00 |
| 101 | John | 100 | 90000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 104 | Paulo | 102 | 110000.00 |

#### LIMIT ALL

##### Input Code:

##### Redshift

```sql
 SELECT id, name, manager_id, salary
INTO limited_employees
FROM employee
LIMIT ALL;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 100 | Carlos |  | 120000.00 |
| 101 | John | 100 | 90000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 104 | Paulo | 102 | 110000.00 |
| 105 | Richard | 102 | 85000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 107 | Liu | 103 | 108000.00 |
| 108 | Zhang | 104 | 95000.00 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS limited_employees AS
SELECT id, name, manager_id, salary
FROM
employee
LIMIT NULL;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 100 | Carlos |  | 120000.00 |
| 101 | John | 100 | 90000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 104 | Paulo | 102 | 110000.00 |
| 105 | Richard | 102 | 85000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 107 | Liu | 103 | 108000.00 |
| 108 | Zhang | 104 | 95000.00 |

#### OFFSET without LIMIT

Snowflake doesn’t support OFFSET without LIMIT. The LIMIT is added after transformation with NULL, which is the default LIMIT.

##### Input Code:

##### Redshift

```sql
 SELECT id, name, manager_id, salary
INTO limited_employees
FROM employee
OFFSET 5;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 105 | Richard | 102 | 85000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 107 | Liu | 103 | 108000.00 |
| 108 | Zhang | 104 | 95000.00 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS limited_employees AS
SELECT id, name, manager_id, salary
FROM
employee
LIMIT NULL
OFFSET 5;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 105 | Richard | 102 | 85000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 107 | Liu | 103 | 108000.00 |
| 108 | Zhang | 104 | 95000.00 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## Local Variables and Parameters

### Description

> Redshift also allows SELECT INTO variables when the statement is executed inside stored procedures.

> **Note:**
>
> This pattern is fully supported in Snowflake.

### Grammar Syntax

```sql
 SELECT [ select_expressions ] INTO target [ select_expressions ] FROM ...;
```

### Sample Source Patterns

#### SELECT INTO with expressions at the left

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE test_sp1(out param1 int)
AS $$
DECLARE
    var1 int;
BEGIN
     select 10, 100 into param1, var1;
END;
$$ LANGUAGE plpgsql;
```

##### Results

| param1 |
| --- |
| 10 |

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE test_sp1 (param1 OUT int)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS $$
        DECLARE
            var1 int;
BEGIN
     select 10, 100 into
                : param1,
                : var1;
END;
$$;
```

##### Results

| TEST_SP1 |
| --- |
| { “param1”: 10 } |

#### SELECT INTO with expressions at the right

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE test_sp1(out param1 int)
AS $$
DECLARE
    var1 int;
BEGIN
     select into param1, var1 10, 100;
END;
$$ LANGUAGE plpgsql;
```

##### Results

| param1 |
| --- |
| 10 |

##### Output Code:

Since Snowflake doesn’t support this grammar for SELECT INTO, the expressions are moved to the left of the INTO.

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE test_sp1 (param1 OUT int)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS $$
        DECLARE
            var1 int;
BEGIN
     select
                10, 100
            into
                : param1,
                : var1;
END;
$$;
```

##### Results

| TEST_SP1 |
| --- |
| { “param1”: 10 } |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## ORDER BY clause

### Description

> The `ORDER BY` clause sorts the result set of a query. ([Redshift SQL Language Reference Order By Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_ORDER_BY_clause.html))

> **Note:**
>
> The [ORDER BY clause](https://docs.snowflake.com/en/sql-reference/constructs/order-by) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ ORDER BY expression [ ASC | DESC ] ]
[ NULLS FIRST | NULLS LAST ]
[ LIMIT { count | ALL } ]
[ OFFSET start ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
    id INT,
    name VARCHAR(20),
    manager_id INT,
    salary DECIMAL(10, 2)
);

INSERT INTO employee (id, name, manager_id, salary) VALUES
(100, 'Carlos', NULL, 120000.00),
(101, 'John', 100, 90000.00),
(102, 'Jorge', 101, 95000.00),
(103, 'Kwaku', 101, 105000.00),
(104, 'Paulo', 102, 110000.00),
(105, 'Richard', 102, 85000.00),
(106, 'Mateo', 103, 95000.00),
(107, 'Liu', 103, 108000.00),
(108, 'Zhang', 104, 95000.00);

SELECT id, name, manager_id, salary
INTO salaries
FROM employee
ORDER BY salary DESC NULLS LAST, name ASC NULLS FIRST
LIMIT 5
OFFSET 2;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 107 | Liu | 103 | 108000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 108 | Zhang | 104 | 95000.00 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
    id INT,
    name VARCHAR(20),
    manager_id INT,
    salary DECIMAL(10, 2)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id, salary) VALUES
(100, 'Carlos', NULL, 120000.00),
(101, 'John', 100, 90000.00),
(102, 'Jorge', 101, 95000.00),
(103, 'Kwaku', 101, 105000.00),
(104, 'Paulo', 102, 110000.00),
(105, 'Richard', 102, 85000.00),
(106, 'Mateo', 103, 95000.00),
(107, 'Liu', 103, 108000.00),
(108, 'Zhang', 104, 95000.00);

CREATE TABLE IF NOT EXISTS salaries AS
    SELECT id, name, manager_id, salary
    FROM
        employee
    ORDER BY salary DESC NULLS LAST, name ASC NULLS FIRST
    LIMIT 5
    OFFSET 2;
```

##### Results

| ID | NAME | MANAGER_ID | SALARY |
| --- | --- | --- | --- |
| 107 | Liu | 103 | 108000.00 |
| 103 | Kwaku | 101 | 105000.00 |
| 102 | Jorge | 101 | 95000.00 |
| 106 | Mateo | 103 | 95000.00 |
| 108 | Zhang | 104 | 95000.00 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## SELECT list

### Description

> The SELECT list names the columns, functions, and expressions that you want the query to return. The list represents the output of the query. ([Redshift SQL Language Reference SELECT list](https://docs.aws.amazon.com/redshift/latest/dg/r_SELECT_list.html))

> **Note:**
>
> The [query start options](https://docs.snowflake.com/en/sql-reference/sql/select) are fully supported in Snowflake. Just keep in mind that in Snowflake the `DISTINCT` and `ALL` options must go at the beginning of the query.

> **Note:**
>
> In Redshift, if your application allows foreign keys or invalid primary keys, it can cause queries to return incorrect results. For example, a SELECT DISTINCT query could return duplicate rows if the primary key column does not contain all unique values. ([Redshift SQL Language Reference SELECT list](https://docs.aws.amazon.com/redshift/latest/dg/r_SELECT_list.html))

### Grammar Syntax

```sql
 SELECT
[ TOP number ]
[ ALL | DISTINCT ] * | expression [ AS column_alias ] [, ...]
```

### Sample Source Patterns

#### Top clause

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT TOP 5 id, name, manager_id
INTO top_employees
FROM employee;

SELECT * FROM top_employees;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 100 | Carlos | null |
| 101 | John | 100 |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee
(
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE IF NOT EXISTS top_employees AS
SELECT TOP 5 id, name, manager_id
  FROM
    employee;

SELECT * FROM
  top_employees;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 100 | Carlos | null |
| 101 | John | 100 |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |

#### ALL

##### Input Code:

##### Redshift

```sql
SELECT ALL manager_id
INTO manager
FROM employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 101 |
| 101 |
| 102 |
| 103 |
| 103 |
| 103 |
| 104 |
| 104 |
| 102 |
| 104 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS manager AS
SELECT ALL manager_id
FROM
employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 101 |
| 101 |
| 102 |
| 103 |
| 103 |
| 103 |
| 104 |
| 104 |
| 102 |
| 104 |

#### DISTINCT

##### Input Code:

##### Redshift

```sql
SELECT DISTINCT manager_id
INTO manager
FROM employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 102 |
| 103 |
| 104 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS manager AS
SELECT DISTINCT manager_id
FROM
employee;
```

##### Results

| MANAGER_ID |
| --- |
| null |
| 100 |
| 101 |
| 102 |
| 103 |
| 104 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## UNION, INTERSECT, and EXCEPT

### Description

> The `UNION`, `INTERSECT`, and `EXCEPT` *set operators* are used to compare and merge the results of two separate query expressions. ([Redshift SQL Language Reference Set Operators](https://docs.aws.amazon.com/redshift/latest/dg/r_UNION.html))

> **Note:**
>
> [Set operators](https://docs.snowflake.com/en/sql-reference/operators-query) are fully supported in Snowflake.

### Grammar Syntax

```sql
 query
{ UNION [ ALL ] | INTERSECT | EXCEPT | MINUS }
query
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 SELECT id, name, manager_id
INTO some_employees
FROM
employee
WHERE manager_id = 101

UNION

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 102

UNION ALL

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

INTERSECT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 103

EXCEPT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 104;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |
| 102 | Jorge | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS some_employees AS
SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

UNION

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 102

UNION ALL

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 101

INTERSECT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 103

EXCEPT

SELECT id, name, manager_id
FROM
employee
WHERE manager_id = 104;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 102 | Jorge | 101 |
| 103 | Kwaku | 101 |
| 110 | Liu | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## WHERE clause

### Description

> The `WHERE` clause contains conditions that either join tables or apply predicates to columns in tables. ([Redshift SQL Language Reference WHERE Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_WHERE_clause.html))

> **Note:**
>
> The [WHERE clause](https://docs.snowflake.com/en/sql-reference/constructs/where) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ WHERE condition ]
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

SELECT id, name, manager_id
INTO employee_names
FROM employee
WHERE name LIKE 'J%';
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 102 | Jorge | 101 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
  id INT,
  name VARCHAR(20),
  manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/06/2025",  "domain": "test" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

CREATE TABLE IF NOT EXISTS employee_names AS
  SELECT id, name, manager_id
  FROM
    employee
  WHERE name LIKE 'J%' ESCAPE '\\';
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 102 | Jorge | 101 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

## WITH clause

### Description

> A `WITH` clause is an optional clause that precedes the SELECT INTO in a query. The `WITH` clause defines one or more *common_table_expressions*. Each common table expression (CTE) defines a temporary table, which is similar to a view definition. You can reference these temporary tables in the `FROM` clause. ([Redshift SQL Language Reference WITH Clause](https://docs.aws.amazon.com/redshift/latest/dg/r_WITH_clause.html))

> **Note:**
>
> The [WITH clause](https://docs.snowflake.com/en/sql-reference/constructs/with) is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ WITH [RECURSIVE] common_table_expression [, common_table_expression , ...] ]

--Where common_table_expression can be either non-recursive or recursive.
--Following is the non-recursive form:
CTE_table_name [ ( column_name [, ...] ) ] AS ( query )

--Following is the recursive form of common_table_expression:
CTE_table_name (column_name [, ...] ) AS ( recursive_query )
```

### Sample Source Patterns

#### Non-Recursive form

##### Input Code:

##### Redshift

```sql
 CREATE TABLE orders (
    order_id INT,
    customer_id INT,
    order_date DATE,
    total_amount DECIMAL(10,2)
);

INSERT INTO orders (order_id, customer_id, order_date, total_amount)
VALUES
(1, 101, '2024-02-01', 250.00),
(2, 102, '2024-02-02', 600.00),
(3, 103, '2024-02-03', 150.00),
(4, 104, '2024-02-04', 750.00),
(5, 105, '2024-02-05', 900.00);

WITH HighValueOrders AS (
    SELECT
        order_id,
        customer_id,
        order_date,
        total_amount
    FROM orders
    WHERE total_amount > 500
)
SELECT * INTO high_value_orders FROM HighValueOrders;

SELECT * FROM high_value_orders;
```

##### Results

| ORDER_ID | CUSTOMER_ID | ORDER_DATE | TOTAL_AMOUNT |
| --- | --- | --- | --- |
| 2 | 102 | 2024-02-02 | 600.00 |
| 4 | 104 | 2024-02-04 | 750.00 |
| 5 | 105 | 2024-02-05 | 900.00 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE orders (
    order_id INT,
    customer_id INT,
    order_date DATE,
    total_amount DECIMAL(10,2)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}';

INSERT INTO orders (order_id, customer_id, order_date, total_amount)
VALUES
(1, 101, '2024-02-01', 250.00),
(2, 102, '2024-02-02', 600.00),
(3, 103, '2024-02-03', 150.00),
(4, 104, '2024-02-04', 750.00),
(5, 105, '2024-02-05', 900.00);

CREATE TABLE IF NOT EXISTS high_value_orders AS
WITH HighValueOrders AS (
    SELECT
        order_id,
        customer_id,
        order_date,
        total_amount
    FROM
        orders
    WHERE total_amount > 500
    )
    SELECT *
    FROM
    HighValueOrders;

SELECT * FROM
    high_value_orders;
```

##### Results

| ORDER_ID | CUSTOMER_ID | ORDER_DATE | TOTAL_AMOUNT |
| --- | --- | --- | --- |
| 2 | 102 | 2024-02-02 | 600.00 |
| 4 | 104 | 2024-02-04 | 750.00 |
| 5 | 105 | 2024-02-05 | 900.00 |

#### Recursive form

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employee (
   id INT,
   name VARCHAR(20),
   manager_id INT
);

INSERT INTO employee(id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);

WITH RECURSIVE john_org(id, name, manager_id, level)
AS
(
   SELECT id, name, manager_id, 1 AS level
   FROM employee
   WHERE name = 'John'
   UNION ALL
   SELECT e.id, e.name, e.manager_id, level + 1 AS next_level
   FROM employee e, john_org j
   WHERE e.manager_id = j.id and level < 4
)
SELECT DISTINCT id, name, manager_id into new_org FROM john_org ORDER BY manager_id;

SELECT * FROM new_org;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 103 | Kwaku | 101 |
| 102 | Jorge | 101 |
| 110 | Liu | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |
| 105 | Richard | 103 |
| 110 | Nikki | 103 |
| 104 | Paulo | 103 |
| 120 | Saanvi | 104 |
| 200 | Shirley | 104 |
| 205 | Zhang | 104 |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employee (
   id INT,
   name VARCHAR(20),
   manager_id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}';

INSERT INTO employee (id, name, manager_id) VALUES
(100, 'Carlos', null),
(101, 'John', 100),
(102, 'Jorge', 101),
(103, 'Kwaku', 101),
(110, 'Liu', 101),
(106, 'Mateo', 102),
(110, 'Nikki', 103),
(104, 'Paulo', 103),
(105, 'Richard', 103),
(120, 'Saanvi', 104),
(200, 'Shirley', 104),
(201, 'Sofía', 102),
(205, 'Zhang', 104);
CREATE TABLE IF NOT EXISTS new_org AS
WITH RECURSIVE john_org(id, name, manager_id, level)
AS
(
   SELECT id, name, manager_id, 1 AS level
   FROM
         employee
   WHERE name = 'John'
   UNION ALL
   SELECT e.id, e.name, e.manager_id, level + 1 AS next_level
   FROM
         employee e,
         john_org j
   WHERE e.manager_id = j.id and level < 4
   )
   SELECT DISTINCT id, name, manager_id
   FROM
   john_org
   ORDER BY manager_id;
SELECT * FROM
   new_org;
```

##### Results

| ID | NAME | MANAGER_ID |
| --- | --- | --- |
| 101 | John | 100 |
| 103 | Kwaku | 101 |
| 102 | Jorge | 101 |
| 110 | Liu | 101 |
| 106 | Mateo | 102 |
| 201 | Sofía | 102 |
| 105 | Richard | 103 |
| 110 | Nikki | 103 |
| 104 | Paulo | 103 |
| 120 | Saanvi | 104 |
| 200 | Shirley | 104 |
| 205 | Zhang | 104 |

### Known Issues

There are no known issues.

### Related EWIs.

There are no related EWIs.

---
title: SnowConvert AI - Redshift - SQL Statements
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-sql-statements.md
section: Migrations
---

# SnowConvert AI - Redshift - SQL Statements

Translation reference for all the supported statements by SnowConvert AI for Redshift.

## CALL

### Description

> Runs a stored procedure. The CALL command must include the procedure name and the input argument values. You must call a stored procedure by using the CALL statement. ([Redshift SQL Language Reference CALL](https://docs.aws.amazon.com/redshift/latest/dg/r_CALL_procedure.html)).

### Grammar Syntax

```sql
 CALL sp_name ( [ argument ] [, ...] )
```

### Sample Source Patterns

#### Base scenario

##### Input Code:

##### Redshift

```sql
 CREATE PROCEDURE sp_insert_values(IN arg1 INT, IN arg2 DATE)
LANGUAGE plpgsql
AS
$$
BEGIN
    INSERT INTO event VALUES (arg1, arg2);
END;
$$;

CALL sp_insert_values(1, CURRENT_DATE);
```

##### Output Code:

##### Redshift

```sql
 CREATE PROCEDURE sp_insert_values (arg1 INT, arg2 DATE)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS
$$
BEGIN
    INSERT INTO event
    VALUES (:arg1, : arg2);
END;
$$;

CALL sp_insert_values(1, CURRENT_DATE());
```

#### Call using Output Parameters Mode (INOUT, OUT)

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE sp_calculate_sum_product(IN a NUMERIC, IN b NUMERIC, INOUT sum_result NUMERIC, INOUT product_result NUMERIC)
LANGUAGE plpgsql
AS $$
BEGIN
    sum_result := a + b;
    product_result := a * b;
END;
$$;

CREATE OR REPLACE PROCEDURE call_sp_calculate_sum_product()
LANGUAGE plpgsql
AS $$
DECLARE
    sum_value NUMERIC DEFAULT null;
    product_value NUMERIC DEFAULT null;
BEGIN
    CALL sp_calculate_sum_product(FLOOR(20.5)::NUMERIC, CEIL(20.7)::NUMERIC, sum_value, product_value);
    INSERT INTO test VALUES (sum_value, product_value);
END;
$$;

CALL call_sp_calculate_sum_product();
```

##### Output Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE sp_calculate_sum_product (a NUMERIC, b NUMERIC, sum_result OUT NUMERIC, product_result OUT NUMERIC)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
    sum_result := a + b;
    product_result := a * b;
END;
$$;

CREATE OR REPLACE PROCEDURE call_sp_calculate_sum_product ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
DECLARE
    sum_value NUMERIC DEFAULT null;
    product_value NUMERIC DEFAULT null;
BEGIN
    CALL sp_calculate_sum_product(FLOOR(20.5)::NUMERIC, CEIL(20.7)::NUMERIC, : sum_value, : product_value);
    INSERT INTO test
    VALUES (:sum_value, : product_value);
END;
$$;

CALL call_sp_calculate_sum_product();
```

### Known Issues

* Output parameters from calls outside procedures won’t work.

### Related EWIs.

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review

## CREATE DATABASE

### Grammar Syntax

```sql
 CREATE DATABASE database_name
[ { [ WITH ]
    [ OWNER [=] db_owner ]
    [ CONNECTION LIMIT { limit | UNLIMITED } ]
    [ COLLATE { CASE_SENSITIVE | CASE_INSENSITIVE } ]
    [ ISOLATION LEVEL { SERIALIZABLE | SNAPSHOT } ]
  }
  | { [ WITH PERMISSIONS ] FROM DATASHARE datashare_name ] OF [ ACCOUNT account_id ] NAMESPACE namespace_guid }
  | { FROM { { ARN '<arn>' } { WITH DATA CATALOG SCHEMA '<schema>' | WITH NO DATA CATALOG SCHEMA } }
             | { INTEGRATION '<integration_id>'} }
  | { IAM_ROLE  {default | 'SESSION' | 'arn:aws:iam::<account-id>:role/<role-name>' } }
```

For more information please refer to Redshift [`CREATE DATABASE` documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_DATABASE.html).

### Sample Source Patterns

#### Basic samples

##### Input Code:

##### Redshift

```sql
 CREATE DATABASE database_name;
```

##### Output Code:

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS database_name
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/25/2024" }}';
```

#### Collate Clause

##### Input Code:

##### Redshift

```sql
 CREATE DATABASE database_collate
COLLATE CASE_INSENSITIVE;
```

##### Output Code:

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS database_collate
DEFAULT_DDL_COLLATION='en-ci'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/24/2024" }}';
```

#### Connection Limit Clause

##### Input Code:

##### Redshift

```sql
 CREATE DATABASE database_connection
CONNECTION LIMIT UNLIMITED;
```

##### Output Code:

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS database_connection
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/24/2024" }}';
```

> **Warning:**
>
> The connection limit clause is removed since the connection concurrency in Snowflake is managed by warehouse. For more information, see the [Snowflake MAX_CONCURRENCY_LEVEL parameter](https://docs.snowflake.com/en/sql-reference/parameters#label-max-concurrency-level).

#### From ARN Clause

##### Input Code:

##### Redshift

```sql
 CREATE DATABASE database_fromARN
FROM ARN 'arn' WITH NO DATA CATALOG SCHEMA IAM_ROLE 'arn:aws:iam::<account-id>:role/<role-name';
```

##### Output Code:

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS database_fromARN
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/24/2024" }}';
```

> **Warning:**
>
> This clause is removed since it is used to reference [Amazon Resources](https://docs.aws.amazon.com/IAM/latest/UserGuide/reference-arns.html), not valid in Snowflake.

#### From Datashare Clause

##### Input Code

##### Redshift

```sql
 CREATE DATABASE database_fromDatashare
FROM DATASHARE datashare_name OF NAMESPACE 'namespace_guid';
```

##### Output Code

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS  database_fromDatashare
FROM DATASHARE datashare_name OF NAMESPACE 'namespace_guid' !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'FromDatashareAttribute' NODE ***/!!!
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/24/2024" }}';
```

> **Note:**
>
> The transformation for Datashare is planned to be delivered in the future.

#### Owner Clause

##### Input Code

##### Redshift

```sql
 CREATE DATABASE database_Owner
OWNER db_owner
ENCODING 'encoding';
```

##### Output Code

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS database_Owner
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/24/2024" }}';
```

> **Warning:**
>
> Please be aware that for this case, the owner clause is removed from the code since Snowflake databases are owned by roles, not individual users. For more information please refer to [Snowflake `GRANT OWNERSHIP` documentation](https://docs.snowflake.com/en/sql-reference/sql/grant-ownership).

#### Isolation Level Clause

##### Input Code

##### Redshift

```sql
 CREATE DATABASE database_Isolation
ISOLATION LEVEL SNAPSHOT;
```

##### Output Code

##### Snowflake

```sql
 CREATE DATABASE IF NOT EXISTS database_Isolation
ISOLATION LEVEL SNAPSHOT !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'IsolationLevelAttribute' NODE ***/!!!
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/24/2024" }}';
```

> **Note:**
>
> The transformation for Isolation Level is planned to be delivered in the future.

### Related EWIs

* [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review

## CREATE EXTERNAL TABLE

### Description

Currently SnowConvert AI is transforming `CREATE EXTERNAL TABLES` to regular tables, that implies additional effort because data stored in external RedShift tables must be transferred to the Snowflake database.

### Grammar Syntax

```sql
 CREATE EXTERNAL TABLE
external_schema.table_name
(column_name data_type [, …] )
[ PARTITIONED BY (col_name data_type [, … ] )]
[ { ROW FORMAT DELIMITED row_format |
  ROW FORMAT SERDE 'serde_name'
  [ WITH SERDEPROPERTIES ( 'property_name' = 'property_value' [, ...] ) ] } ]
STORED AS file_format
LOCATION { 's3://bucket/folder/' | 's3://bucket/manifest_file' }
[ TABLE PROPERTIES ( 'property_name'='property_value' [, ...] ) ]

CREATE EXTERNAL TABLE
external_schema.table_name
[ PARTITIONED BY (col_name [, … ] ) ]
[ ROW FORMAT DELIMITED row_format ]
STORED AS file_format
LOCATION { 's3://bucket/folder/' }
[ TABLE PROPERTIES ( 'property_name'='property_value' [, ...] ) ]
 AS
 { select_statement }
```

See the [Redshift CREATE EXTERNAL TABLE specification](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_EXTERNAL_TABLE.html) for this syntax.

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE EXTERNAL TABLE
external_schema.sales_data
(
    sales_id INT,
    product_id INT,
    sales_amount DECIMAL(10, 2),
    sales_date DATE
)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ','
STORED AS TEXTFILE
LOCATION 's3://mybucket/sales_data/';
```

##### Output Code:

##### Snowflake

```sql
 --** SSC-FDM-0004 - EXTERNAL TABLE TRANSLATED TO REGULAR TABLE **
CREATE TABLE external_schema.sales_data
(
    sales_id INT,
    product_id INT,
    sales_amount DECIMAL(10, 2),
    sales_date DATE
)
--ROW FORMAT DELIMITED
--FIELDS TERMINATED BY ','
--STORED AS TEXTFILE
--LOCATION 's3://mybucket/sales_data/'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;
```

#### Create External Table AS

##### Input Code:

##### Redshift

```sql
 CREATE EXTERNAL TABLE spectrum.partitioned_lineitem
PARTITIONED BY (l_shipdate, l_shipmode)
STORED AS parquet
LOCATION 'S3://amzn-s3-demo-bucket/cetas/partitioned_lineitem/'
AS SELECT l_orderkey, l_shipmode, l_shipdate, l_partkey FROM local_table;
```

##### Output Code:

##### Snowflake

```sql
 --** SSC-FDM-0004 - EXTERNAL TABLE TRANSLATED TO REGULAR TABLE **
CREATE TABLE spectrum.partitioned_lineitem
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
--PARTITIONED BY (l_shipdate, l_shipmode)
--STORED AS parquet
--LOCATION 'S3://amzn-s3-demo-bucket/cetas/partitioned_lineitem/'
AS SELECT l_orderkey, l_shipmode, l_shipdate, l_partkey FROM
local_table;
```

### Recommendations

* For the usage of Create External Table in Snowflake you may refer to [Snowflake’s documentation.](https://docs.snowflake.com/en/sql-reference/sql/create-external-table)

### Related EWIs

1. [SSC-FDM-0004](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): External table translated to regular table

## CREATE MATERIALIZED VIEW

### Description

In SnowConvert AI, Redshift Materialized Views are transformed into Snowflake Dynamic Tables. To properly configure Dynamic Tables, two essential parameters must be defined: TARGET_LAG and WAREHOUSE. If these parameters are left unspecified in the configuration options, SnowConvert AI will default to preassigned values during the conversion, as demonstrated in the example below.

For more information, see the [Redshift CREATE MATERIALIZED VIEW documentation](https://docs.aws.amazon.com/redshift/latest/dg/materialized-view-create-sql-command.html).

For details on the necessary parameters, see the [Snowflake CREATE DYNAMIC TABLE documentation](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table).

### Grammar Syntax

The following is the SQL syntax to create a view in Amazon Redshift. See the [Redshift CREATE MATERIALIZED VIEW specification](https://docs.aws.amazon.com/redshift/latest/dg/materialized-view-create-sql-command.html) for this syntax.

```sql
 CREATE MATERIALIZED VIEW mv_name
[ BACKUP { YES | NO } ]
[ table_attributes ]
[ AUTO REFRESH { YES | NO } ]
AS query
```

### Sample Source Patterns

#### Input Code:

##### Redshift

```sql
 CREATE MATERIALIZED VIEW mv_baseball AS
SELECT ball AS baseball FROM baseball_table;
```

##### Output Code:

##### Snowflake

```sql
 CREATE DYNAMIC TABLE mv_baseball
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "11/26/2024",  "domain": "test" }}'
AS
    SELECT ball AS baseball FROM
        baseball_table;
```

> **Note:**
>
> For the table attributes documentation you can check de following documentation:
>
> * [Sortkey](redshift-sql-statements-create-table.md)
> * [DistKey](redshift-sql-statements-create-table.md)
> * [DistStyle](redshift-sql-statements-create-table.md)

> **Warning:**
>
> The BACKUP and AUTO REFRESH clauses are deleted since they are not applicable in a Snowflake’s Dynamic Table

### Related Ewis

* [SSC-FDM-0031](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default

## CREATE SCHEMA

### Grammar Syntax

```sql
 CREATE SCHEMA [ IF NOT EXISTS ] schema_name [ AUTHORIZATION username ]
           [ QUOTA {quota [MB | GB | TB] | UNLIMITED} ] [ schema_element [ ... ]

CREATE SCHEMA AUTHORIZATION username [ QUOTA {quota [MB | GB | TB] | UNLIMITED} ]
[ schema_element [ ... ] ]
```

For more information please refer to [Redshift `CREATE SCHEMA` documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_SCHEMA.html).

### Sample Source Patterns

#### Basic samples

##### Input Code:

##### Redshift

```sql
 CREATE SCHEMA s1;

CREATE SCHEMA IF NOT EXISTS s2;

CREATE SCHEMA s3
CREATE TABLE t1
(
    col1 INT
)
CREATE VIEW v1 AS SELECT * FROM t1;
```

##### Output Code:

##### Snowflake

```sql
 CREATE SCHEMA IF NOT EXISTS s1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;

CREATE SCHEMA IF NOT EXISTS s2
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;

CREATE SCHEMA IF NOT EXISTS s3
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;
CREATE TABLE t1
(
    col1 INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;
CREATE VIEW v1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
AS SELECT * FROM
    t1;
```

#### Authorization Clause

##### Input Code:

##### Redshift

```sql
 CREATE SCHEMA s1 AUTHORIZATION miller;
```

##### Output Code:

##### Snowflake

```sql
 CREATE SCHEMA IF NOT EXISTS s1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;
```

> **Warning:**
>
> Please be aware that for this case, the authorization clause is removed from the code since Snowflake schemas are owned by roles, not individual users. For more information please refer to [Snowflake `GRANT OWNERSHIP` documentation](https://docs.snowflake.com/en/sql-reference/sql/grant-ownership).

#### Quota Clause

##### Input Code:

##### Redshift

```sql
 CREATE SCHEMA s1 QUOTA UNLIMITED;

CREATE SCHEMA s2 QUOTA 10 TB;
```

##### Output Code:

##### Snowflake

```sql
 CREATE SCHEMA IF NOT EXISTS s1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;

CREATE SCHEMA IF NOT EXISTS s2
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;
```

> **Note:**
>
> In Snowflake is not allowed to define a quota per scheme. Storage management is done at the account and warehouse level, and Snowflake handles it automatically. For this reason it is removed from the code.

#### Create Schema Authorization

In Redshift when the schema name is not specified but the authorization clause is defined, a new schema is created with the owner’s name. For this reason this behavior is replicated in Snowflake.

##### Input Code:

##### Redshift

```sql
 CREATE SCHEMA AUTHORIZATION miller;
```

##### Output Code:

##### Snowflake

```sql
 CREATE SCHEMA IF NOT EXISTS miller
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/23/2024" }}'
;
```

### Related EWIs

There are no known issues.

## CREATE FUNCTION

### Description

This command defines a user-defined function (UDF) within the database. These functions encapsulate reusable logic that can be invoked within SQL queries.

### Grammar Syntax

The following is the SQL syntax to create a view in Amazon Redshift. See the [Redshift CREATE VIEW specification](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_VIEW.html) for this syntax.

```sql
 CREATE [ OR REPLACE ] FUNCTION f_function_name
( { [py_arg_name  py_arg_data_type |
sql_arg_data_type } [ , ... ] ] )
RETURNS data_type
{ VOLATILE | STABLE | IMMUTABLE }
AS $$
  { python_program | SELECT_clause }
$$ LANGUAGE { plpythonu | sql }
```

### SQL Language

#### Volatility category

In Snowflake, `VOLATILE` and `IMMUTABLE` function volatility are functionally equivalent. Given that `STABLE` is inherently transformed to the default `VOLATILE` behavior, explicit use of `STABLE` will be deleted.

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE FUNCTION get_sale(INTEGER)
RETURNS FLOAT
STABLE
AS $$
SELECT price FROM sales where id = $1
$$ LANGUAGE SQL;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE FUNCTION get_sale (SC_ARG1 INTEGER)
RETURNS FLOAT
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
SELECT price FROM
sales
where id = SC_ARG1
$$
;
```

### Python Language

Within the SnowConvert AI scope, the Python language for `CREATE FUNCTION` statements is not supported. Consequently, the language `plpythonu` will be flagged with an EWI (SSC-EWI-0073), and its body could appear with parsing errors.

#### Input Code:

##### Redshift

```sql
 create function f_py_greater (a float, b float)
  returns float
stable
as $$
  if a > b:
    return a
  return b
$$ language plpythonu;
```

##### Output Code:

##### Snowflake

```sql
 create function f_py_greater (a float, b float)
returns float
language plpythonu !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'LANGUAGE PLPythonU' NODE ***/!!!
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
as $$
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '5' COLUMN '3' OF THE SOURCE CODE STARTING AT 'if'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'if' ON LINE '5' COLUMN '3'. **
--  if a > b:
--    return a
--  return b
$$
;
```

### Related EWIs

There are no known issues.

## CREATE VIEW

### Description

This command creates a view in a database, which is run every time the view is referenced in a query. Using the WITH NO SCHEMA BINDING clause, you can create views to an external table or objects that don’t exist yet. This clause, however, requires you to specify the qualified name of the object or table that you are referencing.

### Grammar Syntax

The following is the SQL syntax to create a view in Amazon Redshift. See the [Redshift CREATE VIEW specification](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_VIEW.html) for this syntax.

```sql
 CREATE [ OR REPLACE ] VIEW name [ ( column_name [, ...] ) ] AS query
[ WITH NO SCHEMA BINDING ]
```

### Sample Source Patterns

Considering the obligatory and optional clauses in Redshifts command, the output after migration to Snowflake is very similar.

#### Input Code:

##### Redshift

```sql
 CREATE VIEW myuser
AS
SELECT lastname FROM users;

CREATE VIEW myuser2
AS
SELECT lastname FROM users2
WITH NO SCHEMA BINDING;
```

##### Output Code:

##### Snowflake

```sql
 CREATE VIEW myuser
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/16/2025",  "domain": "test" }}'
AS
SELECT lastname FROM
users;

CREATE VIEW myuser2
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "01/16/2025",  "domain": "test" }}'
AS
SELECT lastname FROM
users2
!!!RESOLVE EWI!!! /*** SSC-EWI-RS0003 - WITH NO SCHEMA BINDING STATEMENT CAN NOT BE REMOVED DUE TO MISSING REFERENCES. ***/!!!
WITH NO SCHEMA BINDING;
```

There are some exceptions, however, of one unsupported clause from Redshift, therefore an EWI was implemented to cover this case.

### Related EWIs

* [SSC-EWI-RS0003](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): With no schema binding statement is not supported in Snowflake.

## DELETE

### Description

> Deletes rows from tables. ([Redshift SQL Language Reference Delete Statement](https://docs.aws.amazon.com/redshift/latest/dg/r_DELETE.html)).

> **Note:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ WITH [RECURSIVE] common_table_expression [, common_table_expression , ...] ]
DELETE [ FROM ] { table_name | materialized_view_name }
    [ USING table_name, ... ]
    [ WHERE condition ]
```

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE employees (
    id INT PRIMARY KEY,
    name VARCHAR(255) NOT NULL,
    department VARCHAR(255),
    manager_id INT REFERENCES employees(id)
);

INSERT INTO employees (id, name, department, manager_id) VALUES
(1, 'Alice', 'Sales', 2),
(2, 'Bob', 'Sales', 1),
(3, 'Charlie', 'Sales', 1),
(4, 'David', 'Marketing', 2),
(5, 'Eve', 'Marketing', 4),
(6, 'Frank', 'Marketing', 4),
(7, 'Grace', 'Engineering', 6),
(8, 'Helen', 'Engineering', 7),
(9, 'Ivy', 'Engineering', 7),
(10, 'John', 'Sales', 3),
(11, 'Joe', 'Engineering', 5);

CREATE TABLE departments (
    department_name VARCHAR(255)
);

INSERT INTO departments (department_name) VALUES
('Sales'),
('Marketing'),
('Engineering');
```

#### From Clause

Update a table by referencing information from other tables. In Redshift, the FROM keyword is optional, but in Snowflake, it is mandatory. Therefore, it will be added in cases where it’s missing.

##### Input Code:

##### Redshift

```sql
 DELETE employees;

SELECT * FROM employees ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
|  |  |  |  |

##### Output Code:

##### Snowflake

```sql
 DELETE FROM
    employees;

SELECT * FROM employees ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
|  |  |  |  |

#### Where Clause

Restricts updates to rows that match a condition. When the condition returns true, the specified SET columns are updated. The condition can be a simple predicate on a column or a condition based on the result of a subquery. This clause is fully equivalent in Snowflake.

##### Input Code:

##### Redshift

```sql
 DELETE FROM employees
WHERE department = 'Marketing';

SELECT * FROM employees
ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 1 | Alice | Sales | 2 |
| 2 | Bob | Sales | 1 |
| 3 | Charlie | Sales | 1 |
| 7 | Grace | Engineering | 6 |
| 8 | Helen | Engineering | 7 |
| 9 | Ivy | Engineering | 7 |
| 10 | John | Sales | 3 |
| 11 | Joe | Engineering | 5 |

##### Output Code:

##### Snowflake

```sql
 DELETE FROM
    employees
WHERE department = 'Marketing';

SELECT * FROM
    employees
ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 1 | Alice | Sales | 2 |
| 2 | Bob | Sales | 1 |
| 3 | Charlie | Sales | 1 |
| 7 | Grace | Engineering | 6 |
| 8 | Helen | Engineering | 7 |
| 9 | Ivy | Engineering | 7 |
| 10 | John | Sales | 3 |
| 11 | Joe | Engineering | 5 |

#### Using Clause

This clause introduces a list of tables when additional tables are referenced in the WHERE clause condition. This clause is fully equivalent in Snowflake.

##### Input Code:

##### Redshift

```sql
 DELETE FROM employees
USING departments d
WHERE employees.department = d.department_name
AND d.department_name = 'Sales';

SELECT * FROM employees ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 4 | David | Marketing | 2 |
| 5 | Eve | Marketing | 4 |
| 6 | Frank | Marketing | 4 |
| 7 | Grace | Engineering | 6 |
| 8 | Helen | Engineering | 7 |
| 9 | Ivy | Engineering | 7 |
| 11 | Joe | Engineering | 5 |

##### Output Code:

##### Snowflake

```sql
 DELETE FROM employees
USING departments d
WHERE employees.department = d.department_name
AND d.department_name = 'Sales';

SELECT * FROM employees ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 4 | David | Marketing | 2 |
| 5 | Eve | Marketing | 4 |
| 6 | Frank | Marketing | 4 |
| 7 | Grace | Engineering | 6 |
| 8 | Helen | Engineering | 7 |
| 9 | Ivy | Engineering | 7 |
| 11 | Joe | Engineering | 5 |

#### WITH clause

This clause specifies one or more Common Table Expressions (CTE). The output column names are optional for non-recursive CTEs, but mandatory for recursive ones.

Since this clause cannot be used in an DELETE statement, it is transformed into temporary tables with their corresponding queries. After the DELETE statement is executed, these temporary tables are dropped to clean up, release resources, and avoid name collisions when creating tables within the same session. Additionally, if a regular table with the same name exists, it will take precedence again, since the temporary table [has priority](https://docs.snowflake.com/en/user-guide/tables-temp-transient#potential-naming-conflicts-with-other-table-types) over any other table with the same name in the same session.

##### Non-Recursive CTE

##### Input Code:

##### Redshift

```sql
 WITH sales_employees AS (
    SELECT id
    FROM employees
    WHERE department = 'Sales'
), engineering_employees AS (
    SELECT id
    FROM employees
    WHERE department = 'Engineering'
)
DELETE FROM employees
WHERE id IN (SELECT id FROM sales_employees)
   OR id IN (SELECT id FROM engineering_employees);

SELECT * FROM employees ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 4 | David | Marketing | 2 |
| 5 | Eve | Marketing | 4 |
| 6 | Frank | Marketing | 4 |

##### Output Code:

##### Snowflake

```sql
 CREATE TEMPORARY TABLE sales_employees AS
SELECT id
FROM employees
WHERE department = 'Sales';

CREATE TEMPORARY TABLE engineering_employees AS
SELECT id
FROM employees
WHERE department = 'Engineering';

DELETE FROM
    employees
WHERE id IN (SELECT id FROM sales_employees)
   OR id IN (SELECT id FROM engineering_employees);

DROP TABLE sales_employees;
DROP TABLE engineering_employees;

SELECT * FROM
    employees
ORDER BY id;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 4 | David | Marketing | 2 |
| 5 | Eve | Marketing | 4 |
| 6 | Frank | Marketing | 4 |

##### Recursive CTE

##### Input Code:

##### Redshift

```sql
 WITH RECURSIVE subordinate_hierarchy(id, name, department, level) AS (
    SELECT id, name, department, 0 as level
    FROM employees
    WHERE department = 'Marketing'

    UNION ALL

    SELECT e.id, e.name, e.department, sh.level + 1
    FROM employees e
    INNER JOIN subordinate_hierarchy sh ON e.manager_id = sh.id
)
DELETE FROM employees
WHERE id IN (SELECT id FROM subordinate_hierarchy);
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 1 | Alice | Sales | 2 |
| 2 | Bob | Sales | 1 |
| 3 | Charlie | Sales | 1 |
| 10 | John | Sales | 3 |

##### Output Code:

##### Snowflake

```sql
 CREATE TEMPORARY TABLE subordinate_hierarchy AS
   WITH RECURSIVE subordinate_hierarchy(id, name, department, level) AS (
       SELECT id, name, department, 0 as level
       FROM
           employees
       WHERE department = 'Marketing'

       UNION ALL

       SELECT e.id, e.name, e.department, sh.level + 1
       FROM
           employees e
       INNER JOIN
               subordinate_hierarchy sh ON e.manager_id = sh.id
   )
   SELECT
       id,
       name,
       department,
       level
   FROM
       subordinate_hierarchy;

   DELETE FROM
   employees
   WHERE id IN (SELECT id FROM
           subordinate_hierarchy
   );

   DROP TABLE subordinate_hierarchy;
```

##### Result

| ID | NAME | DEPARTMENT | MANAGER_ID |
| --- | --- | --- | --- |
| 1 | Alice | Sales | 2 |
| 2 | Bob | Sales | 1 |
| 3 | Charlie | Sales | 1 |
| 10 | John | Sales | 3 |

#### Delete Materialized View

In Redshift, you can apply the DELETE statement to materialized views used for [streaming ingestion](https://docs.aws.amazon.com/redshift/latest/dg/materialized-view-streaming-ingestion.html). In Snowflake, these views are transformed into dynamic tables, and the DELETE statement cannot be used on dynamic tables. For this reason, an EWI will be added.

##### Input Code:

##### Redshift

```sql
 CREATE MATERIALIZED VIEW emp_mv AS
SELECT id, name, department FROM employees WHERE department = 'Engineering';

DELETE FROM emp_mv
WHERE id = 2;
```

##### Output Code:

##### Snowflake

```sql
 CREATE DYNAMIC TABLE emp_mv
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/11/2025",  "domain": "test" }}'
AS
SELECT id, name, department FROM
employees
WHERE department = 'Engineering';

!!!RESOLVE EWI!!! /*** SSC-EWI-RS0008 - MATERIALIZED VIEW IS TRANSFORMED INTO A DYNAMIC TABLE, AND THE DELETE STATEMENT CANNOT BE USED ON DYNAMIC TABLES IN SNOWFLAKE. ***/!!!
DELETE FROM
emp_mv
WHERE id = 2;
```

### Known Issues

* Replicating the functionality of the `WITH` clause requires creating temporary tables mirroring each Common Table Expression (CTE). However, this approach fails if a temporary table with the same name already exists within the current session, causing an error.

### Related EWIs

1. [SSC-FDM-0031](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default.
2. [SSC-EWI-RS0008](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): Materialized view is transformed into a dynamic table, and the DELETE statement cannot be used on dynamic tables in Snowflake.

## EXECUTE

### Description

> The `EXECUTE` `IMMEDIATE` statement builds and runs a dynamic SQL statement in a single operation.
>
> Native dynamic SQL uses the `EXECUTE` `IMMEDIATE` statement to process most dynamic SQL statements. ([Redshift Language Reference EXECUTE Statement](https://docs.aws.amazon.com/redshift/latest/dg/c_PLpgSQL-statements.html#r_PLpgSQL-dynamic-sql))

### Grammar Syntax

```sql
 EXECUTE command-string [ INTO target ];
```

### Sample Source Patterns

Concated Example

Input Code

#### Redshift

```sql
 CREATE OR REPLACE PROCEDURE create_dynamic_table(table_name VARCHAR)
AS $$
DECLARE
sql_statement VARCHAR;
BEGIN
sql_statement := 'CREATE TABLE IF NOT EXISTS ' || table_name || ' (id INT, value VARCHAR);';
EXECUTE sql_statement;
END;
$$ LANGUAGE plpgsql;
```

Output Code

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE create_dynamic_table (table_name VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
DECLARE
sql_statement VARCHAR;
BEGIN
sql_statement := 'CREATE TABLE IF NOT EXISTS ' || table_name || ' (id INT, value VARCHAR)';
!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
EXECUTE IMMEDIATE sql_statement;
END;
$$;
```

#### Function Transformation

##### Input Code

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE insert_with_dynamic()
AS $$
DECLARE
sql_statement VARCHAR;
BEGIN
sql_statement := 'insert into orders(order_date) values ("getdate"());';
EXECUTE sql_statement;
END;
$$ LANGUAGE plpgsql;
```

##### Output Code

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE insert_with_dynamic ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
DECLARE
sql_statement VARCHAR;
BEGIN
sql_statement := 'insert into orders (order_date) values (GETDATE())';
!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
EXECUTE IMMEDIATE sql_statement;
END;
$$;
```

#### Error In Query Parsing

##### Input Code

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE bad_statement(table_name VARCHAR)
AS $$
DECLARE
sql_statement VARCHAR;
BEGIN
sql_statement := 'bad statement goes here';
EXECUTE sql_statement;
END;
$$ LANGUAGE plpgsql;
```

##### Output Code

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE bad_statement (table_name VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
DECLARE
sql_statement VARCHAR;
BEGIN
sql_statement := 'bad statement goes here';
!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-0027 - THE FOLLOWING STATEMENT USES A VARIABLE/LITERAL WITH AN INVALID QUERY AND IT WILL NOT BE EXECUTED ***/!!!
EXECUTE IMMEDIATE sql_statement;
END;
$$;
```

#### INTO Clause

##### Input Code

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE get_max_id(table_name VARCHAR, OUT max_id INTEGER)
AS $$
DECLARE
    sql_statement VARCHAR;
BEGIN
    sql_statement := 'SELECT MAX(id) FROM ' || table_name || ';';
    EXECUTE sql_statement INTO max_id;
END;
$$ LANGUAGE plpgsql;
```

##### Output Code

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE get_max_id (table_name VARCHAR, max_id OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS $$
        DECLARE
            sql_statement VARCHAR;
BEGIN
    sql_statement := 'SELECT
   MAX(id) FROM
   ' || table_name;
            !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
            EXECUTE IMMEDIATE sql_statement
                                            !!!RESOLVE EWI!!! /*** SSC-EWI-PG0007 - INTO CLAUSE IN DYNAMIC SQL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!! INTO max_id;
END;
$$;
```

### Known Issues

#### 1. Execution results cannot be stored in variables.

SnowScripting does not support INTO nor BULK COLLECT INTO clauses. For this reason, results will need to be passed through other means.

##### 2. Dynamic SQL Execution queries may be marked incorrectly as non-runnable.

In some scenarios there an execute statement may be commented regardless of being safe or non-safe to run so please take this into account:

### Related EWIs

1. [SSC-EWI-0027](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Variable with invalid query.
2. [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.

## INSERT

### Description

> Inserts new rows into a table. ([Redshift SQL Language Reference Insert Statement](https://docs.aws.amazon.com/redshift/latest/dg/r_INSERT_30.html#r_INSERT_30-synopsis)).

> **Warning:**
>
> This syntax is partially supported in Snowflake.

### Grammar Syntax

```sql
 INSERT INTO table_name [ ( column [, ...] ) ]
{DEFAULT VALUES |
VALUES ( { expression | DEFAULT } [, ...] )
[, ( { expression | DEFAULT } [, ...] )
[, ...] ] |
query }
```

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE employees (
    id INTEGER IDENTITY(1,1),
    name VARCHAR(100),
    salary INT DEFAULT 20000,
    department VARCHAR(50) DEFAULT 'Marketing'
);

CREATE TABLE new_employees (
    name VARCHAR(100),
    salary INT,
    department VARCHAR(50)
);

INSERT INTO new_employees (name, salary, department)
VALUES
    ('Grace Lee', 32000, 'Operations'),
    ('Hannah Gray', 26000, 'Finance');
```

#### Default Values

It inserts a complete row with its default values. If any columns do not have default values, NULL values are inserted in those columns.

This clause cannot specify individual columns; it always inserts a complete row with its default values. Additionally, columns with the NOT NULL constraint cannot be included in the table definition. To replicate this behavior in Snowflake, SnowConvert AI insert a column with a DEFAULT value in the table. This action inserts a complete row, using the default value for every column.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE employees (
    id INTEGER IDENTITY(1,1),
    name VARCHAR(100),
    salary INT DEFAULT 20000,
    department VARCHAR(50) DEFAULT 'Marketing'
);

INSERT INTO employees
DEFAULT VALUES;

SELECT * FROM employees ORDER BY id;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | NULL | 20000 | Marketing |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE employees (
    id INTEGER IDENTITY(1,1) ORDER,
    name VARCHAR(100),
    salary INT DEFAULT 20000,
    department VARCHAR(50) DEFAULT 'Marketing'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}';

INSERT INTO employees (id)
VALUES (DEFAULT);

SELECT * FROM
    employees
ORDER BY id;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | NULL | 20000 | Marketing |

#### Query

Insert one or more rows into the table by using a query. All rows produced by the query will be inserted into the table. The query must return a column list that is compatible with the table’s columns, although the column names do not need to match. This functionality is fully equivalent in Snowflake.

##### Input Code:

##### Redshift

```sql
 INSERT INTO employees (name, salary, department)
SELECT name, salary, department FROM new_employees;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Grace Lee | 32000 | Operations |
| 2 | Hannah Gray | 26000 | Finance |

##### Output Code:

##### Snowflake

```sql
 INSERT INTO employees (name, salary, department)
SELECT name, salary, department FROM
    new_employees;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Grace Lee | 32000 | Operations |
| 2 | Hannah Gray | 26000 | Finance |

### Known Issues

* Certain expressions cannot be used in the VALUES clause in Snowflake. For example, in Redshift, the [JSON_PARSE](https://docs.aws.amazon.com/redshift/latest/dg/JSON_PARSE.html) function can be used within the VALUES clause to insert a JSON value into a SUPER data type. In Snowflake, however, the [PARSE_JSON](https://docs.snowflake.com/en/sql-reference/functions/parse_json) function cannot be used in the VALUES clause to insert a JSON value into a VARIANT data type. Instead, a query can be used in place of the VALUES clause. For more details, please refer to the [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/sql/insert#usage-notes). You can also check the [following article](https://community.snowflake.com/s/article/Cannot-use-DATE-FROM-PARTS-function-inside-the-VALUES-clause) for further information.

### Related EWIs

There are no known issues.

## MERGE

### Grammar Syntax

```sql
 MERGE INTO target_table
USING source_table [ [ AS ] alias ]
ON match_condition
[ WHEN MATCHED THEN { UPDATE SET col_name = { expr } [,...] | DELETE }
WHEN NOT MATCHED THEN INSERT [ ( col_name [,...] ) ] VALUES ( { expr } [, ...] ) |
REMOVE DUPLICATES ]
```

For more information please refer to Redshift [MERGE documentation](https://docs.aws.amazon.com/redshift/latest/dg/r_MERGE.html).

### Sample Source Patterns

#### UPDATE - INSERT

There are no differences between both languages. The code is kept in its original form.

##### Input Code:

##### Redshift

```sql
 MERGE INTO target USING source ON target.id = source.id
WHEN MATCHED THEN UPDATE SET id = source.id, name = source.name
WHEN NOT MATCHED THEN INSERT VALUES (source.id, source.name);
```

##### Output Code:

##### Snowflake

```sql
 --** SSC-FDM-RS0005 - REDSHIFT MERGE STATEMENT REJECTS DUPLICATE SOURCE ROWS. SNOWFLAKE ALLOWS DUPLICATES, WHICH MAY PRODUCE NON-DETERMINISTIC RESULTS. **
MERGE INTO target USING source ON target.id = source.id
WHEN MATCHED THEN UPDATE SET id = source.id, name = source.name
WHEN NOT MATCHED THEN INSERT VALUES (source.id, source.name);
```

#### DELETE - INSERT

There are no differences between both languages. The code is kept in its original form.

##### Input Code:

##### Redshift

```sql
 MERGE INTO target USING source ON target.id = source.id
WHEN MATCHED THEN DELETE
WHEN NOT MATCHED THEN INSERT VALUES (source.id, source.name);
```

##### Output Code:

##### Snowflake

```sql
 --** SSC-FDM-RS0005 - REDSHIFT MERGE STATEMENT REJECTS DUPLICATE SOURCE ROWS. SNOWFLAKE ALLOWS DUPLICATES, WHICH MAY PRODUCE NON-DETERMINISTIC RESULTS. **
MERGE INTO target USING source ON target.id = source.id
WHEN MATCHED THEN DELETE
WHEN NOT MATCHED THEN INSERT VALUES (source.id, source.name);
```

#### REMOVE DUPLICATES

The REMOVE DUPLICATES clause is not supported in Snowflake, however, there is a workaround that could emulate the original behavior.

The output code will have three new statements:

* A TEMPORARY TABLE with the duplicate values from the source and target table that matches the condition
* An INSERT statement that adds the pending values to the target table after the merge
* A DROP statement that drops the generated temporary table.

These are necessary since the DROP DUPLICATES behavior removes the duplicate values from the target table and then inserts the values that match the condition from the source table.

##### Input Code:

##### Redshift

```sql
 CREATE TABLE target (id INT, name CHAR(10));
CREATE TABLE source (id INT, name CHAR(10));

INSERT INTO target VALUES (30, 'Tony'), (30, 'Daisy'), (11, 'Alice'), (23, 'Bill'), (23, 'Nikki');
INSERT INTO source VALUES (23, 'David'), (22, 'Clarence');

MERGE INTO target USING source ON target.id = source.id REMOVE DUPLICATES;
```

##### Results

| ID | NAME |
| --- | --- |
| 30 | Daisy |
| 22 | Clarence |
| 30 | Tony |
| 11 | Alice |
| 23 | David |

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE target (id INT, name CHAR(10))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}';

CREATE TABLE source (id INT, name CHAR(10))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}';

INSERT INTO target
VALUES (30, 'Tony'), (30, 'Daisy'), (11, 'Alice'), (23, 'Bill'), (23, 'Nikki');

INSERT INTO source
VALUES (23, 'David'), (22, 'Clarence');

CREATE TEMPORARY TABLE source_duplicates AS
SELECT DISTINCT
source.*
FROM
source
INNER JOIN
target
ON target.id = source.id;
--** SSC-FDM-RS0005 - REDSHIFT MERGE STATEMENT REJECTS DUPLICATE SOURCE ROWS. SNOWFLAKE ALLOWS DUPLICATES, WHICH MAY PRODUCE NON-DETERMINISTIC RESULTS. **
MERGE INTO target
USING source ON target.id = source.id
WHEN MATCHED THEN
DELETE
WHEN NOT MATCHED THEN
INSERT
VALUES (source.id, source.name);
INSERT INTO target

SELECT
*
FROM
source_duplicates;

DROP TABLE IF EXISTS source_duplicates CASCADE;
```

##### Results

| ID | NAME |
| --- | --- |
| 22 | Clarence |
| 30 | Tony |
| 30 | Daisy |
| 11 | Alice |
| 23 | David |

### Known Issues

There are no known issues.

### Related EWIs

1. [SSC-EWI-RS0009](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md): Semantic information not found for the source table.
2. [SSC-FDM-RS0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md): Redshift MERGE rejects duplicate source rows. Snowflake allows them, which may produce different results.

## UPDATE

### Description

> Updates values in one or more table columns when a condition is satisfied. ([Redshift SQL Language Reference Update Statement](https://docs.aws.amazon.com/redshift/latest/dg/r_UPDATE.html)).

> **Note:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 [ WITH [RECURSIVE] common_table_expression [, common_table_expression , ...] ]
            UPDATE table_name [ [ AS ] alias ] SET column = { expression | DEFAULT } [,...]

[ FROM fromlist ]
[ WHERE condition ]
```

### Sample Source Patterns

#### **Setup data**

##### Redshift

```sql
 CREATE TABLE employees (
    id INTEGER IDENTITY(1,1),
    name VARCHAR(100),
    salary DECIMAL DEFAULT 20000,
    department VARCHAR(50) DEFAULT 'Marketing'
);

INSERT INTO employees (name, salary, department)
VALUES
    ('Alice', 500000, 'HR'),
    ('Bob', 600000, 'Engineering'),
    ('Charlie', 700000, 'Engineering'),
    ('David', 400000, 'Marketing'),
    ('Eve', 450000, 'HR'),
    ('Frank', 750000, 'Engineering'),
    ('Grace', 650000, 'Engineering'),
    ('Helen', 390000, 'Marketing'),
    ('Ivy', 480000, 'HR'),
    ('Jack', 420000, 'Engineering'),
    ('Ken', 700000, 'Marketing'),
    ('Liam', 600000, 'Engineering'),
    ('Mona', 470000, 'HR');

CREATE TABLE department_bonus (
    department VARCHAR(100),
    bonus DECIMAL
);

INSERT INTO department_bonus (department, bonus)
VALUES
    ('HR', 10000),
    ('Engineering', 50000),
    ('Marketing', 20000),
    ('Sales', 5000);
```

#### Alias

Although Snowflake’s grammar does not specify that a table alias can be used, it’s valid code in Snowflake.

##### Input Code:

##### Redshift

```sql
 UPDATE employees AS e
SET salary = salary + 5000
WHERE e.salary < 600000;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 505000 | HR |
| 2 | Bob | 600000 | Engineering |
| 3 | Charlie | 700000 | Engineering |
| 4 | David | 405000 | Marketing |
| 5 | Eve | 455000 | HR |
| 6 | Frank | 750000 | Engineering |
| 7 | Grace | 650000 | Engineering |
| 8 | Helen | 395000 | Marketing |
| 9 | Ivy | 485000 | HR |
| 10 | Jack | 425000 | Engineering |
| 11 | Ken | 700000 | Marketing |
| 12 | Liam | 600000 | Engineering |
| 13 | Mona | 475000 | HR |

##### Output Code:

##### Snowflake

```sql
 UPDATE employees AS e
SET salary = salary + 5000
WHERE e.salary < 600000;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 505000 | HR |
| 2 | Bob | 600000 | Engineering |
| 3 | Charlie | 700000 | Engineering |
| 4 | David | 405000 | Marketing |
| 5 | Eve | 455000 | HR |
| 6 | Frank | 750000 | Engineering |
| 7 | Grace | 650000 | Engineering |
| 8 | Helen | 395000 | Marketing |
| 9 | Ivy | 485000 | HR |
| 10 | Jack | 425000 | Engineering |
| 11 | Ken | 700000 | Marketing |
| 12 | Liam | 600000 | Engineering |
| 13 | Mona | 475000 | HR |

#### WITH clause

This clause specifies one or more Common Table Expressions (CTE). The output column names are optional for non-recursive CTEs, but mandatory for recursive ones.

Since this clause cannot be used in an UPDATE statement, it is transformed into temporary tables with their corresponding queries. After the UPDATE statement is executed, these temporary tables are dropped to clean up, release resources, and avoid name collisions when creating tables within the same session. Additionally, if a regular table with the same name exists, it will take precedence again, since the temporary table [has priority](https://docs.snowflake.com/en/user-guide/tables-temp-transient#potential-naming-conflicts-with-other-table-types) over any other table with the same name in the same session.

##### Non-Recursive CTE

##### Input Code:

##### Redshift

```sql
 WITH avg_salary_cte AS (
    SELECT AVG(salary) AS avg_salary FROM employees
)
UPDATE employees
SET salary = (SELECT avg_salary FROM avg_salary_cte)
WHERE salary < 500000;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 500000 | HR |
| 2 | Bob | 600000 | Engineering |
| 3 | Charlie | 700000 | Engineering |
| 4 | David | 546923 | Marketing |
| 5 | Eve | 546923 | HR |
| 6 | Frank | 750000 | Engineering |
| 7 | Grace | 650000 | Engineering |
| 8 | Helen | 546923 | Marketing |
| 9 | Ivy | 546923 | HR |
| 10 | Jack | 546923 | Engineering |
| 11 | Ken | 700000 | Marketing |
| 12 | Liam | 600000 | Engineering |
| 13 | Mona | 546923 | HR |

##### Output Code:

##### Snowflake

```sql
 CREATE TEMPORARY TABLE avg_salary_cte AS
SELECT AVG(salary) AS avg_salary FROM
employees;

UPDATE employees
SET salary = (SELECT avg_salary FROM
      avg_salary_cte
)
WHERE salary < 500000;

DROP TABLE avg_salary_cte;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 500000 | HR |
| 2 | Bob | 600000 | Engineering |
| 3 | Charlie | 700000 | Engineering |
| 4 | David | 546923 | Marketing |
| 5 | Eve | 546923 | HR |
| 6 | Frank | 750000 | Engineering |
| 7 | Grace | 650000 | Engineering |
| 8 | Helen | 546923 | Marketing |
| 9 | Ivy | 546923 | HR |
| 10 | Jack | 546923 | Engineering |
| 11 | Ken | 700000 | Marketing |
| 12 | Liam | 600000 | Engineering |
| 13 | Mona | 546923 | HR |

##### Recursive CTE

##### Input Code:

##### Redshift

```sql
 WITH RECURSIVE bonus_updates(id, name, department, salary, level) AS (
    SELECT e.id,
           e.name,
           e.department,
           e.salary + CASE
                          WHEN db.bonus IS NOT NULL THEN db.bonus
                          ELSE 0
               END AS new_salary,
           1 AS level
    FROM employees e
    LEFT JOIN department_bonus db ON e.department = db.department
    UNION ALL
    SELECT e.id,
           e.name,
           e.department,
           e.salary + CASE
                          WHEN db.bonus IS NOT NULL THEN db.bonus
                          ELSE 0
               END + (e.salary * 0.05) AS new_salary,
           bu.level + 1
    FROM employees e
    JOIN department_bonus db ON e.department = db.department
    JOIN bonus_updates bu ON e.id = bu.id
    WHERE bu.level < 3
)
UPDATE employees
SET salary = bu.new_salary
FROM (SELECT id, AVG(salary) as new_salary FROM bonus_updates GROUP BY id) as bu
WHERE employees.id = bu.id
  AND bu.new_salary > employees.salary;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 526666 | HR |
| 2 | Bob | 670000 | Engineering |
| 3 | Charlie | 773333 | Engineering |
| 4 | David | 433333 | Marketing |
| 5 | Eve | 475000 | HR |
| 6 | Frank | 825000 | Engineering |
| 7 | Grace | 721666 | Engineering |
| 8 | Helen | 423000 | Marketing |
| 9 | Ivy | 506000 | HR |
| 10 | Jack | 484000 | Engineering |
| 11 | Ken | 743333 | Marketing |
| 12 | Liam | 670000 | Engineering |
| 13 | Mona | 495668 | HR |

##### Output Code:

##### Snowflake

```sql
 CREATE TEMPORARY TABLE bonus_updates AS
  --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "employees", "department_bonus" **
 WITH RECURSIVE bonus_updates(id, name, department, salary, level) AS (
     SELECT e.id,
            e.name,
            e.department,
            e.salary + CASE
                           WHEN db.bonus IS NOT NULL THEN db.bonus
                           ELSE 0
                END AS new_salary,
            1 AS level
     FROM
            employees e
     LEFT JOIN
                           department_bonus db ON e.department = db.department
     UNION ALL
     SELECT e.id,
            e.name,
            e.department,
            e.salary + CASE
                           WHEN db.bonus IS NOT NULL THEN db.bonus
                           ELSE 0
                END + (e.salary * 0.05) AS new_salary,
            bu.level + 1
     FROM
            employees e
     JOIN
                           department_bonus db ON e.department = db.department
     JOIN
                           bonus_updates bu ON e.id = bu.id
     WHERE bu.level < 3
 )
 SELECT
     id,
     name,
     department,
     salary,
     level
 FROM
     bonus_updates;

UPDATE employees
SET salary = bu.new_salary
FROM (SELECT id, AVG(salary) as new_salary
FROM bonus_updates
GROUP BY id) as bu
WHERE employees.id = bu.id
  AND bu.new_salary > employees.salary;

DROP TABLE bonus_updates;
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 526667 | HR |
| 2 | Bob | 670000 | Engineering |
| 3 | Charlie | 773333 | Engineering |
| 4 | David | 433333 | Marketing |
| 5 | Eve | 475000 | HR |
| 6 | Frank | 825000 | Engineering |
| 7 | Grace | 721667 | Engineering |
| 8 | Helen | 423000 | Marketing |
| 9 | Ivy | 506000 | HR |
| 10 | Jack | 484000 | Engineering |
| 11 | Ken | 743333 | Marketing |
| 12 | Liam | 670000 | Engineering |
| 13 | Mona | 495667 | HR |

#### SET DEFAULT values

##### Input Code:

##### Redshift

```sql
 UPDATE employees
SET salary = DEFAULT, department = 'Sales'
WHERE department = 'HR';
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 20000 | Sales |
| 2 | Bob | 600000 | Engineering |
| 3 | Charlie | 700000 | Engineering |
| 4 | David | 400000 | Marketing |
| 5 | Eve | 20000 | Sales |
| 6 | Frank | 750000 | Engineering |
| 7 | Grace | 650000 | Engineering |
| 8 | Helen | 390000 | Marketing |
| 9 | Ivy | 20000 | Sales |
| 10 | Jack | 420000 | Engineering |
| 11 | Ken | 700000 | Marketing |
| 12 | Liam | 600000 | Engineering |
| 13 | Mona | 20000 | Sales |

##### Output Code:

##### Snowflake

```sql
 UPDATE employees
SET salary = DEFAULT, department = 'Sales'
WHERE
    department = 'HR';
```

##### Result

| ID | NAME | SALARY | DEPARTMENT |
| --- | --- | --- | --- |
| 1 | Alice | 20000 | Sales |
| 2 | Bob | 600000 | Engineering |
| 3 | Charlie | 700000 | Engineering |
| 4 | David | 400000 | Marketing |
| 5 | Eve | 20000 | Sales |
| 6 | Frank | 750000 | Engineering |
| 7 | Grace | 650000 | Engineering |
| 8 | Helen | 390000 | Marketing |
| 9 | Ivy | 20000 | Sales |
| 10 | Jack | 420000 | Engineering |
| 11 | Ken | 700000 | Marketing |
| 12 | Liam | 600000 | Engineering |
| 13 | Mona | 20000 | Sales |

#### SET clause

It is responsible for modifying values in the columns. Similar to Snowflake, update queries with multiple matches per row will throw an error when the configuration parameter [ERROR_ON_NONDETERMINISTIC_UPDATE](https://docs.aws.amazon.com/redshift/latest/dg/r_error_on_nondeterministic_update.html) is set to true. This flag works the same way in Snowflake, and it even uses the same name, [ERROR_ON_NONDETERMINISTIC_UPDATE](https://docs.snowflake.com/en/sql-reference/parameters#label-error-on-nondeterministic-update).

However, when this flag is turned off, no error is returned, and one of the matched rows is used to update the target row. The selected joined row is nondeterministic and arbitrary in both languages; the behavior may not be consistent across executions, which could lead to data inconsistencies.

##### Setup data:

##### Redshift

```sql
 CREATE TABLE target (
  k INT,
  v INT
);

CREATE TABLE src (
  k INT,
  v INT
);

INSERT INTO target (k, v) VALUES (0, 10);

INSERT INTO src (k, v) VALUES
  (0, 14),
  (0, 15),
  (0, 16);
```

##### Input Code:

##### Redshift

```sql
 UPDATE target
  SET v = src.v
  FROM src
  WHERE target.k = src.k;

SELECT * FROM target;
```

##### Result

| K | V |
| --- | --- |
| 0 | 16 |

##### Output Code:

##### Snowflake

```sql
 UPDATE target
  SET v = src.v
  FROM src
  WHERE target.k = src.k;

SELECT * FROM target;
```

##### Result

| K | V |
| --- | --- |
| 0 | 14 |

### Known Issues

* Update queries with multiple matches per row may cause data inconsistencies. Although both platforms have the flag [ERROR_ON_NONDETERMINISTIC_UPDATE](https://docs.aws.amazon.com/redshift/latest/dg/r_error_on_nondeterministic_update.html), these values will always be nondeterministic. Snowflake offers recommendations for handling these scenarios. See the [Snowflake UPDATE examples](https://docs.snowflake.com/en/sql-reference/sql/update#examples) for more details.
* Replicating the functionality of the `WITH` clause requires creating temporary tables mirroring each Common Table Expression (CTE). However, this approach fails if a temporary table with the same name already exists within the current session, causing an error.

### Related EWIs

There are no known issues.

---
title: SnowConvert AI - Redshift - System catalog tables
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/redshift-system-catalog.md
section: Migrations
---

# SnowConvert AI - Redshift - System catalog tables

> **Note:**
>
> This is a work in progress.

## Description

> The system catalogs store schema metadata, such as information about tables and columns. System catalog tables have a PG prefix.
>
> The standard PostgreSQL catalog tables are accessible to Amazon Redshift users. ([Redshift SQL Language reference System catalog tables](https://docs.aws.amazon.com/redshift/latest/dg/c_intro_catalog_views.html)).

The following table outlines how SnowConvert AI transforms references to SQL functions defined in the `pg_catalog` in Redshift.

## Mapping of SQL functions from the `pg_catalog`

| Redshift | Snowflake |
| --- | --- |
| pg_catalog.row_number() | [row_number()](https://docs.snowflake.com/en/sql-reference/functions/row_number) |
| pg_catalog.replace() | [replace()](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| pg_catalog.lead() | [lead()](https://docs.snowflake.com/en/sql-reference/functions/lead) |

---
title: SnowConvert AI - Redshift Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/redshiftFDM.md
section: Migrations
---

# SnowConvert AI - Redshift Functional Differences

## SSC-FDM-RS0001

Data storage option is not supported in Snowflake. Data distribution is automatically handled by Snowflake.

### Description

In Snowflake, it is not necessary to explicitly define `SORTKEY` and `DISTSTYLE` when migrating from Redshift because Snowflake’s architecture inherently manages data distribution and optimization. Snowflake automatically handles data partitioning and indexing, optimizing query performance without requiring manual configuration of these parameters.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
DISTSTYLE AUTO;

CREATE TABLE table2 (
    col1 INTEGER
)
SORTKEY AUTO;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - DISTSTYLE AUTO OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--DISTSTYLE AUTO
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';

CREATE TABLE table2 (
    col1 INTEGER
)
----** SSC-FDM-RS0001 - SORTKEY AUTO OPTION IS NOT SUPPORTED IN SNOWFLAKE. DATA STORAGE IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--SORTKEY AUTO
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

#### Best Practices

* It is advisable to assess the use of `CLUSTER BY` in Snowflake during migration from Redshift, as it may improve query performance by optimizing data locality for frequently queried columns.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0002

The performance of CLUSTER BY in Snowflake may vary compared to the performance of SORTKEY in Redshift.

### Description

The `SORTKEY` (excluding `SORTKEY AUTO`) in Amazon Redshift are analogous to `CLUSTER BY` in Snowflake. However, performance implications may vary due to architectural differences between Redshift and Snowflake.

* **`SORTKEY`** improves performance by maintaining data in a sorted order based on specified columns. This is particularly beneficial for range queries and ordering operations.
* **`CLUSTER BY`** in Snowflake organizes data into blocks based on designated columns, aiding in filtering and aggregation tasks. However, it is less stringent about ordering compared to `SORTKEY`.

Understanding these mechanisms is crucial for optimizing performance in each respective platform.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
SORTKEY (col1);

CREATE TABLE table2 (
    col1 INTEGER SORTKEY
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    col1 INTEGER
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF CLUSTER BY IN SNOWFLAKE MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY IN REDSHIFT **
CLUSTER BY (col1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
;

CREATE TABLE table2 (
    col1 INTEGER
)
--** SSC-FDM-RS0002 - THE PERFORMANCE OF CLUSTER BY IN SNOWFLAKE MAY VARY COMPARED TO THE PERFORMANCE OF SORTKEY IN REDSHIFT **
CLUSTER BY (col1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

#### Best Practices

* **Benchmark after migration:** Run representative queries on both platforms to compare performance, as `CLUSTER BY` uses micro-partitioning rather than physical sort order.
* **Consider automatic clustering:** For large tables with frequent queries on specific columns, enable [automatic clustering](../../../../../../user-guide/tables-auto-reclustering.md) in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0003

Pending SnowConvert AI translation for Redshift foreign key constraints.

### Description

Pending SnowConvert AI translation for Redshift foreign key constraints. Snowflake supports [foreign key constraints](../../../../../../sql-reference/constraints-overview.md), but they are not enforced and serve only as referential integrity metadata. This is a SnowConvert AI limitation, not a Snowflake platform limitation.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE TABLE TABLE1 (
    id INTEGER,
    PRIMARY KEY (id)
);

CREATE TABLE TABLE2 (
	id INTEGER,
	id_table1 INTEGER,
	FOREIGN KEY (id_table1) REFERENCES TABLE1 (col1)
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE TABLE1 (
    id INTEGER,
    PRIMARY KEY (id)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/26/2024" }}';

CREATE TABLE TABLE2 (
	id INTEGER,
	id_table1 INTEGER
--	                 ,
--    --** SSC-FDM-RS0003 - PENDING SNOWCONVERT AI TRANSLATION FOR REDSHIFT FOREIGN KEY CONSTRAINTS. **
--	FOREIGN KEY (id_table1) REFERENCES TABLE1 (col1)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/26/2024" }}';
```

#### Best Practices

* You can manually [alter tables](https://docs.snowflake.com/en/sql-reference/sql/alter-table) with Foreign Keys and add them.

```sql
 ALTER TABLE TABLE2 ADD CONSTRAINT
FOREIGN KEY (id_table1) REFERENCES TABLE1 (col1)
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0004

It is possible that the date is wrong and Snowflake does not accept wrong dates

### Description

In Snowflake, using `TO_DATE` with an invalid date string (like ‘20010631’) results in an error because it enforces strict validation, rejecting any non-existent dates. In contrast, Redshift’s `TO_DATE` can adjust such invalid dates to the nearest valid date (e.g., rolling June 31 to July 1) if the `is_strict` parameter is set to false. This difference highlights how Snowflake prioritizes data integrity by not automatically correcting invalid dates, while Redshift allows for more flexibility in date handling.

#### Code Example

##### Input Code:

##### Redshift

```sql
 SELECT TO_DATE('20010631', 'YYYYMMDD', FALSE);
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
TRY_TO_DATE(/*** SSC-FDM-RS0004 - INVALID DATES WILL CAUSE ERRORS IN SNOWFLAKE ***/ '20010631', 'YYYYMMDD');
```

#### Best Practices

* Check that the date is valid in the TRY_TO_DATE().
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0005

Redshift MERGE rejects duplicate source rows. Snowflake allows them, which may produce different results.

### Description

In Redshift, the `MERGE` statement throws an error when the source table contains duplicate rows matching the join condition. Snowflake allows `MERGE` to execute with duplicate source rows, which may produce non-deterministic results when multiple source rows match the same target row.

#### Code Example

##### Input Code:

##### Redshift

```sql
 MERGE INTO target USING source ON target.id = source.id
WHEN MATCHED THEN DELETE
WHEN NOT MATCHED THEN INSERT VALUES (source.id, source.name);
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-RS0005 - REDSHIFT MERGE STATEMENT REJECTS DUPLICATE SOURCE ROWS. SNOWFLAKE ALLOWS DUPLICATES, WHICH MAY PRODUCE NON-DETERMINISTIC RESULTS. **
MERGE INTO target USING source ON target.id = source.id
WHEN MATCHED THEN DELETE
WHEN NOT MATCHED THEN INSERT VALUES (source.id, source.name);
```

#### Best Practices

* **Deduplicate source data:** Add a `QUALIFY ROW_NUMBER() OVER (PARTITION BY join_key ORDER BY ...) = 1` to the source subquery to ensure each target row matches at most one source row.
* **Validate results:** After migration, compare `MERGE` output row counts between Redshift and Snowflake to detect non-deterministic behavior.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0006

Called procedure contains usages of COMMIT/ROLLBACK. Modifying the current transaction in child scopes is not supported in Snowflake.

### Description

In Redshift, it is allowed to use the statements COMMIT and ROLLBACK inside a procedure to make permanent or discard the changes on a transaction that was opened on an outer scope.

Snowflake works with the concept of [scoped transactions](https://docs.snowflake.com/en/sql-reference/transactions#scoped-transactions), which treats each procedure call as a separate transaction, this limits the effects of the COMMIT and ROLLBACK statements to the scope of the procedure they are declared in.

The aforementioned functional difference will be warned with this FDM when calls to a procedure with COMMIT or ROLLBACK are detected by SnowConvert.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE inner_transaction_procedure(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    ROLLBACK;
    INSERT INTO transaction_values_test values (a + 1);
END
$$;

CREATE OR REPLACE PROCEDURE outer_transaction_procedure(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    -- This insert is also affected by the ROLLBACK in inner_transaction_procedure
    INSERT INTO transaction_values_test values (a);
    CALL inner_transaction_procedure(a + 3);
    COMMIT;
END
$$;

CALL outer_transaction_procedure(10);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE inner_transaction_procedure (a int)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a);
    ROLLBACK;
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a + 1);
    COMMIT;
END
$$;

CREATE OR REPLACE PROCEDURE outer_transaction_procedure (a int)
RETURNS VARCHAR
    LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    -- This insert is also affected by the ROLLBACK in inner_transaction_procedure
    INSERT INTO transaction_values_test
    values (:a);
    --** SSC-FDM-RS0006 - CALLED PROCEDURE CONTAINS USAGES OF COMMIT/ROLLBACK. MODIFYING THE CURRENT TRANSACTION IN CHILD SCOPES IS NOT SUPPORTED IN SNOWFLAKE **
    CALL inner_transaction_procedure(:a + 3);
    COMMIT;
END
$$;

CALL outer_transaction_procedure(10);
```

#### Best Practices

* **Refactor transaction control:** Move `COMMIT` and `ROLLBACK` statements into the outermost procedure or use [scoped transactions](../../../../../../sql-reference/transactions.md) where supported.
* **Use caller’s rights:** Ensure the calling procedure manages the transaction boundary, as Snowflake’s scoped transactions isolate child procedure changes.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0007

DDL statements perform an automatic COMMIT in Snowflake. ROLLBACK will not undo DDL-committed changes.

### Description

In Snowflake, [DDL statements perform an automatic commit](https://docs.snowflake.com/en/sql-reference/transactions#ddl) after their execution, making permanent all the changes in the current transaction, meaning they can not be discarded by a ROLLBACK.

When a ROLLBACK statement is found in a procedure that also contains a DDL statement, SnowConvert AI will generate this FDM to inform about the DDL autocommit behavior.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE rollback_ddl(a int)
    LANGUAGE plpgsql
    AS $$
BEGIN
    INSERT INTO transaction_values_test values (a);
    CREATE TABLE someRollbackTable
    (
        col1 INTEGER
    );

    INSERT INTO someRollbackTable values (a);
    ROLLBACK;
END
$$;

CALL rollback_ddl(10);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE rollback_ddl (a int)
RETURNS VARCHAR
    LANGUAGE SQL
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
    AS $$
BEGIN
    BEGIN TRANSACTION;
    INSERT INTO transaction_values_test
    values (:a);
    CREATE TABLE someRollbackTable
    (
        col1 INTEGER
    );
    BEGIN TRANSACTION;
    INSERT INTO someRollbackTable
    values (:a);
    --** SSC-FDM-RS0007 - DDL STATEMENTS PERFORM AN AUTOMATIC COMMIT IN SNOWFLAKE. ROLLBACK WILL NOT UNDO DDL-COMMITTED CHANGES **
    ROLLBACK;
END
$$;

CALL rollback_ddl(10);
```

#### Best Practices

* **Separate DDL from DML transactions:** Move DDL statements outside the transaction block, or execute them before `BEGIN TRANSACTION` to avoid implicit commits affecting DML operations.
* **Use conditional logic:** If DDL creation is conditional, check for object existence with `IF NOT EXISTS` to avoid unnecessary autocommits.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-RS0008

Snowflake uses autocommit by default. The NONATOMIC option is not supported in Snowflake.

### Description

In Redshift, the `NONATOMIC` option on `CREATE PROCEDURE` allows individual statements within the procedure to commit independently. In Snowflake, [autocommit](../../../../../../sql-reference/transactions.md) is the default behavior — each statement is automatically committed unless wrapped in an explicit `BEGIN TRANSACTION` block. The `NONATOMIC` keyword is removed during migration because Snowflake’s autocommit provides equivalent semantics.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE SP_NONATOMIC()
NONATOMIC
AS
$$
    BEGIN
        NULL;
    END;
$$
LANGUAGE plpgsql;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE SP_NONATOMIC ()
RETURNS VARCHAR
----** SSC-FDM-RS0008 - SNOWFLAKE USES AUTOCOMMIT BY DEFAULT. THE NONATOMIC OPTION IS NOT SUPPORTED IN SNOWFLAKE. **
--NONATOMIC
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "02/10/2025",  "domain": "test" }}'
AS
$$
    BEGIN
        NULL;
    END;
$$;
```

#### Best Practices

* **Verify transaction behavior:** If the original Redshift procedure relied on `NONATOMIC` for partial commits, test the migrated Snowflake procedure to confirm that autocommit provides the expected semantics.
* **Add explicit transactions where needed:** If you need atomic (all-or-nothing) behavior for a group of statements in Snowflake, wrap them in `BEGIN TRANSACTION` … `COMMIT`.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Redshift Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/redshiftEWI.md
section: Migrations
---

# SnowConvert AI - Redshift Issues

## SSC-EWI-RS0002

Set “configuration parameter” is not supported in Snowflake.

### Severity

Medium

#### Description

The [`SET configuration parameter`](https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_PROCEDURE.html) clause in Redshift procedures is not supported in Snowflake. Snowflake uses [ALTER SESSION SET](../../../../../../sql-reference/sql/alter-session.md) or session-level parameters instead. For more information, refer to [CREATE PROCEDURE documentation](../../../../../../sql-reference/sql/create-procedure.md).

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE procedure2(
    IN input_param INTEGER,
    OUT output_param NUMERIC
)
AS $$
BEGIN
    output_param := input_param * 1.7;
END;
$$
LANGUAGE plpgsql
SET enable_numeric_rounding to ON;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE procedure2 (input_param INTEGER, output_param OUT NUMERIC)
RETURNS VARCHAR
LANGUAGE SQL
!!!RESOLVE EWI!!! /*** SSC-EWI-RS0002 - SET CONFIGURATION PARAMETER 'enable_numeric_rounding' IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
SET enable_numeric_rounding to ON
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
    output_param := input_param * 1.7;
END;
$$;
```

#### Best Practices

* **Use ALTER SESSION SET:** Snowflake provides [ALTER SESSION SET](../../../../../../sql-reference/sql/alter-session.md) to configure session-level parameters. Review whether the Redshift configuration parameter has an equivalent Snowflake session parameter.
* **Remove if unnecessary:** Some Redshift configuration parameters (e.g., `enable_numeric_rounding`) have no Snowflake equivalent and may be safely removed if Snowflake’s default behavior meets your requirements.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0003

View with no schema binding can not be removed due to missing references.

> **Note:**
>
> This issue is deprecated and no longer generated by SnowConvert AI since [version 2.2.6](../../../release-notes/release-notes/README.md)

### Severity

Medium

#### Description

Redshift documentation for `CREATE VIEW` includes an optional clause that specifies that the particular view is not bound to the database objects such as tables or functions, nor to those objects that it is referencing. The documentation also clarifies that in such cases that this clause is used, the referenced objects must be qualified with a schema name. This clause allows to create a view and reference objects that might not exist yet. Their existence will be verified once the view is queried, but not at its definition.

However, there is no equivalent command nor obvious workaround to implement this functionality in Snowflake, furthermore, the Snowflake documentation suggests that the views are linked to a specific schema and so are the referenced objects in the view.

If the references linked to the View are present in the input code, the statement will be removed without issue. However, if the necessary references are missing, a warning message will be added to inform the user that the statement cannot be removed due to the missing references.

SnowConvert AI performs analysis solely on the input code and does not account for objects already deployed in Snowflake. Therefore the output may have some issues pointing to missing references, if the references are already present in the Snowflake database, the user can safely remove the statement without any issues.

#### Code Examples

##### Input Code:

##### Redshift

```sql
 CREATE VIEW myView AS SELECT col1 FROM public.missingTable
WITH NO SCHEMA BINDING;
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "public.missingTable" **
CREATE VIEW myView
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}'
AS SELECT col1 FROM
public.missingTable
!!!RESOLVE EWI!!! /*** SSC-EWI-RS0003 - WITH NO SCHEMA BINDING STATEMENT CAN NOT BE REMOVED DUE TO MISSING REFERENCES. ***/!!!
WITH NO SCHEMA BINDING;
```

#### Best Practices

* To resolve this issue, it is suggested to add the missing references to the input code, if the object is already deployed in the Snowflake database, the statement can be remove without issue.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0004

HLLSKETCH data type not supported in Snowflake.

### Severity

High

#### Description

This conversion issue is added because the HLLSKETCH data type is not supported in Snowflake.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE table1
(
    col_hllsketch HLLSKETCH
);
```

##### Generated Code:

```sql
 CREATE TABLE table1
(
    col_hllsketch HLLSKETCH !!!RESOLVE EWI!!! /*** SSC-EWI-RS0004 - HLLSKETCH DATA TYPE NOT SUPPORTED IN SNOWFLAKE. ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "09/17/2024" }}';
```

#### Best Practices

* Please verify all [aggregate functions](https://docs.snowflake.com/en/user-guide/querying-approximate-cardinality#sql-functions) provided by Snowflake to estimate cardinality using [HyperLogLog](https://docs.snowflake.com/en/user-guide/querying-approximate-cardinality#overview).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0005

Pending SnowConvert AI translation for column aliases in the PIVOT/UNPIVOT IN clause.

### Severity

High

#### Description

Pending SnowConvert AI translation for column aliases in the `PIVOT/UNPIVOT` `IN` clause. Snowflake now supports the `AS` clause for specifying column aliases in [`PIVOT`](../../../../../../sql-reference/constructs/pivot.md) and [`UNPIVOT`](../../../../../../sql-reference/constructs/unpivot.md) operations (added October 2025). This is a SnowConvert AI limitation, not a Snowflake platform limitation.

#### Code Example

##### Input Code:

##### Redshift

```sql
 SELECT *
FROM count_by_color UNPIVOT (
    cnt FOR color IN (red AS r, green AS g, blue AS b)
);
```

##### Generated Code:

##### Snowflake

```sql
 SELECT *
FROM
    count_by_color UNPIVOT (
    cnt FOR color IN (red
                          !!!RESOLVE EWI!!! /*** SSC-EWI-RS0005 - PENDING SNOWCONVERT AI TRANSLATION FOR COLUMN ALIASES IN THE PIVOT/UNPIVOT IN CLAUSE. ***/!!! AS r, green
                                                                                                                                                                              !!!RESOLVE EWI!!! /*** SSC-EWI-RS0005 - PENDING SNOWCONVERT AI TRANSLATION FOR COLUMN ALIASES IN THE PIVOT/UNPIVOT IN CLAUSE. ***/!!! AS g, blue
                                                                                                                                                                                                                                                                                                                                 !!!RESOLVE EWI!!! /*** SSC-EWI-RS0005 - PENDING SNOWCONVERT AI TRANSLATION FOR COLUMN ALIASES IN THE PIVOT/UNPIVOT IN CLAUSE. ***/!!! AS b)
);
```

#### Best Practices

* **Use native Snowflake support:** Snowflake now supports the `AS` clause for column aliases in `PIVOT/UNPIVOT IN` clauses. Remove the EWI marker and use the aliases directly:

```sql
 SELECT *
FROM count_by_color UNPIVOT (
    cnt FOR color IN (red AS r, green AS g, blue AS b)
);
```

* **Further reading:** [Snowflake UNPIVOT](../../../../../../sql-reference/constructs/unpivot.md), [Snowflake PIVOT](../../../../../../sql-reference/constructs/pivot.md)
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0006

The behavior of the SUBSTRING function on binary data differs between Redshift and Snowflake.

### Severity

Medium

#### Description

The behavior of the `SUBSTRING` function on binary data differs between Redshift and Snowflake. In Redshift, `SUBSTRING` on `VARBYTE` operates on raw bytes. In Snowflake, `SUBSTRING` on `BINARY` operates on hex-encoded character pairs, so the same positional arguments may return different results.

#### Code Example

##### Input Code:

##### Redshift

```sql
 SELECT SUBSTRING('12345'::varbyte, 2, 4) AS substring_binary;
SELECT SUBSTRING('abc'::varbyte, 2, 4) AS substring_binary;
```

##### Generated Code:

##### Snowflake

```sql
 SELECT SUBSTRING('12345':: BINARY, 2, 4) !!!RESOLVE EWI!!! /*** SSC-EWI-RS0006 - THE BEHAVIOR OF THE SUBSTRING FUNCTION ON BINARY DATA DIFFERS BETWEEN REDSHIFT AND SNOWFLAKE. ***/!!! AS substring_binary;
SELECT SUBSTRING('abc':: BINARY, 2, 4) !!!RESOLVE EWI!!! /*** SSC-EWI-RS0006 - THE BEHAVIOR OF THE SUBSTRING FUNCTION ON BINARY DATA DIFFERS BETWEEN REDSHIFT AND SNOWFLAKE. ***/!!! AS substring_binary;
```

#### Best Practices

* **Verify binary output:** Compare `SUBSTRING` results on binary columns between Redshift and Snowflake to confirm correctness after migration.
* **Adjust offsets:** Because Snowflake’s `BINARY` type uses hex encoding, you may need to multiply position and length arguments by 2 to achieve equivalent byte-level extraction.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0007

Date literal is not supported in Snowflake.

### Severity

High

#### Description

Some DATE, TIME, or TIMESTAMP literal formats used in Redshift (e.g., `'2000-Jan-31'`, `'Jan-31-2000'`) are not recognized by Snowflake. These literals must be rewritten to a [supported Snowflake date format](../../../../../../sql-reference/data-types-datetime.md) or converted using `TO_DATE` with an explicit format string.

#### Code Example

##### Input Code:

##### Redshift

```sql
 select datediff(century, '2000-Jan-31', 'Jan-31-2000');
```

##### Generated Code:

##### Snowflake

```sql
  select
 DATEDIFF(YEAR,
                !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - '2000-Jan-31' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
                '2000-Jan-31',
                               !!!RESOLVE EWI!!! /*** SSC-EWI-RS0007 - 'Jan-31-2000' DATE LITERAL IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
                               'Jan-31-2000') / 100;
```

#### Best Practices

* **Use ISO 8601 format:** Rewrite date literals to `'YYYY-MM-DD'` format, which is universally supported in Snowflake.
* **Use TO_DATE with format string:** If the original format must be preserved, use `TO_DATE('Jan-31-2000', 'MON-DD-YYYY')` to explicitly parse the date.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0008

Delete statement cannot be used on dynamic tables in Snowflake.

### Severity

High

#### Description

In Redshift, you can apply the DELETE statement to materialized views used for [streaming ingestion](https://docs.aws.amazon.com/redshift/latest/dg/materialized-view-streaming-ingestion.html). In Snowflake, materialized views are transformed into [dynamic tables](../../../../../../user-guide/dynamic-tables-about.md), and the DELETE statement cannot be used on dynamic tables.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE MATERIALIZED VIEW mv AS
SELECT id, name, department_id FROM employees WHERE department_id = 101;

DELETE FROM mv
WHERE id = 2;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE DYNAMIC TABLE mv
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
SELECT id, name, department_id FROM
employees
WHERE department_id = 101;

!!!RESOLVE EWI!!! /*** SSC-EWI-RS0008 - MATERIALIZED VIEW IS TRANSFORMED INTO A DYNAMIC TABLE, AND THE DELETE STATEMENT CANNOT BE USED ON DYNAMIC TABLES IN SNOWFLAKE. ***/!!!
DELETE FROM
mv
WHERE id = 2;
```

#### Best Practices

* **Replace the dynamic table definition:** Because dynamic tables cannot be directly deleted from, you can achieve the same result by altering the dynamic table’s underlying query to exclude the rows you want to remove.
* **Use a regular table:** If row-level DML (INSERT, UPDATE, DELETE) is required, consider using a regular table with a scheduled task instead of a dynamic table.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0009

Source table semantic information not found in the code provided to SnowConvert AI.

### Severity

Low

#### Description

Snowflake does not support the `MERGE ... REMOVE DUPLICATES` clause. SnowConvert AI generates a workaround that includes an `INSERT WHEN NOT MATCHED` clause, which requires knowledge of the source table’s columns. If the source table definition was not included in the code provided to SnowConvert AI, the column list cannot be generated and must be added manually.

#### Code Example

##### Input Code:

##### Redshift

```sql
 MERGE INTO target USING source ON target.id = source.id REMOVE DUPLICATES;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TEMPORARY TABLE source_duplicates AS
SELECT DISTINCT
source.*
FROM
source
INNER JOIN
target
ON target.id = source.id;
!!!RESOLVE EWI!!! /*** SSC-EWI-RS0009 - SEMANTIC INFORMATION NOT FOUND FOR THE SOURCE TABLE IN THE CODE PROVIDED TO SNOWCONVERT AI. COLUMNS TO BE INSERTED MAY BE ADDED MANUALLY. ***/!!!
--** SSC-FDM-RS0005 - REDSHIFT MERGE STATEMENT REJECTS DUPLICATE SOURCE ROWS. SNOWFLAKE ALLOWS DUPLICATES, WHICH MAY PRODUCE NON-DETERMINISTIC RESULTS. **
MERGE INTO target
USING source ON target.id = source.id
WHEN MATCHED THEN
DELETE
WHEN NOT MATCHED THEN
INSERT
VALUES ();
INSERT INTO target
SELECT
*
FROM
source_duplicates;
DROP TABLE IF EXISTS source_duplicates CASCADE;
```

#### Best Practices

* **Include all source DDL:** Provide the source table’s `CREATE TABLE` statement in the input code so SnowConvert AI can resolve columns automatically.
* **Add columns manually:** If the source table definition is unavailable, fill in the `INSERT ... VALUES ()` clause with the correct column list from your Redshift catalog.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-RS0010

Top-level procedure call with out parameters is not supported in Snowflake.

### Severity

Low

#### Description

Redshift allows top-level `CALL` statements to invoke procedures with `OUT` parameters without declaring a variable to receive the output. Snowflake requires that `OUT` parameters be assigned to a variable, which is only possible inside a stored procedure or anonymous block.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE OR REPLACE PROCEDURE get_total_sales_by_product(
    IN p_product_name VARCHAR(100),
    OUT p_total_sales DECIMAL(18, 2)
)
AS $$
BEGIN
    NULL;
END;
$$ LANGUAGE plpgsql;

CALL get_total_sales_by_product('Laptop');
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE get_total_sales_by_product (p_product_name VARCHAR(100), p_total_sales OUT DECIMAL(18, 2))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "redshift",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
AS $$
BEGIN
NULL;
END;
$$;
!!!RESOLVE EWI!!! /*** SSC-EWI-RS0010 - TOP-LEVEL PROCEDURE CALL WITH OUT PARAMETERS IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
CALL get_total_sales_by_product('Laptop');
```

#### Best Practices

* Move the call into an anonymous block and declare a variable to pass as an output parameter.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Renaming feature
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/renaming-feature.md
section: Migrations
---

# SnowConvert AI - Renaming feature

Renaming objects during a database migration process is something that a lot of users need to do. For this reason, SnowConvert AI enables the Renaming feature to allow defining new names for the following types of user-defined objects:

> **Note:**
>
> This feature is supported for Teradata, Sql Server and Redshift **ONLY**.

* Schemas
* Tables
* Views
* Materialized Views
* Procedures
* Functions
* Macros

> **Note:**
>
> The renaming feature will apply to both the object definition and the object’s uses.

These objects are usually qualified within a schema or a database, so, depending on the Database platform, the object `Table1` might be referenced simply as `Table1`, as `MySchema.Table1` or as `MyDatabase.MySchema.Table1`. It is **essential** to fully qualify each object in the renaming file to avoid ambiguity.

The new object names are specified via a .json file with the following format.

> **Note:**
>
> Note that this example contains a “Macros” section, this is a **Teradata** specific element, and may vary depending on the specified language.

```json
{
  "Schemas": {
    "SchemaName": "NewSchema"
  },
  "Tables": {
    "SchemaName.TableName": "NewSchema.TableNameChanged",
    "Table1": "Table2"
  },
  "TablesRegex": [
    {
      "RegexExpr": "(Schema1)\\.(.*)",
      "RegexReplace": "Prefix_$1.$2"
    }
  ],

  "Views": {
    "ViewName": "ViewNameChanged",
    "MaterializedViewName": "MaterializedViewNameChanged",
  },
  "ViewsRegex": [
    {
      "RegexExpr": "(Schema1)\\.(.*)",
      "RegexReplace": "$2.$1"
    }
  ],

  "Procedures": {
    "ProcedureName": "ProcedureNameChanged"
  },
  "ProceduresRegex": [
    {
      "RegexExpr": "(Schema1)\\.(.*)",
      "RegexReplace": "$2.$1"
    }
  ],

  "Macros": {
    "SchemaName.MacroName": "MacroNameChanged",
    "SimpleMacro": "SimpleMacroSf"
  },
  "MacrosRegex": [
    {
      "RegexExpr": "(Schema1)\\.(.*)",
      "RegexReplace": "$2.$1"
    }
  ],

  "Functions": {
    "SchemaName.FunctionName": "FunctionNameChanged",
    "SimpleFunction": "SimpleFunctionSf"
  },
  "FunctionsRegex": [
    {
      "RegexExpr": "(Schema1)\\.(.*)",
      "RegexReplace": "$2.$1"
    }
  ]
}
```

## Usage

In order to use the renaming feature you have to execute the CLI version of SnowConvert AI with the following argument `--RenamingFile` and provide the path to the .json file containing the renaming information. An example of the command can look like this:

> snowct.exe -i “somePath/input” -o “somePath/output” –RenamingFile “somePath/renamings.json”

### Renaming modes

Notice there are two fields for each kind of object: `"Tables"` and `"TablesRegex"`*,* `"Views"` and `"ViewsRegex"`, and so on. This is because there are two ways in which renamings can be specified.

#### Object by object (line by line)

In this mode, each line represents an object, and it must contain the original fully qualified name and the new name. So, if we want to move an object named “Table1” inside the schema *“OriginalSchema”* to the schema *“SchemaSF”*, the line must be like this:

```json
"OriginalSchema.Table1": "SchemaSF.Table1"
```

If we also want to rename it to “Table2”, the line should be like this:

```json
"OriginalSchema.Table1": "SchemaSF.Table2"
```

This information has to be specified in the `"Tables"`*,* `"Views"`*,* `"Procedures"`*,* `"Macros"` *and* `"Functions"` sections of the .json file and each line must be separated with a comma. Let’s take a look at an example:

**TableExample1**

```json
"Tables": {
    "Schema1.Table1": "SF_Schema1.SF_Table1",
    "Schema1.Table2": "SF_Schema1.SF_Table2",
    "Schema1.Table3": "SF_Schema1.SF_Table3"
  },
```

The above sample is saying that the only three tables in the whole workload to be renamed are the ones called “*Table1*”, “*Table2*” and “*Table3*”, all located inside the “Schema1” schema; they must be renamed to “*SF_Table1”, “SF*_*Table2”* and *“SF*_*Table3”,* respectively; and finally, they will be located under the *“SF_Schema1*” schema in Snowflake.

#### Regular expressions

If there is a need to rename multiple objects in the same way, the feature also allows regular expressions to define patterns to apply to objects of the same kind. Two lines are required to specify each renaming, the first line is `"RegexExpr"` which is the matching expression and the second line is the `"RegexReplace"` which is the replacing expression. This information has to be provided in the `"TablesRegex"`*,* `"ViewsRegex"`*,* `"ProceduresRegex"`*,* `"MacrosRegex"` and `"FunctionsRegex"` sections of the .json file. So, the previous example can also be written in the following manner, using the regular expression feature.

**TableExample2**

```json
"TablesRegex": [
    {
      "RegexExpr": "Schema1\\.(.*)",
      "RegexReplace": "SF_Schema1.SF_$1"
    }
  ],
```

The only difference is that this way applies to all tables located within the “Schema1” schema. The regex expression would match all tables defined within the “Schema1” schema and will create a capturing group with everything after the dot. The regex replace will move the tables to the “SF_Schema1” schema and will add the “SF_” prefix to all tables found referencing the first group created ($1) in the regex expression.

#### Renaming priority

There might be renamings that apply to the same object and only one of them is chosen. Within the same section, SnowConvert AI will apply the first renaming that matches the current object’s name, and it will stop trying to rename that object. So in the following example, despite the fact that `"Tables"` section specifies renaming “Table1” to “Table1-a” and also to “Table1-b”, SnowConvert AI will only rename it to “Table1-a”.

```json
"Tables": {
    "Schema1.Table1": "Schema1.Table1-a",
    "Schema1.Table1": "Schema1.Table1-b",
  },
```

Also, SnowConvert AI will try to rename an object first checking the object by object renaming section before trying the regular expressions section. So, in the following example despite the fact that both renamings can apply to the same object “Schema1.Table1”, only the one defined in the `"Tables"` section is applied.

```json
"Tables": {
    "Schema1.Table1": "Schema1.TableA",
  },
  "TablesRegex": [
    {
      "RegexExpr": "Schema1\\.(.*)",
      "RegexReplace": "Schema1.SF_$1"
    }
  ],
```

#### Example

Let’s say we have the following input code.

**Input Code**

```sql
CREATE TABLE CLIENT (
    ID INTEGER,
    NAME varchar(20));

CREATE TABLE TICKET (
    CLIENT_ID INTEGER,
    FOREIGN KEY (CLIENT_ID_FK) REFERENCES CLIENT(ID));

SELECT * FROM CLIENT;
```

And the following renaming information

**Renaming File (.JSON)**

```json
{
  "Tables": {
    "CLIENT": "USER"
  }
}
```

This would be the output code with and without renaming.

#### Snowflake output code

```sql
CREATE OR REPLACE TABLE CLIENT (
    ID INTEGER,
    NAME varchar(20))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "11/13/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE TICKET (
    CLIENT_ID INTEGER,
       FOREIGN KEY (CLIENT_ID_FK) REFERENCES CLIENT (ID))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "11/13/2024",  "domain": "test" }}'
;

SELECT
    * FROM
    CLIENT;
```

```sql
CREATE OR REPLACE TABLE USER (
    ID INTEGER,
    NAME varchar(20))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "11/13/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE TICKET (
    CLIENT_ID INTEGER,
       FOREIGN KEY (CLIENT_ID_FK) REFERENCES USER (ID))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "11/13/2024",  "domain": "test" }}'
;

SELECT
    * FROM
    USER;
```

Notice how all the references to “CLIENT” are renamed to “USER”

---
title: SnowConvert AI - Reports
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/README.md
section: Migrations
---

# SnowConvert AI - Reports

## Glossary

In this section, we try to explain concepts used in multiple report documents generated by SnowConvert.

* **Lines of Code (LOC):** the total number of lines in the text of the source code files, excluding blank lines, that were processed by the conversion tool. A comment is considered a line of code.
* **Parsing EWIs:** the total count of parsing errors that occurred during the code analysis process. A parsing error occurs when the parser (the component that reads and understands the source code files) encounters something unexpected. This usually means a syntax error, which refers to a code element in the file that did not match the SQL grammar specification that the parser was expecting. In other cases, these errors can also occur because the parser is not yet ready to support a specific grammar. Parsing errors are considered critical issues because if the code is not parsed, SnowConvert AI cannot assess it or translate it. If this number is high in relation to the migration workload size, input code revision is advised.
* **Unrecognized Elements:** any code element (or parts of them) such as DML, DDL, control statements, with parsing errors that SnowConvert AI was unable to process.
* **Lines of Code in Unrecognized Elements:** the total lines of code in all the unrecognized elements. This is a good indicator of how much code SnowConvert AI was **not** able to process.
* **(Top-Level) Code Units:** a Code Unit is the most atomic, standalone executable element. In most cases, these are statements (like DDL or DML), but they also include script files because those are executed as a single element. They are classified as top-level because they are usually the “root” elements for a database dialect, and they can contain other “smaller” definitions. The top-level code units vary from one SQL dialect to another (Oracle, Teradata, SQL Server, etc). Parsing errors might cause SnowConvert AI to **not** be able to properly count all top-level code units.
* **Lines of Code Conversion Rate:** the percentage of lines of code that were successfully converted by SnowConvert AI into Snowflake code. Take into consideration that unrecognized elements (because of parsing issues) will affect this metric, as their source code will be counted as not converted. Furthermore, a successful element conversion might not be fully equivalent in Snowflake because of platform differences or limitations. In these cases, while the conversion rate is not punished, SnowConvert AI will generate an FDM to alert about the possible difference in functionality. A 90% conversion rate for a code unit means that only 10% of its lines of code were not converted, and therefore, EWIs are generated for them.
* **Fully Converted Code Units:** the percentage of top-level code units that were fully converted without any error in any of their sub-parts. They are considered ready for deployment. Any code unit whose conversion rate is less than 100% is not counted as fully converted.

## [Assessment Report (docx)](assessment-report/README.md)

The assessment report is a document that summarizes the estimation of code conversion rate, and a lot of other useful information for the user to estimate how far are they to achieving a functional equivalent snowflake code.

## [Top-Level Code Units Report](top-level-code-units-report.md)

The top-level code unit report provides a general overview of the main objects present in your source code. These top-level objects have useful information about the state of the conversion and can be used to make decisions on what the next steps should be after converting.˚

## [Issues Report](issues-report.md)

The issues report is a file containing information about all the issues that happened during the migration process.

## [Elements Report](elements-report.md)

The Elements report shows a summarized count of the Grammar Elements found during the migration process. The summarization is done on a multi-column basis, so there’s a distinction between the same grammar elements if they belong in different contexts. For example a SELECT query may be part of a PROCEDURE, or a VIEW, or even be in a script file. Using this report you should able to see the elements with some nuance, and review their overall transformation status.

## [Functions Usage Report](functions-usage-report.md)

The Functions Usage report summarizes the invocations of built-in and user-defined functions found during the conversion process, grouped by their migration status. This report allows the user to get details about function usages, whether they were transformed to Snowflake with no problem, or whether they require an additional post-conversion action.

## [ETL Replatform Issues Report](etl-replatform-issues-report.md)

The ETL Replatform Issues Report (EWIs Report) provides a detailed inventory of errors, warnings, and issues encountered during SSIS to dbt migration. Use this report to identify ETL components that require manual intervention or review.

## [ETL Replatform Component Summary Report](etl-replatform-report.md)

The ETL Replatform Component Summary Report provides a comprehensive inventory of all identified SSIS components and their migration outcomes. Use this report to understand the overall ETL migration scope and identify areas requiring attention.

## [TypeMappings Report](type-mappings-report.md)

The TypeMappings report is only generated when the Data Type Customization feature is used. It displays the data type transformations applied based on your customization file. Use this report to verify that your custom rules (such as NUMBER to DECFLOAT transformations) were applied correctly to the expected columns and objects.

---
title: SnowConvert AI - Review Results
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/README.md
section: Migrations
---

# SnowConvert AI - Review Results

The output from SnowConvert AI includes both Snowflake-Ready code and reports designed to give you more information about the conversion that just took place. You’ll also be given more information about the objects present in your source data warehouse.

On the following pages, we’ll dive deeper into the following topics:

* [**Output Code**](output-code.md)
* [**Reports**](reports/README.md)

---
title: SnowConvert AI - Running SnowConvert AI
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/README.md
section: Migrations
---

# SnowConvert AI - Running SnowConvert AI

---
title: SnowConvert AI - Schemas
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/schemas.md
section: Migrations
---

# SnowConvert AI - Schemas

> **Note:**
>
> This page of the documentation is for Oracle only.

## Number of Schemas Containing Objects

Represents the number of schemas that contain identified top-level objects. Each different schema name will only count as one single schema. In case an object does not have an explicit schema in its name, SnowConvert AI will count all of those names as one single schema because it is assumed that those objects are defined in the default schema of Oracle.

It is important to consider that this number will only be incremented by the names used to create the top-level objects, the references to the object names will not be counted in this assessment value.

> **Warning:**
>
> The Database Link top-level object in Oracle is not defined under any schema, this top-level object does not apply to this assessment value.

### Sample

```sql
CREATE TABLE schema1.table1 (col1 VARCHAR(255));
CREATE TABLE SCHEMA1.table2 (col1 VARCHAR(255));

CREATE TABLE schema2.table3 (col1 VARCHAR(255));

CREATE TABLE "SCHEMA3"."table4" (col1 VARCHAR(255));
CREATE TABLE "schema3"."table5" (col1 VARCHAR(255));

CREATE TABLE table6 (col1 VARCHAR(255));
CREATE TABLE table7 (col1 VARCHAR(255));
```

**Expected Number of Schemas Containing Objects:** 5

**Explanation:** Since `table1` and `table2` come from the same schema only one schema will be counted for those two objects. With the schema name of `table3` that will count as another different schema and finally. `table4` and `table5` have the schema names with double quotes and with uppercase and lowercase, this will make these two schemas count as different ones. `table6` and `table7` do not have an explicit schema name so SnowConvert AI will assume that both objects come from the same default schema.

---
title: SnowConvert AI - Scripts - Files
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/scripts-files.md
section: Migrations
---

# SnowConvert AI - Scripts - Files

> **Note:**
>
> This page of the documentation is for Teradata only.

## Conversion Rate - Files Generated

Indicates the file generation percentage grouped by valid file extension (shown in the image above).

> **Note:**
>
> You can refer to further information about this topic in the [Conversion Rate Modes](README.md) section of our documentation.

### Formula

```none
(successfully_generated_files / total_valid_files) * 100
```

#### Associated CSV Field names

* **BTEQ Files Conversion Rate:** BTEQFilesConversionRate
* **FastLoad Files Conversion Rate:** FastLoadFilesConversionRate
* **MultiLoad Files Conversion Rate**: MultiLoadFilesConversionRate
* **TPT Files Conversion Rate**: TPTFilesConversionRate
* **TPump Files Conversion Rate**: TPumpFilesConversionRate

## Conversion Rate - Lines of Code (LOC)

Indicates the Lines of Code conversion percentage per file extension.

### Formula

```none
(successfully_converted_lines / total_line_amount_per_file_extension) * 100
```

#### Associated CSV Field names

* **BTEQ LOC Conversion Rate:** BTEQLoCConversionRate
* **FastLoad LOC Conversion Rate**: FastLoadLoCConversionRate
* **MultiLoad LOC Conversion Rate**: MultiLoadLoCConversionRate
* **TPT LOC Conversion Rate**: TPTLoCConversionRate
* **TPump LOC Conversion Rate**: TPumpLoCConversionRate

## Total File Quantity

Indicates the total amount of files of each type. It is used to calculate the `Files Generated` conversion rate.

### Associated CSV Field names

* **BTEQ Total File Quantity**: BTEQFileCount
* **FastLoad Total File Quantity:** FastLoadFileCount
* **MultiLoad Total File Quantity:** MultiLoadFileCount
* **TPT Total File Quantity:** TPTFileCount
* **TPump Total File Quantity:** TPumpFileCount

#### Sample

```none
input folder
  ├> one.bteq
  ├> two.tpt
  ├> three.doc
  └> readme.txt
```

```none
output folder
  ├> one_bteq.py
  └> two_tpt.py
```

From the previous, we will get:

* Number of BTEQ files: 1
* Number of TPT files: 1

## Total LOC

Indicates the total amount of lines of code per file extension. It is used to calculate the `Lines of Code` conversion..

### Associated CSV Field names

* **BTEQ Total LOC:** BTEQLinesCount
* **FastLoad Total LOC:** FastLoadLinesCount
* **MultiLoad Total LOC:** MultiLoadLinesCount
* **TPT Total LOC:** TPTLinesCount
* **TPump Total LOC:** TPumpLinesCount

Indicates the total amount of parsing errors per file extension.

#### Associated CSV Field names

* **BTEQ Total Parsing Errors:** BTEQTotalParsingErrors
* **FastLoad Total Parsing Errors:** FastLoadTotalParsingErrors
* **MultiLoad Total Parsing Errors:** MultiLoadTotalParsingErrors
* **TPT Total Parsing Errors:** TPTTotalParsingErrors
* **TPump Total Parsing Errors:** TPumpTotalParsingErrors

#### Sample

```sql
CREATE TABLE TABLE_INVALID [
  first_column INTEGER
];
```

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()

#** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '1' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CREATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CREATE' ON LINE '1' COLUMN '1'. CODE '81'. **
#--CREATE TABLE TABLE_INVALID [
#--  first_column INTEGER
#--]
```

**Explanation**: In the above example, there is a parsing error when creating the table due to the incorrect use of the square brackets (`[]`), lines 1 and 3. This will be shown in the report as 1 parsing error in the TPT files row.

---
title: SnowConvert AI - Scripts - Identified Objects
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/scripts-identified-objects.md
section: Migrations
---

# SnowConvert AI - Scripts - Identified Objects

> **Note:**
>
> This page of the documentation is for Teradata only.

The breakdown of all the database objects created or modified in all script files (BTEQ, BTQ, FL, ML, TPUMP, TPT).

## Conversion Rate - Object

> **Note:**
>
> An object is considered successfully migrated if it does not have issues with medium, high, or critical severity.

Represents the percentage of identified objects by SnowConvert AI that were successfully migrated. This helps determine the number of objects that were successfully migrated and the objects that need manual work to complete the migration of the objects to Snowflake. If `N/A` is listed in the column, it means that the object type is not supported in Snowflake. A “`-`” could also be listed in this column. This means that the set of files migrated by SnowConvert AI did not contain objects of the specific type that could be identified.

### Formula

```none
(successfully_converted_scripts_objects / total_scripts_objects) * 100
```

#### CSV Associated Field Names

* **Tables:** ScriptTableObjectConversionRate
* **Views:** ScriptViewObjectConversionRate
* **Join Index:** ScriptJoinIndexObjectConversionRate
* **Macro:** ScriptMacroObjectConversionRate
* **Procedures:** ScriptProcedureObjectConversionRate
* **Functions:** ScriptFunctionObjectConversionRate
* **Triggers**: ScriptTriggerObjectConversionRate
* **Indexes:** N/A

#### Sample

```sql
CREATE SET TABLE Tables_Database.Employee
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);

CRATE SET TABLE Tables_Database.Employee2
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);
```

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee (
      Associate_Id INTEGER,
      UNIQUE (Associate_Id)
    )
    """)
  #** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '5' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CRATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CRATE' ON LINE '5' COLUMN '1'. CODE '81'. **
  #
  #--CRATE SET TABLE Tables_Database.Employee2
  #--   (Associate_Id     INTEGER)
  #--UNIQUE PRIMARY INDEX (Associate_Id)

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

**Expected Object Conversion Rate:** 50%

**Explanation:** With the previous sample code we will have a 50% Object Conversion Rate because only 1 of the 2 identified tables were successfully migrated to Snowflake.

## Conversion Rate - Lines of Code (LOC)

Indicates the Lines of Code conversion percentage per file extension.

### Formula

```none
(script_success_lines / script_total_lines) * 100
```

#### Associated CSV Field names

* **Tables:** ScriptTableLoCConversionRate
* **Views:** ScriptViewLocConversionRate
* **Join Index:** ScriptJoinIndexLoCConversionRate
* **Macros:** ScriptMacroLoCConversionRate
* **Procedures:** ScriptProcedureLoCConversionRate
* **Functions:** ScriptFunctionLoCConversionRate
* **Triggers**: ScriptTriggerLoCConversionRate
* **Indexes:** N/A

#### Sample

```sql
CREATE SET TABLE Tables_Database.Employee
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);

CREATE SET TABLE Tables_Database.Employee2
   (Associate_Id     ANYTYPE!)
UNIQUE PRIMARY INDEX (Associate_Id);
```

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee (
      Associate_Id INTEGER,
      UNIQUE (Associate_Id)
    )
    """)
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee2 (
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '6' COLUMN '5' OF THE SOURCE CODE STARTING AT 'Associate_Id'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS 'ANYTYPE' ON LINE '6' COLUMN '22'. CODE '15'. **
--                                                       Associate_Id     ANYTYPE!,
      UNIQUE (Associate_Id)
    )
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

**Expected LOC Conversion Rate: 83.33**%

**Explanation:** With the previous sample code we will have a 83.33% LOC Conversion Rate because line 5 of the input code `(Associate_Id ANYTYPE!)` could not be migrated and only 5 of the 6 total lines of code were migrated successfully.

> **Note:**
>
> You can refer to further information about this topic in the [Conversion Rate Modes](README.md) section of our documentation.

## Total Object Quantity

Represents the total amount of objects identified by SnowConvert AI during the parsing phase.

### CSV Associated Field Names

* **Tables:** ScriptTableTotalOccurrences
* **Views:** ScriptViewTotalOccurrences
* **Join Index:** ScriptJoinIndexTotalOccurrences
* **Macros:** ScriptMacroTotalOccurrences
* **Procedures:** ScriptProcedureTotalOccurrences
* **Functions:** ScriptFunctionTotalOccurrences
* **Triggers**: ScriptTriggerTotalOccurrences
* **Indexes:** ScriptIndexTotalOccurrences

#### Sample

```sql
-- Successfully parsed table.
CREATE SET TABLE Tables_Database.Employee
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);

-- Table with a parsing error that could not be identified.
CRATE SET TABLE Tables_Database.Employee2
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);
```

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  # Successfully parsed table.
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee (
      Associate_Id INTEGER,
      UNIQUE (Associate_Id)
    )
    """)
  #** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '7' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CRATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CRATE' ON LINE '7' COLUMN '1'. CODE '81'. **
  #
  #---- Table with a parsing error that could not be identified.
  #--CRATE SET TABLE Tables_Database.Employee2
  #--   (Associate_Id     INTEGER)
  #--UNIQUE PRIMARY INDEX (Associate_Id)

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

**Expected Total Object Quantity:** 1.

**Explanation:** One table was completely parsed by SnowConvert AI during the parsing phase but the other table has a parsing error that causes SnowConvert AI to not identify it as a table object.

## Lines of Code

Represents the total number of lines of code for the identified top-level objects. It is important to take into account that the lines of code of the top-level object, as well as the comments, are used for this column. On the other hand, empty lines will not be counted in this column.

### CSV Associated Field Names

* **Tables:** ScriptTableTotalLinesOfCode
* **Views:** ScriptViewTotalLinesOfCode
* **Join Index:** ScriptJoinIndexTotalLinesOfCode
* **Macros:** ScriptMacroTotalLinesOfCode
* **Procedures:** ScriptProcedureTotalLinesOfCode
* **Functions:** ScriptFunctionTotalLinesOfCode
* **Triggers**: ScriptTriggerTotalLinesOfCode
* **Indexes:** ScriptIndexTotalLinesOfCode

#### Sample

```sql
-- Hello World
CREATE SET TABLE Tables_Database.Employee
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);

CREATE SET TABLE Tables_Database.Employee2
   (-- hello world
   Associate_Id     ANYTYPE!)
UNIQUE PRIMARY INDEX (Associate_Id);
```

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  # Hello World
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee (
      Associate_Id INTEGER,
      UNIQUE (Associate_Id)
    )
    """)
  # hello world
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee2 (
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '8' COLUMN '4' OF THE SOURCE CODE STARTING AT 'Associate_Id'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS 'ANYTYPE' ON LINE '8' COLUMN '21'. CODE '15'. **
--   Associate_Id     ANYTYPE!,
      UNIQUE (Associate_Id)
    )
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

**Expected Lines of Code:** 8

**Explanation:** In this case, we have 6 lines that come from the code used for the `CREATE TABLE` statements and 2 for comments that are inside of the top-level objects.

## Parsing Errors

Represents the number of parsing errors that are inside of the identified objects.

### CSV Associated Field Names

* **Tables:** ScriptTableTotalParsingErrors
* **Views:** ScriptViewTotalParsingErrors
* **Join Index:** ScriptJoinIndexTotalParsingErrors
* **Macros:** ScriptMacroTotalLinesOfCode
* **Procedures:** ScriptProcedureTotalParsingErrors
* **Functions:** ScriptFunctionTotalParsingErrors
* **Triggers**: ScriptTriggerTotalParsingErrors
* **Indexes:** ScriptIndexTotalParsingErrors

#### Sample

```sql
-- Successfully parsed table.
CREATE SET TABLE Tables_Database.Employee
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);

-- Table with a parsing error that could not be identified.
CRATE SET TABLE Tables_Database.Employee2
   (Associate_Id     INTEGER)
UNIQUE PRIMARY INDEX (Associate_Id);
```

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  # Successfully parsed table.
  exec("""
    --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
    CREATE OR REPLACE TABLE Tables_Database.Employee (
      Associate_Id INTEGER,
      UNIQUE (Associate_Id)
    )
    """)
  #** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '7' COLUMN '1' OF THE SOURCE CODE STARTING AT 'CRATE'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'CRATE' ON LINE '7' COLUMN '1'. CODE '81'. **
  #
  #---- Table with a parsing error that could not be identified.
  #--CRATE SET TABLE Tables_Database.Employee2
  #--   (Associate_Id     INTEGER)
  #--UNIQUE PRIMARY INDEX (Associate_Id)

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

**Expected Parsing Errors:** 1

**Explanation:** Only one parsing error will be reported in the **Parsing Errors** column because SnowConvert AI was able to only identify the first table. Since the second table was not identified, those parsing errors will not be counted in the **Parsing Errors** column.

---
title: SnowConvert AI - Scripts Line Conversion Summary
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/scripts-line-conversion-summary.md
section: Migrations
---

# SnowConvert AI - Scripts Line Conversion Summary

> **Note:**
>
> This section applies only to Teradata reports.

These fields are counted for the following Script files:

* BTEQ: .bteq, .btq
* FastLoad: .fload, .fl
* MultiLoad: .mload, .mld, ml
* TPump: .tpump, .tp
* TPT: .tpt

## Lines of Code

Represents the number of lines of code found in the Script files. This counting includes comments but does not include empty lines or lines with only whitespaces unless they are inside block comments or strings. Lines of code that were not recognized are counted as well.

### Samples

> **Note:**
>
> Samples of the SQL Conversion Summary [Lines of Code](sql-conversion-summary.md) also apply to Scripts Lines of Code.

```none
.RUN FILE 'myscript.txt'

.SET FORMAT ON;
```

**Expected Lines of Code:** 2

```none
DATABASE tduser;
```

**Expected lines of code:** 1

```none
.LAYOUT Something;

INSERT INTO myTable (
    myValue
)
VALUES (
    123
);
```

**Expected lines of code:** 7

```none
.logtable TheDatabase.tpumplog;
```

**Expected lines of code:** 1

```none
DEFINE JOB my_job
DESCRIPTION 'A description
goes here'
(
     DEFINE SCHEMA my_schema
     DESCRIPTION 'The schema' (value VARCHAR (10));

   STEP setup_tables
   (
      APPLY ('DELETE FROM &some.name;')
      TO OPERATOR (DDL_OPERATOR () );
   );
);
```

**Expected lines of code:** 12

#### CSV Associated Field Names

* ScriptTotalLoc

## LOC Conversion Percentage

This is the percentage of fully converted lines divided by the total lines of code. Unrecognized Lines of code count as not converted. Comments count as converted.

### Formula

```none
scripts_converted_lines_of_code / scripts_total_lines_of_code
```

#### Samples

> **Note:**
>
> Samples of the SQL Conversion Summary [LOC Conversion Percentage](sql-conversion-summary.md) also apply to Scripts Lines of Code.

#### CSV Associated Field Names

* ScriptTotalLoc

## Unrecognized Lines of Code

This is the number of lines of code that had an element that was not recognized.

### Samples

> **Note:**
>
> Samples of the SQL Conversion Summary [Unrecognized Lines of Code](sql-conversion-summary.md) also apply to Scripts Unrecognized Lines of Code.

#### CSV Associated Field Names

* ScriptsUnrecognizedElementsLOC

---
title: SnowConvert AI - SnowConvert AI Scopes
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/snowconvert-scopes.md
section: Migrations
---

# SnowConvert AI - SnowConvert AI Scopes

## Scope definitions

### Submitted Scope

Every single file in the input path is considered the *Submitted Scope.* However, there can be files with unrecognized extensions or unsupported encodings that will not be processed by SnowConvert AI. Even though the assessment documents provide the list of excluded files, their content is not parsed (recognized).

For more information about unrecognized extensions and unsupported encodings, see the [Validation section](../validation/README.md).

### Assessment Scope

The portion of the Submitted Scope that is seen as valid by SnowConvert AI is considered the *Assessment Scope, that is* all files with recognized extensions and supported encodings. SnowConvert AI will try to parse every single file in this scope in order to be able to provide assessment information.

### Conversion Scope

There can be elements within the Assessment Scope that are not part of the conversion scope. SnowConvert AI classifies specific top-level code units as out-of-scope for multiple reasons, such as:

* they are not relevant in Snowflake
* there is no comparable code unit in Snowflake
* the code unit definition is not readable (ex: encrypted)
* the code unit definition is in a not supported programming language (ex: java)

Lines of code of code units out of the conversion scope will not be used to calculate conversion rates, but they will be used to provide some information in the assessment documents. For example, a Database Link object In Oracle is considered out of scope, however, references made to this object are still counted and reported in the [Object References Report](reports/object-references-report.md).

The following is the list of Code Units per language considered out of the conversion scope.

#### Teradata out-of-conversion scope code units

* Triggers
* Grants
* Functions or procedures with unsupported language

#### Oracle out-of-conversion scope code units

* Triggers
* Grants
* DB Links
* Wrapped Objects
* Functions or procedures with unsupported languages

#### Transact SQL out-of-conversion scope code units

* Triggers
* Grants

#### Redshift out-of-conversion scope code units

* Grants

---
title: SnowConvert AI - SnowConvert AI UDFs
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/function-references/snowconvert-udfs.md
section: Migrations
---

# SnowConvert AI - SnowConvert AI UDFs

## Summary

SnowConvert AI includes several User-Defined Functions (UDFs) that help replicate behaviors from source languages which Snowflake doesn’t natively support. Here’s what these functions do:

## UDFs Location

User-Defined Functions (UDFs) are located in the “UDF Helpers” folder, which is created in the output directory after the migration process completes.

---
title: SnowConvert AI - Spark Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/sparkEWI.md
section: Migrations
---

# SnowConvert AI - Spark Issues

> **Note:**
>
> Conversion Scope
>
> SnowConvert AI for Spark SQL focuses its assessment and translation capabilities primarily on TABLES and VIEWS.
> While SnowConvert AI can recognize other types of ANSI-standard statements, these are not yet fully supported for conversion. This means that while the tool may identify them, it won’t perform a complete translation for these unsupported code units.

This page provides a comprehensive reference for how SnowConvert AI translates Spark grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

## SSC-EWI-SPK0001

CREATE TABLE without columns is not supported in Snowflake

### Severity

Medium

#### Description

This EWI is added when a `CREATE TABLE` statement is encountered without column definitions.

#### Code Example

**Input Code:**

##### Spark

```sql
 CREATE TABLE table_name1;
```

**Output Code:**

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-SPK0001 - CREATE TABLE WITHOUT COLUMNS IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE TABLE table_name1;
```

---
title: SnowConvert AI - SQL Conversion Summary
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/assessment-report/sql-conversion-summary.md
section: Migrations
---

# SnowConvert AI - SQL Conversion Summary

## Code Conversion Rate

> **Note:**
>
> This field applies to Oracle and SQLServer

The conversion rate is the percentage of the total source code that was successfully converted by SnowConvert AI into functionally equivalent Snowflake code. Every time that SnowConvert AI identifies not supported elements, *i.e,* fragments in the input source code that were not converted into Snowflake, this will affect the conversion rate. You can read more about the different conversion rate modes and how they are calculated by SnowConvert AI [here](README.md).

### CSV Associated Field Names

> **Note:**
>
> The CSV field associated is going to depend on the conversion rate mode used.

* **Code Conversion Rate:**

  + SqlLoCConversionRate
  + SqlCharacterConversionRate

## Lines of Code

> **Note:**
>
> This field applies only to Teradata reports.

Represents the number of lines of code found in the SQL files. This counting includes comments but does not include empty lines or lines with only whitespaces unless they are inside block comments or strings. Lines of code that were not recognized are counted as well.

### Samples

```sql
SELECT 123 FROM my_table;
```

**Expected Lines of Code:** 1

```sql
SELECT 123
FROM my_table;
```

**Expected lines of code:** 2

```sql
SELECT 123
FROM my_table;

Unrecognized statement
```

**Expected lines of code:** 3

```sql
SELECT '123

abc' FROM my_table;
```

**Expected lines of code: 3**

**Explanation:** In this case, we have an empty line inside a string. Since this is part of the selected string, is considered part of the code and is counted as a line of code.

```sql
invalid '

' code
```

**Expected lines of code: 3**

**Explanation:** In this case, even if the code was not recognized, there was still a string containing the empty line. Such cases will count the empty line of code as well.

```sql
-- Hello world
```

**Expected Lines of Code:** 1

```sql
/* hello

world */
```

**Expected Lines of Code:** 3

**Explanation:** In this case, the second line is part of the block comment in the example, so this is counted as one line of code as well.

#### CSV Associated Field Names

* SqlLinesOfCode

## LOC Conversion Percentage

> **Note:**
>
> This field applies only to Teradata reports.

This is the percentage of fully converted lines divided by the total lines of code. Unrecognized Lines of code count as not converted. Comments count as converted.

Elements that contain an EWI with medium severity or higher will count as not converted. These elements may include more than one line depending on how the input code was formatted.

### Formula

```none
sql_converted_lines_of_code / sql_total_lines_of_code
```

#### Samples

```sql
CREATE TABLE t1
(
col1 INTEGER
);
```

**Expected LOC Conversion Percentage:** 100%

**Explanation:** The entire table is supported. Because of this, the conversion rate is 100%.

```sql
CREATE TABLE t1
(
NOT A VALID ELEMENT
);
```

**Expected LOC Conversion Percentage:** 75%

**Explanation:** In this case, the third line is unrecognized. The other 3 lines are identified and converted properly, causing a conversion rate of 75%.

```sql
CREATE TABLE t1 (
NOT A VALID ELEMENT );
```

**Expected LOC Conversion Percentage:** 50%

**Explanation:** Even though this is the same code as Sample 2, the format of the code is different. In this case, the first line is considered converted, and the second line has an unrecognized part, causing the line to be counted as not supported. Because of this, the conversion rate is 50%.

```sql
CREATE TABLE t1 (
  col1 INTEGER
);

SELECT CAST (123 AS INTERVAL DAY(4));
```

**Expected LOC Conversion Percentage:** 75%

**Explanation:** In this case, the 3 lines of the `CREATE TABLE` are supported, but the `SELECT` has a `CAST` to `INTERVAL` which is not supported, causing line 5 to be counted as unsupported.

```sql
-- Hello world
Unrecognized statement
```

**Expected LOC Conversion Percentage:** 50%

**Explanation:** In this case, the first line comment is considered as converted and the second line, an unrecognized element, is not supported, causing a 50% conversion rate.

#### CSV Associated Field Names

* SqlLoCConversionRate

## Unrecognized Lines of Code

> **Note:**
>
> This field applies only to Teradata reports.

This is the number of lines of code that had an element that was not recognized.

```none
Unrecognized Element
```

**Unrecognized Lines of Code:** 1

```none
invalid '

' something
```

**Unrecognized Lines of Code:** 3

**Explanation:** In this case, there is a string that starts at line 1 and ends at line 3. However, the entire block of code was not recognized, causing the 3 lines to be counted as unrecognized lines of code.

### CSV Associated Field Names

* SqlUnrecognizedElementsLOC

---
title: SnowConvert AI - SQL Server
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/code-extraction/sql-server.md
section: Migrations
---

# SnowConvert AI - SQL Server

The first step for migration is getting the code that you need to migrate. There are many ways to extract the code from your database. We recommend that you use [SQL Server Management Studio (SSMS)](https://learn.microsoft.com/en-us/sql/ssms/download-sql-server-management-studio-ssms?view=sql-server-ver16). We also provide an alternative for MacOS and Linux environments.

## Prerequisites

* Access to a server with an SQLServer database.

## Extraction through SQL Server Management Studio (SSMS)

SQL Server Management Studio (SSMS) is only available for Windows. Go to the next section for Mac OS and Linux.

1. Open SSMS.
2. Connect to the desired server and server instance with credentials that allow
   visibility of the desired database(s).
3. In the main SSMS window, open **Object Explorer** if not already opened.
4. In the Object Explorer pane, expand **Databases** if not already expanded.
5. Right-click on the desired database and select **Tasks** -> **Generate Scripts**…

6. If the Introduction page of the Generate Scripts dialog is shown, click **Next**. Otherwise, proceed to the next step.

7. On the Choose Objects page of the Generate Scripts dialog:

* Select the **Select specific database objects** radio button and put a **check** in all the database object type **checkboxes** displayed **EXCEPT Users** (NOTE: the list of database object types presented depends on the presence of database objects in the chosen database. Thus, your list of database object types may look different. Just select all database object types EXCEPT Users).
* Click **Next**

8. On the Set Scripting Options page of the Generate Scripts dialog:

* Click the **Save as script file** button and **One script file per object**

* Click the **Advanced** button.

* In the Advanced Scripting Options dialog box, make sure the following Options are set as indicated, keeping the default for all other Option

| Section | Setting. | Value |
| --- | --- | --- |
| General | Include System Constraint names | True |
| empty | Script Extended Properties | True |
| Table/View Options | Script Indexes | True |
| - | Script Triggers | True |

* When done, click **OK** to return to the Set Scripting Options window of the Generate Scripts dialog.

* Select the **Save as script file** radio button.
* Click the **ellipsis** (…) to the right of the File name: field.
* Navigate to a suitable location, enter a descriptive value in the File Name: field (for example, **<server_name>**_**<instance_name>**_**<database_name>**), and click Save.
* Select the **ANSI text** radio button.
* Click Next.

9. On the Summary page of the Generate Scripts dialog, confirm the settings are correct and click **Next >** when ready to start the
   extraction (that is, the extraction will commence when you click **Next >**). The Save Scripts page will appear and will show the
   extraction progress.

10. On the Save Scripts page of the Generate Scripts dialog box (not shown), confirm all Results were Success and click **Finish**.
11. Repeat steps 5 through 10 for each desired database (using a different file name for each). When all databases have been extracted successfully, proceed to the next step.
12. Transmit the resulting file(s) to Snowflake for further analysis.

### Package the results

When the extraction process is finished, compress the results and send them over.

## Table sizing report

1. Option A: For all databases in scope, right click on the database, Reports >
   Standard Reports > Disk Usage By Table. A report will be generated, right click on
   the report and export as Excel.

2. Option B: Run the following script:

```sql
USE <DB_NAME>;
SELECT
 t.NAME AS TableName,
 s.NAME AS SchemaName,
 SUM(a.total_pages) * 8 / 1024 AS TotalSpaceMB,
 SUM(a.used_pages) * 8 / 1024 AS UsedSpaceMB,
 (SUM(a.total_pages) - SUM(a.used_pages)) * 8 / 1024 AS
UnusedSpaceMB
FROM
 sys.tables t
INNER JOIN
 sys.indexes i ON t.OBJECT_ID = i.object_id
INNER JOIN
 sys.partitions p ON i.object_id = p.OBJECT_ID AND i.index_id =
p.index_id
INNER JOIN
 sys.allocation_units a ON p.partition_id = a.container_id
LEFT OUTER JOIN
 sys.schemas s ON t.schema_id = s.schema_id
GROUP BY
 t.NAME, s.NAME, p.Rows
ORDER BY
 TotalSpaceMB DESC;
```

---
title: SnowConvert AI - SQL Server
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/sql-server.md
section: Migrations
---

# SnowConvert AI - SQL Server

## What is SnowConvert AI for SQL Server?

SnowConvert AI is a software tool that understands SQL Server scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for SQL Server performs the following conversions:

### SQL Server to Snowflake SQL

SnowConvert AI understands the SQL Server source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake.

#### Sample code

SQL Server basic input code:

```sql
CREATE TABLE Persons (
    PersonID int,
    LastName varchar(255),
    FirstName varchar(255),
    Address varchar(255),
    City varchar(255)
);
```

Snowflake SQL output code:

```sql
CREATE OR REPLACE TABLE Persons (
    PersonID INT,
    LastName VARCHAR(255),
    FirstName VARCHAR(255),
    Address VARCHAR(255),
    City VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

As you can see, most of the structure remains the same. There are some cases where the datatypes have to be transformed, for example.

### SQL Server Stored Procedures to JavaScript Embedded in Snowflake SQL

SnowConvert AI takes SQL Server stored procedures and converts them to JavaScript embedded into Snowflake SQL. SQL Server’s CREATE PROCEDURE is replaced by Snowflake’s CREATE OR REPLACE PROCEDURE. JavaScript is called as a scripting language, and all of the inner statements are converted to JavaScript.

#### Sample code

SQL Server basic stored procedure:

```sql
CREATE PROCEDURE SelectAllCustomers
AS
SELECT * FROM Customers
GO;
```

Snowflake SQL output code, with embedded JavaScript:

```sql
-- Additional Params: -t JavaScript
CREATE OR REPLACE PROCEDURE SelectAllCustomers ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   // REGION SnowConvert AI Helpers Code
   // END REGION

 EXEC(`SELECT
   *
FROM
   Customers`);
$$;
```

* When creating the JavaScript code, there is a portion of code added as a *helper*, required for an easier transformation of the contents of the procedure.
* You can expect to see warnings with an associated code to help you find out what is happening in the converted code. (See [issues and troubleshooting](../../../technical-documentation/issues-and-troubleshooting/README.md))

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI*: the software that converts securely and automatically your SQL Server files to the Snowflake cloud data platform.
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

On the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for SQL Server is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Sql Server
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/sql-server.md
section: Migrations
---

# SnowConvert AI - Sql Server

## Specific CLI arguments

### `-u, --usedatabase`

Flag to indicate whether or not the Transact SQL USE statement should be translated.

#### `-p, --arrange <ARRANGE OPTION>` [`<ARRANGE OPTION>`]

Flag to indicate whether or not to preprocess or arrange the source code before its transformation. By default, it’s set to FALSE.

| Arrange Option | Description |
| --- | --- |
| prettyprint | Applies indentation to the original code and get it well organized. |
| generatereports | Generates extra reports after the arrangement. |
| multiple | Applies arrangement to multiple databases represented as multiple folders, and keeps their original structure. |

---
title: SnowConvert AI - SQL Server - CREATE FUNCTION
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-function.md
section: Migrations
---

# SnowConvert AI - SQL Server - CREATE FUNCTION

Translation reference for the Transact-SQL User Defined Functions

Applies to

* SQL Server
* Azure Synapse Analytics

## Description

SQL Server only supports two types of [User Defined Functions](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-function-transact-sql?view=sql-server-ver15):

* [Scalar](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-function-transact-sql?view=sql-server-ver15#a-using-a-scalar-valued-user-defined-function-that-calculates-the-iso-week)
* [Table-Valued](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-function-transact-sql?view=sql-server-ver15#b-creating-an-inline-table-valued-function)

Using these UDFs types, is possible to subcategorized them into **simple and complex,** according to the inner logic.

Simple UDFs, matches the SQL Server syntax with Snowflake syntax. This type doesn’t add any logic and goes straightforward to the result. These are usually match to Snowflake’s SQL UDFs.
SnowConvert supports translating SQL Server Scalar User Defined Functions directly to [Snowflake Scripting UDFs](../../../../developer-guide/udf/sql/udf-sql-procedural-functions.md) when they meet specific criteria.

Complex UDFs, makes extensive use of a particular statements ([INSERT](https://docs.microsoft.com/en-us/sql/t-sql/statements/insert-transact-sql?view=sql-server-ver15), [DELETE](https://docs.microsoft.com/en-us/sql/t-sql/statements/delete-transact-sql?view=sql-server-ver15), [UPDATE](https://docs.microsoft.com/en-us/sql/t-sql/queries/update-transact-sql?view=sql-server-ver15), [SET](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/set-local-variable-transact-sql?view=sql-server-ver15), [DECLARE](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/declare-local-variable-transact-sql?view=sql-server-ver15), etc) or [control-of-flow](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/control-of-flow?view=sql-server-ver15) blocks ([IF…ELSE](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/if-else-transact-sql?view=sql-server-ver15), [WHILE](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/while-transact-sql?view=sql-server-ver15), etc) and usually represents a mismatch or violation to Snowflake’s SQL UDFs definition.

## Limitations

Transact UDFs have some limitations not present in other database engines (*such as Oracle and Teradata*). These limitations helps the translations by narrowing the failure scope. This means, there are specific scenarios we can expect to avoid.

Here are some of the limitations SQL Server has on UDFs

* UDFs cannot be used to perform actions that modify the database state
* User-defined functions cannot contain an OUTPUT INTO clause that has a table as its target
* User-defined functions cannot return multiple result sets. Use a stored procedure if you need to return multiple result sets.

For the full list, please check this link [Create User-defined Functions (Database engine)](https://docs.microsoft.com/en-us/sql/relational-databases/user-defined-functions/create-user-defined-functions-database-engine)

scalar.md

inline-table-valued.md

## INLINE TABLE-VALUED

Translation reference to convert Transact-SQL UDF (User Defined Functions) with TABLE return type to Snowflake.

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> Inline Table-Valued functions are table expression that can accept parameters, perform a SELECT statement and return a TABLE ([SQL Server Language Reference Creating an inline table-valued function](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-function-transact-sql?view=sql-server-ver15#b-creating-an-inline-table-valued-function)).

#### Transact Syntax

```sql
 -- Transact-SQL Inline Table-Valued Function Syntax
CREATE [ OR ALTER ] FUNCTION [ schema_name. ] function_name
( [ { @parameter_name [ AS ] [ type_schema_name. ] parameter_data_type
    [ = default ] [ READONLY ] }
    [ ,...n ]
  ]
)
RETURNS TABLE
    [ WITH <function_option> [ ,...n ] ]
    [ AS ]
    RETURN [ ( ] select_stmt [ ) ]
[ ; ]
```

#### Snowflake SQL Syntax

```sql
CREATE OR REPLACE FUNCTION <name> ( [ <arguments> ] )
  RETURNS TABLE ( <output_col_name> <output_col_type> [, <output_col_name> <output_col_type> ... ] )
  AS '<sql_expression>'sql
```

### Sample Source Patterns

The following section describes all the possible source code patterns that can appear in this kind of `CREATE FUNCTION` syntax.

For Inline Table-Valued functions, there can only exist one statement per body that could be:

* `SELECT` Statement
* `WITH` Common Table Expression

#### Select and return values directly from one table

This is the simplest scenario, performing a simple select from a table and returning those values

##### Transact-SQL

##### Inline Table-Valued

```sql
CREATE FUNCTION GetDepartmentInfo()
RETURNS TABLE
AS
RETURN
(
  SELECT DepartmentID, Name, GroupName
  FROM HumanResources.Department
);

GO

SELECT * from GetDepartmentInfo()
```

##### Result

| DepartmentID | Name | GroupName |
| --- | --- | --- |
| 1 | Engineering | Research and Development |
| 2 | Tool Design | Research and Development |
| 3 | Sales | Sales and Marketing |
| 4 | Marketing | Sales and Marketing |
| 5 | Purchasing | Inventory Management |
| 6 | Research and Development | Research and Development |
| 7 | Production | Manufacturing |
| 8 | Production Control | Manufacturing |
| 9 | Human Resources | Executive General and Administration |
| 10 | Finance | Executive General and Administration |
| 11 | Information Services | Executive General and Administration |
| 12 | Document Control | Quality Assurance |
| 13 | Quality Assurance | Quality Assurance |
| 14 | Facilities and Maintenance | Executive General and Administration |
| 15 | Shipping and Receiving | Inventory Management |
| 16 | Executive | Executive General and Administration |

##### Snowflake SQL

##### Inline Table-Valued

```sql
CREATE OR REPLACE FUNCTION GetDepartmentInfo ()
RETURNS TABLE(
  DepartmentID STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN DepartmentID WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
  Name STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN Name WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
  GroupName STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN GroupName WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
$$
    SELECT
    CAST(DepartmentID AS STRING),
    CAST(Name AS STRING),
    CAST(GroupName AS STRING)
    FROM
    HumanResources.Department
$$;

SELECT
    *
from
    TABLE(GetDepartmentInfo());
```

##### Result

| DepartmentID | Name | GroupName |
| --- | --- | --- |
| 1 | Engineering | Research and Development |
| 2 | Tool Design | Research and Development |
| 3 | Sales | Sales and Marketing |
| 4 | Marketing | Sales and Marketing |
| 5 | Purchasing | Inventory Management |
| 6 | Research and Development | Research and Development |
| 7 | Production | Manufacturing |
| 8 | Production Control | Manufacturing |
| 9 | Human Resources | Executive General and Administration |
| 10 | Finance | Executive General and Administration |
| 11 | Information Services | Executive General and Administration |
| 12 | Document Control | Quality Assurance |
| 13 | Quality Assurance | Quality Assurance |
| 14 | Facilities and Maintenance | Executive General and Administration |
| 15 | Shipping and Receiving | Inventory Management |
| 16 | Executive | Executive General and Administration |

#### Select and return values from multiple tables renaming columns and using built in functions

This is an example of a query using built-in functions in a select statement getting data from different tables, renaming columns and returning a table.

##### Transact-SQL

##### Inline Table-Valued

```sql
CREATE FUNCTION GetPersonBasicInfo()
RETURNS TABLE
AS
RETURN
(
 SELECT TOP (20)
      P.PersonType,
      P.FirstName,
      E.JobTitle,
   E.Gender,
      YEAR(E.HireDate) as HIREYEAR
  FROM
      Person.Person P
  INNER JOIN
      HumanResources.Employee E
  ON
      P.BusinessEntityID = E.BusinessEntityID
);

GO

SELECT * FROM GetPersonBasicInfo();
```

##### Result

| PersonType | FirstName | JobTitle | Gender | HIREYEAR |
| --- | --- | --- | --- | --- |
| EM | Ken | Chief Executive Officer | M | 2009 |
| EM | Terri | Vice President of Engineering | F | 2008 |
| EM | Roberto | Engineering Manager | M | 2007 |
| EM | Rob | Senior Tool Designer | M | 2007 |
| EM | Gail | Design Engineer | F | 2008 |
| EM | Jossef | Design Engineer | M | 2008 |
| EM | Dylan | Research and Development Manager | M | 2009 |
| EM | Diane | Research and Development Engineer | F | 2008 |
| EM | Gigi | Research and Development Engineer | F | 2009 |
| EM | Michael | Research and Development Manager | M | 2009 |
| EM | Ovidiu | Senior Tool Designer | M | 2010 |
| EM | Thierry | Tool Designer | M | 2007 |
| EM | Janice | Tool Designer | F | 2010 |
| EM | Michael | Senior Design Engineer | M | 2010 |
| EM | Sharon | Design Engineer | F | 2011 |
| EM | David | Marketing Manager | M | 2007 |
| EM | Kevin | Marketing Assistant | M | 2007 |
| EM | John | Marketing Specialist | M | 2011 |
| EM | Mary | Marketing Assistant | F | 2011 |
| EM | Wanida | Marketing Assistant | F | 2011 |

##### Snowflake SQL

##### Inline Table-Valued

```sql
CREATE OR REPLACE FUNCTION GetPersonBasicInfo ()
RETURNS TABLE(
 PersonType STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN PersonType WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
 FirstName STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN FirstName WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
 JobTitle STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN JobTitle WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
 Gender STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN Gender WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
 HIREYEAR INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
$$
  SELECT
  TOP 20
  CAST(P.PersonType AS STRING),
  CAST(P.FirstName AS STRING),
  CAST(E.JobTitle AS STRING),
  CAST(E.Gender AS STRING),
  YEAR(E.HireDate :: TIMESTAMP) as HIREYEAR
   FROM
  Person.Person P
   INNER JOIN
   HumanResources.Employee E
   ON P.BusinessEntityID = E.BusinessEntityID
$$;

SELECT
  *
FROM
  TABLE(GetPersonBasicInfo());
```

##### Result

| PersonType | FirstName | JobTitle | Gender | HIREYEAR |
| --- | --- | --- | --- | --- |
| EM | Ken | Chief Executive Officer | M | 2009 |
| EM | Terri | Vice President of Engineering | F | 2008 |
| EM | Roberto | Engineering Manager | M | 2007 |
| EM | Rob | Senior Tool Designer | M | 2007 |
| EM | Gail | Design Engineer | F | 2008 |
| EM | Jossef | Design Engineer | M | 2008 |
| EM | Dylan | Research and Development Manager | M | 2009 |
| EM | Diane | Research and Development Engineer | F | 2008 |
| EM | Gigi | Research and Development Engineer | F | 2009 |
| EM | Michael | Research and Development Manager | M | 2009 |
| EM | Ovidiu | Senior Tool Designer | M | 2010 |
| EM | Thierry | Tool Designer | M | 2007 |
| EM | Janice | Tool Designer | F | 2010 |
| EM | Michael | Senior Design Engineer | M | 2010 |
| EM | Sharon | Design Engineer | F | 2011 |
| EM | David | Marketing Manager | M | 2007 |
| EM | Kevin | Marketing Assistant | M | 2007 |
| EM | John | Marketing Specialist | M | 2011 |
| EM | Mary | Marketing Assistant | F | 2011 |
| EM | Wanida | Marketing Assistant | F | 2011 |

#### Select columns using WITH statement

The body of an inline table-valued function can also be specified using a WITH statement as shown below.

##### Transact-SQL

##### Inline Table-Valued

```sql
CREATE FUNCTION GetMaritalStatusByGender
(
 @P_Gender nchar(1)
)

RETURNS TABLE
AS
RETURN
(
  WITH CTE AS
 (
  SELECT BusinessEntityID, MaritalStatus, Gender
  FROM HumanResources.Employee
  where Gender = @P_Gender
 )
  SELECT
 MaritalStatus, Gender, CONCAT(P.FirstName,' ', P.LastName) as Name
  FROM
 CTE INNER JOIN Person.Person P
  ON
 CTE.BusinessEntityID = P.BusinessEntityID
);

GO

select * from GetMaritalStatusByGender('F');
```

##### Result

| MaritalStatus | Gender | Name |
| --- | --- | --- |
| S | F | Terri Duffy |
| M | F | Gail Erickson |
| S | F | Diane Margheim |
| M | F | Gigi Matthew |
| M | F | Janice Galvin |
| M | F | Sharon Salavaria |
| S | F | Mary Dempsey |
| M | F | Wanida Benshoof |
| M | F | Mary Gibson |
| M | F | Jill Williams |
| S | F | Jo Brown |
| M | F | Britta Simon |
| M | F | Margie Shoop |
| M | F | Rebecca Laszlo |
| M | F | Suchitra Mohan |
| M | F | Kim Abercrombie |
| S | F | JoLynn Dobney |
| M | F | Nancy Anderson |
| M | F | Ruth Ellerbrock |
| M | F | Doris Hartwig |
| M | F | Diane Glimp |
| M | F | Bonnie Kearney |
| M | F | Denise Smith |
| S | F | Diane Tibbott |
| M | F | Carole Poland |
| M | F | Carol Philips |
| M | F | Merav Netz |
| S | F | Betsy Stadick |
| S | F | Danielle Tiedt |
| S | F | Kimberly Zimmerman |
| M | F | Elizabeth Keyser |
| M | F | Mary Baker |
| M | F | Alice Ciccu |
| M | F | Linda Moschell |
| S | F | Angela Barbariol |
| S | F | Kitti Lertpiriyasuwat |
| S | F | Susan Eaton |
| S | F | Kim Ralls |
| M | F | Nicole Holliday |
| S | F | Anibal Sousa |
| M | F | Samantha Smith |
| S | F | Olinda Turner |
| S | F | Cynthia Randall |
| M | F | Sandra Reátegui Alayo |
| S | F | Linda Randall |
| S | F | Shelley Dyck |
| S | F | Laura Steele |
| S | F | Susan Metters |
| S | F | Katie McAskill-White |
| M | F | Barbara Decker |
| M | F | Yvonne McKay |
| S | F | Janeth Esteves |
| M | F | Brenda Diaz |
| M | F | Lorraine Nay |
| M | F | Paula Nartker |
| S | F | Lori Kane |
| M | F | Kathie Flood |
| S | F | Belinda Newman |
| M | F | Karen Berge |
| M | F | Lori Penor |
| M | F | Jo Berry |
| M | F | Laura Norman |
| M | F | Paula Barreto de Mattos |
| M | F | Mindy Martin |
| M | F | Deborah Poe |
| S | F | Candy Spoon |
| M | F | Barbara Moreland |
| M | F | Janet Sheperdigian |
| S | F | Wendy Kahn |
| S | F | Sheela Word |
| M | F | Linda Meisner |
| S | F | Erin Hagens |
| M | F | Annette Hill |
| S | F | Jean Trenary |
| S | F | Stephanie Conroy |
| S | F | Karen Berg |
| M | F | Janaina Bueno |
| M | F | Linda Mitchell |
| S | F | Jillian Carson |
| S | F | Pamela Ansman-Wolfe |
| S | F | Lynn Tsoflias |
| M | F | Amy Alberts |
| S | F | Rachel Valdez |
| M | F | Jae Pak |

##### Snowflake SQL

##### Inline Table-Valued

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "HumanResources.Employee", "Person.Person" **
CREATE OR REPLACE FUNCTION GetMaritalStatusByGender
(P_GENDER STRING
)
RETURNS TABLE(
 MaritalStatus STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN MaritalStatus WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
 Gender STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN Gender WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
 Name VARCHAR
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
AS
$$
 --** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **
  WITH CTE AS
 (
  SELECT
   BusinessEntityID,
   MaritalStatus,
   Gender
  FROM
   HumanResources.Employee
  where
   Gender = :P_GENDER
 )
  SELECT
  CAST(MaritalStatus AS STRING),
  CAST(Gender AS STRING),
  CONCAT(P.FirstName,' ', P.LastName) as Name
  FROM
  CTE
  INNER JOIN
   Person.Person P
  ON CTE.BusinessEntityID = P.BusinessEntityID
$$;

select
  *
from
  TABLE(GetMaritalStatusByGender('F'));
```

##### Result

| MaritalStatus | Gender | Name |
| --- | --- | --- |
| S | F | Terri Duffy |
| M | F | Gail Erickson |
| S | F | Diane Margheim |
| M | F | Gigi Matthew |
| M | F | Janice Galvin |
| M | F | Sharon Salavaria |
| S | F | Mary Dempsey |
| M | F | Wanida Benshoof |
| M | F | Mary Gibson |
| M | F | Jill Williams |
| S | F | Jo Brown |
| M | F | Britta Simon |
| M | F | Margie Shoop |
| M | F | Rebecca Laszlo |
| M | F | Suchitra Mohan |
| M | F | Kim Abercrombie |
| S | F | JoLynn Dobney |
| M | F | Nancy Anderson |
| M | F | Ruth Ellerbrock |
| M | F | Doris Hartwig |
| M | F | Diane Glimp |
| M | F | Bonnie Kearney |
| M | F | Denise Smith |
| S | F | Diane Tibbott |
| M | F | Carole Poland |
| M | F | Carol Philips |
| M | F | Merav Netz |
| S | F | Betsy Stadick |
| S | F | Danielle Tiedt |
| S | F | Kimberly Zimmerman |
| M | F | Elizabeth Keyser |
| M | F | Mary Baker |
| M | F | Alice Ciccu |
| M | F | Linda Moschell |
| S | F | Angela Barbariol |
| S | F | Kitti Lertpiriyasuwat |
| S | F | Susan Eaton |
| S | F | Kim Ralls |
| M | F | Nicole Holliday |
| S | F | Anibal Sousa |
| M | F | Samantha Smith |
| S | F | Olinda Turner |
| S | F | Cynthia Randall |
| M | F | Sandra Reátegui Alayo |
| S | F | Linda Randall |
| S | F | Shelley Dyck |
| S | F | Laura Steele |
| S | F | Susan Metters |
| S | F | Katie McAskill-White |
| M | F | Barbara Decker |
| M | F | Yvonne McKay |
| S | F | Janeth Esteves |
| M | F | Brenda Diaz |
| M | F | Lorraine Nay |
| M | F | Paula Nartker |
| S | F | Lori Kane |
| M | F | Kathie Flood |
| S | F | Belinda Newman |
| M | F | Karen Berge |
| M | F | Lori Penor |
| M | F | Jo Berry |
| M | F | Laura Norman |
| M | F | Paula Barreto de Mattos |
| M | F | Mindy Martin |
| M | F | Deborah Poe |
| S | F | Candy Spoon |
| M | F | Barbara Moreland |
| M | F | Janet Sheperdigian |
| S | F | Wendy Kahn |
| S | F | Sheela Word |
| M | F | Linda Meisner |
| S | F | Erin Hagens |
| M | F | Annette Hill |
| S | F | Jean Trenary |
| S | F | Stephanie Conroy |
| S | F | Karen Berg |
| M | F | Janaina Bueno |
| M | F | Linda Mitchell |
| S | F | Jillian Carson |
| S | F | Pamela Ansman-Wolfe |
| S | F | Lynn Tsoflias |
| M | F | Amy Alberts |
| S | F | Rachel Valdez |
| M | F | Jae Pak |

### Known issues

No issues were found

### Related EWIs

1. [SSC-FDM-TS0012](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Information for the expression was not found. CAST to STRING used
2. [SSC-PRF-TS0001](../../general/technical-documentation/issues-and-troubleshooting/performance-review/sqlServerPRF.md): Performance warning - recursion for CTE not checked. Might require a recursive keyword.
3. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review

## MULTI-STATEMENT TABLE-VALUED

Translation reference to convert Transact-SQL UDF (User Defined Functions) with TABLE return type to Snowflake.

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> All the code samples on this page have not been implemented yet in SnowConvert AI. They should be interpreted as a reference for how each scenario should be translated to Snowflake. These translations may change in the future.Some parts in the output code are omitted for clarity reasons.

### Description

Multi-statement table-valued is similar to Inline-statement table-valued (INLINE TABLE-VALUED). However Multi-statement table-valued may have more than one statement in its function body, the table columns are specified in the return type and it has a BEGIN/END block ([SQL Server Language Reference Creating a multi-statement table-valued function](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-function-transact-sql?view=sql-server-ver15#c-creating-a-multi-statement-table-valued-function)

#### Transact-SQL Syntax

```sql
CREATE [ OR ALTER ] FUNCTION [ schema_name. ] function_name
( [ { @parameter_name [ AS ] [ type_schema_name. ] parameter_data_type
    [ = default ] [READONLY] }
    [ ,...n ]
  ]
)
RETURNS @return_variable TABLE <table_type_definition>
    [ WITH <function_option> [ ,...n ] ]
    [ AS ]
    BEGIN
        function_body
        RETURN
    END
[ ; ]
```

#### Snowflake SQL

```sql
CREATE OR REPLACE FUNCTION <name> ( [ <arguments> ] )
  RETURNS TABLE ( <output_col_name> <output_col_type> [, <output_col_name> <output_col_type> ... ] )
  AS '<sql_expression>'
```

### Sample Source Patterns

The following section describes all the possible source code patterns that can appear in this kind ofCREATE FUNCTION syntax.

The function body of Multi-Statement Table-Valued function must be a SELECT statement. For this reason the others statements must be called separately.

#### **Insert values in a table**

Inserts one or more rows into the table and returns the table with the new values

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR ALTER FUNCTION calc_behavioral_segment()
RETURNS @behavioral_segments TABLE (behavioral_segment VARCHAR(50))
AS
BEGIN
 DECLARE @col varchar(15)
 SET @col = 'Unknown'
 INSERT INTO @behavioral_segments
 SELECT @col

 RETURN
END

SELECT * FROM calc_behavioral_segment();
```

##### Result

| BEHAVIORAL_SEGMENT |
| --- |
| Unknown |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTION calc_behavioral_segment ()
RETURNS BEHAVIORAL_SEGMENTS TABLE (
 behavioral_segment VARCHAR(50))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
 DECLARE @col varchar(15)
 SET @col = 'Unknown'
 INSERT INTO @behavioral_segments
 SELECT @col

 RETURN
END

SELECT * FROM calc_behavioral_segment();;
```

##### Results

| BEHAVIORAL_SEGMENT |
| --- |
| Unknown |

#### Insert value according to if/else statement

Inserts a row into the table according to the condition and returns the table with the new value

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR ALTER FUNCTION odd_or_even_number(@number INT)
RETURNS @numbers TABLE (number_type VARCHAR(15))
AS
BEGIN
 IF ((@number % 2) = 0)
 BEGIN
  INSERT @numbers SELECT 'Even'
 END

 ELSE
 BEGIN
  INSERT @numbers SELECT 'Odd'
 END

 RETURN
END

SELECT * FROM odd_or_even_number(9);
```

##### Result

| NUMBER_TYPE |
| --- |
| Odd |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTION odd_or_even_number (NUMBER INT)
RETURNS NUMBERS TABLE (
 number_type VARCHAR(15))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
 IF ((@number % 2) = 0)
 BEGIN
  INSERT @numbers SELECT 'Even'
 END

 ELSE
 BEGIN
  INSERT @numbers SELECT 'Odd'
 END

 RETURN
END

SELECT * FROM odd_or_even_number(9);;
```

##### Result

| NUMBER_TYPE |
| --- |
| Odd |

#### Inserts multiple according to if/else statement

The example below inserts more than one value into the table and more than one variable is modified according to the condition. Returns the table with the new values

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR ALTER FUNCTION new_employee_hired(@id VARCHAR (50), @position VARCHAR(50), @experience VARCHAR(15))
RETURNS @new_employee TABLE (id_employee VARCHAR (50), working_from_home BIT, team VARCHAR(15), computer VARCHAR(15))
AS
BEGIN
 DECLARE @wfh BIT
 DECLARE @team VARCHAR(15)
 DECLARE @computer VARCHAR(15)

 IF @position = 'DEVELOPER'
 BEGIN
  SET @team = 'TEAM_1'
  SET @computer = 'LAPTOP'
 END

 IF @position = 'IT'
 BEGIN
  SET @team = 'TEAM_2'
  SET @computer = 'DESKTOP'
 END

 IF @experience = 'JUNIOR'
 BEGIN
  SET @wfh = '0'
 END
 IF @experience = 'SENIOR'
 BEGIN
  SET @wfh = '1'
 END

 INSERT INTO @new_employee VALUES (@id, @wfh, @team, @computer)
 RETURN
END

SELECT * FROM new_employee_hired('123456789', 'DEVELOPER', 'SENIOR');
```

##### Result

| ID_EMPLOYEE | WORKING_FROM_HOME | TEAM | COMPUTER |
| --- | --- | --- | --- |
| 123456789 | 1 | TEAM_1 | LAPTOP |

##### Snowflake

##### MULTI-STATEMENT TABLE-VALUED

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTION new_employee_hired (ID STRING, POSITION STRING, EXPERIENCE STRING)
RETURNS NEW_EMPLOYEE TABLE (
 id_employee VARCHAR(50),
 working_from_home BOOLEAN,
 team VARCHAR(15),
 computer VARCHAR(15))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
 DECLARE @wfh BIT
 DECLARE @team VARCHAR(15)
 DECLARE @computer VARCHAR(15)

 IF @position = 'DEVELOPER'
 BEGIN
  SET @team = 'TEAM_1'
  SET @computer = 'LAPTOP'
 END

 IF @position = 'IT'
 BEGIN
  SET @team = 'TEAM_2'
  SET @computer = 'DESKTOP'
 END

 IF @experience = 'JUNIOR'
 BEGIN
  SET @wfh = '0'
 END
 IF @experience = 'SENIOR'
 BEGIN
  SET @wfh = '1'
 END

 INSERT INTO @new_employee VALUES (@id, @wfh, @team, @computer)
 RETURN
END

SELECT * FROM new_employee_hired('123456789', 'DEVELOPER', 'SENIOR');;
```

##### Result

| ID_EMPLOYEE | WORKING_FROM_HOME | TEAM | COMPUTER |
| --- | --- | --- | --- |
| 123456789 | 1 | TEAM_1 | LAPTOP |

> **Warning:**
>
> In case there are nested if statements and more than one variables are modified in the statements it is necessary to use a stored procedure.

#### Update values previously inserted

Updates columns values of the table into the function body and returns it with the new values.

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR ALTER FUNCTION get_employees_history()
RETURNS @employee_history TABLE (
 department_name NVARCHAR(50),
 first_name NVARCHAR(50),
 last_name NVARCHAR(50),
 start_date DATE,
 end_date DATE,
 job_title NVARCHAR(50),
 months_working INT
)
BEGIN
 INSERT INTO @employee_history
 SELECT D.name AS department_name, P.first_name, P.last_name, EH.start_date, EH.end_date, E.job_title, 0 FROM Department D
 LEFT OUTER JOIN employee_department_history EH
  ON D.department_ID = EH.department_ID
 INNER JOIN  Employee E
  ON E.business_entity_ID = EH.business_entity_ID
 INNER JOIN Person P
  ON P.business_entity_ID = E.business_entity_ID

 UPDATE @employee_history
 SET
  months_working =
  CASE WHEN end_date IS NULL THEN DATEDIFF(MONTH, start_date, GETDATE())
  ELSE DATEDIFF(MONTH, start_date, end_date)
 END
 RETURN;
END;

SELECT TOP(10) * FROM get_employees_history();
```

##### Result

| DEPARTMENT_NAME | FIRST_NAME | LAST_NAME | START_DATE | END_DATE | JOB_TITLE | MONTHS_WORKING |
| --- | --- | --- | --- | --- | --- | --- |
| Sales | Syed | Abbas | 2013-03-14 | NULL | Pacific Sales Manager | 106 |
| Production | Kim | Abercrombie | 2010-01-16 | NULL | Production Technician - WC60 | 144 |
| Quality Assurance | Hazem | Abolrous | 2009-02-28 | NULL | Quality Assurance Manager | 155 |
| Shipping and Receiving | Pilar | Ackerman | 2009-01-02 | NULL | Shipping and Receiving Supervisor | 156 |
| Production | Jay | Adams | 2009-03-05 | NULL | Production Technician - WC60 | 154 |
| Information Services | François | Ajenstat | 2009-01-17 | NULL | Database Administrator | 156 |
| Sales | Amy | Alberts | 2012-04-16 | NULL | European Sales Manager | 117 |
| Production | Greg | Alderson | 2008-12-02 | NULL | Production Technician - WC45 | 157 |
| Quality Assurance | Sean | Alexander | 2008-12-28 | NULL | Quality Assurance Technician | 157 |
| Facilities and Maintenance | Gary | Altman | 2009-12-02 | NULL | Facilities Manager | 145 |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTION get_employees_history ()
RETURNS EMPLOYEE_HISTORY TABLE (
 department_name VARCHAR(50),
 first_name VARCHAR(50),
 last_name VARCHAR(50),
 start_date DATE,
 end_date DATE,
 job_title VARCHAR(50),
 months_working INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
BEGIN
 INSERT INTO @employee_history
 SELECT D.name AS department_name, P.first_name, P.last_name, EH.start_date, EH.end_date, E.job_title, 0 FROM Department D
 LEFT OUTER JOIN employee_department_history EH
  ON D.department_ID = EH.department_ID
 INNER JOIN  Employee E
  ON E.business_entity_ID = EH.business_entity_ID
 INNER JOIN Person P
  ON P.business_entity_ID = E.business_entity_ID

 UPDATE @employee_history
 SET
  months_working =
  CASE WHEN end_date IS NULL THEN DATEDIFF(MONTH, start_date, GETDATE())
  ELSE DATEDIFF(MONTH, start_date, end_date)
 END
 RETURN;
END;

SELECT TOP(10) * FROM get_employees_history();;
```

##### Result

| DEPARTMENT_NAME | FIRST_NAME | LAST_NAME | START_DATE | END_DATE | JOB_TITLE | MONTHS_WORKING |
| --- | --- | --- | --- | --- | --- | --- |
| Sales | Syed | Abbas | 2013-03-14 | NULL | Pacific Sales Manager | 106 |
| Production | Kim | Abercrombie | 2010-01-16 | NULL | Production Technician - WC60 | 144 |
| Quality Assurance | Hazem | Abolrous | 2009-02-28 | NULL | Quality Assurance Manager | 155 |
| Shipping and Receiving | Pilar | Ackerman | 2009-01-02 | NULL | Shipping and Receiving Supervisor | 156 |
| Production | Jay | Adams | 2009-03-05 | NULL | Production Technician - WC60 | 154 |
| Information Services | François | Ajenstat | 2009-01-17 | NULL | Database Administrator | 156 |
| Sales | Amy | Alberts | 2012-04-16 | NULL | European Sales Manager | 117 |
| Production | Greg | Alderson | 2008-12-02 | NULL | Production Technician - WC45 | 157 |
| Quality Assurance | Sean | Alexander | 2008-12-28 | NULL | Quality Assurance Technician | 157 |
| Facilities and Maintenance | Gary | Altman | 2009-12-02 | NULL | Facilities Manager | 145 |

#### Multiple return clauses

In the following sample there is more than one return clause, this is because depending on the situation it is not necessary to keep executing the whole function.

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR ALTER FUNCTIONcreate_new_team(@team_name VARCHAR(50))
</strong>RETURNS @new_team TABLE (type VARCHAR(50), name VARCHAR(50))
AS
BEGIN
 DECLARE @employees INT
 SET @employees = (SELECT count(*) FROM employee)
 DECLARE @type VARCHAR(15)
 SET @type = 'small_team'
 IF (@employees &#x3C; 8)
 BEGIN
  INSERT @new_team VALUES (@type, @team_name)
  RETURN
 END

 SET @type = 'big_team'
 INSERT @new_team VALUES (@type, @team_name)

 RETURN
END

SELECT * FROMcreate_new_team('Team1');
```

##### Result

| TYPE | NAME |
| --- | --- |
| SMALL_TEAM | TEAM1 |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTIONcreate_new_team (TEAM_NAME STRING)
RETURNS NEW_TEAM TABLE (
 type VARCHAR(50),
 name VARCHAR(50))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
 DECLARE @employees INT
 SET @employees = (SELECT count(*) FROM employee)
 DECLARE @type VARCHAR(15)
 SET @type = 'small_team'
 IF (@employees < 8)
 BEGIN
  INSERT @new_team VALUES (@type, @team_name)
  RETURN
 END

 SET @type = 'big_team'
 INSERT @new_team VALUES (@type, @team_name)

 RETURN
END

SELECT * FROMcreate_new_team('Team1');;
```

##### Result

| TYPE | NAME |
| --- | --- |
| SMALL_TEAM | TEAM1 |

> **Warning:**
>
> This transformation is applied when there is only one value to insert, if there is more than one value it is necessary to use a stored procedure.

#### Complex cases

The example is a complex case that uses nested `if` statements and inserts a value depending on the true condition.

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR ALTER FUNCTION vacation_status(@id VARCHAR (50))
RETURNS @status TABLE (vacation_status VARCHAR(30))
AS
BEGIN
 DECLARE @hire_date DATETIME
 SET @hire_date = (SELECT @hire_date FROM employee WHERE employeeId = @id)
 DECLARE @vacation_hours INT
 SET @vacation_hours = (SELECT count(vacation_hours) FROM employee WHERE employeeId = @id)
 DECLARE @time_working INT
 SET @time_working = (SELECT DATEDIFF(MONTH, @hire_date,GETDATE()))

 IF (@vacation_hours > 0)
 BEGIN
  IF (@time_working > 3)
  BEGIN
   IF (@vacation_hours < 120)
   BEGIN
    INSERT INTO @status VALUES ('Ok')
   END

   IF (@vacation_hours = 120)
   BEGIN
    INSERT INTO @status values ('In the limit')
   END

   IF (@vacation_hours > 120)
   BEGIN
    INSERT INTO @status VALUES ('With excess')
   END
  END
  ELSE
  BEGIN
   INSERT INTO @status values ('Hired recently')
  END
 END
 ELSE
 BEGIN
  INSERT INTO @status values ('No hours')
 END
 RETURN
END

SELECT * FROM vacation_status('adventure-worksken0')
```

##### Result

| VACATION_STATUS |
| --- |
| OK |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTION vacation_status (ID STRING)
RETURNS STATUS TABLE (
 vacation_status VARCHAR(30))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
 DECLARE @hire_date DATETIME
 SET @hire_date = (SELECT @hire_date FROM employee WHERE employeeId = @id)
 DECLARE @vacation_hours INT
 SET @vacation_hours = (SELECT count(vacation_hours) FROM employee WHERE employeeId = @id)
 DECLARE @time_working INT
 SET @time_working = (SELECT DATEDIFF(MONTH, @hire_date,GETDATE()))

 IF (@vacation_hours > 0)
 BEGIN
  IF (@time_working > 3)
  BEGIN
   IF (@vacation_hours < 120)
   BEGIN
    INSERT INTO @status VALUES ('Ok')
   END

   IF (@vacation_hours = 120)
   BEGIN
    INSERT INTO @status values ('In the limit')
   END

   IF (@vacation_hours > 120)
   BEGIN
    INSERT INTO @status VALUES ('With excess')
   END
  END
  ELSE
  BEGIN
   INSERT INTO @status values ('Hired recently')
  END
 END
 ELSE
 BEGIN
  INSERT INTO @status values ('No hours')
 END
 RETURN
END

SELECT * FROM vacation_status('adventure-worksken0');
```

##### Second Tab

| VACATION_STATUS |
| --- |
| OK |

### Known Issues

#### While statements along side queries

The problem with this example is that there’s no way of transforming the while statement to a CTE inside the `WITH` clause of the main select, this forces us to transform this statement to store procedure to maintain the same logic.

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
--Additional Params: -t JavaScript
CREATE OR ALTER FUNCTION get_group_name
(@department_id INT)
RETURNS @group_names TABLE (group_name VARCHAR(15))
AS
BEGIN
DECLARE @name VARCHAR(30) = 'Another Department'
WHILE @name = 'Another Department'
BEGIN
 IF (@department_id &#x3C; 3)
 BEGIN
  SET @name = 'engineering'
 END

 IF @department_id = 3
 BEGIN
  SET @name = 'Tool Design'
 END

 SELECT @department_id = @department_id / 3
END
INSERT @group_names SELECT @name
RETURN
END

SELECT * FROM get_group_name(9);
```

##### Result

| GROUP_NAME |
| --- |
| Tool Design |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!
CREATE OR ALTER FUNCTION get_group_name
(DEPARTMENT_ID INT)
RETURNS @group_names TABLE (
 group_name VARCHAR(15))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
DECLARE @name VARCHAR(30) = 'Another Department'
WHILE @name = 'Another Department'
BEGIN
 IF (@department_id < 3)
 BEGIN
  SET @name = 'engineering'
 END

 IF @department_id = 3
 BEGIN
  SET @name = 'Tool Design'
 END

 SELECT @department_id = @department_id / 3
END
INSERT @group_names SELECT @name
RETURN
END

SELECT * FROM get_group_name(9);;
```

##### Result

| GROUP_NAME |
| --- |
| Tool Design |

#### Declare Cursor

User-defined functions cannot DECLARE, OPEN, FETCH, CLOSE or DEALLOCATE a `CURSOR`. Use a Stored Procedure to work with cursors.

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
 --Additional Params: -t JavaScript

CREATE OR ALTER FUNCTION amount_new_specimens(@id int)
RETURNS @new_specimens TABLE (amount int)
AS
BEGIN
 DECLARE @first_specimen VARCHAR(30) ;
 set @first_specimen = (select name_specimen from specimen where specimen_id = @id);
 DECLARE @second_specimen VARCHAR(30);

 DECLARE @specimens TABLE (name_specimen VARCHAR(30))

 DECLARE Cursor1 CURSOR
 FOR SELECT name_specimen
 FROM specimen

 OPEN cursor1
 FETCH NEXT FROM cursor1
 INTO @second_specimen;

 WHILE @@FETCH_STATUS = 0
 BEGIN
  IF @first_specimen <> @second_specimen
  BEGIN
   INSERT INTO @specimens values (CONCAT_WS('-', @first_specimen, @second_specimen))
  END
  FETCH NEXT FROM cursor1
  INTO @second_specimen;
 END

 CLOSE cursor1;
 DEALLOCATE cursor1;

 INSERT INTO @new_specimens SELECT COUNT(*) FROM @specimens
 RETURN
END

SELECT * FROM amount_new_specimens(1);
```

##### Result

| AMOUNT |
| --- |
| 3 |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
 --Additional Params: -t JavaScript
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'TABLE VALUED FUNCTIONS' NODE ***/!!!

CREATE OR ALTER FUNCTION amount_new_specimens (ID INT)
RETURNS @new_specimens TABLE (
 amount INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
BEGIN
 DECLARE @first_specimen VARCHAR(30) ;
 set @first_specimen = (select name_specimen from specimen where specimen_id = @id);
 DECLARE @second_specimen VARCHAR(30);

 DECLARE @specimens TABLE (name_specimen VARCHAR(30))

 DECLARE Cursor1 CURSOR
 FOR SELECT name_specimen
 FROM specimen

 OPEN cursor1
 FETCH NEXT FROM cursor1
 INTO @second_specimen;

 WHILE @@FETCH_STATUS = 0
 BEGIN
  IF @first_specimen <> @second_specimen
  BEGIN
   INSERT INTO @specimens values (CONCAT_WS('-', @first_specimen, @second_specimen))
  END
  FETCH NEXT FROM cursor1
  INTO @second_specimen;
 END

 CLOSE cursor1;
 DEALLOCATE cursor1;

 INSERT INTO @new_specimens SELECT COUNT(*) FROM @specimens
 RETURN
END

SELECT * FROM amount_new_specimens(1);;
```

##### Result

| AMOUNT |
| --- |
| 3 |

#### Different statements are not supported in Common Tables Expressions

The clauses `UPDATE`, `INSERT`, `DELETE`, `ALTER` or `DROP` are not supported on the body of common tables expressions, even after their declaration using a delimitator. For this reason, the function can be modified to work as a stored procedure.

##### Transact-SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
 --Additional Params: -t JavaScript

CREATE OR ALTER PROCEDURE product_history
AS
BEGIN
 DECLARE @product_history TABLE (
  product_name NVARCHAR(50),
  rating INT
 )
 INSERT INTO @product_history
 SELECT P.Name AS product_name, AVG(ALL R.rating) FROM Production.product P
 INNER JOIN  Production.product_review R
  ON R.product_ID = P.product_ID
 GROUP BY P.Name;

 DELETE FROM @product_history
 WHERE rating < 2;

 SELECT * FROM @product_history;

END
GO;

EXEC product_history
```

##### Result

| PRODUCT_NAME | Rating |
| --- | --- |
| HL Mountain Pedal | 3 |
| Mountain Bike Socks, M | 5 |
| Road-550-W Yellow, 40 | 5 |

##### Snowflake SQL

##### MULTI-STATEMENT TABLE-VALUED

```sql
CREATE OR REPLACE PROCEDURE product_history ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
 // REGION SnowConvert AI Helpers Code
 var _RS, ROW_COUNT, _ROWS, MESSAGE_TEXT, SQLCODE = 0, SQLSTATE = '00000', OBJECT_SCHEMA_NAME  = 'UNKNOWN', ERROR_HANDLERS, NUM_ROWS_AFFECTED, PROC_NAME = arguments.callee.name, DOLLAR_DOLLAR = '$' + '$';
 function* sqlsplit(sql) {
  var part = '';
  var ismark = () => sql[i] == '$' && sql[i + 1] == '$';
  for(var i = 0;i < sql.length;i++) {
   if (sql[i] == ';') {
    yield part + sql[i];
    part = '';
   } else if (ismark()) {
    part += sql[i++] + sql[i++];
    while ( i < sql.length && !ismark() ) {
     part += sql[i++];
    }
    part += sql[i] + sql[i++];
   } else part += sql[i];
  }
  if (part.trim().length) yield part;
 };
 var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
 var fixBind = function (arg) {
  arg = arg == undefined ? null : arg instanceof Date ? formatDate(arg) : arg;
  return arg;
 };
 var EXEC = (stmt,binds = [],severity = "16",noCatch = false) => {
  binds = binds ? binds.map(fixBind) : binds;
  for(var stmt of sqlsplit(stmt)) {
   try {
    _RS = snowflake.createStatement({
      sqlText : stmt,
      binds : binds
     });
    _ROWS = _RS.execute();
    ROW_COUNT = _RS.getRowCount();
    NUM_ROWS_AFFECTED = _RS.getNumRowsAffected();
    return {
     THEN : (action) => !SQLCODE && action(fetch(_ROWS))
    };
   } catch(error) {
    let rStack = new RegExp('At .*, line (\\d+) position (\\d+)');
    let stackLine = error.stackTraceTxt.match(rStack) || [0,-1];
    MESSAGE_TEXT = error.message.toString();
    SQLCODE = error.code.toString();
    SQLSTATE = error.state.toString();
    snowflake.execute({
     sqlText : `SELECT UPDATE_ERROR_VARS_UDF(?,?,?,?,?,?)`,
     binds : [stackLine[1],SQLCODE,SQLSTATE,MESSAGE_TEXT,PROC_NAME,severity]
    });
    throw error;
   }
  }
 };
 // END REGION

  EXEC(`CREATE OR REPLACE TEMPORARY TABLE T_product_history (
   product_name VARCHAR(50),
   rating INT
)`);
 EXEC(` INSERT INTO T_product_history
 SELECT
    P.Name AS product_name,
    AVG(ALL R.rating) FROM
    Production.product P
    INNER JOIN
       Production.product_review R
       ON R.product_ID = P.product_ID
 GROUP BY
    P.Name`);
 EXEC(`DELETE FROM
   T_product_history
WHERE
   rating < 2`);
 EXEC(`
 SELECT
    *
 FROM
    T_product_history`);
$$;
;

CALL product_history();
```

##### Result

| PRODUCT_NAME | Rating |
| --- | --- |
| HL Mountain Pedal | 3 |
| Mountain Bike Socks, M | 5 |
| Road-550-W Yellow, 40 | 5 |

### Related EWIs

1. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review

## SCALAR

Translation reference to convert Transact-SQL UDF (User Defined Functions) with scalar return type to Snowflake.

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> A scalar user-defined function is a Transact-SQL or common language runtime (CLR) routine that accepts parameters, performs an action, such as a complex calculation, and returns the result of that action as a scalar value. ([SQL Server Language ReferenceCREATE FUNCTION subsection](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-function-transact-sql?view=sql-server-ver15)).

> **Note:**
>
> These functions are usually used inside the `SELECT`statement, or single variable setup (most likely inside a stored procedure).

#### Transact-SQL Syntax

```sql
 -- Transact-SQL Scalar Function Syntax
CREATE [ OR ALTER ] FUNCTION [ schema_name. ] function_name
( [ { @parameter_name [ AS ][ type_schema_name. ] parameter_data_type
 [ = default ] [ READONLY ] }
    [ ,...n ]
  ]
)
RETURNS return_data_type
    [ WITH <function_option> [ ,...n ] ]
    [ AS ]
    BEGIN
        function_body
        RETURN scalar_expression
    END
[ ; ]
```

#### Snowflake Syntax

Snowflake allows 3 different languages in their user defined functions:

* SQL
* JavaScript
* Java

For now, SnowConvert AI will support only `SQL` and `JavaScript` as target languages.

##### SQL

> **Note:**
>
> SQL user defined functions only supports one query as their body. They can read from the database, but is not allowed to write or modify it. ([Scalar SQL UDFs Reference](https://docs.snowflake.com/en/developer-guide/udf/sql/udf-sql-scalar-functions.html)).

```sql
CREATE [ OR REPLACE ] [ SECURE ] FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ]
  [ COMMENT = '<string_literal>' ]
  AS '<function_definition>'
```

##### JavaScript

> **Note:**
>
> JavaScript user defined functions allows multiple statements in their bodies, but cannot perform queries to the database. (Scalar JavaScript UDFs Reference)

```sql
CREATE [ OR REPLACE ] [ SECURE ] FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
  RETURNS { <result_data_type> | TABLE ( <col_name> <col_data_type> [ , ... ] ) }
  [ [ NOT ] NULL ]
  LANGUAGE JAVASCRIPT
  [ { CALLED ON NULL INPUT | { RETURNS NULL ON NULL INPUT | STRICT } } ]
  [ VOLATILE | IMMUTABLE ]
  [ COMMENT = '<string_literal>' ]
  AS '<function_definition>'
```

### Sample Source Patterns

#### Set and Declare Statements

The most common statements in function bodies are the `DECLARE` and `SET` statements. For `DECLARE` statements without default value, the transformation will be ignored. `SET` statements and `DECLARE` statements with a default value, will be transformed to a `COMMON TABLE EXPRESSION.` Each common table expression will contain a column that represents the local variable value.

##### Transact-SQL

##### Query

```sql
CREATE OR ALTER FUNCTION PURCHASING.GetVendorName()
RETURNS NVARCHAR(50) AS
BEGIN
 DECLARE @result NVARCHAR(50)
 DECLARE @BUSINESSENTITYID INT

 SET @BUSINESSENTITYID = 1492

 SELECT @result = Name FROM PURCHASING.VENDOR WHERE BUSINESSENTITYID = @BUSINESSENTITYID

 RETURN @result
END

GO

SELECT PURCHASING.GetVendorName() as vendor_name;
```

##### Result

| vendor_name |
| --- |
| Australia Bike Retailer |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.GetVendorName ()
RETURNS VARCHAR(50)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 WITH CTE1 AS
 (
  SELECT
   1492 AS BUSINESSENTITYID
 ),
 CTE2 AS
 (
  SELECT
   Name AS RESULT
  FROM
   PURCHASING.VENDOR
  WHERE
   BUSINESSENTITYID = (
    SELECT
     BUSINESSENTITYID
    FROM
     CTE1
   )
 )
 SELECT
  RESULT
 FROM
  CTE2
$$;

SELECT
 PURCHASING.GetVendorName() as vendor_name;
```

##### Result

| VENDOR_NAME |
| --- |
| Australia Bike Retailer |

#### If/Else Statement Transformation

If/Else statement can be handled in different ways, they can be either transformed to javascript or to SQL using the [CASE EXPRESSION](https://docs.snowflake.com/en/sql-reference/functions/case.html) inside the select allowing conditionals inside the queries, while the javascript transformation is pretty straightforward, the Case statement might not be so obvious at first glance.

##### Transact-SQL

##### Query

```sql
CREATE OR ALTER FUNCTION PURCHASING.HasActiveFlag(@BusinessEntityID int)
RETURNS VARCHAR(10) AS
BEGIN
 DECLARE @result VARCHAR(10)
 DECLARE @ActiveFlag BIT

 SELECT @ActiveFlag = ActiveFlag from PURCHASING.VENDOR v where v.BUSINESSENTITYID = @BusinessEntityID

 IF @ActiveFlag = 1
  SET @result = 'YES'
 ELSE IF @ActiveFlag = 0
  SET @result = 'NO'

 RETURN @result
END

GO

SELECT PURCHASING.HasActiveFlag(1516) as has_active_flag;
```

##### Result

| has_active_flag |
| --- |
| NO |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.HasActiveFlag (P_BUSINESSENTITYID INT)
RETURNS VARCHAR(10)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 WITH CTE1 AS
 (

  SELECT
   ActiveFlag AS ACTIVEFLAG
  from
   PURCHASING.VENDOR v
  where
   v.BUSINESSENTITYID = P_BUSINESSENTITYID
 ),
 CTE2 AS
 (
  SELECT
   CASE
    WHEN (
     SELECT
      ACTIVEFLAG
     FROM
      CTE1
    ) = 1
     THEN 'YES'
    WHEN (
     SELECT
      ACTIVEFLAG
     FROM
      CTE1
    ) = 0
     THEN 'NO'
   END AS RESULT
 )
 SELECT
  RESULT
 FROM
  CTE2
$$;

SELECT
 PURCHASING.HasActiveFlag(1516) as has_active_flag;
```

##### Result

| HAS_ACTIVE_FLAG |
| --- |
| NO |

#### Nested Statements

For nested statements, the structured programming is being transformed to a single query. The statements in the control-of-flow are going to be nested in table structures to preserve the execution order.

> **Note:**
>
> `CASE EXPRESSIONS` only can return one value per statement

##### Example

> **Note:**
>
> The following code in both programming paradigms is functionally equivalent.

##### Structured Programming

```sql
 DECLARE @VendorId AS int;
DECLARE @AccountNumber AS VARCHAR(50);
SELECT @VendorId = poh.VendorID
    FROM Purchasing.PurchaseOrderHeader poh
    WHERE PurchaseOrderID = 1
SELECT @AccountNumber = v.AccountNumber
    FROM Purchasing.Vendor v
    WHERE v.BusinessEntityID = @VendorId
```

##### SQL

```sql
 SELECT V.AccountNumber AccountNumber
FROM (SELECT poh.VendorID VendorId
         FROM Purchasing.PurchaseOrderHeader poh
         WHERE PurchaseOrderID = 1
) T1, Purchasing.Vendor v
WHERE v.BusinessEntityID = T1.VendorId
```

##### Result

| AccountNumber |
| --- |
| LITWARE0001 |

#### Conditional variables through SELECTs

Variable definition and assignment within conditional statements tends to be somewhat problematic, because references to the variable further down the code would have to know where the variable was last modified. Not only that, but if the reference is within another conditional statement, then there would have to be some kind of redirect that references the previous known assignment to the variable.

This is all aggravated by nesting and complex querying that can be found on input code. That’s why a specific [EWI](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) is added when these patterns are found.

In the following scenario, the first `IF` statement can be transformed without problems, because the contents are straightforward enough. The second and third `IF` statements are commented out because they’re not supported at the moment, since there are statements other than variable assignments through `SELECT`.

##### SQL Server

##### Query

```sql
CREATE or ALTER FUNCTION PURCHASING.SELECTINUDF (
    @param1 varchar(12)
)
RETURNS int
AS
BEGIN
    declare @var1 int;
    declare @var2 int;
    declare @var3 int;

    IF @param1 = 'first'
    BEGIN
        select @var1 = col1 + 10 from table1 WHERE id = 0;
        select @var2 = col1 + 20 from table1 WHERE id = 0;
        select @var3 = col1 + 30 from table1 WHERE id = 0;
    END

    IF @param1 = 'second'
    BEGIN
        declare @var4 int = 10;
        select @var1 = col1 + 40 from table1 WHERE id = 0;
        select @var2 = col1 + 40 from table1 WHERE id = 0;
    END

    IF @param1 = 'third'
    BEGIN
        select col1 from table1 where id = 0;
        select @var1 = col1 + 50 from table1 WHERE id = 0;
        select @var2 = col1 + 50 from table1 WHERE id = 0;
    END

    RETURN @var1
END

SELECT PURCHASING.SELECTINUDF('first') as result; -- Assuming table1.col1 is 0 when ID = 0
```

##### Result

| RESULT |
| --- |
| 10 |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.SELECTINUDF (PARAM1 STRING)
RETURNS INT
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
    WITH CTE1 AS
    (
        SELECT
            CASE
                WHEN PARAM1 = 'first'
                    THEN (SELECT
                        col1 + 10 AS VAR1 from
                        table1
                        WHERE
                        id = 0)
            END AS VAR1,
            CASE
                WHEN PARAM1 = 'first'
                        THEN (SELECT
                        col1 + 20 AS VAR2 from
                        table1
                        WHERE
                        id = 0)
            END AS VAR2,
            CASE
                WHEN PARAM1 = 'first'
                        THEN (SELECT
                        col1 + 30 AS VAR3 from
                        table1
                        WHERE
                        id = 0)
            END AS VAR3
    ),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'IF STATEMENT' NODE ***/!!!
    CTE2 AS
    (
        /*    IF @param1 = 'second'
            BEGIN
                declare @var4 int = 10;
                select @var1 = col1 + 40 from table1 WHERE id = 0;
                select @var2 = col1 + 40 from table1 WHERE id = 0;
            END*/
        SELECT
            null
    ),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'IF STATEMENT' NODE ***/!!!
    CTE3 AS
    (
        /*    IF @param1 = 'third'
            BEGIN
                select col1 from table1 where id = 0;
                select @var1 = col1 + 50 from table1 WHERE id = 0;
                select @var2 = col1 + 50 from table1 WHERE id = 0;
            END*/
        SELECT
            null
    ),
    CTE4 AS
    (

        SELECT
            PURCHASING.SELECTINUDF('first') as result
    )
    SELECT
        VAR1
    FROM
        CTE4
$$ -- Assuming table1.col1 is 0 when ID = 0
;
```

##### Result

| RESULT |
| --- |
| 10 |

#### Assign and return a variable

In this simple pattern, there is a variable declaration, then, that variable is set using a `SELECT` statement and finally returned. This is going to be migrated to a [Common Table Expression](https://docs.snowflake.com/en/sql-reference/constructs/with.html) to keep the original behavior.

##### SQL Server

##### Query

```sql
CREATE OR ALTER FUNCTION Purchasing.GetTotalFreight()
RETURNS MONEY AS
BEGIN
 DECLARE @Result MONEY
 SELECT @Result = ISNULL(SUM(t.Freight), 0) from Purchasing.PurchaseOrderHeader t
 return @Result
END

GO

select Purchasing.GetTotalFreight() as Result;
```

##### Result

| Result |
| --- |
| 1583978.2263 |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION Purchasing.GetTotalFreight ()
RETURNS NUMBER(38, 4)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 WITH CTE1 AS
 (
  SELECT
   NVL(SUM(t.Freight), 0) AS RESULT from
   Purchasing.PurchaseOrderHeader t
 )
 SELECT
  RESULT
 FROM
  CTE1
$$;

select
 Purchasing.GetTotalFreight() as Result;
```

##### Result

| RESULT |
| --- |
| 1583978.2263 |

#### Multiple Function Calls

For this specific pattern there are no obvious queries, but there are multiple calls to multiple functions working on the same variable and returning it at the end. Since Snowflake only supports queries inside its functions, the solution for this block is going to be adding it to a Select and nesting the calls inside, making sure the return value is the same as the one on the source.

##### SQL Server

##### Query

```sql
CREATE OR ALTER FUNCTION PURCHASING.Foo
(
 @PARAM1 INT
)
RETURNS varchar(25)
AS
BEGIN
 DECLARE @filter INT = @PARAM1
 DECLARE @NAME VARCHAR(25) = (SELECT Name from Purchasing.Vendor v where BusinessEntityID = @filter)
 SET @NAME = REPLACE(@NAME, 'Australia', 'USA')
 SET @NAME = REPLACE(@NAME, 'Bike', 'Car')
 RETURN @NAME
END

GO

SELECT PURCHASING.Foo(1492) AS Name;
```

##### Result

| Name |
| --- |
| USA Car Retailer |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.Foo (PARAM1 INT)
RETURNS VARCHAR(25)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 WITH CTE1 AS
 (
  SELECT
   PARAM1 AS FILTER
 ),
 CTE2 AS
 (
  SELECT
   (SELECT
     Name
    from
     Purchasing.Vendor v
    where
     BusinessEntityID = (
      SELECT
       FILTER
      FROM
       CTE1
     )
   ) AS NAME
 ),
 CTE3 AS
 (
  SELECT
   REPLACE((
    SELECT
     NAME
    FROM
     CTE3
   ), 'Australia', 'USA') AS NAME
 ),
 CTE4 AS
 (
  SELECT
   REPLACE((
    SELECT
     NAME
    FROM
     CTE4
   ), 'Bike', 'Car') AS NAME
 )
 SELECT
  NAME
 FROM
  CTE4
$$;

SELECT
 PURCHASING.Foo(1492) AS Name;
```

##### Result

| NAME |
| --- |
| USA Car Retailer |

#### Increase a variable based on multiple IF conditions and return its value

For this pattern, a variable is modified (increased in this case) using multiple IF conditions. In the beginning, a set of variables is initialized and used to determine whether the result variable should be increased or not. Finally, the result variable is returned.

##### SQL Server

##### Query

```sql
CREATE OR ALTER FUNCTION PURCHASING.FOO()
RETURNS MONEY
AS
BEGIN
 declare @firstValue MONEY
 declare @secondValue MONEY
 declare @Result MONEY
 select  @Result = 0
 select  @firstValue = SubTotal from Purchasing.PurchaseOrderHeader where PurchaseOrderID = 1
 select  @secondValue = SubTotal from Purchasing.PurchaseOrderHeader where PurchaseOrderID = 2
 if @firstValue is not null
  select @Result = @Result + @firstValue
 if @secondValue is not null
  select @Result = @Result + @secondValue
 return @Result
END

GO

SELECT PURCHASING.Foo() AS Result;
```

##### Result

| Result |
| --- |
| 473.1415 |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.FOO ()
RETURNS NUMBER(38, 4)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 WITH CTE1 AS
 (
  select
   0 AS RESULT
 ),
 CTE2 AS
 (
  select
   SubTotal AS FIRSTVALUE
  from
   Purchasing.PurchaseOrderHeader
  where
   PurchaseOrderID = 1
 ),
 CTE3 AS
 (
  select
   SubTotal AS SECONDVALUE
  from
   Purchasing.PurchaseOrderHeader
  where
   PurchaseOrderID = 2
 ),
 CTE4 AS
 (
  SELECT
   CASE
    WHEN (
     SELECT
      FIRSTVALUE
     FROM
      CTE2
    ) is not null
     THEN (
     select
      (
       SELECT
        RESULT
       FROM
        CTE1
      ) + (
       SELECT
        FIRSTVALUE
       FROM
        CTE2
      ) AS RESULT)
   END AS RESULT
 ),
 CTE5 AS
 (
  SELECT
   CASE
    WHEN (
     SELECT
      SECONDVALUE
     FROM
      CTE3
    ) is not null
     THEN (
     select
      (
       SELECT
        RESULT
       FROM
        CTE1
      ) + (
       SELECT
        SECONDVALUE
       FROM
        CTE3
      ) AS RESULT)
    ELSE (SELECT
     RESULT
    FROM
     CTE4)
   END AS RESULT
 )
 SELECT
  RESULT
 FROM
  CTE5
$$;

SELECT
 PURCHASING.Foo() AS Result;
```

##### Result

| RESULT |
| --- |
| 473.1415 |

#### Two or more RETURN statements

For this pattern, the `IF` block containing the return clause that breaks the code flow is added at the end of the body, like the final statement to be executed in a `CASE` expression.

##### Basic Case

For this particular scenario, there is no logic between the conditional `RETURN` statement and the final `RETURN` statement, so all body will be mapped to a single `CASE EXPRESSION`.

##### SQL Server

##### Query

```sql
CREATE OR ALTER FUNCTION [PURCHASING].[FOO] ()
RETURNS INT
AS
BEGIN
 IF exists (SELECT PreferredVendorStatus FROM Purchasing.Vendor v )
  RETURN 1

 RETURN 0
END

GO

SELECT PURCHASING.FOO() as result;
```

##### Result

| result |
| --- |
| 1 |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.FOO ()
RETURNS INT
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 SELECT
  CASE
   WHEN exists (SELECT
     PreferredVendorStatus
    FROM
     Purchasing.Vendor v
   )
    THEN 1
   ELSE 0
  END
$$;

SELECT
 PURCHASING.FOO() as result;
```

##### Result

| RESULT |
| --- |
| 1 |

#### Common Table Expressions

Common table expressions will be kept as in the original code, and they are going to be concatenated with the generated ones. SnowConvert AI is able to identify first all the original `COMMON TABLE EXPRESSION` names to avoid generating duplicated names.

##### SQL Server

##### Query

```sql
CREATE OR ALTER FUNCTION [PURCHASING].[FOO]
(
 @status INT
)
Returns INT
As
Begin
 Declare @result as int = 0

 ;WITH ctetable(RevisionNumber) as
 (
  SELECT RevisionNumber
  FROM Purchasing.PurchaseOrderHeader poh
  where poh.Status = @status
 ),
 finalCte As
 (
  SELECT RevisionNumber FROM ctetable
 )

 Select @result = count(RevisionNumber) from finalCte
 return @result;
End

GO

SELECT PURCHASING.FOO(4) as result;
```

##### Result

| result |
| --- |
| 3689 |

##### Snowflake

##### Query

```sql
CREATE OR REPLACE FUNCTION PURCHASING.FOO (STATUS INT)
Returns INT
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
 WITH CTE1 AS
 (
  SELECT
   0 AS RESULT
 ),
 ctetable (
  RevisionNumber
 ) as
  (
   SELECT
   RevisionNumber
   FROM
   Purchasing.PurchaseOrderHeader poh
   where
   poh.Status = STATUS
  ),
  finalCte As
  (
   SELECT
   RevisionNumber
  FROM
   ctetable
  ),
  CTE2 AS
  (
  Select
   COUNT(RevisionNumber) AS RESULT from
   finalCte
  )
  SELECT
  RESULT
  FROM
  CTE2
$$;

SELECT
  PURCHASING.FOO(4) as result;
```

##### Result

| RESULT |
| --- |
| 3689 |

#### Transform to JavaScript UDFs

If there are multiple statements and the function does not access the database in any way, it can be transformed into a JavaScript function keeping the functional equivalence

##### SQL Server

##### Query 1

```sql
CREATE OR ALTER FUNCTION PURCHASING.GetFiscalYear
(
 @DATE AS DATETIME
)
RETURNS INT
AS
BEGIN
 DECLARE @FiscalYear AS INT
 DECLARE @CurMonth AS INT
 SET @CurMonth = DATEPART(M,@DATE)
 SET @FiscalYear = DATEPART(YYYY, @DATE)
 IF (@CurMonth >= 7)
 BEGIN
  SET @FiscalYear = @FiscalYear + 1
 END
 RETURN @FiscalYear
END

GO

SELECT PURCHASING.GetFiscalYear('2020-10-10') as DATE;
```

##### Query 2

```sql
CREATE OR ALTER FUNCTION PURCHASING.[getCleanChargeCode]
(
 @ChargeCode varchar(50)
)
returns varchar(50) as
begin
 declare @CleanChargeCode varchar(50),@Len int,@Pos int=2
 set @Pos=LEN(@ChargeCode)-1
 while @Pos > 1
 begin
  set @CleanChargeCode=RIGHT(@ChargeCode,@Pos)
  if TRY_CAST(@CleanChargeCode as bigint) is not null
   return @CleanChargeCode
  set @Pos=@Pos-1
 end
 set @Pos=LEN(@ChargeCode)-1
 while @Pos > 1
 begin
  set @CleanChargeCode=LEFT(@ChargeCode,@Pos)
  if TRY_CAST(@CleanChargeCode as bigint) is not null
   return @CleanChargeCode
  set @Pos=@Pos-1
 end
 return null
end

GO

SELECT PURCHASING.[getCleanChargeCode]('16test') AS CleanChargeCode;
```

##### Result 1

| DATE |
| --- |
| 2021 |

##### Result 2

| CleanChargeCode |
| --- |
| 16 |

##### Snowflake

##### Query 1

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE PURCHASING.GetFiscalYear (DATE TIMESTAMP_NTZ(3))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  FISCALYEAR INT;
  CURMONTH INT;
 BEGIN

  CURMONTH := DATE_PART(month, :DATE :: TIMESTAMP);
  FISCALYEAR := DATE_PART(year, :DATE :: TIMESTAMP);
  IF ((:CURMONTH >= 7)) THEN
   BEGIN
    FISCALYEAR := :FISCALYEAR + 1;
   END;
  END IF;
  RETURN :FISCALYEAR;
 END;
$$;

SELECT
 PURCHASING.GetFiscalYear('2020-10-10') !!!RESOLVE EWI!!! /*** SSC-EWI-0067 - UDF WAS TRANSFORMED TO SNOWFLAKE PROCEDURE, CALLING PROCEDURES INSIDE QUERIES IS NOT SUPPORTED ***/!!! as DATE;
```

##### Query 2

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE PURCHASING.getCleanChargeCode (CHARGECODE STRING)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  CLEANCHARGECODE VARCHAR(50);
  LEN INT;
  POS INT := 2;
 BEGIN

  POS := LEN(:CHARGECODE)-1;
  WHILE (:POS > 1) LOOP
   CLEANCHARGECODE := RIGHT(:CHARGECODE, :POS);
   IF (CAST(:CLEANCHARGECODE AS BIGINT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/!!!RESOLVE EWI!!! /*** SSC-EWI-TS0074 - CAST RESULT MAY BE DIFFERENT FROM TRY_CAST FUNCTION DUE TO MISSING DEPENDENCIES ***/!!! is not null) THEN
    RETURN :CLEANCHARGECODE;
   END IF;
   POS := :POS -1;
  END LOOP;
  POS := LEN(:CHARGECODE)-1;
  WHILE (:POS > 1) LOOP
   CLEANCHARGECODE := LEFT(:CHARGECODE, :POS);
   IF (CAST(:CLEANCHARGECODE AS BIGINT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/!!!RESOLVE EWI!!! /*** SSC-EWI-TS0074 - CAST RESULT MAY BE DIFFERENT FROM TRY_CAST FUNCTION DUE TO MISSING DEPENDENCIES ***/!!! is not null) THEN
    RETURN :CLEANCHARGECODE;
   END IF;
   POS := :POS -1;
  END LOOP;
  RETURN null;
 END;
$$;

SELECT
 PURCHASING.getCleanChargeCode('16test') !!!RESOLVE EWI!!! /*** SSC-EWI-0067 - UDF WAS TRANSFORMED TO SNOWFLAKE PROCEDURE, CALLING PROCEDURES INSIDE QUERIES IS NOT SUPPORTED ***/!!! AS CleanChargeCode;
```

##### Result 1

| DATE |
| --- |
| 2021.0 |

##### Result 2

| CLEANCHARGECODE |
| --- |
| 16 |

### Known Issues

> **Warning:**
>
> User-defined functions cannot be used to perform actions that modify the database state

> **Warning:**
>
> User-defined functions cannot contain an `OUTPUT INTO` clause that has a table as its target

> **Warning:**
>
> User-defined functions cannot DECLARE, OPEN, FETCH, CLOSE or DEALLOCATE a `CURSOR`. Use a Stored Procedure if you need to use cursors.

> **Warning:**
>
> User-defined functions cannot perform control-of-flow statements such as WHILE if there is at least one call to the database

> **Warning:**
>
> User-defined functions with references to other user-defined functions that were transformed to Stored Procedures, will be transformed to Stored Procedures too.

> **Warning:**
>
> User-defined functions that use [@@ROWCOUNT](https://docs.microsoft.com/en-us/sql/t-sql/functions/rowcount-transact-sql?view=sql-server-ver15) are not supported in SQL and should be transformed to stored procedures to keep the functional equivalence.

> **Warning:**
>
> User-defined functions that have `SELECT` statements assigning a variable to itself is not supported in Snowflake. See also [SELECT @local_variable](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/select-local-variable-transact-sql?view=sql-server-ver15)

For all the unsupported cases, please check the related EWIs and the patterns below to obtain recommendations and possible workarounds.

#### Conditionals other than if/else statements along side queries

The next scenario involves the use of the “while statement” along side other queries. The problem with this example is that there’s no way of transforming the while statement to a CTE inside the `WITH` clause of the main select, this forces us to transform this statement to JavaScript procedure to maintain the same logic.

##### SQL Server

##### Query

```sql
CREATE OR ALTER FUNCTION PURCHASING.FOO()
RETURNS INT
AS
BEGIN
    DECLARE @i int = 0, @p int;
    Select @p = COUNT(*) FROM PURCHASING.VENDOR

    WHILE (@p < 1000)
    BEGIN
        SET @i = @i + 1
        SET @p = @p + @i
    END

    IF (@i = 6)
        RETURN 1

    RETURN @p
END

GO

SELECT PURCHASING.FOO() as result;
```

##### Result

| result |
| --- |
| 1007 |

**Snowflake**

##### Query

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE PURCHASING.FOO ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        I INT := 0;
        P INT;
    BEGIN

        Select
            COUNT(*)
        INTO
            :P
 FROM
            PURCHASING.VENDOR;
        WHILE (:P < 1000) LOOP
            I := :I + 1;
            P := :P + :I;
        END LOOP;
        IF ((:I = 6)) THEN
            RETURN 1;
        END IF;
        RETURN :P;
    END;
$$;

SELECT
    PURCHASING.FOO() !!!RESOLVE EWI!!! /*** SSC-EWI-0067 - UDF WAS TRANSFORMED TO SNOWFLAKE PROCEDURE, CALLING PROCEDURES INSIDE QUERIES IS NOT SUPPORTED ***/!!! as result;
```

##### Result

| FOO |
| --- |
| 1007 |

#### Assign a variable using its own value iterating through a rowset

In the following example, the variable `@names` is used to concatenate multiple values from a column into one single string. The variable is updated on each iteration as shown, which is not supported by Snowflake UDFs. For this scenario, the function should be transformed into a *procedure*.

**SQL Server**

##### Query

```sql
CREATE OR ALTER FUNCTION PURCHASING.FOO()
RETURNS VARCHAR(8000)
AS
BEGIN
    DECLARE @names varchar(8000)
    SET @names = ''
    SELECT @names = ISNULL(@names + ' ', '') + Name from Purchasing.Vendor v
    return @names
END

GO

select PURCHASING.FOO() as names;
```

##### Result

| names |
| --- |
| Australia Bike Retailer Allenson Cycles Advanced Bicycles Trikes, Inc. Morgan Bike Accessories Cycling Master Chicago Rent-All Greenwood Athletic Company Compete Enterprises, Inc International Light Speed Training Systems Gardner Touring Cycles Internati |

**Snowflake query**

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
CREATE OR REPLACE PROCEDURE PURCHASING.FOO ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        NAMES VARCHAR(8000);
    BEGIN

        NAMES := '';
        SELECT
            NVL(:NAMES || ' ', '') + Name
        INTO
            :NAMES
        from
            Purchasing.Vendor v;
        RETURN :NAMES;
    END;
$$;

select
    PURCHASING.FOO() !!!RESOLVE EWI!!! /*** SSC-EWI-0067 - UDF WAS TRANSFORMED TO SNOWFLAKE PROCEDURE, CALLING PROCEDURES INSIDE QUERIES IS NOT SUPPORTED ***/!!! as names;
```

> **Warning:**
>
> For the described scenarios above, consider the following limitations:
>
> 1. All the calls to user-defined functions in DML queries such as `SELECT`, `INSERT`, `DELETE`, `UPDATE` or `MERGE` will fail because calls to Stored Procedures within these queries are not allowed.
> 2. Calls to user-defined functions inside procedures, should be preceeded by the `CALL` keyword.
> 3. User-defined functions used in [COMPUTED COLUMNS](https://docs.microsoft.com/en-us/sql/relational-databases/tables/specify-computed-columns-in-a-table?view=sql-server-ver15) will fail during the execution.

### Related EWIs

1. [SSC-EWI-0067](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): UDF was transformed to Snowflake procedure, calling procedures inside a query is not supported.
2. [SSC-EWI-0068](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): User defined function was transformed to a Snowflake procedure.
3. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Snowflake Script UDF (SCALAR)

Translation reference for SQL Server Scalar User Defined Functions to [Snowflake Scripting UDFs](../../../../developer-guide/udf/sql/udf-sql-procedural-functions.md)

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

SnowConvert supports translating SQL Server Scalar User Defined Functions directly to **Snowflake Scripting UDFs** (SnowScript UDFs) when they meet specific criteria, instead of converting all functions to Stored Procedures.

**Snowflake Scripting UDFs** are user-defined functions written using Snowflake’s procedural language syntax (Snowscript) within a SQL UDF body. They support variables, loops, conditional logic, and exception handling.

#### When Functions Become SnowScript UDFs

SnowConvert analyzes each SQL Server function and automatically determines the appropriate Snowflake target. A function becomes a SnowScript UDF when it contains **only** procedural logic without data access operations.

### Sample Source Patterns

#### Simple Calculation Function

A basic scalar function that performs calculations without querying data.

##### SQL Server

```sql
CREATE FUNCTION dbo.CalculateProfit
(
    @Cost DECIMAL(10,2),
    @Revenue DECIMAL(10,2)
)
RETURNS DECIMAL(10,2)
AS
BEGIN
    DECLARE @Profit DECIMAL(10,2)
    SET @Profit = @Revenue - @Cost
    RETURN @Profit
END
GO

SELECT dbo.CalculateProfit(100.00, 150.00) as Profit;
```

##### Result

| Profit |
| --- |
| 50.00 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION dbo.CalculateProfit (COST DECIMAL(10,2), REVENUE DECIMAL(10,2))
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "QsqZARsvG3aeleeXZB43fg==" }}'
AS
$$
   DECLARE
 PROFIT DECIMAL(10, 2);
   BEGIN

 PROFIT := :REVENUE - :COST;
 RETURN :PROFIT;
   END;
$$;

SELECT
   dbo.CalculateProfit(100.00, 150.00) as Profit;
```

##### Result

| PROFIT |
| --- |
| 50.00 |

#### Function with Conditional Logic (IF/ELSE)

Functions using IF/ELSE statements for business logic.

##### SQL Server

```sql
CREATE FUNCTION dbo.GetDiscountRate
(
    @CustomerType VARCHAR(20),
    @OrderAmount DECIMAL(10,2)
)
RETURNS DECIMAL(5,2)
AS
BEGIN
    DECLARE @Discount DECIMAL(5,2)

    IF @CustomerType = 'Premium'
        SET @Discount = 0.15
    ELSE IF @CustomerType = 'Standard'
        SET @Discount = 0.10
    ELSE
        SET @Discount = 0.05

    IF @OrderAmount > 1000
        SET @Discount = @Discount + 0.05

    RETURN @Discount
END
GO

SELECT dbo.GetDiscountRate('Premium', 1200.00) as DiscountRate;
```

##### Result

| DiscountRate |
| --- |
| 0.20 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION dbo.GetDiscountRate (CUSTOMERTYPE STRING, ORDERAMOUNT DECIMAL(10,2))
RETURNS DECIMAL(5, 2)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "QsqZARsvG3aeleeXZB43fg==" }}'
AS
$$
   DECLARE
 DISCOUNT DECIMAL(5, 2);
   BEGIN

 IF (:CUSTOMERTYPE = 'Premium') THEN
 DISCOUNT := 0.15;
 ELSEIF (:CUSTOMERTYPE = 'Standard') THEN
 DISCOUNT := 0.10;
 ELSE
 DISCOUNT := 0.05;
 END IF;
 IF (:ORDERAMOUNT > 1000) THEN
 DISCOUNT := :DISCOUNT + 0.05;
 END IF;
 RETURN :DISCOUNT;
   END;
$$;

SELECT
   dbo.GetDiscountRate('Premium', 1200.00) as DiscountRate;
```

##### Result

| DISCOUNTRATE |
| --- |
| 0.20 |

#### Function with WHILE Loop

Functions using WHILE loops for iterative calculations.

##### SQL Server

```sql
CREATE FUNCTION dbo.Factorial
(
    @Number INT
)
RETURNS BIGINT
AS
BEGIN
    DECLARE @Result BIGINT = 1
    DECLARE @Counter INT = 1

    WHILE @Counter <= @Number
    BEGIN
        SET @Result = @Result * @Counter
        SET @Counter = @Counter + 1
    END

    RETURN @Result
END
GO

SELECT dbo.Factorial(5) as FactorialResult;
```

##### Result

| FactorialResult |
| --- |
| 120 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION dbo.Factorial (NUMBER INT)
RETURNS BIGINT
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "QsqZARsvG3aeleeXZB43fg==" }}'
AS
$$
  DECLARE
  RESULT BIGINT := 1;
  COUNTER INT := 1;
  BEGIN

    WHILE (:COUNTER <= :NUMBER) LOOP
      RESULT := :RESULT * :COUNTER;
      COUNTER := :COUNTER + 1;
    END LOOP;
    RETURN :RESULT;
  END;
$$;

SELECT
   dbo.Factorial(5) as FactorialResult;
```

##### Result

| FACTORIALRESULT |
| --- |
| 120 |

#### String Manipulation Function

Complex string operations using loops and conditional logic.

##### SQL Server

```sql
CREATE FUNCTION dbo.CleanPhoneNumber
(
    @Phone VARCHAR(20)
)
RETURNS VARCHAR(10)
AS
BEGIN
    DECLARE @Clean VARCHAR(10) = ''
    DECLARE @i INT = 1
    DECLARE @Char CHAR(1)

    WHILE @i <= LEN(@Phone)
    BEGIN
        SET @Char = SUBSTRING(@Phone, @i, 1)
        IF @Char BETWEEN '0' AND '9'
            SET @Clean = @Clean + @Char
        SET @i = @i + 1
    END

    RETURN @Clean
END
GO

SELECT dbo.CleanPhoneNumber('(555) 123-4567') as CleanPhone;
```

##### Result

| CleanPhone |
| --- |
| 5551234567 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION dbo.CleanPhoneNumber (PHONE STRING)
RETURNS VARCHAR(10)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "QsqZARsvG3aeleeXZB43fg==" }}'
AS
$$
   DECLARE
 CLEAN VARCHAR(10) := '';
 I INT := 1;
 CHAR CHAR(1);
   BEGIN

 WHILE (:I <= LEN(:PHONE)) LOOP
 CHAR := SUBSTRING(:PHONE, :I, 1);
 IF (:CHAR BETWEEN '0' AND '9') THEN
  CLEAN := :CLEAN + :CHAR;
 END IF;
 I := :I + 1;
 END LOOP;
 RETURN :CLEAN;
   END;
$$;

SELECT
   dbo.CleanPhoneNumber('(555) 123-4567') as CleanPhone;
```

##### Result

| CLEANPHONE |
| --- |
| 5551234567 |

#### CASE Statement Logic

Functions using CASE expressions for categorization.

##### SQL Server

```sql
CREATE FUNCTION dbo.GetGrade
(
    @Score INT
)
RETURNS CHAR(1)
AS
BEGIN
    DECLARE @Grade CHAR(1)

    SET @Grade = CASE
        WHEN @Score >= 90 THEN 'A'
        WHEN @Score >= 80 THEN 'B'
        WHEN @Score >= 70 THEN 'C'
        WHEN @Score >= 60 THEN 'D'
        ELSE 'F'
    END

    RETURN @Grade
END
GO

SELECT dbo.GetGrade(85) as Grade;
```

##### Result

| Grade |
| --- |
| B |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION dbo.GetGrade (SCORE INT)
RETURNS CHAR(1)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2025",  "domain": "no-domain-provided",  "migrationid": "QsqZARsvG3aeleeXZB43fg==" }}'
AS
$$
   DECLARE
 GRADE CHAR(1);
   BEGIN

 CASE
 WHEN :SCORE >= 90 THEN
  GRADE := 'A';
 WHEN :SCORE >= 80 THEN
  GRADE := 'B';
 WHEN :SCORE >= 70 THEN
  GRADE := 'C';
 WHEN :SCORE >= 60 THEN
  GRADE := 'D';
 ELSE
  GRADE := 'F';
 END;
 RETURN :GRADE;
   END;
$$;

SELECT
   dbo.GetGrade(85) as Grade;
```

##### Result

| GRADE |
| --- |
| B |

#### Select Into variable assignment

Functions using simple select into for variable assignment.

##### SQL Server

```sql
CREATE FUNCTION dbo.CalculatePrice
(
    @BasePrice DECIMAL(10, 2),
    @Quantity INT
)
RETURNS DECIMAL(10, 2)
AS
BEGIN
    DECLARE @Discount DECIMAL(5, 2);
    DECLARE @Subtotal DECIMAL(10, 2);
    DECLARE @FinalPrice DECIMAL(10, 2);

    SELECT @Discount = CASE
                           WHEN @Quantity >= 10 THEN 0.15
                           WHEN @Quantity >= 5 THEN 0.10
                           ELSE 0.05
                       END,
           @Subtotal = @BasePrice * @Quantity;

    SET @FinalPrice = @Subtotal * (1 - @Discount);

    RETURN @FinalPrice;
END;
```

##### Result

| CALCULATEPRICE(100, 3) |
| --- |
| 285 |

##### Snowflake (SnowScript UDF)

```sql
CREATE OR REPLACE FUNCTION dbo.CalculatePrice (BASEPRICE DECIMAL(10, 2), QUANTITY INT)
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/26/2025",  "domain": "no-domain-provided",  "migrationid": "T8GaASfFsHeOffK4v3SnIQ==" }}'
AS
$$
    DECLARE
        DISCOUNT DECIMAL(5, 2);
        SUBTOTAL DECIMAL(10, 2);
        FINALPRICE DECIMAL(10, 2);
    BEGIN

        DISCOUNT := CASE
                                      WHEN :QUANTITY >= 10 THEN 0.15
                                      WHEN :QUANTITY >= 5 THEN 0.10
                                      ELSE 0.05
                                  END;
        SUBTOTAL := :BASEPRICE * :QUANTITY;
        FINALPRICE := :SUBTOTAL * (1 - :DISCOUNT);
        RETURN :FINALPRICE;
    END;
$$;
```

##### Result

| CALCULATEPRICE(100, 3) |
| --- |
| 285 |

### Known Issues

> **Warning:**
>
> **SnowConvert AI will not translate UDFs containing the following elements into SnowScripting UDFs, as these features are unsupported in SnowScripting UDFs:**
>
> * Access database tables
> * Use cursors
> * Call other UDFs
> * Contain aggregate or window functions
> * Perform DML operations (INSERT/UPDATE/DELETE)
> * Return result sets

### Related EWIs

1. [SSC-EWI-0067](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): UDF was transformed to Snowflake procedure, calling procedures inside a query is not supported.
2. [SSC-EWI-0068](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): User defined function was transformed to a Snowflake procedure.
3. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - SQL Server Conversion Settings
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/sql-server-conversion-settings.md
section: Migrations
---

# SnowConvert AI - SQL Server Conversion Settings

This topic applies to the following sources:

* SQL Server
* Azure Synapse Analytics

Before conversion, you use SnowConvert AI to extract database objects from your source system to prepare them for
the conversion process. For more information, see [SnowConvert AI: Data Extraction](../../../user-guide/extraction.md).

## General conversion settings

1. **Comment objects with missing dependencies:** Flag to indicate whether to comment on nodes that have missing dependencies.
2. **Set encoding of the input files:** Check [General Conversion Settings](general-conversion-settings.md) for more details.

> **Note:**
>
> To review the Settings that apply to all supported languages, go to the following [article](general-conversion-settings.md).

## DB objects names settings

1. **Schema:** The string value specifies the custom schema name to apply. If not specified, the original database name will be used. Example: DB1.**myCustomSchema**.Table1.
2. **Database:** The string value specifies the custom database name to apply. Example: **MyCustomDB**.PUBLIC.Table1.
3. **Default:** None of the above settings will be used in the objects names.

## Prepare Code Settings

### **Description**

**Prepare my code:** Flag to indicate whether the input code should be processed before parsing and transformation. This can be useful to improve the parsing process. By default, it’s set to FALSE.

Splits the input code top-level objects into multiple files. The containing folders would be organized as follows:

Copy

```none
└───A new folder named ''[input_folder_name]_Processed''
    └───Top-level object type
        └───Schema name
```

### **Example**

#### **Input**

```none
├───in
│       script_name.sql
```

#### **Output**

Assume that the name of the files is the name of the top-level objects in the input files.

```none
├───in_Processed
    ├───procedure
    │   └───dbo
    │           A_PROCEDURE.sql
    │           ANOTHER_PROCEDURE.sql
    │           YET_ANOTHER_PROCEDURE.sql
    │
    └───table
        └───dbo
                MY_TABLE.sql
                ADDITIONAL_TABLE.sql
                THIRD_TABLE.sql
```

### Requirements

We highly recommend using [SQL Server Management Studio (SSMS)](https://learn.microsoft.com/en-us/sql/ssms/download-sql-server-management-studio-ssms?view=sql-server-ver16) to obtain the script.

## Stored Procedures Target Languages Settings

On this page, you can choose whether stored procedures are migrated to JavaScript embedded in Snow SQL, or to Snowflake Scripting. The default option is Snowflake Scripting.

**Reset Settings:** The reset settings option appears on every page. If you’ve made changes, you can reset SnowConvert AI to its original default settings.

## **Next steps for SQL Server databases**

For SQL Server databases, you can use SnowConvert AI to complete the following tasks after conversion:

* [Deployment](../../../user-guide/deployment.md)
* [Data migration](../../../user-guide/data-migration.md)
* [Data validation](../../../user-guide/data-validation.md)

---
title: SnowConvert AI - SQL Server-Azure Synapse
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/README.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse

This page provides a comprehensive reference for how SnowConvert AI translates Transact grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - SQL Server-Azure Synapse - ALTER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-alter-statement.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - ALTER

Translation reference for all the DDL statements that are preceded by the `ALTER` keyword.

## TABLE

### Description

Modifies a table definition by altering, adding, or dropping columns and constraints. ALTER TABLE also reassigns and rebuilds partitions, or disables and enables constraints and triggers. (<https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-transact-sql>)

## DROP CONSTRAINT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

SnowConvert AI translates `ALTER TABLE ... DROP CONSTRAINT` statements to Snowflake, with the following adjustments:

* **IF EXISTS at the constraint level** is stripped because Snowflake does not support `IF EXISTS` on `DROP CONSTRAINT`. Instead, `IF EXISTS` is added at the table level (`ALTER TABLE IF EXISTS`).
* **WITH (index options)** such as `WITH ( ONLINE = OFF )` are removed because Snowflake does not support index options.

### Sample Source Patterns

#### Basic DROP CONSTRAINT

##### SQL Server

```sql
ALTER TABLE [dbo].[MyTable] DROP CONSTRAINT [MyPK];
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS dbo.MyTable DROP CONSTRAINT MyPK;
```

#### DROP CONSTRAINT and DROP COLUMN together

##### SQL Server

```sql
ALTER TABLE [dbo].[MyTable] DROP CONSTRAINT [MyPK];
ALTER TABLE [dbo].[MyTable] DROP COLUMN [MyPK];
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS dbo.MyTable DROP CONSTRAINT MyPK;

ALTER TABLE IF EXISTS dbo.MyTable DROP COLUMN IF EXISTS MyPK;
```

### Known Issues

**1. IF EXISTS on DROP CONSTRAINT is not supported in Snowflake**

Snowflake does not support `IF EXISTS` directly on `DROP CONSTRAINT`. SnowConvert AI strips it and adds `IF EXISTS` at the table level. If the constraint does not exist, the statement will fail.

**2. WITH (index options) are removed**

SQL Server-specific index options like `WITH ( ONLINE = OFF )` have no equivalent in Snowflake and are silently removed.

## CHECK CONSTRAINT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

SnowConvert AI comments out `ALTER TABLE ... CHECK CONSTRAINT ...` and `ALTER TABLE ... NOCHECK CONSTRAINT ...` statements because enabling or disabling constraints is not applicable in Snowflake.

This behavior applies to the `CHECK CONSTRAINT` action. It does not apply to unsupported `ADD CHECK (...)` constraint definitions, which continue to be flagged separately.

### Sample Source Patterns

#### SQL Server

```sql
ALTER TABLE
    [Person].[EmailAddress] CHECK CONSTRAINT [FK_EmailAddress_Person_BusinessEntityID]
GO
```

#### Snowflake

```sql
----** SSC-FDM-TS0054 - CHECK CONSTRAINT STATEMENT REMOVED, ENABLING/DISABLING CONSTRAINTS IS NOT APPLICABLE IN SNOWFLAKE **
--ALTER TABLE IF EXISTS Person.EmailAddress CHECK CONSTRAINT FK_EmailAddress_Person_BusinessEntityID;
```

### Known Limitations

* Snowflake constraints are informational only, so SQL Server workflows that depend on enabling or disabling constraints must be redesigned manually.
* This section only covers the `CHECK CONSTRAINT` action. Unsupported `CHECK` constraint definitions may still emit [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md).

### Related Issues

* [SSC-FDM-TS0054](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): CHECK/NOCHECK CONSTRAINT statement removed.
* [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unsupported `CHECK` constraint definitions.

## ADD

### Description

> **Note:**
>
> In SQL Server, the ADD clause permits multiple actions per ADD, whereas Snowflake only allows a sequence of ADD column actions. Consequently, SnowConvert AI divides the ALTER TABLE ADD clause into individual ALTER TABLE statements.

There is a subset of functionalities provided by the ADD keyword, allowing the addition of different elements to the target table. These include:

* Column definition
* Computed column definition
* Table constraint
* Column set definition

## TABLE CONSTRAINT

Applies to

* SQL Server
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Specifies the properties of a PRIMARY KEY, FOREIGN KEY, UNIQUE, or CHECK constraint that is part of a new column definition added to a table by using [ALTER TABLE](https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-transact-sql?view=sql-server-ver16). (<https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-column-constraint-transact-sql>)

Translation for column constraints is relatively straightforward. There are several parts of the syntax that are not required or not supported in Snowflake.

These parts include:

* `CLUSTERED | NONCLUSTERED`
* `WITH FILLFACTOR = fillfactor`
* `WITH ( index_option [, ...n ] )`
* `ON { partition_scheme_name ( partition_column_name ) | filegroup | "default" }`
* `NOT FOR REPLICATION`
* `CHECK [ NOT FOR REPLICATION ]`

#### Syntax in SQL Server

```sql
 [ CONSTRAINT constraint_name ]
{
    { PRIMARY KEY | UNIQUE }
        [ CLUSTERED | NONCLUSTERED ]
        (column [ ASC | DESC ] [ ,...n ] )
        [ WITH FILLFACTOR = fillfactor
        [ WITH ( <index_option>[ , ...n ] ) ]
        [ ON { partition_scheme_name ( partition_column_name ... )  | filegroup | "default" } ]
    | FOREIGN KEY
        ( column [ ,...n ] )
        REFERENCES referenced_table_name [ ( ref_column [ ,...n ] ) ]
        [ ON DELETE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
        [ ON UPDATE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
        [ NOT FOR REPLICATION ]
    | CONNECTION
        ( { node_table TO node_table }
          [ , {node_table TO node_table }]
          [ , ...n ]
        )
        [ ON DELETE { NO ACTION | CASCADE } ]
    | DEFAULT constant_expression FOR column [ WITH VALUES ]
    | CHECK [ NOT FOR REPLICATION ] ( logical_expression )
}
```

#### Syntax in [**Snowflake**](https://docs.snowflake.com/en/sql-reference/sql/create-table-constraint.html#inline-unique-primary-foreign-key)

```sql
 inlineUniquePK ::=
  [ CONSTRAINT <constraint_name> ]
  { UNIQUE | PRIMARY KEY }
  [ [ NOT ] ENFORCED ]
  [ [ NOT ] DEFERRABLE ]
  [ INITIALLY { DEFERRED | IMMEDIATE } ]
  [ ENABLE | DISABLE ]
  [ VALIDATE | NOVALIDATE ]
  [ RELY | NORELY ]

 [ CONSTRAINT <constraint_name> ]
  { UNIQUE | PRIMARY KEY }
  [ [ NOT ] ENFORCED ]
  [ [ NOT ] DEFERRABLE ]
  [ INITIALLY { DEFERRED | IMMEDIATE } ]
  [ ENABLE | DISABLE ]
  [ VALIDATE | NOVALIDATE ]
  [ RELY | NORELY ]
```

### Sample Source Patterns

#### Multiple ALTER TABLE instances

##### SQL Server

```sql
 -- PRIMARY KEY
ALTER TABLE
    [Person]
ADD
    CONSTRAINT [PK_EmailAddress_BusinessEntityID_EmailAddressID] PRIMARY KEY CLUSTERED (
        [BusinessEntityID] ASC,
        [EmailAddressID] ASC
    ) ON [PRIMARY]
GO

-- FOREING KEY TO ANOTHER TABLE
ALTER TABLE
    [Person].[EmailAddress] WITH CHECK
ADD
    CONSTRAINT [FK_EmailAddress_Person_BusinessEntityID] FOREIGN KEY([BusinessEntityID]) REFERENCES [Person].[Person] ([BusinessEntityID]) ON DELETE CASCADE
GO
```

##### Snowflake

```sql
 -- PRIMARY KEY
ALTER TABLE Person
ADD
    CONSTRAINT PK_EmailAddress_BusinessEntityID_EmailAddressID PRIMARY KEY (BusinessEntityID, EmailAddressID);

-- FOREING KEY TO ANOTHER TABLE
ALTER TABLE Person.EmailAddress
!!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
WITH CHECK
ADD
    CONSTRAINT FK_EmailAddress_Person_BusinessEntityID FOREIGN KEY(BusinessEntityID) REFERENCES Person.Person (BusinessEntityID) ON DELETE CASCADE ;
```

#### DEFAULT within constraints

##### SQL Server

```sql
CREATE TABLE Table1
(
   COL_VARCHAR VARCHAR,
   COL_INT INT,
   COL_DATE DATE
);

ALTER TABLE
    Table1
ADD
    CONSTRAINT [DF_Table1_COL_INT] DEFAULT ((0)) FOR [COL_INT]
GO

ALTER TABLE
    Table1
ADD
    COL_NEWCOLUMN VARCHAR,
    CONSTRAINT [DF_Table1_COL_VARCHAR] DEFAULT ('NOT DEFINED') FOR [COL_VARCHAR]
GO

ALTER TABLE
    Table1
ADD
    CONSTRAINT [DF_Table1_COL_DATE] DEFAULT (getdate()) FOR [COL_DATE]
GO
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE Table1 (
   COL_VARCHAR VARCHAR DEFAULT ('NOT DEFINED'),
   COL_INT INT DEFAULT ((0)),
   COL_DATE DATE DEFAULT (CURRENT_TIMESTAMP() :: TIMESTAMP)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE Table1
--ADD
--    CONSTRAINT DF_Table1_COL_INT DEFAULT ((0)) FOR COL_INT
                                                          ;

ALTER TABLE Table1
ADD COL_NEWCOLUMN VARCHAR;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE Table1
--ADD
--CONSTRAINT DF_Table1_COL_VARCHAR DEFAULT ('NOT DEFINED') FOR COL_VARCHAR
                                                                        ;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE Table1
--ADD
--    CONSTRAINT DF_Table1_COL_DATE DEFAULT (CURRENT_TIMESTAMP() :: TIMESTAMP) FOR COL_DATE
                                                                                         ;
```

### Known Issues

**1. DEFAULT is only supported within** `CREATE TABLE` and `ALTER TABLE ... ADD COLUMN`

SQL Server supports defining a `DEFAULT` property within a constraint, while Snowflake only allows that when adding the column through `CREATE TABLE` or `ALTER TABLE ... ADD COLUMN`. `DEFAULT` properties within the `ADD CONSTRAINT` syntax are not supported and will be translated to ALTER TABLE ALTER COLUMN.

### Related EWIs

1. [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check statement not supported.
2. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
3. [SSC-FDM-TS0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Default constraint was commented out and may have been added to a table definition.

## CHECK

Applies to

* SQL Server

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

When CHECK clause is in the ALTER statement, SnowConvert AI will comment out the entire statement, since it is not supported.

### Sample Source Patterns

#### SQL Server

```sql
ALTER TABLE dbo.doc_exd
ADD CONSTRAINT exd_check CHECK NOT FOR REPLICATION (column_a > 1);
```

#### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
ALTER TABLE dbo.doc_exd
ADD CONSTRAINT exd_check CHECK NOT FOR REPLICATION (column_a > 1);
```

### Known Issues

**1.** **ALTER TABLE CHECK clause is not supported in Snowflake.**

The entire ALTER TABLE CHECK clause is commented out, since it is not supported in Snowflake.

### Related EWIs

* [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check statement not supported.

## CONNECTION

Applies to

* SQL Server

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

When CONNECTION clause is in the ALTER statement, SnowConvert AI will comment out the entire statement, since it is not supported.

### Sample Source Patterns

#### SQL Server

```sql
ALTER TABLE bought
ADD COL2 VARCHAR(32), CONSTRAINT EC_BOUGHT1 CONNECTION (Customer TO Product, Supplier TO Product)
ON DELETE NO ACTION;
```

#### Snowflake

```sql
ALTER TABLE bought
ADD COL2 VARCHAR(32);

!!!RESOLVE EWI!!! /*** SSC-EWI-0109 - ALTER TABLE SYNTAX NOT APPLICABLE IN SNOWFLAKE ***/!!!
ALTER TABLE bought
ADD
CONSTRAINT EC_BOUGHT1 CONNECTION (Customer TO Product, Supplier TO Product)
ON DELETE NO ACTION;
```

### Known Issues

**1.** **ALTER TABLE CONNECTION clause is not supported in Snowflake.**

The entire ALTER TABLE CONNECTION clause is commented out, since it is not supported in Snowflake.

### Related EWIs

* [SSC-EWI-0109](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Alter Table syntax is not applicable in Snowflake.

## DEFAULT

Applies to

* SQL Server

### Description

When DEFAULT clause is in the ALTER statement, SnowConvert AI will comment out the entire statement, since it is not supported.

The only functional scenario happens when the table definition is on the same file, in this way the default is added in the column definition.

### Sample Source Patterns

#### SQL Server

```sql
CREATE TABLE table1
(
  col1 integer not null,
  col2 varchar collate Latin1_General_CS,
  col3 date not null
)

ALTER TABLE table1
ADD CONSTRAINT col1_constraint DEFAULT 50 FOR col1;

ALTER TABLE table1
ADD CONSTRAINT col2_constraint DEFAULT 'hello world' FOR col2;

ALTER TABLE table1
ADD CONSTRAINT col3_constraint DEFAULT getdate() FOR col3;
```

#### Snowflake

```sql
CREATE OR REPLACE TABLE table1 (
  col1 INTEGER not null DEFAULT 50,
  col2 VARCHAR COLLATE 'EN-CS' DEFAULT 'hello world',
  col3 DATE not null DEFAULT CURRENT_TIMESTAMP() :: TIMESTAMP
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE table1
--ADD CONSTRAINT col1_constraint DEFAULT 50 FOR col1
                                                  ;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE table1
--ADD CONSTRAINT col2_constraint DEFAULT 'hello world' FOR col2
                                                             ;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE table1
--ADD CONSTRAINT col3_constraint DEFAULT CURRENT_TIMESTAMP() :: TIMESTAMP FOR col3
                                                                                ;
```

### Known Issues

**1. ALTER TABLE DEFAULT clause is not supported in Snowflake.**

The entire ALTER TABLE DEFAULT clause is commented out, since it is not supported in Snowflake.

### Related EWIs

1. [SSC-FDM-TS0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Default constraint was commented out and may have been added to a table definition.

## FOREIGN KEY

Applies to

* SQL Server

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

Snowflake supports the grammar for Referential Integrity Constraints, and their properties to facilitate the migration from other databases.

#### SQL Server

```sql
FOREIGN KEY
        ( column [ ,...n ] )
        REFERENCES referenced_table_name [ ( ref_column [ ,...n ] ) ]
        [ ON DELETE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
        [ ON UPDATE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
        [ NOT FOR REPLICATION ]
```

#### Snowflake

```sql
  [ FOREIGN KEY ]
  REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
  [ MATCH { FULL | SIMPLE | PARTIAL } ]
  [ ON [ UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
       [ DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ] ]
  [ [ NOT ] ENFORCED ]
  [ [ NOT ] DEFERRABLE ]
  [ INITIALLY { DEFERRED | IMMEDIATE } ]
  [ ENABLE | DISABLE ]
  [ VALIDATE | NOVALIDATE ]
  [ RELY | NORELY ]
```

### Sample Source Patterns

#### SQL Server

```sql
ALTER TABLE [Tests].[dbo].[Employee]
ADD CONSTRAINT FK_Department FOREIGN KEY(DepartmentID) REFERENCES Department(DepartmentID)
ON UPDATE CASCADE
ON DELETE NO ACTION
NOT FOR REPLICATION;
```

#### Snowflake

```sql
ALTER TABLE Tests.dbo.Employee
ADD CONSTRAINT FK_Department FOREIGN KEY(DepartmentID) REFERENCES Department (DepartmentID)
ON UPDATE CASCADE
ON DELETE NO ACTION;
```

> **Note:**
>
> Constraints are not enforced in Snowflake, excepting NOT NULL.
>
> Primary and Foreign Key are only used for documentation purposes more than design constraints.

## ON PARTITION

Applies to

* SQL Server
> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> Notice that this statement is removed from the migration because it is a non-relevant syntax. It means that it is not required in Snowflake.

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

In Transact SQL Server, the `on partition` statement is used inside `alter` statements and is used to divide the data across the database. For more information, see the [SQL Server partitioned tables and indexes documentation](https://learn.microsoft.com/en-us/sql/relational-databases/partitions/partitioned-tables-and-indexes?view=sql-server-ver16).

### Sample Source Patterns

#### On Partition

Notice that in this example the `ON PARTITION` has been removed. This is because Snowflake provides an integrated partitioning methodology. Thus, the syntax is not relevant.

##### SQL SERVER

```sql
ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE
ON partition_scheme_name (partition_column_name);
```

##### Snowflake

```sql
ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE;
```

## PRIMARY KEY

Applies to

* SQL Server

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

SQL Server primary key has many clauses that are not applicable for Snowflake. So, most of the statement will be commented out.

#### Syntax in SQL Server

```sql
{ PRIMARY KEY | UNIQUE }
[ CLUSTERED | NONCLUSTERED ]
(column [ ASC | DESC ] [ ,...n ] )
[ WITH FILLFACTOR = fillfactor
[ WITH ( <index_option>[ , ...n ] ) ]
[ ON { partition_scheme_name ( partition_column_name ... )  | filegroup | "default" } ]
```

#### Syntax in Snowflake

```sql
[ CONSTRAINT <constraint_name> ]
{ UNIQUE | PRIMARY KEY } ( <col_name> [ , <col_name> , ... ] )
[ [ NOT ] ENFORCED ]
[ [ NOT ] DEFERRABLE ]
[ INITIALLY { DEFERRED | IMMEDIATE } ]
[ ENABLE | DISABLE ]
[ VALIDATE | NOVALIDATE ]
[ RELY | NORELY ]
```

### Sample Source Patterns

> **Warning:**
>
> Notice that `WITH FILLFACTOR` statement has been removed from the translation because it is not relevant in Snowflake syntax.

#### SQL Server

```sql
ALTER TABLE Production.TransactionHistoryArchive
   ADD CONSTRAINT PK_TransactionHistoryArchive_TransactionID PRIMARY KEY
   CLUSTERED (TransactionID)
   WITH (FILLFACTOR = 75, ONLINE = ON, PAD_INDEX = ON)
   ON "DEFAULTLOCATION";
```

#### Snowflake

```sql
ALTER TABLE Production.TransactionHistoryArchive
   ADD CONSTRAINT PK_TransactionHistoryArchive_TransactionID PRIMARY KEY (TransactionID);
```

## COLUMN DEFINITION

ALTER TABLE ADD column_name

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Specifies the properties of a column that are added to a table by using [ALTER TABLE](https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-transact-sql?view=sql-server-ver16).

Adding a [column definition](https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-column-definition-transact-sql?view=sql-server-ver16) in Snowflake does have some differences compared to SQL Server.

For instance, several parts of the SQL Server grammar are not required or entirely not supported by Snowflake. These include:

* FILESTREAM
* [ROWGUIDCOL](https://learn.microsoft.com/en-us/sql/t-sql/statements/alter-table-column-definition-transact-sql?view=sql-server-ver16)
* ENCRYPTED WITH …
* SPARSE

Additionally, a couple other parts are partially supported, and require additional work to be implemented to properly emulate the original functionality. Specifically, we’re talking about the `MASKED WITH` property, which will be covered in the patterns section of this page.

#### SQL Server

```sql
column_name <data_type>
[ FILESTREAM ]
[ COLLATE collation_name ]
[ NULL | NOT NULL ]
[
    [ CONSTRAINT constraint_name ] DEFAULT constant_expression [ WITH VALUES ]
    | IDENTITY [ ( seed , increment ) ] [ NOT FOR REPLICATION ]
]
[ ROWGUIDCOL ]
[ SPARSE ]
[ ENCRYPTED WITH
  ( COLUMN_ENCRYPTION_KEY = key_name ,
      ENCRYPTION_TYPE = { DETERMINISTIC | RANDOMIZED } ,
      ALGORITHM =  'AEAD_AES_256_CBC_HMAC_SHA_256'
  ) ]
[ MASKED WITH ( FUNCTION = ' mask_function ') ]
[ <column_constraint> [ ...n ] ]
```

#### Snowflake

```sql
ADD [ COLUMN ] <col_name> <col_type>
        [ { DEFAULT <expr> | { AUTOINCREMENT | IDENTITY } [ { ( <start_num> , <step_num> ) | START <num> INCREMENT <num> } ] } ]
                            /* AUTOINCREMENT (or IDENTITY) supported only for columns with numeric data types (NUMBER, INT, FLOAT, etc.). */
                            /* Also, if the table is not empty (i.e. rows exist in the table), only DEFAULT can be altered.               */
        [ inlineConstraint ]
        [ [ WITH ] MASKING POLICY <policy_name> [ USING ( <col1_name> , cond_col_1 , ... ) ] ]
```

### Sample Source Patterns

#### Basic pattern

This pattern showcases the removal of elements from the original ALTER TABLE.

##### SQL Server

```sql
ALTER TABLE table_name
ADD column_name INTEGER;
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table_name
ADD column_name INTEGER;
```

#### COLLATE

Collation allows you to specify broader rules when talking about string comparison.

##### SQL Server

```sql
ALTER TABLE table_name
ADD COLUMN new_column_name VARCHAR
COLLATE Latin1_General_CI_AS;
```

Since the collation rule nomenclature varies from SQL Server to Snowflake, it is necessary to make adjustments.

##### Snowflake

```sql
ALTER TABLE IF EXISTS table_name
ADD COLUMN new_column_name VARCHAR COLLATE 'EN-CI-AS' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/;
```

#### MASKED WITH

This pattern showcases the translation for MASKED WITH property. CREATE OR REPLACE MASKING POLICY is inserted somewhere before the first usage, and then referenced by a SET MASKING POLICY clause.

The name of the new MASKING POLICY will be the concatenation of the name and arguments of the original MASKED WITH FUNCTION, as seen below:

##### SQL Server

```sql
ALTER TABLE table_name
ALTER COLUMN column_name
ADD MASKED WITH ( FUNCTION = ' random(1, 999) ' );
```

##### Snowflake

```sql
--** SSC-FDM-TS0022 - MASKING ROLE MUST BE DEFINED PREVIOUSLY BY THE USER **
CREATE OR REPLACE MASKING POLICY "random_1_999" AS
(val SMALLINT)
RETURNS SMALLINT ->
CASE
WHEN current_role() IN ('YOUR_DEFINED_ROLE_HERE')
THEN val
ELSE UNIFORM(1, 999, RANDOM()) :: SMALLINT
END;

ALTER TABLE IF EXISTS table_name MODIFY COLUMN column_name/*** SSC-FDM-TS0021 - A MASKING POLICY WAS CREATED AS SUBSTITUTE FOR MASKED WITH ***/  SET MASKING POLICY "random_1_999";
```

#### DEFAULT

This pattern showcases some of the basic translation scenarios for DEFAULT property.

##### SQL Server

```sql
ALTER TABLE table_name
ADD intcol INTEGER DEFAULT 0;

ALTER TABLE table_name
ADD varcharcol VARCHAR(20) DEFAULT '';

ALTER TABLE table_name
ADD datecol DATE DEFAULT CURRENT_TIMESTAMP;
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table_name
ADD intcol INTEGER DEFAULT 0;

ALTER TABLE IF EXISTS table_name
ADD varcharcol VARCHAR(20) DEFAULT '';

ALTER TABLE IF EXISTS table_name
ADD datecol DATE
                 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0078 - DEFAULT OPTION NOT ALLOWED IN SNOWFLAKE ***/!!!
                 DEFAULT CURRENT_TIMESTAMP;
```

#### ENCRYPTED WITH

This pattern showcases the translation for ENCRYPTED WITH property, which is commented out in the output code.

##### SQL Server

```sql
ALTER TABLE table_name
ADD encryptedcol VARCHAR(20)
ENCRYPTED WITH
  ( COLUMN_ENCRYPTION_KEY = key_name ,
      ENCRYPTION_TYPE = RANDOMIZED ,
      ALGORITHM =  'AEAD_AES_256_CBC_HMAC_SHA_256'
  );
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table_name
ADD encryptedcol VARCHAR(20)
----** SSC-FDM-TS0009 - ENCRYPTED WITH NOT SUPPORTED IN SNOWFLAKE **
--ENCRYPTED WITH
--  ( COLUMN_ENCRYPTION_KEY = key_name ,
--      ENCRYPTION_TYPE = RANDOMIZED ,
--      ALGORITHM =  'AEAD_AES_256_CBC_HMAC_SHA_256'
--  )
   ;
```

#### NOT NULL

The SQL Server NOT NULL clause has the same pattern and functionality as the Snowflake NOT NULL clause

##### SQL Server

```sql
ALTER TABLE table2 ADD
column_test INTEGER NOT NULL,
column_test2 INTEGER NULL,
column_test3 INTEGER;
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table2 ADD column_test INTEGER NOT NULL;

ALTER TABLE IF EXISTS table2 ADD column_test2 INTEGER NULL;

ALTER TABLE IF EXISTS table2 ADD column_test3 INTEGER;
```

#### IDENTITY

This pattern showcases the translation for IDENTITY. The `NOT FOR REPLICATION` portion is removed in Snowflake.

##### SQL Server

```sql
ALTER TABLE table3 ADD
column_test INTEGER IDENTITY(1, 100) NOT FOR REPLICATION;
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table3 ADD column_test INTEGER IDENTITY(1, 100) ORDER;
```

### Unsupported clauses

#### FILESTREAM

The original behavior of `FILESTREAM` is not replicable in Snowflake, and merits commenting out the entire `ALTER TABLE` statement.

##### SQL Server

```sql
ALTER TABLE table2
ADD column1 varbinary(max)
FILESTREAM;
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table2
ADD column1 VARBINARY
!!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'FILESTREAM COLUMN OPTION' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FILESTREAM;
```

#### SPARSE

In SQL Server, [SPARSE](https://docs.microsoft.com/en-us/sql/relational-databases/tables/use-sparse-columns) is used to define columns that are optimized for NULL storage. However, when we’re using Snowflake, we are not required to use this clause.

Snowflake performs [optimizations over tables](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions.html#benefits-of-micro-partitioning) automatically, which mitigates the need for manual user-made optimizations.

##### SQL Server

```sql
-- ADD COLUMN DEFINITION form
ALTER TABLE table3
ADD column1 int NULL SPARSE;

----------------------------------------
/* It also applies to the other forms */
----------------------------------------

-- CREATE TABLE form
CREATE TABLE table3
(
    column1 INT SPARSE NULL
);

-- ALTER COLUMN form
ALTER TABLE table3
ALTER COLUMN column1 INT NULL SPARSE;
```

##### Snowflake

```sql
-- ADD COLUMN DEFINITION form
ALTER TABLE IF EXISTS table3
ALTER COLUMN column1
                     !!!RESOLVE EWI!!! /*** SSC-EWI-TS0061 - ALTER COLUMN COMMENTED OUT BECAUSE SPARSE COLUMN IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
                     INT NULL SPARSE;

----------------------------------------
/* It also applies to the other forms */
----------------------------------------

-- CREATE TABLE form
CREATE OR REPLACE TABLE table3
(
    column1 INT
                !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'SPARSE COLUMN OPTION' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
                SPARSE NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;

-- ALTER COLUMN form
ALTER TABLE IF EXISTS table3
ALTER COLUMN column1
                     !!!RESOLVE EWI!!! /*** SSC-EWI-TS0061 - ALTER COLUMN COMMENTED OUT BECAUSE SPARSE COLUMN IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
                     INT NULL SPARSE;
```

#### ROWGUIDCOL

##### SQL Server

```sql
ALTER TABLE table_name
ADD column_name UNIQUEIDENTIFIER
ROWGUIDCOL;
```

##### Snowflake

```sql
ALTER TABLE IF EXISTS table_name
ADD column_name VARCHAR
!!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'ROWGUIDCOL COLUMN OPTION' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
ROWGUIDCOL;
```

### Known Issues

**1. Roles and users have to be previously set up for masking policies**

Snowflake’s Masking Policies can be applied to columns only after the policies were created. This requires the user to create the policies and assign them to roles, and these roles to users, to work properly. Masking Policies can behave differently depending on which user is querying.

> **Warning:**
>
> SnowConvert AI does not perform this setup automatically.

**2. Masking policies require a Snowflake Enterprise account or higher.**

The Snowflake documentation states that masking policies are available on Enterprise or higher rank accounts.

> **Note:**
>
> For further details visit [CREATE MASKING POLICY — Snowflake Documentation](https://docs.snowflake.com/en/sql-reference/sql/create-masking-policy.html#create-masking-policy).

**3. DEFAULT only supports constant values**

SQL Server’s DEFAULT property is partially supported by Snowflake, as long as its associated value is a constant.

**4.** **FILESTREAM clause is not supported in Snowflake.**

The entire FILESTSTREAM clause is commented out, since it is not supported in Snowflake.

**5.** **SPARSE clause is not supported in Snowflake.**

The entire SPARSE clause is commented out, since it is not supported in Snowflake. When it is added within an ALTER COLUMN statement, and it’s the only modification being made to the column, the entire statement is removed since it’s no longer adding anything.

### Related EWIs

1. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
2. [SSC-EWI-TS0061](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): ALTER COLUMN not supported.
3. [SSC-EWI-TS0078](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Default value not allowed in Snowflake.
4. [SSC-FDM-TS0009](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Encrypted with not supported in Snowflake.
5. [SSC-FDM-TS0021](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): A MASKING POLICY was created as a substitute for MASKED WITH.
6. [SSC-FDM-TS0022](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): The user must previously define the masking role.
7. [SSC-PRF-0002](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Case-insensitive columns can decrease the performance of queries.

## COLUMN CONSTRAINT

ALTER TABLE ADD COLUMN … COLUMN CONSTRAINT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Specifies the properties of a PRIMARY KEY, FOREIGN KEY or CHECK that is part of a new [column constraint](https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-column-constraint-transact-sql?view=sql-server-ver16) added to a table by using [Alter Table.](https://docs.microsoft.com/en-us/sql/t-sql/statements/alter-table-transact-sql?view=sql-server-ver16)

#### SQL Server

```sql
[ CONSTRAINT constraint_name ]
{
    [ NULL | NOT NULL ]
    { PRIMARY KEY | UNIQUE }
        [ CLUSTERED | NONCLUSTERED ]
        [ WITH FILLFACTOR = fillfactor ]
        [ WITH ( index_option [, ...n ] ) ]
        [ ON { partition_scheme_name (partition_column_name)
            | filegroup | "default" } ]
    | [ FOREIGN KEY ]
        REFERENCES [ schema_name . ] referenced_table_name
            [ ( ref_column ) ]
        [ ON DELETE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
        [ ON UPDATE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
        [ NOT FOR REPLICATION ]
    | CHECK [ NOT FOR REPLICATION ] ( logical_expression )
}
```

#### Snowflake

```sql
CREATE TABLE <name> ( <col1_name> <col1_type>    [ NOT NULL ] { inlineUniquePK | inlineFK }
                     [ , <col2_name> <col2_type> [ NOT NULL ] { inlineUniquePK | inlineFK } ]
                     [ , ... ] )

ALTER TABLE <name> ADD COLUMN <col_name> <col_type> [ NOT NULL ] { inlineUniquePK | inlineFK }
```

Where:

```sql
inlineUniquePK ::=
  [ CONSTRAINT <constraint_name> ]
  { UNIQUE | PRIMARY KEY }
  [ [ NOT ] ENFORCED ]
  [ [ NOT ] DEFERRABLE ]
  [ INITIALLY { DEFERRED | IMMEDIATE } ]
  [ ENABLE | DISABLE ]
  [ VALIDATE | NOVALIDATE ]
  [ RELY | NORELY ]
```

```sql
inlineFK :=
  [ CONSTRAINT <constraint_name> ]
  [ FOREIGN KEY ]
  REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
  [ MATCH { FULL | SIMPLE | PARTIAL } ]
  [ ON [ UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
       [ DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ] ]
  [ [ NOT ] ENFORCED ]
  [ [ NOT ] DEFERRABLE ]
  [ INITIALLY { DEFERRED | IMMEDIATE } ]
  [ ENABLE | DISABLE ]
  [ VALIDATE | NOVALIDATE ]
  [ RELY | NORELY ]
```

## CHECK

Applies to

* SQL Server

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

When CHECK clause is in the ALTER statement, SnowConvert AI will comment out the entire statement, since it is not supported.

### Sample Source Patterns

#### SQL Server

```sql
ALTER TABLE table_name
ADD column_name VARCHAR(255)
CONSTRAINT constraint_name
CHECK NOT FOR REPLICATION (column_name > 1);
```

#### Snowflake

```sql
ALTER TABLE IF EXISTS table_name
ADD column_name VARCHAR(255)
!!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
CONSTRAINT constraint_name
CHECK NOT FOR REPLICATION (column_name > 1);
```

### Known Issues

**1.** **ALTER TABLE CHECK clause is not supported in Snowflake.**

The entire ALTER TABLE CHECK clause is commented out, since it is not supported in Snowflake.

### Related EWIs

* [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check statement not supported.

## FOREIGN KEY

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The syntax for the Foreign Key is fully supported by Snowflake, except for the `[ NOT FOR REPLICATION ]` and the `WITH CHECK` clauses.

#### SQL Server

Review the following [SQL Server documentation](https://learn.microsoft.com/en-us/sql/t-sql/statements/alter-table-transact-sql?view=sql-server-ver16#syntax-for-memory-optimized-tables) for more information.

```sql
[ FOREIGN KEY ]
REFERENCES [ schema_name . ] referenced_table_name
[ ( ref_column ) ]
[ ON DELETE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
[ ON UPDATE { NO ACTION | CASCADE | SET NULL | SET DEFAULT } ]
[ NOT FOR REPLICATION ]
```

#### Snowflake

```sql
[ FOREIGN KEY ]
REFERENCES <ref_table_name> [ ( <ref_col_name> ) ]
[ MATCH { FULL | SIMPLE | PARTIAL } ]
[ ON [ UPDATE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ]
     [ DELETE { CASCADE | SET NULL | SET DEFAULT | RESTRICT | NO ACTION } ] ]
[ [ NOT ] ENFORCED ]
[ [ NOT ] DEFERRABLE ]
[ INITIALLY { DEFERRED | IMMEDIATE } ]
[ ENABLE | DISABLE ]
[ VALIDATE | NOVALIDATE ]
[ RELY | NORELY ]
```

### Sample Source Patterns

#### General case

##### SQL Server

```sql
ALTER TABLE dbo.student
ADD CONSTRAINT Fk_empid FOREIGN KEY(emp_id)
REFERENCES dbo.emp(id);

ALTER TABLE dbo.student
ADD CONSTRAINT Fk_empid FOREIGN KEY(emp_id)
REFERENCES dbo.emp(id)
NOT FOR REPLICATION;
```

##### Snowflake

```sql
ALTER TABLE dbo.student
ADD CONSTRAINT Fk_empid FOREIGN KEY(emp_id)
REFERENCES dbo.emp (id);

ALTER TABLE dbo.student
ADD CONSTRAINT Fk_empid FOREIGN KEY(emp_id)
REFERENCES dbo.emp (id);
```

#### WITH CHECK / NO CHECK case

Notice that Snowflake logic does not support the CHECK clause in the creation of foreign keys. The `WITH CHECK` statement is marked as not supported. Besides, the `WITH NO CHECK` clause is removed because it is the default behavior in Snowflake and the equivalence is the same.

Please, review the following examples to have a better understanding of the translation.

##### SQL Server

```sql
ALTER TABLE testTable
WITH CHECK ADD CONSTRAINT testFK1 FOREIGN KEY (table_id)
REFERENCES otherTable (Othertable_id);

ALTER TABLE testTable
WITH NOCHECK ADD CONSTRAINT testFK2 FOREIGN KEY (table_id)
REFERENCES otherTable (Othertable_id);
```

##### Snowflake

```sql
ALTER TABLE testTable
----** SSC-FDM-0014 - CHECK STATEMENT NOT SUPPORTED **
--WITH CHECK
           ADD CONSTRAINT testFK1 FOREIGN KEY (table_id)
REFERENCES otherTable (Othertable_id);

ALTER TABLE testTable
ADD CONSTRAINT testFK2 FOREIGN KEY (table_id)
REFERENCES otherTable (Othertable_id);
```

### Known Issues

**1.** **NOT FOR REPLICATION clause.**

Snowflake has a different approach to the replication cases. Please, review the following [documentation](https://docs.snowflake.com/en/user-guide/account-replication-considerations).

**2. WITH CHECK clause.**

Snowflake does not support the `WITH CHECK` statement. Review the following [documentation](https://docs.snowflake.com/en/sql-reference/constraints-overview) for more information.

## PRIMARY KEY / UNIQUE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

All of the optional clauses of the PRIMARY KEY / UNIQUE constraint are removed in Snowflake.

**Syntax in SQL Server**

```sql
{ PRIMARY KEY | UNIQUE }
    [ CLUSTERED | NONCLUSTERED ]
    [ WITH FILLFACTOR = fillfactor ]
    [ WITH ( index_option [, ...n ] ) ]
    [ ON { partition_scheme_name (partition_column_name)
        | filegroup | "default" } ]
```

### Sample Source Patterns

#### SQL Server

```sql
ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name PRIMARY KEY
NONCLUSTERED;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE
WITH FILLFACTOR = 80;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name PRIMARY KEY
WITH (PAD_INDEX = off);

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE
ON partition_scheme_name (partition_column_name);
```

#### Snowflake

```sql
ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name PRIMARY KEY;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name PRIMARY KEY;

ALTER TABLE table_name
ADD column_name INTEGER
CONSTRAINT constraint_name UNIQUE;
```

---
title: SnowConvert AI - SQL Server-Azure Synapse - ANSI_NULLS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-ansi-nulls.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - ANSI_NULLS

Applies to

* SQL Server
* Azure Synapse Analytics

## Description

This statement specifies the ISO-compliant behavior of the Equals and Not Equal to comparison operators when used with null values in SQLServer. Please visit [SET ANSI_NULLS](https://learn.microsoft.com/en-us/sql/t-sql/statements/set-ansi-nulls-transact-sql?view=sql-server-ver16) to get more information about this statement.

## Transact-SQL Syntax

```sql
 SET ANSI_NULLS { ON | OFF }
```

## Sample Source Patterns

### SET ANSI_NULLS ON

*“SET ANSI_NULLS ON affects a comparison only if one of the operands of the comparison is either a variable that is NULL or a literal NULL. If both sides of the comparison are columns or compound expressions, the setting does not affect the comparison.*” (SQLServer ANSI_NULLS article).

Snowflake does not support this statement, so in the case of ANSI_NULLS ON, this is marked with an FDM ([SSC-FDM-TS0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md))
because it does not have relevance in executing equal and not equal comparison operations.
Here, you can find an explanation of the [NULL treatment in Snowflake](https://community.snowflake.com/s/article/NULL-handling-in-Snowflake).

#### SQL Server

```sql
 SET ANSI_NULLS ON;
```

#### Snowflake

```sql
 ----** SSC-FDM-TS0027 - SET ANSI_NULLS ON STATEMENT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE **
--SET ANSI_NULLS ON
```

### SET ANSI_NULLS OFF

“*When ANSI_NULLS is OFF, the Equals (`=`) and Not Equal To (`<>`) comparison operators do not follow the ISO standard. A SELECT statement that uses `WHERE column_name = NULL` returns the rows that have null values in column_name. A SELECT statement that uses `WHERE column_name <> NULL` returns the rows that have non-NULL values in the column*”. (SQLServer ANSI_NULLS article).

In the case of the ANSI_NULLS OFF statement, this one is marked with an EWI ([SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)) because it requires extra manual effort.

#### SQL Server

```sql
 SET ANSI_NULLS OFF;
```

#### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'SIMPLE SET STATEMENT' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
SET ANSI_NULLS OFF;
```

## Related EWIs

1. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement is not supported in Snowflake
2. [SSC-FDM-0027](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): SET ANSI_NULLS ON statement may have different behavior in Snowflake

---
title: SnowConvert AI - SQL Server-Azure Synapse - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-built-in-functions.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - Built-in functions

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

## Aggregate

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| TransactSQL | Snowflake | Notes |
| APPROX_COUNT_DISTINCT | APPROX_COUNT_DISTINCT |  |
| AVG​ | AVG |  |
| CHECKSUM_AGG | *\*to be defined* |  |
| COUNT | COUNT |  |
| COUNT_BIG | *\*to be defined* |  |
| GROUPING | GROUPING |  |
| GROUPING_ID | GROUPING_ID |  |
| MAX | MAX |  |
| MIN | MIN |  |
| STDEV | STDDEV, STDEV_SAMP |  |
| STDEVP | STDDEV_POP |  |
| SUM | SUM |  |
| VAR | VAR_SAMP |  |
| VARP | VAR_POP​ |  |

## Analytic

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| CUME_DIST | CUME_DIST |  |
| FIRST_VALUE | FIRST_VALUE |  |
| LAG | LAG |  |
| LAST_VALUE | LAST_VALUE |  |
| LEAD | LEAD |  |
| PERCENTILE_CONT | PERCENTILE_CONT |  |
| PERCENTILE_DISC | PERCENTILE_DISC |  |
| PERCENT_RANK | PERCENT_RANK |  |

## Collation

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| COLLATIONPROPERTY | *\*to be defined* |  |
| TERTIARY_WEIGHTS | *\*to be defined* |  |

## Configuration

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| ​@@DBTS | *\*to be defined* |  |
| @@LANGID | *\*to be defined* |  |
| @@LANGUAGE | *\*to be defined* |  |
| @@LOCK_TIMEOUT | *\*to be defined* |  |
| @@MAX_CONNECTIONS | *\*to be defined* |  |
| @@MAX_PRECISION | *\*to be defined* |  |
| @@NESTLEVEL | *\*to be defined* |  |
| @@OPTIONS | *\*to be defined* |  |
| @@REMSERVER | *\*to be defined* |  |
| @@SERVERNAME | CONCAT(’[app.snowflake.com](http://app.snowflake.com/)’, CURRENT_ACCOUNT( )) |  |
| @@SERVICENAME | *\*to be defined* |  |
| @@SPID | *\*to be defined* |  |
| @@TEXTSIZE | *\*to be defined* |  |
| @@VERSION | *\*to be defined* | Can be mimicked by using CURRENT_VERSION |

## Conversion

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| CAST | CAST | Returns NULL if the value isn’t a number, otherwise returns the numeric value as its. When using operators such as <, >, =, <> then must be followed by a NULL |
| CONVERT | Check CONVERT | Same behavior as CAST |
| PARSE | *\*to be defined* |  |
| TRY_CAST | TRY_CAST | Returns NULL if the value isn’t a number, otherwise returns the numeric value as its. When using operators such as <, >, =, <> then must be followed by a NULL |
| TRY_CONVERT | *\*to be defined* | Same behavior as TRY_CAST |
| TRY_PARSE | TRY_CAST | Behavior may be different when parsing an integer as date or timestamp. |

## Cryptographic

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| ASYMKEY_ID | *\*to be defined* |  |
| ASYMKEYPROPERTY | *\*to be defined* |  |
| CERTENCODED | *\*to be defined* |  |
| CERTPRIVATEKEY | *\*to be defined* |  |
| DECRYPTBYASYMKEY | *\*to be defined* |  |
| DECRYPTBYCERT | *\*to be defined* |  |
| DECRYPTBYKEY | *\*to be defined* |  |
| DECRYPTBYKEYAUTOASYMKEY | *\*to be defined* |  |
| DECRYPTBYKEYAUTOCERT | *\*to be defined* |  |
| DECRYPTBYPASSPHRASE | _\*to be defined_​ | Can be mimicked by using DENCRYPT_RAW |
| ENCRYPTBYASYMKEY | *\*to be defined* |  |
| ENCRYPTBYCERT | *\*to be defined* |  |
| ENCRYPTBYKEY | *\*to be defined* |  |
| ENCRYPTBYPASSPHRASE | *\*to be defined* | Can be mimicked by using ENCRYPT_RAW |
| HASHBYTES | **MD5, SHA1, SHA2** | Currently only supported separated hash. Use proper one according to the required algorithm  **MD5**, is a 32-character hex-encoded  **SHA1**, has a 40-character hex-encoded string containing the 160-bit  **SHA2**, a hex-encoded string containing the N-bit SHA-2 message digest. Sizes are:  224 = SHA-224  256 = SHA-256 (Default)  384 = SHA-384  512 = SHA-512 |
| IS_OBJECTSIGNED | *\*to be defined* |  |
| KEY_GUID | *\*to be defined* |  |
| KEY_ID | *\*to be defined* |  |
| KEY_NAME | *\*to be defined* |  |
| SIGNBYASYMKEY | *\*to be defined* |  |
| SIGNBYCERT | *\*to be defined* |  |
| SYMKEYPROPERTY | *\*to be defined* |  |
| VERIFYSIGNEDBYCERT | *\*to be defined* |  |

## Cursor

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| @@CURSOR_ROWS | *\*to be defined* | ​ |
| @@FETCH_STATUS | *\*to be defined* |  |
| CURSOR_STATUS | *\*to be defined* |  |

## Data type

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| DATALENGTH | OCTET_LENGTH | ​Snowflake doesn’t use fractional bytes so length is always calculated as 8 \* OCTET_LENGTH |
| IDENT_SEED | *\*to be defined* |  |
| IDENT_CURRENT | *\*to be defined* |  |
| IDENTITY | *\*to be defined* |  |
| IDENT_INCR | *\*to be defined* |  |
| SQL_VARIANT_PROPERTY | *\*to be defined* |  |

## Date & Time

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| @@DATEFIRST | *\*to be defined* |  |
| @@LANGUAGE | *\*to be defined* |  |
| CURRENT_TIMESTAMP | CURRENT_TIMESTAMP |  |
| CURRENT_TIMEZONE | *\*to be defined* |  |
| DATEADD | DATEADD |  |
| DATEDIFF | DATEDIFF |  |
| DATEDIFF_BIG | *\*to be defined* |  |
| DATEFROMPARTS | DATE_FROM_PARTS |  |
| DATENAME | *\*to be defined* | This function receives two arguments: a datepart and date. It returns a string. Here are the supported dateparts from TSQL to Snowflake  **year, yyyy, yy** -> DATE_PART(YEAR, “$date”) **quarter, qq, q** -> DATE_PART(QUARTER, “$date”) **month, mm, m** -> **MONTHNAME**( “$date”), though only providing a three-letter english month name **dayofyear, dy, y** -> DATE_PART(DAYOFYEAR, “$date”) **day, dd, d** -> DATE_PART(DAY, “$date”) **week, wk, ww** -> DATE_PART(WEEK, “$date”)  **weekday, dw** -> **DAYNAME**(“$date”), though only providing an three-letter english day name **hour, hh** -> DATE_PART(HOUR, “$date”) **minute, n** -> DATE_PART(MINUTE, “$date”) **second, ss, s** -> DATE_PART(SECOND, “$date”) **millisecond, ms** -> DATE_PART(MS, “$date”) **microsecond, mcs** -> DATE_PART(US, “$date”) **nanosecond, ns** -> DATE_PART(NS, “$date”) **TZoffset, tz** -> needs a special implementation to get the time offset |
| DATEPART | DATE_PART |  |
| DATETIME2FROMPARTS | *\*to be defined* |  |
| DATETIMEFROMPARTS | *\*to be defined* | ​Can be mimicked by using a combination of **DATE_FROM_PARTS and TIME_FROM_PARTS** |
| DATETIMEOFFSETFROMPARTS | *\*to be defined* |  |
| DAY | DAY |  |
| EOMONTH | *\*to be defined* | Can be mimicked by using **LAST_DAY** |
| GETDATE | GETDATE |  |
| GETUTCDATE | *\*to be defined* | Can be mimicked by using **CONVERT_TIMEZONE** |
| ISDATE | *\*to be defined* | Can be mimicked by using **TRY_TO_DATE**  Returns NULL if the value isn’t a **date**, otherwise returns the date value as its. When using operators such as <, >, =, <> then must be followed by a NULL |
| MONTH | MONTH |  |
| SMALLDATETIMEFROMPARTS | *\*to be defined* | ​​Can be mimicked by using a combination of **DATE_FROM_PARTS and TIME_FROM_PARTS** |
| SWITCHOFFSET | *\*to be defined* | ​Can be mimicked by using **CONVERT_TIMEZONE** |
| SYSDATETIME | LOCALTIME |  |
| SYSDATETIMEOFFSET | *\*to be defined* | ​Can be mimicked by using **CONVERT_TIMEZONE and LOCALTIME** |
| SYSUTCDATETIME | *\*to be defined* | ​​Can be mimicked by using **CONVERT_TIMEZONE and LOCALTIME** |
| TIMEFROMPARTS | TIME_FROM_PARTS | ​ |
| TODATETIMEOFFSET | *\*to be defined* | ​Can be mimicked by using **CONVERT_TIMEZONE** |
| YEAR | YEAR |  |

## JSON

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| ISJSON | CHECK_JSON | ​This is a ‘preview feature’ in Snowflake |
| JSON_VALUE | *\*to be defined* | Can be mimic by using  TO_VARCHAR(GET_PATH(PARSE_JSON(JSON), PATH)) |
| JSON_QUERY | *\*to be defined* |  |
| JSON_MODIFY | *\*to be defined* |  |

## Mathematical

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| ABS | ABS |  |
| ACOS | ACOS |  |
| ASIN | ASIN |  |
| ATAN | ATAN |  |
| ATN2 | ATAN2 |  |
| CEILING | CEIL |  |
| COS | COS |  |
| COT | COT |  |
| DEGREES | DEGREES |  |
| EXP | EXP |  |
| FLOOR | FLOOR |  |
| LOG | LN |  |
| LOG10 | LOG |  |
| PI | PI |  |
| POWER | POWER |  |
| RADIANS | RADIANS |  |
| RAND | RANDOM |  |
| ROUND | ROUND |  |
| SIGN | SIGN |  |
| SIN | SIN |  |
| SQRT | SQRT |  |
| SQUARE | SQUARE |  |

## Logical

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| CHOOSE | *\*to be defined* | Can be mimic by using DECODE |
| GREATEST | GREATEST |  |
| IIF | IIF |  |
| LEAST | LEAST |  |
| NULLIF | NULLIF |  |

## Metadata

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| TransactSQL | Snowflake | Notes |
| @@PROCID | *\*to be defined* |  |
| APP_NAME | *\*to be defined* |  |
| APPLOCK_MODE | *\*to be defined* |  |
| APPLOCK_TEST | *\*to be defined* |  |
| ASSEMBLYPROPERTY | *\*to be defined* |  |
| COL_LENGTH | A UDF named COL_LENGTH_UDF is provided to retrieve this information. This UDF works only with VARCHAR types, as specified in the Transact-SQL documentation. For other data types, it returns NULL. |  |
| COL_NAME | *\*to be defined* |  |
| COLUMNPROPERTY | *\*to be defined* |  |
| DATABASE_PRINCIPAL_ID | *\*to be defined* | Maps to CURRENT_USER when no args |
| DATABASEPROPERTYEX | *\*to be defined* |  |
| DB_ID | *\*to be defined* | We recommend changing to CURRENT_DATABASE(). If there is a need to emulate this functionality.  SELECT DATE_PART(EPOCH,CREATED) FROM INFORMATION_SCHEMA.DATABASES WHERE DATABASE_NAME = ‘DB’ ;  Can achieve something similar |
| DB_NAME | *\*to be defined* | Mostly used in the procedurename mentioned above |
| FILE_ID | *\*to be defined* |  |
| FILE_IDEX | *\*to be defined* |  |
| FILE_NAME | *\*to be defined* |  |
| FILEGROUP_ID | *\*to be defined* |  |
| FILEGROUP_NAME | *\*to be defined* |  |
| FILEGROUPPROPERTY | *\*to be defined* |  |
| FILEPROPERTY | *\*to be defined* |  |
| FULLTEXTCATALOGPROPERTY | *\*to be defined* |  |
| FULLTEXTSERVICEPROPERTY | *\*to be defined* |  |
| INDEX_COL | *\*to be defined* |  |
| INDEXKEY_PROPERTY | *\*to be defined* |  |
| INDEXPROPERTY | *\*to be defined* |  |
| NEXT VALUE FOR | *\*to be defined* |  |
| OBJECT_DEFINITION | *\*to be defined* |  |
| OBJECT_ID | *\*to be defined* | In most cases can be replaced. Most cases are like: IF OBJECT_ID(‘dbo.TABLE’) IS NOT NULL DROP TABLE dbo.Table which can be replaced by a DROP TABLE IF EXISTS (this syntax is also supported in SQL SERVER). If the object_id needs to be replicated, a UDF is added depending on the second parameter of the function call. |
| OBJECT_NAME | *\*to be defined* | Can be replaced by: CREATE OR REPLACE PROCEDURE FOO() RETURNS STRING LANGUAGE JAVASCRIPT AS ‘ var rs = snowflake.execute({sqlText:`SELECT CURRENT_DATABASE() | '.' | ?`, binds:[arguments.callee.name]}); rs.next(); var procname = rs.getColumnValue(1); return procname; ‘; |
| OBJECT_NAME(@@PROCID) | ‘ObjectName’ | This transformation only occurs when it is inside a DeclareStatement.  ObjectName is the name of the TopLevelObject that contains the Function. |
| OBJECT_SCHEMA_NAME | *\*to be defined* |  |
| OBJECT_SCHEMA_NAME(@@PROCID) | :OBJECT_SCHEMA_NAME | This transformation only occurs when it is inside a DeclareStatement. |
| OBJECTPROPERTY | *\*to be defined* |  |
| OBJECTPROPERTYEX | *\*to be defined* |  |
| ORIGINAL_DB_NAME | *\*to be defined* |  |
| PARSENAME | PARSENAME_UDF | It creates a UDF to emulate the same behavior of Parsename function. |
| *\*to be defined* |  |  |
| SCHEMA_NAME | *\*to be defined* |  |
| SCOPE_IDENTITY | *\*to be defined* | It this is needed I would recommend to use sequences, and capture the value before insert |
| SERVERPROPERTY | *\*to be defined* |  |
| STATS_DATE | *\*to be defined* |  |
| TYPE_ID | *\*to be defined* |  |
| TYPE_NAME | *\*to be defined* |  |
| TYPEPROPERTY | *\*to be defined* |  |
| VERSION | *\*to be defined* |  |

## Ranking

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| DENSE_RANK | DENSE_RANK |  |
| NTILE | NTILE |  |
| RANK | RANK |  |
| ROW_NUMBER | ROW_NUMBER |  |

## Replication

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| PUBLISHINGSERVERNAME | *\*to be defined* |  |

## Rowset

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| OPENDATASOURCE | *\*to be defined* |  |
| OPENJSON | *\*to be defined* |  |
| QPENQUERY | *\*to be defined* |  |
| OPENROWSET | *\*to be defined* |  |
| OPENXML | OPENXML_UDF | User-defined function used as a equivalent behavior in Snowflake. |
| STRING_SPLIT | SPLIT_TO_TABLE | The enable_ordinal flag in Transact-SQL’s STRING_SPLIT is not directly supported by Snowflake’s SPLIT_TO_TABLE function. If the ordinal column is required, a user-defined function (UDF) named STRING_SPLIT_UDF will be generated to replicate this behavior. Without the ordinal column, note that STRING_SPLIT returns a single column named value, while SPLIT_TO_TABLE returns three columns: value, index (equivalent to ordinal), and seq. For additional details, see the [SPLIT_TO_TABLE documentation](https://docs.snowflake.com/en/sql-reference/functions/split_to_table). |

## Security

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| CERTENCODED | *\*to be defined* |  |
| CERTPRIVATEKEY | *\*to be defined* |  |
| CURRENT_USER | CURRENT_USER |  |
| DATABASE_PRINCIPAL_ID | *\*to be defined* |  |
| HAS_PERMS_BY_NAME | *\*to be defined* |  |
| IS_MEMBER | *\*to be defined* | Change to query INFORMATION_SCHEMA although the client might require defining new roles |
| IS_ROLEMEMBER | *\*to be defined* | Snowflake’s a similar function  **IS_ROLE_IN_SESSION** |
| IS_SRVROLEMEMBER | *\*to be defined* |  |
| LOGINPROPERTY | *\*to be defined* |  |
| ORIGINAL_LOGIN | *\*to be defined* |  |
| PERMISSIONS | *\*to be defined* |  |
| PWDCOMPARE | *\*to be defined* |  |
| PWDENCRYPT | *\*to be defined* |  |
| SCHEMA_ID | *\*to be defined* |  |
| SCHEMA_NAME | *\*to be defined* |  |
| SESSION_USER | *\*to be defined* |  |
| SUSER_ID | *\*to be defined* |  |
| SUSER_NAME | *\*to be defined* |  |
| SUSER_SID | *\*to be defined* |  |
| SUSER_SNAME | *\*to be defined* |  |
| sys.fn_builtin_permissions | *\*to be defined* |  |
| sys.fn_get_audit_file | *\*to be defined* |  |
| sys.fn_my_permissions | *\*to be defined* |  |
| SYSTEM_USER | *\*to be defined* |  |
| USER_ID | *\*to be defined* |  |
| USER_NAME | *\*to be defined* | Maps to CURRENT_USER |

## String

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| ASCII | ASCII |  |
| CHAR | CHR, CHAR |  |
| CHARINDEX | CHARINDEX |  |
| CONCAT | CONCAT |  |
| CONCAT_WS | CONCAT_WS |  |
| COALESCE | COALESCE |  |
| DIFFERENCE | *\*to be defined* |  |
| FORMAT | TO_CHAR | Supports numeric format specifiers (P, N, %) and date/time custom specifiers (dd, MM, yyyy, HH, mm, ss, fff, dddd, F–FFFFFFF, z, and more). Some date/time specifiers (`dddd`, `F`–`FFFFFFF`, `z`) are translated with SSC-FDM-0036 due to behavioral differences. Additional single-character specifiers (`%y`, `%M`, `%d`, etc.) require [`--enableFormatSpecifiersPreview`](../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md). SSC-EWI-0006 may be generated for remaining unsupported formats. |
| LEFT | LEFT |  |
| LEN | LEN |  |
| LOWER | LOWER |  |
| LTRIM | LTRIM |  |
| NCHAR | *\*to be defined* |  |
| PATINDEX | *\*to be defined* | Map to REGEXP_INSTR |
| QUOTENAME | QUOTENAME_UDF | It creates a UDF to emulate the same behavior of Quotename function |
| REPLACE | REPLACE |  |
| REPLICATE | REPEAT |  |
| REVERSE | REVERSE |  |
| RIGHT | RIGHT |  |
| RTRIM | RTRIM |  |
| SOUNDEX | SOUNDEX |  |
| SPACE | *\*to be defined* |  |
| STR | *\*to be defined* |  |
| STRING_AGG | *\*to be defined* |  |
| STRING_ESCAPE | *\*to be defined* |  |
| STRING_SPLIT | SPLIT_TO_TABLE |  |
| STUFF | *\*to be defined* | CREATE OR REPLACE FUNCTION STUFF(S string, STARTPOS int, LENGTH int, NEWSTRING string) RETURNS string LANGUAGE SQL AS ‘ left(S, STARTPOS) |
| SUBSTRING | SUBSTRING |  |
| TRANSLATE | TRANSLATE |  |
| TRIM | TRIM |  |
| UNICODE | UNICODE |  |
| UPPER | UPPER |  |

## System

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| $PARTITION | *\*to be defined* |  |
| @@ERROR | *\*to be defined* |  |
| @@IDENTITY | *\*to be defined* | It this is needed I would recommend to use sequences, and capture the value before insert |
| @@PACK_RECEIVED | *\*to be defined* |  |
| @@ROWCOUNT | *\*to be defined* |  |
| @@TRANCOUNT | *\*to be defined* |  |
| BINARY_CHECKSUM | *\*to be defined* |  |
| CHECKSUM | *\*to be defined* |  |
| COMPRESS | COMPRESS | ​Snowflake’s version has a method argument to indicate the compression method. These are the valid values: **SNAPPY, ZLIB, ZSTD, BZ2**  The compression level is specified in parentheses and must be a non-negative integer |
| CONNECTIONPROPERTY | *\*to be defined* |  |
| CONTEXT_INFO | *\*to be defined* |  |
| CURRENT_REQUEST_ID | *\*to be defined* |  |
| CURRENT_TRANSACTION_ID | *\*to be defined* |  |
| DECOMPRESS | *\*to be defined* | Snowflake has two functions for these: **DECOMPRESS_BINARY** and **DECOMPRESS_STRING**​ |
| ERROR_LINE | *\*to be defined* | SnowScript: Not supported in Snowflake with **[SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)**.  JavaScript: Will map to **ERROR_LINE** helper. EXEC helper will capture the Exception line property from the stack trace. |
| ERROR_MESSAGE | SQLERRM | Added **SSC-FDM-TS0023** returned error message could be different in Snowflake. |
| ERROR_NUMBER | *\*to be defined* | SnowScript: Not supported in Snowflake with **[SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)**.  JavaScript: Will map to **ERROR_NUMBER** helper. EXEC helper will capture the Exception code property. |
| ERROR_PROCEDURE | *Mapped* | SnowScript: Use current procedure name, added **SSC-FDM-TS0023** result value is based on the stored procedure where the function is called instead of where the exception occurs.  JavaScript: Will map to **ERROR_PROCEDURE** helper, taken from the `arguments.callee.name` procedure property |
| ERROR_SEVERITY | *\*to be defined* | SnowScript: Not supported in Snowflake with **[SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md)**. |
| ERROR_STATE | SQLSTATE | SnowScript: Converted to **SQLSTATE** snowflake property, added **SSC-FDM-TS0023** returned value could be different in Snowflake.  JavaScript: Helper will capture Exception state property |
| FORMATMESSAGE | FORMATEMESSAGE_UDF | It creates a UDF to emulate the same behavior of FORMATMESSAGE function but with some limitations. |
| GET_FILESTREAM_TRANSACTION_CONTEXT | *\*to be defined* |  |
| GETANSINULL | *\*to be defined* |  |
| HOST_ID | *\*to be defined* |  |
| HOST_NAME | *\*to be defined* |  |
| ISNULL | NVL |  |
| ISNUMERIC | *\*to be defined* | No direct equivalent but can be mapped to a custom UDF, returning the same values as in TSQL. |
| MIN_ACTIVE_ROWVERSION | *\*to be defined* | ​ |
| NEWID | *\*to be defined* | ​Maps to UUID_STRING |
| NEWSEQUENTIALID | *\*to be defined* | ​ |
| ROWCOUNT_BIG | *\*to be defined* | ​ |
| SESSION_CONTEXT | *\*to be defined* | ​ |
| SESSION_ID | *\*to be defined* | ​ |
| XACT_STATE | *\*to be defined* | ​ |

## System Statistical

| TransactSql | Snowflake | Notes |
| --- | --- | --- |
| @@CONNECTIONS | *\*to be defined* | ​Snowflake’s a similar function: **LOGIN_HISTORY.**  Returns login events within a specified time range |
| @@PACK_RECEIVED | *\*to be defined* |  |
| @@CPU_BUSY | *\*to be defined* |  |
| @@PACK_SENT | *\*to be defined* |  |
| @@TIMETICKS | *\*to be defined* |  |
| @@IDLE | *\*to be defined* |  |
| @@TOTAL_ERRORS | *\*to be defined* |  |
| @@IO_BUSY | *\*to be defined* |  |
| @@TOTAL_READ | *\*to be defined* |  |
| @@PACKET_ERRORS | *\*to be defined* |  |
| @@TOTAL_WRITE | *\*to be defined* |  |

## Text & Image

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| TEXTPTR | *\*to be defined* |  |
| TEXTVALID | *\*to be defined* |  |

## Trigger

| TransactSQL | Snowflake | Notes |
| --- | --- | --- |
| COLUMNS_UPDATED | *\*to be defined* |  |
| EVENTDATA | *\*to be defined* |  |
| TRIGGER_NESTLEVEL | *\*to be defined* |  |
| UPDATE | *\*to be defined* |  |

# System functions

This section describes the functional equivalents of system functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to the creation of UDFs in Snowflake.

## ISNULL

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Replaces NULL with the specified replacement value. ([ISNULL in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/isnull-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
ISNULL ( check_expression , replacement_value )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/nvl.html)

```sql
NVL( <expr1> , <expr2> )
```

### Examples

#### SQL Server

```sql
SELECT ISNULL(NULL, 'SNOWFLAKE') AS COMPANYNAME;
```

**Result:**

| COMPANYNAME |
| --- |
| SNOWFLAKE |

##### Snowflake SQL

```sql
SELECT
NVL(NULL, 'SNOWFLAKE') AS COMPANYNAME;
```

**Result:**

| COMPANYNAME |
| --- |
| SNOWFLAKE |

## NEWID

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Creates a unique value of type uniqueidentifier. ([NEWID in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/newid-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
NEWID ( )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/nvl.html)

```sql
UUID_STRING()
```

### Examples

> **Warning:**
>
> Outputs may differ because it generates a unique ID in runtime

#### SQL Server

```sql
SELECT NEWID ( ) AS ID;
```

**Result:**

| ID |
| --- |
| 47549DDF-837D-41D2-A59C-A6BC63DF7910 |

##### Snowflake SQL

```sql
SELECT
UUID_STRING( ) AS ID;
```

**Result:**

| ID |
| --- |
| 6fd4312a-7925-4ad9-85d8-e039efd82089 |

## NULLIF

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a null value if the two specified expressions are equal.

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
NULLIF ( check_expression , replacement_value )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/nullif.html)

```sql
NULLIF( <expr1> , <expr2> )
```

### Examples

#### SQL Server

```sql
SELECT NULLIF(6,9) AS RESULT1, NULLIF(5,5) AS RESULT2;
```

**Result:**

| RESULT1 | RESULT2 |
| --- | --- |
| 6 | null |

##### Snowflake SQL

```sql
SELECT
NULLIF(6,9) AS RESULT1,
NULLIF(5,5) AS RESULT2;
```

**Result:**

| RESULT1 | RESULT2 |
| --- | --- |
| 6 | null |

## @@ROWCOUNT

Applies to

* SQL Server

### Description

Returns the number of rows affected by the last statement. ([@@ROWCOUNT in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/functions/rowcount-transact-sql?view=sql-server-ver16)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
@@ROWCOUNT
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/dml-status)

```sql
SQLROWCOUNT
```

### Examples

#### SQL Server

```sql
CREATE TABLE table1
(
    column1 INT
);

CREATE PROCEDURE procedure1
AS
BEGIN
    declare @addCount int = 0;

    INSERT INTO table1 (column1) VALUES (1),(2),(3);
    set @addCount = @addCount + @@ROWCOUNT

   select @addCount
END
;
GO

EXEC procedure1;
```

**Result:**

|  |
| --- |
| 3 |

##### Snowflake SQL

```sql
CREATE OR REPLACE TABLE table1
(
    column1 INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/13/2024",  "domain": "test" }}'
;

CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/13/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        ADDCOUNT INT := 0;
        ProcedureResultSet RESULTSET;
    BEGIN

        INSERT INTO table1 (column1) VALUES (1),(2),(3);
        ADDCOUNT := :ADDCOUNT + SQLROWCOUNT;
        ProcedureResultSet := (

       select
            :ADDCOUNT);
        RETURN TABLE(ProcedureResultSet);
    END;
$$;

CALL procedure1();
```

**Result:**

| :ADDCOUNT |
| --- |
| 3 |

## FORMATMESSAGE

Applies to

* SQL Server

### Description

Constructs a message from an existing message in sys.messages or from a provided string. ([FORMATMESSAGE in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/functions/formatmessage-transact-sql?view=sql-server-ver16)).

### Sample Source Pattern

Since Snowflake does not support `FORMATMESSAGE` function, the FORMATMESSAGE_UDF is added to simulate its behavior.

### Syntax

#### SQL Server

```sql
FORMATMESSAGE ( { msg_number  | ' msg_string ' | @msg_variable} , [ param_value [ ,...n ] ] )
```

### Examples

#### SQL Server

```sql
SELECT FORMATMESSAGE('This is the %s and this is the %s.', 'first variable', 'second variable') AS RESULT;
```

**Result:**

| RESULT |
| --- |
| This is the first variable and this is the second variable. |

#### Snowflake

```sql
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('This is the %s and this is the %s.', ARRAY_CONSTRUCT('first variable', 'second variable')) AS RESULT;
```

**Result:**

| RESULT |
| --- |
| This is the first variable and this is the second variable. |

### Related EWIs

1. [SSC-FDM-TS0008](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): FORMATMESSAGE function was converted to UDF.

## FORMATMESSAGE_UDF

Snowflake does not have a function with the functionality of `FORMATMESSAGE`. SnowConvert AI generates the following Python UDF to emulate the behavior of `FORMATMESSAGE`.

```sql
CREATE OR REPLACE FUNCTION FORMATMESSAGE_UDF(MESSAGE STRING, ARGS ARRAY)
RETURNS STRING
LANGUAGE python
IMMUTABLE
RUNTIME_VERSION = '3.8'
HANDLER = 'format_py'
as
$$
def format_py(message,args):
  return message % (*args,)
$$;
```

This UDF may not work correctly on some cases:

* Using the `%I64d` placeholder will throw an error.
* If the number of substitution arguments is different than the number of place holders, it will throw an error.
* Some unsigned placeholders like `%u` or `%X` will not behave properly when formatting the value.
* It cannot handle message_ids.

## String functions

This section describes the functional equivalents of string functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to the creation of UDFs in Snowflake.

## CHAR

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a single-byte character with the integer sent as a parameter on the ASCII table ([CHAR in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/char-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
CHAR( expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/chr.html)

```sql
{CHR | CHAR} ( <input> )
```

##### JavaScript

[JavaScript complete documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/fromCharCode)

```sql
String.fromCharCode( expression1, ... , expressionN )
```

### Examples

#### SQL Server

```sql
SELECT CHAR(170) AS SMALLEST_A
```

**Output:**

| SMALLEST_A |
| --- |
| ª |

##### Snowflake SQL

```sql
SELECT
CHAR(170) AS SMALLEST_A;
```

**Result:**

| SMALLEST_A |
| --- |
| ª |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION get_char(expression float)
RETURNS string
LANGUAGE JAVASCRIPT
AS
$$
  return String.fromCharCode( EXPRESSION );
$$;

SELECT GET_CHAR(170) SMALLEST_A;
```

**Result:**

| SMALLEST_A |
| --- |
| ª |

## CHARINDEX

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the index of the first occurrence of the specified value sent as a parameter when it matches ([CHARINDEX in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/charindex-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
CHARINDEX( expression_to_find, expression_to_search [, start] )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/charindex.html)

```sql
CHARINDEX( <expr1>, <expr2> [ , <start_pos> ] )
```

##### JavaScript

[JavaScript complete documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/indexOf)

```sql
String.indexOf( search_value [, index] )
```

### Examples

#### SQL Server

```sql
SELECT CHARINDEX('t', 'Customer') AS MatchPosition;
```

**Result:**

| INDEX |
| --- |
| 33 |

##### Snowflake SQL

```sql
SELECT
CHARINDEX('t', 'Customer') AS MatchPosition;
```

**Result:**

| INDEX |
| --- |
| 33 |

##### JavaScript

> **Note:**
>
> Indexes in Transact start at 1, instead of JavaScript which start at 0.

```sql
CREATE OR REPLACE FUNCTION get_index
(
  expression_to_find varchar,
  expression_to_search varchar,
  start_index  float
)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
  return EXPRESSION_TO_SEARCH.indexOf(EXPRESSION_TO_FIND, START_INDEX)+1;
$$;

SELECT GET_INDEX('and', 'Give your heart and soul to me, and life will always be la vie en rose', 20) AS INDEX;
```

**Result:**

| INDEX |
| --- |
| 33 |

## COALESCE

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Evaluates the arguments in order and returns the current value of the first expression that initially doesn’t evaluate to NULL. For example,SELECT COALESCE(NULL, NULL, ‘third_value’, ‘fourth_value’); returns the third value because the third value is the first value that isn’t null. ([COALESCE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/coalesce-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
COALESCE ( expression [ ,...n ] )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/coalesce.html)

```sql
COALESCE( <expr1> , <expr2> [ , ... , <exprN> ] )
```

### Examples

#### SQL Server

```sql
SELECT TOP 10 StartDate,
COALESCE(EndDate,'2000-01-01') AS FIRST_NOT_NULL
FROM HumanResources.EmployeeDepartmentHistory
```

**Result:**

| StartDate | FIRST_NOT_NULL |
| --- | --- |
| 2009-01-14 | 2000-01-01 |
| 2008-01-31 | 2000-01-01 |
| 2007-11-11 | 2000-01-01 |
| 2007-12-05 | 2010-05-30 |
| 2010-05-31 | 2000-01-01 |
| 2008-01-06 | 2000-01-01 |
| 2008-01-24 | 2000-01-01 |
| 2009-02-08 | 2000-01-01 |
| 2008-12-29 | 2000-01-01 |
| 2009-01-16 | 2000-01-01 |

##### Snowflake SQL

```sql
SELECT TOP 10
StartDate,
COALESCE(EndDate,'2000-01-01') AS FIRST_NOT_NULL
FROM
HumanResources.EmployeeDepartmentHistory;
```

**Result:**

| StartDate | FIRST_NOT_NULL |
| --- | --- |
| 2009-01-14 | 2000-01-01 |
| 2008-01-31 | 2000-01-01 |
| 2007-11-11 | 2000-01-01 |
| 2007-12-05 | 2010-05-30 |
| 2010-05-31 | 2000-01-01 |
| 2008-01-06 | 2000-01-01 |
| 2008-01-24 | 2000-01-01 |
| 2009-02-08 | 2000-01-01 |
| 2008-12-29 | 2000-01-01 |
| 2009-01-16 | 2000-01-01 |

## CONCAT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Makes a concatenation of string values with others. ([CONCAT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/concat-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
CONCAT ( string_value1, string_value2 [, string_valueN ] )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/concat.html)

```sql
CONCAT( <expr1> [ , <exprN> ... ] )

<expr1> || <expr2> [ || <exprN> ... ]
```

##### JavaScript

[JavaScript complete documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/concat)

```sql
 String.concat( expression1, ..., expressionN )
```

### Examples

#### SQL Server

```sql
SELECT CONCAT('Ray',' ','of',' ','Light') AS TITLE;
```

**Output:**

| TITLE |
| --- |
| Ray of Light |

##### Snowflake SQL

```sql
SELECT
CONCAT('Ray',' ','of',' ','Light') AS TITLE;
```

**Output:**

| TITLE |
| --- |
| Ray of Light |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION concatenate_strs(strings array)
RETURNS string
LANGUAGE JAVASCRIPT
AS
$$
  var result = ""
  STRINGS.forEach(element => result = result.concat(element));
  return result;
$$;
SELECT concatenate_strs(array_construct('Ray',' ','of',' ','Light')) TITLE;
```

**Output:**

```none
   TITLE|
```

————|
Ray of Light|

## LEFT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the right part of a character string with the specified number of characters. ([RIGHT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/right-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
LEFT ( character_expression , integer_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/left.html)

```sql
LEFT ( <expr> , <length_expr> )
```

##### JavaScript

Function used to emulate the behavior

```sql
function LEFT(string, index){
    if(index < 0){
        throw new RangeError('Invalid INDEX on LEFT function');
    }
    return string.slice( 0, index);
  }
return LEFT(STR, INDEX);
```

### Examples

#### SQL Server

```sql
SELECT LEFT('John Smith', 5) AS FIRST_NAME;
```

**Output:**

| FIRST_NAME |
| --- |
| John |

##### Snowflake SQL

```sql
SELECT LEFT('John Smith', 5) AS FIRST_NAME;
```

**Output:**

| FIRST_NAME |
| --- |
| John |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION left_str(str varchar, index float)
RETURNS string
LANGUAGE JAVASCRIPT
AS
$$
    function LEFT(string, index){
      if(index < 0){
          throw new RangeError('Invalid INDEX on LEFT function');
      }
      return string.slice( 0, index);
    }
  return LEFT(STR, INDEX);
$$;
SELECT LEFT_STR('John Smith', 5) AS FIRST_NAME;
```

**Output:**

| FIRST_NAME |
| --- |
| John |

## LEN

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the length of a string ([LEN in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/len-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
LEN( string_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/length.html)

```sql
LENGTH( <expression> )
LEN( <expression> )
```

##### JavaScript

[JavaScript SQL complete documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/String/length)

```sql
 string.length
```

### Examples

#### SQL Server

```sql
SELECT LEN('Sample text') AS [LEN];
```

**Output:**

| LEN |
| --- |
| 11 |

##### Snowflake SQL

```sql
SELECT LEN('Sample text') AS LEN;
```

**Output:**

| LEN |
| --- |
| 11 |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION get_len(str varchar)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  return STR.length;
$$;
SELECT GET_LEN('Sample text') LEN;
```

**Output:**

| LEN |
| --- |
| 11 |

## LOWER

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Converts a string to lowercase ([LOWER in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/lower-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
LOWER ( character_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/lower.html)

```sql
LOWER( <expr> )
```

##### JavaScript

[JavaScript SQL complete documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/toLowerCase)

```sql
 String.toLowerCase( )
```

### Examples

#### SQL Server

```sql
SELECT LOWER('YOU ARE A PREDICTION OF THE GOOD ONES') AS LOWERCASE;
```

**Output:**

| LOWERCASE |
| --- |
| you are a prediction of the good ones |

##### Snowflake SQL

```sql
SELECT LOWER('YOU ARE A PREDICTION OF THE GOOD ONES') AS LOWERCASE;
```

**Output:**

| LOWERCASE |
| --- |
| you are a prediction of the good ones |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION to_lower(str varchar)
RETURNS string
LANGUAGE JAVASCRIPT
AS
$$
  return STR.toLowerCase();
$$;

SELECT TO_LOWER('YOU ARE A PREDICTION OF THE GOOD ONES') LOWERCASE;
```

**Output:**

| LOWERCASE |
| --- |
| you are a prediction of the good ones |

## NCHAR

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the UNICODE character of an integer sent as a parameter ([NCHAR in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/nchar-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

```sql
NCHAR( expression )
```

##### Arguments

`expression`: Integer expression.

##### Return Type

String value, it depends on the input received.

### Examples

#### Query

```sql
SELECT NCHAR(170);
```

##### Result

|  |
| --- |
| ª |

> **Note:**
>
> The equivalence for this function in JavaScript is documented in CHAR.

## REPLACE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Replaces all occurrences of a specified string value with another string value. ([REPLACE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/replace-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
REPLACE ( string_expression , string_pattern , string_replacement )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/replace.html)

```sql
REPLACE( <subject> , <pattern> [ , <replacement> ] )
```

##### JavaScript

```sql
 String.replace( pattern, new_expression)
```

### Examples

#### SQL Server

```sql
SELECT REPLACE('Real computer software', 'software','science') AS COLUMNNAME;
```

**Output:**

```sql
COLUMNNAME           |
---------------------|
Real computer science|
```

##### Snowflake SQL

```sql
SELECT REPLACE('Real computer software', 'software','science') AS COLUMNNAME;
```

**Output:**

```sql
COLUMNNAME           |
---------------------|
Real computer science|
```

##### JavaScript

```sql
 CREATE OR REPLACE FUNCTION REPLACER (str varchar, pattern varchar, new_expression varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
   return STR.replace( PATTERN, NEW_EXPRESSION );
$$;

SELECT REPLACER('Real computer software', 'software', 'science') AS COLUMNNAME;
```

**Output:**

```sql
COLUMNNAME             |
---------------------|
Real computer science|
```

## REPLICATE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Replicates a string value a specified number of times ([REPLICATE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/replicate-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
REPLICATE( string_expression, number_expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/repeat.html)

```sql
REPEAT(<input>, <n>)
```

##### JavaScript

[JavaScript Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/String/repeat)

```sql
String.repeat( number_expression )
```

### Examples

#### SQL Server

```sql
SELECT REPLICATE('Staying alive',5) AS RESULT
```

**Result:**

```sql
RESULT                                                           |
-----------------------------------------------------------------|
Staying aliveStaying aliveStaying aliveStaying aliveStaying alive|
```

##### Snowflake SQL

```sql
SELECT REPEAT('Staying alive',5) AS RESULT;
```

**Result:**

```sql
RESULT                                                           |
-----------------------------------------------------------------|
Staying aliveStaying aliveStaying aliveStaying aliveStaying alive|
```

##### JavaScript

```sql
 CREATE OR REPLACE FUNCTION REPEAT_STR (str varchar, occurrences float)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$

   return STR.repeat( OCCURRENCES );
$$;

SELECT REPEAT_STR('Staying alive ', 5) AS RESULT;
```

**Result:**

```sql
RESULT                                                           |
-----------------------------------------------------------------|
Staying aliveStaying aliveStaying aliveStaying aliveStaying alive|
```

## RIGHT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the right part of a character string with the specified number of characters. ([RIGHT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/right-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
RIGHT ( character_expression , integer_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/right.html)

```sql
RIGHT( <expr> , <length_expr> )
```

##### JavaScript

UDF used to emulate the behavior

```sql
 function RIGHT(string, index){
      if(index< 0){
          throw new RangeError('Invalid INDEX on RIGHT function');
      }
      return string.slice( string.length - index, string.length );
    }
```

### Examples

#### SQL Server

```sql
SELECT RIGHT('John Smith', 5) AS LAST_NAME;
```

**Output:**

```sql
   LAST_NAME|
------------|
       Smith|
```

##### Snowflake SQL

```sql
SELECT RIGHT('John Smith', 5) AS LAST_NAME;
```

**Output:**

```sql
   LAST_NAME|
------------|
       Smith|
```

##### JavaScript

```sql
 CREATE OR REPLACE FUNCTION right_str(str varchar, index float)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
    function RIGHT(string, index){
      if(index< 0){
          throw new RangeError('Invalid INDEX on RIGHT function');
      }
      return string.slice( string.length - index, string.length );
    }
  return RIGHT(STR, INDEX);
$$;

SELECT RIGHT_STR('John Smith', 5) AS LAST_NAME;
```

**Output:**

```sql
   LAST_NAME|
------------|
       Smith|
```

## RTRIM

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a character expression after it removes leading blanks ([RTRIM in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/rtrim-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
RTRIM( string_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/rtrim.html)

```sql
RTRIM(<expr> [, <characters> ])
```

##### JavaScript

Custom function used to emulate the behavior

```sql
 function RTRIM(string){
    return string.replace(/s+$/,"");
}
```

### Examples

#### SQL Server

**Input:**

```sql
SELECT RTRIM('LAST TWO BLANK SPACES  ') AS [RTRIM]
```

**Output:**

```sql
RTRIM                |
---------------------|
LAST TWO BLANK SPACES|
```

##### Snowflake SQL

```sql
SELECT RTRIM('LAST TWO BLANK SPACES  ') AS RTRIM;
```

**Result:**

```sql
RTRIM                |
---------------------|
LAST TWO BLANK SPACES|
```

##### JavaScript

```sql
 CREATE OR REPLACE FUNCTION rtrim(str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  function RTRIM(string){
    return string.replace(/s+$/,"");
    }
   return RTRIM( STR );
$$;

SELECT RTRIM('LAST TWO BLANK SPACES  ') AS RTRIM;
```

**Result:**

```sql
RTRIM                |
---------------------|
LAST TWO BLANK SPACES|
```

## SPACE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a number of occurrences of blank spaces ([SPACE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/space-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SPACE ( integer_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/space.html)

```sql
SPACE(<n>)
```

##### JavaScript

Custom function used to emulate the behavior

```sql
 function SPACE( occurrences ){
    return ' '.repeat( occurrences );
}
```

### Examples

#### SQL Server

**Input:**

```sql
SELECT CONCAT('SOME', SPACE(5), 'TEXT') AS RESULT;
```

**Output:**

```sql
RESULT       |
-------------|
SOME     TEXT|
```

##### Snowflake SQL

**Input:**

```sql
SELECT CONCAT('SOME', SPACE(5), 'TEXT') AS RESULT;
```

**Output:**

```sql
RESULT       |
-------------|
SOME     TEXT|
```

##### JavaScript

**Input:**

```sql
 CREATE OR REPLACE FUNCTION SPACE(occurrences float)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
    function SPACE( occurrences ){
    return ' '.repeat( occurrences );
    }
    return SPACE( OCCURRENCES );
$$;

SELECT CONCAT('SOME', SPACE(5), 'TEXT') RESULT;
```

**Output:**

```sql
RESULT       |
-------------|
SOME     TEXT|
```

## SUBSTRING

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a character expression after it removes leading blanks ([RTRIM in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/rtrim-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SUBSTRING( string_expression, start, length )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/substr.html)

```sql
SUBSTR( <base_expr>, <start_expr> [ , <length_expr> ] )

SUBSTRING( <base_expr>, <start_expr> [ , <length_expr> ] )
```

##### JavaScript

Custom function used to emulate the behavior

```sql
 string.substring( indexA [, indexB])
```

### Examples

#### SQL Server

**Input:**

```sql
SELECT SUBSTRING('abcdef', 2, 3) AS SOMETEXT;
```

**Output:**

```sql
SOMETEXT|
--------|
bcd     |
```

##### Snowflake SQL

```sql
SELECT SUBSTRING('abcdef', 2, 3) AS SOMETEXT;
```

**Result:**

```sql
SOMETEXT|
--------|
bcd     |
```

##### JavaScript

```sql
 CREATE OR REPLACE FUNCTION REPLACER_LENGTH(str varchar, index float, length float)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
    var start = INDEX - 1;
    var end = STR.length - (LENGTH - 1);
    return STR.substring(start, end);
$$;

SELECT REPLACER_LENGTH('abcdef', 2, 3) AS SOMETEXT;
```

**Result:**

```sql
SOMETEXT|
--------|
bcd     |
```

## UPPER

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Converts a string to uppercase ([UPPER in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/upper-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
UPPER( string_expression )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/upper.html)

```sql
UPPER( <expr> )
```

##### JavaScript

[JavaScript SQL complete documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/String/toUpperCase)

```sql
 String.toUpperCase( )
```

### Examples

#### SQL Server

```sql
SELECT UPPER('you are a prediction of the good ones') AS [UPPER]
```

**Output:**

```sql
+-------------------------------------|
|UPPER                                |
+-------------------------------------|
|YOU ARE A PREDICTION OF THE GOOD ONES|
+-------------------------------------|
```

##### Snowflake SQL

```sql
SELECT
UPPER('you are a prediction of the good ones') AS UPPER;
```

**Output:**

```sql
+-------------------------------------|
|UPPER                                |
+-------------------------------------|
|YOU ARE A PREDICTION OF THE GOOD ONES|
+-------------------------------------|
```

##### JavaScript

```sql
 CREATE OR REPLACE FUNCTION to_upper(str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  return STR.toUpperCase();
$$;

SELECT TO_UPPER('you are a prediction of the good ones') UPPER;
```

**Output:**

```sql
UPPER                                |
-------------------------------------|
YOU ARE A PREDICTION OF THE GOOD ONES|
```

## ASCII

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the number code of a character on the ASCII table ([ASCII in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/ascii-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
ASCII( expression )
```

#### Arguments

`expression`: `VARCVHAR` or `CHAR` expression.

#### Return Type

`INT`.

### Examples

### Query

```sql
SELECT ASCII('A') AS A , ASCII('a') AS a;
```

#### Result

```sql
          A|          a|
-----------| ----------|
         65|         97|
```

## ASCII in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns the number code of a character on the ASCII table ([JavaScript charCodeAt function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/String/charCodeAt)).

### Sample Source Pattern

#### Syntax

```sql
 string.charCodeAt( [index] )
```

##### Arguments

`index`(Optional): Index of string to get character and return its code number on the ASCII table. If this parameter is not specified, it takes 0 as default. \

##### Return Type

`Int`.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION get_ascii(c char)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  return C.charCodeAt();
$$;

SELECT GET_ASCII('A') A, GET_ASCII('a') a;
```

##### Result

```sql
          A|          a|
-----------| ----------|
         65|         97|
```

## QUOTENAME

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a string delimited using quotes ([QUOTENAME in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/quotename-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
QUOTENAME( string_expression [, quote_character])
```

#### Arguments

`string_expression`: String to delimit.

`quote_character`: one-character to delimit the string.

#### Return Type

`NVARCHAR(258)`. Null if the quote is different of (‘), ([]), (“), ( () ), ( >< ), ({}) or (`).

### Examples

### Query

```sql
SELECT QUOTENAME('Hello', '`') AS HELLO;
```

#### Result

```sql
    HELLO|
---------|
  `Hello`|
```

## QUOTENAME in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, this function is not available in JavaScript, but it can be implemented using predefined functions.

### Sample Source Pattern

#### Implementation Example

```sql
 function QUOTENAME(string, quote){
    return quote.concat(string, quote);
}
```

##### Arguments

`string`: String expression to delimit.

`quote`: Quote to be used as a delimiter.

##### Return Type

String.

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION QUOTENAME(str varchar, quote char)
RETURNS string
LANGUAGE JAVASCRIPT
AS
$$
  function QUOTENAME(string, quote){
    const allowed_quotes = /[\']|[\"]|[(]|[)]|[\[]|[\]]|[\{]|[\}]|[\`]/;

    if(!allowed_quotes.test(quote)) throw new TypeError('Invalid Quote');

    return quote.concat(string, quote);
  }
   return QUOTENAME(STR, QUOTE);
$$;

SELECT QUOTENAME('Hola', '`') HELLO;
```

##### Result

```sql
    HELLO|
---------|
  `Hello`|
```

## CONCAT_WS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Makes a concatenation of string values with others using a separator between them ([CONCAT_WS in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/concat-ws-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
CONCAT_WS( separator, expression1, ... ,expressionN )
```

#### Arguments

`separator`: Separator to join.

`expression1, ... ,expressionN:` Expression to be found into a string.

#### Return Type

String value, depends on the input received.

### Examples

### Query

```sql
SELECT CONCAT_WS(' ', 'Mariah','Carey') AS NAME;
```

#### Result

```sql
        NAME|
------------|
Mariah Carey|
```

## Join in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Concatenates the string arguments to the calling string using a separator ([JavaScript Join function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/Array/join)).

### Sample Source Pattern

#### Syntax

```sql
 Array.join( separator )
```

##### Arguments

`separator`: Character to join.

##### Return Type

`String`.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION join_strs(separator varchar, strings array)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  return STRINGS.join(SEPARATOR);
$$;
SELECT join_strs(' ',array_construct('Mariah','Carey')) NAME;
```

##### Result

```sql
        NAME|
------------|
Mariah Carey|
```

## SOUNDEX

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a four-character code to evaluate the similarity of two strings ([SOUNDEX in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/soundex-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
SOUNDEX( string_expression )
```

#### Arguments

`string_expression`: String expression to reverse.

#### Return Type

The same data type of the string expression sent as a parameter.

### Examples

### Query

```sql
SELECT SOUNDEX('two') AS TWO , SOUNDEX('too') AS TOO;
```

#### Result

```sql
      TWO|      TOO|
---------|---------|
     T000|     T000|
```

## SOUNDEX in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, JavaScript does not provide a method that executes the SOUNDEX algorithm, but it can be implemented manually.

### Sample Source Pattern

#### Implementation Example

```sql
 const dic = {A:0, B:1, C:2, D:3, E:0, F:1, G:2, H:0, I:0, J:2, K:2, L:4, M:5, N:5, O:0, P:1, Q:2, R:6, S:2, T:3, U:0, V:1, W:0, X:2, Y:0, Z:2};

  function getCode(letter){
      return dic[letter.toUpperCase()];
  }

  function SOUNDEX(word){
    var initialCharacter = word[0].toUpperCase();
    var initialCode = getCode(initialCharacter);
    for(let i = 1; i < word.length; ++i) {
        const letterCode = getCode(word[i]);
        if (letterCode && letterCode != initialCode) {
             initialCharacter += letterCode;
             if(initialCharacter.length == 4) break;
        }
        initialCode = letterCode;
    }

      return initialCharacter.concat( '0'.repeat( 4 - initialCharacter.length));

  }
```

##### Arguments

`word`: String expression to get its SOUNDEX equivalence.

##### Return Type

String.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION get_soundex(str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  const dic = {A:0, B:1, C:2, D:3, E:0, F:1, G:2, H:0, I:0, J:2, K:2, L:4, M:5, N:5, O:0, P:1, Q:2, R:6, S:2, T:3, U:0, V:1, W:0, X:2, Y:0, Z:2};

  function getCode(letter){
      return dic[letter.toUpperCase()];
  }

  function SOUNDEX(word){
    var initialCharacter = word[0].toUpperCase();
    var initialCode = getCode(initialCharacter);
    for(let i = 1; i < word.length; ++i) {
        const letterCode = getCode(word[i]);
        if (letterCode && letterCode != initialCode) {
             initialCharacter += letterCode;
             if(initialCharacter.length == 4) break;
        }
        initialCode = letterCode;
    }

    return initialCharacter.concat( '0'.repeat( 4 - initialCharacter.length));
  }

  return SOUNDEX( STR );
$$;

SELECT GET_SOUNDEX('two') AS TWO , GET_SOUNDEX('too') AS TOO;
```

##### Result

```sql
      TWO|      TOO|
---------|---------|
     T000|     T000|
```

## REVERSE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Reverses a string ([REVERSE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/reverse-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
REVERSE( string_expression )
```

#### Arguments

`string_expression`: String expression to reverse.

#### Return Type

The same data type of the string expression sent as a parameter.

### Examples

### Query

```sql
SELECT REVERSE('rotator') AS PALINDROME;
```

#### Result

```sql
      PALINDROME|
----------------|
         rotator|
```

## reverse in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, this function is not available in JavaScript, but it can be implemented using predefined functions.

### Sample Source Pattern

#### Implementation Example

```sql
 function REVERSE(string){
    return string.split("").reverse().join("");
}
```

##### Arguments

`string`: String expression to reverse.

##### Return Type

String.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION REVERSE(str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
   return STR.split("").reverse().join("");
$$;

SELECT REVERSE('rotator') PALINDROME;
```

##### Result

```sql
      PALINDROME|
----------------|
         rotator|
```

## STRING_ESCAPE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Escapes special characters in texts and returns text with escaped characters. ([STRING_ESCAPE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/string-escape-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
STRING_ESCAPE( text, type )
```

#### Arguments

`text`: Text to escape characters.

`type`: Format type to escape characters. Currently, JSON is the only format supported.

#### Return Type

`VARCHAR`.

### Examples

### Query

```sql
SELECT STRING_ESCAPE('\   /  \\    "     ', 'json') AS [ESCAPE];
```

#### Result

```sql
ESCAPE|
--------------------------|
  \\   \/  \\\\    \"     |
```

## stringify in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Converts an object to a JSON string format ([JavaScript stringify function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/JSON/stringify)).

### Sample Source Pattern

#### Syntax

```sql
 JSON.stringify( value )
```

##### Arguments

`value`: Object expression to convert.

##### Return Type

String.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION string_escape (str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
   return JSON.stringify( STR );
$$;

SELECT STRING_ESCAPE('\   /  \\    "     ') ESCAPE;
```

##### Result

```sql
                    ESCAPE|
--------------------------|
  \\   \/  \\\\    \"     |
```

## TRIM

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a character expression without blank spaces ([TRIM in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/trim-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
TRIM( string_expression )
```

#### Arguments

`string_expression:` String expressions to convert.

#### Return Type

`VARCHAR` or `NVARCHAR`

### Examples

### SQL Server

```sql
SELECT TRIM('  FIRST AND LAST TWO BLANK SPACES  ') AS [TRIM];
```

**Output:**

```sql
+-------------------------------|
|TRIM                           |
+-------------------------------|
|FIRST AND LAST TWO BLANK SPACES|
+-------------------------------|
```

#### Snowflake SQL

```sql
SELECT TRIM('  FIRST AND LAST TWO BLANK SPACES  ') AS TRIM;
```

**Output:**

```sql
+-------------------------------|
|TRIM                           |
+-------------------------------|
|FIRST AND LAST TWO BLANK SPACES|
+-------------------------------|
```

## trim in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Replaces the occurrences of a pattern using a new one sent as a parameter ([JavaScript Replace function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/String/replace)).

### Sample Source Pattern

#### Syntax

```sql
 String.trim( )
```

##### Arguments

This function does not receive any parameters.

##### Return Type

String.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION TRIM_STR(str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
   return STR.trim( );
$$;

SELECT TRIM_STR('  FIRST AND LAST TWO BLANK SPACES  ')TRIM
```

##### Result

```sql
                           TRIM|
-------------------------------|
FIRST AND LAST TWO BLANK SPACES|
```

## DIFFERENCE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns an integer measuring the difference between two strings using the SOUNDEX algorithm ([DIFFERENCE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/difference-transact-sql?view=sql-server-ver15)).
It counts the common characters of the strings resulting by executing the SOUNDEX algorithm.

### Sample Source Pattern

### Syntax

```sql
DIFFERENCE( expression1, expression1 )
```

#### Arguments

`expression1, expression2:` String expressions to be compared.

#### Return Type

`Int`.

### Examples

### Query

```sql
SELECT DIFFERENCE('Like', 'Mike');
```

#### Result

```sql
    Output |
-----------|
         3 |
```

## DIFFERENCE in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, this functionality is not available in JS, but this can be implemented easily.

> **Note:**
>
> This functions requires the SOUNDEX algorithm implementation.

### Sample Source Pattern

#### Implementation Example

```sql
 function DIFFERENCE(strA, strB) {
    var count = 0;
    for (var i = 0; i < strA.length; i++){
       if ( strA[i] == strB[i] ) count++;
    }

    return count;
}
```

##### Arguments

`strA, strB`: String expressions resulting by executing the SOUNDEX algorithm.

##### Return Type

`String`.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION SOUNDEX_DIFFERENCE(str_1 varchar, str_2 varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
    function DIFFERENCE(strA, strB) {
      var count = 0;
      for (var i = 0; i < strA.length; i++){
         if ( strA[i] == strB[i] ) count++;
      }

    return count;
    }

    return DIFFERENCE(STR_1, STR_2);
$$;

SELECT SOUNDEX_DIFFERENCE(GET_SOUNDEX('two'), GET_SOUNDEX('too')) DIFFERENCE;
```

##### Result

```sql
   DIFFERENCE|
-------------|
            4|
```

## FORMAT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a value formatted with the specified format and optional culture ([FORMAT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/format-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
FORMAT( value, format [, culture])
```

#### Arguments

`value:` String expressions to give format.

format: Desired format.

culture (Optional): NVarchar argument specifying culture. If it is not specified, takes the languages of the current session.

#### Return Type

NULL if the culture parameter is invalid, otherwise, it follows the next data types:

| Category |  | .NET type |
| --- | --- | --- |
| Numeric | bigint | Int64 |
| Numeric | int | Int32 |
| Numeric | smallint | Int16 |
| Numeric | tinyint | Byte |
| Numeric | decimal | SqlDecimal |
| Numeric | numeric | SqlDecimal |
| Numeric | float | Double |
| Numeric | real | Single |
| Numeric | smallmoney | Decimal |
| Numeric | money | Decimal |
| Date and Time | date | DateTime |
| Date and Time | time | TimeSpan |
| Date and Time | datetime | DateTime |
| Date and Time | smalldatetime | DateTime |
| Date and Time | datetime2 | DateTime |
| Date and Time | datetimeoffset | DateTimeOffset |

### Examples

### Query

```sql
SELECT FORMAT(CAST('2022-01-24' AS DATE), 'd', 'en-gb')  AS 'Great Britain';
```

#### Result

```sql
  GREAT BRITAIN|
---------------|
     24/01/2022|
```

##### Query

```sql
SELECT FORMAT(244900.25, 'C', 'cr-CR')  AS 'CURRENCY';
```

##### Result

| CURRENCY |
| --- |
| ₡244,900.25 |

### Date/Time Custom Format Specifiers

SnowConvert AI translates many SQL Server custom date/time format specifiers to their Snowflake `TO_CHAR` equivalents. Some specifiers have behavioral differences and are flagged with [SSC-FDM-0036](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md).

The following date/time specifiers are translated with an FDM marker:

| SQL Server Specifier | Description | Snowflake Equivalent | Behavioral Difference |
| --- | --- | --- | --- |
| `dddd` | Full day name | `DY` | Snowflake returns abbreviated day names (e.g., “Mon” vs “Monday”) |
| `F`–`FFFFFFF` | Fractional seconds (1–7 digits, no trailing zeros) | `F1`–`F7` | Snowflake always includes trailing zeros |
| `z` | UTC offset (hours only) | `TZH` | Formatting differences in offset representation |

#### Date/Time Conversion Example

##### Query

```sql
SELECT FORMAT(CAST('12/12/2024' as datetime), 'dddd, MMMM dd yyyy HH:mm:ss.FFF');
```

##### Snowflake Equivalent

```sql
SELECT
 TO_CHAR(TO_TIMESTAMP_NTZ('12/12/2024'), 'DY, MMMM DD YYYY HH24:MI:SS.F3') /*** SSC-FDM-0036 - TRANSFORMATION OF dddd, MMMM dd yyyy HH:mm:ss.FFF FORMAT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/;
```

#### Related Issues

1. [SSC-FDM-0036](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): The transformed date format may have a different behavior in Snowflake.
2. [SSC-EWI-0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Generated for format specifiers that remain unsupported.

## FORMAT in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

There are different functions to format date and integer values in JavaScript. Unfortunately, these functionalities are not integrated into one method.

### DateTime values

#### Syntax

```sql
 Intl.DateTimeFormat( format ).format( value )
```

##### Arguments

`locales` (Optional): String expression of the format to apply.

`options` (Optional): Object with different supported properties for formats of numeric expressions ([JavaScript NumberFormat function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Intl/NumberFormat/NumberFormat)).

`value`: Numeric expression to format.

##### Return Type

`String`.

### Numeric values

#### Syntax

```sql
 Intl.NumberFormat( [locales [, options]] ).format( value )
```

##### Arguments

`locales` (Optional): String expression of the format to apply.

`options` (Optional): Object with different supported properties for formats of numeric expressions ([JavaScript NumberFormat function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Intl/NumberFormat/NumberFormat)).

`value`: Numeric expression to format.

##### Return Type

`String`.

### Examples

#### DateTime

##### Query

```sql
 CREATE OR REPLACE FUNCTION format_date(date timestamp, format varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  return new Intl.DateTimeFormat( FORMAT ).format( DATE );
$$;
SELECT FORMAT_DATE(TO_DATE('2022-01-24'), 'en-gb') GREAT_BRITAIN;
```

##### Result

```sql
  GREAT_BRITAIN|
---------------|
     24/01/2022|
```

#### Numeric

##### Query

```sql
 CREATE OR REPLACE FUNCTION format_numeric(number float, locales varchar, options variant)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  return new Intl.NumberFormat( LOCALES , OPTIONS ).format( NUMBER );
$$;
SELECT FORMAT_NUMERIC(244900.25, 'de-DE', PARSE_JSON('{ style: "currency", currency: "CRC" }')) CURRENCY;
```

##### Result

```sql
       CURRENCY|
---------------|
 244.900,25 CRC|
```

## PATINDEX

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the starting position of the first occurrence of a pattern in a specified expression ([PATINDEX in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/patindex-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
PATINDEX( pattern, expression )
```

#### Arguments

`pattern`: Pattern to find.

`expression`: Expression to search.

#### Return Type

Integer. Returns 0 if the pattern is not found.

### Examples

### Query

```sql
SELECT PATINDEX( '%on%', 'No, no, non esistono più') AS [PATINDEX]
```

#### Result

```sql
    PATINDEX|
------------|
          10|
```

## search in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Finds the index of a pattern using REGEX ([JavaScript search function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/String/search)).

### Sample Source Pattern

#### Syntax

```sql
 String.search( regex )
```

##### Arguments

`regex`: Regular expression which matches with the desired pattern.

##### Return Type

Integer. If the pattern does not match with any part of the string, returns -1.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION get_index_pattern(pattern varchar, str varchar)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
  function GET_PATTERN(pattern, string){
    return string.search(new RegExp( pattern ));
    }
   return GET_PATTERN(PATTERN, STR) + 1;
$$;

SELECT GET_INDEX_PATTERN('on+', 'No, no, non esistono più') PATINDEX;
```

##### Result

```sql
    PATINDEX|
------------|
          10|
```

## STR

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns character data converted from numeric data. The character data is right-justified, with a specified length and decimal precision. ([STR in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/functions/str-transact-sql?view=sql-server-ver16)).

### Sample Source Pattern

### Syntax

#### SQL Server

```bnf
STR ( float_expression [ , length [ , decimal ] ] )
```

##### Snowflake SQL

```sql
STR_UDF( numeric_expression, number_format )
```

#### Arguments

`numeric_expression`: Float expression with a decimal point.

`length` (Optional): Length that the returning expression will have, including point notation, decimal, and float parts.

`decimal`(Optional): Is the number of places to the right of the decimal point.

#### Return Type

`VARCHAR`.

### Examples

### SQL Server

**Input:**

```sql
/* 1 */
SELECT STR(123.5);

/* 2 */
SELECT STR(123.5, 2);

/* 3 */
SELECT STR(123.45, 6);

/* 4 */
SELECT STR(123.45, 6, 1);
```

**Output:**

```sql
1) 124
2) **
3) 123
4) 123.5
```

#### Snowflake SQL

**Input:**

```sql
/* 1 */
SELECT
PUBLIC.STR_UDF(123.5, '99999');

/* 2 */
SELECT
PUBLIC.STR_UDF(123.5, '99');

/* 3 */
SELECT
PUBLIC.STR_UDF(123.45, '999999');

/* 4 */
SELECT
PUBLIC.STR_UDF(123.45, '9999.9');
```

**Output:**

```sql
1) 124

2) ##

3) 123
4) 123.5
```

## STR in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, this functionality is not available in JS, but it can be implemented easily using the predefined functions for strings.

### Sample Source Pattern

#### Implementation Example

```sql
 function validLength(number, max_length, float_precision) {
  var float_point = number.match(/[\.][0-9]+/);
  /*if the number does not have point float, checks if the float precision
   * and current number are greater than max_length
   */
   if(!float_point) return number.length + float_precision + 1 < max_length;
    //removes the '.' and checks if there is overflow with the float_precision
    return number.length - float_point[0].trim('.').length + float_precision  < max_length;
}
 function STR(number, max_length, float_precision) {
  var number_str = number.toString();
   //if the expression exceeds the max_length, returns '**'
   if(number_str.length > max_length || float_precision > max_length) return '**';
   if(validLength(number_str, max_length, float_precision)) {
      return number.toFixed(float_precision);
    }
    return number.toFixed(max_length - float_precision);
}
```

##### Arguments

`number`: Float expression with a decimal point.

`max_length`: Length that the returning expression will have, including point notation, decimal, and float parts.

`float_precision`: Is the number of places to the right of the decimal point.

##### Return Type

String.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION STR(number float, max_length float, float_precision float)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
    function validLength(number, max_length, float_precision) {
        var float_point = number.match(/[\.][0-9]+/);
        if(!float_point) return number.length + float_precision + 1 < max_length;
        return number.length - float_point[0].trim('.').length + float_precision  < max_length;
    }
    function STR(number, max_length, float_precision) {
      var number_str = number.toString();
      if(number_str.length > max_length || float_precision > max_length) return '**';
      if(validLength(number_str, max_length, float_precision)) {
        return number.toFixed(float_precision);
      }
      return number.toFixed(max_length - float_precision);
    }
    return STR( NUMBER, MAX_LENGTH, FLOAT_PRECISION );
$$;

SELECT STR(12345.674, 12, 6);
```

##### Result

```sql
           STR|
--------------|
  12345.674000|
```

## LTRIM

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a character expression after it removes leading blanks ([LTRIM in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/ltrim-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
LTRIM( string_expression )
```

#### Arguments

`string_expression:` String expressions to convert.

#### Return Type

`VARCHAR` or `NVARCHAR`

### Examples

### Query

```sql
SELECT LTRIM('  FIRST TWO BLANK SPACES') AS [LTRIM]
```

#### Result

```sql
                 LTRIM|
----------------------|
FIRST TWO BLANK SPACES|
```

## LTRIM in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, this function is not available in JavaScript, but it can be implemented using regular expressions.

### Sample Source Pattern

#### Implementation Example

```sql
 function LTRIM(string){
    return string.replace(/^s+/,"");
}
```

##### Arguments

`string`: String expression to remove blank spaces.

##### Return Type

String.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION ltrim(str varchar)
  RETURNS string
  LANGUAGE JAVASCRIPT
AS
$$
  function LTRIM(string){
    return string.replace(/^s+/,"");
    }
   return LTRIM(S TR );
$$;

SELECT LTRIM('  FIRST TWO BLANK SPACES') AS LTRIM;
```

##### Result

```sql
                 LTRIM|
----------------------|
FIRST TWO BLANK SPACES|
```

## Ranking functions

This section describes the functional equivalents of ranking functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to their usage in stored procedures in Snowflake.

## DENSE_RANK

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This function returns the rank of each row within a result set partition, with no gaps in the ranking values. The rank of a specific row is one plus the number of distinct rank values that come before that specific row. ([DENSE_RANK in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/dense-rank-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
 DENSE_RANK ( ) OVER ( [ <partition_by_clause> ] < order_by_clause > )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/dense_rank.html)

```sql
 DENSE_RANK( )
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '1' COLUMN '15' OF THE SOURCE CODE STARTING AT 'OVER'. EXPECTED 'BATCH' GRAMMAR. CODE '80'. **
--              OVER ( [ <partition_by_clause> ] < order_by_clause > )
```

### Examples

#### SQL Server

```sql
SELECT TOP 10 BUSINESSENTITYID, NATIONALIDNUMBER, RANK() OVER (ORDER BY NATIONALIDNUMBER) AS RANK FROM HUMANRESOURCES.EMPLOYEE AS TOTAL
```

**Result:**

```sql
BUSINESSENTITYID|NATIONALIDNUMBER|DENSE_RANK|
----------------|----------------|----------|
              57|10708100        |         1|
              54|109272464       |         2|
             273|112432117       |         3|
               4|112457891       |         4|
             139|113393530       |         5|
             109|113695504       |         6|
             249|121491555       |         7|
             132|1300049         |         8|
             214|131471224       |         9|
              51|132674823       |        10|
```

##### Snowflake SQL

```sql
SELECT TOP 10
BUSINESSENTITYID,
NATIONALIDNUMBER,
RANK() OVER (ORDER BY NATIONALIDNUMBER) AS RANK
FROM
HUMANRESOURCES.EMPLOYEE AS TOTAL;
```

**Result:**

```sql
BUSINESSENTITYID|NATIONALIDNUMBER|DENSE_RANK|
----------------|----------------|----------|
              57|10708100        |         1|
              54|109272464       |         2|
             273|112432117       |         3|
               4|112457891       |         4|
             139|113393530       |         5|
             109|113695504       |         6|
             249|121491555       |         7|
             132|1300049         |         8|
             214|131471224       |         9|
              51|132674823       |        10|
```

#### Related EWIs

* [SSC-EWI-0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unrecognized token on the line of the source code.

## RANK

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the rank of each row within the partition of a result set. The rank of a row is one plus the number of ranks that come before the row in question. ([RANK in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/rank-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
 RANK ( ) OVER ( [ partition_by_clause ] order_by_clause )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/rank.html)

```sql
 RANK( )
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '1' COLUMN '9' OF THE SOURCE CODE STARTING AT 'OVER'. EXPECTED 'BATCH' GRAMMAR. CODE '80'. **
--        OVER ( [ partition_by_clause ] order_by_clause )
```

### Examples

#### SQL Server

```sql
SELECT TOP 10 BUSINESSENTITYID, NATIONALIDNUMBER, RANK() OVER (ORDER BY NATIONALIDNUMBER) AS RANK FROM HUMANRESOURCES.EMPLOYEE AS TOTAL
```

**Result:**

```sql
BUSINESSENTITYID|NATIONALIDNUMBER|RANK|
----------------|----------------|----|
              57|10708100        |   1|
              54|109272464       |   2|
             273|112432117       |   3|
               4|112457891       |   4|
             139|113393530       |   5|
             109|113695504       |   6|
             249|121491555       |   7|
             132|1300049         |   8|
             214|131471224       |   9|
              51|132674823       |  10|
```

##### Snowflake SQL

```sql
SELECT TOP 10
BUSINESSENTITYID,
NATIONALIDNUMBER,
RANK() OVER (ORDER BY NATIONALIDNUMBER) AS RANK
FROM
HUMANRESOURCES.EMPLOYEE AS TOTAL;
```

**Result:**

```sql
BUSINESSENTITYID|NATIONALIDNUMBER|RANK|
----------------|----------------|----|
              57|10708100        |   1|
              54|109272464       |   2|
             273|112432117       |   3|
               4|112457891       |   4|
             139|113393530       |   5|
             109|113695504       |   6|
             249|121491555       |   7|
             132|1300049         |   8|
             214|131471224       |   9|
              51|132674823       |  10|
```

#### Related EWIs

* [SSC-EWI-0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unrecognized token on the line of the source code.

## ROW_NUMBER

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Numbers the output of a result set. More specifically, returns the sequential number of a row within a partition of a result set, starting at 1 for the first row in each partition. ([ROW_NUMBER in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/row-number-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
 ROW_NUMBER ( )
    OVER ( [ PARTITION BY value_expression , ... [ n ] ] order_by_clause )
```

##### Snowflake SQL

[Snowflake SQL complete documentation](https://docs.snowflake.com/en/sql-reference/functions/row_number.html)

```sql
 ROW_NUMBER( )
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '2' COLUMN '5' OF THE SOURCE CODE STARTING AT 'OVER'. EXPECTED 'BATCH' GRAMMAR. CODE '80'. **
--    OVER ( [ PARTITION BY value_expression , ... [ n ] ] order_by_clause )
```

### Examples

#### SQL Server

```sql
SELECT
ROW_NUMBER() OVER(ORDER BY NAME  ASC) AS RowNumber,
NAME
FROM HUMANRESOURCES.DEPARTMENT
```

**Output:**

```sql
RowNumber|NAME                      |
---------|--------------------------|
        1|Document Control          |
        2|Engineering               |
        3|Executive                 |
        4|Facilities and Maintenance|
        5|Finance                   |
        6|Human Resources           |
        7|Information Services      |
        8|Marketing                 |
        9|Production                |
       10|Production Control        |
       11|Purchasing                |
       12|Quality Assurance         |
       13|Research and Development  |
       14|Sales                     |
       15|Shipping and Receiving    |
       16|Tool Design               |
```

##### Snowflake SQL

```sql
SELECT
ROW_NUMBER() OVER(ORDER BY NAME ASC) AS RowNumber,
NAME
FROM
HUMANRESOURCES.DEPARTMENT;
```

**Output:**

```sql
RowNumber|NAME                      |
---------|--------------------------|
        1|Document Control          |
        2|Engineering               |
        3|Executive                 |
        4|Facilities and Maintenance|
        5|Finance                   |
        6|Human Resources           |
        7|Information Services      |
        8|Marketing                 |
        9|Production                |
       10|Production Control        |
       11|Purchasing                |
       12|Quality Assurance         |
       13|Research and Development  |
       14|Sales                     |
       15|Shipping and Receiving    |
       16|Tool Design               |
```

#### Related EWIs

* [SSC-EWI-0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unrecognized token on the line of the source code.

## Logical functions

This section describes the functional equivalents of logical functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to their usage in stored procedures in Snowflake.

## IIF

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns one of two values, depending on whether the Boolean expression evaluates to true or false. ([IIF in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/logical-functions-iif-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
IIF( boolean_expression, true_value, false_value )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/iff.html)

```sql
IFF( <condition> , <expr1> , <expr2> )
```

### Examples

#### SQL Server

```sql
SELECT IIF( 2 > 3, 'TRUE', 'FALSE' ) AS RESULT
```

**Result:**

```sql
RESULT|
------|
 FALSE|
```

##### Snowflake SQL

```sql
SELECT
IFF( 2 > 3, 'TRUE', 'FALSE' ) AS RESULT;
```

**Result:**

```sql
RESULT|
------|
 FALSE|
```

## XML Functions

This section describes the translation of XML functions in Transact-SQL to Snowflake SQL.

## Query

Applies to

* SQL Server
> **Warning:**
>
> This transformation will be delivered in the future

### Description

Specifies an XQuery against an instance of the **xml** data type. The result is of **xml** type. The method returns an instance of untyped XML. ([`Query() in Transact-SQL`](https://learn.microsoft.com/en-us/sql/t-sql/xml/query-method-xml-data-type?view=sql-server-ver16))

### Sample Source Patterns

The following example details the transformation for .query( )

#### SQL Server

##### Input

```sql
 CREATE TABLE xml_demo(object_col XML);

INSERT INTO xml_demo (object_col)
   SELECT
        '<Root>
<ProductDescription ProductID="1" ProductName="Road Bike">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

INSERT INTO xml_demo (object_col)
   SELECT
        '<Root>
<ProductDescription ProductID="2" ProductName="Skate">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

SELECT
    xml_demo.object_col.query('/Root/ProductDescription/Features/Warranty') as Warranty,
    xml_demo.object_col.query('/Root/ProductDescription/Features/Maintenance') as Maintenance
from xml_demo;
```

##### Output

```sql
 Warranty                                     | Maintenance                                                                          |
----------------------------------------------|--------------------------------------------------------------------------------------|
<Warranty>1 year parts and labor</Warranty>   | <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>  |
<Warranty>1 year parts and labor</Warranty>   | <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>  |
```

##### Snowflake SQL

##### Input

```sql
 CREATE OR REPLACE TABLE xml_demo (
    object_col VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - XML DATA TYPE CONVERTED TO VARIANT ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO xml_demo (object_col)
SELECT
        '<Root>
<ProductDescription ProductID="1" ProductName="Road Bike">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

INSERT INTO xml_demo (object_col)
SELECT
        '<Root>
<ProductDescription ProductID="2" ProductName="Skate">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

SELECT
    XMLGET(XMLGET(XMLGET(object_col, 'ProductDescription'), 'Features'), 'Warranty') as Warranty,
    XMLGET(XMLGET(XMLGET(object_col, 'ProductDescription'), 'Features'), 'Maintenance') as Maintenance
from
    xml_demo;
```

##### Output

```sql
 Warranty                                     | Maintenance                                                                          |
----------------------------------------------|--------------------------------------------------------------------------------------|
<Warranty>1 year parts and labor</Warranty>   | <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>  |
<Warranty>1 year parts and labor</Warranty>   | <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>  |
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## Value

Applies to

* SQL Server
> **Warning:**
>
> This transformation will be delivered in the future

### Description

Performs an XQuery against the XML and returns a value of SQL type. This method returns a scalar value. ([`value() in Transact-SQL`](https://learn.microsoft.com/en-us/sql/t-sql/xml/value-method-xml-data-type?view=sql-server-ver16)).

### Sample Source Patterns

The following example details the transformation for .value( )

#### SQL Server

##### Input

```sql
 CREATE TABLE xml_demo(object_col XML);

INSERT INTO xml_demo (object_col)
   SELECT
        '<Root>
<ProductDescription ProductID="1" ProductName="Road Bike">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

INSERT INTO xml_demo (object_col)
   SELECT
        '<Root>
<ProductDescription ProductID="2" ProductName="Skate">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

SELECT
    xml_demo.object_col.value('(/Root/ProductDescription/@ProductID)[1]', 'int' ) as ID,
    xml_demo.object_col.value('(/Root/ProductDescription/@ProductName)[1]', 'varchar(max)' ) as ProductName,
    xml_demo.object_col.value('(/Root/ProductDescription/Features/Warranty)[1]', 'varchar(max)' ) as Warranty
from xml_demo;
```

##### Output

```sql
 ID | ProductName | Warranty               |
----|-------------|------------------------|
1   | Road Bike   | 1 year parts and labor |
2   | Skate       | 1 year parts and labor |
```

##### Snowflake SQL

##### Input

```sql
 CREATE OR REPLACE TABLE xml_demo (
    object_col VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - XML DATA TYPE CONVERTED TO VARIANT ***/!!!
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO xml_demo (object_col)
SELECT
        '<Root>
<ProductDescription ProductID="1" ProductName="Road Bike">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

INSERT INTO xml_demo (object_col)
SELECT
        '<Root>
<ProductDescription ProductID="2" ProductName="Skate">
<Features>
  <Warranty>1 year parts and labor</Warranty>
  <Maintenance>3 year parts and labor extended maintenance is available</Maintenance>
</Features>
</ProductDescription>
</Root>';

SELECT
    GET(XMLGET(object_col, 'ProductDescription'), '@ProductID') :: INT as ID,
    GET(XMLGET(object_col, 'ProductDescription'), '@ProductName') :: VARCHAR as ProductName,
    GET(XMLGET(XMLGET(XMLGET(object_col, 'ProductDescription'), 'Features'), 'Warranty', 0), '$') :: VARCHAR as Warranty
from
    xml_demo;
```

##### Output

```sql
 ID | PRODUCTNAME | WARRANRTY              |
----|-------------|------------------------|
1   | Road Bike   | 1 year parts and labor |
2   | Skate       | 1 year parts and labor |
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## Aggregate functions

This section describes the functional equivalents of aggregate functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to the creation of UDFs in Snowflake.

## COUNT

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This function returns the number of items found in a group. COUNT operates like the COUNT_BIG function. These functions differ only in the data types of their return values. COUNT always returns an int data type value. COUNT_BIG always returns a bigint data type value. ([COUNT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/count-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
COUNT ( { [ [ ALL | DISTINCT ] expression ] | * } )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/count.html)

```sql
COUNT( [ DISTINCT ] <expr1> [ , <expr2> ... ] )
```

### Examples

#### SQL Server

```sql
SELECT COUNT(NATIONALIDNUMBER) FROM HUMANRESOURCES.EMPLOYEE AS TOTAL;
```

**Result:**

| TOTAL |
| --- |
| 290 |

##### Snowflake SQL

```sql
SELECT
COUNT(NATIONALIDNUMBER) FROM
HUMANRESOURCES.EMPLOYEE AS TOTAL;
```

**Result:**

| TOTAL |
| --- |
| 290 |

## COUNT_BIG

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This function returns the number of items found in a group. COUNT_BIG operates like the COUNT function. These functions differ only in the data types of their return values. COUNT_BIG always returns a bigint data type value. COUNT always returns an int data type value. ([COUNT_BIG in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/count-big-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
COUNT_BIG ( { [ [ ALL | DISTINCT ] expression ] | * } )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/count.html)

```sql
COUNT( [ DISTINCT ] <expr1> [ , <expr2> ... ] )
```

### Examples

#### SQL Server

```sql
SELECT COUNT_BIG(NATIONALIDNUMBER) FROM HUMANRESOURCES.EMPLOYEE AS TOTAL;
```

**Result:**

| TOTAL |
| --- |
| 290 |

##### Snowflake SQL

```sql
SELECT
COUNT(NATIONALIDNUMBER) FROM
HUMANRESOURCES.EMPLOYEE AS TOTAL;
```

**Result:**

| TOTAL |
| --- |
| 290 |

## SUM

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Returns the sum of all the values, or only the DISTINCT values, in the expression. SUM can be used with numeric columns only. Null values are ignored. ([SUM in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/sum-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SUM ( [ ALL | DISTINCT ] expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/sum.html)

```sql
SUM( [ DISTINCT ] <expr1> )
```

### Examples

#### SQL Server

```sql
SELECT SUM(VACATIONHOURS) FROM HUMANRESOURCES.EMPLOYEE AS TOTALVACATIONHOURS;
```

**Result:**

| TOTALVACATIONHOURS |
| --- |
| 14678 |

##### Snowflake SQL

```sql
SELECT
SUM(VACATIONHOURS) FROM
HUMANRESOURCES.EMPLOYEE AS TOTALVACATIONHOURS;
```

**Result:**

| TOTALVACATIONHOURS |
| --- |
| 14678 |

## SnowConvert AI custom UDFs

### Description

Some Transact-SQL functions or behaviors may not be available or may behave differently in Snowflake. To minimize these differences, some functions are replaced with SnowConvert AI Custom UDFs.

These UDFs are automatically created during migration, in the `UDF Helper` folder, inside the `Output` folder. There is one file per custom UDF.

## OPENXML UDF

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This custom UDF is added to process a rowset view over an XML document. This would be used for declarations in because it works as a rowset provider.

[Optional parameters](https://learn.microsoft.com/en-us/sql/t-sql/functions/openxml-transact-sql?view=sql-server-ver16#remarks) and different node types are not supported in this version of the UDF. The element node is processed by default.

### Custom UDF overloads

**Parameters**

1. **XML**: A `VARCHAR` that represents the readable content of the XML.
2. **PATH**: A varchar that contains the pattern of the nodes to be processed as rows.

#### UDF

```sql
CREATE OR REPLACE FUNCTION OPENXML_UDF(XML VARCHAR, PATH VARCHAR)
RETURNS TABLE(VALUE VARIANT)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
SELECT VALUE from TABLE(FLATTEN(input=>XML_JSON_SIMPLE(PARSE_XML(XML)), path=>PATH))
$$;

CREATE OR REPLACE FUNCTION XML_JSON_SIMPLE(XML VARIANT)
RETURNS OBJECT
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
function toNormalJSON(xmlJSON) {
    var finalres = {};
    var name=xmlJSON['@'];
    var res = {};
    finalres[name] = res;
    for(var key in xmlJSON)
    {
        if (key == "@")
        {
            res["$name"] = xmlJSON["@"];
        }
        else if (key == "$") {
            continue;
        }
        else if (key.startsWith("@"))
        {
            // This is an attribute
            res[key]=xmlJSON[key];
        }
        else
        {
            var elements = xmlJSON['$']
            var value = xmlJSON[key];
            res[key] = [];
            if (Array.isArray(value))
            {
                for(var elementKey in value)
                {
                    var currentElement = elements[elementKey];
                    var fixedElement = toNormalJSON(currentElement);
                    res[key].push(fixedElement);
                }
            }
            else if (value === 0)
            {
                var fixedElement = toNormalJSON(elements);
                res[key].push(fixedElement);
            }
        }
    }
    return finalres;
}
return toNormalJSON(XML);
$$;
```

##### Transact-SQL

##### Query

```sql
DECLARE @idoc INT, @doc VARCHAR(1000);
SET @doc ='
<ROOT>
<Customer CustomerID="VINET" ContactName="Paul Henriot">
   <Order CustomerID="VINET" EmployeeID="5" OrderDate="1996-07-04T00:00:00">
      <OrderDetail OrderID="10248" ProductID="11" Quantity="12"/>
      <OrderDetail OrderID="10248" ProductID="42" Quantity="10"/>
   </Order>
</Customer>
<Customer CustomerID="LILAS" ContactName="Carlos Gonzlez">
   <Order CustomerID="LILAS" EmployeeID="3" OrderDate="1996-08-16T00:00:00">
      <OrderDetail OrderID="10283" ProductID="72" Quantity="3"/>
   </Order>
</Customer>
</ROOT>';

EXEC sp_xml_preparedocument @idoc OUTPUT, @doc;

SELECT *  FROM OPENXML (@idoc, '/ROOT/Customer',1)
WITH (CustomerID  VARCHAR(10), ContactName VARCHAR(20));
```

##### Result

```sql
CustomerID  | ContactName
----------------------------|
VINET     | Paul Henriot
LILAS     | Carlos Gonzlez
```

##### Snowflake

> **Note:**
>
> The following example is isolated into a stored procedure because environment variables only support 256 bytes of storage, and the XML demo code uses more than that limit.

##### Query

```sql
DECLARE
IDOC INT;
DOC VARCHAR(1000);
BlockResultSet RESULTSET;
BEGIN
DOC := '
<ROOT>
<Customer CustomerID="VINET" ContactName="Paul Henriot">
   <Order CustomerID="VINET" EmployeeID="5" OrderDate="1996-07-04T00:00:00">
      <OrderDetail OrderID="10248" ProductID="11" Quantity="12"/>
      <OrderDetail OrderID="10248" ProductID="42" Quantity="10"/>
   </Order>
</Customer>
<Customer CustomerID="LILAS" ContactName="Carlos Gonzlez">
   <Order CustomerID="LILAS" EmployeeID="3" OrderDate="1996-08-16T00:00:00">
      <OrderDetail OrderID="10283" ProductID="72" Quantity="3"/>
   </Order>
</Customer>
</ROOT>';
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0075 - TRANSLATION FOR BUILT-IN PROCEDURE 'sp_xml_preparedocument' IS NOT CURRENTLY SUPPORTED. ***/!!!

EXEC sp_xml_preparedocument :IDOC OUTPUT, :DOC;
BlockResultSet := (

SELECT
Left(value:Customer['@CustomerID'], '10') AS 'CustomerID',
Left(value:Customer['@ContactName'], '20') AS 'ContactName'
FROM
OPENXML_UDF(:IDOC, ':ROOT:Customer'));
RETURN TABLE(BlockResultSet);
END;
```

##### Result

| CustomerID | ContactName |
| --- | --- |
| VINET | Paul Henriot |
| LILAS | Carlos Gonzlez |

##### Query

```sql
SET code = '<ROOT>
<Customer CustomerID="VINET" ContactName="Paul Henriot">
   <Order CustomerID="VINET" EmployeeID="5" OrderDate="1996-07-04T00:00:00">
      <OrderDetail OrderID="10248" ProductID="11" Quantity="12"/>
   </Order>
</Customer>
</ROOT>';
SELECT
Left(value:Customer['@CustomerID'],10) as "CustomerID",
Left(value:Customer['@ContactName'],20) as "ContactName"
FROM TABLE(OPENXML_UDF($code,'ROOT:Customer'));
```

##### Result

| CustomerID | ContactName |
| --- | --- |
| VINET | Paul Henriot |

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-TS0075](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Built In Procedure Not Supported.

## STR UDF

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This custom UDF converts numeric data to character data.

### Custom UDF overloads

#### Parameters

1. **FLOAT_EXPR**: A numeric expression to be converted to varchar.
2. **FORMAT**: A varchar expression with the length and number of decimals of the resulting varchar. This format is automatically generated in SnowConvert.

##### UDF

```sql
 CREATE OR REPLACE FUNCTION PUBLIC.STR_UDF(FLOAT_EXPR FLOAT, FORMAT VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    TRIM(TRIM(SELECT TO_CHAR(FLOAT_EXPR, FORMAT)), '.')
$$;

CREATE OR REPLACE FUNCTION PUBLIC.STR_UDF(FLOAT_EXPR FLOAT)
RETURNS VARCHAR
LANGUAGE SQL
IMMUTABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
    STR_UDF(FLOAT_EXPR, '999999999999999999')
$$;
```

##### Transact-SQL

##### Query

```sql
SELECT
    STR(123.5) as A,
    STR(123.5, 2) as B,
    STR(123.45, 6) as C,
    STR(123.45, 6, 1) as D;
```

##### Result

| A | B | C | D |
| --- | --- | --- | --- |
| 124 | \*\* | 123 | 123.5 |

##### Snowflake

##### Query

```sql
SELECT
    PUBLIC.STR_UDF(123.5, '99999') as A,
    PUBLIC.STR_UDF(123.5, '99') as B,
    PUBLIC.STR_UDF(123.45, '999999') as C,
    PUBLIC.STR_UDF(123.45, '9999.9') as D;
```

## SWITCHOFFSET_UDF

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This custom UDF is added to return a datetimeoffset value that is changed from the stored time zone offset to a specified new time zone offset.

### Custom UDF overloads

**Parameters**

1. **source_timestamp**: A TIMESTAMP_TZ that can be resolved to a datetimeoffset(n) value.
2. **target_tz**: A varchar that represents the time zone offset

#### UDF

```sql
CREATE OR REPLACE FUNCTION PUBLIC.SWITCHOFFSET_UDF(source_timestamp TIMESTAMP_TZ, target_tz varchar)
RETURNS TIMESTAMP_TZ
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
WITH tz_values AS (
SELECT
    RIGHT(source_timestamp::varchar, 5) as source_tz,

    REPLACE(source_tz::varchar, ':', '') as source_tz_clean,
    REPLACE(target_tz::varchar, ':', '') as target_tz_clean,

    target_tz_clean::integer - source_tz_clean::integer as offset,

    RIGHT(offset::varchar, 2) as tz_min,
    PUBLIC.OFFSET_FORMATTER(RTRIM(offset::varchar, tz_min)) as tz_hrs,

    TIMEADD( hours, tz_hrs::integer, source_timestamp ) as adj_hours,
    TIMEADD( minutes, (LEFT(tz_hrs, 1) || tz_min)::integer, adj_hours::timestamp_tz ) as new_timestamp

FROM DUAL)
SELECT
    (LEFT(new_timestamp, 24) || ' ' || target_tz)::timestamp_tz
FROM tz_values
$$;

-- ==========================================================================
-- Description: The function OFFSET_FORMATTER(offset_hrs varchar) serves as
-- an auxiliary function to format the offset hours and its prefix operator.
-- ==========================================================================
CREATE OR REPLACE FUNCTION PUBLIC.OFFSET_FORMATTER(offset_hrs varchar)
RETURNS varchar
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"udf"}}'
AS
$$
CASE
   WHEN LEN(offset_hrs) = 0 THEN '+' || '0' || '0'
   WHEN LEN(offset_hrs) = 1 THEN '+' || '0' || offset_hrs
   WHEN LEN(offset_hrs) = 2 THEN
        CASE
            WHEN LEFT(offset_hrs, 1) = '-' THEN '-' || '0' || RIGHT(offset_hrs, 1)
            ELSE '+' || offset_hrs
        END
    ELSE offset_hrs
END
$$;
```

##### Transact-SQL

##### Query

```sql
SELECT
  '1998-09-20 7:45:50.71345 +02:00' as fr_time,
  SWITCHOFFSET('1998-09-20 7:45:50.71345 +02:00', '-06:00') as cr_time;
```

##### Result

```sql
SELECT
  '1998-09-20 7:45:50.71345 +02:00' as fr_time,
  SWITCHOFFSET('1998-09-20 7:45:50.71345 +02:00', '-06:00') as cr_time;
```

##### Snowflake

##### Query

```sql
SELECT
  '1998-09-20 7:45:50.71345 +02:00' as fr_time,
  PUBLIC.SWITCHOFFSET_UDF('1998-09-20 7:45:50.71345 +02:00', '-06:00') as cr_time;
```

##### Result

| fr_time | cr_time |
| --- | --- |
| 1998-09-20 7:45:50.71345 +02:00 | 1998-09-19 23:45:50.7134500 -06:00 |

## Metadata functions

This section describes the functional equivalents of metadata functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to their usage in stored procedures in Snowflake.

## DB_NAME

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns the name of a specified database.([DB_NAME in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/functions/db-name-transact-sql?view=sql-server-ver16)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
 DB_NAME ( [ database_id ] )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/current_database.html)

```sql
 CURRENT_DATABASE() /*** SSC-FDM-TS0010 - CURRENT_DATABASE function has different behavior in certain cases ***/
```

### Examples

#### SQL Server

```sql
SELECT DB_NAME();
```

**Result:**

| RESULT |
| --- |
| ADVENTUREWORKS2019 |

##### Snowflake SQL

```sql
SELECT
CURRENT_DATABASE() /*** SSC-FDM-TS0010 - CURRENT_DATABASE function has different behavior in certain cases ***/;
```

**Result:**

| RESULT |
| --- |
| ADVENTUREWORKS2019 |

### Known issues

**1. CURRENT_DATABASE function has different behavior in certain cases**

DB_NAME function can be invoked with the **database_id** parameter, which returns the name of the specified database. Without parameters, the function returns the current database name. However, Snowflake does not support this parameter and the CURRENT_DATABASE function will always return the current database name.

### Related EWIs

1. [SSC-FDM-TS0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): CURRENT_DATABASE function has different behavior in certain cases.

## OBJECT_ID

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns the database object identification number of a schema-scoped object.[(OBJECT_ID in Transact-SQL)](https://learn.microsoft.com/en-us/sql/t-sql/functions/object-id-transact-sql?view=sql-server-ver16).

#### SQL Server syntax

```sql
 OBJECT_ID ( '[ database_name . [ schema_name ] . | schema_name . ]
  object_name' [ ,'object_type' ] )
```

### Sample Source Patterns

#### 1. Default transformation

##### SQL Server

```sql
 IF OBJECT_ID_UDF('DATABASE2.DBO.TABLE1') is not null) THEN
            DROP TABLE IF EXISTS TABLE1;
        END IF;
```

##### Snowflake SQL

```sql
 BEGIN
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '1' COLUMN '0' OF THE SOURCE CODE STARTING AT 'IF'. EXPECTED 'If Statement' GRAMMAR. LAST MATCHING TOKEN WAS 'null' ON LINE '1' COLUMN '48'. FAILED TOKEN WAS ')' ON LINE '1' COLUMN '52'. CODE '70'. **
--IF OBJECT_ID_UDF('DATABASE2.DBO.TABLE1') is not null) THEN
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
DROP TABLE IF EXISTS TABLE1;
END;
```

#### 2. Unknown database

##### SQL Server

```sql
 IF OBJECT_ID_UDF('DATABASE1.DBO.TABLE1') is not null) THEN
            DROP TABLE IF EXISTS TABLE1;
        END IF;
```

##### Snowflake SQL

```sql
  IF (
 OBJECT_ID_UDF('DATABASE1.DBO.TABLE1') is not null) THEN
     DROP TABLE IF EXISTS TABLE1;
 END IF;
```

#### 3. Different object names

##### SQL Server

```sql
 IF OBJECT_ID_UDF('DATABASE1.DBO.TABLE2') is not null) THEN
            DROP TABLE IF EXISTS TABLE1;
        END IF;
```

##### Snowflake SQL

```sql
  IF (
 OBJECT_ID_UDF('DATABASE1.DBO.TABLE2') is not null) THEN
     DROP TABLE IF EXISTS TABLE1;
 END IF;
```

### Known issues

**1. OBJECT_ID_UDF function has different behavior in certain cases**

OBJECT_ID returns the object identification number but the OBJECT_ID_UDF returns a boolean value, so that they are equivalent only when OBJECT_ID is used with not null condition.

### Related EWIs

* [SSC-EWI-0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unrecognized token on the line of the source code.
* [SSC-FDM-0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies

## Analytic Functions

This section describes the functional equivalents of analytic functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to the creation of UDFs in Snowflake.

## LAG

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Accesses data from a previous row in the same result set without the use of a self-join starting with SQL Server 2012 (11.x).
LAG provides access to a row at a given physical offset that comes before the current row.
Use this analytic function in a SELECT statement to compare values in the current row with values in a previous row. (COUNT in Transact-SQL).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
LAG (scalar_expression [,offset] [,default])
    OVER ( [ partition_by_clause ] order_by_clause )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/count.html)

```sql
COUNT( [ DISTINCT ] <expr1> [ , <expr2> ... ] )
```

### Examples

#### SQL Server

```sql
SELECT TOP 10
LAG(E.VacationHours,1) OVER(ORDER BY E.NationalIdNumber) as PREVIOUS,
E.VacationHours AS ACTUAL
FROM HumanResources.Employee E
```

**Result:**

| PREVIOUS | ACTUAL |
| --- | --- |
| NULL | 10 |
| 10 | 89 |
| 89 | 10 |
| 10 | 48 |
| 48 | 0 |
| 0 | 95 |
| 95 | 55 |
| 55 | 67 |
| 67 | 84 |
| 84 | 85 |

##### Snowflake SQL

```sql
SELECT TOP 10
LAG(E.VacationHours,1) OVER(ORDER BY E.NationalIdNumber) as PREVIOUS,
E.VacationHours AS ACTUAL
FROM
HumanResources.Employee E;
```

**Result:**

| PREVIOUS | ACTUAL |
| --- | --- |
| NULL | 10 |
| 10 | 89 |
| 89 | 10 |
| 10 | 48 |
| 48 | 0 |
| 0 | 95 |
| 95 | 55 |
| 55 | 67 |
| 67 | 84 |
| 84 | 85 |

## Data Type functions

This section describes the functional equivalents of data type functions in Transact-SQL to Snowflake SQL and JavaScript code.

## DATALENGTH

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns the number of bytes used to represent any expression. ([DATALENGTH in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/datalength-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DATALENGTH ( expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/cast.html)

```sql
OCTET_LENGTH(<string_or_binary>)
```

### Examples

#### SQL Server

```sql
SELECT DATALENGTH('SomeString') AS SIZE;
```

**Result:**

| SIZE |
| --- |
| 10 |

##### Snowflake SQL

```sql
SELECT OCTET_LENGTH('SomeString') AS SIZE;
```

**Result:**

| SIZE |
| --- |
| 10 |

## Mathematical functions

This section describes the functional equivalents of mathematical functions in Transact-SQL to Snowflake SQL and JavaScript code, oriented to their usage in stored procedures in Snowflake.

## ABS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

A mathematical function that returns the absolute (positive) value of the specified numeric expression. (`ABS` changes negative values to positive values. `ABS` has no effect on zero or positive values.) ([ABS in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/abs-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
ABS( expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/abs.html)

```sql
ABS( <num_expr> )
```

##### JavaScript

[JavaScript complete documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/Math/abs)

```sql
Math.abs( expression )
```

### Examples

#### SQL Server

```sql
SELECT ABS(-5);
```

**Result:**

| ABS(-5) |
| --- |
| 5 |

##### Snowflake SQL

```sql
SELECT ABS(-5);
```

**Result:**

| ABS(-5) |
| --- |
| 5 |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION compute_abs(a float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  return Math.abs(A);
$$
;
SELECT COMPUTE_ABS(-5);
```

**Result:**

| COMPUTE_ABS(-5) |
| --- |
| 5 |

### Related Documentation

* [Transact-SQL supported numeric types](https://docs.microsoft.com/en-us/sql/t-sql/data-types/numeric-types?view=sql-server-ver15)

## AVG

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

> **Note:**
>
> SnowConvert AI Helpers Code section is omitted.

This function returns the average of the values in a group. It ignores null values. ([AVG in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/avg-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
AVG ( [ ALL | DISTINCT ] expression )
   [ OVER ( [ partition_by_clause ] order_by_clause ) ]
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/avg.html)

```sql
AVG( [ DISTINCT ] <expr1> )

AVG( [ DISTINCT ] <expr1> ) OVER (
                                 [ PARTITION BY <expr2> ]
                                 [ ORDER BY <expr3> [ ASC | DESC ] [ <window_frame> ] ]
                                 )
```

### Examples

#### SQL Server

```sql
SELECT AVG(VACATIONHOURS) AS AVG_VACATIONS FROM HUMANRESOURCES.EMPLOYEE;
```

**Result:**

| AVG_VACATIONS |
| --- |
| 50 |

##### Snowflake SQL

```sql
SELECT AVG(VACATIONHOURS) AS AVG_VACATIONS FROM HUMANRESOURCES.EMPLOYEE;
```

**Result:**

| AVG_VACATIONS |
| --- |
| 50 |

## CEILING

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

A mathematical function that returns the smallest greater integer greater/equal to the number sent as a parameter ([CEILING in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/ceiling-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
CEILING( expression )
```

##### Snowflake SQL

```sql
CEIL( <input_expr> [, <scale_expr> ] )
```

##### JavaScript

```sql
 Math.ceil( expression )
```

### Examples

#### SQL Server

```sql
SELECT CEILING(642.20);
```

**Result:**

| CEILING(642.20) |
| --- |
| 643 |

##### Snowflake SQL

```sql
SELECT CEIL(642.20);
```

**Result:**

| CEIL(642.20) |
| --- |
| 643 |

##### JavaScript

```sql
CREATE OR REPLACE FUNCTION compute_ceil(a double)
RETURNS double
LANGUAGE JAVASCRIPT
AS
$$
  return Math.ceil(A);
$$
;
SELECT COMPUTE_CEIL(642.20);
```

**Result:**

```sql
COMPUTE_CEIL(642.20)|
--------------------|
                 643|
```

## FLOOR

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the largest integer less than or equal to the specified numeric expression. ([FLOOR in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/floor-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
FLOOR ( numeric_expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/floor.html)

```sql
FLOOR( <input_expr> [, <scale_expr> ] )
```

### Examples

#### SQL Server

```sql
SELECT FLOOR (124.87) AS FLOOR;
```

**Result:**

```sql
FLOOR|
-----|
  124|
```

##### Snowflake SQL

```sql
SELECT FLOOR (124.87) AS FLOOR;
```

**Result:**

```sql
FLOOR|
-----|
  124|
```

## POWER

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the value of the specified expression to the specified power. ([POWER in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/power-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
POWER ( float_expression , y )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/pow.html)

```sql
POW(x, y)

POWER (x, y)
```

### Examples

#### SQL Server

```sql
SELECT POWER(2, 10.0) AS IntegerResult
```

**Result:**

```sql
IntegerResult |
--------------|
          1024|
```

##### Snowflake SQL

```sql
SELECT POWER(2, 10.0) AS IntegerResult;
```

**Result:**

```sql
IntegerResult |
--------------|
          1024|
```

### Related Documentation

* [SQL Server supported numeric types](https://docs.microsoft.com/en-us/sql/t-sql/data-types/numeric-types?view=sql-server-ver15)

## ROUND

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a numeric value, rounded to the specified length or precision. ([ROUND in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/round-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
ROUND ( numeric_expression , length [ ,function ] )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/round.html)

```sql
ROUND( <input_expr> [, <scale_expr> ] )
```

### Examples

#### SQL Server

```sql
SELECT ROUND(123.9994, 3) AS COL1, ROUND(123.9995, 3) AS COL2;
```

**Result:**

```sql
COL1    |COL2    |
--------|--------|
123.9990|124.0000|
```

##### Snowflake SQL

```sql
SELECT ROUND(123.9994, 3) AS COL1,
ROUND(123.9995, 3) AS COL2;
```

**Result:**

```sql
COL1   | COL2  |
--------|------|
123.999|124.000|
```

### Related Documentation

* [SQL Server supported numeric types](https://docs.microsoft.com/en-us/sql/t-sql/data-types/numeric-types?view=sql-server-ver15)

## SQRT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the square root of the specified float value. ([SQRT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/sqrt-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SQRT ( float_expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/sqrt.html)

```sql
SQRT(expr)
```

### Examples

#### SQL Server

```sql
SELECT SQRT(25) AS RESULT;
```

**Result:**

```sql
RESULT|
------|
   5.0|
```

##### Snowflake SQL

```sql
SELECT SQRT(25) AS RESULT;
```

**Result:**

```sql
RESULT|
------|
   5.0|
```

## SQUARE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the square of the specified float value. ([SQUARE in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/square-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SQUARE ( float_expression )  ****
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/square.html)

```sql
SQUARE(expr)
```

### Examples

#### SQL Server

```sql
SELECT SQUARE (5) AS SQUARE;
```

**Result:**

```sql
SQUARE|
------|
  25.0|
```

##### Snowflake SQL

```sql
SELECT SQUARE (5) AS SQUARE;
```

**Result:**

```sql
SQUARE|
------|
    25|
```

## STDEV

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Returns the statistical standard deviation of all values in the specified expression. ([STDEV in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/degrees-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
 STDEV ( [ ALL | DISTINCT ] expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/stddev.html)

```sql
 STDDEV( [ DISTINCT ] <expression_1> )
```

### Examples

#### SQL Server

```sql
SELECT
    STDEV(VACATIONHOURS)
FROM
    HUMANRESOURCES.EMPLOYEE AS STDEV;
```

**Result:**

```sql
           STDEV|
----------------|
28.7862150320948|
```

##### Snowflake SQL

```sql
SELECT
    STDDEV(VACATIONHOURS)
FROM
    HUMANRESOURCES.EMPLOYEE AS STDEV;
```

**Result:**

```sql
       STDEV|
------------|
28.786215034|
```

## STDEVP

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Returns the statistical standard deviation for the population for all values in the specified expression. ([STDVEP in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/degrees-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
STDEVP ( [ ALL | DISTINCT ] expression )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/stddev_pop.html)

```sql
STDDEV_POP( [ DISTINCT ] expression_1)
```

### Examples

#### SQL Server

```sql
SELECT
    STDEVP(VACATIONHOURS) AS STDEVP_VACATIONHOURS
FROM
    HumanResources.Employee;
```

**Result:**

```sql
STDEVP_VACATIONHOURS|
--------------------|
  28.736540767245085|
```

##### Snowflake SQL

```sql
SELECT
    STDDEV_POP(VACATIONHOURS) AS STDEVP_VACATIONHOURS
FROM
    HumanResources.Employee;
```

**Result:**

```sql
STDEVP_VACATIONHOURS|
--------------------|
        28.736540763|
```

## VAR

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Returns the statistical variance of all values in the specified expression. ([VAR in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/var-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
VAR ( [ ALL | DISTINCT ] expression )
```

##### Snowflake SQL

```sql
VAR_SAMP( [DISTINCT] <expr1> )
```

### Examples

#### SQL Server

```sql
SELECT
    VAR(VACATIONHOURS)
FROM
    HUMANRESOURCES.EMPLOYEE AS VAR;
```

**Result:**

```sql
             VAR|
----------------|
28.7862150320948|
```

##### Snowflake SQL

```sql
SELECT
    VAR_SAMP(VACATIONHOURS)
FROM
    HUMANRESOURCES.EMPLOYEE AS VAR;
```

**Result:**

```sql
       VAR|
----------|
828.646176|
```

## POWER

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the value of the specified expression for a specific power.
([POWER in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/power-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
POWER( base, exp )
```

#### Arguments

`base`: Base of number, it must be a float expression.
`exp`: Power to which raise the base.

#### Return Type

The return type depends on the input expression:

| Input Type | Return Type |
| --- | --- |
| float, real | float |
| decimal(p, s) | decimal(38, s) |
| int, smallint, tinyint | int |
| bigint | bigint |
| money, smallmoney | money |
| bit, char, nchar, varchar, nvarchar | float |

### Examples

### Query

```sql
SELECT POWER(2, 3)
```

#### Result

```sql
POWER(2, 3)|
-----------|
        8.0|
```

## POW in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the base of the exponent power.
([JavaScript POW function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/pow)).

### Sample Source Pattern

#### Syntax

```sql
 Math.pow( base, exp )
```

##### Arguments

`base`: Base of number, it must be a float expression.
`exp`: Power to which raise the base.

##### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION compute_pow(base float, exp float)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
    return Math.pow(BASE, EXP);
$$
;
SELECT COMPUTE_POW(2, 3);
```

##### Result

```sql
COMPUTE_POW(2, 3)|
-----------------|
                8|
```

## ACOS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arccosine in radians of the number sent as a parameter ([ACOS in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/acos-transact-sql?view=sql-server-ver15)).

Mathematically, the arccosine is the inverse function of the cosine, resulting in the following definition:
$$y = cos^{-1} \Leftrightarrow x = cos(y)$$

For $$y = cos^{-1}(x)$$:
- Range: $$0\leqslant y \leqslant \pi$$ or $$0^{\circ}\leqslant y \leqslant 180^{\circ}$$
- Domain: $$-1\leqslant x \leqslant 1$$

### Sample Source Pattern

### Syntax

```sql
ACOS ( expression )
```

#### Arguments

`expression`: Numeric **float** expression, where expression is in$$[-1,1]$$.

#### Return Type

Numeric float expression between 0 and π. If the numeric expression sent by parameter is out of the domain $$[-1, 1]$$, the database engine throws an error.

### Examples

### Query

```sql
SELECT ACOS(-1.0);
```

#### Result

```sql
ACOS(-1.0)       |
-----------------|
3.141592653589793|
```

## ACOS in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arccosine of a specified number
([JavaScript ACOS function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/Math/acos)).

### Sample Source Pattern

#### Syntax

```sql
 Math.acos( expression )
```

##### Arguments

`expression`: Numeric expression, where expression is in$$[-1,1]$$.

##### Return Type

Numeric expression between 0 and π. If the numeric expression sent by parameter is out of the range of the arccosine in radians $$[-1, 1]$$, the function returns NaN.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION compute_acos(a double)
  RETURNS double
  LANGUAGE JAVASCRIPT
AS
$$
  return Math.acos(A);
$$
;
SELECT COMPUTE_ACOS(-1);
```

##### Result

```sql
COMPUTE_ACOS(-1)|
---------------|
    3.141592654|
```

## ASIN

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arcsine in radians of the number sent as parameter ([ASIN in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/asin-transact-sql?view=sql-server-ver15)).

The arcsine is the inverse function of the sine , summarized in the next definition:
$$y = sin^{-1} \Leftrightarrow x = sin(x)$$

For $$y = sin^{-1}(x)$$:
- Range: $$-\frac{\pi}{2}\leqslant y \leqslant \frac{\pi}{2}$$ or $$-90^{\circ}\leqslant y \leqslant 90^{\circ}$$
- Domain: $$-1\leqslant x \leqslant 1$$

### Sample Source Pattern

### Syntax

```sql
ASIN( expression )
```

#### Arguments

`expression`: Numeric **float** expression, where expression is in$$[-1,1]$$.

#### Return Type

Numeric float expression between $$-\frac{\pi}{2}$$ and $$\frac{\pi}{2}$$. If the numeric expression sent by parameter is not in $$[-1, 1]$$, the database engine throws an error.

### Examples

### Query

```sql
SELECT ASIN(0.5);
```

#### Result

```sql
ASIN(0.5)         |
------------------|
0.5235987755982989|
```

## ASIN in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arcsine of a specified number
([JavaScript ASIN function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/Math/asin)).

### Sample Source Pattern

#### Syntax

```sql
 Math.asin( expression )
```

##### Arguments

`expression`: Numeric expression, where expression is in$$[-1,1]$$.

##### Return Type

Numeric expression between $$-\frac{\pi}{2}$$ and $$\frac{\pi}{2}$$. If the numeric expression sent by parameter is out of the domain of the arccosine $$[-1, 1]$$, the function returns NaN.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION compute_asin(a float)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
  return Math.asin(A);
$$
;
SELECT COMPUTE_ASIN(0.5);
```

##### Result

```sql
COMPUTE_ASIN(1)   |
------------------|
      0.5235987756|
```

## COS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the cosine of the angle sent through parameters (must be measured in radians) ([COS in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/cos-transact-sql?view=sql-server-ver15)).

The cosine is defined as:
$$y = cos(x)$$

### Sample Source Pattern

### Syntax

```sql
COS( expression )
```

#### Arguments

`expression`: Numeric **float** expression, where expression is in $$\mathbb{R}$$.

#### Return Type

Numeric float expression in $$[-1, 1]$$.

### Examples

### Query

```sql
SELECT COS(PI())
```

#### Result

```sql
COS(PI())|
---------|
     -1.0|
```

## COS in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Static function that returns the cosine of an angle in radians
([JavaScript COS function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/cos)).

### Sample Source Pattern

#### Syntax

```sql
 Math.cos( expression )
```

##### Arguments

`expression:` Numeric expressions.

##### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION compute_cos(angle float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  return Math.cos(ANGLE);
$$
;
SELECT COMPUTE_COS(PI());
```

##### Result

```sql
COMPUTE_COS(PI())|
-----------------|
               -1|
```

## COT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the cotangent of the angle in radians sent through parameters ([COT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/cot-transact-sql?view=sql-server-ver15)).

The cosine is defined as:
$$cot(x) = \frac{cos(x)}{sin(x)}$$ or $$cot(x) = \frac{1}{tan(x)}$$
To calculate the cosine, the parameter must comply with the constraints of sine and cosine functions.

### Sample Source Pattern

### Syntax

```sql
COT( expression )
```

#### Arguments

`expression`: Numeric **float** expression, where expression is in $$\mathbb{R}-{sin(expression)=0 \wedge tan(expression) =0}$$.

#### Return Type

Numeric float expression in $$\mathbb{R}$$.

### Examples

### Query

```sql
SELECT COT(1)
```

#### Result

```sql
COT(1)            |
------------------|
0.6420926159343306|
```

## COT in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Unfortunately, the object `Math`in JavaScript does not provide a method to calculate the cotangent of a given angle.
This could be calculated using the equation: $$cot(x) = \frac{cos(x)}{sin(x)}$$

### Sample Source Pattern

#### Implementation example

```sql
 function cot(angle){
    return Math.cos(angle)/Math.sin(angle);
}
```

##### Arguments

`angle:` Numeric expression in radians.

##### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION compute_cot(angle float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  function cot(angle){
    return Math.cos(angle)/Math.sin(angle);
  }
  return cot(ANGLE);

$$
;
SELECT COMPUTE_COT(1);
```

##### Result

```sql
COMPUTE_COT(1);   |
------------------|
0.6420926159343308|
```

## RADIANS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Converts degrees to radians.
([RADIANS in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/radians-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
RADIANS( expression )
```

#### Arguments

`expression`: Numeric expression in degrees.

#### Return Type

Same data type sent through parameter as a numeric expression in radians.

### Examples

### Query

```sql
SELECT RADIANS(180.0)
```

#### Result

| RADIANS(180) |
| --- |
| 3.141592653589793116 |

> **Note:**
>
> Cast the parameter of this function to float, otherwise, the above statement will return 3 instead of PI value.

## RADIANS in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

JavaScript does not provide a method to convert degrees to radians of a given angle.
This could be calculated using the equation: $$Radians = \frac{\pi}{180^{\circ}} \cdot angle$$

### Sample Source Pattern

#### Implementation example

```sql
 function radians(angle){
    return (Math.PI/180) * angle;
}
```

##### Arguments

`angle`: Float expression in degrees.

##### Return Type

Same data type sent through parameter as a numeric expression in radians.

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION RADIANS(angle float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
    function radians(angle){
      return (Math.PI/180) * angle;
    }
    return radians(ANGLE);
$$
;
SELECT RADIANS(180);
```

##### Result

| RADIANS(180) |
| --- |
| 3.141592654 |

## PI

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the constant value of PI
([PI in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/pi-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
PI( )
```

#### Arguments

This method does not receive any parameters.

#### Return Type

Float.

### Examples

### Query

```sql
CREATE PROCEDURE CIRCUMFERENCE @radius float
AS
    SELECT 2 * PI() * @radius;
GO:

EXEC CIRCUMFERENCE @radius = 2;
```

#### Result

```sql
CIRCUMFERENCE @radius = 2 |
--------------------------|
          12.5663706143592|
```

## PI in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Constant which represents the PI number (approximately 3.141592…)
([JavaScript PI Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/PI)).

### Sample Source Pattern

#### Syntax

```sql
 Math.PI
```

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION circumference(radius float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  function circumference(r){
    return 2 * Math.PI * r;
  }
  return circumference(RADIUS);
$$
;
SELECT CIRCUMFERENCE(2);
```

##### Result

```sql
  CIRCUMFERENCE(2)|
------------------|
12.566370614359172|
```

## DEGREES

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Converts the angle in radians sent through parameters to degrees ([DEGREES in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/degrees-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
DEGREES( expression )
```

#### Arguments

`expression`: Numeric **float** expression in radians.

#### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

### Query

```sql
SELECT DEGREES(PI())
```

#### Result

```sql
DEGREES(PI())|
-------------|
        180.0|
```

## DEGREES in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

JavaScript does not provide a method to convert radians to degrees of a given angle.
This could be calculated using the equation: $$Degrees = \frac{180^{\circ}}{\pi} \cdot angle$$

### Sample Source Pattern

#### Implementation example

```sql
 function degress(angle){
    return (180/Math.PI) * angle;
}
```

##### Arguments

`angle`: Numeric expression in radians.

##### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION compute_degrees(angle float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  function degrees(angle){
    return (180/Math.PI) * angle;
  }
  return degrees(ANGLE);

$$
;
SELECT COMPUTE_DEGREES(PI());
```

##### Result

```sql
COMPUTE_DEGREES(PI())|
---------------------|
                180.0|
```

## LOG

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the natural logarithm of a number
([LOG in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/log-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
LOG( expression [, base ] )
```

#### Arguments

`expression`: Numeric expression.

`base` (optional): Base to calculate the logarithm of a number, it is Euler by default.

#### Return Type

Float.

### Examples

### Query

```sql
SELECT LOG(8, 2)
```

#### Result

```sql
LOG(8, 2)  |
-----------|
          3|
```

## LOG in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the logarithm using the Euler’s number as a base. ([JavaScript LOG function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/log)).

> **Warning:**
>
> Unfortunately, JavaScript does not provide a method that receives a logarithm base through its parameters, but this can be solved by dividing the base by the argument.

### Sample Source Pattern

#### Syntax

```sql
 Math.log( expression )
```

##### Arguments

`expression`: Numeric expression. It must be positive, otherwise returns NaN.\

##### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION base_log(base float, exp float)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
  function getBaseLog(x, y){
    return Math.log(y)/Math.log(x);
  }
  return getBaseLog(EXP, BASE)
$$
;
SELECT BASE_LOG(2, 8);
```

##### Result

```sql
BASE_LOG(2, 8)|
--------------|
             3|
```

## ATAN

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arctangent in radians of the number sent as a parameter ([ATAN in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/atan-transact-sql?view=sql-server-ver15)).

The arctangent is the inverse function of the tangent, summarized in the next definition:
$$y = arctan^{-1} \Leftrightarrow x = tan(x)$$

For $$y = tan^{-1}(x)$$:
- Range: $$-\frac{\pi}{2}\leqslant y \leqslant \frac{\pi}{2}$$ or $$-90^{\circ}\leqslant y \leqslant 90^{\circ}$$
- Domain: $$\mathbb{R}$$

### Sample Source Pattern

### Syntax

```sql
ATAN( expression )
```

#### Arguments

`expression`: Numeric **float** expression, or a numeric type which could be converted to float.

#### Return Type

Numeric float expression between $$-\frac{\pi}{2}$$ and $$\frac{\pi}{2}$$.

### Examples

### Query

```sql
SELECT ATAN(-30);
```

#### Result

```sql
ATAN(-30)          |
-------------------|
-1.5374753309166493|
```

## ATAN in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arctangent of a specified number
([JavaScript ATAN function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/Math/atan)).

### Sample Source Pattern

#### Syntax

```sql
 Math.atan( expression )
```

##### Arguments

`expression`: Numeric expression.

##### Return Type

Numeric expression between $$-\frac{\pi}{2}$$ and $$\frac{\pi}{2}$$.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION compute_atan(a float)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
  return Math.atan(A);
$$
;
SELECT COMPUTE_ATAN(-30);
```

##### Result

```sql
COMPUTE_ATAN(-30)|
-----------------|
     -1.537475331|
```

## ATN2

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arctangent in radians of two coordinates sent as a parameter ([ATN2 in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/atn2-transact-sql?view=sql-server-ver15)).

For $$z = tan^{-1}(x, y)$$:
- Range: $$-\pi\leqslant z \leqslant \pi$$ or $$-180^{\circ}\leqslant z \leqslant 180^{\circ}$$
- Domain: $$\mathbb{R}$$

### Sample Source Pattern

### Syntax

```sql
ATN2( expression_1, expression_2 )
```

#### Arguments

`expression1`and `expression2`: Numeric expressions.

#### Return Type

Numeric expression between $$-\pi$$ and $$\pi$$.

### Examples

### Query

```sql
SELECT ATN2(7.5, 2);
```

#### Result

```sql
ATN2(7.5, 2)      |
------------------|
1.3101939350475555|
```

## ATAN2 in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Function that returns the arctangent of two parameters
([JavaScript ATAN2 function Documentation](https://developer.mozilla.org/es/docs/Web/JavaScript/Reference/Global_Objects/Math/atan2)).

### Sample Source Pattern

#### Syntax

```sql
 Math.atan2( expression_1, expression_2 )
```

##### Arguments

`expression_1`and `expression_2`: Numeric expressions.

##### Return Type

Numeric expression between $$-\pi$$ and $$\pi$$.

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION compute_atan2(x float, y float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  return Math.atan2(X, Y);
$$
;
SELECT COMPUTE_ATAN2(7.5, 2);
```

##### Result

```sql
ATAN2(7.5, 3)     |
------------------|
       1.310193935|
```

## LOG10

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the base 10 logarithm of a number
([LOG10 in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/log10-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
LOG10( expression )
```

#### Arguments

`expression`: Numeric expression, must be positive.

#### Return Type

Float.

### Examples

### Query

```sql
SELECT LOG10(5)
```

#### Result

```sql
LOG10(5)         |
-----------------|
0.698970004336019|
```

## LOG10 in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the base 10 logarithm of a number
([JavaScript LOG10 function Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/log10)).

### Sample Source Pattern

#### Syntax

```sql
 Math.log10( expression )
```

##### Arguments

`expression`: Numeric expression. It must be positive, otherwise returns NaN.\

##### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

#### Query

```sql
 CREATE OR REPLACE FUNCTION compute_log10(argument float)
  RETURNS float
  LANGUAGE JAVASCRIPT
AS
$$
    return Math.log10(ARGUMENT);
$$
;
SELECT COMPUTE_LOG10(7.5);
```

##### Result

```sql
COMPUTE_LOG10(5)|
----------------|
    0.6989700043|
```

## EXP

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the exponential value of Euler ([EXP in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/exp-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

### Syntax

```sql
EXP( expression )
```

#### Arguments

`expression`: Numeric expression.

#### Return Type

Same data type sent through parameter as a numeric expression.

### Examples

### Query

```sql
SELECT EXP(LOG(20)), LOG(EXP(20))
GO
```

#### Result

```sql
EXP(LOG(20))   |LOG(EXP(20))    |
---------------|----------------|
           20.0|            20.0|
```

## EXP in JS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Constant which represents Euler’s number (approximately 2.718…)
([JavaScript Euler’s Number Documentation](https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/Math/E)).
JavaScript allows make different operations using this constant, instead of Transact-SQL which only supports the exponential of Euler.

### Sample Source Pattern

#### Syntax

```sql
 Math.E
```

### Examples

#### Query

```sql
CREATE OR REPLACE FUNCTION compute_exp(x float)
RETURNS float
LANGUAGE JAVASCRIPT
AS
$$
  return Math.E**X;
$$
;
SELECT COMPUTE_EXP(LN(20)), LN(COMPUTE_EXP(20));
```

##### Result

```sql
COMPUTE_EXP(LOG(20))|LOG(COMPUTE_EXP(20))|
--------------------|--------------------|
                20.0|                20.0|
```

## Conversion functions

This section describes the functional equivalents of date & time functions in Transact-SQL to Snowflake SQL code.

## CONVERT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Convert an expression of one data type to another. ([CONVERT in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/cast-and-convert-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
CONVERT ( data_type [ ( length ) ] , expression [ , style ] )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/cast.html)

```sql
CAST( <source_expr> AS <target_data_type> )
```

### Examples

#### SQL Server

```sql
SELECT CONVERT(INT, '1998') as MyDate
```

##### Result

| MyDate |
| --- |
| 1998 |

##### Snowflake SQL

```sql
SELECT
CAST('1998' AS INT) as MyDate;
```

##### Result

| MYDATE |
| --- |
| 1998 |

##### Casting date type to varchar

##### SQL Server

```sql
SELECT CONVERT(varchar, getdate(), 1) AS RESULT;
```

##### Result

| RESULT |
| --- |
| 12/08/22 |

##### Swowflake SQL

```sql
SELECT
TO_VARCHAR(CURRENT_TIMESTAMP() :: TIMESTAMP, 'mm/dd/yy') AS RESULT;
```

##### Result

| RESULT |
| --- |
| 12/08/22 |

##### Casting date type to varchar with size

##### SQL Server

```sql
SELECT CONVERT(varchar(2), getdate(), 1) AS RESULT;
```

##### Result

| RESULT |
| --- |
| 07 |

##### Snowflake SQL

```sql
SELECT
LEFT(TO_VARCHAR(CURRENT_TIMESTAMP() :: TIMESTAMP, 'mm/dd/yy'), 2) AS RESULT;
```

##### Result

| RESULT |
| --- |
| 07 |

The supported formats for dates casts are:

**Date formats**

| Code | Format |
| --- | --- |
| 1 | mm/dd/yy |
| 2 | yy.mm.dd |
| 3 | dd/mm/yy |
| 4 | dd.mm.yy |
| 5 | dd-mm-yy |
| 6 | dd-Mon-yy |
| 7 | Mon dd, yy |
| 10 | mm-dd-yy |
| 11 | yy/mm/dd |
| 12 | yymmdd |
| 23 | yyyy-mm-dd |
| 101 | mm/dd/yyyy |
| 102 | yyyy.mm.dd |
| 103 | dd/mm/yyyy |
| 104 | dd.mm.yyyy |
| 105 | dd-mm-yyyy |
| 106 | dd Mon yyyy |
| 107 | Mon dd, yyyy |
| 110 | mm-dd-yyyy |
| 111 | yyyy/mm/dd |
| 112 | yyyymmdd |

**Time formats**

| Code | Format |
| --- | --- |
| 8 | hh:mm:ss |
| 14 | hh:mm:ss:ff3 |
| 24 | hh:mm:ss |
| 108 | hh:mm:ss |
| 114 | hh:mm:ss:ff3 |

**Date and time formats**

|  |  |
| --- | --- |
| 0 | Mon dd yyyy hh:mm AM/PM |
| 9 | Mon dd yyyy hh:mm:ss:ff3 AM/PM |
| 13 | dd Mon yyyy hh:mm:ss:ff3 AM/PM |
| 20 | yyyy-mm-dd hh:mm:ss |
| 21 | yyyy-mm-dd hh:mm:ss:ff3 |
| 22 | mm/dd/yy hh:mm:ss AM/PM |
| 25 | yyyy-mm-dd hh:mm:ss:ff3 |
| 100 | Mon dd yyyy hh:mm AM/PM |
| 109 | Mon dd yyyy hh:mm:ss:ff3 AM/PM |
| 113 | dd Mon yyyy hh:mm:ss:ff3 |
| 120 | yyyy-mm-dd hh:mm:ss |
| 121 | yyyy-mm-dd hh:mm:ss:ff3 |
| 126 | yyyy-mm-dd T hh:mm:ss:ff3 |
| 127 | yyyy-mm-dd T hh:mm:ss:ff3 |

**Islamic calendar dates**

| Code | Format |
| --- | --- |
| 130 | dd mmm yyyy hh:mi:ss:ff3 AM/PM |
| 131 | dd mmm yyyy hh:mi:ss:ff3 AM/PM |

If there is no pattern matching with the current code, it will be formatted to `yyyy-mm-dd hh:mm:ss`

##### Converting string to DATE or DATETIME with style

When `CONVERT` targets a `DATE`, `DATETIME`, or `DATETIME2` type and includes a **literal** style code, SnowConvert AI maps it to `TO_DATE` or `TO_TIMESTAMP` with the corresponding Snowflake format string.

##### SQL Server

```sql
SELECT
    CONVERT(DATE, StartDate, 101) AS StartDt,
    CONVERT(DATE, EndDate, 103) AS EndDt,
    CONVERT(DATETIME, EventTime, 120) AS EventTs
FROM Events
```

##### Snowflake SQL

```sql
SELECT
  TO_DATE(StartDate, 'mm/dd/yyyy') AS StartDt,
  TO_DATE(EndDate, 'dd/mm/yyyy') AS EndDt,
  TO_TIMESTAMP(EventTime, 'yyyy-mm-dd hh:mm:ss') AS EventTs
FROM
  Events;
```

The following table shows which target types produce `TO_DATE` versus `TO_TIMESTAMP`:

| Target Type | Snowflake Function |
| --- | --- |
| DATE | TO_DATE |
| DATETIME | TO_TIMESTAMP |
| DATETIME2 | TO_TIMESTAMP |

##### Converting VARBINARY / BINARY with style

When converting to `VARBINARY` or `BINARY` with a hex style (1 or 2), SnowConvert AI maps to `TO_BINARY(expr, 'HEX')`. Style 0 (default/ASCII) maps to a plain `CAST`. For `VARBINARY(MAX)`, the outer `CAST` is omitted.

##### SQL Server

```sql
SELECT CONVERT(VARBINARY(16), @UGIDString, 0);
SELECT CONVERT(VARBINARY(16), @UGIDString, 1);
SELECT CONVERT(VARBINARY(MAX), @HexData, 2);
SELECT CONVERT(BINARY(16), @HexData, 1);
```

##### Snowflake SQL

```sql
SELECT CAST(@UGIDString AS VARBINARY(16));
SELECT CAST(TO_BINARY(@UGIDString, 'HEX') AS VARBINARY(16));
SELECT TO_BINARY(@HexData, 'HEX');
SELECT CAST(TO_BINARY(@HexData, 'HEX') AS BINARY(16));
```

##### Converting with a dynamic style variable

When the style argument is a variable or expression instead of a literal, SnowConvert AI cannot determine the format string at conversion time. The function falls back to `CAST` and emits [SSC-EWI-TS0098](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md).

##### SQL Server

```sql
SELECT CONVERT(DATE, @InputDate, @Style)
```

##### Snowflake SQL

```sql
SELECT
  !!!RESOLVE EWI!!! /*** SSC-EWI-TS0098 - CONVERT WITH A VARIABLE OR EXPRESSION AS THE STYLE ARGUMENT CANNOT BE AUTOMATICALLY MAPPED TO A SNOWFLAKE FORMAT STRING. REPLACE WITH THE APPROPRIATE TO_DATE/TO_TIMESTAMP CALL WITH THE KNOWN FORMAT STRING. ***/!!!
  CAST(@InputDate AS DATE);
```

### Related EWIs

1. [SSC-EWI-TS0098](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): CONVERT with a non-literal style cannot be mapped to a Snowflake format string.

## TRY_CONVERT

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a value cast to the specified data type if the cast succeeds; otherwise, returns null.

([SQL Server Language Reference TRY_CONVERT](https://docs.microsoft.com/en-us/sql/t-sql/functions/try-convert-transact-sql?view=sql-server-ver15))

#### Syntax

```sql
TRY_CONVERT ( data_type [ ( length ) ], expression [, style ] )
```

### Source Patterns

#### Basic Transformation

To transform this function, we have to check the parameters of the TRY_CONVERT first.

```sql
TRY_CONVERT( INT, 'test')
```

If the expression that needs to be casted is a string, it will be transfomed to TRY_CAST, which is a function of Snowflake.

```sql
TRY_CAST( 'test' AS INT)
```

#### TRY_CAST

The TRY_CAST shares the same transformation with TRY_CONVERT.

##### Example

##### Sql Server

```sql
SELECT TRY_CAST('12345' AS NUMERIC) NUMERIC_RESULT,
 TRY_CAST('123.45' AS DECIMAL(20,2)) DECIMAL_RESULT,
 TRY_CAST('123' AS INT) INT_RESULT,
 TRY_CAST('123.02' AS FLOAT) FLOAT_RESULT,
 TRY_CAST('123.02' AS DOUBLE PRECISION) DOUBLE_PRECISION_RESULT,

 TRY_CAST('2017-01-01 12:00:00' AS DATE) DATE_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS DATETIME) DATETIME_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS SMALLDATETIME) SMALLDATETIME_RESULT,
 TRY_CAST('12:00:00' AS TIME) TIME_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS TIMESTAMP) TIMESTAMP_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS DATETIMEOFFSET) DATETIMEOFFSET_RESULT,

 TRY_CAST(1234 AS VARCHAR) VARCHAR_RESULT,
 TRY_CAST(1 AS CHAR) CHAR_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS SQL_VARIANT) SQL_VARIANT_RESULT,
 TRY_CAST('LINESTRING(-122.360 47.656, -122.343 47.656 )' AS GEOGRAPHY) GEOGRAPHY_RESULT;
```

The result will be the same with the example of TRY_CONVERT.

##### Snowflake

```sql
SELECT
 TRY_CAST('12345' AS NUMERIC(38, 18)) NUMERIC_RESULT,
 TRY_CAST('123.45' AS DECIMAL(20,2)) DECIMAL_RESULT,
 TRY_CAST('123' AS INT) INT_RESULT,
 TRY_CAST('123.02' AS FLOAT) FLOAT_RESULT,
 TRY_CAST('123.02' AS DOUBLE PRECISION) DOUBLE_PRECISION_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS DATE) DATE_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS TIMESTAMP_NTZ(3)) DATETIME_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS TIMESTAMP_NTZ(0)) SMALLDATETIME_RESULT,
 TRY_CAST('12:00:00' AS TIME(7)) TIME_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS BINARY(8)) TIMESTAMP_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS TIMESTAMP_TZ(7)) DATETIMEOFFSET_RESULT,
 TO_VARCHAR(1234) VARCHAR_RESULT,
 TO_CHAR(1) CHAR_RESULT,
 TRY_CAST('2017-01-01 12:00:00' AS VARIANT) SQL_VARIANT_RESULT,
 TRY_CAST('LINESTRING(-122.360 47.656, -122.343 47.656 )' AS GEOGRAPHY) GEOGRAPHY_RESULT;
```

### Known Issues

If the data type is Varchar or Char, then it will be transformed differently.

```sql
TRY_CONVERT(VARCHAR, 1234);
TRY_CONVERT(CHAR, 1);
```

If TRY_CAST is used with VARCHAR or CHAR in Snowflake, it will cause an error, so it will be transformed to

```sql
TO_VARCHAR(1234);
TO_CHAR(1);
```

The same happens with the data types of SQL_VARIANT and GEOGRAPHY.

```sql
TRY_CONVERT(SQL_VARIANT, '2017-01-01 12:00:00');
TRY_CONVERT(GEOGRAPHY, 'LINESTRING(-122.360 47.656, -122.343 47.656 )');
```

Are transformed to

```sql
TO_VARIANT('2017-01-01 12:00:00');
TO_GEOGRAPHY('LINESTRING(-122.360 47.656, -122.343 47.656 )');
```

If the expression is not a string, there is a very high chance that it will fail, since the TRY_CAST of snowflake works only with string expressions.

In this case, another transformation will be done

```sql
TRY_CAST(14.85 AS INT)
```

Will be transformed to

```sql
CAST(14.85 AS INT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/
```

Now, with these transformation, there could be problems depending on what is being done with the functions. The TRY_CONVERT of SqlServer returns nulls if the convertion was not possible.

This can be used to do logic like this

```sql
SELECT
    CASE
        WHEN TRY_CONVERT( INT, 'Expression') IS NULL
        THEN 'FAILED'
        ELSE 'SUCCEDDED'
    END;
```

That type of conditions with the TRY_CONVERT can be used with the TRY_CAST, but what happens if it is transformed to TO_VARCHAR, TOCHAR or to the CAST? If the convertion in those functions fails, it will cause an error instead of just returning null.

#### Examples

In this sample we have several TRY_CONVERT with different data types

##### SQL Server

```sql
SELECT TRY_CONVERT(NUMERIC, '12345') NUMERIC_RESULT,
 TRY_CONVERT(DECIMAL(20,2), '123.45') DECIMAL_RESULT,
 TRY_CONVERT(INT, '123') INT_RESULT,
 TRY_CONVERT(FLOAT, '123.02') FLOAT_RESULT,
 TRY_CONVERT(DOUBLE PRECISION, '123.02') DOUBLE_PRECISION_RESULT,

 TRY_CONVERT(DATE, '2017-01-01 12:00:00') DATE_RESULT,
 TRY_CONVERT(DATETIME, '2017-01-01 12:00:00') DATETIME_RESULT,
 TRY_CONVERT(SMALLDATETIME, '2017-01-01 12:00:00') SMALLDATETIME_RESULT,
 TRY_CONVERT(TIME, '12:00:00') TIME_RESULT,
 TRY_CONVERT(TIMESTAMP, '2017-01-01 12:00:00') TIMESTAMP_RESULT,
 TRY_CONVERT(DATETIMEOFFSET, '2017-01-01 12:00:00') DATETIMEOFFSET_RESULT,

 TRY_CONVERT(VARCHAR, 1234) VARCHAR_RESULT,
 TRY_CONVERT(CHAR, 1) CHAR_RESULT,
 TRY_CONVERT(SQL_VARIANT, '2017-01-01 12:00:00') SQL_VARIANT_RESULT,
 TRY_CONVERT(GEOGRAPHY, 'LINESTRING(-122.360 47.656, -122.343 47.656 )') GEOGRAPHY_RESULT;
```

If we migrate that select, we will get the following result

##### Snowflake

```sql
SELECT
 CAST('12345' AS NUMERIC(38, 18)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ NUMERIC_RESULT,
 CAST('123.45' AS DECIMAL(20,2)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ DECIMAL_RESULT,
 CAST('123' AS INT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ INT_RESULT,
 CAST('123.02' AS FLOAT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ FLOAT_RESULT,
 CAST('123.02' AS DOUBLE PRECISION) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ DOUBLE_PRECISION_RESULT,
 CAST('2017-01-01 12:00:00' AS DATE) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ DATE_RESULT,
 CAST('2017-01-01 12:00:00' AS TIMESTAMP_NTZ(3)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ DATETIME_RESULT,
 CAST('2017-01-01 12:00:00' AS TIMESTAMP_NTZ(0)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ SMALLDATETIME_RESULT,
 CAST('12:00:00' AS TIME(7)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ TIME_RESULT,
 CAST('2017-01-01 12:00:00' AS BINARY(8)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ TIMESTAMP_RESULT,
 CAST('2017-01-01 12:00:00' AS TIMESTAMP_TZ(7)) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/ DATETIMEOFFSET_RESULT,
 TO_VARCHAR(1234) VARCHAR_RESULT,
 TO_CHAR(1) CHAR_RESULT,
 TO_VARIANT('2017-01-01 12:00:00') SQL_VARIANT_RESULT,
 TO_GEOGRAPHY('LINESTRING(-122.360 47.656, -122.343 47.656 )') GEOGRAPHY_RESULT;
```

Let’s execute each one and compare the result.

| Alias | SqlServer Result | Snowflake Result |
| --- | --- | --- |
| NUMERIC_RESULT | 12345 | 12345 |
| DECIMAL_RESULT | 123.45 | 123.45 |
| INT_RESULT | 123 | 123 |
| FLOAT_RESULT | 123.02 | 123.02 |
| DOUBLE_PRECISION_RESULT | 123.02 | 123.02 |
| DATE_RESULT | 2017-01-01 | 2017-01-01 |
| DATETIME_RESULT | 2017-01-01 12:00:00.000 | 2017-01-01 12:00:00.000 |
| SMALLDATETIME_RESULT | 2017-01-01 12:00:00 | 2017-01-01 12:00:00.000 |
| TIME_RESULT | 12:00:00.0000000 | 12:00:00 |
| TIMESTAMP_RESULT | 0x323031372D30312D | 2017-01-01 12:00:00.000 |
| DATETIMEOFFSET_RESULT | 2017-01-01 12:00:00.0000000 +00:00 | 2017-01-01 12:00:00.000 -0800 |
| VARCHAR_RESULT | 1234 | 1234 |
| CHAR_RESULT | 1 | 1 |
| SQL_VARIANT_RESULT | 2017-01-01 12:00:00 | “2017-01-01 12:00:00” |
| GEOGRAPHY_RESULT | 0xE610000001148716D9CEF7D34740D7A3703D0A975EC08716D9CEF7D34740CBA145B6F3955EC0 | { “coordinates”: [ [ -122.36, 47.656 ], [ -122.343, 47.656 ] ], “type”: “LineString” } |

### Related EWIs

1. [SSC-FDM-TS0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): TRY_CONVERT/TRY_CAST could not be converted to TRY_CAST.

## Date & Time functions

This section describes the functional equivalents of date & time functions in Transact-SQL to Snowflake SQL and JavaScript code.

## AT TIME ZONE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Converts an *inputdate* to the corresponding *datetimeoffset* value in the target time zone. ([AT TIME ZONE in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/queries/at-time-zone-transact-sql?view=sql-server-ver16)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
inputdate AT TIME ZONE timezone
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone.html)

```sql
CONVERT_TIMEZONE( <source_tz> , <target_tz> , <source_timestamp_ntz> )

CONVERT_TIMEZONE( <target_tz> , <source_timestamp> )
```

### Examples

#### SQL Server

```sql
SELECT CAST('2022-11-24 11:00:45.2000000 +00:00' as datetimeoffset) at time zone 'Alaskan Standard Time';
```

**Result:**

```sql
                          DATE|
------------------------------|
2022-11-24 02:00:45.200 -09:00|
```

##### Snowflake SQL

```sql
SELECT
CONVERT_TIMEZONE('America/Anchorage', CAST('2022-11-24 11:00:45.2000000 +00:00' as TIMESTAMP_TZ(7)));
```

**Result:**

```sql
                          DATE|
------------------------------|
2022-11-24 02:00:45.200 -09:00|
```

##### SQL Server

```sql
SELECT current_timestamp at time zone 'Central America Standard Time';
```

**Result:**

```sql
                          DATE|
------------------------------|
2022-10-10 10:55:50.090 -06:00|
```

##### Snowflake SQL

```sql
SELECT
CONVERT_TIMEZONE('America/Costa_Rica', CURRENT_TIMESTAMP() /*** SSC-FDM-TS0024 - CURRENT_TIMESTAMP in At Time Zone statement may have a different behavior in certain cases ***/);
```

**Result:**

```sql
                          DATE|
------------------------------|
2022-10-10 10:55:50.090 -06:00|
```

### Known Issues

1. Snowflake does not support all the time zones that SQL Server does. You can check the supported time zones at this [link](https://docs.snowflake.com/en/sql-reference/functions/convert_timezone.html).

#### SQL Server

```sql
SELECT current_timestamp at time zone 'Turks And Caicos Standard Time';
```

**Result:**

```sql
                          DATE|
------------------------------|
2022-12-14 20:04:18.317 -05:00|
```

##### Snowflake SQL

```sql
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0063 - TIME ZONE NOT SUPPORTED IN SNOWFLAKE ***/!!!
CURRENT_TIMESTAMP() at time zone 'Turks And Caicos Standard Time';
```

### Related EWIs

1. [SSC-FDM-TS0024](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): CURRENT_TIMESTAMP in At Time Zone statement may have a different behavior in certain cases.
2. [SSC-EWI-TS0063](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Time zone not supported in Snowflake.

## DATEADD

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns an integer representing the specified datepart of the specified date. ([DATEPART in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/abs-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DATEADD (datepart , number , date )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/dateadd.html)

```sql
DATEADD( <date_or_time_part>, <value>, <date_or_time_expr> )
```

### Examples

#### SQL Server

```sql
SELECT DATEADD(year,123, '20060731') as ADDDATE;
```

**Result:**

```sql
                 ADDDATE|
------------------------|
 2129-07-31 00:00:00.000|
```

##### Snowflake SQL

```sql
SELECT
DATEADD(year, 123, '20060731') as ADDDATE;
```

**Result:**

```sql
                 ADDDATE|
------------------------|
 2129-07-31 00:00:00.000|
```

## DATEDIFF

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns the count (as a signed integer value) of the specified datepart boundaries crossed between the specified startdate and enddate. ([DATEDIFF in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/datediff-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DATEDIFF ( datepart , startdate , enddate )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/datediff.html)

```sql
DATEDIFF( <date_or_time_part>, <date_or_time_expr1>, <date_or_time_expr2> )
```

### Examples

#### SQL Server

```sql
SELECT DATEDIFF(year,'2005-12-31 23:59:59.9999999', '2006-01-01 00:00:00.0000000');
```

**Result:**

| DIFF |
| --- |
| 1 |

##### Snowflake SQL

```sql
SELECT DATEDIFF(year,'2005-12-31 23:59:59.9999999', '2006-01-01 00:00:00.0000000');
```

**Result:**

| DIFF |
| --- |
| 1 |

## DATEFROMPARTS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns a **date** value that maps to the specified year, month, and day values.([DATEFROMPARTS in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/datefromparts-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DATEFROMPARTS ( year, month, day )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/date_from_parts.html)

```sql
DATE_FROM_PARTS( <year>, <month>, <day> )
```

### Examples

#### SQL Server

```sql
SELECT DATEFROMPARTS ( 2010, 12, 31 ) AS RESULT;
```

**Result:**

| RESULT |
| --- |
| 2022-12-12 |

##### Snowflake SQL

```sql
SELECT DATE_FROM_PARTS ( 2010, 12, 31 ) AS RESULT;
```

**Result:**

| RESULT |
| --- |
| 2022-12-12 |

## DATENAME

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns a character string representing the specified datepart of the specified date. ([DATENAME in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/datename-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DATENAME ( datepart , date )
```

##### Snowflake SQL

> **Note:**
>
> This transformation uses several functions depending on the inputs

```sql
DATE_PART( <date_or_time_part> , <date_or_time_expr> )
MONTHNAME( <date_or_timestamp_expr> )
DAYNAME( <date_or_timestamp_expr> )
```

### Examples

#### SQL Server

```sql
SELECT DATENAME(month, getdate()) AS DATE1,
DATENAME(day, getdate()) AS DATE2,
DATENAME(dw, GETDATE()) AS DATE3;
```

**Result:**

| DATE1 | DATE2 | DATE3 |
| --- | --- | --- |
| May | 3 | Tuesday |

##### Snowflake SQL

```sql
SELECT
MONTHNAME_UDF(CURRENT_TIMESTAMP() :: TIMESTAMP) AS DATE1,
DAYNAME_UDF(CURRENT_TIMESTAMP() :: TIMESTAMP) AS DATE2,
DAYNAME(CURRENT_TIMESTAMP() :: TIMESTAMP) AS DATE3;
```

**Result:**

| DATE1 | DATE2 | DATE3 |
| --- | --- | --- |
| May | Tue | Tue |

## DATEPART

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns an integer representing the specified datepart of the specified date. ([DATEPART in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/abs-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DATEPART ( datepart , date )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/date_part.html)

```sql
DATE_PART( <date_or_time_part> , <date_or_time_expr> )
```

### Examples

#### SQL Server

```sql
SELECT DATEPART(YEAR, '10-10-2022') as YEAR
```

**Result:**

| YEAR |
| --- |
| 2022 |

##### Snowflake SQL

```sql
SELECT
DATE_PART(YEAR, '10-10-2022' :: TIMESTAMP) as YEAR;
```

**Result:**

| YEAR |
| --- |
| 2022 |

## DAY

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns an integer that represents the day (day of the month) of the specified date. ([DAY in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/day-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
DAY ( date )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/year.html)

```sql
DAY( <date_or_timestamp_expr> )
```

### Examples

#### SQL Server

```sql
SELECT DAY('10-10-2022') AS DAY
```

**Result:**

| DAY |
| --- |
| 10 |

##### Snowflake SQL

```sql
SELECT DAY('10-10-2022' :: TIMESTAMP) AS DAY;
```

**Result:**

| DAY |
| --- |
| 10 |

## EOMONTH

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

This function returns the last day of the month containing a specified date, with an optional offset. ([EOMONTH in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/eomonth-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
EOMONTH ( start_date [, month_to_add ] )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/last_day.html)

```sql
LAST_DAY( <date_or_time_expr> [ , <date_part> ] )
```

### Examples

#### SQL Server

```sql
SELECT EOMONTH (GETDATE()) AS Result;
```

**Result:**

| RESULT |
| --- |
| 2022-05-31 |

##### Snowflake SQL

```sql
SELECT
LAST_DAY(DATEADD('month', 0, CURRENT_TIMESTAMP() :: TIMESTAMP)) AS Result;
```

**Result:**

| RESULT |
| --- |
| 2022-05-31 |

## GETDATE

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns the current database system timestamp as a **datetime** value without the database time zone offset. ([GETDATE in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/functions/getdate-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
GETDATE()
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp.html)

```sql
CURRENT_TIMESTAMP( [ <fract_sec_precision> ] )
```

### Examples

#### SQL Server

```sql
SELECT GETDATE() AS DATE;
```

**Result:**

| DATE |
| --- |
| 2022-05-06 09:54:42.757 |

##### Snowflake SQL

```sql
SELECT CURRENT_TIMESTAMP() :: TIMESTAMP AS DATE;
```

**Result:**

| DATE |
| --- |
| 2022-05-06 08:55:05.422 |

## MONTH

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns an integer that represents the month of the specified *date*. ([MONTH in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/month-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
MONTH( date )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/year.html)

```sql
MONTH ( <date_or_timestamp_expr> )
```

### Examples

#### SQL Server

```sql
SELECT MONTH('10-10-2022') AS MONTH
```

**Result:**

| MONTH |
| --- |
| 10 |

##### Snowflake SQL

```sql
SELECT MONTH('10-10-2022' :: TIMESTAMP) AS MONTH;
```

**Result:**

| MONTH |
| --- |
| 10 |

## SWITCHOFFSET

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The SWITCHOFFSET adjusts a given timestamp value to a specific timezone offset. This is done through numerical values. More information can be found at [SWITCHOFFSET (Transact-SQL)](https://learn.microsoft.com/en-us/sql/t-sql/functions/switchoffset-transact-sql?view=sql-server-ver16).

### Sample Source Pattern

#### Syntax

A UDF Helper accomplish functional equivalence, also it shares the same syntax as the SQLServer’s SWITCHOFFSET function.

##### SQLServer

```sql
 SWITCHOFFSET ( datetimeoffset_expression, timezoneoffset_expression )
```

##### Snowflake SQL

```sql
 SWITCHOFFSET_UDF ( timestamp_tz_expression, timezoneoffset_expression )
```

#### Example

##### SQLServer

```sql
SELECT
  '1998-09-20 7:45:50.71345 +02:00' as fr_time,
  SWITCHOFFSET('1998-09-20 7:45:50.71345 +02:00', '-06:00') as cr_time;
```

**Result:**

| fr_time | cr_time |
| --- | --- |
| 1998-09-20 7:45:50.71345 +02:00 | 1998-09-19 23:45:50.7134500 -06:00 |

##### Snowflake SQL

```sql
SELECT
  '1998-09-20 7:45:50.71345 +02:00' as fr_time,
  PUBLIC.SWITCHOFFSET_UDF('1998-09-20 7:45:50.71345 +02:00', '-06:00') as cr_time;
```

**Result:**

| fr_time | cr_time |
| --- | --- |
| 1998-09-20 7:45:50.71345 +02:00 | 1998-09-19 23:45:50.7134500 -06:00 |

## SYSDATETIME

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a datetime2(7) value that contains the date and time of the computer on which the instance of SQL Server is running. ([SYSDATETIME in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/sysdatetime-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SYSDATETIME ( )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/localtime.html)

```sql
LOCALTIME()
```

### Examples

#### SQL Server

```sql
SELECT SYSDATETIME ( ) AS SYSTEM_DATETIME;
```

**Result:**

| SYSTEM_DATETIME |
| --- |
| 2022-05-06 12:08:05.501 |

##### Snowflake SQL

```sql
SELECT LOCALTIME ( ) AS SYSTEM_DATETIME;
```

**Result:**

| SYSTEM_DATETIME |
| --- |
| 211:09:14 |

## SYSUTCDATETIME

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns a datetime2(7) value that contains the date and time of the computer on which the instance of SQL Server is running. ([SYSUTCDATETIME in Transact-SQL](https://learn.microsoft.com/en-us/sql/t-sql/functions/sysutcdatetime-transact-sql?view=sql-server-ver16)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
SYSUTCDATETIME ( )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/localtime.html)

```sql
SYSDATE()
```

### Examples

#### SQL Server

```sql
SELECT SYSUTCDATETIME() as SYS_UTC_DATETIME;
```

**Result:**

| SYSTEM_UTC_DATETIME |
| --- |
| 2023-02-02 20:59:28.0926502 |

##### Snowflake SQL

```sql
SELECT
SYSDATE() as SYS_UTC_DATETIME;
```

**Result:**

| SYSTEM_UTC_DATETIME |
| --- |
| 2023-02-02 21:02:05.557 |

## YEAR

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Returns an integer that represents the year of the specified *date*. ([YEAR in Transact-SQL](https://docs.microsoft.com/en-us/sql/t-sql/functions/year-transact-sql?view=sql-server-ver15)).

### Sample Source Pattern

#### Syntax

##### SQL Server

```sql
YEAR( date )
```

##### Snowflake SQL

[Snowflake SQL Documentation](https://docs.snowflake.com/en/sql-reference/functions/year.html)

```sql
YEAR ( <date_or_timestamp_expr> )
```

### Examples

#### SQL Server

```sql
SELECT YEAR('10-10-2022') AS YEAR
```

**Result:**

| YEAR |
| --- |
| 2022 |

##### Snowflake SQL

```sql
SELECT YEAR('10-10-2022' :: TIMESTAMP) AS YEAR;
```

**Result:**

| YEAR |
| --- |
| 2022 |

---
title: SnowConvert AI - SQL Server-Azure Synapse - Built-in procedures
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-built-in-procedures.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - Built-in procedures

## SP_ADDEXTENDEDPROPERTY_UDP

Applies to

* SQL Server

### Description

Adds a new extended property to a database object.

#### SQLServer syntax

```sql
 sp_addextendedproperty
    [ @name = ] N'name'
    [ , [ @value = ] value ]
    [ , [ @level0type = ] 'level0type' ]
    [ , [ @level0name = ] N'level0name' ]
    [ , [ @level1type = ] 'level1type' ]
    [ , [ @level1name = ] N'level1name' ]
    [ , [ @level2type = ] 'level2type' ]
    [ , [ @level2name = ] N'level2name' ]
[ ; ]
```

### Custom UDP

Keeps the same parameters as the original procedure

#### UDP

```sql
 -- <copyright file="SP_ADDEXTENDEDPROPERTY_UDP.sql" company="Snowflake Inc">
--        Copyright (c) 2019-2023 Snowflake Inc. All rights reserved.
-- </copyright>

-- =======================================================================================================
-- Description: The sp_addextendedproperty provides an equivalent functionality for adding extended
--              properties in Snowflake. This version is only supporting 'MS_Description' property to
--              add comments at schema/table/view/procedure/function level.
--              Comments on columns are only supported for tables.
--              If the name of the object includes double quotes, they need to be added as part of the
--              parameter values, for example level1name='"My_Col"'.

-- Parameters:
--   name:       Name of the extended property. 'MS_Description' is the only supported in this version.
--   value:      Value of the extended property. Cannot be null for 'MS_Description' property.
--   level0type: Type of level 0 object. SCHEMA is the only supported value in this version.
--   level0name: Value associated to the level 0 object.
--   level1type: Type of level 1 object. TABLE/VIEW/PROCEDURE/FUNCTION are the only supported values in this
--               version.
--   level1name: Value associated to the level 1 object.
--   level2type: Type of level 2 object. COLUMN is the only supported value in this version.
--   level2name: Value associated to the level 2 object.

-- Return:      This procedure returns a message with the result of the execution. If an exception occurs,
--              the exception is raised.
-- =======================================================================================================

CREATE OR REPLACE PROCEDURE SP_ADDEXTENDEDPROPERTY_UDP(
    name varchar,
    value varchar,
    level0type varchar DEFAULT '',
    level0name varchar DEFAULT '',
    level1type varchar DEFAULT '',
    level1name varchar DEFAULT '',
    level2type varchar DEFAULT '',
    level2name varchar DEFAULT '')

RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
   DECLARE  stmt VARCHAR;
            str_result VARCHAR;
BEGIN
    IF(lower(name) = 'ms_description') THEN --Comments on
        IF (value IS NOT NULL) THEN

            --Comment on table column
            IF(lower(level0type) = 'schema' and lower(level1type) = 'table' and lower(level2type) = 'column') THEN
                stmt := 'COMMENT ON COLUMN ' || level0name || '.' || level1name || '.' || level2name || ' IS ''' || value || ''';';

            --Comment on table/view/procedure/function
            ELSEIF(lower(level0type) = 'schema' and lower(level1type) in ('table', 'view', 'procedure', 'function') and level2type IS NULL) THEN
                stmt := 'COMMENT ON ' || upper(level1type) || ' ' || level0name || '.' || level1name || ' IS ''' || value || ''';';

            --Comment on schema
            ELSEIF(lower(level0type) = 'schema' and level1type IS NULL) THEN
                stmt := 'COMMENT ON ' || upper(level0type) || ' ' || level0name || ' IS ''' || value || ''';';

ELSE
                str_result := 'ERROR: COMMENT ON level0type: ' || level0type || ' | level1type: ' || nvl(level1type,'') || ' | level2type: ' || nvl(level2type,'') || ' is not supported yet.';
END IF;

            IF(stmt IS NOT NULL) THEN
                EXECUTE IMMEDIATE :stmt;
                str_result := name || ' extended property was successfully created.';
END IF;
ELSE
            str_result := 'ERROR: NULL value for COMMENT ON is not supported.';
END IF;
ELSE
        str_result := 'ERROR: ' || name || ' extended property is not supported yet.';
END IF;
RETURN str_result;
END;
```

##### SQL Server

```sql
 EXEC sys.sp_addextendedproperty @name=N'MS_Description', @value=N'Technical identifier.' , @level0type=N'SCHEMA',@level0name=N'Monitoring', @level1type=N'TABLE',@level1name=N'tProcessingIssue', @level2type=N'COLUMN',@level2name=N'ID'
```

##### Snowflake

```sql
 CALL SP_ADDEXTENDEDPROPERTY_UDP('MS_Description', 'Technical identifier.', 'SCHEMA', 'Monitoring', 'TABLE', 'tProcessingIssue', 'COLUMN', 'ID');
```

---
title: SnowConvert AI - SQL Server-Azure Synapse - CONTINUE HANDLER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-continue-handler.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - CONTINUE HANDLER

## Description

In SQL Server and Azure Synapse Analytics, exception handling is primarily managed through `TRY...CATCH` blocks. Unlike some other database systems (such as Teradata or DB2), SQL Server does not have a native `DECLARE CONTINUE HANDLER` statement.

However, when migrating code from other database systems that use CONTINUE HANDLERs, SnowConvert AI transforms these constructs into equivalent Snowflake Scripting exception handling mechanisms.

A CONTINUE HANDLER in the source system allows execution to continue after an error occurs, performing specific actions when certain conditions are met. In Snowflake, this is achieved using EXCEPTION blocks with conditional logic.

For more information about SQL Server error handling, see [TRY…CATCH (Transact-SQL)](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/try-catch-transact-sql).

## Grammar Syntax

SQL Server does not have native CONTINUE HANDLER syntax. However, when converting from other database systems, the source pattern typically looks like:

```sql
-- Pattern from source systems (e.g., DB2, Teradata)
DECLARE CONTINUE HANDLER FOR condition_value
  handler_action_statement;
```

## Sample Source Patterns

### CONTINUE HANDLER Conversion from DB2/Teradata

When migrating stored procedures from DB2 or Teradata that contain CONTINUE HANDLER declarations, SnowConvert AI transforms them into Snowflake-compatible exception handling.

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
-- Example pattern from source system
CREATE PROCEDURE example_procedure()
BEGIN
    DECLARE CONTINUE HANDLER FOR SQLSTATE '02000'
    BEGIN
        -- Handler action
        SET error_count = error_count + 1;
    END;

    -- Main procedure logic
    SELECT * FROM non_existent_table;
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE example_procedure()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        error_count INTEGER := 0;
    BEGIN
        BEGIN
            -- Main procedure logic
            SELECT * FROM non_existent_table;
        EXCEPTION
            WHEN OTHER THEN
                -- Handler action
                error_count := error_count + 1;
                -- Continue execution
        END;
    END;
$$;
```

### CONTINUE HANDLER with SQLEXCEPTION

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE handler_example()
BEGIN
    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO error_log VALUES (SQLCODE, SQLERRM);

    -- Procedure body with multiple statements
    DELETE FROM table1 WHERE id = 0/0;
    INSERT INTO table2 VALUES (1, 'Success');
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE handler_example()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        BEGIN
            -- Procedure body with multiple statements
            DELETE FROM table1 WHERE id = 0/0;
        EXCEPTION
            WHEN OTHER THEN
                INSERT INTO error_log
                SELECT :SQLCODE, :SQLERRM;
                -- Continue execution
        END;

        INSERT INTO table2 VALUES (1, 'Success');
    END;
$$;
```

## Known Issues

### Limited CONTINUE HANDLER Support

Applies to

* SQL Server
* Azure Synapse Analytics

SQL Server’s native `TRY...CATCH` mechanism does not have an exact equivalent to CONTINUE HANDLER. When an error occurs in a TRY block, control immediately passes to the CATCH block, and execution does not continue from the point of error.

SnowConvert AI attempts to emulate CONTINUE HANDLER behavior in Snowflake, but there are limitations:

1. **Execution Flow**: True CONTINUE HANDLER behavior (continuing from the exact point of error) cannot be fully replicated.
2. **Statement-level Wrapping**: Individual statements may need to be wrapped in separate exception blocks.
3. **Performance**: Multiple nested exception blocks can impact performance.

#### Known Issues

When migrating CONTINUE HANDLER patterns from other database systems through SQL Server to Snowflake, be aware that exception handling behavior may differ. The TRY…CATCH pattern in SQL Server is converted to Snowflake’s EXCEPTION blocks, but semantic differences may exist. Thorough testing is recommended to ensure the converted code maintains the intended behavior.

### SQLWARNING and NOT FOUND Conditions

Applies to

* SQL Server
* Azure Synapse Analytics

CONTINUE HANDLERs for SQLWARNING and NOT FOUND conditions require special handling in Snowflake:

* **SQLWARNING**: Snowflake does not distinguish between warnings and errors in the same way as source systems.
* **NOT FOUND**: Typically used for cursor operations or SELECT INTO statements that return no rows.

#### Example

##### Source Pattern

```sql
DECLARE CONTINUE HANDLER FOR NOT FOUND
    SET done = TRUE;
```

##### Snowflake

```sql
-- Handled through conditional logic rather than exception handling
IF (SELECT COUNT(*) FROM table1) = 0 THEN
    done := TRUE;
END IF;
```

## Best Practices

When working with converted CONTINUE HANDLER code:

1. **Review Exception Handling**: Verify that the converted exception handling logic matches the intended behavior.
2. **Test Error Scenarios**: Thoroughly test error conditions to ensure the application behavior is correct.
3. **Consider Refactoring**: In some cases, refactoring the error handling logic may provide better performance and maintainability.
4. **Use Transactions**: Leverage Snowflake’s transaction support to ensure data consistency.

## Related Documentation

* [Snowflake Exception Handling](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions)
* [SQL Server TRY…CATCH](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/try-catch-transact-sql)
* [TRY CATCH Translation Reference](transact-create-procedure-snow-script.md)

## See Also

* [CREATE PROCEDURE](transact-create-procedure.md)
* [CREATE PROCEDURE - Snowflake Scripting](transact-create-procedure-snow-script.md)
* [General Statements](transact-general-statements.md)

---
title: SnowConvert AI - SQL Server-Azure Synapse - CREATE INDEX
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-index.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - CREATE INDEX

Translation reference to convert CREATE INDEX statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics
> **Warning:**
>
> Currently, ***Create Index*** statement is not being converted but it is being parsed. Also, if your source code has Create `index` statements, these are going to be accounted for in the ***Assessment Report.***

**Example of Create Index**

## SQLServer

```sql
CREATE INDEX my_index_name ON my_table (column1, column2);

CREATE TABLE table_1(
   date_time DATETIME,
   INDEX ix_PatientBaseEpisodes_Version NONCLUSTERED (VersionStamp)
) ON [PRIMARY]
```

## Snowflake

```sql
 ----** SSC-FDM-0021 - CREATE INDEX IS NOT SUPPORTED BY SNOWFLAKE **
--CREATE INDEX my_index_name ON my_table (column1, column2)

CREATE OR REPLACE TABLE table_1 (
  date_time TIMESTAMP_NTZ(3)
--                            ,
--  --** SSC-FDM-0021 - CREATE INDEX IS NOT SUPPORTED BY SNOWFLAKE **
--   INDEX ix_PatientBaseEpisodes_Version NONCLUSTERED (VersionStamp)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "06/06/2025",  "domain": "no-domain-provided" }}'
;
```

> **Note:**
>
> Due to architectural reasons, Snowflake does not support indexes so, SnowConvert AI will remove all the code related to the creation of indexes. Snowflake automatically creates micro-partitions for every table that help speed up the performance of DML operations, the user does not have to worry about creating or managing these micro-partitions.
>
> Usually, this is enough to have a very good query performance however, there are ways to improve it by creating data clustering keys. [Snowflake’s official page](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions.html) provides more information about micro-partitions and data clustering.

---
title: SnowConvert AI - SQL Server-Azure Synapse - CREATE PROCEDURE (Snowflake Scripting)
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-procedure-snow-script.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - CREATE PROCEDURE (Snowflake Scripting)

## BEGIN and COMMIT Transaction

Translation reference to convert Transact-SQL BEGIN and COMMIT transaction to Snowflake SQL

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Snowflake SQL, a transaction can be started explicitly by executing a BEGIN statement. Snowflake supports the synonyms `BEGIN WORK` and `BEGIN TRANSACTION`. Snowflake recommends using `BEGIN TRANSACTION`.

A transaction can be ended explicitly by executing COMMIT. For more information, see the [Snowflake Transactions documentation](https://docs.snowflake.com/en/sql-reference/transactions.html).

### Sample Source Patterns

The following examples detail the BEGIN and COMMIT transaction statements.

#### Transact-SQL

##### BEGIN/COMMIT TRANSACTION

```sql
CREATE PROCEDURE TestTransaction
AS
BEGIN
    DROP TABLE IF EXISTS NEWTABLE;
    CREATE TABLE NEWTABLE(COL1 INT, COL2 VARCHAR);
      BEGIN TRANSACTION;
         INSERT INTO NEWTABLE VALUES (1, 'MICHAEL');
         INSERT INTO NEWTABLE VALUES(2, 'JACKSON');
      COMMIT TRANSACTION;
END
```

##### Begin/Commit transaction with label

```sql
CREATE PROCEDURE TestTransaction
AS
BEGIN
    DROP TABLE IF EXISTS NEWTABLE;
    CREATE TABLE NEWTABLE(COL1 INT, COL2 VARCHAR);
      BEGIN TRANSACTION LabelA;
        INSERT INTO NEWTABLE VALUES (1, 'MICHAEL');
        INSERT INTO NEWTABLE VALUES(2, 'JACKSON');
      COMMIT TRANSACTION LabelA;
END
```

##### Snowflake SQL

##### BEGIN/COMMIT

```sql
CREATE OR REPLACE PROCEDURE TestTransaction ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        DROP TABLE IF EXISTS NEWTABLE;
        CREATE OR REPLACE TABLE NEWTABLE (
            COL1 INT,
            COL2 VARCHAR
        );
            BEGIN TRANSACTION;
            INSERT INTO NEWTABLE VALUES (1, 'MICHAEL');
         INSERT INTO NEWTABLE VALUES(2, 'JACKSON');
            COMMIT;
    END;
$$;
```

##### BEGIN/COMMIT transaction with label

```sql
 CREATE OR REPLACE PROCEDURE TestTransaction ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        DROP TABLE IF EXISTS NEWTABLE;
        CREATE OR REPLACE TABLE NEWTABLE (
            COL1 INT,
            COL2 VARCHAR
        );
            BEGIN TRANSACTION
            !!!RESOLVE EWI!!! /*** SSC-EWI-0101 - COMMENTED OUT TRANSACTION LABEL NAME BECAUSE IS NOT APPLICABLE IN SNOWFLAKE ***/!!!
            LabelA;
            INSERT INTO NEWTABLE VALUES (1, 'MICHAEL');
        INSERT INTO NEWTABLE VALUES(2, 'JACKSON');
            COMMIT;
    END;
$$;
```

### Known Issues

1. Nested transactions are not supported in Snowflake. Review the following documentation for more information: <https://docs.snowflake.com/en/sql-reference/transactions>

### Related EWIs

1. [SSC-EWI-0101](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Commented out transaction label name because is not applicable in Snowflake.

## CALL

Translation reference for CALL statement

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The CALL statement is not supported in Snowflake Scripting since this is part of the ODBC API and not a SQL statement, therefore this statement is not translated.

## CASE

Translation reference to convert Transact-SQL Case expression to Snowflake Scripting

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Transact-SQL has two possible formats of the Case expression. both of them for the purpose of evaluating expressions and conditionally obtaining results. The first one refers to a Simple Case Expression that will evaluate if an input_expression matches one or more of the when_expression. The second one will evaluate each Boolean_expression independently. The else clause is supported in both formats.

According to the official Transact-SQL Case documentation:

CASE can be used in any statement or clause that allows a valid expression. For example, you can use CASE in statements such as SELECT, UPDATE, DELETE and SET, and in clauses such as select_list, IN, WHERE, ORDER BY, and HAVING.

For more information, see the [Transact-SQL CASE documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/case-transact-sql?view=sql-server-ver15).

```sql
 -- Simple CASE expression:
CASE input_expression
     WHEN when_expression THEN result_expression [ ...n ]
     [ ELSE else_result_expression ]
END

-- Searched CASE expression:
CASE
     WHEN boolean_expression THEN result_expression [ ...n ]
     [ ELSE else_result_expression ]
END
```

Note: Transact-SQL allows to optionally encapsulate the input_expression and the boolean_expression in parentheses; Snowflake Scripting too.

### Sample Source Patterns

The following examples detail two scenarios where the Case expression can be used and their differences from Snowflake Scripting.

#### Select using Case

##### Transact-SQL

##### Simple CASE

```sql
CREATE OR ALTER PROCEDURE SelectCaseDemoProcedure
AS
      SELECT TOP 10
          LOGINID,
          CASE (MARITALSTATUS)
              WHEN 'S' THEN 'SINGLE'
              WHEN 'M' THEN 'MARIED'
              ELSE 'OTHER'
          END AS status
      FROM HUMANRESOURCES.EMPLOYEE;
GO

EXEC SelectCaseDemoProcedure;
```

##### Searched CASE

```sql
CREATE OR ALTER PROCEDURE SelectCaseDemoProcedure
AS
      SELECT TOP 10
          LOGINID,
          CASE
              WHEN MARITALSTATUS = 'S' THEN 'SINGLE'
              WHEN MARITALSTATUS = 'M' THEN 'MARIED'
              ELSE 'OTHER'
          END AS status
      FROM HUMANRESOURCES.EMPLOYEE;
GO

EXEC SelectCaseDemoProcedure;
```

##### Result

| sqlLOGINID | status |
| --- | --- |
| adventure-works\ken0 | SINGLE |
| adventure-works\terri0 | SINGLE |
| adventure-works\roberto0 | MARIED |
| adventure-works\rob0 | SINGLE |
| adventure-works\gail0 | MARIED |
| adventure-works\jossef0 | MARIED |
| adventure-works\dylan0 | MARIED |
| adventure-works\diane1 | SINGLE |
| adventure-works\gigi0 | MARIED |
| adventure-works\michael6 | MARIED |

##### Snowflake Scripting

Note that in this scenario there are no differences regarding the Case expression itself.

> **Warning:**
>
> The declaration and assignment of the `res` variable is to demonstrate the functional equivalence between both languages. It does not appear in the actual output.

##### Simple CASE

```sql
CREATE OR REPLACE PROCEDURE SelectCaseDemoProcedure ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
      DECLARE
            ProcedureResultSet RESULTSET;
      BEGIN
            ProcedureResultSet := (
            SELECT TOP 10
                  LOGINID,
                CASE (MARITALSTATUS)
                    WHEN 'S' THEN 'SINGLE'
                    WHEN 'M' THEN 'MARIED'
                    ELSE 'OTHER'
                END AS status
            FROM
                  HUMANRESOURCES.EMPLOYEE);
            RETURN TABLE(ProcedureResultSet);
      END;
$$;

CALL SelectCaseDemoProcedure();
```

##### Searched CASE

```sql
CREATE OR REPLACE PROCEDURE SelectCaseDemoProcedure ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
      DECLARE
            ProcedureResultSet RESULTSET;
      BEGIN
            ProcedureResultSet := (
            SELECT TOP 10
                  LOGINID,
                CASE
                    WHEN MARITALSTATUS = 'S' THEN 'SINGLE'
                    WHEN MARITALSTATUS = 'M' THEN 'MARIED'
                    ELSE 'OTHER'
                END AS status
            FROM
                  HUMANRESOURCES.EMPLOYEE);
            RETURN TABLE(ProcedureResultSet);
      END;
$$;

CALL SelectCaseDemoProcedure();
```

##### Result

| LOGINID | STATUS |
| --- | --- |
| adventure-worksken0 | SINGLE |
| adventure-works erri0 | SINGLE |
| adventure-worksoberto0 | MARIED |
| adventure-worksob0 | SINGLE |
| adventure-worksgail0 | MARIED |
| adventure-worksjossef0 | MARIED |
| adventure-worksdylan0 | MARIED |
| adventure-worksdiane1 | SINGLE |
| adventure-worksgigi0 | MARIED |
| adventure-worksmichael6 | MARIED |

#### Set using Case

The AdventureWorks2019 database was used in both languages to obtain the same results.

##### Transact-SQL

##### Simple Case

```sql
CREATE OR ALTER PROCEDURE SetCaseDemoProcedure
AS
    DECLARE @value INT;
    DECLARE @result INT;
    SET @value = 5;

    SET @result =
        CASE @value
            WHEN 1 THEN @value * 10
            WHEN 3 THEN @value * 20
            WHEN 5 THEN @value * 30
            WHEN 7 THEN @value * 40
            ELSE -1
        END;

    RETURN @result
GO

DECLARE @result INT;
EXEC @result = SetCaseDemoProcedure;
PRINT @result;
```

##### Searched Case

```sql
CREATE OR ALTER PROCEDURE SetCaseDemoProcedure
AS
    DECLARE @value INT;
    DECLARE @result INT;
    SET @value = 5;

    SET @result =
        CASE
            WHEN @value = 1 THEN @value * 10
            WHEN @value = 3 THEN @value * 20
            WHEN @value = 5 THEN @value * 30
            WHEN @value = 7 THEN @value * 40
            ELSE -1
        END;

    RETURN @result
GO

DECLARE @result INT;
EXEC @result = SetCaseDemoProcedure;
PRINT @result;
```

##### Result

| result |
| --- |
| 150 |

##### Snowflake Scripting

> **Warning:**
>
> Snowflake Scripting does not allow setting a case expression directly to a variable. Both Transact-SQL Case expression formats translate to the following grammar in Snowflake Scripting.

##### SimpleCase

```sql
CREATE OR REPLACE PROCEDURE SetCaseDemoProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        VALUE INT;
        RESULT INT;
    BEGIN

        VALUE := 5;
        CASE (:VALUE)
            WHEN 1 THEN
                RESULT := :VALUE * 10;
            WHEN 3 THEN
                RESULT := :VALUE * 20;
            WHEN 5 THEN
                RESULT := :VALUE * 30;
            WHEN 7 THEN
                RESULT := :VALUE * 40;
            ELSE
                RESULT := -1;
        END;
        RETURN :RESULT;
    END;
$$;

DECLARE
    RESULT INT;
BEGIN
    CALL SetCaseDemoProcedure();
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Print' NODE ***/!!!
    PRINT @result;
END;
```

##### Searched Case

```sql
CREATE OR REPLACE PROCEDURE SetCaseDemoProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        VALUE INT;
        RESULT INT;
    BEGIN

        VALUE := 5;
        CASE
            WHEN :VALUE = 1 THEN
                RESULT := :VALUE * 10;
            WHEN :VALUE = 3 THEN
                RESULT := :VALUE * 20;
            WHEN :VALUE = 5 THEN
                RESULT := :VALUE * 30;
            WHEN :VALUE = 7 THEN
                RESULT := :VALUE * 40;
            ELSE
                RESULT := -1;
        END;
        RETURN :RESULT;
    END;
$$;

DECLARE
    RESULT INT;
BEGIN
    CALL SetCaseDemoProcedure();
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Print' NODE ***/!!!
    PRINT @result;
END;
```

##### Result

| result |
| --- |
| 150 |

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## CREATE PROCEDURE

Translation reference to convert Transact-SQL CREATE PROCEDURE clauses to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

The create procedure statement allows the creation of stored procedures that can:

* Accept input parameters and return multiple values in the form of output parameters to the calling procedure or batch.
* Contain programming statements that perform operations in the database, including calling other procedures.
* Return a status value to a calling procedure or batch to indicate success or failure (and the reason for failure).

For more information, see the [Transact-SQL CREATE PROCEDURE documentation](https://docs.microsoft.com/en-us/sql/t-sql/statements/create-procedure-transact-sql?view=sql-server-ver15).

```sql
CREATE [ OR ALTER ] { PROC | PROCEDURE }
    [schema_name.] procedure_name [ ; number ]
    [ { @parameter [ type_schema_name. ] data_type }
        [ VARYING ] [ = default ] [ OUT | OUTPUT | [READONLY]
    ] [ ,...n ]
[ WITH <procedure_option> [ ,...n ] ]
[ FOR REPLICATION ]
AS { [ BEGIN ] sql_statement [;] [ ...n ] [ END ] }
[;]
```

### Sample Source Patterns

#### Stored procedure without body

A stored procedure without a body is an unusual scenario that is allowed in Transact-SQL. Snowflake Scripting does not allow defining procedures without a body, but the following example shows the equivalence.

##### Transact-SQL

##### Procedure

```sql
CREATE PROC SampleProcedure AS;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE SampleProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      RETURN '';
   END;
$$;
```

#### Basic stored procedure

The following example details a simple stored procedure that will include a new Privacy department into the AdventureWorks2019 database.

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE Add_Privacy_Department
AS
EXECUTE ('INSERT INTO HumanResources.Department VALUES (''Privacy'', ''Executive General and Administration'', default)');
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE Add_Privacy_Department ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE 'INSERT INTO HumanResources.Department VALUES ('Privacy', 'Executive General and Administration', default);';
  END;
$$;
```

#### Alter procedure

The transformation for the ALTER procedure is equivalent to the basic procedure.

##### Transact-SQL

```sql
ALTER PROCEDURE procedureName
AS
SELECT 1 AS ThisDB;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE procedureName ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
DECLARE
ProcedureResultSet RESULTSET;
BEGIN
ProcedureResultSet := (
SELECT 1 AS ThisDB);
RETURN TABLE(ProcedureResultSet);
END;
$$;
```

#### Using parameters

You can use parameters to drive your logic or construct dynamic SQL statements inside your stored procedure. In the following example a simple SetNewPrice stored procedure is constructed, which sets a new product price based on the arguments sent by the caller.

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE SetNewPrice @ProductID INT, @NewPrice MONEY
AS
  BEGIN
    DECLARE @dynSqlStatement AS VARCHAR(300);
    SET @dynSqlStatement = 'UPDATE Production.ProductListPriceHistory SET ListPrice = ' + CAST(@NewPrice AS VARCHAR(10)) + ' WHERE ProductID = ' + CAST(@ProductID AS VARCHAR(10)) + ' AND EndDate IS NULL';
    EXECUTE (@dynSqlStatement);
  END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE SetNewPrice (PRODUCTID INT, NEWPRICE NUMBER(38, 4))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    DYNSQLSTATEMENT VARCHAR(300);
  BEGIN

    DYNSQLSTATEMENT := 'UPDATE Production.ProductListPriceHistory
   SET
      ListPrice = ' || CAST(:NEWPRICE AS VARCHAR(10)) || '
   WHERE
      ProductID = ' || CAST(:PRODUCTID AS VARCHAR(10)) || '
      AND EndDate IS NULL;';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE :DYNSQLSTATEMENT;
  END;
$$;
```

#### Output Parameters

Transact-SQL output keyword indicates that the parameter is an output parameter, whose value will be returned to the stored procedure caller. For example, the following procedure will return the number of vacation hours of a specific employee.

##### Transact-SQL

```sql
CREATE PROCEDURE GetVacationHours
   @employeeId INT,
   @vacationHours INT OUTPUT
AS
BEGIN
   SELECT @vacationHours = VacationHours
   FROM HumanResources.Employee
   WHERE NationalIDNumber = @employeeID
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE GetVacationHours (EMPLOYEEID INT, VACATIONHOURS OUT INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      SELECT
         VacationHours
      INTO
         :VACATIONHOURS
      FROM
         HumanResources.Employee
      WHERE
         NationalIDNumber = :EMPLOYEEID;
   END;
$$;
```

#### Optional Parameters

A parameter is considered optional if the parameter has a default value specified when it is declared. It is not necessary to provide a value for an optional parameter in a procedure call.

##### Transact-SQL

```sql
CREATE PROCEDURE OPTIONAL_PARAMETER @VAR1 INT = 1, @VAR2 INT = 2
AS
    BEGIN
        RETURN NULL;
    END

GO

EXEC OPTIONAL_PARAMETER @VAR2 = 4
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE OPTIONAL_PARAMETER (VAR1 INT DEFAULT 1, VAR2 INT DEFAULT 2)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        RETURN NULL;
    END;
$$;

CALL OPTIONAL_PARAMETER(VAR2 => 4);
```

#### EXECUTE AS

Transact-SQL’s EXECUTE AS clause defines the execution context of the stored procedure, specifying which user account the Database Engine uses to validate permissions on objects that are referenced within the procedure. For example, we can modify the previous GetVacationHours procedure to define different execution contexts.

* Owner (default in Snowflake Scripting)

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE GetVacationHours
   @employeeId INT,
   @vacationHours INT OUTPUT
WITH EXECUTE AS OWNER
AS
BEGIN
   SELECT @vacationHours = VacationHours
   FROM HumanResources.Employee
   WHERE NationalIDNumber = @employeeID
END;
```

##### Snowflake Scripting

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "HumanResources.Employee" **
CREATE OR REPLACE PROCEDURE GetVacationHours (EMPLOYEEID INT, VACATIONHOURS OUT INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS OWNER
AS
$$
   BEGIN
      SELECT
         VacationHours
      INTO
         :VACATIONHOURS
      FROM
         HumanResources.Employee
      WHERE
         NationalIDNumber = :EMPLOYEEID;
   END;
$$;
```

#### Caller

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE GetVacationHours
   @employeeId INT,
   @vacationHours INT OUTPUT
WITH EXECUTE AS CALLER
AS
BEGIN
   SELECT @vacationHours = VacationHours
   FROM HumanResources.Employee
   WHERE NationalIDNumber = @employeeID
END;
```

##### Snowflake Scripting

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "HumanResources.Employee" **
CREATE OR REPLACE PROCEDURE GetVacationHours (EMPLOYEEID INT, VACATIONHOURS OUT INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      SELECT
         VacationHours
      INTO
         :VACATIONHOURS
      FROM
         HumanResources.Employee
      WHERE
         NationalIDNumber = :EMPLOYEEID;
   END;
$$;
```

> **Warning:**
>
> SELF and specific user (‘user_name’) execution contexts are not supported in Snowflake Scripting.

#### READONLY AND VARYING PARAMETERS

Snowflake does not support `READONLY` and `VARYING` parameter types, an FDM is added instead.

##### Transact-SQL

```sql
 CREATE OR ALTER PROCEDURE GetVacationHours
   @Param1 INT READONLY,
   @Param2 INT VARYING
AS
BEGIN
   SELECT * FROM Table1;
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE GetVacationHours (PARAM1 INT !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'READONLY PARAMETERS' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!, PARAM2 INT !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'VARYING PARAMETERS' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!)
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      ProcedureResultSet RESULTSET;
   BEGIN
      ProcedureResultSet := (
      SELECT
         *
      FROM
         Table1);
      RETURN TABLE(ProcedureResultSet);
   END;
$$;
```

### Known Issues

#### Unsupported Optional Arguments

* [VARYING] Applies only to **cursor** parameters.Specifies the result set supported as an output parameter. This parameter is dynamically constructed by the procedure and its contents may vary. Snowflake scripting does not support CURSOR as a valid return data type.
* [= default] Makes a parameter optional through the definition of a default value. Snowflake scripting does not natively supports default parameter values.
* [READONLY] Indicates that the parameter cannot be updated or modified within the body of the procedure. Currently unsupported in Snowflake Scripting.
* [WITH RECOMPILE] Forces the database engine to compile the stored procedure’s query plan each time it is executed. Currently unsupported in Snowflake Scripting.
* [WITH ENCRYPTION] Used to encrypt the text of a stored procedure. Only users with access to system tables or database files (such as sysadmin users) will be able to access the procedure text after its creation. Currently unsupported in Snowflake Scripting.
* [FOR REPLICATION] Restricts the stored procedure to be executed only during replication. Currently unsupported in Snowflake Scripting.

### Related EWIs

1. [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.
2. [SSC-EWI-0058](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.

## CURSOR

Translation reference to convert Transact-SQL CURSOR statement to Snowflake Scripting

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Transact-SQL statements produce a complete result set, but there are times when the results are best processed one row at a time. Opening a cursor on a result set allows processing the result set one row at a time. You can assign a cursor to a variable or parameter with a **cursor** data type. For more information, see the [Transact-SQL Cursors documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/cursors-transact-sql?view=sql-server-ver15).

```sql
 //ISO Syntax
DECLARE cursor_name [ INSENSITIVE ] [ SCROLL ] CURSOR
     FOR select_statement
     [ FOR { READ ONLY | UPDATE [ OF column_name [ ,...n ] ] } ]
[;]

//Transact-SQL Extended Syntax
DECLARE cursor_name CURSOR [ LOCAL | GLOBAL ]
     [ FORWARD_ONLY | SCROLL ]
     [ STATIC | KEYSET | DYNAMIC | FAST_FORWARD ]
     [ READ_ONLY | SCROLL_LOCKS | OPTIMISTIC ]
     [ TYPE_WARNING ]
     FOR select_statement
     [ FOR UPDATE [ OF column_name [ ,...n ] ] ]
[;]
```

```sql
 FETCH
          [ [ NEXT | PRIOR | FIRST | LAST
                    | ABSOLUTE { n | @nvar }
                    | RELATIVE { n | @nvar }
               ]
               FROM
          ]
{ { [ GLOBAL ] cursor_name } | @cursor_variable_name }
[ INTO @variable_name [ ,...n ] ]
```

```sql
OPEN { { [ GLOBAL ] cursor_name } | cursor_variable_name }
```

```sql
CLOSE { { [ GLOBAL ] cursor_name } | cursor_variable_name }
```

```sql
DEALLOCATE { { [ GLOBAL ] cursor_name } | @cursor_variable_name }
```

### Sample Source Patterns

#### Transact-SQL

Notice that the following parameters are inherently supported by Snowflake Scripting.

* [LOCAL].
* [FORWARD_ONLY].
* [FAST_FORWARD] Specifies a FORWARD_ONLY (FETCH NEXT only) and READ_ONLY
* [READ_ONLY] the WHERE CURRENT OF does not exist in Snowflake Scripting.

##### Cursor

```sql
CREATE TABLE vEmployee   (
    PersonID INT,
    LastName VARCHAR(255),
    FirstName VARCHAR(255),
);

INSERT INTO vEmployee(PersonID, LastName, FirstName)
VALUES
    (1, 'AA', 'A'),
    (2, 'BB', 'B'),
    (3, 'CC', 'C'),
    (4, 'DD', 'D'),
    (5, 'EE', 'E'),
    (6, 'FF', 'F'),
    (7, 'GG', 'G');

CREATE OR ALTER PROCEDURE CursorExample
AS
    DECLARE
        @CursorVar CURSOR,
	@firstName VARCHAR;

    SET @CursorVar = CURSOR LOCAL FORWARD_ONLY STATIC READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;

    OPEN @CursorVar;

    FETCH NEXT FROM @CursorVar INTO @firstName;
    FETCH NEXT FROM @CursorVar INTO @firstName;

    CLOSE @CursorVar;

    SELECT @firstName;
GO
```

##### Result

```none
B
```

##### Snowflake Scripting

##### Cursor

```sql
CREATE OR REPLACE TABLE vEmployee (
	PersonID INT,
	LastName VARCHAR(255),
	FirstName VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES
    (1, 'AA', 'A'),
    (2, 'BB', 'B'),
    (3, 'CC', 'C'),
    (4, 'DD', 'D'),
    (5, 'EE', 'E'),
    (6, 'FF', 'F'),
    (7, 'GG', 'G');

CREATE OR REPLACE PROCEDURE CursorExample ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		CURSORVAR CURSOR
		FOR
			SELECT FirstName
			FROM vEmployee;
		FIRSTNAME VARCHAR;
		ProcedureResultSet RESULTSET;
	BEGIN

		OPEN CURSORVAR;
		FETCH
			CURSORVAR
		INTO
			:FIRSTNAME;
		FETCH
			CURSORVAR
		INTO
			:FIRSTNAME;
		CLOSE CURSORVAR;
		ProcedureResultSet := (
		SELECT
			:FIRSTNAME);
		RETURN TABLE(ProcedureResultSet);
	END;
$$;
```

##### Result

```none
B
```

### Known Issues

The following parameters are not supported:

DECLARE CURSOR

* [ GLOBAL ] Allows referencing the cursor name in any stored procedure or batch executed by the connection. Snowflake Scripting only allows the use of the cursor locally.
* [ SCROLL ] Snowflake Scripting only support FETCH NEXT.
* [ KEYSET | DYNAMIC ] If after opening a cursor and update to the table is made, these options may display some of the changes when fetching the cursor, Snowflake scripting only supports STATIC, in other words, after the cursor is opened the changes to the table are not detected by the cursor.
* [SCROLL_LOCKS] Specifies that positioned updates or deletes made through the cursor are guaranteed to succeed, Snowflake Scripting cannot guarantee it.
* [OPTIMISTIC] When an update or delete is made through the cursor it uses comparisons of timestamp column values, or a checksum value if the table has no timestamp column, to determine whether the row was modified after it was read into the cursor. Snowflake Scripting does not have an internal process to replicate it.
* [TYPE_WARNING]

FETCH

* [PRIOR | FIRST | LAST] Snowscripting only support NEXT.
* [ABSOLUTE] Snowflake Scripting only supports NEXT but the behavior can be replicated.
* [RELATIVE] Snowflake Scripting but the behavior can be replicated.
* [ GLOBAL ] Allows referencing the cursor name in any stored procedure or batch executed by the connection. Snowflake Scripting only allows the use of the cursor locally.
* FETCH without INTO is not supported.
* When the FETCH statement is located inside a loop it is considered a complex pattern as it may have an impact on the Snowflake translated code performance. Check the related issues section for more information.

#### Fetch inside loop sample

##### SQL Server

```sql
CREATE OR ALTER PROCEDURE cursor_procedure1
AS
BEGIN
DECLARE cursor1 CURSOR FOR SELECT col1 FROM my_table;
WHILE 1=0
   BEGIN
      FETCH NEXT FROM @cursor1 INTO @variable1;
   END
END;
```

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE cursor_procedure1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   DECLARE
      --** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
      cursor1 CURSOR
      FOR
         SELECT
            col1
         FROM
            my_table;
   BEGIN

      WHILE (1=0) LOOP
         --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
         FETCH
            CURSOR1
            INTO
            :VARIABLE1;
      END LOOP;
   END;
$$;
```

#### OPEN

* [ GLOBAL ] Allows referencing the cursor name in any stored procedure or batch executed by the connection. Snowflake Scripting only allows the use of the cursor locally.

CLOSE

* [ GLOBAL ] Allows referencing the cursor name in any stored procedure or batch executed by the connection. Snowflake Scripting only allows the use of the cursor locally.

DEALLOCATE removes a cursor reference. Snowflake Scripting doesn’t require explicit deallocation because cursors are automatically deallocated when they go out of scope. SnowConvert AI comments out the statement with [SSC-FDM-TS0057](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md).

WHERE CURRENT OF the use of this statement is not supported, for example:

```sql
CREATE OR ALTER PROCEDURE CursorWithCurrent
AS
    DECLARE
        @CursorVar CURSOR;

    SET @CursorVar = CURSOR
	FOR
	SELECT FirstName
	FROM vEmployee;

    OPEN @CursorVar;

    FETCH NEXT FROM @CursorVar;
    FETCH NEXT FROM @CursorVar;

    UPDATE vEmployee SET LastName = 'Changed' WHERE CURRENT OF @CursorVar;

    CLOSE @CursorVar;
GO
```

Environment variables

* @@CURSOR_ROWS
* @@FETCH_STATUS

### Related EWIs

1. [SSC-FDM-TS0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Snowflake Scripting cursor rows are not modifiable.
2. [SSC-PRF-0003](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.

## DECLARE

Translation reference to convert Transact-SQL DECLARE statement to Snowflake Scripting

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Transact-SQL DECLARE statement allows the creation of variables that can be used in the scope of the batch or a stored procedure. For more information, see the [Transact-SQL DECLARE documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/declare-local-variable-transact-sql?view=sql-server-ver15).

```sql
-- Syntax for SQL Server and Azure SQL Database

DECLARE
{
    { @local_variable [AS] data_type  [ = value ] }
  | { @cursor_variable_name CURSOR }
} [,...n]
| { @table_variable_name [AS] <table_type_definition> }

<table_type_definition> ::=
     TABLE ( { <column_definition> | <table_constraint> } [ ,...n] )

<column_definition> ::=
     column_name { scalar_data_type | AS computed_column_expression }
     [ COLLATE collation_name ]
     [ [ DEFAULT constant_expression ] | IDENTITY [ (seed ,increment ) ] ]
     [ ROWGUIDCOL ]
     [ <column_constraint> ]

<column_constraint> ::=
     { [ NULL | NOT NULL ]
     | [ PRIMARY KEY | UNIQUE ]
     | CHECK ( logical_expression )
     | WITH ( <index_option > )
     }

<table_constraint> ::=
     { { PRIMARY KEY | UNIQUE } ( column_name [ ,...n] )
     | CHECK ( search_condition )
     }
```

### Sample Source Patterns

#### Declare variables

Variables can be created in different ways. Variables may or may not have a default value and several variables can be declared in the same line.

Notice that Snowflake Scripting does not allow creating more than one variable per line.

##### Transact-SQL

```sql
DECLARE @find VARCHAR(30);
DECLARE @find2 VARCHAR(30) = 'Default';
DECLARE @var VARCHAR(5), @var2 varchar(5);
```

##### Snowflake Scripting

```sql
DECLARE
    FIND VARCHAR(30);
    FIND2 VARCHAR(30) := 'Default';
    VAR VARCHAR(5);
    VAR2 VARCHAR(5);
BEGIN
    RETURN '';
END;
```

#### Declare table variables

Transact-SQL allows the creation of table variables that can be used as regular tables. Snowflake scripting does not support this, instead, a table can be created and then dropped at the end of the procedure.

##### Transact-SQL

```sql
DECLARE @MyTableVar TABLE(
    column1 varchar(10));
```

##### Snowflake Scripting

```sql
BEGIN
    DECLARE
        T_MYTABLEVAR TABLE(
            column1 VARCHAR(10));
END;
```

#### DECLARE statement outside routines (functions and procedures)

Unlike Transact-SQL, Snowflake does not support executing isolated statements like DECLARE outside routines like functions or procedures. For this scenario, the statement should be encapsulated in an anonymous block, as shown in the following examples. This statement is usually used before a `SET STATEMENT`.

##### Transact-SQL

```sql
DECLARE @Group nvarchar(50), @Sales MONEY;
SET @Group = N'North America';
SET @Sales = 2000000;
```

##### Snowflake Scripting

```sql
DECLARE
    _GROUP VARCHAR(50);
    SALES NUMBER(38, 4);
BEGIN
    _GROUP := 'North America';
    SALES := 2000000;
END;
```

If there is a scenario with only DECLARE statements, the BEGIN…END block should have a RETURN NULL statement to avoid errors, since this block can’t be empty.

##### Transact-SQL

```sql
DECLARE @Group nvarchar(50), @Sales MONEY;
```

##### Snowflake Scripting

```sql
DECLARE
    _GROUP VARCHAR(50);
    SALES NUMBER(38, 4);
BEGIN
    RETURN '';
END;
```

## EXECUTE

Translation reference to convert Transact-SQL Execute statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Transact-SQL EXECUTE statement allows the execution of a command string or character string within a Transact-SQL batch, a scalar-valued user-defined function, or a stored procedure. For more information, see the [Transact-SQL EXECUTE documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/execute-transact-sql?view=sql-server-ver15).

```sql
 -- Execute a character string
{ EXEC | EXECUTE }
    ( { @string_variable | [ N ]'tsql_string' } [ + ...n ] )
    [ AS { LOGIN | USER } = ' name ' ]
[;]

-- Execute a stored procedure or function
[ { EXEC | EXECUTE } ]
    {
      [ @return_status = ]
      { module_name [ ;number ] | @module_name_var }
        [ [ @parameter = ] { value
                           | @variable [ OUTPUT ]
                           | [ DEFAULT ]
                           }
        ]
      [ ,...n ]
      [ WITH <execute_option> [ ,...n ] ]
    }
[;]
```

### Sample Source Patterns

#### Execution of character string

EXECUTE can be used to perform SQL operations passed directly as literals. In the following example it is used within a stored procedure that will insert a new privacy department into the AdventureWorks2019 database.

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE AddPrivacyDepartment
AS
EXECUTE ('INSERT INTO HumanResources.Department VALUES (''Privacy'', ''Executive General and Administration'', default)');
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE AddPrivacyDepartment ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
BEGIN
!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
EXECUTE IMMEDIATE 'INSERT INTO HumanResources.Department VALUES ('Privacy', 'Executive General and Administration', default);';
END;
$$;
```

#### Execution of stored procedure

EXECUTE can also be used to call an existing stored procedure. The following example will call the AddPrivacyDepartment procedure that was created above. It will then run a SELECT to verify that the new department was successfully included.

##### Transact-SQL

```sql
EXECUTE AddPrivacyDepartment;
SELECT DepartmentID, Name, GroupName FROM HumanResources.Department;
```

##### Result

| DepartmentID | Name | GroupName | ModifiedDate |
| --- | --- | --- | --- |
| 1 | Engineering | Research and Development | 2008-04-30 00:00:00.000 |
| 2 | Tool Design | Research and Development | 2008-04-30 00:00:00.000 |
| 3 | Sales | Sales and Marketing | 2008-04-30 00:00:00.000 |
| 4 | Marketing | Sales and Marketing | 2008-04-30 00:00:00.000 |
| 5 | Purchasing | Inventory Management | 2008-04-30 00:00:00.000 |
| 6 | Research and Development | Research and Development | 2008-04-30 00:00:00.000 |
| 7 | Production | Manufacturing | 2008-04-30 00:00:00.000 |
| 8 | Production Control | Manufacturing | 2008-04-30 00:00:00.000 |
| 9 | Human Resources | Executive General and Administration | 2008-04-30 00:00:00.000 |
| 1 0 | Finance | Executive General and Administration | 2008-04-30 00:00:00.000 |
| 1 1 | Information Services | Executive General and Administration | 2008-04-30 00:00:00.000 |
| 1 2 | Document Control | Quality Assurance | 2008-04-30 00:00:00.000 |
| 1 3 | Quality Assurance | Quality Assurance | 2008-04-30 00:00:00.000 |
| 1 4 | Facilities and Maintenance | Executive General and Administration | 2008-04-30 00:00:00.000 |
| 1 5 | Shipping and Receiving | Inventory Management | 2008-04-30 00:00:00.000 |
| 1 6 | Executive | Executive General and Administration | 2008-04-30 00:00:00.000 |
| 1 7 | Privacy | Executive General and Administration | 2021-11-17 12:42:54.640 |

##### Snowflake Scripting

```sql
 CALL AddPrivacyDepartment();

SELECT
DepartmentID,
Name,
GroupName
FROM
HumanResources.Department;
```

##### Result

| DEPARTMENTID | NAME | GROUPNAME | MODIFIEDDATE |
| --- | --- | --- | --- |
| 1 | Engineering | Research and Development | 2021-11-17 10:29:36.963 |
| 2 | Tool Design | Research and Development | 2021-11-17 10:29:37.463 |
| 3 | Sales | Sales and Marketing | 2021-11-17 10:29:38.192 |
| 4 | Marketing | Sales and Marketing | 2021-11-17 10:29:38.733 |
| 5 | Purchasing | Inventory Management | 2021-11-17 10:29:39.298 |
| 6 | Research and Development | Research and Development | 2021-11-17 10:31:53.770 |
| 7 | Production | Manufacturing | 2021-11-17 10:31:55.082 |
| 8 | Production Control | Manufacturing | 2021-11-17 10:31:56.638 |
| 9 | Human Resources | Executive General and Administration | 2021-11-17 10:31:57.507 |
| 10 | Finance | Executive General and Administration | 2021-11-17 10:31:58.473 |
| 11 | Information Services | Executive General and Administration | 2021-11-17 10:34:35.200 |
| 12 | Document Control | Quality Assurance | 2021-11-17 10:34:35.741 |
| 13 | Quality Assurance | Quality Assurance | 2021-11-17 10:34:36.277 |
| 14 | Facilities and Maintenance | Executive General and Administration | 2021-11-17 10:34:36.832 |
| 15 | Shipping and Receiving | Inventory Management | 2021-11-17 10:34:37.373 |
| 16 | Executive | Executive General and Administration | 2021-11-17 10:34:37.918 |
| 17 | Privacy | Executive General and Administration | 2021-11-17 10:46:43.345 |

#### Execution of local variable and use of parameters

A common use case for the EXECUTE statement is when dynamic SQL statements are needed. In this cases instead of executing a string literal, the statement could be constructed dynamically and assigned to a local variable, which will then be executed. A set of arguments can be sent to the called stored procedure to construct the dynamic SQL command.

In the following example a simple SetNewPrice stored procedure is constructed, which uses the EXECUTE statement to set a new product price based on the arguments sent by the caller. Lastly a SELECT is performed to confirm the new product price.

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE SetNewPrice @ProductID INT, @NewPrice MONEY
AS
  DECLARE @dynSqlStatement AS VARCHAR(300);
  SET @dynSqlStatement = 'UPDATE Production.ProductListPriceHistory SET ListPrice = ' + CAST(@NewPrice AS VARCHAR(10)) + ' WHERE ProductID = ' + CAST(@ProductID AS VARCHAR(10)) + ' AND EndDate IS NULL';
  EXECUTE (@dynSqlStatement);
GO

EXECUTE Set_New_Price @ProductID = 707, @NewPrice = 34.99;
SELECT ListPrice FROM Production.ProductListPriceHistory WHERE ProductID = 707 AND EndDate IS NULL;
```

##### Result

| ListPrice |
| --- |
| 34.9900 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE SetNewPrice (PRODUCTID INT, NEWPRICE NUMBER(38, 4))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    DYNSQLSTATEMENT VARCHAR(300);
  BEGIN

    DYNSQLSTATEMENT := 'UPDATE Production.ProductListPriceHistory
   SET
      ListPrice = ' || CAST(:NEWPRICE AS VARCHAR(10)) || '
   WHERE
      ProductID = ' || CAST(:PRODUCTID AS VARCHAR(10)) || '
      AND EndDate IS NULL;';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE :DYNSQLSTATEMENT;
  END;
$$;

CALL Set_New_Price(707, 34.99);

SELECT
  ListPrice
FROM
  Production.ProductListPriceHistory
WHERE
  ProductID = 707 AND EndDate IS NULL;
```

##### Result

| LISTPRICE |
| --- |
| 34.9900 |

### Known Issues

#### Using return codes

Transact-SQL EXECUTE syntax contains the @return_status optional argument, which allows creating a scalar variable to store the return status of a scalar-valued user defined function.

It can also be used in stored procedures although the returning status will be limited to integer data type.

To represent this functionality, we could slightly modify the above example and create a user defined function to calculate the new product price as an average of the historical prices. Instead of passing it to the stored procedure, we could now call the CalculateAveragePrice function to obtain the new price, and store it in the return variable to construct the dynamic SQL.

##### Transact-SQL

##### Execute

```sql
CREATE OR ALTER FUNCTION CalculateAveragePrice(@pid INT)
RETURNS MONEY
AS
BEGIN
  DECLARE @average AS MONEY;
  SELECT @average = AVG(LISTPRICE) FROM Production.ProductListPriceHistory WHERE ProductID = @pid;
  RETURN @average;
END;
GO

CREATE OR ALTER PROCEDURE SetNewPrice @ProductID INT
AS
  DECLARE @averageHistoricalPrice MONEY;
  EXECUTE @averageHistoricalPrice = [dbo].Calculate_Average_Price @pid=@ProductID;
  UPDATE Production.ProductListPriceHistory SET ListPrice = @averageHistoricalPrice WHERE ProductID =  @ProductID AND EndDate IS NULL;
GO

EXECUTE Set_New_Price @ProductID = 707;
SELECT ListPrice FROM Production.ProductListPriceHistory WHERE ProductID = 707 AND EndDate IS NULL;
```

##### Result

| ListPrice |
| --- |
| 34.0928 |

##### Snowflake Scripting

```sql
CREATE OR REPLACE FUNCTION CalculateAveragePrice (PID INT)
RETURNS NUMBER(38, 4)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
$$
  WITH CTE1 AS
  (
    SELECT
      AVG(LISTPRICE) AS AVERAGE FROM
      Production.ProductListPriceHistory
    WHERE
      ProductID = PID
  )
  SELECT
    AVERAGE
  FROM
    CTE1
$$;

CREATE OR REPLACE PROCEDURE SetNewPrice (PRODUCTID INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    AVERAGEHISTORICALPRICE NUMBER(38, 4);
  BEGIN

    CALL dbo.Calculate_Average_Price(:PRODUCTID);
    UPDATE Production.ProductListPriceHistory
      SET
        ListPrice = :AVERAGEHISTORICALPRICE
      WHERE
        ProductID = :PRODUCTID
        AND EndDate IS NULL;
  END;
$$;

CALL Set_New_Price(707);

SELECT
  ListPrice
FROM
  Production.ProductListPriceHistory
WHERE
  ProductID = 707 AND EndDate IS NULL;
```

#### Unsupported Optional arguments

* @return_status
* ;number
* @module__name_v_ar
* WITH RECOMPILE, WITH RESULT SETS NONE, WITH <result set definition>

### Related EWIs

1. [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.

## IF

Translation reference to convert Transact-SQL IF..ELSE clauses to Snowflake Scripting

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The IF clause allows an SQL statement or a block of statements to be conditionally executed as long as the Boolean expression is true; otherwise, the statements in the optional ELSE clause will be executed. Transact-SQL also supports embedding multiple IF… ELSE clauses in case multiple conditions are required, or the CASE clause can also be used.

For more information, see the [Transact-SQL IF…ELSE documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/if-else-transact-sql?view=sql-server-ver15).

```sql
 IF Boolean_expression
     { sql_statement | statement_block }
[ ELSE
     { sql_statement | statement_block } ]
```

Note: To define a statement block, use the control-of-flow keywords `BEGIN` and `END`.

### Sample Source Patterns

#### Transact-SQL

The following code refers to an IF… ELSE in Transact-SQL that conditions the variable @value to identify if it is less than 5, if it is between 5 and 10, or if it has any other value. Since @value is initialized as 7, the second condition must be true and the result must be 200.

##### IF…ELSE

```sql
CREATE OR ALTER PROCEDURE IfElseDemoProcedure
AS
    DECLARE @value INT;
    SET @value = 7;

    IF @value < 5
        SET @value = 100;
    ELSE IF @value >= 5 AND @value < 10
        BEGIN
            SET @value = 300;
            SET @value = @value - 100;
        END;
    ELSE
        SET @value = -1;

    RETURN @value
GO

DECLARE @result INT;
EXEC @result = IfElseDemoProcedure;
PRINT @result;
```

##### Result

| result |
| --- |
| 200 |

##### Snowflake Scripting

> **Note:**
>
> Notice that in Snowflake Scripting, the embedded IF… ELSE condition is called ELSEIF.
>
> Besides, the Boolean condition is encapsulated in parentheses and the clause always ends with the END IF expression.
>
> In addition, in Snowflake Scripting it is not necessary to use the BEGIN and END keywords to define a statement block, however it can be used if required.

##### IF…ELSE

```sql
CREATE OR REPLACE PROCEDURE IfElseDemoProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        VALUE INT;
    BEGIN

        VALUE := 7;
        IF (:VALUE < 5) THEN
            VALUE := 100;
        ELSEIF (:VALUE >= 5 AND :VALUE < 10) THEN
            BEGIN
                VALUE := 300;
                VALUE := :VALUE - 100;
            END;
        ELSE
            VALUE := -1;
        END IF;
        RETURN :VALUE;
    END;
$$;

DECLARE
    RESULT INT;
BEGIN
    CALL IfElseDemoProcedure();
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Print' NODE ***/!!!
    PRINT @result;
END;
```

##### Result

| result |
| --- |
| 200 |

#### IF statement outside routines (functions and procedures)

Unlike Transact-SQL, Snowflake does not support executing isolated statements like IF…ELSE outside routines like functions or procedures. For this scenario, the statement should be encapsulated in an anonymous block, as shown in the following example.
You can read more about how to correctly return the output values in the [SELECT section](transact-select.md).

##### Transact-SQL

```sql
DECLARE @maxWeight FLOAT, @productKey INTEGER
SET @maxWeight = 100.00
SET @productKey = 424
IF @maxWeight <= 99
    SELECT @productKey,  'This product is too heavy to ship and is only available for pickup.'
ELSE
    SELECT @productKey, 'This product is available for shipping or pickup.'
```

##### Snowflake Scripting

```sql
DECLARE
    MAXWEIGHT FLOAT;
    PRODUCTKEY INTEGER;
    BlockResultSet1 VARCHAR;
    BlockResultSet2 VARCHAR;
    return_arr ARRAY := array_construct();
BEGIN
    MAXWEIGHT := 100.00;
    PRODUCTKEY := 424;
    IF (:MAXWEIGHT <= 99) THEN
        BlockResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
        CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:BlockResultSet1) AS
            SELECT
                :PRODUCTKEY,  'This product is too heavy to ship and is only available for pickup.';
        return_arr := array_append(return_arr, :BlockResultSet1);
    ELSE
        BlockResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
        CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:BlockResultSet2) AS
            SELECT
                :PRODUCTKEY, 'This product is available for shipping or pickup.';
        return_arr := array_append(return_arr, :BlockResultSet2);
    END IF;
    --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
    RETURN return_arr;
END;
```

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-FDM-0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables.

## LABEL and GOTO

Translation reference for LABEL and GOTO statements in Transact-SQL.

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

T-SQL supports `GOTO` for unconditional jumps to labeled statements within a procedure. Snowflake Scripting does not support `GOTO` or labeled jump targets natively.

When GOTO/Label patterns appear inside a stored procedure with **forward-only** jumps to **top-level** labels, SnowConvert AI automatically transforms them using a **nested procedure decomposition** approach. The procedure body is split into sections at each label declaration, and each section becomes its own nested procedure — code before the first label is placed in a nested procedure called `SC_PROCESS`, while each labeled section becomes a nested procedure named after its label. Every `GOTO label` is then replaced with `CALL label(); RETURN 'PROCESS FINISHED';`, which transfers control to the target section and exits the current one. To preserve sequential execution when no `GOTO` is taken, each section automatically calls the next one at its end (fall-through). All local variable declarations are moved up to the parent procedure’s `DECLARE` block so that every nested procedure can access them through Snowflake’s lexical scoping. Any `RETURN @value` inside a label section is translated to an assignment `SC_EXIT_CODE := :expr;`, and the outer procedure body simply calls `SC_PROCESS()` and then returns the exit code.

When the pattern **cannot** be transformed, the original `GOTO` and label statements are preserved with EWI markers. This happens with **backward GOTOs** (where the target label appears before the GOTO in source order, which would require recursive nested calls), **GOTO/Label in anonymous blocks or UDFs** (which do not support nested procedure definitions), and **labels inside nested control flow** such as `IF`, `WHILE`, or `TRY` blocks (which cannot be extracted into top-level nested procedures).

### Sample Source Patterns

#### Forward GOTO with single label (transformed)

A common T-SQL pattern uses `GOTO` to skip to a cleanup or exit section when an error is detected. SnowConvert AI transforms this by wrapping the main logic and the cleanup label into separate nested procedures.

##### Transact-SQL

```sql
CREATE PROCEDURE dbo.ValidateOrderInput
AS
BEGIN
    DECLARE @ErrorCode INT = 0
    IF @ErrorCode = 0
    BEGIN
        SET @ErrorCode = 1
        GOTO Cleanup
    END
    SET @ErrorCode = -1
Cleanup:
    RETURN @ErrorCode
END
```

##### Snowflake SQL

```sql
CREATE OR REPLACE PROCEDURE dbo.ValidateOrderInput ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SC_EXIT_CODE VARCHAR;
    ERRORCODE INT := 0;
    SC_PROCESS PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        IF (:ERRORCODE = 0) THEN
          BEGIN
            ERRORCODE := 1;
            BEGIN
              CALL Cleanup();
              RETURN 'PROCESS FINISHED';
            END;
          END;
        END IF;
        ERRORCODE := -1;
        CALL Cleanup();
      END;
    Cleanup PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        SC_EXIT_CODE := :ERRORCODE;
      END;
  BEGIN
    CALL SC_PROCESS();
    RETURN :SC_EXIT_CODE;
  END;
$$;
```

#### Multiple labels with fall-through (transformed)

When a procedure has multiple labeled sections, SnowConvert AI preserves sequential fall-through by having each nested procedure call the next one at its end. A `GOTO` can also skip ahead to any label, bypassing intermediate sections.

##### Transact-SQL

```sql
CREATE PROCEDURE dbo.ProcessShipment @Status VARCHAR(100) OUTPUT
AS
BEGIN
    SET @Status = 'Received'
    IF @Status = 'skip' GOTO Ship
Validate:
    SET @Status = 'Validated'
Pack:
    SET @Status = 'Packed'
Ship:
    SET @Status = 'Shipped'
    RETURN 0
END
```

##### Snowflake SQL

```sql
CREATE OR REPLACE PROCEDURE dbo.ProcessShipment (STATUS OUT STRING)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SC_EXIT_CODE VARCHAR;
    SC_PROCESS PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        STATUS := 'Received';
        IF (:STATUS = 'skip') THEN
          BEGIN
            CALL Ship();
            RETURN 'PROCESS FINISHED';
          END;
        END IF;
        CALL Validate();
      END;
    Validate PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        STATUS := 'Validated';
        CALL Pack();
      END;
    Pack PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        STATUS := 'Packed';
        CALL Ship();
      END;
    Ship PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        STATUS := 'Shipped';
        SC_EXIT_CODE := 0;
      END;
  BEGIN
    CALL SC_PROCESS();
    RETURN :SC_EXIT_CODE;
  END;
$$;
```

#### GOTO inside nested IF (transformed)

`GOTO` statements inside nested `IF` or `BEGIN...END` blocks are also transformed. The `CALL`/`RETURN` pair exits from any depth of nesting, effectively reproducing the jump-out behavior of the original `GOTO`.

##### Transact-SQL

```sql
CREATE PROCEDURE dbo.ApproveExpenseReport @ManagerApproved INT, @BudgetAvailable INT
AS
BEGIN
    DECLARE @ApprovalStatus INT = 0
    IF @ManagerApproved = 1
    BEGIN
        IF @BudgetAvailable = 1
            GOTO Finalize
        SET @ApprovalStatus = 1
    END
    SET @ApprovalStatus = 2
Finalize:
    RETURN 0
END
```

##### Snowflake SQL

```sql
CREATE OR REPLACE PROCEDURE dbo.ApproveExpenseReport (MANAGERAPPROVED INT, BUDGETAVAILABLE INT)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SC_EXIT_CODE VARCHAR;
    APPROVALSTATUS INT := 0;
    SC_PROCESS PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        IF (:MANAGERAPPROVED = 1) THEN
          BEGIN
            IF (:BUDGETAVAILABLE = 1) THEN
              BEGIN
                CALL Finalize();
                RETURN 'PROCESS FINISHED';
              END;
            END IF;
            APPROVALSTATUS := 1;
          END;
        END IF;
        APPROVALSTATUS := 2;
        CALL Finalize();
      END;
    Finalize PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        SC_EXIT_CODE := 0;
      END;
  BEGIN
    CALL SC_PROCESS();
    RETURN :SC_EXIT_CODE;
  END;
$$;
```

#### Backward GOTO — not transformed (EWI kept)

When a `GOTO` targets a label that appears *before* the `GOTO` in the source (a backward jump), SnowConvert AI cannot apply the nested procedure decomposition because it would require recursive calls, which Snowflake does not support for nested procedures. In these cases, the `GOTO` and label are preserved with EWI markers for manual resolution.

##### Transact-SQL

```sql
CREATE PROCEDURE dbo.RetryDatabaseConnection
AS
BEGIN
    DECLARE @Attempts INT = 0
RetryConnection:
    SET @Attempts = @Attempts + 1
    IF @Attempts < 3
        GOTO RetryConnection
    RETURN 0
END
```

##### Snowflake SQL

```sql
CREATE OR REPLACE PROCEDURE dbo.RetryDatabaseConnection ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ATTEMPTS INT := 0;
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0045 - LABELED STATEMENT IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
    RetryConnection:
    ATTEMPTS := :ATTEMPTS + 1;
    IF (:ATTEMPTS < 3) THEN
      !!!RESOLVE EWI!!! /*** SSC-EWI-TS0087 - GOTO IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
      GOTO RetryConnection
    END IF;
    RETURN 0;
  END;
$$;
```

#### LABEL and GOTO outside stored procedures (not transformed)

When `GOTO` and labels appear in batch scripts outside of a stored procedure, the nested procedure decomposition cannot be applied because Snowflake anonymous blocks do not support nested procedure definitions. The statements are preserved with EWI markers.

##### Transact-SQL

```sql
CREATE TABLE AuditLog(EventID INT);
GOTO InsertSecond
InsertFirst:
    INSERT INTO AuditLog VALUES (1);
InsertSecond:
    INSERT INTO AuditLog VALUES (2);
```

##### Snowflake Scripting

```sql
BEGIN
    CREATE OR REPLACE TABLE AuditLog (
        EventID INT
    );
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0087 - GOTO IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
    GOTO InsertSecond;
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0045 - LABELED STATEMENT IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
    InsertFirst:
    INSERT INTO AuditLog VALUES (1);

    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0045 - LABELED STATEMENT IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
    InsertSecond:
    INSERT INTO AuditLog VALUES (2);
END;
```

### Related EWIs

1. [SSC-EWI-TS0045](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Labeled Statement is not supported in Snowflake Scripting.
2. [SSC-EWI-TS0087](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): GOTO is not supported in Snowflake.
3. [SSC-EWI-TS0103](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): GOTO targeting a label inside a nested block is not supported in Snowflake.
4. [SSC-FDM-TS0055](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Label statement commented out (no GOTO references the label).

## OUTPUT PARAMETERS

This article is about the current transformation of the output parameters and how their functionality is being emulated.

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

An **output parameter** is a parameter whose value is passed out of the stored procedure, back to the calling SQL block. Since the output parameters are not supported by Snowflake Scripting, a solution has been implemented to emulate their functionality.

### Sample Source Patterns

#### Single OUT parameter

The most basic scenario for OUT parameters is when the procedure only has one. In this case, we simply return the OUT parameter at the end of the procedure body.

The EXEC procedure has to be translated as well, for this a CALL is created, the parameters are passed without any modifier (“OUT” is removed), and subsequently, an assignment is done so the parameter is associated with it’s respective resulting value.

##### Transact-SQL

```sql
 -- Procedure with output parameter
CREATE PROCEDURE dbo.outmain
@name VARCHAR (255) OUTPUT
AS
SET @name = 'Jane';

GO

-- Auxiliary procedure that calls the main procedure
CREATE PROCEDURE dbo.outaux
AS
DECLARE @name VARCHAR (255);
EXEC dbo.outmain
    @name = @name OUTPUT;
```

##### Snowflake Scripting

```sql
 -- Procedure with output parameter
CREATE OR REPLACE PROCEDURE dbo.outmain (NAME OUT STRING)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        NAME := 'Jane';
    END;
$$;

-- Auxiliary procedure that calls the main procedure
CREATE OR REPLACE PROCEDURE dbo.outaux ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        NAME VARCHAR(255);
    BEGIN

        CALL dbo.outmain(:NAME);
    END;
$$;
```

#### Multiple OUT parameters

When more than one OUT parameters are found, the RETURNS clause of the procedure changes to VARIANT. This is to accommodate the OBJECT_CONSTRUCT that is going to be used to store the values of the OUT parameters.

On top of that, a RETURN statement is added to the end of the procedure’s body. This is where the OBJECT_COSNTRUCT is created and all the OUT parameter values are stored within it. This object will then be used by the caller to assign the parameters value to the corresponding result.

##### Transact-SQL

```sql
CREATE OR ALTER PROCEDURE basicProc (
    @col1 INT OUT,
    @col2 VARCHAR(10) OUT
) AS
BEGIN
    SET @col1 = 4;
    SET @col2 = 'test';
END;

GO

CREATE OR ALTER PROCEDURE basicProcCall AS
BEGIN
    DECLARE @var1 INT = 0;
    DECLARE @var2 VARCHAR(10) = 'EMPTY';

    EXEC basicProc @var1 OUT, @var2 OUT;
    INSERT INTO TABLE1(col1, col2) VALUES (@var1, @var2);
END;

GO

EXEC basicProcCall;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE basicProc (COL1 OUT INT, COL2 OUT STRING)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        COL1 := 4;
        COL2 := 'test';
    END;
$$;

CREATE OR REPLACE PROCEDURE basicProcCall ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        VAR1 INT := 0;
        VAR2 VARCHAR(10) := 'EMPTY';
    BEGIN

        CALL basicProc(:VAR1, :VAR2);
        INSERT INTO TABLE1 (col1, col2) VALUES (:VAR1, :VAR2);
    END;
$$;

CALL basicProcCall();
```

#### OUT parameters and return values

Transact-SQL allows procedures to have return values. When a procedure has both a return value and OUT parameter(s), a similar approach to the Multiple OUT parameters scenario is followed. The original return value is treated as an OUT parameter would be treated, so it’s stored within the OBJECT_CONSTRUCT and extracted inside the caller procedure.

##### Transact-SQL

```sql
 -- Procedure with multiple output parameters
CREATE PROCEDURE dbo.outmain
@name VARCHAR (255) OUTPUT
AS
SET @name = 'Jane';
RETURN 0;

GO

-- Auxiliary procedure that calls the main procedure
CREATE PROCEDURE dbo.outaux
AS
DECLARE @name VARCHAR (255);
DECLARE @returnValue INT;
EXEC @returnValue = dbo.outmain
    @name = @name OUTPUT;
```

##### Snowflake Scripting

##### Query

```sql
 -- Procedure with multiple output parameters
CREATE OR REPLACE PROCEDURE dbo.outmain (NAME OUT STRING)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        NAME := 'Jane';
        RETURN 0;
    END;
$$;

-- Auxiliary procedure that calls the main procedure
CREATE OR REPLACE PROCEDURE dbo.outaux ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        NAME VARCHAR(255);
        RETURNVALUE INT;
    BEGIN

        CALL dbo.outmain(:NAME);
    END;
$$;
```

#### Customer data type OUT parameters

when the output parameter is a custom type, the process is similar to a regular data type.

##### Transact-SQL

```sql
 CREATE PROCEDURE procedure_udtype_out_params(
  @p_employee_id INT,
  @p_phone [dbo].[PhoneNumber] OUTPUT
) AS
BEGIN
  SELECT @p_phone = phone
  FROM employees
  WHERE employee_id = @p_employee_id;
END;
```

##### Snowflake Scripting

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "[dbo].[PhoneNumber]", "employees" **
CREATE OR REPLACE PROCEDURE procedure_udtype_out_params (P_EMPLOYEE_ID INT, P_PHONE OUT VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-TS0015 - DATA TYPE DBO.PHONENUMBER IS NOT SUPPORTED IN SNOWFLAKE ***/!!! NOT NULL)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    SELECT
      phone
    INTO
      :P_PHONE
    FROM
      employees
    WHERE
      employee_id = :P_EMPLOYEE_ID;
  END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-EWI-TS0015](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Data type is not supported in Snowflake.

## SET

Translation reference to convert Transact-SQL SET statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Sets the specified local variable, previously created by using the DECLARE @*local_variable* statement, to the specified value. For more information, see the [Transact-SQL SET documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/set-local-variable-transact-sql?view=sql-server-ver15).

There are four SET cases that are the following:

```sql
SET
{ @local_variable
    [ . { property_name | field_name } ] = { expression | udt_name { . | :: } method_name }
}
|
{ @SQLCLR_local_variable.mutator_method
}
|
{ @local_variable
    {+= | -= | *= | /= | %= | &= | ^= | |= } expression
}
|
  { @cursor_variable =
    { @cursor_variable | cursor_name
    | { CURSOR [ FORWARD_ONLY | SCROLL ]
        [ STATIC | KEYSET | DYNAMIC | FAST_FORWARD ]
        [ READ_ONLY | SCROLL_LOCKS | OPTIMISTIC ]
        [ TYPE_WARNING ]
    FOR select_statement
        [ FOR { READ ONLY | UPDATE [ OF column_name [ ,...n ] ] } ]
      }
    }
}
```

### Sample Source Patterns

#### Transact-SQL

##### Case 1

```sql
CREATE OR ALTER PROCEDURE SetProcedure
AS
    DECLARE @MyCounter INT;
    DECLARE @FloatCounter FLOAT;

    --Numerical operators
    SET @MyCounter = 3;
    SET @MyCounter += 1;  --@MyCounter has 4
    SET @MyCounter -= 1;  --@MyCounter has 3
    SET @MyCounter *= 2;  --@MyCounter has 6

    SET @MyCounter /= 3;  --@MyCounter has 2
    SET @MyCounter = 6;
    SET @MyCounter /= 5;  --@MyCounter has 1
    SET @MyCounter = 6;
    SET @MyCounter /= 7;  --@MyCounter has 0
    SET @FloatCounter = 10;
    SET @FloatCounter /= 4;  --@FloatCounter has 2.5

    SET @MyCounter = 6;
    SET @MyCounter %= 4;  --@MyCounter has 2

    --Logical operators
    SET @MyCounter &= 3;  --@MyCounter has 2
    SET @MyCounter ^= 2;  --@MyCounter has 0
    SET @MyCounter |= 0;  --@MyCounter has 0

    RETURN @MyCounter;
GO

DECLARE @result INT;
EXEC @result = SetProcedure;
PRINT @result;
```

##### Case 2

```sql
CREATE TABLE vEmployee (
    PersonID int,
    LastName varchar(255),
    FirstName varchar(255)
);

CREATE OR ALTER PROCEDURE SetCursor
AS
    DECLARE @CursorVar CURSOR;

    SET @CursorVar = CURSOR SCROLL DYNAMIC
        FOR
	SELECT LastName, FirstName
	FROM vEmployee
	WHERE LastName like 'B%';
GO
```

##### Result 1

| Result |
| --- |
| 0 |

##### Snowflake Scripting

##### Case 1

```sql
CREATE OR REPLACE PROCEDURE SetProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        MYCOUNTER INT;
        FLOATCOUNTER FLOAT;
    BEGIN

        --Numerical operators
        MYCOUNTER := 3;
        MYCOUNTER := MYCOUNTER + 1;  --@MyCounter has 4

        MYCOUNTER := MYCOUNTER - 1;  --@MyCounter has 3

        MYCOUNTER := MYCOUNTER * 2;  --@MyCounter has 6

        MYCOUNTER := MYCOUNTER / 3;  --@MyCounter has 2

        MYCOUNTER := 6;
        MYCOUNTER := MYCOUNTER / 5;  --@MyCounter has 1

        MYCOUNTER := 6;
        MYCOUNTER := MYCOUNTER / 7;  --@MyCounter has 0

        FLOATCOUNTER := 10;
        FLOATCOUNTER := FLOATCOUNTER / 4;  --@FloatCounter has 2.5

        MYCOUNTER := 6;
        MYCOUNTER := MYCOUNTER % 4;  --@MyCounter has 2

    --Logical operators
        MYCOUNTER := BITAND(MYCOUNTER, 3);  --@MyCounter has 2

        MYCOUNTER := BITXOR(MYCOUNTER, 2);  --@MyCounter has 0

        MYCOUNTER := BITOR(MYCOUNTER, 0);  --@MyCounter has 0

        RETURN :MYCOUNTER;
    END;
$$;

DECLARE
    RESULT INT;
BEGIN
    CALL SetProcedure();
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Print' NODE ***/!!!
    PRINT @result;
END;
```

##### Case 2

```sql
CREATE OR REPLACE TABLE vEmployee (
	PersonID INT,
	LastName VARCHAR(255),
	FirstName VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;

CREATE OR REPLACE PROCEDURE SetCursor ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		!!!RESOLVE EWI!!! /*** SSC-EWI-TS0037 - SNOWFLAKE SCRIPTING CURSORS ARE NON-SCROLLABLE, ONLY FETCH NEXT IS SUPPORTED ***/!!!
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CURSORVAR CURSOR
		FOR
			SELECT LastName, FirstName
			FROM vEmployee
			WHERE LastName like 'B%';
	BEGIN

		RETURN '';
	END;
$$;
```

##### Result 1

| Result |
| --- |
| 0 |

#### SET statement outside routines (functions and procedures)

Unlike Transact-SQL, Snowflake does not support executing isolated statements like SET outside routines like functions or procedures. For this scenario, the statement should be encapsulated in an anonymous block, as shown in the following examples. This statement is usually used after a DECLARE STATEMENT.

##### Transact-SQL

```sql
DECLARE @Group nvarchar(50), @Sales MONEY;
SET @Group = N'North America';
SET @Sales = 2000000;
```

##### Snowflake Scripting

```sql
DECLARE
    _GROUP VARCHAR(50);
    SALES NUMBER(38, 4);
BEGIN
    _GROUP := 'North America';
    SALES := 2000000;
END;
```

If there is a scenario with only SET statements, the DECLARE block is not necessary. Probably this scenario will produce runtime errors if there is an attempt of setting a value to a variable that is not declared.

##### Transact-SQL

```sql
SET @Group = N'North America';
```

##### Snowflake Scripting

```sql
BEGIN
    _GROUP := 'North America';
END;
```

### Known Issues

#### 1. SET of a local variable with property name

This type of set is not currently supported by Snowflake scripting.

```sql
 // TSQL custom data type with properties example
DECLARE @p Point;
SET @p.X = @p.X + 1.1;
```

##### 2. SET of a local variable with mutator method

This type of set is not currently supported by Snowflake scripting.

```sql
 // TSQL custom data type with mutator method
SET @p.SetXY(22, 23);
```

### Related EWIs

1. [SSC-EWI-TS0037](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Snowflake Scripting Cursors are non-scrollable.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
3. [SSC-FDM-TS0013](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Snowflake Scripting cursor rows are not modifiable.

## TRY CATCH

Translation reference for TRY CATCH statement in Transact-SQL.

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Implements error handling for Transact SQL. A group of Transact-SQL statements can be enclosed in a TRY block. If an error occurs in the TRY block, control is usually passed to another group of statements that is enclosed in a CATCH block.

### Sample Source Patterns

The following example details the transformation for TRY CATCH inside procedures.

#### Transact-SQL

```sql
CREATE PROCEDURE ERROR_HANDLING_PROC
AS
BEGIN
    BEGIN TRY
        -- Generate divide-by-zero error.
        SELECT 1/0;
    END TRY
    BEGIN CATCH
        -- Execute error retrieval routine.
        SELECT 'error';
    END CATCH;
END;
```

#### Output

```none
|   error    |
```

##### Snowflake SQL

```sql
CREATE OR REPLACE PROCEDURE ERROR_HANDLING_PROC ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        BEGIN
            -- Generate divide-by-zero error.
            SELECT
                TRUNC( 1/0);
        EXCEPTION
            WHEN OTHER THEN
                -- Execute error retrieval routine.
                SELECT 'error';
        END;
    END;
$$;
```

##### Output

```none
|    error    |
```

#### Try catch outside routines (functions and procedures)

##### Transact-SQL

```sql
 BEGIN TRY
    SELECT 1/0;
END TRY
BEGIN CATCH
    SELECT 'error';
END CATCH;
```

##### Snowflake Scripting

```sql
DECLARE
    BlockResultSet1 VARCHAR;
    BlockResultSet2 VARCHAR;
    return_arr ARRAY := array_construct();
BEGIN
    BEGIN
        BlockResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
        CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:BlockResultSet1) AS
            SELECT
                TRUNC( 1/0);
        return_arr := array_append(return_arr, :BlockResultSet1);
    EXCEPTION
        WHEN OTHER THEN
            BlockResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
            CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:BlockResultSet2) AS
                SELECT 'error';
            return_arr := array_append(return_arr, :BlockResultSet2);
    END;
    --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
    RETURN return_arr;
END;
```

### Related EWIs

1. [SSC-FDM-0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables.

## WHILE

Translation reference to convert Transact-SQL While Statement to Snowflake Scripting

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The While statement allows an SQL statement or a block of statements to be repeatedly executed as long as the specified condition is true. The execution of statements in the WHILE loop can be controlled from inside the loop with the `BREAK` and `CONTINUE` keywords.

For more information, see the [Transact-SQL WHILE documentation](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/while-transact-sql?view=sql-server-ver15).

```sql
 WHILE Boolean_expression
     { sql_statement | statement_block | BREAK | CONTINUE }
```

Note: To define a statement block, use the control-of-flow keywords `BEGIN` and `END`.

### Sample Source Patterns

#### Basic source pattern code

##### Transact-SQL

The following code refers to a While Loop in Transact-SQL that iterates the @Iteration variable and controls the flow of the loop to terminate when the value of @Iteration equals 10.

> **Note:**
>
> Statements after the `CONTINUE` keyword will not be executed.

##### While

```sql
CREATE OR ALTER PROCEDURE WhileDemoProcedure
AS
    DECLARE @iteration INT;
    SET @iteration = 1;

    WHILE @iteration < 100
    BEGIN
        IF @iteration = 10
            BREAK;
        ELSE
            BEGIN
                SET @iteration = @iteration + 1;
                CONTINUE;
                SET @iteration = 2 * @iteration;
            END;
    END;
    RETURN @iteration;
GO

DECLARE @result INT;
EXEC @result = WhileDemoProcedure;
PRINT @result;
```

##### Result

| iteration |
| --- |
| 10 |

##### Snowflake Scripting

> **Note:**
>
> As well as Transact-SQL, in Snowflake Scripting the statements after the `CONTINUE` keyword will not be executed.
>
> Notice that in Snowflake Scripting it is not necessary to use the BEGIN and END keywords to define a statement block, however it can be used if required.

##### While

```sql
CREATE OR REPLACE PROCEDURE WhileDemoProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        ITERATION INT;
    BEGIN

        ITERATION := 1;
        WHILE (:ITERATION &#x3C; 100) LOOP
            IF (:ITERATION = 10) THEN
                BREAK;
            ELSE
                BEGIN
                    ITERATION := :ITERATION + 1;
                    CONTINUE;
                    ITERATION := 2 * :ITERATION;
                END;
            END IF;
        END LOOP;
        RETURN :ITERATION;
    END;
$$;

DECLARE
    RESULT INT;
BEGIN
    CALL WhileDemoProcedure();
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Print' NODE ***/!!!
    PRINT @result;
END;
```

##### Loop keyword

Snowflake Scripting allows to use `LOOP` keyword instead of `DO` and the `END LOOP` expression instead of `END WHILE` .

```sql
WHILE (Boolean_expression) LOOP
    -- statement or statement block
END LOOP;
```

##### Result

| Iteration |
| --- |
| 10 |

#### While with empty body Source Pattern

##### Transact-SQL

> **Note:**
>
> Please note this example was written while the IF ELSE statement was not supported, the differences in the results should disappear when support for the statement is implemented.

```sql
CREATE OR ALTER PROCEDURE WhileEmptyBodyProc
AS
BEGIN
    DECLARE @MyVar INT;
    SET @MyVar = 1;
    WHILE (@MyVar < 100)
        BEGIN
            IF @MyVar < 50
                SET @MyVar *= 5;
            ELSE
                SET @MyVar *= 3;
        END;
    RETURN @MyVar;
END;

DECLARE @result INT;
EXEC @result = WhileEmptyBodyProc;
PRINT @result;
```

##### Result

| result |
| --- |
| 125 |

##### Snowflake Scripting

This statement can not have an empty body in Snowflake Scripting, to solve this cases a default BREAK statement is added when an empty body is detected.

```sql
CREATE OR REPLACE PROCEDURE WhileEmptyBodyProc ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        MYVAR INT;
        RESULT INT;
    BEGIN
        BEGIN

            MYVAR := 1;
            WHILE (:MYVAR < 100) LOOP
                IF (:MYVAR < 50) THEN
                    MYVAR := MYVAR * 5;
                ELSE
                    MYVAR := MYVAR * 3;
                END IF;
            END LOOP;
            RETURN :MYVAR;
        END;

        CALL WhileEmptyBodyProc();
        !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PRINT' NODE ***/!!!
        PRINT @result;
    END;
$$;
```

##### Result

| result |
| --- |
| 1 |

#### WHILE statement outside routines (functions and procedures)

Unlike Transact-SQL, Snowflake does not support executing isolated statements like WHILE outside routines like functions or procedures. For this scenario, the statement should be encapsulated in an anonymous block, as shown in the following example.

##### Transact-SQL

```sql
DECLARE @iteration INT;
SET @iteration = 1;

WHILE @iteration < 100
BEGIN
    IF @iteration = 10
        BREAK;
    ELSE
        BEGIN
            SET @iteration = @iteration + 1;
            CONTINUE;
            SET @iteration = 2 * @iteration;
        END;
    END;
```

##### Snowflake Scripting

```sql
DECLARE
    ITERATION INT;
BEGIN
    ITERATION := 1;
    WHILE (:ITERATION < 100) LOOP
        IF (:ITERATION = 10) THEN
            BREAK;
        ELSE
            BEGIN
                ITERATION := :ITERATION + 1;
                CONTINUE;
                ITERATION := 2 * :ITERATION;
            END;
        END IF;
    END LOOP;
END;
```

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - SQL Server-Azure Synapse - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-table.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - CREATE TABLE

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

## Basic CREATE TABLE

### Source

```sql
CREATE TABLE [MYSCHEMA].[MYTABLE]
(
    [COL1] INT IDENTITY (1,1) NOT NULL,
    [COL2] INT,
    [COL2 COL3 COL4] VARCHAR,
    [COL VARCHAR_SPANISH] [VARCHAR](20) COLLATE Modern_Spanish_CI_AI DEFAULT 'HOLA',
    [COL VARCHAR_LATIN] [VARCHAR](20) COLLATE Latin1_General_CI_AI DEFAULT 'HELLO'
);
```

### Expected

```sql
CREATE OR REPLACE TABLE MYSCHEMA.MYTABLE
(
    COL1 INT IDENTITY(1,1) ORDER NOT NULL,
    COL2 INT,
    "COL2 COL3 COL4" VARCHAR,
    "COL VARCHAR_SPANISH" VARCHAR(20) COLLATE 'ES-CI-AI' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/ DEFAULT 'HOLA',
    "COL VARCHAR_LATIN" VARCHAR(20) COLLATE 'EN-CI-AI' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/ DEFAULT 'HELLO'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
;
```

## TEMPORARY TABLES

In the source code, there can be some table names that start with the character #.

```sql
CREATE TABLE #MyLocalTempTable (
        COL1 INT,
        COL2 INT
);
```

If that is the case, they are transformed into temporary tables in the output code.

Let’s see how the code from above would be migrated.

```sql
CREATE OR REPLACE TEMPORARY TABLE T_MyLocalTempTable (
        COL1 INT,
        COL2 INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

As you can see, **TEMPORARY** was added to the definition of the table, and the character **#** was replaced with **T_**.

Also, all references of the table will be transformed too, to match the new name given to the temporary table.

## NULL and NOT NULL Column Option

`NULL` and `NOT NULL` column options are supported in Snowflake.

### Source

```sql
CREATE TABLE [SCHEMA1].[TABLE1](
	[COL1] [varchar](20) NOT NULL
) ON [PRIMARY]
GO

CREATE TABLE [SCHEMA1].[TABLE2](
	[COL1] [varchar](20) NULL
) ON [PRIMARY]
GO
```

### Expected

```sql
CREATE OR REPLACE TABLE SCHEMA1.TABLE1 (
	COL1 VARCHAR(20) NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;

CREATE OR REPLACE TABLE SCHEMA1.TABLE2 (
	COL1 VARCHAR(20) NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

## IDENTITY Column Option

For identity columns, a sequence is created and assigned to the column.

### Source

```sql
CREATE TABLE acct3.UnidentifiedCash3 (
UnidentifiedCash_ID3 INT IDENTITY (666, 313) NOT NULL
);
```

### Expected

```sql
CREATE OR REPLACE TABLE acct3.UnidentifiedCash3 (
UnidentifiedCash_ID3 INT IDENTITY(666, 313) ORDER NOT NULL
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
;
```

## DEFAULT Column Option

The default Expr is supported in Snowflake, however, in Sql Server it can come together with a constraint Name. Since that part is not supported in Snowflake, it has been removed, and a warning has been added.

### Source

```sql
CREATE TABLE [SCHEMA1].[TABLE1] (
    [COL1] VARCHAR (10) CONSTRAINT [constraintName] DEFAULT ('0') NOT NULL
);
```

### Expected

```sql
CREATE OR REPLACE TABLE SCHEMA1.TABLE1 (
COL1 VARCHAR(10) DEFAULT ('0') /*** SSC-FDM-0012 - CONSTRAINT NAME 'constraintName' IN DEFAULT EXPRESSION CONSTRAINT IS NOT SUPPORTED IN SNOWFLAKE ***/ NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

## COLUMN Constraint

### Source

```sql
CREATE TABLE [SalesLT].[Address](
	[AddressID] [int] IDENTITY(1,1) NOT FOR REPLICATION NOT NULL,
	[AddressLine1] [nvarchar](60) NOT NULL,
	[AddressLine2] [nvarchar](60) NULL,
	[City] [nvarchar](30) NOT NULL,
	[StateProvince] [dbo].[Name] NOT NULL,
	[CountryRegion] [dbo].[Name] NOT NULL,
	[PostalCode] [nvarchar](15) NOT NULL,
	[rowguid] [uniqueidentifier] ROWGUIDCOL  NOT NULL,
	[ModifiedDate] [datetime] NOT NULL,
	CONSTRAINT [PK_Address_AddressID] PRIMARY KEY CLUSTERED
	(
		[AddressID] ASC
	)WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON, OPTIMIZE_FOR_SEQUENTIAL_KEY = OFF) ON [PRIMARY],
	CONSTRAINT [AK_Address_rowguid] UNIQUE NONCLUSTERED
	(
		[rowguid] ASC
	)WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON, OPTIMIZE_FOR_SEQUENTIAL_KEY = OFF) ON [PRIMARY]
) ON [PRIMARY]
```

### Expected

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "[dbo].[Name]" **
CREATE OR REPLACE TABLE SalesLT.Address (
	AddressID INT IDENTITY(1,1) ORDER NOT NULL,
	AddressLine1 VARCHAR(60) NOT NULL,
	AddressLine2 VARCHAR(60) NULL,
	City VARCHAR(30) NOT NULL,
	StateProvince VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-TS0015 - DATA TYPE DBO.NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!! NOT NULL,
	CountryRegion VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-TS0015 - DATA TYPE DBO.NAME IS NOT SUPPORTED IN SNOWFLAKE ***/!!! NOT NULL,
	PostalCode VARCHAR(15) NOT NULL,
	rowguid VARCHAR
 	               !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'ROWGUIDCOL COLUMN OPTION' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
 	               ROWGUIDCOL  NOT NULL,
	ModifiedDate TIMESTAMP_NTZ(3) NOT NULL,
		CONSTRAINT PK_Address_AddressID PRIMARY KEY (AddressID),
		CONSTRAINT AK_Address_rowguid UNIQUE (rowguid)
	)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
;
```

## COLLATE Column Option

For the Collate transformation, please check the following [link](transact-general-statements.md)

## ENCRYPTED WITH Column Option

The Encrypted With is not supported in Snowflake, so it is being removed, and a warning is added.

### Source

```sql
CREATE TABLE [SCHEMA1].[TABLE1] (
    [COL1] NVARCHAR(60) ENCRYPTED WITH (COLUMN_ENCRYPTION_KEY = MyCEK, ENCRYPTION_TYPE = RANDOMIZED, ALGORITHM = 'AEAD_AES_256_CBC_HMAC_SHA_256')
);
```

### Expected

```sql
CREATE OR REPLACE TABLE SCHEMA1.TABLE1 (
    COL1 VARCHAR(60)
--                     --** SSC-FDM-TS0009 - ENCRYPTED WITH NOT SUPPORTED IN SNOWFLAKE **
--                     ENCRYPTED WITH (COLUMN_ENCRYPTION_KEY = MyCEK, ENCRYPTION_TYPE = RANDOMIZED, ALGORITHM = 'AEAD_AES_256_CBC_HMAC_SHA_256')
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

## NOT FOR REPLICATION

The NOT FOR REPLICATION option is not supported in Snowflake. It is used for the identity that is being migrated to a `SEQUENCE`.

> **Warning:**
>
> Notice that `NOT FOR REPLICATION` is a statement that is not required in Snowflake because it is translated to an equivalent, so it is removed.

### Source

```sql
CREATE TABLE [TABLE1] (
    [COL1] INT IDENTITY (1, 1) NOT FOR REPLICATION NOT NULL
) ON [PRIMARY];
```

### Output

```sql
CREATE OR REPLACE TABLE TABLE1 (
    COL1 INT IDENTITY(1, 1) ORDER NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

## ON PRIMARY

The `ON PRIMARY` option is a statement that is used in SQL Server to define on which file an object, e.g. a table, is going to be created. Such as on a primary or secondary file group inside the database. Snowflake provides a different logic and indicates distinct constraints. Please review the following [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/constraints) for more information.

### Source

```sql
CREATE TABLE [TABLE1](
[COL1] [nvarchar](255) COLLATE SQL_Latin1_General_CP1_CI_AS NOT NULL
 CONSTRAINT [pk_dimAddress_AddressId] PRIMARY KEY CLUSTERED ([COL1])
 WITH (PAD_INDEX = OFF, STATISTICS_NORECOMPUTE = OFF, IGNORE_DUP_KEY = OFF, ALLOW_ROW_LOCKS = ON, ALLOW_PAGE_LOCKS = ON, OPTIMIZE_FOR_SEQUENTIAL_KEY = OFF) ON [PRIMARY]
) ON [PRIMARY]
```

### Output

```sql
CREATE OR REPLACE TABLE TABLE1 (
 COL1 VARCHAR(255) COLLATE 'EN-CI-AS' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/ /*** SSC-FDM-TS0002 - COLLATION FOR VALUE CP1 NOT SUPPORTED ***/ NOT NULL
  CONSTRAINT pk_dimAddress_AddressId PRIMARY KEY (COL1)
 )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/23/2024" }}'
 ;
```

## ASC/DESC Column Sorting

Column sorting is not supported in Snowflake, the `ASC` or `DESC` keywords are being removed.

### Source

```sql
CREATE TABLE [TABLE1](
	[COL1] [int] NOT NULL,
 CONSTRAINT [constraint1] PRIMARY KEY CLUSTERED ([COL1] ASC)
) ON [PRIMARY]
```

### Output

```sql
CREATE OR REPLACE TABLE TABLE1 (
	COL1 INT NOT NULL,
	 CONSTRAINT constraint1 PRIMARY KEY (COL1)
	)
	COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/23/2024" }}'
	;
```

## COMPUTED Columns

Computed columns are supported in Snowflake, we just need to add the explicit data type in order to be able to deploy the table, for example.

### Source

```sql
CREATE TABLE [TABLE1](
	[COL2] [int] NOT NULL,
	[COL2] [int] NOT NULL,
	[COL1] AS (COL3 * COL2),
)
```

### Output

```sql
CREATE OR REPLACE TABLE TABLE1 (
	COL2 INT NOT NULL,
	COL2 INT NOT NULL,
	COL1 VARIANT AS (COL3 * COL2) /*** SSC-FDM-TS0014 - COMPUTED COLUMN WAS TRANSFORMED TO ITS SNOWFLAKE EQUIVALENT, FUNCTIONAL EQUIVALENCE VERIFICATION PENDING. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

If the computed expression cannot transform, a warning is added, and a simple column definition with the expression return type will be used instead, like in the following example:

### Source

```sql
CREATE TABLE [TABLE1](
	[Col1] AS (CONVERT ([XML], ExpressionValue))
)
```

The expression `CONVERT ([NUMERIC], ExpressionValue)` is not supported yet by SnowConvert AI, so, after it is inspected, SnowConvert AI will determine that its type is XML, so the transformation will be

### Output

```sql
CREATE OR REPLACE TABLE TABLE1 (
	Col1 TEXT AS (CAST(ExpressionValue AS VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - XML DATA TYPE CONVERTED TO VARIANT ***/!!!)) /*** SSC-FDM-TS0014 - COMPUTED COLUMN WAS TRANSFORMED TO ITS SNOWFLAKE EQUIVALENT, FUNCTIONAL EQUIVALENCE VERIFICATION PENDING. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

SnowConvert AI will run a process to determine the original expression type in SQL Server. However, the column will have the equivalent target type. In the previous example, the column type in SQLServer was XML, but the target type in Snowflake for storing an XML is TEXT. For more information about data type mapping, check the [data types sections](transact-data-types.md).

## MASKED WITH Column Option

In SQL Server the data masking is used to keep sensitive information from nonprivileged users. Review the [SQL SERVER documentation](https://learn.microsoft.com/en-us/sql/relational-databases/security/dynamic-data-masking?view=sql-server-ver16) for more information. In Snowflake, there is a dynamic data masking functionality but it is available to Enterprise Edition only. Please review the following [Snowflake documentation](https://docs.snowflake.com/en/user-guide/security-column-ddm-use).

### Input

```sql
CREATE TABLE TABLE1
(
	[COL1] [nvarchar](50) MASKED WITH (FUNCTION = 'default()') NULL
);
```

### Output

```sql
CREATE OR REPLACE TABLE TABLE1
(
	COL1 VARCHAR(50)
 	                !!!RESOLVE EWI!!! /*** SSC-EWI-TS0017 - COLUMN MASKING NOT SUPPORTED IN CREATE TABLE ***/!!!
 MASKED WITH (FUNCTION = 'default()') NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

## ROWGUIDCOL Column Option

`ROWGUIDCOL` is no applicable in Snowflake. It is used in SQL Server for [UNIQUEIDENTIFIER](https://docs.microsoft.com/en-us/sql/t-sql/data-types/uniqueidentifier-transact-sql?view=sql-server-ver15) types that are currently translated to `VARCHAR`. For example:

### Input

```sql
CREATE TABLE TABLEROWID (
    [ROWGUID] UNIQUEIDENTIFIER ROWGUIDCOL NOT NULL
) ON [PRIMARY];
```

### Output

```sql
CREATE OR REPLACE TABLE TABLEROWID (
    ROWGUID VARCHAR
                    !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'ROWGUIDCOL COLUMN OPTION' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
                    ROWGUIDCOL NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

## GENERATED ALWAYS AS ROW START/END Column Option

`ROW START/END` is not supported in Snowflake. An error is added when SnowConvert AI try to transform this kind of column option.

### Input

```sql
CREATE TABLE TABLEROWID (
    [COL1] DATETIME GENERATED ALWAYS AS ROW START NOT NULL
) ON [PRIMARY];
```

### Output

```sql
CREATE OR REPLACE TABLE TABLEROWID (
    COL1 TIMESTAMP_NTZ(3) GENERATED ALWAYS AS ROW START !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'GeneratedClause' NODE ***/!!! NOT NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0036](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.
2. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
3. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
4. [SSC-EWI-TS0017](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Masking not supported.
5. [SSC-FDM-0012](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Constraint in default expression is not supported.
6. [SSC-FDM-TS0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): This message is shown when there is a collate clause that is not supported in Snowflake.
7. [SSC-FDM-TS0009](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Encrypted with not supported in Snowflake.
8. [SSC-FDM-TS0014](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Computed column transformed.
9. [SSC-EWI-TS0015](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Data type is not supported in Snowflake.
10. [SSC-PRF-0002](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Case-insensitive columns can decrease the performance of queries.

## Azure Synapse Analytics

Translation specification for Azure Synapse Analytics Tables

Applies to

* Azure Synapse Analytics

### Description

This section presents the translation for syntax specific to [Azure Synapse Analytics Tables](https://learn.microsoft.com/en-us/sql/t-sql/statements/create-table-azure-sql-data-warehouse?view=aps-pdw-2016-au7).

#### CREATE TABLE

```sql
CREATE TABLE { database_name.schema_name.table_name | schema_name.table_name | table_name }
(
    { column_name <data_type>  [ <column_options> ] } [ ,...n ]
)
[ WITH ( <table_option> [ ,...n ] ) ]
[;]
```

#### CREATE TABLE AS

```sql
CREATE TABLE { database_name.schema_name.table_name | schema_name.table_name | table_name }
    [ ( column_name [ ,...n ] ) ]
    WITH (
      <distribution_option> -- required
      [ , <table_option> [ ,...n ] ]
    )
    AS <select_statement>
    OPTION <query_hint>
[;]
```

### Source Patterns

#### WITH table options

Azure Synapse Analytics presents an additional syntax for defining table options.

```sql
<table_option> ::=
    {
       CLUSTERED COLUMNSTORE INDEX -- default for Azure Synapse Analytics
      | CLUSTERED COLUMNSTORE INDEX ORDER (column [,...n])
      | HEAP --default for Parallel Data Warehouse
      | CLUSTERED INDEX ( { index_column_name [ ASC | DESC ] } [ ,...n ] ) -- default is ASC
    }
    {
        DISTRIBUTION = HASH ( distribution_column_name )
      | DISTRIBUTION = HASH ( [distribution_column_name [, ...n]] )
      | DISTRIBUTION = ROUND_ROBIN -- default for Azure Synapse Analytics
      | DISTRIBUTION = REPLICATE -- default for Parallel Data Warehouse
    }
    | PARTITION ( partition_column_name RANGE [ LEFT | RIGHT ] -- default is LEFT
        FOR VALUES ( [ boundary_value [,...n] ] ) )
```

Snowflake automatically handles table optimization through mechanisms like micro-partitioning. For this reason, an equivalent syntax for some of these table options does not exist in Snowflake. Therefore, it is not necessary to define some of Transact’s table options.

Table options that will be omitted:

* CLUSTERED COLUMNSTORE INDEX (without column)
* HEAP
* DISTRIBUTION
* PARTITION

`CLUSTERED [ COLUMNSTORE ] INDEX` with columns, will be transformed to Snowflake’s `CLUSTER BY`. A performance review PRF will be added as it is advised to check if defining a CLUSTER KEY is necessary.

##### Transact

```sql
CREATE TABLE my_table (
    enterprise_cif INT,
    name NVARCHAR(100),
    address NVARCHAR(255),
    created_at DATETIME
)
WITH (
    DISTRIBUTION = HASH(enterprise_cif),
    CLUSTERED INDEX (enterprise_cif)
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE my_table (
  enterprise_cif INT,
  name VARCHAR(100),
  address VARCHAR(255),
  created_at TIMESTAMP_NTZ(3)
)
--** SSC-PRF-0007 - PERFORMANCE REVIEW - CLUSTER BY **
CLUSTER BY (enterprise_cif)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/09/2024" }}'
;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-PRF-0007](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): PERFORMANCE REVIEW - CLUSTER BY.

## TEXTIMAGE_ON

Applies to

* SQL Server
> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> Notice that this statement removed from the migration because it is a non-relevant syntax. It means that it is not required in Snowflake.\*\*

### Description

`TEXTIMAGE_ON [PRIMARY]` is a way in Transact to handle the large information groups inside a table. In Snowflake it is not required to define these kinds of characteristics because Snowflake handles large data files or information in a different arrangement.

### Sample

### Source Patterns

Notice that in this example the `TEXTIMAGE_ON [PRIMARY]` has been removed due to the unnecessary syntax.

#### SQL Server

```sql
 CREATE TABLE [dbo].[TEST_Person](
	[date_updated] [datetime] NULL
 ) TEXTIMAGE_ON [PRIMARY]
```

#### Snowflake

```sql
 CREATE OR REPLACE TABLE dbo.TEST_Person (
	date_updated TIMESTAMP_NTZ(3) NULL
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
 ;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - SQL Server-Azure Synapse - CREATE TYPE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-type.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - CREATE TYPE

SnowConvert maps SQL Server alias types (`CREATE TYPE ... FROM base_type`) to Snowflake `CREATE TYPE ... AS base_type`. Table types and other unsupported variants are flagged for manual review.

Applies to

* SQL Server
* Azure Synapse Analytics

## Alias types (`FROM` base type)

`FROM` is rewritten to `AS`, nullable/nullability modifiers on the alias definition are dropped in the Snowflake output, and schema-qualified names are preserved.

**Source (T-SQL):**

```sql
CREATE TYPE EmailAddress FROM VARCHAR(255);
```

**Snowflake equivalent:**

```sql
CREATE TYPE EmailAddress AS VARCHAR(255);
```

**Source (T-SQL):**

```sql
CREATE TYPE EmailAddress FROM VARCHAR(255) NOT NULL;
```

**Snowflake equivalent:**

```sql
CREATE TYPE EmailAddress AS VARCHAR(255);
```

**Source (T-SQL):**

```sql
CREATE TYPE dbo.PhoneNumber FROM VARCHAR(20) NOT NULL;
```

**Snowflake equivalent:**

```sql
CREATE TYPE dbo.PhoneNumber AS VARCHAR(20);
```

**Source (T-SQL):**

```sql
CREATE TYPE Currency FROM DECIMAL(15,2);
```

**Snowflake equivalent:**

```sql
CREATE TYPE Currency AS DECIMAL(15, 2);
```

## Table types (`AS TABLE`)

`CREATE TYPE ... AS TABLE (...)` is not supported as a Snowflake user-defined type in this form; the converter emits an EWI and leaves the statement for manual resolution.

**Source (T-SQL):**

```sql
CREATE TYPE dbo.MyTableType AS TABLE (Id INT, Name VARCHAR(100));
```

**Snowflake equivalent (with EWI):**

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0107 - CREATE TYPE AS TABLE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE TYPE dbo.MyTableType AS TABLE (
  Id INT,
  Name VARCHAR(100)
);
```

**Notes:** See [SSC-EWI-TS0107](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md) in the SQL Server conversion issues documentation.

---
title: SnowConvert AI - SQL Server-Azure Synapse - Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-data-types.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - Data Types

Snowflake supports most basic SQL data types (with some restrictions) for use in columns, local variables, expressions, parameters, and any other appropriate/suitable locations.

Applies to

* SQL Server
* Azure Synapse Analytics

## Exact and approximate numerics

| T-SQL | Snowflake | Notes |
| --- | --- | --- |
| BIGINT | BIGINT | ​Note that BIGINT in Snowflake is an alias for NUMBER(38,0)  [See note on this conversion below.] |
| BIT | BOOLEAN | SQLServer only accepts ​1, 0, or NULL |
| DECIMAL | DECIMAL | ​Snowflake’s DECIMAL is synonymous with NUMBER |
| FLOAT | FLOAT | ​This data type behaves equally on both systems.  Precision 7-15 digits, float (1-24)  Storage 4 - 8 bytes, float (25-53) |
| INT | INT | Note that INT in Snowflake is an alias for NUMBER(38,0)  [See note on this conversion below.] |
| MONEY | NUMBER(38, 4) | [See note on this conversion below.] |
| REAL​ | REAL | Snowflake’s REAL is synonymous with FLOAT |
| SMALLINT | SMALLINT​ | ​This data type behaves equally |
| SMALLMONEY | NUMBER(38, 4) | [See note on this conversion below.] |
| TINYINT​ | TINYINT | Note that TINYINT in Snowflake is an alias for NUMBER(38,0)  [See note on this conversion below.] |
| NUMERIC | NUMERIC | ​Snowflake’s NUMERIC is synonymous with NUMBER |

**NOTE:**

* For the conversion of integer data types (INT, SMALLINT, BIGINT, TINYINT), each is converted to the alias in Snowflake with the same name. Each of those aliases is actually converted to NUMBER(38,0), a data type that is considerably larger than the integer datatype. Below is a comparison of the range of values that can be present in each data type:

  + Snowflake NUMBER(38,0): -99999999999999999999999999999999999999 to +99999999999999999999999999999999999999
  + SQLServer TINYINT: 0 to 255
  + SQLServer INT: -2^31 (-2,147,483,648) to 2^31-1 (2,147,483,647)
  + SQLServer BIGINT: -2^63 (-9,223,372,036,854,775,808) to 2^63-1 (9,223,372,036,854,775,807)
  + SQLServer SMALLINT: -2^15 (-32,768) to 2^15-1 (32,767)
* For Money and Smallmoney: ​

  + Currency or monetary data does not need to be enclosed in single quotation marks ( ‘ ). It is important to remember that while you can specify monetary values preceded by a currency symbol, SQL Server does not store any currency information associated with the symbol, it only stores the numeric value.
  + Please take care on the translations for the DMLs

## Date and time

| T-SQL | Snowflake | Notes |
| --- | --- | --- |
| DATE | DATE | ​SQLServer accepts range from 0001-01-01 to 9999-12-31 |
| DATETIME2 | TIMESTAMP_NTZ(7)​ | Snowflake’s DATETIME is an alias for TIMESTAMP_NTZ |
| DATETIME | TIMESTAMP_NTZ(3) | Snowflake’s DATETIME is an alias for TIMESTAMP_NTZ​ |
| DATETIMEOFFSET | TIMESTAMP_TZ(7) | Snowflake’s timestamp precision ranges from 0 to 9 (*this value’s the default*)  Snowflake’s operations are performed in the current session’s time zone, controlled by the TIMEZONE session parameter |
| SMALLDATETIME | TIMESTAMP_NTZ | Snowflake’s DATETIME truncates the TIME information  That is, 1955-12-13 12:43:10 is saved as 1955-12-13 |
| TIME | TIME | ​This data type behaves equally on both systems.  Range 00:00:00.0000000 through 23:59:59.9999999 |
| TIMESTAMP | BINARY(8) | SQL Server `timestamp` is a synonym for `rowversion` and stores a unique BINARY(8) value, not a date/time type. See [SSC-FDM-TS0046](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md). |

## Character strings

| T-SQL | Snowflake | Notes |
| --- | --- | --- |
| CHAR | CHAR | ​SQLServer’s max string size in bytes is 8000 whereas Snowflake is 167772161. |
| TEXT​ | TEXT |  |
| VARCHAR​ | VARCHAR | SQLServer’s max string size in bytes is 8000 whereas Snowflake is 167772161. SQLServer’s VARCHAR(MAX) has no equivalent in Snowflake, it is converted to VARCHAR to take the largest possible size by default. |

## Unicode character strings

| T-SQL | Snowflake | Notes |
| --- | --- | --- |
| NCHAR | NCHAR | Synonymous with VARCHAR except default length is VARCHAR(1). |
| NTEXT | TEXT | Snowflake uses TEXT data type as a synonym for VARCHAR  ​SQLServer’s NTEXT(MAX) has no equivalent in Snowflake, it is converted to VARCHAR to take the largest possible size by default. |
| NVARCHAR | VARCHAR | Snowflake uses this data type as a synonym for VARCHAR  ​SQLServer’s NVARCHAR(MAX) has no equivalent in Snowflake, it is converted to VARCHAR to take the largest possible size by default. |

## Binary strings

| T-SQL | Snowflake | Notes |
| --- | --- | --- |
| BINARY | ​BINARY | In Snowflake the maximum length is 8 MB (8,388,608 bytes) and length is always measured in terms of bytes. |
| VARBINARY | VARBINARY | Snowflake uses this data type as a synonym for BINARY.  Snowflake often represents each byte as 2 hexadecimal characters |
| IMAGE | VARBINARY | ​Snowflake uses this data type as a synonym for BINARY.  Snowflake often represents each byte as 2 hexadecimal characters |

## Other data types

| T-SQL | Snowflake | Notes |
| --- | --- | --- |
| CURSOR | *\*to be defined* | Not supported by Snowflake.  Translate into Cursor helpers |
| HIERARCHYID | *\*to be defined* | Not supported by Snowflake |
| SQL_VARIANT | VARIANT | Maximum size of 16 MB compressed.  A value of any data type can be implicitly cast to a VARIANT value |
| GEOMETRY | *\*to be defined* | Not supported by Snowflake |
| GEOGRAPHY | GEOGRAPHY | The objects store in Snowflake’s GEOGRAPHY data type must be WKT / WKB / EWKT / EWKB / GeoJSON geospatial objects to support LineString and Polygon objects |
| TABLE | *\*to be defined* | Not supported by Snowflake |
| ROWVERSION | BINARY(8) | SQL Server `rowversion` auto-generates a unique binary value on each INSERT/UPDATE. Snowflake BINARY(8) does not replicate this behavior. See [SSC-FDM-TS0046](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md). |
| UNIQUEIDENTIFIER | VARCHAR | ​​Snowflake uses STRING type as a synonym for VARCHAR. Because of conversion Snowflake often represents each byte as 2 hexadecimal characters |
| XML | VARIANT | ​Snowflake uses VARIANT data type as a synonym for XML |
| SYSNAME | VARCHAR(128) | NOT NULL constraint added to the column definition |

---
title: SnowConvert AI - SQL Server-Azure Synapse - DMLs
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-dmls.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - DMLs

## BETWEEN

Returns TRUE when the input expression (numeric or string) is within the
specified lower and upper boundary.

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

**Source Code**

```sql
-- Additional Params: -t JavaScript
CREATE PROCEDURE ProcBetween
AS
BEGIN
declare @aValue int = 1;
IF(@aValue BETWEEN 1 AND 2)
   return 1
END;
GO
```

**Code Expected**

```sql
CREATE OR REPLACE PROCEDURE ProcBetween ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

   let AVALUE = 1;
   if (SELECT(`   ? BETWEEN 1 AND 2`,[AVALUE])) {
      return 1;
   }
$$;
```

## BULK INSERT

Translation reference for the Bulk Insert statement.

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

The direct translation for [BULK INSERT](https://docs.microsoft.com/en-us/sql/t-sql/statements/bulk-insert-transact-sql?view=sql-server-ver15) is the Snowflake [COPY INTO](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html) statement. The `COPY INTO` does not use directly the file path to retrieve the values. The file should exist before in a [STAGE](https://docs.snowflake.com/en/sql-reference/sql/create-stage.html). Also the options used in the `BULK INSERT` should be specified in a Snowflake [FILE FORMAT](https://docs.snowflake.com/en/sql-reference/sql/create-file-format.html) that will be consumed by the `STAGE` or directly by the `COPY INTO`.

To add a file to some `STAGE` you should use the [PUT](https://docs.snowflake.com/en/sql-reference/sql/put.html) command. Notice that the command can be executed only from the [SnowSQL CLI](https://docs.snowflake.com/en/user-guide/snowsql.html). Here is an example of the steps we should do before executing a `COPY INTO`:

### SQL Server

```sql
-- Additional Params: -t JavaScript
CREATE PROCEDURE PROCEDURE_SAMPLE
AS

CREATE TABLE #temptable
 ([col1] varchar(100),
  [col2] int,
  [col3] varchar(100))

BULK INSERT #temptable FROM 'C:\test.txt'
WITH
(
   FIELDTERMINATOR ='\t',
   ROWTERMINATOR ='\n'
);

GO
```

### Snowflake

```sql
CREATE OR REPLACE FILE FORMAT FILE_FORMAT_638434968243607970
FIELD_DELIMITER = '\t'
RECORD_DELIMITER = '\n';

CREATE OR REPLACE STAGE STAGE_638434968243607970
FILE_FORMAT = FILE_FORMAT_638434968243607970;

--** SSC-FDM-TS0004 - PUT STATEMENT IS NOT SUPPORTED ON WEB UI. YOU SHOULD EXECUTE THE CODE THROUGH THE SNOWFLAKE CLI **
PUT file://C:\test.txt @STAGE_638434968243607970 AUTO_COMPRESS = FALSE;

CREATE OR REPLACE PROCEDURE PROCEDURE_SAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
 // SnowConvert AI Helpers Code section is omitted.

 EXEC(`CREATE OR REPLACE TEMPORARY TABLE T_temptable
(
   col1 VARCHAR(100),
   col2 INT,
   col3 VARCHAR(100))`);
 EXEC(`COPY INTO T_temptable FROM @STAGE_638434968243607970/test.txt`);
$$
```

As you see in the code above, SnowConvert AI identifies all the `BULK INSERTS` in the code, and for each instance, a new `STAGE` and `FILE FORMAT` will be created before the copy into execution. In addition, after the creation of the `STAGE`, a `PUT` command will be created as well to add the file to the stage.

The names of the generated statements are auto-generated using the current timestamp in seconds, to avoid collisions between their usages.

Finally, all the options for the bulk insert are being mapped to file format options if apply. If the option is not supported in Snowflake, it will be commented and a warning will be added. See also [SSC-FDM-TS0004](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md).

#### Supported bulk options

| SQL Server | Snowflake |
| --- | --- |
| FORMAT | TYPE |
| FIELDTERMINATOR | FIELD_DELIMITER |
| FIRSTROW | SKIP_HEADER |
| ROWTERMINATOR | RECORD_DELIMITER |
| FIELDQUOTE | FIELD_OPTIONALLY_ENCLOSED_BY |

### Related EWIs

1. [SSC-FDM-TS0004](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): PUT STATEMENT IS NOT SUPPORTED ON WEB UI.

## Common Table Expression (CTE)

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

Common table expressions are supported in Snowflake SQL by default.

### Syntax

#### Snowflake SQL

Subquery:

```sql
[ WITH
       <cte_name1> [ ( <cte_column_list> ) ] AS ( SELECT ...  )
   [ , <cte_name2> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
   [ , <cte_nameN> [ ( <cte_column_list> ) ] AS ( SELECT ...  ) ]
]
SELECT ...
```

Recursive CTE:

```sql
[ WITH [ RECURSIVE ]
       <cte_name1> ( <cte_column_list> ) AS ( anchorClause UNION ALL recursiveClause )
   [ , <cte_name2> ( <cte_column_list> ) AS ( anchorClause UNION ALL recursiveClause ) ]
   [ , <cte_nameN> ( <cte_column_list> ) AS ( anchorClause UNION ALL recursiveClause ) ]
]
SELECT ...
```

Where:

```sql
anchorClause ::=
    SELECT <anchor_column_list> FROM ...

 recursiveClause ::=
     SELECT <recursive_column_list> FROM ... [ JOIN ... ]
```

#### Noteworthy details

The RECURSIVE keyword does not exist in T-SQL, and the transformation does not actively add the keyword to the result. A warning is added to the output code to state this behavior.

#### Common Table Expression with SELECT INTO

The following transformation occurs when the WITH expression is followed by an SELECT INTO statement and it will be transformed into a [TEMPORARY TABLE](https://docs.snowflake.com/en/user-guide/tables-temp-transient.html).

#### SQL Server:

```sql
WITH ctetable(col1, col2) AS
    (
        SELECT	col1, col2 FROM	t1 poh WHERE poh.col1 = 16 and poh.col2 = 4
    ),
    employeeCte AS
    (
	SELECT BUSINESSENTITYID, VACATIONHOURS FROM employee WHERE BUSINESSENTITYID = (SELECT col1 FROM ctetable)
    ),
    finalCte AS
    (
        SELECT BUSINESSENTITYID, VACATIONHOURS FROM employeeCte
    ) SELECT * INTO #table2 FROM finalCte;

SELECT * FROM #table2;
```

#### Snowflake:

```sql
CREATE OR REPLACE TEMPORARY TABLE T_table2 AS
	WITH ctetable (
		col1,
		col2
	) AS
		   (
		       SELECT
		   		col1,
		   		col2
		       FROM
		   		t1 poh
		       WHERE
		   		poh.col1 = 16 and poh.col2 = 4
		   ),
		   		employeeCte AS
		   		    (
		   			SELECT
		   		BUSINESSENTITYID,
		   		VACATIONHOURS
		       FROM
		   		employee
		       WHERE
		   		BUSINESSENTITYID = (SELECT
		   						col1
		   					FROM
		   						ctetable
		   		)
		   		    ),
		   		finalCte AS
		   		    (
		   		        SELECT
		   		BUSINESSENTITYID,
		   		VACATIONHOURS
		       FROM
		   		employeeCte
		   		    )
		   		SELECT
		       *
		       FROM
		       finalCte;

		       SELECT
		       *
		       FROM
		       T_table2;
```

#### Common Table Expression with other expressions

The following transformation occurs when the WITH expression is followed by INSERT or DELETE statements.

#### SQL Server:

```sql
WITH CTE AS( SELECT * from table1)
INSERT INTO Table2 (a,b,c,d)
SELECT a,b,c,d
FROM CTE
WHERE e IS NOT NULL;
```

#### Snowflake:

```sql
INSERT INTO Table2 (a, b, c, d)
WITH CTE AS( SELECT
*
from
table1
)
SELECT
a,
b,
c,
d
FROM
CTE AS CTE
WHERE
e IS NOT NULL;
```

#### Common Table Expression with Delete From

For this transformation, it will only apply for a CTE (Common Table Expression) with a Delete From, however, only for some specifics CTE. It must have only one CTE, and it must have inside a function of ROW_NUMBER or RANK.

The purpose of the CTE with the Delete must be to remove duplicates from a table. In case that the CTE with Delete intents to remove another kind of data, this transformation will not apply.

Let’s see an example. For a working example, we must first create a table with some data.

```sql
CREATE TABLE WithQueryTest
(
    ID BIGINT,
    Value BIGINT,
    StringValue NVARCHAR(258)
);

Insert into WithQueryTest values(100, 100, 'First');
Insert into WithQueryTest values(200, 200, 'Second');
Insert into WithQueryTest values(300, 300, 'Third');
Insert into WithQueryTest values(400, 400, 'Fourth');
Insert into WithQueryTest values(100, 100, 'First');
```

Note that there is a duplicated value. The lines 8 and 12 insert the same value. Now we are going to eliminate the duplicates rows in a table.

```sql
WITH Duplicated AS (
SELECT *, ROW_NUMBER() OVER (PARTITION BY ID ORDER BY ID) AS RN
FROM WithQueryTest
)
DELETE FROM Duplicated
WHERE Duplicated.RN > 1
```

If we execute a Select from the table, it will show the following result

| ID | Value | StringValue |
| --- | --- | --- |
| 100 | 100 | First |
| 200 | 200 | Second |
| 300 | 300 | Third |
| 400 | 400 | Fourth |

Note that the duplicated rows have been removed. To preserve this functionality in Snowflake, which does not support DELETE from a CTE, SnowConvert transforms the statement into the following:

```sql
CREATE OR REPLACE TABLE PUBLIC.WithQueryTest AS SELECT
*
FROM PUBLIC.WithQueryTest
QUALIFY ROW_NUMBER()
OVER (PARTITION BY ID ORDER BY ID) = 1 ;
```

As you can see, the query is transformed to a Create Or Replace Table.

To test it in Snowflake, you will need the table.

```sql
CREATE OR REPLACE TABLE PUBLIC.WithQueryTest
(
ID BIGINT,
Value BIGINT,
StringValue VARCHAR(258)
);

Insert into PUBLIC.WithQueryTest values(100, 100, 'First');
Insert into PUBLIC.WithQueryTest values(200, 200, 'Second');
Insert into PUBLIC.WithQueryTest values(300, 300, 'Third');
Insert into PUBLIC.WithQueryTest values(400, 400, 'Fourth');
Insert into PUBLIC.WithQueryTest values(100, 100, 'First');
```

Now, if we execute the result of the transformation, and then a Select to check if the duplicated rows were deleted, this would be the result.

| ID | Value | StringValue |
| --- | --- | --- |
| 100 | 100 | First |
| 200 | 200 | Second |
| 300 | 300 | Third |
| 400 | 400 | Fourth |

#### Common Table Expression with MERGE statement

The following transformation occurs when the WITH expression is followed by MERGE statement and it will be transformed into a [MERGE INTO](https://docs.snowflake.com/en/sql-reference/sql/merge.html).

##### SQL Server:

```sql
WITH ctetable(col1, col2) as
    (
        SELECT col1, col2
        FROM t1 poh
        where poh.col1 = 16 and poh.col2 = 88
    ),
    finalCte As
    (
        SELECT col1 FROM ctetable
    )
    MERGE
  table1 AS target
  USING finalCte AS source
  ON (target.ID = source.COL1)
  WHEN MATCHED THEN UPDATE SET target.ID = source.Col1
  WHEN NOT MATCHED THEN INSERT (ID, col1) VALUES (source.COL1, source.COL1 );
```

##### Snowflake:

```sql
MERGE INTO table1 AS target
USING (
  --** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **
  WITH ctetable (
    col1,
    col2
  ) as
       (
           SELECT
           col1,
           col2
           FROM
           t1 poh
           where
           poh.col1 = 16 and poh.col2 = 88
       ),
           finalCte As
               (
                   SELECT
           col1
           FROM
           ctetable
               )
           SELECT
           *
           FROM
           finalCte
) AS source
ON (target.ID = source.COL1)
WHEN MATCHED THEN
           UPDATE SET
           target.ID = source.Col1
WHEN NOT MATCHED THEN
           INSERT (ID, col1) VALUES (source.COL1, source.COL1);
```

#### Common Table Expression with UPDATE statement

The following transformation occurs when the WITH expression is followed by an UPDATE statement and it will be transformed into an [UPDATE](https://docs.snowflake.com/en/sql-reference/sql/update.html) statement.

##### SQL Server:

```sql
WITH ctetable(col1, col2) AS
    (
        SELECT col1, col2
        FROM table2 poh
        WHERE poh.col1 = 5 and poh.col2 = 4
    )
UPDATE tab1
SET ID = 8, COL1 = 8
FROM table1 tab1
INNER JOIN ctetable CTE ON tab1.ID = CTE.col1;
```

##### Snowflake:

```sql
UPDATE dbo.table1 tab1
    SET
        ID = 8,
        COL1 = 8
    FROM
        (
            WITH ctetable (
                col1,
                col2
            ) AS
                   (
                       SELECT
                           col1,
                           col2
                       FROM
                           table2 poh
                       WHERE
                           poh.col1 = 5 and poh.col2 = 4
                   )
                   SELECT
                       *
                   FROM
                       ctetable
        ) AS CTE
    WHERE
        tab1.ID = CTE.col1;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0108](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The following subquery matches at least one of the patterns considered invalid and may produce compilation errors.
2. [SSC-PRF-TS0001](../../general/technical-documentation/issues-and-troubleshooting/performance-review/sqlServerPRF.md): Performance warning - recursion for CTE not checked. Might require a recursive keyword.

## DELETE

Translation reference for Transact-SQL Delete statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Removes one or more rows from a table or view in SQL Server. For more information, see the [SQL Server DELETE documentation](https://docs.microsoft.com/en-us/sql/t-sql/statements/delete-transact-sql?view=sql-server-ver15).

```sql
 [ WITH <common_table_expression> [ ,...n ] ]
DELETE
    [ TOP ( expression ) [ PERCENT ] ]
    [ FROM ]
    { { table_alias
      | <object>
      | rowset_function_limited
      [ WITH ( table_hint_limited [ ...n ] ) ] }
      | @table_variable
    }
    [ <OUTPUT Clause> ]
    [ FROM table_source [ ,...n ] ]
    [ WHERE { <search_condition>
            | { [ CURRENT OF
                   { { [ GLOBAL ] cursor_name }
                       | cursor_variable_name
                   }
                ]
              }
            }
    ]
    [ OPTION ( <Query Hint> [ ,...n ] ) ]
[; ]

<object> ::=
{
    [ server_name.database_name.schema_name.
      | database_name. [ schema_name ] .
      | schema_name.
    ]
    table_or_view_name
}
```

### Sample Source Patterns

#### Sample Data

##### SQL Server

```sql
CREATE TABLE Employees (
    EmployeeID INT PRIMARY KEY,
    FirstName VARCHAR(50),
    LastName VARCHAR(50),
    DepartmentID INT
);

CREATE TABLE Departments (
    DepartmentID INT PRIMARY KEY,
    DepartmentName VARCHAR(50)
);

INSERT INTO Employees (EmployeeID, FirstName, LastName, DepartmentID) VALUES
(1, 'John', 'Doe', 1),
(2, 'Jane', 'Smith', 2),
(3, 'Bob', 'Johnson', 1),
(4, 'Alice', 'Brown', 3),
(5, 'Michael', 'Davis', NULL);

INSERT INTO Departments (DepartmentID, DepartmentName) VALUES
(1, 'Sales'),
(2, 'Marketing'),
(3, 'Engineering'),
(4, 'Finance');
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE Employees (
    EmployeeID INT PRIMARY KEY,
    FirstName VARCHAR(50),
    LastName VARCHAR(50),
    DepartmentID INT
);

CREATE OR REPLACE TABLE Departments (
    DepartmentID INT PRIMARY KEY,
    DepartmentName VARCHAR(50)
);

INSERT INTO Employees (EmployeeID, FirstName, LastName, DepartmentID) VALUES
(1, 'John', 'Doe', 1),
(2, 'Jane', 'Smith', 2),
(3, 'Bob', 'Johnson', 1),
(4, 'Alice', 'Brown', 3),
(5, 'Michael', 'Davis', NULL);

INSERT INTO Departments (DepartmentID, DepartmentName) VALUES
(1, 'Sales'),
(2, 'Marketing'),
(3, 'Engineering'),
(4, 'Finance');
```

#### Basic Case

The transformation for the DELETE statement is fairly straightforward, with some caveats. One of these caveats is the way Snowflake supports multiple sources in the FROM clause, however, there is an equivalent in Snowflake as shown below.

##### SQL Server

```sql
 DELETE T1 FROM Departments T2, Employees T1 WHERE T1.DepartmentID = T2.DepartmentID
```

##### Snowflake

```sql
DELETE FROM
Employees T1
USING Departments T2
WHERE
T1.DepartmentID = T2.DepartmentID;
```

> **Note:**
>
> Note that, since the original DELETE was for T1, the presence of TABLE2 T2 in the FROM clause requires the creation of the USING clause.

#### Delete duplicates from a table

The following documentation explains a [common pattern used to remove duplicated rows from a table in SQL Server](https://learn.microsoft.com/en-us/troubleshoot/sql/database-engine/development/remove-duplicate-rows-sql-server-tab#method-2). This approach uses the `ROW_NUMBER` function to partition the data based on the `key_value` which may be one or more columns separated by commas. Then, delete all records that received a row number value that is greater than 1. This value indicates that the records are duplicates. You can read the referenced documentation to understand the behavior of this method and recreate it.

```sql
DELETE T
FROM
(
SELECT *
, DupRank = ROW_NUMBER() OVER (
              PARTITION BY key_value
              ORDER BY ( {expression} )
            )
FROM original_table
) AS T
WHERE DupRank > 1
```

The following example uses this approach to remove duplicates from a table and its equivalent in Snowflake. The transformation consists of performing an [INSERT OVERWRITE](https://docs.snowflake.com/en/sql-reference/sql/insert#optional-parameters) statement which truncates the table (removes all data) and then inserts again the rows in the same table ignoring the duplicated ones. The output code is generated considering the same `PARTITION BY` and `ORDER BY` clauses used in the original code.

##### SQL Server

Create a table with duplicated rows

##### Insert duplicates

```sql
 create table duplicatedRows(
    someID int,
    col2 bit,
    col3 bit,
    col4 bit,
    col5 bit
);

insert into duplicatedRows VALUES(10, 1, 0, 0, 1);
insert into duplicatedRows VALUES(10, 1, 0, 0, 1);
insert into duplicatedRows VALUES(11, 1, 1, 0, 1);
insert into duplicatedRows VALUES(12, 0, 0, 1, 1);
insert into duplicatedRows VALUES(12, 0, 0, 1, 1);
insert into duplicatedRows VALUES(13, 1, 0, 1, 0);
insert into duplicatedRows VALUES(14, 1, 0, 1, 0);
insert into duplicatedRows VALUES(14, 1, 0, 1, 0);

select * from duplicatedRows;
```

##### Output

| someID | col2 | col3 | col4 | col5 |
| --- | --- | --- | --- | --- |
| 10 | true | false | false | true |
| 10 | true | false | false | true |
| 11 | true | true | false | true |
| 12 | false | false | true | true |
| 12 | false | false | true | true |
| 13 | true | false | true | false |
| 14 | true | false | true | false |
| 14 | true | false | true | false |

##### Remove duplicates

```sql
 DELETE f FROM (
	select  someID, row_number() over (
		partition by someID, col2
		order by
			case when COL3 = 1 then 1 else 0 end
			+ case when col4 = 1 then 1 else 0 end
			+ case when col5 = 1 then 1 else 0 end
			asc
		) as rownum
	from
		duplicatedRows
	) f where f.rownum > 1;

select * from duplicatedRows;
```

##### Output

| someID | col2 | col3 | col4 | col5 |
| --- | --- | --- | --- | --- |
| 10 | true | false | false | true |
| 11 | true | true | false | true |
| 12 | false | false | true | true |
| 13 | true | false | true | false |
| 14 | true | false | true | false |

##### Snowflake

Create a table with duplicated rows

##### Insert duplicates

```sql
 create table duplicatedRows(
    someID int,
    col2 BOOLEAN,
    col3 BOOLEAN,
    col4 BOOLEAN,
    col5 BOOLEAN
);

insert into duplicatedRows VALUES(10, 1, 0, 0, 1);
insert into duplicatedRows VALUES(10, 1, 0, 0, 1);
insert into duplicatedRows VALUES(11, 1, 1, 0, 1);
insert into duplicatedRows VALUES(12, 0, 0, 1, 1);
insert into duplicatedRows VALUES(12, 0, 0, 1, 1);
insert into duplicatedRows VALUES(13, 1, 0, 1, 0);
insert into duplicatedRows VALUES(14, 1, 0, 1, 0);
insert into duplicatedRows VALUES(14, 1, 0, 1, 0);

select * from duplicatedRows;
```

##### Output

| someID | col2 | col3 | col4 | col5 |
| --- | --- | --- | --- | --- |
| 10 | true | false | false | true |
| 10 | true | false | false | true |
| 11 | true | true | false | true |
| 12 | false | false | true | true |
| 12 | false | false | true | true |
| 13 | true | false | true | false |
| 14 | true | false | true | false |
| 14 | true | false | true | false |

##### Remove duplicates

```sql
   insert overwrite into duplicatedRows
            SELECT
                *
            FROM
                duplicatedRows
            QUALIFY
                ROW_NUMBER()
                over
                (partition by someID, col2
		    order by
			case when COL3 = 1 then 1 else 0 end
			+ case when col4 = 1 then 1 else 0 end
			+ case when col5 = 1 then 1 else 0 end
			asc) = 1;

select * from duplicatedRows;
```

##### Output

| someID | col2 | col3 | col4 | col5 |
| --- | --- | --- | --- | --- |
| 10 | true | false | false | true |
| 11 | true | true | false | true |
| 12 | false | false | true | true |
| 13 | true | false | true | false |
| 14 | true | false | true | false |

> **Warning:**
>
> Consider that there may be several variations of this pattern, but all of them are based on the same principle and have the same structure.

#### DELETE WITH INNER JOIN

##### SQL SERVER

```sql
DELETE ee
FROM Employees ee INNER JOIN Departments dept
ON ee.DepartmentID = dept.DepartmentID;

SELECT * FROM Employees;
```

#### Output

| EmployeeID | FirstName | LastName | DepartmentID |
| --- | --- | --- | --- |
| 5 | Michael | Davis | null |
| 6 | Lucas | Parker | 8 |

##### Snowflake

```sql
DELETE FROM
    Employees ee
USING Departments dept
WHERE
    ee.DepartmentID = dept.DepartmentID;

SELECT
    *
FROM
    Employees;
```

##### Output

| EmployeeID | FirstName | LastName | DepartmentID |
| --- | --- | --- | --- |
| 5 | Michael | Davis | null |
| 6 | Lucas | Parker | 8 |

#### DELETE WITH LEFT JOIN

##### SQL Server

```sql
DELETE Employees
FROM Employees LEFT JOIN Departments
ON Employees.DepartmentID = Departments.DepartmentID
WHERE Departments.DepartmentID IS NULL;

SELECT * FROM Employees;
```

##### Output

| EmployeeID | FirstName | LastName | DepartmentID |
| --- | --- | --- | --- |
| 1 | John | Doe | 1 |
| 2 | Jane | Smith | 2 |
| 3 | Bob | Johnson | 1 |
| 4 | Alice | Brown | 3 |

##### Snowflake

```sql
DELETE FROM
    Employees
USING Departments
WHERE
    Departments.DepartmentID IS NULL
    AND Employees.DepartmentID = Departments.DepartmentID(+);

SELECT
    *
FROM
    Employees;
```

##### Output

| EmployeeID | FirstName | LastName | DepartmentID |
| --- | --- | --- | --- |
| 1 | John | Doe | 1 |
| 2 | Jane | Smith | 2 |
| 3 | Bob | Johnson | 1 |
| 4 | Alice | Brown | 3 |

#### DELETE WITH RIGHT JOIN

##### SQL SERVER

```sql
DELETE Employees
FROM Employees RIGHT JOIN Departments
ON Employees.DepartmentID = Departments.DepartmentID
WHERE Employees.DepartmentID IS NOT NULL;

SELECT * FROM Employees;
```

##### Output

| EmployeeID | FirstName | LastName | DepartmentID |
| --- | --- | --- | --- |
| 5 | Michael | Davis | null |
| 6 | Lucas | Parker | 8 |

##### Snowflake

```sql
DELETE FROM
    Employees
USING Departments
WHERE
    Employees.DepartmentID IS NOT NULL
    AND Employees.DepartmentID(+) = Departments.DepartmentID;

SELECT
    *
FROM
    Employees;
```

##### Output

| EmployeeID | FirstName | LastName | DepartmentID |
| --- | --- | --- | --- |
| 5 | Michael | Davis | null |
| 6 | Lucas | Parker | 8 |

### Known Issues

1. **FULL JOIN not supported**
   The FULL JOIN can not be represented using the (+) syntax. When found, SnowConvert AI will warn the user about this with an FDM.

#### SQL Server

```sql
DELETE Employees
FROM Employees FULL OUTER JOIN Departments
ON Employees.DepartmentID = Departments.DepartmentID
WHERE Departments.DepartmentID IS NULL;
```

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0081 - USING A FULL JOIN IN A DELETE STATEMENT IS NOT SUPPORTED ***/!!!
DELETE FROM
    Employees
USING Departments
WHERE
    Departments.DepartmentID IS NULL
    AND Employees.DepartmentID = Departments.DepartmentID;
```

### Related EWIs

1. [SSC-EWI-TS0081](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Using a full join in a delete statement is not supported

## DROP STATEMENT

DROP statements

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

### DROP TABLE

#### Transact-SQL

```sql
DROP TABLE [ IF EXISTS ] <table_name> [ ,...n ]
[ ; ]
```

#### Snowflake

```sql
DROP TABLE [ IF EXISTS ] <name> [ CASCADE | RESTRICT ]
```

#### Translation

Translation for single `DROP TABLE` statements is very straightforward. As long as there is only one table being dropped within the statement, it’s left as-is.

For example:

```sql
DROP TABLE IF EXISTS [table_name]
```

```sql
DROP TABLE IF EXISTS table_name;
```

The only noteworthy difference between SQL Server and Snowflake appears when the input statement drops more than one table. In these scenarios, a different `DROP TABLE` statement is created for each table being dropped.

For example:

##### SQL Server

```sql
DROP TABLE IF EXISTS [table_name], [table_name2], [table_name3]
```

##### Snowflake

```sql
DROP TABLE IF EXISTS table_name;

DROP TABLE IF EXISTS table_name2;

DROP TABLE IF EXISTS table_name3;
```

## EXISTS

Transact-SQL subqueries using EXISTS statement transformation details

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

### Types of Subqueries

Subqueries can be categorized as correlated or uncorrelated:

A correlated subquery, refers to one or more columns from outside of the subquery. (The columns are typically referenced inside the WHERE clause of the subquery.) A correlated subquery can be thought of as a filter on the table that it refers to, as if the subquery were evaluated on each row of the table in the outer query.

An uncorrelated subquery, has no such external column references. It is an independent query, the results of which are returned to and used by the outer query once (not per row).

The EXISTS statement is considered a correlated subquery.

#### SQL SERVER

```sql
-- Additional Params: -t JavaScript
CREATE PROCEDURE ProcExists
AS
BEGIN
IF(EXISTS(Select AValue from ATable))
  return 1;
END;
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE ProcExists ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  if (SELECT(`   EXISTS(Select
         AValue
      from
         ATable
   )`)) {
    return 1;
  }
$$;
```

## IN

Transact-SQL subqueries using IN statement transformation details

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

The IN operator checks if an expression is included in the values returned by a subquery.

### SQL SERVER

```sql
-- Additional Params: -t JavaScript
CREATE PROCEDURE dbo.SP_IN_EXAMPLE
AS
	DECLARE @results as VARCHAR(50);

	SELECT @results = COUNT(*) FROM TABLE1

	IF @results IN (1,2,3)
		SELECT 'is IN';
	ELSE
		SELECT 'is NOT IN';

	return
GO

-- =============================================
-- Example to execute the stored procedure
-- =============================================
EXECUTE dbo.SP_IN_EXAMPLE
GO
```

### Snowflake

```sql
CREATE OR REPLACE PROCEDURE dbo.SP_IN_EXAMPLE ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	let RESULTS;
	SELECT(`   COUNT(*) FROM
   TABLE1`,[],(value) => RESULTS = value);
	if ([1,2,3].includes(RESULTS)) {
	} else {
	}
	return;
$$;

-- =============================================
-- Example to execute the stored procedure
-- =============================================
CALL dbo.SP_IN_EXAMPLE();
```

## INSERT

Translation reference for SQL Server Insert statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

### Description

Adds one or more rows to a table or a view in SQL Server. For more information, see the [SQL Server INSERT documentation](https://docs.microsoft.com/en-us/sql/t-sql/statements/insert-transact-sql?view=sql-server-ver15).

#### Syntax comparison

The basic insert grammar is equivalent between both SQL languages. However there are still some other syntax elements in SQL Server that show differences, for example, one allows the developer to add a value to a column by using the assign operator. The syntax mentioned will be transformed to the basic insert syntax too.

##### Snowflake

```sql
INSERT [ OVERWRITE ] INTO <target_table> [ ( <target_col_name> [ , ... ] ) ]
       {
         VALUES ( { <value> | DEFAULT | NULL } [ , ... ] ) [ , ( ... ) ]  |
         <query>
       }
```

##### SQL Server

```sql
[ WITH <common_table_expression> [ ,...n ] ]
INSERT
{
        [ TOP ( expression ) [ PERCENT ] ]
        [ INTO ]
        { <object> | rowset_function_limited
          [ WITH ( <Table_Hint_Limited> [ ...n ] ) ]
        }
    {
        [ ( column_list ) ]
        [ <OUTPUT Clause> ]
        { VALUES ( { DEFAULT | NULL | expression } [ ,...n ] ) [ ,...n     ]
        | derived_table
        | execute_statement
        | <dml_table_source>
        | DEFAULT VALUES
        }
    }
}
[;]

<object> ::=
{
    [ server_name . database_name . schema_name .
      | database_name .[ schema_name ] .
      | schema_name .
    ]
  table_or_view_name
}

<dml_table_source> ::=
    SELECT <select_list>
    FROM ( <dml_statement_with_output_clause> )
      [AS] table_alias [ ( column_alias [ ,...n ] ) ]
    [ WHERE <search_condition> ]
        [ OPTION ( <query_hint> [ ,...n ] ) ]
```

### Sample Source Patterns

#### Basic INSERT

##### SQL Server

```sql
INSERT INTO TABLE1 VALUES (1, 2, 123, 'LiteralValue');
```

##### Snowflake

```sql
INSERT INTO TABLE1 VALUES (1, 2, 123, 'LiteralValue');
```

#### INSERT with assign operator

##### SQL Server

```sql
INSERT INTO aTable (columnA = 'varcharValue', columnB = 1);
```

##### Snowflake

```sql
INSERT INTO aTable (columnA = 'varcharValue', columnB = 1);
```

#### INSERT with no INTO

##### SQL Server

```sql
INSERT exampleTable VALUES ('Hello', 23);
```

##### Snowflake

```sql
INSERT INTO exampleTable VALUES ('Hello', 23);
```

#### INSERT with common table expression

##### SQL Server

```sql
WITH ctevalues (textCol, numCol) AS (SELECT 'cte string', 155)
INSERT INTO exampleTable SELECT * FROM ctevalues;
```

##### Snowflake

```sql
INSERT INTO exampleTable
WITH ctevalues (
textCol,
numCol
) AS (SELECT 'cte string', 155)
SELECT
*
FROM
ctevalues AS ctevalues;
```

#### INSERT with Table DML Factor with MERGE as DML

This case is so specific where the `INSERT` statement has a `SELECT` query, and the `FROM` clause of the `SELECT` mentioned contains a `MERGE` DML statement.
Looking for an equivalent in Snowflake, the next statements are created: a temporary table, the merge statement converted, and finally, the insert statement.

##### SQL Server

```sql
INSERT INTO T3
SELECT
  col1,
  col2
FROM (
  MERGE T1 USING T2
  	ON T1.col1 = T2.col1
  WHEN NOT MATCHED THEN
    INSERT VALUES ( T2.col1, T2.col2 )
  WHEN MATCHED THEN
    UPDATE SET T1.col2 = t2.col2
  OUTPUT
  	$action ACTION_OUT,
    T2.col1,
    T2.col2
) AS MERGE_OUT
 WHERE ACTION_OUT='UPDATE';
```

##### Snowflake

```sql
--** SSC-FDM-TS0026 - DELETE CASE IS NOT BEING CONSIDERED, PLEASE CHECK IF THE ORIGINAL MERGE PERFORMS IT **
CREATE OR REPLACE TEMPORARY TABLE MERGE_OUT AS
SELECT
	CASE WHEN T1.$1 IS NULL THEN 'INSERT' ELSE 'UPDATE' END ACTION_OUT,
	T2.col1,
	T2.col2
FROM T2 LEFT JOIN T1 ON T1.col1 = T2.col1;

MERGE INTO T1
USING T2
ON T1.col1 = T2.col1
WHEN NOT MATCHED THEN INSERT VALUES (T2.col1, T2.col2)
WHEN MATCHED THEN UPDATE SET T1.col2 = t2.col2
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - OUTPUT CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
OUTPUT
	$action ACTION_OUT,
	T2.col1,
	T2.col2 ;

INSERT INTO T3
SELECT col1, col2
FROM MERGE_OUT
WHERE ACTION_OUT ='UPDATE';
```

**NOTE:** As the pattern’s name suggests, it is ONLY for cases where the insert comes with a select…from which the body contains a MERGE statement.

### Known Issues

**1. Syntax elements that require special mappings:**

* [INTO]: This keyword is obligatory in Snowflake and should be added if not present.
* [DEFAULT VALUES]: Inserts the default value in all columns specified in the insert. Should be transformed to VALUES (DEFAULT, DEFAULT, …), the amount of DEFAULTs added equals the number of columns the insert will modify. For now, there is a warning being added.

#### SQL Server

```sql
INSERT INTO exampleTable DEFAULT VALUES;
```

#### Snowflake

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'INSERT WITH DEFAULT VALUES' NODE ***/!!!
INSERT INTO exampleTable DEFAULT VALUES;
```

**2. Syntax elements not supported or irrelevant:**

* [TOP (expression) [PERCENT]]: Indicates the amount or percent of rows that will be inserted. Not supported.
* [rowset_function_limited]: It is either OPENQUERY() or OPENROWSET(), used to read data from remote servers. Not supported.
* [WITH table_hint_limited]: These are used to get reading/writing locks on tables. Not relevant in Snowflake.
* [<OUTPUT Clause>]: Specifies a table or result set in which the inserted rows will also be inserted. Not supported.
* [execute_statement]: Can be used to run a query to get data from. Not supported.
* [dml_table_source]: A temporary result set generated by the OUTPUT clause of another DML statement. Not supported.

**3. The DELETE case is not being considered.**

* For the *INSERT with Table DML Factor with MERGE as DML* pattern, the DELETE case is not being considered in the solution, so if the source code merge statement has a DELETE case please consider that it might not work as expected.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-FDM-TS0026](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): DELETE case is not being considered.

## MERGE

Transact-SQL MERGE statement transformation details

Applies to

* SQL Server
* Azure Synapse Analytics

### Syntax comparison

#### Snowflake

```sql
MERGE
    INTO <target_table>
    USING <source>
    ON <join_expr>
    { matchedClause | notMatchedClause } [ ... ]
```

#### Transact-SQL

```sql
-- SQL Server and Azure SQL Database
[ WITH <common_table_expression> [,...n] ]
MERGE
    [ TOP ( expression ) [ PERCENT ] ]
    [ INTO ] <target_table> [ WITH ( <merge_hint> ) ] [ [ AS ] table_alias ]
    USING <table_source> [ [ AS ] table_alias ]
    ON <merge_search_condition>
    [ WHEN MATCHED [ AND <clause_search_condition> ]
        THEN <merge_matched> ] [ ...n ]
    [ WHEN NOT MATCHED [ BY TARGET ] [ AND <clause_search_condition> ]
        THEN <merge_not_matched> ]
    [ WHEN NOT MATCHED BY SOURCE [ AND <clause_search_condition> ]
        THEN <merge_matched> ] [ ...n ]
    [ <output_clause> ]
    [ OPTION ( <query_hint> [ ,...n ] ) ]
;
```

#### Example

Given the following source code:

##### SQL Server

```sql
MERGE
INTO
  targetTable WITH(KEEPIDENTITY, KEEPDEFAULTS, HOLDLOCK, IGNORE_CONSTRAINTS, IGNORE_TRIGGERS, NOLOCK, INDEX(value1, value2, value3)) as tableAlias
USING
  tableSource AS tableAlias2
ON
  mergeSetCondition > mergeSetCondition
WHEN MATCHED BY TARGET AND pi.Quantity - src.OrderQty >= 0
  THEN UPDATE SET pi.Quantity = pi.Quantity - src.OrderQty
OUTPUT $action, DELETED.v AS DELETED, INSERTED.v INSERTED INTO @localVar(col, list)
OPTION(RECOMPILE);
```

You can expect to get something like this:

##### Snowflake

```sql
MERGE INTO targetTable as tableAlias
USING tableSource AS tableAlias2
ON mergeSetCondition > mergeSetCondition
WHEN MATCHED AND pi.Quantity - src.OrderQty >= 0 THEN
  UPDATE SET
    pi.Quantity = pi.Quantity - src.OrderQty
    !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - OUTPUT CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
   OUTPUT $action, DELETED.v AS DELETED, INSERTED.v INSERTED INTO @localVar(col, list);
```

### Related EWIs

1. [SSC-EWI-0021](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Syntax not supported in Snowflake.

## SELECT

Translation reference to convert SQL Server Select statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

### Description

Allows the selection of one or more rows or columns of one or more tables in SQL Server.
For more information, see the [SQL Server SELECT documentation](https://docs.microsoft.com/en-us/sql/t-sql/queries/select-transact-sql?view=sql-server-ver15).

```sql
<SELECT statement> ::=
    [ WITH { [ XMLNAMESPACES ,] [ <common_table_expression> [,...n] ] } ]
    <query_expression>
    [ ORDER BY <order_by_expression> ]
    [ <FOR Clause>]
    [ OPTION ( <query_hint> [ ,...n ] ) ]
<query_expression> ::=
    { <query_specification> | ( <query_expression> ) }
    [  { UNION [ ALL ] | EXCEPT | INTERSECT }
        <query_specification> | ( <query_expression> ) [...n ] ]
<query_specification> ::=
SELECT [ ALL | DISTINCT ]
    [TOP ( expression ) [PERCENT] [ WITH TIES ] ]
    < select_list >
    [ INTO new_table ]
    [ FROM { <table_source> } [ ,...n ] ]
    [ WHERE <search_condition> ]
    [ <GROUP BY> ]
    [ HAVING < search_condition > ]
```

### Sample Source Patterns

#### SELECT WITH COLUMN ALIASES

The following example demonstrates how to use column aliases in Snowflake. The first two columns, from the SQL Server code, are expected to be transformed from an assignment form into a normalized form using the `AS` keyword. The third and fourth columns are using valid Snowflake formats.

##### SQL Server

```sql
SELECT
    MyCol1Alias = COL1,
    MyCol2Alias = COL2,
    COL3 AS MyCol3Alias,
    COL4 MyCol4Alias
FROM TABLE1;
```

##### Snowflake

```sql
SELECT
    COL1 AS MyCol1Alias,
    COL2 AS MyCol2Alias,
    COL3 AS MyCol3Alias,
    COL4 MyCol4Alias
FROM
    TABLE1;
```

#### SELECT TOP

##### SQL Server

```sql
SELECT TOP 1 * from ATable;
```

##### Snowflake

```sql
SELECT TOP 1
*
from
ATable;
```

#### SELECT INTO

The following example shows the `SELECT INTO` is transformed into a `CREATE TABLE AS`, this is because in Snowflake there is no equivalent for
`SELECT INTO` and to create a table based on a query has to be with the `CREATE TABLE AS`.

##### SQL Server

```sql
SELECT * INTO NEWTABLE FROM TABLE1;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE NEWTABLE AS
SELECT
*
FROM
TABLE1;
```

Another case is when including set operators such as `EXCEPT` and `INTERSECT`. The transformation is basically the same as the previous one.

##### SQL Server

```sql
SELECT * INTO NEWTABLE FROM TABLE1
EXCEPT
SELECT * FROM TABLE2
INTERSECT
SELECT * FROM TABLE3;
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE NEWTABLE AS
SELECT
*
FROM
TABLE1
EXCEPT
SELECT
*
FROM
TABLE2
INTERSECT
SELECT
*
FROM
TABLE3;
```

#### SELECT TOP Additional Arguments

Since `PERCENT` and `WITH TIES` keywords affect the result, and they are not supported by Snowflake, they will be commented out and added as an error.

##### SQL Server

```sql
SELECT TOP 1 PERCENT * from ATable;
SELECT TOP 1 WITH TIES * from ATable;
SELECT TOP 1 PERCENT WITH TIES * from ATable;
```

##### Snowflake

```sql
SELECT
TOP 1 !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'TOP PERCENT' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
*
from
ATable;

SELECT
TOP 1 !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'TOP WITH TIES' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
*
from
ATable;

SELECT
TOP 1 !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'TOP PERCENT AND WITH TIES' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
*
from
ATable;
```

#### SELECT FOR

The `FOR XML` clause is transformed differently depending on whether the path is empty or not.

**FOR XML PATH(‘’) — Empty path (string concatenation pattern):**
`FOR XML PATH('')` is a common SQL Server pattern used for string concatenation (before `STRING_AGG` was introduced). When the path is empty and there is no `ROOT` clause, the query is transformed to use `LISTAGG` with `CONCAT` instead of XML functions, because the intent is string aggregation rather than XML generation.

##### SQL Server

```sql
SELECT ',' + column1,
       ' ' + column2
FROM my_table
FOR XML PATH('');
```

##### Snowflake

```sql
SELECT
  LISTAGG ( CONCAT(',' || column1, ' ' || column2), '')
FROM
  my_table;
```

When there is a single expression, `CONCAT` is omitted:

##### SQL Server

```sql
SELECT ',' + column1 FROM my_table FOR XML PATH('');
```

##### Snowflake

```sql
SELECT
  LISTAGG ( ',' || column1, '')
FROM
  my_table;
```

**FOR XML PATH — Non-empty path (XML generation):**
When the path is not empty, the `FOR XML PATH` clause is converted to use `FOR_XML_UDF` with `OBJECT_CONSTRUCT` to produce XML output. This conversion emits [SSC-FDM-TS0016](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md) because the resulting XML format in Snowflake may differ from SQL Server.

##### SQL Server

```sql
SELECT t.id, t.name as full_name, t.hint FROM demo t FOR XML PATH;
```

##### Snowflake

```sql
SELECT
--** SSC-FDM-TS0016 - XML COLUMNS IN SNOWFLAKE MIGHT HAVE A DIFFERENT FORMAT **
PUBLIC.FOR_XML_UDF(OBJECT_CONSTRUCT('id', t.id, 'full_name', t.name, 'hint', t.hint), 'row')
FROM
demo t;
```

#### SELECT OPTION

The `OPTION` clause is not supported by Snowflake. It will be commented out and added as a warning during the transformation.

Notice that the `OPTION` statement has been removed from transformation because it is not relevant or not needed in Snowflake.

##### SQL Server

```sql
SELECT column1, column2 FROM my_table OPTION (HASH GROUP, FAST 10);
```

##### Snowflake

```sql
SELECT
column1,
column2
FROM
my_table;
```

#### SELECT WITH

The `WITH` clause is not supported by Snowflake. It will be commented out and added as a warning during the transformation.

Notice that the `WITH(NOLOCK, NOWAIT)` statement has been removed from transformation because it is not relevant or not needed in Snowflake.

##### SQL Server

```sql
SELECT AValue from ATable WITH(NOLOCK, NOWAIT);
```

##### Snowflake

```sql
SELECT
AValue
from
ATable;
```

### Related EWIs

1. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
2. [SSC-FDM-TS0016](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): XML columns in Snowflake might have a different format

## SET OPERATORS

Applies to

* SQL Server
* Azure Synapse Analytics

Set Operators in both TSQL and Snowflake present the same syntax and supported scenarios(EXCEPT, INTERSECT, UNION and UNION ALL), with the exception of the MINUS which is not supported in TSQL, resulting in the same code during conversion.

```sql
SELECT LastName, FirstName FROM employees
UNION ALL
SELECT FirstName, LastName FROM contractors;

SELECT ...
INTERSECT
SELECT ...

SELECT ...
EXCEPT
SELECT ...
```

## TRUNCATE

Transact-SQL TRUNCATE statement transformation details

Applies to

* SQL Server
* Azure Synapse Analytics

Some parts in the output code are omitted for clarity reasons.

### SQL Server

```sql
TRUNCATE TABLE TABLE1;
```

### Snowflake

```sql
TRUNCATE TABLE TABLE1;
```

## UPDATE

Translation reference to convert SQL Server Update statement to Snowflake

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Changes existing data in a table or view in SQL Server. For more information, see the [SQL Server UPDATE documentation](https://docs.microsoft.com/en-us/sql/t-sql/queries/update-transact-sql?view=sql-server-ver15).

```sql
[ WITH <common_table_expression> [...n] ]
UPDATE
    [ TOP ( expression ) [ PERCENT ] ]
    { { table_alias | <object> | rowset_function_limited
         [ WITH ( <Table_Hint_Limited> [ ...n ] ) ]
      }
      | @table_variable
    }
    SET
        { column_name = { expression | DEFAULT | NULL }
          | { udt_column_name.{ { property_name = expression
                                | field_name = expression }
                                | method_name ( argument [ ,...n ] )
                              }
          }
          | column_name { .WRITE ( expression , @Offset , @Length ) }
          | @variable = expression
          | @variable = column = expression
          | column_name { += | -= | *= | /= | %= | &= | ^= | |= } expression
          | @variable { += | -= | *= | /= | %= | &= | ^= | |= } expression
          | @variable = column { += | -= | *= | /= | %= | &= | ^= | |= } expression
        } [ ,...n ]

    [ <OUTPUT Clause> ]
    [ FROM{ <table_source> } [ ,...n ] ]
    [ WHERE { <search_condition>
            | { [ CURRENT OF
                  { { [ GLOBAL ] cursor_name }
                      | cursor_variable_name
                  }
                ]
              }
            }
    ]
    [ OPTION ( <query_hint> [ ,...n ] ) ]
[ ; ]

<object> ::=
{
    [ server_name . database_name . schema_name .
    | database_name .[ schema_name ] .
    | schema_name .
    ]
    table_or_view_name}
```

### Sample Source Patterns

### Basic UPDATE

The conversion for a regular UPDATE statement is very straightforward. Since the basic UPDATE structure is supported by default in Snowflake,
the outliers are the parts where you are going to see some differences.

#### SQL Server

```sql
Update UpdateTest1
Set Col1 = 5;
```

#### Snowflake

```sql
Update UpdateTest1
Set
Col1 = 5;
```

### Cartesian Products

SQL Server allows add circular references between the target table of the Update Statement and the FROM Clause/ In execution time, the database optimizer removes any cartesian product generated. Otherwise, Snowflake currently does not optimize this scenario, producing a cartesian product that can be checked in the Execution Plan.\

To resolve this, if there is a JOIN where one of their tables is the same as the update target, this reference is removed and added to the WHERE clause, and it is used to just filter the data and avoid making a set operation.

#### SQL Server

```sql
UPDATE [HumanResources].[EMPLOYEEDEPARTMENTHISTORY_COPY]
SET
	BusinessEntityID = b.BusinessEntityID ,
	DepartmentID = b.DepartmentID,
	ShiftID = b.ShiftID,
	StartDate = b.StartDate,
	EndDate = b.EndDate,
	ModifiedDate = b.ModifiedDate
	FROM [HumanResources].[EMPLOYEEDEPARTMENTHISTORY_COPY] AS a
	RIGHT OUTER JOIN [HumanResources].[EmployeeDepartmentHistory] AS b
	ON a.BusinessEntityID = b.BusinessEntityID and a.ShiftID = b.ShiftID;
```

#### Snowflake

```sql
UPDATE HumanResources.EMPLOYEEDEPARTMENTHISTORY_COPY a
	SET
		BusinessEntityID = b.BusinessEntityID,
		DepartmentID = b.DepartmentID,
		ShiftID = b.ShiftID,
		StartDate = b.StartDate,
		EndDate = b.EndDate,
		ModifiedDate = b.ModifiedDate
	FROM
		HumanResources.EmployeeDepartmentHistory AS b
	WHERE
		a.BusinessEntityID(+) = b.BusinessEntityID
		AND a.ShiftID(+) = b.ShiftID;
```

### OUTPUT clause

The OUTPUT clause is not supported by Snowflake.

#### SQL Server

```sql
Update UpdateTest2
Set Col1 = 5
OUTPUT
	deleted.Col1,
	inserted.Col1
	into ValuesTest;
```

#### Snowflake

```sql
Update UpdateTest2
	Set
		Col1 = 5
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - OUTPUT CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
OUTPUT
	deleted.Col1,
	inserted.Col1
	into ValuesTest;
```

### CTE

The WITH CTE clause is moved to the internal query in the update statement to be supported by Snowflake.

#### SQL Server

```sql
With ut as (select * from UpdateTest3)
Update x
Set Col1 = 5
from ut as x;
```

#### Snowflake

```sql
UPDATE UpdateTest3
Set
Col1 = 5
FROM
(
WITH ut as (select
*
from
UpdateTest3
)
SELECT
*
FROM
ut
) AS x;
```

### TOP clause

The TOP clause is not supported by Snowflake.

#### SQL Server

```sql
Update TOP(10) UpdateTest4
Set Col1 = 5;
```

#### Snowflake

```sql
Update
--       !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - TOP CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
-- TOP(10)
         UpdateTest4
Set
Col1 = 5;
```

### WITH TABLE HINT LIMITED

The Update WITH clause in not supported by Snowflake.

#### SQL Server

```sql
Update UpdateTest5 WITH(TABLOCK)
Set Col1 = 5;
```

#### Snowflake

```sql
Update UpdateTest5
Set
Col1 = 5;
```

### Related EWIs

1. [SSC-EWI-0021](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Syntax not supported in Snowflake.

## UPDATE WITH JOIN

Translation specification for UPDATE statement with WHERE and JOIN clauses

> **Warning:**
>
> *This is a work in progress and may change in the future.*

### Description

The pattern UPDATE FROM is used to update data based on data from other tables. This [SQLServer documentation](https://learn.microsoft.com/en-us/sql/t-sql/queries/update-transact-sql?view=sql-server-ver16#OtherTables) provides a simple sample.

Review the following SQL Server syntax from the [documentation](https://learn.microsoft.com/en-us/sql/t-sql/queries/update-transact-sql?view=sql-server-ver16#UpdateExamples).

#### SQL Server Syntax

```sql
UPDATE [table_name]
SET column_name = expression [, ...]
[FROM <table_source> [, ...]]
[WHERE <search_condition>]
[OPTION (query_hint)]
```

* **`table_name`**: The table or view you are updating.
* **`SET`**: Specifies the columns and their new values. The `SET` clause assigns a new value (or expression) to one or more columns.
* **`FROM`**: Used to specify one or more source tables (*like a **join**)*. It helps define where the data comes from to perform the update.
* **`WHERE`**: Specifies which rows should be updated based on the condition(s). Without this clause, all rows in the table would be updated.
* **`OPTION (query_hint)`**: Specifies hints for query optimization.

##### Snowflake syntax

The Snowflake syntax can also be reviewed in the [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/sql/update).

> **Note:**
>
> Snowflake does not support `JOINs` in `UPDATE` clause.

```sql
 UPDATE <target_table>
       SET <col_name> = <value> [ , <col_name> = <value> , ... ]
        [ FROM <additional_tables> ]
        [ WHERE <condition> ]
```

**Required parameters**

* ***`target_table:`***Specifies the table to update.
* ***`col_name:`***Specifies the name of a column in *`target_table`*. Do not include the table name. E.g., `UPDATE t1 SET t1.col = 1` is invalid.
* ***`value`**`:`*Specifies the new value to set in *`col_name`*.

**Optional parameters**

* **``` FROM`` ```** ***`additional_tables:`*** Specifies one or more tables to use for selecting rows to update or for setting new values. *Note that repeating the target table results in a self-join.*
* **``` WHERE`` ```** ***`condition:`***The expression that specifies the rows in the target table to update. Default: No value (all rows of the target table are updated)

#### Translation Summary

| SQL Server JOIN type | Snowflake Best Alternative |
| --- | --- |
| Single `INNER JOIN` | Use the target table in the `FROM` clause to emulate an `INNER JOIN`. |
| Multiple `INNER JOIN` | Use the target table in the `FROM` clause to emulate an `INNER JOIN`. |
| Multiple `INNER JOIN` + Agregate condition | Use subquery + IN Operation |
| Single `LEFT JOIN` | Use subquery + IN Operation |
| Multiple `LEFT JOIN` | Use Snowflake `UPDATE` reordering the statements as needed.  **`UPDATE`** `[target_table_name]`  **`SET`** `[all_set_statements]`  **`FROM`** `[all_left_join_tables_separated_by_comma]`  **`WHERE`** `[all_clauses_into_the_ON_part]` |
| Multiple `RIGHT JOIN` | Use Snowflake `UPDATE` reordering the statements as needed.  **`UPDATE`** `[target_table_name]`  **`SET`** `[all_set_statements]`  **`FROM`** `[all_right_join_tables_separated_by_comma]`  **`WHERE`** `[all_clauses_into_the_ON_part]` |
| Single RIGHT JOIN | Use the table in the `FROM` clause and add filters in the `WHERE` clause as needed. |

*Note-1: Simple JOIN may use the table in the `FROM` clause and add filters in the `WHERE` clause as needed.*

*Note-2: Other approaches may include (+) operand to define the JOINs.*

### Sample Source Patterns

#### Setup data

##### SQLServer

```sql
CREATE TABLE Orders (
    OrderID INT PRIMARY KEY,
    CustomerID INT,
    ProductID INT,
    Quantity INT,
    OrderDate DATE
);

CREATE TABLE Customers (
    CustomerID INT PRIMARY KEY,
    CustomerName VARCHAR(100)
);

CREATE TABLE Products (
    ProductID INT PRIMARY KEY,
    ProductName VARCHAR(100),
    Price DECIMAL(10, 2)
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE Orders (
    OrderID INT PRIMARY KEY,
    CustomerID INT,
    ProductID INT,
    Quantity INT,
    OrderDate DATE
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/12/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE Customers (
    CustomerID INT PRIMARY KEY,
    CustomerName VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/12/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE Products (
    ProductID INT PRIMARY KEY,
    ProductName VARCHAR(100),
    Price DECIMAL(10, 2)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/12/2024",  "domain": "test" }}'
;
```

Data Insertion for samples

```sql
-- Insert Customer Data
INSERT INTO Customers (CustomerID, CustomerName) VALUES (1, 'John Doe');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (2, 'Jane Smith');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (3, 'Alice Johnson');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (4, 'Bob Lee');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (5, 'Charlie Brown');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (6, 'David White');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (7, 'Eve Black');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (8, 'Grace Green');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (9, 'Hank Blue');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (10, 'Ivy Red');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (11, 'Jack Grey');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (12, 'Kim Yellow');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (13, 'Leo Purple');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (14, 'Mona Pink');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (15, 'Nathan Orange');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (16, 'Olivia Cyan');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (17, 'Paul Violet');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (18, 'Quincy Brown');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (19, 'Rita Silver');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (20, 'Sam Green');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (21, 'Tina Blue');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (22, 'Ursula Red');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (23, 'Vince Yellow');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (24, 'Wendy Black');
INSERT INTO Customers (CustomerID, CustomerName) VALUES (25, 'Xander White');

-- Insert Product Data
INSERT INTO Products (ProductID, ProductName, Price) VALUES (1, 'Laptop', 999.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (2, 'Smartphone', 499.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (3, 'Tablet', 299.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (4, 'Headphones', 149.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (5, 'Monitor', 199.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (6, 'Keyboard', 49.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (7, 'Mouse', 29.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (8, 'Camera', 599.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (9, 'Printer', 99.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (10, 'Speaker', 129.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (11, 'Charger', 29.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (12, 'TV', 699.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (13, 'Smartwatch', 199.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (14, 'Projector', 499.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (15, 'Game Console', 399.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (16, 'Speaker System', 299.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (17, 'Earphones', 89.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (18, 'USB Drive', 15.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (19, 'External Hard Drive', 79.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (20, 'Router', 89.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (21, 'Printer Ink', 49.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (22, 'Flash Drive', 9.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (23, 'Gamepad', 34.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (24, 'Webcam', 49.99);
INSERT INTO Products (ProductID, ProductName, Price) VALUES (25, 'Docking Station', 129.99);

-- Insert Orders Data
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (1, 1, 1, 2, '2024-11-01');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (2, 2, 2, 1, '2024-11-02');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (3, 3, 3, 5, '2024-11-03');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (4, 4, 4, 3, '2024-11-04');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (5, NULL, 5, 7, '2024-11-05');  -- NULL Customer
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (6, 6, 6, 2, '2024-11-06');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (7, 7, NULL, 4, '2024-11-07');  -- NULL Product
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (8, 8, 8, 1, '2024-11-08');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (9, 9, 9, 3, '2024-11-09');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (10, 10, 10, 2, '2024-11-10');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (11, 11, 11, 5, '2024-11-11');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (12, 12, 12, 2, '2024-11-12');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (13, NULL, 13, 8, '2024-11-13');  -- NULL Customer
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (14, 14, NULL, 4, '2024-11-14');  -- NULL Product
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (15, 15, 15, 3, '2024-11-15');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (16, 16, 16, 2, '2024-11-16');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (17, 17, 17, 1, '2024-11-17');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (18, 18, 18, 4, '2024-11-18');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (19, 19, 19, 3, '2024-11-19');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (20, 20, 20, 6, '2024-11-20');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (21, 21, 21, 3, '2024-11-21');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (22, 22, 22, 5, '2024-11-22');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (23, 23, 23, 2, '2024-11-23');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (24, 24, 24, 4, '2024-11-24');
INSERT INTO Orders (OrderID, CustomerID, ProductID, Quantity, OrderDate) VALUES (25, 25, 25, 3, '2024-11-25');
```

#### Case 1: Single `INNER JOIN` Update

For INNER JOIN, if the table is used inside the FROM statements, it automatically turns into INNER JOIN. Notice that there are several approaches to support JOINs in UPDATE statements in Snowflake. This is one of the simplest patterns to ensure readability.

##### SQL Server

```sql
UPDATE Orders
SET Quantity = 10
FROM Orders O
INNER JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE C.CustomerName = 'John Doe';

-- Select the changes
SELECT Orders.CustomerID, Orders.Quantity, Customers.CustomerName
FROM Orders, Customers
WHERE Orders.CustomerID = Customers.CustomerID
AND Customers.CustomerName = 'John Doe';
```

##### Output

| CustomerID | Quantity | CustomerName |
| --- | --- | --- |
| 1 | 10 | John Doe |

##### Snowflake

```sql
UPDATE Orders O
SET O.Quantity = 10
FROM
  Customers C
WHERE
  C.CustomerName = 'John Doe'
  AND O.CustomerID = C.CustomerID;

-- Select the changes
SELECT Orders.CustomerID, Orders.Quantity, Customers.CustomerName
FROM Orders, Customers
WHERE Orders.CustomerID = Customers.CustomerID
AND Customers.CustomerName = 'John Doe';
```

##### Output

| CustomerID | Quantity | CustomerName |
| --- | --- | --- |
| 1 | 10 | John Doe |

**Other approaches:**

MERGE INTO

```sql
MERGE INTO Orders O
USING Customers C
ON O.CustomerID = C.CustomerID
WHEN MATCHED AND C.CustomerName = 'John Doe' THEN
  UPDATE SET O.Quantity = 10;
```

IN Operation

```sql
UPDATE Orders O
SET O.Quantity = 10
WHERE O.CustomerID IN
  (SELECT CustomerID FROM Customers WHERE CustomerName = 'John Doe');
```

#### Case 2: Multiple `INNER JOIN` Update

##### SQL Server

```sql
 UPDATE Orders
SET Quantity = 5
FROM Orders O
INNER JOIN Customers C ON O.CustomerID = C.CustomerID
INNER JOIN Products P ON O.ProductID = P.ProductID
WHERE C.CustomerName = 'Alice Johnson' AND P.ProductName = 'Tablet';

-- Select the changes
SELECT Orders.CustomerID, Orders.Quantity, Customers.CustomerName FROM Orders, Customers
WHERE Orders.CustomerID = Customers.CustomerID
AND Customers.CustomerName = 'Alice Johnson';
```

##### Output

| CustomerID | Quantity | CustomerName |
| --- | --- | --- |
| 3 | 5 | Alice Johnson |

##### Snowflake

```sql
UPDATE Orders O
SET O.Quantity = 5
FROM Customers C, Products P
WHERE O.CustomerID = C.CustomerID
  AND C.CustomerName = 'Alice Johnson'
  AND P.ProductName = 'Tablet'
  AND O.ProductID = P.ProductID;

-- Select the changes
SELECT Orders.CustomerID, Orders.Quantity, Customers.CustomerName FROM Orders, Customers
WHERE Orders.CustomerID = Customers.CustomerID
AND Customers.CustomerName = 'Alice Johnson';
```

##### Output

| CustomerID | Quantity | CustomerName |
| --- | --- | --- |
| 3 | 5 | Alice Johnson |

#### Case 3: Multiple `INNER JOIN` Update with Aggregate Condition

##### SQL Server

```sql
UPDATE Orders
SET Quantity = 6
FROM Orders O
INNER JOIN Customers C ON O.CustomerID = C.CustomerID
INNER JOIN Products P ON O.ProductID = P.ProductID
WHERE C.CustomerID IN (SELECT CustomerID FROM Orders WHERE Quantity > 3)
AND P.Price < 200;

SELECT C.CustomerID, C.CustomerName, O.Quantity, P.Price FROM Orders O
INNER JOIN Customers C ON O.CustomerID = C.CustomerID
INNER JOIN Products P ON O.ProductID = P.ProductID
WHERE C.CustomerID IN (SELECT CustomerID FROM Orders WHERE Quantity > 3)
AND P.Price < 200;
```

##### Output

| CustomerID | CustomerName | Quantity | Price |
| --- | --- | --- | --- |
| 11 | Jack Grey | 6 | 29.99 |
| 18 | Quincy Brown | 6 | 15.99 |
| 20 | Sam Green | 6 | 89.99 |
| 22 | Ursula Red | 6 | 9.99 |
| 24 | Wendy Black | 6 | 49.99 |

##### Snowflake

```sql
UPDATE Orders O
SET Quantity = 6
WHERE O.CustomerID IN (SELECT CustomerID FROM Orders WHERE Quantity > 3)
AND O.ProductID IN (SELECT ProductID FROM Products WHERE Price < 200);

-- Select changes
SELECT C.CustomerID, C.CustomerName, O.Quantity, P.Price FROM Orders O
INNER JOIN Customers C ON O.CustomerID = C.CustomerID
INNER JOIN Products P ON O.ProductID = P.ProductID
WHERE C.CustomerID IN (SELECT CustomerID FROM Orders WHERE Quantity > 3)
AND P.Price < 200;
```

##### Output

| CustomerID | CustomerName | Quantity | Price |
| --- | --- | --- | --- |
| 11 | Jack Grey | 6 | 29.99 |
| 18 | Quincy Brown | 6 | 15.99 |
| 20 | Sam Green | 6 | 89.99 |
| 22 | Ursula Red | 6 | 9.99 |
| 24 | Wendy Black | 6 | 49.99 |

#### Case 4: Single `LEFT JOIN` Update

##### SQL Server

```sql
UPDATE Orders
SET Quantity = 13
FROM Orders O
LEFT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE C.CustomerID IS NULL AND O.ProductID = 13;

-- Select the changes
SELECT * FROM orders
WHERE CustomerID IS NULL;
```

##### Output

| OrderID | CustomerID | ProductID | Quantity | OrderDate |
| --- | --- | --- | --- | --- |
| 5 | null | 5 | 7 | 2024-11-05 |
| 13 | null | 13 | 13 | 2024-11-13 |

##### Snowflake

```sql
UPDATE Orders
SET Quantity = 13
WHERE OrderID IN (
  SELECT O.OrderID
  FROM Orders O
  LEFT JOIN Customers C ON O.CustomerID = C.CustomerID
  WHERE C.CustomerID IS NULL AND O.ProductID = 13
);

-- Select the changes
SELECT * FROM orders
WHERE CustomerID IS NULL;
```

##### Output

| OrderID | CustomerID | ProductID | Quantity | OrderDate |
| --- | --- | --- | --- | --- |
| 5 | null | 5 | 7 | 2024-11-05 |
| 13 | null | 13 | 13 | 2024-11-13 |

> **Note:**
>
> This approach in Snowflake will not work because it does not update the necessary rows:
>
> `UPDATE Orders O SET O.Quantity = 13 FROM Customers C WHERE O.CustomerID = C.CustomerID AND C.CustomerID IS NULL AND O.ProductID = 13;`

#### Case 5: Multiple `LEFT JOIN` and `RIGHT JOIN` Update

This is a more complex pattern. To translate multiple LEFT JOINs, please review the following pattern:

> **Note:**
>
> `LEFT JOIN` and `RIGHT JOIN` will depend on the order in the `FROM` clause.

```sql
UPDATE [target_table_name]
SET [all_set_statements]
FROM [all_left_join_tables_separated_by_comma]
WHERE [all_clauses_into_the_ON_part]
```

##### SQL Server

```sql
UPDATE Orders
SET
    Quantity = C.CustomerID
FROM Orders O
LEFT JOIN Customers C ON C.CustomerID = O.CustomerID
LEFT JOIN Products P ON P.ProductID = O.ProductID
WHERE C.CustomerName = 'Alice Johnson'
  AND P.ProductName = 'Tablet';

SELECT O.OrderID, O.CustomerID, O.ProductID, O.Quantity, O.OrderDate
FROM Orders O
LEFT JOIN Customers C ON C.CustomerID = O.CustomerID
LEFT JOIN Products P ON P.ProductID = O.ProductID
WHERE C.CustomerName = 'Alice Johnson'
  AND P.ProductName = 'Tablet';
```

##### Output

| OrderID | CustomerID | ProductID | Quantity | OrderDate |
| --- | --- | --- | --- | --- |
| 3 | 3 | 3 | 3 | 2024-11-12 |

##### Snowflake

```sql
UPDATE Orders O
SET O.Quantity = C.CustomerID
FROM Customers C, Products P
WHERE O.CustomerID = C.CustomerID
  AND C.CustomerName = 'Alice Johnson'
  AND P.ProductName = 'Tablet'
  AND O.ProductID = P.ProductID;

  SELECT O.OrderID, O.CustomerID, O.ProductID, O.Quantity, O.OrderDate
FROM Orders O
LEFT JOIN Customers C ON C.CustomerID = O.CustomerID
LEFT JOIN Products P ON P.ProductID = O.ProductID
WHERE C.CustomerName = 'Alice Johnson'
  AND P.ProductName = 'Tablet';
```

##### Output

| OrderID | CustomerID | ProductID | Quantity | OrderDate |
| --- | --- | --- | --- | --- |
| 3 | 3 | 3 | 3 | 2024-11-12 |

#### Case 6: Mixed `INNER JOIN` and `LEFT JOIN` Update

##### SQL Server

```sql
UPDATE Orders
SET Quantity = 4
FROM Orders O
INNER JOIN Products P ON O.ProductID = P.ProductID
LEFT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE C.CustomerID IS NULL AND P.ProductName = 'Monitor';

-- Select changes
SELECT O.CustomerID, C.CustomerName, O.Quantity FROM Orders O
INNER JOIN Products P ON O.ProductID = P.ProductID
LEFT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE C.CustomerID IS NULL AND P.ProductName = 'Monitor';
```

##### Output

| CustomerID | CustomerName | Quantity |
| --- | --- | --- |
| null | null | 4 |

##### Snowflake

```sql
UPDATE Orders O
SET Quantity = 4
WHERE O.ProductID IN (SELECT ProductID FROM Products WHERE ProductName = 'Monitor')
AND O.CustomerID IS NULL;

-- Select changes
SELECT O.CustomerID, C.CustomerName, O.Quantity FROM Orders O
INNER JOIN Products P ON O.ProductID = P.ProductID
LEFT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE C.CustomerID IS NULL AND P.ProductName = 'Monitor';
```

##### Output

| CustomerID | CustomerName | Quantity |
| --- | --- | --- |
| null | null | 4 |

#### Case 7: Single `RIGHT JOIN` Update

##### SQL Server

```sql
UPDATE O
SET O.Quantity = 1000
FROM Orders O
RIGHT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE C.CustomerName = 'Alice Johnson';

-- Select changes
SELECT
    O.OrderID,
    O.CustomerID,
    O.ProductID,
    O.Quantity,
    O.OrderDate,
    C.CustomerName
FROM
    Orders O
RIGHT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE
    C.CustomerName = 'Alice Johnson';
```

##### Output

| OrderID | CustomerID | ProductID | Quantity | CustomerName |
| --- | --- | --- | --- | --- |
| 3 | 3 | 3 | 1000 | Alice Johnson |

##### Snowflake

```sql
UPDATE Orders O
SET O.Quantity = 1000
FROM Customers C
WHERE O.CustomerID = C.CustomerID
  AND C.CustomerName = 'Alice Johnson';

  -- Select changes
SELECT
    O.OrderID,
    O.CustomerID,
    O.ProductID,
    O.Quantity,
    O.OrderDate,
    C.CustomerName
FROM
    Orders O
RIGHT JOIN Customers C ON O.CustomerID = C.CustomerID
WHERE
    C.CustomerName = 'Alice Johnson';
```

##### Output

| OrderID | CustomerID | ProductID | Quantity | CustomerName |
| --- | --- | --- | --- | --- |
| 3 | 3 | 3 | 1000 | Alice Johnson |

#### Known Issues

* Since `UPDATE` in Snowflake does not allow the usage of `JOINs` directly, there may be cases that do not match the patterns described.

## UPDATE with LEFT and RIGHT JOIN

Translation specification for the UPDATE statement with JOINs.

Applies to

* SQL Server
* Azure Synapse Analytics
> **Warning:**
>
> Partially supported in Snowflake

### Description

The pattern UPDATE FROM is used to update data based on data from other tables. This [SQLServer documentation](https://learn.microsoft.com/en-us/sql/t-sql/queries/update-transact-sql?view=sql-server-ver16#OtherTables) provides a simple sample.

Review the following SQL Server syntax from the [documentation](https://learn.microsoft.com/en-us/sql/t-sql/queries/update-transact-sql?view=sql-server-ver16#UpdateExamples).

#### SQL Server Syntax

```sql
UPDATE [table_name]
SET column_name = expression [, ...]
[FROM <table_source> [, ...]]
[WHERE <search_condition>]
[OPTION (query_hint)]
```

* **`table_name`**: The table or view you are updating.
* **`SET`**: Specifies the columns and their new values. The `SET` clause assigns a new value (or expression) to one or more columns.
* **`FROM`**: Used to specify one or more source tables (*like a **join**)*. It helps define where the data comes from to perform the update.
* **`WHERE`**: Specifies which rows should be updated based on the condition(s). Without this clause, all rows in the table would be updated.
* **`OPTION (query_hint)`**: Specifies hints for query optimization.

##### Snowflake syntax

The Snowflake syntax can also be reviewed in the [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/sql/update).

> **Note:**
>
> Snowflake does not support `JOINs` in `UPDATE` clause.

```sql
 UPDATE <target_table>
       SET <col_name> = <value> [ , <col_name> = <value> , ... ]
        [ FROM <additional_tables> ]
```

**Required parameters**

* ***`target_table:`***Specifies the table to update.
* ***`col_name:`***Specifies the name of a column in *`target_table`*. Do not include the table name. E.g., `UPDATE t1 SET t1.col = 1` is invalid.
* ***`value`**`:`*Specifies the new value to set in *`col_name`*.

**Optional parameters**

* **``` FROM`` ```** ***`additional_tables:`*** Specifies one or more tables to use for selecting rows to update or for setting new values. *Note that repeating the target table results in a self-join.*
* **``` WHERE`` ```** ***`condition:`***The expression that specifies the rows in the target table to update. Default: No value (all rows of the target table are updated)

#### Translation Summary

As it is explained in the grammar description, there is not straight forward equivalent solution for JOINs inside the UPDATE cluase. For this reason, the approach to transform this statements is to add the operator (+) on the column that logically will add the required data into the table. This operator (+) is added to the cases on which the tables are referenced in the `LEFT`/`RIGHT` `JOIN` section.

Notice that there are other languages that use this operator (+) and the position of the operator may determine the type of join. In this specific case in Snowflake, the position will not determine the join type but the association with the logically needed tables and columns will.

Even when there are other alternative as MERGE clause or the usages of a CTE; these alternatives tend to turn difficult to read when there are complex queries, and get extensive.

### Sample Source Patterns

#### Setup data

##### SQL Server

```sql
 CREATE TABLE GenericTable1 (
    Col1 INT,
    Col2 VARCHAR(10),
    Col3 VARCHAR(10),
    Col4 VARCHAR(10),
    Col5 VARCHAR(10),
    Col6 VARCHAR(100)
);

CREATE TABLE GenericTable2 (
    Col1 VARCHAR(10),
    Col2 VARCHAR(10),
    Col3 VARCHAR(10),
    Col4 VARCHAR(10),
    Col5 VARCHAR(10)
);

CREATE TABLE GenericTable3 (
    Col1 VARCHAR(10),
    Col2 VARCHAR(100),
    Col3 CHAR(1)
);

INSERT INTO GenericTable1 (Col1, Col2, Col3, Col4, Col5, Col6)
VALUES
(1, 'A1', 'B1', 'C1', NULL, NULL),
(2, 'A2', 'B2', 'C2', NULL, NULL),
(3, 'A3', 'B3', 'C3', NULL, NULL);

INSERT INTO GenericTable2 (Col1, Col2, Col3, Col4, Col5)
VALUES
('1', 'A1', 'B1', 'C1', 'X1'),
('2', 'A2', 'B2', 'C2', 'X2'),
('3', 'A3', 'B3', 'C3', 'X3');

INSERT INTO GenericTable3 (Col1, Col2, Col3)
VALUES
('X1', 'Description1', 'A'),
('X2', 'Description2', 'A'),
('X3', 'Description3', 'A');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE GenericTable1 (
    Col1 INT,
    Col2 VARCHAR(10),
    Col3 VARCHAR(10),
    Col4 VARCHAR(10),
    Col5 VARCHAR(10),
    Col6 VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "12/18/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE GenericTable2 (
    Col1 VARCHAR(10),
    Col2 VARCHAR(10),
    Col3 VARCHAR(10),
    Col4 VARCHAR(10),
    Col5 VARCHAR(10)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "12/18/2024",  "domain": "test" }}'
;

CREATE OR REPLACE TABLE GenericTable3 (
    Col1 VARCHAR(10),
    Col2 VARCHAR(100),
    Col3 CHAR(1)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "12/18/2024",  "domain": "test" }}'
;

INSERT INTO GenericTable1 (Col1, Col2, Col3, Col4, Col5, Col6)
VALUES
(1, 'A1', 'B1', 'C1', NULL, NULL),
(2, 'A2', 'B2', 'C2', NULL, NULL),
(3, 'A3', 'B3', 'C3', NULL, NULL);

INSERT INTO GenericTable2 (Col1, Col2, Col3, Col4, Col5)
VALUES
('1', 'A1', 'B1', 'C1', 'X1'),
('2', 'A2', 'B2', 'C2', 'X2'),
('3', 'A3', 'B3', 'C3', 'X3');

INSERT INTO GenericTable3 (Col1, Col2, Col3)
VALUES
('X1', 'Description1', 'A'),
('X2', 'Description2', 'A'),
('X3', 'Description3', 'A');
```

#### LEFT JOIN

##### SQL Server

```sql
 UPDATE T1
SET
    T1.Col5 = T2.Col5,
    T1.Col6 = T3.Col2
FROM GenericTable1 T1
LEFT JOIN GenericTable2 T2 ON
    T2.Col1 COLLATE SQL_Latin1_General_CP1_CI_AS = T1.Col1
    AND T2.Col2 = T1.Col2
    AND T2.Col3 = T1.Col3
    AND T2.Col4 = T1.Col4
LEFT JOIN GenericTable3 T3 ON
    T3.Col1 = T2.Col5 AND T3.Col3 = 'A';
```

##### Output Before Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *null* | *null* |
| 2 | A2 | B2 | C2 | *null* | *null* |
| 3 | A3 | B3 | C3 | *null* | *null* |

##### Output After Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *X1* | *Description1* |
| 2 | A2 | B2 | C2 | *X2* | *Description2* |
| 3 | A3 | B3 | C3 | *X3* | *Description3* |

##### Snowflake

```sql
 UPDATE dbo.GenericTable1 T1
    SET
        T1.Col5 = T2.Col5,
        T1.Col6 = T3.Col2
    FROM
        GenericTable2 T2,
        GenericTable3 T3
    WHERE
        T2.Col1(+) COLLATE 'EN-CI-AS' /*** SSC-FDM-TS0002 - COLLATION FOR VALUE CP1 NOT SUPPORTED ***/ = T1.Col1
        AND T2.Col2(+) = T1.Col2
        AND T2.Col3(+) = T1.Col3
        AND T2.Col4(+) = T1.Col4
        AND T3.Col1(+) = T2.Col5
        AND T3.Col3 = 'A';
```

##### Output Before Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *null* | *null* |
| 2 | A2 | B2 | C2 | *null* | *null* |
| 3 | A3 | B3 | C3 | *null* | *null* |

##### Output After Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *X1* | *Description1* |
| 2 | A2 | B2 | C2 | *X2* | *Description2* |
| 3 | A3 | B3 | C3 | *X3* | *Description3* |

#### RIGHT JOIN

##### SQL Server

```sql
UPDATE T1
SET
    T1.Col5 = T2.Col5
FROM GenericTable2 T2
RIGHT JOIN GenericTable1 T1 ON
    T2.Col1 COLLATE SQL_Latin1_General_CP1_CI_AS = T1.Col1
    AND T2.Col2 = T1.Col2
    AND T2.Col3 = T1.Col3
    AND T2.Col4 = T1.Col4;
```

##### Output Before Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *null* | *null* |
| 2 | A2 | B2 | C2 | *null* | *null* |
| 3 | A3 | B3 | C3 | *null* | *null* |

##### Output After Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *\*\*X1* | *null* |
| 2 | A2 | B2 | C2 | *\*\*X2* | *null* |
| 3 | A3 | B3 | C3 | *\*\*X3* | *null* |

##### Snowflake

```sql
 UPDATE dbo.GenericTable1 T1
    SET
        T1.Col5 = T2.Col5
    FROM
        GenericTable2 T2,
        GenericTable1 T1
    WHERE
        T2.Col1 COLLATE 'EN-CI-AS' /*** SSC-FDM-TS0002 - COLLATION FOR VALUE CP1 NOT SUPPORTED ***/ = T1.Col1
        AND T2.Col2 = T1.Col2(+)
        AND T2.Col3 = T1.Col3(+)
        AND T2.Col4 = T1.Col4(+);
```

##### Output Before Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *null* | *null* |
| 2 | A2 | B2 | C2 | *null* | *null* |
| 3 | A3 | B3 | C3 | *null* | *null* |

##### Output After Query

| Col1 | Col2 | Col3 | Col4 | Col5 | Col6 |
| --- | --- | --- | --- | --- | --- |
| 1 | A1 | B1 | C1 | *\*\*X1* | *null* |
| 2 | A2 | B2 | C2 | *\*\*X2* | *null* |
| 3 | A3 | B3 | C3 | *\*\*X3* | *null* |

### Known Issues

* There may be patterns that cannot be translated due to differences in logic.
* If your query pattern applies, review non-deterministic rows: “When a [FROM](https://docs.snowflake.com/en/sql-reference/constructs/from)
  clause contains a [JOIN](https://docs.snowflake.com/en/sql-reference/constructs/join) between tables (e.g. `t1` and `t2`), a target row in
  `t1` may join against (i.e. match) more than one row in table `t2`. When this occurs, the target row is called a *multi-joined row*. When
  updating a multi-joined row, the
  [ERROR_ON_NONDETERMINISTIC_UPDATE](https://docs.snowflake.com/en/sql-reference/parameters.html#label-error-on-nondeterministic-update)
  session parameter controls the outcome of the update” ([Snowflake documentation](https://docs.snowflake.com/en/sql-reference/sql/update)).

---
title: SnowConvert AI - SQL Server-Azure Synapse - EXIT HANDLER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-exit-handler.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - EXIT HANDLER

## Description

In SQL Server and Azure Synapse Analytics, exception handling is primarily managed through `TRY...CATCH` blocks. Unlike some other database systems (such as Teradata or DB2), SQL Server does not have a native `DECLARE EXIT HANDLER` statement.

However, when migrating code from other database systems that use EXIT HANDLERs, SnowConvert AI transforms these constructs into equivalent Snowflake Scripting exception handling mechanisms.

An EXIT HANDLER in source systems terminates the current block when a specific condition is met and transfers control to the handler code before returning to the caller. In Snowflake, this is achieved using EXCEPTION blocks with appropriate exit behavior.

For more information about SQL Server error handling, see [TRY…CATCH (Transact-SQL)](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/try-catch-transact-sql).

## Grammar Syntax

SQL Server does not have native EXIT HANDLER syntax. However, when converting from other database systems, the source pattern typically looks like:

```sql
-- Pattern from source systems (e.g., DB2, Teradata)
DECLARE EXIT HANDLER FOR condition_value
  handler_action_statement;
```

## Sample Source Patterns

### EXIT HANDLER Conversion from DB2/Teradata

When migrating stored procedures from DB2 or Teradata that contain EXIT HANDLER declarations, SnowConvert AI transforms them into Snowflake-compatible exception handling.

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
-- Example pattern from source system
CREATE PROCEDURE exit_handler_example()
BEGIN
    DECLARE EXIT HANDLER FOR SQLEXCEPTION
    BEGIN
        INSERT INTO error_log VALUES (SQLCODE, SQLERRM, CURRENT_TIMESTAMP);
        ROLLBACK;
    END;

    -- Main procedure logic
    INSERT INTO orders VALUES (1, 100.00);
    UPDATE inventory SET quantity = quantity - 1 WHERE product_id = 1;

    -- This will NOT execute if an error occurred
    INSERT INTO audit_log VALUES ('Transaction completed successfully');
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE exit_handler_example()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        -- Main procedure logic
        INSERT INTO orders VALUES (1, 100.00);
        UPDATE inventory SET quantity = quantity - 1 WHERE product_id = 1;

        -- This will NOT execute if an error occurred
        INSERT INTO audit_log VALUES ('Transaction completed successfully');

        EXCEPTION
            WHEN OTHER THEN
                BEGIN
                    INSERT INTO error_log
                    VALUES (:SQLCODE, :SQLERRM, CURRENT_TIMESTAMP());
                    ROLLBACK;
                END;
    END;
$$;
```

### EXIT HANDLER with Specific Error Codes

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE specific_error_handler()
BEGIN
    DECLARE EXIT HANDLER FOR SQLSTATE '23505'
    BEGIN
        INSERT INTO error_log VALUES ('Duplicate key error');
        RETURN -1;
    END;

    INSERT INTO users VALUES (1, 'John Doe');
    INSERT INTO users VALUES (1, 'Jane Doe');  -- Will trigger handler

    -- This will NOT execute
    INSERT INTO success_log VALUES ('All inserts completed');
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE specific_error_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        INSERT INTO users VALUES (1, 'John Doe');
        INSERT INTO users VALUES (1, 'Jane Doe');  -- Will trigger handler

        -- This will NOT execute
        INSERT INTO success_log VALUES ('All inserts completed');

        EXCEPTION
            WHEN OTHER THEN
                LET errcode := :SQLCODE;
                LET sqlerrmsg := :SQLERRM;
                IF (errcode = '100183' OR CONTAINS(sqlerrmsg, 'duplicate key')) THEN
                    INSERT INTO error_log VALUES ('Duplicate key error');
                    RETURN -1;
                ELSE
                    RAISE;
                END IF;
    END;
$$;
```

### EXIT HANDLER with NOT FOUND

#### Input Code:

##### Source (DB2/Teradata Pattern)

```sql
CREATE PROCEDURE not_found_handler()
BEGIN
    DECLARE v_name VARCHAR(100);

    DECLARE EXIT HANDLER FOR NOT FOUND
    BEGIN
        INSERT INTO log_table VALUES ('Record not found');
        RETURN 0;
    END;

    SELECT name INTO v_name FROM employees WHERE id = 9999;

    -- This will NOT execute if no record found
    INSERT INTO results VALUES (v_name);
END;
```

#### Output Code:

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE not_found_handler()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        v_name VARCHAR(100);
    BEGIN
        SELECT name INTO v_name FROM employees WHERE id = 9999;

        -- This will NOT execute if no record found
        INSERT INTO results VALUES (v_name);

        EXCEPTION
            WHEN NO_DATA_FOUND THEN
                BEGIN
                    INSERT INTO log_table VALUES ('Record not found');
                    RETURN 0;
                END;
    END;
$$;
```

## Known Issues

### EXIT HANDLER Behavior

Applies to

* SQL Server
* Azure Synapse Analytics

SQL Server’s native `TRY...CATCH` mechanism provides similar functionality to EXIT HANDLER. When an error occurs in a TRY block, control passes to the CATCH block, and execution does not continue after the CATCH block in the current scope.

SnowConvert AI transforms EXIT HANDLER patterns to Snowflake EXCEPTION blocks, which provide equivalent exit behavior:

1. **Execution Termination**: The current block is terminated when an exception occurs.
2. **Control Flow**: Control passes to the exception handler, executes the handler code, then exits the block.
3. **Return Behavior**: The procedure can return a value or status from within the exception handler.

### Multiple EXIT Handlers

When multiple EXIT HANDLERs are defined in the source system, they must be merged into a single EXCEPTION block with conditional logic:

#### Source Pattern

```sql
DECLARE EXIT HANDLER FOR SQLSTATE '23505'
    INSERT INTO log VALUES ('Duplicate key');

DECLARE EXIT HANDLER FOR SQLEXCEPTION
    INSERT INTO log VALUES ('General error');
```

#### Snowflake

```sql
EXCEPTION
    WHEN OTHER THEN
        LET errcode := :SQLCODE;
        LET sqlerrmsg := :SQLERRM;
        IF (errcode = '100183' OR CONTAINS(sqlerrmsg, 'duplicate key')) THEN
            INSERT INTO log VALUES ('Duplicate key');
        ELSE
            INSERT INTO log VALUES ('General error');
        END IF;
```

### Mixed CONTINUE and EXIT Handlers

Applies to

* SQL Server
* Azure Synapse Analytics

Source systems may allow mixing CONTINUE and EXIT handlers in the same block. This pattern cannot be directly replicated in Snowflake, as EXCEPTION blocks handle errors uniformly.

When this pattern is encountered:

* Separate EXCEPTION blocks may be generated
* An EWI warning (`SSC-EWI-0114`) is added
* Manual review is recommended

#### Related EWIs

1. [SSC-EWI-0114](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): MIXED CONTINUE AND EXIT EXCEPTION HANDLERS IN THE SAME BLOCK ARE NOT SUPPORTED BY SNOWFLAKE SCRIPTING

## Best Practices

When working with converted EXIT HANDLER code:

1. **Understand Exit Semantics**: EXIT handlers terminate the current block. Verify this matches your application’s requirements.
2. **Test Error Scenarios**: Thoroughly test all error conditions to ensure proper exit behavior.
3. **Use Transactions**: Leverage Snowflake’s transaction support for data consistency.
4. **Return Values**: Use RETURN statements in exception handlers to communicate exit status to callers.
5. **Logging**: Implement comprehensive error logging to track when and why procedures exit.
6. **Nested Blocks**: Remember that EXIT behavior only affects the current block, not outer blocks.

## Related Documentation

* [Snowflake Exception Handling](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/exceptions)
* [SQL Server TRY…CATCH](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/try-catch-transact-sql)
* [TRY CATCH Translation Reference](transact-create-procedure-snow-script.md)

## See Also

* [CONTINUE HANDLER](transact-continue-handler.md)
* [CREATE PROCEDURE](transact-create-procedure.md)
* [CREATE PROCEDURE - Snowflake Scripting](transact-create-procedure-snow-script.md)
* [General Statements](transact-general-statements.md)

---
title: SnowConvert AI - SQL Server-Azure Synapse - General Language Elements
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-general-statements.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - General Language Elements

In this section you could find information about general statements of Transact-SQL.

## COLLATE

Applies to

* SQL Server
* Azure Synapse Analytics

The transformation of the collate depends on its value, since it can be supported or not supported.

Currently, these are the languages that are supported for the transformation, if they are found in the collate, they will be transformed into its Snowflake equivalent.

| SQL Server | Snowflake |
| --- | --- |
| Latin1_General | EN |
| Modern_Spanish | ES |
| French | FR |

If the language is not one of the above, the collate will be commented.

The collate in SQL Server comes with additional specifications, such as **CI, CS, AI,** and **AS**. If there are additional specifications that are unsupported, they will be commented in the result.

### Source

```sql
SELECT 'a' COLLATE Latin1_General_CI_AS;

SELECT 'a' COLLATE Modern_Spanish_CI_AS;

SELECT 'a' COLLATE French_CI_AS;

SELECT 'a' COLLATE Albanian_BIN;

SELECT 'a' COLLATE Latin1_General_CI_AS_WS;

SELECT 'a' COLLATE Latin1_General_CI_AS_KS_WS;

SELECT 'a' COLLATE Albanian_CI_AI;
```

### Expected

```sql
SELECT 'a' COLLATE 'EN-CI-AS';

SELECT 'a' COLLATE 'ES-CI-AS';

SELECT 'a' COLLATE 'FR-CI-AS';

SELECT 'a'
--           !!!RESOLVE EWI!!! /*** SSC-EWI-TS0077 - COLLATION Albanian_BIN NOT SUPPORTED ***/!!!
-- COLLATE Albanian_BIN
                     ;

SELECT 'a' COLLATE 'EN-CI-AS' /*** SSC-FDM-TS0002 - COLLATION FOR VALUE WS NOT SUPPORTED ***/;

SELECT 'a' COLLATE 'EN-CI-AS' /*** SSC-FDM-TS0002 - COLLATION FOR VALUES KS,WS NOT SUPPORTED ***/;

SELECT 'a'
--           !!!RESOLVE EWI!!! /*** SSC-EWI-TS0077 - COLLATION Albanian_CI_AI NOT SUPPORTED ***/!!!
-- COLLATE Albanian_CI_AI
                       ;
```

Let’s see an example of collate in a Create Table

### Source

```sql
CREATE TABLE TABLECOLLATE
(
    COL1 VARCHAR COLLATE Latin1_General_CI_AS
);
```

### Expected

```sql
CREATE OR REPLACE TABLE TABLECOLLATE
(
    COL1 VARCHAR COLLATE 'EN-CI-AS' /*** SSC-PRF-0002 - CASE INSENSITIVE COLUMNS CAN DECREASE THE PERFORMANCE OF QUERIES ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

As you can see, the transformation of Collate inside a Select or a Table is the same.

### Related EWIs

1. [SSC-EWI-TS0077](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): This message is shown when there is a collate clause that is not supported in Snowflake.
2. [SSC-FDM-TS0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): This message is shown when there is a collate clause that is not supported in Snowflake.
3. [SSC-PRF-0002](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Case-insensitive columns can decrease the performance of queries.

## COMPUTED COLUMN

The computed expression could not be transformed.

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The expression of a computed column could not be transformed.

#### Code Example

##### Input Code:

```sql
CREATE TABLE [TestTable](
    [Col1] AS (CONVERT ([REAL], ExpressionValue))
);
```

##### Output Code:

```sql
CREATE OR REPLACE TABLE TestTable (
    Col1 REAL AS (CAST(ExpressionValue AS REAL)) /*** SSC-FDM-TS0014 - COMPUTED COLUMN WAS TRANSFORMED TO ITS SNOWFLAKE EQUIVALENT, FUNCTIONAL EQUIVALENCE VERIFICATION PENDING. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

#### Recommendations

* Add manual changes to the not-transformed expression.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

1. [SSC-FDM-TS0014](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Computed column transformed.

## OUTER APPLY

Outer apply statement equivalence translation.

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

When OUTER APPLY is specified, one row is produced for each row of the left rowset even when the right-side rowset expression returns an empty rowset for that row. ([OUTER APPLY Definition](https://learn.microsoft.com/en-us/u-sql/statements-and-expressions/select/from/select-selecting-from-cross-apply-and-outer-apply))

### Syntax

```sql
   Apply_Operator :=
       'CROSS' 'APPLY'
  |    'OUTER' 'APPLY'.
```

### Snowflake equivalence

Despite the unsupported statement OUTER APPLY in Snowflake, there is an equivalent statement, which is LATERAL. Hence, the translation for the statement is conducted to get the same functionality through the use of alternative solutions.

Nevertheless, the LATERAL statement in Snowflake has two variations in syntax. In fact, the INNER JOIN LATERAL variation is used in this specific translation.

The INNER JOIN LATERAL grammar from Snowflake is the following:

```sql
 SELECT ...
FROM <left_hand_table_expression> INNER JOIN LATERAL ( <inline_view> )
...
```

> **Note:**
>
> *<inline_view>* must not be a table name.

And, the single LATERAL statement is shown below:

```sql
 SELECT ...
FROM <left_hand_table_expression>, LATERAL ( <inline_view> )
...
```

### Sample source

The following example shows a general translation between OUTER APPLY and INNER JOIN LATERAL:

#### SQL Server

```sql
SELECT  p.ProjectName, e.ProjectName, e.FirstName
FROM Project p
OUTER APPLY (
    SELECT
        ProjectName,
        FirstName,
        LastName
    FROM Employees e
) e;
```

#### Output

| p.ProjectName | e.ProjectName | FirstName |
| --- | --- | --- |
| Project A | Project A | John |
| Project A | Project A | Jane |
| Project A | Project B | Michael |
| Project B | Project A | John |
| Project B | Project A | Jane |
| Project B | Project B | Michael |
| Project C | Project A | John |
| Project C | Project A | Jane |
| Project C | Project B | Michael |

#### Snowflake

```sql
 SELECT
    p.ProjectName,
    e.ProjectName,
    e.FirstName
FROM
    Project p
    INNER JOIN
        LATERAL (
                   SELECT
                       ProjectName,
                       FirstName,
                       LastName
                   FROM
                       Employees e
               ) e;
```

#### Output

| PROJECTNAME | PROJECTNAME_2 | FIRSTNAME |
| --- | --- | --- |
| Project A | Project A | John |
| Project A | Project A | Jane |
| Project A | Project B | Michael |
| Project B | Project A | John |
| Project B | Project A | Jane |
| Project B | Project B | Michael |
| Project C | Project A | John |
| Project C | Project A | Jane |
| Project C | Project B | Michael |

### Known issues

Since the translation is an equivalence from the input, there are some limitations.

* TOP and WHERE statements may be reviewed for optimal behavior.
* A correlation name at the end of the statement may be needed. In Snowflake, the query does not represent a problem if the correlation name is not in the query, but functionality may change and does not form part of the accepted pattern in SQL Server.

#### SQL Server

```sql
SELECT
    SATT.UNIVERSAL_NAME
FROM
SAMPLE_ATLAS AS SATT
OUTER APPLY (
    SELECT
        TOP 1 UNIVERSAL_NAME,
        INTERNATIONAL_NAME,
        CODE_IDENTIFIER
    FROM
        SAMPLE_GLOBE AS SG
    WHERE
        SG.GLOBE_KEY = SATT.MbrPersGenKey
    ORDER BY
        GLOBE_KEY
);
```

##### Translation output

```sql
SELECT
            UNIVERSAL_NAME
FROM
            SAMPLE_ATLAS
            AS SATT
            OUTER APPLY
                        /*** MSC-ERROR - MSCCP0001 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/ (SELECT TOP 1
                                                UNIVERSAL_NAME,
                                                INTERNATIONAL_NAME,
                                                CODE_IDENTIFIER
                                    FROM
                                                SAMPLE_GLOBE AS SG
                                    WHERE
                                                SG.GLOBE_KEY = SATT.MbrPersGenKey
                                    ORDER BY GLOBE_KEY
                        );
```

* Specific statements that are not supported may comment out all the block code (example taken from: [JSON Example](https://learn.microsoft.com/en-us/sql/relational-databases/json/validate-query-and-change-json-data-with-built-in-functions-sql-server?view=sql-server-ver16)).

##### SQL Server

```sql
SELECT
    SATT.UNIVERSAL_NAME
FROM
SAMPLE_ATLAS AS SATT
INNER JOIN LATERAL (
    SELECT
        TOP 1 UNIVERSAL_NAME,
        INTERNATIONAL_NAME,
        CODE_IDENTIFIER
    FROM
        SAMPLE_GLOBE AS SG
    WHERE
        SG.GLOBE_KEY = SATT.MbrPersGenKey
    ORDER BY
        GLOBE_KEY
);
```

##### Translation output

```sql
SELECT
	familyName,
	c.givenName AS childGivenName,
	c.firstName AS childFirstName,
	p.givenName AS petName
FROM
	Families f
	LEFT OUTER JOIN
		OPENJSON(f.doc) /*** MSC-WARNING - MSCEWI4030 - Equivalence from CROSS APPLY to LEFT OUTER JOIN must be checked. ***/;
-- ** MSC-ERROR - MSCEWI1001 - UNRECOGNIZED TOKEN ON LINE 7 OF THE SOURCE CODE. **
--		WITH (familyName nvarchar(100), children nvarchar(max) AS JSON)
--		CROSS APPLY OPENJSON(children)
--		WITH (givenName nvarchar(100), firstName nvarchar(100), pets nvarchar(max) AS JSON) as c
--			OUTER APPLY OPENJSON (pets)
--			WITH (givenName nvarchar(100))  as p
```

### Related EWIs

No related EWIs.

## USE

Transact-SQL USE statement Snowflake equivalence.

Applies to

* SQL Server

The [USE](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/use-transact-sql?view=sql-server-ver15) statement has its own equivalent in Snowflake. The statement will be translated to the [USE DATABASE](https://docs.snowflake.com/en/sql-reference/sql/use-database.html) statement in Snowflake.

### Translation Examples

#### Source

```sql
USE [MY DATABASE]
```

#### Output

```sql
USE DATABASE "MY DATABASE";
```

#### Database name

The `database name` specified in the `USE` statement, could have a change if it comes inside *Square Brackets* **`([ ])`**. The first bracket and the last bracket will be replaced with *quotes.* Example:

##### Source

```sql
[MYDATABASE]
[[[MYDATABASE]]
```

##### Output

```sql
"MYDATABASE"
"[[MYDATABASE]"
```

#### User Defined Database

If a user specifies to the Conversion Tool a custom database name to be applied to all the objects by using the `-d` parameter, and wants the USE statements to be transformed, the Database name should be applied just to the `USE` statement and not to the objects. This will override the specified database from the use statement. Example:

##### Source

```sql
-- Additional Params: -d MYCUSTOMDB
USE [MY DATABASE]

CREATE TABLE [TableName1].[TableName2](
	[ColumnName1] varchar NULL
);
```

##### Output

```sql
-- Additional Params: -d MYCUSTOMDB
USE DATABASE MYCUSTOMDB;

CREATE OR REPLACE TABLE MYCUSTOMDB.TableName1.TableName2 (
	ColumnName1 VARCHAR NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## EXECUTE

Applies to

* SQL Server
* Azure Synapse Analytics

The translation for **Exec** or **Execute** Statements is not supported in Snowflake, but it will be translated to **CALL** statement.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Input

```sql
Exec db.sp1
```

### Output

```sql
CALL db.sp1();
```

For more information about Execute visit: [Execute inside Procedures](transact-create-procedure.md)

## PRINT

Applies to

* SQL Server
* Azure Synapse Analytics

The **Print** statement is not directly supported in Snowflake, but it will be translated to its closest equivalent, the **SYSTEM$LOG_INFO** built-in function.

### Input

```sql
PRINT 'My message';
```

### Output (Inside SnowScript)

```sql
SYSTEM$LOG_INFO('My message');
```

### Output (Outside of SnowScript)

When the **Print** statement is used outside of a stored procedure, it is required to be called from a SnowConvert AI UDP.

```sql
CALL PUBLIC.LOG_INFO_UDP('My message');
```

Before you can begin logging messages, you must set up an event table. For more information, see: [Logging messages in Snowflake Scripting](../../../../developer-guide/logging-tracing/logging-snowflake-scripting.md)

## System Stored Procedures

## SP_EXECUTESQL

Translation specification for the system procedure SP_EXECUTESQL.

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

The SP_EXECUTESQL system stored procedure is used to execute a Transact-SQL statement or batch that can be reused many times, or one that is built dynamically. The statement or batch can contain embedded parameters.

This functionality can be emulated in Snowflake through the EXECUTE IMMEDIATE statement and with a user-defined function (UDF) for embedded parameters.

For more information about the user-defined function (UDF) used for this translation,
check [TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(STRING, STRING, ARRAY, ARRAY)](../../general/technical-documentation/function-references/sql-server/README.md).

#### Syntax

##### Transact

```sql
 sp_executesql [ @stmt = ] N'statement'
[
    [ , [ @params = ] N'@parameter_name data_type [ { OUT | OUTPUT } ] [ , ...n ]' ]
    [ , [ @param1 = ] 'value1' [ , ...n ] ]
]
```

### Sample Source Patterns

All patterns will transform SP_EXECUTESQL into Snowflake’s EXECUTE IMMEDIATE statement and only modify the SQL string to be executed when using embedded parameters.

> **Warning:**
>
> [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) (Usage of Dynamic SQL) will be added for all patterns. Even though the translation for SP_EXECUTESQL is equivalent to Snowflake, in this context, this EWI indicates that the SQL string might require manual fixes for the translation to execute as intended.

#### Setup Data

##### Transact

```sql
 CREATE TABLE PERSONS(
  NAME VARCHAR(25),
  ID INT,
  AGE INT
);

-- DATA
INSERT INTO PERSONS VALUES ('John Smith', 1, 24);
INSERT INTO PERSONS VALUES ('John Doe', 2, 21);
INSERT INTO PERSONS VALUES ('Mary Keller', 3, 32);
INSERT INTO PERSONS VALUES ('Mundane Man', 4, 18);
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE PERSONS (
  NAME VARCHAR(25),
  ID INT,
  AGE INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
;

-- DATA
INSERT INTO PERSONS VALUES ('John Smith', 1, 24);
INSERT INTO PERSONS VALUES ('John Doe', 2, 21);
INSERT INTO PERSONS VALUES ('Mary Keller', 3, 32);
INSERT INTO PERSONS VALUES ('Mundane Man', 4, 18);
```

#### Without embedded parameters

When no embedded parameters are being used, the SP_EXECUTESQL is transformed into an EXECUTE IMMEDIATE statement and use the SQL string without modifications.

##### Transact

```sql
 CREATE PROCEDURE SIMPLE_SINGLE_QUERY
AS
BEGIN
    DECLARE @SQLString NVARCHAR(500);
    SET @SQLString = N'SELECT * FROM PERSONS';
    EXECUTE sp_executesql @SQLString;
END

GO

EXEC SIMPLE_SINGLE_QUERY;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE SIMPLE_SINGLE_QUERY ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQLSTRING VARCHAR(500);
    ProcedureResultSet RESULTSET;
  BEGIN

    SQLSTRING := 'SELECT
   *
FROM
   PERSONS;';
    ProcedureResultSet := (
      !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
      EXECUTE IMMEDIATE :SQLSTRING
    );
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL SIMPLE_SINGLE_QUERY();
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |

#### With embedded parameters for data binding

For embedded parameters for data binding, the SP_EXECUTESQL is transformed into an EXECUTE IMMEDIATE statement, and the SQL string is modified through the `TRANSFORM_SP_EXECUTE_SQL_STRING_UDF`.

The result of the EXECUTE IMMEDIATE is assigned to the `ProcedureResultSet` variable and later returned as `TABLE(ProcedureResultSet)`.

##### Transact

```sql
 CREATE PROCEDURE QUERY_WITH_DATA_BINDING_PARAMS
AS
BEGIN
    DECLARE @IntVariable INT;
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParmDefinition NVARCHAR(500);

    SET @IntVariable = 21;
    SET @SQLString = N'SELECT * FROM PERSONS WHERE AGE = @age';
    SET @ParmDefinition = N'@age INT';
    EXECUTE sp_executesql @SQLString, @ParmDefinition, @age = @IntVariable;
END

GO

EXEC QUERY_WITH_DATA_BINDING_PARAMS;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Doe | 2 | 21 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE QUERY_WITH_DATA_BINDING_PARAMS ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    INTVARIABLE INT;
    SQLSTRING VARCHAR(500);
    PARMDEFINITION VARCHAR(500);
    ProcedureResultSet RESULTSET;
  BEGIN

    INTVARIABLE := 21;
    SQLSTRING := 'SELECT
   *
FROM
   PERSONS
WHERE
   AGE = @age;';
    PARMDEFINITION := '@age INT';
    ProcedureResultSet := (
      !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
      EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARMDEFINITION, ARRAY_CONSTRUCT('AGE'), ARRAY_CONSTRUCT(:INTVARIABLE))
    );
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL QUERY_WITH_DATA_BINDING_PARAMS();
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Doe | 2 | 21 |

#### With embedded OUTPUT parameters

For embedded OUTPUT parameters, the SP_EXECUTESQL is transformed into an EXECUTE IMMEDIATE statement, and the SQL string is modified through the `TRANSFORM_SP_EXECUTE_SQL_STRING_UDF`.

Additionally, a `SELECT $1, ..., $n INTO :outputParam1, ..., :outputParamN FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))` is added to the result of each column to the corresponding OUTPUT parameter.

> **Warning:**
>
> SSC-FDM-TS0028 is added to the SELECT INTO statement. It is essential for the parameters in the INTO clause to appear in the same order as they were assigned in the original SQL String.
>
> Otherwise, manual changes are required to meet this requirement.

##### Transact

```sql
CREATE PROCEDURE QUERY_WITH_OUTPUT_PARAMS
AS
BEGIN
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParamDefinition NVARCHAR(500);
    DECLARE @MaxAge INT;

    SET @SQLString = N'SELECT @MaxAgeOUT = max(AGE) FROM PERSONS';
    SET @ParamDefinition = N'@MaxAgeOUT INT OUTPUT';
    EXECUTE sp_executesql @SQLString, @ParamDefinition, @MaxAgeOUT = @MaxAge OUTPUT;

    SELECT @MaxAge;
END

GO

EXEC QUERY_WITH_OUTPUT_PARAMS;
```

##### Results

| <anonymous> |
| --- |
| 32 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE QUERY_WITH_OUTPUT_PARAMS ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/27/2024",  "domain": "test" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        SQLSTRING VARCHAR(500);
        PARAMDEFINITION VARCHAR(500);
        MAXAGE INT;
        ProcedureResultSet RESULTSET;
    BEGIN

        SQLSTRING := 'SELECT
   MAX(AGE) FROM
   PERSONS;';
        PARAMDEFINITION := '@MaxAgeOUT INT OUTPUT';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARAMDEFINITION, ARRAY_CONSTRUCT('MAXAGEOUT'), ARRAY_CONSTRUCT(:MAXAGE));
        --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
        SELECT
            $1
        INTO
            :MAXAGE
        FROM
            TABLE(RESULT_SCAN(LAST_QUERY_ID()));
        ProcedureResultSet := (
        SELECT
            :MAXAGE);
        RETURN TABLE(ProcedureResultSet);
    END;
$$;

CALL QUERY_WITH_OUTPUT_PARAMS();
```

##### Results

| :MAXAGE::NUMBER(38,0) |
| --- |
| 32 |

#### With both embedded OUTPUT parameters and data binding

The translation is the same as for only OUTPUT parameters.

##### Transact

```sql
CREATE PROCEDURE QUERY_WITH_BOTH_PARAMS
AS
BEGIN
    DECLARE @AgeVariable INT;
    DECLARE @IdVariable INT;
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParmDefinition NVARCHAR(500);
    DECLARE @MaxAge INT;
    DECLARE @MaxId INT;

    SET @AgeVariable = 30;
    SET @IdVariable = 100;
    SET @SQLString = N'SELECT @MaxAgeOUT = max(AGE), @MaxIdOut = max(ID) FROM PERSONS WHERE AGE < @age AND ID < @id;';
    SET @ParmDefinition = N'@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT';
    EXECUTE sp_executesql @SQLString, @ParmDefinition, @age = @AgeVariable, @id = @IdVariable, @MaxAgeOUT = @MaxAge OUTPUT, @MaxIdOUT = @MaxId OUTPUT;

    SELECT @MaxAge, @MaxId;
END

GO

EXEC QUERY_WITH_BOTH_PARAMS;
```

##### Results

| <anonymous> | <anonymous> |
| --- | --- |
| 24 | 4 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE QUERY_WITH_BOTH_PARAMS ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    AGEVARIABLE INT;
    IDVARIABLE INT;
    SQLSTRING VARCHAR(500);
    PARMDEFINITION VARCHAR(500);
    MAXAGE INT;
    MAXID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    AGEVARIABLE := 30;
    IDVARIABLE := 100;
    SQLSTRING := 'SELECT
   MAX(AGE),
   MAX(ID) FROM
   PERSONS
WHERE
   AGE < @age AND ID < @id;';
    PARMDEFINITION := '@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARMDEFINITION, ARRAY_CONSTRUCT('AGE', 'ID', 'MAXAGEOUT', 'MAXIDOUT'), ARRAY_CONSTRUCT(:AGEVARIABLE, :IDVARIABLE, :MAXAGE, :MAXID));
    --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
    SELECT
      $1,
      $2
    INTO
      :MAXAGE,
      :MAXID
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    ProcedureResultSet := (
    SELECT
      :MAXAGE,
      :MAXID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL QUERY_WITH_BOTH_PARAMS();
```

##### Results

| :MAXAGE::NUMBER(38,0) | :MAXID::NUMBER(38,0) |
| --- | --- |
| 24 | 4 |

#### Parameters not in order of definition

This pattern follows the same rules as the previous patterns. `TRANSFORM_SP_EXECUTE_SQL_STRING_UDF` replaces the parameter values in the correct order.

##### Transact

```sql
CREATE PROCEDURE QUERY_PARAMS_NOT_IN_ORDER_OF_DEF
AS
BEGIN
    DECLARE @AgeVariable INT;
    DECLARE @IdVariable INT;
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParmDefinition NVARCHAR(500);
    DECLARE @MaxAge INT;
    DECLARE @MaxId INT;

    SET @AgeVariable = 30;
    SET @IdVariable = 100;
    SET @SQLString = N'SELECT @MaxAgeOUT = max(AGE), @MaxIdOut = max(ID) FROM PERSONS WHERE AGE < @age AND ID < @id;';
    SET @ParmDefinition = N'@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT';
    EXECUTE sp_executesql @SQLString, @ParmDefinition, @id = @IdVariable, @MaxAgeOUT = @MaxAge OUTPUT, @age = @AgeVariable, @MaxIdOUT = @MaxId OUTPUT;

    SELECT @MaxAge, @MaxId;
END

GO

EXEC QUERY_PARAMS_NOT_IN_ORDER_OF_DEF;

CREATE PROCEDURE QUERY_PARAMS_NOT_IN_ORDER_OF_DEF_2
AS
BEGIN
    DECLARE @AgeVariable INT;
    DECLARE @IdVariable INT;
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParmDefinition NVARCHAR(500);
    DECLARE @MaxAge INT;
    DECLARE @MaxId INT;

    SET @AgeVariable = 30;
    SET @IdVariable = 100;
    SET @SQLString = N'SELECT @MaxAgeOUT = max(AGE), @MaxIdOut = max(ID) FROM PERSONS WHERE AGE < @age AND ID < @id;';
    SET @ParmDefinition = N'@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT';
    EXECUTE sp_executesql @SQLString, @ParmDefinition, @AgeVariable, @MaxAgeOUT = @MaxAge OUTPUT, @id = @IdVariable, @MaxIdOUT = @MaxId OUTPUT;

    SELECT @MaxAge, @MaxId;
END

GO

EXEC QUERY_PARAMS_NOT_IN_ORDER_OF_DEF_2;
```

##### Results

| <anonymous> | <anonymous> |
| --- | --- |
| 24 | 4 |

| <anonymous> | <anonymous> |
| --- | --- |
| 24 | 4 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE QUERY_PARAMS_NOT_IN_ORDER_OF_DEF ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    AGEVARIABLE INT;
    IDVARIABLE INT;
    SQLSTRING VARCHAR(500);
    PARMDEFINITION VARCHAR(500);
    MAXAGE INT;
    MAXID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    AGEVARIABLE := 30;
    IDVARIABLE := 100;
    SQLSTRING := 'SELECT
   MAX(AGE),
   MAX(ID) FROM
   PERSONS
WHERE
   AGE < @age AND ID < @id;';
    PARMDEFINITION := '@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARMDEFINITION, ARRAY_CONSTRUCT('ID', 'MAXAGEOUT', 'AGE', 'MAXIDOUT'), ARRAY_CONSTRUCT(:IDVARIABLE, :MAXAGE, :AGEVARIABLE, :MAXID));
    --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
    SELECT
      $1,
      $2
    INTO
      :MAXAGE,
      :MAXID
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    ProcedureResultSet := (
    SELECT
      :MAXAGE,
      :MAXID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL QUERY_PARAMS_NOT_IN_ORDER_OF_DEF();

CREATE OR REPLACE PROCEDURE QUERY_PARAMS_NOT_IN_ORDER_OF_DEF_2 ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    AGEVARIABLE INT;
    IDVARIABLE INT;
    SQLSTRING VARCHAR(500);
    PARMDEFINITION VARCHAR(500);
    MAXAGE INT;
    MAXID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    AGEVARIABLE := 30;
    IDVARIABLE := 100;
    SQLSTRING := 'SELECT
   MAX(AGE),
   MAX(ID) FROM
   PERSONS
WHERE
   AGE < @age AND ID < @id;';
    PARMDEFINITION := '@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARMDEFINITION, ARRAY_CONSTRUCT('', 'MAXAGEOUT', 'ID', 'MAXIDOUT'), ARRAY_CONSTRUCT(:AGEVARIABLE, :MAXAGE, :IDVARIABLE, :MAXID));
    --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
    SELECT
      $1,
      $2
    INTO
      :MAXAGE,
      :MAXID
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    ProcedureResultSet := (
    SELECT
      :MAXAGE,
      :MAXID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL QUERY_PARAMS_NOT_IN_ORDER_OF_DEF_2();
```

##### Results

| :MAXAGE::NUMBER(38,0) | :MAXID::NUMBER(38,0) |
| --- | --- |
| 24 | 4 |

| :MAXAGE::NUMBER(38,0) | :MAXID::NUMBER(38,0) |
| --- | --- |
| 24 | 4 |

#### Execute direct values

This translation also handles the cases where the values are directly assigned instead of using variables.

##### Transact

```sql
CREATE PROCEDURE QUERY_WITH_DIRECT_PARAMS_VALUES_ALL
AS
BEGIN
    DECLARE @MaxAge INT;
    DECLARE @MaxId INT;

    EXECUTE sp_executesql
        N'SELECT @MaxAgeOUT = max(AGE), @MaxIdOut = max(ID) FROM PERSONS WHERE ID < @id AND AGE < @age;',
        N'@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT',
        30,
        100,
        @MaxAge OUTPUT,
        @MaxId OUTPUT;

    SELECT @MaxAge, @MaxId;
END

GO

EXEC QUERY_WITH_DIRECT_PARAMS_VALUES_ALL;
```

##### Results

| <anonymous> | <anonymous> |
| --- | --- |
| 24 | 4 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE QUERY_WITH_DIRECT_PARAMS_VALUES_ALL ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/07/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    MAXAGE INT;
    MAXID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF('SELECT
   MAX(AGE),
   MAX(ID) FROM
   PERSONS
WHERE
   ID < @id AND AGE < @age;', '@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT', ARRAY_CONSTRUCT('', '', '', ''), ARRAY_CONSTRUCT(
    30,
    100, :MAXAGE, :MAXID));
    --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
    SELECT
      $1,
      $2
    INTO
      :MAXAGE,
      :MAXID
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    ProcedureResultSet := (
    SELECT
      :MAXAGE,
      :MAXID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL QUERY_WITH_DIRECT_PARAMS_VALUES_ALL();
```

##### Results

| :MAXAGE::NUMBER(38,0) | :MAXID::NUMBER(38,0) |
| --- | --- |
| 24 | 4 |

#### SQL string dynamically built

This pattern follows the same rules as the previous patterns. However, assigning the result of the EXECUTE IMMEDIATE statement might not be added if the SQL string is not a simple single query with or without embedded parameters.

Furthermore, the SQL string must start with the literal value `'SELECT'` for SnowConvert AI to correctly identify that a SELECT statement is going to be executed.

##### Transact

```sql
CREATE PROCEDURE DYNAMIC_WITH_PARAMS
AS
BEGIN
    DECLARE @IntVariable INT;
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParmDefinition NVARCHAR(500);
    DECLARE  @where_clause nvarchar(100);

    SET @where_clause = 'WHERE AGE = @age';
    SET @IntVariable = 21;
    SET @SQLString = N'SELECT * FROM PERSONS ' + @where_clause;
    SET @ParmDefinition = N'@age INT';
    EXECUTE sp_executesql @SQLString, @ParmDefinition, @age = @IntVariable;
END

GO

EXEC DYNAMIC_WITH_PARAMS;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Doe | 2 | 21 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE DYNAMIC_WITH_PARAMS ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    INTVARIABLE INT;
    SQLSTRING VARCHAR(500);
    PARMDEFINITION VARCHAR(500);
    WHERE_CLAUSE VARCHAR(100);
    ProcedureResultSet RESULTSET;
  BEGIN

    WHERE_CLAUSE := 'WHERE AGE = @age';
    INTVARIABLE := 21;
    SQLSTRING := 'SELECT
   *
FROM
   PERSONS ' || :WHERE_CLAUSE || ';';
    PARMDEFINITION := '@age INT';
    ProcedureResultSet := (
      !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
      EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARMDEFINITION, ARRAY_CONSTRUCT('AGE'), ARRAY_CONSTRUCT(:INTVARIABLE))
    );
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL DYNAMIC_WITH_PARAMS();
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Doe | 2 | 21 |

#### Returning multiple result sets

Snowflake Scripting procedures only allow one result set to be returned per procedure.

To replicate Transact-SQL behavior, when two or more result sets are to be returned, they are stored in temporary tables. The Snowflake Scripting procedure will return an array containing the names of the temporary tables. For more information, check [SSC-FDM-0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md).

##### Transact

```sql
CREATE PROCEDURE WITH_MULTIPLE_RETURNS
AS
BEGIN
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @ParmDefinition NVARCHAR(500);

    SET @SQLString = N'SELECT * FROM PERSONS WHERE AGE = @age';
    SET @ParmDefinition = N'@age INT';
    EXECUTE sp_executesql @SQLString, @ParmDefinition, @age = 21;

    SET @SQLString = N'INSERT INTO PERSONS VALUES (''INSERT FIRST'', 1200, 230);';
    EXECUTE sp_executesql @SQLString;

    SET @SQLString = N'SELECT * FROM PERSONS';
    EXECUTE sp_executesql @SQLString;
END

GO

EXECUTE WITH_MULTIPLE_RETURNS;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Doe | 2 | 21 |

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |
| INSERT FIRST | 1200 | 230 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE WITH_MULTIPLE_RETURNS ()
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/07/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQLSTRING VARCHAR(500);
    PARMDEFINITION VARCHAR(500);
    ProcedureResultSet1 VARCHAR;
    ProcedureResultSet2 VARCHAR;
    return_arr ARRAY := array_construct();
  BEGIN

    SQLSTRING := 'SELECT
   *
FROM
   PERSONS
WHERE
   AGE = @age;';
    PARMDEFINITION := '@age INT';
    ProcedureResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF(:SQLSTRING, :PARMDEFINITION, ARRAY_CONSTRUCT('AGE'), ARRAY_CONSTRUCT(21));
    CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet1) AS
      SELECT
        *
      FROM
        TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    return_arr := array_append(return_arr, :ProcedureResultSet1);
    SQLSTRING := 'INSERT INTO PERSONS VALUES ('INSERT FIRST', 1200, 230);';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE :SQLSTRING;
    SQLSTRING := 'SELECT
   *
FROM
   PERSONS;';
    ProcedureResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE :SQLSTRING;
    CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet2) AS
      SELECT
        *
      FROM
        TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    return_arr := array_append(return_arr, :ProcedureResultSet2);
    --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
    RETURN return_arr;
  END;
$$;

CALL WITH_MULTIPLE_RETURNS();
```

##### Results

| WITH_MULTIPLE_RETURNS |
| --- |
| [ “RESULTSET_88C35D7A_1E5B_455D_97A4_247806E583A5”, “RESULTSET_B2345B61_A015_43CB_BA11_6D3E013EF262” ] |

### Known Issues

#### 1. Invalid code is detected

`SP_EXECUTESQL` can execute more than one SQL statement inside the SQL string. Snowflake also supports executing multiple SQL statements, but need to be enclosed in a `BEGIN ... END` block.
Furthermore, when executing multiple statements from a `BEGIN ... END` block, the `EXECUTE IMMEDIATE` will not return a resultset.
The translation for these cases is not yet supported by SnowConvert AI.
For more information, check [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md).

Thus, when this case is detected, in the translated code, the `EXECUTE IMMEDIATE` will not be assigned to the `ProcedureResultSet`.

##### Transact

```sql
CREATE PROCEDURE WITH_INVALID_CODE_DETECTED
AS
BEGIN
    DECLARE @SQLString NVARCHAR(500);
    SET @SQLString = N'INSERT INTO PERSONS VALUES (''INSERT FIRST'', 1200, 230); SELECT * FROM PERSONS;';
    EXECUTE sp_executesql @SQLString;
END

GO

EXEC WITH_INVALID_CODE_DETECTED;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |
| INSERT FIRST | 1200 | 230 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE WITH_INVALID_CODE_DETECTED ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQLSTRING VARCHAR(500);
  BEGIN

    SQLSTRING := 'INSERT INTO PERSONS VALUES ('INSERT FIRST', 1200, 230); SELECT
   *
FROM
   PERSONS;';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE :SQLSTRING;
  END;
$$;

CALL WITH_INVALID_CODE_DETECTED();
```

##### Results

```sql
000006 (0A000): Uncaught exception of type 'STATEMENT_ERROR' on line 10 at position 4 : Multiple SQL statements in a single API call are not supported; use one API call per statement instead.
```

#### 2. Valid or Invalid code is not detected

When the SQL string is built dynamically through concatenations, SnowConvert AI might not detect what statement is going to be executed. Thus, in the translated code, the `EXECUTE IMMEDIATE` will not be assigned to the `ProcedureResultSet`.

##### Transact

```sql
CREATE PROCEDURE WITH_INVALID_CODE_NOT_DETECTED
AS
BEGIN
    DECLARE @SQLString NVARCHAR(500);
    DECLARE @SQLInsert NVARCHAR(500);
    SET @SQLInsert = N'INSERT INTO PERSONS VALUES (''INSERT FIRST'', 1200, 230)';
    SET @SQLString = @SQLInsert + N'SELECT * FROM PERSONS;';
    EXECUTE sp_executesql @SQLString;
END

GO

EXEC WITH_INVALID_CODE_NOT_DETECTED;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |
| INSERT FIRST | 1200 | 230 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE WITH_INVALID_CODE_NOT_DETECTED ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQLSTRING VARCHAR(500);
    SQLINSERT VARCHAR(500);
  BEGIN

    SQLINSERT := 'INSERT INTO PERSONS VALUES ('INSERT FIRST', 1200, 230);';
    SQLSTRING := :SQLINSERT || 'SELECT * FROM PERSONS;';
    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE :SQLSTRING;
  END;
$$;

CALL WITH_INVALID_CODE_NOT_DETECTED();
```

##### Results

```sql
000006 (0A000): Uncaught exception of type 'STATEMENT_ERROR' on line 10 at position 4 : Multiple SQL statements in a single API call are not supported; use one API call per statement instead.
```

#### 3. Invalid code is mistaken as valid

If the SQL string starts with a SELECT statement and is followed by more statements, SnowConvert AI will detect this as a valid code and try to assign the result of the `EXECUTE IMMEDIATE` to the `ProcedureResultSet`. This leads to a compilation error.
For more information, check [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md).

##### Transact

```sql
CREATE PROCEDURE WITH_INVALID_CODE_MISTAKEN_AS_VALID
AS
BEGIN
    DECLARE @SQLString NVARCHAR(500);
    SET @SQLString = N'SELECT * FROM PERSONS; SELECT * FROM PERSONS;';
    EXECUTE sp_executesql @SQLString;
END

GO

EXEC WITH_INVALID_CODE_MISTAKEN_AS_VALID;
```

##### Results

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |

| Name | ID | AGE |
| --- | --- | --- |
| John Smith | 1 | 24 |
| John Doe | 2 | 21 |
| Mary Keller | 3 | 32 |
| Mundane Man | 4 | 18 |

##### Snowflake

```sql
CREATE OR REPLACE PROCEDURE WITH_INVALID_CODE_MISTAKEN_AS_VALID ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/04/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQLSTRING VARCHAR(500);
    ProcedureResultSet RESULTSET;
  BEGIN

    SQLSTRING := 'SELECT
   *
FROM
   PERSONS; SELECT
   *
FROM
   PERSONS;';
    ProcedureResultSet := (
      !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
      EXECUTE IMMEDIATE :SQLSTRING
    );
    RETURN TABLE(ProcedureResultSet);
  END;
$$;

CALL WITH_INVALID_CODE_MISTAKEN_AS_VALID();
```

##### Results

```sql
000006 (0A000): Uncaught exception of type 'STATEMENT_ERROR' on line 10 at position 4 : Multiple SQL statements in a single API call are not supported; use one API call per statement instead.
```

### Related EWIs

1. [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL
2. [SSC-FDM-TS0028](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Output parameters must have the same order as they appear in the executed code.
3. [SSC-FDM-0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables.

## SP_RENAME

Stored Procedure to Rename certain objects in SQL Server

Applies to

* SQL Server
* Azure Synapse Analytics

The SP_RENAME system store procedure can be emulated in Snowflake in certain scenarios. In general, the equivalent is the EXECUTE IMMEDIATE using a dynamic statement with the ALTER TABLE and the original parameters.

### Translation Examples for Tables

#### Source

```sql
EXEC sp_rename 'TABLE1', 'TABLENEW1'
```

#### Output

```sql
EXECUTE IMMEDIATE 'ALTER TABLE TABLE1 RENAME TO TABLENEW1';
```

##### Source

```sql
DECLARE @varname1 nvarchar(50) = 'previous_name'
DECLARE @varname2 nvarchar(50) = 'newer_name'
EXEC sp_rename @varname1, @varname2
```

##### Output

```sql
DECLARE
VARNAME1 VARCHAR(50) := 'previous_name';
VARNAME2 VARCHAR(50) := 'newer_name';
BEGIN
EXECUTE IMMEDIATE 'ALTER TABLE ' || :VARNAME1 || ' RENAME TO ' || :VARNAME2;
END;
```

#### Translation Examples for Columns

##### Source

```sql
EXEC sp_rename 'sample_BACKUP_2.column_old', 'column_new', 'COLUMN'
EXEC sp_rename 'database1.sample_BACKUP_3.column_old', 'column_new', 'COLUMN'
```

##### Output

```sql
EXECUTE IMMEDIATE 'ALTER TABLE sample_BACKUP_2 RENAME COLUMN column_old TO column_new';

EXECUTE IMMEDIATE 'ALTER TABLE database1.sample_BACKUP_3 RENAME COLUMN column_old TO column_new';
```

##### Source

```sql
DECLARE @oldColumnName nvarchar(50) = 'previous_name'
DECLARE @newColumnName nvarchar(50) = 'newer_name'
DECLARE @tableName nvarchar(50) = 'TABLE'
EXEC sp_rename @objname = @tableName + '.' + @oldColumnName, @newname = @newColumnName, @objtype = 'COLUMN';
```

##### Output

```sql
DECLARE
OLDCOLUMNNAME VARCHAR(50) := 'previous_name';
NEWCOLUMNNAME VARCHAR(50) := 'newer_name';
TABLENAME VARCHAR(50) := 'TABLE';
BEGIN
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0075 - TRANSLATION FOR BUILT-IN PROCEDURE 'SP_RENAME' IS NOT CURRENTLY SUPPORTED. ***/!!!
EXEC sp_rename OBJNAME = :TABLENAME || '.' || :OLDCOLUMNNAME, NEWNAME = :NEWCOLUMNNAME, OBJTYPE = 'COLUMN';
END;
```

### Related EWIs

1. [SSC-EWI-TS0075](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md):
   Translation for Built-In Procedure Is Not Currently Supported.

## WAITFOR DELAY

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

In SQL Server, [`WAITFOR DELAY`](https://learn.microsoft.com/en-us/sql/t-sql/language-elements/waitfor-transact-sql?view=sql-server-ver16) pauses execution for a specified duration. SnowConvert AI transforms `WAITFOR DELAY` statements to Snowflake’s [`CALL SYSTEM$WAIT()`](https://docs.snowflake.com/en/sql-reference/functions/system_wait) function, which provides equivalent delay functionality.

The time string is parsed and converted to seconds (or milliseconds for sub-second precision). Variables and parameters are passed through directly with an EWI warning, since `SYSTEM$WAIT` expects a numeric value rather than a time string.

> **Note:**
>
> `WAITFOR TIME` (which pauses until a specific time of day) has no Snowflake equivalent and remains flagged with SSC-EWI-0073.

### Translation Examples

#### WAITFOR DELAY with literal time

##### Input Code:

```sql
 CREATE PROCEDURE proc1()
AS
BEGIN
  WAITFOR DELAY '00:00:30';
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    CALL SYSTEM$WAIT(30);
  END;
$$;
```

#### WAITFOR DELAY with sub-second precision

##### Input Code:

```sql
 CREATE PROCEDURE proc1()
AS
BEGIN
  WAITFOR DELAY '00:00:00.500';
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    CALL SYSTEM$WAIT(500, 'MILLISECONDS');
  END;
$$;
```

#### WAITFOR DELAY with variable

##### Input Code:

```sql
 CREATE PROCEDURE proc1(@WaitTime INT)
AS
BEGIN
  WAITFOR DELAY @WaitTime;
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE proc1 (WAITTIME INT)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0094 - WAITFOR DELAY WITH VARIABLE ':WAITTIME' WAS CONVERTED TO SYSTEM$WAIT, BUT THE VARIABLE MAY CONTAIN A TIME STRING IN 'HH:MM:SS' FORMAT. SYSTEM$WAIT EXPECTS A NUMERIC VALUE IN SECONDS. ***/!!!
    CALL SYSTEM$WAIT(:WAITTIME);
  END;
$$;
```

#### WAITFOR DELAY at script level

##### Input Code:

```sql
 WAITFOR DELAY '00:00:30';
```

##### Generated Code:

```sql
 CALL SYSTEM$WAIT(30);
```

### Known Limitations

* `WAITFOR TIME` (pause until a specific time of day) has no Snowflake equivalent and is flagged with SSC-EWI-0073.
* When a variable is used, SSC-EWI-TS0094 is emitted because `SYSTEM$WAIT` expects a numeric value but the variable may contain a time string in `'HH:MM:SS'` format.

### Related EWIs

1. [SSC-EWI-TS0094](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): WAITFOR DELAY variable may contain a time string incompatible with SYSTEM$WAIT.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending functional equivalence review (emitted for WAITFOR TIME).

## CREATE STATISTICS

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

SnowConvert AI comments out `CREATE STATISTICS` statements because Snowflake automatically collects optimizer statistics and does not require this statement.

### Translation Example

#### Input Code:

```sql
CREATE STATISTICS Stats1 ON dbo.Table1(col1);
```

#### Generated Code:

```sql
----** SSC-FDM-0037 - CREATE STATISTICS NOT NEEDED. SNOWFLAKE AUTOMATICALLY COLLECTS STATISTICS. **
--CREATE STATISTICS Stats1 ON dbo.Table1 (
--  col1
--);
```

### Additional Example

#### Input Code:

```sql
CREATE STATISTICS NamePurchase ON AdventureWorks2022.Person.Person(BusinessEntityID, EmailPromotion) WITH FULLSCAN, NORECOMPUTE;
```

#### Generated Code:

```sql
----** SSC-FDM-0037 - CREATE STATISTICS NOT NEEDED. SNOWFLAKE AUTOMATICALLY COLLECTS STATISTICS. **
--CREATE STATISTICS NamePurchase ON AdventureWorks2022.Person.Person (
--  BusinessEntityID,
--  EmailPromotion
--) WITH FULLSCAN, NORECOMPUTE ;
```

### Known Limitations

* Any operational process that explicitly creates or refreshes statistics in SQL Server should be reviewed, because Snowflake manages optimizer statistics automatically.

### Related Issues

1. [SSC-FDM-0037](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Statistics function not needed in Snowflake.

## CREATE SYNONYM

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

Snowflake does not support synonyms. SnowConvert AI comments out `CREATE SYNONYM` statements with the `SSC-FDM-TS0059` marker and replaces all references to the synonymn with the original base object name, so the generated Snowflake code is functionally equivalent.

### Translation Example

#### Input Code:

```sql
CREATE SYNONYM MyProduct FOR inventory.product;
```

#### Generated Code:

```sql
----** SSC-FDM-TS0059 - SYNONYMS ARE NOT SUPPORTED IN SNOWFLAKE. REFERENCES TO THIS SYNONYM HAVE BEEN REPLACED WITH THE ORIGINAL OBJECT NAME. **
--CREATE SYNONYM MyProduct FOR inventory.product;
```

### Additional Example

When a synonym is used as a table reference in a query, SnowConvert AI replaces it with the original object name.

#### Input Code:

```sql
CREATE SYNONYM MyProduct FOR inventory.product;
GO
SELECT * FROM MyProduct;
```

#### Generated Code:

```sql
----** SSC-FDM-TS0059 - SYNONYMS ARE NOT SUPPORTED IN SNOWFLAKE. REFERENCES TO THIS SYNONYM HAVE BEEN REPLACED WITH THE ORIGINAL OBJECT NAME. **
--CREATE SYNONYM MyProduct FOR inventory.product;

SELECT
  *
FROM
  inventory.product;
```

### Related Issues

1. [SSC-FDM-TS0059](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): Synonyms are not supported in Snowflake. References to this synonym have been replaced with the original object name.

---
title: SnowConvert AI - SQL Server-Azure Synapse - Materialized View
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-materialized-view.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - Materialized View

Translation reference to convert Materialized View to Snowflake Dynamic Table

Applies to

* Azure Synapse Analytics

## Description

In SnowConvert AI, Materialized Views are transformed into Snowflake Dynamic Tables. To properly configure Dynamic Tables, two essential parameters must be defined: TARGET_LAG and WAREHOUSE. If these parameters are left unspecified in the configuration options, SnowConvert AI will default to preassigned values during the conversion, as demonstrated in the example below.

For more information on Materialized Views, click [here](https://learn.microsoft.com/en-us/sql/t-sql/statements/create-materialized-view-as-select-transact-sql?view=azure-sqldw-latest).

For details on the necessary parameters for Dynamic Tables, click [here](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table).

## Sample Source Patterns

### SQL Server

```sql
CREATE MATERIALIZED VIEW sales_total
AS
SELECT SUM(amount) AS total_sales
FROM sales;
```

### Snowflake

```sql
 CREATE OR REPLACE DYNAMIC TABLE sales_total
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
AS
SELECT SUM(amount) AS total_sales
FROM
sales;
```

## Related EWIs

1. [SSC-FDM-0031](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default

---
title: SnowConvert AI - SQL Server-Azure Synapse - Procedures
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-procedure.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - Procedures

This section documents the transformation of the syntax and the procedure’s TSQL statements to snowflake javascript

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

## 1. CREATE PROCEDURE Translation

Snowflake `CREATE PROCEDURE` is defined in SQL Syntax whereas its inner statements are defined in JavaScript.

### Transact

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE HumanResources.uspGetAllEmployees
     @FirstName NVARCHAR(50),
     @Age INT
AS
    -- TSQL Statements and queries...
GO
```

### Snowflake

```sql
CREATE OR REPLACE PROCEDURE HumanResources.uspGetAllEmployees (FIRSTNAME STRING, AGE INT)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.
$$;
```

### Parameter’s DATA TYPE

Parameters data types are being translated to Snowflake equivalent. See also [Data Types](transact-data-types.md).

### EXEC helper

To be able to run statements from a procedure in the Snowflake environment, these statements have to be preprocessed and adapted to reflect their execution in several variables that are specific to the source language.

SnowConvert AI automatically translates the supported statements and makes use of an EXEC helper. This helper provides access and update capabilities to many variables that simulate how the execution of these statements would be in their native environment.

For instance, you may see that in the migrated procedures, there is a block of code that is always added. We are going to explain the basic structure of this code in the next section. Please keep in mind that we are always evaluating and searching for new and improved ways to streamline the transformations and any helper that we require.

#### Structure

The basic structure of the EXEC helper is as follows:

1. **Variable declaration section**: Here, we declare the different variables or objects that will contain values associated with the execution of the statements inside the procedure. This includes values such as the number of rows affected by a statement, or even the result set itself.
2. **fixBind function declaration**: This is an auxiliary function used to fix binds when they are of Date type.
3. **EXEC function declaration**: This is the main EXEC helper function. It receives the statement to execute, the array of binds (basically the variables or parameters that may be modified by the execution and require data permanence throughout the execution of the procedure), the noCatch flag that determines if the ERROR_HANDLERS must be used, and the catchFunction function for executing custom code when there’s an exception in the execution of the statement. The body of the EXEC function is very straightforward; execute the statement and store every valuable data produced by its execution, all inside an error handling block.
4. **ERROR VARS:** The EXEC catch block sets up a list of error variables such as `MESSAGE_TEXT`, `SQLCODE`, `SQLSTATE`, `PROC_NAME` and `ERROR_LINE` that could be used to retrieve values from user defined functions, to emulate the SQL Server [ERROR_LINE](https://docs.microsoft.com/en-us/sql/t-sql/functions/error-line-transact-sql?view=sql-server-ver15), [ERROR_MESSAGE](https://docs.microsoft.com/en-us/sql/t-sql/functions/error-message-transact-sql?view=sql-server-ver15), [ERROR_NUMBER](https://docs.microsoft.com/en-us/sql/t-sql/functions/error-number-transact-sql?view=sql-server-ver15), [ERROR_PROCEDURE](https://docs.microsoft.com/en-us/sql/t-sql/functions/error-procedure-transact-sql?view=sql-server-ver15) and [ERROR_STATE](https://docs.microsoft.com/en-us/sql/t-sql/functions/error-state-transact-sql?view=sql-server-ver15) built in functions behavior. After all of these variables are set with one value, the `UPDATE_ERROR_VARS` user defined function, will be in charge of update some environment variables with the error values, to have access to them in the SQL scope.

#### Code

The following code block represents the EXEC helper inside a procedure:

```sql
   var _RS, ROW_COUNT, _ROWS, MESSAGE_TEXT, SQLCODE = 0, SQLSTATE = '00000', ERROR_HANDLERS, NUM_ROWS_AFFECTED, INTO;
   var fixBind = function (arg) {
      arg = arg == undefined ? null : arg instanceof Date ? arg.toISOString() : arg;
      return arg;
   };
   var fetch = (count,rows,stmt) => (count && rows.next() && Array.apply(null,Array(stmt.getColumnCount())).map((_,i) => rows.getColumnValue(i + 1))) || [];
   var EXEC = (stmt,binds = [],noCatch = false) => {
      binds = binds ? binds.map(fixBind) : binds;
      for(var stmt of stmt.split(";").filter((_) => _)) {
         try {
            _RS = snowflake.createStatement({
                  sqlText : stmt,
                  binds : binds
               });
            _ROWS = _RS.execute();
            ROW_COUNT = _RS.getRowCount();
            NUM_ROWS_AFFECTED = _RS.getNumRowsAffected();
            return {
               THEN : (action) => !SQLCODE && action(fetch(_ROWS))
            };
         } catch(error) {
            let rStack = new RegExp('At .*, line (\\d+) position (\\d+)');
            let stackLine = error.stackTraceTxt.match(rStack) || [0,-1];
            MESSAGE_TEXT = error.message.toString();
            SQLCODE = error.code.toString();
            SQLSTATE = error.state.toString();
            snowflake.execute({sqlText: `SELECT UPDATE_ERROR_VARS_UDF(?,?,?,?,?)`,binds: [stackLine[1], SQLCODE, SQLSTATE, MESSAGE_TEXT, PROC_NAME]});
            throw error;
         }
      }
   };
```

**Simple EXEC example**

This is a simple example of an EXEC call inside a Stored Procedure

**Source Code**

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE dbo.EXEC_EXAMPLE_1
AS
   EXECUTE('SELECT 1 AS Message');
GO
```

```sql
 -- =============================================
-- Example to execute the stored procedure
-- =============================================
EXECUTE dbo.EXEC_EXAMPLE_1
GO
```

**Expected code**

```sql
CREATE OR REPLACE PROCEDURE dbo.EXEC_EXAMPLE_1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// REGION SnowConvert AI Helpers Code
	var _RS, ROW_COUNT, _ROWS, MESSAGE_TEXT, SQLCODE = 0, SQLSTATE = '00000', OBJECT_SCHEMA_NAME  = 'dbo', ERROR_HANDLERS, NUM_ROWS_AFFECTED, PROC_NAME = arguments.callee.name, DOLLAR_DOLLAR = '$' + '$';
	function* sqlsplit(sql) {
		var part = '';
		var ismark = () => sql[i] == '$' && sql[i + 1] == '$';
		for(var i = 0;i < sql.length;i++) {
			if (sql[i] == ';') {
				yield part + sql[i];
				part = '';
			} else if (ismark()) {
				part += sql[i++] + sql[i++];
				while ( i < sql.length && !ismark() ) {
					part += sql[i++];
				}
				part += sql[i] + sql[i++];
			} else part += sql[i];
		}
		if (part.trim().length) yield part;
	};
	var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
	var fixBind = function (arg) {
		arg = arg == undefined ? null : arg instanceof Date ? formatDate(arg) : arg;
		return arg;
	};
	var EXEC = (stmt,binds = [],severity = "16",noCatch = false) => {
		binds = binds ? binds.map(fixBind) : binds;
		for(var stmt of sqlsplit(stmt)) {
			try {
				_RS = snowflake.createStatement({
						sqlText : stmt,
						binds : binds
					});
				_ROWS = _RS.execute();
				ROW_COUNT = _RS.getRowCount();
				NUM_ROWS_AFFECTED = _RS.getNumRowsAffected();
				return {
					THEN : (action) => !SQLCODE && action(fetch(_ROWS))
				};
			} catch(error) {
				let rStack = new RegExp('At .*, line (\\d+) position (\\d+)');
				let stackLine = error.stackTraceTxt.match(rStack) || [0,-1];
				MESSAGE_TEXT = error.message.toString();
				SQLCODE = error.code.toString();
				SQLSTATE = error.state.toString();
				snowflake.execute({
					sqlText : `SELECT UPDATE_ERROR_VARS_UDF(?,?,?,?,?,?)`,
					binds : [stackLine[1],SQLCODE,SQLSTATE,MESSAGE_TEXT,PROC_NAME,severity]
				});
				throw error;
			}
		}
	};
	// END REGION

	EXEC(`SELECT 1 AS Message;`);
$$;
```

**EXEC within a Stored Procedure with a parameter**

In this example, the EXEC command is inside a Stored Procedure and receives a parameter value

**Source Code**

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE dbo.EXEC_EXAMPLE_2
	@p1 varchar(50) = N''
AS
	EXEC ('SELECT ' + @p1);
GO
```

```sql
 -- =============================================
-- Example to execute the stored procedure
-- =============================================
EXECUTE dbo.EXEC_EXAMPLE_2 N'''Hello World!'''
GO
```

**Expected Code**

```sql
CREATE OR REPLACE PROCEDURE dbo.EXEC_EXAMPLE_2 (P1 STRING DEFAULT '')
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	EXEC(`SELECT
   ${P1};`);
$$;
```

**EXEC invoking a Stored Procedure with a parameter**

In this example, the EXEC invokes another Stored Procedure and passes a parameter

**Source Code**

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE dbo.EXEC_EXAMPLE_3
	@p1 varchar(50) = N''
AS
	EXEC EXEC_EXAMPLE_2 @p1
GO
```

```sql
 -- =============================================
-- Example to execute the stored procedure
-- =============================================
EXECUTE dbo.EXEC_EXAMPLE_3 N'''Hello World!'''
GO
```

**Expected Code**

```sql
CREATE OR REPLACE PROCEDURE dbo.EXEC_EXAMPLE_3 (P1 STRING DEFAULT '')
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	EXEC(`CALL EXEC_EXAMPLE_2(?)`,[P1]);
$$;
```

### Parameters with Default Value.

In SQL Server, there can be parameters with a default value in case these are not specified when a procedure is being called.

#### SQL Server

```sql
CREATE PROCEDURE PROC_WITH_DEFAULT_PARAMS1
@PARAM1 INT = 0, @PARAM2 INT = 0, @PARAM3 INT = 0, @PARAM4 INT = 0
AS
BEGIN
    .
    .
    .
END
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC_WITH_DEFAULT_PARAMS1(param1 int default 0, param2 int default 0, param3 int default 0, param4 int default 0)
RETURNS TABLE()
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
    .
    .
    .
$$;
```

```sql
CALL PROC_WITH_DEFAULT_PARAMS1(param2 => 10, param4 => 15);
```

### CURSOR helper

```sql
CREATE OR REPLACE PROCEDURE PROC1()
RETURNS STRING
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
	  var CURSOR = function (stmt, binds) {
	  var statementObj, result_set, total_rows, isOpen = false, self = this, row_count;

	  this.CURRENT = new Object;

	  var fetch = (count,rows,stmt) => (count && rows.next() && Array.apply(null,Array(stmt.getColumnCount())).map((_,i) => rows.getColumnValue(i + 1))) || [];

	  var fixBind = function (arg) {
	      arg = arg == undefined ? null : arg instanceof Date ? formatDate(arg) : arg;
	      return arg;
	   };

	  this.OPEN = function(openParameters) {
		  if (result_set == undefined) {
			try {
				if (openParameters) binds = openParameters;
				if (binds instanceof Function) binds = binds();
				var finalBinds = binds && binds.map(fixBind);
				var finalStmt = stmt instanceof Function ? stmt() : stmt;
				statementObj = snowflake.createStatement({
					sqlText : finalStmt,
					binds : finalBinds
				});
				result_set = statementObj.execute();
				total_rows = statementObj.getRowCount();
				isOpen = true;
				row_count = 0;
			} catch(error) {
				RAISE(error.code,"error",error.message);
			}
			else {
				isOpen = true;
			}
		  }

	      return this;
	  };

	  this.CURSOR_ROWS = function () {
	      return total_rows;
	  };

	  this.FETCH_STATUS = function() {
	      if(total_rows > row_count)
	          return 0;
	      else
	          return -1;
	  };

	  this.FETCH_NEXT = function() {
		  self.res = [];
	      if (isOpen) {
			  self.res = fetch(total_rows,result_set,statementObj);
			  if (self.res)
				  row_count++;
		  }
		  return self.res && self.res.length > 0;
	  };

	  this.INTO = function () {
	      return self.res;
	  };

	  this.CLOSE = function () {
	      isOpen = false;
	  };

	  this.DEALLOCATE = function() {
	      this.CURRENT = row_count = result_set_table = total_rows = result_set = statementObj = self = undefined;
	  };
  };

  var COL1, COL2;
  var sql_stmt = ``;

  let c = new CURSOR(`SELECT COL1, COL2 FROM TABLE1;`,() => []);

  c.OPEN();
  c.FETCH_NEXT();
  [COL1, COL2] = c.INTO();
  while ( c.FETCH_STATUS()) {

        sql_stmt = `INSERT INTO TABLE2 (COL1, COL2) VALUES (` + COL1+ `, ` + COL2 + `)`;

        snowflake.createStatement({
            sqlText : sql_stmt
         }).execute();
  }

  c.CLOSE();
  c.DEALLOCATE();

  return 'sucess';
$$;
```

### Insert Into EXEC Helper

The Insert into Exec helper generates a function called Insert `insertIntoTemporaryTable(sql).` This function will allow the transformation for `INSERT INTO TABLE_NAME EXEC(...)` from TSQL to Snowflake to imitate the behavior from the original statement by inserting it’s data into a temporary table and then re-adding it into the original Insert.

For more information on how the code for this statement is modified look at the section for Insert Into Exec

> **Note:**
>
> This Generated code for the INSERT INTO EXEC, may present performance issues when handling EXECUTE statements containing multiple queries inside.

```sql
   function insertIntoTemporaryTable(sql) {
    var table = "SnowConvertPivotTemporaryTable";
    return EXEC('CREATE OR REPLACE TEMPORARY TABLE ${table} AS ${sql}');
  }

  insertIntoTemporaryTable(`${DBTABLES}`)
  EXEC(`INSERT INTO MYDB.PUBLIC.T_Table SELECT * FROM MYDB.PUBLIC.SnowConvertPivotTemporaryTable`);
  EXEC(`DROP TABLE SnowConvertPivotTemporaryTable`)
```

### LIKE Helper

In case that a like expression is found in a procedure, for example

```sql
CREATE PROCEDURE ProcedureLike @VariableValue VARCHAR(50) AS
BEGIN
	IF @VariableValue like '%c%'
	BEGIN
		Select AValue from ATable;
	END;
END;
```

Since the inside of the procedure is transformed to javascript, the like expression will throw an error. To avoid and keep the functionality, a function is added at the start of the procedure if a like expression is found.

```sql
   function LIKE(expr,pattern,esc,cs) {
    function fixPattern(pattern,esc) {
      const specials = '/.*+?|(){}[]\\'.split('');
      var newPattern = "";
      var fix = (c) => specials.includes(c) ? '\\' + c : c;
      for(var i = 0;i < pattern.length;i++) {
        var c = pattern[i];
        if (c === esc) {
          newPattern += pattern[i + 1]
          i++
        } else if (c === '%') {
          newPattern += ".*?"
        } else if (c === '_') {
          newPattern += "."
        } else if (c === '[' || ']') {
          newPattern += c
        } else newPattern += fix(c)
      }
      return newPattern;
    }
    return new RegExp(`^${fixPattern(pattern,esc)}$`,cs ? '' : 'i').exec(expr) != null;
  }
```

With this function, we can replicate the functionality of the like expression of sql. Let’s see the diferent cases that it can be used

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE ProcedureLike @VariableValue VARCHAR(50) AS
BEGIN
	IF @VariableValue like '%c%'
	BEGIN
		Select AValue from ATable;
	END;
	IF @VariableValue not like '%c%'
	BEGIN
		Select BValue from BTable;
	END;
  IF @VariableValue like '%c!%%' escape '!'
	BEGIN
		Select CValue from CTable;
	END;
END;
```

In the last code, there is a normal like a not like, and a like with escape. The transformation will be

```sql
CREATE OR REPLACE PROCEDURE ProcedureLike (VARIABLEVALUE STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	if (LIKE(VARIABLEVALUE,`%c%`)) {
		{
			EXEC(`		Select
		   AValue
		from
		   ATable`);
		}
	}
	if (!LIKE(VARIABLEVALUE,`%c%`)) {
		{
			EXEC(`		Select
		   BValue
		from
		   BTable`);
		}
	}
	if (LIKE(VARIABLEVALUE,`%c!%%`,`!`)) {
		{
			EXEC(`		Select
		   CValue
		from
		   CTable`);
		}
	}
$$;
```

Note that the likes are transformed to function calls

```sql
LIKE(VARIABLEVALUE,`%c%`)
!LIKE(VARIABLEVALUE,`%c%`)
LIKE(VARIABLEVALUE,`%c!%%`,`!`)
```

The parameters that the function LIKE receive are the followings:

* The expression that is being evaluated.
* The pattern of comparison
* If it is present, the escape character, this is an optional parameter.

### Select Helper

Generates a function called SELECT when a scalar value has to be set to a variable

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE MAX_EMPLOYEE_ID
AS
BEGIN
   DECLARE @VARIABLE INT
   SET @VARIABLE = (SELECT MAX(EMPLOYEE_ID) FROM EMPLOYEES);
   RETURN @VARIABLE
END;
```

In this case, it will generate the following code with the SELECT helper

```sql
CREATE OR REPLACE PROCEDURE MAX_EMPLOYEE_ID ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   // SnowConvert AI Helpers Code section is omitted.

   let VARIABLE;
   VARIABLE = EXEC(`SELECT
   MAX(EMPLOYEE_ID) FROM
   EMPLOYEES`);
   return VARIABLE;
$$;
```

The SELECT helper could be used as well to insert into a local value a retrieved value from a query. The helper was designed specifically to support the same behavior of the SQL Server [SELECT @local_variable](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/select-local-variable-transact-sql?view=sql-server-ver15). The `args` parameter, represents each operation applied to all of the local variables inside the select. See also SELECT @Variable. For example:

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE [PROCEDURE1] AS

DECLARE @VAR1 int;
DECLARE @VAR2 int;
select @VAR1 = col1 + col2, @VAR2 += col1 from table1;

GO
```

In this case the variable assignments will be translated to `JavaScript` lambdas to emulate the SQL Server behavior.

```sql
CREATE OR REPLACE PROCEDURE PROCEDURE1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
// SnowConvert AI Helpers Code section is omitted.

let VAR1;
let VAR2;
SELECT(`   col1 + col2,
   col1
   from
   table1`,[],(value) => VAR1 = value,(value) => VAR2 += value);
$$;
```

### RAISERROR Helper

This helper is generated when there exists usages of a RAISERROR call in the source code. Example:

```sql
 var RAISERROR = (message,severity,state) => {
    snowflake.execute({
      sqlText : `SELECT UPDATE_ERROR_VARS_UDF(?,?,?)`,
      binds : [message,severity,state]
    });
    var msg = `Message: ${message}, Level: ${severity}, State: ${state}`;
    throw msg;
  };
```

The RAISERROR executes the *UPDATE_ERROR_VARS_UDF* to store the value of the error message, severity and state as environment variables, in case they need to be used by calling any of the ERROR built in functions. Finally, the error message is thrown with the same format as SQL Server does.

### Identity Function Helper

This helper is generated whenever the [Identity Function](https://docs.microsoft.com/en-us/sql/t-sql/functions/identity-function-transact-sql?view=sql-server-ver15) is used on a Select Into inside a procedure.

```sql
  var IdentityHelper = (seed,increment) => {
      var sequenceString = "`CREATE OR REPLACE SEQUENCE SnowConvert_Temp_Seq START = ${seed} INCREMENT = ${increment}`";
      return EXEC(sequenceString);
```

The parameters for this helper are the same as the original function, it is created to generate a sequence to mimic the identity function behavior in TSQL, the changes to the original code are:

* An additional method call to the IdentityHelper function using the same parameters found in the source code.
* And call to the IDENTITY_UDF a function design to get the next value in the sequence.

```sql
   IdentityHelper(1,1)
   EXEC(`CREATE TABLE PUBLIC.department_table3 AS SELECT IDENTITY_UDF() /*** MSC-WARNING - MSCEWI1046 - 'identity' FUNCTION MAPPED TO 'IDENTITY_UDF', FUNCTIONAL EQUIVALENCE VERIFICATION PENDING ***/ as Primary_Rank
from PUBLIC.department_table`);
```

Just like in the TSQL if no parameters are given (1,1) will be the default values.

### CALL Procedure Helper

This helper is generated whenever there is a call to what previously was a user defined function, but is now a procedure as a result of the translation process.

```sql
    var CALL = (sql,binds = [],...args) => {
      EXEC("CALL " + sql,binds);
      _ROWS.next();
      return (_ROWS.getColumnValue(1))[0];
   };
```

The purpose of this helper is to encapsulate the logic required for calling procedures as if they were functions.

Please keep in mind that this functionality is limited, since procedures cannot be invoked within queries such as SELECT.

Example of use, assuming that `FooSelfAssign(@PAR INT)` was translated to a procedure:

```sql
 // Input code
DECLARE @VAR1 INT = FooSelfAssign(1);
DECLARE @VAR4 INT = FooSelfAssign(FooSelfAssign(FooSelfAssign(FooSelfAssign(4))));
```

```sql
 // Output code
let VAR1 = CALL(`FooSelfAssign(1)`)
let VAR4 = CALL(`FooSelfAssign(?)`,[CALL(`FooSelfAssign(?)`,[CALL(`FooSelfAssign(?)`,[CALL(`FooSelfAssign(4)`)])])]);
```

Note that the translation for VAR1 is very straightforward, but for VAR4, the outmost CALL contains a list with the rest of the CALLs, as bindings.

Each successive CALL is translated to a binding, if it’s contained within another CALL.

## 2. Variables

### DECLARE @Variable

#### SQL Server

```sql
DECLARE @product_list VARCHAR(MAX) = ' ';
DECLARE @Variable1 AS VARCHAR(100), @Variable2 AS VARCHAR(100);
```

#### Snowflake

```sql
let PRODUCT_LIST = ` `;
let VARIABLE1;
let VARIABLE2;
```

### DECLARE @Variable Table

In this case, the DECLARE is used to declare a variable table, let’s see an example.

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE PROC1
AS
BEGIN
DECLARE @VariableNameTable TABLE
 (
 [Col1] INT NOT NULL,
 [Col2] INT NOT NULL
 );
INSERT INTO @VariableNameTable Values(111,222);
Select * from @VariableNameTable;
END

Exec PROC1;
```

If we execute that code in Sql Server, we will get the following result

| col1 | col2 |
| --- | --- |
| 111 | 222 |

Now, let’s see the transformation in Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
 // SnowConvert AI Helpers Code section is omitted.

 {
  EXEC(`CREATE OR REPLACE TEMPORARY TABLE T_VariableNameTable
(
   Col1 INT NOT NULL,
   Col2 INT NOT NULL
)`);
  EXEC(`INSERT INTO T_VariableNameTable Values(111,222)`);
  EXEC(`Select
   *
from
   T_VariableNameTable`);
 }
 EXEC(`CALL PROC1()`);
$$;
```

Note that from the lines **61** to **67** are the results of those statements inside the procedure.

The Declare Variable Table is turned into a Temporary Table. Note that the name, which that in the name the character @ was replaced for T_.

If we execute that code in Snowflake, we will not get any result. it will display just null. That’s because that last Select is now in the EXEC helper. So, how do we know that the table is there?

Since it was created as a temporary table inside the Procedure in an EXEC, we can do a Select to that table outside of the Procedure.

```sql
 Select * from PUBLIC.T_VariableNameTable;
```

If we execute that statement, we will get the following result

| col1 | col2 |
| --- | --- |
| 111 | 222 |

### SET @Variable

For now, the Set Variable is transformed depending on the expression that is has on the right side.

If the expression has a transformation, it will be transformed to it’s JavaScript equivalent.

Example

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE PROC1
AS
BEGIN
	SET @product_list2 = '';
    SET @product_list = '';
    SET @var1 += '';
    SET @var2 &= '';
    SET @var3 ^= '';
    SET @var4 |= '';
    SET @var5 /= '';
    SET @var6 %= '';
    SET @var7 *= '';
    SET @var8 -= '';
    SET @ProviderStatement = 'SELECT * FROM TABLE1
WHERE COL1 = '+@PARAM1+ ' AND COL2 = ' + @LOCALVAR1;
    SET @NotSupported = functionValue(a,b,c);
END
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	PRODUCT_LIST2 = ``;
	PRODUCT_LIST = ``;
	VAR1 += ``;
	VAR2 &= ``;
	VAR3 ^= ``;
	VAR4 |= ``;
	VAR5 /= ``;
	VAR6 %= ``;
	VAR7 *= ``;
	VAR8 -= ``;
	PROVIDERSTATEMENT = `SELECT
   *
FROM
   TABLE1
WHERE
   COL1 = ${PARAM1}
   AND COL2 = ${LOCALVAR1};`;
	NOTSUPPORTED = SELECT(`   functionValue(a,b,c) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'functionValue' NODE ***/!!!`);
$$;
```

As you can see in the example, the value of the variable NOTSUPPORTED is commented since it is not being transformed for the time being. Note that means that the transformation is not completed yet.

Other kinds of sets are commented, for example the following

#### SQL Server

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE PROC1
AS
BEGIN
SET TRANSACTION ISOLATION LEVEL READ UNCOMMITTED
;

SET NOCOUNT ON
;

SET TRANSACTION ISOLATION LEVEL READ UNCOMMITTED
;

SET NOCOUNT OFF
;
END
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
// SnowConvert AI Helpers Code section is omitted.

/*** SSC-EWI-0040 - THE 'SET' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
/*SET TRANSACTION ISOLATION LEVEL READ UNCOMMITTED*/
;
/*** SSC-EWI-0040 - THE 'SET' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
/*SET NOCOUNT ON*/
;
/*** SSC-EWI-0040 - THE 'SET' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
/*SET TRANSACTION ISOLATION LEVEL READ UNCOMMITTED*/
;
/*** SSC-EWI-0040 - THE 'SET' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
/*SET NOCOUNT OFF*/
;
$$;
```

### SELECT @Variable

For now, the `SELECT @variable` is being transformed into a simple select, removing the variable assignations, and keeping the expressions at the right side of the operator. The assignment operations of the local variables in the select, will be replaced with `arrow` functions that represent the same behavior of the operation being did during the local variable assignment in `SQL Server`.

#### SQL Server

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE PROC1 AS
DECLARE @VAR1 int;
DECLARE @VAR2 int;
SELECT @VAR1 = COL1 + COL2, @VAR2 = COL3 FROM TABLE1;
GO
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
// SnowConvert AI Helpers Code section is omitted.

let VAR1;
let VAR2;
SELECT(`   COL1 + COL2,
   COL3
   FROM
   TABLE1`,[],(value) => VAR1 = value,(value) => VAR2 = value);
$$;
```

## 3. Statements translation

### SELECT

#### Basic form

The basic SELECT form does not have bindings, so the translation implies the creation of a call to the EXEC helper function, with one parameter.
For example:

```sql
 -- Source code:
SELECT * FROM DEMO_TABLE_1;
```

```sql
 // Translated code:
EXEC(`SELECT * FROM DEMO_TABLE_1`);
```

### IF

#### SQL Server

```sql
IF Conditional_Expression
   -- SQL Statement
ELSE IF Conditiona_Expression2
   -- SQL Statement
ELSE
   -- SQL Statement
```

#### Snowflake

```sql
 if (Conditional_Expression) {
    // SQL Statement
} else if (Conditional_Expression2) {
    // SQL Statement
} else{
    // SQL Statement
}
```

### WHILE

#### SQL Server

```sql
WHILE ( Conditional_Expression )
BEGIN
   -- SQL STATEMENTS
END;
```

#### Snowflake

```sql
while ( Conditional_Expression )
{
  // SQL STATEMENTS
}
```

### EXEC / EXECUTE

#### SQL Server

```sql
 -- Execute simple statement
Exec('Select 1');

-- Execute statement using Dynamic Sql
Exec('Select ' + @par1 + ' from [db].[t1]');

-- Execute Procedure with parameter
EXEC db.sp2 'Create proc [db].[p3] AS', @par1, 1
```

#### Snowflake

```sql
 -- Execute simple statement
EXEC(`Select 1`);

-- Execute statement using Dynamic Sql
EXEC(`Select ${PAR1} from MYDB.db.t1`);

-- Execute Procedure with parameter
EXEC(`CALL db.sp2(/*** SSC-EWI-0038 - THIS STATEMENT MAY BE A DYNAMIC SQL THAT COULD NOT BE RECOGNIZED AND CONVERTED ***/
'Select * from MYDB.db.t1', ?, 1, Default)`,[PAR1]);
```

### THROW

The transformation for THROW ensures that the catch block that receives the error has access to the information specified in the original statement.

#### SQL Server

```sql
 -- Case 1
THROW

-- Case 2
THROW 123, 'The error message', 1

-- Case 3
THROW @var1, @var2, @var3
```

#### Snowflake

```sql
 // Case 1
throw {};

// Case 2
throw { code: 123, message: "The error message", status: 1 };

// Case 3
throw { code: VAR1, message: VAR2, status: VAR3 };
```

### RAISERROR

SQL Server [RAISERROR](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/raiserror-transact-sql?view=sql-server-ver15)  function is not supported in Snowflake.
SnowConvert AI identifies all the usages to generate a helper that emulates the original behavior. Example:

#### SQL Server

```sql
 -- Additional Params: -t JavaScript
CREATE OR ALTER PROCEDURE  RAISERRORTEST AS
BEGIN
    DECLARE @MessageTXT VARCHAR = 'ERROR MESSAGE';
    RAISERROR (N'E_INVALIDARG', 16, 1);
    RAISERROR ('Diagram does not exist or you do not have permission.', 16, 1);
    RAISERROR(@MessageTXT, 16, 1);
END
GO
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE RAISERRORTEST ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    let MESSAGETXT = `ERROR MESSAGE`;
    RAISERROR("E_INVALIDARG","16","1");
    RAISERROR("Diagram does not exist or you do not have permission.","16","1");
    RAISERROR(MESSAGETXT,"16","1");
$$;
```

### BREAK/CONTINUE

The break/continue transformation, ensures flow of the code to be stopped or continue with another block.

#### SQL Server

```sql
-- Additional Params: -t JavaScript
CREATE PROCEDURE ProcSample
AS
BEGIN
IF @@ROWCOUNT > 0
  Continue;
ELSE
  BREAK;
END
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE ProcSample ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  if (ROW_COUNT > 0) {
    continue;
  } else {
    break;
  }
$$;
```

### INSERT INTO EXEC

The code is modify slightly due to the `INSERT INTO [Table] EXEC(...)` Statement not being supported in Snowflake this allows us to replicate the behavior by adding a few lines of code:

* The first line added is a call to the `insertIntoTemporaryTable` to where the extracted code from the argument inside the `EXEC`, this will Insert the result set into a Temporary table. For more information on the function check the Insert Into EXEC Helper section.
* The Insert’s Exec is removed from the code and a query retrieving the results of the EXEC from the temporary table.

```sql
SELECT * FROM MYDB.PUBLIC.SnowConvertPivotTemporaryTable
```

* The last line added is a DROP TABLE statement for the Temporary Table added.

```sql
   DROP TABLE SnowConvertPivotTemporaryTable
```

#### SQL Server

```sql
INSERT INTO #Table1
EXEC ('SELECT
Table1.ID
FROM Population');

INSERT INTO #Table1
EXEC (@DBTables);
```

#### Snowflake

```sql
  insertIntoTemporaryTable(`SELECT Table1.ID FROM MYDB.PUBLIC.Population)
  EXEC(`INSERT INTO MYDB.PUBLIC.T_Table1 SELECT * FROM MYDB.PUBLIC.SnowConvertPivotTemporaryTable`);
  EXEC(`DROP TABLE SnowConvertPivotTemporaryTable`)

  insertIntoTemporaryTable(`${DBTABLES}`)
  EXEC(`INSERT INTO MYDB.PUBLIC.T_Table1 SELECT * FROM MYDB.PUBLIC.SnowConvertPivotTemporaryTable`);
  EXEC(`DROP TABLE SnowConvertPivotTemporaryTable`)
```

### BEGIN TRANSACTION

BEGIN TRANSACTION is transformed to Snowflake’s BEGIN command, and inserted into an EXEC helper call.

The helper is in charge of actually executing the resulting BEGIN.

#### SQL Server

```sql
 -- Input code
BEGIN TRAN @transaction_name;
```

#### Snowflake

```sql
 // Output code
EXEC(`BEGIN`, []);
```

### COMMIT TRANSACTION

COMMIT TRANSACTION is transformed to Snowflake’s COMMIT command, and inserted into an EXEC helper call.

The helper is in charge of actually executing the resulting COMMIT.

#### SQL Server

```sql
 -- Input code
COMMIT TRAN @transaction_name;
```

#### Snowflake

```sql
 // Output code
EXEC(`COMMIT`, []);
```

### ROLLBACK TRANSACTION

ROLLBACK TRANSACTION is transformed to Snowflake’s ROLLBACK command, and inserted into an EXEC helper call.

The helper is in charge of actually executing the resulting ROLLBACK .

#### SQL Server

```sql
 -- Input code
ROLLBACK TRAN @transaction_name;
```

#### Snowflake

```sql
 // Output code
EXEC(`ROLLBACK`, []);
```

### WAITFOR DELAY

WAITFOR DELAY clause is transformed to Snowflake’s `SYSTEM$WAIT` function. The *time_to_pass* parameter of the DELAY is transformed to seconds, for usage as a parameter in the `SYSTEM$WAIT` function.

The other variants of the WAITFOR clause are not supported in Snowflake, and are therefore marked with the corresponding message.

#### SQL Server

```sql
 -- Input code
1) WAITFOR DELAY '02:00';
2) WAITFOR TIME '13:30';
3) WAITFOR (RECEIVE TOP (1)
   @dh = conversation_handle,
   @mt = message_type_name,
   @body = message_body
   FROM [eqe]), TIMEOUT 5000;
```

#### Snowflake

```sql
 // Output code
1) EXEC(`SYSTEM$WAIT(120)`,[]);
2) /*** SSC-EWI-0040 - THE 'WAIT FOR' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
   /*WAITFOR TIME '13:30'*/
   ;
3) /*** SSC-EWI-0040 - THE 'WAIT FOR' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
   /*WAITFOR (RECEIVE TOP (1)
      @dh = conversation_handle,
      @mt = message_type_name,
      @body = message_body
      FROM [eqe]), TIMEOUT 5000*/
   ;
```

## 3. Cursors

Since `CURSORS` are not supported in Snowflake, SnowConvert AI maps their functionality to a `JavaScript` helper that emulates the original behavior in the target platform. Example:

### SQL Server

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE [procCursorHelper] AS

DECLARE vendor_cursor CURSOR FOR
    SELECT VendorID, Name
    FROM Purchasing.Vendor
    WHERE PreferredVendorStatus = 1
    ORDER BY VendorID;
GO
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE procCursorHelper ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var VENDOR_CURSOR = new CURSOR(`SELECT
       VendorID,
       Name
    FROM
       Purchasing.Vendor
    WHERE
       PreferredVendorStatus = 1
    ORDER BY VendorID`,[],false);
$$;
```

### DECLARE CURSOR

#### SQL Server

```sql
DECLARE myCursor1 CURSOR FOR SELECT COL1 FROM TABLE1
```

#### Snowflake

```sql
let myCursor1 = new CURSOR(`SELECT COL1 FROM TABLE1`,() => []);
```

### OPEN

#### SQL Server

```sql
OPEN myCursor1
OPEN GLOBAL myCursor2
```

#### Snowflake

```sql
myCursor1.OPEN();
myCursor2.OPEN()
```

### FETCH

#### SQL Server

```sql
DECLARE @VALUE1 INT
FETCH NEXT FROM myCursor1 into @VALUE1
```

#### Snowflake

```sql
var VALUE1;
myCursor1.FETCH_NEXT();
VALUE1 = myCursor1.INTO();
```

### CLOSE

#### SQL Server

```sql
CLOSE myCursor1
CLOSE GLOBAL myCursor2
```

#### Snowflake

```sql
myCursor1.CLOSE()
myCursor2.CLOSE()
```

### DEALLOCATE

#### SQL Server

```sql
DEALLOCATE myCursor1
DEALLOCATE GLOBAL myCursor2
```

#### Snowflake

```sql
myCursor1.DEALLOCATE()
myCursor2.DEALLOCATE()
```

### @@FETCH_STATUS

#### SQL Server

```sql
 @@FETCH_STATUS
```

#### Snowflake

```sql
myCursor1.FETCH_STATUS()
```

### @@CURSOR_ROWS

#### SQL Server

```sql
 @@CURSOR_ROWS
```

#### Snowflake

```sql
myCursor1.FETCH_STATUS()
```

## 4. Expressions

### Binary Operations

#### SQL Server

```sql
SET @var1 = 1 + 1;
SET @var1 = 1 - 1;
SET @var1 = 1 / 1;
SET @var1 = 1 * 1;
SET @var1 = 1 OR 1;
SET @var1 = 1 AND 1;
```

#### Snowflake

```sql
VAR1 = 1 + 1;
VAR1 = 1 - 1;
VAR1 = 1 / 1;
VAR1 = 1 * 1;
VAR1 = 1 || 1;
VAR1 = 1 && 1;
```

### Conditionals

#### SQL Server

```sql
@var1 > 0
@var1 = 0
@var1 < 0
@var1 <> 0
```

#### Snowflake

```sql
VAR1 > 0
VAR1 = 0
VAR1 < 0
VAR1 != 0
```

#### NULL Predicate

##### SQL Server

```sql
@var1 is null
@var2 is not null
```

##### Snowflake

```sql
VAR1 == null
VAR2 != null
```

## 5. Labels and Goto

`Labels` have not the same behavior in JavaScript as SQL Server has. To simulate the behavior, they are being transformed to `functions` . Its usage is being replaced with a call of the generated function that contains all the logic of the label. Example:

### Source Code

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE [procWithLabels]
AS
SUCCESS_EXIT:
	SET @ErrorStatus = 0
	RETURN @ErrorStatus

ERROR_EXIT:
	RETURN @ErrorStatus
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE procWithLabels ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	SUCCESS_EXIT();
	ERROR_EXIT();
	function SUCCESS_EXIT() {
		ERRORSTATUS = 0;
		return ERRORSTATUS;
	}
	function ERROR_EXIT() {
		return ERRORSTATUS;
	}
$$;
```

As you see in the example above, the function declarations that were the labels in the source code, will be put at the end of the code to make it cleaner.

`GOTO` is another command that does not exist in JavaScript. To simulate its behavior, their usages are being transformed to calls to the function (label) that is referenced, preceded by a return statement. Example:

#### SQL Server

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE [procWithLabels]
AS
DECLARE @ErrorStatus int = 0;
IF @ErrorStatus <> 0 GOTO ERROR_EXIT

SUCCESS_EXIT:
	SET @ErrorStatus = 0
	RETURN @ErrorStatus

ERROR_EXIT:
	RETURN @ErrorStatus
```

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE procWithLabels ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	// SnowConvert AI Helpers Code section is omitted.

	let ERRORSTATUS = 0;
	if (ERRORSTATUS != 0) {
		return ERROR_EXIT();
	}
	SUCCESS_EXIT();
	ERROR_EXIT();
	function SUCCESS_EXIT() {
		ERRORSTATUS = 0;
		return ERRORSTATUS;
	}
	function ERROR_EXIT() {
		return ERRORSTATUS;
	}
$$;
```

As you see in the example above, the `return` is added to the function call, to stop the code flow as SQL Server does with the `GOTO` .

## Related EWIs

1. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - SQL Server-Azure Synapse - QUOTED_IDENTIFIER
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-quoted-identifier.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - QUOTED_IDENTIFIER

Applies to

* SQL Server
* Azure Synapse Analytics

## Description

This statement controls whether double quotation marks are used to delimit identifiers (such as table names, column names, etc.) or string literals in SQL Server. When `SET QUOTED_IDENTIFIER` is ON, identifiers can be delimited by double quotation marks, and literals must be delimited by single quotation marks. When OFF, double quotation marks are treated as string literal delimiters. Please visit [SET QUOTED_IDENTIFIER](https://learn.microsoft.com/en-us/sql/t-sql/statements/set-quoted-identifier-transact-sql?view=sql-server-ver17) to get more information about this statement.

## Transact-SQL Syntax

```sql
 SET QUOTED_IDENTIFIER { ON | OFF }
```

## Behavior Comparison

### SQL Server Behavior

In SQL Server, the `SET QUOTED_IDENTIFIER` setting determines how double quotes are interpreted:

* **When ON (default)**: Double quotes delimit identifiers, allowing special characters and reserved keywords in object names
* **When OFF**: Double quotes are treated as string literal delimiters (similar to single quotes)

### Snowflake Behavior

Snowflake always treats double quotes as identifier delimiters (equivalent to SQL Server’s `QUOTED_IDENTIFIER ON`). There is no equivalent to the `OFF` setting. Key differences include:

1. **Case Sensitivity**:

   * Unquoted identifiers are automatically converted to uppercase
   * Quoted identifiers preserve exact case and become case-sensitive
2. **QUOTED_IDENTIFIERS_IGNORE_CASE Parameter**: Controls case sensitivity for quoted identifiers

## Sample Source Patterns

### SET QUOTED_IDENTIFIER ON

When `QUOTED_IDENTIFIER` is ON in SQL Server, double quotes can be used to delimit identifiers containing spaces or special characters.

#### SQL Server

```sql
 SET QUOTED_IDENTIFIER ON;

 CREATE TABLE "Order Details" (
     "Order ID" INT,
     "Product Name" VARCHAR(50),
     "Unit Price" DECIMAL(10,2)
 );

 SELECT "Order ID", "Product Name" FROM "Order Details";
```

#### Snowflake

```sql
----** SSC-FDM-TS0033 - SET QUOTED_IDENTIFIER STATEMENT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE **
--SET QUOTED_IDENTIFIER ON

 CREATE OR REPLACE TABLE "Order Details" (
     "Order ID" INT,
     "Product Name" VARCHAR(50),
     "Unit Price" DECIMAL(10, 2)
 )
 COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/22/2025",  "domain": "no-domain-provided" }}'
;

 SELECT
     "Order ID",
     "Product Name"
 FROM
     "Order Details";
```

**Example of the Difference**

Let’s assume you’ve migrated a table from a SQL Server database with a case-insensitive collation (_CI):

#### SQL Server (with _CI collation):

```sql
-- This statement is valid
SELECT "MyColumn" FROM "MyTable";

-- This statement is also valid and returns the same result
SELECT "mycolumn" FROM "MyTable";
```

In this case, the _CI collation makes the two SELECT statements interchangeable.

#### Snowflake:

```sql
-- This statement is valid
SELECT "MyColumn" FROM "MyTable";

-- This statement will fail because "mycolumn" does not match "MyColumn"
SELECT "mycolumn" FROM "MyTable";
-- ERROR:  SQL compilation error: error in select clause: mycolumn does not exist
```

The Snowflake behavior is different because it respects the case of the quoted identifier by default.
It could be approachable by altering the session using.

```sql
ALTER SESSION SET QUOTED_IDENTIFIERS_IGNORE_CASE = TRUE;
```

If you want to set the parameter at the account level, you can use the following command:

```sql
ALTER ACCOUNT SET QUOTED_IDENTIFIERS_IGNORE_CASE = TRUE;
```

This will set the parameter for all sessions associated with the account.
For further information, check the following [documentation](https://docs.snowflake.com/en/sql-reference/identifiers-syntax);

### SET QUOTED_IDENTIFIER OFF

When `QUOTED_IDENTIFIER` is OFF in SQL Server, double quotes are treated as string delimiters.

#### SQL Server

```sql
 SET QUOTED_IDENTIFIER OFF;

 -- Double quotes treated as string literals
 SELECT * FROM customers WHERE name = "John Doe";

 -- Must use square brackets for identifiers with spaces
 SELECT [Order ID] FROM [Order Details];
```

#### Snowflake

```sql
 ----** SSC-FDM-TS0028 - QUOTED_IDENTIFIER OFF behavior not supported in Snowflake **
 -- Double quotes always delimit identifiers in Snowflake
 -- Use single quotes for string literals
 SELECT * FROM customers WHERE name = 'John Doe';

 -- Double quotes delimit identifiers (case-sensitive)
 SELECT "Order ID" FROM "Order Details";
```

## Migration Considerations

1. **Review Identifier Casing**: Ensure consistent casing when migrating to Snowflake, especially for quoted identifiers
2. **String Literals**: Replace double-quoted string literals with single-quoted literals
3. **Use QUOTED_IDENTIFIERS_IGNORE_CASE**: Consider setting this parameter to `TRUE` early in migration to reduce case sensitivity issues
4. **Test Thoroughly**: Verify all object references work correctly after migration

## Related EWIs and FDMs

1. **SSC-FDM-TS0028**: SET QUOTED_IDENTIFIER OFF behavior not supported in Snowflake - double quotes always delimit identifiers

---
title: SnowConvert AI - SQL Server-Azure Synapse - SELECT
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-select.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - SELECT

## SELECT

Translation reference for SELECT statement inside procedures in Transact-SQL.

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Multiple result sets are returned in temporary tables

### Description

Snowflake SQL supports returning tables as a return type for Stored Procedures, but unlike Transact-SQL, Snowflake does not support returning multiple resultsets in the same procedure. For this scenario, all the query IDs are stored in a temporary table and returned as an array.

### Sample Source Patterns

The following example details the transformation when there is only one SELECT statement in the procedure.

#### Transact-SQL

##### Single Resultset

```sql
CREATE PROCEDURE SOMEPROC()
AS
BEGIN
        SELECT * from AdventureWorks.HumanResources.Department;
END
```

##### Output

| DepartmentID | Name | GroupName |
| --- | --- | --- |
| 1 | Engineering | Research and Development |
| 2 | Tool Design | Research and Development |
| 3 | Sales | Sales and Marketing |
| 4 | Marketing | Sales and Marketing |
| 5 | Purchasing | Inventory Management |
| 6 | Research and Development | Research and Development |
| 7 | Production | Manufacturing |
| 8 | Production Control | Manufacturing |
| 9 | Human Resources | Executive General and Administration |
| 10 | Finance | Executive General and Administration |
| 11 | Information Services | Executive General and Administration |
| 12 | Document Control | Quality Assurance |
| 13 | Quality Assurance | Quality Assurance |
| 14 | Facilities and Maintenance | Executive General and Administration |
| 15 | Shipping and Receiving | Inventory Management |
| 16 | Executive | Executive General and Administration |

##### Snowflake SQL

##### Single Resultset

```sql
CREATE OR REPLACE PROCEDURE SOMEPROC ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
        DECLARE
                ProcedureResultSet RESULTSET;
        BEGIN
                ProcedureResultSet := (
                SELECT
                        *
                from
                        AdventureWorks.HumanResources.Department);
                RETURN TABLE(ProcedureResultSet);
        END;
$$;
```

##### Output

| DepartmentID | Name | GroupName |
| --- | --- | --- |
| 1 | Engineering | Research and Development |
| 2 | Tool Design | Research and Development |
| 3 | Sales | Sales and Marketing |
| 4 | Marketing | Sales and Marketing |
| 5 | Purchasing | Inventory Management |
| 6 | Research and Development | Research and Development |
| 7 | Production | Manufacturing |
| 8 | Production Control | Manufacturing |
| 9 | Human Resources | Executive General and Administration |
| 10 | Finance | Executive General and Administration |
| 11 | Information Services | Executive General and Administration |
| 12 | Document Control | Quality Assurance |
| 13 | Quality Assurance | Quality Assurance |
| 14 | Facilities and Maintenance | Executive General and Administration |
| 15 | Shipping and Receiving | Inventory Management |
| 16 | Executive | Executive General and Administration |

The following example details the transformation when there are many SELECT statements in the procedure.

##### Transact-SQL

##### Multiple Resultset

```sql
 CREATE PROCEDURE SOMEPROC()
AS
BEGIN
        SELECT * from AdventureWorks.HumanResources.Department;
        SELECT * from AdventureWorks.HumanResources.Shift;
END
```

##### Output

| DepartmentID | Name | GroupName |
| --- | --- | --- |
| 1 | Engineering | Research and Development |
| 2 | Tool Design | Research and Development |
| 3 | Sales | Sales and Marketing |
| 4 | Marketing | Sales and Marketing |
| 5 | Purchasing | Inventory Management |
| 6 | Research and Development | Research and Development |
| 7 | Production | Manufacturing |
| 8 | Production Control | Manufacturing |
| 9 | Human Resources | Executive General and Administration |
| 10 | Finance | Executive General and Administration |
| 11 | Information Services | Executive General and Administration |
| 12 | Document Control | Quality Assurance |
| 13 | Quality Assurance | Quality Assurance |
| 14 | Facilities and Maintenance | Executive General and Administration |
| 15 | Shipping and Receiving | Inventory Management |
| 16 | Executive | Executive General and Administration |

| ShiftID | Name | StartTime | EndTime | ModifiedDate |
| --- | --- | --- | --- | --- |
| 1 | Day | 07:00:00 | 15:00:00 | 2008-04-30 00:00:00.000 |
| 2 | Evening | 15:00:00 | 23:00:00 | 2008-04-30 00:00:00.000 |
| 3 | Night | 23:00:00 | 07:00:00 | 2008-04-30 00:00:00.000 |

##### Snowflake SQL

##### Single Resultset

```sql
CREATE OR REPLACE PROCEDURE SOMEPROC ()
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
        DECLARE
                ProcedureResultSet1 VARCHAR;
                ProcedureResultSet2 VARCHAR;
                return_arr ARRAY := array_construct();
        BEGIN
                ProcedureResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
                CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet1) AS
                        SELECT
                                *
                        from
                                AdventureWorks.HumanResources.Department;
                return_arr := array_append(return_arr, :ProcedureResultSet1);
                ProcedureResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
                CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet2) AS
                        SELECT
                                *
                        from
                                AdventureWorks.HumanResources.Shift;
                return_arr := array_append(return_arr, :ProcedureResultSet2);
                --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
                RETURN return_arr;
        END;
$$;
```

##### Output

| DepartmentID | Name | GroupName |
| --- | --- | --- |
| 1 | Engineering | Research and Development |
| 2 | Tool Design | Research and Development |
| 3 | Sales | Sales and Marketing |
| 4 | Marketing | Sales and Marketing |
| 5 | Purchasing | Inventory Management |
| 6 | Research and Development | Research and Development |
| 7 | Production | Manufacturing |
| 8 | Production Control | Manufacturing |
| 9 | Human Resources | Executive General and Administration |
| 10 | Finance | Executive General and Administration |
| 11 | Information Services | Executive General and Administration |
| 12 | Document Control | Quality Assurance |
| 13 | Quality Assurance | Quality Assurance |
| 14 | Facilities and Maintenance | Executive General and Administration |
| 15 | Shipping and Receiving | Inventory Management |
| 16 | Executive | Executive General and Administration |

| ShiftID | Name | StartTime | EndTime | ModifiedDate |
| --- | --- | --- | --- | --- |
| 1 | Day | 07:00:00 | 15:00:00 | 2008-04-30 00:00:00.000 |
| 2 | Evening | 15:00:00 | 23:00:00 | 2008-04-30 00:00:00.000 |
| 3 | Night | 23:00:00 | 07:00:00 | 2008-04-30 00:00:00.000 |

### Known Issues

1. The query results should be accessed by using the IDs returned by the Stored Procedure

### Related EWIs

1. [SSC-FDM-0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables.

## TOP

Applies to

* SQL Server
* Azure Synapse Analytics

### Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

Limits the rows returned in a query result set to a specified number of rows or percentage of rows. When you use `TOP` with the `ORDER BY` clause, the result set is limited to the first *N* number of ordered rows. Otherwise, `TOP` returns the first ***N*** number of rows in an undefined order. Use this clause to specify the number of rows returned from a `SELECT` statement. Or, use `TOP` to specify the rows affected by an `INSERT`, `UPDATE`, `MERGE`, or `DELETE` statement. ([Transact-SQL TOP documentation](https://learn.microsoft.com/en-us/sql/t-sql/queries/top-transact-sql?view=sql-server-ver16))

#### Syntax in Transact-SQL

```sql
 TOP (expression) [PERCENT] [ WITH TIES ]
```

> **Note:**
>
> To get more information about the **`TOP`** arguments please check the [Transact-SQL TOP documentation](https://learn.microsoft.com/en-us/sql/t-sql/queries/top-transact-sql?view=sql-server-ver16#arguments).

##### Syntax in Snowflake

```sql
 TOP <n>
```

> **Note:**
>
> To get more information about **`TOP`** arguments please check the [Snowflake TOP documentation](https://docs.snowflake.com/en/sql-reference/constructs/top_n#parameters).

### Sample Source Patterns

To execute correctly the following samples it is required run the next `CREATE TABLE` statement:

#### Transact-SQL

```sql
 CREATE TABLE Cars(
    Model VARCHAR(15),
    Price MONEY,
    Color VARCHAR(10)
);

INSERT Cars VALUES ('sedan', 10000, 'red'),
('convertible', 15000, 'blue'),
('coupe', 20000, 'red'),
('van', 8000, 'blue'),
('sub', 8000, 'green');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE Cars (
    Model VARCHAR(15),
    Price NUMBER(38, 4),
    Color VARCHAR(10)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;

INSERT INTO Cars VALUES ('sedan', 10000, 'red'),
('convertible', 15000, 'blue'),
('coupe', 20000, 'red'),
('van', 8000, 'blue'),
('sub', 8000, 'green');
```

#### Common Case

##### Transact-SQL

##### Query

```sql
 SELECT TOP(1) Model, Color, Price
FROM Cars
WHERE Color = 'red'
```

##### Result

| Model | Color | Price |
| --- | --- | --- |
| sedan | red | 10000.0000 |

##### Snowflake

##### Query

```sql
 SELECT
TOP 1
Model,
Color,
Price
FROM
Cars
WHERE
Color = 'red';
```

##### Result

| MODEL | COLOR | PRICE |
| --- | --- | --- |
| sedan | red | 10,000 |

#### TOP using PERCENT

##### Transact-SQL

##### Query

```sql
 SELECT TOP(50)PERCENT Model, Color, Price FROM Cars
```

##### Result

| Model | Color | Prices |
| --- | --- | --- |
| sedan | red | 10000.0000 |
| convertible | blue | 15000.0000 |
| coupe | green | 20000.0000 |

##### Snowflake

##### Query

```sql
SELECT
TOP 50 !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'TOP PERCENT' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
Model,
Color,
Price
FROM
Cars;
```

##### Result

| MODEL | COLOR | PRICE |
| --- | --- | --- |
| sedan | red | 10,000 |
| convertible | blue | 15,000 |
| coupe | red | 20,000 |
| van | blue | 8,000 |
| sub | green | 8,000 |

> **Warning:**
>
> Since `PERCENT` argument is not supported by Snowflake it is being removed from the `TOP` clause, that’s why the result of executing the query in Snowflake is not equivalent to Transact-SQL.

#### TOP WITH TIES

##### Transact-SQL

##### Query

```sql
 SELECT TOP(50)PERCENT WITH TIES Model, Color, Price FROM Cars ORDER BY Price;
```

##### Result

| Model | Color | Price |
| --- | --- | --- |
| van | blue | 8000.0000 |
| sub | green | 8000.0000 |
| sedan | red | 10000.0000 |

##### Snowflake

##### Query

```sql
 SELECT
 TOP 50 !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'TOP PERCENT AND WITH TIES' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
  Model,
  Color,
  Price
 FROM
  Cars
 ORDER BY Price;
```

##### Result

| MODEL | COLOR | PRICE |
| --- | --- | --- |
| sub | green | 8,000 |
| van | blue | 8,000 |
| sedan | red | 10,000 |
| convertible | blue | 15,000 |
| coupe | red | 20,000 |

> **Warning:**
>
> Since `WITH TIES` argument is not supported by Snowflake it is being removed from the `TOP` clause, that’s why the result of executing the query in Snowflake is not equivalent to Transact-SQL.

### Known Issues

#### 1. PERCENT argument is not supported by Snowflake

Since the `PERCENT` argument is not supported by Snowflake it is being removed from the `TOP` clause and a warning is being added. Functional equivalence mismatches in the results could happen.

##### 2. WITH TIES argument is not supported by Snowflake

Since the `WITH TIES` argument is not supported by Snowflake it is being removed from the `TOP` clause and a warning is being added. Functional equivalence mismatches in the results could happen.

### Related EWIs

1. [SSC-EWI-0040](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.

---
title: SnowConvert AI - SQL Server-Azure Synapse - System Tables
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-system-tables.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - System Tables

Translation spec for Transact-SQL System Tables

## System tables

| Transact-SQL | Snowflake SQL | Notes |  |
| --- | --- | --- | --- |
| SYS.ALL_VIEWS | INFORMATION_SCHEMA.VIEWS |  |  |
| SYS.ALL_COLUMNS | INFORMATION_SCHEMA.COLUMNS |  |  |
| SYS.COLUMNS | INFORMATION_SCHEMA.COLUMNS |  |  |
| SYS.OBJECTS | INFORMATION_SCHEMA.OBJECT_PRIVILEGES |  |  |
| SYS.PROCEDURES | INFORMATION_SCHEMA.PROCEDURES |  |  |
| SYS.SEQUENCES | INFORMATION_SCHEMA.SEQUENCES |  |  |
| SYS.ALL_OBJECTS | INFORMATION_SCHEMA.OBJECT_PRIVILEGES |  |  |
| ALL_PARAMETERS | **Not supported** |  |  |
| SYS.ALL_SQL_MODULES | **Not supported** |  |  |
| SYS.ALLOCATION_UNITS | **Not supported** |  |  |
| SYS.ASSEMBLY_MODULES | **Not supported** |  |  |
| SYS.CHECK_CONSTRAINTS | **Not supported** |  |  |
| SYS.COLUMN_STORE_DICTIONARIES | **Not supported** |  |  |
| SYS.COLUMN_STORE_ROW_GROUPS | **Not supported** |  |  |
| SYS.COLUMN_STORE_SEGMENTS | **Not supported** |  |  |
| SYS.COMPUTED_COLUMNS | **Not supported** |  |  |
| SYS.DEFAULT_CONSTRAINTS | **Not supported** |  |  |
| SYS.EVENTS | **Not supported** |  |  |
| SYS.EVENT_NOTIFICATIONS | **Not supported** |  |  |
| SYS.EVENT_NOTIFICATION_EVENT_TYPES | **Not supported** |  |  |
| SYS.EXTENDED_PROCEDURES | **Not supported** |  |  |
| SYS.EXTERNAL_LANGUAGE_FILES | **Not supported** |  |  |
| SYS.EXTERNAL_LANGUAGES | **Not supported** |  |  |
| SYS.EXTERNAL_LIBRARIES | **Not supported** |  |  |
| SYS.EXTERNAL_LIBRARY_FILES | **Not supported** |  |  |
| SYS.FOREIGN_KEYS | INFORMATION_SCHEMA.TABLE_CONSTRAINTS |  |  |
| SYS.FOREIGN_KEY_COLUMNS | **Not supported** |  |  |
| SYS.FUNCTION_ORDER_COLUMNS | **Not supported** |  |  |
| SYS.HASH_INDEXES | **Not supported** |  |  |
| SYS.INDEXES | **Not supported** |  |  |
| SYS.INDEX_COLUMNS | **Not supported** |  |  |
| SYS.INDEX_RESUMABLE_OPERATIONS | **Not supported** |  |  |
| SYS.INTERNAL_PARTITIONS | **Not supported** |  |  |
| SYS.INTERNAL_TABLES | **Not supported** |  |  |
| SYS.KEY_CONSTRAINTS | **Not supported** |  |  |
| SYS.MASKED_COLUMNS | **Not supported** |  |  |
| SYS.MEMORY_OPTIMIZED_TABLES_INTERNAL_ATTRIBUTES | **Not supported** |  |  |
| SYS.MODULE_ASSEMBLY_USAGES | **Not supported** |  |  |
| SYS.NUMBERED_PROCEDURES | **Not supported** |  |  |
| SYS.NUMBERED_PROCEDURE_PARAMETERS | **Not supported** |  |  |
| SYS.PARAMETERS | **Not supported** |  |  |
| SYS.PARTITIONS | **Not supported** |  |  |
| SYS.PERIODS | **Not supported** |  |  |
| SYS.SERVER_ASSEMBLY_MODULES | **Not supported** |  |  |
| SYS.SERVER_EVENTS | **Not supported** |  |  |
| SYS.SERVER_EVENT_NOTIFICATIONS | **Not supported** |  |  |
| SYS.SERVER_SQL_MODULE | **Not supported** |  |  |
| SYS.SERVER_TRIGGERS | **Not supported** |  |  |
| SYS._SERVER_TRIGGER_EVENTS | **Not supported** |  |  |
| SYS.SQL_DEPENDENCIES | **Not supported** |  |  |
| SYS.SQL_EXPRESSION_DEPENDENCIES | **Not supported** |  |  |
| SYS.SQL_MODULES | **Not supported** |  |  |
| SYS.STATS | **Not supported** |  |  |
| SYS.STATS_COLUMNS | **Not supported** |  |  |
| SYS.SYNONYMS | **Not supported** |  |  |
| SYS.SYSTEM_COLUMNS | **Not supported** |  |  |
| SYS.SYSTEM_OBJECTS | **Not supported** |  |  |
| SYS.SYSTEM_PARAMETERS | **Not supported** |  |  |
| SYS.SYSCONSTRAINTS | INFORMATION_SCHEMA.TABLE_CONSTRAINTS |  |  |
| SYS.SYSTEM_SQL_MODULES” | **Not supported** |  |  |

## SYSCONSTRAINTS

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

The `sysconstraints` compatibility view maps constraint IDs to the objects and tables they belong to. It is a legacy system table from earlier SQL Server versions ([SQL Server documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-compatibility-views/sys-sysconstraints-transact-sql)).

SnowConvert AI transforms queries against `sysconstraints` (or `sys.sysconstraints`) into queries against Snowflake’s `INFORMATION_SCHEMA.TABLE_CONSTRAINTS`. The transformation also rewrites common `OBJECT_NAME()` patterns:

| sysconstraints pattern | Snowflake equivalent |
| --- | --- |
| `OBJECT_NAME(constid) = 'X'` | `CONSTRAINT_NAME = 'X'` |
| `OBJECT_NAME(id) = 'Y'` | `TABLE_NAME = 'Y'` |

When `OBJECT_NAME()` is called with an unrecognized argument or compared against a non-literal value, the expression is preserved and [SSC-EWI-TS0104](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md) is emitted.

### Sample Source Patterns

#### 1. Basic sysconstraints query

##### SQL Server

```sql
SELECT 1 FROM sysconstraints WHERE OBJECT_NAME(constid) = 'CPK' AND OBJECT_NAME(id) = 'T';
```

##### Snowflake

```sql
SELECT
  1
FROM
  INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
  CONSTRAINT_NAME = 'CPK'
  AND TABLE_NAME = 'T';
```

#### 2. Qualified sys.sysconstraints

##### SQL Server

```sql
SELECT 1 FROM sys.sysconstraints WHERE OBJECT_NAME(constid) = 'PK_Orders' AND OBJECT_NAME(id) = 'Orders';
```

##### Snowflake

```sql
SELECT
  1
FROM
  INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
  CONSTRAINT_NAME = 'PK_Orders'
  AND TABLE_NAME = 'Orders';
```

#### 3. sysconstraints inside IF EXISTS with DROP CONSTRAINT

##### SQL Server

```sql
IF ( EXISTS(SELECT 1 FROM sysconstraints WHERE OBJECT_NAME(constid) = 'LoanDynamicMBSLoadCPK'))
BEGIN
  ALTER TABLE [dbo].[LoanDynamicMBSLoad] DROP CONSTRAINT [LoanDynamicMBSLoadCPK] WITH ( ONLINE = OFF )
END
GO
```

##### Snowflake

```sql
BEGIN
  IF ((EXISTS (
    SELECT
      1
    FROM
      INFORMATION_SCHEMA.TABLE_CONSTRAINTS
    WHERE
      CONSTRAINT_NAME = 'LoanDynamicMBSLoadCPK'
  ))) THEN
    BEGIN
      ALTER TABLE IF EXISTS dbo.LoanDynamicMBSLoad DROP CONSTRAINT LoanDynamicMBSLoadCPK;
    END;
  END IF;
END;
```

> **Note:**
>
> The `WITH ( ONLINE = OFF )` clause is removed because Snowflake does not support index options on `DROP CONSTRAINT`. The `IF EXISTS` at the constraint level is also stripped because Snowflake does not support it in that position.

#### 4. Unmapped argument emits EWI

##### SQL Server

```sql
SELECT 1 FROM sysconstraints WHERE OBJECT_NAME(status) = 'X';
```

##### Snowflake

```sql
SELECT
  1
FROM
  INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
  !!!RESOLVE EWI!!! /*** SSC-EWI-TS0104 - 'OBJECT_NAME(status) in sysconstraints query' COULD NOT BE AUTOMATICALLY CONVERTED IN SYSTEM TABLE QUERY. MANUAL REVIEW REQUIRED ***/!!!
  OBJECT_NAME(status) = 'X';
```

### Known Issues

#### 1. Only `OBJECT_NAME(constid)` and `OBJECT_NAME(id)` are automatically mapped

Other arguments to `OBJECT_NAME()` inside sysconstraints queries cannot be automatically resolved and will emit [SSC-EWI-TS0104](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md).

#### 2. Non-literal comparisons are not converted

If `OBJECT_NAME()` is compared to a column reference, variable, or expression (instead of a string literal), the expression is preserved with an EWI annotation.

#### 3. JOIN statements with sysconstraints are not supported

Queries that join `sysconstraints` with other tables are not automatically translated.

### Related EWIs

1. [SSC-EWI-TS0104](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): System table query pattern could not be automatically converted.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## SYS.FOREIGN_KEYS

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Contains a row per object that is a FOREIGN KEY constraint ([SQLServer Documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-foreign-keys-transact-sql?view=sql-server-ver16)).

The columns for **FOREIGN KEY** (sys.foreign_keys) are the following:

| Column name | Data type | Description | Has equivalent column in Snowflake |
| --- | --- | --- | --- |
|  | - | For a list of columns that this view inherits, see [sys.objects (Transact-SQL).](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-objects-transact-sql?view=sql-server-ver16) | Partial |
| referenced_object_id | int | ID of the referenced object. | No |
| key_index_id | int | ID of the key index within the referenced object. | No |
| is_disabled | bit | FOREIGN KEY constraint is disabled. | No |
| is_not_for_replication | bit | FOREIGN KEY constraint was created by using the NOT FOR REPLICATION option. | No |
| is_not_trusted | bit | FOREIGN KEY constraint has not been verified by the system. | No |
| delete_referential_action | tinyint | The referential action that was declared for this FOREIGN KEY when a delete happens. See [SQLServer Documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-foreign-keys-transact-sql?view=sql-server-ver16). | No |
| delete_referential_action_desc | nvarchar(60) | Description of the referential action that was declared for this FOREIGN KEY when a delete occurs. See [SQLServer Documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-foreign-keys-transact-sql?view=sql-server-ver16). | No |
| update_referential_action | tinyint | The referential action that was declared for this FOREIGN KEY when an update happens. See [SQLServer Documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-foreign-keys-transact-sql?view=sql-server-ver16). | No |
| update_referential_action_desc | nvarchar(60) | Description of the referential action that was declared for this FOREIGN KEY when an update happens. See [SQLServer Documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-foreign-keys-transact-sql?view=sql-server-ver16). | No |
| is_system_named | bit | 1 = Name was generated by the system.  0 = Name was supplied by the user. | No |

The inherited columns from **sys.objects** are the following:

For more information, review the [sys.objects documentation](https://learn.microsoft.com/en-us/sql/relational-databases/system-catalog-views/sys-objects-transact-sql?view=sql-server-ver16).

| Column name | Data type | Description | Has equivalent column in Snowflake |
| --- | --- | --- | --- |
| name | sysname | Object name. | Yes |
| object_id | int | Object identification number. Is unique within a database. | No |
| principal_id | int | ID of the individual owner, if different from the schema owner. | No |
| schema_id | int | ID of the schema that the object is contained in. | No |
| parent_object_id | int | ID of the object to which this object belongs. | No |
| type | char(2) | Object type | Yes |
| type_desc | nvarchar(60) | Description of the object type | Yes |
| create_date | datetime | Date the object was created. | Yes |
| modify_date | datetime | Date the object was last modified by using an ALTER statement. | Yes |
| is_ms_shipped | bit | Object is created by an internal SQL Server component. | No |
| is_published | bit | Object is created by an internal SQL Server component. | No |
| is_schema_published | bit | Only the schema of the object is published. | No |

> **Warning:**
>
> Notice that, in this case, for the sys.foreign_keys, there is no equivalence in Snowflake. But, the equivalence is made under the columns inherited from sys.objects.

#### Applicable column equivalence

| SQLServer | Snowflake | Limitations | Applicable |
| --- | --- | --- | --- |
| name | CONSTRAINT_NAME | Names auto-generated by the database may be reviewed to the target Snowflake auto-generated name, | Yes |
| type | CONSTRAINT_TYPE | The type column has a variety of options. But, in this case, the support is only for the letter ‘F’ which represents the foreign keys. | No. Because of the extra validation to determine the foreign keys from all table constraints, it is not applicable. |
| type_desc | CONSTRAINT_TYPE | No limitations found. | No. Because of the extra validation to determine the foreign keys from all table constraints, it is not applicable. |
| create_date | CREATED | Data type differences. | Yes |
| modify_date | LAST_ALTERED | Data type differences. | Yes |
| parent_object_id | CONSTRAINT_CATALOG, CONSTRAINT_SCHEMA, TABLE_NAME | Columns are generated only for the cases that use the OBJECT_ID() function and, the name has a valid pattern. | Yes |

##### Syntax in SQL Server

```sql
SELECT ('column_name' | * )
FROM sys.foreign_keys;
```

##### Syntax in Snowflake

```sql
SELECT ('column_name' | * )
FROM information_schema.table_constraints
WHERE CONSTRAINT_TYPE = 'FOREIGN KEY';
```

> **Note:**
>
> Since the equivalence for the system foreign keys is the catalog view in Snowflake for in ormation_schema.table_constraints, it is necessary to define the type of the constraint in an additional ‘WHERE’ clause to identify foreign key constraints from other constraints.

### Sample Source Patterns

To accomplish correctly the following samples, it is required to run the following statements:

#### SQL Server

```sql
CREATE TABLE Customers (
    CustomerID INT PRIMARY KEY,
    FirstName VARCHAR(50),
    LastName VARCHAR(50),
    Email VARCHAR(100)
);

CREATE TABLE Orders (
    OrderID INT PRIMARY KEY,
    CustomerID INT,
    OrderDate DATE,
    TotalAmount DECIMAL(10, 2),
    CONSTRAINT FK_Name_Test FOREIGN KEY (CustomerID) REFERENCES Customers(CustomerID)
);

INSERT INTO Customers (CustomerID, FirstName, LastName, Email)
VALUES
    (1, 'John', 'Doe', 'john.doe@example.com'),
    (2, 'Jane', 'Smith', 'jane.smith@example.com');

INSERT INTO Orders (OrderID, CustomerID, OrderDate, TotalAmount)
VALUES
    (101, 1, '2023-09-01', 100.50),
    (102, 1, '2023-09-02', 75.25),
    (103, 2, '2023-09-03', 50.00);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE Customers (
    CustomerID INT PRIMARY KEY,
    FirstName VARCHAR(50),
    LastName VARCHAR(50),
    Email VARCHAR(100)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;

CREATE OR REPLACE TABLE Orders (
    OrderID INT PRIMARY KEY,
    CustomerID INT,
    OrderDate DATE,
    TotalAmount DECIMAL(10, 2),
       CONSTRAINT FK_Name_Test FOREIGN KEY (CustomerID) REFERENCES Customers (CustomerID)
   )
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;

INSERT INTO Customers (CustomerID, FirstName, LastName, Email)
VALUES
    (1, 'John', 'Doe', 'john.doe@example.com'),
    (2, 'Jane', 'Smith', 'jane.smith@example.com');

INSERT INTO Orders (OrderID, CustomerID, OrderDate, TotalAmount)
VALUES
    (101, 1, '2023-09-01', 100.50),
    (102, 1, '2023-09-02', 75.25),
    (103, 2, '2023-09-03', 50.00);
```

#### 1. Simple Select Case

##### SQL Server

```sql
SELECT *
FROM sys.foreign_keys;
```

##### Result

| name | object_id | principal_id | schema_id | type | type_desc | create_date | modify_date | parent_object_id | is_ms_shipped | is_published | is_schema_published | referenced_object_id | key_index_id | is_disabled | is_not_for_replication | is_not_trusted | delete_referential_action | delete_referential_action_desc | update_referential_action | update_referential_action_desc | is_system_named |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| FK_Name_Test | 1719677174 | NULL | 1 | F | FOREIGN_KEY_CONSTRAINT | 2023-09-11 22:20:04.160 | 2023-09-11 22:20:04.160 | 1687677060 | false | true | false | 1655676946 | 1 | false | false |  | 0 | NO_ACTION | 0 | NO_ACTION | true |

##### Snowflake

```sql
SELECT *
FROM
INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME | TABLE_CATALOG | TABLE_SCHEMA | TABLE_NAME | CONSTRAINT_TYPE | IS_DEFERRABLE | INITIALLY_DEFERRED | ENFORCED | COMMENT | CREATED | LAST_ALTERED | RELY |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| DBTEST | PUBLIC | FK_Name_Test | DATETEST | PUBLIC | ORDERS | FOREIGN KEY | NO | YES | NO | null | 2023-09-11 15:23:51.969 -0700 | 2023-09-11 15:23:52.097 -0700 | NO |

> **Warning:**
>
> Results differ due to the differences in column objects and missing equivalence. The result may be checked.

#### 2. Name Column Case

##### SQL Server

```sql
SELECT * FROM sys.foreign_keys WHERE name = 'FK_Name_Test';
```

##### Result

| name | object_id | principal_id | schema_id | type | type_desc | create_date | modify_date | parent_object_id | is_ms_shipped | is_published | is_schema_published | referenced_object_id | key_index_id | is_disabled | is_not_for_replication | is_not_trusted | delete_referential_action | delete_referential_action_desc | update_referential_action | update_referential_action_desc | is_system_named |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| FK_Name_Test | 1719677174 | NULL | 1 | F | FOREIGN_KEY_CONSTRAINT | 2023-09-11 22:20:04.160 | 2023-09-11 22:20:04.160 | 1687677060 | false | true | false | 1655676946 | 1 | false | false |  | 0 | NO_ACTION | 0 | NO_ACTION | true |

##### Snowflake

```sql
SELECT * FROM
INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
CONSTRAINT_NAME = 'FK_NAME_TEST'
AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME | TABLE_CATALOG | TABLE_SCHEMA | TABLE_NAME | CONSTRAINT_TYPE | IS_DEFERRABLE | INITIALLY_DEFERRED | ENFORCED | COMMENT | CREATED | LAST_ALTERED | RELY |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| DBTEST | PUBLIC | FK_Name_Test | DATETEST | PUBLIC | ORDERS | FOREIGN KEY | NO | YES | NO | null | 2023-09-11 15:23:51.969 -0700 | 2023-09-11 15:23:52.097 -0700 | NO |

> **Warning:**
>
> This translation may require verification if the constraint name is auto-generated by the database and used in the query. For more information review the Known Issues section.

#### 3. Parent Object ID Case

In this example, a database and schema were created to exemplify the processing of the names to create different and equivalent columns.

##### SQL Server

```sql
use database_name_test
create schema schema_name_test

CREATE TABLE schema_name_test.Customers (
    CustomerID INT PRIMARY KEY,
    FirstName VARCHAR(50),
    LastName VARCHAR(50),
    Email VARCHAR(100)
);

CREATE TABLE schema_name_test.Orders (
    OrderID INT PRIMARY KEY,
    CustomerID INT,
    OrderDate DATE,
    TotalAmount DECIMAL(10, 2),
    CONSTRAINT FK_Name_Test FOREIGN KEY (CustomerID) REFERENCES schema_name_test.Customers(CustomerID)
);

INSERT INTO schema_name_test.Customers (CustomerID, FirstName, LastName, Email)
VALUES
    (1, 'John', 'Doe', 'john.doe@example.com'),
    (2, 'Jane', 'Smith', 'jane.smith@example.com');

INSERT INTO schema_name_test.Orders (OrderID, CustomerID, OrderDate, TotalAmount)
VALUES
    (101, 1, '2023-09-01', 100.50),
    (102, 1, '2023-09-02', 75.25),
    (103, 2, '2023-09-03', 50.00);

SELECT * FROM sys.foreign_keys WHERE name = 'FK_Name_Test' AND parent_object_id = OBJECT_ID(N'database_name_test.schema_name_test.Orders')
```

##### Result

| name | object_id | principal_id | schema_id | type | type_desc | create_date | modify_date | parent_object_id | is_ms_shipped | is_published | is_schema_published | referenced_object_id | key_index_id | is_disabled | is_not_for_replication | is_not_trusted | delete_referential_action | delete_referential_action_desc | update_referential_action | update_referential_action_desc | is_system_named |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| FK_Name_Test | 1719677174 | NULL | 1 | F | FOREIGN_KEY_CONSTRAINT | 2023-09-11 22:20:04.160 | 2023-09-11 22:20:04.160 | 1687677060 | false | true | false | 1655676946 | 1 | false | false |  | 0 | NO_ACTION | 0 | NO_ACTION | true |

##### Snowflake

```sql
USE DATABASE database_name_test;

CREATE SCHEMA IF NOT EXISTS schema_name_test
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;

CREATE OR REPLACE TABLE schema_name_test.Customers (
    CustomerID INT PRIMARY KEY,
    FirstName VARCHAR(50),
    LastName VARCHAR(50),
    Email VARCHAR(100)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;

CREATE OR REPLACE TABLE schema_name_test.Orders (
    OrderID INT PRIMARY KEY,
    CustomerID INT,
    OrderDate DATE,
    TotalAmount DECIMAL(10, 2),
       CONSTRAINT FK_Name_Test FOREIGN KEY (CustomerID) REFERENCES schema_name_test.Customers (CustomerID)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO schema_name_test.Customers (CustomerID, FirstName, LastName, Email)
VALUES
    (1, 'John', 'Doe', 'john.doe@example.com'),
    (2, 'Jane', 'Smith', 'jane.smith@example.com');

INSERT INTO schema_name_test.Orders (OrderID, CustomerID, OrderDate, TotalAmount)
VALUES
    (101, 1, '2023-09-01', 100.50),
    (102, 1, '2023-09-02', 75.25),
    (103, 2, '2023-09-03', 50.00);

SELECT * FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    CONSTRAINT_NAME = 'FK_NAME_TEST'
    AND CONSTRAINT_CATALOG = 'DATABASE_NAME_TEST'
    AND CONSTRAINT_SCHEMA = 'SCHEMA_NAME_TEST'
    AND TABLE_NAME = 'ORDERS'
    AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME | TABLE_CATALOG | TABLE_SCHEMA | TABLE_NAME | CONSTRAINT_TYPE | IS_DEFERRABLE | INITIALLY_DEFERRED | ENFORCED | COMMENT | CREATED | LAST_ALTERED | RELY |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| DATABASE_NAME_TEST | SCHEMA_NAME_TEST | FK_Name_Test | DATABASE_NAME_TEST | SCHEMA_NAME_TEST | ORDERS | FOREIGN KEY | NO | YES | NO | null | 2023-09-11 15:23:51.969 -0700 | 2023-09-11 15:23:52.097 -0700 | NO |

> **Warning:**
>
> If the name coming inside the OBJECT_ID() function does not have a valid pattern, it will not be converted due to name processing limitations on special characters.

> **Warning:**
>
> Review the database that is being used in Snowflake.

#### 4. Type Column Case

The ‘F’ in SQL Server means ‘Foreign Key’ and it is removed due to the validation at the ending to specify the foreign key from all the table constraints.

##### SQL Server

```sql
 SELECT * FROM sys.foreign_keys WHERE type = 'F';
```

##### Result

| name | object_id | principal_id | schema_id | type | type_desc | create_date | modify_date | parent_object_id | is_ms_shipped | is_published | is_schema_published | referenced_object_id | key_index_id | is_disabled | is_not_for_replication | is_not_trusted | delete_referential_action | delete_referential_action_desc | update_referential_action | update_referential_action_desc | is_system_named |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| FK_Name_Test | 1719677174 | NULL | 3 | F | FOREIGN_KEY_CONSTRAINT | 2023-09-11 22:20:04.160 | 2023-09-11 22:20:04.160 | 1687677060 | false | true | false | 1655676946 | 1 | false | false |  | 0 | NO_ACTION | 0 | NO_ACTION | true |

##### Snowflake

```sql
 SELECT * FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    type = 'F' AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME | TABLE_CATALOG | TABLE_SCHEMA | TABLE_NAME | CONSTRAINT_TYPE | IS_DEFERRABLE | INITIALLY_DEFERRED | ENFORCED | COMMENT | CREATED | LAST_ALTERED | RELY |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| DBTEST | PUBLIC | FK_Name_Test | DATETEST | PUBLIC | ORDERS | FOREIGN KEY | NO | YES | NO | null | 2023-09-11 15:23:51.969 -0700 | 2023-09-11 15:23:52.097 -0700 | NO |

#### 5. Type Desc Column Case

The ‘type_desc’ column is removed due to the validation at the ending to specify the foreign key from all the table constraints.

##### SQL Server

```sql
SELECT
    *
FROM
    sys.foreign_keys
WHERE
    type_desc = 'FOREIGN_KEY_CONSTRAINT';
```

##### Result

| name | object_id | principal_id | schema_id | type | type_desc | create_date | modify_date | parent_object_id | is_ms_shipped | is_published | is_schema_published | referenced_object_id | key_index_id | is_disabled | is_not_for_replication | is_not_trusted | delete_referential_action | delete_referential_action_desc | update_referential_action | update_referential_action_desc | is_system_named |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| FK_Name_Test | 1719677174 | NULL | 3 | F | FOREIGN_KEY_CONSTRAINT | 2023-09-11 22:20:04.160 | 2023-09-11 22:20:04.160 | 1687677060 | false | true | false | 1655676946 | 1 | false | false |  | 0 | NO_ACTION | 0 | NO_ACTION | true |

##### Snowflake

```sql
SELECT
    *
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    type_desc = 'FOREIGN_KEY_CONSTRAINT' AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_CATALOG | CONSTRAINT_SCHEMA | CONSTRAINT_NAME | TABLE_CATALOG | TABLE_SCHEMA | TABLE_NAME | CONSTRAINT_TYPE | IS_DEFERRABLE | INITIALLY_DEFERRED | ENFORCED | COMMENT | CREATED | LAST_ALTERED | RELY |
| --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- | --- |
| DBTEST | PUBLIC | FK_Name_Test | DATETEST | PUBLIC | ORDERS | FOREIGN KEY | NO | YES | NO | null | 2023-09-11 15:23:51.969 -0700 | 2023-09-11 15:23:52.097 -0700 | NO |

#### 6. Modify Date Column Simple Case

##### SQL Server

```sql
SELECT *
FROM sys.foreign_keys
WHERE modify_date = CURRENT_TIMESTAMP;
```

##### Result

```none
The query produced no results.
```

##### Snowflake

```sql
SELECT *
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    LAST_ALTERED = CURRENT_TIMESTAMP()
    AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

```none
The query produced no results.
```

#### 7. Modify Date Column with DATEDIFF() Case

The following example shows a more complex scenario where the columns from sys.foreign_keys (inherited from sys.objects) are inside a function DATEDIFF. In this case, the argument corresponding to the applicable equivalence is changed to the corresponding column from the information.schema in Snowflake.

##### SQL Server

```sql
SELECT *
FROM sys.foreign_keys
WHERE DATEDIFF(DAY, modify_date, GETDATE()) <= 30;
```

##### Result

```none
The foreign keys altered in the last 30 days.
```

##### Snowflake

```sql
SELECT *
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    DATEDIFF(DAY, LAST_ALTERED, CURRENT_TIMESTAMP() :: TIMESTAMP) <= 30
    AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

```none
The foreign keys altered in the last 30 days.
```

#### 8. Create Date Column Case

##### SQL Server

```sql
SELECT *
FROM sys.foreign_keys
WHERE create_date = '2023-09-12 14:36:38.060';
```

##### Result

```none
The foreign keys that were created on the specified date and time.
```

##### Snowflake

```sql
SELECT *
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    CREATED = '2023-09-12 14:36:38.060'
    AND CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

```none
The foreign keys that were created on the specified date and time.
```

> **Warning:**
>
> The result may change if the creation date is specific due to the time on which the queries were executed. It is possible to execute a specified query at one time on the origin database and then execute the objects at another time in the new Snowflake queries.

#### 9. Selected Columns Single Name Case

##### SQL Server

```sql
SELECT name
FROM sys.foreign_keys;
```

##### Result

| name |
| --- |
| FK_Name_Test |

##### Snowflake

```sql
SELECT
    CONSTRAINT_NAME
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_NAME |
| --- |
| FK_Name_Test |

#### 10. Selected Columns Qualified Name Case

##### SQL Server

```sql
SELECT
    fk.name
FROM sys.foreign_keys AS fk;
```

##### Result

| name |
| --- |
| FK_Name_Test |

##### Snowflake

```sql
SELECT
    fk.CONSTRAINT_NAME
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS AS fk
WHERE
    CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### Result

| CONSTRAINT_NAME |
| --- |
| FK_Name_Test |

### Known Issues

#### 1. The ‘name’ column may not show a correct output if the constraint does not have a user-created name

If the referenced name is one auto-generated from the database, it would be probable to review it and use the wanted value.

##### 2. When selecting columns, there is a limitation that depends on the applicable columns that are equivalent in Snowflake

Since the columns from sys.foreign_keys are not completely equivalent in Snowflake, some results may change due to the limitations on the equivalence.

##### 3. The OBJECT_ID() function may have a valid pattern to be processed or the database, schema or table could not be extracted

Based on the name that receives the OBJECT_ID() function, the processing of this name will be limited and dependent on formatting.

##### 4. Name Column With OBJECT_NAME() Function Case

Since the OBJECT_NAME() function is not supported yet, the transformations related to this function are not supported.

##### SQL Server

```sql
SELECT name AS ForeignKeyName,
    OBJECT_NAME(parent_object_id) AS ReferencingTable,
    OBJECT_NAME(referenced_object_id) AS ReferencedTable
FROM sys.foreign_keys;
```

##### Snowflake

```sql
SELECT
    name AS ForeignKeyName,
    OBJECT_NAME(parent_object_id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'OBJECT_NAME' NODE ***/!!! AS ReferencingTable,
    OBJECT_NAME(referenced_object_id) !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'OBJECT_NAME' NODE ***/!!! AS ReferencedTable
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    CONSTRAINT_TYPE = 'FOREIGN KEY';
```

##### 5. SCHEMA_NAME() and TYPE_NAME() functions are also not supported yet.

##### 6. Different Join statement types may be not supported if the system table is not supported. Review the supported system tables.

##### 7. Cases with JOIN statements are not supported.

##### 8. Names with alias AS are not supported.

### Related EWIs

1. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

---
title: SnowConvert AI - SQL Server-Azure Synapse - Views
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/transact-create-view.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse - Views

Applies to

* SQL Server
* Azure Synapse Analytics
> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

In this section, we will check the transformation for the create view.

## Sample Source Patterns

### SIMPLE CREATE VIEW

The following example shows a transformation for a simple `CREATE VIEW` statement.

#### Transact

```sql
CREATE VIEW VIEWNAME
AS
SELECT AValue from ATable;
```

##### Snowflake

```sql
CREATE OR REPLACE VIEW VIEWNAME
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
SELECT
AValue
from
ATable;
```

## CREATE OR ALTER VIEW

The **CREATE OR ALTER** definition used in SQL Server is transformed to **CREATE OR REPLACE** in Snowflake.

### Transact

```sql
CREATE OR ALTER VIEW VIEWNAME
AS
SELECT AValue from ATable;
```

#### Snowflake

```sql
CREATE OR REPLACE VIEW VIEWNAME
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
SELECT
AValue
from
ATable;
```

## CREATE VIEW WITH

In this type of View, after the name of the View, the following clauses can come

* `WITH ENCRYPTION`
* `WITH SCHEMABINDING`
* `WITH VIEW_METADATA`

> **Warning:**
>
> Notice that the above clauses are removed from the translation. because they are not relevant in Snowflake syntax.

### Transact

```sql
CREATE OR ALTER VIEW VIEWNAME
WITH ENCRYPTION
AS
SELECT AValue from ATable;
```

### Snowflake

```sql
CREATE OR REPLACE VIEW VIEWNAME
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
SELECT
AValue
from
ATable;
```

## CREATE VIEW AS SELECT WITH CHECK OPTION

In this type of View, the clause **`WITH CHECK OPTION`** comes after the end of the Select statement used in the Create View.

> **Warning:**
>
> Notice that `WITH CHECK OPTION`is removed from the translation, because is not relevant in Snowflake syntax.

### Transact

```sql
CREATE OR ALTER VIEW VIEWNAME
AS
SELECT AValue from ATable
WITH CHECK OPTION;
```

### Snowflake

```sql
CREATE OR REPLACE VIEW VIEWNAME
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
SELECT
AValue
from
ATable;
```

## CREATE VIEW AS COMMON TABLE EXPRESSION

Common Table Expressions must be used to retrieve the data:

### Transact

```sql
CREATE VIEW EMPLOYEEIDVIEW
AS
WITH CTE AS ( SELECT NationalIDNumber from [HumanResources].[Employee]
UNION ALL
SELECT BusinessEntityID FROM [HumanResources].[EmployeeDepartmentHistory] )
SELECT * FROM MyCTE;
```

### Snowflake

```sql
CREATE OR REPLACE VIEW EMPLOYEEIDVIEW
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
--** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **
WITH CTE AS ( SELECT
NationalIDNumber
from
HumanResources.Employee
UNION ALL
SELECT
BusinessEntityID
FROM
HumanResources.EmployeeDepartmentHistory
)
SELECT
*
FROM
MyCTE;
```

## UNSUPPORTED SCENARIOS

Common table expressions with Update, Insert or Delete statements will be commented out because they are not supported in Snowflake and SQLServer.

In the case where an invalid CTE is added to the view, this will be completely commented out.

```sql
 --!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - COMMON TABLE EXPRESSION IN VIEW NOT SUPPORTED ***/!!!
--CREATE OR REPLACE VIEW PUBLIC.EmployeeInsertVew
--AS
--WITH MyCTE AS ( SELECT
--NationalIDNumber
--from
--HumanResources.Employee
--UNION ALL
--SELECT
--BusinessEntityID
--FROM
--HumanResources.EmployeeDepartmentHistory
--)
--INSERT INTO PUBLIC.Dummy
```

### FINAL SAMPLE

Let’s see a final sample, let’s put together all the cases that we have seen so far and see how the transformation would be

#### Transact

```sql
CREATE OR ALTER VIEW VIEWNAME
WITH ENCRYPTION
AS
Select AValue from ATable
WITH CHECK OPTION;
```

##### Snowflake

```sql
CREATE OR REPLACE VIEW VIEWNAME
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
Select
AValue
from
ATable;
```

As you can see, we changed the **OR ALTER** with **OR REPLACE** and we removed the clause **WITH ENCRYPTION** that comes after the view name and the **WITH CHECK OPTION** that comes after the Select.

### Related EWIs

1. [SSC-PRF-TS0001](../../general/technical-documentation/issues-and-troubleshooting/performance-review/sqlServerPRF.md): Performance warning - recursion for CTE not checked. Might require a recursive keyword.

---
title: SnowConvert AI - SQL Server-Azure Synapse Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse Functional Differences

Applies to

* SQL Server
* Azure Synapse Analytics

## SSC-FDM-TS0001

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TS0077](../conversion-issues/sqlServerEWI.md) documentation

### Description

This message is shown when there is a collate clause that is not supported in Snowflake.

#### Code example

##### Input Code:

```sql
 SELECT 'a' COLLATE Albanian_BIN;

SELECT 'a' COLLATE Albanian_CI_AI;

CREATE TABLE ExampleTable (
    ID INT,
    Name VARCHAR(50) COLLATE collateName
);
```

##### Generated Code:

```sql
 SELECT 'a'
--           --** SSC-FDM-TS0001 - COLLATION Albanian_BIN NOT SUPPORTED **
--           COLLATE Albanian_BIN
                               ;

SELECT 'a'
--           --** SSC-FDM-TS0001 - COLLATION Albanian_CI_AI NOT SUPPORTED **
--           COLLATE Albanian_CI_AI
                                 ;

CREATE OR REPLACE TABLE ExampleTable (
    ID INT,
    Name VARCHAR(50)
--                     --** SSC-FDM-TS0001 - COLLATION collateName NOT SUPPORTED **
--                     COLLATE collateName
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0002

### Description

This message is shown when there is a collate clause that is not supported in Snowflake.

#### Code Example

##### Input Code:

```sql
 SELECT 'a' COLLATE Latin1_General_CI_AS_WS;
```

##### Generated Code:

```sql
 SELECT 'a' COLLATE 'EN-CI-AS' /*** SSC-FDM-TS0002 - COLLATION FOR VALUE WS NOT SUPPORTED ***/;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0003

XP_LOGININFO mapped to custom UDF

### Description

This message is shown when the XP_LOGININFO procedure is executed and returns the following set of columns ([See SQL SERVER documentation for more info](https://learn.microsoft.com/en-us/sql/relational-databases/system-stored-procedures/xp-logininfo-transact-sql?view=sql-server-ver16))

|  |  |  |  |  |
| --- | --- | --- | --- | --- |
| account name | type | privilege | mapped login name | permission path |

To replicate this behavior, there is a query that select the columns from the APPLICABLE_ROLES view in Snowflake, which returns the following set of columns ([See Snowflake documentation for more info](https://docs.snowflake.com/en/sql-reference/info-schema/applicable_roles.html))

| GRANTEE | ROLE_NAME | ROLE_OWNER | IS_GRANTABLE |
| --- | --- | --- | --- |

SQL Server original columns are mapped as shown in the next table. They may be not completely equivalent.

| SQL Server | Snowflake |  |
| --- | --- | --- |
| account name | GRANTEE |  |
| type | ROLE_OWNER |  |
| privilege | ROLE_NAME |  |
| mapped login name | GRANTEE |  |
| permission path | NULL |  |

#### Example code

##### Input code:

```sql
 EXEC xp_logininfo

EXEC xp_logininfo 'USERNAME'
```

##### Generated Code:

```sql
 --** SSC-FDM-TS0003 - XP_LOGININFO MAPPED TO CUSTOM UDF XP_LOGININFO_UDF AND MIGHT HAVE DIFFERENT BEHAVIOR **
SELECT
*
FROM
TABLE(XP_LOGININFO_UDF());

--** SSC-FDM-TS0003 - XP_LOGININFO MAPPED TO CUSTOM UDF XP_LOGININFO_UDF AND MIGHT HAVE DIFFERENT BEHAVIOR **
SELECT
*
FROM
TABLE(XP_LOGININFO_UDF('USERNAME'));
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0004

### Description

This message is shown when a `BULK INSERT` was transformed and a `PUT` command is added to the output code. It happens because the `PUT` command cannot be executed using the SnowSQL Web UI. To successfully execute it, any user should have the SnowCLI installed before.

#### Code Example

##### Input Code:

```sql
 BULK INSERT #temptable FROM 'path/to/file.txt'
WITH
(
   FIELDTERMINATOR ='\t',
   ROWTERMINATOR ='\n'
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE FILE FORMAT FILE_FORMAT_638466175888203490
FIELD_DELIMITER = '\t'
RECORD_DELIMITER = '\n';

CREATE OR REPLACE STAGE STAGE_638466175888203490
FILE_FORMAT = FILE_FORMAT_638466175888203490;

--** SSC-FDM-TS0004 - PUT STATEMENT IS NOT SUPPORTED ON WEB UI. YOU SHOULD EXECUTE THE CODE THROUGH THE SNOWFLAKE CLI **
PUT file://path/to/file.txt @STAGE_638466175888203490 AUTO_COMPRESS = FALSE;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "#temptable" **
COPY INTO T_temptable FROM @STAGE_638466175888203490/file.txt;
```

#### Best Practices

* Install SnowCLI.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0005

TRY_CONVERT/TRY_CAST could not be converted to TRY_CAST

### Description

This FDM is added when a TRY_CONVERT or TRY_CAST cannot be converted to a TRY_CAST in Snowflake.

[Snowflake’s TRY_CAST](https://docs.snowflake.com/en/sql-reference/functions/try_cast) function has a limitation as it only allows the conversion of string expressions. However, Transact’s `TRY_CONVERT` and `TRY_CAST` functions allow any data type expression.

Currently, the transformation from `TRY_CONVERT` or `TRY_CAST` to Snowflake’s `TRY_CAST` is only performed for string expressions or expressions that the tool can identify as strings in its context.

#### Code Example

##### Input Code:

```sql
 SELECT TRY_CAST(14.85 AS INT);
SELECT TRY_CONVERT(VARCHAR, 1234);
SELECT TRY_CONVERT(CHAR, 1);
SELECT TRY_CONVERT(SQL_VARIANT, '2017-01-01 12:00:00');
SELECT TRY_CONVERT(GEOGRAPHY, 'LINESTRING(-122.360 47.656, -122.343 47.656 )');
```

##### Generated Code:

```sql
 SELECT
CAST(14.85 AS INT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/;
SELECT
TO_VARCHAR(1234);
SELECT
TO_CHAR(1);
SELECT
TO_VARIANT('2017-01-01 12:00:00');
SELECT
TO_GEOGRAPHY('LINESTRING(-122.360 47.656, -122.343 47.656 )');
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0006

EXECUTE AS ‘user_name’ clause does not exist in Snowflake and the user calling the procedure should have all the required privileges.

### Description

This message is shown when SnowConvert AI finds a procedure with an `EXECUTE AS 'user_name'` clause. This is not supported in Snowflake, so it is changed `EXECUTE AS CALLER.`

This clause specifies the security context under which to execute the procedure.

> **Note:**
>
> For more details see the [documentation](https://learn.microsoft.com/en-us/sql/t-sql/statements/execute-as-clause-transact-sql?view=sql-server-ver16&amp;tabs=sqlserver) about the clause functionality.

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE SelectAllCustomers
WITH EXECUTE AS 'user_name'
AS
BEGIN
      SELECT * FROM Customers;
END;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "Customers" **
CREATE OR REPLACE PROCEDURE SelectAllCustomers ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
--** SSC-FDM-TS0006 - EXECUTE AS 'user_name' CLAUSE DOES NOT EXIST IN SNOWFLAKE AND THE USER CALLING THE PROCEDURE SHOULD HAVE ALL THE REQUIRED PRIVILEGES **
AS
$$
      DECLARE
            ProcedureResultSet RESULTSET;
      BEGIN
            ProcedureResultSet := (
            SELECT
                  *
            FROM
                  Customers);
            RETURN TABLE(ProcedureResultSet);
      END;
$$;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0007

FOR REPLICATION clause does not exist in Snowflake.

### Description

This message is shown when SnowConvert AI finds a procedure with a `FOR REPLICATION` clause. This is not supported in Snowflake, so it is removed.

This clause specifies that the procedure is created for replication. Consequently, it can’t be executed on the Subscriber.

> **Note:**
>
> For more details see the [documentation](https://learn.microsoft.com/en-us/sql/t-sql/statements/create-procedure-transact-sql?view=sql-server-ver16#for-replication) about the clause functionality.

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE SelectAllCustomers
WITH FOR REPLICATION
AS
BEGIN
      SELECT * FROM Customers;
END;
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "Customers" **
CREATE OR REPLACE PROCEDURE SelectAllCustomers ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
--** SSC-FDM-TS0007 - FOR REPLICATION CLAUSE DOES NOT EXIST IN SNOWFLAKE **
AS
$$
      DECLARE
            ProcedureResultSet RESULTSET;
      BEGIN
            ProcedureResultSet := (
            SELECT
                  *
            FROM
                  Customers);
            RETURN TABLE(ProcedureResultSet);
      END;
$$;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0008

FORMATMESSAGE function was converted to UDF

### Description

This Warning is added because the `FORMATMESSAGE` function is being used and it was replaced by `FORMATMESSAGE_UDF`. The reason to add the warning is because the `FORMATMESSAGE_UDF` used to replace the `FORMATMESSAGE` does not handle properly all kinds of formats and it may throw an error on certain conditions.

Unsigned numerical values that are given as negative will preserve the sign instead of converting the value. Also, the `%I64d` placeholder is not supported by the UDF so it will throw an error when it is used.

In the FORMATMESSAGE_UDF, an error will happen if the given number of arguments is different than the number of placeholders.

This UDF does not support using message number IDs.

#### Code Example

##### Input Code:

```sql
 SELECT FORMATMESSAGE('Unsigned int %u, %u', 50, -50); -- Unsigned int 50, 4294967246
SELECT FORMATMESSAGE('Unsigned octal %o, %o', 50, -50); -- Unsigned octal 62, 37777777716
SELECT FORMATMESSAGE('Unsigned hexadecimal %X, %x', -11, -50); -- Unsigned hexadecimal FFFFFFF5, ffffffce
SELECT FORMATMESSAGE('Unsigned octal with prefix: %#o', -50); -- Unsigned octal with prefix: 037777777716
SELECT FORMATMESSAGE('Unsigned hexadecimal with prefix: %#X, %x', -11,-50); -- Unsigned hexadecimal with prefix: 0XFFFFFFF5, ffffffce
SELECT FORMATMESSAGE('Bigint %I64d', 3000000000); -- Bigint 3000000000
SELECT FORMATMESSAGE('My message: %s %s %s', 'Hello', 'World'); -- My message: Hello World (null)
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('Unsigned int %u, %u', ARRAY_CONSTRUCT(50, -50)); -- Unsigned int 50, 4294967246
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('Unsigned octal %o, %o', ARRAY_CONSTRUCT(50, -50)); -- Unsigned octal 62, 37777777716
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('Unsigned hexadecimal %X, %x', ARRAY_CONSTRUCT(-11, -50)); -- Unsigned hexadecimal FFFFFFF5, ffffffce
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('Unsigned octal with prefix: %#o', ARRAY_CONSTRUCT(-50)); -- Unsigned octal with prefix: 037777777716
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('Unsigned hexadecimal with prefix: %#X, %x', ARRAY_CONSTRUCT(-11, -50)); -- Unsigned hexadecimal with prefix: 0XFFFFFFF5, ffffffce
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('Bigint %I64d', ARRAY_CONSTRUCT(3000000000)); -- Bigint 3000000000
SELECT
--** SSC-FDM-TS0008 - FORMATMESSAGE WAS CONVERTED TO CUSTOM UDF FORMATMESSAGE_UDF AND IT MIGHT HAVE A DIFFERENT BEHAVIOR. **
FORMATMESSAGE_UDF('My message: %s %s %s', ARRAY_CONSTRUCT('Hello', 'World')); -- My message: Hello World (null)
```

#### Best Practices

* Avoid using `%I64d` placeholder in the message.
* Use directly the message as a string instead of using a message ID for the first argument.
* Make sure the number of placeholders is the same as the number of arguments after the message.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0009

Encrypted with not supported in Snowflake.

### Description

This warning is added when there is an `ENCRYPTED WITH` used in a Column Definition. Since this is not supported in Snowflake, it is being removed and a warning is added.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE [SCHEMA1].[TABLE1] (
    [COL1] NVARCHAR(60)
        ENCRYPTED WITH (
            COLUMN_ENCRYPTION_KEY = MyCEK,
            ENCRYPTION_TYPE = RANDOMIZED,
            ALGORITHM = 'AEAD_AES_256_CBC_HMAC_SHA_256'
        )
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE SCHEMA1.TABLE1 (
    COL1 VARCHAR(60)
--    --** SSC-FDM-TS0009 - ENCRYPTED WITH NOT SUPPORTED IN SNOWFLAKE **
--           ENCRYPTED WITH (
--               COLUMN_ENCRYPTION_KEY = MyCEK,
--               ENCRYPTION_TYPE = RANDOMIZED,
--               ALGORITHM = 'AEAD_AES_256_CBC_HMAC_SHA_256'
--           )
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
   ;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0010

CURRENT_DATABASE function has different behavior in certain cases.

### Description

This EWI is added when the function DB_NAME is transformed to CURRENT_DATABASE because Snowflake does not support the database_id parameter and the CURRENT_DATABASE function will always return the current database name.

#### Code Example

##### Input Code:

```sql
 SELECT DB_NAME(someId);
```

##### Generated Code:

```sql
 SELECT
CURRENT_DATABASE() /*** SSC-FDM-TS0010 - CURRENT_DATABASE function has different behavior in certain cases ***/;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0011

Default value not allowed in Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TS0078](../conversion-issues/sqlServerEWI.md) documentation

### Description

This error is added to the code when expressions like function calls, variable names, or named constants follow the default option.

Snowflake only supports explicit constants like numbers or strings.

#### Code Example

##### Input Code:

```sql
 ALTER TABLE
    T_ALTERTABLETEST
ADD
    COLUMN COL10 INTEGER DEFAULT RANDOM(10);
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "T_ALTERTABLETEST", "RANDOM" **
ALTER TABLE IF EXISTS T_ALTERTABLETEST
ADD
    COLUMN COL10 INTEGER
--                         --** SSC-FDM-TS0011 - DEFAULT OPTION NOT ALLOWED IN SNOWFLAKE **
--                         DEFAULT RANDOM(10)
                                           ;
```

#####

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0012

Information for the column was not found. STRING used to match CAST operation

### Description

This EWI is added in Table-Valued User Defined Functions where the return type of a column can not be determined during the conversion. `STRING` is used as a default to match the `CAST` operation in the `SELECT` statement <!–TODO: search for a broken reference.->

#### Code Example

##### Input Code:

```sql
 CREATE FUNCTION GetDepartmentInfo()
RETURNS TABLE
AS
RETURN
(
  SELECT DepartmentID, Name, GroupName
  FROM HumanResources.Department
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE FUNCTION GetDepartmentInfo ()
RETURNS TABLE(
  DepartmentID STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN DepartmentID WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
  Name STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN Name WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
  GroupName STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN GroupName WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
AS
$$
    SELECT
    CAST(DepartmentID AS STRING),
    CAST(Name AS STRING),
    CAST(GroupName AS STRING)
    FROM
    HumanResources.Department
$$;
```

#### Best Practices

* The user should check which is the correct data type that could not be found and change it in the `RETURNS TABLE` statement definition.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0013

Snowflake Scripting cursor rows are not modifiable.

### Description

This EWI is added when Cursors are open to modification in the input code. Snowflake Scripting does not allow modifying cursor rows.

#### Example Code:

##### Input Code:

```sql
 CREATE OR ALTER PROCEDURE modifiablecursorTest
AS
BEGIN
    -- Should be marked with SSC-FDM-TS0013
    DECLARE CursorVar CURSOR
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE CursorVar2 INSENSITIVE CURSOR
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE CursorVar3 CURSOR KEYSET SCROLL_LOCKS
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE CursorVar4 CURSOR DYNAMIC OPTIMISTIC
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE CursorVar6 CURSOR STATIC
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE CursorVar7 CURSOR READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;
    -- Shouid not be marked
    DECLARE CursorVar5 CURSOR STATIC READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;
    RETURN 'DONE';
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE modifiablecursorTest ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		-- Should be marked with SSC-FDM-TS0013
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CursorVar CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CursorVar2 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CursorVar3 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CursorVar4 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CursorVar6 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		--** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
		CursorVar7 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		-- Shouid not be marked
		CursorVar5 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
	BEGIN
		RETURN 'DONE';
	END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0014

Computed column transformed

### Description

This warning is added when an SQL Server computed column is transformed to its Snowflake equivalent. It is added because, in some cases, the functional equivalence could be affected.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE [TestTable](
    [Col1] AS (CONVERT ([REAL], ExpressionValue))
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TestTable (
    Col1 REAL AS (CAST(ExpressionValue AS REAL)) /*** SSC-FDM-TS0014 - COMPUTED COLUMN WAS TRANSFORMED TO ITS SNOWFLAKE EQUIVALENT, FUNCTIONAL EQUIVALENCE VERIFICATION PENDING. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

#### Best Practices

* No additional user actions are required; it is just informative.
* Add manual changes to the not-transformed expression.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0016

XML columns in Snowflake might have a different format

### Description

This warning is added when an SQL Server FOR XML clause with a non-empty path is transformed to its Snowflake equivalent using `FOR_XML_UDF`. It is added because columns in XML could be different.

> **Note:**
>
> `FOR XML PATH('')` (empty path without a `ROOT` clause) is a common SQL Server string concatenation pattern and is **not** an XML generation scenario. These cases are transformed to `LISTAGG` instead of `FOR_XML_UDF`, and this FDM is not emitted. See the [SELECT FOR](../../../../translation-references/transact/transact-dmls.md) section for details.

#### Code Example

Given the following table called `employee` as an example.

| Id | Name | Hint |
| --- | --- | --- |
| 1 | Kinslee Park | Developer |
| 2 | Ezra Mata | Developer |
| 3 | Aliana Quinn | Manager |

##### Input Code:

##### Code

```sql
 SELECT
  	e.id,
  	e.name as full_name,
  	e.hint
  FROM
  	employee e
  FOR XML PATH;
```

##### Output

```html
 <row>
    <id>1</id>
    <full_name>Kinslee Park</full_name>
    <hint>Developer</hint>
</row>
<row>
    <id>2</id>
    <full_name>Ezra Mata</full_name>
    <hint>Developer</hint>
</row>
<row>
    <id>3</id>
    <full_name>Aliana Quinn</full_name>
    <hint>Manager</hint>
</row>
```

##### Generated Code:

##### Code

```sql
 SELECT
	--** SSC-FDM-TS0016 - XML COLUMNS IN SNOWFLAKE MIGHT HAVE A DIFFERENT FORMAT **
	FOR_XML_UDF(OBJECT_CONSTRUCT('id', e.id, 'full_name', e.name, 'hint', e.hint), 'row')
FROM
	employee e;
```

##### Output

```html
 <row type="OBJECT">
    <full_name type="VARCHAR">Kinslee Park</full_name>
    <hint type="VARCHAR">Developer</hint>
    <id type="INTEGER">1</id>
</row>
<row type="OBJECT">
    <full_name type="VARCHAR">Ezra Mata</full_name>
    <hint type="VARCHAR">Developer</hint>
    <id type="INTEGER">2</id>
</row>
<row type="OBJECT">
    <full_name type="VARCHAR">Aliana Quinn</full_name>
    <hint type="VARCHAR">Manager</hint>
    <id type="INTEGER">3</id>
</row>
```

#### Best Practices

* No additional user actions are required; it is just informative.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0017

CURRENT_USER function does not support a user ID as a parameter.

### Description

This EWI is added when functions like `SUSER_NAME` or `SUSER_SNAME` contain the user identifier as a parameter because this last one is not supported in the CURRENT_USER function in Snowflake.

#### Input Code:

```sql
 SELECT SUSER_NAME(0x010500000000000515000000a065cf7e784b9b5fe77c87705a2e0000);
```

##### Generated Code:

```sql
 SELECT
CURRENT_USER() /*** SSC-FDM-TS0017 - User ID parameter used in SUSER_NAME function is not supported in CURRENT_USER function and it was removed. ***/;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0018

Database console command is not supported

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TS0079](../conversion-issues/sqlServerEWI.md) documentation

### Description

This FDM is added when SnowConvert AI finds a DBCC statement inside the input code.
Most DBCC statements are not supported in Snowflake.

#### Code Example

##### Input Code:

```sql
 DBCC CHECKIDENT(@a, RESEED, @b) WITH NO_INFOMSGS
```

##### Generated Code:

```sql
 ----** SSC-FDM-TS0018 - DATABASE CONSOLE COMMAND 'CHECKIDENT' IS NOT SUPPORTED. **
--DBCC CHECKIDENT(@a, RESEED, @b) WITH NO_INFOMSGS
```

#### Best Practices

* No additional user actions are required; it is just informative.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0019

RAISERROR Error Message may differ because of the SQL Server string format.

### Description

This EWI is added to notify that the RAISERROR Error Message may differ because of the SQL Server string format.

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE RAISERROR_PROCEDURE
AS
BEGIN
RAISERROR ('This is a sample error message with the first parameter %d and the second parameter %*.*s',
           10,
           1,
           123,
	   7,
	   7,
	   'param2');
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE RAISERROR_PROCEDURE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		--** SSC-FDM-TS0019 - RAISERROR ERROR MESSAGE MAY DIFFER BECAUSE OF THE SQL SERVER STRING FORMAT **
		SELECT
			RAISERROR_UDF('This is a sample error message with the first parameter %d and the second parameter %*.*s',
			10,
			1, array_construct(
			123,
7,
7,
'param2'));
	END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0020

Default constraint was commented out and may have been added to a table definition.

### Description

This FDM is added when the default constraint is present in an Alter Table statement.

Currently, support for that constraint is unavailable. A workaround to transform it is to define the table before using Alter Table. This allows SnowConvert AI to identify the references, and the default constraint is consolidated in the table definition. Otherwise, the constraint is only commented out.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE table1(
  col1 integer,
  col2 varchar collate Latin1_General_CS,
  col3 date
);

ALTER TABLE table1
ADD col4 integer,
  CONSTRAINT col1_constraint DEFAULT 50 FOR col1,
  CONSTRAINT col1_constraint DEFAULT (getdate()) FOR col1;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE table1 (
  col1 INTEGER DEFAULT 50,
  col2 VARCHAR COLLATE 'EN-CS',
  col3 DATE
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;

ALTER TABLE table1
ADD col4 INTEGER;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE table1
--ADD
--CONSTRAINT col1_constraint DEFAULT 50 FOR col1
                                              ;

----** SSC-FDM-TS0020 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION **

--ALTER TABLE table1
--ADD
--CONSTRAINT col1_constraint DEFAULT (CURRENT_TIMESTAMP() :: TIMESTAMP) FOR col1
                                                                              ;
```

#### Known Issues

* When different default constraints are declared over the same column, only the first will be reflected on the Create Table Statement.
* When a default constraint is declared on a missing column, the transformation cannot be performed due to the lack of dependencies.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0021

A MASKING POLICY was created as a substitute for MASKED WITH.

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This EWI is added when the Alter Table statement contains a MASKED WITH clause. The reason this is added is to inform that an approximate MASKING POLICY was created as a substitute for the MASKED WITH function.

#### Code Example

##### Input Code:

```sql
 ALTER TABLE table_name
ALTER COLUMN column_name
ADD MASKED WITH (FUNCTION = 'default()');
```

##### Generated Code:

```sql
 --** SSC-FDM-TS0022 - MASKING ROLE MUST BE DEFINED PREVIOUSLY BY THE USER **
CREATE OR REPLACE MASKING POLICY "default" AS
(val STRING)
RETURNS STRING ->
CASE
WHEN current_role() IN ('YOUR_DEFINED_ROLE_HERE')
THEN val
ELSE 'xxxxx'
END;

ALTER TABLE IF EXISTS table_name MODIFY COLUMN column_name/*** SSC-FDM-TS0021 - A MASKING POLICY WAS CREATED AS SUBSTITUTE FOR MASKED WITH ***/  SET MASKING POLICY "default";
```

> **Note:**
>
> The MASKING POLICY will be created previous to the ALTER TABLE statement. And it is expected to have an approximate behavior. Some tweaks might be needed in regard to roles and user privileges.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0022

The user must previously define the masking role.

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This is EWI occurs when a MASKING POLICY is created and a role or privilege must be linked to it so the data masking could work properly.

#### Code Example

##### Input code

```sql
 ALTER TABLE tableName
ALTER COLUMN columnName
ADD MASKED WITH (FUNCTION = 'partial(1, "xxxxx", 1)');
```

##### Generated Code:

```sql
 --** SSC-FDM-TS0022 - MASKING ROLE MUST BE DEFINED PREVIOUSLY BY THE USER **
CREATE OR REPLACE MASKING POLICY "partial_1_xxxxx_1" AS
(val STRING)
RETURNS STRING ->
CASE
WHEN current_role() IN ('YOUR_DEFINED_ROLE_HERE')
THEN val
ELSE LEFT(val, 1) || 'xxxxx' || RIGHT(val, 1)
END;

ALTER TABLE IF EXISTS tableName MODIFY COLUMN columnName/*** SSC-FDM-TS0021 - A MASKING POLICY WAS CREATED AS SUBSTITUTE FOR MASKED WITH ***/  SET MASKING POLICY "partial_1_xxxxx_1";
```

> **Note:**
>
> As shown on line 6, there is a placeholder where the defined roles can be placed. There is room for one or several values separated by commas. Also, here, the use of single quotes is mandatory for each of the values.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0023

Error function could be different in Snowflake

### Description

This EWI is added in the transformation of the following ERRORs functions due to the corresponding behavior change.

* **ERROR_MESSAGE** The message of SQLERRM could be different in Snowflake.
* **ERROR_STATE** The target SQLSTATE property could return a different number due to platform differences.
* **ERROR_PROCEDURE** Transformation changed to return the stored procedure where the function is called.

#### Input Code:

```sql
CREATE PROCEDURE ProcError
AS
BEGIN
Declare @ErrorState INT = ERROR_STATE();
Declare @ErrorMessage INT = ERROR_MESSAGE();
Declare @ErrorProc INT = ERROR_PROCEDURE();
Select 1;
END;
```

#### Generated Code

```sql
CREATE OR REPLACE PROCEDURE ProcError ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "09/01/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
DECLARE
ERRORSTATE INT := SQLSTATE /*** SSC-FDM-TS0023 - ERROR STATE COULD BE DIFFERENT IN SNOWFLAKE ***/;
ERRORMESSAGE INT := SQLERRM /*** SSC-FDM-TS0023 - ERROR MESSAGE COULD BE DIFFERENT IN SNOWFLAKE ***/;
ERRORPROC INT := 'ProcError' /*** SSC-FDM-TS0023 - ERROR PROCEDURE NAME COULD BE DIFFERENT IN SNOWFLAKE ***/;
ProcedureResultSet RESULTSET;
BEGIN

ProcedureResultSet := (
Select 1);
RETURN TABLE(ProcedureResultSet);
END;
$$;
```

#### Recommendation

If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-FDM-TS0024

CURRENT_TIMESTAMP in At Time Zone statement may have a different behavior in certain cases.

### Description

This FDM is added when the `At Time Zone` has the `CURRENT_TIMESTAMP`. This is because the result might differ in some instances.

The main difference is that in SQL Server, CURRENT_TIMESTAMP returns the current system date and time in the server time zone and in Snowflake CURRENT_TIMESTAMP returns the current date and time in the UTC (Coordinated Universal Time) time zone.

#### Input Code:

##### Sql Server

```sql
 SELECT current_timestamp at time zone 'Hawaiian Standard Time';
```

##### Result

`2024-02-08 16:52:55.317 -10:00`

##### Generated Code:

##### Snowflake

```sql
 SELECT
CONVERT_TIMEZONE('Pacific/Honolulu', CURRENT_TIMESTAMP() /*** SSC-FDM-TS0024 - CURRENT_TIMESTAMP in At Time Zone statement may have a different behavior in certain cases ***/);
```

##### Result

`2024-02-08 06:53:46.994 -1000`

#### Best Practices

This is an example if you want to keep the same format in Snowflake.

##### SQL Server

```sql
 SELECT current_timestamp at time zone 'Hawaiian Standard Time';
```

##### Result

`2024-02-08 16:33:49.143 -10:00`

In Snowflake you can use [ALTER SESSION](https://docs.snowflake.com/en/sql-reference/sql/alter-session) to change the default time zone. For example:

##### Snowflake

```sql
 ALTER SESSION SET TIMEZONE = 'Pacific/Honolulu';

SELECT
CONVERT_TIMEZONE('Pacific/Honolulu', 'UTC', CURRENT_TIMESTAMP());
```

##### Result

`2024-02-08 16:33:49.143`

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0025

DB_ID_UDF may have a different behavior in certain cases.

### Description

This FDM is added to clarify that the DB_ID_UDF tries to emulate the [DB_ID](https://learn.microsoft.com/en-us/sql/t-sql/functions/db-id-transact-sql?view=sql-server-ver16) SqlServer function as well as possible. In SqlServer, the identifier assigned to a database is unique, and if the database is deleted, this ID won’t ever be used again; otherwise, in Snowflake, this identifier corresponds to the number assigned to the database when it is created; it is also unique, but it is a consecutive number which means that if this database is deleted, this number is going to be assigned to the database that was created after the deleted one.

#### Input Code:

##### Sql Server

```sql
 SELECT DB_ID('my_database');
```

##### Result

`6`

##### Generated Code:

##### Snowflake

```sql
 SELECT
DB_ID_UDF('my_database') /*** SSC-FDM-TS0025 - DB_ID_UDF MAY HAVE A DIFFERENT BEHAVIOR IN CERTAIN CASES ***/;
```

##### Result

`6`

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0026

DELETE case is not being considered in the temporary table

### Description

There is an INSERT statement pattern that requires a specific transformation, which involves the creation of a temporary table. This FDM notifies that the DELETE case is not considered in the transformation mentioned. Please visit [INSERT with Table DML Factor with MERGE as DML](../../../../translation-references/transact/transact-dmls.md) to get more information about this pattern.

#### Input Code:

##### Sql Server

```sql
 INSERT INTO T3
SELECT
	col1,
  col2
FROM (
  MERGE T1 USING T2
  	ON T1.col1 = T2.col1
  WHEN NOT MATCHED THEN
    INSERT VALUES ( T2.col1, T2.col2 )
  WHEN MATCHED THEN
    UPDATE SET T1.col2 = t2.col2
  OUTPUT
  	$action ACTION_OUT,
    T2.col1,
    T2.col2
) AS MERGE_OUT
 WHERE ACTION_OUT='UPDATE';
```

##### Generated Code:

##### Snowflake

```sql
 --** SSC-FDM-TS0026 - DELETE CASE IS NOT BEING CONSIDERED, PLEASE CHECK IF THE ORIGINAL MERGE PERFORMS IT **
CREATE OR REPLACE TEMPORARY TABLE MERGE_OUT AS
	SELECT
		CASE
			WHEN T1.$1 IS NULL
				THEN 'INSERT'
			ELSE 'UPDATE'
		END ACTION_OUT,
		T2.col1,
		T2.col2
	FROM
		T2
		LEFT JOIN
			T1
			ON T1.col1 = T2.col1;

MERGE INTO T1
USING T2
ON T1.col1 = T2.col1
WHEN NOT MATCHED THEN
	   INSERT VALUES (T2.col1, T2.col2)
WHEN MATCHED THEN
	UPDATE SET
		T1.col2 = t2.col2
		!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - OUTPUT CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
		OUTPUT
			$action ACTION_OUT,
		  T2.col1,
		  T2.col2 ;

		INSERT INTO T3
		SELECT
	col1,
	col2
		FROM
	MERGE_OUT
		WHERE
	ACTION_OUT ='UPDATE';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0027

SET ANSI_NULLS ON statement may have a different behavior in Snowflake

### Description

This FDM notifies that the SET ANSI_NULLS ON statement may behave differently in Snowflake. For more information about this statement,
go to the [ANSI_NULLS](../../../../translation-references/transact/transact-ansi-nulls.md) article.

#### Input Code

```sql
 SET ANSI_NULLS ON;
```

##### Generated Code

```sql
 ----** SSC-FDM-TS0027 - SET ANSI_NULLS ON STATEMENT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE **
--SET ANSI_NULLS ON
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0028

Output parameters must have the same order as they appear in the executed code

### Description

This FDM notifies that the output parameters in the SP_EXECUTESQL statement must be in the same order as they appear in the SQL string to execute. Otherwise, the output values will not be correctly assigned.

### Code Example

#### Correct case

As can be seen, `@MaxAgeOUT` and `@MaxIdOU`T appear in the same order in both the SQL string and the output parameters.

Thus, when converting the code, the `SELECT $1, $2 INTO :MAXAGE, :MAXID FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))` will assign the values correctly.

##### Transact

```sql
 CREATE PROCEDURE CORRECT_OUTPUT_PARAMS_ORDER
AS
BEGIN
    DECLARE @MaxAge INT;
    DECLARE @MaxId INT;

    EXECUTE sp_executesql
        N'SELECT @MaxAgeOUT = max(AGE), @MaxIdOut = max(ID) FROM PERSONS WHERE ID < @id AND AGE < @age;',
        N'@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT',
        30,
        100,
        @MaxAgeOUT = @MaxAge OUTPUT,
        @MaxIdOut = @MaxId OUTPUT;

    SELECT @MaxAge, @MaxId;
END
```

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE CORRECT_OUTPUT_PARAMS_ORDER ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/07/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    MAXAGE INT;
    MAXID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF('SELECT
   MAX(AGE),
   MAX(ID) FROM
   PERSONS
WHERE
   ID < @id AND AGE < @age;', '@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT', ARRAY_CONSTRUCT('', '', 'MAXAGEOUT', 'MAXIDOUT'), ARRAY_CONSTRUCT(
    30,
    100, :MAXAGE, :MAXID));
    --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
    SELECT
      $1,
      $2
    INTO
      :MAXAGE,
      :MAXID
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    ProcedureResultSet := (
    SELECT
      :MAXAGE,
      :MAXID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;
```

#### Problematic case

As can be seen, `@MaxAgeOUT` and `@MaxIdOUT` in the output parameters appear in a different order compared to the SQL string.

Thus, when converting the code, the `SELECT $1, $2 INTO :MAXID, :MAXAGE FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))` will assign the values incorrectly. `Max(AGE)` will be assigned to `:MAXID` and `Max(ID)` to `:MAXAGE`.

This needs to be manually fixed by either changing the order of the output parameters in the SELECT INTO statement or by changing the order in the SQL string.

##### Transact

```sql
 CREATE PROCEDURE INCORRECT_OUTPUT_PARAMS_ORDER
AS
BEGIN
    DECLARE @MaxAge INT;
    DECLARE @MaxId INT;

    EXECUTE sp_executesql
        N'SELECT @MaxAgeOUT = max(AGE), @MaxIdOut = max(ID) FROM PERSONS WHERE ID < @id AND AGE < @age;',
        N'@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT',
        30,
        100,
        @MaxIdOut = @MaxId OUTPUT,
        @MaxAgeOUT = @MaxAge OUTPUT;

    SELECT @MaxAge, @MaxId;
END
```

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE INCORRECT_OUTPUT_PARAMS_ORDER ()
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "10/07/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    MAXAGE INT;
    MAXID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
    EXECUTE IMMEDIATE TRANSFORM_SP_EXECUTE_SQL_STRING_UDF('SELECT
   MAX(AGE),
   MAX(ID) FROM
   PERSONS
WHERE
   ID < @id AND AGE < @age;', '@age INT, @id INT, @MaxAgeOUT INT OUTPUT, @MaxIdOUT INT OUTPUT', ARRAY_CONSTRUCT('', '', 'MAXIDOUT', 'MAXAGEOUT'), ARRAY_CONSTRUCT(
    30,
    100, :MAXID, :MAXAGE));
    --** SSC-FDM-TS0028 - OUTPUT PARAMETERS MUST HAVE THE SAME ORDER AS THEY APPEAR IN THE EXECUTED CODE **
    SELECT
      $1,
      $2
    INTO
      :MAXID,
      :MAXAGE
    FROM
      TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    ProcedureResultSet := (
    SELECT
      :MAXAGE,
      :MAXID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;
```

### Best Practices

* Make sure the OUTPUT parameters are in the same order as they appear in the SQL string.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0029

SET NOCOUNT statement is commented out, which is not applicable in Snowflake.

### Description

When SnowConvert AI encounters a `SET NOCOUNT` statement, it adds this FDM. SnowConvert AI then comments out the `SET NOCOUNT` statement because it is not relevant in the Snowflake environment.

### Code example

#### Input Code:

```sql
 SET NOCOUNT ON;
```

##### Generated Code

```sql
 ----** SSC-FDM-TS0029 - SET NOCOUNT STATEMENT IS COMMENTED OUT, WHICH IS NOT APPLICABLE IN SNOWFLAKE. **
--SET NOCOUNT ON
```

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0030

SET ANSI_PADDING ON statement is commented out, which is equivalent in Snowflake.

### Description

Snowflake always preserves trailing spaces in string values when they are inserted into columns. This behavior is equivalent to `SET ANSI_PADDING ON` in SQL Server. Therefore, when SnowConvert AI encounters a `SET ANSI_PADDING ON` statement, it adds this FDM and comments it out.

> **Note:**
>
> When `SET ANSI_PADDING OFF` is encountered instead, SnowConvert AI raises [SSC-EWI-TS0002](../conversion-issues/sqlServerEWI.md) because `ANSI_PADDING OFF` behavior cannot be replicated in Snowflake. In SQL Server, `ANSI_PADDING` is a column-level storage property set at column creation time, not a session-level behavior. See SSC-EWI-TS0002 for details on the limitations and recommended manual remediation steps.

### Code example

#### Input Code:

```sql
 SET ANSI_PADDING ON;
```

##### Generated Code

```sql
 ----** SSC-FDM-TS0030 - SET ANSI_PADDING ON STATEMENT IS COMMENTED OUT, WHICH IS EQUIVALENT IN SNOWFLAKE. **
--SET ANSI_PADDING ON
```

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0031

SET ANSI_WARNINGS ON statement is commented out because Snowflake generally adheres to ANSI-standard behaviors.

### Description

Snowflake generally behaves as if `ANSI_WARNINGS` is `ON` by default, especially concerning error handling for arithmetic overflow, division by zero, and string truncation. You typically don’t need to explicitly “set” an equivalent to `ANSI_WARNINGS` in Snowflake. Therefore, when SnowConvert AI encounters a `SET ANSI_WARNINGS ON` statement, it adds this FDM and comments it out.

### Code example

#### Input Code:

```sql
 SET ANSI_WARNINGS ON;
```

##### Generated Code

```sql
 ----** SSC-FDM-TS0031 - SET ANSI_WARNINGS ON STATEMENT IS COMMENTED OUT, WHICH SNOWFLAKE GENERALLY ADHERES TO ANSI-STANDARD BEHAVIORS. **
--SET ANSI_WARNINGS ON
```

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0032

IDENTITY column property not supported in CREATE TABLE AS STATEMENT, emulated using ROW_NUMBER().

### Description

Snowflake does not have a direct way to perform a CREATE TABLE AS with an identity column. Although SnowConvert adds a ROW_NUMBER column instead of the IDENTITY to simulate the enumeration of the identity. This transformation does not create an identity column, which means rows inserted after creation won’t be automatically incremented.

### Code example

#### Input Code:

```sql
with peers as
(
    select
    *
    from (
    values
        ('Luis', 'Miguel'),
        ('Cory', 'Wong'),
        ('Steve', 'Vai'),
        ('John', 'Petrucci'),
        ('Paul', 'Gilbert')
    ) as info(name, lastname)
)
select
    rowm = IDENTITY(int,1,1),
    *
into #MYTABLE
from peers;
```

##### Generated Code

```sql
--** SSC-FDM-TS0032 - IDENTITY COLUMN PROPERTY NOT SUPPORTED IN CREATE TABLE AS STATEMENT, EMULATED WITH USING ROW_NUMBER **
CREATE OR REPLACE TEMPORARY TABLE T_MYTABLE AS
     WITH peers as
(
    select
     *
    from (
    values
        ('Luis', 'Miguel'),
        ('Cory', 'Wong'),
        ('Steve', 'Vai'),
        ('John', 'Petrucci'),
        ('Paul', 'Gilbert')
    ) as info (
      name,
      lastname
     )
)
     SELECT
    ROW_NUMBER()
    OVER (
    ORDER BY
     NULL) AS rowm,
    *
from
    peers;
```

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0033

SET QUOTED_IDENTIFIER STATEMENT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE.

### Description

**SQL Server Behavior**

In SQL Server, SET QUOTED_IDENTIFIER ON is a syntax setting that is separate from collation. The database’s or column’s collation (for example, _CI for Case-Insensitive or _CS for Case-Sensitive) dictates whether quoted identifiers are case-sensitive or not. If a database has a _CI collation, then “MyColumn” and “mycolumn” are treated as the same.

**Snowflake Behavior**

In Snowflake, the behavior is simpler and more strict:

Unquoted Identifiers: Automatically stored and resolved in all uppercase, making them case-insensitive (mytable is the same as MYTABLE).

Quoted Identifiers: By default, identifiers enclosed in double quotes (“MyColumn”) are case-sensitive. They are stored exactly as you typed them.

### Code example

#### Input Code:

```sql
SET QUOTED_IDENTIFIER ON
GO

-- the table is defined as "Products Test"
-- this query will work because the case is ignored.
select
*
from [products test];

SET QUOTED_IDENTIFIER OFF

-- this query will fail because the case is preserved
select
*
from [products test];
GO
```

##### Generated Code

```sql
----** SSC-FDM-TS0033 - SET QUOTED_IDENTIFIER STATEMENT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE **
--SET QUOTED_IDENTIFIER ON

-- the table is defined as "Products Test"
-- this query will work because the case is ignored.
select
  *
from
  "products test";

----** SSC-FDM-TS0033 - SET QUOTED_IDENTIFIER STATEMENT MAY HAVE A DIFFERENT BEHAVIOR IN SNOWFLAKE **
--SET QUOTED_IDENTIFIER OFF

-- this query will fail because the case is preserved
select
  *
from
  "products test";
```

**How to Achieve Equivalence in Snowflake**

To get the same case-insensitive behavior for quoted identifiers as in SQL Server, you can set the QUOTED_IDENTIFIERS_IGNORE_CASE session parameter to TRUE in Snowflake.

```sql
-- This will make quoted identifiers case-insensitive for the session
ALTER SESSION SET QUOTED_IDENTIFIERS_IGNORE_CASE = TRUE;

-- Now, this query will succeed
select
  *
from
  "products test";
```

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0034

### Description

This FDM is generated when a `DATA_COMPRESSION` clause is encountered in a `CREATE TABLE` or `ALTER TABLE` statement. In SQL Server, `DATA_COMPRESSION` is used to specify whether data should be compressed (using ROW or PAGE compression) to reduce storage space and improve I/O performance. **Snowflake automatically handles data compression** using its proprietary compression algorithms, making the `DATA_COMPRESSION` clause unnecessary and unsupported. SnowConvert comments out the `DATA_COMPRESSION` clause during conversion.

### Example Code

#### Input (SQL Server):

```sql
CREATE TABLE Employees (
    EmployeeID INT PRIMARY KEY,
    Name NVARCHAR(100),
    Department NVARCHAR(50),
    Salary DECIMAL(10, 2)
)
WITH (DATA_COMPRESSION = PAGE);
```

#### Output (Snowflake):

```sql
CREATE OR REPLACE TABLE Employees (
    EmployeeID INT PRIMARY KEY,
    Name NVARCHAR(100),
    Department NVARCHAR(50),
    Salary DECIMAL(10, 2)
)
WITH (
--    --** SSC-FDM-TS0034 - DATA_COMPRESSION IS AUTOMATICALLY HANDLED BY SNOWFLAKE. **
--    DATA_COMPRESSION = PAGE
                           )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/06/2025",  "domain": "no-domain-provided",  "migrationid": "sFmaAZAnCnm6VvGeJrE4BQ==" }}'
;
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0035

Triggers are not supported in Snowflake. ENABLE/DISABLE trigger operation is out of the translation scope of SnowConvert AI.

### Description

Triggers are not supported in Snowflake. In SQL Server, ENABLE TRIGGER and DISABLE TRIGGER operations control whether a trigger fires when its triggering event occurs. Since Snowflake does not have an equivalent trigger mechanism, these operations are out of the translation scope of SnowConvert AI. When SnowConvert AI encounters an `ALTER TABLE ... ENABLE TRIGGER` or `ALTER TABLE ... DISABLE TRIGGER` statement, it comments out the trigger clause and generates this issue.

#### Code Example

##### Input Code:

```sql
ALTER TABLE Employees
ENABLE TRIGGER AuditEmployeeChanges;
GO
```

##### Generated Code:

```sql
ALTER TABLE IF EXISTS Employees
----** SSC-FDM-TS0035 - TRIGGERS ARE NOT SUPPORTED IN SNOWFLAKE. ENABLE TRIGGER OPERATION IS OUT OF THE TRANSLATION SCOPE OF SNOWCONVERT AI. **
--ENABLE TRIGGER
--  AuditEmployeeChanges
        ;
```

#### Best Practices

* **Review trigger dependencies:** Identify all triggers that were enabled or disabled in the source SQL Server code and document their business logic. Determine whether that logic should be implemented as Snowflake streams and tasks, stored procedures, or application-layer logic.
* **Consider Snowflake streams and tasks:** Snowflake’s [streams](https://docs.snowflake.com/en/user-guide/streams-intro) capture change data on tables, and [tasks](https://docs.snowflake.com/en/user-guide/tasks-intro) can be scheduled to process that data — together they provide event-driven behavior similar to SQL Server triggers.
* **Remove commented-out statements:** After migrating the trigger logic to Snowflake-native constructs, remove the commented-out ENABLE/DISABLE TRIGGER statements to keep the codebase clean.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0036

HOST_NAME replaced with CURRENT_IP_ADDRESS, which returns the client IP address instead of the workstation name.

### Description

This FDM is generated when SnowConvert AI encounters the `HOST_NAME()` function. In SQL Server, `HOST_NAME()` returns the workstation name of the client connection. Snowflake does not have a direct equivalent; `CURRENT_IP_ADDRESS()` is used as the closest alternative, but it returns the client’s IP address rather than the hostname. This is a functional difference because the returned values have different formats and semantics.

#### Code Example

##### Input Code:

```sql
SELECT HOST_NAME();
```

##### Generated Code:

```sql
SELECT
    CURRENT_IP_ADDRESS() /*** SSC-FDM-TS0036 - HOST_NAME REPLACED WITH CURRENT_IP_ADDRESS, WHICH RETURNS THE CLIENT IP ADDRESS INSTEAD OF THE WORKSTATION NAME ***/;
```

#### Best Practices

* If your application uses `HOST_NAME()` for auditing or logging, verify that the IP address provides sufficient information for your use case.
* If the workstation name is required, consider passing it as a session parameter via `ALTER SESSION SET` or storing it in a context variable.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0037

SET statement is not applicable in Snowflake as it has equivalent default behavior.

### Description

This FDM is generated when SnowConvert AI encounters a `SET` statement whose specified value matches Snowflake’s default behavior. For example, `SET CONCAT_NULL_YIELDS_NULL ON` is the default in Snowflake (NULL concatenation yields NULL), `SET NUMERIC_ROUNDABORT OFF` matches Snowflake’s default of not raising errors on precision loss, and `SET ARITHABORT ON/OFF` has no behavioral impact in Snowflake. Since the setting is already the default, the statement is commented out.

#### Code Example

##### Input Code:

```sql
SET CONCAT_NULL_YIELDS_NULL ON;
```

##### Generated Code:

```sql
----** SSC-FDM-TS0037 - SET CONCAT_NULL_YIELDS_NULL ON STATEMENT IS NOT APPLICABLE IN SNOWFLAKE AS IT HAS EQUIVALENT DEFAULT BEHAVIOR. **
--SET CONCAT_NULL_YIELDS_NULL ON;
```

#### Best Practices

* No action is required — the commented-out statement reflects behavior that is already the default in Snowflake.
* If the non-default value of the same option is used elsewhere (e.g., `SET CONCAT_NULL_YIELDS_NULL OFF`), that will generate a separate EWI (SSC-EWI-TS0089) because the non-default behavior cannot be replicated.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

* [SSC-EWI-TS0089](../conversion-issues/sqlServerEWI.md): SET statement not supported in Snowflake (for non-default values).

## SSC-FDM-TS0038

Agent Job migrated to Snowflake Task orchestration.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_add_job` call that creates a SQL Server Agent Job containing SSIS package steps. The Agent Job definition is migrated to a Snowflake Task orchestration model. The original `sp_add_job` call is commented out and replaced with generated orchestration files in the `ETL/AGENTJOBS/` output directory. The generated output includes Snowflake Task definitions, orchestrator stored procedures, and schedule mappings.

#### Code Example

##### Input Code:

```sql
DECLARE @jobId BINARY(16);
EXEC msdb.dbo.sp_add_job
    @job_name = N'ETL_Nightly_Load',
    @enabled = 1,
    @job_id = @jobId OUTPUT;
```

##### Generated Code:

```sql
DECLARE
  JOBID BINARY(16);
BEGIN
--  --** SSC-FDM-TS0038 - AGENT JOB 'ETL_Nightly_Load' MIGRATED TO SNOWFLAKE TASK ORCHESTRATION. GENERATED OUTPUT IN ETL/AGENTJOBS/. **
--  EXEC msdb.dbo.sp_add_job @job_name = N'ETL_Nightly_Load', @enabled = 1, @job_id = @jobId OUTPUT
                                                                                                 ;
END;
```

#### Best Practices

* Review the generated files in the `ETL/AGENTJOBS/` output directory. These include Snowflake Task definitions and orchestrator stored procedures that replace the Agent Job.
* Validate the task scheduling and step ordering match your original Agent Job configuration.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0039

Agent Job schedule mapped to CRON expression in Snowflake Task.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_add_jobschedule` or `sp_add_schedule`/`sp_attach_schedule` call that defines a schedule for a SQL Server Agent Job. The schedule parameters (`freq_type`, `freq_interval`, `active_start_time`) are mapped to a CRON expression for use in the corresponding Snowflake Task definition. The original schedule call is commented out.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_add_jobschedule
    @job_id = @jobId,
    @name = N'Nightly_2AM',
    @enabled = 1,
    @freq_type = 4,
    @freq_interval = 1,
    @active_start_time = 020000;
```

##### Generated Code:

```sql
--  --** SSC-FDM-TS0039 - AGENT JOB SCHEDULE 'Nightly_2AM' MAPPED TO CRON EXPRESSION IN SNOWFLAKE TASK. **
--  EXEC msdb.dbo.sp_add_jobschedule @job_id = @jobId, @name = N'Nightly_2AM', @enabled = 1, @freq_type = 4, @freq_interval = 1, @active_start_time = 020000
                                                                                                                                                          ;
```

#### Best Practices

* Verify the generated CRON expression in the Snowflake Task definition matches your intended schedule. Complex SQL Server schedules (e.g., monthly on specific days, bi-weekly) may need manual adjustment.
* Review the `ETL/AGENTJOBS/` output for the generated `CREATE TASK ... SCHEDULE = 'USING CRON ...'` statement.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0040

Agent Job step migrated to orchestrator Stored Procedure.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_add_jobstep` call for an Agent Job step with a `TSQL` or `SSIS` subsystem. The step is migrated to an orchestrator stored procedure that is generated in the `ETL/AGENTJOBS/` output directory. The original `sp_add_jobstep` call is commented out. For SSIS steps, the SSIS package is also processed through SnowConvert AI’s ETL-to-dbt pipeline.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_add_jobstep
    @job_name = N'ETL_Nightly_Load',
    @step_name = N'LoadSalesData',
    @step_id = 1,
    @subsystem = N'SSIS',
    @command = N'/ISSERVER "SalesETL.dtsx"';
```

##### Generated Code:

```sql
----** SSC-FDM-TS0040 - AGENT JOB STEP 'LoadSalesData' (SSIS) MIGRATED TO ORCHESTRATOR STORED PROCEDURE. GENERATED OUTPUT IN ETL/AGENTJOBS/. **
--EXEC msdb.dbo.sp_add_jobstep @job_name = N'ETL_Nightly_Load', @step_name = N'LoadSalesData', @step_id = 1, @subsystem = N'SSIS', @command = N'/ISSERVER "SalesETL.dtsx"'
```

#### Best Practices

* Review the generated orchestrator stored procedure in `ETL/AGENTJOBS/` to ensure the step logic is correctly translated.
* For SSIS steps, also review the generated dbt models and SQL files produced by the ETL-to-dbt pipeline.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0041

sp_delete_job translated to DROP TASK IF EXISTS.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_delete_job` call for a SQL Server Agent Job that has been migrated to a Snowflake Task. The `sp_delete_job` call is translated to a `DROP TASK IF EXISTS` statement targeting the corresponding Snowflake Task. The task name is derived from the original job name with a `TASK_` prefix and uppercase formatting.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_delete_job
    @job_name = N'ETL_Nightly_Load';
```

##### Generated Code:

```sql
--** SSC-FDM-TS0041 - SP_DELETE_JOB FOR AGENT JOB 'ETL_Nightly_Load' TRANSLATED TO DROP TASK IF EXISTS. **
DROP TASK IF EXISTS TASK_ETL_NIGHTLY_LOAD;
```

#### Best Practices

* Verify that the task name `TASK_{JOB_NAME}` matches the task created by the Agent Job migration (SSC-FDM-TS0038).
* Note that dropping a task in Snowflake also removes its schedule. If the task has dependent tasks, those must be updated separately.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0042

sp_start_job translated to EXECUTE TASK.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_start_job` call for a SQL Server Agent Job that has been migrated to a Snowflake Task. The call is translated to an `EXECUTE TASK` statement that triggers the corresponding Snowflake Task immediately, regardless of its schedule.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_start_job
    @job_name = N'ETL_Nightly_Load';
```

##### Generated Code:

```sql
--** SSC-FDM-TS0042 - SP_START_JOB FOR AGENT JOB 'ETL_Nightly_Load' TRANSLATED TO EXECUTE TASK. **
EXECUTE TASK TASK_ETL_NIGHTLY_LOAD;
```

#### Best Practices

* `EXECUTE TASK` triggers a single immediate run of the task. It does not affect the task’s schedule or resume/suspend state.
* Ensure the task has been created and is in a `STARTED` state if you also need it to run on schedule.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0043

sp_stop_job translated to ALTER TASK SUSPEND.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_stop_job` call for a SQL Server Agent Job that has been migrated to a Snowflake Task. The call is translated to `ALTER TASK ... SUSPEND`, which prevents future scheduled runs of the task. Note that `ALTER TASK SUSPEND` does not stop an already-running execution — it only prevents future runs from being triggered.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_stop_job
    @job_name = N'ETL_Nightly_Load';
```

##### Generated Code:

```sql
--** SSC-FDM-TS0043 - SP_STOP_JOB FOR AGENT JOB 'ETL_Nightly_Load' TRANSLATED TO ALTER TASK SUSPEND. NOTE: THIS PREVENTS FUTURE RUNS BUT CANNOT STOP AN IN-PROGRESS EXECUTION. **
EXECUTE IMMEDIATE 'ALTER TASK TASK_ETL_NIGHTLY_LOAD SUSPEND';
```

#### Best Practices

* Be aware that `ALTER TASK SUSPEND` only prevents future scheduled executions. If the task is currently running, the in-progress execution will complete.
* In SQL Server, `sp_stop_job` attempts to cancel an in-progress job step. This capability does not exist in Snowflake’s Task model.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0044

sp_update_job @enabled translated to ALTER TASK RESUME or SUSPEND.

### Description

This FDM is generated when SnowConvert AI encounters an `sp_update_job` call with the `@enabled` parameter for a SQL Server Agent Job that has been migrated to a Snowflake Task. When `@enabled=1`, the call is translated to `ALTER TASK ... RESUME` (starts the task’s schedule). When `@enabled=0`, it is translated to `ALTER TASK ... SUSPEND` (pauses the task’s schedule).

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_update_job
    @job_name = N'ETL_Nightly_Load',
    @enabled = 0;
```

##### Generated Code:

```sql
--** SSC-FDM-TS0044 - SP_UPDATE_JOB @ENABLED FOR AGENT JOB 'ETL_Nightly_Load' TRANSLATED TO ALTER TASK RESUME/SUSPEND. **
EXECUTE IMMEDIATE 'ALTER TASK TASK_ETL_NIGHTLY_LOAD SUSPEND';
```

#### Best Practices

* Verify that `RESUME` and `SUSPEND` map correctly to your intended enable/disable behavior.
* If `sp_update_job` is called with parameters other than `@enabled` (e.g., `@description`), those calls will generate SSC-EWI-TS0093 instead, as metadata updates are not applicable in Snowflake’s Task model.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0046

Rowversion/timestamp data type auto-generates unique values in SQL Server but not in Snowflake.

### Description

This FDM is generated when SnowConvert AI encounters a column with the `ROWVERSION` or `TIMESTAMP` data type (they are synonyms in SQL Server). In SQL Server, these data types automatically generate unique binary values on every `INSERT` and `UPDATE`, providing a mechanism for optimistic concurrency control. SnowConvert AI maps the type to `BINARY(8)`, which preserves the storage format but does not replicate the auto-generation behavior.

#### Code Example

##### Input Code:

```sql
CREATE TABLE Orders (
    OrderID INT PRIMARY KEY,
    OrderDate DATE,
    RowVer ROWVERSION
);
```

##### Generated Code:

```sql
CREATE OR REPLACE TABLE Orders (
    OrderID INT PRIMARY KEY,
    OrderDate DATE,
    RowVer BINARY(8) /*** SSC-FDM-TS0046 - ROWVERSION/TIMESTAMP DATA TYPE AUTO-GENERATES UNIQUE VALUES ON INSERT AND UPDATE IN SQL SERVER. THIS BEHAVIOR IS NOT REPLICATED IN SNOWFLAKE BINARY(8). ***/
)
;
```

#### Best Practices

* If your application uses `ROWVERSION` for optimistic concurrency control, implement an alternative pattern in Snowflake. Options include:

  + A `NUMBER` column with a Snowflake sequence, updated via a stream/task or stored procedure on each modification.
  + A `TIMESTAMP_NTZ` column set to `CURRENT_TIMESTAMP()` on insert/update using a default value and a stream-triggered task.
* If the column is only used for auditing (not concurrency), a `TIMESTAMP_NTZ DEFAULT CURRENT_TIMESTAMP()` column may suffice.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0047

SET IDENTITY_INSERT commented out.

### Description

This FDM is generated when SnowConvert AI encounters a `SET IDENTITY_INSERT ... ON` or `SET IDENTITY_INSERT ... OFF` statement. In SQL Server, `SET IDENTITY_INSERT ON` allows explicit values to be inserted into an identity column, and `OFF` re-enables the automatic identity generation. In Snowflake, explicit inserts into `IDENTITY`/`AUTOINCREMENT` columns are allowed by default without any special setting. However, the sequence counter does not automatically adjust to account for explicitly inserted values, which may cause conflicts.

#### Code Example

##### Input Code:

```sql
SET IDENTITY_INSERT dbo.Employees ON;
```

##### Generated Code:

```sql
----** SSC-FDM-TS0047 - SET IDENTITY_INSERT COMMENTED OUT. SNOWFLAKE ALLOWS EXPLICIT INSERTS INTO IDENTITY/AUTOINCREMENT COLUMNS BY DEFAULT, BUT THE SEQUENCE COUNTER DOES NOT ADJUST TO EXPLICITLY INSERTED VALUES. **
--SET IDENTITY_INSERT dbo.Employees ON;
```

#### Best Practices

* After explicitly inserting values into an identity column in Snowflake, manually adjust the underlying sequence to avoid conflicts: `ALTER SEQUENCE seq_name SET START = <max_inserted_value + increment>`.
* If you rely on toggling `IDENTITY_INSERT` in batch load scripts, remove the `SET` statements and add a sequence adjustment step at the end of the batch.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0047

SET IDENTITY_INSERT commented out.

### Description

In SQL Server, [`SET IDENTITY_INSERT`](https://learn.microsoft.com/en-us/sql/t-sql/statements/set-identity-insert-transact-sql?view=sql-server-ver16) controls whether explicit values can be inserted into the identity column of a table. When set to `ON`, it allows explicit inserts; when set to `OFF` (the default), it prevents them.

In Snowflake, there is no equivalent statement because explicit inserts into `IDENTITY` / `AUTOINCREMENT` columns are **always allowed by default**. However, unlike SQL Server, the underlying sequence counter in Snowflake does not adjust to account for explicitly inserted values, which may lead to duplicate key conflicts on subsequent inserts.

SnowConvert AI comments out the `SET IDENTITY_INSERT` statement and attaches this FDM with a context-specific reason depending on whether the original statement was `ON` or `OFF`.

#### Code Example

##### SET IDENTITY_INSERT ON

###### Input Code:

```sql
 SET IDENTITY_INSERT dbo.MyTable ON;
```

###### Generated Code:

```sql
 ----** SSC-FDM-TS0047 - SET IDENTITY_INSERT COMMENTED OUT. SNOWFLAKE ALLOWS EXPLICIT INSERTS INTO IDENTITY/AUTOINCREMENT COLUMNS BY DEFAULT, BUT THE SEQUENCE COUNTER DOES NOT ADJUST TO EXPLICITLY INSERTED VALUES. **
--SET IDENTITY_INSERT dbo.MyTable ON;
```

##### SET IDENTITY_INSERT OFF

###### Input Code:

```sql
 SET IDENTITY_INSERT dbo.MyTable OFF;
```

###### Generated Code:

```sql
 ----** SSC-FDM-TS0047 - SET IDENTITY_INSERT COMMENTED OUT. SNOWFLAKE DOES NOT SUPPORT RESTRICTING EXPLICIT INSERTS INTO IDENTITY/AUTOINCREMENT COLUMNS. **
--SET IDENTITY_INSERT dbo.MyTable OFF;
```

#### Best Practices

* After migration, verify that any tables with `IDENTITY` / `AUTOINCREMENT` columns do not experience duplicate key conflicts caused by the sequence counter not reflecting explicitly inserted values.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0046

Rowversion/timestamp data type auto-generation behavior not replicated in Snowflake.

### Description

In SQL Server, the [`rowversion`](https://learn.microsoft.com/en-us/sql/t-sql/data-types/rowversion-transact-sql?view=sql-server-ver16) data type (also known as `timestamp`) automatically generates a unique `BINARY(8)` value every time a row is inserted or updated. This is commonly used for optimistic concurrency control.

Snowflake does not have an equivalent mechanism. The `rowversion`/`timestamp` data type is mapped to `BINARY(8)`, but Snowflake’s `BINARY(8)` column will **not** auto-generate unique values on INSERT or UPDATE. Any application logic that depends on auto-incrementing row version values will need to be revised.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE t1
(
    RowVer TIMESTAMP
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE t1 (
  RowVer BINARY(8) /*** SSC-FDM-TS0046 - ROWVERSION/TIMESTAMP DATA TYPE AUTO-GENERATES UNIQUE VALUES ON INSERT AND UPDATE IN SQL SERVER. THIS BEHAVIOR IS NOT REPLICATED IN SNOWFLAKE BINARY(8). ***/
)
;
```

#### Best Practices

* Review any application logic that depends on `rowversion`/`timestamp` for optimistic concurrency control and adjust accordingly.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0053

WITH CHECK clause removed. Snowflake constraints are informational only and not enforced.

### Description

This message is shown when an `ALTER TABLE ... WITH CHECK ADD CONSTRAINT ... FOREIGN KEY ...` statement is converted. SnowConvert AI removes the `WITH CHECK` clause because Snowflake constraints are informational and are not enforced, so the validation semantics do not apply.

#### Code Example

##### Input Code:

```sql
ALTER TABLE dAccount
WITH CHECK ADD CONSTRAINT testFK
FOREIGN KEY (account_id) REFERENCES dInvoiceAccounts (account_id);
```

##### Generated Code:

```sql
--** SSC-FDM-TS0053 - WITH CHECK CLAUSE REMOVED, SNOWFLAKE CONSTRAINTS ARE INFORMATIONAL ONLY AND NOT ENFORCED **
ALTER TABLE dAccount ADD CONSTRAINT testFK FOREIGN KEY (account_id) REFERENCES dInvoiceAccounts (account_id);
```

#### Best Practices

* Review whether the source workflow depended on SQL Server validating existing data when the constraint was added.
* If validation is required after migration, implement an explicit data-quality check in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0054

CHECK/NOCHECK CONSTRAINT statement removed. Enabling or disabling constraints is not applicable in Snowflake.

### Description

This message is shown when `ALTER TABLE ... CHECK CONSTRAINT ...` or `ALTER TABLE ... NOCHECK CONSTRAINT ...` is converted. SnowConvert AI comments out the statement because Snowflake does not support enabling or disabling constraints in the same way SQL Server does.

#### Code Example

##### Input Code:

```sql
ALTER TABLE dbo.FactPoolSummary CHECK CONSTRAINT DimPoolFKFactPoolSummary01;
```

##### Generated Code:

```sql
----** SSC-FDM-TS0054 - CHECK CONSTRAINT STATEMENT REMOVED, ENABLING/DISABLING CONSTRAINTS IS NOT APPLICABLE IN SNOWFLAKE **
--ALTER TABLE IF EXISTS dbo.FactPoolSummary CHECK CONSTRAINT DimPoolFKFactPoolSummary01;
```

#### Best Practices

* Review any operational process that temporarily disables constraints during bulk loads or maintenance.
* If the source process relied on constraint state transitions, redesign that workflow explicitly for Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0055

Label statement commented out. No GOTO references this label so it is not required in Snowflake.

### Description

This message is shown when a labeled statement in a stored procedure has no corresponding `GOTO` that references it. Since the label serves no control flow purpose, SnowConvert AI comments it out. The statements that follow the label are preserved and execute normally.

#### Code Example

In this example, the `Cleanup` label exists in the procedure body but no `GOTO Cleanup` references it, so the label is commented out while the `RETURN` statement beneath it is preserved:

##### Input Code:

```sql
CREATE PROCEDURE dbo.UpdateCustomerStatus
AS
BEGIN
    DECLARE @ErrorCode INT = 0
    SET @ErrorCode = 1
Cleanup:
    RETURN @ErrorCode
END
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE dbo.UpdateCustomerStatus ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ERRORCODE INT := 0;
  BEGIN
    ERRORCODE := 1;

--    --** SSC-FDM-TS0055 - LABEL STATEMENT COMMENTED OUT. NO GOTO REFERENCES THIS LABEL SO IT IS NOT REQUIRED IN SNOWFLAKE. **
--    Cleanup:
    RETURN :ERRORCODE;
  END;
$$;
```

#### Best Practices

* No action is required. The label had no effect on control flow and was safely removed.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

* [SSC-EWI-TS0045](../conversion-issues/sqlServerEWI.md): Labeled statement is not supported in Snowflake Scripting.

## SSC-FDM-TS0056

CREATE USER statement commented out, database-scoped user management is not applicable in Snowflake.

### Description

In SQL Server, `CREATE USER` creates a database-scoped user that is typically tied to a SQL Server login (`FOR LOGIN`) or a Windows domain account, and may include a `DEFAULT_SCHEMA` assignment. Snowflake does not have an equivalent concept — users are managed at the account level rather than within individual databases, so there is no direct translation. SnowConvert AI comments out the entire statement and adds this FDM marker. All variants are handled, including `FOR LOGIN`, `WITH DEFAULT_SCHEMA`, and combinations of both.

#### Code Example

##### Input Code:

```sql
CREATE USER [CORP\DEV-LA] FOR LOGIN [CORP\DEV-LA]
CREATE USER [Corp\IceService] FOR LOGIN [CORP\IceService] WITH DEFAULT_SCHEMA=[dbo]
CREATE USER [CORP\GOLDEV] WITH DEFAULT_SCHEMA=[dbo]
```

##### Generated Code:

```sql
----** SSC-FDM-TS0056 - CREATE USER STATEMENT COMMENTED OUT, DATABASE-SCOPED USER MANAGEMENT IS NOT APPLICABLE IN SNOWFLAKE **
--CREATE USER [CORP\DEV-LA] FOR LOGIN [CORP\DEV-LA]

----** SSC-FDM-TS0056 - CREATE USER STATEMENT COMMENTED OUT, DATABASE-SCOPED USER MANAGEMENT IS NOT APPLICABLE IN SNOWFLAKE **
--CREATE USER [Corp\IceService] FOR LOGIN [CORP\IceService] WITH DEFAULT_SCHEMA = [dbo]

----** SSC-FDM-TS0056 - CREATE USER STATEMENT COMMENTED OUT, DATABASE-SCOPED USER MANAGEMENT IS NOT APPLICABLE IN SNOWFLAKE **
--CREATE USER [CORP\GOLDEV] WITH DEFAULT_SCHEMA = [dbo]
```

#### Best Practices

* Review the commented-out `CREATE USER` statements and recreate the users at the Snowflake account level using `CREATE USER` with Snowflake’s syntax.
* Map SQL Server logins and Windows domain accounts to Snowflake’s identity providers (SSO, SCIM, or key-pair authentication).
* Default schema assignments can be configured per user in Snowflake using `ALTER USER ... SET DEFAULT_NAMESPACE`.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0057

DEALLOCATE is not required in Snowflake Scripting. Cursors are automatically deallocated when they go out of scope.

### Description

In SQL Server, `DEALLOCATE` releases the data structures and locks held by a cursor. In Snowflake Scripting, cursors are automatically deallocated when they go out of scope, so there’s no functional need for an explicit `DEALLOCATE` statement. SnowConvert AI comments out the statement and adds this FDM marker to indicate no user action is required.

#### Code Example

##### Input Code:

```sql
CREATE PROCEDURE dbo.SimpleCursorProc
AS
BEGIN
    DECLARE @ItemId INT

    DECLARE item_curs CURSOR FOR
        SELECT ItemId FROM dbo.Items

    OPEN item_curs

    FETCH NEXT FROM item_curs INTO @ItemId

    CLOSE item_curs
    DEALLOCATE item_curs
END
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE dbo.SimpleCursorProc ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ITEMID INT;
    --** SSC-FDM-TS0013 - SNOWFLAKE SCRIPTING CURSOR ROWS ARE NOT MODIFIABLE **
    item_curs CURSOR
    FOR
      SELECT
        ItemId
      FROM
        dbo.Items;
  BEGIN
    OPEN item_curs;
    FETCH
      item_curs
    INTO
      :ITEMID;
    CLOSE item_curs;
--    --** SSC-FDM-TS0057 - DEALLOCATE IS NOT REQUIRED IN SNOWFLAKE SCRIPTING. CURSORS ARE AUTOMATICALLY DEALLOCATED WHEN THEY GO OUT OF SCOPE. **
--    DEALLOCATE item_curs
  END;
$$;
```

#### Best Practices

* No additional user actions are required. The commented-out `DEALLOCATE` statement can safely be left as is or removed entirely.
* Snowflake automatically deallocates cursors when the procedure or block scope ends.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0059

Synonym references renamed to original object name.

### Description

This message is shown when a `CREATE SYNONYM` statement is converted. Snowflake does not support synonyms, so SnowConvert AI comments out the `CREATE SYNONYM` statement and replaces all references to the synonym with the original object name.

#### Code Example

##### Input Code:

```sql
CREATE SYNONYM MyProduct FOR inventory.product;
GO
SELECT * FROM MyProduct;
```

##### Generated Code:

```sql
----** SSC-FDM-TS0059 - SYNONYMS ARE NOT SUPPORTED IN SNOWFLAKE. REFERENCES TO THIS SYNONYM HAVE BEEN REPLACED WITH THE ORIGINAL OBJECT NAME. **
--CREATE SYNONYM MyProduct FOR inventory.product;

SELECT
  *
FROM
  inventory.product;
```

#### Best Practices

* SnowConvert AI only replaces synonym references found within the converted scripts. Any external code that references the synonym — such as ETL pipelines, application queries, or stored procedures in other scripts — must be updated manually to use the original object name.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TS0060

The database_id parameter in OBJECT_SCHEMA_NAME is not supported in Snowflake and has been removed.

### Description

SQL Server’s `OBJECT_SCHEMA_NAME` function accepts an optional second parameter `database_id` that allows cross-database schema lookups:

```sql
OBJECT_SCHEMA_NAME(object_id, database_id)
```

Snowflake’s `INFORMATION_SCHEMA` views are scoped to the current database by default and do not support a numeric `database_id` parameter. When SnowConvert AI encounters the two-argument form, it converts the function to `OBJECT_SCHEMA_NAME_UDF` using only the first argument and drops the `database_id` parameter, adding this FDM to indicate the behavioral difference.

#### Code Example

##### Input Code:

```sql
SELECT OBJECT_SCHEMA_NAME(1, 1);
```

##### Generated Code:

```sql
SELECT
PUBLIC.OBJECT_SCHEMA_NAME_UDF(1) /*** SSC-FDM-TS0060 - THE DATABASE_ID PARAMETER IN OBJECT_SCHEMA_NAME IS NOT SUPPORTED IN SNOWFLAKE AND HAS BEEN REMOVED ***/;
```

##### Input Code (dynamic database_id):

```sql
SELECT OBJECT_SCHEMA_NAME(object_id, DB_ID()) FROM sys.objects;
```

##### Generated Code:

```sql
SELECT
   PUBLIC.OBJECT_SCHEMA_NAME_UDF(object_id) /*** SSC-FDM-TS0060 - THE DATABASE_ID PARAMETER IN OBJECT_SCHEMA_NAME IS NOT SUPPORTED IN SNOWFLAKE AND HAS BEEN REMOVED ***/
FROM
   sys.objects;
```

#### Best Practices

* If the original query targets a different database, you may need to manually qualify the UDF call or switch database context using `USE DATABASE`.
* For single-database migrations, the removed `database_id` parameter typically has no impact since `INFORMATION_SCHEMA` queries default to the current database.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - SQL Server-Azure Synapse Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse Issues

Applies to

* SQL Server
* Azure Synapse Analytics
* Sybase

## SSC-EWI-TS0001

User defined function body not generated

### Severity

Critical

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI appears when SnowConvert AI handles a critical exception that causes the function body not to be generated during its translation.

#### Example Code

##### SQL Server

```sql
 CREATE FUNCTION func1 ()
RETURNS VARCHAR
SELECT
   *
FROM
   TABLE1
```

##### Snowflake

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
CREATE OR REPLACE FUNCTION func1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0001 - THE BODY WAS NOT GENERATED FOR FUNCTION 'func1' ***/!!!
AS
$$

$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0002

The ANSI_PADDING OFF is not supported in Snowflake.

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

In SQL Server, `SET ANSI_PADDING` controls how trailing blanks are stored in `CHAR`, `VARCHAR`, `BINARY`, and `VARBINARY` columns. When `SET ANSI_PADDING OFF` is active during a `CREATE TABLE` or `ALTER TABLE` statement, SQL Server records this setting as a **column-level property** on each column defined in that scope. This means the trimming behavior is permanently associated with the column definition, regardless of the session setting at the time data is inserted.

Specifically, when a column is created with `ANSI_PADDING OFF`:

* `VARCHAR` columns have trailing blanks trimmed on insert.
* `VARBINARY` columns have trailing zeros trimmed on insert.
* `CHAR` columns are trimmed of trailing blanks (instead of being right-padded to the defined length).

Snowflake has no equivalent column-level property. Snowflake always preserves trailing spaces in string values (equivalent to `ANSI_PADDING ON`). There is no way to configure a Snowflake column to automatically trim trailing blanks on insert.

Since this is a storage-level semantic difference that cannot be automatically translated, SnowConvert AI flags the statement with this EWI.

#### Example Code

##### SQL Server

```sql
 SET ANSI_PADDING OFF;
```

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0002 - THE ANSI_PADDING OFF IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
SET ANSI_PADDING OFF;
```

#### Limitations

Snowflake does not support `ANSI_PADDING OFF` semantics at any level (session, database, or column).

* **Wrapping inserts with `RTRIM`**: This only handles explicit `INSERT` statements in the migrated code. It does not cover data loaded through ETL pipelines, external tools, or application code that also relied on the column’s `ANSI_PADDING OFF` property.
* **Session-level setting**: Snowflake has no session parameter equivalent to `SET ANSI_PADDING`.
* **Column-level constraint or default**: Snowflake does not allow defining a column property that automatically trims trailing spaces.

To fully replicate `ANSI_PADDING OFF` behavior in Snowflake, manual intervention is required at the data pipeline level for every affected column.

#### Best Practices

* Identify all columns that were created under `SET ANSI_PADDING OFF` in the source SQL Server database. You can query `sys.columns` with `is_ansi_padded = 0` to find them.
* For each affected column, ensure that all data insertion paths (SQL scripts, ETL pipelines, application code) apply `RTRIM` before inserting into the corresponding Snowflake column.
* Consider creating a Snowflake stored procedure or UDF wrapper that enforces trimming for the affected columns.
* Review whether the trimming behavior is actually relied upon by downstream consumers. In some cases, `ANSI_PADDING OFF` was set by default in legacy scripts without the application depending on the trimming behavior.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0003

The ANSI_WARNINGS OFF is not supported in Snowflake.

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

In Transact-SQL, the statement `SET ANSI_WARNINGS OFF` disables warnings such as Division by Zero or arithmetic overflow. Since `SET ANSI_WARNINGS OFF` is not a directly configurable setting in Snowflake, SnowConvert AI will generate this EWI.

#### Example Code

##### SQL Server

```sql
 SET ANSI_WARNINGS OFF;
```

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0003 - THE ANSI_WARNINGS OFF IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
SET ANSI_WARNINGS OFF;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0009

The following transaction may contain nested transactions and this is considered a complex pattern not supported in Snowflake.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

High

#### Description

This error is added to indicate when a transaction may contain nested transactions. In SQL Server, transactions can be nested. This means that it is possible to start a new transaction within an existing transaction. If after the first BEGIN statement, we execute another one, a new transaction will be opened and the current transaction count will be increased by one.\

On the other hand this is not supported in Snowflake, what will happen is that the second BEGIN statement will be ignored and we will still have only one transaction. For more information please refer to [SQL Server Transactions](https://learn.microsoft.com/en-us/sql/t-sql/language-elements/transactions-transact-sql?view=sql-server-ver16).

#### Code Example

##### Input Code:

```sql
 CREATE PROC transactionsTest
AS
BEGIN TRANSACTION
   SELECT @@TRANCOUNT AS TransactionCount_AfterFirstTransaction
   INSERT INTO TESTSCHEMA.TESTTABLE(ID) VALUES (1), (2)
   BEGIN TRANSACTION
      SELECT @@TRANCOUNT AS TransactionCount_AfterSecondTransaction
      INSERT INTO TESTSCHEMA.TESTTABLE(ID) VALUES (3), (4)
   COMMIT;
   SELECT @@TRANCOUNT AS TransactionCount_AfterFirstCommit
COMMIT;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE transactionsTest ()
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  ProcedureResultSet1 VARCHAR;
  ProcedureResultSet2 VARCHAR;
  ProcedureResultSet3 VARCHAR;
  return_arr ARRAY := array_construct();
 BEGIN
  !!!RESOLVE EWI!!! /*** SSC-EWI-TS0009 - THE FOLLOWING TRANSACTION MAY CONTAIN NESTED TRANSACTIONS WHICH ARE NOT SUPPORTED IN SNOWFLAKE. ***/!!!
  BEGIN TRANSACTION;
  ProcedureResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
  CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet1) AS
   SELECT
    :TRANCOUNT AS TransactionCount_AfterFirstTransaction;
  return_arr := array_append(return_arr, :ProcedureResultSet1);
  INSERT INTO TESTSCHEMA.TESTTABLE (ID) VALUES (1), (2);
  BEGIN TRANSACTION;
  ProcedureResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
  CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet2) AS
   SELECT
    :TRANCOUNT AS TransactionCount_AfterSecondTransaction;
  return_arr := array_append(return_arr, :ProcedureResultSet2);
  INSERT INTO TESTSCHEMA.TESTTABLE (ID) VALUES (3), (4);
  COMMIT;
  ProcedureResultSet3 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
  CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet3) AS
   SELECT
    :TRANCOUNT AS TransactionCount_AfterFirstCommit;
  return_arr := array_append(return_arr, :ProcedureResultSet3);
  COMMIT;
  --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
  RETURN return_arr;
 END;
$$;
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '12' COLUMN '1' OF THE SOURCE CODE STARTING AT 'END'. EXPECTED 'BATCH' GRAMMAR. **
--END
   ;
```

#### Best Practices

* In Snowflake nested transactions will not cause compilation errors, they will simply be ignored. You can access the assessment reports to check if nested transactions are present.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWI

1. [SSC-FDM-0020](../functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables
2. [SSC-EWI-0001](generalEWI.md): Unrecognized token on the line of the source code.
3. [SSC-EWI-0040](generalEWI.md): Statement Not Supported.

## SSC-EWI-TS0010

Common table expression in view not supported in Snowflake.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

High

#### Description

This error is added when an invalid CTE is inside a view since views are materialized representations of queries, which means that they only define how data is retrieved and presented, not how it is manipulated.

#### Code Example

##### Input Code:

```sql
 Create View viewName
as
with commonTableExpressionName (
   columnName
) as
(
   select
      1
)
((select
   1 as col2)
union
(
   select
      1 as col3
));
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0010 - COMMON TABLE EXPRESSION IN VIEW NOT SUPPORTED IN SNOWFLAKE. ***/!!!
CREATE OR REPLACE VIEW viewName
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
AS
!!!RESOLVE EWI!!! /*** SSC-EWI-0021 - WITH CTE NOT SUPPORTED IN SNOWFLAKE ***/!!!
with commonTableExpressionName (
   columnName
) as
(
   select
      1
)
((select
   1 as col2)
union
(
   select
      1 as col3
));
```

### Related EWI

1. [SSC-EWI-0021](generalEWI.md): Not supported.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0013

Computed column transformed

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TS0013](../functional-difference/sqlServerFDM.md) documentation

### Severity

Low

#### Description

This warning is added when an SQL Server computed column is transformed to its Snowflake equivalent. It is added because, in some cases, the functional equivalence could be affected.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE [TestTable](
    [Col1] AS (CONVERT ([REAL], ExpressionValue))
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TestTable (
    Col1 REAL AS (CAST(ExpressionValue AS REAL)) /*** SSC-FDM-TS0014 - COMPUTED COLUMN WAS TRANSFORMED TO ITS SNOWFLAKE EQUIVALENT, FUNCTIONAL EQUIVALENCE VERIFICATION PENDING. ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

#### Best Practices

* No additional user actions are required; it is just informative.
* Add manual changes to the not-transformed expression.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0015

Data type is not supported in Snowflake

### Severity

Medium

### Description

This warning is added when an SQL Server column has an unsupported type in Snowflake.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE table1
(
    column1 customType,
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE table1
(
    column1 VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-TS0015 - DATA TYPE CUSTOMTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

#### Best Practices

* Check the Snowflake data types [documentation](https://docs.snowflake.com/en/sql-reference/data-types.html) to find an equivalent for the data type.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0016

Translation for ODBC Scalar function pending

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Description

This EWI is added when SnowConvert AI finds an ODBC Scalar function inside the input code.
User-defined functions are not supported in ODBC Scalar Function.

#### Code Example

##### Input Code:

```sql
 SELECT {fn CURRENT_DATE_UDF()};
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "CURRENT_DATE_UDF" **
SELECT
CURRENT_DATE_UDF() !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'CURRENT_DATE_UDF' NODE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-TS0016 - USER DEFINED FUNCTIONS ARE NOT SUPPORTED IN ODBC SCALAR FUNCTION. ***/!!!;
```

### Related EWI

1. [SSC-EWI-0073](generalEWI.md): Pending Functional Equivalence Review.

#### Best Practices

* No additional user actions are required; it is just informative.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0017

Masking not supported

### Severity

Low

#### Description

This EWI is added when SnowConvert AI finds a masked column inside a `CREATE TABLE` statement. This functionality doesn’t work by adding the option in the column declaration. Manual effort is needed to have the same behavior as SQL Server.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE TABLE1
(
  [COL1] nvarchar MASKED WITH (FUNCTION = 'default()') NULL,
  [COL2] varchar(100) MASKED WITH (FUNCTION = 'partial(1, "xxxxx", 1)') NULL,
  [COL3] varchar(100) MASKED WITH (FUNCTION = 'email()') NOT NULL,
  [COL4] smallint MASKED WITH (FUNCTION = 'random(1, 100)') NULL
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TABLE1
(
  COL1 VARCHAR
               !!!RESOLVE EWI!!! /*** SSC-EWI-TS0017 - COLUMN MASKING NOT SUPPORTED IN CREATE TABLE ***/!!!
               MASKED WITH (FUNCTION = 'default()') NULL,
  COL2 VARCHAR(100)
                    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0017 - COLUMN MASKING NOT SUPPORTED IN CREATE TABLE ***/!!!
 MASKED WITH (FUNCTION = 'partial(1, "xxxxx", 1)') NULL,
  COL3 VARCHAR(100)
                    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0017 - COLUMN MASKING NOT SUPPORTED IN CREATE TABLE ***/!!!
 MASKED WITH (FUNCTION = 'email()') NOT NULL,
  COL4 SMALLINT
                !!!RESOLVE EWI!!! /*** SSC-EWI-TS0017 - COLUMN MASKING NOT SUPPORTED IN CREATE TABLE ***/!!!
                MASKED WITH (FUNCTION = 'random(1, 100)') NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;
```

#### Best Practices

SnowConvert AI is not generating `MASKING POLICIES` in the current version, so they have to be created manually. E.g.:

The first step is to create a masking policy administrator role.

```sql
 create role masking_admin;
```

The second one is to grant the necessary privileges to the created role.

```sql
 grant create masking policy on schema PUBLIC to role masking_admin;
allow table_owner role to set or unset the ssn_mask masking policy -- (optional)
grant apply on masking policy ssn_mask to role table_owner;
```

The next step is to create the masking policy functions.

```sql
 -- default mask
create or replace masking policy default_mask as (val string) returns string ->
case
when current_role() in ('ANALYST') then val
else 'xxxx'
end;

-- partial mask
create or replace masking policy partial_mask as (val string) returns string ->
case
when current_role() in ('ANALYST') then val
else LEFT(val,1) || 'xxxxx' || RIGHT(val,1)
end;

-- email mask
create or replace masking policy email_mask as (val string) returns string ->
case
when current_role() in ('ANALYST') then val
else LEFT(val,1) || 'XXX@XXX.com'
end;

-- random mask
create or replace masking policy random_mask as (val smallint) returns smallint ->
case
when current_role() in ('ANALYST') then val
else UNIFORM(1,100,RANDOM())::SMALLINT
end;
```

> **Note:**
>
> For sample purposes, we are taking some examples of masking functions in SQL Server, and manually translating it into its equivalent in Snowflake.

The final step is to add the masking policy to the column that originally had the masking option in SQL Server.

```sql
 alter table if exists TABLE1 modify column COL1 set masking policy default_mask;
alter table if exists TABLE1 modify column COL2 set masking policy partial_mask;
alter table if exists TABLE1 modify column COL3 set masking policy email_mask;
alter table if exists TABLE1 modify column COL4 set masking policy random_mask;
```

If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0023

Bulk option not supported

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when some option in a `BULK INSERT` could not be mapped. The translated bulk options should be reflected as `FILE FORMAT` options.

#### Code Example

##### Input Code:

```sql
 BULK INSERT #PCE FROM 'E:\PCE_Look-up_table.txt'
WITH
(
   FIELDTERMINATOR ='\t',
   ROWTERMINATOR ='\n',
   FIRE_TRIGGERS
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE FILE FORMAT FILE_FORMAT_638461199649565070
FIELD_DELIMITER = '\t'
RECORD_DELIMITER = '\n'
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0023 - 'FIRE_TRIGGERS' BULK OPTION COULD NOT BE TRANSFORMED TO ANY OF THE EXISTING FILE FORMAT OPTIONS ***/!!!
FIRE_TRIGGERS;

CREATE OR REPLACE STAGE STAGE_638461199649565070
FILE_FORMAT = FILE_FORMAT_638461199649565070;

--** SSC-FDM-TS0004 - PUT STATEMENT IS NOT SUPPORTED ON WEB UI. YOU SHOULD EXECUTE THE CODE THROUGH THE SNOWFLAKE CLI **
PUT file://E:\PCE_Look-up_table.txt @STAGE_638461199649565070 AUTO_COMPRESS = FALSE;

COPY INTO T_PCE FROM @STAGE_638461199649565070/PCE_Look-up_table.txt;
```

#### Best Practices

* Visit the SnowSQL CLI [user guide](https://docs.snowflake.com/en/user-guide/snowsql.html).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWI

1. [SSC-FDM-TS0004](../functional-difference/sqlServerFDM.md): PUT statement not supported on UI.

## SSC-EWI-TS0024

Incomplete transformation for Bulk Insert

### Severity

Low

#### Description

This EWI is added when a `BULK INSERT` inside a stored procedure was not identified at all, so the dependencies for the complete transformation will not be generated. Also the transformed `COPY INTO` retrieves the file from a `tempStage` that needs to be created manually.

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE BULK_PROC2
AS
BULK INSERT dbo.table1 FROM 'E:\test.txt'
WITH
(
   FIELDTERMINATOR ='\t',
   ROWTERMINATOR ='\n'
);

GO
```

##### Generated Code:

```sql
 CREATE OR REPLACE FILE FORMAT FILE_FORMAT_638461207064166040
FIELD_DELIMITER = '\t'
RECORD_DELIMITER = '\n';

CREATE OR REPLACE STAGE STAGE_638461207064166040
FILE_FORMAT = FILE_FORMAT_638461207064166040;

--** SSC-FDM-TS0004 - PUT STATEMENT IS NOT SUPPORTED ON WEB UI. YOU SHOULD EXECUTE THE CODE THROUGH THE SNOWFLAKE CLI **
PUT file://E:\test.txt @STAGE_638461207064166040 AUTO_COMPRESS = FALSE;

CREATE OR REPLACE PROCEDURE BULK_PROC2 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
   // REGION SnowConvert AI Helpers Code
   // END REGION

   EXEC(`COPY INTO dbo.table1 FROM @STAGE_638461207064166040/test.txt`);
$$
```

#### Best Practices

* To retrieve the file, manually create a [STAGE](https://docs.snowflake.com/en/sql-reference/sql/create-stage.html) and a [FILE FORMAT](https://docs.snowflake.com/en/sql-reference/sql/create-file-format.html).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0025

ERROR_SEVERITY function transformed

### Severity

Low

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag -t JavaScript or –PLTargetLanguage JavaScript

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when [ERROR_SEVERITY](https://docs.microsoft.com/en-us/sql/t-sql/functions/error-severity-transact-sql?view=sql-server-ver15) built-in function is translated. By default, the function will return 16 as it is the most common severity in SQL Server. The generated UDF should retrie

#### Code Example

##### Input Code:

```sql
 -- Additional Params: -t JavaScript
CREATE procedure proc1()
as
BEGIN TRY
    -- Generate a divide-by-zero error.
    SELECT 1/0 from table1;
END TRY
BEGIN CATCH
    return ERROR_SEVERITY();
END CATCH;
GO
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    try {
        EXEC(`    -- Generate a divide-by-zero error.
    SELECT
       TRUNC( 1/0) from
       table1`);
    } catch(error) {
        return SELECT(`   !!!RESOLVE EWI!!! /*** SSC-EWI-TS0025 - CUSTOM UDF 'ERROR_SEVERITY_UDF' INSERTED FOR ERROR_SEVERITY FUNCTION. ***/!!!
   ERROR_SEVERITY_UDF()`);
    }
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0026

With Delete Query turned to Create Table.

### Severity

Low

#### Description

This EWI is added when a Common Table Expression With a Delete From is transformed to a Create or Replace Table.

#### Code Example

##### Input Code:

```sql
 WITH Duplicated AS (
SELECT *, ROW_NUMBER() OVER (PARTITION BY ID ORDER BY ID) AS RN
FROM WithQueryTest
)
DELETE FROM Duplicated
WHERE Duplicated.RN > 1
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0026 - WITH DELETE QUERY TURNED TO CREATE TABLE ***/!!!
CREATE OR REPLACE TABLE WithQueryTest AS
SELECT
*
FROM
WithQueryTest
QUALIFY
ROW_NUMBER()
OVER (PARTITION BY
ID
ORDER BY ID) = 1;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0032

Bulk Insert Partially Translated

> **Warning:**
>
> The EWI is only generated when Javascript is the target language for Stored Procedures. This is a deprecated translation feature, as Snowflake Scripting is the recommended target language for Stored Procedures.

### Severity

High

> **Note:**
>
> Generate Procedures and Macros using JavaScript as the target language adding the following flag -t JavaScript or –PLTargetLanguage JavaScript

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added to a literal that was originally a concatenation, when the contained code had a `BULK INSERT` statement. The `PUT` command resulting from the `BULK INSERT` translation is not supported when executing code that was originally Dynamic SQL.

For this reason, the `PUT` command must be extracted from the output code and executed manually outside of the procedure that contains it. Keep in mind that if there are many `BULK INSERT` statements in Dynamic SQL sentences within the procedure, it is advised to split this procedure to be able to manually execute the corresponding `PUT` command for each translated `BULK INSERT`.

#### Code Example

##### Input Code:

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE  [dbo].[Load_FuelMgtMasterData]
AS
    BEGIN
        SET NOCOUNT ON;

        DECLARE
            @SQLString VARCHAR(500)
        ,   @ImportName VARCHAR(200)
        ,   @Today DATE
        ,   @Yesterday DATE
        ,   @SourceAffiliates VARCHAR(200);

        SET @Today = GETDATE();
        SET @Yesterday = DATEADD(DAY, -1, @Today);
        TRUNCATE TABLE dbo.SourceFM_Affiliates;
        SET @ImportName = '\\' + +@@ServerName
            + '\WorkA\merchantportal\affiliates.txt';
        SET @SQLString = 'BULK INSERT ' + @SourceAffiliates + ' FROM '''
            + @ImportName + '''';
        EXEC (@SQLString);
    END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE dbo.Load_FuelMgtMasterData ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    /*** SSC-EWI-0040 - THE 'SET' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/
    /*        SET NOCOUNT ON*/
    ;
    let SQLSTRING;
    let IMPORTNAME;
    let TODAY;
    let YESTERDAY;
    let SOURCEAFFILIATES;
    TODAY = SELECT(`   CURRENT_TIMESTAMP() :: TIMESTAMP`);
    YESTERDAY = SELECT(`   DATEADD(DAY, -1, ?)`,[TODAY]);
    EXEC(`        TRUNCATE TABLE dbo.SourceFM_Affiliates`);
    IMPORTNAME = `\\` + SERVERNAME + `\WorkA\merchantportal\affiliates.txt`;
    SQLSTRING =
        // ** SSC-EWI-TS0032 - THE BULK INSERT WAS PART OF A DYNAMIC SQL, WHICH MAKES SOME OF THE TRANSLATED ELEMENTS INVALID UNLESS EXECUTED OUTSIDE DYNAMIC CODE. **
        `CREATE OR REPLACE FILE FORMAT FILE_FORMAT_638923328992788100;

CREATE OR REPLACE STAGE STAGE_638923328992788100
FILE_FORMAT = FILE_FORMAT_638923328992788100;

PUT file://${IMPORTNAME} @STAGE_638923328992788100 AUTO_COMPRESS = FALSE;

COPY INTO ${SOURCEAFFILIATES}
FROM @STAGE_638923328992788100/${IMPORTNAME}`;
    EXEC(`${SQLSTRING}`);
$$;
```

#### Best Practices

* Extract the `PUT` command that resulted from the Dynamic `BULK INSERT` statement, and execute it before calling the procedure.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0034

RETURNS clause incomplete due to missing symbols

### Severity

High

#### Description

This EWI is added to the output code when the `RETURNS TABLE` clause of a `CREATE FUNCTION` could not be properly generated. This happens when the columns that must be specified in the resulting `RETURNS TABLE` clause cannot be inferred by SnowConvert AI, thus leaving the `RETURNS TABLE` clause empty.

#### Code Example

##### Input Code:

```sql
 CREATE FUNCTION Sales.ufn_SalesByStore2()
RETURNS TABLE
AS
RETURN
(
  WITH CTE AS (
  SELECT DepartmentID, Name, GroupName
  FROM HumanResources.Department
  )
  SELECT tab.* FROM CTE tab
);

GO

SELECT * FROM GetDepartmentInfo();
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "HumanResources.Department" **
CREATE OR REPLACE FUNCTION Sales.ufn_SalesByStore2 ()
RETURNS TABLE(
  DepartmentID STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN DepartmentID WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
  Name STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN Name WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/,
  GroupName STRING /*** SSC-FDM-TS0012 - INFORMATION FOR THE COLUMN GroupName WAS NOT FOUND. STRING DATATYPE USED TO MATCH CAST AS STRING OPERATION ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
AS
$$
  --** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **
    WITH CTE AS (
    SELECT
      DepartmentID,
      Name,
      GroupName
    FROM
      HumanResources.Department
    )
    SELECT tab.* FROM
    CTE tab
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "GetDepartmentInfo" **

SELECT
    *
FROM
    TABLE(GetDepartmentInfo());
```

#### Best Practices

* The causes for this issue may vary. Be sure to include all the objects that your code needs. If the issue persists even though the migration has access to all the necessary objects, please do contact us with information about your specific scenario.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0035

Declaring a Cursor Variable that it is never initialized is not supported.

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

Currently, a Cursor Variable that is declared but never initialized is not supported by Snowflake. Thus, the EWI is added, and the code commented out.

#### Code Example

##### Input Code:

```sql
 CREATE OR ALTER PROCEDURE notInitializedCursorTest
AS
BEGIN
    -- Should be marked with SSC-EWI-TS0035
    DECLARE @MyCursor CURSOR, @MyCursor2 CURSOR;
    -- Should not be marked
    DECLARE cursorVar CURSOR FORWARD_ONLY STATIC READ_ONLY
        FOR
        SELECT someCol
        FROM someTable;
    RETURN 'DONE';
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE notInitializedCursorTest ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        -- Should be marked with SSC-EWI-TS0035
        !!!RESOLVE EWI!!! /*** SSC-EWI-TS0035 - CURSOR VARIABLE DECLARED BUT NEVER INITIALIZED, THIS IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
        MYCURSOR CURSOR;
        !!!RESOLVE EWI!!! /*** SSC-EWI-TS0035 - CURSOR VARIABLE DECLARED BUT NEVER INITIALIZED, THIS IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
        MYCURSOR2 CURSOR;
        -- Should not be marked
        cursorVar CURSOR
        FOR
            SELECT
                someCol
            FROM
                someTable;
    BEGIN

        RETURN 'DONE';
    END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0036

Snowflake Scripting only supports Local Cursors.

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when Cursors other than Local Cursors are identified. Currently, Snowflake Scripting only supports Local Cursors. Thus, all Cursors are translated as Local Cursors.

#### Code Example

##### Input Code:

```sql
 CREATE OR ALTER PROCEDURE globalCursorTest
AS
BEGIN
    -- Should be marked with SSC-EWI-TS0036
    DECLARE MyCursor CURSOR GLOBAL STATIC READ_ONLY
        FOR
        SELECT *
        FROM exampleTable;
    -- Should not be marked
    DECLARE MyCursor2 CURSOR LOCAL STATIC READ_ONLY
        FOR
        SELECT testCol
        FROM myTable;
    RETURN 'DONE';
END;
```

```sql
 CREATE OR REPLACE PROCEDURE globalCursorTest ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        -- Should be marked with SSC-EWI-TS0036
        !!!RESOLVE EWI!!! /*** SSC-EWI-TS0036 - SNOWFLAKE SCRIPTING ONLY SUPPORTS LOCAL CURSORS ***/!!!
        MyCursor CURSOR
        FOR
            SELECT
                *
            FROM
                exampleTable;
        -- Should not be marked
        MyCursor2 CURSOR
        FOR
            SELECT
                testCol
            FROM
                myTable;
    BEGIN

        RETURN 'DONE';
    END;
$$;
```

```sql

```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0037

Snowflake Scripting Cursors are non-scrollable.

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

Snowflake Scripting Cursors are non-scrollable. Currently, only FETCH NEXT is supported.

#### Code Example

##### Input Code:

```sql
 CREATE OR ALTER PROCEDURE scrollablecursorTest
AS
BEGIN
    -- Should be marked with SSC-EWI-TS0037
    DECLARE CursorVar CURSOR SCROLL STATIC READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;
    -- Should not be marked
    DECLARE CursorVar2 CURSOR STATIC READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE CursorVar3 CURSOR FORWARD_ONLY STATIC READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;
    RETURN 'DONE';
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE scrollablecursorTest ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		-- Should be marked with SSC-EWI-TS0037
		!!!RESOLVE EWI!!! /*** SSC-EWI-TS0037 - SNOWFLAKE SCRIPTING CURSORS ARE NON-SCROLLABLE, ONLY FETCH NEXT IS SUPPORTED ***/!!!
		CursorVar CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		-- Should not be marked
		CursorVar2 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		CursorVar3 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
	BEGIN

		RETURN 'DONE';
	END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0039

Multiple SET Statements for the same cursor found.

### Severity

Medium

#### Description

This EWI is added when multiple SET Statements for the same cursor are found; All additional SET Statements are also commented out. This happens because having multiple SET Statements for the same cursor is not valid in Snowflake Scripting.

#### Example Code:

##### This EWI is added when multiple SET Statements for the same cursor are found; All additional SET Statements are also commented out. This happens because having multiple SET Statements for the same cursor is not valid in Snowflake Scripting.

#### Example Code:

##### Input Code:

```sql
 CREATE OR ALTER PROCEDURE multipleSetExample
AS
BEGIN
    DECLARE @MyCursor CURSOR;
    DECLARE @MyCursor2 CURSOR STATIC READ_ONLY
	FOR
	SELECT FirstName
	FROM vEmployee;
    DECLARE @MyCursor3 CURSOR;

    SET @MyCursor = CURSOR STATIC READ_ONLY
        FOR
        SELECT col3
        FROM defaultTable;

    SET @MyCursor3 = CURSOR STATIC READ_ONLY
    FOR
    SELECT *
    FROM someTable;

    SET @MyCursor = CURSOR DYNAMIC
        FOR
        SELECT col2
        FROM exampleTable;

    SET @MyCursor2 = CURSOR STATIC READ_ONLY
        FOR
        SELECT col3
        FROM defaultTable;

    RETURN 'DONE';
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE multipleSetExample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		MYCURSOR CURSOR
		FOR
			SELECT col3
			FROM defaultTable;
		MYCURSOR2 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		MYCURSOR3 CURSOR
		FOR
			SELECT *
			FROM someTable;
	BEGIN

		DECLARE
		MYCURSOR CURSOR
		FOR
			SELECT col3
			FROM defaultTable;
		MYCURSOR2 CURSOR
		FOR
			SELECT
				FirstName
			FROM
				vEmployee;
		MYCURSOR3 CURSOR
		FOR
			SELECT *
			FROM someTable;
	BEGIN

		!!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'SET CURSOR' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-TS0039 - CURSOR VARIABLE MYCURSOR SET MULTIPLE TIMES, THIS IS NOT VALID IN SNOWFLAKE SCRIPTING ***/!!!

		SET @MyCursor = CURSOR DYNAMIC
		    FOR
		    SELECT col2
		    FROM exampleTable;
		!!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'SET CURSOR' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-TS0039 - CURSOR VARIABLE MYCURSOR2 SET MULTIPLE TIMES, THIS IS NOT VALID IN SNOWFLAKE SCRIPTING ***/!!!

    SET @MyCursor2 = CURSOR STATIC READ_ONLY
        FOR
        SELECT col3
        FROM defaultTable;
		RETURN 'DONE';
	END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0041

XML data type methods are not supported in Snowflake.

### Severity

Medium

#### Description

This EWI is added for the following [XML data type methods](https://docs.microsoft.com/en-us/sql/t-sql/xml/xml-data-type-methods?view=sql-server-ver15) that are not supported in Snowflake SQL:

* Value
* Query
* Exist
* Modify
* Nodes

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE xml_procedure
    @inUserGroupsXML XML
AS
BEGIN
    SELECT  entities.entity.value('TypeID[1]', 'VARCHAR(100)') AS TypeID
        ,entities.entity.value('Name[1]', 'VARCHAR(100)') AS Name
    INTO  #tmpUserGroups
    FROM  @inUserGroupsXML.nodes('/entities/entity') entities(entity)
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE xml_procedure (INUSERGROUPSXML TEXT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CREATE OR REPLACE TEMPORARY TABLE T_tmpUserGroups AS
            SELECT
                XMLGET(entity, '$') :: VARCHAR(100) AS TypeID
                ,
                XMLGET(entity, '$') :: VARCHAR(100) AS Name
            FROM
                !!!RESOLVE EWI!!! /*** SSC-EWI-TS0041 - XML TYPE METHOD nodes IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
                T_inUserGroupsXML('/entities/entity') entities (
                    entity
                );
    END;
$$;
```

#### Best Practices

* Consider using UDFs to emulate the behavior of the source code
* You can [check this documentation](https://medium.com/snowflake/working-with-xml-in-snowflake-part-ii-774b4d32399) and review some possible approaches to work with XML datatypes in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0043

WITH XMLNAMESPACES is not supported in Snowflake.

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added fort the [WITH XMLNAMESPACES](https://docs.microsoft.com/en-us/sql/relational-databases/xml/add-namespaces-to-queries-with-with-xmlnamespaces?view=sql-server-ver15) clause which is not supported in Snowflake SQL

#### Code Example

##### Input Code:

```sql
 WITH XMLNAMESPACES ('uri' as ns1)
SELECT ProductID as 'ns1:ProductID',
Name      as 'ns1:Name',
Color     as 'ns1:Color'
FROM Production.Product
WHERE ProductID = 316
FOR XML RAW, ELEMENTS XSINIL
```

##### Generated Code:

```sql
 --** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **
WITH
     !!!RESOLVE EWI!!! /*** SSC-EWI-TS0043 - WITH XMLNAMESPACES IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
 XMLNAMESPACES ('uri' as VARIANT !!!RESOLVE EWI!!! /*** SSC-EWI-TS0015 - DATA TYPE NS1 IS NOT SUPPORTED IN SNOWFLAKE ***/!!! NOT NULL)
SELECT
ProductID AS "ns1:ProductID",
Name AS "ns1:Name",
Color AS "ns1:Color"
FROM
Production.Product
WHERE
ProductID = 316
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0044 - FOR XML RAW CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FOR XML RAW, ELEMENTS XSINIL;
```

#### Best Practices

* Consider using UDFs to emulate the behavior of the source code. The following code provides suggestions of UDFs that can be used to achieve recreating the original behavior:

##### SQL Server

```sql
 CREATE  TABLE PRODUCT (ProductID INTEGER, Name VarChar(20), Color VarChar(20));
INSERT INTO PRODUCT(PRODUCTID, NAME, COLOR) VALUES(1,'UMBRELLA','RED');
INSERT INTO PRODUCT(PRODUCTID, NAME, COLOR) VALUES(2,'SHORTS','BLUE');
INSERT INTO PRODUCT(PRODUCTID, NAME, COLOR) VALUES(3,'BALL','YELLOW');

WITH XMLNAMESPACES ('uri' as ns1)
SELECT ProductID as 'ns1:ProductID',
       Name      as 'ns1:Name',
       Color     as 'ns1:Color'
FROM Product
FOR XML RAW
```

##### Snowflake SQL

```sql
 CREATE OR REPLACE TABLE PRODUCT (
       ProductID INTEGER,
       Name VARCHAR(20),
       Color VARCHAR(20))
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/12/2024" }}'
;

INSERT INTO PRODUCT (PRODUCTID, NAME, COLOR) VALUES(1,'UMBRELLA','RED');
INSERT INTO PRODUCT (PRODUCTID, NAME, COLOR) VALUES(2,'SHORTS','BLUE');
INSERT INTO PRODUCT (PRODUCTID, NAME, COLOR) VALUES(3,'BALL','YELLOW');

--** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **

WITH
     !!!RESOLVE EWI!!! /*** SSC-EWI-TS0043 - WITH XMLNAMESPACES IS NOT SUPPORTED IN SNOWFLAKE ***/!!! XMLNAMESPACES ('uri' as ns1)
SELECT
       ProductID AS "ns1:ProductID",
       Name AS "ns1:Name",
       Color AS "ns1:Color"
FROM
       Product
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0044 - FOR XML RAW CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FOR XML RAW;
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWI

1. [SSC-PRF-TS0001](../performance-review/sqlServerPRF.md): Performance warning - recursion for CTE not checked. Might require a recursive keyword.
2. SSC-EWI-TS0044: FOR XML clause is not supported in Snowflake.
3. SSC-EWI-TS0015: Data type not supported in Snowflake

## SSC-EWI-TS0044

FOR XML clause is not supported in Snowflake.

### Severity

Critical

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added for the [FOR XML](https://docs.microsoft.com/en-us/sql/relational-databases/xml/for-xml-sql-server?view=sql-server-ver15) clause which is not supported in Snowflake SQL

#### Code Example

##### Input Code:

```sql
 SELECT TOP 1 LastName
FROM AdventureWorks2019.Person.Person
FOR XML AUTO;
```

##### Generated Code:

```sql
 SELECT TOP 1
LastName
FROM
AdventureWorks2019.Person.Person
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0044 - FOR XML AUTO CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FOR XML AUTO;
```

#### Best Practices

* Consider using UDFs to emulate the behavior of the source code. The following code provides suggestions of UDFs that can be used to achieve recreating the original behavior:

SQL Server

##### Query

```sql
 CREATE TABLE TEMPTABLE (Ref INT, Des NVARCHAR(100), Qty INT)

INSERT INTO tempTable VALUES (100001, 'Normal', 1), (100002, 'Foobar', 1), (100003, 'Hello World', 2)

GO

-- FOR XML
SELECT *
FROM TempTable
FOR XML AUTO

GO

-- FOR XML RAW
SELECT *
FROM TempTable
FOR XML RAW
```

##### Result

```sql
 -- FOR XML
<TempTable Ref="100001" Des="Normal" Qty="1"/><TempTable Ref="100002" Des="Foobar" Qty="1"/><TempTable Ref="100003" Des="Hello World" Qty="2"/>

-- FOR XML RAW
<row Ref="100001" Des="Normal" Qty="1"/><row Ref="100002" Des="Foobar" Qty="1"/><row Ref="100003" Des="Hello World" Qty="2"/>
```

##### *Snowflake*

##### Query

```sql
 CREATE OR REPLACE TABLE TEMPTABLE (
Ref INT,
Des VARCHAR(100),
Qty INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;

INSERT INTO tempTable VALUES (100001, 'Normal', 1), (100002, 'Foobar', 1), (100003, 'Hello World', 2);

-- FOR XML
SELECT
*
FROM
TempTable
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0044 - FOR XML AUTO CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FOR XML AUTO;

-- FOR XML RAW
SELECT
*
FROM
TempTable
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0044 - FOR XML RAW CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
FOR XML RAW;
```

##### Result

```sql
 -- FOR XML
<TempTable DES="Normal" QTY="1" REF="100001"  /><TempTable DES="Foobar" QTY="1" REF="100002"  /><TempTable DES="Hello World" QTY="2" REF="100003"  />

-- FOR XML RAW
<row DES="Normal" QTY="1" REF="100001"  /><row DES="Foobar" QTY="1" REF="100002"  /><row DES="Hello World" QTY="2" REF="100003"  />
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0045

Labeled Statement is not supported in Snowflake Scripting.

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when SnowConvert AI encounters a [labeled statement](https://docs.microsoft.com/en-us/sql/t-sql/language-elements/goto-transact-sql?view=sql-server-ver15) in T-SQL that cannot be automatically transformed.

When GOTO/Label patterns appear inside a stored procedure with only forward jumps and top-level labels, SnowConvert AI automatically transforms them into nested procedure definitions with `CALL`/`RETURN` semantics — no EWI is emitted in those cases. See the [LABEL and GOTO translation reference](../../../../translation-references/transact/transact-create-procedure-snow-script.md) for details on the transformation.

This EWI is only emitted when the label **cannot** be transformed. This happens when the procedure contains a backward `GOTO` (one that targets a label appearing earlier in the source, which would require recursive calls), when labels appear inside anonymous blocks or UDFs (which do not support nested procedure definitions), or when labels are declared inside nested control flow blocks like `IF`, `WHILE`, or `TRY` (which cannot be extracted into nested procedures).

#### Code Example

The following example shows a backward GOTO pattern (retry logic) where the label `RetryConnection` appears before the `GOTO` that targets it, preventing automatic transformation:

##### Input Code:

```sql
CREATE PROCEDURE dbo.RetryDatabaseConnection
AS
BEGIN
    DECLARE @Attempts INT = 0
RetryConnection:
    SET @Attempts = @Attempts + 1
    IF @Attempts < 3
        GOTO RetryConnection
    RETURN 0
END
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE dbo.RetryDatabaseConnection ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ATTEMPTS INT := 0;
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0045 - LABELED STATEMENT IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
    RetryConnection:
    ATTEMPTS := :ATTEMPTS + 1;
    IF (:ATTEMPTS < 3) THEN
      !!!RESOLVE EWI!!! /*** SSC-EWI-TS0087 - GOTO IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
      GOTO RetryConnection
    END IF;
    RETURN 0;
  END;
$$;
```

#### Best Practices

* For backward GOTO patterns like retry logic, refactor the control flow to use `WHILE` or `LOOP` constructs instead.
* For labels in anonymous blocks or UDFs, restructure the code into separate procedures or use `IF/ELSE` control flow.
* Forward GOTO/Label patterns inside stored procedures are automatically transformed — no manual action is required for those cases.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

1. SSC-EWI-TS0087: GOTO is not supported in Snowflake.
2. SSC-EWI-TS0103: GOTO targeting a label inside a nested block is not supported in Snowflake.

## SSC-EWI-TS0046

System table is not supported in Snowflake.

### Severity

Medium

#### Description

This EWI is added when referencing [SQL Server system tables](https://docs.microsoft.com/en-us/sql/relational-databases/system-catalog-views/object-catalog-views-transact-sql?view=sql-server-ver15) not supported or without equivalent in Snowflake SQL. See the [supported and unsupported system tables reference](../../../../translation-references/transact/transact-system-tables.md) for the complete list.

#### Code Example

##### Input Code:

```sql
 SELECT *
FROM
    sys.all_sql_modules
WHERE
    [STATE] = 0; -- state must be ONLINE
```

##### Generated Code:

```sql
 SELECT
    *
FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0046 - SYSTEM TABLE sys.all_sql_modules IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
    sys.all_sql_modules
WHERE
    STATE = 0; -- state must be ONLINE
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0047

RAISERROR Error Message may differ because of the SQL Server string format.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TS0019](../functional-difference/sqlServerFDM.md) documentation

### Severity

Low

#### Description

This EWI is added to notify that the RAISERROR Error Message may differ because of the SQL Server string format.

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE RAISERROR_PROCEDURE
AS
BEGIN
RAISERROR ('This is a sample error message with the first parameter %d and the second parameter %*.*s',
           10,
           1,
           123,
	   7,
	   7,
	   'param2');
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE RAISERROR_PROCEDURE ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		!!!RESOLVE EWI!!! /*** SSC-EWI-TS0047 - RAISERROR ERROR MESSAGE MAY DIFFER BECAUSE OF THE SQL SERVER STRING FORMAT ***/!!!
		SELECT
			RAISERROR_UDF('This is a sample error message with the first parameter %d and the second parameter %*.*s',
			10,
			1, array_construct(
			123,
7,
7,
'param2'));
	END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0049

Multiple Line If Body translation planned to be delivered in the future.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

Most of the cases in`IF` statements that contain a `Begin ... End` block inside their body are supported. This is a successful scenario (no SSC-EWI-TS0049 generated).

#### Code Example

##### Input Code:

```sql
 CREATE OR ALTER FUNCTION [PURCHASING].[FOO](@status INT)
Returns INT
As
Begin
    declare @result as int = 10;
    SELECT @result = quantity FROM TABLE1 WHERE COL1 = @status;
    IF @result = 3
    BEGIN
        IF @result>0 SELECT @result=0  ELSE SELECT @result=1
        SELECT @result = 1
    END
    return @result;
End
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0068 - USER DEFINED FUNCTION WAS TRANSFORMED TO SNOWFLAKE PROCEDURE ***/!!!
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TABLE1" **
CREATE OR REPLACE PROCEDURE PURCHASING.FOO (STATUS INT)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        RESULT INT := 10;
    BEGIN

        SELECT
            quantity
        INTO
            :RESULT
        FROM
            TABLE1
        WHERE
            COL1 = :STATUS;
        IF (:RESULT = 3) THEN
            BEGIN
                IF (:RESULT >0) THEN SELECT
                        0
                    INTO
                        :RESULT;
                ELSE
                    SELECT
                        1
                    INTO
                        :RESULT;
                END IF;
        SELECT
                    1
                INTO
                    :RESULT;
            END;
        END IF;
        RETURN :RESULT;
    END;
$$;
```

> **Note:**
>
> In a general code example (Like the on top) the conversion is done successfully. But there are some edge cases where the “IF” statement is not converted and the EWI will be generated.

#### Manual Support

##### Case 1: Single Statement

For these cases, the transformation would be straightforward, since the transformed statement would appear in a select clause

```sql
 IF @result = 0
BEGIN
    SET @result =1
END
```

```sql
 CASE WHEN (SELECT RESULT FROM CTE2)= 0 THEN
( SELECT 1 AS RESULT )
```

##### Case 2: Multiple Statements

For cases in which multiple statements are being transformed, we should transform the N Statement, and use it as the source table for the N+1 Statement.

```sql
 IF @result = 0
BEGIN
    Statement1
    Statement2
    Statement3
END
```

```sql
 CASE WHEN (SELECT RESULT FROM CTE2)= 0 THEN
(
    SELECT TransformedStatement3
    FROM (
        SELECT TransformedStatement2
        FROM (
            SELECT TransformedStatement1
        ) T1
    ) T2
)
```

##### Case 3: Multiple set statements

For these cases, it will be necessary to replicate a transformation for each set statement.

```sql
 IF @result = 0
BEGIN
    SET @var1 = 1
    SET @var2 = 3
    SET @var3 = @var2
END
```

```sql
 WITH CTE1 AS (
    SELECT
        CASE WHEN (SELECT
                        RESULT
                    FROM
                        CTE0) = 0 THEN
        (SELECT 1) AS VAR1)
WITH CTE2 AS (
    SELECT
        CASE WHEN (SELECT
                        RESULT
                    FROM
                        CTE0)= 0 THEN
        (SELECT 3) AS VAR2)
WITH CTE3 AS (
    SELECT
        CASE WHEN (SELECT
                        RESULT
                    FROM
                        CTE0)= 0 THEN
        (SELECT T1.VAR2
        FROM ((SELECT 3) AS VAR2) AS T1) AS VAR3)
...
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0055

Default constraint was commented out and may have been added to a table definition.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TS0020](../functional-difference/sqlServerFDM.md) documentation.

### Severity

Medium

#### Description

This EWI is added when the default constraint is present in an Alter Table statement.

Currently, there is no support for that constraint. A workaround available to transform it, is when the table is previously defined to the Alter Table, in this way we identify the references, and the default constraint is unified on the table definition; otherwise, the constraint is only commented out.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE table1(
  col1 integer,
  col2 varchar collate Latin1_General_CS,
  col3 date
);

ALTER TABLE table1
ADD col4 integer,
  CONSTRAINT col1_constraint DEFAULT 50 FOR col1,
  CONSTRAINT col1_constraint DEFAULT 30 FOR col1;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE table1 (
  col1 INTEGER DEFAULT 50,
  col2 VARCHAR COLLATE 'EN-CS',
  col3 DATE
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
;

ALTER TABLE table1
ADD col4 INTEGER,
  CONSTRAINT col1_constraint
                             !!!RESOLVE EWI!!! /*** SSC-EWI-TS0055 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION ***/!!!
                             DEFAULT 50 FOR col1,
  CONSTRAINT col1_constraint
                             !!!RESOLVE EWI!!! /*** SSC-EWI-TS0055 - DEFAULT CONSTRAINT MAY HAVE BEEN ADDED TO TABLE DEFINITION ***/!!!
                             DEFAULT 30 FOR col1;
```

> **Note:**
>
> If all the content of the Alter Table is invalid, the Alter Table will be commented out.

#### Known Issues

When different default constraints are declared over the same column, only the first will be reflected on the Create Table Statement.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0056

A MASKING POLICY was created as a substitute for MASKED WITH.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TS0021](../functional-difference/sqlServerFDM.md) documentation

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when the Alter Table statement contains a MASKED WITH clause. The reason this is added is to inform that an approximate MASKING POLICY was created as a substitute for the MASKED WITH function.

#### Code Example

##### Input Code:

```sql
 ALTER TABLE table_name
ALTER COLUMN column_name
ADD MASKED WITH (FUNCTION = 'default()');
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0057 - MASKING ROLE MUST BE DEFINED PREVIOUSLY BY THE USER ***/!!!
CREATE OR REPLACE MASKING POLICY "default" AS
(val STRING)
RETURNS STRING ->
CASE
WHEN current_role() IN ('YOUR_DEFINED_ROLE_HERE')
THEN val
ELSE 'xxxxx'
END;

ALTER TABLE IF EXISTS table_name MODIFY COLUMN column_name!!!RESOLVE EWI!!! /*** SSC-EWI-TS0056 - A MASKING POLICY WAS CREATED AS SUBSTITUTE FOR MASKED WITH ***/!!!  SET MASKING POLICY "default";
```

> **Note:**
>
> The MASKING POLICY will be created previous to the ALTER TABLE statement. And it is expected to have and approximate behaviour. Some tweaks might be needed in regards to roles and user privileges. <!– TODO: You can relate to Broken link broken-reference “mention” for further details.>

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0057

The user must previously define the masking role.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TS0022](../functional-difference/sqlServerFDM.md) documentation

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This is EWI occurs when a MASKING POLICY is created and a role or privilege must be linked to it so the data masking could work properly.

#### Code Example

##### Input code

```sql
 ALTER TABLE tableName
ALTER COLUMN columnName
ADD MASKED WITH (FUNCTION = 'partial(1, "xxxxx", 1)');
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0057 - MASKING ROLE MUST BE DEFINED PREVIOUSLY BY THE USER ***/!!!
CREATE OR REPLACE MASKING POLICY "partial_1_xxxxx_1" AS
(val STRING)
RETURNS STRING ->
CASE
WHEN current_role() IN ('YOUR_DEFINED_ROLE_HERE')
THEN val
ELSE LEFT(val, 1) || 'xxxxx' || RIGHT(val, 1)
END;

ALTER TABLE IF EXISTS tableName MODIFY COLUMN columnName!!!RESOLVE EWI!!! /*** SSC-EWI-TS0056 - A MASKING POLICY WAS CREATED AS SUBSTITUTE FOR MASKED WITH ***/!!!  SET MASKING POLICY "partial_1_xxxxx_1";
```

> **Note:**
>
> As shown on line 6, there is a placeholder where the defined roles can be placed. There is room for one or several values separated by commas. Also, here, the use of single quotes is mandatory for each of the values.

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0060

Datetime interval not supported by Snowflake.

### Severity

Medium

#### Description

This EWI is added when one of the following time parts is used as a parameter for a date-related function because they are not supported in Snowflake. For more information go to ‘supported date time parts ([Date & Time Functions | Snowflake Documentation](https://docs.snowflake.com/en/sql-reference/functions-date-time#label-supported-date-time-parts)).

#### Code Example

##### Input code

```sql
 SELECT
    -- Supported
    DATEPART(second, getdate()),
    -- Not supported
    DATEPART(millisecond, getdate()),
    DATEPART(microsecond, getdate());
```

##### Generated Code:

```sql
 SELECT
    -- Supported
    DATE_PART(second, CURRENT_TIMESTAMP() :: TIMESTAMP),
    -- Not supported
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0060 - TIME PART 'millisecond' NOT SUPPORTED AS A FUNCTION PARAMETER ***/!!!
    DATEPART(millisecond, CURRENT_TIMESTAMP() :: TIMESTAMP),
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0060 - TIME PART 'microsecond' NOT SUPPORTED AS A FUNCTION PARAMETER ***/!!!
    DATEPART(microsecond, CURRENT_TIMESTAMP() :: TIMESTAMP);
```

#### Best Practices

* An UDF could be created to manually extract unsupported time parts in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0061

ALTER COLUMN not supported

### Severity

Medium

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added whenever there is an unsupported ALTER COLUMN statement

#### Code Example

##### Input Code:

```sql
 ALTER TABLE SampleTable
ALTER COLUMN SampleColumn INT NULL SPARSE;
```

##### Generated Code:

```sql
 ALTER TABLE IF EXISTS SampleTable
ALTER COLUMN SampleColumn
                          !!!RESOLVE EWI!!! /*** SSC-EWI-TS0061 - ALTER COLUMN COMMENTED OUT BECAUSE SPARSE COLUMN IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
                          INT NULL SPARSE;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0063

Time zone not supported in Snowflake.

### Severity

Critical

#### Description

This EWI is added when there are Time zones that are not supported in Snowflake

#### Code Example

##### Input Code:

```sql
 SELECT current_timestamp at time zone 'Turks And Caicos Standard Time';
```

##### Generated Code:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0063 - TIME ZONE NOT SUPPORTED IN SNOWFLAKE ***/!!!
CURRENT_TIMESTAMP() at time zone 'Turks And Caicos Standard Time'
                                                                 ;
```

#### Best Practices

* A user defined function can be created to support multiple timezones.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0067

Invalid parameters in OPENXML table-valued function.

### Severity

Critical

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when there are invalid parameters in the OPENXML, specifically when the XML path cannot be accessed.

To avoid this EWI, please send the explicit node path through the parameters.

##### Input Code:

```sql
 SELECT
    *
FROM
    OPENXML (@idoc, @path, 1) WITH (
        CustomerID VARCHAR(10),
        ContactName VARCHAR(20)
    );
```

##### Generated Code:

```sql
 SELECT
    *
FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0067 - INVALID PARAMETERS IN OPENXML TABLE-VALUED FUNCTION ***/!!!
    OPENXML(@idoc, @path, 1);
```

##### Input code (Explicit parameter)

```sql
 SELECT
    *
FROM
    OPENXML (@idoc, '/ROOT/Customer', 1) WITH(
        CustomerID VARCHAR(10),
        ContactName VARCHAR(20)
    );
```

##### Generated Code (Explicit parameter)

```sql
 SELECT
    Left(value:Customer['@CustomerID'], '10') AS 'CustomerID',
    Left(value:Customer['@ContactName'], '20') AS 'ContactName'
FROM
    OPENXML_UDF($idoc, ':ROOT:Customer');
```

#### Best Practices

* Try to see if the path can be explicitly passed as a parameter.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0070

CURRENT_TIMESTAMP in At Time Zone statement may have a different behavior in certain cases.

> **Note:**
>
> This `EWI` is deprecated, please refer to [SSC-FDM-TS0024](../functional-difference/sqlServerFDM.md) documentation.

### Description

This EWI is added when the At Time Zone has the CURRENT_TIMESTAMP. This is because the result may have different results in some instances.

The main difference is that in SQL Server, CURRENT_TIMESTAMP returns the current system date and time in the server time zone and in Snowflake CURRENT_TIMESTAMP returns the current date and time in the UTC (Coordinated Universal Time) time zone.

#### Input Code:

##### Sql Server

```sql
 SELECT current_timestamp at time zone 'Hawaiian Standard Time';
```

##### Result

`2024-02-08 16:52:55.317 -10:00`

##### Generated Code:

##### Snowflake

```sql
 SELECT
CONVERT_TIMEZONE('Pacific/Honolulu', CURRENT_TIMESTAMP() !!!RESOLVE EWI!!! /*** SSC-EWI-TS0070 - CURRENT_TIMESTAMP in At Time Zone statement may have a different behavior in certain cases ***/!!!);
```

##### Result

`2024-02-08 06:53:46.994 -1000`

#### Best Practices

This is an example if you want to keep the same format in Snowflake.

##### SQL Server

```sql
 SELECT current_timestamp at time zone 'Hawaiian Standard Time';
```

##### Result

`2024-02-08 16:33:49.143 -10:00`

In Snowflake you can use [ALTER SESSION](https://docs.snowflake.com/en/sql-reference/sql/alter-session) to change the default time zone. For example:

##### Snowflake

```sql
 ALTER SESSION SET TIMEZONE = 'Pacific/Honolulu';

SELECT
CONVERT_TIMEZONE('Pacific/Honolulu', 'UTC', CURRENT_TIMESTAMP());
```

##### Result

`2024-02-08 16:33:49.143`

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0072

RETURN statement will be ignored due to previous RETURN statement

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added when SELECT statements and OUPUT parameters should be returned. In this case, the resultsets from the SELECT statements are prioritized.

##### Input Code:

```sql
 CREATE PROCEDURE SOMEPROC(@product_count INT OUTPUT,  @123 INT OUTPUT)
AS
BEGIN
		SELECT * from AdventureWorks.HumanResources.Department;
        SELECT * from AdventureWorks.HumanResources.Employee;
END
```

##### Generated Code:

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "AdventureWorks.HumanResources.Department", "AdventureWorks.HumanResources.Employee" **
CREATE OR REPLACE PROCEDURE SOMEPROC (PRODUCT_COUNT OUT INT, _123 OUT INT)
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		ProcedureResultSet1 VARCHAR;
		ProcedureResultSet2 VARCHAR;
		return_arr ARRAY := array_construct();
	BEGIN
		ProcedureResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet1) AS
			SELECT
				*
			from
				AdventureWorks.HumanResources.Department;
		return_arr := array_append(return_arr, :ProcedureResultSet1);
		ProcedureResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet2) AS
			SELECT
				*
			from
				AdventureWorks.HumanResources.Employee;
		return_arr := array_append(return_arr, :ProcedureResultSet2);
		--** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
		RETURN return_arr;
	END;
$$;
```

#### Best Practices

* Remove the RETURN statement that should be ignored.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWI

1. [SSC-FDM-0020](../functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables;

## SSC-EWI-TS0073

Error message could be different in snowflake

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TS0023](../functional-difference/sqlServerFDM.md) documentation

### Severity

Low

#### Description

This EWI is added in the transformation of ERROR_MESSAGE(). The exact message of the error could change in Snowflake.

##### Input Code:

```sql
 SET @varErrorMessage = ERROR_MESSAGE()
```

##### Generated Code

```sql
 BEGIN
VARERRORMESSAGE := SQLERRM !!!RESOLVE EWI!!! /*** SSC-EWI-TS0073 - ERROR MESSAGE COULD BE DIFFERENT IN SNOWFLAKE ***/!!!;
END;
```

#### Recommendation

If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-TS0074

Cast result may be different from TRY_CAST/TRY_CONVERT function due to missing dependencies

### Severity

Low

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI is added in the transformation of TRY_CAST and TRY_CONVERT functions. The exact result of these functions may change in Snowflake due to missing dependencies (SnowConvert AI couldn’t resolve some data types). This could be because the dependency was not in the source code.

##### Input Code:

```sql
 SELECT TRY_CONVERT( INT, col1) FROM TABLE1;

SELECT TRY_CAST(COL1 AS FLOAT) FROM TABLE1
```

##### Generated Code

```sql
 SELECT
CAST(col1 AS INT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/!!!RESOLVE EWI!!! /*** SSC-EWI-TS0074 - CAST RESULT MAY BE DIFFERENT FROM TRY_CONVERT FUNCTION DUE TO MISSING DEPENDENCIES ***/!!!
FROM
TABLE1;

SELECT
CAST(COL1 AS FLOAT) /*** SSC-FDM-TS0005 - TRY_CONVERT/TRY_CAST COULD NOT BE CONVERTED TO TRY_CAST ***/!!!RESOLVE EWI!!! /*** SSC-EWI-TS0074 - CAST RESULT MAY BE DIFFERENT FROM TRY_CAST FUNCTION DUE TO MISSING DEPENDENCIES ***/!!!
FROM
TABLE1;
```

#### Recommendation

If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## SSC-EWI-TS0075

Built In Procedure Not Supported

### Severity

Medium

#### Description

Translation for built-in procedures is not currently supported.

#### Example Code

##### Input Code:

```sql
 EXEC sp_column_privileges_rowset_rmt 'Caption';
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0075 - TRANSLATION FOR BUILT-IN PROCEDURE 'sp_column_privileges_rowset_rmt' IS NOT CURRENTLY SUPPORTED. ***/!!!
EXEC sp_column_privileges_rowset_rmt 'Caption';
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0076

Default Parameters May Need To Be Reordered

> **Note:**
>
> This EWI is deprecated. SnowConvert AI now automatically reorders default parameters to the end of the parameter list. Please refer to [SSC-FDM-0041](../functional-difference/generalFDM.md) for the updated behavior.

### Severity

Medium

#### Description

Default parameters may need to be reordered. Snowflake only supports default parameters at the end of the parameters declarations.

#### Example Code

##### Input Code:

```sql
 CREATE PROCEDURE MySampleProc
    @Param1 NVARCHAR(50) = NULL,
    @Param2 NVARCHAR(10),
    @Param3 NVARCHAR(10) = NULL,
    @Param4 NVARCHAR(10)
AS
    SELECT 1;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0076 - DEFAULT PARAMETERS MAY NEED TO BE REORDERED. SNOWFLAKE ONLY SUPPORTS DEFAULT PARAMETERS AT THE END OF THE PARAMETERS DECLARATIONS. ***/!!!
CREATE OR REPLACE PROCEDURE MySampleProc (PARAM1 STRING DEFAULT NULL, PARAM2 STRING, PARAM3 STRING DEFAULT NULL, PARAM4 STRING)
RETURNS TABLE()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"transact"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        ProcedureResultSet RESULTSET;
    BEGIN
        ProcedureResultSet := (
        SELECT 1);
        RETURN TABLE(ProcedureResultSet);
    END;
$$;
```

#### Best Practices

* No end-user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0077

Collation Not Supported

### Severity

Low

#### Description

This message is shown when there is a collate clause that is not supported in Snowflake.

#### Code example

##### Input Code:

```sql
 SELECT 'a' COLLATE Albanian_BIN;

SELECT 'a' COLLATE Albanian_CI_AI;

CREATE TABLE ExampleTable (
    ID INT,
    Name VARCHAR(50) COLLATE collateName
);
```

##### Generated Code:

```sql
 SELECT 'a'
--           !!!RESOLVE EWI!!! /*** SSC-EWI-TS0077 - COLLATION Albanian_BIN NOT SUPPORTED ***/!!!
-- COLLATE Albanian_BIN
                     ;

SELECT 'a'
--           !!!RESOLVE EWI!!! /*** SSC-EWI-TS0077 - COLLATION Albanian_CI_AI NOT SUPPORTED ***/!!!
-- COLLATE Albanian_CI_AI
                       ;

CREATE OR REPLACE TABLE ExampleTable (
    ID INT,
    Name VARCHAR(50)
--                     !!!RESOLVE EWI!!! /*** SSC-EWI-TS0077 - COLLATION collateName NOT SUPPORTED ***/!!!
-- COLLATE collateName
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"transact"}}'
;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0078

Default value not allowed in Snowflake.

### Severity

Medium

#### Description

This error is added to the code when expressions like function calls, variable names, or named constants follow the default option.

Snowflake only supports explicit constants like numbers or strings.

#### Code Example

##### Input Code:

```sql
 ALTER TABLE
    T_ALTERTABLETEST
ADD
    COLUMN COL10 INTEGER DEFAULT RANDOM(10);
```

##### Generated Code:

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "T_ALTERTABLETEST", "RANDOM" **
ALTER TABLE IF EXISTS T_ALTERTABLETEST
ADD
    COLUMN COL10 INTEGER
                         !!!RESOLVE EWI!!! /*** SSC-EWI-TS0078 - DEFAULT OPTION NOT ALLOWED IN SNOWFLAKE ***/!!!
                         DEFAULT RANDOM(10);
```

#####

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0079

Database console command is not supported

### Severity

Medium

#### Description

This EWI is added when SnowConvert AI finds a DBCC statement inside the input code.
Most DBCC statements are not supported in Snowflake.

#### Code Example

##### Input Code:

```sql
 DBCC CHECKIDENT(@a, RESEED, @b) WITH NO_INFOMSGS
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TS0079 - DATABASE CONSOLE COMMAND 'CHECKIDENT' IS NOT SUPPORTED. ***/!!!
DBCC CHECKIDENT(@a, RESEED, @b) WITH NO_INFOMSGS;
```

#### Best Practices

* No additional user actions are required; it is just informative.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0080

Changing the execution context at runtime is not supported in Snowflake

### Severity

High

#### Description

Users in SQL Server can use the command `EXECUTE AS` to temporarily change the execution context, this modifies the execution privileges and affects the results of context-dependent functions like `USER_NAME()`. The `REVERT` command can be used to restore the context previous to the last `EXECUTE AS`.

Snowflake only supports the definition of an execution context in procedures, using either the `CREATE PROCEDURE` or `ALTER PROCEDURE` statements. Changing the context at runtime is not supported.

#### Code Example

Input Code:

```sql
 CREATE PROCEDURE proc1()
WITH EXECUTE AS OWNER
AS
BEGIN
	SELECT USER_NAME();
	EXECUTE AS CALLER;
	SELECT USER_NAME();
	REVERT;
	SELECT USER_NAME();
END

GO
```

Output Code:

```sql
 CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "07/05/2024" }}'
EXECUTE AS OWNER
AS
$$
	DECLARE
		ProcedureResultSet1 VARCHAR;
		ProcedureResultSet2 VARCHAR;
		ProcedureResultSet3 VARCHAR;
		return_arr ARRAY := array_construct();
	BEGIN
		ProcedureResultSet1 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet1) AS
			SELECT
				CURRENT_USER();
		return_arr := array_append(return_arr, :ProcedureResultSet1);
		!!!RESOLVE EWI!!! /*** SSC-EWI-TS0080 - CHANGING THE EXECUTION CONTEXT AT RUNTIME IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
	EXECUTE AS CALLER;
		ProcedureResultSet2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet2) AS
			SELECT
				CURRENT_USER();
		return_arr := array_append(return_arr, :ProcedureResultSet2);
		!!!RESOLVE EWI!!! /*** SSC-EWI-TS0080 - CHANGING THE EXECUTION CONTEXT AT RUNTIME IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
	REVERT;
		ProcedureResultSet3 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:ProcedureResultSet3) AS
			SELECT
				CURRENT_USER();
		return_arr := array_append(return_arr, :ProcedureResultSet3);
		--** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
		RETURN return_arr;
	END;
$$;
```

#### Best Practices

* Refactor the code so it works without having to switch the context.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0081

Using a full join in a delete statement is not supported

### Description

When transforming the DELETE statement, SnowConvert AI extracts the table references found in the FROM clause of the statement and moves them to the USING clause of the Snowflake delete statement.

The following EWI warns the user about the limitations of the outer join (+) syntax in Snowflake. To preserve the LEFT and RIGHT JOINs used in the original code, outer join syntax (+) is added to the conditions to indicate such behavior. However, in Snowflake, the (+) syntax can’t be used to indicate FULL JOINs. For more information, see [Joins in the WHERE clause](https://docs.snowflake.com/en/sql-reference/constructs/where#joins-in-the-where-clause).

#### Example code

##### Input Code :

```sql
DELETE Employees
FROM Employees FULL OUTER JOIN Departments
ON Employees.DepartmentID = Departments.DepartmentID
WHERE Departments.DepartmentID IS NULL;
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0081 - USING A FULL JOIN IN A DELETE STATEMENT IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
DELETE FROM
Employees
USING Departments
WHERE
Departments.DepartmentID IS NULL
AND Employees.DepartmentID = Departments.DepartmentID;
```

#### Best Practices

* Check the logic of your FULL JOIN, it might be possible to rewrite it as other type of JOIN. For example, the code included in the example code is essentially the same as a LEFT JOIN:

Input:

```sql
DELETE Employees
FROM Employees LEFT OUTER JOIN Departments
ON Employees.DepartmentID = Departments.DepartmentID
WHERE Departments.DepartmentID IS NULL;
```

Output:

```sql
 DELETE FROM
    Employees
USING Departments
WHERE
    Departments.DepartmentID IS NULL
    AND Employees.DepartmentID = Departments.DepartmentID(+);
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0082

CROSS APPLY has been converted to LEFT OUTER JOIN and requires manual validation.

### Description

Manual validation is required because the conversion from CROSS APPLY to LEFT OUTER JOIN can lead to incorrect results or unexpected behavior in Snowflake. While the two functions might seem similar, they handle certain situations differently, especially when the subquery has no matches or the subquery is correlated with the outer table.

#### Example code

##### Setup Data

```sql
-- Create a table to store monthly sales or metric data
CREATE TABLE sales_metrics (
    metric_id INT PRIMARY KEY,
    january_value VARCHAR(35),
    february_value VARCHAR(35),
    march_value VARCHAR(35)
);

-- Insert sample data
INSERT INTO sales_metrics (metric_id, january_value, february_value, march_value) VALUES
(1, 'sales-jan-1', 'sales-feb-1', 'sales-march-1'),
(2, 'sales-jan-2', 'sales-feb-2', 'sales-march-2');
```

##### Input Code :

```sql
SELECT
    m.metric_id,
    monthly_data.metric_value,
    monthly_data.month_number
FROM
    sales_metrics m
CROSS APPLY (
    SELECT m.january_value AS metric_value, '01' AS month_number
    UNION ALL
    SELECT m.february_value AS metric_value, '02' AS month_number
    UNION ALL
    SELECT m.march_value AS metric_value, '03' AS month_number
) AS monthly_data;
```

##### Generated Code:

```sql
SELECT
    m.metric_id,
    monthly_data.metric_value,
    monthly_data.month_number
FROM
    sales_metrics m
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0082 - CROSS APPLY HAS BEEN CONVERTED TO LEFT OUTER JOIN AND REQUIRES MANUAL VALIDATION. ***/!!!
    LEFT OUTER JOIN
        (
               SELECT
                m.january_value AS metric_value, '01' AS month_number
               UNION ALL
               SELECT
                m.february_value AS metric_value, '02' AS month_number
               UNION ALL
               SELECT
                m.march_value AS metric_value, '03' AS month_number
           ) AS monthly_data;
```

#### Best Practices

### Key Scenarios Where LEFT OUTER JOIN May Fail

* **Filtering Behavior:** If the original `CROSS APPLY` was intended to filter out rows from the main table that have no matches in the subquery, a `LEFT OUTER JOIN` will not replicate this behavior. Instead, it will include those rows with `NULL` values for the joined columns, which may not be the intended result.
* **Correlated Subqueries:** `CROSS APPLY` is specifically designed to support correlated subqueries, where the subquery references columns from the outer query. A standard `LEFT OUTER JOIN` does not support this pattern in the same way. Attempting to convert a correlated `CROSS APPLY` to a `LEFT OUTER JOIN` can lead to syntax errors, Cartesian products (duplicate rows), or logically incorrect results.
* **Result Set Differences:** The semantics of `CROSS APPLY` and `LEFT OUTER JOIN` differ, especially when the subquery returns no rows. `CROSS APPLY` will exclude such rows from the result, while `LEFT OUTER JOIN` will include them with `NULL` values.

**Recommendation:** Always review and test the output of queries where `CROSS APPLY` has been converted to `LEFT OUTER JOIN` to ensure correctness.

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0083

### Error Message

**ROLLBACK TRANSACTION requires the appropriate setup to work as intended.**

### Severity

**Low**

### Description

This EWI is generated when a `ROLLBACK TRANSACTION` statement is encountered, indicating that SnowConvert has successfully transformed the statement into a Snowflake-compatible format. However, the transformation requires manual verification because Snowflake’s transaction rollback behavior differs significantly from SQL Server’s `ROLLBACK TRANSACTION` functionality.

### Example Code

#### Input (SQL Server):

```sql
BEGIN TRANSACTION MyTransaction;

    -- Some operations
    INSERT INTO Employees (Name, Department) VALUES ('Alice', 'Engineering');

    IF @@ERROR <> 0
    BEGIN
        ROLLBACK TRANSACTION MyTransaction;  -- Named transaction rollback
    END
    ELSE
    BEGIN
        COMMIT TRANSACTION MyTransaction;
    END
```

#### Output (Snowflake Scripting):

```sql
BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BeginTransaction' NODE ***/!!!
    BEGIN TRANSACTION MyTransaction;

        -- Some operations
    --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "Employees" **
        INSERT INTO Employees (Name, Department) VALUES ('Alice', 'Engineering');
    IF (:ERROR <> 0) THEN
        BEGIN
            !!!RESOLVE EWI!!! /*** SSC-EWI-TS0083 - ROLLBACK TRANSACTION REQUIRES THE APPROPRIATE SETUP TO WORK AS INTENDED. ***/!!!
            ROLLBACK TRANSACTION MyTransaction;  -- Named transaction rollback

        END;
    ELSE
        BEGIN
            COMMIT;
        END;
    END IF;
END;
```

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0085

INSERT WITH EXECUTE statement requires manual review.

### Severity

Medium

#### Description

This issue is generated when SnowConvert AI encounters an `INSERT ... EXECUTE` statement that cannot be automatically transformed. In SQL Server, `INSERT ... EXEC` inserts the result set of a stored procedure or dynamic SQL into a table. Snowflake does not support this syntax directly. When the statement appears at the top level (outside a stored procedure), SnowConvert AI cannot apply its standard transformation pattern and flags the statement for manual review.

#### Code Example

##### Input Code:

```sql
INSERT INTO SalesReport
EXEC GenerateQuarterlySales @Quarter = 1;
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0085 - INSERT WITH EXECUTE NODE NEEDS TO BE CHECKED. ***/!!!
INSERT INTO SalesReport EXEC GenerateQuarterlySales @Quarter = 1;
```

#### Best Practices

* Rewrite the logic using Snowflake Scripting: call the procedure separately, capture its result with `RESULT_SCAN(LAST_QUERY_ID())`, and then `INSERT INTO ... SELECT` from the result set.
* If the procedure returns a fixed schema, consider using a temporary table or `TABLE()` function to capture the output.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0086

OPENQUERY is not supported in Snowflake.

### Severity

High

#### Description

This issue is generated when SnowConvert AI encounters an `OPENQUERY` function. In SQL Server, `OPENQUERY` executes a pass-through query on a linked server and returns the result as a table. Snowflake does not have an equivalent linked server or `OPENQUERY` mechanism. The statement is preserved as-is with an EWI marker for manual migration.

#### Code Example

##### Input Code:

```sql
SELECT *
FROM OPENQUERY(OracleFinance, 'SELECT account_id, balance FROM accounts WHERE status = ''ACTIVE''');
```

##### Generated Code:

```sql
SELECT
    *
FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0086 - OPENQUERY NODE NEEDS TO BE CHECKED. ***/!!! OPENQUERY (OracleFinance, 'SELECT account_id, balance FROM accounts WHERE status = ''ACTIVE''');
```

#### Best Practices

* Replace `OPENQUERY` with Snowflake external tables, external stages, or data sharing to access data from external sources.
* If the linked server points to another database, consider migrating that data into Snowflake or using Snowflake’s connector ecosystem (e.g., Snowflake Connector for Oracle).
* For real-time access patterns, evaluate Snowflake External Network Access or External Functions.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0087

GOTO is not supported in Snowflake.

### Severity

High

#### Description

This issue is generated when SnowConvert AI encounters a `GOTO` statement that cannot be automatically transformed.

When GOTO/Label patterns appear inside a stored procedure with only forward jumps to top-level labels, SnowConvert AI automatically transforms them into nested procedure definitions with `CALL`/`RETURN` semantics — no EWI is emitted in those cases. See the [LABEL and GOTO translation reference](../../../../translation-references/transact/transact-create-procedure-snow-script.md) for details on the transformation.

This EWI is only emitted when the `GOTO` **cannot** be transformed. This happens with backward GOTOs (where the target label appears before the `GOTO` in the source, which would require recursive calls), or when the `GOTO` appears inside anonymous blocks or UDFs (which do not support nested procedure definitions in Snowflake).

#### Code Example

The following example shows a backward GOTO used for retry logic. Because `RetryConnection` appears before the `GOTO` that jumps to it, the transformation cannot be applied and the EWI is emitted:

##### Input Code:

```sql
CREATE PROCEDURE dbo.RetryDatabaseConnection
AS
BEGIN
    DECLARE @Attempts INT = 0
RetryConnection:
    SET @Attempts = @Attempts + 1
    IF @Attempts < 3
        GOTO RetryConnection
    RETURN 0
END
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE dbo.RetryDatabaseConnection ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ATTEMPTS INT := 0;
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0045 - LABELED STATEMENT IS NOT SUPPORTED IN SNOWFLAKE SCRIPTING ***/!!!
    RetryConnection:
    ATTEMPTS := :ATTEMPTS + 1;
    IF (:ATTEMPTS < 3) THEN
      !!!RESOLVE EWI!!! /*** SSC-EWI-TS0087 - GOTO IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
      GOTO RetryConnection
    END IF;
    RETURN 0;
  END;
$$;
```

#### Best Practices

* For backward GOTO patterns like retry logic, refactor the control flow to use `WHILE` or `LOOP` constructs instead.
* For GOTO in anonymous blocks or UDFs, restructure the code into separate procedures or use `IF/ELSE` control flow.
* Forward GOTO patterns inside stored procedures are automatically transformed — no manual action is required for those cases.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

* SSC-EWI-TS0045: Labeled statement is not supported in Snowflake Scripting.
* SSC-EWI-TS0103: GOTO targeting a label inside a nested block is not supported in Snowflake.

## SSC-EWI-TS0088

Unsupported sequence options were removed during conversion.

### Severity

Low

#### Description

This issue is generated when SnowConvert AI encounters a `CREATE SEQUENCE` statement with options that are not supported in Snowflake, such as `MINVALUE`, `MAXVALUE`, or `CYCLE`. These options are removed during conversion because Snowflake sequences only support `START WITH` and `INCREMENT BY`. The EWI message lists the specific options that were removed.

#### Code Example

##### Input Code:

```sql
CREATE SEQUENCE InvoiceNumberSeq
START WITH 1000
INCREMENT BY 5
MINVALUE 100
MAXVALUE 50000
CYCLE;
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0088 - SEQUENCE OPTIONS 'MIN VALUE, MAX VALUE, CYCLE' WERE REMOVED, THEY ARE NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE SEQUENCE InvoiceNumberSeq
  START WITH 1000
  INCREMENT BY 5
;
```

#### Best Practices

* If your application relies on `CYCLE` behavior, implement a wrapper UDF that resets the sequence value when it exceeds a threshold.
* If `MINVALUE` or `MAXVALUE` bounds are critical, add application-level validation or a Snowflake task to monitor sequence values.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0089

SET statement is not supported in Snowflake.

### Severity

Low

#### Description

This issue is generated when SnowConvert AI encounters a `SET` statement that changes a session option not supported in Snowflake and whose non-default value cannot be replicated. For example, `SET CONCAT_NULL_YIELDS_NULL OFF` changes SQL Server’s NULL concatenation behavior, but Snowflake always treats `NULL || value` as `NULL` (equivalent to `CONCAT_NULL_YIELDS_NULL ON`). Similarly, `SET NUMERIC_ROUNDABORT ON` raises errors on precision loss, which Snowflake does not support. The original statement is preserved with an EWI marker.

#### Code Example

##### Input Code:

```sql
SET CONCAT_NULL_YIELDS_NULL OFF;
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0089 - SET CONCAT_NULL_YIELDS_NULL OFF IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
SET CONCAT_NULL_YIELDS_NULL OFF;
```

#### Best Practices

* Review the downstream code that depends on this `SET` option. For `CONCAT_NULL_YIELDS_NULL OFF`, replace `NULL` concatenation patterns with explicit `NVL()` or `COALESCE()` calls to handle NULL values.
* For `NUMERIC_ROUNDABORT ON`, add explicit rounding or precision checks in the application logic.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

* [SSC-FDM-TS0037](../functional-difference/sqlServerFDM.md): SET statement with equivalent default behavior in Snowflake (e.g., `SET CONCAT_NULL_YIELDS_NULL ON`).

## SSC-EWI-TS0090

Agent Job step uses an unsupported subsystem and requires manual migration.

### Severity

Medium

#### Description

This issue is generated when SnowConvert AI encounters a SQL Server Agent Job step that uses a subsystem other than `TSQL` or `SSIS` (e.g., `CmdExec`, `PowerShell`, `ANALYSISCOMMAND`). These subsystems execute operating system commands or external tools that have no direct equivalent in Snowflake. The original `sp_add_jobstep` call is preserved with an EWI marker, and the step is not included in the generated Snowflake Task orchestration.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_add_jobstep
    @job_name = N'NightlyArchive',
    @step_name = N'ArchiveOldRecords',
    @step_id = 1,
    @subsystem = N'CmdExec',
    @command = N'powershell.exe -File "C:\Scripts\archive_records.ps1"';
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0090 - AGENT JOB STEP 'ArchiveOldRecords' USES UNSUPPORTED SUBSYSTEM 'CmdExec'. MANUAL MIGRATION REQUIRED. ***/!!!
EXEC msdb.dbo.sp_add_jobstep @job_name = N'NightlyArchive', @step_name = N'ArchiveOldRecords', @step_id = 1, @subsystem = N'CmdExec', @command = N'powershell.exe -File "C:\Scripts\archive_records.ps1"';
```

#### Best Practices

* For `CmdExec` or `PowerShell` steps, evaluate whether the logic can be rewritten as a Snowflake stored procedure, external function, or Snowflake task with a SQL body.
* For SSIS steps, use SnowConvert AI’s built-in ETL-to-dbt migration, which automatically generates orchestrator stored procedures.
* Consider using Snowflake External Functions or Snowpark for operations that require external compute.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0091

Agent Job notification procedure requires manual setup of a Snowflake notification integration.

### Severity

Medium

#### Description

This issue is generated when SnowConvert AI encounters a SQL Server Agent notification procedure such as `sp_send_dbmail`, `sp_notify_operator`, or similar email/alert procedures within an Agent Job context. These procedures rely on SQL Server’s Database Mail or Operator subsystem, which has no direct equivalent in Snowflake. A Snowflake notification integration must be manually configured to replicate this functionality.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_send_dbmail
    @profile_name = N'DBA_Alerts',
    @recipients = N'admin@example.com',
    @subject = N'ETL job completed';
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0091 - AGENT JOB NOTIFICATION PROCEDURE 'sp_send_dbmail' REQUIRES MANUAL SETUP OF A SNOWFLAKE NOTIFICATION INTEGRATION. ***/!!!
EXEC msdb.dbo.sp_send_dbmail @profile_name = N'DBA_Alerts', @recipients = N'admin@example.com', @subject = N'ETL job completed';
```

#### Best Practices

* Set up a [Snowflake Notification Integration](https://docs.snowflake.com/en/sql-reference/sql/create-notification-integration) with an email provider or cloud messaging service (AWS SNS, Azure Event Grid, GCP Pub/Sub).
* Use `SYSTEM$SEND_EMAIL()` in Snowflake to send email notifications from stored procedures and tasks.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0092

Agent Job procedure references a dynamic job name that cannot be resolved statically.

### Severity

Medium

#### Description

This issue is generated when SnowConvert AI encounters an Agent Job management procedure (`sp_start_job`, `sp_stop_job`, `sp_delete_job`, `sp_update_job`) where the `@job_name` parameter is a variable rather than a string literal. Because the job name cannot be resolved at conversion time, SnowConvert AI cannot determine which Snowflake Task to reference. The original statement is preserved with an EWI marker for manual resolution.

#### Code Example

##### Input Code:

```sql
DECLARE @jobName NVARCHAR(128);
SET @jobName = N'ETL_Daily_Load';
EXEC msdb.dbo.sp_start_job @job_name = @jobName;
```

##### Generated Code:

```sql
DECLARE
  JOBNAME NVARCHAR(128);
BEGIN
  JOBNAME := 'ETL_Daily_Load';
  !!!RESOLVE EWI!!! /*** SSC-EWI-TS0092 - AGENT JOB PROCEDURE 'sp_start_job' REFERENCES A DYNAMIC JOB NAME THAT CANNOT BE RESOLVED STATICALLY. ***/!!!
  EXEC msdb.dbo.sp_start_job @job_name = @jobName;
END;
```

#### Best Practices

* If the job name is known at design time, replace the variable with a string literal so SnowConvert AI can resolve it to the corresponding Snowflake Task (e.g., `EXECUTE TASK TASK_ETL_DAILY_LOAD`).
* If the job name is truly dynamic, use `EXECUTE IMMEDIATE` to build the `EXECUTE TASK` or `ALTER TASK` statement dynamically in Snowflake Scripting.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0093

Agent Job procedure is not supported.

### Severity

Low

#### Description

This issue is generated when SnowConvert AI encounters a SQL Server Agent Job system procedure that does not have a supported translation to Snowflake. This includes procedures like `sp_update_jobstep`, `sp_add_jobserver`, and `sp_update_job` (when used without the `@enabled` parameter). These procedures manage Agent Job metadata that has no equivalent in Snowflake’s Task framework. The original statement is preserved with an EWI marker.

#### Code Example

##### Input Code:

```sql
EXEC msdb.dbo.sp_update_jobstep
    @job_name = N'ETL_Nightly_Load',
    @step_id = 1,
    @step_name = N'UpdatedStepName';
```

##### Generated Code:

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0093 - AGENT JOB PROCEDURE 'sp_update_jobstep' IS NOT SUPPORTED. ***/!!!
EXEC msdb.dbo.sp_update_jobstep @job_name = N'ETL_Nightly_Load', @step_id = 1, @step_name = N'UpdatedStepName';
```

#### Best Practices

* Review whether the procedure’s functionality is still needed in the Snowflake environment. Many Agent Job metadata operations (renaming steps, assigning servers) are not applicable in Snowflake’s Task model.
* If the procedure modifies job scheduling or enablement, use `ALTER TASK` in Snowflake instead.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0094

### Error Message

**WAITFOR DELAY variable may contain a time string incompatible with SYSTEM$WAIT.**

### Severity

**Medium**

### Description

This EWI is generated when a `WAITFOR DELAY` statement uses a variable or parameter expression instead of a literal time value. The statement is transformed to `CALL SYSTEM$WAIT()`, which expects a numeric value representing seconds (or milliseconds). However, the variable may hold a time string in `'HH:MM:SS'` format, which is incompatible with `SYSTEM$WAIT`.

For more information about `SYSTEM$WAIT`, see the [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/functions/system_wait).

#### Code Example

##### Input Code:

```sql
 CREATE PROCEDURE proc1(@WaitTime INT)
AS
BEGIN
  WAITFOR DELAY @WaitTime;
END
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE proc1 (WAITTIME INT)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0094 - WAITFOR DELAY WITH VARIABLE ':WAITTIME' WAS CONVERTED TO SYSTEM$WAIT, BUT THE VARIABLE MAY CONTAIN A TIME STRING IN 'HH:MM:SS' FORMAT. SYSTEM$WAIT EXPECTS A NUMERIC VALUE IN SECONDS. ***/!!!
    CALL SYSTEM$WAIT(:WAITTIME);
  END;
$$;
```

#### Best Practices

* Ensure the variable passed to `SYSTEM$WAIT` contains a numeric value in seconds, not a time string in `'HH:MM:SS'` format.
* If the variable holds a time string, convert it to seconds before passing it to `SYSTEM$WAIT`.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0095

SCOPE_IDENTITY() called without a preceding INSERT to an identity table in the same scope.

### Severity

**High**

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

SnowConvert AI was unable to determine the target table for SCOPE_IDENTITY(). No preceding INSERT to an identity table found in the same scope.

This EWI is generated when `SCOPE_IDENTITY()` is called without a preceding `INSERT` statement to a table with an `IDENTITY` column in the same procedural scope. In SQL Server, `SCOPE_IDENTITY()` returns the last identity value inserted in the current scope, but without a detectable `INSERT`, SnowConvert AI cannot generate the appropriate time-travel query.

The function call is kept as-is with this EWI, requiring manual review to determine the correct implementation.

#### Code Example

##### Input Code (SQL Server):

```sql
 CREATE PROCEDURE GetLastId
AS
BEGIN
    DECLARE @LastID INT = SCOPE_IDENTITY();
    SELECT @LastID;
END;
```

##### Generated Code (Snowflake):

```sql
 CREATE OR REPLACE PROCEDURE GetLastId ()
RETURNS TABLE()
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    LASTID INT := SCOPE_IDENTITY() !!!RESOLVE EWI!!! /*** SSC-EWI-TS0095 - SNOWCONVERT AI WAS UNABLE TO DETERMINE THE TARGET TABLE FOR SCOPE_IDENTITY(). NO PRECEDING INSERT TO AN IDENTITY TABLE FOUND IN THE SAME SCOPE. ***/!!!;
    ProcedureResultSet RESULTSET;
  BEGIN
    ProcedureResultSet := (SELECT
      :LASTID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;
```

#### Best Practices

* Review the stored procedure logic to identify where the INSERT statement occurs
* If the INSERT is in a different scope (e.g., nested block), refactor the code to make the INSERT detectable
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0096

SCOPE_IDENTITY() references a table that cannot be resolved in the symbol table.

### Severity

**High**

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

SnowConvert AI was unable to resolve the target table for SCOPE_IDENTITY(). Missing table definition.

This EWI is generated when `SCOPE_IDENTITY()` follows an INSERT statement, but the target table cannot be resolved in the symbol table. This may occur when:

* The table is defined in an external file not included in the conversion
* The table name uses dynamic SQL or is otherwise unresolvable
* The table definition is missing or incomplete

Without a resolvable table reference, SnowConvert AI cannot determine which identity column to query in the generated time-travel query.

#### Code Example

##### Input Code (SQL Server):

```sql
 CREATE PROCEDURE InsertOrder @CustomerID INT
AS
BEGIN
    DECLARE @OrderID INT;
    INSERT INTO UnknownTable (CustomerID) VALUES (@CustomerID);
    SET @OrderID = SCOPE_IDENTITY();
    SELECT @OrderID;
END;
```

##### Generated Code (Snowflake):

```sql
 CREATE OR REPLACE PROCEDURE InsertOrder (CUSTOMERID INT)
RETURNS TABLE()
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ORDERID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    INSERT INTO UnknownTable (CustomerID) VALUES (:CUSTOMERID);
    ORDERID := SCOPE_IDENTITY() !!!RESOLVE EWI!!! /*** SSC-EWI-TS0096 - SNOWCONVERT AI WAS UNABLE TO RESOLVE THE TARGET TABLE FOR SCOPE_IDENTITY(). MISSING TABLE DEFINITION. ***/!!!;
    ProcedureResultSet := (SELECT
      :ORDERID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;
```

#### Best Practices

* Ensure all table definitions are included in the conversion input
* Verify that the table name in the INSERT statement matches the table definition
* If the table is external, provide the schema definition or manually implement the identity retrieval logic
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0097

SCOPE_IDENTITY() references a table without an identifiable identity column.

### Severity

**High**

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

SnowConvert AI was unable to identify the identity column for SCOPE_IDENTITY(). Missing column definition.

This EWI is generated when `SCOPE_IDENTITY()` follows an INSERT statement to a table that exists in the symbol table but does not have an IDENTITY column defined. In SQL Server, `SCOPE_IDENTITY()` only returns values for tables with identity columns. Without an identifiable identity column, SnowConvert AI cannot generate the appropriate `MAX(identity_column)` query for the time-travel transformation.

#### Code Example

##### Input Code (SQL Server):

```sql
 CREATE TABLE Orders (OrderID INT, CustomerID INT);
GO

CREATE PROCEDURE InsertOrder @CustomerID INT
AS
BEGIN
    DECLARE @OrderID INT;
    INSERT INTO Orders (CustomerID) VALUES (@CustomerID);
    SET @OrderID = SCOPE_IDENTITY();
    SELECT @OrderID;
END;
```

##### Generated Code (Snowflake):

```sql
 CREATE OR REPLACE TABLE Orders (
  OrderID INT,
  CustomerID INT
)
;

CREATE OR REPLACE PROCEDURE InsertOrder (CUSTOMERID INT)
RETURNS TABLE()
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    ORDERID INT;
    ProcedureResultSet RESULTSET;
  BEGIN

    INSERT INTO Orders (CustomerID) VALUES (:CUSTOMERID);
    ORDERID := SCOPE_IDENTITY() !!!RESOLVE EWI!!! /*** SSC-EWI-TS0097 - SNOWCONVERT AI WAS UNABLE TO IDENTIFY THE IDENTITY COLUMN FOR SCOPE_IDENTITY(). MISSING COLUMN DEFINITION. ***/!!!;
    ProcedureResultSet := (SELECT
      :ORDERID);
    RETURN TABLE(ProcedureResultSet);
  END;
$$;
```

#### Best Practices

* Verify that the table definition includes an IDENTITY column specification
* If the table should have an identity column, add the IDENTITY constraint to the CREATE TABLE statement before conversion
* If the table should not use SCOPE_IDENTITY(), refactor the code to use a different method for retrieving the inserted ID
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0098

CONVERT with a non-literal style cannot be mapped to a Snowflake format string.

### Severity

**Medium**

### Description

This EWI is generated when the third argument of `CONVERT` is a variable or expression instead of a literal style code. SnowConvert AI can map literal style values to Snowflake format strings for `TO_DATE` and `TO_TIMESTAMP`, but when the style is dynamic it cannot determine the correct format at conversion time. In those cases SnowConvert AI falls back to `CAST`.

#### Code Example

##### Input Code (SQL Server):

```sql
SELECT CONVERT(DATE, @InputDate, @Style);
```

##### Generated Code (Snowflake SQL):

```sql
SELECT
  !!!RESOLVE EWI!!! /*** SSC-EWI-TS0098 - CONVERT WITH A VARIABLE OR EXPRESSION AS THE STYLE ARGUMENT CANNOT BE AUTOMATICALLY MAPPED TO A SNOWFLAKE FORMAT STRING. REPLACE WITH THE APPROPRIATE TO_DATE/TO_TIMESTAMP CALL WITH THE KNOWN FORMAT STRING. ***/!!!
  CAST(@InputDate AS DATE);
```

#### Best Practices

* Replace the dynamic style argument with a known literal format whenever possible.
* If the style varies at runtime, rewrite the expression manually using the correct `TO_DATE`, `TO_TIMESTAMP`, or conditional logic in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0099

The OBJECT_SCHEMA_NAME function is not supported in Snowflake.

> **Note:**
>
> This EWI is deprecated. `OBJECT_SCHEMA_NAME` is now converted to a helper UDF `OBJECT_SCHEMA_NAME_UDF`. When the two-argument form is used, refer to [SSC-FDM-TS0060](../functional-difference/sqlServerFDM.md) for information about the removed `database_id` parameter.

### Severity

**Low**

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Description

This EWI was generated when SnowConvert AI encountered the SQL Server `OBJECT_SCHEMA_NAME(object_id [, database_id])` function, which returns the schema name for a schema-scoped object given its numeric object ID. Snowflake does not use numeric object IDs for metadata lookups.

This EWI has been replaced by an automatic conversion to a helper UDF (`OBJECT_SCHEMA_NAME_UDF`) that queries `INFORMATION_SCHEMA` to resolve schema names.

#### Code Example

##### Input Code (SQL Server):

```sql
SELECT OBJECT_SCHEMA_NAME(1);
```

##### Generated Code (Snowflake SQL) — Previous behavior:

```sql
SELECT
OBJECT_SCHEMA_NAME(1) !!!RESOLVE EWI!!! /*** SSC-EWI-TS0099 - THE OBJECT_SCHEMA_NAME FUNCTION IS NOT SUPPORTED IN SNOWFLAKE ***/!!!;
```

##### Generated Code (Snowflake SQL) — Current behavior:

```sql
SELECT
PUBLIC.OBJECT_SCHEMA_NAME_UDF(1);
```

#### Best Practices

* This EWI is no longer emitted. `OBJECT_SCHEMA_NAME` calls are now automatically converted to `OBJECT_SCHEMA_NAME_UDF`.
* Review the generated UDF to ensure it resolves the correct schema for your objects.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0103

GOTO targeting a label inside a nested block is not supported in Snowflake.

### Severity

High

#### Description

This EWI is generated when a `GOTO` targets a label that is declared inside a nested control flow block (such as `IF`, `WHILE`, `BEGIN...END`, or `TRY...CATCH`). SnowConvert AI’s GOTO/Label decomposition can only transform labels that are declared at the top level of a procedure body — labels buried inside nested blocks cannot be extracted into standalone nested procedures. When this happens, the `GOTO` is preserved with this EWI marker, while top-level labels in the same procedure are still transformed normally.

#### Code Example

In this example, `Done` is a top-level label and is transformed into a nested procedure. However, `HandlePartialFailure` is declared inside a `BEGIN...END` block, so the `GOTO` targeting it cannot be transformed:

##### Input Code:

```sql
CREATE PROCEDURE dbo.ImportCustomerData
AS
BEGIN
    DECLARE @BatchId INT = 0, @IsValid INT = 0, @RowCount INT = 0
    IF @BatchId = 1 GOTO Done
    BEGIN
        IF @IsValid = 1 GOTO HandlePartialFailure
        SET @RowCount = 1
    HandlePartialFailure:
        SET @RowCount = 2
    END
Done:
    RETURN 0
END
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE dbo.ImportCustomerData ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SC_EXIT_CODE VARCHAR;
    BATCHID INT := 0;
    ISVALID INT := 0;
    ROWCOUNT INT := 0;
    SC_PROCESS PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        IF (:BATCHID = 1) THEN
          BEGIN
            CALL Done();
            RETURN 'PROCESS FINISHED';
          END;
        END IF;
        BEGIN
          IF (:ISVALID = 1) THEN
            !!!RESOLVE EWI!!! /*** SSC-EWI-TS0103 - GOTO TARGETING A LABEL INSIDE A NESTED BLOCK IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
            GOTO HandlePartialFailure
          END IF;
          ROWCOUNT := 1;
          HandlePartialFailure:
          ROWCOUNT := 2;
        END;
        CALL Done();
      END;
    Done PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        SC_EXIT_CODE := 0;
      END;
  BEGIN
    CALL SC_PROCESS();
    RETURN :SC_EXIT_CODE;
  END;
$$;
```

#### Best Practices

* Move the nested label to the top level of the procedure body so SnowConvert AI can transform it automatically.
* Alternatively, replace the GOTO with structured `IF/ELSE` or `LOOP` control flow to avoid the jump entirely.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

* SSC-EWI-TS0045: Labeled statement is not supported in Snowflake Scripting.
* SSC-EWI-TS0087: GOTO is not supported in Snowflake.

## SSC-EWI-TS0104

System table query pattern could not be automatically converted.

### Severity

**Medium**

### Description

This EWI is generated when SnowConvert AI encounters a query pattern inside a system table query (such as `sysconstraints`) that it cannot translate automatically. Common triggers include:

* `OBJECT_NAME()` called with an argument that doesn’t map to a known column (for example, `OBJECT_NAME(status)` instead of `OBJECT_NAME(constid)` or `OBJECT_NAME(id)`)
* `OBJECT_NAME()` compared against a non-literal value (a column reference, variable, or expression instead of a string literal)

In these cases the original expression is preserved and the EWI is emitted so you can review and rewrite the query manually.

#### Code Example

##### Input Code (SQL Server):

```sql
SELECT 1 FROM sysconstraints WHERE OBJECT_NAME(status) = 'X';
```

##### Generated Code (Snowflake SQL):

```sql
SELECT
    1
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0104 - 'OBJECT_NAME(status) in sysconstraints query' COULD NOT BE AUTOMATICALLY CONVERTED IN SYSTEM TABLE QUERY. MANUAL REVIEW REQUIRED ***/!!!
    OBJECT_NAME(status) = 'X';
```

##### Input Code (SQL Server) - Non-literal comparison:

```sql
SELECT 1 FROM sysconstraints WHERE OBJECT_NAME(constid) = col1;
```

##### Generated Code (Snowflake SQL):

```sql
SELECT
    1
FROM
    INFORMATION_SCHEMA.TABLE_CONSTRAINTS
WHERE
    !!!RESOLVE EWI!!! /*** SSC-EWI-TS0104 - 'Non-literal comparison with OBJECT_NAME in sysconstraints query' COULD NOT BE AUTOMATICALLY CONVERTED IN SYSTEM TABLE QUERY. MANUAL REVIEW REQUIRED ***/!!!
    OBJECT_NAME(constid) = col1;
```

#### Best Practices

* Replace `OBJECT_NAME(constid)` with `CONSTRAINT_NAME` and `OBJECT_NAME(id)` with `TABLE_NAME` when querying `INFORMATION_SCHEMA.TABLE_CONSTRAINTS`.
* Ensure that the comparison value is a string literal. If you need to compare against a variable or column, rewrite the query to use the equivalent `INFORMATION_SCHEMA` column directly.
* Review the [sysconstraints translation reference](../../../../translation-references/transact/transact-system-tables.md) for supported transformation patterns.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TS0107

CREATE TYPE AS TABLE cannot be converted to Snowflake

### Severity

Medium

### Description

SQL Server **table types** created with `CREATE TYPE ... AS TABLE (...)` are not mapped to a single Snowflake `CREATE TYPE` equivalent. SnowConvert flags this pattern so you can redesign using tables, views, or other Snowflake constructs.

#### Example Code

##### Input Code (SQL Server):

```sql
CREATE TYPE dbo.OrderLines AS TABLE (
    order_id INT,
    line_no  INT,
    qty      DECIMAL(10, 2)
);
```

##### Generated Code (Snowflake SQL):

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-TS0107 - CREATE TYPE AS TABLE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
CREATE TYPE dbo.OrderLines AS TABLE (
    order_id INT,
    line_no INT,
    qty DECIMAL(10, 2)
);
```

#### Best Practices

* Model reusable row shapes as permanent or transient tables, or use `OBJECT`/`VARIANT` patterns where appropriate.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - SQL Server-Azure Synapse Performance Review Messages
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/performance-review/sqlServerPRF.md
section: Migrations
---

# SnowConvert AI - SQL Server-Azure Synapse Performance Review Messages

Applies to

* SQL Server
* Azure Synapse Analytics

## SSC-PRF-TS0001

Performance warning - recursion for CTE not checked. Might require a recursive keyword.

### Description

This warning appears when SnowConvert AI detects a Common Table Expression (CTE) but has not verified whether the CTE contains recursive operations in its query definition.

Snowflake SQL requires the RECURSIVE keyword for recursive Common Table Expressions (CTEs). Currently, SnowConvert AI does not automatically detect recursive queries to determine whether the RECURSIVE keyword should be included. This warning notifies you that you may need to manually add the RECURSIVE keyword for recursive CTEs.

Support for this validation may be added in future releases as requirements evolve.

### Code Example

#### Input Code:

```sql
 WITH Sales_CTE (SalesPersonID, NumberOfOrders)
AS
(
    SELECT SalesPersonID, 2
    FROM Sales.SalesOrderHeader
    WHERE SalesPersonID IS NOT NULL
    GROUP BY SalesPersonID
)
SELECT 2 AS "Average Sales Per Person"
FROM Sales_CTE;
```

#### Generated Code:

```sql
 --** SSC-PRF-TS0001 - PERFORMANCE WARNING - RECURSION FOR CTE NOT CHECKED. MIGHT REQUIRE RECURSIVE KEYWORD **
WITH Sales_CTE (
    SalesPersonID,
    NumberOfOrders
) AS
(
    SELECT
        SalesPersonID, 2
    FROM
        Sales.SalesOrderHeader
    WHERE
        SalesPersonID IS NOT NULL
    GROUP BY
        SalesPersonID
)
SELECT 2 AS "Average Sales Per Person"
FROM
    Sales_CTE;
```

### Best Practices

* The RECURSIVE keyword is optional and won’t affect your query results. However, it may influence how Snowflake allocates resources during execution. We recommend reviewing Snowflake’s CTE documentation and contacting us if you’d like automatic RECURSIVE keyword addition for compatible CTE queries.
* For additional assistance, please email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - SSIS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/ssis/README.md
section: Migrations
---

# SnowConvert AI - SSIS

This section provides a comprehensive reference of SSIS elements and components that SnowConvert can convert to dbt and Snowflake. Control Flow elements (tasks and containers) become orchestration logic, while Data Flow components (sources, transformations, destinations) become dbt models.

## Control Flow Elements

These SSIS Control Flow tasks and containers are supported:

| Element | Category | Conversion Target | Notes |
| --- | --- | --- | --- |
| [Microsoft.Pipeline (Data Flow Task)](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/data-flow?view=sql-server-ver17) | Task | Complete dbt Project | - |
| [Microsoft.ExecuteSQLTask](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/execute-sql-task?view=sql-server-ver17) | Task | Inline SQL or Stored Procedure | - |
| [Microsoft.ExecutePackageTask](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/execute-package-task?view=sql-server-ver17) | Task | Inline EXECUTE TASK or PROCEDURE call | - |
| [Microsoft.SendMailTask](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/send-mail-task?view=sql-server-ver17) | Task | SYSTEM$SEND_EMAIL with Notification Integration | Some features not supported; See Send Mail Task section |
| [Microsoft.BulkInsertTask](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/bulk-insert-task?view=sql-server-ver17) | Task | COPY INTO with inline FILE_FORMAT | Requires stage setup; See Bulk Insert Task section |
| [STOCK:SEQUENCE (Sequence Container)](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/sequence-container?view=sql-server-ver17) | Container | Inline sequential execution | - |
| [STOCK:FORLOOP (For Loop Container)](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/for-loop-container?view=sql-server-ver17) | Container | Sequential execution | Manual iteration logic required; Check [EWI SSC-EWI-SSIS0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) for more information |
| [STOCK:FOREACHLOOP (ForEach Loop Container)](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/foreach-loop-container?view=sql-server-ver17) | Container | LIST/CURSOR pattern | Requires stage mapping; Check [EWI SSC-EWI-SSIS0014](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) for more information |
| [Event Handlers](https://learn.microsoft.com/en-us/sql/integration-services/integration-services-ssis-event-handlers?view=sql-server-ver17) | Container | Not converted | Implement manually using Snowflake exception handling |

**Note**: Unlisted Control Flow elements generate EWI [SSC-EWI-SSIS0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md).

### Container Conversion Details

SSIS containers (Sequence, For Loop, ForEach, Event Handlers) are converted using an inline approach where container logic is expanded within the parent TASK or procedure rather than creating separate procedures.

#### Sequence Containers

[Sequence containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/sequence-container?view=sql-server-ver17) are converted inline within the parent TASK. The container’s boundaries are marked with comments in the generated code, and all tasks within the container execute sequentially in the same TASK scope.

**Conversion characteristics:**

* No separate procedure or TASK is created for the container
* Container boundaries are clearly marked BEGIN … END blocks
* All tasks execute sequentially within the parent TASK
* Task execution order based on precedence constraints is maintained
* **Limitation**: Only “Success” precedence constraints are fully supported. Conditional execution based on task outcomes (Failure or Completion constraints) is not currently implemented and will require manual post-migration adjustments

**Behavioral differences:**

* FDM generated: [SSC-FDM-SSIS0003](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md)
* Variable scoping differs from SSIS: Container variables are accessible throughout the entire parent TASK, not just within the container scope

**Example:**

```sql
-- BEGIN Sequence Container: MySequence
-- Task 1 within sequence
EXECUTE DBT PROJECT public.DataFlow1 ARGS='build --target dev';
-- Task 2 within sequence
EXECUTE DBT PROJECT public.DataFlow2 ARGS='build --target dev';
-- END Sequence Container: MySequence
```

#### For Loop Containers

[For Loop containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/for-loop-container?view=sql-server-ver17) are converted to sequential execution of their contained tasks. However, the loop iteration logic itself requires manual implementation.

**Conversion limitations:**

* The container executes once by default (iteration logic not automatically converted)
* InitExpression, EvalExpression, and AssignExpression require manual conversion
* An [EWI (SSC-EWI-SSIS0004)](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) is generated to indicate manual work is needed

**Required manual steps:**

1. Review EvalExpression to understand the loop termination condition
2. Implement the iteration using Snowflake’s WHILE loop construct
3. Update AssignExpression logic for proper loop counter management

#### ForEach Loop Containers

**File Enumerator (Supported)**

[ForEach File Enumerator containers](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/foreach-loop-container?view=sql-server-ver17) are converted to Snowflake stage operations using the LIST command and cursor pattern:

```sql
-- List files from Snowflake stage
LIST @<STAGE_PLACEHOLDER>/FolderPath PATTERN = '.*/file_pattern\.csv';

-- Create cursor for iteration
LET file_cursor CURSOR FOR
   SELECT REGEXP_SUBSTR($1, '[^/]+$') AS FILE_VALUE
   FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
   WHERE $1 NOT LIKE '%FolderPath/%/%';

-- Iterate through files
FOR file_row IN file_cursor DO
   User_CurrentFileName := :file_row.FILE_VALUE;
   EXECUTE DBT PROJECT public.My_DataFlow_Project ARGS='build --target dev';
END FOR;
```

**Configuration requirements:**

After migration, you’ll need to:

* Replace `<STAGE_PLACEHOLDER>` with your actual Snowflake stage name
* Ensure the folder path is correctly mapped to a Snowflake stage
* Verify that files are properly staged in Snowflake

An [EWI (SSC-EWI-SSIS0014)](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) is generated to remind you of this manual configuration step.

**Other Enumerator Types**

Other ForEach enumerator types (ForEach Item, ForEach ADO, ForEach NodeList, etc.) aren’t currently supported. SnowConvert generates an [EWI (SSC-EWI-SSIS0004)](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) for these cases. Consider implementing the equivalent logic using Snowflake queries or scripting constructs.

#### Event Handlers

[Event handlers](https://learn.microsoft.com/en-us/sql/integration-services/integration-services-ssis-event-handlers?view=sql-server-ver17) (OnError, OnWarning, OnPreExecute, OnPostExecute, etc.) aren’t supported. EWIs are generated. Implement manually using Snowflake exception handling.

### Execute SQL Task

[Execute SQL Tasks](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/execute-sql-task?view=sql-server-ver17) are converted as inline SQL statements or separate stored procedures, depending on complexity and result set bindings.

**Conversion approach:**

* **Simple SQL statements**: Converted inline within the parent TASK
* **Complex statements with result sets**: May be converted to separate stored procedures
* **Result bindings**: Handled where possible; unsupported patterns generate [EWI SSC-EWI-SSIS0011](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md)

### Execute Package Task

[Execute Package Tasks](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/execute-package-task?view=sql-server-ver17) are handled differently based on package type:

| Package Type | Conversion | Notes |
| --- | --- | --- |
| **Local** (single reference) | Inline execution within parent TASK | Package logic expanded inline |
| **Reusable** (2+ references or parameters) | CALL to stored procedure | Enables synchronous execution with parameters; generates [FDM SSC-FDM-SSIS0005](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| **External** | CALL with path resolution | Generates [EWI SSC-EWI-SSIS0008](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) for manual verification |

**Asynchronous execution note:**

TASK-based Execute Package conversions run asynchronously. For synchronous behavior, packages are converted to stored procedures. See [EWI SSC-EWI-SSIS0005](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md).

### Send Mail Task

[Send Mail Tasks](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/send-mail-task?view=sql-server-ver17) are converted to Snowflake Tasks that use `SYSTEM$SEND_EMAIL` with a dynamically created Notification Integration.

#### Key Differences from SSIS

| Aspect | SSIS | Snowflake |
| --- | --- | --- |
| Email Service | Custom SMTP server | Snowflake’s built-in email service |
| Configuration | SMTP Connection Manager | Notification Integration |
| Sender Address | Custom FROM address | Fixed by Snowflake account |
| CC/BCC Support | Full support | Not supported (merged into recipients) |
| Attachments | File attachments supported | Not supported |
| HTML Body | Supported | Plain text only |
| Priority | High/Normal/Low | Not supported |

#### Property Mapping

| SSIS Property | Snowflake Equivalent | Notes |
| --- | --- | --- |
| ToLine | `ALLOWED_RECIPIENTS` + recipients parameter | Direct mapping |
| FromLine | Prepended to message body | [FDM SSC-FDM-SSIS0008](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| CCLine | Added to recipients list | [FDM SSC-FDM-SSIS0009](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| BCCLine | Added to recipients list | [FDM SSC-FDM-SSIS0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) (privacy concern) |
| Subject | `subject` parameter | Direct mapping |
| MessageSource | `message` parameter | Direct mapping |
| MessageSourceType (DirectInput) | Supported | - |
| MessageSourceType (Variable) | Supported | Variable reference converted |
| MessageSourceType (FileConnection) | Not supported | [EWI SSC-EWI-SSIS0017](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| Priority | Not supported | [EWI SSC-EWI-SSIS0016](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| FileAttachments | Not supported | [EWI SSC-EWI-SSIS0015](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| SMTPConnection | Managed by Snowflake | [FDM SSC-FDM-SSIS0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| BodyFormat (HTML) | Not supported | [EWI SSC-EWI-SSIS0018](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |

#### Conversion Output Structure

Each Send Mail Task is converted to a Snowflake Task containing:

1. **Notification Integration Creation**: Created dynamically via `EXECUTE IMMEDIATE`
2. **SYSTEM$SEND_EMAIL Call**: Sends the email through the integration

```sql
CREATE OR REPLACE TASK public.my_package_send_mail_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.my_package
AS
BEGIN
   -- Step 1: Create Notification Integration dynamically
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com", "team@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;

   -- Step 2: Send the email
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com,team@example.com', 'Subject', 'Message body');
END;
```

#### Conversion Examples

**Basic Email (To, Subject, Body):**

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com', 'Daily Report', 'The daily report is ready.');
END;
```

**Email with FROM Address:**

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("noreply@company.com", "admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   --** SSC-FDM-SSIS0008 - SNOWFLAKE'S EMAIL INTEGRATION USES A FIXED SENDER ADDRESS. THE ORIGINAL FROM ADDRESS HAS BEEN PREPENDED TO THE MESSAGE BODY FOR REFERENCE. **
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'noreply@company.com,admin@example.com', 'Notification', 'Email sent by: noreply@company.com

Package completed successfully.');
END;
```

**Email with Multiple Features (attachments, priority, CC):**

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("noreply@company.com", "admin@example.com", "team@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0015 - SNOWFLAKE'S SYSTEM$SEND_EMAIL DOES NOT SUPPORT FILE ATTACHMENTS. CONSIDER USING STAGED FILES WITH SHARED LINKS OR ALTERNATIVE DELIVERY METHODS. ***/!!!
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0016 - EMAIL PRIORITY SETTINGS (HIGH/NORMAL/LOW) ARE NOT SUPPORTED BY SYSTEM$SEND_EMAIL AND WILL BE IGNORED. ***/!!!
   --** SSC-FDM-SSIS0008 - SNOWFLAKE'S EMAIL INTEGRATION USES A FIXED SENDER ADDRESS. THE ORIGINAL FROM ADDRESS HAS BEEN PREPENDED TO THE MESSAGE BODY FOR REFERENCE. **
   --** SSC-FDM-SSIS0009 - SNOWFLAKE'S SYSTEM$SEND_EMAIL DOES NOT SUPPORT CC ADDRESSING. ALL CC RECIPIENTS HAVE BEEN ADDED TO THE MAIN RECIPIENTS LIST. **
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'noreply@company.com,admin@example.com,team@example.com', 'Monthly Report', 'Email sent by: noreply@company.com

Please review the attached monthly report.');
END;
```

#### Prerequisites for Snowflake Email

Before using converted Send Mail Tasks:

1. **Email Notification Integration permissions**: Account admin must grant `CREATE INTEGRATION ON ACCOUNT` to the executing role
2. **Recipient verification**: All email addresses in `ALLOWED_RECIPIENTS` must be verified in Snowflake
3. **Update warehouse name**: Replace `DUMMY_WAREHOUSE` with your actual warehouse name

#### Workarounds for Unsupported Features

**File Attachments:**

Upload files to a Snowflake stage and share links instead:

```sql
-- Upload file to stage
PUT file://report.pdf @my_stage;

-- Get shareable link (valid for 1 hour)
LET file_url STRING := GET_PRESIGNED_URL(@my_stage, 'report.pdf', 3600);

-- Include link in email body
CALL SYSTEM$SEND_EMAIL('my_integration', 'admin@example.com', 'Report Available',
  'Download the report from: ' || :file_url);
```

**BCC Privacy:**

Send separate emails to maintain recipient privacy:

```sql
-- Send to main recipients
CALL SYSTEM$SEND_EMAIL('my_integration', 'admin@example.com', 'Subject', 'Message');

-- Send separately to BCC recipients
CALL SYSTEM$SEND_EMAIL('my_integration', 'audit@example.com', 'Subject', 'Message');
```

### Bulk Insert Task

[Bulk Insert Tasks](https://learn.microsoft.com/en-us/sql/integration-services/control-flow/bulk-insert-task?view=sql-server-ver17) are converted to Snowflake Tasks that use `COPY INTO` with an inline FILE_FORMAT. The conversion generates a stage placeholder that you must configure before execution.

#### Key Differences from SSIS

| Aspect | SSIS | Snowflake |
| --- | --- | --- |
| Data Source | File system path or UNC path | Snowflake Stage (internal or external) |
| File Format | Format file (.fmt/.xml) or inline options | FILE_FORMAT object or inline options |
| Native Format | Native/WideNative supported | Not supported (CSV, JSON, Parquet, etc.) |
| Row Filtering | FirstRow/LastRow options | Not directly supported |
| Batch Control | BatchSize configurable | Automatic management |
| Error Handling | MaximumErrors count | ON_ERROR behavior |
| Triggers | FireTriggers option | Not supported (use Streams/Tasks) |
| Table Locking | TableLock option | Not needed (MVCC) |

#### Property Mapping

| SSIS Property | Snowflake Equivalent | Notes |
| --- | --- | --- |
| DestinationTableName | `COPY INTO table` | Square brackets `[]` removed |
| DataFileType (Char) | `TYPE = 'CSV'` | Direct mapping |
| DataFileType (Native) | Not supported | [EWI SSC-EWI-SSIS0020](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| FieldTerminator | `FIELD_DELIMITER` | Parsed from SSIS format |
| RowTerminator | `RECORD_DELIMITER` | Parsed from SSIS format |
| FirstRow | `SKIP_HEADER` | Value - 1 |
| LastRow | Not supported | [EWI SSC-EWI-SSIS0021](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| MaximumErrors | `ON_ERROR` | [FDM SSC-FDM-SSIS0011](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| KeepNulls=True | `NULL_IF = ()` | Empty tuple |
| KeepNulls=False | `NULL_IF = ('', 'NULL', 'null')` | Default behavior |
| KeepIdentity=False | FDM generated | [FDM SSC-FDM-SSIS0017](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| TableLock=True | Not needed | [FDM SSC-FDM-SSIS0014](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| FireTriggers=True | Not supported | [EWI SSC-EWI-SSIS0022](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| FormatFile | Not supported | [EWI SSC-EWI-SSIS0023](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md) |
| CheckConstraints=True | Always enforced | [FDM SSC-FDM-SSIS0016](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| BatchSize | Automatic | [FDM SSC-FDM-SSIS0012](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |
| SortedData | Not available | [FDM SSC-FDM-SSIS0015](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) |

#### Terminator Parsing

SSIS uses specific tokens for field and row terminators. These are converted to Snowflake escape sequences:

| SSIS Format | Snowflake Output |
| --- | --- |
| `{CR}{LF}` | `\r\n` |
| `{CR}` | `\r` |
| `{LF}` | `\n` |
| `{TAB}` | `\t` |
| `Tab` | `\t` |
| `Comma {,}` | `,` |
| `Semicolon {;}` | `;` |
| `Vertical Bar {|}` | `|` |

#### Conversion Output Structure

Each Bulk Insert Task is converted to a Snowflake Task containing a COPY INTO statement with an inline FILE_FORMAT:

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   ---- Start block 'Package\BulkInsertTask'
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0024 - THE STAGE AND FILE UPLOAD ARE NOT INCLUDED IN THE TRANSLATION. CREATE A SNOWFLAKE STAGE AND UPLOAD THE SOURCE FILE BEFORE EXECUTING THE COPY INTO STATEMENT. REPLACE {STAGE_PLACEHOLDER} WITH YOUR STAGE NAME. ***/!!!
   COPY INTO target_table
   FROM '@{STAGE_PLACEHOLDER}'
   PATTERN = '.*data_file.*'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = ',', RECORD_DELIMITER = '\r\n', SKIP_HEADER = 1, NULL_IF = ('', 'NULL', 'null'), ERROR_ON_COLUMN_COUNT_MISMATCH = FALSE)
   ON_ERROR = CONTINUE;
   ---- End block 'Package\BulkInsertTask'
END;
```

#### Conversion Examples

**Basic Bulk Insert (CSV with default options):**

```sql
CREATE OR REPLACE TASK public.package_load_customers
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0024 - THE STAGE AND FILE UPLOAD ARE NOT INCLUDED IN THE TRANSLATION. CREATE A SNOWFLAKE STAGE AND UPLOAD THE SOURCE FILE BEFORE EXECUTING THE COPY INTO STATEMENT. REPLACE {STAGE_PLACEHOLDER} WITH YOUR STAGE NAME. ***/!!!
   COPY INTO Customers
   FROM '@{STAGE_PLACEHOLDER}'
   PATTERN = '.*customers\.csv.*'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = ',', RECORD_DELIMITER = '\r\n', SKIP_HEADER = 0, NULL_IF = ('', 'NULL', 'null'), ERROR_ON_COLUMN_COUNT_MISMATCH = FALSE)
   ON_ERROR = CONTINUE;
END;
```

**Bulk Insert with Tab Delimiter and Header Skip:**

```sql
CREATE OR REPLACE TASK public.package_load_products
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0024 - THE STAGE AND FILE UPLOAD ARE NOT INCLUDED IN THE TRANSLATION. CREATE A SNOWFLAKE STAGE AND UPLOAD THE SOURCE FILE BEFORE EXECUTING THE COPY INTO STATEMENT. REPLACE {STAGE_PLACEHOLDER} WITH YOUR STAGE NAME. ***/!!!
   COPY INTO Products
   FROM '@{STAGE_PLACEHOLDER}'
   PATTERN = '.*products\.txt.*'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = '\t', RECORD_DELIMITER = '\n', SKIP_HEADER = 1, NULL_IF = ('', 'NULL', 'null'), ERROR_ON_COLUMN_COUNT_MISMATCH = FALSE)
   ON_ERROR = SKIP_FILE_10;
END;
```

**Bulk Insert with Multiple EWIs (Native format, LastRow, FireTriggers):**

```sql
CREATE OR REPLACE TASK public.package_load_orders
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0020 - SSIS BULKINSERTTASK NATIVE OR WIDENATIVE DATA FILE TYPE IS NOT SUPPORTED IN SNOWFLAKE. EXPORT SOURCE DATA TO CSV FORMAT BEFORE MIGRATION. ***/!!!
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0021 - SSIS BULKINSERTTASK LASTROW OPTION IS NOT SUPPORTED IN SNOWFLAKE. USE TEMPORARY TABLE WITH ROW_NUMBER AND LIMIT/OFFSET TO SELECT SPECIFIC ROW RANGE. ***/!!!
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0022 - SSIS BULKINSERTTASK FIRETRIGGERS OPTION IS NOT SUPPORTED IN SNOWFLAKE. CONSIDER USING SNOWFLAKE STREAMS AND TASKS TO IMPLEMENT TRIGGER-LIKE BEHAVIOR. ***/!!!
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0024 - THE STAGE AND FILE UPLOAD ARE NOT INCLUDED IN THE TRANSLATION. CREATE A SNOWFLAKE STAGE AND UPLOAD THE SOURCE FILE BEFORE EXECUTING THE COPY INTO STATEMENT. REPLACE {STAGE_PLACEHOLDER} WITH YOUR STAGE NAME. ***/!!!
   COPY INTO Orders
   FROM '@{STAGE_PLACEHOLDER}'
   PATTERN = '.*orders\.dat.*'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = ',', RECORD_DELIMITER = '\r\n', SKIP_HEADER = 0, NULL_IF = ('', 'NULL', 'null'), ERROR_ON_COLUMN_COUNT_MISMATCH = FALSE)
   ON_ERROR = CONTINUE;
END;
```

#### Stage Setup (Required)

Before executing converted Bulk Insert Tasks, you must:

1. **Create a Snowflake stage:**

```sql
CREATE OR REPLACE STAGE my_bulk_stage;
```

2. **Upload files using SnowSQL CLI:**

```bash
PUT file:///path/to/data.csv @my_bulk_stage AUTO_COMPRESS = FALSE;
```

3. **Replace the stage placeholder in generated code:**

```sql
-- Change this:
FROM '@{STAGE_PLACEHOLDER}'

-- To this:
FROM '@my_bulk_stage'
```

4. **Verify files are staged:**

```sql
LIST @my_bulk_stage;
```

#### Workarounds for Unsupported Features

**Native Data Format:**

Export SQL Server data to CSV format before migration. The native binary format is not supported by Snowflake.

**LastRow Filtering:**

Load to staging table and filter:

```sql
-- Load all data
COPY INTO staging_table FROM '@my_stage' ...;

-- Insert only rows up to LastRow value
INSERT INTO target_table
SELECT * FROM (
  SELECT *, ROW_NUMBER() OVER (ORDER BY 1) AS rn
  FROM staging_table
) WHERE rn <= 1000;  -- Original LastRow value
```

**FireTriggers (Trigger-like Behavior):**

Use Snowflake Streams and Tasks:

```sql
-- Create stream to capture inserts
CREATE OR REPLACE STREAM target_stream ON TABLE target_table;

-- Create task to process inserts (trigger logic)
CREATE OR REPLACE TASK process_inserts
  WAREHOUSE = my_warehouse
  SCHEDULE = '1 minute'
  WHEN SYSTEM$STREAM_HAS_DATA('target_stream')
AS
  INSERT INTO audit_table
  SELECT *, CURRENT_TIMESTAMP()
  FROM target_stream
  WHERE METADATA$ACTION = 'INSERT';
```

### dbt Project Execution

Within the orchestration code, Data Flow Tasks are executed using Snowflake’s [`EXECUTE DBT PROJECT`](../../../../sql-reference/sql/execute-dbt-project.md) command:

```sql
EXECUTE DBT PROJECT schema.project_name ARGS='build --target dev'
```

**Important requirements:**

* The `project_name` must match the name you used when deploying the dbt project (via `CREATE DBT PROJECT` or Snowflake Workspace deployment)
* Arguments passed are standard dbt CLI arguments (like `build`, `run`, `test`)
* Each execution runs the entire dbt project with all models in dependency order

**Deployment:**

Before executing dbt projects in orchestration, deploy them using:

* Snowflake CLI: `snow dbt deploy --schema schema_name --database database_name --force package_name`
* Snowflake Workspace: Upload and deploy via UI

For complete deployment instructions, see the [user guide](../../general/user-guide/etl-migration-replatform.md).

## Data Flow Components

These SSIS Data Flow sources, transformations, and destinations are supported:

| Component | Category | dbt Mapping | Model Naming | Notes |
| --- | --- | --- | --- | --- |
| **Source Components** |  |  |  |  |
| [Microsoft.OLEDBSource](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/ole-db-source?view=sql-server-ver17) | Source | Staging Model | `stg_raw__{component_name}` | - |
| [Microsoft.FlatFileSource](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/flat-file-source?view=sql-server-ver17) | Source | Staging Model | `stg_raw__{component_name}` | - |
| **Transformation Components** |  |  |  |  |
| [Microsoft.DerivedColumn](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/derived-column-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (SELECT with expressions) | `int_{component_name}` | - |
| [Microsoft.DataConvert](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/data-conversion-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (CAST expressions) | `int_{component_name}` | - |
| [Microsoft.Lookup](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/lookup-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (LEFT JOIN) | `int_{component_name}` | Might present functional differences for ORDER BY requirements. Check [FDM SSC-FDM-SSIS0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) for more information |
| [Microsoft.UnionAll](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/union-all-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (UNION ALL) | `int_{component_name}` | - |
| [Microsoft.Merge](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/merge-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (UNION ALL) | `int_{component_name}` | Might present functional differences for sorted output. Check [FDM SSC-FDM-SSIS0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) for more information |
| [Microsoft.MergeJoin](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/merge-join-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (JOIN) | `int_{component_name}` | Might present functional differences for ORDER BY requirements. Check [FDM SSC-FDM-SSIS0004](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md) for more information |
| [Microsoft.ConditionalSplit](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/conditional-split-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (Router pattern with CTEs) | `int_{component_name}` | - |
| [Microsoft.Multicast](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/multicast-transformation?view=sql-server-ver17) | Transformation | Intermediate Model (SELECT pass-through) | `int_{component_name}` | - |
| [Microsoft.RowCount](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/transformations/row-count-transformation?view=sql-server-ver17) | Transformation | Intermediate Model with macro | `int_{component_name}` | Uses m_update_row_count_variable macro |
| **Destination Components** |  |  |  |  |
| [Microsoft.OLEDBDestination](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/ole-db-destination?view=sql-server-ver17) | Destination | Mart Model (table) | `{target_table_name}` | - |
| [Microsoft.FlatFileDestination](https://learn.microsoft.com/en-us/sql/integration-services/data-flow/flat-file-destination?view=sql-server-ver17) | Destination | Mart Model (table) | `{target_table_name}` | - |

**Note**: Unlisted Data Flow components generate EWI [SSC-EWI-SSIS0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md).

---
title: SnowConvert AI - SSIS Conversion Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/ssisEWI.md
section: Migrations
---

# SnowConvert AI - SSIS Conversion Issues

This section provides detailed documentation for all Error, Warning, and Information (EWI) messages that SnowConvert may generate during SSIS to dbt conversion.

For assistance with any EWI, you can use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions, or contact [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com) for additional support.

## SSC-EWI-SSIS0001

SSIS component requires manual implementation.

### Severity

Critical

### Description

This EWI is added when an SSIS component cannot be automatically converted to Snowflake SQL or dbt. The component is not supported by SnowConvert’s conversion engine and requires manual implementation. This typically occurs with custom components, third-party transformations, or components that have no direct equivalent in Snowflake’s architecture.

The conversion will place a placeholder comment in the generated code indicating where manual intervention is required.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0001 - SSIS COMPONENT IS NOT SUPPORTED BY SNOWCONVERT ***/!!!
-- Component: ScriptComponent1 requires manual implementation
```

### Best Practices

* Review the original SSIS component’s logic and data transformation requirements
* If possible, implement equivalent functionality using Snowflake SQL, dbt models, or Snowflake stored procedures
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0002

SSIS expression requires manual translation to Snowflake SQL.

### Severity

High

### Description

This EWI is generated when an SSIS expression contains syntax that cannot be automatically translated to Snowflake SQL. This commonly occurs with:

* Complex nested expressions with unsupported functions
* SSIS-specific functions without direct Snowflake equivalents
* Malformed expressions (e.g., unbalanced parentheses)
* Expressions using unsupported operators or type conversions

The generated code will include a placeholder where the expression should be manually translated.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_simple_expression
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   LET User_message VARCHAR := public.GetControlVariableUDF('User_message', 'package_simple_expression') :: VARCHAR;
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0002 - SSIS EXPRESSION CANNOT BE CONVERTED TO SNOWFLAKE SQL: @[User::message] = UPPER( @[User::message] || ***/!!!
   ;
   CALL public.UpdateControlVariable('User_message', 'package_simple_expression', TO_VARIANT(:User_message));
END;
```

### Best Practices

* Carefully review the original SSIS expression logic
* If possible, manually translate the expression to valid Snowflake SQL syntax
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0003

Embedded SQL requires manual translation to Snowflake syntax.

### Severity

High

### Description

This EWI is added when SQL statements embedded in SSIS components (such as OLE DB Source, Lookup, or Execute SQL Task) cannot be automatically converted to Snowflake syntax. This typically occurs when:

* The source SQL dialect has syntax incompatible with Snowflake
* The SQL contains system-specific functions or objects
* The SQL uses features not supported in Snowflake

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0003 - EMBEDDED SQL CANNOT BE CONVERTED FROM SQL SERVER TO SNOWFLAKE SQL ***/!!!
-- Original SQL: SELECT CustomerID, CustomerName FROM DimCustomer
--                WHERE CONVERT(VARCHAR, LastModified, 101) > '01/01/2020'
```

## SSC-EWI-SSIS0004

SSIS Control Flow Element requires manual implementation.

### Severity

High

### Description

This EWI is generated when an SSIS control flow element cannot be converted to Snowflake scripting. This can occur with various unsupported control flow tasks and containers, including but not limited to:

**Common Scenarios:**

* Control flow task types not yet supported by SnowConvert
* Container iteration logic that cannot be directly translated
* Complex control flow patterns without Snowflake equivalents
* Control flow elements with configurations that cannot be mapped to Snowflake

The specific control flow element that triggered this EWI will be identified in the error message, and manual implementation is required using Snowflake’s procedural SQL constructs.

### Common Cases

#### For Loop Container

For Loop containers with iteration logic (initialization, condition, increment) that cannot be automatically converted.

**Converted Code:**

```sql
CREATE OR REPLACE TASK simpleforloop_package_for_loop_container
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.simpleforloop_package_execute_sql_task
AS
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0004 - SSIS CONTROL FLOW ELEMENT 'FORLOOP CONTAINER ITERATION LOGIC' CANNOT BE CONVERTED TO SNOWFLAKE SCRIPTING. ***/!!!
BEGIN
   -- Loop body tasks here
END;
```

**Best Practices for For Loop:**

* Convert to Snowflake’s WHILE loops with explicit counter variables

#### ForEach Loop Container (Non-File Enumerator)

ForEach Loop containers with enumerators other than the File Enumerator (which has its own EWI SSC-EWI-SSIS0014).

**Converted Code:**

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0004 - SSIS CONTROL FLOW ELEMENT 'FOREACH CONTAINER ITERATION LOGIC' CANNOT BE CONVERTED TO SNOWFLAKE SCRIPTING. ***/!!!
FOR record IN cursor DO
   -- Loop body tasks here
END FOR;
```

**Best Practices for ForEach Loop:**

* Identify the enumerator type (ADO, NodeList, Variable, etc.)

#### Other Unsupported Control Flow Elements

Other control flow tasks and elements that may generate this EWI include:

* Custom tasks without Snowflake equivalents
* Third-party control flow elements
* Certain configurations of standard tasks
* Complex event handlers

## SSC-EWI-SSIS0005

Execute Package Task converted to asynchronous EXECUTE TASK.

### Severity

High

### Description

This EWI indicates that an SSIS Execute Package Task has been converted to a Snowflake TASK, which executes asynchronously by default. In SSIS, Execute Package Task runs synchronously within the parent package’s execution context. In Snowflake, EXECUTE TASK triggers the task to run asynchronously, which may affect orchestration logic, error handling, and variable passing between packages.

### Converted Code

```sql
CREATE OR REPLACE TASK parent_package_execute_package_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.parent_package_previous_task
AS
EXECUTE TASK public.childpackage_child_package;
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0005 - THIS TASK RUNS ASYNCHRONOUSLY. Original SSIS Execute Package Task ran synchronously. ***/!!!
```

## SSC-EWI-SSIS0006

Execute Package Task variable bindings require manual implementation.

### Severity

High

### Description

This EWI is generated when an Execute Package Task contains variable bindings (parameter mappings between parent and child packages) that could not be automatically converted. SSIS allows parent packages to pass variable values to child packages through parameter bindings. This mechanism requires manual implementation in Snowflake.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0006 - SSIS EXECUTE PACKAGE TASK VARIABLE BINDINGS WERE NOT CONVERTED. ***/!!!
-- Original binding: ParentVariable -> ChildParam
-- Implement parameter passing mechanism manually
EXECUTE TASK public.childpackage;
```

## SSC-EWI-SSIS0007

Property expressions require manual implementation.

### Severity

High

### Description

This EWI is generated when an SSIS executable (task or container) uses property expressions to dynamically set properties at runtime, and these expressions could not be converted. Property expressions in SSIS allow dynamic configuration of task properties using expressions based on variables or parameters. In Snowflake, similar dynamic behavior must be implemented manually.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0007 - SSIS EXECUTABLE CONTAINS PROPERTY EXPRESSIONS that were not converted. ***/!!!
-- Original property expression: SqlStatementSource = @[User::SqlQuery]
-- Implement dynamic SQL logic manually
```

### Best Practices

* Use EXECUTE IMMEDIATE for dynamic SQL execution
* Validate dynamically constructed SQL to prevent injection risks
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0008

Execute Package Task references external package not in conversion scope.

### Severity

Medium

### Description

This EWI indicates that an Execute Package Task references a package that exists outside the current project or conversion scope. Ensure all dependent packages are available for the migration process.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0008 - EXECUTE PACKAGE TASK REFERENCES AN EXTERNAL PACKAGE ***/!!!
-- External package: C:\ExternalPackages\UtilityPackage.dtsx
-- Ensure this package is converted and accessible
EXECUTE TASK public.utilitypackage;
```

### Best Practices

* Create an inventory of all external package dependencies
* If possible, include external packages in the conversion scope
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0009

Unexpected error during component conversion.

### Severity

High

### Description

This EWI indicates that an unexpected error occurred during the conversion of a specific component. This is typically a rare occurrence and may be caused by:

* Corrupted package metadata
* Unusual component configuration
* Edge cases not covered by the converter

The component may have been partially converted. Review the generated code and contact support if the issue persists.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0009 - UNEXPECTED EXCEPTION CONVERTING COMPONENT. ***/!!!
-- Review component configuration and generated code
```

## SSC-EWI-SSIS0010

Multiple models write to the same destination table.

### Severity

High

### Description

This EWI is generated when multiple SSIS components write to the same destination table, resulting in multiple dbt models targeting the same table. This typically occurs when:

* Multiple Data Flow Tasks write to the same table
* Different packages in the same conversion write to the same table
* The same table is used as a destination in multiple transformations

### Converted Code

```sql
-- Model 1: models/factsales.sql
-- Model 2: models/factsales_1.sql (automatically renamed)
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0010 - A model associated with table 'FactSales' is already defined. ***/!!!
```

### Best Practices

* Review duplicate destination references in your packages
* If possible, consolidate multiple writes to the same table into a single model
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0011

Result binding configured for non-SELECT statement.

### Severity

High

### Description

This EWI is generated when an Execute SQL Task has a result binding configured (to capture query results into a variable), but the SQL statement is not a SELECT query. Result bindings only work with SELECT statements that return result sets. If the SQL statement is an INSERT, UPDATE, DELETE, or procedural statement, the result binding cannot be applied and must be manually addressed.

For non-query statements, consider using OUTPUT parameters or separate SELECT statements to retrieve values.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0011 - RESULT BINDING IS CONFIGURED FOR NON-QUERY STATEMENT. RESULT BINDING ONLY WORKS WITH SELECT QUERIES. ***/!!!
DELETE FROM Customers WHERE CustomerId = 1;
-- Original result binding: RowCount -> User::DeletedRows
```

### Best Practices

* Use separate SELECT statements to retrieve values after DML operations
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0012

XML result set requires manual implementation.

### Severity

High

### Description

This EWI is generated when an Execute SQL Task is configured to return results as XML, which is not directly supported in the conversion. SnowConvert supports SINGLEROW and FULLRESULTSET result types, but XML result sets require manual implementation.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0012 - XML RESULT SET TYPE IS NOT SUPPORTED. ONLY SINGLEROW AND FULLRESULTSET RESULT SET TYPES ARE SUPPORTED. ***/!!!
-- Original SQL: SELECT * FROM Customers FOR XML AUTO
-- Convert to supported result type or use JSON format
```

## SSC-EWI-SSIS0013

Complex property expression requires manual implementation.

### Severity

High

### Description

This EWI is generated when a property expression in an SSIS task contains patterns that cannot be automatically converted. SnowConvert supports simple property expressions such as:

* Single variable references: `@[User::VariableName]`
* Simple string concatenation: `"SELECT * FROM " + @[User::TableName]`

More complex patterns involving multiple operations, nested expressions, or complex string manipulation require manual conversion.

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0013 - PROPERTY EXPRESSION FOR SQLSTATEMENTSOURCE CONTAINS UNSUPPORTED PATTERNS. ONLY SINGLE VARIABLE REFERENCES OR SIMPLE STRING CONCATENATION WITH LITERALS AND VARIABLE REFERENCES IS SUPPORTED. ***/!!!
-- Implement complex expression logic manually
```

## SSC-EWI-SSIS0014

ForEach File Enumerator requires Snowflake stage mapping.

### Severity

High

### Description

This EWI is generated when a ForEach File Enumerator Container is used to iterate over files in a folder. In SSIS, this references local or network file system paths. In Snowflake, files must be staged in Snowflake internal or external stages. You must:

* Map the folder path to a Snowflake stage
* Replace the `<STAGE_PLACEHOLDER>` with your actual stage name
* Ensure files are uploaded to the stage before execution
* Implement the file enumeration logic using Snowflake’s LIST command

### Converted Code

```sql
-- !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0014 - THE FOLDER PATH REQUIRES MANUAL MAPPING TO A SNOWFLAKE STAGE. ***/!!!
-- Original folder: C:\Data\InputFiles\
-- Replace <STAGE_PLACEHOLDER> with your stage name
-- Example: @my_stage/input_files/
DECLARE
   file_cursor CURSOR FOR
      SELECT relative_path
      FROM TABLE(RESULT_SCAN(
         SELECT system$list_files('<STAGE_PLACEHOLDER>')
      ))
      WHERE relative_path LIKE '%.csv';
```

### Best Practices

* Create a Snowflake stage for file storage
* Upload files to the stage using SnowSQL, Snowpipe, or cloud storage integration
* Update the stage reference in the generated code
* Test file enumeration with actual staged files
* Use Snowflake’s LIST command to verify files are accessible
* Document stage naming conventions for your project
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0015

Send Mail Task file attachments are not supported in Snowflake.

### Severity

High

### Description

This EWI is generated when an SSIS Send Mail Task is configured with file attachments. Snowflake’s `SYSTEM$SEND_EMAIL` procedure does not support sending file attachments directly. The email will be sent without the attachments.

To include file content in emails, you must upload files to a Snowflake stage and generate shareable links using `GET_PRESIGNED_URL()`.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0015 - SNOWFLAKE'S SYSTEM$SEND_EMAIL DOES NOT SUPPORT FILE ATTACHMENTS. CONSIDER USING STAGED FILES WITH SHARED LINKS OR ALTERNATIVE DELIVERY METHODS. ***/!!!
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com', 'Report Attached', 'Please review the attached report.');
END;
```

### Best Practices

* Upload files to a Snowflake stage before sending the email
* Generate pre-signed URLs using `GET_PRESIGNED_URL(@stage_name, 'filename', expiration_seconds)`
* Include the shareable link in the email body instead of the attachment
* Consider using external file sharing services for large attachments
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0016

Send Mail Task email priority is not supported in Snowflake.

### Severity

Medium

### Description

This EWI is generated when an SSIS Send Mail Task is configured with a priority setting (High, Normal, or Low). Snowflake’s `SYSTEM$SEND_EMAIL` does not support email priority headers. The email will be sent without priority information.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0016 - EMAIL PRIORITY SETTINGS (HIGH/NORMAL/LOW) ARE NOT SUPPORTED BY SYSTEM$SEND_EMAIL AND WILL BE IGNORED. ***/!!!
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com', 'Urgent Alert', 'Critical error detected.');
END;
```

### Best Practices

* Add priority indicators to the email subject line (e.g., `[URGENT]`, `[HIGH PRIORITY]`)
* No other manual action required for basic functionality
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0017

Send Mail Task file-based message source is not supported.

### Severity

High

### Description

This EWI is generated when an SSIS Send Mail Task uses a File Connection as the message source type. Snowflake’s `SYSTEM$SEND_EMAIL` requires the message body to be provided as a string value directly. Loading email content from external files is not supported during conversion.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0017 - LOADING EMAIL BODY FROM A FILE CONNECTION IS NOT SUPPORTED. THE MESSAGE SOURCE MUST BE DIRECT INPUT OR FROM A VARIABLE. ***/!!!
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com', 'Newsletter', '');
END;
```

### Best Practices

* Load the file content into a Snowflake stage
* Read the file content using Snowflake functions and store it in a variable
* Pass the variable content to `SYSTEM$SEND_EMAIL`
* Consider migrating email templates to Snowflake tables or variables
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0018

Send Mail Task HTML body format is not supported in Snowflake.

### Severity

Medium

### Description

This EWI is generated when an SSIS Send Mail Task is configured with HTML body format. Snowflake’s `SYSTEM$SEND_EMAIL` only supports plain text email bodies. HTML tags will appear as literal text in the email and will not be rendered.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0018 - SYSTEM$SEND_EMAIL ONLY SUPPORTS PLAIN TEXT EMAIL BODIES. HTML FORMATTING WILL NOT BE PRESERVED. ***/!!!
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com', 'Formatted Report', '<html><body><h1>Report</h1><p>Status: <b>Success</b></p></body></html>');
END;
```

### Best Practices

* Convert HTML content to plain text before sending
* Use text formatting conventions like asterisks for emphasis (`*bold*`)
* Use ASCII art or spacing for tabular data
* Consider using external email services if HTML formatting is required
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0020

Bulk Insert Task native data file type is not supported in Snowflake.

### Severity

Critical

### Description

This EWI is generated when an SSIS Bulk Insert Task is configured with `DataFileType` set to `DTSBulkInsert_DataFileType_Native` or `DTSBulkInsert_DataFileType_WideNative`. These binary formats are SQL Server proprietary and cannot be read by Snowflake’s COPY INTO command.

Native data files store data in SQL Server’s internal binary format for faster bulk operations. Since Snowflake cannot interpret these formats, the source data must be exported to a supported format (CSV, JSON, Parquet, etc.) before migration.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0020 - SSIS BULKINSERTTASK NATIVE OR WIDENATIVE DATA FILE TYPE IS NOT SUPPORTED IN SNOWFLAKE. EXPORT SOURCE DATA TO CSV FORMAT BEFORE MIGRATION. ***/!!!
   COPY INTO target_table
   FROM '@{STAGE_PLACEHOLDER}'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* Export data from SQL Server to CSV or another supported format before migration
* Consider Parquet format for better performance with large datasets
* Update the FILE_FORMAT in the generated code to match your exported format
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0021

Bulk Insert Task LastRow option is not supported in Snowflake.

### Severity

High

### Description

This EWI is generated when an SSIS Bulk Insert Task specifies a `LastRow` value to limit the number of rows loaded. Snowflake’s COPY INTO command does not support stopping at a specific row number.

To achieve similar functionality, you must load the data into a staging table and then use SQL with ROW_NUMBER() and LIMIT to select the desired row range.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0021 - SSIS BULKINSERTTASK LASTROW OPTION IS NOT SUPPORTED IN SNOWFLAKE. USE TEMPORARY TABLE WITH ROW_NUMBER AND LIMIT/OFFSET TO SELECT SPECIFIC ROW RANGE. ***/!!!
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV', SKIP_HEADER = 1);
END;
```

### Best Practices

* Load data into a staging table first, then use SQL to filter rows
* Consider pre-processing the source file to contain only the needed rows
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

Load all data to a staging table, then insert only the rows up to the desired LastRow value:

```sql
-- Step 1: Load all data to staging
COPY INTO staging_table FROM '@my_stage' FILE_FORMAT = (TYPE = 'CSV');

-- Step 2: Insert only rows up to LastRow value (e.g., 1000)
INSERT INTO target_table
SELECT * FROM (
  SELECT *, ROW_NUMBER() OVER (ORDER BY 1) AS rn
  FROM staging_table
) WHERE rn <= 1000;
```

## SSC-EWI-SSIS0022

Bulk Insert Task FireTriggers option is not supported in Snowflake.

### Severity

Medium

### Description

This EWI is generated when an SSIS Bulk Insert Task has the `FireTriggers` option enabled. In SQL Server, this option causes INSERT triggers to fire during the bulk load operation. Snowflake does not have traditional database triggers.

To implement trigger-like behavior in Snowflake, use Streams and Tasks to detect and process data changes after the load completes.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0022 - SSIS BULKINSERTTASK FIRETRIGGERS OPTION IS NOT SUPPORTED IN SNOWFLAKE. CONSIDER USING SNOWFLAKE STREAMS AND TASKS TO IMPLEMENT TRIGGER-LIKE BEHAVIOR. ***/!!!
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* Review the original SQL Server trigger logic
* Create a Snowflake Stream on the target table to capture data changes
* Create a scheduled Task to process the stream data (equivalent to trigger logic)
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

Implement trigger-like behavior using Snowflake Streams and Tasks:

```sql
-- Step 1: Create a stream to capture inserts
CREATE OR REPLACE STREAM target_table_stream ON TABLE target_table;

-- Step 2: Create a task to process the stream (trigger logic)
CREATE OR REPLACE TASK process_inserts
  WAREHOUSE = my_warehouse
  SCHEDULE = '1 minute'
  WHEN SYSTEM$STREAM_HAS_DATA('target_table_stream')
AS
  INSERT INTO audit_table
  SELECT *, CURRENT_TIMESTAMP()
  FROM target_table_stream
  WHERE METADATA$ACTION = 'INSERT';
```

## SSC-EWI-SSIS0023

Bulk Insert Task FormatFile is not supported in Snowflake.

### Severity

Medium

### Description

This EWI is generated when an SSIS Bulk Insert Task uses a format file (`.fmt` or `.xml`) to define the data layout. Snowflake does not support SQL Server format files. Instead, you must create an equivalent Snowflake FILE FORMAT object that defines the same parsing rules.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0023 - SSIS BULKINSERTTASK FORMAT FILE IS NOT SUPPORTED IN SNOWFLAKE. CREATE EQUIVALENT FILE FORMAT OBJECT MANUALLY BASED ON FORMAT FILE CONTENTS. ***/!!!
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* Review the original format file to understand field definitions, terminators, and data types
* Create an equivalent Snowflake FILE FORMAT object
* For complex format files with column mappings, consider using a staging table with explicit column selection
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

Create an equivalent Snowflake FILE FORMAT based on the format file contents:

```sql
CREATE OR REPLACE FILE FORMAT my_format
  TYPE = 'CSV'
  FIELD_DELIMITER = ','
  RECORD_DELIMITER = '\n'
  SKIP_HEADER = 1
  FIELD_OPTIONALLY_ENCLOSED_BY = '"'
  NULL_IF = ('', 'NULL');
```

## SSC-EWI-SSIS0024

Bulk Insert Task stage and file upload not included in translation.

### Severity

Medium

### Description

This EWI is always generated for Bulk Insert Task conversions to remind you that Snowflake requires files to be staged before they can be loaded. Unlike SSIS which can read directly from file system paths, Snowflake’s COPY INTO command requires files to be in a Snowflake stage (internal or external).

You must create a stage, upload your source files, and replace the `{STAGE_PLACEHOLDER}` in the generated code with your actual stage name.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0024 - THE STAGE AND FILE UPLOAD ARE NOT INCLUDED IN THE TRANSLATION. CREATE A SNOWFLAKE STAGE AND UPLOAD THE SOURCE FILE BEFORE EXECUTING THE COPY INTO STATEMENT. REPLACE {STAGE_PLACEHOLDER} WITH YOUR STAGE NAME. ***/!!!
   COPY INTO target_table
   FROM '@{STAGE_PLACEHOLDER}'
   PATTERN = '.*data_file.*'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = ',', RECORD_DELIMITER = '\n')
   ON_ERROR = CONTINUE;
END;
```

### Best Practices

* Create a Snowflake stage for your source files
* Upload files using SnowSQL CLI with the PUT command
* Replace `{STAGE_PLACEHOLDER}` with your stage name in the generated code
* Verify files are staged correctly using `LIST @my_stage;`
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

Create a stage and upload your source files before executing the COPY INTO:

```sql
-- 1. Create a stage
CREATE OR REPLACE STAGE my_bulk_stage;

-- 2. Upload file using SnowSQL CLI
-- PUT file:///path/to/data.csv @my_bulk_stage AUTO_COMPRESS = FALSE;

-- 3. Replace @{STAGE_PLACEHOLDER} in generated code
```

## SSC-EWI-SSIS0025

Flat File Source stage path variable requires manual mapping to a Snowflake stage.

### Severity

High

### Description

This EWI is generated when a Flat File Source component references a stage path variable that must be manually mapped to a Snowflake stage. In SSIS, Flat File Source components read directly from file system paths. In Snowflake, files must be staged before they can be queried. The generated code uses a dbt variable for the stage path, which must be updated to point to the actual Snowflake stage containing the source file.

### Converted Code

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0025 - FLAT FILE SOURCE STAGE PATH VARIABLE 'Data_Flow_Task_Flat_File_Source_stage_path' REQUIRES MANUAL MAPPING TO A SNOWFLAKE STAGE. ***/!!!
SELECT
   $1 :: NUMERIC AS id,
   $2 :: VARCHAR(100) AS name,
   $3 :: NUMERIC(10, 2) AS amount,
   $4 :: DATE AS order_date
FROM
   @{{ var('Data_Flow_Task_Flat_File_Source_stage_path') }} (FILE_FORMAT => 'TestPackage_Data_Flow_Task_Flat_File_Source')
WHERE
   METADATA$FILE_ROW_NUMBER > 1
```

### Best Practices

* Create a Snowflake stage and upload your flat files before execution
* Update the stage path variable in the task CONFIG section to point to your Snowflake stage (e.g., `@my_stage/input_files/`)
* Verify the FILE_FORMAT object matches your source file’s delimiters and encoding
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0026

SSIS property expression is not supported.

### Severity

High

### Description

This EWI is generated when an SSIS component uses a property expression that cannot be converted to Snowflake. Property expressions in SSIS dynamically set component properties at runtime using SSIS expression syntax. Certain property expressions — particularly those referencing project parameters, complex concatenations, or unsupported expression functions — cannot be automatically translated. The original expression is preserved in the generated code as a comment for manual resolution.

### Converted Code

```sql
CREATE OR REPLACE TASK public.TestPackage
CONFIG = $${
  "package": {
    "Data_Flow_Task_Flat_File_Source_stage_path": {"value": "!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0026 - PROPERTY EXPRESSION 'ConnectionString' IS NOT SUPPORTED. **/ @[$Project::FlatFileDir] + \"\\\\input.csv\"", "type": "VARCHAR", "is_parameter": false}
  }
}$$
AS
BEGIN
   LET config VARCHAR := SYSTEM$GET_TASK_GRAPH_CONFIG('package');
   LET scope VARCHAR := 'TestPackage';
   CALL public.ClearVariables(:scope);
   CALL public.InitVariablesFromConfig(:scope, :config);
END;
```

### Best Practices

* Review the original SSIS property expression and manually implement the equivalent logic
* For connection string expressions, replace with the actual Snowflake stage path or connection reference
* Use dbt variables (`{{ var('...') }}`) or Snowflake task CONFIG parameters for runtime configuration
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0027

Flat File Source connection manager not found.

### Severity

High

### Description

This EWI is generated when a Flat File Source component references a connection manager that cannot be found in the SSIS package. This typically occurs when:

* The connection manager is defined at the project level and not embedded in the `.dtsx` package
* The connection manager reference is broken or misconfigured
* The `.conmgr` file is missing from the project

Without the connection manager, SnowConvert cannot determine the file path, format, or column definitions, so the generated code contains null placeholder columns.

### Converted Code

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0027 - FLAT FILE SOURCE CONNECTION MANAGER NOT FOUND. ***/!!!
SELECT
   null AS col_a,
   null AS col_b
```

### Best Practices

* Locate the missing connection manager (`.conmgr` file) and include it in the conversion scope
* Verify the connection manager reference ID matches between the Data Flow component and the package
* Manually define the source query with the correct columns and stage path
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0028

Excel Source error row redirection requires alternative implementation.

### Severity

High

### Description

This EWI appears in the migration assessment report when an SSIS Excel Source component has an error output configured with `RedirectRow` disposition. In SSIS, error rows can be redirected to a separate output for logging or reprocessing. This pattern cannot be directly translated to Snowflake SQL.

The generated dbt model processes all rows without error redirection. To achieve similar error handling behavior in Snowflake, use `TRY_TO_*` casting functions with error flag columns for defensive error handling, and create separate error capture models if needed.

### Converted Code

The generated dbt model reads data normally without error handling:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sheet1', 'YES'))
),
parsed_data AS
(
   SELECT
      TRY_TO_DOUBLE(data['id'] :: VARCHAR) AS id,
      data['name'] :: VARCHAR AS name
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
```

To add error row redirection, modify the model like this:

```sql
-- Add an error flag column to identify rows with conversion issues
SELECT
   TRY_TO_DOUBLE(data['id'] :: VARCHAR) AS id,
   data['name'] :: VARCHAR AS name,
   CASE WHEN TRY_TO_DOUBLE(data['id'] :: VARCHAR) IS NULL AND data['id'] :: VARCHAR IS NOT NULL THEN TRUE ELSE FALSE END AS _has_error
FROM
   excel_raw_data
```

### Best Practices

* Use `TRY_TO_NUMBER`, `TRY_TO_DATE`, and other `TRY_TO_*` functions for defensive casting
* Add error flag columns to identify rows with conversion issues
* Create a separate dbt model to capture and log error rows: `SELECT * FROM {{ ref('my_model') }} WHERE _has_error = TRUE`
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0029

Excel Source parameterized queries require dbt vars or macros.

### Severity

High

### Description

This EWI appears in the migration assessment report when an SSIS Excel Source component uses parameter mappings to pass runtime values into the query. SSIS allows parameterized queries in Excel Source using `?` placeholders bound to SSIS variables. This pattern cannot be directly translated.

The generated dbt model reads the full Excel sheet without applying parameter filtering. You must manually add the equivalent filtering using dbt `var()` functions or Jinja templating.

### Converted Code

The generated dbt model reads all data from the sheet:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sheet1', 'YES'))
),
parsed_data AS
(
   SELECT
      data['ProductName'] :: VARCHAR AS ProductName,
      TRY_TO_DOUBLE(data['Quantity'] :: VARCHAR) AS Quantity
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
-- Add equivalent parameter filtering here:
-- WHERE ProductName = '{{ var("filter_product") }}'
```

### Best Practices

* Replace SSIS parameter placeholders with dbt `{{ var('param_name') }}` references
* Pass values at runtime using `dbt run --vars '{"param_name": "value"}'`
* Add WHERE clauses or CTE filters to replicate the original parameterized query logic
* Consider using dbt macros for complex parameterized logic
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0030

Excel Source project-level connection manager requires external resolution.

### Severity

Medium

### Description

This EWI appears in the migration assessment report when an SSIS Excel Source component references a connection manager defined at the project level (in a `.conmgr` file) rather than embedded in the `.dtsx` package. SnowConvert cannot automatically resolve project-level connection managers because the `.conmgr` file may not be included in the conversion scope.

The generated dbt model may use a placeholder or default stage path. You must locate the corresponding `.conmgr` file and extract the `ConnectionString` property to determine the Excel file path and HDR (header row) setting, then update the stage file path variable in `dbt_project.yml`.

### Converted Code

The generated dbt model uses a stage file path variable that needs to be updated:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sheet1', 'YES'))
),
parsed_data AS
(
   SELECT
      data['Column1'] :: VARCHAR AS Column1
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
```

### Best Practices

* Locate the `.conmgr` file in your SSIS project directory — it will be named after the connection manager
* Open the `.conmgr` file and find the `ConnectionString` property to get the Excel file path and `HDR=YES/NO` setting
* Include `.conmgr` files in the conversion input folder and re-run the conversion for better results
* Update the `stage_file_path` variable in `dbt_project.yml` to point to your Snowflake stage
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0031

Excel Source CONFIG variable requires manual mapping.

### Severity

Medium

### Description

This EWI appears in the migration assessment report when the conversion creates a CONFIG variable for an Excel Source stage file path. The generated orchestration task includes a CONFIG section with a default value derived from the original SSIS package’s connection string. You must update this variable to reference the correct Snowflake stage and Excel file path before execution.

The default value follows the pattern `@excel_source_stage/<filename>.xlsx` and is also registered in `dbt_project.yml` under the `vars:` section.

### Converted Code

The orchestration task includes the stage file path as a CONFIG variable:

```sql
CREATE OR REPLACE TASK public.my_package
WAREHOUSE=DUMMY_WAREHOUSE
CONFIG = $${
  "package": {
    "Data_Flow_Task_Excel_Source_stage_file_path": {"value": "@excel_source_stage/sales_data.xlsx", "type": "VARCHAR", "is_parameter": false}
  }
}$$
AS
BEGIN
   LET config VARCHAR := SYSTEM$GET_TASK_GRAPH_CONFIG('package');
   LET scope VARCHAR := 'my_package';
   CALL public.ClearVariables(:scope);
   CALL public.InitVariablesFromConfig(:scope, :config);
END;
```

### Best Practices

* Upload your Excel file to a Snowflake stage: `PUT file:///path/to/sales_data.xlsx @my_stage AUTO_COMPRESS = FALSE;`
* Update the CONFIG variable value to point to the staged file (e.g., `@my_stage/sales_data.xlsx`)
* Also update the corresponding variable in `dbt_project.yml` under `vars:`
* Verify the file is accessible from the stage using `LIST @my_stage;`
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0032

Excel Source variable-based access mode not supported.

### Severity

High

### Description

This EWI appears in the migration assessment report when an SSIS Excel Source component uses a variable-based access mode to dynamically resolve the sheet name or SQL query at runtime. SSIS supports four access modes:

* **0**: Table or view (static sheet name) — supported
* **1**: Table name or view name from variable — **not supported**
* **2**: SQL command (static query) — supported
* **3**: SQL command from variable — **not supported**

For modes 1 and 3, the sheet name or query is stored in an SSIS variable and resolved at runtime. This dynamic resolution cannot be automatically translated. The generated model reads the full sheet using the static `OpenRowset` value from the component configuration. You must manually add the dynamic sheet or query logic using dbt vars.

### Converted Code

The generated dbt model uses the static sheet name from the component definition:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sheet1', 'YES'))
),
parsed_data AS
(
   SELECT
      data['ProductName'] :: VARCHAR AS ProductName,
      TRY_TO_DOUBLE(data['Quantity'] :: VARCHAR) AS Quantity
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
```

### Best Practices

* Replace the static sheet name with a dbt `{{ var('sheet_name') }}` reference if it needs to be dynamic
* Pass the sheet name at runtime using `dbt run --vars '{"sheet_name": "Sheet1"}'`
* If the variable resolved to a SQL command (access mode 3), convert the query logic to WHERE clauses or CTEs in the dbt model
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0033

Excel Source SQL command filtering logic not preserved.

### Severity

Medium

### Description

This EWI appears in the migration assessment report when an SSIS Excel Source component uses the SQL Command access mode (mode 2) with a query that includes filtering, joins, or aggregation logic. For example, the original SSIS component might use `SELECT * FROM [Sheet1$] WHERE Status = 'Active'`.

The generated dbt model reads the full Excel sheet via the `excel_source_udf` UDF and does not apply the original SQL filtering. You must add the equivalent filtering as downstream CTEs or WHERE clauses in the dbt model.

### Converted Code

The generated dbt model reads all data from the sheet without the original SQL filtering:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sheet1', 'YES'))
),
parsed_data AS
(
   SELECT
      data['ProductName'] :: VARCHAR AS ProductName,
      data['Status'] :: VARCHAR AS Status,
      TRY_TO_DOUBLE(data['Quantity'] :: VARCHAR) AS Quantity
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
-- Original SSIS SQL: SELECT * FROM [Sheet1$] WHERE Status = 'Active'
-- Add the equivalent filtering:
WHERE
   Status = 'Active'
```

### Best Practices

* Check the assessment report for the original SQL command text
* Add the equivalent WHERE clause, JOINs, or aggregation as CTEs in the dbt model
* Test the output row count to ensure it matches the original SSIS component’s results
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0034

Flat file format is not supported.

### Severity

High

### Description

This EWI is generated when an SSIS Flat File Source component uses a file format that cannot be automatically converted to Snowflake. Snowflake’s COPY INTO command and FILE_FORMAT objects support delimited (CSV) files but do not directly support certain SSIS flat file formats such as:

* **FixedWidth**: Files where columns are defined by character position and width
* **RaggedRight**: Files where the last column has a variable length delimiter

For these formats, the source file must be pre-processed or transformed into a delimited format before loading into Snowflake.

### Converted Code

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0034 - FLAT FILE FORMAT 'FixedWidth' IS NOT SUPPORTED. ***/!!!
SELECT
   null AS col_a,
   null AS col_b
```

### Best Practices

* Convert FixedWidth files to CSV or another delimited format before staging in Snowflake
* For RaggedRight files, add a consistent delimiter to the last column
* Use Python UDFs or external functions for complex format transformations
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0035

Non-standard header row delimiter is not supported.

### Severity

High

### Description

This EWI is generated when an SSIS Flat File connection manager specifies a header row delimiter that is not a newline character. Snowflake’s FILE_FORMAT expects header rows to be terminated by the standard record delimiter (typically `\n` or `\r\n`). Non-standard header row delimiters cannot be configured in Snowflake.

### Converted Code

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0035 - FLAT FILE HEADER ROW DELIMITER IS NOT A NEWLINE CHARACTER. THIS IS NOT SUPPORTED. ***/!!!
SELECT
   $1 :: VARCHAR(50) AS col_a,
   $2 :: VARCHAR(50) AS col_b
FROM
   @{{ var('Data_Flow_Task_Flat_File_Source_stage_path') }} (FILE_FORMAT => 'TestPackage_Data_Flow_Task_Flat_File_Source')
WHERE
   METADATA$FILE_ROW_NUMBER > 1
```

### Best Practices

* Pre-process the source file to replace the non-standard header delimiter with a newline character
* Alternatively, remove the header row and use `SKIP_HEADER = 0` in the FILE_FORMAT
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0036

Flat File Source feature is not supported.

### Severity

High

### Description

This EWI is generated when a Flat File Source component uses a feature that is not supported by SnowConvert’s conversion engine. The specific unsupported feature is identified in the EWI message. Common unsupported features include:

* **Multiple Flat Files connection manager**: Connection managers configured to read from multiple files simultaneously
* **Per-column delimiters**: Different delimiters for individual columns
* **Per-column text qualified settings**: Individual text qualifier settings per column
* **Unsupported CodePage**: Character encoding not supported by Snowflake
* **Unsupported LocaleID**: Locale-specific formatting not supported by Snowflake
* **Multi-character text qualifier**: Text qualifiers longer than a single character

### Converted Code

```sql
!!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0036 - MULTIPLE FLAT FILES CONNECTION MANAGER IS NOT SUPPORTED. ***/!!!
SELECT
   null AS col_a,
   null AS col_b
```

### Best Practices

* Review the specific unsupported feature identified in the EWI message
* For multiple flat files, use Snowflake’s PATTERN option in COPY INTO to load multiple files from a stage
* For per-column delimiters, pre-process the file to use a uniform delimiter
* For unsupported code pages, convert the file encoding to UTF-8 before staging
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0037

SSIS expression function not reviewed for Snowflake equivalence.

### Severity

Medium

### Description

This EWI is generated when an SSIS expression uses a function that has not been reviewed for Snowflake equivalence. The function has been passed through to the generated code as-is, but it may not exist or behave differently in Snowflake. Manual review is required to verify or replace the function with the appropriate Snowflake equivalent.

### Converted Code

```sql
WITH source_data AS
(
   SELECT
      Name
   FROM
      {{ ref('stg_raw__ole_db_source') }}
)
SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0037 - SSIS EXPRESSION FUNCTION 'TOKEN' HAS NOT BEEN REVIEWED FOR SNOWFLAKE EQUIVALENCE ***/!!!
   TOKEN(Name, ',', 1) AS TokenResult,
   Name AS Name
FROM
   source_data
```

### Best Practices

* Look up the SSIS expression function in the [SSIS Expression Reference](https://learn.microsoft.com/en-us/sql/integration-services/expressions/functions-ssis-expression) to understand its behavior
* Find the equivalent Snowflake function (e.g., SSIS `TOKEN` → Snowflake `SPLIT_PART`)
* Replace the function call with the Snowflake equivalent and test the output
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0038

SSIS FileSystemTask path requires manual mapping to a Snowflake stage.

### Severity

High

### Description

This EWI is generated when an SSIS File System Task uses a File Connection Manager to reference a file or directory path. In SSIS, File System Tasks operate directly on local or network file system paths. In Snowflake, file operations use stages instead of file system paths.

The generated code uses `@<STAGE_PLACEHOLDER>` where the original path was. You must replace this placeholder with the actual Snowflake stage and path that corresponds to the original file location.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_delete_report_file
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   ---- Start block 'Package\Delete Report File'
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0038 - THE PATH(S) REQUIRES MANUAL MAPPING TO A SNOWFLAKE STAGE(S). ***/!!!
   EXECUTE IMMEDIATE 'REMOVE @<STAGE_PLACEHOLDER>/Data/Input/report.txt';
   ---- End block 'Package\Delete Report File'

END;
```

### Best Practices

* Replace `@<STAGE_PLACEHOLDER>` with your actual Snowflake stage name (e.g., `@my_stage`)
* Create the stage if it does not exist: `CREATE OR REPLACE STAGE my_stage;`
* Ensure files are uploaded to the stage before executing file operations
* For operations referencing multiple paths, each `@<STAGE_PLACEHOLDER>` may map to a different stage
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0039

SSIS FileSystemTask overwrite=false not supported in Snowflake.

### Severity

High

### Description

This EWI is generated when an SSIS File System Task is configured with `OverwriteDestinationFile = False`. In SSIS, this setting causes the task to fail if the destination file or directory already exists, providing a safety guard against accidental overwrites.

Snowflake’s `COPY FILES INTO` command always silently overwrites existing files at the destination. There is no built-in mechanism to check for file existence before copying. If your workflow depends on the fail-if-exists behavior, you must implement a manual check.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_copy_file_no_overwrite
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   LET User_SourceFilePath VARCHAR := public.GetControlVariableUDF('User_SourceFilePath', 'package') :: VARCHAR;
   LET User_DestFilePath VARCHAR := public.GetControlVariableUDF('User_DestFilePath', 'package') :: VARCHAR;
   ---- Start block 'Package\Copy File No Overwrite'
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0039 - THE ORIGINAL SSIS TASK WAS CONFIGURED TO FAIL IF THE DESTINATION EXISTS. SNOWFLAKE DOES NOT SUPPORT THIS BEHAVIOR AND WILL SILENTLY OVERWRITE. ***/!!!
   --** SSC-FDM-SSIS0025 - THE VARIABLE(S) VALUE(S) MUST CONTAIN A VALID SNOWFLAKE STAGE PATH. **
   EXECUTE IMMEDIATE CONCAT('COPY FILES INTO ', :User_DestFilePath, ' FROM ', :User_SourceFilePath);
   ---- End block 'Package\Copy File No Overwrite'

END;
```

### Best Practices

* If fail-if-exists behavior is required, add a pre-check using `LIST @stage/path` and raise an error if the file exists
* Consider using `SYSTEM$STREAM_HAS_DATA` or a control table to track file processing state
* Review whether the overwrite guard is critical to your workflow or simply a safety measure
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

Implement a file existence check before copying:

```sql
-- Check if destination file exists before copying
LET file_count INTEGER := (SELECT COUNT(*) FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
   WHERE "name" LIKE '%dest_file.csv%');
IF (:file_count > 0) THEN
   RAISE 'Destination file already exists. Aborting copy to prevent overwrite.';
END IF;
```

## SSC-EWI-SSIS0041

SSIS FileSystemTask references an external or unresolvable connection manager.

### Severity

High

### Description

This EWI is generated when an SSIS File System Task references a connection manager for its source or destination path that is not defined within the `.dtsx` package file. This typically occurs when the connection manager is defined at the project level (in a `.conmgr` file) or when the reference is broken.

Without the connection manager definition, SnowConvert cannot resolve the file path. The unresolvable path is replaced with `@<STAGE_PLACEHOLDER>` in the generated code.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_copy_from_external_source
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   LET User_DestFilePath VARCHAR := public.GetControlVariableUDF('User_DestFilePath', 'package') :: VARCHAR;
   ---- Start block 'Package\Copy From External Source'
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0041 - THE CONNECTION MANAGER REFERENCED BY TASKSOURCEPATH OR TASKDESTINATIONPATH IS NOT DEFINED IN THE DTSX FILE. IT MAY BE A PROJECT-LEVEL CONNECTION MANAGER. THE PATH CANNOT BE RESOLVED. ***/!!!
   --** SSC-FDM-SSIS0025 - THE VARIABLE(S) VALUE(S) MUST CONTAIN A VALID SNOWFLAKE STAGE PATH. **
   EXECUTE IMMEDIATE CONCAT('COPY FILES INTO ', :User_DestFilePath, ' FROM @<STAGE_PLACEHOLDER>');
   ---- End block 'Package\Copy From External Source'

END;
```

### Best Practices

* Locate the missing connection manager in your SSIS project (check for `.conmgr` files)
* Include `.conmgr` files in the conversion input folder and re-run the conversion
* Replace `@<STAGE_PLACEHOLDER>` with the correct Snowflake stage path derived from the connection manager’s file path
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0043

SSIS FileSystemTask SetAttributes operation not supported in Snowflake.

### Severity

High

### Description

This EWI is generated when an SSIS File System Task is configured with the `SetAttributes` operation. In SSIS, this operation sets file system attributes such as Read-Only, Hidden, Archive, or System on a file or directory. Snowflake stages do not support file attributes — all staged files are accessible without attribute-based access control.

The generated code contains an empty statement (`;`) with diagnostic comments preserving the original task configuration for reference.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_set_file_attributes
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   ---- Start block 'Package\Set File Attributes'
   ---- TaskOperationType="SetAttributes"
   ---- TaskSourcePath="Data/report.txt"
   ---- TaskIsSourceVariable="False"
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0043 - SNOWFLAKE STAGES DO NOT SUPPORT FILE ATTRIBUTES (READ-ONLY, HIDDEN, ETC.). THIS OPERATION CANNOT BE TRANSLATED. ***/!!!
    ;
   ---- End block 'Package\Set File Attributes'

END;
```

### Best Practices

* Review whether the file attribute setting is critical to your workflow
* If access control is needed, use Snowflake role-based access control (RBAC) on the stage instead
* The empty statement can be safely removed if no alternative is needed
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SSIS0044

SSIS FileSystemTask COPY FILES destination appends source filename.

### Severity

High

### Description

This EWI is generated for every file-level copy, move, or rename operation in an SSIS File System Task. Snowflake’s `COPY FILES INTO` command treats the destination as a directory prefix and appends the source filename automatically. This means:

* If your SSIS task copies `source.csv` to a destination path `@stage/output/renamed.csv`, the actual result will be `@stage/output/renamed.csv/source.csv` — which is likely not the intended behavior
* If the destination is a directory path (e.g., `@stage/output/`), the behavior is correct

Review the destination path in the generated code and adjust manually if the destination refers to a specific file rather than a directory.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_copy_data_file
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   LET User_SourceFilePath VARCHAR := public.GetControlVariableUDF('User_SourceFilePath', 'package') :: VARCHAR;
   LET User_DestFilePath VARCHAR := public.GetControlVariableUDF('User_DestFilePath', 'package') :: VARCHAR;
   ---- Start block 'Package\Copy Data File'
   !!!RESOLVE EWI!!! /*** SSC-EWI-SSIS0044 - THE COPY FILES COMMAND TREATS THE DESTINATION AS A DIRECTORY PREFIX AND APPENDS THE SOURCE FILENAME. IF THE DESTINATION PATH REFERS TO A SPECIFIC FILE RATHER THAN A DIRECTORY, THE RESULTING PATH MAY BE INCORRECT. REVIEW AND ADJUST MANUALLY IF NEEDED. ***/!!!
   --** SSC-FDM-SSIS0025 - THE VARIABLE(S) VALUE(S) MUST CONTAIN A VALID SNOWFLAKE STAGE PATH. **
   EXECUTE IMMEDIATE CONCAT('COPY FILES INTO ', :User_DestFilePath, ' FROM ', :User_SourceFilePath);
   ---- End block 'Package\Copy Data File'

END;
```

### Best Practices

* If the destination is a directory, the generated code works correctly as-is
* If the destination is meant to be a specific filename (rename scenario), use a two-step approach: copy to a directory, then rename using `COPY FILES INTO` + `REMOVE`
* For move operations, verify that the source file is correctly removed after the copy
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this EWI
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - SSIS Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/ssisFDM.md
section: Migrations
---

# SnowConvert AI - SSIS Functional Differences

This section provides detailed documentation for all Functional Difference Messages (FDMs) that SnowConvert may generate during SSIS to dbt conversion. FDMs indicate where the converted code functions correctly but has behavioral differences from the original SSIS implementation.

For assistance with any FDM, you can use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions, or contact [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com) for additional support.

## SSC-FDM-SSIS0001

Replace NULL with appropriate ORDER BY column(s) to ensure deterministic first match selection.

### Severity

None

### Description

This FDM is generated when a Lookup transformation is converted to SQL JOIN. In SSIS, the Lookup transformation returns the first matching row based on the order rows are read from the reference table. In standard SQL, when multiple rows match the join condition without an ORDER BY clause, any matching row may be returned, making the result non-deterministic.

To ensure consistent behavior matching SSIS, add an ORDER BY clause to the query that retrieves the first match.

### Converted Code

```sql
WITH lookup_reference AS
(
   SELECT
      SalesTerritoryKey,
      SalesTerritoryAlternateKey,
      SalesTerritoryRegion,
      SalesTerritoryCountry,
      SalesTerritoryGroup,
      SalesTerritoryImage
   FROM
      {{ ref('stg_raw__lookup') }}
   QUALIFY
      ROW_NUMBER() OVER (
      PARTITION BY
         SalesTerritoryKey
      ORDER BY
         (
            SELECT
               --** SSC-FDM-SSIS0001 - REPLACE NULL WITH APPROPRIATE ORDER BY COLUMN(S) TO ENSURE DETERMINISTIC FIRST MATCH SELECTION. SSIS LOOKUP RETURNS THE FIRST MATCHING ROW, SO PROPER ORDERING IS REQUIRED WHEN MULTIPLE ROWS MATCH THE JOIN CONDITION. **
               null
         )) = 1
),
input_data AS
(
   SELECT
      EmployeeKey EmployeeKey,
      SalesTerritoryKey SalesTerritoryKey,
      BaseRate BaseRate,
      FirstName FirstName,
      LastName LastName
   FROM
      {{ ref('stg_raw__ole_db_source') }}
)
SELECT
   input_data.EmployeeKey,
   input_data.SalesTerritoryKey,
   input_data.BaseRate,
   input_data.FirstName,
   input_data.LastName,
   lookup_reference.SalesTerritoryRegion Region,
   lookup_reference.SalesTerritoryCountry Country
FROM
   input_data
   INNER JOIN
      lookup_reference
      ON lookup_reference.SalesTerritoryKey = input_data.SalesTerritoryKey
```

### Best Practices

* Replace `null` with appropriate ORDER BY columns (e.g., `ORDER BY modified_date DESC, customer_id`)
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0002

Add an ORDER BY clause to ensure sorted output.

### Severity

Low

### Description

This FDM is generated when a Merge transformation is converted to UNION ALL. In SSIS, the Merge transformation requires sorted inputs and naturally produces a sorted, deterministic output preserving the merge order. The equivalent SQL UNION ALL does not guarantee any particular order unless an explicit ORDER BY clause is added.

If the order of rows matters for downstream processing or matches SSIS behavior, add an ORDER BY clause to the final query.

### Converted Code

```sql
--** SSC-FDM-SSIS0002 - ADD AN ORDER BY CLAUSE TO ENSURE SORTED OUTPUT. **
WITH source1 AS (
   SELECT ProductID, ProductName, Price
   FROM {{ ref('stg_products') }}
),
source2 AS (
   SELECT ProductID, ProductName, Price
   FROM {{ ref('stg_new_products') }}
)
SELECT * FROM source1
UNION ALL
SELECT * FROM source2
-- Add ORDER BY ProductID if sorted output is required
```

### Best Practices

* Add `ORDER BY` clause matching the original SSIS sort keys if order matters
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0003

The SSIS container was converted inline.

### Severity

None

### Description

This FDM indicates that an SSIS container (Sequence Container, For Loop, or ForEach Loop) was converted inline rather than as a separate procedural block. In SSIS, containers create variable scopes and logical groupings. In the Snowflake conversion, container contents are expanded inline within the parent execution context.

This approach offers benefits:

* Improved debugging (direct visibility of all steps)
* Better performance (reduced nesting overhead)
* Simplified execution flow

However, variable scoping works differently—variables are in the parent scope rather than container scope.

### Converted Code

```sql
CREATE OR REPLACE TASK package_main
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0003 - THE SSIS 'SEQUENCE' CONTAINER WAS CONVERTED INLINE. Original container name: Package\Sequence Container **
   BEGIN
      -- Execute SQL Task 1
      INSERT INTO staging_table SELECT * FROM source_table1;

      -- Execute SQL Task 2
      INSERT INTO target_table SELECT * FROM staging_table;
   END;
END;
```

### Best Practices

* Review variable scope changes if the container had local variables
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0004

Add an ORDER BY clause to ensure sorted output.

### Severity

Low

### Description

This FDM is generated when a Merge Join transformation is converted to a standard SQL JOIN. In SSIS, the Merge Join transformation requires sorted inputs and naturally produces a sorted, deterministic output based on the join keys and the merge algorithm. The equivalent SQL JOIN does not guarantee any particular order unless an explicit ORDER BY clause is added.

If the order of rows matters for downstream processing or to match SSIS behavior exactly, add an ORDER BY clause.

### Converted Code

```sql
SELECT
   --** SSC-FDM-SSIS0004 - ADD AN ORDER BY CLAUSE TO ENSURE SORTED OUTPUT. THE SSIS MERGE JOIN TRANSFORMATION ASSUMES SORTED INPUTS AND NATURALLY PRODUCES A SORTED, DETERMINISTIC OUTPUT. THE EQUIVALENT SQL JOIN DOES NOT GUARANTEE ORDER. **
   employeeassignments.employee_id,
   tasks.project_id AS "project identifier",
   employeeassignments.assignment_start_date,
   employeeassignments.assigned_hours,
   tasks.task_id
FROM
   {{ ref('stg_employee_assignments') }} AS employeeassignments
   INNER JOIN {{ ref('stg_tasks') }} AS tasks
      ON employeeassignments.task_id = tasks.task_id
-- Add ORDER BY employee_id, task_id if sorted output is required
```

### Best Practices

* Add `ORDER BY` clause on the join keys if order matters
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0005

Package was converted to stored procedure because it is being reused by other packages.

### Severity

None

### Description

This FDM indicates that an SSIS package was converted to a Snowflake stored PROCEDURE instead of a TASK because it is called by at least one ExecutePackage task from another control flow. This design choice provides several benefits:

**Benefits of PROCEDURE over TASK:**

* **Synchronous execution**: Calling packages wait for completion (matches SSIS behavior)
* **Reusability**: Can be called from multiple locations with different parameters
* **Return values**: Can return status codes or result sets to callers
* **Simpler orchestration**: Direct CALL statements instead of complex EXECUTE TASK chains

**Difference from SSIS:**

* Must be explicitly called with `CALL procedure_name()` instead of automatic execution
* Parameters must be passed explicitly in the CALL statement
* No automatic task scheduling (must be invoked programmatically)

### Converted Code

```sql
--** SSC-FDM-SSIS0005 - PACKAGE WAS CONVERTED TO STORED PROCEDURE BECAUSE IT IS BEING REUSED BY OTHER PACKAGES. **
CREATE OR REPLACE PROCEDURE public.utilitypackage(input_param VARCHAR)
RETURNS VARCHAR
LANGUAGE SQL
AS
$$
BEGIN
   -- Package logic here
   INSERT INTO log_table VALUES (CURRENT_TIMESTAMP(), :input_param);
   RETURN 'SUCCESS';
END;
$$;

-- Parent Package 1 calls the procedure
CREATE OR REPLACE TASK public.parent_package_1_execute_utility
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   CALL public.utilitypackage('param_value_1');
END;

-- Parent Package 2 calls the procedure
CREATE OR REPLACE TASK public.parent_package_2_execute_utility
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   CALL public.utilitypackage('param_value_2');
END;
```

### Best Practices

* Use CALL statements to invoke the procedure from parent packages
* Pass parameters explicitly in the CALL statement
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0006

Event handler stored procedure created but not automatically triggered.

### Severity

None

### Description

This FDM indicates that an SSIS event handler has been converted to a Snowflake stored procedure, but unlike SSIS where event handlers are automatically triggered by runtime events (OnError, OnPreExecute, OnPostExecute, etc.), the generated stored procedure must be invoked manually or through a custom triggering mechanism.

In SSIS, event handlers automatically fire when their associated event occurs during package execution. In Snowflake, there is no built-in event handler mechanism. The converted stored procedure contains the event handler logic but requires explicit invocation using a `CALL` statement.

### Converted Code

```sql
--** SSC-FDM-SSIS0006 - EVENT HANDLER STORED PROCEDURE CREATED BUT NOT AUTOMATICALLY TRIGGERED. MANUAL INVOCATION OR TRIGGERING MECHANISM IMPLEMENTATION REQUIRED. **
CREATE OR REPLACE PROCEDURE public.package_execute_sql_task_onerror_handler ()
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
   BEGIN
      LET User_ErrorCount NUMERIC := public.GetControlVariableUDF('User_ErrorCount', 'MyPackage') :: NUMERIC;
      INSERT INTO TaskErrorLog (TaskName, ErrorTime) VALUES ('Execute SQL Task', CURRENT_TIMESTAMP());
      RETURN 'SUCCESS';
   END;
$$;
```

### Best Practices

* Add explicit `CALL` statements to invoke the event handler procedure at appropriate points in your orchestration
* For OnError handlers, wrap task execution in BEGIN…EXCEPTION blocks and call the handler in the exception handler
* For OnPreExecute/OnPostExecute handlers, add CALL statements before/after the relevant task execution
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0007

Send Mail Task SMTP connection settings are managed by Snowflake.

### Severity

None

### Description

This FDM indicates that SSIS Send Mail Task SMTP connection settings were not converted. In SSIS, you configure custom SMTP server settings through an SMTP Connection Manager. In Snowflake, email delivery is managed entirely through the built-in Notification Integration service, and custom SMTP servers cannot be specified.

This is informational only and does not require action. Snowflake’s email service is reliable and properly configured.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   --** SSC-FDM-SSIS0007 - CUSTOM SMTP SERVER SETTINGS ARE NOT APPLICABLE. SNOWFLAKE MANAGES EMAIL DELIVERY THROUGH THE NOTIFICATION INTEGRATION. **
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com', 'Test', 'Test message');
END;
```

### Best Practices

* No manual action required
* Snowflake handles email delivery through its managed infrastructure
* Ensure recipients are verified in your Snowflake account
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0008

Send Mail Task FROM address added to email body.

### Severity

None

### Description

This FDM indicates that the SSIS Send Mail Task FROM address has been preserved by prepending it to the email body. Snowflake’s email service uses a fixed sender address managed by your Snowflake account and does not allow custom FROM addresses.

The original FROM address is included in the message body so recipients can see who intended to send the email.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("noreply@company.com", "admin@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   --** SSC-FDM-SSIS0008 - SNOWFLAKE'S EMAIL INTEGRATION USES A FIXED SENDER ADDRESS. THE ORIGINAL FROM ADDRESS HAS BEEN PREPENDED TO THE MESSAGE BODY FOR REFERENCE. **
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'noreply@company.com,admin@example.com', 'Notification', 'Email sent by: noreply@company.com

Package completed successfully.');
END;
```

### Best Practices

* No manual action required for basic functionality
* The FROM address is preserved in the message body for reference
* Consider updating email templates if the sender information format needs adjustment
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0009

Send Mail Task CC addresses added to recipients list.

### Severity

None

### Description

This FDM indicates that CC (carbon copy) recipients from the SSIS Send Mail Task have been merged into the main recipients list. Snowflake’s `SYSTEM$SEND_EMAIL` does not distinguish between TO and CC recipients. All recipients receive the email, but they will not see the CC distinction in their email client.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com", "team@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   --** SSC-FDM-SSIS0009 - SNOWFLAKE'S SYSTEM$SEND_EMAIL DOES NOT SUPPORT CC ADDRESSING. ALL CC RECIPIENTS HAVE BEEN ADDED TO THE MAIN RECIPIENTS LIST. **
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com,team@example.com', 'Status Update', 'All systems operational.');
END;
```

### Best Practices

* No manual action required for basic functionality
* All recipients will receive the email successfully
* If TO/CC distinction is important, consider adding recipient information in the email body
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0010

Send Mail Task BCC addresses added to recipients list.

### Severity

None

### Description

This FDM indicates that BCC (blind carbon copy) recipients from the SSIS Send Mail Task have been merged into the main recipients list. This is an important behavioral change: in SSIS, BCC recipients are hidden from other recipients. In Snowflake, all recipients are visible to each other because `SYSTEM$SEND_EMAIL` does not support BCC addressing.

**Privacy concern:** Recipients who were originally BCC’d will now be visible to all other recipients.

### Converted Code

```sql
BEGIN
   BEGIN
      LET my_package_Send_Mail_Task_integration_sql STRING := 'CREATE OR REPLACE NOTIFICATION INTEGRATION my_package_Send_Mail_Task
  TYPE=EMAIL
  ENABLED=TRUE
  ALLOWED_RECIPIENTS=("admin@example.com", "audit@example.com")';
      EXECUTE IMMEDIATE :my_package_Send_Mail_Task_integration_sql;
   END;
   --** SSC-FDM-SSIS0010 - SNOWFLAKE'S SYSTEM$SEND_EMAIL DOES NOT SUPPORT BCC ADDRESSING. ALL BCC RECIPIENTS HAVE BEEN ADDED TO THE MAIN RECIPIENTS LIST, MAKING THEM VISIBLE TO ALL RECIPIENTS. **
   CALL SYSTEM$SEND_EMAIL('my_package_Send_Mail_Task', 'admin@example.com,audit@example.com', 'Audit Trail', 'Process completed.');
END;
```

### Best Practices

* Review if BCC privacy is required for your use case
* If recipients must remain hidden, send separate emails to each BCC recipient
* Consider implementing a wrapper procedure that sends individual emails for BCC scenarios
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0011

Bulk Insert Task MaximumErrors has semantic differences in Snowflake.

### Severity

None

### Description

This FDM indicates that the SSIS Bulk Insert Task `MaximumErrors` setting has been converted to Snowflake’s `ON_ERROR` option, but the behavior differs. In SSIS, `MaximumErrors` specifies the maximum number of errors allowed before the bulk insert fails. In Snowflake, `ON_ERROR` controls the behavior when errors occur but works differently:

| SSIS MaximumErrors | Snowflake ON_ERROR |
| --- | --- |
| 0 (fail on first error) | `ABORT_STATEMENT` |
| N (fail after N errors) | `SKIP_FILE_N` (skips file after N errors) |
| Large value (continue) | `CONTINUE` |

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0011 - SSIS BULKINSERTTASK MAXIMUMERRORS SPECIFIES ERROR COUNT THRESHOLD. SNOWFLAKE ON_ERROR CONTROLS BEHAVIOR WHEN ERRORS OCCUR. REVIEW ERROR HANDLING STRATEGY. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV')
   ON_ERROR = SKIP_FILE_5;
END;
```

### Best Practices

* Review your error tolerance requirements
* `ON_ERROR = CONTINUE` is most permissive (skips bad records)
* `ON_ERROR = ABORT_STATEMENT` stops on first error
* `ON_ERROR = SKIP_FILE_N` skips the file after N errors per file
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0012

Bulk Insert Task BatchSize is managed automatically by Snowflake.

### Severity

None

### Description

This FDM indicates that the SSIS Bulk Insert Task `BatchSize` setting is not applicable in Snowflake. In SSIS, `BatchSize` controls how many rows are committed in each batch transaction. Snowflake’s COPY INTO command manages batching automatically for optimal performance and does not expose batch size configuration.

Snowflake uses micro-partitions and automatic parallelization to achieve high-performance data loading without manual batch tuning.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0012 - SSIS BULKINSERTTASK BATCHSIZE IS NOT AVAILABLE IN SNOWFLAKE. SNOWFLAKE MANAGES BATCHING AUTOMATICALLY FOR OPTIMAL PERFORMANCE. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* No manual action required
* Snowflake automatically optimizes batch processing
* For very large loads, consider splitting into multiple files for parallel processing
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0013

Bulk Insert Task KeepIdentity cannot override Snowflake autoincrement behavior.

### Severity

None

### Description

This FDM indicates that the SSIS Bulk Insert Task `KeepIdentity` option behavior differs in Snowflake. In SQL Server, `KeepIdentity=True` preserves identity values from the source file, while `KeepIdentity=False` allows SQL Server to generate new identity values.

In Snowflake, COPY INTO always loads values from the file as-is. If you need Snowflake to generate identity values, you must either:

* Remove the identity column from the source file
* Load into a staging table and use INSERT with column mapping

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0013 - SSIS BULKINSERTTASK KEEPIDENTITY CANNOT OVERRIDE SNOWFLAKE AUTOINCREMENT. USE EXPLICIT COLUMN MAPPING TO LOAD IDENTITY VALUES INTO NON-AUTOINCREMENT COLUMNS. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* If preserving identity values: load directly (Snowflake default behavior)
* If generating new identity values: remove the identity column from the source file, or use a staging table approach
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

To have Snowflake auto-generate identity values, load to a staging table and insert with an explicit column list:

```sql
-- Step 1: Load all columns to staging
COPY INTO staging_table FROM '@my_stage' FILE_FORMAT = (TYPE = 'CSV');

-- Step 2: Insert with column list (exclude identity column)
INSERT INTO target_table (col1, col2, col3)
SELECT col1, col2, col3 FROM staging_table;
```

## SSC-FDM-SSIS0014

Bulk Insert Task TableLock is not needed in Snowflake.

### Severity

None

### Description

This FDM indicates that the SSIS Bulk Insert Task `TableLock` option is not applicable in Snowflake. In SQL Server, `TableLock=True` acquires a table-level lock during bulk insert for better performance by reducing lock overhead.

Snowflake uses Multi-Version Concurrency Control (MVCC), which allows concurrent reads during writes without explicit locking. Table locks are not needed or supported because Snowflake handles concurrency automatically.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0014 - SSIS BULKINSERTTASK TABLELOCK IS NOT NEEDED IN SNOWFLAKE. MVCC ARCHITECTURE ALLOWS CONCURRENT READS DURING WRITES WITHOUT EXPLICIT LOCKING. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* No manual action required
* Snowflake’s MVCC architecture handles concurrency automatically
* Readers see consistent data without being blocked by writers
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0015

Bulk Insert Task SortedData hint is not available in Snowflake.

### Severity

None

### Description

This FDM indicates that the SSIS Bulk Insert Task `SortedData` option is not applicable in Snowflake. In SQL Server, specifying `SortedData` with a column name hints that the data is pre-sorted, allowing SQL Server to optimize the bulk insert by avoiding re-sorting for clustered index maintenance.

Snowflake does not use traditional indexes. For query optimization on sorted data access patterns, use the `CLUSTER BY` clause on table definitions instead.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0015 - SSIS BULKINSERTTASK SORTEDDATA HINT IS NOT AVAILABLE IN SNOWFLAKE. USE CLUSTER BY ON TABLE DEFINITION FOR SIMILAR OPTIMIZATION. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* If sort optimization is important, define clustering on the target table
* Snowflake automatically manages micro-partition pruning
* Clustering is most beneficial for very large tables with common filter patterns
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

#### Manual Support

Define clustering on the target table for similar optimization to SortedData:

```sql
ALTER TABLE target_table CLUSTER BY (sort_column);
```

## SSC-FDM-SSIS0016

Bulk Insert Task CheckConstraints is always enforced in Snowflake.

### Severity

None

### Description

This FDM indicates that the SSIS Bulk Insert Task `CheckConstraints` option behavior differs in Snowflake. In SQL Server, `CheckConstraints=False` (the default for bulk insert) disables CHECK constraint validation during the load for better performance.

In Snowflake, constraints are always validated during data loading. However, Snowflake’s constraint enforcement is different from SQL Server—NOT NULL constraints are enforced, but CHECK, UNIQUE, and FOREIGN KEY constraints are not enforced (they are informational only).

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0016 - SSIS BULKINSERTTASK CHECKCONSTRAINTS OPTION IS IMPLICIT IN SNOWFLAKE. CONSTRAINTS ARE ALWAYS VALIDATED DURING DATA LOADING. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV');
END;
```

### Best Practices

* Review constraint requirements for data integrity
* NOT NULL constraints are enforced by Snowflake
* CHECK, UNIQUE, and FOREIGN KEY constraints are informational only
* Implement data validation logic in your ETL process if strict constraint checking is required
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0017

Bulk Insert Task field terminator not specified, using SSIS default value.

### Severity

None

### Description

This FDM is generated when an SSIS Bulk Insert Task does not explicitly specify a field terminator (`FieldTerminator`) in the `.dtsx` package configuration. When no field terminator is specified, SnowConvert uses the SSIS default value (typically a tab character `\t` or comma `,`). The default value is applied to the `FIELD_DELIMITER` option in the generated Snowflake `COPY INTO` statement.

Verify that the default field terminator matches the actual format of your data file to ensure correct column parsing.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0017 - FIELD TERMINATOR NOT SPECIFIED IN DTSX. USING SSIS DEFAULT VALUE ','. VERIFY THIS MATCHES YOUR DATA FILE FORMAT. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = ',', RECORD_DELIMITER = '\r\n')
   ON_ERROR = CONTINUE;
END;
```

### Best Practices

* Verify the default field terminator matches your actual data file format
* Open the source data file and confirm the column separator character
* Update the `FIELD_DELIMITER` value in the FILE_FORMAT if it does not match
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0018

Bulk Insert Task row terminator not specified, using SSIS default value.

### Severity

None

### Description

This FDM is generated when an SSIS Bulk Insert Task does not explicitly specify a row terminator (`RowTerminator`) in the `.dtsx` package configuration. When no row terminator is specified, SnowConvert uses the SSIS default value (typically `\r\n` on Windows). The default value is applied to the `RECORD_DELIMITER` option in the generated Snowflake `COPY INTO` statement.

Verify that the default row terminator matches the actual format of your data file to ensure correct row parsing.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_bulk_insert_task
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   --** SSC-FDM-SSIS0018 - ROW TERMINATOR NOT SPECIFIED IN DTSX. USING SSIS DEFAULT VALUE '\r\n'. VERIFY THIS MATCHES YOUR DATA FILE FORMAT. **
   COPY INTO target_table
   FROM '@my_stage'
   FILE_FORMAT = (TYPE = 'CSV', FIELD_DELIMITER = ',', RECORD_DELIMITER = '\r\n')
   ON_ERROR = CONTINUE;
END;
```

### Best Practices

* Verify the default row terminator matches your actual data file format
* Files created on Windows typically use `\r\n`, while Unix/Linux files use `\n`
* Update the `RECORD_DELIMITER` value in the FILE_FORMAT if it does not match
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0019

Excel Source file staging required.

### Severity

None

### Description

This FDM appears in the migration assessment report for every Excel Source component conversion. It is a reminder that Snowflake requires Excel files to be staged before they can be processed.

In SSIS, Excel Source components read directly from file system paths (e.g., `C:\Data\sales.xlsx`). In Snowflake, files must be uploaded to a stage (internal or external) before they can be queried by the generated `excel_source_udf` UDF.

The assessment report message includes the stage file path variable name used in the generated code, which you must configure with the actual location of your staged Excel file.

### Converted Code

The generated dbt model references the stage path via a dbt variable:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sales', 'YES'))
),
parsed_data AS
(
   SELECT
      data['ProductName'] :: VARCHAR AS ProductName,
      TRY_TO_DOUBLE(data['Quantity'] :: VARCHAR) AS Quantity
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
```

Update the variable in `dbt_project.yml`:

```yaml
vars:
  Package_Data_Flow_Task_Excel_Source_stage_file_path: '@my_stage/sales.xlsx'
```

### Best Practices

* Create a Snowflake stage: `CREATE OR REPLACE STAGE my_stage;`
* Upload your Excel file: `PUT file:///path/to/sales.xlsx @my_stage AUTO_COMPRESS = FALSE;`
* Update the stage file path variable in `dbt_project.yml` to point to the staged file
* Verify the file is staged correctly using `LIST @my_stage;`
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0020

Excel Source legacy format may have behavioral differences.

### Severity

None

### Description

This FDM appears in the migration assessment report when an SSIS Excel Source component references a legacy `.xls` (Excel 97-2003) or binary `.xlsb` file format. These legacy formats may have minor differences in date and number handling compared to the modern `.xlsx` format.

The generated `excel_source_udf` UDF auto-detects the format from the filename extension, so the generated code works with all Excel formats. However, certain data type conversions — particularly dates stored as serial numbers — may behave differently between formats.

In legacy Excel formats, dates are stored as serial numbers (days since 1899-12-30). If date columns appear as numbers in the output, use the `DATEADD` serial conversion pattern.

### Converted Code

The generated dbt model is the same for all Excel formats:

```sql
WITH excel_raw_data AS
(
   SELECT
      data
   FROM
      TABLE(excel_source_udf('{{ var('Package_Data_Flow_Task_Excel_Source_stage_file_path') }}', 'Sheet1', 'YES'))
),
parsed_data AS
(
   SELECT
      data['OrderId'] :: VARCHAR AS OrderId,
      TRY_TO_DATE(data['OrderDate'] :: VARCHAR) AS OrderDate,
      TRY_TO_DOUBLE(data['Amount'] :: VARCHAR) AS Amount
   FROM
      excel_raw_data
)
SELECT
   *
FROM
   parsed_data
```

If date columns come through as serial numbers, convert them manually:

```sql
-- Convert Excel date serial number to Snowflake DATE
DATEADD('day', TRY_TO_DOUBLE(data['OrderDate'] :: VARCHAR), '1899-12-30' :: DATE) AS OrderDate
```

### Best Practices

* Convert legacy `.xls` or `.xlsb` files to `.xlsx` format before migration if possible
* For date columns that appear as numbers, use `DATEADD('day', serial_number, '1899-12-30')` to convert
* Test the output data types and values against the original SSIS component’s results
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0021

SSIS Pivot PassThroughUnmatchedPivotKeys behavioral difference.

### Severity

Low

### Description

This FDM is generated when an SSIS Pivot transformation has the `PassThroughUnmatchedPivotKeys` property enabled. In SSIS, this setting causes rows with unmatched pivot key values to be passed through to a separate output for further processing. The converted SQL uses `CASE WHEN ... THEN` with `GROUP BY`, which produces `NULL` values for unmatched pivot keys instead of routing them to a separate output.

If your downstream logic depends on capturing unmatched pivot key rows, you must implement additional filtering to separate matched and unmatched rows.

### Converted Code

```sql
--** SSC-FDM-SSIS0021 - SSIS PIVOT WITH PASSTHROUGHUNMATCHEDPIVOTKEYS ENABLED PASSES UNMATCHED PIVOT KEY ROWS TO A SEPARATE OUTPUT. THE CONVERTED SQL PRODUCES NULL VALUES FOR UNMATCHED PIVOT KEYS INSTEAD. **
WITH source_data AS
(
   SELECT
      CustomerName,
      Product,
      Quantity
   FROM
      {{ ref('stg_raw__source') }}
)
SELECT
   CustomerName,
   MAX(CASE
      WHEN Product = 'Bike'
         THEN Quantity
   END) AS Bike
FROM
   source_data
GROUP BY
   CustomerName
```

### Best Practices

* If unmatched rows need separate handling, add a WHERE clause to filter rows where all pivot columns are NULL
* Consider creating a separate model for unmatched rows using `NOT IN` or `NOT EXISTS` against the expected pivot key values
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0022

SSIS Pivot assumes input data is sorted by Set Key.

### Severity

Low

### Description

This FDM is generated when an SSIS Pivot transformation is converted to SQL. The SSIS Pivot transformation assumes that the input data is sorted by the Set Key column, and the transformation’s output preserves this sort order. The converted SQL uses `GROUP BY`, which does not require or preserve sort order. If downstream consumers depend on sorted output, an explicit `ORDER BY` clause must be added.

### Converted Code

```sql
--** SSC-FDM-SSIS0022 - THE SSIS PIVOT TRANSFORMATION ASSUMES INPUT DATA IS SORTED BY THE SET KEY COLUMN. THE CONVERTED SQL USES GROUP BY WHICH DOES NOT REQUIRE OR PRESERVE SORT ORDER. VERIFY THAT DOWNSTREAM CONSUMERS DO NOT DEPEND ON SORTED OUTPUT. **
WITH source_data AS
(
   SELECT
      Region,
      CustomerName,
      Product,
      Quantity
   FROM
      {{ ref('stg_raw__source') }}
)
SELECT
   Region,
   CustomerName,
   MAX(CASE
      WHEN Product = 'Bike'
         THEN Quantity
   END) AS Bike,
   MAX(CASE
      WHEN Product = 'Helmet'
         THEN Quantity
   END) AS Helmet
FROM
   source_data
GROUP BY
   Region,
   CustomerName
```

### Best Practices

* Add an `ORDER BY` clause if downstream consumers depend on sorted output
* Use the same sort key as the original SSIS Pivot Set Key column
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0023

SSIS expression circular variable reference detected.

### Severity

Medium

### Description

This FDM is generated when SnowConvert detects a circular reference while expanding an SSIS variable expression. A circular reference occurs when a variable’s expression references itself (directly or indirectly through other variables). When this happens, SnowConvert uses the default value for the variable’s data type instead of attempting to resolve the infinite loop.

Review the original SSIS variable definitions to determine the intended value and update the generated code accordingly.

### Converted Code

```sql
--** SSC-FDM-SSIS0023 - CIRCULAR REFERENCE DETECTED WHEN EXPANDING VARIABLE 'User::Counter'. THE DEFAULT VALUE FOR TYPE 'Int32' HAS BEEN USED INSTEAD. **
CREATE OR REPLACE TASK public.my_package
WAREHOUSE=DUMMY_WAREHOUSE
AS
BEGIN
   LET User_Counter NUMERIC := 0;
   -- Original variable expression referenced itself: @[User::Counter] + 1
   -- Default value for Int32 (0) was used instead
END;
```

### Best Practices

* Review the original SSIS variable definitions to identify the circular dependency chain
* Determine the intended initial value and update the generated code manually
* Break the circular dependency by reorganizing variable assignments
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0024

SSIS FileSystemTask directory path translated to Snowflake stage prefix.

### Severity

None

### Description

This FDM indicates that an SSIS File System Task directory operation has been translated to use Snowflake stage prefix-based paths. Unlike traditional file systems where directories are distinct entities, Snowflake stages use prefix-based paths — a “directory” is simply a common prefix shared by multiple file paths.

When the generated code uses `REMOVE @stage/path/` with a trailing slash, it deletes **all files** that match the prefix pattern `path/`, not a single directory entry. This is functionally equivalent to recursively deleting a directory in a traditional file system, but the semantics are different.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_delete_temp_directory
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   LET User_TempFolder VARCHAR := public.GetControlVariableUDF('User_TempFolder', 'package') :: VARCHAR;
   ---- Start block 'Package\Delete Temp Directory'
   --** SSC-FDM-SSIS0025 - THE VARIABLE(S) VALUE(S) MUST CONTAIN A VALID SNOWFLAKE STAGE PATH. **
   --** SSC-FDM-SSIS0024 - SNOWFLAKE STAGES USE PREFIX-BASED PATHS, NOT REAL DIRECTORIES. THE REMOVE COMMAND WITH A TRAILING SLASH DELETES ALL FILES MATCHING THE PREFIX PATTERN. **
   EXECUTE IMMEDIATE CONCAT('REMOVE ', :User_TempFolder, '/');
   ---- End block 'Package\Delete Temp Directory'

END;
```

### Best Practices

* Ensure the trailing slash is present in directory operations to match all files under the prefix
* Be cautious: `REMOVE @stage/data/` will delete all files starting with `data/`, including nested paths like `data/subdir/file.txt`
* For delete-directory-content operations (keep the directory but remove its contents), the generated code re-creates a `.keep` placeholder file after the REMOVE to preserve the prefix
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0025

SSIS FileSystemTask variable must contain a valid Snowflake stage path.

### Severity

None

### Description

This FDM indicates that the generated File System Task operation uses an SSIS variable to build the stage path dynamically at runtime. The variable value must contain a valid Snowflake stage path (e.g., `@my_stage/path/to/file.txt`) when the task executes.

In SSIS, File System Task variables typically contain local file system paths (e.g., `C:\Data\input.csv`). After migration, these variables must be updated to hold Snowflake stage paths instead. The variable value is set through the task CONFIG section or the `UpdateControlVariable` procedure.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_delete_source_file
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   LET User_SourceFilePath VARCHAR := public.GetControlVariableUDF('User_SourceFilePath', 'package') :: VARCHAR;
   ---- Start block 'Package\Delete Source File'
   --** SSC-FDM-SSIS0025 - THE VARIABLE(S) VALUE(S) MUST CONTAIN A VALID SNOWFLAKE STAGE PATH. **
   EXECUTE IMMEDIATE CONCAT('REMOVE ', :User_SourceFilePath);
   ---- End block 'Package\Delete Source File'

END;
```

### Best Practices

* Update the variable’s default value in the task CONFIG section to use a Snowflake stage path (e.g., `@my_stage/data/input.csv`)
* Ensure the stage path includes the `@` prefix followed by the stage name
* For directory paths, include a trailing slash (e.g., `@my_stage/data/output/`)
* Test with `LIST @my_stage` to verify the file exists at the expected path before running the task
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SSIS0026

SSIS FileSystemTask CreateDirectory translated using COPY INTO placeholder file.

### Severity

None

### Description

This FDM indicates that an SSIS File System Task `CreateDirectory` operation has been translated using a workaround because Snowflake stages do not support empty directories. In a traditional file system, you can create an empty directory. In Snowflake, a “directory” only exists as long as there are files with that prefix.

To simulate directory creation, the generated code writes a `.dummy` placeholder file to the target path using `COPY INTO`. This ensures the prefix exists and can be referenced by subsequent operations.

### Converted Code

```sql
CREATE OR REPLACE TASK public.package_create_output_directory
WAREHOUSE=DUMMY_WAREHOUSE
AFTER public.package
AS
BEGIN
   LET User_NewDirectory VARCHAR := public.GetControlVariableUDF('User_NewDirectory', 'package') :: VARCHAR;
   ---- Start block 'Package\Create Output Directory'
   --** SSC-FDM-SSIS0026 - SINCE SNOWFLAKE STAGES USE PREFIX-BASED PATHS, THE DIRECTORY IS CREATED BY WRITING A .DUMMY PLACEHOLDER FILE TO THE TARGET PATH VIA COPY INTO. **
   --** SSC-FDM-SSIS0025 - THE VARIABLE(S) VALUE(S) MUST CONTAIN A VALID SNOWFLAKE STAGE PATH. **
   EXECUTE IMMEDIATE CONCAT('COPY INTO ', :User_NewDirectory, '/.dummy FROM (SELECT ''empty'') FILE_FORMAT = (TYPE = CSV COMPRESSION = NONE) OVERWRITE = TRUE SINGLE = TRUE');
   ---- End block 'Package\Create Output Directory'

END;
```

### Best Practices

* The `.dummy` file is a zero-overhead placeholder and can be left in place
* If the original SSIS task was configured with `OverwriteDestinationFile = False`, the `OVERWRITE = TRUE` clause is omitted, meaning the COPY INTO will fail if the `.dummy` file already exists (simulating “fail if directory exists”)
* Subsequent file operations to the same directory path will work correctly regardless of the `.dummy` file
* Use the [SnowConvert Migration Assistant](../../../../migration-assistant/README.md) to get AI-powered explanations and actionable solutions for this FDM
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Supported Languages
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/README.md
section: Migrations
---

# SnowConvert AI - Supported Languages

* [Teradata](teradata.md)
* [Oracle](oracle.md)
* [SQL Server](sql-server.md)
* [RedShift](redshift.md)
* [Azure Synapse](azure-synapse.md)
* [Google BigQuery](google-bigquery.md)
* [IBM DB2](ibm-db2.md)
* [Sybase IQ](sybase-iq.md)
* [PostgreSQL-Greenplum-Netezza](postgresql-and-based-languages.md)
* [Hive-Spark- Databricks SQL](hive-spark-databricks-sql.md)
* [Vertica](vertica.md)

---
title: SnowConvert AI - Sybase IQ
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/code-extraction/sybase-iq.md
section: Migrations
---

# SnowConvert AI - Sybase IQ

The first step for migration is getting the code that you need to migrate. There are many ways to extract the code from your database. However, we recommend using the extraction scripts provided by Snowflake.

All the source code for these scripts is open source and is available on [GitHub](https://github.com/Snowflake-Labs/SC.DDLExportScripts/).

## Prerequisites

* Sybase IQ client utilities installed and accessible, specifically iqunload (or iqunload.bat on Windows).
* Sufficient privileges for the user in the target database to extract DDL.
* Disk space in the output directory for the consolidated SQL and split files.
* A valid Sybase IQ connection string (examples below).

## Installing the scripts

Go to <https://github.com/Snowflake-Labs/SC.DDLExportScripts/>

From the Code option, select the drop-down and use the **Download ZIP** option to download the code.

Decompress the ZIP file. The code for Sybase IQ should be under the “Sybase IQ” folder.

Follow the [Usage instructions](https://github.com/Snowflake-Labs/SC.DDLExportScripts/tree/main/Sybase%20IQ#readme) to modify the files and run them on your system.

Once the script execution finishes, the output folder will contain all the DDLs for the migration.

---
title: SnowConvert AI - Sybase IQ
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/sybase-iq.md
section: Migrations
---

# SnowConvert AI - Sybase IQ

## What is SnowConvert AI for Sybase IQ?

SnowConvert AI is a software tool that understands Sybase IQ scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for Sybase IQ performs the following conversions:

### Sybase IQ to Snowflake SQL

SnowConvert AI understands the Sybase IQ source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake.

#### Sample code

Sybase IQ basic input code:

```sql
 CREATE TABLE Persons (
    PersonID int,
    LastName varchar(255),
    FirstName varchar(255),
    Address varchar(255),
    City varchar(255)
);
```

Snowflake SQL output code:

```sql
 CREATE OR REPLACE TABLE Persons (
    PersonID INT,
    LastName VARCHAR(255),
    FirstName VARCHAR(255),
    Address VARCHAR(255),
    City VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"sybase"}}'
;
```

As you can see, most of the structure remains the same. For example, some cases require the data types to be transformed.

#### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI: the software that converts your Sybase IQ files securely and automatically to the Snowflake cloud data platform.*
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* Parsing is an initial process by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

On the following few pages, you’ll learn more about the kind of conversions that SnowConvert AI for Sybase IQ is capable of. If you’re ready, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Sybase IQ
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/README.md
section: Migrations
---

# SnowConvert AI - Sybase IQ

Translation specification for Sybase IQ grammar syntax

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Sybase IQ currently supports assessment and translation for TABLES, VIEWS, STORED PROCEDURES, and FUNCTIONS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

This page provides a comprehensive reference for how SnowConvert AI translates Sybase IQ grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - Sybase IQ - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-built-in-functions.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - Built-in functions

> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

## Built-in Functions

> This section describes each SQL function individually. ([Sybase SQL Language Reference Functions](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a531c50684f21015af65db434a6a1ab5.html?version=16.1.5.0&amp;locale=en-US)).

| Sybase | Snowflake Equivalent |
| --- | --- |
| [ABS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a532439384f21015be5cb176f7ecbae4.html) ( numeric-expression) | [ABS](https://docs.snowflake.com/en/sql-reference/functions/abs) ( numeric-expression) |
| [ACOS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a532c20484f21015a4a5f8c26e3af9c7.html) ( numeric-expression) | [ACOS](https://docs.snowflake.com/en/sql-reference/functions/acos) ( numeric-expression) |
| [ARGN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53342da84f21015892d9495d775376f.html) (integer-expression, expression [ , …] ) | None  *Note: Snowflake does not contain a similar built-in function, a UDF might be created to emulate the Sybase behavior.* |
| [ASCII](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a533e3a684f21015a2a0af73e4a9ad1c.html) ( string-expression) | [ASCII](https://docs.snowflake.com/en/sql-reference/functions/ascii) ( numeric-expression) |
| [ASIN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a534668f84f2101599958685dfc4673b.html) ( numeric-expression) | [ASIN](https://docs.snowflake.com/en/sql-reference/functions/asin) ( numeric-expression) |
| [ATAN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a534e83384f21015a17bd5947c1575b2.html) ( numeric-expression) | [ATAN](https://docs.snowflake.com/en/sql-reference/functions/atan) ( numeric-expression) |
| [ATAN2](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5356c1b84f210159f68d03274510fe6.html) (numeric-expression, numeric-expression) | [ATAN2](https://docs.snowflake.com/en/sql-reference/functions/atan2) ( numeric-expression, numeric-expression) |
| [AVG](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a535f04784f2101590f89a693842c970.html) ( [DISTINCT] column-name) [OVER ...] | [AVG](https://docs.snowflake.com/en/sql-reference/functions/avg) ( [DISTINCT] column-name) [OVER ...] |
| [BFILE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5366ee684f21015b5b0e80fe42dc31f.html) ( file-name-expression, large-object-column ) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [BIGINTTOHEX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5370dd584f21015a902e1868e059b79.html) (integer-expression) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [BIT_LENGTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a537928a84f210158191ea44ca58ee8e.html) (large-object-column) | [BIT_LENGTH](https://docs.snowflake.com/en/sql-reference/functions/bit_length) (string_or_binary)  *Note: Snowflake doesn't use fractional bytes, so length is always calculated as 8 \* OCTET_LENGTH.* |
| [BYTE_INSERTSTR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f411c06ce21014834ca45160d818e3.html)( insert-position , source-string , insert-string ) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [BYTE_LENGTH64](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a538947b84f21015b13989839189a494.html)(large-object-column) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [BYTE_LENGTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53816b784f210159b849878d71ab1a8.html)(string-expression) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [CAST](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53996d784f21015a34086a244c40db1.html) (expression AS data type) | [CAST](https://docs.snowflake.com/en/sql-reference/functions/cast)(source_expr AS target_data_type) |
| [CEIL](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53a419c84f21015b689e542cbf26996.html) (numeric-expression) | [CEIL](https://docs.snowflake.com/en/sql-reference/functions/ceil)( input_expr [, scale_expr ] ) |
| [CHAR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53b50f084f210159a74ba4e4e50f914.html) (integer-expression) | [CHAR](https://docs.snowflake.com/en/sql-reference/functions/chr) (integer-expression) |
| [CHAR_LENGTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53bd3d384f21015bcf88da636a1a768.html) (string-expression) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) (string or binary-expression) |
| [CHAR_LENGTH64](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53c545784f21015bc94cb2f1bd99abc.html)(long-varchar-expression) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length) (string or binary-expression) |
| [CHARINDEX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53cde2984f210158cbd968731b1879c.html)(string-expression1, string-expression2) | [CHARINDEX](https://docs.snowflake.com/en/sql-reference/functions/charindex)string-expression1, string-expression2. [start-pos]) |
| [COALESCE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53d627984f21015a1fa9a5eb36a5dde.html) (expression, expression, [...]) | [COALESCE](https://docs.snowflake.com/en/sql-reference/functions/coalesce) (expression, expression, [...]) |
| [COL_LENGTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53de5ea84f21015b906dcfb2dc6b447.html) (table-name, column-name) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [COL_NAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53e665984f21015b7c5d3e09dd1397a.html)(table-id, column-id [, database-id]) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [CONNECTION_PROPERTY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53eeaf984f21015974f97e3388d1738.html)( { integer-expression1 | string-expression } … [ , integer-expression2 ] ) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [CONVERT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53f6efb84f21015af0e8594ce5cd68e.html)( data-type, expression [ , format-style ] ) | [CAST](https://docs.snowflake.com/en/sql-reference/functions/cast)(source_expr AS target_data_type) |
| [CORR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a53fefea84f21015a7ac9e118cc9795c.html)( dependent-expression, independent-expression ) [OVER ...] | [CORR](https://docs.snowflake.com/en/sql-reference/functions/corr)( dependent-expression, independent-expression ) [OVER ...] |
| [COS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5406e3184f21015956e83d802a05631.html) ( numeric-expression) | [COS](https://docs.snowflake.com/en/sql-reference/functions/cos) ( numeric-expression) |
| [COT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a540f97a84f21015bfc68a88c0565f03.html) ( numeric-expression) | [COT](https://docs.snowflake.com/en/sql-reference/functions/cot) ( numeric-expression) |
| [COVAR_POP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a541901c84f21015b699cc40f6738ebc.html) ( dependent-expression, independent-expression ) [OVER ...] | [COVAR_POP](https://docs.snowflake.com/en/sql-reference/functions/covar_pop) ( dependent-expression, independent-expression ) [OVER ...] |
| [COVAR_SAMP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5420eb484f21015be7f881364efd165.html) ( dependent-expression, independent-expression ) [OVER ...] | [COVAR_SAMP](https://docs.snowflake.com/en/sql-reference/functions/covar_samp) ( dependent-expression, independent-expression ) [OVER ...] |
| [COUNT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54290fd84f21015b7dddc9484de19d0.html)( \* | expression | DISTINCT column-name ) [OVER ...] | [COUNT](https://docs.snowflake.com/en/sql-reference/functions/count) ( \* | expression | DISTINCT column-name ) [OVER ...] |
| [CUME_DIST](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54314be84f210159603ce84a892876c.html) () [OVER ...] | [CUME_DIST](https://docs.snowflake.com/en/sql-reference/functions/cume_dist) [OVER ...] |
| [DATE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a544131284f21015a70ed7c8e7db2f8b.html)(string-expression) | [DATE](https://docs.snowflake.com/en/sql-reference/functions/to_date)(string-expression, [format]) |
| [DATEADD](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5449deb84f210159a75e748a099539f.html)( date-part, numeric-expression, date-expression ) | [DATEADD](https://docs.snowflake.com/en/sql-reference/functions/dateadd)( date-part, numeric-expression, date-expression ) |
| [DATECEILING](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a545210684f21015bfabb0f3f2ce3eae.html) ( date-part, numeric-expression[, multiple-expression] ) | None  *Note: Snowflake does not contain a similar built-in function.* |
| [DATEDIFF](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a545a63784f210158075c22cd6f85d3a.html)( date-part, date-expression1, date-expression2 ) | [DATEDIFF](https://docs.snowflake.com/en/sql-reference/functions/datediff) ( date-part, date-expression1, date-expression2 )  *Note: Transformation Needs Review.* |
| [DATEFLOOR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5462b6184f21015a0c4efd06d244945.html) ( date-part, datetime-expression [, multiple-expression ] ) | None  *Note: Pending Transformation.* |
| [DATENAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5472b7084f21015892b91f8f67b6ef9.html) ( date-part, date-expression ) | None  *Note: Pending Transformation.* |
| [DATEPART](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a547b06f84f210158ab3bd499f292d99.html)( date-part, date-expression ) | None  *Note: Pending Transformation.* |
| [DATEROUND](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5483a3f84f21015ba1087485982b02f.html)( date-part, datetime-expression [, multiple-expression ] ) | None  *Note: Pending Transformation.* |
| [DATETIME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a548c21f84f210158350cf2fab822610.html)( expression ) | [TO_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/to_timestamp) (expression) |
| [DAY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5493fe284f2101587fac052951c6f01.html)( date-expression ) | [DAY](https://docs.snowflake.com/en/sql-reference/functions/year)( date-expression ) |
| [DAYNAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a549c43b84f21015a569d8e52c4af3f8.html)( date-expression ) | DAYNAME_UDF(date-expression ) |
| [DAYS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54a45b584f21015a4c2ab2c117fc738.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | DAYS_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [DB_ID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54ac47c84f2101591f3cf37067c4ad5.html)( [ database-name ] ) | DB_ID_UDF  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [DB_NAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54b690484f21015b21ebe23e239b7fb.html)( [ database-id ] ) | CURRENT_DATABASE( ) |
| [DB_PROPERTY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54c05bf84f210159e15ebbba6819ce4.html)( { property-id | property-name } [ , { database-id | database-name } ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [DEGREES](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54c87d684f21015a9b9f518179a73ff.html)( numeric-expression ) | [DEGREES](https://docs.snowflake.com/en/sql-reference/functions/degrees)( numeric-expression ) |
| [DENSE_RANK](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54d078b84f21015b96984e51c0cb74a.html) () [OVER ...] | [DENSE_RANK](https://docs.snowflake.com/en/sql-reference/functions/dense_rank) () [OVER ...] |
| [DIFFERENCE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54d8aac84f210158ef283ad984de764.html)( string-expression1, string-expression2 ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [DOW](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54e817784f21015bfbbc50ea9eaecba.html)( date-expression ) | [DAYOFWEEK](https://docs.snowflake.com/en/sql-reference/functions/year)( date-expression ) |
| [ENCRYPT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f72ceb6ce210149256a7523672a8bb.html)( string-expression , key [ , algorithm-format [ , initialization-vector ] ] ) | [ENCRYPT](https://docs.snowflake.com/en/sql-reference/functions/encrypt)( value_to_encrypt , passphrase , [ [ additional_authenticated_data , ] encryption_method ] )  *Note: Pending Review* |
| [ERRORMSG](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a54f2ead84f210158668ce108de25460.html)( [ sqlstate | sqlcode ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [EXP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55131d984f21015966fac9e1cb19b02.html)(numeric-expression) | [EXP](https://docs.snowflake.com/en/sql-reference/functions/exp)(numeric-expression) |
| [EXP_WEIGHTED_AVG](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a551b4fb84f210158a07f463ff01b5e2.html)(expression, period-expression) [OVER ...] | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [EXTRACT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/5225b3c7a7c54e5ab49d6888dd756800.html)( date-part FROM timestamp-expression ) | [EXTRACT](https://docs.snowflake.com/en/sql-reference/functions/extract)( date-part FROM timestamp-expression )  *Note: Pending Review* |
| [FIRST_VALUE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5523f3c84f21015aa0092a61fcc2714.html) | [FIRST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/first_value) |
| [FLOOR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a552c1cc84f21015bfc3d6309d6785d6.html)(numeric-expression) | [FLOOR](https://docs.snowflake.com/en/sql-reference/functions/floor)(numeric-expression) |
| [GETDATE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a553449784f21015aba2a0fc3f4ce78c.html)() | [GETDATE](https://docs.snowflake.com/en/sql-reference/functions/getdate)() |
| [HASH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f88c0c6ce210149fbfd5e80df422c4.html)( expression [ , algorithm ] ) | [HASH](https://docs.snowflake.com/en/sql-reference/functions/hash)  *Note: Pending Transformation* |
| [HEXTOBIGINT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55548d184f21015b2d58684e0bb094a.html)( hexadecimal-string ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HEXTOINT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a555d0f984f210158262871887ce5bc9.html)( hexadecimal-string ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HOUR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55651ad84f210158eceac6470043938.html)( datetime-expression ) | [HOUR](https://docs.snowflake.com/en/sql-reference/functions/hour-minute-second)( datetime-expression ) |
| [HOURS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a556e14084f210158443b519970bb86d.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | HOURS_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [HTML_DECODE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8bbdd6ce21014a76ca7e38126b096.html)( string-expression ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HTML_ENCODE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8c4b16ce21014aa14a17df2f5d8b1.html)( string-expression ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HTTP_DECODE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8d6e36ce210148909b1cd1df91fc7.html)( string-expression ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HTTP_ENCODE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8df8c6ce21014969efaff4e96db5c.html)( string-expression ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HTTP_HEADER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8e9406ce210149888f72003377b08.html)( header-field-name [ , instance ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HTTP_RESPONSE_HEADER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8f31b6ce21014b2f9e1f931f49fcd.html)( header-field-name [ , instance ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [HTTP_VARIABLE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81f8fd606ce21014a283ae906e71712f.html)( var-name [ , instance [ , attribute ] ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [IFNULL](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a557e29b84f21015b460f69ff0fed6da.html) ( expression1, expression2 [ , expression3 ] ) | [IFNULL](https://docs.snowflake.com/en/sql-reference/functions/ifnull)  *Note:*  *Is transformed to*  *IFF(input is null, expression2, expression3) when the expression3 is present if not the third parameter will be NULL.* |
| [INSERTSTR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a558efff84f210159092915333b9e6df.html)( numeric-expression, base_expr, insert_expr ) | [INSERT](https://docs.snowflake.com/en/sql-reference/functions/insert)( base_expr, pos, len, insert_expr ) |
| [INTTOHEX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55971e984f21015845192079b46b239.html)(integer-expression) | None |
| [ISDATE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a559f0f684f21015b95ee838e6da62dc.html)( string-expression ) | IS_DATE_UDF |
| [ISNULL](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55a73cd84f21015ae0b9236251e12e7.html)( expression, expression [ …, expression ] ) | [COALESCE](https://docs.snowflake.com/en/sql-reference/functions/coalesce)( expression, expression [ …, expression ] ) |
| [ISNUMERIC](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55af5d284f21015867a9c978b63f5c1.html)( string-expression ) | IS_NUMERIC_UDF( string-expression ) |
| [LAG](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55b772a84f2101583fef0038bcd8bb0.html)( value_expr [, offset [, default ] ] ) [OVER ...] | [LAG](https://docs.snowflake.com/en/sql-reference/functions/lag) ( value_expr [, offset [, default ] ] ) [OVER ...] |
| [LAST_VALUE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55bfa7784f21015b86bd5dcfa28a6a5.html)(expression [IGNORE NULLS | RESPECT NULLS]) OVER ... | [LAST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/last_value)(expression [IGNORE NULLS | RESPECT NULLS]) OVER ... |
| [LCASE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55c82d484f210158fe3bfeba4f0e0bd.html) ( string-expression ) | [LOWER](https://docs.snowflake.com/en/sql-reference/functions/lower) ( string-expression ) |
| [LEAD](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55d051484f21015b82fe3d1795a7a94.html) ( value_expr [, offset [, default ] ] ) [OVER ...] | [LEAD](https://docs.snowflake.com/en/sql-reference/functions/lead) ( value_expr [, offset [, default ] ] ) [OVER ...] |
| [LEFT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55d883284f210158c5ec15e3e69239f.html)( string-expression, numeric-expression ) | [LEFT](https://docs.snowflake.com/en/sql-reference/functions/left)( string-expression, numeric-expression ) |
| [LEN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55e08c884f210159d0cec6bce940d82.html)( string-expression ) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length)( string-expression ) |
| [LENGTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55ea65684f21015a60794ef54777c14.html)( string-expression ) | [LENGTH](https://docs.snowflake.com/en/sql-reference/functions/length)( string-expression ) |
| [LIST](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a2984e5584f21015bddde2495874815d.html)  ([ALL | DISTINCT] string-expresssion [, 'delimiter-string'] [ORDER BY order-by-expression [ ASC | DESC ], ... ] ) [OVER ...] | [LISTAGG](https://docs.snowflake.com/en/sql-reference/functions/listagg)  ([ DISTINCT ] expr1  [, delimiter ] ) [ WITHIN GROUP ( orderby_clause ) ] OVER ( [ PARTITION BY expr2 ] )  *Note: ALL Keyword not supported in snowflake.* |
| [LN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a55f245c84f21015b1f7fdabe2f902dc.html)(numeric-expression) | [LN](https://docs.snowflake.com/en/sql-reference/functions/ln)(numeric-expression) |
| [LOG](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a560332084f21015bf3b92161333e171.html)(numeric-expression) | [LN](https://docs.snowflake.com/en/sql-reference/functions/ln)(numeric-expression) |
| [LOG10](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a560b1f984f210158a13cb8a24202e26.html)(numeric-expression) | [LOG(10, N)](https://docs.snowflake.com/en/sql-reference/functions/log) |
| [LOWER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a561324784f2101582439eaf6377b80b.html)( string-expression ) | [LOWER](https://docs.snowflake.com/en/sql-reference/functions/lower)( string-expression ) |
| [LPAD](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/7bf4b4293b56487bbabf9c2f3d01b364.html)( str, n [, pattern ] ) | [LPAD](https://docs.snowflake.com/en/sql-reference/functions/lpad)( str, n [, pattern ] ) |
| [LTRIM](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a561eaf184f2101596bab303110c20fb.html)( string-expression, [ trim_character_set ] ) | [LTRIM](https://docs.snowflake.com/en/sql-reference/functions/ltrim)( string-expression, [ trim_character_set ] )  *Note: Snowflake is case-sensitive by default and affects operations with strings.* |
| [MAX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5626d6684f210158cafad316e131142.html) ([DISTINCT] column-name) [OVER ...] | [MAX](https://docs.snowflake.com/en/sql-reference/functions/max) (column-name) [OVER ...]  *Note: Usage of the DISTINCT keyword does not affect the result.* |
| [MEDIAN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a562edfc84f210159175c2831fabbd47.html) ( [ ALL | DISTINCT ] expression ) [OVER ...] | [MEDIAN](https://docs.snowflake.com/en/sql-reference/functions/median) ( expression ) [OVER ...]  *Note: Usage of the ALL has no effect on the function since it counts all by default. The DISTINCT keyword is not supported.* |
| [MIN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5638af584f210158d1fe90a3fb7c0ec.html) ([DISTINCT] column-name) [OVER ...] | [MIN](https://docs.snowflake.com/en/sql-reference/functions/min) ( expression ) [OVER ...]  *Note: Usage of the DISTINCT keyword does not affect the result.* |
| [MINUTE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5640f2284f21015825db935889f60d9.html)( datetime-expression ) | [MINUTE](https://docs.snowflake.com/en/sql-reference/functions/hour-minute-second)( datetime-expression ) |
| [MINUTES](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5648d4484f21015975efebd7ac03399.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | MINUTES_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [MOD](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5650e7684f21015b1dcafaf320a4d00.html)( dividend, divisor ) | [MOD](https://docs.snowflake.com/en/sql-reference/functions/mod)( dividend, divisor ) |
| [MONTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a565928184f21015aecd84c01c4c2078.html)( date-expression ) | [MONTH](https://docs.snowflake.com/en/sql-reference/functions/year)( date-expression ) |
| [MONTHNAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a566193184f2101587e8896021cbc6c7.html)( date-expression ) | MONTHNAME_UDF( date-expression ) |
| [MONTHS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a566ced484f21015ad419bb64c76680c.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | MONTH_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [NEWID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56756f884f2101589eefadf085512d9.html) ( ) | [UUID_STRING](https://docs.snowflake.com/en/sql-reference/functions/uuid_string)( ) |
| [NEXT_CONNECTION](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a567dab684f21015b1ad9ffdb01bb91a.html)( { connection-id }, { database-id } ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [NEXT_HTTP_HEADER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81fa91566ce210149757da0cc8b98f41.html)( string-expression ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [NOW](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a568dfde84f210159d57b7ca3bb6ca84.html)( \* ) | [CURRENT_TIMESTAMP](https://docs.snowflake.com/en/sql-reference/functions/current_timestamp)( ) |
| [NTILE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5695f3f84f21015a23ae9730b31eef2.html) ( expression1 ) OVER  (  ORDER BY expression2 [ ASC | DESC ]  ) | [NTILE](https://docs.snowflake.com/en/sql-reference/functions/ntile) ( constant_value ) OVER ( [ PARTITION BY expr1 ] ORDER BY expr2 [ { ASC | DESC } ] ) |
| [NULLIF](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a569fd1184f210159b61c1d4823ce243.html)( expression1, expression2 ) | [NULLIF](https://docs.snowflake.com/en/sql-reference/functions/nullif)( expression1, expression2 ) |
| [NUMBER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56a888784f21015bbaed2c2a214738e.html)( \* ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [OBJECT_ID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56b078284f210158dec9fd05131e60d.html) ( object-name ) | OBJECT_ID_UDF |
| [OBJECT_NAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56b844884f21015ba6d84cedfda5d23.html) ( object-id [ , database-id ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [OCTET_LENGTH](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56c053484f21015952de04bc4dab521.html)( string-expression ) | [OCTET_LENGTH](https://docs.snowflake.com/en/sql-reference/functions/octet_length)( string-expression ) |
| [PATINDEX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56c8f8684f210158653d0c858b0e559.html)( '%pattern%', string-expression ) | PATINDEX_UDF( '%pattern%', string-expression ) |
| [PERCENT_RANK](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56d183584f21015881bb3f46bb765ee.html) ( ) OVER ( ORDER BY expression [ ASC | DESC ] ) | [PERCENT_RANK](https://docs.snowflake.com/en/sql-reference/functions/percent_rank) ( ) OVER ( [ PARTITION BY expr1 ] ORDER BY expr2 [ { ASC | DESC } ] [ fixedRangeFrame ] ) |
| [PERCENTILE_CONT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56d9fa784f21015b6c8d94588153331.html) ( expression1 ) WITHIN GROUP ( ORDER BY expression2 [ ASC | DESC ] ) | [PERCENTILE_CONT](https://docs.snowflake.com/en/sql-reference/functions/percentile_cont) ( percentile ) WITHIN GROUP (ORDER BY order_by_expr) OVER ( [ PARTITION BY expr3 ] ) |
| [PERCENTILE_DISC](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56e219484f21015b3a4f46749d3faf5.html) ( expression1> ) WITHIN GROUP ( ORDER BY expression2> [ ASC | DESC ] ) | [PERCENTILE_DISC](https://docs.snowflake.com/en/sql-reference/functions/percentile_disc) ( percentile ) WITHIN GROUP (ORDER BY order_by_expr ) OVER ( [ PARTITION BY expr3 ] ) |
| [PI](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56ea16284f21015b398e51fb08558f3.html) ( \* ) | [PI](https://docs.snowflake.com/en/sql-reference/functions/pi) ( ) |
| [POWER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56f22b284f210159c928a9db0c5907e.html) ( numeric-expression1, numeric-expression2 ) | [POWER](https://docs.snowflake.com/en/sql-reference/functions/pow) ( numeric-expression1, numeric-expression2 ) |
| [PROPERTY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a56fa4db84f2101581d1eea9ca3957e2.html) ( { property-id | property-name } ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [PROPERTY_DESCRIPTION](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57024b884f21015a1819d79e3571f53.html) ( { property-id | property-name } ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [PROPERTY_IS_TRACKABLE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/65b794ca66ab4b7fb8e60b888f9fc1f4.html) ( property-id ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [PROPERTY_NAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a570a7e184f2101584578b1e641ba61b.html) ( property-id ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [PROPERTY_NUMBER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57131a184f2101585959e45321b1e95.html) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [QUARTER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a571b27b84f21015b649cee091ad3bd6.html) ( date-expression ) | [QUARTER](https://docs.snowflake.com/en/sql-reference/functions/year) ( date-expression ) |
| [QUARTERSTR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/8fbd6b73408a49d1aa5c88d99954bf7c.html) ( date-expression,[ quarter_start_month ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [RADIANS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a572340384f21015b1d3dab0d7a76062.html) (numeric-expression) | [RADIANS](https://docs.snowflake.com/en/sql-reference/functions/radians) (numeric-expression) |
| [RAND](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a572b2db84f210159574b044cfd9dcb6.html) ( [ integer-expression ] ) | [RANDOM](https://docs.snowflake.com/en/sql-reference/functions/random) ( [ integer-expression ] ) |
| [RANK](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57337e084f21015aa46b31299b91d70.html) ( ) OVER ( [ PARTITION BY ] ORDER BY expression [ ASC | DESC ] ) | [RANK](https://docs.snowflake.com/en/sql-reference/functions/rank) ( ) OVER ( [ PARTITION BY ] ORDER BY expression [ { ASC | DESC } ] [ window_frame ] ) |
| [READ_SERVER_FILE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/81fb732a6ce21014b442b1082d0be5af.html) ( filename [ , start [ , length ] ] ) | *None*  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [REGR_AVGX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a573b70d84f21015a55f85bddd70d598.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_AVGX](https://docs.snowflake.com/en/sql-reference/functions/regr_avgx) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_AVGY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a574426e84f210159d5d8adecd1f70f2.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_AVGY](https://docs.snowflake.com/en/sql-reference/functions/regr_avgy) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_COUNT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a574c56884f21015b7b6f6bde76a2e6a.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_COUNT](https://docs.snowflake.com/en/sql-reference/functions/regr_count) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_INTERCEPT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57548bd84f21015a72397703df578ba.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_INTERCEPT](https://docs.snowflake.com/en/sql-reference/functions/regr_intercept) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_R2](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a575c77684f210158d23e68bbd456148.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_R2](https://docs.snowflake.com/en/sql-reference/functions/regr_r2) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_SLOPE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57647a684f21015af3cb26e82eae9cd.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_SLOPE](https://docs.snowflake.com/en/sql-reference/functions/regr_slope) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_SXX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a576c83284f21015b9d5bbf81742e83a.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_SXX](https://docs.snowflake.com/en/sql-reference/functions/regr_sxx) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_SXY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57748fd84f21015bfd08e9110638b53.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_SXY](https://docs.snowflake.com/en/sql-reference/functions/regr_sxy) ( dependent-expression, independent-expression ) [OVER ...] |
| [REGR_SYY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57806cb84f21015933dc43a04d2cc9f.html) ( dependent-expression, independent-expression ) [OVER ...] | [REGR_SYY](https://docs.snowflake.com/en/sql-reference/functions/regr_syy) ( dependent-expression, independent-expression ) [OVER ...] |
| [REMAINDER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5788e7284f21015a4caecc7b2f96b10.html) ( dividend, divisor ) | [MOD](https://docs.snowflake.com/en/sql-reference/functions/mod)( dividend, divisor ) |
| [REPEAT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a579104184f2101598d4cd02edf61346.html) ( string-expression, integer-expression ) | [REPEAT](https://docs.snowflake.com/en/sql-reference/functions/repeat)( string-expression, integer-expression ) |
| [REPLACE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a579952184f210159e17940c17a6d8f7.html)( original-string, search-string, replace-string ) | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace)( original-string, search-string, replace-string ) |
| [REPLICATE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57a156384f2101597df9d785635d3b0.html)( string-expression, integer-expression ) | [REPEAT](https://docs.snowflake.com/en/sql-reference/functions/repeat)( string-expression, integer-expression ) |
| [REVERSE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57a972e84f2101584c3b9d17a08b0f9.html) ( expression ) | [REVERSE](https://docs.snowflake.com/en/sql-reference/functions/reverse) ( expression ) |
| [RIGHT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57b364f84f210158a90b2b566be1d36.html) ( string-expression, numeric-expression ) | [RIGHT](https://docs.snowflake.com/en/sql-reference/functions/right) ( string-expression, numeric-expression ) |
| [ROUND](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57bbb0684f21015822ddb659e37c042.html) ( numeric-expression, integer-expression ) | [ROUND](https://docs.snowflake.com/en/sql-reference/functions/round) ( numeric-expression, integer-expression ) |
| [ROW_NUMBER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57c3ea884f21015b4f8c850a5a5357f.html) OVER ( [ PARTITION BY window partition ] ORDER BY window ordering ) | [ROW_NUMBER](https://docs.snowflake.com/en/sql-reference/functions/row_number) OVER ( [ PARTITION BY window partition ] ORDER BY window ordering ) |
| [ROWID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57cbfb484f21015b1a6f34fe17463d2.html)( table-name) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [RPAD](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/3a8714b7782a4730b091194c3b54aca0.html)( str, n [, pattern ] ) | [RPAD](https://docs.snowflake.com/en/sql-reference/functions/rpad)( str, n [, pattern ] ) |
| [RTRIM](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57d411084f21015969acd7d63bcc34c.html) ( string-expression, [ trim_character_set ] ) | [RTRIM](https://docs.snowflake.com/en/sql-reference/functions/rtrim)( string-expression, [ trim_character_set ] )  Note: Snowflake is case-sensitive |
| [SECOND](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57dc03b84f210158836c9258c67e700.html)( datetime-expression ) | [SECOND](https://docs.snowflake.com/en/sql-reference/functions/hour-minute-second)( datetime-expression ) |
| [SECONDS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57e4e7d84f21015bdabf289394cd2ce.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | SECONDS_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [SIGN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57ed58c84f21015bb5e803787dd27eb.html) (numeric-expression) | [SIGN](https://docs.snowflake.com/en/sql-reference/functions/sign) (numeric-expression) |
| [SIMILAR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57f56c484f21015b142b043da48dee3.html) ( string-expression1, string-expression2 ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [SIN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a57fd70a84f21015a70cd54791443340.html) (numeric-expression) | [SIN](https://docs.snowflake.com/en/sql-reference/functions/sin) (numeric-expression) |
| [SORTKEY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5805ddb84f2101591ffe19db63f3521.html) ( string-expression [, { collation-id | collation-name [ ( collation-tailoring-string ) ] } ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [SOUNDEX](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a580dde084f21015b422a82fcc67a159.html)( string-expression ) | [SOUNDEX](https://docs.snowflake.com/en/sql-reference/functions/soundex)( string-expression ) |
| [SPACE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5815e2c84f210158cf48f3c618df22c.html)(numeric-expression) | [SPACE](https://docs.snowflake.com/en/sql-reference/functions/space)(numeric-expression) |
| [SQLFLAGGER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a581e75f84f210158c3cd3ba6b97a9eb.html)( sql-standard-string, sql-statement-string ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [SQRT](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5826d0c84f210159ad8a785b1b1ac0b.html) (numeric-expression) | [SQRT](https://docs.snowflake.com/en/sql-reference/functions/sqrt) (numeric-expression) |
| [SQUARE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a582f08784f210158c9aebe92c8ae80f.html) (numeric-expression) | [SQUARE](https://docs.snowflake.com/en/sql-reference/functions/square) (numeric-expression) |
| [STDDEV](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a583716084f2101595c8e7a0abc4d989.html) ( [ ALL ] expression ) [OVER ...] | [STDDEV](https://docs.snowflake.com/en/sql-reference/functions/stddev) ( expression ) [OVER ...] |
| [STDDEV_POP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a583f35984f21015b952ffc0a8c12597.html) ( [ ALL ] expression ) [OVER ...] | [STDDEV_POP](https://docs.snowflake.com/en/sql-reference/functions/stddev_pop) ( expression ) [OVER ...] |
| [STDDEV_SAMP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a584728f84f210158226d1181b68d335.html) ( [ ALL ] expression ) [OVER ...] | [STDDEV_SAMP](https://docs.snowflake.com/en/sql-reference/functions/stddev) ( expression ) [OVER ...] |
| [STR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a584f54284f21015bb43e961aa835036.html)( numeric-expression [ , length[ , decimal ] ] ) | STR_UDF |
| [STR_REPLACE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5857e0a84f210158b54cac40679f568.html)( string_expr1, string_expr2, string_expr3 ) | [REPLACE](https://docs.snowflake.com/en/sql-reference/functions/replace) |
| [STRING](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a586010d84f210158657b25cdb264bf0.html)( string-expression [ , … ] ) | [ARRAY_TO_STRING](https://docs.snowflake.com/en/sql-reference/functions/array_to_string)([...]. '') |
| [STRTOUUID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58683c184f21015bb5cb68f114bbcb9.html) (string-expression) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [STUFF](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58705b984f21015b314be7887f1392a.html) ( string-expression1, start, length, string-expression2 ) | [INSERT](https://docs.snowflake.com/en/sql-reference/functions/insert) |
| [SUBSTRING](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58787e784f21015acc5ecadf5b1a9a0.html)( string-expression, start [ , length ] ) | [SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/substr) |
| [SUBSTR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58787e784f21015acc5ecadf5b1a9a0.html)( string-expression, start [ , length ] ) | [SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/substr) |
| [SUBSTRING64](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a588072e84f21015ad978a0d7bc662d8.html)( string-expression, start [ , length ] ) | [SUBSTR](https://docs.snowflake.com/en/sql-reference/functions/substr) |
| [SUM](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5889fe484f21015b024abf6dcede473.html) ( expression | DISTINCT column-name ) [OVER ...] | [SUM](https://docs.snowflake.com/en/sql-reference/functions/sum) ( expression | DISTINCT column-name ) [OVER ...] |
| [SUSER_ID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5892d8e84f21015ad62d1b43e0bae2e.html) ( [ user-name ] ) | [CURRENT_USER](https://docs.snowflake.com/en/sql-reference/functions/current_user)( ) |
| [SUSER_NAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a589ad8084f21015ab2eb0b0272e8c41.html) ( [ user-id ] ) | [CURRENT_USER](https://docs.snowflake.com/en/sql-reference/functions/current_user)( ) |
| [TAN](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58a2ec084f2101584a8c423a3ca9750.html) (numeric-expression) | [TAN](https://docs.snowflake.com/en/sql-reference/functions/tan) (numeric-expression) |
| [TODAY](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58aae9284f21015a550a97595a91cc9.html)( [\*] ) | CURRENT_DATE( ) |
| [TRIM](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58b326684f210158b01c6a84254a2f2.html)( string-expression, [ trim_character_set ] ) | [TRIM](https://docs.snowflake.com/en/sql-reference/functions/trim)( string-expression, [ trim_character_set ] ) |
| [TRUNCNUM](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58baf5b84f21015961fcdf7ec6e1b8b.html)( numeric-expression, integer-expression ) | [TRUNC](https://docs.snowflake.com/en/sql-reference/functions/trunc)( numeric-expression, integer-expression ) |
| [UCASE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58c382984f2101586a7d1f6dcf499c3.html)(string-expression) | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper)(string-expression) |
| [UPPER](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58cbc0284f21015ac14f5baa190b878.html)(string-expression) | [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper)(string-expression) |
| [USER_ID](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58d3bab84f2101590ac91b509c292c5.html)( [ user-name ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [USER_NAME](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58dbf3184f21015b67ac2deee9f7081.html)( [ user-id ] ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [UUIDTOSTR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58e3ffd84f2101593c5c09c7d64fec4.html)( uuid-expression ) | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [VAR_POP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58ec03e84f21015b373c5236f4567a1.html)( [ ALL ] expression ) [OVER ...] | [VAR_POP](https://docs.snowflake.com/en/sql-reference/functions/var_pop)( [ ALL ] expression ) [OVER ...] |
| [VAR_SAMP](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58f41a384f2101582a6bccb68243889.html)( [ ALL ] expression ) [OVER ...] | [VAR_SAMP](https://docs.snowflake.com/en/sql-reference/functions/var_samp)( [ ALL ] expression ) [OVER ...] |
| [VARIANCE](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a58fdc8684f210158b82f182e03b637a.html) ( [ ALL ] expression ) [OVER ...] | [VARIANCE](https://docs.snowflake.com/en/sql-reference/functions/variance) ( [ ALL ] expression ) [OVER ...] |
| [WEEKS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a590601384f210158a02bf2d5a2c1783.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | WEEKS_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [WEIGHTED_AVG](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a590e30584f210158df8d2242a037242.html) (expression) OVER (window-spec); | None  *Note: Snowflake does not have any built-in function to emulate this behavior.* |
| [WIDTH_BUCKET](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a591658384f21015a3a2e821679c9000.html) ( expression, min_value, max_value, num_buckets ) | [WIDTH_BUCKET](https://docs.snowflake.com/en/sql-reference/functions/width_bucket) ( expression, min_value, max_value, num_buckets ) |
| [YEAR](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a591eb9d84f210159e35a75b4b036a0d.html)( date-expression ) | [YEAR](https://docs.snowflake.com/en/sql-reference/functions/year)( date-expression ) |
| [YEARS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a5926bf484f210159b3980226202882f.html)( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) | YEARS_UDF( datetime-expression ) | ( datetime-expression, datetime-expression ) | ( datetime-expression, integer-expression ) |
| [YMD](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a592fc9184f21015bfa68c6078363fae.html)( integer-expression1, integer-expression2, integer-expression3 ) | [DATE_FROM_PARTS](https://docs.snowflake.com/en/sql-reference/functions/date_from_parts) ( year, month, day ) |

## Related EWIs

[SSC-FDM-0009](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): GLOBAL TEMPORARY TABLE functionality not supported.
[SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
[SSC-EWI-TS0060](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sqlServerEWI.md): Datetime interval not supported by Snowflake.
[SSC-FDM-TS0025](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): DB_ID_UDF may have a different behavior in certain cases.
[SSC-FDM-TS0010](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sqlServerFDM.md): CURRENT_DATABASE function has different behavior in certain cases.\

---
title: SnowConvert AI - Sybase IQ - CREATE FUNCTION
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-create-function.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - CREATE FUNCTION

## Description

Creates a user-defined function (UDF) that returns a scalar value.
SnowConvert AI translates Sybase IQ UDFs to Snowflake UDFs, mapping parameters and data types to Snowflake equivalents.

> **Note:**
>
> Sybase IQ supports Transact-SQL as language. For Transact-SQL to Snowflake guidance, see the SQL Server/Azure Synapse translation reference: [Built-in Functions](../transact/transact-built-in-functions.md).
> This section documents statement translation specific to Sybase IQ.

## Grammar Syntax

```sql
CREATE FUNCTION [ <owner>. ]<function-name>
   ( <parameter-name> <data-type> [ , ... ] )
RETURNS <return-data-type>
BEGIN
   <function-body>
   RETURN <expression>;
END
```

## Sample Source Patterns

### Input Code:

#### Sybase

```sql
CREATE FUNCTION dbo.fn_tax(p_amount DECIMAL(10,2))
RETURNS DECIMAL(10,2)
BEGIN
    RETURN p_amount * 0.16;
END
```

### Output Code:

#### Snowflake

```sql
CREATE OR REPLACE FUNCTION dbo.fn_tax (p_amount DECIMAL(10, 2))
RETURNS DECIMAL(10, 2)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "02/02/2026",  "domain": "no-domain-provided",  "migrationid": "GSCcAR8gMXmhqgt7dMJukg==" }}'
AS
$$
  BEGIN
    RETURN p_amount * 0.16;
  END;
$$;
;
```

## Notes

* Parameter and return data types are translated to their Snowflake equivalents.
* For procedural logic that cannot be expressed as a SQL UDF, SnowConvert AI may use a JavaScript UDF or recommend a stored procedure.

---
title: SnowConvert AI - Sybase IQ - CREATE PROCEDURE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-create-procedure.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - CREATE PROCEDURE

## Description

Creates a stored procedure that encapsulates one or more SQL statements and optional control-flow logic.
SnowConvert AI translates Sybase IQ procedures to Snowflake stored procedures, mapping parameters and data types to Snowflake equivalents.

> **Note:**
>
> Sybase IQ procedures supports Transact-SQL as language. For Transact-SQL to Snowflake guidance, see the SQL Server/Azure Synapse translation reference: [CREATE PROCEDURE](../transact/transact-create-procedure.md).
> This section documents statement translation specific to Sybase IQ.

## Grammar Syntax

```sql
CREATE [ OR REPLACE | TEMPORARY ] PROCEDURE [ owner.]procedure-name
    ( [ parameter-list ] )
    [ AS ]
    compound-statement

parameter-list ::= parameter { , parameter }*

parameter ::= [ IN | OUT | INOUT ] parameter-name datatype

compound-statement ::= BEGIN
    statement-list
END
```

## Sample Source Patterns

### Input Code:

#### Sybase

```sql
CREATE PROCEDURE dbo.usp_update_sales
    (p_id INT, p_amount DECIMAL(10,2))
AS
    UPDATE sales
    SET amount = p_amount
    WHERE id = p_id;
```

### Output Code:

#### Snowflake

```sql
CREATE OR REPLACE PROCEDURE dbo.usp_update_sales (p_id INT, p_amount DECIMAL(10, 2))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "02/02/2026",  "domain": "no-domain-provided",  "migrationid": "8x+cAQEkgXqRnjrS+t0q4A==" }}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    UPDATE sales
    SET
        amount = p_amount
    WHERE
        id = p_id;
  END;
$$;
```

## Notes

* Parameter and return data types are translated to their Snowflake equivalents.
* SnowConvert AI may adjust procedure bodies to conform to Snowflake Scripting requirements.

---
title: SnowConvert AI - Sybase IQ - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-create-table.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - CREATE TABLE

## Description

Creates a new table in the current database. You define a list of columns, which each hold data of a distinct type. The owner of the table is the issuer of the CREATE TABLE command.

For more information, please refer to [`CREATE TABLE`](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html) documentation.

## Grammar Syntax

```sql
 CREATE [ { GLOBAL | LOCAL } TEMPORARY ] TABLE
   [ IF NOT EXISTS ] [ <owner>. ]<table-name>
   … ( <column-definition> [ <column-constraint> ] …
   [ , <column-definition> [ <column-constraint> ] …]
   [ , <table-constraint> ] … )
   |{ ENABLE | DISABLE } RLV STORE

   …[ IN <dbspace-name> ]
   …[ ON COMMIT { DELETE | PRESERVE } ROWS ]
   [ AT <location-string> ]
   [PARTITION BY
     <range-partitioning-scheme>
     | <hash-partitioning-scheme>
     | <composite-partitioning-scheme> ]

<column-definition> ::=
   <column-name> <data-type>
    [ [ NOT ] NULL ]
    [ DEFAULT <default-value> | IDENTITY ]
    [ PARTITION | SUBPARTITION ( <partition-name> IN  <dbspace-name> [ , ... ] ) ]

<default-value> ::=
   <special-value>
   | <string>
   | <global variable>
   | [ - ] <number>
   | ( <constant-expression> )
   | <built-in-function>( <constant-expression> )
   | AUTOINCREMENT
   | CURRENT DATABASE
   | CURRENT REMOTE USER
   | NULL
   | TIMESTAMP
   | LAST USER

<special-value> ::=
   CURRENT
   { DATE | TIME | TIMESTAMP | USER | PUBLISHER }
   | USER

<column-constraint> ::=
   IQ UNIQUE ( <integer> )
   | { [ CONSTRAINT <constraint-name> ]
     { UNIQUE
        | PRIMARY KEY
        | REFERENCES <table-name> [ ( <column-name> ) ] [ ON { UPDATE | DELETE } RESTRICT ] }
      [ IN <dbspace-name> ]
      | CHECK ( <condition> )
   }

<table-constraint> ::=
    [ CONSTRAINT <constraint-name> ]
   {  { UNIQUE | PRIMARY KEY } ( <column-name> [ , … ] )
     [ IN <dbspace-name> ]
     | <foreign-key-constraint>
     | CHECK ( <condition> )
   }

<foreign-key-constraint> ::=
   FOREIGN KEY [ <role-name> ] [ ( <column-name> [ , <column-name> ] … ) ]
   …REFERENCES <table-name> [ ( <column-name> [ , <column-name> ] … ) ]
   …[ <actions> ] [ IN <dbspace-name> ]

<actions> ::=
   [ ON { UPDATE | DELETE } RESTRICT ]

<location-string> ::=
   { <remote-server-name>. [ <db-name> ].[ <owner> ].<object-name>
      | <remote-server-name>; [ <db-name> ]; [ <owner> ];<object-name> }

<range-partitioning-scheme> ::=
   RANGE ( <partition-key> ) ( <range-partition-decl> [,<range-partition-decl> … ] )

<partition-key> ::= <column-name>

<range-partition-declaration> ::=
    <range-partition-name> VALUES <= ( {<constant> |  MAX } ) [ IN <dbspace-name> ]

<hash-partitioning-scheme> ::=
   HASH ( <partition-key> [ , <partition-key>, … ] )

<composite-partitioning-scheme> ::=
   <hash-partitioning-scheme> SUBPARTITION BY <range-partitioning-scheme>
```

## TEMPORARY TABLES

### Description

In Sybase IQ `GLOBAL | LOCAL TEMPORARY` is used to create temporary tables that exist only for the session. These tables are session-specific and automatically deleted when the session ends. They help store intermediate results or work data without affecting the permanent database schema. It also can be created only by adding an `#` at the beginning of the name.

> **Warning:**
>
> This syntax is partially supported in Snowflake.

### Grammar Syntax

```sql
 CREATE [ { GLOBAL | LOCAL } TEMPORARY ] TABLE
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE LOCAL TEMPORARY TABLE TABLE01 (
    col1 INTEGER
);

CREATE GLOBAL TEMPORARY TABLE TABLE02 (
    col1 INTEGER
);

CREATE TABLE #TABLE03(
    col1 INTEGER
);
```

##### Output Code:

##### Sybase

```sql
 CREATE OR REPLACE TEMPORARY TABLE TABLE01 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;

--** SSC-FDM-0009 - GLOBAL TEMPORARY TABLE FUNCTIONALITY NOT SUPPORTED. **
CREATE OR REPLACE TABLE TABLE02 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;

CREATE OR REPLACE TEMPORARY TABLE T_TABLE03 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;
```

### Related EWIs

[SSC-FDM-0009](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): GLOBAL TEMPORARY TABLE functionality not supported.

## IF NOT EXISTS

### Description

> Ensures the table is created only if it does not already exist, preventing duplication and errors in your SQL script. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html)).

> **SuccessPlaceholder:**
>
> This syntax is fully supported in Snowflake.

### Grammar Syntax

```sql
 IF NOT EXISTS
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE TABLE IF NOT EXISTS table1 (
    col1 INTEGER
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE TABLE IF NOT EXISTS table1 (
    col1 INTEGER
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2024" }}';
```

## (ENABLE | DISABLE) RLV STORE

### Description

> Controls Row-Level Versioning Store functionality. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html)).

> **Note:**
>
> This syntax is not needed in Snowflake.

### Grammar Syntax

```sql
 { ENABLE | DISABLE } RLV STORE
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE TABLE rlv_table
(id INT)
ENABLE RLV STORE;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE rlv_table
(
id INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;
```

## IN DBSPACE

### Description

> Specifies the DB space for data storage. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html)).

> **Note:**
>
> This syntax is not needed in Snowflake. Snowflake automatically handles storage.

### Grammar Syntax

```sql
 IN <dbspace-name>
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE TABLE dbspace_table (
    id INT PRIMARY KEY
)
IN my_dbspace;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE dbspace_table (
    id INT PRIMARY KEY
);
```

## ON COMMIT

### Description

> Specifies the behaviour of the temporary table when a commit is done. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html))

> **Warning:**
>
> This syntax is partially supported.

### Grammar Syntax

```sql
 [ ON COMMIT { DELETE | PRESERVE } ROWS ]
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE LOCAL TEMPORARY TABLE temp_employees (
    DATA VARCHAR(255)
) ON COMMIT DELETE ROWS;

CREATE LOCAL TEMPORARY TABLE temp_projects (
    DATA VARCHAR(255)
) ON COMMIT PRESERVE ROWS;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TEMPORARY TABLE temp_employees (
    DATA VARCHAR(255)
)
--    --** SSC-FDM-0008 - ON COMMIT NOT SUPPORTED **
--    ON COMMIT DELETE ROWS
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;

CREATE OR REPLACE TEMPORARY TABLE temp_projects (
    DATA VARCHAR(255)
) ON COMMIT PRESERVE ROWS
    COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;
```

### Related EWIs

[SSC-FDM-0008](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): On Commit not supported.

## AT LOCATION

### Description

> Creates a remote table (proxy). ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html))

> **Danger:**
>
> This syntax is not supported in Snowflake.

### Grammar Syntax

```sql
 AT <location-string>
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE TABLE t1
(
    DATA VARCHAR(10)
)
AT 'SERVER_A.db1.joe.t1';
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE t1
(
    DATA VARCHAR(10)
)
    !!!RESOLVE EWI!!! /*** SSC-EWI-SY0002 - UNSUPPORTED REMOTE TABLE SYNTAX ***/!!!
AT 'SERVER_A.db1.joe.t1'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "07/11/2025",  "domain": "no-domain-provided" }}'
;
```

### Related EWIs

[SSC-EWI-SY0002](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md): UNSUPPORTED REMOTE TABLE SYNTAX.

## PARTITION BY

### Description

> All rows of a table partition are physically colocated. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html))

> **Note:**
>
> This syntax is not needed in Snowflake.

### Grammar Syntax

```sql
 PARTITION BY
     <range-partitioning-scheme>
     | <hash-partitioning-scheme>
     | <composite-partitioning-scheme>

<range-partitioning-scheme> ::=
   RANGE ( <partition-key> ) ( <range-partition-decl> [,<range-partition-decl> … ] )

<partition-key> ::= <column-name>

<range-partition-declaration> ::=
    <range-partition-name> VALUES <= ( {<constant> |  MAX } ) [ IN <dbspace-name> ]

<hash-partitioning-scheme> ::=
   HASH ( <partition-key> [ , <partition-key>, … ] )

<composite-partitioning-scheme> ::=
   <hash-partitioning-scheme> SUBPARTITION BY <range-partitioning-scheme>
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 -- Range Partitioning
CREATE TABLE sales (
    sale_id INT,
    sale_date DATE,
    amount DECIMAL(10, 2)
)
PARTITION BY RANGE (sale_date) (
    p1 VALUES <= ('2023-01-01'),
    p2 VALUES <= ('2024-01-01'),
    p3 VALUES <= (MAXVALUE)
);

-- Hash Partitioning
CREATE TABLE customers (
    customer_id INT,
    customer_name VARCHAR(255)
)
PARTITION BY HASH (customer_id);

-- Composite Partitioning (Hash-Range)
CREATE TABLE orders (
    order_id INT,
    customer_id INT,
    order_date DATE,
    amount DECIMAL(10,2)
)
PARTITION BY HASH (customer_id)
SUBPARTITION BY RANGE (order_date) (
    p1 VALUES <= ('2023-01-01'),
    p2 VALUES <= ('2024-01-01'),
    p3 VALUES <= (MAXVALUE)
);
```

##### Output Code:

##### Snowflake

```sql
 -- Range Partitioning
CREATE OR REPLACE TABLE sales (
    sale_id INT,
    sale_date DATE,
    amount DECIMAL(10, 2)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
;

-- Hash Partitioning
CREATE OR REPLACE TABLE customers (
    customer_id INT,
    customer_name VARCHAR(255)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
;

-- Composite Partitioning (Hash-Range)
CREATE OR REPLACE TABLE orders (
    order_id INT,
    customer_id INT,
    order_date DATE,
    amount DECIMAL(10,2)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
;
```

## CONSTRAINTS

### Description

> This ensures the accuracy and reliability of the data in the table. If there is any violation between the constraint and the data action, the action is aborted. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a619764084f21015b8039a8346dc622c.html))

> **Warning:**
>
> This syntax is partially supported.

### Grammar Syntax

```sql
 <table-constraint> ::=
    [ CONSTRAINT <constraint-name> ]
   {  { UNIQUE | PRIMARY KEY } ( <column-name> [ , … ] )
     [ IN <dbspace-name> ]
     | <foreign-key-constraint>
     | CHECK ( <condition> )
   }

<foreign-key-constraint> ::=
   FOREIGN KEY [ <role-name> ] [ ( <column-name> [ , <column-name> ] … ) ]
   …REFERENCES <table-name> [ ( <column-name> [ , <column-name> ] … ) ]
   …[ <actions> ] [ IN <dbspace-name> ]

<actions> ::=
   [ ON { UPDATE | DELETE } RESTRICT ]
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 CREATE TABLE t_constraint (
    id1 INT NOT NULL,
    id2 INT PRIMARY KEY,
    age INT CHECK (age >= 18),
    email VARCHAR(255) UNIQUE,
    product_id INT REFERENCES products(id) ON DELETE RESTRICT IN SOMEPLACE,
    cod_iq VARCHAR(20) IQ UNIQUE(5),
    CONSTRAINT unq_name_email UNIQUE (name, email),
    CONSTRAINT fk_ord_line FOREIGN KEY (ord_id, line_id) REFERENCES ord_lines(ord_id,line_id)
);
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE t_constraint (
    id1 INT NOT NULL,
    id2 INT PRIMARY KEY,
    age INT
            !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
            CHECK (age >= 18),
    email VARCHAR(255) UNIQUE,
    product_id INT REFERENCES products (id) ON DELETE RESTRICT ,
    cod_iq VARCHAR(20)
                       !!!RESOLVE EWI!!! /*** SSC-EWI-SY0003 - UNSUPPORTED IQ UNIQUE CONSTRAINT ***/!!!
 IQ UNIQUE(5),
       CONSTRAINT unq_name_email UNIQUE (name, email),
       CONSTRAINT fk_ord_line FOREIGN KEY (ord_id, line_id) REFERENCES ord_lines (ord_id, line_id)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;
```

### Related EWIs

[SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): CHECK STATEMENT NOT SUPPORTED.

[SSC-EWI-SY0003](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md): UNSUPPORTED IQ UNIQUE CONSTRAINT.

## DEFAULT

### Description

Defines the default value of a column in a create table.

> **Warning:**
>
> This syntax is partially supported in Snowflake.

### Grammar Syntax

```sql
 <default-value> ::=
   <special-value>
   | <string>
   | <global variable>
   | [ - ] <number>
   | ( <constant-expression> )
   | <built-in-function>( <constant-expression> )
   | AUTOINCREMENT
   | CURRENT DATABASE
   | CURRENT REMOTE USER
   | NULL
   | TIMESTAMP
   | LAST USER

<special-value> ::=
   CURRENT
   { DATE | TIME | TIMESTAMP | USER | PUBLISHER }
   | USER
```

### Sample Source Patterns

#### Input Code:

##### Sybase

```sql
 create table t_defaults
(
col1 timestamp default current utc timestamp,
col2 timestamp default current timestamp,
col3 varchar default current user,
col4 varchar default current remote user,
col5 varchar default last user,
col6 varchar default current publisher,
col7 varchar default current date,
col8 varchar default current database,
col9 varchar default current time,
col10 varchar default user,
col11 int default autoincrement,
col12 int identity,
col13 int default -10,
col14 int default 'literal',
col15 int default null
)
;
```

##### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE t_defaults
(
    col1 timestamp default CURRENT_TIMESTAMP,
    col2 timestamp default CURRENT_TIMESTAMP,
    col3 VARCHAR default CURRENT_USER,
    col4 VARCHAR default
                         !!!RESOLVE EWI!!! /*** SSC-EWI-SY0001 - UNSUPPORTED DEFAULT VALUE CURRENT REMOTE USER IN SNOWFLAKE ***/!!! current remote user,
    col5 VARCHAR default
                         !!!RESOLVE EWI!!! /*** SSC-EWI-SY0001 - UNSUPPORTED DEFAULT VALUE LAST USER IN SNOWFLAKE ***/!!! last user,
    col6 VARCHAR default
                         !!!RESOLVE EWI!!! /*** SSC-EWI-SY0001 - UNSUPPORTED DEFAULT VALUE CURRENT PUBLISHER IN SNOWFLAKE ***/!!! current publisher,
    col7 VARCHAR default CURRENT_DATE,
    col8 VARCHAR default CURRENT_DATABASE,
    col9 VARCHAR default CURRENT_TIME,
    col10 VARCHAR DEFAULT CURRENT_USER,
    col11 INT IDENTITY ORDER,
    col12 INT IDENTITY ORDER,
    col13 INT default -10,
    col14 INT default 'literal',
    col15 INT default null
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "03/19/2025",  "domain": "test" }}'
;
```

---
title: SnowConvert AI - Sybase IQ - CREATE TYPE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-create-type.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - CREATE TYPE

Sybase alias types use the same pattern as SQL Server: `CREATE TYPE ... FROM base_type` becomes Snowflake `CREATE TYPE ... AS base_type`, with nullability keywords on the source definition removed in the output.

**Source (Sybase):**

```sql
CREATE TYPE EmailAddress FROM VARCHAR(255);
```

**Snowflake equivalent:**

```sql
CREATE TYPE EmailAddress AS VARCHAR(255);
```

**Source (Sybase):**

```sql
CREATE TYPE PhoneNumber FROM VARCHAR(20) NOT NULL;
```

**Snowflake equivalent:**

```sql
CREATE TYPE PhoneNumber AS VARCHAR(20);
```

**Notes:** For table types and other Transact-SQL constructs not covered here, see [CREATE TYPE (SQL Server / Azure Synapse)](../transact/transact-create-type.md); alias-type behavior is shared between Transact-SQL and Sybase.

---
title: SnowConvert AI - Sybase IQ - CREATE VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-create-view.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - CREATE VIEW

## Description

Creates a new view in the current database. You define a list of columns, which each hold data of a distinct type. The owner of the view is the issuer of the CREATE VIEW command.

For more information, please refer to [`CREATE VIEW`](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a61a051684f210158cced2d83231bd8a.html?version=16.1.5.0&amp;locale=en-US) documentation.

## Grammar Syntax

```sql
 CREATE [ OR REPLACE ] VIEW
   … [ owner.]view-name [ ( column-name [ , … ] ) ]
   … AS select-without-order-by
   … [ WITH CHECK OPTION ]
```

## Sample Source Patterns

### Input Code:

#### Sybase

```sql
 CREATE OR REPLACE VIEW VIEW1
AS
SELECT
COL1, COL2
FROM T1
WITH CHECK OPTION;
```

#### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE VIEW VIEW1
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "04/15/2025",  "domain": "test" }}'
AS
SELECT
COL1,
COL2
FROM
T1;
```

---
title: SnowConvert AI - Sybase IQ - Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-data-types.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - Data Types

Snowflake supports most basic SQL data types (with some restrictions) for columns, local variables, expressions, parameters, and other appropriate/suitable locations.

## Exact and approximate numerics

| Sybase | Snowflake | Notes |
| --- | --- | --- |
| Sybase | Snowflake | Notes |
| BIGINT | BIGINT | ​Note that BIGINT in Snowflake is an alias for NUMBER(38,0)  [See note on this conversion below.] |
| BIT | BOOLEAN | Sybase only accepts ​1, 0, or NULL |
| DECIMAL | DECIMAL | ​Snowflake's DECIMAL is synonymous with NUMBER |
| FLOAT | FLOAT | ​This data type behaves equally on both systems.  Precision 7-15 digits, float (1-24)  Storage 4 - 8 bytes, float (25-53) |
| INT | INT | Note that INT in Snowflake is an alias for NUMBER(38,0)  [See note on this conversion below.] |
| SMALLINT | SMALLINT​ | ​This data type behaves equally |
| TINYINT​ | TINYINT | Note that TINYINT in Snowflake is an alias for NUMBER(38,0)  [See note on this conversion below.] |
| NUMERIC | NUMERIC | ​Snowflake's NUMERIC is synonymous with NUMBER |

**NOTE:**

* Each is converted to the alias in Snowflake with the same name for the conversion of integer data types (INT, SMALLINT, BIGINT, TINYINT). Each of those aliases is converted to NUMBER(38,0), a data type considerably larger than the integer datatype. Below is a comparison of the range of values that can be present in each data type:

  + Snowflake NUMBER(38,0): -99999999999999999999999999999999999999 to +99999999999999999999999999999999999999
  + Sybase TINYINT: 0 to 255
  + Sybase INT: -2^31 (-2,147,483,648) to 2^31-1 (2,147,483,647)
  + Sybase BIGINT: -2^63 (-9,223,372,036,854,775,808) to 2^63-1 (9,223,372,036,854,775,807)
  + Sybase SMALLINT: -2^15 (-32,768) to 2^15-1 (32,767)

## Date and time

| Sybase | Snowflake | Notes |
| --- | --- | --- |
| DATE | DATE | Sybase accepts range from 0001-01-01 to 9999-12-31 |
| DATETIME | TIMESTAMP_NTZ(3) | Snowflake’s DATETIME is an alias for TIMESTAMP_NTZ​ |
| SMALLDATETIME | TIMESTAMP_NTZ | Snowflake’s DATETIME truncates the TIME information  That is, 1955-12-13 12:43:10 is saved as 1955-12-13 |
| TIME | TIME | ​This data type behaves equally on both systems.  Range 00:00:00.0000000 through 23:59:59.9999999 |
| TIMESTAMP | TIMESTAMP |  |

## Character strings

| Sybase | Snowflake | Notes |
| --- | --- | --- |
| CHAR | CHAR | ​Snowflake’s max string size in bytes is 167772161. |
| TEXT​ | TEXT |  |
| VARCHAR​ | VARCHAR | Snowflake’s max string size in bytes is 167772161. |

## Unicode character strings

| Sybase | Snowflake | Notes |
| --- | --- | --- |
| NCHAR | NCHAR | Synonymous with VARCHAR except default length is VARCHAR(1). |
| NTEXT | TEXT | NTEXT is a Sybase domain type, implemented as a LONG NVARCHAR. |
| NVARCHAR | VARCHAR | Snowflake’s max string size in bytes is 167772161. |

## Binary strings

| Sybase | Snowflake | Notes |
| --- | --- | --- |
| BINARY | ​BINARY | In Snowflake the maximum length is 8 MB (8,388,608 bytes) and length is always measured in terms of bytes. |
| VARBINARY | VARBINARY | Snowflake uses this data type as a synonym for BINARY.  Snowflake often represents each byte as 2 hexadecimal characters |

---
title: SnowConvert AI - Sybase IQ - SELECT
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/sybase/sybase-select-statement.md
section: Migrations
---

# SnowConvert AI - Sybase IQ - SELECT

## Description

> Retrieves information from the database. ([Sybase SQL Language Reference](https://help.sap.com/docs/SAP_IQ/a899599784f21015a466ed42e24d07f9/a624e72e84f210159276a39335acd358.html?version=16.0.11&amp;locale=en-US))

> **Warning:**
>
> This syntax is partially supported in Snowflake.

## Grammar Syntax

```sql
 SELECT
[ ALL | DISTINCT ]
[ row-limitation-option1 ]
   	select-list
   … 	[ INTO { host-variable-list | variable-list | table-name } ]
   … 	[ INTO LOCAL TEMPORARY TABLE { table-name } ]
   … 	[ FROM table-list ]
   … 	[ WHERE search-condition ]
   … 	[ GROUP BY [ expression [, ...]
         | ROLLUP ( expression [, ...] )
         | CUBE ( expression [, ...] ) ] ]
   … 	[ HAVING search-condition ]
   … 	[ ORDER BY { expression | integer } [ ASC | DESC ] [, ...] ]
   | 	[ FOR JSON json-mode ]
   … [ row-limitation-option ]

select-list:
   { column-name
   | expression [ [ AS ] alias-name ]
   | *
   }

row-limitation-option1:
   FIRST
   | TOP {ALL | limit-expression} [START AT startat-expression ]

limit-expression:
    simple-expression

startat-expression:
    simple-expression

row-limitation-option2:
   LIMIT { [ offset-expression, ] limit-expression
   | limit-expression OFFSET offset-expression }

offset-expression:
   simple-expression

simple-expression:
   integer
   | variable
   | ( simple-expression )
   | ( simple-expression { + | - | * } simple-expression )

..FROM <table-expression> [,...]

<table-expression> ::=
   <table-name>
   | <view-name>
   | <procedure-name>
   | <common-table-expression>
   | ( <subquery> ) [ [ AS ] <derived-table-name> ( <column_name, ...>) ] ]
   | <derived-table>
   | <join-expression>
   | ( <table-expression> , ... )
   | <openstring-expression>
   | <apply-expression>
   | <contains-expression>
   | <dml-derived-table>

<table-name> ::=
   [ <userid>.] <table-name> ]
   [ [ AS ] <correlation-name> ]
   [ FORCE INDEX ( <index-name> ) ]

<view-name> ::=
   [ <userid>.]<view-name> [ [ AS ] <correlation-name> ]

<procedure-name> ::=
   [  <owner>, ] <procedure-name> ([ <parameter>, ...])
   [  WITH(<column-name datatype>, )]
   [ [ AS ] <correlation-name> ]

<parameter> ::=
   <scalar-expression> | <table-parameter>

<table-parameter> ::=
   TABLE (<select-statement)> [ OVER ( <table-parameter-over> )]

<table-parameter-over> ::=
   [ PARTITION BY {ANY
   | NONE|< table-expression> } ]
   [ ORDER BY { <expression> | <integer> }
   [ ASC | DESC ] [, ...] ]

<derived-table> ::=
   ( <select-statement> )
   	[ AS ] <correlation-name> [ ( <column-name>, ... ) ]

<join-expression> ::=
   <table-expression> <join-operator> <table-expression>
   	[ ON <join-condition> ]

<join-operator> ::=
   [ KEY | NATURAL ] [ <join-type> ] JOIN | CROSS JOIN

<join-type> ::=
   INNER
     | LEFT [ OUTER ]
     | RIGHT [ OUTER ]
     | FULL [ OUTER ]

<openstring-expression> ::=
   OPENSTRING ( { FILE | VALUE } <string-expression> )
     WITH ( <rowset-schema> )
   	[ OPTION ( <scan-option> ...  ) ]
   	[ AS ] <correlation-name>

<apply-expression> ::=
   <table-expression> { CROSS | OUTER } APPLY <table-expression>

<contains-expression> ::=
   { <table-name>  | <view-name> } CONTAINS
   ( <column-name> [,...], <contains-query> )
   [ [ AS ] <score-correlation-name> ]

<rowset-schema> ::=
   <column-schema-list>
	   | TABLE [<owner>.]<table-name> [ ( <column-list> ) ]

<column-schema-list> ::=
   { <column-name user-or-base-type> |  filler( ) } [ , ... ]

<column-list> ::=
   { <column-name> | filler( ) } [ , ... ]

<scan-option> ::=
   BYTE ORDER MARK { ON | OFF }
   | COMMENTS INTRODUCED BY <comment-prefix>
   | DELIMITED BY <string>
   | ENCODING <encoding>
   | ESCAPE CHARACTER <character>
   | ESCAPES { ON | OFF }
   | FORMAT { TEXT  | BCP  }
   | HEXADECIMAL { ON | OFF }
   | QUOTE <string>
   | QUOTES { ON | OFF }
   | ROW DELIMITED BY string
   | SKIP <integer>
   | STRIP { ON | OFF | LTRIM | RTRIM | BOTH }

<contains-query> ::= <string>

<dml-derived-table> ::=
   ( <dml-statement>  ) REFERENCING ( [ <table-version-names>  | NONE ] )

<dml-statement> ::=
   <insert-statement>
   <update-statement>
   <delete-statement>

<table-version-names> ::=
   OLD [ AS ] <correlation-name> [ FINAL [ AS ] <correlation-name> ]
     | FINAL [ AS ] <correlation-name>
```

## Sample Source Patterns

### Row Limitation

Sybase allows row limitation in a query by using the TOP clause with an optional START AT. Snowflake does not support this syntax but it can be transformed as shown below to achieve the same functionality.

#### Input Code:

##### Sybase

```sql
 SELECT
TOP 10 START AT 2
COL1
FROM TABLE1;

SELECT
FIRST
COL1
FROM TABLE1;

SELECT
COL1
FROM TABLE1
LIMIT 2, 1;

SELECT
COL1
FROM TABLE1
LIMIT 1 OFFSET 2;
```

#### Output Code:

##### Snowflake

```sql
 SELECT
COL1
FROM
TABLE1
LIMIT 10 OFFSET 2;

SELECT
TOP 1
COL1
FROM
TABLE1;

SELECT
COL1
FROM
TABLE1
LIMIT 1 OFFSET 2;

SELECT
COL1
FROM
TABLE1
LIMIT 1 OFFSET 2;
```

### Into Clause

In Sybase, a table can be defined by selecting multiple rows and defining a name to store the data retrieved. Snowflake does not support this behavior but can be emulated by doing a CREATE TABLE AS.

#### Input Code:

##### Sybase

```sql
 SELECT
* INTO mynewtable
FROM TABLE1;

SELECT
* INTO LOCAL TEMPORARY TABLE mynewtable
FROM TABLE1;

SELECT
* INTO #mynewtable
FROM TABLE1;
```

#### Output Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE mynewtable AS
SELECT
*
FROM
TABLE1;

CREATE OR REPLACE TEMPORARY TABLE mynewtable AS
SELECT
*
FROM
TABLE1;

CREATE OR REPLACE TEMPORARY TABLE T_mynewtable AS
SELECT
*
FROM
TABLE1;
```

### Force Index

Snowflake does not contain indexes for query optimization.

#### Input Code:

##### Sybase

```sql
 SELECT * FROM MyTable FORCE INDEX (MyIndex);
```

#### Output Code:

##### Snowflake

```sql
 SELECT
*
FROM
MyTable
--        --** SSC-FDM-SY0002 - FORCE INDEX IS NOT SUPPORTED IN SNOWFLAKE **
--        FORCE INDEX (MyIndex)
                             ;
```

### TABLE FUNCTIONS

Snowflake allows calling a stored procedure(when the procedure meets certain [limitations](https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-selecting-from#limitations-for-selecting-from-a-stored-procedure)) or a table value function in a FROM clause, but RESULTSETS and windowing cannot be used as parameters.

#### Input Code:

##### Sybase

```sql
 SELECT * FROM
MyProcedure(TABLE (SELECT * FROM TABLE1));

SELECT * FROM MyProcedure(1, 'test');

SELECT * FROM
MyProcedure(
TABLE (SELECT * FROM TABLE1)
OVER (PARTITION BY Col1 ORDER BY Col2 DESC));

SELECT * FROM
MyProcedure(
TABLE (SELECT * FROM AnotherTable) );
```

#### Output Code:

##### Snowflake

```sql
 SELECT
*
FROM
TABLE(MyProcedure(
                  !!!RESOLVE EWI!!! /*** SSC-EWI-SY0004 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T RECEIVE A QUERY AS PARAMETER ***/!!!TABLE (SELECT * FROM TABLE1)));

SELECT
*
FROM
TABLE(MyProcedure(1, 'test'));

SELECT
*
FROM
TABLE(MyProcedure(
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0004 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T RECEIVE A QUERY AS PARAMETER ***/!!!
TABLE (SELECT * FROM TABLE1)
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0005 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T BE USED WITH OVER EXPRESSION ***/!!!
OVER (PARTITION BY Col1 ORDER BY Col2 DESC)));

SELECT
*
FROM
TABLE(MyProcedure(
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0004 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T RECEIVE A QUERY AS PARAMETER ***/!!!
TABLE (SELECT * FROM AnotherTable) ));
```

### OPEN STRING

Snowflake does not support [OPENSTRING](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a7749cf084f21015b73b899c1520fb06.html#parameters) functionality.

#### Input Code:

##### Sybase

```sql
 -- Openstring from file
SELECT * FROM
OPENSTRING (FILE '/path/to/file.txt')
WITH (Col1 INT, Col2 VARCHAR(20)) AS OS;

-- Openstring from value
SELECT * FROM
OPENSTRING (VALUE '1,test')
WITH (Col1 INT, Col2 VARCHAR(20)) AS OS;

-- Openstring with options
SELECT * FROM
OPENSTRING (FILE '/path/to/file.csv')
WITH (Col1 INT, Col2 VARCHAR(20))
OPTION (DELIMITED BY ',' QUOTE '"') AS OS;
```

#### Output Code:

##### Snowflake

```sql
 -- Openstring from file
SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0006 - OPEN STRING IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
OPENSTRING (FILE '/path/to/file.txt')
WITH (Col1 INT, Col2 VARCHAR(20)) AS OS;

-- Openstring from value
SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0006 - OPEN STRING IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
OPENSTRING (VALUE '1,test')
WITH (Col1 INT, Col2 VARCHAR(20)) AS OS;

-- Openstring with options
SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0006 - OPEN STRING IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
OPENSTRING (FILE '/path/to/file.csv')
WITH (Col1 INT, Col2 VARCHAR(20))
OPTION (DELIMITED BY ',' QUOTE '"') AS OS;
```

### DML Derived Table

In Sybase, during execution, the DML statement specified in the dml-derived table is executed first, and the rows affected by that DML materialize into a temporary table whose columns are described by the REFERENCING clause. The temporary table represents the result set of dml-derived-table. Snowflake does not support this behavior.

#### Input Code:

##### Sybase

```sql
 -- DML derived table with insert
SELECT * FROM (INSERT INTO TargetTable (Col1, Col2) VALUES (1, 'test')) REFERENCING (FINAL AS F);

-- DML derived table with update
SELECT * FROM (UPDATE TargetTable SET Col2 = 'updated' WHERE Col1 = 1) REFERENCING (OLD AS O FINAL AS F);

-- DML derived table with delete
SELECT * FROM (DELETE FROM TargetTable WHERE Col1 = 1) REFERENCING (OLD AS O);
```

#### Output Code:

##### Snowflake

```sql
 -- DML derived table with insert
SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0007 - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE ***/!!! (INSERT INTO TargetTable (Col1, Col2) VALUES (1, 'test')) REFERENCING (FINAL AS F);

-- DML derived table with update
SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0007 - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE ***/!!! (UPDATE TargetTable SET Col2 = 'updated' WHERE Col1 = 1) REFERENCING (OLD AS O FINAL AS F);

-- DML derived table with delete
SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0007 - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE ***/!!! (DELETE FROM TargetTable WHERE Col1 = 1) REFERENCING (OLD AS O);
```

### KEY JOIN

Snowflake does not support KEY join but when the ON CLAUSE is defined in the query the KEY keyword is removed; otherwise, an EWI is inserted.

#### Input Code:

##### Sybase

```sql
 SELECT * FROM Table1 KEY JOIN Table2;
SELECT * FROM Table1 KEY JOIN Table2 ON Table1.ID = Table2.ID;
```

#### Output Code:

##### Snowflake

```sql
 SELECT
*
FROM
Table1
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0009 - KEY JOIN NOT SUPPORTED IN SNOWFLAKE ***/!!!
KEY JOIN
Table2;

SELECT
*
FROM
Table1
JOIN
Table2
ON Table1.ID = Table2.ID;
```

### OUTER-CROSS APPLY

Snowflake transforms the clause the CROSS APPLY into LEFT OUTER JOIN and OUTER APPLY to INNER JOIN.

#### Input Code:

##### Sybase

```sql
 -- Apply cross apply
SELECT * FROM Table1 CROSS APPLY (SELECT Col2 FROM Table2 WHERE Table1.ID = Table2.ID) AS AP;

-- Apply outer apply
SELECT * FROM Table1 OUTER APPLY (SELECT Col2 FROM Table2 WHERE Table1.ID = Table2.ID) AS AP;
```

#### Output Code:

##### Snowflake

```sql
 -- Apply cross apply
SELECT
    *
FROM
    Table1
    LEFT OUTER JOIN (
        SELECT
            Col2
        FROM
            Table2
        WHERE
            Table1.ID = Table2.ID
    ) AS AP;

-- Apply outer apply
SELECT
    *
FROM
    Table1
    INNER JOIN LATERAL (
        SELECT
            Col2
        FROM
            Table2
        WHERE
            Table1.ID = Table2.ID
    ) AS AP;
```

### CONTAINS Clause

In Sybase the [CONTAINS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a7749cf084f21015b73b899c1520fb06.html) clause following a table name to filter the table and return only those rows matching the full text query specified with contains-query. Every matching row of the table is returned together with a score column that can be referred to using score-correlation-name. Snowflake does not support this behavior.

#### Input Code:

##### Sybase

```sql
 -- Contains clause
SELECT * FROM MyTable CONTAINS (TextColumn, 'search term') AS Score;

-- Contains clause with multiple columns.
SELECT * FROM MyTable CONTAINS (TextColumn,TextColumn2, 'search term') AS Score;
```

#### Output Code:

##### Snowflake

```sql
 -- Contains clause
SELECT
*
FROM
MyTable
        !!!RESOLVE EWI!!! /*** SSC-EWI-SY0008 - CONTAINS CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
        CONTAINS (TextColumn, 'search term') AS Score;

-- Contains clause with multiple columns.
SELECT
*
FROM
MyTable
        !!!RESOLVE EWI!!! /*** SSC-EWI-SY0008 - CONTAINS CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
        CONTAINS (TextColumn,TextColumn2, 'search term') AS Score;
```

## Related EWIs

[SSC-FDM-0009](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): GLOBAL TEMPORARY TABLE FUNCTIONALITY NOT SUPPORTED.

[SSC-FDM-SY0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sybaseFDM.md): CALLING STORED PROCEDURE IN FROM CLAUSE MIGHT HAVE COMPILATION ERRORS

[SSC-FDM-SY0002](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/sybaseFDM.md): FORCE INDEX IS NOT SUPPORTED IN SNOWFLAKE

[SSC-EWI-SY0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md) - UNSUPPORTED SYNTAX TABLE FUNCTION CAN’T RECEIVE A QUERY AS PARAMETER

[SSC-EWI-SY0005](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md) - UNSUPPORTED SYNTAX TABLE FUNCTION CAN’T BE USED WITH OVER EXPRESSION

[SSC-EWI-SY0006](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md) - OPEN STRING IS NOT SUPPORTED IN SNOWFLAKE

[SSC-EWI-SY0007](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md) - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE

[SSC-EWI-SY0008](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md) - CONTAINS CLAUSE NOT SUPPORTED IN SNOWFLAKE

[SSC-EWI-SY0009](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md) - KEY JOIN NOT SUPPORTED IN SNOWFLAKE

---
title: SnowConvert AI - Sybase IQ Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/sybaseFDM.md
section: Migrations
---

# SnowConvert AI - Sybase IQ Functional Differences

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Sybase IQ focuses its assessment and translation capabilities primarily on TABLES, VIEWS, STORED PROCEDURES, and FUNCTIONS.
> While SnowConvert AI can recognize other types of ANSI-standard statements, these are not yet fully supported for conversion. This means that while the tool may identify them, it won’t perform a complete translation for these unsupported code units.

## SSC-FDM-SY0001

Calling stored procedure in FROM might have compilation errors

### Description

Snowflake supports calling a stored procedure in the FROM clause when the procedure meets certain [conditions](https://docs.snowflake.com/en/developer-guide/stored-procedure/stored-procedures-selecting-from#limitations-for-selecting-from-a-stored-procedure) otherwise the query fails.

#### Code Example

##### Input Code:

##### Sybase

```sql
 SELECT * FROM MyProcedure(1, 'test');
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
  *
FROM
  --** SSC-FDM-SY0001 - CALLING STORED PROCEDURE IN FROM CLAUSE MIGHT HAVE COMPILATION ERRORS **
  TABLE(MyProcedure(1, 'test'));
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-SY0002

Calling stored procedure in FROM might have compilation errors

### Description

Snowflake does not contain indexes for query optimization.

#### Code Example

##### Input Code:

##### Sybase

```sql
 SELECT * FROM TABLE1 FORCE INDEX (MyIndex);
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
  *
FROM
  TABLE1
--         --** SSC-FDM-SY0002 - FORCE INDEX IS NOT SUPPORTED IN SNOWFLAKE **
--         FORCE INDEX(MyIndex)
                             ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Sybase IQ Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/sybaseEWI.md
section: Migrations
---

# SnowConvert AI - Sybase IQ Issues

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Sybase IQ currently supports assessment and translation for TABLES, VIEWS, STORED PROCEDURES, and FUNCTIONS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

This page provides a comprehensive reference for how SnowConvert AI translates Sybase IQ grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

## SSC-EWI-SY0001

Unsupported default value in Snowflake.

### Severity

High

#### Description

Snowflake does not support the use of the following default values.

* CURRENT REMOTE USER
* LAST USER
* CURRENT PUBLISHER

#### Code Examples

##### Input Code:

##### Sybase

```sql
 create table t1
(
  col1 varchar default current remote user,
  col2 varchar default last user,
  col3 varchar default current publisher
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE t1 (
  col1 VARCHAR default
                       !!!RESOLVE EWI!!! /*** SSC-EWI-SY0001 - UNSUPPORTED DEFAULT VALUE CURRENT REMOTE USER IN SNOWFLAKE ***/!!!
                       current remote user,
  col2 VARCHAR default
                       !!!RESOLVE EWI!!! /*** SSC-EWI-SY0001 - UNSUPPORTED DEFAULT VALUE LAST USER IN SNOWFLAKE ***/!!!
                       last user,
  col3 VARCHAR default
                       !!!RESOLVE EWI!!! /*** SSC-EWI-SY0001 - UNSUPPORTED DEFAULT VALUE CURRENT PUBLISHER IN SNOWFLAKE ***/!!!
                       current publisher
)
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0002

Unsupported remote table syntax in Snowflake.

### Severity

High

#### Description

Sybase IQ remote table syntax is not supported in Snowflake.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 CREATE TABLE remote_data(
    remote_id INT
)
AT 'remote_server;remote_db;owner;remote_object';
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE remote_data (
    remote_id INT
)
    !!!RESOLVE EWI!!! /*** SSC-EWI-SY0002 - UNSUPPORTED REMOTE TABLE SYNTAX ***/!!!
AT 'remote_server;remote_db;owner;remote_object'
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "sybase",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0003

Unsupported iq unique constraint in Snowflake.

### Severity

High

#### Description

The IQ UNIQUE constraint specifies an estimate of the number of distinct values in a column. Snowflake does not contain any constraint to emulate this functionality.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 CREATE TABLE T1 (
  DATA VARCHAR IQ UNIQUE(10)
)
;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE TABLE T1 (
  DATA VARCHAR
  !!!RESOLVE EWI!!! /*** SSC-EWI-SY0003 - UNSUPPORTED IQ UNIQUE CONSTRAINT ***/!!!
              IQ UNIQUE(10)
);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0004

Unsupported Syntax Table function can’t receive a query as a parameter.

### Severity

High

#### Description

Snowflake does not support passing RESULTSET as parameter in a table-value function call.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 SELECT
*
FROM
MyProcedure(TABLE (SELECT * FROM TABLE1));
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
*
FROM
TABLE(MyProcedure(
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0004 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T RECEIVE A QUERY AS PARAMETER ***/!!!
TABLE(SELECT * FROM TABLE1)));
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0005

Unsupported Syntax Table function can’t be used with over expression

### Severity

High

#### Description

Snowflake does not support windows specification on a table-value function call.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 SELECT * FROM
MyProcedure(
TABLE (SELECT * FROM TABLE1)
OVER (PARTITION BY Col1 ORDER BY Col2 DESC));
```

##### Generated Code:

##### Snowflake

```sql
         SELECT
          *
        FROM
          TABLE(MyProcedure(
          !!!RESOLVE EWI!!! /*** SSC-EWI-SY0004 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T RECEIVE A QUERY AS PARAMETER ***/!!!
          TABLE(
            SELECT
              *
            FROM
              TABLE1
          )
          !!!RESOLVE EWI!!! /*** SSC-EWI-SY0005 - UNSUPPORTED SYNTAX TABLE FUNCTION CAN'T BE USED WITH OVER EXPRESSION ***/!!!
          OVER (
          PARTITION BY
            Col1
          ORDER BY Col2 DESC)));
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0006

Open string is not supported in Snowflake.

### Severity

High

#### Description

Snowflake does not support [OPENSTRING](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a7749cf084f21015b73b899c1520fb06.html#parameters) functionality.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 SELECT * FROM
OPENSTRING (FILE '/path/to/file.txt')
WITH (Col1 INT, Col2 VARCHAR(20)) AS OS;
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
*
FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-SY0006 - OPEN STRING IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
OPENSTRING (FILE '/path/to/file.txt')
WITH (Col1 INT, Col2 VARCHAR(20)) AS OS;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0007

DML Derived Table not supported in Snowflake.

### Severity

High

#### Description

In Sybase, during execution, the DML statement specified in the dml-derived table is executed first, and the rows affected by that DML materialize into a temporary table whose columns are described by the REFERENCING clause. The temporary table represents the result set of dml-derived-table. Snowflake does not support this behavior.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 SELECT * FROM (INSERT INTO TABLE1 (Col1, Col2) VALUES (1, 'test')) REFERENCING (FINAL AS F);
SELECT * FROM (DELETE FROM TABLE1) REFERENCING (FINAL AS F);
SELECT * FROM (UPDATE TABLE1 SET A = 1) REFERENCING (FINAL AS F);
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
  *
FROM
  !!!RESOLVE EWI!!! /*** SSC-EWI-SY0007 - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE ***/!!!
  (
    INSERT INTO TABLE1 (Col1, Col2) VALUES (1, 'test')
  )
  REFERENCING
  (FINAL AS F);

SELECT
  *
FROM
  !!!RESOLVE EWI!!! /*** SSC-EWI-SY0007 - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE ***/!!!
  (
    DELETE FROM TABLE1
  )
  REFERENCING
  (FINAL AS F);

SELECT
  *
FROM
  !!!RESOLVE EWI!!! /*** SSC-EWI-SY0007 - DML DERIVED TABLE NOT SUPPORTED IN SNOWFLAKE ***/!!!
  (
    UPDATE TABLE1
      SET
        A = 1
  )
  REFERENCING
  (FINAL AS F);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0008

Contains clause not supported in Snowflake.

### Severity

High

#### Description

In Sybase the [CONTAINS](https://help.sap.com/docs/SAP_IQ/a898e08b84f21015969fa437e89860c8/a7749cf084f21015b73b899c1520fb06.html) clause following a table name to filter the table and return only those rows matching the full text query specified with contains-query. Every matching row of the table is returned together with a score column that can be referred to using score-correlation-name. Snowflake does not support this behavior.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 SELECT * FROM TABLE1 CONTAINS (TextColumn, 'search term') AS Score;
```

##### Generated Code:

##### Snowflake

```sql
 SELECT
  *
FROM
  TABLE1
         !!!RESOLVE EWI!!! /*** SSC-EWI-SY0008 - CONTAINS CLAUSE NOT SUPPORTED IN SNOWFLAKE ***/!!!
         CONTAINS(TextColumn,'search term') AS Score;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0009

Key Join not supported in Snowflake.

### Severity

High

#### Description

Snowflake does not support KEY JOIN. When the ON CLAUSE is specified, the KEY keyword is removed and treated as an INNER JOIN.

#### Code Examples

##### Input Code:

##### Sybase

```sql
 SELECT * FROM TABLE1 KEY JOIN Table2 ON Table1.ID = Table2.ID;
SELECT * FROM TABLE1 KEY JOIN Table2;
```

##### Generated Code:

##### Snowflake

```sql
   SELECT
    *
  FROM
    TABLE1
    JOIN
      Table2
      ON Table1.ID = Table2.ID;

  SELECT
    *
  FROM
    TABLE1
    !!!RESOLVE EWI!!! /*** SSC-EWI-SY0009 - KEY JOIN NOT SUPPORTED IN SNOWFLAKE ***/!!!
    KEY JOIN
      Table2;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-SY0010

The temporary option is not supported in Snowflake.

### Severity

High

#### Description

Snowflake does not support the SET TEMPORARY OPTION statement used in Sybase IQ to configure session-level options.

#### Code Examples

##### Input Code:

##### Sybase

```sql
set temporary option chained = 'OFF'
```

##### Generated Code:

##### Snowflake

```sql
--        !!!RESOLVE EWI!!! /*** SSC-EWI-SY0010 - THE TEMPORARY OPTION 'chained' IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
--        set temporary option chained = 'OFF'
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - System Object Naming Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/system-object-naming-validation.md
section: Migrations
---

# SnowConvert AI - System Object Naming Validation

## Description

This validation step verifies the files and folder names that contain reserved words. These files and folders are marked as invalid or out of scope because they can potentially be built-in systems definitions that must be removed from the migration scope. When this behavior happens, the following warning is displayed:

Also, in the ScopeValidation report, you will find information about the failed file(s).

---
title: SnowConvert AI - System Requirements
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/system-requirements.md
section: Migrations
---

# SnowConvert AI - System Requirements

Before you start, make sure that your system meets the minimum requirements list given here:

* MacOS

  + Ventura 13.3.1 or higher
  + 4 GB of RAM or higher\*
  + Python ≥ 3.10 or < 3.14 for the [Data Validation feature](../user-guide/data-validation.md)
* Windows

  + Windows 11 or later
  + 4 GB of RAM or higher\*
  + Python ≥ 3.10 or < 3.14 for the [Data Validation feature](../user-guide/data-validation.md)

\*The amount of RAM available affects the speed of assessment and conversion executions and the amount of code that can be processed at once (more is better).

Before you download, if you’re into the legalities of SnowConvert AI, you can view our [End User License Agreement (EULA)](../terms-and-conditions/README.md).

> **Note:**
>
> SnowConvert AI will run using .NET 9.0 and it’s shipped in a self-contained package so you don’t have to install any dependencies.

If you encounter any issues in the download, installation, or setup process, let us know! Send a message to [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com). We’ll get you up and running again.

---
title: SnowConvert AI - Technical Documentation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/README.md
section: Migrations
---

# SnowConvert AI - Technical Documentation

---
title: SnowConvert AI - Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/code-extraction/teradata.md
section: Migrations
---

# SnowConvert AI - Teradata

The first step for migration is getting the code that you need to migrate. There are many ways to extract the code from your database. However, we recommend using the extraction scripts provided by Snowflake.

All the source code for these scripts is open source and is available on [GitHub](https://github.com/Snowflake-Labs/SC.DDLExportScripts/).

## Prerequisites

* Access to a server with a Teradata database.
* Permission to run shell scripts with access to the server.
* Teradata utilities like`bteq / tpt`.

## Installing the scripts

Go to <https://github.com/Snowflake-Labs/SC.DDLExportScripts/>.

From the Code option, select the drop-down and use the **Download ZIP** option to download the code.

Decompress the ZIP file. The code for Teradata should be under the Teradata folder

Follow the [Usage instructions](https://github.com/Snowflake-Labs/SC.DDLExportScripts/blob/main/Teradata/README.md) to modify the files and run them on your system.

### Package the results

When the script is done, the output folder will contain all the DDLs for the migration. You can then compress this folder to use it with [SnowConvert AI](../../../overview.md).

E.g. run:

```none
zip -r output.zip ./output
```

---
title: SnowConvert AI - Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/teradata.md
section: Migrations
---

# SnowConvert AI - Teradata

## What is SnowConvert AI for Teradata?

SnowConvert AI is a software tool that understands [**Teradata SQL**](https://www.teradata.com/), BTEQ, and other Teradata-specific scripts (such as Fastload, Multiload, TPump, and TPT files) and converts this source code into functionally equivalent [**Snowflake**](https://www.snowflake.com/) code.

### Conversion Types

Specifically, SnowConvert AI for Teradata performs the following conversions:

### Teradata SQL to Snowflake SQL

SnowConvert AI understands the Teradata source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake. SnowConvert AI can migrate the source code in any of these three extensions **.sql, .dml, ddl**

### Teradata Stored Procedures to JavaScript Embedded in Snowflake SQL

SnowConvert AI takes Teradata stored procedures (usually written in SQL) and converts them to JavaScript embedded into Snowflake SQL. Teradata’s CREATE PROCEDURE and REPLACE PROCEDURE language is replaced by Snowflake’s CREATE OR REPLACE PROCEDURE language. JavaScript is called as a scripting language, and all of the inner statements are converted to JavaScript.

### Teradata BTEQ, Fastload, Multiload, and TPT to Python

Basic Teradata Query (BTEQ) is Teradata’s proprietary scripting language. All BTEQ script files will be converted to Python scripts. A helper class will be called from the converted scripts to create the functional equivalence between the source and the target. More information about the Python Helpers can be found on our [Translation Reference page](../../../../translation-references/teradata/README.md). BTEQ can be batch run from outside the Snowflake environment. Learn more about how [**you can connect Python scripts directly to Snowflake**](https://docs.snowflake.com/en/user-guide/python-connector.html).

BTEQ files are also the foundation for multiple other proprietary data types that Teradata has created:

* Fastload
* Multiload
* TPUMP

Each one of these file types are extensions of BTEQ. SnowConvert AI translates each one of these file types to Python.

Each of these conversions are optimized to give you the most functionally equivalent output ready for use in Snowflake. For more information on the power of the kind of conversion SnowConvert AI can provide, you can learn more about our tool by visiting our complete [**SQL reference guide**](../../../../translation-references/teradata/README.md).

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *BTEQ (Basic Teradata Query):* BTEQ was the first utility and query tool for Teradata.
* TPT (Teradata Parallel Transporter): TPT is a new generation utility tool that aims to create a one-stop tool for all the activities related to loading and exporting data from/to Teradata databases.
* *SnowConvert AI*: the software that converts securely and automatically your Teradata files to the Snowflake cloud data platform.
* *Conversion rule or transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure to process the conversion rules.

On the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for Teradata is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation. If you’re interested in getting more information about SnowConvert AI in general, visit our [**SnowConvert AI for Teradata**](../../../../translation-references/teradata/README.md) information page.

## Translation Sample

*Teradata SQL Statement*

```sql
-- CREATE TABLE DDL
CREATE SET TABLE TABLE1,
    NO BEFORE JOURNAL,
    NO AFTER JOURNAL,
    CHECKSUM = DEFAULT,
    DEFAULT MERGEBLOCKRATIO
(
    COL1 VARCHAR(15) CHARACTER SET LATIN NOT CASESPECIFIC,
    Col2 BYTEINT CHECK ( CurrentFlag  IN (0 ,1 ) ) NOT NULL,
    COL3 DATE FORMAT 'yyyy-mm-dd',
    COL4 BLOB(2097088000),
    COL5 BYTEINT,
    COL7 INTEGER NOT NULL COMPRESS (1 ,2 ,3 ,4),
    COL8 INTERVAL HOUR(2) TO MINUTE
);

-- REPLACE VIEW DDL
REPLACE VIEW VIEW1 AS
SELECT * FROM TABLE1
UNION ALL
SELECT MAX(COL1) FROM TABLE1;
```

*The Converted Snowflake SQL Code*:

```sql
-- CREATE TABLE DDL
--** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
CREATE OR REPLACE TABLE TABLE1
(
    COL1 VARCHAR(15) COLLATE 'en-cs',
    Col2 BYTEINT
                 !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!!
 CHECK ( CurrentFlag  IN (0 ,1 ) ) NOT NULL,
    COL3 DATE,
    COL4 BINARY /*** SSC-FDM-TD0001 - COLUMN CONVERTED FROM BLOB DATA TYPE ***/,
    COL5 BYTEINT,
    COL7 INTEGER NOT NULL,
    COL8 VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL HOUR(2) TO MINUTE DATA TYPE CONVERTED TO VARCHAR ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

-- REPLACE VIEW DDL
CREATE OR REPLACE VIEW VIEW1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
AS
SELECT
    * FROM
    TABLE1
    UNION ALL
    SELECT
    MAX(COL1) FROM
    TABLE1;
```

In this converted SQL you will notice that we are converting many things such as:

* Adding `PUBLIC` Schema by default for all the Table and view names if the user doesn’t specify one (see how to specify a Schema).
* `CREATE SET TABLE` to `CREATE TABLE`
* `REPLACE VIEW` to `CREATE OR REPLACE VIEW`
* Data Types: `BLOB` to `BINARY` and `INTERVAL` to `VARCHAR`
* Data Type Attributes: `NOT CASESPECIFIC` to `COLLATE`
* Removing pieces of the Teradata SQL that are not necessary in Snowflake due to Snowflake’s architecture such as `NO BEFORE JOURNAL`, `NO AFTER JOURNAL`, `CHECKSUM`, `COMPRESS`, and `DEFAULT MERGEBLOCKRATIO`.

---
title: SnowConvert AI - Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/considerations/teradata.md
section: Migrations
---

# SnowConvert AI - Teradata

## Numeric Data Operations

### Calculation Precision

Teradata and Snowflake handle calculations differently:

* Teradata rounds numbers after each calculation step based on the data type:

  + For decimal types, it maintains the larger precision
  + For NUMBER types, it keeps full precision
* Snowflake stores all numbers using the NUMBER data type, maintaining full precision throughout calculations. This can lead to different results compared to Teradata, especially when working with decimal, integer, or float types.

This difference in behavior is not adjusted during code conversion since it’s typically not what developers intend to change.

Teradata: SELECT (1.00/28) \* 15.00 /\* Returns 0.60 \*/

Snowflake will round the result of the division (1.00/28) \* 15.00 to two decimal places:
SELECT (1.00/28) \* 15.00 = 0.535710 = 0.54

### Integer-Integer Division

When dividing two integer values, Teradata performs truncation (floor), while Snowflake performs rounding. To maintain consistent behavior during migration, the automated code conversion automatically adds a TRUNC statement in these cases.

Teradata: SELECT (5/3) = 1 /\* Returns 1 since integer division rounds down \*/

Snowflake: When dividing 5 by 3, the result is 1.6666666, which rounds to 2

Truncated Division in Snowflake: SELECT TRUNC(5/3) returns 1

### Banker Rounding

Teradata offers Banker’s rounding through the ROUNDHALFWAYMAGUP parameter, while Snowflake uses standard rounding methods only.

| SQL | Teradata | Snowflake |
| --- | --- | --- |
| CAST( 1.05 AS DECIMAL(9,1)) | 1.0 | 1.1 |
| CAST( 1.15 AS DECIMAL(9,1)) | 1.2 | 1.2 |
| CAST( 1.25 AS DECIMAL(9,1)) | 1.2 | 1.3 |
| CAST( 1.35 AS DECIMAL(9,1)) | 1.4 | 1.4 |
| CAST( 1.45 AS DECIMAL(9,1)) | 1.4 | 1.5 |
| CAST( 1.55 AS DECIMAL(9,1)) | 1.6 | 1.6 |
| CAST( 1.65 AS DECIMAL(9,1)) | 1.6 | 1.7 |
| CAST( 1.75 AS DECIMAL(9,1)) | 1.8 | 1.8 |
| CAST( 1.85 AS DECIMAL(9,1)) | 1.8 | 1.9 |
| CAST( 1.95 AS DECIMAL(9,1)) | 2.0 | 2.0 |

### Decimal to Integer Conversion

Teradata and Snowflake handle decimal values differently. While Teradata truncates decimal values, Snowflake rounds them to the nearest integer. To maintain consistency with Teradata’s behavior, the conversion process automatically adds a TRUNC statement.

| SQL | Teradata | Snowflake |
| --- | --- | --- |
| CAST( 1.0 AS INTEGER) | 1 | 1 |
| CAST( 1.1 AS INTEGER) | 1 | 1 |
| CAST( 1.2 AS INTEGER) | 1 | 1 |
| CAST( 1.3 AS INTEGER) | 1 | 1 |
| CAST( 1.4 AS INTEGER) | 1 | 1 |
| CAST( 1.5 AS INTEGER) | 1 | 2 |
| CAST( 1.6 AS INTEGER) | 1 | 2 |
| CAST( 1.7 AS INTEGER) | 1 | 2 |
| CAST( 1.8 AS INTEGER) | 1 | 2 |
| CAST( 1.9 AS INTEGER) | 1 | 2 |

### Number without Precision/Scale

When a Teradata NUMBER column is defined without specifying scale or precision, it can store decimal values with varying scale (from 0 to 38), as long as the total precision stays within 38 digits. However, Snowflake requires fixed scale and precision values for NUMBER columns. Here’s an example of how numbers are defined in a Teradata table with this flexible format:

```sql
CREATE MULTISET TABLE DATABASEXYZ.TABLE_NUMS
     (NUM_COL1 NUMBER(*),
      NUM_COL2 NUMBER,
      NUM_COL3 NUMBER(38,*));
```

The following table shows two examples of values that exceed Snowflake’s column size limits. These values could appear in any of the previously shown Teradata columns.

Value 1: 123,345,678,901,234,567,891,012.0123456789

Value 2: 123.12345678901234567890

These numeric values would require a NUMBER(42, 20) data type, which exceeds Snowflake’s maximum precision limit of 38. Snowflake is currently working on implementing flexible precision and scale functionality.

### Truncation on INSERT for SQL DML Statements

Teradata automatically truncates string values that exceed the defined field length during insertion. While SnowConvert AI maintains the same field lengths during conversion (for example, VARCHAR(20) remains VARCHAR(20)), Snowflake does not automatically truncate oversized strings. If your data ingestion process depends on automatic truncation, you will need to manually modify it by adding a LEFT() function. SnowConvert AI intentionally does not add truncation automatically due to the potential impact across the entire codebase.

### Float Default Issue Example:

```sql
/* <sc-table> TABLE DUMMY.EXAMPLE </sc-table> */
/**** WARNING: SET TABLE FUNCTIONALITY NOT SUPPORTED ****/
CREATE TABLE DUMMY.PUBLIC.EXAMPLE (
LOGTYPE INTEGER,
OPERSEQ INTEGER DEFAULT 0,
RUNTIME FLOAT /**** ERROR: DEFAULT CURRENT_TIME NOT VALID FOR DATA TYPE ****/
);
```

### Float Data Aggregation

Floating-point numbers are approximate representations of decimal values. Due to these approximations, different database systems may produce slightly different results when performing calculations and aggregations with float data types. This variation occurs because each database system handles floating-point arithmetic and rounding in its own way.

## Other Considerations

### Join Elimination

Snowflake executes SQL queries exactly as written, including all specified joins, regardless of whether they affect the final results. Unlike Snowflake, Teradata can automatically remove unnecessary joins by using primary and foreign key relationships defined in the table structure. This feature in Teradata primarily helps prevent poorly written queries, and it’s usually only a concern when code was specifically written to use this capability. If your existing code was designed to take advantage of Teradata’s join elimination feature, automated code conversion tools cannot address this limitation. In such cases, you may need to redesign parts of your solution.

**Using Window Functions with max() and order by**

#### Teradata behavior and defaults:

**Default**: When an ORDER BY clause is present but no ROWS or ROWS BETWEEN clause is specified, Teradata SQL window aggregate functions automatically use **ROWS BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING**.

#### Snowflake behavior and defaults:

**Default**: When you use a window aggregate function with an ORDER BY clause but without specifying ROWS or ROWS BETWEEN, Snowflake automatically applies **ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW** as the window frame.

**Example:**

Below is a sample table named TEST_WIN that shows employee salary data across various departments.

| DEPT_NM | DEPT_NO | EMP_NO | SALARY |
| --- | --- | --- | --- |
| SALES | 10 | 11 | 5000 |
| SALES | 10 | 12 | 6000 |
| HR | 20 | 21 | 1000 |
| HR | 20 | 22 | 2000 |
| PS | 30 | 31 | 7000 |
| PS | 30 | 32 | 9000 |

The following code, when executed in Teradata, calculates the highest salary among all employees, grouped by department.

```sql
SELECT DEPT_NM, SALARY ,DEPT_NO,
MAX(SALARY) OVER ( ORDER BY DEPT_NO  ) AS MAX_DEPT_SALARY
FROM TEST_WIN;
```

| DEPT_NM | SALARY | DEPT_NO | MAX_DEPT_SALARY |
| --- | --- | --- | --- |
| SALES | 6000 | 10 | 9000 |
| SALES | 5000 | 10 | 9000 |
| HR | 2000 | 20 | 9000 |
| HR | 1000 | 20 | 9000 |
| PS | 7000 | 30 | 9000 |
| PS | 9000 | 30 | 9000 |

When executing the converted code using Snowflake-SnowConvert AI, you may notice different results (highlighted values). These differences are expected and align with Snowflake’s default settings.

```sql
SELECT DEPT_NM, SALARY ,DEPT_NO,
MAX(SALARY) OVER ( ORDER BY DEPT_NO  ) AS MAX_DEPT_SALARY
FROM TEST_WIN;
```

| DEPT_NM | SALARY | DEPT_NO | MAX_DEPT_SALARY |
| --- | --- | --- | --- |
| SALES | 5000 | 10 | 6000 |
| SALES | 6000 | 10 | 6000 |
| HR | 1000 | 20 | 6000 |
| HR | 2000 | 20 | 6000 |
| PS | 7000 | 30 | 9000 |
| PS | 9000 | 30 | 9000 |

To achieve identical results as in Teradata, you must specify the ROWS/RANGE value as shown in the code below.

```sql
SELECT DEPT_NM, SALARY ,DEPT_NO,
MAX(SALARY) OVER ( ORDER BY DEPT_NO RANGE BETWEEN UNBOUNDED PRECEDING AND UNBOUNDED FOLLOWING) AS MAX_DEPT_SALARY
FROM TEST WIN;
```

| DEPT_NM | SALARY | DEPT_NO | MAX_DEPT_SALARY |
| --- | --- | --- | --- |
| SALES | 5000 | 10 | 9000 |
| SALES | 6000 | 10 | 9000 |
| HR | 1000 | 20 | 9000 |
| HR | 2000 | 20 | 9000 |
| PS | 7000 | 30 | 9000 |
| PS | 9000 | 30 | 9000 |

The RANGE/ROWS clause explicitly defines how rows are ordered. You can achieve similar results by removing the ORDER BY clause completely.

## References

Snowflake: <https://docs.snowflake.com/en/sql-reference/functions-analytic.html>
Teradata: <https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/dIV_fAtkK3UeUIQ5_uucQw>

---
title: SnowConvert AI - Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/teradata.md
section: Migrations
---

# SnowConvert AI - Teradata

## Specific CLI arguments

The following CLI arguments are specific for executing migrations with **SnowConvert AI for Teradata**

### `--displaceDatabaseAsSchema`

This flag must be used with the `-s` parameter. When used it will maintain Teradata’s database name qualification as Snowflake’s data warehouse, contrary to its default behavior where it becomes a schema on Snowflake code. Let’s look at an example where `-s customSchema` is included:

```sql
SELECT * FROM databaseName.tableName;
```

```sql
-- Additional Params: -s customSchema
SELECT
* FROM
customSchema.tableName;
```

```sql
-- Additional Params: -s customSchema --displaceDatabaseAsSchema
SELECT
* FROM
databaseName.customSchema.tableName;
```

#### `--CharacterToApproximateNumber <NUMBER>`

An integer value for the CHARACTER to Approximate Number transformation (Default: `10`).

#### `--DefaultDateFormat <STRING>`

String value for the Default DATE format (Default: `"YYYY/MM/DD"`).

#### `--DefaultTimeFormat <STRING>`

String value for the Default TIME format (Default: `"HH:MI:SS.FF6"`).

#### `--DefaultTimestampFormat <STRING>`

String value for the Default TIMESTAMP format (Default: `"YYYY/MM/DD HH:MI:SS.FF6"`).

#### `--DefaultTimezoneFormat <STRING>`

String value for the Default TIMEZONE format (Default: `"GMT-5"`).

#### `-p, --scriptTargetLanguage <TARGET_LANGUAGE>`

The string value specifies the target language to convert Bteq and Mload script files. Currently supported values are **SnowScript** and **Python**. The default value is set to **Python**.

#### `-n, --SessionMode <SESSION_MODE>`

SnowConvert AI handles Teradata code in both TERA and ANSI modes. Currently, this is limited to the default case specification of character data and how it affects comparisons.

The string value specifies the Session Mode of the input code. Currently supported values are **TERA** and **ANSI**. The default value is set to **TERA**.

You can learn more about how SnowConvert AI handles and converts code depending on the session mode, check here.

#### `--replaceDeleteAllToTruncate`

Flag to indicate whether Delete All statements must be replaced with Truncate or not. This will generate SSC-EWI-TD0037 when the replacement is done. Example:

```sql
create table testTable(
    column1 varchar(30)
);

delete testTable all;

delete from testTable;
```

```sql
CREATE OR REPLACE TABLE testTable (
    column1 varchar(30)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

DELETE FROM testTable;

DELETE FROM
    testTable;
```

```sql
-- Additional Params: --replaceDeleteAllToTruncate
CREATE OR REPLACE TABLE testTable (
    column1 varchar(30)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

TRUNCATE TABLE testTable;

DELETE FROM
    testTable;
```

#### `--generateStoredProcedureTags`

Flag to indicate whether the SQL statements SELECT, INSERT, CREATE, DELETE, UPDATE, DROP, MERGE in Stored Procedures will be tagged on the converted code. This feature is used for easy statement identification on the migrated code. Wrapping these statements within these XML-like tags allows for other programs to quickly find and extract them. The decorated code looks like this:

```sql
//<SQL_DELETE
EXEC(DELETE FROM SB_EDP_SANDBOX_LAB.PUBLIC.USER_LIST,[])
//SQL_DELETE!>
```

#### `--splitPeriodDatatype`

This flag is used to indicate that the tool should migrate any use of the `PERIOD` datatype as two separate `DATETIME` fields that will hold the original period begin and end values, anytime a period field or function is migrated using this flag SSC-FDM-TD0004 will be added to warn about this change.

```sql
CREATE TABLE myTable(
   col1 PERIOD(DATE),
   col2 VARCHAR(50),
   col3 PERIOD(TIMESTAMP)
);
```

```sql
CREATE OR REPLACE TABLE myTable (
   col1 VARCHAR(24) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!,
   col2 VARCHAR(50),
   col3 VARCHAR(58) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

```sql
-- Additional Params: --splitPeriodDatatype
CREATE OR REPLACE TABLE myTable (
   col1_begin DATE,
   col1_end DATE /*** SSC-FDM-TD0004 - PERIOD DATA TYPES ARE HANDLED AS TWO DATA FIELDS ***/,
   col2 VARCHAR(50),
   col3_begin TIMESTAMP,
   col3_end TIMESTAMP /*** SSC-FDM-TD0004 - PERIOD DATA TYPES ARE HANDLED AS TWO DATA FIELDS ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### `--arrange`

Flag to indicate whether the input code should be processed before parsing and transformation.

#### `--RenamingFile`

The path to a .json file that specifies new names for certain objects such as Tables, Views, Procedures, Functions, and Macros. This parameter can’t be used with the `customSchema` argument. Navigate to the [Renaming Feature](renaming-feature.md) to learn more about this argument.

#### `--UseCollateForCaseSpecification`

This flag indicates whether to use COLLATE or UPPER to preserve Case Specification functionality, for example, CASESPECIFIC or NOT CASESPECIFIC. By default, it is turned off, meaning that the UPPER function will be used to emulate case insensitivity (NOT CASESPECIFIC). To learn more about how Case Specification is handled by SnowConvert AI check here.

---
title: SnowConvert AI - Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/README.md
section: Migrations
---

# SnowConvert AI - Teradata

Translation specification for Teradata grammar syntax

This page provides a comprehensive reference for how SnowConvert AI translates Teradata grammar elements to Snowflake equivalents. In this translation reference, you will find, code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - Teradata - BTEQ
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-python/bteq-translation.md
section: Migrations
---

# SnowConvert AI - Teradata - BTEQ

Translation references to convert Teradata BTEQ files to Python

Basic Teradata Query (BTEQ) is a general-purpose, command-based program that enables users on a workstation to communicate with one or more Teradata Database systems, and to format reports for both print and screen output.

To simulate the BTEQ functionality for Teradata in Snowflake, BTEQ files and commands are transformed to Python code, similar to the transformations performed for MultiLoad and FastLoad scripts. The generated code uses the Snowflake Python project called [snowconvert.helpers](snowconvert-script-helpers.md) which contains the required functions to simulate the BTEQ statements in Snowflake.

## BTEQ Commands Translation

The following table presents the conversion for the BTEQ commands to Snowflake.

| Teradata | Snowflake | Notes |
| --- | --- | --- |
| ERRORCODE != 0 | snowconvert.helpers.error_code != 0 |  |
| .EXPORT DATA FILE=fileName | Export.report(“fileName”, “,”) | The function has no functionality |
| .EXPORT INDICDATA FILE=fileName | Export.report(“fileName”, “,”) | The function has no functionality |
| .EXPORT REPORT FILE=fileName | Export.report(“fileName”, “,”) | The function has no functionality |
| .EXPORT DIF FILE=fileName | Export.report(“fileName”, “,”) | The function has no functionality |
| .EXPORT RESET | Export.reset() | The function has no functionality |
| .IF ERRORCODE != 0 THEN .QUIT ERRORCODE | If snowconvert.helpers.error_code != 0: snowconvert.helpers.quit_application (snowconvert.helpers.error_code) |  |
| .IMPORT RESET | snowconvert.helpers.import_reset() | The function has no functionality |
| .LABEL newLabel | def NEWLABEL(): snowconvert.helpers.quit_application() |  |
| .LOGOFF | The statement is commented |  |
| .LOGON | The statement is commented |  |
| .LOGMECH | The statement is commented |  |
| .OS /fs/fs01/bin/filename.sh ‘load’ | snowconvert.helpers.os(“”/fs/fs01/bin/filename.sh ‘load’ “”) |  |
| .RUN FILE=newFile | for statement in snowconvert.helpers.readrun(“newFile”): eval(statement) |  |
| .SET DEFAULTS | Export.defaults() | The function has no functionality |
| .SET ERRORLEVEL 3807 SEVERITY 0 | snowconvert.helpers.set_error_level(3807, 0) |  |
| .SET RECORMODE OFF | Export.record_mode(False) |  |
| .SET RECORMODE ON | Export.record_mode(True) |  |
| .SET SEPARATOR ‘|’ | Export.separator_string(‘|’) | The function has no functionality |
| .SET WIDTH 120 | Export.width(120) | The function has no functionality |
| .Remark “”Hello world!””” | snowconvert.helpers.remark(r””””””Hello world!””””””) |  |
| .QUIT ERRORCODE | snowconvert.helpers.quit_application(  snowconvert.helpers.error_code  ) |  |
| .QUIT | snowconvert.helpers.quit_application() |  |
| SQL statements | exec(statement) |  |
| $(<$INPUT_SQL_FILE) | exec_file(“$INPUT_SQL_FILE”) |  |
| = (Repeat previous command) | snowconvert.helpers.repeat_previous_sql_statement(con) |  |

For more complicated statements presented in the previous table, subsections with detailed examples are provided below.

### .GOTO Conversion

Since we are converting BTEQ scripts to Python, certain structures that are valid in BTEQ are not inherently supported in Python. This is the case for the `.GOTO` command using the `.Label` commands.

For this reason, an alternative has been developed so that the functionality of these commands can be emulated, turning the `.Label` commands into functions with subsequent call statements.

Check the following code:

```sql
 .LABEL FIRSTLABEL
SELECT * FROM MyTable1;
.LABEL SECONDLABEL
SELECT * FROM MyTable2;
SELECT * FROM MyTable3;
```

In the example above, there were five commands. Two of them were`.Label`commands. The command`FIRSTLABEL`was transformed into a function with the statement(s) that follow it below until another`.LABEL`command is found. When another label is called (in this case, `SECONDLABEL`), that call ends the first function and starts a new one.

If we were to migrate the previous example, the result would be:

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  FIRSTLABEL()
  snowconvert.helpers.quit_application()
def FIRSTLABEL():
  exec("""
    SELECT
      *
    FROM
      MyTable1
    """)
  SECONDLABEL()
def SECONDLABEL():
  exec("""
    SELECT
      *
    FROM
      MyTable2
    """)
  exec("""
    SELECT
      *
    FROM
      MyTable3
    """)

if __name__ == "__main__":
  main()
```

*Notice there is a call to the function*`FIRSTLABEL`*, this function has only one statement, which would be the only non-label command that follows`FIRSTLABEL`in the original code. Before the`FIRSTLABEL`function ends, it calls* `SECONDLABEL`*, with the statements that followed it.*

> * *Notes:*
>
>   + Creating a connector variable `con = None`, and populating it in the `main()` function: `con = snowconvert.helpers.log_on()`.
>   + Setting up a log: `snowconvert.helpers.configure_log()`.

### Execute Query Statements

Every SQL statement found in a BTEQ file will be executed through the`exec`function provided by the [snowconvert.helpers](snowconvert-script-helpers.md). Take for example the following code:

```sql
 CREATE TABLE aTable (aColumn BYTEINT);
```

This is converted to:

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    CREATE OR REPLACE TABLE aTable (
      aColumn BYTEINT
    )
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### Execute Script Files

Files that contain a user’s BTEQ commands and Teradata SQL statements are called scripts, run files, macros, or stored procedures. For example, create a file called SAMPFILE, and enter the following BTEQ script:

```sql
    .LOGON tdpid/userid,password
   SELECT * FROM department;
   .LOGOFF
```

To execute the run file, enter either form of the BTEQ RUN command:

```none
.RUN FILE=sampfile
```

If you convert the second code, the result is the following:

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()

  for statement in snowconvert.helpers.readrun(fr"sampfile"):
    eval(statement)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

The `snowconvert.helpers.readrun("sampfile")` will return each line from the SAMPFILE and in the`FOR`statement, each one of the lines will be passed to the `eval` function, a method that parses the expression passed to it and runs python expression (the SAMPFILE should be converted to work) within the program.

### Execute SQL Files

In some instances during the execution of a BTEQ file a SQL file can be found, take for example the SQL file called NEWSQL:

```sql
 CREATE TABLE aTable (aColumn BYTEINT);
```

This can be executed during a script with the following line:

```sql
 $(<$NEWSQL)
```

And after the conversion of the script the result is:

```sql
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    CREATE OR REPLACE TABLE aTable (
      aColumn BYTEINT
    )
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

The `exec_file` helper function will read each line from the NEWSQL file and then use the exec function as explained in the section Execute query statement.

## Known Issues

No issues were found.

## Related EWIs

No issues were found.

## REPEAT

Translation specification for the REPEAT statement.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

As per Teradata’s [documentation](https://docs.teradata.com/r/Basic-Teradata-Query-Reference/October-2018/BTEQ-Commands/BTEQ-Command-Descriptions/REPEAT), the REPEAT statement enables users to specify the maximum number of times the next SQL request is to be submitted. Note that a SQL request can be a single or multi-statement request. This is defined by the position of the semicolons for each statement following the REPEAT statement.

### Syntax

```none
REPEAT [ n [ PACK p [ REQBUFLEN b ] ] | * | RECS r]
<sql_request>
```

### Sample Source Patterns

With this input data:

#### inputData.dat

```none
A B C
D E F
G H I
```

##### inputData2.dat

```none
* [
] *
```

#### Teradata:

##### Query

```sql
 .IMPORT DATA FILE = inputData.dat;
.REPEAT *
USING var_1 (CHARACTER), var_2 (CHARACTER), var_3 (CHARACTER)
INSERT INTO testtabu (c1) VALUES (:var_1)
;INSERT INTO testtabu (c1) VALUES (:var_2)
;INSERT INTO testtabu (c1) VALUES (:var_3)
;UPDATE testtabu
   SET c2 = 'X'
   WHERE c1 = :var_1
;UPDATE testtabu
   SET c2 = 'Y'
   WHERE c1 = :var_2
;UPDATE testtabu
   SET c2 = 'Z'
   WHERE c1 = :var_3
;INSERT INTO TESTTABU (c1, c2) VALUES ('?','_');

.REPEAT 10
INSERT INTO TESTTABU2 VALUES ('John Doe', 23);

.REPEAT RECS 5
INSERT INTO TESTTABU2 VALUES ('Bob Alice', 21);

.IMPORT DATA FILE = inputData2.dat;
USING (var_1 CHARACTER, var_2 CHARACTER)
INSERT INTO testtabu (c1) VALUES (:var_1)
;INSERT INTO testtabu (c1) VALUES (:var_2);
```

##### TESTTABU Result

| C1 | C2 |
| --- | --- |
| A | X |
| D | X |
| G | X |
| B | Y |
| E | Y |
| H | Y |
| C | Z |
| F | Z |
| I | Z |
| ? | _ |
| ? | _ |
| ? | _ |
| \* | null |
| [ | null |

##### TESTTABU2 Result

| MY_NAME | MY_AGE |
| --- | --- |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| Bob Alice | 21 |
| Bob Alice | 21 |
| Bob Alice | 21 |
| Bob Alice | 21 |
| Bob Alice | 21 |

#### Snowflake:

##### Query

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  snowconvert.helpers.import_file(fr"inputData.dat")
  ssc_repeat_value = '*'
  ssc_max_iterations = 1

  for ssc_repeat_position in range(0, ssc_max_iterations):

    if ssc_repeat_position == 0:
      using = snowconvert.helpers.using("var_1", "CHARACTER", "var_2", "CHARACTER", "var_3", "CHARACTER", rows_to_read = ssc_repeat_value)
      exec("""
        INSERT INTO testtabu (c1)
        VALUES (:var_1)
        """, using = using)
      exec("""
        INSERT INTO testtabu (c1)
        VALUES (:var_2)
        """, using = using)
      exec("""
        INSERT INTO testtabu (c1)
        VALUES (:var_3)
        """, using = using)
      exec("""
        UPDATE testtabu
          SET
            c2 = 'X'
          WHERE
            c1 = :var_1
        """, using = using)
      exec("""
        UPDATE testtabu
          SET
            c2 = 'Y'
          WHERE
            c1 = :var_2
        """, using = using)
      exec("""
        UPDATE testtabu
          SET
            c2 = 'Z'
          WHERE
            c1 = :var_3
        """, using = using)
      exec("""
        INSERT INTO TESTTABU (c1, c2)
        VALUES ('?', '_')
        """, using = using)

  ssc_repeat_value = 10
  ssc_max_iterations = 10

  for ssc_repeat_position in range(0, ssc_max_iterations):
    exec("""
      INSERT INTO TESTTABU2
      VALUES ('John Doe', 23)
      """)
  ssc_repeat_value = 5
  ssc_max_iterations = 5

  for ssc_repeat_position in range(0, ssc_max_iterations):
    exec("""
      INSERT INTO TESTTABU2
      VALUES ('Bob Alice', 21)
      """)
  snowconvert.helpers.import_file(fr"inputData2.dat")
  using = snowconvert.helpers.using("var_1", "CHARACTER", "var_2", "CHARACTER", rows_to_read = 1)
  exec("""
    INSERT INTO testtabu (c1)
    VALUES (:var_1)
    """, using = using)
  exec("""
    INSERT INTO testtabu (c1)
    VALUES (:var_2)
    """, using = using)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

##### TESTTABU Result

| C1 | C2 |
| --- | --- |
| A | X |
| D | X |
| G | X |
| B | Y |
| E | Y |
| H | Y |
| C | Z |
| F | Z |
| I | Z |
| ? | _ |
| ? | _ |
| ? | _ |
| \* | null |
| [ | null |

##### TESTTABU2 Result

| MY_NAME | MY_AGE |
| --- | --- |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| John Doe | 23 |
| Bob Alice | 21 |
| Bob Alice | 21 |
| Bob Alice | 21 |
| Bob Alice | 21 |
| Bob Alice | 21 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## USING REQUEST MODIFIER

Translation specification for the USING REQUEST MODIFIER query.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

As per Teradata’s [documentation](https://docs.teradata.com/r/SQL-Data-Manipulation-Language/September-2020/Statement-Syntax/USING-Request-Modifier/USING-Request-Modifier-Syntax), the USING REQUEST MODIFIER defines one or more variable parameter names to be used in the subsequent `SELECT`, `INSERT`, `UPDATE`, or `DELETE` statements to import or export data.

The syntax for this statement is as follows:

```sql
 USING ( <using_spec> [,...] ) SQL_request

<using_spec> ::= using_variable_name data_type [ data_type_attribute [...] ]
  [ AS { DEFERRED [BY NAME] | LOCATOR } ]
```

As stated in Teradata’s documentation, the USING REQUEST MODIFIER needs to be preceded by an .IMPORT statement for it to load the data into the defined parameters.

Thus, the transformation for this statement follows these steps:

1. Call the `import_file()` function from the SnowConvert AI Helpers. This loads the data into a temporary file.
2. Call the `using()` function from the SnowConvert AI Helpers to create a dictionary with the loaded data.
3. For each query, run the `exec()` function from the SnowConvert AI Helpers and pass the previously defined dictionary. This will use Snowflake Python Connector data binding capabilities.

With this input data:

```none
 A,B,C
```

**Teradata (MultiLoad)**

### Query

```xml
 .IMPORT DATA FILE = inputData.dat;
USING var_1 (CHARACTER), var_2 (CHARACTER), var_3 (CHARACTER)
INSERT INTO testtabu (c1) VALUES (:var_1)
;INSERT INTO testtabu (c1) VALUES (:var_2)
;INSERT INTO testtabu (c1) VALUES (:var_3)
;UPDATE testtabu
   SET c2 = 'X'
   WHERE c1 = :var_1
;UPDATE testtabu
   SET c2 = 'Y'
   WHERE c1 = :var_2
;UPDATE testtabu
   SET c2 = 'Z'
   WHERE c1 = :var_3;
```

#### Result

| ROW | C1 | C2 |
| --- | --- | --- |
| 1 | A | X |
| 2 | B | Y |
| 3 | C | Z |

**Snowflake (Python)**

##### Query

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  snowconvert.helpers.import_file(fr"inputData.dat")
  using = snowconvert.helpers.using("var_1", "CHARACTER", "var_2", "CHARACTER", "var_3", "CHARACTER", rows_to_read = 1)
  exec("""
    INSERT INTO testtabu (c1)
    VALUES (:var_1)
    """, using = using)
  exec("""
    INSERT INTO testtabu (c1)
    VALUES (:var_2)
    """, using = using)
  exec("""
    INSERT INTO testtabu (c1)
    VALUES (:var_3)
    """, using = using)
  exec("""
    UPDATE testtabu
      SET
        c2 = 'X'
      WHERE
        c1 = :var_1
    """, using = using)
  exec("""
    UPDATE testtabu
      SET
        c2 = 'Y'
      WHERE
        c1 = :var_2
    """, using = using)
  exec("""
    UPDATE testtabu
      SET
        c2 = 'Z'
      WHERE
        c1 = :var_3
    """, using = using)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

##### Result

| ROW | C1 | C2 |
| --- | --- | --- |
| 1 | A | X |
| 2 | B | Y |
| 3 | C | Z |

### Known Issues

**1. .REPEAT command is not yet supported**

The `.REPEAT` command is not yet supported. This means that the USING REQUEST MODIFIER will only use the data loaded from the first row of the input file. Thus, the queries will only run once.

This issue should be fixed when the .REPEAT command receives proper transformation support.

If you have any additional questions regarding this documentation, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Teradata - BTEQ
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/bteq.md
section: Migrations
---

# SnowConvert AI - Teradata - BTEQ

Translation references to convert Teradata BTEQ files to Snowflake SQL

## Description

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> Basic Teradata Query (BTEQ) is a general-purpose, command-based program that enables users on a workstation to communicate with one or more Teradata Database systems and to format reports for both print and screen output.

For more information, see the [Teradata BTEQ Reference](https://docs.teradata.com/r/Basic-Teradata-Query-Reference/October-2018).

### Sample Source Patterns

#### 1. Basic BTEQ Example

The BTEQ content is relocated within an `EXECUTE IMMEDIATE` block to transfer the BTEQ script functionality to Snowflake SQL executable code.

All the DML and DDL statements inside BTEQ scripts are supported by SnowConvert AI and successfully translated to Snowflake SQL. The commands that do not have support yet, or do not have support at all, are being marked with a warning message and commented out.

##### Teradata BTEQ

```sql
 -- Additional Params: -q SnowScript
.LOGON 0/dbc,dbc;
   DATABASE tduser;

   CREATE TABLE employee_bkup (
      EmployeeNo INTEGER,
      FirstName CHAR(30),
      LastName CHAR(30),
      DepartmentNo SMALLINT,
      NetPay INTEGER
   )
   Unique Primary Index(EmployeeNo);

   DROP TABLE employee_bkup;

   .IF ERRORCODE <> 0 THEN .EXIT ERRORCODE;
.LOGOFF;
```

##### Snowflake SQL

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    -- Additional Params: -q SnowScript
    --.LOGON 0/dbc,dbc
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    BEGIN
      USE DATABASE tduser;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      CREATE OR REPLACE TABLE employee_bkup (
        EmployeeNo INTEGER,
        FirstName CHAR(30),
        LastName CHAR(30),
        DepartmentNo SMALLINT,
        NetPay INTEGER,
        UNIQUE (EmployeeNo)
      );
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      DROP TABLE employee_bkup;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    IF (STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/ != 0) THEN
      RETURN STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/;
    END IF;
    --.LOGOFF
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'LogOff' NODE ***/!!!
    null;
  END
$$
```

#### 2. Bash Variable Placeholders Example

SnowConvert AI supports the migration of BTEQ code with Bash Variable Placeholders used for shell scripts, these placeholders will be migrated to its SnowSQL equivalent and [SSC-FDM-TD0003](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) will be added to the code. Please consider the following when migrating code with these placeholders:

* SnowConvert AI does **not** support the migration of shell scripts, to migrate the BTEQ code please isolate it in a BTEQ file and supply it as input for the tool.
* SnowSQL with variable substitution enabled is required to execute the migrated code, for more information on how to use SnowSQL please check [SSC-FDM-TD0003](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) and the [official documentation for SnowSQ](https://docs.snowflake.com/en/user-guide/snowsql-use.html#using-snowsql)L.

##### Teradata BTEQ

```sql
 -- Additional Params: -q SnowScript
.LOGON dbc, dbc;

DATABASE testing;

SELECT $columnVar FROM $tableVar WHERE col2 = $nameExprVar;
INSERT INTO $tableName values ('$myString', $numValue);
UPDATE $dbName.$tableName SET col1 = $myValue;
DELETE FROM $tableName;

.LOGOFF;
```

##### Snowflake SQL

```sql
 EXECUTE IMMEDIATE
$$
  --** SSC-FDM-TD0003 - BASH VARIABLES FOUND, USING SNOWSQL WITH VARIABLE SUBSTITUTION ENABLED IS REQUIRED TO RUN THIS SCRIPT **
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    -- Additional Params: -q SnowScript
    --.LOGON dbc, dbc
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    BEGIN
      USE DATABASE testing;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      SELECT
        &columnVar
      FROM
        &tableVar
      WHERE
        col2 = &nameExprVar;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      INSERT INTO &tableName
      VALUES ('&myString', &numValue);
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      UPDATE &dbName.&tableName
        SET
          col1 = &myValue
        ;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      DELETE FROM
        &tableName;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    --.LOGOFF
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'LogOff' NODE ***/!!!
    null;
  END
$$
```

#### 3. `.IF`, `.GOTO`, and `.LABEL` (Snowflake Scripting)

When the BTEQ script target is **Snowflake Scripting** (SnowScript), SnowConvert AI translates **`.IF`**, **`.GOTO`**, and **`.LABEL`** by modeling jumps as **nested procedure calls** and **early returns** inside a single **`EXECUTE IMMEDIATE $$ … $$`** block. Snowflake does not provide BTEQ-style goto/label semantics.

| BTEQ construct | Snowflake Scripting approach |
| --- | --- |
| Script body with labels | Wrapped in `EXECUTE IMMEDIATE $$ … $$` with `DECLARE`, nested procedures, and a top-level `BEGIN … END` |
| `.LABEL name` | Section `name` becomes a **nested procedure**; handoffs use `CALL name();` |
| `.GOTO name` | `CALL name();` followed by `RETURN 'PROCESS FINISHED';` |
| `.IF ERRORCODE …` | `IF (STATUS_OBJECT['SQLCODE'] …)` using a status object updated in `EXCEPTION` handlers |
| `.IF ACTIVITYCOUNT …` | `IF` on row count from `TABLE(RESULT_SCAN(LAST_QUERY_ID()))` |
| Error/status tracking | `STATUS_OBJECT` holds status fields; generated SQL may include **SSC-FDM-TD0013** (Snowflake `SQLCODE` is not the same as Teradata `ERRORCODE`) |

> **Note:**
>
> To produce output in this form, set the Teradata/BTEQ **script output** to **Snowflake Scripting** (for example `-- Additional Params: -q SnowScript` in the samples below). Option names vary by interface; see the SnowConvert AI user guide for your product version.

The following example shows **`.IF ERRORCODE`** with **`.GOTO`** and **`.LABEL`**. Teradata BTEQ often branches on **`ERRORCODE`** after DDL and jumps to a labeled cleanup or next step.

##### Teradata BTEQ

```sql
 -- Additional Params: -q SnowScript
DROP TABLE DP_DWEDW.TF035_PCOM_PROD_TRAT_SEL;
.IF ERRORCODE <> 0 THEN .GOTO CRIA_EXPRESS_1;
.LABEL CRIA_EXPRESS_1
DROP TABLE DP_DWEDW.TF035_DESCTO_EXPRESS;
.IF ERRORCODE <> 0 THEN .GOTO CRIA_EXPRESS_2;
.LABEL CRIA_EXPRESS_2
DROP TABLE DP_DWEDW.TF035_DEVOL_EXPRESS;
```

##### Snowflake SQL

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    SC_EXIT_CODE VARCHAR := 0;
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0) /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/;
    SC_PROCESS PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        DROP TABLE DP_DWEDW.TF035_PCOM_PROD_TRAT_SEL;
        IF (STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/ != 0) THEN
          CALL CRIA_EXPRESS_1();
          RETURN 'PROCESS FINISHED';
        END IF;
        CALL CRIA_EXPRESS_1();
      EXCEPTION
        WHEN OTHER CONTINUE THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
    CRIA_EXPRESS_1 PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        DROP TABLE DP_DWEDW.TF035_DESCTO_EXPRESS;
        IF (STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/ != 0) THEN
          CALL CRIA_EXPRESS_2();
          RETURN 'PROCESS FINISHED';
        END IF;
        CALL CRIA_EXPRESS_2();
      EXCEPTION
        WHEN OTHER CONTINUE THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
    CRIA_EXPRESS_2 PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        DROP TABLE DP_DWEDW.TF035_DEVOL_EXPRESS;
      EXCEPTION
        WHEN OTHER CONTINUE THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
  BEGIN
    CALL SC_PROCESS();
    RETURN SC_EXIT_CODE;
  END
$$
;
```

After each statement that can set SQL state, the generated `IF` checks `STATUS_OBJECT['SQLCODE']`. On error, the script **calls** the target label procedure and **returns** from the current procedure so later statements in that section do not run. On success, it **calls** the next section’s procedure to continue the original linear flow.

The next example shows **`.IF ActivityCount = 0 THEN .GOTO …`**, expressed in Snowflake Scripting using the **last query id** and **`RESULT_SCAN`**.

##### Teradata BTEQ

```sql
 -- Additional Params: -q SnowScript
.IF ActivityCount = 0 THEN .GOTO Continue_No_Rejects_00
DROP TABLE DROPTEDTABLE1;
.LABEL Continue_No_Rejects_00
SELECT A FROM AUDITORIA;
```

##### Snowflake SQL

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    SC_EXIT_CODE VARCHAR := 0;
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0) /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/;
    SC_PROCESS PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        IF ((SELECT
          COUNT(*)
        FROM
          TABLE(RESULT_SCAN(LAST_QUERY_ID()))) = 0) THEN
          CALL Continue_No_Rejects_00();
          RETURN 'PROCESS FINISHED';
        END IF;
        DROP TABLE DROPTEDTABLE1;
        CALL Continue_No_Rejects_00();
      EXCEPTION
        WHEN OTHER CONTINUE THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
    Continue_No_Rejects_00 PROCEDURE ()
    RETURNS VARCHAR
    AS
      BEGIN
        SELECT
          A
        FROM
          AUDITORIA;
      EXCEPTION
        WHEN OTHER CONTINUE THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
  BEGIN
    CALL SC_PROCESS();
    RETURN SC_EXIT_CODE;
  END
$$
;
```

If **activity count is zero**, the generated script jumps to the label section with **`CALL Continue_No_Rejects_00()`** and **`RETURN 'PROCESS FINISHED'`**, skipping `DROP TABLE DROPTEDTABLE1`. Otherwise it runs `DROP TABLE`, then **calls** the label procedure to run `SELECT A FROM AUDITORIA`.

### Known Issues

1. **There may be BTEQ commands that do not have an equivalent in Snowflake SQL**

Since BTEQ is a command-based program, there may be some commands in your input code that do not have a hundred percent functional equivalence in Snowflake SQL. Those particular cases are identified, marked with warnings in the output code, and documented in the further pages.

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-FDM-TD0003](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Bash variables found, using Snow SQL with variable substitution enabled is required to run this script.
3. [SSC-FDM-TD0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): The Snowflake error code mismatches the original Teradata error code.

---
title: SnowConvert AI - Teradata - Built-in Functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/teradata-built-in-functions.md
section: Migrations
---

# SnowConvert AI - Teradata - Built-in Functions

This page provides a description of the translation for the built-in functions in Teradata to Snowflake

> **Note:**
>
> This page only lists the functions that are already transformed by SnowConvert AI, if a function from the Teradata documentation is not listed there then it should be taken as unsupported.
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../../general/built-in-functions.md).

> **Note:**
>
> Some Teradata functions do not have a direct equivalent in Snowflake so they are transformed into a functional equivalent UDF, these can be easily spotted by the _UDF postfix in the name of the function. For more information on the UDFs SnowConvert AI uses check this [git repository](https://github.com/MobilizeNet/SnowConvert_Support_Library/tree/main/UDFs).

## Aggregate Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| AVG | AVG |  |
| CORR | CORR |  |
| COUNT | COUNT |  |
| COVAR_POP | COVAR_POP |  |
| COVAR_SAMP | COVAR_SAMP |  |
| GROUPING | GROUPING |  |
| KURTOSIS | KURTOSIS |  |
| MAXIMUM  MAX | MAX |  |
| MINIMUM  MIN | MIN |  |
| PIVOT | PIVOT | Check PIVOT. |
| REGR_AVGX | REGR_AVGX |  |
| REGR_AVGY | REGR_AVGY |  |
| REGR_COUNT | REGR_COUNT |  |
| REGR_INTERCEPT | REGR_INTERCEPT |  |
| REGR_R2 | REGR_R2 |  |
| REGR_SLOPE | REGR_SLOPE |  |
| REGR_SXX | REGR_SXX |  |
| REGR_SXY | REGR_SXY |  |
| REGR_SYY | REGR_SYY |  |
| SKEW | SKEW |  |
| STDDEV_POP | STDDEV_POP |  |
| STDDEV_SAMP | STDDEV_SAMP |  |
| SUM | SUM |  |
| UNPIVOT | UNPIVOT | Unpivot with multiple functions not supported in Snowflake |
| VAR_POP | VAR_POP |  |
| VAR_SAMP | VAR_SAMP |  |

> See [Aggregate functions​](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Aggregate-Functions)

## Arithmetic, Trigonometric, Hyperbolic Operators/Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| ABS | ABS |  |
| CEILING | CEIL |  |
| DEGREES | DEGREES |  |
| EXP | EXP |  |
| FLOOR | FLOOR |  |
| ***HYPERBOLIC***  ACOSH  ASINH  ATANH  COSH  SINH  TANH | ***HYPERBOLIC***  ACOSH  ASINH  ATANH  COSH  SINH  TANH |  |
| LOG | LOG |  |
| LN | LN |  |
| MOD | MOD |  |
| NULLIFZERO(param) | CASE WHEN param=0 THEN null ELSE param END |  |
| POWER | POWER |  |
| RANDOM | RANDOM |  |
| RADIANS | RADIANS |  |
| ROUND | ROUND |  |
| SIGN | SIGN |  |
| SQRT | SQRT |  |
| TRUNC | TRUNC_UDF |  |
| ***TRIGONOMETRIC***  ACOS  ASIN  ATAN  ATAN2  COS  SIN  TAN | ***TRIGONOMETRIC***  ACOS  ASIN  ATAN  ATAN2  COS  SIN  TAN |  |
| ZEROIFNULL | ZEROIFNULL |  |

> See [Arithmetic, Trigonometric, Hyperbolic Operators/Functions](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Arithmetic-Trigonometric-Hyperbolic-Operators/Functions)​

## Attribute Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| BIT_LENGTH | BIT_LENGTH |  |
| BYTE  BYTES | LENGTH |  |
| CHAR  CHARS  CHARACTERS | LEN |  |
| CHAR_LENGTH  CHARACTER_LENGTH | LEN |  |
| MCHARACTERS | LENGTH |  |
| OCTECT_LENGTH | OCTECT_LENGTH |  |

> See [Attribute functions](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Attribute-Functions)

## Bit/Byte Manipulation Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| BITAND | BITAND |  |
| BITNOT | BITNOT |  |
| BITOR | BITOR |  |
| BITXOR | BITXOR |  |
| GETBIT | GETBIT |  |

> See [Bit/Byte functions](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Bit/Byte-Manipulation-Functions)

## Built-In (System Functions)

| Teradata | Snowflake | Note |
| --- | --- | --- |
| ACCOUNT | CURRENT_ACCOUNT |  |
| CURRENT_DATE  CURDATE | CURRENT_DATE |  |
| CURRENT_ROLE | CURRENT_ROLE |  |
| CURRENT_TIME CURTIME | CURRENT_TIME |  |
| CURRENT_TIMESTAMP | CURRENT_TIMESTAMP |  |
| DATABASE | CURRENT_DATABASE |  |
| DATE | CURRENT_DATE |  |
| NOW | CURRENT_TIMESTAMP |  |
| PROFILE | CURRENT_ROLE | Check [SSC-EWI-TD0068](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) for more details on this transformation |
| SESSION | CURRENT_SESSION |  |
| TIME | CURRENT_TIME |  |
| USER | CURRENT_USER |  |

> See [Built-In Functions](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Built-In-Functions)

## Business Calendars

| Teradata | Snowflake | Note |
| --- | --- | --- |
| DAYNUMBER_OF_MONTH(DatetimeValue, ‘COMPATIBLE’) | DAYOFMONTH |  |
| DAYNUMBER_OF_MONTH(DatetimeValue, ‘ISO’) | DAYNUMBER_OF_MONTH_ISO_UDF |  |
| DAYNUMBER_OF_MONTH(DatetimeValue, ‘TERADATA’) | DAYOFMONTH |  |
| DAYNUMBER_OF_WEEK(DatetimeValue, ‘ISO’) | DAYOFWEEKISO |  |
| DAYNUMBER_OF_WEEK(DatetimeValue, ‘COMPATIBLE’) | DAY_OF_WEEK_COMPATIBLE_UDF |  |
| DAYNUMBER_OF_WEEK(DatetimeValue, ‘TERADATA’) DAYNUMBER_OF_WEEK(DatetimeValue) | TD_DAY_OF_WEEK_UDF |  |
| DAYNUMBER_OF_YEAR(DatetimeValue, ‘ISO’) | PUBLIC.DAY_OF_YEAR_ISO_UDF |  |
| DAYNUMBER_OF_YEAR(DatetimeValue) | DAYOFYEAR |  |
| QUARTERNUMBER_OF_YEAR | QUARTER |  |
| TD_SUNDAY(DateTimeValue) | PREVIOUS_DAY(DateTimeValue, ‘Sunday’) |  |
| WEEKNUMBER_OF_MONTH | WEEKNUMBER_OF_MONTH_UDF |  |
| WEEKNUMBER_OF_QUARTER(dateTimeValue) | WEEKNUMBER_OF_QUARTER_UDF |  |
| WEEKNUMBER_OF_QUARTER(dateTimeValue, ‘ISO’) | WEEKNUMBER_OF_QUARTER_ISO_UDF |  |
| WEEKNUMBER_OF_QUARTER(dateTimeValue, ‘COMPATIBLE’) | WEEKNUMBER_OF_QUARTER_COMPATIBLE_UDF |  |
| WEEKNUMBER_OF_YEAR(DateTimeValue, ‘ISO’) | WEEKISO |  |
| YEARNUMBER_OF_CALENDAR(DATETIMEVALUE, ‘COMPATIBLE’) | YEAR |  |
| YEARNUMBER_OF_CALENDAR(DATETIMEVALUE, ‘ISO’) | YEAROFWEEKISO |  |

> See [Business Calendars](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Date-and-Time-Functions-and-Expressions-17.20/Business-Calendars)

## Calendar Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| DAYNUMBER_OF_WEEK(DatetimeValue) | TD_DAY_OF_WEEK_UDF |  |
| DAYNUMBER_OF_WEEK(DatetimeValue, ‘COMPATIBLE’) | DAY_OF_WEEK_COMPATIBLE_UDF |  |
| QuarterNumber_Of_Year(DatetimeValue, ‘ISO’) | QUARTER_OF_YEAR_ISO_UDF(DatetimeValue) |  |
| TD_DAY_OF_CALENDAR | TD_DAY_OF_CALENDAR_UDF |  |
| TD_DAY_OF_MONTH DAYOFMONTH | DAYOFMONTH |  |
| TD_DAY_OF_WEEK DAYOFWEEK | TD_DAY_OF_WEEK_UDF |  |
| TD_DAY_OF_YEAR | DAYOFYEAR |  |
| TD_MONTH_OF_CALENDAR(DateTimeValue) MONTH_CALENDAR(DateTimeValue) | TD_MONTH_OF_CALENDAR_UDF(DateTimeValue) |  |
| TD_WEEK_OF_CALENDAR(DateTimeValue) WEEK_OF_CALENDAR(DateTimeValue) | TD_WEEK_OF_CALENDAR_UDF(DateTimeValue) |  |
| TD_WEEK_OF_YEAR | WEEK_OF_YEAR_UDF |  |
| TD_YEAR_BEGIN(DateTimeValue) | YEAR_BEGIN_UDF(DateTimeValue) |  |
| TD_YEAR_BEGIN(DateTimeValue, ‘ISO’) | YEAR_BEGIN_ISO_UDF(DateTimeValue) |  |
| TD_YEAR_END(DateTimeValue) | YEAR_END_UDF(DateTimeValue) |  |
| TD_YEAR_END(DateTimeValue, ‘ISO’) | YEAR_END_ISO_UDF(DateTimeValue) |  |
| WEEKNUMBER_OF_MONTH(DateTimeValue) | WEEKNUMBER_OF_MONTH_UDF(DateTimeValue) |  |
| WEEKNUMBER_OF_QUARTER(DateTimeValue) | WEEKNUMBER_OF_QUARTER_UDF(DateTimeValue) |  |
| WEEKNUMBER_OF_QUARTER(DateTimeValue, ‘ISO’) | WEEKNUMBER_OF_QUARTER_ISO_UDF(DateTimeValue) |  |
| WEEKNUMBER_OF_QUARTER(DateTimeValue, ‘COMPATIBLE’) | WEEKNUMBER_OF_QUARTER_COMPATIBLE_UDF(DateTimeValue) |  |
| WEEKNUMBER_OF_YEAR(DateTimeValue) | WEEK_OF_YEAR_UDF(DateTimeValue) |  |
| WEEKNUMBER_OF_YEAR(DateTimeValue, ‘COMPATIBLE’) | WEEK_OF_YEAR_COMPATIBLE_UDF(DateTimeValue) |  |

> See [Calendar Functions](https://docs.teradata.com/r/WX0vkeB8F3JQXZ0HTR~d0Q/~8TzAjUr3AFwohWtu8ndxQ)

## Case Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| COALESCE | COALESCE | Check Coalesce. |
| NULLIF | NULLIF |  |

> See [case functions](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/CASE-Expressions)

## Comparison Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| DECODE | DECODE |  |
| GREATEST | GREATEST |  |
| LEAST | LEAST |  |

> See [comparison functions](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Comparison-Operators-and-Functions)

## Data type conversions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| CAST | CAST |  |
| CAST(DatetimeValue AS INT) | DATE_TO_INT_UDF |  |
| CAST (VarcharValue AS INTERVAL) | INTERVAL_UDF | Check Cast to INTERVAL datatype |
| TRYCAST | TRY_CAST |  |
| FROM_BYTES | TO_NUMBER TO_BINARY | FROM_BYTES with ASCII parameter not supported in Snowflake. |

> See [Data Type Conversions](https://docs.teradata.com/reader/~_sY_PYVxZzTnqKq45UXkQ/iZ57TG_CtznEu1JdSbFNsQ)

## Data Type Conversion Functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| TO_BYTES(Input, ‘Base10’) | INT2HEX_UDF(Input) |  |
| TO_NUMBER | TO_NUMBER |  |
| TO_CHAR | TO_CHAR or equivalent expression | Check TO_CHAR. |
| TO_DATE | TO_DATE |  |
| TO_DATE(input, ‘YYYYDDD’) | JULIAN_TO_DATE_UDF |  |

> See [Data Type Conversion Functions](https://docs.teradata.com/r/Teradata-VantageTM-Data-Types-and-Literals/March-2019/Data-Type-Conversion-Functions)

## DateTime and Interval functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| ADD_MONTHS | ADD_MONTHS |  |
| EXTRACT | EXTRACT |  |
| LAST_DAY | LAST_DAY |  |
| MONTH | MONTH |  |
| MONTHS_BETWEEN | MONTHS_BETWEEN_UDF |  |
| NEXT_DAY | NEXT_DAY |  |
| OADD_MONTHS | ADD_MONTHS |  |
| ROUND(Numeric) | ROUND |  |
| ROUND(Date) | ROUND_DATE_UDF |  |
| TRUNC(Date) | TRUNC_UDF |  |
| YEAR | YEAR |  |

> See [DateTime and Interval Functions and Expressions](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/JhmMJqd9vWURvHYeTRgQLQ)

## Hash functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| HASH_MD5 | MD5 |  |
| HASHAMP  HASHBACKAM  HASHBUCKET  HASHROW | Not supported | Check notes on the architecture differences between Teradata and Snowflake |

> See [Hash functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/jslafnqlE8bGpg~wXQiEFw)

## JSON functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| NEW JSON | TO_JSON(PARSE_JSON()**)** | Check NEW JSON |
| JSON_CHECK | CHECK_JSON | Check JSON_CHECK |
| JSON_TABLE | Equivalent query | Check JSON_TABLE |
| JSONExtract  JSONExtractValue JSONExtractLargeValue | JSON_EXTRACT_UDF | Check JSON_EXTRACT |

> See [JSON documentation](https://docs.teradata.com/r/C8cVEJ54PO4~YXWXeXGvsA/_aeoMCG0XgMNegNj0oy5cg)

## Null-Handling functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| NVL | NVL |  |
| NVL2 | NVL2 |  |

> See [Null-Handling functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/4di35TY_6SqRNEGk4vv0ww)

## Ordered Analytical/Window Aggregate functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| CSUM(col1, col2) | SUM(col_1) OVER (PARTITION BY null ORDER BY col_2 ROWS UNBOUNDED PRECEDING) |  |
| CUME_DIST | CUME_DIST |  |
| DENSE_RANK | DENSE_RANK |  |
| FIRST_VALUE | FIRST_VALUE |  |
| LAG | LAG |  |
| LAST_VALUE | LAST_VALUE |  |
| LEAD | LEAD |  |
| MAVG(csales, 2, cdate, csales) | AVG(csales) OVER ( ORDER BY cdate, csales ROWS 1 PRECEDING) |  |
| MEDIAN | MEDIAN |  |
| MSUM(csales, 2, cdate, csales) | SUM(csales) OVER(ORDER BY cdate, csales ROWS 1 PRECEDING) |  |
| PERCENT_RANK | PERCENT_RANK |  |
| PERCENTILE_CONT | PERCENTILE_CONT |  |
| PERCENTILE_DISC | PERCENTILE_DISC |  |
| QUANTILE | QUANTILE |  |
| RANK | RANK |  |
| ROW_NUMBER | ROW_NUMBER |  |

> See [Window functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/qbFqalW6IF5Fryz47~iqJQ)

## Period functions and operators

| Teradata | Snowflake | Note |
| --- | --- | --- |
| BEGIN | PERIOD_BEGIN_UDF |  |
| END | PERIOD_END_UDF |  |
| INTERVAL | TIMESTAMPDIFF |  |
| LAST | PERIOD_LAST_UDF |  |
| LDIFF | PERIOD_LDIFF_UDF |  |
| OVERLAPS | PUBLIC.PERIOD_OVERLAPS_UDF |  |
| PERIOD | PERIOD_UDF |  |
| PERIOD(datetimeValue, UNTIL_CHANGED)  PERIOD(datetimeValue, UNTIL_CLOSED) | PERIOD_UDF(datetimeValue, ‘9999-12-31 23:59:59.999999’) | See notes about ending bound constants |
| RDIFF | PERIOD_RDIFF_UDF |  |

> See [Period Functions and Operators](https://docs.teradata.com/r/Teradata-Database-SQL-Functions-Operators-Expressions-and-Predicates/June-2017/Period-Functions-and-Operators)

## Query band functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| GETQUERYBANDVALUE | GETQUERYBANDVALUE_UDF | Check GETQUERYBANDVALUE |

> See [Query band functions](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/Teradata-VantageTM-Application-Programming-Reference-17.20/Workload-Management-Query-Band-APIs/Open-APIs-SQL-Interfaces)

## Regex functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| REGEXP_INSTR | REGEXP_INSTR | Check Regex functions |
| REGEXP_REPLACE | REGEXP_REPLACE | Check Regex functions |
| REGEXP_SIMILAR | REGEXP_LIKE | Check Regex functions |
| REGEXP_SUBSTR | REGEXP_SUBSTR | Check Regex functions |

> See [Regex functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/yL2xT~elOTehmwVmwVBRHA)

## String operators and functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| ASCII | ASCII |  |
| CHAR2HEXINT | CHAR2HEXINT_UDF |  |
| CHR | CHR/CHAR |  |
| CHAR_LENGTH | LEN |  |
| CONCAT | CONCAT |  |
| EDITDISTANCE | EDITDISTANCE |  |
| INDEX | CHARINDEX | Check notes about implicit conversion |
| INITCAP | INITCAP |  |
| INSTR | REGEXP_INSTR |  |
| INSTR(StringValue, StringValue ,NumericNegativeValue, NumericValue) | INSTR_UDF(StringValue, StringValue ,NumericNegativeValue, NumericValue) |  |
| LEFT | LEFT |  |
| LENGTH | LENGTH |  |
| LOWER | LOWER |  |
| LPAD | LPAD |  |
| LTRIM | LTRIM |  |
| OREPLACE | REPLACE |  |
| OTRANSLATE | TRANSLATE |  |
| POSITION | POSITION | Check notes about implicit conversion |
| REVERSE | REVERSE |  |
| RIGHT | RIGHT |  |
| RPAD | RPAD |  |
| RTRIM | RTRIM |  |
| SOUNDEX | SOUNDEX_P123 |  |
| STRTOK | STRTOK |  |
| STRTOK_SPLIT_TO_TABLE | STRTOK_SPLIT_TO_TABLE | Check Strtok_split_to_table |
| SUBSTRING | SUBSTR/SUBSTR_UDF | Check Substring |
| TRANSLATE_CHK | TRANSLATE_CHK_UDF |  |
| TRIM(LEADING ‘0’ FROM aTABLE) | LTRIM(aTABLE, ‘0’) |  |
| TRIM(TRAILING ‘0’ FROM aTABLE) | RTRIM(aTABLE, ‘0’) |  |
| TRIM(BOTH ‘0’ FROM aTABLE) | TRIM(aTABLE, ‘0’) |  |
| TRIM(CAST(numericValue AS FORMAT ‘999’)) | LPAD(numericValue, 3, 0) |  |
| UPPER | UPPER |  |

> See [String operators and functions](https://docs.teradata.com/reader/756LNiPSFdY~4JcCCcR5Cw/5nyfztBE7gDQVCVU2MFTnA)​​​

## St_Point functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| ST_SPHERICALDISTANCE | HAVERSINE ST_DISTANCE |  |

See [St_Point functions](https://docs.teradata.com/r/W1AEeHO2cxTi3Sn7dtj8hg/JDVMx04qe~mo1mIm2h7NWQ)

## Table operators

| Teradata | Snowflake | Note |
| --- | --- | --- |
| TD_UNPIVOT | Equivalent query | Check Td_unpivot |

> See [Table Operators](https://docs.teradata.com/r/Teradata-Database-SQL-Functions-Operators-Expressions-and-Predicates/June-2017/Table-Operators)

## XML functions

| Teradata | Snowflake | Note |
| --- | --- | --- |
| XMLAGG | LISTAGG | Check Xmlagg |
| XMLQUERY | Not Supported |  |

> See [XML functions](https://docs.teradata.com/r/JTydkOYDksSy26sxlEtMvg/GhlIYri~mxyncdX5BV3jWA)

## Extensibility UDFs

This section contains UDFs and other extensibility functions that are not offered as system built-in functions by Teradata but are transformed by SnowConvert AI

| Teradata | Snowflake | Note |
| --- | --- | --- |
| CHKNUM | CHKNUM_UDF | Check [this UDF download page](https://downloads.teradata.com/download/extensibility/isnumeric-udf) |

## Notes

### Architecture differences between Teradata and Snowflake

Teradata has a shared-nothing architecture with Access Module Processors (AMP) where each AMP manages their own share of disk storage and is accessed through hashing when doing queries. To take advantage of parallelism the stored information should be evenly distributed among AMPs and to do this Teradata offers a group of hash-related functions that can be used to determine how good the actual primary indexes are.

On the other hand, Snowflake architecture is different, and it manages how the data is stored on its own, meaning users do not need to worry about optimizing their data distribution.

### Ending bound constants (UNTIL_CHANGED and UNTIL_CLOSED)

Both UNTIL_CHANGED and UNTIL_CLOSED are Teradata constants that represent an undefined ending bound for periods. Internally, these constants are represented as the maximum value a timestamp can have i.e ‘9999-12-31 23:59:59.999999’. During the migration of the PERIOD function, the ending bound is checked if present to determine if it is one of these constants and to replace it with varchar of value ‘9999-12-31 23:59:59.999999’ in case it is, Snowflake then casts the varchar to date or timestamp depending on the type of the beginning bound when calling PERIOD___UDF.

### Implicit conversion

Some Teradata string functions like INDEX or POSITION accept non-string data types and implicitly convert them to string, this can cause inconsistencies in the results of those functions between Teradata and Snowflake. For example, the following Teradata code:

```sql
 SELECT INDEX(35, '5');
```

Returns 4, while the CHARINDEX equivalent in Snowflake:

```sql
 SELECT CHARINDEX('5', 35);
```

Returns 2, this happens because Teradata has its own [default formats](https://docs.teradata.com/r/S0Fw2AVH8ff3MDA0wDOHlQ/Xh8u4~A7KI46wOdMG9DSHQ) which are used during implicit conversion. In the above example, Teradata [interprets the numeric constant](https://docs.teradata.com/r/T5QsmcznbJo1bHmZT2KnFw/TEOJhlyP6az05SdTK9JHMg) 35 as BYTEINT and uses BYTEINT default format`'-999'` for the implicit conversion to string, causing the converted value to be `' 35'`. On the other hand, Snowflake uses its own [default formats](https://docs.snowflake.com/en/sql-reference/sql-format-models.html#default-formats-for-parsing), creating inconsistencies in the result.

To solve this, the following changes are done to those function parameters:

* If the parameter does **not** have a cast with format, then a Snowflake `TO_VARCHAR` function with the default Teradata format equivalent in Snowflake is added instead.
* If the parameter does have a cast with format, then the format is converted to its Snowflake equivalent and the`TO_VARCHAR`function is added.

  + As a side note, Teradata ignores the sign of a number if it is not explicitly put inside a format, while Snowflake always adds spaces to insert the sign even when not specified, for those cases a check is done to see if the sign was specified and to remove it from the Snowflake string in case it was not.

After these changes, the resulting code would be:

```sql
 SELECT CHARINDEX( '5', TO_VARCHAR(35, 'MI999'));
```

Which returns 4, the same as the Teradata code.

## Known Issues

No issues were found.

## Related EWIs

No related EWIs.

## COALESCE

### Description

The coalesce function is used to return the first non-null element in a list. For more information check [COALESCE](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/Wo3afkb7dFsUUAO5AwQOkQ).

```sql
COALESCE(element_1, element_2 [, element_3, ..., element_n])
```

Both Teradata and Snowflake COALESCE functions allow mixing numeric with string and date with timestamp parameters. However, they handle these two cases differently:

* Numeric along with string parameters: Teradata converts all numeric parameters to varchar while Snowflake does the opposite
* Timestamp along with date parameters: Teradata converts all timestamps to date while Snowflake does the opposite

To ensure functional equivalence in the first case, all numeric parameters are cast to`string`using`to_varchar`function, this takes the format of the numbers into account. In the second case, all timestamps are casted to date using `to_date`, Teradata ignores the format of timestamps when casting them so it is removed during transformation.

### Sample Source Patterns

#### Numeric mixed with string parameters

##### *Teradata*

**Query**

```sql
 SELECT COALESCE(125, 'hello', cast(850 as format '-999'));
```

**Result**

```sql
COLUMN1|
-------+
125    |
```

##### *Snowflake*

**Query**

```sql
SELECT
 COALESCE(TO_VARCHAR(125), 'hello', TO_VARCHAR(850, '9000'));
```

**Result**

```sql
COLUMN1|
-------+
125    |
```

#### Timestamp mixed with date parameters

##### *Teradata*

**Query**

```
 SELECT COALESCE(cast(TIMESTAMP '2021-09-14 10:14:59' as format 'HH:MI:SSBDD-MM-YYYY'), current_date);
```

**Result**

```sql
COLUMN1    |
-----------+
2021-09-14 |
```

##### *Snowflake*

**Query**

```sql
SELECT
 COALESCE(TO_DATE(TIMESTAMP '2021-09-14 10:14:59' !!!RESOLVE EWI!!! /*** SSC-EWI-TD0025 - OUTPUT FORMAT 'HH:MI:SSBDD-MM-YYYY' NOT SUPPORTED. ***/!!!), CURRENT_DATE());
```

**Result**

```sql
COLUMN1    |
-----------+
2021-09-14 |
```

### Known Issues

No known issues_._

### Related EWIs

* [SSC-EWI-TD0025](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Output format not supported.

## CURRENT_TIMESTAMP

### Severity

Low

### Description

Fractional seconds are only displayed if it is explicitly set in the TIME_OUTPUT_FORMAT session parameter.

#### Input code:

```sql
SELECT current_timestamp(4) at local;
```

#### Output code:

```sql
SELECT
CURRENT_TIMESTAMP(4);
```

### Recommendations

* Check if the TIME_OUTPUT___FORMAT session parameter is set to get the behavior that you want.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## DAYNUMBER_OF_MONTH

### Description

Returns the number of days elapsed from the beginning of the month to the given date. For more information check [DAYNUMBER_OF_MONTH](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/msvzanlVHZUHwFv5LYpqzg).

```sql
DAYNUMBER_OF_MONTH(expression [, calendar_name])
```

Both Teradata and Snowflake handle the DAYNUMBER_OF_MONTH function in the same way, except in one case:

* The ISO calendar: An ISO month has 4 or 5 complete weeks. For more information check [About ISO Computation](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/RdZQp3YPJ1WrBpj8b3uljA).

To ensure functional equivalence, a user-defined function (UDF) is added for the ISO calendar case.

### Sample Source Patterns

#### *Teradata*

**Query**

```sql
SELECT
    DAYNUMBER_OF_MONTH (DATE'2022-12-22'),
    DAYNUMBER_OF_MONTH (DATE'2022-12-22', NULL),
    DAYNUMBER_OF_MONTH (DATE'2022-12-22', 'Teradata'),
    DAYNUMBER_OF_MONTH (DATE'2022-12-22', 'COMPATIBLE');
```

**Result**

```sql
COLUMN1|COLUMN2|COLUMN3|COLUMN4|
-------+-------+-------+-------+
22     |22     |22     |22     |
```

#### *Snowflake*

**Query**

```sql
SELECT
    DAYOFMONTH(DATE'2022-12-22'),
    DAYOFMONTH(DATE'2022-12-22'),
    DAYOFMONTH(DATE'2022-12-22'),
    DAYOFMONTH(DATE'2022-12-22');
```

**Result**

```sql
COLUMN1|COLUMN2|COLUMN3|COLUMN4|
-------+-------+-------+-------+
22     |22     |22     |22     |
```

#### ISO calendar

##### *Teradata*

**Query**

```sql
SELECT DAYNUMBER_OF_MONTH (DATE'2022-12-22', 'ISO');
```

**Result**

```sql
COLUMN1|
-------+
25     |
```

##### *Snowflake*

**Query**

```sql
SELECT
PUBLIC.DAYNUMBER_OF_MONTH_UDF(DATE'2022-12-22');
```

**Result**

```sql
COLUMN1|
-------+
25     |
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## FROM_BYTES

Translation specification for transforming the TO_CHAR function into an equivalent function concatenation in Snowflake

### Description

The FROM_BYTES function encodes a sequence of bits into a sequence of characters representing its encoding. For more information check [FROM_BYTES(Encoding)](https://www.docs.teradata.com/r/Teradata-VantageTM-Data-Types-and-Literals/March-2019/Data-Type-Conversion-Functions/FROM_BYTES).

Snowflake does not have support for FROM_BYTES function, however, some workarounds can be done for the most common occurrences of this function.

### Sample Source Patterns

#### Teradata

##### Query

```sql
 SELECT
FROM_BYTES('5A1B'XB, 'base10'), --returns '23067'
FROM_BYTES('5A3F'XB, 'ASCII'), --returns 'Z\ESC '
FROM_BYTES('5A1B'XB, 'base16'); -- returns '5A1B'
```

##### Result

```none
COLUMN1    | COLUMN2    | COLUMN3 |
-----------+------------+---------+
23067      |  Z\ESC     | 5A1B    |
```

##### Snowflake

##### Query

```sql
 SELECT
--returns '23067'
TO_NUMBER('5A1B', 'XXXX'),
--returns 'Z\ESC '
!!!RESOLVE EWI!!! /*** SSC-EWI-0031 - FROM_BYTES FUNCTION NOT SUPPORTED ***/!!!
FROM_BYTES(TO_BINARY('5A3F'), 'ASCII'),
TO_BINARY('5A1B', 'HEX'); -- returns '5A1B'
```

##### Result

```none
COLUMN1    | COLUMN2    | COLUMN3 |
-----------+------------+---------+
23067      |  Z\ESC     | 5A1B    |
```

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Known Issues

1. TO_NUMBER format parameter must match with the digits on the input string.
2. There is no functional equivalent built-in function for FROM_BYTES when encoding to ANSI

### Related EWIs

1. [SSC-EWI-0031](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): FUNCTION NOT SUPPORTED

## GETQUERYBANDVALUE

Translation specification for the transformation of GetQueryBandValue to Snowflake

### Description

The GetQueryBandValue function searches a name key inside of the query band and returns its associated value if present. It can be used to search inside the transaction, session, profile, or any of the key-value pairs of the query band.

For more information on this function check [GetQueryBandValue](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/Teradata-VantageTM-Application-Programming-Reference-17.20/Workload-Management-Query-Band-APIs/Open-APIs-SQL-Interfaces/GetQueryBandValue) in the Teradata documentation.

```none
[SYSLIB.]GetQueryBandValue([QueryBandIn,] SearchType, Name);
```

### Sample Source Patterns

#### Setup data

##### Teradata

##### Query

```sql
 SET QUERY_BAND = 'hola=hello;adios=bye;' FOR SESSION;
```

##### *Snowflake*

##### Query

```sql
 ALTER SESSION SET QUERY_TAG = 'hola=hello;adios=bye;';
```

#### GetQueryBandValue with QueryBandIn parameter

##### *Teradata*

##### Query

```sql
 SELECT
GETQUERYBANDVALUE('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 0, 'account') as Example1,
GETQUERYBANDVALUE('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 1, 'account') as Example2,
GETQUERYBANDVALUE('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 2, 'account') as Example3,
GETQUERYBANDVALUE('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 3, 'account') as Example4,
GETQUERYBANDVALUE('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 0, 'role') as Example5,
GETQUERYBANDVALUE('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 1, 'role') as Example6;
```

##### Result

```none
+----------+----------+----------+----------+----------+----------+
| EXAMPLE1 | EXAMPLE2 | EXAMPLE3 | EXAMPLE4 | EXAMPLE5 | EXAMPLE6 |
+----------+----------+----------+----------+----------+----------+
| Mark200  | Mark200  | SaraDB   | Peter3   | DbAdmin  |          |
+----------+----------+----------+----------+----------+----------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
GETQUERYBANDVALUE_UDF('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 0, 'account') as Example1,
GETQUERYBANDVALUE_UDF('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 1, 'account') as Example2,
GETQUERYBANDVALUE_UDF('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 2, 'account') as Example3,
GETQUERYBANDVALUE_UDF('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 3, 'account') as Example4,
GETQUERYBANDVALUE_UDF('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 0, 'role') as Example5,
GETQUERYBANDVALUE_UDF('=T> user=Mark;account=Mark200; =S> user=Sara;account=SaraDB;role=DbAdmin =P> user=Peter;account=Peter3;', 1, 'role') as Example6;
```

##### Result

```none
+----------+----------+----------+----------+----------+----------+
| EXAMPLE1 | EXAMPLE2 | EXAMPLE3 | EXAMPLE4 | EXAMPLE5 | EXAMPLE6 |
+----------+----------+----------+----------+----------+----------+
| Mark200  | Mark200  | SaraDB   | Peter3   | DbAdmin  |          |
+----------+----------+----------+----------+----------+----------+
```

#### GetQueryBandValue without QueryBandIn parameter

##### *Teradata*

##### Query

```sql
 SELECT
GETQUERYBANDVALUE(2, 'hola') as Example1,
GETQUERYBANDVALUE(2, 'adios') as Example2;
```

##### Result

```none
+----------+----------+
| EXAMPLE1 | EXAMPLE2 |
+----------+----------+
| hello    | bye      |
+----------+----------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
GETQUERYBANDVALUE_UDF('hola') as Example1,
GETQUERYBANDVALUE_UDF('adios') as Example2;
```

##### Result

```none
+----------+----------+
| EXAMPLE1 | EXAMPLE2 |
+----------+----------+
| hello    | bye      |
+----------+----------+
```

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Known Issues

**1. GetQueryBandValue without QueryBandIn parameter only supported for session**

Teradata allows defining query bands at transaction, session or profile levels. If GetQueryBandValue is called without specifying an input query band Teradata will automatically check the transaction, session or profile query bands depending on the value of the SearchType parameter.

In Snowflake the closest equivalent to query bands are query tags, which can be specified for session, user and account.

Due to these differences, the implementation of GetQueryBandValue without QueryBandIn parameter only considers the session query tag and may not work as expected for other search types.

### Related EWIs

No related EWIs.

## HASHBUCKET

### Description

HASHBUCKET is a Teradata system function used to determine the hash bucket number for data distribution across Access Module Processors (AMPs). It is commonly used in combination with HASHROW to evaluate how evenly data is distributed across AMPs based on the primary index. For more information, see [HASHBUCKET](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/jslafnqlE8bGpg~wXQiEFw).

```sql
HASHBUCKET(HASHROW(column1 [, column2, ...]))
HASHBUCKET()
HASHBUCKET(byte_expression)
```

HASHBUCKET is not translated to Snowflake because it is specific to Teradata’s shared-nothing AMP architecture. While Snowflake provides a [HASH](https://docs.snowflake.com/en/sql-reference/functions/hash) function, the two are not functionally equivalent:

* **Different algorithms**: Teradata and Snowflake use completely different hashing algorithms.
* **Different value ranges**: HASHBUCKET returns values in the range 0–1,048,575, while Snowflake HASH returns signed 64-bit integers (a far larger and different value range).
* **Different NULL handling**: HASHBUCKET(HASHROW(NULL)) returns 0, while Snowflake HASH(NULL) returns a non-NULL hash value.

### Sample Source Patterns

#### HASHBUCKET with HASHROW (single column)

##### *Teradata*

```sql
 SELECT HASHBUCKET(HASHROW(col1)) FROM my_table;
```

##### *Snowflake*

```sql
 SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHBUCKET FUNCTION NOT SUPPORTED ***/!!!
   HASHBUCKET(
              !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHROW FUNCTION NOT SUPPORTED ***/!!!
              HASHROW(col1))
 FROM
   my_table;
```

#### HASHBUCKET with HASHROW (multiple columns)

##### *Teradata*

```sql
 SELECT HASHBUCKET(HASHROW(a, b, c)) FROM my_table;
```

##### *Snowflake*

```sql
 SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHBUCKET FUNCTION NOT SUPPORTED ***/!!!
   HASHBUCKET(
              !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHROW FUNCTION NOT SUPPORTED ***/!!!
              HASHROW(a, b, c))
 FROM
   my_table;
```

#### HASHBUCKET with no arguments

##### *Teradata*

```sql
 SELECT HASHBUCKET() + 1 FROM my_table;
```

##### *Snowflake*

```sql
 SELECT
   !!!RESOLVE EWI!!! /*** SSC-EWI-0031 - HASHBUCKET FUNCTION NOT SUPPORTED ***/!!!
   HASHBUCKET() + 1
 FROM
   my_table;
```

### Recommendations

The appropriate replacement depends on how HASHBUCKET is used in the source code:

* **Data distribution or skew analysis** — Teradata uses `HASHBUCKET(HASHROW(col))` to evaluate how evenly data is distributed across AMPs. In Snowflake, data distribution is managed automatically through [micro-partitioning](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions). To analyze clustering quality, use [SYSTEM$CLUSTERING_INFORMATION](https://docs.snowflake.com/en/sql-reference/functions/system_clustering_information) instead.
* **Custom bucketing or partitioning logic** — If the source code uses hash bucket numbers to route or partition data, consider using Snowflake’s [HASH](https://docs.snowflake.com/en/sql-reference/functions/hash) function with `MOD()` to distribute values across a fixed number of buckets: `ABS(MOD(HASH(col), num_buckets))`. Note that this will not produce the same bucket assignments as Teradata.
* **Max bucket number constant** — `HASHBUCKET()` with no arguments returns the maximum hash bucket number (1,048,575) in Teradata. If this value is used as a constant in calculations, replace it with the literal value `1048575`.
* **Code that needs any hash value** — If the specific bucket numbers do not matter and only the distribution property is needed (e.g., sampling, random assignment), Snowflake’s [HASH](https://docs.snowflake.com/en/sql-reference/functions/hash) function can be used directly.
* **Code that depends on exact Teradata bucket values** — If the logic relies on specific bucket numbers (e.g., hardcoded ranges, lookups against pre-computed buckets), there is no automated migration path. The hashing algorithm is proprietary to Teradata and cannot be reproduced in Snowflake. This code requires manual redesign.

### Known Issues

No known issues.

### Related EWIs

* [SSC-EWI-0031](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Function not supported.

## HASHROW

### Description

HASHROW is a Teradata system function that generates a hash value for one or more columns. It is typically used as the argument to HASHBUCKET for data distribution analysis. Like HASHBUCKET, HASHROW is tied to Teradata’s AMP architecture and has no Snowflake equivalent. For more information, see [Hash functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/jslafnqlE8bGpg~wXQiEFw).

See HASHBUCKET for conversion examples and recommendations.

### Related EWIs

* [SSC-EWI-0031](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Function not supported.

## HASHAMP

### Description

HASHAMP is a Teradata system function that returns the AMP number associated with a hash bucket value. Like HASHBUCKET and HASHROW, it is tied to Teradata’s AMP architecture and has no Snowflake equivalent. For more information, see [Hash functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/jslafnqlE8bGpg~wXQiEFw).

See HASHBUCKET for conversion examples and recommendations.

### Related EWIs

* [SSC-EWI-0031](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Function not supported.

## HASHBAKAMP

### Description

HASHBAKAMP is a Teradata system function that returns the fallback AMP number for a given hash bucket value. Like HASHBUCKET and HASHROW, it is tied to Teradata’s AMP architecture and has no Snowflake equivalent. For more information, see [Hash functions](https://docs.teradata.com/r/756LNiPSFdY~4JcCCcR5Cw/jslafnqlE8bGpg~wXQiEFw).

See HASHBUCKET for conversion examples and recommendations.

### Related EWIs

* [SSC-EWI-0031](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Function not supported.

## JSON_CHECK

### Description

The JSON_CHECK function checks a string for valid JSON.

For more information, see the [Teradata JSON_CHECK documentation](https://docs.teradata.com/r/Teradata-Database-JSON-Data-Type/June-2017/JSON-Functions-and-Operators/JSON_CHECK).

```javascript
[TD_SYSFNLIB.]JSON_CHECK(string_expr);
```

### Sample Source Pattern

#### Basic Source Pattern

##### Teradata

**Query**

```sql
SELECT JSON_CHECK('{"key": "value"}');
```

##### Snowflake Scripting

**Query**

```sql
SELECT
IFNULL(CHECK_JSON('{"key": "value"}'), 'OK');
```

#### JSON_CHECK inside CASE transformation

##### Teradata

**Query**

```sql
SELECT CASE WHEN JSON_CHECK('{}') = 'OK' then 'OKK' ELSE 'NOT OK' END;
```

##### Snowflake Scripting

**Query**

```sql
SELECT
CASE
WHEN UPPER(RTRIM(IFNULL(CHECK_JSON('{}'), 'OK'))) = UPPER(RTRIM('OK'))
THEN 'OKK' ELSE 'NOT OK'
END;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## JSON_EXTRACT

Translation reference to convert the Teradata functions JSONExtractValue, JSONExtractLargeValue and JSONExtract to Snowflake Scripting.

### Description

As per Teradata’s documentation, these functions use the [JSONPath Query Syntax](https://docs.teradata.com/r/Teradata-VantageTM-JSON-Data-Type/March-2019/Operations-on-the-JSON-Type/JSONPath-Request-Syntax) to request information about a portion of a JSON instance. The entity desired can be any portion of a JSON instance, such as a name/value pair, an object, an array, an array element, or a value.

For more information, see the [Teradata JSONExtract function comparison](https://docs.teradata.com/r/Teradata-VantageTM-JSON-Data-Type/March-2019/JSON-Methods/Comparison-of-JSONExtract-and-JSONExtractValue).

```javascript
 JSON_expr.JSONExtractValue(JSONPath_expr)

JSON_expr.JSONExtractLargeValue(JSONPath_expr)

JSON_expr.JSONExtract(JSONPath_expr)
```

The JSON_EXTRACT_UDF is a Snowflake implementation of the JSONPath specification that uses a modified version of the original JavaScript implementation developed by [Stefan Goessner](https://goessner.net/index.html).

#### Sample Source Pattern

##### Teradata

##### Query

```sql
 SELECT
    Store.JSONExtract('$..author') as AllAuthors,
    Store.JSONExtractValue('$..book[2].title') as ThirdBookTitle,
    Store.JSONExtractLargeValue('$..book[2].price') as ThirdBookPrice
FROM BookStores;
```

##### Snowflake Scripting

##### Query

```sql
 SELECT
    JSON_EXTRACT_UDF(Store, '$..author', FALSE) as AllAuthors,
    JSON_EXTRACT_UDF(Store, '$..book[2].title', TRUE) as ThirdBookTitle,
    JSON_EXTRACT_UDF(Store, '$..book[2].price', TRUE) as ThirdBookPrice
    FROM
    BookStores;
```

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Known Issues

#### 1. Elements inside JSONs may not retain their original order.

Elements inside a JSON are ordered by their keys when inserted in a table. Thus, the query results might differ. However, this does not affect the order of arrays inside the JSON.

For example, if the original JSON is:

```json
 {
   "firstName":"Peter",
   "lastName":"Andre",
   "age":31,
   "cities": ["Los Angeles", "Lima", "Buenos Aires"]
}
```

Using the Snowflake [PARSE_JSON()](https://docs.snowflake.com/en/sql-reference/functions/parse_json.html) that interprets an input string as a JSON document, producing a VARIANT value. The inserted JSON will be:

```json
 {
   "age": 31,
   "cities": ["Los Angeles", "Lima", "Buenos Aires"],
   "firstName": "Peter",
   "lastName": "Andre"
}
```

Note how “age” is now the first element. However, the array of “cities” maintains its original order.

### Related EWIs

No related EWIs.

## JSON_TABLE

Translation specification for the transformation of JSON_TABLE into a equivalent query in Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Creates a table based on the contents of a JSON document. See [JSON_TABLE documentation](https://docs.teradata.com/r/Teradata-VantageTM-JSON-Data-Type-17.20/JSON-Shredding/JSON_TABLE).

```none
[TD_SYSFNLIB.]JSON_TABLE(
  ON (json_documents_retrieving_expr)
  USING
      ROWEXPR (row_expr_literal)
      COLEXPR (column_expr_literal)
  [AS] correlation_name [(column_name [,...])]
)
```

The conversion of JSON_TABLE has the considerations shown below:

* ROW_NUMBER() is an equivalent of ordinal columns in Snowflake.
* In Teradata, the second column of JSON_TABLE must be JSON type because the generated columns replace the second column, for that reason, SnowConvert AI assumes that the column has the right type, and uses it for the transformation.

### Sample Source Patterns

#### Setup data

##### Teradata

##### Query

```sql
 create table myJsonTable(
 col1 integer,
 col2 JSON(1000)
 );

insert into myJsonTable values(1,
new json('{
"name": "Matt",
"age" : 30,
"songs" : [
	{"name" : "Late night", "genre" : "Jazz"},
	{"name" : "Wake up", "genre" : "Rock"},
	{"name" : "Who am I", "genre" : "Rock"},
	{"name" : "Raining", "genre" : "Blues"}
]
}'));
```

##### *Snowflake*

##### Query

```sql
 CREATE OR REPLACE TABLE myJsonTable (
 col1 integer,
 col2 VARIANT
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO myJsonTable
VALUES (1, TO_JSON(PARSE_JSON('{
"name": "Matt",
"age" : 30,
"songs" : [
	{"name" : "Late night", "genre" : "Jazz"},
	{"name" : "Wake up", "genre" : "Rock"},
	{"name" : "Who am I", "genre" : "Rock"},
	{"name" : "Raining", "genre" : "Blues"}
]
}')));
```

#### Pattern code 1

##### *Teradata*

##### Query

```sql
 SELECT * FROM
JSON_TABLE(ON (SELECT COL1, COL2 FROM myJsonTable WHERE col1 = 1)
USING rowexpr('$.songs[*]')
colexpr('[ {"jsonpath" : "$.name",
            "type" : "CHAR(20)"},
            {"jsonpath" : "$.genre",
             "type" : "VARCHAR(20)"}]')) AS JT(ID, "Song name", Genre);
```

##### Result

```none
ID | Song name  | Genre |
---+------------+-------+
1  | Late night | Jazz  |
---+------------+-------+
1  | Wake up    | Rock  |
---+------------+-------+
1  | Who am I   | Rock  |
---+------------+-------+
1  | Raining    | Blues |
```

##### *Snowflake*

##### Query

```sql
 SELECT
* FROM
(
SELECT
COL1 AS ID,
rowexpr.value:name :: CHAR(20) AS "Song name",
rowexpr.value:genre :: VARCHAR(20) AS Genre
FROM
myJsonTable,
TABLE(FLATTEN(INPUT => COL2:songs)) rowexpr
WHERE col1 = 1
) JT;
```

##### Result

```none
ID | Song name  | Genre |
---+------------+-------+
1  | Late night | Jazz  |
---+------------+-------+
1  | Wake up    | Rock  |
---+------------+-------+
1  | Who am I   | Rock  |
---+------------+-------+
1  | Raining    | Blues |
```

### Known Issues

**1. The JSON path in COLEXPR can not have multiple asterisk accesses**

The columns JSON path cannot have multiple lists with asterisk access, for example: `$.Names[*].FullNames[*]`. On the other hand, the JSON path of ROWEXP can have it.

**2. JSON structure defined in the COLEXPR literal must be a valid JSON**

When it is not the case the user will be warned about the JSON being badly formed.

### Related EWIs

No related EWIs.

## NEW JSON

### Description

Allocates a new instance of a JSON datatype. For more information check [NEW JSON Constructor Expression.](https://docs.teradata.com/r/Teradata-Database-JSON-Data-Type/June-2017/The-JSON-Data-Type/About-JSON-Type-Constructor/NEW-JSON-Constructor-Expression)

```none
NEW JSON ( [ JSON_string_spec | JSON_binary_data_spec ] )

JSON_string_spec := JSON_String_literal [, { LATIN | UNICODE | BSON | UBJSON } ]

JSON_binary_data_spec := JSON_binary_literal [, { BSON | UBJSON } ]
```

The second parameter of the NEW JSON function is always omitted by SnowConvert AI since Snowflake works only with UTF-8.

### Sample Source Patterns

#### NEW JSON with string data

##### *Teradata*

**Query**

```sql
SELECT NEW JSON ('{"name" : "cameron", "age" : 24}'),
NEW JSON ('{"name" : "cameron", "age" : 24}', LATIN);
```

**Result**

| COLUMN1 | COLUMN2 |
| --- | --- |
| {“age”:24,”name”:”cameron”} | {“age”:24,”name”:”cameron”} |

##### *Snowflake*

**Query**

```sql
SELECT
TO_JSON(PARSE_JSON('{"name" : "cameron", "age" : 24}')),
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0039 - INPUT FORMAT 'LATIN' NOT SUPPORTED ***/!!!
TO_JSON(PARSE_JSON('{"name" : "cameron", "age" : 24}'));
```

**Result**

| COLUMN1 | COLUMN2 |
| --- | --- |
| {“age”:24,”name”:”cameron”} | {“age”:24,”name”:”cameron”} |

### Known Issues

**1. The second parameter is not supported**

The second parameter of the function used to specify the format of the resulting JSON is not supported because Snowflake only supports UTF-8, this may result in functional differences for some uses of the function.

**2. JSON with BINARY data is not supported**

Snowflake does not support parsing binary data to create a JSON value, the user will be warned when SnowConvert AI finds a NEW JSON with binary data.

### Related EWIs

1. [SSC-EWI-TD0039](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Input format not supported.

## NVP

### Description

Extracts the value of the key-value pair where the key matches the nth occurrence of the specified name to search. See [NVP](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/String-Operators-and-Functions/NVP).

```sql
[TD_SYSFNLIB.] NVP (
in_string,
name_to_search
[, name_delimiters ]
[, value_delimiters ]
[, occurrence ]
)
```

### Sample Source Patterns

#### NVP basic case

##### *Teradata*

**Query**

```sql
SELECT
NVP('entree=-orange chicken&entree+.honey salmon', 'entree', '&', '=- +.', 1),
NVP('Hello=bye|name=Lucas|Hello=world!', 'Hello', '|', '=', 2),
NVP('Player=Mario$Game&Tenis%Player/Susana$Game=Chess', 'Player', '% $', '= & /', 2);
```

**Result**

```sql
COLUMN1        | COLUMN2 | COLUMN3 |
---------------+---------+---------+
orange chicken | world!  | Susana  |
```

##### *Snowflake*

**Query**

```sql
SELECT
PUBLIC.NVP_UDF('entree=-orange chicken&entree+.honey salmon', 'entree', '&', '=- +.', 1),
PUBLIC.NVP_UDF('Hello=bye|name=Lucas|Hello=world!', 'Hello', '|', '=', 2),
PUBLIC.NVP_UDF('Player=Mario$Game&Tenis%Player/Susana$Game=Chess', 'Player', '% $', '= & /', 2);
```

**Result**

```sql
COLUMN1        | COLUMN2 | COLUMN3 |
---------------+---------+---------+
orange chicken | world!  | Susana  |
```

#### NVP with optional parameters ignored

##### *Teradata*

**Query**

```sql
SELECT
NVP('City=Los Angeles&Color=Green&Color=Blue&City=San Jose', 'Color'),
NVP('City=Los Angeles&Color=Green&Color=Blue&City=San Jose', 'Color', 2),
NVP('City=Los Angeles#Color=Green#Color=Blue#City=San Jose', 'City', '#', '=');
```

**Result**

```sql
COLUMN1 | COLUMN2 | COLUMN3     |
--------+---------+-------------+
Green   | Blue    | Los Angeles |
```

##### *Snowflake*

**Query**

```sql
SELECT
    PUBLIC.NVP_UDF('City=Los Angeles&Color=Green&Color=Blue&City=San Jose', 'Color', '&', '=', 1),
    PUBLIC.NVP_UDF('City=Los Angeles&Color=Green&Color=Blue&City=San Jose', 'Color', '&', '=', 2),
    PUBLIC.NVP_UDF('City=Los Angeles#Color=Green#Color=Blue#City=San Jose', 'City', '#', '=', 1);
```

**Result**

```sql
COLUMN1 | COLUMN2 | COLUMN3     |
--------+---------+-------------+
Green   | Blue    | Los Angeles |
```

#### NVP with spaces in delimiters

##### *Teradata*

**Query**

```sql
SELECT
NVP('store = whole foods&&store: ?Bristol farms','store', '&&', '\ =\  :\ ?', 2),
NVP('Hello = bye|name = Lucas|Hello = world!', 'Hello', '|', '\ =\ ', 2);
```

**Result**

```sql
COLUMN1       | COLUMN2 |
--------------+---------+
Bristol farms | world!  |
```

##### *Snowflake*

**Query**

```sql
SELECT
PUBLIC.NVP_UDF('store = whole foods&&store: ?Bristol farms', 'store', '&&', '\\ =\\  :\\ ?', 2),
PUBLIC.NVP_UDF('Hello = bye|name = Lucas|Hello = world!', 'Hello', '|', '\\ =\\ ', 2);
```

**Result**

```sql
COLUMN1       | COLUMN2 |
--------------+---------+
Bristol farms | world!  |
```

#### NVP with non-literal delimiters

##### *Teradata*

**Query**

```sql
SELECT NVP('store = whole foods&&store: ?Bristol farms','store', '&&', valueDelimiter, 2);
```

##### *Snowflake*

**Query**

```sql
SELECT
PUBLIC.NVP_UDF('store = whole foods&&store: ?Bristol farms', 'store', '&&', valueDelimiter, 2) /*** SSC-FDM-TD0008 - WHEN NVP_UDF FOURTH PARAMETER IS NON-LITERAL AND IT CONTAINS A BACKSLASH, THAT BACKSLASH NEEDS TO BE ESCAPED ***/;
```

### Known Issues

**1. Delimiters with spaces (\ ) need to have the backslash escaped in Snowflake**

In Teradata, delimiters including space specify them using “\ “ (see NVP with spaces in delimiters), as shown in the examples, in Teradata it is not necessary to escape the backslash, however, it is necessary in Snowflake. Escaping the backslashes in the delimiter can be done automatically by SnowConvert AI but only if the delimiter values are literal strings, otherwise the user will be warned that the backslashes could not be escaped and that it may cause different results in Snowflake.

### Related EWIs

1. [SSC-FDM-TD0008](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Non-literal delimiters with spaces need their backslash escaped in Snowflake.

## OVERLAPS

### Description

According to Teradata’s documentation, the OVERLAPS operator compares two or more period expressions. If they overlap, it returns true.

For more information, see the [Teradata OVERLAPS documentation](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/3VIgdwHNVU~tsnNiIR1aEw).

```sql
period_expression
OVERLAPS
period_expression
```

The PERIOD_OVERLAPS_UDF is a Snowflake implementation of the OVERLAPS operator in Teradata.

### Sample Source Pattern

#### Teradata

**Query**

```sql
SELECT
    PERIOD(DATE '2009-01-01', DATE '2010-09-24')
    OVERLAPS
    PERIOD(DATE '2009-02-01', DATE '2009-06-24');
```

#### Snowflake Scripting

**Query**

```sql
SELECT
    PUBLIC.PERIOD_OVERLAPS_UDF(ARRAY_CONSTRUCT(PUBLIC.PERIOD_UDF(DATE '2009-01-01', DATE '2010-09-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!, PUBLIC.PERIOD_UDF(DATE '2009-02-01', DATE '2009-06-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!)) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!;
```

### Known Issues

#### 1. Unsupported Period Expressions

The *PERIOD(TIME WITH TIME ZONE)* and *PERIOD(TIMESTAMP WITH TIME ZONE)* expressions are not supported yet.

### Related EWIs

1. [SSC-EWI-TD0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Snowflake does not support the period datatype, all periods are handled as varchar instead

## P_INTERSECT

### Description

According to Teradata’s documentation, the P_INTERSECT operator compares two or more period expressions. If they overlap, it returns the common portion of the period expressions.

For more information, see the [Teradata P_INTERSECT documentation](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/iW6iefgeyOypFOMY2qGG_A).

```sql
period_expression
P_INTERSECT
period_expression
```

The PERIOD_INTERSECT_UDF is a Snowflake implementation of the P_INTERSECT operator in Teradata.

### Sample Source Pattern

#### Teradata

**Query**

```sql
SELECT
    PERIOD(DATE '2009-01-01', DATE '2010-09-24')
    P_INTERSECT
    PERIOD(DATE '2009-02-01', DATE '2009-06-24');
```

#### Snowflake Scripting

**Query**

```sql
SELECT
    PUBLIC.PERIOD_INTERSECT_UDF(ARRAY_CONSTRUCT(PUBLIC.PERIOD_UDF(DATE '2009-01-01', DATE '2010-09-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!, PUBLIC.PERIOD_UDF(DATE '2009-02-01', DATE '2009-06-24') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!)) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!;
```

### Known Issues

#### 1. Unsupported Period Expressions

The *PERIOD(TIME WITH TIME ZONE)* and *PERIOD(TIMESTAMP WITH TIME ZONE)* expressions are not supported yet.

### Related EWIs

1. [SSC-EWI-TD0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Snowflake does not support the period datatype, all periods are handled as varchar instead

## PIVOT

Translation specification for the PIVOT function from Teradata to Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

The pivot function is used to transform rows of a table into columns. For more information check the [PIVOT Teradata documentation.](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/Aggregate-Functions/PIVOT)

```none
PIVOT ( pivot_spec )
  [ WITH with_spec [,...] ]
  [AS] derived_table_name [ ( cname [,...] ) ]

pivot_spec := aggr_fn_spec [,...] FOR for_spec

aggr_fn_spec := aggr_fn ( cname ) [ [AS] pvt_aggr_alias ]

for_spec := { cname IN ( expr_spec_1 [,...] ) |
( cname [,...] ) IN ( expr_spec_2 [,...] ) |
cname IN ( subquery )
}

expr_spec_1 := expr [ [AS] expr_alias_name ]

expr_spec_2 := ( expr [,...] ) [ [AS] expr_alias_name ]

with_spec := aggr_fn ( { cname [,...] | * } ) [AS] aggr_alias
```

### Sample Source Patterns

#### Setup data

##### Teradata

##### Query

```sql
 CREATE TABLE star1(
	country VARCHAR(20),
	state VARCHAR(10),
	yr INTEGER,
	qtr VARCHAR(3),
	sales INTEGER,
	cogs INTEGER
);

insert into star1 values ('USA', 'CA', 2001, 'Q1', 30, 15);
insert into star1 values ('Canada', 'ON', 2001, 'Q2', 10, 0);
insert into star1 values ('Canada', 'BC', 2001, 'Q3', 10, 0);
insert into star1 values ('USA', 'NY', 2001, 'Q1', 45, 25);
insert into star1 values ('USA', 'CA', 2001, 'Q2', 50, 20);
```

##### *Snowflake*

##### Query

```sql
 CREATE OR REPLACE TABLE star1 (
	country VARCHAR(20),
	state VARCHAR(10),
	yr INTEGER,
	qtr VARCHAR(3),
	sales INTEGER,
	cogs INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO star1
VALUES ('USA', 'CA', 2001, 'Q1', 30, 15);

INSERT INTO star1
VALUES ('Canada', 'ON', 2001, 'Q2', 10, 0);

INSERT INTO star1
VALUES ('Canada', 'BC', 2001, 'Q3', 10, 0);

INSERT INTO star1
VALUES ('USA', 'NY', 2001, 'Q1', 45, 25);

INSERT INTO star1
VALUES ('USA', 'CA', 2001, 'Q2', 50, 20);
```

#### Basic PIVOT transformation

##### *Teradata*

##### Query

```sql
 SELECT *
FROM star1 PIVOT (
	SUM(sales) FOR qtr
    IN ('Q1',
    	'Q2',
        'Q3')
)Tmp;
```

##### Result

```none
Country | State | yr   | cogs | 'Q1' | 'Q2' | 'Q3' |
--------+-------+------+------+------+------+------+
Canada	| BC	| 2001 | 0    | null | null | 10   |
--------+-------+------+------+------+------+------+
USA 	| NY	| 2001 | 25   | 45   | null | null |
--------+-------+------+------+------+------+------+
Canada 	| ON 	| 2001 | 0    | null | 10   | null |
--------+-------+------+------+------+------+------+
USA 	| CA 	| 2001 | 20   | null | 50   | null |
--------+-------+------+------+------+------+------+
USA 	| CA 	| 2001 | 15   | 30   | null | null |
--------+-------+------+------+------+------+------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
	*
FROM
	star1 PIVOT(
	SUM(sales) FOR qtr IN ('Q1',
	   	'Q2',
	       'Q3'))Tmp;
```

##### Result

```none
Country | State | yr   | cogs | 'Q1' | 'Q2' | 'Q3' |
--------+-------+------+------+------+------+------+
Canada	| BC	| 2001 | 0    | null | null | 10   |
--------+-------+------+------+------+------+------+
USA 	| NY	| 2001 | 25   | 45   | null | null |
--------+-------+------+------+------+------+------+
Canada 	| ON 	| 2001 | 0    | null | 10   | null |
--------+-------+------+------+------+------+------+
USA 	| CA 	| 2001 | 20   | null | 50   | null |
--------+-------+------+------+------+------+------+
USA 	| CA 	| 2001 | 15   | 30   | null | null |
--------+-------+------+------+------+------+------+
```

#### PIVOT with aliases transformation

##### *Teradata*

##### Query

```sql
 SELECT *
FROM star1 PIVOT (
	SUM(sales) as ss1 FOR qtr
    IN ('Q1' AS Quarter1,
    	'Q2' AS Quarter2,
        'Q3' AS Quarter3)
)Tmp;
```

##### Result

```none
Country | State | yr   | cogs | Quarter1_ss1 | Quarter2_ss1 | Quarter3_ss1 |
--------+-------+------+------+--------------+--------------+--------------+
Canada	| BC	| 2001 | 0    | null 	     | null         | 10           |
--------+-------+------+------+--------------+--------------+--------------+
USA 	| NY	| 2001 | 25   | 45 	     | null 	    | null         |
--------+-------+------+------+--------------+--------------+--------------+
Canada 	| ON 	| 2001 | 0    | null 	     | 10 	    | null 	   |
--------+-------+------+------+--------------+--------------+--------------+
USA 	| CA 	| 2001 | 20   | null         | 50           | null         |
--------+-------+------+------+--------------+--------------+--------------+
USA 	| CA 	| 2001 | 15   | 30           | null         | null         |
--------+-------+------+------+--------------+--------------+--------------+
```

##### *Snowflake*

##### Query

```sql
 SELECT
	*
FROM
	!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT RENAME COLUMN NOT SUPPORTED ***/!!!
	star1 PIVOT(
	SUM(sales) FOR qtr IN (
	                       !!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT RENAME COLUMN NOT SUPPORTED ***/!!!
	                       'Q1',
	!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT RENAME COLUMN NOT SUPPORTED ***/!!!
	   	'Q2',
	!!!RESOLVE EWI!!! /*** SSC-EWI-0015 - PIVOT/UNPIVOT RENAME COLUMN NOT SUPPORTED ***/!!!
	       'Q3'))Tmp;
```

##### Result

```sql
 Country | State | yr   | cogs | Quarter1_ss1 | Quarter2_ss1 | Quarter3_ss1 |
--------+-------+------+------+--------------+--------------+--------------+
Canada	| BC	| 2001 | 0    | null 	     | null         | 10           |
--------+-------+------+------+--------------+--------------+--------------+
USA 	| NY	| 2001 | 25   | 45 	     | null 	    | null         |
--------+-------+------+------+--------------+--------------+--------------+
Canada 	| ON 	| 2001 | 0    | null 	     | 10 	    | null 	   |
--------+-------+------+------+--------------+--------------+--------------+
USA 	| CA 	| 2001 | 20   | null         | 50           | null         |
--------+-------+------+------+--------------+--------------+--------------+
USA 	| CA 	| 2001 | 15   | 30           | null         | null         |
--------+-------+------+------+--------------+--------------+--------------+
```

### Known Issues

**1. WITH clause not supported**

Using the WITH clause is not currently supported.

**2. Pivot over multiple pivot columns not supported**

SnowConvert AI is transforming the PIVOT function into the PIVOT function in Snowflake, which only supports applying the function over a single column.

**3. Pivot with multiple aggregate functions not supported**

The PIVOT function in Snowflake only supports applying one aggregate function over the data.

**4. Subquery in the IN clause not supported**

The IN clause of the Snowflake PIVOT function does not accept subqueries.

**5. Aliases only supported if all IN clause elements have it and table specification is present**

For the column names with aliases to be equivalent, SnowConvert AI requires that all the values specified in the IN clause have one alias specified and the table specification is present in the input code, this is necessary so SnowConvert AI can successfully create the alias list for the resulting table.

### Related EWIs

1. [SSC-EWI-0015](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The input pivot/unpivot statement format is not supported

## RANK

Translation specification for the transformation of the RANK() function

### Description

RANK sorts a result set and identifies the numeric rank of each row in the result. The only argument for RANK is the sort column or columns, and the function returns an integer that represents the rank of each row in the result. ([RANK in Teradata](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Functions-Expressions-and-Predicates/Ordered-Analytical/Window-Aggregate-Functions/RANK-Teradata))

#### Teradata syntax

```sql
 RANK ( sort_expression [ ASC | DESC ] [,...] )
```

#### Snowflake syntax

```sql
 RANK() OVER
(
    [ PARTITION BY <expr1> ]
    ORDER BY <expr2> [ { ASC | DESC } ]
    [ <window_frame> ]
)
```

### Sample Source Pattern

#### Setup data

##### Teradata

##### Query

```sql
 CREATE TABLE Sales (
  Product VARCHAR(255),
  Sales INT
);

INSERT INTO Sales (Product, Sales) VALUES ('A', 100);
INSERT INTO Sales (Product, Sales) VALUES ('B', 150);
INSERT INTO Sales (Product, Sales) VALUES ('C', 200);
INSERT INTO Sales (Product, Sales) VALUES ('D', 150);
INSERT INTO Sales (Product, Sales) VALUES ('E', 120);
INSERT INTO Sales (Product, Sales) VALUES ('F', NULL);
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE TABLE Sales (
  Product VARCHAR(255),
  Sales INT
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO Sales (Product, Sales)
VALUES ('A', 100);

INSERT INTO Sales (Product, Sales)
VALUES ('B', 150);

INSERT INTO Sales (Product, Sales)
VALUES ('C', 200);

INSERT INTO Sales (Product, Sales)
VALUES ('D', 150);

INSERT INTO Sales (Product, Sales)
VALUES ('E', 120);

INSERT INTO Sales (Product, Sales)
VALUES ('F', NULL);
```

#### RANK() using ASC, DESC, and DEFAULT order

##### Teradata

> **Warning:**
>
> Notice that Teradata’s ordering default value when calling RANK() is DESC. However, the default in Snowflake is ASC. Thus, DESC is added in the conversion of RANK() when no order is specified.

##### Query

```sql
 SELECT
  Sales,
  RANK(Sales ASC) AS SalesAsc,
  RANK(Sales DESC) AS SalesDesc,
  RANK(Sales) AS SalesDefault
FROM
  Sales;
```

##### Result

| SALES | SALESASC | SALESDESC | SALESDEFAULT |
| --- | --- | --- | --- |
| NULL | 6 | 6 | 6 |
| 200 | 5 | 1 | 1 |
| 150 | 3 | 2 | 2 |
| 150 | 3 | 2 | 2 |
| 120 | 2 | 4 | 4 |
| 100 | 1 | 5 | 5 |

##### Snowflake

##### Query

```sql
 SELECT
  Sales,
  RANK() OVER (
  ORDER BY
    Sales ASC) AS SalesAsc,
    RANK() OVER (
    ORDER BY
    Sales DESC NULLS LAST) AS SalesDesc,
    RANK() OVER (
    ORDER BY
    Sales DESC NULLS LAST) AS SalesDefault
    FROM
    Sales;
```

##### Result

| SALES | SALESASC | SALESDESC | SALESDEFAULT |
| --- | --- | --- | --- |
| NULL | 6 | 6 | 6 |
| 200 | 5 | 1 | 1 |
| 150 | 3 | 2 | 2 |
| 150 | 3 | 2 | 2 |
| 120 | 2 | 4 | 4 |
| 100 | 1 | 5 | 5 |

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Regex functions

### Description

Both Teradata and Snowflake offer support for functions that apply regular expressions over varchar inputs. See the [Teradata documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates/March-2019/Regular-Expression-Functions) and [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/functions-regexp.html) for more details.

```sql
REGEXP_SUBSTR(source. regexp [, position, occurrence, match])
REGEXP_REPLACE(source. regexp [, replace_string, position, occurrence, match])
REGEXP_INSTR(source. regexp [, position, occurrence, return_option, match])
REGEXP_SIMILAR(source. regexp [, match])
REGEXP_SPLIT_TO_TABLE(inKey. source. regexp, match)
```

### Sample Source Patterns

#### Setup data

##### Teradata

**Query**

```sql
CREATE TABLE regexpTable
(
    col1 CHAR(35)
);

INSERT INTO regexpTable VALUES('hola');
```

##### *Snowflake*

**Query**

```sql
CREATE OR REPLACE TABLE regexpTable
(
    col1 CHAR(35)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO regexpTable
VALUES ('hola');
```

#### Regex transformation example

##### *Teradata*

**Query**

```sql
SELECT
REGEXP_REPLACE(col1,'.*(h(i|o))','ha', 1, 0, 'x'),
REGEXP_SUBSTR(COL1,'.*(h(i|o))', 2, 1, 'x'),
REGEXP_INSTR(COL1,'.*(h(i|o))',1, 1, 0, 'x'),
REGEXP_SIMILAR(COL1,'.*(h(i|o))', 'xl')
FROM regexpTable;
```

**Result**

```sql
COLUMN1|COLUMN2|COLUMN3|COLUMN4|
-------+-------+-------+-------+
hala   |null   |1      |0      |
```

##### *Snowflake*

**Query**

```sql
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "regexpTable" **
SELECT
REGEXP_REPLACE(col1, '.*(h(i|o))', 'ha', 1, 0),
REGEXP_SUBSTR(COL1, '.*(h(i|o))', 2, 1),
REGEXP_INSTR(COL1, '.*(h(i|o))', 1, 1, 0),
--** SSC-FDM-TD0016 - VALUE 'l' FOR PARAMETER 'match_arg' IS NOT SUPPORTED IN SNOWFLAKE **
REGEXP_LIKE(COL1, '.*(h(i|o))')
FROM
regexpTable;
```

**Result**

```sql
COLUMN1|COLUMN2|COLUMN3|COLUMN4|
-------+-------+-------+-------+
hala   |null   |1      |FALSE  |
```

### Known Issues

**1. Snowflake only supports POSIX regular expressions**

The user will be warned when SnowConvert AI finds a non-POSIX regular expression.

**2. Teradata “match_arg” option ‘l’ is unsupported in Snowflake**

The option ‘l’ has no counterpart in Snowflake and the user will be warned if SnowConvert AI finds them.

**3. Fixed size of the CHAR datatype may cause different behavior**

Some regex functions in Teradata will try to match the whole column of CHAR datatype in a table even if some of the characters in the column were left empty due to a smaller string being inserted. In Snowflake this does not happen because the CHAR datatype is of variable size.

**4. REGEXP_SPLIT_TO_TABLE not supported**

The function is currently not supported by Snowflake.

### Related EWIs

1. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
2. [SSC-FDM-TD0016](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Value ‘l’ for parameter ‘match_arg’ is not supported in Snowflake.

## STRTOK_SPLIT_TO_TABLE

### Description

Split a string into a table using the provided delimiters. For more information check [STRTOK_SPLIT_TO_TABLE](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Functions-Expressions-and-Predicates-17.20/String-Operators-and-Functions/STRTOK_SPLIT_TO_TABLE).

```sql
[TD_SYSFNLIB.] STRTOK_SPLIT_TO_TABLE ( inkey, instring, delimiters )
  RETURNS ( outkey, tokennum, token )
```

### Sample Source Patterns

#### Setup data

##### Teradata

**Query**

```sql
CREATE TABLE strtokTable
(
	col1 INTEGER,
	col2 VARCHAR(100)
);

INSERT INTO strtokTable VALUES(4, 'hello-world-split-me');
INSERT INTO strtokTable VALUES(1, 'string$split$by$dollars');
```

##### *Snowflake*

**Query**

```sql
CREATE OR REPLACE TABLE strtokTable
(
	col1 INTEGER,
	col2 VARCHAR(100)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO strtokTable
VALUES (4, 'hello-world-split-me');

INSERT INTO strtokTable
VALUES (1, 'string$split$by$dollars');
```

#### STRTOK_SPLIT_TO_TABLE transformation

##### *Teradata*

**Query**

```sql
SELECT outkey, tokennum, token FROM table(STRTOK_SPLIT_TO_TABLE(strtokTable.col1, strtokTable.col2, '-$')
RETURNS (outkey INTEGER, tokennum INTEGER, token VARCHAR(100))) AS testTable
ORDER BY outkey, tokennum;
```

**Result**

```sql
outkey |tokennum | token  |
-------+---------+--------+
1      |1        |string  |
-------+---------+--------+
1      |2        |split   |
-------+---------+--------+
1      |3        |by      |
-------+---------+--------+
1      |4        |dollars |
-------+---------+--------+
4      |1        |hello   |
-------+---------+--------+
4      |2        |world   |
-------+---------+--------+
4      |3        |split   |
-------+---------+--------+
4      |4        |me      |
```

##### *Snowflake*

**Query**

```sql
SELECT
CAST(strtokTable.col1 AS INTEGER) AS outkey,
CAST(INDEX AS INTEGER) AS tokennum,
CAST(VALUE AS VARCHAR) AS token
FROM
strtokTable,
table(STRTOK_SPLIT_TO_TABLE(strtokTable.col2, '-$')) AS testTable
ORDER BY outkey, tokennum;
```

**Result**

```sql
outkey |tokennum | token  |
-------+---------+--------+
1      |1        |string  |
-------+---------+--------+
1      |2        |split   |
-------+---------+--------+
1      |3        |by      |
-------+---------+--------+
1      |4        |dollars |
-------+---------+--------+
4      |1        |hello   |
-------+---------+--------+
4      |2        |world   |
-------+---------+--------+
4      |3        |split   |
-------+---------+--------+
4      |4        |me      |
```

### Known Issues

No known issues.

### Related EWIs

No related EWIs.

## SUBSTRING

### Description

Extracts a substring from a given input string. For more information check [SUBSTRING/SUBSTR.](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/lxOd~YrdVkJGt0_anAEXFQ)

```sql
SUBSTRING(string_expr FROM n1 [FOR n2])

SUBSTR(string_expr, n1, [, n2])
```

When the value to start getting the substring (n1) is less than one SUBSTR_UDF is inserted instead.

### Sample Source Patterns

#### SUBSTRING transformation

##### *Teradata*

**Query**

```sql
SELECT SUBSTR('Hello World!', 2, 6),
SUBSTR('Hello World!', -2, 6),
SUBSTRING('Hello World!' FROM 2 FOR 6),
SUBSTRING('Hello World!' FROM -2 FOR 6);
```

**Result**

```sql
COLUMN1 |COLUMN2 |COLUMN3 | COLUMN4 |
--------+--------+--------+---------+
ello W  |Hel     |ello W  |Hel      |
```

##### *Snowflake*

**Query**

```sql
SELECT
SUBSTR('Hello World!', 2, 6),
PUBLIC.SUBSTR_UDF('Hello World!', -2, 6),
SUBSTRING('Hello World!', 2, 6),
PUBLIC.SUBSTR_UDF('Hello World!', -2, 6);
```

**Result**

```sql
COLUMN1 |COLUMN2 |COLUMN3 | COLUMN4 |
--------+--------+--------+---------+
ello W  |Hel     |ello W  |Hel      |
```

### Related EWIs

No related EWIs.

## TD_UNPIVOT

Translation specification for the transformation of TD_UNPIVOT into an equivalent query in Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

`TD_UNPIVOT` in Teradata can unpivot multiple columns at once, while Snowflake `UNPIVOT` can only unpivot a single column\*\*.\*\* The *unpivot* functionality is used to transform columns of the specified table into rows. For more information see [TD_UNPIVOT](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Operators-and-User-Defined-Functions-17.20/Table-Operators/TD_UNPIVOT).

```none
[TD_SYSFNLIB.] TD_UNPIVOT (
  ON { tableName | ( query_expression ) }
  USING VALUE_COLUMNS ( 'value_columns_value' [,...] )
  UNPIVOT_COLUMN ( 'unpivot_column_value' )
  COLUMN_LIST ( 'column_list_value' [,...] )
  [ COLUMN_ALIAS_LIST ( 'column_alias_list_value' [,...] )
      INCLUDE_NULLS ( { 'No' | 'Yes' } )
  ]
)
```

The following transformation is able to generate a SQL query in Snowflake that unpivots multiple columns at the same time, the same way it works in Teradata.

### Sample Source Patterns

#### Setup data title

##### Teradata

##### Query

```sql
 CREATE TABLE superunpivottest (
	myKey INTEGER NOT NULL PRIMARY KEY,
	firstSemesterIncome DECIMAL(10,2),
	secondSemesterIncome DECIMAL(10,2),
	firstSemesterExpenses DECIMAL(10,2),
	secondSemesterExpenses DECIMAL(10,2)
);

INSERT INTO superUnpivottest VALUES (2020, 15440, 25430.57, 10322.15, 12355.36);
INSERT INTO superUnpivottest VALUES (2018, 18325.25, 25220.65, 15560.45, 15680.33);
INSERT INTO superUnpivottest VALUES (2019, 23855.75, 34220.22, 14582.55, 24122);
```

##### *Snowflake*

##### Query

```sql
 CREATE OR REPLACE TABLE superunpivottest (
	myKey INTEGER NOT NULL PRIMARY KEY,
	firstSemesterIncome DECIMAL(10,2),
	secondSemesterIncome DECIMAL(10,2),
	firstSemesterExpenses DECIMAL(10,2),
	secondSemesterExpenses DECIMAL(10,2)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO superUnpivottest
VALUES (2020, 15440, 25430.57, 10322.15, 12355.36);

INSERT INTO superUnpivottest
VALUES (2018, 18325.25, 25220.65, 15560.45, 15680.33);

INSERT INTO superUnpivottest
VALUES (2019, 23855.75, 34220.22, 14582.55, 24122);
```

#### TD_UNPIVOT transformation

##### *Teradata*

##### Query

```sql
 SELECT * FROM
 TD_UNPIVOT(
 	ON superunpivottest
 	USING
 	VALUE_COLUMNS('Income', 'Expenses')
 	UNPIVOT_COLUMN('Semester')
 	COLUMN_LIST('firstSemesterIncome, firstSemesterExpenses', 'secondSemesterIncome, secondSemesterExpenses')
    COLUMN_ALIAS_LIST('First', 'Second')
 )X ORDER BY mykey, Semester;
```

##### Result

```none
myKey |Semester |Income   | Expenses |
------+---------+---------+----------+
2018  |First    |18325.25 |15560.45  |
------+---------+---------+----------+
2018  |Second   |25220.65 |15680.33  |
------+---------+---------+----------+
2019  |First    |23855.75 |14582.55  |
------+---------+---------+----------+
2019  |Second   |34220.22 |24122.00  |
------+---------+---------+----------+
2020  |First    |15440.00 |10322.15  |
------+---------+---------+----------+
2020  |Second   |25430.57 |12355.36  |
```

##### *Snowflake*

##### Query

```sql
 SELECT
 * FROM
 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0061 - TD_UNPIVOT TRANSFORMATION REQUIRES COLUMN INFORMATION THAT COULD NOT BE FOUND, COLUMNS MISSING IN RESULT ***/!!!
 (
  SELECT
   TRIM(GET_IGNORE_CASE(OBJECT_CONSTRUCT('FIRSTSEMESTERINCOME', 'First', 'FIRSTSEMESTEREXPENSES', 'First', 'SECONDSEMESTERINCOME', 'Second', 'SECONDSEMESTEREXPENSES', 'Second'), Semester), '"') AS Semester,
   Income,
   Expenses
  FROM
   superunpivottest UNPIVOT(Income FOR Semester IN (
    firstSemesterIncome,
    secondSemesterIncome
   )) UNPIVOT(Expenses FOR Semester1 IN (
    firstSemesterExpenses,
    secondSemesterExpenses
   ))
  WHERE
   Semester = 'FIRSTSEMESTERINCOME'
   AND Semester1 = 'FIRSTSEMESTEREXPENSES'
   OR Semester = 'SECONDSEMESTERINCOME'
   AND Semester1 = 'SECONDSEMESTEREXPENSES'
 ) X ORDER BY mykey, Semester;
```

##### Result

```none
myKey |Semester |Income   | Expenses |
------+---------+---------+----------+
2018  |First    |18325.25 |15560.45  |
------+---------+---------+----------+
2018  |Second   |25220.65 |15680.33  |
------+---------+---------+----------+
2019  |First    |23855.75 |14582.55  |
------+---------+---------+----------+
2019  |Second   |34220.22 |24122.00  |
------+---------+---------+----------+
2020  |First    |15440.00 |10322.15  |
------+---------+---------+----------+
2020  |Second   |25430.57 |12355.36  |
```

### Known Issues

1. **TD_UNPIVOT with INCLUDE_NULLS clause set to YES is not supported**

Snowflake UNPIVOT function used in the transformation will ignore null values always, and the user will be warned that the INCLUDE_NULLS clause is not supported when it is set to YES.

2. **Table information is required to correctly transform the function**

SnowConvert AI needs the name of the columns that are being used in the TD_UNPIVOT function; if the user does not include the columns list in the query_expression of the function but provides the name of the table being unpivoted, then it will try to retrieve the column names from the table definition. If the names can not be found then the user will be warned that the resulting query might be losing columns in the result.

### Related EWIs

1. [SSC-EWI-TD0061](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): TD_UNPIVOT transformation requires column information that could not be found, columns missing in result.

## TO_CHAR

### Description

The TO_CHAR function casts a DateTime or numeric value to a string. For more information check [TO_CHAR(Numeric)](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/4xbLyOA_385QLYctkj~hjw) and [TO_CHAR(DateTime)](https://docs.teradata.com/r/kmuOwjp1zEYg98JsB8fu_A/he2a_fFPveMN9cjMlF3Tqg).

```sql
-- Numeric version
[TD_SYSFNLIB.]TO_CHAR(numeric_expr [, format_arg [, nls_param]])

-- DateTime version
[TD_SYSFNLIB.]TO_CHAR(dateTime_expr [, format_arg])
```

Both Snowflake and Teradata have their own version of the TO_CHAR function, however, Teradata supports plenty of formats that are not natively supported by Snowflake. To support these format elements SnowConvert AI uses Snowflake built-in functions and custom UDFs to generate a concatenation expression that produces the same string as the original TO_CHAR function in Teradata.

### Sample Source Patterns

#### TO_CHAR(DateTime) transformation

##### *Teradata*

**Query**

```sql
SELECT
TO_CHAR(date '2012-12-23'),
TO_CHAR(date '2012-12-23', 'DS'),
TO_CHAR(date '2012-12-23', 'DAY DD, MON YY');
```

**Result**

```sql
COLUMN1    | COLUMN2    | COLUMN3           |
-----------+------------+-------------------+
2012/12/23 | 12/23/2012 | SUNDAY 23, DEC 12 |
```

##### *Snowflake*

**Query**

```sql
SELECT
TO_CHAR(date '2012-12-23') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/,
TO_CHAR(date '2012-12-23', 'MM/DD/YYYY') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/,
PUBLIC.DAYNAME_LONG_UDF(date '2012-12-23', 'uppercase') || TO_CHAR(date '2012-12-23', ' DD, ') || PUBLIC.MONTH_SHORT_UDF(date '2012-12-23', 'uppercase') || TO_CHAR(date '2012-12-23', ' YY') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/;
```

**Result**

```sql
COLUMN1    | COLUMN2    | COLUMN3           |
-----------+------------+-------------------+
2012/12/23 | 12/23/2012 | SUNDAY 23, DEC 12 |
```

#### TO_CHAR(Numeric) transformation

##### *Teradata*

**Query**

```sql
SELECT
TO_CHAR(1255.495),
TO_CHAR(1255.495, '9.9EEEE'),
TO_CHAR(1255.495, 'SC9999.9999', 'nls_iso_currency = ''EUR''');
```

**Result**

```sql
COLUMN1  | COLUMN2 | COLUMN3       |
---------+---------+---------------+
1255.495 | 1.3E+03 | +EUR1255.4950 |
```

##### *Snowflake*

**Query**

```sql
SELECT
TO_CHAR(1255.495) /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/,
TO_CHAR(1255.495, '9.0EEEE') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/,
PUBLIC.INSERT_CURRENCY_UDF(TO_CHAR(1255.495, 'S9999.0000'), 2, 'EUR') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/;
```

**Result**

```sql
COLUMN1  | COLUMN2 | COLUMN3       |
---------+---------+---------------+
1255.495 | 1.3E+03 | +EUR1255.4950 |
```

### Known Issues

**1. Formats with different or unsupported behaviors**

Teradata offers an extensive list of format elements that may show different behavior in Snowflake after the transformation of the TO_CHAR function. For the list of elements with different or unsupported behaviors check [SSC-EWI-TD0029](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md).

### Related EWIs

1. [SSC-FDM-TD0029](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Snowflake supported formats for TO_CHAR differ from Teradata and may fail or have different behavior.

## XMLAGG

### Description

Construct an XML value by performing an aggregation of multiple rows. For more information check [XMLAGG](https://docs.teradata.com/r/Teradata-VantageTM-XML-Data-Type/June-2020/Functions-for-XML-Type-and-XQuery/XMLAGG).

```none
XMLAGG (
  XML_value_expr
  [ ORDER BY order_by_spec [,...] ]
  [ RETURNING { CONTENT | SEQUENCE } ]
)

order_by_spec := sort_key [ ASC | DESC ] [ NULLS { FIRST | LAST } ]
```

### Sample Source Patterns

#### Setup data

##### Teradata

**Query**

```sql
create table orders (
	o_orderkey int,
	o_totalprice float);

insert into orders values (1,500000);
insert into orders values (2,100000);
insert into orders values (3,600000);
insert into orders values (4,700000);
```

##### *Snowflake*

**Query**

```sql
CREATE OR REPLACE TABLE orders (
	o_orderkey int,
	o_totalprice float)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO orders
VALUES (1,500000);

INSERT INTO orders
VALUES (2,100000);

INSERT INTO orders
VALUES (3,600000);

INSERT INTO orders
VALUES (4,700000);
```

#### XMLAGG transformation

##### *Teradata*

**Query**

```sql
select
    xmlagg(o_orderkey order by o_totalprice desc) (varchar(10000))
from orders
where o_totalprice > 5;
```

**Result**

```sql
COLUMN1 |
--------+
4 3 1 2 |
```

##### *Snowflake*

**Query**

```sql
SELECT
    LEFT(TO_VARCHAR(LISTAGG ( o_orderkey, ' ')
    WITHIN GROUP(
 order by o_totalprice DESC NULLS LAST)), 10000)
    from
    orders
    where o_totalprice > 5;
```

**Result**

```sql
COLUMN1 |
--------+
4 3 1 2 |
```

### Known Issues

**1. The RETURNING clause is currently not supported.**

The user will be warned that the translation of the returning clause will be added in the future.

### Related EWIs

No related EWIs.

## CAST

## Cast from Number Datatypes to Varchar Datatype

Teradata when casts to varchar uses default formats for each number datatype, so SnowConvert AI adds formats to keep the equivalence among platforms.

### Sample Source Patterns

#### BYTEINT

##### *Teradata*

**Query**

```sql
SELECT '"'||cast(cast(12 as BYTEINT) as varchar(10))||'"';
```

**Result**

```sql
(('"'||12)||'"')|
----------------+
"12"            |
```

##### *Snowflake*

**Query**

```sql
SELECT
'"'|| LEFT(TO_VARCHAR(cast(12 as BYTEINT), 'TM'), 10) ||'"';
```

**Result**

```sql
"'""'|| LEFT(TO_VARCHAR(CAST(12 AS BYTEINT), 'TM'), 10) ||'""'"
---------------------------------------------------------------
"12"
```

#### SMALLINT

##### *Teradata*

**Query**

```sql
SELECT '"'||cast(cast(123 as SMALLINT) as varchar(10))||'"';
```

**Result**

```sql
(('"'||123)||'"')|
-----------------+
"123"            |
```

##### *Snowflake*

**Query**

```sql
SELECT
'"'|| LEFT(TO_VARCHAR(CAST(123 AS SMALLINT), 'TM'), 10) ||'"';
```

**Result**

```
"'""'|| LEFT(TO_VARCHAR(CAST(123 AS SMALLINT), 'TM'), 10) ||'""'"
-----------------------------------------------------------------
"123"
```

#### INTEGER

##### *Teradata*

**Query**

```sql
SELECT '"'||cast(cast(12345 as INTEGER) as varchar(10))||'"';
```

**Result**

```sql
(('"'||12345)||'"')|
-------------------+
"12345"            |
```

##### *Snowflake*

**Query**

```sql
SELECT
'"'|| LEFT(TO_VARCHAR(CAST(12345 AS INTEGER), 'TM'), 10) ||'"';
```

**Result**

```sql
"'""'|| LEFT(TO_VARCHAR(CAST(12345 AS INTEGER), 'TM'), 10) ||'""'"
------------------------------------------------------------------
"12345"
```

#### BIGINT

##### *Teradata*

**Query**

```sql
SELECT '"'||cast(cast(12345 as BIGINT) as varchar(10))||'"';
```

**Result**

```sql
(('"'||12345)||'"')|
-------------------+
"12345"            |
```

##### *Snowflake*

**Query**

```sql
SELECT
       '"'|| LEFT(TO_VARCHAR(CAST(12345 AS BIGINT), 'TM'), 10) ||'"';
```

**Result**

```sql
"'""'|| LEFT(TO_VARCHAR(CAST(12345 AS BIGINT), 'TM'), 10) ||'""'"
-----------------------------------------------------------------
"12345"
```

#### DECIMAL[(n[,m])] or NUMERIC[(n[,m])]

##### *Teradata*

**Query**

```sql
SELECT '"'||cast(cast(12345 as DECIMAL) as varchar(10))||'"',
       '"'||cast(cast(12345 as DECIMAL(12, 2)) as varchar(10))||'"';
```

**Result**

```sql
(('"'||12345)||'"')|(('"'||12345)||'"')|
-------------------+-------------------+
"12345."           |"12345.00"         |
```

##### *Snowflake*

**Query**

```sql
SELECT
'"'|| LEFT(TO_VARCHAR(CAST(12345 AS DECIMAL), 'TM.'), 10) ||'"',
'"'|| LEFT(TO_VARCHAR(CAST(12345 AS DECIMAL(12, 2)), 'TM'), 10) ||'"';
```

**Result**

```
'"'|| LEFT(TO_VARCHAR(CAST(12345 AS DECIMAL), 'TM.'), 10) ||'"'	'"'|| LEFT(TO_VARCHAR(CAST(12345 AS DECIMAL(12, 2)), 'TM'), 10) ||'"'
"12345."	"12345.00"
```

### Known Issues

* Teradata treats the numbers between 0 and 1 differently than Snowflake. For those values, Teradata does not add the zero before the dot; meanwhile, Snowflake does.

#### *Teradata*

**Query**

```sql
SELECT '"'||cast(cast(-0.1 as DECIMAL(12, 2)) as varchar(10))||'"' AS column1,
       '"'||cast(cast(0.1 as DECIMAL(12, 2)) as varchar(10))||'"' AS column2;
```

**Result**

```sql
COLUMN1          |COLUMN2
-----------------+--------------+
"-.10"           |".10"         |
```

#### *Snowflake*

**Query**

```sql
SELECT
'"'|| LEFT(TO_VARCHAR(CAST(-0.1 AS DECIMAL(12, 2)), 'TM'), 10) ||'"' AS column1,
'"'|| LEFT(TO_VARCHAR(CAST(0.1 AS DECIMAL(12, 2)), 'TM'), 10) ||'"' AS column2;
```

**Result**

```sql
COLUMN1           |COLUMN2
------------------+---------------+
"-0.10"           |"0.10"         |
```

### Related EWIs

No related EWIs.

## Cast to DATE using { }

### Description

The following syntax casts a date-formatted string to DATE datatype by putting a d before the string definition inside curly braces.

```none
SELECT {d '1233-10-10'}
```

### Sample Source Patterns

#### Cast to DATE using curly braces

**Teradata**

**Cast to Date**

```none
SELECT * FROM RESOURCE_DETAILS where change_ts >= {d '2022-09-10'};
```

**Snowflake**

**Cast to Date**

```sql
SELECT
* FROM
PUBLIC.RESOURCE_DETAILS
where change_ts >= DATE('2022-09-10');
```

## Cast string expressions to TIMESTAMP/DATE

### Description

When a CAST expression converts a string-typed operand (column reference, concatenation, or expression) to a TIMESTAMP or DATE type without an explicit FORMAT clause, SnowConvert AI converts it to the appropriate Snowflake function:

* `TO_TIMESTAMP(expr)` — when the target is TIMESTAMP and no timezone offset is detected
* `TO_TIMESTAMP_TZ(expr)` — when the target is TIMESTAMP and the concatenation includes a timezone offset literal (e.g., `'+00:00'` or `'-05:00'`)
* `TO_DATE(expr, format)` — when the target is DATE

For TIMESTAMP targets with non-literal operands, the format argument is omitted to let Snowflake’s AUTO format detection handle the conversion at runtime. This avoids hardcoding a format that may not match the actual data.

When a timezone offset literal is detected in a concatenation expression, `TO_TIMESTAMP_TZ` is used instead of `TO_TIMESTAMP` to preserve timezone information. Per Snowflake documentation, using `TO_TIMESTAMP` (which produces `TIMESTAMP_NTZ`) with timezone data silently discards the timezone offset.

For string literal operands (e.g., `CAST('2022-11-01' AS TIMESTAMP)`), a default format is applied and dashes are replaced with slashes to match Snowflake conventions.

### Sample Source Patterns

#### Concatenation with timezone offset to TIMESTAMP

##### *Teradata*

**Concatenation with TZ offset**

```sql
SELECT CAST(SQ.EXTRACTION_DATE || '.000000' || '+00:00' AS TIMESTAMP(6)) AS EXTRACTION_DATE;
```

##### *Snowflake*

**Concatenation with TZ offset**

```sql
SELECT
  TO_TIMESTAMP_TZ(SQ.EXTRACTION_DATE || '.000000' || '+00:00') AS EXTRACTION_DATE;
```

#### Concatenation with negative timezone offset to TIMESTAMP

##### *Teradata*

**Concatenation with negative TZ offset**

```sql
SELECT CAST(COL1 || '.000000' || '-05:00' AS TIMESTAMP(6));
```

##### *Snowflake*

**Concatenation with negative TZ offset**

```sql
SELECT
  TO_TIMESTAMP_TZ(COL1 || '.000000' || '-05:00');
```

#### Concatenation without timezone offset to TIMESTAMP

##### *Teradata*

**Concatenation without TZ offset**

```sql
SELECT CAST(COL1 || '.000000' AS TIMESTAMP(6));
```

##### *Snowflake*

**Concatenation without TZ offset**

```sql
SELECT
  TO_TIMESTAMP(COL1 || '.000000');
```

#### Column reference to TIMESTAMP without FORMAT

##### *Teradata*

**Column ref to TIMESTAMP**

```sql
SELECT CAST(COL1 AS TIMESTAMP);
```

##### *Snowflake*

**Column ref to TIMESTAMP**

```sql
SELECT
  TO_TIMESTAMP(COL1);
```

#### Concatenation to DATE

##### *Teradata*

**Concatenation to DATE**

```sql
SELECT CAST(COL1 || '-01' AS DATE);
```

##### *Snowflake*

**Concatenation to DATE**

```sql
SELECT
  TO_DATE(COL1 || '-01', 'YYYY/MM/DD');
```

### Known Issues

No related issues.

### Related EWIs

No related EWIs.

## Cast to INTERVAL datatype

### Description

Snowflake does not support the Interval data type, but it has INTERVAL constants that can be used in DateTime operations and other uses can be emulated using VARCHAR, SnowConvert AI will transform CAST functions to the INTERVAL datatype into an equivalent depending on the case:

* When the value being casted is of type interval an UDF will be generated to produce the new interval equivalent as a string
* When the value is a literal, a Snowflake interval constant will be generated if the cast is used in a datetime operation, otherwise a literal string will be generated
* When the value is non-literal then a cast to string will be generated

### Sample Source Patterns

#### Non-interval literals

##### *Teradata*

**Query**

```sql
SELECT TIMESTAMP '2022-10-15 10:30:00' + CAST ('12:34:56.78' AS INTERVAL HOUR(2) TO SECOND(2)) AS VARCHAR_TO_INTERVAL,
TIMESTAMP '2022-10-15 10:30:00' + CAST(-5 AS INTERVAL YEAR(4)) AS NUMBER_TO_INTERVAL,
CAST('07:00' AS INTERVAL HOUR(2) TO MINUTE) AS OUTSIDE_DATETIME_OPERATION;
```

**Result**

```sql
VARCHAR_TO_INTERVAL | NUMBER_TO_INTERVAL | OUTSIDE_DATETIME_OPERATION |
--------------------+--------------------+----------------------------+
2022-10-15 23:04:56 |2017-10-15 10:30:00 | 7:00                       |
```

##### *Snowflake*

**Query**

```sql
SELECT
TIMESTAMP '2022-10-15 10:30:00' + INTERVAL '12 HOUR, 34 MINUTE, 56 SECOND, 780000 MICROSECOND' AS VARCHAR_TO_INTERVAL,
TIMESTAMP '2022-10-15 10:30:00' + INTERVAL '-5 YEAR' AS NUMBER_TO_INTERVAL,
'07:00' AS OUTSIDE_DATETIME_OPERATION;
```

**Result**

```sql
VARCHAR_TO_INTERVAL     | NUMBER_TO_INTERVAL     | OUTSIDE_DATETIME_OPERATION |
------------------------+------------------------+----------------------------+
2022-10-15 23:04:56.780 |2017-10-15 10:30:00.000 | 07:00                      |
```

#### Non-literal and non-interval values

##### *Teradata*

**Query**

```sql
SELECT TIMESTAMP '2022-10-15 10:30:00' + CAST('20 ' || '10' AS INTERVAL DAY TO HOUR) AS DATETIME_OPERATION,
CAST('20 ' || '10' AS INTERVAL DAY TO HOUR) AS OUTSIDE_DATETIME_OPERATION;
```

**Result**

```sql
DATETIME_OPERATION  | OUTSIDE_DATETIME_OPERATION |
--------------------+----------------------------+
2022-11-04 20:30:00 | 20 10                      |
```

##### *Snowflake*

**Query**

```sql
SELECT
PUBLIC.DATETIMEINTERVALADD_UDF(TIMESTAMP '2022-10-15 10:30:00', CAST('20 ' || '10' AS VARCHAR(21)), 'DAY', '+') AS DATETIME_OPERATION,
CAST('20 ' || '10' AS VARCHAR(21)) AS OUTSIDE_DATETIME_OPERATION;
```

**Result**

```sql
DATETIME_OPERATION      | OUTSIDE_DATETIME_OPERATION |
------------------------+----------------------------+
2022-11-04 20:30:00.000 | 20 10                      |
```

#### Cast of interval to another interval

##### *Teradata*

**Query**

```sql
SELECT
TIMESTAMP '2022-10-15 10:30:00' + CAST(INTERVAL '5999' MINUTE AS INTERVAL DAY TO HOUR) AS DATETIME_OPERATION,
CAST(INTERVAL '5999' MINUTE AS INTERVAL DAY TO HOUR) AS OUTSIDE_DATETIME_OPERATION;
```

**Result**

```sql
DATETIME_OPERATION  | OUTSIDE_DATETIME_OPERATION |
--------------------+----------------------------+
2022-10-19 13:30:00 | 4 03                       |
```

##### *Snowflake*

**Query**

```sql
SELECT
PUBLIC.DATETIMEINTERVALADD_UDF(
TIMESTAMP '2022-10-15 10:30:00', PUBLIC.INTERVALTOINTERVAL_UDF('5999', 'MINUTE', 'MINUTE', 'DAY', 'HOUR'), 'DAY', '+') AS DATETIME_OPERATION,
PUBLIC.INTERVALTOINTERVAL_UDF('5999', 'MINUTE', 'MINUTE', 'DAY', 'HOUR') AS OUTSIDE_DATETIME_OPERATION;
```

**Result**

```sql
DATETIME_OPERATION      | OUTSIDE_DATETIME_OPERATION |
------------------------+----------------------------+
2022-10-19 13:30:00.000 | 4 03                       |
```

### Known Issues

**No known issues.**

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Teradata - COMMON STATEMENTS
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/common-statements.md
section: Migrations
---

# SnowConvert AI - Teradata - COMMON STATEMENTS

Translation references to convert Teradata script statements that are in common among all scripts syntaxes to Snowflake SQL

## ERROR HANDLING

> The BTEQ error handling capabilities are based on the Teradata Database error codes. These are the standard error codes and messages produced in response to user-specified Teradata SQL statements. A BTEQ user cannot change, modify or delete these messages.

For more information, see the [Teradata BTEQ Error Handling documentation](https://docs.teradata.com/r/Basic-Teradata-Query-Reference/October-2018/Using-BTEQ/Error-Handling).

### Sample Source Patterns

#### Basic BTEQ Error Handling Example

The error conditions content is relocated in different statements in case ERRORCODE is different to zero, otherwise it can be located as the original code. First, the query above the if statement is relocated within a BEGIN - END block, where in case of an exception it will be caught in the EXCEPTION block. Additionally, the ERRORCODE variable will be changed to the variable declared indicating its SQLCODE with an EWI indicating that the exact number of the SQLCODE is not the same as the ERRORCODE in BTEQ.

##### Teradata BTEQ

```none
-- Additional Params: -q SnowScript
SELECT * FROM table1;

.IF ERRORCODE<>0 THEN .EXIT 1

.QUIT 0
```

##### Snowflake SQL

```none
EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      -- Additional Params: -q SnowScript
      SELECT
        *
      FROM
        table1;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    IF (STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/ != 0) THEN
      RETURN 1;
    END IF;
    RETURN 0;
  END
$$
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-TD0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): The Snowflake error code mismatches the original Teradata error code.

## EXIT or QUIT

Logs off all database sessions and then exits BTEQ.

The highest severity value encountered during BTEQ’s execution will by default be used as BTEQ’s return code value unless an argument is explicitly supplied. ([Teradata Basic Query Reference EXIT or QUIT Command](https://docs.teradata.com/r/Basic-Teradata-Query-Reference/October-2018/BTEQ-Commands/BTEQ-Command-Descriptions/ERROROUT))

```sql
.<ExitCommand> [<Result>];
<ExitCommand> := EXIT | QUIT
<Result> := <Status_variable> | Number
<Status_variable> := ACTIVITY_COUNT | ERRORCODE | ERRORLEVEL
```

### Sample Source Patterns

#### Basic IF example

##### Teradata BTEQ

```sql
-- Additional Params: -q SnowScript
.QUIT ERRORCODE;
```

##### Snowflake SQL

```none
EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    RETURN STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/;
  END
$$
```

### Known Issues

When the EXIT or QUIT command doesn’t have an input, it returns the ERRORLEVEL as default. However, SnowConvert AI transforms it to return 0.

### Related EWIs

1. [SSC-FDM-TD0013](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): The Snowflake error code mismatches the original Teradata error code.

## GOTO

### Description

> The BTEQ Goto command skips over all intervening BTEQ commands and SQL statements until a specified label is encountered, then resumes processing as usual. ([Teradata Basic Query Reference Goto Command](https://docs.teradata.com/r/1fdhoBglKXYl~W_OyMEtGQ/KzhGjSojGrSjKxfWnYYrMw))

```sql
.GOTO LabelName;
```

### Sample Source Patterns

#### Basic GOTO example

Snowflake scripting doesn’t have an equivalent statement for Teradata BTEQ Goto command, but fortunately it can be removed from the input code and get an equivalent code, due to the sequence of Goto and Labels commands always in reverse topological order. In other words, the definitions come after their uses. Thus, SnowConvert AI just needs to copy bottom-up all Label section code to its corresponding Goto statements.

##### Teradata BTEQ

```sql
-- Additional Params: -q SnowScript
.LOGON 0/dbc,dbc;
   DATABASE tduser;
.LOGON 127.0.0.1/dbc,dbc;

INSERT INTO TABLEB VALUES (1);
.IF activitycount = 0 then .GOTO SECTIONA
.IF activitycount >= 1 then .GOTO SECTIONB

.label SECTIONA
.REMARK 'Zero Hours on Account'
.GOTO SECTIONC

.label SECTIONB
.REMARK 'Total Hours on Account'

.label SECTIONC
.logoff
.exit
```

##### Snowflake

```none
EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    -- Additional Params: -q SnowScript
    --.LOGON 0/dbc,dbc
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    BEGIN
      USE DATABASE tduser;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    --.LOGON
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    /*** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '4' COLUMN '8' OF THE SOURCE CODE STARTING AT '127.0'. EXPECTED 'STATEMENT' GRAMMAR. LAST MATCHING TOKEN WAS 'LOGON' ON LINE '4' COLUMN '2'. FAILED TOKEN WAS '127.0' ON LINE '4' COLUMN '8'. CODE '81'. ***/
    /*--127.0.0.1/dbc,dbc*/

    BEGIN
      INSERT INTO TABLEB
      VALUES (1);
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    IF (NOT (STATUS_OBJECT['SQLROWCOUNT'] = 0)) THEN
      --** SSC-FDM-TD0026 - GOTO SECTIONA WAS REMOVED DUE TO IF STATEMENT INVERSION **

      IF (STATUS_OBJECT['SQLROWCOUNT'] >= 1) THEN

        /*.label SECTIONB*/

        --.REMARK 'Total Hours on Account'
        null;
        /*.label SECTIONC*/

        --.logoff
        null;
        RETURN 0;
      END IF;
    END IF;
    /*.label SECTIONA*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    --.REMARK 'Zero Hours on Account'
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Remark' NODE ***/!!!
    null;

    /*.label SECTIONC*/

    --.logoff
    null;
    RETURN 0;
    /*.label SECTIONB*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    --.REMARK 'Total Hours on Account'
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'Remark' NODE ***/!!!
    null;
    /*.label SECTIONC*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    --.logoff
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'LogOff' NODE ***/!!!
    null;
    RETURN 0;
  END
$$
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0001](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Unrecognized token on the line of the source code.
2. [SSC-FDM-0027](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Removed next statement, not applicable in Snowflake.
3. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review
4. [SSC-FDM-TD0026](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): GOTO statement was removed due to if statement inversion.

## IF… THEN…

### Description

> The IF statement validates a condition and executes an action when the action is true. ([Teradata SQL Language reference IF…THEN…](https://docs.teradata.com/r/1fdhoBglKXYl~W_OyMEtGQ/92K64CKQxrkuO4Hm7P8IEA))

```sql
.IF <Condition> THEN <Action>;

<Condition> := <Status_variable> <Operator> Number
<Status_variable> := ACTIVITY_COUNT | ERRORCODE | ERRORLEVEL
<Operator> := ^= | != | ~= | <> | = | < | > | <= | >=
<Action> := BTEQ_command | SQL_request
```

### Sample Source Patterns

#### Basic IF example

##### Teradata BTEQ

```sql
-- Additional Params: -q SnowScript
.IF ACTIVITYCOUNT <> 0 THEN .GOTO InsertEmployee;
```

##### Snowflake SQL

```none
EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    IF (STATUS_OBJECT['SQLROWCOUNT'] != 0) THEN

      RETURN 1;
    END IF;
  END
$$
```

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Teradata - CREATE TYPE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/teradata-create-type.md
section: Migrations
---

# SnowConvert AI - Teradata - CREATE TYPE

Teradata structured types in `SYSUDTLIB` (and similar) are translated when the definition includes a `CAST FROM` clause that allows reduction to a single Snowflake scalar type, or when the body is a simple attribute list without such a cast mapping to an alias. For background, see Teradata’s UDT documentation and [Snowflake `CREATE TYPE`](https://docs.snowflake.com/en/sql-reference/sql/create-type).

## Types with `CAST FROM` (scalar alias)

When Teradata defines a UDT with one attribute and a `CAST FROM` clause pointing at a built-in type, SnowConvert can emit a Snowflake `CREATE TYPE ... AS <scalar>` alias.

**Source (Teradata):**

```sql
CREATE TYPE SYSUDTLIB.MyUDT AS (value INTEGER) INSTANTIABLE NOT FINAL CAST FROM INTEGER;
```

**Snowflake equivalent:**

```sql
CREATE TYPE SYSUDTLIB.MyUDT AS INTEGER;
```

**Source (Teradata):**

```sql
CREATE TYPE SYSUDTLIB.EmailAddr AS (val VARCHAR(255)) INSTANTIABLE NOT FINAL CAST FROM VARCHAR(255);
```

**Snowflake equivalent:**

```sql
CREATE TYPE SYSUDTLIB.EmailAddr AS VARCHAR(255);
```

**Source (Teradata):**

```sql
CREATE TYPE SYSUDTLIB.Currency AS (val DECIMAL(15,2)) INSTANTIABLE NOT FINAL CAST FROM DECIMAL(15,2);
```

**Snowflake equivalent:**

```sql
CREATE TYPE SYSUDTLIB.Currency AS DECIMAL(15, 2);
```

## Composite type without scalar `CAST FROM`

Multi-attribute definitions without the scalar-alias pattern map to Snowflake `OBJECT(...)`.

**Source (Teradata):**

```sql
CREATE TYPE SYSUDTLIB.Person AS (FirstName VARCHAR(50), LastName VARCHAR(50), Age INTEGER) INSTANTIABLE NOT FINAL;
```

**Snowflake equivalent:**

```sql
CREATE TYPE SYSUDTLIB.Person AS OBJECT (FirstName VARCHAR(50), LastName VARCHAR(50), Age INTEGER);
```

**Notes:** Teradata-specific clauses such as `INSTANTIABLE` / `NOT FINAL` are not represented in Snowflake `CREATE TYPE`; the translation focuses on the usable Snowflake type shape.

---
title: SnowConvert AI - Teradata - Data Migration Considerations
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/data-migration-considerations.md
section: Migrations
---

# SnowConvert AI - Teradata - Data Migration Considerations

This section describe important consideration when migration data from Teradata to Snowflake.

> **Note:**
>
> Consider that this is a work in progress.

When migrating data from Teradata to Snowflake, it is crucial to consider the functional differences between the databases. This page showcases the best suggestions for migrating data.

Review the following information:

## UNION ALL Data Migration

Data migration considerations for UNION ALL.

UNION ALL is a SQL operator that allows the combination of multiple resultsets. The syntax is the following:

```sql
 query_expression_1 UNION [ ALL ] query_expression_2
```

For more information, please review the following [Teradata](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Manipulation-Language/Set-Operators/UNION-Operator/UNION-Operator-Syntax) documentation.

### Column Size differences

Even though the operator is translated into the same operator in Snowflake, there could be detailed differences in functional equivalence. For example, the union of different columns which have different column sizes. Teradata does truncate the values when the first SELECT statement contains less space in the columns.

#### Teradata behavior

> **Note:**
>
> **Same behavior in ANSI and TERA session modes.**

For this example, the following input will show the Teradata behavior.

##### Teradata setup data

```sql
 CREATE TABLE table1
(
col1 VARCHAR(20)
);

INSERT INTO table1 VALUES('value 1 abcdefghijk');
INSERT INTO table1 VALUES('value 2 abcdefghijk');

CREATE TABLE table2
(
col1 VARCHAR(10)
);

INSERT INTO table2 VALUES('t2 row 1 a');
INSERT INTO table2 VALUES('t2 row 2 a');
INSERT INTO table2 VALUES('t2 row 3 a');
```

##### Snowflake setup data

```sql
 CREATE OR REPLACE TABLE table1
(
col1 VARCHAR(20)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/14/2024" }}'
;

INSERT INTO table1
VALUES ('value 1 abcdefghijk');

INSERT INTO table1
VALUES ('value 2 abcdefghijk');

CREATE OR REPLACE TABLE table2
(
col1 VARCHAR(10)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/14/2024" }}'
;

INSERT INTO table2
VALUES ('t2 row 1 a');

INSERT INTO table2
VALUES ('t2 row 2 a');

INSERT INTO table2
VALUES ('t2 row 3 a');
```

#### **Case 1 - one single column: UNION ALL for a column VARCHAR(20) over a column VARCHAR(10)**

> **SuccessPlaceholder:**
>
> For this case, the functional equivalence is the same

##### Teradata input

```sql
 SELECT col1 FROM table1
UNION ALL
SELECT col1 FROM table2;
```

##### Teradata output

```none
value 1 abcdefghijk
t2 row 3 a
value 2 abcdefghijk
t2 row 1 a
t2 row 2 a
```

##### Snowflake input

```sql
 SELECT
col1 FROM
table1
UNION ALL
SELECT
col1 FROM
table2;
```

##### Snowflake output

```none
value 1 abcdefghijk
t2 row 3 a
value 2 abcdefghijk
t2 row 1 a
t2 row 2 a
```

#### **Case 2 - one single column: UNION ALL for a column VARCHAR(10) over a column VARCHAR(20)**

> **Danger:**
>
> In this case, the functional equivalence is not the same.

The following case does not show functional equivalence in Snowflake. The column values should be truncated as in the Teradata sample.

##### Teradata input

```sql
 SELECT col1 FROM table2
UNION ALL
SELECT col1 FROM table1;
```

##### Teradata output

```none
t2 row 3 a
value 1 ab --> truncated
t2 row 1 a
t2 row 2 a
value 2 ab --> truncated
```

##### Snowflake input

```sql
 SELECT
col1 FROM
table2
UNION ALL
SELECT
col1 FROM
table1;
```

##### Snowflake output

```none
t2 row 3 a
value 1 abcdefghijk --> NOT truncated
t2 row 1 a
t2 row 2 a
value 2 abcdefghijk --> NOT truncated
```

**Workaround to get the same functionality**

In this case, the size of the column of the `table2` is 10 and the `table1` is 20. So, the size of the first column in the query should be the element to complete the `LEFT()` function used here. For more information, see the [Snowflake LEFT function documentation](https://docs.snowflake.com/en/sql-reference/functions/left).

##### Snowflake input

```sql
 SELECT col1 FROM table2 -- size (10)
UNION ALL
SELECT LEFT(col1, 10) AS col1 FROM table1;
```

##### Snowflake output

```none
t2 row 1 a
t2 row 2 a
t2 row 3 a
value 1 ab
value 2 ab
```

#### **Case 3 - multiple columns - same size by table: UNION ALL for columns VARCHAR(20) over columns VARCHAR(10)**

For this case, it is required to set up new data as follows:

##### Teradata setup data

```sql
 CREATE TABLE table3
(
col1 VARCHAR(20),
col2 VARCHAR(20)
);

INSERT INTO table3 VALUES('value 1 abcdefghijk', 'value 1 abcdefghijk');
INSERT INTO table3 VALUES('value 2 abcdefghijk', 'value 2 abcdefghijk');

CREATE TABLE table4
(
col1 VARCHAR(10),
col2 VARCHAR(10)
);

INSERT INTO table4 VALUES('t2 row 1 a', 't2 row 1 b');
INSERT INTO table4 VALUES('t2 row 2 a', 't2 row 2 b');
INSERT INTO table4 VALUES('t2 row 3 a', 't2 row 3 b');
```

##### Snowflake setup data

```sql
 CREATE OR REPLACE TABLE table3
(
col1 VARCHAR(20),
col2 VARCHAR(20)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/14/2024" }}'
;

INSERT INTO table3
VALUES ('value 1 abcdefghijk', 'value 1 abcdefghijk');

INSERT INTO table3
VALUES ('value 2 abcdefghijk', 'value 2 abcdefghijk');

CREATE OR REPLACE TABLE table4
(
col1 VARCHAR(10),
col2 VARCHAR(10)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/14/2024" }}'
;

INSERT INTO table4
VALUES ('t2 row 1 a', 't2 row 1 b');

INSERT INTO table4
VALUES ('t2 row 2 a', 't2 row 2 b');

INSERT INTO table4
VALUES ('t2 row 3 a', 't2 row 3 b');
```

Once the new tables and data are created, the following query can be evaluated.

> **Note:**
>
> For this case, the functional equivalence is the same

##### Teradata input

```sql
 select col1, col2 from table3
union all
select col1, col2 from table4;
```

##### Teradata output

| col1 | col2 |
| --- | --- |
| value 1 abcdefghijk | value 1 abcdefghijk |
| t2 row 3 a | t2 row 3 b |
| value 2 abcdefghijk | value 2 abcdefghijk |
| t2 row 1 a | t2 row 1 b |
| t2 row 2 a | t2 row 2 b |

##### Snowflake input

```sql
 SELECT
col1, col2 FROM
table3
UNION ALL
SELECT
col1, col2 FROM
table4;
```

##### Snowflake output

| col1 | col2 |
| --- | --- |
| value 1 abcdefghijk | value 1 abcdefghijk |
| value 2 abcdefghijk | value 2 abcdefghijk |
| t2 row 1 a | t2 row 1 b |
| t2 row 2 a | t2 row 2 b |
| t2 row 3 a | t2 row 3 b |

#### Case 4 - multiple columns - same size by table: UNION ALL for columns VARCHAR(10) over columns VARCHAR(20)

> **Warning:**
>
> In this case, the functional equivalence is not the same.

##### Teradata input

```sql
 select col1, col2 from table4
union all
select col1, col2 from table3;
```

##### Teradata output

| col1 | col2 |
| --- | --- |
| t2 row 3 a | t2 row 3 b |
| value 1 ab | value 1 ab |
| t2 row 1 a | t2 row 1 b |
| t2 row 2 a | t2 row 2 b |
| value 2 ab | value 2 ab |

##### Snowflake input

```sql
 SELECT
col1, col2 FROM
table4
UNION ALL
SELECT
col1, col2 FROM
table3;
```

##### Snowflake output

| col1 | col2 |
| --- | --- |
| t2 row 1 a | t2 row 1 b |
| t2 row 2 a | t2 row 2 b |
| t2 row 3 a | t2 row 3 b |
| value 1 abcdefghijk | value 1 abcdefghijk |
| value 2 abcdefghijk | value 2 abcdefghijk |

**Workaround to get the same functionality**

Apply the column size to the second `SELECT` on the columns to get the same functionality.

##### Snowflake input

```sql
 SELECT col1, col2 FROM table4 -- size (10)
UNION ALL
SELECT LEFT(col1, 10) AS col1, LEFT(col2, 10) AS col2 FROM table3;
```

##### Snowflake output

| col1 | col2 |
| --- | --- |
| t2 row 1 a | t2 row 1 b |
| t2 row 2 a | t2 row 2 b |
| t2 row 3 a | t2 row 3 b |
| value 1 ab | value 1 ab |
| value 2 ab | value 2 ab |

#### Case 5 - multiple columns - different sizes by table: UNION ALL for columns VARCHAR(10) over columns VARCHAR(20)

For this case, it is required to set up new data as follows:

##### Teradata setup data

```sql
 CREATE TABLE table5
(
col1 VARCHAR(20),
col2 VARCHAR(12)
);

INSERT INTO table5 VALUES('value 1 abcdefghijk', 'value 1 abcdefghijk');
INSERT INTO table5 VALUES('value 2 abcdefghijk', 'value 2 abcdefghijk');

CREATE TABLE table6
(
col1 VARCHAR(10),
col2 VARCHAR(5)
);

INSERT INTO table6 VALUES('t2 row 1 a', 't2 row 1 b');
INSERT INTO table6 VALUES('t2 row 2 a', 't2 row 2 b');
INSERT INTO table6 VALUES('t2 row 3 a', 't2 row 3 b');
```

##### Snowflake setup data

```sql
 CREATE OR REPLACE TABLE table5
(
col1 VARCHAR(20),
col2 VARCHAR(12)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/14/2024" }}'
;

INSERT INTO table5
VALUES ('value 1 abcdefghijk', 'value 1 abcd');

INSERT INTO table5
VALUES ('value 2 abcdefghijk', 'value 2 abcd');

CREATE OR REPLACE TABLE table6
(
col1 VARCHAR(10),
col2 VARCHAR(5)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/14/2024" }}'
;

INSERT INTO table6
VALUES ('t2 row 1 a', 't2 1b');

INSERT INTO table6
VALUES ('t2 row 2 a', 't2 2b');

INSERT INTO table6
VALUES ('t2 row 3 a', 't2 3b');
```

Once the new tables and data are created, the following query can be evaluated.

> **Note:**
>
> For this case, the functional equivalence is the same

##### Teradata input

```sql
 select col1, col2 from table5
union all
select col1, col2 from table6;
```

##### Teradata output

| col1 | col2 |
| --- | --- |
| value 1 abcdefghijk | value 1 abcd |
| t2 row 3 a | t2 3b |
| value 2 abcdefghijk | value 2 abcd |
| t2 row 1 a | t2 1b |
| t2 row 2 a | t2 2b |

##### Snowflake input

```sql
 SELECT
col1, col2 FROM
table5
UNION ALL
SELECT
col1, col2 FROM
table6;
```

##### Snowflake output

| col1 | col2 |
| --- | --- |
| value 1 abcdefghijk | value 1 abcd |
| value 2 abcdefghijk | value 2 abcd |
| t2 row 1 a | t2 1b |
| t2 row 2 a | t2 2b |
| t2 row 3 a | t2 3b |

#### Case 6 - multiple columns - different sizes by table: UNION ALL for columns VARCHAR(20), VARCHAR(10) over columns VARCHAR(10), VARCHAR(5)

> **Warning:**
>
> In this case, the functional equivalence is not the same.

##### Teradata input

```sql
 select col1, col2 from table6
union all
select col1, col2 from table5;
```

##### Teradata output

| col1 | col2 |
| --- | --- |
| t2 row 3 a | t2 3b |
| **value 1 ab** | **value** |
| t2 row 1 a | t2 1b |
| t2 row 2 a | t2 2b |
| **value 2 ab** | **value** |

##### Snowflake input

```sql
 SELECT
col1, col2 FROM
table6
UNION ALL
SELECT
col1, col2 FROM
table5;
```

##### Snowflake output

| col1 | col2 |
| --- | --- |
| t2 row 1 a | t2 1b |
| t2 row 2 a | t2 2b |
| t2 row 3 a | t2 3b |
| **value 1 abcdefghijk** | **value 1 abcd** |
| **value 2 abcdefghijk** | **value 2 abcd** |

**Workaround to get the same functionality**

The column with the smallest size from the first `SELECT` is used to determine the size of the columns from the second `SELECT`.

##### Snowflake input

```sql
 SELECT
col1, col2 FROM
table6
UNION ALL
SELECT
LEFT(col1, 5) as col1, LEFT(col2, 5) AS col2 FROM
table5;
```

##### Snowflake output

| col1 | col2 |
| --- | --- |
| t2 row 3 a | t2 3b |
| **value 1 ab** | **value** |
| t2 row 1 a | t2 1b |
| t2 row 2 a | t2 2b |
| **value 2 ab** | **value** |

#### Case 7 - multiple columns *expression* - different sizes by table: UNION ALL for columns VARCHAR(20), VARCHAR(20) over columns VARCHAR(10), VARCHAR(10)

Use the data set up in Case 3 — Multiple columns — Same size by table. Once the new tables and data are created, the following query can be evaluated.

> **Note:**
>
> For this case, the functional equivalence is the same

##### Teradata input

```sql
 select col1 || col2 from table3
union all
select col1 || col2 from table4;
```

##### Teradata output

| col1 || col2 |
| --- |
| value 1 abcdefghijkvalue 1 abcdefghijk |
| t2 row 3 at2 row 3 b |
| value 2 abcdefghijkvalue 2 abcdefghijk |
| t2 row 1 at2 row 1 b |
| t2 row 2 at2 row 2 b |

##### Snowflake input

```sql
 SELECT
col1 || col2 FROM
table3
UNION ALL
SELECT
col1 || col2 FROM
table4;
```

##### Snowflake output

| col1 || col2 |
| --- |
| value 1 abcdefghijkvalue 1 abcdefghijk |
| value 2 abcdefghijkvalue 2 abcdefghijk |
| t2 row 1 at2 row 1 b |
| t2 row 2 at2 row 2 b |
| t2 row 3 at2 row 3 b |

#### Case 8 - multiple columns *expression* - different sizes by table: UNION ALL for columns VARCHAR(20), VARCHAR(20) over columns VARCHAR(10), VARCHAR(10)

> **Warning:**
>
> This case has functional differences.

##### Teradata input

```sql
 select col1 || col2 from table4
union all
select col1 || col2 from table3;
```

##### Teradata output

| col1 || col2 |
| --- |
| t2 row 1 at2 row 1 b |
| t2 row 2 at2 row 2 b |
| t2 row 3 at2 row 3 b |
| value 1 abcdefghijkv |
| value 2 abcdefghijkv |

##### Snowflake input

```sql
 SELECT
col1 || col2 FROM
table4
UNION ALL
SELECT
col1 || col2 FROM
table3;
```

##### Snowflake output

| col1 || col2 |
| --- |
| t2 row 1 at2 row 1 b |
| t2 row 2 at2 row 2 b |
| t2 row 3 at2 row 3 b |
| value 1 abcdefghijkvalue 1 abcdefghijk |
| value 2 abcdefghijkvalue 2 abcdefghijk |

**Workaround to get the same functionality**

The sum of the column sizes of the smaller column should be used in the `LEFT` function. For example, if the smaller column is VARCHAR(10), then the limit of the `LEFT` function should be 20 (10 + 10).

> **Warning:**
>
> If the first `SELECT` result is smaller, its sum would be used for the truncation of the values.

##### Snowflake input

```sql
 SELECT
col1 || col2 FROM
table4
UNION ALL
SELECT
LEFT(col1 || col2, 20) FROM
table3;
```

##### Snowflake output

| col1 || col2 |
| --- |
| t2 row 1 at2 row 1 b |
| t2 row 2 at2 row 2 b |
| t2 row 3 at2 row 3 b |
| value 1 abcdefghijkv |
| value 2 abcdefghijkv |

#### Other considerations about column size differences

* `CHAR` and `VARCHAR` behave the same.
* Number columns may behave differently. The numbers cannot be truncated, so there is an overflow in the Teradata environment. So, this is not applied to these data types. Review the following example:

```sql
-- Teradata number sample
CREATE TABLE table11
(
col1 NUMBER(2)
);

INSERT INTO table11 VALUES(10);
INSERT INTO table11 VALUES(10);

CREATE TABLE table12
(
col1 NUMBER(1)
);

INSERT INTO table12 VALUES(1);
INSERT INTO table12 VALUES(1);
INSERT INTO table12 VALUES(1);

-- ERROR!  Overflow occurred when computing an expression involving table11.col1
SELECT col1 FROM table12
UNION ALL
SELECT col1 FROM table11;
```

---
title: SnowConvert AI - Teradata - Data Types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/data-types.md
section: Migrations
---

# SnowConvert AI - Teradata - Data Types

This section shows equivalents between data types in Teradata and in Snowflake.

## Conversion Table

| Teradata | Snowflake | Notes |
| --- | --- | --- |
| `ARRAY` | `ARRAY` |  |
| `BIGINT` | `BIGINT` | `BIGINT`in Snowflake is an alias for `NUMBER(38,0).`[Check out note] |
| `BLOB` | `BINARY` | Limited to 8MB. `BLOB`is not supported, warning [SSC-FDM-TD0001](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) is generated |
| `BYTE` | `BINARY` |  |
| `BYTEINT` | `BYTEINT` |  |
| `CHAR` | `CHAR` |  |
| `CLOB` | `VARCHAR` | ​Limited to 16MB. `CLOB`is not supported, warning [SSC-FDM-TD0002](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) is generated |
| `DATE` | `DATE` |  |
| `DECIMAL` | `DECIMAL` |  |
| `DOUBLE PRECISION` | `DOUBLE PRECISION` |  |
| `FLOAT` | `FLOAT` |  |
| `INTEGER` | `INTEGER` | `INTEGER`in Snowflake is an alias for `NUMBER(38,0)`. [Check out note] |
| `INTERVAL DAY [TO HOUR | MINUTE | SECOND]` | `VARCHAR(20)` | ​Intervals are stored as`VARCHAR`in Snowflake except when used in addition/subtraction. [Check out note]. |
| `INTERVAL HOUR [TO MINUTE | SECOND]` | `VARCHAR(20)` | ​Intervals are stored as`VARCHAR`in Snowflake except when used in addition/subtraction. [Check out note]. |
| `INTERVAL MINUTE [TO SECOND]` | `VARCHAR(20)` | ​Intervals are stored as`VARCHAR`in Snowflake except when used in addition/subtraction. [Check out note]. |
| `INTERVAL SECOND` | `VARCHAR(20)` | ​Intervals are stored as`VARCHAR`in Snowflake except when used in addition/subtraction. [Check out note]. |
| `INTERVAL YEAR [TO SECOND]` | `VARCHAR(20)` | ​Intervals are stored as`VARCHAR`in Snowflake except when used in addition/subtraction. [Check out note]. |
| `JSON` | `VARIANT` | Elements inside a JSON are ordered by their keys when inserted in a table. [Check out [note](data-types.md)]. |
| `MBR` | `---` | Not supported |
| `NUMBER` | `NUMBER(38, 18)` |  |
| `PERIOD(DATE)` | `VARCHAR(24)` | Periods are stored as`VARCHAR`in Snowflake. [Check out note]. |
| `PERIOD(TIME)` | `VARCHAR(34)` | Periods are stored as`VARCHAR`in Snowflake. [Check out note]. |
| `PERIOD(TIME WITH TIME ZONE)` | `VARCHAR(46)` | Periods are stored as`VARCHAR`in Snowflake. [Check out note]. |
| `PERIOD(TIMESTAMP)` | `VARCHAR(58)` | Periods are stored as`VARCHAR`in Snowflake. [Check out note]. |
| `PERIOD(TIMESTAMP WITH TIME ZONE)` | `VARCHAR(58)` | Periods are stored as`VARCHAR`in Snowflake. [Check out note]. |
| `REAL` | `REAL` |  |
| `SMALLINT` | `​SMALLINT`​ | `SMALLINT` in Snowflake is an alias for `NUMBER(38,0).` [Check out note] |
| `ST_GEOMETRY` | `GEOGRAPHY` |  |
| `TIME` | `TIME` |  |
| `TIME WITH TIME ZONE` | `TIME` | Warning [SSC-FDM-0005](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) is generated. |
| `TIMESTAMP` | `TIMESTAMP` |  |
| `TIMESTAMP WITH TIME ZONE` | `TIMESTAMP_TZ` |  |
| `VARBYTE` | `BINARY` |  |
| `VARCHAR` | `VARCHAR` |  |
| `XML` | `VARIANT` | ​ |

## Notes

> **Note:**
>
> See the documentation on Teradata [data types](https://docs.teradata.com/reader/~_sY_PYVxZzTnqKq45UXkQ/I_xWuywcishQ9U3Xal6zjA)

### Integer Data Types

For the conversion of integer data types (`INTEGER`, `SMALLINT`, and `BIGINT`), each one is converted to the alias in Snowflake with the same name. Each of those aliases converts to `NUMBER(38,0)`, a data type that is considerably larger than the integer datatype. Below is a comparison of the range of values that can be present in each data type:

* Teradata `INTEGER`: -2,147,483,648 to 2,147,483,647
* Teradata `SMALLINT`: -32768 to 32767
* Teradata `BIGINT`: -9,223,372,036,854,775,808 to 9,223,372,036,854,775,807
* Snowflake `NUMBER(38,0)`: -99999999999999999999999999999999999999 to +99999999999999999999999999999999999999

Warning [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) is generated.

### Interval/Period Data Types

Intervals and Periods are stored as a string (`VARCHAR`) in Snowflake. When converting, SnowConvert AI creates a UDF that recreates the same expression as a string. Warning [SSC-EWI-TD0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) is generated.

You can see more of the UDFs in the public repository of UDFs currently created by Snowflake SnowConvert.

These UDFs assume that periods are stored in a `VARCHAR` where the data/time parts are separated by an `*`. For example for a Teradata period like `PERIOD('2018-01-01','2018-01-20')` it should be stored in Snowflake as a `VARCHAR` like `'2018-01-01`\*`2018-01-20'`.

> **Note:**
>
> **Preview Feature:** When the `--UseIntervalDatatype` [preview flag](../../../general/getting-started/running-snowconvert/conversion/preview-conversion-settings.md) is enabled, Teradata INTERVAL columns are preserved as native Snowflake INTERVAL types (for example, `INTERVAL DAY TO SECOND`, `INTERVAL YEAR TO MONTH`) instead of being converted to VARCHAR. Interval literals are also normalized and preserved. See the [Interval Data Types](../../general/interval-data-types.md) translation reference for complete transformation details.

The only exception to the `VARCHAR` transformation for intervals are interval literals used to add/subtract values from a Datetime expression, Snowflake does not have an `INTERVAL` datatype but [interval constants](https://docs.snowflake.com/en/sql-reference/data-types-datetime.html#interval-constants) exist for the specific purpose mentioned. Examples:

Input code:

```sql
 SELECT TIMESTAMP '2018-05-13 10:30:45' + INTERVAL '10 05:30' DAY TO MINUTE;
```

Output code:

```sql
 SELECT
TIMESTAMP '2018-05-13 10:30:45' + INTERVAL '10 DAY, 05 HOUR, 30 MINUTE';
```

Cases where the interval is being multiplied/divided by a numerical expression are transformed to equivalent `DATEADD` function calls instead:

Input code:

```sql
 SELECT TIME '03:45:15' - INTERVAL '15:32:01' HOUR TO SECOND * 10;
```

Output code:

```sql
 SELECT
DATEADD('SECOND', 10 * -1, DATEADD('MINUTE', 10 * -32, DATEADD('HOUR', 10 * -15, TIME '03:45:15')));
```

### JSON Data Type

Elements inside a JSON are ordered by their keys when inserted in a table. Thus, the query results might differ. However, this does not affect the order of arrays inside the JSON.

For example, if the original JSON is:

```json
 {
   "firstName":"Peter",
   "lastName":"Andre",
   "age":31,
   "cities": ["Los Angeles", "Lima", "Buenos Aires"]
}
```

Using the Snowflake [PARSE_JSON()](https://docs.snowflake.com/en/sql-reference/functions/parse_json.html) that interprets an input string as a JSON document, producing a VARIANT value. The inserted JSON will be:

```json
 {
   "age": 31,
   "cities": ["Los Angeles", "Lima", "Buenos Aires"],
   "firstName": "Peter",
   "lastName": "Andre"
}
```

Note how “age” is now the first element. However, the array of “cities” maintains its original order.

## Known Issues

No issues were found.

## Related EWIs

No related EWIs.

---
title: SnowConvert AI - Teradata - Database DBC
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/database-dbc.md
section: Migrations
---

# SnowConvert AI - Teradata - Database DBC

Equivalents for DBC objects and columns

> **Note:**
>
> The DBC database contains critical system tables that define the user databases in the Analytics Database / Teradata Database. In the next segments, you can see the **supported** objects and columns of DBC database, the ones missing are **not supported** yet.

## DBC database

| Teradata | Snowflake | Notes |
| --- | --- | --- |
| DBC | INFORMATION_SCHEMA |  |

> See [DBC database](https://docs.teradata.com/r/Teradata-DSA-User-Guide/November-2022/Database-DBC-Info/Database-DBC)

## DBC tables

| Teradata | Snowflake | Notes |
| --- | --- | --- |
| COLUMNS | COLUMNS |  |
| COLUMNSV | COLUMNS |  |
| DATABASES | DATABASES |  |
| DBQLOGTBL | TABLE(INFORMATION_SCHEMA.QUERY_HISTORY()) |  |
| TABLES | TABLES |  |

## DBC columns

| Teradata | Snowflake | Notes |
| --- | --- | --- |
| ALLRIGHTS | APPLICABLE_ROLES |  |
| COLUMNNAME | COLUMN_NAME |  |
| COLUMNUDTNAME | UDT_NAME |  |
| COMMENT_STRING | COMMENT |  |
| CREATETIMESTAMP | CREATED |  |
| COLUMNTYPE | DATA_TYPE |  |
| COLUMNLENGTH | CHARACTER_MAXIMUM_LENGTH |  |
| CONSTRAINTNAME | CONSTRAINT_NAME |  |
| CONSTRAINTTEXT | CONSTRAINT_TYPE |  |
| DATABASENAME | TABLE_SCHEMA |  |
| FINALWDNAME | SESSION_ID |  |
| FIRSTSTEPTIME | DATEADD(MILLISECOND, TOTAL_ELAPSED_TIME - EXECUTION_TIME, START_TIME) |  |
| LASTALTERTIMESTAMP | LAST_ALTERED |  |
| NULLABLE | IS_NULLABLE |  |
| STARTTIME | START_TIME |  |
| TABLEKIND | TABLE_TYPE |  |
| TABLE_LEVELCONSTRAINTS | TABLE_CONSTRAINTS |  |
| TABLENAME | TABLE_NAME |  |
| USER_NAME | GRANTEE |  |

> For more information about DBC tables and columns see the [Teradata documentation.](https://docs.teradata.com/r/hNI_rA5LqqKLxP~Y8vJPQg/jwOyftGqfH5vIH1ZRVNW6A)

---
title: SnowConvert AI - Teradata - DDL
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/ddl-teradata.md
section: Migrations
---

# SnowConvert AI - Teradata - DDL

In this section, you will find the documentation for the translation reference of Data Definition Language Elements.

## Index

Translation reference to convert INDEX statement to Snowflake

> **Warning:**
>
> Currently, ***Create Index*** statement is not being converted but it is being parsed. Also, if your source code has Create `index` statements, these are going to be accounted for in the ***Assessment Report.***

**Example of Create Index**

### Teradata input

```sql
 CREATE INDEX (col1, col2, col3) ORDER BY VALUES (col2) ON table1;

CREATE INDEX my_index_name ON my_table (column1, column2);
```

> **Note:**
>
> Due to architectural reasons, Snowflake does not support indexes so, SnowConvert AI will remove all the code related to the creation of indexes. Snowflake automatically creates micro-partitions for every table that help speed up the performance of DML operations, the user does not have to worry about creating or managing these micro-partitions.
>
> Usually, this is enough to have a very good query performance however, there are ways to improve it by creating data clustering keys. [Snowflake’s official page](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions.html) provides more information about micro-partitions and data clustering.

## Join Index

### Description

In SnowConvert AI, Teradata Join Indexes are transformed into Snowflake Dynamic Tables. To properly configure Dynamic Tables, two essential parameters must be defined: TARGET_LAG and WAREHOUSE. If these parameters are left unspecified in the configuration options, SnowConvert AI will default to preassigned values during the conversion, as demonstrated in the example below.

For more information, see the [Teradata Join Indexes documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/Database-Design/Join-and-Hash-Indexes/Join-Indexes).

For details on the necessary parameters, see the [Snowflake CREATE DYNAMIC TABLE documentation](https://docs.snowflake.com/en/sql-reference/sql/create-dynamic-table).

### Sample Source Patterns

**Teradata**

**Join Index**

```sql
CREATE JOIN INDEX Employee
AS
SELECT
  Employee_Id,
  First_Name,
  Last_Name,
  BirthDate,
  DepartmentNo
FROM Employee
PRIMARY INDEX (First_Name);
```

**Snowflake**

**Dynamic Table**

```sql
CREATE OR REPLACE DYNAMIC TABLE Employee
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
AS
SELECT
  Employee_Id,
  First_Name,
  Last_Name,
  BirthDate,
  DepartmentNo
FROM
  Employee;
```

### Known Issues

No known errors detected at this time.

### Related EWIs

1. [SSC-FDM-0031](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default

## Schema

### Description

The translation of the `CREATE SCHEMA` statement from Teradata to Snowflake is simple, as the basic syntax remains the same.

### Sample Source Patterns

**Teradata**

**Join Index**

```sql
CREATE SCHEMA IF EXISTS schema_name;
```

**Snowflake**

**Dynamic Table**

```sql
CREATE SCHEMA IF EXISTS schema_name
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/23/2024" }}'
;
```

### Known Issues

#### WITH Properties of CREATE SCHEMA

The `WITH` properties associated with the `CREATE SCHEMA` statement in Teradata are not supported in Snowflake, as there is no equivalent functionality available.

**Teradata**

**Join Index**

```sql
CREATE SCHEMA IF EXISTS schema_name
WITH ( PROPERTY1 = PROPERTYNAME, PROPERTY2 = PROPERTTYNAME, PROPERTY3 = PROPERTTYNAME);
```

**Snowflake**

**Dynamic Table**

```sql
CREATE SCHEMA IF EXISTS schema_name
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/23/2024" }}'
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'SCHEMA WITH' NODE ***/!!!
WITH ( PROPERTY1 = PROPERTYNAME, PROPERTY2 = PROPERTTYNAME, PROPERTY3 = PROPERTTYNAME);
```

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.

## Views

Translation reference to convert Teradata VIEW statement to Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s VIEW statement is translated to Snowflake VIEW syntax.

For more information, see the [Teradata VIEW documentation](https://docs.teradata.com/r/scPHvjfglIlB8F70YliLAw/EXhAa7frdTDJwg2OZukLgQ).

### Sample Source Patterns

#### Create View Transformation

**Teradata**

##### View

```sql
 CREATE VIEW view1 (someTable.col1, someTable.col2) AS locking row for access
    SELECT
    my_table.col1, my_table.col2
    FROM table1 AS my_table
    WHERE my_table.col1 = 'SpecificValue'
    UNION ALL
    SELECT other_table.col2
    FROM table2 AS other_table
    WHERE my_table.col2 = other_table.col2
```

**Snowflake**

##### View

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "table1", "table2" **
CREATE OR REPLACE VIEW view1
(
    col1,
    col2)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
    my_table.col1,
    my_table.col2
    FROM
    table1 AS my_table
    WHERE
    UPPER(RTRIM( my_table.col1)) = UPPER(RTRIM('SpecificValue'))
    UNION ALL
    SELECT
    other_table.col2
       FROM
    table2 AS other_table
       WHERE my_table.col2 = other_table.col2;
```

#### Custom Schema Tag

The custom schema is specified in the comment section before the specification of the view, with an XML tag named “sc-view” that contains only the value of the schema and the view name separated with a period ‘.’ as shown below: `<sc-view>SCHEMANAME.VIEWNAME</sc-view>`

The custom schema will be used as a view qualifier, and then the name of the view and all the objects referred to in the FROM queries and inner queries will be using that custom schema. Therefore could be several views with the same name, but with different custom tags. **Example**: two views with the same name, will take the custom schema tag information to perform the translation.

##### Teradata

##### View

```sql
 /*<sc-view>RMSviews.EMPLOYEEB</sc-view>*/
REPLACE VIEW EMPLOYEEB AS
SELECT * FROM EMPLOYEE
WHERE AREA = "AREAB";

/*<sc-view>Views.EMPLOYEEB</sc-view>*/
REPLACE VIEW EMPLOYEEB AS
SELECT * FROM EMPLOYEE
WHERE AREA = "AREAB";
```

##### Snowflake

The transformation for Snowflake will vary depending on the customized schema name `MySchema`, customized database name `MyDatabase` or not selecting a customized database or schema in the conversion settings.

##### Custom Schema

```sql
 /*<sc-view>RMSviews.EMPLOYEEB</sc-view>*/
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "EMPLOYEE" **
CREATE OR REPLACE VIEW RMSviews.EMPLOYEEB
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
SELECT
* FROM
RMSviews.EMPLOYEE
WHERE AREA = "AREAB";

/*<sc-view>Views.EMPLOYEEB</sc-view>*/
--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "EMPLOYEE" **
--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR Views.EMPLOYEEB. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
CREATE OR REPLACE VIEW Views.EMPLOYEEB
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
SELECT
* FROM
Views.EMPLOYEE
 WHERE AREA = "AREAB";
```

##### Custom Database

```sql
 /*<sc-view>RMSviews.EMPLOYEEB</sc-view>*/
CREATE OR REPLACE VIEW MyDatabase.RMSviews.EMPLOYEEB
AS
   SELECT * FROM MyDatabase.RMSviews.EMPLOYEE
   WHERE AREA = "AREAB";

/*<sc-view>Views.EMPLOYEEB</sc-view>*/
CREATE OR REPLACE VIEW MyDatabase.Views.EMPLOYEEB
AS
   SELECT * FROM MyDatabase.Views.EMPLOYEE
   WHERE AREA = "AREAB";
```

##### Non selected

```sql
 /*<sc-view>RMSviews.EMPLOYEEB</sc-view>*/
CREATE OR REPLACE VIEW RMSviews.PUBLIC.EMPLOYEEB
AS
   SELECT * FROM RMSviews.PUBLIC.EMPLOYEE
   WHERE AREA = "AREAB";

/*<sc-view>Views.EMPLOYEEB</sc-view>*/
CREATE OR REPLACE VIEW Views.PUBLIC.EMPLOYEEB
AS
   SELECT * FROM Views.PUBLIC.EMPLOYEE
   WHERE AREA = "AREAB";
```

### Known Issues

#### 1. Locking row for access logic difference

In Snowflake, access to objects and elements is based on users and privileges.

### Related EWIs

1. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
2. [SSC-FDM-0019](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Semantic information could not be loaded.

## Tables

Translation reference to convert Teradata TABLE statement to Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s TABLE statement is translated to Snowflake TABLE syntax.

For more information, see the [Teradata CREATE TABLE documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Data-Definition-Language-Syntax-and-Examples/March-2019/Table-Statements/CREATE-TABLE).

### Sample Source Patterns

#### **Simple Create​ Table**

**Teradata**

##### Table

```sql
 CREATE TABLE table1, no fallback,
no before journal,
no after journal (
  c1 INTEGER NOT NULL,
	f1 INTEGER NOT NULL,
	p1 INTEGER NOT NULL,
  DATE,
  TIME,
	FOREIGN KEY(f1) REFERENCES WITH CHECK OPTION table2 (d1)
)
UNIQUE PRIMARY INDEX(c1)
PARTITION BY COLUMN(p1);
```

**Snowflake**

##### Table

```sql
CREATE OR REPLACE TABLE table1 (
	c1 INTEGER NOT NULL,
	f1 INTEGER NOT NULL,
	p1 INTEGER NOT NULL,
	DATE,
	TIME,
	FOREIGN KEY(f1) REFERENCES table2 (d1) ,
	UNIQUE (c1)
)
----** SSC-FDM-0038 - MICRO-PARTITIONING IS AUTOMATICALLY HANDLED ON ALL SNOWFLAKE TABLES **
--PARTITION BY COLUMN(p1)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "09/19/2025",  "domain": "no-domain-provided" }}'
;
```

#### Table Kind Clause - SET and MULTISET

Teradata’s kind clause determines whether duplicate rows are permitted (MULTISET) or not (SET).

##### Teradata

##### Table

```sql
 -- Set semantics
CREATE SET TABLE table1 (
    column1 INTEGER
);

--Multiset semantics
CREATE MULTISET TABLE table2(
    column1 INTEGER
);
```

##### Snowflake

##### Table

```sql
 -- Set semantics
--** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
CREATE OR REPLACE TABLE table1 (
    column1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

--Multiset semantics
CREATE OR REPLACE TABLE table2 (
    column1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Volatile and Global Temporary Tables

Teradata’s Volatile and Global Temporary tables are used for the temporary storage of data. Their difference lies in that the table definition (DDL) of Global Temporary tables is persisted in the Data Dictionary, while Volatile tables definition is not stored.

##### Teradata

##### Table

```sql
 --Global Temporary Table
CREATE MULTISET GLOBAL TEMPORARY TABLE table1 (
    column1 INTEGER
);

--Volatile Table
CREATE MULTISET VOLATILE TABLE table3 (
    column1 INTEGER
);
```

##### Snowflake

##### Table

```sql
 --Global Temporary Table
--** SSC-FDM-0009 - GLOBAL TEMPORARY TABLE FUNCTIONALITY NOT SUPPORTED. **
CREATE OR REPLACE TABLE table1 (
    column1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

--Volatile Table
CREATE OR REPLACE TEMPORARY TABLE table3 (
    column1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### With data and with no data option

**Teradata**

##### Table

```sql
 -- With data
CREATE TABLE table1 AS table2 WITH DATA

-- With no data
CREATE TABLE table1 AS table2 WITH NO DATA
```

**Snowflake**

##### Table

```sql
 -- With data
CREATE OR REPLACE TABLE table1 CLONE table2
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

-- With no data
--** SSC-FDM-0019 - SEMANTIC INFORMATION COULD NOT BE LOADED FOR table1. CHECK IF THE NAME IS INVALID OR DUPLICATED. **
CREATE OR REPLACE TABLE table1 LIKE table2
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Snowflake’s Reserved & Limited Keywords

SnowConvert AI facilitates seamless SQL migrations to Snowflake by addressing challenges associated with reserved keywords. As per Snowflake’s [reserved and limited keyword documentation](https://docs.snowflake.com/en/sql-reference/reserved-keywords), certain keywords cannot be used as column names, table names, or aliases without special handling. SnowConvert AI includes functionality to ensure SQL code compatibility in such cases.

**Reserved ANSI Keywords as Column Names**

For column names that match **ANSI or Snowflake** **reserved keywords**, SnowConvert AI automatically wraps the column name in double quotes (`"`) to comply with Snowflake’s syntax rules. This adjustment ensures that queries with these column names compile correctly in Snowflake without requiring manual intervention.

**Example:**

##### Table

```sql
 CREATE TABLE ReservedKeywords (
  "CREATE" VARCHAR(50),
  FOLLOWING VARCHAR(50),
  "ILIKE" VARCHAR(50),
  RLIKE VARCHAR(50)
);
```

**Snowflake**

##### Table

```sql
 CREATE OR REPLACE TABLE ReservedKeywords (
    "CREATE" VARCHAR(50),
    "FOLLOWING" VARCHAR(50),
    "ILIKE" VARCHAR(50),
    "RLIKE" VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/28/2024",  "domain": "test" }}'
;
```

**Snowflake-Specific Reserved Keywords**

Columns that match **Snowflake-specific reserved keywords** (for example, `CONSTRAINT`, `CURRENT_DATE`, `CURRENT_TIME`) may still cause compilation issues even when wrapped in quotes. SnowConvert AI detects these instances and generates a warning with code [`SSC-EWI-0045`](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md), prompting users to review and potentially rename these columns for compatibility.

**Example:**

##### Table

```sql
 CREATE TABLE ColumnReservedNames (
  "CONSTRAINT" VARCHAR(50),
  "CURRENT_DATE" VARCHAR(50),
  "CURRENT_TIME" VARCHAR(50)
);
```

**Snowflake**

##### Table

```sql
 CREATE OR REPLACE TABLE ColumnReservedNames (
    !!!RESOLVE EWI!!! /*** SSC-EWI-0045 - COLUMN NAME 'CONSTRAINT' IS A SNOWFLAKE RESERVED KEYWORD ***/!!!
    "CONSTRAINT" VARCHAR(50),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0045 - COLUMN NAME 'CURRENT_DATE' IS A SNOWFLAKE RESERVED KEYWORD ***/!!!
    "CURRENT_DATE" VARCHAR(50),
    !!!RESOLVE EWI!!! /*** SSC-EWI-0045 - COLUMN NAME 'CURRENT_TIME' IS A SNOWFLAKE RESERVED KEYWORD ***/!!!
    "CURRENT_TIME" VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "transact",  "convertedOn": "11/28/2024",  "domain": "test" }}'
;
```

### Known Issues

#### 1. Create table options not supported

As shown in the example “Simple Create Table”, Snowflake does not support Teradata create table options. They are removed.

##### 2. Partition by performance issues

In the example “Simple Create Table”, the `partition by` statement is removed due to performance considerations.

##### 3. Primary Index moved

In Teradata, the primary index constraint is declared outside of the `create table` statement, but in Snowflake it is required to be inside, as shown in the example “Simple Create Table”.

##### 4. SET semantics not supported

As shown in the example “Table Kind Clause - SET and MULTISET”, Snowflake does not support Teradata’s SET semantics. They are removed.

##### 5. Global Temporary table option not supported

As shown in the example “Volatile and Global Temporary Table”, Snowflake does not support Teradata’s Global Temporary table option. It will be removed.

##### 6. Compress unsupported

`COMPRESS (value1. value2, value3)` is removed due to being unsupported.

##### 7. On commit unsupported

`On commit` is removed due to being unsupported.

##### 8. Block compression unsupported

`Block compression` is removed due to being unsupported.

##### 9. Normalize unsupported

`Normalize` is removed due to being unsupported.

##### 10. FORMAT clause on column definitions

Teradata supports a `FORMAT` clause on column definitions to control how values are displayed or parsed. Snowflake does not support this clause. SnowConvert AI handles it differently depending on the column type and format pattern:

| Column Type | Format Pattern | Issue | DDL Result | DML Result |
| --- | --- | --- | --- | --- |
| Datetime (`DATE`, `TIMESTAMP`, `TIME`) | Snowflake standard (e.g., `'YYYY-MM-DD'`, `'HH:MI:SS'`, `'YYYY-MM-DDBHH:MI:SS'`) | None | Silently removed | No conversion needed |
| Datetime (`DATE`, `TIMESTAMP`, `TIME`) | Translatable non-standard (e.g., `'MM-DD-YYYY'`) | [SSC-FDM-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) | Commented out | Conversion functions added |
| Datetime | Not translatable (e.g., `'EEEE'`) | [SSC-EWI-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) | Kept with EWI marker | No conversion; manual fix needed |
| Character (`VARCHAR`, `CHAR`) | Display-only `X(n)` | [SSC-FDM-TD0041](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) | Commented out | No conversion needed |
| Any other | Any | [SSC-EWI-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) | Kept with EWI marker | No conversion; manual fix needed |

> **Important:**
>
> The FORMAT value and column type are read from the `CREATE TABLE` statement and stored internally so that DML statements referencing those columns can be converted correctly. If the `CREATE TABLE` is not included in the conversion input, SnowConvert AI cannot determine the format, and no conversion functions will be added to DML statements. Always include the relevant DDL files in the conversion scope and verify that the converted code behaves correctly when FORMAT clauses are present.

### Related EWIs

1. [SSC-FDM-0009](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): GLOBAL TEMPORARY TABLE functionality not supported.
2. [SSC-FDM-0019](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Semantic information could not be loaded.
3. [SSC-FDM-TD0024](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Set table functionality not supported.
4. [SSC-PRF-0007](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): CLUSTER BY performance review.
5. [SSC-EWI-0045](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Column Name is Snowflake Reserved Keyword.
6. [SSC-FDM-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Column-level FORMAT clause is not supported in Snowflake. Conversion functions are used in DML statements as a workaround.
7. [SSC-FDM-TD0041](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Column-level display-only FORMAT clause is not supported in Snowflake. No action needed.
8. [SSC-EWI-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Column-level FORMAT clause contains unsupported format elements.

## WITH DEFAULT

Translation reference to convert Teradata WITH DEFAULT clause in column definitions to Snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s `WITH DEFAULT` clause sets a system-default value to columns that are inserted with no values. This value is typically the equivalent of zero or empty.

#### Syntax:

```sql
 WITH DEFAULT
```

The following table shows Teradata’s data types, their corresponding type in Snowflake, and the default value to be set if supported.

| Teradata | Snowflake | Default Value |
| --- | --- | --- |
| BLOB[(n)] | BYTE | NOT SUPPORTED |
| BYTE[(n)] | BYTE | NOT SUPPORTED |
| VARBYTE[(n)] | BYTE | NOT SUPPORTED |
| BIGINT | BIGINT | 0 |
| BYTEINT | BYTEINT | 0 |
| DECIMAL [(n[,m])] | DECIMAL | 0 |
| DOUBLE PRECISION | DOUBLE PRECISION | 0 |
| FLOAT | FLOAT | 0 |
| INTEGER | INTEGER | 0 |
| NUMBER(n[,m]) | NUMBER | 0 |
| NUMBER[(\*[,m])] | NUMBER | 0 |
| NUMERIC [(n[,m])] | NUMERIC | 0 |
| REAL | REAL | 0 |
| SMALLINT | SMALLINT | 0 |
| DATE | DATE | CURRENT_DATE |
| TIME [(n)] | TIME | CURRENT_TIME |
| TIMESTAMP [(n)] | TIMESTAMP | CURRENT_TIMESTAMP |
| TIMESTAMP WITH TIME ZONE | TIMESTAMP_TZ | LOCALTIMESTAMP |
| INTERVAL DAY [(n)] | VARCHAR(21) | '0DAY' |
| INTERVAL DAY [(n)] TO HOUR | VARCHAR(21) | '0DAY' |
| INTERVAL DAY [(n)] TO MINUTE | VARCHAR(21) | '0DAY' |
| INTERVAL DAY [(n)] TO SECOND | VARCHAR(21) | '0DAY' |
| INTERVAL HOUR [(n)] | VARCHAR(21) | '0HOUR' |
| INTERVAL HOUR [(n)] TO MINUTE | VARCHAR(21) | '0HOUR' |
| INTERVAL HOUR [(n)] TO SECOND | VARCHAR(21) | '0HOUR' |
| INTERVAL MINUTE [(n)] | VARCHAR(21) | '0MINUTE' |
| INTERVAL MINUTE [(n)] TO SECOND [(m)] | VARCHAR(21) | '0MINUTE' |
| INTERVAL MONTH | VARCHAR(21) | '0MONTH' |
| INTERVAL SECOND [(n,[m])] | VARCHAR(21) | '0SECOND' |
| INTERVAL YEAR [(n)] | VARCHAR(21) | '0YEAR' |
| INTERVAL YEAR [(n)] TO MONTH | VARCHAR(21) | '0YEAR' |
| CHAR[(n)] | CHAR | '' |
| CHARACTER(n) CHARACTER SET GRAPHIC | - | NOT SUPPORTED |
| CLOB | - | NOT SUPPORTED |
| CHAR VARYING(n) | VARCHAR | '' |
| LONG VARCHAR | - | NOT SUPPORTED |
| LONG VARCHAR CHARACTER SET GRAPHIC | - | NOT SUPPORTED |
| VARCHAR(n) | VARCHAR | '' |
| VARCHAR(n) CHARACTER SET GRAPHIC | - | NOT SUPPORTED |
| PERIOD(DATE) | VARCHAR(24) | NOT SUPPORTED |
| PERIOD(TIME [(n)]) | VARCHAR(24) | NOT SUPPORTED |
| PERIOD(TIMESTAMP [(n)]) | VARCHAR(24) | NOT SUPPORTED |

### Sample Source Patterns

#### Teradata

##### Query

```sql
 CREATE TABLE SAMPLE_TABLE
(
    ID INT,

    -- Numeric Types
    big_integer_col BIGINT WITH DEFAULT,
    byteint_col BYTEINT WITH DEFAULT,
    decimal_col DECIMAL(10,2) WITH DEFAULT,
    double_precision_col DOUBLE PRECISION WITH DEFAULT,
    float_col FLOAT WITH DEFAULT,
    integer_col INTEGER WITH DEFAULT,
    number_col NUMBER WITH DEFAULT,
    numeric_col NUMERIC(10,2) WITH DEFAULT,
    real_col REAL WITH DEFAULT,
    smallint_col SMALLINT WITH DEFAULT,

    -- Character Types
    char_col CHAR(50) WITH DEFAULT,
    character_col CHARACTER(50) WITH DEFAULT,
    --clob_col CLOB,
    char_varying_col CHAR VARYING(100) WITH DEFAULT,
    --long_varchar_col LONG VARCHAR WITH DEFAULT,
    --long_varchar_graphic_col LONG VARCHAR CHARACTER SET GRAPHIC WITH DEFAULT,
    varchar_col VARCHAR(255) WITH DEFAULT,
    --varchar_graphic_col VARCHAR(255) CHARACTER SET GRAPHIC WITH DEFAULT,

    -- Date and Time Types
    date_col DATE WITH DEFAULT,
    time_col TIME WITH DEFAULT,
    time_precision_col TIME(6) WITH DEFAULT,
    timestamp_col TIMESTAMP WITH DEFAULT,
    timestamp_precision_col TIMESTAMP(6) WITH DEFAULT,
    tz_timestamp_col TIMESTAMP WITH TIME ZONE WITH DEFAULT,
    tz_timestamp_precision_col TIMESTAMP(6) WITH TIME ZONE WITH DEFAULT,
    interval_col INTERVAL DAY(4) WITH DEFAULT,
    interval_day_to_hour_col INTERVAL DAY(4) TO HOUR WITH DEFAULT,
    interval_hour_col INTERVAL HOUR(2) WITH DEFAULT,
    interval_minute_col INTERVAL MINUTE(2) WITH DEFAULT,
    interval_month_col INTERVAL MONTH WITH DEFAULT,
    interval_second_col INTERVAL SECOND(2) WITH DEFAULT,
    interval_year_col INTERVAL YEAR(4) WITH DEFAULT,

    -- Binary Types
    -- blob_col BLOB(1000),
    byte_col BYTE(1000) WITH DEFAULT,
    varbyte_col VARBYTE(1000) WITH DEFAULT
);
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE TABLE SAMPLE_TABLE
(
    ID INT,
    -- Numeric Types
    big_integer_col BIGINT DEFAULT 0,
    byteint_col BYTEINT DEFAULT 0,
    decimal_col DECIMAL(10,2) DEFAULT 0,
    double_precision_col DOUBLE PRECISION DEFAULT 0,
    float_col FLOAT DEFAULT 0,
    integer_col INTEGER DEFAULT 0,
    number_col NUMBER(38, 18) DEFAULT 0,
    numeric_col NUMERIC(10,2) DEFAULT 0,
    real_col REAL DEFAULT 0,
    smallint_col SMALLINT DEFAULT 0,
    -- Character Types
    char_col CHAR(50) DEFAULT '',
    character_col CHARACTER(50) DEFAULT '',
    --clob_col CLOB,
    char_varying_col CHAR VARYING(100) DEFAULT '',
    --long_varchar_col LONG VARCHAR WITH DEFAULT,
    --long_varchar_graphic_col LONG VARCHAR CHARACTER SET GRAPHIC WITH DEFAULT,
    varchar_col VARCHAR(255) DEFAULT '',
    --varchar_graphic_col VARCHAR(255) CHARACTER SET GRAPHIC WITH DEFAULT,

    -- Date and Time Types
    date_col DATE DEFAULT CURRENT_DATE,
    time_col TIME DEFAULT CURRENT_TIME,
    time_precision_col TIME(6) DEFAULT CURRENT_TIME(6),
    timestamp_col TIMESTAMP
--                            !!!RESOLVE EWI!!! /*** SSC-EWI-0013 - EXCEPTION THROWN WHILE CONVERTING ITEM: Mobilize.T12Data.Sql.Ast.TdWithDefaultAttribute. LINE: 31 OF FILE: /Users/hbadillabonilla/Documents/Workspace/migrations-snowconvert/Tools/DocVerifier/out/temp/CUebOYutwG1Dca8jb0Fo/8921d487/SOURCE/Teradata_01.sql ***/!!!
--                            WITH DEFAULT
                                        ,
    timestamp_precision_col TIMESTAMP(6)
--                                         !!!RESOLVE EWI!!! /*** SSC-EWI-0013 - EXCEPTION THROWN WHILE CONVERTING ITEM: Mobilize.T12Data.Sql.Ast.TdWithDefaultAttribute. LINE: 32 OF FILE: /Users/hbadillabonilla/Documents/Workspace/migrations-snowconvert/Tools/DocVerifier/out/temp/CUebOYutwG1Dca8jb0Fo/8921d487/SOURCE/Teradata_01.sql ***/!!!
-- WITH DEFAULT
             ,
    tz_timestamp_col TIMESTAMP_TZ
--                                  WITH DEFAULT
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - WITH DEFAULT FOR 'TIMESTAMP WITH TIME ZONE' NOT SUPPORTED IN SNOWFLAKE ***/!!!
                                                                                                                        ,
    tz_timestamp_precision_col TIMESTAMP_TZ(6)
--                                               WITH DEFAULT
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - WITH DEFAULT FOR 'TIMESTAMP(6) WITH TIME ZONE' NOT SUPPORTED IN SNOWFLAKE ***/!!!
                                                                                                                           ,
    interval_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DAY(4) DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0DAY',
    interval_day_to_hour_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL DAY(4) TO HOUR DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0DAY',
    interval_hour_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL HOUR(2) DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0HOUR',
    interval_minute_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL MINUTE(2) DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0MINUTE',
    interval_month_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL MONTH DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0MONTH',
    interval_second_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL SECOND(2) DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0SECOND',
    interval_year_col VARCHAR(21) !!!RESOLVE EWI!!! /*** SSC-EWI-0036 - INTERVAL YEAR(4) DATA TYPE CONVERTED TO VARCHAR ***/!!! DEFAULT '0YEAR',
    -- Binary Types
    -- blob_col BLOB(1000),
    byte_col BINARY
--                    WITH DEFAULT
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - WITH DEFAULT FOR 'BYTE(1000)' NOT SUPPORTED IN SNOWFLAKE ***/!!!
                                                                                                          ,
    varbyte_col BINARY(1000)
--                             WITH DEFAULT
--    !!!RESOLVE EWI!!! /*** SSC-EWI-0021 - WITH DEFAULT FOR 'VARBYTE(1000)' NOT SUPPORTED IN SNOWFLAKE ***/!!!
)
```

### Known Issues

#### 1. Unsupported types

As shown in the table in the description table, some types are not supported and no default value will be set when transforming the `WITH DEFAULT` clause.

### Related EWIs

1. [SSC-EWI-0021](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Not Supported in Snowflake.
2. [SSC-EWI-0036](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Data type converted to another data type.

## CREATE MACRO

Translation reference to convert Teradata CREATE MACRO to Snowflake Scripting

### Description

The Teradata `CREATE MACRO` defines one or more statements that are commonly used or that perform a complex operation, thus avoiding writing the same sequence of statements multiple times. The macro is executed when it is called by the EXECUTE statement.

For more information, see the [Teradata CREATE MACRO documentation](https://docs.teradata.com/r/Teradata-Database-SQL-Data-Definition-Language-Syntax-and-Examples/June-2017/Macro-Statements/CREATE-MACRO-and-REPLACE-MACRO).

```sql
 CREATE MACRO <macroname> [(parameter1, parameter2,...)] (
   <sql_statements>
);

[ EXECUTE | EXEC ] <macroname>;
```

### Sample Source Patterns

#### Setup data

The following code is necessary to execute the sample patterns present in this section.

##### Teradata

```sql
 CREATE TABLE DEPOSIT
(
    ACCOUNTNO NUMBER,
    ACCOUNTNAME VARCHAR(100)
);

INSERT INTO DEPOSIT VALUES (1, 'Account 1');
INSERT INTO DEPOSIT VALUES (2, 'Account 2');
INSERT INTO DEPOSIT VALUES (3, 'Account 3');
INSERT INTO DEPOSIT VALUES (4, 'Account 4');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE DEPOSIT
(
    ACCOUNTNO NUMBER(38, 18),
    ACCOUNTNAME VARCHAR(100)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO DEPOSIT
VALUES (1, 'Account 1');

INSERT INTO DEPOSIT
VALUES (2, 'Account 2');

INSERT INTO DEPOSIT
VALUES (3, 'Account 3');

INSERT INTO DEPOSIT
VALUES (4, 'Account 4');
```

#### Basic Macro

Since there is no macro object in Snowflake, the conversion tool transforms Teradata macros into Snowflake Scripting stored procedures. Besides, to replicate the functionality of the returned result set, in Snowflake Scripting, the query that is supposed to return a data set from a macro is assigned to a `RESULTSET` variable which will then be returned.

##### Teradata

##### Query

```sql
 REPLACE MACRO DEPOSITID (ID INT)
AS
(
  SELECT * FROM DEPOSIT WHERE ACCOUNTNO=:ID;
);

EXECUTE DEPOSITID(2);
```

##### Result

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 2            | Account 2    |
+--------------+--------------+
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE DEPOSITID (ID FLOAT)
RETURNS TABLE ()
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        LET res RESULTSET := (SELECT * FROM DEPOSIT WHERE ACCOUNTNO=:ID);
        RETURN TABLE(res);
    END;
$$;

CALL DEPOSITID(2);
```

##### Result

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 2            | Account 2    |
+--------------+--------------+
```

#### Macro Calls Another Macro

SnowConvert AI supports the scenario where a macro calls another macro and, by transitivity, a result set is returned by getting the results from Snowflake’s `RESULT_SCAN(LAST_QUERY_ID())`.

##### Teradata

##### Query

```sql
 REPLACE MACRO MacroCallOtherMacro (ID INT)
AS
(
    EXECUTE DEPOSITID(:ID);
);

EXECUTE MacroCallOtherMacro(2);
```

##### Result

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 2            | Account 2    |
+--------------+--------------+
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE MacroCallOtherMacro (ID FLOAT)
RETURNS TABLE (
)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "09/09/2024" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CALL DEPOSITID(:ID);
        LET res RESULTSET :=
        (
            SELECT
                *
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        );
        RETURN TABLE(res);
    END;
$$;

CALL MacroCallOtherMacro(2);
```

##### Result

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 2            | Account 2    |
+--------------+--------------+
```

#### Macro with no result set

Not all macros are intended to return a result set. The mentioned scenario is also supported.

##### Teradata

##### Query

```sql
 REPLACE MACRO MacroWithoutSelect (ACCOUNTNO NUMBER, ACCOUNTNAME VARCHAR(100))
AS
(
  INSERT INTO DEPOSIT VALUES (:ACCOUNTNO, :ACCOUNTNAME);
);

EXECUTE MacroWithoutSelect(5, 'Account 5');
SELECT * FROM DEPOSIT;
```

##### Result

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 1            | Account 1    |
+--------------+--------------+
| 2            | Account 2    |
+--------------+--------------+
| 3            | Account 3    |
+--------------+--------------+
| 4            | Account 4    |
+--------------+--------------+
| 5            | Account 5    |
+--------------+--------------+
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE MacroWithoutSelect (ACCOUNTNO FLOAT, ACCOUNTNAME VARCHAR(100))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        INSERT INTO DEPOSIT
        VALUES (:ACCOUNTNO, :ACCOUNTNAME);
    END;
$$;

CALL MacroWithoutSelect(5, 'Account 5');
SELECT * FROM DEPOSIT;
```

##### Result

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 1            | Account 1    |
+--------------+--------------+
| 2            | Account 2    |
+--------------+--------------+
| 3            | Account 3    |
+--------------+--------------+
| 4            | Account 4    |
+--------------+--------------+
| 5            | Account 5    |
+--------------+--------------+
```

#### Macro returns multiple result sets

In Teradata, macros can return more than one result set from a single macro.

Snowflake Scripting procedures only allow one result set to be returned per procedure. To replicate Teradata behavior, when there are two or more result sets to return, they are stored in temporary tables. The Snowflake Scripting procedure will return an array containing the name of the temporary tables.

##### Teradata

##### Query

```sql
 REPLACE MACRO DEPOSITID (ID INT)
AS
(
  SELECT * FROM DEPOSIT WHERE ACCOUNTNO=4;
  SELECT * FROM DEPOSIT WHERE ACCOUNTNO=:ID;
  EXECUTE DEPOSITID(:ID);
);

EXECUTE DEPOSITID(2);
```

##### Result Set 1

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 4            | Account 4    |
+--------------+--------------+
```

##### Result Set 2

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 2            | Account 2    |
+--------------+--------------+
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE DEPOSITID (ID FLOAT)
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "09/09/2024" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    return_arr ARRAY := array_construct();
    tbl_nm VARCHAR;
  BEGIN
    tbl_nm := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
    CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_nm) AS
      SELECT
        * FROM
        DEPOSIT
      WHERE ACCOUNTNO=4;
    return_arr := array_append(return_arr, :tbl_nm);
    tbl_nm := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
    CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_nm) AS
      SELECT
        * FROM
        DEPOSIT
      WHERE ACCOUNTNO=:ID;
    return_arr := array_append(return_arr, :tbl_nm);
    CALL DEPOSITID(:ID);
    tbl_nm := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
    CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_nm) AS
      SELECT
        *
      FROM
        TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    return_arr := array_append(return_arr, :tbl_nm);
    --** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
    RETURN return_arr;
  END;
$$;

CALL DEPOSITID(2);
```

##### Result Set 1

```none
+-----------------------------------------------------+
| DEPOSIDID                                           |
|-----------------------------------------------------|
| [                                                   |
|  "RESULTSET_93D50CBB_F22C_418A_A88C_4E1DE101B500",  |
|  "RESULTSET_6BDE39D7_0554_406E_B52F_D9E863A3F15C"   |
| ]                                                   |
+-----------------------------------------------------+
```

##### Visualize Result Sets

Executing the above procedure on Snowflake, an array with temporary table names in it will be returned:

> [ “RESULTSET_93D50CBB_F22C_418A_A88C_4E1DE101B500”, “RESULTSET_6BDE39D7_0554_406E_B52F_D9E863A3F15C”]

It is necessary to execute the following queries to display the result sets just like in Teradata.

##### Query

```sql
 SELECT * FROM table('RESULTSET_93D50CBB_F22C_418A_A88C_4E1DE101B500');
SELECT * FROM table('RESULTSET_6BDE39D7_0554_406E_B52F_D9E863A3F15C');
```

##### Result Set 1

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 4            | Account 4    |
+--------------+--------------+
```

##### Result Set 2

```none
+--------------+--------------+
| ACCOUNTNO    | ACCOUNTNAME  |
|--------------+--------------|
| 2            | Account 2    |
+--------------+--------------+
```

### Known Issues

No issues were found.

### Related EWIs

## CREATE PROCEDURE

Translation reference to convert Teradata CREATE PROCEDURE to Snowflake Scripting

Description

The Teradata `CREATE PROCEDURE` and `REPLACE PROCEDURE` statement generates or replaces a stored procedure implementation and compiles it.

For more information, see the [Teradata CREATE PROCEDURE and REPLACE PROCEDURE documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Definition-Language-Syntax-and-Examples/Procedure-Statements/CREATE-PROCEDURE-and-REPLACE-PROCEDURE-SQL-Form).

```sql
 -- Create/replace procedure syntax
{CREATE | REPLACE} PROCEDURE [database_name. | user_name.] procedure_name
    ([<parameter_definition>[, ...n]])
[<SQL_data_access>]
[DYNAMIC RESULT SETS number_of_sets]
[SQL SECURITY <privilege_option>]
statement;

<parameter_definition> := [IN | OUT | INOUT] parameter_name data_type

<SQL_data_access> := {CONTAINS SQL | MODIFIES SQL DATA | READS SQL DATA}

<privilege_option> := {CREATOR | DEFINER | INVOKER | OWNER}
```

### Sample Source Patterns

#### Setup data

The following code is necessary to execute the sample patterns present in this section.

##### Teradata

```sql
 CREATE TABLE inventory (
    product_name VARCHAR(50),
    price INTEGER
);

INSERT INTO inventory VALUES ('Bread', 50);
INSERT INTO inventory VALUES ('Tuna', 150);
INSERT INTO inventory VALUES ('Gum', 20);
INSERT INTO inventory VALUES ('Milk', 80);
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE inventory (
    product_name VARCHAR(50),
    price INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO inventory
VALUES ('Bread', 50);

INSERT INTO inventory
VALUES ('Tuna', 150);

INSERT INTO inventory
VALUES ('Gum', 20);

INSERT INTO inventory
VALUES ('Milk', 80);
```

#### Basic Procedure

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE BasicProcedure(IN counterValue INTEGER)
BEGIN
    DECLARE productName VARCHAR(50);
    DECLARE productPrice INTEGER DEFAULT 0;
    DECLARE whileCounter INTEGER DEFAULT 0;
    SET productName = 'Salt';
    WHILE (whileCounter < counterValue) DO
        SET productPrice = 10 + productPrice;
        SET whileCounter = whileCounter + 1;
    END WHILE;
    INSERT INTO inventory VALUES (productName, productPrice);
END;

CALL BasicProcedure(5);
SELECT product_name, price FROM inventory WHERE product_name = 'Salt';
```

##### Result

```none
+--------------+--------------+
| product_name |    price     |
|--------------+--------------|
| Salt         | 50           |
+--------------+--------------+
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE BasicProcedure (COUNTERVALUE INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/10/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        productName VARCHAR(50);
        productPrice INTEGER DEFAULT 0;
        whileCounter INTEGER DEFAULT 0;
    BEGIN

        productName := 'Salt';
            WHILE (:whileCounter < :counterValue) LOOP
            productPrice := 10 + productPrice;
            whileCounter := whileCounter + 1;
        END LOOP;
        INSERT INTO inventory
        VALUES (:productName, :productPrice);
    END;
$$;

CALL BasicProcedure(5);

SELECT
    product_name,
    price FROM
    inventory
WHERE
    UPPER(RTRIM( product_name)) = UPPER(RTRIM('Salt'));
```

##### Result

```none
+--------------+--------------+
| product_name |    price     |
|--------------+--------------|
| Salt         | 50           |
+--------------+--------------+
```

#### Single out parameter

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE procedureLabelSingle(OUT Message VARCHAR(100))
BEGIN
    set Message = 'Assignment value. Thanks';
END;

CALL procedureLabelSingle(?);
```

##### Result

```none
Message                 |
------------------------+
Assignment value. Thanks|
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE procedureLabelSingle (MESSAGE OUT VARCHAR(100))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/23/2024" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        Message := 'Assignment value. Thanks';
    END;
$$;

CALL procedureLabelSingle(?);
```

##### Result

```none
+───────────────────────────────+
| PROCEDURELABELSINGLE          |
+───────────────────────────────+
| ""Assignment value. Thanks""  |
+───────────────────────────────+
```

#### Multiple out parameter

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE procedureLabelMultiple(OUT Message VARCHAR(100), OUT Message2 VARCHAR(100))
BEGIN
    set Message = 'Assignment value. Thanks';
    set Message2 = 'Assignment value2. Thanks';
END;

CALL procedureLabelSingle(?, ?);
```

##### Result

```none
1                       |2                        |
------------------------+-------------------------+
Assignment value. Thanks|Assignment value2. Thanks|
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE procedureLabelMultiple (MESSAGE OUT VARCHAR(100), MESSAGE2 OUT VARCHAR(100))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        Message := 'Assignment value. Thanks';
        Message2 := 'Assignment value2. Thanks';
    END;
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "procedureLabelSingle" **
CALL procedureLabelSingle(?, ?);
```

##### Result

```none
+─────────────────────────+────────────────────────────────+
| PROCEDURELABELMULTIPLE  |                                |
+─────────────────────────+────────────────────────────────+
| "{                      |                                |
| ""Message""             | ""Assignment value. Thanks"",  |
| ""Message2""            | ""Assignment value2. Thanks""  |
| }"                      |                                |
+─────────────────────────+────────────────────────────────+
```

#### Multiple out parameter with dynamic result sets

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE Procedure1(out product_name VARCHAR(50), out price integer)
DYNAMIC RESULT SETS 2
BEGIN
	DECLARE result_set CURSOR WITH RETURN ONLY FOR
	SELECT * FROM inventory;
    DECLARE result_set2 CURSOR WITH RETURN ONLY FOR
	SELECT * FROM inventory;
    SET price = 100;
    SET product_name = 'another2';
	OPEN result_set2;
	OPEN result_set;
END;

REPLACE PROCEDURE Procedure2()
BEGIN
 DECLARE price INTEGER;
 DECLARE productName varchar(10);
 CALL Procedure1(productName, price);
 INSERT INTO inventory VALUES(:productName, :price);
END;

CALL Procedure2();
```

##### Result

##### Snowflake Scripting

##### Query

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "inventory" **
CREATE OR REPLACE PROCEDURE Procedure1 (PRODUCT_NAME OUT VARCHAR(50), PRICE OUT integer)
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		tbl_result_set VARCHAR;
		tbl_result_set2 VARCHAR;
		return_arr ARRAY := array_construct();
	BEGIN
		tbl_result_set := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_result_set) AS
			SELECT
				* FROM
				inventory;
		LET result_set CURSOR
		FOR
			SELECT
				*
			FROM
				IDENTIFIER(?);
		tbl_result_set2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_result_set2) AS
			SELECT
				* FROM
				inventory;
		LET result_set2 CURSOR
		FOR
			SELECT
				*
			FROM
				IDENTIFIER(?);
				price := 100;
				product_name := 'another2';
				OPEN result_set2 USING (tbl_result_set2);
				return_arr := array_append(return_arr, :tbl_result_set2);
				OPEN result_set USING (tbl_result_set);
				return_arr := array_append(return_arr, :tbl_result_set);
				--** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
				RETURN return_arr;
	END;
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "inventory" **
CREATE OR REPLACE PROCEDURE Procedure2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
				price INTEGER;
				productName varchar(10);
	BEGIN

				CALL Procedure1(:productName, :price);
				INSERT INTO inventory
				VALUES (:productName, :price);
	END;
$$;

CALL Procedure2();
```

### Known Issues

**1. SQL Data Access**

By default, Snowflake procedures support the execution of any kind of SQL statements, including data reading or modification statements, making the SQL data access clause non-relevant. This clause will be ignored when converting the procedure.

**2. Top Level Objects in Assessment Report**

Elements (Temporal tables or Views) inside Stored Procedures are being counted in the Assessment report as Top Level Objects. The SnowConvert AI team is now working on a fix for this scenario.

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-FDM-0020](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables.

---
title: SnowConvert AI - Teradata - DML
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/dml-teradata.md
section: Migrations
---

# SnowConvert AI - Teradata - DML

In this section, you will find the documentation for the translation reference of Data Manipulation Language Elements.

## Delete Statement

> See [Delete statement](https://docs.teradata.com/r/huc7AEHyHSROUkrYABqNIg/z8eO9bdxtjFRveHdDwwYPQ)

Teradata support calling more than one table in the`FROM`clause, Snowflake does not. Therefore, it is necessary to use the`USING`clause to refer to the extra tables involved in the condition.

**Teradata**

**Delete**

```sql
DEL FROM MY_TABLE ALL;
DEL FROM MY_TABLE_2 WHERE COL1 > 50;
DELETE T1 FROM TABLE1 T1, TABLE2 T2 WHERE T1.ID = T2.ID;
DELETE FROM TABLE1 T1, TABLE2 T2 WHERE T1.ID = T2.ID;
DELETE T1 FROM TABLE2 T2, TABLE1 T1 WHERE T1.ID = T2.ID;
DELETE FROM TABLE1 WHERE TABLE1.COLUMN1 = TABLE2.COLUMN2
```

**Snowflake**

**Delete**

```sql
DELETE FROM
MY_TABLE;

DELETE FROM
MY_TABLE_2
WHERE
COL1 > 50;

DELETE FROM
TABLE1 T1
USING TABLE2 T2
WHERE
T1.ID = T2.ID;

DELETE FROM
TABLE1 T1
USING TABLE2 T2
WHERE
T1.ID = T2.ID;

DELETE FROM
TABLE1 T1
USING TABLE2 T2
WHERE
T1.ID = T2.ID;

DELETE FROM
TABLE1
WHERE
TABLE1.COLUMN1 = TABLE2.COLUMN2;
```

### Known Issues

#### 1. DEL abbreviation unsupported

The abbreviation is unsupported in Snowflake but it is translated correctly by changing it to DELETE.

### Related EWIs

No related EWIs.

## Set Operators

The SQL set operators manipulate the result sets of several queries combining the results of each query into a single result set.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> See [Set operators](https://docs.teradata.com/r/b8dd8xEYJnxfsq4uFRrHQQ/Q8qU3AO1RXLNFCPOGTX73g)

Set Operators in both Teradata and Snowflake have the same syntax and supported scenarios `EXCEPT`, `INTERSECT`, and `UNION` except for the clause `ALL` in the `INTERSECT ALL`, which is not supported in Snowflake, resulting in the portion of the `ALL` as a commented code after the conversion.

**Teradata**

### Intersect

```sql
 SELECT LastName, FirstName FROM employees
INTERSECT
SELECT FirstName, LastName FROM contractors;

SELECT LastName, FirstName FROM employees
INTERSECT ALL
SELECT FirstName, LastName FROM contractors;
```

**Snowflake**

#### Intersect

```sql
 SELECT
LastName,
FirstName FROM
employees
INTERSECT
SELECT
FirstName,
LastName FROM
contractors;

SELECT
LastName,
FirstName FROM
employees
INTERSECT
        !!!RESOLVE EWI!!! /*** SSC-EWI-0040 - THE 'INTERSECT ALL QUANTIFIER' CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!! ALL
SELECT
FirstName,
LastName FROM
contractors;
```

### Known Issues

#### 1. INTERSECT ALL unsupported

The INTERSECT ALL is unsupported in Snowflake and then the part ALL will be commented.

### Related EWIs

1. [SSC-EWI-0040](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Statement Not Supported.

## Update Statement

### Description

> Modifies column values in existing rows of a table. ([Teradata SQL Language Reference UPDATE](https://docs.teradata.com/r/huc7AEHyHSROUkrYABqNIg/k6fC7ozmhIZZXa315VjJAw))

### Sample Source Patterns

#### Basic case

**Teradata**

**Update**

```sql
 UPDATE CRASHDUMPS.TABLE1 i
 SET COLUMN4 = CRASHDUMPS.TABLE2.COLUMN3
 WHERE i.COLUMN1 = CRASHDUMPS.TABLE2.COLUMN1
 AND i.COLUMN3 = 'L';
```

**Snowflake**

**Update**

```sql
UPDATE CRASHDUMPS.TABLE1 AS i
 SET
  i.COLUMN4 = CRASHDUMPS.TABLE2.COLUMN3
 FROM
  CRASHDUMPS.TABLE2
  WHERE i.COLUMN1 = CRASHDUMPS.TABLE2.COLUMN1
  AND UPPER(RTRIM( i.COLUMN3)) = UPPER(RTRIM('L'));
```

#### UPDATE with forward alias

Teradata supports referencing an alias before it is declared, but Snowflake does not. The transformation for this scenario is to take the referenced table and change the alias for the table name it references.

**Teradata**

**Update**

```sql
 UPDATE i
 FROM CRASHDUMPS.TABLE2, CRASHDUMPS.TABLE1 i
 SET COLUMN4 = CRASHDUMPS.TABLE2.COLUMN3
 WHERE i.COLUMN1 = CRASHDUMPS.TABLE2.COLUMN1
 AND i.COLUMN3 = 'L';
```

**Snowflake**

**Update**

```sql
UPDATE CRASHDUMPS.TABLE1 AS i
  SET
  i.COLUMN4 = CRASHDUMPS.TABLE2.COLUMN3
  FROM
  CRASHDUMPS.TABLE2
  WHERE i.COLUMN1 = CRASHDUMPS.TABLE2.COLUMN1
  AND UPPER(RTRIM( i.COLUMN3)) = UPPER(RTRIM('L'));
```

#### UPDATE with target table in the FROM clause

Teradata supports having the target table defined in the FROM clause, this is removed in Snowflake to avoid duplicate alias and ambiguous column reference errors.

**Teradata**

**Update**

```sql
UPDATE some_table
FROM some_table
SET Code = Code + 100
WHERE Name = 'A';
```

**Snowflake**

**Update**

```sql
UPDATE some_table
  SET Code = Code + 100
  WHERE
  UPPER(RTRIM( Name)) = UPPER(RTRIM('A'));
```

### Related EWIs

No related EWIs.

## With Modifier

Select statement that uses the WITH modifier with a list of several named queries also known as common table expressions (CTEs).

> See [With Modifier](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Data-Manipulation-Language/July-2021/SELECT-Statements/WITH-Modifier)

Snowflake supports Teradata’s `WITH` modifier on a SELECT statement that has several `CTEs` (Common Table Expressions). Teradata supports any order of CTE definition, regardless of whether it is referenced before it is declared or not, but Snowflake requires that if a CTE calls another CTE, it must be defined before it is called. Then the converted sequence of CTEs within the WITH will be reordered into the unreferenced CTEs, then the CTE that calls the next CTE, and so on.

Where there is a cycle detected in the WITH calling sequence, it will be left as the original, without any changes to the sequence as detailed in an example of the [SSC-EWI-TD0077](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md).

In the example below, there are two CTEs named n1 and n2, the n1 referring to n2. Then the n2 must be defined first in Snowflake as the corresponding converted code.

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

**Teradata**

### With Modifier

```sql
 WITH recursive n1(c1) as (select c1, c3 from t2, n1),
     n2(c2) as (select c2 from tablex)
     SELECT * FROM t1;
```

**Snowflake**

#### With Modifier

```sql
 WITH RECURSIVE n1(c1) AS
(
     SELECT
          c1,
          c3 from
          t2, n1
),
n2(c2) AS
(
     SELECT
          c2 from
          tablex
)
SELECT
     * FROM
     t1;
```

### Known Issues

#### 1. Impossible to reorder when cycles were found

When the CTEs references are analyzed and there is a cycle between the calls of the CTEs, the CTEs will not be ordered.

### Related EWIs

No related EWIs.

## Insert Statement

SQL statement that adds new rows to a table.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> See [Insert statement](https://docs.teradata.com/r/0I5vemahub4iSU2bk5WA1A/SQ4EQb1a8WMHn3tbrcvW9Q)

In Teradata, there is an alternate`INSERT`syntax that assigns the value for each table column inline. This alternate structure requires a special transformation to be supported in Snowflake. The inline assignment of the values is separated and placed inside the `VALUES(...)` part of the Snowflake `INSERT INTO` statement.

**Teradata**

### Insert

```sql
 INSERT INTO appDB.logTable (
    process_name = 'S2F_BOOKS_LOAD_NEW'
    , session_id = 105678989
    , message_txt = ''
    , message_ts = '2019-07-23 00:00:00'
    , Insert_dt = CAST((CURRENT_TIMESTAMP(0)) AS DATE FORMAT 'YYYY-MM-DD'));
```

**Snowflake**

#### Insert

```sql
 INSERT INTO appDB.logTable (
process_name, session_id, message_txt, message_ts, Insert_dt)
VALUES ('S2F_BOOKS_LOAD_NEW', 105678989, '', '2019-07-23 00:00:00', TO_DATE((CURRENT_TIMESTAMP(0))));
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## LOGGING ERRORS

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> **Notice that this statement is** **removed from the migration** **because it is a non-relevant syntax. It means that it is not required in Snowflake.**

### Description

Statement to log errors when using statements as `INSERT...SELECT.` Please review the following [documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Manipulation-Language/Statement-Syntax/INSERT/INSERT-...-SELECT/INSERT/INSERT-...-SELECT-Examples/Example-Logging-Errors-with-INSERT-...-SELECT).

### Sample Source Patterns

#### LOGGING ERRORS

In this example, notice that `LOGGING ERRORS` has been removed because it is not a relevant syntax. The syntax is not required in Snowflake.

##### Teradata

```sql
 INSERT INTO MY_TABLE
SELECT *
FROM MY_SAMPLE
LOGGING ERRORS;
```

##### Snowflake

```
INSERT INTO MY_TABLE SELECT
*
FROM
MY_SAMPLE;
```

#### LOGGING ALL ERRORS

In this example, notice that `LOGGING ALL ERRORS` has been removed because it is not a relevant syntax. The syntax is not required in Snowflake.

##### Teradata

```sql
 INSERT INTO MY_TABLE
SELECT *
FROM MY_SAMPLE
LOGGING ALL ERRORS;
```

##### Snowflake

```sql
 INSERT INTO MY_TABLE SELECT
*
FROM
MY_SAMPLE;
```

#### LOGGING ERRORS WITH NO LIMIT

In this example, notice that `LOGGING ERRORS WITH NO LIMIT` has been removed because it is not a relevant syntax. The syntax is not required in Snowflake.

##### Teradata

```sql
 INSERT INTO MY_TABLE
SELECT *
FROM MY_SAMPLE
LOGGING ERRORS WITH NO LIMIT;
```

##### Snowflake

```sql
 INSERT INTO MY_TABLE SELECT
*
FROM
MY_SAMPLE;
```

#### LOGGING ERRORS WITH LIMIT OF

In this example, notice that `LOGGING ERRORS WITH LIMIT OF` has been removed because it is not a relevant syntax. The syntax is not required in Snowflake.

##### Teradata

```sql
 INSERT INTO MY_TABLE
SELECT *
FROM MY_SAMPLE
LOGGING ERRORS WITH LIMIT OF 100;
```

##### Snowflake

```sql
 INSERT INTO MY_TABLE SELECT
*
FROM
MY_SAMPLE;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Select Statement

> See [Select statement](https://docs.teradata.com/reader/b8dd8xEYJnxfsq4uFRrHQQ/kH97CTRIXdd~i1yLemdvKw)

Snowflake supports Teradata’s `SELECT` syntax with a few exceptions. Primarily, it does not support the `SEL` abbreviation.​

**Teradata**

**Sel**

```sql
SEL DISTINCT col1, col2 FROM table1
```

**Snowflake**

**Select**

```sql
SELECT DISTINCT col1,
col2 FROM
table1;
```

Teradata supports referencing an alias before it is declared, but Snowflake does not. The transformation for this scenario is to take the referenced column and change the alias for the column name it references.

**Teradata**

**Alias**

```sql
SELECT
my_val, sum(col1),
col2 AS my_val FROM table1
```

**Snowflake**

**Alias**

```sql
SELECT
my_val,
SUM(col1),
col2 AS my_val FROM
table1;
```

### Removed clause options

The following clause options are not relevant to Snowflake, therefore they are removed during the migration.

| Teradata | Snowflake |
| --- | --- |
| Expand on | Unsupported |
| Normalize | Unsupported |
| With check option (Query) | Unsupported |

### Known Issues

#### 1. SEL abbreviation unsupported

The abbreviation is unsupported in Snowflake but it is translated correctly by changing it to SELECT.

### Related EWIs

No related EWIs.

## ANY Predicate

> **Warning:**
>
> This is a work in progress, changes may be applied in the future.

### Description

In Teradata enables quantification in a comparison operation or IN/NOT IN predicate. The comparison of expression and at least one value in the set of values returned by subquery is true. Please review the following [Teradata documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Functions-Expressions-and-Predicates/Logical-Predicates/ANY/ALL/SOME) for more information.

**Teradata syntax**

```sql
 { expression quantifier ( literal [ {, | OR} ... ] ) |
  { expression | ( expression [,...] ) } quantifier ( subquery )
}
```

Where quantifier:

```sql
 { comparison_operator [ NOT ] IN } { ALL |ANY | SOME }
```

**Snowflake syntax**

> **SuccessPlaceholder:**
>
> In subquery form, IN is equivalent to `= ANY` and NOT IN is equivalent to `<> ALL`. Review the following [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/functions/in) for more information.

To compare individual values:

```sql
 <value> [ NOT ] IN ( <value_1> [ , <value_2> ...  ] )
```

To compare *row constructors* (parenthesized lists of values):

```sql
 ( <value_A> [, <value_B> ... ] ) [ NOT ] IN (  ( <value_1> [ , <value_2> ... ] )  [ , ( <value_3> [ , <value_4> ... ] )  ...  ]  )
```

To compare a value to the values returned by a subquery:

```sql
 <value> [ NOT ] IN ( <subquery> )
```

### Sample Source Patterns

#### Sample data

##### Teradata

##### Query

```sql
 CREATE TABLE Employee (
    EmpNo INT,
    Name VARCHAR(100),
    DeptNo INT
);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (1, 'Alice', 100);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (2, 'Bob', 300);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (3, 'Charlie', 500);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (4, 'David', 200);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (5, 'Eve', 100);
```

##### Snowflake

##### Query

```sql
 CREATE OR REPLACE TABLE Employee (
    EmpNo INT,
    Name VARCHAR(100),
    DeptNo INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "01/14/2025",  "domain": "test" }}'
;

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (1, 'Alice', 100);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (2, 'Bob', 300);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (3, 'Charlie', 500);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (4, 'David', 200);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (5, 'Eve', 100);
```

#### Equal ANY predicate in WHERE clause

**Teradata**

##### Input

```sql
 SELECT DeptNo
FROM Employee
WHERE DeptNo = ANY(100,300,500) ;
```

##### Output

| DeptNo |
| --- |
| 100 |
| 500 |
| 100 |
| 300 |

**Snowflake**

##### Input

```sql
 SELECT DeptNo
FROM Employee
WHERE DeptNo IN(100,300,500) ;
```

##### Output

| DeptNo |
| --- |
| 100 |
| 500 |
| 100 |
| 300 |

#### Other comparison operators in WHERE clause

When there are other comparison operators, there equivalent translation is to add a subquery with the required logic.

**Teradata**

##### Input

```sql
 SELECT Name, DeptNo
FROM Employee
WHERE DeptNo < ANY(100,300,500) ;
```

##### Output

| Name | DeptNo |
| --- | --- |
| Eve | 100 |
| Alice | 100 |
| David | 200 |
| Bob | 300 |

**Snowflake**

##### Input

```sql
 SELECT Name, DeptNo
FROM Employee
WHERE DeptNo < ANY
(SELECT DeptNo
FROM Employee
WHERE DeptNo > 100
OR DeptNo > 300
OR DeptNo > 500);
```

##### Output

| NAME | DEPTNO |
| --- | --- |
| Alice | 100 |
| Eve | 100 |
| Bob | 300 |
| David | 200 |

#### IN ANY in WHERE clause

**Teradata**

##### Input

```sql
 SELECT DeptNo
FROM Employee
WHERE DeptNo IN ANY(100,300,500) ;
```

##### Output

| DeptNo |
| --- |
| 100 |
| 500 |
| 100 |
| 300 |

**Snowflake**

##### Input

```sql
 SELECT DeptNo
FROM Employee
WHERE DeptNo IN(100,300,500) ;
```

##### Output

| DeptNo |
| --- |
| 100 |
| 500 |
| 100 |
| 300 |

#### NOT IN ALL in WHERE clause

**Teradata**

##### Input

```sql
 SELECT Name, DeptNo
FROM Employee
WHERE DeptNo NOT IN ALL(100, 200);
```

##### Output

| Name | DeptNo |
| --- | --- |
| Charlie | 500 |
| Bob | 300 |

**Snowflake**

##### Input

```sql
 SELECT Name, DeptNo
FROM Employee
WHERE DeptNo NOT IN (100, 200);
```

##### Output

| Name | DeptNo |
| --- | --- |
| Charlie | 500 |
| Bob | 300 |

### Known Issues

#### NOT IN ANY in WHERE clause

**Teradata**

##### Input

```sql
 SELECT Name, DeptNo
FROM Employee
WHERE DeptNo NOT IN ANY(100, 200);
```

##### Output

| Name | DeptNo |
| --- | --- |
| Eve | 100 |
| Charlie | 500 |
| Alice | 100 |
| David | 200 |
| Bob | 300 |

**Snowflake**

##### Input

```sql
 SELECT Name, DeptNo
FROM Employee
WHERE DeptNo IN (100, 200)
   OR DeptNo NOT IN (100, 200);
```

##### Output

| Name | DeptNo |
| --- | --- |
| Eve | 100 |
| Charlie | 500 |
| Alice | 100 |
| David | 200 |
| Bob | 300 |

### Related EWIs

No related EWIs.

## Expand On Clause

Translation reference to convert Teradata Expand On functionality to Snowflake

### Description

> The Expand On clause expands a column having a **period** data type, creating a regular time series of rows based on the period value in the input row. For more information about Expand On clause, see the [Teradata documentation](https://docs.teradata.com/r/huc7AEHyHSROUkrYABqNIg/542VMPPqGwHBhF98pnTz9w).

### Sample Source Patterns

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

#### Sample data

##### Teradata

```sql
 CREATE TABLE table1 (id INTEGER, pd PERIOD (TIMESTAMP));

INSERT INTO
    table1
VALUES
    (
        1,
        PERIOD(
            TIMESTAMP '2022-05-23 10:15:20.00009',
            TIMESTAMP '2022-05-23 10:15:25.000012'
        )
    );
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE table1 (
    id INTEGER,
    pd VARCHAR(58) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO table1
VALUES (
1, PUBLIC.PERIOD_UDF(
            TIMESTAMP '2022-05-23 10:15:20.00009',
            TIMESTAMP '2022-05-23 10:15:25.000012'
        ) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!);
```

#### Expand On Clause

Suppose you want to expand the period column by seconds, for this Expand On clause has anchor period expansion and interval literal expansion.

##### Anchor Period Expansion

##### Teradata

```sql
 SELECT
    id,
    BEGIN(bg)
FROM
    table1 EXPAND ON pd AS bg BY ANCHOR ANCHOR_SECOND;
```

##### Result

| id | BEGIN (bg) |
| --- | --- |
| 1 | 2022-05-23 10:15:21.0000 |
| 1 | 2022-05-23 10:15:22.0000 |
| 1 | 2022-05-23 10:15:23.0000 |
| 1 | 2022-05-23 10:15:24.0000 |
| 1 | 2022-05-23 10:15:25.0000 |

Snowflake doesn’t support Expand On clause. To reproduce the same results and functionality, the Teradata SQL code will be contained in a CTE block, with an **EXPAND_ON_UDF** and **TABLE** function, using **FLATTEN** function to return multiple rows, **ROW_COUNT_UDF** and **DIFF_TTIME_PERIOD_UDF** to indicate how many rows are needed and returning **VALUE** to help the EXPAND_ON_UDF to calculate the different regular time series. This CTE block returns the same expand columns alias as in the Expand On clause, so the result can be used in any usage of period datatype.

##### Snowflake

```sql
 WITH ExpandOnCTE AS
(
    SELECT
        PUBLIC.EXPAND_ON_UDF('ANCHOR_SECOND', VALUE, pd) bg
    FROM
        table1,
        TABLE(FLATTEN(PUBLIC.ROW_COUNT_UDF(PUBLIC.DIFF_TIME_PERIOD_UDF('ANCHOR_SECOND', pd))))
)
SELECT
    id,
    PUBLIC.PERIOD_BEGIN_UDF(bg) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!
FROM
    table1,
    ExpandOnCTE;
```

##### Result

| id | PERIOD_BEGIN_UDF(bg) |
| --- | --- |
| 1 | 2022-05-23 10:15:21.0000 |
| 1 | 2022-05-23 10:15:22.0000 |
| 1 | 2022-05-23 10:15:23.0000 |
| 1 | 2022-05-23 10:15:24.0000 |
| 1 | 2022-05-23 10:15:25.0000 |

### Known Issues

The Expand On clause can use interval literal expansion, for this case, SnowConvert AI will add an error that this translation is planned.

#### Interval literal expansion

##### Teradata

```sql
 SELECT
    id,
    BEGIN(bg)
FROM
    table1 EXPAND ON pd AS bg BY INTERVAL '1' SECOND;
```

##### Result

| id | BEGIN(bg) |
| --- | --- |
| 1 | 2022-05-23 10:15:20.0000 |
| 1 | 2022-05-23 10:15:21.0000 |
| 1 | 2022-05-23 10:15:22.0000 |
| 1 | 2022-05-23 10:15:23.0000 |
| 1 | 2022-05-23 10:15:24.0000 |

##### Snowflake

```sql
 SELECT
    id,
    PUBLIC.PERIOD_BEGIN_UDF(bg) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!
FROM
    table1
!!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'EXPAND ON' NODE ***/!!!
EXPAND ON pd AS bg BY INTERVAL '1' SECOND;
```

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-EWI-TD0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Snowflake does not support the period datatype, all periods are handled as varchar instead.

## Normalize

Translation reference to convert Teradata Normalize functionality to Snowflake

### Description

> NORMALIZE specifies that period values in the first-period column that meet or overlap are combined to form a period that encompasses the individual period values. For more information about Normalize clause, see the [Teradata documentation](https://docs.teradata.com/r/2_MC9vCtAJRlKle2Rpb0mA/UuxiA0mklFgv~33X5nyKMA).

### Sample Source Patterns

> **Note:**
>
> Some parts in the output code are omitteed for clarity reasons.

#### Sample data

##### Teradata

```sql
 CREATE TABLE project (
    emp_id INTEGER,
    project_name VARCHAR(20),
    dept_id INTEGER,
    duration PERIOD(DATE)
);

INSERT INTO project
VALUES
    (
        10,
        'First Phase',
        1000,
        PERIOD(DATE '2010-01-10', DATE '2010-03-20')
    );

INSERT INTO project
VALUES
    (
        10,
        'First Phase',
        2000,
        PERIOD(DATE '2010-03-20', DATE '2010-07-15')
    );

INSERT INTO project
VALUES
    (
        10,
        'Second Phase',
        2000,
        PERIOD(DATE '2010-06-15', DATE '2010-08-18')
    );

INSERT INTO project
VALUES
    (
        20,
        'First Phase',
        2000,
        PERIOD(DATE '2010-03-10', DATE '2010-07-20')
    );

INSERT INTO project
VALUES
    (
        20,
        'Second Phase',
        1000,
        PERIOD(DATE '2020-05-10', DATE '2020-09-20')
    );
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE project (
    emp_id INTEGER,
    project_name VARCHAR(20),
    dept_id INTEGER,
    duration VARCHAR(24) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO project
VALUES (
10,
        'First Phase',
        1000, PUBLIC.PERIOD_UDF(DATE '2010-01-10', DATE '2010-03-20') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!);

INSERT INTO project
VALUES (
10,
        'First Phase',
        2000, PUBLIC.PERIOD_UDF(DATE '2010-03-20', DATE '2010-07-15') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!);

INSERT INTO project
VALUES (
10,
        'Second Phase',
        2000, PUBLIC.PERIOD_UDF(DATE '2010-06-15', DATE '2010-08-18') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!);

INSERT INTO project
VALUES (
20,
        'First Phase',
        2000, PUBLIC.PERIOD_UDF(DATE '2010-03-10', DATE '2010-07-20') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!);

INSERT INTO project
VALUES (
20,
        'Second Phase',
        1000, PUBLIC.PERIOD_UDF(DATE '2020-05-10', DATE '2020-09-20') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!);
```

#### Normalize Clause

Suppose you want to use Normalize clause with the employee id.

##### Teradata

```sql
 SELECT
    NORMALIZE emp_id,
    duration
FROM
    project;
```

##### Result

| EMP_ID | DURATION |
| --- | --- |
| 20 | (2010-03-10, 2010-07-20) |
| 10 | (2010-01-10, 2010-08-18) |
| 20 | (2020-05-10, 2010-09-20) |

##### Snowflake

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0079 - THE REQUIRED PERIOD TYPE COLUMN WAS NOT FOUND ***/!!!
WITH NormalizeCTE AS
(
    SELECT
        T1.*,
        SUM(GroupStartFlag)
        OVER (
        PARTITION BY
            emp_id, duration
        ORDER BY
            PeriodColumn_begin
        ROWS UNBOUNDED PRECEDING) GroupID
    FROM
        (
            SELECT
                emp_id,
                duration,
                PUBLIC.PERIOD_BEGIN_UDF(PeriodColumn) PeriodColumn_begin,
                PUBLIC.PERIOD_END_UDF(PeriodColumn) PeriodColumn_end,
                (CASE
                    WHEN PeriodColumn_begin <= LAG(PeriodColumn_end)
                    OVER (
                    PARTITION BY
                        emp_id, duration
                    ORDER BY
                        PeriodColumn_begin,
                        PeriodColumn_end)
                        THEN 0
                    ELSE 1
                END) GroupStartFlag
            FROM
                project
        ) T1
)
SELECT
    emp_id,
    duration,
    PUBLIC.PERIOD_UDF(MIN(PeriodColumn_begin), MAX(PeriodColumn_end))
FROM
    NormalizeCTE
GROUP BY
    emp_id,
    duration,
    GroupID;
```

##### Result

| EMP_ID | PUBLIC.PERIOD_UDF(MIN(START_DATE), MAX(END_DATE)) |
| --- | --- |
| 20 | 2020-05-10\*2010-09-20 |
| 20 | 2010-03-10\*2010-07-20 |
| 10 | 2010-01-10\*2010-08-18 |

### Known Issues

Normalize clause can use **ON MEETS OR OVERLAPS**, **ON OVERLAPS** or **ON OVERLAPS OR MEETS,** for these cases SnowConvert AI will add an error that this translation is planned for the future.

#### Teradata

```sql
 SELECT NORMALIZE ON MEETS OR OVERLAPS emp_id, duration FROM table1;
```

##### Snowflake

```sql
 SELECT
       !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'NORMALIZE SET QUANTIFIER' NODE ***/!!!
       NORMALIZE ON MEETS OR OVERLAPS emp_id,
duration FROM
table1;
```

### Related EWIs

1. [SSC-EWI-0073](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
2. [SSC-EWI-TD0079](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): The required period type column was not found.
3. [SSC-EWI-TD0053](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Snowflake does not support the period datatype, all periods are handled as varchar instead.

## Reset When

### Description

> Reset When determines the partition on which an SQL window function operates based on some specific condition. If the condition evaluates to True, a new dynamic sub partition is created within the existing window partition. For more information about Reset When, see the [Teradata documentation](https://docs.teradata.com/reader/1DcoER_KpnGTfgPinRAFUw/b7wL86OoMTPno6hrSPNdDg).

### Sample Source Patterns

#### Sample data

##### Teradata

**Query**

```sql
CREATE TABLE account_balance
(
  account_id INTEGER NOT NULL,
  month_id INTEGER,
  balance INTEGER
)
UNIQUE PRIMARY INDEX (account_id, month_id);

INSERT INTO account_balance VALUES (1, 1, 60);
INSERT INTO account_balance VALUES (1, 2, 99);
INSERT INTO account_balance VALUES (1, 3, 94);
INSERT INTO account_balance VALUES (1, 4, 90);
INSERT INTO account_balance VALUES (1, 5, 80);
INSERT INTO account_balance VALUES (1, 6, 88);
INSERT INTO account_balance VALUES (1, 7, 90);
INSERT INTO account_balance VALUES (1, 8, 92);
INSERT INTO account_balance VALUES (1, 9, 10);
INSERT INTO account_balance VALUES (1, 10, 60);
INSERT INTO account_balance VALUES (1, 11, 80);
INSERT INTO account_balance VALUES (1, 12, 10);
```

**Result**

| account_id | month_id | balance |
| --- | --- | --- |
| 1 | 1 | 60 |
| 1 | 2 | 99 |
| 1 | 3 | 94 |
| 1 | 4 | 90 |
| 1 | 5 | 80 |
| 1 | 6 | 88 |
| 1 | 7 | 90 |
| 1 | 8 | 92 |
| 1 | 9 | 10 |
| 1 | 10 | 60 |
| 1 | 11 | 80 |
| 1 | 12 | 10 |

##### Snowflake

**Query**

```sql
CREATE OR REPLACE TABLE account_balance (
  account_id INTEGER NOT NULL,
  month_id INTEGER,
  balance INTEGER,
  UNIQUE (account_id, month_id)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

INSERT INTO account_balance
VALUES (1, 1, 60);

INSERT INTO account_balance
VALUES (1, 2, 99);

INSERT INTO account_balance
VALUES (1, 3, 94);

INSERT INTO account_balance
VALUES (1, 4, 90);

INSERT INTO account_balance
VALUES (1, 5, 80);

INSERT INTO account_balance
VALUES (1, 6, 88);

INSERT INTO account_balance
VALUES (1, 7, 90);

INSERT INTO account_balance
VALUES (1, 8, 92);

INSERT INTO account_balance
VALUES (1, 9, 10);

INSERT INTO account_balance
VALUES (1, 10, 60);

INSERT INTO account_balance
VALUES (1, 11, 80);

INSERT INTO account_balance
VALUES (1, 12, 10);
```

**Result**

| account_id | month_id | balance |
| --- | --- | --- |
| 1 | 1 | 60 |
| 1 | 2 | 99 |
| 1 | 3 | 94 |
| 1 | 4 | 90 |
| 1 | 5 | 80 |
| 1 | 6 | 88 |
| 1 | 7 | 90 |
| 1 | 8 | 92 |
| 1 | 9 | 10 |
| 1 | 10 | 60 |
| 1 | 11 | 80 |
| 1 | 12 | 10 |

#### Reset When

For each account, suppose you want to analyze the sequence of consecutive monthly balance increases. When the balance of one month is less than or equal to the balance of the previous month, the requirement is to reset the counter to zero and restart.

To analyze this data, Teradata SQL uses a window function with a nested aggregate and a Reset When statement, as follows:

##### Teradata

**Query**

```sql
SELECT
   account_id,
   month_id,
   balance,
   (
     ROW_NUMBER() OVER (
       PARTITION BY account_id
       ORDER BY
         month_id RESET WHEN balance <= SUM(balance) OVER (
           PARTITION BY account_id
           ORDER BY month_id
           ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
         )
     ) -1
   ) AS balance_increase
FROM account_balance
ORDER BY 1, 2;
```

**Result**

| account_id | month_id | balance | balance_increase |
| --- | --- | --- | --- |
| 1 | 1 | 60 | 0 |
| 1 | 2 | 99 | 1 |
| 1 | 3 | 94 | 0 |
| 1 | 4 | 90 | 0 |
| 1 | 5 | 80 | 0 |
| 1 | 6 | 88 | 1 |
| 1 | 7 | 90 | 2 |
| 1 | 8 | 92 | 3 |
| 1 | 9 | 10 | 0 |
| 1 | 10 | 60 | 1 |
| 1 | 11 | 80 | 2 |
| 1 | 12 | 10 | 0 |

##### Snowflake

Snowflake does not support the Reset When clause in window functions. To reproduce the same result, the Teradata SQL code must be translated using native SQL syntax and nested subqueries, as follows:

**Query**

```sql
SELECT
   account_id,
   month_id,
   balance,
   (
     ROW_NUMBER() OVER (
   PARTITION BY
      account_id, new_dynamic_part
   ORDER BY
         month_id
     ) -1
   ) AS balance_increase
FROM
   (
      SELECT
   account_id,
   month_id,
   balance,
   previous_value,
   SUM(dynamic_part) OVER (
           PARTITION BY account_id
           ORDER BY month_id
   ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
         ) AS new_dynamic_part
      FROM
   (
      SELECT
         account_id,
         month_id,
         balance,
         SUM(balance) OVER (
                 PARTITION BY account_id
                 ORDER BY month_id
                 ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
               ) AS previous_value,
         (CASE
            WHEN balance <= previous_value
               THEN 1
            ELSE 0
         END) AS dynamic_part
      FROM
         account_balance
   )
   )
ORDER BY 1, 2;
```

**Result**

| account_id | month_id | balance | balance_increase |
| --- | --- | --- | --- |
| 1 | 1 | 60 | 0 |
| 1 | 2 | 99 | 1 |
| 1 | 3 | 94 | 0 |
| 1 | 4 | 90 | 0 |
| 1 | 5 | 80 | 0 |
| 1 | 6 | 88 | 1 |
| 1 | 7 | 90 | 2 |
| 1 | 8 | 92 | 3 |
| 1 | 9 | 10 | 0 |
| 1 | 10 | 60 | 1 |
| 1 | 11 | 80 | 2 |
| 1 | 12 | 10 | 0 |

Two nested sub-queries are needed to support the Reset When functionality in Snowflake.

In the inner sub-query, a dynamic partition indicator (dynamic_part) is created and populated. dynamic_part is set to 1 if one month’s balance is less than or equal to the preceding month’s balance; otherwise, it’s set to 0.

In the next layer, a new_dynamic_part attribute is generated as the result of a SUM window function.

Finally, a new_dynamic_part is added as a new partition attribute (dynamic partition) to the existing partition attribute (account_id) and applies the same ROW_NUMBER() window function as in Teradata.

After these changes, Snowflake generates the same output as Teradata.

#### Reset When when conditional window function is a column

Same example as above, except that now the window function used in the RESET WHEN condition is defined as a column called `previous`. This variation changes the transformation slightly since it is no longer necessary to define the `previous_value` as in the previous example. It is the same workaround.

##### Teradata

**Query**

```sql
SELECT
   account_id,
   month_id,
   balance,
   SUM(balance) OVER (
           PARTITION BY account_id
           ORDER BY month_id
           ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
         ) AS previous,
   (
     ROW_NUMBER() OVER (
       PARTITION BY account_id
       ORDER BY
         month_id RESET WHEN balance <= previous
     )
   ) AS balance_increase
FROM account_balance
ORDER BY 1, 2;
```

**Result**

| account_id | month_id | balance | previous | balance_increase |
| --- | --- | --- | --- | --- |
| 1 | 1 | 60 |  | 0 |
| 1 | 2 | 99 | 60 | 1 |
| 1 | 3 | 94 | 99 | 0 |
| 1 | 4 | 90 | 94 | 0 |
| 1 | 5 | 80 | 90 | 0 |
| 1 | 6 | 88 | 80 | 1 |
| 1 | 7 | 90 | 88 | 2 |
| 1 | 8 | 92 | 90 | 3 |
| 1 | 9 | 10 | 92 | 0 |
| 1 | 10 | 60 | 10 | 1 |
| 1 | 11 | 80 | 60 | 2 |
| 1 | 12 | 10 | 80 | 0 |

##### Snowflake

**Query**

```sql
SELECT
   account_id,
   month_id,
   balance,
   SUM(balance) OVER (
           PARTITION BY account_id
           ORDER BY month_id
           ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
         ) AS previous,
   (
     ROW_NUMBER() OVER (
   PARTITION BY
      account_id, new_dynamic_part
   ORDER BY
         month_id
     )
   ) AS balance_increase
FROM
   (
      SELECT
   account_id,
   month_id,
   balance,
   SUM(balance) OVER (
           PARTITION BY account_id
           ORDER BY month_id
           ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
         ) AS previous,
   SUM(dynamic_part) OVER (
           PARTITION BY account_id
           ORDER BY month_id
   ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
         ) AS new_dynamic_part
      FROM
   (
      SELECT
         account_id,
         month_id,
         balance,
         SUM(balance) OVER (
                 PARTITION BY account_id
                 ORDER BY month_id
                 ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
               ) AS previous,
         (CASE
            WHEN balance <= previous
               THEN 1
            ELSE 0
         END) AS dynamic_part
      FROM
         account_balance
   )
   )
ORDER BY 1, 2;
```

**Untitled**

| account_id | month_id | balance | previous | balance_increase |
| --- | --- | --- | --- | --- |
| 1 | 1 | 60 |  | 0 |
| 1 | 2 | 99 | 60 | 1 |
| 1 | 3 | 94 | 99 | 0 |
| 1 | 4 | 90 | 94 | 0 |
| 1 | 5 | 80 | 90 | 0 |
| 1 | 6 | 88 | 80 | 1 |
| 1 | 7 | 90 | 88 | 2 |
| 1 | 8 | 92 | 90 | 3 |
| 1 | 9 | 10 | 92 | 0 |
| 1 | 10 | 60 | 10 | 1 |
| 1 | 11 | 80 | 60 | 2 |
| 1 | 12 | 10 | 80 | 0 |

### Known Issues

The RESET WHEN clause could have some variations such as its condition. Currently, SnowConvert AI only supports binary conditions (<=, >=, <> or =), in any other type, as `IS NOT NULL`, SnowConvert AI will remove the RESET WHEN clause and add an error message since it is not supported in Snowflake, as shown in the following example.

#### Teradata

**Query**

```sql
SELECT
    account_id,
    month_id,
    balance,
    ROW_NUMBER() OVER (
        PARTITION BY account_id
        ORDER BY month_id
        RESET WHEN balance IS NOT NULL
        ROWS UNBOUNDED PRECEDING
    ) as balance_increase
FROM account_balance
ORDER BY 1,2;
```

#### Snowflake

**Query**

```sql
SELECT
    account_id,
    month_id,
    balance,
    ROW_NUMBER() OVER (
        PARTITION BY account_id
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0077 - RESET WHEN CLAUSE IS NOT SUPPORTED IN THIS SCENARIO DUE TO ITS CONDITION ***/!!!
        ORDER BY month_id
        ROWS UNBOUNDED PRECEDING
    ) as balance_increase
FROM
    account_balance
ORDER BY 1,2;
```

### Related EWIs

* [SSC-EWI-TD0077](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): RESET WHEN clause is not supported in this scenario due to its condition.

## SAMPLE clause

### Description

The SAMPLE clause in Teradata reduces the number of rows to be processed and it returns one or more samples of rows as a list of fractions or as a list of numbers of rows. The clause is used in the SELECT query. Please review the following [Teradata documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Manipulation-Language/SELECT-Statements/SAMPLE-Clause) for more information.

**Teradata syntax**

```none
SAMPLE
  [ WITH REPLACEMENT ]
  [ RANDOMIZED LOCALIZATION ]
  { { fraction_description | count_description } [,...] |
    when_clause ]
  }
```

**Snowflake syntax**

Review the following [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/constructs/sample) for more information. `SAMPLE` and `TABLESAMPLE` are synonymous.

```none
SELECT ...
FROM ...
  { SAMPLE | TABLESAMPLE } [ samplingMethod ]
[ ... ]
```

Where:

```none
samplingMethod ::= {
{ BERNOULLI | ROW } ( { <probability> | <num> ROWS } ) |
{ SYSTEM | BLOCK } ( <probability> ) [ { REPEATABLE | SEED } ( <seed> ) ] }
```

* In Snowflake, the following keywords can be used interchangeably:

  > + `SAMPLE | TABLESAMPLE`
  > + `BERNOULLI | ROW`
  > + `SYSTEM | BLOCK`
  > + `REPEATABLE | SEED`

Review the following table to check on key differences.

| SAMPLE behavior | Teradata | Snowflake |
| --- | --- | --- |
| Sample by probability | Also known as fraction description. It must be a fractional number between 0,1 and 1. | Decimal number between 0 and 100. |
| Fixed number of rows | Also known as count description. It is a positive integer that determines the number of rows to be sampled. | It specifies the number of rows (up to 1,000,000) to sample from the table. Can be any integer between `0` (no rows selected) and `1000000` inclusive. |
| Repeated rows | It is known as `WITH REPLACEMENT.` This is used to query more samples than there are rows in the table. | It is known as `REPEATABLE` or `SEED`. This is used to make the query deterministic. It means that the same set of rows will be the same for each query run. |
| Sampling methods | *Proportional* and `RANDOMIZED ALLOCATION.` | `BERNOULLI` or `SYSTEM`. |

### Sample Source Patterns

#### Sample data

##### Teradata

**Query**

```sql
CREATE TABLE Employee (
    EmpNo INT,
    Name VARCHAR(100),
    DeptNo INT
);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (1, 'Alice', 100);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (2, 'Bob', 300);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (3, 'Charlie', 500);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (4, 'David', 200);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (5, 'Eve', 100);
```

##### Snowflake

**Query**

```sql
CREATE OR REPLACE TABLE Employee (
    EmpNo INT,
    Name VARCHAR(100),
    DeptNo INT
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "01/14/2025",  "domain": "test" }}'
;

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (1, 'Alice', 100);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (2, 'Bob', 300);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (3, 'Charlie', 500);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (4, 'David', 200);

INSERT INTO Employee (EmpNo, Name, DeptNo)
VALUES (5, 'Eve', 100);
```

#### SAMPLE clause

##### Fixed number of rows

Notice that for this example, the number of rows are a fixed number but not necessarily are the same result for each run.

**Teradata**

**Input**

```sql
SELECT * FROM Employee SAMPLE 2;
```

**Output**
2 rows.

**Snowflake**

**Input**

```sql
SELECT * FROM Employee SAMPLE (2 ROWS);
```

**Output**
2 rows.

##### Rows number based on probability

This option will return a variety of rows depending on the probability set.

**Teradata**

**Input**

```sql
SELECT * FROM Employee SAMPLE 0.25;
```

**Output**
25% of probability for each row: 1 output row.

**Snowflake**

**Input**

```sql
SELECT * FROM Employee SAMPLE (25);
```

**Output**
25% of probability for each row: 1 output row.

### Known Issues

#### Fixed number of rows with replacement

This option will return a fixed number of rows and will allows the repetition of the rows. In Snowflake, it is not possible to request more samples than rows in a table.

**Teradata sample**

**Input**

```sql
SELECT * FROM Employee SAMPLE WITH REPLACEMENT 8;
```

**Output**

| EmpNo | Name | DeptNo |
| --- | --- | --- |
| 5 | Eve | 100 |
| 5 | Eve | 100 |
| 5 | Eve | 100 |
| 4 | David | 200 |
| 4 | David | 200 |
| 3 | Charlie | 500 |
| 1 | Alice | 100 |
| 1 | Alice | 100 |

#### SAMPLEID related functionality

In Teradata, it is possible to assign a unique ID to each sample that is specified. It helps to identify which belongs to which sample. This is not ANSI grammar, instead it is an extension of Teradata.

**Teradata sample**

**Input**

```sql
SELECT name, SAMPLEID FROM employee SAMPLE 0.5, 0.25, 0.25;
```

**Output**

| Name | SampleId |
| --- | --- |
| Eve | 3 |
| Charlie | 1 |
| Alice | 1 |
| David | 2 |
| Bob | 1 |

In Snowflake, there is not a SAMPLEID function. A possible workaround may be the following, but it has to be adapted to each single case:

**Snowflake possible workaround**

**Input**

```sql
WITH sampled_data AS (
    -- Sample 100% of the rows from the Employee table
    SELECT *,
           ROW_NUMBER() OVER (ORDER BY EmpNo) AS row_num,
           COUNT(*) OVER () AS total_rows  -- Get the total row count to calculate sample size
    FROM Employee
)
SELECT Name,
       CASE
           -- First 50% of the rows
           WHEN row_num <= total_rows * 0.5 THEN 1
           -- Next 25% of the rows
           WHEN row_num <= total_rows * 0.75 THEN 2
           -- Remaining 25% of the rows
           ELSE 3
       END AS sample_id
FROM sampled_data
ORDER BY sample_id, row_num;  -- Order by sample_id and row_num for consistency
```

**Output**

| Name | SAMPLE_ID |
| --- | --- |
| Alice | 1 |
| Bob | 1 |
| Charlie | 2 |
| David | 3 |
| Eve | 3 |

#### Conditional sampling

In Snowflake there is not conditional sampling. This can be achieve by using CTE’s.

**Teradata sample**

**Input**

```sql
SELECT * FROM employee
SAMPLE WHEN DeptNo > 100 then 0.9
ELSE 0.1 END;
```

**Output**

| EmpNo | Name | DeptNo |
| --- | --- | --- |
| 3 | Charlie | 500 |
| 4 | David | 200 |
| 2 | Bob | 300 |

### Related EWIs

[SSC-EWI-0021](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Syntax not supported in Snowflake.

## Column-level FORMAT Clause in DML Statements

Translation reference for how Teradata column FORMAT clauses affect DML statement conversion to Snowflake.

### Description

In Teradata, the `FORMAT` clause on a column definition controls how date, timestamp, and time values are parsed from string literals. For example, if a column is defined as `DATE FORMAT 'MM-DD-YYYY'`, writing `WHERE hire_date = '03-30-2026'` means Teradata reads the string using that format.

Snowflake does not have this feature. To preserve the same behavior, SnowConvert AI:

1. **Comments out** the FORMAT clause in the `CREATE TABLE` output (see [DDL reference](ddl-teradata.md)).
2. **Adds conversion functions** (`TO_DATE`, `TO_TIMESTAMP`, or `TO_TIME`) around string literals in DML statements that reference the formatted column, using the equivalent Snowflake format string.

> **Note:**
>
> When the FORMAT matches Snowflake’s default output format for the column type (`'YYYY-MM-DD'` for `DATE`, `'HH:MI:SS'` for `TIME`, `'YYYY-MM-DDBHH:MI:SS'` for `TIMESTAMP`), the FORMAT clause is **silently removed** from the DDL and no conversion functions are added to DML statements. These formats are natively handled by Snowflake. The conversion functions described below only apply to non-standard formats.

> **Important:**
>
> For this to work, the `CREATE TABLE` that defines the `FORMAT` clause **must be included** in the conversion input. SnowConvert AI reads the FORMAT value and column type from the DDL and uses that information when converting DML statements. If the DDL is missing, the conversion functions will not be added. Always verify that the converted code behaves correctly when FORMAT clauses are present.

### Supported DML Contexts

The following contexts are handled when the target column has a translatable datetime FORMAT:

| Context | Teradata Pattern | Snowflake Result |
| --- | --- | --- |
| WHERE equality | `WHERE col = '03-30-2026'` | `WHERE col = TO_DATE('03-30-2026', 'MM-DD-YYYY')` |
| WHERE comparison | `WHERE col > '01-01-2026'` | `WHERE col > TO_DATE('01-01-2026', 'MM-DD-YYYY')` |
| WHERE BETWEEN | `WHERE col BETWEEN '...' AND '...'` | Both bounds converted |
| WHERE IN | `WHERE col IN ('...', '...')` | All values converted |
| INSERT VALUES | `VALUES ('03-30-2026')` | `VALUES (TO_DATE('03-30-2026', 'MM-DD-YYYY'))` |
| UPDATE SET | `SET col = '03-30-2026'` | `SET col = TO_DATE('03-30-2026', 'MM-DD-YYYY')` |
| MERGE UPDATE | `UPDATE SET col = '...'` | Conversion function added |
| MERGE INSERT | `INSERT (col) VALUES ('...')` | Conversion function added |
| JOIN ON | `ON col = '03-30-2026'` | `ON col = TO_DATE('03-30-2026', 'MM-DD-YYYY')` |

### Sample Source Patterns

#### WHERE Equality with Date FORMAT

**Teradata**

```sql
CREATE TABLE employee (
  id INTEGER,
  hire_date DATE FORMAT 'MM-DD-YYYY'
);

SELECT * FROM employee WHERE hire_date = '03-30-2026';
```

**Snowflake**

```sql
CREATE OR REPLACE TABLE employee (
  id INTEGER,
  hire_date DATE
--                 --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'MM-DD-YYYY' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                 FORMAT 'MM-DD-YYYY'
)
;

SELECT
  *
FROM
  employee
WHERE
  hire_date = TO_DATE('03-30-2026', 'MM-DD-YYYY');
```

#### WHERE with TIME FORMAT

**Teradata**

```sql
CREATE TABLE shift_log (
  id INTEGER,
  shift_start TIME FORMAT 'HH.MI.SS'
);

SELECT * FROM shift_log WHERE shift_start = '08.30.00';
```

**Snowflake**

```sql
CREATE OR REPLACE TABLE shift_log (
  id INTEGER,
  shift_start TIME
--                   --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'HH.MI.SS' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                   FORMAT 'HH.MI.SS'
)
;

SELECT
  *
FROM
  shift_log
WHERE
  shift_start = TO_TIME('08.30.00', 'HH.MI.SS');
```

#### JOIN ON with Date FORMAT

**Teradata**

```sql
CREATE TABLE event_log (
  id INTEGER,
  event_date DATE FORMAT 'MM-DD-YYYY'
);

CREATE TABLE event_source (
  id INTEGER
);

SELECT * FROM event_log e JOIN event_source s ON e.event_date = '03-30-2026' AND e.id = s.id;
```

**Snowflake**

```sql
CREATE OR REPLACE TABLE event_log (
  id INTEGER,
  event_date DATE
--                  --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'MM-DD-YYYY' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                  FORMAT 'MM-DD-YYYY'
)
;

CREATE OR REPLACE TABLE event_source (
  id INTEGER
)
;

SELECT
  *
FROM
  event_log e
  JOIN event_source s
    ON e.event_date = TO_DATE('03-30-2026', 'MM-DD-YYYY')
    AND e.id = s.id;
```

### Best Practices

* Always include the `CREATE TABLE` statements that define `FORMAT` clauses in the conversion input. Without them, SnowConvert AI cannot add conversion functions to DML statements.
* After conversion, verify that the converted code behaves correctly when these formats are present, especially for edge cases such as `INSERT ... SELECT` statements or columns with untranslatable format strings.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

### Related EWIs

1. [SSC-FDM-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Column-level FORMAT clause is not supported in Snowflake. Conversion functions are used in DML statements as a workaround.
2. [SSC-FDM-TD0041](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Column-level display-only FORMAT clause is not supported in Snowflake. No action needed.
3. [SSC-EWI-TD0040](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Column-level FORMAT clause cannot be automatically converted to Snowflake.

---
title: SnowConvert AI - Teradata - FLOAD
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-python/fastload-translation.md
section: Migrations
---

# SnowConvert AI - Teradata - FLOAD

Translation references to convert Teradata FLOAD files to Python

Teradata FastLoad is a command‑driven utility for quickly loading large amounts of data in an empty table on a Teradata Database.

To simulate the FastLoad functionality for Teradata in Snowflake, FastLoad files and commands are transformed to Python code, similar to the transformations performed for BTEQ and MultiLoad scripts. The generated code uses the Snowflake Python project called [snowconvert.helpers](snowconvert-script-helpers.md) which contains the required functions to simulate the FastLoad statements in Snowflake.

## FastLoad Commands Translation

Most of the [FastLoad commands](https://docs.teradata.com/r/vIWhrlrRPxEfMbR9H0qaTQ/GB0V~iGzwIASn~LiFWyAfA) are considered not relevant in Snowflake, these commands are commented out. Below is the summary list of FastLoad commands and their transformation status into Snowflake:

| Teradata FastLoad Command | Transformation Status | Note |
| --- | --- | --- |
| AXSMOD | Commented | ​ |
| BEGIN LOADING | **Transformed** | ​The node is commented out since the transformation occurs in the related INSERT statement instead. |
| CLEAR | Commented | ​ |
| DATEFORM | Commented | ​ |
| DEFINE | **Transformed** | ​ |
| END LOADING | **Transformed** | ​Commented out since is not necessary for the transformation of the BEGIN LOADING. |
| ERRLIMIT | Commented | ​ |
| HELP | Commented | ​ |
| HELP TABLE | Commented | ​ |
| INSERT | **Transformed** | Transformed as part of the BEGIN LOADING. |
| LOGDATA | Commented | ​ |
| LOGMECH | Commented | ​ |
| LOGOFF | Commented | ​ |
| LOGON | Commented | ​ |
| NOTIFY | Commented | ​ |
| OS | Commented | ​ |
| QUIT | Commented | ​ |
| RECORD | Commented | ​ |
| RUN | Commented | ​ |
| SESSIONS | Commented | ​ |
| SET RECORD | **Transformed** | ​ |
| SET SESSION CHARSET | Commented | ​ |
| SHOW | Commented | ​ |
| SHOW VERSIONS | Commented | ​ |
| SLEEP | Commented | ​ |
| TENACITY | Commented | ​ |

### Default Transformation

The default behavior of the ConversionTool for these statements is to comment them out. For example:

**Teradata (FastLoad)**

```sql
 SESSIONS 4;
ERRLIMIT 25;
```

**Snowflake (Python)**

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #SESSIONS 4

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #ERRLIMIT 25

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

Nonetheless, there are some exceptions that must be converted to specific Python statements to work as intended in Snowflake.

### BEGIN LOADING (And related commands)

The transformation for the command `BEGIN LOADING` is a multi-part transformation that requires the DEFINE, INSERT and (optionally) SET RECORD commands to simulate its behavior correctly.

This transformation is fully explained in this section.

#### SET RECORD

As stated above, this command is not required for the transformation of the BEGIN LOADING. If not found, the default delimiter will be set to ‘,’ (comma). Else, the defined delimiter will be used.

**Teradata (FastLoad)**

```sql
 BEGIN LOADING FastTable ERRORFILES Error1,Error2
   CHECKPOINT 10000;
```

**Snowflake (Python)**

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #BEGIN LOADING FastTable ERRORFILES Error1, Error2 CHECKPOINT 10000

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

In the example above, `FastTable` is the name of the table associated to the `BEGIN LOADING` command. Note the use of the python variable`inputDataPlaceholder`, that must be defined by the user in a previous step. The value represents the Snowflake stage that could be internal or external as shown in the following table or as [explained here](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html#examples).

| Stage | Input Data Place Holder |
| --- | --- |
| Stage | Input Data Place Holder |
| Internal stage | `@my_int_stage` |
| External stage | `@my_int_stage/path/file.csv` |
| Amazon S3 bucket | `s3://mybucket/data/files` |
| Google Cloud Storage | `gcs://mybucket/data/files` |
| Microsoft Azure | `azure://myaccount.blob.core.windows.net/mycontainer/data/files` |

### Embedded SQL

FastLoad scripts support Teradata statements inside the same file. The majority of these statements are converted just as if they were inside a BTEQ file, with some exceptions.

Dropping an error table is commented out if inside a FastLoad file.

**Teradata (FastLoad)**

```sql
 DROP TABLE Error1;
DROP TABLE Error2;
```

**Snowflake (Python)**

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    DROP TABLE Error1
    """)
  exec("""
    DROP TABLE Error2
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### Large Example

Given the transformations shown above for a variety of commands, consider the following example.

**Teradata (FastLoad)**

```sql
 SESSIONS 4;
ERRLIMIT 25;
DROP TABLE FastTable;
DROP TABLE Error1;
DROP TABLE Error2;
CREATE TABLE FastTable, NO FALLBACK
   ( ID INTEGER, UFACTOR INTEGER, MISC CHAR(42))
   PRIMARY INDEX(ID);
DEFINE ID (INTEGER), UFACTOR (INTEGER), MISC (CHAR(42))
   FILE=FileName;
SHOW;
BEGIN LOADING FastTable ERRORFILES Error1,Error2
   CHECKPOINT 10000;
INSERT INTO FastTable (ID, UFACTOR, MISC) VALUES
   (:ID, :MISC);
END LOADING;
```

**Snowflake (Python)**

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***
#** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "Error1", "Error2" **

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #SESSIONS 4

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #ERRLIMIT 25

  exec("""
    DROP TABLE FastTable
    """)
  exec("""
    CREATE OR REPLACE TABLE FastTable (
      ID INTEGER,
      UFACTOR INTEGER,
      MISC CHAR(42)
    )
    """)
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS ASSIGNMENT STATEMENTS **
  #DEFINE ID (INTEGER), UFACTOR (INTEGER), MISC (CHAR(42)) FILE = FileName

  ssc_define_columns = "ID (INTEGER), UFACTOR (INTEGER), MISC (CHAR(42))"
  #Set file name manually if empty
  ssc_define_file = f"""FileName"""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #SHOW

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #BEGIN LOADING FastTable ERRORFILES Error1, Error2 CHECKPOINT 10000

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS PART OF THE BEGIN LOADING TRANSLATION **
  #INSERT INTO FastTable (ID, UFACTOR, MISC) VALUES (:ID, :MISC)

  ssc_begin_loading_columns = "(ID, UFACTOR, MISC)"
  ssc_begin_loading_values = [":ID", ":MISC"]
  BeginLoading.import_file_to_table(f"""FastTable""", ssc_define_columns, ssc_define_file, ssc_begin_loading_columns, ssc_begin_loading_values, ",")
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. END LOADING **
  #END LOADING

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

If you have any additional questions regarding this documentation, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## Known Issues

No issues were found.

## Related EWIs

1. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
2. [SSC-FDM-0027](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Removed next statement, not applicable in Snowflake.

## BEGIN LOADING

The transformation for the command `BEGIN LOADING` is a multi-part transformation that requires the `DEFINE`, `INSERT` and (optionally) `SET RECORD` commands to simulate its behavior correctly.

This transformation is fully explained in the following subsections.

### SET RECORD

As stated above, this command is not required for the transformation of the BEGIN LOADING. If not found, the default delimiter will be set to ‘,’ (comma). Else, the defined delimiter will be used. This value is stored in the `ssc_set_record` variable.

As of now only `SET RECORD VARTEXT`, `SET RECORD FORMATTED` and `SET RECORD UNFORMATTED` are supported. For the `BINARY` and `TEXT` keyword specification an error EWI is placed instead.

**Teradata (FastLoad)**

```sql
SET RECORD VARTEXT DELIMITER 'c' DISPLAY ERRORS 'efilename';
SET RECORD VARTEXT 'l' 'c' NOSTOP;
SET RECORD VARTEXT 'l' TRIM NONE LEADING 'p';
SET RECORD VARTEXT 'l' TRIM NONE TRAILING 'p';
SET RECORD VARTEXT 'l' TRIM NONE BOTH 'p';
SET RECORD FORMATTED TRIM NONE BOTH;
SET RECORD UNFORMATTED QUOTE NO OPTIONAL;
SET RECORD BINARY QUOTE NO YES 'q';
SET RECORD TEXT QUOTE OPTIONAL;
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD VARTEXT DELIMITER 'c' DISPLAY ERRORS 'efilename'

  ssc_set_record = ""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD VARTEXT 'l' 'c' NOSTOP

  ssc_set_record = "'l'"
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD VARTEXT 'l' TRIM NONE LEADING 'p'

  ssc_set_record = "'l'"
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD VARTEXT 'l' TRIM NONE TRAILING 'p'

  ssc_set_record = "'l'"
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD VARTEXT 'l' TRIM NONE BOTH 'p'

  ssc_set_record = "'l'"
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD FORMATTED TRIM NONE BOTH

  ssc_set_record = ","
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD UNFORMATTED QUOTE NO OPTIONAL

  ssc_set_record = "UNFORMATTED"
  #** SSC-EWI-0021 - 'BINARY' KEYWORD SPECIFICATION FOR SET RECORD NOT SUPPORTED IN SNOWFLAKE **
  #SET RECORD BINARY QUOTE NO YES 'q'

  #** SSC-EWI-0021 - 'TEXT' KEYWORD SPECIFICATION FOR SET RECORD NOT SUPPORTED IN SNOWFLAKE **
  #SET RECORD TEXT QUOTE OPTIONAL

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### DEFINE

The transformation for the `DEFINE` command sets the `ssc_define_columns` and `ssc_define_file` variables with the value of the columns definition and the file path to be used in the `BEGIN LOADING` transformation respectively.

**Teradata (FastLoad)**

```sql
DEFINE
    id (INTEGER),
    first_name (VARCHAR(50)),
    last_name (VARCHAR(50)),
    salary (FLOAT)
FILE=/tmp/inputData.txt;

DEFINE
    id (INTEGER),
    first_name (VARCHAR(50)),
    last_name (VARCHAR(50)),
    salary (FLOAT)

DEFINE
FILE=/tmp/inputData.txt;

DEFINE;
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS ASSIGNMENT STATEMENTS **
  #DEFINE id (INTEGER), first_name (VARCHAR(50)), last_name (VARCHAR(50)), salary (FLOAT) FILE = /tmp/inputData.txt

  ssc_define_columns = "id (INTEGER), first_name (VARCHAR(50)), last_name (VARCHAR(50)), salary (FLOAT)"
  #Set file name manually if empty
  ssc_define_file = f"""/tmp/inputData.txt"""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS ASSIGNMENT STATEMENTS **
  #DEFINE id (INTEGER), first_name (VARCHAR(50)), last_name (VARCHAR(50)), salary (FLOAT)

  ssc_define_columns = "id (INTEGER), first_name (VARCHAR(50)), last_name (VARCHAR(50)), salary (FLOAT)"
  #Set file name manually if empty
  ssc_define_file = f""""""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS ASSIGNMENT STATEMENTS **
  #DEFINE FILE = /tmp/inputData.txt

  ssc_define_columns = ""
  #Set file name manually if empty
  ssc_define_file = f"""/tmp/inputData.txt"""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS ASSIGNMENT STATEMENTS **
  #DEFINE

  ssc_define_columns = ""
  #Set file name manually if empty
  ssc_define_file = f""""""
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### BEGIN LOADING

The `BEGIN LOADING` command is commented out since the relevant information for the transformation is found in the associated `INSERT` statement instead.

`ERRORFILES`, `NODROP`, `CHECKPOINT`, `INDICATORS` and `DATAENCRYPTION` specifications are not necessary for the transformation and thus commented out.

**Teradata (FastLoad)**

```sql
BEGIN LOADING FastTable ERRORFILES Error1,Error2
   CHECKPOINT 10000;
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #BEGIN LOADING FastTable ERRORFILES Error1, Error2 CHECKPOINT 10000

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### INSERT

The transformation for the associated `INSERT` statement sets the value for the `ssc_begin_loading_columns` and `ssc_begin_loading_values` variables, used to determine the order in which to insert the values to be loaded.

Finally, these variables and the ones described in the above sections are used to call the `BeginLoading.import_file_to_table` function part of the `SnowConvert.Helpers` module. This function simulates the behavior of the whole FastLoad `BEGIN LOADING` process. To learn more about this function check here.

**Teradata (FastLoad)**

```sql
SET RECORD VARTEXT """";
DEFINE
    _col1 (CHAR(10)),
    _col2 (CHAR(7)),
    _col3 (CHAR(2, NULLIF = 'V5'))
FILE=inputDataNoDel.txt;
BEGIN LOADING TESTS.EmpLoad4
ERRORFILES ${CPRDBName}.ET_${LOADTABLE},${CPRDBName}.UV_${LOADTABLE}
CHECKPOINT 1000;
INSERT INTO TESTS.EmpLoad4 (col2, col3, col1, col4)
VALUES
(
    :_col2,
    :_col3,
    :_col1,
    CURRENT_DATE
);
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***
#** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TESTS.EmpLoad4" **

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
#** SSC-FDM-TD0022 - SHELL VARIABLES FOUND, RUNNING THIS CODE IN A SHELL SCRIPT IS REQUIRED **
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS AN ASSIGNMENT STATEMENT **
  #SET RECORD VARTEXT "" ""

  ssc_set_record = ""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS ASSIGNMENT STATEMENTS **
  #DEFINE _col1 (CHAR(10)), _col2 (CHAR(7)), _col3 (CHAR(2, NULLIF = 'V5')) FILE = inputDataNoDel.txt

  ssc_define_columns = "_col1 (CHAR(10)), _col2 (CHAR(7)), _col3 (CHAR(2, NULLIF = 'V5'))"
  #Set file name manually if empty
  ssc_define_file = f"""inputDataNoDel.txt"""
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #BEGIN LOADING TESTS.EmpLoad4 ERRORFILES ${CPRDBName}.ET_${LOADTABLE}, ${CPRDBName}.UV_${LOADTABLE} CHECKPOINT 1000

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW AS PART OF THE BEGIN LOADING TRANSLATION **
  #INSERT INTO TESTS.EmpLoad4 (col2, col3, col1, col4) VALUES (:_col2, :_col3, :_col1, CURRENT_DATE)

  ssc_begin_loading_columns = "(col2, col3, col1, col4)"
  ssc_begin_loading_values = [":_col2", ":_col3", ":_col1", "CURRENT_DATE()"]
  BeginLoading.import_file_to_table(f"""TESTS.EmpLoad4""", ssc_define_columns, ssc_define_file, ssc_begin_loading_columns, ssc_begin_loading_values, ssc_set_record)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

Internally, the `import_file_to_table` function creates a temporary stage and puts the local file in the stage to load into the specified table. However, the file might be already stored in one the supported cloud provider by [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table#required-parameters):

| Stage | Input Data Place Holder |
| --- | --- |
| **Stage** | **Input Data Place Holder** |
| Internal stage | `@my_int_stage` |
| External stage | `@my_int_stage/path/file.csv` |
| Amazon S3 bucket | `s3://mybucket/data/files` |
| Google Cloud Storage | `gcs://mybucket/data/files` |
| Microsoft Azure | `azure://myaccount.blob.core.windows.net/mycontainer/data/files` |

If this is the case, please manually add the additional parameter `input_data_place_holder="<cloud_provider_path>"` in the `import_file_to_table` function. For example:

```python
BeginLoading.import_file_to_table(
  f"""TESTS.EmpLoad4""",
  ssc_define_columns,
  ssc_define_file,
  ssc_begin_loading_columns,
  ssc_begin_loading_values,
  ssc_set_record,
  input_data_place_holder="s3://mybucket/data/files")
```

### END LOADING

The `END LOADING` command is commented out since is not necessary for the transformation of the `BEGIN LOADING`.

**Teradata (FastLoad)**

```sql
END LOADING;
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. END LOADING **
  #END LOADING

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### Known Issues

**1. BINARY and TEXT keyword specification not supported**

The `BINARY` and `TEXT` keyword specification for the `SET RECORD` command are not yet supported.

**2. Only base specification for VARTEXT is supported**

Extra specifications for the `SET RECORD VARTEXT` such as `TRIM` or `QUOTE` are not yet supported.

### Related EWIs

1. [SSC-FDM-0007](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
2. [SSC-FDM-0027](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Removed next statement, not applicable in Snowflake.
3. [SSC-EWI-0021](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Not supported.
4. [SSC-FDM-TD0022](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Shell variables found, running this code in a shell script is required.

---
title: SnowConvert AI - Teradata - Iceberg Tables Transformations
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/Iceberg-tables-transformations.md
section: Migrations
---

# SnowConvert AI - Teradata - Iceberg Tables Transformations

This section covers the transformation of tables into Snowflake-managed Iceberg tables, performed by SnowConvert AI when the conversion setting [Table Translation](../../../general/getting-started/running-snowconvert/conversion/teradata-conversion-settings.md) is used.

## Temporary tables

The temporary option is not supported in Iceberg tables, they will be preserved as temporary.

### Teradata

```sql
CREATE VOLATILE TABLE myTable
(
  column1 NUMBER(15,0)
);
```

### Snowflake

```sql
CREATE OR REPLACE TEMPORARY TABLE myTable
(
 column1 NUMBER(15,0)
)
;
```

## Other tables

Other table types are going to be transformed into Iceberg tables.

### Teradata

```sql
CREATE TABLE myTable
(
  column1 NUMBER(15,0)
);
```

### Snowflake

```sql
CREATE OR REPLACE ICEBERG TABLE myTable (
  column1 NUMBER(15, 0)
)
CATALOG = 'SNOWFLAKE'
;
```

## Data types

The following column data type conversions are applied to comply with the Iceberg tables type requirements and restrictions.

> **Note:**
>
> Data types in the first column are the **Snowflake** data types that would normally be created if the table target is not Iceberg, while second column shows the data type generated for Iceberg tables.

| Original target type | New target type |
| --- | --- |
| TIME(X)  TIMESTAMP(X)  DATETIME(X)  TIMESTAMP_LTZ(X)  TIMESTAMP_NTZ(X)  TIME  TIMESTAMP  DATETIME  TIMESTAMP_LTZ  TIMESTAMP_NTZ  where X != 6 | TIME(6)  TIMESTAMP(6)  DATETIME(6)  TIMESTAMP_LTZ(6)  TIMESTAMP_NTZ(6)  TIME(6)  TIMESTAMP(6)  DATETIME(6)  TIMESTAMP_LTZ(6)  TIMESTAMP_NTZ(6) |
| VARCHAR(X)  STRING(X)  TEXT(X)  NVARCHAR(X)  NVARCHAR2(X)  CHAR VARYING(X)  NCHAR VARYING(X) | VARCHAR  STRING  TEXT  NVARCHAR  NVARCHAR2  CHAR VARYING  NCHAR VARYING |
| CHAR[(n)]  CHARACTER[(n)]  NCHAR[(n)] | VARCHAR  VARCHAR  VARCHAR |
| NUMBER  DECIMAL  DEC  NUMERIC  INT  INTEGER  BIGINT  SMALLINT  TINYINT  BYTEINT | NUMBER(38,0)  DECIMAL(38,0)  DEC(38,0)  NUMERIC(38,0)  NUMBER(38,0)  NUMBER(38,0)  NUMBER(38,0)  NUMBER(38,0)  NUMBER(38,0)  NUMBER(38,0) |
| FLOAT  FLOAT4  FLOAT8 | DOUBLE  DOUBLE  DOUBLE |
| VARBINARY[(n)] | BINARY[(n)] |

## PARTITION BY

The following PARTITION BY cases are supported:

### PARTITION BY name

Left as is.

#### Teradata

```sql
CREATE TABLE myTable
(
  customerName VARCHAR(30),
  areaCode INTEGER
)
PARTITION BY areaCode;
```

#### Snowflake

```sql
CREATE OR REPLACE ICEBERG TABLE myTable (
  customerName VARCHAR,
  areaCode NUMBER(38, 0)
)
PARTITION BY (areaCode)
CATALOG = 'SNOWFLAKE'
;
```

### PARTITION BY CASE_N (equality over single column)

When the CASE_N function follows this pattern:

```sql
PARTITION BY CASE_N(
  column_name = value1,
  column_name = value2,
  ...
  column_name = valueN)
```

It will be transformed to a PARTITION BY column_name.

#### Teradata

```sql
CREATE TABLE myTable
(
  customerName VARCHAR(30),
  weekDay VARCHAR(20)
)
PARTITION BY CASE_N(
weekDay =  'Sunday',
weekDay =  'Monday',
weekDay =  'Tuesday',
weekDay =  'Wednesday',
weekDay =  'Thursday',
weekDay =  'Friday',
weekDay =  'Saturday',
 NO CASE OR UNKNOWN);
```

#### Snowflake

```sql
CREATE OR REPLACE ICEBERG TABLE myTable (
  customerName VARCHAR,
  weekDay VARCHAR
)
PARTITION BY (weekDay)
CATALOG = 'SNOWFLAKE'
;
```

### PARTITION BY RANGE_N

PARTITION BY RANGE_N is transformed when it matches one of these patterns:

#### Numeric range

Pattern:

```sql
RANGE_N(columnName BETWEEN x AND y EACH z) -- x, y and z must be numeric constants.
```

This case will be changed with a BUCKET partition transform.

##### Teradata

```sql
CREATE TABLE myTable
(
  customerName VARCHAR(30),
  totalPurchases INTEGER
)
PARTITION BY RANGE_N(totalPurchases BETWEEN 5 AND 200 EACH 10);
```

##### Snowflake

```sql
CREATE OR REPLACE ICEBERG TABLE myTable (
  customerName VARCHAR,
  totalPurchases NUMBER(38, 0)
)
PARTITION BY (BUCKET(20, totalPurchases))
CATALOG = 'SNOWFLAKE'
;
```

#### Datetime range

Pattern:

```sql
RANGE_N(columnName BETWEEN date_constant AND date_constant EACH interval_constant) -- Interval qualifier must be YEAR, MONTH, DAY or HOUR
```

This case will be changed with the YEAR, MONTH, DAY or HOUR partition transforms.

##### Teradata

```sql
CREATE TABLE myTable
(
  customerName VARCHAR(30),
  purchaseDate DATE
)
PARTITION BY RANGE_N(purchaseDate BETWEEN DATE '2000-01-01' AND '2100-12-31' EACH INTERVAL '1' MONTH);
```

##### Snowflake

```sql
CREATE OR REPLACE ICEBERG TABLE myTable (
  customerName VARCHAR,
  purchaseDate DATE
)
PARTITION BY (MONTH(purchaseDate))
CATALOG = 'SNOWFLAKE'
;
```

## CASESPECIFIC and NOT CASESPECIFIC

Case sensitivity can be handled in Snowflake using the COLLATE column option, however, Iceberg Tables currently do not support collation at the column level.

To handle this difference, SnowConvert AI will enforce the “Disable use of COLLATE for Case Specification” [general conversion setting](../../../general/getting-started/running-snowconvert/conversion/teradata-conversion-settings.md) for all transformations that specify Iceberg Tables as target. This will cause the case sensitive comparisons to be emulated at query level with the UPPER function, preserving the case sensitivity functionality for Iceberg.

### Teradata

```sql
CREATE TABLE my_table
(
    col1 VARCHAR(50) NOT CASESPECIFIC
);

SELECT * FROM my_table WHERE col1 = 'test';
```

### Snowflake

```sql
--** SSC-FDM-TD0039 - COLLATION HANDLED AT QUERY LEVEL FOR THIS TABLE, ANY NEW QUERY OVER THIS TABLE SHOULD APPLY COLLATION APPROPRIATELY **
CREATE OR REPLACE ICEBERG TABLE my_table
(
    col1 VARCHAR
)
CATALOG = 'SNOWFLAKE'
;

SELECT
    * FROM
    my_table
WHERE
    UPPER(RTRIM( col1)) = UPPER(RTRIM('test'));
```

## Known Issues

### 1. Unsupported data types

Current Snowflake support for Iceberg tables does not allow data types like VARIANT or GEOGRAPHY to be used, tables with these types will be marked with an EWI.

### 2. Unsupported PARTITION BY cases

PARTITION BY cases different than the ones shown in this documentation will not be transformed, instead, the PARTITION BY clause will be commented out with a PRF.

## Related EWIs

1. [SSC-EWI-0115](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Iceberg table contains unsupported datatypes
2. [SSC-PRF-0010](../../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Partition by removed, at least one of the specified expressions have no iceberg partition transform equivalent
3. [SSC-FDM-TD0039](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Collation handled at query level for this table, any new query over this table should apply collation appropriately

---
title: SnowConvert AI - Teradata - MLOAD
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-python/multiload-translation.md
section: Migrations
---

# SnowConvert AI - Teradata - MLOAD

Translation references to convert Teradata MLOAD files to Python

Teradata MultiLoad is a command-driven utility for fast, high-volume maintenance on multiple tables and views in Teradata Database.

To simulate the MultiLoad functionality for Teradata in Snowflake, MultiLoad files and commands are transformed to Python code, similar to the transformations performed for BTEQ and FastLoad scripts. The generated code uses the Snowflake Python project called [snowconvert.helpers](snowconvert-script-helpers.md) which contains the required functions to simulate the MultiLoad statements in Snowflake.

## MultiLoad Commands Translation

Most of the [MultiLoad Commands](https://docs.teradata.com/reader/u5g65Je3hpMChJXfDyt1hg/KRpA6tp8QD64m48~ng0PFw) are considered not relevant in Snowflake, these commands are commented out. Below is the summary list of MultiLoad commands and their transformation status into Snowflake:

| Commands | Transformation Status | Note |
| --- | --- | --- |
| ACCEPT | Commented | ​ |
| [BEGIN MLOAD](begin-mload.md) | **Transformed** | ​​The node is commented out since the transformation occurs in other related statements instead. |
| BEGIN DELETE MLOAD | Commented | ​ |
| DATEFORM | Commented | ​ |
| DELETE | **Partially transformed** | Check [known issues](begin-mload.md).​ |
| DISPLAY | Commented | ​ |
| [DML LABEL](begin-mload.md) | **Transformed** | ​ |
| END MLOAD | **Transformed** | ​​Commented out since is not necessary for the transformation of the BEGIN MLOAD. |
| EOC | Commented | ​ |
| [FIELD](begin-mload.md) | **Transformed** | ​ |
| [FILLER](begin-mload.md) | **Transformed** | This command needs to be with a FIELD and LAYOUT command to be converted. |
| IF, ELSE, and ENDIF | Commented | ​ |
| [IMPORT](begin-mload.md) | **Transformed** | ​ |
| INSERT | **Transformed** | This is taken as a Teradata Statement, so it doesn't appear in this chapter. |
| [LAYOUT](begin-mload.md) | **Transformed** | This command needs to be with a FIELD and FILLER command to be converted. |
| LOGDATA | Commented | ​ |
| LOGMECH | Commented | ​ |
| LOGOFF | Commented | ​ |
| LOGON | Commented | ​ |
| LOGTABLE | Commented | ​ |
| PAUSE ACQUISITION | Commented | ​ |
| RELEASE MLOAD | Commented | ​ |
| ROUTE MESSAGES | Commented | ​ |
| RUN FILE | Commented | ​ |
| SET | Commented | ​ |
| SYSTEM | Commented | ​ |
| TABLE | Commented | ​ |
| UPDATE | **Transformed** | This is taken as a Teradata Statement, so it doesn't appear in this chapter. |
| VERSION | Commented | ​ |

However, there are some exceptional commands that must be converted into Python-specific code for them to work as intended in Snowflake. See this section.

If you have any additional questions regarding this documentation, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

## BEGIN MLOAD

The transformation for the command `.BEGIN MLOAD` is a multi-part transformation that requires the `.LAYOUT`, `.FIELD`, `.FILLER`,`.DML LABEL`, and `.IMPORT` commands to simulate its behavior correctly.

This transformation is fully explained in the following subsections.

### .LAYOUT, .FIELD and .FILLER

The transformation for the commands `.LAYOUT`, `.FIELD`, and `.FILLER` will create variable definitions to be used in a future function call of the IMPORT of this layout.

**Teradata (MultiLoad)**

```sql
.LAYOUT INFILE_LAYOUT;
.FIELD TABLE_ID        * INTEGER;
.FIELD TABLE_DESCR     * CHAR(8);
.FILLER COL1           * CHAR(1);
.FIELD TABLE_NBR       * SMALLINT;
.FIELD TABLE_SOMEFIELD * SMALLINT;
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  INFILE_LAYOUT_TableName = "INFILE_LAYOUT_TEMP_TABLE"
  INFILE_LAYOUT_Columns = """TABLE_ID INTEGER,
TABLE_DESCR CHAR(8),
COL1 CHAR(1),
TABLE_NBR SMALLINT,
TABLE_SOMEFIELD SMALLINT"""
  INFILE_LAYOUT_Conditions = """TABLE_ID AS TABLE_ID, TABLE_DESCR AS TABLE_DESCR, COL1 AS COL1, TABLE_NBR AS TABLE_NBR, TABLE_SOMEFIELD AS TABLE_SOMEFIELD"""
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### .DML LABEL

The transformation for the `.DML LABEL`command will create a function containing the statements after the label definition. Note that after the `.DML LABEL` command there is usually an `Insert`, `Update` or `Delete`.

**Teradata (MultiLoad)**

```sql
-- Example of .DML LABEL with INSERT:
.DML LABEL INSERT_TABLE;
INSERT INTO mydb.mytable( TABLE_ID,TABLE_DESCR,TABLE_NBR ) VALUES( :TABLE_ID,:TABLE_DESCR,:TABLE_NBR );

-- Example of .DML LABEL with DELETE:
.DML LABEL DELETE_TABLE;
DELETE FROM Employee WHERE EmpNo  = :EmpNo;

-- Example of .DML LABEL with an UPDATE, followed by an INSERT:
.DML LABEL UPSERT_TABLE DO INSERT FOR MISSING UPDATE ROWS;
UPDATE   mydb.mytable SET TABLE_ID = :TABLE_ID WHERE TABLE_DESCR = :somedescription
INSERT INTO mydb.mytable(TABLE_ID, TABLE_DESCR, TABLE_NBR) VALUES(:TABLE_ID, :TABLE_DESCR, :TABLE_NBR );
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  def INSERT_TABLE(tempTableName, queryConditions = ""):
    exec(f"""INSERT INTO mydb.mytable (TABLE_ID, TABLE_DESCR, TABLE_NBR)
SELECT
   :TABLE_ID,
   :TABLE_DESCR,
   :TABLE_NBR
FROM {tempTableName} SRC {queryConditions}""")
  exec("""
    DELETE FROM
      Employee
    WHERE
      EmpNo = :EmpNo
    """)
  def UPSERT_TABLE(tempTableName, queryConditions = ""):
    exec(f"""MERGE INTO mydb.mytable TGT USING (SELECT * FROM {tempTableName} {queryConditions}) SRC ON TABLE_DESCR = :somedescription
WHEN MATCHED THEN UPDATE SET
   TABLE_ID = :TABLE_ID
WHEN NOT MATCHED THEN INSERT (TABLE_ID, TABLE_DESCR, TABLE_NBR)
VALUES (:TABLE_ID, :TABLE_DESCR, :TABLE_NBR)""")
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### .IMPORT

The transformation of the `.IMPORT` command will create a call to the`import_file_to_temptable`helper to load the data from the file to a temporary table. Then, the calls to all the`APPLY`labels used in the original import will be created. Finally, the calls for an`INSERT`label will be transformed to a query parameter and optionally can have a query condition.

**Teradata (MultiLoad)**

```sql
.IMPORT INFILE INFILE_FILENAME
    LAYOUT INFILE_LAYOUT
    APPLY INSERT_TABLE
    APPLY UPSERT_TABLE
    Apply DELETE_TABLE;
```

**Snowflake (Python)**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #.IMPORT INFILE INFILE_FILENAME LAYOUT INFILE_LAYOUT APPLY INSERT_TABLE APPLY UPSERT_TABLE Apply DELETE_TABLE

  snowconvert.helpers.import_file_to_temptable(fr"INFILE_FILENAME", INFILE_LAYOUT_TableName, INFILE_LAYOUT_Columns, INFILE_LAYOUT_Conditions, ',')
  INSERT_TABLE(INFILE_LAYOUT_TableName)
  UPSERT_TABLE(INFILE_LAYOUT_TableName)
  DELETE_TABLE(INFILE_LAYOUT_TableName)
  exec(f"""DROP TABLE {INFILE_LAYOUT_TableName}""")
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### Large Example

Given the transformations shown above for a variety of commands, consider the following example.

With this input data:

```none
id,name,age
1,John,25
2,Maria,29
3,Carlos,31
4,Mike,40
5,Laura,27
```

**Teradata (MultiLoad)**

**Query**

```sql
.begin import mload
        tables
	mySampleTable1
sessions 20
ampcheck none;

.layout myLayOut;
 .field ID * VARCHAR(2) NULLIF ID = '1';
 .field NAME * VARCHAR(25);
 .field AGE * VARCHAR(10);
.dml label insert_data;

INSERT INTO mySampleTable1
 (
    ID,
    NAME,
    AGE
 )
VALUES
 (
    :ID,
    SUBSTRING(:NAME FROM 2),
    :AGE
 );

.import infile sampleData.txt
layout myLayOut
apply insert_data

.end mload;
.logoff;
```

**Result**

| ROW | ID | NAME | AGE |
| --- | --- | --- | --- |
| 1 | NULL | ohn | 25 |
| 2 | 2 | aria | 29 |
| 3 | 3 | arlos | 31 |
| 4 | 4 | ike | 40 |
| 5 | 5 | aura | 27 |

**Snowflake (Python)**

**Query**

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***
#** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "mySampleTable1" **

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
  #.begin import mload tables mySampleTable1 sessions 20 ampcheck none

  myLayOut_TableName = "myLayOut_TEMP_TABLE"
  myLayOut_Columns = """ID VARCHAR(2),
NAME VARCHAR(25),
AGE VARCHAR(10)"""
  myLayOut_Conditions = """CASE
   WHEN UPPER(RTRIM(ID)) = UPPER(RTRIM('1'))
      THEN NULL
   ELSE ID
END AS ID, NAME AS NAME, AGE AS AGE"""
  def insert_data(tempTableName, queryConditions = ""):
    exec(f"""INSERT INTO mySampleTable1 (ID, NAME, AGE)
SELECT
   SRC.ID,
   SUBSTRING(SRC.NAME, 2),
   SRC.AGE
FROM {tempTableName} SRC {queryConditions}""")
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. TRANSLATED BELOW **
  #.import infile sampleData.txt layout myLayOut apply insert_data

  snowconvert.helpers.import_file_to_temptable(fr"sampleData.txt", myLayOut_TableName, myLayOut_Columns, myLayOut_Conditions, ',')
  insert_data(myLayOut_TableName)
  exec(f"""DROP TABLE {myLayOut_TableName}""")

  if con is not None:
    con.close()
    con = None
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

**Result**

| ROW | ID | NAME | AGE |
| --- | --- | --- | --- |
| 1 | NULL | ohn | 25 |
| 2 | 2 | aria | 29 |
| 3 | 3 | arlos | 31 |
| 4 | 4 | ike | 40 |
| 5 | 5 | aura | 27 |

### Known Issues

**1. Delete statement is partially supported**

The `DELETE` statement is partially supported since the where conditions, when found, are not being converted correctly if pointing to a `LAYOUT` defined column.

In the example below, `:EmpNo` is pointing to a `LAYOUT` defined column. However, the transformation does not take this into account and thus the code will be referencing a column that does not exist.

```sql
  exec("""
    DELETE FROM
      Employee
    WHERE
      EmpNo = :EmpNo
    """)
```

If you have any additional questions regarding this documentation, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com).

### Related EWIs

1. [SSC-FDM-0027](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Removed next statement, not applicable in Snowflake.

---
title: SnowConvert AI - Teradata - MLOAD
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/mload.md
section: Migrations
---

# SnowConvert AI - Teradata - MLOAD

Translation references to convert Teradata MLOAD files to Snowflake Scripting.

## Description

The [Teradata MultiLoad (MLoad)](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Using-Teradata-MultiLoad) utility is designed for efficient batch maintenance of large databases, offering a command-driven approach for fast, high-volume data loading operations.

SnowConvert AI translates MLoad scripts into Snowflake Scripting using the [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table) command with staged files.

### Translation Structure

The generated output is organized into two distinct sections due to Snowflake’s execution model:

**1. Stage and file upload (outside EXECUTE IMMEDIATE)**

The [`CREATE TEMPORARY STAGE`](https://docs.snowflake.com/en/sql-reference/sql/create-stage) and [`PUT`](https://docs.snowflake.com/en/sql-reference/sql/put) commands are placed **before** the `EXECUTE IMMEDIATE` block.

**Why?** The `PUT` command is a **client-side** operation—it transfers files from your local machine to a Snowflake stage. This file transfer happens on your machine, not on the Snowflake server. As a result, `PUT` can only run through [SnowSQL](https://docs.snowflake.com/en/user-guide/snowsql) (the command-line client) and cannot execute inside stored procedures, `EXECUTE IMMEDIATE` blocks, or the Snowflake web UI.

**2. Data loading (inside EXECUTE IMMEDIATE)**

The [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table) statement and any additional logic are wrapped in an `EXECUTE IMMEDIATE` block with exception handling. This separation ensures the file upload completes first, and then the server-side data loading runs with proper error handling.

## Supported Commands

### .LOGTABLE

The [`.LOGTABLE`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/LOGTABLE) command stores checkpoint and restart information for MLoad sessions. Snowflake manages checkpointing internally, so this command is removed.

Since MLoad’s `IMPORT` command is translated to Snowflake’s `COPY INTO` statement, you can use the [`COPY_HISTORY`](https://docs.snowflake.com/en/sql-reference/functions/copy_history) table function to monitor and track your data loading operations. This function queries the loading history for a specified table within the last 14 days, returning details such as file names, load times, row counts, error messages, and statuses. For longer retention (up to 365 days), use the [`COPY_HISTORY` view](https://docs.snowflake.com/en/sql-reference/account-usage/copy_history) in the Account Usage schema.

#### Sample Source Patterns

##### Teradata MLoad

```sql
.LOGTABLE ${DATABASE}.LT_EMPLOYEES;
```

##### Snowflake Scripting

```sql
--** SSC-FDM-TD0037 - REMOVED NEXT STATEMENT. USE COPY_HISTORY() FOR MONITORING **
-- .LOGTABLE ${DATABASE}.LT_EMPLOYEES;
```

### .SET Variables

The [`.SET`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/SET) command defines variables referenced with `&VARIABLE` throughout the script. These are translated to Snowflake Scripting variables using `DECLARE`.

#### Type Inference

Snowflake automatically infers the variable type from the assigned value, so explicit type declarations are **not required** in most cases. However, when the value involves **concatenation** (using the `||` operator), the `STRING` type must be explicitly declared.

| MLoad Source | Snowflake Translation |
| --- | --- |
| `.SET YEAR_VAL TO 2024;` | `YEAR_VAL := 2024;` |
| `.SET TBL TO 'MY_TABLE';` | `TBL STRING := 'MY_TABLE';` |
| `.SET DB_ALIAS TO &SRC_DB;` | `DB_ALIAS := :SRC_DB;` |
| `.SET KEY TO &A.&B;` | `KEY STRING := :A || :B;` |
| `.SET NAME TO 'ET_&TBL';` | `NAME STRING := 'ET_' || :TBL;` |

#### Bind Variables

Since the `.SET` command is translated to `DECLARE` variables in Snowflake Scripting, these variables must be referenced using [bind variable syntax](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/variables#using-a-variable-in-a-sql-statement-binding) when used within SQL statements. This is done by prefixing the variable name with a colon (`:`), which allows for dynamic substitution of values at execution time.

##### Teradata MLoad

```sql
INSERT INTO my_table VALUES (&my_variable, &another_var);
```

##### Snowflake Scripting

```sql
INSERT INTO my_table (column1, column2) VALUES (:my_variable, :another_var);
```

#### Using Variables as Object Identifiers

When a variable represents the name of a database object (table, schema, etc.), the [`IDENTIFIER`](https://docs.snowflake.com/en/sql-reference/identifier-literal) function must be used. This function tells Snowflake to interpret the variable’s value as an object identifier rather than a string literal. The function ensures the variable is treated strictly as an identifier, reducing security risks.

**Important:** The `IDENTIFIER()` function does not support concatenation expressions directly. You cannot write `IDENTIFIER(:schema || '.' || :table)`. To handle concatenated object names, an intermediate variable with the `sc_` prefix (SnowConvert) is generated to hold the pre-computed concatenation result.

##### Teradata MLoad

```sql
.SET table_name TO 'EMPLOYEES';
.SET schema_name TO 'HR';

SELECT * FROM &table_name;
DROP TABLE &schema_name..&table_name;
```

##### Snowflake Scripting

```sql
EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
    table_name STRING := 'EMPLOYEES';
    schema_name STRING := 'HR';
    sc_schema_name_dot_table_name STRING := :schema_name || '.' || :table_name;
  BEGIN
    SELECT
      *
    FROM
      IDENTIFIER(:table_name);
    DROP TABLE IDENTIFIER(:sc_schema_name_dot_table_name);
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$;
```

#### Variable Concatenation

MLoad supports concatenating variables using the dot (`.`) operator. A single dot joins values directly, while a double dot inserts a literal dot separator:

| MLoad Pattern | Result | Snowflake Translation |
| --- | --- | --- |
| `&DB.&TBL` | `DBTBL` | `:DB || :TBL` |
| `&DB..&TBL` | `DB.TBL` | `:DB || '.' || :TBL` |
| `&A..&B.&C` | `A.BC` | `:A || '.' || :B || :C` |

When concatenation is used within a string literal, embedded variables are extracted and concatenated:

| MLoad Pattern | Snowflake Translation |
| --- | --- |
| `'ET_&TABLE_NAME'` | `'ET_' || :TABLE_NAME` |
| `'sales_&COUNTRY_CODE'` | `'sales_' || :COUNTRY_CODE` |
| `'&SRC_DB..&TARGET'` | `:SRC_DB || '.' || :TARGET` |

#### Variable Reassignment

When a variable is reassigned after its initial declaration, the reassignment is placed inside the `BEGIN` block.

##### Teradata MLoad

```sql
.SET TBL TO 'TABLE1';
.SET TBL TO 'TABLE2';
```

##### Snowflake Scripting

```sql
EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
    TBL STRING := 'TABLE1';
  BEGIN
    TBL := 'TABLE2';
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$;
```

#### Sample Source Patterns

##### Teradata MLoad

```sql
.SET DB_NAME TO '${DATABASE}';
.SET TABLE_NAME TO '${TABLE}';
.SET ERROR_TABLE TO 'ET_&TABLE_NAME';

DROP TABLE &DB_NAME..&ERROR_TABLE;
```

##### Snowflake Scripting

```sql
EXECUTE IMMEDIATE
$$
  --** SSC-FDM-TD0003 - BASH VARIABLES FOUND, USING SNOWSQL WITH VARIABLE SUBSTITUTION ENABLED IS REQUIRED TO RUN THIS SCRIPT **
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
    DB_NAME STRING := '&{DATABASE}';
    TABLE_NAME STRING := '&{TABLE}';
    ERROR_TABLE STRING := 'ET_' || :TABLE_NAME;
    sc_DB_NAME_dot_ERROR_TABLE STRING := :DB_NAME || '.' || :ERROR_TABLE;
  BEGIN
    DROP TABLE IDENTIFIER(:sc_DB_NAME_dot_ERROR_TABLE);
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

### .BEGIN IMPORT MLOAD / .END MLOAD

These commands define the scope of an MLoad import operation. They are removed because Snowflake’s `COPY INTO` command is atomic and handles scope internally.

#### Sample Source Patterns

##### Teradata MLoad

```sql
.BEGIN IMPORT MLOAD TABLES &DB_NAME..&TABLE_NAME
    WORKTABLES &DB_NAME..&WORK_TABLE
    ERRORTABLES &DB_NAME..&ERROR_TABLE
    CHECKPOINT 2000000;
...
.END MLOAD;
```

##### Snowflake Scripting

```sql
--** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- .BEGIN IMPORT MLOAD TABLES &DB_NAME..&TABLE_NAME WORKTABLES ...
...
--** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **
-- .END MLOAD;
```

### .LOGOFF

The `.LOGOFF` command terminates the MLoad session. This is not applicable in Snowflake and is removed from the output.

## Data Import Translation

### VARTEXT Format (Delimited Files)

The `VARTEXT` format loads delimited files. If no delimiter is specified, the default is pipe (`|`).

#### Sample Source Patterns

##### Teradata MLoad

```sql
.LAYOUT employee_layout;
.FIELD employee_id * CHAR(10);
.FIELD first_name * CHAR(50);
.FIELD last_name * CHAR(50);
.FIELD department * CHAR(30);
.FIELD salary * CHAR(15);

.DML LABEL insert_employees;
INSERT INTO &DB_NAME..&TABLE_NAME (
    employee_id,
    first_name,
    last_name,
    department,
    salary
) VALUES (
    :employee_id,
    :first_name,
    :last_name,
    :department,
    :salary
);

.IMPORT INFILE ${DATA_DIR}/${FILE_NAME}
    FROM 1
    FORMAT VARTEXT '|'
    LAYOUT employee_layout
    APPLY insert_employees;
```

##### Snowflake Scripting

```sql
--** SSC-FDM-TD0003 - BASH VARIABLES FOUND, USING SNOWSQL WITH VARIABLE SUBSTITUTION ENABLED IS REQUIRED TO RUN THIS SCRIPT **
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://&{DATA_DIR}/&{FILE_NAME} @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
    DB_NAME STRING := '&{DATABASE}';
    TABLE_NAME STRING := '&{TABLE}';
    sc_DB_NAME_dot_TABLE_NAME STRING := :DB_NAME || '.' || :TABLE_NAME;
  BEGIN
    BEGIN
      COPY INTO IDENTIFIER(:sc_DB_NAME_dot_TABLE_NAME) (
        employee_id,
        first_name,
        last_name,
        department,
        salary
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3,
          $4,
          $5
        FROM
          @sc_import_stage/&{FILE_NAME}
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

### TEXT/UNFORMAT Format (Fixed-Width Files)

The `TEXT` and `UNFORMAT` formats load fixed-width files. Field positions can use:

* **Asterisk (`*`)**: Automatic position calculation based on field length
* **Explicit number**: Specific byte position in the record

Fields are extracted using the `SUBSTRING` function.

#### Sample Source Patterns

##### Teradata MLoad

```sql
.LAYOUT employee_fixed_layout;
.FIELD employee_id 1 CHAR(10);
.FIELD first_name 11 CHAR(30);
.FILLER filler1 41 CHAR(20);
.FIELD last_name 61 CHAR(30);
.FILLER filler2 91 CHAR(20);
.FIELD department 111 CHAR(30);
.FIELD salary 141 CHAR(15);

.DML LABEL insert_employees;
INSERT INTO employees (employee_id, first_name, last_name, department, salary)
VALUES (:employee_id, :first_name, :last_name, :department, :salary);

.IMPORT INFILE employees.txt FORMAT TEXT LAYOUT employee_fixed_layout APPLY insert_employees;
```

##### Snowflake Scripting

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.txt @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees (
        employee_id,
        first_name,
        last_name,
        department,
        salary
      )
      FROM
      (
        SELECT
          SUBSTRING($1, 1, 10),
          SUBSTRING($1, 11, 30),
          SUBSTRING($1, 61, 30),
          SUBSTRING($1, 111, 30),
          SUBSTRING($1, 141, 15)
        FROM
          @sc_import_stage/employees.txt
      )
      FILE_FORMAT = (TYPE = CSV)
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

> **Note:**
>
> `.FILLER` fields are excluded from the `SELECT` statement.

## Field Options

### Trim Options

The `.FIELD` command supports options for trimming data:

| MLoad Option | Snowflake Function |
| --- | --- |
| `DROP LEADING BLANKS` | `LTRIM($n)` |
| `DROP TRAILING NULLS` | `RTRIM($n)` |
| `DROP LEADING BLANKS AND TRAILING NULLS` | `TRIM($n)` |

#### Sample Source Patterns

##### Teradata MLoad

```sql
.LAYOUT employee_layout;
.FIELD first_name * VARCHAR(50) DROP LEADING BLANKS;
.FIELD last_name * VARCHAR(50) DROP TRAILING NULLS;
.FIELD department * VARCHAR(30) DROP LEADING BLANKS AND TRAILING NULLS;

.DML LABEL insert_employees;
INSERT INTO employees (first_name, last_name, department)
VALUES (:first_name, :last_name, :department);

.IMPORT INFILE employees.csv
    FORMAT VARTEXT ','
    LAYOUT employee_layout
    APPLY insert_employees;
```

##### Snowflake Scripting

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.csv @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees (
        first_name,
        last_name,
        department
      )
      FROM
      (
        SELECT
          LTRIM($1),
          RTRIM($2),
          TRIM($3)
        FROM
          @sc_import_stage/employees.csv
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = ',')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

## Import Options

### FROM Clause (Skip Rows)

The `FROM n` clause specifies to start reading from row `n`. This translates to `SKIP_HEADER = n - 1`.

| MLoad | Snowflake |
| --- | --- |
| `FROM 1` | `SKIP_HEADER = 0` |
| `FROM 2` | `SKIP_HEADER = 1` |
| `FROM 5` | `SKIP_HEADER = 4` |

### WHERE Condition

The `WHERE` clause filters records during import. Since Snowflake’s [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table) only supports `SELECT ... FROM ...` queries (without `WHERE`), the translation uses a **staging table pattern**:

1. Create a temporary staging table with the same structure as the target table
2. Load all data into the staging table using `COPY INTO`
3. Insert filtered records from the staging table into the target table using `INSERT INTO ... SELECT ... WHERE`
4. Drop the staging table

#### Sample Source Patterns

##### Teradata MLoad

```sql
.LAYOUT employee_layout;
.FIELD employee_id * VARCHAR(10);
.FIELD first_name * VARCHAR(50);
.FIELD last_name * VARCHAR(50);
.FIELD department * VARCHAR(30);

.DML LABEL insert_employees;
INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (:employee_id, :first_name, :last_name, :department);

.IMPORT INFILE employees.csv
    FORMAT VARTEXT ','
    LAYOUT employee_layout
    APPLY insert_employees
    WHERE department = 'SALES';
```

##### Snowflake Scripting

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.csv @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      CREATE OR REPLACE TEMPORARY TABLE sc_employees_staging LIKE employees;
      COPY INTO sc_employees_staging (
        employee_id,
        first_name,
        last_name,
        department
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3,
          $4
        FROM
          @sc_import_stage/employees.csv
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = ',')
      ON_ERROR = 'CONTINUE';
      INSERT INTO employees
      SELECT
        *
      FROM
        sc_employees_staging
      WHERE
        department = 'SALES';
      DROP TABLE sc_employees_staging;
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

## File Path Handling

MLoad’s `.IMPORT INFILE` path is translated to Snowflake’s [`PUT`](https://docs.snowflake.com/en/sql-reference/sql/put) command, which requires the `file://` protocol prefix. The file is uploaded to a stage (`@sc_import_stage`) and then referenced in the `COPY INTO` statement.

### Bash Variables in Paths

Bash variables (`${VAR}`) are converted to [SnowSQL variable substitution](https://docs.snowflake.com/en/user-guide/snowsql-use#using-variables) syntax (`&{VAR}`). These require running the script through SnowSQL with variable substitution enabled.

#### Sample Source Patterns

##### Teradata MLoad

```sql
.IMPORT INFILE ${DATA_DIR}/${FILE_NAME}
  FORMAT VARTEXT '|'
  LAYOUT employee_layout
  APPLY employee_insert;
```

##### Snowflake Scripting

```sql
--** SSC-FDM-TD0003 - BASH VARIABLES FOUND, USING SNOWSQL WITH VARIABLE SUBSTITUTION ENABLED IS REQUIRED TO RUN THIS SCRIPT **
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://&{DATA_DIR}/&{FILE_NAME} @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees (
        employee_id,
        first_name,
        last_name
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3
        FROM
          @sc_import_stage/&{FILE_NAME}
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

### Local Variables in Paths (Not Supported)

Local MLoad variables (`&VAR`) are **not supported** in file paths for `PUT` and `COPY INTO` statements. An EWI marker is generated to indicate manual resolution is required.

#### Sample Source Patterns

##### Teradata MLoad

```sql
.IMPORT INFILE &FILE_NAME
  FORMAT VARTEXT '|'
  LAYOUT local_input
  APPLY load_local;
```

##### Snowflake Scripting

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

!!!RESOLVE EWI!!! /*** SSC-EWI-TD0097 - LOCAL VARIABLES ARE CURRENTLY NOT SUPPORTED IN THE PUT STATEMENT. ***/!!!
--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://&FILE_NAME @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
    FILE_NAME STRING := 'data.csv';
  BEGIN
    BEGIN
      COPY INTO employees (
        id,
        name
      )
      FROM
      (
        SELECT
          $1,
          $2
        FROM
          !!!RESOLVE EWI!!! /*** SSC-EWI-TD0097 - LOCAL VARIABLES ARE CURRENTLY NOT SUPPORTED IN THE COPY INTO STATEMENT. ***/!!!
          @sc_import_stage/&FILE_NAME
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

### Quoted Paths

Paths enclosed in single quotes (`'...'`) are handled differently. The `COPY INTO` statement uses the `FILES` clause to specify the file name instead of appending it to the stage path.

#### Sample Source Patterns

##### Teradata MLoad

```sql
.IMPORT INFILE '/data/employee records.csv'
  FORMAT VARTEXT '|'
  LAYOUT employee_layout
  APPLY employee_insert;
```

##### Snowflake Scripting

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT 'file:///data/employee records.csv' @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees (
        employee_id,
        first_name,
        last_name
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3
        FROM
          @sc_import_stage
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|')
      ON_ERROR = 'CONTINUE'
      FILES = ('employee records.csv');
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

### Windows Path Conversion

Quoted paths with Windows backslashes (`\`) are automatically converted to forward slashes (`/`) for compatibility with Snowflake’s `PUT` command.

| MLoad Path | Snowflake Output |
| --- | --- |
| `'C:\data\employees.csv'` | `'file://C:/data/employees.csv'` |

## Multiple Imports

When a script contains multiple `.IMPORT` commands:

* The `CREATE TEMPORARY STAGE` is executed once at the beginning
* All `PUT` commands for all imports are grouped together before the `EXECUTE IMMEDIATE` block
* Each import is translated to a separate `BEGIN...END` block inside the script

### Complete Example with Different Formats

The following example demonstrates two imports with different formats:

* First import: `VARTEXT` format (CSV with comma delimiter)
* Second import: `TEXT` format (fixed-width using `SUBSTRING`)

#### Sample Source Patterns

##### Teradata MLoad

```sql
.LAYOUT employees_insert_layout;
    .FIELD id * VARCHAR(10);
    .FIELD first_name * VARCHAR(50);
    .FIELD last_name * VARCHAR(50);
    .FIELD department * VARCHAR(50);
    .FIELD salary * VARCHAR(10);

.DML LABEL employees_insert_dml;

INSERT INTO employees_target (
    id,
    first_name,
    last_name,
    department,
    salary
) VALUES (
    :id,
    :first_name,
    :last_name,
    :department,
    :salary
);

.IMPORT INFILE employees.csv
    FORMAT VARTEXT ','
    LAYOUT employees_insert_layout
    APPLY employees_insert_dml;

.LAYOUT employees_text_asterisk;
    .FIELD id * CHAR(10);
    .FIELD first_name * CHAR(50);
    .FIELD last_name * CHAR(50);
    .FIELD department * CHAR(50);
    .FIELD salary * CHAR(10);

.DML LABEL employees_dml;

INSERT INTO employees_target (
    id,
    first_name,
    last_name,
    department,
    salary
) VALUES (
    :id,
    :first_name,
    :last_name,
    :department,
    :salary
);

.IMPORT INFILE employees_fixed.txt
    FORMAT TEXT
    LAYOUT employees_text_asterisk
    APPLY employees_dml;
```

##### Snowflake Scripting

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.csv @sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees_fixed.txt @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees_target (
        id,
        first_name,
        last_name,
        department,
        salary
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3,
          $4,
          $5
        FROM
          @sc_import_stage/employees.csv
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = ',')
      ON_ERROR = 'CONTINUE';
    END;

    BEGIN
      COPY INTO employees_target (
        id,
        first_name,
        last_name,
        department,
        salary
      )
      FROM
      (
        SELECT
          SUBSTRING($1, 1, 10),
          SUBSTRING($1, 11, 50),
          SUBSTRING($1, 61, 50),
          SUBSTRING($1, 111, 50),
          SUBSTRING($1, 161, 10)
        FROM
          @sc_import_stage/employees_fixed.txt
      )
      FILE_FORMAT = (TYPE = CSV)
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

## Translation Requirements

For the [`.IMPORT`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/IMPORT) command to be fully translated, all of the following conditions must be met. See Known Limitations for unsupported features.

### Required Components

| Component | Requirement | Error if Missing |
| --- | --- | --- |
| [`.LAYOUT`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/LAYOUT) definition | Must be defined before the `.IMPORT` command | `SSC-EWI-TD0094` |
| [`.DML LABEL`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/DML-LABEL) definition | Must contain at least one `INSERT INTO...VALUES` statement | `SSC-EWI-TD0094` |

### Supported Formats

The following file formats are fully translated to Snowflake’s `COPY INTO` command:

| MLoad Format | Description | Snowflake FILE_FORMAT |
| --- | --- | --- |
| `FORMAT VARTEXT ','` | Delimited file with separator | `TYPE = CSV FIELD_DELIMITER = ','` |
| `FORMAT VARTEXT` | Delimited file with default pipe separator | `TYPE = CSV FIELD_DELIMITER = '|'` |
| `FORMAT TEXT` | Fixed-width positional file | `TYPE = CSV` with `SUBSTRING` extraction |
| `FORMAT UNFORMAT` | Fixed-width positional file (binary-safe) | `TYPE = CSV` with `SUBSTRING` extraction |

### Supported Layout Definitions

Field definitions in `.LAYOUT` are translated based on the format type:

| MLoad Definition | Use Case | Snowflake Translation |
| --- | --- | --- |
| `.FIELD name * VARCHAR(n)` | Delimited files (VARTEXT) - auto position | `$1`, `$2`, `$3`… (positional columns) |
| `.FIELD name pos CHAR(n)` | Fixed-width files (TEXT/UNFORMAT) - explicit position | `SUBSTRING($1, pos, n)` |
| `.FILLER name pos CHAR(n)` | Skip unused bytes in fixed-width files | Excluded from SELECT |

### Supported DML Statements

The DML statement inside `.DML LABEL` determines how data is loaded:

| MLoad DML | Description | Snowflake Translation |
| --- | --- | --- |
| `INSERT INTO table (...) VALUES (...)` | Insert new records | `COPY INTO table (...) FROM (SELECT ...)` |

### Supported Field Modifiers

Field modifiers for trimming whitespace are translated to Snowflake string functions:

| MLoad Modifier | Description | Snowflake Function |
| --- | --- | --- |
| `DROP LEADING BLANKS` | Remove leading spaces | `LTRIM($n)` |
| `DROP TRAILING NULLS` | Remove trailing nulls/spaces | `RTRIM($n)` |
| `DROP LEADING BLANKS AND TRAILING NULLS` | Remove both leading and trailing | `TRIM($n)` |

### Supported IMPORT Clauses

The following `.IMPORT` options are translated to equivalent Snowflake functionality:

| MLoad Clause | Description | Snowflake Translation |
| --- | --- | --- |
| `FROM n` | Start reading from row n (skip header rows) | `SKIP_HEADER = n-1` in FILE_FORMAT |
| `WHERE condition` | Filter records during import | Uses staging table pattern (see WHERE Condition) |
| `NOSTOP` | Continue on errors | `ON_ERROR = 'CONTINUE'` |

## Known Limitations

The following MLoad features are not currently supported and will generate an EWI marker (`SSC-EWI-TD0094`) indicating manual resolution is required:

### Unsupported Formats

| Format | EWI Message |
| --- | --- |
| `FORMAT BINARY` | `BINARY FORMAT IS PENDING TRANSLATION` |
| `FORMAT FASTLOAD` | `FASTLOAD FORMAT IS PENDING TRANSLATION` |

### Unsupported Layout Types

| Layout Type | EWI Message |
| --- | --- |
| `.TABLE tablename` | `TABLE TYPE LAYOUT IS PENDING TRANSLATION` |

### Unsupported DML Statements in IMPORT

The following DML statements are not supported **when used within a `.DML LABEL` applied by an `.IMPORT` command**. Standalone DML statements outside of the import context are translated correctly.

| DML Statement in `.DML LABEL` | EWI Message |
| --- | --- |
| `UPDATE` statement only | `NON INSERT-VALUES DML STATEMENTS ARE PENDING TRANSLATION` |
| `DELETE` statement only | `NON INSERT-VALUES DML STATEMENTS ARE PENDING TRANSLATION` |

### Missing Required Components

| Missing Component | EWI Message |
| --- | --- |
| `.LAYOUT` definition not found | `LAYOUT DEFINITION WAS NOT FOUND IN THE SCRIPT` |
| `.DML LABEL` definition not found | `DML LABEL WAS NOT FOUND IN THE SCRIPT` |

## Related EWIs and FDMs

### Functional Difference Messages

| Code | Description |
| --- | --- |
| [SSC-FDM-TD0003](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) | Bash variables require SnowSQL variable substitution |
| [SSC-FDM-TD0037](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) | LOGTABLE removed; use `COPY_HISTORY()` for monitoring |
| [SSC-FDM-TD0038](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) | PUT command requires execution through SnowSQL |
| [SSC-FDM-0027](../../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md) | BEGIN/END MLOAD removed; not applicable in Snowflake |

### Issues

| Code | Description |
| --- | --- |
| [SSC-EWI-TD0094](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) | IMPORT command not converted due to unsupported features |
| [SSC-EWI-TD0095](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) | DML statement in IMPORT pending translation |
| [SSC-EWI-TD0096](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) | COPY INTO requires explicit file name |
| [SSC-EWI-TD0097](../../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) | Local variables not supported in PUT or COPY INTO |

## Related Topics

* [COPY INTO (Snowflake Documentation)](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table)
* [PUT (Snowflake Documentation)](https://docs.snowflake.com/en/sql-reference/sql/put)
* [COPY_HISTORY Function](https://docs.snowflake.com/en/sql-reference/functions/copy_history)
* [Snowflake Scripting](https://docs.snowflake.com/en/developer-guide/snowflake-scripting/index)
* [SnowSQL](https://docs.snowflake.com/en/user-guide/snowsql)

---
title: SnowConvert AI - Teradata - Power BI Repointing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/etl-bi-repointing/power-bi-teradata-repointing.md
section: Migrations
---

# SnowConvert AI - Teradata - Power BI Repointing

## Description

The Power BI repointing is a feature that provides an easy way to redefine the connections from the [M language in the Power Query Editor](https://learn.microsoft.com/en-us/powerquery-m/). This means that the connection parameters will be redefined to point to the Snowflake migration database context. For Teradata, the method in [M Language](https://learn.microsoft.com/en-us/powerquery-m/) that defined the connection is `Teradata.Database(...)`. In Snowflake, there is a connector that depends on some other parameters and the main connection is defined by `Snowflake.Database(...)` method. In addition, there is a limited support to `ODBC.Query` connector only for Teradata as a source language in the migration. This means that the source connection parameters (of Teradata connections) will be redefined to point to the Snowflake migration database context.

## Source Pattern Samples

### Entity Repointing Case: Table

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a table.

**Teradata Connection in the Power Query Editor**

```sql
let
    Source = Teradata.Database("the_teradata_server", [HierarchicalNavigation=true]),
    databaseTest = Source{[Schema="databaseTest"]}[Data],
    employees1 = databaseTest{[Name="employees"]}[Data]
in
    employees1
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="databaseTest", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="EMPLOYEES", Kind="Table"]}[Data],
    Employees1 = Table.RenameColumns(SourceSfTbl, {{ "EMPLOYEEID", "EmployeeID"}, { "FIRSTNAME", "FirstName"}, { "LASTNAME", "LastName"}, { "HIREDATE", "HireDate"}, { "SALARY", "Salary"}, { "DEPARTMENTID", "DepartmentID"}})
in
    Employees1
```

### Entity Repointing Case: View

This case refers to connections that do not contain embedded SQL. This means that the user has established a connection from Power BI to a view.

**Teradata Connection in the Power Query Editor**

```sql
let
    Source = Teradata.Database("the_teradata_server", [HierarchicalNavigation=true]),
    databaseTest = Source{[Schema="databaseTest"]}[Data],
    EmployeeSalaryBonusView1 = databaseTest{[Name="EmployeeSalaryBonusView"]}[Data]
in
    EmployeeSalaryBonusView1
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="databaseTest", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="EMPLOYEESALARYBONUSVIEW", Kind="Table"]}[Data],
    EmployeeSalaryBonusView1 = Table.RenameColumns(SourceSfTbl, {{ "FIRSTNAME", "FirstName"}, { "LASTNAME", "LastName"}, { "HIREDATE", "HireDate"}})
in
    EmployeeSalaryBonusView1
```

### Embedded SQL Case

This case refers to connections that contains embedded SQL inside of them. This sample show a simple query but SnowConvert AI covers a range of more larger scenarios. Besides, there may be warning messages knows as EWI- PRF - FDM depending on the migrated query. This will help the user identifies patterns that needs extra attention.

**Teradata Connection in the Power Query Editor**

```sql
let
    Source = Teradata.Database("the_teradata_server", [HierarchicalNavigation=true, Query="SELECT *#(lf)FROM databaseTest.employees"])
in
    Source
```

**Snowflake Connection in the Power Query Editor**

```sql
let
    SfSource = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "SELECT * FROM databaseTest.employees", null, [EnableFolding=true]),
    Source = Table.RenameColumns(SfSource, {{ "EMPLOYEEID", "EmployeeID"}, { "FIRSTNAME", "FirstName"}, { "LASTNAME", "LastName"}, { "HIREDATE", "HireDate"}, { "SALARY", "Salary"}, { "DEPARTMENTID", "DepartmentID"}})
in
    Source
```

### ODBC.Query Case

At the moment it is supported only `ODBC.Query` connector. Other connectors as `ODBC.DataSource` are not supported.

This case refers to connections that contains embedded SQL inside of an `ODBC.Query` connector. Notice that all connections with `ODBC.Query` will be taken as Teradata source when migrating Teradata. Please, be aware of your report connection definitions.

**Teradata Connection in the Power Query Editor**

```sql
let
  Source = Odbc.Query("dsn=TERADATA_TEST", "SELECT * FROM TEST_TABLE")
in
  Source
```

**Snowflake Connection in the Power Query Editor**

```sql
let
   Source = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "SELECT * FROM TEST_TABLE", null, [EnableFolding=true])
in
   Source
```

---
title: SnowConvert AI - Teradata - Scripts To Python Translation Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-python/README.md
section: Migrations
---

# SnowConvert AI - Teradata - Scripts To Python Translation Reference

This section details how Snow Convert translates the Teradata Scripts (BTEQ, FastLoad, MultiLoad, TPUMP, etc.) into a scripting language compatible with Snowflake.

Browse through the following pages to find more information about specific topics.

* [BTEQ](bteq-translation.md), explore the translation reference for Basic Teradata Query syntax.
* [FastLoad](fastload-translation.md), explore the translation reference for FastLoad syntax.
* [MultiLoad](multiload-translation.md), explore the translation reference for MultiLoad syntax.
* [TPT](tpt-translation.md), explore the translation reference for TPT syntax.

---
title: SnowConvert AI - Teradata - Scripts to Snowflake SQL Translation Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/README.md
section: Migrations
---

# SnowConvert AI - Teradata - Scripts to Snowflake SQL Translation Reference

Translation reference to convert Teradata scripts files to Snowflake SQL

Browse through the following pages to find more information about specific topics.

---
title: SnowConvert AI - Teradata - Session Modes in Teradata
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/session-modes.md
section: Migrations
---

# SnowConvert AI - Teradata - Session Modes in Teradata

## Teradata session modes description

The Teradata database has different modes for running queries: ANSI Mode (rules based on the ANSI SQL: 2011 specifications) and TERA mode (rules defined by Teradata). Please review the following [Teradata documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Request-and-Transaction-Processing/Transaction-Processing/Transaction-Semantics-Differences-in-ANSI-and-Teradata-Session-Modes) for more information.

### Teradata mode for strings informative table

For strings, the Teradata Mode works differently. As it is explained in the following table based on the [Teradata documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Request-and-Transaction-Processing/Transaction-Processing/Comparison-of-Transactions-in-ANSI-and-Teradata-Session-Modes):

| Feature | ANSI mode | Teradata mode |
| --- | --- | --- |
| Default attribute for character comparisons | CASESPECIFIC | NOT CASESPECIFIC |
| Default TRIM behavior | TRIM(BOTH FROM) | TRIM(BOTH FROM) |

#### Translation specification summary

| Mode | Column constraint values | Teradata behavior | SC expected behavior |
| --- | --- | --- | --- |
| ANSI Mode | CASESPECIFIC | CASESPECIFIC | No constraint added. |
|  | NOT CASESPECIFIC | CASESPECIFIC | Add `COLLATE 'en-cs'` in column definition. |
| Teradata Mode | CASESPECIFIC | CASESPECIFIC | In most cases, do not add COLLATE, and convert its usages of string comparison to `RTRIM( expression )` |
|  | NOT CASESPECIFIC | NOT CASESPECIFIC | In most cases, do not add COLLATE, and convert its usages of string comparison to `RTRIM(UPPER( expression ))` |

### Available translation specification options

* TERA Mode For Strings Comparison - NO COLLATE
* TERA Mode For Strings Comparison - COLLATE
* ANSI Mode For Strings Comparison - NO COLLATE
* ANSI Mode For Strings Comparison - COLLATE

## ANSI Mode For Strings Comparison - COLLATE

This section defines the translation specification for a string in ANSI mode with the use of COLLATE.

### Description

#### ANSI mode for string comparison and COLLATE usage

The ANSI mode string comparison will apply the COLLATE constraint to the columns or statements as required. The default case specification trim behavior may be taken into account.

Notice that in Teradata, the default case specification is ‘`CASESPECIFIC`’, the same default as in Snowflake ‘`case-sensitive'`. Thus, these cases will not be translated with a `COLLATE` because it will be redundant.

### Sample Source Patterns

#### Setup data

##### Teradata

```sql
 CREATE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50) NOT CASESPECIFIC,
    last_name VARCHAR(50) CASESPECIFIC,
    department VARCHAR(50)
);

INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (1, 'George', 'Snow', 'Sales');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (2, 'John', 'SNOW', 'Engineering');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (5, 'Mary', '   ', 'SaleS  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (6, 'GEORGE', '  ', 'sales  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (7, 'GEORGE   ', '  ', 'salEs  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (9, 'JOHN', '   SnoW', 'IT');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50) NOT CASESPECIFIC,
    location VARCHAR(100) CASESPECIFIC,
    PRIMARY KEY (department_id)
);

INSERT INTO departments (department_id, department_name, location) VALUES (101, 'Information Technology', 'New York');
INSERT INTO departments (department_id, department_name, location) VALUES (102, 'Human Resources', 'Chicago');
INSERT INTO departments (department_id, department_name, location) VALUES (103, 'Sales', 'San Francisco');
INSERT INTO departments (department_id, department_name, location) VALUES (104, 'Finance', 'Boston');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    department VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (1, 'George', 'Snow', 'Sales');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (2, 'John', 'SNOW', 'Engineering');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (5, 'Mary', '   ', 'SaleS  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (6, 'GEORGE', '  ', 'sales  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (7, 'GEORGE   ', '  ', 'salEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (9, 'JOHN', '   SnoW', 'IT');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE OR REPLACE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50),
    location VARCHAR(100),
       PRIMARY KEY (department_id)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO departments (department_id, department_name, location)
VALUES (101, 'Information Technology', 'New York');

INSERT INTO departments (department_id, department_name, location)
VALUES (102, 'Human Resources', 'Chicago');

INSERT INTO departments (department_id, department_name, location)
VALUES (103, 'Sales', 'San Francisco');

INSERT INTO departments (department_id, department_name, location)
VALUES (104, 'Finance', 'Boston');
```

#### Comparison operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name = 'GEorge ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT
    *
FROM
    employees
WHERE
    COLLATE(first_name, 'en-cs-rtrim') = RTRIM('George');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name = 'SNOW ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Snowflake

##### Query

```sql
SELECT
 *
FROM
 employees
WHERE
 RTRIM(last_name) = RTRIM('SNOW ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Case 3: CAST NOT CASESPECIFIC column to CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE first_name = 'George   ' (CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

> **Note:**
>
> COLLATE ‘en-cs’ is required for functional equivalence.

##### Query

```sql
 SELECT
    *
FROM
    employees
WHERE
    COLLATE(first_name, 'en-cs-rtrim') = 'George   ' /*** SSC-FDM-TD0032 - CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 4: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE first_name = 'GEorge   ' (NOT CASESPECIFIC) ;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |
| 7 | GEORGE |  | salEs |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(first_name) = RTRIM('GEorge   ' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |
| 7 | GEORGE |  | salEs |

##### Case 5: CAST NOT CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE first_name (NOT CASESPECIFIC)  = 'George    ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

> **Note:**
>
> It requires COLLATE.

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   COLLATE(first_name /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/, 'en-cs-rtrim') = 'George    ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

#### LIKE operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'George';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE COLLATE(first_name, 'en-cs-rtrim') LIKE 'George';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'Snow';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE RTRIM(last_name) LIKE RTRIM('Snow');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |

##### Case 3: CAST NOT CASESPECIFIC column to CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'Mary' (CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 5 | Mary |  | SaleS |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   COLLATE(first_name, 'en-cs-rtrim') LIKE 'Mary' /*** SSC-FDM-TD0032 - CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 5 | Mary |  | SaleS |

##### Case 4: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'SNO%' (NOT CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(last_name) LIKE RTRIM('SNO%' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |

#### IN Operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name IN ('George   ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

> **Note:**
>
> This case requires `COLLATE(`*`column_name`*`, 'en-cs-rtrim')`

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(first_name) IN (COLLATE('George   ', 'en-cs-rtrim'));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

> **Note:**
>
> For this case, the column does not have a column constraint, but the default constraint in Teradata ANSI mode is `CASESPECIFIC`.

##### Query

```sql
 SELECT *
FROM employees
WHERE department IN ('EngineerinG    ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 4 | Marco | SnoW | EngineerinG |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(department) IN (RTRIM('EngineerinG    '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 4 | Marco | SnoW | EngineerinG |

#### ORDER BY clause

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
ORDER BY first_name;
```

##### Output

| first_name |
| --- |
| GeorgE |
| GEORGE |
| GEORGE |
| **George** |
| John |
| JOHN |
| JOHN |
| Marco |
| Mary |
| WIlle |

##### Snowflake

> **Warning:**
>
> Please review FDM. ***Pending to add.***

##### Query

```sql
 SELECT
   first_name
FROM
   employees
ORDER BY first_name;
```

##### Output

| first_name |
| --- |
| GeorgE |
| **George** |
| GEORGE |
| GEORGE |
| John |
| JOHN |
| JOHN |
| Marco |
| Mary |
| WIlle |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
ORDER BY last_name;
```

##### Output

| department |
| --- |
| EngineerinG |
| Engineering |
| Finance |
| Human resources |
| IT |
| SalEs |
| SaleS |
| Sales |
| salEs |
| sales |

##### Snowflake

##### Query

```sql
 SELECT
   last_name
FROM
   employees
ORDER BY last_name;
```

##### Output

| department |
| --- |
| EngineerinG |
| Engineering |
| Finance |
| Human resources |
| IT |
| SalEs |
| SaleS |
| Sales |
| salEs |
| sales |

#### GROUP BY clause

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| Mary |
| GeorgE |
| WIlle |
| **JOHN** |
| Marco |
| GEORGE |

##### Snowflake

> **Warning:**
>
> **The case or order may differ in output.**

> **Note:**
>
> `RTRIM` is required in selected columns.

##### Query

```sql
   SELECT
   first_name
  FROM
   employees
  GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| **John** |
| Marco |
| **George** |
| GeorgE |
| WIlle |
| Mary |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
|  |
| SNOW |
| SnoW |
| Snow |
| snow |

##### Snowflake

> **Note:**
>
> *The order may differ.*

##### Query

```sql
 SELECT
   last_name
  FROM
   employees
  GROUP BY last_name;
```

##### Output

| first_name |
| --- |
| Snow |
| SNOW |
| SnoW |
|  |
| SnoW |
| snow |

#### HAVING clause

The HAVING clause will use the patterns in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name
HAVING first_name = 'Mary';
```

##### Output

```none
Mary
```

##### Snowflake

##### Query

```sql
 SELECT
  first_name
FROM
  employees
GROUP BY first_name
HAVING
   COLLATE(first_name, 'en-cs-rtrim') = 'Mary';
```

##### Output

```none
Mary
```

#### CASE WHEN statement

The `CASE WHEN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Teradata

##### Query

```sql
 SELECT first_name,
      last_name,
      CASE
          WHEN department = 'EngineerinG' THEN 'Information Technology'
          WHEN first_name = '    GeorgE   ' THEN 'GLOBAL SALES'
          ELSE 'Other'
      END AS department_full_name
FROM employees
WHERE last_name = '';
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | Other |
| Mary |  | Other |
| GeorgE |  | GLOBAL SALES |
| GEORGE |  | Other |

##### Snowflake

##### Query

```sql
    SELECT
   first_name,
   last_name,
   CASE
         WHEN RTRIM(department) = RTRIM('EngineerinG')
            THEN 'Information Technology'
         WHEN COLLATE(first_name, 'en-cs-rtrim')  = '    GeorgE   '
            THEN 'GLOBAL SALES'
       ELSE 'Other'
   END AS department_full_name
FROM
   employees
WHERE RTRIM(last_name) = RTRIM('');
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| Mary |  | Other |
| GEORGE |  | Other |
| GEORGE |  | Other |
| GeorgE |  | GLOBAL SALES |

#### JOIN clause

> **Warning:**
>
> Simple scenarios with evaluation operations are supported.

The `JOIN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT
    e.employee_id,
    e.first_name,
    e.last_name,
    d.department_name
FROM
    employees e
JOIN
    departments d
ON
    e.department = d.department_name;
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 10 | JOHN | snow | Finance |

##### Snowflake

> **Note:**
>
> `d.department_name` is `NOT CASESPECIFIC`, so it requires `COLLATE`.

##### Query

```sql
    SELECT
   e.employee_id,
   e.first_name,
   e.last_name,
   d.department_name
FROM
   employees e
JOIN
   departments d
ON COLLATE(e.department, 'en-cs-rtrim') = d.department_name;
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 10 | JOHN | snow | Finance |

#### Related EWIs

[SSC-EWI-TD0007](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): GROUP BY IS NOT EQUIVALENT IN TERADATA MODE

[SC-FDM-TD0032](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) : [NOT] CASESPECIFIC CLAUSE WAS REMOVED

## ANSI Mode For Strings Comparison - NO COLLATE

This section defines the translation specification for a string in ANSI mode without the use of COLLATE.

### Description

#### ANSI mode for string comparison and NO COLLATE usages.

The ANSI mode string comparison without the use of COLLATE will apply RTRIM and UPPER as needed. The default case specification trim behavior may be taken into account, so if a column does not have a case specification in Teradata ANSI mode, Teradata will have as default `CASESPECIFIC`.

### Sample Source Patterns

#### Setup data

##### Teradata

```sql
 CREATE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50) NOT CASESPECIFIC,
    last_name VARCHAR(50) CASESPECIFIC,
    department VARCHAR(50)
);

INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (1, 'George', 'Snow', 'Sales');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (2, 'John', 'SNOW', 'Engineering');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (5, 'Mary', '   ', 'SaleS  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (6, 'GEORGE', '  ', 'sales  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (7, 'GEORGE   ', '  ', 'salEs  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (9, 'JOHN', '   SnoW', 'IT');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50) NOT CASESPECIFIC,
    location VARCHAR(100) CASESPECIFIC,
    PRIMARY KEY (department_id)
);

INSERT INTO departments (department_id, department_name, location) VALUES (101, 'Information Technology', 'New York');
INSERT INTO departments (department_id, department_name, location) VALUES (102, 'Human Resources', 'Chicago');
INSERT INTO departments (department_id, department_name, location) VALUES (103, 'Sales', 'San Francisco');
INSERT INTO departments (department_id, department_name, location) VALUES (104, 'Finance', 'Boston');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    department VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/30/2024",  "domain": "test" }}'
;

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (1, 'George', 'Snow', 'Sales');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (2, 'John', 'SNOW', 'Engineering');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (5, 'Mary', '   ', 'SaleS  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (6, 'GEORGE', '  ', 'sales  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (7, 'GEORGE   ', '  ', 'salEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (9, 'JOHN', '   SnoW', 'IT');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE OR REPLACE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50),
    location VARCHAR(100),
       PRIMARY KEY (department_id)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/30/2024",  "domain": "test" }}'
;

INSERT INTO departments (department_id, department_name, location)
VALUES (101, 'Information Technology', 'New York');

INSERT INTO departments (department_id, department_name, location)
VALUES (102, 'Human Resources', 'Chicago');

INSERT INTO departments (department_id, department_name, location)
VALUES (103, 'Sales', 'San Francisco');

INSERT INTO departments (department_id, department_name, location)
VALUES (104, 'Finance', 'Boston');
```

#### Comparison operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name = 'George      ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT
 *
FROM
employees
WHERE
RTRIM(first_name) = RTRIM('George      ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name = 'SNOW ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Snowflake

##### Query

```sql
 SELECT
 *
FROM
employees
WHERE
 RTRIM(last_name) = RTRIM('SNOW ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Case 3: CAST NOT CASESPECIFIC column to CASESPECIFIC and database mode is ANSI Mode

> **Warning:**
>
> The (`CASESPECIFIC`) overwrite the column constraint in the table definition.

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE first_name = 'GEorge   ' (CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT * FROM workers
WHERE RTRIM(first_name) = RTRIM(UPPER('GEorge   '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 6 | GEORGE |  | sales |

##### Case 4: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT * FROM employees
WHERE last_name = 'SnoW   ' (NOT CASESPECIFIC) ;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 4 | Marco | SnoW | EngineerinG |

##### Snowflake

##### Query

```sql
 SELECT * FROM employees
WHERE RTRIM(last_name) = RTRIM('SnoW   ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 4 | Marco | SnoW | EngineerinG |

#### LIKE operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'Georg%';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'Georg%';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'Snow';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'Snow';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 3: CAST NOT CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'George' (NOT CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   first_name ILIKE 'George' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 4: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'SNO%' (NOT CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   last_name LIKE 'SNO%' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |

#### IN Operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name IN ('GEORGE   ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 6 | GEORGE |  | sales |
| 7 | GEORGE |  | salEs |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE RTRIM(first_name) IN (RTRIM('GEORGE   '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 6 | GEORGE |  | sales |
| 7 | GEORGE |  | salEs |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE department IN ('SaleS');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 5 | Mary |  | SaleS |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE RTRIM(department) IN (RTRIM('SaleS'));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 5 | Mary |  | SaleS |

#### ORDER BY clause

> **Note:**
>
> **Notice that this functional equivalence can differ.**

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT department_name
FROM departments
ORDER BY department_name;
```

##### Output

| department |
| --- |
| EngineerinG |
| Engineering |
| Finance |
| Human resources |
| IT |
| SalEs |
| SaleS |
| Sales |
| salEs |
| sales |

##### Snowflake

> **Note:**
>
> **Please review FDM. The order differs in the order of insertion of data.**

##### Query

```sql
 SELECT
   department_name
FROM
   departments
ORDER BY
   UPPER(department_name);
```

##### Output

| department |
| --- |
| EngineerinG |
| Engineering |
| Finance |
| Human resources |
| IT |
| SalEs |
| SaleS |
| Sales |
| salEs |
| sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
ORDER BY last_name;
```

##### Output

| department |
| --- |
| Finance |
| Human Resources |
| Information Technology |
| Sales |

##### Snowflake

##### Query

```sql
 SELECT last_name
FROM employees
ORDER BY last_name;
```

##### Output

| department |
| --- |
| Finance |
| Human Resources |
| Information Technology |
| Sales |

#### GROUP BY clause

> **Warning:**
>
> **To ensure a functional equivalence, it is required to use the COLLATE expression.**
>
> Please review the [SSC-EWI-TD0007](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md) for more information.

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| Mary |
| GeorgE |
| WIlle |
| John |
| Marco |
| GEORGE |

##### Snowflake

##### Query

```sql
 SELECT
   first_name
FROM
   employees
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0007 - GROUP BY IS NOT EQUIVALENT IN TERADATA MODE ***/!!!
GROUP BY first_name;
```

##### Output

| FIRST_NAME |
| --- |
| George |
| John |
| WIlle |
| Marco |
| Mary |
| GEORGE |
| GEORGE |
| GeorgE |
| JOHN |
| JOHN |

##### Case 2: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
|  |
| SNOW |
| SnoW |
| Snow |
| snow |

##### Snowflake

##### Query

```sql
 SELECT
   last_name
FROM
   employees
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0007 - GROUP BY IS NOT EQUIVALENT IN TERADATA MODE ***/!!!
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
|  |
| SNOW |
| SnoW |
| Snow |
| snow |

#### HAVING clause

The HAVING clause will use the patterns in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is NOT CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name
HAVING first_name = 'GEORGE';
```

##### Output

```none
GEORGE
```

##### Snowflake

##### Query

```sql
 SELECT
   first_name
FROM
   employees
GROUP BY first_name
HAVING
   RTRIM(first_name) = RTRIM('GEORGE');
```

##### Output

```none
GEORGE
```

#### CASE WHEN statement

The `CASE WHEN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Teradata

##### Query

```sql
 SELECT first_name,
      last_name,
      CASE
          WHEN department = 'SaleS  ' THEN 'GLOBAL SALES'
          WHEN first_name = 'GEORGE   ' THEN 'Department Full Name'
          ELSE 'Other'
      END AS department_full_name
FROM employees
WHERE last_name = '   ';
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | Department Full Name |
| Mary |  | GLOBAL SALES |
| GeorgE |  | Other |
| GEORGE |  | Department Full Name |

##### Snowflake

##### Query

```sql
 SELECT
      first_name,
      last_name,
      CASE
            WHEN UPPER(RTRIM(department)) = UPPER(RTRIM('SaleS  '))
                  THEN 'GLOBAL SALES'
            WHEN UPPER(RTRIM(first_name)) = UPPER(RTRIM('GEORGE   '))
                  THEN 'Department Full Name'
          ELSE 'Other'
      END AS department_full_name
FROM
      employees
WHERE
      UPPER(RTRIM( last_name)) = UPPER(RTRIM('   '));
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | Department Full Name |
| Mary |  | GLOBAL SALES |
| GeorgE |  | Other |
| GEORGE |  | Department Full Name |

#### JOIN clause

> **Warning:**
>
> Simple scenarios are supported.

The `JOIN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is CASESPECIFIC and database mode is ANSI Mode

##### Teradata

##### Query

```sql
 SELECT
    e.employee_id,
    e.first_name,
    e.last_name,
    d.department_name
FROM
    employees e
JOIN
    departments d
ON
    e.department = d.department_name;
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 10 | JOHN | snow | Finance |

##### Snowflake

##### Query

```sql
 SELECT
   e.employee_id,
   e.first_name,
   e.last_name,
   d.department_name
FROM
   employees e
JOIN
      departments d
ON RTRIM(e.department) = RTRIM(d.department_name);
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 10 | JOHN | snow | Finance |

### Related EWIs

[SSC-EWI-TD0007](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): GROUP BY IS NOT EQUIVALENT IN TERADATA MODE

## TERA Mode For Strings Comparison - COLLATE

This section defines the translation specification for string in Tera mode with the use of COLLATE.

### Description

#### Tera Mode for string comparison and COLLATE usage

The Tera Mode string comparison will apply the COLLATE constraint to the columns or statements as required. The default case specification trim behavior may be taken into account. The default case specification in Teradata for TERA mode is `NOT CASESPECIFIC`. Thus, the columns without case specification will have `COLLATE('en-ci')` constraints.

### Sample Source Patterns

#### Setup data

##### Teradata

```sql
 CREATE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50) NOT CASESPECIFIC,
    last_name VARCHAR(50) CASESPECIFIC,
    department VARCHAR(50)
);

INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (1, 'George', 'Snow', 'Sales');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (2, 'John', 'SNOW', 'Engineering');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (5, 'Mary', '   ', 'SaleS  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (6, 'GEORGE', '  ', 'sales  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (7, 'GEORGE   ', '  ', 'salEs  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (9, 'JOHN', '   SnoW', 'IT');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50) NOT CASESPECIFIC,
    location VARCHAR(100) CASESPECIFIC,
    PRIMARY KEY (department_id)
);

INSERT INTO departments (department_id, department_name, location) VALUES (101, 'Information Technology', 'New York');
INSERT INTO departments (department_id, department_name, location) VALUES (102, 'Human Resources', 'Chicago');
INSERT INTO departments (department_id, department_name, location) VALUES (103, 'Sales', 'San Francisco');
INSERT INTO departments (department_id, department_name, location) VALUES (104, 'Finance', 'Boston');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50) COLLATE 'en-ci',
    last_name VARCHAR(50),
    department VARCHAR(50) COLLATE 'en-ci'
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "11/01/2024",  "domain": "test" }}'
;

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (1, 'George', 'Snow', 'Sales');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (2, 'John', 'SNOW', 'Engineering');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (5, 'Mary', '   ', 'SaleS  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (6, 'GEORGE', '  ', 'sales  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (7, 'GEORGE   ', '  ', 'salEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (9, 'JOHN', '   SnoW', 'IT');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE OR REPLACE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50) COLLATE 'en-ci',
    location VARCHAR(100),
       PRIMARY KEY (department_id)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "11/01/2024",  "domain": "test" }}'
;

INSERT INTO departments (department_id, department_name, location)
VALUES (101, 'Information Technology', 'New York');

INSERT INTO departments (department_id, department_name, location)
VALUES (102, 'Human Resources', 'Chicago');

INSERT INTO departments (department_id, department_name, location)
VALUES (103, 'Sales', 'San Francisco');

INSERT INTO departments (department_id, department_name, location)
VALUES (104, 'Finance', 'Boston');
```

#### Comparison operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name = 'GEorge ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
 *
FROM
 employees
WHERE
 RTRIM(first_name) = RTRIM('GEorge ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name = 'SNOW ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Snowflake

##### Query

```sql
SELECT
 *
FROM
 employees
WHERE
 RTRIM(last_name) = RTRIM('SNOW ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Case 3: CAST NOT CASESPECIFIC column to CASESPECIFIC and database mode is TERA Mode

> **Note:**
>
> Notice that the following queries
>
> * `SELECT * FROM employees WHERE first_name = 'JOHN ' (CASESPECIFIC)`
> * `SELECT * FROM employees WHERE first_name (CASESPECIFIC) = 'JOHN '`
>
> will return the same values.

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE first_name = 'JOHN   ' (CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 9 | JOHN | SnoW | IT |
| 10 | JOHN | snow | Finance |

##### Snowflake

##### Query

```sql
 SELECT
    *
FROM
    employees
WHERE
    COLLATE(first_name, 'en-cs-rtrim') = 'JOHN   ' /*** SSC-FDM-TD0032 - CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 9 | JOHN | SnoW | IT |
| 10 | JOHN | snow | Finance |

##### Case 4: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is TERA Mode

> **Note:**
>
> CAST to a column on the left side of the comparison has priority.
>
> For example:
>
> * `SELECT * FROM employees WHERE last_name (NOT CASESPECIFIC) = 'snoW';` *will return **5 rows.***
> * `SELECT * FROM employees WHERE last_name = 'snoW' (NOT CASESPECIFIC);` *will return **0 rows** with this setup data.*

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE last_name (NOT CASESPECIFIC)  = 'snoW' ;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |
| 4 | Marco | SnoW | EngineerinG |
| 10 | JOHN | snow | Finance |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   COLLATE(last_name /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/, 'en-ci-rtrim') = 'snoW' ;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 2 | John | SNOW | Engineering |
| 3 | WIlle | SNOW | Human resources |
| 4 | Marco | SnoW | EngineerinG |
| 10 | JOHN | snow | Finance |

#### LIKE operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'GeorgE';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(first_name) LIKE RTRIM('GeorgE');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'Snow';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(last_name) LIKE RTRIM('Snow');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 3: CAST NOT CASESPECIFIC column to CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'George' (CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Snowflake

##### Query

```sql
 SELECT
    *
FROM
    employees
WHERE
    COLLATE(first_name, 'en-cs-rtrim') LIKE 'George';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |

##### Case 4: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'SNO%' (NOT CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(last_name) LIKE RTRIM('SNO%' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

#### IN Operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name IN ('George   ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(first_name) IN (RTRIM('George   '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 2: Column constraint is not defined and database mode is TERA Mode

> **Note:**
>
> In Tera mode, not defined case specification means `NOT CASESPECIFIC`.

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE department IN ('Sales    ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 5 | Mary |  | SaleS |
| 6 | GEORGE |  | sales |
| 7 | GEORGE |  | salEs |
| 8 | GeorgE |  | SalEs |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(department) IN (RTRIM('Sales    '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 5 | Mary |  | SaleS |
| 6 | GEORGE |  | sales |
| 7 | GEORGE |  | salEs |
| 8 | GeorgE |  | SalEs |

##### Case 3: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name IN ('SNOW   ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(last_name) IN (RTRIM('SNOW   '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

#### ORDER BY clause

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT employee_id, first_name
FROM employees
ORDER BY employee_id, first_name;
```

##### Output

| employee_id | first_name |
| --- | --- |
| 1 | George |
| 2 | John |
| 3 | WIlle |
| 4 | Marco |
| 5 | Mary |
| 6 | GEORGE |
| 7 | GEORGE |
| 8 | GeorgE |
| 9 | JOHN |
| 10 | JOHN |

##### Snowflake

##### Query

```sql
 SELECT employee_id, first_name
FROM employees
ORDER BY employee_id, first_name;
```

##### Output

| employee_id | first_name |
| --- | --- |
| 1 | George |
| 2 | John |
| 3 | WIlle |
| 4 | Marco |
| 5 | Mary |
| 6 | GEORGE |
| 7 | GEORGE |
| 8 | GeorgE |
| 9 | JOHN |
| 10 | JOHN |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT employee_id, last_name
FROM employees
ORDER BY employee_id, last_name;
```

##### Output

| employee_id | last_name |
| --- | --- |
| 1 | Snow |
| 2 | SNOW |
| 3 | SNOW |
| 4 | SnoW |
| 5 |  |
| 6 |  |
| 7 |  |
| 8 |  |
| 9 | SnoW |
| 10 | snow |

##### Snowflake

##### Query

```sql
 SELECT employee_id, last_name
FROM employees
ORDER BY employee_id, last_name;
```

##### Output

| employee_id | last_name |
| --- | --- |
| 1 | Snow |
| 2 | SNOW |
| 3 | SNOW |
| 4 | SnoW |
| 5 |  |
| 6 |  |
| 7 |  |
| 8 |  |
| 9 | SnoW |
| 10 | snow |

#### GROUP BY clause

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| Mary |
| GeorgE |
| WIlle |
| **JOHN** |
| Marco |
| **GEORGE** |

##### Snowflake

> **Warning:**
>
> Case specification in output may vary depending on the number of columns selected.

##### Query

```sql
 SELECT
   first_name
FROM
   employees
GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| **John** |
| Marco |
| **George** |
| GeorgE |
| WIlle |
| Mary |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
|  |
| SNOW |
| SnoW |
| Snow |
| snow |

##### Snowflake

##### Query

```sql
 SELECT
   last_name
FROM
   employees
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
|  |
| SNOW |
| SnoW |
| Snow |
| snow |

#### HAVING clause

The HAVING clause will use the patterns in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

> **Note:**
>
> Case specification in output may vary depending on the number of columns selected. This is also related to the `GROUP BY` clause.

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name
HAVING first_name = 'George  ';
```

##### Output

| employee_id | first_name |
| --- | --- |
| 7 | GEORGE |
| 1 | George |
| 6 | GEORGE |

##### Snowflake

##### Query

```sql
 SELECT
  employee_id,
  first_name
FROM
  employees
GROUP BY employee_id, first_name
HAVING
   RTRIM(first_name) = RTRIM('George  ');
```

##### Output

| employee_id | first_name |
| --- | --- |
| 7 | GEORGE |
| 1 | George |
| 6 | GEORGE |

#### CASE WHEN statement

The `CASE WHEN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Teradata

##### Query

```sql
 SELECT first_name,
      last_name,
      CASE
          WHEN department = 'Engineering' THEN 'Information Technology'
          WHEN first_name = 'GeorgE' THEN 'GLOBAL SALES'
          ELSE 'Other'
      END AS department_full_name
FROM employees
WHERE last_name = '';
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | GLOBAL SALES |
| Mary |  | Other |
| GeorgE |  | Other |
| GEORGE |  | GLOBAL SALES |

##### Snowflake

##### Query

```sql
 SELECT
   first_name,
   last_name,
   CASE
      WHEN RTRIM(department) = RTRIM('Engineering')
         THEN 'Information Technology'
      WHEN RTRIM(first_name) = RTRIM('GeorgE')
         THEN 'GLOBAL SALES'
      ELSE 'Other'
   END AS department_full_name
FROM
   employees
WHERE
   RTRIM( last_name) = RTRIM('');
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | GLOBAL SALES |
| Mary |  | Other |
| GeorgE |  | Other |
| GEORGE |  | GLOBAL SALES |

#### JOIN clause

> **Warning:**
>
> Simple scenarios with evaluation operations are supported.

The `JOIN` statement will use the patterns described in:

* Evaluation of comparison operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT
    e.employee_id,
    e.first_name,
    e.last_name,
    d.department_name
FROM
    employees e
JOIN
    departments d
ON
    e.department = d.department_name;
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 3 | WIlle | SNOW | Human Resources |
| 5 | Mary |  | Sales |
| 6 | GEORGE |  | Sales |
| 7 | GEORGE |  | Sales |
| 8 | GeorgE |  | Sales |
| 10 | JOHN | snow | Finance |

##### Snowflake

##### Query

```sql
 SELECT
   e.employee_id,
   e.first_name,
   e.last_name,
   d.department_name
FROM
   employees e
JOIN
   departments d
ON RTRIM(e.department) = RTRIM(d.department_name);
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 3 | WIlle | SNOW | Human Resources |
| 5 | Mary |  | Sales |
| 6 | GEORGE |  | Sales |
| 7 | GEORGE |  | Sales |
| 8 | GeorgE |  | Sales |
| 10 | JOHN | snow | Finance |

### Related EWIs

[SSC-EWI-TD0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): GROUP BY REQUIRED COLLATE FOR CASE INSENSITIVE COLUMNS

[SC-FDM-TD0032](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) : [NOT] CASESPECIFIC CLAUSE WAS REMOVED

## TERA Mode For Strings Comparison - NO COLLATE

This section defines the translation specification for string in Tera mode without using COLLATE.

### Description

#### Tera Mode for string comparison and NO COLLATE usages

The Tera Mode string comparison without the use of COLLATE will apply `RTRIM` and `UPPER` as needed. The default case specification trim behavior may be taken into account.

### Sample Source Patterns

#### Setup data

##### Teradata

```sql
 CREATE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50) NOT CASESPECIFIC,
    last_name VARCHAR(50) CASESPECIFIC,
    department VARCHAR(50)
);

INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (1, 'George', 'Snow', 'Sales');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (2, 'John', 'SNOW', 'Engineering');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (5, 'Mary', '   ', 'SaleS  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (6, 'GEORGE', '  ', 'sales  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (7, 'GEORGE   ', '  ', 'salEs  ');
INSERT INTO employees(employee_id, first_name, last_name, department) VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (9, 'JOHN', '   SnoW', 'IT');
INSERT INTO employees (employee_id, first_name, last_name, department) VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50) NOT CASESPECIFIC,
    location VARCHAR(100) CASESPECIFIC,
    PRIMARY KEY (department_id)
);

INSERT INTO departments (department_id, department_name, location) VALUES (101, 'Information Technology', 'New York');
INSERT INTO departments (department_id, department_name, location) VALUES (102, 'Human Resources', 'Chicago');
INSERT INTO departments (department_id, department_name, location) VALUES (103, 'Sales', 'San Francisco');
INSERT INTO departments (department_id, department_name, location) VALUES (104, 'Finance', 'Boston');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE employees (
    employee_id INTEGER NOT NULL,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    department VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/30/2024",  "domain": "test" }}'
;

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (1, 'George', 'Snow', 'Sales');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (2, 'John', 'SNOW', 'Engineering');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (3, 'WIlle', 'SNOW', 'Human resources   ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (4, 'Marco', 'SnoW   ', 'EngineerinG');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (5, 'Mary', '   ', 'SaleS  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (6, 'GEORGE', '  ', 'sales  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (7, 'GEORGE   ', '  ', 'salEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (8, '    GeorgE   ', '  ', 'SalEs  ');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (9, 'JOHN', '   SnoW', 'IT');

INSERT INTO employees (employee_id, first_name, last_name, department)
VALUES (10, 'JOHN    ', 'snow', 'Finance   ');

CREATE OR REPLACE TABLE departments (
    department_id INTEGER NOT NULL,
    department_name VARCHAR(50),
    location VARCHAR(100),
       PRIMARY KEY (department_id)
   )
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/30/2024",  "domain": "test" }}'
;

INSERT INTO departments (department_id, department_name, location)
VALUES (101, 'Information Technology', 'New York');

INSERT INTO departments (department_id, department_name, location)
VALUES (102, 'Human Resources', 'Chicago');

INSERT INTO departments (department_id, department_name, location)
VALUES (103, 'Sales', 'San Francisco');

INSERT INTO departments (department_id, department_name, location)
VALUES (104, 'Finance', 'Boston');
```

#### Comparison operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

This example demonstrates the usage of a column set up as `NOT CASESPECIFIC` as it is a `first_name` column. Even when asking for the string `'GEorge',` the query execution will retrieve results in Teradata because the case specification is not considered.

To emulate this scenario in Snowflake, there are implemented two functions: `RTRIM(UPPER(string_evaluation))`, `UPPER` is required in this scenario because the string does not review the case specification.

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name = 'GEorge ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
 *
FROM
 employees
WHERE
 RTRIM(UPPER(first_name)) = RTRIM(UPPER('GEorge '));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

For this example, the column constraint is `CASESPECIFIC`, for which the example does not retrieve rows in Teradata because ‘`Snow`’ is not equal to ‘`SNOW`’.

In Snowflake, the resulting migration points only to the use of the `RTRIM` function since the case specification is important.

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name = 'SNOW ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Snowflake

##### Query

```sql
SELECT
 *
FROM
 employees
WHERE
 RTRIM(last_name) = RTRIM('SNOW ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 3 | WIlle | SNOW | Human resources |
| 2 | John | SNOW | Engineering |

##### Case 3: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

> **Warning:**
>
> The (`CASESPECIFIC`) overrides the column constraint in the table definition.

##### Query

```sql
 SELECT * FROM employees WHERE first_name = 'GEORGE   ' (CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 6 | GEORGE |  | sales |

##### Snowflake

> **Note:**
>
> RTRIM is required on the left side, and RTRIM is required on the right side.

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(first_name) = RTRIM('GEORGE   ' /*** SSC-FDM-TD0032 - CASESPECIFIC CLAUSE WAS REMOVED ***/);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 6 | GEORGE |  | sales |

##### Case 4: CAST NOT CASESPECIFIC column to NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT * FROM employees WHERE first_name = 'GEorge   ' (NOT CASESPECIFIC) ;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   UPPER(RTRIM(first_name)) = UPPER(RTRIM('GEorge   ' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 5: Blank spaces case. Column constraint is NOT CASESPECIFIC, database mode is TERA Mode, and using equal operation

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name = '   ';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 5 | Mary |  | SaleS |
| 8 | GeorgE |  | SalEs |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   RTRIM(last_name) = RTRIM('   ');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 5 | Mary |  | SaleS |
| 8 | GeorgE |  | SalEs |
| 6 | GEORGE |  | sales |

#### LIKE operation

> **Note:**
>
> This operation works differently from another one. Blank spaces must be the same quantity to retrieve information.

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

This example is expected to display one row because the case specification is not relevant.

> **Note:**
>
> In Snowflake, the migration uses the [ILIKE](https://docs.snowflake.com/en/sql-reference/functions/ilike) operation. This performs a case-insensitive comparison.

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'GeorgE';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name ILIKE 'GeorgE';
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'Snow';
```

##### Output

| first_name | last_name | department |
| --- | --- | --- |
| George | Snow | Sales |
| Jonh | Snow | Engineering |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name LIKE 'Snow';
```

##### Output

| first_name | last_name | department |
| --- | --- | --- |
| George | Snow | Sales |
| Jonh | Snow | Engineering |

##### Case 3: CAST CASESPECIFIC column to NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'George' (NOT CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   first_name ILIKE 'George' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 4: CAST NOT CASESPECIFIC column to NOT CASESPECIFIC and database mode is ANSI Mode

> **Note:**
>
> This case requires the translation to `ILIKE`.

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name LIKE 'GE%' (NOT CASESPECIFIC);
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT
   *
FROM
   employees
WHERE
   first_name ILIKE 'GE%' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

#### IN Operation

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE first_name IN ('GeorgE');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE RTRIM(UPPER(first_name)) IN (RTRIM(UPPER('GeorgE')));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 7 | GEORGE |  | salEs |
| 1 | George | Snow | Sales |
| 6 | GEORGE |  | sales |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

For this example, the usage of the UPPER function is not required since, in the Teradata database, the case specification is relevant to the results.

##### Teradata

##### Query

```sql
 SELECT *
FROM employees
WHERE last_name IN ('SnoW');
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 4 | Marco | SnoW | EngineerinG |

##### Snowflake

##### Query

```sql
 SELECT *
FROM employees
WHERE RTRIM(last_name) IN (RTRIM('SnoW'));
```

##### Output

| employee_id | first_name | last_name | department |
| --- | --- | --- | --- |
| 4 | Marco | SnoW | EngineerinG |

#### ORDER BY clause

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

> **Danger:**
>
> **Notice that this output order can differ.**

##### Teradata

##### Query

```sql
 SELECT department
FROM employees
ORDER BY department;
```

##### Output

| department |
| --- |
| EngineerinG |
| Engineering |
| Finance |
| Human resources |
| IT |
| sales |
| SalEs |
| Sales |
| SaleS |
| salEs |

##### Snowflake

##### Query

```sql
 SELECT department
FROM employees
ORDER BY UPPER(department);
```

##### Output

| department |
| --- |
| EngineerinG |
| Engineering |
| Finance |
| Human resources |
| IT |
| sales |
| SalEs |
| Sales |
| SaleS |
| salEs |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

> **Danger:**
>
> **Notice that this output can differ in order.**

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
ORDER BY last_name;
```

##### Output

| last_name |
| --- |
|  |
|  |
|  |
|  |
| SnoW |
| SNOW |
| SNOW |
| SnoW |
| Snow |
| snow |

##### Snowflake

##### Query

```sql
 SELECT last_name
FROM employees
ORDER BY last_name;
```

##### Output

| last_name |
| --- |
|  |
|  |
|  |
|  |
| SnoW |
| SNOW |
| SNOW |
| SnoW |
| Snow |
| snow |

#### GROUP BY clause

> **Warning:**
>
> **Notice that this output can differ. To ensure a functional equivalence, it is required to use the COLLATE expression.**
>
> Please review the SSC-EWI-TD0007 for more information.
>
> *The following might be a workaround without `collate`:*
>
> `SELECT RTRIM(UPPER(first_name))`
>
> `FROM employees`
>
> `GROUP BY RTRIM(UPPER(first_name));`

**About the column behavior**

> **Danger:**
>
> Please review the insertion of data in Snowflake. Snowflake does allow the insertion of values as ‘`GEORGE`’ and ‘`georges`’ without showing errors because the case specification is not bound explicitly with the column.

Assume a table and data as follows:

```sql
 CREATE TABLE students (
   first_name VARCHAR(50) NOT CASESPECIFIC
);

INSERT INTO students(first_name) VALUES ('George');
INSERT INTO students(first_name) VALUES ('   George');
```

Notice that this sample does not allow inserting values with upper and lower case letters in the `NOT CASESPECIFIC` column because it takes it as the same value. Because the column does not supervise the case specification, the ‘GEORGE’ and ‘george’ values are checked as the same information.

The following rows are taken as ***duplicated row errors***:

```sql
 INSERT INTO students(first_name) VALUES ('GEORGE');
INSERT INTO students(first_name) VALUES ('GeorGe');
INSERT INTO students(first_name) VALUES ('George  ');
INSERT INTO students(first_name) VALUES ('GeOrge');
INSERT INTO students(first_name) VALUES ('GEorge');
INSERT INTO students(first_name) VALUES ('George');
```

##### Case 1: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT first_name
FROM employees
GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| Mary |
| GeorgE |
| WIlle |
| JOHN |
| Marco |
| GEORGE |

##### Snowflake

##### Query

```sql
 SELECT
   first_name
FROM
   employees
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0007 - GROUP BY IS NOT EQUIVALENT IN TERADATA MODE ***/!!!
GROUP BY first_name;
```

##### Output

| first_name |
| --- |
| George |
| John |
| WIlle |
| Marco |
| Mary |
| GEORGE |
| GEORGE |
| GeorgE |
| JOHN |
| JOHN |

##### Case 2: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
|  |
| SNOW |
| SnoW |
| Snow |
| snow |

##### Snowflake

##### Query

```sql
 SELECT
   last_name
FROM
   employees
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0007 - GROUP BY IS NOT EQUIVALENT IN TERADATA MODE ***/!!!
GROUP BY last_name;
```

##### Output

| last_name |
| --- |
| SnoW |
| SNOW |
| SnoW |
|  |
|  |
| Snow |
| snow |

#### HAVING clause

The HAVING clause will use the patterns in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT last_name
FROM employees
GROUP BY last_name
HAVING last_name = 'Snow';
```

##### Output

| last_name |
| --- |
| Snow |

##### Snowflake

##### Query

```sql
 SELECT last_name
FROM employees
GROUP BY last_name
HAVING RTRIM(last_name) = RTRIM('Snow');
```

##### Output

| last_name |
| --- |
| Snow |

#### CASE WHEN statement

The `CASE WHEN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Teradata

##### Query

```sql
 SELECT first_name,
      last_name,
      CASE
          WHEN department = 'EngineerinG' THEN 'Information Technology'
          WHEN last_name = 'SNOW' THEN 'GLOBAL COOL SALES'
          ELSE 'Other'
      END AS department_full_name
FROM employees;
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | Other |
| JOHN | SnoW | Other |
| Mary |  | Other |
| JOHN | snow | Other |
| WIlle | SNOW | GLOBAL COOL SALES |
| George | Snow | Other |
| GeorgE |  | Other |
| GEORGE |  | Other |
| Marco | SnoW | Information Technology |
| John | SNOW | Information Technology |

##### Snowflake

##### Query

```sql
 SELECT
   first_name,
   last_name,
   CASE
      WHEN UPPER(RTRIM(department)) = UPPER(RTRIM('EngineerinG'))
         THEN 'Information Technology'
      WHEN RTRIM(last_name) = RTRIM('SNOW')
         THEN 'GLOBAL COOL SALES'
      ELSE 'Other'
   END AS department_full_name
FROM
   employees;
```

##### Output

| first_name | last_name | department_full_name |
| --- | --- | --- |
| GEORGE |  | Other |
| JOHN | SnoW | Other |
| Mary |  | Other |
| JOHN | snow | Other |
| WIlle | SNOW | GLOBAL COOL SALES |
| George | Snow | Other |
| GeorgE |  | Other |
| GEORGE |  | Other |
| Marco | SnoW | Information Technology |
| John | SNOW | Information Technology |

#### JOIN clause

> **Warning:**
>
> Simple scenarios are supported.

The `JOIN` statement will use the patterns described in:

* Evaluation operations.

  + For example: `=, !=, <, >.`
* LIKE operation.
* IN Operation.
* CAST to evaluation operation.
* CAST to LIKE operation.

The following sample showcases a pattern with evaluation operation.

##### Sample: Column constraint is NOT CASESPECIFIC and database mode is TERA Mode

##### Teradata

##### Query

```sql
 SELECT
    e.employee_id,
    e.first_name,
    e.last_name,
    d.department_name
FROM
    employees e
JOIN
    departments d
ON
    e.department = d.department_name;
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 3 | WIlle | SNOW | Human Resources |
| 5 | Mary |  | Sales |
| 6 | GEORGE |  | Sales |
| 7 | GEORGE |  | Sales |
| 8 | GeorgE |  | Sales |
| 10 | JOHN | snow | Finance |

##### Snowflake

##### Query

```sql
 SELECT
   e.employee_id,
   e.first_name,
   e.last_name,
   d.department_name
FROM
   employees e
JOIN
   departments d
ON UPPER(RTRIM(e.department)) = UPPER(RTRIM(d.department_name));
```

##### Output

| employee_id | first_name | last_name | department_name |
| --- | --- | --- | --- |
| 1 | George | Snow | Sales |
| 3 | WIlle | SNOW | Human Resources |
| 5 | Mary |  | Sales |
| 6 | GEORGE |  | Sales |
| 7 | GEORGE |  | Sales |
| 8 | GeorgE |  | Sales |
| 10 | JOHN | snow | Finance |

### Known Issues

1. there are some mode-specific SQL statement restrictions: `BEGIN TRANSACTION`, `END TRANSACTION`, `COMMIT [WORK]`.
2. Data insertion may differ in Snowflake since the case specification is not bound to the column declaration.
3. `GROUP BY` may differ in order, but group the correct values.
4. `ORDER BY` behaves differently in Snowflake.
5. If a function has a TRIM() from the source code, this workaround will add the required functions to the source code. So, RTRIM will be applied to the TRIM() source function.

### Related EWIs

[SSC-EWI-TD0007](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): GROUP BY IS NOT EQUIVALENT IN TERADATA MODE

---
title: SnowConvert AI - Teradata - SnowConvert AI Procedures Helpers
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/helpers-for-procedures.md
section: Migrations
---

# SnowConvert AI - Teradata - SnowConvert AI Procedures Helpers

In this section you will find the helper functions used inside procedures that are used to achieve functional equivalence of some Teradata features that are not supported natively in Snowflake.

## Cursor Helper

This section describes the usage of different functions to achieve functional equivalence for Teradata cursors in JavaScript.

The cursor helper is a function that contains the main four actions that Teradata cursors perform such as Open, Fetch, Next, and Close.

* *CURSOR(),* the main routine which declares the needed variables and other sub-routines.
* *OPEN(),* opens the cursor executing the given statement, and updates the necessary variables.
* *NEXT(),* moves the cursor to the next row (if any) of the statement and sets every column value to the current row.
* *FETCH(),* obtains the values (if any) from the response of the statement executed.
* *CLOSE(),* removes the temporary table from the _OUTQUERIES (if it was added in the EXEC helper) and unsets the necessary variables.

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Cursor Sample Usage

Teradata

```sql
 -- Additional Params: -t JavaScript
Replace procedure procedure1()
dynamic result sets 2
begin

    -------- Local variables --------
    declare sql_cmd varchar(20000) default ' ';
    declare num_cols integer;

    ------- Declare cursor with return only-------
    declare resultset cursor with return only for firststatement;

    ------- Declare cursor -------
    declare cur2 cursor for select count(columnname) from table1;

    -------- Set --------
    set sql_cmd='sel * from table1';

    -------- Prepare cursor --------
    prepare firststatement from sql_cmd;

    -------- Open cursors --------
    open resultset;
    open cur1;

    -------- Fetch -------------
    fetch cur1 into val1, val2;

    -------- Close cursor --------
    close cur1;
end;
```

Snowflake output

```sql
 CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    //------ Local variables --------
    var SQL_CMD = ` `;
    var NUM_COLS;
    var RESULTSET = new CURSOR(() => FIRSTSTATEMENT,[],true);
    //----- Declare cursor -------
    var CUR2 = new CURSOR(`SELECT
   COUNT(columnname)
from
   table1`,[],false);
    //------ Set --------
    SQL_CMD = `SELECT
   * from
   table1`;
    //------ Prepare cursor --------
    var FIRSTSTATEMENT = SQL_CMD;
    //------ Open cursors --------
    RESULTSET.OPEN();
    CUR1.OPEN();

        //------ Fetch -------------
    CUR1.FETCH() && ([val1,val2] = CUR1.INTO());
    //------ Close cursor --------
    CUR1.CLOSE();
    return PROCRESULTS();
$$;
```

#### Cursor Helper Function Definition

```javascript
 var CURSOR = function (stmt,binds,withReturn) {
	   var rs, rows, row_count, opened = false, resultsetTable = '', self = this;
	   this.CURRENT = new Object;
	   this.INTO = function () {
	         return self.res;
	      };
	   this.OPEN = function (usingParams) {
	         try {
	            if (usingParams) binds = usingParams;
	            if (binds instanceof Function) binds = binds();
	            var finalBinds = binds && binds.map(fixBind);
	            var finalStmt = stmt instanceof Function ? stmt() : stmt;
	            if (withReturn) {
	               resultsetTable = EXEC(finalStmt,finalBinds,true,null,{
	                     temp : true
	                  });
	               finalStmt = `SELECT * FROM TABLE(RESULT_SCAN('${resultsetTable}'))`;
	               finalBinds = [];
	            }
	            rs = snowflake.createStatement({
	                  sqlText : finalStmt,
	                  binds : finalBinds
	               });
	            rows = rs.execute();
	            row_count = rs.getRowCount();
	            ACTIVITY_COUNT = rs.getRowCount();
	            opened = true;
	            return this;
	         } catch(error) {
	            ERROR_HANDLERS && ERROR_HANDLERS(error);
	         }
	      };
	   this.NEXT = function () {
	         if (row_count && rows.next()) {
	            this.CURRENT = new Object;
	            for(let i = 1;i <= rs.getColumnCount();i++) {
	               (this.CURRENT)[rs.getColumnName(i)] = rows.getColumnValue(i);
	            }
	            return true;
	         } else return false;
	      };
	   this.FETCH = function () {
	         self.res = [];
	         self.res = fetch(row_count,rows,rs);
	         if (opened) if (self.res.length > 0) {
	            SQLCODE = 0;
	            SQLSTATE = '00000';
	         } else {
	            SQLCODE = 7362;
	            SQLSTATE = '02000';
	            var fetchError = new Error('There are not rows in the response');
	            fetchError.code = SQLCODE;
	            fetchError.state = SQLSTATE;
	            if (ERROR_HANDLERS) ERROR_HANDLERS(fetchError);
	         } else {
	            SQLCODE = 7631;
	            SQLSTATE = '24501';
	         }
	         return self.res && self.res.length > 0;
	      };
	   this.CLOSE = function () {
	         if (withReturn && _OUTQUERIES.includes(resultsetTable)) {
	            _OUTQUERIES.splice(_OUTQUERIES.indexOf(resultsetTable),1);
	         }
	         rs = rows = row_count = undefined;
	         opened = false;
	         resultsetTable = '';
	      };
	};
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Exec Helper

The exec helper is a function used to execute SQL statements in procedures.

### Syntax

EXEC(stmt)
EXEC(stmt, binds)
EXEC(stmt, binds, noCatch)
EXEC(stmt, binds, noCatch, catchFunction)
EXEC(stmt, binds, noCatch, catchFunction, opts)

### Parameters

#### stmt

The string of the SQL statement to execute.

#### binds (optional)

An array with the values or the variables to bind into the SQL statement.

#### NoCatch (optional)

Boolean to know if an error should not be catched.

#### catchFunction (optional)

A function to execute in case an error occurs during the execution of the exec function.

#### opts (optional)

A JSON object ({ temp : true }) to know if the query ID should be returned.

### FixBind And FormatDate Functions

The Exec helper uses a function defined in the helpers called FixBind. This function uses the FormatDate function when it encounters that one of the binding variables is a date type, this is done to manage properly the date types in Snowflake.
Both functions are defined as below.

```javascript
 var formatDate = (arg) => (new Date(arg - (arg.getTimezoneOffset() * 60000))).toISOString().slice(0,-1);
	var fixBind = function (arg) {
	   arg = arg == undefined ? null : arg instanceof Date ? formatDate(arg) : arg;
	   return arg;
	};
```

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

#### Exec Usage Sample

Teradata

```sql
 -- Additional Params: -t JavaScript
REPLACE PROCEDURE ProcedureSample ()
BEGIN

case value
when 0 then
  select * from table1
else
  update table1 set name = "SpecificValue" where id = value;
end case

END;
```

Snowflake output

```javascript
 CREATE OR REPLACE PROCEDURE ProcedureSample ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  switch(value) {
    case 0:EXEC(`SELECT * from table1`,[]);
    break;
    default:EXEC(`UPDATE table1
    set
      name = "SpecificValue"
    where
      id = value`,[]);
    break;
  }
$$;
```

#### Exec Helper Definition

```javascript
 var EXEC = function (stmt,binds,noCatch,catchFunction,opts) {
	   try {
	      binds = binds ? binds.map(fixBind) : binds;
	      _RS = snowflake.createStatement({
	            sqlText : stmt,
	            binds : binds
	         });
	      _ROWS = _RS.execute();
	      ROW_COUNT = _RS.getRowCount();
	      ACTIVITY_COUNT = _RS.getNumRowsAffected();
	      HANDLE_NOTFOUND && HANDLE_NOTFOUND(_RS);
	      if (INTO) return {
	         INTO : function () {
	            return INTO();
	         }
	      };
			  if (_OUTQUERIES.length < DYNAMIC_RESULTS) _OUTQUERIES.push(_ROWS.getQueryId());
	      if (opts && opts.temp) return _ROWS.getQueryId();
	   } catch(error) {
	      MESSAGE_TEXT = error.message;
	      SQLCODE = error.code;
	      SQLSTATE = error.state;
	      var msg = `ERROR CODE: ${SQLCODE} SQLSTATE: ${SQLSTATE} MESSAGE: ${MESSAGE_TEXT}`;
	      if (catchFunction) catchFunction(error);
	      if (!noCatch && ERROR_HANDLERS) ERROR_HANDLERS(error); else throw new Error(msg);
	   }
	};
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Functional Equivalence Helpers

A list of helpers functions in JavaScript that procedures in Snowflake can use, in order to better support several Teradata language features.

Depending on what is in each Stored Procedure in Teradata, SnowConvert AI will create one or more of the following javascript functions inside them.

### ***CompareDates***

A function that compares dates handling nullity. In Javascript, it is needed to call *.getTime()* for date comparisons.

```javascript
 var CompareDates = function(value1, value2) {
	var value1Time = value1 && value1.getTime() || null;
	var value2Time = value2 && value2.getTime() || null;
	if (value1Time == null && value2Time == null) return null; /*in SQL null == null is equal to null as well as any other comparison */
	return value1Time > value2Time? 1 : value1Time<value2Time? -1 : 0;
}
```

### ***BetweenFunc***

A function to handle the *BETWEEN* statement in Teradata.

```javascript
 var BetweenFunc = function (expression,startExpr,endExpr) {
	if ([expression,startExpr,endExpr].some((arg) => arg == null)) {
		  return false;
	}
	return expression >= startExpr && expression <= endExpr;
};
```

### ***LikeFunction()***

A function to handle the *LIKE* statement in Teradata.

```javascript
 var likeFunction = function (leftExpr,rightExpr) {
	RegExp.escape = function (text) {
		if (!arguments.callee.sRE) {
			var specials = ['/','.','*','+','?','|','(',')','[',']','{','}','\\'];
			arguments.callee.sRE = new RegExp('(\\' + specials.join('|\\') + ')','g');
		}
		return text.replace(arguments.callee.sRE,'\\$1');
	}
	var likeExpr = RegExp.escape(rightExpr);
	var likeResult = new RegExp(likeExpr.replace('%','.*').replace('_','.')).exec(leftExpr) != null;
	return likeResult;
};
```

### ***ERROR_HANDLERS()***

The main error-handling routine.

```javascript
 var continue_handler_1 = function (error) {
   {
	  V_SQL_VALUE = SQLSTATE;
	  V_EXCEPTION_FLAG = `Y`;
   }
};

// Main error-handling routine
var ERROR_HANDLERS = function (error) {
   switch(error.state) {
	  //Conversion Warning - handlers for the switch default (SQLWARNING/SQLEXCEPTION/NOT FOUND) can be the following
	  default:continue_handler_1(error);
   }
};
```

### ***INSERT_TEMP***

> **Warning:**
>
> ***This helper has been deprecated in stored procedures since version 2.0.15.***

A function to create a temporary table using the argument *query* with the given *parameters*.

```javascript
 var procname = `PUBLIC.Procedure1`;
var temptable_prefix, tablelist = [];
var INSERT_TEMP = function (query,parameters) {
		if (!temptable_prefix) {
	  		var sql_stmt = `select current_session() || '_' || to_varchar(current_timestamp, 'yyyymmddhh24missss')`;
	      var rs = snowflake.createStatement({
	         sqlText : sql_stmt,
	         binds : []
	      }).execute();
	      temptable_prefix = rs.next() && (procname + '_TEMP_' + rs.getColumnValue(1) + '_');
	  }
	  var tablename = temptable_prefix + tablelist.length;
	  tablelist.push(tablename);
	  var sql_stmt = `CREATE OR REPLACE TEMPORARY TABLE ${tablename} AS ${query}`;
	  snowflake.execute({
	  		sqlText : sql_stmt,
	      binds : parameters
	  });
	  return tablename;
};
```

### ***IS_NOT_FOUND()***

A function that validates when a SELECT returns no values or a sentence affects zero rows. This is done in order to emulate the same behavior as Teradata, when there are exits or continue handlers for NOT FOUND EXCEPTIONS.

```javascript
 let IS_NOT_FOUND = (stmt) => {
	   let n = -1;
	   let cmd = stmt.getSqlText().replace(new RegExp("\\/\\*.*\\*\\/","gsi"),"").replace(new RegExp("--.*?\\n","gsi"),"");
	   let matched = cmd.match(new RegExp("\\s*(\\w+)\\s+"),"");
	   if (matched) {
	      cmd = matched[1].toUpperCase();
	      switch(cmd) {
		       case "CALL":
	         case "DROP":
	         case "CREATE":
	         case "ALTER":
	         case "SELECT":
								n = stmt.getRowCount();
	         break;
	         default:n = stmt.getNumRowsAffected();
	         break;
	      }
	   }
	   return n == 0;
	};
```

### ***HANDLE_NOTFOUND()***

This function uses the above *IS_NOT*_FOUND function to validate when an artificial error ‘NOT FOUND’ is being thrown.

```javascript
  	let HANDLE_NOTFOUND = (stmt) => {
	   if (IS_NOT_FOUND(stmt) && (error = new Error('NOT_FOUND')) && (NOT_FOUND = true) && ([error.code,error.state] = ['020000','020000'])) throw error;
	};
```

### PROCRESULTS()

A function that takes zero or multiple output parameters and binds them with the _OUTQUERIES in an array in order to be returned.

```javascript
 let PROCRESULTS = (...OUTPARAMS) => JSON.stringify([...OUTPARAMS,[..._OUTQUERIES]]);
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## Into Helper

The into function is used to extract the resulting rows from a subquery or from a select into statement.

### Fetch Function

The INTO helper uses a fetch function to get the row from a resulting query. The definition of the Fetch Function is described below.

```javascript
 var fetch = (count,rows,stmt) =>
(count && rows.next() && Array.apply(null,Array(stmt.getColumnCount())).map((_,i)
=> rows.getColumnValue(i + 1))) || [];
```

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Into Sample Usage

Teradata

```sql
 -- Additional Params: -t JavaScript
REPLACE PROCEDURE SubQuerypoc ()
BEGIN

DECLARE monat INTEGER;
SET monat      = (SELECT column1
             FROM table1);
END;
```

Snowflake output

```sql
CREATE OR REPLACE PROCEDURE SubQuerypoc ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var MONAT;
    EXEC(`(SELECT column1 FROM table1)`,[]);
    var subQueryVariable0;
    [subQueryVariable0] = INTO();
    MONAT = subQueryVariable0;
$$;
```

### Into Helper function Definition

```javascript
 var INTO = () => fetch(ROW_COUNT,_ROWS,_RS);
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Teradata - SnowConvert AI Scripts Helpers
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-python/snowconvert-script-helpers.md
section: Migrations
---

# SnowConvert AI - Teradata - SnowConvert AI Scripts Helpers

SnowConvert AI Helpers is a set of classes with functions designed to facilitate the conversion of Teradata script files to Python files that Snowflake can interpret.

SnowConvert AI for Teradata can take in any Teradata SQL or scripts (BTEQ, FastLoad, MultiLoad, and TPump) and convert them to functionally equivalent Snowflake SQL, JavaScript embedded in Snowflake SQL, and Python. Any output Python code from SnowConvert AI will call functions from these helper classes to complete the conversion and create a functionally equivalent output in Snowflake.

The [Snowflake Connector for Python](https://pypi.org/project/snowflake-connector-python/) will also be called to connect to your Snowflake account and run the output Python code created by SnowConvert.

For the latest version information, see the [snowconvert-helpers PyPI page](https://pypi.org/project/snowconvert-helpers/).

> **Note:**
>
> The Python package`snowconvert-helpers` supports Python versions 3.6, 3.7, 3.8, and 3.9.

## Script Migration

### Source

Suppose you have the following BTEQ code to be migrated.

```sql
 insert into table1 values(1, 2);
insert into table1 values(3, 4);
insert into table1 values(5, 6);
```

### Output

You should get an output like the one below.

> **Note:**
>
> The `log_on`function parameters (‘user’, ‘password’, ‘account’, ‘database’, ‘warehouse’, ‘role’, ‘token’) should be defined by the user.

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    INSERT INTO table1
    VALUES (1, 2)
    """)
  exec("""
    INSERT INTO table1
    VALUES (3, 4)
    """)
  exec("""
    INSERT INTO table1
    VALUES (5, 6)
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

## Getting Started

To install the package, you should run the following command in your python environment. If you’re not familiar with installing packages in Python, visit the following page on python packages (<https://packaging.python.org/tutorials/installing-packages/>).

```bash
 pip install snowconvert-helpers
```

Once your package is installed, you will be able to run the script migrated code in Python.

## Run the code

To run the migrated code, you just have to open the `Command Prompt` or the `Terminal` and execute the following command.

```python
 python sample_BTEQ.py
```

If the script has no errors, you will get in your console an output like the one below.

```bash
 Executing: INSERT INTO PUBLIC.table1 VALUES (1, 2).
Printing Result Set:
number of rows inserted
1

Executing: INSERT INTO PUBLIC.table1 VALUES (3, 4).
Printing Result Set:
number of rows inserted
1

Executing: INSERT INTO PUBLIC.table1 VALUES (5, 6).
Printing Result Set:
number of rows inserted
1

Error Code 0
Script done >>>>>>>>>>>>>>>>>>>>
Error Code 0
```

### Passing connection parameters

There are several ways to pass the connection parameters to the connection of the database:

* As parameters in the function call snowconvert.helpers.log_on inside the python file.
* As positional parameters with the specific order of user, password, account, database, warehouse, and role when the python is being executed from the command line.
* As named parameters with no order restriction of SNOW_USER, SNOW_PASSWORD, SNOW_ACCOUNT, SNOW_DATABASE, SNOW_WAREHOUSE, SNOW_ROLE, SNOW_QUERYTAG, SNOWAUTHENTICATOR and SNOWTOKEN when the python is being executed from the command line and any of them are passed like –param-VARNAME=VALUE.
* As environment variables named SNOW_USER, SNOW_PASSWORD, SNOW_ACCOUNT, SNOW_DATABASE, SNOW_WAREHOUSE, SNOW_ROLE, SNOW_QUERYTAG, SNOWAUTHENTICATOR and SNOWTOKEN before python execution.

The previous order specified is the way to determine the precedence.

#### Parameters in the function call

They can be set as positional parameters in the function call as follows.

```python
    .....
   con = snowconvert.helpers.log_on(
     'myuser',
     'mypassword',
     'myaccount',
     'mydatabase',
     'mywarehouse',
     'myrole',
     5,
     'myauthenticator',
     'mytoken')
   .....
```

Or they can be set any of the named parameters in any order in the function call as follows.

```python
    .....
   con = snowconvert.helpers.log_on(
     account:'myaccount',
     password:'mypassword',
     user:'myuser',
     warehouse:'mywarehouse',
     login_timeout:5,
     authenticator:'myauthenticator',
     token:'mytoken')
   .....
```

#### Positional parameters

They need to be set in the specific order in the command line as follows.

```python
 python sample_BTEQ.py myuser mypassword myaccount mydatabase mywarehouse myrole myauthenticator mytokenr
```

Or they can be set only some of the parameters but always starting with the user parameter as follows.

```python
 python sample_BTEQ.py myuser mypassword myaccount
```

#### Named parameters

They can be set any of the named parameters in any order in the command line as follows (use a single line, multiline shown for readability reasons).

```none
python sample_BTEQ.py --param-SNOW_WAREHOUSE=mywarehouse
  --param-SNOW_ROLE=myrole
  --param-SNOW_PASSWORD=mypassword
  --param-SNOW_USER=myuser
  --param-SNOW_QUERYTAG=mytag
  --param-SNOW_ACCOUNT=myaccount
  --param-SNOW_DATABASE=mydatabase
  --param-SNOW_AUTHENTICATOR=myauthenticator
  --param-SNOW_TOKEN=mytoken
  --param-PRIVATE_KEY_PATH=myprivatekey
  --param-PRIVATE_KEY_PASSWORD=myprivatekeypassword
```

#### Environment variables

Before calling the python script, any of the following environment variables can be set:

* SNOW_USER
* SNOW_PASSWORD
* SNOW_ACCOUNT
* SNOW_DATABASE
* SNOW_WAREHOUSE
* SNOW_ROLE
* SNOW_QUERYTAG
* SNOW_AUTHENTICATOR
* SNOW_TOKEN
* PRIVATE_KEY_PATH
* PRIVATE_KEY_PASSWORD

#### Key Pair Authentication

The `log_on` function can also support the key pair authentication process. Review the following [Snowflake documentation](https://docs.snowflake.com/en/user-guide/key-pair-auth) for more information about the key creation . Please notice the required parameters:

`log_on(`

`user='YOUR_USER',`

`account='YOUR_ACCOUNT',`

`role = 'YOUR_ROLE',`

`warehouse = 'YOUR_WAREHOUSE',`

`database = 'YOUR_DATABASE',`

`private_key_path='/YOUR_PATH/rsa_key.p8',`

`private_key_password='YOUR_PASSWORD')`

### Example of passing environment variables

Here is an example of using SNOW_AUTHENTICATOR, SNOW_USER and SNOW_PASSWORD. They must be defined before running the output python file and then run the python generated file.

#### Windows

```shell
 SET SNOW_AUTHENTICATOR=VALUE
SET SNOW_USER=myuser
SET SNOW_PASSWORD=mypassword
python sample_BTEQ.py
```

##### Linux/Mac

```bash
 export SNOW_AUTHENTICATOR=VALUE
export SNOW_USER=myuser
export SNOW_PASSWORD=mypassword
python3 sample_BTEQ.py
```

### Enabling Logging

To enable logging, you should enable an environment variable called SNOW_LOGGING set as true.

Then, if you want to customize the logging configuration you can pass a parameter to the `snowconvert.helpers.configure_log()` method like this:

```python
 snowconvert.helpers.configure_log("SOMEPATH.conf")
```

The configuration file should contain the next structure. For more information, see the [Python logging configuration documentation](https://docs.python.org/es/3/library/logging.config.html)

```html
 [loggers]
keys=root

[handlers]
keys=consoleHandler

[formatters]
keys=simpleFormatter

[logger_root]
level=DEBUG
handlers=consoleHandler

[logger_simpleExample]
level=DEBUG
handlers=consoleHandler
qualname=simpleExample
propagate=0

[handler_consoleHandler]
class=FileHandler
level=DEBUG
formatter=simpleFormatter
args=('python2.log', 'w')

[formatter_simpleFormatter]
format=%(asctime)s -%(levelname)s - %(message)s
```

## Snowflake

Once any migrated code you have been executed, you can go to Snowflake and check your changes or deployments.

```sql
 select * from PUBLIC.table1;
```

You will be able to see the rows you have inserted in the example above.

## Local Helpers Documentation

First of all, it is required to install the python package named pydoc (Available since version 2.0.2 of snowconvert-helpers).

```bash
 pip install pydoc
```

Then to display the python documentation of the package snowconvert-helpers, you should go to a folder where you have the converted output code and you have a python output.

```none
D:\bteq\Output>dir

 Volume in drive D is Storage
 Volume Serial Number is 203C-168C

 Directory of D:\bteq\Output

05/25/2021  03:55 PM    <DIR>          .
05/25/2021  03:55 PM    <DIR>          ..
05/25/2021  03:55 PM               630 input_BTEQ.py
               1 File(s)            630 bytes
               2 Dir(s)  1,510,686,502,912 bytes free
```

Located in this directory you need to run:

```bash
 python -m pydoc -b
```

The console will open your preferred browser with the HTML help of the documentation for all the installed packages.

```none
D:\bteq\Output>python -m pydoc -b
Server ready at http://localhost:61355/
Server commands: [b]rowser, [q]uit
server>
```

This will open the browser with the documentation of your code like:

Scroll thru the end of the page to see the installed packages. And you will see something similar to:

Clicking in the SnowConvert AI(package) you will see something like:

Clicking in the module helpers will display a screen similar to:

Then you can scroll thru the functions and classes of the module.

## Known Issues

No issues were found.

## Related EWIs

No related EWIs.

## Technical Documentation

### Functions

All the functions defined in the project.

#### access

> **Note:**
>
> **`access`**`(path, mode, *, dir_fd=None, effective_ids=False, follow_symlinks=True)`

##### **Description:**

*Use the real uid/gid to test for access to a path.*

*dir_fd, effective_ids, and follow_symlinks may not be implemented on your platform. If they are unavailable, using them will raise a NotImplementedError.*

*Note that most operations will use the effective uid/gid, therefore this routine can be used in a suid/sgid environment to test if the invoking user has the specified access to the path.*

##### **Parameters:**

* **`path,`** Path to be tested; can be string, bytes, or a path-like [object](http://localhost:65458/builtins.html#object)
* **`mode,`** Operating-system mode bitfield. Can be F_OK to test existence, or the inclusive-OR of R_OK, W_OK, and X_OK
* **`dir_fd,`** If not None, it should be a file descriptor open to a directory, and path should be relative; path will then be relative to that directory
* **`effective_ids,`** If True, access will use the effective uid/gid instead of the real uid/gid
* **`follow_symlinks,`** If False, and the last element of the path is a symbolic link, access will examine the symbolic link itself instead of the file the link points to

#### at_exit_helpers

> **Note:**
>
> **`at_exit_helpers`**`()`

##### **Description:**

*Executes at the exit of the execution of the script.*

#### colored

> **Note:**
>
> **`colored`**`(text, color='blue')`

##### **Description:**

*Prints colored text from the specified color.*

##### **Parameters:**

* `text`**`,`** The text to be printed
* `color="blue"`**`,`** The color to print

#### configure_log

> **Note:**
>
> **`configure_log`**`(configuration_path)`

##### **Description:**

*Configures the logging that will be performed for any data-related execution on the snowflake connection. The log file is named ‘snowflake_python_connector.log’ by default.*

**Parameters:**

* `configuration_path`**`,`** The configuration path of the file that contains all the settings desired for the logging

#### drop_transient_table

> **Note:**
>
> **`drop_transient_table`**`(tempTableName, con=None)`

##### **Description:**

*Drops the transient table with the specified name.*

**Parameters:**

* `tempTableName`**`,`** The name of the temporary table
* `con=None`**`,`** The connection to be used, if None is passed it will use the last connection performed

#### exception_hook

> **Note:**
>
> **`exception_hook`**`(exctype, value, tback)`

##### **Description:**

**Parameters:**

* `exctype`
* `value`
* `tback`

#### exec

> **Note:**
>
> **`exec`**`(sql_string, using=None, con=None)`

##### **Description:**

*Executes a sql string using the last connection, optionally it uses arguments or an specific connection. Examples:*

* *`exec("SELECT * FROM USER")`*
* *`exec("SELECT * FROM USER", con)`*
* *`exec("SELECT * FROM CUSTOMER WHERE CUSTOMERID= %S", customer)`*

**Parameters:**

* `sql_string`**`,`** The definition of the sql
* `using=None`**`,`** The optional parameter that can be used in the sql passed
* `con=None`**`,`** The connection to be used, if None is passed it will use the last connection performed

#### exec_file

> **Note:**
>
> **`exec_file`**`(filename, con=None)`

##### **Description:**

*Reads the content of a file and executes the sql statements contained with the specified connection.*

**Parameters:**

* `filename`**`,`** The filename to be read and executed
* `con=None`**`,`** The connection to be used, if None is passed it will use the last connection performed

#### exec_os

> **Note:**
>
> **`exec_os`**`(command)`

##### **Description:**

*Executes a command in the operating system.*

#### exec_sql_statement

> **Note:**
>
> **`exec_sql_statement`**`(sql_string, con, using=None)`

##### **Description:**

*Executes a sql statement in the connection passed, with the optional arguments.*

**Parameters:**

* `sql_string`**`,`** The sql containing the string to be executed
* `con`**`,`** The connection to be used
* `using`**`,`** The optional parameters to be used in the sql execution

#### **expands_using_params**

> **Note:**
>
> **`expands_using_params`**`(statement, params)`

##### **Description:**

*Expands the statement passed with the parameters.*

**Parameters:**

* `statement`**`,`** The sql containing the string to be executed
* `params`**`,`** The parameters of the sql statement

#### **expandvar**

> **Note:**
>
> **`expandvar`**`(str)`

##### **Description:**

*Expands the variable from the string passed.*

**Parameters:**

* `str`**`,`** The string to be expanded with the variables

#### **expandvars**

> **Note:**
>
> **`expandvars`**`(path, params, skip_escaped=False)`

##### **Description:**

*Expand environment variables of form $var and ${var}. If parameter ‘skip_escaped’ is True, all escaped variable references (that is, preceded by backslashes) are skipped. Unknown variables are set to ‘default’. If ‘default’ is None, they are left unchanged.*

**Parameters:**

* `path`**`,`**
* `params`**`,`**
* `skip_escaped=False`**`,`**

#### **fast_load**

> **Note:**
>
> **`fast_load`**`(target_schema, filepath, stagename, target_table_name, con=None)`

##### **Description:**

*Executes the fast load with the passed parameters target_schema, filepath, stagename and target_table_name.*

**Parameters:**

* `target_schema`**`,`** The name of the schema to be used in the fast load
* `filepath`**`,`** The filename path to be loaded in the table
* `target_table_name`**`,`** The name of the table that will have the data loaded
* `con=None`**`,`** The connection to be used, if None is passed it will use the last connection performed

#### **file_exists_and_readable**

> **Note:**
>
> **`file_exists_and_readable`**`(filename)`

##### **Description:**

**Parameters:**

* `filename`**`,`**

#### **get_argkey**

> **Note:**
>
> **`get_argkey`**`(astr)`

##### **Description:**

*Gets the argument key value from the passed string. It must start with the string ‘–param-’*

**Parameters:**

* `astr`**`,`** The argument string to be used. The string should have a value similar to –param-column=32 and the returned string will be ‘32

#### **get_error_position**

> **Note:**
>
> **`get_error_position`**`()`

##### **Description:**

*Gets the error position from the file using the information of the stack of the produced error.*

#### **get_from_vars_or_args_or_environment**

> **Note:**
>
> **`get_from_vars_or_args_or_environment`**`(arg_pos, variable_name, vars, args)`

##### **Description:**

*Gets the argument from the position specified or gets the value from the table vars or gets the environment variable name passed.*

**Parameters:**

* `arg_pos`**`,`** The argument position to be used from the arguments parameter
* `variable_name`**`,`** The name of the variable to be obtained
* `vars`**`,`** The hash with the variables names and values
* `args`**`,`** The arguments array parameter

#### **import_data_to_temptable**

> **Note:**
>
> **`import_data_to_temptable`**`(tempTableName, inputDataPlaceholder, con)`

##### **Description:**

*Imports data to a temporary table using an input data place holder.*

**Parameters:**

* `tempTableName,` The temporary table name.
* `inputDataPlaceholder,` The input place holder used that is a stage in the snowflake database
* `con,` The connection to be used

#### **import_file**

> **Note:**
>
> **`import_file`**`(filename, separator=' ')`

##### **Description:**

*Imports the passed filename with the optional separator.*

**Parameters:**

* `filename,` The filename path to be imported
* `separator=' ',` The optional separator

#### **import_file_to_temptable**

> **Note:**
>
> **`import_file_to_temptable`**`(filename, tempTableName, columnDefinition)`

##### **Description:**

*Imports the file passed to a temporary table. It will use a public stage named as the temporary table with the prefix Stage_. At the end of the loading to the temporary table, it will delete the stage that was used in the process.*

**Parameters:**

* `filename,` The name of the file to be read
* `tempTableName,` The name of the temporary table
* `columnDefinition,` The definition of all the fields that will have the temporary table

#### **import_reset**

> **Note:**
>
> **`import_reset`**`()`

##### **Description:**

#### **log**

> **Note:**
>
> **`log`**`(*msg, level=20, writter=None)`

##### **Description:**

*Prints a message to the console (standard output) or to the log file, depending on if logging is enabled*

**Parameters:**

* `*msg,` The message to print or log
* `level=20,`
* `writter=None,`

#### **log_on**

> **Note:**
>
> **`log_on`**`(user=None, password=None, account=None, database=None, warehouse=None, role=None, login_timeout=10, authenticator=None)`

##### **Description:**

*Logs on the snowflake database with the credentials, database, warehouse, role, login_timeout and authenticator passed parameters.*

**Parameters:**

* `user,` The user of the database
* `password` The password of the user of the database
* `database,` The database to be connected
* `warehouse,` The warehouse of the database to be connected
* `role,` The role to be connected
* `login_timeout,` The maximum timeout before giving error if the connection is taking too long to connect
* `authenticator,` The authenticator supported value to use like SNOWFLAKE, EXTERNALBROWSER, SNOWFLAKE_JWT or OAUTH
* `token,` The OAUTH or JWT token

#### **os**

> **Note:**
>
> **`os`**`(args)`

##### **Description:**

**Parameters:**

* `args,`

#### **print_table**

> **Note:**
>
> **`print_table`**`(dictionary)`

##### **Description:**

*Prints the dictionary without exposing user and password values.*

**Parameters:**

* `dictionary,`

#### **quit_application**

> **Note:**
>
> **`quit_application`**`(code=None)`

##### **Description:**

*Quits the application and optionally returns the passed code.*

**Parameters:**

* `code=None,` The code to be returned after it quits

#### **read_params_args**

> **Note:**
>
> **`read_param_args`**`(args)`

##### **Description:**

*Reads the parameter arguments from the passed array.*

**Parameters:**

* `args,` The arguments to be used

#### **readrun**

> **Note:**
>
> **readrun**(line, skip=0)

##### **Description:**

*Reads the given filename lines and optionally skips some lines at the beginning of the file.*

**Parameters:**

* `line,` The filename to be read
* `skip=0,` The lines to be skipped

#### **remark**

> **Note:**
>
> **remark**(arg)

##### **Description:**

*Prints the argument.*

**Parameters:**

* `arg,` The argument to be printed

#### **repeat_previous_sql_statement**

> **Note:**
>
> **`repeat_previous_sql_statement`**`(con=None, n=1)`

##### **Description:**

*Repeats the previous executed sql statement(s).*

**Parameters:**

* `con=None,` Connection if specified. If it is not passed it will use the last connection performed
* `n=1,` The number of previous statements to be executed again

#### **set_default_error_level**

> **Note:**
>
> **`set_default_error_level`**`(severity_value)`

##### **Description:**

**Parameters:**

* `severity_value,`

#### **set_error_level**

> **Note:**
>
> **`set_error_level`**`(arg, severity_value)`

##### **Description:**

**Parameters:**

* `arg,`
* `severity_value,`

#### **simple_fast_load**

> **Note:**
>
> **`simple_fast_load`**`(con, target_schema, filepath, stagename, target_table_name)`

##### **Description:**

Executes a simple fast load in the connection and the passed parameter target_schema, filepath, stagename and target table name.

**Parameters:**

* `arg,` The connection to be used
* `target_schema,` The name of the schema to be used in the fast load
* `filepath,` The filename path to be loaded in the table
* `target_table_name,` The name of the table that will have the data loaded

#### **stat**

> **Note:**
>
> **`stat`**`(path, *, dir_fd=None, follow_symlinks=True)`

##### **Description:**

*Perform a stat system call on the given path.* dir_fd and follow_symlinks may not be implemented on your platform. If they are unavailable, using them will raise a NotImplementedError. It’s an error to use dir_fd or follow_symlinks when specifying path as an open file descriptor

**Parameters:**

* `path,` Path to be examined; can be string, bytes, a path-like [object](http://localhost:55262/builtins.html#object) or
  open-file-descriptor int
* `dir_fd,` If not None, it should be a file descriptor open to a directory, and path should be a relative string; path will then be relative to that directory
* `follow_symlinks,` If False, and the last element of the path is a symbolic link, stat will examine the symbolic link itself instead of the file the link points to

#### **system**

> **Note:**
>
> **`system`**`(command)`

##### **Description:**

*Execute the command in a subshell.*

**Parameters:**

* *`command`*`,`

#### **using**

> **Note:**
>
> **`using`**`(*argv)`

##### **Description:**

**Parameters:**

* *`*argv`*`,`

### Classes

All the classes defined in the project

#### BeginLoading Class

This class contains the `import_file_to_tab` static function which provides support for the BEGIN LOADING and associated commands in FastLoad.

##### `import_file_to_tab()`

Parameters:

1. `target_schema_table`

   * the target schema (optional) and table name
2. `define_file`

   * The name of the file to be read
3. `define_columns`

   * The definition of all the columns for the temporary table
4. `begin_loading_columns`

   * The column names to insert. Dictates the order in which values are inserted
5. `begin_loading_values`

   * The list of raw insert values to convert
6. `field_delimiter`

   * The field delimiter
7. *(optional)* `skip_header`

   * The number of rows to skip
8. *(optional)* `input_data_place_holder`

   * The location of the file in a supported cloud provider. Set parameter when the file is not stored locally
9. *(optional)* `con`

   * The connection to be used

#### Export Class

Static methods in the class

* `defaults()`
* `null(value=None)`
* `record_mode(value=None)`
* `report(file, separator=' ')`
* `reset()`
* `separator_string(value=None)`
* `separator_width(value=None)`
* `side_titles(value=None)`
* `title_dashes(value=None, withValue=None)`
* `title_dashes_with(value=None)`
* `width(value=None)`

Data and other attributes defined here

* `expandedfilename = None`
* `separator = ''` \

#### Import Class

Methods in the class

* `reset()`

Static methods in the class

* `file(file, separator=' ')`
* `using(globals, *argv)`

Data and other attributes defined in the class

* `expandedfilename = None`
* `no_more_rows = False`
* `read_obj = None`
* `reader = None`
* `separator = ' '`

#### `Parameters` Class

Data and other attributes defined in the class

* `passed_variables = {}`

#####

---
title: SnowConvert AI - Teradata - SQL to JavaScript (Procedures)
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/teradata-to-javascript-translation-reference.md
section: Migrations
---

# SnowConvert AI - Teradata - SQL to JavaScript (Procedures)

## GET DIAGNOSTICS EXCEPTION

Translation reference to convert Teradata GET DIAGNOSTICS EXCEPTION statement to Snowflake Scripting

### Description

> GET DIAGNOSTICS retrieves information about successful, exception, or completion conditions from the Diagnostics Area.

For more information, see the [Teradata GET DIAGNOSTICS documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/Condition-Handling/GET-DIAGNOSTICS).

```sql
 GET DIAGNOSTICS
{
  [ EXCEPTION < condition_number >
    [ < parameter_name | variable_name > = < information_item > ]...
  ]
  |
  [ < parameter_name | variable_name > = < information_item > ]...
}
```

> **Note:**
>
> Some parts of the output code are omitted for clarity reasons.

### Sample Source Patterns

#### Teradata

##### Query

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE getDiagnosticsSample ()
BEGIN
    DECLARE V_MESSAGE, V_CODE VARCHAR(200);
    DECLARE V_Result INTEGER;
    SELECT c1 INTO V_Result FROM tab1;
    GET DIAGNOSTICS EXCEPTION 1 V_MESSAGE = MESSAGE_TEXT;
END;
```

##### Snowflake

##### Javascript

```sql
 CREATE OR REPLACE PROCEDURE getDiagnosticsSample ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var V_MESSAGE;
    var V_CODE;
    var V_RESULT;
    EXEC(`SELECT c1 FROM tab1`,[]);
    [V_RESULT] = INTO();
    V_MESSAGE = MESSAGE_TEXT;
$$;
```

### Known Issues

1. **Unsupported condition attributes statements**

   1. CLASS_ORIGIN
   2. CONDITION_IDENTIFIER
   3. CONDITION_NUMBER
   4. MESSAGE_LENGTH
   5. RETURNED_SQLSTATE
   6. SUBCLASS_ORIGIN

### Related EWIs

No related EWIs.

## **If**

The transformation for the [IF statement](https://docs.teradata.com/reader/I5Vi6UNnylkj3PsoHlLHVQ/GOzyPogDqU7DWoFvg6YMCw) is:

**Teradata**

```sql
 IF value = 2 THEN
```

**Snowflake**

```javascript
 if(value == 2){
}
```

## **Case**

The transformation for the [Case statement](https://docs.teradata.com/reader/I5Vi6UNnylkj3PsoHlLHVQ/nuR4riyH6QmcdmMu01TQEw) is:

**Teradata**

```sql
 case value
when 0 then
  select * from table1
else
  update table1 set name = "SpecificValue" where id = value;
end case
```

**Snowflake**

```javascript
 switch(value) {
    case 0:EXEC(`SELECT * FROM PUBLIC.table1`,[]);
        break;
    default:EXEC(`UPDATE PUBLIC.table1 set name = "SpecificValue" where id = value`,[]);
        break;
}
```

## **Cursor Declare, OPEN, FETCH and CLOSE**

The transformation for [cursor statements](https://docs.teradata.com/reader/I5Vi6UNnylkj3PsoHlLHVQ/vLlfGRxfadgP4k0a~0jpkA) is:

**Teradata**

### Cursor

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE procedure1()
DYNAMIC RESULT SETS 2
BEGIN

    -------- Local variables --------
    DECLARE sql_cmd VARCHAR(20000) DEFAULT ' ';
    DECLARE num_cols INTEGER;

    ------- Declare cursor with return only-------
    DECLARE resultset CURSOR WITH RETURN ONLY FOR firststatement;

    ------- Declare cursor -------
    DECLARE cur2 CURSOR FOR SELECT COUNT(columnname) FROM table1;

    -------- Set --------
    SET sql_cmd='sel * from table1';

    -------- Prepare cursor --------
    PREPARE firststatement FROM sql_cmd;

    -------- Open cursors --------
    OPEN resultset;
    OPEN cur1;

    -------- Fetch -------------
    FETCH cur1 INTO val1, val2;

    -------- Close cursor --------
    CLOSE cur1;
END;
```

**Snowflake**

#### JavaScript Cursor

```sql
 CREATE OR REPLACE PROCEDURE procedure1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    //------ Local variables --------
    var SQL_CMD = ` `;
    var NUM_COLS;
    var RESULTSET = new CURSOR(() => FIRSTSTATEMENT,[],true);
    //----- Declare cursor -------
    var CUR2 = new CURSOR(`SELECT COUNT(columnname) FROM table1`,[],false);
    //------ Set --------
    SQL_CMD = `SELECT * from table1`;
    //------ Prepare cursor --------
    var FIRSTSTATEMENT = SQL_CMD;
    //------ Open cursors --------
    RESULTSET.OPEN();
    CUR1.OPEN();
    //------ Fetch -------------
    CUR1.FETCH() && ([val1,val2] = CUR1.INTO());
    //------ Close cursor --------
    CUR1.CLOSE();
    return PROCRESULTS();
$$;
```

## **While**

The transformation for [while statement](https://docs.teradata.com/reader/I5Vi6UNnylkj3PsoHlLHVQ/fTCuW3l9hT6vtPc7V3QpcA) is:

**Teradata**

### While

```sql
 while (counter < 10) do
    set counter = counter + 1;
```

### Snowflake

#### While

```sql
 while ( counter < 10) {
    counter = counter + 1;
}
```

## **Security**

The transformation for [security statements](https://docs.teradata.com/reader/zzfV8dn~lAaKSORpulwFMg/knEJa8MckUZrAYquDblFAA) is:

| Teradata | Snowflake |
| --- | --- |
| SQL SECURITY CREATOR | EXECUTE AS OWNER |
| SQL SECURITY INVOKER | EXECUTE AS CALLER |
| SQL SECURITY DEFINER | EXECUTE AS OWNER |

## **FOR-CURSOR-FOR loop**

The transformation for [FOR-CURSOR-FOR loop](https://docs.teradata.com/reader/scPHvjfglIlB8F70YliLAw/YyY70D3vVqnHSAIE30t78g) is:

**Teradata**

### For-Cursor-For-Loop

```sql
-- Additional Params: -t JavaScript
REPLACE PROCEDURE Database1.Proc1()
BEGIN
    DECLARE lNumber INTEGER DEFAULT 1;
    FOR class1 AS class2 CURSOR FOR
      SELECT COL0,
      TRIM(COL1) AS COL1ALIAS,
      TRIM(COL2),
      COL3
      FROM someDb.prefixCol
    DO
      INSERT INTO TempDB.Table1 (:lgNumber, :lNumber, (',' || :class1.ClassCD || '_Ind CHAR(1) NOT NULL'));
      SET lNumber = lNumber + 1;
    END FOR;
END;
```

**Snowflake**

#### JavaScript For-Cursor-For-Loop

```sql
 CREATE OR REPLACE PROCEDURE Database1.Proc1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var LNUMBER = 1;
    /*** SSC-EWI-0023 - PERFORMANCE REVIEW - THIS LOOP CONTAINS AN INSERT, DELETE OR UPDATE STATEMENT ***/
    for(var CLASS2 = new CURSOR(`SELECT
   COL0,
   TRIM(COL1) AS COL1ALIAS,
   TRIM(COL2),
   COL3
FROM
   someDb.prefixCol`,[],false).OPEN();CLASS2.NEXT();) {
        let CLASS1 = CLASS2.CURRENT;
        EXEC(`INSERT INTO TempDB.Table1
VALUES (:lgNumber, :1, (',' || :
!!!RESOLVE EWI!!! /*** SSC-EWI-0026 - THE  VARIABLE class1.ClassCD MAY REQUIRE A CAST TO DATE, TIME OR TIMESTAMP ***/!!!
:2 || '_Ind CHAR(1) NOT NULL'))`,[LNUMBER,CLASS1.CLASSCD]);
        LNUMBER = LNUMBER + 1;
    }
    CLASS2.CLOSE();
$$;
```

*Note: The FOR loop present in the Teradata procedure is transformed to a FOR block in javascript that emulates its functionality.*

## **Procedure parameters and variables referenced inside statements**

The transformation for the procedure parameters and variables that are referenced inside the statements of the procedure is:

**Teradata**

### Parameters and variables

```sql
 -- Additional Params: -t JavaScript
REPLACE PROCEDURE PROC1 (param1 INTEGER, param2 VARCHAR(30))
BEGIN
    DECLARE var1          VARCHAR(1024);
    DECLARE var2          SMALLINT;
    DECLARE weekstart date;
    set weekstart= '2019-03-03';
    set var1 = 'something';
    set var2 = 123;

    SELECT * FROM TABLE1 WHERE SOMETHING = :param1;
    SELECT * FROM TABLE1 WHERE var1 = var1 AND date1 = weekstart AND param2 = :param2;
    INSERT INTO TABLE2 (col1, col2, col3, col4, col5) VALUES (:param1, :param2, var1, var2, weekstart);
END;
```

**Snowflake**

#### JavaScript prameters and variables

```sql
 CREATE OR REPLACE PROCEDURE PROC1 (PARAM1 FLOAT, PARAM2 STRING)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var VAR1;
    var VAR2;
    var WEEKSTART;
    WEEKSTART = `2019-03-03`;
    VAR1 = `something`;
    VAR2 = 123;
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`SELECT * FROM TABLE1 WHERE SOMETHING = :1`,[PARAM1]);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`SELECT * FROM TABLE1 WHERE :1 = :1 AND date1 = :2 AND param2 = :3`,[VAR1,WEEKSTART,PARAM2]);
    // ** SSC-EWI-0022 - ONE OR MORE IDENTIFIERS IN THIS STATEMENT WERE CONSIDERED PARAMETERS BY DEFAULT. REFERENCED TABLE NOT FOUND. **
    EXEC(`INSERT INTO TABLE2 (col1, col2, col3, col4, col5) VALUES (:1, :2, :3, :4, :5)`,[PARAM1,PARAM2,VAR1,VAR2,WEEKSTART]);
$$;
```

*Note: Whenever a procedure parameter or a variable declared inside the procedure is referenced inside a Teradata statement that has to be converted,* *this reference is escaped from the resulting text to preserve the original reference’s functionality.*

## **Leave**

In Javascript, it’s possible to use `break` with an additional parameter, thus emulating the behavior of a Teradata `LEAVE` jump.

Labels can also be emulated by using Javascript Labeled Statements.

The transformation for [LEAVE statement](https://docs.teradata.com/reader/I5Vi6UNnylkj3PsoHlLHVQ/60WkuZd8ir9NgHlhlJcxIA) is:

**Teradata**

### Leave

```sql
-- Additional Params: -t JavaScript
REPLACE PROCEDURE  PROC1 ()
BEGIN
  DECLARE v_propval            VARCHAR(1024);

 DECLARE Cur1 cursor for
   Select
      propID
   from viewName.viewCol
   where propval is not null;

LABEL_WHILE:
  WHILE (SQLCODE = 0)
  DO
      IF (SQLSTATE = '02000' )
       THEN LEAVE LABEL_WHILE;
      END IF;
      LABEL_INNER_WHILE:
      WHILE (SQLCODE = 0)
      DO
        IF (SQLSTATE = '02000' )
          THEN LEAVE LABEL_INNER_WHILE;
        END IF;
      END WHILE LABEL_INNER_WHILE;
      SELECT * FROM TABLE1;
  END WHILE L1;
END;
```

**Snowflake**

#### JavaScript Leave

```sql
 CREATE OR REPLACE PROCEDURE PROC1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
 // SnowConvert AI Helpers Code section is omitted.

 var V_PROPVAL;
 var CUR1 = new CURSOR(`SELECT propID from viewName.viewCol
  where
  propval is not null`,[],false);
  LABEL_WHILE: {
  while ( SQLCODE == 0 ) {
   if (SQLSTATE == `02000`) {
    break LABEL_WHILE;
   }
   LABEL_INNER_WHILE: {
    while ( SQLCODE == 0 ) {
     if (SQLSTATE == `02000`) {
      break LABEL_INNER_WHILE;
     }
    }
   }
   EXEC(`SELECT * FROM TABLE1`,[]);
  }
 }
$$;
```

## Getting Results from Procedures

### Description of the translation

In Teradata, there are two ways to return data from a procedure. The first is through output parameters and the second through *Dynamic Result Sets* and *Cursors.* Both are shown in the following example. Each important point is explained below.

### Example of returning data from a Stored Procedure

**Teradata**

#### Out parameter

```sql
-- Additional Params: -t JavaScript
REPLACE PROCEDURE Procedure1(OUT P1 INTEGER)
    DYNAMIC RESULT SETS 2
    BEGIN
        DECLARE SQL_CMD,SQL_CMD_1  VARCHAR(20000) DEFAULT ' ';
        DECLARE RESULTSET CURSOR WITH RETURN ONLY FOR FIRSTSTATEMENT;
        SET SQL_CMD = 'SEL * FROM EMPLOYEE';
        PREPARE FIRSTSTATEMENT FROM SQL_CMD;
        OPEN RESULTSET;
        SET P1 = (SEL CAST(AVG(AGE) AS INTEGER) FROM EMPLOYEE);
    END;
```

**Snowflake**

##### JavaScript out parameter

```sql
 CREATE OR REPLACE PROCEDURE Procedure1 (P1 FLOAT)
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var SQL_CMD = ` `;
    var SQL_CMD_1 = ` `;
    var RESULTSET = new CURSOR(() => FIRSTSTATEMENT,[],true);
    SQL_CMD = `SELECT * FROM EMPLOYEE`;
    var FIRSTSTATEMENT = SQL_CMD;
    RESULTSET.OPEN();
    EXEC(`(
   SELECT
      CAST(AVG(AGE) AS INTEGER)
   FROM
      EMPLOYEE
)`,[]);
    var subQueryVariable0;
    [subQueryVariable0] = INTO();
    P1 = subQueryVariable0;
    return PROCRESULTS(P1);
$$;
```

In this converted SQL, there are several conversions that take place:

* The `DYNAMIC RESULT SETS 2` definition is converted to a `DYNAMIC_RESULTS` variable.

```javascript
     var DYNAMIC_RESULTS = 2;
```

* When a cursor with an `WITH RETURN`attribute is opened (and therefore a query is executed), its query ID is stored in the`_OUTQUERIES`collection to be later returned. The query id is obtained by the`getQueryId()`function provided in the [JavaScript API for Snowflake stored procedures](https://docs.snowflake.com/en/sql-reference/stored-procedures-api.html#getQueryId).
* Only the first k-query-IDs are stored in the collection, where k is the value of the`DYNAMIC_RESULTS`variable. This is done to emulate Teradata’s behavior, which only returns the first k-opened-cursors, even if more are opened in the stored procedure.
* The combination of `DECLARE CURSOR WITH RETURN` with `PREPARE` is translated to:

```javascript
     var RESULTSET = new CURSOR(() => FIRSTSTATEMENT,[],true);
```

* The output parameters are supported through the return statement of the procedure. An array is created containing the value of each output parameter and the`_OUTQUERIES`collection. The`PROCRESULTS`function deals with the creation and filling of this array. See [PROCRESULTS() helper](helpers-for-procedures.md) for more information.

```javascript
     return PROCRESULTS(P1);
```

### Example of getting data from a Stored Procedure

If the output parameters and the query IDs are returned from a procedure, a second one could call the first one to get these values, as shown below:

**Teradata**

#### Call procedure

```sql
 -- Additional Params: -t JavaScript
CREATE PROCEDURE Procedure2()
BEGIN
    DECLARE x INTEGER;
    CALL Procedure1(x);
END;
```

**Snowflake**

##### JavaScript Call procedure

```sql
 CREATE OR REPLACE PROCEDURE Procedure2 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    // SnowConvert AI Helpers Code section is omitted.

    var X;
    EXEC(`CALL Procedure1(:1)`,[X]);
$$;
```

* The value of the`P1`argument from`Procedure1`is returned and stored in the`X`variable.
* The`_OUTQUERIES`returned from`Procedure1`are stored in the`resultset`variable.

> **Note:**
>
> This behavior also applies to the INOUT parameters.

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-EWI-0022](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): One or more identifiers in this statement were considered parameters by default.
2. [SSC-EWI-0023](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Performance Review - A loop contains an insert, delete, or update statement.
3. [SSC-EWI-0026](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The variable may require a cast to date, time, or timestamp.
4. [SSC-FDM-TD0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): This message is shown when SnowConvert AI finds a data type BLOB.

---
title: SnowConvert AI - Teradata - SQL to Snowflake Scripting (Procedures)
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/teradata-to-snowflake-scripting-translation-reference.md
section: Migrations
---

# SnowConvert AI - Teradata - SQL to Snowflake Scripting (Procedures)

## ABORT and ROLLBACK

Translation reference to convert Teradata ABORT and ROLLBACK statements to Snowflake Scripting

### Description

Teradata’s `ABORT` and `ROLLBACK` statements are replaced by a `ROLLBACK` statement in Snowflake Scripting.

For more information on Teradata [ABORT](https://docs.teradata.com/r/huc7AEHyHSROUkrYABqNIg/c6KYQ4ySu4QTCkKS4f5A2w) and for [ROLLBACK](https://docs.teradata.com/r/huc7AEHyHSROUkrYABqNIg/ZddbA8dTQ1LNcHwmCn8BVg).

```sql
 ABORT [abort_message] [FROM option] [WHERE abort_condition];

ROLLBACK [WORK] [abort_message] [FROM clause] [WHERE clause];
```

### Sample Source Patterns

#### Basic ABORT and ROLLBACK

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE procedureBasicAbort()
BEGIN
    ABORT;
    ROLLBACK;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE procedureBasicAbort ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        ROLLBACK;
        ROLLBACK;
    END;
$$;
```

#### Conditional ABORT and ROLLBACK

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE procedureWhereAbort(AnotherValueProc INTEGER)
BEGIN
    ABORT WHERE AValueProc > 2;

    ROLLBACK WHERE (AnotherValueProc > 2);
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE procedureWhereAbort (ANOTHERVALUEPROC INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/23/2024" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        IF (AValueProc > 2) THEN
            ROLLBACK;
        END IF;
        IF (:AnotherValueProc > 2) THEN
            ROLLBACK;
        END IF;
    END;
$$;
```

#### ABORT and ROLLBACK with table references and FROM clause

##### Teradata

##### Query

```sql
 CREATE TABLE  ReferenceTable
    (ColumnValue INTEGER);

CREATE TABLE  ReferenceTable2
    (ColumnValue INTEGER);

REPLACE PROCEDURE procedureFromAbort()
BEGIN
    ROLLBACK FROM ReferenceTable, ReferenceTable2
	WHERE ReferenceTable.ColumnValue = ReferenceTable2.ColumnValue;
    ABORT FROM ReferenceTable, ReferenceTable2
        WHERE ReferenceTable.ColumnValue = ReferenceTable2.ColumnValue;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE TABLE ReferenceTable
(
	ColumnValue INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

CREATE TABLE ReferenceTable2
(
	ColumnValue INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

CREATE OR REPLACE PROCEDURE procedureFromAbort ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
	BEGIN
		LET _ROW_COUNT FLOAT;
		SELECT
			COUNT(*)
		INTO
			_ROW_COUNT
			FROM
			ReferenceTable,
			ReferenceTable2
				WHERE
			ReferenceTable.ColumnValue = ReferenceTable2.ColumnValue;
			IF (_ROW_COUNT > 0) THEN
			ROLLBACK;
			END IF;
			SELECT
			COUNT(*)
			INTO
			_ROW_COUNT
			FROM
			ReferenceTable,
			ReferenceTable2
			        WHERE
			ReferenceTable.ColumnValue = ReferenceTable2.ColumnValue;
			IF (_ROW_COUNT > 0) THEN
			ROLLBACK;
			END IF;
	END;
$$;
```

#### ABORT and ROLLBACK with table references without FROM clause

##### Teradata

##### Query

```sql
 CREATE TABLE  ReferenceTable
    (ColumnValue INTEGER);

REPLACE PROCEDURE procedureFromTableAbort()
BEGIN
    ROLLBACK WHERE ReferenceTable.ColumnValue > 2;
    ABORT WHERE ReferenceTable.ColumnValue > 4;
END;
```

##### Snowflake Scripting

##### Abort and rollback

```sql
 CREATE OR REPLACE TABLE ReferenceTable
(
    ColumnValue INTEGER)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

CREATE OR REPLACE PROCEDURE procedureFromTableAbort ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        LET _ROW_COUNT FLOAT;
        SELECT
            COUNT(*)
        INTO
            _ROW_COUNT
        FROM
            ReferenceTable
            WHERE
            ReferenceTable.ColumnValue > 2;
            IF (_ROW_COUNT > 0) THEN
            ROLLBACK;
            END IF;
            SELECT
            COUNT(*)
            INTO
            _ROW_COUNT
            FROM
            ReferenceTable
            WHERE
            ReferenceTable.ColumnValue > 4;
            IF (_ROW_COUNT > 0) THEN
            ROLLBACK;
            END IF;
    END;
$$;
```

### Known Issues

#### 1. Custom Error Message

Even though the ROLLBACK AND ABORT are supported, using them with a custom error message is not supported.

##### Teradata

##### Error message

```sql
 ABORT 'Error message for abort';
ROLLBACK  'Error message for rollback';
```

##### Snowflake Scripting

##### Error message

```sql
 ABORT 'Error message for abort';
ROLLBACK  'Error message for rollback';
```

##### 2. Aggregate function

The use of the aggregate function combined with ABORT/ROLLBACK is not supported

##### Teradata

##### Aggregate function

```sql
 ROLLBACK WHERE SUM(ATable.AValue) < 2;
ABORT WHERE SUM(ATable.AValue) < 2;
```

##### Snowflake Scripting

##### Aggregate function

```sql
 ROLLBACK WHERE SUM(ATable.AValue) < 2;
ABORT WHERE SUM(ATable.AValue) < 2;
```

### Related EWIs

No related EWIs.

## ACTIVITY_COUNT

Translation specification for the ACTIVITY_COUNT status variable.

### Description

The `ACTIVITY_COUNT` status variable returns the number of rows affected by an SQL DML statement in an embedded SQL or stored procedure application. For more information, see the [Teradata ACTIVITY_COUNT documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Stored-Procedures-and-Embedded-SQL/Result-Code-Variables/ACTIVITY_COUNT).

There is no direct equivalent in Snowflake. However, there is a workaround to emulate the `ACTIVITY_COUNT`’s behavior. One must simply use the following query:

```sql
 SELECT $1 FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
```

This query retrieves and returns the first column of the result set from the last executed query in the current session. Furthermore, `$1` can be replaced by `"number of rows inserted"`, `"number of rows updated"` or `"number of rows deleted"` based on the query type.

As expected, this translation behaves like its Teradata counterpart only when no other queries besides the SQL DML statement are executed before calling `LAST_QUERY_ID`.

### Sample Source Patterns

#### Setup data

##### Teradata

##### Query

```sql
 CREATE TABLE employees (
    employee_id INT NOT NULL,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    department_id INT,
    salary DECIMAL(10,2),
    PRIMARY KEY (employee_id)
);

INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (1, 'John', 'Doe', 10, 60000.00);

INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (2, 'Johny', 'Doey', 10, 65000.00);

INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (3, 'Max', 'Smith', 10, 70000.00);

DROP TABLE activity_log;
CREATE TABLE activity_log (
    log_id INT GENERATED ALWAYS AS IDENTITY (START WITH 1 INCREMENT BY 1) NOT NULL,
    operation VARCHAR(200),
    row_count INT,
    log_timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP,
    PRIMARY KEY (log_id)
);
```

##### *Snowflake*

##### Query

```sql
 CREATE OR REPLACE TABLE employees (
    employee_id INT NOT NULL,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    department_id INT,
    salary DECIMAL(10,2),
    PRIMARY KEY (employee_id)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/11/2024" }}'
;

INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (1, 'John', 'Doe', 10, 60000.00);

INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (2, 'Johny', 'Doey', 10, 65000.00);

INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (3, 'Max', 'Smith', 10, 70000.00);

CREATE OR REPLACE TABLE activity_log (
    log_id INT DEFAULT activity_log_log_id.NEXTVAL NOT NULL,
    operation VARCHAR(200),
    row_count INT,
    log_timestamp TIMESTAMP DEFAULT CURRENT_TIMESTAMP(),
    PRIMARY KEY (log_id)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/11/2024" }}'
;
```

#### Supported usage

##### *Teradata*

##### Query

```sql
 REPLACE PROCEDURE UpdateEmployeeSalaryAndLog ()
BEGIN
    DECLARE row_count1 INT;

    UPDATE employees
    SET salary = 80000
    WHERE department_id = 10;

    -- Get the ACTIVITY_COUNT
    SET row_count1 = ACTIVITY_COUNT;

    -- Insert the ACTIVITY_COUNT into the activity_log table
    INSERT INTO activity_log (operation, row_count)
    VALUES ('UPDATE WHERE dept=10', row_count1);
END;

CALL UpdateEmployeeSalaryAndLog();

SELECT * FROM ACTIVITY_LOG;
```

##### Result

```none
LOG_ID | OPERATION    	      | ROW_COUNT | LOG_TIMESTAMP              |
-------+----------------------+-----------+----------------------------+
1      | UPDATE WHERE dept=10 |	3         | 2024-07-10 15:58:46.490000 |
```

##### *Snowflake*

##### Query

```sql
 CREATE OR REPLACE PROCEDURE UpdateEmployeeSalaryAndLog ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/11/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        row_count1 INT;
    BEGIN

        UPDATE employees
    SET salary = 80000
    WHERE department_id = 10;

    -- Get the ACTIVITY_COUNT
        row_count1 := (
    SELECT
        $1
    FROM
        TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;

        -- Insert the ACTIVITY_COUNT into the activity_log table
        INSERT INTO activity_log (operation, row_count)
        VALUES ('UPDATE WHERE dept=10', :row_count1);
    END;
$$;

CALL UpdateEmployeeSalaryAndLog();

SELECT
    * FROM
    ACTIVITY_LOG;
```

##### Result

```none
LOG_ID | OPERATION    	      | ROW_COUNT | LOG_TIMESTAMP            |
-------+----------------------+-----------+--------------------------+
102    | UPDATE WHERE dept=10 |	3         | 2024-07-11T12:42:35.280Z |
```

### Known Issues

1. If `ACTIVITY_COUNT` is called twice or more times before executing a DML statement, the transformation might not return the expected values. See [SSC-FDM-TD0033](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md).
2. If `ACTIVITY_COUNT` is called after a non DML statement was executed, the transformation will not return the expected values. See [SSC-FDM-TD0033](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md).
3. `ACTIVITY_COUNT` requires manual fixing when inside a `SELECT/SET INTO VARIABLE` statement and was not able to be identified as a column name. See [SSC-EWI-TD0003](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md).

### Related EWIs

1. [SSC-FDM-TD0033](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): ‘ACTIVITY_COUNT’ TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS.

## BEGIN END

Translation reference to convert Teradata BEGIN END clause to Snowflake Scripting

### BEGIN END TRANSACTION

#### Description

> Defines the beginning of an explicit logical transaction in Teradata session mode.

For more information, see the [Teradata BEGIN END Transaction documentation](https://docs.teradata.com/r/2_MC9vCtAJRlKle2Rpb0mA/EhQtM73NDooSYqTcaZEHzQ).

```sql
 [ BEGIN TRANSACTION | BT ]
     statement
     [ statement ]... ]
[ END TRANSACTION | ET ];
```

#### Sample Source Pattern

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE BeginEndProcedure()
BEGIN
    DECLARE HELLOSTRING VARCHAR(60);
    BEGIN TRANSACTION
        SET HELLOSTRING = 'HELLO WORLD';
    END TRANSACTION;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE BeginEndProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        HELLOSTRING VARCHAR(60);
    BEGIN

        BEGIN TRANSACTION;
        HELLOSTRING := 'HELLO WORLD';
        COMMIT;
    END;
$$;
```

### BEGIN END REQUEST

#### Description

> Delimits a SQL multistatement request

For more information, see the [Teradata BEGIN END Request documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Data-Definition-Language-Syntax-and-Examples/March-2019/Procedure-Statements/CREATE-PROCEDURE-and-REPLACE-PROCEDURE-SQL-Form/Syntax-Elements/Statement-Options/BEGIN-REQUEST).

```sql
 BEGIN REQUEST
     statement
     [ statement ]... ]
END REQUEST;
```

#### Sample Source Pattern

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE BeginEndProcedure()
BEGIN
    DECLARE HELLOSTRING VARCHAR(60);
    BEGIN REQUEST
        SET HELLOSTRING = 'HELLO WORLD';
    END REQUEST;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE BeginEndProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        HELLOSTRING VARCHAR(60);
    BEGIN

        BEGIN
            HELLOSTRING := 'HELLO WORLD';
            COMMIT;
        EXCEPTION
            WHEN OTHER THEN
                ROLLBACK;
        END;
    END;
$$;
```

### BEGIN END COMPOUND

#### Description

> Delimits a compound statement in a stored procedure.

For more information, see the [Teradata BEGIN END Compound documentation](https://docs.teradata.com/r/Teradata-Database-SQL-Stored-Procedures-and-Embedded-SQL/June-2017/SQL-Control-Statements/BEGIN-...-END).

```sql
 label_name: BEGIN
     statement
     [ statement ]... ]
END label_name;
```

#### Sample Source Pattern

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE BeginEndProcedure()
BEGIN
    DECLARE HELLOSTRING VARCHAR(60);
    label_name: BEGIN
        SET HELLOSTRING = 'HELLO WORLD';
    END label_name;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE BeginEndProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        HELLOSTRING VARCHAR(60);
    BEGIN

        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'label_name LABEL' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
        label_name:
        BEGIN
            HELLOSTRING := 'HELLO WORLD';
        END;
    END;
$$;
```

### Known Issues

#### 1. Labels not supported in outer BEGIN END blocks

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE procedureLabelSingle()
label_name: BEGIN
    DECLARE HELLOSTRING VARCHAR(60);
    SET HELLOSTRING = 'HELLO WORLD';
END label_name;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE procedureLabelSingle ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'label_name LABEL' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
    label_name:
    DECLARE
        HELLOSTRING VARCHAR(60);
    BEGIN

        HELLOSTRING := 'HELLO WORLD';
    END;
$$;
```

### Related EWIs

1. [SSC-EWI-0058](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.

## CASE

Translation reference to convert Teradata CASE statement to Snowflake Scripting

### Description

> Provides conditional execution of statements based on the evaluation of the specified conditional expression or equality of two operands.
>
> The CASE statement is different from the SQL CASE expression_,_ which returns the result of an expression.

For more information, see the [Teradata CASE documentation](https://docs.teradata.com/r/zzfV8dn~lAaKSORpulwFMg/3nWOY~VPjk9_5FJXaKQNFg).

```sql
 -- Simple CASE
CASE operant_1
[ WHEN operant_2 THEN
     statement
     [ statement ]... ]...
[ ELSE
     statement
     [ statement ]... ]
END CASE;

-- Searched CASE
CASE
[ WHEN conditional_expression THEN
     statement
     [ statement ]... ]...
[ ELSE
     statement
     [ statement ]... ]
END CASE;
```

### Sample Source Patterns

#### Sample auxiliary table

##### Teradata

```sql
 CREATE TABLE case_table(col varchar(30));
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE case_table (
col varchar(30))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Simple Case

##### Teradata

##### Query

```sql
 CREATE  PROCEDURE caseExample1 ( grade NUMBER )
BEGIN
    CASE grade
        WHEN 10 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Excellent');
        WHEN 9 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Very Good');
        WHEN 8 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Good');
        WHEN 7 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Fair');
        WHEN 6 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Poor');
        ELSE INSERT INTO CASE_TABLE(COL) VALUES ('No such grade');
    END CASE;
END;

CALL caseExample1(6);
CALL caseExample1(4);
CALL caseExample1(10);
SELECT * FROM CASE_TABLE;
```

##### Result

```none
|COL          |
|-------------|
|Poor         |
|No such grade|
|Excellent    |
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE caseExample1 (GRADE NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CASE (grade)
            WHEN 10 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Excellent');
            WHEN 9 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Very Good');
            WHEN 8 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Good');
            WHEN 7 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Fair');
            WHEN 6 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Poor');
            ELSE
                INSERT INTO CASE_TABLE (COL)
                VALUES ('No such grade');
        END CASE;
    END;
$$;

CALL caseExample1(6);
CALL caseExample1(4);
CALL caseExample1(10);
SELECT * FROM CASE_TABLE;
```

##### Result

```none
|COL          |
|-------------|
|Poor         |
|No such grade|
|Excellent    |
```

#### Searched Case

##### Teradata

##### Query

```sql
 CREATE PROCEDURE caseExample2 ( grade NUMBER )
BEGIN
    CASE
        WHEN grade = 10 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Excellent');
        WHEN grade = 9 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Very Good');
        WHEN grade = 8 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Good');
        WHEN grade = 7 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Fair');
        WHEN grade = 6 THEN INSERT INTO CASE_TABLE(COL) VALUES ('Poor');
        ELSE INSERT INTO CASE_TABLE(COL) VALUES ('No such grade');
    END CASE;
END;

CALL caseExample2(6);
CALL caseExample2(4);
CALL caseExample2(10);
SELECT * FROM CASE_TABLE;
```

##### Result

```none
|COL          |
|-------------|
|Poor         |
|No such grade|
|Excellent    |
```

##### Snowflake Scripting

##### Query

```sql
CREATE OR REPLACE PROCEDURE caseExample2 (GRADE NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        CASE
            WHEN :grade = 10 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Excellent');
            WHEN :grade = 9 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Very Good');
            WHEN :grade = 8 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Good');
            WHEN :grade = 7 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Fair');
            WHEN :grade = 6 THEN
                INSERT INTO CASE_TABLE (COL)
                VALUES ('Poor');
                ELSE
                INSERT INTO CASE_TABLE (COL)
                VALUES ('No such grade');
        END CASE;
    END;
$$;

CALL caseExample2(6);

CALL caseExample2(4);

CALL caseExample2(10);

SELECT
    * FROM
    CASE_TABLE;
```

##### Result

```none
|COL          |
|-------------|
|Poor         |
|No such grade|
|Excellent    |
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## CURSOR

Translation reference to convert Teradata CURSOR statement to Snowflake Scripting

### Description

A cursor is a data structure that is used by stored procedures at runtime to point to a resultset returned by an SQL query. For more information, see the [Teradata SQL Cursor Control and DML Statements documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/SQL-Cursor-Control-and-DML-Statements).

```sql
 DECLARE cursor_name [ SCROLL | NO SCROLL ] CURSOR
     [
          WITHOUT RETURN
          |
          WITH RETURN [ ONLY ] [ TO [ CALLER | CLIENT ] ]
     ]
     FOR
     cursor_specification [ FOR [ READ ONLY | UPDATE ] ]
     |
     statement_name
;
```

```sql
 FETCH [ [ NEXT | FIRST ] FROM ] cursor_name INTO
    [ variable_name | parameter_name ] [ ,...n ]
;
```

```sql
 OPEN cursor_name
    [ USING [ SQL_identifier | SQL_paramenter ] [ ,...n ] ]
;
```

```sql
 CLOSE cursor_name ;
```

### Sample Source Patterns

#### Setup Data

The following code is necessary to execute the sample patterns present in this section.

##### Teradata

```sql
 CREATE TABLE vEmployee(
    PersonID INT,
    LastName VARCHAR(255),
    FirstName VARCHAR(255)
);

CREATE TABLE ResTable(
    Column1 VARCHAR(255)
);

INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (1, 'Smith', 'Christian');
INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (2, 'Johnson', 'Jhon');
INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (3, 'Brown', 'William');
INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (4, 'Williams', 'Gracey');
INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (5, 'Garcia', 'Julia');
INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (6, 'Miller', 'Peter');
INSERT INTO vEmployee(PersonID, LastName, FirstName) VALUES (7, 'Davis', 'Jannys');

CREATE TABLE TEST_TABLE (
    ColumnA NUMBER,
    ColumnB VARCHAR(8),
    ColumnC VARCHAR(8));

SELECT * FROM TEST_TABLE;
INSERT INTO TEST_TABLE VALUES (1, '1', '1');
INSERT INTO TEST_TABLE VALUES (2, '2', '2');
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE vEmployee (
    PersonID INT,
    LastName VARCHAR(255),
    FirstName VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

CREATE OR REPLACE TABLE ResTable (
    Column1 VARCHAR(255)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (1, 'Smith', 'Christian');

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (2, 'Johnson', 'Jhon');

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (3, 'Brown', 'William');

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (4, 'Williams', 'Gracey');

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (5, 'Garcia', 'Julia');

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (6, 'Miller', 'Peter');

INSERT INTO vEmployee (PersonID, LastName, FirstName)
VALUES (7, 'Davis', 'Jannys');

CREATE OR REPLACE TABLE TEST_TABLE (
    ColumnA NUMBER(38, 18),
    ColumnB VARCHAR(8),
    ColumnC VARCHAR(8))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

SELECT
    * FROM
    TEST_TABLE;

    INSERT INTO TEST_TABLE
    VALUES (1, '1', '1');

    INSERT INTO TEST_TABLE
    VALUES (2, '2', '2');
```

#### Basic Cursor

##### Teradata

##### Cursor Code

```sql
 REPLACE PROCEDURE CursorsTest()
BEGIN
    DECLARE val1 VARCHAR(255);
    DECLARE empcursor CURSOR FOR
        SELECT LastName
        FROM vEmployee
        ORDER BY PersonID;

    OPEN empcursor;
    FETCH NEXT FROM empcursor INTO val1;
    FETCH NEXT FROM empcursor INTO val1;
    INSERT INTO ResTable(Column1) VALUES (val1);
    CLOSE empcursor;
END;

CALL CursorsTest();
SELECT * FROM ResTable;
```

##### Result

```none
Johnson
```

##### Snowflake Scripting

##### Cursor Code

```sql
 CREATE OR REPLACE PROCEDURE CursorsTest ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "06/18/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        val1 VARCHAR(255);
    BEGIN

        LET empcursor CURSOR
        FOR
            SELECT
                LastName
                   FROM
                vEmployee
                   ORDER BY PersonID;
        OPEN empcursor;
        FETCH NEXT FROM empcursor INTO val1;
            FETCH NEXT FROM empcursor INTO val1;
        INSERT INTO ResTable (Column1)
        VALUES (:val1);
            CLOSE empcursor;
    END;
$$;

CALL CursorsTest();

SELECT
    * FROM
    ResTable;
```

##### Result

```none
Johnson
```

#### Single Returnable Cursor

The following procedure is intended to return one result set since it has the `DYNAMIC RESULT SETS 1` property in the header, the cursor has the `WITH RETURN` property and is being opened in the body.

##### Teradata

##### Cursor Code

```sql
 REPLACE PROCEDURE spSimple ()
DYNAMIC RESULT SETS 1
BEGIN
    DECLARE result_set CURSOR WITH RETURN ONLY FOR
    SELECT *
    FROM vEmployee;

    OPEN result_set;
END;

CALL spSimple();
```

##### Result

```none
PersonID|LastName|FirstName|
--------+--------+---------+
       7|Davis   |Jannys   |
       5|Garcia  |Julia    |
       3|Brown   |William  |
       1|Smith   |Christian|
       6|Miller  |Peter    |
       4|Williams|Gracey   |
       2|Johnson |Jhon     |
```

##### Snowflake Scripting

##### Cursor Code

```sql
 CREATE OR REPLACE PROCEDURE spSimple ()
RETURNS TABLE (
)
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        LET result_set CURSOR FOR
            SELECT * FROM vEmployee;
        OPEN result_set;
        RETURN TABLE(resultset_from_cursor(result_set));
    END;
$$;

CALL spSimple();
```

##### Result

```none
PERSONID|LASTNAME|FIRSTNAME|
--------+--------+---------+
       1|Smith   |Christian|
       2|Johnson |Jhon     |
       3|Brown   |William  |
       4|Williams|Gracey   |
       5|Garcia  |Julia    |
       6|Miller  |Peter    |
       7|Davis   |Jannys   |
```

#### Multiple Returnable Cursors

The following procedure is intended to return multiple results when `DYNAMIC RESULT SETS` property in the header is greater than 1, the procedure has multiple cursors with the `WITH RETURN` property and these same cursors are being opened in the body.

##### Teradata

##### Cursor Code

```sql
 REPLACE PROCEDURE spTwoOrMore()
DYNAMIC RESULT SETS 2
BEGIN
    DECLARE result_set CURSOR WITH RETURN ONLY FOR
        SELECT * FROM SampleTable2;

    DECLARE result_set2 CURSOR WITH RETURN ONLY FOR
	SELECT Column11 FROM SampleTable1;
    OPEN result_set2;
    OPEN result_set;
END;

CALL spTwoOrMore();
```

##### Result

```none
ColumnA|ColumnB|ColumnC|
-------+-------+-------+
      2|2      |2      |
      1|1      |1      |

PersonID|LastName|FirstName|
--------+--------+---------+
       7|Davis   |Jannys   |
       5|Garcia  |Julia    |
       3|Brown   |William  |
       1|Smith   |Christian|
       6|Miller  |Peter    |
       4|Williams|Gracey   |
       2|Johnson |Jhon     |
```

##### Snowflake Scripting

##### Cursor Code

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "SampleTable2", "SampleTable1" **
CREATE OR REPLACE PROCEDURE spTwoOrMore ()
RETURNS ARRAY
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		tbl_result_set VARCHAR;
		tbl_result_set2 VARCHAR;
		return_arr ARRAY := array_construct();
	BEGIN
		tbl_result_set := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_result_set) AS
			SELECT
				* FROM
				SampleTable2;
		LET result_set CURSOR
		FOR
			SELECT
				*
			FROM
				IDENTIFIER(?);
		tbl_result_set2 := 'RESULTSET_' || REPLACE(UPPER(UUID_STRING()), '-', '_');
		CREATE OR REPLACE TEMPORARY TABLE IDENTIFIER(:tbl_result_set2) AS
			SELECT
				Column11 FROM
				SampleTable1;
		LET result_set2 CURSOR
		FOR
			SELECT
				*
			FROM
				IDENTIFIER(?);
		OPEN result_set2 USING (tbl_result_set2);
		return_arr := array_append(return_arr, :tbl_result_set2);
		OPEN result_set USING (tbl_result_set);
		return_arr := array_append(return_arr, :tbl_result_set);
		--** SSC-FDM-0020 - MULTIPLE RESULT SETS ARE RETURNED IN TEMPORARY TABLES **
		RETURN return_arr;
	END;
$$;

CALL spTwoOrMore();
```

##### Results

```none
[
  "RESULTSET_B5B0005D_1602_48B7_9EE4_62E1A28B000C",
  "RESULTSET_1371794D_7B77_4DA9_B42E_7981F35CEA9C"
]

ColumnA|ColumnB|ColumnC|
-------+-------+-------+
      2|2      |2      |
      1|1      |1      |

PersonID|LastName|FirstName|
--------+--------+---------+
       7|Davis   |Jannys   |
       5|Garcia  |Julia    |
       3|Brown   |William  |
       1|Smith   |Christian|
       6|Miller  |Peter    |
       4|Williams|Gracey   |
       2|Johnson |Jhon     |
```

#### Cursors With Binding Variables

The following cursor uses binding variables as the were condition to perform the query.

##### Teradata

##### Cursor Code

```sql
 REPLACE PROCEDURE TestProcedure (IN param1 NUMBER, param2 VARCHAR(8), param3 VARCHAR(8))
DYNAMIC RESULT SETS 1
BEGIN
    DECLARE cursorExample CURSOR WITH RETURN ONLY FOR
        SELECT * FROM  TEST_TABLE
   	WHERE ColumnA = param1 AND ColumnB LIKE param2 and ColumnC LIKE param3;

    OPEN cursorExample;
END;
```

##### Result

```none
|ColumnA|ColumnB|ColumnC|
+-------+-------+-------+
|      2|2      |2      |
```

##### Snowflake Scripting

##### Cursor Code

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "TEST_TABLE" **
CREATE OR REPLACE PROCEDURE TestProcedure (PARAM1 NUMBER(38, 18), PARAM2 VARCHAR(8), PARAM3 VARCHAR(8))
RETURNS TABLE (
)
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      LET cursorExample CURSOR
      FOR
         SELECT
            * FROM
            TEST_TABLE
           	WHERE ColumnA = ?
            AND ColumnB ILIKE ?
            and ColumnC ILIKE ?;
      OPEN cursorExample USING (param1, param2, param3);
      RETURN TABLE(resultset_from_cursor(cursorExample));
   END;
$$;
```

##### Result

```none
|ColumnA|ColumnB|ColumnC|
+-------+-------+-------+
|      2|2      |2      |
```

#### Cursor For Loop

It is a type of loop that uses a cursor to fetch rows from a SELECT statement and then performs some processing on each row.

##### Teradata

##### Cursor Code

```sql
 REPLACE PROCEDURE TestProcedure ()
DYNAMIC RESULT SETS 1
BEGIN
    FOR fUsgClass AS cUsgClass CURSOR FOR
        SELECT columnA FROM  TEST_TABLE
    DO
        INSERT INTO ResTable(Column1) VALUES (fUsgClass.columnA);
    END FOR;
END;

CALL TestProcedure();
SELECT * FROM ResTable;
```

##### Result

```none
|Column1|
+-------+
|      1|
|      2|
```

##### Snowflake Scripting

##### Cursor Code

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "TEST_TABLE", "ResTable" **
CREATE OR REPLACE PROCEDURE TestProcedure ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        !!!RESOLVE EWI!!! /*** SSC-EWI-0110 - TRANSFORMATION NOT PERFORMED DUE TO MISSING DEPENDENCIES ***/!!!
        temp_fUsgClass_columnA;
    BEGIN
        LET cUsgClass CURSOR
        FOR
            SELECT
                columnA FROM
                TEST_TABLE;
        --** SSC-PRF-0004 - THIS STATEMENT HAS USAGES OF CURSOR FOR LOOP **
        FOR fUsgClass IN cUsgClass DO
            temp_fUsgClass_columnA := fUsgClass.columnA;
            INSERT INTO ResTable (Column1)
            VALUES (:temp_fUsgClass_columnA);
        END FOR;
    END;
$$;

CALL TestProcedure();

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "ResTable" **
SELECT
    * FROM
    ResTable;
```

##### Result

```none
|Column1|
+-------+
|      1|
|      2|
```

#### Cursor Fetch inside a Loop

It allows one to retrieve rows from a result set one at a time and perform some processing on each row.

##### Teradata

##### Cursor Code

```sql
 REPLACE PROCEDURE teradata_fetch_inside_loop()
DYNAMIC RESULT SETS 1
BEGIN
    DECLARE col_name VARCHAR(255);
    DECLARE col_int INTEGER DEFAULT 1;
    DECLARE cursor_var CURSOR FOR SELECT columnA FROM TEST_TABLE;
    WHILE (col_int <> 0) DO
        FETCH cursor_var INTO col_name;
        INSERT INTO ResTable(Column1) VALUES (cursor_var.columnA);
        SET col_int = 0;
    END WHILE;
END;

CALL teradata_fetch_inside_loop();
SELECT * FROM ResTable;
```

##### Result

```none
|Column1|
+-------+
|      2|
```

##### Snowflake Scripting

##### Cursor Code

```sql
 CREATE OR REPLACE PROCEDURE teradata_fetch_inside_loop ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "06/18/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        col_name VARCHAR(255);
        col_int INTEGER DEFAULT 1;
    BEGIN

        LET cursor_var CURSOR
        FOR
            SELECT
                columnA FROM
                TEST_TABLE;
                WHILE (:col_int <> 0) LOOP
            --** SSC-PRF-0003 - FETCH INSIDE A LOOP IS CONSIDERED A COMPLEX PATTERN, THIS COULD DEGRADE SNOWFLAKE PERFORMANCE. **
                    FETCH cursor_var INTO col_name;
            INSERT INTO ResTable (Column1)
            VALUES (cursor_var.columnA);
            col_int := 0;
                END LOOP;
    END;
$$;

CALL teradata_fetch_inside_loop();

SELECT
    * FROM
    ResTable;
```

##### Result

```none
|Column1|
+-------+
|      2|
```

### Known Issues

The following parameters are not applicable in Snowflake Scripting.

#### 1. Declare

[ SCROLL/NO SCROLL ] Snowflake Scripting only supports FETCH NEXT.

[ READ-ONLY ] This is the default in Snowflake Scripting.

[ UPDATE ].

##### 2. Fetch

[ NEXT ] This is the default behavior in Snowflake Scripting.

[ FIRST ].

### Related EWIs

1. [SSC-FDM-0020](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Multiple result sets are returned in temporary tables.
2. [SSC-PRF-0003](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): Fetch inside a loop is considered a complex pattern, this could degrade Snowflake performance.
3. [SSC-PRF-0004](../../general/technical-documentation/issues-and-troubleshooting/performance-review/generalPRF.md): This statement has usages of cursor for loop.

## DECLARE CONTINUE HANDLER

Translation reference to convert Teradata DECLARE CONTINUE handler to Snowflake Scripting

### Description

> Handle completion conditions and exception conditions not severe enough to affect the flow of control.

For more information, see the [Teradata DECLARE CONTINUE handler documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Stored-Procedures-and-Embedded-SQL/Condition-Handling/DECLARE-HANDLER-CONTINUE-Type).

```sql
 DECLARE CONTINUE HANDLER FOR
  {
    { sqlstate_state_spec | condition_name } [,...] |

    { SQLEXCEPTION | SQLWARNING | NOT FOUND } [,...]

  } handler_action_statement ;
```

### Sample Source Patterns

#### DECLARE CONTINUE HANDLER

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE PURGING_ADD_TABLE
(
 IN inDatabaseName     	VARCHAR(30),
 IN inTableName    		VARCHAR(30)
)
BEGIN
 DECLARE vCHAR_SQLSTATE CHAR(5);
 DECLARE vSUCCESS       CHAR(5);

  DECLARE CONTINUE HANDLER FOR SQLSTATE 'T5628'
  BEGIN
     SET vCHAR_SQLSTATE = SQLCODE;
     SET vSUCCESS    = SQLCODE;
  END;

  SELECT 1;

END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE PURGING_ADD_TABLE
(INDATABASENAME VARCHAR(30), INTABLENAME VARCHAR(30)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "06/18/2024" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  vCHAR_SQLSTATE CHAR(5);
  vSUCCESS       CHAR(5);
 BEGIN

  BEGIN
   SELECT
    1;
  EXCEPTION
   WHEN statement_error THEN
    LET errcode := :sqlcode
    LET sqlerrmsg := :sqlerrm
    IF (errcode = '904'
    AND contains(sqlerrmsg, 'invalid value')) THEN
     BEGIN
      vCHAR_SQLSTATE := SQLCODE;
      vSUCCESS := SQLCODE;
     END;
    ELSE
     RAISE
    END IF
  END
 END;
$$;
```

### Known Issues

#### DECLARE CONTINUE HANDLER FOR SQLSTATE

The support of declaring continue handlers for some SQLSTATE values is not currently supported by Snowflake Scripting.

##### Teradata

##### Query

```sql
 CREATE PROCEDURE declareConditionExample2 ( )
BEGIN
   DECLARE CONTINUE HANDLER FOR SQLSTATE 'UNSUPPORTED'
     BEGIN
       SET vCHAR_SQLSTATE = SQLCODE;
       SET vSUCCESS    = SQLCODE;
    END;
END;
```

##### Snowflake Scripting

```sql
CREATE OR REPLACE PROCEDURE declareConditionExample2 ( )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      !!!RESOLVE EWI!!! /*** SSC-EWI-TD0004 - NOT SUPPORTED SQL EXCEPTION ON CONTINUE HANDLER ***/!!!
      DECLARE CONTINUE HANDLER FOR SQLSTATE 'UNSUPPORTED'
      BEGIN
         vCHAR_SQLSTATE := SQLCODE;
         vSUCCESS := SQLCODE;
      END;
   END;
$$;
```

### Related EWIs

1. [SSC-EWI-TD0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Not supported SQL Exception on continue handler.

## DECLARE CONDITION HANDLER

Translation reference to convert Teradata DECLARE CONDITION handler to Snowflake Scripting

### Description

> Assign a name to an SQLSTATE code, or declare a user-defined condition.

For more information, see the [Teradata DECLARE CONDITION handler documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/Condition-Handling/DECLARE-CONDITION).

```sql
 DECLARE condition_name CONDITION
    [ FOR SQLSTATE [ VALUE ] sqlstate_code ] ;
```

### Sample Source Patterns

#### DECLARE CONDITION

##### Teradata

##### Query

```sql
 CREATE PROCEDURE declareConditionExample ( )
BEGIN
    DECLARE DB_ERROR CONDITION;
    ...
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE declareConditionExample ( )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        DB_ERROR EXCEPTION;
    BEGIN
    END;
$$;
```

### Known Issues

#### DECLARE CONDITION FOR SQLSTATE

The support of declaring conditions for SQLSTATE values is not currently supported by Snowflake Scripting.

##### Teradata

##### Query

```sql
 CREATE PROCEDURE declareConditionExample2 ( )
BEGIN
    DECLARE ERROR_EXISTS CONDITION FOR SQLSTATE VALUE '42000';
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE declareConditionExample2 ( )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        ERROR_EXISTS EXCEPTION;
    BEGIN
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'SET EXCEPTION DETAILS' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
-- ERROR_EXISTS CONDITION FOR SQLSTATE VALUE '42000';
    END;
$$;
```

### Related EWIs

1. [SSC-EWI-0058:](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) Functionality is not currently supported by Snowflake Scripting.

## DECLARE

Translation reference to convert Teradata DECLARE statement to Snowflake Scripting

### Description

> Declares one or more local variables.

For more information, see the [Teradata DECLARE documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/SQL-Control-Statements/DECLARE).

```sql
 DECLARE variable_name [, variable_name ]... DATA_TYPE [ DEFAULT default_value]
```

### Sample Source Patterns

#### Teradata

##### Query

```sql
 CREATE PROCEDURE declareExample ( )
BEGIN
    DECLARE COL_NAME, COL_TYPE VARCHAR(200) DEFAULT '' ;
    DECLARE COL_COUNT, COL_LEN INTEGER;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE declareExample ( )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "06/18/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        COL_NAME VARCHAR(200) DEFAULT '';
        COL_TYPE VARCHAR(200) DEFAULT '';
        COL_COUNT INTEGER;
        COL_LEN INTEGER;
    BEGIN

        RETURN 1;
    END;
$$;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## DML and DDL Objects

### Description

DML and DDL objects are translated in the same way regardless of whether they are inside stored procedures or not. For further information check the following links.

### Translation References

* [data-types.md](sql-translation-reference/data-types.md): Compare Teradata data types and their equivalents in Snowflake.
* [ddl](sql-translation-reference/ddl-teradata.md): Explore the translation of the Data Definition Language.
* [dml](sql-translation-reference/dml-teradata.md): Explore the translation of the Data Manipulation Language.
* [built-in-functions](sql-translation-reference/teradata-built-in-functions.md): Compare functions included in the runtime of both languages.

## EXCEPTION HANDLERS

Translation reference to convert Teradata EXCEPTION HANDLERS clause to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s single and multiple Exception Handlers are replaced by its equivalent handlers in Snowflake Scripting.

For more information, see the [Teradata EXCEPTION HANDLERS documentation](https://docs.teradata.com/r/zzfV8dn~lAaKSORpulwFMg/gH3xxgeVDpIqVBjEQfppyQ).

```sql
 DECLARE < handler_type > HANDLER
  FOR  < condition_value_list > < handler_action > ;
```

### Sample Source Patterns

#### SQLEXCEPTION HANDLER

##### Teradata

##### Single handler

```sql
 CREATE PROCEDURE handlerSample ()
BEGIN
    DECLARE EXIT HANDLER FOR SQLEXCEPTION
        INSERT INTO Proc_Error_Table ('procSample', 'Failed SqlException');
    SELECT * FROM Proc_Error_Table;
END;
```

##### Multiple handlers

```sql
 CREATE PROCEDURE handlerSample ()
BEGIN
    DECLARE ConditionByUser1 CONDITION;
    DECLARE EXIT HANDLER FOR SQLEXCEPTION
        INSERT INTO Proc_Error_Table ('procSample', 'Failed SqlException');
    DECLARE EXIT HANDLER FOR ConditionByUser1
        INSERT INTO Proc_Error_Table ('procSample', 'Failed ConditionByUser1');
    SELECT * FROM Proc_Error_Table;
END;
```

##### Snowflake Scripting

##### Single handler

```sql
 CREATE OR REPLACE PROCEDURE handlerSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN

        SELECT
            * FROM
            Proc_Error_Table;
    EXCEPTION
            WHEN other THEN
            INSERT INTO Proc_Error_Table
            VALUES ('procSample', 'Failed SqlException');
    END;
$$;
```

##### Multiple handlers

```sql
 CREATE OR REPLACE PROCEDURE handlerSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        ConditionByUser1 EXCEPTION;
    BEGIN

        SELECT
            * FROM
            Proc_Error_Table;
    EXCEPTION
            WHEN ConditionByUser1 THEN
            INSERT INTO Proc_Error_Table
            VALUES ('procSample', 'Failed ConditionByUser1');
            WHEN other THEN
            INSERT INTO Proc_Error_Table
            VALUES ('procSample', 'Failed SqlException');
    END;
$$;
```

#### User-Defined Handlers

##### Teradata

##### Query

```sql
 CREATE PROCEDURE handlerSample ()
BEGIN
    DECLARE EXIT HANDLER FOR Custom1, Custom2, Custom3
      BEGIN
        SET Message1 = 'custom1 and custom2 and custom3';
      END;
    SELECT * FROM Proc_Error_Table;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE handlerSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN

        SELECT
            * FROM
            Proc_Error_Table;
    EXCEPTION
            WHEN Custom1 OR Custom2 OR Custom3 THEN
            BEGIN
                    Message1 := 'custom1 and custom2 and custom3';
            END;
    END;
$$;
```

### Known Issues

#### CONTINUE Handler

> **Danger:**
>
> A ‘CONTINUE’ handler in Teradata allows the execution to be resumed after executing a statement with errors. This is not supported by the exception blocks in Snowflake Scripting. [Condition Handler Teradata reference documentation.](https://docs.teradata.com/r/CeAGk~BNtx~axcR0ed~5kw/EN6T2zEDlgBRvSKjw7shUg)

##### Teradata

##### Query

```sql
 CREATE PROCEDURE handlerSample ()
BEGIN
    DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO Proc_Error_Table ('spSample4', 'Failed SqlException');
    SELECT * FROM Proc_Error_Table;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE handlerSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-TD0004 - NOT SUPPORTED SQL EXCEPTION ON CONTINUE HANDLER ***/!!!
        DECLARE CONTINUE HANDLER FOR SQLEXCEPTION
        INSERT INTO Proc_Error_Table
        VALUES ('spSample4', 'Failed SqlException');
        SELECT
            * FROM
            Proc_Error_Table;
    END;
$$;
```

#### Other not supported handlers

> **Danger:**
>
> Handlers for SQLSTATE, SQLWARNING, and NOT FOUND are not supported

##### Teradata

##### Query

```sql
 CREATE PROCEDURE handlerSample ()
BEGIN
    DECLARE EXIT HANDLER FOR SQLSTATE '42002', SQLWARNING, NOT FOUND
        INSERT INTO Proc_Error_Table ('procSample', 'Failed SqlState or SqlWarning or Not Found');
    SELECT * FROM Proc_Error_Table;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE handlerSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/04/2024" }}'
EXECUTE AS CALLER
AS
$$
    BEGIN
--        !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'SQLSTATE, SQLWARNING, NOT-FOUND TYPES HANDLER' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!
--        DECLARE EXIT HANDLER FOR SQLSTATE '42002', SQLWARNING, NOT FOUND
--            INSERT INTO Proc_Error_Table ('procSample', 'Failed SqlState or SqlWarning or Not Found');
        SELECT
            * FROM
            Proc_Error_Table;
    END;
$$;
```

### Related EWIs

1. [SSC-EWI-0058](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.
2. [SSC-EWI-TD0004](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): Not supported SQL Exception on continue handler.

## EXECUTE/EXEC

Translation reference to convert Teradata EXECUTE or EXEC statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

The Teradata `EXECUTE`statement allows the execution prepared dynamic SQL or macros, on the other hand exec only allows macros.

For more information regarding Teradata EXECUTE/EXEC, check [Macro Form](https://docs.teradata.com/r/Teradata-Database-SQL-Data-Manipulation-Language/June-2017/Statement-Syntax/EXECUTE-Macro-Form) and [Dynamic SQL Form](https://docs.teradata.com/r/Teradata-Database-SQL-Stored-Procedures-and-Embedded-SQL/June-2017/Dynamic-Embedded-SQL-Statements/Dynamic-SQL-Statement-Syntax/EXECUTE-Dynamic-SQL-Form)

```sql
 -- EXECUTE macro syntax
{EXECUTE | EXEC } macro_identifier [ (<parameter_definition>[, ...n] ) ] [;]

<parameter_definition>:= {parameter_name = constant_expression | constant_expresion}

-- EXECUTE prepared dynamic SQL syntax
EXECUTE prepare_indentifier [<using>|<usingDescriptor>]

<using>:= USING < host_variable >[, ...n]
<host_variable>:= [:] host_variable_name [[INDICATOR] :host_indicator_name]
<usingDescriptor>:= USING DESCRIPTOR [:] descript_area
```

### Sample Source Patterns

#### Setup data

The following code is necessary to execute the sample patterns present in this section.

##### Teradata

```sql
 -- Additional Params: -t JavaScript
CREATE TABLE inventory (
    product_name VARCHAR(50),
    price INTEGER
);

CREATE MACRO dummyMacro AS(
  SELECT * FROM INVENTORY;
);
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE inventory (
  product_name VARCHAR(50),
  price INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

CREATE OR REPLACE PROCEDURE dummyMacro ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  INSERT_TEMP(`SELECT
   *
  FROM
   INVENTORY`,[]);
  return tablelist;
$$;
```

#### Execute prepared statement

##### Teradata

##### Execute

```sql
 CREATE PROCEDURE InsertProductInInventory(IN productName VARCHAR(50), IN price INTEGER)
BEGIN
    DECLARE dynamicSql CHAR(200);
    SET dynamicSql = 'INSERT INTO INVENTORY VALUES( ?, ?)';
    PREPARE preparedSql FROM dynamicSql;
    EXECUTE preparedSql USING productName, price;

END;

CALL InsertProductInInventory('''Chocolate''', 75);
CALL InsertProductInInventory('''Sugar''', 65);
CALL InsertProductInInventory('''Rice''', 100);
```

##### Snowflake Scripting

##### Execute

```sql
 CREATE OR REPLACE PROCEDURE InsertProductInInventory (PRODUCTNAME VARCHAR(50), PRICE INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/24/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        dynamicSql CHAR(200);
    BEGIN

        dynamicSql := 'INSERT INTO INVENTORY
VALUES (?, ?)';
        !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'PREPARE STATEMENT' NODE ***/!!!
            PREPARE preparedSql FROM dynamicSql;
        !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
        EXECUTE IMMEDIATE dynamicSql;
    END;
$$;

CALL InsertProductInInventory('''Chocolate''', 75);

CALL InsertProductInInventory('''Sugar''', 65);

CALL InsertProductInInventory('''Rice''', 100);
```

#### Execute macro statement

##### Teradata

##### Execute

```sql
 EXECUTE dummyMacro;
```

##### Result

```none
+---------------+-------+
| product_name  | price |
+---------------+-------+
| 'Chocolate'   | 75    |
+---------------+-------+
| 'Sugar'       | 65    |
+---------------+-------+
| 'Rice'        | 100   |
+---------------+-------+
```

##### Snowflake Scripting

##### Execute

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
EXECUTE IMMEDIATE dummyMacro;
```

#### PREPARE with cursor pattern and USING clause

When a `PREPARE` statement is used with a cursor that is opened with a `USING` clause, SnowConvert AI transforms the pattern into `EXECUTE IMMEDIATE` with the USING clause bound at execution time.

##### Teradata

##### Query

```sql
REPLACE PROCEDURE fetch_cursor_with_using(OUT result INTEGER)
BEGIN
    DECLARE SQL_string VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTable WHERE col1 = ?';
    DECLARE filter_value INTEGER DEFAULT 5;

    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string;
    OPEN C1 USING filter_value;
    FETCH C1 INTO result;
    CLOSE C1;
END;
```

##### Snowflake Scripting

##### Query

```sql
CREATE OR REPLACE PROCEDURE fetch_cursor_with_using (RESULT OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQL_string VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTable
WHERE col1 = ?';
    filter_value INTEGER DEFAULT 5;
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN
    prepareQuery_aux_sql := SQL_string;
    --** SSC-FDM-TD0044 - USING VARIABLES BOUND AT EXECUTE IMMEDIATE TIME INSTEAD OF OPEN CURSOR TIME. CURSOR IS FIXED AT LET CURSOR FOR RESULTSET DECLARATION; REASSIGNING THE RESULTSET VARIABLE OR RE-EXECUTING PREPARE IN A LOOP WILL NOT UPDATE THE CURSOR. **
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql USING (filter_value)
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    OPEN CURSOR_S1_INSTANCE_V0;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      result;
    CLOSE CURSOR_S1_INSTANCE_V0;
  END;
$$;
```

**Transformation details:**

1. **PREPARE statement** is converted to `EXECUTE IMMEDIATE` with the USING clause
2. **Cursor declaration** is transformed to `LET CURSOR ... FOR resultset`
3. **OPEN USING** clause is moved to the EXECUTE IMMEDIATE statement
4. **Cursor instances** receive unique names (e.g., `CURSOR_S1_INSTANCE_V0`) when reused in loops

**Behavioral note:** In Teradata, the USING variables are bound when `OPEN` is called. In Snowflake, they are bound when `EXECUTE IMMEDIATE` runs (at PREPARE time). See [SSC-FDM-TD0044](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md) for implications.

### Related EWIs

1. [SSC-EWI-0030:](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md) The statement below has usages of dynamic SQL.
2. [SSC-EWI-0073](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Pending Functional Equivalence Review.
3. [SSC-FDM-TD0044](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): PREPARE with USING variables bound at EXECUTE IMMEDIATE time instead of OPEN CURSOR time.
4. [SSC-EWI-TD0098](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md): PREPARE with USING clause containing non-variable expressions cannot be automatically migrated.

## EXECUTE IMMEDIATE

Translation reference to convert Teradata EXECUTE IMMENDIATE statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

The Teradata `EXECUTE IMMEDIATE` statement allows the execution of dynamic SQL contained on variables or string literals.

For more information, see the [Teradata EXECUTE IMMEDIATE documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/Dynamic-Embedded-SQL-Statements/Dynamic-SQL-Statement-Syntax/EXECUTE-IMMEDIATE).

```sql
 -- EXECUTE IMMEDIATE syntax
EXECUTE IMMEDIATE <dynamic_statement>

<dynamic_statement> := {string_literal | string_variable}
```

### Sample Source Patterns

#### Setup data

The following code is necessary to execute the sample patterns present in this section.

##### Teradata

```sql
 CREATE TABLE inventory (
    product_name VARCHAR(50),
    price INTEGER
);
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE inventory (
    product_name VARCHAR(50),
    price INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Execute Example

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE InsertProductInInventory(IN productName VARCHAR(50), IN price INTEGER)
BEGIN
	DECLARE insertStatement VARCHAR(100);
	SET insertStatement = 'INSERT INTO INVENTORY VALUES(' || productName || ', ' || price || ')';
    EXECUTE IMMEDIATE insertStatement;
END;

CALL InsertProductInInventory('''Chocolate''', 75);
CALL InsertProductInInventory('''Sugar''', 65);
CALL InsertProductInInventory('''Rice''', 100);

SELECT product_name, price FROM inventory;
```

##### Result

```none
+--------------+-------+
| product_name | price |
+--------------+-------+
| Chocolate    | 75    |
+--------------+-------+
| Sugar        | 65    |
+--------------+-------+
| Rice         | 100   |
+--------------+-------+
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE InsertProductInInventory (PRODUCTNAME VARCHAR(50), PRICE INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/24/2024" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		insertStatement VARCHAR(100);
	BEGIN

		insertStatement := 'INSERT INTO INVENTORY
VALUES (' || productName || ', ' || price || ')';
		!!!RESOLVE EWI!!! /*** SSC-EWI-0030 - THE STATEMENT BELOW HAS USAGES OF DYNAMIC SQL. ***/!!!
		EXECUTE IMMEDIATE insertStatement;
	END;
$$;

CALL InsertProductInInventory('''Chocolate''', 75);

CALL InsertProductInInventory('''Sugar''', 65);

CALL InsertProductInInventory('''Rice''', 100);

SELECT
	product_name,
	price FROM
	inventory;
```

##### Result

```none
+--------------+-------+
| PRODUCT_NAME | PRICE |
+--------------+-------+
| Chocolate    | 75    |
+--------------+-------+
| Sugar        | 65    |
+--------------+-------+
| Rice         | 100   |
+--------------+-------+
```

##### Result

```none
column1|column2                  |column3|
-------+-------------------------+-------+
      3|Mundo3                   |    3.3|
```

### Related EWIs

1. [SSC-EWI-0030](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): The statement below has usages of dynamic SQL.

## FUNCTION OPTIONS OR DATA ACCESS

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> **Notice that this statement is****removed from the migration****because it is a non-relevant syntax. It means that it is not required in Snowflake.**

### Description

Functions options or data access options are statements used in functions on the declaration part to specify certain characteristics. These can be:

* `CONTAINS SQL`
* `SQL SECURITY DEFINER`
* `COLLATION INVOKER`
* `SPECIFIC FUNCTION_NAME`

### Sample Source Patterns

#### Function Options

Notice that in this example the function options have been removed because they are not required in Snowflake.

##### Teradata

```sql
 CREATE FUNCTION sumValues(A INTEGER, B INTEGER)
   RETURNS INTEGER
   LANGUAGE SQL
   CONTAINS SQL
   SQL SECURITY DEFINER
   SPECIFIC sumTwoValues
   COLLATION INVOKER
   INLINE TYPE 1
   RETURN A + B;
```

##### Snowflake

```sql
 CREATE OR REPLACE FUNCTION sumValues (A INTEGER, B INTEGER)
   RETURNS INTEGER
   LANGUAGE SQL
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
   AS
   $$
      A + B
   $$;
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## GET DIAGNOSTICS EXCEPTION

Translation reference to convert Teradata GET DIAGNOSTICS EXCEPTION statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> GET DIAGNOSTICS retrieves information about successful, exception, or completion conditions from the Diagnostics Area.

For more information, see the [Teradata GET DIAGNOSTICS documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/Condition-Handling/GET-DIAGNOSTICS).

```sql
 GET DIAGNOSTICS
{
  [ EXCEPTION < condition_number >
    [ < parameter_name | variable_name > = < information_item > ]...
  ]
  |
  [ < parameter_name | variable_name > = < information_item > ]...
}
```

### Sample Source Patterns

#### Teradata

##### Query

```sql
 CREATE PROCEDURE getDiagnosticsSample ()
BEGIN
    DECLARE V_MESSAGE, V_CODE VARCHAR(200);
    DECLARE V_Result INTEGER;

    SELECT c1 INTO V_Result FROM tab1;
    GET DIAGNOSTICS EXCEPTION 1
        V_MESSAGE = Message_Text,
        V_CODE = RETURNED_SQLSTATE;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE getDiagnosticsSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "06/18/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        V_MESSAGE VARCHAR(200);
        V_CODE VARCHAR(200);
        V_Result INTEGER;
    BEGIN

        SELECT
            c1 INTO
            :V_Result
        FROM
            tab1;
            V_MESSAGE := SQLERRM;
            V_CODE := SQLSTATE;
    END;
$$;
```

### Known Issues

#### CLASS_ORIGIN, CONDITION_NUMBER

> **Danger:**
>
> The use of GET DIAGNOSTICS for CLASS_ORIGIN, CONDITION_NUMBER is not supported

##### Teradata

##### Query

```sql
 CREATE PROCEDURE getDiagnosticsSample ()
BEGIN
    DECLARE V_MESSAGE, V_CODE VARCHAR(200);
    DECLARE V_Result INTEGER;

    SELECT c1 INTO V_Result FROM tab1;
    GET DIAGNOSTICS EXCEPTION 5
        V_CLASS = CLASS_ORIGIN,
        V_COND = CONDITION_NUMBER;
END;
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE getDiagnosticsSample ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "06/18/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        V_MESSAGE VARCHAR(200);
        V_CODE VARCHAR(200);
        V_Result INTEGER;
    BEGIN

        SELECT
            c1 INTO
            :V_Result
        FROM
            tab1;
--            V_CLASS = CLASS_ORIGIN
                                  !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'GET DIAGNOSTICS DETAIL FOR CLASS_ORIGIN' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

--            V_COND = CONDITION_NUMBER
                                     !!!RESOLVE EWI!!! /*** SSC-EWI-0058 - FUNCTIONALITY FOR 'GET DIAGNOSTICS DETAIL FOR CONDITION_NUMBER' IS NOT CURRENTLY SUPPORTED BY SNOWFLAKE SCRIPTING ***/!!!

    END;
$$;
```

### Related EWIs

1. [SSC-EWI-0058](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Functionality is not currently supported by Snowflake Scripting.

## IF

Translation reference to convert Teradata IF statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> Provides conditional execution based on the truth value of a condition.

For more information, see the [Teradata IF documentation](https://docs.teradata.com/r/zzfV8dn~lAaKSORpulwFMg/BER5lYjGTnRKex5a8eznnA).

```sql
 IF conditional_expression THEN
     statement
     [ statement ]...
[ ELSEIF conditional_expression THEN
     statement
     [ statement ]... ]...
[ ELSE
     statement
     [ statement ]... ]
END IF;
```

### Sample Source Patterns

#### Sample auxiliary table

##### Teradata

```sql
 CREATE TABLE if_table(col1 varchar(30));
```

##### Snowflake

```sql
 CREATE OR REPLACE TABLE if_table (
col1 varchar(30))
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Possible IF variations

##### Teradata

##### Code 1

```sql
 CREATE PROCEDURE ifExample1 ( flag NUMBER )
BEGIN
   IF flag = 1 THEN
      INSERT INTO if_table(col1) VALUES ('one');
   END IF;
END;

CALL ifExample1(1);
SELECT * FROM if_table;
```

##### Code 2

```sql
 CREATE PROCEDURE ifExample2 ( flag NUMBER )
BEGIN
   IF flag = 1 THEN
      INSERT INTO if_table(col1) VALUES ('one');
   ELSE
      INSERT INTO if_table(col1) VALUES ('Unexpected input.');
   END IF;
END;

CALL ifExample2(2);
SELECT * FROM if_table;
```

##### Code 3

```sql
 CREATE PROCEDURE ifExample3 ( flag NUMBER )
BEGIN
   IF flag = 1 THEN
      INSERT INTO if_table(col1) VALUES ('one');
   ELSEIF flag = 2 THEN
      INSERT INTO if_table(col1) VALUES ('two');
   ELSEIF flag = 3 THEN
      INSERT INTO if_table(col1) VALUES ('three');
   END IF;
END;

CALL ifExample3(3);
SELECT * FROM if_table;
```

##### Code 4

```sql
 CREATE PROCEDURE ifExample4 ( flag NUMBER )
BEGIN
   IF flag = 1 THEN
      INSERT INTO if_table(col1) VALUES ('one');
   ELSEIF flag = 2 THEN
      INSERT INTO if_table(col1) VALUES ('two');
   ELSEIF flag = 3 THEN
      INSERT INTO if_table(col1) VALUES ('three');
   ELSE
      INSERT INTO if_table(col1) VALUES ('Unexpected input.');
   END IF;
END;

CALL ifExample4(4);
SELECT * FROM if_table;
```

##### Result 1

```none
|COL1|
|----|
|one |
```

##### Result 2

```none
|COL1             |
|-----------------|
|Unexpected input.|
```

##### Result 3

```none
|COL1 |
|-----|
|three|
```

##### Result 4

```none
|COL1             |
|-----------------|
|Unexpected input.|
```

##### Snowflake Scripting

##### Query 1

```sql
 CREATE OR REPLACE PROCEDURE ifExample1 (FLAG NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      IF (:flag = 1) THEN
         INSERT INTO if_table (col1)
         VALUES ('one');
      END IF;
   END;
$$;

CALL ifExample1(1);

SELECT
   * FROM
   if_table;
```

##### Query 2

```sql
 CREATE OR REPLACE PROCEDURE ifExample2 (FLAG NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      IF (:flag = 1) THEN
         INSERT INTO if_table (col1)
         VALUES ('one');
      ELSE
         INSERT INTO if_table (col1)
         VALUES ('Unexpected input.');
      END IF;
   END;
$$;

CALL ifExample2(2);

SELECT
   * FROM
   if_table;
```

##### Query 3

```sql
 CREATE OR REPLACE PROCEDURE ifExample3 (FLAG NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      IF (:flag = 1) THEN
         INSERT INTO if_table (col1)
         VALUES ('one');
      ELSEIF (:flag = 2) THEN
         INSERT INTO if_table (col1)
         VALUES ('two');
      ELSEIF (:flag = 3) THEN
         INSERT INTO if_table (col1)
         VALUES ('three');
      END IF;
   END;
$$;

CALL ifExample3(3);

SELECT
   * FROM
   if_table;
```

##### Query 4

```sql
 CREATE OR REPLACE PROCEDURE ifExample4 (FLAG NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
   BEGIN
      IF (:flag = 1) THEN
         INSERT INTO if_table (col1)
         VALUES ('one');
      ELSEIF (:flag = 2) THEN
         INSERT INTO if_table (col1)
         VALUES ('two');
      ELSEIF (:flag = 3) THEN
         INSERT INTO if_table (col1)
         VALUES ('three');
      ELSE
         INSERT INTO if_table (col1)
         VALUES ('Unexpected input.');
      END IF;
   END;
$$;

CALL ifExample4(4);

SELECT
   * FROM
   if_table;
```

##### Result 1

```none
|COL1|
|----|
|one |
```

##### Result 2

```none
|COL1             |
|-----------------|
|Unexpected input.|
```

##### Result 3

```none
|COL1 |
|-----|
|three|
```

##### Result 4

```none
|COL1             |
|-----------------|
|Unexpected input.|
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## LOCKING FOR ACCESS

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> **Notice that this statement is****removed from the migration****because it is a non-relevant syntax. It means that it is not required in Snowflake.**

### Description

The functionality of locking a row in Teradata is related to the access and the privileges. Revire the following [documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Manipulation-Language/Statement-Syntax/LOCKING-Request-Modifier/Usage-Notes/Using-LOCKING-ROW) to know more.

### Sample Source Patterns

#### Locking row

Notice that in this example the `LOCKING ROW FOR ACCESS` has been deleted. This is because Snowflake handles accesses with roles and privileges. The statement is not required.

##### Teradata

```sql
 REPLACE VIEW SCHEMA2.VIEW1
AS
LOCKING ROW FOR ACCESS
SELECT * FROM SCHEMA1.TABLE1;
```

##### Snowflake

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "SCHEMA1.TABLE1" **
CREATE OR REPLACE VIEW SCHEMA2.VIEW1
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
--** SSC-FDM-0001 - VIEWS SELECTING ALL COLUMNS FROM A SINGLE TABLE ARE NOT REQUIRED IN SNOWFLAKE AND MAY IMPACT PERFORMANCE. **
SELECT
* FROM
SCHEMA1.TABLE1;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Views selecting all columns from a single table are not required in Snowflake.
2. [SSC-FDM-0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.

## LOOP

Translation reference to convert Teradata LOOP statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s `LOOP` statement is translated to Snowflake Scripting `LOOP` syntax.

For more information, see the [Teradata LOOP documentation](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Stored-Procedures-and-Embedded-SQL/March-2019/SQL-Control-Statements/LOOP).

```sql
 [label_name:] LOOP
    { sql_statement }
END LOOP [label_name];
```

### Sample Source Patterns

#### Teradata

##### Loop

```sql
 CREATE PROCEDURE loopProcedure(OUT resultCounter INTEGER)
BEGIN
    DECLARE counter INTEGER DEFAULT 0;

    customeLabel: LOOP
    	SET counter = counter + 1;
	IF counter = 10 THEN
	    LEAVE customeLabel;
	END IF;
    END LOOP customeLabel;

    SET resultCounter = counter;
END;

CALL loopProcedure(:?);
```

##### Result

```sql
 |resultCounter|
|-------------|
|10           |
```

##### Snowflake Scripting

##### Loop

```sql
 CREATE OR REPLACE PROCEDURE loopProcedure (RESULTCOUNTER OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		counter INTEGER DEFAULT 0;
	BEGIN

		LOOP
			counter := counter + 1;
			IF (:counter = 10) THEN
				BREAK CUSTOMELABEL;
			END IF;
		END LOOP CUSTOMELABEL;
		resultCounter := counter;
	END;
$$;

CALL loopProcedure(:?);
```

##### Result

```sql
 |LOOPPROCEDURE|
|-------------|
|10           |
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## OUTPUT PARAMETERS

This article is about the current transformation of the output parameters and how their functionality is being emulated.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

An **output parameter** is a parameter whose value is passed out of the stored procedure, back to the calling statement. Snowflake has direct support for output parameters.

### Sample Source Patterns

#### Single out parameter

##### Teradata

```sql
CREATE PROCEDURE demo.proc_with_single_output_parameters(OUT param1 NUMBER)
BEGIN
 SET param1 = 100;
END;

REPLACE PROCEDURE demo.proc_calling_proc_with_single_output_parameters ()
BEGIN
  DECLARE mytestvar NUMBER;
  CALL demo.proc_with_single_output_parameters(mytestvar);
  INSERT INTO demo.TABLE20 VALUES(mytestvar,432);
END;
```

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE demo.proc_with_single_output_parameters (PARAM1 OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/24/2024" }}'
EXECUTE AS CALLER
AS
$$
 BEGIN
  param1 := 100;
 END;
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "demo.TABLE20" **
CREATE OR REPLACE PROCEDURE demo.proc_calling_proc_with_single_output_parameters ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/24/2024" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  mytestvar NUMBER(38, 18);
 BEGIN

  CALL demo.proc_with_single_output_parameters(:mytestvar);
  INSERT INTO demo.TABLE20
  VALUES (:mytestvar,432);
 END;
$$;
```

#### Multiple out parameter

##### Teradata

```sql
 CREATE PROCEDURE demo.proc_with_multiple_output_parameters(OUT param1 NUMBER, INOUT param2 NUMBER)
BEGIN
  SET param1 = param2;
  SET param2 = 32;
END;

CREATE PROCEDURE demo.proc_calling_proc_with_multiple_output_parameters ()
BEGIN
    DECLARE var1  NUMBER;
    DECLARE var2  NUMBER;
    SET var2 = 34;
    CALL demo.proc_with_multiple_output_parameters(var1, var2);
    INSERT INTO demo.TABLE20 VALUES(var1,var2);
END;
```

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE demo.proc_with_multiple_output_parameters (PARAM1 OUT NUMBER(38, 18), PARAM2 OUT NUMBER(38, 18))
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  BEGIN
    param1 := param2;
    param2 := 32;
  END;
$$;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "demo.TABLE20" **
CREATE OR REPLACE PROCEDURE demo.proc_calling_proc_with_multiple_output_parameters ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    var1 NUMBER(38, 18);
    var2 NUMBER(38, 18);
  BEGIN

    var2 := 34;
    CALL demo.proc_with_multiple_output_parameters(:var1, :var2);
    INSERT INTO demo.TABLE20
    VALUES (:var1, :var2);
  END;
$$;
```

### Related EWIs

No related EWIs.

## PREPARE

Translation specification to convert Teradata PREPARE statement to Snowflake Scripting. This section review the PREPARE pattern related to a cursor logic.

### Description

> Prepares the dynamic DECLARE CURSOR statement to allow the creation of different result sets. Allows dynamic parameter markers.

For more information, please review the following [documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Stored-Procedures-and-Embedded-SQL/SQL-Cursor-Control-and-DML-Statements/PREPARE).

**Teradata syntax**:

```sql
 PREPARE statement_name FROM { 'statement_string' | statement_string_variable } ;
```

Where:

* **statement_name** is the same identifier as `statement_name` in a **DECLARE CURSOR** statement.
* **statement_string** is the SQL text that is to be executed dynamically.
* **statement_string_variable** is the name of an SQL local variable, or an SQL parameter or string variable, that contains the SQL text string to be executed dynamically.

> **Note:**
>
> **Important information**
>
> **For this transformation, the cursors are renamed since they cannot be dynamically updated.**

### Sample Source Patterns

#### Data setting for examples

For this example, please use the following complementary queries in the case that you want to run each case.

##### Teradata

```sql
 CREATE TABLE MyTemporaryTable(
    Col1  INTEGER
);

INSERT INTO MyTemporaryTable(col1) VALUES (1);
SELECT * FROM databaseTest.MyTemporaryTable;

CREATE TABLE MyStatusTable (
    Col1  VARCHAR(2)
);
SELECT * FROM MyStatusTable;
```

##### Snowflake

```sql
 CREATE TABLE MyTemporaryTable (
    Col1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

INSERT INTO MyTemporaryTable (col1) VALUES (1);

SELECT * FROM MyTemporaryTable;

    CREATE TABLE MyStatusTable (
    Col1 VARCHAR(2)
   )
    COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

SELECT * FROM MyStatusTable;
```

#### Simple scenario

This example reviews the functionality for the cases where a single cursor is being used one single time.

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE simple_scenario()
BEGIN
    --Variables for the example's procedure_results
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT * FROM MyTemporaryTable';
    DECLARE procedure_result INTEGER DEFAULT 0;

    -- Actual Cursor usage
    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
    FETCH C1 INTO procedure_result;
    INSERT INTO databaseTest.MyStatusTable(Col1) VALUES (procedure_result);
    CLOSE C1;
END;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE simple_scenario ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ ""origin"": ""sf_sc"", ""name"": ""snowconvert"", ""version"": {  ""major"": 0,  ""minor"": 0,  ""patch"": ""0"" }, ""attributes"": {  ""component"": ""none"",  ""convertedOn"": ""01/01/0001"" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --Variables for the example's procedure_results
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   * FROM
   MyTemporaryTable';
    procedure_result INTEGER DEFAULT 0;
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN

    -- Actual Cursor usage

    prepareQuery_aux_sql := SQL_string_sel;
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    OPEN CURSOR_S1_INSTANCE_V0;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      procedure_result;
    INSERT INTO databaseTest.MyStatusTable (Col1)
    VALUES (procedure_result);
    CLOSE CURSOR_S1_INSTANCE_V0;
  END;
$$;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

#### Simple scenario with RETURN ONLY

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE simple_scenario()
DYNAMIC RESULT SETS 1
BEGIN
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT * FROM MyTemporaryTable';
    DECLARE procedure_result VARCHAR(100);
    DECLARE C1 CURSOR WITH RETURN ONLY FOR S1;

    SET procedure_result = '';
    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
END;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE simple_scenario ()
RETURNS TABLE (
)
LANGUAGE SQL
COMMENT = '{ ""origin"": ""sf_sc"", ""name"": ""snowconvert"", ""version"": {  ""major"": 0,  ""minor"": 0,  ""patch"": ""0"" }, ""attributes"": {  ""component"": ""none"",  ""convertedOn"": ""01/01/0001"" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   * FROM
   MyTemporaryTable';
    procedure_result VARCHAR(100);
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN

    procedure_result := '';
    prepareQuery_aux_sql := SQL_string_sel;
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    OPEN CURSOR_S1_INSTANCE_V0;
    RETURN TABLE(resultset_from_cursor(CURSOR_S1_INSTANCE_V0));
  END;
$$;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

#### Reused cursor case

##### Teradata

##### Query

```sql
 CREATE PROCEDURE fetch_simple_reused_cursor(OUT procedure_result INTEGER)
BEGIN
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTemporaryTable WHERE col1 = 1';

    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
    FETCH C1 INTO procedure_result;
    CLOSE C1;

    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
    FETCH C1 INTO procedure_result;
    CLOSE C1;
END;
```

##### Output

```none
No returning information.
```

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE fetch_simple_reused_cursor (
--                                                        OUT
                                                            PROCEDURE_RESULT INTEGER)
RETURNS VARIANT
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/24/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTemporaryTable
WHERE col1 = 1';
        S1 RESULTSET;
        prepareQuery_aux_sql VARCHAR;
    BEGIN

        prepareQuery_aux_sql := SQL_string_sel;
        S1 := (
            EXECUTE IMMEDIATE prepareQuery_aux_sql
        );
        LET CURSOR_S1_INSTANCE_V0 CURSOR
        FOR
            S1;
        OPEN CURSOR_S1_INSTANCE_V0;
            FETCH
            CURSOR_S1_INSTANCE_V0
        INTO procedure_result;
            CLOSE CURSOR_S1_INSTANCE_V0;
        prepareQuery_aux_sql := SQL_string_sel;
        S1 := (
            EXECUTE IMMEDIATE prepareQuery_aux_sql
        );
        LET CURSOR_S1_INSTANCE_V1 CURSOR
        FOR
            S1;
        OPEN CURSOR_S1_INSTANCE_V1;
            FETCH
            CURSOR_S1_INSTANCE_V1
        INTO procedure_result;
            CLOSE CURSOR_S1_INSTANCE_V1;
        RETURN procedure_result;
    END;
$$;
```

##### Output

```none
No returning information.
```

#### Modified query before usage

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE fetch_modified_query_cursor()
BEGIN
    --Variables for the example's procedure_results
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTemporaryTable WHERE col1 = 1';
    DECLARE procedure_result INTEGER DEFAULT 0;
    -- Actual Cursor usages
    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string_sel;

    -- This modification does not take effect since S1 is already staged for the Cursor
    SET SQL_string_sel = 'SELECT col1 FROM MyTemporaryTable WHERE col1 = 0';
    OPEN C1;
    FETCH C1 INTO procedure_result;
    INSERT INTO databaseTest.MyStatusTable(Col1) VALUES (procedure_result);
    CLOSE C1;
END;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE fetch_modified_query_cursor ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ ""origin"": ""sf_sc"", ""name"": ""snowconvert"", ""version"": {  ""major"": 0,  ""minor"": 0,  ""patch"": ""0"" }, ""attributes"": {  ""component"": ""none"",  ""convertedOn"": ""01/01/0001"" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --Variables for the example's procedure_results
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTemporaryTable
WHERE col1 = 1';
    procedure_result INTEGER DEFAULT 0;
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN

    -- Actual Cursor usages

    prepareQuery_aux_sql := SQL_string_sel;
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    -- This modification does not take effect since S1 is already staged for the Cursor
    SQL_string_sel := 'SELECT
   col1 FROM
   MyTemporaryTable
WHERE col1 = 0';
    OPEN CURSOR_S1_INSTANCE_V0;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      procedure_result;
    INSERT INTO databaseTest.MyStatusTable (Col1)
    VALUES (procedure_result);
    CLOSE CURSOR_S1_INSTANCE_V0;
  END;
$$;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

#### Simple cursor combined with no PREPARE pattern

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE fetch_cursor_ignored_query_cursor()
BEGIN
    --Variables for the example's procedure_results
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT * FROM MyTemporaryTable WHERE col1 = 1';
    DECLARE intermediate_result INTEGER;
    DECLARE procedure_result INTEGER DEFAULT 0;
    DECLARE C2 CURSOR FOR SELECT col1 FROM MyTemporaryTable WHERE col1 = 1;

    -- Actual Cursor usage
    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
    FETCH C1 INTO intermediate_result;
    CLOSE C1;
    SET procedure_result = intermediate_result;
    INSERT INTO databaseTest.MyStatusTable(Col1) VALUES (procedure_result);

    OPEN C2;
    FETCH C2 INTO intermediate_result;
    CLOSE C2;
    SET procedure_result = procedure_result + intermediate_result;
END;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE fetch_cursor_ignored_query_cursor ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ ""origin"": ""sf_sc"", ""name"": ""snowconvert"", ""version"": {  ""major"": 0,  ""minor"": 0,  ""patch"": ""0"" }, ""attributes"": {  ""component"": ""none"",  ""convertedOn"": ""01/01/0001"" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --Variables for the example's procedure_results
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   * FROM
   MyTemporaryTable
WHERE col1 = 1';
    intermediate_result INTEGER;
    procedure_result INTEGER DEFAULT 0;
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN

    -- Actual Cursor usage
    LET C2 CURSOR
    FOR
      SELECT
        col1
      FROM
        MyTemporaryTable
      WHERE
        col1 = 1;
    prepareQuery_aux_sql := SQL_string_sel;
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    OPEN CURSOR_S1_INSTANCE_V0;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      intermediate_result;
    CLOSE CURSOR_S1_INSTANCE_V0;
    procedure_result := intermediate_result;
    INSERT INTO databaseTest.MyStatusTable (Col1)
    VALUES (procedure_result);
    OPEN C2;
    FETCH
      C2
    INTO
      intermediate_result;
    CLOSE C2;
    procedure_result := procedure_result + intermediate_result;
  END;
$$;

CALL databaseTest.simple_scenario();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| 1 |

#### Prepare combined with nested cursors

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE fetch_nested_cursor()
BEGIN
    --Variables for the example's procedure_results
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTemporaryTable WHERE col1 = 1';
    DECLARE intermediate_result INTEGER;
    DECLARE C2 CURSOR FOR SELECT col1 FROM MyTemporaryTable WHERE col1 = 1;

    -- Actual Cursor usage
    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
    OPEN C2;
    FETCH C2 INTO intermediate_result;

    CLOSE C2;
    FETCH C1 INTO intermediate_result;
    CLOSE C1;
END;
```

##### Output

```none
No returning information.
```

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE fetch_nested_cursor ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ ""origin"": ""sf_sc"", ""name"": ""snowconvert"", ""version"": {  ""major"": 0,  ""minor"": 0,  ""patch"": ""0"" }, ""attributes"": {  ""component"": ""none"",  ""convertedOn"": ""01/01/0001"" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --Variables for the example's procedure_results
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTemporaryTable
WHERE col1 = 1';
    intermediate_result INTEGER;
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN

    -- Actual Cursor usage
    LET C2 CURSOR
    FOR
      SELECT
        col1
      FROM
        MyTemporaryTable
      WHERE
        col1 = 1;
    prepareQuery_aux_sql := SQL_string_sel;
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    OPEN CURSOR_S1_INSTANCE_V0;
    OPEN C2;
    FETCH
      C2
    INTO
      intermediate_result;
    CLOSE C2;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      intermediate_result;
    CLOSE CURSOR_S1_INSTANCE_V0;
  END;
$$;
```

##### Output

```none
No returning information.
```

#### Variable markers without variable reordering

> **Warning:**
>
> **This case is not supported yet.**

##### Teradata

##### Query

```sql
 CREATE PROCEDURE PREPARE_ST_TEST()
BEGIN
    DECLARE ctry_list VARCHAR(100);
    DECLARE SQL_string_sel VARCHAR(255);
    DECLARE col_value NUMBER;

    DECLARE C1 CURSOR FOR S1;

    SET ctry_list = '';
    SET col_value = 1;
    SET SQL_string_sel = 'SELECT * FROM databaseTest.MyTemporaryTable where Col1 = ?';
    PREPARE S1 FROM SQL_string_sel;
    OPEN C1 USING col_value;
    FETCH C1 INTO ctry_list;
    IF (ctry_list <> '') THEN
        INSERT INTO databaseTest.MyStatusTable(col1) VALUES ('ok');
    END IF;
    CLOSE C1;
END;

CALL PREPARE_ST_TEST();
SELECT * FROM MyStatusTable;
```

##### Output

| Col1 |
| --- |
| ok |

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE PREPARE_ST_TEST_MARKERS ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        p1 RESULTSET;
        p1_sql VARCHAR DEFAULT '';

    BEGIN
        LET ctry_list VARCHAR(100);
        LET SQL_string_sel VARCHAR(255);
        LET col_value NUMBER(38, 18);
        LET S1 RESULTSET;

        ctry_list := '';

        col_value := 1;

        SQL_string_sel := 'SELECT * FROM MyTemporaryTable WHERE Col1 = ?';

        p1_sql := SQL_string_sel;
        S1 := (
            EXECUTE IMMEDIATE p1_sql USING (col_value)
        );
        LET C1 CURSOR FOR S1;

        OPEN C1;
            FETCH C1 INTO ctry_list;
        IF (RTRIM(ctry_list) <> '') THEN
            INSERT INTO MyStatusTable (col1)
            VALUES ('ok');
        END IF;
            CLOSE C1;
    END;
$$;
```

##### Output

| Col1 |
| --- |
| ok |

#### Variable markers with variable reordering

> **Warning:**
>
> **This case is not supported yet.**

> **Note:**
>
> When there are variables setting the value into different ones between the `PREPARE` statement and `OPEN` cursor in Teradata, It is necessary to move this variable before the `EXECUTE IMMEDIATE` in Snowflake. So, the dynamic variable information is updated at the moment of running the dynamic query.

##### Teradata

##### Query

```sql
 CREATE PROCEDURE PREPARE_ST_TEST()
BEGIN
    DECLARE ctry_list VARCHAR(100);
    DECLARE SQL_string_sel VARCHAR(255);
    DECLARE col_name NUMBER;

    DECLARE C1 CURSOR FOR S1;

    SET ctry_list = '';
    SET col_name = 1;
    SET SQL_string_sel = 'SELECT * FROM databaseTest.MyTemporaryTable where Col1 = ?';
    PREPARE S1 FROM SQL_string_sel;
    SET col_name = 2; // change value before open cursor
    OPEN C1 USING col_name;
    FETCH C1 INTO ctry_list;
    IF (ctry_list <> '') THEN
        INSERT INTO databaseTest.MyStatusTable(col1) VALUES ('ok');
    END IF;
    CLOSE C1;
END;

CALL PREPARE_ST_TEST();
SELECT * FROM MyStatusTable;
```

##### Output

```none
"MyStatusTable" should be empty.
```

##### Snowflake Scripting

> **Note:**
>
> Usages for cursors must be renamed and declared again.

##### Query

```sql
 CREATE OR REPLACE PROCEDURE PREPARE_ST_TEST_MARKERS ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        p1 RESULTSET;
        p1_sql VARCHAR DEFAULT '';

    BEGIN
        LET ctry_list VARCHAR(100);
        LET SQL_string_sel VARCHAR(255);
        LET col_value NUMBER(38, 18);
        LET S1 RESULTSET;

        ctry_list := '';

        col_value := 1;

        SQL_string_sel := 'SELECT * FROM MyTemporaryTable WHERE Col1 = ?';

        p1_sql := SQL_string_sel;

        col_value:= 2; // Move variable setting before the EXECUTE IMMEDIATE

        S1 := (
            EXECUTE IMMEDIATE p1_sql USING (col_value)
        );

        LET C1 CURSOR FOR S1;

        OPEN C1;
            FETCH C1 INTO ctry_list;
        IF (RTRIM(ctry_list) <> '') THEN
            INSERT INTO MyStatusTable (col1)
            VALUES ('ok');
        END IF;
            CLOSE C1;
    END;
$$;

CALL PREPARE_ST_TEST();
SELECT * FROM MyStatusTable;
```

##### Output

```none
"MyStatusTable" should be empty.
```

#### Anonymous blocks - Declaration outside the block

> **Warning:**
>
> **This case is not supported yet.**

##### Teradata

##### Query

```sql
 REPLACE PROCEDURE anonymous_blocks_case(OUT procedure_result INTEGER)
BEGIN
    --Variables for the example's procedure_results
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTemporaryTable WHERE col1 = 1';

    -- Actual Cursor usage
    DECLARE C1 CURSOR FOR S1;
    DECLARE C2 CURSOR FOR S2;

    PREPARE S1 FROM SQL_string_sel;
    OPEN C1;
    FETCH C1 INTO procedure_result;
    CLOSE C1;

    BEGIN
        PREPARE S2 FROM SQL_string_sel;
        OPEN C2;
        FETCH C2 INTO procedure_result;
        CLOSE C2;
    END;

    OPEN C1;
    CLOSE C1;
END;
```

##### Output

```none
No returning information.
```

##### Query

```sql
 CREATE OR REPLACE PROCEDURE anonymous_blocks_case (
--                                                   OUT
                                                       PROCEDURE_RESULT INTEGER)
RETURNS VARIANT
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "none",  "convertedOn": "01/01/0001" }}'
EXECUTE AS CALLER
AS
$$
  DECLARE
    --Variables for the example's procedure_results
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTemporaryTable
WHERE col1 = 1';
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
    S2 RESULTSET;
  BEGIN
    -- Actual Cursor usage

    prepareQuery_aux_sql := SQL_string_sel
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    OPEN CURSOR_S1_INSTANCE_V0;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      procedure_result;
    CLOSE CURSOR_S1_INSTANCE_V0;

    BEGIN
      prepareQuery_aux_sql := SQL_string_sel
      S2 := (
        EXECUTE IMMEDIATE prepareQuery_aux_sql
      );
      LET CURSOR_S2_INSTANCE_V# CURSOR
      FOR
        S1;
      OPEN CURSOR_S2_INSTANCE_V#;
      FETCH
        CURSOR_S2_INSTANCE_V#
      INTO
        procedure_result;
      CLOSE CURSOR_S2_INSTANCE_V#;
    END;

    OPEN CURSOR_S1_INSTANCE_V0; -- NAME REMAINS AS NEEDED IN LOGIC
    CLOSE CURSOR_S1_INSTANCE_V0;
    RETURN null;
  END;
$$;
```

##### Output

```none
No returning information.
```

### Known Issues

* Review carefully nested cursors and conditionals, if that is the case.

### Related EWIs

No related EWIs.

## REPEAT

Translation reference to convert Teradata REPEAT statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s `REPEAT` statement is translated to Snowflake Scripting `REPEAT` syntax.

For more information, see the [Teradata REPEAT documentation](https://docs.teradata.com/r/Teradata-Database-SQL-Stored-Procedures-and-Embedded-SQL/June-2017/SQL-Control-Statements/REPEAT).

```sql
 [label_name:] REPEAT
    { sql_statement }
    UNTIL conditional_expression
END REPEAT [label_name];
```

### Sample Source Patterns

#### Teradata

##### Repeat

```sql
 CREATE PROCEDURE repeatProcedure(OUT resultCounter INTEGER)
BEGIN
    DECLARE counter INTEGER DEFAULT 0;

    customeLabel: REPEAT
    	SET counter = counter + 1;
	UNTIL 10 < counter
    END REPEAT customeLabel;

    SET resultCounter = counter;
END;

CALL repeatProcedure(:?);
```

##### Result

```none
|resultCounter|
|-------------|
|11           |
```

##### Snowflake Scripting

##### Repeat

```sql
 CREATE OR REPLACE PROCEDURE repeatProcedure (RESULTCOUNTER OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
	DECLARE
		counter INTEGER DEFAULT 0;
	BEGIN

		REPEAT
			counter := counter + 1;
		UNTIL (10 < :counter)
		END REPEAT CUSTOMELABEL;
		resultCounter := counter;
	END;
$$;

CALL repeatProcedure(:?);
```

##### Result

```none
|REPEATPROCEDURE|
|---------------|
|1             |
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## SET

Translation reference to convert Teradata SET statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

> Assigns a value to a local variable or parameter in a stored procedure.

For more information, see the [Teradata SET documentation](https://docs.teradata.com/r/zzfV8dn~lAaKSORpulwFMg/7wwpafjC_5JfF~I2zpTsQQ).

```sql
 SET assigment_target = assigment_source ;
```

### Sample Source Patterns

#### Teradata

##### Query

```sql
 CREATE PROCEDURE setExample ( OUT PARAM1 INTEGER )
BEGIN
    DECLARE COL_COUNT INTEGER;
    SET COL_COUNT = 3;
    SET PARAM1 = COL_COUNT + 1;
END;
```

##### Result

```none
|PARAM1 |
|-------|
|4      |
```

##### Snowflake Scripting

##### Query

```sql
 CREATE OR REPLACE PROCEDURE setExample (PARAM1 OUT INTEGER )
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        COL_COUNT INTEGER;
    BEGIN

        COL_COUNT := 3;
        PARAM1 := COL_COUNT + 1;
    END;
$$;
```

##### Result

```none
|PARAM1 |
|-------|
|4      |
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

## SYSTEM_DEFINED

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> Non-relevant statement.

> **Warning:**
>
> **Notice that this statement is****removed from the migration****because it is a non-relevant syntax. It means that it is not required in Snowflake.**

### Description

Property in Teradata that can be after a `CREATE` statement in cases such as `JOIN INDEX`.

### Sample Source Patterns

Notice that SYSTEM_DEFINED has been removed from the source code because it is a non-relevant syntax in Snowflake.

#### Teradata

```sql
 CREATE SYSTEM_DEFINED JOIN INDEX MY_TESTS.MYPARTS_TJI004 ,FALLBACK ,CHECKSUM = DEFAULT, MAP = TD_MAP1 AS
CURRENT TRANSACTIONTIME
SELECT
    MY_TESTS.myParts.ROWID,
    MY_TESTS.myParts.part_id,
    MY_TESTS.part_duration
FROM MY_TESTS.myParts
UNIQUE PRIMARY INDEX (part_id);
```

##### Snowflake

```sql
 --** SSC-FDM-0007 - MISSING DEPENDENT OBJECT "MY_TESTS.myParts" **
CREATE OR REPLACE DYNAMIC TABLE MY_TESTS.MYPARTS_TJI004
--** SSC-FDM-0031 - DYNAMIC TABLE REQUIRED PARAMETERS SET BY DEFAULT **
TARGET_LAG='1 day'
WAREHOUSE=UPDATE_DUMMY_WAREHOUSE
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/01/2024" }}'
AS
--    --** SSC-FDM-TD0025 - TEMPORAL FORMS ARE NOT SUPPORTED IN SNOWFLAKE **
--    CURRENT TRANSACTIONTIME
                            SELECT
        MY_TESTS.myParts.ROWID,
        MY_TESTS.myParts.part_id,
        MY_TESTS.part_duration
    FROM
        MY_TESTS.myParts;
```

### Known Issues

No issues were found.

### Related EWIs

1. [SSC-FDM-0007](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Element with missing dependencies.
2. [SSC-FDM-TD0025](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md): Teradata Database Temporal Table is not supported in Snowflake.
3. [SSC-FDM-0031](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/generalFDM.md): Dynamic Table required parameters set by default

## WHILE

Translation reference to convert Teradata WHILE statement to Snowflake Scripting

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

Teradata’s `WHILE` statement is translated to Snowflake Scripting`WHILE` syntax.

For more information, see the [Teradata WHILE documentation](https://docs.teradata.com/r/Teradata-Database-SQL-Stored-Procedures-and-Embedded-SQL/June-2017/SQL-Control-Statements/WHILE).

```sql
 [label_name:] WHILE conditional_expression DO
    { sql_statement }
END WHILE [label_name];
```

### Sample Source Patterns

#### Teradata

##### While

```sql
 REPLACE PROCEDURE whileProcedure(OUT resultCounter INTEGER)
BEGIN
    DECLARE counter INTEGER DEFAULT 0;
    customeLabel: WHILE counter < 10 DO
        SET counter = counter + 1;
    END WHILE customeLabel;
    SET resultCounter = counter;
END;

CALL whileProcedure(:?);
```

##### Result

```sql
 |resultCounter|
|-------------|
|10           |
```

##### Snowflake Scripting

##### While

```sql
 CREATE OR REPLACE PROCEDURE whileProcedure (RESULTCOUNTER OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/16/2025",  "domain": "no-domain-provided" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        counter INTEGER DEFAULT 0;
    BEGIN

        WHILE (:counter < 10) LOOP
            counter := counter + 1;
        END LOOP CUSTOMELABEL;
        resultCounter := counter;
    END;
$$;

CALL whileProcedure(:?);
```

##### Result

```sql
 |WHILEPROCEDURE|
|--------------|
|10            |
```

### Known Issues

No issues were found.

### Related EWIs

No related EWIs.

---
title: SnowConvert AI - Teradata - SQL Translation Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/sql-translation-reference/README.md
section: Migrations
---

# SnowConvert AI - Teradata - SQL Translation Reference

This section provides information about the translation that SnowConvert AI performs, over Teradata SQL Syntax to Snowflake.

Use this as a guide to understand how the transformed code might look when migrating from [**Teradata**](https://docs.teradata.com/) to [**Snowflake**](https://docs.snowflake.net/manuals/index.html). SQL has a similar syntax between dialects, but each dialect can extend or add new functionalities.

For this reason, when running SQL in one environment (such as Teradata) vs. another (such as Snowflake), there are many statements that require transformation or even removal. These transformations are done by SnowConvert.

Browse through the following pages to find more information about specific topics.

* [Analytic](analytic.md), compare Teradata Analytics statements and their equivalents in Snowflake.
* [Data Types](data-types.md), compare Teradata data types and their equivalents in Snowflake.
* [DDL](ddl-teradata.md), explore the translation of the Data Definition Language.
* [CREATE TYPE](teradata-create-type.md), translation of Teradata UDT definitions to Snowflake `CREATE TYPE`.
* [DML](dml-teradata.md), explore the translation of the Data Manipulation Language.
* [Built-in Functions](teradata-built-in-functions.md), compare functions included in the runtime of both languages.

---
title: SnowConvert AI - Teradata - TPT
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/scripts-to-python/tpt-translation.md
section: Migrations
---

# SnowConvert AI - Teradata - TPT

This section illustrates TPT translation from Teradata to Snowflake.

## TPT statements transformation

All TPT statements, like other Teradata scripting languages, are being converted to python code. Here are some examples of transformations already supported.

### Define Job header transformation

The job statement is translated to a python class with all the statements like operators, schema definitions, and steps inside of it.

Source code

```sql
 /* Some comments on the job  */
DEFINE JOB LOADJOB
DESCRIPTION 'LOAD AC_SCHEMA TABLE FROM A FILE'
JobBody
```

Translated code

```python
 # Some comments on the job
class LOADJOB:
    # DESCRIPTION 'LOAD AC_SCHEMA TABLE FROM A FILE'
    JobBody
```

### Define Schema transformation

The schema statement is translated to an attribute in the class created for the job statement.

Source code

```sql
 DEFINE SCHEMA DCS_SCHEMA
DESCRIPTION 'DCS DATA'
(
PNRHEADER_ID   PERIOD(DATE),
PNRLOCPERIOD   PERIOD(TIMESTAMP(0)),
CRTDATE        CLOB,
REQTYP         JSON(100000),
seqno          INTEGER,
resdata        INTEGER
);
```

Translated code

```python
 class JOBNAME:
    DCS_SCHEMA = """(
    PNRHEADER_ID VARCHAR(24),
    PNRLOCPERIOD VARCHAR(58),
    CRTDATE VARCHAR /*** MSC-WARNING - SSC-FDM-TD0002 - COLUMN CONVERTED FROM CLOB DATA TYPE ***/,
    REQTYP VARIANT,
    seqno INTEGER,
    resdata INTEGER,
    );"""
```

### Define Operator transformation

The operators are translated to python functions inside the class generated for the job. The examples provided are the operators that SnowConvert AI currently supports

#### DDL Operator

Source code for DDL operator

```sql
 DEFINE OPERATOR DDL_OPERATOR()
DESCRIPTION 'TERADATA PARALLEL TRANSPORTER DDL OPERATOR'
TYPE DDL
ATTRIBUTES
(
  VARCHAR PrivateLogName ,
  VARCHAR TdpId          = @MyTdpId,
  VARCHAR UserName       = @MyUserName,
  VARCHAR UserPassword   = 'SomePassWord',
  VARCHAR AccountID,
  VARCHAR ErrorList      = ['3807','2580']
);
```

Translated code

```python
 class JobName:
    def DDL_OPERATOR(self):
        #'TERADATA PARALLEL TRANSPORTER DDL OPERATOR'
        global args
        self.con = log_on(user = args.MyUserName, password = 'SomePassWord')
```

#### UPDATE Operator

Source code for UPDATE operator

```sql
 DEFINE OPERATOR LOAD_OPERATOR()
DESCRIPTION 'TERADATA PARALLEL TRANSPORTER LOAD OPERATOR'
TYPE UPDATE
SCHEMA AC_MASTER_SCHEMA
ATTRIBUTES
(
    VARCHAR PrivateLogName ,
    INTEGER MaxSessions       =  32,
    INTEGER MinSessions       =  1,
    VARCHAR TargetTable       = '&TARGET_TABLE',
    VARCHAR TdpId             = @MyTdpId,
    VARCHAR UserName          = @MyUserName,
    VARCHAR UserPassword      = @MyPassword,
    VARCHAR AccountId,
    VARCHAR ErrorTable1       = '&LOG_DB_NAME.ERR1',
    VARCHAR ErrorTable2       = '&LOG_DB_NAME.ERR2',
    VARCHAR LogTable          = '&LOG_DB_NAME.LOG_TABLE'
);
```

Translated code

```python
 class JobName:
    def LOAD_OPERATOR(self, query):
        #'TERADATA PARALLEL TRANSPORTER LOAD OPERATOR'
        #USES SCHEMA AC_MASTER_SCHEMA
        operator_name = "LOAD_OPERATOR"
        return query
```

#### DATA CONNECTOR PRODUCER Operator

Source code for Data Connector Producer operator

```sql
 DEFINE OPERATOR FILE_READER()
DESCRIPTION 'TERADATA PARALLEL TRANSPORTER DATA CONNECTOR OPERATOR'
TYPE DATACONNECTOR PRODUCER
SCHEMA AC_MASTER_SCHEMA
ATTRIBUTES
(
  VARCHAR PrivateLogName ,
  VARCHAR DirectoryPath   = '&INPUTFILEPATH' ,
  VARCHAR FileName        = '&INPUTTEXTFILE' ,
  VARCHAR Format          = 'delimited',
  VARCHAR OpenMode        = 'Read',
  VARCHAR TextDelimiter     = '~',
  VARCHAR IndicatorMode   = 'N'
);
```

Translated code

```python
 class JobName:
    def FILE_READER(self):
        #'TERADATA PARALLEL TRANSPORTER DATA CONNECTOR OPERATOR'
        #USES SCHEMA AC_MASTER_SCHEMA
        operator_name = "FILE_READER"
        stage_name = f"{self.jobname}_{operator_name}"
        format_name = f"{self.jobname}_{operator_name}_FILEFORMAT"
        exec(f"""CREATE OR REPLACE FILE FORMAT {format_name} TYPE = 'CSV' FIELD_DELIMITER = '~' TRIM_SPACE = TRUE SKIP_HEADER = 0""")
        exec(f"""CREATE STAGE IF NOT EXISTS {self.jobname}_STAGE""")
        exec(f"""PUT file://{INPUTFILEPATH}/{INPUTTEXTFILE} @{stage_name} OVERWRITE = TRUE AUTO_COMPRESS = FALSE;""")
        temp_table_name = f"{self.jobname}_{operator_name}_TEMP"
        exec(f"""DROP TABLE IF EXISTS {temp_table_name}""")
        exec(f"""CREATE TEMPORARY TABLE {temp_table_name} {self.AC_MASTER_SCHEMA}""")
        exec(f"""COPY INTO {temp_table_name} FROM @{stage_name} FILE_FORMAT = (format_name = '{format_name}')""")
        return temp_table_name
```

### Define step transformation

Steps are too translated to python functions inside the class generated for the job, they will be called in the main function of the translated code.

Step source code

```sql
 STEP setup_tables
(
  APPLY
  ('DELETE FROM  &STAGE_DB_NAME.EMS_AC_MASTER_STG;')
   TO OPERATOR (DDL_OPERATOR() );
);

STEP stLOAD_FILE_NAME
(
  APPLY
  ('INSERT INTO CRASHDUMPS.EMP_NAME
  (EMP_NAME, EMP_YEARS, EMP_TEAM)
  VALUES
  (:EMP_NAME, :EMP_YEARS, :EMP_TEAM);')
  TO OPERATOR (ol_EMP_NAME() [1])
  SELECT * FROM OPERATOR(op_EMP_NAME);
);
```

Translated code

```python
 def setup_tables(self):
    self.DDL_OPERATOR()
    exec(f"""DELETE FROM DATABASE1.{STAGE_DB_NAME}.EMS_AC_MASTER_STG""")

def stLOAD_FILE_NAME(self):
    exec(f"""INSERT INTO DATABASE1.CRASHDUMPS.EMP_NAME (EMP_NAME, EMP_YEARS, EMP_TEAM)
SELECT EMP_NAME, EMP_YEARS, EMP_TEAM
FROM (
{self.ol_EMP_NAME('SELECT * FROM ' + self.op_EMP_NAME() )})""")
```

### Main function

The main function is always generated for any scripting language, for TPT the main function contains an instance of the job class and calls to the steps in the job

Main function sample code

```python
 def main():
  _LOADJOB = LOADJOB()
  _LOADJOB.setup_tables()
  _LOADJOB.stLOAD_FILE_NAME()
  snowconvert.helpers.quit_application()
```

---
title: SnowConvert AI - Teradata Conversion Settings
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/conversion/teradata-conversion-settings.md
section: Migrations
---

# SnowConvert AI - Teradata Conversion Settings

## General Conversion Settings

### General Result Settings

1. **Comment objects with missing dependencies:** Flag to indicate if the user wants to comment on nodes that have missing dependencies.
2. **Generate XML-tags for SQL statements in Stored Procedures:** Flag to indicate whether the SQL statements SELECT, INSERT, CREATE, DELETE, UPDATE, DROP, MERGE in Stored Procedures will be tagged on the converted code. This feature is used for easy statement identification on the migrated code. Wrapping these statements within these XML-like tags allows for other programs to quickly find and extract them. The decorated code looks like this:

   ```sql
   //<SQL_DELETE
   EXEC(DELETE FROM SB_EDP_SANDBOX_LAB.PUBLIC.USER_LIST,[])
   //SQL_DELETE!>
   ```
3. **Separate Period Data-type definitions and usages into begin and end Data-Time fields:** This flag is used to indicate that the tool should migrate any use of the PERIOD datatype as two separate DATETIME fields that will hold the original period begin and end values, anytime a period field or function is migrated using this flag SSC-EWI-TD0053 will be added to warn about this change.

   Input Code:

   ```sql
   CREATE TABLE myTable(
      col1 PERIOD(DATE),
      col2 VARCHAR(50),
      col3 PERIOD(TIMESTAMP)
   );
   ```

   Output Code:

   ```sql
   CREATE OR REPLACE TABLE myTable (
      col1 VARCHAR(24) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!,
      col2 VARCHAR(50),
      col3 VARCHAR(58) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0053 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/!!!
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
   ;
   ```
4. **Set encoding of the input files:** Check [General Conversion Settings](general-conversion-settings.md) for more details.
5. **Disable use of COLLATE for Case Specification**: This flag indicates whether to use COLLATE or UPPER to preserve Case Specification functionality, e.g. CASESPECIFIC or NOT CASESPECIFIC. By default, COLLATE will be used to emulate the case insensitive comparisons (NOT CASESPECIFIC), when turning on this flag SnowConvert will modify queries to use the UPPER function for case insensitive comparisons instead. To learn more about how Case Specification is handled by SnowConvert AI check [here](../../../../translation-references/teradata/session-modes.md).

   When the “Iceberg tables in Snowflake Horizon Catalog” option is selected in the Table translation setting, this setting will be enforced, this is done since Iceberg Tables do not support collation at the column level.

> **Note:**
>
> To review the Settings that apply to all supported languages, go to the following [article](general-conversion-settings.md).

### Session Mode Settings

This settings sub-page is used to indicate the Session Mode of the input code.

SnowConvert AI handles Teradata code in both TERA and ANSI modes. Currently, this is limited to the default case specification of character data and how it affects comparisons. By default, the Session Mode is TERA.

You can learn more about how SnowConvert AI handles and converts code depending on the session mode, check here.

## DB Objects Names Settings

1. **Schema:** The string value specifies the custom schema name to apply. If not specified, the original database name will be used. Example: DB1.**myCustomSchema**.Table1.
2. **Database:** The string value specifies the custom database name to apply. Example: **MyCustomDB**.PUBLIC.Table1.
3. **Default:** None of the above settings will be used in the object names.

## Prepare Code Settings

### **Description**

**Prepare my code:** Flag to indicate whether the input code should be processed before parsing and transformation. This can be useful to improve the parsing process. By default, it’s set to FALSE.

Splits the input code top-level objects into multiple files. The containing folders would be organized as follows:

```none
└───A new folder named ''[input_folder_name]_Processed''
    └───Top-level object type
        └───Schema name
```

### **Example**

#### **Input**

```none
├───in
│       DDL_Macros.sql
│       DDL_Procedures.sql
│       DDL_Tables.sql
```

#### **Output**

Assume that the name of the files is the name of the top-level objects in the input files.

```none
├───in_Processed
    ├───macro
    │   └───MY_DATABASE
    │           MY_FIRST_MACRO.sql
    │           ANOTHER_MACRO.sql
    │
    ├───procedure
    │   └───MY_DATABASE
    │           A_PROCEDURE.sql
    │           ANOTHER_PROCEDURE.sql
    │           YET_ANOTHER_PROCEDURE.sql
    │
    └───table
        └───MY_DATABASE
                MY_TABLE.sql
                ADDITIONAL_TABLE.sql
                THIRD_TABLE.sql
```

Inside the “schema name” folder, there should be as many files as top-level objects in the input code. Also, it is possible to have copies of some files when multiple same-type top-level objects have the same name. In this case, the file names will be enumerated in ascending order.

Only files with the “.sql”, “.ddl” and “.dml” extensions will be considered for splitting. Other kinds of files like “.bteq” scripts will be copied into the preprocessed folder and will be categorized depending on the script extension but they won’t be modified by the Split Task.

### Requirements

To identify top-level objects, a tag must be included in a comment before their declaration. Our [Extraction](../../code-extraction/teradata.md) scripts generate these tags.

The tag should follow the next format:

```none
<sc-top_level_object_type>top_level_object_name</sc-top_level_object_type>
```

You can follow the next example:

```sql
/* <sc-table> MY_DATABASE.MY_TABLE</sc-table> */
CREATE TABLE "MY_DATABASE"."MY_TABLE" (
    "MY_COLUMN" INTEGER
) ;
```

## Format Conversion Settings

1. **Character to Number default scale:** An integer value for the CHARACTER to Approximate Number transformation (Default: 10).
2. **Default TIMESTAMP format:** String value for the TIMESTAMP format (Default: “YYYY/MM/DD HH:MI:SS.FF6”).
3. **Default DATE format:** String value for the DATE format (Default: “YYYY/MM/DD”).
4. **Source TIMEZONE:** String value for the TIMEZONE format (Default: “GMT-5”).
5. **Default TIME format:** String value for the TIME format (Default: “HH:MI:SS.FF6”).

## Target Language for BTEQ, Procedures/Macros

Specifies the target language to convert Bteq and Mload script files. Currently supported values are **SnowScript** and **Python**. The default value is set to **Python**.

String value specifying the target language to convert Stored procedures and Macros. Currently supported are: **SnowScript** and **JavaScript**. The default value is set to **SnowScript**.

**Reset Settings:** The reset settings option appears on every page. If you’ve made changes, you can reset SnowConvert AI to its original default settings.

## Table translation

Used to specify the type of tables that SnowConvert AI will output for table transformations, currently:

1. Snowflake-native tables
2. [Iceberg tables in Snowflake Horizon Catalog](../../../../translation-references/teradata/sql-translation-reference/Iceberg-tables-transformations.md)

Default is Snowflake-native tables.

The selected table type will be generated unless the source table is considered not compatible, the following criteria is applied for incompatible tables generation:

| Table type | Not compatible tables |
| --- | --- |
| Iceberg tables in Snowflake Horizon Catalog | Temporary tables (VOLATILE) |

Any table not compatible with the specified table type will not be affected by the setting and transformed to its default table type.

---
title: SnowConvert AI - Teradata Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/teradataFDM.md
section: Migrations
---

# SnowConvert AI - Teradata Functional Differences

## SSC-FDM-TD0001

Column converted from Blob data type.

### Description

This message is shown when SnowConvert AI finds a data type BLOB. Since BLOB is not supported in Snowflake, the type is changed to Binary.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE TableExample
(
ColumnExample BLOB
);
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TableExample
(
ColumnExample BINARY /*** SSC-FDM-TD0001 - COLUMN CONVERTED FROM BLOB DATA TYPE ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0002

Column converted from Clob data type.

### Description

This message is shown when SnowConvert AI finds a data type CLOB. Since CLOB is not supported in SnowConvert AI, the type is changed to VARCHAR.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE TableExample
(
ColumnExample CLOB
)
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TableExample
(
ColumnExample VARCHAR /*** SSC-FDM-TD0002 - COLUMN CONVERTED FROM CLOB DATA TYPE ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0003

Bash variables found, using SnowSQL with variable substitution enabled is required to run this script

### Description

When the source code of a script file migrated to Snowflake Scripting contains Bash variables placeholders ($variable or ${variable}), SnowConvert AI transforms them into SnowSQL variables (&variable or &{variable}).

This warning is generated to point out that the execution of the migrated script now depends on SnowSQL to work, please consider the following when running the script in SnowSQL:

* Variable substitution [must be enabled](https://docs.snowflake.com/en/user-guide/snowsql-use.html#enabling-variable-substitution).
* All variables [must be defined](https://docs.snowflake.com/en/user-guide/snowsql-use.html#defining-variables).
* Run the file as a [batch script](https://docs.snowflake.com/en/user-guide/snowsql-use.html#running-batch-scripts).

#### Example Code

##### Input Code:

```sql
 .LOGON dbc, dbc;

select '$variable', '${variable}', '${variable}_concatenated';

select $colname from $tablename where info = $id;

select ${colname} from ${tablename} where info = ${id};

.LOGOFF;
```

##### Generated Code:

```sql
EXECUTE IMMEDIATE
$$
  --** SSC-FDM-TD0003 - BASH VARIABLES FOUND, USING SNOWSQL WITH VARIABLE SUBSTITUTION ENABLED IS REQUIRED TO RUN THIS SCRIPT **
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    --.LOGON dbc, dbc
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    BEGIN
      SELECT
        '&#x26;variable',
        '&#x26;{variable}',
        '&#x26;{variable}_concatenated';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      SELECT
        &#x26;colname
      from
        &#x26;tablename
      where
        info = &#x26;id;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    BEGIN
      SELECT
        &#x26;{colname}
      from
        &#x26;{tablename}
      where
        info = &#x26;{id};
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    --.LOGOFF
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'LogOff' NODE ***/!!!
    null;
  END
$$
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0004

Period types are handled as two data fields

### Description

Teradata has a period data type used to represent a time interval, with instances of this type having a beginning and ending bound of the same type (time, date or timestamp) along with a set of functions that allow initializing and manipulating period data such as PERIOD, BEGIN, END, and OVERLAPS.

Since the period type is not supported by Snowflake, SnowConvert AI transforms this type and its related functions using the following rules:

* Any period type declaration in column tables is migrated as a two column of the same type.
* The [period value constructor function](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Date-and-Time-Functions-and-Expressions/Period-Functions-and-Operators) is migrated into two different constructors of the period subtype one with the begin value and the other with the end value.
* Supported functions that expect period type parameters are migrated to UDFs as well, these UDFs expect almost two parameters for the begin value and the end value.

#### Example code

##### Input code:

```sql
 -- Additional Params: --SplitPeriodDatatype
CREATE TABLE DateTable
(
	COL1 PERIOD(DATE) DEFAULT PERIOD (DATE '2005-02-03', UNTIL_CHANGED)
);
```

##### Generated Code:

```sql
CREATE OR REPLACE TABLE DateTable
(
	COL1_begin DATE DEFAULT DATE '2005-02-03',
	COL1_end DATE DEFAULT DATE '9999-12-31' /*** SSC-FDM-TD0004 - PERIOD DATA TYPES ARE HANDLED AS TWO DATA FIELDS ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0005

Non-standard time zone offsets are not supported in Snowflake, rounded to nearest valid time zone

### Description

While Teradata provides the flexibility to define any time zone offset between `-12:59` and `+14:00` using the `SET TIME ZONE` query, Snowflake exclusively supports time zones listed in the [IANA Time Zone Database](https://www.iana.org/time-zones).

If the specified offset in the SET TIME ZONE query does not align with an IANA standard time zone, Snowflake will automatically round it to the nearest standard time zone with the closest offset. In such a case, a warning message will be generated.

#### Example Code

##### Input Code:

```sql
-- Will be rounded to Asia/Colombo (+05:30)
SET TIME ZONE '05:26';
```

##### Generated Code:

```sql
 -- Will be rounded to Asia/Colombo (+05:30)
--** SSC-FDM-TD0005 - NON-STANDARD TIME ZONE OFFSETS NOT SUPPORTED IN SNOWFLAKE, ROUNDED TO NEAREST VALID TIME ZONE **
ALTER SESSION SET TIMEZONE = 'Asia/Colombo';
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0006

View With Check Option Not Supported.

### Description

This message is shown when SnowConvert AI finds a view with the WITH CHECK OPTION clause. Which is not supported in Snowflake, so it is commented out from the code.

This clause works with updatable views that can be used to execute INSERT and UPDATE commands over the view and internally update the table associated with the view.

The clause is used to restrict the rows that will be affected by the command using the WHERE clause in the view.

For more details see the [documentation](https://docs.teradata.com/r/SQL-Data-Definition-Language-Syntax-and-Examples/July-2021/View-Statements/CREATE-VIEW-and-REPLACE-VIEW/CREATE-VIEW-and-REPLACE-VIEW-Syntax-Elements/WITH-CHECK-OPTION) about the clause functionality.

#### Example code

##### Input code:

```sql
REPLACE VIEW VIEWWITHOPTIONTEST AS
LOCKING ROW FOR ACCESS
SELECT
    *
FROM SOMETABLE
WHERE app_id = 'SUPPLIER'
WITH CHECK OPTION;
```

##### Generated Code:

```sql
 CREATE OR REPLACE VIEW VIEWWITHOPTIONTEST
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/02/2025",  "domain": "no-domain-provided" }}'
AS
SELECT
    *
FROM
    SOMETABLE
WHERE
    UPPER(RTRIM( app_id)) = UPPER(RTRIM('SUPPLIER'))
--    --** SSC-FDM-TD0006 - VIEW WITH OPTION NOT SUPPORTED IN SNOWFLAKE **
--    WITH CHECK OPTION
                     ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0007

Variant column does not support collation.

### Description

This message is shown when SnowConvert AI a Variant data type in the transformation of a code has a COLLATE clause. Since COLLATE is not supported with the data type VARIANT, it will be removed and a message will be added.

#### Example code

##### Input code:

```sql
-- Additional Params: --useCollateForCaseSpecification
CREATE TABLE TableExample
(
ColumnExample JSON(2500) NOT CASESPECIFIC
)
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TableExample
(
ColumnExample VARIANT
--                      NOT CASESPECIFIC /*** SSC-FDM-TD0007 - VARIANT COLUMN DOES NOT SUPPORT COLLATION ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

The data type JSON is converted to VARIANT, while NOT CASESPECIFIC is converted to a COLLATE clause.

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0008

When NVP_UDF fourth parameter is non-literal and it contains a backslash, that backslash needs to be escaped.

### Description

Non-literal delimiters with spaces need their backslash escaped in Snowflake.

#### Example code

##### Input code

```sql
SELECT NVP('store = whole foods&#x26;&#x26;store: ?Bristol farms','store', '&#x26;&#x26;', valueDelimiter, 2);
```

##### Generated Code

```sql
 SELECT
PUBLIC.NVP_UDF('store = whole foods&&store: ?Bristol farms', 'store', '&&', valueDelimiter, 2) /*** SSC-FDM-TD0008 - WHEN NVP_UDF FOURTH PARAMETER IS NON-LITERAL AND IT CONTAINS A BACKSLASH, THAT BACKSLASH NEEDS TO BE ESCAPED ***/;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0009

Converted from integer to varchar for current session default.

### Description

This message is shown when SnowConvert AI finds a DEFAULT SESSION and the data type is NOT a VARCHAR. If that is the case, the data type is changed to VARCHAR and a message is added.

#### Code Example

##### Input Code:

```sql
 CREATE TABLE TableExample
(
ColumnExample INTEGER DEFAULT SESSION,
ColumnExample2 VARCHAR DEFAULT SESSION
)
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE TableExample
(
ColumnExample VARCHAR DEFAULT CURRENT_SESSION() /*** SSC-FDM-TD0009 - CONVERTED FROM INTEGER TO VARCHAR FOR CURRENT_SESSION DEFAULT ***/,
ColumnExample2 VARCHAR DEFAULT CURRENT_SESSION()
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

Let’s look at the example. Note that ColumnExample has a data type INTEGER with DEFAULT SESSION. Since the data type is not VARCHAR, in the output it is transformed to VARCHAR.

The data type of ColumnExample2 hasn’t changed since it is already VARCHAR.

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0010

Table columns between tables (Teradata) DBC.COLUMNSV and INFORMATION_SCHEMA.COLUMNS (Snowflake). But some columns might not have an exact match in Snowflake.

### Description

Uses of the table `DBC.COLUMNSV` in Teradata are converted to `INFORMATION_SCHEMA.COLUMNS`, but some columns might not have an exact match in Snowflake. That means there are some columns in Teradata for which there is **no** equivalent in Snowflake, and there are others that do have a matching column but the content is not exactly the same.

Notice, for example, that there is no equivalent column for *“ColumnFormat*” in Snowflake and notice also that *“DATA_TYPE”* seems to be the match for the column *“ColumnType”* in Teradata, but their content greatly differ.

#### Code Example

##### Input Code:

```sql
 SELECT columnname FROM dbc.columnsV WHERE tablename = 'TableN';
```

##### Generated Code:

```sql
 SELECT
COLUMN_NAME AS COLUMNNAME
FROM
--** SSC-FDM-TD0010 - USES OF TABLE DBC.COLUMNSV ARE CONVERTED TO INFORMATION_SCHEMA.COLUMNS, BUT SOME COLUMNS MIGHT NOT HAVE AND EXACT MATCH IN SNOWFLAKE **
INFORMATION_SCHEMA.COLUMNS
WHERE
UPPER(RTRIM(TABLE_NAME)) = UPPER(RTRIM('TableN'));
```

#### Best Practices

* Review what columns were used in Teradata and check if the available content in Snowflake matches your needs.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0011

Unicode BMP escape is not supported.

### Description

Snowflake doesn’t support Unicode BMP, so this message is shown when SnowConvert AI transforms Teradata [Unicode Delimited Character Literal](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Types-and-Literals/Data-Literals/Unicode-Delimited-Character-Literals) with Unicode BMP escape to Snowflake.

#### Example code

##### Input Code:

```sql
 SELECT U&'hola #+005132 mundo' UESCAPE '#';
```

##### Generated Code:

```sql
 SELECT
--** SSC-FDM-TD0011 - UNICODE BMP IS NOT SUPPORTED IN SNOWFLAKE **
'hola \u+005132 mundo';
```

#### Best Practices

* Check if a Unicode equivalent exists.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0012

Invalid default value.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TD0006](../conversion-issues/teradataEWI.md) documentation

### Description

The **DEFAULT TIME** / **DEFAULT DATE** / **DEFAULT CURREN_DATE** */* **DEFAULT DEFAULT CURRENT_TIME** */* **DEFAULT CURRENT_TIMESTAMP** column specifications are not supported for the **FLOAT** data type.

#### Example Code

##### Teradata:

```sql
CREATE TABLE T_2004
(
    -- In the output code all of these columns will be FLOAT type
    -- and will include the SSC-FDM-TD0012 message.
    COL1 FLOAT DEFAULT TIME,
    COL2 FLOAT DEFAULT DATE,
    COL3 FLOAT DEFAULT CURRENT_DATE,
    COL4 FLOAT DEFAULT CURRENT_TIME,
    COL5 FLOAT DEFAULT CURRENT_TIMESTAMP
);
```

##### Snowflake Scripting:

```sql
 CREATE TABLE T_2004
(
    -- In the output code all of these columns will be FLOAT type
    -- and will include the SSC-FDM-TD0012 message.
    COL1 FLOAT DEFAULT TIME /*** SSC-FDM-TD0012 - DEFAULT CURRENT_TIME NOT VALID FOR DATA TYPE ***/,
    COL2 FLOAT DEFAULT DATE /*** SSC-FDM-TD0012 - DEFAULT CURRENT_DATE NOT VALID FOR DATA TYPE ***/,
    COL3 FLOAT DEFAULT CURRENT_DATE /*** SSC-FDM-TD0012 - DEFAULT CURRENT_DATE NOT VALID FOR DATA TYPE ***/,
    COL4 FLOAT DEFAULT CURRENT_TIME /*** SSC-FDM-TD0012 - DEFAULT CURRENT_TIME NOT VALID FOR DATA TYPE ***/,
    COL5 FLOAT DEFAULT CURRENT_TIMESTAMP /*** SSC-FDM-TD0012 - DEFAULT CURRENT_TIMESTAMP NOT VALID FOR DATA TYPE ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0013

The Snowflake error code doesn’t match the original Teradata error code.

### Description

This message is shown because the error code saved in the BTEQ ERRORCODE built-in variable could not be the same in Snowflake Scripting.

#### Example code

##### Input code:

```sql
SELECT * FROM table1;

.IF ERRORCODE<>0 THEN .EXIT 1

.QUIT 0
```

##### Generated Code:

```sql
 -- Additional Params: -q snowscript

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      SELECT
        *
      FROM
        table1;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    IF (STATUS_OBJECT['SQLCODE'] /*** SSC-FDM-TD0013 - THE SNOWFLAKE ERROR CODE MISMATCH THE ORIGINAL TERADATA ERROR CODE ***/ != 0) THEN
      RETURN 1;
    END IF;
    RETURN 0;
  END
$$
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0014

File execution inconsistency

### Description

This EWI appears when the migrated code is a BTEQ sentence executing an environment file with SQL statements E.g. $(<$INPUT_SQL_FILE). The difference between the BTEQ execution and the python generated code is that BTEQ continues with the other statements in the file when one of them fails but the python execution stops whenever an error occurs.

#### Example Code

##### Teradata BTEQ:

```sql
 .logmech LDAP;
.logon $LOGON_STR;
.SET DEFAULTS;

$(<$INPUT_SQL_FILE)

.export reset
.logoff
.quit
```

##### Python:

```python
#*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

from snowconvert.helpers import exec_file
import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
#** SSC-FDM-TD0022 - SHELL VARIABLES FOUND, RUNNING THIS CODE IN A SHELL SCRIPT IS REQUIRED **
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. LOGMECH **
  #.logmech LDAP;

  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. LOGON **
  #.logon $LOGON_STR

  #** SSC-EWI-TD0005 - THE STATEMENT WAS CONVERTED BUT ITS FUNCTIONALITY IS NOT IMPLEMENTED YET **
  Export.defaults()
  #** SSC-FDM-TD0014 - EXECUTION OF FILE WITH SQL STATEMENTS STOPS WHEN AN ERROR OCCURS **
  exec_file("$INPUT_SQL_FILE")
  #** SSC-EWI-TD0005 - THE STATEMENT WAS CONVERTED BUT ITS FUNCTIONALITY IS NOT IMPLEMENTED YET **
  Export.reset()
  #** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE. LOGOFF **
  #.logoff

  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0015

Regexp_Substr Function only supports POSIX regular expressions.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-0009](../conversion-issues/generalEWI.md) documentation

### Description

Currently, there is no support in Snowflake for extended regular expression beyond the POSIX Basic Regular Expression syntax.

This EWI is added every time a function call to *REGEX_SUBSTR, REGEX_REPLACE,* or *REGEX_INSTR* is transformed to Snowflake to warn the user about possible unsupported regular expressions. Some of the features **not supported** are lookahead, lookbehind, and non-capturing groups.

#### Example Code

##### Teradata:

```sql
 SELECT REGEXP_SUBSTR('qaqequ','q(?=u)', 1, 1);
```

##### Snowflake Scripting:

```sql
 SELECT
--** SSC-FDM-TD0015 - REGEXP_SUBSTR FUNCTION ONLY SUPPORTS POSIX REGULAR EXPRESSIONS **
REGEXP_SUBSTR('qaqequ','q(?=u)', 1, 1);
```

#### Best Practices

* Check the regular expression used in each case to determine whether it needs manual intervention. More information about expanded regex support and alternatives in Snowflake can be found [**here**](https://community.snowflake.com/s/question/0D50Z00007ENLKsSAP/expanded-support-for-regular-expressions-regex)**.**
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0016

Value ‘l’ for parameter ‘match_arg’ is not supported in Snowflake

### Description

In Teradata functions like *REGEX_SUBSTR, REGEX_REPLACE,* or *REGEX_INSTR* have a parameter called *“match_arg*”, a character argument with the following valid values:

* `'i'`: case-insensitive matching.
* `'c'`: case sensitive matching.
* `'n'`: the period character (match any character) can match the newline character.
* `'m'`: source string is treated as multiple lines instead of as a single line.
* **`'l'`**: if source_string exceeds the current maximum allowed source_string size (currently 16 MB), a NULL is returned instead of an error.
* `'x'`: ignore whitespace (only affects the pattern string).

The argument can contain more than one character.

In Snowflake, the equivalent argument for these functions is *`regexp_parameters.`*A *s*tring of one or more characters that specifies the regular expression parameters used for searching for matches. The supported values are:

* `c`: case-sensitive.
* `i`: case-insensitive.
* `m`: multi-line mode.
* `e`: extract sub-matches.
* `s`: the ‘.’ the wildcard also matches the newline character as well.

As it can be seen, values `'i', 'c', 'm'` are the same in both languages, and the `'n'` value in Teradata is mapped to `'s'`. However, values `'l', 'x'` don’t have an equivalent counterpart.

For the `'x'` value, the functionality is replicated by generating a call to the `REGEXP_REPLACE` function. However, the `'l'` parameter can not be replicated so this warning is generated for these cases.

#### Input Code:

```sql
 SELECT REGEXP_SUBSTR('Chip Chop','ch(i|o)p', 1, 1, 'i'),
       REGEXP_SUBSTR('Chip Chop','ch(i|o)p', 1, 1, 'c'),
       REGEXP_SUBSTR('Chip Chop','ch(i|o)p', 1, 1, 'm'),
       REGEXP_SUBSTR('Chip Chop','ch(i|o)p', 1, 1, 'n'),
       REGEXP_SUBSTR('Chip Chop','ch(i|o)p', 1, 1, 'l'),
       REGEXP_SUBSTR('Chip Chop','ch(i|o)p', 1, 1, 'x');
```

##### Generated Code:

```sql
 SELECT
       REGEXP_SUBSTR('Chip Chop', 'ch(i|o)p', 1, 1, 'i'),
       REGEXP_SUBSTR('Chip Chop', 'ch(i|o)p', 1, 1, 'c'),
       REGEXP_SUBSTR('Chip Chop', 'ch(i|o)p', 1, 1, 'm'),
       REGEXP_SUBSTR('Chip Chop', 'ch(i|o)p', 1, 1, 's'),
       --** SSC-FDM-TD0016 - VALUE 'l' FOR PARAMETER 'match_arg' IS NOT SUPPORTED IN SNOWFLAKE **
       REGEXP_SUBSTR('Chip Chop', 'ch(i|o)p', 1, 1),
       REGEXP_SUBSTR('Chip Chop', 'ch(i|o)p', 1, 1);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0017

The use of foreign tables is not supported in Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TD0076](../conversion-issues/teradataEWI.md) documentation

### Description

[Foreign tables](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Data-Definition-Language-Syntax-and-Examples/September-2020/Table-Statements/CREATE-FOREIGN-TABLE) enable access to data in external object storage, such as semi-structured and unstructured data in Amazon S3, Azure Blob storage, and Google Cloud Storage. This syntax is not supported in Snowflake. However, there are other alternatives in Snowflake that can be used instead, such as external tables, iceberg tables, and standard tables.

#### Example code

##### Input code:

```sql
 SELECT cust_id, income, age FROM
FOREIGN TABLE (SELECT cust_id, income, age FROM twm_customer)@hadoop1 T1;
```

##### Generated Code:

```sql
 SELECT
cust_id,
income,
age FROM
--** SSC-FDM-TD0017 - THE USE OF FOREIGN TABLES IS NOT SUPPORTED IN SNOWFLAKE. **
 FOREIGN TABLE (SELECT cust_id, income, age FROM twm_customer)@hadoop1 T1;
```

#### Best Practices

* Instead of foreign tables in Teradata, you can use [Snowflake external tables](https://docs.snowflake.com/en/user-guide/tables-external.html). External tables reference data files located in a cloud storage (Amazon S3, Google Cloud Storage, or Microsoft Azure) data lake. This enables querying data stored in files in a data lake as if it were inside a database. External tables can access data stored in any format supported by [COPY INTO <table>](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html) statements.
* Another alternative is [Snowflake’s Iceberg tables](https://www.snowflake.com/blog/iceberg-tables-powering-open-standards-with-snowflake-innovations/?lang=es). So, you can think of Iceberg tables as tables that use open formats and customer-supplied cloud storage. This data is stored in Parquet files.
* Finally, there are the [standard Snowflake tables](https://docs.snowflake.com/en/sql-reference/sql/create-table.html) which can be an option to cover the functionality of foreign tables in Teradata
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0018

JSON path was not recognized

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TD0063](../conversion-issues/teradataEWI.md) documentation

### Description

This message is shown when SnowConvert AI cannot deserialize a JSON path, because the string does not have the expected format or is not supported in Snowflake.

#### Example code

##### Input Code:

```sql
 SELECT
    *
FROM
JSON_TABLE (
    ON (
        SELECT
            id,
            trainSchedule as ts
        FROM
            demo.PUBLIC.Train T
    ) USING rowexpr('$weekShedule.Monday[*]') colexpr(
        '[{"jsonpath"  "$.time",
              "type"" : "CHAR ( 12 )"}]'
    )
) AS JT(Id, Ordinal, Time, City);
```

##### Generated Code:

```sql
 SELECT
    *
FROM
    --** SSC-FDM-TD0018 - UNRECOGNIZED JSON PATH $weekShedule.Monday[*] **
JSON_TABLE (
    ON
       !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!! (
           SELECT
               id,
               trainSchedule as ts
FROM
               demo.PUBLIC.Train T
    ) USING rowexpr('$weekShedule.Monday[*]') colexpr(
        '[{"jsonpath"  "$.time",
              "type"" : "CHAR ( 12 )"}]'
    )
) AS JT(Id, Ordinal, Time, City);
```

#### Best Practices

* Check if the JSON path have an unexpected character, or do not have the right format.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0019

Transaction and profile level query tags not supported in Snowflake, referencing session query tag instead

### Description

Teradata allows users to define query bands at transaction, session, and profile levels, as well as consulting them with functions like GetQueryBandValue.

Snowflake equivalent for query bands is the query_tag parameter, which can be set for session, user or account. Also, Snowflake does not have profiles.

Due to these differences, this FDM is added to warn the user that transaction or profile-level query tags can not be defined nor consulted in Snowflake and that session-level query tags will be used as a replacement, which may cause functional differences in some cases.

#### Example Code

##### Input Code:

```sql
 SELECT GETQUERYBANDVALUE(3, 'account');
```

##### Generated Code

```sql
 SELECT
--** SSC-FDM-TD0019 - TRANSACTION AND PROFILE LEVEL QUERY TAGS NOT SUPPORTED IN SNOWFLAKE, REFERENCING SESSION QUERY TAG INSTEAD **
GETQUERYBANDVALUE_UDF('account');
```

#### Best Practices

* Modify your code logic to use query bands at the session level.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0020

JSON value was not recognized due to invalid format

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This message is shown when SnowConvert AI needs to deserialize JSON data for a transformation context, but the JSON value didn’t have the expected format or is not valid JSON.

#### Example code

##### Input Code:

```sql
 SELECT
*
FROM
 JSON_TABLE
(ON (SELECT id,
trainSchedule as ts
FROM demo.PUBLIC.Train T)
USING rowexpr('$.weekShedule.Monday[*]')
      colexpr('[ {"ordinal"  true},
                 {"jsonpath"  "$.time",
                  "type"" : "CHAR ( 12 )"},
                 {"jsonpath"  "$.city",
                  "type" : "VARCHAR ( 12 )"}]'))
AS JT(Id, Ordinal, Time, City);

SELECT
*
FROM
 JSON_TABLE
(ON (SELECT id,
trainSchedule as ts
FROM demo.PUBLIC.Train T)
USING rowexpr('$.weekShedule.Monday[*]')
      colexpr('{"jsonpath"  "$.time",
                  "type"" : "CHAR ( 12 )"}'))
AS JT(Id, Ordinal, Time, City);
```

##### Generated Code:

```sql
 SELECT
 *
 FROM
 (
  SELECT
   id
  --** SSC-FDM-TD0020 - UNRECOGNIZED JSON LITERAL [ {"ordinal" true}, {"jsonpath" "$.time", "type"" : "CHAR ( 12 )"}, {"jsonpath" "$.city", "type" : "VARCHAR ( 12 )"}] **
  FROM
   demo.PUBLIC.Train T,
   TABLE(FLATTEN(INPUT =>
   trainSchedule:weekShedule.Monday)) rowexpr
 ) JT;

 SELECT
 *
 FROM
 (
  SELECT
   id
  --** SSC-FDM-TD0020 - UNRECOGNIZED JSON LITERAL {"jsonpath" "$.time", "type"" : "CHAR ( 12 )"} **
  FROM
   demo.PUBLIC.Train T,
   TABLE(FLATTEN(INPUT =>
   trainSchedule:weekShedule.Monday)) rowexpr
 ) JT;
```

#### Best Practices

* Be sure the JSON has the expected format according to the Teradata grammar.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0021

Built-in reference to {0} is not supported in Snowflake.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-EWI-TD0046](../conversion-issues/teradataEWI.md) documentation

### Description

This error appears when a query referencing [DBC.DATABASES](https://www.docs.teradata.com/r/hNI_rA5LqqKLxP~Y8vJPQg/GqTx8VuBIkfaC4fso9f5cw) table is executed, and the selected column has no equivalence in Snowflake.

#### Example Code

##### Input:

```sql
 CREATE VIEW SAMPLE_VIEW
AS
SELECT PROTECTIONTYPE FROM DBC.DATABASES;
```

##### Output:

```sql
 CREATE OR REPLACE VIEW SAMPLE_VIEW
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "08/14/2024" }}'
AS
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0046 - BUILT-IN REFERENCE TO PROTECTIONTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
PROTECTIONTYPE FROM
INFORMATION_SCHEMA.DATABASES;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0022

Shell variables found, running this code in a shell script is required.

### Description

In Teradata scripts, shell variables are used to store temporary values that can be accessed and manipulated throughout the script. Shell variables are defined using the dollar sign ($) followed by a name (which can be enclosed by curly braces), and their values can be set using the assignment operator (=).

```none
#!/bin/bash

## define a shell variable
tablename="mytable"

## use the variable in a Teradata SQL query
bteq <<EOF
    .LOGON myhost/myuser,mypassword
    SELECT * FROM ${tablename};
    .LOGOFF
EOF
```

You can think of shell variables having the same or similar function as string interpolation. Thus, it is important to keep this functionality when transformed.

When converting Scripts to Python, shell variables keep their functionality by running the converted code in a shell script (.sh file). For this reason, these shell variables must keep the same format as the input code.

### Example Code

#### Input Code:

```sql
 SELECT $column FROM ${tablename}
```

##### Generated Code

```sql
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
#** SSC-FDM-TD0022 - SHELL VARIABLES FOUND, RUNNING THIS CODE IN A SHELL SCRIPT IS REQUIRED **
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  exec("""
    SELECT
      $column
    FROM
      ${tablename}
    """)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

#### Best Practices

* Running the converted code in a shell script is required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0023

String Similarity might have a different behavior.

### Description

This FDM is shown when SnowConvert AI transforms the Similarity Function from Teradata to Snowflake. It indicates the results might have different behavior.

#### Example Code

Given the following data as an example

| Id | a | b |
| --- | --- | --- |
| 1 |  |  |
| 2 | Gute nacht | Ich weis nicht |
| 3 | Ich weiß nicht | Ich wei? nicht |
| 4 | Ich weiß nicht | Ich wei? nicht |
| 5 | Ich weiß nicht | Ich weiss nicht |
| 6 | Snowflake | Oracle |
| 7 | święta | swieta |
| 8 | NULL |  |
| 9 | NULL | NULL |

##### Input Code:

##### Query

```sql
-- Additional Params: -q SnowScript
SELECT * FROM StringSimilarity (
  ON (
    SELECT id, CAST(a AS VARCHAR(200)) AS a, CAST(b AS VARCHAR(200)) AS b
    FROM table_1
  ) PARTITION BY ANY
  USING
  ComparisonColumnPairs ('jaro_winkler(a,b) AS sim_fn')
  Accumulate ('id')
) AS dt ORDER BY 1;
```

##### Result

| Id | sim_fn |
| --- | --- |
| 1 | 0 |
| 2 | 0.565079365 |
| 3 | 1 |
| 4 | 0.959047619 |
| 5 | 0 |
| 6 | 0.611111111 |
| 7 | 0.7777777777777777 |
| 8 | 0 |
| 9 | 0 |

##### Generated Code

##### Query

```sql
 SELECT
* FROM
--** SSC-FDM-TD0023 - STRING SIMILARITY MIGHT HAVE A DIFFERENT BEHAVIOR. **
(
   SELECT
     id,
     JAROWINKLER_UDF(a, b) AS sim_fn
   FROM table_1
 ) dt ORDER BY 1;
```

##### Result

| ID | SIM_FN |
| --- | --- |
| 1 | 0.000000 |
| 2 | 0.560000 |
| 3 | 0.970000 |
| 4 | 0.950000 |
| 5 | 0.000000 |
| 6 | 0.610000 |
| 7 | 0.770000 |
| 8 | 0.000000 |
| 9 | 0.000000 |

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0024

Set table functionality not supported.

### Description

This EWI is shown when SnowConvert AI finds a Create Table with the SET option. Since the SET TABLE is not supported in Snowflake, it is removed.

#### Example Code

##### Teradata:

```sql
 CREATE SET TABLE TableExample
(
ColumnExample Number
)
```

```sql
 CREATE SET VOLATILE TABLE SOMETABLE, LOG AS
(SELECT ColumnExample FROM TableExample);
```

##### Snowflake Scripting:

```sql
 --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
CREATE OR REPLACE TABLE TableExample
(
ColumnExample NUMBER(38, 18)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;
```

```sql
 --** SSC-FDM-TD0024 - SET TABLE FUNCTIONALITY NOT SUPPORTED. TABLE MIGHT HAVE DUPLICATE ROWS **
CREATE OR REPLACE TEMPORARY TABLE SOMETABLE
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
AS
(
SELECT
ColumnExample FROM
TableExample
);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0025

Teradata Database Temporal Table is not supported in Snowflake

### Description

The [Teradata Database Temporal Support](https://docs.teradata.com/r/0TSAVrLIwk23SLHbA4nUvQ/root) involves the creation of temporal tables and temporal DDL and DML objects. The support for temporal (time-aware) tables and data are not supported in Snowflake since there is not an absolute equivalent.

All these statements are recognized (parsed) by SnowConvert AI, but to execute the queries in Snowflake, these elements are removed in the translation process.

It is worth noting that in cases where an `abort` statement is encountered, it will be transformed into a `Delete` command to keep the equivalence functionality allows you to undo operations performed during a transaction and restore the database to the state it had at the beginning.

#### Example code

The following example shows a Temporal-form Select being translated to a usual Select.

##### Input code:

```sql
 SEQUENCED VALIDTIME
   SELECT
   Policy_ID,
   Customer_ID
   FROM Policy
      WHERE Policy_Type = 'AU';
```

##### Generated Code:

```sql
 ----** SSC-FDM-TD0025 - TEMPORAL FORMS ARE NOT SUPPORTED IN SNOWFLAKE **
--SEQUENCED VALIDTIME
SELECT
   Policy_ID,
   Customer_ID
   FROM
   Policy
      WHERE
   UPPER(RTRIM( Policy_Type)) = UPPER(RTRIM('AU'));
```

Case where the `Abort` command is used in the context of a transaction.

##### Input code:

```sql
 CREATE OR REPLACE PROCEDURE TEST.ABORT_STATS()
BEGIN
    CURRENT VALIDTIME AND NONSEQUENCED TRANSACTIONTIME ABORT
     FROM table_1
     WHERE table_1.x1 = 1;
END;
```

##### Generated Code:

```sql
 CREATE OR REPLACE PROCEDURE TEST.ABORT_STATS ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        --    CURRENT VALIDTIME AND NONSEQUENCED TRANSACTIONTIME
        --** SSC-FDM-TD0025 - TEMPORAL FORMS ARE NOT SUPPORTED IN SNOWFLAKE **
        LET _ROW_COUNT FLOAT;
        SELECT
            COUNT(*)
        INTO
            _ROW_COUNT
            FROM
            table_1
                 WHERE table_1.x1 = 1;
            IF (_ROW_COUNT > 0) THEN
            ROLLBACK;
            END IF;
    END;
$$;
```

####

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0026

GOTO statement was removed due to if statement inversion.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

It is common to use GOTO command with IF and LABEL commands to replicate the functionality of an SQL if statement. When used in this way, it is possible to transform them directly into an if, if-else, or even an if-elseif-else statement. However, in these cases, the GOTO commands become unnecessary and should be removed to prevent them from being replaced by a LABEL section.

#### Example Code

**Input Code:**

```sql
 -- Additional Params: --scriptsTargetLanguage SnowScript
.If ActivityCount = 0 THEN .GOTO endIf
DROP TABLE TABLE1;
.Label endIf
SELECT A FROM TABLE1;
```

**Output Code**

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    IF (NOT (STATUS_OBJECT['SQLROWCOUNT'] = 0)) THEN
      --** SSC-FDM-TD0026 - GOTO endIf WAS REMOVED DUE TO IF STATEMENT INVERSION **

      BEGIN
        DROP TABLE TABLE1;
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
      EXCEPTION
        WHEN OTHER THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
    END IF;
    /*.Label endIf*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    BEGIN
      SELECT
        A
      FROM
        TABLE1;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
  END
$$
```

##### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0027

TD_UNPIVOT transformation requires column information that could not be found, columns missing in result

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TD0061](../conversion-issues/teradataEWI.md) documentation.

### Description

SnowConvert AI supports and transforms the [TD_UNPIVOT](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Operators-and-User-Defined-Functions/Table-Operators/TD_UNPIVOT) function, which can be used to represent columns from a table as rows.

However, this transformation requires information about the table/tables columns to work, more specifically the names of the columns. When this information is not present the transformation may be left in an incomplete state where columns are missing from the result, this EWI is generated in these cases.

#### Example code

##### Input Code:

```sql
 CREATE TABLE unpivotTable  (
	myKey INTEGER NOT NULL PRIMARY KEY,
	firstSemesterIncome DECIMAL(10,2),
	secondSemesterIncome DECIMAL(10,2),
	firstSemesterExpenses DECIMAL(10,2),
	secondSemesterExpenses DECIMAL(10,2)
);

SELECT * FROM
 TD_UNPIVOT(
 	ON unpivotTable
 	USING
 	VALUE_COLUMNS('Income', 'Expenses')
 	UNPIVOT_COLUMN('Semester')
 	COLUMN_LIST('firstSemesterIncome, firstSemesterExpenses', 'secondSemesterIncome, secondSemesterExpenses')
 	COLUMN_ALIAS_LIST('First', 'Second')
 )X ORDER BY mykey;

SELECT * FROM
 TD_UNPIVOT(
 	ON unknownTable
 	USING
 	VALUE_COLUMNS('MonthIncome')
 	UNPIVOT_COLUMN('Months')
 	COLUMN_LIST('januaryIncome', 'februaryIncome', 'marchIncome', 'aprilIncome')
 	COLUMN_ALIAS_LIST('January', 'February', 'March', 'April')
 )X ORDER BY yearKey;
```

##### Generated Code:

```sql
 CREATE TABLE unpivotTable (
	myKey INTEGER NOT NULL PRIMARY KEY,
	firstSemesterIncome DECIMAL(10,2),
	secondSemesterIncome DECIMAL(10,2),
	firstSemesterExpenses DECIMAL(10,2),
	secondSemesterExpenses DECIMAL(10,2)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},{"attributes":{"component":"teradata"}}'
;

SELECT
	* FROM
	(
		SELECT
			myKey,
			TRIM(GET_IGNORE_CASE(OBJECT_CONSTRUCT('FIRSTSEMESTERINCOME', 'First', 'FIRSTSEMESTEREXPENSES', 'First', 'SECONDSEMESTERINCOME', 'Second', 'SECONDSEMESTEREXPENSES', 'Second'), Semester), '"') AS Semester,
			Income,
			Expenses
		FROM
			unpivotTable UNPIVOT(Income FOR Semester IN (
				firstSemesterIncome,
				secondSemesterIncome
			)) UNPIVOT(Expenses FOR Semester1 IN (
				firstSemesterExpenses,
				secondSemesterExpenses
			))
		WHERE
			Semester = 'FIRSTSEMESTERINCOME'
			AND Semester1 = 'FIRSTSEMESTEREXPENSES'
			OR Semester = 'SECONDSEMESTERINCOME'
			AND Semester1 = 'SECONDSEMESTEREXPENSES'
	) X ORDER BY mykey;

	SELECT
	* FROM
	--** SSC-FDM-TD0027 - TD_UNPIVOT TRANSFORMATION REQUIRES COLUMN INFORMATION THAT COULD NOT BE FOUND, COLUMNS MISSING IN RESULT **
	(
		SELECT
			TRIM(GET_IGNORE_CASE(OBJECT_CONSTRUCT('JANUARYINCOME', 'January', 'FEBRUARYINCOME', 'February', 'MARCHINCOME', 'March', 'APRILINCOME', 'April'), Months), '"') AS Months,
			MonthIncome
		FROM
			unknownTable UNPIVOT(MonthIncome FOR Months IN (
				januaryIncome,
				februaryIncome,
				marchIncome,
				aprilIncome
			))
	) X ORDER BY yearKey;
```

#### Best Practices

* There are two ways of supplying the information about columns to the conversion tool: put the table specification in the same file as the TD_UNPIVOT call or specify a column list in the SELECT query of the ON expression instead of SELECT \* or the table name.
* This issue can be safely ignored if ALL the columns from the input table/tables are unpivoted, otherwise, the result will have missing columns.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0028

JSON_TABLE not transformed, column names could not be retrieved from semantic information

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-TD0060](../conversion-issues/teradataEWI.md) documentation.

### Description

The JSON_TABLE function can be transformed by SnowConvert AI, however, this transformation requires knowing the name of the columns that are being selected in the JSON_TABLE ON subquery.

This message is generated to warn the user that the column names were not explicitly put in the subquery (for example, a SELECT \* was used) and the semantic information of the tables being referenced was not found, meaning the column names could not be extracted.

#### Example code

##### Input Code:

```sql
 CREATE TABLE demo.Train (
    firstCol INT,
    jsonCol JSON(400),
    thirdCol VARCHAR(30)
);

SELECT * FROM JSON_TABLE
(ON (SELECT T.*
           FROM demo.Train T)
USING rowexpr('$.schools[*]')
               colexpr('[ {"jsonpath" : "$.name",
                           "type" : "CHAR(20)"},
                          {"jsonpath" : "$.type",
                           "type" : "VARCHAR(20)"}]')
)
AS JT;

SELECT * FROM JSON_TABLE
(ON (SELECT T.*
           FROM demo.missingTable T)
USING rowexpr('$.schools[*]')
               colexpr('[ {"jsonpath" : "$.name",
                           "type" : "CHAR(20)"},
                          {"jsonpath" : "$.type",
                           "type" : "VARCHAR(20)"}]')
)
AS JT;
```

##### Generated Code:

```sql
 CREATE TABLE demo.Train (
    firstCol INT,
    jsonCol VARIANT,
    thirdCol VARCHAR(30)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

SELECT
    * FROM
    (
        SELECT
            firstCol,
            rowexpr.value:name :: CHAR(20) AS Column_0,
            rowexpr.value:type :: VARCHAR(20) AS Column_1,
            thirdCol
        FROM
            demo.Train T,
            TABLE(FLATTEN(INPUT => jsonCol:schools)) rowexpr
    ) JT;

    SELECT
    * FROM
    --** SSC-FDM-TD0028 - JSON_TABLE NOT TRANSFORMED, COLUMN NAMES COULD NOT BE RETRIEVED FROM SEMANTIC INFORMATION **
    JSON_TABLE
   (ON
       !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!! (
        SELECT
            T.*
                  FROM
            demo.missingTable T)
   USING rowexpr('$.schools[*]')
                  colexpr('[ {"jsonpath" : "$.name",
                           "type" : "CHAR(20)"},
                          {"jsonpath" : "$.type",
                           "type" : "VARCHAR(20)"}]')
   )
   AS JT;
```

#### Best Practices

* Please check the code provided to SnowConvert AI is complete, if you did not provide the table definition please re-execute the code with the table definition present.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0029

Snowflake supported formats for TO_CHAR differ from Teradata and may fail or have different behavior

### Format elements that depend on session parameters

Some Teradata format elements are mapped to Snowflake functions that depend on the value of session parameters. To avoid functional differences in the results you should set these session parameters to the same values they have in Teradata. Identified format elements that are mapped to this kind of functions are:

* **D**: Mapped to `DAYOFWEEK` function, the results of this function depend on the `WEEK_START` session parameter, by default Teradata considers Sunday as the first day of the week, while in Snowflake it is Monday.
* **WW**: Mapped to `WEEK` function, this function depends on the session parameter `WEEK_OF_YEAR_POLICY` which by default is set to use the ISO standard (the first week of year is the first to contain at least four days of January) but in Teradata is set to consider January first as the start of the first week.

To modify session parameters, use `ALTER SESSION SET parameter_name = value`. For more information, see the [Snowflake session parameters reference](https://docs.snowflake.com/en/sql-reference/parameters.html).

#### Single parameter version of TO_CHAR

The single parameter version of `TO_CHAR(Datetime)` makes use of the default formats specified in the session parameters `TIMESTAMP_LTZ_OUTPUT_FORMAT`, `TIMESTAMP_NTZ_OUTPUT_FORMAT`, `TIMESTAMP_TZ_OUTPUT_FORMAT` and `TIME_OUTPUT_FORMAT`. To avoid differences in behavior please set them to the same values used in Teradata.

For `TO_CHAR(Numeric)` Snowflake generates the varchar representation using either the `TM9` or `TME` formats to get a compact representation of the number, Teradata also generates compact representations of the numbers so no action is required.

#### Example Code

##### Input Code:

```sql
 select to_char(date '2008-09-13', 'DD/RM/YYYY');

select to_char(date '2010-10-20', 'DS');

select to_char(1255.495, 'SC9999.9999', 'nls_iso_currency = ''EUR''');

select to_char(45620);
```

##### Generated Code:

```sql
 SELECT
TO_CHAR(date '2008-09-13', 'DD/') || PUBLIC.ROMAN_NUMERALS_MONTH_UDF(date '2008-09-13') || TO_CHAR(date '2008-09-13', '/YYYY') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/;

SELECT
TO_CHAR(date '2010-10-20', 'MM/DD/YYYY') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/;

SELECT
PUBLIC.INSERT_CURRENCY_UDF(TO_CHAR(1255.495, 'S9999.0000'), 2, 'EUR') /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/;

SELECT
TO_CHAR(45620) /*** SSC-FDM-TD0029 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/;
```

### Best Practices

* When using FF either try to use DateTime types with the same precision that you use in Teradata or add a precision to the format element to avoid the different behavior.
* When using timezone-related format elements, use the first parameter of type `TIMESTAMP_TZ` to avoid different behavior. Also remember that the `TIME` type cannot have time zone information in Snowflake.
* Set the necessary session parameters with the default values from Teradata to avoid different behavior.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0030

A return statement was added at the end of the label section to ensure the same execution flow

### Description

When a Goto statement is replaced with a Label section and does not contain a return statement, one is added at the end of the section to ensure the same execution flow.

BTEQ after a Goto command is executed, the statements between the goto command and the label command with the same name are ignored. So, to avoid those statements being executed the label section should contain a return statement.

In addition, it is worth value mentioning the Goto command skips all the other statements except for the Label with the same name, which is when the execution resumes. Therefore, the execution will never resume in a label section defined before the Goto command.

#### Example Code

##### Input Code:

```sql
 -- Additional Params: --scriptsTargetLanguage SnowScript
.LOGON dbc,dbc;
select 'STATEMENTS';
.GOTO LABEL_B
select 'IGNORED STATEMENTS';
.label LABEL_B
select 'LABEL_B STATEMENTS';
```

##### Generated Code

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    -- Additional Params: --scriptsTargetLanguage SnowScript
    --.LOGON dbc,dbc
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    BEGIN
      SELECT
        'STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;

    /*.label LABEL_B*/

    BEGIN
      SELECT
        'LABEL_B STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    --** SSC-FDM-TD0030 - A RETURN STATEMENT WAS ADDED AT THE END OF THE LABEL SECTION LABEL_B TO ENSURE THE SAME EXECUTION FLOW **
    RETURN 0;
    BEGIN
      SELECT
        'IGNORED STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    /*.label LABEL_B*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    BEGIN
      SELECT
        'LABEL_B STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
  END
$$
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0031

ST_DISTANCE results are slightly different from ST_SPHERICALDISTANCE

### Description

The Teradata function ST_SPHERICALDISTANCE calculates the distance between two spherical coordinates on the planet using the Haversine formula, on the other side, the Snowflake ST_DISTANCE function does not utilize the haversine formula to calculate the minimum distance between two geographical points.

#### Example Code

##### Input Code:

```sql
 --The distance between New York and Los Angeles
Select Cast('POINT(-73.989308 40.741895)' As ST_GEOMETRY) As location1,
	Cast('POINT(40.741895 34.053691)' As ST_GEOMETRY) As location2,
	location1.ST_SPHERICALDISTANCE(location2) As Distance_In_km;
```

##### Teradata Output

| location1 | location2 | Distance_In_Km |
| --- | --- | --- |
| POINT (-73.989308 40.741895) | POINT (40.741895 34.053691) | 9351139.978062356 |

##### Generated Code

```sql
 --The distance between New York and Los Angeles
SELECT
	TO_GEOGRAPHY('POINT(-73.989308 40.741895)') As location1,
	TO_GEOGRAPHY('POINT(40.741895 34.053691)') As location2,
	--** SSC-FDM-TD0031 - ST_DISTANCE RESULTS ARE SLIGHTLY DIFFERENT FROM ST_SPHERICALDISTANCE **
	ST_DISTANCE(
	location1, location2) As Distance_In_km;
```

##### Snowflake Output

| LOCATION1 | LOCATION2 | DISTANCE_IN_KM |
| --- | --- | --- |
| { “coordinates”: [ -73.989308, 40.741895 ], “type”: “Point” } | { “coordinates”: [ 40.741895, 34.053691 ], “type”: “Point” } | 9351154.65572674 |

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0032

CASESPECIFIC clause was removed from LIKE expression

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Description

This error appears when the `LIKE` expression is accompanied by the `[NOT] CASESPECIFIC` clause.

#### Example Code

##### Input Code:

```sql
 SELECT * FROM MY_TABLE
WHERE Name Like 'Marco%' (NOT CASESPECIFIC);
```

##### Generated Code

```sql
 SELECT
    * FROM
    MY_TABLE
WHERE Name ILIKE 'Marco%' /*** SSC-FDM-TD0032 - NOT CASESPECIFIC CLAUSE WAS REMOVED ***/;
```

#### Best Practices

* Case-Specific Behavior in TERADATA depends on TMODE system configuration.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0033

ACTIVITY_COUNT transformation might require manual adjustments

### Description

The `ACTIVITY_COUNT` status variable returns the number of rows affected by an SQL DML statement in an embedded SQL or stored procedure application. For more information, see the [Teradata ACTIVITY_COUNT documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Stored-Procedures-and-Embedded-SQL/Result-Code-Variables/ACTIVITY_COUNT).

As explained in its translation specification, there is a workaround to emulate `ACTIVITY_COUNT`’s behavior through:

```sql
 SELECT $1 FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
```

However, this presents some limitations listed below.

### Limitations

#### First case

If `ACTIVITY_COUNT` is called twice or more times before executing another DML statement, the transformation might not return the expected values.

##### Teradata

```sql
 REPLACE PROCEDURE InsertEmployeeSalaryAndLog_1 ()
BEGIN
    DECLARE row_count1 INT;

    INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
    VALUES (101, 'Alice', 'Smith', 10, 70000.00);

    -- Get the ACTIVITY_COUNT
    SET row_count1 = ACTIVITY_COUNT;
    SET row_count1 = ACTIVITY_COUNT;

    -- Insert the ACTIVITY_COUNT into the activity_log table
    INSERT INTO activity_log (operation, row_count)
    VALUES ('INSERT PROCEDURE', row_count1);
END;

REPLACE PROCEDURE InsertEmployeeSalaryAndLog_2 ()
BEGIN
    DECLARE row_count1 INT;
    DECLARE message VARCHAR(100);

    INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
    VALUES (101, 'Alice', 'Smith', 10, 70000.00);

    -- Get the ACTIVITY_COUNT
    SET row_count1 = ACTIVITY_COUNT + 1;
    SET row_count1 = ACTIVITY_COUNT;

    -- Insert the ACTIVITY_COUNT into the activity_log table
    INSERT INTO activity_log (operation, row_count)
    VALUES ('INSERT PROCEDURE', row_count1);
END;
```

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE InsertEmployeeSalaryAndLog_1 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/15/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        row_count1 INT;
    BEGIN

        INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
        VALUES (101, 'Alice', 'Smith', 10, 70000.00);

           -- Get the ACTIVITY_COUNT
        row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;
        row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;

        -- Insert the ACTIVITY_COUNT into the activity_log table
        INSERT INTO activity_log (operation, row_count)
        VALUES ('INSERT PROCEDURE', :row_count1);
    END;
$$;

CREATE OR REPLACE PROCEDURE InsertEmployeeSalaryAndLog_2 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/15/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        row_count1 INT;
        message VARCHAR(100);
    BEGIN

        INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
        VALUES (101, 'Alice', 'Smith', 10, 70000.00);

           -- Get the ACTIVITY_COUNT
        row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/ + 1;
        row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;

        -- Insert the ACTIVITY_COUNT into the activity_log table
        INSERT INTO activity_log (operation, row_count)
        VALUES ('INSERT PROCEDURE', :row_count1);
    END;
$$;
```

In both procedures, `ACTIVITY_COUNT` is called twice before another DML statement is called. In Teradata, `ACTIVITY_COUNT` will return the number of rows in the `INSERT` statement above them, even when called twice. However, since the Snowflake transformation uses `LAST_QUERY_ID()`, the result depends on the result set held by `LAST_QUERY_ID()`.

`InsertEmployeeSalaryAndLog_1()` requires no manual adjustments. Check the Query History (bottom-up):

1. `INSERT` statement is executed. `LAST_QUERY_ID()` will point to this statement.
2. `SELECT` (first `ACTIVITY_COUNT`) is executed, and `$1` will be `1`. `LAST_QUERY_ID()` will point to this statement.
3. `SELECT` (second `ACTIVITY_COUNT`) is executed; since the last statement result was `1`, `$1` will be `1` for this `SELECT` as well.
4. Finally, `row_count1` holds the value `1`, which is inserted in `activity_log`.

On the other side, `InsertEmployeeSalaryAndLog_2()` does require manual adjustments. Check the Query History (bottom-up):

1. `INSERT` statement is executed. `LAST_QUERY_ID()` will point to this statement.
2. SELECT (first `ACTIVITY_COUNT`) is executed, and `$1` will be `1`. However, notice how `QUERY_TEXT` has the `+ 10`; this will affect the result that will be scanned. `LAST_QUERY_ID()` will point to this statement.
3. `SELECT` (second `ACTIVITY_COUNT`) is executed. The result for the last query is `11`; thus `$1` will hold `11` instead of the expected `1`.
4. Finally, `row_count1` holds the value `11`, which is inserted in `activity_log`.

These are the values inserted in `activity_log`:

| LOG_ID | OPERATION | ROW_COUNT | LOG_TIMESTAMP |
| --- | --- | --- | --- |
| 1 | INSERT PROCEDURE | 1 | 2024-07-15 09:22:21.725 |
| 101 | INSERT PROCEDURE | 11 | 2024-07-15 09:22:26.248 |

#### Adjustments for the first case

As per Snowflake’s documentation for [LAST_QUERY_ID](https://docs.snowflake.com/en/sql-reference/functions/last_query_id), you can specify the query to return, based on the position of the query. `LAST_QUERY_ID(-1)` returns the latest query, `(-2)` the second last query, and so on.

The fix for the problem in `InsertEmployeeSalaryAndLog_2()` will be to simply specify `LAST_QUERY_ID(-2)` in the second use of `ACTIVITY_COUNT` (second `SELECT`) so that it gets the results from the `INSERT` statement instead:

```sql
 ...
INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
        VALUES (101, 'Alice', 'Smith', 10, 70000.00);

           -- Get the ACTIVITY_COUNT
        row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/ + 1;
        row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID(-2)))
        ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;
...
```

#### Second case

If `ACTIVITY_COUNT` is called after a non DML statement was executed, the transformation will not return the expected values.

##### Teradata

```sql
REPLACE PROCEDURE InsertEmployeeSalaryAndLog_3 ()
BEGIN
    DECLARE row_count1 INT;
    DECLARE emp_id INT;
    DECLARE message VARCHAR(100);

    INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
    VALUES (101, 'Alice', 'Smith', 10, 70000.00);

    SELECT employee_id INTO emp_id FROM employees;
    -- Get the ACTIVITY_COUNT
    SET row_count1 = ACTIVITY_COUNT;
    SET message = 'EMPLOYEE INSERTED - ID: ' || emp_id;

    -- Insert the ACTIVITY_COUNT into the activity_log table
    INSERT INTO activity_log (operation, row_count)
    VALUES (message, row_count1);
END;
```

##### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE InsertEmployeeSalaryAndLog_3 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/15/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
        row_count1 INT;
        emp_id INT;
        message VARCHAR(100);
    BEGIN

        INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
        VALUES (101, 'Alice', 'Smith', 10, 70000.00);
        SELECT
            employee_id INTO
            :emp_id
        FROM
            employees;
               -- Get the ACTIVITY_COUNT
               row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID()))
               ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;
               message := 'EMPLOYEE INSERTED - ID: ' || emp_id;

               -- Insert the ACTIVITY_COUNT into the activity_log table
               INSERT INTO activity_log (operation, row_count)
               VALUES (:message, :row_count1);
    END;
$$;
```

Similar to the previous, `LAST_QUERY_ID` does not point to the correct query and thus returns an incorrect value, which is assigned to row_count1. Check the Query History (bottom-up):

1. `INSERT` statement is executed. `LAST_QUERY_ID()` will point to this statement.
2. `SELECT INTO` is executed, and $1 will be 101. `LAST_QUERY_ID()` will point to this statement.
3. `SELECT` (`ACTIVITY_COUNT`) is executed. The result for the last query is `101`; thus `$1` will hold `101` instead of the expected 1.
4. Finally, `row_count1` holds the value `101`, which is inserted in `activity_log`.

These are the values inserted in activity_log:

| LOG_ID | OPERATION | ROW_COUNT | LOG_TIMESTAMP |
| --- | --- | --- | --- |
| 1 | EMPLOYEE INSERTED - ID: 101 | 101 | 2024-07-15 11:00:38.000 |

#### Adjustments for the second case

1. One possible fix is to specify the correct query to return by `LAST_QUERY_ID`. For example, here `LAST_QUERY_ID(-2)` will be the correct query to point to.

```sql
 ...
row_count1 := (
            SELECT
                $1
            FROM
                TABLE(RESULT_SCAN(LAST_QUERY_ID(-2)))
               ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;
               ...
```

2. Another possible fix is to use `ACTIVITY_COUNT` (`SELECT`) immediately after executing the `INSERT` statement.

```sql
...
INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
VALUES (101, 'Alice', 'Smith', 10, 70000.00);
-- Get the ACTIVITY_COUNT
       row_count1 := (
    SELECT
        $1
    FROM
        TABLE(RESULT_SCAN(LAST_QUERY_ID()))
       ) /*** SSC-FDM-TD0033 - 'ACTIVITY_COUNT' TRANSFORMATION MIGHT REQUIRE MANUAL ADJUSTMENTS ***/;
SELECT
    employee_id INTO
    :emp_id
FROM
    employees;
       message := 'EMPLOYEE INSERTED - ID: ' || emp_id;
...
```

#### Best Practices

* Make sure to point to the correct query when using `LAST_QUERY_ID`.
* Make sure `ACTIVITY_COUNT` is used immediately after the DML statement to evaluate.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0034

Period contains transformed to user defined function.

### Description

The Teradata `CONTAINS` expression performs a validation indicating whether the element at the right is contained in the element at the left which is supposed to be of `PERIOD` type. The CONTAINS only applies for `DATE`, `TIME`, `TIMESTAMP` or `PERIOD`. Since `PERIOD` is not supported in Snowflake, an user-defined function will emulate the logic of the native `CONTAINS` behavior.

#### Example Code

##### Input Code:

```sql
  UPDATE TABLE1
  SET COL1 = CURRENT_TIMESTAMP
  WHERE COL3 CONTAINS CURRENT_TIMESTAMP;
```

##### Generated Code

```sql
  UPDATE TABLE1
  SET
    COL1 = CURRENT_TIMESTAMP()
  WHERE
    PUBLIC.PERIOD_CONTAINS_UDF(COL3, CURRENT_TIMESTAMP()) /*** SSC-FDM-TD0034 - PERIOD CONTAINS EXPRESSION TRANSFORMED TO USER DEFINED FUNCTION. ***/
```

#### Best Practices

* The `VARCHAR` used instead of `PERIOD` assumes `<PERIOD_BEGIN>*<PERIOD_END>` format in all the values. If the values are split by a token different than `*`, you can change the value returned from the `PUBLIC.GET_PERIOD_SEPARATOR` UDF provided by SnowConvert AI. Notice that the structure should have a token that marks the begin and end of a PERIOD, so the two dates, times or timestamps should be always separated with the same token.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0035

Statistics function not needed in Snowflake.

> **Note:**
>
> This FDM is deprecated, please refer to [SSC-EWI-0037](generalFDM.md) documentation

### Description

DROP, COLLECT, or HELP statistics are not needed in Snowflake. Snowflake already collects statistics used for automatic query optimization, which is why these statistics statements are used in Teradata.

#### Example Code

##### Input Code:

```sql
  HELP STATISTICS TestName;
```

##### Generated Code

```sql
  ----** SSC-FDM-TD0035 - HELP STATISTICS NOT NEEDED. SNOWFLAKE AUTOMATICALLY COLLECTS STATISTICS. **
  --HELP STATISTICS TestName
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0036

Snowflake does not support the period datatype, all periods are handled as varchar instead

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Precision of generated varchar representations

PERIOD_UDF generates the varchar representation of a period using the default formats for timestamps and time specified in Snowflake, this means timestamps will have three precision digits and time variables will have zero, because of this you may find that the results have a higher/lower precision from the expected, there are two options to modify how many precision digits are included in the resulting string:

* Use the three parameters version of PERIOD_UDF: This overload of the function takes the`PRECISIONDIGITS`parameter, an integer between 0 and 9 to control how many digits of the fractional time part will be included in the result. Note that even if Snowflake supports up to nine digits of precision the maximum in Teradata is six. Example:

| Call | Result |
| --- | --- |
| `PUBLIC.PERIOD_UDF(time '13:30:45.870556', time '15:35:20.344891', 0)` | `'13:30:45*15:35:20'` |
| `PUBLIC.PERIOD_UDF(time '13:30:45.870556', time '15:35:20.344891', 2)` | `'13:30:45.87*15:35:20.34'` |
| `PUBLIC.PERIOD_UDF(time '13:30:45.870556', time '15:35:20.344891', 5)` | `'13:30:45.87055*15:35:20.34489'` |

* Alter the session parameters `TIMESTAMP_NTZ_OUTPUT_FORMAT` and `TIME_OUTPUT_FORMAT`: The commands `ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT = <format>` and`ALTER SESSION SET TIME_OUTPUT_FORMAT = <format>`

  can be used to modify the formats Snowflake uses by default for the current session, modifying them to include the desired number of precision digits changes the result of future executions of PERIOD_UDF for the current session.

#### Example code

##### Input code:

```sql
 create table vacations (
    employeeName varchar(50),
    duration period(date)
);

insert into vacations values ('Richard', period(date '2021-05-15', date '2021-06-15'));

select end(duration) from vacations;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE vacations (
    employeeName varchar(50),
    duration VARCHAR(24) /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

INSERT INTO vacations
VALUES ('Richard', PUBLIC.PERIOD_UDF(date '2021-05-15', date '2021-06-15') /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/);

SELECT
    PUBLIC.PERIOD_END_UDF(duration) /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/ from
    vacations;
```

#### Best Practices

* Since the behavior of`PERIOD`and its related functions is emulated using varchar, we recommend reviewing the results obtained to ensure its correctness.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0037

LOGTABLE removed.

### Description

The [`.LOGTABLE`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/LOGTABLE) command in [MLoad](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Using-Teradata-MultiLoad) is used for checkpoint and restart metadata, but Snowflake handles these features automatically. Instead of `.LOGTABLE`, you can monitor and audit your data loads in Snowflake using the [`COPY_HISTORY`](https://docs.snowflake.com/en/sql-reference/functions/copy_history) function and related [account usage views](https://docs.snowflake.com/en/sql-reference/account-usage).

#### Code Example

##### Input Code:

```sql
.LOGTABLE ${DATABASE}.LT_EMPLOYEES;
```

##### Generated Code:

```sql
--** SSC-FDM-TD0037 - REMOVED NEXT STATEMENT. USE COPY_HISTORY() FOR MONITORING **
-- .LOGTABLE ${DATABASE}.LT_EMPLOYEES;
```

#### Best Practices

* Use [`COPY_HISTORY`](https://docs.snowflake.com/en/sql-reference/functions/copy_history) and related Snowflake account usage views to monitor load history.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0038

PUT command requires SnowSQL execution.

### Description

The [`PUT`](https://docs.snowflake.com/en/sql-reference/sql/put) command lets you upload files to a Snowflake [stage](https://docs.snowflake.com/en/sql-reference/sql/create-stage), but it only works when you run your script with [SnowSQL](https://docs.snowflake.com/en/user-guide/snowsql). It won’t work inside scripts, procedures, or the web UI. If your script includes a `PUT` command, make sure to run it using SnowSQL.

#### Code Example

##### Generated Code:

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.csv @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees (
        employee_id,
        first_name,
        last_name
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3
        FROM
          @sc_import_stage/employees.csv
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = ',')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

#### Best Practices

* Run scripts containing [`PUT`](https://docs.snowflake.com/en/sql-reference/sql/put) commands using [SnowSQL](https://docs.snowflake.com/en/user-guide/snowsql).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0039

Collation handled at query level for this table, any new query over this table should apply collation appropriately.

### Description

When the “Disable use of COLLATE for Case Specification” [general conversion setting](../../../getting-started/running-snowconvert/conversion/teradata-conversion-settings.md) is enabled, SnowConvert AI will emulate the case insensitive behavior of the NOT CASESPECIFIC clause by modifying comparisons in queries with the UPPER function, this is performed at query level instead of using collation at the column level. This warning will be generated on any table whose case sensitivity is being emulated at the query level to remind the user that any new query over these tables will require to properly handle the case sensitivity behavior on comparisons.

#### Example code

##### Input code:

```sql
CREATE TABLE my_table
(
    col1 VARCHAR(50) NOT CASESPECIFIC
);

SELECT * FROM my_table WHERE col1 = 'test';
```

##### Generated Code:

```sql
--** SSC-FDM-TD0039 - COLLATION HANDLED AT QUERY LEVEL FOR THIS TABLE, ANY NEW QUERY OVER THIS TABLE SHOULD APPLY COLLATION APPROPRIATELY **
CREATE OR REPLACE TABLE my_table
(
    col1 VARCHAR(50)
);

SELECT
    * FROM
    my_table
WHERE
    UPPER(RTRIM( col1)) = UPPER(RTRIM('test'));
```

#### Best Practices

* If you provided all your queries over the table to SnowConvert as part of your transformation then no additional actions are required, this FDM is informational only.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0040

Column-level FORMAT clause is not supported in Snowflake. Conversion functions are used in DML statements as a workaround.

### Description

In Teradata, the `FORMAT` clause on a column definition tells the system how to display and parse datetime values. For example, a column defined as `DATE FORMAT 'MM-DD-YYYY'` expects date strings like `'03-30-2026'`.

Snowflake does not have an equivalent `FORMAT` clause on column definitions. To preserve the original behavior, SnowConvert AI:

1. **Comments out** the `FORMAT` clause in the `CREATE TABLE` output.
2. **Adds explicit conversion functions** (`TO_DATE`, `TO_TIMESTAMP`, or `TO_TIME`) around string literals in DML statements that reference the formatted column, using the Snowflake-equivalent format string.

This ensures that DML statements continue to parse string literals the same way Teradata did.

> **Note:**
>
> When the FORMAT matches Snowflake’s default output format for the column type (`'YYYY-MM-DD'` for `DATE`, `'HH:MI:SS'` for `TIME`, `'YYYY-MM-DDBHH:MI:SS'` for `TIMESTAMP`), the FORMAT clause is **silently removed** from the DDL without this FDM, and no conversion functions are added to DML statements. These formats are natively handled by Snowflake. This FDM only appears for non-standard formats that require explicit conversion.

> **Important:**
>
> For this transformation to work, the `CREATE TABLE` statement that defines the `FORMAT` clause **must be included** in the conversion input. SnowConvert AI reads the FORMAT value and column type from the DDL and uses that information when converting DML statements. If the DDL is not included, the tool has no way to know which format applies and the conversion functions will not be added.

#### Conversion function mapping

| Column Type | Conversion Function |
| --- | --- |
| `DATE` | `TO_DATE` |
| `TIMESTAMP`, `TIMESTAMP WITH TIME ZONE` | `TO_TIMESTAMP` |
| `TIME`, `TIME WITH TIME ZONE` | `TO_TIME` |

#### Example Code

##### Input code:

```sql
CREATE TABLE employee (
  id INTEGER,
  hire_date DATE FORMAT 'MM-DD-YYYY'
);

SELECT * FROM employee WHERE hire_date = '03-30-2026';
```

##### Generated Code:

```sql
CREATE OR REPLACE TABLE employee (
  id INTEGER,
  hire_date DATE
--                 --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'MM-DD-YYYY' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                 FORMAT 'MM-DD-YYYY'
)
;

SELECT
  *
FROM
  employee
WHERE
  hire_date = TO_DATE('03-30-2026', 'MM-DD-YYYY');
```

##### Example with BETWEEN:

```sql
CREATE TABLE event_range (
  id INTEGER,
  event_date DATE FORMAT 'MM-DD-YYYY'
);

SELECT * FROM event_range WHERE event_date BETWEEN '01-01-2026' AND '12-31-2026';
```

```sql
CREATE OR REPLACE TABLE event_range (
  id INTEGER,
  event_date DATE
--                  --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'MM-DD-YYYY' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                  FORMAT 'MM-DD-YYYY'
)
;

SELECT
  *
FROM
  event_range
WHERE
  event_date BETWEEN TO_DATE('01-01-2026', 'MM-DD-YYYY') AND TO_DATE('12-31-2026', 'MM-DD-YYYY');
```

##### Example with INSERT VALUES:

```sql
CREATE TABLE target_events (
  event_date DATE FORMAT 'DD/MM/YYYY'
);

INSERT INTO target_events (event_date) VALUES ('30/03/2026');
```

```sql
CREATE OR REPLACE TABLE target_events (
  event_date DATE
--                  --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'DD/MM/YYYY' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                  FORMAT 'DD/MM/YYYY'
)
;

INSERT INTO target_events (event_date)
VALUES (TO_DATE('30/03/2026', 'DD/MM/YYYY'));
```

##### Example with MERGE:

```sql
CREATE TABLE hr_target (
  id INTEGER,
  hire_date DATE FORMAT 'MM/DD/YYYY'
);

CREATE TABLE hr_source (
  id INTEGER
);

MERGE INTO hr_target AS t
USING hr_source AS s
ON t.id = s.id
WHEN MATCHED THEN UPDATE SET hire_date = '03/30/2026'
WHEN NOT MATCHED THEN INSERT (id, hire_date) VALUES (s.id, '03/30/2026');
```

```sql
CREATE OR REPLACE TABLE hr_target (
  id INTEGER,
  hire_date DATE
--                 --** SSC-FDM-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'MM/DD/YYYY' IS NOT SUPPORTED IN SNOWFLAKE. CONVERSION FUNCTIONS ARE USED IN DML STATEMENTS AS A WORKAROUND. **
--                 FORMAT 'MM/DD/YYYY'
)
;

CREATE OR REPLACE TABLE hr_source (
  id INTEGER
)
;

MERGE INTO hr_target AS t USING hr_source AS s ON t.id = s.id
WHEN MATCHED THEN
  UPDATE SET
    hire_date = TO_DATE('03/30/2026', 'MM/DD/YYYY')
WHEN NOT MATCHED THEN
  INSERT(id, hire_date)
  VALUES (s.id, TO_DATE('03/30/2026', 'MM/DD/YYYY'));
```

#### Best Practices

* Always include the `CREATE TABLE` statements that define `FORMAT` clauses in the conversion input. Without them, SnowConvert AI cannot determine the correct format for DML conversion.
* After conversion, verify that the converted code behaves correctly when these formats are present. In particular, check that the format string in the generated `TO_DATE` / `TO_TIMESTAMP` / `TO_TIME` calls matches the original Teradata FORMAT.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0041

Column-level display-only FORMAT clause is not supported in Snowflake. No action needed.

### Description

Teradata supports a display-only `FORMAT 'X(n)'` clause on character-type columns (`VARCHAR`, `CHAR`, `CLOB`, `STRING`). This format controls only the display width of the column and has no effect on data storage or query behavior. Snowflake does not support this clause, so SnowConvert AI comments it out.

Because the `X(n)` format is purely cosmetic, **no conversion functions are added to DML statements** and no manual intervention is required. This FDM is informational only.

#### Example Code

##### Input code:

```sql
CREATE TABLE customer (
  name VARCHAR(100) FORMAT 'X(50)',
  id INTEGER
);

SELECT * FROM customer WHERE name = 'John';
```

##### Generated Code:

```sql
--** SSC-FDM-TD0039 - COLLATION HANDLED AT QUERY LEVEL FOR THIS TABLE, ANY NEW QUERY OVER THIS TABLE SHOULD APPLY COLLATION APPROPRIATELY **
CREATE OR REPLACE TABLE customer (
  name VARCHAR(100)
--                    --** SSC-FDM-TD0041 - COLUMN-LEVEL DISPLAY-ONLY FORMAT CLAUSE 'X(50)' IS NOT SUPPORTED IN SNOWFLAKE. NO ACTION NEEDED. **
--                    FORMAT 'X(50)'
                                   ,
  id INTEGER
)
;

SELECT
  *
FROM
  customer
WHERE
  UPPER(RTRIM(name)) = UPPER(RTRIM('John'));
```

> **Note:**
>
> The `UPPER(RTRIM(...))` wrapping on the WHERE clause is due to the collation handling for `NOT CASESPECIFIC` columns (SSC-FDM-TD0039), not the FORMAT clause.

#### Best Practices

* No action is required for this FDM. The `X(n)` display format has no functional impact in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0042

SIGNAL condition information items other than MESSAGE_TEXT are not supported in Snowflake. RAISE is used as a workaround.

### Description

In Teradata, the `SIGNAL` statement can include a `SET` clause with multiple condition information items such as `MESSAGE_TEXT`, `CLASS_ORIGIN`, `SUBCLASS_ORIGIN`, `RETURNED_SQLSTATE`, and others. These items provide additional context when raising an error condition.

Snowflake’s `RAISE` statement only supports a single message through the `EXCEPTION` declaration. SnowConvert AI preserves the `MESSAGE_TEXT` value and uses it to declare a Snowflake exception, but any other condition information items (e.g., `CLASS_ORIGIN`, `SUBCLASS_ORIGIN`) are dropped because Snowflake has no equivalent mechanism.

This FDM is attached to the generated `RAISE` statement whenever unsupported condition information items are present in the original `SIGNAL` statement.

#### Example Code

##### Input code:

```sql
REPLACE PROCEDURE SignalSqlstateExtra(testValue INTEGER)
BEGIN
  IF (testValue > 5) THEN
    SIGNAL SQLSTATE VALUE '75001' SET MESSAGE_TEXT = 'Balance is too low', CLASS_ORIGIN = 'SP';
  ELSE
    INSERT INTO exampleTable VALUES ('testValue', testValue);
  END IF;
END;
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE SignalSqlstateExtra (TESTVALUE INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SIGNAL_EXCEPTION_75001 EXCEPTION (-20001, 'Balance is too low');
  BEGIN
    IF (:testValue > 5) THEN
      --** SSC-FDM-TD0042 - SIGNAL CONDITION INFORMATION ITEMS OTHER THAN MESSAGE_TEXT ARE NOT SUPPORTED IN SNOWFLAKE. RAISE IS USED AS A WORKAROUND. **
      RAISE SIGNAL_EXCEPTION_75001;
    ELSE
      INSERT INTO exampleTable
      VALUES ('testValue', :testValue);
    END IF;
  END;
$$;
```

> **Note:**
>
> When all condition information items in the `SET` clause are supported (i.e., only `MESSAGE_TEXT` is present), the `SIGNAL` is converted to `RAISE` without this FDM.

#### Best Practices

* Review each occurrence of this FDM to determine if the dropped condition information items (`CLASS_ORIGIN`, `SUBCLASS_ORIGIN`, etc.) are critical for your error-handling logic. If so, consider adding custom logging to capture that information.
* The `MESSAGE_TEXT` value is always preserved in the Snowflake `EXCEPTION` declaration, so the primary error message remains intact.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0044

PREPARE with USING variables bound at EXECUTE IMMEDIATE time instead of OPEN CURSOR time.

### Description

In Teradata, the `PREPARE ... FROM query` statement stages a SQL query for execution, and the `OPEN cursor USING var1, var2` statement binds variable values **at OPEN time**, allowing the cursor to use the current values of those variables when the cursor is opened.

In Snowflake, SnowConvert AI transforms `PREPARE S1 FROM query` into `EXECUTE IMMEDIATE query USING (var1, var2)`, which binds the variable values **at EXECUTE IMMEDIATE time** (when the PREPARE is converted). The cursor is then fixed at the `LET CURSOR FOR RESULTSET` declaration. This means that:

* Variable values are captured earlier in the execution flow (at PREPARE/EXECUTE IMMEDIATE time, not OPEN time)
* Reassigning the resultset variable or re-executing PREPARE in a loop will **not** update the cursor

This functional difference marker indicates that the binding timing has changed. Review your code to ensure that variables contain the correct values at PREPARE time (EXECUTE IMMEDIATE time in Snowflake).

#### Example Code

##### Input Code:

```sql
REPLACE PROCEDURE fetch_simple_cursor_placeholder(OUT procedure_result INTEGER)
BEGIN
    DECLARE SQL_string_sel VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTemporaryTable WHERE col1 = ?';
    DECLARE column_value INTEGER DEFAULT 0;
    DECLARE intermediate_result INTEGER DEFAULT 0;

    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string_sel;
    SET column_value = 1;
    OPEN C1 USING column_value;
    FETCH C1 INTO intermediate_result;
    CLOSE C1;
END;
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE fetch_simple_cursor_placeholder (PROCEDURE_RESULT OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQL_string_sel VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTemporaryTable
WHERE col1 = ?';
    column_value INTEGER DEFAULT 0;
    intermediate_result INTEGER DEFAULT 0;
    S1 RESULTSET;
    prepareQuery_aux_sql VARCHAR;
  BEGIN
    prepareQuery_aux_sql := SQL_string_sel;
    --** SSC-FDM-TD0044 - USING VARIABLES BOUND AT EXECUTE IMMEDIATE TIME INSTEAD OF OPEN CURSOR TIME. CURSOR IS FIXED AT LET CURSOR FOR RESULTSET DECLARATION; REASSIGNING THE RESULTSET VARIABLE OR RE-EXECUTING PREPARE IN A LOOP WILL NOT UPDATE THE CURSOR. **
    S1 := (
      EXECUTE IMMEDIATE prepareQuery_aux_sql USING (column_value)
    );
    LET CURSOR_S1_INSTANCE_V0 CURSOR
    FOR
      S1;
    column_value := 1;
    OPEN CURSOR_S1_INSTANCE_V0;
    FETCH
      CURSOR_S1_INSTANCE_V0
    INTO
      intermediate_result;
    CLOSE CURSOR_S1_INSTANCE_V0;
  END;
$$;
```

**Note:** In the generated code, `column_value` is bound when `EXECUTE IMMEDIATE` runs (where it still equals 0), not when `OPEN CURSOR_S1_INSTANCE_V0` executes (after it’s set to 1). In Teradata, the binding happens at OPEN time, so the cursor would use the value 1.

#### Best Practices

* **Review variable assignment order**: Ensure variables used in USING clauses have the correct values **before** the PREPARE statement is executed (which becomes EXECUTE IMMEDIATE in Snowflake).
* **Move assignments earlier**: If variables are assigned after PREPARE but before OPEN in Teradata, move those assignments to **before** the PREPARE statement.
* **Test cursor behavior**: Verify that cursors return the expected result sets, especially in loops or when variable values change between PREPARE and OPEN.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-FDM-TD0043

Dynamic MESSAGE_TEXT in SIGNAL is not supported by Snowflake exceptions. CUSTOM_SQLERRM is used as a workaround.

### Description

In Teradata, `SIGNAL ... SET MESSAGE_TEXT` can accept a variable or expression as the error message, allowing the message to be built dynamically at runtime. For example:

```sql
SET errorText = 'The given value ' || testValue || ' is greater than 5';
SIGNAL SQLSTATE VALUE '75001' SET MESSAGE_TEXT = errorText;
```

In Snowflake Scripting, the `EXCEPTION` declaration requires a compile-time literal for the message. There is no way to dynamically set the exception message at raise time using the `RAISE` statement.

As a workaround, SnowConvert AI:

1. Declares the exception with a static fallback message (e.g., `'Condition 75001 signaled'`).
2. Assigns the dynamic value to a `CUSTOM_SQLERRM` variable before the `RAISE`.

The `CUSTOM_SQLERRM` variable holds the intended dynamic message, but when the exception propagates, Snowflake reports the static message from the `EXCEPTION` declaration — not the dynamic one. Exception handlers that need the dynamic message must read `CUSTOM_SQLERRM` explicitly.

#### Example Code

##### Input code:

```sql
REPLACE PROCEDURE SignalSqlstateDynamic(testValue INTEGER)
BEGIN
  IF (testValue > 5) THEN
    SET errorText = 'The given value ' || testValue || ' is greater than 5';
    SIGNAL SQLSTATE VALUE '75001' SET MESSAGE_TEXT = errorText;
  ELSE
    INSERT INTO exampleTable VALUES ('testValue', testValue);
  END IF;
END;
```

##### Generated Code:

```sql
CREATE OR REPLACE PROCEDURE SignalSqlstateDynamic (TESTVALUE INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    CUSTOM_SQLERRM VARCHAR;
    SIGNAL_EXCEPTION_75001 EXCEPTION (-20001, 'Condition 75001 signaled');
  BEGIN
    IF (:testValue > 5) THEN
      errorText := 'The given value ' || testValue || ' is greater than 5';
      CUSTOM_SQLERRM := errorText;
      --** SSC-FDM-TD0043 - DYNAMIC MESSAGE_TEXT IN SIGNAL IS NOT SUPPORTED BY SNOWFLAKE EXCEPTIONS. CUSTOM_SQLERRM IS USED AS A WORKAROUND. **
      RAISE SIGNAL_EXCEPTION_75001;
    ELSE
      INSERT INTO exampleTable
      VALUES ('testValue', :testValue);
    END IF;
  END;
$$;
```

#### Best Practices

* In exception handlers, read `CUSTOM_SQLERRM` to retrieve the dynamic error message instead of relying on the exception’s static message.
* If the dynamic message is only used for logging purposes, consider moving the logging statement before the `RAISE` so it captures the dynamic value directly.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Teradata Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/teradataEWI.md
section: Migrations
---

# SnowConvert AI - Teradata Issues

## SSC-EWI-TD0001

Recursive forward alias error.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This EWI is shown whenever SnowConvert AI detects recursion within aliased expressions, therefore being unable to execute the Forward Alias transformation required for the correct functionality of aliases within Snowflake environment.

A recursive alias happens when an aliased expression contains another alias, and the second aliased expression contains the first alias. This may not be as trivial as the example shows, since the recursion can happen further down the line in a *transitive* way.

#### Example Code

**Note:** Recursive aliases are not supported in Snowflake, however, some simple instances are.

> **Note:**
>
> Note that recursive alias is not supported in Snowflake, however, some simple instances are. Check the examples below.

The following example code works in Snowflake after migration:

##### Teradata:

```sql
 SELECT
    COL1 AS COL2,
    COL2 AS COL1
FROM
    TABLE_EXAMPLE;
```

##### Snowflake Scripting:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
    COL1 AS COL2,
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0001 - 'COL1' HAS RECURSIVE REFERENCES. FORWARD ALIAS CONVERSION COULD NOT BE COMPLETED ***/!!!
    COL2 AS COL1
FROM
    TABLE_EXAMPLE;
```

However, the following example code does not work:

##### Teradata:

```sql
 SELECT
    A + B as C,
    COL2 + C AS A,
    COL3 AS B
FROM
    TABLE_EXAMPLE;
```

##### Snowflake Scripting:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0001 - 'A' HAS RECURSIVE REFERENCES. FORWARD ALIAS CONVERSION COULD NOT BE COMPLETED ***/!!!
    COL2 + C AS A,
    COL3 AS B,
    A + B as C
FROM
    TABLE_EXAMPLE;
```

#### Best Practices

* Review your code and make sure recursive forward aliases are not present. The EWI shows the name of the first instance of an alias that has recursive references, but that does not mean that is the only one that has them in your code.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0002

Interval type not supported.

This EWI is deprecated since SnowConvert AI 28.1.100 release

### Severity

High

#### Description

When the selector of a column in a SQL statement is type INTERVAL, the EWI will be added and a Stub function will be created too. This is a type that is not supported in Snowflake and therefore implies pending work after SnowConvert AI finishes.

#### Example Code

##### Teradata:

```sql
 SELECT
     CAST('07:00' AS INTERVAL HOUR(2) TO MINUTE),
     CAST('08:00' AS INTERVAL HOUR(2) TO MINUTE) As Test_Interval;
```

##### Snowflake Scripting:

```sql
 SELECT
     !!!RESOLVE EWI!!! /*** SSC-EWI-TD0002 - INTERVAL TYPE NOT SUPPORTED IN SNOWFLAKE ***/!!!
     INTERVAL '07 hour, 00 min',
     !!!RESOLVE EWI!!! /*** SSC-EWI-TD0002 - INTERVAL TYPE NOT SUPPORTED IN SNOWFLAKE ***/!!!
     INTERVAL '08 hour, 00 min' As Test_Interval;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0003

Collation not supported in trim functions, add original collation to function result to preserve it.

### Severity

Low

#### Description

In Snowflake, trim functions (`LTRIM, RTRIM,` or `TRIM`) do not support collation unless the characters to trim are empty or white space characters.

If SnowConvert AI detects a `LTRIM, RTRIM` or `TRIM LEADING, TRAILING,` or both function with the scenario mentioned above, the `COLLATE` function will be automatically generated to create a copy without collation of the input column. This EWI is generated to point out that the column collation was removed before the trim function, meaning the result of the function will not have collation, and that this may change the results of further comparisons using the result.

#### Example Code

##### Teradata:

```sql
 CREATE TABLE collateTable (
	col1 VARCHAR(50) CHARACTER SET LATIN NOT CASESPECIFIC
);

SELECT
    TRIM(BOTH '0' FROM col1),
    TRIM(LEADING '  ' FROM col1),
    TRIM(TRAILING '0' FROM col1),
    LTRIM(col1, '0'),
    RTRIM(col1)
FROM
    collateTable;
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE TABLE collateTable (
	col1 VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
;

SELECT
	TRIM(COLLATE(col1, ''), '0') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0003 - COLLATION NOT SUPPORTED IN TRIM FUNCTIONS, ADD ORIGINAL COLLATION TO FUNCTION RESULT TO PRESERVE IT ***/!!!,
	LTRIM(col1, '  '),
	RTRIM(COLLATE(col1, ''), '0') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0003 - COLLATION NOT SUPPORTED IN TRIM FUNCTIONS, ADD ORIGINAL COLLATION TO FUNCTION RESULT TO PRESERVE IT ***/!!!,
	LTRIM(COLLATE(col1, ''), '0') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0003 - COLLATION NOT SUPPORTED IN TRIM FUNCTIONS, ADD ORIGINAL COLLATION TO FUNCTION RESULT TO PRESERVE IT ***/!!!,
	RTRIM(col1)
	FROM
	collateTable;
```

#### Best Practices

* To avoid functional differences during comparisons, please add the original collation of the column to the `TRIM` function result string, this can be achieved using the `COLLATE` function and specifying the original column collation as the second argument, this argument has to be a literal string with the collation value.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0004

Not supported SQL Exception on continue handler.

### Severity

Low

#### Description

In Snowflake procedures there is no equivalent transformation for Teradata Continue Handler. For some supported Exception codes, SnowConvert AI does some treatment to emulate this behavior. This EWI is added to Continue Handler statements having an exception code that is not supported.

#### Example Code

##### Teradata:

```sql
 REPLACE PROCEDURE PURGING_ADD_TABLE
(
 IN inDatabaseName     	VARCHAR(30),
 IN inTableName    		VARCHAR(30)
)
BEGIN
 DECLARE vCHAR_SQLSTATE CHAR(5);
 DECLARE vSUCCESS       CHAR(5);

  DECLARE CONTINUE HANDLER FOR SQLSTATE 'UNSUPPORTED'
  BEGIN
     SET vCHAR_SQLSTATE = SQLCODE;
     SET vSUCCESS    = SQLCODE;
  END;

  SELECT 1;

END;
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE PROCEDURE PURGING_ADD_TABLE
(INDATABASENAME VARCHAR(30), INTABLENAME VARCHAR(30)
)
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/04/2024" }}'
EXECUTE AS CALLER
AS
$$
 DECLARE
  vCHAR_SQLSTATE CHAR(5);
  vSUCCESS       CHAR(5);
 BEGIN

  !!!RESOLVE EWI!!! /*** SSC-EWI-TD0004 - NOT SUPPORTED SQL EXCEPTION ON CONTINUE HANDLER ***/!!!

  DECLARE CONTINUE HANDLER FOR SQLSTATE 'UNSUPPORTED'
  BEGIN
   vCHAR_SQLSTATE := SQLCODE;
   vSUCCESS := SQLCODE;
  END;
  SELECT
   1;
 END;
$$;
```

#### Best Practices

* Check the possible statements that can throw the exception code and encapsulate them in a similar code block as seen in [Continue Handler Translation Reference](../../../../translation-references/teradata/teradata-to-snowflake-scripting-translation-reference.md).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0005

The statement was converted but its functionality is not implemented yet.

### Severity

Critical

#### Description

The statement was recognized and it was converted but the converted code will not have the expected functionality because the implementation is not done yet.

The warning is added for the user to be aware that when the script uses this statement the script will not have the expected functionally equivalent.

#### Example source

##### BTEQ Input code:

```sql
 .SET SIDETITLES ON
```

##### Python Output code:

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  #** SSC-EWI-TD0005 - THE STATEMENT WAS CONVERTED BUT ITS FUNCTIONALITY IS NOT IMPLEMENTED YET **
  Export.side_titles(True)
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

#### Best Practices

* For more information please refer to [translation spec of BTEQ to Python](../../../../translation-references/teradata/scripts-to-snowflake-sql-translation-reference/bteq.md).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0006

Invalid default value.

### Severity

Low

#### Description

The **DEFAULT TIME** / **DEFAULT DATE** / **DEFAULT CURREN_DATE** */* **DEFAULT DEFAULT CURRENT_TIME** */* **DEFAULT CURRENT_TIMESTAMP** column specifications are not supported for the **FLOAT** data type.

#### Example Code

##### Teradata:

```
CREATE TABLE T_2004
(
    -- In the output code all of these columns will be FLOAT type
    -- and will include the SSC-EWI-TD0006 message.
    COL1 FLOAT DEFAULT TIME,
    COL2 FLOAT DEFAULT DATE,
    COL3 FLOAT DEFAULT CURRENT_DATE,
    COL4 FLOAT DEFAULT CURRENT_TIME,
    COL5 FLOAT DEFAULT CURRENT_TIMESTAMP
);
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE TABLE T_2004
(
    -- In the output code all of these columns will be FLOAT type
    -- and will include the SSC-EWI-TD0006 message.
    COL1 FLOAT DEFAULT TIME !!!RESOLVE EWI!!! /*** SSC-EWI-TD0006 - DEFAULT CURRENT_TIME NOT VALID FOR DATA TYPE ***/!!!,
    COL2 FLOAT DEFAULT DATE !!!RESOLVE EWI!!! /*** SSC-EWI-TD0006 - DEFAULT CURRENT_DATE NOT VALID FOR DATA TYPE ***/!!!,
    COL3 FLOAT DEFAULT CURRENT_DATE !!!RESOLVE EWI!!! /*** SSC-EWI-TD0006 - DEFAULT CURRENT_DATE NOT VALID FOR DATA TYPE ***/!!!,
    COL4 FLOAT DEFAULT CURRENT_TIME !!!RESOLVE EWI!!! /*** SSC-EWI-TD0006 - DEFAULT CURRENT_TIME NOT VALID FOR DATA TYPE ***/!!!,
    COL5 FLOAT DEFAULT CURRENT_TIMESTAMP !!!RESOLVE EWI!!! /*** SSC-EWI-TD0006 - DEFAULT CURRENT_TIMESTAMP NOT VALID FOR DATA TYPE ***/!!!
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0007

GROUP BY clause unsupported in Teradata Mode for string comparison

### Severity

Low

#### Description

This error message indicates a possible issue when migrating Teradata SQL queries to Snowflake, particularly related to differences in how the GROUP BY clause handles string comparison sensitivity in Teradata mode.

In Teradata mode, string comparisons in GROUP BY clauses are case-insensitive by default (NOT CASESPECIFIC), whereas Snowflake is case-sensitive unless columns are explicitly defined with a case-insensitive COLLATE clause. This difference can cause queries that rely on case-insensitive grouping in Teradata to produce different results in Snowflake.

#### Example Code

##### Teradata:

```sql
CREATE TABLE employees (
    employee_id INTEGER,
    first_name VARCHAR(50) NOT CASESPECIFIC,
    department VARCHAR(50)
);

INSERT INTO employees VALUES (1, 'John', 'Sales');
INSERT INTO employees VALUES (2, 'JOHN', 'sales');
INSERT INTO employees VALUES (3, 'john', 'SALES');

SELECT first_name, COUNT(*)
FROM employees
GROUP BY first_name;
```

##### Snowflake Scripting:

```sql
CREATE OR REPLACE TABLE employees (
    employee_id INTEGER,
    first_name VARCHAR(50),
    department VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "10/20/2025",  "domain": "no-domain-provided",  "migrationid": "kwOaAavBVnCx8OhdxEITfg==" }}'
;

INSERT INTO employees
VALUES (1, 'John', 'Sales');

INSERT INTO employees
VALUES (2, 'JOHN', 'sales');

INSERT INTO employees
VALUES (3, 'john', 'SALES');

SELECT
    first_name,
    COUNT(*)
FROM
    employees
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0007 - GROUP BY IS NOT EQUIVALENT IN TERADATA MODE ***/!!!
GROUP BY first_name;
```

#### Expected Behavior Differences

| Platform | Grouping Behavior | Example Result Rows |
| --- | --- | --- |
| **Teradata Mode** | Groups ‘John’, ‘JOHN’, and ‘john’ together | `John` (or `JOHN`/`john`), 3 |
| **Snowflake** | Treats ‘John’, ‘JOHN’, and ‘john’ as separate | `John`, 1 `JOHN`, 1 `john`, 1 |

#### Best Practices

* **Review GROUP BY clauses** involving string columns when migrating from Teradata mode to ensure expected grouping behavior.

**Note:** When using expressions like `RTRIM(UPPER(first_name))` or `RTRIM(first_name)` in the `GROUP BY` clause to achieve case-insensitive or trimmed grouping, you must apply the same expression consistently in all parts of the query where the column is referenced. For example:

```sql
SELECT RTRIM(UPPER(first_name))
FROM employees
WHERE RTRIM(UPPER(first_name)) = 'JOHN'
GROUP BY RTRIM(UPPER(first_name));
```

This ensures that filtering, selection, and grouping all use the same logic, avoiding mismatches or unexpected results.

* **Define columns with COLLATE** during table creation if consistent case-insensitive behavior is required:

  ```sql
  CREATE TABLE employees (
      first_name VARCHAR(50) COLLATE 'en-cs'
  );
  ```
* **Enable the –UseCollateForCaseSpecification CLI flag or Conversion Setting to use COLLATE for case specification** during conversion. This option ensures that case specification (such as CASESPECIFIC or NOT CASESPECIFIC) is handled using COLLATE functions instead of UPPER functions. For details, refer to the [CLI documentation](../../../user-guide/snowconvert/command-line-interface/teradata.md) or [conversion settings](../../../getting-started/running-snowconvert/conversion/teradata-conversion-settings.md).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0008

Function for comparing strings is not supported

### Severity

Low

#### Description

Currently, there is no equivalence for some string-comparing functions in Snowflake.

This EWI is added whenever the comparison type is *jaro*, *n_gram*, *LD*, *LDWS*, *OSA*, *DL*, *hamming*, *LCS*, *jaccard*, *cosine* and *soundexcode*.

#### Example Code

##### Teradata:

```sql
 SELECT * FROM StringSimilarity (
  ON (
    SELECT CAST(a AS VARCHAR(200)) AS a, CAST(b AS VARCHAR(200)) AS b
    FROM table_1
  ) PARTITION BY ANY
  USING
  ComparisonColumnPairs ('ld(a,b) AS sim_fn')
) AS dt ORDER BY 1;
```

##### Snowflake Scripting:

```sql
 SELECT
  * FROM
  !!!RESOLVE EWI!!! /*** SSC-EWI-TD0008 - FUNCTION FOR COMPARING STRINGS IS NOT SUPPORTED ***/!!! StringSimilarity (
   ON (
     SELECT CAST(a AS VARCHAR(200)) AS a, CAST(b AS VARCHAR(200)) AS b
     FROM table_1
   ) PARTITION BY ANY
   USING
   ComparisonColumnPairs ('ld(a,b) AS sim_fn')
 ) AS dt ORDER BY 1;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0009

TEMPORAL column not supported.

### Severity

Low

#### Description

Teradata provides temporal table support at the column level using derived period columns. These columns are not supported in Snowflake.

#### Example Code

##### Teradata:

```sql
 CREATE MULTISET TABLE Policy(
      Policy_ID INTEGER,
      Customer_ID INTEGER,
      Policy_Type CHAR(2) NOT NULL,
      Policy_Details CHAR(40),
      Policy_Start DATE NOT NULL,
      Policy_End DATE NOT NULL,
      PERIOD FOR Validity(Policy_Start,Policy_End) AS VALIDTIME
      )
   PRIMARY INDEX(Policy_ID);
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE TABLE Policy (
   Policy_ID INTEGER,
   Customer_ID INTEGER,
   Policy_Type CHAR(2) NOT NULL,
   Policy_Details CHAR(40),
   Policy_Start DATE NOT NULL,
   Policy_End DATE NOT NULL,
   !!!RESOLVE EWI!!! /*** SSC-EWI-TD0009 - TEMPORAL COLUMN NOT SUPPORTED ***/!!!
         PERIOD FOR Validity(Policy_Start,Policy_End) AS VALIDTIME
         )
         COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
         ;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0010

UPPERCASE not supported by Snowflake.

### Severity

Low

#### Description

The UPPERCASE column attribute is not supported in Snowflake.

#### Example Code

##### Teradata:

```sql
 CREATE TABLE T_2010
(
    col1 VARCHAR(1) UPPERCASE
);
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE TABLE T_2010 (
    col1 VARCHAR(1)
                    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0010 - UPPERCASE NOT SUPPORTED BY SNOWFLAKE ***/!!!
 UPPERCASE
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* Since the `UPPERCASE` clause indicates that characters typed as ‘aaa’ are stored as ‘AAA’, a possible workaround can be adding to all the insert references the [UPPER](https://docs.snowflake.com/en/sql-reference/functions/upper) function. However, external data loading by ETL processes would also have to be modified.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0012

Binary does not support default.

### Severity

Low

#### Description

This EWI is shown when SnowConvert AI finds a data type BINARY along with a DEFAULT value specification. Since default values are not allowed in BINARY columns, it is removed.

#### Example Code

##### Teradata:

```sql
 CREATE TABLE TableExample
(
ColumnExample BINARY DEFAULT '00000000'XB NOT NULL
)
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE TABLE TableExample (
ColumnExample BINARY DEFAULT NOT TO_BINARY('00000000') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0012 - BINARY DOES NOT SUPPORT DEFAULT NOT TO_BINARY('00000000') ***/!!! NULL
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0017

Global temporary table trace functionality not supported.

### Severity

Low

#### Description

This EWI is shown when SnowConvert AI finds a Create Table with the GLOBAL TEMPORARY TRACE option. Review the following Teradata documentation about the [TRACE functionality](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Data-Definition-Language-Syntax-and-Examples/Table-Statements/CREATE-GLOBAL-TEMPORARY-TRACE-TABLE). Since it is not supported in Snowflake, it is removed.

#### Example Code

##### Teradata:

```sql
 CREATE GLOBAL TEMPORARY TRACE TABLE TableExample
(
ColumnExample Number
)
```

##### Snowflake Scripting:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0017 - GLOBAL TEMPORARY TABLE TRACE FUNCTIONALITY NOT SUPPORTED ***/!!!
CREATE OR REPLACE TABLE TableExample (
ColumnExample NUMBER(38, 18)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* Note: It might be possible to replicate some tracing functionality in Snowflake by using an `EVENT TABLE`. Review the following Snowflake documentation about [Loggin and Tracing](https://docs.snowflake.com/en/developer-guide/logging-tracing/logging-tracing-overview).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0020

Regexp_Substr Function only supports POSIX regular expressions.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-EWI-0009](generalEWI.md) documentation

### Severity

Low

#### Description

Currently, there is no support in Snowflake for extended regular expression beyond the POSIX Basic Regular Expression syntax.

This EWI is added every time a function call to *REGEX_SUBSTR, REGEX_REPLACE,* or *REGEX_INSTR* is transformed to Snowflake to warn the user about possible unsupported regular expressions. Some of the features **not supported** are lookahead, lookbehind, and non-capturing groups.

#### Example Code

##### Teradata:

```sql
 SELECT REGEXP_SUBSTR('qaqequ','q(?=u)', 1, 1);
```

##### Snowflake Scripting:

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-0009 - REGEXP_SUBSTR FUNCTION ONLY SUPPORTS POSIX REGULAR EXPRESSIONS ***/!!!
REGEXP_SUBSTR('qaqequ','q(?=u)', 1, 1);
```

#### Best Practices

* Check the regular expression used in each case to determine whether it needs manual intervention. More information about expanded regex support and alternatives in Snowflake can be found [**here**](https://community.snowflake.com/s/question/0D50Z00007ENLKsSAP/expanded-support-for-regular-expressions-regex)**.**
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0023

ACTIVITY_COUNT inside SELECT/SET INTO VARIABLE requires manual fix

### Severity

Low

### Description

The `ACTIVITY_COUNT` status variable returns the number of rows affected by an SQL DML statement in an embedded SQL or stored procedure application. For more information, see the [Teradata ACTIVITY_COUNT documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Stored-Procedures-and-Embedded-SQL/Result-Code-Variables/ACTIVITY_COUNT).

As explained in its translation specification, there is a workaround to emulate `ACTIVITY_COUNT`’s behavior through:

```sql
 SELECT $1 FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
```

When using `ACTIVITY_COUNT` in a `SELECT/SET INTO VARIABLE` statement, it can not be simply replaced by the workaround mentioned above.

### Example Code

#### Teradata

```
REPLACE PROCEDURE InsertEmployeeSalaryAndLog_4 ()
BEGIN
    DECLARE rowCount INT;
    DECLARE message VARCHAR(100);

    INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
    VALUES (101, 'Alice', 'Smith', 10, 70000.00);

    SELECT ACTIVITY_COUNT INTO rowCount;
    SET message = 'ROWS INSERTED: ' || rowCount;

    -- Insert the ACTIVITY_COUNT into the activity_log table
    INSERT INTO activity_log (operation, row_count)
    VALUES (message, rowCount);
END;
```

#### Snowflake

```sql
 CREATE OR REPLACE PROCEDURE InsertEmployeeSalaryAndLog_4 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/15/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
               rowCount INT;
               message VARCHAR(100);
    BEGIN

               INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
               VALUES (101, 'Alice', 'Smith', 10, 70000.00);
               SELECT
            ACTIVITY_COUNT !!!RESOLVE EWI!!! /*** SSC-EWI-TD0023 - ACTIVITY_COUNT INSIDE SELECT/SET INTO VARIABLE REQUIRES MANUAL FIX ***/!!! INTO
            :rowCount;
            message := 'ROWS INSERTED: ' || rowCount;

            -- Insert the ACTIVITY_COUNT into the activity_log table
            INSERT INTO activity_log (operation, row_count)
            VALUES (:message, :rowCount);
    END;
$$;
```

#### Manual Fix

Part of the workaround presented above can be used to still get the number of rows inserted/updated/deleted like this:

```sql
 CREATE OR REPLACE PROCEDURE InsertEmployeeSalaryAndLog_4 ()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/15/2024" }}'
EXECUTE AS CALLER
AS
$$
    DECLARE
               rowCount INT;
               message VARCHAR(100);
    BEGIN

               INSERT INTO employees (employee_id, first_name, last_name, department_id, salary)
               VALUES (101, 'Alice', 'Smith', 10, 70000.00);
               SELECT $1 INTO :rowCount FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
            message := 'ROWS INSERTED: ' || rowCount;

            -- Insert the ACTIVITY_COUNT into the activity_log table
            INSERT INTO activity_log (operation, row_count)
            VALUES (:message, :rowCount);
    END;
$$;
```

Instead of using the complete query, it needs to be adapted manually to Snowflake’s [SELECT INTO VARIABLE](https://docs.snowflake.com/en/sql-reference/constructs/into) syntax.

Furthermore, if `RESULT_SCAN(LAST_QUERY_ID())` is giving incorrect results, check SSC-FDM-TD0033(../functional-difference/teradataFDM.md#ssc-fdm-td0033) for how to handle possible limitations of using `LAST_QUERY_ID`.

### Best Practices

* Manually adapt the proposed workaround.
* Check SSC-FDM-TD0033(../functional-difference/teradataFDM.md#ssc-fdm-td0033) for how to handle possible limitations of using `LAST_QUERY_ID`.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0024

Abort statement is not supported due to an aggregate function.

### Severity

Low

#### Description

This EWI appears when an `AGGREGATE` function is part of an `ABORT` statement inside of a stored procedure. The statement is commented out.

#### Example Code

##### Teradata:

```sql
 REPLACE PROCEDURE ABORT_SAMPLE()
BEGIN
    ABORT WHERE SUM(TABLE1.COL1) < 2;
END;
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE PROCEDURE ABORT_SAMPLE()
RETURNS VARCHAR
LANGUAGE SQL
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
    BEGIN
        !!!RESOLVE EWI!!! /*** SSC-EWI-TD0024 - ABORT STATEMENT IS NOT SUPPORTED DUE TO AN AGGREGATE FUNCTION ***/!!!
        ABORT WHERE SUM(TABLE1.COL1) < 2;
    END;
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0025

Output format not supported.

### Severity

Low

#### Description

This EWI appears when a `CAST` function specifies an output format not supported by Snowflake scripting.

#### Code Example

##### Teradata:

```sql
 CREATE TABLE SAMPLE_TABLE
(
    VARCHAR_TYPE VARCHAR
);

REPLACE VIEW SAMPLE_VIEW
AS
SELECT
CAST(VARCHAR_TYPE AS FLOAT FORMAT 'ZZZ.ZZZZZ'),
CAST('01:02.030405' AS TIME(1) WITH TIME ZONE FORMAT 'MI:SS.S(6)')
FROM SAMPLE_TABLE;
```

##### Snowflake Scripting:

```sql
 CREATE OR REPLACE TABLE SAMPLE_TABLE
(
    VARCHAR_TYPE VARCHAR
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "03/03/2025",  "domain": "test" }}'
;

CREATE OR REPLACE VIEW SAMPLE_VIEW
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "03/03/2025",  "domain": "test" }}'
AS
SELECT
    TO_NUMBER(VARCHAR_TYPE, '999.00000', 38, 10) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0025 - OUTPUT FORMAT 'ZZZ.ZZZZZ' NOT SUPPORTED. ***/!!!,
    TO_TIME('01:02.030405', 'MI:SS.FF6') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0025 - OUTPUT FORMAT 'MI:SS.S(6)' NOT SUPPORTED. ***/!!!
    FROM
    SAMPLE_TABLE;
```

#### Best Practices

* Check if the output code has functional equivalence with the original code.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0027

Snowflake does not support Teradata built-in time dimensions column options

### Severity

Low

#### Description

The EWI is generated because Snowflake does not support the Teradata built-in time dimensions attributes like VALIDTIME or TRANSACTIONTIME.

#### Example Code

##### Teradata input:

```sql
 CREATE MULTISET TABLE SAMPLE_TABLE
(
    COL1 PERIOD(TIMESTAMP(6) WITH TIME ZONE) NOT NULL AS TRANSACTIONTIME
);
```

##### Snowflake output:

```sql
 CREATE OR REPLACE TABLE SAMPLE_TABLE (
       COL1 VARCHAR(68) NOT NULL !!!RESOLVE EWI!!! /*** SSC-EWI-TD0027 - SNOWFLAKE DOES NOT SUPPORT 'TRANSACTIONTIME' COLUMN OPTION ***/!!! /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/
   )
   COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* Manually create TIMESTAMP columns with default values such as CURRENT_TIMESTAMP.
* Leverage the use of table streams, they can record data manipulation changes made to tables as well as metadata about each change. ([Guide](https://docs.snowflake.com/en/user-guide/streams))
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0029

Queue table functionality is not supported.

### Severity

Low

#### Description

This warning appears when a `TABLE` with the [QUEUE](https://www.docs.teradata.com/r/rgAb27O_xRmMVc_aQq2VGw/tHvboDYXkHchWgJ2CD6Uig) attribute is migrated. The `QUEUE` keyword is removed because it is not supported in snowflake.

#### Example Code

##### Input:

```sql
 CREATE MULTISET TABLE SAMPLE_TABLE,
QUEUE,
NO FALLBACK
(
    COL1 INTEGER
);
```

##### Output:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0029 - QUEUE TABLE FUNCTIONALITY NOT SUPPORTED ***/!!!
CREATE OR REPLACE TABLE SAMPLE_TABLE
(
    COL1 INTEGER
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0031

The result may differ due to char type having a fixed length in Teradata

### Severity

Low

#### Description

Since Teradata CHAR data type has a fixed length, some functions such as LIKE will try to match the complete column instead of the word inserted into the column, resulting in false matches. However, Snowflake the CHAR type is of variable size, meaning that the LIKE functions will always try to match against the inserted values. Take the following code as an example:

#### Example Code

##### Input:

```sql
 CREATE TABLE table1
(
    col1 VARCHAR(36),
    col2 CHAR(36)
);

INSERT INTO table1 VALUES ('Gabriel', 'Gabriel');
INSERT INTO table1 VALUES ('Barnum', 'Barnum');
INSERT INTO table1 VALUES ('Sergio', 'Sergio');

SELECT col1 FROM table1 where col1 LIKE 'Barnum';
-- The result is a single row with 'Barnum'
SELECT col2 FROM table1 where col2 LIKE 'Barnum';
-- It does not return any row
```

##### Output:

```sql
 CREATE OR REPLACE TABLE table1
(
    col1 VARCHAR(36),
    col2 CHAR(36)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO table1
VALUES ('Gabriel', 'Gabriel');

INSERT INTO table1
VALUES ('Barnum', 'Barnum');

INSERT INTO table1
VALUES ('Sergio', 'Sergio');

SELECT
    col1 FROM
    table1
where col1 ILIKE 'Barnum';
-- The result is a single row with 'Barnum'
    SELECT
    col2 FROM
    table1
    where
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0031 - THE RESULT OF LIKE MAY DIFFER DUE TO CHAR TYPE HAVING A FIXED LENGTH IN TERADATA ***/!!! col2 ILIKE 'Barnum';
    -- It does not return any row
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0034

Multistatement SQL is not supported.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

Multistatement SQL execution is not supported. The request was handled as a transaction.

> **Note:**
>
> The following EWI is only generated when the PL Target Language flag is set to Javascript, like this: ‘–PLTargetLanguage Javascript’

#### Example Code

##### Input:

```
-- Additional Params: --PLTargetLanguage Javascript
REPLACE PROCEDURE proc1()
  BEGIN
    BEGIN REQUEST;
      SELECT* FROM TABLE1;
    END REQUEST;
END;
```

##### Output:

```sql
 CREATE OR REPLACE PROCEDURE proc1 ()
RETURNS STRING
LANGUAGE JAVASCRIPT
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
EXECUTE AS CALLER
AS
$$
  // SnowConvert AI Helpers Code section is omitted.

  var TRANSACTION_HANDLER = function (error) {
    throw error;
  };
  // ** SSC-EWI-TD0034 - MULTISTATEMENT SQL EXECUTION NOT SUPPORTED, REQUEST HANDLED AS TRANSACTION **
  try {
    EXEC(`BEGIN`);
    EXEC(`SELECT
   *
FROM
   TABLE1`,[],undefined,TRANSACTION_HANDLER);
    EXEC(`COMMIT`);
  } catch(error) {
    EXEC(`ROLLBACK`);
  }
$$;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0039

Input format not supported.

### Severity

Medium

#### Description

The specified input format is not supported in Snowflake.

#### Example Code

##### Input:

```sql
 SELECT
    CAST('02/032/25' AS DATE FORMAT 'MM/DDD/YY'),
    CAST('02/032/25' AS DATE FORMAT 'MM/D3/YY'),
    CAST('03-Thursday-2025' AS DATE FORMAT 'DD-EEEE-YYYY'),
    CAST('03-Thursday-2025' AS DATE FORMAT 'DD-E4-YYYY');
```

##### Output:

```sql
 SELECT
    TO_DATE('02/032/25', 'MM/DDD/YY') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0039 - INPUT FORMAT 'MM/DDD/YY' NOT SUPPORTED ***/!!!,
    TO_DATE('02/032/25', 'MM/D3/YY') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0039 - INPUT FORMAT 'MM/D3/YY' NOT SUPPORTED ***/!!!,
    TO_DATE('03-Thursday-2025', 'DD-EEEE-YYYY') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0039 - INPUT FORMAT 'DD-EEEE-YYYY' NOT SUPPORTED ***/!!!,
    TO_DATE('03-Thursday-2025', 'DD-E4-YYYY') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0039 - INPUT FORMAT 'DD-E4-YYYY' NOT SUPPORTED ***/!!!;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0040

The FORMAT clause on a column definition cannot be automatically converted to Snowflake.

### Severity

Low

#### Description

SnowConvert AI found a `FORMAT` clause on a column definition that it cannot translate to Snowflake. The FORMAT clause is preserved in the output and marked with this EWI so you can review it manually.

This issue is raised in two situations:

* **Datetime columns with unsupported format elements**: The format string contains elements that have no Snowflake equivalent (e.g., `'EEEE'` for day-of-week names). Because the format cannot be translated, no conversion functions are added to DML statements that reference this column.
* **Columns where the type could not be determined**: If SnowConvert AI cannot resolve the column type, it falls back to this EWI as a safety measure.

When the FORMAT can be fully translated, SnowConvert AI uses [SSC-FDM-TD0040](../functional-difference/teradataFDM.md) instead and adds conversion functions automatically. For character-type display-only formats like `X(n)`, see [SSC-FDM-TD0041](../functional-difference/teradataFDM.md).

#### Example Code

##### Input:

```sql
CREATE TABLE event_dayname (
  id INTEGER,
  event_date DATE FORMAT 'EEEE'
);

SELECT * FROM event_dayname WHERE event_date = '03-30-2026';
```

##### Output:

```sql
CREATE OR REPLACE TABLE event_dayname (
  id INTEGER,
  event_date DATE
                  !!!RESOLVE EWI!!! /*** SSC-EWI-TD0040 - COLUMN-LEVEL FORMAT CLAUSE 'EEEE' IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
                  FORMAT 'EEEE'
)
;

SELECT
  *
FROM
  event_dayname
WHERE
  event_date = '03-30-2026';
```

Notice that the string literal `'03-30-2026'` in the SELECT statement is left unchanged because the format could not be translated.

#### How FORMAT issues are classified

| Column Type | Format Pattern | Issue | DML Effect |
| --- | --- | --- | --- |
| `DATE`, `TIMESTAMP`, `TIME` | Snowflake standard (e.g., `'YYYY-MM-DD'`, `'HH:MI:SS'`) | None (silently removed) | No conversion needed |
| `DATE`, `TIMESTAMP`, `TIME` | Translatable non-standard (e.g., `'MM-DD-YYYY'`) | [SSC-FDM-TD0040](../functional-difference/teradataFDM.md) | Conversion functions added automatically |
| `DATE`, `TIMESTAMP`, `TIME` | Not translatable (e.g., `'EEEE'`) | **SSC-EWI-TD0040** | No conversion added; manual fix needed |
| `VARCHAR`, `CHAR`, `CLOB`, `STRING` | Display-only `X(n)` | [SSC-FDM-TD0041](../functional-difference/teradataFDM.md) | No conversion needed |
| Any other | Any | **SSC-EWI-TD0040** | No conversion added; manual fix needed |

#### Best Practices

* Review the format string and check whether it can be rewritten using [Snowflake-supported format elements](https://docs.snowflake.com/en/sql-reference/functions-conversion#date-and-time-formats-in-conversion-functions). If so, add the appropriate `TO_DATE`, `TO_TIMESTAMP`, or `TO_TIME` call yourself.
* If the format was used only for display purposes and does not affect how data is stored or queried, it can be safely removed.
* After conversion, verify that the converted code behaves correctly for any columns where this EWI appears.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0041

Trunc function was added to ensure integer.

### Severity

Low

#### Description

When migrating Teradata to Snowflake, you may encounter differences in how numeric conversions are handled. In Teradata, casting a value to `INTEGER` will implicitly truncate any decimal part, even if the original value is a floating-point number or a string representation of a number. However, in Snowflake, casting a non-integer numeric or a string directly to `INTEGER` can result in errors or unexpected results if the value is not already an integer.

To ensure compatibility, the `TRUNC()` function is applied before casting to `INTEGER`. This strips any decimal portion, allowing safe conversion to an integer. However, if the source value is not numeric or is a non-numeric string, errors may still occur and manual intervention may be required. For example, if SnowConvert AI cannot determine the column type due to missing references, you may need to manually adjust the conversion.

#### Example Code

##### Input:

```sql
 SELECT
    cast(date_column as integer);
```

##### Output:

```sql
 SELECT
    cast(TRUNC(date_column) as integer) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0041 - TRUNC FUNCTION WAS ADDED TO ENSURE INTEGER. MAY NEED CHANGES IF NOT NUMERIC OR STRING. ***/!!!;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0046

Built-in reference is not supported in Snowflake.

### Severity

Medium

#### Description

This error appears when there is a reference to a [DBC](https://docs.teradata.com/r/Teradata-Archive/Recovery-Utility-Reference/March-2019/Archive/Recovery-Operations/Database-DBC) table and the selected column has no equivalence in Snowflake.

#### Example Code

##### Input:

```sql
 CREATE VIEW SAMPLE_VIEW
AS
SELECT PROTECTIONTYPE FROM DBC.DATABASES;
```

##### Output:

```sql
 CREATE OR REPLACE VIEW SAMPLE_VIEW
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "08/14/2024" }}'
AS
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0046 - BUILT-IN REFERENCE TO PROTECTIONTYPE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
PROTECTIONTYPE FROM
INFORMATION_SCHEMA.DATABASES;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0049

TPT-Statement not processed.

### Severity

High

#### Description

A DML statement in TPT could not be processed and converted by the tool. This can happen for reasons like using concatenation with script variables or using escaping quotes inside the DML statement.

#### Example code

##### Input Code:

```sql
 -- Script1.tpt
DEFINE JOB load_job
DESCRIPTION 'LOAD TABLE FROM A FILE'
  (
     DEFINE SCHEMA schema_name
     DESCRIPTION 'define SCHEMA'
   (
       var1 VARCHAR (50)
   );

   STEP setup_tables
   (
      APPLY
       ('RELEASE MLOAD database_name.table_name;')
     TO OPERATOR (DDL_OPERATOR() );

   );
);
```

##### Generated Code:

```python
 #*** Generated code is based on the SnowConvert AI Python Helpers version 2.0.6 ***

import os
import sys
import snowconvert.helpers
from snowconvert.helpers import Export
from snowconvert.helpers import exec
from snowconvert.helpers import BeginLoading
import argparse
args = None
## Script1.tpt
class load_job:
    #'LOAD TABLE FROM A FILE'

  jobname = "load_job"
    #'define SCHEMA'

  schema_name = """(
var1 VARCHAR(50)
);"""
  def setup_tables(self):
    self.DDL_OPERATOR()
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0049 - THE FOLLOWING STATEMENT COULD NOT BE PROCESSED ***/!!!
      #'RELEASE MLOAD database_name.table_name;'

con = None
def main():
  snowconvert.helpers.configure_log()
  con = snowconvert.helpers.log_on()
  _load_job = load_job()
  _load_job.setup_tables()
  snowconvert.helpers.quit_application()

if __name__ == "__main__":
  main()
```

### Best Practices

* For this issue, you can type the insert statement manually, and/or since the DML statement is not being supported yet, ask the SnowConvert AI team to add support for that specific case.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0051

Teradata BYTES function results differs from Snowflake LENGTH function for byte columns

### Severity

Low

#### Description

Since Teradata byte datatype has a fixed length, BYTES function [will always count the trailing zeros](https://docs.teradata.com/r/1DcoER_KpnGTfgPinRAFUw/f7V55vW7OB1nU2WltjLxig) inserted to fit smaller byte type values into the column, returning the size of the column instead of the size of the value inserted originally. However, Snowflake binary type has variable size, meaning that the LENGTH function will always return the size of the inserted values. Take the following code as an example:

Teradata:

```sql
 create table exampleTable(
	bytecol byte(10)
);

insert into exampleTable values ('2B'XB);

select bytes(bytecol) from exampleTable;
-- Will return 10, the size of bytecol
```

Equivalent code in Snowflake:

```sql
 CREATE OR REPLACE TABLE exampleTable (
	bytecol BINARY
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

INSERT INTO exampleTable
VALUES (TO_BINARY('2B'));

SELECT
	LENGTH(bytecol) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0051 - TERADATA BYTES FUNCTION RESULTS DIFFER FROM SNOWFLAKE LENGTH FUNCTION FOR BYTE TYPE COLUMNS ***/!!! from
	exampleTable;
	-- Will return 10, the size of bytecol
```

#### Example code:

##### Input code:

```sql
 create table sampleTable(
    byteColumn byte(10),
    varbyteColumn varbyte(15)
);

select bytes(byteColumn), bytes(varbyteColumn) from sampleTable;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE sampleTable (
    byteColumn BINARY,
    varbyteColumn BINARY(15)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

SELECT
    LENGTH(byteColumn) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0051 - TERADATA BYTES FUNCTION RESULTS DIFFER FROM SNOWFLAKE LENGTH FUNCTION FOR BYTE TYPE COLUMNS ***/!!!,
    LENGTH(varbyteColumn) from
    sampleTable;
```

#### Best Practices

* Analyze the use given to the BYTES function results, the Snowflake LENGTH function behavior was the one desired from the start and no changes are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0052

Snowflake implicit conversion to numeric differs from Teradata and may fail for non-literal strings

### Severity

Low

#### Description

Both Teradata and Snowflake allow string values to function that expect numeric parameters, these strings are then parsed and converted to their numeric equivalent.

However, there are differences on what the two languages consider a valid numeric string, Teradata is more permissive and successfully parses cases like empty / whitespace-only strings, embedded dashes, having no digits in the mantissa or exponent, currency signs, digit separators or specifying the sign of the number after the digits. For example, the following strings are valid:

* `'1-2-3-4-5' -> 12345`
* `'$50' -> 50`
* `'5000-' -> -5000`
* `'1,569,284.55' -> 1569284.55`

Snowflake applies [automatic optimistic string conversion](https://docs.snowflake.com/en/sql-reference/sql-format-models.html#default-formats-for-parsing), expecting the strings to match either the TM9 or TME formats, so conversion fails for most of the cases mentioned. To solve these differences, SnowConvert AI processes string literals passed to functions that do an implicit conversion to numeric and generates equivalent strings that match TM9 or TME so they can be parsed by Snowflake. This only applies to literal string values, meaning non-literal values have no guarantee to be parsed by Snowflake.

#### Example code

##### Input code:

```sql
 create table myTable(
    stringCol varchar(30)
);

insert into myTable values ('   1,236,857.45-');

select cos('   1,236,857.45-');

select cos(stringCol) from myTable;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE myTable (
    stringCol varchar(30)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "07/14/2025",  "domain": "no-domain-provided" }}'
;

INSERT INTO myTable
VALUES ('   1,236,857.45-');

SELECT
    COS('-1236857.45');

    SELECT
    COS(stringCol !!!RESOLVE EWI!!! /*** SSC-EWI-TD0052 - SNOWFLAKE IMPLICIT CONVERSION TO NUMERIC DIFFERS FROM TERADATA AND MAY FAIL FOR NON-LITERAL STRING VALUES ***/!!!)
    from
    myTable;
```

#### Best Practices

* No additional user actions are required.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0053

Snowflake does not support the period datatype, all periods are handled as varchar instead

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TD0036](../functional-difference/teradataFDM.md) documentation

### Precision of generated varchar representations

PERIOD_UDF generates the varchar representation of a period using the default formats for timestamps and time specified in Snowflake, this means timestamps will have three precision digits and time variables will have zero, because of this you may find that the results have a higher/lower precision from the expected, there are two options to modify how many precision digits are included in the resulting string:

* Use the three parameters version of PERIOD_UDF: This overload of the function takes the`PRECISIONDIGITS`parameter, an integer between 0 and 9 to control how many digits of the fractional time part will be included in the result. Note that even if Snowflake supports up to nine digits of precision the maximum in Teradata is six. Example:

| Call | Result |
| --- | --- |
| `PUBLIC.PERIOD_UDF(time '13:30:45.870556', time '15:35:20.344891', 0)` | `'13:30:45*15:35:20'` |
| `PUBLIC.PERIOD_UDF(time '13:30:45.870556', time '15:35:20.344891', 2)` | `'13:30:45.87*15:35:20.34'` |
| `PUBLIC.PERIOD_UDF(time '13:30:45.870556', time '15:35:20.344891', 5)` | `'13:30:45.87055*15:35:20.34489'` |

* Alter the session parameters `TIMESTAMP_NTZ_OUTPUT_FORMAT` and `TIME_OUTPUT_FORMAT`: The commands `ALTER SESSION SET TIMESTAMP_NTZ_OUTPUT_FORMAT = <format>` and`ALTER SESSION SET TIME_OUTPUT_FORMAT = <format>`

  can be used to modify the formats Snowflake uses by default for the current session, modifying them to include the desired number of precision digits changes the result of future executions of PERIOD_UDF for the current session.

#### Example code

##### Input code:

```sql
 create table vacations (
    employeeName varchar(50),
    duration period(date)
);

insert into vacations values ('Richard', period(date '2021-05-15', date '2021-06-15'));

select end(duration) from vacations;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE vacations (
    employeeName varchar(50),
    duration VARCHAR(24) /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

INSERT INTO vacations
VALUES ('Richard', PUBLIC.PERIOD_UDF(date '2021-05-15', date '2021-06-15') !!!RESOLVE EWI!!! /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/);

SELECT
    PUBLIC.PERIOD_END_UDF(duration) /*** SSC-FDM-TD0036 - SNOWFLAKE DOES NOT SUPPORT THE PERIOD DATATYPE, ALL PERIODS ARE HANDLED AS VARCHAR INSTEAD ***/ from
    vacations;
```

#### Best Practices

* Since the behavior of`PERIOD`and its related functions is emulated using varchar, we recommend reviewing the results obtained to ensure its correctness.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0055

Snowflake supported formats for TO_CHAR differ from Teradata and may fail or have different behavior

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TD0029](../functional-difference/teradataFDM.md) documentation

### Format elements that depend on session parameters

Some Teradata format elements are mapped to Snowflake functions that depend on the value of session parameters. To avoid functional differences in the results you should set these session parameters to the same values they have in Teradata. Identified format elements that are mapped to this kind of functions are:

* **D**: Mapped to `DAYOFWEEK` function, the results of this function depend on the `WEEK_START` session parameter, by default Teradata considers Sunday as the first day of the week, while in Snowflake it is Monday.
* **WW**: Mapped to `WEEK` function, this function depends on the session parameter `WEEK_OF_YEAR_POLICY` which by default is set to use the ISO standard (the first week of year is the first to contain at least four days of January) but in Teradata is set to consider January first as the start of the first week.

To modify session parameters, use `ALTER SESSION SET parameter_name = value`. For more information, see the [Snowflake session parameters reference](https://docs.snowflake.com/en/sql-reference/parameters.html).

#### Single parameter version of TO_CHAR

The single parameter version of `TO_CHAR(Datetime)` makes use of the default formats specified in the session parameters `TIMESTAMP_LTZ_OUTPUT_FORMAT`, `TIMESTAMP_NTZ_OUTPUT_FORMAT`, `TIMESTAMP_TZ_OUTPUT_FORMAT` and `TIME_OUTPUT_FORMAT`. To avoid differences in behavior please set them to the same values used in Teradata.

For `TO_CHAR(Numeric)` Snowflake generates the varchar representation using either the `TM9` or `TME` formats to get a compact representation of the number, Teradata also generates compact representations of the numbers so no action is required.

#### Example Code

##### Input Code:

```sql
 select to_char(date '2008-09-13', 'DD/RM/YYYY');

select to_char(date '2010-10-20', 'DS');

select to_char(1255.495, 'SC9999.9999', 'nls_iso_currency = ''EUR''');

select to_char(45620);
```

##### Generated Code:

```sql
 SELECT
TO_CHAR(date '2008-09-13', 'DD/') || PUBLIC.ROMAN_NUMERALS_MONTH_UDF(date '2008-09-13') || TO_CHAR(date '2008-09-13', '/YYYY') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0055 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/!!!;

SELECT
TO_CHAR(date '2010-10-20', 'MM/DD/YYYY') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0055 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/!!!;

SELECT
PUBLIC.INSERT_CURRENCY_UDF(TO_CHAR(1255.495, 'S9999.0000'), 2, 'EUR') !!!RESOLVE EWI!!! /*** SSC-EWI-TD0055 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/!!!;

SELECT
TO_CHAR(45620) !!!RESOLVE EWI!!! /*** SSC-EWI-TD0055 - SNOWFLAKE SUPPORTED FORMATS FOR TO_CHAR DIFFER FROM TERADATA AND MAY FAIL OR HAVE DIFFERENT BEHAVIOR ***/!!!;
```

### Best Practices

* When using FF either try to use DateTime types with the same precision that you use in Teradata or add a precision to the format element to avoid the different behavior.
* When using timezone-related format elements, use the first parameter of type `TIMESTAMP_TZ` to avoid different behavior. Also remember that the `TIME` type cannot have time zone information in Snowflake.
* Set the necessary session parameters with the default values from Teradata to avoid different behavior.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0057

Binary data in NEW JSON is not supported

### Severity

Low

### Description

The NEW JSON function accepts the JSON data represented as a string or in binary format. when the data is in its binary representation the function is not transformed since this binary format is not valid in Snowflake because it cannot interpret the metadata about the JSON object, for more information about this please see Teradata NEW JSON [documentation](https://docs.teradata.com/r/C8cVEJ54PO4~YXWXeXGvsA/QpXrJfufgZ4uyeXFz7Rtcg).

### Example Code

#### Input Code

```sql
 SELECT NEW JSON ('160000000268656C6C6F0006000000776F726C640000'xb, BSON);
```

##### Generated Code

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0057 - NEW JSON FUNCTION WITH BINARY DATA IS NOT SUPPORTED ***/!!!!!!RESOLVE EWI!!! /*** SSC-EWI-TD0039 - INPUT FORMAT 'BSON' NOT SUPPORTED ***/!!!
NEW JSON (TO_BINARY('160000000268656C6C6F0006000000776F726C640000'), BSON);
```

### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0059

Snowflake user default time zone may require configuration to match Teradata value

### Severity

Low

#### Description

Same as Teradata, setting a default time zone value to the user will make sessions start using that time zone until a new value is defined for the session.

This warning is generated to remind that the same time zone that was defined for the user in Teradata should be set for the Snowflake user, to do this please use the following query in Snowflake: `ALTER SESSION SET TIMEZONE = 'equivalent_timezone'`, remember that Snowflake only accepts [IANA Time Zone Database](https://www.iana.org/time-zones) standard time zones.

#### Example Code

##### Input Code:

```sql
 SET TIME ZONE USER;
```

##### Generated Code:

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0059 - SNOWFLAKE USER DEFAULT TIME ZONE MAY REQUIRE CONFIGURATION TO MATCH TERADATA VALUE ***/!!!
ALTER SESSION UNSET TIMEZONE;
```

#### Best Practices

* Remember to set the default time zone of the user to a time zone equivalent to the one set for the Teradata user.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0060

JSON_TABLE not transformed, column names could not be retrieved from semantic information

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

The JSON_TABLE function can be transformed by SnowConvert AI, however, this transformation requires knowing the name of the columns that are being selected in the JSON_TABLE ON subquery.

This message is generated to warn the user that the column names were not explicitly put in the subquery (for example, a SELECT \* was used) and the semantic information of the tables being referenced was not found, meaning the column names could not be extracted.

If you want know how to load JSON data into a table check this [page](https://docs.snowflake.com/en/user-guide/script-data-load-transform-json)

#### Example code

##### Input Code:

```sql
 CREATE TABLE demo.Train (
    firstCol INT,
    jsonCol JSON(400),
    thirdCol VARCHAR(30)
);

SELECT * FROM JSON_TABLE
(ON (SELECT T.*
           FROM demo.Train T)
USING rowexpr('$.schools[*]')
               colexpr('[ {"jsonpath" : "$.name",
                           "type" : "CHAR(20)"},
                          {"jsonpath" : "$.type",
                           "type" : "VARCHAR(20)"}]')
)
AS JT;

SELECT * FROM JSON_TABLE
(ON (SELECT T.*
           FROM demo.missingTable T)
USING rowexpr('$.schools[*]')
               colexpr('[ {"jsonpath" : "$.name",
                           "type" : "CHAR(20)"},
                          {"jsonpath" : "$.type",
                           "type" : "VARCHAR(20)"}]')
)
AS JT;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE demo.Train (
    firstCol INT,
    jsonCol VARIANT,
    thirdCol VARCHAR(30)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "teradata",  "convertedOn": "12/16/2024",  "domain": "test" }}'
;

SELECT
    * FROM
    (
        SELECT
            firstCol,
            rowexpr.value:name :: CHAR(20) AS Column_0,
            rowexpr.value:type :: VARCHAR(20) AS Column_1,
            thirdCol
        FROM
            demo.Train T,
            TABLE(FLATTEN(INPUT => jsonCol:schools)) rowexpr
    ) JT;

    SELECT
    * FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0060 - JSON_TABLE NOT TRANSFORMED, COLUMN NAMES COULD NOT BE RETRIEVED FROM SEMANTIC INFORMATION ***/!!! JSON_TABLE
   (ON (
        SELECT
            T.*
                  FROM
            demo.missingTable T)
   USING rowexpr('$.schools[*]')
                  colexpr('[ {"jsonpath" : "$.name",
                           "type" : "CHAR(20)"},
                          {"jsonpath" : "$.type",
                           "type" : "VARCHAR(20)"}]')
   )
   AS JT;
```

#### Best Practices

* Please check the code provided to SnowConvert AI is complete, if you did not provide the table definition please re-execute the code with the table definition present.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0061

TD_UNPIVOT transformation requires column information that could not be found, columns missing in result

### Severity

Low

#### Description

SnowConvert AI not supports and transforms the [TD_UNPIVOT](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Operators-and-User-Defined-Functions/Table-Operators/TD_UNPIVOT) function, which can be used to represent columns from a table as rows.

However, this transformation requires information about the table/tables columns to work, more specifically the names of the columns. When this information is not present the transformation may be left in an incomplete state where columns are missing from the result, this EWI is generated in these cases.

#### Example code

##### Input Code:

```sql
 CREATE TABLE unpivotTable  (
	myKey INTEGER NOT NULL PRIMARY KEY,
	firstSemesterIncome DECIMAL(10,2),
	secondSemesterIncome DECIMAL(10,2),
	firstSemesterExpenses DECIMAL(10,2),
	secondSemesterExpenses DECIMAL(10,2)
);

SELECT * FROM
 TD_UNPIVOT(
 	ON unpivotTable
 	USING
 	VALUE_COLUMNS('Income', 'Expenses')
 	UNPIVOT_COLUMN('Semester')
 	COLUMN_LIST('firstSemesterIncome, firstSemesterExpenses', 'secondSemesterIncome, secondSemesterExpenses')
 	COLUMN_ALIAS_LIST('First', 'Second')
 )X ORDER BY mykey;

SELECT * FROM
 TD_UNPIVOT(
 	ON unknownTable
 	USING
 	VALUE_COLUMNS('MonthIncome')
 	UNPIVOT_COLUMN('Months')
 	COLUMN_LIST('januaryIncome', 'februaryIncome', 'marchIncome', 'aprilIncome')
 	COLUMN_ALIAS_LIST('January', 'February', 'March', 'April')
 )X ORDER BY yearKey;
```

##### Generated Code:

```sql
 CREATE OR REPLACE TABLE unpivotTable (
	myKey INTEGER NOT NULL PRIMARY KEY,
	firstSemesterIncome DECIMAL(10,2),
	secondSemesterIncome DECIMAL(10,2),
	firstSemesterExpenses DECIMAL(10,2),
	secondSemesterExpenses DECIMAL(10,2)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "VALUE_COLUMNS", "UNPIVOT_COLUMN", "COLUMN_LIST", "COLUMN_ALIAS_LIST" **
SELECT
	* FROM
	(
		SELECT
			myKey,
			TRIM(GET_IGNORE_CASE(OBJECT_CONSTRUCT('FIRSTSEMESTERINCOME', 'First', 'FIRSTSEMESTEREXPENSES', 'First', 'SECONDSEMESTERINCOME', 'Second', 'SECONDSEMESTEREXPENSES', 'Second'), Semester), '"') AS Semester,
			Income,
			Expenses
		FROM
			unpivotTable UNPIVOT(Income FOR Semester IN (
				firstSemesterIncome,
				secondSemesterIncome
			)) UNPIVOT(Expenses FOR Semester1 IN (
				firstSemesterExpenses,
				secondSemesterExpenses
			))
		WHERE
			Semester = 'FIRSTSEMESTERINCOME'
			AND Semester1 = 'FIRSTSEMESTEREXPENSES'
			OR Semester = 'SECONDSEMESTERINCOME'
			AND Semester1 = 'SECONDSEMESTEREXPENSES'
	) X ORDER BY mykey;

	--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS "VALUE_COLUMNS", "UNPIVOT_COLUMN", "COLUMN_LIST", "COLUMN_ALIAS_LIST" **
	SELECT
	* FROM
	!!!RESOLVE EWI!!! /*** SSC-EWI-TD0061 - TD_UNPIVOT TRANSFORMATION REQUIRES COLUMN INFORMATION THAT COULD NOT BE FOUND, COLUMNS MISSING IN RESULT ***/!!!
	(
		SELECT
			TRIM(GET_IGNORE_CASE(OBJECT_CONSTRUCT('JANUARYINCOME', 'January', 'FEBRUARYINCOME', 'February', 'MARCHINCOME', 'March', 'APRILINCOME', 'April'), Months), '"') AS Months,
			MonthIncome
		FROM
			unknownTable UNPIVOT(MonthIncome FOR Months IN (
				januaryIncome,
				februaryIncome,
				marchIncome,
				aprilIncome
			))
	) X ORDER BY yearKey;
```

#### Best Practices

* There are two ways of supplying the information about columns to the conversion tool: put the table specification in the same file as the TD_UNPIVOT call or specify a column list in the SELECT query of the ON expression instead of SELECT \* or the table name.
* This issue can be safely ignored if ALL the columns from the input table/tables are unpivoted, otherwise, the result will have missing columns.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0063

JSON path was not recognized

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

This message is shown when SnowConvert AI cannot deserialize a JSON path because the string does not have the expected JSON format.

#### Example code

##### Input Code:

```sql
 SELECT
    *
FROM
JSON_TABLE (
    ON (
        SELECT
            id,
            trainSchedule as ts
        FROM
            demo.PUBLIC.Train T
    ) USING rowexpr('$weekShedule.Monday[*]') colexpr(
        '[{"jsonpath"  "$.time",
              "type"" : "CHAR ( 12 )"}]'
    )
) AS JT(Id, Ordinal, Time, City);
```

##### Generated Code:

```sql
 SELECT
    *
FROM
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0063 - UNRECOGNIZED JSON PATH $weekShedule.Monday[*] ***/!!!
JSON_TABLE (
    ON
       !!!RESOLVE EWI!!! /*** SSC-EWI-0108 - THE FOLLOWING SUBQUERY MATCHES AT LEAST ONE OF THE PATTERNS CONSIDERED INVALID AND MAY PRODUCE COMPILATION ERRORS ***/!!! (
           SELECT
               id,
               trainSchedule as ts
FROM
               demo.PUBLIC.Train T
    ) USING rowexpr('$weekShedule.Monday[*]') colexpr(
        '[{"jsonpath"  "$.time",
              "type"" : "CHAR ( 12 )"}]'
    )
) AS JT(Id, Ordinal, Time, City);
```

#### Best Practices

* Check if the Json path have an unexpected character, or do not have the right format.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0066

The following identifier has one or more Unicode escape characters that are invalid in snowflake

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This message is shown when SnowConvert AI transforms a [Teradata Unicode Delimited Identifier](https://docs.teradata.com/r/Teradata-Database-SQL-Fundamentals/June-2017/Basic-SQL-Syntax/Working-with-Unicode-Delimited-Identifiers) with invalid characters in Snowflake.

#### Example code

##### Input Code:

```sql
 SELECT * FROM U&"#000f#ffff" UESCAPE '#';
```

##### Generated Code:

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
* FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0066 - THE FOLLOWING IDENTIFIER HAS ONE OR MORE UNICODE ESCAPE CHARACTERS THAT ARE INVALID IN SNOWFLAKE ***/!!!
"\u000f\uffff";
```

#### Best Practices

* Use identifiers with valid Unicode characters in Snowflake.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0068

Snowflake does not support profiles, referencing role instead

### Severity

Medium

#### Description

Teradata profiles allow defining of multiple common parameters related to storage space and password constraints management.

However, [Snowflake works with cloud architecture and automatically manages and optimizes storage](https://docs.snowflake.com/en/user-guide/intro-key-concepts.html#key-concepts-architecture), meaning no storage customization is done on the user side. Also, [Snowflake currently has a password policy](https://docs.snowflake.com/en/user-guide/admin-user-management.html#snowflake-password-policy) defined that applies to all user passwords and is not modifiable.

This error is generated when a reference to a Teradata profile is found to indicate that it was changed to a reference to the user’s role, which is the nearest approximation to a profile in Snowflake, although there might be differences in the query results unless the profile and role names of a user are the same.

#### Example code

##### Input Code:

```sql
 SELECT PROFILE;
```

##### Generated Code:

```sql
 SELECT
CURRENT_ROLE() !!!RESOLVE EWI!!! /*** SSC-EWI-TD0068 - SNOWFLAKE DOES NOT SUPPORT PROFILES, REFERENCING ROLE INSTEAD ***/!!!;
```

#### Best Practices

* Avoid referencing user profiles, they are not supported, and query results will be different unless the user has the same name for both its profile and role.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0069

ST_DISTANCE results are slightly different from ST_SPHERICALDISTANCE

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TD0031](../functional-difference/teradataFDM.md) documentation

### Severity

Low

#### Description

The Teradata function ST_SPHERICALDISTANCE calculates the distance between two spherical coordinates on the planet using the Haversine formula, on the other side, the Snowflake ST_DISTANCE function does not utilize the haversine formula to calculate the minimum distance between two geographical points.

#### Example Code

##### Input Code:

```sql
 --The distance between New York and Los Angeles
Select Cast('POINT(-73.989308 40.741895)' As ST_GEOMETRY) As location1,
	Cast('POINT(40.741895 34.053691)' As ST_GEOMETRY) As location2,
	location1.ST_SPHERICALDISTANCE(location2) As Distance_In_km;
```

##### Generated Code

```sql
 --The distance between New York and Los Angeles
SELECT
	Cast('POINT(-73.989308 40.741895)' As GEOGRAPHY) As location1,
	Cast('POINT(40.741895 34.053691)' As GEOGRAPHY) As location2,
	!!!RESOLVE EWI!!! /*** SSC-EWI-TD0069 - ST_DISTANCE RESULTS ARE SLIGHTLY DIFFERENT FROM ST_SPHERICALDISTANCE ***/!!!
	ST_DISTANCE(
	location1, location2) As Distance_In_km;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0070

A return statement was added at the end of the label section to ensure the same execution flow

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TD0030](../functional-difference/teradataFDM.md) documentation

### Severity

Medium

#### Description

When a Goto statement is replaced with a Label section and does not contain a return statement, one is added at the end of the section to ensure the same execution flow.

BTEQ after a Goto command is executed, the statements between the goto command and the label command with the same name are ignored. So, to avoid those statements being executed the label section should contain a return statement.

In addition, it is worth value mentioning the Goto command skips all the other statements except for the Label with the same name, which is when the execution resumes. Therefore, the execution will never resume in a label section defined before the Goto command.

#### Example Code

##### Input Code:

```sql
 -- Additional Params: --scriptsTargetLanguage SnowScript
.LOGON dbc,dbc;
select 'STATEMENTS';
.GOTO LABEL_B
select 'IGNORED STATEMENTS';
.label LABEL_B
select 'LABEL_B STATEMENTS';
```

##### Generated Code

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    --.LOGON dbc,dbc
    !!!RESOLVE EWI!!! /*** SSC-EWI-0073 - PENDING FUNCTIONAL EQUIVALENCE REVIEW FOR 'BTLogOn' NODE ***/!!!
    null;
    BEGIN
      SELECT
        'STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;

    /*.label LABEL_B*/

    BEGIN
      SELECT
        'LABEL_B STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0070 - A RETURN STATEMENT WAS ADDED AT THE END OF THE LABEL SECTION LABEL_B TO ENSURE THE SAME EXECUTION FLOW ***/!!!
    RETURN 0;
    BEGIN
      SELECT
        'IGNORED STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
    /*.label LABEL_B*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    BEGIN
      SELECT
        'LABEL_B STATEMENTS';
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
  END
$$
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0076

The use of foreign tables is not supported in Snowflake.

### Severity

Medium

#### Description

[Foreign tables](https://docs.teradata.com/r/Teradata-VantageTM-SQL-Data-Definition-Language-Syntax-and-Examples/September-2020/Table-Statements/CREATE-FOREIGN-TABLE) enable access to data in external object storage, such as semi-structured and unstructured data in Amazon S3, Azure Blob storage, and Google Cloud Storage. This syntax is not supported in Snowflake. However, there are other alternatives in Snowflake that can be used instead, such as external tables, iceberg tables, and standard tables.

#### Example code

##### Input code:

```sql
 SELECT cust_id, income, age FROM
FOREIGN TABLE (SELECT cust_id, income, age FROM twm_customer)@hadoop1 T1;
```

##### Generated Code:

```sql
 SELECT
cust_id,
income,
age FROM
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0076 - THE USE OF FOREIGN TABLES IS NOT SUPPORTED IN SNOWFLAKE. ***/!!!
 FOREIGN TABLE (SELECT cust_id, income, age FROM twm_customer)@hadoop1 T1;
```

#### Best Practices

* Instead of foreign tables in Teradata, you can use [Snowflake external tables](https://docs.snowflake.com/en/user-guide/tables-external.html). External tables reference data files located in a cloud storage (Amazon S3, Google Cloud Storage, or Microsoft Azure) data lake. This enables querying data stored in files in a data lake as if it were inside a database. External tables can access data stored in any format supported by [COPY INTO <table>](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table.html) statements.
* Another alternative is [Snowflake’s Iceberg tables](https://www.snowflake.com/blog/iceberg-tables-powering-open-standards-with-snowflake-innovations/?lang=es). So, you can think of Iceberg tables as tables that use open formats and customer-supplied cloud storage. This data is stored in Parquet files.
* Finally, there are the [standard Snowflake tables](https://docs.snowflake.com/en/sql-reference/sql/create-table.html) which can be an option to cover the functionality of foreign tables in Teradata
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0077

RESET WHEN clause is not supported in this scenario due to its condition

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

SnowConvert AI currently only supports `RESET WHEN` clauses with binary conditions (<=, >= or =). Any other type of condition, such as `IS NOT NULL`, the `RESET WHEN` clause will be removed and an error message will be added since it is not supported in Snowflake.

This error message also appears when the `RESET WHEN` condition references an expression whose definition was not found by the migration tool. Currently, the tool supports the alias references to a column that was defined in the same query.

#### Example Code

##### Condition is not binary

##### Input Code:

```sql
 SELECT
    account_id,
    month_id,
    balance,
    ROW_NUMBER() OVER (
        PARTITION BY account_id
        ORDER BY month_id
        RESET WHEN balance IS NOT NULL
        ROWS UNBOUNDED PRECEDING
    ) as balance_increase
FROM account_balance
ORDER BY 1,2;
```

##### Generated Code

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
    account_id,
    month_id,
    balance,
    ROW_NUMBER() OVER (
        PARTITION BY account_id
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0077 - RESET WHEN CLAUSE IS NOT SUPPORTED IN THIS SCENARIO DUE TO ITS CONDITION ***/!!!
        ORDER BY month_id
        ROWS UNBOUNDED PRECEDING
    ) as balance_increase
FROM
    account_balance
ORDER BY 1,2;
```

##### Condition expression was not found

##### Input Code:

```sql
 SELECT
    account_id,
    month_id,
    balance,
    ROW_NUMBER() OVER (
        PARTITION BY account_id
        ORDER BY month_id
        RESET WHEN balance <= not_found_expresion
    ) as balance_increase
FROM account_balance
ORDER BY 1,2;
```

##### Generated Code

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
    account_id,
    month_id,
    balance,
    ROW_NUMBER() OVER (
        PARTITION BY account_id
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0077 - RESET WHEN CLAUSE IS NOT SUPPORTED IN THIS SCENARIO DUE TO ITS CONDITION ***/!!!
        ORDER BY month_id
    ) as balance_increase
FROM
    account_balance
ORDER BY 1,2;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0079

The required period type column was not found

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Low

#### Description

This warning is shown because the Period column necessary to replicate the functionality of Normalize clause was not found.

#### Example Code

##### Input Code:

```sql
 SELECT NORMALIZE emp_id, duration2 FROM project;
```

##### Generated Code

```sql
 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0079 - THE REQUIRED PERIOD TYPE COLUMN WAS NOT FOUND ***/!!!
// SnowConvert AI Helpers Code section is omitted.
WITH NormalizeCTE AS
(
SELECT
T1.*,
SUM(GroupStartFlag)
OVER (
PARTITION BY
emp_id, duration2
ORDER BY
PeriodColumn_begin
ROWS UNBOUNDED PRECEDING) GroupID
FROM
(
SELECT 
emp_id,
duration2,
PUBLIC.PERIOD_BEGIN_UDF(PeriodColumn) PeriodColumn_begin,
PUBLIC.PERIOD_END_UDF(PeriodColumn) PeriodColumn_end,
(CASE
WHEN PeriodColumn_begin <= LAG(PeriodColumn_end)
OVER (
PARTITION BY
emp_id, duration2
ORDER BY
PeriodColumn_begin,
PeriodColumn_end)
THEN 0
ELSE 1
END) GroupStartFlag FROM 
project
) T1
)
SELECT
emp_id,
duration2,
PUBLIC.PERIOD_UDF(MIN(PeriodColumn_begin), MAX(PeriodColumn_end))
FROM
NormalizeCTE
GROUP BY
emp_id,
duration2,
GroupID;
```

#### Best Practices

* To fix this warning manually you just need to find which was the first period column and remove all its references except where is defined, and then replace PeriodColumn with the column found.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0082

Translate function using the current encoding is not supported

### Severity

Medium

#### Description

The usage of the Translate function using the current encoding arguments is not supported in Snowflake. The function is commented out during translation.

#### Example Code

##### Input Code:

```sql
 SELECT Translate('abc' USING KANJISJIS_TO_LATIN);
```

##### Generated Code

```sql
 SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0082 - TRANSLATE FUNCTION USING KANJISJIS_TO_LATIN ENCODING IS NOT SUPPORTED ***/!!!
Translate('abc' USING KANJISJIS_TO_LATIN);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0083

Not able to transform two or more complex Select clauses at a time

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

SnowConvert AI is not able to transform two or more complex SELECT clauses, as it is necessary to map them to a CTE or composite FROM clause, which causes the mapped code to not compile or enter into a logical cycle.

##### What do we consider a SELECT complex clause?

Those that required to be mapped to a CTE or composite FROM clause such as NORMALIZE, EXPAND ON, or RESET WHEN.

#### Example Code

##### Input Code:

```sql
 SELECT
   NORMALIZE emp_id,
   duration,
   dept_id,
   balance,
   (
     ROW_NUMBER() OVER (
       PARTITION BY emp_id
       ORDER BY
         dept_id RESET WHEN balance <= SUM(balance) OVER (
           PARTITION BY emp_id
           ORDER BY dept_id
           ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
         )
     ) -1
   ) AS balance_increase
FROM project
EXPAND ON duration AS bg BY ANCHOR ANCHOR_SECOND
ORDER BY 1, 2;
```

##### Generated Code

```sql
 // SnowConvert AI Helpers Code section is omitted.
SELECT
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0083 - NOT ABLE TO TRANSFORM TWO OR MORE COMPLEX SELECT CLAUSES AT A TIME ***/!!!
NORMALIZE emp_id,
   duration,
   dept_id,
   balance,
   (
     ROW_NUMBER() OVER (
   PARTITION BY
      emp_id, new_dynamic_part
   ORDER BY
         dept_id
     ) -1
   ) AS balance_increase
FROM
   (
      SELECT
         emp_id,
         duration,
         dept_id,
         balance,
         previous_value,
         SUM(dynamic_part) OVER (
                 PARTITION BY emp_id
                 ORDER BY dept_id
         ROWS BETWEEN UNBOUNDED PRECEDING AND CURRENT ROW
               ) AS new_dynamic_part
      FROM
         (
            SELECT
               emp_id,
               duration,
               dept_id,
               balance,
               SUM(balance) OVER (
                       PARTITION BY emp_id
                       ORDER BY dept_id
                       ROWS BETWEEN 1 PRECEDING AND 1 PRECEDING
                     ) AS previous_value,
               (CASE
                  WHEN balance <= previous_value
                     THEN 1
                  ELSE 0
               END) AS dynamic_part
            FROM
               project
         )
   )
!!!RESOLVE EWI!!! /*** SSC-EWI-TD0083 - NOT ABLE TO TRANSFORM TWO OR MORE COMPLEX SELECT CLAUSES AT A TIME ***/!!!
EXPAND ON duration AS bg BY ANCHOR ANCHOR_SECOND
ORDER BY 1, 2;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0087

GOTO statement was removed due to if statement inversion.

> **Note:**
>
> This EWI is deprecated, please refer to [SSC-FDM-TD0026](../functional-difference/teradataFDM.md) documentation

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons.

### Severity

Medium

#### Description

It is common to use GOTO command with IF and LABEL commands to replicate the functionality of an SQL if statement. When used in this way, it is possible to transform them directly into an if, if-else, or even an if-elseif-else statement. However, in these cases, the GOTO commands become unnecessary and should be removed to prevent them from being replaced by a LABEL section.

#### Example Code

##### Input Code:

```
-- Additional Params: --scriptsTargetLanguage SnowScript
.If ActivityCount = 0 THEN .GOTO endIf
DROP TABLE TABLE1;
.Label endIf
SELECT A FROM TABLE1;
```

##### Generated Code

```sql
 EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    IF (NOT (STATUS_OBJECT['SQLROWCOUNT'] = 0)) THEN
      !!!RESOLVE EWI!!! /*** SSC-EWI-TD0087 - GOTO endIf WAS REMOVED DUE TO IF STATEMENT INVERSION ***/!!!

      BEGIN
        DROP TABLE TABLE1;
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
      EXCEPTION
        WHEN OTHER THEN
          STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
      END;
    END IF;
    /*.Label endIf*/
    --** SSC-FDM-0027 - REMOVED NEXT STATEMENT, NOT APPLICABLE IN SNOWFLAKE.  **

    BEGIN
      SELECT
        A
      FROM
        TABLE1;
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLROWCOUNT', SQLROWCOUNT);
    EXCEPTION
      WHEN OTHER THEN
        STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
    END;
  END
$$
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0091

Expression converted as cast with possible errors due to missing dependencies.

> **Note:**
>
> Some parts in the output code are omitted for clarity reasons

### Severity

Medium

#### Description

In Teradata scripts, you can use the following syntax to CAST expressions:

```none
<expression> ( <DataType> )
```

Unfortunately, this syntax generates ambiguity when trying to convert a CAST to `DATE` or `TIME` since these keywords also behave as the `CURRENT_DATE` and `CURRENT_TIME` functions respectively.

Thus, without context about the expression to be CAST, there is no sure way to differentiate when we are dealing with an actual case of CAST or a function that accepts DATE or TIME as parameters.

In other words, it is required to know whether `<expression>` is a column or a user-defined function (UDF). To achieve this, when converting the code, one must add the `CREATE TABLE` or `CREATE FUNCTION` from which <expression> is dependant on.

E.g. check the following `SELECT` statement. With no context about `AMBIGUOUS_EXPR`, we have no way to determine if we are dealing with a function call or CAST to `DATE`. However, we do know that `COL1 (DATE)` is indeed a CAST since `COL1` is a column from the table `TAB`.

```none
CREATE TABLE TAB (
    COL1 VARCHAR(23)
)

SELECT
    COL1 (DATE),
    AMBIGUOUS_EXPR (DATE)
FROM TAB;
```

#### Example Code

##### Input Code:

```sql
 CREATE TABLE TAB (
    COL1 VARCHAR(23)
)

SELECT
    COL1 (DATE),
    AMBIGUOUS_EXPR (DATE)
FROM TAB;
```

##### Generated Code

```sql
 CREATE OR REPLACE TABLE TAB (
    COL1 VARCHAR(23)
)
COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"teradata"}}'
;

SELECT
    TO_DATE(
    COL1, 'YYYY/MM/DD') AS COL1,
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0091 - EXPRESSION CONVERTED AS CAST BY DEFAULT. CONVERSION MIGHT PRESENT ERRORS DUE TO MISSING DEPENDENCIES FOR 'AMBIGUOUS_EXPR'. ***/!!!
    AMBIGUOUS_EXPR :: DATE
    FROM
    TAB;
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0092

Translation for Teradata Built-In Table/View is not currently supported

### Severity

Low

#### Description

This EWI is added when SnowConvert AI finds a Teradata system table that is currently not translated.

#### Example Code

##### Input Code:

```sql
 SELECT
  CRLF ||
  TRIM(em.ErrorText) INTO :MsgText
FROM
  DBC.ErrorMsgs em
WHERE
  em.ErrorCode = SUBSTR(:SqlStateCode, 2, 4)
```

##### Generated Code

```sql
 SELECT
  CRLF ||
  TRIM(em.ErrorText) INTO :MsgText
FROM
  !!!RESOLVE EWI!!! /*** SSC-EWI-TD0092 - TRANSLATION FOR TERADATA BUILT-IN TABLE/VIEW DBC.ErrorMsgs IS NOT CURRENTLY SUPPORTED. ***/!!!
  DBC.ErrorMsgs em
WHERE
  UPPER(RTRIM(
  em.ErrorCode)) = UPPER(RTRIM(SUBSTR(:SqlStateCode, 2, 4)));
```

#### Best Practices

* Search in Snowflake’s internal tables, such as `Information_Schema` or `SNOWFLAKE.ACCOUNT_USAGE` for equivalents
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0093

Format not supported and must be updated in all its varchar casting uses.

### Severity

High

#### Description

This EWI is added when the CAST function is used to cast a numeric expression to another numeric type with a specified format. While the format does not impact the numeric value itself, if the result is subsequently cast to a string, the intended format will not be correctly applied. Therefore, it is necessary to update all instances where the result is cast to VARCHAR, ensuring the format defined in the EWI is used.

#### Example Code

##### Input Code:

```sql
SELECT
   CAST(245222.32 AS FORMAT '-(10)9.9(4)') AS FormattedAmount,
   CAST(FormattedAmount AS VARCHAR(30));
```

##### Generated Code

```sql
SELECT
   245222.32 !!!RESOLVE EWI!!! /*** SSC-EWI-TD0093 - FORMAT '-(10)9.9(4)' IS NOT SUPPORTED AND MUST BE UPDATED TO THE FOLLOWING FORMAT 'S9999999999.0000' IN ALL VARCHAR CAST USAGES. ***/!!! AS FormattedAmount,
   LEFT(LTRIM(TO_VARCHAR(FormattedAmount, 'MI0.00000000000000EEEEE')), 10);
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0094

The IMPORT command was not converted.

### Severity

High

#### Description

This issue indicates that an [`.IMPORT`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/IMPORT) command was not converted because it uses unsupported features. The original MLoad layout, DML, and import statements are commented out and each line is annotated with this EWI.

**Features pending translation:**

* `BINARY` format
* `FASTLOAD` format
* `.TABLE` type layout
* [`INMOD`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/IMPORT/INMOD-Specification) option
* `AXSMOD` option
* Non `INSERT-VALUES` DML statements

**Missing required definitions:**

* [`.LAYOUT`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/LAYOUT) definition was not found in the script
* [`.DML LABEL`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/DML-LABEL) was not found in the script

#### Example Code

##### Teradata:

```sql
.LAYOUT employee_layout;
.FIELD employee_id * CHAR(10);
.FIELD first_name * CHAR(50);

.DML LABEL insert_employees;
INSERT INTO employees (employee_id, first_name) VALUES (:employee_id, :first_name);

.IMPORT INFILE employees.dat FORMAT BINARY LAYOUT employee_layout APPLY insert_employees;
```

##### Snowflake Scripting:

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.dat @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0094 - THE IMPORT COMMAND WAS NOT CONVERTED: BINARY FORMAT IS PENDING TRANSLATION. ***/!!!
    -- .LAYOUT employee_layout;
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0094 - THE IMPORT COMMAND WAS NOT CONVERTED: BINARY FORMAT IS PENDING TRANSLATION. ***/!!!
    -- .FIELD employee_id * CHAR(10) ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0094 - THE IMPORT COMMAND WAS NOT CONVERTED: BINARY FORMAT IS PENDING TRANSLATION. ***/!!!
    -- .FIELD first_name * CHAR(50) ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0094 - THE IMPORT COMMAND WAS NOT CONVERTED: BINARY FORMAT IS PENDING TRANSLATION. ***/!!!
    -- .DML LABEL insert_employees ;
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0094 - THE IMPORT COMMAND WAS NOT CONVERTED: BINARY FORMAT IS PENDING TRANSLATION. ***/!!!
    -- INSERT INTO employees (employee_id, first_name) VALUES (:employee_id, :first_name);
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0094 - THE IMPORT COMMAND WAS NOT CONVERTED: BINARY FORMAT IS PENDING TRANSLATION. ***/!!!
    -- .IMPORT INFILE employees.dat FORMAT BINARY LAYOUT employee_layout APPLY insert_employees;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

#### Best Practices

* Convert the source file to a supported format (`VARTEXT`, `TEXT`, or `UNFORMAT`) before running SnowConvert AI.
* Manually rewrite the load using [Snowflake stages](https://docs.snowflake.com/en/user-guide/data-load-overview) and [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0095

DML statement in IMPORT command is pending translation.

### Severity

Medium

#### Description

This issue happens when a `.IMPORT` command uses a DML label that includes statements other than a basic `INSERT ... VALUES` (for example, `UPDATE`, `DELETE`, or more complex `INSERT` logic). In these cases, the converter will only transform the simple `INSERT ... VALUES` part into a [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table) statement for Snowflake. Any other DML statements are left in the output with a warning annotation, and are not automatically converted. This means that important logic—like updates or deletes—will not be migrated, which can affect your results. Please review and update your script to handle these cases, such as by using a [`MERGE`](https://docs.snowflake.com/en/sql-reference/sql/merge) statement for upserts.

#### Example Code

##### Teradata:

```sql
.LAYOUT employee_layout;
.FIELD employee_id * VARCHAR(10);
.FIELD first_name * VARCHAR(50);
.FIELD salary * VARCHAR(10);

.DML LABEL upsert_employees;
UPDATE employees SET salary = :salary WHERE employee_id = :employee_id;
INSERT INTO employees (employee_id, first_name, salary) VALUES (:employee_id, :first_name, :salary);

.IMPORT INFILE employees.csv FORMAT VARTEXT ',' LAYOUT employee_layout APPLY upsert_employees;
```

##### Snowflake Scripting:

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://employees.csv @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      !!!RESOLVE EWI!!! /*** SSC-EWI-TD0095 - THE DML 'UPDATE STATEMENT' USED IN THE IMPORT COMMAND IS PENDING TRANSLATION. ***/!!!
      UPDATE employees SET
        salary = :salary WHERE
        employee_id = :employee_id;

      COPY INTO employees (
        employee_id,
        first_name,
        salary
      )
      FROM
      (
        SELECT
          $1,
          $2,
          $3
        FROM
          @sc_import_stage/employees.csv
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = ',')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

#### Best Practices

* Implement the equivalent upsert logic in Snowflake using [`MERGE`](https://docs.snowflake.com/en/sql-reference/sql/merge).
* Load data into a [staging table](https://docs.snowflake.com/en/user-guide/data-load-overview) first, then merge into the target table.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0096

COPY INTO requires an explicit target file name.

### Severity

Medium

#### Description

When the `.IMPORT INFILE` path consists solely of a bash variable (for example, `${FILE_PATH}`) and no explicit file name can be inferred, this EWI is raised for the [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table) source. The converter cannot determine the file name to use in the [stage](https://docs.snowflake.com/en/sql-reference/sql/create-stage) path.

#### Example Code

##### Teradata:

```sql
.LAYOUT employee_layout;
.FIELD employee_id * VARCHAR(10);
.FIELD first_name * VARCHAR(50);

.DML LABEL insert_employees;
INSERT INTO employees (employee_id, first_name) VALUES (:employee_id, :first_name);

.IMPORT INFILE ${FILE_PATH} FORMAT VARTEXT '|' LAYOUT employee_layout APPLY insert_employees;
```

##### Snowflake Scripting:

```sql
--** SSC-FDM-TD0003 - BASH VARIABLES FOUND, USING SNOWSQL WITH VARIABLE SUBSTITUTION ENABLED IS REQUIRED TO RUN THIS SCRIPT **
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://&{FILE_PATH} @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
  BEGIN
    BEGIN
      COPY INTO employees (
        employee_id,
        first_name
      )
      FROM
      (
        SELECT
          $1,
          $2
        FROM
          !!!RESOLVE EWI!!! /*** SSC-EWI-TD0096 - COPY INTO REQUIRES AN EXPLICIT TARGET FILE NAME. ***/!!!
          @sc_import_stage/&{FILE_PATH}
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

#### Best Practices

* Adjust the original [MLoad](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Using-Teradata-MultiLoad) script so that the file name is explicit (separate directory and file name).
* Use a literal file name with variable directory, for example, `.IMPORT INFILE ${DATA_DIR}/employees.csv ...`
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0097

Local variables not supported in PUT or COPY INTO.

### Severity

Medium

#### Description

This issue indicates the use of local MLoad variables, such as `&FILE_NAME`, defined with [`.SET`](https://docs.teradata.com/r/Enterprise_IntelliFlex_Lake_VMware/Teradata-MultiLoad-Reference-20.00/Teradata-MultiLoad-Commands/SET) in `INFILE` paths. These cannot be resolved in the generated [`PUT`](https://docs.snowflake.com/en/sql-reference/sql/put) or [`COPY INTO`](https://docs.snowflake.com/en/sql-reference/sql/copy-into-table) statements because Snowflake’s `PUT` command only supports literal paths or [SnowSQL session variables](https://docs.snowflake.com/en/user-guide/snowsql-use#using-variables) (`&{VAR}`), not Snowflake Scripting variables (`:var`).

#### Example Code

##### Teradata:

```sql
.SET FILE_NAME TO 'employees.csv';

.LAYOUT employee_layout;
.FIELD employee_id * VARCHAR(10);
.FIELD first_name * VARCHAR(50);

.DML LABEL insert_employees;
INSERT INTO employees (employee_id, first_name) VALUES (:employee_id, :first_name);

.IMPORT INFILE &FILE_NAME FORMAT VARTEXT '|' LAYOUT employee_layout APPLY insert_employees;
```

##### Snowflake Scripting:

```sql
CREATE TEMPORARY STAGE IF NOT EXISTS sc_import_stage;

!!!RESOLVE EWI!!! /*** SSC-EWI-TD0097 - LOCAL VARIABLES ARE CURRENTLY NOT SUPPORTED IN THE PUT STATEMENT. ***/!!!
--** SSC-FDM-TD0038 - PUT COMMAND REQUIRES EXECUTION THROUGH SNOWSQL. **
PUT file://&FILE_NAME @sc_import_stage;

EXECUTE IMMEDIATE
$$
  DECLARE
    STATUS_OBJECT OBJECT := OBJECT_CONSTRUCT('SQLCODE', 0);
    FILE_NAME STRING := 'employees.csv';
  BEGIN
    BEGIN
      COPY INTO employees (
        employee_id,
        first_name
      )
      FROM
      (
        SELECT
          $1,
          $2
        FROM
          !!!RESOLVE EWI!!! /*** SSC-EWI-TD0097 - LOCAL VARIABLES ARE CURRENTLY NOT SUPPORTED IN THE COPY INTO STATEMENT. ***/!!!
          @sc_import_stage/&FILE_NAME
      )
      FILE_FORMAT = (TYPE = CSV FIELD_DELIMITER = '|')
      ON_ERROR = 'CONTINUE';
    END;
  EXCEPTION
    WHEN OTHER CONTINUE THEN
      STATUS_OBJECT := OBJECT_CONSTRUCT('SQLCODE', SQLCODE, 'SQLERRM', SQLERRM, 'SQLSTATE', SQLSTATE);
  END
$$
```

#### Best Practices

* Replace local variables with bash variables (resolved by [SnowSQL](https://docs.snowflake.com/en/user-guide/snowsql) before execution).
* Alternatively, hard-code the file name directly.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-TD0098

PREPARE with USING clause containing non-variable expressions cannot be automatically migrated.

### Severity

Medium

### Description

This issue is raised when a `PREPARE` statement with an `OPEN ... USING` clause contains non-variable expressions such as function calls, arithmetic operations, or other complex expressions in the USING clause. SnowConvert AI can only automatically migrate USING clauses that contain simple variable references.

In Teradata, the `OPEN cursor USING expr1, expr2` statement allows any expression to be bound to the query’s parameter markers (`?`). However, SnowConvert AI’s transformation to `EXECUTE IMMEDIATE query USING (...)` requires simple variable names to ensure correct binding behavior.

When complex expressions are detected in the USING clause, the PREPARE statement is left untransformed and marked with this EWI for manual review.

### Example Code

#### Teradata:

```sql
REPLACE PROCEDURE fetch_complex_using(OUT result INTEGER)
BEGIN
    DECLARE SQL_string VARCHAR(200) DEFAULT 'SELECT col1 FROM MyTable WHERE col1 = ? AND col2 = ?';
    DECLARE base_value INTEGER DEFAULT 5;

    DECLARE C1 CURSOR FOR S1;
    PREPARE S1 FROM SQL_string;
    -- Using expressions: function call and arithmetic
    OPEN C1 USING UPPER('test'), base_value + 10;
    FETCH C1 INTO result;
    CLOSE C1;
END;
```

#### Snowflake Scripting:

```sql
CREATE OR REPLACE PROCEDURE fetch_complex_using (RESULT OUT INTEGER)
RETURNS VARCHAR
LANGUAGE SQL
EXECUTE AS CALLER
AS
$$
  DECLARE
    SQL_string VARCHAR(200) DEFAULT 'SELECT
   col1 FROM
   MyTable
WHERE col1 = ? AND col2 = ?';
    base_value INTEGER DEFAULT 5;
  BEGIN
    !!!RESOLVE EWI!!! /*** SSC-EWI-TD0098 - PREPARE STATEMENT WITH USING CLAUSE CONTAINING NON-VARIABLE EXPRESSIONS (E.G., FUNCTION CALLS, ARITHMETIC) CANNOT BE AUTOMATICALLY MIGRATED TO EXECUTE IMMEDIATE. MANUAL REVIEW REQUIRED TO PROPERLY BIND EXPRESSIONS. ***/!!!
    PREPARE S1 FROM SQL_string;
    OPEN C1 USING UPPER('test'), base_value + 10;
    FETCH
      C1
    INTO
      result;
    CLOSE C1;
  END;
$$;
```

### Best Practices

* **Extract expressions into variables**: Before the PREPARE statement, assign complex expressions to intermediate variables:

  ```sql
  DECLARE upper_value VARCHAR(50);
  DECLARE calculated_value INTEGER;

  upper_value := UPPER('test');
  calculated_value := base_value + 10;

  PREPARE S1 FROM SQL_string;
  OPEN C1 USING upper_value, calculated_value;
  ```
* **Manually transform to EXECUTE IMMEDIATE**: Convert the PREPARE-cursor pattern to use EXECUTE IMMEDIATE with simple variable references in the USING clause.
* **Test the conversion**: Ensure that the binding behavior matches the original Teradata logic.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Top-Level Code Units Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/top-level-code-units-report.md
section: Migrations
---

# SnowConvert AI - Top-Level Code Units Report

## What is a Top-Level Code Unit?

A Code Unit, as the name suggests, is the most atomic, standalone executable element. In most cases, these are statements, but they also include script files as well because those are executed as a single element.

Some Code Units can be nested inside other Code Units. When there is no other Code Unit above it in a hierarchy of Units, it’s called a Top-Level Code Unit.

## What is an out-of-scope Code Unit?

Out-of-scope Top-Level Code Units are Code Units that are out of the conversion scope of SnowConvert AI. Because of this, these code units are not considered when calculating the conversion rate. Each of these code units will not have a conversion rate (it will appear as `N/A`).

If the input code includes only out-of-scope Code Units, then the Lines of Code conversion rate of the entire migration will be 0%.

The following `CREATE TRIGGER` is considered an out-of-scope Code Unit.

```sql
CREATE OR REPLACE TRIGGER my_trigger
    AFTER
    UPDATE
    ON my_table
    FOR EACH ROW
BEGIN
   NULL;
END;
```

## Examples of Top-Level Code Units

In the following section, we can see some examples of Top-Level Code Units.

### Queries

In the following example, we have here a single SELECT statement. This statement is a single Top-Level Code Unit.

```sql
SELECT * FROM table1;
```

In this example, we have a nested `SELECT` statement nested inside another `SELECT` statement. The entire query counts as a single Top-Level Code Unit.

```sql
SELECT * FROM (SELECT * FROM table1);
```

### Objects

Objects created with a DDL count as a single Top-Level Code Unit, even if it contains other Code Units inside it.

The following statement creates a view with a query. In this case, the entire `CREATE VIEW` counts as a single Top-Level Code Unit.

```sql
CREATE VIEW view1 AS SELECT * FROM table1;
```

The following `CREATE PROCEDURE` statement counts as a single Top-Level Code Unit even if it contains multiple statements inside it.

```sql
CREATE PROCEDURE procedure1
AS
BEGIN
    DELETE FROM table1;
END;
```

### Commands

Independent commands in an SQL file are considered Top-Level Code Units.

A `COMMIT` statement counts as a single Top-Level Code Unit.

```sql
COMMIT;
```

### Package Bodies in Oracle

A package can define multiple elements inside its body. The package body is considered the Top-Level Code Unit because those elements cannot be created individually without creating the entire package body. Elements or code units inside a package body will not count as Top-Level Code Units.

The following code will be reported as a single `CREATE PACKAGE BODY` Code Unit.

```sql
CREATE PACKAGE package_body1 IS
    FUNCTION function1
    RETURN VARCHAR
    IS
    BEGIN
        RETURN 'HELLO'';
    END;
END;
```

### Teradata Script files

Teradata Script files like BTEQ or TPUMP are executed as standalone code units. Because of this, the entire file is considered a single Top-Level Code Unit. Other possible code units inside these files will not count as Top-Level Code Units.

The following BTEQ script file will be reported as a single BTEQ Top-Level Code Unit.

```sql
.LOGON e/fml,notebook
.COMPILE FILE = example.spl;
COMMIT;
CALL samplesp1 (8888, pAmount);
.LOGOFF
```

### Transact SQL batches with GOTO

Each statement of Transact-SQL can be executed independently. In most cases, each of these statements is considered a Top-Level Code Unit. However, when there is a batch that contains a GOTO statement to a label inside the same batch, the statements of the batch cannot be executed independently without ensuring that they work properly. Because of this, statements that are in a batch with a GOTO statement will not count as Top-Level Code Units, only the batch.

The following code example will be reported as a single GOTO/LABEL Code Unit:

```sql
DECLARE @Counter int;
SET @Counter = 1;
WHILE @Counter < 10
BEGIN
    SELECT @Counter
    SET @Counter = @Counter + 1
    IF @Counter = 4 GOTO Branch_One
    IF @Counter = 5 GOTO Branch_Two
END
Branch_One:
    SELECT 'Jumping To Branch One.'
    GOTO Branch_Three;
Branch_Two:
    SELECT 'Jumping To Branch Two.'
Branch_Three:
    SELECT 'Jumping To Branch Three.';
GO
```

## How is the Code Unit methodology represented in other reports?

The Code Unit methodology is also represented in other reports. This section explains how these values are shown or are related to other reports.

### Issues Report

<issues-report.md>

Each row of the Issues Report has some information about the Code Unit that is being impacted by the issue. The columns related to Code Units are the following:

* **Code Unit Database:** This is the Database of the Top-Level Code Unit where the issue was found. It only applies to Code Units that are objects.
* **Code Unit Schema:** This is the Schema of the Top-Level Code Unit where the issue was found. It only applies to Code Units that are objects.
* **Code Unit Package:** This is the Package of the Top-Level Code Unit where the issue was found. It only applies to Code Units that are objects.
* **Code Unit Name:** This is the name Top-Level Code Unit where the issue was found. It only applies to named Code Units like objects. This name is not qualified by database, schema, or package.
* **Code Unit ID:** This is the ID of the Top-Level Code Unit where the issue was found. This name has the name qualified and will add a number for code units with repeated names.
* **Code Unit:** This is the type of the Top-Level Code Unit where the issue was found.
* **Code Unit Size:** This is the size of the Top-Level Code Unit where the issue was found.

### Object References Report and Missing Objects Report

<object-references-report.md>

<missing-objects-report.md>

Each row of the Object References report has information about the Top-Level Code Unit that was referencing another element. These referenced elements may not be Top-Level, so those other values may not be included in the Top-Level Code Units report.

Similarly to the Object References report, the Missing Objects Report has information about the Top-Level Code Unit that was referencing an element that could not be found in the code.

* **Caller Code Unit:** This is the type of the Top-Level Code Unit that is referencing another element.
* **Caller Code Unit Database:** This is the database of the Top-Level Code Unit that is referencing another element.
* **Caller Code Unit Schema:** This is the schema of the Top-Level Code Unit that is referencing another element.
* **Caller Code Unit Name:** This is the name of the Top-Level Code Unit that is referencing another element.
* **Caller Code Unit Full Name:** This is the full name of the Top-Level Code Unit that is referencing another element.

## Information in the Top-Level Code Units Report

| Column | Description |
| --- | --- |
| Partition Key | The unique identifier of the conversion. |
| File Type | The type of the file that the Code Unit is in. (SQL, BTEQ, etc…) |
| Category | The broader class or type each code unit belongs to. |
| Code Unit | The type of Code Unit that this element belongs to. |
| Source Database | The database where the source code unit is located. |
| Source Schema | The schema where the source code unit is located. |
| Source Name | The original name of the source code unit as it appears in the source system. |
| Code Unit Id | The unique identifier of the Code Unit with qualified name and numbering for code units with repeated names. |
| File Name | The name of the file in which the object is located. Uses the relative path starting from the input directory. |
| Line Number | The line number inside the file where the code unit is located. |
| Lines of Code | The total lines of code that the code unit has. |
| EWI Count | The amount of EWIs found within the code unit. You can learn more about EWIs [here](../../../../technical-documentation/issues-and-troubleshooting/conversion-issues/README.md). |
| FDM Count | The amount of FDMs found within the code unit. You can learn more about FDMs [here](../../../../technical-documentation/issues-and-troubleshooting/functional-difference/README.md). |
| PRF Count | The amount of PRFs found within the code unit. You can learn more about PRFs [here](../../../../technical-documentation/issues-and-troubleshooting/performance-review/README.md). |
| Highest EWI Severity | The highest EWI severity found within the code unit. The severity order is the following:   * N/A (when there are not any EWIs) * Low * Medium * High * Critical |
| UDFs Used | The names of all the user defined functions found within the code unit. The name of the UDFs used are separated by a pipe if there is more than one. |
| EWI | The codes of all the EWIs found within the code unit. These codes are separated by pipes and do not include repeated codes. |
| FDM | The codes of all the FDMs found within the code unit. These codes are separated by pipes and do not include repeated codes. |
| PRF | The codes of all the PRFs found within the code unit. These codes are separated by pipes and do not include repeated codes. |
| Conversion Status | The final status of the conversion of the code unit.  The possible conversion statuses are:   * NotSupported: When the Code Unit has a 0% conversion rate. * Partial: When the conversion rate of the Code Unit is between 0% and 100%. * Success: When the Code Unit conversion rate is 100%. |
| LoC Conversion Percentage | The conversion percentage is based on Lines of Code. A single line of code may have supported and unsupported fragments depending on how the input code was formatted. In these cases, the entire line is considered as not supported. |
| Deployment Order | The deployment order is the topological level of each code unit based on its dependencies. It shows the right order in which the code units should be deployed to avoid missing dependencies during the deployment phase. |
| Language | The programming language or SQL dialect of the source code unit. |

## Example

Assume that the following `CREATE TABLE` in ORACLE SQL is located in its file called table_example.sql.

```sql
CREATE TABLE my_table (
  my_column DATE DEFAULT TO_DATE(CURRENT_DATE, 'J'),
  NOT A VALID COLUMN
);
```

```sql
CREATE OR REPLACE TABLE my_table (
   my_column TIMESTAMP /*** SSC-FDM-OR0042 - DATE TYPE COLUMN HAS A DIFFERENT BEHAVIOR IN SNOWFLAKE. ***/ DEFAULT PUBLIC.JULIAN_TO_GREGORIAN_DATE_UDF(CURRENT_DATE(), 'J')
--                                                                                                                                                                          ,
-- ** SSC-EWI-0001 - UNRECOGNIZED TOKEN ON LINE '3' COLUMN '3' OF THE SOURCE CODE STARTING AT 'NOT'. EXPECTED 'Column Definition' GRAMMAR. LAST MATCHING TOKEN WAS ',' ON LINE '2' COLUMN '52'. FAILED TOKEN WAS 'NOT' ON LINE '3' COLUMN '3'. CODE '15'. **
--  NOT A VALID COLUMN
 )
 COMMENT = '{"origin":"sf_sc","name":"snowconvert","version":{"major":1, "minor":0},"attributes":{"component":"oracle"}}'
 ;
```

The Top-Level Code Units report will have a single entry of the previously shown table.

Here are all the values that would be reported in the entry of this `CREATE TABLE` statement:

* The **Partition Key** value will depend on migration so the value here will vary.
* The **File Type** will be SQL because it was migrated on a file with the .sql extension.
* The **Category** will be TABLE because the `CREATE TABLE` statement is part of the TABLE Code Unit Category.
* The **Code Unit** itself will be `CREATE TABLE`.
* The **File Name** where this code unit was found would be table_example.sql.
* Assuming that the `CREATE TABLE` statement is at the beginning of the file, the **Line Number** will be 1.
* The **Lines of Code** number would be 4.
* The **EWI Count** column will report 1 because the output code has one parsing EWI.
* The **FDM Count** column will report 1 because the output code has an FDM issue related to data types.
* The **PRF Count** will report 0 because there are no PRF issues present in the output code.
* The **Highest EWI Severity** in this case would be “Critical” because this is the severity of the parsing EWI of the example. The other one has a “Low” severity.
* The **UDFs Used** column will be `JULIAN_TO_GREGORIAN_DATE_UDF` because this custom User Defined Function was added to convert the `TO_DATE` function of the input code.
* The **EWI** column will report “SSC-EWI-0001” because this was one of EWIs added in the output code.
* The **FDM** column will report “SSC-FDM-OR0042” because this was one of FDMs added in the output code.
* The **PRF** column will report “N/A” because there are no PRF issues present in the output code.
* The **Conversion Status** will be “Partial” because only some fragments of this Code Unit were able to be migrated without EWIs.
* The **LoC Conversion Percentage** is 50% because out of 4 lines, only 2 were converted successfully.

## Deployment Order

The deployment order column represents the correct order to deploy each code unit into Snowflake.

The following code exemplifies in depth how the deployment order is calculated.

```sql
CREATE TABLE TABLE1 ( -- level 0, no dependencies
   COL1 INT
);

CREATE TABLE TABLE2 ( -- level 0, no dependencies
   COL1 INT
);

CREATE VIEW VIEW1 -- level 4, depends on level-3 objects
AS SELECT * FROM VIEW2, VIEW3;

CREATE VIEW VIEW2 -- level 3, depends on level-2 objects
AS SELECT * FROM VIEW4, VIEW5, VIEW3;

CREATE VIEW VIEW4 -- level 1, depends on level-0 objects
AS SELECT * FROM TABLE1, TABLE2;

CREATE VIEW VIEW5 -- level 1, depends on level-0 objects
AS SELECT * FROM TABLE1;

CREATE VIEW VIEW3 -- level 2, depends on level-1 objects
AS SELECT * FROM VIEW6;

CREATE VIEW VIEW6 -- level 1, depends on level-0 objects
AS SELECT * FROM TABLE2;
```

The deployment order starts with `0`, so code units without any dependencies will start at this level. In the example above, `TABLE1` and `TABLE2` will have a level `0` .

For the next level, we will focus on code units that depend on code units of level `0`. `VIEW4`, `VIEW5`, and `VIEW6` depend directly on `TABLE1` and `TABLE2`, so their level will be `1`.

After identifying all the code units of level `1` , we will focus on code units of level `2`. In that particular scenario, just `VIEW3` depends on `VIEW6` , so `VIEW3` will be level `2`.

Once we identify all code units of level `2`, we will focus on level 3. In the example above, `VIEW2` depends on `VIEW4`, `VIEW5` and `VIEW3`, however, the highest dependency level is `2`, so, `VIEW2` will be of level `3`.

Finally, we got `VIEW1`, which depends on `VIEW2` and `VIEW3`. Since `VIEW2` is the dependency with higher level, `VIEW1` will get level `4`.

After making all the calculations, the top-level code units report will look something like the following table.

| Code Unit Id | Deployment Order |
| --- | --- |
| VIEW1 | 4 |
| VIEW2 | 3 |
| VIEW3 | 2 |
| VIEW4 | 1 |
| VIEW5 | 1 |
| VIEW6 | 1 |
| TABLE1 | 0 |
| TABLE2 | 0 |

### Limitations

There are some scenarios where the deployment order may not calculate the right level for a specific code unit.

#### Code Units with Missing Dependencies

Deployment of code units that depend (directly or indirectly) on missing objects is not possible. Although SnowConvert AI calculates the deployment order as best it can, a missing dependency will cause deployment errors. For code units with missing dependencies, SnowConvert AI adds an asterisk (\*) alongside the deployment order. E.g.

```sql
CREATE TABLE TABLE1 ( -- level 0, no dependencies
  COL1 INT
);

CREATE VIEW VIEW1 -- level 1*, depends on level-0 objects and has a missing dependency
AS SELECT * FROM TABLE1, TABLE2;

CREATE VIEW VIEW2 -- level 2*, depends on level-1* objects
AS SELECT * FROM VIEW1;
```

The example above shows `VIEW1` referencing a missing `TABLE2` and `VIEW2` referencing ,`VIEW1` which indirectly refers `TABLE2` . `VIEW1` has a direct missing reference and `VIEW2` an indirect missing reference. The top-level code units report will look something like the following table.

| Code Unit Id | Deployment Order |
| --- | --- |
| TABLE1 | 0 |
| VIEW1 | 1\* |
| VIEW2 | 2\* |

#### Code Units referencing Database Links (Oracle)

While SnowConvert AI can identify references to Database Links, it cannot get more information about the objects being referenced through the database link. This kind of reference may cause trouble during deployment as well, so it will be handled the same way as missing object references. E.g.

```sql
CREATE DATABASE LINK DBLINK1
CONNECT TO PUBLIC IDENTIFIED BY VALUES ':1'
USING 'TEST';

CREATE MATERIALIZED VIEW VIEW1 REFRESH WITH ROWID
AS SELECT * FROM TABLE1@DBLINK1;
```

`VIEW1` is referencing `TABLE1` through the database link `DBLINK1`. Since we don’t know where `TABLE1` is located, the deployment order of `VIEW1` will be handled like a deployment order with missing dependencies (\*).

| Code Unit Id | Deployment Order |
| --- | --- |
| DBLINK1 | 0 |
| VIEW1 | 1\* |

#### Code Units referencing DDLs defined inside Stored Procedures, Anonymous Blocks, etc

In some scenarios, the deployment order may not be correct because the referenced element was defined inside another code unit. E.g.

```sql
CREATE TABLE TABLE1 (
  COL1 INT
);

CREATE OR REPLACE PROCEDURE PROC1 (param1 NUMBER)
IS
BEGIN
    CREATE VIEW VIEW1
    AS
    SELECT * FROM TABLE1;
END;

CREATE VIEW VIEW2
AS SELECT * FROM VIEW1;
```

In the code above, `VIEW2` references `VIEW1`, which will be created after executing the stored procedure. `VIEW1` references `TABLE1`, so the procedure should be executed after creating the table. In that particular scenario, `VIEW1` will not be included in the top-level code units report since it is contained by the stored procedure. In that case, for `VIEW2` is not possible to know that `VIEW1` depends on PROC1 to be created, and the deployment order may not be correct because of that. The following table shows the deployment order for the code above.

| Code Unit Id | Deployment Order |
| --- | --- |
| TABLE1 | 0 |
| PROC1 | 1 |
| VIEW2 | 1 |

Despite `VIEW1` and `PROC1` having the same deployment order, `VIEW1` will fail if the procedure was not executed first.

> **Warning:**
>
> Deployment Order support for Sequences is going to be delivered in a future version. By default, Code Units referencing sequences are not considering them to calculate the deployment order.

---
title: SnowConvert AI - Training and Support
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/training-and-support.md
section: Migrations
---

# SnowConvert AI - Training and Support

## SnowConvert AI for Developers

We **highly** recommend that you go through the [SnowConvert AI](https://learn.snowflake.com/en/courses/OD-SC-AI/) training in order to get the most out of SnowConvert AI. This training provides participants with the core knowledge to recognize how SnowConvert AI fits into the migration process and the skills to prepare, assess, and execute a code conversion with SnowConvert AI to accelerate their journey to Snowflake.

###

---
title: SnowConvert AI - Transact - Power BI Repointing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/etl-bi-repointing/power-bi-transact-repointing.md
section: Migrations
---

# SnowConvert AI - Transact - Power BI Repointing

Applies to

* SQL Server
* Azure Synapse Analytics

## Description

The Power BI repointing is a feature that provides an easy way to redefine the connections from the M language in the Power Query Editor. This means that the connection parameters will be redefined to point to the Snowflake migration database context. For SQL Server and Azure Synapse, the method in M Language that defined the connection is `Sql.Database(...)`. In Snowflake, there is a connector that depends on some other parameters and the main connection is defined by `Snowflake.Database(...)` method.

## Source Pattern Samples

This section will explain the cases currently addressed by SnowConvert AI.

### Simple Entity Repointing Case

Even a simple connection to a table from Power BI requires many transformations to be used with the implicit Power BI connector from Snowflake. In this case, SnowConvert AI adds new information variables such as database and schema, and calls the source table with the implicit type (it can also be a view).

Also, SnowConvert AI generates a mapping between the columns to match the text case with the database migration context or, if possible, with the Power BI report internal information.

**Transact-SQL | Azure Synapse Connection in the Power Query Editor**

```sql
let
    Source = Sql.Database("your_connection", "LibraryDatabase"),
    dbo_Authors = Source{[Schema="dbo",Item="Authors"]}[Data]
in
    dbo_Authors
```

**Snowflake SQL Connection in the Power Query Editor**

```sql
let
    Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
    SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
    SourceSfSchema = SourceSfDb{[Name="DBO", Kind="Schema"]}[Data],
    SourceSfTbl = SourceSfSchema{[Name="BOOKS", Kind="Table"]}[Data],
    dbo_Books = Table.RenameColumns(SourceSfTbl, {{ "BOOKID", "BookID"}, { "TITLE", "Title"}, { "AUTHORID", "AuthorID"}, { "PUBLICATIONYEAR", "PublicationYear"}})
in
    dbo_Books
```

### Simple Entity With Multiple Lines Repointing Case

In this case “Filtered Rows” is an additional step into the logic of the query. In the repointing version, the additional logic is preserved as it is.

**Transact-SQL | Azure Synapse Connection in the Power Query Editor**

```sql
let
  Source = Sql.Database("your_connection", "mytestdb"),
  dbo_Employee = Source{[Schema="dbo",
  Item="Employee"]}[Data],
  #"Filtered Rows" = Table.SelectRows(dbo_Employee, each Text.StartsWith([name], "John"))
in
  #"Filtered Rows"
```

**Snowflake SQL Connection in the Power Query Editor**

```sql
let
  Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
  SourceSfDb = Source{[Name=SF_DB_NAME, Kind="Database"]}[Data],
  SourceSfSchema = SourceSfDb{[Name="DBO", Kind="Schema"]}[Data],
  SourceSfTbl = SourceSfSchema{[Name="EMPLOYEE", Kind="Table"]}[Data],
  dbo_Employee = SourceSfTbl,
  #"Filtered Rows" = Table.SelectRows(dbo_Employee, each Text.StartsWith([name], "John"))
in
  #"Filtered Rows"
```

### Embedded SQL Query Repointing Case

For the SQL queries embedded inside the connections, SnowConvert AI will extract, migrate, and re-insert these queries. Warning messages in the migrated queries may require extra attention. In this case, the warning message does not stop the query from being run in the Snowflake database.

**Transact-SQL | Azure Synapse Connection in the Power Query Editor**

```sql
let
    Source = Sql.Database("your_connection", "LibraryDatabase", [Query="SELECT DISTINCT#(lf)    B.Title#(lf)FROM#(lf)    DBO.Books AS B#(lf)JOIN#(lf)    DBO.Authors AS A ON B.AuthorID = A.AuthorID#(lf)JOIN#(lf)    DBO.BookGenres AS BG ON B.BookID = BG.BookID#(lf)JOIN#(lf)    DBO.Genres AS G ON BG.GenreID = G.GenreID#(lf)WHERE#(lf)    A.Nationality = 'American' AND G.Origin = 'USA'#(lf)ORDER BY#(lf)    B.Title;", CreateNavigationProperties=false])
in
    Source
```

**Snowflake SQL Connection in the Power Query Editor**

```sql
let
    SfSource = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "--** SSC-FDM-0007 - MISSING DEPENDENT OBJECTS ""DBO.Books"", ""DBO.Authors"", ""DBO.BookGenres"", ""DBO.Genres"" **
SELECT DISTINCT
    B.Title
FROM
    DBO.Books AS B
    JOIN
        DBO.Authors AS A
        ON B.AuthorID = A.AuthorID
    JOIN
        DBO.BookGenres AS BG
        ON B.BookID = BG.BookID
    JOIN
        DBO.Genres AS G
        ON BG.GenreID = G.GenreID
WHERE
    A.Nationality = 'American' AND G.Origin = 'USA'
ORDER BY B.Title", null, [EnableFolding=true]),
    Source = Table.RenameColumns(SfSource, {{ "TITLE", "Title"}})
in
    Source
```

### Embedded SQL Query With Multiple Lines Repointing Case

This case showcases the connection with SQL queries and multiple lines of logic after the connection logic.

**Transact-SQL | Azure Synapse Connection in the Power Query Editor**

```sql
let
  Source = Sql.Database("your_connection", "mytestdb", [Query="SELECT DISTINCT#(lf)    P.ProductName,#(lf)    P.Category,#(lf)    P.StockQuantity#(lf)FROM#(lf)    Products AS P#(lf)WHERE#(lf)    P.StockQuantity > 0#(lf)ORDER BY#(lf)    P.Category ASC;"]),
  #"Filtered Rows" = Table.SelectRows(Source, each Text.StartsWith([Name], "Cards"))
in
 #"Filtered Rows"
```

**Snowflake SQL Connection in the Power Query Editor**

```sql
let
  Source = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT ""Products"" **
SELECT DISTINCT
    P.ProductName,
    P.Category,
    P.StockQuantity
FROM
    Products AS P
WHERE
    P.StockQuantity > 0
ORDER BY P.Category ASC", null, [EnableFolding=true]),
  #"Filtered Rows" = Table.SelectRows(Source, each Text.StartsWith([Name], "Cards"))
in
  #"Filtered Rows"
```

### Embedded SQL Query With Column Renaming Repointing Case

At the moment, column renaming for SQL queries cases are only applied if the internal information of the provided Power BI report contains this information.

**Transact-SQL | Azure Synapse Connection in the Power Query Editor**

```sql
let
    Source = Sql.Database("your_connection", "SalesSampleDB", [Query="SELECT DISTINCT#(lf)    P.ProductName,#(lf)    P.Category,#(lf)    P.StockQuantity#(lf)FROM#(lf)    Products AS P#(lf)WHERE#(lf)    P.StockQuantity > 0#(lf)ORDER BY#(lf)    P.Category ASC;"])
in
    Source
```

**Snowflake SQL Connection in the Power Query Editor**

```sql
let
    SfSource = Value.NativeQuery(Snowflake.Databases(SF_SERVER_LINK,SF_WAREHOUSE_NAME,[Implementation="2.0"]){[Name=SF_DB_NAME]}[Data], "--** SSC-FDM-0007 - MISSING DEPENDENT OBJECT ""Products"" **
SELECT DISTINCT
    P.ProductName,
    P.Category,
    P.StockQuantity
FROM
    Products AS P
WHERE
    P.StockQuantity > 0
ORDER BY P.Category ASC", null, [EnableFolding=true]),
    Source = Table.RenameColumns(SfSource, {{ "PRODUCTNAME", "ProductName"}, { "CATEGORY", "Category"}, { "STOCKQUANTITY", "StockQuantity"}})
in
    Source
```

### Function For Entity Case Repointing Case

Currently, the functions are only supported for entities import case, and Transact only.

**Transact-SQL Connection in the Power Query Editor**

```sql
let
  Source = Sql.Database("your_connection", "mytestdb"),
  dbo_MultiParam = Source{[Schema="dbo",Item="MultiParam"]}[Data],
  #"Invoked Functiondbo_MultiParam1" = dbo_MultiParam(1,"HELLO")
in
  #"Invoked Functiondbo_MultiParam1"
```

**Snowflake SQL Connection in the Power Query Editor**

```sql
let
  Source = Snowflake.Databases(SF_SERVER_LINK, SF_WAREHOUSE_NAME),
  SourceSfDb = Source{[Name="mytestdb, Kind="Database"]}[Data],
  SourceSfFunc = (x, y) => Value.NativeQuery(SourceSfDb, "SELECT DBO.MultiParam(" & Text.From(x) & "," &  (if y = null then null else ("'" & y & "'"))  & ")"),
  dbo_MultiParam = SourceSfFunc,
  #"Invoked Functiondbo_MultiParam1" = dbo_MultiParam(1,"HELLO")
in
  #"Invoked Functiondbo_MultiParam1"
```

---
title: SnowConvert AI - TypeMappings Report
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/review-results/reports/type-mappings-report.md
section: Migrations
---

# SnowConvert AI - TypeMappings Report

## What is the TypeMappings Report?

The TypeMappings report shows the data type transformations that were applied based on your [Data Type Customization](../../../../../translation-references/oracle/basic-elements-of-oracle-sql/data-types/README.md) file. This report only includes transformations specified in the customization file. Use this report to verify that your custom rules were applied correctly to the expected columns and objects.

## Where can I find it?

The TypeMappings report can be found in a folder named *“reports”*, in the output folder of your conversion. The file is named **TypeMappings.csv**.

> **Note:**
>
> This report is generated when data type customization is enabled (using the `--dataTypeCustomizationFile` argument). If no customization file is provided, this report may not be generated or may be empty.

## What information does it contain?

The TypeMappings report is presented in a CSV table format and contains the following columns:

| Column | Description |
| --- | --- |
| ObjectType | The type of object where the data type was found (e.g., `TABLE_COLUMN`, `PROCEDURE_PARAMETER`, `FUNCTION_PARAMETER`, `VARIABLE`). |
| ObjectId | The fully qualified identifier of the object (e.g., `Schema.Table.Column`). |
| FileName | The name of the source file where the data type was found. |
| LineNumber | The line number in the source file where the data type is defined. |
| OriginalType | The original data type in the source code (e.g., `NUMBER(10, 2)`). |
| TargetType | The resulting data type after transformation (e.g., `DECFLOAT`, `NUMBER(18, 2)`). |

## Example Output

Here is an example of what the TypeMappings report might contain:

| ObjectType | ObjectId | FileName | LineNumber | OriginalType | TargetType |
| --- | --- | --- | --- | --- | --- |
| TABLE_COLUMN | SALES.ORDERS.TOTAL_AMOUNT | orders.sql | 15 | NUMBER(15, 2) | DECFLOAT |
| TABLE_COLUMN | SALES.ORDERS.ORDER_ID | orders.sql | 12 | NUMBER(10, 0) | NUMBER(18, 0) |
| TABLE_COLUMN | HR.EMPLOYEES.SALARY | employees.sql | 8 | NUMBER | NUMBER(18, 2) |

## Using the Report

### Verifying Customization Rules

Use this report to verify that your data type customization rules in the JSON configuration file were applied as expected. Compare the `OriginalType` and `TargetType` columns to ensure the transformations match your requirements.

### Identifying Affected Objects

The report helps you identify all database objects affected by data type customizations, making it easier to:

* Review the scope of changes before deployment
* Plan testing strategies for affected tables and procedures
* Document the migration changes for compliance purposes

### UI Integration

In SnowConvert AI’s graphical interface, the TypeMappings report is integrated into the **Code Units Summary** tab of the conversion report. For Oracle conversions using data type customization, you will see a “Data type mappings” section that shows:

* The total count of affected data types
* A link to open the full TypeMappings.csv report

## Related Documentation

* [Data Type Customization](../../../../../translation-references/oracle/basic-elements-of-oracle-sql/data-types/README.md): Learn how to configure data type transformation rules.
* [Oracle CLI Arguments](../../../../user-guide/snowconvert/command-line-interface/oracle.md): Details on the `--dataTypeCustomizationFile` argument.
* [DB2 DECFLOAT](../../../../../translation-references/db2/db2-data-types.md): Information about DECFLOAT transformation in DB2.

---
title: SnowConvert AI - Understanding Converted Code
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/README.md
section: Migrations
---

# SnowConvert AI - Understanding Converted Code

## Definitions

SnowConvert AI produces messages in the converted code that highlight areas requiring additional work to ensure the code functions correctly in Snowflake.

### EWI (Errors, Warnings and Issues)

When SnowConvert AI cannot completely convert a code segment, it generates an Error, Warning, and Issue (EWI) message. Each EWI negatively affects the conversion rate of a code unit. Here are the common reasons why SnowConvert AI might not fully convert code:

* The required conversion rule has not been created yet
* Required dependent code is missing for the conversion rule to work
* An equivalent statement is not available in Snowflake, or a User-Defined Function (UDF) has not been developed to provide the needed functionality

SnowConvert AI adds a `!!!RESOLVE EWI!!!` marker before each EWI (Error, Warning, and Issue) message. This marker causes compilation to fail, ensuring that developers address these issues before deploying the converted code.

The SnowConvert AI team categorizes EWIs (Errors, Warnings, and Issues) into four severity levels based on the average effort required to fix the code:

* Low
* Medium
* High
* Critical

### FDM (Functional Difference Messages)

When converting code from legacy platforms (such as Teradata, Oracle, or SQL Server) to Snowflake, it’s important to understand that these systems have different features and capabilities. Due to these differences, automatic conversion may not provide complete functional equivalence, and manual intervention is often necessary to address these gaps.

A Functional Difference Message (FDM) appears when SnowConvert AI successfully converts code that compiles correctly but may not function exactly like the original source code. This can happen when Snowflake doesn’t support certain features from the source platform. Resolving these differences often requires business decisions or architectural changes beyond simple code conversion. Since FDMs are added as comments to working code, they don’t affect the code’s ability to compile.

### PRF (Performance Review)

These warning messages are added to the converted code to inform users that although the code has been correctly translated, it may not perform optimally in certain situations. To achieve the best performance in Snowflake, some code modifications may be necessary.

### OOS (Out Of Scope)

As explained in the [conversion scope page](../../getting-started/running-snowconvert/review-results/snowconvert-scopes.md) and the out of scope messages section, certain code units cannot be automatically converted. SnowConvert AI will generate messages to inform users when a code unit has not been converted.

---

## New Codes Format

We have updated the format of Error, Warning, and Issue (EWI) messages to be more user-friendly. The new format makes it easier to identify both the message type and the programming language it relates to. Below, we’ll explain these changes with examples.

### General

General Errors, Warnings, and Issues (EWIs) that do not correspond to any specific programming language supported by SnowConvert AI will no longer use the code 1000. Instead, these EWIs will be identified by the absence of a language-specific abbreviation.

#### Example

| New Code | Old Code |
| --- | --- |
| SSC-EWI-0001 | MSCEWI1001 |

### Teradata

Teradata Errors, Warnings, and Issues (EWIs) previously used codes starting with 2000. Now, these codes will begin with ‘TD’ followed by the numeric portion.

#### Example

| New Code | Old Code |
| --- | --- |
| SSC-EWI-TD0001 | MSCEWI2001 |

### Oracle

Oracle Errors, Warnings, and Issues (EWIs) previously used codes starting with 3000. Now, these codes will begin with ‘OR’ followed by the numeric portion of the code.

#### Example

| New Code | Old Code |
| --- | --- |
| SSC-EWI-OR0012 | MSCEWI3012 |

### SQL Server

SQL Server Errors, Warnings, and Issues (EWIs) codes have been updated. Previously, these codes started with 2000. Now, they begin with ‘TS’ followed by a numeric value.

#### Example

| New Code | Old Code |
| --- | --- |
| SSC-EWI-TS0003 | MSCEWI4003 |

## Changed Category To FDM

| Dialect | New code | Old code |
| --- | --- | --- |
| General | SSC-FDM-0001 | MSCINF0001 |
| General | SSC-FDM-0002 | MSCINF0002 |
| General | SSC-FDM-0003 | MSCINF0003 |
| General | SSC-FDM-0004 | MSC-GP0001 |
| General | SSC-FDM-0005 | MSCEWI1096 |
| General | SSC-FDM-0006 | MSCEWI1066 |
| General | SSC-FDM-0007 | MSCEWI1050 |
| General | SSC-FDM-0008 | MSCEWI1093 |
| General | SSC-FDM-0009 | MSCCP0002 |
| General | SSC-FDM-0010 | MSCEWI1044 |
| General | SSC-FDM-0011 | MSCEWI1045 |
| General | SSC-FDM-0012 | MSCEWI1097 |
| General | SSC-FDM-0013 | MSCEWI1008 |
| General | SSC-FDM-0014 | MSCEWI1035 |
| General | SSC-FDM-0015 | MSCEWI1064 |
| General | SSC-FDM-0016 | MSCEWI1076 |
| General | SSC-FDM-0022 | MSCEWI1072 |
| General | SSC-FDM-0023 | SSC-EWI-0049 |
| General | SSC-FDM-0024 | MSCEWI1058 |
| General | SSC-FDM-0026 | MSCEWI1028 |
| General | SSC-FDM-0029 | SSC-EWI-0068 |
| Teradata | SSC-FDM-TD0001 | MSCEWI2013 |
| Teradata | SSC-FDM-TD0002 | MSCEWI2014 |
| Teradata | SSC-FDM-TD0003 | MSCEWI2073 |
| Teradata | SSC-FDM-TD0004 | MSCEWI2074 |
| Teradata | SSC-FDM-TD0005 | MSCEWI2058 |
| Teradata | SSC-FDM-TD0006 | MSCEWI2045 |
| Teradata | SSC-FDM-TD0007 | MSCEWI2018 |
| Teradata | SSC-FDM-TD0008 | MSCEWI2080 |
| Teradata | SSC-FDM-TD0009 | MSCEWI2019 |
| Teradata | SSC-FDM-TD0010 | MSCEWI2042 |
| Teradata | SSC-FDM-TD0011 | MSCEWI2064 |
| Teradata | SSC-FDM-TD0012 | MSCEWI2004 |
| Teradata | SSC-FDM-TD0013 | MSCEWI2075 |
| Teradata | SSC-FDM-TD0014 | MSCEWI2023 |
| Teradata | SSC-FDM-TD0015 | MSCEWI2020 |
| Teradata | SSC-FDM-TD0016 | MSCEWI2021 |
| Teradata | SSC-FDM-TD0018 | MSCEWI2063 |
| Teradata | SSC-FDM-TD0019 | MSCEWI2084 |
| Teradata | SSC-FDM-TD0020 | MSCEWI2062 |
| Teradata | SSC-FDM-TD0021 | MSCEWI2030 |
| Teradata | SSC-FDM-TD0022 | MSCEWI2086 |
| Teradata | SSC-FDM-TD0025 | MSCEWI2054 |
| Teradata | SSC-FDM-TD0026 | SSC-EWI-TD0087 |
| Teradata | SSC-FDM-TD0027 | MSCEWI2061 |
| Teradata | SSC-FDM-TD0028 | MSCEWI2060 |
| Teradata | SSC-FDM-TD0029 | SSC-EWI-TD0055 |
| Teradata | SSC-FDM-TD0030 | SSC-EWI-TD0070 |
| Oracle | SSC-FDM-OR0001 | MSCINF0004 |
| Oracle | SSC-FDM-OR0002 | MSCEWI3068 |
| Oracle | SSC-FDM-OR0003 | MSCEWI3038 |
| Oracle | SSC-FDM-OR0004 | MSCEWI3022 |
| Oracle | SSC-FDM-OR0005 | MSCEWI3025 |
| Oracle | SSC-FDM-OR0006 | MSCEWI3041 |
| Oracle | SSC-FDM-OR0007 | MSCEWI3056 |
| Oracle | SSC-FDM-OR0008 | MSCEWI3071 |
| Oracle | SSC-FDM-OR0009 | MSCEWI3086 |
| Oracle | SSC-FDM-OR0010 | MSCEWI3093 |
| Oracle | SSC-FDM-OR0011 | MSCEWI3066 |
| Oracle | SSC-FDM-OR0012 | MSCEWI3131 |
| Oracle | SSC-FDM-OR0013 | MSCEWI3039 |
| Oracle | SSC-FDM-OR0014 | MSCEWI3002 |
| Oracle | SSC-FDM-OR0015 | MSCEWI3091 |
| Oracle | SSC-FDM-OR0016 | MSCEWI3132 |
| Oracle | SSC-FDM-OR0017 | MSCEWI3017 |
| Oracle | SSC-FDM-OR0018 | MSCEWI3134 |
| Oracle | SSC-FDM-OR0019 | MSCEWI3086 |
| Oracle | SSC-FDM-OR0020 | MSCEWI3051 |
| Oracle | SSC-FDM-OR0021 | MSCEWI3102 |
| Oracle | SSC-FDM-OR0022 | MSCEWI3100 |
| Oracle | SSC-FDM-OR0023 | MSCEWI3099 |
| Oracle | SSC-FDM-OR0024 | MSCEWI3114 |
| Oracle | SSC-FDM-OR0025 | MSCEWI3021 |
| Oracle | SSC-FDM-OR0026 | MSCEWI1065 |
| Oracle | SSC-FDM-OR0025 | MSCEWI3098 |
| Oracle | SSC-FDM-OR0028 | MSCEWI3031 |
| Oracle | SSC-FDM-OR0029 | MSCEWI3059 |
| Oracle | SSC-FDM-OR0030 | MSCEWI3094 |
| Oracle | SSC-FDM-OR0031 | MSCEWI3113 |
| Oracle | SSC-FDM-OR0037 | MSCEWI3004 |
| Oracle | SSC-FDM-OR0038 | SSC-EWI-OR0128 |
| Oracle | SSC-FDM-OR0040 | SSC-EWI-OR0062 |
| Oracle | SSC-FDM-OR0043 | SSC-EWI-OR0005 |
| Oracle | SSC-FDM-OR0044 | SSC-EWI-OR0089 |
| Oracle | SSC-EWI-OR0039 | SSC-FDM-OR0013 |
| Oracle | SSC-FDM-OR0045 | SSC-EWI-OR0006 |
| SQL Server | SSC-FDM-TS0001 | MSCEWI4005 |
| SQL Server | SSC-FDM-TS0002 | MSCEWI4004 |
| SQL Server | SSC-FDM-TS0003 | MSCEWI4064 |
| SQL Server | SSC-FDM-TS0004 | MSCEWI4022 |
| SQL Server | SSC-FDM-TS0005 | MSCEWI4074 |
| SQL Server | SSC-FDM-TS0006 | MSCEWI4066 |
| SQL Server | SSC-FDM-TS0007 | MSCEWI4066 |
| SQL Server | SSC-FDM-TS0008 | MSCEWI4065 |
| SQL Server | SSC-FDM-TS0009 | MSCEWI4003 |
| SQL Server | SSC-FDM-TS0010 | MSCEWI4069 |
| SQL Server | SSC-FDM-TS0011 | MSCEWI1088 |
| SQL Server | SSC-FDM-TS0017 | SSC-EWI-TS0071 |
| SQL Server | SSC-FDM-TS0020 | SSC-EWI-TS0055 |

| Dialect | New code | Old code |
| --- | --- | --- |
| General | SSC-FDM-0001 | MSCINF0001 |
| General | SSC-FDM-0002 | MSCINF0002 |
| General | SSC-FDM-0003 | MSCINF0003 |
| General | SSC-FDM-0004 | MSC-GP0001 |
| General | SSC-FDM-0005 | MSCEWI1096 |
| General | SSC-FDM-0006 | MSCEWI1066 |
| General | SSC-FDM-0007 | MSCEWI1050 |
| General | SSC-FDM-OR0039 | MSCEWI1057 |
| General | SSC-FDM-0008 | MSCEWI1093 |
| General | SSC-FDM-0009 | MSCCP0002 |
| General | SSC-FDM-0010 | MSCEWI1044 |
| General | SSC-FDM-0011 | MSCEWI1045 |
| General | SSC-FDM-0012 | MSCEWI1097 |
| General | SSC-FDM-0013 | MSCEWI1008 |
| General | SSC-FDM-0014 | MSCEWI1035 |
| General | SSC-FDM-0015 | MSCEWI1064 |
| General | SSC-FDM-0016 | MSCEWI1076 |
| General | SSC-FDM-0022 | MSCEWI1072 |
| General | SSC-FDM-0024 | MSCEWI1058 |
| General | SSC-FDM-0026 | MSCEWI1028 |
| Teradata | SSC-FDM-TD0001 | MSCEWI2013 |
| Teradata | SSC-FDM-TD0002 | MSCEWI2014 |
| Teradata | SSC-FDM-TD0003 | MSCEWI2073 |
| Teradata | SSC-FDM-TD0004 | MSCEWI2074 |
| Teradata | SSC-FDM-TD0005 | MSCEWI2058 |
| Teradata | SSC-FDM-TD0006 | MSCEWI2045 |
| Teradata | SSC-FDM-TD0007 | MSCEWI2018 |
| Teradata | SSC-FDM-TD0008 | MSCEWI2080 |
| Teradata | SSC-FDM-TD0009 | MSCEWI2019 |
| Teradata | SSC-FDM-TD0010 | MSCEWI2042 |
| Teradata | SSC-FDM-TD0011 | MSCEWI2064 |
| Teradata | SSC-FDM-TD0012 | MSCEWI2004 |
| Teradata | SSC-FDM-TD0013 | MSCEWI2075 |
| Teradata | SSC-FDM-TD0014 | MSCEWI2023 |
| Teradata | SSC-FDM-TD0015 | MSCEWI2020 |
| Teradata | SSC-FDM-TD0016 | MSCEWI2021 |
| Teradata | SSC-FDM-TD0018 | MSCEWI2063 |
| Teradata | SSC-FDM-TD0019 | MSCEWI2084 |
| Teradata | SSC-FDM-TD0020 | MSCEWI2062 |
| Teradata | SSC-FDM-TD0021 | MSCEWI2030 |
| Teradata | SSC-FDM-TD0022 | MSCEWI2086 |
| Teradata | SSC-FDM-TD0025 | MSCEWI2054 |
| Teradata | SSC-FDM-TD0027 | MSCEWI2061 |
| Teradata | SSC-FDM-TD0028 | MSCEWI2060 |
| Oracle | SSC-FDM-OR0001 | MSCINF0004 |
| Oracle | SSC-FDM-OR0002 | MSCEWI3068 |
| Oracle | SSC-FDM-OR0003 | MSCEWI3038 |
| Oracle | SSC-FDM-OR0004 | MSCEWI3022 |
| Oracle | SSC-FDM-OR0005 | MSCEWI3025 |
| Oracle | SSC-FDM-OR0006 | MSCEWI3041 |
| Oracle | SSC-FDM-OR0007 | MSCEWI3056 |
| Oracle | SSC-FDM-OR0008 | MSCEWI3071 |
| Oracle | SSC-FDM-OR0009 | MSCEWI3086 |
| Oracle | SSC-FDM-OR0010 | MSCEWI3093 |
| Oracle | SSC-FDM-OR0011 | MSCEWI3066 |
| Oracle | SSC-FDM-OR0012 | MSCEWI3131 |
| Oracle | SSC-FDM-OR0013 | MSCEWI3039 |
| Oracle | SSC-FDM-OR0014 | MSCEWI3002 |
| Oracle | SSC-FDM-OR0015 | MSCEWI3091 |
| Oracle | SSC-FDM-OR0016 | MSCEWI3132 |
| Oracle | SSC-FDM-OR0017 | MSCEWI3017 |
| Oracle | SSC-FDM-OR0018 | MSCEWI3134 |
| Oracle | SSC-FDM-OR0019 | MSCEWI3086 |
| Oracle | SSC-FDM-OR0020 | MSCEWI3051 |
| Oracle | SSC-FDM-OR0021 | MSCEWI3102 |
| Oracle | SSC-FDM-OR0022 | MSCEWI3100 |
| Oracle | SSC-FDM-OR0023 | MSCEWI3099 |
| Oracle | SSC-FDM-OR0024 | MSCEWI3114 |
| Oracle | SSC-FDM-OR0025 | MSCEWI3021 |
| Oracle | SSC-FDM-OR0026 | MSCEWI1065 |
| Oracle | SSC-FDM-OR0025 | MSCEWI3098 |
| Oracle | SSC-FDM-OR0027 | MSCEWI3029 |
| Oracle | SSC-FDM-OR0028 | MSCEWI3031 |
| Oracle | SSC-FDM-OR0029 | MSCEWI3059 |
| Oracle | SSC-FDM-OR0030 | MSCEWI3094 |
| Oracle | SSC-FDM-OR0031 | MSCEWI3113 |
| Oracle | SSC-FDM-OR0038 | SSC-EWI-OR0128 |
| Oracle | SSC-FDM-OR0041 | MSCEWI307 |
| Oracle | SSC-FDM-OR0040 | SSC-EWI-OR0062 |
| SQL Server | SSC-FDM-TS0001 | MSCEWI4005 |
| SQL Server | SSC-FDM-TS0002 | MSCEWI4004 |
| SQL Server | SSC-FDM-TS0003 | MSCEWI4064 |
| SQL Server | SSC-FDM-TS0004 | MSCEWI4022 |
| SQL Server | SSC-FDM-TS0005 | MSCEWI4074 |
| SQL Server | SSC-FDM-TS0006 | MSCEWI4066 |
| SQL Server | SSC-FDM-TS0007 | MSCEWI4066 |
| SQL Server | SSC-FDM-TS0008 | MSCEWI4065 |
| SQL Server | SSC-FDM-TS0009 | MSCEWI4003 |
| SQL Server | SSC-FDM-TS0010 | MSCEWI4069 |
| SQL Server | SSC-FDM-TS0011 | MSCEWI1088 |
| SQL Server | SSC-FDM-TS0017 | SSC-EWI-TS0071 |
| SQL Server | SSC-FDM-TS0020 | SSC-EWI-TS0055 |

## Changed Category To PRF

| Dialect | New code | Old code |
| --- | --- | --- |
| General | SSC-PRF-0001 | MSCCP0005 |
| General | SSC-PRF-0002 | MSCCP0007 |
| General | SSC-PRF-0003 | MSCCP0006 |
| General | SSC-PRF-0004 | MSCCP0003 |
| General | SSC-PRF-0005 | MSCCP0010 |
| General | SSC-PRF-0006 | MSCCP0011 |
| Teradata | SSC-PRF-TD0001 | MSCEWI2008 |
| SQL Server | SSC-PRF-TS0001 | MSCEWI4007 |

## Deprecated EWIs

| Dialect | Code |
| --- | --- |
| General | MSCEWI1008 |
| General | MSCEWI1016 |
| General | MSCEWI1017 |
| General | MSCEWI1019 |
| General | MSCEWI1029 |
| General | MSCEWI1055 |
| General | MSCEWI1037 |
| General | MSCEWI1042 |
| General | MSCEWI1043 |
| General | MSCEWI1048 |
| General | MSCEWI1059 |
| General | MSCEWI1069 |
| General | MSCEWI1074 |
| General | MSCEWI1079 |
| General | MSCEWI1081 |
| General | MSCEWI1082 |
| General | MSCEWI1083 |
| General | MSCEWI1085 |
| General | MSCEWI1087 |
| General | MSCEWI1089 |
| General | MSCEWI1090 |
| General | MSCEWI1091 |
| General | MSCEWI1097 |
| General | MSCEWI1098 |
| General | MSCEWI1099 |
| General | MSCEWI1108 |
| General | MSCINF0001 |
| General | MSCINF0002 |
| General | MSCINF0003 |
| Teradata | MSCEWI2002 |
| Teradata | MSCEWI2006 |
| Teradata | MSCEWI2007 |
| Teradata | MSCEWI2016 |
| Teradata | MSCEWI2018 |
| Teradata | MSCEWI2026 |
| Teradata | MSCEWI2028 |
| Teradata | MSCEWI2032 |
| Teradata | MSCEWI2033 |
| Teradata | MSCEWI2038 |
| Teradata | MSCEWI2044 |
| Teradata | MSCEWI2047 |
| Teradata | MSCEWI2050 |
| Teradata | MSCEWI2056 |
| Teradata | MSCEWI2065 |
| Teradata | MSCEWI2078 |
| Teradata | MSCEWI2081 |
| Teradata | MSCEWI2085 |
| Teradata | MSCEWI2088 |
| Teradata | MSCEWI2089 |
| Teradata | MSCEWI2090 |
| Oracle | MSCEWI3003 |
| Oracle | MSCEWI3007 |
| Oracle | MSCEWI3015 |
| Oracle | MSCEWI3019 |
| Oracle | MSCEWI3024 |
| Oracle | MSCEWI3027 |
| Oracle | MSCEWI3028 |
| Oracle | MSCEWI3037 |
| Oracle | MSCEWI3043 |
| Oracle | MSCEWI3044 |
| Oracle | MSCEWI3054 |
| Oracle | MSCEWI3058 |
| Oracle | MSCEWI3061 |
| Oracle | MSCEWI3063 |
| Oracle | MSCEWI3064 |
| Oracle | MSCEWI3065 |
| Oracle | MSCEWI3077 |
| Oracle | MSCEWI3083 |
| Oracle | MSCEWI3084 |
| Oracle | MSCEWI3085 |
| Oracle | MSCEWI3088 |
| Oracle | MSCEWI3096 |
| Oracle | MSCEWI3117 |
| Oracle | MSCEWI3119 |
| Oracle | MSCEWI3122 |
| Oracle | MSCEWI3125 |
| Oracle | SSC-EWI-OR0130 |
| SQL Server | MSCEWI4002 |
| SQL Server | MSCEWI4006 |
| SQL Server | MSCEWI4008 |
| SQL Server | MSCEWI4011 |
| SQL Server | MSCEWI4012 |
| SQL Server | MSCEWI4014 |
| SQL Server | MSCEWI4018 |
| SQL Server | MSCEWI4019 |
| SQL Server | MSCEWI4020 |
| SQL Server | MSCEWI4026 |
| SQL Server | MSCEWI4028 |
| SQL Server | MSCEWI4030 |
| SQL Server | MSCEWI4040 |
| SQL Server | MSCEWI4042 |
| SQL Server | MSCEWI4050 |
| SQL Server | MSCEWI4052 |
| SQL Server | MSCEWI4054 |
| SQL Server | MSCEWI4056 |
| SQL Server | MSCEWI4068 |
| SQL Server | SSC-EWI-TS0048 |

| Dialect | New code | Old code |
| --- | --- | --- |
| General | SSC-FDM-0001 | MSCINF0001 |
| General | SSC-FDM-0002 | MSCINF0002 |
| General | SSC-FDM-0003 | MSCINF0003 |
| General | SSC-FDM-0004 | MSC-GP0001 |
| General | SSC-FDM-0005 | MSCEWI1096 |
| General | SSC-FDM-0006 | MSCEWI1066 |
| General | SSC-FDM-0007 | MSCEWI1050 |
| General | SSC-FDM-OR0039 | MSCEWI1057 |
| General | SSC-FDM-0008 | MSCEWI1093 |
| General | SSC-FDM-0009 | MSCCP0002 |
| General | SSC-FDM-0010 | MSCEWI1044 |
| General | SSC-FDM-0011 | MSCEWI1045 |
| General | SSC-FDM-0012 | MSCEWI1097 |
| General | SSC-FDM-0013 | MSCEWI1008 |
| General | SSC-FDM-0014 | MSCEWI1035 |
| General | SSC-FDM-0015 | MSCEWI1064 |
| General | SSC-FDM-0016 | MSCEWI1076 |
| General | SSC-FDM-0022 | MSCEWI1072 |
| General | SSC-FDM-0023 | SSC-EWI-0049 |
| General | SSC-FDM-0024 | MSCEWI1058 |
| General | SSC-FDM-0026 | MSCEWI1028 |
| General | SSC-FDM-0029 | SSC-EWI-0068 |
| Teradata | SSC-FDM-TD0001 | MSCEWI2013 |
| Teradata | SSC-FDM-TD0002 | MSCEWI2014 |
| Teradata | SSC-FDM-TD0003 | MSCEWI2073 |
| Teradata | SSC-FDM-TD0004 | MSCEWI2074 |
| Teradata | SSC-FDM-TD0005 | MSCEWI2058 |
| Teradata | SSC-FDM-TD0006 | MSCEWI2045 |
| Teradata | SSC-FDM-TD0007 | MSCEWI2018 |
| Teradata | SSC-FDM-TD0008 | MSCEWI2080 |
| Teradata | SSC-FDM-TD0009 | MSCEWI2019 |
| Teradata | SSC-FDM-TD0010 | MSCEWI2042 |
| Teradata | SSC-FDM-TD0011 | MSCEWI2064 |
| Teradata | SSC-FDM-TD0013 | MSCEWI2075 |
| Teradata | SSC-FDM-TD0014 | MSCEWI2023 |
| Teradata | SSC-FDM-TD0016 | MSCEWI2021 |
| Teradata | SSC-FDM-TD0019 | MSCEWI2084 |
| Teradata | SSC-FDM-TD0020 | MSCEWI2062 |
| Teradata | SSC-FDM-TD0021 | MSCEWI2030 |
| Teradata | SSC-FDM-TD0022 | MSCEWI2086 |
| Teradata | SSC-FDM-TD0025 | MSCEWI2054 |
| Oracle | SSC-FDM-OR0001 | MSCINF0004 |
| Oracle | SSC-FDM-OR0004 | MSCEWI3022 |
| Oracle | SSC-FDM-OR0005 | MSCEWI3025 |
| Oracle | SSC-FDM-OR0006 | MSCEWI3041 |
| Oracle | SSC-FDM-OR0007 | MSCEWI3056 |
| Oracle | SSC-FDM-OR0009 | MSCEWI3086 |
| Oracle | SSC-FDM-OR0010 | MSCEWI3093 |
| Oracle | SSC-FDM-OR0011 | MSCEWI3066 |
| Oracle | SSC-FDM-OR0012 | MSCEWI3131 |
| Oracle | SSC-FDM-OR0013 | MSCEWI3039 |
| Oracle | SSC-FDM-OR0014 | MSCEWI3002 |
| Oracle | SSC-FDM-OR0015 | MSCEWI3091 |
| Oracle | SSC-FDM-OR0016 | MSCEWI3132 |
| Oracle | SSC-FDM-OR0017 | MSCEWI3017 |
| Oracle | SSC-FDM-OR0018 | MSCEWI3134 |
| Oracle | SSC-FDM-OR0019 | MSCEWI3086 |
| Oracle | SSC-FDM-OR0020 | MSCEWI3051 |
| Oracle | SSC-FDM-OR0021 | MSCEWI3102 |
| Oracle | SSC-FDM-OR0022 | MSCEWI3100 |
| Oracle | SSC-FDM-OR0023 | MSCEWI3099 |
| Oracle | SSC-FDM-OR0024 | MSCEWI3114 |
| Oracle | SSC-FDM-OR0025 | MSCEWI3021 |
| Oracle | SSC-FDM-OR0026 | MSCEWI1065 |
| Oracle | SSC-FDM-OR0025 | MSCEWI3098 |
| Oracle | SSC-FDM-OR0027 | MSCEWI3029 |
| Oracle | SSC-FDM-OR0029 | MSCEWI3059 |
| Oracle | SSC-FDM-OR0030 | MSCEWI3094 |
| Oracle | SSC-FDM-OR0031 | MSCEWI3113 |
| Oracle | SSC-FDM-OR0037 | MSCEWI3004 |
| Oracle | SSC-FDM-OR0041 | MSCEWI307 |
| Oracle | SSC-FDM-OR0040 | SSC-EWI-OR0062 |
| SQL Server | SSC-FDM-TS0001 | MSCEWI4005 |
| SQL Server | SSC-FDM-TS0002 | MSCEWI4004 |
| SQL Server | SSC-FDM-TS0003 | MSCEWI4064 |
| SQL Server | SSC-FDM-TS0004 | MSCEWI4022 |
| SQL Server | SSC-FDM-TS0005 | MSCEWI4074 |
| SQL Server | SSC-FDM-TS0006 | MSCEWI4066 |
| SQL Server | SSC-FDM-TS0007 | MSCEWI4066 |
| SQL Server | SSC-FDM-TS0008 | MSCEWI4065 |
| SQL Server | SSC-FDM-TS0009 | MSCEWI4003 |
| SQL Server | SSC-FDM-TS0010 | MSCEWI4069 |
| SQL Server | SSC-FDM-TS0011 | MSCEWI1088 |
| SQL Server | SSC-FDM-TS0017 | SSC-EWI-TS0071 |
| SQL Server | SSC-FDM-TS0019 | SSC-EWI-TS0047 |
| SQL Server | SSC-FDM-TS0020 | SSC-EWI-TS0055 |
| SQL Server | SSC-FDM-TS0021 | SSC-EWI-TS0056 |
| SQL Server | SSC-FDM-TS0022 | SSC-EWI-TS0057 |
| SQL Server | SSC-FDM-TS0023 | SSC-EWI-TS0073 |

---
title: SnowConvert AI - Validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/validation/README.md
section: Migrations
---

# SnowConvert AI - Validation

The Scope Validator step checks if the entry code meets the basic requirements to execute a successful conversion. These requirements are:

* Valid extension file
* Valid file encoding
* The entry code is extracted
* Valid entry file format
* Valid files and folder naming
* Valid Comments

> **Note:**
>
> If one of these validations is not met, it does not mean that the conversion can not be executed. These validations only warn that something is not okay with the entry code, and thus, something could be wrong with the migration results.

## How to execute the validation

The Scope Validator is a step that starts once the configuration for the [Conversion](../conversion/README.md) process is done. It means that it is always executed.

### View validation results

Once the validation step finishes, a window with information about which validations failed is displayed:

In this case, the validation that failed is to check if the entry code was extracted or not. Please visit their sections to get more information about each validation.

Also, a report called *FilesOutOfScope* is generated inside a folder called *Scope Validations* inside the output folder.

```none
Output Folder > Reports > SnowConvert AI > Scope Validations > ScopeValidation.{timeStamp}.csv
```

> **Note:**
>
> The Out of scope report link could redirect you to this report.

This report enumerates all the invalid files with their respective paths/names, sizes, and reasons. E.g.:

| File Path/Name | File Size | Reason |
| --- | --- | --- |
| file1.bat | 40.0 B | File Type |

The last one was an example of an invalid file because of the extension.

### How to proceed

As you can see in the *Scope Validator Window With Results* image, there are two options: one is to continue with the Assessment or Conversion Process, and the other is to cancel it. The recommendation is to cancel the process, check the warnings, and try to resolve them. But if you decide to continue with the conversion, you will also be able to find the information related to the Scope Validation in the Assessment.csv report and the AssessmentReport.docx. In the Assessment.csv, you will find the following fields: the total validated files, total invalid files, total of files with wrong encoding, total of files with wrong extension, total of files with invalid naming, and an out-of-scope percentage (a calculation of the pending work to do to have a valid entry code). In the docx report, you will find a section with the same information that is in the FilesOutOfScope report.

---
title: SnowConvert AI - Vertica
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/getting-started/running-snowconvert/supported-languages/vertica.md
section: Migrations
---

# SnowConvert AI - Vertica

## What is SnowConvert AI for Vertica?

SnowConvert AI is a software tool that understands SQL Vertica scripts and converts this source code into functionally equivalent Snowflake code.

## Conversion Types

Specifically, SnowConvert AI for Vertica performs the following conversions:

### Vertica to Snowflake SQL

SnowConvert AI recognizes the Vertica source code and converts the different statements into the appropriate SQL for the Snowflake target.

### Sample code

#### Input Code:

```sql
CREATE TABLE data_types_conversion (
    int8_col INT8,
    long_varbinary_col LONG VARBINARY(65000),
    uuid_col UUID
);
```

#### Output Code:

```sql
CREATE TABLE data_types_conversion (
    int8_col INTEGER,
    long_varbinary_col VARBINARY(65000),
    uuid_col VARCHAR(36)
);
```

As you can see, most of the structure remains the same, but some column properties have to be transformed into Snowflake equivalents. For more information please refer to [Vertica Translation References documentation.](../../../../translation-references/vertica/README.md)

### SnowConvert AI Terminology

Before we get lost in the magic of these code conversions, here are a few terms/definitions so you know what we mean when we start dropping them all over the documentation:

* *SQL (Structured Query Language):* the standard language for storing, manipulating, and retrieving data in most modern database architectures.
* *SnowConvert AI*: the software that converts securely and automatically your Vertica files to the Snowflake cloud data platform.
* *Conversion rule* or *transformation rule:* rules that allow SnowConvert AI to convert from a portion of source code to the expected target code.
* *Parse:* parse or parsing is an initial process done by SnowConvert AI to understand the source code and build up an internal data structure required for executing the conversion rules.

In the next few pages, you’ll learn more about the kind of conversions that SnowConvert AI for Vertica is capable of. If you’re ready to get started, visit the [**Getting Started**](../../README.md) page in this documentation.

---
title: SnowConvert AI - Vertica
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/README.md
section: Migrations
---

# SnowConvert AI - Vertica

> **Conversion Scope:**
>
> SnowConvert AI for Vertica currently supports assessment and translation for TABLES and VIEWS. Although SnowConvert AI can recognize other types of statements, they are not fully supported.

This page provides a comprehensive reference for how SnowConvert AI translates Vertica grammar elements to Snowflake equivalents. In this translation reference, you will find code examples, functional equivalence results, key differences, recommendations, known issues, and descriptions of each transformation.

---
title: SnowConvert AI - Vertica - Built-in functions
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-built-in-functions.md
section: Migrations
---

# SnowConvert AI - Vertica - Built-in functions

Functions return information from the database. This section describes functions that Vertica supports. Except for meta-functions, you can use a function anywhere an expression is allowed. ([Vertica SQL Language Reference built-in functions](https://docs.vertica.com/24.3.x/en/sql-reference/functions/)).

> **Note:**
>
> For more information about built-in functions and their Snowflake equivalents, also see [Common built-in functions](../general/built-in-functions.md).

---
title: SnowConvert AI - Vertica - CREATE TABLE
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-create-table.md
section: Migrations
---

# SnowConvert AI - Vertica - CREATE TABLE

## Description

Creates a table in the logical schema. ([Vertica SQL Language Reference Create Table](https://docs.vertica.com/23.3.x/en/sql-reference/statements/create-statements/create-table/)).

> **Warning:**
>
> This syntax is partially supported in Snowflake. Translation pending for these clauses:

```sql
DISK_QUOTA quota
SET USING expression
ENCODING encoding-type
ACCESSRANK integer
```

## Grammar Syntax

```sql
CREATE TABLE [ IF NOT EXISTS ] [[database.]schema.]table
   ( column-definition[,...] [, table-constraint [,...]] )
   [ ORDER BY column[,...] ]
   [ segmentation-spec ]
   [ KSAFE [safety] ]
   [ partition-clause]
   [ {INCLUDE | EXCLUDE} [SCHEMA] PRIVILEGES ]
   [ DISK_QUOTA quota ]

<column-definition> ::=
column-name data-type
    [ column-constraint ][...]
    [ ENCODING encoding-type ]
    [ ACCESSRANK integer ]

<column-constraint> ::=
[ { AUTO_INCREMENT | IDENTITY } [ (args) ] ]
[ CONSTRAINT constraint-name ] {
   [ CHECK (expression) [ ENABLED | DISABLED ] ]
   [ [ DEFAULT expression ] [ SET USING expression } | DEFAULT USING expression ]
   [ NULL | NOT NULL ]
   [ { PRIMARY KEY [ ENABLED | DISABLED ] REFERENCES table [( column )] } ]
   [ UNIQUE [ ENABLED | DISABLED ] ]
}

<table-constraint>::=
[ CONSTRAINT constraint-name ]
{
... PRIMARY KEY (column[,... ]) [ ENABLED | DISABLED ]
... | FOREIGN KEY (column[,... ] ) REFERENCES table [ (column[,...]) ]
... | UNIQUE (column[,...]) [ ENABLED | DISABLED ]
... | CHECK (expression) [ ENABLED | DISABLED ]
}
```

## Tables Options

### Order By

In Vertica, this `ORDER BY` clause specifies how data is physically sorted within a **superprojection**, an optimized storage structure for a table. This explicit physical ordering at table creation is not directly supported in Snowflake. For more information please refer to [SSC-EWI-VT0002.](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md)

#### Sample Source

##### Vertica

```sql
CREATE TABLE metrics
(
  metric_id INT,
  business_unit VARCHAR(100),
  metric_category VARCHAR(50) NOT NULL,
  measurement_date DATE NOT NULL
)
ORDER BY measurement_date, business_unit, metric_category;
```

##### Snowflake

```sql
CREATE TABLE metrics
(
  metric_id INT,
  business_unit VARCHAR(100),
  metric_category VARCHAR(50) NOT NULL,
  measurement_date DATE NOT NULL
)
!!!RESOLVE EWI!!! /*** SSC-EWI-VT0002 - ORDER BY TABLE OPTION IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
ORDER BY measurement_date, business_unit, metric_category
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

### Projections Clauses

Vertica’s projections are a mechanism to define and maintain the physical sort order of data on disk, thereby optimizing query performance for specific access patterns. Snowflake, however, utilizes a fundamentally different storage and optimization strategy. Data in Snowflake is automatically broken down into immutable **micro-partitions**, which are then organized and managed by the cloud service.

While an inherent order might exist within these micro-partitions due to insertion or the application of **clustering keys**, Snowflake’s query optimizer and its underlying architecture are designed to efficiently prune these micro-partitions during query execution, regardless of a pre-defined global sort order. This approach, combined with automatic caching and a columnar storage format, allows Snowflake to achieve high performance without requiring users to manually define and manage physical data structures like Vertica’s projections, thus simplifying data management and optimizing for a broader range of query patterns without explicit physical sort definitions.

Due to these reasons, the following clauses aren’t necessary in Snowflake and are removed from the original code:

```sql
[ segmentation-spec ]
[ KSAFE [safety] ]
[ partition-clause]
```

### Inherited Schema Privileges Clause

`INCLUDE SCHEMA PRIVILEGES` is a Vertica-specific feature that governs how privileges are inherited, in this case, potentially from the schema level. Snowflake does not have a direct equivalent for this clause within its `CREATE TABLE` syntax. Privileges in Snowflake are managed explicitly through `GRANT` statements.

> **Warning:**
>
> This syntax is not supported in Snowflake.

#### Sample Source

##### Vertica

```sql
CREATE TABLE metrics
(
  metric_id INT,
  business_unit VARCHAR(100),
  metric_category VARCHAR(50) NOT NULL,
  measurement_date DATE NOT NULL
)
INCLUDE SCHEMA PRIVILEGES;
```

##### Snowflake

```sql
CREATE TABLE metrics
(
  metric_id INT,
  business_unit VARCHAR(100),
  metric_category VARCHAR(50) NOT NULL,
  measurement_date DATE NOT NULL
)
!!!RESOLVE EWI!!! /*** SSC-EWI-VT0001 - INHERITED PRIVILEGES CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
INCLUDE SCHEMA PRIVILEGES
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

## Constraints

### IDENTITY - AUTO_INCREMENT

Creates a table column whose values are automatically generated by and managed by the database. You cannot change or load values in this column. You can set this constraint on only one table column.

> **Success:**
>
> This syntax is fully supported in Snowflake.

#### Sample Source

##### Vertica

```sql
CREATE TABLE customers (
  id AUTO_INCREMENT(1, 2),
  name VARCHAR(50)
);

CREATE TABLE customers2 (
  id IDENTITY(1, 2),
  name VARCHAR(50)
);
```

##### Snowflake

```sql
CREATE TABLE customers (
  id INT AUTOINCREMENT(1, 2) ORDER,
  name VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';

CREATE TABLE customers2 (
  id INT IDENTITY(1, 2) ORDER,
  name VARCHAR(50)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

### CHECK Constraint

The `CHECK` clause in Vertica requires new or updated rows to satisfy a Boolean expression. Snowflake doesn’t have an equivalent to this clause; therefore, SnowConvert AI will add an EWI. This will be applied as a `CHECK` attribute or table constraint in the converted code.

> **Danger:**
>
> This syntax is not supported in Snowflake.

#### Sample Source

##### Vertica

```sql
CREATE TABLE table1 (
    product_id INT PRIMARY KEY,
    quantity INT CHECK (quantity >= 0)
);
```

##### Snowflake

```sql
CREATE TABLE table1 (
    product_id INT PRIMARY KEY,
    quantity INT
                 !!!RESOLVE EWI!!! /*** SSC-EWI-0035 - CHECK STATEMENT NOT SUPPORTED ***/!!! CHECK (quantity >= 0)
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

### DEFAULT Constraint

> **Warning:**
>
> This syntax is partially supported in Snowflake.

The basic `DEFAULT` clause from Vertica is fully supported and translates directly to Snowflake. For Vertica’s `DEFAULT USING` clause, however, the translation is partial. Snowflake will correctly apply the `DEFAULT` value when new rows are inserted, but the deferred refresh capability from the `USING` portion has no direct equivalent and some expressions might not be supported in Snowflake. Therefore, a warning is added to highlight this functional difference.

#### Sample Source

##### Vertica

```sql
CREATE TABLE table1 (
    base_value INT,
    status_code INT DEFAULT 0,
    derived_value INT DEFAULT USING (base_value + 100)
);
```

##### Snowflake

```sql
CREATE TABLE table1 (
    base_value INT,
    status_code INT DEFAULT 0,
    derived_value INT DEFAULT (base_value + 100) /*** SSC-FDM-VT0001 - EXPRESSION IN USING CONSTRAINT MIGHT NOT BE SUPPORTED IN SNOWFLAKE ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

### PRIMARY KEY - UNIQUE - FOREIGN KEY

SnowConvert AI keeps the constraint definitions; however, in Snowflake, these properties are provided to facilitate migrating from other databases. They are not enforced or maintained by Snowflake. This means that the defaults can be changed for these properties, but changing the defaults results in Snowflake not creating the constraint.

> **Warning:**
>
> This syntax is partially supported in Snowflake.

#### Sample Source

##### Vertica

```sql
CREATE OR REPLACE TABLE employees (
    emp_id INTEGER,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    CONSTRAINT pk_employees_enabled PRIMARY KEY (emp_id) ENABLED
);
```

##### Snowflake

```sql
CREATE OR REPLACE TABLE employees (
    emp_id INTEGER,
    first_name VARCHAR(50),
    last_name VARCHAR(50),
    CONSTRAINT pk_employees_enabled PRIMARY KEY (emp_id) ENABLE
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

## Related EWIs

1. [SSC-EWI-0035](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/generalEWI.md): Check statement not supported.
2. [SSC-EWI-VT0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md): Inherited privileges clause is not supported in Snowflake.
3. [SSC-EWI-VT0002](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md): Order by table option is not supported in Snowflake.
4. [SSC-FDM-VT0001](../../general/technical-documentation/issues-and-troubleshooting/functional-difference/verticaFDM.md): Expression in USING constraint might not be supported in Snowflake.

## CREATE TABLE AS

### Description

Creates and loads a table from the [results of a query](https://docs.vertica.com/23.3.x/en/admin/working-with-native-tables/creating-table-from-other-tables/creating-table-from-query/). ([Vertica SQL Language Reference Create Table](https://docs.vertica.com/23.3.x/en/sql-reference/statements/create-statements/create-table/)).

> **Warning:**
>
> This syntax is partially supported in Snowflake. Translation pending for the following clauses

```sql
[ /*+ LABEL */ ]
[ AT epoch ]
[ ENCODED BY column-ref-list ]
[ ENCODING encoding-type ]
[ ACCESSRANK integer ]
[ GROUPED ( column-reference[,...] ) ]
```

### Grammar Syntax

```sql
CREATE TABLE [ IF NOT EXISTS ] [[database.]schema.]table
[ ( column-name-list ) ]
[ {INCLUDE | EXCLUDE} [SCHEMA] PRIVILEGES ]
AS  [ /*+ LABEL */ ] [ AT epoch ] query [ ENCODED BY column-ref-list ] [ segmentation-spec ]

<column-name-list> ::=
column-name-list
    [ ENCODING encoding-type ]
    [ ACCESSRANK integer ]
    [ GROUPED ( column-reference[,...] ) ]
```

### Tables Options

#### Segmentation Clause

This syntax isn’t required in Snowflake and is removed from the original code. For more information, please refer to **Projections Clauses**.

> **Note:**
>
> This syntax is not required in Snowflake.

#### Inherited Schema Privileges Clause

`INCLUDE SCHEMA PRIVILEGES` is a Vertica-specific feature that governs how privileges are inherited, in this case, potentially from the schema level. Snowflake does not have a direct equivalent for this clause within its `CREATE TABLE` syntax. For more information please refer to Inherited Schema Privileges Clause.

> **Warning:**
>
> This syntax is not supported in Snowflake.

### Related EWIs

1. [SSC-EWI-VT0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md): Inherited privileges clause is not supported in Snowflake.

## CREATE TABLE LIKE

### Description

Creates the table by [replicating an existing table](https://docs.vertica.com/23.3.x/en/admin/working-with-native-tables/creating-table-from-other-tables/replicating-table/). ([Vertica SQL Language Reference Create Table](https://docs.vertica.com/23.3.x/en/sql-reference/statements/create-statements/create-table/)).

> **Warning:**
>
> This syntax is partially supported in Snowflake. Translation pending for the following clause:
>
> ```sql
> DISK_QUOTA quota
> ```

### Grammar Syntax

```sql
CREATE TABLE [ IF NOT EXISTS ] [[database.]schema.]table
  LIKE [[database.]schema.]existing-table
  [ {INCLUDING | EXCLUDING} PROJECTIONS ]
  [ {INCLUDE | EXCLUDE} [SCHEMA] PRIVILEGES ]
  [ DISK_QUOTA quota ]
```

### Tables Options

#### Projections

This syntax isn’t required in Snowflake and is removed from the original code. For more information, please refer to **Projections Clauses**.

> **Warning:**
>
> This syntax is not required in Snowflake.

#### Inherited Schema Privileges Clause

`INCLUDE SCHEMA PRIVILEGES` is a Vertica-specific feature that governs how privileges are inherited, in this case, potentially from the schema level. Snowflake does not have a direct equivalent for this clause within its `CREATE TABLE` syntax. For more information please refer to Inherited Schema Privileges Clause.

> **Warning:**
>
> This syntax is not supported in Snowflake.

### Related EWIs

1. [SSC-EWI-VT0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md): Inherited privileges clause is not supported in Snowflake.

---
title: SnowConvert AI - Vertica - CREATE VIEW
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-create-view.md
section: Migrations
---

# SnowConvert AI - Vertica - CREATE VIEW

## Description

Creates a new view. ([Vertica SQL Language Reference Create view statement](https://docs.vertica.com/25.2.x/en/sql-reference/statements/create-statements/create-view/))

## Grammar Syntax

```sql
CREATE [ OR REPLACE ] VIEW [[database.]schema.]view [ (column[,...]) ]
  [ {INCLUDE|EXCLUDE} [SCHEMA] PRIVILEGES ] AS query
```

## Sample Source Patterns

> **Success:**
>
> This syntax is fully supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-view).

### Vertica

```sql
CREATE OR REPLACE VIEW mySchema.myuser(
userlastname
)
AS
SELECT lastname FROM users;
```

#### Snowflake

```sql
CREATE OR REPLACE VIEW mySchema.myuser
(
userlastname
)
AS
SELECT lastname FROM
    users;
```

### Inherited Schema Privileges Clause

`INCLUDE SCHEMA PRIVILEGES` is a Vertica-specific feature that governs how privileges are inherited, in this case, potentially from the schema level. Snowflake does not have a direct equivalent for this clause within its `CREATE VIEW` syntax. Privileges in Snowflake are managed explicitly through `GRANT` statements.

> **Warning:**
>
> This syntax is not supported in Snowflake.

#### BigQuery

```sql
CREATE OR REPLACE VIEW mySchema.myuser(
userlastname
)
INCLUDE SCHEMA PRIVILEGES
AS
SELECT lastname FROM users;
```

#### Snowflake

```sql
CREATE OR REPLACE VIEW mySchema.myuser
(
userlastname
)
!!!RESOLVE EWI!!! /*** SSC-EWI-VT0001 - INHERITED PRIVILEGES CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
INCLUDE SCHEMA PRIVILEGES
AS
SELECT lastname FROM
    users;
```

### Known Issues

There are no known Issues.

### Related EWIs

1. [SSC-EWI-VT0001](../../general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md): Inherited privileges clause is not supported in Snowflake.

---
title: SnowConvert AI - Vertica - Data types
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-data-types.md
section: Migrations
---

# SnowConvert AI - Vertica - Data types

Snowflake supports most basic [SQL data types](https://docs.snowflake.com/en/sql-reference/intro-summary-data-types) (with some restrictions) for use in columns, local variables, expressions, parameters, and any other appropriate/suitable locations.

## Binary Data Type

| Vertica | Snowflake |
| --- | --- |
| [BINARY](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/binary-data-types-binary-and-varbinary/) | [BINARY](https://docs.snowflake.com/en/sql-reference/data-types-text#binary) |
| [VARBINARY (synonyms: BYTEA, RAW, BINARY VARYING)](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/binary-data-types-binary-and-varbinary/) | [BINARY (synonyms: VARBINARY, BINARY VARYING)](https://docs.snowflake.com/en/sql-reference/data-types-text#binary) |
| [LONG VARBINARY](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/long-data-types/) | [BINARY](https://docs.snowflake.com/en/sql-reference/data-types-text#binary)    *Notes: Vertica’s `LONG VARBINARY` supports up to 32,000,000 bytes (**~30.5MB)**, while Snowflake’s `BINARY` is limited to (8,388,608 bytes) **8MB**. This size difference means you might need an alternative solution for mapping larger `LONG VARBINARY` data.* |

## Boolean Data Type

| Vertica | Snowflake |
| --- | --- |
| [BOOLEAN](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/boolean-data-type/) | [BOOLEAN](https://docs.snowflake.com/en/sql-reference/data-types-logical#boolean) |

## Character Data Type

| Vertica | Snowflake |
| --- | --- |
| [CHAR](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/character-data-types-char-and-varchar/) | [CHAR](https://docs.snowflake.com/en/sql-reference/data-types-text#char-character-nchar) |
| [VARCHAR](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/character-data-types-char-and-varchar/) | [VARCHAR](https://docs.snowflake.com/en/sql-reference/data-types-text#varchar) |
| [LONG VARCHAR](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/long-data-types/) | [VARCHAR](https://docs.snowflake.com/en/sql-reference/data-types-text#varchar)    *Notes: Vertica’s `LONG VARCHAR` supports up to 32,000,000 bytes (**~30.5MB)**, while Snowflake’s `VARCHAR` is limited to 16,777,216 bytes (**16MB)**. This size difference means you might need an alternative solution for mapping larger `LONG VARCHAR` data.* |

## Date/Time Data Type

| Vertica | Snowflake |
| --- | --- |
| [DATE](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/date/) | [DATE](https://docs.snowflake.com/en/sql-reference/data-types-datetime#label-datatypes-date)    *Notes: Be aware of* [*Snowflake’s*](https://docs.snowflake.com/en/sql-reference/data-types-datetime#data-types) *recommended year range (1582-9999).* |
| [TIME](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/timetimetz/) | [TIME](https://docs.snowflake.com/en/sql-reference/data-types-datetime#label-datatypes-time) |
| [TIME WITH TIMEZONE (TIMETZ)](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/timestamptimestamptz/) | [TIME](https://docs.snowflake.com/en/sql-reference/data-types-datetime#label-datatypes-time)    *Notes: TIME data type in Snowflake does not persist this timezone attribute.* [*`SSC-FDM-0005`*](../general/technical-documentation/issues-and-troubleshooting/functional-difference/general/ssc-fdm-0005.md) *is added.* |
| [TIMESTAMP](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/timestamptimestamptz/) | [TIMESTAMP](https://docs.snowflake.com/en/sql-reference/data-types-datetime#timestamp) |
| [DATETIME](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/datetime/) | [DATETIME](https://docs.snowflake.com/en/sql-reference/data-types-datetime#datetime) |
| [SMALLDATETIME](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/smalldatetime/) | [TIMESTAMP_NTZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime#timestamp-ltz-timestamp-ntz-timestamp-tz) |
| [TIMESTAMP WITH TIMEZONE (TIMESTAMPTZ)](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/timestamptimestamptz/) | [TIMESTAMP_TZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime#timestamp-ltz-timestamp-ntz-timestamp-tz) |
| [TIMESTAMP WITHOUT TIME ZONE](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/datetime-data-types/timestamptimestamptz/) | [TIMESTAMP_NTZ](https://docs.snowflake.com/en/sql-reference/data-types-datetime#timestamp-ltz-timestamp-ntz-timestamp-tz) |

## Approximate Numeric Data Type

| Vertica | Snowflake |
| --- | --- |
| [DOUBLE PRECISION](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/double-precision-float/) | [DOUBLE PRECISION](https://docs.snowflake.com/en/sql-reference/data-types-numeric#double-double-precision-real) |
| [FLOAT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/double-precision-float/) | [FLOAT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#float-float4-float8) |
| [FLOAT8](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/double-precision-float/) | [FLOAT8](https://docs.snowflake.com/en/sql-reference/data-types-numeric#float-float4-float8) |
| [REAL](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/double-precision-float/) | [REAL](https://docs.snowflake.com/en/sql-reference/data-types-numeric#double-double-precision-real) |

## Exact Numeric Data Type

| Vertica | Snowflake |
| --- | --- |
| [INTEGER](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) | [INTEGER](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) |
| [INT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) | [INT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) |
| [BIGINT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) | [BIGINT](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) |
| [INT8](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) | [INTEGER](https://docs.snowflake.com/en/sql-reference/data-types-numeric#int-integer-bigint-smallint-tinyint-byteint) |
| [SMALLINT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) | [SMALLINT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) |
| [TINYINT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) | [TINYINT](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/integer/) |
| [DECIMAL](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/numeric/) | [DECIMAL](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric) |
| [NUMERIC](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/numeric/) | [NUMERIC](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric) |
| [NUMBER](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/numeric/) | [NUMBER](https://docs.snowflake.com/en/sql-reference/data-types-numeric#number) |
| [MONEY](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/numeric-data-types/numeric/) | [NUMERIC](https://docs.snowflake.com/en/sql-reference/data-types-numeric#decimal-dec-numeric) |

## Spatial Data Type

| Vertica | Snowflake |
| --- | --- |
| [GEOMETRY](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/spatial-data-types/) | [GEOMETRY](https://docs.snowflake.com/en/sql-reference/data-types-geospatial#label-data-types-geometry) |
| [GEOGRAPHY](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/spatial-data-types/) | [GEOGRAPHY](https://docs.snowflake.com/en/sql-reference/data-types-geospatial#label-data-types-geography) |

## UUID Data Type

| Vertica | Snowflake |
| --- | --- |
| [UUID](https://docs.vertica.com/25.1.x/en/sql-reference/data-types/uuid-data-type/) | [VARCHAR(36)](https://docs.snowflake.com/en/sql-reference/data-types-text#varchar)    *Notes: Snowflake doesn’t have a native UUID data type. Instead, UUIDs are usually stored as either **VARCHAR(36)** (for string format) or **BINARY(16)** (for raw byte format).*  *You can generate RFC 4122-compliant UUIDs in Snowflake using the built-in* [***`UUID_STRING()`***](https://docs.snowflake.com/en/sql-reference/functions/uuid_string) *function.* |

---
title: SnowConvert AI - Vertica - Identifier differences between Vertica and Snowflake
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-identifier-between-vertica-and-snowflake.md
section: Migrations
---

# SnowConvert AI - Vertica - Identifier differences between Vertica and Snowflake

## Quoted identifiers

In Vertica, quoted identifiers stick to the [case sensitivity rules](https://docs.vertica.com/25.1.x/en/sql-reference/language-elements/identifiers/#case-sensitivity), which means that, for example, column names are still case insensitive even when quoted. Thus, identifiers `"ABC"`, `"ABc"`, and `"aBc"` are synonymous, as are `ABC`, `ABc`, and `aBc` :

### Vertica

```sql
CREATE TABLE test.quotedIdentTable
(
  "col#1" INTEGER
);

SELECT "col#1" FROM test.quotedIdentTable;

SELECT "COL#1" FROM test.quotedIdentTable;
```

In Snowflake, case sensitivity of quoted identifiers depends on the session parameter [QUOTED_IDENTIFIERS_IGNORE_CASE](https://docs.snowflake.com/en/sql-reference/parameters#quoted-identifiers-ignore-case), by default quoted identifiers comparison is case sensitive, this means that the result code from migrating the above example:

### Snowflake

```sql
CREATE TABLE test.quotedIdentTable
(
  "col#1" INTEGER
);

SELECT
  "col#1"
FROM
  test.quotedIdentTable;

SELECT
  "COL#1"
FROM
  test.quotedIdentTable;
```

Will fail when executing the second select unless the session parameter is set to TRUE.

## How SnowConvert AI migrates quoted identifiers

SnowConvert AI will analyze quoted identifiers to determine if they contain non-alphanumeric characters or are reserved words in Snowflake, if they do SnowConvert AI will leave them as they are, alphanumeric identifiers will be left unquoted:

### Vertica

```sql
CREATE TABLE test.identsTable1
(
  "col#1" INTEGER,
  "col2" INTEGER
);

-- Group is a reserved word
SELECT
"col#1" AS "group",
"col2" AS "hello"
FROM
test.identsTable1;
```

### Snowflake

```sql
CREATE TABLE test.identsTable1
(
  "col#1" INTEGER,
  col2 INTEGER
);

-- Group is a reserved word
SELECT
  "col#1" AS "group",
  col2 AS hello
FROM
  test.identsTable1;
```

---
title: SnowConvert AI - Vertica - Operators
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-operators.md
section: Migrations
---

# SnowConvert AI - Vertica - Operators

## Operators

Vertica Operators

### Cast coercion operator

> Operator used to return:
>
> * NULL instead of an error for any non-date/time data types
> * NULL instead of an error after setting EnableStrictTimeCasts
>
> ( [Vertica SQL Language Reference Coercion operator](https://docs.vertica.com/24.1.x/en/sql-reference/language-elements/operators/data-type-coercion-operators-cast/cast-failures/#returning-all-cast-failures-as-null) )

To replicate this functionality SnowConvert AI translates this operator to the [**`TRY_CAST`**](https://docs.snowflake.com/en/sql-reference/functions/try_cast) function.

### Sample Source Patterns

#### Vertica

```sql
 SELECT
    measurement_id,
    reading::!FLOAT AS measurement_value
FROM raw_measurements;
```

#### Snowflake

```sql
 SELECT
    measurement_id,
    TRY_CAST(
    reading AS FLOAT) AS measurement_value
FROM
    raw_measurements;
```

---
title: SnowConvert AI - Vertica - Predicates
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/vertica/vertica-predicates.md
section: Migrations
---

# SnowConvert AI - Vertica - Predicates

## ALL & ANY array expressions

### Description

An expression used to **evaluate and compare** each element of an array against a specified expression. ([Vertica Language Reference ANY & ALL (array)](https://docs.vertica.com/23.4.x/en/sql-reference/language-elements/predicates/any-and-all/))

### Grammar Syntax

```sql
expression operator ANY (array expression)
expression operator ALL (array expression)
```

To support this expression SnowConvert AI translates the `<> ALL` to `NOT IN` and the `= ANY` to `IN`

### Sample Source Patterns

#### Input Code:

```sql
SELECT some_column <> ALL (ARRAY[1, 2, 3])
FROM some_table;

SELECT *
FROM someTable
WHERE column_name = ANY (ARRAY[1, 2, 3]);
```

#### Output Code:

```sql
SELECT some_column NOT IN (1, 2, 3)
FROM some_table;

SELECT *
 FROM someTable
 WHERE column_name IN (1, 2, 3);
```

#### Known Issues

There are no known issues

#### Related EWIs

There are no related EWIs.

## LIKE

LIKE Predicate

### Description

> Retrieves rows where a string expression—typically a column—matches the specified pattern or, if qualified by ANY or ALL, set of patterns ([Vertica SQL Language Reference Like Predicate](https://docs.vertica.com/23.4.x/en/sql-reference/language-elements/predicates/like/))

### Grammar Syntax

```sql
 string-expression [ NOT ] { LIKE | ILIKE | LIKEB | ILIKEB }
   { pattern | { ANY | SOME | ALL } ( pattern,... ) } [ ESCAPE 'char' ]
```

#### Vertica Substitute symbols

| Symbol | Vertica Equivalent | Snowflake Equivalent |
| --- | --- | --- |
| ~~ | LIKE | LIKE |
| ~# | LIKEB | LIKE |
| ~~\* | ILIKE | ILIKE |
| ~#\* | ILIKEB | ILIKE |
| !~~ | NOT LIKE | NOT LIKE |
| !~# | NOT LIKEB | NOT LIKE |
| !~~\* | NOT ILIKE | NOT ILIKE |
| !~#\* | NOT ILIKEB | NOT ILIKE |

In Vertica, the default escape character is the backslash (`\`). Snowflake doesn’t have a default escape character. SnowConvert AI will automatically add the `ESCAPE` clause when needed.

It’s important to know that Snowflake requires the backslash to be escaped (`\\`) when you use it as an escape character within both the expression and the `ESCAPE` clause. This means you’ll need two backslashes to represent a single literal backslash escape character in Snowflake queries. SnowConvert AI handles this by automatically escaping the backslash for you.

### Sample Source Patterns

> **Success:**
>
> This syntax is fully supported in [Snowflake](https://docs.snowflake.com/en/sql-reference/sql/create-view).

#### Vertica

```sql
 SELECT path_name
FROM file_paths
WHERE path_name ~~ '/report/sales_2025_q_.csv';

-- Find a path containing the literal '50%'
SELECT path_name
FROM file_paths
WHERE path_name LIKE '%50\%%';

-- Find a path starting with 'C:\'
SELECT path_name
FROM file_paths
WHERE path_name ILIKEB 'C:\\%' ESCAPE'\';
```

#### Snowflake

```sql
SELECT path_name
FROM file_paths
WHERE path_name LIKE '/report/sales_2025_q_.csv';

-- Find a path containing the literal '50%'
SELECT path_name
FROM file_paths
WHERE path_name LIKE '%50\\%%' ESCAPE'\\';

-- Find a path starting with 'C:\'
SELECT path_name
FROM file_paths
WHERE path_name ILIKE 'C:\\\\%' ESCAPE'\\';
```

#### Known Issues

While SnowConvert AI handles most backslash patterns, some **complex expressions** may still cause **query failures**. We recommend reviewing complex patterns to prevent these issues.

#### Related EWIs

There are no related EWIs.

---
title: SnowConvert AI - Vertica Functional Differences
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/functional-difference/verticaFDM.md
section: Migrations
---

# SnowConvert AI - Vertica Functional Differences

> **Note:**
>
> **Conversion Scope**
>
> SnowConvert AI for Vertica focuses its assessment and translation capabilities primarily on TABLES and VIEWS.
> While SnowConvert AI can recognize other types of ANSI-standard statements, these are not yet fully supported for conversion. This means that while the tool may identify them, it won’t perform a complete translation for these unsupported code units.

## SSC-FDM-VT0001

Expression in USING constraint might not be supported in Snowflake.

### Description

In Vertica, the `DEFAULT USING` clause offers a deferred refresh capability, which Snowflake doesn’t support. While Snowflake can apply the expression as a simple `DEFAULT` value when new rows are inserted, it won’t replicate Vertica’s deferred refresh logic.

Additionally, the expression itself might contain Vertica-specific functions or syntax that are incompatible with Snowflake. Because of these differences, a warning is added to your converted code. This highlights both the change in refresh behavior and the necessity to manually review the translated expression to ensure its syntax is compatible with Snowflake.

#### Code Example

##### Input Code:

##### Redshift

```sql
 CREATE TABLE table1 (
    base_value INT,
    derived_value INT DEFAULT USING (base_value + 100)
);
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE table1 (
    base_value INT,
    derived_value INT DEFAULT (base_value + 100) /*** SSC-FDM-VT0001 - EXPRESSION IN USING CONSTRAINT MIGHT NOT BE SUPPORTED IN SNOWFLAKE ***/
)
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

#### Best Practices

* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - Vertica Issues
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/technical-documentation/issues-and-troubleshooting/conversion-issues/verticaEWI.md
section: Migrations
---

# SnowConvert AI - Vertica Issues

## SSC-EWI-VT0001

Inherited privileges clause is not supported in Snowflake.

### Description

Vertica’s **`INCLUDE SCHEMA PRIVILEGES`** allows views to inherit schema-level privileges, unlike Snowflake where view access is managed by explicit **`GRANT`** statements. Migrating these Vertica views to Snowflake requires manually translating these inherited permissions into specific **`GRANTs` .**

#### Code Example

##### Input Code:

##### Vertica

```sql
 CREATE OR REPLACE VIEW mySchema.myuser
INCLUDE SCHEMA PRIVILEGES
AS
SELECT lastname FROM users;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE OR REPLACE VIEW mySchema.myuser
!!!RESOLVE EWI!!! /*** SSC-EWI-VT0001 - INHERITED PRIVILEGES CLAUSE IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
INCLUDE SCHEMA PRIVILEGES
AS
SELECT lastname FROM
    users;
```

#### Best Practices

* For Snowflake, the recommendation is to translate these inherited Vertica permissions by using **`GRANT`** statements to assign the necessary privileges on the view directly to specific roles.
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

## SSC-EWI-VT0002

Order by table option is not supported in Snowflake

### Description

In Vertica, this `ORDER BY` clause specifies how data is physically sorted within a **superprojection**, an optimized storage structure for a table. This explicit physical ordering at table creation is not directly supported in Snowflake.

Snowflake handles data storage differently, utilizing **micro-partitions**. While the data within these micro-partitions can exhibit some natural order based on insertion or if **clustering keys** are defined, an `ORDER BY` clause is not used to dictate this physical arrangement during table creation in the same explicit manner as in Vertica’s superprojections. Instead, Snowflake employs clustering to optimize data layout for performance, providing a more automated approach to physical ordering.

#### Code Example

##### Input Code:

##### Vertica

```sql
 CREATE TABLE metrics
(
  metric_id INT,
  business_unit VARCHAR(100),
  metric_category VARCHAR(50) NOT NULL,
  measurement_date DATE NOT NULL
)
ORDER BY measurement_date, business_unit, metric_category;
```

##### Generated Code:

##### Snowflake

```sql
 CREATE TABLE metrics
(
  metric_id INT,
  business_unit VARCHAR(100),
  metric_category VARCHAR(50) NOT NULL,
  measurement_date DATE NOT NULL
)
!!!RESOLVE EWI!!! /*** SSC-EWI-VT0002 - ORDER BY TABLE OPTION IS NOT SUPPORTED IN SNOWFLAKE ***/!!!
ORDER BY measurement_date, business_unit, metric_category
COMMENT = '{ "origin": "sf_sc", "name": "snowconvert", "version": {  "major": 0,  "minor": 0,  "patch": "0" }, "attributes": {  "component": "vertica",  "convertedOn": "06/17/2025",  "domain": "no-domain-provided" }}';
```

#### Best Practices

* For Snowflake, the recommendation is to add **clustering keys** to emulate this behavior, following [Snowflake’s own recommendations for clustering key implementation](https://docs.snowflake.com/en/user-guide/tables-clustering-keys).
* If you need more support, you can email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: SnowConvert AI - What is a Project?
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/what-is-a-snowconvert-project.md
section: Migrations
---

# SnowConvert AI - What is a Project?

## What is a SnowConvert AI Project (.snowct)?

This concept introduces the ability to execute the tool and persist the status of a project and all its configurations like Source Platform, Conversion Settings, Status of the latest successfully executed step, and others.

Each time that you click on “Save & Start Assessment” a project file (with the extension .snowct) will be created in the same folder that you chose as the input folder that contains the source code that you want to convert.

As a user, you will be able to:

1. Open the SnowConvert AI project by double-clicking the .snowct file
2. Open the SnowConvert AI project by clicking on “Open Project”
3. Click File -> Open Recents to see the list of projects that you recently opened in SnowConvert AI\

Once you execute any of these flows the tool will redirect you to the same state of the tool that you were executing when you closed the tool.

---
title: SnowConvert AI - Windows
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/how-to-install-the-tool/windows.md
section: Migrations
---

# SnowConvert AI - Windows

## Windows Installation

1. Click on the [downloaded](../../../getting-started/download-and-access.md) .exe file.
2. That will start the installation process on your computer.

3. Once the process is finished you can open SnowConvert AI from your applications menu.

4. After you start the application you can create a new assessment or conversion project or open an existing one.

### Setting up the CLI

Refer to [SnowConvert AI CLI](../command-line-interface/README.md).

---
title: SnowConvert AI CLI (scai) Command Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/SCAI_Command_Reference.md
section: Migrations
---

# SnowConvert AI CLI (scai) Command Reference

SnowConvert AI (`scai`) is a CLI tool for accelerated database migration to Snowflake. It manages end-to-end migration workflows including code extraction from source databases, automated conversion to Snowflake SQL, AI-powered code improvement, deployment, data migration, and validation.

---

## Global Options

These options are available on every `scai` command.

| Option | Description |
| --- | --- |
| `-h, --help` | Show help message |
| `-v, --version` | Display version information |
| `--log-debug` | Enable debug-level logging. Can also be set via the `SCAI_LOG_LEVEL` env var (accepts: `verbose`, `debug`, `information`, `warning`, `error`, `fatal`). |

---

## Quick Start

The basic workflow to get started with `scai`. For a more detailed walkthrough, see the quick start guide.

**1. Create a project** (use `-c` to set a default Snowflake connection):

```bash
scai init -n <name> -l <language> -c <connection>
```

**2. Add source code:**

For full migration (SQL Server, Redshift) – extract directly from the source database:

```bash
scai code extract
```

For other languages – add source files from disk:

```bash
scai code add -i <path>
```

**3. Convert to Snowflake SQL:**

```bash
scai code convert
```

**Optional additional steps:**

```bash
# Run AI-powered improvement
scai ai-convert start -w

# Accept AI fixes (after ai-convert completes)
scai ai-convert accept --all

# Deploy to Snowflake
scai code deploy --all
```

> **Tip:** Using `-c <connection>` during `scai init` saves the Snowflake connection in the project, so you don’t need to specify it for each command.
>
> Run `scai <command> -h` for detailed help on any command.

---

## Commands

### scai init

Create a new migration project in the specified directory (or the current directory if `PATH` is omitted).

```sql
scai init [PATH] -l <LANGUAGE> [-n <NAME>] [-i <INPUT_PATH>] [--skip-split] [-c <CONNECTION>]
```

**Prerequisites:**

* Target directory must not contain an existing project
* Valid source language must be specified (see Supported Languages)

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `[PATH]` | Directory to create the project in. If omitted, uses the current directory. | No |  |
| `-n, --name <NAME>` | Project name. If omitted, defaults to the target folder name. | No |  |
| `-l, --source-language <LANGUAGE>` | Source language for the project. | Yes |  |
| `-i, --input-code-path <PATH>` | Path to source code files to add during initialization. SQL Server and Redshift sources are processed through the arrange and assess pipeline; other languages are copied directly to `source/`. | No |  |
| `--skip-split` | Skip the arrange/split phase when source code is already split (SQL Server and Redshift only). Requires `--input-code-path`. | No | `false` |
| `-c, --connection <NAME>` | Snowflake connection name to save as project default. | No |  |

**Behavior notes:**

* Creates the project directory structure and configuration files.
* When `--input-code-path` is provided: SQL Server and Redshift run the arrange and parse-and-assess pipeline, promote processed files to `source/`, and generate a code unit registry. Other languages copy source directly to `source/`.
* When `--skip-split` is used with `--input-code-path` (SQL Server and Redshift only), skips the arrange/split phase, promotes raw source directly to `source/`, runs assessment only for code unit registry generation, and marks the project as type Full (new folder structure).
* Redshift source files require paired SC tags (e.g., `-- <sc-table> table_name </sc-table>`) for the arrange step. If validation or arrange fails, the project is still created but source code is not added. Recovery: fix source files and run `scai code add -i <path>`, or use `--skip-split` if the code is already split.

**Project folder structure:**

| Path | Description |
| --- | --- |
| `.scai/` | Project configuration |
| `.scai/config/` | `project.yml` and local settings |
| `artifacts/` | Intermediate processing artifacts |
| `source/` | Processed source code (populated by `--input-code-path` or `code add`) |
| `snowflake/` | Converted code, reports, and logs |

**Examples:**

```bash
# Create a project in a new folder (recommended)
scai init my-project -l Teradata

# Create a project in the current directory
scai init -l Teradata

# Create project with source code
scai init my-project -l Oracle -i /path/to/code

# Create project with pre-split source code (skip arrange phase)
scai init my-project -l SqlServer -i /path/to/code --skip-split

# Create project with a specific Snowflake connection
scai init my-project -l Oracle -c my-snowflake-conn
```

---

### scai project

View and manage project configuration.

#### scai project info

Display project details including name, source language, and status.

```sql
scai project info
```

**Prerequisites:**

* Must be run from within a migration project directory.

**Output fields:** Project Name, Project ID, Source Language, Snowflake Connection, Project Root.

**Examples:**

```bash
scai project info
```

#### scai project set-default-connection

Set the default Snowflake connection for the current project.

```sql
scai project set-default-connection -c <CONNECTION>
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Snowflake connection available in `connections.toml` or `config.toml`.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-c, --connection <NAME>` | Name of the Snowflake connection to set as the project default. | Yes |

**Connection precedence:**

1. `-c`/`--connection` option (per-command override)
2. Project connection (set by this command)
3. Default TOML connection

**Examples:**

```bash
# Set project default connection
scai project set-default-connection -c my-snowflake

# Change to production connection
scai project set-default-connection -c prod-snowflake
```

---

### scai connection

Manage source database connections (Redshift, SQL Server).

#### scai connection add-sql-server

Add a new SQL Server source database connection.

```sql
scai connection add-sql-server [OPTIONS]
```

**Prerequisites:**

* Network access to the SQL Server instance.
* For Windows auth: valid domain credentials.
* For standard auth: SQL Server username and password.

**Authentication methods:** Windows Authentication (Integrated Security), Standard Authentication (username/password).

**Operation modes:** Interactive (prompts for all required information – recommended) or Inline (command-line options for automation/CI).

Connections are saved to `~/.snowflake/connections.toml` (or project-local).

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-c, --connection <NAME>` | Name for this connection. | No |
| `--auth <AUTH>` | Authentication method (`windows`, `standard`). | No |
| `--user <USER>` | Username. | No |
| `--database <DATABASE>` | Database name. | No |
| `--connection-timeout <SECONDS>` | Connection timeout in seconds. | No |
| `--server-url <URL>` | SQL Server URL. | No |
| `--port <PORT>` | Port number. | No |
| `--password <PASSWORD>` | Password. | No |
| `--trust-server-certificate` | Trust server certificate. | No |
| `--encrypt` | Encrypt connection. | No |

**Examples:**

```bash
# Add connection interactively (recommended)
scai connection add-sql-server

# Windows Authentication
scai connection add-sql-server --connection my-sqlserver --auth windows --server-url localhost --database mydb

# Standard Authentication
scai connection add-sql-server --connection my-sqlserver --auth standard --server-url localhost --database mydb --username sa
```

#### scai connection add-redshift

Add a new Redshift source database connection.

```sql
scai connection add-redshift [OPTIONS]
```

**Prerequisites:**

* Network access to the Redshift cluster/serverless endpoint.
* For IAM auth: AWS credentials configured (AWS CLI or environment variables).
* For standard auth: username and password.

**Authentication methods:** IAM Serverless (AWS IAM with Redshift Serverless), IAM Provisioned (AWS IAM with Provisioned Cluster), Standard (username/password).

**Operation modes:** Interactive (recommended) or Inline (for automation/CI).

Connections are saved to `~/.snowflake/connections.toml` (or project-local).

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-c, --connection <NAME>` | Name for this connection. | No |
| `--auth <AUTH>` | Authentication method (`iam-serverless`, `iam-provisioned-cluster`, `standard`). | No |
| `--user <USER>` | Username. | No |
| `--database <DATABASE>` | Database name. | No |
| `--connection-timeout <SECONDS>` | Connection timeout in seconds. | No |
| `--workgroup <NAME>` | Redshift Serverless workgroup name. | No |
| `--cluster-id <ID>` | Redshift Provisioned Cluster ID. | No |
| `--region <REGION>` | AWS region. | No |
| `--access-key-id <KEY>` | AWS Access Key ID. | No |
| `--secret-access-key <KEY>` | AWS Secret Access Key. | No |
| `--host <HOST>` | Redshift host. | No |
| `--port <PORT>` | Port number. | No |
| `--password <PASSWORD>` | Password. | No |

**Examples:**

```bash
# Add connection interactively (recommended)
scai connection add-redshift

# IAM Serverless (inline)
scai connection add-redshift --connection my-redshift --auth iam-serverless \
  --workgroup my-workgroup --database mydb --region us-east-1
```

#### scai connection set-default

Set the default source connection for a database type.

```sql
scai connection set-default -l <LANGUAGE> -c <CONNECTION>
```

**Prerequisites:**

* Connection already added with `scai connection add-redshift` or `scai connection add-sql-server`.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-l, --source-language <LANGUAGE>` | Database type of the connection (`SqlServer`, `Redshift`). | Yes |
| `-c, --connection <NAME>` | Name of the source connection to set as default. | Yes |

**Examples:**

```bash
# Set default Redshift connection
scai connection set-default -l redshift --connection prod

# Set default SQL Server connection
scai connection set-default -l sqlserver --connection dev
```

#### scai connection list

List connections for a given source database.

```sql
scai connection list [-l <LANGUAGE>]
```

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-l, --source-language <LANGUAGE>` | Source language of the connection (`SqlServer`, `Redshift`). If omitted, shows a summary of all connections. | No |

**Output:** Table with columns: Name, Default, Host, Database.

**Examples:**

```bash
# List all connections summary
scai connection list

# List Redshift connections
scai connection list -l redshift

# List SQL Server connections
scai connection list -l sqlserver
```

#### scai connection test

Test a source database connection.

```sql
scai connection test -l <LANGUAGE> [-c <CONNECTION>]
```

**Prerequisites:**

* Connection already configured.
* Network access to the database.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-l, --source-language <LANGUAGE>` | Source language of the connection (`SqlServer`, `Redshift`). | Yes |
| `-c, --connection <NAME>` | Name of the connection to test. | No |

**Examples:**

```bash
# Test SQL Server connection
scai connection test -l sqlserver -c my-sqlserver

# Test Redshift connection
scai connection test -l redshift -c my-redshift
```

---

### scai code

Code operations: add, extract, convert, deploy, find, accept, where, resync.

#### scai code add

Add source code from an input file or directory to the project’s `source/` folder.

```sql
scai code add -i <INPUT_PATH> [--skip-split] [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Input must be a valid SQL source file or a directory containing SQL source files.

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `-i, --input-path <PATH>` | Path to a source code file or directory to add to the project. | Yes |  |
| `--overwrite` | Overwrite existing files in the project’s `source/` folder if they conflict. | No | `false` |
| `--skip-split` | Skip the arrange/split phase when source code is already split (SQL Server and Redshift only). | No | `false` |
| `--source-id <SOURCE_ID>` | Identifier for the source system (e.g., server hostname). Recorded in the code unit registry under `codeStatus.registration.sourceId`. Defaults to the local machine name. | No |  |

**Behavior notes:**

* SQL Server and Redshift: runs arrange-only, produces `artifacts/source_raw_Processed/`, merges into `source/`.
* Other languages: copies source directly into `source/`.
* Checks for conflicting files when destination folders are non-empty (unless `--overwrite` is set).

**Examples:**

```bash
# Add source code to project
scai code add -i /path/to/source/code

# Add pre-split source code (skip arrange phase)
scai code add -i /path/to/source/code --skip-split

# Add a single file
scai code add -i /path/to/script.sql

# Add code overwriting existing files
scai code add -i /path/to/source/code --overwrite

# Add code with a source identifier for traceability
scai code add -i /path/to/source/code --source-id prod-sql-server-01
```

#### scai code extract

Extract code from the source database.

```sql
scai code extract [OPTIONS]
```

**Supported languages:** SQL Server, Redshift.

**Prerequisites:**

* A migration project initialized with `scai init`.
* Source database connection configured (use `scai connection add-redshift` or `scai connection add-sql-server`).
* Network access to the source database.

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `-s, --source-connection <NAME>` | Name of the source connection to extract code from. | No |  |
| `--schema <SCHEMA>` | Schema name to extract code from. | No |  |
| `-t, --object-type <TYPES>` | Object types to extract (comma-separated, e.g., `TABLE,VIEW,PROCEDURE`). | No |  |
| `-n, --name <PATTERN>` | Filter objects by name. Supports substring match or wildcard patterns with `*` (e.g., `emp` or `Get*Data`). | No |  |
| `-i, --interactive` | Interactive mode: browse and select schemas, object types, and filter by name. | No | `false` |
| `--source-id <SOURCE_ID>` | Identifier for the source system. Recorded in the code unit registry under `codeStatus.registration.sourceId`. Defaults to the server hostname from the source connection. | No |  |

**Interactive mode:**

Requires an interactive terminal. In non-interactive or CI environments, use `--schema`, `--object-type`, and `--name` instead.

* **Pre-fetch phase:** prompt for schema (or leave empty for all) and multi-select object types to scope the catalog query.
* **Post-fetch phase:** multi-select schemas to include, optional name filter (wildcard `*` supported), summary table, then confirm extraction.

Options `--schema`, `-t`/`--object-type`, and `-n`/`--name` pre-fill the interactive prompts when used with `-i`.

**Output structure:**

```none
source/
  └── <database>/
      └── <schema>/
          └── <type>/
              └── *.sql
```

**Examples:**

```bash
# Interactive extraction (browse and select)
scai code extract -i

# Interactive with pre-filled schema
scai code extract -i --schema public

# Extract tables from a schema
scai code extract --schema public --object-type TABLE

# Extract tables and views
scai code extract --object-type TABLE,VIEW

# Extract from all schemas
scai code extract

# Extract code with a custom source identifier
scai code extract --source-id prod-redshift-cluster
```

#### scai code convert

Transform source database code to Snowflake SQL.

```bash
scai code convert [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Source code in the `source/` folder (from `scai code extract`, `scai code add`, or manual copy).

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-h, --help` | Display all conversion settings available for the project’s source language. | No |
| `-e, --etl-replatform-sources-path <PATH>` | Path to ETL replatform source files for cross-project code analysis. | No |
| `-p, --powerbi-repointing <PATH>` | Path to Power BI files for input repointing. | No |
| `-x, --show-ewis` | Show detailed EWI (Early Warning Issues) table instead of summary. | No |
| `--context-path <PATH>` | Path to read migration context from. Defaults to `.scai/conversion-context`. Generated context is always written to `.scai/conversion-context`. | No |
| `--overwrite-working-directory` | Overwrite the output files in the `snowflake/` directory and the Code Unit Registry files. | No |
| `--where <WHERE>` | SQL-like filter to select which code units to convert (see WHERE Clause Reference). Only matched units are transformed; dependencies are still parsed for symbol resolution. | No |

**Dialect-specific settings:**

Additional options are dynamically loaded based on the project’s source language. Run `scai code convert --help` within a project to see all available options for that dialect.

Common options available across multiple dialects:

| Option | Description | Dialects |
| --- | --- | --- |
| `-m, --comments` | Comment nodes that have missing dependencies. | SQL Server, Oracle, Teradata |
| `--encoding <ENCODING>` | File encoding for source files (default: UTF-8). | All |
| `-s, --customschema <SCHEMA>` | Custom schema name to apply. | SQL Server, Oracle, Teradata |
| `-d, --database <DATABASE>` | Custom database name to apply. | SQL Server, Oracle, Teradata |
| `--useexistingnamequalification` | Preserve existing name qualification from input code. Must be used with `-d` or `-s`. | SQL Server, Oracle, Teradata |
| `--renamingfile <PATH>` | Path to a file that specifies new names for objects. | Redshift, SQL Server, Teradata |
| `--arrange` | Arrange the code before translation. | SQL Server, Oracle, Teradata, Redshift |
| `-t, --pltargetlanguage <LANGUAGE>` | Target language for stored procedure transformation (`SnowScript` or `JavaScript`). Default: `SnowScript`. | SQL Server, Oracle, Teradata |
| `-w, --warehouse <NAME>` | Warehouse name for dynamic table refresh. Default: `UPDATE_DUMMY_WAREHOUSE`. | SQL Server, Oracle, Teradata, Databricks, Spark |
| `--targetlag <VALUE>` | Target lag for dynamic tables (e.g., `1 day`). | SQL Server, Oracle, Teradata, Databricks, Spark |
| `--previewflags <FLAGS>` | Feature flags to enable Snowflake preview features. | All |
| `--createestimationreports` | Generate estimation reports. | All |

**Output structure:**

```none
snowflake/
  ├── Output/                     Converted Snowflake SQL files
  │   └── <schema>/
  │       └── *.sql
  ├── Reports/
  │   ├── TopLevelCodeUnits.csv   List of all converted objects
  │   ├── Issues.csv              Conversion issues/warnings
  │   └── Summary.html            HTML conversion summary
  └── Logs/                       Conversion log files
```

**Examples:**

```bash
# Convert using project defaults
scai code convert

# Show all conversion settings for the project's dialect
scai code convert --help

# Convert with custom schema
scai code convert --customschema MY_SCHEMA

# Convert with comments on missing dependencies
scai code convert --comments

# Convert with object renaming file
scai code convert --renamingfile /path/to/renaming.json

# Convert with custom context path
scai code convert --context-path /path/to/context

# Convert only procedures
scai code convert --where "target.objectType = 'procedure'"
```

#### scai code deploy

Deploy converted SQL code to Snowflake.

```sql
scai code deploy [OPTIONS]
```

**Prerequisites:**

* Converted code in `snowflake/Output/` (from `scai code convert`).
* Snowflake connection configured (set with `scai init -c` or project settings).
* Appropriate Snowflake privileges (CREATE TABLE, CREATE VIEW, etc.).

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `-c, --connection <NAME>` | The Snowflake connection to use. Uses default if not specified. | No |  |
| `-d, --database <NAME>` | Target database name for deployment. Also sets the connection database if not already configured. | No |  |
| `--warehouse <WAREHOUSE>` | Warehouse for the Snowflake connection. Only applied if the connection does not already have one. | No |  |
| `--schema <SCHEMA>` | Schema for the Snowflake connection. Only applied if the connection does not already have one. | No |  |
| `--role <ROLE>` | Role for the Snowflake connection. Only applied if the connection does not already have one. | No |  |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter objects to deploy (see WHERE Clause Reference). | No |  |
| `-a, --all` | Deploy all successfully converted objects without selection prompt. | No | `false` |
| `-r, --retry <N>` | Number of retry attempts for failed object deployments. | No | `1` |
| `--continue-on-error` | Continue deploying remaining objects even if some fail. | No | `true` |
| `--include-dependencies` | When used with `--where`, also deploy the dependencies of the filtered code units. No effect without `--where`. | No | `false` |

**Behavior notes:**

* `--warehouse`, `--schema`, and `--role` temporarily set missing connection fields (in-memory only; the TOML file is not modified).
* If the connection already has a value for an overridden field, an error is returned.

**Examples:**

```bash
# Deploy using default connection
scai code deploy

# Deploy all objects
scai code deploy --all

# Deploy with specific connection
scai code deploy --connection my-snowflake

# Deploy with temporary warehouse override
scai code deploy --warehouse MY_WH

# Deploy filtered objects and their dependencies
scai code deploy --where "target.objectType = 'procedure'" --include-dependencies
```

#### scai code find

Find code units from the project’s Code Unit Registry.

```sql
scai code find [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* An initialized Code Unit Registry (generated after `scai code convert`).

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter objects (see WHERE Clause Reference). | No |  |
| `--no-limit` | Disable the default 100-row limit on displayed objects. | No | `false` |

**Output:** Table with columns: Id, Fully Qualified Name, Object Type.

**Examples:**

```bash
# Find all code units
scai code find

# Find code units with a specific name
scai code find --where "source.name = 'my_table'"

# Find all code units without limit
scai code find --no-limit
```

#### scai code accept

Accept the latest converted artifact versions into the `snowflake/` output folder.

```sql
scai code accept [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Source code must be split and registry files must be generated (run `scai code add`).
* At least one code conversion run with `scai code convert`.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `--where <WHERE>` | Filter expression to select which objects to accept (see WHERE Clause Reference). | No |

**Behavior notes:**

* Scans the `artifacts/` directory for timestamped conversion outputs.
* For each code unit, selects the most recent version based on the timestamp folder name (`yyyyMMdd.HHmmss`).
* Copies the latest `.sql` files into the `snowflake/` folder, preserving directory structure.

**Examples:**

```bash
# Accept all latest artifacts
scai code accept
```

#### scai code where

Show the WHERE clause query reference for code unit filtering.

```sql
scai code where
```

This command displays all queryable fields, supported operators, and usage examples for WHERE clause filtering. It does not require a project directory or network access. The reference is generated at runtime from the Code Unit Registry schema.

**Field naming conventions:**

* Field names use **camelCase** with dot-notation: `source.objectType`, `target.objectType`, `codeStatus.conversion.status`, `codeStatus.aiVerification.status`, `codeStatus.registration.status`
* Enum values are **lowercase**: `'table'`, `'procedure'`, `'view'`, `'function'`, `'completed'`, `'failed'`, `'pending'`, `'excluded'`

**Commands that support `--where`:**

| Command | Usage |
| --- | --- |
| `scai code accept` | `--where <EXPRESSION>` |
| `scai code convert` | `--where <EXPRESSION>` |
| `scai code deploy` | `--where <EXPRESSION>` |
| `scai code find` | `--where <EXPRESSION>` |
| `scai ai-convert start` | `--where <EXPRESSION>` |
| `scai ai-convert accept` | `--where <EXPRESSION>` |
| `scai data migrate` | `--where <EXPRESSION>` |
| `scai data validate` | `--where <EXPRESSION>` |

**Examples:**

```bash
# Show WHERE clause reference
scai code where
```

#### scai code resync

Re-scan modified converted files and update issue metadata in the Code Unit Registry.

```sql
scai code resync
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Code converted with `scai code convert`.

**Behavior notes:**

* Detects code units whose converted files have been modified.
* Re-scans each modified file for SnowConvert issue codes (EWI, FDM, OOS, PRF).
* Updates the issue metadata in the registry.

**Examples:**

```bash
scai code resync
```

---

### scai ai-convert

AI-powered code improvement and test generation.

#### scai ai-convert start

Start AI-powered code conversion on converted code.

```bash
scai ai-convert start [OPTIONS]
```

**Prerequisites:**

* Code converted with `scai code convert` (generates TopLevelCodeUnits report).
* Snowflake connection configured with `snow connection add`.
* `CREATE MIGRATION` privilege granted on the Snowflake account.
* A warehouse configured in the Snowflake connection.
* Must accept AI disclaimers (interactive prompt or `-y` flag).

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `-c, --connection <NAME>` | Snowflake connection for AI code conversion. | No |  |
| `--selector <PATH>` | Path to object selector file (YAML). Only for `code_conversion_only` projects. | No |  |
| `-i, --instructions <PATH>` | Path to instructions file with custom AI conversion configuration. | No |  |
| `-w, --watch` | Display job progress until completion (may take several minutes to hours). | No | `false` |
| `-y, --accept-disclaimers` | Accept all AI disclaimers without prompting (required for non-interactive use). | No | `false` |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter objects to convert (see WHERE Clause Reference). | No |  |
| `--warehouse <WAREHOUSE>` | Warehouse for the Snowflake connection. Only applied if the connection does not already have one. | No |  |
| `--schema <SCHEMA>` | Schema for the Snowflake connection. Only applied if the connection does not already have one. | No |  |
| `--role <ROLE>` | Role for the Snowflake connection. Only applied if the connection does not already have one. | No |  |
| `--database <DATABASE>` | Database for the Snowflake connection. Only applied if the connection does not already have one. | No |  |

**Testing modes:**

* **Default:** Tests converted code on Snowflake only.
* **Source system verification:** Also runs tests against the source database (requires an instructions file).

**Output structure:**

```none
ai-converted/
  └── JOB_<timestamp>_<id>/
      ├── fixed/           AI-improved SQL files by object type/schema
      └── tests_sql/       Generated regression tests by database/schema
```

**Examples:**

```bash
# Start AI code conversion
scai ai-convert start

# Start and wait for completion
scai ai-convert start -w

# Filter with selector (code_conversion_only)
scai ai-convert start --selector my-selector.yml

# Non-interactive (CI/CD)
scai ai-convert start -y -w

# Source system verification
scai ai-convert start -i config/instructions.yml

# Start with temporary warehouse override
scai ai-convert start --warehouse MY_WH

# Convert only tables using WHERE clause
scai ai-convert start --where "target.objectType = 'table'"
```

#### scai ai-convert status

Check the status of an AI code conversion job.

```sql
scai ai-convert status [JOB_ID] [OPTIONS]
```

**Prerequisites:**

* A job started with `scai ai-convert start`.
* Snowflake connection (uses the job’s connection if `--connection` not specified).

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `[JOB_ID]` | The job ID to check. If omitted, checks the last started job. | No |  |
| `-c, --connection <NAME>` | Override the Snowflake connection. | No |  |
| `-w, --watch` | Monitor progress until completion. For finished jobs, forces a server-side refresh and downloads detailed results. | No | `false` |
| `--warehouse <WAREHOUSE>` | Warehouse override (if not already configured on connection). | No |  |
| `--schema <SCHEMA>` | Schema override (if not already configured on connection). | No |  |
| `--role <ROLE>` | Role override (if not already configured on connection). | No |  |
| `--database <DATABASE>` | Database override (if not already configured on connection). | No |  |

**Examples:**

```bash
# Check last job status
scai ai-convert status

# Check specific job
scai ai-convert status JOB_20260112041123_XYZ

# Wait and download results
scai ai-convert status -w

# Use different connection
scai ai-convert status -c other-snowflake
```

#### scai ai-convert cancel

Cancel a running AI code conversion job.

```sql
scai ai-convert cancel [JOB_ID] [OPTIONS]
```

**Prerequisites:**

* A running job started with `scai ai-convert start`.
* Snowflake connection (uses the job’s connection if `--connection` not specified).

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `[JOB_ID]` | The job ID to cancel. If omitted, cancels the last started job. | No |
| `-c, --connection <NAME>` | Override the Snowflake connection. | No |
| `--warehouse <WAREHOUSE>` | Warehouse override (if not already configured on connection). | No |
| `--schema <SCHEMA>` | Schema override (if not already configured on connection). | No |
| `--role <ROLE>` | Role override (if not already configured on connection). | No |
| `--database <DATABASE>` | Database override (if not already configured on connection). | No |

**Examples:**

```bash
# Cancel last job
scai ai-convert cancel

# Cancel specific job
scai ai-convert cancel JOB_20260112041123_XYZ

# Use different connection
scai ai-convert cancel -c other-snowflake
```

#### scai ai-convert list

List AI code conversion jobs for the current project.

```sql
scai ai-convert list [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Snowflake connection for refreshing job status.

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `-l, --limit <N>` | Maximum number of jobs to display. | No | `10` |
| `-a, --all` | Show all jobs (ignores limit). | No |  |
| `-c, --connection <NAME>` | Override the Snowflake connection for refreshing job status. | No |  |
| `--warehouse <WAREHOUSE>` | Warehouse override (if not already configured on connection). | No |  |
| `--schema <SCHEMA>` | Schema override (if not already configured on connection). | No |  |
| `--role <ROLE>` | Role override (if not already configured on connection). | No |  |
| `--database <DATABASE>` | Database override (if not already configured on connection). | No |  |

**Output:** Table with columns: Job ID, Status, Start Time, Duration, Objects. Possible status values: `PENDING`, `IN_PROGRESS`, `FINISHED`, `FAILED`, `CANCELLED`.

**Examples:**

```bash
# List recent jobs
scai ai-convert list

# Show all jobs
scai ai-convert list --all

# Refresh with different connection
scai ai-convert list -c other-snowflake
```

#### scai ai-convert accept

Review, compare, and accept AI-suggested fixes from a completed verification job.

```sql
scai ai-convert accept [JOB_ID] [OPTIONS]
```

**Prerequisites:**

* A completed AI code conversion job (run `scai ai-convert start` first).
* If using `--selector`: a selector file (`code_conversion_only`). Create with `scai object-selector create`.
* If using `--where`: full migration project. Run `scai code where` for syntax.

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `[JOB_ID]` | The job ID to accept changes for. If omitted, uses the last finished job. | No |  |
| `-i, --interactive` | Review each code unit one by one with options to accept, verify, or compare. | No | `false` |
| `-o, --selector <PATH>` | Path to object selector file (YAML). Only for `code_conversion_only`. | No |  |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter which objects to accept (see WHERE Clause Reference). Full migration projects only. | No |  |
| `--all` | Replace all converted files with their AI-fixed versions without prompting. | No | `false` |
| `--summary` | Show a summary of what would be affected without making changes. | No | `true` |
| `--json` | Output results in JSON format (for automation). Works with `--summary`. | No | `false` |

**Review modes:**

* **Summary** (`--summary`, default): Preview affected code units without making changes.
* **Interactive** (`-i`): Review each code unit with accept/verify/diff options.
* **All** (`--all`): Accept all AI-suggested fixes without prompting.

**Interactive actions:**

* `[d]` Diff – open diff tool to compare original and AI-fixed code
* `[v]` Verify – mark as verified (you applied changes manually)
* `[a]` Accept – overwrite converted file with AI fix
* `[s]` Skip – decide later
* `[q]` Quit – exit (progress is saved)

**Examples:**

```bash
# Interactive review
scai ai-convert accept -i

# Accept all fixes
scai ai-convert accept --all

# Accept from selector
scai ai-convert accept --all -o selector.yml

# Accept filtered by WHERE (full migration)
scai ai-convert accept --all --where "target.objectType = 'table'"

# Preview changes
scai ai-convert accept --summary

# JSON for automation
scai ai-convert accept --summary --json
```

---

### scai data

Data operations: migrate and validate.

#### scai data migrate

Migrate data from the source system into a Snowflake account.

```sql
scai data migrate [OPTIONS]
```

**Prerequisites:**

* Code converted with `scai code convert` (generates TopLevelCodeUnits report).
* Code deployed with `scai code deploy` (creates target tables in Snowflake).
* Source database connection configured.
* Snowflake connection configured with INSERT privileges.
* If using `--selector`: a selector file (`code_conversion_only`). Create with `scai object-selector create`.
* If using `--where`: full migration project; filter tables from Code Unit Registry.
* For Redshift: S3 bucket, Snowflake Storage Integration, and External Stage configured.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-s, --source-connection <NAME>` | Source connection to extract data from. Uses default if not specified. | No |
| `-c, --connection <NAME>` | Snowflake connection to migrate data to. Uses default if not specified. | No |
| `-o, --selector <PATH>` | Selector file for migration (`code_conversion_only`). If not provided, all tables are migrated. | No |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter tables from the Code Unit Registry (see WHERE Clause Reference). Full migration projects only. | No |
| `-b, --bucket-uri <URI>` | (Redshift only) S3 bucket URI for staging data (e.g., `s3://my-bucket/path`). | No |
| `--stage <STAGE_NAME>` | (Redshift only) Fully qualified Snowflake stage name for loading parquet files (e.g., `database.schema.stage_name`). | No |
| `-i, --iam-role-arn <ARN>` | (Redshift only) IAM role ARN to unload parquet files to S3. | No |
| `--warehouse <WAREHOUSE>` | Warehouse override (if not already configured on connection). | No |
| `--schema <SCHEMA>` | Schema override (if not already configured on connection). | No |
| `--role <ROLE>` | Role override (if not already configured on connection). | No |
| `--database <DATABASE>` | Database override (if not already configured on connection). | No |

**Examples:**

```bash
# Migrate all tables
scai data migrate --source-connection my-redshift --connection my-snowflake

# Migrate selected tables (selector)
scai data migrate --source-connection my-redshift --connection my-snowflake \
  --selector my-selector.yml

# Migrate filtered by WHERE (full migration)
scai data migrate --source-connection my-redshift --connection my-snowflake \
  --where "source.schema = 'public'"
```

#### scai data validate

Compare data between source and Snowflake to verify data integrity.

```sql
scai data validate [OPTIONS]
```

**Prerequisites:**

* Source database connection configured.
* Snowflake connection configured.
* Tables must exist in both source and target databases.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-s, --source-connection <SOURCE_CONNECTION>` | Source connection to use. Uses default if not specified. Ignored when `--snowflake-source` is specified. | No |
| `--snowflake-source <CONNECTION_NAME>` | Snowflake connection to use as the source (for Snowflake-to-Snowflake validation). | No |
| `-c, --connection <CONNECTION>` | Snowflake target connection. Uses default if not specified. | No |
| `-d, --target-database <DATABASE>` | Target Snowflake database for validation. | No |
| `-o, --selector <PATH>` | Selector file for validation (`code_conversion_only`). | No |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter tables from the Code Unit Registry (see WHERE Clause Reference). Full migration projects only. | No |
| `-m, --db-mapping <MAPPING>` | Database name mapping in format `source:target`. Can be specified multiple times. | No |
| `-e, --schema-mapping <MAPPING>` | Schema name mapping in format `source:target`. Can be specified multiple times. | No |
| `-f, --config-file <CONFIG_FILE>` | Path to an existing data validation config file (YAML). When provided, uses this config instead of generating one. | No |

**Output structure:**

```none
results/data-validation/run-YYYY-MM-DD-HH-mm-ss/
  ├── *_data_validation_report.csv   Main validation results
  ├── *_data_validation_report.html  HTML report for review
  └── logs/                          Detailed execution logs
```

**Examples:**

```bash
# Validate all tables from report
scai data validate

# Validate with selector file
scai data validate --selector my-tables.yml

# Validate filtered by WHERE (full migration)
scai data validate --where "source.schema = 'public'"

# With target database
scai data validate --target-database PROD_DB

# With name mappings
scai data validate --db-mapping "sourcedb:TARGETDB" --schema-mapping "dbo:PUBLIC"

# With explicit connections
scai data validate --source-connection my-sqlserver --connection my-snowflake
```

---

### scai test

Generate test cases for migrated stored procedures.

#### scai test seed

Generate YAML test case files from an execution log of stored procedure calls.

```sql
scai test seed --execution-log <EXECUTION_LOG> [--source-connection <SOURCE_CONNECTION>] [--connection <CONNECTION>] [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* An execution log file produced by running the original stored procedures.
* Code converted with `scai code convert`.

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `-e, --execution-log <EXECUTION_LOG>` | Path to the execution log file. | Yes |  |
| `-s, --source-connection <SOURCE_CONNECTION>` | Source connection to use. Uses default if not specified. | No |  |
| `-c, --connection <CONNECTION>` | Snowflake connection. Uses project/default if not specified. | No |  |
| `-m, --max-cases <MAX_CASES>` | Maximum number of test cases to generate per procedure. | No | `10` |
| `-a, --append` | Append test cases to existing test files instead of replacing them. | No |  |

**Output:** One YAML test case file per procedure at `artifacts/<target_db>/<target_schema>/<object_type>/.../<procedure_name>.yml`.

**Examples:**

```bash
# Generate test cases
scai test seed --execution-log artifacts/exec_log.csv

# Limit to 5 cases per procedure
scai test seed --execution-log artifacts/exec_log.csv --max-cases 5

# Append to existing test files
scai test seed --execution-log artifacts/new_exec_log.csv --append
```

#### scai test capture

Capture test baselines from the source database.

```sql
scai test capture [--source-connection <SOURCE_CONNECTION>] [--connection <CONNECTION>] [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Test YAML files in `artifacts/**/test/*.yml` (generated by `scai test seed`).
* A configured source database connection.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-s, --source-connection <SOURCE_CONNECTION>` | Source connection to use. Uses default if not specified. | No |
| `-c, --connection <CONNECTION>` | Snowflake connection (for baseline stage upload). Uses project/default if not specified. | No |
| `--baseline-dir <BASELINE_DIR>` | Directory to write baseline files to. Defaults to `{project}/.scai/baselines`. | No |

**Output:** JSON baseline files written to `.scai/baselines/` (or the directory specified by `--baseline-dir`).

**Examples:**

```bash
# Capture baselines
scai test capture

# With explicit connections
scai test capture --source-connection my-sqlserver --connection my-snowflake

# Custom baseline directory
scai test capture --baseline-dir ./my-baselines
```

#### scai test validate

Validate Snowflake procedures against captured baselines.

```sql
scai test validate [--connection <CONNECTION>] [OPTIONS]
```

**Prerequisites:**

* A migration project initialized with `scai init`.
* Baselines captured with `scai test capture`.
* A configured Snowflake connection.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-c, --connection <CONNECTION>` | Snowflake connection. Uses project/default if not specified. | No |
| `--baseline-dir <BASELINE_DIR>` | Directory containing baseline files. Falls back to Snowflake stage if not specified. | No |
| `--baseline-stage <BASELINE_STAGE>` | Snowflake stage containing baselines. Uses implicit default stage if not specified. | No |
| `--pattern <PATTERN>` | Regex pattern to filter test files by procedure name. Defaults to all procedures. | No |
| `--create-schema` | Create the VALIDATION schema and objects before running. | No |

**Examples:**

```bash
# Validate all procedures
scai test validate

# Filter by procedure name
scai test validate --pattern "my_proc.*"

# Create validation schema first
scai test validate --create-schema

# Use local baselines
scai test validate --baseline-dir ./my-baselines
```

---

### scai object-selector

Create selector files for filtering objects.

#### scai object-selector create

Create a selector file to filter objects for data migration and other operations.

```sql
scai object-selector create [OPTIONS]
```

**Prerequisites:**

* Code converted with `scai code convert` (generates TopLevelCodeUnits report).

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-d, --database <NAME>` | Filter objects by source database name. | No |
| `-s, --schema <NAME>` | Filter objects by source schema name. | No |
| `-t, --type <TYPES>` | Filter objects by type (comma-separated, e.g., `table,view,procedure`). | No |
| `-n, --name <NAME>` | Label for the selector file (becomes `<name>.<timestamp>.yml`). Defaults to `object-selector.<timestamp>.yml`. | No |

**Output:** A YAML selector file with the following structure:

```yaml
objects:
  - code_unit_id: <database>.<schema>.<name>
    type: TABLE | VIEW | PROCEDURE | ...
    source: { database, schema, name }
    target: { database, schema, name }
```

**Examples:**

```bash
# Create selector file
scai object-selector create

# Create with custom output path
scai object-selector create -o custom-selector.yml
```

---

### scai query

Execute SQL queries on source database systems.

```sql
scai query -q <QUERY> -s <CONNECTION> [-l <LANGUAGE>]
```

**Prerequisites:**

* Source database connection configured via `scai connection add-sql-server` or `scai connection add-redshift`.
* Network access to the source database.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-q, --query <QUERY>` | SQL query to execute on the source system. | Yes |
| `-s, --source-connection <CONNECTION>` | Source connection to use for query execution. | Yes |
| `-l, --source-language <LANGUAGE>` | Source database type (`SqlServer`, `Redshift`). Auto-detected from the connection name if omitted. | No |

**Output:** Query results printed as a formatted table (limited to 1000 rows).

**Examples:**

```bash
# Execute simple query
scai query -q "SELECT 1;" -s my-sqlserver

# Check table row count
scai query -q "SELECT COUNT(*) FROM customers" -s my-redshift

# Query with filter
scai query -q "SELECT * FROM orders WHERE status = 'pending'" -s my-connection

# Query with explicit source language
scai query -q "SELECT COUNT(*) FROM users" -s my-sqlserver -l SqlServer
```

---

### scai logs

Display the location of CLI log files and list recent entries.

```sql
scai logs [--last <COUNT>] [--open]
```

**Options:**

| Option | Description | Required | Default |
| --- | --- | --- | --- |
| `--last <COUNT>` | Number of recent log files to display. | No | `5` |
| `--open` | Open the log directory in the system file explorer. | No | `false` |

**Examples:**

```bash
# Show recent log files
scai logs

# Show the last 10 log files
scai logs --last 10

# Open log directory in file explorer
scai logs --open
```

---

### scai license

Install offline license for air-gapped environments.

#### scai license install

Install an offline license for running conversions without online activation.

```sql
scai license install -p <LICENSE_PATH>
```

**Prerequisites:**

* A valid offline license file (`.lic`) from Snowflake.

**Use cases:**

* Running in air-gapped environments without internet.
* CI/CD pipelines that cannot use online activation.
* Environments with restricted network access.

**Options:**

| Option | Description | Required |
| --- | --- | --- |
| `-p, --path <LICENSE_PATH>` | Path to the license file to install. | Yes |

**Examples:**

```bash
scai license install --path /path/to/license.lic
```

---

## Supported Languages

`scai` supports two project types depending on the source language.

### Full Migration

These languages support the complete migration workflow: code extraction from a live source database, conversion, AI improvement, deployment, data migration, and validation.

| Language |
| --- |
| SqlServer |
| Redshift |

### Code Conversion Only

These languages support code conversion from files on disk. Source code is added manually via `scai code add` or `scai init -i`.

| Language |
| --- |
| Oracle |
| Teradata |
| BigQuery |
| Databricks |
| Greenplum |
| Sybase |
| Postgresql |
| Netezza |
| Spark |
| Vertica |
| Hive |
| Db2 |

---

## Workflows

### Full Migration (SQL Server / Redshift)

Complete migration workflow for full project types with source database connectivity.

```bash
# 1. Create project
scai init my-migration -l SqlServer

# 2. Add source connection
scai connection add-sql-server

# 3. Extract from source
scai code extract

# 4. Convert to Snowflake
scai code convert

# 5. AI improvement (optional)
scai ai-convert start -w

# 6. Accept AI fixes (optional, after step 5)
scai ai-convert accept --all

# 7. Deploy to Snowflake
scai code deploy --all

# 8. Migrate data (optional)
scai data migrate --source-connection my-sqlserver --connection my-snowflake
```

### Code Conversion Only

Workflow for projects without source database connectivity. Source code is added from local files.

```bash
# 1. Create project
scai init my-migration -l Teradata

# 2. Add source code
scai code add -i /path/to/teradata/code

# 3. Convert to Snowflake
scai code convert

# 4. Deploy to Snowflake
scai code deploy --all
```

---

## Snowflake Connection

SnowConvert AI uses the Snowflake CLI (`snow`) for managing Snowflake connections. This is separate from the `scai` CLI.

**Configuration:**

```bash
# Add a Snowflake connection
snow connection add

# Set default Snowflake connection
snow connection set-default <connection-name>
```

**Usage in scai:**

```bash
# Deploy with specific connection
scai code deploy -c my-snowflake

# AI convert with specific connection
scai ai-convert start -c my-snowflake
```

**Connection precedence** (highest to lowest):

1. `-c`/`--connection` option on the `scai` command
2. Project connection (set by `scai project set-default-connection` or `scai init -c`)
3. Default TOML connection (set by `snow connection set-default`)

For more details on configuring Snowflake connections, see the [Snowflake CLI connection documentation](https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/configure-connections).

---
title: SnowConvert AI: Data migration [Preview]
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/data-migration.md
section: Migrations
---

# SnowConvert AI: Data migration [Preview]

The Data Migration feature of SnowConvert provides a fault-tolerant, scalable solution for moving data from external sources into Snowflake. This feature is specifically designed for cases where you are moving data from a system you plan to decommission. For replication purposes, other solutions are available that might better fit your use case.

## Architecture overview

The Data Migration feature uses two main components: an **Orchestrator** and one or more **Workers**.

* The Orchestrator connects to the Snowflake account. It requires privileges to create and operate the `SNOWCONVERT_AI` database, where metadata is stored.
* One or more Workers connect to both the source system and the Snowflake account. Workers read data from the source system and upload it to a Snowflake stage. Workers pick up tasks created by the Orchestrator and process them in parallel.
* Files uploaded to the Snowflake stage are copied into the target tables using a `COPY INTO` statement submitted and monitored by the Orchestrator.

### Deployment options

The Orchestrator and Workers can be deployed in multiple ways:

* Both on Snowpark Container Services (in the Snowflake account).
* Both in the customer’s environment, including custom hardware, virtual machines, or containers.
* Orchestrator on Snowpark Container Services and Workers in the customer’s environment, or the other way around.

The following requirements apply to the environment where the Orchestrator and Workers run:

* The Orchestrator and Workers are Python packages, so Python 3.11 or higher must be installed.
* Workers typically require an ODBC driver to connect to the source system.
* The Orchestrator must be able to connect to the Snowflake account using a role that has privileges to create the `SNOWCONVERT_AI` database and create schemas and objects within it.

## Prerequisites

Before you use Data Migration, make sure the following are in place:

* **Python environment**: Python 3.11 or higher, with the `snowflake-data-migration-orchestrator` and `snowflake-data-exchange-agent` packages installed (see Installation).
* **Snowflake access**: Connections for the Orchestrator and Workers in your Snowflake `config.toml` or `connections.toml`, using a role that can create the `SNOWCONVERT_AI` database and its objects. The first time the Orchestrator starts, it creates that database and related resources if they do not exist yet. On later runs, use a role that can administer `SNOWCONVERT_AI` and its objects; sticking with the same role you used for the initial creation is the simplest way to avoid permission issues.
* **Source connectivity**: For typical source databases, an ODBC driver on the machine where Workers run. Programmatic Access Tokens (PATs) are recommended for Snowflake connections; see Connecting to Snowflake with a PAT.
* **Hybrid Tables**: Hybrid Tables must be enabled and available in your Snowflake account and region for this feature. Review [Hybrid tables](https://docs.snowflake.com/en/user-guide/tables-hybrid) and [Hybrid tables limitations](https://docs.snowflake.com/en/user-guide/tables-hybrid-limitations) so you understand relevant platform requirements.
* **Snowpark Container Services (optional)**: If you deploy the Orchestrator or Workers on Snowflake compute, your account needs SPCS. See the [Snowpark Container Services overview](https://docs.snowflake.com/en/developer-guide/snowpark-container-services/overview). Running both components outside Snowflake does not require SPCS.

## Setup

### Installation

The recommended approach is to install the SnowConvert AI (SCAI) CLI and use its commands directly. This requires Snowpark Container Services (SPCS) to be enabled on the Snowflake account.

Alternatively, you can run the Python packages directly (Python 3.11 or higher):

* `snowflake-data-migration-orchestrator`: For the Orchestrator.
* `snowflake-data-exchange-agent`: For Workers.

## Usage

To migrate data using this solution, complete the following high-level steps:

1. Start the Orchestrator.
2. Start the Workers.
3. Create a Data Migration Workflow.
4. Monitor the Data Migration Workflow until completion.

A Data Migration Workflow is essentially an action or goal for the system to complete, such as migrating a specific set of tables with a given configuration. You can submit multiple workflows simultaneously and monitor them. The Orchestrator breaks Data Migration Workflows into smaller tasks, which typically involves splitting a table into partitions before extracting its data and loading it to Snowflake.

* Using SCAI CLI
* Using Python packages

### Using SCAI CLI

#### Starting the Orchestrator

Use this command to start the Orchestrator on Snowpark Container Services. Remember to stop the service when it’s no longer needed.

```bash
# Set up with default connection:
scai data setup-cloud-migration --compute-pool MY_COMPUTE_POOL

# Set up with specific connection:
scai data setup-cloud-migration --compute-pool MY_COMPUTE_POOL --connection MY_SNOWFLAKE_CONNECTION

# Check help:
scai data setup-cloud-migration --help
```

#### Starting the Workers

Start a Worker by running the following command. See Worker configuration for the configuration file reference.

```bash
# Start with default connection:
scai data start-cloud-worker my-worker-config.toml

# Start with specific connection:
scai data start-cloud-worker my-worker-config.toml -c my-snowflake

# Check help:
scai data start-cloud-worker --help
```

#### Creating a Data Migration Workflow

Generate a Workflow Configuration based on the state of your SCAI project. You can also create one by hand or ask Cortex Code for help. See Workflow configuration reference for the full specification.

```bash
# Generate config for all tables:
scai data generate-cloud-migration-config

# Filter tables by schema:
scai data generate-cloud-migration-config --where "source.schema = 'public'"

# Custom output path:
scai data generate-cloud-migration-config -o my-config.yaml

# Check help:
scai data generate-cloud-migration-config --help
```

Once you have a Workflow Configuration, start a workflow. You can return immediately or wait for completion.

```bash
# Start migration (returns immediately):
scai data cloud-migrate --config my-data-migration-config.yaml --connection my-snowflake

# Start migration and wait for completion:
scai data cloud-migrate --config my-data-migration-config.yaml --connection my-snowflake --watch

# Start service and migrate:
scai data cloud-migrate --config my-data-migration-config.yaml --start-service \
  --compute-pool MY_COMPUTE_POOL --connection my-snowflake

# Check help:
scai data cloud-migrate --help
```

#### Monitoring a Data Migration Workflow

After a workflow starts, check its status with the following command. The same observability features, including the `DATA_MIGRATION_DASHBOARD` Streamlit dashboard, are also available when using the SCAI CLI.

```bash
# Check workflow status:
scai data cloud-migrate-status DATA_MIGRATION_WORKFLOW_xx_yy_zz

# Watch workflow progress:
scai data cloud-migrate-status DATA_MIGRATION_WORKFLOW_xx_yy_zz --watch

# Watch with custom interval:
scai data cloud-migrate-status --watch --poll-interval 10

# Check help:
scai data cloud-migrate-status --help
```

### Using Python packages

#### Starting the Orchestrator

After installing the Orchestrator, start it by running the following command:

```bash
python -m data_migration_orchestrator
```

Before running, make sure the `SNOWFLAKE_CONNECTION_NAME` environment variable is set to a value that matches one of the connection names in your Snowflake `config.toml` or `connections.toml`. That is the name of the connection used to connect to the target Snowflake account.

By default, workflow and task metadata objects are created under `SNOWCONVERT_AI.DATA_MIGRATION`. To use a different database or schema for that metadata, set the environment variables `CUSTOM_SNOWFLAKE_DATABASE_FOR_METADATA` (default `SNOWCONVERT_AI`) and `CUSTOM_SNOWFLAKE_SCHEMA_FOR_DATA_MIGRATION_METADATA` (default `DATA_MIGRATION`) before starting the Orchestrator. If you override these values, set the same database and schema in each Worker’s configuration using `snowflake_database_for_metadata` and `snowflake_schema_for_data_migration_metadata` under `[application]` (see Worker configuration).

The Orchestrator runs until you stop it. Data Migration Workflows require an active Orchestrator to complete. However, the Orchestrator can be safely stopped at any point and resumed later; ongoing Data Migration Workflows are resumed at that point.

#### Starting the Workers

After installing the Worker, start it by running the following command:

```bash
python -m data_exchange_agent -c <configuration-file-path>
```

The path to the configuration file can be omitted. In that case, the Worker looks for a file called `configuration.toml` in your current directory. For the Worker configuration specification, see Worker configuration.

Workers run until you stop them. Data Migration Workflows require at least one active Worker to complete. However, Workers can be safely stopped at any point and resumed later; ongoing Data Migration Workflows are resumed at that point.

#### Creating a Data Migration Workflow

After installing the Orchestrator, create workflows by running the following command:

```bash
python -m data_migration_orchestrator create-data-migration-workflow <workflow-config-file-path> \
  --name <workflow-name> \
  --connection-name <connection-name> \
  --source-platform <source-platform>
```

Keep the following in mind:

* The Workflow Configuration specification can be found in Workflow configuration reference.
* The workflow name must be composed of alphanumeric characters and cannot start with a digit.
* You must specify the name of the Snowflake connection you want to use, as it appears in your `config.toml` or `connections.toml` file.
* Supported source platforms are `sqlserver` and `redshift`.

#### Monitoring a Data Migration Workflow

Each workflow goes through different stages throughout its lifecycle:

* **Pending**: No tasks have been created for this workflow yet.
* **Executing**: Tasks have been created for this workflow and there are still tasks that have not reached a terminal state (`COMPLETED` or `FAILED`).
* **Completed**: All tasks have reached a terminal state (`COMPLETED` or `FAILED`).

In the `SNOWCONVERT_AI.DATA_MIGRATION` schema, the following tables and views can be queried to understand the status of one or more workflows:

* `WORKFLOW`: Contains one row per workflow, including start/end time, status, and configuration.
* `TABLE_PROGRESS_WITH_EXAMPLE_ERROR`: Contains one row per table being migrated as part of a workflow. Includes information about how many partitions are in each stage (extraction, loading, completed, or failed), as well as related errors. Can be filtered by `WORKFLOW_ID`.
* `DATA_MIGRATION_ERROR`: For each partition of a table being migrated, contains the first known error that affected the migration of that partition. Can be filtered by `WORKFLOW_ID`.

In the same schema, the `DATA_MIGRATION_DASHBOARD` Streamlit dashboard can be used to monitor workflows. This dashboard presents data from those tables and views.

### Redshift UNLOAD

For Redshift, it is recommended to use the `unload` extraction strategy. This works as follows:

* Large query results are written directly to an S3 bucket instead of being downloaded to the machine running the Worker.
* On the Snowflake side, an external stage is configured to reference the corresponding S3 bucket, so that `COPY INTO` statements can be executed directly from that stage.

For configuration details, see ExtractionStrategy model.

### Incremental synchronization

You can migrate tables and then re-migrate them in the future, moving only the data that has changed. For more details, see SynchronizationStrategy model.

## Considerations and recommendations

### Connecting to Snowflake with a PAT

It is recommended to use Programmatic Access Tokens (PAT) for connections used by the Orchestrator and Workers. This ensures there is no need to constantly authenticate through the browser or with an authenticator app. You will need to establish a Network Policy or temporarily bypass the requirement for a Network Policy (this can be done from Snowsight).

### Running Orchestrator and Workers on SPCS

To leverage Snowflake compute for these tasks:

1. Prepare Docker images that use the Python modules with the appropriate configuration.
2. Push those Docker images to an Image Repository in Snowflake.
3. Execute the Orchestrator and/or Worker images using Snowpark Container Services.

Keep the following in mind:

* It is recommended to execute them as Services, not Jobs.
* It is possible to run only one component (Orchestrator or Workers) in SPCS and the other on a different platform.
* It is a good practice to monitor the SPCS service and suspend it when it is not being used.
* Depending on the network configuration of the source system, you may need to configure an External Access Integration so that these services can connect to your source system.

### Initial testing

It is recommended to deploy the DDL for the tables you want to migrate before starting data migration. This ensures the target type matches the behavior you want to see in the table and its related views and procedures. You can convert DDL from your source dialect into Snowflake SQL using the code conversion capabilities of SnowConvert AI and/or Cortex Code.

> **Note:**
>
> If you don’t deploy the DDL for the tables before starting data migration, the types will be inferred, which may not be as accurate as required.

For an early test run, use a **separate workflow configuration** whose `tables` array lists only the table or small set of tables you want to validate. On each of those entries, set `whereClauseCriteria` to an SQL-like predicate (as you would in a `WHERE` clause) so only a subset of rows is migrated, for example a bounded primary-key range or a narrow date range in the source dialect. You can also set a small `partitionSize` (for example `maxRowsPerPartition`) to keep partitions tiny during the test. After you confirm connectivity, performance, and results, create your full workflow: remove or relax `whereClauseCriteria` and use `"auto"` or your production `partitionSize` settings.

### Managing Workers

The time it takes to complete a workflow depends on many variables, but the number of Workers (and threads per Worker) has the greatest impact, as it determines how many extraction tasks can be executed in parallel. Consider the following:

* It is not necessary to run two Workers on the same machine. If you want more parallelism on a single machine, increase the thread count instead.
* Network bandwidth greatly affects Worker speed and is shared between threads of a Worker.
* Even with many Workers and threads running in parallel, the source system might not have enough resources to handle the load.
* Keep a low Worker count to avoid overloading your source system.
* Consider stopping some or all Workers when the source system is already under heavy load from other operations.

## Configuration reference

### Workflow configuration reference

The Data Migration Workflow configuration file is a JSON object. The following sections describe its structure and properties.

> **Note:**
>
> Names that require quoting (or brackets) must be manually quoted as they would normally be in JSON. For example: `tableName: "\"MyCaseSensitiveTable\""`.

#### Top-level object

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `tables` | `TableConfiguration[]` |  | An array of table-specific configurations defining which tables to migrate and how. |
| `defaultTableConfiguration` | `TableConfiguration` |  | Shared settings that are inherited by all tables in the `tables` array. Table-specific values override these defaults. |
| `affinity` | `String` |  | An affinity group string. Ensures that only Orchestrator and Worker instances with a matching affinity process this workflow. |

When `defaultTableConfiguration` is present, each object in `tables` is merged with those defaults: shared fields apply to every table unless the same field is set again on a specific table entry, in which case the table-level value wins.

#### TableConfiguration model

Defines the settings for migrating a single table.

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `source` | `SourceTargetIdentifier` |  | Identifies the source table. |
| `target` | `SourceTargetIdentifier` |  | Identifies the target table in Snowflake. |
| `columnNamesToPartitionBy` | `String[]` |  | A list of columns used to partition data during the extraction phase. |
| `extraction` | `ExtractionStrategy` |  | Settings to configure how data is extracted from the source database. |
| `synchronization` | `SynchronizationStrategy` |  | Settings for incremental synchronization. |
| `columnTypeMappings` | `ColumnTypeMapping` |  | Type conversions applied during migration. |
| `columnNameMappings` | `ColumnNameMapping` |  | Column renaming mappings. |
| `primaryKeyColumns` | `String[]` |  | Primary key columns for the source table. Required for `trackModifications` under the `watermark` synchronization strategy. |
| `partitionSize` | `PartitionSize` |  | Configures the target size of each partition during extraction. Defaults to `"auto"`. See Partition size (`partitionSize`). |
| `whereClauseCriteria` | `String` |  | An SQL-like filter to select a subset of rows for migration (for example, `"is_deleted = 0"`). |

#### SourceTargetIdentifier model

A nested object used within `TableConfiguration` to specify a database object. For `source`, use only the properties in the following table. For `target`, you can also set the optional properties in Additional target properties.

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `databaseName` | `String` |  | The name of the source or target database. |
| `schemaName` | `String` |  | The name of the schema containing the table. |
| `tableName` | `String` |  | The name of the table to be migrated. |

#### Additional target properties

The following optional fields apply **only** to the `target` object (not to `source`).

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `tableType` | `String` |  | `"native"` for a standard Snowflake table (default if omitted) or `"iceberg"` for an Apache Iceberg™ table. |
| `icebergConfig` | `Object` | For Iceberg targets | Required when `tableType` is `"iceberg"`. Merged with `defaultTableConfiguration.target.icebergConfig` if present; table-level keys override defaults. See Iceberg configuration (`target.icebergConfig`). |

#### Iceberg configuration (`target.icebergConfig`)

Used when `target.tableType` is `"iceberg"`. Account setup (external volumes, catalog integrations, stages, and privileges) follows Snowflake’s Iceberg documentation; see [Apache Iceberg™ tables](https://docs.snowflake.com/en/user-guide/tables-iceberg), [Create an Iceberg table](https://docs.snowflake.com/en/user-guide/tables-iceberg-create), and [Configure an external volume](https://docs.snowflake.com/en/user-guide/tables-iceberg-configure-external-volume).

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `catalog` | `String` |  | Default `SNOWFLAKE` for Snowflake-managed Iceberg. Use a catalog integration name for externally cataloged tables (for example AWS Glue). |
| `externalVolume` | `String` | For `catalog` `SNOWFLAKE` | Snowflake external volume for Iceberg data and metadata. |
| `baseLocationPrefix` | `String` |  | Optional path prefix for `BASE_LOCATION` when using Snowflake-managed Iceberg (`catalog` `SNOWFLAKE`). |
| `catalogTableName` | `String` | For external `catalog` | Fully qualified name of the table in the external catalog (for example `glue_db.my_table`). |
| `catalogSync` | `String` |  | Optional catalog integration used to sync Snowflake-managed metadata back to an external catalog. |
| `sourceDataStage` | `String` |  | Stage path starting with `@` pointing at existing Parquet files; used for `copy_files`-style loads with Snowflake-managed Iceberg. |
| `migrationStrategy` | `String` |  | One of `catalog_link`, `convert_to_managed`, or `copy_files`. When omitted, the Orchestrator infers a strategy from `catalog` and `sourceDataStage`. |

#### Partition size (`partitionSize`)

Controls how large each partition should be during extraction. You can use a string or an object.

| Form | Description |
| --- | --- |
| `"auto"` (default) | The system chooses partition sizes from the source platform, extraction strategy, and table size. Auto mode uses larger partitions for Redshift UNLOAD (S3-friendly large files) and smaller partitions for ODBC-based extraction (SQL Server, Redshift regular), where data flows through the Worker. For very large tables (100+ GB), the maximum number of partitions can increase to allow more parallelism. |
| `{ "targetSizeMb": N }` | Each partition targets about `N` megabytes of data. |
| `{ "maxRowsPerPartition": N }` | Each partition contains at most `N` rows, regardless of data size. |

When you use the object form, specify only one of `targetSizeMb` or `maxRowsPerPartition`.

**auto** (default):

```json
"partitionSize": "auto"
```

**Target size in MB:**

```json
"partitionSize": { "targetSizeMb": 2048 }
```

**Maximum rows per partition:**

```json
"partitionSize": { "maxRowsPerPartition": 500000 }
```

#### ColumnTypeMapping model

A nested object used within `TableConfiguration` to specify type mappings for a column.

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `sourceType` | `String` |  | The name of the type in the source system. |
| `targetType` | `String` |  | The name of the target type in Snowflake. |

#### ColumnNameMapping model

A nested object used within `TableConfiguration` to specify column name mappings.

| Property | Type | Required | Description |
| --- | --- | --- | --- |
| `sourceName` | `String` |  | The name of the column in the source system. |
| `targetName` | `String` |  | The name of the target column in Snowflake. |

#### ExtractionStrategy model

Configures the method for data extraction.

| Field | Type | Required | Description |
| --- | --- | --- | --- |
| `strategy` | `String` (`"regular"`, `"unload"`) |  | The extraction method. `"regular"` is the default. `"unload"` is available for Redshift sources only. |
| `externalStage` | `String` | UNLOAD only | The name of the Snowflake external stage to use when `strategy` is `"unload"`. |

**Extraction: regular (Default)**

```json
"extraction": {
  "strategy": "regular"
}
```

**Extraction: unload (Redshift only)**

```json
"extraction": {
  "strategy": "unload",
  "externalStage": "MY_DB.MY_SCHEMA.S3_EXTERNAL_STAGE"
}
```

#### SynchronizationStrategy model

Configures the approach for incremental data syncing on subsequent runs.

| Field | Type | Required | Description |
| --- | --- | --- | --- |
| `strategy` | `String` (`"none"`, `"checksum"`, `"watermark"`) |  | The synchronization method. |
| `watermarkColumn` | `String` | `watermark` only | Column name to track. Must be monotonically increasing. |
| `trackModifications` | `Boolean` |  | If `true`, the system uses the primary key to identify and deduplicate modified rows. Requires `primaryKeyColumns` to be specified in `TableConfiguration`. |

**Strategy: none (Default)**

Performs a full extraction of all partition data on every run. No synchronization metadata is stored.

```json
"synchronization": {
  "strategy": "none"
}
```

Use when data is small, changes are unpredictable, or guaranteed consistency is needed.

**Strategy: checksum**

Computes a hash of all column values for each partition on the source. Only changed partitions are cleared and re-extracted in the target.

```json
"synchronization": {
  "strategy": "checksum"
}
```

Use when you need to detect any change in a partition but lack a reliable monotonic column (for example, dimension tables). Note that this requires a checksum computation on the source for every partition on every run.

**Strategy: watermark**

Tracks a monotonic column (timestamp, ID, or version) to sync only rows where the watermark value is greater than the maximum observed in the previous sync.

```json
"synchronization": {
  "strategy": "watermark",
  "watermarkColumn": "UPDATED_AT"
}
```

Use when your table has a reliable monotonic column that increases on insert/update (for example, fact tables or event logs).

> **Note:**
>
> Watermark alone can’t currently track deletions. Support for this will be added in the future.

#### Example workflow: Redshift UNLOAD with Iceberg targets

The following workflow excerpt combines Redshift UNLOAD with Iceberg table targets, including Snowflake-managed Iceberg defaults and a per-table external catalog override:

```json
{
  "defaultTableConfiguration": {
    "source": {
      "schemaName": "public",
      "databaseName": "analytics_db"
    },
    "target": {
      "schemaName": "public",
      "databaseName": "TARGET_DB",
      "tableType": "iceberg",
      "icebergConfig": {
        "catalog": "SNOWFLAKE",
        "externalVolume": "my_iceberg_ext_vol",
        "baseLocationPrefix": "migrations/redshift",
        "sourceDataStage": "@TARGET_DB.PUBLIC.ICEBERG_SOURCE_STAGE"
      }
    },
    "extraction": {
      "strategy": "unload",
      "externalStage": "TARGET_DB.PUBLIC.S3_EXTERNAL_STAGE"
    },
    "partitionSize": "auto"
  },
  "tables": [
    {
      "source": { "tableName": "customers" },
      "target": { "tableName": "customers" },
      "columnNamesToPartitionBy": ["customer_id"]
    },
    {
      "source": { "tableName": "events" },
      "target": {
        "tableName": "events",
        "tableType": "iceberg",
        "icebergConfig": {
          "catalog": "my_glue_catalog_integration",
          "externalVolume": "my_iceberg_ext_vol",
          "catalogTableName": "glue_db.events"
        }
      },
      "columnNamesToPartitionBy": ["event_id"]
    }
  ]
}
```

#### Affinity

By specifying an affinity for a Workflow, you are indicating that you want specific workers to help with the execution of that Workflow. This can be particularly useful in cases in which you want to have some workers extract from one source and other workers extract from a different source. The rules for matching workers with tasks are:

* A task without affinity will be picked up by any worker, independently of the worker’s affinity.
* A worker without affinity will pick up any task, independently of the task’s affinity.
* A task with a given affinity will not be picked up by a worker with different affinity.

Affinity only needs to be a String; its format is defined by the user. For example, all of these are valid: `sql-server`, `DEV_SERVER`, `my_custom_server`, `::blue::`.

### Worker configuration

This file configures the behavior and connections for the Workers (`data_exchange_agent`). You must set `selected_task_source` to `"snowflake_stored_procedure"` as shown below.

| Section | Property | Type | Description |
| --- | --- | --- | --- |
| Top level | `selected_task_source` | `String` | Required. Must be `"snowflake_stored_procedure"`. |
| `[application]` | `max_parallel_tasks` | `Integer` | The maximum number of tasks the Worker will process in parallel using threads. |
| `[application]` | `task_fetch_interval` | `Integer` | The interval in seconds between attempts to fetch new tasks from the Orchestrator. |
| `[application]` | `affinity` | `String` | A user-defined affinity for the worker. |
| `[application]` | `snowflake_database_for_metadata` | `String` | Optional. Database where the Orchestrator created the task metadata objects (default `SNOWCONVERT_AI`). Must match the Orchestrator’s `CUSTOM_SNOWFLAKE_DATABASE_FOR_METADATA` if you override it. |
| `[application]` | `snowflake_schema_for_data_migration_metadata` | `String` | Optional. Schema for data migration task metadata (default `DATA_MIGRATION`). Must match the Orchestrator’s `CUSTOM_SNOWFLAKE_SCHEMA_FOR_DATA_MIGRATION_METADATA` if you override it. |
| `[connections.source.*]` | N/A | `Object` | Configuration for source systems. Workers typically require an ODBC driver to connect to the source system. |
| `[connections.target.snowflake_connection_name]` | `connection_name` | `String` | The name of the connection entry in the `~/.snowflake/config.toml` file to use. |

An example configuration file looks like this:

```toml
selected_task_source = "snowflake_stored_procedure"

[application]
max_parallel_tasks = 4
task_fetch_interval = 30
# Optional: only if the Orchestrator uses CUSTOM_SNOWFLAKE_* overrides for metadata location
# snowflake_database_for_metadata = "SNOWCONVERT_AI"
# snowflake_schema_for_data_migration_metadata = "DATA_MIGRATION"

# SQL Server connection (standard authentication)
[connections.source.sqlserver]
username = "username"
password = "password"
database = "database_name"
host = "127.0.0.1"
port = 1433

# Amazon Redshift connection (IAM authentication for provisioned cluster)
[connections.source.redshift]
username = "demo-user"
database = "snowconvert_demo"
auth_method = "iam-provisioned-cluster"
cluster_id = "migrations-aws"
region = "us-west-2"
access_key_id = "your-access-key-id"
secret_access_key = "your-secret-access-key"

# Amazon Redshift connection (standard authentication)
# [connections.source.redshift]
# username = "myuser"
# password = "mypassword"
# database = "mydatabase"
# host = "my-cluster.abcdef123456.us-west-2.redshift.amazonaws.com"
# port = 5439
# auth_method = "standard"

# Snowflake target connection
[connections.target.snowflake_connection_name]
connection_name = "connection_name"
```

> **Note:**
>
> Only one source connection is needed.

#### Source connection configuration examples

The following examples show the three main source connection configurations:

**1. SQL Server (Standard Authentication)**

```toml
[connections.source.sqlserver]
username = "username"
password = "password"
database = "database_name"
host = "127.0.0.1"
port = 1433
```

**2. Amazon Redshift (IAM Authentication)**

```toml
[connections.source.redshift-iam]
username = "demo-user"
database = "demo_db"
auth_method = "iam-provisioned-cluster"
cluster_id = "my-aws-cluster"
region = "us-west-2"
access_key_id = "your-access-key-id"
secret_access_key = "your-secret-access-key"
# Optional fields for UNLOAD strategy
# unload_s3_bucket = "my-migrations-bucket"
# unload_iam_role_arn = "arn:aws:iam::123456789012:role/MyRole"
```

**3. Amazon Redshift (Standard Authentication)**

```toml
[connections.source.redshift-standard]
username = "myuser"
password = "mypassword"
database = "mydatabase"
host = "my-cluster.abcdef123456.us-west-2.redshift.amazonaws.com"
port = 5439
auth_method = "standard"
# Optional fields for UNLOAD strategy
# unload_s3_bucket = "my-migrations-bucket"
# unload_iam_role_arn = "arn:aws:iam::123456789012:role/MyRole"
```

## Platform-specific Details

### Migrate Amazon Redshift data

In order to use the `UNLOAD` strategy for extraction of Amazon Redshift data, it will be necessary to set up multiple resources. This strategy enables the data to flow directly from Amazon Redshift into an S3 bucket and for Snowflake to execute COPY INTO operations directly from there (by creating an external stage that is mapped to that S3 bucket). This is faster than having the workers download the data and then upload it to a Snowflake stage.

#### Create a stage integration to S3

If you don’t have an existing stage configured, you need to create a Snowflake external stage that integrates with your S3 bucket. You can create the stage using the following SQL command in Snowflake:

```sql
CREATE OR REPLACE STAGE <stage_name>
  URL = 's3://<your_bucket_name>/<path>/'
  STORAGE_INTEGRATION = <your_storage_integration>
  FILE_FORMAT = (TYPE = 'PARQUET');
```

Alternatively, if you’re using AWS credentials directly:

```sql
CREATE OR REPLACE STAGE <stage_name>
  URL = 's3://<your_bucket_name>/<path>/'
  CREDENTIALS = (AWS_KEY_ID = '<your_aws_key_id>' AWS_SECRET_KEY = '<your_aws_secret_key>')
  FILE_FORMAT = (TYPE = 'PARQUET');
```

Replace the placeholders:

* `<stage_name>`: Your desired stage name (for example, `my_redshift_stage`)
* `<your_bucket_name>`: Your S3 bucket name
* `<path>`: Optional path within the bucket
* `<your_storage_integration>`: Your Snowflake storage integration name (recommended method)
* `<your_aws_key_id>` and `<your_aws_secret_key>`: Your AWS IAM user credentials (if not using storage integration)

> **Note:**
>
> Using a Snowflake storage integration is the recommended approach for better security and credential management. For more information about creating storage integrations, see the [Snowflake documentation](https://docs.snowflake.com/en/user-guide/data-load-s3-config-storage-integration).

#### Verify stage integration

After setting up your stage, verify that the integration is working correctly before proceeding with data migration. You can verify the stage integration by running the following command in Snowflake:

```sql
LIST @<stage_name>;
```

This command should execute successfully without errors. If the stage is newly created and empty, it may return no results, which is expected.

To perform a more thorough verification, you can test the stage by uploading a test file:

```sql
PUT file:///<local_test_file_path> @<stage_name>;
LIST @<stage_name>;
```

If the commands execute successfully and you can see the uploaded file, your stage integration is configured correctly.

> **Note:**
>
> You can also verify stage permissions by checking the stage description:
>
> ```sql
> DESCRIBE STAGE <stage_name>;
> ```
>
> This displays the stage configuration, including the URL, credentials type, and file format settings.

---
title: SnowConvert AI: Data validation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/data-validation.md
section: Migrations
---

# SnowConvert AI: Data validation

SnowConvert AI, as part of the end-to-end migration experience, provides the capability to validate your migrated data to ensure
that both the structure of the data and the data itself match the original source. This data validation feature is available for SQL Server databases.

## Data validation modes

To ensure that your data is successfully migrated to Snowflake, the data validation employs two distinct validation levels: schema validation and metrics validation.

### Schema validation

Schema validation confirms that the basic structure of your migrated table is preserved in Snowflake. It validates the following table attributes:

* Table name
* Column names
* Ordinal position of each column
* Data types
* Character maximum length for text columns
* Numeric precision and scale for numeric columns
* Row Count

### Metrics validation

Metrics validation confirms that the data itself matches the original source. Metrics validation compares aggregate metrics between each original table and the corresponding new Snowflake table. Although the specific metrics can vary by column data type, metrics validation evaluates the following items:

* Minimum value
* Maximum value
* Average
* Nulls count
* Distinct count
* Standard deviation
* Variance

## Validate migrated data

> **Warning:**
>
> For accurate validation and to avoid false negatives, don’t alter the migrated data during the validation process.

For SQL Server migrations, validation includes an optional step within the process. This step validates the data after you
use SnowConvert AI to move it.

### Prerequisites

This feature requires a version of Python that meets the following requirements to be installed and available in your PATH:

* Greater than or equal to 3.10.
* Lower than or equal to 3.13.

To verify that a supported Python version is available in your PATH:

1. In your terminal (or Command Prompt on Windows), run `python --version`.
2. Confirm that the Python version meets the requirements that are mentioned earlier.

Complete the following steps to validate your migrated data:

1. In SnowConvert AI, open **Validate data** in one of the following ways:

   * Complete the [data migration process](data-migration.md), and then select **Go to data validation**.
   * In your project, select **Data validation**.
2. On the **Connect to source database** page, complete the fields with the connection information for your source
   database, select **Test connection**, and then select **Continue**.
3. Select the objects that you want to validate.

   The following image is an example of the page:
4. Select **Validate data**.

   The validation process starts.

   When validation completes successfully and no differences are found, SnowConvert AI displays a message confirming that no
   differences were found.

   If differences are found in the migrated data, SnowConvert AI generates a report and displays a summary of the discrepancies in the tables.

   The following image is an example of a validation report:

   Also, a CSV file report is generated so you can visualize and share it.

   The validation results are classified into three categories:

   | Category | Description |
   | --- | --- |
   |  | Values match exactly between the source database and Snowflake. |
   |  | Snowflake table has minor differences that don’t affect the data, such as higher numeric precision. |
   |  | Values don’t match between the original database and the Snowflake database. |

   Finally, you can open the reports folder to access the generated CSV reports:

---
title: SnowConvert AI: Deployment
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/deployment.md
section: Migrations
---

# SnowConvert AI: Deployment

SnowConvert AI, as part of the end-to-end migration experience, offers the option to deploy converted database objects directly to your Snowflake environment. With this deployment feature, you can review conversion results, authenticate to Snowflake, and deploy selected objects with their proper dependencies and execution order. This deployment feature is available for SQL Server and Amazon Redshift databases.

## Conversion status indicators

Before deployment, SnowConvert AI provides visual indicators to help you understand the conversion status of each object:

### Ready for Deployment

Objects that have been successfully converted and are ready for deployment without any issues.

### Functional Data Model (FDM) Warnings

Objects with FDMs have been found in the conversion. It is recommended to review these before deployment, though they can still be deployed.

### Equivalent Work Item (EWI) Errors

Objects with EWIs have critical issues that must be fixed before deployment. These objects cannot be deployed until the issues are resolved.

> **Note:**
>
> For detailed information about FDMs and EWIs, please refer to the [SnowConvert AI Technical Documentation](../technical-documentation/README.md).

## Supported authentication methods

The deployment process supports two authentication methods to connect to your Snowflake environment:

### SSO (Single Sign-On)

Allows authentication using your organization’s Single Sign-On provider configured with Snowflake. This method provides seamless integration with your existing identity management system.

### Standard authentication

Traditional username and password authentication with the following security requirements:

* Multi-factor authentication (MFA) must be enabled for your Snowflake account
* Follows Snowflake’s security best practices and recommendations

> **Note:**
>
> The account identifier must use **-** for separation instead of **.** (for example, **orgname-account-name**).

> **Warning:**
>
> Ensure that you follow [Snowflake’s security recommendations](https://community.snowflake.com/s/article/Snowflake-Security-Overview-and-Best-Practices) and have [multi-factor authentication (MFA)](https://docs.snowflake.com/en/user-guide/ui-snowsight-profile.html#label-snowsight-set-up-mfa) enabled.

## Deployment execution order

The deployment process executes database objects in a specific order to maintain proper dependencies:

1. **Databases**: Created first to establish the container structure.
2. **Schemas**: Created within databases to organize objects.
3. **Tables**: Created to establish data structures.
4. **Views**: Created after tables because they depend on table structures.
5. **Functions**: Deployed to provide reusable logic.
6. **Stored Procedures**: Deployed last as they may reference other objects.

## Deploy converted database objects to Snowflake

You can deploy your converted database objects to Snowflake. After deployment, you can proceed with data migration
to complete the end-to-end migration process.

Ensure that you meet the following prerequisites before deploying converted objects:

* You completed conversion process with objects ready for deployment.
* You have a valid Snowflake account with appropriate permissions.
* You have multi-factor authentication (MFA) enabled (for Standard authentication).

Complete the following steps to deploy converted objects:

1. In SnowConvert AI, open the project, and then select **Deploy code**.
2. On the **Connect to Snowflake** page, complete the fields with your connection information, and then select **Sign in**.

   The **Select objects to deploy** page appears. The following image is an example of the page:
3. Review the conversion status and resolve any errors before proceeding.

   Examine the status indicators for each object in your project. Resolve any EWI errors before proceeding.
4. Select the objects that you want to deploy to Snowflake.

   Only select objects with successful conversion status or acceptable FDM warnings.

   > **Note:**
   >
   > If you make changes to object files, you can refresh the conversion status by selecting **Refresh files**.
5. Select **Deploy**.

   The deployment process starts. Objects are deployed automatically, in the proper dependency order. When deployment
   finishes, the **Deployment results** window appears.
6. In **Deployment results**, review the results for success confirmations or error messages.

   The following image is an example of a **Deployment results** window:

> **Note:**
>
> Only successfully converted objects are available for deployment. You must resolve objects with EWI errors before you can deploy them.

After completing the deployment process, your database objects are available in your Snowflake environment. You can then proceed
with [data migration](data-migration.md) to transfer your data to complete the full migration process and make everything ready for use in your
applications and workflows.

---
title: SnowConvert AI: Extraction
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/extraction.md
section: Migrations
---

# SnowConvert AI: Extraction

SnowConvert AI provides the following options for extracting database objects:

* Extract code: For SQL Server and Amazon Redshift databases, use this option if you don’t already have code for extracted database objects.
* Load existing code: For any source database, use this option if you already have code for extracted database objects.
  You might already have code if you previously extracted the code by using SnowConvert AI or if you generated the code yourself.

## Extract code

SnowConvert AI, as part of the end-to-end migration experience, offers the option to extract database objects from your source system to prepare them for the conversion process. This extraction feature is available for SQL Server and Amazon Redshift databases, letting you connect to your source database, browse the catalog, and select specific objects for migration.

The extraction process supports different database objects depending on your source database type:

**SQL Server Objects**

* Tables
* Views
* Functions
* Stored procedures

**Amazon Redshift Objects**

* Tables
* Views
* Materialized views
* Stored procedures

To extract database objects from a source system, complete the following steps:

1. [Create the project](project-creation.md), and then select **Continue**.
2. On the **Add code to your project** page,select **Extract code**.

   The **Set up code and ETL/BI projects** page appears. The following image is an example of the page
   for SQL Server:
3. In **Authentication Method**, select the authentication method that you want to use to connect to the source system.
   The following authentication methods are supported:

   * **SQL Server**

     + **Standard Authentication**
     + **Windows Authentication**
   * **Amazon Redshift**

     + **Standard Authentication**
     + **IAM Provisioned Cluster**
     + **IAM Serverless**

   When you select an authentication method, the other fields on the page are refreshed based on the method.
4. To provide appropriate information for your authentication method, complete the remaining fields.
5. For SQL Server migrations, both authentication methods require verification about whether the following security settings
   are configured for your source database:

   * **Trust server certificate**: Enable if the database requires trusted certificate validation.
   * **Encrypt connection**: Enable if the database requires encrypted connections.
6. In **Where should the converted code be saved?**, select **Browse**, and then choose a location for the code.
7. In **Have ETL projects or BI/reports?**, select the options that you want to use for
   [replatforming](etl-migration-replatform.md) or [repointing](power-bi-repointing-general.md), and then
   select the corresponding files.
8. Select **Continue**.
9. On the **Select objects to extract** page, choose the objects that you want to migrate.

   The system retrieves the metadata and structure information for the selected objects.

   The following image shows a sample **Select objects to extract** page:
10. Select **Extract objects**.
11. Review the extraction results.

    The following image shows a sample **Extraction resultst** window:

When extraction is complete, move on to Next steps.

## Load existing code

To load existing code for extracted database objects, complete the following steps:

1. [Create the project](project-creation.md), and select **Continue**.
2. On the **Add code to your project** page, select **Already have code**.

   The **Set up code and ETL/BI projects** page appears. The following image is an example of the page
   for SQL Server:
3. On the **Set up code and ETL/BI projects** page, specify the following information:

   * **Where is your source code?**: Select **Browse**, and then choose the location of your source code.
   * **Where should the converted code be saved?**: Select **Browse**, and then choose the location of your converted code.
4. In **Have ETL projects or BI/reports?**, select the options that you want to use for
   [replatforming](etl-migration-replatform.md) or [repointing](power-bi-repointing-general.md), and then
   select the corresponding files.
5. Select **Continue**.

When extraction is complete, move on to Next steps.

## Next steps

> **Note:**
>
> Only successfully extracted objects will be available for the subsequent conversion step.

After completing the extraction process, you can proceed to the [conversion process](../getting-started/running-snowconvert/conversion/README.md) to transform your database objects for Snowflake compatibility.

---
title: SnowConvert AI: Power BI Repointing
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/power-bi-repointing-general.md
section: Migrations
---

# SnowConvert AI: Power BI Repointing

This guide provides comprehensive instructions on utilizing Snowconvert AI for Power BI repointing to Snowflake. It details the process of migrating your existing Power BI reports and dashboards to leverage Snowflake as their underlying data source. You will learn how to prepare your Power BI reports, execute the Snowconvert AI tool, and validate the repointed reports to ensure seamless integration with Snowflake.

SnowConvert AI provides a new option to redefine their Power BI connections to the migrated databases in Snowflake. This redefinition of connections is called repointing. Repointing is executed inside the SnowConvert AI migration logic and uses the migration context to identify and migrate correctly embedded SQL queries.

## How To Use The Tool

> **Note:**
>
> Notice that this feature only supports Power BI reports with the extension **.pbit**. Before starting, please save your reports to **.pbit** extension.

### Prerequisites

Before you begin, ensure you have the following:
SnowConvert AI: You need to have the tool installed. You can access it on the [SnowConvert AI page](https://www.snowflake.com/en/migrate-to-the-cloud/snowconvert-ai/).
Power BI reports: You need to download your reports and save them with the .pbit format.

#### How To Save A .Pbit Correctly

1. Open your report (.pbix) file and allow it to load.
2. Click on “File”.

3. Then click on “Save as”.

4. Then click on “Browse this device”.

5. Select the location to be saved and the extension as .pbit.

6. Click on “Save”.

7. Optionally, add a description and click on “Ok”.

### Migration steps

1. Locate all Power BI reports with .pbit extension in a folder.
2. In the SnowConvert AI app, add the path of the Power BI projects in the “Where is your SSIS/Power BI project(s)?” section.
3. Continue the migration steps as normally.

4. Reports: In the output folder, you can review the report named ETLAndBiRepointing about the repointing transformation.
5. Access: In the output folder, you can review the “repointing_output” to access the Power BI repointing reports.
6. **Execution**: Before opening your reports, it is important to run all your migrated DDLs in your Snowflake account. Otherwise, the object will not be retrieved because they do not exist in the Snowflake account. So, follow the next steps:

   1. Run your migrated queries.
   2. Open your Power BI report.
   3. Fill in the Power BI parameters required: SF_SERVER_LINK, SF_DB_NAME, and SF_WAREHOUSE_NAME. For more information, please review the following [Power BI parameters documentation](https://learn.microsoft.com/en-us/power-query/power-query-query-parameters).

4. Click on load and wait until the report loads the information.
5. Provide your account credentials to the Power BI app. Additionally, if you have two-factor authentication, you may be asked to accept every connection request from Power BI. Be aware that there may be several pop-ups for authorization.
6. Review the ETLAndBiRepointing report and resolve every data entity with issues.
7. Double-check functionality.
8. Refresh the data and save your report in the format of your preference. It is now ready to be shared.

## Project structure

SnowConvert AI provides a new option to redefine their Power BI connections to the migrated databases in Snowflake. This redefinition of connections is referred to as repointing. Repointing is executed within the SnowConvert AI migration logic and utilizes the migration context to identify and migrate embedded SQL queries correctly.

Please refer to the specific source language Snowflake documentation that you are repointing:

1. [SQL Server](https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/etl-bi-repointing/power-bi-transact-repointing)
2. [Oracle](https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/oracle/etl-bi-repointing/power-bi-oracle-repointing)
3. [Teradata](https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/teradata/etl-bi-repointing/power-bi-teradata-repointing)
4. [Redshift](https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/redshift/etl-bi-repointing/power-bi-redshift-repointing)
5. [Azure Synapse](https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/transact/etl-bi-repointing/power-bi-transact-repointing)
6. [PostgreSQL](https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/postgres/etl-bi-repointing/power-bi-postgres-repointing)

### Output structure overview

The output structure will resemble this and will include the repointed reports. The repointing output folder named repointing_output will contain the repointed reports.

Additionally, a dedicated folder containing the extracted queries will be provided, named power_bi_sql_queries. This folder serves a crucial purpose: to allow for a thorough double-check of all embedded SQL statements. These SQL statements will have been meticulously extracted from the applicable connectors within the Power BI environment.

```sql
Output/
├── repointing_output/
│   ├── report1.pbit
│   ├── report2.pbit
│   └── reportN.pbit
└── power_bi_sql_queries/
    ├── query1.sql
    ├── query2.sql
    └── queryN.sql
```

On the other hand, in the input folder will remain the non-migrated SQL files from every single connector. If there is a need review of these.

```sql
 Input/
└── power_bi_sql_queries/
    ├── query1.sql
    ├── query2.sql
    └── queryN.sql
```

## Support Capabilities

### The current version supports

1. Repointing of tables, views, and embedded SQL queries.
2. Maintain the remaining logic steps after the connection steps in the M Language (multiple lines).
3. Provides parameters inside Power BI to handle information correctly for Snowflake server link, warehouse and database name.
4. Convert queries saved as expressions (when the “Enable load” property has been disabled).
5. Renaming of columns based on related DDLs on the migration or by Power BI report references if DDLs are not provided.
6. Identification of views, if related DDLs are provided in the migration.
7. Multiple databases and schema repointing if these are using the selected platform connector in SnowConvert AI.

### Considerations

1. The schema name of the source connections is being used as the schema in the repointed connection. It is assumed that the Snowflake database objects were created under the same schema.
2. The database objects must be deployed in Snowflake before trying to open the repointed report.
3. If the column renaming step in the M Language is empty, it means that no information was found in the migration context or Power BI project references to create it.
4. Functions and procedures are not supported in connectors different from SQL Server and Azure Synapse, so these cases are not supported.
5. All found database connections related to the source language in the migration settings will be repointed, and [parameters](https://learn.microsoft.com/en-us/power-query/power-query-query-parameters) will be added.
6. Notice that other connections from other sources rather than the selected in the migration settings, are not being edited.

## Migration Reports

The ETLAndBiRepointing contains information about the repointing process. There are connectors that are not applicable for repointing, such as CSV files, JSON files, and SharePoint connections. These non-applicable connectors are unlikely to be edited, but it is recommended to double-check. It looks like the following sample:

## Troubleshooting

1. If the user does not enter the requested global parameters after repointing, the load of objects is not triggered by Power BI; therefore, ensure that the parameter information is added. If
2. If the user clicks Cancel and the reports do not load, it is recommended to close and reopen the report.
3. If a visualization does not load, it may be because a column definition does not match the text case. Notice that the Snowflake connector from Power BI retrieves the entities and columns always in uppercase.
4. If you experience issues with the credential cache, you can navigate to the settings in Power BI and clear the connection to enter new credentials.
5. There may be problems with complex SQL queries after migration. These cases may require extra work to solve warning messages from the migration process (EWI - PRF - FDM).

## Limitations

1. Dynamic SQL embedded in connectors.
2. Column renaming is crucial for visualization loading. This renaming is not guaranteed to be precise due to limitations in the processed information. If no columns are found during the repointing, the default is to rename the columns based on a predefined case sensitivity. The default is uppercase because the native Snowflake connector retrieves all information in uppercase.

---
title: SnowConvert AI: Project Creation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/project-creation.md
section: Migrations
---

# SnowConvert AI: Project Creation

SnowConvert AI manages migrations by using SnowConvert AI projects. A SnowConvert AI project contains metadata about migrating a data set into Snowflake, such as the source database platform, input files, and output location. You can save and reuse these settings for multiple conversion runs.

To create a new project, complete the following steps:

1. Open SnowConvert AI, and then select **New Project**.

   The **New Project** page appears:
2. Enter the following information:

   * **Project name**: The name of the project.
   * **Select source**: The type of the source database system; for example, SQL Server, Oracle, and so on.
3. To use Snowflake Cortex to help verify and fix migration objects, turn on **AI features**.

   If you don’t want to use Snowflake Cortex for the migration, turn off **AI features**.

   For more information about Snowflake Cortex, see [Snowflake AI and ML](https://docs.snowflake.com/en/guides-overview-ai-features).
4. For **Snowflake connection**, optionally enter the information required to connect to your Snowflake account.
5. Select **Save Project**.

After creating a project, you can proceed to the [extraction process](extraction.md) to extract database objects from your source system to prepare them for the conversion process.

---
title: Snowflake Commands Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/snowflake_commands.md
section: Migrations
---

# Snowflake Commands Reference

## Overview

This page provides comprehensive reference documentation for Snowflake-to-Snowflake validation commands in the Snowflake Data Validation CLI. This feature enables validation between different Snowflake accounts, regions, or databases—useful for cross-account migrations, region migrations, or verifying data replication.

For other source platforms, see [SQL Server Commands Reference](sqlserver_commands.md), [Teradata Commands Reference](teradata_commands.md), or [Redshift Commands Reference](redshift_commands.md).

---

## Command Structure

All Snowflake commands follow this consistent structure:

```bash
snowflake-data-validation snowflake <command> [options]

# Or use the shorter alias
sdv snowflake <command> [options]
```

Where `<command>` is one of:

* `run-validation` - Run synchronous validation
* `run-async-validation` - Run asynchronous validation
* `generate-validation-scripts` - Generate validation scripts
* `get-configuration-files` - Get configuration templates
* `auto-generated-configuration-file` - Interactive config generation
* `row-partitioning-helper` - Interactive row partitioning configuration
* `column-partitioning-helper` - Interactive column partitioning configuration
* `source-validate` - Execute validation on source only and save results as Parquet files

---

## Run Synchronous Validation

Validates data between source and target Snowflake databases in real-time.

### Syntax

```bash
snowflake-data-validation snowflake run-validation \
  --data-validation-config-file /path/to/config.yaml \
  --log-level INFO
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file containing validation settings
* **Example:** `--data-validation-config-file ./configs/snowflake_validation.yaml`

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for validation execution
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Basic validation
sdv snowflake run-validation \
  --data-validation-config-file ./configs/snowflake_validation.yaml

# Validation with debug logging
sdv snowflake run-validation \
  --data-validation-config-file ./configs/snowflake_validation.yaml \
  --log-level DEBUG

# Using full command name
snowflake-data-validation snowflake run-validation \
  -dvf /opt/validations/prod_config.yaml \
  -ll INFO
```

### Use Cases

* Cross-account Snowflake migration validation
* Cross-region data replication verification
* Database copy validation within the same account
* Pre-cutover validation checks
* Post-migration verification
* Continuous validation in CI/CD pipelines

---

## Run Asynchronous Validation

Performs validation using pre-generated metadata files without connecting to databases.

### Syntax

```bash
snowflake-data-validation snowflake run-async-validation \
  --data-validation-config-file /path/to/config.yaml
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file
* **Note:** Configuration must specify paths to pre-generated metadata files

### Example Usage

```bash
# Run async validation
sdv snowflake run-async-validation \
  --data-validation-config-file ./configs/async_validation.yaml

# Using full command name
snowflake-data-validation snowflake run-async-validation \
  -dvf /data/validations/async_config.yaml
```

### Prerequisites

Before running async validation:

1. Generate validation scripts using `generate-validation-scripts`
2. Execute the generated scripts on source and target Snowflake databases
3. Save results to metadata files
4. Ensure metadata files are available in the configured paths

### Use Cases

* Validating in environments with restricted database access
* Separating metadata extraction from validation
* Batch validation workflows
* Scheduled validation jobs
* When database connections are intermittent

---

## Source Validate

Executes validation queries on the source Snowflake database only and saves results as Parquet files for later comparison without needing source database access.

### Syntax

```bash
snowflake-data-validation snowflake source-validate \
  --data-validation-config-file /path/to/config.yaml \
  --log-level INFO
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for validation execution
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Run source validation
sdv snowflake source-validate \
  --data-validation-config-file ./configs/snowflake_validation.yaml

# Source validation with debug logging
sdv snowflake source-validate \
  --data-validation-config-file ./configs/snowflake_validation.yaml \
  --log-level DEBUG

# Using full command name
snowflake-data-validation snowflake source-validate \
  -dvf /opt/configs/validation.yaml \
  -ll INFO
```

### Output

The command generates Parquet files in the configured output directory containing:

* Schema metadata from source tables
* Metrics data (row counts, statistics)
* Row-level data for comparison (if row validation is enabled)

### Use Cases

* **Offline validation**: Extract source data once, validate multiple times
* **Network-restricted environments**: Export data when source is accessible, validate later
* **Performance optimization**: Separate data extraction from comparison
* **Archival purposes**: Keep point-in-time snapshots of source metadata
* **Cross-environment validation**: Extract from production, validate in development

---

## Generate Validation Scripts

Generates SQL scripts for Snowflake metadata extraction that can be executed separately.

### Syntax

```bash
snowflake-data-validation snowflake generate-validation-scripts \
  --data-validation-config-file /path/to/config.yaml
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file

### Example Usage

```bash
# Generate scripts
sdv snowflake generate-validation-scripts \
  --data-validation-config-file ./configs/validation.yaml

# Using full command name
snowflake-data-validation snowflake generate-validation-scripts \
  -dvf /opt/configs/script_generation.yaml
```

### Output

The command generates SQL scripts in the output directory configured in your YAML file:

```text
<output_directory>/
├── source_schema_queries.sql
├── source_metrics_queries.sql
├── source_row_queries.sql
├── target_schema_queries.sql
├── target_metrics_queries.sql
└── target_row_queries.sql
```

### Use Cases

* Generating scripts for execution by DBAs
* Compliance requirements for query review
* Environments where direct CLI database access is restricted
* Manual execution and validation workflows
* Separating metadata extraction from validation

---

## Get Configuration Templates

Retrieves Snowflake configuration templates for validation setup.

### Syntax

```bash
snowflake-data-validation snowflake get-configuration-files \
  --templates-directory ./snowflake-templates \
  --query-templates
```

### Options

**`--templates-directory, -td`** (optional)

* **Type:** String (path)
* **Default:** Current directory
* **Description:** Directory to save template files
* **Example:** `--templates-directory ./templates`

**`--query-templates`** (optional)

* **Type:** Flag (no value required)
* **Description:** Include J2 (Jinja2) query template files for advanced customization
* **Example:** `--query-templates`

### Example Usage

```bash
# Get basic templates in current directory
sdv snowflake get-configuration-files

# Save templates to specific directory
sdv snowflake get-configuration-files \
  --templates-directory ./my-project/snowflake-templates

# Include query templates for customization
sdv snowflake get-configuration-files \
  --templates-directory ./templates \
  --query-templates

# Using short flags
sdv snowflake get-configuration-files -td ./templates --query-templates
```

### Output Files

**Without `--query-templates` flag:**

```text
<templates_directory>/
└── snowflake_validation_template.yaml
```

**With `--query-templates` flag:**

```text
<templates_directory>/
├── snowflake_validation_template.yaml
└── query_templates/
    ├── snowflake_columns_metrics_query.sql.j2
    ├── snowflake_row_count_query.sql.j2
    └── snowflake_compute_md5_sql.j2
```

### Use Cases

* Starting a new Snowflake-to-Snowflake validation project
* Learning Snowflake-specific configuration options
* Customizing validation queries
* Creating organization-specific templates

---

## Auto-Generate Configuration File

Interactive command to generate a configuration file by prompting for Snowflake connection parameters.

### Syntax

```bash
snowflake-data-validation snowflake auto-generated-configuration-file
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### Interactive Prompts

The command will prompt for the following information:

1. **Snowflake Named Connection name**

   * Name of pre-configured Snowflake connection
   * Default: `default`
   * Example: `my_snowflake_connection`
2. **Snowflake database**

   * Name of the database to validate
   * Example: `PRODUCTION_DB`
3. **Snowflake schema**

   * Schema name within the database
   * Example: `PUBLIC`
4. **Output path for configuration file**

   * Where to save the generated YAML file
   * Example: `./configs/snowflake_config.yaml`

### Example Session

```bash
$ sdv snowflake auto-generated-configuration-file

Generating basic configuration file for Snowflake validation...
Please provide the following connection information:

Snowflake Named Connection name [default]: prod_connection
Snowflake database: PRODUCTION_DB
Snowflake schema: PUBLIC
Output path for the configuration file: ./configs/snowflake_validation.yaml

Configuration file generated successfully!
```

### Generated Configuration

The command generates a basic YAML configuration file:

```yaml
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: ./validation_results

source_connection:
  mode: name
  name: prod_connection

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables: []
```

### Next Steps After Generation

1. **Edit the configuration file** to add:

   * Target connection details (if not using default)
   * Tables to validate
   * Validation options
   * Column selections and mappings
2. **Review connection settings:**

   * Verify source and target connection names
   * Consider using environment variables for sensitive data
3. **Add table configurations:**

   * Specify fully qualified table names
   * Configure column selections
   * Set up filtering where clauses
4. **Test the configuration:**

   ```bash
   sdv snowflake run-validation \
     --data-validation-config-file ./configs/snowflake_validation.yaml
   ```

### Use Cases

* Quick setup for new Snowflake-to-Snowflake users
* Generating baseline configurations
* Testing connectivity during setup
* Creating template configurations for teams

---

## Row Partitioning Helper

Interactive command to generate partitioned table configurations for large tables. This helper divides tables into smaller row partitions based on a specified column, enabling more efficient validation of large datasets.

### Syntax

```bash
snowflake-data-validation snowflake row-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The row partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply partitioning
3. If partitioning is enabled, collects partition parameters
4. Queries the source Snowflake database to determine partition boundaries
5. Generates new table configurations with `WHERE` clauses for each partition
6. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/snowflake_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply partitioning?** (yes/no)

   * Whether to partition this specific table
   * Default: yes

   b. **Partition column** (if partitioning)

   * Column name used to divide the table
   * Should be indexed or clustered for performance
   * Example: `transaction_id`, `created_date`

   c. **Is partition column a string type?** (yes/no)

   * Determines quoting in generated WHERE clauses
   * Default: no (numeric)

   d. **Number of partitions**

   * How many partitions to create
   * Example: `10`, `50`, `100`

### Example Session

```bash
$ sdv snowflake row-partitioning-helper

Generate a configuration file for Snowflake table partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip partitioning or specify partitioning parameters for each table.

Configuration file path: ./configs/snowflake_validation.yaml

Apply partitioning for PROD_DB.PUBLIC.FACT_SALES? [Y/n]: y
Write the partition column for PROD_DB.PUBLIC.FACT_SALES: SALE_ID
Is 'SALE_ID' column a string type? [y/N]: n
Write the number of partitions for PROD_DB.PUBLIC.FACT_SALES: 10

Apply partitioning for PROD_DB.PUBLIC.DIM_CUSTOMER? [Y/n]: n

Apply partitioning for PROD_DB.PUBLIC.TRANSACTIONS? [Y/n]: y
Write the partition column for PROD_DB.PUBLIC.TRANSACTIONS: TRANSACTION_DATE
Is 'TRANSACTION_DATE' column a string type? [y/N]: n
Write the number of partitions for PROD_DB.PUBLIC.TRANSACTIONS: 5

Table partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with WHERE clauses:

```yaml
tables:
  # Original table partitioned into 10 segments
  - fully_qualified_name: PROD_DB.PUBLIC.FACT_SALES
    where_clause: "SALE_ID >= 1 AND SALE_ID < 100000"
    target_where_clause: "SALE_ID >= 1 AND SALE_ID < 100000"
    # ... other table settings preserved

  - fully_qualified_name: PROD_DB.PUBLIC.FACT_SALES
    where_clause: "SALE_ID >= 100000 AND SALE_ID < 200000"
    target_where_clause: "SALE_ID >= 100000 AND SALE_ID < 200000"
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: PROD_DB.PUBLIC.DIM_CUSTOMER
    # ... original configuration
```

### Use Cases

* **Large table validation**: Break multi-billion row tables into manageable chunks
* **Parallel processing**: Enable concurrent validation of different partitions
* **Memory optimization**: Reduce memory footprint by processing smaller data segments
* **Incremental validation**: Validate specific data ranges independently
* **Performance tuning**: Optimize validation for tables with uneven data distribution

### Best Practices

1. **Choose appropriate partition columns:**

   * Use clustered columns for better query performance
   * Prefer columns with sequential values (IDs, timestamps)
   * Avoid columns with highly skewed distributions
2. **Determine optimal partition count:**

   * Consider table size and available resources
   * Start with 10-20 partitions for tables with 10M+ rows
   * Increase partitions for very large tables (100M+ rows)
3. **String vs numeric columns:**

   * Numeric columns are generally more efficient
   * String columns work but may have uneven distribution
4. **After partitioning:**

   * Review generated WHERE clauses
   * Adjust partition boundaries if needed
   * Test with a subset before full validation

---

## Column Partitioning Helper

Interactive command to generate partitioned table configurations for wide tables with many columns. This helper divides tables into smaller column partitions, enabling more efficient validation of tables with a large number of columns.

### Syntax

```bash
snowflake-data-validation snowflake column-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The column partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply column partitioning
3. If partitioning is enabled, collects the number of partitions
4. Queries the source Snowflake database to retrieve all column names for the table
5. Divides the columns into the specified number of partitions
6. Generates new table configurations where each partition validates only a subset of columns
7. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/snowflake_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply column partitioning?** (yes/no)

   * Whether to partition this specific table by columns
   * Default: yes

   b. **Number of partitions** (if partitioning)

   * How many column partitions to create
   * Example: `3`, `5`, `10`

### Example Session

```bash
$ sdv snowflake column-partitioning-helper

Generate a configuration file for Snowflake column partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip column partitioning or specify column partitioning parameters for each table.

Configuration file path: ./configs/snowflake_validation.yaml

Apply column partitioning for PROD_DB.PUBLIC.WIDE_TABLE? [Y/n]: y
Write the number of partitions for PROD_DB.PUBLIC.WIDE_TABLE: 5

Apply column partitioning for PROD_DB.PUBLIC.SMALL_TABLE? [Y/n]: n

Apply column partitioning for PROD_DB.PUBLIC.REPORT_TABLE? [Y/n]: y
Write the number of partitions for PROD_DB.PUBLIC.REPORT_TABLE: 3

Column partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with column subsets:

```yaml
tables:
  # Original table with 100 columns partitioned into 5 segments (20 columns each)
  - fully_qualified_name: PROD_DB.PUBLIC.WIDE_TABLE
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - COLUMN_A
      - COLUMN_B
      - COLUMN_C
      # ... first 20 columns alphabetically

  - fully_qualified_name: PROD_DB.PUBLIC.WIDE_TABLE
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - COLUMN_D
      - COLUMN_E
      - COLUMN_F
      # ... next 20 columns alphabetically
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: PROD_DB.PUBLIC.SMALL_TABLE
    # ... original configuration
```

### Use Cases

* **Wide table validation**: Break tables with hundreds of columns into manageable chunks
* **Memory optimization**: Reduce memory footprint by validating fewer columns at a time
* **Parallel processing**: Enable concurrent validation of different column groups
* **Targeted validation**: Validate specific column groups independently
* **Performance tuning**: Optimize validation for tables with many VARIANT or complex columns

### Best Practices

1. **Determine optimal partition count:**

   * Consider the total number of columns in the table
   * For tables with 50+ columns, start with 3-5 partitions
   * For tables with 100+ columns, consider 5-10 partitions
2. **Column ordering:**

   * Columns are divided alphabetically
   * Related columns may end up in different partitions
3. **After partitioning:**

   * Review generated column lists
   * Verify all required columns are included
   * Test with a subset before full validation
4. **Combine with row partitioning:**

   * For very large, wide tables, consider using both row and column partitioning
   * First partition by columns, then apply row partitioning to each column partition if needed

---

## Snowflake Connection Configuration

Snowflake connections support multiple modes for both source and target databases.

### Connection Modes

#### Option 1: Named Connection

Use a pre-configured Snowflake connection saved in your Snowflake connections file.

```yaml
source_connection:
  mode: name
  name: "my_source_connection"

target_connection:
  mode: name
  name: "my_target_connection"
```

**Fields:**

* **`mode`** (required): Must be `"name"`
* **`name`** (required): Name of the saved Snowflake connection

#### Option 2: Default Connection

Use the default Snowflake connection from your environment.

```yaml
source_connection:
  mode: default

target_connection:
  mode: default
```

**Fields:**

* **`mode`** (required): Must be `"default"`

#### Option 3: Credentials Mode (IPC Only)

> **Note:** The `credentials` mode is only available when using IPC (Inter-Process Communication) commands directly via CLI parameters, not in YAML configuration files. This mode is exclusive to the SnowConvert UI.

### Connection Examples

**Same Account, Different Databases:**

```yaml
source_connection:
  mode: name
  name: prod_connection

target_connection:
  mode: name
  name: prod_connection  # Same connection, different database specified in tables
```

**Cross-Account Validation:**

```yaml
source_connection:
  mode: name
  name: source_account_connection

target_connection:
  mode: name
  name: target_account_connection
```

**Cross-Region Migration:**

```yaml
source_connection:
  mode: name
  name: us_east_connection

target_connection:
  mode: name
  name: eu_west_connection
```

**Development to Production Comparison:**

```yaml
source_connection:
  mode: name
  name: dev_connection

target_connection:
  mode: name
  name: prod_connection
```

### Setting Up Named Connections

Snowflake connections are typically configured using the Snowflake CLI or SnowSQL configuration files.

**SnowSQL Configuration Example (`~/.snowsql/config`):**

```ini
[connections.prod_connection]
accountname = myaccount.us-east-1
username = my_user
password = my_password
dbname = PRODUCTION_DB
schemaname = PUBLIC
warehousename = COMPUTE_WH

[connections.dev_connection]
accountname = myaccount.us-east-1
username = my_user
password = my_password
dbname = DEVELOPMENT_DB
schemaname = PUBLIC
warehousename = DEV_WH
```

**Snowflake CLI Configuration Example (`~/.snowflake/connections.toml`):**

```toml
[prod_connection]
account = "myaccount.us-east-1"
user = "my_user"
password = "my_password"
database = "PRODUCTION_DB"
schema = "PUBLIC"
warehouse = "COMPUTE_WH"

[dev_connection]
account = "myaccount.us-east-1"
user = "my_user"
password = "my_password"
database = "DEVELOPMENT_DB"
schema = "PUBLIC"
warehouse = "DEV_WH"
```

---

## Complete Snowflake Examples

### Example 1: Basic Snowflake-to-Snowflake Configuration

```yaml
# Global configuration
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: ./validation_results
max_threads: auto

# Source connection (development)
source_connection:
  mode: name
  name: dev_connection

# Target connection (production)
target_connection:
  mode: name
  name: prod_connection

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Tables to validate
tables:
  - fully_qualified_name: DEV_DB.PUBLIC.CUSTOMERS
    target_database: PROD_DB
    target_schema: PUBLIC
    target_name: CUSTOMERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - CUSTOMER_ID

  - fully_qualified_name: DEV_DB.PUBLIC.ORDERS
    target_database: PROD_DB
    target_schema: PUBLIC
    target_name: ORDERS
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - INTERNAL_NOTES
      - AUDIT_LOG
```

### Example 2: Cross-Account Migration Validation

```yaml
# Global configuration
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: /opt/validation/cross_account
max_threads: 16

# Source connection (Account A)
source_connection:
  mode: name
  name: account_a_connection

# Target connection (Account B)
target_connection:
  mode: name
  name: account_b_connection

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 200

# Comparison configuration
comparison_configuration:
  tolerance: 0.01

# Logging configuration
logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

# Database mappings (if names differ between accounts)
database_mappings:
  ANALYTICS_A: ANALYTICS_B
  WAREHOUSE_A: WAREHOUSE_B

# Schema mappings
schema_mappings:
  RAW: RAW_DATA
  STAGING: STAGING_DATA

# Tables configuration
tables:
  - fully_qualified_name: ANALYTICS_A.RAW.FACT_SALES
    target_database: ANALYTICS_B
    target_schema: RAW_DATA
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - SALE_ID
    chunk_number: 50
    max_failed_rows_number: 500

  - fully_qualified_name: ANALYTICS_A.RAW.DIM_CUSTOMER
    target_database: ANALYTICS_B
    target_schema: RAW_DATA
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - INTERNAL_SCORE
      - RISK_RATING
    where_clause: "STATUS = 'ACTIVE'"
    target_where_clause: "STATUS = 'ACTIVE'"
    column_mappings:
      CUST_KEY: CUSTOMER_KEY
      CUST_NAME: CUSTOMER_NAME
```

### Example 3: Cross-Region Replication Validation

```yaml
# Global configuration
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: /data/validation/region_replication
max_threads: 24

# Source connection (US East)
source_connection:
  mode: name
  name: us_east_connection

# Target connection (EU West)
target_connection:
  mode: name
  name: eu_west_connection

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 150

# Comparison configuration
comparison_configuration:
  tolerance: 0.005

# Tables configuration
tables:
  # Large fact table with chunking
  - fully_qualified_name: GLOBAL_DB.REPLICATION.TRANSACTIONS
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - TRANSACTION_ID
      - CUSTOMER_ID
      - AMOUNT
      - TRANSACTION_DATE
      - STATUS
    index_column_list:
      - TRANSACTION_ID
    where_clause: "TRANSACTION_DATE >= DATEADD(day, -7, CURRENT_DATE())"
    target_where_clause: "TRANSACTION_DATE >= DATEADD(day, -7, CURRENT_DATE())"
    chunk_number: 30

  # Dimension table
  - fully_qualified_name: GLOBAL_DB.REPLICATION.PRODUCTS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - PRODUCT_ID

  # Reference table
  - fully_qualified_name: GLOBAL_DB.REPLICATION.CURRENCIES
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - CURRENCY_CODE
```

### Example 4: Database Copy Validation

```yaml
# Validate a database copy within the same account
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: ./db_copy_validation
max_threads: auto

# Use the same connection for both
source_connection:
  mode: name
  name: prod_connection

target_connection:
  mode: name
  name: prod_connection

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Comparison configuration
comparison_configuration:
  tolerance: 0.001

# Tables to validate (source DB vs copied DB)
tables:
  - fully_qualified_name: ORIGINAL_DB.PUBLIC.USERS
    target_database: COPIED_DB
    target_schema: PUBLIC
    target_name: USERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - USER_ID

  - fully_qualified_name: ORIGINAL_DB.PUBLIC.EVENTS
    target_database: COPIED_DB
    target_schema: PUBLIC
    target_name: EVENTS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - EVENT_ID
    chunk_number: 20
```

### Example 5: Snowflake View Validation

Validate Snowflake views alongside tables for comprehensive data verification.

```yaml
source_platform: Snowflake
target_platform: Snowflake
output_directory_path: ./snowflake_view_validation
max_threads: auto

source_connection:
  mode: name
  name: source_connection

target_connection:
  mode: name
  name: target_connection

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 50

comparison_configuration:
  tolerance: 0.01

# Tables to validate
tables:
  - fully_qualified_name: ANALYTICS_DB.PUBLIC.CUSTOMERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

# Views to validate
views:
  # Basic view validation
  - fully_qualified_name: ANALYTICS_DB.PUBLIC.V_CUSTOMER_SUMMARY
    target_name: V_CUSTOMER_SUMMARY
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

  # View with specific columns
  - fully_qualified_name: ANALYTICS_DB.PUBLIC.V_SALES_METRICS
    target_name: V_SALES_METRICS
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - REGION
      - TOTAL_SALES
      - ORDER_COUNT
      - AVG_ORDER_VALUE
    index_column_list: [REGION, PERIOD]
    target_index_column_list: [REGION, PERIOD]

  # View with filtering
  - fully_qualified_name: ANALYTICS_DB.PUBLIC.V_ACTIVE_USERS
    target_name: V_ACTIVE_USERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [USER_ID]
    target_index_column_list: [USER_ID]
    where_clause: "LAST_LOGIN >= DATEADD(day, -30, CURRENT_DATE())"
    target_where_clause: "LAST_LOGIN >= DATEADD(day, -30, CURRENT_DATE())"

  # View with different target name
  - fully_qualified_name: ANALYTICS_DB.PUBLIC.V_LEGACY_REPORT
    target_database: MODERN_DB
    target_schema: REPORTS
    target_name: V_MODERNIZED_REPORT
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [REPORT_ID]
    target_index_column_list: [REPORT_ID]
    column_mappings:
      OLD_COL: NEW_COL
```

**Note:** View validation creates temporary tables internally to materialize view data for comparison between source and target Snowflake databases.

---

## Troubleshooting Snowflake Connections

### Issue: Connection Not Found

**Symptom:**

```sql
Connection 'connection_name' not found
```

**Solutions:**

1. Verify the connection name is correct:

   ```bash
   # List available connections using Snowflake CLI
   snow connection list
   ```
2. Check your Snowflake connections configuration file
3. Ensure the connection file has proper permissions
4. Verify the connection name matches exactly (case-sensitive)

### Issue: Authentication Failed

**Symptom:**

```sql
Authentication failed for user 'username'
```

**Solutions:**

1. Verify credentials are correct
2. Check if using correct authentication method:

   * Password authentication
   * Key pair authentication
   * SSO/OAuth
3. Verify user has necessary permissions:

   ```sql
   -- Grant read permissions
   GRANT USAGE ON DATABASE database_name TO ROLE my_role;
   GRANT USAGE ON SCHEMA database_name.schema_name TO ROLE my_role;
   GRANT SELECT ON ALL TABLES IN SCHEMA database_name.schema_name TO ROLE my_role;
   ```
4. Check if account is correct (including region suffix)

### Issue: Database/Schema Not Found

**Symptom:**

```sql
Database 'DATABASE_NAME' does not exist or not authorized
```

**Solutions:**

1. Verify database/schema names are correct (case-sensitive in Snowflake)
2. Check user has access to the database:

   ```sql
   USE DATABASE database_name;
   USE SCHEMA schema_name;
   SHOW TABLES;
   ```
3. Verify the warehouse is running:

   ```sql
   ALTER WAREHOUSE my_warehouse RESUME;
   ```

### Issue: Cross-Account Access Denied

**Symptom:**

```sql
Access denied to account 'account_name'
```

**Solutions:**

1. Verify both accounts have correct connection configurations
2. Check if data sharing is properly configured between accounts
3. Verify network policies allow cross-account connections
4. Ensure both connections use appropriate credentials

### Issue: Timeout Errors

**Symptom:**

```sql
Query timeout: Operation did not complete within the specified time
```

**Solutions:**

1. Increase warehouse size:

   ```sql
   ALTER WAREHOUSE my_warehouse SET WAREHOUSE_SIZE = 'LARGE';
   ```
2. Enable chunking for large tables:

   ```yaml
   tables:
     - fully_qualified_name: large_table
       chunk_number: 50
   ```
3. Add WHERE clauses to limit data:

   ```yaml
   tables:
     - fully_qualified_name: large_table
       where_clause: "CREATED_DATE >= DATEADD(month, -1, CURRENT_DATE())"
   ```
4. Reduce thread count if warehouse is overloaded:

   ```yaml
   max_threads: 8
   ```

---

## Best Practices for Snowflake-to-Snowflake Validation

### Connection Management

1. **Use named connections:**

   ```yaml
   source_connection:
     mode: name
     name: source_account
   ```
2. **Store credentials securely:**

   * Use Snowflake CLI connection configuration
   * Leverage key pair authentication for production
   * Avoid hardcoding passwords
3. **Use appropriate roles:**

   ```sql
   -- Create a read-only role for validation
   CREATE ROLE validation_reader;
   GRANT USAGE ON DATABASE db_name TO ROLE validation_reader;
   GRANT USAGE ON ALL SCHEMAS IN DATABASE db_name TO ROLE validation_reader;
   GRANT SELECT ON ALL TABLES IN DATABASE db_name TO ROLE validation_reader;
   ```

### Performance Optimization

1. **Size warehouses appropriately:**

   ```sql
   -- Use larger warehouse for big validations
   ALTER WAREHOUSE validation_wh SET WAREHOUSE_SIZE = 'MEDIUM';
   ```
2. **Enable chunking for large tables:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       chunk_number: 50
   ```
3. **Use WHERE clauses to filter data:**

   ```yaml
   tables:
     - fully_qualified_name: transactions
       where_clause: "TRANSACTION_DATE >= CURRENT_DATE() - 30"
   ```
4. **Optimize thread count:**

   ```yaml
   max_threads: 16  # Adjust based on warehouse capacity
   ```
5. **Consider time-based filtering for incremental validation:**

   ```yaml
   tables:
     - fully_qualified_name: events
       where_clause: "EVENT_TIMESTAMP >= '2024-01-01'"
       target_where_clause: "EVENT_TIMESTAMP >= '2024-01-01'"
   ```

### Data Quality

1. **Start with schema validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: false
     row_validation: false
   ```
2. **Progress to metrics validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: true
     row_validation: false
   ```
3. **Enable row validation selectively:**

   ```yaml
   validation_configuration:
     row_validation: true

   tables:
     - fully_qualified_name: critical_fact_table
       # Row validation enabled for critical tables
   ```

### Cross-Account/Region Considerations

1. **Account for replication lag:**

   * Allow time for replication to complete before validation
   * Use time-based filters that account for lag
2. **Handle naming differences:**

   ```yaml
   database_mappings:
     SOURCE_DB: TARGET_DB

   schema_mappings:
     SOURCE_SCHEMA: TARGET_SCHEMA
   ```
3. **Monitor costs:**

   * Cross-region data transfer incurs costs
   * Schedule validations during off-peak hours
   * Use sampling for initial validation
4. **Use appropriate tolerance:**

   ```yaml
   comparison_configuration:
     tolerance: 0.01  # Allow for minor differences
   ```

---

## See Also

* [Main CLI Usage Guide](CLI_USAGE_GUIDE.md)
* [SQL Server Commands Reference](sqlserver_commands.md)
* [Teradata Commands Reference](teradata_commands.md)
* [Redshift Commands Reference](redshift_commands.md)
* [Configuration Examples](CONFIGURATION_EXAMPLES.md)
* [Quick Reference Guide](CLI_QUICK_REFERENCE.md)

---
title: Snowflake Data Validation - Documentation Index
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/index.md
section: Migrations
---

# Snowflake Data Validation - Documentation Index

Welcome to the Snowflake Data Validation CLI documentation. The Snowflake Data Validation CLI (`snowflake-data-validation` or `sdv`) is a comprehensive command-line tool for validating data migrations between source databases (SQL Server, Teradata, Amazon Redshift, Snowflake) and Snowflake. It provides multi-level validation strategies to ensure data consistency and quality, including Snowflake-to-Snowflake validation for cross-account or cross-region migrations.

## Documentation roadmap

### 1. Command References by Database Dialect

**Choose your source database for dialect-specific commands:**

* **[SQL Server Commands Reference](sqlserver_commands.md)** - Complete SQL Server command documentation
* **[Teradata Commands Reference](teradata_commands.md)** - Complete Teradata command documentation
* **[Amazon Redshift Commands Reference](redshift_commands.md)** - Complete Redshift command documentation
* **[Snowflake Commands Reference](snowflake_commands.md)** - Complete Snowflake-to-Snowflake command documentation

Each command reference includes:

* Detailed syntax and options for all commands
* Connection configuration specifics
* Complete examples
* Troubleshooting tips
* Best practices for that platform

---

### 2. CLI Usage Guide - Comprehensive Reference

**[Start here for complete documentation.](CLI_USAGE_GUIDE.md)**

A comprehensive, customer-facing guide covering all aspects of the CLI tool.

**Contents:**

* Complete installation instructions
* Detailed command reference for all source databases
* In-depth configuration file reference with all options explained
* Complete configuration examples
* Advanced usage patterns
* Troubleshooting guide

**Best for:**

* First-time users getting started
* Users needing detailed explanations of configuration options
* Troubleshooting issues
* Understanding all available features

---

### 3. Quick Reference Guide - Fast Lookup

**[Use this for quick lookups and reminders.](CLI_QUICK_REFERENCE.md)**

A concise reference guide with essential information in an easy-to-scan format.

**Contents:**

* Command syntax at a glance
* Quick configuration templates
* Table configuration patterns
* Common CLI options reference
* Performance tips
* Common issues and quick fixes

**Best for:**

* Experienced users who need quick reminders
* Looking up specific syntax
* Quick configuration templates
* Performance optimization tips

---

### 4. Configuration Examples - Ready-to-Use configurations

**[Copy and adapt these real-world examples.](CONFIGURATION_EXAMPLES.md)**

A collection of ready-to-use configuration file examples for various scenarios.

**Contents:**

* 16+ complete configuration examples
* SQL Server configurations
* Teradata configurations
* Redshift configurations
* Snowflake-to-Snowflake configurations
* Scenario-based examples (dev, staging, production, PII-compliant, etc.)
* Tips for adapting examples
* Security best practices

**Best for:**

* Jump-starting your configuration
* Finding a configuration similar to your use case
* Learning by example
* Best practices for different scenarios

---

## Quick Navigation by Task

### The following sections provide quick references to the documentation for specific tasks.

#### Get Started

1. Follow installation instructions in [CLI Usage Guide](CLI_USAGE_GUIDE.md)
2. Copy an example from [Configuration Examples](CONFIGURATION_EXAMPLES.md)
3. Run your first validation using the [Quick Reference](CLI_QUICK_REFERENCE.md)

#### Understand All Options

→ [CLI Usage Guide - Configuration File Reference](CLI_USAGE_GUIDE.md)

#### Find a Command

→ [Quick Reference - Common Commands](CLI_QUICK_REFERENCE.md)

#### Create a Configuration File

→ [Configuration Examples](CONFIGURATION_EXAMPLES.md) (pick the closest match to your scenario)

#### Troubleshoot an Issue

→ [CLI Usage Guide - Troubleshooting](CLI_USAGE_GUIDE.md)

#### Optimize Performance

→ [Quick Reference - Performance Tips](CLI_QUICK_REFERENCE.md)

#### Validate Large Tables

→ [CLI Usage Guide - Working with Large Tables](CLI_USAGE_GUIDE.md)

#### Understand Connection Options

→ [CLI Usage Guide - Connection Configuration](CLI_USAGE_GUIDE.md)

#### Set Up Validation Levels

→ [CLI Usage Guide - Validation Configuration](CLI_USAGE_GUIDE.md)

#### Configure Table-Specific Settings

→ [CLI Usage Guide - Table Configuration](CLI_USAGE_GUIDE.md)

#### Configure View Validation

→ [CLI Usage Guide - View Configuration](CLI_USAGE_GUIDE.md)

#### View Validation Examples

→ [Configuration Examples - View Validation](CONFIGURATION_EXAMPLES.md)

---

## Documentation by Source Database

The following sections provide quick references to the documentation for specific source databases.

### SQL Server Users

**Essential Reading:**

1. **[SQL Server Commands Reference](sqlserver_commands.md)** - Complete command reference
2. [Quick Reference - SQL Server Connection](CLI_QUICK_REFERENCE.md)
3. [CLI Usage Guide - SQL Server Commands](CLI_USAGE_GUIDE.md)
4. [Configuration Examples - SQL Server Examples](CONFIGURATION_EXAMPLES.md)

**Key Examples:**

* [Example 1: Minimal SQL Server Configuration](CONFIGURATION_EXAMPLES.md)
* [Example 2: Production SQL Server with SSL/TLS](CONFIGURATION_EXAMPLES.md)
* [Example 3: SQL Server Incremental Validation](CONFIGURATION_EXAMPLES.md)
* [Example 4: SQL Server with Column Mappings](CONFIGURATION_EXAMPLES.md)

### Teradata Users

**Essential Reading:**

1. **[Teradata Commands Reference](teradata_commands.md)** - Complete command reference
2. [Quick Reference - Teradata Connection](CLI_QUICK_REFERENCE.md)
3. [CLI Usage Guide - Teradata Commands](CLI_USAGE_GUIDE.md)
4. [Configuration Examples - Teradata Examples](CONFIGURATION_EXAMPLES.md)

**Key Examples:**

* [Example 5: Basic Teradata Configuration](CONFIGURATION_EXAMPLES.md)
* [Example 6: Teradata Large-Scale Migration](CONFIGURATION_EXAMPLES.md)
* [Example 7: Teradata Multi-Schema Validation](CONFIGURATION_EXAMPLES.md)

### Amazon Redshift Users

**Essential Reading:**

1. **[Amazon Redshift Commands Reference](redshift_commands.md)** - Complete command reference
2. [Quick Reference - Redshift Connection](CLI_QUICK_REFERENCE.md)
3. [CLI Usage Guide - Redshift Commands](CLI_USAGE_GUIDE.md)
4. [Configuration Examples - Redshift Examples](CONFIGURATION_EXAMPLES.md)

**Key Examples:**

* [Example 8: Basic Redshift Configuration](CONFIGURATION_EXAMPLES.md)
* [Example 9: Redshift Data Lake Migration](CONFIGURATION_EXAMPLES.md)
* [Example 10: Redshift with Complex Filtering](CONFIGURATION_EXAMPLES.md)

---

## Documentation by Use Case

### Development Environment

* [Configuration Example 11: Development Environment - Fast Validation](CONFIGURATION_EXAMPLES.md)
* [Quick Reference - Common Commands](CLI_QUICK_REFERENCE.md)

### Staging Environment

* [Configuration Example 12: Staging Environment - Comprehensive Testing](CONFIGURATION_EXAMPLES.md)
* [CLI Usage Guide - Advanced Usage](CLI_USAGE_GUIDE.md)

### Production Environment

* [Configuration Example 13: Production - Maximum Performance](CONFIGURATION_EXAMPLES.md)
* [CLI Usage Guide - Working with Large Tables](CLI_USAGE_GUIDE.md)
* [Quick Reference - Performance Tips](CLI_QUICK_REFERENCE.md)

### PII/Compliance Requirements

* [Configuration Example 14: PII-Compliant Validation](CONFIGURATION_EXAMPLES.md)
* [CLI Usage Guide - Table Configuration](CLI_USAGE_GUIDE.md)

### Migration Cutover

* [Configuration Example 15: Migration Cutover Validation](CONFIGURATION_EXAMPLES.md)
* [CLI Usage Guide - Advanced Usage](CLI_USAGE_GUIDE.md)

### Continuous/Incremental Validation

* [Configuration Example 16: Continuous Validation - Daily Incremental](CONFIGURATION_EXAMPLES.md)
* [CLI Usage Guide - Advanced Usage](CLI_USAGE_GUIDE.md)

### View Validation

Validate database views alongside or separately from tables. Views are materialized into temporary tables for comparison.

* [CLI Usage Guide - View Configuration](CLI_USAGE_GUIDE.md)
* [Configuration Examples - View Validation](CONFIGURATION_EXAMPLES.md)
* [Quick Reference - View Configuration](CLI_QUICK_REFERENCE.md)

**Key Features:**

* Validate views with the same options as tables (column selection, filtering, column mappings)
* Support for target database/schema/name overrides
* Views are automatically materialized for accurate comparison

### Snowflake-to-Snowflake Validation

Validate data between different Snowflake accounts, regions, or databases.

**Essential Reading:**

1. **[Snowflake Commands Reference](snowflake_commands.md)** - Complete command reference
2. [CLI Usage Guide - Snowflake Commands](CLI_USAGE_GUIDE.md)
3. [Configuration Examples - Snowflake Examples](CONFIGURATION_EXAMPLES.md)

**Key Features:**

* Cross-account validation with separate source and target credentials
* IPC mode for direct connection parameter specification
* Source-only validation with Parquet file export for offline comparison
* Same validation capabilities as other source platforms (schema, metrics, row-level)

---

## Configuration Reference

The following sections provide quick references to the documentation for specific configuration scenarios.

### Quick Config Template

→ [Quick Reference - Configuration Template](CLI_QUICK_REFERENCE.md)

### Complete Field Reference

→ [CLI Usage Guide - Configuration File Reference](CLI_USAGE_GUIDE.md)

### Real-World Examples

→ [Configuration Examples](CONFIGURATION_EXAMPLES.md)

---

## Common Workflows

The following sections provide quick references to the documentation for common workflows.

### First-Time Setup Workflow

1. Install the CLI

   * → [CLI Usage Guide - Installation](CLI_USAGE_GUIDE.md)
2. Generate configuration template

   * → [Quick Reference - Get Templates](CLI_QUICK_REFERENCE.md)
3. Copy and modify an example

   * → [Configuration Examples](CONFIGURATION_EXAMPLES.md)
4. Run validation

   * → [Quick Reference - Run Validation](CLI_QUICK_REFERENCE.md)
5. Review results

   * → [CLI Usage Guide - Validation Reports](CLI_USAGE_GUIDE.md)

### Troubleshooting Workflow

1. Check error message

   * → [CLI Usage Guide - Troubleshooting](CLI_USAGE_GUIDE.md)
2. Review configuration

   * → [Quick Reference - Configuration Template](CLI_QUICK_REFERENCE.md)
3. Enable debug logging

   * → [CLI Usage Guide - Logging Configuration](CLI_USAGE_GUIDE.md)
4. Review logs

   * → [CLI Usage Guide - Troubleshooting](CLI_USAGE_GUIDE.md)
5. Adjust configuration

   * → [Configuration Examples](CONFIGURATION_EXAMPLES.md)

### Performance Optimization Workflow

1. Review performance tips

   * → [Quick Reference - Performance Tips](CLI_QUICK_REFERENCE.md)
2. Enable chunking

   * → [CLI Usage Guide - Working with Large Tables](CLI_USAGE_GUIDE.md)
3. Adjust thread count

   * → [CLI Usage Guide - Global Configuration](CLI_USAGE_GUIDE.md)
4. Add filters

   * → [CLI Usage Guide - Table Configuration](CLI_USAGE_GUIDE.md)
5. Test with examples

   * → [Configuration Examples - Production](CONFIGURATION_EXAMPLES.md)

---

## Feature Matrix

| Feature | Command Refs | Quick Reference | Usage Guide | Examples |
| --- | --- | --- | --- | --- |
| Installation |  | ✓ | ✓✓✓ |  |
| Command Syntax | ✓✓✓ | ✓✓✓ | ✓✓ |  |
| Configuration | ✓ | ✓✓ | ✓✓✓ | ✓✓✓ |
| Connection Setup | ✓✓✓ | ✓ | ✓✓✓ | ✓✓✓ |
| Table Config |  | ✓✓ | ✓✓✓ | ✓✓✓ |
| View Config |  | ✓✓ | ✓✓✓ | ✓✓✓ |
| Validation Levels |  | ✓ | ✓✓✓ | ✓✓ |
| Performance |  | ✓✓✓ | ✓✓ | ✓✓ |
| Troubleshooting | ✓✓✓ | ✓✓ | ✓✓✓ |  |
| Examples | ✓✓ | ✓ | ✓✓ | ✓✓✓ |

Legend: ✓ = Covered, ✓✓ = Good Coverage, ✓✓✓ = Comprehensive Coverage

---

## Learning Path

### Beginner Path

1. **Day 1: Understanding the Tool**

   * Read the [Main Project Repository](https://github.com/snowflake-eng/migrations-data-validation)
   * Skim [CLI Usage Guide - Overview](CLI_USAGE_GUIDE.md)
   * Review [Quick Reference](CLI_QUICK_REFERENCE.md)
2. **Day 2: First Validation**

   * Follow [CLI Usage Guide - Quick Start](CLI_USAGE_GUIDE.md)
   * Copy [Configuration Example 1 or 5 or 8](CONFIGURATION_EXAMPLES.md)
   * Run your first validation
3. **Day 3: Configuration Mastery**

   * Read [CLI Usage Guide - Configuration Reference](CLI_USAGE_GUIDE.md)
   * Review multiple [Configuration Examples](CONFIGURATION_EXAMPLES.md)
   * Customize configuration for your needs

### Intermediate Path

1. **Optimize Performance**

   * [CLI Usage Guide - Working with Large Tables](CLI_USAGE_GUIDE.md)
   * [Quick Reference - Performance Tips](CLI_QUICK_REFERENCE.md)
2. **Advanced Features**

   * [CLI Usage Guide - Advanced Usage](CLI_USAGE_GUIDE.md)
   * [Configuration Examples - Scenario-Based](CONFIGURATION_EXAMPLES.md)
3. **CI/CD Integration**

   * [CLI Usage Guide - CI/CD Integration](CLI_USAGE_GUIDE.md)
   * [Configuration Examples - Continuous Validation](CONFIGURATION_EXAMPLES.md)

### Expert Path

1. **Custom Templates**

   * [CLI Usage Guide - Using Custom Query Templates](CLI_USAGE_GUIDE.md)
2. **Async Workflows**

   * [CLI Usage Guide - Asynchronous Validation Workflow](CLI_USAGE_GUIDE.md)
3. **Production Deployment**

   * [Configuration Example 13 & 15](CONFIGURATION_EXAMPLES.md)

---

## Search Tips

### Finding Information Quickly

**For Commands:**

* Look in [Quick Reference](CLI_QUICK_REFERENCE.md) first
* For details, see [CLI Usage Guide - CLI Commands](CLI_USAGE_GUIDE.md)

**For Configuration:**

* Start with [Quick Reference - Configuration Template](CLI_QUICK_REFERENCE.md)
* For full details, see [CLI Usage Guide - Configuration Reference](CLI_USAGE_GUIDE.md)
* For examples, see [Configuration Examples](CONFIGURATION_EXAMPLES.md)

**For Errors:**

* Check [CLI Usage Guide - Troubleshooting](CLI_USAGE_GUIDE.md)
* Review [Quick Reference - Common Issues](CLI_QUICK_REFERENCE.md)

---

## Additional Support

If you cannot find what you need in these documents:

Email us at [snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: Snowflake Data Validation CLI - Complete Usage Guide
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/CLI_USAGE_GUIDE.md
section: Migrations
---

# Snowflake Data Validation CLI - Complete Usage Guide

## Overview

The Snowflake Data Validation CLI (`snowflake-data-validation` or `sdv`) is a comprehensive command-line tool for validating data migrations between source databases (SQL Server, Teradata, Amazon Redshift) and Snowflake. It provides multi-level validation strategies to ensure data consistency and quality.

### Key Features

* **Multi-Level Validation**: Schema, statistical metrics, and row-by-row data validation
* **Multiple Source Platforms**: SQL Server, Teradata, and Amazon Redshift
* **Tables and Views Validation**: Validate both tables and database views
* **Flexible Execution Modes**: Synchronous, asynchronous, and script generation
* **Comprehensive Configuration**: YAML-based configuration with extensive customization options
* **Detailed Reporting**: Comprehensive validation reports with mismatch information

---

## Prerequisites

Before installing the Snowflake Data Validation CLI, ensure you have the following prerequisites installed:

### System Requirements

* **Python**: Version 3.8 or higher
* **pip**: Latest version recommended
* **Operating System**: Linux, macOS, or Windows

### ODBC Drivers

The CLI requires appropriate ODBC drivers to be installed on your system for connecting to source databases. Install the ODBC driver that matches your source database dialect:

#### SQL Server ODBC Driver

For SQL Server as a source database, you need the **Microsoft ODBC Driver for SQL Server**.

**Recommended Version**: ODBC Driver 17 or 18 for SQL Server

**Installation Instructions:**

* **Linux**:

  ```bash
  # Ubuntu/Debian
  curl https://packages.microsoft.com/keys/microsoft.asc | apt-key add -
  curl https://packages.microsoft.com/config/ubuntu/$(lsb_release -rs)/prod.list > /etc/apt/sources.list.d/mssql-release.list
  apt-get update
  ACCEPT_EULA=Y apt-get install -y msodbcsql18
  ```
* **macOS**:

  ```bash
  # Using Homebrew
  brew tap microsoft/mssql-release https://github.com/Microsoft/homebrew-mssql-release
  brew update
  HOMEBREW_ACCEPT_EULA=Y brew install msodbcsql18
  ```
* **Windows**:
  Download and install from [Microsoft’s official download page](https://docs.microsoft.com/en-us/sql/connect/odbc/download-odbc-driver-for-sql-server)

**Verification:**

```bash
# List available drivers (should show ODBC Driver 17 or 18 for SQL Server)
odbcinst -q -d
```

**Documentation**: [Microsoft ODBC Driver for SQL Server](https://docs.microsoft.com/en-us/sql/connect/odbc/microsoft-odbc-driver-for-sql-server)

#### Teradata ODBC Driver

For Teradata as a source database, you need the **Teradata ODBC Driver**.

**Recommended Version**: Teradata ODBC Driver 17.20 or higher

**Installation Instructions:**

1. Download the Teradata Tools and Utilities (TTU) package from [Teradata Downloads](https://downloads.teradata.com/)
2. Select your operating system and download the appropriate installer
3. Run the installer and select “ODBC Driver” during installation
4. Configure the driver according to Teradata’s documentation

**Note**: You may need to create a Teradata account to access the download page.

**Configuration:**
After installation, you may need to configure the ODBC driver:

* **Linux/macOS**: Edit `/etc/odbc.ini` and `/etc/odbcinst.ini`
* **Windows**: Use the ODBC Data Source Administrator

**Documentation**: [Teradata ODBC Driver Documentation](https://downloads.teradata.com/download/connectivity/odbc-driver/linux)

#### Amazon Redshift ODBC Driver

For Amazon Redshift as a source database, you need the **Amazon Redshift ODBC Driver** or a **PostgreSQL ODBC Driver** (since Redshift is PostgreSQL-compatible).

**Option 1: Amazon Redshift ODBC Driver (Recommended)**

**Recommended Version**: Amazon Redshift ODBC Driver 2.x

**Installation Instructions:**

* Download from [Amazon Redshift ODBC Driver Download](https://docs.aws.amazon.com/redshift/latest/mgmt/configure-odbc-connection.html)
* Choose your operating system and architecture
* Follow the installation wizard

**Option 2: PostgreSQL ODBC Driver (Alternative)**

* **Linux**:

  ```bash
  # Ubuntu/Debian
  sudo apt-get install odbc-postgresql

  # RHEL/CentOS/Fedora
  sudo yum install postgresql-odbc
  ```
* **macOS**:

  ```bash
  # Using Homebrew
  brew install psqlodbc
  ```
* **Windows**:
  Download from [PostgreSQL ODBC Driver](https://www.postgresql.org/ftp/odbc/versions/)

**Verification:**

```bash
# List available drivers (should show Amazon Redshift or PostgreSQL drivers)
odbcinst -q -d
```

**Documentation**: [Amazon Redshift ODBC Driver Documentation](https://docs.aws.amazon.com/redshift/latest/mgmt/configure-odbc-connection.html)

### Additional Tools (Optional)

* **unixODBC** (Linux/macOS): Required for ODBC driver management

  ```bash
  # Ubuntu/Debian
  sudo apt-get install unixodbc unixodbc-dev

  # macOS
  brew install unixodbc

  # RHEL/CentOS/Fedora
  sudo yum install unixODBC unixODBC-devel
  ```

### Network Access

Ensure your environment has network access to:

* Source database (SQL Server, Teradata, or Redshift)
* Snowflake account
* Package repositories (for pip installation)

---

## Installation

### Base Installation

```bash
pip install snowflake-data-validation
```

### Source-Specific Installation

Install with the appropriate database driver for your source system:

```bash
# For SQL Server as source
pip install "snowflake-data-validation[sqlserver]"

# For Teradata as source
pip install "snowflake-data-validation[teradata]"

# For Amazon Redshift as source
pip install "snowflake-data-validation[redshift]"

# For development with all drivers
pip install "snowflake-data-validation[all]"
```

### Post-Installation Verification

After installation, verify the CLI is correctly installed:

```bash
# Check version
snowflake-data-validation --version

# Or using the alias
sdv --version

# Verify ODBC drivers are accessible
odbcinst -q -d
```

---

## Quick Start

### 1. Generate a Configuration Template

```bash
# Get SQL Server configuration templates
snowflake-data-validation sqlserver get-configuration-files

# Get Teradata configuration templates
snowflake-data-validation teradata get-configuration-files

# Get Redshift configuration templates
snowflake-data-validation redshift get-configuration-files
```

### 2. Auto-Generate Configuration from Connection

```bash
# Interactive configuration generation for SQL Server
snowflake-data-validation sqlserver auto-generated-configuration-file

# Interactive configuration generation for Teradata
snowflake-data-validation teradata auto-generated-configuration-file

# Interactive configuration generation for Redshift
snowflake-data-validation redshift auto-generated-configuration-file
```

### 3. Run Validation

```bash
# Run synchronous validation
snowflake-data-validation sqlserver run-validation \
  --data-validation-config-file ./config/validation_config.yaml
```

---

## Best Practices & Guidance

This section provides strategic guidance on how to approach data validation effectively, minimize resource consumption, and identify issues early.

### Incremental Validation Approach

> **Note:**
>
> Always start small and scale up incrementally. Running full validation on large datasets immediately can:
>
> * Consume significant compute resources on both source and target systems
> * Take hours or days to complete
> * Make troubleshooting difficult if issues are found
> * Impact production systems if run during business hours

### Recommended Validation Strategy

Follow this proven approach to ensure efficient and effective validation:

#### Phase 1: Start with a Sample Dataset

**Goal**: Verify configuration and establish baseline

**Approach**:

* Test with 1-2 small tables first (< 100,000 rows)
* Choose tables with diverse data types to validate type mapping
* Verify connectivity and authentication work correctly
* Confirm output format meets your needs

**Example Configuration**:

```yaml
tables:
  - source_table_name: "small_reference_table"
    target_table_name: "small_reference_table"
    source_schema_name: "dbo"
    target_schema_name: "PUBLIC"
    target_database_name: "MIGRATION_DB"
    where_clause: "reference_id <= 1000"  # Limit to first 1000 rows on source
    target_where_clause: "reference_id <= 1000"  # Limit to first 1000 rows on target
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"
```

**What to Verify**:

* ✅ Connection to both source and target successful
* ✅ Schema validation passes
* ✅ Row count matches
* ✅ Data types mapped correctly
* ✅ Validation report generated successfully

**Understanding `where_clause` vs `target_where_clause`**:

The tool provides two filtering options:

* **`where_clause`**: Applied **only** to the **source** table

  ```yaml
  where_clause: "ProductID <= 45"  # Filters only the source table
  ```
* **`target_where_clause`**: Applied **only** to the **target** table (Snowflake)

  ```yaml
  target_where_clause: "ProductID <= 45"  # Filters only the target table
  ```

**Common Usage Patterns**:

```yaml
# Pattern 1: Apply same filter to both source and target
tables:
  - source_table_name: "products"
    where_clause: "ProductID <= 100"           # Filter source
    target_where_clause: "ProductID <= 100"     # Filter target with same condition
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"

# Pattern 2: Different filters for source and target
# Useful when target has additional test data to exclude
tables:
  - source_table_name: "products"
    where_clause: "ProductID <= 100"                    # Filter source
    target_where_clause: "ProductID <= 100 AND ProductID != 5"  # Exclude test data in target
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"

# Pattern 3: Filter only source (validate all target data)
tables:
  - source_table_name: "products"
    where_clause: "ProductID <= 100"  # Only validate subset from source
    # No target_where_clause - validates all matching rows in target
    validations:
      - validation_type: "row_count"
```

#### Phase 2: Verify Small Subsets of Production Tables

**Goal**: Test against actual production data patterns with limited scope

**Approach**:

* Select a subset of rows from production tables (10,000 - 100,000 rows)
* Use `where_clause` and `target_where_clause` to restrict data
* Focus on recent data or specific partitions
* Validate critical business columns first

**Example Configuration**:

```yaml
tables:
  - source_table_name: "large_transactions_table"
    target_table_name: "large_transactions_table"
    source_schema_name: "sales"
    target_schema_name: "SALES"
    target_database_name: "MIGRATION_DB"
    where_clause: "transaction_date >= '2024-01-01' AND transaction_date < '2024-02-01'"
    target_where_clause: "transaction_date >= '2024-01-01' AND transaction_date < '2024-02-01'"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"
        columns_to_validate:
          - "transaction_id"
          - "amount"
          - "customer_id"
          - "transaction_date"
```

**Key Points**:

* 🎯 Use date ranges or ID ranges to limit scope
* 🎯 Test different data patterns (recent vs. historical, high-volume dates)
* 🎯 Validate critical business columns before validating all columns

#### Phase 3: Partition-Based Validation for Large Tables

**Goal**: Validate large tables efficiently using partitioning strategy

**⚠️ IMPORTANT**: For tables with millions or billions of rows, validating the entire table at once is:

* Resource-intensive (high compute costs)
* Time-consuming (can take hours/days)
* Risky (harder to identify specific issue patterns)

**Recommended Approach for Large Tables**:

**Strategy 1: Date-Based Partitioning**

Validate data in chunks based on date ranges:

```yaml
# Validation 1: January 2024
tables:
  - source_table_name: "orders"
    target_table_name: "orders"
    source_schema_name: "dbo"
    target_schema_name: "PUBLIC"
    target_database_name: "MIGRATION_DB"
    where_clause: "order_date >= '2024-01-01' AND order_date < '2024-02-01'"
    target_where_clause: "order_date >= '2024-01-01' AND order_date < '2024-02-01'"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"

# Validation 2: February 2024
# Create separate config with:
# where_clause: "order_date >= '2024-02-01' AND order_date < '2024-03-01'"
# target_where_clause: "order_date >= '2024-02-01' AND order_date < '2024-03-01'"
```

**Strategy 2: Modulo-Based Sampling**

Use modulo arithmetic to sample evenly distributed rows:

```yaml
tables:
  - source_table_name: "customers"
    target_table_name: "customers"
    source_schema_name: "dbo"
    target_schema_name: "PUBLIC"
    target_database_name: "MIGRATION_DB"
    where_clause: "customer_id % 100 = 0"  # 1% sample - evenly distributed
    target_where_clause: "customer_id % 100 = 0"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"
```

**Strategy 3: Statistical Sampling**

For very large tables (> 100M rows), validate representative samples:

```yaml
tables:
  - source_table_name: "clickstream_events"
    target_table_name: "clickstream_events"
    source_schema_name: "dbo"
    target_schema_name: "PUBLIC"
    target_database_name: "MIGRATION_DB"
    where_clause: "event_id % 10000 = 0"  # ~0.01% sample - statistically distributed
    target_where_clause: "event_id % 10000 = 0"
    validations:
      # First: Validate aggregates and row count (fast)
      - validation_type: "row_count"
      - validation_type: "aggregate_metrics"

      # Second: Validate a statistical sample of rows
      - validation_type: "column_level"
```

**Strategy 4: Progressive Partition Validation**

Validate multiple partitions progressively:

```yaml
# config_q1.yaml - Validate Q1 2024
tables:
  - source_table_name: "orders"
    target_table_name: "orders"
    where_clause: "order_date >= '2024-01-01' AND order_date < '2024-04-01'"
    target_where_clause: "order_date >= '2024-01-01' AND order_date < '2024-04-01'"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"

# config_q2.yaml - Validate Q2 2024 (run after Q1 passes)
tables:
  - source_table_name: "orders"
    target_table_name: "orders"
    where_clause: "order_date >= '2024-04-01' AND order_date < '2024-07-01'"
    target_where_clause: "order_date >= '2024-04-01' AND order_date < '2024-07-01'"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"
```

```bash
# Validate Q1 2024
sdv sqlserver run-validation --data-validation-config-file config_q1.yaml

# If Q1 passes, validate Q2 2024
sdv sqlserver run-validation --data-validation-config-file config_q2.yaml

# Continue for remaining quarters
```

#### Phase 4: Full Validation

**Goal**: Complete comprehensive validation after successful subset testing

**When to Run Full Validation**:

* ✅ Sample validations pass successfully
* ✅ Subset validations show no data quality issues
* ✅ You have allocated sufficient time and compute resources
* ✅ Preferably during off-peak hours

**Considerations**:

* Use **asynchronous validation** for large datasets to avoid timeouts
* Consider using **script generation mode** to run validations in parallel
* Monitor resource consumption on both source and target systems
* Plan for validation to run during maintenance windows

**Example**:

```bash
# Generate scripts for parallel execution
sdv sqlserver generate-validation-scripts \
  --data-validation-config-file full_validation_config.yaml \
  --output-directory ./validation_scripts

# Review generated scripts and execute in parallel or scheduled
```

### Performance Optimization Tips

#### 1. Use Appropriate Validation Types

Not all tables need all validation types:

```yaml
# For reference/lookup tables (small, static)
validations:
  - validation_type: "schema"
  - validation_type: "row_count"
  - validation_type: "column_level"  # Full validation OK

# For large transaction tables
validations:
  - validation_type: "schema"
  - validation_type: "row_count"
  - validation_type: "aggregate_metrics"  # Use aggregates instead of row-by-row
```

#### 2. Prioritize Critical Columns

For large tables, validate critical business columns first:

```yaml
validations:
  - validation_type: "column_level"
    columns_to_validate:
      # Start with business-critical columns
      - "customer_id"
      - "transaction_amount"
      - "transaction_date"
      # Add more columns after initial validation succeeds
```

#### 3. Leverage Partitioning Metadata

If your tables are partitioned, validate partition by partition:

```yaml
# Config 1: Validate partition 1
tables:
  - source_table_name: "orders"
    target_table_name: "orders"
    where_clause: "partition_key = '2024-01'"
    target_where_clause: "partition_key = '2024-01'"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"

# Config 2: Validate partition 2
tables:
  - source_table_name: "orders"
    target_table_name: "orders"
    where_clause: "partition_key = '2024-02'"
    target_where_clause: "partition_key = '2024-02'"
    validations:
      - validation_type: "row_count"
      - validation_type: "column_level"
```

#### 4. Use Asynchronous Validation for Production

For production environments, use asynchronous validation:

```bash
# Async validation returns immediately, processes in Snowflake
sdv sqlserver run-async-validation \
  --data-validation-config-file config.yaml \
  --poll-interval 30 \
  --max-wait-time 3600
```

### Cost and Resource Management

#### Estimate Query Costs

**Before running validation on large tables**:

1. **Estimate row counts**:

   ```sql
   -- Source
   SELECT COUNT(*) FROM source_table WHERE <partition_filter>;

   -- Target
   SELECT COUNT(*) FROM target_table WHERE <partition_filter>;
   ```
2. **Estimate data volume**:

   ```sql
   -- Check table size
   SELECT
       table_name,
       row_count,
       bytes,
       bytes / (1024*1024*1024) as size_gb
   FROM information_schema.tables
   WHERE table_name = 'your_table';
   ```
3. **Start with small partitions**: If estimated size > 10 GB, break into smaller chunks

#### Compute Warehouse Sizing (Snowflake)

* **Small tables (< 1M rows)**: XS or S warehouse
* **Medium tables (1M - 10M rows)**: S or M warehouse
* **Large tables (> 10M rows)**: M or L warehouse, use partitioning
* **Very large tables (> 100M rows)**: L or XL warehouse, mandatory partitioning

### Common Pitfalls to Avoid

| ❌ Don’t Do This | ✅ Do This Instead |
| --- | --- |
| Validate entire 1B row table at once | Validate in partitions of 10M-100M rows |
| Run validation during business hours on production | Schedule during off-peak hours or use read replicas |
| Skip sample testing and go straight to full validation | Always validate samples first |
| Use same configuration for all table sizes | Tailor validation strategy to table size |
| Validate all columns for all tables | Prioritize critical columns, especially for large tables |
| Ignore resource consumption | Monitor and set appropriate compute resources |

### Validation Checklist

Use this checklist to ensure you’re following best practices:

**Before Starting**:

* [ ] ODBC drivers installed and tested
* [ ] Connectivity to source and target verified
* [ ] Configuration template generated
* [ ] Test credentials have appropriate read permissions

**Initial Testing** (Phase 1):

* [ ] Selected 1-2 small tables for initial test
* [ ] Configuration file created and reviewed
* [ ] Sample validation executed successfully
* [ ] Validation report reviewed and understood

**Subset Validation** (Phase 2):

* [ ] Identified subset of production data to validate
* [ ] Used `where_clause` and `target_where_clause` to restrict rows
* [ ] Validated 10,000 - 100,000 rows successfully
* [ ] Reviewed results for any data quality issues

**Large Table Strategy** (Phase 3):

* [ ] Identified tables > 10M rows
* [ ] Chosen partitioning strategy (date, ID range, modulo)
* [ ] Estimated compute costs for validation
* [ ] Tested validation on 1-2 partitions first
* [ ] Documented partition validation schedule

**Production Validation** (Phase 4):

* [ ] All subset validations passed
* [ ] Resource allocation planned (compute, time)
* [ ] Validation scheduled during maintenance window
* [ ] Using asynchronous or script generation mode
* [ ] Monitoring plan in place

### Example: Complete Validation Strategy

Here’s a complete example of validating a large e-commerce database:

**Day 1: Initial Setup and Small Tables**

```bash
# Test with small reference tables
sdv sqlserver run-validation --data-validation-config-file config_small_tables.yaml
# Tables: product_categories (1K rows), payment_types (50 rows)
```

**Day 2: Subset of Medium Tables**

```bash
# Test with recent data from medium tables
sdv sqlserver run-validation --data-validation-config-file config_subset_medium.yaml
# Tables: customers WHERE created_date >= '2024-01-01' (50K rows)
#         products WHERE product_id <= 10000 (10K rows)
```

**Day 3: Partition Strategy for Large Tables**

```bash
# Validate one month of orders
sdv sqlserver run-validation --data-validation-config-file config_orders_jan2024.yaml
# Table: orders WHERE order_date BETWEEN '2024-01-01' AND '2024-01-31' (500K rows)
```

**Day 4-5: Progressive Partition Validation**

```bash
# Generate scripts for all partitions
sdv sqlserver generate-validation-scripts \
  --data-validation-config-file config_all_partitions.yaml \
  --output-directory ./scripts

# Review and execute scripts for each partition
```

**Day 6: Full Validation (Off-Peak)**

```bash
# Run complete validation during weekend maintenance window
sdv sqlserver run-async-validation \
  --data-validation-config-file config_full_validation.yaml \
  --poll-interval 60 \
  --max-wait-time 28800  # 8 hours
```

---

## CLI Commands

### Command Structure

All commands follow this consistent structure:

```bash
snowflake-data-validation <source_dialect> <command> [options]

# Or use the shorter alias
sdv <source_dialect> <command> [options]
```

Where:

* `<source_dialect>` is one of: `sqlserver`, `teradata`, `redshift`, `snowflake`.
* `<command>` is one of:

  + `run-validation` - Run synchronous validation
  + `run-async-validation` - Run asynchronous validation
  + `generate-validation-scripts` - Generate validation scripts
  + `get-configuration-files` - Get configuration templates
  + `auto-generated-configuration-file` - Interactive config generation

### Global Options

These options can be used with the CLI without specifying a source dialect or command:

#### Check Version

Display the current installed version of the Snowflake Data Validation CLI:

```bash
# Using full command name
snowflake-data-validation --version

# Using the alias
sdv --version
```

**Output Example:**

```text
snowflake-data-validation 1.2.3
```

**Use Cases:**

* Verify successful installation
* Check which version is currently installed
* Confirm version before reporting issues
* Ensure compatibility with documentation

#### Help

Display general help information:

```bash
# General help
snowflake-data-validation --help
sdv --help

# Command-specific help
sdv sqlserver --help
sdv teradata run-validation --help
sdv redshift generate-validation-scripts --help
```

### Dialect-Specific Command References

For detailed command documentation specific to your source database, see the following pages:

* **[SQL Server Commands Reference](sqlserver_commands.md)** - Complete command reference for SQL Server migrations
* **[Teradata Commands Reference](teradata_commands.md)** - Complete command reference for Teradata migrations
* **[Amazon Redshift Commands Reference](redshift_commands.md)** - Complete command reference for Redshift migrations
* **[Snowflake Commands Reference](snowflake_commands.md)** - Complete command reference for Snowflake-to-Snowflake migrations

Each page provides:

* Detailed syntax for all commands
* Complete option descriptions with examples
* Connection configuration specifics
* Dialect-specific examples
* Troubleshooting tips for that platform
* Best practices for that database type

---

### SQL Server Commands

For complete SQL Server command documentation, see [SQL Server Commands Reference](sqlserver_commands.md).

**Quick Links:**

* [Run Synchronous Validation](sqlserver_commands.md)
* [Run Asynchronous Validation](sqlserver_commands.md)
* [Generate Validation Scripts](sqlserver_commands.md)
* [Get Configuration Templates](sqlserver_commands.md)
* [Auto-Generate Configuration File](sqlserver_commands.md)
* [Connection Configuration](sqlserver_commands.md)
* [Troubleshooting](sqlserver_commands.md)

#### Common Commands

**Run Validation:**

```bash
sdv sqlserver run-validation \
  --data-validation-config-file ./configs/sqlserver_validation.yaml
```

**Generate Scripts:**

```bash
sdv sqlserver generate-validation-scripts \
  --data-validation-config-file /path/to/config.yaml
```

**Get Templates:**

```bash
sdv sqlserver get-configuration-files \
  --templates-directory ./templates
```

For complete documentation, see [SQL Server Commands Reference](sqlserver_commands.md).

---

### Teradata Commands

For complete Teradata command documentation, see [Teradata Commands Reference](teradata_commands.md).

**Quick Links:**

* [Run Synchronous Validation](teradata_commands.md)
* [Run Asynchronous Validation](teradata_commands.md)
* [Generate Validation Scripts](teradata_commands.md)
* [Get Configuration Templates](teradata_commands.md)
* [Auto-Generate Configuration File](teradata_commands.md)
* [Connection Configuration](teradata_commands.md)
* [Troubleshooting](teradata_commands.md)

#### Common Commands

**Run Validation:**

```bash
sdv teradata run-validation \
  --data-validation-config-file ./configs/teradata_validation.yaml
```

**Generate Scripts:**

```bash
sdv teradata generate-validation-scripts \
  ./config.yaml \
  --output-directory ./scripts
```

**Get Templates:**

```bash
sdv teradata get-configuration-files \
  --templates-directory ./templates
```

For complete documentation, see [Teradata Commands Reference](teradata_commands.md).

---

### Amazon Redshift Commands

For complete Amazon Redshift command documentation, see [Redshift Commands Reference](redshift_commands.md).

**Quick Links:**

* [Run Synchronous Validation](redshift_commands.md)
* [Run Asynchronous Validation](redshift_commands.md)
* [Generate Validation Scripts](redshift_commands.md)
* [Get Configuration Templates](redshift_commands.md)
* [Auto-Generate Configuration File](redshift_commands.md)
* [Connection Configuration](redshift_commands.md)
* [Troubleshooting](redshift_commands.md)

#### Common Commands

**Run Validation:**

```bash
sdv redshift run-validation \
  --data-validation-config-file ./configs/redshift_validation.yaml
```

**Generate Scripts:**

```bash
sdv redshift generate-validation-scripts \
  --data-validation-config-file /path/to/config.yaml
```

**Get Templates:**

```bash
sdv redshift get-configuration-files \
  --templates-directory ./templates
```

For complete documentation, see [Redshift Commands Reference](redshift_commands.md).

---

### Snowflake Commands

For complete Snowflake-to-Snowflake command documentation, see [Snowflake Commands Reference](snowflake_commands.md).

**Quick Links:**

* [Run Synchronous Validation](snowflake_commands.md)
* [Run Asynchronous Validation](snowflake_commands.md)
* [Source Validate](snowflake_commands.md)
* [Generate Validation Scripts](snowflake_commands.md)
* [Get Configuration Templates](snowflake_commands.md)
* [Auto-Generate Configuration File](snowflake_commands.md)
* [Connection Configuration](snowflake_commands.md)
* [Troubleshooting](snowflake_commands.md)

#### Common Commands

**Run Validation:**

```bash
sdv snowflake run-validation \
  --data-validation-config-file ./configs/snowflake_validation.yaml
```

**Generate Scripts:**

```bash
sdv snowflake generate-validation-scripts \
  --data-validation-config-file /path/to/config.yaml
```

**Get Templates:**

```bash
sdv snowflake get-configuration-files \
  --templates-directory ./templates
```

For complete documentation, see [Snowflake Commands Reference](snowflake_commands.md).

---

## Configuration File Reference

### Global Configuration

The global configuration section defines the overall behavior of the validation process.

```yaml
# Platform configuration
source_platform: SqlServer  # Options: SqlServer, Teradata, Redshift, Snowflake
target_platform: Snowflake  # Currently only Snowflake is supported

# Output configuration
output_directory_path: /path/to/output/directory

# Threading configuration
max_threads: auto  # Options: "auto" or positive integer (1-32)

# Teradata-specific configuration (required only for Teradata)
target_database: TARGET_DB_NAME

# Directory path for source validation file
source_validation_files_path: /path/to/source_validation_file/directory

# Directory path for target validation file
target_validation_files_path: /path/to/target_validation_file/directory
```

#### Platform Configuration Options

**`source_platform`** (required)

* **Type:** String
* **Valid Values:** `SqlServer`, `Teradata`, `Redshift`, `Snowflake`
* **Description:** The source database platform for validation
* **Example:** `source_platform: SqlServer`

**`target_platform`** (required)

* **Type:** String
* **Valid Values:** `Snowflake`
* **Description:** The target database platform (currently only Snowflake is supported)
* **Example:** `target_platform: Snowflake`

**`output_directory_path`** (required)

* **Type:** String (path)
* **Description:** Directory where validation results, logs, and reports will be saved
* **Example:** `output_directory_path: /home/user/validation_output`

**`max_threads`** (optional)

* **Type:** String or Integer
* **Valid Values:** `"auto"` or positive integer (1-32)
* **Default:** `"auto"`
* **Description:** Controls parallelization for validation operations

  + `"auto"`: Automatically detects optimal thread count based on CPU cores
  + Integer value: Specifies exact number of threads to use
* **Examples:**

  ```yaml
  max_threads: auto        # Auto-detect optimal threads
  max_threads: 4           # Use exactly 4 threads
  max_threads: 16          # Use 16 threads
  ```

**`target_database`** (required for Teradata only)

* **Type:** String
* **Description:** Target database name in Snowflake for Teradata validations
* **Example:** `target_database: PROD_DB`

**`source_validation_files_path`** (optional)

* **Type:** String (path)
* **Description:** Path to the directory containing the source validation files.
* **Example:** `source_validation_files_path: /path/to/source_validation_file/directory`

**`target_validation_files_path`** (optional)

* **Type:** String (path)
* **Description:** Path to the directory containing the target validation files.
* **Example:** `target_validation_files_path: /path/to/targetvalidation_file/directory`

---

### Connection Configuration

Define how to connect to source and target databases.

#### Source Connection Configuration

##### SQL Server Source Connection

```yaml
source_connection:
  mode: credentials
  host: "sqlserver.company.com"
  port: 1433
  username: "sqlserver_user"
  password: "secure_password"
  database: "source_database"
  trust_server_certificate: "no"   # Optional: yes/no
  encrypt: "yes"                   # Optional: yes/no/optional
```

**Connection Fields:**

* **`mode`** (required)

  + **Type:** String
  + **Valid Values:** `credentials`
  + **Description:** Connection mode for SQL Server
* **`host`** (required)

  + **Type:** String
  + **Description:** SQL Server hostname or IP address
  + **Example:** `"sqlserver.company.com"` or `"192.168.1.100"`
* **`port`** (required)

  + **Type:** Integer
  + **Default:** 1433
  + **Description:** SQL Server port number
* **`username`** (required)

  + **Type:** String
  + **Description:** SQL Server authentication username
* **`password`** (required)

  + **Type:** String
  + **Description:** SQL Server authentication password
  + **Security Note:** Consider using environment variables or secret management
* **`database`** (required)

  + **Type:** String
  + **Description:** SQL Server database name
* **`trust_server_certificate`** (optional)

  + **Type:** String
  + **Valid Values:** `"yes"`, `"no"`
  + **Default:** `"no"`
  + **Description:** Whether to trust the server certificate for SSL/TLS connections
* **`encrypt`** (optional)

  + **Type:** String
  + **Valid Values:** `"yes"`, `"no"`, `"optional"`
  + **Default:** `"yes"`
  + **Description:** Connection encryption setting

##### Teradata Source Connection

```yaml
source_connection:
  mode: credentials
  host: "teradata.company.com"
  username: "teradata_user"
  password: "secure_password"
  database: "source_database"
```

**Connection Fields:**

* **`mode`** (required)

  + **Type:** String
  + **Valid Values:** `credentials`
* **`host`** (required)

  + **Type:** String
  + **Description:** Teradata hostname or IP address
* **`username`** (required)

  + **Type:** String
  + **Description:** Teradata authentication username
* **`password`** (required)

  + **Type:** String
  + **Description:** Teradata authentication password
* **`database`** (required)

  + **Type:** String
  + **Description:** Teradata database name

##### Amazon Redshift Source Connection

```yaml
source_connection:
  mode: credentials
  host: "redshift-cluster.region.redshift.amazonaws.com"
  port: 5439
  username: "redshift_user"
  password: "secure_password"
  database: "source_database"
```

**Connection Fields:**

* **`mode`** (required)

  + **Type:** String
  + **Valid Values:** `credentials`
* **`host`** (required)

  + **Type:** String
  + **Description:** Redshift cluster endpoint
* **`port`** (required)

  + **Type:** Integer
  + **Default:** 5439
  + **Description:** Redshift port number
* **`username`** (required)

  + **Type:** String
  + **Description:** Redshift authentication username
* **`password`** (required)

  + **Type:** String
  + **Description:** Redshift authentication password
* **`database`** (required)

  + **Type:** String
  + **Description:** Redshift database name

#### Target Connection (Snowflake)

Snowflake connections support three modes: `name`, `default`, and `credentials` (IPC only and SnowConvert exclusive).

##### Option 1: Named Connection

Use a pre-configured Snowflake connection saved in your Snowflake connections file.

```yaml
target_connection:
  mode: name
  name: "my_snowflake_connection"
```

**Fields:**

* **`mode`** (required): Must be `"name"`
* **`name`** (required): Name of the saved Snowflake connection

##### Option 2: Default Connection

Use the default Snowflake connection from your environment.

```yaml
target_connection:
  mode: default
```

**Fields:**

* **`mode`** (required): Must be `"default"`

##### Option 3: Credentials Mode (IPC Only)

> **Note:** The `credentials` mode is only available when using IPC (Inter-Process Communication) commands directly via CLI parameters, not in YAML configuration files. This mode is exclusive to the SnowConvert UI.

---

### Validation Configuration

Controls which validation levels are executed.

```yaml
validation_configuration:
  schema_validation: true          # Level 1: Schema validation
  metrics_validation: true         # Level 2: Statistical metrics validation
  row_validation: false            # Level 3: Row-level data validation
  max_failed_rows_number: 100      # Maximum failed rows to report (applies only for row validation)
  exclude_metrics: false           # Exclude statistical metrics from validation
  apply_metric_column_modifier: false  # Apply column modifiers for metrics
  custom_templates_path: /path/to/templates  # Optional: Custom query templates
```

**Validation Options:**

* **`schema_validation`** (optional)

  + **Type:** Boolean
  + **Default:** `true`
  + **Description:** Validates table and column schema consistency
  + **Checks:**

    - Column names match between source and target
    - Data types are compatible
    - Column nullability settings
    - Primary key definitions
* **`metrics_validation`** (optional)

  + **Type:** Boolean
  + **Default:** `true`
  + **Description:** Validates statistical metrics for each column
  + **Checks:**

    - Row counts
    - Distinct value counts
    - Null value counts
    - Min/max values
    - Average, sum, standard deviation (for numeric columns)
* **`row_validation`** (optional)

  + **Type:** Boolean
  + **Default:** `false`
  + **Description:** Validates data at the row level using hash-based comparison

    - **Note:** Requires index columns for row identification. If not specified in the configuration, the tool attempts to auto-detect them from primary keys.
  + **Warning:** This is the most resource-intensive validation level
  + **Checks:**

    - MD5 hash comparison of row chunks
    - Identifies specific rows with differences
* **`max_failed_rows_number`** (optional)

  + **Type:** Integer
  + **Default:** 100
  + **Minimum:** 1
  + **Description:** Maximum number of failed rows to report per table
  + **Example:** `max_failed_rows_number: 250`
* **`exclude_metrics`** (optional)

  + **Type:** Boolean
  + **Default:** `false`
  + **Description:** When `true`, excludes certain statistical metrics (avg, sum, stddev, variance) from validation
  + **Use Case:** Useful for large tables where statistical calculations might cause an overflow.
* **`apply_metric_column_modifier`** (optional)

  + **Type:** Boolean
  + **Default:** `true`
  + **Description:** Applies column modifiers defined in metric templates
  + **Use Case:** Advanced users with custom metric calculations
* **`custom_templates_path`** (optional)

  + **Type:** String (path)
  + **Description:** Path to directory containing custom Jinja2 query templates
  + **Example:** `custom_templates_path: /opt/validation/custom_templates`

---

### Table Configuration

Defines which tables to validate and how to validate them.

```yaml
tables:
  # Table 1: Include specific columns
  - fully_qualified_name: database.schema.table1
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - customer_id
      - customer_name
      - email
    index_column_list:
      - customer_id
    where_clause: "status = 'ACTIVE'"
    target_where_clause: "status = 'ACTIVE'"
    is_case_sensitive: false
    chunk_number: 10
    max_failed_rows_number: 50
    column_mappings:
      customer_id: cust_id
      customer_name: name

  # Table 2: Exclude specific columns
  - fully_qualified_name: database.schema.table2
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - audit_timestamp
      - created_by
      - modified_by
    index_column_list: []

  # Table 3: Simple configuration
  - fully_qualified_name: database.schema.table3
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

**Table Configuration Fields:**

* **`fully_qualified_name`** (required)

  + **Type:** String
  + **Format:** `database.schema.table` or `schema.table`
  + **Description:** Full table identifier in the source database
  + **Examples:**

    ```yaml
    fully_qualified_name: my_database.dbo.customers
    fully_qualified_name: public.orders  # For Redshift
    ```
* **`target_database`** (optional)

  + **Type:** String
  + **Default:** Source database name from fully_qualified_name field
  + **Description:** Target database name if different from source database name
  + **Example:**

    ```yaml
    target_database: target_database_name
    ```
* **`target_schema`** (optional)

  + **Type:** String
  + **Default:** Source schema name from fully_qualified_name field
  + **Description:** Target schema name if different from source schema name
  + **Example:**

    ```yaml
    target_schema: target_schema_name
    ```
* **`target_name`** (optional)

  + **Type:** String
  + **Default:** Source table name from fully_qualified_name field
  + **Description:** Target table name if different from source table name
  + **Example:**

    ```yaml
    target_name: customers_new
    ```
* **`use_column_selection_as_exclude_list`** (required)

  + **Type:** Boolean
  + **Description:** Determines how `column_selection_list` is interpreted

    - `false`: Include only the specified columns
    - `true`: Exclude the specified columns (include all others)
  + **Examples:**

    ```yaml
    # Include only these columns
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - id
      - name
      - email

    # Exclude these columns (include all others)
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_notes
      - audit_timestamp
    ```
* **`column_selection_list`** (required)

  + **Type:** List of strings
  + **Description:** List of column names to include or exclude
  + **Note:** Use an empty list `[]` to include all columns
* **`index_column_list`** (optional)

  + **Type:** List of strings
  + **Default:** Auto-detected from primary keys
  + **Description:** Columns to use as unique identifiers for row validation
  + **Use Case:** Specify when the table doesn’t have a primary key or you want to use different columns
  + **Example:**

    ```yaml
    index_column_list:
      - customer_id
      - order_date
    ```
* **`target_index_column_list`** (optional)

  + **Type:** List of strings
  + **Description:** Index columns in the target table (if different from source)
  + **Note:** Automatically derived from `column_mappings` if not specified
* **`where_clause`** (optional)

  + **Type:** String
  + **Default:** `""` (empty, no filter)
  + **Description:** SQL WHERE clause to filter source data (without “WHERE” keyword)
  + **Examples:**

    ```yaml
    where_clause: "created_date >= '2024-01-01'"
    where_clause: "status IN ('ACTIVE', 'PENDING') AND region = 'US'"
    where_clause: "amount > 1000 AND customer_type = 'PREMIUM'"
    ```
* **`target_where_clause`** (optional)

  + **Type:** String
  + **Default:** `""` (empty, no filter)
  + **Description:** SQL WHERE clause to filter target data
  + **Best Practice:** Should match `where_clause` to ensure consistent comparison
  + **Example:**

    ```yaml
    target_where_clause: "created_date >= '2024-01-01'"
    ```
* **`is_case_sensitive`** (optional)

  + **Type:** Boolean
  + **Default:** `false`
  + **Description:** Whether column name matching should be case-sensitive
* **`chunk_number`** (optional)

  + **Type:** Integer
  + **Default:** 0 (no chunking)
  + **Minimum:** 0
  + **Description:** Number of chunks to split row validation into
  + **Use Case:** Large tables benefit from chunking for better performance
  + **Example:**

    ```yaml
    chunk_number: 20  # Split into 20 chunks for parallel processing
    ```
* **`max_failed_rows_number`** (optional)

  + **Type:** Integer
  + **Minimum:** 1
  + **Description:** Maximum failed rows to report for this specific table
  + **Note:** Overrides the global `max_failed_rows_number` setting
* **`column_mappings`** (optional)

  + **Type:** Dictionary (key-value pairs)
  + **Description:** Maps source column names to target column names when they differ
  + **Format:** `source_column_name: target_column_name`
  + **Example:**

    ```yaml
    column_mappings:
      cust_id: customer_id
      cust_name: customer_name
      addr: address
    ```
* **`exclude_metrics`** (optional)

  + **Type:** Boolean
  + **Description:** Exclude metrics validation for this specific table
  + **Note:** Overrides the global `exclude_metrics` setting
* **`apply_metric_column_modifier`** (optional)

  + **Type:** Boolean
  + **Description:** Apply column modifiers for this specific table
  + **Note:** Overrides the global `apply_metric_column_modifier` setting

---

### View Configuration

Views are validated similarly to tables but are configured in a separate `views:` section. View validation creates temporary tables internally to materialize view schema for comparison between source and target systems.

```yaml
views:
  # View 1: Basic view validation
  - fully_qualified_name: database.schema.customer_summary_view
    target_name: customer_summary_view_target
    use_column_selection_as_exclude_list: false
    column_selection_list: []

  # View 2: View with specific columns and filtering
  - fully_qualified_name: database.schema.sales_report_view
    target_name: sales_report_view_target
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - region
      - total_sales
      - order_count
    where_clause: "fiscal_year = 2024"
    target_where_clause: "fiscal_year = 2024"
    column_mappings:
      region_code: region
```

**View Configuration Fields:**

View configuration uses the same fields as table configuration, with the following key points:

* **`fully_qualified_name`** (required)

  + **Type:** String
  + **Format:** `database.schema.view` or `schema.view`
  + **Description:** Full view identifier in the source database
  + **Examples:**

    ```yaml
    fully_qualified_name: my_database.dbo.customer_summary_view
    fully_qualified_name: public.sales_report_view  # For Redshift
    ```
* **`target_database`** (optional)

  + **Type:** String
  + **Default:** Source database name from fully_qualified_name field
  + **Description:** Target database name if different from source database name
* **`target_schema`** (optional)

  + **Type:** String
  + **Default:** Source schema name from fully_qualified_name field
  + **Description:** Target schema name if different from source schema name
* **`target_name`** (optional)

  + **Type:** String
  + **Default:** Source view name from fully_qualified_name field
  + **Description:** Target view name if different from source view name
* **`use_column_selection_as_exclude_list`** (required)

  + **Type:** Boolean
  + **Description:** Determines how `column_selection_list` is interpreted

    - `false`: Include only the specified columns
    - `true`: Exclude the specified columns (include all others)
* **`column_selection_list`** (required)

  + **Type:** List of strings
  + **Description:** List of column names to include or exclude
  + **Note:** Use an empty list `[]` to include all columns
* **`index_column_list`** (optional)

  + **Type:** List of strings
  + **Description:** Columns to use as unique identifiers for row validation
  + **Use Case:** Required when row validation is enabled
* **`where_clause`** (optional)

  + **Type:** String
  + **Default:** `""` (empty, no filter)
  + **Description:** SQL WHERE clause to filter source data (without “WHERE” keyword)
* **`target_where_clause`** (optional)

  + **Type:** String
  + **Default:** `""` (empty, no filter)
  + **Description:** SQL WHERE clause to filter target data
* **`column_mappings`** (optional)

  + **Type:** Dictionary (key-value pairs)
  + **Description:** Maps source column names to target column names when they differ
* **`chunk_number`** (optional)

  + **Type:** Integer
  + **Default:** 0 (no chunking)
  + **Description:** Number of chunks to split row validation into
* **`max_failed_rows_number`** (optional)

  + **Type:** Integer
  + **Description:** Maximum failed rows to report for this specific view
* **`is_case_sensitive`** (optional)

  + **Type:** Boolean
  + **Default:** `false`
  + **Description:** Whether column name matching should be case-sensitive

**How View Validation Works:**

1. The CLI creates temporary tables from view definitions in both source and target systems
2. Data is extracted from the views into these temporary tables
3. Validation is performed on the temporary tables
4. Temporary tables are cleaned up after validation completes

**Best Practices for View Validation:**

1. **Use filtering for large views:** Apply `where_clause` and `target_where_clause` to limit data volume
2. **Test with small subsets first:** Start with filtered validation before full view validation
3. **Consider view complexity:** Complex views with many joins may take longer to validate
4. **Monitor resource usage:** Views that materialize large datasets consume significant memory

---

### Comparison Configuration

Controls comparison behavior and tolerance levels.

```yaml
comparison_configuration:
  tolerance: 0.01  # 1% tolerance for statistical comparisons
```

**Comparison Options:**

* **`tolerance`** (optional)

  + **Type:** Float
  + **Default:** 0.001 (0.1%)
  + **Description:** Acceptable tolerance for statistical metric differences
  + **Use Case:** Allows for small differences due to rounding or data type conversions
  + **Examples:**

    ```yaml
    tolerance: 0.001   # 0.1% tolerance (very strict)
    tolerance: 0.01    # 1% tolerance (recommended)
    tolerance: 0.05    # 5% tolerance (lenient)
    ```

---

### Logging Configuration

Controls logging behavior for validation operations.

```yaml
logging_configuration:
  level: INFO              # Overall logging level
  console_level: WARNING   # Console-specific level
  file_level: DEBUG        # File-specific level
```

**Logging Options:**

* **`level`** (optional)

  + **Type:** String
  + **Valid Values:** `DEBUG`, `INFO`, `WARNING`, `ERROR`, `CRITICAL`
  + **Default:** `INFO`
  + **Description:** Default logging level for all loggers
  + **Level Descriptions:**

    - `DEBUG`: Detailed diagnostic information
    - `INFO`: General informational messages
    - `WARNING`: Warning messages for potentially problematic situations
    - `ERROR`: Error messages for serious issues
    - `CRITICAL`: Critical errors that may cause application failure
* **`console_level`** (optional)

  + **Type:** String
  + **Valid Values:** `DEBUG`, `INFO`, `WARNING`, `ERROR`, `CRITICAL`
  + **Default:** Same as `level`
  + **Description:** Logging level for console output
  + **Use Case:** Set to `WARNING` or `ERROR` to reduce console noise
* **`file_level`** (optional)

  + **Type:** String
  + **Valid Values:** `DEBUG`, `INFO`, `WARNING`, `ERROR`, `CRITICAL`
  + **Default:** Same as `level`
  + **Description:** Logging level for file output
  + **Use Case:** Set to `DEBUG` for detailed file logs while keeping console clean

**Example Configurations:**

```yaml
# Verbose logging for troubleshooting
logging_configuration:
  level: DEBUG
  console_level: DEBUG
  file_level: DEBUG

# Production-friendly logging
logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: INFO

# Minimal console output with detailed file logs
logging_configuration:
  level: INFO
  console_level: ERROR
  file_level: DEBUG
```

**Note:** CLI `--log-level` parameter overrides configuration file settings.

---

### Database and Schema Mappings

Map source database/schema names to target names when they differ.

```yaml
database_mappings:
  source_db1: target_db1
  source_db2: target_db2
  legacy_database: modern_database

schema_mappings:
  dbo: public
  source_schema: target_schema
  old_schema: new_schema
```

**Mapping Options:**

* **`database_mappings`** (optional)

  + **Type:** Dictionary
  + **Description:** Maps source database names to target database names
  + **Use Case:** When database names differ between source and Snowflake
  + **Example:**

    ```yaml
    database_mappings:
      PROD_SQL: PROD_SNOWFLAKE
      DEV_SQL: DEV_SNOWFLAKE
    ```
* **`schema_mappings`** (optional)

  + **Type:** Dictionary
  + **Description:** Maps source schema names to target schema names
  + **Use Case:** When schema names differ between source and Snowflake
  + **Example:**

    ```yaml
    schema_mappings:
      dbo: PUBLIC
      sales: SALES_DATA
      hr: HUMAN_RESOURCES
    ```

---

## Complete Configuration Examples

### Example 1: SQL Server to Snowflake - Basic Validation

```yaml
# Global configuration
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./validation_results
max_threads: auto

# Source connection
source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: sql_user
  password: sql_password
  database: production_db
  trust_server_certificate: "no"
  encrypt: "yes"

# Target connection
target_connection:
  mode: name
  name: snowflake_prod

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Comparison configuration
comparison_configuration:
  tolerance: 0.01

# Tables to validate
tables:
  - fully_qualified_name: production_db.dbo.customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  - fully_qualified_name: production_db.dbo.orders
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_notes
      - audit_log
    where_clause: "order_date >= '2024-01-01'"
    target_where_clause: "order_date >= '2024-01-01'"
```

### Example 2: Teradata to Snowflake - Comprehensive Validation

```yaml
# Global configuration
source_platform: Teradata
target_platform: Snowflake
output_directory_path: /opt/validation/results
max_threads: 8
target_database: PROD_SNOWFLAKE

# Source connection
source_connection:
  mode: credentials
  host: teradata.company.com
  username: teradata_user
  password: teradata_password
  database: prod_db

# Target connection
target_connection:
  mode: default

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100
  exclude_metrics: false
  apply_metric_column_modifier: false

# Comparison configuration
comparison_configuration:
  tolerance: 0.005

# Logging configuration
logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

# Schema mappings
schema_mappings:
  prod_db: PUBLIC

# Tables configuration
tables:
  - fully_qualified_name: prod_db.sales_data
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - transaction_id
      - customer_id
      - amount
      - transaction_date
    index_column_list:
      - transaction_id
    chunk_number: 10
    max_failed_rows_number: 50

  - fully_qualified_name: prod_db.customer_master
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - credit_card
    where_clause: "status = 'ACTIVE'"
    target_where_clause: "status = 'ACTIVE'"
    column_mappings:
      cust_id: customer_id
      cust_name: customer_name
```

### Example 3: Redshift to Snowflake - Advanced Configuration

```yaml
# Global configuration
source_platform: Redshift
target_platform: Snowflake
output_directory_path: /data/validation/redshift_migration
max_threads: 16

# Source connection
source_connection:
  mode: credentials
  host: redshift-cluster.us-east-1.redshift.amazonaws.com
  port: 5439
  username: redshift_admin
  password: redshift_secure_password
  database: analytics_db

# Target connection
target_connection:
  mode: name
  name: snowflake_analytics

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 200
  exclude_metrics: false

# Comparison configuration
comparison_configuration:
  tolerance: 0.02

# Logging configuration
logging_configuration:
  level: INFO
  console_level: ERROR
  file_level: DEBUG

# Database mappings
database_mappings:
  analytics_db: ANALYTICS_PROD

# Schema mappings
schema_mappings:
  public: PUBLIC
  staging: STAGING

# Tables configuration
tables:
  # Large fact table with chunking
  - fully_qualified_name: analytics_db.public.fact_sales
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - sale_id
    chunk_number: 50
    max_failed_rows_number: 500

  # Dimension table with column mappings
  - fully_qualified_name: analytics_db.public.dim_customer
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - customer_key
      - customer_name
      - email
      - phone
      - address
    column_mappings:
      customer_key: cust_key
      customer_name: name
    is_case_sensitive: false

  # Filtered validation
  - fully_qualified_name: analytics_db.staging.incremental_load
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - load_timestamp
      - etl_batch_id
    where_clause: "load_date >= CURRENT_DATE - 7"
    target_where_clause: "load_date >= CURRENT_DATE - 7"
    chunk_number: 10
```

### Example 4: Minimal Configuration

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./output

source_connection:
  mode: credentials
  host: localhost
  port: 1433
  username: sa
  password: password
  database: test_db

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables:
  - fully_qualified_name: test_db.dbo.test_table
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### Example 5: View Validation Configuration

```yaml
# Global configuration
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./view_validation_results
max_threads: auto

# Source connection
source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: sql_user
  password: sql_password
  database: production_db
  trust_server_certificate: "no"
  encrypt: "yes"

# Target connection
target_connection:
  mode: name
  name: snowflake_prod

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Views to validate
views:
  # Basic view validation
  - fully_qualified_name: production_db.dbo.customer_summary_view
    use_column_selection_as_exclude_list: false
    column_selection_list: []

  # View with specific columns
  - fully_qualified_name: production_db.dbo.sales_metrics_view
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - region
      - total_sales
      - order_count
      - avg_order_value

  # View with filtering and column mappings
  - fully_qualified_name: production_db.dbo.active_orders_view
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "order_date >= '2024-01-01'"
    target_where_clause: "order_date >= '2024-01-01'"
    column_mappings:
      ord_id: order_id
      cust_id: customer_id
```

### Example 6: Combined Tables and Views Validation

```yaml
# Global configuration
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./combined_validation
max_threads: 16

# Source connection
source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: validator_user
  password: secure_password
  database: analytics_db
  trust_server_certificate: "no"
  encrypt: "yes"

# Target connection
target_connection:
  mode: name
  name: snowflake_analytics

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

# Comparison configuration
comparison_configuration:
  tolerance: 0.01

# Tables to validate
tables:
  - fully_qualified_name: analytics_db.dbo.customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  - fully_qualified_name: analytics_db.dbo.orders
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id
    chunk_number: 20

# Views to validate
views:
  - fully_qualified_name: analytics_db.dbo.customer_order_summary
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  - fully_qualified_name: analytics_db.dbo.monthly_sales_report
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - month
      - year
      - total_sales
      - total_orders
```

---

## Advanced Usage

### Working with Large Tables

For large tables, consider these optimization strategies:

#### Enable Chunking

```yaml
tables:
  - fully_qualified_name: database.schema.large_table
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    chunk_number: 100  # Split into 100 chunks
    index_column_list:
      - primary_key_column
```

#### Increase Thread Count

```yaml
max_threads: 32  # Use maximum threads for parallel processing
```

#### 3. Filter Data

```yaml
tables:
  - fully_qualified_name: database.schema.large_table
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: "created_date >= '2024-01-01' AND region = 'US'"
    target_where_clause: "created_date >= '2024-01-01' AND region = 'US'"
```

#### Selective Column Validation

```yaml
tables:
  - fully_qualified_name: database.schema.large_table
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - large_text_column
      - large_binary_column
      - xml_column
```

### Using Custom Query Templates

For advanced users, you can provide custom Jinja2 templates:

```yaml
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  custom_templates_path: /opt/custom_templates
```

Custom template directory structure:

```text
/opt/custom_templates/
├── sqlserver_columns_metrics_query.sql.j2
├── sqlserver_row_count_query.sql.j2
├── sqlserver_compute_md5_sql.j2
└── snowflake_columns_metrics_query.sql.j2
```

### Asynchronous Validation Workflow

For environments with restricted database access or long-running validations:

#### Step 1: Generate Scripts

```bash
sdv sqlserver generate-validation-scripts \
  --data-validation-config-file config.yaml
```

This generates SQL scripts in the output directory.

#### Step 2: Execute Scripts Manually

Execute the generated scripts on source and target databases, saving results to CSV files.

#### Step 3: Run Async Validation

```bash
sdv sqlserver run-async-validation \
  --data-validation-config-file config.yaml
```

This compares the pre-generated metadata files.

### CI/CD Integration

Integrate validation into your deployment pipeline:

```bash
#!/bin/bash
# validate_migration.sh

CONFIG_FILE="./configs/validation_config.yaml"
LOG_LEVEL="INFO"

# Run validation
sdv sqlserver run-validation \
  --data-validation-config-file "$CONFIG_FILE" \
  --log-level "$LOG_LEVEL"

# Check exit code
if [ $? -eq 0 ]; then
  echo "✓ Validation passed successfully"
  exit 0
else
  echo "✗ Validation failed"
  exit 1
fi
```

### Handling Multiple Environments

Create separate configuration files for each environment:

```bash
configs/
├── dev_validation.yaml
├── staging_validation.yaml
└── prod_validation.yaml
```

Run validation for specific environment:

```bash
# Development
sdv sqlserver run-validation --data-validation-config-file configs/dev_validation.yaml

# Staging
sdv sqlserver run-validation --data-validation-config-file configs/staging_validation.yaml

# Production
sdv sqlserver run-validation --data-validation-config-file configs/prod_validation.yaml
```

---

## Validation Reports

### Overview

The Snowflake Data Validation tool generates comprehensive CSV reports that document the results of data migration validations between source and target databases. These reports help identify discrepancies in schema, metrics, and row-level data.

### Report Types

The validation tool generates different types of reports based on the validation levels configured:

#### 1. Main Validation Report (`validation_report.csv`)

This consolidated report contains results from schema and metrics validations for all tables.

#### 2. Row Validation Reports (per table)

Separate reports generated for each table when row validation is enabled, containing detailed row-level comparison results.

---

### Main Validation Report Structure

The main validation report contains the following columns:

| Column Name | Description |
| --- | --- |
| **VALIDATION_TYPE** | Type of validation performed: `SCHEMA VALIDATION` or `METRICS VALIDATION` |
| **TABLE** | Fully qualified name of the table being validated (e.g., `database.schema.table`) |
| **COLUMN_VALIDATED** | Name of the column being validated (or table-level attribute for schema validation) |
| **EVALUATION_CRITERIA** | The specific property being compared (e.g., `DATA_TYPE`, `NULLABLE`, `ROW_COUNT`, `min`, `max`) |
| **SOURCE_VALUE** | The value from the source database |
| **SNOWFLAKE_VALUE** | The value from the target (Snowflake) database |
| **STATUS** | Validation result status (see Status Values section below) |
| **COMMENTS** | Additional context or explanation for the validation result |

---

### Validation Types Explained

#### Schema Validation

Compares structural metadata between source and target tables:

* **Column existence**: Ensures columns present in source exist in target
* **Data types**: Validates column data types match (with configurable mappings)
* **Nullability**: Checks if NULL constraints match between source and target
* **Primary keys**: Verifies primary key definitions
* **Column precision/scale**: Validates numeric precision and scale values
* **Character length**: Compares VARCHAR/CHAR column lengths

**Example Schema Validation Row:**

```text
SCHEMA VALIDATION,mydb.myschema.customers,customer_id,DATA_TYPE,INT,NUMBER,FAILURE,"Data type mismatch detected"
```

#### Metrics Validation

Compares statistical metrics calculated on column data:

* **Row count**: Total number of rows in the table
* **min**: Minimum value in numeric/date columns
* **max**: Maximum value in numeric/date columns
* **count**: Count of non-null values
* **count_distinct**: Number of distinct values
* **avg**, **sum**, **stddev**, **variance**: Statistical measures (can be excluded via configuration)

**Example Metrics Validation Row:**

```text
METRICS VALIDATION,mydb.myschema.orders,order_date,max,2024-12-31,2024-12-31,SUCCESS,""
```

#### Row Validation

Generates separate per-table reports comparing actual row data using MD5 checksums to detect differences.

---

### Row Validation Report Structure

Row validation reports have a different structure focused on identifying specific rows with differences:

| Column Name | Description |
| --- | --- |
| **ROW_NUMBER** | Sequential row number in the report |
| **TABLE_NAME** | Fully qualified table name |
| **RESULT** | Outcome of the row comparison (see Result Values below) |
| **[INDEX_COLUMNS]_SOURCE** | Primary key/index column values from source |
| **[INDEX_COLUMNS]_TARGET** | Primary key/index column values from target |
| **SOURCE_QUERY** | SQL query to retrieve the row from source database |
| **TARGET_QUERY** | SQL query to retrieve the row from target database |

---

### Status Values

The **STATUS** column in the main validation report can have the following values:

| Status | Meaning |
| --- | --- |
| **SUCCESS** | Validation passed - values match between source and target |
| **FAILURE** | Validation failed - values differ between source and target |
| **WARNING** | Potential issue detected that may require attention |
| **NOT_FOUND_SOURCE** | Element exists in target but not in source |
| **NOT_FOUND_TARGET** | Element exists in source but not in target |

---

### Result Values (Row Validation)

The **RESULT** column in row validation reports can have the following values:

| Result | Meaning |
| --- | --- |
| **SUCCESS** | Row data matches between source and target |
| **FAILURE** | Row data differs between source and target (MD5 checksum mismatch) |
| **NOT_FOUND_SOURCE** | Row exists in target but not in source |
| **NOT_FOUND_TARGET** | Row exists in source but not in target |

---

### Understanding Validation Results

#### Interpreting Schema Validation Results

**Success Scenario:**

* All columns exist in both source and target
* Data types match (considering configured type mappings)
* Nullability constraints are consistent
* All structural attributes align

**Common Failure Scenarios:**

1. **Data Type Mismatch**

   * Source: `VARCHAR(50)`, Target: `VARCHAR(100)`
   * Status: May be SUCCESS if within tolerance, or FAILURE if strict matching is required
2. **Missing Column**

   * Source has column `phone_number`, target does not
   * Status: NOT_FOUND_TARGET
3. **Nullability Difference**

   * Source: `NOT NULL`, Target: `NULL`
   * Status: FAILURE

#### Interpreting Metrics Validation Results

**Success Scenario:**

* Row counts match exactly
* Statistical metrics are within configured tolerance (default: 0.1%)
* All calculated metrics align between source and target

**Common Failure Scenarios:**

1. **Row Count Mismatch**

   * Source: 10,000 rows, Target: 9,998 rows
   * Status: FAILURE
   * Action: Investigate missing rows
2. **Min/Max Value Differences**

   * Source max date: `2024-12-31`, Target max date: `2024-12-30`
   * Status: FAILURE
   * Action: Check for incomplete data migration
3. **Statistical Variance**

   * Source count_distinct: 1,000, Target count_distinct: 995
   * Status: FAILURE (if beyond tolerance)
   * Action: Investigate potential duplicates or missing values

#### Interpreting Row Validation Results

**Success Scenario:**

* All rows in source have matching rows in target (by MD5 checksum)
* No orphaned rows in either database
* Primary key values align correctly

**Common Failure Scenarios:**

1. **Row Content Mismatch**

   * Same primary key, different column values
   * Result: FAILURE
   * Action: Use provided SQL queries to investigate specific differences
2. **Missing Rows**

   * Row exists in source but not in target
   * Result: NOT_FOUND_TARGET
   * Action: Check migration completeness
3. **Extra Rows**

   * Row exists in target but not in source
   * Result: NOT_FOUND_SOURCE
   * Action: Investigate unexpected data in target

---

### Using the Reports

#### Quick Assessment

1. **Filter by STATUS column**: Focus on `FAILURE`, `WARNING`, `NOT_FOUND_SOURCE`, and `NOT_FOUND_TARGET` rows
2. **Group by VALIDATION_TYPE**: Assess schema issues separately from metrics issues
3. **Group by TABLE**: Identify which tables have the most issues

#### Investigating Failures

**For Schema Validation:**

1. Review the `EVALUATION_CRITERIA` to understand what attribute failed
2. Compare `SOURCE_VALUE` vs `SNOWFLAKE_VALUE`
3. Check if differences are acceptable (e.g., VARCHAR size increase)
4. Update type mappings or schema definitions if needed

**For Metrics Validation:**

1. Review the metric that failed (e.g., `row_count`, `max`, `min`)
2. Calculate the difference magnitude
3. Determine if within acceptable business tolerance
4. Use the detailed queries to investigate source of discrepancy

**For Row Validation:**

1. Open the table-specific row validation report
2. Identify rows with `FAILURE` status
3. Use the provided `SOURCE_QUERY` and `TARGET_QUERY` to retrieve actual row data
4. Compare column-by-column to identify specific field differences
5. Investigate why values differ (data type conversion, truncation, transformation)

---

### Configuration Options Affecting Reports

#### Tolerance Settings

The `comparison_configuration.tolerance` setting affects metrics validation:

```yaml
comparison_configuration:
  tolerance: 0.01  # 1% tolerance for numeric comparisons
```

* Values within tolerance are marked as SUCCESS
* Values beyond tolerance are marked as FAILURE

#### Validation Levels

Control which validations run and therefore which reports are generated:

```yaml
validation_configuration:
  schema_validation: true    # Validates table/column structure
  metrics_validation: true   # Validates statistical metrics
  row_validation: false      # Validates individual row data (resource intensive)
```

#### Excluded Metrics

Exclude specific metrics from validation:

```yaml
validation_configuration:
  exclude_metrics: true  # Excludes avg, sum, stddev, variance
```

#### Maximum Failed Rows

Limit the number of failed rows reported in row validation:

```yaml
validation_configuration:
  max_failed_rows_number: 100  # Report up to 100 failed rows per table
```

---

### Report File Locations

Reports are generated in the configured output directory:

```text
<output_directory_path>/
├── <timestamp>_validation_report.csv          # Main consolidated report
├── <timestamp>_database.schema.table1_1.csv   # Row validation for table1
├── <timestamp>_database.schema.table2_2.csv   # Row validation for table2
└── data_validation_<timestamp>.log            # Detailed execution log
```

**File naming convention:**

* Timestamp format: `YYYY-MM-DD_HH-MM-SS`
* Row validation reports include table name and unique ID to prevent collisions

---

### Best Practices

1. **Start with Schema Validation**: Ensure structural alignment before validating data
2. **Use Appropriate Tolerance**: Set realistic tolerance thresholds for metrics validation
3. **Selective Row Validation**: Enable row validation only for critical tables (resource intensive)
4. **Iterative Approach**: Fix schema issues first, then metrics, then row-level differences
5. **Document Acceptable Differences**: Some type conversions or value transformations may be expected
6. **Automate Report Analysis**: Use scripts to parse CSV reports and flag critical issues
7. **Preserve Reports**: Archive validation reports for audit trails and compliance

---

### Troubleshooting Report Issues

#### All Validations Showing FAILURE

**Possible Causes:**

* Incorrect database/schema mappings in configuration
* Type mapping file not loaded correctly
* Connection to wrong target database

**Solution:** Verify `database_mappings`, `schema_mappings`, and connection settings

#### Row Validation Shows All NOT_FOUND_TARGET

**Possible Causes:**

* Target table empty or not migrated yet
* Incorrect target table name
* Primary key/index columns mismatch

**Solution:** Verify target table exists and contains data, check column mappings

#### Metrics Validation Shows Large Differences

**Possible Causes:**

* Incomplete data migration
* Data type conversion issues causing value changes
* Filter/WHERE clause differences between source and target queries

**Solution:** Review migration logs, verify row counts first, check data transformations

#### Report File Not Generated

**Possible Causes:**

* Output directory doesn’t exist or lacks write permissions
* Validation configuration has all levels set to false
* Application crashed before report generation

**Solution:** Check output path permissions, review logs for errors, enable at least one validation level

---

## Troubleshooting

### Common Issues and Solutions

#### Issue: “Configuration file not found”

**Symptom:**

```sql
Configuration file not found: ./config.yaml
```

**Solution:**

* Verify the file path is correct
* Use absolute paths: `/home/user/configs/validation.yaml`
* Check file permissions

#### Issue: “Connection error”

**Symptom:**

```sql
Connection error: Unable to connect to database
```

**Solutions:**

1. **Verify connection parameters:**

   ```yaml
   source_connection:
     host: correct-hostname.com  # Verify hostname
     port: 1433                  # Verify port
     username: valid_user        # Verify username
     password: correct_password  # Verify password
   ```
2. **Check network connectivity:**

   ```bash
   # Test connection to SQL Server
   telnet sqlserver.company.com 1433

   # Test connection to Teradata
   telnet teradata.company.com 1025

   # Test connection to Redshift
   telnet redshift-cluster.amazonaws.com 5439
   ```
3. **Verify firewall rules** allow connections from your machine
4. **For SQL Server SSL issues:**

   ```yaml
   source_connection:
     trust_server_certificate: "yes"  # Try this if SSL errors occur
     encrypt: "optional"
   ```

#### Issue: “Invalid parameter”

**Symptom:**

```sql
Invalid parameter: max_threads must be 'auto' or a positive integer
```

**Solution:**

```yaml
# Correct
max_threads: auto
max_threads: 4

# Incorrect
max_threads: "4"      # Remove quotes
max_threads: 0        # Must be positive
max_threads: -1       # Must be positive
```

#### Issue: “Table not found”

**Symptom:**

```sql
Table not found: database.schema.table_name
```

**Solutions:**

1. **Verify fully qualified name:**

   ```yaml
   # For SQL Server and Teradata
   fully_qualified_name: database_name.schema_name.table_name

   # For Redshift (no database in FQN)
   fully_qualified_name: schema_name.table_name
   ```
2. **Check case sensitivity:**

   ```yaml
   tables:
     - fully_qualified_name: MyDatabase.MySchema.MyTable
       is_case_sensitive: true  # Match exact case
   ```
3. **Verify table exists in source database**

#### Issue: “YAML formatting error”

**Symptom:**

```sql
Error in the format of config.yaml
```

**Solutions:**

1. **Check indentation (use spaces, not tabs)**

   ```yaml
   # Correct
   tables:
     - fully_qualified_name: db.schema.table
       column_selection_list:
         - column1
         - column2
   ```

   Incorrect example (mixed indentation with tabs):

   ```text
   tables:
   	- fully_qualified_name: db.schema.table
   	  column_selection_list:
   	    - column1
   ```
2. **Quote special characters:**

   ```yaml
   password: "p@ssw0rd!"    # Quote passwords with special chars
   where_clause: "name = 'O''Brien'"  # Escape quotes
   ```
3. **Validate YAML syntax** using online validators or:

   ```bash
   python -c "import yaml; yaml.safe_load(open('config.yaml'))"
   ```

#### Issue: “Validation fails with tolerance errors”

**Symptom:**

```sql
Metrics validation failed: Difference exceeds tolerance
```

**Solution:**
Adjust tolerance in configuration:

```yaml
comparison_configuration:
  tolerance: 0.05  # Increase to 5% tolerance
```

#### Issue: “Out of memory errors with large tables”

**Solutions:**

1. **Enable chunking:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       chunk_number: 100  # Process in smaller chunks
   ```
2. **Reduce thread count:**

   ```yaml
   max_threads: 4  # Use fewer threads
   ```
3. **Filter data:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       where_clause: "created_date >= '2024-01-01'"
   ```
4. **Exclude large columns:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       use_column_selection_as_exclude_list: true
       column_selection_list:
         - large_blob_column
         - large_text_column
   ```

### Getting Help

1. **Check logs:** Review log files in the output directory
2. **Enable debug logging:**

   ```bash
   sdv sqlserver run-validation \
     --data-validation-config-file config.yaml \
     --log-level DEBUG
   ```
3. **Review validation reports** in the output directory
4. **Consult documentation:** [Full Documentation](https://github.com/snowflake-eng/migrations-data-validation)
5. **Report issues:** Email us at:[snowconvert-support@snowflake.com](mailto:snowconvert-support%40snowflake.com)

---
title: Snowflake Data Validation CLI - Quick Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/CLI_QUICK_REFERENCE.md
section: Migrations
---

# Snowflake Data Validation CLI - Quick Reference

This quick reference guide provides a condensed overview of commands, configuration options, and common usage patterns for the Snowflake Data Validation CLI tool, designed for easy lookup during validation tasks.

---

## Installation

## Prerequisites

Before running the commands below, ensure that Python 3.10 or later and pip are installed on your system.

```bash
# Base installation
pip install snowflake-data-validation

# With source-specific drivers
pip install "snowflake-data-validation[sqlserver]"
pip install "snowflake-data-validation[teradata]"
pip install "snowflake-data-validation[redshift]"
```

---

## Command Structure

```bash
snowflake-data-validation <dialect> <command> [options]
# or
sdv <dialect> <command> [options]
```

**Dialects:** `sqlserver` | `teradata` | `redshift` | `snowflake`

---

## Common Commands

### `run-validation`

```bash
# SQL Server
sdv sqlserver run-validation --data-validation-config-file config.yaml

# Teradata
sdv teradata run-validation --data-validation-config-file config.yaml

# Redshift
sdv redshift run-validation --data-validation-config-file config.yaml

# Snowflake (Snowflake-to-Snowflake)
sdv snowflake run-validation --data-validation-config-file config.yaml
```

### `generate-validation-scripts`

```bash
sdv <dialect> generate-validation-scripts --data-validation-config-file config.yaml
```

### `run-async-validation`

```bash
sdv <dialect> run-async-validation --data-validation-config-file config.yaml
```

### `get-configuration-files`

```bash
sdv <dialect> get-configuration-files --templates-directory ./templates
```

### `auto-generated-configuration-file`

```bash
sdv <dialect> auto-generated-configuration-file
```

### `row-partitioning-helper`

```bash
sdv <dialect> row-partitioning-helper
```

Interactive command to partition large tables by rows for more efficient validation.

### `column-partitioning-helper`

```bash
sdv <dialect> column-partitioning-helper
```

Interactive command to partition wide tables by columns for more efficient validation.

---

## Configuration Template

This template provides the core structure for configuring data validation jobs, defining source and target connections, validation rules, and table-specific settings that control how data is compared between your source database and Snowflake.

```yaml
# GLOBAL
source_platform: SqlServer  # SqlServer | Teradata | Redshift | Snowflake
target_platform: Snowflake
output_directory_path: ./output
max_threads: auto  # "auto" or 1-32
target_database: teradataTargetDatabase # For Teradata sources only - specify target database

# SOURCE CONNECTION
source_connection:
  mode: credentials
  host: "hostname"
  port: 1433
  username: "user"
  password: "pass"
  database: "db"
  # SQL Server only:
  trust_server_certificate: "no"  # yes | no
  encrypt: "yes"  # yes | no | optional

# TARGET CONNECTION
target_connection:
  mode: name  # name | default
  name: "connection_name"  # if mode=name

# VALIDATION
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false
  max_failed_rows_number: 100
  exclude_metrics: false
  apply_metric_column_modifier: false

# COMPARISON
comparison_configuration:
  tolerance: 0.01  # 1% tolerance

# LOGGING (optional)
logging_configuration:
  level: INFO  # DEBUG | INFO | WARNING | ERROR | CRITICAL
  console_level: WARNING
  file_level: DEBUG

# MAPPINGS (optional)
database_mappings:
  source_db: target_db

schema_mappings:
  source_schema: target_schema

# TABLES
tables:
  - fully_qualified_name: database.schema.table1
    target_name: table1_target
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: []
    where_clause: ""
    target_where_clause: ""
    chunk_number: 0
    column_mappings: {}

# VIEWS
views:
  - fully_qualified_name: database.schema.view1
    target_name: view1_target
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [ID]
    target_index_column_list: [ID]
    where_clause: ""
    target_where_clause: ""
    chunk_number: 0
    column_mappings: {}
```

---

## Table Configuration Examples

### Include All Columns

```yaml
- fully_qualified_name: db.schema.table
  use_column_selection_as_exclude_list: false
  column_selection_list: []
```

### Include Specific Columns

```yaml
- fully_qualified_name: db.schema.table
  use_column_selection_as_exclude_list: false
  column_selection_list:
    - column1
    - column2
    - column3
```

### Exclude Specific Columns

```yaml
- fully_qualified_name: db.schema.table
  use_column_selection_as_exclude_list: true
  column_selection_list:
    - audit_timestamp
    - internal_notes
```

### With Filtering

```yaml
- fully_qualified_name: db.schema.table
  use_column_selection_as_exclude_list: false
  column_selection_list: []
  where_clause: "status = 'ACTIVE' AND created_date >= '2024-01-01'"
  target_where_clause: "status = 'ACTIVE' AND created_date >= '2024-01-01'"
```

### With Column Mappings

```yaml
- fully_qualified_name: db.schema.table
  use_column_selection_as_exclude_list: false
  column_selection_list: []
  column_mappings:
    source_col1: target_col1
    source_col2: target_col2
```

### Large Table with Chunking

```yaml
- fully_qualified_name: db.schema.large_table
  use_column_selection_as_exclude_list: false
  column_selection_list: []
  index_column_list:
    - primary_key
  chunk_number: 50
  max_failed_rows_number: 500
```

---

## View Configuration Examples

Views are validated similarly to tables but are configured in a separate `views:` section. View validation creates temporary tables internally to materialize view schema for comparison.

### Basic View Validation

```yaml
views:
  - fully_qualified_name: db.schema.customer_view
    target_name: customer_view_target
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

### View with Column Selection

```yaml
views:
  - fully_qualified_name: db.schema.sales_summary_view
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - region
      - total_sales
      - sales_count
```

### View with Filtering

```yaml
views:
  - fully_qualified_name: db.schema.active_users_view
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    where_clause: status = 'ACTIVE'
    target_where_clause: status = 'ACTIVE'
```

### View with Column Mappings

```yaml
views:
  - fully_qualified_name: db.schema.legacy_report_view
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    column_mappings:
      old_col_name: new_col_name
      legacy_id: id
```

### Combined Tables and Views Configuration

```yaml
tables:
  - fully_qualified_name: db.schema.customers
    target_name: customers_target
    use_column_selection_as_exclude_list: false
    column_selection_list: []

views:
  - fully_qualified_name: db.schema.customer_summary_view
    target_name: customer_summary_view_target
    use_column_selection_as_exclude_list: false
    column_selection_list: []
```

---

## Connection Examples

### SQL Server

```yaml
source_connection:
  mode: credentials
  host: "sqlserver.company.com"
  port: 1433
  username: "sql_user"
  password: "sql_pass"
  database: "prod_db"
  trust_server_certificate: "no"
  encrypt: "yes"
```

### Teradata

```yaml
source_connection:
  mode: credentials
  host: "teradata.company.com"
  username: "td_user"
  password: "td_pass"
  database: "prod_db"
```

### Redshift

```yaml
source_connection:
  mode: credentials
  host: "cluster.region.redshift.amazonaws.com"
  port: 5439
  username: "rs_user"
  password: "rs_pass"
  database: "prod_db"
```

### Snowflake Target (Named) (See more info here: https://docs.snowflake.com/en/developer-guide/snowflake-cli/connecting/configure-connections)

```yaml
target_connection:
  mode: name
  name: "my_snowflake_connection"
```

### Snowflake Target (Default)

```yaml
target_connection:
  mode: default
```

### Snowflake Source (for Snowflake-to-Snowflake validation)

```yaml
source_connection:
  mode: name
  name: "my_source_snowflake_connection"
```

---

## Validation Levels

| Level | Type | Description | Cost |
| --- | --- | --- | --- |
| **1** | Schema | Column names, types, nullability | Low |
| **2** | Metrics | Row counts, distinct values, min/max, avg | Medium |
| **3** | Row | Hash-based row comparison | High |

---

## Common CLI Options

| Option | Short | Description | Default |
| --- | --- | --- | --- |
| `--data-validation-config-file` | `-dvf` | Config file path | Required |
| `--log-level` | `-ll` | Log level | INFO |
| `--templates-directory` | `-td` | Template output dir | Current dir |
| `--query-templates` |  | Include query templates | false |
| `--output-directory` |  | Results directory | From config |

---

## Log Levels

* **DEBUG**: Detailed diagnostic information
* **INFO**: General informational messages
* **WARNING**: Warning messages
* **ERROR**: Error messages
* **CRITICAL**: Critical errors

---

## Configuration Field Reference

### Required Fields

* `source_platform`
* `target_platform`
* `output_directory_path`
* `source_connection`
* `target_connection`
* `tables` or `views` (at least one must have entries)

### Optional Fields

* `max_threads` (default: “auto”)
* `target_database` (required for Teradata)
* `validation_configuration`
* `comparison_configuration`
* `logging_configuration`
* `database_mappings`
* `schema_mappings`

---

## Table Configuration Fields

| Field | Required | Type | Description |
| --- | --- | --- | --- |
| `fully_qualified_name` | ✓ | String | Full table identifier |
| `use_column_selection_as_exclude_list` | ✓ | Boolean | Include/exclude mode |
| `column_selection_list` | ✓ | List | Columns to include/exclude |
| `index_column_list` |  | List | Primary key columns |
| `where_clause` |  | String | Source filter |
| `target_where_clause` |  | String | Target filter |
| `chunk_number` |  | Integer | Number of chunks (0=off) |
| `max_failed_rows_number` |  | Integer | Max failures to report |
| `column_mappings` |  | Dict | Source→Target mappings |
| `is_case_sensitive` |  | Boolean | Case-sensitive matching |

---

## View Configuration Fields

View configuration uses the same fields as table configuration. Views are defined in a separate `views:` section.

| Field | Required | Type | Description |
| --- | --- | --- | --- |
| `fully_qualified_name` | ✓ | String | Full view identifier |
| `use_column_selection_as_exclude_list` | ✓ | Boolean | Include/exclude mode |
| `column_selection_list` | ✓ | List | Columns to include/exclude |
| `index_column_list` |  | List | Index columns for row validation |
| `where_clause` |  | String | Source filter |
| `target_where_clause` |  | String | Target filter |
| `chunk_number` |  | Integer | Number of chunks (0=off) |
| `max_failed_rows_number` |  | Integer | Max failures to report |
| `column_mappings` |  | Dict | Source→Target mappings |
| `is_case_sensitive` |  | Boolean | Case-sensitive matching |
| `target_database` |  | String | Override target database |
| `target_schema` |  | String | Override target schema |
| `target_name` |  | String | Override target view name |

**Note:** Views use temporary tables internally to materialize the schema of the view for validation.

---

## Performance Tips

### For Large Tables

1. Enable chunking:

   ```yaml
   chunk_number: 100
   ```
2. Increase threads:

   ```yaml
   max_threads: 32
   ```
3. Filter data:

   ```yaml
   where_clause: "date >= '2024-01-01'"
   ```
4. Exclude large columns:

   ```yaml
   use_column_selection_as_exclude_list: true
   column_selection_list:
     - large_blob
     - large_text
   ```
5. Skip row validation initially:

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: true
     row_validation: false  # Enable after initial validation
   ```

---

## Common Issues

### Connection Failed

```yaml
# SQL Server SSL issues
trust_server_certificate: "yes"
encrypt: "optional"
```

### Out of Memory

```yaml
# Reduce parallelism
max_threads: 4

# Enable chunking
chunk_number: 50
```

### Tolerance for Numerical Differences

```yaml
# Increase tolerance
comparison_configuration:
  tolerance: 0.05  # 5%
```

### YAML Syntax Errors

* Use spaces, not tabs
* Quote special characters in YAML: :code:`password: "p@ssw0rd!"`
* If a string value starts or ends with a double quote, escape the double quotes “table #1”
* Escape quotes: `name = 'O''Brien'`

---

## Asynchronous Workflow

The asynchronous workflow allows you to decouple script generation from execution, which is useful when you need to run validation queries manually on the source database or when you have restricted access that requires scheduled execution.

1. Generate the validation scripts:

   ```bash
   sdv sqlserver generate-validation-scripts --data-validation-config-file config.yaml
   ```
2. Execute the generated scripts manually on your source database and save the results to CSV files in the output directory.
3. Run the async validation to compare the saved results:

   ```bash
   sdv sqlserver run-async-validation --data-validation-config-file config.yaml
   ```

---

## Example Workflows

### Basic Validation

1. Get the configuration templates:

   ```bash
   sdv sqlserver get-configuration-files
   ```
2. Edit the generated `config.yaml` file to configure your source and target connections, validation settings, and tables.
3. Run the validation:

   ```bash
   sdv sqlserver run-validation --data-validation-config-file config.yaml
   ```

### Interactive Setup

1. Generate a configuration file interactively by answering prompts:

   ```bash
   sdv sqlserver auto-generated-configuration-file
   ```
2. Run the validation using the generated configuration:

   ```bash
   sdv sqlserver run-validation --data-validation-config-file generated_config.yaml
   ```

### Debug Mode

To troubleshoot issues or get detailed execution information, run validation with debug logging:

```bash
sdv sqlserver run-validation \
  --data-validation-config-file config.yaml \
  --log-level DEBUG
```

### Large Table Partitioning

For validating very large tables, use the partitioning helper to divide tables into smaller segments:

1. Create a configuration file with your table definitions
2. Run the partitioning helper:

   ```bash
   sdv sqlserver row-partitioning-helper
   ```
3. Follow the prompts to specify partition columns and counts
4. Run validation with the partitioned configuration

---

## Output Files

Generated in `output_directory_path`:

* **Validation reports:** Schema, metrics, row comparison results
* **Log files:** `data_validation_YYYY-MM-DD_HH-MM-SS.log`
* **Difference files:** `differencesL1.csv`, `differencesL2.csv`
* **Generated scripts:** (when using `generate-validation-scripts`)

---

## Environment Variables

For Snowflake connections using default mode, configure:

```bash
export SNOWFLAKE_ACCOUNT="account_name"
export SNOWFLAKE_USER="username"
export SNOWFLAKE_PASSWORD="password"
export SNOWFLAKE_DATABASE="database"
export SNOWFLAKE_SCHEMA="schema"
export SNOWFLAKE_WAREHOUSE="warehouse"
export SNOWFLAKE_ROLE="role"
```

---

## Help Commands

```bash
# Main help
sdv --help

# Dialect-specific help
sdv sqlserver --help
sdv teradata --help
sdv redshift --help
sdv snowflake --help

# Command-specific help
sdv sqlserver run-validation --help
sdv sqlserver generate-validation-scripts --help
sdv sqlserver row-partitioning-helper --help
sdv sqlserver column-partitioning-helper --help
```

---

## Resources

* **Full Documentation:** <CLI_USAGE_GUIDE.md>
* **SQL Server Commands:** <sqlserver_commands.md>
* **Teradata Commands:** <teradata_commands.md>
* **Redshift Commands:** <redshift_commands.md>
* **Snowflake Commands:** <snowflake_commands.md>
* **Configuration Examples:** <CONFIGURATION_EXAMPLES.md>

---
title: Snowflake Migration Plugin
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/migration-skill/skill.md
section: Migrations
---

# Snowflake Migration Plugin

The Snowflake Migration Plugin is an AI-powered plugin for [Cortex Code](https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code) that guides you through an end-to-end database migration to Snowflake. It provides a conversational, interactive workflow — from connecting to your source database through code conversion, deployment, data migration, and validation.

The plugin is organized as a **skill tree**: a hierarchy of AI instructions that detect where you are in the migration lifecycle and route you to the right action automatically. You can follow the prescribed path from start to finish, or jump directly to any capability at any time.

**Supported sources:** SQL Server, Amazon Redshift

---

## Download

Download the latest release from:

```text
https://snowconvert.snowflake.com/storage/linux/beta/plugins/migration-plugin-pr.zip
```

---

## Prerequisites

Before starting a migration, ensure you have:

* A Snowflake account with a connection configured in `~/.snowflake/connections.toml`.
* A source database (SQL Server or Redshift) accessible from your machine.
* [Cortex Code](https://docs.snowflake.com/en/user-guide/cortex-code/cortex-code) installed.

All other dependencies (`uv`, `scai`, Python packages) are installed automatically on first launch.

---

## Quick Start

**1. Launch Cortex Code with the plugin:**

```shell
cortex --plugin-dir /path/to/migration-plugin/
```

**2. Start migrating:**

```text
Migrate my database to Snowflake
```

The agent reads your project state, shows a progress checklist, and picks up where you left off. If no project exists, it starts from the beginning.

**Tip:** You can also combine the plugin with a profile:

```shell
cortex --plugin-dir /path/to/migration-plugin/ --profile <your-profile>
```

---

## Migration Workflow

The plugin guides you through six stages. At every session start, the agent detects your current progress and resumes from where you left off.

| Stage | Name | What happens |
| --- | --- | --- |
| 1 | **Connect** | Set up a connection to your source database (SQL Server or Redshift) |
| 2 | **Init** | Create a local migration project |
| 3 | **Register** | Extract DDL and code from the source database, or import local `.sql` files |
| 4 | **Convert** | Translate source SQL to Snowflake-compatible SQL via SnowConvert |
| 5 | **Assess** | Generate an interactive assessment report covering deployment waves, object exclusions, dynamic SQL patterns, and ETL/SSIS analysis |
| 6 | **Migrate** | Deploy objects, migrate data, validate output, and fix errors — wave by wave |

Each time you start a session, the agent presents a live progress checklist:

```text
✅  1. Connect             — Connected to SQL Server
✅  2. Init                — Project initialized
✅  3. Register            — 342 objects registered
◐   4. Initial Conv        — 280/342 converted
⬚   5. Assess              — Not run
⬚   6. Migrate Objects     — 0/120 tables deployed
```

---

## Skills Reference

The plugin is organized as a skill tree. The root skill detects your project state and delegates to the right sub-skill. You can also invoke any skill directly by describing what you want.

### Setup Skills (Stages 1–5)

#### connection

Walks you through connecting to your source database. The agent collects credentials, tests the connection, and saves it for reuse across sessions. Supports:

* **SQL Server** — configures ODBC driver, host, port, and authentication.
* **Amazon Redshift** — configures host, port, database, and IAM or password authentication.

#### register-code-units

Gets source code into the migration project. Two paths are available:

| Path | When to use |
| --- | --- |
| **Extract from database** | You have a live source connection and want the agent to pull DDL and object code directly |
| **Import local files** | You already have `.sql` files on disk and want to import them into the project |

#### convert

Runs SnowConvert to translate your source SQL (T-SQL or Redshift SQL) into Snowflake-compatible SQL. After conversion, the agent presents:

* Total objects converted successfully.
* EWI (Early Warning Issue) summary broken down by severity (errors, warnings, informational).
* A list of objects that require manual review.

#### assessment

Generates an interactive multi-tab HTML report. The assessment includes four analyses that can be run individually or together:

| Analysis | What it does |
| --- | --- |
| **Deployment Waves** | Analyzes object dependencies to produce an ordered deployment sequence. Objects within a wave have no inter-dependencies; waves are ordered so dependencies are always deployed first. |
| **Object Exclusion** | Identifies objects that do not need migration: temporary tables, staging objects, deprecated objects, and test artifacts. Reduces scope before deployment. |
| **Dynamic SQL Analysis** | Classifies and scores Dynamic SQL patterns in your converted code. Identifies patterns that Snowflake handles natively, patterns requiring manual rewrite, and patterns with elevated migration complexity. |
| **ETL/SSIS Assessment** | Analyzes SSIS packages individually: classifies each package (Ingestion, Transformation, Export, Orchestration, Hybrid), maps control and data flow, and estimates migration effort. |

The report is generated as a single self-contained HTML file. You can iterate on the wave plan interactively — for example, reprioritizing objects, adjusting wave sizes, or relocating specific objects — before locking it for deployment.

### Migration Skills (Stage 6)

#### migrate-objects

The main deploy loop. Processes all objects in the current wave in dependency order:

| Object type | What happens |
| --- | --- |
| **Tables** | Deployed to Snowflake, then data is migrated from the source. |
| **Views** | Deployed to Snowflake. Blocked views retry after their dependent functions/procedures pass. |
| **Functions & Procedures** | Deployed, tested against source output, and fixed if tests fail. The loop repeats until tests pass or the user decides to skip. |

After each wave completes, the agent automatically advances to the next wave.

#### baseline-capture

Captures the expected output of source stored procedures and functions for use as test baselines. Two approaches are supported:

| Approach | When to use |
| --- | --- |
| **Query Logs** | You have CSV logs of real `EXEC` or `CALL` statements from your source system. The agent parses these to extract parameters and expected outputs. |
| **AI-Assisted** | No logs are available. A swarm of specialized agents generates test cases covering business logic, data-driven scenarios, and edge cases by analyzing the source SQL. |

Baselines are stored locally and uploaded to Snowflake so they can be used for two-sided validation (source output vs. Snowflake output) during the migrate-objects loop.

#### rule-engine

Manages reusable migration rules stored in Snowflake. Rules encode known source-to-Snowflake fix patterns and are shared across all objects in the project. Each rule can operate in two modes:

| Mode | How it works |
| --- | --- |
| **Regex** | A regex find-and-replace applied mechanically to SQL files |
| **AI-guided** | The rule provides context and strategy; the AI interprets and applies it |

The rule engine has four sub-capabilities:

| Sub-skill | What it does |
| --- | --- |
| **search** | Scans a SQL file against all rules using regex pattern matching and Cortex semantic search. Returns matched rules ranked by relevance. |
| **apply** | Applies matched rules to local SQL files. Regex rules are applied automatically; AI-mode rules are shown for review before applying. Supports single-file and batch application. |
| **extract** | Creates a new reusable rule from a fix you just made. Works from an interactive before/after comparison or retroactively from git history. |
| **propagate** | Given a rule, finds every code unit in the project it applies to (via reverse regex + semantic search), then hands off to batch apply. |

Rules accumulate over the lifetime of the project. Every time the agent fixes an object and extracts a rule, that rule becomes available to all subsequent objects — reducing manual effort as the migration progresses.

---

## What You Can Ask

You do not need to follow the prescribed path. You can ask for any capability at any time.

### Status and navigation

| Prompt | What happens |
| --- | --- |
| `"What is the current state?"` | Shows the progress checklist |
| `"What should I work on next?"` | Returns the next dependency-ready object |
| `"Continue"` | Picks up the prescribed migration path |

### Setup

```text
"Connect to my SQL Server database"
"Extract objects from the source"
"Import SQL files from ./my-scripts/"
"Convert my source code"
```

### Assessment

```text
"Run a full assessment"
"Generate deployment waves"
"I want a maximum of 30 objects per wave"
"Prioritize all Payroll objects in Wave 1"
"Identify temporary and staging objects"
"Analyze dynamic SQL patterns"
"Assess my SSIS packages"
```

### Migration

```text
"Deploy tables"
"Migrate data"
"Deploy and test the next function"
"Capture baselines for dbo.GetCustomerOrders"
```

### Rule engine

```text
"Search rules for this file"
"Apply all matched rules"
"Extract a rule from my last fix"
"Propagate this rule across the project"
"Show me all rules"
```

---

## Example Workload

You can use [AdventureWorksDW](https://learn.microsoft.com/en-us/sql/samples/adventureworks-install-configure) as an example source database to try the plugin end-to-end. Substitute any SQL Server or Redshift database you have access to — the plugin adapts to whatever source you connect.

---

## Troubleshooting

| Problem | Resolution |
| --- | --- |
| **Plugin fails on startup** | Check `migration-plugin/logs/install-dependencies.log`. The install hook requires Homebrew (macOS/Linux) or winget (Windows). |
| **`scai` not found** | Run the install hook manually: `migration-plugin/hooks/install-dependencies`. Or install directly: `brew install --cask snowflakedb/snowconvert-ai/snowconvert-ai`. |
| **Snowflake connection errors** | Verify your connection in `~/.snowflake/connections.toml` and confirm the connection name matches what you provided during setup. |
| **Agent seems lost** | Say `"What is the current state?"` — the agent re-reads project status and resets context. |

---

## Support

For help with the migration plugin, contact: [**snowconvert-support@snowflake.com**](mailto:snowconvert-support%40snowflake.com)

---
title: Snowflake Migration Tools
source: https://docs.snowflake.com/en/migrations/README.md
section: Migrations
---

# Snowflake Migration Tools

Snowflake offers various migration tools to help organizations modernize their data platforms:

## [SnowConvert AI](snowconvert-docs/overview.md)

This desktop tool converts SQL code from legacy platforms to Snowflake SQL
Supports migrations from:

* Teradata
* Oracle
* SQL Server
* Redshift
* Azure Synapse
* Sybase IQ
* Spark SQL
* Databricks SQL
* BigQuery
* PostgreSQL
* Greenplum
* Netezza
* Vertica
* Hive

Enables automated SQL translation to accelerate database migrations

## [SnowConvert AI CLI](snowconvert-docs/general/user-guide/snowconvert/command-line-interface/README.md)

All of the desktop functionality, in a CLI tool that enables scripting and enables AI agents.

## [Snowconvert Migration Plugin](snowconvert-docs/general/user-guide/snowconvert/migration-skill/skill.md)

Downloadable skill to be used to manage migration projects for Cortext Code.

## [Snowpark Migration Accelerator (SMA)](sma-docs/README.md)

Converts code from Spark to Snowpark API
Supports:

* Python
* Scala
* Java

Helps modernize data processing applications to run natively in Snowflake

### Both tools offer:

* Easy-to-use interfaces
* Automated code conversion
* Accelerated migration timelines
* Path to leverage Snowflake’s modern data platform capabilities
* For more information:

SnowConvert AI: Email [snowconvert-info@snowflake.com](mailto:snowconvert-info%40snowflake.com)

SMA: Email [sma-info@snowflake.com](mailto:sma-info%40snowflake.com)

---
title: Snowflake SnowConvert AI Documentation
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/overview.md
section: Migrations
---

# Snowflake SnowConvert AI Documentation

Traditional data platforms and big data solutions often fail to meet their main goal: allowing users to work with data without restrictions on size, speed, or adaptability. Snowflake’s modern cloud-based data warehouse offers benefits for all data professionals, including analysts, scientists, engineers, and business users. Here are the key advantages of moving to Snowflake, the leading cloud data platform:

* *Multi-cluster and shared data*: enables fast and efficient processing of large data volumes using multiple clusters simultaneously.
* *Micro-Partitioning*: provides secure and efficient data storage with automatic organization.
* *Delivered as a service*: eliminates the need for manual administration and management of data infrastructure.
* *Data Platform built for any cloud*: separates storage from compute, allowing multiple compute clusters to work on the same data simultaneously.
* *Better Performance and throughput*: delivers faster data processing compared to traditional data platforms.
* *Support for all data*: processes both structured data and semi-structured formats (JSON, Avro, XML, Parquet) in a single platform.

*SnowConvert AI* is a user-friendly tool that helps you modernize your traditional data platform by migrating it to Snowflake’s Data Warehousing Architecture. *SnowConvert AI* analyzes your existing SQL code (from platforms like Oracle, SQL Server, and Teradata) and converts it to [**Snowflake SQL**](../../sql-reference-commands.md). After conversion, you can immediately take advantage of all Snowflake’s features and capabilities.

Currently, *SnowConvert AI* supports converting code from these source platforms:

* **Teradata**
* **Oracle**
* **SQL Server**
* **Sybase IQ**
* **Redshift**
* **Azure Synapse**
* **Spark SQL**
* **Databricks SQL**
* **BigQuery**
* **PostgreSQL & Based Languages**
* **Vertica**
* **Hive**
* **IBM DB2**

Looking to migrate to a new platform? [Please contact us](general/contact-us.md) to discuss your needs.

## Additional Resources

For more information, see our [Getting Started Guide](general/getting-started/README.md)

---
title: Snowpark Migration Accelerator Documentation
source: https://docs.snowflake.com/en/migrations/sma-docs/README.md
section: Migrations
---

# Snowpark Migration Accelerator Documentation

Traditional data platforms and big data solutions often fail to achieve their main goal: allowing users to work with data without restrictions on size, speed, or adaptability. Snowflake’s Data Cloud offers a complete solution that benefits everyone working with data, including:

* Data analysts
* Data scientists
* Data engineers
* Business professionals
* Technology professionals

Moving to Snowflake, the leading cloud-based data platform, can provide significant advantages.

* *Multi-Cluster and Shared Data*: Processes large amounts of data quickly and efficiently by using multiple clusters that can access the same data.
* *Micro-Partitioning*: Stores customer data in small, manageable chunks for better security and efficiency.
* *Delivered as a Service*: No need to manage or maintain the platform yourself - everything is handled for you.
* *Data Platform Built for Any Cloud*: Multiple teams can work with the same data simultaneously because storage and computing resources are separate.
* *Improved Performance and Throughput*: Works faster than traditional data processing methods.
* *Support for All Data*: Can work with both regular structured data and semi-structured data like JSON, Avro, XML, and Parquet files.
* *Build in Your Language of Choice*: Write code in Python, Scala, or Java using Snowpark’s secure libraries and runtime environments.

Snowflake’s Snowpark Migration Accelerator (SMA) is a user-friendly tool that helps you modernize your traditional data platform by moving it to the Snowflake Data Cloud. SMA scans your Python and Scala source code that contains Spark API calls, analyzes it, and creates an inventory. It then converts this Spark code into equivalent code that uses the [Snowpark API](https://docs.snowflake.com/en/developer-guide/snowpark/index). After the conversion is complete, you can start taking advantage of Snowflake’s features.

Please note: The Snowpark Migration Accelerator (SMA) is not an official Snowflake Inc. product and is not included in the Snowflake Service. SMA is provided as-is under its own terms. Snowflake’s support team does not provide assistance for SMA, and it is not covered by Snowflake’s standard support and service level agreements. For more information, please contact [sma-info@snowflake.com](mailto:sma-info%40snowflake.com).

---
title: Snowpark Migration Accelerator:  Additional Parameters
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/using-the-sma-cli/additional-parameters.md
section: Migrations
---

# Snowpark Migration Accelerator: Additional Parameters

The Snowpark Migration Accelerator (SMA) provides optional parameters that help you customize how your source code is assessed or converted. Here’s a detailed explanation of all additional parameters available in SMA:

## `--mapDirectory, -m <PATH>`

The path to the folder containing custom mapping files. Custom mapping functionality is not currently available in the Snowpark Migration Accelerator (SMA), but documentation will be provided once this feature is implemented.

### `--sqlDirectory, -f <PATH>`

The folder path where custom SQL extraction files are stored.

> **Danger:**
>
> This additional parameter is no longer supported and should not be used.

---
title: Snowpark Migration Accelerator:  Approach
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/approach.md
section: Migrations
---

# Snowpark Migration Accelerator: Approach

The Snowpark Migration Accelerator (SMA) helps you migrate code by identifying and reporting potential issues during the conversion process. It serves two main purposes: accelerating code migration and troubleshooting conversion problems. While SMA automatically converts compatible code, it also provides detailed information about any code segments it cannot convert.

## What is an issue?

An “issue” in the Snowpark Migration Accelerator (SMA) typically refers to either:

* A failure or crash during tool execution
* Problems that prevent the tool from completing its operation successfully

### Problem Executing the Tool

The first category of problems occurs when the tool encounters a failure or stops working.

### Indicator on the Issue Output

The tool identifies potential problems during code conversion. These problems are organized into three categories to help users successfully complete their migration process.

Further information about these issues will be provided soon.

---
title: Snowpark Migration Accelerator:  Assessment Output - In Application
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/interpreting-the-assessment-output/assessment-output-in-application.md
section: Migrations
---

# Snowpark Migration Accelerator: Assessment Output - In Application

When the Snowpark Migration Accelerator (SMA) finishes analyzing your code, it generates assessment artifacts and automatically takes you to the Assessment Results page.

For more information on how to interpret the assessment output, refer to the [Understanding the Assessment Summary](../../../user-guide/assessment/understanding-the-assessment-summary.md) section.

---
title: Snowpark Migration Accelerator:  Assessment zip file
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-reports/assessment-zip-file.md
section: Migrations
---

# Snowpark Migration Accelerator: Assessment zip file

The assessment results are stored in a zip file named “AssessmentFiles.zip” located in the output directory.

If you run SMA without an internet connection, you can email the zip file to [sma-info@snowflake.com](mailto:sma-info%40snowflake.com). We will process the file and simulate an offline execution for you.

---
title: Snowpark Migration Accelerator:  Code Extraction
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/before-using-the-sma/code-extraction.md
section: Migrations
---

# Snowpark Migration Accelerator: Code Extraction

The Snowpark Migration Accelerator (SMA) processes all files within a specified directory. While it creates an inventory of every file, it specifically analyzes files with certain extensions to identify Spark API references.

There are several ways to add files to this directory.

Place all your relevant code files into a single directory before proceeding with the migration.

To extract notebooks from your existing environment (such as Databricks), you can use an extraction script to help with the migration process.

## Extraction Scripts

Snowflake provides publicly available extraction scripts that you can find on the [Snowflake Labs GitHub page](https://github.com/Snowflake-Labs/SC.DDLExportScripts/tree/main). For Spark migrations, these scripts support various platforms.

### Databricks

For Jupyter (.ipynb) or Databricks (.dbc) notebooks that run in Databricks, you can directly place them in a directory for SMA analysis without any extraction. To learn how to export your Databricks notebook files, visit the Databricks documentation here: <https://docs.databricks.com/en/notebooks/notebook-export-import.html#export-notebooks>.

For an alternative approach, you can follow the instructions and use the scripts available in the Databricks folder of the SC.DDLExportScripts repository:
<https://github.com/Snowflake-Labs/SC.DDLExportScripts/tree/main/Databricks>

Additional information about data extraction will be provided soon.

---
title: Snowpark Migration Accelerator:  Contact Us
source: https://docs.snowflake.com/en/migrations/sma-docs/support/contact-us.md
section: Migrations
---

# Snowpark Migration Accelerator: Contact Us

The Snowpark Migration Accelerator (previously known as SnowConvert for Spark) [is now integrated into Snowflake’s product suite](https://investors.snowflake.com/news/news-details/2023/Snowflake-Announces-Intent-to-Acquire-Mobilize.Nets-SnowConvert-to-Accelerate-Legacy-Migrations-to-the-Data-Cloud/default.aspx).

For additional information about this accelerator, please contact us at:

* For additional information, contact: [sma-info@snowflake.com](mailto:sma-info%40snowflake.com)
* For technical support, contact: [sma-support@snowflake.com](mailto:sma-support%40snowflake.com)

---
title: Snowpark Migration Accelerator:  Conversion Setup
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/conversion-setup.md
section: Migrations
---

# Snowpark Migration Accelerator: Conversion Setup

When you first launch the Snowpark Migration Accelerator (SMA), you need to either create a new project or open an existing one. Each project can store multiple SMA executions for both Assessment and Conversion phases. After completing the Assessment phase, you will need to configure your project for the Conversion phase.

## Conversion Setup Page

During the conversion process, you have several configuration options available, although most default settings should work well for most cases.

On the **Conversion settings** page, choose whether to run the conversion using **Default Settings** or to select **Customize settings** to configure advanced options.

If you select **Customize settings**, SMA opens a **Conversion settings** dialog where you can review and update settings and then click **Save settings**.

### Conversion Settings

With the following settings from the user interface, you can more finely control how the SMA performs conversion.

* **Pandas**

  **Convert Pandas API to Snowpark API** - Specifies to automatically convert Pandas code to the Snowpark equivalent Pandas API
  (Snowpark Pandas). When enabled, the tool transforms any Pandas operations it finds in your code into their Snowpark counterparts.
* **DBX**

  **Convert DBX notebooks to Snowflake notebooks** - Specifies to convert the .dbc into Jupyter files in a new folder with the .dbc name.

  > **Note:**
  >
  > When exporting notebooks, consider exporting them as Databricks, rather than Jupyter. When Jupyter files contain different sources than Python, SMA behavior may be unexpected.
* **Checkpoints**

  + **Identify and collect checkpoints** - Activates the feature.
  + **Collect checkpoints as active** - Specifies to execute the collected checkpoint in VS Code when running the workload.
  + **Collect user-defined functions returning data frame type** - Specifies to validate that dataframes should be collected if the user has their own functions that return DataFrames.
  + **Mode** - Specifies the mode type to validate (Schema or DataFrame).
  + **Sample** - Specifies the sampling percentage of each DataFrame to validate.
  + **Relevant PySpark functions to collect** - Specifies the PySpark packages to collect (by default, all of them are checked). You can also add more packages by adding the package’s full name.

## Setup Complete

Once your setup is complete, click the **Continue** button. This action will initiate the SMA Conversion processes. A progress screen will display the current status of your conversion.

After the conversion finishes, SMA automatically displays the Conversion Results screen.

---
title: Snowpark Migration Accelerator:  Conversion Software Terms of Use
source: https://docs.snowflake.com/en/migrations/sma-docs/general/conversion-software-terms-of-use/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Conversion Software Terms of Use

For the most current and authoritative version of the Conversion Software Terms of Use, please visit the official Snowflake legal site:

[Conversion Software Terms of Use](https://www.snowflake.com/en/legal/technical-services-and-education/conversion-software-terms/)

---
title: Snowpark Migration Accelerator:  Conversion Walkthrough
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/conversion-walkthrough.md
section: Migrations
---

# Snowpark Migration Accelerator: Conversion Walkthrough

The Snowpark Migration Accelerator (SMA) can be run in conversion mode to generate output code that is compatible with Snowflake. This lab will walk you through executing a conversion and help you better troubleshoot through the output.

The Conversion Walkthrough has been absorbed by [the Migration Lab in the next section](migration-lab/README.md).

---
title: Snowpark Migration Accelerator:  Create Table
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-ddl/create-table/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Create Table

Let’s examine how to create a table. The CREATE TABLE syntax in Spark closely resembles the syntax used in Snowflake.

Here are the key areas you should focus on:

* [USING](using.md)

Let’s examine each of these components in detail.

---
title: Snowpark Migration Accelerator:  DBC files explode
source: https://docs.snowflake.com/en/migrations/sma-docs/support/frequently-asked-questions-faq/dbc-files-explode.md
section: Migrations
---

# Snowpark Migration Accelerator: DBC files explode

Before migrating Databricks workloads, you need to complete two steps:

1. Extract the source code from your .dbc files using the explode process
2. Use SnowConvert AI to migrate the extracted source code

To run the explode process, you need Python installed on your computer. We recommend using [Python 3.7](https://www.python.org/downloads/release/python-370/).

## Run explode script

Run [dbcexplode.py](https://repo.bds.mobilize.net/snowflake/qualification-service-desk/-/blob/main/dbcexplode.py) and provide the path to your .dbc file as a command-line argument.

```bash
python dbcexplode.py <dbc_file_path>
```

The script creates a folder in the same directory as the **dbcexplode.py** script. The new folder’s name will be your DBC file’s name followed by **.dbc-exploded**.

This folder will contain a separate folder for each notebook found in the .dbc file. In this example, the .dbc file contains a single notebook named **SanFranciscoFireCallsAnalysis (1).python**.

Inside this folder, you will find separate files for each command from the processed notebook. Each file follows the naming pattern **<notebook_name>-<sequence_number>**. The **<sequence_number>** represents the order in which the commands appear in the notebook. For example, **SanFranciscoFireCallsAnalysis (1)-001.md** represents the first command found in the notebook.

Note: If a notebook code cell contains a magic string, the script will generate a file with a .magic extension.

---
title: Snowpark Migration Accelerator:  Deploying the Output Code
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/deploying-the-output-code.md
section: Migrations
---

# Snowpark Migration Accelerator: Deploying the Output Code

To run the output code generated by the Snowpark Migration Accelerator (SMA), follow these environment-specific recommendations based on your source platform.

## Spark Scala

Before executing your migrated Apache Spark code in Snowpark, please review these important considerations:

### Add snowpark and snowpark extensions library reference

The migrated project must include references to both the Snowpark library and its extensions.

### Snowpark Extensions

Snowpark Extensions is a library that adds Apache Spark features to the standard Snowpark library. These features are not currently available in Snowpark. This library helps developers migrate their projects from Apache Spark to Snowpark more easily.

Follow these steps to reference Snowpark and Snowpark Extensions libraries in your migrated code:

1. Add the Snowpark library reference to your project
2. Add the Snowpark Extensions library reference to your project
3. Update your code to use these libraries

### Step 1 - Add snowpark and snowpark extensions library references to the project configuration file

The tool automatically adds these dependencies to your project configuration file. After the dependencies are added, your build tool will handle resolving them.

Based on the file extension of your project configuration file, the tool automatically adds the appropriate references in the following way:

#### build.gradle

```groovy
dependencies {
    implementation 'com.snowflake:snowpark:1.6.2'
    implementation 'net.mobilize.snowpark-extensions:snowparkextensions:0.0.9'
    ...
}
```

#### build.sbt

```scala
...
libraryDependencies += "com.snowflake" % "snowpark" % "1.6.2"
libraryDependencies += "net.mobilize.snowpark-extensions" % "snowparkextensions" % "0.0.9"
...
```

#### pom.xml

```xml
<dependencies>
    <dependency>
        <groupId>com.snowflake</groupId>
        <artifactId>snowpark</artifactId>
        <version>1.6.2</version>
    </dependency>
    <dependency>
        <groupId>net.mobilize.snowpark-extensions</groupId>
        <artifactId>snowparkextensions</artifactId>
        <version>0.0.9</version>
    </dependency>
    ...
</dependencies>
```

### Step 2 - Add snowpark extensions library import statements

The tool automatically adds these two import statements to every generated .scala file.

```scala
import com.snowflake.snowpark_extensions.Extensions._
import com.snowflake.snowpark_extensions.Extensions.functions._
```

### Code example

The code below uses **hex** and **isin** functions, which are native to Spark but not to Snowpark. However, the code will still execute successfully because these functions are provided through Snowpark extensions.

#### Input code

```scala
package com.mobilize.spark

import org.apache.spark.sql._

object Main {

   def main(args: Array[String]) : Unit = {

      var languageArray = Array("Java");

      var languageHex = hex(col("language"));

      col("language").isin(languageArray:_*);
   }

}
```

#### Output code

```scala
package com.mobilize.spark

import com.snowflake.snowpark._
import com.snowflake.snowpark_extensions.Extensions._
import com.snowflake.snowpark_extensions.Extensions.functions._

object Main {

   def main(args: Array[String]) : Unit = {

      var languageArray = Array("Java");

      // hex does not exist on Snowpark. It is a extension.
      var languageHex = hex(col("language"));

      // isin does not exist on Snowpark. It is a extension.
      col("language").isin(languageArray :_*)

   }

}
```

## PySpark

Before running your migrated PySpark code in Snowpark, please review these important considerations:

### Install snowpark and snowpark extensions libraries

The migrated project must include references to both the Snowpark library and its extensions.

### Snowpark Extensions

Snowpark Extensions is a library that adds PySpark-like features to the standard Snowpark library. These features are currently not available in Snowpark. This library helps developers migrate their projects from PySpark to Snowpark more easily.

Follow these steps to reference Snowpark and Snowpark Extensions libraries in your migrated code:

1. Add Snowpark library references to your migrated code
2. Include Snowpark Extensions library references where needed

#### Step 1 - Install snowpark library

```bash
pip install snowpark-extensions
```

#### Step 2 - Install snowpark extensions library

```bash
pip install snowflake-snowpark-python
```

#### Step 3 - Add snowpark extensions library import statements

The tool automatically adds the PySpark import statement to every file that requires PySpark functionality.

```python
import snowpark_extensions
```

### Code example

The `create_map` function is not available in PySpark but is supported in Snowpark through its extensions. This means your code will work correctly in Snowpark without any modifications.

#### Input code

```python
import pyspark.sql.functions as df
df.select(create_map('name', 'age').alias("map")).collect()
```

#### Output code

```python
import snowpark_extensions
import snowflake.snowpark.functions as df
df.select(create_map('name', 'age').alias("map")).collect()
```

---
title: Snowpark Migration Accelerator:  Distinct
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/distinct.md
section: Migrations
---

# Snowpark Migration Accelerator: Distinct

## Description

Select all unique rows from the referenced tables. ([Databricks SQL Language Reference SELECT](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select.html))

`DISTINCT` removes duplicate rows from your query results. ([Snowflake SQL Language Reference SELECT](https://docs.snowflake.com/en/sql-reference/sql/select#parameters))

### Syntax

```bnf
SELECT [ DISTINCT ] { named_expression | star_clause } [, ...]
  FROM table_reference
```

```bnf
SELECT [ DISTINCT ]
       {
         [{<object_name>|<alias>}.]<col_name>
         | [{<object_name>|<alias>}.]$<col_position>
         | <expr>
       }
       [ [ AS ] <col_alias> ]
       [ , ... ]
[ ... ]
```

## Sample Source Patterns

### Setup data

#### Databricks

```sql
CREATE TEMPORARY VIEW number1(c) AS VALUES (3), (1), (2), (2), (3), (4);
```

#### Snowflake

```sql
CREATE TEMPORARY TABLE number1(c int);
INSERT INTO number1 VALUES (3), (1), (2), (2), (3), (4);
```

### Pattern code

#### Databricks

```sql
SELECT DISTINCT c FROM number1;
```

| c |
| --- |
| 3 |
| 1 |
| 2 |
| 4 |

#### Snowflake

```sql
SELECT DISTINCT c FROM number1;
```

| c |
| --- |
| 3 |
| 1 |
| 2 |
| 4 |

### Known Issues

No issues were found

### Related EWIs

No related EWIs

---
title: Snowpark Migration Accelerator:  Frequently Asked Questions (FAQ)
source: https://docs.snowflake.com/en/migrations/sma-docs/support/frequently-asked-questions-faq/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Frequently Asked Questions (FAQ)

Looking for help with Snowpark Migration Accelerator for Python? Here are the most common questions and answers about SMA for PySpark to help you get started.

* [Using SMA with Jupyter Notebooks](using-sma-with-jupyter-notebooks.md)
* [How to share results with Snowflake](sharing-the-output-with-snowflake.md)
* [Working with DBC file extraction](dbc-files-explode.md)

---
title: Snowpark Migration Accelerator:  General
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/general.md
section: Migrations
---

# Snowpark Migration Accelerator: General

Issue codes can be either platform-specific or generic. Generic issue codes, which apply to multiple source platforms, are listed below.

If no issue codes appear below, there are currently no generic issue codes to display.

---
title: Snowpark Migration Accelerator:  General Troubleshooting
source: https://docs.snowflake.com/en/migrations/sma-docs/support/general-troubleshooting/README.md
section: Migrations
---

# Snowpark Migration Accelerator: General Troubleshooting

Having trouble with SMA? This section will help you troubleshoot common issues and find solutions.

Here are the most common issues and their solutions:

* [How to grant SMA access to the configuration folder](how-do-i-give-sma-permission-to-the-config-folder.md)
* [How to grant SMA access to documents, desktop, and downloads folders](how-do-i-give-sma-permission-to-documents-desktop-and-downloads-folders.md)
* [How to make sure that .config is a folder instead of a file](how-do-I-make-sure-that-config-is-folder.md)

If you need help understanding specific issue codes generated by the Snowpark Migration Accelerator (SMA), please refer to the [Issue Analysis](../../issue-analysis/approach.md) section of this documentation.

---
title: Snowpark Migration Accelerator:  Generic Inventories
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-reports/generic-inventories.md
section: Migrations
---

# Snowpark Migration Accelerator: Generic Inventories

When the Snowpark Migration Accelerator (SMA) analyzes your code, it performs two types of scans:

1. A language-specific scan that analyzes code in your source programming language
2. A general-purpose scan that collects basic information about files and keywords in your codebase

You can find details about the language-specific scan results in the [SMA Inventories](sma-inventories.md) section. This page describes the information collected by the general-purpose scan.

Although some files have a .pam extension, they are actually comma-separated files similar to .csv files. You may notice duplicate entries across these files because the data has been organized in different ways to facilitate various types of analysis.

## File Summary

The files.pam contains an inventory that lists all files processed during a tool execution. For each file, it records the file type and size. This file contains the same information as the [files.csv described in the SMA Inventories section](sma-inventories.md).

## Generic File Inventory

The **FilesInventory.csv** file contains categorization details and line counts for each source file.

* Filename: The complete path and name of the file from the root input directory
* Extension: The file type extension (e.g., .java, .py, .sql)
* Technology: The programming language or technology identified based on the file extension
* Status: Always shows “OK” for identified files (unidentified files are not listed)
* isBinary: Indicates if the file is binary (TRUE), text (FALSE), or unrecognized (UNKNOWN)
* Bytes: File size in bytes
* ContentType: Categorizes each line as either:

  + Code: Programming instructions
  + Comment: Documentation or notes
  + Blank: Empty lines
  + Other: Unrecognized content
* ContentLines: Total number of code lines in the file
* CommentLines: Total number of comment lines in the file
* BlankLines: Total number of empty lines in the file

## Keyword Counts

The **KeywordCounts.csv** file provides a comprehensive list of all keywords detected in each file, organized by technology type. This analysis includes keywords from any programming language that our generic scanner can process, not just the source languages officially supported by the Snowpark Migration Accelerator (SMA).

* FileId: The file path where the keyword was located
* Technology: The original technology used in the source file
* Keyword: The specific keyword found (examples: from, import, DataFrame)
* Count: The number of occurrences of the keyword in each line

## Lines Inventory

The **line_counts.pam** file analyzes each line in a scanned file and categorizes them as code, comments, or blank lines. It also provides a total count for each category.

* FileId: The name of the file being analyzed
* LineKind: The category of each line in the file (can be code, comment, or blank)
* Count: Total number of lines for each combination of FileId and LineKind

## Tool Execution Inventory

The tool_execution.pam file contains essential information about the current SMA tool execution. This file is identical to the [tool_execution.csv file described in the SMA Inventories section](sma-inventories.md) of this documentation.

## Word Counts

The **word_counts.pam** file displays how many times each keyword appears across all files in the scanned codebase.

* FileId: The file location and relative path where the keyword was found
* Keyword: The specific text identified as a keyword (examples: from, import, DataFrame)
* Count: The number of occurrences of the keyword in a single line of code

---
title: Snowpark Migration Accelerator:  Glossary
source: https://docs.snowflake.com/en/migrations/sma-docs/support/glossary.md
section: Migrations
---

# Snowpark Migration Accelerator: Glossary

The Snowpark Migration Accelerator (SMA) uses some technical terms that might be unfamiliar. Refer to the glossary page to learn more about these terms.

## Snowpark Migration Accelerator (SMA)

This software documentation explains how to automatically convert Spark API code written in Scala or Python to equivalent Snowflake Snowpark code. The conversion process is secure and maintains the functionality of your original code.

The Snowpark Migration Accelerator (SMA) was previously known as SnowConvert and SnowConvert for Spark. SnowConvert (SC) continues to be available as a tool for SQL conversions.

## Readiness Score

The Readiness Score helps you understand how ready your code is for migration to Snowpark. It calculates the percentage of Spark API references that can be converted to Snowpark API. For example, if 3413 out of 3748 Spark API references can be converted, the readiness score would be 91%.

However, it’s important to note that this score:

* Only considers Spark API references
* Does not evaluate third-party libraries
* Should be used as an initial assessment, not the final decision factor

While a higher score indicates better compatibility with Snowpark, you should also evaluate other factors, such as third-party library dependencies, before proceeding with the migration.

## Spark Reference Categories

The Snowpark Migration Accelerator (SMA) classifies Spark components according to how they map to Snowpark functionality. For each Spark reference, SMA provides:

* A categorization of how it translates to Snowpark
* A detailed description
* Example code
* Information about automatic conversion capability
* Details about Snowpark support

You can find the complete reference guide [on this page](../user-guide/scos-conversion/spark-reference-categories.md).

## SnowConvert Qualification Tool

SnowConvert for Spark’s assessment mode analyzes your codebase to automatically detect and identify all instances of Apache Spark Python code.

## File Inventory

A complete list of all files found in the tool’s input directory, regardless of file type. The inventory provides a detailed breakdown organized by file type, including:

* The original technology or platform
* Number of lines of code
* Number of comment lines
* File sizes of the source files

## Keyword Counts

A summary of keyword occurrences organized by technology type. For example, when analyzing a .py file containing PySpark code, the system tracks and counts each PySpark keyword. The report shows the total number of keywords found for each file extension.

## Spark Reference Inventory

After analyzing your code, you will receive a comprehensive list of all Spark API references found in your Python code.

## Readiness Score

The Spark code references will help determine how much of your codebase can be automatically converted.

## Conversion Score

The conversion score is calculated by dividing the number of automatically converted Spark operations by the total number of Spark references detected in the code.

## Conversion/Transformation Rule

Rules that define how SnowConvert transforms source code into the desired target code format.

## Parse

The parsing phase is the first step where SnowConvert analyzes the source code and creates an internal data structure. This structure is then used to apply conversion rules during the migration process.

---
title: Snowpark Migration Accelerator:  Group By
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/group-by.md
section: Migrations
---

# Snowpark Migration Accelerator: Group By

## Description

The `GROUP BY` clause groups rows based on specified expressions and calculates aggregate functions for each group. Databricks SQL provides advanced grouping options through `GROUPING SETS`, `CUBE`, and `ROLLUP` clauses, which allow multiple aggregations on the same dataset. You can combine regular grouping expressions with these advanced options in the `GROUP BY` clause, and nest them within `GROUPING SETS`. ([Databricks SQL Language Reference GROUP BY](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select-groupby.html))

Groups rows that share the same values in specified columns and calculates aggregate functions (such as SUM, COUNT, or AVG) for each group. The GROUP BY clause can include:

* The name of a column
* A number that refers to a position in the [SELECT](https://docs.snowflake.com/en/sql-reference/sql/select) list
* Any valid expression

Extensions:

[GROUP BY CUBE](https://docs.snowflake.com/en/sql-reference/constructs/group-by-cube), [GROUP BY GROUPING SETS](https://docs.snowflake.com/en/sql-reference/constructs/group-by-grouping-sets), and [GROUP BY ROLLUP](https://docs.snowflake.com/en/sql-reference/constructs/group-by-rollup)

[Snowflake SQL Language Reference GROUP BY](https://docs.snowflake.com/en/sql-reference/constructs/group-by)

### Syntax

```text
GROUP BY ALL

GROUP BY group_expression [, ...] [ WITH ROLLUP | WITH CUBE ]

GROUP BY { group_expression | { ROLLUP | CUBE | GROUPING SETS } ( grouping_set [, ...] ) } [, ...]

grouping_set
   { expression |
     ( [ expression [, ...] ] ) }
```

```text
SELECT ...
  FROM ...
  [ ... ]
  GROUP BY groupItem [ , groupItem [ , ... ] ]
  [ ... ]

SELECT ...
  FROM ...
  [ ... ]
  GROUP BY ALL
  [ ... ]
groupItem ::= { <column_alias> | <position> | <expr> }

SELECT ...
FROM ...
[ ... ]
GROUP BY CUBE ( groupCube [ , groupCube [ , ... ] ] )
[ ... ]

groupCube ::= { <column_alias> | <position> | <expr> }

SELECT ...
FROM ...
[ ... ]
GROUP BY GROUPING SETS ( groupSet [ , groupSet [ , ... ] ] )
[ ... ]

groupSet ::= { <column_alias> | <position> | <expr> }

SELECT ...
FROM ...
[ ... ]
GROUP BY ROLLUP ( groupRollup [ , groupRollup [ , ... ] ] )
[ ... ]

groupRollup ::= { <column_alias> | <position> | <expr> }
```

## Sample Source Patterns

### Setup data

#### Databricks

```sql
CREATE TEMP VIEW dealer (id, city, car_model, quantity) AS
VALUES (100, 'Fremont', 'Honda Civic', 10),
       (100, 'Fremont', 'Honda Accord', 15),
       (100, 'Fremont', 'Honda CRV', 7),
       (200, 'Dublin', 'Honda Civic', 20),
       (200, 'Dublin', 'Honda Accord', 10),
       (200, 'Dublin', 'Honda CRV', 3),
       (300, 'San Jose', 'Honda Civic', 5),
       (300, 'San Jose', 'Honda Accord', 8);
```

#### Snowflake

```sql
CREATE TEMP TABLE dealer (id INT, city STRING, car_model STRING, quantity INT);
INSERT INTO dealer VALUES
        (100, 'Fremont', 'Honda Civic', 10),
        (100, 'Fremont', 'Honda Accord', 15),
        (100, 'Fremont', 'Honda CRV', 7),
        (200, 'Dublin', 'Honda Civic', 20),
        (200, 'Dublin', 'Honda Accord', 10),
        (200, 'Dublin', 'Honda CRV', 3),
        (300, 'San Jose', 'Honda Civic', 5),
        (300, 'San Jose', 'Honda Accord', 8);
```

### Pattern code

#### Databricks

```sql
-- 1. Sum of quantity per dealership. Group by `id`.
SELECT id, sum(quantity) FROM dealer GROUP BY id ORDER BY id;

-- 2. Use column position in GROUP by clause.
SELECT id, sum(quantity) FROM dealer GROUP BY 1 ORDER BY 1;

-- 3. Multiple aggregations.
-- 3.1. Sum of quantity per dealership.
-- 3.2. Max quantity per dealership.
SELECT id, sum(quantity) AS sum, max(quantity) AS max
    FROM dealer GROUP BY id ORDER BY id;

-- 4. Count the number of distinct dealers in cities per car_model.
SELECT car_model, count(DISTINCT city) AS count FROM dealer GROUP BY car_model;

-- 5. Count the number of distinct dealers in cities per car_model, using GROUP BY ALL
SELECT car_model, count(DISTINCT city) AS count FROM dealer GROUP BY ALL;

-- 6. Sum of only 'Honda Civic' and 'Honda CRV' quantities per dealership.
SELECT id,
         sum(quantity) FILTER (WHERE car_model IN ('Honda Civic', 'Honda CRV')) AS `sum(quantity)`
    FROM dealer
    GROUP BY id ORDER BY id;

-- 7. Aggregations using multiple sets of grouping columns in a single statement.
-- Following performs aggregations based on four sets of grouping columns.
-- 7.1. city, car_model
-- 7.2. city
-- 7.3. car_model
-- 7.4. Empty grouping set. Returns quantities for all city and car models.
SELECT city, car_model, sum(quantity) AS sum
    FROM dealer
    GROUP BY GROUPING SETS ((city, car_model), (city), (car_model), ())
    ORDER BY city;

-- 8.Group by processing with `ROLLUP` clause.
-- Equivalent GROUP BY GROUPING SETS ((city, car_model), (city), ())
SELECT city, car_model, sum(quantity) AS sum
    FROM dealer
    GROUP BY city, car_model WITH ROLLUP
    ORDER BY city, car_model;

-- 9. Group by processing with `CUBE` clause.
-- Equivalent GROUP BY GROUPING SETS ((city, car_model), (city), (car_model), ())
SELECT city, car_model, sum(quantity) AS sum
    FROM dealer
    GROUP BY city, car_model WITH CUBE
    ORDER BY city, car_model;
```

| id | sum(quantity) |
| --- | --- |
| 100 | 32 |
| 200 | 33 |
| 300 | 13 |

| id | sum(quantity) |
| --- | --- |
| 100 | 32 |
| 200 | 33 |
| 300 | 13 |

| id | sum | max |
| --- | --- | --- |
| 100 | 32 | 15 |
| 200 | 33 | 20 |
| 300 | 13 | 8 |

| car_model | count |
| --- | --- |
| Honda Civic | 3 |
| Honda CRV | 2 |
| Honda Accord | 3 |

| car_model | count |
| --- | --- |
| Honda Civic | 3 |
| Honda CRV | 2 |
| Honda Accord | 3 |

| id | sum(quantity) |
| --- | --- |
| 100 | 17 |
| 200 | 23 |
| 300 | 5 |

| city | car_model | sum |
| --- | --- | --- |
| *NULL* | Honda Civic | 35 |
| *NULL* | Honda Accord | 33 |
| *NULL* | *NULL* | 78 |
| *NULL* | Honda CRV | 10 |
| Dublin | Honda Civic | 20 |
| Dublin | *NULL* | 33 |
| Dublin | Honda CRV | 3 |
| Dublin | Honda Accord | 10 |
| Fremont | Honda Accord | 15 |
| Fremont | Honda Civic | 10 |
| Fremont | *NULL* | 32 |
| Fremont | Honda CRV | 7 |
| San Jose | Honda Accord | 8 |
| San Jose | *NULL* | 13 |
| San Jose | Honda Civic | 5 |

| city | car_model | sum |
| --- | --- | --- |
| *NULL* | *NULL* | 78 |
| Dublin | *NULL* | 33 |
| Dublin | Honda Accord | 10 |
| Dublin | Honda CRV | 3 |
| Dublin | Honda Civic | 20 |
| Fremont | *NULL* | 32 |
| Fremont | Honda Accord | 15 |
| Fremont | Honda CRV | 7 |
| Fremont | Honda Civic | 10 |
| San Jose | *NULL* | 13 |
| San Jose | Honda Accord | 8 |
| San Jose | Honda Civic | 5 |

| city | car_model | sum |
| --- | --- | --- |
| *NULL* | *NULL* | 78 |
| *NULL* | Honda Accord | 33 |
| *NULL* | Honda CRV | 10 |
| *NULL* | Honda Civic | 35 |
| Dublin | *NULL* | 33 |
| Dublin | Honda Accord | 10 |
| Dublin | Honda CRV | 3 |
| Dublin | Honda Civic | 20 |
| Fremont | *NULL* | 32 |
| Fremont | Honda Accord | 15 |
| Fremont | Honda CRV | 7 |
| Fremont | Honda Civic | 10 |
| San Jose | *NULL* | 13 |
| San Jose | Honda Accord | 8 |
| San Jose | Honda Civic | 5 |

#### Snowflake

```sql
-- 1. Sum of quantity per dealership. Group by `id`.
SELECT id, sum(quantity) FROM dealer GROUP BY id ORDER BY id;

-- 2. Use column position in GROUP by clause.
SELECT id, sum(quantity) FROM dealer GROUP BY 1 ORDER BY 1;

-- 3. Multiple aggregations.
-- 3.1. Sum of quantity per dealership.
-- 3.2. Max quantity per dealership.
SELECT id, sum(quantity) AS sum, max(quantity) AS max
    FROM dealer GROUP BY id ORDER BY id;

-- 4. Count the number of distinct dealers in cities per car_model.
SELECT car_model, count(DISTINCT city) AS count FROM dealer GROUP BY car_model;

-- 5. Count the number of distinct dealers in cities per car_model, using GROUP BY ALL
SELECT car_model, count(DISTINCT city) AS count FROM dealer GROUP BY ALL;

-- 6. Sum of only 'Honda Civic' and 'Honda CRV' quantities per dealership.
SELECT
    id,
    SUM(CASE WHEN car_model='Honda Civic' OR car_model='Honda CRV' THEN quantity ELSE NULL END) AS `sum(quantity)`
    FROM dealer
    GROUP BY id ORDER BY id;

-- 7. Aggregations using multiple sets of grouping columns in a single statement.
-- Following performs aggregations based on four sets of grouping columns.
-- 7.1. city, car_model
-- 7.2. city
-- 7.3. car_model
-- 7.4. Empty grouping set. Returns quantities for all city and car models.
SELECT city, car_model, sum(quantity) AS sum
    FROM dealer
    GROUP BY GROUPING SETS ((city, car_model), (city), (car_model), ())
    ORDER BY city NULLS FIRST;

-- 8. Group by processing with `ROLLUP` clause.
-- Equivalent GROUP BY GROUPING SETS ((city, car_model), (city), ())
SELECT city, car_model, sum(quantity) AS sum
    FROM dealer
    GROUP BY ROLLUP (city, car_model)
    ORDER BY city NULLS FIRST, car_model NULLS FIRST;

-- 9. Group by processing with `CUBE` clause.
-- Equivalent GROUP BY GROUPING SETS ((city, car_model), (city), (car_model), ())
SELECT city, car_model, sum(quantity) AS sum
    FROM dealer
    GROUP BY CUBE (city, car_model)
    ORDER BY city NULLS FIRST, car_model NULLS FIRST;
```

| id | sum(quantity) |
| --- | --- |
| 100 | 32 |
| 200 | 33 |
| 300 | 13 |

| id | sum(quantity) |
| --- | --- |
| 100 | 32 |
| 200 | 33 |
| 300 | 13 |

| id | sum | max |
| --- | --- | --- |
| 100 | 32 | 15 |
| 200 | 33 | 20 |
| 300 | 13 | 8 |

| car_model | count |
| --- | --- |
| Honda Civic | 3 |
| Honda CRV | 2 |
| Honda Accord | 3 |

| car_model | count |
| --- | --- |
| Honda Civic | 3 |
| Honda CRV | 2 |
| Honda Accord | 3 |

| id | sum(quantity) |
| --- | --- |
| 100 | 17 |
| 200 | 23 |
| 300 | 5 |

| city | car_model | sum |
| --- | --- | --- |
| *NULL* | Honda Civic | 35 |
| *NULL* | Honda Accord | 33 |
| *NULL* | *NULL* | 78 |
| *NULL* | Honda CRV | 10 |
| Dublin | Honda Civic | 20 |
| Dublin | *NULL* | 33 |
| Dublin | Honda CRV | 3 |
| Dublin | Honda Accord | 10 |
| Fremont | Honda Accord | 15 |
| Fremont | Honda Civic | 10 |
| Fremont | *NULL* | 32 |
| Fremont | Honda CRV | 7 |
| San Jose | Honda Accord | 8 |
| San Jose | *NULL* | 13 |
| San Jose | Honda Civic | 5 |

| city | car_model | sum |
| --- | --- | --- |
| *NULL* | *NULL* | 78 |
| Dublin | *NULL* | 33 |
| Dublin | Honda Accord | 10 |
| Dublin | Honda CRV | 3 |
| Dublin | Honda Civic | 20 |
| Fremont | *NULL* | 32 |
| Fremont | Honda Accord | 15 |
| Fremont | Honda CRV | 7 |
| Fremont | Honda Civic | 10 |
| San Jose | *NULL* | 13 |
| San Jose | Honda Accord | 8 |
| San Jose | Honda Civic | 5 |

| city | car_model | sum |
| --- | --- | --- |
| *NULL* | *NULL* | 78 |
| *NULL* | Honda Accord | 33 |
| *NULL* | Honda CRV | 10 |
| *NULL* | Honda Civic | 35 |
| Dublin | *NULL* | 33 |
| Dublin | Honda Accord | 10 |
| Dublin | Honda CRV | 3 |
| Dublin | Honda Civic | 20 |
| Fremont | *NULL* | 32 |
| Fremont | Honda Accord | 15 |
| Fremont | Honda CRV | 7 |
| Fremont | Honda Civic | 10 |
| San Jose | *NULL* | 13 |
| San Jose | Honda Accord | 8 |
| San Jose | Honda Civic | 5 |

### Known Issues

No issues were found

### Related EWIs

No related EWIs

---
title: Snowpark Migration Accelerator:  Hive
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/sql/hive/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Hive

## Issues Codes

All of the warnings, parsing errors, and conversion exceptions generated by the SMA when Hive is selected as the Database language to migrate SQL statements will appear below. If you have any concerns or see something that is not right, please reach out to the SMA support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

| Code | Description | Category | Deprecat |
| --- | --- | --- | --- |
| `SPRKHVSQL1001` | Unrecognized token | Parsing error |  |
| `SPRKHVSQL1002` | Unsupported statement in Snowflake. | Warning |  |
| `SPRKHVSQL1003` | Unsupported SET statement in Snowflake | Warning |  |
| `SPRKHVSQL1004` | Unsupported PURGE clause in DROP TABLE | Warning |  |
| `SPRKHVSQL1005` | Unsuported TBLPROPERTIES in ALTER Statements | Conversion Error |  |

---
title: Snowpark Migration Accelerator:  HiveSQL
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/hivesql/README.md
section: Migrations
---

# Snowpark Migration Accelerator: HiveSQL

## Supported functions

### Strings

| Function | Status |
| --- | --- |
| CONCAT | SUPPORTED |
| CONCAT_WS | SUPPORTED |
| LOWER | SUPPORTED |
| LPAD | SUPPORTED |
| REVERSE | SUPPORTED |
| SUBSTR/SUBSTRING | SUPPORTED |
| TRANSLATE | SUPPORTED |
| TRIM | SUPPORTED |
| UPPER | SUPPORTED |

### Numerics

| Function | Status |
| --- | --- |
| ABS | SUPPORTED |
| AVG | SUPPORTED |
| COUNT | SUPPORTED |
| DENSE_RANK | SUPPORTED |
| MAX | SUPPORTED |
| MIN | SUPPORTED |
| RANK | SUPPORTED |
| REGEXP_REPLACE | SUPPORTED |
| ROUND | SUPPORTED |
| SUM | SUPPORTED |

### Date

| Function | Status |
| --- | --- |
| ADD_MONTHS | SUPPORTED |
| CURRENT_DATE | SUPPORTED |
| CURRENT_TIMESTAMP | SUPPORTED |
| DATE_ADD | PENDING |
| DATEDIFF | PENDING |
| DATE_FORMAT | PENDING |
| FIRST_VALUE | PENDING |
| FROM_UNIXTIME | PENDING |
| LAST_DAY | SUPPORTED |
| MONTH | SUPPORTED |
| TO_DATE | SUPPORTED |
| TO_UTC_TIMESTAMP | PENDING |
| UNIX_TIMESTAMP | PENDING |
| YEAR | SUPPORTED |

### Advanced functions

| Function | Status |
| --- | --- |
| CAST | SUPPORTED |
| COALESCE | SUPPORTED |
| COLLECT_SET | PENDING |
| LAG | SUPPORTED |
| LEAD | PENDING |
| NVL | SUPPORTED |
| NTILE | SUPPORTED |
| PARTITION | PENDING |
| ROW_NUMBER | SUPPORTED |
| TRUNC | PENDING |

---
title: Snowpark Migration Accelerator:  How do I give SMA permission to Documents, Desktop, and Downloads folders?
source: https://docs.snowflake.com/en/migrations/sma-docs/support/general-troubleshooting/how-do-i-give-sma-permission-to-documents-desktop-and-downloads-folders.md
section: Migrations
---

# Snowpark Migration Accelerator: How do I give SMA permission to Documents, Desktop, and Downloads folders?

A known issue exists on macOS where SMA crashes if the project directory lacks proper `read` and `write` permissions.

Please follow these steps:

1. Verify that your project is located in one of these directories:

* {file}:code:`Documents`: Contains your document files
* {file}:code:`Desktop`: Contains files and shortcuts on your desktop
* {file}:code:`Downloads`: Contains downloaded files

To enable access in macOS, adjust your system settings:

1. On your Mac, click the Apple menu icon, select “System Settings,” and then click “Privacy & Security” in the left sidebar.
2. Click “Files and Folders” and grant SMA access to the folders where your project is located.

---
title: Snowpark Migration Accelerator:  How the Conversion Works
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/how-the-conversion-works.md
section: Migrations
---

# Snowpark Migration Accelerator: How the Conversion Works

The Snowpark Migration Accelerator (SMA) not only generates a comprehensive assessment of your code but can also convert specific elements from your source code into compatible formats for your target codebase. This conversion process follows the same steps as the initial assessment, with just one additional step.

## Conversion in the SMA

In both assessment and conversion modes, the Snowpark Migration Accelerator (SMA):

* Searches through all files within a specified directory
* Detects which files contain code
* Analyzes the code files according to their programming language
* Creates a structured representation of the code (Abstract Syntax Tree or AST)
* Creates and fills a Symbol Table with program information
* Identifies and classifies any errors found
* Creates detailed reports of the results

All of these processes are repeated when you run SMA in conversion mode, even if you previously ran it in assessment mode. However, conversion mode includes one additional final step.

* Format the generated code from the Abstract Syntax Tree (AST) to improve readability

The Abstract Syntax Tree (AST) is a model that represents how your source code works. When the same functionality exists in both the source and target languages, SMA can generate equivalent code in the target language. This code generation only happens during the actual conversion process.

## Types of Conversion in the SMA

The Snowpark Migration Accelerator (SMA) currently supports the following code conversions:

* Converts Python or Scala code from Spark API calls to equivalent Snowpark Connect calls

> **Note:**
>
> The SMA does not perform any SQL conversion. For SQL files or SQL-only assessments, the tool provides assessment only, without any automated conversion.

Let’s examine an example written in both Scala and Python programming languages.

## Examples of Conversion of References to the Spark API to the Snowpark Connect

### Example of Spark Scala to Snowpark

When using Scala as your source language, the Snowpark Migration Accelerator (SMA) automatically converts Spark API references in your Scala code to their equivalent Snowpark Connect references. Below is an example that demonstrates how a basic Spark application is converted. The example application performs several common data operations:

* Reading data
* Filtering records
* Joining datasets
* Calculating averages
* Displaying results

Apache Spark Code Written in Scala

```scala
import org.apache.spark.sql._
import org.apache.spark.sql.functions._
import org.apache.spark.sql.SparkSession

object SimpleApp {
  // This function calculates the average salary for jobs in a specific department
  def avgJobSalary(session: SparkSession, dept: String) {
    // Load employee data from CSV file
    val employees = session.read.csv("path/data/employees.csv")
    // Load job data from CSV file
    val jobs = session.read.csv("path/data/jobs.csv")

val jobsAvgSalary = employees
    .filter(column("Department") === dept)    // Filter employees by department
    .join(jobs)                              // Join with jobs table
    .groupBy("JobName")                      // Group results by job name
    .avg("Salary")                          // Calculate average salary for each job

// Calculate and display a list of all salaries in the department
jobsAvgSalary.select(collect_list("Salary")).show()

```scala
// Calculate and display the average salary
jobsAvgSalary.show()
}
```

The Code After Conversion to Snowflake:

```scala
import com.snowflake.snowpark._
import com.snowflake.snowpark.functions._
import com.snowflake.snowpark.Session

object SimpleApp {
  // This function calculates the average salary for jobs in a specific department
  def avgJobSalary(session: Session, dept: String) {
    // Load employee data from CSV file
    val employees = session.read.csv("path/data/employees.csv")
    // Load job data from CSV file
    val jobs = session.read.csv("path/data/jobs.csv")

val jobsAvgSalary = employees
    .filter(column("Department") === dept)    // Filter employees by department
    .join(jobs)                              // Join with jobs table
    .groupBy("JobName")                      // Group results by job name
    .avg("Salary")                           // Calculate average salary per job

```scala
// Calculate and display all salaries in the department
jobsAvgSalary.select(array_agg("Salary")).show()

// Display the average salary
jobsAvgSalary.show()
}
}
```

In this example, the code structure remains largely unchanged. However, the code has been updated to use Snowpark Connect references instead of Spark API references.

### Example of PySpark to Snowpark Connect

When you choose Python as your source language, SMA automatically converts PySpark API calls in your Python code to their equivalent Snowpark Connect calls. Below is an example script that demonstrates various PySpark functions:

```python
from datetime import date, datetime
from pyspark.sql import SparkSession
from pyspark.sql import functions as F
from pyspark.sql import Row

Create a Spark session by building and initializing a new SparkSession object, or retrieve an existing one if already available.

df = spark_session.createDataFrame([
    Row(a=1, b=2., c='string1', d=date(2000, 1, 1), e=datetime(2000, 1, 1, 12, 0)),
    Row(a=2, b=3., c='string2', d=date(2000, 2, 1), e=datetime(2000, 1, 2, 12, 0)),
    Row(a=4, b=5., c='string3', d=date(2000, 3, 1), e=datetime(2000, 1, 3, 12, 0))
])

# cube()
df.cube("name", df.age).count().orderBy("name", "age").show()

# take()
df_new1.take(2)

# describe()
df.describe(['age']).show()

# explain()
df.explain()
df.explain("simple") # Physical plan
df.explain(True)

# intersect()
df1 = spark_session.createDataFrame([("a", 1), ("a", 1), ("b", 3), ("c", 4)], ["C1", "C2"])
df2 = spark_session.createDataFrame([("a", 1), ("a", 1), ("b", 3)], ["C1", "C2"])

# where()
df_new1.where(F.col('Id2')>30).show()
```

The Code After Conversion to Snowflake:

```python
from snowflake import snowpark_connect
from datetime import date, datetime
from pyspark.sql import SparkSession
from pyspark.sql import functions as F
from pyspark.sql import Row
conf = SparkConf()
conf.setAppName("test")

# Create a Spark session by building and initializing a new SparkSession object, or retrieve an existing one if already available.
#EWI: SPRKCNTPY3501 => The AppName method of pyspark.sql.session.SparkSession.Builder has been replaced with the SetName function to provide equivalent functionality in Snowpark Connect
#EWI: SPRKCNTPY1001 => The creation of the SparkSession has been replaced with the creation of an equivalent Snowpark Connect Session.
spark_session = snowpark_connect.server.init_spark_session(conf = conf)

df = spark_session.createDataFrame([
    Row(a=1, b=2., c='string1', d=date(2000, 1, 1), e=datetime(2000, 1, 1, 12, 0)),
    Row(a=2, b=3., c='string2', d=date(2000, 2, 1), e=datetime(2000, 1, 2, 12, 0)),
    Row(a=4, b=5., c='string3', d=date(2000, 3, 1), e=datetime(2000, 1, 3, 12, 0))
])

# cube()
df.cube("name", df.age).count().orderBy("name", "age").show()

# take()
df_new1 = spark_session.createDataFrame([(1, "Alice"), (2, "Bob"), (3, "Charlie")], ["Id", "Name"])
df_new1.take(2)

# describe()
df.describe(['age']).show()

# explain()
df.explain()
df.explain("simple") # Physical plan
df.explain(True)

# intersect()
df1 = spark_session.createDataFrame([("a", 1), ("a", 1), ("b", 3), ("c", 4)], ["C1", "C2"])
df2 = spark_session.createDataFrame([("a", 1), ("a", 1), ("b", 3)], ["C1", "C2"])

# where()
df_new1.where(F.col('Id2')>30).show()
```

In this example, the code structure remains largely unchanged. However, the code has been updated to use Snowpark Connect calls instead of Spark API calls.

During the conversion process with the Snowpark Migration Accelerator (SMA), you can expect the following:

---
title: Snowpark Migration Accelerator:  How the Conversion Works
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/how-the-conversion-works.md
section: Migrations
---

# Snowpark Migration Accelerator: How the Conversion Works

The Snowpark Migration Accelerator (SMA) not only generates a comprehensive assessment of your code but can also convert specific elements from your source code into compatible formats for your target codebase. This conversion process follows the same steps as the initial assessment, with just one additional step.

## Conversion in the SMA

In both assessment and conversion modes, the Snowpark Migration Accelerator (SMA):

* Searches through all files within a specified directory
* Detects which files contain code
* Analyzes the code files according to their programming language
* Creates a structured representation of the code (Abstract Syntax Tree or AST)
* Creates and fills a Symbol Table with program information
* Identifies and classifies any errors found
* Creates detailed reports of the results

All of these processes are repeated when you run SMA in conversion mode, even if you previously ran it in assessment mode. However, conversion mode includes one additional final step.

* Format the generated code from the Abstract Syntax Tree (AST) to improve readability

The Abstract Syntax Tree (AST) is a model that represents how your source code works. When the same functionality exists in both the source and target languages, SMA can generate equivalent code in the target language. This code generation only happens during the actual conversion process.

## Types of Conversion in the SMA

The Snowpark Migration Accelerator (SMA) currently supports the following code conversions:

* Converts Python or Scala code from Spark API calls to equivalent Snowpark API calls

> **Note:**
>
> The SMA does not perform any SQL conversion. For SQL files or SQL-only assessments, the tool provides assessment only, without any automated conversion.

Let’s examine an example written in both Scala and Python programming languages.

## Examples of Conversion of References to the Spark API to the Snowpark API

### Example of Spark Scala to Snowpark

When using Scala as your source language, the Snowpark Migration Accelerator (SMA) automatically converts Spark API references in your Scala code to their equivalent Snowpark API references. Below is an example that demonstrates how a basic Spark application is converted. The example application performs several common data operations:

* Reading data
* Filtering records
* Joining datasets
* Calculating averages
* Displaying results

Apache Spark Code Written in Scala

```scala
import org.apache.spark.sql._
import org.apache.spark.sql.functions._
import org.apache.spark.sql.SparkSession

object SimpleApp {
  // This function calculates the average salary for jobs in a specific department
  def avgJobSalary(session: SparkSession, dept: String) {
    // Load employee data from CSV file
    val employees = session.read.csv("path/data/employees.csv")
    // Load job data from CSV file
    val jobs = session.read.csv("path/data/jobs.csv")

val jobsAvgSalary = employees
    .filter(column("Department") === dept)    // Filter employees by department
    .join(jobs)                              // Join with jobs table
    .groupBy("JobName")                      // Group results by job name
    .avg("Salary")                          // Calculate average salary for each job

// Calculate and display a list of all salaries in the department
jobsAvgSalary.select(collect_list("Salary")).show()

```scala
// Calculate and display the average salary
jobsAvgSalary.show()
}
```

The Code After Conversion to Snowflake:

```scala
import com.snowflake.snowpark._
import com.snowflake.snowpark.functions._
import com.snowflake.snowpark.Session

object SimpleApp {
  // This function calculates the average salary for jobs in a specific department
  def avgJobSalary(session: Session, dept: String) {
    // Load employee data from CSV file
    val employees = session.read.csv("path/data/employees.csv")
    // Load job data from CSV file
    val jobs = session.read.csv("path/data/jobs.csv")

val jobsAvgSalary = employees
    .filter(column("Department") === dept)    // Filter employees by department
    .join(jobs)                              // Join with jobs table
    .groupBy("JobName")                      // Group results by job name
    .avg("Salary")                           // Calculate average salary per job

```scala
// Calculate and display all salaries in the department
jobsAvgSalary.select(array_agg("Salary")).show()

// Display the average salary
jobsAvgSalary.show()
}
}
```

In this example, the code structure remains largely unchanged. However, the code has been updated to use Snowpark API references instead of Spark API references.

### Example of PySpark to Snowpark

When you choose Python as your source language, SMA automatically converts PySpark API calls in your Python code to their equivalent Snowpark API calls. Below is an example script that demonstrates various PySpark functions:

```python
from datetime import date, datetime
from pyspark.sql import SparkSession
from pyspark.sql import functions as F
from pyspark.sql import Row

Create a Spark session by building and initializing a new SparkSession object, or retrieve an existing one if already available.

df = spark_session.createDataFrame([
    Row(a=1, b=2., c='string1', d=date(2000, 1, 1), e=datetime(2000, 1, 1, 12, 0)),
    Row(a=2, b=3., c='string2', d=date(2000, 2, 1), e=datetime(2000, 1, 2, 12, 0)),
    Row(a=4, b=5., c='string3', d=date(2000, 3, 1), e=datetime(2000, 1, 3, 12, 0))
])

# cube()
df.cube("name", df.age).count().orderBy("name", "age").show()

# take()
df_new1.take(2)

# describe()
df.describe(['age']).show()

# explain()
df.explain()
df.explain("simple") # Physical plan
df.explain(True)

# intersect()
df1 = spark_session.createDataFrame([("a", 1), ("a", 1), ("b", 3), ("c", 4)], ["C1", "C2"])
df2 = spark_session.createDataFrame([("a", 1), ("a", 1), ("b", 3)], ["C1", "C2"])

# where()
df_new1.where(F.col('Id2')>30).show()
```

The Code After Conversion to Snowflake:

```python
from datetime import date, datetime
from snowflake.snowpark import Session
from snowflake.snowpark import functions as F
from snowflake.snowpark import Row

Create a Spark session using the Session builder:

spark_session = Session.builder.create()

df = spark_session.create_dataframe([
    Row(a=1, b=2., c='string1', d=date(2000, 1, 1), e=datetime(2000, 1, 1, 12, 0)),
    Row(a=2, b=3., c='string2', d=date(2000, 2, 1), e=datetime(2000, 1, 2, 12, 0)),
    Row(a=4, b=5., c='string3', d=date(2000, 3, 1), e=datetime(2000, 1, 3, 12, 0))
])

# cube()
df.cube("name", df.age).count().sort("name", "age").show()

# take()
df_new1.take(2)

# describe()
df.describe(['age']).show()

# explain()
df.explain()
df.explain("simple") # Physical plan
df.explain(True)

# intersect()
df1 = spark_session.create_dataframe([("a", 1), ("a", 1), ("b", 3), ("c", 4)], ["C1", "C2"])
df2 = spark_session.create_dataframe([("a", 1), ("a", 1), ("b", 3)], ["C1", "C2"])

# where()
df_new1.where(F.col('Id2')>30).show()
```

In this example, the code structure remains largely unchanged. However, the code has been updated to use Snowpark API calls instead of Spark API calls.

During the conversion process with the Snowpark Migration Accelerator (SMA), you can expect the following:

---
title: Snowpark Migration Accelerator:  Installation
source: https://docs.snowflake.com/en/migrations/sma-docs/general/getting-started/installation/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Installation

When you [download the Snowpark Migration Accelerator (SMA)](../download-and-access.md), you’ll receive an installer package. The installation process varies depending on your operating system, as SMA runs locally on your machine. You can install SMA on Windows, MacOS, or Linux. The following pages provide step-by-step installation guides for each operating system.

## Installation Guides

Before installing, verify that your system meets the [requirements](../download-and-access.md) listed on the download page.

Follow the appropriate installation guide based on your operating system.

* [Windows Installation guide](windows-installation.md)
* [MacOS Installation guide](macos-installation.md)
* [Linux Installation guide](linux-installation.md)

After installing the Snowpark Migration Accelerator (SMA), please refer to the [user guide](../../../user-guide/overview.md) for detailed instructions on using the application.

> **Note:**
>
> When your installed version of the Snowpark Migration Accelerator needs updating, you’ll see an “Updates Available” notification in the top right corner. To get the latest version:
>
> 1. Click “**UPDATE NOW**” to download the update
> 2. Follow the installation prompt that appears after the download completes
>
> You can also update by using the [Check for updates option](../../../user-guide/project-overview/configuration-and-settings.md) in the menu.

---
title: Snowpark Migration Accelerator:  Interpreting the Assessment Output
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/interpreting-the-assessment-output/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Interpreting the Assessment Output

After the tool completes its analysis, you can review the output to make informed decisions. Let’s examine the results to understand how to interpret the data and determine its value for your project.

After the analysis finishes, you’ll see a summary in the application. The detailed assessment results will be saved in the output directory you specified during project creation. The most valuable information from the assessment can be found in the files (artifacts) within this directory.

This guide explains how to interpret and use both the application’s direct output and the complete output directory. Here’s what you need to know:

* **Readiness Score(s)** - SMA generates multiple scores that measure how compatible your code is with Snowflake. For example:

  + The Spark API Readiness Score shows the percentage of Spark API code that can be automatically converted to Snowpark API
  + The SQL Readiness Score indicates how much of your Spark SQL or HiveQL code can be automatically converted to Snowflake SQL

  These scores help you understand how much manual code conversion will be needed.
* **Size** - You can assess the workload size using the File Summary, which shows the total number of files and lines of code. When combined with readiness scores for each file, you can identify which files need more attention during migration.
* More to come, but in the meantime…

Let’s examine the summary page within the application.

---
title: Snowpark Migration Accelerator:  Issue Codes by Source
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Issue Codes by Source

During your work with the Snowpark Migration Accelerator (SMA), you’ll frequently encounter the term “issue.” Issues are diagnostic codes that appear in both reports and converted code. These codes serve as guidance to help you achieve a successful migration.

The issue codes can be either general or specific to your source platform. You can find a list of supported platforms and their corresponding issue codes in the following pages:

## Supported Sources

* [Pandas Issue Codes](pandas/README.md)
* [Python Issue Codes](python/README.md)
* [Spark-Scala Issue Codes](spark-scala/README.md)
* [SQL Issue Codes](sql/README.md)
* [Snowpark Connect Python Issue Codes](python/snowpark-connect-codes-python.md)
* [Snowpark Connect Scala Issue Codes](spark-scala/snowpark-connect-codes-scala.md)

Note: Each code should have a suggested next step. If you notice a missing or incorrect next step, please let us know! You can contact us through:

* The tool
* Our forums
* Email at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com)

---
title: Snowpark Migration Accelerator:  Join
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/join.md
section: Migrations
---

# Snowpark Migration Accelerator: Join

## Description

Merges rows from two [table references](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select-table-reference.html) using specified join conditions. For more details, see the [Databricks SQL Language Reference JOIN](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select-join.html).

A `JOIN` combines data from two sources (such as tables or views) into a single result set. Each row in the result contains columns from both sources based on a specified condition. For a detailed explanation of joins, see [Working with Joins](https://docs.snowflake.com/en/user-guide/querying-joins). ([Snowflake SQL Language Reference JOIN](https://docs.snowflake.com/en/sql-reference/constructs/join))

### Syntax

```text
left_table_reference { [ join_type ] JOIN right_table_reference join_criteria |
           NATURAL join_type JOIN right_table_reference |
           CROSS JOIN right_table_reference }

join_type
  { [ INNER ] |
    LEFT [ OUTER ] |
    [ LEFT ] SEMI |
    RIGHT [ OUTER ] |
    FULL [ OUTER ] |
    [ LEFT ] ANTI |
    CROSS }

join_criteria
  { ON boolean_expression |
    USING ( column_name [, ...] ) }
```

```bnf
SELECT ...
FROM <object_ref1> [
                     {
                       INNER
                       | { LEFT | RIGHT | FULL } [ OUTER ]
                     }
                   ]
                   JOIN <object_ref2>
  [ ON <condition> ]
[ ... ]

SELECT *
FROM <object_ref1> [
                     {
                       INNER
                       | { LEFT | RIGHT | FULL } [ OUTER ]
                     }
                   ]
                   JOIN <object_ref2>
  [ USING( <column_list> ) ]
[ ... ]

SELECT ...
FROM <object_ref1> [
                     {
                       | NATURAL [ { LEFT | RIGHT | FULL } [ OUTER ] ]
                       | CROSS
                     }
                   ]
                   JOIN <object_ref2>
[ ... ]
```

## Sample Source Patterns

### Setup data

#### Databricks

```sql
-- Use employee and department tables to demonstrate different type of joins.
CREATE TEMP VIEW employee(id, name, deptno) AS
     VALUES(105, 'Chloe', 5),
           (103, 'Paul', 3),
           (101, 'John', 1),
           (102, 'Lisa', 2),
           (104, 'Evan', 4),
           (106, 'Amy', 6);

CREATE TEMP VIEW department(deptno, deptname) AS
    VALUES(3, 'Engineering'),
          (2, 'Sales'      ),
          (1, 'Marketing'  );
```

#### Snowflake

```sql
-- Use employee and department tables to demonstrate different type of joins.
CREATE TEMPORARY TABLE employee(id, name, deptno) AS
SELECT id, name, deptno
  FROM (VALUES (105, 'Chloe', 5),
           (103, 'Paul' , 3),
           (101, 'John' , 1),
           (102, 'Lisa' , 2),
           (104, 'Evan' , 4),
           (106, 'Amy'  , 6)) AS v1 (id, name, deptno);

CREATE TEMP VIEW department(deptno, deptname) AS
SELECT deptno, deptname
  FROM (VALUES(3, 'Engineering'),
          (2, 'Sales'      ),
          (1, 'Marketing'  )) AS v1 (deptno, deptname);
```

### Pattern code

#### Databricks

```sql
-- 1. Use employee and department tables to demonstrate inner join.
SELECT id, name, employee.deptno, deptname
   FROM employee
   INNER JOIN department ON employee.deptno = department.deptno;

-- 2. We will use the employee and department tables to show how a left join works. This example will help you understand how to combine data from two tables while keeping all records from the left (first) table.
SELECT id, name, employee.deptno, deptname
   FROM employee
   LEFT JOIN department ON employee.deptno = department.deptno;

-- 3. Demonstrate a RIGHT JOIN using employee and department tables. This query retrieves all departments and matching employees.
SELECT id, name, employee.deptno, deptname
    FROM employee
    RIGHT JOIN department ON employee.deptno = department.deptno;

-- 4. Demonstrate a FULL JOIN operation using the employee and department tables.
SELECT id, name, employee.deptno, deptname
    FROM employee
    FULL JOIN department ON employee.deptno = department.deptno;

-- 5. Demonstrate a cross join operation using the employee and department tables. This query returns all possible combinations of employees and departments.
SELECT id, name, employee.deptno, deptname
    FROM employee
    CROSS JOIN department;

-- 6. This example shows how to use a semi join between employee and department tables. A semi join returns records from the first table (employee) where there is a matching record in the second table (department).
```{code} sql
SELECT *
    FROM employee
    SEMI JOIN department ON employee.deptno = department.deptno;
```

1. We will use two sample tables - “employee” and “department” - to show how an inner join works. An inner join combines rows from both tables where there is a match between specified columns.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 103 | Paul | 3 | Engineering |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Sales |

---

2. We will use the employee and department tables to show how a left join works. This example will help you understand how to combine data from two tables while keeping all records from the left (first) table.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 105 | Chloe | 5 | null |
| 103 | Paul | 3 | Engineering |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Sales |
| 104 | Evan | 4 | null |
| 106 | Amy | 6 | null |

---

3. Let’s use the employee and department tables to show how a RIGHT JOIN works in SQL.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 103 | Paul | 3 | Engineering |
| 102 | Lisa | 2 | Sales |
| 101 | John | 1 | Marketing |

---

4. Let’s use the employee and department tables to show how a full join works. A full join combines all records from both tables, including unmatched rows from either table.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Sales |
| 103 | Paul | 3 | Engineering |
| 104 | Evan | 4 | null |
| 105 | Chloe | 5 | null |
| 106 | Amy | 6 | null |

---

5. Create a cross join between the employee and department tables to show how to combine every row from one table with every row from another table.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 105 | Chloe | 5 | Engineering |
| 105 | Chloe | 5 | Sales |
| 105 | Chloe | 5 | Marketing |
| 103 | Paul | 3 | Engineering |
| 103 | Paul | 3 | Sales |
| 103 | Paul | 3 | Marketing |
| 101 | John | 1 | Engineering |
| 101 | John | 1 | Sales |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Engineering |
| 102 | Lisa | 2 | Sales |
| 102 | Lisa | 2 | Marketing |
| 104 | Evan | 4 | Engineering |
| 104 | Evan | 4 | Sales |
| 104 | Evan | 4 | Marketing |
| 106 | Amy | 6 | Engineering |
| 106 | Amy | 6 | Sales |
| 106 | Amy | 6 | Marketing |

---

6. Let’s use the employee and department tables to show how a semi join works. A semi join returns records from the first table where there is a matching record in the second table.

| id | name | deptno |
| --- | --- | --- |
| 103 | Paul | 3 |
| 101 | John | 1 |
| 102 | Lisa | 2 |

#### Snowflake

```sql
-- 1. Use employee and department tables to demonstrate inner join.
SELECT id, name, employee.deptno, deptname
   FROM employee
   INNER JOIN department ON employee.deptno = department.deptno;

-- 2. Use employee and department tables to demonstrate left join.
SELECT id, name, employee.deptno, deptname
   FROM employee
   LEFT JOIN department ON employee.deptno = department.deptno;

-- 3. Use employee and department tables to demonstrate right join.
SELECT id, name, employee.deptno, deptname
    FROM employee
    RIGHT JOIN department ON employee.deptno = department.deptno;

-- 4. Use employee and department tables to demonstrate full join.
SELECT id, name, employee.deptno, deptname
    FROM employee
    FULL JOIN department ON employee.deptno = department.deptno;

-- 5. Use employee and department tables to demonstrate cross join.
SELECT id, name, employee.deptno, deptname
    FROM employee
    CROSS JOIN department;

-- 6. Use employee and department tables to demonstrate semi join.
SELECT e.*
    FROM employee e, department d
    WHERE e.deptno = d.deptno;
```

1. We will use two sample tables - “employee” and “department” - to show how an inner join works. An inner join combines records from both tables where there is a matching value in the specified columns.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 103 | Paul | 3 | Engineering |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Sales |

---

2. Use employee and department tables to demonstrate left join.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 105 | Chloe | 5 | null |
| 103 | Paul | 3 | Engineering |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Sales |
| 104 | Evan | 4 | null |
| 106 | Amy | 6 | null |

---

3. Let’s use the employee and department tables to show how a right join works.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 103 | Paul | 3 | Engineering |
| 102 | Lisa | 2 | Sales |
| 101 | John | 1 | Marketing |

---

4. Let’s use the employee and department tables to show how a full join works. A full join combines all records from both tables, including unmatched rows from either table.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 105 | Chloe | 5 | null |
| 103 | Paul | 3 | Engineering |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Sales |
| 104 | Evan | 4 | null |
| 106 | Amy | 6 | null |

---

5. Create a cross join between the employee and department tables to show how each employee can be paired with every department. This example demonstrates how cross joins work by combining all possible combinations of rows from both tables.

| id | name | deptno | deptname |
| --- | --- | --- | --- |
| 105 | Chloe | 5 | Engineering |
| 105 | Chloe | 5 | Sales |
| 105 | Chloe | 5 | Marketing |
| 103 | Paul | 3 | Engineering |
| 103 | Paul | 3 | Sales |
| 103 | Paul | 3 | Marketing |
| 101 | John | 1 | Engineering |
| 101 | John | 1 | Sales |
| 101 | John | 1 | Marketing |
| 102 | Lisa | 2 | Engineering |
| 102 | Lisa | 2 | Sales |
| 102 | Lisa | 2 | Marketing |
| 104 | Evan | 4 | Engineering |
| 104 | Evan | 4 | Sales |
| 104 | Evan | 4 | Marketing |
| 106 | Amy | 6 | Engineering |
| 106 | Amy | 6 | Sales |
| 106 | Amy | 6 | Marketing |

---

6. Let’s use the employee and department tables to show how a semi join works. A semi join returns records from the first table where there is a matching record in the second table.

| id | name | deptno |
| --- | --- | --- |
| 103 | Paul | 3 |
| 101 | John | 1 |
| 102 | Lisa | 2 |

### Known Issues

No issues were found

### Related EWIs

No related EWIs

---
title: Snowpark Migration Accelerator:  Known Issues
source: https://docs.snowflake.com/en/migrations/sma-docs/general/release-notes/old-version-release-notes/sc-spark-python-release-notes/known-issues.md
section: Migrations
---

# Snowpark Migration Accelerator: Known Issues

* No known issues

---
title: Snowpark Migration Accelerator:  Known Issues
source: https://docs.snowflake.com/en/migrations/sma-docs/general/release-notes/old-version-release-notes/sc-spark-scala-release-notes/known-issues.md
section: Migrations
---

# Snowpark Migration Accelerator: Known Issues

* There are some scenarios that are not properly supported in order to resolve symbols for assessment reports and mappings.
* Partial support for Companions, Case Classes, Generic classes with multiple type parameters.
* Lack of support of Generic Functions, Lambdas with multiple parameters.
* Partial support for Equivalence of types, currently all primitive types are fully supported, but user types that involve inheritance don’t.
* There is a minor issue at the migrated code, related with formatting (pretty-printing) of the scala code. Some newlines or indentations are not preserved exactly as the original code.
* Lack of support of migration for Embedded Sql queries.
* Partial support of Spark symbols libraries. See documentation for detail of what is currently supported.
* There might be issues when using third party libraries, due to having not supported elements (described below).

---
title: Snowpark Migration Accelerator:  Linux Installation
source: https://docs.snowflake.com/en/migrations/sma-docs/general/getting-started/installation/linux-installation.md
section: Migrations
---

# Snowpark Migration Accelerator: Linux Installation

The Snowpark Migration Accelerator (SMA) offers two installation options:

* Desktop application (available for Windows and macOS users)
* Command Line Interface (CLI, available for Windows, macOS, and Linux users)

Note: Linux users can only use the CLI version.

If you need the CLI files, please check the [Downloading and Accessing page](../download-and-access.md) for detailed instructions on how to get them.

## Installing the SMA CLI on Linux

Here’s how to install the Snowpark Migration Accelerator (SMA) Command Line Interface (CLI) on your Linux system:

1. **Verify the Download:** Download the SMA CLI installation file that matches your Linux system. You can find the correct download link in the [Downloading and Accessing page](../download-and-access.md).
2. **Open a Terminal:** Open a terminal window and navigate to the folder containing the `SMA-CLI-linux.tar` file.
3. **Run Installation Commands:** Enter the following commands in your terminal. Note: You may need administrator privileges (sudo) to execute these commands.

```bash
sudo mkdir /usr/local/share/.SMA-CLI-linux
sudo tar -xf SMA-CLI-linux.tar -C /usr/local/share/.SMA-CLI-linux
sudo ln -s /usr/local/share/.SMA-CLI-linux/orchestrator/sma /usr/local/bin/sma
```

After installation is complete, you can begin using the SMA Command Line Interface (CLI). For detailed instructions, please refer to the [SMA User Guide](../../../user-guide/overview.md).

**Important Note:**

When using SMA on Windows or Mac, you will see an “UPDATE NOW” notification in the top right corner if a newer version is available. Simply click the notification to download and install the latest version.

**Troubleshooting Guide:**

If you experience any problems while installing the software, please email our support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---
title: Snowpark Migration Accelerator:  Locating Issues
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/troubleshooting-the-output-code/locating-issues.md
section: Migrations
---

# Snowpark Migration Accelerator: Locating Issues

The Snowpark Migration Accelerator (SMA) converts code where possible and identifies sections it cannot convert. When SMA encounters code it cannot convert, it generates an issue with a specific issue code. Let’s explore how to locate these issues.

Use the [Issues Inventory file](../../user-guide/scos-conversion/output-reports/sma-inventories.md) from your local output to review identified issues.

The Issues Inventory provides extensive information, but these key fields are essential for locating specific issues:

* Issue Code: The unique identifier assigned to the issue
* Description: A detailed explanation of the problem
* Issue Category: The type or classification of the issue
* Filename: The specific file where the issue was found
* Line Number: The exact location in the file where the issue occurred

The issue code and description will be included as comments in the generated code.

---
title: Snowpark Migration Accelerator:  MacOS Installation
source: https://docs.snowflake.com/en/migrations/sma-docs/general/getting-started/installation/macos-installation.md
section: Migrations
---

# Snowpark Migration Accelerator: MacOS Installation

You can install the Snowpark Migration Accelerator (SMA) on macOS in two ways:

* As a desktop application
* As a Command Line Interface (CLI)

This guide explains both installation methods.

If you need the application or CLI files, please check the [Downloading and License Access page](../download-and-access.md) for detailed instructions on how to obtain them.

## Installing the SMA Application on macOS

Follow these steps to install the Snowpark Migration Accelerator (SMA) application on your Mac:

1. **Open the .dmg File:** Double-click the .dmg file you downloaded.
2. **Accept the Terms:** Review and accept the software’s terms of use by clicking the **Accept** button.

3. **Move to Applications Folder:** Move the SMA application to your Applications folder by either dragging the SMA icon or double-clicking the SMA logo.

Once you’ve completed the installation, you can begin using the SMA application. For detailed instructions on how to use the application, refer to the [SMA User Guide](../../../user-guide/overview.md).

**Important Note:**

When a new version of SMA is available, you will see an “UPDATE NOW” button in the top right corner of your screen. Select this button to download and install the latest version automatically.

If you experience any problems while installing the software, email our support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

## Installing the SMA CLI on macOS

Here’s how to install the Snowpark Migration Accelerator (SMA) Command Line Interface on your Mac:

1. **Verify the Download:** Check that you have downloaded the correct SMA CLI installation file for macOS.
2. **Extract the Files:** Unzip the installation file contents into a folder on your computer. For example, you can use `/Users/<YourUsername>/Documents/dotnet-artifacts`.
3. **Create a Symbolic Link:** Open a terminal and execute the following command. Make sure to replace `/Users/<YourUsername>/Documents/dotnet-artifacts` with your actual extraction path. You may need administrator privileges (sudo) to run this command.

```bash
sudo ln -s /Users/<YourUsername>/Documents/dotnet-artifacts/orchestrator/sma /usr/local/bin/sma
```

**Verify Installation:** Check if you can use the `sma` command by running the version check command below. This will work if `/usr/local/bin` is included in your system’s `PATH` environment variable.

```bash
sma --version
```

**Check the Version:** After installation, verify the current version of the SMA CLI by running the version command.

After installing either the application or Command Line Interface (CLI), you can begin using the Snowpark Migration Accelerator (SMA). For detailed instructions, please refer to the [SMA User Guide](../../../user-guide/overview.md).

If you experience any problems while installing the software, email our support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---
title: Snowpark Migration Accelerator:  Merge
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/merge.md
section: Migrations
---

# Snowpark Migration Accelerator: Merge

## Description

The `MERGE` statement combines data from one or more source tables with a target table, allowing you to perform updates and inserts in a single operation. Based on conditions you define, it determines whether to update existing rows or insert new ones in the target table. This makes it more efficient than using separate `INSERT`, `UPDATE`, and `DELETE` statements. The `MERGE` statement always produces consistent results when run multiple times with the same data.

In Spark, you can find the MERGE syntax in the [Spark documentation](https://docs.databricks.com/en/sql/language-manual/delta-merge-into.html).

```sql
MERGE INTO target_table_name [target_alias]
   USING source_table_reference [source_alias]
   ON merge_condition
   { WHEN MATCHED [ AND matched_condition ] THEN matched_action |
     WHEN NOT MATCHED [BY TARGET] [ AND not_matched_condition ] THEN not_matched_action |
     WHEN NOT MATCHED BY SOURCE [ AND not_matched_by_source_condition ] THEN not_matched_by_source_action } [...]

matched_action
 { DELETE |
   UPDATE SET * |
   UPDATE SET { column = { expr | DEFAULT } } [, ...] }

not_matched_action
 { INSERT * |
   INSERT (column1 [, ...] ) VALUES ( expr | DEFAULT ] [, ...] )

not_matched_by_source_action
 { DELETE |
   UPDATE SET { column = { expr | DEFAULT } } [, ...] }
```

In Snowflake, the MERGE statement follows this syntax (For additional details, refer to the [Snowflake documentation](https://docs.snowflake.com/en/sql-reference/sql/merge)):

```sql
MERGE INTO <target_table> USING <source> ON <join_expr> { matchedClause | notMatchedClause } [ ... ]

matchedClause ::=
  WHEN MATCHED [ AND <case_predicate> ] THEN { UPDATE SET <col_name> = <expr> [ , <col_name2> = <expr2> ... ] | DELETE } [ ... ]

notMatchedClause ::=
   WHEN NOT MATCHED [ AND <case_predicate> ] THEN INSERT [ ( <col_name> [ , ... ] ) ] VALUES ( <expr> [ , ... ] )
```

The key distinction is that Snowflake lacks a direct equivalent to the `WHEN NOT MATCHED BY SOURCE` clause. A workaround solution is required to achieve similar functionality in Snowflake.

## Sample Source Patterns

### Sample auxiliary data

> **Note:**
>
> The following code examples have been executed to help you better understand how they work:

```sql
CREATE OR REPLACE people_source (
  person_id  INTEGER NOT NULL PRIMARY KEY,
  first_name STRING NOT NULL,
  last_name  STRING NOT NULL,
  title      STRING NOT NULL,
);

CREATE OR REPLACE TABLE people_target (
  person_id  INTEGER NOT NULL PRIMARY KEY,
  first_name STRING NOT NULL,
  last_name  STRING NOT NULL,
  title      STRING NOT NULL DEFAULT 'NONE'
);

INSERT INTO people_target VALUES (1, 'John', 'Smith', 'Mr');
INSERT INTO people_target VALUES (2, 'alice', 'jones', 'Mrs');
INSERT INTO people_source VALUES (2, 'Alice', 'Jones', 'Mrs.');
INSERT INTO people_source VALUES (3, 'Jane', 'Doe', 'Miss');
INSERT INTO people_source VALUES (4, 'Dave', 'Brown', 'Mr');
```

```sql
CREATE OR REPLACE TABLE people_source (
    person_id  INTEGER NOT NULL PRIMARY KEY,
    first_name VARCHAR(20) NOT NULL,
    last_name VARCHAR(20) NOT NULL,
    title VARCHAR(10) NOT NULL
);

CREATE OR REPLACE TABLE people_target (
    person_id  INTEGER NOT NULL PRIMARY KEY,
    first_name VARCHAR(20) NOT NULL,
    last_name VARCHAR(20) NOT NULL,
    title VARCHAR(10) NOT NULL DEFAULT 'NONE'
);

INSERT INTO people_target VALUES (1, 'John', 'Smith', 'Mr');
INSERT INTO people_target VALUES (2, 'alice', 'jones', 'Mrs');
INSERT INTO people_source VALUES (2, 'Alice', 'Jones', 'Mrs.');
INSERT INTO people_source VALUES (3, 'Jane', 'Doe', 'Miss');
INSERT INTO people_source VALUES (4, 'Dave', 'Brown', 'Mr');
```

### MERGE Statement - Insert and Update Case

#### Spark

```sql
MERGE INTO people_target pt
USING people_source ps
ON    (pt.person_id = ps.person_id)
WHEN MATCHED THEN UPDATE
  SET pt.first_name = ps.first_name,
      pt.last_name = ps.last_name,
      pt.title = DEFAULT
WHEN NOT MATCHED THEN INSERT
  (pt.person_id, pt.first_name, pt.last_name, pt.title)
  VALUES (ps.person_id, ps.first_name, ps.last_name, ps.title);

SELECT * FROM people_target;
```

```text
PERSON_ID|FIRST_NAME|LAST_NAME|TITLE|
---------+----------+---------+-----+
        1|John      |Smith    |Mr   |
        2|Alice     |Jones    |NONE |
        3|Jane      |Doe      |Miss |
        4|Dave      |Brown    |Mr   |
```

#### Snowflake

```sql
MERGE INTO people_target2 pt
USING people_source ps
ON    (pt.person_id = ps.person_id)
WHEN MATCHED THEN UPDATE
  SET pt.first_name = ps.first_name,
      pt.last_name = ps.last_name,
      pt.title = DEFAULT
WHEN NOT MATCHED THEN INSERT
  (pt.person_id, pt.first_name, pt.last_name, pt.title)
  VALUES (ps.person_id, ps.first_name, ps.last_name, ps.title);

SELECT * FROM PUBLIC.people_target ORDER BY person_id;
```

```text
PERSON_ID|FIRST_NAME|LAST_NAME|TITLE|
---------+----------+---------+-----+
        1|John      |Smith    |Mr   |
        2|Alice     |Jones    |NONE |
        3|Jane      |Doe      |Miss |
        4|Dave      |Brown    |Mr   |
```

The `INSERT` and `UPDATE` operations work the same way in Snowflake. In both SQL dialects, you can use `DEFAULT` as an expression to set a column to its default value.

Spark allows insert and update operations without explicitly listing the columns. When columns are not specified, the operation affects all columns in the table. For this to work correctly, the source and destination tables must have identical column structures. If the column structures don’t match, you will receive a parsing error.

```sql
UPDATE SET *
-- This is equivalent to UPDATE SET col1 = source.col1 [, col2 = source.col2 ...]

INSERT *
-- This command copies all columns from the source table to the target table, matching columns by name. It is the same as explicitly listing all columns in both the INSERT and VALUES clauses.

Since Snowflake doesn't support these options, the migration process will instead list all columns from the target table.

### MERGE Statement - Delete Case

```{code} sql
:force:
MERGE INTO people_target pt
USING people_source ps
ON    (pt.person_id = ps.person_id)
WHEN MATCHED AND pt.person_id < 3 THEN DELETE
WHEN NOT MATCHED BY TARGET THEN INSERT *;

SELECT * FROM people_target;
```

```text
PERSON_ID|FIRST_NAME|LAST_NAME|TITLE|
---------+----------+---------+-----+
        1|John      |Smith    |Mr   |
        3|Jane      |Doe      |Miss |
        4|Dave      |Brown    |Mr   |
```

#### Snowflake

```sql
MERGE INTO people_target pt
USING people_source ps
ON    (pt.person_id = ps.person_id)
WHEN MATCHED AND pt.person_id < 3 THEN DELETE
WHEN NOT MATCHED THEN INSERT
  (pt.person_id, pt.first_name, pt.last_name, pt.title)
  VALUES (ps.person_id, ps.first_name, ps.last_name, ps.title);

SELECT * FROM people_target;
```

```text
PERSON_ID|FIRST_NAME|LAST_NAME|TITLE|
---------+----------+---------+-----+
        1|John      |Smith    |Mr   |
        3|Jane      |Doe      |Miss |
        4|Dave      |Brown    |Mr   |
```

The `DELETE` action in Snowflake works the same way as in other databases. You can also add additional conditions to the `MATCHED` and `NOT MATCHED` clauses.

`WHEN NOT MATCHED BY TARGET` and `WHEN NOT MATCHED` are equivalent clauses that can be used interchangeably in SQL merge statements.

### MERGE Statement - WHEN NOT MATCHED BY SOURCE

`WHEN NOT MATCHED BY SOURCE` clauses are triggered when a row in the target table has no matching rows in the source table. This occurs when both the `merge_condition` and the optional `not_match_by_source_condition` evaluate to true. For more details, see the [Spark documentation](https://docs.databricks.com/en/sql/language-manual/delta-merge-into.html).

Snowflake does not support this clause directly. To handle this limitation, you can use the following workaround for both `DELETE` and `UPDATE` actions.

```sql
MERGE INTO people_target pt
USING people_source ps
ON pt.person_id = ps.person_id
WHEN NOT MATCHED BY SOURCE THEN DELETE;

SELECT * FROM people_target;
```

```text
PERSON_ID|FIRST_NAME|LAST_NAME|TITLE|
---------+----------+---------+-----+
        2|Alice     |Jones    |NONE |
```

#### Snowflake

```sql
MERGE INTO people_target pt
USING (
    SELECT
        pt.person_id
    FROM
        people_target pt LEFT
    JOIN people_source ps ON pt.person_id = ps.person_id
    WHERE
        ps.person_id is null
) s_src
    ON s_src.person_id = pt.person_id
WHEN MATCHED THEN DELETE;

SELECT * FROM people_target;
```

```text
PERSON_ID|FIRST_NAME|LAST_NAME|TITLE|
---------+----------+---------+-----+
        2|Alice     |Jones    |NONE |
```

The `DELETE` action in Snowflake works the same way as in other databases. You can also add additional conditions to the `MATCHED` and `NOT MATCHED` clauses.

## Known issues

### 1. MERGE is very similar in both languages

While Apache Spark offers additional features, you can achieve similar functionality in Snowflake using alternative approaches, as demonstrated in the previous examples.

## Related EWIs

No related Errors, Warnings, and Issues (EWIs) found.

---
title: Snowpark Migration Accelerator:  Open Source Libraries
source: https://docs.snowflake.com/en/migrations/sma-docs/general/conversion-software-terms-of-use/open-source-libraries.md
section: Migrations
---

# Snowpark Migration Accelerator: Open Source Libraries

The open-source libraries used in the Snowpark Migration Accelerator Include:

## .NET Open Source Libraries

| name | version | type | licenses | license urls |
| --- | --- | --- | --- | --- |
| Microsoft.Extensions.Logging.Console | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.Configuration | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Collections.Immutable | 7.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Primitives | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.Abstractions | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.FileExtensions | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.DependencyInjection.Abstractions | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.DependencyInjection | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Logging.Abstractions | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Options | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Bcl.AsyncInterfaces | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.IO.Pipelines | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Options.ConfigurationExtensions | 8.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Text.Encodings.Web | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.FileSystemGlobbing | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.FileProviders.Physical | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Text.Json | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.FileProviders.Abstractions | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.Json | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Microsoft.Extensions.Configuration.Binder | 9.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Buffers | 4.5.1 | nuget | MIT | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| Microsoft.CodeCoverage | 17.2.1 | nuget | Microsoft-.NET-Library | <https://github.com/microsoft/vstest/blob/master/LICENSE> |
| H.Formatters.BinaryFormatter | 2.0.51 | nuget | MIT | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| H.Pipes | 2.0.51 | nuget | MIT | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| H.Pipes.AccessControl | 2.0.51 | nuget | MIT | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |
| TestableIO.System.IO.Abstractions | 21.0.26 | nuget | MIT | <https://github.com/TestableIO/System.IO.Abstractions/blob/master/LICENSE> |
| System.IO.Abstractions | 21.0.26 | nuget | MIT | <https://github.com/TestableIO/System.IO.Abstractions/blob/master/LICENSE> |
| TestableIO.System.IO.Abstractions.Wrappers | 21.0.26 | nuget | MIT | <https://github.com/TestableIO/System.IO.Abstractions/blob/master/LICENSE> |
| NamedPipeServerStream.NetFrameworkVersion | 1.1.11 | nuget | MIT | <https://github.com/HavenDV/NamedPipeServerStream.NetFrameworkVersion/blob/master/LICENSE.txt> |
| StyleCop.Analyzers | 1.1.118 | nuget | Apache-2.0 | <https://github.com/DotNetAnalyzers/StyleCopAnalyzers?tab=MIT-1-ov-file#readme> |
| System.Runtime.CompilerServices.Unsafe | 6.0.0 | nuget | MIT | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.Diagnostics.EventLog | 6.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.ComponentModel.Composition | 6.0.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| NuGet.Frameworks | 5.11.0 | nuget | Apache-2.0 | <https://github.com/NuGet/NuGet.Client/blob/master/LICENSE.txt> |
| Castle.Core | 5.1.1 | nuget | Apache-2.0 | <https://github.com/castleproject/Core/blob/master/LICENSE> |
| System.Memory | 4.5.5 | nuget | MIT | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.Threading.Tasks.Extensions | 4.5.4 | nuget | MIT | <https://github.com/dotnet/maintenance-packages/blob/master/LICENSE> |
| System.ValueTuple | 4.5.0 | nuget | MIT | <https://github.com/dotnet/corefx/blob/master/LICENSE.TXT> |
| System.Configuration.ConfigurationManager | 4.4.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| System.Security.Cryptography.ProtectedData | 4.4.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| Moq | 4.18.4 | nuget | BSD-3-Clause | <https://github.com/devlooped/moq?tab=License-1-ov-file> |
| Newtonsoft.Json | 13.0.3 | nuget | MIT | <https://github.com/JamesNK/Newtonsoft.Json?tab=MIT-1-ov-file#readme> |
| System.Reflection.Metadata | 1.6.0 | nuget | MIT | <https://github.com/dotnet/runtime/blob/master/LICENSE.TXT> |
| H.Formatters | 2.0.51 | nuget | MIT | <https://github.com/HavenDV/H.Pipes/blob/master/LICENSE.txt> |

### Node Open Source Libraries

| name | version | type | licenses | license urls |
| --- | --- | --- | --- | --- |
| abbrev | 1.1.1 | npm | ISC | <https://github.com/npm/abbrev-js/blob/master/LICENSE> |
| ace-builds | 1.31.1 | npm | BSD-3-Clause | <https://github.com/ajaxorg/ace-builds/blob/master/LICENSE> |
| acorn | 8.11.2 | npm | MIT | <https://github.com/acornjs/acorn/blob/master/acorn/LICENSE> |
| acorn-import-assertions | 1.9.0 | npm | MIT | <https://github.com/xtuc/acorn-import-assertions/blob/master/LICENSE> |
| agent-base | 6.0.2 | npm | MIT | <https://github.com/TooTallNate/proxy-agents/blob/main/packages/agent-base/LICENSE> |
| ajv | 8.12.0 | npm | MIT | <https://github.com/ajv-validator/ajv/blob/master/LICENSE> |
| ajv-formats | 2.1.1 | npm | MIT | <https://github.com/ajv-validator/ajv-formats/blob/master/LICENSE> |
| ansi-styles | 3.2.1 | npm | MIT | <https://github.com/chalk/ansi-styles/blob/main/license> |
| applicationinsights | 2.9.0 | npm | MIT | <https://github.com/microsoft/ApplicationInsights-node.js/blob/master/LICENSE> |
| argparse | 2.0.1 | npm | Python-2.0 | <https://github.com/nodeca/argparse/blob/master/LICENSE> |
| async-hook-jl | 1.7.6 | npm | MIT | <https://github.com/Jeff-Lewis/async-hook-jl/blob/master/LICENSE.md> |
| async-listener | 0.6.10 | npm | BSD-2-Clause | <https://github.com/othiym23/async-listener/blob/master/LICENSE> |
| asynckit | 0.4.0 | npm | MIT | <https://github.com/alexindigo/asynckit/blob/master/LICENSE> |
| atomically | 1.7.0 | npm | MIT | <https://github.com/fabiospampinato/atomically/blob/master/license> |
| axios | 1.7.7 | npm | MIT | <https://github.com/axios/axios/blob/master/LICENSE> |
| azure/abort-controller | 1.1.0 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| azure/core-auth | 1.5.0 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| azure/core-rest-pipeline | 1.10.1 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| azure/core-tracing | 1.0.1 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| azure/core-util | 1.6.1 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| azure/logger | 1.0.4 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| azure/opentelemetry-instrumentation-azure-sdk | 1.0.0-beta.5 | npm | MIT | <https://github.com/Azure/azure-sdk-for-js/blob/main/LICENSE> |
| babel-plugin-macros | 3.1.0 | npm | MIT | <https://github.com/kentcdodds/babel-plugin-macros/blob/master/LICENSE> |
| babel/code-frame | 7.22.13 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| babel/helper-module-imports | 7.22.15 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| babel/helper-string-parser | 7.22.5 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| babel/helper-validator-identifier | 7.22.20 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| babel/highlight | 7.22.20 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| babel/runtime | 7.23.2 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| babel/types | 7.23.0 | npm | MIT | <https://github.com/babel/babel/blob/master/LICENSE> |
| binary | 0.3.0 | npm | MIT | <https://www.npmjs.com/package/binary?activeTab=code> |
| buffers | 0.1.1 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| builder-util-runtime | 9.2.5-alpha.4 | npm | MIT | <https://github.com/electron-userland/electron-builder/blob/master/LICENSE> |
| callsites | 3.1.0 | npm | MIT | <https://github.com/sindresorhus/callsites/blob/main/license> |
| chainsaw | 0.1.0 | npm | MIT, X11 | <https://www.npmjs.com/package/chainsaw?activeTab=code> |
| chalk | 2.4.2 | npm | MIT | <https://github.com/chalk/chalk/blob/main/license> |
| cjs-module-lexer | 1.2.3 | npm | MIT | <https://github.com/nodejs/cjs-module-lexer/blob/master/LICENSE> |
| classnames | 2.3.2 | npm | MIT | <https://github.com/JedWatson/classnames/blob/master/LICENSE> |
| cls-hooked | 4.2.2 | npm | BSD-2-Clause | <https://github.com/jeff-lewis/cls-hooked/blob/master/LICENSE> |
| clsx | 2.0.0 | npm | MIT | <https://github.com/lukeed/clsx/blob/master/license> |
| color-convert | 1.9.3 | npm | MIT | <https://github.com/Qix-/color-convert/blob/master/LICENSE> |
| color-name | 1.1.3 | npm | MIT | <https://github.com/colorjs/color-name/blob/master/LICENSE> |
| combined-stream | 1.0.8 | npm | MIT | <https://github.com/felixge/node-combined-stream/blob/master/License> |
| conf | 10.2.0 | npm | MIT | <https://github.com/sindresorhus/conf/blob/main/license> |
| config | 3.3.9 | npm | MIT | <https://github.com/node-config/node-config/blob/master/LICENSE> |
| continuation-local-storage | 3.2.1 | npm | BSD-2-Clause | <https://github.com/othiym23/node-continuation-local-storage/blob/master/LICENSE> |
| convert-source-map | 1.9.0 | npm | MIT | <https://github.com/thlorenz/convert-source-map/blob/master/LICENSE> |
| core-util-is | 1.0.3 | npm | MIT | <https://github.com/isaacs/core-util-is/blob/master/LICENSE> |
| cosmiconfig | 7.1.0 | npm | MIT | <https://github.com/cosmiconfig/cosmiconfig/blob/master/LICENSE> |
| css-mediaquery | 0.1.2 | npm | BSD-2-Clause | <https://github.com/ericf/css-mediaquery/blob/master/LICENSE> |
| css-vendor | 2.0.8 | npm | MIT | <https://github.com/cssinjs/css-vendor/blob/master/LICENSE> |
| csstype | 3.1.2 | npm | MIT | <https://github.com/frenic/csstype/blob/master/LICENSE> |
| debounce-fn | 4.0.0 | npm | MIT | <https://github.com/sindresorhus/debounce-fn/blob/main/license> |
| debug | 4.3.4 | npm | MIT | <https://github.com/debug-js/debug/blob/master/LICENSE> |
| decompress-zip | 0.3.3 | npm | MIT | <https://github.com/bower/decompress-zip/blob/master/license> |
| deepmerge | 2.2.1 | npm | MIT | <https://github.com/TehShrike/deepmerge/blob/master/license.txt> |
| delayed-stream | 1.0.0 | npm | MIT | <https://github.com/felixge/node-delayed-stream/blob/master/License> |
| diagnostic-channel | 1.1.1 | npm | MIT | <https://github.com/Microsoft/node-diagnostic-channel/blob/master/LICENSE> |
| diagnostic-channel-publishers | 1.0.7 | npm | MIT | <https://github.com/Microsoft/node-diagnostic-channel/blob/master/LICENSE> |
| diff-match-patch | 1.0.5 | npm | Apache-2.0 | <https://github.com/JackuB/diff-match-patch/blob/master/LICENSE> |
| dom-helpers | 5.2.1 | npm | MIT | <https://github.com/react-bootstrap/dom-helpers/blob/master/LICENSE> |
| dot-prop | 6.0.1 | npm | MIT | <https://github.com/sindresorhus/dot-prop/blob/main/license> |
| electron-cgi | 1.0.6 | npm | MIT | <https://github.com/ruidfigueiredo/electron-cgi/blob/master/LICENSE> |
| electron-debug | 3.2.0 | npm | MIT | <https://github.com/sindresorhus/electron-debug/blob/main/license> |
| electron-is-accelerator | 0.1.2 | npm | MIT | <https://github.com/brrd/electron-is-accelerator/blob/master/LICENSE> |
| electron-is-dev | 1.2.0 | npm | MIT | <https://github.com/sindresorhus/electron-is-dev/blob/main/license> |
| electron-localshortcut | 3.2.1 | npm | MIT | <https://github.com/parro-it/electron-localshortcut/blob/master/license> |
| electron-log | 4.4.8 | npm | MIT | <https://github.com/megahertz/electron-log/blob/master/LICENSE> |
| electron-store | 8.1.0 | npm | MIT | <https://github.com/sindresorhus/electron-store/blob/main/license> |
| electron-updater | 6.3.0-alpha.7 | npm | MIT | <https://github.com/electron-userland/electron-builder/blob/master/LICENSE> |
| electron/remote | 2.0.12 | npm | MIT | <https://github.com/electron/remote/blob/master/LICENSE> |
| emitter-listener | 1.1.2 | npm | BSD-2-Clause | <https://github.com/othiym23/emitter-listener/blob/master/package.json> |
| emotion/babel-plugin | 11.11.0 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/cache | 11.11.0 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/hash | 0.9.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/is-prop-valid | 1.2.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/memoize | 0.8.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/react | 11.11.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/serialize | 1.1.2 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/sheet | 1.2.2 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/styled | 11.11.0 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/unitless | 0.8.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/use-insertion-effect-with-fallbacks | 1.0.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/utils | 1.2.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| emotion/weak-memoize | 0.3.1 | npm | MIT | <https://github.com/emotion-js/emotion.git#main/blob/master/LICENSE.md> |
| env-paths | 2.2.1 | npm | MIT | <https://github.com/sindresorhus/env-paths/blob/main/license> |
| error-ex | 1.3.2 | npm | MIT | <https://github.com/qix-/node-error-ex/blob/master/LICENSE> |
| escape-string-regexp | 4.0.0 | npm | MIT | <https://github.com/sindresorhus/escape-string-regexp/blob/main/license> |
| fast-deep-equal | 3.1.3 | npm | MIT | <https://github.com/epoberezkin/fast-deep-equal/blob/master/LICENSE> |
| find-root | 1.1.0 | npm | MIT | <https://github.com/junosuarez/find-root/blob/master/LICENSE.md> |
| find-up | 3.0.0 | npm | MIT | <https://github.com/sindresorhus/find-up/blob/main/license> |
| floating-ui/core | 1.5.0 | npm | MIT | <https://github.com/floating-ui/floating-ui/blob/master/LICENSE> |
| floating-ui/dom | 1.5.3 | npm | MIT | <https://github.com/floating-ui/floating-ui/blob/master/LICENSE> |
| floating-ui/react-dom | 2.0.2 | npm | MIT | <https://github.com/floating-ui/floating-ui/blob/master/LICENSE> |
| floating-ui/utils | 0.1.6 | npm | MIT | <https://github.com/floating-ui/floating-ui/blob/master/LICENSE> |
| follow-redirects | 1.15.9 | npm | MIT | <https://github.com/follow-redirects/follow-redirects/blob/main/LICENSE> |
| form-data | 4.0.0 | npm | MIT | <https://github.com/form-data/form-data/blob/master/License> |
| formik | 2.4.5 | npm | Apache-2.0 | <https://github.com/jaredpalmer/formik/blob/master/LICENSE> |
| formik-mui | 4.0.0-alpha.3 | npm | MIT | <https://github.com/stackworx/formik-mui/blob/master/LICENSE> |
| fs-extra | 10.1.0 | npm | MIT | <https://github.com/jprichardson/node-fs-extra/blob/master/LICENSE> |
| function-bind | 1.1.2 | npm | MIT | <https://github.com/Raynos/function-bind/blob/master/LICENSE> |
| graceful-fs | 4.2.11 | npm | ISC | <https://github.com/isaacs/node-graceful-fs/blob/master/LICENSE> |
| has-flag | 3.0.0 | npm | MIT | <https://github.com/sindresorhus/has-flag/blob/main/license> |
| hasown | 2.0.0 | npm | MIT | <https://github.com/inspect-js/hasOwn/blob/master/LICENSE> |
| hoist-non-react-statics | 3.3.2 | npm | BSD-3-Clause | <https://github.com/mridgway/hoist-non-react-statics/blob/main/LICENSE.md> |
| html-parse-stringify | 3.0.1 | npm | MIT | <https://github.com/HenrikJoreteg/html-parse-stringify/blob/master/README.md> |
| http-proxy-agent | 5.0.0 | npm | MIT | <https://github.com/TooTallNate/proxy-agents/blob/main/packages/http-proxy-agent/LICENSE> |
| https-proxy-agent | 5.0.1 | npm | MIT | <https://github.com/TooTallNate/proxy-agents/blob/main/packages/https-proxy-agent/LICENSE> |
| hyphenate-style-name | 1.0.4 | npm | BSD-3-Clause | <https://github.com/rexxars/hyphenate-style-name/blob/main/LICENSE> |
| i18next | 22.5.1 | npm | MIT | <https://github.com/i18next/i18next/blob/master/LICENSE> |
| import-fresh | 3.3.0 | npm | MIT | <https://github.com/sindresorhus/import-fresh/blob/main/license> |
| import-in-the-middle | 1.4.2 | npm | Apache-2.0 | <https://github.com/nodejs/import-in-the-middle/blob/main/LICENSE> |
| inherits | 2.0.4 | npm | ISC | <https://github.com/isaacs/inherits/blob/main/LICENSE> |
| inversify | 6.0.2 | npm | MIT | <https://github.com/inversify/InversifyJS/blob/master/LICENSE> |
| inversify-react | 1.1.0 | npm | Apache-2.0 | <https://github.com/Kukkimonsuta/inversify-react/blob/master/LICENSE> |
| is-arrayish | 0.2.1 | npm | MIT | <https://github.com/qix-/node-is-arrayish/blob/master/LICENSE> |
| is-core-module | 2.13.1 | npm | MIT | <https://github.com/inspect-js/is-core-module/blob/master/LICENSE> |
| is-in-browser | 1.1.3 | npm | MIT | <https://github.com/tuxsudo/is-in-browser/blob/master/LICENSE> |
| is-obj | 2.0.0 | npm | MIT | <https://github.com/sindresorhus/is-obj/blob/main/license> |
| isarray | 0.0.1 | npm | MIT | <https://github.com/juliangruber/isarray/blob/master/LICENSE> |
| js-tokens | 4.0.0 | npm | MIT | <https://github.com/lydell/js-tokens/blob/master/LICENSE> |
| js-yaml | 4.1.0 | npm | MIT | <https://github.com/nodeca/js-yaml/blob/master/LICENSE> |
| json-parse-even-better-errors | 2.3.1 | npm | MIT | <https://github.com/npm/json-parse-even-better-errors/blob/master/LICENSE.md> |
| json-schema-traverse | 1.0.0 | npm | MIT | <https://github.com/epoberezkin/json-schema-traverse/blob/master/LICENSE> |
| json-schema-typed | 7.0.3 | npm | BSD-2-Clause | <https://github.com/jrylan/json-schema-typed/blob/master/LICENSE.md> |
| json5 | 2.2.3 | npm | MIT | <https://github.com/json5/json5/blob/master/LICENSE.md> |
| jsonfile | 6.1.0 | npm | MIT | <https://github.com/jprichardson/node-jsonfile/blob/master/LICENSE> |
| jss | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-camel-case | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-default-unit | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-global | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-nested | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-props-sort | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-rule-value-function | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| jss-plugin-vendor-prefixer | 10.10.0 | npm | MIT | <https://github.com/cssinjs/jss/blob/master/LICENSE> |
| keyboardevent-from-electron-accelerator | 2.0.0 | npm | MIT | <https://github.com/parro-it/keyboardevent-from-electron-accelerator/blob/master/license> |
| keyboardevents-areequal | 0.2.2 | npm | MIT | <https://github.com/parro-it/keyboardevents-areequal/blob/master/license> |
| lazy-val | 1.0.5 | npm | MIT | <https://github.com/develar/lazy-val/blob/master/package.json> |
| lines-and-columns | 1.2.4 | npm | MIT | <https://github.com/eventualbuddha/lines-and-columns/blob/master/LICENSE> |
| locate-path | 3.0.0 | npm | MIT | <https://github.com/sindresorhus/locate-path/blob/main/license> |
| lodash | 4.17.21 | npm | MIT | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash-es | 4.17.21 | npm | MIT | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash.escaperegexp | 4.1.2 | npm | MIT | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash.get | 4.4.2 | npm | MIT | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| lodash.isequal | 4.5.0 | npm | MIT | <https://github.com/lodash/lodash/blob/master/LICENSE> |
| loose-envify | 1.4.0 | npm | MIT | <https://github.com/zertosh/loose-envify/blob/master/LICENSE> |
| lru-cache | 6.0.0 | npm | ISC | <https://github.com/isaacs/node-lru-cache/blob/main/LICENSE> |
| markdown-to-jsx | 7.3.2 | npm | MIT | <https://github.com/quantizor/markdown-to-jsx/blob/master/LICENSE> |
| matchmediaquery | 0.3.1 | npm | MIT | <https://github.com/ncochard/matchmediaquery/blob/master/LICENSE> |
| microsoft/applicationinsights-web-snippet | 1.0.1 | npm | MIT | <https://github.com/microsoft/ApplicationInsights-JS/blob/main/tools/applicationinsights-web-snippet/LICENSE> |
| mime-db | 1.52.0 | npm | MIT | <https://github.com/jshttp/mime-db/blob/master/LICENSE> |
| mime-types | 2.1.35 | npm | MIT | <https://github.com/jshttp/mime-types/blob/master/LICENSE> |
| mimic-fn | 3.1.0 | npm | MIT | <https://github.com/sindresorhus/mimic-function/blob/main/license> |
| mkpath | 0.1.0 | npm | MIT | <https://github.com/jrajav/mkpath/blob/master/LICENSE> |
| module-details-from-path | 1.0.3 | npm | MIT | <https://github.com/watson/module-details-from-path/blob/master/LICENSE> |
| ms | 2.1.2 | npm | MIT | <https://github.com/vercel/ms/blob/main/license.md> |
| mui-markdown | 0.5.7 | npm | MIT | <https://github.com/HPouyanmehr/mui-markdown/blob/main/package/package.json> |
| mui/base | 5.0.0-beta.22 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/core-downloads-tracker | 5.14.16 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/icons-material | 5.14.16 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/lab | 5.0.0-alpha.151 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/material | 5.14.7 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/private-theming | 5.14.16 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/styled-engine | 5.14.16 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/styles | 5.14.7 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/system | 5.14.16 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/types | 7.2.8 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/utils | 5.14.16 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/x-data-grid | 6.9.1 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| mui/x-tree-view | 6.0.0-alpha.1 | npm | MIT | <https://github.com/mui/material-ui/blob/master/LICENSE> |
| nanoclone | 0.2.1 | npm | MIT | <https://github.com/kelin2025/nanoclone/blob/master/LICENSE> |
| nopt | 3.0.6 | npm | ISC | <https://github.com/npm/nopt/blob/master/LICENSE> |
| object-assign | 4.1.1 | npm | MIT | <https://github.com/sindresorhus/object-assign/blob/main/license> |
| onetime | 5.1.2 | npm | MIT | <https://github.com/sindresorhus/onetime/blob/main/license> |
| opentelemetry/api | 1.6.0 | npm | Apache-2.0 | <https://github.com/open-telemetry/opentelemetry-js/blob/master/LICENSE> |
| opentelemetry/core | 1.17.1 | npm | Apache-2.0 | <https://github.com/open-telemetry/opentelemetry-js/blob/master/LICENSE> |
| opentelemetry/instrumentation | 0.41.2 | npm | Apache-2.0 | <https://github.com/open-telemetry/opentelemetry-js/blob/master/LICENSE> |
| opentelemetry/resources | 1.17.1 | npm | Apache-2.0 | <https://github.com/open-telemetry/opentelemetry-js/blob/master/LICENSE> |
| opentelemetry/sdk-trace-base | 1.17.1 | npm | Apache-2.0 | <https://github.com/open-telemetry/opentelemetry-js/blob/master/LICENSE> |
| opentelemetry/semantic-conventions | 1.17.1 | npm | Apache-2.0 | <https://github.com/open-telemetry/opentelemetry-js/blob/master/LICENSE> |
| p-limit | 2.3.0 | npm | MIT | <https://github.com/sindresorhus/p-limit/blob/main/license> |
| p-locate | 3.0.0 | npm | MIT | <https://github.com/sindresorhus/p-locate/blob/main/license> |
| p-timeout | 6.1.2 | npm | MIT | <https://github.com/sindresorhus/p-timeout/blob/main/license> |
| p-try | 2.2.0 | npm | MIT | <https://github.com/sindresorhus/p-try/blob/main/license> |
| parent-module | 1.0.1 | npm | MIT | <https://github.com/sindresorhus/parent-module/blob/main/license> |
| parse-json | 5.2.0 | npm | MIT | <https://github.com/sindresorhus/parse-json/blob/main/license> |
| path-exists | 3.0.0 | npm | MIT | <https://github.com/sindresorhus/path-exists/blob/main/license> |
| path-parse | 1.0.7 | npm | MIT | <https://github.com/jbgutierrez/path-parse/blob/master/LICENSE> |
| path-type | 4.0.0 | npm | MIT | <https://github.com/sindresorhus/path-type/blob/main/license> |
| pkg-up | 3.1.0 | npm | MIT | <https://github.com/sindresorhus/package-up/blob/main/license> |
| popperjs/core | 2.11.8 | npm | MIT | <https://github.com/popperjs/popper-core/blob/master/LICENSE> |
| prism-react-renderer | 1.3.5 | npm | MIT | <https://github.com/FormidableLabs/prism-react-renderer/blob/master/LICENSE> |
| prop-types | 15.8.1 | npm | MIT | <https://github.com/facebook/prop-types/blob/master/LICENSE> |
| property-expr | 2.0.6 | npm | MIT | <https://github.com/jquense/expr/blob/master/LICENSE.txt> |
| proxy-from-env | 1.1.0 | npm | MIT | [https://github.com/Rob–W/proxy-from-env/blob/master/LICENSE](https://github.com/Rob--W/proxy-from-env/blob/master/LICENSE) |
| pubsub-js | 1.9.4 | npm | MIT | <https://github.com/mroderick/PubSubJS/blob/master/LICENSE.md> |
| punycode | 2.3.1 | npm | MIT | <https://github.com/mathiasbynens/punycode.js/blob/main/LICENSE-MIT.txt> |
| q | 1.5.1 | npm | MIT | <https://github.com/kriskowal/q/blob/master/LICENSE> |
| react | 18.2.0 | npm | MIT | <https://github.com/facebook/react/blob/master/LICENSE> |
| react-ace | 10.1.0 | npm | MIT | <https://github.com/securingsincity/react-ace/blob/main/LICENSE> |
| react-dom | 18.2.0 | npm | MIT | <https://github.com/facebook/react/blob/master/LICENSE> |
| react-error-boundary | 3.1.4 | npm | MIT | <https://github.com/bvaughn/react-error-boundary/blob/master/LICENSE> |
| react-fast-compare | 2.0.4 | npm | MIT | <https://github.com/FormidableLabs/react-fast-compare/blob/master/LICENSE> |
| react-i18next | 12.3.1 | npm | MIT | <https://github.com/i18next/react-i18next/blob/master/LICENSE> |
| react-is | 18.2.0 | npm | MIT | <https://github.com/facebook/react/blob/master/LICENSE> |
| react-responsive | 9.0.2 | npm | MIT | <https://github.com/yocontra/react-responsive/blob/master/LICENSE> |
| react-router | 6.18.0 | npm | MIT | <https://github.com/remix-run/react-router/blob/master/LICENSE.md> |
| react-router-dom | 6.18.0 | npm | MIT | <https://github.com/remix-run/react-router/blob/master/LICENSE.md> |
| react-tabs | 6.0.2 | npm | MIT | <https://github.com/reactjs/react-tabs/blob/master/LICENSE> |
| react-transition-group | 4.4.5 | npm | BSD-3-Clause | <https://github.com/reactjs/react-transition-group/blob/master/LICENSE> |
| readable-stream | 1.1.14 | npm | MIT | <https://github.com/nodejs/readable-stream/blob/main/LICENSE> |
| regenerator-runtime | 0.14.0 | npm | MIT | <https://github.com/facebook/regenerator.git#main/blob/master/LICENSE.md> |
| remix-run/router | 1.11.0 | npm | MIT | <https://github.com/remix-run/react-router/blob/master/LICENSE.md> |
| require-from-string | 2.0.2 | npm | MIT | <https://github.com/floatdrop/require-from-string/blob/master/license> |
| require-in-the-middle | 7.2.0 | npm | MIT | <https://github.com/elastic/require-in-the-middle/blob/master/LICENSE> |
| reselect | 4.1.8 | npm | MIT | <https://github.com/reduxjs/reselect/blob/master/LICENSE> |
| resolve | 1.22.8 | npm | MIT | <https://github.com/browserify/resolve/blob/main/LICENSE> |
| resolve-from | 4.0.0 | npm | MIT | <https://github.com/sindresorhus/resolve-from/blob/main/license> |
| sax | 1.3.0 | npm | ISC | <https://github.com/isaacs/sax-js/blob/main/LICENSE> |
| scheduler | 0.23.0 | npm | MIT | <https://github.com/facebook/react/blob/master/LICENSE> |
| semver | 7.5.4 | npm | ISC | <https://github.com/npm/node-semver/blob/master/LICENSE> |
| shallow-equal | 1.2.1 | npm | MIT | <https://github.com/moroshko/shallow-equal/blob/master/LICENSE> |
| shimmer | 1.2.1 | npm | BSD-2-Clause | <https://github.com/othiym23/shimmer/blob/master/LICENSE> |
| source-map | 0.5.7 | npm | BSD-3-Clause | <https://github.com/mozilla/source-map/blob/master/LICENSE> |
| stack-chain | 1.3.7 | npm | MIT | <https://github.com/AndreasMadsen/stack-chain/blob/master/LICENSE.md> |
| string_decoder | 0.10.31 | npm | MIT | <https://github.com/nodejs/string_decoder/blob/main/LICENSE> |
| stylis | 4.2.0 | npm | MIT | <https://github.com/thysultan/stylis.js/blob/master/LICENSE> |
| supports-color | 5.5.0 | npm | MIT | <https://github.com/chalk/supports-color/blob/main/license> |
| supports-preserve-symlinks-flag | 1.0.0 | npm | MIT | <https://github.com/inspect-js/node-supports-preserve-symlinks-flag/blob/master/LICENSE> |
| testing-library/react-hooks | 8.0.1 | npm | MIT | <https://github.com/testing-library/react-hooks-testing-library/blob/master/LICENSE.md> |
| tiny-typed-emitter | 2.1.0 | npm | MIT | <https://github.com/binier/tiny-typed-emitter/blob/master/LICENSE> |
| tiny-warning | 1.0.3 | npm | MIT | <https://github.com/alexreardon/tiny-warning/blob/master/LICENSE> |
| to-fast-properties | 2.0.0 | npm | MIT | <https://github.com/sindresorhus/to-fast-properties/blob/main/license> |
| tootallnate/once | 2.0.0 | npm | MIT | <https://github.com/TooTallNate/once/blob/master/LICENSE> |
| toposort | 2.0.2 | npm | MIT | <https://github.com/marcelklehr/toposort/blob/master/License> |
| touch | 0.0.3 | npm | ISC | <https://github.com/isaacs/node-touch/blob/master/LICENSE> |
| traverse | 0.3.9 | npm | MIT, X11 | <https://github.com/ljharb/js-traverse/blob/main/LICENSE> |
| tslib | 2.6.2 | npm | 0BSD | <https://github.com/Microsoft/tslib/blob/master/LICENSE.txt> |
| type-fest | 2.19.0 | npm | MIT, CC0-1.0 | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/hoist-non-react-statics | 3.3.4 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/lodash | 4.14.200 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/parse-json | 4.0.1 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/prop-types | 15.7.9 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/react | 18.2.34 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/react-transition-group | 4.4.8 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/scheduler | 0.16.5 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| types/shimmer | 1.0.4 | npm | MIT | <https://github.com/DefinitelyTyped/DefinitelyTyped/blob/master/LICENSE> |
| underscore | 1.13.6 | npm | MIT | <https://github.com/jashkenas/underscore/blob/master/LICENSE> |
| universalify | 2.0.1 | npm | MIT | <https://github.com/RyanZim/universalify/blob/master/LICENSE> |
| uri-js | 4.4.1 | npm | BSD-2-Clause | <https://github.com/jfromaniello/url-join/blob/main/LICENSE> |
| url-join | 4.0.1 | npm | MIT | <https://github.com/uuidjs/uuid/blob/main/LICENSE.md> |
| uuid | 9.0.1 | npm | MIT | <https://github.com/uuidjs/uuid/blob/master/LICENSE.md> |
| void-elements | 3.1.0 | npm | MIT | <https://github.com/pugjs/void-elements/blob/master/LICENSE> |
| yallist | 4.0.0 | npm | ISC | <https://github.com/isaacs/yallist/blob/master/LICENSE.md> |
| yaml | 1.10.2 | npm | ISC | <https://github.com/eemeli/yaml/blob/master/LICENSE> |
| yup | 0.32.11 | npm | MIT | <https://github.com/jquense/yup/blob/master/LICENSE.md> |

---
title: Snowpark Migration Accelerator:  Output Code
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/output-code.md
section: Migrations
---

# Snowpark Migration Accelerator: Output Code

The SMA conversion process generates an Output folder containing the converted code. This folder becomes available immediately after the conversion is complete.

Before examining the code in detail, let’s understand how the output code is organized.

## Output Code File Structure

A project file stores information about your SMA project. You can run SMA multiple times within the same project, and each execution will be associated with that project.

When you run an assessment, SMA creates a folder in your specified output directory. This folder, named “Assessment,” contains all reports and logs generated during the assessment phase. Similarly, when you proceed to conversion, SMA creates another folder named “Conversion” in the output directory you specified during Conversion Setup. This second folder contains all reports and logs generated during the conversion phase.

When you click “View Output” in the Conversion Results Page, you will be directed to the Output folder within the Conversion-Date-Time directory. (While this folder also exists in the Assessment-Date-Time directory, it only contains demo code.

The output directory structure will be identical to the input directory structure. All folders, subfolders, and code files will be copied to the output directory with the same names and organization.

The converted code will maintain the same filenames as the source files. For example, if your source file is named “Notebook_1”, the converted file will also be named “Notebook_1”.

Additional information will be provided.

> **Note:**
>
> If checkpoints generation is enabled in the settings page (check [Conversion Settings](../project-overview/configuration-and-settings.md)), the conversion will create a file in the output named “checkpoints.json” which allows the user to run it using checkpoints feature within the VS Code Snowflake Extension.

---
title: Snowpark Migration Accelerator:  Output Reports
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-reports/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Output Reports

The assessment phase of this accelerator generates multiple detailed reports, including:

Let’s organize these reports into three distinct categories to make them easier to understand.

* [Curated Reports](curated-reports.md) - Detailed, formatted reports that provide in-depth analysis of the information shown in the application.
* [SMA Inventories](sma-inventories.md) - Detailed spreadsheets that help you understand the analyzed codebase.
* [Assessment zip file](assessment-zip-file.md) - A compressed file containing all reports, useful for offline review or sharing.

These generated files provide detailed analysis and insights about your codebase after processing it through the tool.

To view the reports, click the “View Reports” button located at the bottom of the application.

Inside the “Reports” directory of your specified output folder, you will find several files. These files are similar to those shown in the first image above.

* AssessmentReport.json - Main assessment report in JSON format
* DetailedReport.docx - Comprehensive analysis report in Word format
* DetailedReport.html - HTML version of detailed report (deprecated since Spark Conversion Core V2.43.0)
* files.csv - List of processed files
* GenericScanner

  + GenericScannerOutput
    \* files.pam - Scanner output file for file analysis
    \* FilesInventory.csv - Complete inventory of scanned files
    \* KeywordCounts.csv - Statistics of keyword usage
    \* line_counts.pam - Line count analysis data
    \* tool_execution.pam - Tool execution logs
    \* word_counts.pam - Word frequency analysis
* ImportUsagesInventory.csv - Inventory of import statements
* InputFilesInventory.csv - List of input files processed
* IOFilesInventory.csv - Input/Output operations inventory
* Issues.csv - List of identified issues and problems
* JoinsInventory.csv - Analysis of join operations
* NotebookCellsInventory.csv - Inventory of notebook cells
* NotebookSizeInventory.csv - Size analysis of notebooks
* PandasUsagesInventory.csv - List of Pandas library usage
* SparkUsagesInventory.csv - Inventory of Spark operations
* SqlStatementsInventory.csv - List of SQL statements
* SummaryReport.docx - Brief overview report (deprecated since Spark Conversion Core V2.43.0)
* SummaryReport.html - HTML version of summary (deprecated since Spark Conversion Core V2.43.0)
* ThirdPartyUsagesInventory.csv - Inventory of third-party library usage
* tool_execution.csv - Execution logs and statistics

Let’s examine each category in detail.

---
title: Snowpark Migration Accelerator:  Pre-Processing Considerations
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/before-using-the-sma/pre-processing-considerations.md
section: Migrations
---

# Snowpark Migration Accelerator: Pre-Processing Considerations

When preparing source code for analysis with the Snowpark Migration Accelerator (SMA), please note that the tool can only process code located in the input directory. Before running SMA, ensure all relevant source files are placed in this directory.

## Size

The SMA tool analyzes source code and text files, not data files. When scanning large codebases or numerous files, the tool may experience memory limitations on your local machine. For example, if you include exported code from all dependent libraries as input files, the analysis will take significantly longer. Keep in mind that SMA will only identify Spark-specific code references, regardless of how much code you include in the scan.

We recommend collecting all code files that:

* Are executed regularly as part of an automated process
* Were used to create the process (if separate from regular execution)
* Are custom libraries developed by your organization that are referenced by either the process or its creation scripts

You do not need to include code that creates established third-party libraries (such as Pandas, Scikit-Learn, or others). The tool automatically catalogs these references without requiring their defining code.

## It should work

The Snowpark Migration Accelerator (SMA) requires complete and valid source code to function properly. It cannot process incomplete code fragments or snippets that don’t execute independently in Scala or Python. If you encounter numerous parsing errors while running SMA, it likely means the source code is incomplete or contains syntax errors. To ensure successful analysis, make sure your input directory contains only working, syntactically correct code from your source platform.

## Use Case

Understanding the SMA output goes beyond the tool itself. While SMA analyzes your codebase, it’s important to understand your specific use case to identify potential migration challenges. For example, if you have a notebook that uses SQL and a database connector without any Spark references, SMA will only report the third-party libraries used in that notebook. This information is useful, but the tool won’t provide a readiness score for such files. Having context about your application helps you interpret these findings more effectively.

## Code from Databricks Notebooks

Databricks notebooks allow you to write code in multiple programming languages (SQL, Scala, and PySpark) within the same notebook. When you export a notebook, the file extension will match the primary language category (.ipynb or .py for Python notebooks, .sql for SQL notebooks). Any code written in a different language than the notebook’s primary language will be automatically commented out during export. For example, if you write SQL code in a Python notebook, that SQL code will be commented out when you export the notebook.

Comments containing code are not analyzed by the SMA tool. If you want the code within comments to be analyzed, you must first preprocess it to expose the code in a file format that the tool can recognize.

When working with notebooks, SMA can analyze and recognize code written in languages different from the notebook’s file extension. For example, if you have SQL code in a Jupyter notebook (.ipynb file), SMA will detect and process it even if the code is not commented.

For non-notebook files, make sure your code is saved with the correct file extension that matches the source language (for example, save Python code with a .py extension). This ensures the code can be properly analyzed.

---
title: Snowpark Migration Accelerator:  Project Overview
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/project-overview/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Project Overview

The Snowpark Migration Accelerator (SMA) helps developers analyze and convert existing Spark code to Snowpark code. This tool simplifies the process of understanding your codebase and automatically translates Spark API references to their Snowpark API equivalents.

How Does SMA Work?

This section explains the core functionality and processes. You will learn about:

* [Project Creation and Setup](project-setup.md)
* [Configuration and Settings](configuration-and-settings.md)
* [Running the Tool](tool-execution.md)

Let’s define two important concepts you’ll encounter when using this tool:

1. Project: This represents a single execution or run of the tool. Each time you use the tool, it creates a new project.
2. Readiness Score: This is the main metric used to evaluate your results. It indicates how prepared your code is for migration.

## What is a SnowConvert Project?

To use this accelerator, you first need to create a project. A project links your tool executions with your configuration settings. When you create a project, the tool generates a .snowct file in your source code directory. This file stores all your project information on your local machine, including:

* The source platform you selected
* Your conversion settings
* The project status

## What is the Readiness Score?

The readiness score measures how well your Spark API code can be mapped to equivalent Snowpark API functions. While a high score indicates good compatibility between Spark and Snowpark elements, it does not guarantee that your entire codebase will run successfully in Snowflake. The readiness score serves as an initial assessment tool, but you should consider additional factors when evaluating if your application is suitable for Snowpark migration.

For additional technical terms and definitions, please refer to our [glossary](../../support/glossary.md).

Let’s begin with the project setup…

---
title: Snowpark Migration Accelerator:  Project Setup
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/project-overview/project-setup.md
section: Migrations
---

# Snowpark Migration Accelerator: Project Setup

This section explains how to create and manage projects with the Snowpark Migration Accelerator (SMA) tool.

## Project Page

When you first open the Snowpark Migration Accelerator (SMA), you will see the project page. On this page, you will find the following two options:

* **Open project** - Browse and select a previously created project file. For more details about opening an existing project, see more information on opening an existing project.
* **New project** - Select this option to create a new project. Learn more about creating a new project.

## Creating a New Project

Clicking the “New Project” button will open the project creation screen.

The Project Creation page contains multiple fields that you need to complete.

1. **Project Name**: Enter a name for your project. This name will be used to store your settings and track multiple executions. More details about project files are provided below.
2. **Email Address**: Enter your email address to identify yourself as a tool user. This should be the user of the tool, not the owner of the codebase being scanned.
3. **Company Name**: Enter the name of the organization whose code you are working with. If you are analyzing your own code, enter your organization’s name. If you are working with another organization’s code, enter their name. This helps organize projects by organization.
4. **Input Folder**: Select the folder containing your source code. Note that SMA will only analyze [supported file types](../before-using-the-sma/supported-filetypes.md).
5. **Output Folder Path**: Select the directory where SMA will store output files, including logs, reports, and converted code.
6. **SQL Language**: Select either SparkSQL, HiveSQL or Databricks based on your source code. (Optional)

All fields must be completed to run the tool.

After completing your project setup, you have two options:

* Click “Save” to save your project and continue to the next step
* Click “Cancel” to exit without saving

Clicking “Cancel” returns you to the main screen. If you choose “Save” your project settings will be saved in a `.snowct` file. This file allows you to reopen the project later with all your configured settings intact.

## Notes on the SMA Project File (.snowct)

The `.snowct` file is a project configuration file that stores your project settings and assessment history. This file allows you to:

* Rerun the tool using the same configuration settings
* Access and review assessment data from previous runs

Each time you click “Save”, SMA creates a project file (with a `.snowct` extension) in the root of your selected output folder.

As a user, you will have the following capabilities:

* Double-click the `.snowct` file to open an existing project
* Click “Open Project” on the main screen to open an existing project

## Open a Project

From the main screen, click **Open Project** to launch your file browser. Select a project file with a `.snowct` extension to open the project home page. This action works regardless of your project’s completion status.

The following section explains the available configuration options and settings when using the application.

---
title: Snowpark Migration Accelerator:  SC Spark Python Release Notes
source: https://docs.snowflake.com/en/migrations/sma-docs/general/release-notes/old-version-release-notes/sc-spark-python-release-notes/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SC Spark Python Release Notes

## 2.14.0

2023-10-24 AddedAdd condensed ID for filenames and use it in the log.

Changed

Refactor output folder hierarchy of the TrialMode.

Generate Reports locally in Assessment mode when the score hits 90 or higher.

Generate Reports locally in Assessment mode when it’s a Snowflake user.

Create inventories as .csv files.

Move inventories to the Reports folder.

##

## 2.13.0

2023-10-19

Added

* Add a flag to enable more logging messages.
* Add a flag to disable the execution of the conversion.
* Add a timeout mechanism for Scala symbol table resolution.
* Add a timeout mechanism for Scala parsing phase.
* Add progress log messages in parsing phase for Scala.

Changed

* Adjustments to reports (HTML and docx): renaming readiness score and updating appendix and imports call table.
* Bump `AssessmentMode` from 8.1.6 to 9.0.4
* Bump `Common.AssessmentModel` from 3.1.12 to 3.1.14
* Add lock to avoid race condition

Fixed

* Fix an inconsistent number of SparkReferences between assessment and conversion modes.
* Fix issue causing .sql files to not be recognized as supported files.
* Fix parsing error when a backslash is between AtomElement and BracedSlices.
* Fix issue when parsing code with a big quantity of nested expressions took a lot of time.

## 2.12.0

2023-10-13

Added

* Add Trial Mode support.

Changed

* Bump `Snowflake.SnowConvert.Python` from 1.1.79 to 1.1.80
* Add a variant of ResolveType to avoid stack overflow at some scenarios.

Fixed

* Fix scenario when resolving a FullName causes stack overflow.

## 2.11.0

Added

* Add support for Snowpark API version 1.7.0 on Python.
* Add support for Snowpark API version 1.6.1 on Python.
* A new workaround added
* Four (4) new mappings added

Changed

* Update Scala integration test validations.
* Reduce Scala integration tests time.
* Update the remaining assembly name references in the internal code.
* Update source file headers to match company guidelines.

Fixed

* Fix multiple executions with same ExecutionId by adding SessionId and ExecutionId to inventories and reports.
* Fix failing CopyOtherFiles task with storage.lck file.
* Fix issue generating .HTML reports when some values are null.

## 2.09.0

2023-10-03

Added

* Add FilesInventory.pam
* Four (4) new mappings added

Changed

* Change assembly names.
* Bump `Snowflake.SnowConvert.Python` from 1.1.70 to 1.1.79
* Add a backslash in three different rules to solve parsing errors.
* Add a new spark reference symbol.
* Support two (2) new resolutions.
* Support empty commands in .sql DBX notebooks.
* Improve robustness in the StopIfDedent function.

Fixed

* Fix a parsing error in a backslash scenario with param and commas.
* Fix expression between parentheses symbol resolution issue.
* Fix parsing error with empty command in .sql DBX notebooks.
* Fix empty brackets symbol resolution issue.
* Fix Regex timeout error when collecting the SQL statements inventory.
* Fix parsing error related to mixed indentation.
* Fix false crash message when a parsing error was found.
* Fix an inconsistent number of SparkReferences between assessment and conversion modes.

## 2.8.0

2023-09-27

Added

* Add support for Snowpark API version 1.5.1 on Python.
* Add support for Python 3.10.10 syntax.
* Add CellId column in the inventories (for both notebooks, Databricks and Jupyter).
* Add four (4) new mappings

Changed

* Bump `Mobilize.Python` from 1.1.64 to 1.1.70
* Add support for Python 3.10.10 syntax.
* Add three (3) new backslash scenarios to solved a parsing error.
* Add an explicit return type to some Pandas symbols to avoid a loading error.

Fixed

* Fix a parsing error when a backslash in a square bracket, colon and param scenarios.
* Fix error loading Pandas symbols.

## 2.7.0

2023-09-20

Added

* Add support for Snowpark API version 1.5.0 on Python.
* 3 new mappings added

Changed

* Avoid processing hidden files
* Bump `Mobilize.SparkCommon.Utils` from 1.3.188 to 1.3.189
* Bump `Mobilize.Common.Utils` from 3.2.0 to 3.2.2

Fixed

* Fix PackageVersionInventory collection phase getting stuck.
* Fix incorrect percentage in Spark Usage Summary table in the detailed report when using DBC files.
* Fix File Sizing table in the detailed report shown empty or not shown at all.

## 2.6.0

2023-09-12

Added

* Add support of %SQL cells (from notebooks) to the SQL statements inventory.

Changed

* Bump `Mobilize.Python` from 1.1.62 to 1.1.64
* Adds support to magic sql.
* Avoid updating function parameter type when inferred type is `None`.

Fixed

* Fix issue causing infinite loading of symbols for specific files.
* Fix issue of GenericScanner files not being generated.

Security

* Secure test passwords in Python transformation tests.

##

## 2.5.0

2023-09-05

Added

* Add Notebook Sizing inventory.
* Add Snowflake.SparkCommon.MappingLoader project (uses the new Snowflake.SnowMapGrammar).

Changed

* Bump Mobilize.Python from 1.1.59 to 1.1.62

  + Add a timeout mechanism at Python symbol resolution for GetSymbol methods.
* Bump Mobilize.SparkCommon.Utils from 1.3.186 to 1.3.187

  + Update Mobilize.SparkCommon.Utils.FilesHelper.CopyFilesRecursively method to handle hidden files.

Fixed

* Fix the issue of not receiving the email after a run (decreasing the log file size by avoiding logging Debug messages by default).

Removed

* Remove Mobilize.SparkCommon.TransformationCore project (used the old Mobilize.MapGrammar).

## 2.4.0

2023-08-28

Added

* Add NotebookCells inventory.
* Collect the argument values of DataFrameReader.option and DataFrameWriter.option for Scala and Python.
* Add 2 new mappings and a better alias type info collection
* Encrypt output files when additional parameters are provided.
* Re-enable SQLStatements inventory.
* Re-enable parallelization for Collectors.

Changed

* Update File Type Summary section of the detailed report (docx and html). (SCT-3867)
* Update for 2 mappings
* Bump Mobilize.SparkCommon.Utils from 1.3.181 to 1.3.186.
* Improve support of sorting CSV files.
* Bump Mobilize.Common.Utils from 3.1.6 to 3.2.0.

  + Improve support of sorting CSV files.
  + Bump Mobilize.Common.Utils from 3.1.6 to 3.2.0.
  + Update NuGet package versions.
* Refactor on Load Mappings Task.
* Refacto on SparkCommon Utils project references.
* Group solution projects.
* Merge Scala integration tests JupyterTest, InventoryTests and TransformationTest.

Fixed

* Fix issue that caused the Python conversion tool to get stuck when collecting the SQL statements inventory items.
* Fix missing GenericScanner files in the output.
* Fix issue of migrated DBC files that were not loading in Databricks.
* Fix error at the end of the tool process.

Removed

* Remove InventoryStorageTemp.
* Remove redundant StyleCop.Analyzers project references.

## 2.2.001

2023-07-19

Added

* Adding six (6) new mappings

Changed

* Assessment Model update from 3.1.10 to 3.1.11

Fixed

* Fix Databricks processing not working in Assessment mode

Security

* Added subresource integrity to HTML links

## 2.1.161

2023-07-06

Fixed

* Fixing and enabling Scala Spark functional tests

## 2.1.160

2023-07-05

Changed

* Assessment Model update from 3.1.9 to 3.1.10

## 2.1.159

2023-07-05

Changed

* Assessment Model update from 3.1.7 to 3.1.9

## 2.1.158

2023-07-05

Added

* Added tool stability by improving the handling of the exceptions in tasks

## 2.1.157

2023-07-05

Changed

* Spark Common update from 1.3.178 to 1.3.181

## 2.1.155

2023-07-05

Changed

* Common Build update from 2.0.2 to 3.0.4
* Improvements building the solution in MacOs

## 2.1.148

2023-07-04

Changed

* Spark Common update from 1.3.177 to 1.3.178
* Common Utils update from 4.0.0-alpha.DevOps.9 to 3.1.6

## 2.1.147

2023-07-03

Security

* Remove non-licensed package references in `Spark Common` projects.

## 2.1.146

2023-07-03

Changed

* Bump `coverlet.collector` from 3.2.0 to 6.0.0
* Bump `FluentAssertions` from 6.9.0 to 6.11.0
* Bump `Scriban.Signed` from 5.5.2 to 5.7.0
* Bump `DocumentFormat.OpenXml` from 2.19.0 to 2.20.0

Security

* Remove non-licensed package references in `SparkCommon` projects.

## 2.1.145

2023-06-28

Changed

* `Mobilize.Python` update from 1.1.49 to 1.1.50
* Fix Databricks notebook whole file parsing issue when not parsing single cell

## 2.1.144

2023-06-27

Fixed

* Fix .dbc file extraction on MacOS

## 2.1.143

2023-06-26

Fixed

* Fix tests errors because of different data formats.

## 2.1.142

2023-06-26

Changed

* Refactor inventory storage.

## 2.1.141

2023-06-23

Changed

* `Mobilize.Python` update from 1.1.46 to 1.1.49
* Detecting and stopping recursive cycles while resolving a symbol
* Fix StackOverflow exception involving \_\\*init\\*\_.py files
* Fix PyArgExpr node with backslash

## 2.1.140

2023-06-22

Changed

* `Mobilize.Python` update from 1.1.44 to 1.1.46
* Fix PyTerm node with backslash

## 2.1.138

2023-06-22

Changed

* Spark Common update from 1.3.176 to 1.3.177

Fixed

* Fix building Scala code processor.

## 2.1.137

2023-06-22

Security

* Secure credentials in functional tests.
* Remove non-licensed package references.

## 2.1.136

2023-06-21

Changed

* `Snowflake.Data` update from 2.0.15 to 2.0.25
* Spark Common update from 1.3.175 to 1.3.176

Security

Upgrading references in the functional tests.

## 2.1.135

2023-06-21

Added

* Add .dbc extension as supported by Python and Scala code processor tools.
* Add tests for the Contracts project.

Security

* Remove non-licensed package references in `SparkCommon.Contracts.Test`.

## 2.1.132

2023-06-21

Removed

* Remove the `Supported` column from IOFiles inventory in assessment mode.

## 2.1.131

2023-06-20

Fixed

* Fix tests on Mac.

## 2.1.130

2023-06-19

Changed

* Merge SparkCommon repo with this repo.

## 2.1.126

2023-06-16

Fixed

* Fix building the repo.

## 2.1.124

2023-06-15

Fixed

* Fix building the repo.

## 2.1.123

2023-06-15

Changed

* `Mobilize.Scala` update from 0.2.34 to 0.2.37
* Fix parsing error involving generic type with underscore and restriction
* Fix parsing error involving expressions with quote marks and interpolation

Security

* Remove of unsecure package references.

## 2.1.121

2023-06-15

Security

* Remove credential files.

## 2.1.120

2023-06-15

Changed

* Minor change in the version configuration for both Scala and Python.

## 1.0.877

April 26th, 2023

Python 1.1.25

PythonSnowConvert Core 2.01.090

SparkCommon 1.3.151

Added

* Added support for Snowpark 1.3.0

  + Four new mappings
  + EWI [SPRKPY1048](../../../../issue-analysis/issue-codes-by-source/python/README.md) was deprecated
* Added transformations for

  + DataFrameReader chain
  + SparkSession.sparkContext
* Added Severity column to the Issues Summary table of the detailed report

Improvements

* Improved name of the Spark usages inventory file
* Improved readiness score displayed value when no Spark references were found

Fixed

* Fixed button URLs
* Fixed inconsistencies of the Spark usages inventory locally and in telemetry
* Fixed RDD metrics in the Spark Usage Summary table of the detailed report
* Fixed inconsistencies with zero and dash symbols in the reports

## 1.0.826

March 29th, 2023

Python 1.1.25

PythonSnowConvert Core 2.01.068

SparkCommon 1.3.131

Added

* Added support for convert DBC files

  Improvements
* Added transformation for DataFrameReader.format and DataFrameReader.load

Fixed

* Fixed SnowConvert/Snowpark version values transposed

## 1.0.725

February 15th, 2023

Python 1.1.11

PythonSnowConvert Core 2.01.022

SparkCommon 1.3.113

Added

* Added support for Databricks archive files (.dbc extension)
* Added support for Databricks notebook files (.python extension)
* Added parallelism to the Spark usages identification process
* Added support for SnowPark API version 1.1.0
* Added mapping elements:
* twelve direct mappings
* two conversions using helper

Improvements

* Improved SPRKPY1038 EWI message
* Improved registration of EWIs in conversion for columns using attribute access
* Improved local report names

## 1.0.691

February 1st, 2023

Python 1.1.3

PythonSnowConvert Core 2.1.4

SparkCommon 1.3.105

Added

* Added Net6 compatibility (internal)
* Added issues.csv report
* Added sizing table to the detailed report
* Added support for global variable declaration
* Added support for inherited symbol identification
* Added support for accessing columns using attribute access
* Added in telemetry the version of the mapping that was used
* Added support for Jupyter Notebooks in GenericScanner
* Added mapping elements:

  + one direct mapping
  + one conversion using helper
  + six workarounds
  + five not supported identification

Improvements

* Improved tool version format in reports, inventories and telemetry
* Improved syncing of local and remote HTML reports
* Improved HTML detailed report sync with DOCX detailed report
* Improved issues table grouping by EWI code
* Improved import table grouping by package
* Improved commented output code
* Improved UI progress phase titles

Bug Fixes

* Fixed location of EWI messages for complex statements
* Fixed UI wording when cancelling the execution
* Fixed typos on reports

## 1.0.594

December 28th, 2022

Python 1.0.457

PythonSnowConvert Core 2.0.280

Added

* Added support for Jupyter Notebooks in Generic Scanner
* Added conversion percentage in the reports
* Added ‘ElementPackage’ column to the import usages inventory
* Added one direct mapping
* Added four helpers
* Added two workarounds
* Added minor visual improvements to the detailed report

Improvements

* Improved one mapping from rename to direct
* Improved sorting of issues table in the detailed report

Bugs

* Fixed columns size of the issue table in the detailed report
* Fixed an error when adding EWI comment for Column.contains function usage
* Fixed six mapping statuses that didn’t match in the Spark usages inventory

## 1.0.555

December 21st, 2022

Python 1.0.457

PythonSnowConvert Core 2.0.259

New Features

* Added three new workarounds
* Added margin of error in the Detailed Report description

Improvements

* Improved two mapping from rename to direct
* Improved sorting of issues table in the detailed report
* Improved displaying of percentages in the detailed report
* Conversion stage logging messages improved

Bugs

* Fixed two mappings
* Fixed identification of a not supported element

## 1.0.515

December 14th, 2022

Python 1.0.457

PythonSnowConvert Core 2.0.241

New Features

* Support for ‘snowpark_extensions’
* Twelve conversions using the ‘snowpark_extensions’
* Two workarounds added
* A new spark reference added to the table reference database, including its status.
* Customer info added to the detailed report

Improvements

* EWI SPRKPY1038 wording improvement
* A spark reference status improved from *rename* to *direct*

Bug Fixes

* A bug in a mapping fixed
* A broken Spark Core Mapping table fixed

## 1.0.492

December 07th, 2022

Python 1.0.455

PythonSnowConvert Core 2.0.233

New Features

* Addd margin of error in the readiness score
* Added two new mappings
* Added EWI for PySpark elements that were not recognized

Improvements

* Improved appendix A wording in the detailed report
* Improved EWI message for PySpark elements that are not defined in the tool’s conversion database

Bug Fixes

* Fixed ‘alias’ column name in the inventory

## 1.0.457

December 01st, 2022

Python 1.0.452

Python SnowConvert Core 2.0.217

New Features

* Added support to SnowPark API version 1.0.0
* Added five new workarounds documentation
* Added execution info to telemetry
* Added margin of error to the readiness score

Improvements

* Improved accuracy in code symbols identification
* Improvement in the assessment step when logging messages.

## 1.0.441

November 23rd, 2022

Python 1.0.449

PythonSnowConvert Core 2.0.210

New Features

* Added EWI comments to the output code for not defined PySpark elements
* Added support for inherited symbols
* Three new mappings added
* One workaround added

Improvements

* Improved readiness score when all the files have errors
* Improved error message when loading the symbol table
* Improved handling of generic types
* One mapping status changed from rename to direct
* One conversion status changed from workaround to direct mapping

Bug Fixes

* Fixed markdown conversion issue
* Fixed syncing issues between PySpark_Mappings_Core table and the tool

## 1.0.425

November 17th, 2022

Python 1.0.445

PythonSnowConvert Core 2.0.203

Improvements

* Robustness at the loading symbol table

Bug Fixes

* Fixed detailed report summary table for spark usage values
* Fixed some parsing errors
* Fixed EWI code sync issues between the tool and PySpark_Mappings_Core Snowflake DB table and

## 1.0.415

November 15th, 2022

Python 1.0.441

PythonSnowConvert Core 2.0.199

New Features

* Added EWI record when an error is detected at loading the symbol table

Bug fixes

* Fixed new lines issue when converting Jupyter notebook files

## 1.0.404

November 11th, 2022

Python 1.0.436

PythonSnowConvert Core 2.0.195

New Features

* Added basic support to convert Jupyter notebook files
* Added a value for tracking import usages as an inventory
* Improve the detailed report (Spark usages grouped by support category and Python Import Call Summary)
* New mappings added
* New workarounds added for ‘SparkSession.Builder.appName’
* New EWIs added as comments in the output code
* Added support to copy non-Python files to the output directory
* Added PySpark usages identification for id expressions
* Added an error message when symbol table loading fails

Improvements

* Improved imports mapping
* Improved type hints mapping
* Improved rename mappings to direct mappings

Bug Fixes

* Parsing errors
* The output directory structure for files with parsing errors
* Fixed ‘pyspark.streaming’ full names
* Fixed CLI crashing

## 1.0.315

October 21st, 2022

Python 1.0.422

PythonSnowConvert Core 2.0.152

Added

* Added type inference
* 5 New mappings supported

Improvements

* Detailed report
* Import Statement conversion
* Transformation documentation

Fixed

* EWIs related to a Project ID logging
* 4 Pyspark elements conversion status

## 1.0.280

October 12th, 2022

Python 1.0.417

PythonSnowConvert Core 2.0.135

Added

* New transformations
* Handling unsupported Pyspark elements used in imports
* Improvements in logging message

## 1.0.271

October 05th, 2022

Python 1.0.417

PythonSnowConvert Core 2.0.132

Added

* Robustness to symbol identification
* Improving in type resolution

Fixed

* Settings button is not refreshing with license change
* Documentation link in Python version reference

## 1.0.247

September 27th, 2022

Python 1.0.410

PythonSnowConvert Core 2.0.126

Added

* Robustness when parsing Jupypter Notebook files
* Improvements in resolving symbols with Generics
* New transformations

Fixed

* Total Python files in the report

## 1.0.220

September 15th, 2022

Python 1.0.399

PythonSnowConvert Core 2.0.112

Added

* New support for imports
* Alias name in inventories for the imports

Fixed

* Wrong line number in the inventory for macOS files
* Identified usages table percentages in the html report
* Qualification tool showing zero PySpark references
* Update contact information in the email template

## 1.0.190

September 06th, 2022

Python 1.0.392

PythonSnowConvert Core 2.0.100

Added

* ‘SnowConvert Version’ and ‘Snowpark version’ columns to SparkUsagesInventory
* More functions from pyspark supported
* Improvements to speed analysis

Fixed

* Direct mapping updating

## 1.0.148

August 31st, 2022

Python 1.0.381

PythonSnowConvert Core 2.0.71

Added

* 10 new mappings supported
* 17 new workaround conversions detected
* Support for identification of PySpark usages in Jupyter notebook files
* Automated and Status columns added to SparkReferenceInventory.csv
* Summary and detailed html report uploading to snowflake

Fixed

* Summary and detailed report wordings fixes
* Email template wording fixes

## 1.0.107

August 24th, 2022

Python 1.0.380

**PythonSnowConvert Core 2.0.30**

Added

* 30 new mappings supported
* Identification of pyspark.streaming and pyspark.rdd packages
* Improvements in identifying imported symbols
* Email template update
* Adding “Version information” section to Summary Report
* Adding “Resources” section to Detailed Report
* Final screen UI changes
* Sort SparkReferenceInventory report file

Fixed

* Settings button removed
* Detailed report logos update
* Percentage values precision on summary and detailed assessment reports

## 1.0.66

August 17th, 2022

Python 1.0.377

PythonSnowConvert Core 1.0.61

Added

* 136 new mappings supported
* Supported status updated for all functions listed as “Corrected” in the shared spreadsheet
* Information collected from the requirements.txt file
* Improvements in identifying chained symbols

Fixed

* Line number in SparkReferenceInventory report

## 1.0.30

August 9th, 2022

Python 1.0.373

PythonSnowConvert Core 1.0.29

Added

* Collect all the import usages
* Improvements identifying PySpark usages (import without module, import with star)
* Identifying more DataFrame functions as supported

Fixed

Logging parsing errors

## 0.1.172

July 20th, 2022

Python 0.1.172

Added

* Command line interface.
* Python code Qualification tool feature.

---
title: Snowpark Migration Accelerator:  SC Spark Scala Release Notes
source: https://docs.snowflake.com/en/migrations/sma-docs/general/release-notes/old-version-release-notes/sc-spark-scala-release-notes/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SC Spark Scala Release Notes

## 2.14.0

2023-10-24 AddedAdd condensed ID for filenames and use it in the log.

Changed

Refactor output folder hierarchy of the TrialMode.

Generate Reports locally in Assessment mode when the score hits 90 or higher.

Generate Reports locally in Assessment mode when it’s a Snowflake user.

Create inventories as .csv files.

Move inventories to the Reports folder.

## 2.13.0

2023-10-19

Added

* Add a flag to enable more logging messages.
* Add a flag to disable the execution of the conversion.
* Add a timeout mechanism for Scala symbol table resolution.
* Add a timeout mechanism for Scala parsing phase.
* Add progress log messages in parsing phase for Scala.

Changed

* Adjustments to reports (HTML and docx): renaming readiness score and updating appendix and imports call table.
* Bump `AssessmentMode` from 8.1.6 to 9.0.4
* Bump `Common.AssessmentModel` from 3.1.12 to 3.1.14
* Add lock to avoid race condition

Fixed

* Fix an inconsistent number of SparkReferences between assessment and conversion modes.
* Fix issue causing .sql files to not be recognized as supported files.
* Fix parsing error when a backslash is between AtomElement and BracedSlices.
* Fix issue when parsing code with a big quantity of nested expressions took a lot of time.

## 2.12.0

2023-10-13

Added

* Add Trial Mode support.

Changed

* Bump `Snowflake.SnowConvert.Python` from 1.1.79 to 1.1.80
* Add a variant of ResolveType to avoid stack overflow at some scenarios.

Fixed

* Fix scenario when resolving a FullName causes stack overflow.

## 2.11.0

Added

* Add support for Snowpark API version 1.7.0 on Python.
* Add support for Snowpark API version 1.6.1 on Python.
* A new workaround added
* Four (4) new mappings added

Changed

* Update Scala integration test validations.
* Reduce Scala integration tests time.
* Update the remaining assembly name references in the internal code.
* Update source file headers to match company guidelines.

Fixed

* Fix multiple executions with same ExecutionId by adding SessionId and ExecutionId to inventories and reports.
* Fix failing CopyOtherFiles task with storage.lck file.
* Fix issue generating .HTML reports when some values are null.

## 2.09.0

2023-10-03

Added

* Add FilesInventory.pam
* Four (4) new mappings added

Changed

* Change assembly names.
* Bump `Snowflake.SnowConvert.Python` from 1.1.70 to 1.1.79
* Add a backslash in three different rules to solve parsing errors.
* Add a new spark reference symbol.
* Support two (2) new resolutions.
* Support empty commands in .sql DBX notebooks.
* Improve robustness in the StopIfDedent function.

Fixed

* Fix a parsing error in a backslash scenario with param and commas.
* Fix expression between parentheses symbol resolution issue.
* Fix parsing error with empty command in .sql DBX notebooks.
* Fix empty brackets symbol resolution issue.
* Fix Regex timeout error when collecting the SQL statements inventory.
* Fix parsing error related to mixed indentation.
* Fix false crash message when a parsing error was found.
* Fix an inconsistent number of SparkReferences between assessment and conversion modes.

## 2.8.0

2023-09-27

Added

* Add support for Snowpark API version 1.5.1 on Python.
* Add support for Python 3.10.10 syntax.
* Add CellId column in the inventories (for both notebooks, Databricks and Jupyter).
* Add four (4) new mappings

Changed

* Bump `Mobilize.Python` from 1.1.64 to 1.1.70
* Add support for Python 3.10.10 syntax.
* Add three (3) new backslash scenarios to solved a parsing error.
* Add an explicit return type to some Pandas symbols to avoid a loading error.

Fixed

* Fix a parsing error when a backslash in a square bracket, colon and param scenarios.
* Fix error loading Pandas symbols.

## 2.7.0

2023-09-20

Added

* Add support for Snowpark API version 1.5.0 on Python.
* 3 new mappings added

Changed

* Avoid processing hidden files
* Bump `Mobilize.SparkCommon.Utils` from 1.3.188 to 1.3.189
* Bump `Mobilize.Common.Utils` from 3.2.0 to 3.2.2

Fixed

* Fix PackageVersionInventory collection phase getting stuck.
* Fix incorrect percentage in Spark Usage Summary table in the detailed report when using DBC files.
* Fix File Sizing table in the detailed report shown empty or not shown at all.

## 2.6.0

2023-09-12

Added

* Add support of %SQL cells (from notebooks) to the SQL statements inventory.

Changed

* Bump `Mobilize.Python` from 1.1.62 to 1.1.64
* Adds support to magic sql.
* Avoid updating function parameter type when inferred type is `None`.

Fixed

* Fix issue causing infinite loading of symbols for specific files.
* Fix issue of GenericScanner files not being generated.

Security

* Secure test passwords in Python transformation tests.

## 2.5.0

2023-09-05

Added

* Add Notebook Sizing inventory. (SCT-3876)
* Add Snowflake.SparkCommon.MappingLoader project (uses the new Snowflake.SnowMapGrammar). (SCT-4281)

Changed

* Bump Mobilize.Python from 1.1.59 to 1.1.62

  + Add a timeout mechanism at Python symbol resolution for GetSymbol methods.
* Bump Mobilize.SparkCommon.Utils from 1.3.186 to 1.3.187

  + Update Mobilize.SparkCommon.Utils.FilesHelper.CopyFilesRecursively method to handle hidden files.

Fixed

* Fix the issue of not receiving the email after a run (decreasing the log file size by avoiding logging Debug messages by default). (SCT-5320)

Removed

* Remove Mobilize.SparkCommon.TransformationCore project (used the old Mobilize.MapGrammar).

## 2.4.0

2023-08-28

Added

* Add NotebookCells inventory.
* Collect the argument values of DataFrameReader.option and DataFrameWriter.option for Scala and Python.
* Add 2 new mappings and a better alias type info collection
* Encrypt output files when additional parameters are provided.
* Re-enable SQLStatements inventory.
* Re-enable parallelization for Collectors.

Changed

* Update File Type Summary section of the detailed report (docx and html). (SCT-3867)
* Update for 2 mappings
* Bump Mobilize.SparkCommon.Utils from 1.3.181 to 1.3.186.
* Improve support of sorting CSV files.
* Bump Mobilize.Common.Utils from 3.1.6 to 3.2.0.

  + Improve support of sorting CSV files.
  + Bump Mobilize.Common.Utils from 3.1.6 to 3.2.0.
  + Update NuGet package versions.
* Refactor on Load Mappings Task.
* Refacto on SparkCommon Utils project references.
* Group solution projects.
* Merge Scala integration tests JupyterTest, InventoryTests and TransformationTest.

Fixed

* Fix issue that caused the Python conversion tool to get stuck when collecting the SQL statements inventory items.
* Fix missing GenericScanner files in the output.
* Fix issue of migrated DBC files that were not loading in Databricks.
* Fix error at the end of the tool process.

Removed

* Remove InventoryStorageTemp.
* Remove redundant StyleCop.Analyzers project references.

## 2.2.001

2023-07-19

Added

* Adding six (6) new mappings

Changed

* Assessment Model update from 3.1.10 to 3.1.11

Fixed

* Fix Databricks processing not working in Assessment mode

Security

* Added subresource integrity to HTML links

## 2.1.161

2023-07-06

Fixed

* Fixing and enabling Scala Spark functional tests

## 2.1.160

2023-07-05

Changed

* Assessment Model update from 3.1.9 to 3.1.10

## 2.1.159

2023-07-05

Changed

* Assessment Model update from 3.1.7 to 3.1.9

## 2.1.158

2023-07-05

Added

* Added tool stability by improving the handling of the exceptions in tasks

## 2.1.157

2023-07-05

Changed

* Spark Common update from 1.3.178 to 1.3.181

## 2.1.155

2023-07-05

Changed

* Common Build update from 2.0.2 to 3.0.4
* Improvements building the solution in MacOs

## 2.1.148

2023-07-04

Changed

* Spark Common update from 1.3.177 to 1.3.178
* Common Utils update from 4.0.0-alpha.DevOps.9 to 3.1.6

## 2.1.147

2023-07-03

Security

* Remove non-licensed package references in `Spark Common` projects.

## 2.1.146

2023-07-03

Changed

* Bump `coverlet.collector` from 3.2.0 to 6.0.0
* Bump `FluentAssertions` from 6.9.0 to 6.11.0
* Bump `Scriban.Signed` from 5.5.2 to 5.7.0
* Bump `DocumentFormat.OpenXml` from 2.19.0 to 2.20.0

Security

* Remove non-licensed package references in `SparkCommon` projects.

## 2.1.145

2023-06-28

Changed

* `Mobilize.Python` update from 1.1.49 to 1.1.50
* Fix Databricks notebook whole file parsing issue when not parsing single cell

## 2.1.144

2023-06-27

Fixed

* Fix .dbc file extraction on MacOS

## 2.1.143

2023-06-26

Fixed

* Fix tests errors because of different data formats.

## 2.1.142

2023-06-26

Changed

* Refactor inventory storage.

## 2.1.141

2023-06-23

Changed

* `Mobilize.Python` update from 1.1.46 to 1.1.49
* Detecting and stopping recursive cycles while resolving a symbol
* Fix StackOverflow exception involving \_\\*init\\*\_.py files
* Fix PyArgExpr node with backslash

## 2.1.140

2023-06-22

Changed

* `Mobilize.Python` update from 1.1.44 to 1.1.46
* Fix PyTerm node with backslash

## 2.1.138

2023-06-22

Changed

* Spark Common update from 1.3.176 to 1.3.177

Fixed

* Fix building Scala code processor.

## 2.1.137

2023-06-22

Security

* Secure credentials in functional tests.
* Remove non-licensed package references.

## 2.1.136

2023-06-21

Changed

* `Snowflake.Data` update from 2.0.15 to 2.0.25
* Spark Common update from 1.3.175 to 1.3.176

Security

Upgrading references in the functional tests.

## 2.1.135

2023-06-21

Added

* Add .dbc extension as supported by Python and Scala code processor tools.
* Add tests for the Contracts project.

Security

* Remove non-licensed package references in `SparkCommon.Contracts.Test`.

## 2.1.132

2023-06-21

Removed

* Remove the `Supported` column from IOFiles inventory in assessment mode.

## 2.1.131

2023-06-20

Fixed

* Fix tests on Mac.

## 2.1.130

2023-06-19

Changed

* Merge SparkCommon repo with this repo.

## 2.1.126

2023-06-16

Fixed

* Fix building the repo.

## 2.1.124

2023-06-15

Fixed

* Fix building the repo.

## 2.1.123

2023-06-15

Changed

* `Mobilize.Scala` update from 0.2.34 to 0.2.37
* Fix parsing error involving generic type with underscore and restriction
* Fix parsing error involving expressions with quote marks and interpolation

Security

* Remove of unsecure package references.

## 2.1.121

2023-06-15

Security

* Remove credential files.

## 2.1.120

2023-06-15

Changed

* Minor change in the version configuration for both Scala and Python.

## 1.0.306

February 14, 2023

Scala 0.2.13

SparkSnowConvert Core 1.1.27

New Features

* Jupyter notebooks (.ipynb) processing
* EWI generation when a dependency couldn’t be added to the project config file

Improvements

* Lambda scopes opening and closing

Bug Fixes

* Bug 680497: The remaning to full qualified for functions is not working fine
* Bug 681704: Unable to generate final report

## 1.0.273

February 2, 2023

Scala 0.2.4

SparkSnowConvert Core 1.1.8.0

Hotfix

* API endpoints update

## 1.0.263

January 31, 2023

Scala 0.2.4

SparkSnowConvert Core 1.1.8.0

Added

* .NET Core 6 Upgrade
* ElementPackage column added to imports inventory
* Sizing table added to assessment reports
* Add conversion percentage in the reports synced with BDS
* Add issues.csv file in the output
* Generate SummaryReport.html and DetailedReport.html (mirror docx html) locally on Reports folder
* Add ConversionStatus keywords to GenericScanner
* Support full name conversion

Improvements

* org.apache.spark.mllib mappings added to the core reference table
* [UI] Fix wording when cancelling the execution
* [UI] Change UI phase titles
* Group issues by EWI code
* Update TOOL_VERSION column value format on Execution info table
* Simplified the Issue summary table so it is not too big

Bug Fixes

* Resolved Issue with backslash
* Resolved BreakLine Issue
* Resolved Lambda blocks corner case
* Remove AssessmentReport.html generation (local html report)

## 1.0.191

December 27, 2022

Scala 0.1.493

SparkSnowConvert Core 1.0.117.0

Added

* Uploading packages inventory to cloud telemetry

Improvements

* Detailed report

  + Minor visual improvements
  + Sorting issue table by:
    \* Instances
    \* Code
    \* Description

## 1.0.166

December 21, 2022

Scala 0.1.492

SparkSnowConvert Core 1.0.105.0

Added

* Added a margin of error description in the detailed report

Improvements

* Improved sorting of issues table in the detailed report
* Improved display of percentages in the detailed report

Bug Fixes

* <#> character is showing issues
* Compose is not recognized as a keyword
* Parser is not working on ‘join’ argument
* Scala code processor throwing critical error

## 1.0.132

December 13, 2022

Scala 0.1.487

SparkSnowConvert Core 1.0.88

Improvements

* Customer information added to the detailed assessment report
* Transformation logging messages

Bug fixes

* An issue with expressions like (a, b) =>val c
* *compose* not being recognized as a keyword

## 1.0.107

December 7, 2022

Scala 0.1.484

SparkSnowConvert Core 1.0.77

Added

* Snowpark mappings update to 1.6.2 version
* Functions without parentheses collection improvements on assessment
* Maven project (pom.xml) file processing
* ClassName column renamed to ‘alias’ on SparkUsagesInventory.pam and ImportUsagesInventory.pam
* Added margin of error to the readiness score

Fixed

* Snowpark Python and Scala posted version update
* Issue with a new line after the name of functions

## 1.0.59

November 29, 2022

Scala 0.1.478

SparkSnowConvert Core 1.0.60

Added

* Basic companion object support
* org.apache.spark.sql.Column mappings update
* org.apache.spark.sql.Expression mappings update
* org.apache.spark.sql.functions mappings update
* Reference extensions dependency from project config file (SBT)
* Reference extensions dependency from project config file (Gradle)

Fixed

* “Script” code is not supported

## 1.0.17

November 23, 2022

Scala 0.1.472

SparkSnowConvert Core 1.0.44

Added

* Spark mappings update
* Trim “FileId” column value on all .pam files
* ConversionStatus and scala_spark_mappings_core.csv unification

## 1.0.1

November 17, 2022

Scala 0.1.472

SparkSnowConvert Core 1.0.37

Added

* SparkSession, DataFrameReader, and DataFrameWriter mappings update
* EWI Generation for unary and binary expressions

Fixed

* Writer replacer supports csv, parquet, json, and options
* Reader replacer is not supporting functions without parentheses
* Writer replacer is not supporting functions without parentheses
* Currently, the transformation of InsertInto is not a valid code.
* Writer replacer is not including all functions.

## 0.1.873

November 11, 2022

Scala 0.1.468

SparkSnowConvert Core 1.0.23

Added:

* Symbol resolution for function calls without parentheses
* Scopes opening/closing exceptions handling(at Replacers)
* EWI generation for not supported imports (complex cases)
* EWI generation for not defined imports
* SparkSession transformation improvements
* DataFrame reader/writer transformation improvements
* “Spark Usages by Support Category”, “Scala Import Call Summary” sections added to Detailed report
* RDD mappings update

Fixed:

* Stack Overflow, output files were not generated
* Expression without parentheses on Spark Session replacer transformation

## 0.1.770

October 21, 2022

Scala 0.1.458

SparkSnowConvert Core 0.1.530

Added:

* Updated helper/extension .jar to latest version
* Updated assessment .docx report template
* Import usages inventory generation
* Generating EWIs for not supported imports (simple case)

Fixed:

* Indeterminism issue on SymblTable
* Error when sorting spark usages inventory files
* SclSingleExprPath must not contain null members
* The collection was modified; the enumeration operation may not execute
* Parsing does not finish when there are multiple closing multi-line in a row
* Issue with expression
* Error FileNotGenerated

## 0.1.705

October 04, 2022

Scala 0.1.442

SparkSnowConvert Core 0.1.499

Fixed:

* The setting button is not refreshing when the license is changed.

## 0.1.702

September 28, 2022

Scala 0.1.442

SparkSnowConvert Core 0.1.498

Added:

* Symbol table built ins loading improvements
* Adding robustness to symbol table loaders

Fixed:

* Error in the total of Scala files in the AssessmentReport
* Symbol resolving for generic functions using an asterisk
* Comments inside comments and id prefix and interpolation parsing error
* The comma after identifier parsing error
* Parsing error of the expression when the first statement is taking the pattern of the second statement
* “and”, “::”,”++” and “or” operators parsing errors

## 0.1.687

September 20, 2022

Scala 0.1.430

SparkSnowConvert Core 0.1.491.0

Added

* Symbol loading/resolving - Add support for generic methods with asterisk params\*\*.\*\*
* Symbol loading/resolving - Add type inference for type defs.
* Symbol loading/resolving general improvements

Fixed

Issue related to the import usages not being stored if there are no Spark references.

## 0.1.677

September 15, 2022

Scala 0.1.427

SparkSnowConvert Core 0.1.486.0

Added

* Cloud telemetry and sending email mechanism now available in Conversion Mode
* Update contact information in the email template

## 0.1.653

September 06, 2022

Scala 0.1.426

SparkSnowConvert Core 0.1.476.0

Added

* ‘SnowConvert Version’ and ‘Snowpark version’ columns to SparkUsagesInventory
* Improvements to speed analysis

## 0.1.624

August 31st, 2022

Scala 0.1.422

SparkSnowConvert Core 0.1.454.0

Added

* Automated and Status columns added to SparkReferenceInventory.csv
* Summary and detailed html report uploading to Snowflake
* Mappings update

Fixed:

* Summary and detailed report wordings fixes
* Email template wording fixes.

## 0.1.579

August 23th, 2022

Scala 0.1.421 Spark

SnowConvert Core 0.1.414

Added

* Email template update
* Adding “Version information” section to Summary report
* Adding “Resources” section to Detailed report
* Final screen UI changes

Fixed

* Report missing spark functions on sparkUsagesInventory.pam
* Detailed report logos update
* Percentage values precision on summary and detailed assessment reports

## 0.1.595

August 17th, 2022

Scala 0.1.421

SparkSnowConvert Core 0.1.396

Added

* Spark read and write transformations improvement
* Session id column to spark usages inventory

## 0.1.479

June 30th, 2022

Scala 0.1.411

SparkSnowConvert Core 0.1.279

Added

* Spark read and write transformations
* Spark trim, rtrim and ltrim function transformations
* String interpolation parsing
* Increasing sql extraction match patterns

## 0.1.447

June 14th, 2022

Scala 0.1.402

SparkSnowConvert Core 0.1.274

Added

* File operations robustness
* Output folders reorganization
* SparkSession builder transformation
* Adding “Scala files with embedded sql” count in assessment reports

Fixed

* Cyclic dependencies issue on Symbol Table
* Empty case clause parsing
* Multiple statements on lambda block parsing
* Case clause pattern parsing

## 0.1.380

June 1st, 2022

Scala 0.1.391

SparkSnowConvert Core 0.1.229

Added

* Parsing robustness
* .sbt configuration files processing
* Issues breakdown section added in assessment html report
* Look and feel improvements in assessment html report
* Using RapidScanner inventories to calculate the spark usages assessment
* macOS CLI & UI support
* Improvements in import statements mappings

## 0.1.7

May 17th, 2022

Scala 0.1.380

Added

* Scala Parser

  + Double exclamation mark support
* Conversion tool

  + Sql extraction
  + object_struct function transformation
  + avg function transformation
  + Snowpark extensions .jar update
  + Lines of code report
  + Import mappings
  + Docx and html assessment reports
  + RapidScan integration
  + Linux OS support

**Fixed**

* Binary expressions special cases parsing

## 0.1.3

March 18th, 2022

Scala 0.1.358

Added

* Scala Parser

  + Support underscore followed by newline when parsing expressions
  + Improve parsing errors handling
* Symbols

  + Improve support of Unresolved Symbols
  + Improve creation of Generic Symbols to reuse existing ones
  + Support Loading and Resolution of Lambda Expressions
* Mappings:

  + Support custom mappings for functions and types via .map files
  + Added custom map directory parameter

**Fixed**

* Fill missing columns at notification .pam file.
* Generate metrics data files (.pam) to specified reports folder

## 0.1.2

March 4th, 2022

Scala 0.1.351

Added

* Updated logos and text in UI and Documentation
* Symbols

  + Support Generic Identifiers on Type Parameters for Generic Symbol
  + Exclusion of not required dependencies
* ScalaParser:

  + Backticks idents
  + ArgAssign expressions

Fixed

* ScalaParser:

  + ExprLambda with ColonType next to ident
  + Try expression when try is not referring a keyword
  + Empty lambda expr with args
  + Underscore (“_”) in TypeArgs
  + Files with all commented out source
  + New lines at SimpleExpr, SingleExpr, TailExpr nodes
* ConversionTool:

  + Fix Crash of conversion due to javap parsing errors (related with jar dependencies)

## 0.1.1

February 14th, 2022

Scala 0.1.333

Features

* Command line interface.
* Scala code assessment feature.
* Consume multiple files or single files with multiple objects.
* Conversion of basic Scala programs as defined by functions and syntax to be mutually agreed during the first 3 development sprints.
* Comments in Scala code are re-inserted inline.
* Insert comments in-line with any errors/warning/reviews.
* Basic reporting including

  + Number of spark elements processed
  + Summary of elements transformed, files and locations of
  + Summary of errors/warnings/reviews encountered.
  + Summary of unsupported Spark APIs
* Demonstrated inclusion of the following defined scenarios:

  + API mappings
  + Recreate project as SnowPark projects
    \* Setup Proper project structure
    \* Update to SnowPark supported Scala version
  + Helper Creation to reduce impedance mismatch
  + Define some pattern rewrite
  + Document guidelines for non-automatable concepts (e.g.: file usage patterns, data source configuration, or spark libraries without a direct equivalent, like Kafka stream reading)
* Greater than 90% successful conversion rate for initial two customer code bases (basis code for the above scenarios) to be provided to Mobilize by Snowflake on the Effective Date.

  + Measured based upon number of compilable objects in Snowflake
  + Objects with unsupported/untranslatable functions not counted
  + Conversion rate for code will be based upon a complete code base containing all dependent objects.
  + Snowflake will provide access to all available private preview features for Mobilize development benefit

---
title: Snowpark Migration Accelerator:  SIT Tagging
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/sit-tagging/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SIT Tagging

## Purpose

During Internal Consumption Tracking, SMA adds JSON-formatted comments to identify each processed element. These comments contain tracking information.

* The version number of the core processor
* The source language being migrated (HiveSQL, SparkSQL, Scala, or Python)
* The tool name (defaults to SMA)

### Statements identified

In the following sections, you will find statements organized by programming language.

---
title: Snowpark Migration Accelerator:  SMA Inventories
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-reports/sma-inventories.md
section: Migrations
---

# Snowpark Migration Accelerator: SMA Inventories

The Snowpark Migration Accelerator (SMA) analyzes your codebase and produces detailed data, which is stored in the Reports folder as spreadsheets (inventories). This data is used to create two types of reports:

1. The [conversion summary](../understanding-the-conversion-summary.md)
2. The [curated reports](curated-reports.md)

Understanding the inventory files may seem daunting at first, but they provide valuable insights into both your source workload and the converted workload. Below, we explain each output file and its columns in detail.

These inventories are also shared through telemetry data collection. For more details, refer to the telemetry section of this documentation.

## Assessment Report Details

The **AssessmentReport.json** file stores data that is displayed in both the Detailed Report and Assessment Summary sections of the application. This file is primarily used to populate these reports and may contain information that is also available in other spreadsheets.

## DBX Elements Inventory

The **DbxElementsInventory.csv** lists the DBX elements found inside notebooks.

* **Element:** The DBX element name.
* **ProjectId:** Name of the project (root directory the tool was run on)
* **FileId:** File where the element was found and the relative path to that file.
* **Count:** The number of times that element shows up in a single line.
* **Category:** The element category.
* **Alias:** The alias of the element (applies just for import elements).
* **Kind:** A category for each element. These could include Function or Magic.
* **Line:** The line number in the source files where the element was found.
* **PackageName:** The name of the package where the element was found.
* **Supported:** Whether this reference is “supported” or not. Values: True/False.
* **Automated:** Whether or not the tool can automatically convert it. Values: True/False.
* **Status:** The categorization of each element. The options are Rename, Direct, Helper, Transformation, WorkAround, NotSupported, NotDefined.
* **Statement:** The code where the element was used. [NOTE: This column is not sent via telemetry.]
* **SessionId:** Unique identifier for each run of the tool.
* **SnowConvertCoreVersion:** The version number for the core code process of the tool.
* **SnowparkVersion:** The version of Snowpark API available for the specified technology and run of the tool.
* **CellId:** If this element was found in a notebook file, the numbered location of the cell where this element was in the file.
* **ExecutionId:** The unique identifier for this execution of the SMA.

## Execution Flow Inventory

The **ExecutionFlowInventory.csv** lists the relations between the different workload scopes, based on the function calls found. This inventory main purpose is to serve as the base for the entry points identification.

* **Caller:** The full name of the scope where the call was found.
* **CallerType:** The type of the scope where the call was found. This can be: Function, Class, or Module.
* **Invoked:** The full name of the element that was called.
* **InvokedType:** The type of the element. This can be: Function or Class.
* **FileId:** The relative path of the file. (Starting from the input folder the user chose in the SMA tool)
* **CellId:** The cell number where the call was found inside a notebook file, if applies.
* **Line:** The line number where the call was found.
* **Column:** The column number where the call was found.
* **ExecutionId:** The execution id.

## Checkpoints Inventory

The **Checkpoints.csv** lists the generated checkpoints for the user workload, these checkpoints are completely capable to be used in the Checkpoints Feature from the Snowflake Exentesion.

* **Name:** The checkpoint name (using the format described before).
* **FileId:** the relative path of the file (starting from the input folder the user chose in the SMA tool).
* **CellId:** the number of cell where the DataFrame operation was found inside a notebook file.
* **Line:** line number where the DataFrame operation was found.
* **Column:** the column number where the DataFrame operation was found.
* **Type:** the use case of the checkpoints (Collection or Validation).
* **DataFrameName:** The name of the DataFrame.
* **Location:** The assignment number of the DataFrame name.
* **Enabled:** Indicates whether the checkpoint is enabled (True or False).
* **Mode:** The mode number of the collection (Schema [1] or DataFrame [2]).
* **Sample:** The sample of the DataFrame.
* **EntryPoint:** The entry point that guide the flow to execute the checkpoint.
* **ExecutionId:** the execution id.

## DataFrames Inventory

The **DataFramesInventory.csv** lists the dataframes assignments found in order to be used to generate checkpoints for the user workload.

* **FullName:** The full name of the DataFrame.
* **Name:** The simple name of the variable of the DataFrame.
* **FileId:** The relative path of the file (starting from the input folder the user chose in the SMA tool).
* **CellId:** The number of cells where the DataFrame operation was found inside a notebook file.
* **Line:** The line number where the DataFrame operation was found.
* **Column:** The column number where the DataFrame operation was found.
* **AssignmentNumber:** The number of assignments for this particular identifier (not symbol) in the file.
* **RelevantFunction:** The relevant function why this was collected.
* **RelatedDataFrames:** The full qualified name of the DataFrame(s) involved in the operation (separated by semicolon).
* **EntryPoints:** it will be empty for this phase. In a later phase, it will be filled.
* **ExecutionId:** the execution id.

## Artifact Dependency Inventory

The **ArtifactDependencyInventory.csv** lists the artifact dependencies of each file analyzed by the SMA. This inventory allows the user to determine which artifacts are needed for the file to work properly in Snowflake.

The following are considered artifacts: a third-party library, SQL entity, source of a read or write operation, and another source code file in the workload.

* **ExecutionId:** the identifier of the execution.
* **FileId:** the identifier of the source code file.
* **Dependency:** the artifact dependency that the current file has.
* **Type:** the type of the artifact dependency.

  + *UserCodeFile:* source code or notebook.
  + *IOSources:* resource required for input and output operation.
  + *ThirdPartyLibraries:* a third-party library.
  + *UnknownLibraries\*\**:\*\* a library whose origin was not determined by SMA.
  + *SQLObjects:* an SQL entity: table or view, for example.
* **Success:** If the artifact needs any intervention, it shows FALSE; otherwise, it shows TRUE.
* **Status_Detail**: the status of the artifact dependency, based on the type.

  + *UserCodeFile:*
    \* Parsed: the file was parsed successfully.
    \* NotParsed: the file parsing failed.
  + *IOSources:*
    \* Exists: the resource of the operation is in the workload.
    \* DoesNotExists: the resource of the operation is not present in the input.
  + *ThirdPartyLibraries:*
    \* Supported: the library is supported by Snowpark Anaconda.
    \* NotSupported: the library is not supported by Snowpark Anaconda.
  + *UnknownLibraries:*
    \* NotSupported: since the origin was not determined by SMA.
  + **\*SQLObject\***
    \* DoesNotExists: the embedded statement that creates the entity is not in the input source code.
    \* Exists: the embedded statement that creates the entity is in the input source code.
* **Arguments**: an extra data of the artifact dependency, based on the type.
* **Location**: the collection of cell ID and line number where the artifact dependency is being used in the source code file.
* **IndirectDependencies:** A list of other files that this file relies on, even if not directly.
* **TotalIndirectDependencies:** The total count of these indirect dependencies.

* **DirectParents:** A list of files that directly use this file.
* **TotalDirectParents:** The total count of these direct parent files.

* **IndirectParents:** A list of files that use this file indirectly (through other files).
* **TotalIndirectParents:** The total count of these indirect parent files.

## Files Inventory

The **files.csv** contains a complete list of all files processed during tool execution, including their file types and sizes.

* Path: The file location relative to the root directory. For example, files in the root directory will show only their filename.
* Technology: The programming language of the source code (Python or Scala)
* FileKind: Identifies if the file contains source code or is another type (such as text or log files)
* BinaryKind: Indicates if the file is human-readable text or a binary file
* Bytes: The file size measured in bytes
* SupportedStatus: Always shows “DoesNotApply” as file support status is not applicable in this context

## Import Usages Inventory

The **ImportUsagesInventory.csv** file contains a list of all external library imports found in your codebase. An external library is any package or module that is imported into your source code files.

* Element: The unique identifier for the Spark reference
* ProjectId: The root directory name where the tool was executed
* FileId: The relative path and filename containing the Spark reference
* Count: Number of occurrences of the element in a single line
* Alias: Optional alternative name for the element
* Kind: Always empty/null as all elements are imports
* Line: Source code line number where the element appears
* PackageName: Package containing the element
* Supported: Indicates if the reference can be converted (True/False)
* Automated: Empty/null (deprecated column)
* Status: Always “Invalid” (deprecated column)
* Statement: The actual code using the element [Not included in telemetry]
* SessionId: Unique identifier for each tool execution
* SnowConvertCoreVersion: Version number of the tool’s core processing engine
* SnowparkVersion: Available Snowpark API version for the specific technology
* ElementPackage: Package name containing the imported element (when available)
* CellId: For notebook files, indicates the cell number containing the element
* ExecutionId: Unique identifier for this SMA execution
* Origin: Source type of the import (BuiltIn, ThirdPartyLib, or blank)

## Input Files Inventory

The **InputFilesInventory.csv** file contains a detailed list of all files, organized by their file types and sizes.

* Element: The filename, which is identical to FileId
* ProjectId: The name of the project, represented by the root directory where the tool was executed
* FileId: The complete path to the file containing the Spark reference, shown as a relative path
* Count: The number of files sharing this filename
* SessionId: A unique identifier assigned to each tool session
* Extension: The file extension type
* Technology: The programming language or technology type, determined by the file extension
* Bytes: The file size measured in bytes
* CharacterLength: The total number of characters in the file
* LinesOfCode: The total number of code lines in the file
* ParsingResult: Indicates whether the cell was successfully parsed (“Successful”) or encountered errors (“Error”)

## Input and Ouput Files Inventory

The **IOFilesInventory.csv** file contains a list of all external files and resources that your code reads from or writes to.

* Element: The specific item (file, variable, or component) being accessed for reading or writing operations
* ProjectId: The name of the root directory where the tool was executed
* FileId: The complete path and filename where Spark code was detected
* Count: The number of occurrences of this filename
* isLiteral: Indicates whether the read/write location is specified as a literal value
* Format: The detected file format (such as CSV, JSON) if SMA can identify it
* FormatType: Specifies if the identified format is explicit
* Mode: Indicates whether the operation is “Read” or “Write”
* Supported: Indicates if Snowpark supports this operation
* Line: The line number in the file where the read or write operation occurs
* SessionId: A unique identifier assigned to each tool session
* OptionalSettings: Lists any additional parameters defined for the element
* CellId: For notebook files, identifies the specific cell location (null for non-notebook files)
* ExecutionId: A unique identifier for each time the tool is run

## Issue Inventory

The **Issues.csv** file contains a detailed report of all conversion issues discovered in your codebase. For each issue, you will find:

* A description explaining the problem
* The precise location within the file where the issue occurs
* A unique code identifier for the issue type

For more detailed information about specific issues, refer to the [issue analysis](../../../issue-analysis/approach.md) section of our documentation.

* Code: A unique identifier assigned to each issue detected by the tool
* Description: A detailed explanation of the issue, including the Spark reference name when applicable
* Category: The type of issue found, which can be one of the following:

  + Warning
  + Conversion Error
  + Parser Error
  + Helper
  + Transformation
  + WorkAround
  + NotSupported
  + NotDefined
* NodeType: The syntax node identifier where the issue was detected
* FileId: The relative path and filename where the Spark reference was found
* ProjectId: The root directory name where the tool was executed
* Line: The specific line number in the source file where the issue occurs
* Column: The specific character position in the line where the issue occurs

## Joins Inventory

The **JoinsInventory.csv** file contains a comprehensive list of all dataframe join operations found in the codebase.

* Element: Line number indicating where the join starts (and ends, if spanning multiple lines)
* ProjectId: Name of the root directory where the tool was executed
* FileId: Path and name of the file containing the Spark reference
* Count: Number of files with the same filename
* isSelfJoin: TRUE if joining a table with itself, FALSE otherwise
* HasLeftAlias: TRUE if an alias is defined for the left side of the join, FALSE otherwise
* HasRightAlias: TRUE if an alias is defined for the right side of the join, FALSE otherwise
* Line: Starting line number of the join
* SessionId: Unique identifier assigned to each tool session
* CellId: Identifier of the notebook cell containing the element (null for non-notebook files)
* ExecutionId: Unique identifier for each tool execution

## Notebook Cells Inventory

The **NotebookCellsInventory.csv** file provides a detailed list of all cells within a notebook, including their source code content and the number of code lines per cell.

* Element: The programming language used in the source code (Python, Scala, or SQL)
* ProjectId: The name of the root directory where the tool was executed
* FileId: The complete path and filename where Spark code was detected
* Count: The number of files with this specific filename
* CellId: For notebook files, the unique identifier of the cell containing the code (null for non-notebook files)
* Arguments: This field is always empty (null)
* LOC: The total number of code lines in the cell
* Size: The total number of characters in the cell
* SupportedStatus: Indicates whether all elements in the cell are supported (TRUE) or if there are unsupported elements (FALSE)
* ParsingResult: Shows if the cell was successfully parsed (“Successful”) or if there were parsing errors (“Error”)

## Notebook Size Inventory

The **NotebookSizeInventory.csv** file provides a summary of code lines for each programming language found in notebook files.

* filename: The name of the spreadsheet file (identical to the FileId)
* ProjectId: The name of the root directory where the tool was executed
* FileId: The relative path and name of the file containing Spark references
* Count: The number of files with this specific filename
* PythonLOC: Number of Python code lines in notebook cells (zero for regular files)
* ScalaLOC: Number of Scala code lines in notebook cells (zero for regular files)
* SqlLOC: Number of SQL code lines in notebook cells (zero for regular files)
* Line: This field is always empty (null)
* SessionId: A unique identifier assigned to each tool session
* ExecutionId: A unique identifier assigned to each tool execution

## Pandas Usages Inventory

The **PandasUsagesInventory.csv** file contains a comprehensive list of all Pandas API references found in your Python codebase during the scanning process.

* Element: The unique identifier for the pandas reference
* ProjectId: The root directory name where the tool was executed
* FileId: The relative path to the file containing the spark reference
* Count: Number of occurrences of the element in a single line
* Alias: The alternative name used for the element (only applies to imports)
* Kind: The type of element, such as Class, Variable, Function, Import, etc.
* Line: The source file line number where the element was found
* PackageName: The package containing the element
* Supported: Indicates if the reference is supported (True/False)
* Automated: Indicates if the tool can automatically convert the element (True/False)
* Status: Element classification: Rename, Direct, Helper, Transformation, WorkAround, NotSupported, or NotDefined
* Statement: The context in which the element was used [Not included in telemetry]
* SessionId: A unique identifier for each tool execution
* SnowConvertCoreVersion: The version number of the tool’s core processing code
* SnowparkVersion: The Snowpark API version available for the specific technology and tool run
* PandasVersion: The pandas API version used to identify elements in the codebase
* CellId: The cell identifier in the FileId (only for notebooks, null otherwise)
* ExecutionId: A unique identifier for each tool execution

## Spark Usages Inventory

The **SparkUsagesInventory.csv** file identifies where and how Spark API functions are used in your code. This information helps calculate the [Readiness Score](../../../support/glossary.md), which indicates how ready your code is for migration.

* Element: The unique identifier for the Spark reference
* ProjectId: The root directory name where the tool was executed
* FileId: The relative path and filename containing the Spark reference
* Count: Number of occurrences of the element in a single line
* Alias: The element’s alias (only applies to import elements)
* Kind: The element’s category (e.g., Class, Variable, Function, Import)
* Line: The source file line number where the element was found
* PackageName: The package name containing the element
* Supported: Indicates if the reference is supported (True/False)
* Automated: Indicates if the tool can automatically convert the element (True/False)
* Status: Element categorization (Rename, Direct, Helper, Transformation, WorkAround, NotSupported, NotDefined)
* Statement: The actual code where the element was used [NOTE: This column is not sent via telemetry]
* SessionId: A unique identifier for each tool execution
* SnowConvertCoreVersion: The tool’s core process version number
* SnowparkVersion: The available Snowpark API version for the specific technology and tool run
* CellId: For notebook files, the cell’s numerical location where the element was found
* ExecutionId: A unique identifier for this specific SMA execution

The **SqlStatementsInventory.csv** file contains a count of SQL keywords found in Spark SQL elements.

* Element: Name of the code element containing the SQL statement
* ProjectId: Root directory name where the tool was executed
* FileId: Relative path to the file containing the Spark reference
* Count: Number of occurrences of the element in a single line
* InterpolationCount: Number of external elements inserted into this element
* Keywords: Dictionary containing SQL keywords and their frequency
* Size: Total character count of the SQL statement
* LiteralCount: Number of string literals in the element
* NonLiteralCount: Number of SQL components that are not string literals
* Line: Line number where the element appears
* SessionId: Unique identifier for each tool session
* CellId: Identifier of the notebook cell containing the element (null if not in a notebook)
* ExecutionId: Unique identifier for each tool execution

## SQL Elements Inventory

The SQLElementsInventory.csv file contains a count of SQL statements found within Spark SQL elements.

Here are the fields included in the SQL analysis report:

* Element: SQL code element type (Example: SqlSelect, SqlFromClause)
* ProjectId: Root directory name where the tool was executed
* FileId: Path to the file containing the SQL code
* Count: Number of occurrences of the element in a single line
* NotebookCellId: ID of the notebook cell
* Line: Line number where the element appears
* Column: Column number where the element appears
* SessionId: Unique ID for each tool session
* ExecutionId: Unique ID for each tool run
* SqlFlavor: Type of SQL being analyzed (Example: Spark SQL, Hive SQL)
* RootFullName: Complete name of the main code element
* RootLine: Line number of the main element
* RootColumn: Column number of the main element
* TopLevelFullName: Complete name of the highest-level SQL statement
* TopLevelLine: Line number of the highest-level statement
* TopLevelColumn: Column number of the highest-level statement
* ConversionStatus: Result of SQL conversion (Example: Success, Failed)
* Category: Type of SQL statement (Example: DDL, DML, DQL)
* EWI: Error Warning Information code
* ObjectReference: Name of the SQL object being referenced (Example: table name, view name)

## SQL Embedded Usage Inventory

The SqlEmbeddedUsageInventory.csv file contains a count of SQL keywords found within Spark SQL elements.

* Element: The type of SQL component found in the code (such as Select statement, From clause, or Numeric literal)
* ProjectId: The name of the root directory where the tool was executed
* FileId: The location and relative path of the file containing the SQL reference
* Count: How many times this element appears in a single line
* ExecutionId: A unique ID assigned to each tool execution
* LibraryName: The name of the library in use
* HasLiteral: Shows if the element contains literal values
* HasVariable: Shows if the element contains variables
* HasFunction: Shows if the element contains function calls
* ParsingStatus: The current parsing state (Success, Failed, or Partial)
* HasInterpolation: Shows if the element contains string interpolations
* CellId: The identifier for the notebook cell
* Line: The line number where the element is found
* Column: The column number where the element is found

## Third Party Usages Inventory

The **ThirdPartyUsagesInventory.csv** file contains

* Element: The unique identifier for the third-party reference
* ProjectId: The name of the project’s root directory where the tool was executed
* FileId: The relative path to the file containing the Spark reference
* Count: The number of occurrences of the element in a single line
* Alias: The alternative name assigned to the element (if applicable)
* Kind: The type classification of the element (variable, type, function, or class)
* Line: The source file line number where the element was found
* PackageName: The full package name (combination of ProjectId and FileId in Python)
* Statement: The actual code where the element was used [NOTE: Not included in telemetry data]
* SessionId: A unique identifier for each tool session
* CellId: The notebook cell identifier where the element was found (null for non-notebook files)
* ExecutionId: A unique identifier for each tool execution

## Packages Inventory

The **packagesInventory.csv** file contains

* Package Name: The name of the package being analyzed.
* Project Name: The name of the project, which corresponds to the root directory where the tool was executed.
* File Location: The file path where the package was found, shown as a relative path.
* Occurrence Count: The number of times this package appears on a single line of code.

## Tool Execution Summary

The **tool_execution.csv** file contains essential information about the current execution of the Snowpark Migration Accelerator (SMA) tool.

* ExecutionId: A unique identifier assigned to each time the tool runs.
* ToolName: The name of the tool being used. Can be either PythonSnowConvert or SparkSnowConvert (for Scala).
* Tool_Version: The version number of the software.
* AssemblyName: The complete name of the code processor (a more detailed version of ToolName).
* LogFile: Indicates if a log file was generated when an error or failure occurred.
* FinalResult: Indicates at which point the tool stopped if an error or failure occurred.
* ExceptionReport: Indicates if an error report was generated when a failure occurred.
* StartTime: The date and time when the tool began running.
* EndTime: The date and time when the tool finished running.
* SystemName: The machine’s serial number where the tool was run (used only for troubleshooting and verifying licenses).

---
title: Snowpark Migration Accelerator:  Spark Reference Categories
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/spark-reference-categories.md
section: Migrations
---

# Snowpark Migration Accelerator: Spark Reference Categories

SnowConvert for Spark categorizes Spark elements based on how they can be mapped to Snowpark. The following categories describe how each Spark reference is translated, including:

* Whether SnowConvert can automatically convert it
* Whether it’s possible to implement in Snowpark

Also it provides

* Examples of the translation
* Description of the mapping process

The sections below explain each status type and provide examples.

## Direct

Direct translation means the function works exactly the same way in both PySpark and Snowpark, requiring no modifications to the code.

* Snowpark Support: Available
* Tool Support: Available
* Spark Example:

```python
col("col1")
```

* Snowpark Example:

```python
col("col1")
```

## Rename

The PySpark function has an equivalent in Snowpark, but you need to use a different function name.

* Snowpark Support: Available
* Tool Support: Available
* Spark Example:

```python
orderBy("date")
```

* Snowpark Example:

```python
sort("date")
```

## Helper

> **Note:**
>
> Starting from Spark Conversion Core V2.40.0, the Python extensions library is no longer supported. Python Spark elements will not be classified as extensions from this version onward. However, helper classes in the Snowpark extensions library will continue to be available for Spark Scala.

To address the difference between Spark and Snowpark functionality, you can create a helper function in an extension file. This helper function will have the same signature as the original Spark function and can be called from any file where needed. The extension library will contain this function to resolve the compatibility issue.

For more information about the Snowpark extensions library, visit our GitHub repository at <https://github.com/Snowflake-Labs/snowpark-extensions>.

Examples include fixed additional parameters and changes to parameter order.

* Snowpark Support: Available
* Tool Support: Available
* Spark Example:

```python
instr(str, substr)
```

* Snowpark Example:

```python
# creating a helper function named instr with an
# identical signature as the pyspark function, like:

def instr(source: str, substr: str) -> str:
    """
    Returns the position of a substring within a source string.
    Similar to the CHARINDEX function in SQL.

    Args:
        source: The string to search in
        substr: The string to search for

    Returns:
        The position where the substring is found, or 0 if not found
    """
    return charindex(substr, str)
```

## Transformation

The function is rebuilt in Snowpark to achieve the same results as the original, though it may look different. The new version might use multiple functions or additional code lines to accomplish the same task.

* Snowpark Support: Yes
* Tool Support: Yes
* Spark Example:

```python
col1 = col("col1")
col2 = col("col2")
col1.contains(col2)
```

* Snowpark Example:

```python
col1 = col("col1")
col2 = col("col2")
from snowflake.snowpark.functions as f
f.contains(col, col2)
```

## WorkAround

This category applies when SMA cannot automatically convert a PySpark element, but there is a documented manual solution available in the tool’s documentation to help you complete the conversion.

* Snowpark Support: Available
* Tool Support: Not Available
* Spark Example:

```python
instr(str, substr)
```

* Snowpark Example:

```python
#EWI: SPRKPY#### => pyspark function has a workaround, see documentation for more info
charindex(substr, str)
```

## NotSupported

This category applies when a PySpark element cannot be converted because there is no matching equivalent in Snowflake.

* Snowpark Support: Not Available
* Tool Support: Not Available
* Spark Example:

```python
df:DataFrame = spark.createDataFrame(rowData, columns)
df.alias("d")
```

* Snowpark Example:

```python
df:DataFrame = spark.createDataFrame(rowData, columns)
# EWI: SPRKPY11XX => DataFrame.alias is not supported
# df.alias("d")
```

## NotDefined

This error occurs when the tool identifies a PySpark element but cannot convert it because the element is not included in the tool’s supported conversion database.

This category applies when a PySpark element cannot be converted because there is no corresponding feature or functionality in Snowflake.

* Snowpark Support: Not Available
* Tool Support: Not Available
* Spark Example: Not Applicable
* Snowpark Example: Not Applicable

The assessment results will classify all detected Spark API references into the following categories.

---
title: Snowpark Migration Accelerator:  Spark SQL
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Spark SQL

While Snowflake SQL and Spark SQL share many similarities, they have distinct differences that require careful translation. When migrating between these platforms, certain SQL statements and functions need to be converted to ensure compatibility.

Let’s examine how these tools compare across different categories.

| Category | Documentation |
| --- | --- |
| DDL | [spark-sql-ddl](spark-sql-ddl/README.md) |
| DML | [spark-sql-dml](spark-sql-dml/README.md) |
| Data Types | <spark-sql-data-types.md> |

---
title: Snowpark Migration Accelerator:  Spark SQL Data Types
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-data-types.md
section: Migrations
---

# Snowpark Migration Accelerator: Spark SQL Data Types

## Conversion Table

| Spark SQL | Snowflake | Notes |
| --- | --- | --- |
| `BIGINT` | `BIGINT` |  |
| `BOOLEAN` | `BOOLEAN` |  |
| `BYTE` | `BYTEINT` |  |
| `CHAR` | `CHAR` |  |
| `DATE` | `DATE` |  |
| `DECIMAL` | `DECIMAL` |  |
| `DOUBLE` | `DOUBLE` |  |
| `FLOAT` | `FLOAT` |  |
| `INTEGER` | `INTEGER` |  |
| `LONG` | `INT` | Check out note |
| `SHORT` | `INT` | Check out note |
| `STRING` | `STRING` |  |
| `TIMESTAMP` | `TIMESTAMP_TZ` |  |
| `TIMESTAMPNTZ` | `TIMESTAMP_NTZ` |  |
| `VARCHAR` | `VARCHAR` |  |

## Notes

> **Note:**
>
> For more information, refer to the Spark SQL [data types](https://spark.apache.org/docs/latest/sql-ref-datatypes.html#data-types) documentation.

### Integer types

When converting integer data types from the source system, both `LONG` and `SHORT` are mapped to Snowflake’s `INTEGER` data type, as `INTEGER` can accommodate the full range of values for both data types.

* SparkSQL LONG: Range from -32,768 to 32,767
* SparkSQL SHORT: Range from -9,223,372,036,854,775,808 to 9,223,372,036,854,775,807
* Snowflake INTEGER: Range from -9.9999999999999999999999999999999999999 x 10^38 to +9.9999999999999999999999999999999999999 x 10^38

---
title: Snowpark Migration Accelerator:  Spark SQL DML
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Spark SQL DML

While SELECT statements are common across different SQL databases, the way data manipulation (DML) works can vary significantly between database systems. Each database has its own unique features and limitations that affect how DML statements behave, making some operations more complex than they might appear.

Here’s an overview of essential DML (Data Manipulation Language) statements you should be familiar with:

* [MERGE](merge.md)
* [SELECT](select/README.md)

Let’s examine each of these components in detail.

---
title: Snowpark Migration Accelerator:  SparkSQL
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/sql/sparksql/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SparkSQL

## Issues Codes

All of the warnings, parsing errors, and conversion exceptions generated by the SMA when SparkSQL is selected as the Database language to migrate SQL statements will appear below. If you have any concerns or see something that is not right, please reach out to the SMA support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

| Code | Description | Category | Deprecated |
| --- | --- | --- | --- |
| `SPRKSQPQL1001` | Unrecognized token | Parsing Error |  |
| `SPRKSPSQL1002` | Unsupported statement in Snowflake | Conversion Error |  |
| `SPRKSPSQL1003` | The name expression is currently not supported in Snowflake. | Conversion Error |  |
| `SPRKSPSQL1005` | TblProperties is not supported in Snowflake | Conversion Error |  |

---
title: Snowpark Migration Accelerator:  SQL
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/sql/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SQL

The Snowpark Migration Accelerator (SMA) is primarily designed to analyze and convert code in scripts and notebooks. That can be from a variety of scripting languages, but also includes SQL. As the SMA analyzes and converts SQL code included in a workload, issue codes specific to SQL will be necessary to support the migration.

## Supported SQL

* [Hive](hive/README.md)
* [SparkSQL](sparksql/README.md)

---
title: Snowpark Migration Accelerator:  SQL Embedded code
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/sql-embedded-code.md
section: Migrations
---

# Snowpark Migration Accelerator: SQL Embedded code

> **Note:**
>
> Currently, SMA only supports the [\*pyspark.sql\*](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.sql.html) function.

SMA can transform SQL code that is embedded within Python or Scala files. It processes embedded SQL code in the following file extensions:

* Python source code files (with .py extension)
* Scala source code files (with .scala extension)
* Jupyter Notebook files (with .ipynb extension)
* Databricks source files (with .python or .scala extensions)
* Databricks Notebook archive files (with .dbc extension)

## Embedded SQL Code transformation Samples

### Supported Case

* Using the [\*spark.sql\*](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.sql.html) function in Python to execute SQL queries:

```python
# Original in Spark
spark.sql("""MERGE INTO people_target pt
USING people_source ps
ON (pt.person_id1 = ps.person_id2)
WHEN NOT MATCHED BY SOURCE THEN DELETE""")
```

```python
# SMA transformation
spark.sql("""MERGE INTO people_target pt
USING (
   SELECT
      pt.person_id1
   FROM
      people_target pt
      LEFT JOIN
         people_source ps
         ON pt.person_id1 = ps.person_id2
   WHERE
      ps.person_id2 IS NULL
) s_src
ON pt.person_id1 = s_src.person_id1
WHEN MATCHED THEN
   DELETE;""")
```

## Unsupported Cases

When SMA encounters code that it cannot convert, it generates an Error, Warning, and Issue (EWI) message in the output code. For more details about these messages, see EWI.

The following scenarios are not currently supported:

* When working with SQL code, you can incorporate string variables in the following way:

```python
query = "SELECT COUNT(COUNTRIES) FROM SALES"
dfSales = spark.sql(query)
```

```python
query = "SELECT COUNT(COUNTRIES) FROM SALES"
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
dfSales = spark.sql(query)
```

* Combining strings to build SQL code using simple concatenation:

```sql
base = "SELECT "
criteria_1 = " COUNT(*) "
criteria_2 = " * "
fromClause = " FROM COUNTRIES"

df1 = spark.sql(bas + criteria_1 + fromClause)
df2 = spark.sql(bas + criteria_2 + fromClause)
```

```python
base = "SELECT "
criteria_1 = " COUNT(*) "
criteria_2 = " * "
fromClause = " FROM COUNTRIES"
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.

df1 = spark.sql(bas + criteria_1 + fromClause)
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
df2 = spark.sql(bas + criteria_2 + fromClause)
```

* Using string interpolation to dynamically generate SQL statements:

```python
# Old Style interpolation
UStbl = "SALES_US"
salesUS = spark.sql("SELECT * FROM %s" % (UStbl))

# Using format function
COLtbl = "COL_SALES WHERE YEAR(saleDate) > 2023"
salesCol = spark.sql("SELECT * FROM {}".format(COLtbl))

# New Style
UKTbl = " UK_SALES_JUN_18"
salesUk = spark.sql(f"SELECT * FROM {UKTbl}")
```

```python
# Old Style interpolation
UStbl = "SALES_US"
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
salesUS = spark.sql("SELECT * FROM %s" % (UStbl))

# Using format function
COLtbl = "COL_SALES WHERE YEAR(saleDate) > 2023"
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
salesCol = spark.sql("SELECT * FROM {}".format(COLtbl))

# New Style
UKTbl = " UK_SALES_JUN_18"
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
salesUk = spark.sql(f"SELECT * FROM {UKTbl}")
```

* Using functions that generate SQL queries dynamically:

```python
def ByMonth(month):
    query = f"SELECT * LOGS WHERE MONTH(access_date) = {month}"
    return spark.sql(query)
```

```python
def ByMonth(month):
query = f"SELECT * LOGS WHERE MONTH(access_date) = {month}"
    #EWI: SPRKPY1077 => SQL embedded code cannot be processed.
    return spark.sql(query)
```

## Unsupported Cases and EWI messages

* When analyzing Scala code, the error code SPRKSCL1173 indicates unsupported embedded SQL statements.

```scala
/*Scala*/
 class SparkSqlExample {
    def main(spark: SparkSession) : Unit = {
    /*EWI: SPRKSCL1173 => SQL embedded code cannot be processed.*/
    spark.sql("CREATE VIEW IF EXISTS My View AS Select * From my Table WHERE date < current_date() ")
    }
```

* When Python code contains unsupported embedded SQL statements, the error code SPRKPY1077 will be displayed.

```python
# Python Output
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
b = spark.sql("CREATE VIEW IF EXISTS My View AS Select * From my Table WHERE date < current_date() ")
```

---
title: Snowpark Migration Accelerator:  SQL statements
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/sit-tagging/sql-statements.md
section: Migrations
---

# Snowpark Migration Accelerator: SQL statements

## Tagged elements

SQL statements are tagged to monitor usage and consumption.

| Statements | HiveSQL | SparkSQL | SnowSQL |
| --- | --- | --- | --- |
| **CREATE TABLE** | SUPPORTED | SUPPORTED | FUNCTIONAL EQUIVALENT |
| **CREATE VIEW** | SUPPORTED | SUPPORTED | FUNCTIONAL EQUIVALENT |
| **CREATE FUNCTION** | NOT SUPPORTED | SUPPORTED | FUNCTIONAL EQUIVALENT |
| **ALTER TABLE** | SUPPORTED | SUPPORTED | FUNCTIONAL EQUIVALENT |
| **ALTER VIEW** | SUPPORTED | SUPPORTED | FUNCTIONAL EQUIVALENT |

> **Note:**
>
> When a comment is marked as “FUNCTIONAL EQUIVALENT,” it means that only the comment’s transformation to Snowflake has been validated. Any other statements within the comment are not included in this status assessment.

### Usages

The tool identifies and tags the following statements:

#### CREATE STATEMENTS

CREATE statements will include tags in two scenarios:

1. The SQL statement is missing the COMMENT property.
2. The SQL statement includes a `COMMENT` property, but no value has been assigned to it.

If a SQL statement includes a comment, the comment will be preserved during the conversion process.

##### Example

**Input (Apache SparkSQL)**

```sql
CREATE OR REPLACE VIEW some_view
AS
SELECT id, name FROM some_table WHERE some_column > 5;

CREATE OR REPLACE FUNCTION blue()
RETURNS STRING
LANGUAGE SQL
COMMENT ''
RETURN '0000FF';

CREATE TABLE my_varchar (
    COL1 VARCHAR(5)
) COMMENT 'The Table';
```

**Output (Snowflake SQL)**

```sql
CREATE OR REPLACE VIEW some_view
COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":1,"minor":2,"patch":3},"attributes":{"language":"HiveSql"}}'
AS
SELECT
   id,
   name
FROM
   some_table
WHERE
   some_column > 5;

CREATE OR REPLACE FUNCTION blue()
RETURNS STRING LANGUAGE SQL
COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}'
RETURN '0000FF';

CREATE TABLE my_varchar
(COL1 VARCHAR(5))
COMMENT = 'The Table';
```

The formatting of the generated code may appear different from the source code due to formatting differences in the original file.

---

##### Create Table

**Input code (SparkSQL)**

```sql
CREATE TABLE SOME_TABLE
(COL1 VARCHAR(5));
```

**Output code (Snowflake SQL)**

```sql
CREATE TABLE SOME_TABLEA
(COL1 VARCHAR(5))
COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}';
```

---

##### CREATE VIEW

**Source Code (HiveSQL)**

```sql
CREATE OR REPLACE VIEW experienced_employee
AS
SELECT id, name FROM all_employee
WHERE working_years > 5;
```

**Output code (Snowflake SQL)**

```sql
CREATE OR REPLACE VIEW experienced_employee
COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":1,"minor":2,"patch":3},"attributes":{"language":"HiveSql"}}'
AS
SELECT
   id,
   name
FROM
   all_employee
WHERE
   working_years > 5;
```

---

##### CREATE FUNCTION

**Input code (SparkSQL)**

```sql
CREATE OR REPLACE FUNCTION blue()
RETURNS STRING
LANGUAGE SQL RETURN '0000FF';
```

**Output (Snowflake SQL)**

```sql
CREATE OR REPLACE FUNCTION blue()
RETURNS STRING
LANGUAGE SQL
COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}'
RETURN '0000FF';
```

#### ALTER STATEMENTS

ALTER statements include a tag when the comment property is empty. This occurs in two scenarios in SparkSQL:

1. When using `SET TBLPROPERTIES` with an empty comment
2. When using `UNSET TBLPROPERTIES`

##### Examples

**SET TBLPROPERTIES (ALTER VIEW and ALTER TABLE)**

**Input (Apache Spark SQL)**

```sql
ALTER TABLE SOME_TABLE SET TBLPROPERTIES ('comment'= ' ');
-- ALTER VIEW
ALTER VIEW SOME_VIEW SET TBLPROPERTIES ('comment'= ' ');
```

**Output (Snowflake SQL)**

```sql
-- ALTER TABLE
ALTER TABLE SOME_TABLE
SET TBLPROPERTIES ('comment' = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}');

-- ALTER VIEW
ALTER VIEW SOME_VIEW
SET TBLPROPERTIES ('comment' = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}');

**Input (Apache HiveSQL)**

```{code} sql
:force:
-- ALTER TABLE
ALTER TABLE SOME_TABLE SET TBLPROPERTIES ('comment'= ' ');

-- ALTER VIEW
ALTER VIEW SOME_VIEW SET TBLPROPERTIES ('comment'= ' ');
```

**Output (Snowflake SQL)**

```sql
-- ALTER TABLE
ALTER TABLE SOME_TABLE
SET TBLPROPERTIES ('comment' = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"HiveSql"}}');

-- ALTER VIEW
ALTER VIEW SOME_VIEW
SET TBLPROPERTIES ('comment' = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"HiveSql"}}');
```

**UNSET TBLPROPERTIES (ALTER VIEW and ALTER TABLE)**

**Input (Apache Spark SQL)**

```sql
-- ALTER TABLE
ALTER TABLE SOME_TABLE UNSET TBLPROPERTIES ('comment');

-- ALTER VIEW
ALTER VIEW SOME_VIEW UNSET TBLPROPERTIES ('comment');

**Output (Snowflake SQL)**

```{code} sql
:force:
-- ALTER TABLE
ALTER TABLE SOME_TABLE
UNSET TBLPROPERTIES ('comment')
ALTER TABLE SOME_TABLE
SET COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}'

-- ALTER VIEW
ALTER VIEW SOME_VIEW
UNSET TBLPROPERTIES ('comment')
ALTER VIEW SOME_VIEW
SET COMMENT = '{"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"SparkSql"}}'
```

---
title: Snowpark Migration Accelerator:  Supported Filetypes
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/before-using-the-sma/supported-filetypes.md
section: Migrations
---

# Snowpark Migration Accelerator: Supported Filetypes

The Snowpark Migration Accelerator (SMA) scans files in your selected source directory during project creation. While some files are excluded based on their type, SMA generates a summary report showing the count of files by extension.

The SMA tool searches for specific file extensions when analyzing references to the Spark API, SQL Statements, and other elements that contribute to the [Readiness Scores](../assessment/readiness-scores.md). The tool can analyze both code files and notebooks located in any directory or subdirectory of your project.

## Code Files

The Snowpark Migration Accelerator scans the following file types to identify references to Spark API and other third-party APIs:

* Files with the extension .scala
* Files with the extension .py
* Files with the extension .python

SQL statements written in Spark SQL or HiveQL can be detected in the following file types:

* SQL files with the extension .sql
* Hive Query Language files with the extension .hql

## Notebooks

Both the Spark Scala and PySpark parsers in the Snowpark Migration Accelerator (SMA) automatically scan and process Jupyter Notebook files and exported Databricks files when they are present in the source code directory.

* Jupyter Notebook files (`*.ipynb`)
* Databricks Notebook files (`*.dbc`)

The SMA will analyze notebook files to identify:

* References to the Spark API
* References to other third-party APIs
* SQL statements

The analysis is performed based on the cell type within each notebook. Notebooks can contain a mix of SQL, Python, and Scala cells. The SMA will create an [inventory of all cell types](../scos-conversion/output-reports/sma-inventories.md) in its output report.

### Excluded Files and folders

By default, certain files and folders are excluded from scanning. These exclusions primarily consist of project configuration files and their associated directories.

#### Folders type excluded from the scanning:

* Python package installer (pip) - A tool for installing Python packages
* Distribution packages (dist) - A directory containing Python packages ready for distribution
* Virtual environment (venv) - An isolated Python environment for managing project dependencies
* Site-packages - A directory where Python packages are installed for use across the system

#### Files type excluded from the scanning:

* input.wsp - Workspace input file
* .DS_Store - macOS system file that stores custom folder attributes
* build.gradle - Gradle build configuration file
* build.sbt - Scala Build Tool configuration file
* pom.xml - Maven Project Object Model configuration file
* storage.lck - Storage lock file

---
title: Snowpark Migration Accelerator:  Supported functions
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/supported-functions.md
section: Migrations
---

# Snowpark Migration Accelerator: Supported functions

## Strings

| Function | Status |
| --- | --- |
| CONCAT | SUPPORTED |
| LEFT | SUPPORTED |
| LOWER | SUPPORTED |
| SPLIT | SUPPORTED |
| SUBSTRING | SUPPORTED |
| SUBSTRING_INDEX | PENDING |
| TRIM | SUPPORTED |
| UPPER | SUPPORTED |

### Numerics

| Function | Status |
| --- | --- |
| COUNT | SUPPORTED |
| LAST | PENDING |
| MAX | SUPPORTED |
| MIN | SUPPORTED |
| ROUND | SUPPORTED |
| SUM | SUPPORTED |

### Date

| Function | Status |
| --- | --- |
| CURRENT_DATE | SUPPORTED |
| CURRENT_TIMESTAMP | SUPPORTED |
| DATEDIFF | PENDING |
| DATE_SUB | PENDING |
| DAY | SUPPORTED |
| FIRST_VALUE | PENDING |
| FROM_UNIXTIME | PENDING |
| TO_DATE | SUPPORTED |
| WEEKOFYEAR | SUPPORTED |

### Advanced functions

| Function | Status |
| --- | --- |
| COALESCE | SUPPORTED |
| EXPLODE | PENDING |
| IFNULL | SUPPORTERD |
| MERGE | PENDING |
| POSEXPLODE | PENDING |
| ROW_NUMBER | SUPPORTED |

---
title: Snowpark Migration Accelerator:  Supported Platforms
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/before-using-the-sma/supported-platforms.md
section: Migrations
---

# Snowpark Migration Accelerator: Supported Platforms

## Supported Platforms

The Snowpark Migration Accelerator (SMA) currently supports the following programming languages as source code:

* Python
* Scala
* SQL

The SMA analyzes both code files and notebook files to identify any usage of Spark API and other third-party APIs. For a complete list of file types that SMA can analyze, please refer to [Supported Filetypes](supported-filetypes.md).

### SQL Dialects

The Snowpark Migration Accelerator (SMA) can analyze code files to identify SQL elements. Currently, SMA can detect SQL code written in the following formats:

* Spark SQL
* Hive QL
* Databricks SQL

### SQL Assessment and Conversion Guidelines

While Spark SQL and Snowflake SQL are highly compatible, some SQL code may not convert perfectly.

SQL analysis is only possible when the SQL is received in the following ways:

* A SQL cell within a supported notebook file
* A .sql or .hql file
* A complete string passed to a `spark.sql` statement.

  Some variable substitutions are not supported. Here are a few examples:

  + Parsed:

    ```python
    spark.sql("select * from TableA")
    ```

    New SMA scenarios supported include the following:

    ```python
    # explicit concatenation
    spark.sql("select * from TableA" + ' where col1 = 1')

    # implicit concatenation (juxtaposition)
    spark.sql("select * from TableA" ' where col1 = 1')

    # var initialized with sql in previous lines before execution on same scope
    sql = "select * from TableA"
    spark.sql(sql)

    # f-string interpolation:
    spark.sql(f"select * from {varTableA}")

    # format kindof interpolation
    spark.sql("select * from {}".format(varTableA))

    # mix var with concat and f-string interpolation
    sql = f"select * from {varTableA} " + f'where {varCol1} = 1'
    spark.sql(sql)
    ```
  + Not Parsed:

    ```python
    some_variable = "TableA"
    spark.sql("select * from" + some_variable)
    ```

  SQL elements are accounted for in the object inventories, and a readiness score is generated specifically for SQL.

---
title: Snowpark Migration Accelerator:  Tool Execution
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/project-overview/tool-execution.md
section: Migrations
---

# Snowpark Migration Accelerator: Tool Execution

After setting up your project, you can run the Snowpark Migration Accelerator (SMA).

## Assessment Process

The Assessment process performs an extended evaluation of your source code to determine which conversion type fits best.

This process is composed of three distinct phases:

* **Loading Source Code**: SMA scans all files in the input directory to create a file inventory. From this inventory, it builds a semantic model using code from the specified file extensions.
* **Analyzing Source Code**: SMA analyzes the source code to determine which conversion type fits best.
* **Generating Results**: SMA generates the output files needed to display the assessment report. The output varies depending on the conversion type selected.

After all three phases are complete, the Assessment Results page is automatically displayed.

## SCOS Conversion Process

The SCOS Conversion process converts your source code to Snowpark Connect (SCOS) code.

The application begins scanning all files in the input directory. The SCOS Conversion process consists of three distinct phases:

* **Loading Source Code**: SMA scans all files in the input directory to create a file inventory. From this inventory, it builds a semantic model using code from the specified file extensions.
* **Analyzing Source Code**: During this main phase, SMA creates an Abstract Syntax Tree (AST) to represent your source code’s functionality. While building the AST, it also creates a symbol table to track elements and functions throughout the conversion process. This symbol table helps generate all output reports. In conversion mode, SMA identifies elements in the AST that have Snowflake equivalents and maps them to their corresponding Snowflake functions.
* **Writing Results**: In the final step, SMA generates output files. For the SCOS Conversion process, SMA produces the converted code in the specified output folder.

After all three phases are complete, the SCOS Conversion Results page is automatically displayed.

## Snowpark API Conversion Process

The Snowpark API Conversion process converts your source code to Snowpark API code.

The application requires you to select whether to use default settings or customize the settings. For more information on customizing settings, refer to the [Conversion Settings](configuration-and-settings.md) section.

After configuration is complete, the tool begins scanning all files in the input directory. The Snowpark API Conversion process consists of three distinct phases:

* **Loading Source Code**: SMA scans all files in the input directory to create a file inventory. From this inventory, it builds a semantic model using code from the specified file extensions.
* **Analyzing Source Code**: During this main phase, SMA creates an Abstract Syntax Tree (AST) to represent your source code’s functionality. While building the AST, it also creates a symbol table to track elements and functions throughout the conversion process. This symbol table helps generate all output reports. In conversion mode, SMA identifies elements in the AST that have Snowflake equivalents and maps them to their corresponding Snowflake functions.
* **Writing Results**: In the final step, SMA generates output files. For the Snowpark API Conversion process, SMA produces the converted code in the specified output folder.

After all three phases are complete, the Snowpark API Conversion Results page is automatically displayed.

---
title: Snowpark Migration Accelerator:  Translation Reference Overview
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/translation-reference-overview.md
section: Migrations
---

# Snowpark Migration Accelerator: Translation Reference Overview

The Snowpark Migration Accelerator (SMA) converts source code into Snowflake-compatible formats. This section explains which elements and formats are compatible with Snowflake, helping you understand what can be successfully migrated.

This reference guide is continuously updated but may not reflect the most recent changes in the Snowpark Migration Accelerator (SMA). For the most accurate assessment of your specific workload’s readiness, please refer to the results generated by running SMA directly.

This reference guide is continuously updated, and your feedback is valuable for its improvement. You can help enhance the mappings or add comments by using the [report an issue option](../user-guide/project-overview/configuration-and-settings.md) in the SMA.

Additional information will be provided.

---
title: Snowpark Migration Accelerator:  Union
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/union.md
section: Migrations
---

# Snowpark Migration Accelerator: Union

## Description

Merges two subqueries into a single query. Databricks SQL provides three set operators that allow you to combine queries:

* `EXCEPT` - Retrieves all rows from the first query that do not appear in the second query
* `INTERSECT` - Returns only the rows that appear in both queries
* `UNION` - Combines the results of two or more queries into a single result set

[Databricks SQL Language Reference UNION](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select-setops.html)

Set operators enable you to combine multiple queries into a single result. For more details, see [Snowflake SQL Language Reference UNION](https://docs.snowflake.com/en/sql-reference/operators-query).

### Syntax

```bnf
subquery1 { { UNION [ ALL | DISTINCT ] |
              INTERSECT [ ALL | DISTINCT ] |
              EXCEPT [ ALL | DISTINCT ] } subquery2 } [...] }
```

```bnf
[ ( ] <query> [ ) ] { INTERSECT | { MINUS | EXCEPT } | UNION [ ALL ] } [ ( ] <query> [ ) ]
[ ORDER BY ... ]
[ LIMIT ... ]
```

## Sample Source Patterns

### Setup data

#### Databricks

```sql
CREATE TEMPORARY VIEW number1(c) AS VALUES (3), (1), (2), (2), (3), (4);

CREATE TEMPORARY VIEW number2(c) AS VALUES (5), (1), (1), (2);
```

#### Snowflake

```sql
CREATE TEMPORARY TABLE number1(c int);
INSERT INTO number1 VALUES (3), (1), (2), (2), (3), (4);

CREATE TEMPORARY TABLE number2(c int);
INSERT INTO number2 VALUES (5), (1), (1), (2);
```

### Pattern code

#### Databricks

```sql
-- EXCEPT (MINUS) Operator:
SELECT c FROM number1 EXCEPT SELECT c FROM number2;

SELECT c FROM number1 MINUS SELECT c FROM number2;

-- EXCEPT ALL (MINUS ALL) Operator:
SELECT c FROM number1 EXCEPT ALL (SELECT c FROM number2);

SELECT c FROM number1 MINUS ALL (SELECT c FROM number2);

-- INTERSECT Operator:
(SELECT c FROM number1) INTERSECT (SELECT c FROM number2);

-- INTERSECT DISTINCT Operator:
(SELECT c FROM number1) INTERSECT DISTINCT (SELECT c FROM number2);

-- INTERSECT ALL Operator:
(SELECT c FROM number1) INTERSECT ALL (SELECT c FROM number2);

-- UNION Operator:
(SELECT c FROM number1) UNION (SELECT c FROM number2);

-- UNION DISTINCT Operator:
(SELECT c FROM number1) UNION DISTINCT (SELECT c FROM number2);

-- UNION ALL Operator:
SELECT c FROM number1 UNION ALL (SELECT c FROM number2);
```

**EXCEPT (MINUS) Operator:** The EXCEPT operator, also known as MINUS, removes rows from the first query that appear in the result set of the second query. It returns only unique rows from the first query that do not exist in the second query.

| c |
| --- |
| 3 |
| 4 |

**EXCEPT ALL (MINUS ALL) Operator: Removes Duplicate Records**

| c |
| --- |
| 3 |
| 3 |
| 4 |

**INTERSECT Operator:** Returns only the rows that appear in both result sets, eliminating duplicates. It compares the results of two or more SELECT statements and returns only the matching records. Returns only the rows that appear in both result sets, eliminating duplicates.

| c |
| --- |
| 1 |
| 2 |

**INTERSECT DISTINCT Operator:** Returns only unique rows that appear in both result sets, eliminating any duplicates. Returns only unique rows that appear in both queries, eliminating any duplicates from the result set.

| c |
| --- |
| 1 |
| 2 |

**INTERSECT ALL Operator:** Returns all matching rows from multiple queries, including duplicates. Unlike the standard INTERSECT operator, which removes duplicates, INTERSECT ALL preserves duplicate rows in the final result set. Returns all rows that appear in both result sets, including duplicates. Unlike INTERSECT, which removes duplicates, INTERSECT ALL preserves duplicate rows based on their frequency in both sets.

| c |
| --- |
| 1 |
| 2 |
| 2 |

**UNION Operator:** The UNION operator combines the results of two or more SELECT statements into a single result set. It removes duplicate rows from the combined result set by default. The UNION operator combines the results of two or more SELECT statements into a single result set. It removes duplicate rows from the combined results.

| c |
| --- |
| 1 |
| 3 |
| 5 |
| 4 |
| 2 |

**UNION DISTINCT Operator:** The UNION DISTINCT operator combines two or more result sets and removes any duplicate rows from the final output. It returns only unique rows from all the combined queries. The UNION DISTINCT operator combines rows from two or more queries while removing any duplicate rows from the final result set.

| c |
| --- |
| 1 |
| 3 |
| 5 |
| 4 |
| 2 |

**UNION ALL Operator:** The UNION ALL operator combines rows from two or more queries without removing duplicate records. Unlike the UNION operator, UNION ALL retains all rows, including duplicates, making it faster to execute since it doesn’t need to perform duplicate checking. This operator combines the results of two or more SELECT statements and includes all rows, including duplicates. Unlike UNION, which removes duplicate rows, UNION ALL retains all rows from all SELECT statements.

| c |
| --- |
| 3 |
| 1 |
| 2 |
| 2 |
| 3 |
| 4 |
| 5 |
| 1 |
| 1 |
| 2 |

#### Snowflake

```sql
-- EXCEPT (MINUS) Operator
SELECT c FROM number1 EXCEPT SELECT c FROM number2;

SELECT c FROM number1 MINUS SELECT c FROM number2;

-- EXCEPT ALL (MINUS ALL) Operator:
SELECT number1.c FROM number1
LEFT JOIN number2
    ON number1.c = number2.c
WHERE number2.c IS NULL;
-- ** MSC-WARMING - MSC-S000# - EXCEPT ALL IS TRANSFORMED TO A LEFT JOIN. **

SELECT number1.c FROM number1
LEFT JOIN number2
    ON number1.c = number2.c
WHERE number2.c IS NULL;
-- ** MSC-WARMING - MSC-S000# - MINUS ALL IS TRANSFORMED TO A LEFT JOIN. **

-- INTERSECT Operator:
(SELECT c FROM number1) INTERSECT (SELECT c FROM number2);

-- INTERSECT DISTINCT Operator:
(SELECT c FROM number1) INTERSECT (SELECT c FROM number2);

-- INTERSECT ALL Operator:
SELECT DISTINCT number1.c FROM number1
INNER JOIN number2
    ON number1.c = number2.c;
-- ** MSC-WARMING - MSC-S000# - INTERSECT ALL IS TRANSFORMED TO A INNER JOIN. **

-- UNION Operator:
(SELECT c FROM number1) UNION (SELECT c FROM number2);

-- UNION DISTINCT Operator:
(SELECT c FROM number1) UNION DISTINCT (SELECT c FROM number2);

-- UNION ALL Operator:
SELECT c FROM number1 UNION ALL (SELECT c FROM number2);
```

**EXCEPT (MINUS) Operator: Removes Duplicate Records**

The EXCEPT operator, also known as MINUS, compares two queries and returns only the unique records from the first query that do not appear in the second query. It eliminates duplicate rows from the result set.

| c |
| --- |
| 3 |
| 4 |

**EXCEPT ALL (MINUS ALL) Operator: Removes Duplicate Rows**

| c |
| --- |
| 3 |
| 3 |
| 4 |

**INTERSECT Operator:**

| c |
| --- |
| 1 |
| 2 |

**INTERSECT DISTINCT Operator:**

| c |
| --- |
| 1 |
| 2 |

**INTERSECT ALL Operator:**

| c |
| --- |
| 1 |
| 2 |
| 2 |

**UNION Operator:**

| c |
| --- |
| 1 |
| 3 |
| 5 |
| 4 |
| 2 |

**UNION DISTINCT Operator:**

| c |
| --- |
| 1 |
| 3 |
| 5 |
| 4 |
| 2 |

**UNION ALL Operator:**

| c |
| --- |
| 3 |
| 1 |
| 2 |
| 2 |
| 3 |
| 4 |
| 5 |
| 1 |
| 1 |
| 2 |

### Known Issues

No related EWIs

### Related EWIs

* MSC-S000#: A SET operator with the ALL keyword is converted into a JOIN operation.

---
title: Snowpark Migration Accelerator:  Using
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-ddl/create-table/using.md
section: Migrations
---

# Snowpark Migration Accelerator: Using

## Description

The USING command in Spark specifies which file format should be used when creating a table. Common formats include CSV, JSON, and AVRO. For more detailed information about the Create Table USING command, refer to the [Databricks documentation](https://docs.databricks.com/en/archive/spark-sql-2.x-language-manual/create-table.html).

## Syntax

```sql
CREATE TABLE [IF NOT EXISTS] [db_name.]table_name
  [(col_name1 col_type1 [COMMENT col_comment1], ...)]
  USING data_source
  [OPTIONS (key1 [ = ] val1, key2 [ = ] val2, ...)]
  [PARTITIONED BY (col_name1, col_name2, ...)]
  [CLUSTERED BY (col_name3, col_name4, ...) INTO num_buckets BUCKETS]
  [LOCATION path]
  [COMMENT table_comment]
  [TBLPROPERTIES (key1 [ = ] val1, key2 [ = ] val2, ...)]
  [AS select_statement]
```

## Sample source patterns

The `USING` data source statement is not supported in Snowflake. During migration, this statement will be commented out and marked with an Error, Warning, and Issue (EWI) message indicating that it is unsupported.

### Sample data

```sql
CREATE TABLE table1
(
id INTEGER
) USING DELTA;
```

```sql
CREATE TABLE table1
(
id INTEGER
) /*** MSC-WARNING - MSCEWI# - SNOWFLAKE DOES NOT SUPPORT USING STATEMENT ***/
-- USING DELTA;
```

## Known Issues

Snowflake does not support the USING data source clause in SQL statements.

## Related EWIs

* Snowflake does not support the USING statement in SQL queries.

---
title: Snowpark Migration Accelerator:  Using SMA with Docker
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/using-snowconvert-in-a-ubuntu-docker-image.md
section: Migrations
---

# Snowpark Migration Accelerator: Using SMA with Docker

Using the Linux Command Line Interface (CLI) for Snowpark Migration Accelerator with Docker: A Step-by-Step Guide

## Dependencies

The following software must be installed on your computer before proceeding:

* [Docker desktop](https://docs.docker.com/desktop/windows/install/)
* [Visual Code](https://code.visualstudio.com/download)
* [Docker Extension in Visual Code](https://marketplace.visualstudio.com/items?itemName=ms-azuretools.vscode-docker)

## Steps

### Create the image config file

Create a file named “Dockerfile” (without a file extension). This file will contain the configuration needed to build the Docker image.

```bash
FROM ubuntu
COPY snowCli /dockerDestinationFolder
ENV DOTNET_SYSTEM_GLOBALIZATION_INVARIANT=1
RUN apt-get update
RUN apt-get install -y ca-certificates openssl
```

When using the [Ubuntu](https://hub.docker.com/_/ubuntu) image to run the Snowpark Migration Accelerator CLI for Linux, you need to add two dependencies to the Dockerfile:

1. Enable [System.Globalization.Invariant](https://docs.microsoft.com/en-us/dotnet/core/run-time-config/globalization) setting
2. Install OpenSSL

These dependencies are required to activate the license and establish a secure HTTPS connection for license validation.

In addition to installing dependencies, the `COPY` command copies files from your local machine into the Docker image. For example, the *snowCLI* file (which must be in the same directory as the Dockerfile) will be copied to `/dockerDestinationFolder` within the Docker image.

### Build the image

Start the Docker Desktop application.

Open Visual Studio Code and locate the “Dockerfile”. If you have the Docker extension installed in Visual Studio Code, it will automatically recognize the Dockerfile as a Docker configuration file. To build the Docker image, right-click on the “Dockerfile” and select “Build image…”

Enter a name for the image when prompted at the top of Visual Studio Code.

Enter any name and press “Enter.” Docker will then create the container by downloading the Ubuntu image, installing required dependencies, and copying the specified files. Wait for the terminal to complete the process. A success message will appear when the image has been built correctly.

```bash
> Executing task: docker build --pull --rm -f "Dockerfile" -t release:Ubuntu "." <

[+] Build completed in 2.0 seconds. All 11 tasks finished successfully.
```

### Run the image

Launch the recently created image by navigating to the Images tab in Docker Desktop and clicking the Run button.

In Visual Studio Code, navigate to the Docker tab. Under the “Containers” section, you will find the recently executed image. Click the arrow next to it to expand and browse through its file directory structure.

### Connect to the container

Finally, to access the container’s command line interface, right-click on the running container and select “Attach shell”. This will open a Terminal window where you can execute any command you need.

You will find your personal files in this location. These files were previously selected for copying using the COPY command in the configuration file.

---
title: Snowpark Migration Accelerator:  Using SMA with Jupyter Notebooks
source: https://docs.snowflake.com/en/migrations/sma-docs/support/frequently-asked-questions-faq/using-sma-with-jupyter-notebooks.md
section: Migrations
---

# Snowpark Migration Accelerator: Using SMA with Jupyter Notebooks

## Can I use Python notebook (.ipynb files) in the tool?

**Yes**! Place your notebook files (.ipynb) in the source directory you select as input for the tool. The notebooks can be located in any subfolder within that directory. You can include both Python files (.py) and notebook files (.ipynb) in your source directory or its subfolders. The tool will process all compatible files regardless of their location in the directory structure.

*Converting notebook files (.ipynb) to Python (.py) files offers several advantages:*

1. Better version control: Python files are easier to track changes and manage in version control systems like Git
2. Improved collaboration: Team members can review and edit code more efficiently in standard Python files
3. Easier automation: Python files can be directly executed in automated pipelines and scheduled jobs
4. Cleaner code organization: Python files encourage better code structure and modularity
5. Reduced file size: Python files are typically smaller than notebook files, which contain additional metadata

You have two options:

1. Keep your notebooks as they are if you plan to continue using them in notebook format. SMA can analyze and convert notebooks directly.
2. Extract the Python code into .py files if you want to move away from using notebooks. While this is possible through a workaround, it’s not necessary since SMA can process both notebooks and Python files.

To extract only the Python code from Jupyter notebook files, you can use the nbconvert utility. Here’s how:

1. Install the [nbconvert](https://pypi.org/project/nbconvert/) package using one of these commands:

   * For Windows/Linux: `pip install nbconvert`
   * For MacOS: `pip3 install nbconvert` or `python3 -m pip install nbconvert`
2. Make a backup copy of your Jupyter notebook directory
3. Convert all Jupyter notebooks to Python scripts using the command line:

   * For Windows/Linux:
     `find /path/to/folder/with/notebooks -name '*.ipynb' | xargs python -m nbconvert --to script`
   * For MacOS:
     `find /path/to/folder/with/notebooks -name '*.ipynb' | xargs python3 -m nbconvert --to script`

   This will create Python script files in the same directory as your notebooks.
4. Process the converted Python files by running SMA for Python on the output directory.

---
title: Snowpark Migration Accelerator:  Using the SMA CLI
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/using-the-sma-cli/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Using the SMA CLI

## Description

The Snowpark Migration Accelerator (SMA) provides a Command Line Interface (CLI) that allows you to perform various operations. Using this CLI, you can execute the code processor, manage access codes (install or display them), and perform any other task that’s available in the SMA application.

The SMA uses a single code processor that works with all [supported source platforms](../before-using-the-sma/supported-platforms.md). You don’t need to provide any additional arguments for this processor.

## Installation

Before installing the Command Line Interface (CLI), you need to [download it](../../general/getting-started/download-and-access.md) to a location you can access. Choose the installation guide that matches your operating system:

* [Linux](../../general/getting-started/installation/linux-installation.md)
* [Windows](../../general/getting-started/installation/windows-installation.md)
* [MacOS](../../general/getting-started/installation/macos-installation.md)

## Commands

To run the tool, you need to set up a sequence of commands based on your requirements. You can use either the **long-command** or **short-command** options with the following syntax:

```bash
sma [command] [argument] [command] [argument] ...
```

The following commands are available. Click any command to view its detailed explanation.

| Long-command | Short-Command | Description |
| --- | --- | --- |
| –help | -h | Displays help documentation. |
| –version | -v | Displays current tool version. |
| install-access-code | install-ac | Installs a new access code. |
| show-access-code | show-ac | Displays all installed access codes. |
| workspace-estimator | we | Workspace Estimator commands. |
| –input | -i | Specifies the input folder location. |
| –output | -o | Specifies the output folder location. |
| –assessment | -a | Runs the tool in assessment mode. |
| [–mapDirectory](additional-parameters.md) | -m | Specifies the folder containing custom mapping files. |
| –enableJupyter | -j | Enables or disables conversion of Databricks notebooks to Jupyter format. |
| –sql | -f | Specifies which database engine syntax to use for SQL commands. |
| –customerEmail | -e | Sets the customer email address. |
| –customerCompany | -c | Sets the customer company name. |
| –projectName | -p | Sets the project name. |
| –yes | -y | Skips confirmation prompts during execution. |

### Installing an access code

To begin the code conversion process, you must first install an access code. You can do this in two ways:

1. Enter the access code directly
2. Provide the path to a file containing the access code (This method is helpful when you’re working offline or behind a restrictive firewall)

You can install the access code by running the following command:

```bash
sma install-access-code <access-code>
```

This command produces the same result as the previous command.

```bash
sma install-ac <access-code>
```

To install an access code from a file, use either the `--file` or `-f` option with your command, like this:

```bash
sma install-access-code --file <path-to-file>
or
sma install-access-code -f <path-to-file>
```

If an error occurs while installing the license, an error message will be displayed.

To request an access code, please contact [sma-support@snowflake.com](mailto:sma-support%40snowflake.com)

### Checking which access codes are installed

To check which access codes are currently installed on your computer, use this command:

```bash
sma show-access-code
```

This command displays details about all access codes that are currently installed on your computer.

### Converting

After installing a valid license, you can run the code processor to convert your code. To start the conversion process, you need to provide the following required arguments:

* **Input path:** The folder containing your original source code
* **Output path:** The folder where you want the converted code to be saved

#### Project Information

When you run the code processor for the first time, you need to provide certain arguments. These arguments will be saved and used for future executions. The required arguments are the same as those needed when [creating a new project in the application](../project-overview/project-setup.md).

* **Customer Email:** Enter a valid email address
* **Customer Company:** Enter your company name
* **Project Name:** Enter a name for your project

This example demonstrates how to execute the code processor using only the essential requirements:

```bash
sma -i <input-path> -o <output-path> -e <client email> -c <client company> -p <project name> <additional-parameters>
```

After entering the sequence of commands and pressing “Enter”, the tool will display your current settings and ask you to confirm before starting the process.

Would you like to add or modify any arguments? Type “n” to cancel or “y” to proceed.

#### Skipping the Project Confirmation

To bypass the confirmation prompt shown above, add either **–yes** or **-y** as an argument. This is particularly important when using the tool programmatically, as the confirmation prompt will appear every time without these parameters.

For more information about all available parameters, please refer to this [link](additional-parameters.md).

### Performing an Assessment

When performing an assessment, add the `--assessment` or `-a` option to the standard conversion commands. Here are examples of how the commands should look:

```bash
sma --input <input-path> --output <output-path> --assessment <additional-parameters>
```

Each of these commands can accept additional parameters. For more details, please refer to the “Converting” section.

### Checking the tool version

To check the tool version and code-processing engine, you can use any of these commands:

```bash
sma --version
sma -v
```

### Enabling conversion of Databricks notebooks to Jupyter Notebooks

This option converts Python (.python) and/or Scala (.scala) source files into Jupyter Notebook (.ipynb) files. The conversion works regardless of whether the original files were exported from notebooks or were regular code files.

To convert Jupyter notebooks, add either the `'--enableJupyter'` flag or its shorthand version `'-j'` to your command.

```bash
sma -i <input-path> -o <output-path> --enableJupyter
```

### Setting the SQL Flavor of the source code

You can specify which SQL syntax to use when a SQL command is detected. Use either the command `'--sql'` or its shortcut `'-f'`. The supported syntax options are ‘SparkSql’ (which is the default), ‘HiveSql’, and ‘Databricks’.

```bash
sma --input <input-path> --output <output-path> --sql SparkSql
sma --input <input-path> --output <output-path> --sql HiveSql
sma --input <input-path> --output <output-path> --sql Databricks
```

### Workspace Estimator

The `workspace-estimator` (or `we`) verb provides commands for estimating Databricks workspace usage. It connects to a Databricks workspace, extracts metadata, and uploads the results to Snowflake for analysis.

The following subcommands are available:

* `sma we dbx run` – Runs both extraction and upload against a Databricks workspace in a single invocation.
* `sma we dbx extract` – Extracts workspace metadata to a local `.zip` file without uploading.
* `sma we dbx upload` – Uploads a previously extracted `.zip` file to Snowflake.

For full option tables and usage examples, refer to [the Workspace Estimator section of the CLI walkthrough](../../use-cases/sma-cli-walkthrough.md).

### Need more help?

To view general help information for the Command Line Interface (CLI), you can use any of these commands:

```bash
sma --help
sma -h
```

To learn more about specific commands, you can execute this command:

```bash
sma <command> --help
```

To learn more about installing an access code, run the command `sma install-access-code --help`.

---
title: Snowpark Migration Accelerator:  Values
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/values.md
section: Migrations
---

# Snowpark Migration Accelerator: Values

## Description

Creates a temporary table within the query that can be used immediately. For more information, see [Databricks SQL Language Reference VALUES](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select-values.html).

The VALUES sub-clause in a SELECT statement’s FROM clause lets you define a set of fixed values to create a specific number of rows. ([Snowflake SQL Language Reference VALUES](https://docs.snowflake.com/en/sql-reference/constructs/values))

### Syntax

```bnf
VALUES {expression | ( expression [, ...] ) } [, ...] [table_alias]

SELECT expression [, ...] [table_alias]
```

```bnf
SELECT ...
FROM ( VALUES ( <expr> [ , <expr> [ , ... ] ] ) [ , ( ... ) ] ) [ [ AS ] <table_alias> [ ( <column_alias> [, ... ] ) ] ]
[ ... ]
```

## Sample Source Patterns

### Setup data

#### Databricks

```sql
CREATE TEMPORARY VIEW number1(c) AS VALUES (3), (1), (2), (2), (3), (4);
```

#### Snowflake

```sql
CREATE TEMPORARY TABLE number1(c int);
INSERT INTO number1 VALUES (3), (1), (2), (2), (3), (4);
```

### Pattern code

#### Databricks

```sql
-- single row, without a table alias
> VALUES ("one", 1);
  one    1

-- Multiple rows, one column
> VALUES 1, 2, 3;
 1
 2
 3

-- three rows with a table alias
> SELECT data.a, b
    FROM VALUES ('one', 1),
                ('two', 2),
                ('three', NULL) AS data(a, b);
   one    1
   two    2
 three NULL

-- complex types with a table alias
> SELECT a, b
  FROM VALUES ('one', array(0, 1)),
              ('two', array(2, 3)) AS data(a, b);
 one [0, 1]
 two [2, 3]

-- Using the SELECT syntax
> SELECT 'one', 2
 one 2
```

| c |
| --- |
| 3 |
| 1 |
| 2 |
| 4 |

#### Snowflake

```sql
-- single row, without a table alias
SELECT * FROM (VALUES ('one', 1));

-- Multiple rows, one column
SELECT * FROM (VALUES (1), (2), (3));

-- three rows with a table alias
SELECT a, b
    FROM (VALUES ('one', 1),
                ('two', 2),
                ('three', NULL)) AS data(a, b);

-- complex types with a table alias
SELECT a, b
    FROM
    (VALUES ('one', '[0, 1]'),
            ('two', '[2, 3]')
            ) AS data(a, b);

-- Using the SELECT syntax
SELECT 'one', 2
```

| c |
| --- |
| 3 |
| 1 |
| 2 |
| 4 |

### Known Issues

No issues were found

### Related EWIs

No related Enterprise Warehouse Integrations

---
title: Snowpark Migration Accelerator:  Where
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/where.md
section: Migrations
---

# Snowpark Migration Accelerator: Where

## Description

Filters the data returned by a query or subquery based on specified conditions. ([Databricks SQL Language Reference WHERE](https://docs.databricks.com/en/sql/language-manual/sql-ref-syntax-qry-select-where.html))

The `WHERE` clause filters data by defining specific conditions that must be met. ([Snowflake SQL Language Reference WHERE](https://docs.snowflake.com/en/sql-reference/constructs/where))

### Syntax

```sql
WHERE <boolean_expression>
```

```sql
...
WHERE <predicate>
[ ... ]
```

## Sample Source Patterns

### Setup data

#### Databricks

```sql
CREATE TABLE person (id INT, name STRING, age INT);
INSERT INTO person VALUES
    (100, 'John',   30),
    (200, 'Mary', NULL),
    (300, 'Mike',   80),
    (400, 'Dan' ,   50);
```

#### Snowflake

```sql
 CREATE TABLE person (id INT, name STRING, age INT);
 INSERT INTO person VALUES
    (100, 'John',   30),
    (200, 'Mary', NULL),
    (300, 'Mike',   80),
    (400, 'Dan' ,   50);
```

### Pattern code

#### Databricks

```sql
-- 1. Comparison operator in `WHERE` clause.
SELECT * FROM person WHERE id > 200 ORDER BY id;

-- 2. Comparison and logical operators in `WHERE` clause.
SELECT * FROM person WHERE id = 200 OR id = 300 ORDER BY id;

-- 3. IS NULL expression in `WHERE` clause.
SELECT * FROM person WHERE id > 300 OR age IS NULL ORDER BY id;

-- 4. Function expression in `WHERE` clause.
SELECT * FROM person WHERE length(name) > 3 ORDER BY id;

-- 5. `BETWEEN` expression in `WHERE` clause.
SELECT * FROM person WHERE id BETWEEN 200 AND 300 ORDER BY id;

-- 6. Scalar Subquery in `WHERE` clause.
SELECT * FROM person WHERE age > (SELECT avg(age) FROM person);

-- 7. Correlated Subquery in `WHERE` clause.
SELECT * FROM person AS parent
   WHERE EXISTS (SELECT 1 FROM person AS child
                  WHERE parent.id = child.id
                    AND child.age IS NULL);
```

1. Use comparison operators (such as =, >, <, >=, <=) in the `WHERE` clause to filter data.

| ID | NAME | AGE |
| --- | --- | --- |
| 300 | Mike | 80 |
| 400 | Dan | 50 |

---

2. Use comparison operators (=, <, >, <=, >=, !=) and logical operators (AND, OR, NOT) in the `WHERE` clause to filter data.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |
| 300 | Mike | 80 |

---

3. Using `IS NULL` in the `WHERE` clause to check for null values.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |
| 400 | Dan | 50 |

---

4. Using function expressions within a `WHERE` clause.

| ID | NAME | AGE |
| --- | --- | --- |
| 100 | John | 30 |
| 200 | Mary | null |
| 300 | Mike | 80 |

---

5. Using the `BETWEEN` operator in a `WHERE` clause to filter data based on a range of values.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |
| 300 | Mike | 80 |

---

6. Using a Scalar Subquery within a `WHERE` clause.

| ID | NAME | AGE |
| --- | --- | --- |
| 300 | Mike | 80 |

---

7. A subquery in the `WHERE` clause that references columns from the outer query.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |

#### Snowflake

```sql
-- 1. Comparison operator in `WHERE` clause.
SELECT * FROM person WHERE id > 200 ORDER BY id;

-- 2. Comparison and logical operators in `WHERE` clause.
SELECT * FROM person WHERE id = 200 OR id = 300 ORDER BY id;

-- 3. IS NULL expression in `WHERE` clause.
SELECT * FROM person WHERE id > 300 OR age IS NULL ORDER BY id;

-- 4. Function expression in `WHERE` clause.
SELECT * FROM person WHERE length(name) > 3 ORDER BY id;

-- 5. `BETWEEN` expression in `WHERE` clause.
SELECT * FROM person WHERE id BETWEEN 200 AND 300 ORDER BY id;

-- 6. Scalar Subquery in `WHERE` clause.
SELECT * FROM person WHERE age > (SELECT avg(age) FROM person);

-- 7. Correlated Subquery in `WHERE` clause.
SELECT * FROM person AS parent
   WHERE EXISTS (SELECT 1 FROM person AS child
                  WHERE parent.id = child.id
                    AND child.age IS NULL);
```

1. Use comparison operators (such as =, >, <, >=, <=) in the `WHERE` clause to filter data.

| ID | NAME | AGE |
| --- | --- | --- |
| 300 | Mike | 80 |
| 400 | Dan | 50 |

---

2. Using comparison operators (such as =, <, >, <=, >=) and logical operators (such as AND, OR, NOT) in the `WHERE` clause to filter data.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |
| 300 | Mike | 80 |

---

3. Using `IS NULL` in the `WHERE` clause to check for null values.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |
| 400 | Dan | 50 |

---

4. Using function expressions within a `WHERE` clause.

| ID | NAME | AGE |
| --- | --- | --- |
| 100 | John | 30 |
| 200 | Mary | null |
| 300 | Mike | 80 |

---

5. Using the `BETWEEN` operator in a `WHERE` clause to filter data based on a range of values.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |
| 300 | Mike | 80 |

---

6. Using a Scalar Subquery within a `WHERE` clause.

| ID | NAME | AGE |
| --- | --- | --- |
| 300 | Mike | 80 |

---

7. Correlated Subquery in `WHERE` clause.

| ID | NAME | AGE |
| --- | --- | --- |
| 200 | Mary | null |

### Known Issues

No issues were found

### Related EWIs

No related EWIs

---
title: Snowpark Migration Accelerator:  Workarounds
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/workarounds.md
section: Migrations
---

# Snowpark Migration Accelerator: Workarounds

Some code elements and functions cannot be automatically converted at this time. We call these “Workarounds.” While we provide suggested solutions for each workaround, they will require either manual changes or specific decisions from the developer.

This page contains a list of available workarounds and their detailed descriptions.

Available soon!

---
title: Snowpark Migration Accelerator: Assessment
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Assessment

The Snowpark Migration Accelerator (SMA) begins with a comprehensive assessment of your source code. Understanding the assessment process is essential to maximize its benefits and ensure a successful migration.

This section covers the following topics:

* [How the Assessment Works](how-the-assessment-works.md)
* [Getting Started with Assessment](assessment-quick-start.md)
* [Understanding Your Assessment Results](understanding-the-assessment-summary.md)
* [Understanding Readiness Scores](readiness-scores.md)

The following additional topics may be helpful for reference, although they are not directly related to the assessment:

* [Issue Analysis](../../issue-analysis/approach.md)
* [Glossary](../../support/glossary.md)

---
title: Snowpark Migration Accelerator: Assessment Output - Reports Folder
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/interpreting-the-assessment-output/assessment-output-reports-folder.md
section: Migrations
---

# Snowpark Migration Accelerator: Assessment Output - Reports Folder

A complete set of output files and reports will be generated when you use the Snowpark Migration Accelerator (SMA). To see the full list of generated files and reports, refer to the [Output Reports section of this documentation](../../../user-guide/scos-conversion/output-reports/README.md).

The assessment generates .csv files that can be opened with any spreadsheet software. The detailed report provides a summary of these files and serves as a starting point for evaluating the results. While we’ll examine some key .csv files to understand the migration requirements, we won’t cover all of them. For a complete list of inventory files generated by the Snowpark Migration Accelerator (SMA), refer to [the SMA Inventories section of this documentation](../../../user-guide/scos-conversion/output-reports/sma-inventories.md).

To view the reports, click the “VIEW REPORTS” button at the bottom of the screen. This will open your file explorer to the directory containing the reports.

Let’s examine what information we can gather from the Detailed Report.

> **Note:**
>
> The version of the detailed report and other inventories shown on this page may differ from what you see when running SMA. The report shown here reflects the tool version available when this walkthrough was created. If you notice significant differences or issues in your results, contact the SMA team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or [report an issue in the tool](../../../user-guide/project-overview/configuration-and-settings.md). You can also use the SMA tool to report documentation issues.

## The Detailed Report

The Detailed Report (.docx) provides a comprehensive summary of the information found in the inventory files. This report is essential for evaluating how well-suited your codebase is for Snowpark migration. While a complete description of the report’s contents is available in the [detailed documentation](../../../user-guide/scos-conversion/output-reports/curated-reports.md), this guide focuses on three key aspects:

1. Important elements to review
2. Their impact on readiness scores
3. How to interpret the results

Review all readiness scores available in your report to understand your migration readiness status.

### The Spark API Readiness Score

Let’s clarify what the Spark API Readiness score means and how it’s calculated. This score is the main readiness indicator produced by the Snowpark Migration Accelerator (SMA). It’s important to note that this score only considers Spark API usage and doesn’t account for third-party libraries or other factors in your code. While this limitation means the score might not tell the complete story, it still serves as a useful starting point for your migration assessment. For more information about third-party library compatibility, refer to the Third Party API Readiness section.

The Conversion Score represents the ratio of Spark API references that can be automatically converted to Snowpark API, compared to the total number of Spark API references found in your code. In this case, 3541 out of 3746 references can be converted. A higher score indicates that more of your code can be automatically migrated to Snowpark. While unconverted code can still be manually adapted to work with Snowpark, this score provides a reliable indication of how well-suited your workload is for automatic migration.

### Third Party Libraries Readiness Score

The Third Party Libraries Readiness Score helps you understand which external APIs are used in your code. This score provides a clear overview of all external dependencies in your codebase.

### Summary Page

The Summary page displays your readiness score and provides an overview of your execution results.

**What to Look For**
Check the readiness score to evaluate how prepared your codebase is for converting Spark API references to Snowpark API references. A high readiness score indicates that the Spark code is well-suited for migration to Snowpark.

### File Summary

The file summary provides an overview of your codebase, including:

* Total lines of code per file extension
* Notebook cell information (if notebooks were analyzed)
* Number of files containing embedded SQL queries

**What Should You Watch For?**

The number and content of files. When you find many files but only a few contain Spark API references, this could mean:

* The application uses Spark minimally (perhaps only for data extraction and loading)
* The source code includes external library dependencies
* The use case needs further investigation to understand how Spark is being utilized

In either scenario, it’s important to thoroughly analyze the use case before proceeding.

### Spark Usage Summary

The Spark Usage Summary provides a detailed breakdown of Spark API references found in your code and identifies which ones can be converted to the Snowpark API. The summary categorizes these references into different types, including DataFrame operations, column manipulations, SparkSession calls, and other API functions.

Each reference is classified into one of seven support statuses. These statuses indicate whether and how a reference can be supported in Snowpark. The detailed definitions of these statuses can be found in the report’s appendixes.

* **Direct:** The function exists in both PySpark and Snowpark and can be used without changes.
* **Rename**: The function exists in both frameworks, but requires a name change in Snowpark.
* **Helper**: The function requires a small modification in Snowpark that can be solved by creating an equivalent helper function.
* **Transformation**: The function needs to be completely rebuilt in Snowpark using different methods or multiple steps to achieve the same result.
* **Workaround**: The function cannot be automatically converted, but there is a documented manual solution available.
* **NotSupported**: The function cannot be converted because Snowflake does not have an equivalent feature. The tool will add an error message to the code.
* **NotDefined**: The PySpark element is not yet included in the conversion tool’s database and will be added in a future update.

What Should You Watch For?

The readiness score is displayed in this section. You can review how many code references will need workarounds versus direct translations. If your code requires many workarounds, helpers, and transformations, we recommend using Snowpark Migration Accelerator (SMA) to help migrate your codebase efficiently.

### Import Calls:

SMA tracks each package or library import as an individual import call. Common and recognized import calls are displayed in the import summary section of the detailed report page. All import calls are recorded in both the local output inventories folder and the assessment database. Note that these import calls have not yet been classified as supported or unsupported in Snowflake.

**What Should You Watch For?**

Third-party libraries not supported by Snowflake can significantly impact your migration readiness. If your code imports libraries like mllib, streaming, or third-party libraries such as graphs, subprocess, or smtplib, you may face migration challenges. While the presence of these libraries doesn’t automatically make migration impossible, it requires a deeper analysis of your use case. In such situations, we recommend consulting with the WLS team for a detailed assessment.

### Snowpark Migration Accelerator Issue Summary

This section provides an overview of potential issues and errors that may occur during workload migration. While detailed information about unconvertible elements is available elsewhere, this section is particularly valuable during the initial stages of the conversion process.

**Common Issues to Watch For**

To find elements that were not converted or have known workarounds, check the Spark reference inventory in your local inventories folder. You can compare these elements with existing mappings by querying the database.

---

## Summary:

The readiness score indicates how prepared your codebase is for Snowpark migration. A score of 80% or higher means your code is mostly ready for migration. If your score is below 60%, you will need to make additional modifications to your code before proceeding.

For this workload, the score exceeds 90%, which indicates excellent compatibility for migration.

The next indicator is **size**. A workload with extensive code but few Spark API references might suggest heavy reliance on third-party libraries. Even if a project has a low readiness score, it can be quickly converted manually if it contains only about 100 lines of code or 5 Spark API references, regardless of automation tools.

For this workload, the size is reasonable and easy to handle. The codebase contains more than 100 files with fewer than 5,000 Spark API references and under 10,000 lines of code. Approximately 98% of these files contain Spark API references, indicating that most of the Python code is Spark-related.

The third indicator to examine is **imported libraries**. The inventory of import statements helps identify which external packages the code uses. If the code relies heavily on third-party libraries, it may require additional analysis. In cases with numerous external dependencies, consult the Workload Services (WLS) team to better understand how these libraries are being used.

In this example, we have some referenced third-party libraries, but none of them are related to Machine Learning, Streaming, or other complex libraries that would be challenging to implement in Snowpark.

Since this workload is suitable for migration to Snowpark, proceed to the next step in the Spark migration process.

---
title: Snowpark Migration Accelerator: Assessment Quick Start
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/assessment-quick-start.md
section: Migrations
---

# Snowpark Migration Accelerator: Assessment Quick Start

The Snowpark Migration Accelerator (SMA) helps you analyze your source code and determine which kind of conversion fits best. This guide will show you how to begin the assessment process.

## How to Execute an Assessment

To begin assessing your code, create a new project in the Snowpark Migration Accelerator (SMA) tool.

1. Begin by selecting the **New Project** button.

Complete the required fields in the New Project dialog, including project name, email address, company name, input folder, and output folder. Once all required information is provided, click the **Save** button to create your project.

The project home page displays with an **Analyze Code** card.

The assessment execution screen displays, showing the progress of three stages: **Loading Source Code**, **Analyzing Source Code**, and **Generating Results**. Processing time varies depending on your project size.

Once the assessment is complete, the results screen will display. This screen provides detailed information to help you understand the best conversion option for your source code and a breakdown of the assessment results.

Note that while you can find basic information in the [assessment summary](understanding-the-assessment-summary.md) page above, the complete output folder contains much more detailed information, including a comprehensive multi-page report.

## Next Steps

After the tool completes its analysis, the application displays the assessment results page, showing the best conversion option for your source code. The following tips can help guide you:

* Take Time to Analyze the Assessment Results: The assessment provides valuable insights that can help you create an effective migration strategy. Carefully review the assessment data before starting the conversion process to avoid unnecessary rework and ensure a more efficient migration.

The assessment results guide you on which conversion option best suits your source code. The footer buttons allow you to proceed to the next step, highlighting the recommended option.

* **Primary option** - The recommended conversion option for your source code.
* **Secondary option** - An alternative conversion option for your source code.
* **View Reports** - Opens the folder containing assessment output reports. These include the detailed assessment report, Spark reference inventory, and other analyses of your source codebase. Each report type is explained in detail in this documentation.

---
title: Snowpark Migration Accelerator: Assessment Walkthrough
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Assessment Walkthrough

The Snowpark Migration Accelerator (SMA) analyzes Python, Scala, and SQL code in your project to identify what can be converted from Spark API to Snowpark API. This analysis provides detailed insights about your source code and highlights which components are ready to be migrated to Snowflake.

This guide will help you maximize the value of your assessment. You will learn:

* How to access and run the assessment tool
* How to review your source code to ensure optimal conversion results
* How to understand and analyze the assessment report
* What actions to take after completing the assessment

This walkthrough specifically covers Python code written for Spark. While the Snowpark Migration Accelerator (SMA) supports multiple programming languages and platforms (see [supported languages and platforms](../../user-guide/before-using-the-sma/supported-platforms.md)), we will focus exclusively on Python.

Before you begin, please review the prerequisites on [the next page](walkthrough-setup/README.md). Although we strongly recommend performing an assessment first, you can proceed directly to the conversion process by following our [conversion walkthrough](../conversion-walkthrough.md).

---
title: Snowpark Migration Accelerator: Before Using the SMA
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/before-using-the-sma/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Before Using the SMA

The Snowpark Migration Accelerator (SMA) helps developers analyze and convert code by providing automated tools and features.

> **Note:**
>
> NOTE: Before using the Snowpark Migration Accelerator (SMA), we recommend that you familiarize yourself with its functionality and how to interpret its results. Please review the training materials and resources available on [the getting started page](../../general/getting-started/README.md) to learn how to use the tool effectively.

The quality of your input code directly affects SMA’s performance and analysis capabilities. While SMA is a powerful tool, it can only analyze the code files you provide. Therefore, the better organized and structured your input code is, the more accurate and useful the results will be.

Before you begin using the Snowpark Migration Accelerator (SMA), this section will explain how to maximize its benefits. We will cover:

* [Supported File Types](supported-filetypes.md)
* [Code Extraction Process](code-extraction.md)
* [Pre-Processing Guidelines](pre-processing-considerations.md)

To achieve the best results, it’s important to properly prepare your input data. Here’s how to ensure optimal performance.

---
title: Snowpark Migration Accelerator: Collection
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/snowpark-checkpoints-execution-guide/collection.md
section: Migrations
---

# Snowpark Migration Accelerator: Collection

To follow the collection process, please proceed with the steps outlined below:

1. Open the collection workload in VS Code to begin the process.
2. Generate checkpoints using the `checkpoints.json` file.

To generate checkpoints, you can perform one of the following actions:

1. Generate checkpoints by accepting the suggested message:
2. Execute the “Snowflake: Load All Checkpoints” command:

   Once all checkpoints are successfully loaded, your files should appear as shown below:
3. Run the Python file to execute the checkpoints collection process.

When running a Python file that includes checkpoints, a folder named `snowpark-checkpoints-output` will be created, containing the collection results.

The `checkpoints_collection_results.json` file contains the consolidated results of the collection process.

```json
{
  "results": [
    {
      "timestamp": "2025-05-05 15:06:43",
      "file": "sample.py",
      "line_of_code": 57,
      "checkpoint_name": "sample$BBVOC7$df1$1",
      "result": "PASS"
    },
    {
      "timestamp": "2025-05-05 15:06:53",
      "file": "sample.py",
      "line_of_code": 57,
      "checkpoint_name": "sample$BBVOC7$df2$1",
      "result": "PASS"
    },
    {
      "timestamp": "2025-05-05 15:06:58",
      "file": "sample.py",
      "line_of_code": 57,
      "checkpoint_name": "sample$BBVOC7$df3$1",
      "result": "PASS"
    }
  ]
}
```

The `snowpark-checkpoints-output` folder should be copied into the validation workload to grant access to the collection results. For details on how to proceed with the validation process, refer to the [Validation Section](https://app.gitbook.com/o/-MB4z_O8Sl--Tfl3XVml/s/6on4bNAZUZGzMpdEum8X/~/changes/499/use-cases/sma-checkpoints-walkthrough/snowpark-checkpoints-execution-guide/validation).

---
title: Snowpark Migration Accelerator: Conclusions
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/migration-lab/conclusions.md
section: Migrations
---

# Snowpark Migration Accelerator: Conclusions

By utilizing the SMA, we were able to accelerate the migration of both a data pipeline and a reporting notebook. The more of each that you have, the more value a tool like the SMA can provide.

And let’s go back to the assessment -> conversion -> validation flow that we have consistently come back to. In this migration, we:

* Setup out project in the SMA
* Ran SMA’s assessment and conversion engine on the code files
* Reviewed the output reporting from the SMA to better understand what we have
* Review what could not be converted by the SMA in VS Code
* Resolve issues and errors
* Resolve session references
* Resolve input/output references
* Run the code locally

  + And run the code in Snowflake
* Ran the newly migrated scripts and validated their success

Snowflake has spent a great deal of time improving its ingestion and data engineering capabilities, just as it has spent time improving migration tools like SnowConvert, the SnowConvert Migration Assistant, and the Snowpark Migration Accelerator. Each of these will continue to improve. Please feel free to reach out if you have any suggestions for migration tooling. These teams are always looking for additional feedback to improve the tools.

---
title: Snowpark Migration Accelerator: Configuration and Settings
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/project-overview/configuration-and-settings.md
section: Migrations
---

# Snowpark Migration Accelerator: Configuration and Settings

Setting up a new project with the Snowpark Migration Accelerator (SMA) is a simple process. This page explains all available settings, updates, and customization options within the application.

## Updating the Application

The Snowpark Migration Accelerator (SMA) automatically checks for new versions when you start the program. If a newer version is available, you’ll see a notification message in the upper-right corner of your screen.

If you select “Update Now,” the application will immediately download and install the latest version.

After the download completes, click “Close and Install” to update the application.

After the application restarts, you can verify the current version number in the top-right corner of the window.

### Check for Updates

To check for a new version of the application, click the “Check for Updates” option in the menu.

If your application is up to date, you will see a notification in the top right corner confirming this status.

If your application is not running the latest version, you will be prompted to update it.

### Conversion Settings

Before starting a conversion, you can change the conversion settings from the **Conversion Settings** page. On this page, choose one of the following options:

* **Customize settings**: Opens the **Conversion settings** dialog so you can adjust advanced conversion options.
* **Default Settings**: Uses the default conversion settings and proceeds with the conversion workflow.

When you open **Customize settings**, the dialog organizes settings by category (for example, **Pandas**, **DBX**, and **Checkpoints**):

For example, under **Pandas**, you can enable **Convert Pandas API to Snowpark API** to automatically convert supported Pandas API usage to the Snowpark Pandas API during conversion.

Select **Save settings** to apply your changes. Select **Reset settings** to restore defaults, or **Cancel** to close the dialog without saving.

## About

To view the version information for your Snowpark Migration Accelerator installation, click the **About Snowpark Migration Accelerator** option in the menu.

To review changes between different versions, check the release notes.

## File Menu

From the File menu, you can create a new project by selecting **New Project**

## Help Menu

The help menu provides several support options tailored to your specific needs.

The following sections describe each option available in the help menu.

### Documentation

To return to the main documentation page (Welcome!), click [here](../../README.md).

### Release Notes

The release notes can be found in the general section of our documentation site. Click [here](../../general/release-notes/README.md) to view them.

### Glossary

The Glossary option directs you to the [Glossary](../../support/glossary.md) section, where you can find definitions of terms used throughout this documentation.

### Contact Us

The Contact Us button opens your default email application. When clicked, it allows you to send an email to [sma-info@snowflake.com](mailto:sma-info%40snowflake.com) to discuss any questions or concerns about the Snowpark Migration Accelerator.

For additional ways to reach the Snowpark Migration Accelerator (SMA) team, visit the [contact us section](../../support/contact-us.md).

### Report an Issue

You can report issues at any time. This includes problems encountered while running the tool or any other concerns related to using the tool.

When you select this issue, a form will be displayed.

To help us resolve your issue quickly, provide:

1. A detailed description of the problem
2. Your email address so we can contact you
3. Any relevant files, such as screenshots showing the problem
4. Log files from when you ran the tool

After submitting the issue, the SMA support team will be notified. If you provided your email address, our team will contact you promptly.

### EULA

This link directs you to the [End User License Agreement (EULA) page](../../general/conversion-software-terms-of-use/README.md) where you can review the terms of use.

---

Let’s start using the Snowpark Migration Accelerator (SMA) tool to convert your code.

---
title: Snowpark Migration Accelerator: Conversion
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Conversion

The Snowpark Migration Accelerator (SMA) begins by evaluating your Spark workload through an assessment process. Beyond assessment, SMA can also transform specific components of your Spark application into Snowpark-compatible code.

This section covers the following topics:

* [How the conversion works](how-the-conversion-works.md)
* [Conversion Quick Start](conversion-quick-start.md)
* Understanding your conversion results
* Reviewing conversion outputs (including reports, logs, and converted code)
* Working with your converted code
* What to do next

The Snowpark Migration Accelerator (SMA) partially converts Spark API references to Snowpark API. While it cannot convert all references, it is designed to be transparent about its limitations. The tool provides clear issue and error codes to guide users on necessary manual interventions. This dual functionality - converting what’s possible while clearly identifying what requires manual attention - is one of SMA’s most valuable features.

When converting code, it’s important to understand how to interpret the issues reported by the Snowpark Migration Accelerator (SMA). We have a dedicated section that covers issue analysis in detail, which includes:

* [Approach to resolving issues](../../issue-analysis/approach.md)
* [Issue codes by Source](../../issue-analysis/issue-codes-by-source/README.md)
* [Troubleshooting through the issues](../../issue-analysis/troubleshooting-the-output-code/README.md)
* [Dealing with Workarounds](../../issue-analysis/workarounds.md)
* [Deploying the output code](../../issue-analysis/deploying-the-output-code.md)

Understanding these concepts is crucial for determining whether a conversion will be successful.

---

Let’s explore how the conversion process works.

---
title: Snowpark Migration Accelerator: Conversion Quick Start
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/conversion-quick-start.md
section: Migrations
---

# Snowpark Migration Accelerator: Conversion Quick Start

## How to Execute a Conversion

Run the conversion process from selecting the **Convert to the Snowpark API** card in the project home page.

## Conversion Setup

On the **Conversion settings** page, choose whether to run the conversion using **Default Settings** or to select **Customize settings** to configure advanced options.

If you select **Customize settings**, SMA opens a **Conversion settings** dialog where you can review and update settings (for example, Pandas conversion options) and then click **Save settings**.

Once your setup is complete, click the **Continue** button. A progress screen will display the current status of your conversion.

After completing this process, you will observe:

1. **Conversion Reports:** View the conversion results by clicking the “View Results” button.
2. **Conversion Output Code**: Access the converted code by clicking the “View Output” button on the conversion results screen.
3. **Retry Conversion**: If you need to convert updated source code, click the “Retry Conversion” button on the conversion results screen to run the conversion process again.

---
title: Snowpark Migration Accelerator: Curated Reports
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-reports/curated-reports.md
section: Migrations
---

# Snowpark Migration Accelerator: Curated Reports

The Snowpark Migration Accelerator (SMA) generates comprehensive assessment reports by analyzing detailed data. The following section lists these available reports.

The assessment results, including detailed inventories of all elements, can be found in the [spreadsheets on the following pages](sma-inventories.md).

## Detailed Report

> **Danger:**
>
> The **DetailedReport.html** report has been deprecated and is no longer supported as of Spark Conversion Core **V2.43.0**

> **Note:**
>
> This page explains each section of the detailed report as shown in the document file.

The SMA Detailed Report is the main analysis report that provides comprehensive information across multiple sections.

The assessment report contains the following sections and their descriptions:

The first page of the detailed report provides a concise overview of the Snowpark Migration Accelerator (SMA) tool.

This page contains the following subsection:

The Execution Summary section displays:

* Your organization name and email address from your [project creation](../../project-overview/project-setup.md) settings
* A unique identification number for each SMA execution (this ID is referenced throughout the inventories section)
* The timestamp of execution
* Version details for both SMA and Snowpark API

### Readiness Scores Summary

The next page displays a summary of readiness scores. It includes scores for [Spark API](../readiness-scores.md) and [Third-Party libraries](../readiness-scores.md), along with guidance on how to interpret them. These scores help you understand how well-prepared your codebase is for migration to Snowflake.

This section provides detailed information about each readiness score.

### File Summary

The file summary section begins on the following page. This section may span multiple pages depending on how many different file types were processed during this tool execution.

This information is also available in the [conversion summary presented in the application](../understanding-the-conversion-summary.md).

* File Type Summary: Displays a breakdown of recognized technologies, including the number of files for each technology type, their total lines of code, and what percentage they represent of all analyzed files.
* File Extension Summary: Shows statistics for each recognized file extension, including the number of files with that extension, their total lines of code, and what percentage they represent of all analyzed files.

* Code File Size Analysis: Displays the distribution of code files by size category (“t-shirt” sizing). Each size category shows the number of files and their percentage of the total codebase.
* Notebook Language Statistics: Provides a breakdown of code lines and cells by programming language across all scanned notebooks.
* Notebook Size Classification by Language: Categorizes each notebook file by size based on its total lines of code. The notebook type (Python, Scala, or SQL) is determined by the predominant language used. Size categories are:

  + XS: Under 50 lines
  + S: 50-200 lines
  + M: 200-500 lines
  + L: 500-1,000 lines
  + XL: Over 1,000 lines

### Spark API Summary

The Spark API Summary provides a detailed analysis of the readiness score shown in the Readiness Score section. This section contains four tables:

1. A list of files containing Spark API references
2. A breakdown of supported and unsupported features
3. The readiness score organized by Spark API categories
4. The readiness score organized by Mapping Status

We will explain which Spark API references are supported and unsupported. Here’s what these terms mean:

* Supported: The Snowpark Migration Accelerator (SMA) can automatically convert this API element to the Snowpark API or provide a known workaround.
* Unsupported: The Snowpark Migration Accelerator (SMA) cannot automatically convert this API element to the Snowpark API. This does not mean conversion is impossible, but it will require manual intervention.

* Files with Spark References: This table shows a breakdown of Spark technology usage across your workload, categorized by technology type.
* Files with Spark Support Status: This table displays the number of supported and unsupported Spark features in your source code, organized by technology type.

* Spark API Usage Summary: A table showing how many Spark API functions are supported and not supported in Python and Scala. The table is organized by API category and includes a Spark API Readiness Score, which matches the score shown in the Readiness Score section.
* Spark API Usage Support Categories: A breakdown of how many times Spark API functions are used in your code, organized by their support status. For detailed descriptions of each support category, see [the Spark Reference Categories page](../spark-reference-categories.md).

### Pandas API Usage Summarycv

> **Note:**
>
> The Pandas API Usage Summary is only available for executions that contain Python files.

The Pandas API Summary provides a list of references to the Pandas API, similar to the Spark API Summary shown previously.

* Files with Pandas Usage: A breakdown showing the number of Pandas references found in each technology across your entire workload.
* Pandas API Usage Summary: A detailed list of Pandas library functions used in your source code, sorted by frequency of use.

### Import Reference Summary

The Import Analysis section displays all external dependencies imported into your codebase. This includes third-party libraries and other external components used across all files. Note that imports from files within your own codebase are not shown in this table.

The table displays Python package information with the following details:

* Package names that were imported
* Whether each package is supported in Snowpark’s Anaconda distribution
* Number of times each package appears in imports
* Percentage of files containing each import

Note that while the “Percent” column total equals 100%, individual percentages may sum to more than 100% since files often contain multiple package imports.

### SQL Reference Summary

* SQL Usage by File Type: This table categorizes SQL usage based on different technologies, showing the total number of SQL files and SQL cells found in your workload.
* SQL Usage by Support Status: This table organizes SQL elements based on whether they have an equivalent feature in Snowflake or not.

### Snowpark Migration Accelerator (SMA) Issue Summary

The Snowpark Migration Accelerator (SMA) creates issue reports whenever it detects warnings, conversion errors, or parsing errors in your code. Resolving these issues is essential for completing a successful code migration using SMA.

For a detailed guide on understanding and analyzing issues, refer to [the issue analysis section](../../../issue-analysis/approach.md) of our documentation.

The summary displays each issue with the following information:

* Issue code (with a link to detailed documentation)
* Number of occurrences in the workload
* Severity level

The report displays three severity levels (Warning, Conversion Error, and Parsing Error) along with a summary organized by each level.

When working with migration tools, follow these priorities for handling different types of issues:

1. Address parsing errors first, as they require immediate attention
2. Resolve conversion errors through programmatic solutions
3. Monitor and track warnings throughout the migration process

Appendices

Appendix A provides detailed descriptions of all mapping status categories.

---

This comprehensive report contains detailed information gathered from [the inventory files](sma-inventories.md) that the Snowpark Migration Accelerator (SMA) generates.

For detailed information about the report, contact the Snowpark Migration Accelerator (SMA) team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

> **Danger:**
>
> The **Summary Report** feature has been removed and is no longer available starting from Spark Conversion Core **V2.43.0**

---

The SMA generates several output reports, which include detailed spreadsheets in the results.

---
title: Snowpark Migration Accelerator: Default Settings
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/sma-execution-guide/feature-settings/default-settings.md
section: Migrations
---

# Snowpark Migration Accelerator: Default Settings

## Default Values

* On/Off the whole feature: Enabled.
* Collect user-defined methods returning DataFrame type: False.
* List of relevant PySpark functions to collect: (See table below).
* Sample: 100%.
* Mode: Schema.
* Enabled: Always True.

## **Default PySpark functions to collect**

| Type | PySpark Packages |
| --- | --- |
| Creation | pyspark.sql.session.SparkSession.createDataFrame<br>pyspark.sql.readwriter.DataFrameReader.csv<br>pyspark.sql.readwriter.DataFrameReader.jdbc<br>pyspark.sql.readwriter.DataFrameReader.json<br>pyspark.sql.readwriter.DataFrameReader.load<br>pyspark.sql.readwriter.DataFrameReader.orc<br>pyspark.sql.readwriter.DataFrameReader.parquet<br>pyspark.sql.readwriter.DataFrameReader.table<br>pyspark.sql.readwriter.DataFrameReader.text<br>pyspark.rdd.RDD.toDF |
| Transformation | pyspark.sql.dataframe.DataFrame.union<br>pyspark.sql.dataframe.DataFrame.intersect<br>pyspark.sql.dataframe.DataFrame.join<br>pyspark.sql.group.GroupedData.pivot |

---
title: Snowpark Migration Accelerator: Determining Compatibility with Snowpark Connect
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/snowpark-connect/identifying-fully-compatible-files.md
section: Migrations
---

# Snowpark Migration Accelerator: Determining Compatibility with Snowpark Connect

With [Snowpark Connect for Spark](../../../../developer-guide/snowpark-connect/snowpark-connect-overview.md), you can run your Spark code with
Snowflake.

You can determine if a Spark workload is a good fit for Snowpark Connect for Spark by using the following steps:

1. Take a moment to understand what you have and what you are looking to do.

   Snowpark Connect for Spark will be a great choice for many Spark workloads, but not all.
2. Run the Snowpark Migration Accelerator (SMA), as described below.

   Currently, the SMA will report on compatibility for Snowpark Connect for Spark for any code written in Python with references to the
   Spark API.
3. Use the SMA to identify all references to the Spark API that are present in the codebase you’ve scanned with it.

## Analyze compatibility with the SMA

### Accessing the SMA

The Snowpark Migration Accelerator (SMA) is a tool that accelerates the migration of pipelines written in or with Spark. The SMA assesses
the compatibility of references of the Spark API in Python and Scala code, and can convert some references to the Snowpark API.

### Installation

You can download the SMA from the Snowflake website as described in [Installation](../../general/getting-started/installation/README.md).

### Before running the SMA

Once you have installed the SMA, you can assess a codebase. As you do, keep in mind the following:

* The SMA can only analyze [certain extensions](../../user-guide/before-using-the-sma/supported-filetypes.md) for references
  to the Spark API. However, only Python code can be analyzed for Snowpark Connect.

  Notebooks and code files can be processed at the same time.
* The SMA reads files from a local directory. You will need to put them all in a root directory (there can be as many subdirectories as you
  like) for the SMA to analyze them. You can run many files or a single file through as much as you like.

For more SMA considerations, see [Before Using the SMA](../../user-guide/before-using-the-sma/README.md).

## Generating the assessment

1. Open the Snowpark Migration Accelerator (SMA).
2. Start a new project by selecting the **New Project** button.
3. Fill out the fields on the [project creation screen](../../user-guide/project-overview/project-setup.md), including the locally accessible directory where the source codebase is. The field **Company name** helps Snowflake identify other runs of the SMA that may be related to your codebase or other codebases (depending on how you name it), both in the past or the future. Once you have filled out all the fields, click the **Save** button in the bottom right corner.
4. Select the **Code Process** card to proceed to the assessment settings screen.
5. On the **Assessment Settings** screen, select **Start Assessment** to run the assessment on your code.
6. The SMA displays progress indicators while the assessment runs: **Loading Source Code**, **Analyzing Source Code**, and **Writing Results**. This may take several minutes depending on project size.
7. When assessment has finished, the SMA displays the **Assessment Results** page.
8. Determine a codebase’s compatibility with Snowpark Connect by looking at the Snowpark Connect Readiness Score.

   The number of [Readiness Scores](../../user-guide/assessment/readiness-scores.md) shown varies depending on the SMA version
   you run.

   The percentage shown is the count of references to the Spark API that are fully compatible with Snowpark Connect. Next to the
   percentage, you’ll find green (greater than 90 percent of references are supported), yellow (between 70 percent and 90 percent of references are
   supported), or red (less than 70 percent of references are supported) indicators.

   The following describes what the indicators mean:

   * **Green**: Good candidate for Snowpark Connect (less than 10 percent of references to the Spark API are not supported in Snowpark Connect).
   * **Yellow**: Possibly a good candidate.

     Determine if you can make what is not supported work with Snowpark Connect. You can use the SMA to convert to the Snowpark API
     and compare that result with the Snowpark Readiness Score to see which is a better fit.
   * **Red**: Could still work, but there’s a lot of incompatibility.

     Check the Snowpark API Readiness score. If that is high (greater than 90 percent), then it’s likely that Snowpark is the better route
     for migration for this workload.

   For the yellow and red options, reach out to [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) for more support, but for the green indicator you may want to
   see if you can take the next step and identify a POC.

> **Note:**
>
> The SMA converts only a few elements to Snowpark Connect. You can choose **Continue to Conversion** in the SMA user interface, but that will only
> convert the code to the Snowpark API.

## Determining what files are ready to run with Snowpark Connect

The score is a high level indicator, but the SMA allows you to see exactly what elements of the Spark API are supported and what files are
fully supported.

> **Note:**
>
> This guide shows you how to do this locally, but you can also use the [Interactive Assessment Application (IAA)](../../interactive-assessment-application/overview.md).

To determine what files are ready to run, you will have to use the `SparkUsagesInventory.csv` file generated in the
[local Reports output folder](../../user-guide/scos-conversion/output-reports/README.md) by the SMA. This file lists every
reference to the Spark API found by the SMA.

1. Navigate to the reports directory from the application by selecting **View Reports**.

   This will take you to a local directory that has a large number of inventories and other reports that the SMA generates.
2. Open the `SparkUsagesInventory.csv` file in a spreadsheet editor.
3. Locate the **IsSnowparkConnectSupported** field.

   This will give a TRUE or FALSE indicator of each element of the Spark API.
4. Pivot this spreadsheet to determine if there are any files that are fully supported.

   1. Insert a pivot table with the entire spreadsheet in the range for the pivot.
   2. Select **FileId** as the row, and **IsSnowparkConnectSupported** as the column and values.

      This will give you a result that looks like the following image:
   3. Sort the result by FALSE ascending (meaning, lowest to highest).
   4. If there are any files that have zero unsupported references in Snowpark Connect, they will show up at the top.

      There are none in this example (the lowest count of unsupported references in a specific file is 1).
   5. From this, you can use the artifact dependency output of the SMA to get the dependencies for the file list above.

      With those dependencies, you can see what inputs or outputs may be present in order to run the file, and ultimately build a POC.

---
title: Snowpark Migration Accelerator: Downloading and Getting Access
source: https://docs.snowflake.com/en/migrations/sma-docs/general/getting-started/download-and-access.md
section: Migrations
---

# Snowpark Migration Accelerator: Downloading and Getting Access

The Snowpark Migration Accelerator (SMA) is a desktop application that helps you to convert your existing code to Snowflake’s Snowpark framework. The application runs on macOS and Windows operating systems.

## System Requirements

Before installing the Snowpark Migration Accelerator (SMA), verify that your system meets the following minimum requirements:

**MacOS:**

* macOS Ventura 13.3.1 or a newer version
* Minimum of 4 GB RAM

**Windows:**

* Windows 11
* Minimum 4 GB of RAM

> **Note:**
>
> * Available RAM size affects the speed of conversion process, and the amount of code that can be processed at once. (More RAM is better).
> * The Snowpark Migration Accelerator requires .NET and comes as a self-contained package, eliminating the need to install additional dependencies.

For detailed legal information, please review the [End User License Agreement (EULA)](../conversion-software-terms-of-use/README.md).

## Getting Support

If you experience any difficulties with downloading, installing, or configuring the Snowpark Migration Accelerator (SMA), please reach out to our support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com). Our team is ready to assist you!

## Downloading the SMA Application

The Snowpark Migration Accelerator (SMA) is a desktop application that runs locally on your computer. You can download the installer from the official Snowflake website:

<https://www.snowflake.com/en/data-cloud/snowpark/migration-accelerator/>

For step-by-step guidance on using the Snowpark Migration Accelerator (SMA) application, please refer to the [SMA User Guide](../../user-guide/overview.md).

## Downloading the SMA Command Line Interface (CLI)

The SMA Command Line Interface (CLI) provides the same functionality as the graphical application but operates through text commands in a terminal. This makes it ideal for automation and scripting tasks.

Download the version that matches your operating system:

* [LINUX X64](https://sitartifacts.z5.web.core.windows.net/linux/prod/cli/SMA-CLI-linux.tar)
* [LINUX ARM](https://sitartifacts.z5.web.core.windows.net/linux/prod/cli/SMA-CLI-arm64-linux.tar)
* [MAC OS X64](https://sitartifacts.z5.web.core.windows.net/darwin_x64/prod/cli/SMA-CLI-mac.tar)
* [MAC OS ARM](https://sitartifacts.z5.web.core.windows.net/darwin_arm64/prod/cli/SMA-CLI-arm64-mac.tar)
* [WINDOWS](https://sitartifacts.z5.web.core.windows.net/windows/prod/cli/SMA-CLI-windows.zip)

For detailed guidance on using the Snowpark Migration Accelerator (SMA) Command Line Interface, refer to the [SMA CLI user guide](../../user-guide/using-the-sma-cli/README.md).

> **Note:**
>
> To migrate from a SQL database to Snowflake, please refer to the [SnowConvert AI documentation](../../../snowconvert-docs/overview.md).

---
title: Snowpark Migration Accelerator: Feature Settings
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/sma-execution-guide/feature-settings/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Feature Settings

## CLI Commands

A new command has been added to the SMA CLI to disable the SMA-Checkpoints feature. Users can do this by using either of the following flags: `-d` or `--disableCheckpoints`.

### **Like follows:**

```sh
./sma -i inputPath -o outputPath -e user@company.com -c Company -p Project -d
```

Or

```sh
./sma -i inputPath -o outputPath -e user@company.com -c Company -p Project --disableCheckpoints
```

## UI Settings

The SMA application allows users to enable or disable the SMA-Checkpoints feature through the *Conversion Settings* modal, accessible from the conversion settings page.

**Configuring SMA-Checkpoints settings**

---
title: Snowpark Migration Accelerator: Getting Started
source: https://docs.snowflake.com/en/migrations/sma-docs/general/getting-started/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Getting Started

Getting started with a migration project can be challenging. The Snowpark Migration Accelerator (SMA) simplifies this process by automatically analyzing your code and converting it to Snowpark, making your migration faster and easier. This documentation, created by Snowflake’s migration experts, will guide you through each step of the process.

The following section covers these topics:

* [Downloading and Getting Access](download-and-access.md)
* [Installation](installation/README.md)

Already installed? Here are some helpful resources to get you started:

* [User Guide](../../user-guide/overview.md) - Learn how to use the Snowpark Migration Accelerator.
* [Snowflake SMA Training](https://learn.snowflake.com/en/courses/OD-SC-SMA/) - Official Snowflake training course for SMA.
* [Assessment Walkthrough](../../use-cases/assessment-walkthrough/README.md) - Step-by-step guide for conducting a migration assessment. We recommend performing an assessment before starting conversion.

  + [Assessment Quick Start](../../user-guide/assessment/assessment-quick-start.md) - Quick guide to start your assessment immediately.
  + [Assessment Overview [Video]](https://www.youtube.com/watch?v=78Ks4kxj3KI) - Brief video tutorial on running an assessment.
* [Conversion Guide](../../user-guide/snowpark-api-conversion/README.md) - Complete guide for code conversion.

  + [Conversion Quick Start](../../user-guide/snowpark-api-conversion/conversion-quick-start.md) - Begin your code conversion process.
  + [Conversion Overview [Video]](https://www.youtube.com/watch?v=IDboYGQegOE) - Brief video tutorial on performing a conversion.

Welcome! Let’s begin.

---
title: Snowpark Migration Accelerator: How do I give SMA permission to the config folder?
source: https://docs.snowflake.com/en/migrations/sma-docs/support/general-troubleshooting/how-do-i-give-sma-permission-to-the-config-folder.md
section: Migrations
---

# Snowpark Migration Accelerator: How do I give SMA permission to the config folder?

SMA requires specific folder permissions to function correctly. It needs read, write, and execute access to:

* macOS: The `.config` folder
* Windows: The `AppData` folder

These folders store essential SMA files including:

* Temporary files
* Log files

Please ensure SMA has full access to the appropriate folder for your operating system.

## For macOS

1. Open the Terminal by pressing **cmd** + **spacebar**, typing `Terminal`, and pressing **enter**.
2. Navigate to your home directory by typing `cd ~` and pressing enter.
3. Change the permissions of the .config directory by typing `chmod 777 .config`. If you see “Operation not permitted,” use `sudo chmod 777 .config` instead.
4. Close the Terminal and restart the Snowpark Migration Accelerator (SMA).

### For Windows

1. Open the Run dialog window by pressing the Windows key and R key together.
2. Enter `%AppData%` in the Run dialog window and press Enter or click OK.
3. Find the “Snowflake Inc folder”, right-click on it, and verify that the Read-only checkbox under Attributes is unchecked.

---
title: Snowpark Migration Accelerator: How do I make sure that .config is a folder instead of a file?
source: https://docs.snowflake.com/en/migrations/sma-docs/support/general-troubleshooting/how-do-I-make-sure-that-config-is-folder.md
section: Migrations
---

# Snowpark Migration Accelerator: How do I make sure that .config is a folder instead of a file?

*This problem only affects macOS systems.*

SMA requires read, write, and execute permissions for the configuration folder (`.config` on macOS). This folder stores temporary files, log files, and license information.

The `.config` must be a directory (folder). If you find that `.config` exists as a file, you need to convert it to a directory and set the appropriate permissions.

To resolve this issue, follow these steps:

1. Locate the `.config` file in your home directory at `'/Users/[Username]/'`.
2. Delete the `.config` file.
3. Create a new folder called `.config` in the same location.
4. Launch a command terminal.
5. Navigate to your home directory by typing the following command, then pressing Enter:

   `cd ~`
6. Change folder permissions by typing the following command:

   `chmod 777 .config`

   If you see an `Operation not permitted` error, use `sudo chmod 777 .config` instead.
7. Exit the terminal.
8. Start SMA.

---
title: Snowpark Migration Accelerator: How the Assessment Works
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/how-the-assessment-works.md
section: Migrations
---

# Snowpark Migration Accelerator: How the Assessment Works

The Snowpark Migration Accelerator (SMA) analyzes your source code and creates a detailed inventory of all its components and dependencies.

[As mentioned previously](../../general/introduction.md), the Snowpark Migration Accelerator (SMA) is more sophisticated than a simple pattern-matching or text replacement tool. It analyzes your source code and creates a comprehensive semantic model that captures all the functionality of your codebase.

The SMA assessment provides a comprehensive inventory of your source code files and evaluates how well your existing Spark API code will work with the Snowpark API. This assessment helps you start your migration project and gives you a clear overview of all the code in your workload.

The assessment process generates the following outputs:

* [Readiness score](../../support/glossary.md)
* A complete list of all Spark API references and their compatibility with the Snowpark API
* A comprehensive list of all third-party library imports in your codebase
* A summary report containing detailed information from all collected inventories

To view all output files generated during assessment mode, refer to the [output reports](../scos-conversion/output-reports/README.md) section of this documentation.

---
title: Snowpark Migration Accelerator: Introduction
source: https://docs.snowflake.com/en/migrations/sma-docs/general/introduction.md
section: Migrations
---

# Snowpark Migration Accelerator: Introduction

## Overview of the Snowpark Migration Accelerator

The Snowpark Migration Accelerator (SMA), formerly *SnowConvert for Spark*, helps developers convert code from various platforms to Snowflake. It uses a proven migration framework with 30 years of development to analyze code that contains Spark API calls. The tool creates an Abstract Syntax Tree (AST) and Symbol Table to build a detailed model of how the code works. This model helps convert the original code into equivalent Snowflake code automatically, maintaining the same functionality as the source code.

The Snowpark Migration Accelerator (SMA) analyzes your source code by creating a detailed model that captures its meaning and purpose. This allows SMA to understand how your code works at a deeper level than basic tools that only search and replace text or match patterns.

The SMA scans your source code and notebook files to find all Spark API calls. It then converts these Spark API calls to their matching Snowpark API functions when possible.

## Assessment and Conversion

The Snowpark Migration Accelerator (SMA) has two operating modes:

1. *Assessment* (or *Qualification*) - A free analysis tool that evaluates your code before conversion
2. *Conversion* - Transforms your code to Snowpark

We strongly recommend running the Assessment mode first before starting any code conversion.

### Assessment Mode

Assessment mode helps users find and analyze Spark API usage in their code. SMA scans the source code and builds a *semantic model* using our specialized framework. This model helps SMA understand how the code works and what it does. As a result, SMA can generate detailed and accurate reports about the code’s components.

The SMA analyzes your code to help plan the migration process. It identifies Spark API dependencies and evaluates how ready your code is for migration. Once the assessment is complete, you can move forward with converting your code.

For more information about how SMA assesses your code, please see the [Assessment section of the SMA User Guide](../user-guide/assessment/README.md).

### Conversion Mode

During the conversion phase, SMA uses the semantic model created in the assessment phase to automatically generate Snowflake-compatible code. The tool replaces Spark API calls with equivalent Snowpark API calls whenever possible. When direct conversion isn’t possible, SMA adds detailed comments to the output code explaining why certain elements couldn’t be converted and provides helpful context for manual conversion.

## Outline

This section provides comprehensive guidance on the Snowpark Migration Accelerator (SMA), covering the following key areas:

* **Getting Started:**

  + Learn how to [Download and Access](getting-started/download-and-access.md) SMA.
  + Step-by-step [Installation](getting-started/installation/README.md) guide.
* **End User License Agreement (EULA):** Review the [Conversion Software Terms of Use](conversion-software-terms-of-use/README.md).
* **Release Notes:** View the latest [Release Notes](release-notes/README.md) to see recent updates and changes.

For assistance or questions, please [Contact Us](../support/contact-us.md).

We invite you to start exploring the features and functionalities of the Snowpark Migration Accelerator (SMA).

---
title: Snowpark Migration Accelerator: Issue Code Categorization
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-code-categorization.md
section: Migrations
---

# Snowpark Migration Accelerator: Issue Code Categorization

The Snowpark Migration Accelerator (SMA) analyzes your codebase and generates issue codes. While these codes provide detailed information, they fall into three main categories.

## Parsing Error

A parsing error occurs when SMA cannot understand or process a section of your source code. This happens when SMA encounters code that it either doesn’t recognize or considers invalid. These errors typically stem from one of two sources:

1. An issue within the SMA tool itself
2. Problems in your source code

This type of error can occur for various reasons.

* **Invalid Source Code**: The code must be executable in the source platform. If you provide code snippets or partial code that cannot run independently in the source platform, SMA will not be able to parse them.
* **Circular Dependencies**: When analyzing large codebases, SMA may encounter circular references between code elements. This can cause the tool to skip or fail to parse some of these interdependent references.
* **New Code Patterns**: While SMA is regularly updated, source platforms also evolve continuously. There might be cases where newly introduced code patterns are not yet supported by the tool.
* **Encoding Issues**: If your source code contains inconsistent encoding or unexpected characters at the beginning or end of files, SMA may generate parsing errors, even if the code runs successfully in the source platform.

When parsing errors occur, they are identified by specific error codes. To understand what these codes mean and how they relate to parsing errors, refer to [the issue codes by source section](issue-codes-by-source/README.md) in our documentation.

## Conversion Error

A conversion error happens when SMA successfully identifies the code but is unable to convert it. Unlike parsing errors, conversion errors do not indicate problems with your source code. Instead, they show that SMA is working as intended by identifying code segments that are beyond its conversion capabilities.

There are several common reasons why code cannot be converted. These include:

* **The element from the source code cannot be implemented in Snowflake**. Currently, there is no equivalent functionality available in Snowflake for this source code element.
* **The specific usage of an element is not supported in Snowflake**. While Snowflake may support a particular element from the source platform, the way it’s being used in the source code is not compatible with Snowflake’s implementation.
* **Required parameters are not supported**. SMA creates a detailed functional model of the source code by analyzing how each element is used, rather than just identifying and categorizing elements. Sometimes, essential function parameters from the source code don’t have corresponding support in Snowflake.
* **Certain function combinations are incompatible**. SMA’s functional model analyzes how functions work together. Even when individual functions are supported in Snowflake, their combined usage might not be possible. In such cases, SMA will flag this as a conversion error.

Most error messages include specific recommendations or next steps to help you resolve the conversion issue. You can find these suggestions on the corresponding error page.

When SMA encounters a conversion error, it adds an EWI (Error, Warning, Info) comment in the converted code and records the error in [the issues inventory file](../user-guide/scos-conversion/output-reports/sma-inventories.md). The system will then:

* Add a comment symbol to the line containing the conversion error.
* Keep the line uncommented to prevent the file from executing.

When encountering conversion errors, each error has a unique error code. To understand what these codes mean and how to resolve them, refer to [the issue codes by source section](issue-codes-by-source/README.md) in our documentation.

## Warning

A warning differs from an error in SMA. Warnings appear when the tool detects changes that you should be aware of. While these changes won’t prevent your code from running, they indicate that certain aspects of your code may look or behave differently in the converted output compared to the source code.

Common reasons for warning messages:

* **The code appears different**. SMA performs transformations that generate an EWI (Error, Warning, or Information) message.
* **Some specific scenarios may not convert successfully**. The tool will generate a warning if a particular feature works in 99.9% of test cases but fails in certain parameter combinations. If your code uses these specific parameter combinations, you will receive a conversion error.
* **Elements were omitted**. This is the most frequent type of warning. Many functions or parameters that are essential in the source system are not required in Snowflake.

Warnings are informational messages that typically don’t require immediate action. However, we strongly recommend reviewing all warnings before deploying code to the target environment. These warnings should be considered during the testing phase of the converted code.

Warnings are identified by specific error codes. To understand what these codes mean, refer to [the issue codes by source section](issue-codes-by-source/README.md) in this documentation.

---
title: Snowpark Migration Accelerator: Issue codes for DBX
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/dbx/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Issue codes for DBX

## SPRKDBX1000

Message: < **\*Magic Name\*** > is not supported in Snowsight. It is necessary to rewrite code.

Category: Conversion Error.

### Description

This issue appears when the SMA detects a magic command in a DBX notebook, Snowsight does not support magic commands.

### Scenario

**Input**

The following is an example of a magic command in a DBX notebook.

```python
val df = spark.read.format("csv").load("path/to/file.csv")
df.show()
```

**Output**

The SMA adds the EWI `SPRKDBX1000` to the output code to let you know that this magic is not supported.

```python
# EWI: SPRKDBX1000 => Then %alias magic command is not supported in Snowsight. It is necessary to rewrite code.
# %alias myalias echo \"This is an alias\"
```

**Recommended fix**

This issue has no direct fix. You must rewrite the code.

## SPRKDBX1001

Message: The `%run` command has a partial mapping, because it has different behavior in Snowpark.

Category: Conversion Error.

### Description

This issue appears when the SMA detects a use of the `%run` command that does not have a direct equivalent in Snowpark.

The way the `%run` command works in DBX is similar to an import, meaning it includes the code
from another notebook into the current one in that it shares variables, functions, and states with the notebook
where the command was executed.

### Scenario

**Input**

Below is an example of the `%run` command.

```python
%run /Workspace/Users/path/to/Notebook/notebookName
```

**Output**

The SMA adds the EWI `SPRKDBX1001` on the output code to let you know that this element has a different behavior in Snowpark.

```python
EWI: SPRKDBX1001 => The %run command has a partial mapping, because it has a different behavior in Snowpark.
spark.sql("EXECUTE NOTEBOOK <DATABASE>.<SCHEMA>.notebookName()")
```

**Recommended fix**

Identify the used elements (functions, variables) from the executed notebook and encapsulate them into a file. Import that file into the executing notebook.

This fix would emulate the behavior of the original `%run` command.

## SPRKDBX1002

Message: Scala cells are not supported in Snowsight.

Category: Conversion Error.

### Description

This issue appears when the SMA detects a cell with Scala code in a DBX notebook; Snowsight does not support Scala cells. Only SQL, Python, and Markdown are available in Snowsight.

### Scenarios

The following scenarios are not supported.

#### Scenario 1

Scala cell in a DBX notebook.

**Input**

Below is an example of a Scala cell in a DBX notebook.

```scala
val df = spark.read.format("csv").load("path/to/file.csv")
df.show()
```

**Output**

The SMA adds the EWI `SPRKDBX1002` on the output code to let you know that this cell is not supported.

```python
# EWI: SPRKDBX1002 => Scala cells are not supported in Snowpark. It is necessary to rewrite the Scala code in Python.
#val df = spark.read.format("csv").load("path/to/file.csv")
#df.show()
```

**Recommended fix**

This issue has no direct fix. You must rewrite the Scala code in Python.

#### Scenario 2

The `%scala` command in a DBX notebook.

**Input**

Below is an example of a `%scala` command cell in a DBX notebook.

```scala
%scala
val df = spark.read.format("csv").load("path/to/file.csv")
df.show()
```

**Output**

The SMA adds the EWI `SPRKDBX1002` on the output code to let you know that this cell is not supported.

```python
# EWI: SPRKDBX1002 => Scala cells are not supported in Snowpark. It is necessary to rewrite the Scala code in Python.
#val df = spark.read.format("csv").load("path/to/file.csv")
#df.show()
```

**Recommended fix**

This issue has no direct fix. It is necessary to rewrite the Scala code in Python.

## SPRKDBX1003

Message: R cells are not supported in Snowsight.

Category: Conversion Error.

### Description

This issue appears when the SMA detects a cell with R code in a DBX notebook; Snowsight does not support R cells. Only SQL, Python, and Markdown are available in Snowsight.

### Scenario

**Input**

Below is an example of %r command.

```python
%r
my_vector <- c(1, 2, 3, 4, 5)
```

**Output**

The SMA adds the EWI `SPRKDBX1003` on the output code to let you know that this cell is not supported.

```python
# EWI: SPRKDBX1003 => R cells are not supported in Snowpark. It is necessary to rewrite the R code in Python.
# my_vector <- c(1, 2, 3, 4, 5)
```

**Recommended fix**

This issue has no direct fix. You must rewrite the R code in Python.

## SPRKDBX1004

Message: The method ‘< **\*element\*** >’ has no equivalence on Snowflake/Snowsight.

Category: Conversion Error.

### Description

This issue appears when the SMA detects the use of a DBX method that has no equivalent in Snowsight, and does not have its own error code associated with it. SMA uses this generic error code for an unsupported DBX element.

### Scenario

**Input**

Below is an example of DBX utility element.

```python
dbutils.data.summarize(df)
```

**Output**

The SMA adds the EWI `SPRKDBX1004` on the output code to let you know that the method has no equivalent in Snowsight.

```python
# EWI: SPRKPY1004 => The method 'dbutils.data.summarize ' has no equivalence on Snowflake/Snowsight.
dbutils.data.summarize(df)
```

**Recommended fix**

Because this is a generic error code that applies to a range of unsupported functions, there is not a single and specific fix. The appropriate action will depend on the particular element in use.

Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

## SPRKDBX1005

Message: The method <**\*element\***> has an equivalent in Snowflake/Snowsight; however, the element’s parameter, a URL path, is not supported.

Category: Warning.

### Description

This issue appears when the SMA identifies the use of a DBX method that has an equivalent in Snowsight; however, the element has a URL path as a parameter, which is not supported in Snowflake and Snowsight.

### Scenario

**Input**

The following example shows a `dbutils` method called with a URL path as a parameter:

```python
dbutils.fs.cp("s3://example.com/data.csv", "/mnt/data/")
```

**Output**

The SMA adds the EWI `SPRKDBX1005` on the output code to let you know that the method has URL as a parameter and is not supported.

```python
# EWI: SPRKDBX1005 => The method 'dbutils.fs.cp' has an equivalent in Snowflake/Snowsight; however, the element's parameter, a URL path, is not supported.
sfutils.fs.cp("s3://example.com/data.csv", "/mnt/data/")
```

**Recommended fix**

This generic warning is used for functions with URL path parameters, so there is no single recommended fix. Review the specific method and its usage to determine if an alternative approach or workaround is possible in Snowflake or Snowsight. Consider refactoring the code to avoid using URL paths. You can also implement custom logic to handle the data transfer outside the method.

Please note that even though the URL paths are not supported, it does not necessarily mean that a solution or workaround cannot be found. It only means that the SMA itself cannot find the solution.

Snowflake offers the ability to map URL-type paths by using an external stage, which facilitates the integration and access to external data. Consult the Snowflake documentation on [External Stage](../../../../../user-guide/data-load-s3-create-stage.md) for more information.

For this, you must have a storage integration ([STORAGE_INTEGRATION](../../../../../sql-reference/sql/create-storage-integration.md)) configured. Subsequently, you will need to copy your files to a table. You can find more details on how to copy data from an [S3 stage](../../../../../user-guide/data-load-s3-copy.md) in the Snowflake documentation.

---
title: Snowpark Migration Accelerator: Issue codes for pandas
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/pandas/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Issue codes for pandas

The following issue codes may appear when the Snowpark Migration Accelerator (SMA) processes pandas code. Select an issue code to view its description, examples, and recommended fix.

For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or [report an issue](../../../user-guide/project-overview/configuration-and-settings.md). If you have a support contract with Snowflake, contact your sales engineer for assistance.

* [PNDSPY1001](PNDSPY1001.md)
* [PNDSPY1002](PNDSPY1002.md)
* [PNDSPY1003](PNDSPY1003.md)
* [PNDSPY1004](PNDSPY1004.md)
* [PNDSPY1005](PNDSPY1005.md)
* [PNDSPY1006](PNDSPY1006.md)
* [PNDSPY1007](PNDSPY1007.md)
* [PNDSPY1008](PNDSPY1008.md)
* [PNDSPY1009](PNDSPY1009.md)
* [PNDSPY1010](PNDSPY1010.md)
* [PNDSPY1011](PNDSPY1011.md)
* [PNDSPY1012](PNDSPY1012.md)
* [PNDSPY1013](PNDSPY1013.md)
* [PNDSPY1014](PNDSPY1014.md)
* [PNDSPY1015](PNDSPY1015.md)
* [PNDSPY1016](PNDSPY1016.md)
* [PNDSPY1017](PNDSPY1017.md)
* [PNDSPY1018](PNDSPY1018.md)
* [PNDSPY1019](PNDSPY1019.md)
* [PNDSPY1020](PNDSPY1020.md)
* [PNDSPY1021](PNDSPY1021.md)
* [PNDSPY1022](PNDSPY1022.md)
* [PNDSPY1023](PNDSPY1023.md)
* [PNDSPY1024](PNDSPY1024.md)
* [PNDSPY1025](PNDSPY1025.md)
* [PNDSPY1026](PNDSPY1026.md)
* [PNDSPY1027](PNDSPY1027.md)
* [PNDSPY1028](PNDSPY1028.md)
* [PNDSPY1029](PNDSPY1029.md)
* [PNDSPY1030](PNDSPY1030.md)
* [PNDSPY1031](PNDSPY1031.md)
* [PNDSPY1032](PNDSPY1032.md)
* [PNDSPY1033](PNDSPY1033.md)
* [PNDSPY1034](PNDSPY1034.md)
* [PNDSPY1035](PNDSPY1035.md)
* [PNDSPY1036](PNDSPY1036.md)
* [PNDSPY1037](PNDSPY1037.md)
* [PNDSPY1038](PNDSPY1038.md)
* [PNDSPY1039](PNDSPY1039.md)
* [PNDSPY1040](PNDSPY1040.md)
* [PNDSPY1041](PNDSPY1041.md)
* [PNDSPY1042](PNDSPY1042.md)
* [PNDSPY1043](PNDSPY1043.md)
* [PNDSPY1044](PNDSPY1044.md)
* [PNDSPY1045](PNDSPY1045.md)
* [PNDSPY1046](PNDSPY1046.md)
* [PNDSPY1047](PNDSPY1047.md)
* [PNDSPY1048](PNDSPY1048.md)
* [PNDSPY1049](PNDSPY1049.md)
* [PNDSPY1050](PNDSPY1050.md)
* [PNDSPY1051](PNDSPY1051.md)
* [PNDSPY1052](PNDSPY1052.md)
* [PNDSPY1053](PNDSPY1053.md)
* [PNDSPY1054](PNDSPY1054.md)
* [PNDSPY1055](PNDSPY1055.md)
* [PNDSPY1056](PNDSPY1056.md)
* [PNDSPY1057](PNDSPY1057.md)
* [PNDSPY1058](PNDSPY1058.md)
* [PNDSPY1059](PNDSPY1059.md)
* [PNDSPY1060](PNDSPY1060.md)
* [PNDSPY1061](PNDSPY1061.md)
* [PNDSPY1062](PNDSPY1062.md)
* [PNDSPY1063](PNDSPY1063.md)
* [PNDSPY1064](PNDSPY1064.md)
* [PNDSPY1065](PNDSPY1065.md)
* [PNDSPY1066](PNDSPY1066.md)
* [PNDSPY1067](PNDSPY1067.md)
* [PNDSPY1068](PNDSPY1068.md)
* [PNDSPY1069](PNDSPY1069.md)
* [PNDSPY1070](PNDSPY1070.md)
* [PNDSPY1071](PNDSPY1071.md)
* [PNDSPY1072](PNDSPY1072.md)
* [PNDSPY1073](PNDSPY1073.md)
* [PNDSPY1074](PNDSPY1074.md)
* [PNDSPY1075](PNDSPY1075.md)
* [PNDSPY1076](PNDSPY1076.md)
* [PNDSPY1077](PNDSPY1077.md)
* [PNDSPY1078](PNDSPY1078.md)
* [PNDSPY1079](PNDSPY1079.md)
* [PNDSPY1080](PNDSPY1080.md)
* [PNDSPY1081](PNDSPY1081.md)
* [PNDSPY1082](PNDSPY1082.md)
* [PNDSPY1083](PNDSPY1083.md)
* [PNDSPY1084](PNDSPY1084.md)
* [PNDSPY1085](PNDSPY1085.md)
* [PNDSPY1086](PNDSPY1086.md)
* [PNDSPY1087](PNDSPY1087.md)
* [PNDSPY1088](PNDSPY1088.md)
* [PNDSPY1089](PNDSPY1089.md)
* [PNDSPY1090](PNDSPY1090.md)
* [PNDSPY1091](PNDSPY1091.md)
* [PNDSPY1092](PNDSPY1092.md)
* [PNDSPY1093](PNDSPY1093.md)
* [PNDSPY1094](PNDSPY1094.md)
* [PNDSPY1095](PNDSPY1095.md)
* [PNDSPY1096](PNDSPY1096.md)
* [PNDSPY1097](PNDSPY1097.md)
* [PNDSPY1098](PNDSPY1098.md)
* [PNDSPY1099](PNDSPY1099.md)
* [PNDSPY1100](PNDSPY1100.md)
* [PNDSPY1101](PNDSPY1101.md)
* [PNDSPY1102](PNDSPY1102.md)
* [PNDSPY1103](PNDSPY1103.md)
* [PNDSPY1104](PNDSPY1104.md)
* [PNDSPY1105](PNDSPY1105.md)
* [PNDSPY1106](PNDSPY1106.md)
* [PNDSPY1107](PNDSPY1107.md)
* [PNDSPY1108](PNDSPY1108.md)
* [PNDSPY1109](PNDSPY1109.md)
* [PNDSPY1110](PNDSPY1110.md)
* [PNDSPY1111](PNDSPY1111.md)
* [PNDSPY1112](PNDSPY1112.md)
* [PNDSPY1113](PNDSPY1113.md)
* [PNDSPY1114](PNDSPY1114.md)
* [PNDSPY1115](PNDSPY1115.md)
* [PNDSPY1116](PNDSPY1116.md)
* [PNDSPY1117](PNDSPY1117.md)
* [PNDSPY1118](PNDSPY1118.md)
* [PNDSPY1119](PNDSPY1119.md)
* [PNDSPY1120](PNDSPY1120.md)
* [PNDSPY1121](PNDSPY1121.md)
* [PNDSPY1122](PNDSPY1122.md)
* [PNDSPY1123](PNDSPY1123.md)
* [PNDSPY1124](PNDSPY1124.md)
* [PNDSPY1125](PNDSPY1125.md)
* [PNDSPY1126](PNDSPY1126.md)
* [PNDSPY1127](PNDSPY1127.md)
* [PNDSPY1128](PNDSPY1128.md)
* [PNDSPY1129](PNDSPY1129.md)
* [PNDSPY1130](PNDSPY1130.md)
* [PNDSPY1131](PNDSPY1131.md)
* [PNDSPY1132](PNDSPY1132.md)
* [PNDSPY1133](PNDSPY1133.md)
* [PNDSPY1134](PNDSPY1134.md)
* [PNDSPY1135](PNDSPY1135.md)
* [PNDSPY1136](PNDSPY1136.md)
* [PNDSPY1137](PNDSPY1137.md)
* [PNDSPY1138](PNDSPY1138.md)
* [PNDSPY1139](PNDSPY1139.md)
* [PNDSPY1140](PNDSPY1140.md)
* [PNDSPY1141](PNDSPY1141.md)
* [PNDSPY1142](PNDSPY1142.md)
* [PNDSPY1143](PNDSPY1143.md)
* [PNDSPY1144](PNDSPY1144.md)
* [PNDSPY1145](PNDSPY1145.md)
* [PNDSPY1146](PNDSPY1146.md)
* [PNDSPY1147](PNDSPY1147.md)
* [PNDSPY1148](PNDSPY1148.md)
* [PNDSPY1149](PNDSPY1149.md)
* [PNDSPY1150](PNDSPY1150.md)
* [PNDSPY1151](PNDSPY1151.md)
* [PNDSPY1152](PNDSPY1152.md)
* [PNDSPY1153](PNDSPY1153.md)
* [PNDSPY1154](PNDSPY1154.md)
* [PNDSPY1155](PNDSPY1155.md)
* [PNDSPY1156](PNDSPY1156.md)
* [PNDSPY1157](PNDSPY1157.md)
* [PNDSPY1158](PNDSPY1158.md)
* [PNDSPY1159](PNDSPY1159.md)
* [PNDSPY1160](PNDSPY1160.md)
* [PNDSPY1161](PNDSPY1161.md)
* [PNDSPY1162](PNDSPY1162.md)
* [PNDSPY1163](PNDSPY1163.md)
* [PNDSPY1164](PNDSPY1164.md)
* [PNDSPY1165](PNDSPY1165.md)
* [PNDSPY1166](PNDSPY1166.md)
* [PNDSPY1167](PNDSPY1167.md)
* [PNDSPY1168](PNDSPY1168.md)
* [PNDSPY1169](PNDSPY1169.md)
* [PNDSPY1170](PNDSPY1170.md)
* [PNDSPY1171](PNDSPY1171.md)
* [PNDSPY1172](PNDSPY1172.md)
* [PNDSPY1173](PNDSPY1173.md)
* [PNDSPY1174](PNDSPY1174.md)
* [PNDSPY1175](PNDSPY1175.md)
* [PNDSPY1176](PNDSPY1176.md)
* [PNDSPY1177](PNDSPY1177.md)
* [PNDSPY1178](PNDSPY1178.md)
* [PNDSPY1179](PNDSPY1179.md)
* [PNDSPY1180](PNDSPY1180.md)
* [PNDSPY1181](PNDSPY1181.md)
* [PNDSPY1182](PNDSPY1182.md)
* [PNDSPY1183](PNDSPY1183.md)
* [PNDSPY1184](PNDSPY1184.md)
* [PNDSPY1185](PNDSPY1185.md)
* [PNDSPY1186](PNDSPY1186.md)
* [PNDSPY1187](PNDSPY1187.md)
* [PNDSPY1188](PNDSPY1188.md)
* [PNDSPY1189](PNDSPY1189.md)
* [PNDSPY1190](PNDSPY1190.md)
* [PNDSPY1191](PNDSPY1191.md)
* [PNDSPY1192](PNDSPY1192.md)
* [PNDSPY1193](PNDSPY1193.md)
* [PNDSPY1194](PNDSPY1194.md)
* [PNDSPY1195](PNDSPY1195.md)
* [PNDSPY1196](PNDSPY1196.md)
* [PNDSPY1197](PNDSPY1197.md)
* [PNDSPY1198](PNDSPY1198.md)
* [PNDSPY1199](PNDSPY1199.md)
* [PNDSPY1200](PNDSPY1200.md)
* [PNDSPY1201](PNDSPY1201.md)
* [PNDSPY1202](PNDSPY1202.md)
* [PNDSPY1203](PNDSPY1203.md)
* [PNDSPY1204](PNDSPY1204.md)
* [PNDSPY1205](PNDSPY1205.md)
* [PNDSPY1206](PNDSPY1206.md)
* [PNDSPY1207](PNDSPY1207.md)
* [PNDSPY1208](PNDSPY1208.md)
* [PNDSPY1209](PNDSPY1209.md)
* [PNDSPY1210](PNDSPY1210.md)
* [PNDSPY1211](PNDSPY1211.md)
* [PNDSPY1212](PNDSPY1212.md)
* [PNDSPY1213](PNDSPY1213.md)
* [PNDSPY1214](PNDSPY1214.md)
* [PNDSPY1215](PNDSPY1215.md)
* [PNDSPY1216](PNDSPY1216.md)
* [PNDSPY1217](PNDSPY1217.md)
* [PNDSPY1218](PNDSPY1218.md)
* [PNDSPY1219](PNDSPY1219.md)
* [PNDSPY1220](PNDSPY1220.md)
* [PNDSPY1221](PNDSPY1221.md)
* [PNDSPY1222](PNDSPY1222.md)
* [PNDSPY1223](PNDSPY1223.md)
* [PNDSPY1224](PNDSPY1224.md)
* [PNDSPY1225](PNDSPY1225.md)
* [PNDSPY1226](PNDSPY1226.md)
* [PNDSPY1227](PNDSPY1227.md)
* [PNDSPY1228](PNDSPY1228.md)
* [PNDSPY1229](PNDSPY1229.md)
* [PNDSPY1230](PNDSPY1230.md)
* [PNDSPY1231](PNDSPY1231.md)
* [PNDSPY1232](PNDSPY1232.md)
* [PNDSPY1233](PNDSPY1233.md)
* [PNDSPY1234](PNDSPY1234.md)
* [PNDSPY1235](PNDSPY1235.md)
* [PNDSPY1236](PNDSPY1236.md)
* [PNDSPY1237](PNDSPY1237.md)
* [PNDSPY1238](PNDSPY1238.md)
* [PNDSPY1239](PNDSPY1239.md)
* [PNDSPY1240](PNDSPY1240.md)
* [PNDSPY1241](PNDSPY1241.md)
* [PNDSPY1242](PNDSPY1242.md)
* [PNDSPY1243](PNDSPY1243.md)
* [PNDSPY1244](PNDSPY1244.md)
* [PNDSPY1245](PNDSPY1245.md)
* [PNDSPY1246](PNDSPY1246.md)
* [PNDSPY1247](PNDSPY1247.md)
* [PNDSPY1248](PNDSPY1248.md)
* [PNDSPY1249](PNDSPY1249.md)
* [PNDSPY1250](PNDSPY1250.md)
* [PNDSPY1251](PNDSPY1251.md)
* [PNDSPY1252](PNDSPY1252.md)
* [PNDSPY1253](PNDSPY1253.md)
* [PNDSPY1254](PNDSPY1254.md)
* [PNDSPY1255](PNDSPY1255.md)
* [PNDSPY1256](PNDSPY1256.md)
* [PNDSPY1257](PNDSPY1257.md)
* [PNDSPY1258](PNDSPY1258.md)
* [PNDSPY1259](PNDSPY1259.md)
* [PNDSPY1260](PNDSPY1260.md)
* [PNDSPY1261](PNDSPY1261.md)
* [PNDSPY1262](PNDSPY1262.md)

---
title: Snowpark Migration Accelerator: Issue Codes for Python
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/python/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Issue Codes for Python

## SPRKPY1000

Message: Source project spark-core version is xx.xx:xx.x.x, the spark-core version supported by snowpark is 2.12:3.1.2 so there may be functional differences between the existing mappings.

Category: Warning.

### Description

This issue appears when the Pyspark version of your source code is not supported. This means that there may be functional differences between the existing mappings.

#### Additional recommendations

* The pyspark version scanned by the SMA for compatibility to Snowpark is from 2.12 to 3.1.2. If you are using a version outside this range, the tool may produce inconsistent results. You could alter the version of the source code you are scanning.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](https://snowflakecomputing.atlassian.net/o/-MB4z_O8Sl--Tfl3XVml/s/6on4bNAZUZGzMpdEum8X/~/changes/371/user-guide/project-overview/configuration-and-settings#report-an-issue).

## SPRKPY1001

Message\*\*:\*\* This code section has parsing errors

Category\*\*:\*\* Parsing error.

### Description

A parsing error is reported by the Snowpark Migration Accelerator (SMA) when it cannot correctly read or understand the code in a file (it cannot correctly “parse” the file). This issue code appears when a file has one or more parsing error(s).

#### Scenario

**Input:** The EWI message appears when the code has invalid syntax, for example:

```python
def foo():
    x = %%%%%%1###1
```

**Output:** SMA find a parsing error and comment the parsing error adding the corresponding EWI message:

```python
def foo():
    x
## EWI: SPRKPY1101 => Unrecognized or invalid CODE STATEMENT @(2, 7). Last valid token was 'x' @(2, 5), failed token '=' @(2, 7)
##      = %%%%%%1###1
```

### Additional recommendations

* Check that the file contains valid Python code. (You can use the issues.csv file to find all files with this EWI code to determine which file(s) were not processed by the tool due to parsing error(s).) Many parsing errors occur because only part of the code is input into the tool, so it’s bets to ensure that the code will run in the source. If it is valid, report that you encountered a parsing error using the Report an Issue option [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md). Include the line of code that was causing a parsing error in the description when you file this issue.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1002

Message\*\*:\*\* < element > is not supported,Spark element is not supported.

Category\*\*:\*\* Conversion error.

### Description

This issue appears when the tool detects the usage of an element that is not supported in Snowpark, and does not have it’s own error code associated with it. This is the generic error code used by the SMA for an unsupported element.

#### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It only means that the tool itself cannot find the solution.
* If you have encountered an unsupported element from a pyspark.ml library, consider some alternative approached. There are additional guides available to walkthrough issues related to ml such as this one from Snowflake.
* Check if the source code has the correct syntax. (You can use the issues.csv file to determine where the conversion error(s) are occurring.) If the syntax is correct, report that you encountered a conversion error on a particular element using the Report an Issue option in the SMA. Include the line of code that was causing the error in the description when you file this issue.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1003

Message\*\*:\*\* An error occurred when loading the symbol table.

Category\*\*:\*\* Conversion error.

### Description

This issue appears when there is an error processing the symbols in the symbol table. The symbol table is part of the underlying architecture of the SMA allowing for more complex conversions. This error could be due to an unexpected statement in the source code.

#### Additional recommendations

* This is unlikely to be an error in the source code itself, but rather is an error in how the tool processes the source code. The best resolution would be to post an issue [in the SMA](https://snowflakecomputing.atlassian.net/o/-MB4z_O8Sl--Tfl3XVml/s/6on4bNAZUZGzMpdEum8X/~/changes/371/user-guide/project-overview/configuration-and-settings#report-an-issue).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](https://snowflakecomputing.atlassian.net/o/-MB4z_O8Sl--Tfl3XVml/s/6on4bNAZUZGzMpdEum8X/~/changes/371/user-guide/project-overview/configuration-and-settings#report-an-issue).

## SPRKPY1004

Message\*\*:\*\* The symbol table could not be loaded.

Category\*\*:\*\* Parsing error.

### Description

This issue appears when there is an unexpected error in the tool execution process. Since the symbol table cannot be loaded, the tool cannot start the assessment or conversion process.

#### Additional recommendations

* This is unlikely to be an error in the source code itself, but rather is an error in how the tool processes the source code. The best resolution would be to reach out to [the SMA support team](../../../support/contact-us.md). You can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](https://snowflakecomputing.atlassian.net/o/-MB4z_O8Sl--Tfl3XVml/s/6on4bNAZUZGzMpdEum8X/~/changes/371/user-guide/project-overview/configuration-and-settings#report-an-issue).

## SPRKPY1005

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

Message\*\*:\*\* pyspark.conf.SparkConf is not required

Category\*\*:\*\* Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.conf.SparkConf](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkConf.html) which is not required.

#### Scenario

**Input**

SparkConf could be called without parameters or with loadDefaults.

```python
from pyspark import SparkConf

my_conf = SparkConf(loadDefaults=True)
```

**Output**

For both cases (with or without parameters) SMA creates a [Snowpark Session.builder](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.Session.SessionBuilder.configs) object:

```python
#EWI: SPRKPY1005 => pyspark.conf.SparkConf is not required
#from pyspark import SparkConf
pass

#EWI: SPRKPY1005 => pyspark.conf.SparkConf is not required
my_conf = Session.builder.configs({"user" : "my_user", "password" : "my_password", "account" : "my_account", "role" : "my_role", "warehouse" : "my_warehouse", "database" : "my_database", "schema" : "my_schema"}).create()
```

#### Additional recommendations

* This is an unnecessary parameter being removed with a warning comment being inserted. There should be no additional action from the user.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1006

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

Message\*\*:\*\* pyspark.context.SparkContext is not required

Category\*\*:\*\* Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.context.SparkContext](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.html), which is not required in Snowflake.

#### Scenario

**Input**

In this example there are two context to create a connections to an Spark Cluster

```python
from pyspark import SparkContext

sql_context1 = SparkContext(my_sc1)
sql_context2 = SparkContext(sparkContext=my_sc2)
```

**Output**

Because there are no clusters on Snowflake the Context is not required, note that the variables my_sc1 and my_sc2 that contains Spark properties may be not required or it will to be adapted to fix the code.

```python
from snowflake.snowpark import Session
#EWI: SPRKPY1006 => pyspark.sql.context.SparkContext is not required
sql_context1 = my_sc1
#EWI: SPRKPY1006 => pyspark.sql.context.SparkContext is not required

sql_context2 = my_sc2
```

#### Additional recommendations

* This is an unnecessary parameter being removed with a warning comment being inserted. There should be no action from the user.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1007

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

Message\*\*:\*\* pyspark.sql.context.SQLContext is not required

Category\*\*:\*\* Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.context.SQLContext](https://downloads.apache.org/spark/docs/1.6.1/api/python/pyspark.sql.html), which is not required.

#### Scenario

**Input**

Here we have an example with different SparkContext overloads.

```python
from pyspark import SQLContext
​
my_sc1 = SQLContext(myMaster, myAppName, mySparkHome, myPyFiles, myEnvironment, myBatctSize, mySerializer, my_conf1)
my_sc2 = SQLContext(conf=my_conf2)
my_sc3 = SQLContext()
```

**Output**

The output code has commented the line for pyspark.SQLContext, and replaces the scenarios with a reference to a configuration. Note that the variables my_sc1 and my_sc2 that contains Spark properties may be not required or it will to be adapted to fix the code.

```python
#EWI: SPRKPY1007 => pyspark.sql.context.SQLContext is not required
#from pyspark import SQLContext
pass

#EWI: SPRKPY1007 => pyspark.sql.context.SQLContext is not required
sql_context1 = my_sc1
#EWI: SPRKPY1007 => pyspark.sql.context.SQLContext is not required
sql_context2 = my_sc2
```

#### Additional recommendations

* This is an unnecessary parameter being and is removed with a warning comment inserted into the source code. There should be no action from the user.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1008

Message: pyspark.sql.context.HiveContext is not required

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.context.HiveContext](https://downloads.apache.org/spark/docs/1.6.1/api/python/pyspark.sql.html#pyspark.sql.HiveContext), which is not required.

#### Scenario

**Input**

In this example an example to create a connection to an Hive store.

```python
from pyspark.sql import HiveContext
hive_context = HiveContext(sc)
df = hive_context.table("myTable")
df.show()
```

**Output**

In Snowflake there are not Hive stores, so the Hive Context is not required, You can still use parquet files on Snowflake please check this [tutorial](https://docs.snowflake.com/en/user-guide/tutorials/script-data-load-transform-parquet) to learn how.

```python
#EWI: SPRKPY1008 => pyspark.sql.context.HiveContext is not required
hive_context = sc
df = hive_context.table("myTable")
df.show()
```

the sc variable refers to a [Snow Park Session Object](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.Session)

**Recommended fix**

For the output code in the example you should add the [Snow Park Session Object](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.Session) similar to this code:

```python
## Here manually we can add the Snowpark Session object via a json config file called connection.json
import json
from snowflake.snowpark import Session
jsonFile = open("connection.json")
connection_parameter = json.load(jsonFile)
jsonFile.close()
sc = Session.builder.configs(connection_parameter).getOrCreate()

hive_context = sc
df = hive_context.table("myTable")
df.show()
```

### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1009

Message\*\*:\*\* pyspark.sql.dataframe.DataFrame.approxQuantile has a workaround

Category\*\*:\*\* Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.dataframe.DataFrame.approxQuantile](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.approxQuantile.html) which has a workaround.

#### Scenario

**Input**

It’s important understand that Pyspark uses two different approxQuantile functions, here we use the [DataFrame approxQuantile](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.approxQuantile.html) version

```python
from pyspark.sql import SparkSession
spark = SparkSession.builder.getOrCreate()
data = [['Sun', 10],
        ['Mon', 64],
        ['Thr', 12],
        ['Wen', 15],
        ['Thu', 68],
        ['Fri', 14],
        ['Sat', 13]]

columns = ['Day', 'Ammount']
df = spark.createDataFrame(data, columns)
df.approxQuantile('Ammount', [0.25, 0.5, 0.75], 0)
```

**Output**

SMA returns the EWI SPRKPY1009 over the line where approxQuantile is used, so you can use to identify where to fix.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Sun', 10],
        ['Mon', 64],
        ['Thr', 12],
        ['Wen', 15],
        ['Thu', 68],
        ['Fri', 14],
        ['Sat', 13]]

columns = ['Day', 'Ammount']
df = spark.createDataFrame(data, columns)
#EWI: SPRKPY1009 => pyspark.sql.dataframe.DataFrame.approxQuantile has a workaround, see documentation for more info
df.approxQuantile('Ammount', [0.25, 0.5, 0.75], 0)
```

**Recommended fix**

Use [Snowpark approxQuantile](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrame.approxQuantile) method. Some parameters don’t match so they require some manual adjustments. for the output code’s example a recommended fix could be:

```python
from snowflake.snowpark import Session
...
df = spark.createDataFrame(data, columns)

df.stat.approx_quantile('Ammount', [0.25, 0.5, 0.75])
```

pyspark.sql.dataframe.DataFrame.approxQuantile’s relativeError parameter does’t exist in SnowPark.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1010

Message: pyspark.sql.dataframe.DataFrame.checkpoint has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.dataframe.DataFrame.checkpoint](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.checkpoint.html) which has a workaround.

#### Scenario

**Input**

In PySpark Checkpoints are used to truncate the logical plan of a dataframe, this to avoid the growing of a logical plan.

```python
import tempfile
from pyspark.sql import SparkSession
spark = SparkSession.builder.getOrCreate()
data = [['Q1', 300000],
        ['Q2', 60000],
        ['Q3', 500002],
        ['Q4', 130000]]

columns = ['Quarter', 'Score']
df = spark.createDataFrame(data, columns)
with tempfile.TemporaryDirectory() as d:
    spark.sparkContext.setCheckpointDir("/tmp/bb")
    df.checkpoint(False)
```

**Output**

SMA returns the EWI SPRKPY1010 over the line where approxQuantile is used, so you can use to identify where to fix. Note that also marks the [setCheckpointDir](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.setCheckpointDir.html) as unsupported, but a checpointed directory is not required for the fix.

```python
import tempfile
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 300000],
        ['Q2', 60000],
        ['Q3', 500002],
        ['Q4', 130000]]

columns = ['Quarter', 'Score']
df = spark.createDataFrame(data, columns)
with tempfile.TemporaryDirectory() as d:
    #EWI: SPRKPY1002 => pyspark.context.SparkContext.setCheckpointDir is not supported
    spark.setCheckpointDir("/tmp/bb")
    #EWI: SPRKPY1010 => pyspark.sql.dataframe.DataFrame.checkpoint has a workaround, see documentation for more info
    df.checkpoint(False)
```

**Recommended fix**

Snowpark eliminates the need for explicit checkpoints: this because Snowpark works with SQL-based operations that are optimized by Snowflake query optimization engine eliminating the need for unrequited computations or logical plans that grow out of control.

However there could be scenarios where you would require persist the result of a computation on a dataframe. In this scenarios you can save materialize the results by writing the dataframe on a [Snowflake Table or in a Snowflake Temporary Table](https://docs.snowflake.com/en/user-guide/tables-temp-transient).

* By the use of a permanent table or the computed result can be accessed in any moment even after the session end.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 300000],
        ['Q2', 60000],
        ['Q3', 500002],
        ['Q4', 130000]]

columns = ['Quarter', 'Score']
df = spark.createDataFrame(data, columns)
df.write.save_as_table("my_table", table_type="temporary") # Save the dataframe into Snowflake table "my_table".
df2 = Session.table("my_table") # Now I can access the stored result quering the table "my_table"
```

* An alternative fix, the use of a Temporary table have the advantage that the table is deleted after the session ends:

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 300000],
        ['Q2', 60000],
        ['Q3', 500002],
        ['Q4', 130000]]

columns = ['Quarter', 'Score']
df = spark.createDataFrame(data, columns)
df.write.save_as_table("my_temp_table", table_type="temporary") # Save the dataframe into Snowflake table "my_temp_table".
df2 = Session.table("my_temp_table") # Now I can access the stored result quering the table "my_temp_table"
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1011

Message: pyspark.sql.dataframe.DataFrameStatFunctions.approxQuantile has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.dataframe.DataFrameStatFunctions.approxQuantile](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameStatFunctions.approxQuantile.html#pyspark.sql.DataFrameStatFunctions.approxQuantile) which has a workaround.

#### Scenario

**Input**

It’s important understand that Pyspark uses two different approxQuantile functions, here we use the [DataFrameStatFunctions approxQuantile](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameStatFunctions.approxQuantile.html#pyspark.sql.DataFrameStatFunctions.approxQuantile) version.

```python
import tempfile
from pyspark.sql import SparkSession, DataFrameStatFunctions
spark = SparkSession.builder.getOrCreate()
data = [['Q1', 300000],
        ['Q2', 60000],
        ['Q3', 500002],
        ['Q4', 130000]]

columns = ['Quarter', 'Gain']
df = spark.createDataFrame(data, columns)
aprox_quantille = DataFrameStatFunctions(df).approxQuantile('Gain', [0.25, 0.5, 0.75], 0)
print(aprox_quantille)
```

**Output**

SMA returns the EWI SPRKPY1011 over the line where approxQuantile is used, so you can use to identify where to fix.

```python
import tempfile
from snowflake.snowpark import Session, DataFrameStatFunctions
spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 300000],
        ['Q2', 60000],
        ['Q3', 500002],
        ['Q4', 130000]]

columns = ['Quarter', 'Gain']
df = spark.createDataFrame(data, columns)
#EWI: SPRKPY1011 => pyspark.sql.dataframe.DataFrameStatFunctions.approxQuantile has a workaround, see documentation for more info
aprox_quantille = DataFrameStatFunctions(df).approxQuantile('Gain', [0.25, 0.5, 0.75], 0)
```

**Recommended fix**

You can use [Snowpark approxQuantile](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrame.approxQuantile) method. Some parameters don’t match so they require some manual adjustments. for the output code’s example a recommended fix could be:

```python
from snowflake.snowpark import Session # remove DataFrameStatFunctions because is not required
...
df = spark.createDataFrame(data, columns)

aprox_quantille = df.stat.approx_quantile('Ammount', [0.25, 0.5, 0.75])
```

pyspark.sql.dataframe.DataFrame.approxQuantile’s relativeError parameter does’t exist in SnowPark.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1012

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.dataframe.DataFrameStatFunctions.writeTo has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.dataframe.DataFrameStatFunctions.writeTo](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.writeTo.html) which has a workaround.

#### Scenario

**Input**

For this example the dataframe df is writed to a Spark table “table”.

```python
writer = df.writeTo("table")
```

**Output**

SMA returns the EWI SPRKPY1012 over the line where DataFrameStatFunctions.writeTo is used, so you can use to identify where to fix.

```python
#EWI: SPRKPY1012 => pyspark.sql.dataframe.DataFrameStatFunctions.writeTo has a workaround, see documentation for more info
writer = df.writeTo("table")
```

**Recomended fix**

Use df.write.SaveAsTable() instead.

```python
import df.write as wt
writer = df.write.save_as_table(table)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1013

Message: pyspark.sql.functions.acosh has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.acosh](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.acosh.html) which has a workaround.

#### Scenario

**Input**

On this example pyspark calculates the acosh for a dataframe by using [pyspark.sql.functions.acosh](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.acosh.html)

```python
from pyspark.sql import SparkSession
from pyspark.sql.functions import acosh
spark = SparkSession.builder.getOrCreate()
data = [['V1', 30],
        ['V2', 60],
        ['V3', 50],
        ['V4', 13]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
df_with_acosh = df.withColumn("acosh_value", acosh(df["value"]))
```

**Output**

SMA returns the EWI SPRKPY1013 over the line where acosh is used, so you can use to identify where to fix.

```python
from snowflake.snowpark import Session

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['V1', 30],
        ['V2', 60],
        ['V3', 50],
        ['V4', 13]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
#EWI: SPRKPY1013 => pyspark.sql.functions.acosh has a workaround, see documentation for more info
df_with_acosh = df.withColumn("acosh_value", acosh(df["value"]))
```

**Recommended fix**

There is no direct “acosh” implementation but “[call_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.3.0/api/snowflake.snowpark.functions.call_function)” can be used instead, using “acosh” as the first parameter, and colName as the second one.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.snowpark.functions import call_function, col

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['V1', 30],
        ['V2', 60],
        ['V3', 50],
        ['V4', 13]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
df_with_acosh = df.select(call_function('ACOSH', col('value')))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1014

Message: pyspark.sql.functions.asinh has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.asinh](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.asinh.html) which has a workaround.

#### Scenario

**Input**

On this example pyspark calculates the asinh for a dataframe by using [pyspark.sql.functions.asinh](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.asinh.html).

```python
from pyspark.sql import SparkSession
from pyspark.sql.functions import asinh
spark = SparkSession.builder.getOrCreate()
data = [['V1', 3.0],
        ['V2', 60.0],
        ['V3', 14.0],
        ['V4', 3.1]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
df_result = df.withColumn("asinh_value", asinh(df["value"]))
```

**Output**

SMA returns the EWI SPRKPY1014 over the line where asinh is used, so you can use to identify where to fix.

```python
from snowflake.snowpark import Session

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['V1', 3.0],
        ['V2', 60.0],
        ['V3', 14.0],
        ['V4', 3.1]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
#EWI: SPRKPY1014 => pyspark.sql.functions.asinh has a workaround, see documentation for more info
df_result = df.withColumn("asinh_value", asinh(df["value"]))
```

**Recomended fix**

There is no direct “asinh” implementation but “[call_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.3.0/api/snowflake.snowpark.functions.call_function)” can be used instead, using “asinh” as the first parameter, and colName as the second one.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.snowpark.functions import call_function, col

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['V1', 3.0],
        ['V2', 60.0],
        ['V3', 14.0],
        ['V4', 3.1]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
df_result = df.select(call_function('asinh', col('value')))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1015

Message: pyspark.sql.functions.atanh has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.atanh](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.atanh.html) which has a workaround.

#### Scenario

**Input**

On this example pyspark calculates the atanh for a dataframe by using [pyspark.sql.functions.atanh](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.atanh.html).

```python
from pyspark.sql import SparkSession
from pyspark.sql.functions import atanh
spark = SparkSession.builder.getOrCreate()
data = [['V1', 0.14],
        ['V2', 0.32],
        ['V3', 0.4],
        ['V4', -0.36]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
df_result = df.withColumn("atanh_value", atanh(df["value"]))
```

**Output**

SMA returns the EWI SPRKPY1015 over the line where atanh is used, so you can use to identify where to fix.

```python
from snowflake.snowpark import Session

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['V1', 0.14],
        ['V2', 0.32],
        ['V3', 0.4],
        ['V4', -0.36]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
#EWI: SPRKPY1015 => pyspark.sql.functions.atanh has a workaround, see documentation for more info
df_result = df.withColumn("atanh_value", atanh(df["value"]))
```

**Recommended fix**

There is no direct “atanh” implementation but “[call_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.3.0/api/snowflake.snowpark.functions.call_function)” can be used instead, using “atanh” as the first parameter, and colName as the second one.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.snowpark.functions import call_function, col

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['V1', 0.14],
        ['V2', 0.32],
        ['V3', 0.4],
        ['V4', -0.36]]

columns = ['Paremeter', 'value']
df = spark.createDataFrame(data, columns)
df_result = df.select(call_function('atanh', col('value')))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1016

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 0.11.7](../../../general/release-notes/README.md)

Message: pyspark.sql.functions.collect_set has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.collect_set](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.collect_set.html) which has a workaround.

#### Scenario

**Input**

Using collect\*set to get the elements of _colname\* without duplicates:

```python
col = collect_set(colName)
```

**Output**

SMA returns the EWI SPRKPY1016 over the line where collect_set is used, so you can use to identify where to fix.

```python
#EWI: SPRKPY1016 => pyspark.sql.functions.collect_set has a workaround, see documentation for more info
col = collect_set(colName)
```

**Recommended fix**

Use function array_agg, and add a second argument with the value True.

```python
col = array_agg(col, True)
```

#### Additional recommendation

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1017

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

pyspark.sql.functions.date_add has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.date_add](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.date_add.html) which has a workaround.

#### Scenario

**Input**

In this example we use date_add to calculate the date 5 days after the current date for the dataframe df.

```python
col = df.select(date_add(df.colName, 5))
```

**Output**

SMA returns the EWI SPRKPY1017 over the line where date_add is used, so you can use to identify where to fix.

```python
#EWI: SPRKPY1017 => pyspark.sql.functions.date_add has a workaround, see documentation for more info
col = df.select(date_add(df.colName, 5))
```

**Recommended fix**

Import snowflake.snowpark.functions, which contains an implementation for [date_add](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.date_add) (and alias dateAdd) function.

```python
from snowflake.snowpark.functions import date_add

col = df.select(date_add(df.dt, 1))
```

#### Additional recommendation

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1018

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

Message: pyspark.sql.functions.date_sub has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.date_sub](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.date_sub.html) which has a workaround.

#### Scenario

**Input**

In this example we use date_add to calculate the date 5 days before the current date for the dataframe df.

```python
col = df.select(date_sub(df.colName, 5))
```

**Output**

SMA returns the EWI SPRKPY1018 over the line where date_sub is used, so you can use to identify where to fix.

```python
#EWI: SPRKPY1018 => pyspark.sql.functions.date_sub has a workaround, see documentation for more info
col = df.select(date_sub(df.colName, 5))
```

**Recommended fix**

Import snowflake.snowpark.functions, which contains an implementation for [date_sub](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.date_sub) function.

```python
from pyspark.sql.functions import date_sub
df.withColumn("date", date_sub(df.colName, 5))
```

#### Additional recommendation

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1019

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

Message: pyspark.sql.functions.datediff has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.datediff](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.datediff.html) which has a workaround.

#### Scenario

**Input**

In this example we use datediff to calculate the diference in day from ‘today’ and others dates.

```python
contacts = (contacts
            #days since last event
            .withColumn('daysSinceLastEvent', datediff(lit(today),'lastEvent'))
            #days since deployment
            .withColumn('daysSinceLastDeployment', datediff(lit(today),'lastDeploymentEnd'))
            #days since online training
            .withColumn('daysSinceLastTraining', datediff(lit(today),'lastTraining'))
            #days since last RC login
            .withColumn('daysSinceLastRollCallLogin', datediff(lit(today),'adx_identity_lastsuccessfullogin'))
            #days since last EMS login
            .withColumn('daysSinceLastEMSLogin', datediff(lit(today),'vms_lastuserlogin'))
           )
```

**Output**

SMA returns the EWI SPRKPY1019 over the line where datediff is used, so you can use to identify where to fix.

```python
from pyspark.sql.functions import datediff
#EWI: SPRKPY1019 => pyspark.sql.functions.datediff has a workaround, see documentation for more info
contacts = (contacts
            #days since last event
            .withColumn('daysSinceLastEvent', datediff(lit(today),'lastEvent'))
            #days since deployment
            .withColumn('daysSinceLastDeployment', datediff(lit(today),'lastDeploymentEnd'))
            #days since online training
            .withColumn('daysSinceLastTraining', datediff(lit(today),'lastTraining'))
            #days since last RC login
            .withColumn('daysSinceLastRollCallLogin', datediff(lit(today),'adx_identity_lastsuccessfullogin'))
            #days since last EMS login
            .withColumn('daysSinceLastEMSLogin', datediff(lit(today),'vms_lastuserlogin'))
           )
```

SMA convert pyspark.sql.functions.datediff onto [snowflake.snowpark.functions.daydiff](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.daydiff) that also calculates the diference in days between two dates.

**Recommended fix**

**datediff(part: string ,end: ColumnOrName, start: ColumnOrName)**

**Action:** Import snowflake.snowpark.functions, which contains an implementation for [datediff](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.daydiff) function that requires an extra parameter for [date time part](https://docs.snowflake.com/en/sql-reference/functions-date-time#label-supported-date-time-parts) and allows more versatility on calculate differences between dates.

```python
from snowflake.snowpark import Session
from snowflake.snowpark.functions import datediff
contacts = (contacts
            #days since last event
            .withColumn('daysSinceLastEvent', datediff('day', lit(today),'lastEvent'))
            #days since deployment
            .withColumn('daysSinceLastDeployment', datediff('day',lit(today),'lastDeploymentEnd'))
            #days since online training
            .withColumn('daysSinceLastTraining', datediff('day', lit(today),'lastTraining'))
            #days since last RC login
            .withColumn('daysSinceLastRollCallLogin', datediff('day', lit(today),'adx_identity_lastsuccessfullogin'))
            #days since last EMS login
            .withColumn('daysSinceLastEMSLogin', datediff('day', lit(today),'vms_lastuserlogin'))
           )
```

#### Recommendation

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1020

Message: pyspark.sql.functions.instr has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.instr](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.instr.html) which has a workaround.

#### Scenario

**Input**

Here is a basic example of usage of pyspark instr:

```python
from pyspark.sql import SparkSession
from pyspark.sql.functions import instr
spark = SparkSession.builder.getOrCreate()
df = spark.createDataFrame([('abcd',)], ['test',])
df.select(instr(df.test, 'cd').alias('result')).collect()
```

**Output:**

SMA returns the EWI SPRKPY1020 over the line where instr is used, so you can use to identify where to fix.

```python
from snowflake.snowpark import Session

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
df = spark.createDataFrame([('abcd',)], ['test',])
#EWI: SPRKPY1020 => pyspark.sql.functions.instr has a workaround, see documentation for more info
df.select(instr(df.test, 'cd').alias('result')).collect()
```

**Recommended fix**

Requires a manual change by using the function [charindex](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.charindex) and changing the order of the first two parameters.

```python
import snowflake.snowpark as snowpark
from snowflake.snowpark import Session
from snowflake.snowpark.functions import charindex, lit

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
df = spark.createDataFrame([('abcd',)], ['test',])
df.select(charindex(lit('cd'), df.test).as_('result')).show()
```

#### Additional recommendation

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1021

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.last has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.last](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.last.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.last` function that generates this EWI. In this example, the `last` function is used to get the last **value** for each name.

```python
df = spark.createDataFrame([("Alice", 1), ("Bob", 2), ("Charlie", 3), ("Alice", 4), ("Bob", 5)], ["name", "value"])
df_grouped = df.groupBy("name").agg(last("value").alias("last_value"))
```

**Output**

The SMA adds the EWI `SPRKPY1021` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([("Alice", 1), ("Bob", 2), ("Charlie", 3), ("Alice", 4), ("Bob", 5)], ["name", "value"])
#EWI: SPRKPY1021 => pyspark.sql.functions.last has a workaround, see documentation for more info
df_grouped = df.groupBy("name").agg(last("value").alias("last_value"))
```

**Recommended fix**

As a workaround, you can use the Snowflake [LAST_VALUE](https://docs.snowflake.com/en/sql-reference/functions/last_value) function. To invoke this function from Snowpark, use the [snowflake.snowpark.functions.call_builtin](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.call_builtin) function and pass the string `last_value` as the first argument and the corresponding column as the second argument. If you were using the name of the column in the `last` function, you should convert it into a column when calling the `call_builtin` function.

```python
df = spark.createDataFrame([("Alice", 1), ("Bob", 2), ("Charlie", 3), ("Alice", 4), ("Bob", 5)], ["name", "value"])
df_grouped = df.groupBy("name").agg(call_builtin("last_value", col("value")).alias("last_value"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---

description: >-
The `mode` parameter in the methods of CSV, JSON and PARQUET is transformed to
`overwrite`

---

## SPRKPY1022

Message: pyspark.sql.functions.log10 has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.log10](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.log10.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.log10` function that generates this EWI. In this example, the `log10` function is used to calculate the base-10 logarithm of the **value** column.

```python
df = spark.createDataFrame([(1,), (10,), (100,), (1000,), (10000,)], ["value"])
df_with_log10 = df.withColumn("log10_value", log10(df["value"]))
```

**Output**

The SMA adds the EWI `SPRKPY1022` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([(1,), (10,), (100,), (1000,), (10000,)], ["value"])
#EWI: SPRKPY1022 => pyspark.sql.functions.log10 has a workaround, see documentation for more info
df_with_log10 = df.withColumn("log10_value", log10(df["value"]))
```

**Recommended fix**

As a workaround, you can use the [snowflake.snowpark.functions.log](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.log) function by passing the literal value `10` as the base.

```python
df = spark.createDataFrame([(1,), (10,), (100,), (1000,), (10000,)], ["value"])
df_with_log10 = df.withColumn("log10_value", log(10, df["value"]))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1023

Message: pyspark.sql.functions.log1p has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.log1p](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.log1p.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.log1p` function that generates this EWI. In this example, the `log1p` function is used to calculate the natural logarithm of the **value** column.

```python
df = spark.createDataFrame([(0,), (1,), (10,), (100,)], ["value"])
df_with_log1p = df.withColumn("log1p_value", log1p(df["value"]))
```

**Output**

The SMA adds the EWI `SPRKPY1023` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([(0,), (1,), (10,), (100,)], ["value"])
#EWI: SPRKPY1023 => pyspark.sql.functions.log1p has a workaround, see documentation for more info
df_with_log1p = df.withColumn("log1p_value", log1p(df["value"]))
```

**Recommended fix**

As a workaround, you can use the [call_function](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.call_function) function by passing the string `ln` as the first argument and by adding `1` to the second argument.

```python
df = spark.createDataFrame([(0,), (1,), (10,), (100,)], ["value"])
df_with_log1p = df.withColumn("log1p_value", call_function("ln", lit(1) + df["value"]))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1024

Message: pyspark.sql.functions.log2 has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.log2](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.log2.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.log2` function that generates this EWI. In this example, the `log2` function is used to calculate the base-2 logarithm of the **value** column.

```python
df = spark.createDataFrame([(1,), (2,), (4,), (8,), (16,)], ["value"])
df_with_log2 = df.withColumn("log2_value", log2(df["value"]))
```

**Output**

The SMA adds the EWI `SPRKPY1024` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([(1,), (2,), (4,), (8,), (16,)], ["value"])
#EWI: SPRKPY1024 => pyspark.sql.functions.log2 has a workaround, see documentation for more info
df_with_log2 = df.withColumn("log2_value", log2(df["value"]))
```

**Recommended fix**

As a workaround, you can use the [snowflake.snowpark.functions.log](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.log) function by passing the literal value `2` as the base.

```python
df = session.createDataFrame([(1,), (2,), (4,), (8,), (16,)], ["value"])
df_with_log2 = df.withColumn("log2_value", log(2, df["value"]))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1025

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.ntile has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.ntile](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.ntile.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.ntile` function that generates this EWI. In this example, the `ntile` function is used to divide the rows into 3 buckets.

```python
df = spark.createDataFrame([("Alice", 50), ("Bob", 30), ("Charlie", 60), ("David", 90), ("Eve", 70), ("Frank", 40)], ["name", "score"])
windowSpec = Window.orderBy("score")
df_with_ntile = df.withColumn("bucket", ntile(3).over(windowSpec))
```

**Output**

The SMA adds the EWI `SPRKPY1025` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([("Alice", 50), ("Bob", 30), ("Charlie", 60), ("David", 90), ("Eve", 70), ("Frank", 40)], ["name", "score"])
windowSpec = Window.orderBy("score")
#EWI: SPRKPY1025 => pyspark.sql.functions.ntile has a workaround, see documentation for more info
df_with_ntile = df.withColumn("bucket", ntile(3).over(windowSpec))
```

**Recommended fix**

Snowpark has an equivalent [ntile](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.ntile) function, however, the argument pass to it should be a column. As a workaround, you can convert the literal argument into a column using the [snowflake.snowpark.functions.lit](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.lit) function.

```python
df = spark.createDataFrame([("Alice", 50), ("Bob", 30), ("Charlie", 60), ("David", 90), ("Eve", 70), ("Frank", 40)], ["name", "score"])
windowSpec = Window.orderBy("score")
df_with_ntile = df.withColumn("bucket", ntile(lit(3)).over(windowSpec))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1026

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core 4.3.2](../../../general/release-notes/README.md)

Message: pyspark.sql.readwriter.DataFrameReader.csv has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.readwriter.DataFrameReader.csv](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.csv.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.readwriter.DataFrameReader.csv` function that generates this EWI. In this example, the `csv` function is used to read multiple `.csv` files with a given schema and uses some extra options such as **encoding**, **header** and **sep** to fine-tune the behavior of reading the files.

```python
file_paths = [
  "path/to/your/file1.csv",
  "path/to/your/file2.csv",
  "path/to/your/file3.csv",
]

df = session.read.csv(
  file_paths,
  schema=my_schema,
  encoding="UTF-8",
  header=True,
  sep=","
)
```

**Output**

The SMA adds the EWI `SPRKPY1026` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
file_paths = [
  "path/to/your/file1.csv",
  "path/to/your/file2.csv",
  "path/to/your/file3.csv",
]

#EWI: SPRKPY1026 => pyspark.sql.readwriter.DataFrameReader.csv has a workaround, see documentation for more info
df = session.read.csv(
  file_paths,
  schema=my_schema,
  encoding="UTF-8",
  header=True,
  sep=","
)
```

**Recommended fix**

In this section, we explain how to configure the `path` parameter, the `schema` parameter and some `options` to make them work in Snowpark.

**1. path parameter**

Snowpark requires the **path** parameter to be a stage location so, as a workaround, you can create a temporary stage and add each `.csv` file to that stage using the prefix `file://`.

**2. schema parameter**

Snowpark does not allow defining the **schema** as a parameter of the `csv` function. As a workaround, you can use the [snowflake.snowpark.DataFrameReader.schema](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.schema) function.

**3. options parameters**

Snowpark does not allow defining the **extra options** as parameters of the `csv` function. As a workaround, for many of them you can use the [snowflake.snowpark.DataFrameReader.option](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.option) function to specify those parameters as options of the DataFrameReader.

> **Note:**
>
> The following options are **not supported** by Snowpark:
>
> * columnNameOfCorruptRecord
> * emptyValue
> * enforceSchema
> * header
> * ignoreLeadingWhiteSpace
> * ignoreTrailingWhiteSpace
> * inferSchema
> * locale
> * maxCharsPerColumn
> * maxColumns
> * mode
> * multiLine
> * nanValue
> * negativeInf
> * nullValue
> * positiveInf
> * quoteAll
> * samplingRatio
> * timestampNTZFormat
> * unescapedQuoteHandling

Below is the full example of how the input code should look like after applying the suggestions mentioned above to make it work in Snowpark:

```python
stage = f'{session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {stage}')

session.file.put(f"file:///path/to/your/file1.csv", f"@{stage}")
session.file.put(f"file:///path/to/your/file2.csv", f"@{stage}")
session.file.put(f"file:///path/to/your/file3.csv", f"@{stage}")

df = session.read.schema(my_schema).option("encoding", "UTF-8").option("sep", ",").csv(stage)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1027

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core 4.5.2](../../../general/release-notes/README.md)

Message: pyspark.sql.readwriter.DataFrameReader.json has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.readwriter.DataFrameReader.json](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.json.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.readwriter.DataFrameReader.json` function that generates this EWI. In this example, the `json` function is used to read multiple `.json` files with a given schema and uses some extra options such as **primitiveAsString** and **dateFormat** to fine-tune the behavior of reading the files.

```python
file_paths = [
  "path/to/your/file1.json",
  "path/to/your/file2.json",
  "path/to/your/file3.json",
]

df = session.read.json(
  file_paths,
  schema=my_schema,
  primitiveAsString=True,
  dateFormat="2023-06-20"
)
```

**Output**

The SMA adds the EWI `SPRKPY1027` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
file_paths = [
  "path/to/your/file1.json",
  "path/to/your/file2.json",
  "path/to/your/file3.json",
]

#EWI: SPRKPY1027 => pyspark.sql.readwriter.DataFrameReader.json has a workaround, see documentation for more info
df = session.read.json(
  file_paths,
  schema=my_schema,
  primitiveAsString=True,
  dateFormat="2023-06-20"
)
```

**Recommended fix**

In this section, we explain how to configure the `path` parameter, the `schema` parameter and some `options` to make them work in Snowpark.

**1. path parameter**

Snowpark requires the **path** parameter to be a stage location so, as a workaround, you can create a temporary stage and add each `.json` file to that stage using the prefix `file://`.

**2. schema parameter**

Snowpark does not allow defining the **schema** as a parameter of the `json` function. As a workaround, you can use the [snowflake.snowpark.DataFrameReader.schema](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.schema) function.

**3. options parameters**

Snowpark does not allow defining the **extra options** as parameters of the `json` function. As a workaround, for many of them you can use the [snowflake.snowpark.DataFrameReader.option](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.option) function to specify those parameters as options of the DataFrameReader.

> **Note:**
>
> The following options are not supported by Snowpark:
>
> * allowBackslashEscapingAnyCharacter
> * allowComments
> * allowNonNumericNumbers
> * allowNumericLeadingZero
> * allowSingleQuotes
> * allowUnquotedControlChars
> * allowUnquotedFieldNames
> * columnNameOfCorruptRecord
> * dropFiledIfAllNull
> * encoding
> * ignoreNullFields
> * lineSep
> * locale
> * mode
> * multiline
> * prefersDecimal
> * primitiveAsString
> * samplingRatio
> * timestampNTZFormat
> * timeZone

Below is the full example of how the input code should look like after applying the suggestions mentioned above to make it work in Snowpark:

```python
stage = f'{session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {stage}')

session.file.put(f"file:///path/to/your/file1.json", f"@{stage}")
session.file.put(f"file:///path/to/your/file2.json", f"@{stage}")
session.file.put(f"file:///path/to/your/file3.json", f"@{stage}")

df = session.read.schema(my_schema).option("dateFormat", "2023-06-20").json(stage)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1028

Message: pyspark.sql.readwriter.DataFrameReader.orc has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.readwriter.DataFrameReader.orc](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.orc.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.readwriter.DataFrameReader.orc` function that generates this EWI. In this example, the `orc` function is used to read multiple `.orc` files and uses some extra options such as **mergeSchema** and **recursiveFileLookup** to fine-tune the behavior of reading the files.

```python
file_paths = [
  "path/to/your/file1.orc",
  "path/to/your/file2.orc",
  "path/to/your/file3.orc",
]

df = session.read.orc(
  file_paths,
  mergeSchema="True",
  recursiveFileLookup="True"
)
```

**Output**

The SMA adds the EWI `SPRKPY1028` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
file_paths = [
  "path/to/your/file1.orc",
  "path/to/your/file2.orc",
  "path/to/your/file3.orc",
]

#EWI: SPRKPY1028 => pyspark.sql.readwriter.DataFrameReader.orc has a workaround, see documentation for more info
df = session.read.orc(
  file_paths,
  mergeSchema="True",
  recursiveFileLookup="True"
)
```

**Recommended fix**

In this section, we explain how to configure the `path` parameter and the extra `options` to make them work in Snowpark.

**1. path parameter**

Snowpark requires the **path** parameter to be a stage location so, as a workaround, you can create a temporary stage and add each `.orc` file to that stage using the prefix `file://`.

**2. options parameters**

Snowpark does not allow defining the **extra options** as parameters of the `orc` function. As a workaround, for many of them you can use the [snowflake.snowpark.DataFrameReader.option](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.option) function to specify those parameters as options of the DataFrameReader.

> **Note:**
>
> The following options are not supported by Snowpark:
>
> * compression
> * mergeSchema

Below is the full example of how the input code should look like after applying the suggestions mentioned above to make it work in Snowpark:

```python
stage = f'{session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {stage}')

session.file.put(f"file:///path/to/your/file1.orc", f"@{stage}")
session.file.put(f"file:///path/to/your/file2.orc", f"@{stage}")
session.file.put(f"file:///path/to/your/file3.orc", f"@{stage}")

df = session.read.option(recursiveFileLookup, "True").orc(stage)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1029

Message: This issue appears when the tool detects the usage of pyspark.sql.readwriter.DataFrameReader.parquet. This function is supported, but some of the differences between Snowpark and the Spark API might require making some manual changes.

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.readwriter.DataFrameReader.parquet](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.parquet.html) function. This function is supported by Snowpark, however, there are some differences that would require some manual changes.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.readwriter.DataFrameReader.parquet` function that generates this EWI.

```python
file_paths = [
  "path/to/your/file1.parquet",
  "path/to/your/file2.parquet",
  "path/to/your/file3.parquet",
]

df = session.read.parquet(
  *file_paths,
  mergeSchema="true",
  pathGlobFilter="*file*",
  recursiveFileLookup="true",
  modifiedBefore="2024-12-31T00:00:00",
  modifiedAfter="2023-12-31T00:00:00"
)
```

**Output**

The SMA adds the EWI `SPRKPY1029` to the output code to let you know that this function is supported by Snowpark, but it requires some manual adjustments. Please note that the options supported by Snowpark are transformed into `option` function calls and those that are not supported are removed. This is explained in more detail in the next sections.

```python
file_paths = [
  "path/to/your/file1.parquet",
  "path/to/your/file2.parquet",
  "path/to/your/file3.parquet"
]

#EWI: SPRKPY1076 => Some of the included parameters are not supported in the parquet function, the supported ones will be added into a option method.
#EWI: SPRKPY1029 => This issue appears when the tool detects the usage of pyspark.sql.readwriter.DataFrameReader.parquet. This function is supported, but some of the differences between Snowpark and the Spark API might require making some manual changes.
df = session.read.option("PATTERN", "*file*").parquet(
  *file_paths
)
```

**Recommended fix**

In this section, we explain how to configure the `paths` and `options` parameters to make them work in Snowpark.

**1. paths parameter**

In Spark, this parameter can be a local or cloud location. Snowpark only accepts cloud locations using a [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage). So, you can create a temporal stage and add each file into it using the prefix `file://`.

**2. options parameter**

Snowpark does not allow defining the different **options** as parameters of the `parquet` function. As a workaround, you can use the [option](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.option) or [options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.options) functions to specify those parameters as extra options of the DataFrameReader.

Please note that the Snowpark **options** are not exactly the same as the PySpark **options** so some manual changes might be needed. Below is a more detailed explanation of how to configure the most common PySpark options in Snowpark.

**2.1 mergeSchema option**

Parquet supports schema evolution, allowing users to start with a simple schema and gradually add more columns as needed. This can result in multiple parquet files with different but compatible schemas. In Snowflake, thanks to the [infer_schema](https://docs.snowflake.com/en/sql-reference/functions/infer_schema) capabilities you don’t need to do that and therefore the `mergeSchema` option can just be removed.

**2.2 pathGlobFilter option**

If you want to load only a subset of files from the stage, you can use the `pattern` option to specify a regular expression that matches the files you want to load. The SMA already automates this as you can see in the output of this scenario.

**2.3 recursiveFileLookupstr option**

This option is not supported by Snowpark. The best recommendation is to use a regular expression like with the `pathGlobFilter` option to achieve something similar.

**2.4 modifiedBefore / modifiedAfter option**

You can achieve the same result in Snowflake by using the `metadata` columns.

> **Note:**
>
> The following options are not supported by Snowpark:
>
> * compression
> * datetimeRebaseMode
> * int96RebaseMode
> * mergeSchema

Below is the full example of how the input code should be transformed in order to make it work in Snowpark:

```python
from snowflake.snowpark.column import METADATA_FILE_LAST_MODIFIED, METADATA_FILENAME

temp_stage = f'{session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}')

session.file.put(f"file:///path/to/your/file1.parquet", f"@{temp_stage}")
session.file.put(f"file:///path/to/your/file2.parquet", f"@{temp_stage}")
session.file.put(f"file:///path/to/your/file3.parquet", f"@{temp_stage}")

df = session.read \
  .option("PATTERN", ".*file.*") \
  .with_metadata(METADATA_FILENAME, METADATA_FILE_LAST_MODIFIED) \
  .parquet(temp_stage) \
  .where(METADATA_FILE_LAST_MODIFIED < '2024-12-31T00:00:00') \
  .where(METADATA_FILE_LAST_MODIFIED > '2023-12-31T00:00:00')
```

#### Additional recommendations

* In Snowflake, you can leverage other approaches for parquet data ingestion, such as:

  + Leveraging [native parquet ingestion capabilities](https://docs.snowflake.com/en/user-guide/tutorials/script-data-load-transform-parquet). Consider also [autoingest with snowpipe](https://docs.snowflake.com/en/user-guide/data-load-snowpipe-auto-s3).
  + Parquet [external tables](https://docs.snowflake.com/en/user-guide/tables-external-intro) which can be pointed directly to cloud file locations.
  + Using [Iceberg tables](https://docs.snowflake.com/LIMITEDACCESS/iceberg/tables-iceberg-parquet-source).
* When doing a migration is a good practice to leverage the SMA reports to try to build an inventory of files and determine after modernization to which stages/tables will the data be mapped.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1030

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.session.SparkSession.Builder.appName has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.session.SparkSession.Builder.appName](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.builder.appName.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.session.SparkSession.Builder.appName` function that generates this EWI. In this example, the `appName` function is used to set **MyApp** as the name of the application.

```python
session = SparkSession.builder.appName("MyApp").getOrCreate()
```

**Output**

The SMA adds the EWI `SPRKPY1030` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
#EWI: SPRKPY1030 => pyspark.sql.session.SparkSession.Builder.appName has a workaround, see documentation for more info
session = Session.builder.appName("MyApp").getOrCreate()
```

**Recommended fix**

As a workaround, you can import the [snowpark_extensions](https://pypi.org/project/snowpark-extensions/) package which provides an extension for the `appName` function.

```python
import snowpark_extensions
session = SessionBuilder.appName("MyApp").getOrCreate()
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1031

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core 2.7.0](../../../general/release-notes/old-version-release-notes/sc-spark-python-release-notes/README.md)

Message: pyspark.sql.column.Column.contains has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.column.Column.contains](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.Column.contains.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.column.Column.contains` function that generates this EWI. In this example, the `contains` function is used to filter the rows where the ‘City’ column contains the substring ‘New’.

```python
df = spark.createDataFrame([("Alice", "New York"), ("Bob", "Los Angeles"), ("Charlie", "Chicago")], ["Name", "City"])
df_filtered = df.filter(col("City").contains("New"))
```

**Output**

The SMA adds the EWI `SPRKPY1031` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([("Alice", "New York"), ("Bob", "Los Angeles"), ("Charlie", "Chicago")], ["Name", "City"])
#EWI: SPRKPY1031 => pyspark.sql.column.Column.contains has a workaround, see documentation for more info
df_filtered = df.filter(col("City").contains("New"))
```

**Recommended fix**

As a workaround, you can use the [snowflake.snowpark.functions.contains](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.contains) function by passing the column as the first argument and the element to search as the second argument. If the element to search is a literal value then it should be converted into a column expression using the `lit` function.

```python
from snowflake.snowpark import functions as f
df = spark.createDataFrame([("Alice", "New York"), ("Bob", "Los Angeles"), ("Charlie", "Chicago")], ["Name", "City"])
df_filtered = df.filter(f.contains(col("City"), f.lit("New")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1032

Message: **\*spark element\*** is not defined

Category: Conversion error

### Description

This issue appears when the SMA could not determine an appropriate mapping status for the given element. This means, the SMA doesn’t know yet if this element is supported or not by Snowpark. Please note, this is a generic error code used by the SMA for any not defined element.

#### Scenario

**Input**

Below is an example of a function for which the SMA could not determine an appropriate mapping status. In this case, you should assume that `not_defined_function()` is a valid PySpark function and the code runs.

```python
sc.parallelize(["a", "b", "c", "d", "e"], 3).not_defined_function().collect()
```

**Output**

The SMA adds the EWI `SPRKPY1032` to the output code to let you know that this element is not defined.

```python
#EWI: SPRKPY1032 => pyspark.rdd.RDD.not_defined_function is not defined
sc.parallelize(["a", "b", "c", "d", "e"], 3).not_defined_function().collect()
```

**Recommended fix**

To try to identify the problem, you can perform the following validations:

* Check if the source code has the correct syntax, and it is spelled correctly.
* Check if you are using a PySpark version supported by the SMA. To know which PySpark version is supported by the SMA at the moment of running the SMA, you can review the first page of the `DetailedReport.docx` file.

If this is a valid PySpark element, please report that you encountered a conversion error on that particular element using the [Report an Issue](../../../user-guide/project-overview/configuration-and-settings.md) option of the SMA and include any additional information that you think may be helpful.

Please note that if an element is not defined, it does not mean that it is not supported by Snowpark. You should check the [Snowpark Documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/index) to verify if an equivalent element exist.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1033

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.asc has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.asc](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.asc.html) function, which has a workaround.

#### Scenarios

The `pyspark.sql.functions.asc` function takes either a column object or the name of the column as a string as its parameter. Both scenarios are not supported by Snowpark so this EWI is generated.

##### Scenario 1

**Input**

Below is an example of a use of the `pyspark.sql.functions.asc` function that takes a column object as parameter.

```python
df.orderBy(asc(col))
```

**Output**

The SMA adds the EWI `SPRKPY1033` to the output code to let you know that the `asc` function with a column object parameter is not directly supported by Snowpark, but it has a workaround.

```python
#EWI: SPRKPY1033 => pyspark.sql.functions.asc has a workaround, see documentation for more info
df.orderBy(asc(col))
```

**Recommended fix**

As a workaround, you can call the [snowflake.snowpark.Column.asc](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.Column.asc) function from the column parameter.

```python
df.orderBy(col.asc())
```

##### Scenario 2

**Input**

Below is an example of a use of the `pyspark.sql.functions.asc` function that takes the name of the column as parameter.

```python
df.orderBy(asc("colName"))
```

**Output**

The SMA adds the EWI `SPRKPY1033` to the output code to let you know that the `asc` function with a column name parameter is not directly supported by Snowpark, but it has a workaround.

```python
#EWI: SPRKPY1033 => pyspark.sql.functions.asc has a workaround, see documentation for more info
df.orderBy(asc("colName"))
```

**Recommended fix**

As a workaround, you can convert the string parameter into a column object using the [snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.col) function and then call the [snowflake.snowpark.Column.asc](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.Column.asc) function.

```python
df.orderBy(col("colName").asc())
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1034

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.desc has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.desc](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.desc.html) function, which has a workaround.

#### Scenarios

The `pyspark.sql.functions.desc` function takes either a column object or the name of the column as a string as its parameter. Both scenarios are not supported by Snowpark so this EWI is generated.

##### Scenario 1

**Input**

Below is an example of a use of the `pyspark.sql.functions.desc` function that takes a column object as parameter.

```python
df.orderBy(desc(col))
```

**Output**

The SMA adds the EWI `SPRKPY1034` to the output code to let you know that the `desc` function with a column object parameter is not directly supported by Snowpark, but it has a workaround.

```python
#EWI: SPRKPY1034 => pyspark.sql.functions.desc has a workaround, see documentation for more info
df.orderBy(desc(col))
```

**Recommended fix**

As a workaround, you can call the [snowflake.snowpark.Column.desc](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.Column.desc) function from the column parameter.

```python
df.orderBy(col.desc())
```

##### Scenario 2

**Input**

Below is an example of a use of the `pyspark.sql.functions.desc` function that takes the name of the column as parameter.

```python
df.orderBy(desc("colName"))
```

**Output**

The SMA adds the EWI `SPRKPY1034` to the output code to let you know that the `desc` function with a column name parameter is not directly supported by Snowpark, but it has a workaround.

```python
#EWI: SPRKPY1034 => pyspark.sql.functions.desc has a workaround, see documentation for more info
df.orderBy(desc("colName"))
```

**Recommended fix**

As a workaround, you can convert the string parameter into a column object using the [snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.col) function and then call the [snowflake.snowpark.Column.desc](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.Column.desc) function.

```python
df.orderBy(col("colName").desc())
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1035

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.reverse has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.reverse](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.reverse.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.reverse` function that generates this EWI. In this example, the `reverse` function is used to reverse each string of the **word** column.

```python
df = spark.createDataFrame([("hello",), ("world",)], ["word"])
df_reversed = df.withColumn("reversed_word", reverse(df["word"]))
df_reversed = df.withColumn("reversed_word", reverse("word"))
```

**Output**

The SMA adds the EWI `SPRKPY1035` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([("hello",), ("world",)], ["word"])
#EWI: SPRKPY1035 => pyspark.sql.functions.reverse has a workaround, see documentation for more info
df_reversed = df.withColumn("reversed_word", reverse(df["word"]))
#EWI: SPRKPY1035 => pyspark.sql.functions.reverse has a workaround, see documentation for more info
df_reversed = df.withColumn("reversed_word", reverse("word"))
```

**Recommended fix**

As a workaround, you can import the [snowpark_extensions](https://pypi.org/project/snowpark-extensions/) package which provides an extension for the `reverse` function.

```python
import snowpark_extensions

df = spark.createDataFrame([("hello",), ("world",)], ["word"])
df_reversed = df.withColumn("reversed_word", reverse(df["word"]))
df_reversed = df.withColumn("reversed_word", reverse("word"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1036

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.column.Column.getField has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.column.Column.getField](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.Column.getField.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.column.Column.getField` function that generates this EWI. In this example, the `getField` function is used to extract the **name** from the **info** column.

```python
df = spark.createDataFrame([(1, {"name": "John", "age": 30}), (2, {"name": "Jane", "age": 25})], ["id", "info"])
df_with_name = df.withColumn("name", col("info").getField("name"))
```

**Output**

The SMA adds the EWI `SPRKPY1036` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([(1, {"name": "John", "age": 30}), (2, {"name": "Jane", "age": 25})], ["id", "info"])
#EWI: SPRKPY1036 => pyspark.sql.column.Column.getField has a workaround, see documentation for more info
df_with_name = df.withColumn("name", col("info").getField("name"))
```

**Recommended fix**

As a workaround, you can use the [Snowpark column indexer operator](https://docs.snowflake.com/ko/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.Column#snowflake.snowpark.Column) with the name of the field as the index.

```python
df = spark.createDataFrame([(1, {"name": "John", "age": 30}), (2, {"name": "Jane", "age": 25})], ["id", "info"])
df_with_name = df.withColumn("name", col("info")["name"])
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1037

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.sort_array has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.sort_array](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.sort_array.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.sort_array` function that generates this EWI. In this example, the `sort_array` function is used to sort the **numbers** array in ascending and descending order.

```python
df = spark.createDataFrame([(1, [3, 1, 2]), (2, [10, 5, 8]), (3, [6, 4, 7])], ["id", "numbers"])
df_sorted_asc = df.withColumn("sorted_numbers_asc", sort_array("numbers", asc=True))
df_sorted_desc = df.withColumn("sorted_numbers_desc", sort_array("numbers", asc=False))
```

**Output**

The SMA adds the EWI `SPRKPY1037` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([(1, [3, 1, 2]), (2, [10, 5, 8]), (3, [6, 4, 7])], ["id", "numbers"])
#EWI: SPRKPY1037 => pyspark.sql.functions.sort_array has a workaround, see documentation for more info
df_sorted_asc = df.withColumn("sorted_numbers_asc", sort_array("numbers", asc=True))
#EWI: SPRKPY1037 => pyspark.sql.functions.sort_array has a workaround, see documentation for more info
df_sorted_desc = df.withColumn("sorted_numbers_desc", sort_array("numbers", asc=False))
```

**Recommended fix**

As a workaround, you can import the [snowpark_extensions](https://pypi.org/project/snowpark-extensions/) package which provides an extension for the `sort_array` function.

```python
import snowpark_extensions

df = spark.createDataFrame([(1, [3, 1, 2]), (2, [10, 5, 8]), (3, [6, 4, 7])], ["id", "numbers"])
df_sorted_asc = df.withColumn("sorted_numbers_asc", sort_array("numbers", asc=True))
df_sorted_desc = df.withColumn("sorted_numbers_desc", sort_array("numbers", asc=False))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1038

Message: **\*spark element\*** is not yet recognized

Category: Conversion error

### Description

This issue appears when there is a PySpark element in your source code that was not recognized by the SMA. This can occur for different reasons, such as:

* An element that does not exist in PySpark.
* An element that was added in a PySpark version that the SMA does not support yet.
* An internal error of the SMA when processing the element.

This is a generic error code used by the SMA for any not recognized element.

#### Scenario

**Input**

Below is an example of a use of a function that could not be recognized by the SMA because it does not exist in PySpark.

```python
from pyspark.sql import functions as F
F.unrecognized_function()
```

**Output**

The SMA adds the EWI `SPRKPY1038` to the output code to let you know that this element could not be recognized.

```python
from snowflake.snowpark import functions as F
#EWI: SPRKPY1038 => pyspark.sql.functions.non_existent_function is not yet recognized
F.unrecognized_function()
```

**Recommended fix**

To try to identify the problem, you can perform the following validations:

* Check if the element exists in PySpark.
* Check if the element is spelled correctly.
* Check if you are using a PySpark version supported by the SMA. To know which PySpark version is supported by the SMA at the moment of running the SMA, you can review the first page of the `DetailedReport.docx` file.

If it is a valid PySpark element, please report that you encountered a conversion error on that particular element using the [Report an Issue](../../../user-guide/project-overview/configuration-and-settings.md) option of the SMA and include any additional information that you think may be helpful.

Please note that if an element could not be recognized by the SMA, it does not mean that it is not supported by Snowpark. You should check the [Snowpark Documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/index) to verify if an equivalent element exist.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1039

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.column.Column.getItem has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.column.Column.getItem](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.Column.getItem.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.column.Column.getItem` function that generates this EWI. In this example, the `getItem` function is used to get an item by position and by key.

```python
df = spark.createDataFrame([(1, ["apple", "banana", "orange"]), (2, ["carrot", "avocado", "banana"])], ["id", "fruits"])
df.withColumn("first_fruit", col("fruits").getItem(0))

df = spark.createDataFrame([(1, {"apple": 10, "banana": 20}), (2, {"carrot": 15, "grape": 25}), (3, {"pear": 30, "apple": 35})], ["id", "fruit_quantities"])
df.withColumn("apple_quantity", col("fruit_quantities").getItem("apple"))
```

**Output**

The SMA adds the EWI `SPRKPY1039` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([(1, ["apple", "banana", "orange"]), (2, ["carrot", "avocado", "banana"])], ["id", "fruits"])
#EWI: SPRKPY1039 => pyspark.sql.column.Column.getItem has a workaround, see documentation for more info
df.withColumn("first_fruit", col("fruits").getItem(0))

df = spark.createDataFrame([(1, {"apple": 10, "banana": 20}), (2, {"carrot": 15, "grape": 25}), (3, {"pear": 30, "apple": 35})], ["id", "fruit_quantities"])
#EWI: SPRKPY1039 => pyspark.sql.column.Column.getItem has a workaround, see documentation for more info
df.withColumn("apple_quantity", col("fruit_quantities").getItem("apple"))
```

**Recommended fix**

As a workaround, you can use the **Snowpark column indexer operator** with the name or position of the field as the index.

```python
df = spark.createDataFrame([(1, ["apple", "banana", "orange"]), (2, ["carrot", "avocado", "banana"])], ["id", "fruits"])
df.withColumn("first_fruit", col("fruits")[0])

df = spark.createDataFrame([(1, {"apple": 10, "banana": 20}), (2, {"carrot": 15, "grape": 25}), (3, {"pear": 30, "apple": 35})], ["id", "fruit_quantities"])
df.withColumn("apple_quantity", col("fruit_quantities")["apple"])
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1040

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.functions.explode has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [pyspark.sql.functions.explode](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.explode.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.functions.explode` function that generates this EWI. In this example, the `explode` function is used to generate one row per array item for the **numbers** column.

```python
df = spark.createDataFrame([("Alice", [1, 2, 3]), ("Bob", [4, 5]), ("Charlie", [6, 7, 8, 9])], ["name", "numbers"])
exploded_df = df.select("name", explode(df.numbers).alias("number"))
```

**Output**

The SMA adds the EWI `SPRKPY1040` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```python
df = spark.createDataFrame([("Alice", [1, 2, 3]), ("Bob", [4, 5]), ("Charlie", [6, 7, 8, 9])], ["name", "numbers"])
#EWI: SPRKPY1040 => pyspark.sql.functions.explode has a workaround, see documentation for more info
exploded_df = df.select("name", explode(df.numbers).alias("number"))
```

**Recommended fix**

As a workaround, you can import the [snowpark_extensions](https://pypi.org/project/snowpark-extensions/) package which provides an extension for the `explode` function.

```python
import snowpark_extensions

df = spark.createDataFrame([("Alice", [1, 2, 3]), ("Bob", [4, 5]), ("Charlie", [6, 7, 8, 9])], ["name", "numbers"])
exploded_df = df.select("name", explode(df.numbers).alias("number"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1041

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.9.0](../../../general/release-notes/old-version-release-notes/sc-spark-python-release-notes/README.md)

Message: pyspark.sql.functions.explode_outer has a workaround

Category: Warning

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.explode_outer](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.explode_outer.html#pyspark.sql.functions.explode_outer) which has a workaround.

#### Scenario

**Input**

The example shows the use of the method **explode_outer** in a select call.

```python
df = spark.createDataFrame(
    [(1, ["foo", "bar"], {"x": 1.0}),
     (2, [], {}),
     (3, None, None)],
    ("id", "an_array", "a_map")
)

df.select("id", "an_array", explode_outer("a_map")).show()
```

**Output**

The tool adds the EWI `SPRKPY1041` indicating that a workaround can be implemented.

```python
df = spark.createDataFrame(
    [(1, ["foo", "bar"], {"x": 1.0}),
     (2, [], {}),
     (3, None, None)],
    ("id", "an_array", "a_map")
)

#EWI: SPRKPY1041 => pyspark.sql.functions.explode_outer has a workaround, see documentation for more info
df.select("id", "an_array", explode_outer("a_map")).show()
```

**Recommended fix**

As a workaround, you can import the snowpark_extensions package, which contains a helper for the `explode_outer` function.

```python
import snowpark_extensions

df = spark.createDataFrame(
    [(1, ["foo", "bar"], {"x": 1.0}),
     (2, [], {}),
     (3, None, None)],
    ("id", "an_array", "a_map")
)

df.select("id", "an_array", explode_outer("a_map")).show()
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1042

Message: pyspark.sql.functions.posexplode has a workaround

Category: Warning

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.posexplode](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.posexplode.html?highlight=posexplode) which has a workaround.

#### Scenarios

There are a couple of scenarios that this method can handle depending on the type of column it is passed as a parameter, it can be a `list of values` or a `map/directory (keys/values)`.

##### Scenario 1

**Input**

Below is an example of the usage of `posexplode` passing as a parameter of a **list of values**.

```python
df = spark.createDataFrame(
    [Row(a=1,
         intlist=[1, 2, 3])])

df.select(posexplode(df.intlist)).collect()
```

**Output**

The tool adds the EWI `SPRKPY1042` indicating that a workaround can be implemented.

```python
df = spark.createDataFrame(
    [Row(a=1,
         intlist=[100, 200, 300])])
#EWI: SPRKPY1042 => pyspark.sql.functions.posexplode has a workaround, see documentation for more info

df.select(posexplode(df.intlist)).show()
```

**Recommended fix**

For having the same behavior, use the method [functions.flatten](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.functions.flatten), drop extra columns, and rename index and value column names.

```python
df = spark.createDataFrame(
  [Row(a=1,
       intlist=[1, 2, 3])])

df.select(
    flatten(df.intlist))\
    .drop("DATA", "SEQ", "KEY", "PATH", "THIS")\
    .rename({"INDEX": "pos", "VALUE": "col"}).show()
```

##### Scenario 2

**Input**

Below is another example of the usage of `posexplode` passing as a parameter a **map/dictionary (keys/values)**

```python
df = spark.createDataFrame([
    [1, [1, 2, 3], {"Ashi Garami": "Single Leg X"}, "Kimura"],
    [2, [11, 22], {"Sankaku": "Triangle"}, "Coffee"]
],
schema=["idx", "lists", "maps", "strs"])

df.select(posexplode(df.maps)).show()
```

**Output**

The tool adds the EWI `SPRKPY1042` indicating that a workaround can be implemented.

```python
df = spark.createDataFrame([
    [1, [1, 2, 3], {"Ashi Garami": "Single Leg X"}, "Kimura"],
    [2, [11, 22], {"Sankaku": "Triangle"}, "Coffee"]
],
schema=["idx", "lists", "maps", "strs"])
#EWI: SPRKPY1042 => pyspark.sql.functions.posexplode has a workaround, see documentation for more info

df.select(posexplode(df.maps)).show()
```

**Recommended fix**

As a workaround, you can use [functions.row_number](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/api/snowflake.snowpark.functions.row_number.html) to get the position and [functions.explode](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.explode) with the name of the field to get the value the key/value for dictionaries.

```python
df = spark.createDataFrame([
    [10, [1, 2, 3], {"Ashi Garami": "Single Leg X"}, "Kimura"],
    [11, [11, 22], {"Sankaku": "Triangle"}, "Coffee"]
],
    schema=["idx", "lists", "maps", "strs"])

window = Window.orderBy(col("idx").asc())

df.select(
    row_number().over(window).alias("pos"),
    explode(df.maps).alias("key", "value")).show()
```

**Note:** using row_number is not fully equivalent, because it starts with 1 (not zero as spark method)

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1043

Message: pyspark.sql.functions.posexplode_outer has a workaround

Category: Warning

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.posexplode_outer](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.posexplode_outer.html) which has a workaround.

#### Scenarios

There are a couple of scenarios that this method can handle depending on the type of column it is passed as a parameter, it can be a `list of values` or a `map/directory (keys/values)`.

##### Scenario 1

**Input**

Below is an example that shows the usage of `posexplode_outer` passing a **list of values**.

```python
df = spark.createDataFrame(
    [
        (1, ["foo", "bar"]),
        (2, []),
        (3, None)],
    ("id", "an_array"))

df.select("id", "an_array", posexplode_outer("an_array")).show()
```

**Output**

The tool adds the EWI `SPRKPY1043` indicating that a workaround can be implemented.

```python
df = spark.createDataFrame(
    [
        (1, ["foo", "bar"]),
        (2, []),
        (3, None)],
    ("id", "an_array"))
#EWI: SPRKPY1043 => pyspark.sql.functions.posexplode_outer has a workaround, see documentation for more info

df.select("id", "an_array", posexplode_outer("an_array")).show()
```

**Recommended fix**

For having the same behavior, use the method [functions.flatten](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/api/snowflake.snowpark.functions.flatten) sending the `outer` parameter in True, drop extra columns, and rename index and value column names.

```python
df = spark.createDataFrame(
    [
        (1, ["foo", "bar"]),
        (2, []),
        (3, None)],
    ("id", "an_array"))

df.select(
    flatten(df.an_array, outer=True))\
    .drop("DATA", "SEQ", "KEY", "PATH", "THIS")\
    .rename({"INDEX": "pos", "VALUE": "col"}).show()
```

##### Scenario 2

**Input**

Below is another example of the usage of posexplode_outer passing a **map/dictionary (keys/values)**

```python
df = spark.createDataFrame(
    [
        (1, {"x": 1.0}),
        (2, {}),
        (3, None)],
    ("id", "a_map"))

df.select(posexplode_outer(df.a_map)).show()
```

**Output**

The tool adds the EWI `SPRKPY1043` indicating that a workaround can be implemented.

```python
df = spark.createDataFrame(
    [
        (1, {"x": "Ashi Garami"}),
        (2, {}),
        (3, None)],
    ("id", "a_map"))
#EWI: SPRKPY1043 => pyspark.sql.functions.posexplode_outer has a workaround, see documentation for more info

df.select(posexplode_outer(df.a_map)).show()
```

**Recommended fix**

As a workaround, you can use [functions.row_number](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/api/snowflake.snowpark.functions.row_number.html) to get the position and [functions.explode_outer](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.explode_outer) with the name of the field to get the value of the key/value for dictionaries.

```python
df = spark.createDataFrame(
    [
        (1, {"x": "Ashi Garami"}),
        (2,  {}),
        (3, None)],
    ("id", "a_map"))

window = Window.orderBy(col("id").asc())

df.select(
    row_number().over(window).alias("pos"),
          explode_outer(df.a_map)).show()
```

**Note:** using row_number is not fully equivalent, because it starts with 1 (not zero as spark method)

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1044

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.4.0](../../../general/release-notes/old-version-release-notes/sc-spark-python-release-notes/README.md)

Message: pyspark.sql.functions.split has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.split](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.split.html) which has a workaround.

#### Scenarios

There are a couple of scenarios depending on the amount of parameters passed to the method.

##### Scenario 1

**Input**

Below is an example when the function `split` has just the *str* and *pattern* parameters

```python
F.split('col', '\\|')
```

**Output**

The tool shows the EWI `SPRKPY1044` indicating there is a workaround.

```python
#EWI: SPRKPY1044 => pyspark.sql.functions.split has a workaround, see the documentation for more info
F.split('col', '\\|')
```

**Recommended fix**

As a workaround, you can call the function [snowflake.snowpark.functions.lit](https://docs.snowflake.com/ko/developer-guide/snowpark/reference/python/api/snowflake.snowpark.functions.lit.html) with the pattern parameter and send it into the split.

```python
F.split('col', lit('\\|'))
## the result of lit will be sent to the split function
```

### Scenario 2

**Input**

Below is another example when the function `split` has the *str, pattern, and limit* parameters.

```python
F.split('col', '\\|', 2)
```

**Output**

The tool shows the EWI `SPRKPY1044` indicating there is a workaround.

```python
#EWI: SPRKPY1044 => pyspark.sql.functions.split has a workaround, see the documentation for more info
F.split('col', '\\|', 2)
```

**Recommended fix**

This specific scenario is not supported.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1045

Message: pyspark.sql.functions.map_values has a workaround

Category: Warning.

### Description

This function is used to extract the list of values from a column that contains a **map/dictionary (keys/values)**.

The issue appears when the tool detects the usage of [pyspark.sql.functions.map_values](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.map_values.html) which has a workaround.

#### Scenario

**Input**

Below is an example of the usage of the method `map_values`.

```python
df = spark.createDataFrame(
    [(1, {'Apple': 'Fruit', 'Potato': 'Vegetable'})],
    ("id", "a_map"))

df.select(map_values("a_map")).show()
```

**Output**

The tool adds the EWI `SPRKPY1045` indicating that a workaround can be implemented.

```python
df = spark.createDataFrame(
    [(1, {'Apple': 'Fruit', 'Potato': 'Vegetable'})],
    ("id", "a_map"))
#EWI: SPRKPY1045 => pyspark.sql.functions.map_values has a workaround, see documentation for more info

df.select(map_values("a_map")).show()
```

**Recommended fix**

As a workaround, you can create an [udf](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udf) to get the values for a column. The below example shows how to create the udf, then assign it to `F.map_values`, and then make use of it.

```python
from snowflake.snowpark import functions as F
from snowflake.snowpark.types import ArrayType, MapType

map_values_udf=None

def map_values(map):
    global map_values_udf
    if not map_values_udf:
        def _map_values(map: dict)->list:
            return list(map.values())
        map_values_udf = F.udf(_map_values,return_type=ArrayType(),input_types=[MapType()],name="map_values",is_permanent=False,replace=True)
    return map_values_udf(map)

F.map_values = map_values

df.select(map_values(colDict))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1046

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.1.22](../../../general/release-notes/README.md)

Message: pyspark.sql.functions.monotonically_increasing_id has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.monotonically_increasing_id](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.monotonically_increasing_id.html) which has a workaround.

#### Scenario

**Input**

Below is an example of the usage of the method `monotonically_increasing_id`.

```python
from pyspark.sql import functions as F

spark.range(0, 10, 1, 2).select(F.monotonically_increasing_id()).show()
```

**Output**

The tool adds the EWI `SPRKPY1046` indicating that a workaround can be implemented.

```python
from pyspark.sql import functions as F
#EWI: SPRKPY1046 => pyspark.sql.functions.monotonically_increasing_id has a workaround, see documentation for more info
spark.range(0, 10, 1, 2).select(F.monotonically_increasing_id()).show()
```

**Recommended fix**

Update the tool version.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1047

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.6.0](../../../general/release-notes/README.md)

### Description

This issue appears when the tool detects the usage of [pyspark.context.SparkContext.setLogLevel](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.setLogLevel.html?highlight=pyspark%20context%20sparkcontext%20setloglevel#pyspark.SparkContext.setLogLevel) which has a workaround.

#### Scenario

**Input**

Below is an example of the usage of the method `setLogLevel`.

```python
sparkSession.sparkContext.setLogLevel("WARN")
```

**Output**

The tool adds the EWI `SPRKPY1047` indicating that a workaround can be implemented.

```python
#EWI: SPRKPY1047 => pyspark.context.SparkContext.setLogLevel has a workaround, see documentation for more info
sparkSession.sparkContext.setLogLevel("WARN")
```

**Recommended fix**

Replace the `setLogLevel` function usage with `logging.basicConfig` that provides a set of convenience functions for simple logging usage. In order to use it, we need to import two modules, “logging” and “sys”, and the level constant should be replaced using the “Level equivalent table”:

```python
import logging
import sys
logging.basicConfig(stream=sys.stdout, level=logging.WARNING)
```

* Level equivalent table

| Level source parameter | Level target parameter |
| --- | --- |
| “ALL” | *<mark style=”color:red;”>\*\*This has no equivalent\*\*</mark>* |
| “DEBUG” | logging.DEBUG |
| “ERROR” | logging.ERROR |
| “FATAL” | logging.CRITICAL |
| “INFO” | logging.INFO |
| “OFF” | logging.NOTSET |
| “TRACE” | *<mark style=”color:red;”>\*\*This has no equivalent\*\*</mark>* |
| “WARN” | logging.WARNING |

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1048

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.4.0](../../../general/release-notes/README.md)

Message: pyspark.sql.session.SparkSession.conf has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.session.SparkSession.conf](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.conf.html) which has a workaround.

#### Scenario

**Input**

Below is an example of how to set a configuration into the property `conf` .

```python
spark.conf.set("spark.sql.crossJoin.enabled", "true")
```

**Output**

The tool adds the EWI `SPRKPY1048` indicating that a workaround can be implemented.

```python
#EWI: SPRKPY1048 => pyspark.sql.session.SparkSession.conf has a workaround, see documentation for more info
spark.conf.set("spark.sql.crossJoin.enabled", "true")
```

**Recommended fix**

SparkSession.conf is used to pass some specific settings only used by Pyspark and doesn’t apply to Snowpark. You can remove or comment on the code

```python
#spark.conf.set("spark.sql.crossJoin.enabled", "true")
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1049

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.1.9](../../../general/release-notes/README.md)

Message: pyspark.sql.session.SparkSession.sparkContext has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.session.SparkSession.sparkContext](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.sparkContext.html) which has a workaround.

#### Scenario

**Input**

Below is an example that creates a spark session and then uses the `SparkContext` property to print the appName.

```python
print("APP Name :"+spark.sparkContext.appName())
```

**Output**

The tool adds the EWI `SPRKPY1049` indicating that a workaround can be implemented.

```python
#EWI: SPRKPY1049 => pyspark.sql.session.SparkSession.sparkContext has a workaround, see documentation for more info
print("APP Name :"+spark.sparkContext.appName())
```

**Recommended fix**

SparkContext is not supported in SnowPark but you can access the methods and properties from SparkContext directly from the Session instance.

```python
## Pyspark
print("APP Name :"+spark.sparkContext.appName())
can be used in SnowPark removing the sparkContext as:
#Manual adjustment in SnowPark
print("APP Name :"+spark.appName());
```

### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1050

Message: pyspark.conf.SparkConf.set has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.conf.SparkConf.set](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkConf.set.html) which has a workaround.

#### Scenario

**Input**

Below is an example that sets a variable using `conf.set`.

```python
conf = SparkConf().setAppName('my_app')

conf.set("spark.storage.memoryFraction", "0.5")
```

**Output**

The tool adds the EWI `SPRKPY1050` indicating that a workaround can be implemented.

```python
conf = SparkConf().setAppName('my_app')

#EWI: SPRKPY1050 => pyspark.conf.SparkConf.set has a workaround, see documentation for more info
conf.set("spark.storage.memoryFraction", "0.5")
```

**Recommended fix**

SparkConf.set is used to set a configuration setting only used by Pyspark and doesn’t apply to Snowpark. You can remove or comment on the code

```python
#conf.set("spark.storage.memoryFraction", "0.5")
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1051

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.4.0](../../../general/release-notes/README.md)

Message: pyspark.sql.session.SparkSession.Builder.master has a workaround

Category: Warning.

### Description

This issue appears when the tool detects [pyspark.sql.session.SparkSession.Builder.master](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.builder.master.html) usage which has a workaround.

#### Scenario

**Input**

Below is an example of the usage of the method `builder.master` to set the Spark Master URL to connect to local using 1 core.

```python
spark = SparkSession.builder.master("local[1]")
```

**Output**

The tool adds the EWI `SPRKPY1051` indicating that a workaround can be implemented.

```python
#EWI: SPRKPY1051 => pyspark.sql.session.SparkSession.Builder.master has a workaround, see documentation for more info
spark = Session.builder.master("local[1]")
```

**Recommended fix**

`pyspark.sql.session.SparkSession.Builder.master` is used to set up a Spark Cluster. Snowpark doesn’t use Spark Clusters so you can remove or comment the code.

```python
## spark = Session.builder.master("local[1]")
```

### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1052

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.8.0](../../../general/release-notes/README.md)

Message: pyspark.sql.session.SparkSession.Builder.enableHiveSupport has a workaround

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.session.SparkSession.Builder.enableHiveSupport](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.SparkSession.builder.enableHiveSupport.html) which has a workaround.

#### Scenario

**Input**

Below is an example that configures the SparkSession and enables the hive support using the method `enableHiveSupport`.

```python
spark = Session.builder.appName("Merge_target_table")\
        .config("spark.port.maxRetries","100") \
        .enableHiveSupport().getOrCreate()
```

**Output**

The tool adds the EWI `SPRKPY1052` indicating that a workaround can be implemented.

```python
#EWI: SPRKPY1052 => pyspark.sql.session.SparkSession.Builder.enableHiveSupport has a workaround, see documentation for more info
spark = Session.builder.appName("Merge_target_table")\
        .config("spark.port.maxRetries","100") \
        .enableHiveSupport().getOrCreate()
```

**Recommended fix**

Remove the use of `enableHiveSupport` function because it is not needed in Snowpark.

```python
spark = Session.builder.appName("Merge_target_table")\
        .config("spark.port.maxRetries","100") \
        .getOrCreate()
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1053

Message: An error occurred when extracting the dbc files.

Category: Warning.

### Description

This issue appears when a dbc file cannot be extracted. This warning could be caused by one or more of the following reasons: Too heavy, inaccessible, read-only, etc.

#### Additional recommendations

* As a workaround, you can check the size of the file if it is too heavy to be processed. Also, analyze whether the tool can access it to avoid any access issues.
* For more support, you can email us at [snowconvert-info@snowflake.com](mailto:snowconvert-info%40snowflake.com). If you have a contract for support with Snowflake, reach out to your sales engineer and they can direct your support needs.

## SPRKPY1080

Message: The value of SparkContext is replaced with ‘session’ variable.

Category: Warning

### Description

Spark context is stored into a variable called session that creates a Snowpark Session.

#### Scenario

**Input**

This snippet describes a SparkContext

```python
## Input Code
from pyspark import SparkContext
from pyspark.sql import SparkSession

def example1():

    sc = SparkContext("local[*]", "TestApp")

    sc.setLogLevel("ALL")
    sc.setLogLevel("DEBUG")
```

**Output**

In this output code SMA has replaced the PySpark.SparkContext by a SparkSession , Note that SMA also add a template to replace the connection in the “connection.json” file and then load this configuration on the connection_parameter variable.

```python
## Output Code
import logging
import sys
import json
from snowflake.snowpark import Session
from snowflake.snowpark import Session

def example1():
    jsonFile = open("connection.json")
    connection_parameter = json.load(jsonFile)
    jsonFile.close()
    #EWI: SPRKPY1080 => The value of SparkContext is replaced with 'session' variable.
    sc = Session.builder.configs(connection_parameter).getOrCreate()
    sc.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
    logging.basicConfig(stream = sys.stdout, level = logging.NOTSET)
    logging.basicConfig(stream = sys.stdout, level = logging.DEBUG)
```

**Recommended fix**

The configuration file “connection.json” must be updated with the required connection information:

```json
{
  "user": "my_user",
  "password": "my_password",
  "account": "my_account",
  "role": "my_role",
  "warehouse": "my_warehouse",
  "database": "my_database",
  "schema": "my_schema"
}
```

### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1054

Message: pyspark.sql.readwriter.DataFrameReader.format is not supported.

Category: Warning.

### Description

This issue appears when the [pyspark.sql.readwriter.DataFrameReader.format](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.format.html) has an argument that is not supported by Snowpark.

#### Scenarios

There are some scenarios depending on the type of format you are trying to load. It can be a `supported` , or `non-supported` format.

##### Scenario 1

**Input**

The tool analyzes the type of format that is trying to load, the supported formats are:

* Csv
* JSON
* Parquet
* Orc

The below example shows how the tool transforms the `format` method when passing a `Csv` value.

```python
from pyspark.sql import SparkSession
spark = SparkSession.builder.getOrCreate()

df1 = spark.read.format('csv').load('/path/to/file')
```

**Output**

The tool transforms the `format` method into a `Csv` method call.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()

df1 = spark.read.csv('/path/to/file')
```

**Recommended fix**

In this case, the tool does not show the EWI, meaning there is no fix necessary.

##### Scenario 2

**Input**

The below example shows how the tool transforms the `format` method when passing a `Jdbc` value.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()

df2 = spark.read.format('jdbc') \
    .option("driver", "com.mysql.cj.jdbc.Driver") \
    .option("url", "jdbc:mysql://localhost:3306/emp") \
    .option("dbtable", "employee") \
    .option("user", "root") \
    .option("password", "root") \
    .load()
```

**Output**

The tool shows the EWI `SPRKPY1054` indicating that the value “jdbc” is not supported.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()

#EWI: SPRKPY1054 => pyspark.sql.readwriter.DataFrameReader.format with argument value "jdbc" is not supported.
#EWI: SPRKPY1002 => pyspark.sql.readwriter.DataFrameReader.load is not supported

df2 = spark.read.format('jdbc') \
    .option("driver", "com.mysql.cj.jdbc.Driver") \
    .option("url", "jdbc:mysql://localhost:3306/emp") \
    .option("dbtable", "employee") \
    .option("user", "root") \
    .option("password", "root") \
    .load()
```

**Recommended fix**

For the `not supported` scenarios, there is no specific fix since it depends on the files that are trying to be read.

##### Scenario 3

**Input**

The below example shows how the tool transforms the `format` method when passing a `CSV`, but using a variable instead.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()

myFormat = 'csv'
df3 = spark.read.format(myFormat).load('/path/to/file')
```

**Output**

Since the tool can not determine the value of the variable in runtime, shows the EWI `SPRKPY1054` indicating that the value “” is not supported.

```python
from snowflake.snowpark import Session
spark = Session.builder.getOrCreate()

myFormat = 'csv'
#EWI: SPRKPY1054 => pyspark.sql.readwriter.DataFrameReader.format with argument value "" is not supported.
#EWI: SPRKPY1002 => pyspark.sql.readwriter.DataFrameReader.load is not supported
df3 = spark.read.format(myFormat).load('/path/to/file')
```

**Recommended fix**

As a workaround, you can check the value of the variable and add it as a string to the `format` call.

#### Additional recommendations

* The Snowpark location only accepts cloud locations using a [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).
* The documentation of methods supported by Snowpark can be found in the [documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.20.0/snowpark/api/snowflake.snowpark.DataFrameReader#snowflake.snowpark.DataFrameReader)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1055

Message: pyspark.sql.readwriter.DataFrameReader.option key value is not supported.

Category: Warning.

### Description

This issue appears when the `pyspark.sql.readwriter.DataFrameReader.option` key value is not supported by SnowFlake.

The tool analyzes the option call parameters and depends on the method (CSV or JSON or PARQUET) the key value might have or not have an equivalent in Snowpark, if all the parameters have an equivalent, the tool does not add the EWI, and it replaces the key value for his equivalent, otherwise, the tool adds the EWI.

**List of equivalences:**

* Equivalences for CSV:

| Spark option keys | Snowpark Equivalences |
| --- | --- |
| sep | FIELD_DELIMITER |
| header | PARSE_HEADER |
| lineSep | RECORD_DELIMITER |
| pathGlobFilter | PATTERN |
| quote | FIELD_OPTIONALLY_ENCLOSED_BY |
| nullValue | NULL_IF |
| dateFormat | DATE_FORMAT |
| timestampFormat | TIMESTAMP_FORMAT |
| inferSchema | INFER_SCHEMA |
| delimiter | FIELD_DELIMITER |

* Equivalences for JSON:

| Spark option keys | Snowpark Equivalences |
| --- | --- |
| dateFormat | DATE_FORMAT |
| timestampFormat | TIMESTAMP_FORMAT |
| pathGlobFilter | PATTERN |

* Equivalences for PARQUET:

| Spark option keys | Snowpark Equivalences |
| --- | --- |
| pathGlobFilter | PATTERN |

Any other key option that’s not in one of the tables above, are not supported or doesn’t have an equivalent in Snowpark. If that’s the case, the tool adds the EWI with the parameter information and removes it from the chain.

#### Scenarios

The below scenarios apply for CSV, JSON, and PARQUET.

There are a couple of scenarios depending on the value of the key used in the `option` method.

##### Scenario 1

**Input**

Below is an example of a `option` call using a `equivalent key`.

```python
from pyspark.sql import SparkSession

spark = SparkSession.builder.getOrCreate()

## CSV example:
spark.read.option("header", True).csv(csv_file_path)

## Json example:
spark.read.option("dateFormat", "dd-MM-yyyy").json(json_file_path)

## Parquet example:
spark.read.option("pathGlobFilter", "*.parquet").parquet(parquet_file_path)
```

**Output**

The tool transforms the key with the correct equivalent.

```python
from snowflake.snowpark import Session

spark = Session.builder.getOrCreate()

## CSV example:
spark.read.option("PARSE_HEADER", True).csv(csv_file_path)

## Json example:
spark.read.option("DATE_FORMAT", "dd-MM-yyyy").json(json_file_path)

## Parquet example:
spark.read.option("PATTERN", "*.parquet").parquet(parquet_file_path)
```

**Recommended fix**

Since the tool transforms the value of the key, there is no necessary fix.

### Scenario 2

**Input**

Below is an example of a `option` call using a `non-equivalent key`.

```python
from pyspark.sql import SparkSession

spark = SparkSession.builder.getOrCreate()

## CSV example:
spark.read.option("anotherKeyValue", "myVal").csv(csv_file_path)

## Json example:
spark.read.option("anotherKeyValue", "myVal").json(json_file_path)

## Parquet example:
spark.read.option("anotherKeyValue", "myVal").parquet(parquet_file_path)
```

**Output**

The tool adds the EWI `SPRKPY1055` indicating the key is not supported and removes the `option` call.

```python
from snowflake.snowpark import Session

spark = Session.builder.getOrCreate()

## CSV example:
#EWI: SPRKPY1055 => pyspark.sql.readwriter.DataFrameReader.option with key value "anotherKeyValue" is not supported.
spark.read.csv(csv_file_path)

## Json example:
#EWI: SPRKPY1055 => pyspark.sql.readwriter.DataFrameReader.option with key value "anotherKeyValue" is not supported.
spark.read.json(json_file_path)

## Parquet example:
#EWI: SPRKPY1055 => pyspark.sql.readwriter.DataFrameReader.option with key value "anotherKeyValue" is not supported.
spark.read.parquet(parquet_file_path)
```

**Recommended fix**

It is recommended to check the behavior after the transformation.

### Additional recommendations

* When non-equivalent parameters are present, it is recommended to check the behavior after the transformation.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1056

> **Warning:**
>
> This issue code has been **deprecated**

Message: pyspark.sql.readwriter.DataFrameReader.option argument _\*\*<argument_name>\*\*_ is not a literal and can’t be evaluated

Category: Warning

### Description

This issue appears when the argument’s key or value of the [pyspark.sql.readwriter.DataFrameReader.option](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.option.html) function is not a literal value (for example a variable). The SMA does a static analysis of your source code, and therefore it is not possible to evaluate the content of the argument.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.readwriter.DataFrameReader.option` function that generates this EWI.

```python
my_value = ...
my_option = ...

df1 = spark.read.option("dateFormat", my_value).format("csv").load('filename.csv')
df2 = spark.read.option(my_option, "false").format("csv").load('filename.csv')
```

**Output**

The SMA adds the EWI `SPRKPY1056` to the output code to let you know that the argument of this function is not a literal value, and therefore it could not be evaluated by the SMA.

```python
my_value = ...
my_option = ...

#EWI: SPRKPY1056 => pyspark.sql.readwriter.DataFrameReader.option argument "dateFormat" is not a literal and can't be evaluated
df1 = spark.read.option("dateFormat", my_value).format("csv").load('filename.csv')
#EWI: SPRKPY1056 => pyspark.sql.readwriter.DataFrameReader.option argument key is not a literal and can't be evaluated
df2 = spark.read.option(my_option, "false").format("csv").load('filename.csv')
```

**Recommended fix**

Even though the SMA was unable to evaluate the argument, it does not mean that it is not supported by Snowpark. Please make sure that the value of the argument is valid and equivalent in Snowpark by checking the [documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameReader.option).

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1057

> **Warning:**
>
> This Issue Code has been **deprecated** since [Spark Conversion Core Version 4.8.0](../../../general/release-notes/README.md)

Message: [PySpark Dataframe Option](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.option.html#pyspark.sql.DataFrameReader.option) argument contains a value that is not a literal, therefore cannot be evaluated

Category: Warning.

### Description

This issue code is deprecated. If you are using an older version, please upgrade to the latest.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1058

Message: < method > with < key > Platform specific key is not supported.

Category: ConversionError

### Description

The `get` and `set` methods from [pyspark.sql.conf.RuntimeConfig](https://spark.apache.org/docs/3.5.3/api/python/reference/pyspark.sql/api/pyspark.sql.conf.RuntimeConfig.html#pyspark.sql.conf.RuntimeConfig) are not supported with a Platform specific key.

#### Scenarios

Not all usages of `get` or `set` methods are going to have an EWI in the output code. This EWI appears when the tool detects the usage of these methods with a Platform specific key which is not supported.

##### Scenario 1

**Input**

Below is an example of the `get` or `set` methods with supported keys in Snowpark.

```python
session.conf.set("use_constant_subquery_alias", False)
spark.conf.set("sql_simplifier_enabled", True)

session.conf.get("use_constant_subquery_alias")
session.conf.get("use_constant_subquery_alias")
```

**Output**

Since the keys are supported in Snowpark the tool does not add the EWI on the output code.

```python
session.conf.set("use_constant_subquery_alias", True)
session.conf.set("sql_simplifier_enabled", False)

session.conf.get("use_constant_subquery_alias")
session.conf.get("sql_simplifier_enabled")
```

**Recommended fix**

There is no recommended fix for this scenario.

##### Scenario 2

**Input**

Below is an example using not supported keys.

```python
data =
    [
      ("John", 30, "New York"),
      ("Jane", 25, "San Francisco")
    ]

session.conf.set("spark.sql.shuffle.partitions", "50")
spark.conf.set("spark.yarn.am.memory", "1g")

session.conf.get("spark.sql.shuffle.partitions")
session = spark.conf.get("spark.yarn.am.memory")

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])
```

**Output**

The tool adds this EWI `SPRKPY1058` on the output code to let you know that these methods are not supported with a Platform specific key.

```python
data =
    [
      ("John", 30, "New York"),
      ("Jane", 25, "San Francisco")
    ]

#EWI: SPRKPY1058 => pyspark.sql.conf.RuntimeConfig.set method with this "spark.sql.shuffle.partitions" Platform specific key is not supported.
spark.conf.set("spark.sql.shuffle.partitions", "50")
#EWI: SPRKPY1058 => pyspark.sql.conf.RuntimeConfig.set method with this "spark.yarn.am.memory" Platform specific key is not supported.
spark.conf.set("spark.yarn.am.memory", "1g")

#EWI: SPRKPY1058 => pyspark.sql.conf.RuntimeConfig.get method with this "spark.sql.shuffle.partitions" Platform specific key is not supported.
spark.conf.get("spark.sql.shuffle.partitions")
#EWI: SPRKPY1058 => pyspark.sql.conf.RuntimeConfig.get method with this "spark.yarn.am.memory" Platform specific key is not supported.
spark.conf.get("spark.yarn.am.memory")

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])
```

**Recommended fix**

The recommended fix is to remove these methods.

```python
data =
    [
      ("John", 30, "New York"),
      ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1059

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.45.1](../../../general/release-notes/README.md)

Message: [pyspark.storagelevel.StorageLevel](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.StorageLevel.html) has a workaround, see documentation.

Category: Warning

### Description

Currently, the use of StorageLevel is not required in Snowpark since [Snowflake controls the storage](https://docs.snowflake.com/en/user-guide/intro-key-concepts#database-storage). For more information, you can refer to the EWI SPRKPY1072

#### Additional recommendations

* Upgrade your application to the latest version.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1060

Message: The authentication mechanism is connection.json (template provided).

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.conf.SparkConf](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkConf.html).

#### Scenario

**Input**

Since the authentication mechanism is different in Snowpark, the tool removes the usages and creates a **connection configuration file (connection.json)** instead.

```python
from pyspark import SparkConf

my_conf = SparkConf(loadDefaults=True)
```

**Output**

The tool adds the EWI `SPRKPY1060` indicating that the authentication mechanism is different.

```python
#EWI: SPRKPY1002 => pyspark.conf.SparkConf is not supported
#EWI: SPRKPY1060 => The authentication mechanism is connection.json (template provided).
#my_conf = Session.builder.configs(connection_parameter).getOrCreate()

my_conf = None
```

**Recommended fix**

To create a connection it is necessary that you fill in the information in the `connection.json` file.

```python
{
  "user": "<USER>",
  "password": "<PASSWORD>",
  "account": "<ACCOUNT>",
  "role": "<ROLE>",
  "warehouse": "<WAREHOUSE>",
  "database": "<DATABASE>",
  "schema": "<SCHEMA>"
}
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1061

Message: Snowpark does not support unix_timestamp functions

Category: Warning

### Description

In Snowpark, the first parameter is mandatory; the issue appears when the tool detects the usage of [pyspark.sql.functions.unix_timestamp](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.unix_timestamp.html) with no parameters.

#### Scenario

**Input**

Below an example that calls the `unix_timestamp` method without parameters.

```python
data = [["2015-04-08", "10"],["2015-04-10", "15"]]

df = spark.createDataFrame(data, ['dt', 'val'])
df.select(unix_timestamp()).show()
```

**Output**

The Snowpark signature for this function `unix_timestamp(e: ColumnOrName, fmt: Optional["Column"] = None)`, as you can notice the first parameter it’s required.

The tool adds this EWI `SPRKPY1061` to let you know that function unix_timestamp with no parameters it’s not supported in Snowpark.

```python
data = [["2015-04-08", "10"],["2015-04-10", "15"]]

df = spark.createDataFrame(data, ['dt', 'val'])
#EWI: SPRKPY1061 => Snowpark does not support unix_timestamp functions with no parameters. See documentation for more info.
df.select(unix_timestamp()).show()
```

**Recommended fix**

As a workaround, you can add at least the name or column of the timestamp string.

```python
data = [["2015-04-08", "10"],["2015-04-10", "15"]]

df = spark.createDataFrame(data, ["dt", "val"])
df.select(unix_timestamp("dt")).show()
```

#### Additional recommendations

* You also can add the [current_timestamp()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.current_timestamp) as the first parameter.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1062

Message: Snowpark does not support GroupedData.pivot without parameter “values”.

Category: Warning

### Description

This issue appears when the SMA detects the usage of the [pyspark.sql.group.GroupedData.pivot](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.GroupedData.pivot.html) function without the “values” parameter *(the list of values to pivot on)*.

At the moment, the Snowpark Python pivot function requires you to explicitly specify the list of distinct values to pivot on.

#### Scenarios

##### Scenario 1

**Input**

The SMA detects an expression that matches the pattern `dataFrame.groupBy("columnX").pivot("columnY")` and the pivot does not have the **values** parameter.

```python
df.groupBy("date").pivot("category").sum("amount")
```

**Output**

The SMA adds an EWI message indicating that the pivot function without the “values” parameter is not supported.

In addition, it will add as a second parameter of the pivot function a list comprehension that calculates the list of values that will be translated into columns. Keep in mind that this operation is not efficient for large datasets, and it is advisable to indicate the values explicitly.

```python
#EWI: SPRKPY1062 => pyspark.sql.group.GroupedData.pivot without parameter 'values' is not supported. See documentation for more info.
df.groupBy("date").pivot("category", [v[0] for v in df.select("category").distinct().limit(10000).collect()]]).sum("amount")
```

**Recommended fix**

For this scenario the SMA add a second parameter of the pivot function a list comprehension that calculates the list of values that will be translated into columns, but you can a list of distinct values to pivot on, as follows:

```python
df = spark.createDataFrame([
      Row(category="Client_ID", date=2012, amount=10000),
      Row(category="Client_name",   date=2012, amount=20000)
  ])

df.groupBy("date").pivot("category", ["dotNET", "Java"]).sum("amount")
```

##### Scenario 2

**Input**

The SMA couldn’t detect an expression that matches the pattern `dataFrame.groupBy("columnX").pivot("columnY")` and the pivot does not have the **values** parameter.

```python
df1.union(df2).groupBy("date").pivot("category").sum("amount")
```

**Output**

The SMA adds an EWI message indicating that the pivot function without the “values” parameter is not supported.

```python
#EWI: SPRKPY1062 => pyspark.sql.group.GroupedData.pivot without parameter 'values' is not supported. See documentation for more info.
df1.union(df2).groupBy("date").pivot("category").sum("amount")
```

**Recommended fix**

Add a list of distinct values to pivot on, as follows:

```python
df = spark.createDataFrame([
      Row(course="dotNET", year=2012, earnings=10000),
      Row(course="Java",   year=2012, earnings=20000)
  ])

df.groupBy("year").pivot("course", ["dotNET", "Java"]).sum("earnings").show()
```

#### Additional recommendations

* Calculating the list of distinct values to pivot on is not an efficient operation on large datasets and could become a blocking call. Please consider indicating the list of distinct values to pivot on explicitly.
* If you don’t want to specify the list of distinct values to pivot on explicitly (not advisable), you can add the following code as the second argument of the pivot function to infer the values at runtime\*

```python
[v[0] for v in <df>.select(<column>).distinct().limit(<count>).collect()]]
```

\*\*\*\*Replace\*\*\* *:code:`<df>` with the corresponding DataFrame, with the column to pivot and with the number of rows to select.*

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1063

Message: pyspark.sql.pandas.functions.pandas_udf has workaround.

Category: Warning

### Description

This issue appears when the tool detects the usage of [pyspark.sql.pandas.functions.pandas_udf](https://spark.apache.org/docs/3.1.2/api/python/reference/api/pyspark.sql.functions.pandas_udf.html) which has a workaround.

#### Scenario

**Input**

The pandas_udf function is used to create a user defined functions that works with large amounts of data.

```python
@pandas_udf(schema, PandasUDFType.GROUPED_MAP)
def modify_df(pdf):
    return pd.DataFrame({'result': pdf['col1'] + pdf['col2'] + 1})
df = spark.createDataFrame([(1, 2), (3, 4), (1, 1)], ["col1", "col2"])
new_df = df.groupby().apply(modify_df)
```

**Output**

The SMA adds an EWI message indicating that the pandas_udf has a workaround.

```python
#EWI: SPRKPY1062 => pyspark.sql.pandas.functions.pandas_udf has a workaround, see documentation for more info
@pandas_udf(schema, PandasUDFType.GROUPED_MAP)

def modify_df(pdf):
    return pd.DataFrame({'result': pdf['col1'] + pdf['col2'] + 1})

df = spark.createDataFrame([(1, 2), (3, 4), (1, 1)], ["col1", "col2"])

new_df = df.groupby().apply(modify_df)
```

**Recommended fix**

Specify explicitly the parameters types as a new parameter `input_types`, and remove `functionType` parameter if applies. Created function must be called inside a select statement.

```python
@pandas_udf(
    return_type = schema,
    input_types = [PandasDataFrameType([IntegerType(), IntegerType()])]
)

def modify_df(pdf):
    return pd.DataFrame({'result': pdf['col1'] + pdf['col2'] + 1})

df = spark.createDataFrame([(1, 2), (3, 4), (1, 1)], ["col1", "col2"])

new_df = df.groupby().apply(modify_df) # You must modify function call to be a select and not an apply
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1064

Message: The **\*Spark element\*** does not apply since snowflake uses snowpipe mechanism instead.

Category: Warning

### Description

This issue appears when the tool detects the usage of any element from the pyspark.streaming library:

* [pyspark.streaming.DStream](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.streaming.DStream.html)
* [pyspark.streaming.StreamingContext](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.streaming.StreamingContext.html)
* pyspark.streaming.listener.StreamingListener.

#### Scenario

**Input**

Below is an example with one of the elements that trigger this EWI.

```python
from pyspark.streaming.listener import StreamingListener

var = StreamingListener.Java
var.mro()

df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])
df.show()
```

**Output**

The SMA adds the EWI `SPRKPY1064` on the output code to let you know that this function does not apply.

```python
#EWI: SPRKPY1064 => The element does not apply since snowflake uses snowpipe mechanism instead.

var = StreamingListener.Java
var.mro()

df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])
df.show()
```

**Recommended fix**

The SMA removes the import statement and adds the issue to the *Issues.csv* inventory, remove any usages of the Spark element.

```python
df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])
df.show()
```

#### Additional recommendations

* Check the documentation for [Snowpipe](https://docs.snowflake.com/en/user-guide/data-load-snowpipe-intro) to see how it fits to the current scenario.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1065

Message: The pyspark.context.SparkContext.broadcast does not apply since snowflake use data-clustering mechanism to compute the data.

Category: Warning

### Description

This issue appears when the tool detects the usage of element [pyspark.context.SparkContext.broadcast](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.broadcast.html), which is not necessary due to the use of [data-clustering](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions) of Snowflake.

**Input code**

In this example a broadcast variable is created, these variables allows data to be share more efficiently through all nodes.

```python
sc = SparkContext(conf=conf_spark)

mapping = {1: 10001, 2: 10002}

bc = sc.broadcast(mapping)
```

**Output code**

The SMA adds an EWI message indicating that the broadcast it’s not required.

```python
sc = conf_spark

mapping = {1: 10001, 2: 10002}
#EWI: SPRKPY1065 => The element does not apply since snowflake use data-clustering mechanism to compute the data.

bc = sc.broadcast(mapping)
```

**Recommended fix**

Remove any usages of pyspark.context.SparkContext.broadcast.

```python
sc = conf_spark

mapping = {1: 10001, 2: 10002}
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1066

Message: The Spark element does not apply since snowflake use micro-partitioning mechanism are created automatically.

Category: Warning

### Description

This issue appears when the tool detects the usage of elements related to partitions:

* [pyspark.sql.catalog.Catalog.recoverPartitions](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.Catalog.recoverPartitions.html)
* [pyspark.sql.dataframe.DataFrame.foreachPartition](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.foreachPartition.html)
* [pyspark.sql.dataframe.DataFrame.sortWithinPartitions](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.sortWithinPartitions.html)
* [pyspark.sql.functions.spark_partition_id](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.spark_partition_id.html)

Those elements do not apply due the use of [micro-partitions](https://docs.snowflake.com/en/user-guide/tables-clustering-micropartitions) of Snowflake.

**Input code**

In this example [sortWithinPartitions](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.sortWithinPartitions.html) it’s used to create a partition in a DataFrame sorted by the specified column.

```python
df = spark.createDataFrame([(2, "Alice"), (5, "Bob")], schema=["age", "name"])
df.sortWithinPartitions("age", ascending=False)
```

**Output code**

The SMA adds an EWI message indicating that Spark element is not required.

```python
df = spark.createDataFrame([(2, "Alice"), (5, "Bob")], schema=["age", "name"])
#EWI: SPRKPY1066 => The element does not apply since snowflake use micro-partitioning mechanism are created automatically.
df.sortWithinPartitions("age", ascending=False)
```

**Recommended fix**

Remove the usage of the element.

```python
df = spark.createDataFrame([(2, "Alice"), (5, "Bob")], schema=["age", "name"])
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1067

Message: The pyspark.sql.functions.split has parameters that are not supported in Snowpark.

Category: Warning

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.split](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.split.html) with more than two parameters or a regex pattern as a parameter; both cases are not supported.

#### Scenarios

##### Scenario 1

**Input code**

In this example the split function has more than two parameters.

```python
df.select(split(columnName, ",", 5))
```

**Output code**

The tool adds this EWI on the output code to let you know that this function is not supported when it has more than two parameters.

```python
#EWI: SPRKPY1067 => Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info.
df.select(split(columnName, ",", 5))
```

**Recommended fix**

Keep the split function with only two parameters.

```python
df.select(split(columnName, ","))
```

##### Scenario 2

**Input code**

In this example the split function has a regex pattern as a parameter.

```python
df.select(split(columnName, "^([\d]+-[\d]+-[\d])"))
```

**Output code**

The tool adds this EWI on the output code to let you know that this function is not supported when it has a regex pattern as a parameter.

```python
#EWI: SPRKPY1067 => Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info.
df.select(split(columnName, "^([\d]+-[\d]+-[\d])"))
```

**Recommended fix**

The spark signature for this method `functions.split(str: ColumnOrName, pattern: str, limit: int = - 1)` not exactly match with the method in Snowpark `functions.split(str: Union[Column, str], pattern: Union[Column, str])` so for now the scenario using regular expression does not have a recommended fix.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1068

Message: toPandas contains columns of type ArrayType that is not supported and has a workaround.

Category: Warning

### Description

[pyspark.sql.DataFrame.toPandas](https://spark.apache.org/docs/3.5.3/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.toPandas.html) doesn’t work properly If there are columns of type ArrayType. The workaround for these cases is converting those columns into a Python Dictionary by using json.loads method.

#### Scenario

**Input**

ToPandas returns the data of the original DataFrame as a Pandas DataFrame.

```python
sparkDF = spark.createDataFrame([
Row(a=1, b=2., c='string1', d=date(2000, 1, 1), e=datetime(2000, 1, 1, 12, 0)),
Row(a=2, b=3., c='string2', d=date(2000, 2, 1), e=datetime(2000, 1, 2, 12, 0))
])

pandasDF = sparkDF.toPandas()
```

**Output**

The tool adds this EWI to let you know that toPandas is not supported If there are columns of type ArrayType, but has workaround.

```python
sparkDF = spark.createDataFrame([
Row(a=1, b=2., c='string1', d=date(2000, 1, 1), e=datetime(2000, 1, 1, 12, 0)),
Row(a=2, b=3., c='string2', d=date(2000, 2, 1), e=datetime(2000, 1, 2, 12, 0))
])
#EWI: SPRKPY1068 => toPandas doesn't work properly If there are columns of type ArrayType. The workaround for these cases is converting those columns into a Python Dictionary by using json.loads method. example: df[colName] = json.loads(df[colName]).
pandasDF = sparkDF.toPandas()
```

**Recommended fix**

```python
pandas_df = sparkDF.toPandas()​
​
## check/convert all resulting fields from calling toPandas when they are of
## type ArrayType,
## they will be reasigned by converting them into a Python Dictionary
## using json.loads method​
​
for field in pandas_df.schema.fields:
    if isinstance(field.datatype, ArrayType):
        pandas_df[field.name] = pandas_df[field.name].apply(lambda x: json.loads(x) if x is not None else x)
```

### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1069

Message: If partitionBy parameter is a list, Snowpark will throw an error.

Category: Warning

### Description

When there is a usage of [pyspark.sql.readwriter.DataFrameWriter.parquet](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.parquet.html) method where it comes to the parameter `partitionBy`, the tool shows the EWI.

This is because in Snowpark the [DataFrameWriter.parquet](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.parquet) only supports a `ColumnOrSqlExpr` as a partitionBy parameter.

#### Scenarios

##### Scenario 1

**Input code:**

For this scenario the partitionBy parameter is not a list.

```python
df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])

df.write.parquet(file_path, partitionBy="age")
```

**Output code:**

The tool adds the EWI `SPRKPY1069` to let you know that Snowpark throws an error if parameter is a list.

```python
df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])

#EWI: SPRKPY1069 => If partitionBy parameter is a list, Snowpark will throw and error.
df.write.parquet(file_path, partition_by = "age", format_type_options = dict(compression = "None"))
```

**Recommended fix**

There is not a recommended fix for this scenario because the tool always adds this EWI just in case the partitionBy parameter is a list. Remember that in Snowpark, only accepts cloud locations using a [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

```python
df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])

stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
Session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {stage}').show()
Session.file.put(f"file:///path/to/data/file.parquet", f"@{stage}")

df.write.parquet(stage, partition_by = "age", format_type_options = dict(compression = "None"))
```

##### Scenario 2

**Input code:**

For this scenario the partitionBy parameter is a list.

```python
df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])

df.write.parquet(file_path, partitionBy=["age", "name"])
```

**Output code:**

The tool adds the EWI `SPRKPY1069` to let you know that Snowpark throws an error if parameter is a list.

```python
df = spark.createDataFrame([(25, "Alice", "150"), (30, "Bob", "350")], schema=["age", "name", "value"])

#EWI: SPRKPY1069 => If partitionBy parameter is a list, Snowpark will throw and error.
df.write.parquet(file_path, partition_by = ["age", "name"], format_type_options = dict(compression = "None"))
```

**Recommended fix**

If the value of the parameter is a `list`, then replace it with a `ColumnOrSqlExpr`.

```python
df.write.parquet(file_path, partition_by = sql_expr("age || name"), format_type_options = dict(compression = "None"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1070

Message: The `mode` argument is transformed to `overwrite`, check the variable value and set the corresponding bool value.

Category: Warning

### Description

When there is a usage of:

* [pyspark.sql.readwriter.DataFrameWriter.csv](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.csv.html)
* [pyspark.sql.readwriter.DataFrameWriter.json](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.json.html)
* [pyspark.sql.readwriter.DataFrameWriter.parquet](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.json.html)

The tool analyzes the parameter `mode` to determinate if the value is `overwrite`.

#### Scenarios

##### Scenario 1

**Input code**

For this scenario the tool detects that the mode parameter can set the corresponding bool value.

```python
df.write.csv(file_path, mode="overwrite")
```

**Output code:**

The SMA tool analyzes the mode parameter, determinate that the value is `overwrite` and set the corresponding bool value

```python
df.write.csv(file_path, format_type_options = dict(compression = "None"), overwrite = True)
```

**Recommended fix**

There is not a recommended fix for this scenario because the tool performed the corresponding transformation.

**Scenario 2:**

**Input code**

In this scenario the tool can not validate the value is `overwrite`.

```python
df.write.csv(file_path, mode=myVal)
```

**Output code:**

The SMA adds an EWI message indicating that the mode parameter was transformed to ‘overwrite’, but it’s also to let you know that it is better to check the variable value and set the correct bool value.

```python
#EWI: SPRKPY1070 => The 'mode' argument is transformed to 'overwrite', check the variable value and set the corresponding bool value.
df.write.csv(file_path, format_type_options = dict(compression = "None"), overwrite = myVal)
```

**Recommended fix**

Check for the value of the parameter `mode` and add the correct value for the parameter `overwrite`.

```python
df.write.csv(file_path, format_type_options = dict(compression = "None"), overwrite = True)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1071

Message: The function pyspark.rdd.RDD.getNumPartitions is not required in Snowpark. So, you should remove all references.

Category: Warning

### Description

This issue appears when the tool finds the use of the [pyspark.rdd.RDD.getNumPartitions](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.RDD.getNumPartitions.html) function. Snowflake uses micro-partitioning mechanism, so the use of this function is not required.

#### Scenario

**Input**

The getNumPartitions returns the quantity of partitions on a RDD.

```python
df = spark.createDataFrame([('2015-04-08',), ('5',), [Row(a=1, b="b")]], ['dt', 'num', 'row'])

print(df.getNumPartitions())
```

**Output**

The tool adds this EWI to let you know that getNumPartitions is not required.

```python
df = spark.createDataFrame([('2015-04-08',), ('5',), [Row(a=1, b="b")]], ['dt', 'num', 'row'])
#EWI: SPRKPY1071 => The getNumPartitions are not required in Snowpark. So, you should remove all references.

print(df.getNumPartitions())
```

**Recommended fix**

Remove all uses of this function.

```python
df = spark.createDataFrame([('2015-04-08',), ('5',), [Row(a=1, b="b")]], ['dt', 'num', 'row'])
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1072

Message: The use of StorageLevel is not required in Snowpark.

Category: Warning.

### Description

This issue appears when the tool finds the use of the [StorageLevel](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.StorageLevel.html) class, which works like “flags” to set the storage level. Since Snowflake controls the storage, the use of this function is not required.

#### Additional recommendations

* Remove all uses of this function.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1073

Message: pyspark.sql.functions.udf without parameters or return type parameter are not supported

Category: Warning.

### Description

This issue appears when the tool detects the usage of [pyspark.sql.functions.udf](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.functions.udf.html) as function or decorator and is not supported in two specifics cases, when it has no parameters or return type parameter.

#### Scenarios

##### Scenario 1

**Input**

In Pyspark you can create an User Defined Function without input or return type parameters:

```python
from pyspark.sql import SparkSession, DataFrameStatFunctions
from pyspark.sql.functions import col, udf

spark = SparkSession.builder.getOrCreate()
data = [['Q1', 'Test 1'],
        ['Q2', 'Test 2'],
        ['Q3', 'Test 1'],
        ['Q4', 'Test 1']]

columns = ['Quadrant', 'Value']
df = spark.createDataFrame(data, columns)

my_udf = udf(lambda s: len(s))
df.withColumn('Len Value' ,my_udf(col('Value')) ).show()
```

**Output**

Snowpark requires the input and return types for Udf function. Because they are not provided and SMA cannot this parameters.

```python
from snowflake.snowpark import Session, DataFrameStatFunctions
from snowflake.snowpark.functions import col, udf

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 'Test 1'],
        ['Q2', 'Test 2'],
        ['Q3', 'Test 1'],
        ['Q4', 'Test 1']]

columns = ['Quadrant', 'Value']
df = spark.createDataFrame(data, columns)
#EWI: SPRKPY1073 => pyspark.sql.functions.udf function without the return type parameter is not supported. See documentation for more info.
my_udf = udf(lambda s: len(s))

df.withColumn('Len Value' ,my_udf(col('Value')) ).show()
```

**Recommended fix**

To fix this scenario is required to add the import for the returns types of the input and output, and then the parameters of return\*type and input_types[] on the [udf](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udf) function _my_udf\*.

```python
from snowflake.snowpark import Session, DataFrameStatFunctions
from snowflake.snowpark.functions import col, udf
from snowflake.snowpark.types import IntegerType, StringType

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 'Test 1'],
        ['Q2', 'Test 2'],
        ['Q3', 'Test 1'],
        ['Q4', 'Test 1']]

columns = ['Quadrant', 'Value']
df = spark.createDataFrame(data, columns)

my_udf = udf(lambda s: len(s), return_type=IntegerType(), input_types=[StringType()])

df.with_column("result", my_udf(df.Value)).show()
```

##### Scenario 2

In PySpark you can use a @udf decorator without parameters

**Input**

```python
from pyspark.sql.functions import col, udf

spark = SparkSession.builder.getOrCreate()
data = [['Q1', 'Test 1'],
        ['Q2', 'Test 2'],
        ['Q3', 'Test 1'],
        ['Q4', 'Test 1']]

columns = ['Quadrant', 'Value']
df = spark.createDataFrame(data, columns)

@udf()
def my_udf(str):
    return len(str)

df.withColumn('Len Value' ,my_udf(col('Value')) ).show()
```

**Output**

In Snowpark all the parameters of a [udf](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udf) decorator are required.

```python
from snowflake.snowpark.functions import col, udf

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 'Test 1'],
        ['Q2', 'Test 2'],
        ['Q3', 'Test 1'],
        ['Q4', 'Test 1']]

columns = ['Quadrant', 'Value']
df = spark.createDataFrame(data, columns)

#EWI: SPRKPY1073 => pyspark.sql.functions.udf decorator without parameters is not supported. See documentation for more info.

@udf()
def my_udf(str):
    return len(str)

df.withColumn('Len Value' ,my_udf(col('Value')) ).show()
```

**Recommended fix**

To fix this scenario is required to add the import for the returns types of the input and output, and then the parameters of return_type and input_types[] on the [udf](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.functions.udf) @udf decorator.

```python
from snowflake.snowpark.functions import col, udf
from snowflake.snowpark.types import IntegerType, StringType

spark = Session.builder.getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})
data = [['Q1', 'Test 1'],
        ['Q2', 'Test 2'],
        ['Q3', 'Test 1'],
        ['Q4', 'Test 1']]

columns = ['Quadrant', 'Value']
df = spark.createDataFrame(data, columns)

@udf(return_type=IntegerType(), input_types=[StringType()])
def my_udf(str):
    return len(str)

df.withColumn('Len Value' ,my_udf(col('Value')) ).show()
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1074

Message: File has mixed indentation (spaces and tabs).

Category: Parsing error.

### Description

This issue appears when the tool detects the file has a mixed indentation. It means, file has a combination of spaces and tabs to indent code lines.

#### Scenario

**Input**

In Pyspark you can mix spaces and tabs for the identation level.

```python
def foo():
    x = 5 # spaces
    y = 6 # tab
```

**Output**

SMA cannot handle mixed indentation markers. When this is detected on a python code file SMA adds the EWI SPRKPY1074 on first line.

```python
## EWI: SPRKPY1074 => File has mixed indentation (spaces and tabs).
## This file was not converted, so it is expected to still have references to the Spark API
def foo():
    x = 5 # spaces
    y = 6 # tabs
```

**Recommended fix**

The solution is to make all the indentation symbols the same.

```python
def foo():
  x = 5 # tab
  y = 6 # tab
```

### Additional recommendations

* Useful indent tools [PEP-8](https://peps.python.org/pep-0008/) and [Reindent](https://pypi.org/project/reindent/).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1075

Category

Warning.

### Description

The parse_json does not apply schema validation, if you need to filter/validate based on schema you might need to introduce some logic.

### Example

**Input**

```python
df.select(from_json(df.value, Schema))
df.select(from_json(schema=Schema, col=df.value))
df.select(from_json(df.value, Schema, option))
```

**Output**

```python
#EWI: SPRKPY1075 => The parse_json does not apply schema validation, if you need to filter/validate based on schema you might need to introduce some logic.
df.select(parse_json(df.value))
#EWI: SPRKPY1075 => The parse_json does not apply schema validation, if you need to filter/validate based on schema you might need to introduce some logic.
df.select(parse_json(df.value))
#EWI: SPRKPY1075 => The parse_json does not apply schema validation, if you need to filter/validate based on schema you might need to introduce some logic.
df.select(parse_json(df.value))
```

For the function from_json the schema is not really passed for inference it is used for validation. See this examples:

```python
data = [
    ('{"name": "John", "age": 30, "city": "New York"}',),
    ('{"name": "Jane", "age": "25", "city": "San Francisco"}',)
]

df = spark.createDataFrame(data, ["json_str"])
```

**Example 1: Enforce Data Types and Change Column Names:**

```python
## Parse JSON column with schema
parsed_df = df.withColumn("parsed_json", from_json(col("json_str"), schema))

parsed_df.show(truncate=False)

## +------------------------------------------------------+---------------------------+
## |json_str                                              |parsed_json                |
## +------------------------------------------------------+---------------------------+
## |{"name": "John", "age": 30, "city": "New York"}       |{John, 30, New York}       |
## |{"name": "Jane", "age": "25", "city": "San Francisco"}|{Jane, null, San Francisco}|
## +------------------------------------------------------+---------------------------+
## notice that values outside of the schema were dropped and columns not matched are returned as null
```

**Example 2: Select Specific Columns:**

```python
## Define a schema with only the columns we want to use
partial_schema = StructType([
    StructField("name", StringType(), True),
    StructField("city", StringType(), True)
])

## Parse JSON column with partial schema
partial_df = df.withColumn("parsed_json", from_json(col("json_str"), partial_schema))

partial_df.show(truncate=False)

## +------------------------------------------------------+---------------------+
## |json_str                                              |parsed_json          |
## +------------------------------------------------------+---------------------+
## |{"name": "John", "age": 30, "city": "New York"}       |{John, New York}     |
## |{"name": "Jane", "age": "25", "city": "San Francisco"}|{Jane, San Francisco}|
## +------------------------------------------------------+---------------------+
## there is also an automatic filtering
```

### Recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com). If you have a contract for support with Snowflake, reach out to your sales engineer and they can direct your support needs.
* Useful tools [PEP-8](https://peps.python.org/pep-0008/) and [Reindent](https://pypi.org/project/reindent/).

## SPRKPY1076

Message: Parameters in [pyspark.sql.readwriter.DataFrameReader](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.html) methods are not supported. This applies to CSV, JSON and PARQUET methods.

Category: Warning.

### Description

For the CSV, JSON and PARQUET methods on the [pyspark.sql.readwriter.DataFrameReader](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.html) object, the tool will analyze the parameters and add a transformation according to each case:

* All the parameters match their equivalent name in Snowpark: in this case, the tool will transform the parameter into a .option() call. For this case, the parameter won’t add this EWI.
* Some parameters do not match the equivalent in Snowpark: in this case, the tool will add this EWI with the parameter information and remove it from the method call.

**List of equivalences:**

* Equivalences for CSV:

| Spark keys | Snowpark Equivalences |
| --- | --- |
| sep | FIELD_DELIMITER |
| header | PARSE_HEADER |
| lineSep | RECORD_DELIMITER |
| pathGlobFilter | PATTERN |
| quote | FIELD_OPTIONALLY_ENCLOSED_BY |
| nullValue | NULL_IF |
| dateFormat | DATE_FORMAT |
| timestampFormat | TIMESTAMP_FORMAT |
| inferSchema | INFER_SCHEMA |
| delimiter | FIELD_DELIMITER |

* Equivalences for JSON:

| Spark keys | Snowpark Equivalences |
| --- | --- |
| dateFormat | DATE_FORMAT |
| timestampFormat | TIMESTAMP_FORMAT |
| pathGlobFilter | PATTERN |

* Equivalences for PARQUET:

| Spark keys | Snowpark Equivalences |
| --- | --- |
| pathGlobFilter | PATTERN |

#### Scenarios

##### Scenario 1

**Input**

For CVS here are some examples:

```python
from pyspark.sql import SparkSession

spark = SparkSession.builder.appName('myapp').getOrCreate()

spark.read.csv("path3", None,None,None,None,None,None,True).show()
```

**Output**

In the converted code the parameters are added as individual options to the cvs function

```python
from snowflake.snowpark import Session

spark = Session.builder.app_name('myapp', True).getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})

#EWI: SPRKPY1076 => Some of the included parameters are not supported in the csv function, the supported ones will be added into a option method.
spark.read.option("FIELD_DELIMITER", None).option("PARSE_HEADER", True).option("FIELD_OPTIONALLY_ENCLOSED_BY", None).csv("path3").show()
```

#### Scenario 2

**Input**

For JSON here are some example:

```python
from pyspark.sql import SparkSession
spark = SparkSession.builder.appName('myapp').getOrCreate()
spark.read.json("/myPath/jsonFile/", dateFormat='YYYY/MM/DD').show()
```

**Output**

In the converted code the parameters are added as individual options to the json function

```python
from snowflake.snowpark import Session
spark = Session.builder.app_name('myapp', True).getOrCreate()
#EWI: SPRKPY1076 => Some of the included parameters are not supported in the json function, the supported ones will be added into a option method.

spark.read.option("DATE_FORMAT", 'YYYY/MM/DD').json("/myPath/jsonFile/").show()
```

##### Scenario 3

**Input**

For PARQUET here are some examples:

```python
from pyspark.sql import SparkSession
spark = SparkSession.builder.appName('myapp').getOrCreate()

spark.read.parquet("/path/to/my/file.parquet", pathGlobFilter="*.parquet").show()
```

**Output**

In the converted code the parameters are added as individual options to the parquet function

```python
from snowflake.snowpark import Session

spark = Session.builder.app_name('myapp', True).getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":0,"minor":0,"patch":0},"attributes":{"language":"Python"}})

#EWI: SPRKPY1076 => Some of the included parameters are not supported in the parquet function, the supported ones will be added into a option method.
#EWI: SPRKPY1029 => The parquet function require adjustments, in Snowpark the parquet files needs to be located in an stage. See the documentation for more info.

spark.read.option("PATTERN", "*.parquet").parquet("/path/to/my/file.parquet")
```

#### Additional recommendations

* When non-equivalent parameters are present, it is recommended to check the behavior after the transformation.
* Also the documentation could be useful to find a better fit:

  + Options documentation for CSV:
    - [PySpark CSV Options](https://spark.apache.org/docs/latest/sql-data-sources-csv.html#data-source-option).
    - [Snowpark CSV Options](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#type-csv).
  + Options documentation for JSON:
    - [PySpark JSON Options](https://spark.apache.org/docs/latest/sql-data-sources-json.html).
    - [Snowpark JSON Options](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#type-json).
  + Options documentation for PARQUET:
    - [Pyspark PARQUET options](https://spark.apache.org/docs/latest/sql-data-sources-parquet.html#data-source-option).
    - [SnowPark PARQUET options.](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#type-parquet).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1077

Message: SQL embedded code cannot be processed.

Category: Warning.

### Description

This issue appears when the tool detects an SQL-embedded code that cannot be converted to Snowpark.

Check the SQL-embedded code section for more information.

#### Scenario

**Input**

In this example the SQL code is embedded on a variable called query that is used as parameter for the Pyspark.sql method.

```python
query = f"SELECT * from myTable"
spark.sql(query)
```

**Output**

SMA detects that the PySpark.sql parameter is a variable and not a SQL Code, so the EWI SPRKPY1077 message is added to the PySpark.sql line.

```python
query = f"SELECT * myTable"
#EWI: SPRKPY1077 => SQL embedded code cannot be processed.
spark.sql(query)
```

#### Additional recommendations

* For the transformation of SQL, this code must be directly inside as parameter of the method only as string values and without interpolation. Please check the SQL send to the PySpark.SQL function to validate it’s functionality on Snowflake.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1078

Message: The argument of the pyspark.context.SparkContext.setLogLevel function is not a literal value and therefore could not be evaluated

Category: Warning

### Description

This issue appears when the SMA detects the use of the [pyspark.context.SparkContext.setLogLevel](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.setLogLevel.html) function with an argument that is not a literal value, for example, when the argument is a variable.

The SMA does a static analysis of your source code and therefore it is not possible to evaluate the content of that argument and determine an equivalent in Snowpark.

#### Scenario

**Input**

In this example the logLevel is defined in the variable my_log_level, then my_log_level used as parameter by the setLogLevel method.

```python
my_log_level = "WARN"
sparkSession.sparkContext.setLogLevel(my_log_level)
```

**Output**

SMA is unable to evaluate the argument for the log level parameter, so the EWI SPRKPY1078 is added over the line of the transformed logging:

```python
my_log_level = "WARN"
#EWI: SPRKPY1078 => my_log_level is not a literal value and therefore could not be evaluated. Make sure the value of my_log_level is a valid level in Snowpark. Valid log levels are: logging.CRITICAL, logging.DEBUG, logging.ERROR, logging.INFO, logging.NOTSET, logging.WARNING
logging.basicConfig(stream = sys.stdout, level = my_log_level)
```

**Recommended fix**

Even though the SMA was unable to evaluate the argument, it will transform the `pyspark.context.SparkContext.setLogLevel` function into the Snowpark equivalent. Please make sure the value of the `level` argument in the generated output code is a valid and equivalent log level in Snowpark according to the table below:

| PySpark log level | Snowpark log level equivalent |
| --- | --- |
| ALL | logging.NOTSET |
| DEBUG | logging.DEBUG |
| ERROR | logging.ERROR |
| FATAL | logging.CRITICAL |
| INFO | logging.INFO |
| OFF | logging.WARNING |
| TRACE | logging.NOTSET |
| WARN | logging.WARNING |

Thus the recommended fix will looks like:

```python
my_log_level = logging.WARNING
logging.basicConfig(stream = sys.stdout, level = my_log_level)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1079

Message: The argument of the pyspark.context.SparkContext.setLogLevel function is not a valid PySpark log level

Category: Warning

### Description

This issue appears when the SMA detects the use of the [pyspark.context.SparkContext.setLogLevel](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.setLogLevel.html) function with an argument that is not a valid log level in PySpark, and therefore an equivalent could not be determined in Snowpark.

#### Scenario

**Input**

here the log level uses “INVALID_LOG_LEVEL” that is not a valid Pyspark log level.

```python
sparkSession.sparkContext.setLogLevel("INVALID_LOG_LEVEL")
```

**Output**

SMA can not recognize the log level “INVALID_LOG_LEVEL”, even though SMA makes the conversion the EWI SPRKPY1079 is added to indicate a possible problem.

```python
#EWI: SPRKPY1079 => INVALID_LOG_LEVEL is not a valid PySpark log level, therefore an equivalent could not be determined in Snowpark. Valid PySpark log levels are: ALL, DEBUG, ERROR, FATAL, INFO, OFF, TRACE, WARN
logging.basicConfig(stream = sys.stdout, level = logging.INVALID_LOG_LEVEL)
```

**Recommended fix**

Make sure that the log level used in the pyspark.context.SparkContext.setLogLevel function is a valid log level in [PySpark](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.setLogLevel.html) or in [Snowpark](https://docs.snowflake.com/en/developer-guide/snowpark/python/troubleshooting) and try again.

```python
logging.basicConfig(stream = sys.stdout, level = logging.DEBUG)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1081

This issue code has been **deprecated** since [Spark Conversion Core 4.12.0](../../../general/release-notes/README.md)

Message: pyspark.sql.readwriter.DataFrameWriter.partitionBy has a workaround.

Category: Warning

### Description

The [Pyspark.sql.readwriter.DataFrameWriter.partitionBy](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.partitionBy.html) function is not supported. The workaround is to use [Snowpark’s copy_into_location](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.copy_into_location) instead. See the documentation for more info.

#### Scenario

**Input**

This code will create a separate directories for each unique value in the `FIRST_NAME` column. The data is the same, but it’s going to be stored in different directories based on the column.

```python
df = session.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]], schema = ["FIRST_NAME", "LAST_NAME"])
df.write.partitionBy("FIRST_NAME").csv("/home/data")
```

This code will create a separate directories for each unique value in the `FIRST_NAME` column. The data is the same, but it’s going to be stored in different directories based on the column.

**Output code**

```python
df = session.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]], schema = ["FIRST_NAME", "LAST_NAME"])
#EWI: SPRKPY1081 => The partitionBy function is not supported, but you can instead use copy_into_location as workaround. See the documentation for more info.
df.write.partitionBy("FIRST_NAME").csv("/home/data", format_type_options = dict(compression = "None"))
```

**Recommended fix**

In Snowpark, [copy_into_location](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.copy_into_location) has a partition_by parameter that you can use instead of the partitionBy function, but it’s going to require some manual adjustments, as shown in the following example:

**Spark code:**

```python
df = session.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]], schema = ["FIRST_NAME", "LAST_NAME"])
df.write.partitionBy("FIRST_NAME").csv("/home/data")
```

**Snowpark code manually adjusted:**

```python
df = session.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]], schema = ["FIRST_NAME", "LAST_NAME"])
df.write.copy_into_location(location=temp_stage, partition_by=col("FIRST_NAME"), file_format_type="csv", format_type_options={"COMPRESSION": "NONE"}, header=True)
```

**copy_into_location** has the following parameters

* *location*: The Snowpark location only accepts cloud locations using an [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).
* _partition_by_: It can be a Column name or a SQL expression, so you will need to converted to a column or a SQL, using col or sql_expr.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1082

Message: The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.

Category: Warning

### Description

The [pyspark.sql.readwriter.DataFrameReader.load](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameReader.load.html) function is not supported. The workaround is to use Snowpark DataFrameReader methods instead.

#### Scenarios

The spark signature for this method `DataFrameReader.load(path, format, schema, **options)` does not exist in Snowpark. Therefore, any usage of the load function is going to have an EWI in the output code.

##### Scenario 1

**Input**

Below is an example that tries to load data from a `CSV` source.

```python
path_csv_file = "/path/to/file.csv"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])

my_session.read.load(path_csv_file, "csv").show()
my_session.read.load(path_csv_file, "csv", schema=schemaParam).show()
my_session.read.load(path_csv_file, "csv", schema=schemaParam, lineSep="\r\n", dateFormat="YYYY/MM/DD").show()
```

**Output**

The SMA adds the EWI `SPRKPY1082` to let you know that this function is not supported by Snowpark, but it has a workaround.

```python
path_csv_file = "/path/to/file.csv"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.

my_session.read.load(path_csv_file, "csv").show()
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.
my_session.read.load(path_csv_file, "csv", schema=schemaParam).show()
#EWI: The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.
my_session.read.load(path_csv_file, "csv", schema=schemaParam, lineSep="\r\n", dateFormat="YYYY/MM/DD").show()
```

**Recommended fix**

As a workaround, you can use [Snowpark DataFrameReader](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader) methods instead.

* Fixing `path` and `format` parameters:

  + Replace the `load` method with `csv` method.
  + The first parameter `path` must be in a stage to make an equivalence with [Snowpark](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

Below is an example that creates a temporal stage and puts the file into it, then calls the `CSV` method.

```python
path_csv_file = "/path/to/file.csv"

## Stage creation

temp_stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
my_session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}').show()
my_session.file.put(f"file:///path/to/file.csv", f"@{temp_stage}")
stage_file_path = f"{temp_stage}file.csv"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])

my_session.read.csv(stage_file_path).show()
```

* Fixing `schema` parameter:

  + The schema can be set by using the [schema](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader.schema) function as follows:

```python
schemaParam = StructType([
        StructField("name", StringType(), True),
        StructField("city", StringType(), True)
    ])

df = my_session.read.schema(schemaParam).csv(temp_stage)
```

* Fixing `options` parameter:

The [options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader.options) between spark and snowpark are not the same, in this case `lineSep` and `dateFormat` are replaced with `RECORD_DELIMITER` and `DATE_FORMAT`, the **Additional recommendations** section has a table with all the Equivalences.

Below is an example that creates a dictionary with `RECORD_DELIMITER` and `DATE_FORMAT`, and calls the `options` method with that dictionary.

```python
optionsParam = {"RECORD_DELIMITER": "\r\n", "DATE_FORMAT": "YYYY/MM/DD"}
df = my_session.read.options(optionsParam).csv(stage)
```

### Scenario 2

**Input**

Below is an example that tries to load data from a `JSON` source.

```python
path_json_file = "/path/to/file.json"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])

my_session.read.load(path_json_file, "json").show()
my_session.read.load(path_json_file, "json", schema=schemaParam).show()
my_session.read.load(path_json_file, "json", schema=schemaParam, dateFormat="YYYY/MM/DD", timestampFormat="YYYY-MM-DD HH24:MI:SS.FF3").show()
```

**Output**

The SMA adds the EWI `SPRKPY1082` to let you know that this function is not supported by Snowpark, but it has a workaround.

```python
path_json_file = "/path/to/file.json"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.

my_session.read.load(path_json_file, "json").show()
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.
my_session.read.load(path_json_file, "json", schema=schemaParam).show()
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.
my_session.read.load(path_json_file, "json", schema=schemaParam, dateFormat="YYYY/MM/DD", timestampFormat="YYYY-MM-DD HH24:MI:SS.FF3").show()
```

**Recommended fix**

As a workaround, you can use [Snowpark DataFrameReader](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader) methods instead.

* Fixing `path` and `format` parameters:

  + Replace the `load` method with `json` method
  + The first parameter `path` must be in a stage to make an equivalence with [Snowpark](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

Below is an example that creates a temporal stage and puts the file into it, then calls the `JSON` method.

```python
path_json_file = "/path/to/file.json"

## Stage creation

temp_stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
my_session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}').show()
my_session.file.put(f"file:///path/to/file.json", f"@{temp_stage}")
stage_file_path = f"{temp_stage}file.json"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])

my_session.read.json(stage_file_path).show()
```

* Fixing `schema` parameter:

  + The schema can be set by using the [schema](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader.schema) function as follows:

```python
schemaParam = StructType([
        StructField("name", StringType(), True),
        StructField("city", StringType(), True)
    ])

df = my_session.read.schema(schemaParam).json(temp_stage)
```

* Fixing `options` parameter:

The [options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader.options) between Spark and snowpark are not the same, in this case `dateFormat` and `timestampFormat` are replaced with `DATE_FORMAT` and `TIMESTAMP_FORMAT`, the **Additional recommendations** section has a table with all the Equivalences.

Below is an example that creates a dictionary with `DATE_FORMAT` and `TIMESTAMP_FORMAT`, and calls the `options` method with that dictionary.

```python
optionsParam = {"DATE_FORMAT": "YYYY/MM/DD", "TIMESTAMP_FORMAT": "YYYY-MM-DD HH24:MI:SS.FF3"}
df = Session.read.options(optionsParam).json(stage)
```

### Scenario 3

**Input**

Below is an example that tries to load data from a `PARQUET` source.

```python
path_parquet_file = "/path/to/file.parquet"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])

my_session.read.load(path_parquet_file, "parquet").show()
my_session.read.load(path_parquet_file, "parquet", schema=schemaParam).show()
my_session.read.load(path_parquet_file, "parquet", schema=schemaParam, pathGlobFilter="*.parquet").show()
```

**Output**

The SMA adds the EWI `SPRKPY1082` to let you know that this function is not supported by Snowpark, but it has a workaround.

```python
path_parquet_file = "/path/to/file.parquet"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.

my_session.read.load(path_parquet_file, "parquet").show()
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.
my_session.read.load(path_parquet_file, "parquet", schema=schemaParam).show()
#EWI: SPRKPY1082 => The pyspark.sql.readwriter.DataFrameReader.load function is not supported. A workaround is to use Snowpark DataFrameReader format specific method instead (avro csv, json, orc, parquet). The path parameter should be a stage location.
my_session.read.load(path_parquet_file, "parquet", schema=schemaParam, pathGlobFilter="*.parquet").show()
```

**Recommended fix**

As a workaround, you can use [Snowpark DataFrameReader](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader) methods instead.

* Fixing `path` and `format` parameters:

  + Replace the `load` method with `parquet` method
  + The first parameter `path` must be in a stage to make an equivalence with [Snowpark](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

Below is an example that creates a temporal stage and puts the file into it, then calls the `PARQUET` method.

```python
path_parquet_file = "/path/to/file.parquet"

## Stage creation

temp_stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
my_session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}').show()
my_session.file.put(f"file:///path/to/file.parquet", f"@{temp_stage}")
stage_file_path = f"{temp_stage}file.parquet"

schemaParam = StructType([
        StructField("Name", StringType(), True),
        StructField("Superhero", StringType(), True)
    ])

my_session.read.parquet(stage_file_path).show()
```

* Fixing `schema` parameter:

  + The schema can be set by using the [schema](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader.schema) function as follows:

```python
schemaParam = StructType([
        StructField("name", StringType(), True),
        StructField("city", StringType(), True)
    ])

df = my_session.read.schema(schemaParam).parquet(temp_stage)
```

* Fixing `options` parameter:

The [options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader.options) between Spark and snowpark are not the same, in this case `pathGlobFilter` is replaced with `PATTERN`, the **Additional recommendations** section has a table with all the Equivalences.

Below is an example that creates a dictionary with `PATTERN`, and calls the `options` method with that dictionary.

```python
optionsParam = {"PATTERN": "*.parquet"}
df = Session.read.options(optionsParam).parquet(stage)
```

### Additional recommendations

* Take into account that the options between spark and snowpark are not the same, but they can be mapped:

| Spark Options | Possible value | Snowpark equivalent | Description |
| --- | --- | --- | --- |
| header | True or False | SKIP_HEADER = 1 / SKIP_HEADER = 0 | To use the first line of a file as names of columns. |
| delimiter | Any single/multi character field separator | FIELD_DELIMITER | To specify single / multiple character(s) as a separator for each column/field. |
| sep | Any single character field separator | FIELD_DELIMITER | To specify a single character as a separator for each column/field. |
| encoding | UTF-8, UTF-16, etc… | ENCODING | To decode the CSV files by the given encoding type. Default encoding is UTF-8 |
| lineSep | Any single character line separator | RECORD_DELIMITER | To define the line separator that should be used for file parsing. |
| pathGlobFilter | File pattern | PATTERN | To define a pattern to read files only with filenames matching the pattern. |
| recursiveFileLookup | True or False | N/A | To recursively scan a directory to read files. Default value of this option is False. |
| quote | Single character to be quoted | FIELD_OPTIONALLY_ENCLOSED_BY | To quote fields/columns containing fields where the delimiter / separator can be part of the value. This character To quote all fields when used with quoteAll option. Default value of this option is double quote(“). |
| nullValue | String to replace null | NULL_IF | To replace null values with the string while reading and writing dataframe. |
| dateFormat | Valid date format | DATE_FORMAT | To define a string that indicates a date format. Default format is yyyy-MM-dd. |
| timestampFormat | Valid timestamp format | TIMESTAMP_FORMAT | To define a string that indicates a timestamp format. Default format is yyyy-MM-dd ‘T’HH:mm:ss. |
| escape | Any single character | ESCAPE | To set a single character as escaping character to override default escape character(\). |
| inferSchema | True or False | INFER_SCHEMA | Automatically detects the file schema |
| mergeSchema | True or False | N/A | Not needed in snowflake as this happens whenever the infer_schema determines the parquet file structure |

* For **modifiedBefore / modifiedAfter** option you can achieve the same result in Snowflake by using the metadata columns and then adding a filter like: `df.filter(METADATA_FILE_LAST_MODIFIED > ‘some_date’)`.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1083

Message: The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.

Category: Warning

### Description

The [pyspark.sql.readwriter.DataFrameWriter.save](https://spark.apache.org/docs/3.5.3/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.save.html) function is not supported. The workaround is to use Snowpark DataFrameWriter methods instead.

#### Scenarios

The spark signature for this method `DataFrameWriter.save(path, format, mode, partitionBy, **options)` does not exists in Snowpark. Therefore, any usage of the load function it’s going to have an EWI in the output code.

##### Scenario 1

**Input code**

Below is an example that tries to save data with `CSV` format.

```python
path_csv_file = "/path/to/file.csv"

data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = my_session.createDataFrame(data, schema=["Name", "Age", "City"])

df.write.save(path_csv_file, format="csv")
df.write.save(path_csv_file, format="csv", mode="overwrite")
df.write.save(path_csv_file, format="csv", mode="overwrite", lineSep="\r\n", dateFormat="YYYY/MM/DD")
df.write.save(path_csv_file, format="csv", mode="overwrite", partitionBy="City", lineSep="\r\n", dateFormat="YYYY/MM/DD")
```

**Output code**

The tool adds this EWI `SPRKPY1083` on the output code to let you know that this function is not supported by Snowpark, but it has a workaround.

```python
path_csv_file = "/path/to/file.csv"

data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = my_session.createDataFrame(data, schema=["Name", "Age", "City"])

#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_csv_file, format="csv")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_csv_file, format="csv", mode="overwrite")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_csv_file, format="csv", mode="overwrite", lineSep="\r\n", dateFormat="YYYY/MM/DD")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_csv_file, format="csv", mode="overwrite", partitionBy="City", lineSep="\r\n", dateFormat="YYYY/MM/DD")
```

**Recommended fix**

As a workaround you can use [Snowpark DataFrameWriter](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter) methods instead.

* Fixing `path` and `format` parameters:

  + Replace the `load` method with [csv](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) or [copy_into_location](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameWriter.copy_into_location) method.
  + If you are using `copy_into_location` method, you need to specify the format with the `file_format_type parameter`.
  + The first parameter `path` must be in a stage to make an equivalence with [Snowpark](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

Below is an example that creates a temporal stage and put the file into it, then calls one the methods mentioned above.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Stage creation

temp_stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
my_session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}').show()
my_session.file.put(f"file:///path/to/file.csv", f"@{temp_stage}")
stage_file_path = f"{temp_stage}file.csv"

## Using csv method
df.write.csv(stage_file_path)

## Using copy_into_location method
df.write.copy_into_location(stage_file_path, file_format_type="csv")
```

* Fixing `mode` parameter:

  + Use the [mode](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.mode) function from [Snowpark DataFrameWriter](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter), as follows:

Below is an example that adds into the daisy chain the `mode` method with `overwrite` as a parameter.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Using csv method
df.write.mode("overwrite").csv(temp_stage)

## Using copy_into_location method
df.write.mode("overwrite").copy_into_location(temp_stage, file_format_type="csv")
```

* Fixing `partitionBy` parameter:

  + Use the [partition_by](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) parameter from the `CSV` method, as follows:

Below is an example that used the `partition_by` parameter from the `CSV` method.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Using csv method
df.write.csv(temp_stage, partition_by="City")

## Using copy_into_location method
df.write.copy_into_location(temp_stage, file_format_type="csv", partition_by="City")
```

* Fixing `options` parameter:

  + Use the [format_type_options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) parameter from the `CSV` method, as follows:

The options between spark and snowpark are not the same, in this case `lineSep` and `dateFormat` are replaced with `RECORD_DELIMITER` and `DATE_FORMAT`, the **Additional recommendations** section has table with all the Equivalences.

Below is an example that creates a dictionary with `RECORD_DELIMITER` and `DATE_FORMAT`, and calls the `options` method with that dictionary.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])
optionsParam = {"RECORD_DELIMITER": "\r\n", "DATE_FORMAT": "YYYY/MM/DD"}

## Using csv method
df.write.csv(stage, format_type_options=optionsParam)

## Using copy_into_location method
df.write.csv(stage, file_format_type="csv", format_type_options=optionsParam)
```

### Scenario 2

**Input code**

Below is an example that tries to save data with `JSON` format.

```python
path_json_file = "/path/to/file.json"

data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

df.write.save(path_json_file, format="json")
df.write.save(path_json_file, format="json", mode="overwrite")
df.write.save(path_json_file, format="json", mode="overwrite", dateFormat="YYYY/MM/DD", timestampFormat="YYYY-MM-DD HH24:MI:SS.FF3")
df.write.save(path_json_file, format="json", mode="overwrite", partitionBy="City", dateFormat="YYYY/MM/DD", timestampFormat="YYYY-MM-DD HH24:MI:SS.FF3")
```

**Output code**

The tool adds this EWI `SPRKPY1083` on the output code to let you know that this function is not supported by Snowpark, but it has a workaround.

```python
path_json_file = "/path/to/file.json"

data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_json_file, format="json")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_json_file, format="json", mode="overwrite")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_json_file, format="json", mode="overwrite", dateFormat="YYYY/MM/DD", timestampFormat="YYYY-MM-DD HH24:MI:SS.FF3")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_json_file, format="json", mode="overwrite", partitionBy="City", dateFormat="YYYY/MM/DD", timestampFormat="YYYY-MM-DD HH24:MI:SS.FF3")
```

**Recommended fix**

As a workaround you can use [Snowpark DataFrameReader](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader) methods instead.

* Fixing `path` and `format` parameters:

  + Replace the `load` method with [json](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameWriter.json) or [copy_into_location](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameWriter.copy_into_location) method
  + If you are using `copy_into_location` method, you need to specify the format with the `file_format_type parameter`.
  + The first parameter `path` must be in a stage to make an equivalence with [Snowpark](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

Below is an example that creates a temporal stage and put the file into it, then calls one the methods mentioned above.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Stage creation

temp_stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
my_session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}').show()
my_session.file.put(f"file:///path/to/file.json", f"@{temp_stage}")
stage_file_path = f"{temp_stage}file.json"

## Using json method
df.write.json(stage_file_path)

## Using copy_into_location method
df.write.copy_into_location(stage_file_path, file_format_type="json")
```

* Fixing `mode` parameter:

  + Use the [mode](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.mode) function from [Snowpark DataFrameWriter](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter), as follows:

Below is an example that adds into the daisy chain the `mode` method with `overwrite` as a parameter.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Using json method
df.write.mode("overwrite").json(temp_stage)

## Using copy_into_location method
df.write.mode("overwrite").copy_into_location(temp_stage, file_format_type="json")
```

* Fixing `partitionBy` parameter:

  + Use the [partition_by](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) parameter from the `CSV` method, as follows:

Below is an example that used the `partition_by` parameter from the `CSV` method.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Using json method
df.write.json(temp_stage, partition_by="City")

## Using copy_into_location method
df.write.copy_into_location(temp_stage, file_format_type="json", partition_by="City")
```

* Fixing `options` parameter:

  + Use the [format_type_options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) parameter from the `CSV` method, as follows:

The options between spark and snowpark are not the same, in this case `dateFormat` and `timestampFormat` are replaced with `DATE_FORMAT` and `TIMESTAMP_FORMAT`, the **Additional recommendations** section has table with all the Equivalences.

Below is an example that creates a dictionary with `DATE_FORMAT` and `TIMESTAMP_FORMAT`, and calls the `options` method with that dictionary.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])
optionsParam = {"DATE_FORMAT": "YYYY/MM/DD", "TIMESTAMP_FORMAT": "YYYY-MM-DD HH24:MI:SS.FF3"}

## Using json method
df.write.json(stage, format_type_options=optionsParam)

## Using copy_into_location method
df.write.copy_into_location(stage, file_format_type="json", format_type_options=optionsParam)
```

### Scenario 3

**Input code**

Below is an example that tries to save data with `PARQUET` format.

```python
path_parquet_file = "/path/to/file.parquet"

data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

df.write.save(path_parquet_file, format="parquet")
df.write.save(path_parquet_file, format="parquet", mode="overwrite")
df.write.save(path_parquet_file, format="parquet", mode="overwrite", pathGlobFilter="*.parquet")
df.write.save(path_parquet_file, format="parquet", mode="overwrite", partitionBy="City", pathGlobFilter="*.parquet")
```

**Output code**

The tool adds this EWI `SPRKPY1083` on the output code to let you know that this function is not supported by Snowpark, but it has a workaround.

```python
path_parquet_file = "/path/to/file.parquet"

data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_parquet_file, format="parquet")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_parquet_file, format="parquet", mode="overwrite")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_parquet_file, format="parquet", mode="overwrite", pathGlobFilter="*.parquet")
#EWI: SPRKPY1083 => The pyspark.sql.readwriter.DataFrameWriter.save function is not supported. A workaround is to use Snowpark DataFrameWriter copy_into_location method instead.
df.write.save(path_parquet_file, format="parquet", mode="overwrite", partitionBy="City", pathGlobFilter="*.parquet")
```

**Recommended fix**

As a workaround you can use [Snowpark DataFrameReader](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameReader) methods instead.

* Fixing `path` and `format` parameters:

  + Replace the `load` method with [parquet](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameWriter.parquet) or [copy_into_location](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrameWriter.copy_into_location) method.
  + If you are using `copy_into_location` method, you need to specify the format with the `file_format_type parameter`.
  + The first parameter `path` must be in a stage to make an equivalence with [Snowpark](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).

Below is an example that creates a temporal stage and put the file into it, then calls one the methods mentioned above.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Stage creation

temp_stage = f'{Session.get_fully_qualified_current_schema()}.{_generate_prefix("TEMP_STAGE")}'
my_session.sql(f'CREATE TEMPORARY STAGE IF NOT EXISTS {temp_stage}').show()
my_session.file.put(f"file:///path/to/file.parquet", f"@{temp_stage}")
stage_file_path = f"{temp_stage}file.parquet"

## Using parquet method
df.write.parquet(stage_file_path)

## Using copy_into_location method
df.write.copy_into_location(stage, file_format_type="parquet")
```

* Fixing `mode` parameter:

  + Use the [mode](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.mode) function from [Snowpark DataFrameWriter](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter), as follows:

Below is an example that adds into the daisy chain the `mode` method with `overwrite` as a parameter.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Using parquet method
df.write.mode("overwrite").parquet(temp_stage)

## Using copy_into_location method
df.write.mode("overwrite").copy_into_location(stage, file_format_type="parquet")
```

* Fixing `partitionBy` parameter:

  + Use the [partition_by](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) parameter from the `CSV` method, as follows:

Below is an example that used the `partition_by` parameter from the `parquet` method.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

## Using parquet method
df.write.parquet(temp_stage, partition_by="City")

## Using copy_into_location method
df.write.copy_into_location(stage, file_format_type="parquet", partition_by="City")
```

* Fixing `options` parameter:

  + Use the [format_type_options](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.csv) parameter from the `CSV` method, as follows:

The options between spark and snowpark are not the same, in this case `pathGlobFilter` is replaced with `PATTERN`, the **Additional recommendations** section has table with all the Equivalences.

Below is an example that creates a dictionary with `PATTERN`, and calls the `options` method with that dictionary.

```python
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]
df = spark.createDataFrame(data, schema=["Name", "Age", "City"])
optionsParam = {"PATTERN": "*.parquet"}

## Using parquet method
df.write.parquet(stage, format_type_options=optionsParam)

## Using copy_into_location method
df.write.copy_into_location(stage, file_format_type="parquet", format_type_options=optionsParam)
```

### Additional recommendations

* Take into account the options between spark and snowpark are not the same, but they can be mapped:

| Spark Options | Possible value | Snowpark equivalent | Description |
| --- | --- | --- | --- |
| header | True or False | SKIP_HEADER = 1 / SKIP_HEADER = 0 | To use the first line of a file as names of columns. |
| delimiter | Any single/multi character field separator | FIELD_DELIMITER | To specify single / multiple character(s) as a separator for each column/field. |
| sep | Any single character field separator | FIELD_DELIMITER | To specify a single character as a separator for each column/field. |
| encoding | UTF-8, UTF-16, etc… | ENCODING | To decode the CSV files by the given encoding type. Default encoding is UTF-8 |
| lineSep | Any single character line separator | RECORD_DELIMITER | To define the line separator that should be used for file parsing. |
| pathGlobFilter | File pattern | PATTERN | To define a pattern to read files only with filenames matching the pattern. |
| recursiveFileLookup | True or False | N/A | To recursively scan a directory to read files. Default value of this option is False. |
| quote | Single character to be quoted | FIELD_OPTIONALLY_ENCLOSED_BY | To quote fields/columns containing fields where the delimiter / separator can be part of the value. This character To quote all fields when used with quoteAll option. Default value of this option is double quote(“). |
| nullValue | String to replace null | NULL_IF | To replace null values with the string while reading and writing dataframe. |
| dateFormat | Valid date format | DATE_FORMAT | To define a string that indicates a date format. Default format is yyyy-MM-dd. |
| timestampFormat | Valid timestamp format | TIMESTAMP_FORMAT | To define a string that indicates a timestamp format. Default format is yyyy-MM-dd ‘T’HH:mm:ss. |
| escape | Any single character | ESCAPE | To set a single character as escaping character to override default escape character(\). |
| inferSchema | True or False | INFER_SCHEMA | Automatically detects the file schema |
| mergeSchema | True or False | N/A | Not needed in snowflake as this happens whenever the infer_schema determines the parquet file structure |

* For **modifiedBefore / modifiedAfter** option you can achieve the same result in Snowflake by using the metadata columns and then add a filter like: `df.filter(METADATA_FILE_LAST_MODIFIED > ‘some_date’)`.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1084

This issue code has been **deprecated** since [Spark Conversion Core 4.12.0](../../../general/release-notes/README.md)

Message: pyspark.sql.readwriter.DataFrameWriter.option is not supported.

Category: Warning

### Description

The [pyspark.sql.readwriter.DataFrameWriter.option](https://spark.apache.org/docs/3.5.3/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.option.html) function is not supported.

#### Scenario

**Input code**

Below is an example using the `option` method, this method is used to add additional configurations when writing the data of a DataFrame.

```python
path_csv_file = "/path/to/file.csv"
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

df.write.option("header", True).csv(csv_file_path)
df.write.option("sep", ";").option("lineSep","-").csv(csv_file_path)
```

**Output code**

The tool adds this EWI `SPRKPY1084` on the output code to let you know that this function is not supported by Snowpark.

```python
path_csv_file = "/path/to/file.csv"
data = [
        ("John", 30, "New York"),
        ("Jane", 25, "San Francisco")
    ]

df = spark.createDataFrame(data, schema=["Name", "Age", "City"])

#EWI: SPRKPY1084 => The pyspark.sql.readwriter.DataFrameWriter.option function is not supported.

df.write.option("header", True).csv(csv_file_path)
#EWI: SPRKPY1084 => The pyspark.sql.readwriter.DataFrameWriter.option function is not supported.
df.write.option("sep", ";").option("lineSep","-").csv(csv_file_path)
```

**Recommended fix**

The pyspark.sql.readwriter.DataFrameWriter.option method does not have a recommended fix.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1085

Message: pyspark.ml.feature.VectorAssembler is not supported.

Category: Warning

### Description

The [pyspark.ml.feature.VectorAssembler](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.ml.feature.VectorAssembler.html) is not supported.

#### Scenario

**Input code**

VectorAssembler is used to combine several columns into a single vector.

```python
data = [
        (1, 10.0, 20.0),
        (2, 25.0, 30.0),
        (3, 50.0, 60.0)
    ]

df = SparkSession.createDataFrame(data, schema=["Id", "col1", "col2"])
vector = VectorAssembler(inputCols=["col1", "col2"], output="cols")
```

**Output code**

The tool adds this EWI `SPRKPY1085` on the output code to let you know that this class is not supported by Snowpark.

```python
data = [
        (1, 10.0, 20.0),
        (2, 25.0, 30.0),
        (3, 50.0, 60.0)
    ]

df = spark.createDataFrame(data, schema=["Id", "col1", "col2"])
#EWI: SPRKPY1085 => The pyspark.ml.feature.VectorAssembler function is not supported.

vector = VectorAssembler(inputCols=["col1", "col2"], output="cols")
```

**Recommended fix**

The pyspark.ml.feature.VectorAssembler does not have a recommended fix.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1086

Message: pyspark.ml.linalg.VectorUDT is not supported.

Category: Warning

### Description

The [pyspark.ml.linalg.VectorUDT](https://spark.apache.org/docs/latest/api/python/_modules/pyspark/ml/linalg.html) is not supported.

#### Scenario

**Input code**

VectorUDT is a data type to represent vector columns in a DataFrame.

```python
data = [
        (1, Vectors.dense([10.0, 20.0])),
        (2, Vectors.dense([25.0, 30.0])),
        (3, Vectors.dense([50.0, 60.0]))
    ]

schema = StructType([
        StructField("Id", IntegerType(), True),
        StructField("VectorCol", VectorUDT(), True),
    ])

df = SparkSession.createDataFrame(data, schema=schema)
```

**Output code**

The tool adds this EWI `SPRKPY1086` on the output code to let you know that this function is not supported by Snowpark.

```python
data = [
        (1, Vectors.dense([10.0, 20.0])),
        (2, Vectors.dense([25.0, 30.0])),
        (3, Vectors.dense([50.0, 60.0]))
    ]

#EWI: SPRKPY1086 => The pyspark.ml.linalg.VectorUDT function is not supported.
schema = StructType([
        StructField("Id", IntegerType(), True),
        StructField("VectorCol", VectorUDT(), True),
    ])

df = spark.createDataFrame(data, schema=schema)
```

**Recommended fix**

The pyspark.ml.linalg.VectorUDT does not have a recommended fix.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1087

Message: The pyspark.sql.dataframe.DataFrame.writeTo function is not supported, but it has a workaround.

Category: Warning.

### Description

The [pyspark.sql.dataframe.DataFrame.writeTo](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrame.writeTo.html) function is not supported. The workaround is to use Snowpark DataFrameWriter [SaveAsTable](https://docs.snowflake.com/developer-guide/snowpark/reference/python/latest/snowpark/api/snowflake.snowpark.DataFrameWriter.saveAsTable) method instead.

#### Scenario

**Input**

Below is an example of a use of the `pyspark.sql.dataframe.DataFrame.writeTo` function, the dataframe `df` is written into a table name `Personal_info`.

```python
df = spark.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]],
                                 schema=["FIRST_NAME", "LAST_NAME"])

df.writeTo("Personal_info")
```

**Output**

The SMA adds the EWI `SPRKPY1087` to the output code to let you know that this function is not supported, but has a workaround.

```python
df = spark.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]],
                                 schema=["FIRST_NAME", "LAST_NAME"])

#EWI: SPRKPY1087 => pyspark.sql.dataframe.DataFrame.writeTo is not supported, but it has a workaround.
df.writeTo("Personal_info")
```

**Recommended fix**

The workaround is to use Snowpark DataFrameWriter SaveAsTable method instead.

```python
df = spark.createDataFrame([["John", "Berry"], ["Rick", "Berry"], ["Anthony", "Davis"]],
                                 schema=["FIRST_NAME", "LAST_NAME"])

df.write.saveAsTable("Personal_info")
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1088

Message: The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.

Category: Warning

### Description

The [pyspark.sql.readwriter.DataFrameWriter.option](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.option.html) values in Snowpark may be different, so validation might be needed to ensure that the behavior is correct.

#### Scenarios

There are some scenarios depending on the option it is supported or not, or the format used to write the file.

##### Scenario 1

**Input**

Below is an example of the usage of the method option, adding a `sep` option, which is currently `supported`.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])

df.write.option("sep", ",").csv("some_path")
```

**Output**

The tool adds the EWI `SPRKPY1088` indicating that it is required validation.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])
#EWI: SPRKPY1088 => The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.
df.write.option("sep", ",").csv("some_path")
```

**Recommended fix**

The Snowpark API supports this parameter, so the only action can be to check the behavior after the migration. Please refer to the **Equivalences table** to see the supported parameters.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])
#EWI: SPRKPY1088 => The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.
df.write.option("sep", ",").csv("some_path")
```

##### Scenario 2

**Input**

Here the scenario shows the usage of option, but adds a `header` option, which is `not supported`.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])

df.write.option("header", True).csv("some_path")
```

**Output**

The tool adds the EWI `SPRKPY1088` indicating that it is required validation is needed.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])
#EWI: SPRKPY1088 => The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.
df.write.option("header", True).csv("some_path")
```

**Recommended fix**

For this scenario it is recommended to evaluate the Snowpark [format type options](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions) to see if it is possible to change it according to your needs. Also, check the behavior after the change.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])
#EWI: SPRKPY1088 => The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.
df.write.csv("some_path")
```

##### Scenario 3

**Input**

This scenario adds a `sep` option, which is `supported` and uses the `JSON` method.

* Note: this scenario also applies for `PARQUET`.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])

df.write.option("sep", ",").json("some_path")
```

**Output**

The tool adds the EWI `SPRKPY1088` indicating that it is required validation is needed.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])
#EWI: SPRKPY1088 => The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.
df.write.option("sep", ",").json("some_path")
```

**Recommended fix**

The file format `JSON` does not support the parameter `sep`, so it is recommended to evaluate the snowpark [format type options](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions) to see if it is possible to change it according to your needs. Also, check the behavior after the change.

```python
df = spark.createDataFrame([(100, "myVal")], ["ID", "Value"])
#EWI: SPRKPY1088 => The pyspark.sql.readwriter.DataFrameWriter.option values in Snowpark may be different, so required validation might be needed.
df.write.json("some_path")
```

#### Additional recommendations

* Since there are some `not supported` parameters, it is recommended to check the `table of equivalences` and check the behavior after the transformation.
* **Equivalences table:**

| PySpark Option | SnowFlake Option | Supported File Formats | Description |
| --- | --- | --- | --- |
| SEP | FIELD_DELIMITER | CSV | One or more single byte or multibyte characters that separate fields in an input file. |
| LINESEP | RECORD_DELIMITER | CSV | One or more characters that separate records in an input file. |
| QUOTE | FIELD_OPTIONALLY_ENCLOSED_BY | CSV | Character used to enclose strings. |
| NULLVALUE | NULL_IF | CSV | String used to convert to and from SQL NULL. |
| DATEFORMAT | DATE_FORMAT | CSV | String that defines the format of date values in the data files to be loaded. |
| TIMESTAMPFORMAT | TIMESTAMP_FORMAT | CSV | String that defines the format of timestamp values in the data files to be loaded. |

If the parameter used is not in the list, the API throws an error.

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1089

Message: The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.

Category: Warning

### Description

The [pyspark.sql.readwriter.DataFrameWriter.options](https://spark.apache.org/docs/latest/api/python/reference/pyspark.sql/api/pyspark.sql.DataFrameWriter.options.html) values in Snowpark may be different, so validation might be needed to ensure that the behavior is correct.

#### Scenarios

There are some scenarios, depending on whether the options are supported or not, or the format used to write the file.

##### Scenario 1

**Input**

Below is an example of the usage of the method options, adding the options `sep` and `nullValue`, which are currently `supported`.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])

df.write.options(nullValue="myVal", sep=",").csv("some_path")
```

**Output**

The tool adds the EWI `SPRKPY1089` indicating that it is required validation.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])
#EWI: SPRKPY1089 => The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.
df.write.options(nullValue="myVal", sep=",").csv("some_path")
```

**Recommended fix**

The Snowpark API supports these parameters, so the only action can be to check the behavior after the migration. Please refer to the **Equivalences table** to see the supported parameters.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])
#EWI: SPRKPY1089 => The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.
df.write.options(nullValue="myVal", sep=",").csv("some_path")
```

##### Scenario 2

**Input**

Here the scenario shows the usage of options, but adds a `header` option, which is `not supported`.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])

df.write.options(header=True, sep=",").csv("some_path")
```

**Output**

The tool adds the EWI `SPRKPY1089` indicating that it is required validation is needed.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])
#EWI: SPRKPY1089 => The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.
df.write.options(header=True, sep=",").csv("some_path")
```

**Recommended fix**

For this scenario it is recommended to evaluate the Snowpark [format type options](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions) to see if it is possible to change it according to your needs. Also, check the behavior after the change.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])
#EWI: SPRKPY1089 => The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.
df.write.csv("some_path")
```

##### Scenario 3

**Input**

This scenario adds a `sep` option, which is `supported` and uses the `JSON` method.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])

df.write.options(nullValue="myVal", sep=",").json("some_path")
```

**Output**

The tool adds the EWI `SPRKPY1089` indicating that it is required validation is needed.

* Note: this scenario also applies for `PARQUET`.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])
#EWI: SPRKPY1089 => The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.
df.write.options(nullValue="myVal", sep=",").json("some_path")
```

**Recommended fix**

The file format `JSON` does not support the parameter `sep`, so it is recommended to evaluate the snowpark [format type options](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions) to see if it is possible to change it according to your needs. Also, check the behavior after the change.

```python
df = spark.createDataFrame([(1, "myVal")], [2, "myVal2"], [None, "myVal3" ])
#EWI: SPRKPY1089 => The pyspark.sql.readwriter.DataFrameWriter.options values in Snowpark may be different, so required validation might be needed.
df.write.json("some_path")
```

#### Additional recommendations

* Since there are some `not supported` parameters, it is recommended to check the `table of equivalences` and check the behavior after the transformation.
* **Equivalences table:**

Snowpark can support a list of **equivalences** for some parameters:

| PySpark Option | SnowFlake Option | Supported File Formats | Description |
| --- | --- | --- | --- |
| SEP | FIELD_DELIMITER | CSV | One or more single byte or multibyte characters that separate fields in an input file. |
| LINESEP | RECORD_DELIMITER | CSV | One or more characters that separate records in an input file. |
| QUOTE | FIELD_OPTIONALLY_ENCLOSED_BY | CSV | Character used to enclose strings. |
| NULLVALUE | NULL_IF | CSV | String used to convert to and from SQL NULL. |
| DATEFORMAT | DATE_FORMAT | CSV | String that defines the format of date values in the data files to be loaded. |
| TIMESTAMPFORMAT | TIMESTAMP_FORMAT | CSV | String that defines the format of timestamp values in the data files to be loaded. |

If the parameter used is not in the list, the API throws an error.

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKPY1101

### Category

Parsing error.

#### Description

When the tool recognizes a parsing error, it tries to recover from it and continues the process in the next line. In those cases, it shows the error and comments on the line.

This example shows how a mismatch error between spaces and tabs is handled.

**Input code**

```python
def foo():
    x = 5 # Spaces
     y = 6 # Tab

def foo2():
    x=6
    y=7
```

**Output code**

```python
def foo():
    x = 5 # Spaces
## EWI: SPRKPY1101 => Unrecognized or invalid CODE STATEMENT @(3, 2). Last valid token was '5' @(2, 9), failed token 'y' @(3, 2)
## y = 6 # Tab

def foo2():
    x=6
    y=7
```

### Recommendations

* Try fixing the commented line.
* For more support, email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com). If you have a support contract with Snowflake, reach out to your sales engineer, who can direct your support needs.

---
title: Snowpark Migration Accelerator: Issue Codes for Spark - Scala
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/spark-scala/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Issue Codes for Spark - Scala

## SPRKSCL1126

Message: org.apache.spark.sql.functions.covar_pop has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.covar_pop](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

**Input**

Below is an example of the `org.apache.spark.sql.functions.covar_pop` function, first used with column names as the arguments and then with column objects.

```scala
val df = Seq(
  (10.0, 100.0),
  (20.0, 150.0),
  (30.0, 200.0),
  (40.0, 250.0),
  (50.0, 300.0)
).toDF("column1", "column2")

val result1 = df.select(covar_pop("column1", "column2").as("covariance_pop"))
val result2 = df.select(covar_pop(col("column1"), col("column2")).as("covariance_pop"))
```

**Output**

The SMA adds the EWI `SPRKSCL1126` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  (10.0, 100.0),
  (20.0, 150.0),
  (30.0, 200.0),
  (40.0, 250.0),
  (50.0, 300.0)
).toDF("column1", "column2")

/*EWI: SPRKSCL1126 => org.apache.spark.sql.functions.covar_pop has a workaround, see documentation for more info*/
val result1 = df.select(covar_pop("column1", "column2").as("covariance_pop"))
/*EWI: SPRKSCL1126 => org.apache.spark.sql.functions.covar_pop has a workaround, see documentation for more info*/
val result2 = df.select(covar_pop(col("column1"), col("column2")).as("covariance_pop"))
```

**Recommended fix**

Snowpark has an equivalent [covar_pop](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html#covar_pop(column1:com.snowflake.snowpark.Column,column2:com.snowflake.snowpark.Column):com.snowflake.snowpark.Column) function that receives two column objects as arguments. For that reason, the Spark overload that receives two column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives two string arguments, you can convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html#col(colName:String):com.snowflake.snowpark.Column) function as a workaround.

```scala
val df = Seq(
  (10.0, 100.0),
  (20.0, 150.0),
  (30.0, 200.0),
  (40.0, 250.0),
  (50.0, 300.0)
).toDF("column1", "column2")

val result1 = df.select(covar_pop(col("column1"), col("column2")).as("covariance_pop"))
val result2 = df.select(covar_pop(col("column1"), col("column2")).as("covariance_pop"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1112

Message: **\*spark element\*** is not supported

Category: Conversion error

### Description

This issue appears when the SMA detects the use of a Spark element that is not supported by Snowpark, and it does not have its own error code associated with it. This is a generic error code used by the SMA for any unsupported Spark element.

#### Scenario

**Input**

Below is an example of a Spark element that is not supported by Snowpark, and therefore it generates this EWI.

```scala
val df = session.range(10)
val result = df.isLocal
```

**Output**

The SMA adds the EWI `SPRKSCL1112` to the output code to let you know that this element is not supported by Snowpark.

```scala
val df = session.range(10)
/*EWI: SPRKSCL1112 => org.apache.spark.sql.Dataset.isLocal is not supported*/
val result = df.isLocal
```

**Recommended fix**

Since this is a generic error code that applies to a range of unsupported functions, there is not a single and specific fix. The appropriate action will depend on the particular element in use.

Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It only means that the SMA itself cannot find the solution.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1143

Message: An error occurred when loading the symbol table

Category: Conversion error

### Description

This issue appears when there is an error loading the symbols of the SMA symbol table. The symbol table is part of the underlying architecture of the SMA allowing for more complex conversions.

#### Additional recommendations

* This is unlikely to be an error in the source code itself, but rather is an error in how the SMA processes the source code. The best resolution would be to post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1153

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.3.2](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.max has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.max](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.max` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(10, 12, 20, 15, 18).toDF("value")
val result1 = df.select(max("value"))
val result2 = df.select(max(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1153` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(10, 12, 20, 15, 18).toDF("value")
/*EWI: SPRKSCL1153 => org.apache.spark.sql.functions.max has a workaround, see documentation for more info*/
val result1 = df.select(max("value"))
/*EWI: SPRKSCL1153 => org.apache.spark.sql.functions.max has a workaround, see documentation for more info*/
val result2 = df.select(max(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [max](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(10, 12, 20, 15, 18).toDF("value")
val result1 = df.select(max(col("value")))
val result2 = df.select(max(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1102

This issue code has been **deprecated** since [Spark Conversion Core 2.3.22](../../../general/release-notes/README.md)

Message:Explode is not supported

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.explode](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which is not supported by Snowpark.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.explode` function used to get the consolidated information of the array fields of the dataset.

```scala
    val explodeData = Seq(
      Row("Cat", Array("Gato","Chat")),
      Row("Dog", Array("Perro","Chien")),
      Row("Bird", Array("Ave","Oiseau"))
    )

    val explodeSchema = StructType(
      List(
        StructField("Animal", StringType),
        StructField("Translation", ArrayType(StringType))
      )
    )

    val rddExplode = session.sparkContext.parallelize(explodeData)

    val dfExplode = session.createDataFrame(rddExplode, explodeSchema)

    dfExplode.select(explode(dfExplode("Translation").alias("exploded")))
```

**Output**

The SMA adds the EWI `SPRKSCL1102` to the output code to let you know that this function is not supported by Snowpark.

```scala
    val explodeData = Seq(
      Row("Cat", Array("Gato","Chat")),
      Row("Dog", Array("Perro","Chien")),
      Row("Bird", Array("Ave","Oiseau"))
    )

    val explodeSchema = StructType(
      List(
        StructField("Animal", StringType),
        StructField("Translation", ArrayType(StringType))
      )
    )

    val rddExplode = session.sparkContext.parallelize(explodeData)

    val dfExplode = session.createDataFrame(rddExplode, explodeSchema)

    /*EWI: SPRKSCL1102 => Explode is not supported */
    dfExplode.select(explode(dfExplode("Translation").alias("exploded")))
```

**Recommended Fix**

Since explode is not supported by Snowpark, the function [flatten](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/DataFrame.html) could be used as a substitute.

The following fix creates flatten of the dfExplode dataframe, then makes the query to replicate the result in Spark.

```scala
    val explodeData = Seq(
      Row("Cat", Array("Gato","Chat")),
      Row("Dog", Array("Perro","Chien")),
      Row("Bird", Array("Ave","Oiseau"))
    )

    val explodeSchema = StructType(
      List(
        StructField("Animal", StringType),
        StructField("Translation", ArrayType(StringType))
      )
    )

    val rddExplode = session.sparkContext.parallelize(explodeData)

    val dfExplode = session.createDataFrame(rddExplode, explodeSchema)

     var dfFlatten = dfExplode.flatten(col("Translation")).alias("exploded")
                              .select(col("exploded.value").alias("Translation"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1136

> **Warning:**
>
> This issue code is **deprecated** since [Spark Conversion Core 4.3.2](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.min has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.min](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.min` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(1, 3, 10, 1, 3).toDF("value")
val result1 = df.select(min("value"))
val result2 = df.select(min(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1136` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(1, 3, 10, 1, 3).toDF("value")
/*EWI: SPRKSCL1136 => org.apache.spark.sql.functions.min has a workaround, see documentation for more info*/
val result1 = df.select(min("value"))
/*EWI: SPRKSCL1136 => org.apache.spark.sql.functions.min has a workaround, see documentation for more info*/
val result2 = df.select(min(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [min](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that takes a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(1, 3, 10, 1, 3).toDF("value")
val result1 = df.select(min(col("value")))
val result2 = df.select(min(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1167

Message: Project file not found on input folder

Category: Warning

### Description

This issue appears when the SMA detects that input folder do not have any project configuration file. The project configuration files supported by the SMA are:

* build.sbt
* build.gradle
* pom.xml

#### Additional recommendations

* Include a configuration project file on input folder.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1147

Message: org.apache.spark.sql.functions.tanh has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.tanh](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.tanh` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(-1.0, 0.5, 1.0, 2.0).toDF("value")
val result1 = df.withColumn("tanh_value", tanh("value"))
val result2 = df.withColumn("tanh_value", tanh(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1147` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(-1.0, 0.5, 1.0, 2.0).toDF("value")
/*EWI: SPRKSCL1147 => org.apache.spark.sql.functions.tanh has a workaround, see documentation for more info*/
val result1 = df.withColumn("tanh_value", tanh("value"))
/*EWI: SPRKSCL1147 => org.apache.spark.sql.functions.tanh has a workaround, see documentation for more info*/
val result2 = df.withColumn("tanh_value", tanh(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [tanh](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(-1.0, 0.5, 1.0, 2.0).toDF("value")
val result1 = df.withColumn("tanh_value", tanh(col("value")))
val result2 = df.withColumn("tanh_value", tanh(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1116

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 2.40.1](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.split has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.split](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.split` function that generates this EWI.

```scala
val df = Seq("apple,banana,orange", "grape,lemon,lime", "cherry,blueberry,strawberry").toDF("values")
val result1 = df.withColumn("split_values", split(col("values"), ","))
val result2 = df.withColumn("split_values", split(col("values"), ",", 0))
```

**Output**

The SMA adds the EWI `SPRKSCL1116` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("apple,banana,orange", "grape,lemon,lime", "cherry,blueberry,strawberry").toDF("values")
/*EWI: SPRKSCL1116 => org.apache.spark.sql.functions.split has a workaround, see documentation for more info*/
val result1 = df.withColumn("split_values", split(col("values"), ","))
/*EWI: SPRKSCL1116 => org.apache.spark.sql.functions.split has a workaround, see documentation for more info*/
val result2 = df.withColumn("split_values", split(col("values"), ",", 0))
```

**Recommended fix**

For the Spark overload that receives two arguments, you can convert the second argument into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

The overload that receives three arguments is not yet supported by Snowpark and there is no workaround.

```scala
val df = Seq("apple,banana,orange", "grape,lemon,lime", "cherry,blueberry,strawberry").toDF("values")
val result1 = df.withColumn("split_values", split(col("values"), lit(",")))
val result2 = df.withColumn("split_values", split(col("values"), ",", 0)) // This overload is not supported yet
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1122

Message: org.apache.spark.sql.functions.corr has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.corr](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.corr` function, first used with column names as the arguments and then with column objects.

```scala
val df = Seq(
  (10.0, 20.0),
  (20.0, 40.0),
  (30.0, 60.0)
).toDF("col1", "col2")

val result1 = df.select(corr("col1", "col2"))
val result2 = df.select(corr(col("col1"), col("col2")))
```

**Output**

The SMA adds the EWI `SPRKSCL1122` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  (10.0, 20.0),
  (20.0, 40.0),
  (30.0, 60.0)
).toDF("col1", "col2")

/*EWI: SPRKSCL1122 => org.apache.spark.sql.functions.corr has a workaround, see documentation for more info*/
val result1 = df.select(corr("col1", "col2"))
/*EWI: SPRKSCL1122 => org.apache.spark.sql.functions.corr has a workaround, see documentation for more info*/
val result2 = df.select(corr(col("col1"), col("col2")))
```

**Recommended fix**

Snowpark has an equivalent [corr](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives two column objects as arguments. For that reason, the Spark overload that receives column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives two string arguments, you can convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  (10.0, 20.0),
  (20.0, 40.0),
  (30.0, 60.0)
).toDF("col1", "col2")

val result1 = df.select(corr(col("col1"), col("col2")))
val result2 = df.select(corr(col("col1"), col("col2")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1173

Message: SQL embedded code cannot be processed.

Category: Warning.

### Description

This issue appears when the SMA detects a SQL-embedded code that can not be processed. Then, the SQL-embedded code can not be converted to Snowflake.

#### Scenario

**Input**

Below is an example of a SQL-embedded code that can not be processed.

```scala
spark.sql("CREATE VIEW IF EXISTS My View" + "AS Select * From my Table WHERE date < current_date()")
```

**Output**

The SMA adds the EWI `SPRKSCL1173` to the output code to let you know that the SQL-embedded code can not be processed.

```scala
/*EWI: SPRKSCL1173 => SQL embedded code cannot be processed.*/
spark.sql("CREATE VIEW IF EXISTS My View" + "AS Select * From my Table WHERE date < current_date()")
```

**Recommended fix**

Make sure that the SQL-embedded code is a string without interpolations, variables or string concatenations.

#### Additional recommendations

* You can find more information about SQL-embedded [here](../../../translation-reference/sql-embedded-code.md).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1163

Message: The element is not a literal and can’t be evaluated.

Category: Conversion error.

### Description

This issue occurs when the current processing element is not a literal, then it can not be evaluated by SMA.

#### Scenario

**Input**

Below is an example when element to process is not a literal and it can not be evaluated by SMA.

```scala
val format_type = "csv"
spark.read.format(format_type).load(path)
```

**Output**

The SMA adds the EWI `SPRKSCL1163` to the output code to let you know that `format_type` parameter is not a literal and it can not be evaluated by the SMA.

```scala
/*EWI: SPRKSCL1163 => format_type is not a literal and can't be evaluated*/
val format_type = "csv"
spark.read.format(format_type).load(path)
```

**Recommended fix**

* Make sure that a value of the variable is a valid one in order to avoid unexpected behaviors.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1132

Message: org.apache.spark.sql.functions.grouping_id has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.grouping_id](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.grouping_id` function, first used with multiple column name as arguments and then with column objects.

```scala
val df = Seq(
  ("Store1", "Product1", 100),
  ("Store1", "Product2", 150),
  ("Store2", "Product1", 200),
  ("Store2", "Product2", 250)
).toDF("store", "product", "amount")

val result1 = df.cube("store", "product").agg(sum("amount"), grouping_id("store", "product"))
val result2 = df.cube("store", "product").agg(sum("amount"), grouping_id(col("store"), col("product")))
```

**Output**

The SMA adds the EWI `SPRKSCL1132` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("Store1", "Product1", 100),
  ("Store1", "Product2", 150),
  ("Store2", "Product1", 200),
  ("Store2", "Product2", 250)
).toDF("store", "product", "amount")

/*EWI: SPRKSCL1132 => org.apache.spark.sql.functions.grouping_id has a workaround, see documentation for more info*/
val result1 = df.cube("store", "product").agg(sum("amount"), grouping_id("store", "product"))
/*EWI: SPRKSCL1132 => org.apache.spark.sql.functions.grouping_id has a workaround, see documentation for more info*/
val result2 = df.cube("store", "product").agg(sum("amount"), grouping_id(col("store"), col("product")))
```

**Recommended fix**

Snowpark has an equivalent [grouping_id](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives multiple column objects as arguments. For that reason, the Spark overload that receives multiple column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives multiple string arguments, you can convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("Store1", "Product1", 100),
  ("Store1", "Product2", 150),
  ("Store2", "Product1", 200),
  ("Store2", "Product2", 250)
).toDF("store", "product", "amount")

val result1 = df.cube("store", "product").agg(sum("amount"), grouping_id(col("store"), col("product")))
val result2 = df.cube("store", "product").agg(sum("amount"), grouping_id(col("store"), col("product")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1106

> **Warning:**
>
> This issue code has been **deprecated**

Message: Writer option is not supported.

Category: Conversion error.

### Description

This issue appears when the tool detects, in writer statement, the usage of an option not supported by Snowpark.

#### Scenario

**Input**

Below is an example of the [org.apache.spark.sql.DataFrameWriter.option](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameWriter.html) used to add options to a writer statement.

```scala
df.write.format("net.snowflake.spark.snowflake").option("dbtable", tablename)
```

**Output**

The SMA adds the EWI `SPRKSCL1106` to the output code to let you know that the option method is not supported by Snowpark.

```scala
df.write.saveAsTable(tablename)
/*EWI: SPRKSCL1106 => Writer option is not supported .option("dbtable", tablename)*/
```

**Recommended fix**

There is no recommended fix for this scenario

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1157

Message: org.apache.spark.sql.functions.kurtosis has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.kurtosis](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.kurtosis` function that generates this EWI. In this example, the `kurtosis` function is used to calculate the kurtosis of selected column.

```scala
val df = Seq("1", "2", "3").toDF("elements")
val result1 = kurtosis(col("elements"))
val result2 = kurtosis("elements")
```

**Output**

The SMA adds the EWI `SPRKSCL1157` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("1", "2", "3").toDF("elements")
/*EWI: SPRKSCL1157 => org.apache.spark.sql.functions.kurtosis has a workaround, see documentation for more info*/
val result1 = kurtosis(col("elements"))
/*EWI: SPRKSCL1157 => org.apache.spark.sql.functions.kurtosis has a workaround, see documentation for more info*/
val result2 = kurtosis("elements")
```

**Recommended fix**

Snowpark has an equivalent [kurtosis](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq("1", "2", "3").toDF("elements")
val result1 = kurtosis(col("elements"))
val result2 = kurtosis(col("elements"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1146

Message: org.apache.spark.sql.functions.tan has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.tan](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.tan` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(math.Pi / 4, math.Pi / 3, math.Pi / 6).toDF("angle")
val result1 = df.withColumn("tan_value", tan("angle"))
val result2 = df.withColumn("tan_value", tan(col("angle")))
```

**Output**

The SMA adds the EWI `SPRKSCL1146` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(math.Pi / 4, math.Pi / 3, math.Pi / 6).toDF("angle")
/*EWI: SPRKSCL1146 => org.apache.spark.sql.functions.tan has a workaround, see documentation for more info*/
val result1 = df.withColumn("tan_value", tan("angle"))
/*EWI: SPRKSCL1146 => org.apache.spark.sql.functions.tan has a workaround, see documentation for more info*/
val result2 = df.withColumn("tan_value", tan(col("angle")))
```

**Recommended fix**

Snowpark has an equivalent [tan](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(math.Pi / 4, math.Pi / 3, math.Pi / 6).toDF("angle")
val result1 = df.withColumn("tan_value", tan(col("angle")))
val result2 = df.withColumn("tan_value", tan(col("angle")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1117

> **Warning:**
>
> This issue code is **deprecated** since [Spark Conversion Core 2.40.1](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.translate has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.translate](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.translate` function that generates this EWI. In this example, the `translate` function is used to replace the characters **‘a’**, **‘e’** and **‘o’** in each word with **‘1’**, **‘2’** and **‘3’**, respectively.

```scala
val df = Seq("hello", "world", "scala").toDF("word")
val result = df.withColumn("translated_word", translate(col("word"), "aeo", "123"))
```

**Output**

The SMA adds the EWI `SPRKSCL1117` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("hello", "world", "scala").toDF("word")
/*EWI: SPRKSCL1117 => org.apache.spark.sql.functions.translate has a workaround, see documentation for more info*/
val result = df.withColumn("translated_word", translate(col("word"), "aeo", "123"))
```

**Recommended fix**

As a workaround, you can convert the second and third argument into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq("hello", "world", "scala").toDF("word")
val result = df.withColumn("translated_word", translate(col("word"), lit("aeo"), lit("123")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1123

Message: org.apache.spark.sql.functions.cos has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.cos](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.cos` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(0.0, Math.PI / 4, Math.PI / 2, Math.PI).toDF("angle_radians")
val result1 = df.withColumn("cosine_value", cos("angle_radians"))
val result2 = df.withColumn("cosine_value", cos(col("angle_radians")))
```

**Output**

The SMA adds the EWI `SPRKSCL1123` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(0.0, Math.PI / 4, Math.PI / 2, Math.PI).toDF("angle_radians")
/*EWI: SPRKSCL1123 => org.apache.spark.sql.functions.cos has a workaround, see documentation for more info*/
val result1 = df.withColumn("cosine_value", cos("angle_radians"))
/*EWI: SPRKSCL1123 => org.apache.spark.sql.functions.cos has a workaround, see documentation for more info*/
val result2 = df.withColumn("cosine_value", cos(col("angle_radians")))
```

**Recommended fix**

Snowpark has an equivalent [cos](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(0.0, Math.PI / 4, Math.PI / 2, Math.PI).toDF("angle_radians")
val result1 = df.withColumn("cosine_value", cos(col("angle_radians")))
val result2 = df.withColumn("cosine_value", cos(col("angle_radians")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1172

Message: Snowpark does not support StructFiled with metadata parameter.

Category: Warning

### Description

This issue appears when the SMA detects that [org.apache.spark.sql.types.StructField.apply](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/types/StructField.html) with [org.apache.spark.sql.types.Metadata](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/types/Metadata.html) as parameter. This is because Snowpark does not supported the metadata parameter.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.types.StructField.apply` function that generates this EWI. In this example, the `apply` function is used to generate and instance of StructField.

```scala
val result = StructField("f1", StringType(), True, metadata)
```

**Output**

The SMA adds the EWI `SPRKSCL1172` to the output code to let you know that metadata parameter is not supported by Snowflake.

```scala
/*EWI: SPRKSCL1172 => Snowpark does not support StructFiled with metadata parameter.*/
val result = StructField("f1", StringType(), True, metadata)
```

**Recommended fix**

Snowpark has an equivalent [com.snowflake.snowpark.types.StructField.apply](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/types/StructField$.html) function that receives three parameters. Then, as workaround, you can try to remove the metadata argument.

```scala
val result = StructField("f1", StringType(), True, metadata)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1162

> **Note:**
>
> This issue code has been **deprecated**

Message: An error occurred when extracting the dbc files.

Category: Warning.

### Description

This issue appears when a dbc file cannot be extracted. This warning could be caused by one or more of the following reasons: Too heavy, inaccessible, read-only, etc.

#### Additional recommendations

* As a workaround, you can check the size of the file if it is too heavy to be processed. Also, analyze whether the tool can access it to avoid any access issues.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1133

Message: org.apache.spark.sql.functions.least has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.least](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.least` function, first used with multiple column name as arguments and then with column objects.

```scala
val df = Seq((10, 20, 5), (15, 25, 30), (7, 14, 3)).toDF("value1", "value2", "value3")
val result1 = df.withColumn("least", least("value1", "value2", "value3"))
val result2 = df.withColumn("least", least(col("value1"), col("value2"), col("value3")))
```

**Output**

The SMA adds the EWI `SPRKSCL1133` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq((10, 20, 5), (15, 25, 30), (7, 14, 3)).toDF("value1", "value2", "value3")
/*EWI: SPRKSCL1133 => org.apache.spark.sql.functions.least has a workaround, see documentation for more info*/
val result1 = df.withColumn("least", least("value1", "value2", "value3"))
/*EWI: SPRKSCL1133 => org.apache.spark.sql.functions.least has a workaround, see documentation for more info*/
val result2 = df.withColumn("least", least(col("value1"), col("value2"), col("value3")))
```

**Recommended fix**

Snowpark has an equivalent [least](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives multiple column objects as arguments. For that reason, the Spark overload that receives multiple column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives multiple string arguments, you can convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq((10, 20, 5), (15, 25, 30), (7, 14, 3)).toDF("value1", "value2", "value3")
val result1 = df.withColumn("least", least(col("value1"), col("value2"), col("value3")))
val result2 = df.withColumn("least", least(col("value1"), col("value2"), col("value3")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1107

> **Warning:**
>
> This issue code has been **deprecated**

Message: Writer save is not supported.

Category: Conversion error.

### Description

This issue appears when the tool detects, in writer statement, the usage of a writer save method that is not supported by Snowpark.

#### Scenario

**Input**

Below is an example of the [org.apache.spark.sql.DataFrameWriter.save](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameWriter.html) used to save the DataFrame content.

```scala
df.write.format("net.snowflake.spark.snowflake").save()
```

**Output**

The SMA adds the EWI `SPRKSCL1107` to the output code to let you know that the save method is not supported by Snowpark.

```scala
df.write.saveAsTable(tablename)
/*EWI: SPRKSCL1107 => Writer method is not supported .save()*/
```

**Recommended fix**

There is no recommended fix for this scenario

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1156

Message: org.apache.spark.sql.functions.degrees has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.degrees](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.degrees` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(math.Pi, math.Pi / 2, math.Pi / 4, math.Pi / 6).toDF("radians")
val result1 = df.withColumn("degrees", degrees("radians"))
val result2 = df.withColumn("degrees", degrees(col("radians")))
```

**Output**

The SMA adds the EWI `SPRKSCL1156` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(math.Pi, math.Pi / 2, math.Pi / 4, math.Pi / 6).toDF("radians")
/*EWI: SPRKSCL1156 => org.apache.spark.sql.functions.degrees has a workaround, see documentation for more info*/
val result1 = df.withColumn("degrees", degrees("radians"))
/*EWI: SPRKSCL1156 => org.apache.spark.sql.functions.degrees has a workaround, see documentation for more info*/
val result2 = df.withColumn("degrees", degrees(col("radians")))
```

**Recommended fix**

Snowpark has an equivalent [degrees](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(math.Pi, math.Pi / 2, math.Pi / 4, math.Pi / 6).toDF("radians")
val result1 = df.withColumn("degrees", degrees(col("radians")))
val result2 = df.withColumn("degrees", degrees(col("radians")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1127

Message: org.apache.spark.sql.functions.covar_samp has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.covar_samp](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.covar_samp` function, first used with column names as the arguments and then with column objects.

```scala
val df = Seq(
  (10.0, 20.0),
  (15.0, 25.0),
  (20.0, 30.0),
  (25.0, 35.0),
  (30.0, 40.0)
).toDF("value1", "value2")

val result1 = df.select(covar_samp("value1", "value2").as("sample_covariance"))
val result2 = df.select(covar_samp(col("value1"), col("value2")).as("sample_covariance"))
```

**Output**

The SMA adds the EWI `SPRKSCL1127` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  (10.0, 20.0),
  (15.0, 25.0),
  (20.0, 30.0),
  (25.0, 35.0),
  (30.0, 40.0)
).toDF("value1", "value2")

/*EWI: SPRKSCL1127 => org.apache.spark.sql.functions.covar_samp has a workaround, see documentation for more info*/
val result1 = df.select(covar_samp("value1", "value2").as("sample_covariance"))
/*EWI: SPRKSCL1127 => org.apache.spark.sql.functions.covar_samp has a workaround, see documentation for more info*/
val result2 = df.select(covar_samp(col("value1"), col("value2")).as("sample_covariance"))
```

**Recommended fix**

Snowpark has an equivalent [covar_samp](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives two column objects as arguments. For that reason, the Spark overload that receives two column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives two string arguments, you can convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  (10.0, 20.0),
  (15.0, 25.0),
  (20.0, 30.0),
  (25.0, 35.0),
  (30.0, 40.0)
).toDF("value1", "value2")

val result1 = df.select(covar_samp(col("value1"), col("value2")).as("sample_covariance"))
val result2 = df.select(covar_samp(col("value1"), col("value2")).as("sample_covariance"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1113

Message: org.apache.spark.sql.functions.next_day has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.next_day](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.next_day` function, first used with a string as the second argument and then with a column object.

```scala
val df = Seq("2024-11-06", "2024-11-13", "2024-11-20").toDF("date")
val result1 = df.withColumn("next_monday", next_day(col("date"), "Mon"))
val result2 = df.withColumn("next_monday", next_day(col("date"), lit("Mon")))
```

**Output**

The SMA adds the EWI `SPRKSCL1113` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("2024-11-06", "2024-11-13", "2024-11-20").toDF("date")
/*EWI: SPRKSCL1113 => org.apache.spark.sql.functions.next_day has a workaround, see documentation for more info*/
val result1 = df.withColumn("next_monday", next_day(col("date"), "Mon"))
/*EWI: SPRKSCL1113 => org.apache.spark.sql.functions.next_day has a workaround, see documentation for more info*/
val result2 = df.withColumn("next_monday", next_day(col("date"), lit("Mon")))
```

**Recommended fix**

Snowpark has an equivalent [next_day](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives two column objects as arguments. For that reason, the Spark overload that receives two column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives a column object and a string, you can convert the string into a column object using the [com.snowflake.snowpark.functions.lit](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function as a workaround.

```scala
val df = Seq("2024-11-06", "2024-11-13", "2024-11-20").toDF("date")
val result1 = df.withColumn("next_monday", next_day(col("date"), lit("Mon")))
val result2 = df.withColumn("next_monday", next_day(col("date"), lit("Mon")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1002

Message: This code section has recovery from parsing errors **\*statement\***

Category: Parsing error.

### Description

This issue appears when the SMA detects some statement that cannot correctly read or understand in the code of a file, it is called as **parsing error**, however the SMA can recovery from that parsing error and continue analyzing the code of the file. In this case, the SMA is able to process the code of the file without errors.

#### Scenario

**Input**

Below is an example of invalid Scala code where the SMA can recovery.

```scala
Class myClass {

    def function1() & = { 1 }

    def function2() = { 2 }

    def function3() = { 3 }

}
```

**Output**

The SMA adds the EWI `SPRKSCL1002` to the output code to let you know that the code of the file has parsing errors, however the SMA can recovery from that error and continue analyzing the code of the file.

```scala
class myClass {

    def function1();//EWI: SPRKSCL1002 => Unexpected end of declaration. Failed token: '&' @(3,21).
    & = { 1 }

    def function2() = { 2 }

    def function3() = { 3 }

}
```

**Recommended fix**

Since the message pinpoint the error in the statement you can try to identify the invalid syntax and remove it or comment out that statement to avoid the parsing error.

```scala
Class myClass {

    def function1() = { 1 }

    def function2() = { 2 }

    def function3() = { 3 }

}
```

```scala
Class myClass {

    // def function1() & = { 1 }

    def function2() = { 2 }

    def function3() = { 3 }

}
```

#### Additional recommendations

* Check that the code of the file is a valid Scala code.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1142

Message: **\*spark element\*** is not defined

Category: Conversion error

### Description

This issue appears when the SMA could not determine an appropriate mapping status for the given element. This means, the SMA doesn’t know yet if this element is supported or not by Snowpark. Please note, this is a generic error code used by the SMA for any not defined element.

#### Scenario

**Input**

Below is an example of a function for which the SMA could not determine an appropriate mapping status, and therefore it generated this EWI. In this case, you should assume that `notDefinedFunction()` is a valid Spark function and the code runs.

```scala
val df = session.range(10)
val result = df.notDefinedFunction()
```

**Output**

The SMA adds the EWI `SPRKSCL1142` to the output code to let you know that this element is not defined.

```scala
val df = session.range(10)
/*EWI: SPRKSCL1142 => org.apache.spark.sql.DataFrame.notDefinedFunction is not defined*/
val result = df.notDefinedFunction()
```

**Recommended fix**

To try to identify the problem, you can perform the following validations:

* Check if it is a valid Spark element.
* Check if the element has the correct syntax and it is spelled correctly.
* Check if you are using a Spark version supported by the SMA.

If this is a valid Spark element, please report that you encountered a conversion error on that particular element using the [Report an Issue](../../../user-guide/project-overview/configuration-and-settings.md) option of the SMA and include any additional information that you think may be helpful.

Please note that if an element is not defined by the SMA, it does not mean necessarily that it is not supported by Snowpark. You should check the [Snowpark Documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/index.html) to verify if an equivalent element exist.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1152

Message: org.apache.spark.sql.functions.variance has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.variance](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.variance` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(10, 20, 30, 40, 50).toDF("value")
val result1 = df.select(variance("value"))
val result2 = df.select(variance(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1152` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(10, 20, 30, 40, 50).toDF("value")
/*EWI: SPRKSCL1152 => org.apache.spark.sql.functions.variance has a workaround, see documentation for more info*/
val result1 = df.select(variance("value"))
/*EWI: SPRKSCL1152 => org.apache.spark.sql.functions.variance has a workaround, see documentation for more info*/
val result2 = df.select(variance(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [variance](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(10, 20, 30, 40, 50).toDF("value")
val result1 = df.select(variance(col("value")))
val result2 = df.select(variance(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1103

This issue code has been **deprecated**

Message: SparkBuilder method is not supported **\*method name\***

Category: Conversion Error

### Description

This issue appears when the SMA detects a method that is not supported by Snowflake in the SparkBuilder method chaining. Therefore, it might affects the migration of the reader statement.

The following are the not supported SparkBuilder methods:

* master
* appName
* enableHiveSupport
* withExtensions

#### Scenario

**Input**

Below is an example of a SparkBuilder method chaining with many methods are not supported by Snowflake.

```scala
val spark = SparkSession.builder()
           .master("local")
           .appName("testApp")
           .config("spark.sql.broadcastTimeout", "3600")
           .enableHiveSupport()
           .getOrCreate()
```

**Output**

The SMA adds the EWI `SPRKSCL1103` to the output code to let you know that master, appName and enableHiveSupport methods are not supported by Snowpark. Then, it might affects the migration of the Spark Session statement.

```scala
val spark = Session.builder.configFile("connection.properties")
/*EWI: SPRKSCL1103 => SparkBuilder Method is not supported .master("local")*/
/*EWI: SPRKSCL1103 => SparkBuilder Method is not supported .appName("testApp")*/
/*EWI: SPRKSCL1103 => SparkBuilder method is not supported .enableHiveSupport()*/
.create
```

**Recommended fix**

To create the session is required to add the proper Snowflake Snowpark configuration.

In this example a configs variable is used.

```scala
    val configs = Map (
      "URL" -> "https://<myAccount>.snowflakecomputing.com:<port>",
      "USER" -> <myUserName>,
      "PASSWORD" -> <myPassword>,
      "ROLE" -> <myRole>,
      "WAREHOUSE" -> <myWarehouse>,
      "DB" -> <myDatabase>,
      "SCHEMA" -> <mySchema>
    )
    val session = Session.builder.configs(configs).create
```

Also is recommended the use of a configFile (profile.properties) with the connection information:

```properties
## profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATEKEY = <unencrypted_private_key_from_the_private_key_file>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

And with the `Session.builder.configFile` the session can be created:

```scala
val session = Session.builder.configFile("/path/to/properties/file").create
```

### Additional recommendations

* [Developer guide for create a session.](https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-session)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1137

Message: org.apache.spark.sql.functions.sin has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.sin](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.sin` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(Math.PI / 2, Math.PI, Math.PI / 6).toDF("angle")
val result1 = df.withColumn("sin_value", sin("angle"))
val result2 = df.withColumn("sin_value", sin(col("angle")))
```

**Output**

The SMA adds the EWI `SPRKSCL1137` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(Math.PI / 2, Math.PI, Math.PI / 6).toDF("angle")
/*EWI: SPRKSCL1137 => org.apache.spark.sql.functions.sin has a workaround, see documentation for more info*/
val result1 = df.withColumn("sin_value", sin("angle"))
/*EWI: SPRKSCL1137 => org.apache.spark.sql.functions.sin has a workaround, see documentation for more info*/
val result2 = df.withColumn("sin_value", sin(col("angle")))
```

**Recommended fix**

Snowpark has an equivalent [sin](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(Math.PI / 2, Math.PI, Math.PI / 6).toDF("angle")
val result1 = df.withColumn("sin_value", sin(col("angle")))
val result2 = df.withColumn("sin_value", sin(col("angle")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1166

> **Note:**
>
> This issue code has been **deprecated**

Message: org.apache.spark.sql.DataFrameReader.format is not supported.

Category: Warning.

### Description

This issue appears when the [org.apache.spark.sql.DataFrameReader.format](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameReader.html) has an argument that is not supported by Snowpark.

#### Scenarios

There are some scenarios depending on the type of format you are trying to load. It can be a `supported`, or `non-supported` format.

##### Scenario 1

**Input**

The tool analyzes the type of format that is trying to load, the supported formats are:

* `csv`
* `json`
* `orc`
* `parquet`
* `text`

The below example shows how the tool transforms the `format` method when passing a `csv` value.

```scala
spark.read.format("csv").load(path)
```

**Output**

The tool transforms the `format` method into a `csv` method call when load function has one parameter.

```scala
spark.read.csv(path)
```

**Recommended fix**

In this case, the tool does not show the EWI, meaning there is no fix necessary.

##### Scenario 2

**Input**

The below example shows how the tool transforms the `format` method when passing a `net.snowflake.spark.snowflake` value.

```scala
spark.read.format("net.snowflake.spark.snowflake").load(path)
```

**Output**

The tool shows the EWI `SPRKSCL1166` indicating that the value `net.snowflake.spark.snowflake` is not supported.

```scala
/*EWI: SPRKSCL1166 => The parameter net.snowflake.spark.snowflake is not supported for org.apache.spark.sql.DataFrameReader.format
  EWI: SPRKSCL1112 => org.apache.spark.sql.DataFrameReader.load(scala.String) is not supported*/
spark.read.format("net.snowflake.spark.snowflake").load(path)
```

**Recommended fix**

For the `not supported` scenarios there is no specific fix since it depends on the files that are trying to be read.

##### Scenario 3

**Input**

The below example shows how the tool transforms the `format` method when passing a `csv`, but using a variable instead.

```scala
val myFormat = "csv"
spark.read.format(myFormat).load(path)
```

**Output**

Since the tool can not determine the value of the variable in runtime, shows the EWI `SPRKSCL1163` indicating that the value is not supported.

```scala
/*EWI: SPRKSCL1163 => myFormat is not a literal and can't be evaluated
  EWI: SPRKSCL1112 => org.apache.spark.sql.DataFrameReader.load(scala.String) is not supported*/
spark.read.format(myFormat).load(path)
```

**Recommended fix**

As a workaround, you can check the value of the variable and add it as a string to the `format` call.

#### Additional recommendations

* The Snowpark location only accepts cloud locations using a [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).
* The documentation of methods supported by Snowpark can be found in the [documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/DataFrameReader.html)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1118

Message: org.apache.spark.sql.functions.trunc has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.trunc](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.trunc` function that generates this EWI.

```scala
val df = Seq(
  Date.valueOf("2024-10-28"),
  Date.valueOf("2023-05-15"),
  Date.valueOf("2022-11-20"),
).toDF("date")

val result = df.withColumn("truncated", trunc(col("date"), "month"))
```

**Output**

The SMA adds the EWI `SPRKSCL1118` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  Date.valueOf("2024-10-28"),
  Date.valueOf("2023-05-15"),
  Date.valueOf("2022-11-20"),
).toDF("date")

/*EWI: SPRKSCL1118 => org.apache.spark.sql.functions.trunc has a workaround, see documentation for more info*/
val result = df.withColumn("truncated", trunc(col("date"), "month"))
```

**Recommended fix**

As a workaround, you can convert the second argument into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq(
  Date.valueOf("2024-10-28"),
  Date.valueOf("2023-05-15"),
  Date.valueOf("2022-11-20"),
).toDF("date")

val result = df.withColumn("truncated", trunc(col("date"), lit("month")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1149

Message: org.apache.spark.sql.functions.toRadians has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.toRadians](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.toRadians` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(0, 45, 90, 180, 270).toDF("degrees")
val result1 = df.withColumn("radians", toRadians("degrees"))
val result2 = df.withColumn("radians", toRadians(col("degrees")))
```

**Output**

The SMA adds the EWI `SPRKSCL1149` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(0, 45, 90, 180, 270).toDF("degrees")
/*EWI: SPRKSCL1149 => org.apache.spark.sql.functions.toRadians has a workaround, see documentation for more info*/
val result1 = df.withColumn("radians", toRadians("degrees"))
/*EWI: SPRKSCL1149 => org.apache.spark.sql.functions.toRadians has a workaround, see documentation for more info*/
val result2 = df.withColumn("radians", toRadians(col("degrees")))
```

**Recommended fix**

As a workaround, you can use the [radians](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function. For the Spark overload that receives a string argument, you additionally have to convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq(0, 45, 90, 180, 270).toDF("degrees")
val result1 = df.withColumn("radians", radians(col("degrees")))
val result2 = df.withColumn("radians", radians(col("degrees")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1159

Message: org.apache.spark.sql.functions.stddev_samp has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.stddev_samp](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.stddev_samp` function that generates this EWI. In this example, the `stddev_samp` function is used to calculate the sample standard deviation of selected column.

```scala
val df = Seq("1.7", "2.1", "3.0", "4.4", "5.2").toDF("elements")
val result1 = stddev_samp(col("elements"))
val result2 = stddev_samp("elements")
```

**Output**

The SMA adds the EWI `SPRKSCL1159` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("1.7", "2.1", "3.0", "4.4", "5.2").toDF("elements")
/*EWI: SPRKSCL1159 => org.apache.spark.sql.functions.stddev_samp has a workaround, see documentation for more info*/
val result1 = stddev_samp(col("elements"))
/*EWI: SPRKSCL1159 => org.apache.spark.sql.functions.stddev_samp has a workaround, see documentation for more info*/
val result2 = stddev_samp("elements")
```

**Recommended fix**

Snowpark has an equivalent [stddev_samp](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq("1.7", "2.1", "3.0", "4.4", "5.2").toDF("elements")
val result1 = stddev_samp(col("elements"))
val result2 = stddev_samp(col("elements"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1108

> **Note:**
>
> This issue code has been **deprecated.**

Message: org.apache.spark.sql.DataFrameReader.format is not supported.

Category: Warning.

### Description

This issue appears when the [org.apache.spark.sql.DataFrameReader.format](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameReader.html) has an argument that is not supported by Snowpark.

#### Scenarios

There are some scenarios depending on the type of format you are trying to load. It can be a `supported`, or `non-supported` format.

##### Scenario 1

**Input**

The tool analyzes the type of format that is trying to load, the supported formats are:

* `csv`
* `json`
* `orc`
* `parquet`
* `text`

The below example shows how the tool transforms the `format` method when passing a `csv` value.

```scala
spark.read.format("csv").load(path)
```

**Output**

The tool transforms the `format` method into a `csv` method call when load function has one parameter.

```scala
spark.read.csv(path)
```

**Recommended fix**

In this case, the tool does not show the EWI, meaning there is no fix necessary.

##### Scenario 2

**Input**

The below example shows how the tool transforms the `format` method when passing a `net.snowflake.spark.snowflake` value.

```scala
spark.read.format("net.snowflake.spark.snowflake").load(path)
```

**Output**

The tool shows the EWI `SPRKSCL1108` indicating that the value `net.snowflake.spark.snowflake` is not supported.

```scala
/*EWI: SPRKSCL1108 => The parameter net.snowflake.spark.snowflake is not supported for org.apache.spark.sql.DataFrameReader.format
  EWI: SPRKSCL1112 => org.apache.spark.sql.DataFrameReader.load(scala.String) is not supported*/
spark.read.format("net.snowflake.spark.snowflake").load(path)
```

**Recommended fix**

For the `not supported` scenarios there is no specific fix since it depends on the files that are trying to be read.

##### Scenario 3

**Input**

The below example shows how the tool transforms the `format` method when passing a `csv`, but using a variable instead.

```scala
val myFormat = "csv"
spark.read.format(myFormat).load(path)
```

**Output**

Since the tool can not determine the value of the variable in runtime, shows the EWI `SPRKSCL1163` indicating that the value is not supported.

```scala
/*EWI: SPRKSCL1108 => myFormat is not a literal and can't be evaluated
  EWI: SPRKSCL1112 => org.apache.spark.sql.DataFrameReader.load(scala.String) is not supported*/
spark.read.format(myFormat).load(path)
```

**Recommended fix**

As a workaround, you can check the value of the variable and add it as a string to the `format` call.

#### Additional recommendations

* The Snowpark location only accepts cloud locations using a [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).
* The documentation of methods supported by Snowpark can be found in the [documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/DataFrameReader.html)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1128

Message: org.apache.spark.sql.functions.exp has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.exp](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.exp` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(1.0, 2.0, 3.0).toDF("value")
val result1 = df.withColumn("exp_value", exp("value"))
val result2 = df.withColumn("exp_value", exp(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1128` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(1.0, 2.0, 3.0).toDF("value")
/*EWI: SPRKSCL1128 => org.apache.spark.sql.functions.exp has a workaround, see documentation for more info*/
val result1 = df.withColumn("exp_value", exp("value"))
/*EWI: SPRKSCL1128 => org.apache.spark.sql.functions.exp has a workaround, see documentation for more info*/
val result2 = df.withColumn("exp_value", exp(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [exp](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(1.0, 2.0, 3.0).toDF("value")
val result1 = df.withColumn("exp_value", exp(col("value")))
val result2 = df.withColumn("exp_value", exp(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1169

Message: **\*Spark element\*** is missing on the method chaining.

Category: Warning.

### Description

This issue appears when the SMA detects that a Spark element call is missing on the method chaining. SMA needs to know that Spark element to analyze the statement.

#### Scenario

**Input**

Below is an example where load function call is missing on the method chaining.

```scala
val reader = spark.read.format("json")
val df = reader.load(path)
```

**Output**

The SMA adds the EWI `SPRKSCL1169` to the output code to let you know that load function call is missing on the method chaining and SMA can not analyze the statement.

```scala
/*EWI: SPRKSCL1169 => Function 'org.apache.spark.sql.DataFrameReader.load' is missing on the method chaining*/
val reader = spark.read.format("json")
val df = reader.load(path)
```

**Recommended fix**

Make sure that all function calls of the method chaining are in the same statement.

```scala
val reader = spark.read.format("json").load(path)
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1138

Message: org.apache.spark.sql.functions.sinh has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.sinh](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.sinh` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(0.0, 1.0, 2.0, 3.0).toDF("value")
val result1 = df.withColumn("sinh_value", sinh("value"))
val result2 = df.withColumn("sinh_value", sinh(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1138` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(0.0, 1.0, 2.0, 3.0).toDF("value")
/*EWI: SPRKSCL1138 => org.apache.spark.sql.functions.sinh has a workaround, see documentation for more info*/
val result1 = df.withColumn("sinh_value", sinh("value"))
/*EWI: SPRKSCL1138 => org.apache.spark.sql.functions.sinh has a workaround, see documentation for more info*/
val result2 = df.withColumn("sinh_value", sinh(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [sinh](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(0.0, 1.0, 2.0, 3.0).toDF("value")
val result1 = df.withColumn("sinh_value", sinh(col("value")))
val result2 = df.withColumn("sinh_value", sinh(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1129

Message: org.apache.spark.sql.functions.floor has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.floor](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.floor` function, first used with a column name as an argument, then with a column object and finally with two column objects.

```scala
val df = Seq(4.75, 6.22, 9.99).toDF("value")
val result1 = df.withColumn("floor_value", floor("value"))
val result2 = df.withColumn("floor_value", floor(col("value")))
val result3 = df.withColumn("floor_value", floor(col("value"), lit(1)))
```

**Output**

The SMA adds the EWI `SPRKSCL1129` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(4.75, 6.22, 9.99).toDF("value")
/*EWI: SPRKSCL1129 => org.apache.spark.sql.functions.floor has a workaround, see documentation for more info*/
val result1 = df.withColumn("floor_value", floor("value"))
/*EWI: SPRKSCL1129 => org.apache.spark.sql.functions.floor has a workaround, see documentation for more info*/
val result2 = df.withColumn("floor_value", floor(col("value")))
/*EWI: SPRKSCL1129 => org.apache.spark.sql.functions.floor has a workaround, see documentation for more info*/
val result3 = df.withColumn("floor_value", floor(col("value"), lit(1)))
```

**Recommended fix**

Snowpark has an equivalent [floor](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

For the overload that receives a column object and a scale, you can use the [callBuiltin](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function to invoke the Snowflake builtin [FLOOR](https://docs.snowflake.com/en/sql-reference/functions/floor) function. To use it, you should pass the string **“floor”** as the first argument, the column as the second argument and the scale as the third argument.

```scala
val df = Seq(4.75, 6.22, 9.99).toDF("value")
val result1 = df.withColumn("floor_value", floor(col("value")))
val result2 = df.withColumn("floor_value", floor(col("value")))
val result3 = df.withColumn("floor_value", callBuiltin("floor", col("value"), lit(1)))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1168

Message: **\*Spark element\*** with argument(s) value(s) **\*given arguments\*** is not supported.

Category: Warning.

### Description

This issue appears when the SMA detects that Spark element with the given parameters is not supported.

#### Scenario

**Input**

Below is an example of Spark element which parameter is not supported.

```scala
spark.read.format("text").load(path)
```

**Output**

The SMA adds the EWI `SPRKSCL1168` to the output code to let you know that Spark element with the given parameter is not supported.

```scala
/*EWI: SPRKSCL1168 => org.apache.spark.sql.DataFrameReader.format(scala.String) with argument(s) value(s) (spark.format) is not supported*/
spark.read.format("text").load(path)
```

**Recommended fix**

For this scenario there is no specific fix.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1139

Message: org.apache.spark.sql.functions.sqrt has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.sqrt](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.sqrt` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(4.0, 16.0, 25.0, 36.0).toDF("value")
val result1 = df.withColumn("sqrt_value", sqrt("value"))
val result2 = df.withColumn("sqrt_value", sqrt(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1139` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(4.0, 16.0, 25.0, 36.0).toDF("value")
/*EWI: SPRKSCL1139 => org.apache.spark.sql.functions.sqrt has a workaround, see documentation for more info*/
val result1 = df.withColumn("sqrt_value", sqrt("value"))
/*EWI: SPRKSCL1139 => org.apache.spark.sql.functions.sqrt has a workaround, see documentation for more info*/
val result2 = df.withColumn("sqrt_value", sqrt(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [sqrt](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(4.0, 16.0, 25.0, 36.0).toDF("value")
val result1 = df.withColumn("sqrt_value", sqrt(col("value")))
val result2 = df.withColumn("sqrt_value", sqrt(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1119

Message: org.apache.spark.sql.Column.endsWith has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.Column.endsWith](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/Column.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.Column.endsWith` function, first used with a literal string argument and then with a column object argument.

```scala
val df1 = Seq(
  ("Alice", "alice@example.com"),
  ("Bob", "bob@example.org"),
  ("David", "david@example.com")
).toDF("name", "email")
val result1 = df1.filter(col("email").endsWith(".com"))

val df2 = Seq(
  ("Alice", "alice@example.com", ".com"),
  ("Bob", "bob@example.org", ".org"),
  ("David", "david@example.org", ".com")
).toDF("name", "email", "suffix")
val result2 = df2.filter(col("email").endsWith(col("suffix")))
```

**Output**

The SMA adds the EWI `SPRKSCL1119` to the output code to let you know that this function is not directly supported by Snowpark, but it has a workaround.

```scala
val df1 = Seq(
  ("Alice", "alice@example.com"),
  ("Bob", "bob@example.org"),
  ("David", "david@example.com")
).toDF("name", "email")
/*EWI: SPRKSCL1119 => org.apache.spark.sql.Column.endsWith has a workaround, see documentation for more info*/
val result1 = df1.filter(col("email").endsWith(".com"))

val df2 = Seq(
  ("Alice", "alice@example.com", ".com"),
  ("Bob", "bob@example.org", ".org"),
  ("David", "david@example.org", ".com")
).toDF("name", "email", "suffix")
/*EWI: SPRKSCL1119 => org.apache.spark.sql.Column.endsWith has a workaround, see documentation for more info*/
val result2 = df2.filter(col("email").endsWith(col("suffix")))
```

**Recommended fix**

As a workaround, you can use the [com.snowflake.snowpark.functions.endswith](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function, where the first argument would be the column whose values will be checked and the second argument the suffix to check against the column values. Please note that if the argument of the Spark’s `endswith` function is a literal string, you should convert it into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df1 = Seq(
  ("Alice", "alice@example.com"),
  ("Bob", "bob@example.org"),
  ("David", "david@example.com")
).toDF("name", "email")
val result1 = df1.filter(endswith(col("email"), lit(".com")))

val df2 = Seq(
  ("Alice", "alice@example.com", ".com"),
  ("Bob", "bob@example.org", ".org"),
  ("David", "david@example.org", ".com")
).toDF("name", "email", "suffix")
val result2 = df2.filter(endswith(col("email"), col("suffix")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1148

Message: org.apache.spark.sql.functions.toDegrees has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.toDegrees](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.toDegrees` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(Math.PI, Math.PI / 2, Math.PI / 4).toDF("angle_in_radians")
val result1 = df.withColumn("angle_in_degrees", toDegrees("angle_in_radians"))
val result2 = df.withColumn("angle_in_degrees", toDegrees(col("angle_in_radians")))
```

**Output**

The SMA adds the EWI `SPRKSCL1148` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(Math.PI, Math.PI / 2, Math.PI / 4).toDF("angle_in_radians")
/*EWI: SPRKSCL1148 => org.apache.spark.sql.functions.toDegrees has a workaround, see documentation for more info*/
val result1 = df.withColumn("angle_in_degrees", toDegrees("angle_in_radians"))
/*EWI: SPRKSCL1148 => org.apache.spark.sql.functions.toDegrees has a workaround, see documentation for more info*/
val result2 = df.withColumn("angle_in_degrees", toDegrees(col("angle_in_radians")))
```

**Recommended fix**

As a workaround, you can use the [degrees](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function. For the Spark overload that receives a string argument, you additionally have to convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq(Math.PI, Math.PI / 2, Math.PI / 4).toDF("angle_in_radians")
val result1 = df.withColumn("angle_in_degrees", degrees(col("angle_in_radians")))
val result2 = df.withColumn("angle_in_degrees", degrees(col("angle_in_radians")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1158

Message: org.apache.spark.sql.functions.skewness has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.skewness](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.skewness` function that generates this EWI. In this example, the `skewness` function is used to calculate the skewness of selected column.

```scala
val df = Seq("1", "2", "3").toDF("elements")
val result1 = skewness(col("elements"))
val result2 = skewness("elements")
```

**Output**

The SMA adds the EWI `SPRKSCL1158` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("1", "2", "3").toDF("elements")
/*EWI: SPRKSCL1158 => org.apache.spark.sql.functions.skewness has a workaround, see documentation for more info*/
val result1 = skewness(col("elements"))
/*EWI: SPRKSCL1158 => org.apache.spark.sql.functions.skewness has a workaround, see documentation for more info*/
val result2 = skewness("elements")
```

**Recommended fix**

Snowpark has an equivalent [skew](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq("1", "2", "3").toDF("elements")
val result1 = skew(col("elements"))
val result2 = skew(col("elements"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1109

> **Note:**
>
> This issue code has been **deprecated**

Message: The parameter is not defined for org.apache.spark.sql.DataFrameReader.option

Category: Warning

### Description

This issue appears when the SMA detects that giving parameter of [org.apache.spark.sql.DataFrameReader.option](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameReader.html) is not defined.

#### Scenario

**Input**

Below is an example of undefined parameter for `org.apache.spark.sql.DataFrameReader.option` function.

```scala
spark.read.option("header", True).json(path)
```

**Output**

The SMA adds the EWI `SPRKSCL1109` to the output code to let you know that giving parameter to the org.apache.spark.sql.DataFrameReader.option function is not defined.

```scala
/*EWI: SPRKSCL1109 => The parameter header=True is not supported for org.apache.spark.sql.DataFrameReader.option*/
spark.read.option("header", True).json(path)
```

**Recommended fix**

Check the Snowpark documentation for reader format option [here](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions), in order to identify the defined options.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1114

Message: org.apache.spark.sql.functions.repeat has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.repeat](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.repeat` function that generates this EWI.

```scala
val df = Seq("Hello", "World").toDF("word")
val result = df.withColumn("repeated_word", repeat(col("word"), 3))
```

**Output**

The SMA adds the EWI `SPRKSCL1114` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("Hello", "World").toDF("word")
/*EWI: SPRKSCL1114 => org.apache.spark.sql.functions.repeat has a workaround, see documentation for more info*/
val result = df.withColumn("repeated_word", repeat(col("word"), 3))
```

**Recommended fix**

As a workaround, you can convert the second argument into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq("Hello", "World").toDF("word")
val result = df.withColumn("repeated_word", repeat(col("word"), lit(3)))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1145

Message: org.apache.spark.sql.functions.sumDistinct has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.sumDistinct](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.sumDistinct` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(
  ("Alice", 10),
  ("Bob", 15),
  ("Alice", 10),
  ("Alice", 20),
  ("Bob", 15)
).toDF("name", "value")

val result1 = df.groupBy("name").agg(sumDistinct("value"))
val result2 = df.groupBy("name").agg(sumDistinct(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1145` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("Alice", 10),
  ("Bob", 15),
  ("Alice", 10),
  ("Alice", 20),
  ("Bob", 15)
).toDF("name", "value")

/*EWI: SPRKSCL1145 => org.apache.spark.sql.functions.sumDistinct has a workaround, see documentation for more info*/
val result1 = df.groupBy("name").agg(sumDistinct("value"))
/*EWI: SPRKSCL1145 => org.apache.spark.sql.functions.sumDistinct has a workaround, see documentation for more info*/
val result2 = df.groupBy("name").agg(sumDistinct(col("value")))
```

**Recommended fix**

As a workaround, you can use the [sum_distinct](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function. For the Spark overload that receives a string argument, you additionally have to convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq(
  ("Alice", 10),
  ("Bob", 15),
  ("Alice", 10),
  ("Alice", 20),
  ("Bob", 15)
).toDF("name", "value")

val result1 = df.groupBy("name").agg(sum_distinct(col("value")))
val result2 = df.groupBy("name").agg(sum_distinct(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1171

Message: Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info.

Category: Warning.

### Description

This issue appears when the SMA detects that [org.apache.spark.sql.functions.split](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) has more than two parameters or containing regex pattern.

#### Scenarios

The `split` function is used to separate the given column around matches of the given pattern. This Spark function has three overloads.

##### Scenario 1

**Input**

Below is an example of the `org.apache.spark.sql.functions.split` function that generates this EWI. In this example, the `split` function has two parameters and the second argument is a string, not a regex pattern.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
val result = df.select(split(col("words"), "Snow"))
```

**Output**

The SMA adds the EWI `SPRKSCL1171` to the output code to let you know that this function is not fully supported by Snowpark.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
/* EWI: SPRKSCL1171 => Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info. */
val result = df.select(split(col("words"), "Snow"))
```

**Recommended fix**

Snowpark has an equivalent [split](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as a second argument. For that reason, the Spark overload that receives a string argument in the second argument, but it is not a regex pattern, can convert the string into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
val result = df.select(split(col("words"), lit("Snow")))
```

##### Scenario 2

**Input**

Below is an example of the `org.apache.spark.sql.functions.split` function that generates this EWI. In this example, the `split` function has two parameters and the second argument is a regex pattern.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
val result = df.select(split(col("words"), "^([\\d]+-[\\d]+-[\\d])"))
```

**Output**

The SMA adds the EWI `SPRKSCL1171` to the output code to let you know that this function is not fully supported by Snowpark because regex patterns are not supported by Snowflake.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
/* EWI: SPRKSCL1171 => Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info. */
val result = df.select(split(col("words"), "^([\\d]+-[\\d]+-[\\d])"))
```

**Recommended fix**

Since Snowflake does not supported regex patterns, try to replace the pattern by a not regex pattern string.

##### Scenario 3

**Input**

Below is an example of the `org.apache.spark.sql.functions.split` function that generates this EWI. In this example, the `split` function has more than two parameters.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
val result = df.select(split(df("words"), "Snow", 3))
```

**Output**

The SMA adds the EWI `SPRKSCL1171` to the output code to let you know that this function is not fully supported by Snowpark, because Snowflake does not have a split function with more than two parameters.

```scala
val df = Seq("Snowflake", "Snowpark", "Snow", "Spark").toDF("words")
/* EWI: SPRKSCL1171 => Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info. */
val result3 = df.select(split(df("words"), "Snow", 3))
```

**Recommended fix**

Since Snowflake does not supported split function with more than two parameters, try to use the split function supported by Snowflake.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1120

Message: org.apache.spark.sql.functions.asin has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.asin](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.asin` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(0.5, 0.6, -0.5).toDF("value")
val result1 = df.select(col("value"), asin("value").as("asin_value"))
val result2 = df.select(col("value"), asin(col("value")).as("asin_value"))
```

**Output**

The SMA adds the EWI `SPRKSCL1120` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(0.5, 0.6, -0.5).toDF("value")
/*EWI: SPRKSCL1120 => org.apache.spark.sql.functions.asin has a workaround, see documentation for more info*/
val result1 = df.select(col("value"), asin("value").as("asin_value"))
/*EWI: SPRKSCL1120 => org.apache.spark.sql.functions.asin has a workaround, see documentation for more info*/
val result2 = df.select(col("value"), asin(col("value")).as("asin_value"))
```

**Recommended fix**

Snowpark has an equivalent [asin](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(0.5, 0.6, -0.5).toDF("value")
val result1 = df.select(col("value"), asin(col("value")).as("asin_value"))
val result2 = df.select(col("value"), asin(col("value")).as("asin_value"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1130

Message: org.apache.spark.sql.functions.greatest has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.greatest](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.greatest` function, first used with multiple column names as arguments and then with multiple column objects.

```scala
val df = Seq(
  ("apple", 10, 20, 15),
  ("banana", 5, 25, 18),
  ("mango", 12, 8, 30)
).toDF("fruit", "value1", "value2", "value3")

val result1 = df.withColumn("greatest", greatest("value1", "value2", "value3"))
val result2 = df.withColumn("greatest", greatest(col("value1"), col("value2"), col("value3")))
```

**Output**

The SMA adds the EWI `SPRKSCL1130` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("apple", 10, 20, 15),
  ("banana", 5, 25, 18),
  ("mango", 12, 8, 30)
).toDF("fruit", "value1", "value2", "value3")

/*EWI: SPRKSCL1130 => org.apache.spark.sql.functions.greatest has a workaround, see documentation for more info*/
val result1 = df.withColumn("greatest", greatest("value1", "value2", "value3"))
/*EWI: SPRKSCL1130 => org.apache.spark.sql.functions.greatest has a workaround, see documentation for more info*/
val result2 = df.withColumn("greatest", greatest(col("value1"), col("value2"), col("value3")))
```

**Recommended fix**

Snowpark has an equivalent [greatest](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives multiple column objects as arguments. For that reason, the Spark overload that receives column objects as arguments is directly supported by Snowpark and does not require any changes.

For the overload that receives multiple string arguments, you can convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("apple", 10, 20, 15),
  ("banana", 5, 25, 18),
  ("mango", 12, 8, 30)
).toDF("fruit", "value1", "value2", "value3")

val result1 = df.withColumn("greatest", greatest(col("value1"), col("value2"), col("value3")))
val result2 = df.withColumn("greatest", greatest(col("value1"), col("value2"), col("value3")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---

description: >-
Snowpark and Snowpark Extensions were not added to the project configuration
file.

---

## SPRKSCL1161

Message: Failed to add dependencies.

Category: Conversion error.

### Description

This issue occurs when the SMA detects a Spark version in the project configuration file that is not supported by the SMA, therefore SMA could not add the Snowpark and Snowpark Extensions dependencies to the corresponding project configuration file. If Snowpark dependencies are not added, the migrated code will not compile.

#### Scenarios

There are three possible scenarios: sbt, gradle and pom.xml. The SMA tries to process the project configuration file by removing Spark dependencies and adding Snowpark and Snowpark Extensions dependencies.

##### Scenario 1

**Input**

Below is an example of the `dependencies` section of a `sbt` project configuration file.

```properties
...
libraryDependencies += "org.apache.spark" % "spark-core_2.13" % "3.5.3"
libraryDependencies += "org.apache.spark" % "spark-sql_2.13" % "3.5.3"
...
```

**Output**

The SMA adds the EWI `SPRKSCL1161` to the issues inventory since the Spark version is not supported and keeps the output the same.

```scala
...
libraryDependencies += "org.apache.spark" % "spark-core_2.13" % "3.5.3"
libraryDependencies += "org.apache.spark" % "spark-sql_2.13" % "3.5.3"
...
```

**Recommended fix**

Manually, remove the Spark dependencies and add Snowpark and Snowpark Extensions dependencies to the `sbt` project configuration file.

```scala
...
libraryDependencies += "com.snowflake" % "snowpark" % "1.14.0"
libraryDependencies += "net.mobilize.snowpark-extensions" % "snowparkextensions" % "0.0.18"
...
```

Make sure to use the Snowpark version that best meets your project’s requirements.

##### Scenario 2

**Input**

Below is an example of the `dependencies` section of a `gradle` project configuration file.

```groovy
dependencies {
    implementation group: 'org.apache.spark', name: 'spark-core_2.13', version: '3.5.3'
    implementation group: 'org.apache.spark', name: 'spark-sql_2.13', version: '3.5.3'
    ...
}
```

**Output**

The SMA adds the EWI `SPRKSCL1161` to the issues inventory since the Spark version is not supported and keeps the output the same.

```groovy
dependencies {
    implementation group: 'org.apache.spark', name: 'spark-core_2.13', version: '3.5.3'
    implementation group: 'org.apache.spark', name: 'spark-sql_2.13', version: '3.5.3'
    ...
}
```

**Recommended fix**

Manually, remove the Spark dependencies and add Snowpark and Snowpark Extensions dependencies to the `gradle` project configuration file.

```groovy
dependencies {
    implementation 'com.snowflake:snowpark:1.14.2'
    implementation 'net.mobilize.snowpark-extensions:snowparkextensions:0.0.18'
    ...
}
```

Make sure that dependencies version are according to your project needs.

##### Scenario 3

**Input**

Below is an example of the `dependencies` section of a `pom.xml` project configuration file.

```xml
<dependencies>
  <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-core_2.13</artifactId>
    <version>3.5.3</version>
  </dependency>

  <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-sql_2.13</artifactId>
    <version>3.5.3</version>
    <scope>compile</scope>
  </dependency>
  ...
</dependencies>
```

**Output**

The SMA adds the EWI `SPRKSCL1161` to the issues inventory since the Spark version is not supported and keeps the output the same.

```xml
<dependencies>
  <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-core_2.13</artifactId>
    <version>3.5.3</version>
  </dependency>

  <dependency>
    <groupId>org.apache.spark</groupId>
    <artifactId>spark-sql_2.13</artifactId>
    <version>3.5.3</version>
    <scope>compile</scope>
  </dependency>
  ...
</dependencies>
```

**Recommended fix**

Manually, remove the Spark dependencies and add Snowpark and Snowpark Extensions dependencies to the `gradle` project configuration file.

```xml
<dependencies>
  <dependency>
    <groupId>com.snowflake</groupId>
    <artifactId>snowpark</artifactId>
    <version>1.14.2</version>
  </dependency>

  <dependency>
    <groupId>net.mobilize.snowpark-extensions</groupId>
    <artifactId>snowparkextensions</artifactId>
    <version>0.0.18</version>
  </dependency>
  ...
</dependencies>
```

Make sure that dependencies version are according to your project needs.

#### Additional recommendations

* Make sure that input has a project configuration file:

  + build.sbt
  + build.gradle
  + pom.xml
* Spark version supported by the SMA is 2.12:3.1.2
* You can check the latest Snowpark version [here](https://github.com/snowflakedb/snowpark-java-scala/releases/latest).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1155

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.3.2](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.countDistinct has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.countDistinct](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.countDistinct` function, first used with column names as arguments and then with column objects.

```scala
val df = Seq(
  ("Alice", 1),
  ("Bob", 2),
  ("Alice", 3),
  ("Bob", 4),
  ("Alice", 1),
  ("Charlie", 5)
).toDF("name", "value")

val result1 = df.select(countDistinct("name", "value"))
val result2 = df.select(countDistinct(col("name"), col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1155` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("Alice", 1),
  ("Bob", 2),
  ("Alice", 3),
  ("Bob", 4),
  ("Alice", 1),
  ("Charlie", 5)
).toDF("name", "value")

/*EWI: SPRKSCL1155 => org.apache.spark.sql.functions.countDistinct has a workaround, see documentation for more info*/
val result1 = df.select(countDistinct("name", "value"))
/*EWI: SPRKSCL1155 => org.apache.spark.sql.functions.countDistinct has a workaround, see documentation for more info*/
val result2 = df.select(countDistinct(col("name"), col("value")))
```

**Recommended fix**

As a workaround, you can use the [count_distinct](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function. For the Spark overload that receives string arguments, you additionally have to convert the strings into column objects using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val df = Seq(
  ("Alice", 1),
  ("Bob", 2),
  ("Alice", 3),
  ("Bob", 4),
  ("Alice", 1),
  ("Charlie", 5)
).toDF("name", "value")

val result1 = df.select(count_distinct(col("name"), col("value")))
val result2 = df.select(count_distinct(col("name"), col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1104

This issue code has been **deprecated**

Message: Spark Session builder option is not supported.

Category: Conversion Error.

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.SparkSession.Builder.config](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/SparkSession$$Builder.html) function, which is setting an option of the Spark Session and it is not supported by Snowpark.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.SparkSession.Builder.config` function used to set an option in the Spark Session.

```scala
val spark = SparkSession.builder()
           .master("local")
           .appName("testApp")
           .config("spark.sql.broadcastTimeout", "3600")
           .getOrCreate()
```

**Output**

The SMA adds the EWI `SPRKSCL1104` to the output code to let you know config method is not supported by Snowpark. Then, it is not possible to set options in the Spark Session via config function and it might affects the migration of the Spark Session statement.

```scala
val spark = Session.builder.configFile("connection.properties")
/*EWI: SPRKSCL1104 => SparkBuilder Option is not supported .config("spark.sql.broadcastTimeout", "3600")*/
.create()
```

**Recommended fix**

To create the session is require to add the proper Snowflake Snowpark configuration.

In this example a configs variable is used.

```scala
    val configs = Map (
      "URL" -> "https://<myAccount>.snowflakecomputing.com:<port>",
      "USER" -> <myUserName>,
      "PASSWORD" -> <myPassword>,
      "ROLE" -> <myRole>,
      "WAREHOUSE" -> <myWarehouse>,
      "DB" -> <myDatabase>,
      "SCHEMA" -> <mySchema>
    )
    val session = Session.builder.configs(configs).create
```

Also is recommended the use of a configFile (profile.properties) with the connection information:

```properties
## profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATEKEY = <unencrypted_private_key_from_the_private_key_file>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

And with the `Session.builder.configFile` the session can be created:

```scala
val session = Session.builder.configFile("/path/to/properties/file").create
```

### Additional recommendations

* [Developer guide for create a session.](https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-session)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1124

Message: org.apache.spark.sql.functions.cosh has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.cosh](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.cosh` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(0.0, 1.0, 2.0, -1.0).toDF("value")
val result1 = df.withColumn("cosh_value", cosh("value"))
val result2 = df.withColumn("cosh_value", cosh(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1124` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(0.0, 1.0, 2.0, -1.0).toDF("value")
/*EWI: SPRKSCL1124 => org.apache.spark.sql.functions.cosh has a workaround, see documentation for more info*/
val result1 = df.withColumn("cosh_value", cosh("value"))
/*EWI: SPRKSCL1124 => org.apache.spark.sql.functions.cosh has a workaround, see documentation for more info*/
val result2 = df.withColumn("cosh_value", cosh(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [cosh](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(0.0, 1.0, 2.0, -1.0).toDF("value")
val result1 = df.withColumn("cosh_value", cosh(col("value")))
val result2 = df.withColumn("cosh_value", cosh(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1175

Message: The two-parameter `udf` function is not supported in Snowpark. It should be converted into a single-parameter `udf` function. Please check the documentation to learn how to manually modify the code to make it work in Snowpark.

Category: Conversion error.

### Description

This issue appears when the SMA detects an use of the two-parameter [org.apache.spark.sql.functions.udf](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function in the source code, because Snowpark does not have an equivalent two-parameter `udf` function, then the output code might not compile.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.udf` function that generates this EWI. In this example, the `udf` function has two parameters.

```scala
val myFuncUdf = udf(new UDF1[String, Integer] {
  override def call(s: String): Integer = s.length()
}, IntegerType)
```

**Output**

The SMA adds the EWI `SPRKSCL1175` to the output code to let you know that the `udf` function is not supported, because it has two parameters.

```scala
/*EWI: SPRKSCL1175 => The two-parameter udf function is not supported in Snowpark. It should be converted into a single-parameter udf function. Please check the documentation to learn how to manually modify the code to make it work in Snowpark.*/
val myFuncUdf = udf(new UDF1[String, Integer] {
  override def call(s: String): Integer = s.length()
}, IntegerType)
```

**Recommended fix**

Snowpark only supports the single-parameter `udf` function (without the return type parameter), so you should convert your two-parameter `udf` function into a single-parameter `udf` function in order to make it work in Snowpark.

For example, for the sample code mentioned above, you would have to manually convert it into this:

```scala
val myFuncUdf = udf((s: String) => s.length())
```

Please note that there are some caveats about creating `udf` in Snowpark that might require you to make some additional manual changes to your code. Please check this other recommendations here related with creating single-parameter `udf` functions in Snowpark for more details.

#### Additional recommendations

* To learn more about how to create user-defined functions in Snowpark, please refer to the following documentation: Creating User-Defined Functions (UDFs) for DataFrames in Scala
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1001

Message: This code section has parsing errors. The parsing error was found at: line **\*line number\***, column **\*column number\***. When trying to parse **\*statement\***. This file was not converted, so it is expected to still have references to the Spark API.

Category: Parsing error.

### Description

This issue appears when the SMA detects some statement that cannot correctly read or understand in the code of a file, it is called as **parsing error**. Besides, this issue appears when a file has one or more parsing error(s).

#### Scenario

**Input**

Below is an example of invalid Scala code.

```scala
/#/(%$"$%

Class myClass {

    def function1() = { 1 }

}
```

**Output**

The SMA adds the EWI `SPRKSCL1001` to the output code to let you know that the code of the file has parsing errors. Therefore, SMA is not able to process a file with this error.

```scala
// **********************************************************************************************************************
// EWI: SPRKSCL1001 => This code section has parsing errors
// The parsing error was found at: line 0, column 0. When trying to parse ''.
// This file was not converted, so it is expected to still have references to the Spark API
// **********************************************************************************************************************
/#/(%$"$%

Class myClass {

    def function1() = { 1 }

}
```

**Recommended fix**

Since the message pinpoint the error statement you can try to identify the invalid syntax and remove it or comment out that statement to avoid the parsing error.

```scala
Class myClass {

    def function1() = { 1 }

}
```

```scala
// /#/(%$"$%

Class myClass {

    def function1() = { 1 }

}
```

#### Additional recommendations

* Check that the code of the file is a valid Scala code.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1141

Message: org.apache.spark.sql.functions.stddev_pop has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.stddev_pop](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

Below is an example of the `org.apache.spark.sql.functions.stddev_pop` function, first used with a column name as an argument and then with a column object.

**Input**

```scala
val df = Seq(
  ("Alice", 23),
  ("Bob", 30),
  ("Carol", 27),
  ("David", 25),
).toDF("name", "age")

val result1 = df.select(stddev_pop("age"))
val result2 = df.select(stddev_pop(col("age")))
```

**Output**

The SMA adds the EWI `SPRKSCL1141` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("Alice", 23),
  ("Bob", 30),
  ("Carol", 27),
  ("David", 25),
).toDF("name", "age")

/*EWI: SPRKSCL1141 => org.apache.spark.sql.functions.stddev_pop has a workaround, see documentation for more info*/
val result1 = df.select(stddev_pop("age"))
/*EWI: SPRKSCL1141 => org.apache.spark.sql.functions.stddev_pop has a workaround, see documentation for more info*/
val result2 = df.select(stddev_pop(col("age")))
```

**Recommended fix**

Snowpark has an equivalent [stddev_pop](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("Alice", 23),
  ("Bob", 30),
  ("Carol", 27),
  ("David", 25),
).toDF("name", "age")

val result1 = df.select(stddev_pop(col("age")))
val result2 = df.select(stddev_pop(col("age")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1110

> **Note:**
>
> This issue code has been **deprecated**

Message: Reader method not supported **\*method name\***.

Category: Warning

### Description

This issue appears when the SMA detects a method that is not supported by Snowflake in the DataFrameReader method chaining. Then, it might affects the migration of the reader statement.

#### Scenario

**Input**

Below is an example of a DataFrameReader method chaining where load method is not supported by Snowflake.

```scala
spark.read.
    format("net.snowflake.spark.snowflake").
    option("query", s"select * from $tablename")
    load()
```

**Output**

The SMA adds the EWI `SPRKSCL1110` to the output code to let you know that load method is not supported by Snowpark. Then, it might affects the migration of the reader statement.

```scala
session.sql(s"select * from $tablename")
/*EWI: SPRKSCL1110 => Reader method not supported .load()*/
```

**Recommended fix**

Check the Snowpark documentation for reader [here](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/DataFrameReader.html), in order to know the supported methods by Snowflake.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1100

This issue code has been **deprecated** since [Spark Conversion Core 2.3.22](../../../general/release-notes/README.md)

Message: Repartition is not supported.

Category: Parsing error.

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.DataFrame.repartition](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/Dataset.html) function, which is not supported by Snowpark. Snowflake manages the storage and the workload on the clusters making repartition operation inapplicable.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.DataFrame.repartition` function used to return a new `DataFrame` partitioned by the given partitioning expressions.

```scala
    var nameData = Seq("James", "Sarah", "Dylan", "Leila, "Laura", "Peter")
    var jobData = Seq("Police", "Doctor", "Actor", "Teacher, "Dentist", "Fireman")
    var ageData = Seq(40, 38, 34, 27, 29, 55)

    val dfName = nameData.toDF("name")
    val dfJob = jobData.toDF("job")
    val dfAge = ageData.toDF("age")

    val dfRepartitionByExpresion = dfName.repartition($"name")

    val dfRepartitionByNumber = dfJob.repartition(3)

    val dfRepartitionByBoth = dfAge.repartition(3, $"age")

    val joinedDf = dfRepartitionByExpresion.join(dfRepartitionByNumber)
```

**Output**

The SMA adds the EWI `SPRKSCL1100` to the output code to let you know that this function is not supported by Snowpark.

```scala
    var nameData = Seq("James", "Sarah", "Dylan", "Leila, "Laura", "Peter")
    var jobData = Seq("Police", "Doctor", "Actor", "Teacher, "Dentist", "Fireman")
    var ageData = Seq(40, 38, 34, 27, 29, 55)

    val dfName = nameData.toDF("name")
    val dfJob = jobData.toDF("job")
    val dfAge = ageData.toDF("age")

    /*EWI: SPRKSCL1100 => Repartition is not supported*/
    val dfRepartitionByExpresion = dfName.repartition($"name")

    /*EWI: SPRKSCL1100 => Repartition is not supported*/
    val dfRepartitionByNumber = dfJob.repartition(3)

    /*EWI: SPRKSCL1100 => Repartition is not supported*/
    val dfRepartitionByBoth = dfAge.repartition(3, $"age")

    val joinedDf = dfRepartitionByExpresion.join(dfRepartitionByNumber)
```

**Recommended Fix**

Since Snowflake manages the storage and the workload on the clusters making repartition operation inapplicable. This means that the use of repartition before the join is not required at all.

```scala
    var nameData = Seq("James", "Sarah", "Dylan", "Leila, "Laura", "Peter")
    var jobData = Seq("Police", "Doctor", "Actor", "Teacher, "Dentist", "Fireman")
    var ageData = Seq(40, 38, 34, 27, 29, 55)

    val dfName = nameData.toDF("name")
    val dfJob = jobData.toDF("job")
    val dfAge = ageData.toDF("age")

    val dfRepartitionByExpresion = dfName

    val dfRepartitionByNumber = dfJob

    val dfRepartitionByBoth = dfAge

    val joinedDf = dfRepartitionByExpresion.join(dfRepartitionByNumber)
```

#### Additional recommendations

* The [Snowflake’s architecture guide](https://docs.snowflake.com/en/user-guide/intro-key-concepts) provides insight about Snowflake storage management.
* Snowpark [Dataframe reference](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/dataframe) could be useful in how to adapt a particular scenario without the need of repartition.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1151

Message: org.apache.spark.sql.functions.var_samp has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.var_samp](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.var_samp` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(
  ("A", 10),
  ("A", 20),
  ("A", 30),
  ("B", 40),
  ("B", 50),
  ("B", 60)
).toDF("category", "value")

val result1 = df.groupBy("category").agg(var_samp("value"))
val result2 = df.groupBy("category").agg(var_samp(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1151` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("A", 10),
  ("A", 20),
  ("A", 30),
  ("B", 40),
  ("B", 50),
  ("B", 60)
).toDF("category", "value")

/*EWI: SPRKSCL1151 => org.apache.spark.sql.functions.var_samp has a workaround, see documentation for more info*/
val result1 = df.groupBy("category").agg(var_samp("value"))
/*EWI: SPRKSCL1151 => org.apache.spark.sql.functions.var_samp has a workaround, see documentation for more info*/
val result2 = df.groupBy("category").agg(var_samp(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [var_samp](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("A", 10),
  ("A", 20),
  ("A", 30),
  ("B", 40),
  ("B", 50),
  ("B", 60)
).toDF("category", "value")

val result1 = df.groupBy("category").agg(var_samp(col("value")))
val result2 = df.groupBy("category").agg(var_samp(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---

description: >-
The format of the reader on DataFrameReader method chaining is not one of the
defined by Snowpark.

---

## SPRKSCL1165

Message: Reader format on DataFrameReader method chaining can’t be defined

Category: Warning

### Description

This issue appears when the SMA detects that `format` of the reader in DataFrameReader method chaining is not one of the following supported for Snowpark: `avro`, `csv`, `json`, `orc`, `parquet` and `xml`. Therefore, the SMA can not determine if setting options are defined or not.

#### Scenario

**Input**

Below is an example of DataFrameReader method chaining where SMA can determine the format of reader.

```scala
spark.read.format("net.snowflake.spark.snowflake")
                 .option("query", s"select * from $tableName")
                 .load()
```

**Output**

The SMA adds the EWI `SPRKSCL1165` to the output code to let you know that `format` of the reader can not be determine in the giving DataFrameReader method chaining.

```scala
/*EWI: SPRKSCL1165 => Reader format on DataFrameReader method chaining can't be defined*/
spark.read.option("query", s"select * from $tableName")
                 .load()
```

**Recommended fix**

Check the Snowpark documentation [here](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions) to get more information about format of the reader.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1134

Message: org.apache.spark.sql.functions.log has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.log](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.log` function that generates this EWI.

```scala
val df = Seq(10.0, 20.0, 30.0, 40.0).toDF("value")
val result1 = df.withColumn("log_value", log(10, "value"))
val result2 = df.withColumn("log_value", log(10, col("value")))
val result3 = df.withColumn("log_value", log("value"))
val result4 = df.withColumn("log_value", log(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1134` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(10.0, 20.0, 30.0, 40.0).toDF("value")
/*EWI: SPRKSCL1134 => org.apache.spark.sql.functions.log has a workaround, see documentation for more info*/
val result1 = df.withColumn("log_value", log(10, "value"))
/*EWI: SPRKSCL1134 => org.apache.spark.sql.functions.log has a workaround, see documentation for more info*/
val result2 = df.withColumn("log_value", log(10, col("value")))
/*EWI: SPRKSCL1134 => org.apache.spark.sql.functions.log has a workaround, see documentation for more info*/
val result3 = df.withColumn("log_value", log("value"))
/*EWI: SPRKSCL1134 => org.apache.spark.sql.functions.log has a workaround, see documentation for more info*/
val result4 = df.withColumn("log_value", log(col("value")))
```

**Recommended fix**

Below are the different workarounds for all the overloads of the `log` function.

**1. def log(base: Double, columnName: String): Column**

You can convert the base into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function and convert the column name into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val result1 = df.withColumn("log_value", log(lit(10), col("value")))
```

**2. def log(base: Double, a: Column): Column**

You can convert the base into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function.

```scala
val result2 = df.withColumn("log_value", log(lit(10), col("value")))
```

**3.def log(columnName: String): Column**

You can pass `lit(Math.E)` as the first argument and convert the column name into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function and pass it as the second argument.

```scala
val result3 = df.withColumn("log_value", log(lit(Math.E), col("value")))
```

**4. def log(e: Column): Column**

You can pass `lit(Math.E)` as the first argument and the column object as the second argument.

```scala
val result4 = df.withColumn("log_value", log(lit(Math.E), col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1125

> **Warning:**
>
> This issue code is **deprecated** since [Spark Conversion Core 2.9.0](../../../general/release-notes/old-version-release-notes/sc-spark-scala-release-notes/README.md)

Message: org.apache.spark.sql.functions.count has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.count](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.count` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(
  ("Alice", "Math"),
  ("Bob", "Science"),
  ("Alice", "Science"),
  ("Bob", null)
).toDF("name", "subject")

val result1 = df.groupBy("name").agg(count("subject").as("subject_count"))
val result2 = df.groupBy("name").agg(count(col("subject")).as("subject_count"))
```

**Output**

The SMA adds the EWI `SPRKSCL1125` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("Alice", "Math"),
  ("Bob", "Science"),
  ("Alice", "Science"),
  ("Bob", null)
).toDF("name", "subject")

/*EWI: SPRKSCL1125 => org.apache.spark.sql.functions.count has a workaround, see documentation for more info*/
val result1 = df.groupBy("name").agg(count("subject").as("subject_count"))
/*EWI: SPRKSCL1125 => org.apache.spark.sql.functions.count has a workaround, see documentation for more info*/
val result2 = df.groupBy("name").agg(count(col("subject")).as("subject_count"))
```

**Recommended fix**

Snowpark has an equivalent [count](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("Alice", "Math"),
  ("Bob", "Science"),
  ("Alice", "Science"),
  ("Bob", null)
).toDF("name", "subject")

val result1 = df.groupBy("name").agg(count(col("subject")).as("subject_count"))
val result2 = df.groupBy("name").agg(count(col("subject")).as("subject_count"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1174

Message: The single-parameter `udf` function is supported in Snowpark but it might require manual intervention. Please check the documentation to learn how to manually modify the code to make it work in Snowpark.

Category: Warning.

### Description

This issue appears when the SMA detects an use of the single-parameter [org.apache.spark.sql.functions.udf](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function in the code. Then, it might require a manual intervention.

The Snowpark API provides an equivalent [com.snowflake.snowpark.functions.udf](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that allows you to create a user-defined function from a lambda or function in Scala, however, there are some caveats about creating `udf` in Snowpark that might require you to make some manual changes to your code in order to make it work properly.

#### Scenarios

The Snowpark `udf` function should work as intended for a wide range of cases without requiring manual intervention. However, there are some scenarios that would requiere you to manually modify your code in order to get it work in Snowpark. Some of those scenarios are listed below:

##### Scenario 1

**Input**

Below is an example of creating UDFs in an object with the App Trait.

The Scala’s `App` trait simplifies creating executable programs by providing a `main` method that automatically runs the code within the object definition. Extending `App` delays the initialization of the fields until the `main` method is executed, which can affect the UDFs definitions if they rely on initialized fields. This means that if an object extends `App` and the `udf` references an object field, the `udf` definition uploaded to Snowflake will not include the initialized value of the field. This can result in `null` values being returned by the `udf`.

For example, in the following code the variable myValue will resolve to `null` in the `udf` definition:

```scala
object Main extends App {
  ...
  val myValue = 10
  val myUdf = udf((x: Int) => x + myValue) // myValue in the `udf` definition will resolve to null
  ...
}
```

**Output**

The SMA adds the EWI `SPRKSCL1174` to the output code to let you know that the single-parameter `udf` function is supported in Snowpark but it requires manual intervention.

```scala
object Main extends App {
  ...
  val myValue = 10
  /*EWI: SPRKSCL1174 => The single-parameter udf function is supported in Snowpark but it might require manual intervention. Please check the documentation to learn how to manually modify the code to make it work in Snowpark.*/
  val myUdf = udf((x: Int) => x + myValue) // myValue in the `udf` definition will resolve to null
  ...
}
```

**Recommended fix**

To avoid this issue, it is recommended to not extend `App` and implement a separate `main` method for your code. This ensure that object fields are initialized before `udf` definitions are created and uploaded to Snowflake.

```scala
object Main {
  ...
  def main(args: Array[String]): Unit = {
    val myValue = 10
    val myUdf = udf((x: Int) => x + myValue)
  }
  ...
}
```

For more details about this topic, see [Caveat About Creating UDFs in an Object With the App Trait](https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-udfs#caveat-about-creating-udfs-in-an-object-with-the-app-trait).

##### Scenario 2

**Input**

Below is an example of creating UDFs in Jupyter Notebooks.

```scala
def myFunc(s: String): String = {
  ...
}

val myFuncUdf = udf((x: String) => myFunc(x))
df1.select(myFuncUdf(col("name"))).show()
```

**Output**

The SMA adds the EWI `SPRKSCL1174` to the output code to let you know that the single-parameter `udf` function is supported in Snowpark but it requires manual intervention.

```scala
def myFunc(s: String): String = {
  ...
}

/*EWI: SPRKSCL1174 => The single-parameter udf function is supported in Snowpark but it might require manual intervention. Please check the documentation to learn how to manually modify the code to make it work in Snowpark.*/
val myFuncUdf = udf((x: String) => myFunc(x))
df1.select(myFuncUdf(col("name"))).show()
```

**Recommended fix**

To create a `udf` in a Jupyter Notebook, you should define the implementation of your function in a class that extends `Serializable`. For example, you should manually convert it into this:

```scala
object ConvertedUdfFuncs extends Serializable {
  def myFunc(s: String): String = {
    ...
  }

  val myFuncAsLambda = ((x: String) => ConvertedUdfFuncs.myFunc(x))
}

val myFuncUdf = udf(ConvertedUdfFuncs.myFuncAsLambda)
df1.select(myFuncUdf(col("name"))).show()
```

For more details about how to create UDFs in Jupyter Notebooks, see [Creating UDFs in Jupyter Notebooks](https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-udfs#creating-udfs-in-jupyter-notebooks).

#### Additional recommendations

* To learn more about how to create user-defined functions in Snowpark, please refer to the following documentation: [Creating User-Defined Functions (UDFs) for DataFrames in Scala](https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-udfs)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1000

Message: Source project spark-core version is **\*version number\***, the spark-core version supported by snowpark is 2.12:3.1.2 so there may be functional differences between the existing mappings

Category: Warning

### Description

This issue appears when the SMA detects a version of the `spark-core` that is not supported by SMA. Therefore, there may be functional differences between the existing mappings and the output might have unexpected behaviors.

#### Additional recommendations

* The spark-core version supported by SMA is 2.12:3.1.2. Consider changing the version of your source code.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1140

Message: org.apache.spark.sql.functions.stddev has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.stddev](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.stddev` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(
  ("Alice", 10),
  ("Bob", 15),
  ("Charlie", 20),
  ("David", 25),
).toDF("name", "score")

val result1 = df.select(stddev("score"))
val result2 = df.select(stddev(col("score")))
```

**Output**

The SMA adds the EWI `SPRKSCL1140` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("Alice", 10),
  ("Bob", 15),
  ("Charlie", 20),
  ("David", 25),
).toDF("name", "score")

/*EWI: SPRKSCL1140 => org.apache.spark.sql.functions.stddev has a workaround, see documentation for more info*/
val result1 = df.select(stddev("score"))
/*EWI: SPRKSCL1140 => org.apache.spark.sql.functions.stddev has a workaround, see documentation for more info*/
val result2 = df.select(stddev(col("score")))
```

**Recommended fix**

Snowpark has an equivalent [stddev](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("Alice", 10),
  ("Bob", 15),
  ("Charlie", 20),
  ("David", 25),
).toDF("name", "score")

val result1 = df.select(stddev(col("score")))
val result2 = df.select(stddev(col("score")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1111

> **Note:**
>
> This issue code has been **deprecated**

Message: CreateDecimalType is not supported.

Category: Conversion error.

### Description

This issue appears when the SMA detects a usage [org.apache.spark.sql.types.DataTypes.CreateDecimalType](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/types/DecimalType.html) function.

#### Scenario

**Input**

Below is an example of usage of org.apache.spark.sql.types.DataTypes.CreateDecimalType function.

```scala
var result = DataTypes.createDecimalType(18, 8)
```

**Output**

The SMA adds the EWI `SPRKSCL1111` to the output code to let you know that CreateDecimalType function is not supported by Snowpark.

```scala
/*EWI: SPRKSCL1111 => CreateDecimalType is not supported*/
var result = createDecimalType(18, 8)
```

**Recommended fix**

There is not a recommended fix yet.

Message: Spark Session builder option is not supported.

Category: Conversion Error.

#### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.SparkSession.Builder.config](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/SparkSession$$Builder.html) function, which is setting an option of the Spark Session and it is not supported by Snowpark.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.SparkSession.Builder.config` function used to set an option in the Spark Session.

```scala
val spark = SparkSession.builder()
           .master("local")
           .appName("testApp")
           .config("spark.sql.broadcastTimeout", "3600")
           .getOrCreate()
```

**Output**

The SMA adds the EWI `SPRKSCL1104` to the output code to let you know config method is not supported by Snowpark. Then, it is not possible to set options in the Spark Session via config function and it might affects the migration of the Spark Session statement.

```scala
val spark = Session.builder.configFile("connection.properties")
/*EWI: SPRKSCL1104 => SparkBuilder Option is not supported .config("spark.sql.broadcastTimeout", "3600")*/
.create()
```

**Recommended fix**

To create the session is require to add the proper Snowflake Snowpark configuration.

In this example a configs variable is used.

```scala
    val configs = Map (
      "URL" -> "https://<myAccount>.snowflakecomputing.com:<port>",
      "USER" -> <myUserName>,
      "PASSWORD" -> <myPassword>,
      "ROLE" -> <myRole>,
      "WAREHOUSE" -> <myWarehouse>,
      "DB" -> <myDatabase>,
      "SCHEMA" -> <mySchema>
    )
    val session = Session.builder.configs(configs).create
```

Also is recommended the use of a configFile (profile.properties) with the connection information:

```properties
## profile.properties file (a text file)
URL = https://<account_identifier>.snowflakecomputing.com
USER = <username>
PRIVATEKEY = <unencrypted_private_key_from_the_private_key_file>
ROLE = <role_name>
WAREHOUSE = <warehouse_name>
DB = <database_name>
SCHEMA = <schema_name>
```

And with the `Session.builder.configFile` the session can be created:

```scala
val session = Session.builder.configFile("/path/to/properties/file").create
```

### Additional recommendations

* [Developer guide for create a session.](https://docs.snowflake.com/en/developer-guide/snowpark/scala/creating-session)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1101

This issue code has been **deprecated** since [Spark Conversion Core 2.3.22](../../../general/release-notes/README.md)

Message: Broadcast is not supported

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.broadcast](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which is not supported by Snowpark. This function is not supported because Snowflake does not support [broadcast variables](https://spark.apache.org/docs/latest/api/java/org/apache/spark/broadcast/Broadcast.html).

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.broadcast` function used to create a broadcast object to use on each Spark cluster:

```scala
    var studentData = Seq(
      ("James", "Orozco", "Science"),
      ("Andrea", "Larson", "Bussiness"),
    )

    var collegeData = Seq(
      ("Arts", 1),
      ("Bussiness", 2),
      ("Science", 3)
    )

    val dfStudent = studentData.toDF("FirstName", "LastName", "CollegeName")
    val dfCollege = collegeData.toDF("CollegeName", "CollegeCode")

    dfStudent.join(
      broadcast(dfCollege),
      Seq("CollegeName")
    )
```

**Output**

The SMA adds the EWI `SPRKSCL1101` to the output code to let you know that this function is not supported by Snowpark.

```scala
    var studentData = Seq(
      ("James", "Orozco", "Science"),
      ("Andrea", "Larson", "Bussiness"),
    )

    var collegeData = Seq(
      ("Arts", 1),
      ("Bussiness", 2),
      ("Science", 3)
    )

    val dfStudent = studentData.toDF("FirstName", "LastName", "CollegeName")
    val dfCollege = collegeData.toDF("CollegeName", "CollegeCode")

    dfStudent.join(
      /*EWI: SPRKSCL1101 => Broadcast is not supported*/
      broadcast(dfCollege),
      Seq("CollegeName")
    )
```

**Recommended fix**

Since Snowflake manages the storage and the workload on the clusters making broadcast objects inapplicable. This means that the use of broadcast could not be required at all, but each case should require further analysis.

The recommended approach is replace a Spark dataframe broadcast by a Snowpark regular dataframe or by using a dataframe method as [Join](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/api/snowflake.snowpark.DataFrame.join).

For the proposed input the fix is to adapt the join to use directly the dataframe `collegeDF` without the use of broadcast for the dataframe.

```scala
    var studentData = Seq(
      ("James", "Orozco", "Science"),
      ("Andrea", "Larson", "Bussiness"),
    )

    var collegeData = Seq(
      ("Arts", 1),
      ("Bussiness", 2),
      ("Science", 3)
    )

    val dfStudent = studentData.toDF("FirstName", "LastName", "CollegeName")
    val dfCollege = collegeData.toDF("CollegeName", "CollegeCode")

    dfStudent.join(
      dfCollege,
      Seq("CollegeName")
    ).show()
```

#### Additional recommendations

* The [Snowflake’s architecture guide](https://docs.snowflake.com/en/user-guide/intro-key-concepts) provides insight about Snowflake storage management.
* Snowpark [Dataframe reference](https://docs.snowflake.com/en/developer-guide/snowpark/reference/python/1.23.0/snowpark/dataframe) could be useful in how to adapt a particular broadcast scenario.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1150

Message: org.apache.spark.sql.functions.var_pop has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.var_pop](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.var_pop` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(
  ("A", 10.0),
  ("A", 20.0),
  ("A", 30.0),
  ("B", 40.0),
  ("B", 50.0),
  ("B", 60.0)
).toDF("group", "value")

val result1 = df.groupBy("group").agg(var_pop("value"))
val result2 = df.groupBy("group").agg(var_pop(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1150` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(
  ("A", 10.0),
  ("A", 20.0),
  ("A", 30.0),
  ("B", 40.0),
  ("B", 50.0),
  ("B", 60.0)
).toDF("group", "value")

/*EWI: SPRKSCL1150 => org.apache.spark.sql.functions.var_pop has a workaround, see documentation for more info*/
val result1 = df.groupBy("group").agg(var_pop("value"))
/*EWI: SPRKSCL1150 => org.apache.spark.sql.functions.var_pop has a workaround, see documentation for more info*/
val result2 = df.groupBy("group").agg(var_pop(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [var_pop](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(
  ("A", 10.0),
  ("A", 20.0),
  ("A", 30.0),
  ("B", 40.0),
  ("B", 50.0),
  ("B", 60.0)
).toDF("group", "value")

val result1 = df.groupBy("group").agg(var_pop(col("value")))
val result2 = df.groupBy("group").agg(var_pop(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---

description: >-
The parameter of org.apache.spark.sql.DataFrameReader.option function is not
defined.

---

## SPRKSCL1164

> **Note:**
>
> This issue code has been **deprecated**

Message: The parameter is not defined for org.apache.spark.sql.DataFrameReader.option

Category: Warning

### Description

This issue appears when the SMA detects that giving parameter of [org.apache.spark.sql.DataFrameReader.option](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameReader.html) is not defined.

#### Scenario

**Input**

Below is an example of undefined parameter for `org.apache.spark.sql.DataFrameReader.option` function.

```scala
spark.read.option("header", True).json(path)
```

**Output**

The SMA adds the EWI `SPRKSCL1164` to the output code to let you know that giving parameter to the org.apache.spark.sql.DataFrameReader.option function is not defined.

```scala
/*EWI: SPRKSCL1164 => The parameter header=True is not supported for org.apache.spark.sql.DataFrameReader.option*/
spark.read.option("header", True).json(path)
```

**Recommended fix**

Check the Snowpark documentation for reader format option [here](https://docs.snowflake.com/en/sql-reference/sql/create-file-format#format-type-options-formattypeoptions), in order to identify the defined options.

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1135

> **Warning:**
>
> This issue code is **deprecated** since [Spark Conversion Core 4.3.2](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.mean has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.mean](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.mean` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(1, 3, 10, 1, 3).toDF("value")
val result1 = df.select(mean("value"))
val result2 = df.select(mean(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1135` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(1, 3, 10, 1, 3).toDF("value")
/*EWI: SPRKSCL1135 => org.apache.spark.sql.functions.mean has a workaround, see documentation for more info*/
val result1 = df.select(mean("value"))
/*EWI: SPRKSCL1135 => org.apache.spark.sql.functions.mean has a workaround, see documentation for more info*/
val result2 = df.select(mean(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [mean](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(1, 3, 10, 1, 3).toDF("value")
val result1 = df.select(mean(col("value")))
val result2 = df.select(mean(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1115

> **Warning:**
>
> This issue code has been **deprecated** since [Spark Conversion Core Version 4.6.0](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.round has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.round](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.round` function that generates this EWI.

```scala
val df = Seq(3.9876, 5.673, 8.1234).toDF("value")
val result1 = df.withColumn("rounded_value", round(col("value")))
val result2 = df.withColumn("rounded_value", round(col("value"), 2))
```

**Output**

The SMA adds the EWI `SPRKSCL1115` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(3.9876, 5.673, 8.1234).toDF("value")
/*EWI: SPRKSCL1115 => org.apache.spark.sql.functions.round has a workaround, see documentation for more info*/
val result1 = df.withColumn("rounded_value", round(col("value")))
/*EWI: SPRKSCL1115 => org.apache.spark.sql.functions.round has a workaround, see documentation for more info*/
val result2 = df.withColumn("rounded_value", round(col("value"), 2))
```

**Recommended fix**

Snowpark has an equivalent [round](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a column object and a scale, you can convert the scale into a column object using the [com.snowflake.snowpark.functions.lit](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(3.9876, 5.673, 8.1234).toDF("value")
val result1 = df.withColumn("rounded_value", round(col("value")))
val result2 = df.withColumn("rounded_value", round(col("value"), lit(2)))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1144

Message: The symbol table could not be loaded

Category: Parsing error

### Description

This issue appears when there is a critical error in the SMA execution process. Since the symbol table cannot be loaded, the SMA cannot start the assessment or conversion process.

#### Additional recommendations

* This is unlikely to be an error in the source code itself, but rather is an error in how the SMA processes the source code. The best resolution would be to post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1170

> **Note:**
>
> This issue code has been **deprecated**

Message: sparkConfig member key is not supported with platform specific key.

Category: Conversion error

### Description

If you are using an older version, please upgrade to the latest.

#### Additional recommendations

* Upgrade your application to the latest version.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1121

Message: org.apache.spark.sql.functions.atan has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.atan](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.atan` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(1.0, 0.5, -1.0).toDF("value")
val result1 = df.withColumn("atan_value", atan("value"))
val result2 = df.withColumn("atan_value", atan(col("value")))
```

**Output**

The SMA adds the EWI `SPRKSCL1121` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(1.0, 0.5, -1.0).toDF("value")
/*EWI: SPRKSCL1121 => org.apache.spark.sql.functions.atan has a workaround, see documentation for more info*/
val result1 = df.withColumn("atan_value", atan("value"))
/*EWI: SPRKSCL1121 => org.apache.spark.sql.functions.atan has a workaround, see documentation for more info*/
val result2 = df.withColumn("atan_value", atan(col("value")))
```

**Recommended fix**

Snowpark has an equivalent [atan](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(1.0, 0.5, -1.0).toDF("value")
val result1 = df.withColumn("atan_value", atan(col("value")))
val result2 = df.withColumn("atan_value", atan(col("value")))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1131

Message: org.apache.spark.sql.functions.grouping has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.grouping](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.grouping` function, first used with a column name as an argument and then with a column object.

```scala
val df = Seq(("Alice", 2), ("Bob", 5)).toDF("name", "age")
val result1 = df.cube("name").agg(grouping("name"), sum("age"))
val result2 = df.cube("name").agg(grouping(col("name")), sum("age"))
```

**Output**

The SMA adds the EWI `SPRKSCL1131` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(("Alice", 2), ("Bob", 5)).toDF("name", "age")
/*EWI: SPRKSCL1131 => org.apache.spark.sql.functions.grouping has a workaround, see documentation for more info*/
val result1 = df.cube("name").agg(grouping("name"), sum("age"))
/*EWI: SPRKSCL1131 => org.apache.spark.sql.functions.grouping has a workaround, see documentation for more info*/
val result2 = df.cube("name").agg(grouping(col("name")), sum("age"))
```

**Recommended fix**

Snowpark has an equivalent [grouping](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq(("Alice", 2), ("Bob", 5)).toDF("name", "age")
val result1 = df.cube("name").agg(grouping(col("name")), sum("age"))
val result2 = df.cube("name").agg(grouping(col("name")), sum("age"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1160

> **Note:**
>
> This issue code has been **deprecated** since [Spark Conversion Core 4.1.0](../../../general/release-notes/README.md)

Message: org.apache.spark.sql.functions.sum has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.sum](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.sum` function that generates this EWI. In this example, the `sum` function is used to calculate the sum of selected column.

```scala
val df = Seq("1", "2", "3", "4", "5").toDF("elements")
val result1 = sum(col("elements"))
val result2 = sum("elements")
```

**Output**

The SMA adds the EWI `SPRKSCL1160` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq("1", "2", "3", "4", "5").toDF("elements")
/*EWI: SPRKSCL1160 => org.apache.spark.sql.functions.sum has a workaround, see documentation for more info*/
val result1 = sum(col("elements"))
/*EWI: SPRKSCL1160 => org.apache.spark.sql.functions.sum has a workaround, see documentation for more info*/
val result2 = sum("elements")
```

**Recommended fix**

Snowpark has an equivalent [sum](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

```scala
val df = Seq("1", "2", "3", "4", "5").toDF("elements")
val result1 = sum(col("elements"))
val result2 = sum(col("elements"))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1154

Message: org.apache.spark.sql.functions.ceil has a workaround, see documentation for more info

Category: Warning

### Description

This issue appears when the SMA detects a use of the [org.apache.spark.sql.functions.ceil](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/functions$.html) function, which has a workaround.

#### Scenario

**Input**

Below is an example of the `org.apache.spark.sql.functions.ceil` function, first used with a column name as an argument, then with a column object and finally with a column object and a scale.

```scala
val df = Seq(2.33, 3.88, 4.11, 5.99).toDF("value")
val result1 = df.withColumn("ceil", ceil("value"))
val result2 = df.withColumn("ceil", ceil(col("value")))
val result3 = df.withColumn("ceil", ceil(col("value"), lit(1)))
```

**Output**

The SMA adds the EWI `SPRKSCL1154` to the output code to let you know that this function is not fully supported by Snowpark, but it has a workaround.

```scala
val df = Seq(2.33, 3.88, 4.11, 5.99).toDF("value")
/*EWI: SPRKSCL1154 => org.apache.spark.sql.functions.ceil has a workaround, see documentation for more info*/
val result1 = df.withColumn("ceil", ceil("value"))
/*EWI: SPRKSCL1154 => org.apache.spark.sql.functions.ceil has a workaround, see documentation for more info*/
val result2 = df.withColumn("ceil", ceil(col("value")))
/*EWI: SPRKSCL1154 => org.apache.spark.sql.functions.ceil has a workaround, see documentation for more info*/
val result3 = df.withColumn("ceil", ceil(col("value"), lit(1)))
```

**Recommended fix**

Snowpark has an equivalent [ceil](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function that receives a column object as an argument. For that reason, the Spark overload that receives a column object as an argument is directly supported by Snowpark and does not require any changes.

For the overload that receives a string argument, you can convert the string into a column object using the [com.snowflake.snowpark.functions.col](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function as a workaround.

For the overload that receives a column object and a scale, you can use the [callBuiltin](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/functions$.html) function to invoke the Snowflake builtin [CEIL](https://docs.snowflake.com/en/sql-reference/functions/ceil) function. To use it, you should pass the string **“ceil”** as the first argument, the column as the second argument and the scale as the third argument.

```scala
val df = Seq(2.33, 3.88, 4.11, 5.99).toDF("value")
val result1 = df.withColumn("ceil", ceil(col("value")))
val result2 = df.withColumn("ceil", ceil(col("value")))
val result3 = df.withColumn("ceil", callBuiltin("ceil", col("value"), lit(1)))
```

#### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKSCL1105

This issue code has been **deprecated**

Message: Writer format value is not supported.

Category: Conversion Error

### Description

This issue appears when the [org.apache.spark.sql.DataFrameWriter.format](https://spark.apache.org/docs/latest/api/scala/org/apache/spark/sql/DataFrameWriter.html) has an argument that is not supported by Snowpark.

#### Scenarios

There are some scenarios depending on the type of format you are trying to save. It can be a `supported`, or `non-supported` format.

##### Scenario 1

**Input**

The tool analyzes the type of format that is trying to save, the supported formats are:

* `csv`
* `json`
* `orc`
* `parquet`
* `text`

```scala
    dfWrite.write.format("csv").save(path)
```

**Output**

The tool transforms the `format` method into a `csv` method call when save function has one parameter.

```scala
    dfWrite.write.csv(path)
```

**Recommended fix**

In this case, the tool does not show the EWI, meaning there is no fix necessary.

##### Scenario 2

**Input**

The below example shows how the tool transforms the `format` method when passing a `net.snowflake.spark.snowflake` value.

```scala
dfWrite.write.format("net.snowflake.spark.snowflake").save(path)
```

**Output**

The tool shows the EWI `SPRKSCL1105` indicating that the value `net.snowflake.spark.snowflake` is not supported.

```scala
/*EWI: SPRKSCL1105 => Writer format value is not supported .format("net.snowflake.spark.snowflake")*/
dfWrite.write.format("net.snowflake.spark.snowflake").save(path)
```

**Recommended fix**

For the `not supported` scenarios there is no specific fix since it depends on the files that are trying to be read.

##### Scenario 3

**Input**

The below example shows how the tool transforms the `format` method when passing a `csv`, but using a variable instead.

```scala
val myFormat = "csv"
dfWrite.write.format(myFormat).save(path)
```

**Output**

Since the tool can not determine the value of the variable in runtime, shows the EWI `SPRKSCL1163` indicating that the value is not supported.

```scala
val myFormat = "csv"
/*EWI: SPRKSCL1163 => format_type is not a literal and can't be evaluated*/
dfWrite.write.format(myFormat).load(path)
```

**Recommended fix**

As a workaround, you can check the value of the variable and add it as a string to the `format` call.

#### Additional recommendations

* The Snowpark location only accepts cloud locations using a [snowflake stage](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage).
* The documentation of methods supported by Snowpark can be found in the [documentation](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/2.12/com/snowflake/snowpark/DataFrameWriter.html)
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---
title: Snowpark Migration Accelerator: Migration Lab
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/migration-lab/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Migration Lab

> **Note:**
>
> This is also part of [the Snowflake End-to-End Migration Quickstart](https://quickstarts.snowflake.com/guide/end2endmigration/index.html?index=..%2F..index#0) available in the Snowflake quickstarts.

Moving the logic and data in a data warehouse is essential to getting an operational database on a new platform. But to take advantage of the new platform in a functional way, any pipelines running moving data in or out of that data platform need to be repointed or replatformed as well. This can often be challenging as there are usually a variety of pipelines being used. This HoL will focus on just one for which Snowflake can provide some acceleration. But note that new ETL and pipeline accelerators are constantly being developed.

Let’s talk about the pipeline and the notebook we are moving in this scenario. As a reminder, this a SQL Server database migration, but scoped to a Proof of Concept. A small data mart in SQL Server has been moved by AdventureWorks to Snowflake. There is a basic pipeline script and a reporting notebook that AdventureWorks has included as part of this POC. Here is a summary of each artifact:

* The pipeline script is written in Python using Spark. This script is reading an accessible file generated by an older POS system in a local directory at regular intervals run by an orchestration tool. (Something like Airflow, but the orchestration is not part of the POC, so we’re not 100% sure what it is.)
* The notebook is a reporting notebook that reads from the existing SQL Server database and reports on a few summary metrics.

Neither of these are too complex, but both are just the tip of the iceberg. There are hundreds more pipeline scripts and notebooks related to other data marts. This POC will just move these two.

Both of these use Spark and access the SQL Server database. So our goal is essentially to move the operations in Spark into Snowpark. Let’s see how we would do this using [the Snowpark Migration Accelerator (SMA)](https://www.snowflake.com/en/migrate-to-the-cloud/migration-accelerator/). The SMA is a sister tool to SnowConvert and is built on the same foundation. We are going to walk through many steps (most of which will be similar to what we did with SnowConvert), but note that we are still essentially working through the same assessment -> conversion -> validation flow that we have already walked through.

## Notes on this Lab Environment

This lab uses the Snowpark Migration Accelerator and the Snowflake VS Code Extension. But to make the most of this, you will need to run Python with a PySpark. The simplest way to start this would be to start an environment with [the anaconda distribution](https://www.anaconda.com/docs/getting-started/anaconda/main). This will have most of the packages needed to run the code in this lab.

You will still need to make available the following resources:

* Python Libraries

  + [PySpark](https://pypi.org/project/pyspark/)
  + [Snowpark Python](https://pypi.org/project/snowflake-snowpark-python/)
  + [Snowflake](https://pypi.org/project/snowflake/)
* VS Code Extensions

  + [Snowflake](https://marketplace.visualstudio.com/items?itemName=snowflake.snowflake-vsc)
  + [Python](https://marketplace.visualstudio.com/items?itemName=ms-python.python)
  + [Jupyter](https://marketplace.visualstudio.com/items?itemName=ms-toolsai.jupyter)
* Other

  + [PySpark JDBC Driver of SQL Server](https://learn.microsoft.com/en-us/sql/connect/jdbc/download-microsoft-jdbc-driver-for-sql-server?view=sql-server-ver17)

Having said all of this, you can still run this lab with just a Snowflake account, the SMA, and the Snowflake VS Code Extension. You will not be able to run everything (particularly, the source code), but you will be able to use all of the converted elements in Snowflake.

Now let’s get started by assessing what we have.

---
title: Snowpark Migration Accelerator: Notebook Conversion
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/migration-lab/notebook-conversion.md
section: Migrations
---

# Snowpark Migration Accelerator: Notebook Conversion

Let’s step over to the Reporting Notebook in our codebase: **Basic Reporting Notebook - SqlServer Spark.ipynb**. We’re going to walk through a similar set of steps as we did with the pipeline script.

* **Resolve All Issues**: “Issues” here means the issues generated by the SMA. Take a look at the output code. Resolve parsing errors and conversion errors, and investigate warnings.
* **Resolve the session calls**: How the session call is written in the output code depends on where we are going to run the file. We will resolve this for running the code file(s) in the same location as they were originally going to be run, and then for running them in Snowflake.
* **Resolve the Input/Outputs**: Connections to different sources cannot be resolved entirely by the SMA. There are differences in the platforms, and the SMA will usually disregard this. This also is affected by where the file is going to be run.
* **Clean up and Test**! Let’s run the code. See if it works. We will be smoke testing in this lab, but there are tools to do more extensive testing and data validation including Snowpark Python Checkpoints.

Let’s get started.

## Resolve All Issues

Let’s go ahead and look at the issues present in the notebook.

(Note that you can open the notebook in VS Code, but to view it appropriately, you may want to install the Jupyter extension for VS Code. Alternatively, you could open this in Jupyter, but Snowflake still recommends VS Code with the Snowflake extension installed).

You can use the compare feature to view both of these side by side as we did with the pipeline file, though it will look more like a json if you do so:

Not that there are only two unique EWI’s in this notebook. You can return to the search bar to find them, but since this is so short, you could also just… scroll down. These are the unique issues:

* **SPRKPY1002** => *pyspark.sql.readwriter.DataFrameReader.jdbc is not supported*. This is a similar issue to the one we saw in the pipeline file, but that was a write call. This is a read call to the SQL Server database. We will resolve this in a bit.
* **SPRKPY1068** => *“pyspark.sql.dataframe.DataFrame.toPandas is not supported if there are columns of type ArrayType, but it has a workaround. See documentation for more info.* This is another warning. If we pass an array to this function in Snowpark, it may not work. Let’s keep an eye on this when we test it.

And that’s it for the notebook… and our issues. We resolved a parsing error, recognized that we will have to fix the input/outputs, and there’s a couple of potential functional differences we should keep an eye on. Let’s move on to the next step: resolving any session calls.

## Resolve the Session Calls

To update the session calls in the reporting notebook, we need to locate the cell with the session call in it. That looks like this:

Now let’s do what we already did for our pipeline file:

* Change all references to the “spark” session variable to “session” (note that this is throughout the notebook)
* Remove the config function with the spark driver.

The before and after on this will look like this:

```python
# Old Session
spark = Session.builder.config('spark.driver.extraClassPath', driver_path).app_name("AdventureWorksSummary", True).getOrCreate()
spark.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":7,"minor":4,"patch":10},"attributes":{"language":"Python"}})

# New Session
# Session
session = Session.builder.app_name("AdventureWorksSummary", True).getOrCreate()
session.update_query_tag({"origin":"sf_sit","name":"sma","version":{"major":7,"minor":4,"patch":10},"attributes":{"language":"Python"}})
```

Note that there is other code in this cell. This code:

```python
url = sql_server_url
properties = {'user' : sql_server_user, 'password' : sql_server_password}
# Spark dataframe.
#EWI: SPRKPY1002 => pyspark.sql.readwriter.DataFrameReader.jdbc is not supported
df = session.read.jdbc(url = url, table = 'dbo.DimCustomer', properties = properties)
print('Session successfully setup.')
```

We’re almost ready to take on the read statement, but we’re not there yet. Let’s just move all of this to another cell. Create a new cell below this one, and move this code to that cell. It will look like this:

Is this all we need for the session call? No. Recall (and possibly review) the previous page under [Notes on Session Calls](pipeline-conversion.md). You will either need to make sure that your connection.toml file has your connection information or you will need to explicitly specify the connection parameters you intend to use in the session.

## Resolving the Inputs/Outputs

So let’s resolve our inputs and outputs now. Note that this is going to diverge based on whether you’re running the files locally or Snowflake, but for the notebook, everything can be run locally or in Snowflake. The code will be a bit simpler as we won’t even need to call a session. We’ll just… get the active session. As with the pipeline file, we’ll do this in two parts: to be run/orchestrated locally, and to be run in Snowflake.

Working through the inputs and outputs in the reporting notebook will be considerably simpler than it was for the pipeline. There is no reading from a local file or moving data between files. There is simply a read from a table in SQL Server that is now a read from a table in Snowflake. Since we will not be accessing SQL Server, we can ditch any reference to the SQL Server properties. And the read statement can be replaced by a table statement in Snowflake. The before and after for this cell should look like this:

```python
# Before
url = sql_server_url
properties = {'user' : sql_server_user, 'password' : sql_server_password}
# Spark dataframe.
#EWI: SPRKPY1002 => pyspark.sql.readwriter.DataFrameReader.jdbc is not supported
df = session.read.jdbc(url = url, table = 'dbo.DimCustomer', properties = properties)
print('Session successfully setup.')
```

```python
# After
# New table call
# Snowpark Dataframe table.
df = session.table('ADVENTUREWORKS.DBO.DIMCUSTOMER')
print('Table loaded successfully.')
df.show()
```

That’s actually… it. Let’s move on to the Clean up and test part of the notebook file.

## Clean Up and Test

Let’s do some clean up (like we did previously for the pipeline file). We never looked at our import calls and we have config files that are not necessary at all. Let’s start by removing the references to the config files. This will be each of the cells between the import statements and the session call.

Now let’s look at our imports. The reference to the os can be deleted. (Seems like that wasn’t used in the original file either…) There is a pandas reference. There does not appear to be any usages of pandas in this notebook anymore now that the config files are referenced. There is a toPandas reference as part of the Snowpark dataframe API in the reporting section, but that’s not part of the pandas library.

You can optionally replace all of the import calls to pandas with the modin pandas library. This library will optimize pandas dataframes to take advantage of Snowflake’s powerhouse computing. This change would look like this:

```python
# Old
import pandas as pd

# New
import modin.pandas as pd
import snowflake.snowpark.modin.plugin
```

Having said that, we can delete that one as well. Note that the SMA has replaced any spark specific import statements with those related to Snowpark. The final import cell would look like this:

And that’s it for our cleanup. We still have a couple of EWIs in the reporting and visualization cells, but it looks like we should make it. Let’s run this one and see if we get an output.

And we did. The reports seem to match what was output by the Spark Notebook. Even though the reporting cells seemed complex, Snowpark is able to work with them. The SMA let us know there could be an issue, but there doesn’t appear to be any problems. More testing would help, but our first round of smoke testing has passed.

Now let’s look at this notebook in Snowsight. Unlike the pipeline file, we can do this entirely in Snowsight.

## Running the Notebook in Snowsight

Let’s take the version of the notebook that we have right now (having worked through the issues, the session calls, and the inputs and outputs) and load it into Snowflake. To do this, go to the notebooks section in SnowSight:

And select down arrow next to the +Notebook button in the top right, and select “Import .ipynb file” (shown above).

Once this has been imported, choose the notebook file that we have been working with in the output directory created by the SMA in your project folder.

There will be a create notebook dialog window that opens. For this upload, we will choose the following options:

* Notebook location:

  + Database: **ADVENTUREWORKS**
  + Schema: **DBO**
* Python environment: **Run on warehouse**

  + This is not a large notebook with a bunch of ml. This is a basic reporting notebook. We can run this on a warehouse.
* Query warehouse: **DEFAULT_WH**
* Notebook warehouse: **DEFAULT_WH** (you can leave it as the system chosen warehouse (will be a streamlit warehouse)… for this notebook, it will not matter)

You can see these selections below:

This should load your notebook into Snowflake and it will look something like this:

There are a couple of quick checks/changes we need to make from the version we just tested locally in order to ensure that the notebook runs in Snowsight:

* Change the session calls to retrieve the active session
* Ensure any dependent libraries we need to install are available

Let’s start with the first one. It may seem odd to alter the session call again after we spent so much time on it in the first place, but we’re running inside of Snowflake now. You can remove anything associated with reading the session call and replacing it with the “get_active_session” call that is standard at the top of most Snowflake notebooks:

```python
//# Old for Jupyter
session = Session.builder.app_name("AdventureWorksSummary", True).getOrCreate()

# New for Snowsight
from snowflake.snowpark.context import get_active_session
session = get_active_session()
```

We don’t need to specify connection parameters or update a .toml file because we are already connected. we are in Snowflake.

Let’s replace the old code in the cell with the new code. That will look something like this:

Now let’s address the available packages for this run, but instead of us figuring out what we need to add. Let’s let Snowflake. One of the better parts of using a notebook is that we can run individual cells and see what the results are. Let’s run our import library cell.

If you haven’t already, go ahead and start the session by clicking in the top right corner of the screen where it says “Start”:

If you run the topmost cell in the notebook, and you will likely discover that matplotlib is not loaded into the session:

This is a pretty important one for this notebook. You can add that library to your notebook/session by using the “Packages” option in the top right of the notebook:

Search for **matplotlib**, and select it. This will make this package available in the session.

Once you load this library, you will have to restart the session. Once you have restarted the session, run that first cell again. You will likely be told that it was a success this time.

With the packages loaded, the session fixed, and the rest of the issues in the code already resolved, what can we do to check the rest of the notebook? Run it! You can run all the cells in the notebook by selecting “Run all” in the top right corner of the screen, and see if we get any errors.

It looks like there was a successful run:

If you compare the two notebooks execution, it looks like the only difference is that the Snowflake version put all of the output datasets first followed by the images, whereas they are intermixed in the Spark Jupyter Notebook:

Note that this difference is not an API difference, but rather a difference in how notebooks in Snowflake orchestrate this. This is likely a difference AdventureWorks is willing to accept!

## Conclusions

By utilizing the SMA, we were able to accelerate the migration of both a data pipeline and a reporting notebook. The more of each that you have, the more value a tool like the SMA can provide.

And let’s go back to the assessment -> conversion -> validation flow that we have consistently come back to. In this migration, we:

* Setup out project in the SMA
* Ran SMA’s assessment and conversion engine on the code files
* Reviewed the output reporting from the SMA to better understand what we have
* Review what could not be converted by the SMA in VS Code
* Resolve issues and errors
* Resolve session references
* Resolve input/output references
* Run the code locally
* And run the code in Snowflake
* Ran the newly migrated scripts and validated their success

Snowflake has spent a great deal of time improving its ingestion and data engineering capabilities, just as it has spent time improving migration tools like SnowConvert, the SnowConvert Migration Assistant, and the Snowpark Migration Accelerator. Each of these will continue to improve. Please feel free to reach out if you have any suggestions for migration tooling. These teams are always looking for additional feedback to improve the tools.

---
title: Snowpark Migration Accelerator: Notes on Code Preparation
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/walkthrough-setup/notes-on-code-preparation.md
section: Migrations
---

# Snowpark Migration Accelerator: Notes on Code Preparation

Before running Snowpark Migration Accelerator (SMA), make sure all your source code files are located on the computer where you installed SMA. You don’t need to connect to any source database or Spark environment since SMA only performs code analysis.

The source code must be in a readable format for SMA to process it correctly, as the tool relies completely on the source code you provide.

## Extraction

Before running the Snowpark Migration Accelerator (SMA), organize all your source code files into a single main folder. You can maintain your existing subfolder structure within this main folder, but all code files must be located under this one directory. This requirement applies to:

The following file types are supported:

* GitHub repositories (downloaded as ZIP files and extracted to your local machine)
* Python script files
* Scala project files
* Databricks notebook files
* Jupyter notebooks run on your local computer

Before starting your migration, gather all source code files into a single main folder. While your source code may come from different locations, having it organized in one place will make the migration process more efficient. If you already have an established file organization structure, keep it intact within the main folder.

[Export GitHub repositories to ZIP files](https://docs.github.com/en/repositories/working-with-files/using-files/downloading-source-code-archives)

To generate accurate and complete reports using the Snowpark Migration Accelerator (SMA), scan only the code that is relevant to your migration project. Rather than scanning all available code, identify and include only the essential code files that you plan to migrate. For more information, refer to Size in the Considerations section.

## Considerations

Let’s review which file types are compatible with Snowpark Migration Accelerator (SMA) and understand the key considerations when preparing your source code for analysis with SMA.

### Filetypes

The Snowpark Migration Accelerator (SMA) examines all files in your source directory, but only processes files with specific extensions that may contain Spark API code. This includes both regular code files and Jupyter notebooks.

You can find a list of file types that SMA supports in the [Supported Filetypes section of this documentation](../../../user-guide/before-using-the-sma/supported-filetypes.md).

### Exported Files

If your code is stored in a source control platform instead of local files, you need to export it into a format that SMA can process. Here’s how you can export your code:

For Databricks users: To use the Snowpark Migration Accelerator (SMA), you need to export your notebooks to .dbc format. You can find detailed instructions on how to export notebooks in [the Databricks documentation on exporting notebooks](https://docs.databricks.com/en/notebooks/notebook-export-import.html#export-notebooks.).

Need help exporting files? Visit [the export scripts in the Snowflake Labs Github repo](https://github.com/Snowflake-Labs/SC.DDLExportScripts/tree/main), where Snowflake Professional Services maintains scripts for Databricks, Hive, and other platforms.

* If you are using a different platform, please refer to the [Code Extraction page](../../../user-guide/before-using-the-sma/code-extraction.md) for specific instructions for your platform. If you need assistance converting your code into a format that works with SMA, please contact [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

### Size

The Snowpark Migration Accelerator (SMA) is designed to analyze source code, not data. To ensure optimal performance and prevent system resource exhaustion, we recommend:

1. Only include the specific code files you want to migrate
2. Avoid including unnecessary library dependencies

While you can include dependent library code files, doing so will significantly increase processing time without adding value, since SMA specifically focuses on identifying Spark code that requires migration.

We recommend gathering all code files that…

* Run automatically as part of a scheduled process
* Were used to create or configure that process (if they are separate)
* Are custom libraries created by your organization that are used in either of the above scenarios

You don’t need to include code for common third-party libraries such as Pandas or Sci-Kit Learn. The tool will automatically detect and catalog these library references without requiring their source code.

### Does it run?

The Snowpark Migration Accelerator (SMA) can only process complete and syntactically correct source code. Your code must be able to run successfully in [a supported source platform](../../../user-guide/before-using-the-sma/supported-platforms.md). If the SMA reports multiple [parsing errors](../../../issue-analysis/issue-code-categorization.md), this usually indicates that your source code contains syntax errors. To achieve the best results, ensure that your input directory contains only valid code that can be executed on the source platform.

### Use Case

Understanding your codebase’s purpose is essential when reviewing scan results. It will help you:

1. Determine which applications or processes may not work well with Snowpark
2. Understand and analyze readiness assessment results more effectively
3. Check if your existing code and systems are compatible with Snowflake

When scanning a notebook that uses an unsupported SQL dialect and a database connector without Spark, the SMA will only display imported third-party libraries. While this information is helpful, the notebook will not receive a Spark API Readiness Score. Understanding how you plan to use your code will help you better understand these limitations and make better decisions during migration.

### Exports from Databricks Notebooks

Databricks notebooks support multiple programming languages such as SQL, Scala, and PySpark in a single notebook. When you export a notebook, the file extension will reflect its primary language:

* Python notebooks: .ipynb or .py
* SQL notebooks: .sql

Any code written in a language different from the notebook’s primary language will be automatically converted to comments during export. For instance, if you include SQL code in a Python notebook, the SQL code will appear as comments in the exported file.

Code comments are excluded from SMA analysis. To ensure your code is properly analyzed, place it in a file with the correct file extension matching the source language. For example:

* Python code should be in .py files
* SQL code should be in .sql files

Note that even uncommented code will not be analyzed if it’s in a file with the wrong extension (such as Python code in a .sql file).

Before using the tool, please read the [Pre-Processing Considerations](../../../user-guide/before-using-the-sma/pre-processing-considerations.md) section in our documentation. This section contains essential information that you need to know before proceeding.

## Walkthrough Codebase

Select one of the extracted sample codebase directories as the input for the Snowpark Migration Accelerator (SMA).

When migrating code, maintain your original folder structure. This preserves file organization and helps developers understand the code architecture. Both the code conversion process and assessment analysis are performed one file at a time.

For this tutorial, we will work with small, functional Spark code samples (each less than 1MB). These samples showcase different scenarios and functions that can be converted. Although these examples are simplified versions and not production code, they effectively demonstrate various conversion possibilities.

The source directory can contain Jupyter notebooks (.ipynb), Python scripts (.py), and text files. While SMA examines all files in your codebase, it only searches for Spark API references in Python (.py) files and Jupyter notebook (.ipynb) files.

---
title: Snowpark Migration Accelerator: Old Version Release Notes
source: https://docs.snowflake.com/en/migrations/sma-docs/general/release-notes/old-version-release-notes/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Old Version Release Notes

The Snowpark Migration Accelerator (SMA) has published release notes, but before the SMA existed, SnowConvert for Spark existed. You can find release notes and version information for old versions of the tool on the following pages.

* [SnowConvert for Spark Scala Release Notes](sc-spark-scala-release-notes/README.md)
* [SnowConvert for Spark Python Release Notes](sc-spark-python-release-notes/README.md)

These release notes may be a bit difficult to navigate, so please utilize the release notes from the current version of the SMA.

As always, if you have any comments, please reach out to [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---
title: Snowpark Migration Accelerator: Optional Technical Discovery
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/project-overview/optional-technical-discovery.md
section: Migrations
---

# Snowpark Migration Accelerator: Optional Technical Discovery

## What is Technical Discovery?

*Technical discovery* is an optional questionnaire within the Snowpark Migration Accelerator (SMA). With it, you can gather high-level information about your workload, which is a type of information that code analysis alone cannot detect.

## Why is it important?

While code analysis can identify transformations and logic, it lacks operational context. To provide accurate migration recommendations and a relevant suggested architecture, the tool needs to understand details like:

* Your technology stack (cloud platform, regions, tool versions).
* Your data ecosystem (external sources, governance tools, ETL processes, etc.).

By providing this information, you can allow the AI assistant to offer a much more complete and accurate assessment.

## How is this information used?

The answers you provide are linked directly to your assessment results and are used to:

* Categorize your workload.
* Inform the AI assistant.

## Is it mandatory?

No. This feature is optional. If you choose to skip the questionnaire, the assessment and recommendation features will still function, based solely on the data collected from the code analysis.

---
title: Snowpark Migration Accelerator: Output Logs
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-logs.md
section: Migrations
---

# Snowpark Migration Accelerator: Output Logs

The Snowpark Migration Accelerator (SMA) creates detailed log files during its execution. These logs track the tool’s operations and are valuable resources for troubleshooting when issues occur.

The SMA generates three log files during execution. Each log file has a .log extension and includes the date in its filename. These files are continuously updated while SMA is running.

* **Controller-Log**: Displays basic information about the SMA execution, including the session ID and summary metrics. This log is completed when the tool finishes running.
* **Generic-Scanner-Log**: Shows basic scanning information from the initial tool execution, including session ID, execution ID, and scanner completion status.
* **SparkSnowConvert-Log**: The main log file produced by the SMA (previously known as SnowConvert for Spark). It records detailed information about:

  + Tool execution steps
  + Error messages (both execution and conversion errors)
  + Troubleshooting information for failed executions

The log is completed when the tool finishes running.

If you are experiencing issues with the Snowpark Migration Accelerator (SMA) and need to analyze the logs, visit [the Support section](../../support/general-troubleshooting/README.md) of our documentation or contact our support team directly at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---
title: Snowpark Migration Accelerator: Overview
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/overview.md
section: Migrations
---

# Snowpark Migration Accelerator: Overview

If you have already [downloaded](../general/getting-started/download-and-access.md) and [installed](../general/getting-started/installation/README.md) the Snowpark Migration Accelerator (SMA), this section will show you how to use it effectively.

If you haven’t downloaded and installed the software yet, or if you’re unsure where to begin, visit the [Getting Started page](../general/getting-started/README.md) for step-by-step instructions.

The following sections will explain these topics:

* [Before using the SMA](before-using-the-sma/README.md) - Learn about the prerequisites and requirements needed to ensure the tool works effectively.
* [Project Overview](project-overview/README.md) - Understand how projects work in SMA and learn how to create and configure them.
* [Optional Technical Discovery](project-overview/optional-technical-discovery.md) - Gather high-level information about your workload.
* [Assessment Overview](assessment/README.md) - Learn how to use the tool to analyze your source code and generate an assessment report.
* [Conversion Overview](snowpark-api-conversion/README.md) - Learn how to use the tool to convert your source code into the target format.

Let’s explore the key requirements needed for successful execution.

---
title: Snowpark Migration Accelerator: Pipeline Conversion
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/migration-lab/pipeline-conversion.md
section: Migrations
---

# Snowpark Migration Accelerator: Pipeline Conversion

The SMA has “converted” our scripts, but has it really? What it has actually done is converted all references from the Spark API to the Snowpark API, but what it has not done is to replace the connections that may exist in your pipelines.

The SMA’s power is in the assessment reporting that it does as the conversion is tied to converting references from the Spark API to the Snowpark API. Note that the conversion of these references will not be enough to run any data pipeline. You will have to ensure that the pipeline’s connections are resolved manually. The SMA cannot assume to know connection parameters or other elements that are likely not available to be run through it.

As with any conversion, dealing with the converted code can be done in a variety of ways. The following steps are how we would **recommend** that you approach the output of the conversion tool. Like SnowConvert, the SMA requires attention to be paid to the output. No conversion will ever be 100% automated. This is particularly true for the SMA. Since the SMA is converting references from the Spark API to the Snowpark API, you will always need to check how those references are being run. It does not attempt to orchestrate the successful execution of any script or notebook run through it.

So we’ll follow these steps to work through the output of the SMA that will be slightly different than SnowConvert:

* **Resolve All Issues**: “Issues” here means the issues generated by the SMA. Take a look at the output code. Resolve parsing errors and conversion errors, and investigate warnings.
* **Resolve the session calls**: How the session call is written in the output code depends on where we are going to run the file. We will resolve this for running the code file(s) in the same location as they were originally going to be run, and then for running them in Snowflake.
* **Resolve the Input/Outputs**: Connections to different sources cannot be resolved entirely by the SMA. There are differences in the platforms, and the SMA will usually disregard this. This also is affected by where the file is going to be run.
* **Clean up and Test**! Let’s run the code. See if it works. We will be smoke testing in this lab, but there are tools to do more extensive testing and data validation including Snowpark Python Checkpoints.

So let’s take a look at what this looks like. We’re going to do this with two approaches: the first approach is to run this in Python on the local machine (as the source script is running). The second would be to do everything in Snowflake… in Snowsight, but for a data pipeline reading from a local source, this will not be 100% possible in Snowsight. That’s ok though. We are not converting the orchestration of this script in this POC.

Let’s start with the pipeline script file, and get to the notebook in the next section.

## Resolve Issues

Let’s open our source and our output code in a code editor. You can use any code editor of your choice, but as has been mentioned multiple times, Snowflake would recommend using **VS Code with the Snowflake Extension**. Not only does the Snowflake Extension help navigate through the issues from SnowConvert, but can also run **Snowpark Checkpoints** for Python, which would help with testing and root cause analysis (though just barely out of scope for this lab).

Let’s open the directory that we originally created in the project creation screen (Spark ADW Lab) in VS Code:

Note that the **Output** directory structure will be the same as the input directory. Even the data file will be copied over despite no conversion taking place. There will also be a couple of **checkpoints.json** files that will be created by the SMA. These are json files that contain instructions for the Snowpark Checkpoints extension. The Snowflake extension can load checkpoints into both the source and output code based on the data in those files. We will ignore them for now.

Finally, let’s compare the input python script with the converted one in the output script.

This is a very basic side-by-side comparison with the original Spark code on the left and the output Snowpark compatible code on the right. Looks like some imports were converted as well as the session call(s). We can see an EWI at the bottom of the image above, but let’s not start there. We need to find the parsing error before we do anything else.

We can search the document for the error code for that parsing error that was shown in both the UI and the issues.csv: **SPRKPY1101**.

Since I have not filtered the results, the listing of this error code in the **issues.csv** also comes up in the search and the **AssessmentReport.json** that is used to build the **AssessmentReport.docx** summary assessment report. This is the main report that users will navigate through to understand a large workload, but we did not look at it in this lab. ([More info on the this report can be found in the SMA documentation](https://docs.snowconvert.com/sma/user-guide/assessment/output-reports/curated-reports).) Let’s choose where this EWI shows up in the **pipeline_dimcustomer.py** file as shown above.

You can see that this line of code was present at the bottom of the source code.

```python
# Conversion Input.
some rogue code that doesn't make any sense!

# Conversion Output.
some
# EWI: SPRKPY1101 => Unrecognized or invalid CODE STATEMENT @(131, 6). Last valid token was 'some' @(131, 1), failed token 'rogue' @(131, 6)
#     rogue code that doesn't make any sense!
```

Looks like this parsing error was because of… “some rogue code that doesn’t make any sense!”. This line of code is at the bottom of the pipeline file. This is not unusual to have extra characters or other elements in a code file as part of an extraction from a source. Note have the SMA detected that this was not valid Python code, and it generated the parsing error.

You can also see how the SMA inserts both the error code and the description into the output code as a comment where the error occurred. This is how all error messages will appear in the output.

Since this is not valid code, it is at the end of the file, and there is nothing else that was removed as a result of this error, the original code and the comment can safely be removed from the output code file.

And now we’ve resolved our first and most serious issue. Get excited.

Let’s work through the rest of our EWIs in this file. We can search for “EWI” because we now know that text will appear in the comment every time there is an error code. (Alternatively, we could sort the issues.csv file and order the issues by severity… but that’s not really necessary here.)

The next one is actually just a warning, not an error. It’s telling us that there was a function used that isn’t always equivalent in Spark and Snowpark:

```python
#EWI: SPRKPY1067 => Snowpark does not support split functions with more than two parameters or containing regex pattern. See documentation for more info.
split_col = split(df_uppercase['NAME'], '.first:')
```

The description here though gives away that we probably don’t have to worry about this. There are only two parameters being passed. Let’s leave this EWI as a comment in the file, so we know to check for it when we are running the file later.

The last one for this file is a conversion error saying that something is not supported:

This is the write call to the spark jdbc driver to write the output dataframe into SQL Server. Since this is part of the “resolve all inputs/outputs” step that we are going to deal with after we address our issues, we’ll leave this for later. Note, however, that this error must be resolved. The previous one was just a warning and may still work with no change being made.

## Resolving the Session Calls

The session calls are converted by the SMA, but you should pay special attention to them to make sure they are functional. In our pipeline script, this is the before and after code:

The SparkSession reference was changed to Session. You can see that reference change near the top of this file in the import statement as well:

Note in the image above, the variable assignment of the session call to “spark” is not changed. This is because this is a variable assignment. It is not necessary to change this, but if you’d like to change the “spark” decorator to “session”, that would be more in line with what Snowpark recommends. (Note that the VS Code Extension “SMA Assistant” will suggest these changes as well.)

This is a simple exercise, but it’s worth doing. You can do a find and replace using VS Code’s own search ability to find the references to “spark” in this file and replace them with session. You can see the result of this in the image below. The references to the “spark” variable in the converted code have been replaced with “session”:

We also can remove something else from this session call. Since we are not going to be running “spark” anymore, we do not need to specify the driver path for the spark driver. So we can remove the config function entirely from the session call like this:

```python
# Old Converted output.
# Spark Session
session = Session.builder.config('spark.driver.extraClassPath', driver_path) \
                    .app_name('SparkSQLServerExample', True) \
                    .getOrCreate()

# New Converted Output
# Snowpark Session
session = Session.builder.app_name('SparkSQLServerExample', True).getOrCreate()
```

Might as well convert it to a single line. The SMA couldn’t be sure we didn’t need that driver (although that seems logical), so it did not remove it. But now that we have our session call is complete.

(Note that the SMA also adds a “query tag” to the session. This is to help troubleshoot issues with this session or query later on, but this is completely optional to leave or remove.)

### Notes on the Session Calls

Believe it or not that is all that we need to change in the code for the session call, but that’s not all we need to do to create the session. This refers back to the original question that a lot of this depends on where you want to run these files. These original spark session calls used a configuration that was setup elsewhere. If you look at the original Spark session call it’s looking for a config file that is being read into a pandas dataframe location at the start of this script file (this is actually true for our notebook file as well).

Snowpark can function the same way, and this conversion assumes that is how this user will run this code. However, for the existing session call to work, the user would have to load all of the information for their Snowflake account into the local (or at least accessible) connections.toml file on this machine, and that the account they are attempting to connect to is set as the default. [You can learn more about updating the connections.toml file in the Snowflake/Snowpark documentation](https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-session#connect-by-using-the-connections-toml-file), but the idea behind it is that there is an accessible location that has the credentials. When a snowpark session is created, it is going to check this… unless the connection parameters are explicitly passed to the session call.

The standard way to do this is to input the connection parameters directly as strings and call them with the session:

```python
# Parameters in a dictionary.
connection_parameters = {
  "account": "<your snowflake account>",
  "user": "<your snowflake user>",
  "password": "<your snowflake password>",
  "role": "<your snowflake role>",  # optional
  "warehouse": "<your snowflake warehouse>",  # optional
  "database": "<your snowflake database>",  # optional
  "schema": "<your snowflake schema>",  # optional
}

# The session call
session = Session.builder.configs(connection_parameters).app_name("AdventureWorksSummary", True).getOrCreate()
```

AdventureWorks appears to have referenced a file with these credentials and called it. Assuming there is a similar file called ‘snowflake_credentials.txt’ that is accessible, then the syntax that would match that could look something like:

```python
# Load into a dataframe.
snow_creds = pd.read_csv('snowflake_credentials.txt', index_col=None, header=0)

# Build the parameters.
connection_parameters = {
  "account": snow_creds.loc[snow_creds['Specific_Element'] == 'Account', 'Value'].item(),
  "user": snow_creds.loc[snow_creds['Specific_Element'] == 'Username', 'Value'].item(),
  "password": snow_creds.loc[snow_creds['Specific_Element'] == 'Password', 'Value'].item(),
  "role": "<your snowflake role>",  # optional
  "warehouse": snow_creds.loc[snow_creds['Specific_Element'] == 'Warehouse', 'Value'].item(),  # optional
  "database": snow_creds.loc[snow_creds['Specific_Element'] == 'Database', 'Value'].item(),  # optional
  "schema": snow_creds.loc[snow_creds['Specific_Element'] == 'Schema', 'Value'].item(),  # optional
}

# Then pass the parameters to the configs function of the session builder.
session = Session.builder.configs(connection_parameters).app_name("AdventureWorksSummary", True).getOrCreate()
```

For the purpose of the time limit on this lab, the first option may make more sense. [There’s more on this in the Snowpark documentation](https://docs.snowflake.com/en/developer-guide/snowpark/python/creating-session#connect-by-specifying-connection-parameters).

Note that for our notebook file to run inside of Snowflake using Snowsight, you wouldn’t need to do any of this. You would just call the active session and run it.

Now it’s time for the most critical component of this migration, resolving any input/output references.

## Resolving the Inputs and Outputs

So let’s resolve our inputs and outputs now. Note that this is going to diverge based on whether you’re running the files locally or Snowflake. for the python script, Let’s make sure what we gain/lose by running directly inside of Snowsight: **you cannot run the whole operation in Snowsight** (at least not currently). The local csv file is not accessible from Snowsight. You will have to load the .csv file into a stage manually. This will likely not be an ideal solution, but we can test the conversion by doing this.

So we’ll first prep this file to be run/orchestrated locally, and then to be run in Snowflake.

To get the pipeline script’s inputs and output resolved, we need to first identify them. They are pretty simple. This script seems to:

* access a local file
* load the result into SQL Server (but now Snowflake)
* moves the file to make way for the next one

Simple enough. So we need to replace each component of the code that does those things. Let’s start with accessing the local file.

As was mentioned at the start of this, it would be strongly suggested to rearchitect the Point of Sale System and the orchestration tools used to run this python script, to put the output file into a cloud storage location. Then you could turn that location into an External Table, and voila… you are in Snowflake. However, the current architecture says that this file is not in a cloud storage location and will stay where it is, so we need to create a way for Snowflake to access this file preserving the existing logic.

We have options to do this, but we will create an internal stage and move the file into the stage with the script. We would then need to move the file in the local file system, and also move it in the stage. This can all be done with Snowpark. Let’s break it down:

* accessing a local file: Create an internal stage (it one doesn’t exist already) -> Load the file into the stage -> Read the file into a dataframe
* loading the result into SQL Server: Load the transformed data into a table in Snowflake
* moves the file to make way for the next one: Move the local file -> Move the file in the stage.

Let’s look at code that can do each of these things.

### Access a Locally Accessible File

This source code in Spark looks like this:

```python
# Spark read from a local csv file.
df = spark.read.csv('customer_update.csv', header=True, inferSchema=True)
```

And the transformed snowpark code (by the SMA) looks like this:

```python
# Snowpark read from a local csv file.
df = session.read.option("PARSE_HEADER", True).option("INFER_SCHEMA", True).csv('customer_update.csv')
```

We can replace that with this with code that does the steps above:

1. Create an internal stage (if one does not exist already). We will create a stage called ‘LOCAL_LOAD_STAGE’ and go through a few steps to make sure that the stage is r

```python
# Create a stage if one does not already exist.
# name the stage we're going to use.
target_stage_name = "LOCAL_LOAD_STAGE"

# Check to see if this stage already exists.
stages = session.sql("SHOW STAGES").collect()
target_stages = [stage for stage in stages if stage['name'] == target_stage_name]

# Create the stage if it does not already exist.
if(len(target_stages) < 1):
    from snowflake.core import Root
    from snowflake.core.stage import Stage, StageEncryption, StageResource
    root = Root(session)
    my_stage = Stage(name="LOCAL_LOAD_STAGE",encryption=StageEncryption(type="SNOWFLAKE_SSE"))
    root.databases["ADVENTUREWORKS"].schemas["DBO"].stages.create(my_stage)
    print('%s created.'%(target_stage_name))
else:
    print('%s already exists.'%(target_stage_name))
```

2. Load the file into the stage.

```python
# Move the file.
put_results = session.file.put(local_file_name="customer_update.csv",
                    stage_location="ADVENTUREWORKS.DBO.LOCAL_LOAD_STAGE",
                    overwrite=False,
                    auto_compress=False)

# Read the results.
for r in put_results:
    str_output = ("File {src}: {stat}").format(src=r.source,stat=r.status)
    print(str_output)
```

3. Read the file into a dataframe. This is the part that the SMA actually converted. We need to specify that the location of the file is now the internal stage.

```python
# Location of the file in the stage.
csv_file_path = "@LOCAL_LOAD_STAGE/customer_update.csv"

# Spark read from a local csv file.
df = session.read.option("PARSE_HEADER", True).option("INFER_SCHEMA", True).csv(csv_file_path)
```

The result of that would look like this:

Let’s move on to the next step.

### Load the result into Snowflake

The original script wrote the dataframe into SQL Server. Now we are going to load into Snowflake. This is a much simpler conversion. The dataframe is already a Snowpark dataframe. This is one of the advantages of Snowflake. Now that the data is accessible to Snowflake, everything happens inside Snowflake.

```python
# Original output from the conversion tool.
# Write the DataFrame to SQL Server.
#EWI: SPRKPY1002 => pyspark.sql.readwriter.DataFrameWriter.jdbc is not supported
df_transformed.write.jdbc(url=sql_server_url,
              table='dbo.DimCustomer',
              mode="append",
              properties={
                  "user": sql_server_user,
                  "password": sql_server_password,
                  "driver": driver_path
              })

# Corrected Snowflake/Snowpark code.
df_transformed.write.save_as_table("ADVENTUREWORKS.DBO.DIMCUSTOMER", mode="append")
```

Note that we may want to write to a temp table to do some testing/validation, but this is the behavior in the original script.

### Move the file to make way for the next one

This is the behavior in the orginal script. We don’t really need to make this happen in Snowflake, but we can to showcase the exact same functionality in the stage. This is done with an os command in the original file system. That does not depend on Spark and will remain the same. But to emulate this behavior in snowpark, we would need to move this file in the stage to a new directory.

This can be done simply enough with the following python code:

```python
# New filename.
original_filepath = '@LOCAL_LOAD_STAGE/customer_update.csv'
new_filepath = '@LOCAL_LOAD_STAGE/old_versions/customer_update_%s.csv'%(today_time)

copy_sql = f"COPY FILES INTO {new_filepath} FROM {original_filepath}"
session.sql(copy_sql).collect()
print(f"File copied from {original_filepath} to {new_filepath}")

remove_sql = f"REMOVE {original_filepath}"
session.sql(remove_sql).collect()
print(f"Original file {original_filepath} removed.")
```

Note that this would not replace any of the existing code. Since we already want to keep the existing motion of moving the spark code to snowpark, we will leave the os reference. The final version will look like this:

Now we have the same motion completely done. Now let’s do our final cleanup, and test this script out.

## Clean up and Test

We never looked at our import calls and we have config files that are not necessary at all. We could leave the references to the config files and run the script. In fact, assuming those config files are still accessible, then the code will still run. But if we’re taking a close look at our import statements, we might as well remove them. These files are represented by all of the code between the import statements and the session call:

There’s a few other things we should do:

* Check that all of our imports are still necessary. We can leave them for now. If there is an erorr, we can address it.
* We also have one EWI that we left in there as a warning to check. So we want to make sure we inspect that output.
* We need to make sure that our file system behavior mirrors that of the expected file system for the POS system. To do this, we should move the customer_update.csv file into the root folder you chose when first launching VS Code.
* Create a directory called “old_versions” in that same directory. This should allow the os operations to run.

Finally, if you are not comfortable running the code directly into the production table, you can create a copy of that table for this test, and point the load to that copy. Replace the load statement with the one below. Since this is a lab, feel free to write to the “production” table:

```python
# In case we want to test.
create_sql = """
                CREATE OR REPLACE TABLE ADVENTUREWORKS.DBO.DIMCUSTOMER_1
                AS select * from ADVENTUREWORKS.DBO.DIMCUSTOMER;
                """
session.sql(create_sql).collect()

# Write the DataFrame to SQL Server.
df_transformed.write.save_as_table("ADVENTUREWORKS.DBO.DIMCUSTOMER_1", mode="append")
```

Now we’re finally ready to test this out. We can run this script in Python to a testing table and see if it will fail. So run it!

Tragic! The script failed with the following error:

It looks like the way we are referencing an identifier is not the way that Snowpark wanted it. The code that failed is in the exact spot where the remaining EWI is:

You could reference the documentation on the link provided by the error, but in the interest of time, Snowpark needs this variable to expressly be a literal. We need to make the following replacement:

```python
# Old
split_col = split(df_uppercase['NAME'], '.first:')

# New
split_col = split(df_uppercase['NAME'], lit('.first:'))
```

This should take care of this error. Note that there are always going to be some functional differences between source and a target platforms. Conversion tools like the SMA like to make these differences as obvious as possible. But note that no conversion is 100% automated.

Let’s run it again. This time… success!

We can write some queries in python to validate this, but why don’t we just go into Snowflake (because that’s what we’re about to do anyways).

Navigate to your snowflake account that you have been using to run these scripts. This should be the same one you used to load the database from SQL Server (and if you haven’t done that, the above scripts won’t work anyways beecause the data has not yet been migrated).

You can quickly check this by seeing if the stage was created with the file:

Enable the directory table view to see if the old_versions folder is in there:

And it is:

Since that was the last element of our script, it looks like we’re good!

We can also simply validate that the data was loaded by simply querying the table for the data we uploaded. You can open a new worksheet and simply write this query:

```sql
select * from ADVENTUREWORKS.DBO.DIMCUSTOMER
where FIRSTNAME like '%Brandon%'
AND LASTNAME like '%Carver%'
```

This is one of the names that was just loaded. And it looks like our pipeline has worked:

## Running the Pipeline Script in Snowsight

Let’s take a quick look back at the flow we are attempting to convert was doing in Spark:

* accessing a local file
* loading the result into SQL Server
* moving the file to make way for the next one

This flow is not possible to run entirely from within Snowsight. Snowsight does not have access to a local file system. The recommendation here would be to move the export from the POS to a data lake… or any number of other options that would be accessible via Snowsight.

We can, however, take a closer look at how Snowpark handles the transformation logic by running the Python script in Snowflake. If you have already made the changes recommended above, you can run the body of the script in a Python Worksheet in Snowflake.

To do this, first login to your Snowflake account and navigate to the worksheets section. In this worksheet, create a new Python worksheet:

Specify the database, schema, role, and warehouse you’d like to use:

Now we do not have to deal with our session call. You will see a template generated in the worksheet window:

Let’s start by bringing over our import calls. After making the previous script ready to use, we should have the following set of imports:

```python
# General Imports
import pandas as pd
import os
import shutil
import datetime

# Snowpark Imports
from snowflake.snowpark import Session
from snowflake.snowpark.functions import col
from snowflake.snowpark.functions import upper
from snowflake.snowpark.functions import lower
from snowflake.snowpark.functions import split
from snowflake.snowpark.functions import trim
from snowflake.snowpark.functions import when
from snowflake.snowpark.functions import lit
from snowflake.snowpark.functions import expr
from snowflake.snowpark.functions import regexp_replace
```

We only need the snowpark imports. We will not be moving files around a file system. We could keep the datetime reference if we want to move the file in the stage. (Let’s do it.)

Paste the Snowpark imports (plus datetime) in the python worksheet below the other imports that are already present. Note that ‘col’ is already imported, so you can remove one of those:

Under the “def main” call, let’s paste in all of our transformation code. This will include everything from the assignment of the csv location to the writing of the dataframe to a table.

From here:

To here:

We can also add back in the code that moves the files around in the stage. This part:

Before you can run the code though, you will have to manually create the stage and move the file into the stage. We can add the create stage statement into the script, but we would still need to manually load the file into the stage.

So if you open another worksheet (this time… a sql worksheet), you can run a basic SQL statement that will create the stage:

```sql
CREATE STAGE my_int_stage
  ENCRYPTION = (TYPE = 'SNOWFLAKE_SSE');
```

Make sure to select the correct database, schema, role, and warehouse:

You can also [create an internal stage directly in the Snowsight UI](https://docs.snowflake.com/en/user-guide/data-load-local-file-system-create-stage#create-a-named-stage-using-snowsight). Now that the stage exists, we can manually load the file of interest into the stage. Navigate to the Databases section of the Snowsight UI, and find the stage we just created in the appropriate database.schema:

Let’s add our csv file by selecting the +Files option in the top right corner of the window. This will launch the Upload Your Files menu:

Drag and drop or browse to our project directory and load the customer_update.csv file into the stage:

Select Upload in the bottom right corner of the screen. You will be taken back to the stage screen. To view the files, you will need to select Enable Directory Table:

And now… our file appears in the stage:

This is not really a pipeline anymore, of course. But at least we can run the login in Snowflake. Run the rest of the code that you moved into the worksheet. This user had success the first time, but that’s no guarantee of success the second time:

Note that once you’ve defined this function in Snowflake, you can call it in other ways. If AdventureWorks is 100% replacing their POS, then it may make sense to have the transformation logic in Snowflake, especially if orchestration and file movement will be handled somewhere else entirely. This allows Snowpark to focus on where it excels with the transformation logic.

## Conclusion

And that’s it for the script file. It’s not the best example of a pipeline, but it does hit hard on how to deal with the output from the SMA:

* Resolve All Issues
* Resolve the session calls
* Resolve the Input/Outputs
* Clean up and Test!

Let’s move on to the reporting notebook.

---
title: Snowpark Migration Accelerator: Pipeline Lab - Assessment
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/migration-lab/compatibility-and-assessment.md
section: Migrations
---

# Snowpark Migration Accelerator: Pipeline Lab - Assessment

As with SnowConvert, we will run code through the SMA, evaluate the result, resolve any issues, and run it on the new platform. However, unlike SnowConvert, the SMA does NOT connect to any source platform, nor does it connect to Snowflake. It is a local application that can be run completely offline. But its power is in its assessment. Most of the heavy lifting on conversion has been done by building compatibility between the Spark API and the Snowpark API.

## Extraction / Code Availability

The files we will use for the AdventureWorks Lab are here:

[`end_to_end_lab_source_code.zip`](../../../../_downloads/62b8c258bb35353bbc7914de3d0095d2/end_to_end_lab_source_code.zip)

For the purpose of this lab, we will assume that the notebook and script file that we are converting are already accessible as files. In general, the SMA takes in files as an input and does not connect to any source platform. If the files are being orchestrated by a specific tool, you may need to export them. If you are using notebooks as part of databricks or EMR, you can export those as .ipynb files just as the jupyter notebook we are going to run through the SMA today.

This lab only has a few files, but it’s common in a large migration to have hundreds or thousands of files. Extract what you can and run those files through the SMA. The good thing about using a tool like this is that it can tell you what you might be missing.

Note that there is also a data file as well: ‘customer_update.csv’. This is a sample of the file being generated locally by the Point of Sale (POS) system that Adventure Works is currently using. While that system is also being updated, this Proof of Concept (POC) is focused on making the existing pipeline work with Snowpark instead of Spark.

Let’s take each of these files, and drop them into a single directory on our local machine:

It would be recommended to create a project directory. This can be called whatever you like, but as a suggestion for this lab, let’s go with **spark_adw_lab**. This means we would create a folder with the name spark_adw_lab, then create another folder in that directory called source_files (the path being something like **/your/accessible/directory/spark_adw_lab/source_files**). This isn’t required, but will help keep things organized. The SMA will scan any set of subdirectories as well, so you could add specific pipelines in a folder and notebooks in another.

## Access

Now that we have our source files in an accessible directory, it is time to run the SMA.

If you have not already downloaded it, the SMA is accessible from [the Snowflake website](https://www.snowflake.com/en/migrate-to-the-cloud/migration-accelerator/). It is also accessible from the Migrations page in SnowSight in your Snowflake account:

Once you download the tool, install it! There is more information on [installing the SMA](https://docs.snowconvert.com/sma/general/getting-started/installation) in the SMA documentation.

## Using the Snowpark Migration Accelerator

Once you have installed the tool, open it! When you launch the SMA, it will look very similar to its partner tool, SnowConvert. Both of these tools are built on a similar concept where you input code files into the tool and it runs. As a reminder, we have seen that SnowConvert can take the DDL and data directly from the source and input it directly into Snowflake. The SMA does not do this. It only takes in code files as a source and outputs those files to something that is compatible with Snowflake. This is primarily because the tool does not know how a user will orchestrate their spark code, but also to make it more secure to use.

Once you have launched the tool, It will ask you if you would like to create a new project or open an already existing one:

This will take you to the project creation screen:

On this screen, you will enter the relevant details for your project. Note that all fields are required. For this project, you could enter something similar to:

* Project Name: **Spark ADW Lab**
* Email Address: **your.name@your_domain.com**
* Company name: **Your Organization**
* Input Folder Path: **/your/accessible/directory/spark_adw_lab/source_files**
* Output Folder Path (the SMA will auto generate a directory for the output, but you can modify this): **/your/accessible/directory/spark_adw_lab/source_files_output**

A couple of notes about this project creation screen:

* The email and company fields are to help you track projects that may be ongoing. For example, at any large SI, there may be multiple email addresses and multiple organizations on behalf of whom a single user may run the SMA. This information is stored in the project file created by the SMA.
* There is a hidden field for SQL. Note that the SMA can scan/analyze SQL, but it does not convert any SQL.It also can only identify SQL in the following circumstances:

  + SQL that is in .sql files
  + SQL that is in SQL cells in a Jupyter Notebook
  + SQL that is passed as a single string to a spark.sql statement.
* While this SQL capability can be helpful to determine where there is incompatible SQL with Snowflake, it is not the primary use for the SMA. More support for Spark SQL and HiveQL are coming soon.

Once you’ve entered all of your project information, for this HoL, we are going to **skip** the assessment phase. (What… aren’t we building an assessment?) If you do not want to convert any code, running an assessment can be helpful as it will allow you to get the full set of reports generated by the SMA. You can then navigate through those or share them with others in your organization while not creating extra copies of the converted code. However, all of these same assessment reports are also generated during a conversion. So we will skip assessment mode for now and go to conversion.

On the **Conversion settings** page, select **Skip Assessment**, and then click **Continue** in the bottom right corner.

Note that what you are “saving” is a local project file. All of the information that you entered on the project creation screen will be saved to this local text file with the extension ‘.snowma’ in the directory you just specified above.

This will take you to the **Conversion settings** page. From here, you can choose **Default Settings** to proceed with conversion, or select **Customize settings** to review and adjust advanced options.

There is one setting that will simplify the output of this hands on lab, which would be to disable the attempted conversion of pandas dataframes to the Snowpark API:

This one setting is currently being updated, so there will be a lot of additional warnings added if this option is not deselected. Most of the pandas dataframe can be used as part of the modin implementation of pandas, so a simple import call change should suffice for now. Look for an issue on this issue by the end of June 2025. You can look at the other settings, but we will leave them as is. It’s important to note that there is a testing library that the output code is compatible with called Snowpark Checkpoints. There are settings related to this, but we will not alter them in this lab.

Select “Save settings” to save and close your settings.

To start the conversion, click **Continue** in the bottom right corner of the application.

The next screen will show the progress of the conversion:

Like SnowConvert, the SMA is building a semantic model of the entire codebase in the input directory. It is building relationships between code elements, sql objects, and other referenced artifacts, and creating the closest output it can to a functional equivalent for Snowflake. This primarily means converting references from the Spark API to the Snowpark API. The SMA’s engineering team is a part of the Snowpark engineering team, so most transformations that take place have been built into the Snowpark API, so the changes may seem minor. But the wealth of assessment information that is generated by the SMA allows a migration project to really get moving forward. An in-depth look at all of the generated assessment information will have to take place elsewhere because the SMA has likely finished this conversion in the time it took to read this paragraph.

When the SMA has finished, the results page will show the… results.

The results page has some “Readiness Scores” that are very simplified metrics on how “ready” this codebase is for Snowflake. We will review the results next, but note that running the Snowpark Migration Accelerator is the easy part. Note that this is just an “accelerator”. It is not a silver bullet or a hands-off automation tool. Pipelines that connect to one data source and output to another are not fully migrated by this tool will always need more attention than a straight SQL-to-SQL migration of DDL as is done by SnowConvert. But Snowflake is continuously working towards making this as simple as possible.

## Interpreting the Output

The SMA, even more so than SnowConvert, generates a large amount of assessment information. It can be difficult to parse through the results. There are many different directions you could go depending on what you want to achieve.

Note that this is an extremely simple scenario, so some of the steps we are going to take will look like overkill. (I mean, do we really need to analyze the dependencies present in this project when there are only two files and we could just… look?) The goal is to still walk through what we normally recommend even in this small POC. But let’s be clear… that the scope is clear, and there are only two files. We just need both of them to work as they do in the source.

### **Readiness Scores**

With that in mind, let’s take a look at the first part of the output that you will see in the application: the readiness scores. There will be multiple readiness scores and you can expand on each one of them to better understand what is captured by that readiness score.

Each readiness score is a very basic calculation of the count of functions or elements in an API that are supported in Snowpark/Snowflake divided by the count of all functions or elements related to that API for this execution. The calculation showing you how the score is calculated is shown when you expand the window. You can also learn more about how to interpret the readiness scores by selecting “How to read through the scores” near the top left corner of this window.

This execution has a Snowpark API Readiness Score of 96.02%. (Please note that yours may be different! These tools are updated on a biweekly basis and there may be a change as compatibility between the two platforms is ever evolving.) This means that 96.02% of the references to the Spark API that the tool identified are supported in Snowflake. “Supported” in this case means that there could be a similar function that already exists or that the SMA has created a functionally equivalent output. The higher this score is, the more likely this code can quickly run in Snowflake.

(Note that this 96.02% of references are either supported directly by the Snowpark API or they are converted by the SMA. Most of them are likely supported directly, but you can find out exactly what was converted and what was passed through by reviewing the **SparkUsagesInventory.csv** report in the output Reports folder generated by the SMA. We will not walk through that in this lab as we will see what is NOT supported in the **Issues.csv** file, but you can use this information for reference.)

There are other readiness scores and you may see more than what is shown in the lab as the readiness scores do change over time. This lab won’t walk through each of them, but note that a low score will always be worth investigating.

### **Code Analyzed**

Just below each of the readiness scores, will be a small indicator that lets you know if there was any code that could not be processed:

This number represents the **percentage of files** that were fully parsed. If this number is less than 100%, then there is some code that the SMA could not parse or process. This is the first place you should start looking to resolve problems. If it’s less than 100%, you should see where the parsing errors occurred by looking at the issue summary. This is the first place you should look when working through the SMA’s output because it’s the only one where it might make sense to run the tool again if a large amount of code was not able to be scanned.

---
title: Snowpark Migration Accelerator: Prerequisites
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/prerequisites.md
section: Migrations
---

# Snowpark Migration Accelerator: Prerequisites

**Minimum requirements:**

1. Snowpark Migration Accelerator **2.6.8** version or higher.
2. Checkpoints Extension installed and configured.

---
title: Snowpark Migration Accelerator: Readiness Scores
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/readiness-scores.md
section: Migrations
---

# Snowpark Migration Accelerator: Readiness Scores

The Snowpark Migration Accelerator (SMA) evaluates your code and produces detailed assessment data. To make this information more accessible, SMA calculates Readiness Scores that measure how easily your code can be migrated to Snowflake. These scores act as compatibility indicators - the higher the score, the more compatible your code is with Snowflake’s platform. You can obtain these scores by simply running the SMA tool.

The SMA generates the following Readiness Scores:

* Snowpark API Readiness Score
* Snowpark Connect Readiness Score
* Third Party API Readiness Score
* SQL Readiness Score

The Readiness Scores indicate how compatible your code is with Snowflake, not how much work remains to be done. Even with a high readiness score, the remaining incompatible code might still require significant effort to migrate. To accurately estimate the work needed for migration, review the complete assessment report. If you need help creating a migration plan or estimating the effort required, [reach out](../../support/contact-us.md) to our team.

## Levels

The Snowpark Migration Accelerator (SMA) uses a color-coded scoring system similar to a traffic light:

* **Red** - Critical issue detected. Stop immediately and resolve the problem, as it significantly impacts the migration process or prevents accurate code analysis. Follow the provided action steps before proceeding.
* **Yellow** - Warning detected. Review the action steps carefully and understand the potential impact on your migration. Once you understand the implications, you may continue to the next step.
* **Green** - No major issues detected. While this indicates there are no significant blockers for migration, the code may still need adjustments. Review the action steps and continue with the migration process.

## How to Interpret the Scores

For each score, you will receive:

* A numerical value
* A status indicator (red, yellow, or green as explained earlier)
* A recommended next action

We strongly recommend that you:

* **Review scores sequentially** - When you encounter a red score, investigate and address that issue right away
* **Review all recommended actions for every score** - Check the suggested next steps for all results, including green scores, as they contain important action items

Let’s examine the readiness scores currently available in the system.

## Snowpark API Readiness Score

The Snowpark Migration Accelerator (SMA) generates a Snowpark API Readiness Score, which indicates how ready your code is for migration. It’s important to note that this score only evaluates the usage of Spark API components and does not assess other elements such as third-party libraries or external dependencies in your code.

When SMA analyzes your code, it identifies all Spark API references, including both import statements and function calls. These references are documented in [the Spark API Usages Inventory](../scos-conversion/output-reports/sma-inventories.md), which you can find in your local output directory. Each reference is classified as either “supported” or “not supported” according to [the Spark Reference Categories](../scos-conversion/spark-reference-categories.md). The readiness score is calculated by dividing the number of supported references by the total number of references found in your code.

This score is displayed as a percentage, indicating how well Snowflake supports the Spark API references found in your code. A higher percentage means better compatibility with Snowflake. You can view this score in both the [detailed report](../scos-conversion/output-reports/curated-reports.md) and the [assessment summary](understanding-the-assessment-summary.md) sections of the application.

The Readiness Score shown here is the original score generated by the SMA. For newer SMA versions that display only one Readiness Score, this score specifically measures Spark API compatibility.

### Snowpark API Readiness Levels

Based on the calculated score, the result will be classified into one of three categories: green, yellow, or red. The application and output report will provide specific recommendations based on your score category.

The Snowpark API Readiness Score will be assigned one of these levels:

* Green: Most Spark API references are supported, making this workload a strong candidate for migration. If other indicators are also green, consider proceeding with a Proof of Concept.
* Yellow: Some Spark API references are not supported, which will require additional migration effort. Next steps should include creating an inventory of unsupported items and estimating the conversion effort needed.

## Snowpark Connect Readiness Score

The Snowpark Connect Readiness Score measures the percentage of Spark API references in your codebase that are supported by Snowpark Connect. This score provides an assessment of your existing Spark API code’s readiness for execution within the Snowpark Connect environment.

### How It’s Calculated

During its execution, the SMA scans your codebase to identify all references to the Spark API. Examples of such references include import statements, function calls, and class instantiations. All discovered references are then logged in the [Spark API Usages Inventory](../scos-conversion/output-reports/sma-inventories.md). This inventory is generated as a file in your local output directory. For every reference listed in the inventory, the SMA populates the `IsSnowparkConnectToolSupported` column, setting it to `True` if the API usage is supported by Snowpark Connect, or `False` if it is not.

To calculate the readiness score, the SMA takes all of the supported references and divides them by the total references found in the codebase:

For example, if your codebase has 100 Spark API references and 90 of them are supported by Snowpark Connect, your Snowpark Connect Readiness Score would be 90%.

A higher percentage for the Snowpark Connect Readiness Score indicates a greater degree of compatibility with Snowpark Connect, suggesting that a larger portion of your Spark code aligns with functionalities supported by Snowpark Connect.

### Readiness Levels

The compatibility analysis yields a readiness score, which is categorized into one of three distinct levels: **Green**, **Yellow**, or **Red**. Both the application’s [assessment summary](understanding-the-assessment-summary.md) and the generated [detailed report](../scos-conversion/output-reports/curated-reports.md) will display this readiness level, accompanied by specific guidance tailored to the findings:

* Green - This workload is highly compatible with Snowpark Connect as the majority of references to the Spark API are supported without any code changes. [Files that are fully compatible](../../use-cases/snowpark-connect/identifying-fully-compatible-files.md) can be run immediately, though some files will still require [issue resolution](../../issue-analysis/approach.md).

  A good next step would be to try to [run some of the files in Snowflake](../../../../developer-guide/snowpark-connect/snowpark-connect-overview.md). View the [reports](../scos-conversion/output-reports/README.md) generated by the SMA and select a file that might be [ready to run with Snowpark Connect](../../use-cases/snowpark-connect/README.md).
* Yellow - There are some elements of the Spark API in this workload that are not supported in Snowpark Connect or are incompatible with [Snowpark Connect for Spark](../../../../developer-guide/snowpark-connect/snowpark-connect-compatibility.md). This workload may still be able to run with Snowpark Connect, but there are elements that will require [issue resolution](../../issue-analysis/approach.md) or even re-architecture.

  The recommended next step would be to evaluate if code conversion makes more sense. You can do this by [converting this workload to the Snowpark API](../snowpark-api-conversion/README.md), and reviewing the Snowpark API Readiness Score. You can also dive deeper into this workload’s compatibility with Snowpark Connect by viewing the [reports](../scos-conversion/output-reports/README.md) generated by the SMA. You can explore the compatibility with Snowpark by [understanding which files may be ready to run](../../use-cases/snowpark-connect/README.md) and working through the Spark elements that have [issues that need resolution](../../issue-analysis/approach.md).
* Red - This workload has a significant number of references to the Spark API that are not supported in Snowpark Connect. However, this workload may still be a good candidate for conversion to the Snowpark API. The recommended next step would be to [convert this workload](../snowpark-api-conversion/README.md), and take a look at the Snowpark API Readiness Score. If you need help, feel free to reach out to [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

  If you would still like to further explore the compatibility with Snowpark Connect, you can view the [reports](../scos-conversion/output-reports/README.md) generated by the SMA. A good place to start would be to [understand which files may be ready to run](../../use-cases/snowpark-connect/README.md).

## Third-Party API Readiness Score

The Third-Party Readiness Score shows how many of your imported libraries can be used in Snowflake. To better understand this score, let’s first explain what we mean by “Third Party”:

**Third Party Library**: Any software package or library that is not developed, maintained, or controlled by Snowflake (or Snowpark in Snowflake).

The readiness score indicates the percentage of external libraries and packages that are compatible with Snowflake. For Python code, compatibility means the package is available through the Anaconda package collection in Snowpark. For Scala or Java code, compatibility means the package is already included in Snowpark’s core functionality.

The readiness score is calculated by dividing the number of supported third-party library imports by the total number of third-party library imports in your code.

Important Information About the Readiness Score:

* **Supported Third-Party Libraries in Snowpark**: This includes all libraries that Snowpark supports (including org.apache.spark)
* **Total Third-Party Library Calls**: The sum of all third-party library calls found in the code, including both Spark and non-Spark libraries, whether supported or unsupported by Snowpark.
* Only imports marked as “ThirdPartyLib” in the Import Usages Inventory are counted. Internal dependencies and imports from within the codebase are excluded.
* This metric counts the total number of calls, not unique library references. For example, if your code has 100 library calls total, with 80 calls to an unsupported library and 20 calls to a supported library, the support score would be 20%. This shows the actual usage frequency of supported vs. unsupported libraries in the code, rather than the ratio of unique library references.

### Third Party API Readiness Levels

Based on the calculated score, the result will be classified into one of three categories: green, yellow, or red. The application and output report will provide specific recommendations based on your score category.

The Third Party API Readiness Score will be assigned one of these levels:

* Green - The codebase uses Python libraries that are fully supported in Snowflake. No additional configuration is required.
* Yellow - The codebase contains at least one Python package or library that is not currently supported in Snowpark. You can add unsupported third-party packages using several methods described in the third-party package documentation. To identify unsupported packages, review the [Import Usages Inventory](../scos-conversion/output-reports/sma-inventories.md) generated by SMA. Then analyze how these packages are used in your code and plan their implementation in Snowflake.
* Red - The codebase heavily relies on packages or libraries not supported in Snowpark. This could mean either a single unsupported library is used extensively throughout the code, or multiple unsupported libraries are used across different parts of the codebase. A thorough assessment of these import statements is necessary to understand their impact. For guidance or assistance with package support, contact [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

## SQL Readiness Score

The SQL Readiness Score indicates what percentage of SQL elements in your source code can be automatically converted to Snowflake SQL using the Snowpark Migration Accelerator (SMA). A higher score means more of your code can be converted automatically, which makes the migration process easier and faster.

The readiness score is calculated by dividing the number of SQL elements that can be converted by the total number of SQL elements found in the source code.

### SQL Readiness Score Levels

The SQL Readiness Score will be assigned one of these levels:

* Green - Most SQL in this codebase is either directly supported by Snowflake or can be automatically converted by the SMA. While no conversion is perfect, this workload requires minimal manual adjustments for Snowflake migration.
* Yellow - Some SQL elements in this codebase are not supported by Snowflake, requiring additional effort for migration. Review the SQL Element Inventory for unsupported features and check the EWI’s in the issues output to create an action plan. You may need to make minor code adjustments or partially redesign some components.
* Red - A large portion of SQL in this codebase is not compatible with Snowflake, suggesting significant redesign may be necessary. To proceed, review the SQL Element Inventory for unsupported features and examine the EWI’s in the issues output to develop a migration strategy. For assistance, contact [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---

While readiness scores provide valuable insights, they should not be the only factor in determining a workload’s migration readiness. Consider multiple aspects of your migration plan alongside these scores, as they serve as an initial assessment rather than a complete evaluation. If you notice any readiness metrics that could be improved or aren’t accurately represented in the tool, [let us know](../../support/contact-us.md). The SMA team continuously works to enhance and refine these readiness measurements.

---
title: Snowpark Migration Accelerator: Readiness Scores
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/readiness-scores.md
section: Migrations
---

# Snowpark Migration Accelerator: Readiness Scores

The Snowpark Migration Accelerator (SMA) evaluates your code and produces detailed assessment data. To make this information more accessible, SMA calculates Readiness Scores that measure how easily your code can be migrated to Snowflake. These scores act as compatibility indicators - the higher the score, the more compatible your code is with Snowflake’s platform. You can obtain these scores by simply running the SMA tool.

The SMA generates the following Readiness Scores:

* Snowpark API Readiness Score
* Snowpark Connect Readiness Score
* Third Party API Readiness Score
* SQL Readiness Score

The Readiness Scores indicate how compatible your code is with Snowflake, not how much work remains to be done. Even with a high readiness score, the remaining incompatible code might still require significant effort to migrate. To accurately estimate the work needed for migration, review the complete assessment report. If you need help creating a migration plan or estimating the effort required, [reach out](../../support/contact-us.md) to our team.

## Levels

The Snowpark Migration Accelerator (SMA) uses a color-coded scoring system similar to a traffic light:

* **Red** - Critical issue detected. Stop immediately and resolve the problem, as it significantly impacts the migration process or prevents accurate code analysis. Follow the provided action steps before proceeding.
* **Yellow** - Warning detected. Review the action steps carefully and understand the potential impact on your migration. Once you understand the implications, you may continue to the next step.
* **Green** - No major issues detected. While this indicates there are no significant blockers for migration, the code may still need adjustments. Review the action steps and continue with the migration process.

## How to Interpret the Scores

For each score, you will receive:

* A numerical value
* A status indicator (red, yellow, or green as explained earlier)
* A recommended next action

We strongly recommend that you:

* **Review scores sequentially** - When you encounter a red score, investigate and address that issue right away
* **Review all recommended actions for every score** - Check the suggested next steps for all results, including green scores, as they contain important action items

Let’s examine the readiness scores currently available in the system.

## Snowpark API Readiness Score

The Snowpark Migration Accelerator (SMA) generates a Snowpark API Readiness Score, which indicates how ready your code is for migration. It’s important to note that this score only evaluates the usage of Spark API components and does not assess other elements such as third-party libraries or external dependencies in your code.

When SMA analyzes your code, it identifies all Spark API references, including both import statements and function calls. These references are documented in [the Spark API Usages Inventory](output-reports/sma-inventories.md), which you can find in your local output directory. Each reference is classified as either “supported” or “not supported” according to [the Spark Reference Categories](spark-reference-categories.md). The readiness score is calculated by dividing the number of supported references by the total number of references found in your code.

This score is displayed as a percentage, indicating how well Snowflake supports the Spark API references found in your code. A higher percentage means better compatibility with Snowflake. You can view this score in both the [detailed report](output-reports/curated-reports.md) and the [conversion summary](understanding-the-conversion-summary.md) sections of the application.

The Readiness Score shown here is the original score generated by the SMA. For newer SMA versions that display only one Readiness Score, this score specifically measures Spark API compatibility.

### Snowpark API Readiness Levels

Based on the calculated score, the result will be classified into one of three categories: green, yellow, or red. The application and output report will provide specific recommendations based on your score category.

The Snowpark API Readiness Score will be assigned one of these levels:

* Green: Most Spark API references are supported, making this workload a strong candidate for migration. If other indicators are also green, consider proceeding with a Proof of Concept.
* Yellow: Some Spark API references are not supported, which will require additional migration effort. Next steps should include creating an inventory of unsupported items and estimating the conversion effort needed.

## Snowpark Connect Readiness Score

The Snowpark Connect Readiness Score measures the percentage of Spark API references in your codebase that are supported by Snowpark Connect. This score provides an assessment of your existing Spark API code’s readiness for execution within the Snowpark Connect environment.

### How It’s Calculated

During its execution, the SMA scans your codebase to identify all references to the Spark API. Examples of such references include import statements, function calls, and class instantiations. All discovered references are then logged in the [Spark API Usages Inventory](output-reports/sma-inventories.md). This inventory is generated as a file in your local output directory. For every reference listed in the inventory, the SMA populates the `IsSnowparkConnectToolSupported` column, setting it to `True` if the API usage is supported by Snowpark Connect, or `False` if it is not.

To calculate the readiness score, the SMA takes all of the supported references and divides them by the total references found in the codebase:

For example, if your codebase has 100 Spark API references and 90 of them are supported by Snowpark Connect, your Snowpark Connect Readiness Score would be 90%.

A higher percentage for the Snowpark Connect Readiness Score indicates a greater degree of compatibility with Snowpark Connect, suggesting that a larger portion of your Spark code aligns with functionalities supported by Snowpark Connect.

### Readiness Levels

The compatibility analysis yields a readiness score, which is categorized into one of three distinct levels: **Green**, **Yellow**, or **Red**. Both the application’s [conversion summary](understanding-the-conversion-summary.md) and the generated [detailed report](output-reports/curated-reports.md) will display this readiness level, accompanied by specific guidance tailored to the findings:

* Green - This workload is highly compatible with Snowpark Connect as the majority of references to the Spark API are supported without any code changes. [Files that are fully compatible](../../use-cases/snowpark-connect/identifying-fully-compatible-files.md) can be run immediately, though some files will still require [issue resolution](../../issue-analysis/approach.md).

  A good next step would be to try to [run some of the files in Snowflake](../../../../developer-guide/snowpark-connect/snowpark-connect-overview.md). View the [reports](output-reports/README.md) generated by the SMA and select a file that might be [ready to run with Snowpark Connect](../../use-cases/snowpark-connect/README.md).
* Yellow - There are some elements of the Spark API in this workload that are not supported in Snowpark Connect or are incompatible with [Snowpark Connect for Spark](../../../../developer-guide/snowpark-connect/snowpark-connect-compatibility.md). This workload may still be able to run with Snowpark Connect, but there are elements that will require [issue resolution](../../issue-analysis/approach.md) or even re-architecture.

  The recommended next step would be to evaluate if code conversion makes more sense. You can do this by [converting this workload to the Snowpark API](../snowpark-api-conversion/README.md), and reviewing the Snowpark API Readiness Score. You can also dive deeper into this workload’s compatibility with Snowpark Connect by viewing the [reports](output-reports/README.md) generated by the SMA. You can explore the compatibility with Snowpark by [understanding which files may be ready to run](../../use-cases/snowpark-connect/README.md) and working through the Spark elements that have [issues that need resolution](../../issue-analysis/approach.md).
* Red - This workload has a significant number of references to the Spark API that are not supported in Snowpark Connect. However, this workload may still be a good candidate for conversion to the Snowpark API. The recommended next step would be to [convert this workload](../snowpark-api-conversion/README.md), and take a look at the Snowpark API Readiness Score. If you need help, feel free to reach out to [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

  If you would still like to further explore the compatibility with Snowpark Connect, you can view the [reports](output-reports/README.md) generated by the SMA. A good place to start would be to [understand which files may be ready to run](../../use-cases/snowpark-connect/README.md).

## Third-Party API Readiness Score

The Third-Party Readiness Score shows how many of your imported libraries can be used in Snowflake. To better understand this score, let’s first explain what we mean by “Third Party”:

**Third Party Library**: Any software package or library that is not developed, maintained, or controlled by Snowflake (or Snowpark in Snowflake).

The readiness score indicates the percentage of external libraries and packages that are compatible with Snowflake. For Python code, compatibility means the package is available through the Anaconda package collection in Snowpark. For Scala or Java code, compatibility means the package is already included in Snowpark’s core functionality.

The readiness score is calculated by dividing the number of supported third-party library imports by the total number of third-party library imports in your code.

Important Information About the Readiness Score:

* **Supported Third-Party Libraries in Snowpark**: This includes all libraries that Snowpark supports (including org.apache.spark)
* **Total Third-Party Library Calls**: The sum of all third-party library calls found in the code, including both Spark and non-Spark libraries, whether supported or unsupported by Snowpark.
* Only imports marked as “ThirdPartyLib” in the Import Usages Inventory are counted. Internal dependencies and imports from within the codebase are excluded.
* This metric counts the total number of calls, not unique library references. For example, if your code has 100 library calls total, with 80 calls to an unsupported library and 20 calls to a supported library, the support score would be 20%. This shows the actual usage frequency of supported vs. unsupported libraries in the code, rather than the ratio of unique library references.

### Third Party API Readiness Levels

Based on the calculated score, the result will be classified into one of three categories: green, yellow, or red. The application and output report will provide specific recommendations based on your score category.

The Third Party API Readiness Score will be assigned one of these levels:

* Green - The codebase uses Python libraries that are fully supported in Snowflake. No additional configuration is required.
* Yellow - The codebase contains at least one Python package or library that is not currently supported in Snowpark. You can add unsupported third-party packages using several methods described in the third-party package documentation. To identify unsupported packages, review the [Import Usages Inventory](output-reports/sma-inventories.md) generated by SMA. Then analyze how these packages are used in your code and plan their implementation in Snowflake.
* Red - The codebase heavily relies on packages or libraries not supported in Snowpark. This could mean either a single unsupported library is used extensively throughout the code, or multiple unsupported libraries are used across different parts of the codebase. A thorough assessment of these import statements is necessary to understand their impact. For guidance or assistance with package support, contact [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

## SQL Readiness Score

The SQL Readiness Score indicates what percentage of SQL elements in your source code can be automatically converted to Snowflake SQL using the Snowpark Migration Accelerator (SMA). A higher score means more of your code can be converted automatically, which makes the migration process easier and faster.

The readiness score is calculated by dividing the number of SQL elements that can be converted by the total number of SQL elements found in the source code.

### SQL Readiness Score Levels

The SQL Readiness Score will be assigned one of these levels:

* Green - Most SQL in this codebase is either directly supported by Snowflake or can be automatically converted by the SMA. While no conversion is perfect, this workload requires minimal manual adjustments for Snowflake migration.
* Yellow - Some SQL elements in this codebase are not supported by Snowflake, requiring additional effort for migration. Review the SQL Element Inventory for unsupported features and check the EWI’s in the issues output to create an action plan. You may need to make minor code adjustments or partially redesign some components.
* Red - A large portion of SQL in this codebase is not compatible with Snowflake, suggesting significant redesign may be necessary. To proceed, review the SQL Element Inventory for unsupported features and examine the EWI’s in the issues output to develop a migration strategy. For assistance, contact [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---

While readiness scores provide valuable insights, they should not be the only factor in determining a workload’s migration readiness. Consider multiple aspects of your migration plan alongside these scores, as they serve as an initial assessment rather than a complete evaluation. If you notice any readiness metrics that could be improved or aren’t accurately represented in the tool, [let us know](../../support/contact-us.md). The SMA team continuously works to enhance and refine these readiness measurements.

---
title: Snowpark Migration Accelerator: Release Notes
source: https://docs.snowflake.com/en/migrations/sma-docs/general/release-notes/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Release Notes

Note that the release notes below are organized by release date. Version numbers for both the application and the conversion core will appear below.

## Version 3.3.0 (Mar 30, 2026)

### Application & CLI Version: 3.3.0

#### Included SMA Core Version

* Snowpark Conversion Core: 8.1.88

### Engine Release Notes

#### Changed

* Included `FileName` column in the SnowConvert `Elements.csv` inventory.
* Updated Snowpark Scala library version from `1.17.0` to `1.18.0`, adding support for new overloads with `Any` parameter types on existing `Column` methods (`cast`, `equal_to`, `not_equal`, `gt`, `lt`, `leq`, `geq`, `equal_null`, `plus`, `minus`, `multiply`, `divide`, `mod`) and functions (`try_to_date`, `try_to_timestamp` with format parameter).
* `pyspark.sql.functions.array` with multiple arguments is now correctly mapped to `snowflake.snowpark.functions.array_construct` instead of `to_array`. Single-argument calls continue mapping to `to_array`.
* Session creation is now converted to the standard SCOS pattern.

  + E.g. `session.server.init_spark_session()` → `session.init_spark_session()`

### Desktop Release Notes

#### Added

* **.config folder validation:** The SMA app now detects and reports issues related to the dedicated `.config` folder configuration.

## Version 3.2.0 (Mar 13, 2026)

### Application & CLI Version: 3.2.0

### Engine Release Notes

#### Changed

* Updated .NET version to `v10.0.0`.
* Bumped Python AST and Parser version to `v149.1.23`.
* The SMA now correctly identifies and reports usages of `org.apache.spark.sql.functions.to_number` and `org.apache.spark.sql.functions.try_to_number` as unsupported elements within the Snowpark API.

### Desktop Release Notes

#### Added

* **Output folder validation:** The output folder selection now validates for existing project files (`.snowct`). If the selected folder already contains a project, the selection is blocked and an error is displayed, preventing accidental overwrites.
* **Dual-assessment workflow:** Automatically evaluates both SCOS and Snowpark API conversion paths, recommending the best fit for the user’s workload.
* **Two conversion modes:** Snowpark Connect (SCOS) and Snowpark API are now available as distinct conversion targets.
* **Readiness score section:** Displays the recommended conversion target with a color-coded compatibility score badge.
* **File compatibility breakdown:** KPI cards showing fully compatible files, files requiring changes, and files with unsupported APIs.
* **Data distribution charts:** Stacked bar charts showing input data sources and output data targets, grouped by platform.
* **Code dependencies:** Interactive donut chart categorizing dependencies as Supported, Internal, or Unknown.
* **Issues by category:** AI-enriched table grouping conversion issues into human-readable categories with file counts and key issue summaries. Displayed when a Snowflake session is active.
* **Execution summary:** Project metadata, engine version information, and input/output folder references.
* **Performance:** Assessment report data is cached for the duration of the session, enabling instant page loads when navigating back to results.

#### Changed

* **Card layout improvements:** Assessment and conversion cards were refactored for improving the user experience.
* **Assessment workflow shortcut:** If an assessment has already been completed, clicking “Analyze code” navigates directly to the results page instead of re-running the assessment.
* **Renamed connection action:** “Activate Assistant” is now “Connect to Snowflake” across the application header and connection dialog for clearer terminology.

## Version 3.1.0 (Feb 27, 2026)

### Application & CLI Version: 3.1.0

#### Included SMA Core Version

* Snowpark Conversion Core: 8.1.60

#### Included SnowConvert AI Version

* SnowConvert AI Version 2.2.0 ([Release Notes](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/release-notes/release-notes/README#version-2-2-0-jan-07-2026))

### Engine Release Notes

#### Added

* Added support for processing files located in a hidden folder (such as `.databricks` when exported from the source). These files are now correctly processed by the SMA.
* Added 245 new PySpark elements to the SMA mapping table with a NotSupported status. These entries correspond to functions and methods introduced in PySpark 3.3.0 through 4.1.x:

  + 219 functions (`pyspark.sql.functions`)
  + 4 DataFrame methods
  + 3 Column methods
  + 5 Session methods
  + 2 ReadWriter methods
  + 12 Types classes
* Added new EWIs for the following Pandas elements:

  + PNDSPY1019: pandas.core.arrays.datetimelike.DatelikeOps.strftime partial support
  + PNDSPY1020: pandas.core.arrays.datetimelike.TimelikeOps.ceil partial support
  + PNDSPY1021: pandas.core.arrays.datetimelike.TimelikeOps.floor partial support
  + PNDSPY1022: pandas.core.arrays.datetimelike.TimelikeOps.round partial support
  + PNDSPY1023: pandas.core.arrays.datetimes.DatetimeArray.day_name partial support
  + PNDSPY1024: pandas.core.arrays.datetimes.DatetimeArray.month_name partial support
  + PNDSPY1025: pandas.core.arrays.datetimes.DatetimeArray.tz_convert partial support
  + PNDSPY1026: pandas.core.arrays.datetimes.DatetimeArray.tz_localize partial support
  + PNDSPY1027: pandas.core.base.IndexOpsMixin.argmax partial support
  + PNDSPY1028: pandas.core.base.IndexOpsMixin.argmin partial support
  + PNDSPY1029: pandas.core.base.IndexOpsMixin.value_counts partial support
  + PNDSPY1030: pandas.core.frame.DataFrame.T partial support
  + PNDSPY1031: pandas.core.frame.DataFrame._\*dataframe\*_ partial support
  + PNDSPY1032: pandas.core.frame.DataFrame.add partial support
  + PNDSPY1033: pandas.core.frame.DataFrame.align partial support
  + PNDSPY1034: pandas.core.frame.DataFrame.all partial support
  + PNDSPY1035: pandas.core.frame.DataFrame.any partial support
  + PNDSPY1036: pandas.core.frame.DataFrame.applymap partial support
  + PNDSPY1037: pandas.core.frame.DataFrame.asfreq partial support
  + PNDSPY1038: pandas.core.frame.DataFrame.astype partial support
  + PNDSPY1039: pandas.core.frame.DataFrame.at partial support
  + PNDSPY1040: pandas.core.frame.DataFrame.backfill partial support
  + PNDSPY1041: pandas.core.frame.DataFrame.bfill partial support
  + PNDSPY1042: pandas.core.frame.DataFrame.compare partial support
  + PNDSPY1043: pandas.core.frame.DataFrame.corr partial support
  + PNDSPY1044: pandas.core.frame.DataFrame.cumsum partial support
  + PNDSPY1045: pandas.core.frame.DataFrame.div partial support
  + PNDSPY1046: pandas.core.frame.DataFrame.divide partial support
  + PNDSPY1047: pandas.core.frame.DataFrame.dropna partial support
  + PNDSPY1048: pandas.core.frame.DataFrame.eq partial support
  + PNDSPY1049: pandas.core.frame.DataFrame.eval partial support
  + PNDSPY1050: pandas.core.frame.DataFrame.expanding partial support
  + PNDSPY1051: pandas.core.frame.DataFrame.ffill partial support
  + PNDSPY1052: pandas.core.frame.DataFrame.fillna partial support
  + PNDSPY1053: pandas.core.frame.DataFrame.floordiv partial support
  + PNDSPY1054: pandas.core.frame.DataFrame.from_records partial support
  + PNDSPY1055: pandas.core.frame.DataFrame.ge partial support
  + PNDSPY1056: pandas.core.frame.DataFrame.groupby partial support
  + PNDSPY1057: pandas.core.frame.DataFrame.gt partial support
  + PNDSPY1058: pandas.core.frame.DataFrame.idxmax partial support
  + PNDSPY1059: pandas.core.frame.DataFrame.idxmin partial support
  + PNDSPY1060: pandas.core.frame.DataFrame.info partial support
  + PNDSPY1061: pandas.core.frame.DataFrame.join partial support
  + PNDSPY1062: pandas.core.frame.DataFrame.le partial support
  + PNDSPY1063: pandas.core.frame.DataFrame.loc partial support
  + PNDSPY1064: pandas.core.frame.DataFrame.lt partial support
  + PNDSPY1065: pandas.core.frame.DataFrame.map partial support
  + PNDSPY1066: pandas.core.frame.DataFrame.mask partial support
  + PNDSPY1067: pandas.core.frame.DataFrame.melt partial support
  + PNDSPY1068: pandas.core.frame.DataFrame.merge partial support
  + PNDSPY1069: pandas.core.frame.DataFrame.mod partial support
  + PNDSPY1070: pandas.core.frame.DataFrame.mul partial support
  + PNDSPY1071: pandas.core.frame.DataFrame.multiply partial support
  + PNDSPY1072: pandas.core.frame.DataFrame.ne partial support
  + PNDSPY1073: pandas.core.frame.DataFrame.nlargest partial support
  + PNDSPY1074: pandas.core.frame.DataFrame.nsmallest partial support
  + PNDSPY1075: pandas.core.frame.DataFrame.nunique partial support
  + PNDSPY1076: pandas.core.frame.DataFrame.pad partial support
  + PNDSPY1077: pandas.core.frame.DataFrame.pct_change partial support
  + PNDSPY1078: pandas.core.frame.DataFrame.pivot partial support
  + PNDSPY1079: pandas.core.frame.DataFrame.pivot_table partial support
  + PNDSPY1080: pandas.core.frame.DataFrame.pow partial support
  + PNDSPY1081: pandas.core.frame.DataFrame.quantile partial support
  + PNDSPY1082: pandas.core.frame.DataFrame.radd partial support
  + PNDSPY1083: pandas.core.frame.DataFrame.rank partial support
  + PNDSPY1084: pandas.core.frame.DataFrame.rdiv partial support
  + PNDSPY1085: pandas.core.frame.DataFrame.reindex partial support
  + PNDSPY1086: pandas.core.frame.DataFrame.rename partial support
  + PNDSPY1087: pandas.core.frame.DataFrame.replace partial support
  + PNDSPY1088: pandas.core.frame.DataFrame.resample partial support
  + PNDSPY1089: pandas.core.frame.DataFrame.rfloordiv partial support
  + PNDSPY1090: pandas.core.frame.DataFrame.rmod partial support
  + PNDSPY1091: pandas.core.frame.DataFrame.rmul partial support
  + PNDSPY1092: pandas.core.frame.DataFrame.rolling partial support
  + PNDSPY1093: pandas.core.frame.DataFrame.round partial support
  + PNDSPY1094: pandas.core.frame.DataFrame.rpow partial support
  + PNDSPY1095: pandas.core.frame.DataFrame.rsub partial support
  + PNDSPY1096: pandas.core.frame.DataFrame.rtruediv partial support
  + PNDSPY1097: pandas.core.frame.DataFrame.sample partial support
  + PNDSPY1098: pandas.core.frame.DataFrame.shift partial support
  + PNDSPY1099: pandas.core.frame.DataFrame.skew partial support
  + PNDSPY1100: pandas.core.frame.DataFrame.sort_index partial support
  + PNDSPY1101: pandas.core.frame.DataFrame.sort_values partial support
  + PNDSPY1102: pandas.core.frame.DataFrame.stack partial support
  + PNDSPY1103: pandas.core.frame.DataFrame.std partial support
  + PNDSPY1104: pandas.core.frame.DataFrame.sub partial support
  + PNDSPY1105: pandas.core.frame.DataFrame.subtract partial support
  + PNDSPY1106: pandas.core.frame.DataFrame.to_csv partial support
  + PNDSPY1107: pandas.core.frame.DataFrame.transform partial support
  + PNDSPY1108: pandas.core.frame.DataFrame.transpose partial support
  + PNDSPY1109: pandas.core.frame.DataFrame.truediv partial support
  + PNDSPY1110: pandas.core.frame.DataFrame.tz_convert partial support
  + PNDSPY1111: pandas.core.frame.DataFrame.tz_localize partial support
  + PNDSPY1112: pandas.core.frame.DataFrame.unstack partial support
  + PNDSPY1113: pandas.core.frame.DataFrame.var partial support
  + PNDSPY1114: pandas.core.frame.DataFrame.where partial support
  + PNDSPY1115: pandas.core.generic.NDFrame.shift partial support
  + PNDSPY1116: pandas.core.groupby.generic.DataFrameGroupBy.agg partial support
  + PNDSPY1117: pandas.core.groupby.generic.DataFrameGroupBy.aggregate partial support
  + PNDSPY1118: pandas.core.groupby.generic.DataFrameGroupBy.fillna partial support
  + PNDSPY1119: pandas.core.groupby.generic.DataFrameGroupBy.idxmax partial support
  + PNDSPY1120: pandas.core.groupby.generic.DataFrameGroupBy.idxmin partial support
  + PNDSPY1121: pandas.core.groupby.generic.DataFrameGroupBy.transform partial support
  + PNDSPY1122: pandas.core.groupby.generic.DataFrameGroupBy.value_counts partial support
  + PNDSPY1123: pandas.core.groupby.groupby.BaseGroupBy.get_group partial support
  + PNDSPY1124: pandas.core.groupby.groupby.GroupBy.all partial support
  + PNDSPY1125: pandas.core.groupby.groupby.GroupBy.any partial support
  + PNDSPY1126: pandas.core.groupby.groupby.GroupBy.apply partial support
  + PNDSPY1127: pandas.core.groupby.groupby.GroupBy.bfill partial support
  + PNDSPY1128: pandas.core.groupby.groupby.GroupBy.ffill partial support
  + PNDSPY1129: pandas.core.groupby.groupby.GroupBy.first partial support
  + PNDSPY1130: pandas.core.groupby.groupby.GroupBy.last partial support
  + PNDSPY1131: pandas.core.groupby.groupby.GroupBy.pct_change partial support
  + PNDSPY1132: pandas.core.groupby.groupby.GroupBy.quantile partial support
  + PNDSPY1133: pandas.core.groupby.groupby.GroupBy.resample partial support
  + PNDSPY1134: pandas.core.groupby.groupby.GroupBy.rolling partial support
  + PNDSPY1135: pandas.core.groupby.groupby.GroupBy.shift partial support
  + PNDSPY1136: pandas.core.groupby.groupby.GroupBy.std partial support
  + PNDSPY1137: pandas.core.groupby.groupby.GroupBy.var partial support
  + PNDSPY1138: pandas.core.indexes.base.Index.all partial support
  + PNDSPY1139: pandas.core.indexes.base.Index.any partial support
  + PNDSPY1140: pandas.core.indexes.base.Index.nlevels partial support
  + PNDSPY1141: pandas.core.indexes.base.Index.reindex partial support
  + PNDSPY1142: pandas.core.indexes.base.Index.sort_values partial support
  + PNDSPY1143: pandas.core.indexes.datetimes.DatetimeIndex.ceil partial support
  + PNDSPY1144: pandas.core.indexes.datetimes.DatetimeIndex.day_name partial support
  + PNDSPY1145: pandas.core.indexes.datetimes.DatetimeIndex.floor partial support
  + PNDSPY1146: pandas.core.indexes.datetimes.DatetimeIndex.month_name partial support
  + PNDSPY1147: pandas.core.indexes.datetimes.DatetimeIndex.round partial support
  + PNDSPY1148: pandas.core.indexes.datetimes.DatetimeIndex.std partial support
  + PNDSPY1149: pandas.core.indexes.datetimes.DatetimeIndex.tz_convert partial support
  + PNDSPY1150: pandas.core.indexes.datetimes.DatetimeIndex.tz_localize partial support
  + PNDSPY1151: pandas.core.indexes.datetimes.bdate_range partial support
  + PNDSPY1152: pandas.core.indexes.datetimes.date_range partial support
  + PNDSPY1153: pandas.core.resample.Resampler.asfreq partial support
  + PNDSPY1154: pandas.core.resample.Resampler.bfill partial support
  + PNDSPY1155: pandas.core.resample.Resampler.ffill partial support
  + PNDSPY1156: pandas.core.resample.Resampler.fillna partial support
  + PNDSPY1157: pandas.core.resample.Resampler.first partial support
  + PNDSPY1158: pandas.core.resample.Resampler.last partial support
  + PNDSPY1159: pandas.core.resample.Resampler.quantile partial support
  + PNDSPY1160: pandas.core.resample.Resampler.std partial support
  + PNDSPY1161: pandas.core.resample.Resampler.var partial support
  + PNDSPY1162: pandas.core.reshape.concat.concat partial support
  + PNDSPY1163: pandas.core.reshape.melt.melt partial support
  + PNDSPY1164: pandas.core.reshape.merge.merge partial support
  + PNDSPY1165: pandas.core.reshape.merge.merge_asof partial support
  + PNDSPY1166: pandas.core.reshape.pivot.crosstab partial support
  + PNDSPY1167: pandas.core.reshape.pivot.pivot partial support
  + PNDSPY1168: pandas.core.reshape.pivot.pivot_table partial support
  + PNDSPY1169: pandas.core.reshape.tile.cut partial support
  + PNDSPY1170: pandas.core.reshape.tile.qcut partial support
  + PNDSPY1171: pandas.core.series.Series.add partial support
  + PNDSPY1172: pandas.core.series.Series.all partial support
  + PNDSPY1173: pandas.core.series.Series.any partial support
  + PNDSPY1174: pandas.core.series.Series.case_when partial support
  + PNDSPY1175: pandas.core.series.Series.compare partial support
  + PNDSPY1176: pandas.core.series.Series.cumsum partial support
  + PNDSPY1177: pandas.core.series.Series.div partial support
  + PNDSPY1178: pandas.core.series.Series.divide partial support
  + PNDSPY1179: pandas.core.series.Series.dropna partial support
  + PNDSPY1180: pandas.core.series.Series.eq partial support
  + PNDSPY1181: pandas.core.series.Series.flags partial support
  + PNDSPY1182: pandas.core.series.Series.floordiv partial support
  + PNDSPY1183: pandas.core.series.Series.ge partial support
  + PNDSPY1184: pandas.core.series.Series.groupby partial support
  + PNDSPY1185: pandas.core.series.Series.gt partial support
  + PNDSPY1186: pandas.core.series.Series.le partial support
  + PNDSPY1187: pandas.core.series.Series.lt partial support
  + PNDSPY1188: pandas.core.series.Series.map partial support
  + PNDSPY1189: pandas.core.series.Series.mod partial support
  + PNDSPY1190: pandas.core.series.Series.mul partial support
  + PNDSPY1191: pandas.core.series.Series.multiply partial support
  + PNDSPY1192: pandas.core.series.Series.ne partial support
  + PNDSPY1193: pandas.core.series.Series.nlargest partial support
  + PNDSPY1194: pandas.core.series.Series.nsmallest partial support
  + PNDSPY1195: pandas.core.series.Series.pow partial support
  + PNDSPY1196: pandas.core.series.Series.quantile partial support
  + PNDSPY1197: pandas.core.series.Series.radd partial support
  + PNDSPY1198: pandas.core.series.Series.rdiv partial support
  + PNDSPY1199: pandas.core.series.Series.reindex partial support
  + PNDSPY1200: pandas.core.series.Series.rename partial support
  + PNDSPY1201: pandas.core.series.Series.rfloordiv partial support
  + PNDSPY1202: pandas.core.series.Series.rmod partial support
  + PNDSPY1203: pandas.core.series.Series.rmul partial support
  + PNDSPY1204: pandas.core.series.Series.rpow partial support
  + PNDSPY1205: pandas.core.series.Series.rsub partial support
  + PNDSPY1206: pandas.core.series.Series.rtruediv partial support
  + PNDSPY1207: pandas.core.series.Series.skew partial support
  + PNDSPY1208: pandas.core.series.Series.sort_index partial support
  + PNDSPY1209: pandas.core.series.Series.sort_values partial support
  + PNDSPY1210: pandas.core.series.Series.std partial support
  + PNDSPY1211: pandas.core.series.Series.sub partial support
  + PNDSPY1212: pandas.core.series.Series.subtract partial support
  + PNDSPY1213: pandas.core.series.Series.truediv partial support
  + PNDSPY1214: pandas.core.series.Series.unstack partial support
  + PNDSPY1215: pandas.core.series.Series.var partial support
  + PNDSPY1216: pandas.core.strings.accessor.StringMethods._\*getitem\*_ partial support
  + PNDSPY1217: pandas.core.strings.accessor.StringMethods.contains partial support
  + PNDSPY1218: pandas.core.strings.accessor.StringMethods.endswith partial support
  + PNDSPY1219: pandas.core.strings.accessor.StringMethods.get partial support
  + PNDSPY1220: pandas.core.strings.accessor.StringMethods.isdigit partial support
  + PNDSPY1221: pandas.core.strings.accessor.StringMethods.len partial support
  + PNDSPY1222: pandas.core.strings.accessor.StringMethods.lstrip partial support
  + PNDSPY1223: pandas.core.strings.accessor.StringMethods.replace partial support
  + PNDSPY1224: pandas.core.strings.accessor.StringMethods.rstrip partial support
  + PNDSPY1225: pandas.core.strings.accessor.StringMethods.slice partial support
  + PNDSPY1226: pandas.core.strings.accessor.StringMethods.split partial support
  + PNDSPY1227: pandas.core.strings.accessor.StringMethods.startswith partial support
  + PNDSPY1228: pandas.core.strings.accessor.StringMethods.strip partial support
  + PNDSPY1229: pandas.core.strings.accessor.StringMethods.translate partial support
  + PNDSPY1230: pandas.core.tools.datetimes.to_datetime partial support
  + PNDSPY1231: pandas.core.tools.numeric.to_numeric partial support
  + PNDSPY1232: pandas.core.tools.timedeltas.to_timedelta partial support
  + PNDSPY1233: pandas.core.window.ewm.ExponentialMovingWindow.corr partial support
  + PNDSPY1234: pandas.core.window.ewm.ExponentialMovingWindow.mean partial support
  + PNDSPY1235: pandas.core.window.ewm.ExponentialMovingWindow.std partial support
  + PNDSPY1236: pandas.core.window.ewm.ExponentialMovingWindow.sum partial support
  + PNDSPY1237: pandas.core.window.ewm.ExponentialMovingWindow.var partial support
  + PNDSPY1238: pandas.core.window.expanding.Expanding.corr partial support
  + PNDSPY1239: pandas.core.window.expanding.Expanding.count partial support
  + PNDSPY1240: pandas.core.window.expanding.Expanding.max partial support
  + PNDSPY1241: pandas.core.window.expanding.Expanding.mean partial support
  + PNDSPY1242: pandas.core.window.expanding.Expanding.min partial support
  + PNDSPY1243: pandas.core.window.expanding.Expanding.sem partial support
  + PNDSPY1244: pandas.core.window.expanding.Expanding.std partial support
  + PNDSPY1245: pandas.core.window.expanding.Expanding.sum partial support
  + PNDSPY1246: pandas.core.window.expanding.Expanding.var partial support
  + PNDSPY1247: pandas.core.window.rolling.Rolling.corr partial support
  + PNDSPY1248: pandas.core.window.rolling.Rolling.count partial support
  + PNDSPY1249: pandas.core.window.rolling.Rolling.max partial support
  + PNDSPY1250: pandas.core.window.rolling.Rolling.mean partial support
  + PNDSPY1251: pandas.core.window.rolling.Rolling.min partial support
  + PNDSPY1252: pandas.core.window.rolling.Rolling.sem partial support
  + PNDSPY1253: pandas.core.window.rolling.Rolling.std partial support
  + PNDSPY1254: pandas.core.window.rolling.Rolling.sum partial support
  + PNDSPY1255: pandas.core.window.rolling.Rolling.var partial support
  + PNDSPY1256: pandas.core.window.rolling.Window.mean partial support
  + PNDSPY1257: pandas.core.window.rolling.Window.std partial support
  + PNDSPY1258: pandas.core.window.rolling.Window.sum partial support
  + PNDSPY1259: pandas.core.window.rolling.Window.var partial support
  + PNDSPY1260: pandas.io.json._json.read_json partial support
  + PNDSPY1261: pandas.io.parquet.read_parquet partial support
  + PNDSPY1262: pandas.io.parsers.readers.read_csv partial support

#### Changed

* Updated the sfutils library implementation to support multiple levels of notebooks calls
* Upgraded supported Snowpark Python version from `v1.41.0` to `v1.43.0`. This upgrade includes the following mapping status changes:

  **NotSupported → Direct (8 functions):**

  + `pyspark.sql.functions.bool_and` → `snowflake.snowpark.functions.booland_agg`
  + `pyspark.sql.functions.bucket` → `snowflake.snowpark.functions.bucket`
  + `pyspark.sql.functions.cot` → `snowflake.snowpark.functions.cot`
  + `pyspark.sql.functions.day` → `snowflake.snowpark.functions.day`
  + `pyspark.sql.functions.every` → `snowflake.snowpark.functions.booland_agg`
  + `pyspark.sql.functions.pi` → `snowflake.snowpark.functions.pi`
  + `pyspark.sql.functions.width_bucket` → `snowflake.snowpark.functions.width_bucket`
  + `pyspark.sql.functions.zeroifnull` → `snowflake.snowpark.functions.zeroifnull`

**NotSupported → Rename (1 function):**

> * `pyspark.sql.functions.uuid` → `snowflake.snowpark.functions.uuid_string`

* Upgraded supported Snowpark Pandas version from `v1.41.0` to `v1.43.0`.
* The mapping status of the following Pandas elements were updated:

  **NotSupported → Direct (56 functions):**

  + `pandas.core.arrays.datetimes.DatetimeArray.date`
  + `pandas.core.arrays.datetimes.DatetimeArray.normalize`
  + `pandas.core.arrays.datetimes.DatetimeArray.time`
  + `pandas.core.base.IndexOpsMixin.T`
  + `pandas.core.base.IndexOpsMixin.empty`
  + `pandas.core.base.IndexOpsMixin.is_monotonic_decreasing`
  + `pandas.core.base.IndexOpsMixin.is_monotonic_increasing`
  + `pandas.core.base.IndexOpsMixin.is_unique`
  + `pandas.core.base.IndexOpsMixin.item`
  + `pandas.core.base.IndexOpsMixin.ndim`
  + `pandas.core.base.IndexOpsMixin.nunique`
  + `pandas.core.base.IndexOpsMixin.shape`
  + `pandas.core.base.IndexOpsMixin.size`
  + `pandas.core.base.IndexOpsMixin.to_list`
  + `pandas.core.base.IndexOpsMixin.to_numpy`
  + `pandas.core.base.IndexOpsMixin.tolist`
  + `pandas.core.base.IndexOpsMixin.transpose`
  + `pandas.core.generic.NDFrame.abs`
  + `pandas.core.generic.NDFrame.add_prefix`
  + `pandas.core.generic.NDFrame.add_suffix`
  + `pandas.core.generic.NDFrame.attrs`
  + `pandas.core.generic.NDFrame.copy`
  + `pandas.core.generic.NDFrame.describe`
  + `pandas.core.generic.NDFrame.dtypes`
  + `pandas.core.generic.NDFrame.equals`
  + `pandas.core.generic.NDFrame.first`
  + `pandas.core.generic.NDFrame.first_valid_index`
  + `pandas.core.generic.NDFrame.get`
  + `pandas.core.generic.NDFrame.head`
  + `pandas.core.generic.NDFrame.keys`
  + `pandas.core.generic.NDFrame.last`
  + `pandas.core.generic.NDFrame.last_valid_index`
  + `pandas.core.generic.NDFrame.ndim`
  + `pandas.core.generic.NDFrame.size`
  + `pandas.core.generic.NDFrame.squeeze`
  + `pandas.core.generic.NDFrame.tail`
  + `pandas.core.generic.NDFrame.take`
  + `pandas.core.generic.NDFrame.to_excel`
  + `pandas.core.groupby.groupby.BaseGroupBy.groups`
  + `pandas.core.groupby.groupby.GroupBy.count`
  + `pandas.core.groupby.groupby.GroupBy.cumcount`
  + `pandas.core.groupby.groupby.GroupBy.cummax`
  + `pandas.core.groupby.groupby.GroupBy.cummin`
  + `pandas.core.groupby.groupby.GroupBy.cumsum`
  + `pandas.core.groupby.groupby.GroupBy.head`
  + `pandas.core.groupby.groupby.GroupBy.max`
  + `pandas.core.groupby.groupby.GroupBy.mean`
  + `pandas.core.groupby.groupby.GroupBy.median`
  + `pandas.core.groupby.groupby.GroupBy.min`
  + `pandas.core.groupby.groupby.GroupBy.rank`
  + `pandas.core.groupby.groupby.GroupBy.size`
  + `pandas.core.groupby.groupby.GroupBy.tail`
  + `pandas.core.indexes.datetimes.DatetimeIndex.year`
  + `pandas.core.indexing.IndexingMixin.iat`
  + `pandas.core.indexing.IndexingMixin.iloc`
  + `pandas.core.series.Series.first`

**NotSupported → Partial (70 functions):**

> * `pandas.core.arrays.datetimelike.DatelikeOps.strftime` (PNDSPY1019)
> * `pandas.core.arrays.datetimelike.TimelikeOps.ceil` (PNDSPY1020)
> * `pandas.core.arrays.datetimelike.TimelikeOps.floor` (PNDSPY1021)
> * `pandas.core.arrays.datetimelike.TimelikeOps.round` (PNDSPY1022)
> * `pandas.core.arrays.datetimes.DatetimeArray.day_name` (PNDSPY1023)
> * `pandas.core.arrays.datetimes.DatetimeArray.month_name` (PNDSPY1024)
> * `pandas.core.arrays.datetimes.DatetimeArray.tz_convert` (PNDSPY1025)
> * `pandas.core.arrays.datetimes.DatetimeArray.tz_localize` (PNDSPY1026)
> * `pandas.core.base.IndexOpsMixin.argmax` (PNDSPY1027)
> * `pandas.core.base.IndexOpsMixin.argmin` (PNDSPY1028)
> * `pandas.core.base.IndexOpsMixin.value_counts` (PNDSPY1029)
> * `pandas.core.frame.DataFrame.eval` (PNDSPY1049)
> * `pandas.core.frame.DataFrame.expanding` (PNDSPY1050)
> * `pandas.core.frame.DataFrame.melt` (PNDSPY1067)
> * `pandas.core.frame.DataFrame.pct_change` (PNDSPY1077)
> * `pandas.core.frame.DataFrame.quantile` (PNDSPY1081)
> * `pandas.core.frame.DataFrame.std` (PNDSPY1103)
> * `pandas.core.generic.NDFrame.asfreq` (PNDSPY1037)
> * `pandas.core.generic.NDFrame.fillna` (PNDSPY1052)
> * `pandas.core.generic.NDFrame.mask` (PNDSPY1066)
> * `pandas.core.generic.NDFrame.pct_change` (PNDSPY1077)
> * `pandas.core.generic.NDFrame.rank` (PNDSPY1083)
> * `pandas.core.generic.NDFrame.replace` (PNDSPY1087)
> * `pandas.core.generic.NDFrame.shift` (PNDSPY1115)
> * `pandas.core.generic.NDFrame.to_csv` (PNDSPY1106)
> * `pandas.core.generic.NDFrame.tz_convert` (PNDSPY1110)
> * `pandas.core.generic.NDFrame.tz_localize` (PNDSPY1111)
> * `pandas.core.generic.NDFrame.where` (PNDSPY1114)
> * `pandas.core.groupby.generic.DataFrameGroupBy.transform` (PNDSPY1121)
> * `pandas.core.groupby.generic.DataFrameGroupBy.value_counts` (PNDSPY1122)
> * `pandas.core.groupby.groupby.BaseGroupBy.get_group` (PNDSPY1123)
> * `pandas.core.groupby.groupby.GroupBy.bfill` (PNDSPY1127)
> * `pandas.core.groupby.groupby.GroupBy.first` (PNDSPY1129)
> * `pandas.core.groupby.groupby.GroupBy.last` (PNDSPY1130)
> * `pandas.core.groupby.groupby.GroupBy.quantile` (PNDSPY1132)
> * `pandas.core.groupby.groupby.GroupBy.resample` (PNDSPY1133)
> * `pandas.core.groupby.groupby.GroupBy.rolling` (PNDSPY1134)
> * `pandas.core.groupby.groupby.GroupBy.shift` (PNDSPY1135)
> * `pandas.core.groupby.groupby.GroupBy.std` (PNDSPY1136)
> * `pandas.core.groupby.groupby.GroupBy.var` (PNDSPY1137)
> * `pandas.core.indexes.base.Index.nlevels` (PNDSPY1140)
> * `pandas.core.indexes.base.Index.sort_values` (PNDSPY1142)
> * `pandas.core.indexing.IndexingMixin.at` (PNDSPY1039)
> * `pandas.core.indexing.IndexingMixin.loc` (PNDSPY1063)
> * `pandas.core.resample.Resampler.ffill` (PNDSPY1155)
> * `pandas.core.resample.Resampler.first` (PNDSPY1157)
> * `pandas.core.resample.Resampler.last` (PNDSPY1158)
> * `pandas.core.resample.Resampler.std` (PNDSPY1160)
> * `pandas.core.resample.Resampler.var` (PNDSPY1161)
> * `pandas.core.reshape.merge.merge_asof` (PNDSPY1165)
> * `pandas.core.reshape.pivot.pivot` (PNDSPY1167)
> * `pandas.core.series.Series.expanding` (PNDSPY1050)
> * `pandas.core.series.Series.pct_change` (PNDSPY1077)
> * `pandas.core.window.ewm.ExponentialMovingWindow.corr` (PNDSPY1233)
> * `pandas.core.window.ewm.ExponentialMovingWindow.mean` (PNDSPY1234)
> * `pandas.core.window.ewm.ExponentialMovingWindow.std` (PNDSPY1235)
> * `pandas.core.window.ewm.ExponentialMovingWindow.sum` (PNDSPY1236)
> * `pandas.core.window.ewm.ExponentialMovingWindow.var` (PNDSPY1237)
> * `pandas.core.window.expanding.Expanding.corr` (PNDSPY1238)
> * `pandas.core.window.expanding.Expanding.max` (PNDSPY1240)
> * `pandas.core.window.expanding.Expanding.mean` (PNDSPY1241)
> * `pandas.core.window.expanding.Expanding.min` (PNDSPY1242)
> * `pandas.core.window.expanding.Expanding.sem` (PNDSPY1243)
> * `pandas.core.window.expanding.Expanding.std` (PNDSPY1244)
> * `pandas.core.window.expanding.Expanding.sum` (PNDSPY1245)
> * `pandas.core.window.expanding.Expanding.var` (PNDSPY1246)
> * `pandas.core.window.rolling.Window.mean` (PNDSPY1256)
> * `pandas.core.window.rolling.Window.std` (PNDSPY1257)
> * `pandas.core.window.rolling.Window.sum` (PNDSPY1258)
> * `pandas.core.window.rolling.Window.var` (PNDSPY1259)

**(new) → Direct (74 functions):**

> * `pandas.core.arrays.datetimes.DatetimeArray.day`
> * `pandas.core.arrays.datetimes.DatetimeArray.day_of_week`
> * `pandas.core.arrays.datetimes.DatetimeArray.day_of_year`
> * `pandas.core.arrays.datetimes.DatetimeArray.dayofweek`
> * `pandas.core.arrays.datetimes.DatetimeArray.dayofyear`
> * `pandas.core.arrays.datetimes.DatetimeArray.days_in_month`
> * `pandas.core.arrays.datetimes.DatetimeArray.daysinmonth`
> * `pandas.core.arrays.datetimes.DatetimeArray.hour`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_leap_year`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_month_end`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_month_start`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_quarter_end`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_quarter_start`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_year_end`
> * `pandas.core.arrays.datetimes.DatetimeArray.is_year_start`
> * `pandas.core.arrays.datetimes.DatetimeArray.isocalendar`
> * `pandas.core.arrays.datetimes.DatetimeArray.microsecond`
> * `pandas.core.arrays.datetimes.DatetimeArray.minute`
> * `pandas.core.arrays.datetimes.DatetimeArray.month`
> * `pandas.core.arrays.datetimes.DatetimeArray.nanosecond`
> * `pandas.core.arrays.datetimes.DatetimeArray.quarter`
> * `pandas.core.arrays.datetimes.DatetimeArray.second`
> * `pandas.core.arrays.datetimes.DatetimeArray.weekday`
> * `pandas.core.arrays.datetimes.DatetimeArray.year`
> * `pandas.core.arrays.timedeltas.TimedeltaArray.days`
> * `pandas.core.arrays.timedeltas.TimedeltaArray.microseconds`
> * `pandas.core.arrays.timedeltas.TimedeltaArray.nanoseconds`
> * `pandas.core.arrays.timedeltas.TimedeltaArray.seconds`
> * `pandas.core.frame.DataFrame.flags`
> * `pandas.core.generic.NDFrame.flags`
> * `pandas.core.generic.NDFrame.rename_axis`
> * `pandas.core.groupby.groupby.BaseGroupBy.__iter__`
> * `pandas.core.groupby.groupby.BaseGroupBy.__len__`
> * `pandas.core.groupby.groupby.GroupBy.sum`
> * `pandas.core.indexes.base.Index.T`
> * `pandas.core.indexes.datetimes.DatetimeIndex.date`
> * `pandas.core.indexes.datetimes.DatetimeIndex.day`
> * `pandas.core.indexes.datetimes.DatetimeIndex.day_of_week`
> * `pandas.core.indexes.datetimes.DatetimeIndex.day_of_year`
> * `pandas.core.indexes.datetimes.DatetimeIndex.dayofweek`
> * `pandas.core.indexes.datetimes.DatetimeIndex.dayofyear`
> * `pandas.core.indexes.datetimes.DatetimeIndex.hour`
> * `pandas.core.indexes.datetimes.DatetimeIndex.is_month_end`
> * `pandas.core.indexes.datetimes.DatetimeIndex.is_month_start`
> * `pandas.core.indexes.datetimes.DatetimeIndex.mean`
> * `pandas.core.indexes.datetimes.DatetimeIndex.microsecond`
> * `pandas.core.indexes.datetimes.DatetimeIndex.minute`
> * `pandas.core.indexes.datetimes.DatetimeIndex.month`
> * `pandas.core.indexes.datetimes.DatetimeIndex.nanosecond`
> * `pandas.core.indexes.datetimes.DatetimeIndex.normalize`
> * `pandas.core.indexes.datetimes.DatetimeIndex.quarter`
> * `pandas.core.indexes.datetimes.DatetimeIndex.second`
> * `pandas.core.indexes.timedeltas.TimedeltaIndex.total_seconds`
> * `pandas.core.series.Series.info` (PNDSPY1018)
> * `pandas.core.series.Series.tolist`
> * `pandas.core.strings.accessor.StringMethods.capitalize`
> * `pandas.core.strings.accessor.StringMethods.center`
> * `pandas.core.strings.accessor.StringMethods.count`
> * `pandas.core.strings.accessor.StringMethods.islower`
> * `pandas.core.strings.accessor.StringMethods.istitle`
> * `pandas.core.strings.accessor.StringMethods.isupper`
> * `pandas.core.strings.accessor.StringMethods.ljust`
> * `pandas.core.strings.accessor.StringMethods.lower`
> * `pandas.core.strings.accessor.StringMethods.match`
> * `pandas.core.strings.accessor.StringMethods.pad`
> * `pandas.core.strings.accessor.StringMethods.rjust`
> * `pandas.core.strings.accessor.StringMethods.title`
> * `pandas.core.strings.accessor.StringMethods.upper`
> * `snowpark_pandas.read_snowflake`
> * `snowpark_pandas.to_dynamic_table`
> * `snowpark_pandas.to_iceberg`
> * `snowpark_pandas.to_pandas`
> * `snowpark_pandas.to_snowflake`
> * `snowpark_pandas.to_view`

**(new) → Partial (47 functions):**

> * `pandas.core.frame.DataFrame.__dataframe__` (PNDSPY1031)
> * `pandas.core.frame.DataFrame.pad` (PNDSPY1076)
> * `pandas.core.generic.NDFrame.align` (PNDSPY1033)
> * `pandas.core.generic.NDFrame.astype` (PNDSPY1038)
> * `pandas.core.generic.NDFrame.expanding` (PNDSPY1050)
> * `pandas.core.generic.NDFrame.ffill` (PNDSPY1051)
> * `pandas.core.generic.NDFrame.interpolate` (PNDSPY1015)
> * `pandas.core.generic.NDFrame.pad` (PNDSPY1076)
> * `pandas.core.generic.NDFrame.resample` (PNDSPY1088)
> * `pandas.core.generic.NDFrame.rolling` (PNDSPY1092)
> * `pandas.core.generic.NDFrame.sample` (PNDSPY1097)
> * `pandas.core.groupby.groupby.GroupBy.all` (PNDSPY1124)
> * `pandas.core.groupby.groupby.GroupBy.any` (PNDSPY1125)
> * `pandas.core.groupby.groupby.GroupBy.apply` (PNDSPY1126)
> * `pandas.core.indexes.base.Index.all` (PNDSPY1138)
> * `pandas.core.indexes.base.Index.any` (PNDSPY1139)
> * `pandas.core.indexes.base.Index.reindex` (PNDSPY1141)
> * `pandas.core.indexes.base.Index.value_counts` (PNDSPY1029)
> * `pandas.core.indexes.datetimes.DatetimeIndex.tz_convert` (PNDSPY1149)
> * `pandas.core.indexes.datetimes.DatetimeIndex.tz_localize` (PNDSPY1150)
> * `pandas.core.series.Series.backfill` (PNDSPY1040)
> * `pandas.core.series.Series.bfill` (PNDSPY1041)
> * `pandas.core.series.Series.flags` (PNDSPY1181)
> * `pandas.core.series.Series.pad` (PNDSPY1076)
> * `pandas.core.strings.accessor.StringMethods.__getitem__` (PNDSPY1216)
> * `pandas.core.strings.accessor.StringMethods.contains` (PNDSPY1217)
> * `pandas.core.strings.accessor.StringMethods.endswith` (PNDSPY1218)
> * `pandas.core.strings.accessor.StringMethods.get` (PNDSPY1219)
> * `pandas.core.strings.accessor.StringMethods.isdigit` (PNDSPY1220)
> * `pandas.core.strings.accessor.StringMethods.len` (PNDSPY1221)
> * `pandas.core.strings.accessor.StringMethods.lstrip` (PNDSPY1222)
> * `pandas.core.strings.accessor.StringMethods.replace` (PNDSPY1223)
> * `pandas.core.strings.accessor.StringMethods.rstrip` (PNDSPY1224)
> * `pandas.core.strings.accessor.StringMethods.slice` (PNDSPY1225)
> * `pandas.core.strings.accessor.StringMethods.split` (PNDSPY1226)
> * `pandas.core.strings.accessor.StringMethods.startswith` (PNDSPY1227)
> * `pandas.core.strings.accessor.StringMethods.strip` (PNDSPY1228)
> * `pandas.core.strings.accessor.StringMethods.translate` (PNDSPY1229)
> * `pandas.core.window.rolling.Rolling.corr` (PNDSPY1247)
> * `pandas.core.window.rolling.Rolling.max` (PNDSPY1249)
> * `pandas.core.window.rolling.Rolling.mean` (PNDSPY1250)
> * `pandas.core.window.rolling.Rolling.min` (PNDSPY1251)
> * `pandas.core.window.rolling.Rolling.sem` (PNDSPY1252)
> * `pandas.core.window.rolling.Rolling.std` (PNDSPY1253)
> * `pandas.core.window.rolling.Rolling.sum` (PNDSPY1254)
> * `pandas.core.window.rolling.Rolling.var` (PNDSPY1255)
> * `pandas.io.json._json.read_json` (PNDSPY1260)

**Direct → Partial (12 functions):**

> * `pandas.core.frame.DataFrame.T` (PNDSPY1030)
> * `pandas.core.frame.DataFrame.any` (PNDSPY1035)
> * `pandas.core.frame.DataFrame.where` (PNDSPY1114)
> * `pandas.core.groupby.generic.DataFrameGroupBy.agg` (PNDSPY1116)
> * `pandas.core.indexes.datetimes.DatetimeIndex.round` (PNDSPY1147)
> * `pandas.core.reshape.tile.qcut` (PNDSPY1170)
> * `pandas.core.series.Series.astype` (PNDSPY1038)
> * `pandas.core.series.Series.groupby` (PNDSPY1184)
> * `pandas.core.series.Series.le` (PNDSPY1186)
> * `pandas.core.series.Series.loc` (PNDSPY1063)
> * `pandas.io.parquet.read_parquet` (PNDSPY1261)
> * `pandas.io.parsers.readers.read_csv` (PNDSPY1262)

**Partial → Direct (5 functions):**

> * `pandas.core.indexes.datetimes.DatetimeIndex.is_leap_year`
> * `pandas.core.indexes.datetimes.DatetimeIndex.is_quarter_end`
> * `pandas.core.indexes.datetimes.DatetimeIndex.is_quarter_start`
> * `pandas.core.indexes.datetimes.DatetimeIndex.is_year_end`
> * `pandas.core.indexes.datetimes.DatetimeIndex.is_year_start`

**Rename → Partial (4 functions):**

> * `pandas.core.frame.DataFrame.divide` (PNDSPY1046)
> * `pandas.core.frame.DataFrame.multiply` (PNDSPY1071)
> * `pandas.core.frame.DataFrame.subtract` (PNDSPY1105)
> * `pandas.core.series.Series.divide` (PNDSPY1178)

#### Fixed

* Fixed the “How to read through the scores” link on the assessment and conversion results page to ensure it correctly opens the readiness score documentation.

## Version 3.0.0 (Feb 12, 2026)

### Application & CLI Version: 3.0.0

#### Included SMA Core Version

* Snowpark Conversion Core: 8.1.55

### Engine Release Notes

#### Improvements

* **License-Free Conversion Mode:** A license or access code is no longer required to run SMA in Conversion mode.
* **Project Options Page:** A new Project Options page has been introduced to present the available workflows in the application, including “Code Analysis and Conversion”.
* **Technical Discovery Relocation:** The Technical Discovery section has been moved to the Project Creation page for a more streamlined project setup experience.
* **Simplified Conversion Setup:** The Conversion Setup page has been updated and no longer requires a license or access code.
* **Project File Extension:** The project file extension has changed from `.snowma` to `.snowct`.
* **Updated User Interface:** The user interface has been refreshed to align with the SnowConvert AI look and feel.

## Version 2.11.1 (Jan 30, 2026)

### Application & CLI Version: 2.11.1

#### Included SMA Core Version

* Snowpark Conversion Core: 8.1.55

### Engine Release Notes

#### Added

* Added SQL Language to the DetailedReport doc file.
* Added SQL configuration cell at the beginning of a converted Databricks-to-Jupyter transformation to be compatible with Snowflake notebooks.

#### Changed

* Updated the `%run` magic command transformation to append `.ipynb` extension to notebook paths.

  + For unquoted paths: `%run ./myNotebook` transforms to `%run ./myNotebook.ipynb`
  + For quoted paths: `%run "./myNotebook"` transforms to `%run "./myNotebook.ipynb"`
* Scala code in notebook cells will now be commented in a python cell during a notebook migration.
* Updated the conversion of `dbutils.run` to the `sfutils.notebook.run` function to handle notebook execution calls.
* Bumped the supported versions of Snowpark Python API and Snowpark Pandas API from `1.40.0` to `1.41.0`.
* Updated the mapping status for the following Pandas functions from NotSupported to Partial:

  + `pandas.core.frame.DataFrame.agg` → `modin.pandas.DataFrame.agg`
  + `pandas.core.frame.DataFrame.interpolate` → `modin.pandas.DataFrame.interpolate`
  + `pandas.core.reshape.encoding.get_dummies` → `modin.pandas.general.get_dummies`
  + `pandas.core.series.Series.agg` → `modin.pandas.Series.agg`
  + `pandas.core.series.Series.interpolate` → `modin.pandas.Series.interpolate`

#### Fixed

* SMA now will rename `.hql` (Hive SQL) files to `.sql` after conversion.
* The implicit cell for a DBX Scala Notebook when converting to Snowflake will be a python cell with an EWI. The Scala code will be commented out.
* Python cells from DBX SQL Notebooks will preserve the language metadata.

#### Removed

* Removed the previous `%run` transformation in DBX notebooks that generated `spark.sql("EXECUTE NOTEBOOK ...")` SQL statements.
* The SnowConvert MissingObjects report was absorbed by the MissingObjectReference report. The MissingObjects report will no longer be generated.

## Version 2.11.0 (Jan 9, 2026)

### Application & CLI Version: 2.11.0

#### Included SMA Core Version

* Snowpark Conversion Core: 8.1.43

#### Included SnowConvert AI Version

* SnowConvert AI Version 2.2.0 ([Release Notes](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/release-notes/release-notes/README#version-2-2-0-jan-07-2026))

### Engine Release Notes

#### Added

* **Enhanced Notebook Setup for Assessment:** When running an assessment on Databricks notebooks, a Snowpark Connect session is now automatically added to the first cell to simplify your setup.
* **Automatic Snowpark Connect Conversion:** The tool now automatically converts both `SparkSession` and `SparkContext` initializations in Python code to their equivalent Snowpark Connect sessions.
* **Improved Error Identification:**

  + Added a new warning code, `SPRKCNTPY4000`, to clearly flag any `SparkContext` elements that are not yet supported by Snowpark Connect.
  + The tool now automatically detects and flags unsupported Databricks utility calls (`dbutils` API) with the new warning code `SPRKDBX1004` during conversion.
* **More Detailed Reporting:**

  + The SparkUsagesInventory.csv report now includes a new column called `IS_SNOWPARK_CONNECT_TOOL_SUPPORTED`
  + This new column is to clearly indicate if a Spark element is supported directly by Snowpark Connect, or supported throught an SMA transformation.
  + The Snowpark Connect readiness score calculation has been updated to use the new `IS_SNOWPARK_CONNECT_TOOL_SUPPORTED` column in the SparkUsagesInventory.csv report.
* **Next-Generation Notebook Support:** Enhanced support for the VNext Snowflake Notebooks format when converting Databricks or Jupyter notebooks.

  + **Full VNext Compatibility:** The SMA can now generate output files that fully adhere to the VNext Snowflake Notebooks standard, regardless of whether the source was a Databricks or a previous-generation Jupyter notebook.
  + **Smarter Language Handling:** The conversion engine has been updated with enhanced logic to accurately detect and manage the specific language (such as Python or Scala) within each individual notebook cell. This allows for more precise and reliable cell-by-cell conversion.
  + **Enhanced Metadata for Cells:** The process now correctly incorporates necessary language and type metadata at the cell level during generation, which is essential for VNext Notebooks to function as expected.

#### Changed

* **Simplified Python Code:** For Snowpark Connect, unnecessary `.sparkContext` references in Python method calls are now removed to streamline your code.
* **Clearer Warning Codes:** Snowpark Connect warning codes are now renamed to include language-specific prefixes (e.g., `SPRKCNTPY` for Python, `SPRKCNTSCL` for Scala) for easier error identification.
* **More Accurate Notebook Conversions:** The conversion process for notebooks has been improved to correctly distinguish between Databricks and Jupyter formats, preventing incorrect modifications.

#### Fixed

* Fixed a bug in the artifact dependency inventory that incorrectly reported `.options()` configuration as a data source.

### Desktop Release Notes

#### Added

* **Technical Discovery View:** A new Technical Discovery View is now available in the desktop application.
* **SMA Assessment AI:** SMA desktop application is now directly integrated with an optional LLM interface.

  + Ask questions about your assessment results
  + Get help with how to approach the migration
  + Connect and deploy your assessment results directly into your Snowflake account.

#### Changed

* The Command Line Interface (CLI) parameter for controlling Jupyter conversion has been updated from `--enableJupyter` to `--disableJupyterConversion` for clearer functionality.

## Version 2.10.5 (Dec 3rd, 2025)

### Application & CLI Version: 2.10.5

#### Included SMA Core Versions

* Snowpark Conversion Core: 8.1.26

#### Included SnowConvert AI Version

* SnowConvert AI Version 2.0.57 (Release Notes: [SnowConvert AI - Recent Release Notes | Snowflake Documentation](https://docs.snowflake.com/en/migrations/snowconvert-docs/general/release-notes/release-notes/README))

### Engine Release Notes

#### Added

* The **Execution Summary** section of the `DetailedReport.docx` now indicates whether the SMA was run in Assessment or Conversion mode.

#### Changed

* Bumped the supported versions of Snowpark Python API and Snowpark Pandas API from `1.39.0` to `1.40.0`.

**PySpark Function Mapping Updates:**

**NotSupported** to **Rename**:

* `pyspark.sql.functions.unhex` → `snowflake.snowpark.functions.hex_decode_binary`

**Direct** to **Rename**:

* `pyspark.sql.functions.greatest` → `snowflake.snowpark.functions.greatest_ignore_nulls`
* `pyspark.sql.functions.least` → `snowflake.snowpark.functions.least_ignore_nulls`

**NotDefined** to **Rename**:

* `pyspark.sql.functions.bool_or` → `snowflake.snowpark.functions.boolor_agg`
* `pyspark.sql.functions.char` → `snowflake.snowpark.functions.chr`

**NotDefined** to **Direct**:

* `pyspark.sql.functions.nullif` → `snowflake.snowpark.functions.nullif`
* `pyspark.sql.functions.nvl2` → `snowflake.snowpark.functions.nvl2`

**Snowpark Pandas Function Mapping Updates:**

**NotSupported** to **Partial**:

* `modin.pandas.DataFrame.query` → `snowflake.snowpark.pandas.core.frame.DataFrame.query`
* Added a new EWI `PNDSPY1012` to indicate that `modin.pandas.DataFrame.query` does not support MultiIndex. The following example scenario illustrating this limitation is also included in the EWI documentation.

  ```python
  from snowflake.snowpark.modin import plugin
  import modin.pandas as pd # Snowpark pandas

  # Create a DataFrame with single-level index
  data = {
      'name': ['Alice', 'Bob', 'Charlie', 'David', 'Eve', 'Frank'],
      'age': [25, 30, 35, 28, 32, 45],
      'salary': [50000, 60000, 75000, 55000, 80000, 90000],
      'department': ['Sales', 'IT', 'HR', 'Sales', 'IT', 'HR']
  }
  df = pd.DataFrame(data)

  # Set a single-level index
  df = df.set_index('name')
  print("DataFrame with single-level index:")
  print(df)

  # Use query() - This works fine!
  #EWI: PNDSPY1012 => pandas.core.frame.DataFrame.query does not support DataFrames that have a row MultiIndex. Check Snowpark Pandas documentation for more details.
  result = df.query("age > 30 and salary < 85000")

  # Create a DataFrame with MultiIndex on rows
  data = {
      'A': [1, 2, 3, 4, 5, 6],
      'B': [10, 20, 30, 40, 50, 60],
      'C': ['x', 'y', 'x', 'y', 'x', 'y']
  }
  df = pd.DataFrame(data)

  # Create MultiIndex
  df = df.set_index([
      pd.Index(['group1', 'group1', 'group2', 'group2', 'group3', 'group3']),
      pd.Index(['a', 'b', 'a', 'b', 'a', 'b'])
  ])
  df.index.names = ['group', 'subgroup']

  # This will ERROR in Snowpark pandas!
  #EWI: PNDSPY1012 => pandas.core.frame.DataFrame.query does not support DataFrames that have
  ```

  **Recommended fix:** If the DataFrame contains a MultiIndex, it is necessary to validate the behavior of the `query()` method in Snowpark pandas. Ensure that the DataFrame structure is compatible with Snowpark pandas’ limitations, as MultiIndex rows are not supported. Consider restructuring the DataFrame to use a single-level index or alternative filtering methods.
* Updated all documentation links in the `DetailedReport.docx` to point to the official Snowflake documentation, replacing the legacy Snowpark Migration Accelerator site.
* Updated the Snowpark Connect readiness score descriptions in the `DetailedReport.docx` to match the SMA UI.
* Usages of `pyspark.sql.window.WindowSpec.orderBy` are now reported as supported by Snowpark Connect.

#### Fixed

* Fixed broken internal links in the `DetailedReport.docx` to ensure proper navigation between document sections.
* Added a `CellId` column to the issues inventory to easily identify the location of EWIs within notebook files.

## Version 2.10.4 (Nov 18, 2025)

### Application & CLI Version: 2.10.4

#### Included SMA Core Versions

* Snowpark Conversion Core: 8.1.8

### Engine Release Notes

#### Fixed

* Fixed an issue where the SMA generated corrupted Databricks notebook files in the output directory during Assessment mode execution.
* Fixed an issue where the SMA would crash if the input directory contained folders named “SMA_ConvertedNotebooks”.

## Version 2.10.3 (Oct 30, 2025)

### Application & CLI Version: 2.10.3

#### Included SMA Core Versions

* Snowpark Conversion Core: 8.1.7

### Engine Release Notes

#### Added

* Added the [Snowpark Connect readiness score](https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/readiness-scores#snowpark-connect-readiness-score). This new score measures the percentage of Spark API references in your codebase that are supported by [Snowpark Connect for Spark](https://docs.snowflake.com/en/developer-guide/snowpark-connect/snowpark-connect-overview).

  + This will now be the **only** score shown in assessment mode. To generate the [Snowpark API Readiness Score](https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/readiness-scores#snowpark-api-readiness-score), run the SMA in [conversion mode](https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/README).
* Added support for SQL embedded migration for literal string concatenations assigned to a local variable in the same scope of execution.

  + Included scenarios now include:
    .. code-block:: python

    > sqlStat = “SELECT colName “ + “FROM myTable”
    > session.sql(sqlStat)

#### Changed

* Updated the EWI URLs in the [Issues.csv](https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/output-reports/sma-inventories#issue-inventory) inventory to point to the [main Snowflake documentation site](https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/approach).

#### Fixed

* Fixed a code issue that caused inner project configuration files (e.g., pom.xml, build.sbt, build.gradle) to be incorrectly placed in the root of the output directory instead of the correct inner directories after migration.

### Desktop Release Notes

#### Added

* Added the Snowpark Connect readiness score and updated the assessment execution flow.

  + When running the application in assessment mode, **only** the Snowpark Connect readiness score is now displayed.
  + When running the application in conversion mode, the Snowpark API readiness score is displayed (the Snowpark Connect Readiness will **not** be shown).

#### Changed

Updated all in-application documentation links to point to the official [Snowflake documentation](https://docs.snowflake.com/en/migrations/sma-docs/README), replacing the legacy [SnowConvert](https://docs.snowconvert.com/sma) site.

## Version 2.10.2 (Oct 27, 2025)

### Application & CLI Version 2.10.2

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.73

#### Fixed

* Fixed an issue where the Snowpark Migration Accelerator failed converting DBC files into Jupyter Notebooks properly.

## Version 2.10.1 (Oct 23, 2025)

### Application & CLI Version 2.10.1

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.72

#### Added

* Added support for Snowpark Scala v1.17.0:

**From Not Supported to Direct:**

**Dataset:**

* `org.apache.spark.sql.Dataset.isEmpty` → `com.snowflake.snowpark.DataFrame.isEmpty`

**Row:**

* `org.apache.spark.sql.Row.mkString` → `com.snowflake.snowpark.Row.mkString`

**StructType:**

* `org.apache.spark.sql.types.StructType.fieldNames` → `com.snowflake.snowpark.types.StructType.fieldNames`

**From Not Supported to Rename:**

**Functions:**

* `org.apache.spark.functions.flatten` → `com.snowflake.snowpark.functions.array_flatten`

**From Direct to Rename:**

**Functions:**

* `org.apache.spark.functions.to_date` → `com.snowflake.snowpark.functions.try_to_date`
* `org.apache.spark.functions.to_timestamp` → `com.snowflake.snowpark.functions.try_to_timestamp`

**From Direct Helper to Rename:**

**Functions:**

* `org.apache.spark.sql.functions.concat_ws` → `com.snowflake.snowpark.functions.concat_ws_ignore_nulls`

**From Not Defined to Direct:**

**Functions:**

* `org.apache.spark.functions.try_to_timestamp` → `com.snowflake.snowpark.functions.try_to_timestamp`
* Embedded SQL is now migrated when a SQL statement literal is assigned to a local variable.

Example:
sqlStat = “SELECT colName FROM myTable”
session.sql(sqlStat)

* Embedded SQL is now supported for literal strings concatenations.

Example:
session.sql(“SELECT colName “ + “FROM myTable”)

#### Changed

* Updated the supported versions of Snowpark Python API and Snowpark Pandas API from 1.36.0 to 1.39.0.
* Updated the mapping status for the following PySpark xpath functions from NotSupported to Direct with EWI SPRKPY1103:

  + `pyspark.sql.functions.xpath`
  + `pyspark.sql.functions.xpath_boolean`
  + `pyspark.sql.functions.xpath_double`
  + `pyspark.sql.functions.xpath_float`
  + `pyspark.sql.functions.xpath_int`
  + `pyspark.sql.functions.xpath_long`
  + `pyspark.sql.functions.xpath_number`
  + `pyspark.sql.functions.xpath_short`
  + `pyspark.sql.functions.xpath_string`
* Updated the mapping status for the following PySpark elements from NotDefined to Direct:

  + `pyspark.sql.functions.bit_and` → `snowflake.snowpark.functions.bitand_agg`
  + `pyspark.sql.functions.bit_or` → `snowflake.snowpark.functions.bitor_agg`
  + `pyspark.sql.functions.bit_xor` → `snowflake.snowpark.functions.bitxor_agg`
  + `pyspark.sql.functions.getbit` → `snowflake.snowpark.functions.getbit`
* Updated the mapping status for the following Pandas elements from NotSupported to Direct:

  + `pandas.core.indexes.base.Index` → `modin.pandas.Index`
  + `pandas.core.indexes.base.Index.get_level_values` → `modin.pandas.Index.get_level_values`
* Updated the mapping status for the following PySpark functions from NotSupported to Rename:

  + `pyspark.sql.functions.now` → `snowflake.snowpark.functions.current_timestamp`

#### Fixed

* Fixed Scala not migrating imports when there’s a rename.

  **Example:**

  **Source code:**

  ```scala
  package com.example.functions
  import org.apache.spark.sql.functions.{to_timestamp, lit}
  object ToTimeStampTest extends App {
     to_timestamp(lit("sample"))
     to_timestamp(lit("sample"), "yyyy-MM-dd")
   }
  ```

  **Output code:**

  ```scala
  package com.example.functions
  import com.snowflake.snowpark.functions.{try_to_timestamp, lit}
  import com.snowflake.snowpark_extensions.Extensions._
  import com.snowflake.snowpark_extensions.Extensions.functions._
  object ToTimeStampTest extends App {
     try_to_timestamp(lit("sample"))
     try_to_timestamp(lit("sample"), "yyyy-MM-dd")
   }
  ```

## Version 2.10.0 (Sep 24, 2025)

### Application & CLI Version 2.10.0

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.62

#### Added

* Added functionality to migrate SQL embedded with Python format interpolation.
* Added support for `DataFrame.select` and `DataFrame.sort` transformations for greater data processing flexibility.

#### Changed

* Bumped the supported versions of Snowpark Python API and Snowpark Pandas API to 1.36.0.
* Updated the mapping status of `pandas.core.frame.DataFrame.boxplot` from Not Supported to Direct.
* Updated the mapping status of `DataFrame.select`, `Dataset.select`, `DataFrame.sort` and `Dataset.sort` from Direct to Transformation.
* Snowpark Scala allows a sequence of columns to be passed directly to the select and sort functions, so this transformation changes all the usages such as `df.select(cols: _*)` to `df.select(cols)` and `df.sort(cols: _*)` to `df.sort(cols)`.
* Bumped Python AST and Parser version to 149.1.9.
* Updated the status to Direct for pandas functions:

  + `pandas.core.frame.DataFrame.to_excel`
  + `pandas.core.series.Series.to_excel`
  + `pandas.io.feather_format.read_feather`
  + `pandas.io.orc.read_orc`
  + `pandas.io.stata.read_stata`
* Updated the status for `pyspark.sql.pandas.map_ops.PandasMapOpsMixin.mapInPandas` to workaround using the EWI SPRKPY1102.

#### Fixed

* Fixed issue that affected SqlEmbedded transformations when using chained method calls.
* Fixed transformations involving PySqlExpr using the new PyLiteralSql to avoid losing Tails.
* Resolved internal stability issues to improve tool robustness and reliability.

## Version 2.7.7 (Aug 28, 2025)

### Application & CLI Version 2.7.7

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.46

#### Added

* Added new Pandas EWI documentation PNDSPY1011.
* Added support to the following Pandas functions:

  + pandas.core.algorithms.unique
  + pandas.core.dtypes.missing.isna
  + pandas.core.dtypes.missing.isnull
  + pandas.core.dtypes.missing.notna
  + pandas.core.dtypes.missing.notnull
  + pandas.core.resample.Resampler.count
  + pandas.core.resample.Resampler.max
  + pandas.core.resample.Resampler.mean
  + pandas.core.resample.Resampler.median
  + pandas.core.resample.Resampler.min
  + pandas.core.resample.Resampler.size
  + pandas.core.resample.Resampler.sum
  + pandas.core.arrays.timedeltas.TimedeltaArray.total_seconds
  + pandas.core.series.Series.get
  + pandas.core.series.Series.to_frame
  + pandas.core.frame.DataFrame.assign
  + pandas.core.frame.DataFrame.get
  + pandas.core.frame.DataFrame.to_numpy
  + pandas.core.indexes.base.Index.is_unique
  + pandas.core.indexes.base.Index.has_duplicates
  + pandas.core.indexes.base.Index.shape
  + pandas.core.indexes.base.Index.array
  + pandas.core.indexes.base.Index.str
  + pandas.core.indexes.base.Index.equals
  + pandas.core.indexes.base.Index.identical
  + pandas.core.indexes.base.Index.unique

Added support to the following Spark Scala functions:

* org.apache.spark.sql.functions.format_number
* org.apache.spark.sql.functions.from_unixtime
* org.apache.spark.sql.functions.instr
* org.apache.spark.sql.functions.months_between
* org.apache.spark.sql.functions.pow
* org.apache.spark.sql.functions.to_unix_timestamp
* org.apache.spark.sql.Row.getAs

#### Changed

* Bumped the version of Snowpark Pandas API supported by the SMA to 1.33.0.
* Bumped the version of Snowpark Scala API supported by the SMA to 1.16.0.
* Updated the mapping status of pyspark.sql.group.GroupedData.pivot from Transformation to Direct.
* Updated the mapping status of org.apache.spark.sql.Builder.master from NotSupported to Transformation. This transformation removes all the identified usages of this element during code conversion.
* Updated the mapping status of org.apache.spark.sql.types.StructType.fieldIndex from NotSupported to Direct.
* Updated the mapping status of org.apache.spark.sql.Row.fieldIndex from NotSupported to Direct.
* Updated the mapping status of org.apache.spark.sql.SparkSession.stop from NotSupported to Rename. All the identified usages of this element are renamed to com.snowflake.snowpark.Session.close during code conversion.
* Updated the mapping status of org.apache.spark.sql.DataFrame.unpersist and org.apache.spark.sql.Dataset.unpersist from NotSupported to Transformation. This transformation removes all the identified usages of this element during code conversion.

#### Fixed

* Fixed continuation backslash on removed tailed functions.
* Fix the LIBRARY_PREFIX column in the ConversionStatusLibraries.csv file to use the right identifier for scikit-learn library family (scikit-\*).
* Fixed bug not parsing multiline grouped operations.

## Version 2.9.0 (Sep 09, 2025)

### Included SMA Core Versions

* Snowpark Conversion Core 8.0.53

#### Added

* The following mappings are now performed for `org.apache.spark.sql.Dataset[T]`:

  + `org.apache.spark.sql.Dataset.union` is now `com.snowflake.snowpark.DataFrame.unionAll`
  + `org.apache.spark.sql.Dataset.unionByName` is now `com.snowflake.snowpark.DataFrame.unionAllByName`
* Added support for `org.apache.spark.sql.functions.broadcast` as a transformation.

#### Changed

* Increased the supported Snowpark Python API version for SMA from `1.27.0` to `1.33.0`.
* The status for the `pyspark.sql.function.randn` function has been updated to Direct.

#### Fixed

* Resolved an issue where `org.apache.spark.SparkContext.parallelize` was not resolving and now supports it as a transformation.
* Fixed the `Dataset.persist` transformation to work with any type of Dataset, not just `Dataset[Row]`.

## Version 2.7.6 (Jul 17, 2025)

### Included SMA Core Versions

* Snowpark Conversion Core 8.0.30

#### Added

* Adjusted mappings for spark.DataReader methods.
* `DataFrame.union` is now `DataFrame.unionAll`.
* `DataFrame.unionByName` is now `DataFrame.unionAllByName`.
* Added multi-level artifact dependency columns in artifact inventory
* Added new Pandas EWIs documentation, from `PNDSPY1005` to `PNDSPY1010`.
* Added a specific EWI for `pandas.core.series.Series.apply`.

#### Changed

* Bumped the version of Snowpark Pandas API supported by the SMA from `1.27.0` to `1.30.0`.

#### Fixed

* Fixed an issue with missing values in the formula to get the SQL readiness score.
* Fixed a bug that was causing some Pandas elements to have the default EWI message from PySpark.

## Version 2.7.5 (Jul 2, 2025)

### Application & CLI Version 2.7.5

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.19

#### Changed

* **Refactored Pandas Imports:** Pandas imports now use `modin.pandas` instead of `snowflake.snowpark.modin.pandas`.
* **Improved dbutils and Magic Commands Transformation:**

  + A new `sfutils.py` file is now generated, and all `dbutils` prefixes are replaced with `sfutils`.
  + For Databricks (DBX) notebooks, an implicit import for `sfutils` is automatically added.
  + The `sfutils` module simulates various `dbutils` methods, including file system operations (`dbutils.fs`) via a defined Snowflake FileSystem (SFFS) stage, and handles notebook execution (`dbutils.notebook.run`) by transforming it to `EXECUTE NOTEBOOK` SQL functions.
  + `dbutils.notebook.exit` is removed as it is not required in Snowflake.

#### Fixed

* **Updates in SnowConvert Reports:** SnowConvert reports now include the *CellId* column when instances originate from SMA, and the *FileName* column displays the full path.
* **Updated Artifacts Dependency for SnowConvert Reports:** The SMA’s artifact inventory report, which was previously impacted by the integration of SnowConvert, has been restored. This update enables the SMA tool to accurately capture and analyze *Object References* and *Missing Object References* directly from SnowConvert reports, thereby ensuring the correct retrieval of SQL dependencies for the inventory.

## Version 2.7.4 (Jun 26, 2025)

### Application & CLI Version 2.7.4

**Desktop App**

#### Added

* Added telemetry improvements.

#### Fixed

* Fix documentation links in conversion settings pop-up and Pandas EWIs.

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.16

#### Added

* Transforming Spark XML to Snowpark
* Databricks SQL option in the SQL source language
* Transform JDBC read connections.

#### Changed

* All the SnowConvert reports are copied to the backup Zip file.
* The folder is renamed from `SqlReports` to `SnowConvertReports`.
* `SqlFunctionsInventory` is moved to the folder `Reports`.
* All the SnowConvert Reports are sent to Telemetry.

#### Fixed

* Non-deterministic issue with SQL Readiness Score.
* Fixed a false-positive critical result that made the desktop crash.
* Fixed issue causing the Artifacts dependency report not to show the SQL objects.

## Version 2.7.2 (Jun 10, 2025)

### Application & CLI Version 2.7.2

### Included SMA Core Versions

* Snowpark Conversion Core 8.0.2

#### Fixed

* Addressed an issue with SMA execution on the latest Windows OS, as previously reported. This fix resolves the issues encountered in version 2.7.1.

## Version 2.7.1 (Jun 9, 2025)

### Application & CLI Version 2.7.1

#### Included SMA Core Versions

* Snowpark Conversion Core 8.0.1

#### Added

The Snowpark Migration Accelerator (SMA) now orchestrates` SnowConvert <<https://docs.snowconvert.com/sc/general/about>>`_ to process SQL found in user workloads, including embedded SQL in Python / Scala code, Notebook SQL cells, `.sql` files, and `.hql` files.

The SnowConvert now enhances the previous SMA capabilities:

* [Spark SQL](https://docs.snowconvert.com/sc/translation-references/spark-dbx)

A new folder in the Reports called SQL Reports contains the reports generated by SnowConvert.

#### Known Issues

The previous SMA version for SQL reports will appear empty for the following:

* For `Reports/SqlElementsInventory.csv`, partially covered by the `Reports/SqlReports/Elements.yyyymmdd.hhmmss.csv.`
* For `Reports/SqlFunctionsInventory.csv` refer to the new location with the same name at `Reports/SqlReports/SqlFunctionsInventory.csv`

The artifact dependency inventory:

* In the `ArtifactDependencyInventory` the column for the SQL Object will appear empty

## Version 2.6.10 (May 5, 2025)

### Application & CLI Version 2.6.10

#### Included SMA Core Versions

* Snowpark Conversion Core 7.4.0

#### Fixed

* Fixed wrong values in the ‘checkpoints.json’ file.

  + The ‘sample’ value was without decimals (for integer values) and quotes.
  + The ‘entryPoint’ value had dots instead of slashes and was missing the file extension.
* Updated the default value to TRUE for the setting ‘Convert DBX notebooks to Snowflake notebooks’

## Version 2.6.8 (Apr 28, 2025)

### Application & CLI Version 2.6.8

#### Desktop App

* Added checkpoints execution settings mechanism recognition.
* Added a mechanism to collect DBX magic commands into DbxElementsInventory.csv
* Added ‘checkpoints.json’ generation into the input directory.
* Added a new EWI for all not supported magic command.
* Added the collection of dbutils into DbxElementsInventory.csv from scala source notebooks

#### Included SMA Core Versions

* Snowpark Conversion Core 7.2.53

#### Changed

* Updates made to handle transformations from DBX Scala elements to Jupyter Python elements, and to comment the entire code from the cell.
* Updates made to handle transformations from dbutils.notebook.run and “r” commands, for the last one, also comment out the entire code from the cell.
* Updated the name and the letter of the key to make the conversion of the notebook files.

#### Fixed

* Fixed the bug that was causing the transformation of DBX notebooks into .ipynb files to have the wrong format.
* Fixed the bug that was causing .py DBX notebooks to not be transformable into .ipynb files.
* Fixed a bug that was causing comments to be missing in the output code of DBX notebooks.
* Fixed a bug that was causing raw Scala files to be converted into ipynb files.

## Version 2.6.7 (Apr 21, 2025)

### Application & CLI Version 2.6.7

#### Included SMA Core Versions

* Snowpark Conversion Core 7.2.42

#### Changed

Updated DataFramesInventory to fill EntryPoints column

## Version 2.6.6 (Apr 7, 2025)

### Application & CLI Version 2.6.6

#### Desktop App

#### Added

* Update DBx EWI link in the UI results page

#### Included SMA Core Versions

* Snowpark Conversion Core 7.2.39

#### Added

* Added Execution Flow inventory generation.
* Added implicit session setup in every DBx notebook transformation

#### Changed

* Renamed the DbUtilsUsagesInventory.csv to DbxElementsInventory.csv

#### Fixed

* Fixed a bug that caused a Parsing error when a backslash came after a type hint.
* Fixed relative imports that do not start with a dot and relative imports with a star.

## Version 2.6.5 (Mar 27, 2025)

### Application & CLI Version 2.6.5

#### Desktop App

#### Added

* Added a new conversion setting toggle to enable or disable Sma-Checkpoints feature.
* Fix report issue to not crash when post api returns 500

#### Included SMA Core Versions

* Snowpark Conversion Core 7.2.26

#### Added

* Added generation of the checkpoints.json file into the output folder based on the DataFramesInventory.csv.
* Added “disableCheckpoints” flag into the CLI commands and additional parameters of the code processor.
* Added a new replacer for Python to transform the dbutils.notebook.run node.
* Added new replacers to transform the magic %run command.
* Added new replacers (Python and Scala) to remove the dbutils.notebook.exit node.
* Added Location column to artifacts inventory.

#### Changed

* Refactored the normalized directory separator used in some parts of the solution.
* Centralized the DBC extraction working folder name handling.
* Updated Snowpark and Pandas version to v1.27.0
* Updated the artifacts inventory columns to:

  + Name -> Dependency
  + File -> FileId
  + Status -> Status_detail
* Added new column to the artifacts inventory:

  + Success

#### Fixed

* Dataframes inventory was not being uploaded to the stage correctly.

## Version 2.6.4 (Mar 12, 2025)

### Application & CLI Version 2.6.4

#### Included SMA Core Versions

* Snowpark Conversion Core 7.2.0

#### Added

* An Artifact Dependency Inventory
* A replacer and EWI for pyspark.sql.types.StructType.fieldNames method to snowflake.snowpark.types.StructType.fieldNames attribute.
* The following **PySpark** functions with the status:

Direct Status

* `pyspark.sql.functions.bitmap_bit_position`
* `pyspark.sql.functions.bitmap_bucket_number`
* `pyspark.sql.functions.bitmap_construct_agg`
* `pyspark.sql.functions.equal_null`
* `pyspark.sql.functions.ifnull`
* `pyspark.sql.functions.localtimestamp`
* `pyspark.sql.functions.max_by`
* `pyspark.sql.functions.min_by`
* `pyspark.sql.functions.nvl`
* `pyspark.sql.functions.regr_avgx`
* `pyspark.sql.functions.regr_avgy`
* `pyspark.sql.functions.regr_count`
* `pyspark.sql.functions.regr_intercept`
* `pyspark.sql.functions.regr_slope`
* `pyspark.sql.functions.regr_sxx`
* `pyspark.sql.functions.regr_sxy`
* `pyspark.sql.functions.regr`

NotSupported

* `pyspark.sql.functions.map_contains_key`
* `pyspark.sql.functions.position`
* `pyspark.sql.functions.regr_r2`
* `pyspark.sql.functions.try_to_binary`

The following **Pandas** functions with status

* `pandas.core.series.Series.str.ljust`
* `pandas.core.series.Series.str.center`
* `pandas.core.series.Series.str.pad`
* `pandas.core.series.Series.str.rjust`

Update the following **Pyspark** functions with the status

From WorkAround to Direct

* `pyspark.sql.functions.acosh`
* `pyspark.sql.functions.asinh`
* `pyspark.sql.functions.atanh`
* `pyspark.sql.functions.instr`
* `pyspark.sql.functions.log10`
* `pyspark.sql.functions.log1p`
* `pyspark.sql.functions.log2`

From NotSupported to Direct

* `pyspark.sql.functions.bit_length`
* `pyspark.sql.functions.cbrt`
* `pyspark.sql.functions.nth_value`
* `pyspark.sql.functions.octet_length`
* `pyspark.sql.functions.base64`
* `pyspark.sql.functions.unbase64`

Updated the folloing **Pandas** functions with the status

From NotSupported to Direct

* `pandas.core.frame.DataFrame.pop`
* `pandas.core.series.Series.between`
* `pandas.core.series.Series.pop`

## Version 2.6.3 (Mar 6, 2025)

### Application & CLI Version 2.6.3

#### Included SMA Core Versions

* Snowpark Conversion Core 7.1.13

#### Added

* Added csv generator class for new inventory creation.
* Added “full_name” column to import usages inventory.
* Added transformation from pyspark.sql.functions.concat_ws to snowflake.snowpark.functions._concat_ws_ignore_nulls.
* Added logic for generation of checkpoints.json.
* Added the inventories:

  + DataFramesInventory.csv.
  + CheckpointsInventory.csv

## Version 2.6.0 (Feb 21, 2025)

### Application & CLI Version 2.6.0

#### Desktop App

* Updated the licensing agreement, acceptance is required.

#### Included SMA Core Versions

* Snowpark Conversion Core 7.1.2

Added

Updated the mapping status for the following PySpark elements, from `NotSupported` to `Direct`

* `pyspark.sql.types.ArrayType.json`
* `pyspark.sql.types.ArrayType.jsonValue`
* `pyspark.sql.types.ArrayType.simpleString`
* `pyspark.sql.types.ArrayType.typeName`
* `pyspark.sql.types.AtomicType.json`
* `pyspark.sql.types.AtomicType.jsonValue`
* `pyspark.sql.types.AtomicType.simpleString`
* `pyspark.sql.types.AtomicType.typeName`
* `pyspark.sql.types.BinaryType.json`
* `pyspark.sql.types.BinaryType.jsonValue`
* `pyspark.sql.types.BinaryType.simpleString`
* `pyspark.sql.types.BinaryType.typeName`
* `pyspark.sql.types.BooleanType.json`
* `pyspark.sql.types.BooleanType.jsonValue`
* `pyspark.sql.types.BooleanType.simpleString`
* `pyspark.sql.types.BooleanType.typeName`
* `pyspark.sql.types.ByteType.json`
* `pyspark.sql.types.ByteType.jsonValue`
* `pyspark.sql.types.ByteType.simpleString`
* `pyspark.sql.types.ByteType.typeName`
* `pyspark.sql.types.DecimalType.json`
* `pyspark.sql.types.DecimalType.jsonValue`
* `pyspark.sql.types.DecimalType.simpleString`
* `pyspark.sql.types.DecimalType.typeName`
* `pyspark.sql.types.DoubleType.json`
* `pyspark.sql.types.DoubleType.jsonValue`
* `pyspark.sql.types.DoubleType.simpleString`
* `pyspark.sql.types.DoubleType.typeName`
* `pyspark.sql.types.FloatType.json`
* `pyspark.sql.types.FloatType.jsonValue`
* `pyspark.sql.types.FloatType.simpleString`
* `pyspark.sql.types.FloatType.typeName`
* `pyspark.sql.types.FractionalType.json`
* `pyspark.sql.types.FractionalType.jsonValue`
* `pyspark.sql.types.FractionalType.simpleString`
* `pyspark.sql.types.FractionalType.typeName`
* `pyspark.sql.types.IntegerType.json`
* `pyspark.sql.types.IntegerType.jsonValue`
* `pyspark.sql.types.IntegerType.simpleString`
* `pyspark.sql.types.IntegerType.typeName`
* `pyspark.sql.types.IntegralType.json`
* `pyspark.sql.types.IntegralType.jsonValue`
* `pyspark.sql.types.IntegralType.simpleString`
* `pyspark.sql.types.IntegralType.typeName`
* `pyspark.sql.types.LongType.json`
* `pyspark.sql.types.LongType.jsonValue`
* `pyspark.sql.types.LongType.simpleString`
* `pyspark.sql.types.LongType.typeName`
* `pyspark.sql.types.MapType.json`
* `pyspark.sql.types.MapType.jsonValue`
* `pyspark.sql.types.MapType.simpleString`
* `pyspark.sql.types.MapType.typeName`
* `pyspark.sql.types.NullType.json`
* `pyspark.sql.types.NullType.jsonValue`
* `pyspark.sql.types.NullType.simpleString`
* `pyspark.sql.types.NullType.typeName`
* `pyspark.sql.types.NumericType.json`
* `pyspark.sql.types.NumericType.jsonValue`
* `pyspark.sql.types.NumericType.simpleString`
* `pyspark.sql.types.NumericType.typeName`
* `pyspark.sql.types.ShortType.json`
* `pyspark.sql.types.ShortType.jsonValue`
* `pyspark.sql.types.ShortType.simpleString`
* `pyspark.sql.types.ShortType.typeName`
* `pyspark.sql.types.StringType.json`
* `pyspark.sql.types.StringType.jsonValue`
* `pyspark.sql.types.StringType.simpleString`
* `pyspark.sql.types.StringType.typeName`
* `pyspark.sql.types.StructType.json`
* `pyspark.sql.types.StructType.jsonValue`
* `pyspark.sql.types.StructType.simpleString`
* `pyspark.sql.types.StructType.typeName`
* `pyspark.sql.types.TimestampType.json`
* `pyspark.sql.types.TimestampType.jsonValue`
* `pyspark.sql.types.TimestampType.simpleString`
* `pyspark.sql.types.TimestampType.typeName`
* `pyspark.sql.types.StructField.simpleString`
* `pyspark.sql.types.StructField.typeName`
* `pyspark.sql.types.StructField.json`
* `pyspark.sql.types.StructField.jsonValue`
* `pyspark.sql.types.DataType.json`
* `pyspark.sql.types.DataType.jsonValue`
* `pyspark.sql.types.DataType.simpleString`
* `pyspark.sql.types.DataType.typeName`
* `pyspark.sql.session.SparkSession.getActiveSession`
* `pyspark.sql.session.SparkSession.version`
* `pandas.io.html.read_html`
* `pandas.io.json._normalize.json_normalize`
* `pyspark.sql.types.ArrayType.fromJson`
* `pyspark.sql.types.MapType.fromJson`
* `pyspark.sql.types.StructField.fromJson`
* `pyspark.sql.types.StructType.fromJson`
* `pandas.core.groupby.generic.DataFrameGroupBy.pct_change`
* `pandas.core.groupby.generic.SeriesGroupBy.pct_change`

Updated the mapping status for the following Pandas elements, from `NotSupported` to `Direct`

* `pandas.io.html.read_html`
* `pandas.io.json._normalize.json_normalize`
* `pandas.core.groupby.generic.DataFrameGroupBy.pct_change`
* `pandas.core.groupby.generic.SeriesGroupBy.pct_change`

Updated the mapping status for the following PySpark elements, from `Rename` to `Direct`

* `pyspark.sql.functions.collect_list`
* `pyspark.sql.functions.size`

#### Fixed

* Standardized the format of the version number in the inventories.

## Version 2.5.2 (Feb 5, 2025)

### Hotfix: Application & CLI Version 2.5.2

### Desktop App

* Fixed an issue when converting in the sample project option.

### Included SMA Core Versions

* Snowpark Conversion Core 5.3.0

## Version 2.5.1 (Feb 4, 2025)

### Application & CLI Version 2.5.1

### Desktop App

* Added a new modal when the user does not have write permission.
* Updated the licensing aggrement, acceptance is required.

### CLI

* Fixed the year in the CLI screen when showing “–version” or “-v”

### Included SMA Core Versions included-sma-core-versions

* Snowpark Conversion Core 5.3.0

#### Added

Added the following Python Third-Party libraries with Direct status:

* `about-time`
* `affinegap`
* `aiohappyeyeballs`
* `alibi-detect`
* `alive-progress`
* `allure-nose2`
* `allure-robotframework`
* `anaconda-cloud-cli`
* `anaconda-mirror`
* `astropy-iers-data`
* `asynch`
* `asyncssh`
* `autots`
* `autoviml`
* `aws-msk-iam-sasl-signer-python`
* `azure-functions`
* `backports.tarfile`
* `blas`
* `bottle`
* `bson`
* `cairo`
* `capnproto`
* `captum`
* `categorical-distance`
* `census`
* `clickhouse-driver`
* `clustergram`
* `cma`
* `conda-anaconda-telemetry`
* `configspace`
* `cpp-expected`
* `dask-expr`
* `data-science-utils`
* `databricks-sdk`
* `datetime-distance`
* `db-dtypes`
* `dedupe`
* `dedupe-variable-datetime`
* `dedupe_lehvenshtein_search`
* `dedupe_levenshtein_search`
* `diff-cover`
* `diptest`
* `dmglib`
* `docstring_parser`
* `doublemetaphone`
* `dspy-ai`
* `econml`
* `emcee`
* `emoji`
* `environs`
* `eth-abi`
* `eth-hash`
* `eth-typing`
* `eth-utils`
* `expat`
* `filetype`
* `fitter`
* `flask-cors`
* `fpdf2`
* `frozendict`
* `gcab`
* `geojson`
* `gettext`
* `glib-tools`
* `google-ads`
* `google-ai-generativelanguage`
* `google-api-python-client`
* `google-auth-httplib2`
* `google-cloud-bigquery`
* `google-cloud-bigquery-core`
* `google-cloud-bigquery-storage`
* `google-cloud-bigquery-storage-core`
* `google-cloud-resource-manager`
* `google-generativeai`
* `googlemaps`
* `grapheme`
* `graphene`
* `graphql-relay`
* `gravis`
* `greykite`
* `grpc-google-iam-v1`
* `harfbuzz`
* `hatch-fancy-pypi-readme`
* `haversine`
* `hiclass`
* `hicolor-icon-theme`
* `highered`
* `hmmlearn`
* `holidays-ext`
* `httplib2`
* `icu`
* `imbalanced-ensemble`
* `immutabledict`
* `importlib-metadata`
* `importlib-resources`
* `inquirerpy`
* `iterative-telemetry`
* `jaraco.context`
* `jaraco.test`
* `jiter`
* `jiwer`
* `joserfc`
* `jsoncpp`
* `jsonpath`
* `jsonpath-ng`
* `jsonpath-python`
* `kagglehub`
* `keplergl`
* `kt-legacy`
* `langchain-community`
* `langchain-experimental`
* `langchain-snowflake`
* `langchain-text-splitters`
* `libabseil`
* `libflac`
* `libgfortran-ng`
* `libgfortran5`
* `libglib`
* `libgomp`
* `libgrpc`
* `libgsf`
* `libmagic`
* `libogg`
* `libopenblas`
* `libpostal`
* `libprotobuf`
* `libsentencepiece`
* `libsndfile`
* `libstdcxx-ng`
* `libtheora`
* `libtiff`
* `libvorbis`
* `libwebp`
* `lightweight-mmm`
* `litestar`
* `litestar-with-annotated-types`
* `litestar-with-attrs`
* `litestar-with-cryptography`
* `litestar-with-jinja`
* `litestar-with-jwt`
* `litestar-with-prometheus`
* `litestar-with-structlog`
* `lunarcalendar-ext`
* `matplotlib-venn`
* `metricks`
* `mimesis`
* `modin-ray`
* `momepy`
* `mpg123`
* `msgspec`
* `msgspec-toml`
* `msgspec-yaml`
* `msitools`
* `multipart`
* `namex`
* `nbconvert-all`
* `nbconvert-core`
* `nbconvert-pandoc`
* `nlohmann_json`
* `numba-cuda`
* `numpyro`
* `office365-rest-python-client`
* `openapi-pydantic`
* `opentelemetry-distro`
* `opentelemetry-instrumentation`
* `opentelemetry-instrumentation-system-metrics`
* `optree`
* `osmnx`
* `pathlib`
* `pdf2image`
* `pfzy`
* `pgpy`
* `plumbum`
* `pm4py`
* `polars`
* `polyfactory`
* `poppler-cpp`
* `postal`
* `pre-commit`
* `prompt-toolkit`
* `propcache`
* `py-partiql-parser`
* `py_stringmatching`
* `pyatlan`
* `pyfakefs`
* `pyfhel`
* `pyhacrf-datamade`
* `pyiceberg`
* `pykrb5`
* `pylbfgs`
* `pymilvus`
* `pymoo`
* `pynisher`
* `pyomo`
* `pypdf`
* `pypdf-with-crypto`
* `pypdf-with-full`
* `pypdf-with-image`
* `pypng`
* `pyprind`
* `pyrfr`
* `pysoundfile`
* `pytest-codspeed`
* `pytest-trio`
* `python-barcode`
* `python-box`
* `python-docx`
* `python-gssapi`
* `python-iso639`
* `python-magic`
* `python-pandoc`
* `python-zstd`
* `pyuca`
* `pyvinecopulib`
* `pyxirr`
* `qrcode`
* `rai-sdk`
* `ray-client`
* `ray-observability`
* `readline`
* `rich-click`
* `rouge-score`
* `ruff`
* `scikit-criteria`
* `scikit-mobility`
* `sentencepiece-python`
* `sentencepiece-spm`
* `setuptools-markdown`
* `setuptools-scm`
* `setuptools-scm-git-archive`
* `shareplum`
* `simdjson`
* `simplecosine`
* `sis-extras`
* `slack-sdk`
* `smac`
* `snowflake-sqlalchemy`
* `snowflake_legacy`
* `socrata-py`
* `spdlog`
* `sphinxcontrib-images`
* `sphinxcontrib-jquery`
* `sphinxcontrib-youtube`
* `splunk-opentelemetry`
* `sqlfluff`
* `squarify`
* `st-theme`
* `statistics`
* `streamlit-antd-components`
* `streamlit-condition-tree`
* `streamlit-echarts`
* `streamlit-feedback`
* `streamlit-keplergl`
* `streamlit-mermaid`
* `streamlit-navigation-bar`
* `streamlit-option-menu`
* `strictyaml`
* `stringdist`
* `sybil`
* `tensorflow-cpu`
* `tensorflow-text`
* `tiledb-ptorchaudio`
* `torcheval`
* `trio-websocket`
* `trulens-connectors-snowflake`
* `trulens-core`
* `trulens-dashboard`
* `trulens-feedback`
* `trulens-otel-semconv`
* `trulens-providers-cortex`
* `tsdownsample`
* `typing`
* `typing-extensions`
* `typing_extensions`
* `unittest-xml-reporting`
* `uritemplate`
* `us`
* `uuid6`
* `wfdb`
* `wsproto`
* `zlib`
* `zope.index`

Added the following Python BuiltIn libraries with Direct status:

* `aifc`
* `array`
* `ast`
* `asynchat`
* `asyncio`
* `asyncore`
* `atexit`
* `audioop`
* `base64`
* `bdb`
* `binascii`
* `bitsect`
* `builtins`
* `bz2`
* `calendar`
* `cgi`
* `cgitb`
* `chunk`
* `cmath`
* `cmd`
* `code`
* `codecs`
* `codeop`
* `colorsys`
* `compileall`
* `concurrent`
* `contextlib`
* `contextvars`
* `copy`
* `copyreg`
* `cprofile`
* `crypt`
* `csv`
* `ctypes`
* `curses`
* `dbm`
* `difflib`
* `dis`
* `distutils`
* `doctest`
* `email`
* `ensurepip`
* `enum`
* `errno`
* `faulthandler`
* `fcntl`
* `filecmp`
* `fileinput`
* `fnmatch`
* `fractions`
* `ftplib`
* `functools`
* `gc`
* `getopt`
* `getpass`
* `gettext`
* `graphlib`
* `grp`
* `gzip`
* `hashlib`
* `heapq`
* `hmac`
* `html`
* `http`
* `idlelib`
* `imaplib`
* `imghdr`
* `imp`
* `importlib`
* `inspect`
* `ipaddress`
* `itertools`
* `keyword`
* `linecache`
* `locale`
* `lzma`
* `mailbox`
* `mailcap`
* `marshal`
* `math`
* `mimetypes`
* `mmap`
* `modulefinder`
* `msilib`
* `multiprocessing`
* `netrc`
* `nis`
* `nntplib`
* `numbers`
* `operator`
* `optparse`
* `ossaudiodev`
* `pdb`
* `pickle`
* `pickletools`
* `pipes`
* `pkgutil`
* `platform`
* `plistlib`
* `poplib`
* `posix`
* `pprint`
* `profile`
* `pstats`
* `pty`
* `pwd`
* `py_compile`
* `pyclbr`
* `pydoc`
* `queue`
* `quopri`
* `random`
* `re`
* `reprlib`
* `resource`
* `rlcompleter`
* `runpy`
* `sched`
* `secrets`
* `select`
* `selectors`
* `shelve`
* `shlex`
* `signal`
* `site`
* `sitecustomize`
* `smtpd`
* `smtplib`
* `sndhdr`
* `socket`
* `socketserver`
* `spwd`
* `sqlite3`
* `ssl`
* `stat`
* `string`
* `stringprep`
* `struct`
* `subprocess`
* `sunau`
* `symtable`
* `sysconfig`
* `syslog`
* `tabnanny`
* `tarfile`
* `telnetlib`
* `tempfile`
* `termios`
* `test`
* `textwrap`
* `threading`
* `timeit`
* `tkinter`
* `token`
* `tokenize`
* `tomllib`
* `trace`
* `traceback`
* `tracemalloc`
* `tty`
* `turtle`
* `turtledemo`
* `types`
* `unicodedata`
* `urllib`
* `uu`
* `uuid`
* `venv`
* `warnings`
* `wave`
* `weakref`
* `webbrowser`
* `wsgiref`
* `xdrlib`
* `xml`
* `xmlrpc`
* `zipapp`
* `zipfile`
* `zipimport`
* `zoneinfo`

Added the following Python BuiltIn libraries with NotSupported status:

* `msvcrt`
* `winreg`
* `winsound`

#### Changed

* Update .NET version to v9.0.0.
* Improved EWI SPRKPY1068.
* Bumped the version of Snowpark Python API supported by the SMA from 1.24.0 to 1.25.0.
* Updated the detailed report template, now has the Snowpark version for Pandas.
* Changed the following libraries from **ThirdPartyLib** to **BuiltIn**.

  + `configparser`
  + `dataclasses`
  + `pathlib`
  + `readline`
  + `statistics`
  + `zlib`

Updated the mapping status for the following Pandas elements, from Direct to Partial:

* `pandas.core.frame.DataFrame.add`
* `pandas.core.frame.DataFrame.aggregate`
* `pandas.core.frame.DataFrame.all`
* `pandas.core.frame.DataFrame.apply`
* `pandas.core.frame.DataFrame.astype`
* `pandas.core.frame.DataFrame.cumsum`
* `pandas.core.frame.DataFrame.div`
* `pandas.core.frame.DataFrame.dropna`
* `pandas.core.frame.DataFrame.eq`
* `pandas.core.frame.DataFrame.ffill`
* `pandas.core.frame.DataFrame.fillna`
* `pandas.core.frame.DataFrame.floordiv`
* `pandas.core.frame.DataFrame.ge`
* `pandas.core.frame.DataFrame.groupby`
* `pandas.core.frame.DataFrame.gt`
* `pandas.core.frame.DataFrame.idxmax`
* `pandas.core.frame.DataFrame.idxmin`
* `pandas.core.frame.DataFrame.inf`
* `pandas.core.frame.DataFrame.join`
* `pandas.core.frame.DataFrame.le`
* `pandas.core.frame.DataFrame.loc`
* `pandas.core.frame.DataFrame.lt`
* `pandas.core.frame.DataFrame.mask`
* `pandas.core.frame.DataFrame.merge`
* `pandas.core.frame.DataFrame.mod`
* `pandas.core.frame.DataFrame.mul`
* `pandas.core.frame.DataFrame.ne`
* `pandas.core.frame.DataFrame.nunique`
* `pandas.core.frame.DataFrame.pivot_table`
* `pandas.core.frame.DataFrame.pow`
* `pandas.core.frame.DataFrame.radd`
* `pandas.core.frame.DataFrame.rank`
* `pandas.core.frame.DataFrame.rdiv`
* `pandas.core.frame.DataFrame.rename`
* `pandas.core.frame.DataFrame.replace`
* `pandas.core.frame.DataFrame.resample`
* `pandas.core.frame.DataFrame.rfloordiv`
* `pandas.core.frame.DataFrame.rmod`
* `pandas.core.frame.DataFrame.rmul`
* `pandas.core.frame.DataFrame.rolling`
* `pandas.core.frame.DataFrame.round`
* `pandas.core.frame.DataFrame.rpow`
* `pandas.core.frame.DataFrame.rsub`
* `pandas.core.frame.DataFrame.rtruediv`
* `pandas.core.frame.DataFrame.shift`
* `pandas.core.frame.DataFrame.skew`
* `pandas.core.frame.DataFrame.sort_index`
* `pandas.core.frame.DataFrame.sort_values`
* `pandas.core.frame.DataFrame.sub`
* `pandas.core.frame.DataFrame.to_dict`
* `pandas.core.frame.DataFrame.transform`
* `pandas.core.frame.DataFrame.transpose`
* `pandas.core.frame.DataFrame.truediv`
* `pandas.core.frame.DataFrame.var`
* `pandas.core.indexes.datetimes.date_range`
* `pandas.core.reshape.concat.concat`
* `pandas.core.reshape.melt.melt`
* `pandas.core.reshape.merge.merge`
* `pandas.core.reshape.pivot.pivot_table`
* `pandas.core.reshape.tile.cut`
* `pandas.core.series.Series.add`
* `pandas.core.series.Series.aggregate`
* `pandas.core.series.Series.all`
* `pandas.core.series.Series.any`
* `pandas.core.series.Series.cumsum`
* `pandas.core.series.Series.div`
* `pandas.core.series.Series.dropna`
* `pandas.core.series.Series.eq`
* `pandas.core.series.Series.ffill`
* `pandas.core.series.Series.fillna`
* `pandas.core.series.Series.floordiv`
* `pandas.core.series.Series.ge`
* `pandas.core.series.Series.gt`
* `pandas.core.series.Series.lt`
* `pandas.core.series.Series.mask`
* `pandas.core.series.Series.mod`
* `pandas.core.series.Series.mul`
* `pandas.core.series.Series.multiply`
* `pandas.core.series.Series.ne`
* `pandas.core.series.Series.pow`
* `pandas.core.series.Series.quantile`
* `pandas.core.series.Series.radd`
* `pandas.core.series.Series.rank`
* `pandas.core.series.Series.rdiv`
* `pandas.core.series.Series.rename`
* `pandas.core.series.Series.replace`
* `pandas.core.series.Series.resample`
* `pandas.core.series.Series.rfloordiv`
* `pandas.core.series.Series.rmod`
* `pandas.core.series.Series.rmul`
* `pandas.core.series.Series.rolling`
* `pandas.core.series.Series.rpow`
* `pandas.core.series.Series.rsub`
* `pandas.core.series.Series.rtruediv`
* `pandas.core.series.Series.sample`
* `pandas.core.series.Series.shift`
* `pandas.core.series.Series.skew`
* `pandas.core.series.Series.sort_index`
* `pandas.core.series.Series.sort_values`
* `pandas.core.series.Series.std`
* `pandas.core.series.Series.sub`
* `pandas.core.series.Series.subtract`
* `pandas.core.series.Series.truediv`
* `pandas.core.series.Series.value_counts`
* `pandas.core.series.Series.var`
* `pandas.core.series.Series.where`
* `pandas.core.tools.numeric.to_numeric`

Updated the mapping status for the following Pandas elements, from NotSupported to Direct:

* `pandas.core.frame.DataFrame.attrs`
* `pandas.core.indexes.base.Index.to_numpy`
* `pandas.core.series.Series.str.len`
* `pandas.io.html.read_html`
* `pandas.io.xml.read_xml`
* `pandas.core.indexes.datetimes.DatetimeIndex.mean`
* `pandas.core.resample.Resampler.indices`
* `pandas.core.resample.Resampler.nunique`
* `pandas.core.series.Series.items`
* `pandas.core.tools.datetimes.to_datetime`
* `pandas.io.sas.sasreader.read_sas`
* `pandas.core.frame.DataFrame.attrs`
* `pandas.core.frame.DataFrame.style`
* `pandas.core.frame.DataFrame.items`
* `pandas.core.groupby.generic.DataFrameGroupBy.head`
* `pandas.core.groupby.generic.DataFrameGroupBy.median`
* `pandas.core.groupby.generic.DataFrameGroupBy.min`
* `pandas.core.groupby.generic.DataFrameGroupBy.nunique`
* `pandas.core.groupby.generic.DataFrameGroupBy.tail`
* `pandas.core.indexes.base.Index.is_boolean`
* `pandas.core.indexes.base.Index.is_floating`
* `pandas.core.indexes.base.Index.is_integer`
* `pandas.core.indexes.base.Index.is_monotonic_decreasing`
* `pandas.core.indexes.base.Index.is_monotonic_increasing`
* `pandas.core.indexes.base.Index.is_numeric`
* `pandas.core.indexes.base.Index.is_object`
* `pandas.core.indexes.base.Index.max`
* `pandas.core.indexes.base.Index.min`
* `pandas.core.indexes.base.Index.name`
* `pandas.core.indexes.base.Index.names`
* `pandas.core.indexes.base.Index.rename`
* `pandas.core.indexes.base.Index.set_names`
* `pandas.core.indexes.datetimes.DatetimeIndex.day_name`
* `pandas.core.indexes.datetimes.DatetimeIndex.month_name`
* `pandas.core.indexes.datetimes.DatetimeIndex.time`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.ceil`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.days`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.floor`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.microseconds`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.nanoseconds`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.round`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.seconds`
* `pandas.core.reshape.pivot.crosstab`
* `pandas.core.series.Series.dt.round`
* `pandas.core.series.Series.dt.time`
* `pandas.core.series.Series.dt.weekday`
* `pandas.core.series.Series.is_monotonic_decreasing`
* `pandas.core.series.Series.is_monotonic_increasing`

Updated the mapping status for the following Pandas elements, from NotSupported to Partial:

* `pandas.core.frame.DataFrame.align`
* `pandas.core.series.Series.align`
* `pandas.core.frame.DataFrame.tz_convert`
* `pandas.core.frame.DataFrame.tz_localize`
* `pandas.core.groupby.generic.DataFrameGroupBy.fillna`
* `pandas.core.groupby.generic.SeriesGroupBy.fillna`
* `pandas.core.indexes.datetimes.bdate_range`
* `pandas.core.indexes.datetimes.DatetimeIndex.std`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.mean`
* `pandas.core.resample.Resampler.asfreq`
* `pandas.core.resample.Resampler.quantile`
* `pandas.core.series.Series.map`
* `pandas.core.series.Series.tz_convert`
* `pandas.core.series.Series.tz_localize`
* `pandas.core.window.expanding.Expanding.count`
* `pandas.core.window.rolling.Rolling.count`
* `pandas.core.groupby.generic.DataFrameGroupBy.aggregate`
* `pandas.core.groupby.generic.SeriesGroupBy.aggregate`
* `pandas.core.frame.DataFrame.applymap`
* `pandas.core.series.Series.apply`
* `pandas.core.groupby.generic.DataFrameGroupBy.bfill`
* `pandas.core.groupby.generic.DataFrameGroupBy.ffill`
* `pandas.core.groupby.generic.SeriesGroupBy.bfill`
* `pandas.core.groupby.generic.SeriesGroupBy.ffill`
* `pandas.core.frame.DataFrame.backfill`
* `pandas.core.frame.DataFrame.bfill`
* `pandas.core.frame.DataFrame.compare`
* `pandas.core.frame.DataFrame.unstack`
* `pandas.core.frame.DataFrame.asfreq`
* `pandas.core.series.Series.backfill`
* `pandas.core.series.Series.bfill`
* `pandas.core.series.Series.compare`
* `pandas.core.series.Series.unstack`
* `pandas.core.series.Series.asfreq`
* `pandas.core.series.Series.argmax`
* `pandas.core.series.Series.argmin`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.microsecond`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.nanosecond`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.day_name`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.month_name`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.month_start`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.month_end`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.is_year_start`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.is_year_end`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.is_quarter_start`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.is_quarter_end`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.is_leap_year`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.floor`
* `pandas.core.indexes.accessors.CombinedDatetimelikeProperties.ceil`
* `pandas.core.groupby.generic.DataFrameGroupBy.idxmax`
* `pandas.core.groupby.generic.DataFrameGroupBy.idxmin`
* `pandas.core.groupby.generic.DataFrameGroupBy.std`
* `pandas.core.indexes.timedeltas.TimedeltaIndex.mean`
* `pandas.core.tools.timedeltas.to_timedelta`

#### Known Issue

* **This version includes an issue when converting the sample project will not work on this version, it** will be fixed on the next release

## Version 2.4.3 (Jan 9, 2025)

### Application & CLI Version 2.4.3

#### Desktop App

* Added link to the troubleshooting guide in the crash report modal.

#### Included SMA Core Versions

* Snowpark Conversion Core 4.15.0

#### Added

* Added the following PySpark elements to ConversionStatusPySpark.csv file as `NotSupported:`

  + `pyspark.sql.streaming.readwriter.DataStreamReader.table`
  + `pyspark.sql.streaming.readwriter.DataStreamReader.schema`
  + `pyspark.sql.streaming.readwriter.DataStreamReader.options`
  + `pyspark.sql.streaming.readwriter.DataStreamReader.option`
  + `pyspark.sql.streaming.readwriter.DataStreamReader.load`
  + `pyspark.sql.streaming.readwriter.DataStreamReader.format`
  + `pyspark.sql.streaming.query.StreamingQuery.awaitTermination`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.partitionBy`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.toTable`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.trigger`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.queryName`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.outputMode`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.format`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.option`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.foreachBatch`
  + `pyspark.sql.streaming.readwriter.DataStreamWriter.start`

#### Changed

* Updated Hive SQL EWIs format.

  + SPRKHVSQL1001
  + SPRKHVSQL1002
  + SPRKHVSQL1003
  + SPRKHVSQL1004
  + SPRKHVSQL1005
  + SPRKHVSQL1006
* Updated Spark SQL EWIs format.

  + SPRKSPSQL1001
  + SPRKSPSQL1002
  + SPRKSPSQL1003
  + SPRKSPSQL1004
  + SPRKSPSQL1005
  + SPRKSPSQL1006

#### Fixed

* Fixed a bug that was causing some PySpark elements not identified by the tool.
* Fixed the mismatch in the ThirdParty identified calls and the ThirdParty import Calls number.

## Version 2.4.2 (Dec 13, 2024)

### Application & CLI **Version 2.4.2**

#### Included SMA Core Versions

* Snowpark Conversion Core 4.14.0

#### Added added

* Added the following Spark elements to ConversionStatusPySpark.csv:

  + `pyspark.broadcast.Broadcast.value`
  + `pyspark.conf.SparkConf.getAll`
  + `pyspark.conf.SparkConf.setAll`
  + `pyspark.conf.SparkConf.setMaster`
  + `pyspark.context.SparkContext.addFile`
  + `pyspark.context.SparkContext.addPyFile`
  + `pyspark.context.SparkContext.binaryFiles`
  + `pyspark.context.SparkContext.setSystemProperty`
  + `pyspark.context.SparkContext.version`
  + `pyspark.files.SparkFiles`
  + `pyspark.files.SparkFiles.get`
  + `pyspark.rdd.RDD.count`
  + `pyspark.rdd.RDD.distinct`
  + `pyspark.rdd.RDD.reduceByKey`
  + `pyspark.rdd.RDD.saveAsTextFile`
  + `pyspark.rdd.RDD.take`
  + `pyspark.rdd.RDD.zipWithIndex`
  + `pyspark.sql.context.SQLContext.udf`
  + `pyspark.sql.types.StructType.simpleString`

#### Changed

* Updated the documentation of the Pandas EWIs, `PNDSPY1001`, `PNDSPY1002` and `PNDSPY1003` `SPRKSCL1137` to align with a standardized format, ensuring consistency and clarity across all the EWIs.
* Updated the documentation of the following Scala EWIs: `SPRKSCL1106` and `SPRKSCL1107`. To be aligned with a standardized format, ensuring consistency and clarity across all the EWIs.

#### Fixed

* Fixed the bug the was causing the UserDefined symbols showing in the third party usages inventory.

## Version 2.4.1 (Dec 4, 2024)

### Application & CLI **Version 2.4.1**

#### Included SMA Core Versions

* Snowpark Conversion Core 4.13.1

#### Command Line Interface

**Changed**

* Added timestamp to the output folder.

### Snowpark Conversion Core 4.13.1

#### Added

* Added ‘Source Language’ column to Library Mappings Table
* Added `Others` as a new category in the Pandas API Summary table of the DetailedReport.docx

#### Changed

* Updated the documentation for Python EWI `SPRKPY1058`.
* Updated the message for the pandas EWI `PNDSPY1002` to show the relate pandas element.
* Updated the way we created the .csv reports, now are overwritten after a second run .

#### Fixed

* Fixed a bug that was causing Notebook files not being generated in the output.
* Fixed the replacer for `get` and `set` methods from `pyspark.sql.conf.RuntimeConfig`, the replacer now match the correct full names.
* Fixed query tag incorrect version.
* Fixed UserDefined packages reported as ThirdPartyLib.

## Version 2.3.1 (Nov 14, 2024)

### Application & CLI **Version 2.3.1**

#### Included SMA Core Versions

* Snowpark Conversion Core 4.12.0

#### Desktop App

**Fixed**

* Fix case-sensitive issues in –sql options.

**Removed**

* Remove platform name from show-ac message.

### Snowpark Conversion Core 4.12.0

#### Added

* Added support for Snowpark Python 1.23.0 and 1.24.0.
* Added a new EWI for the `pyspark.sql.dataframe.DataFrame.writeTo` function. All the usages of this function will now have the EWI SPRKPY1087.

#### Changed

* Updated the documentation of the Scala EWIs from `SPRKSCL1137` to `SPRKSCL1156` to align with a standardized format, ensuring consistency and clarity across all the EWIs.
* Updated the documentation of the Scala EWIs from `SPRKSCL1117` to `SPRKSCL1136` to align with a standardized format, ensuring consistency and clarity across all the EWIs.
* Updated the message that is shown for the following EWIs:

  + SPRKPY1082
  + SPRKPY1083
* Updated the documentation of the Scala EWIs from `SPRKSCL1100` to `SPRKSCL1105`, from `SPRKSCL1108` to `SPRKSCL1116`; from `SPRKSCL1157` to `SPRKSCL1175`; to align with a standardized format, ensuring consistency and clarity across all the EWIs.
* Updated the mapping status of the following PySpark elements from **NotSupported** to **Direct** with EWI:

  + `pyspark.sql.readwriter.DataFrameWriter.option` => `snowflake.snowpark.DataFrameWriter.option`: All the usages of this function now have the EWI SPRKPY1088
  + `pyspark.sql.readwriter.DataFrameWriter.options` => `snowflake.snowpark.DataFrameWriter.options`: All the usages of this function now have the EWI SPRKPY1089
* Updated the mapping status of the following PySpark elements from **Workaround** to **Rename**:

  + `pyspark.sql.readwriter.DataFrameWriter.partitionBy` => `snowflake.snowpark.DataFrameWriter.partition_by`
* Updated EWI documentation: SPRKSCL1000, SPRKSCL1001, SPRKSCL1002, SPRKSCL1100, SPRKSCL1101, SPRKSCL1102, SPRKSCL1103, SPRKSCL1104, SPRKSCL1105.

#### Removed

* Removed the `pyspark.sql.dataframe.DataFrameStatFunctions.writeTo` element from the conversion status, this element does not exist.

#### Deprecated

* Deprecated the following EWI codes:

  + SPRKPY1081
  + SPRKPY1084

## Version 2.3.0 (Oct 30, 2024)

### Application & CLI Version 2.3.0

* Snowpark Conversion Core 4.11.0

### Snowpark Conversion Core 4.11.0

#### Added

* Added a new column called `Url` to the `Issues.csv` file, which redirects to the corresponding EWI documentation.
* Added new EWIs for the following Spark elements:

  + [SPRKPY1082] pyspark.sql.readwriter.DataFrameReader.load
  + [SPRKPY1083] pyspark.sql.readwriter.DataFrameWriter.save
  + [SPRKPY1084] pyspark.sql.readwriter.DataFrameWriter.option
  + [SPRKPY1085] pyspark.ml.feature.VectorAssembler
  + [SPRKPY1086] pyspark.ml.linalg.VectorUDT
* Added 38 new Pandas elements:

  + pandas.core.frame.DataFrame.select
  + andas.core.frame.DataFrame.str
  + pandas.core.frame.DataFrame.str.replace
  + pandas.core.frame.DataFrame.str.upper
  + pandas.core.frame.DataFrame.to_list
  + pandas.core.frame.DataFrame.tolist
  + pandas.core.frame.DataFrame.unique
  + pandas.core.frame.DataFrame.values.tolist
  + pandas.core.frame.DataFrame.withColumn
  + pandas.core.groupby.generic._SeriesGroupByScalar
  + pandas.core.groupby.generic._SeriesGroupByScalar[S1].agg
  + pandas.core.groupby.generic._SeriesGroupByScalar[S1].aggregate
  + pandas.core.indexes.datetimes.DatetimeIndex.year
  + pandas.core.series.Series.columns
  + pandas.core.tools.datetimes.to_datetime.date
  + pandas.core.tools.datetimes.to_datetime.dt.strftime
  + pandas.core.tools.datetimes.to_datetime.strftime
  + pandas.io.parsers.readers.TextFileReader.apply
  + pandas.io.parsers.readers.TextFileReader.astype
  + pandas.io.parsers.readers.TextFileReader.columns
  + pandas.io.parsers.readers.TextFileReader.copy
  + pandas.io.parsers.readers.TextFileReader.drop
  + pandas.io.parsers.readers.TextFileReader.drop_duplicates
  + pandas.io.parsers.readers.TextFileReader.fillna
  + pandas.io.parsers.readers.TextFileReader.groupby
  + pandas.io.parsers.readers.TextFileReader.head
  + pandas.io.parsers.readers.TextFileReader.iloc
  + pandas.io.parsers.readers.TextFileReader.isin
  + pandas.io.parsers.readers.TextFileReader.iterrows
  + pandas.io.parsers.readers.TextFileReader.loc
  + pandas.io.parsers.readers.TextFileReader.merge
  + pandas.io.parsers.readers.TextFileReader.rename
  + pandas.io.parsers.readers.TextFileReader.shape
  + pandas.io.parsers.readers.TextFileReader.to_csv
  + pandas.io.parsers.readers.TextFileReader.to_excel
  + pandas.io.parsers.readers.TextFileReader.unique
  + pandas.io.parsers.readers.TextFileReader.values
  + pandas.tseries.offsets

## Version 2.2.3 (Oct 24, 2024)

### Application Version 2.2.3

#### Included SMA Core Versions

* Snowpark Conversion Core 4.10.0

#### Desktop App

#### Fixed

* Fixed a bug that caused the SMA to show the label **SnowConvert** instead of **Snowpark Migration Accelerator** in the menu bar of the Windows version.
* Fixed a bug that caused the SMA to crash when it did not have read and write permissions to the `.config` directory in macOS and the `AppData` directory in Windows.

#### Command Line Interface

**Changed**

* Renamed the CLI executable name from `snowct` to `sma`.
* Removed the source language argument so you no longer need to specify if you are running a Python or Scala assessment / conversion.
* Expanded the command line arguments supported by the CLI by adding the following new arguments:

  + `--enableJupyter` | `-j`: Flag to indicate if the conversion of Databricks notebooks to Jupyter is enabled or not.
  + `--sql` | `-f`: Database engine syntax to be used when a SQL command is detected.
  + `--customerEmail` | `-e`: Configure the customer email.
  + `--customerCompany` | `-c`: Configure the customer company.
  + `--projectName` | `-p`: Configure the customer project.
* Updated some texts to reflect the correct name of the application, ensuring consistency and clarity in all the messages.
* Updated the terms of use of the application.
* Updated and expanded the documentation of the CLI to reflect the latests features, enhancements and changes.
* Updated the text that is shown before proceeding with the execution of the SMA to improve
* Updated the CLI to accept **“Yes”** as a valid argument when prompting for user confirmation.
* Allowed the CLI to continue the execution without waiting for user interaction by specifying the argument `-y` or `--yes`.
* Updated the help information of the `--sql` argument to show the values that this argument expects.

### Snowpark Conversion Core Version 4.10.0

#### Added

* Added a new EWI for the `pyspark.sql.readwriter.DataFrameWriter.partitionBy` function. All the usages of this function will now have the EWI SPRKPY1081.
* Added a new column called `Technology` to the `ImportUsagesInventory.csv` file.

#### Changed

* Updated the Third-Party Libraries readiness score to also take into account the `Unknown` libraries.
* Updated the `AssessmentFiles.zip` file to include `.json` files instead of `.pam` files.
* Improved the CSV to JSON conversion mechanism to make processing of inventories more performant.
* Improved the documentation of the following EWIs:

  + SPRKPY1029
  + SPRKPY1054
  + SPRKPY1055
  + SPRKPY1063
  + SPRKPY1075
  + SPRKPY1076
* Updated the mapping status of the following Spark Scala elements from `Direct` to `Rename`.

  + `org.apache.spark.sql.functions.shiftLeft` => `com.snowflake.snowpark.functions.shiftleft`
  + `org.apache.spark.sql.functions.shiftRight` => `com.snowflake.snowpark.functions.shiftright`
* Updated the mapping status of the following Spark Scala elements from `Not Supported` to `Direct`.

  + `org.apache.spark.sql.functions.shiftleft` => `com.snowflake.snowpark.functions.shiftleft`
  + `org.apache.spark.sql.functions.shiftright` => `com.snowflake.snowpark.functions.shiftright`

#### Fixed

* Fixed a bug that caused the SMA to incorrectly populate the `Origin` column of the `ImportUsagesInventory.csv` file.
* Fixed a bug that caused the SMA to not classify imports of the libraries `io`, `json`, `logging` and `unittest` as Python built-in imports in the `ImportUsagesInventory.csv` file and in the `DetailedReport.docx` file.

## Version 2.2.2 (Oct 11, 2024)

### Application Version 2.2.2

Features Updates include:

* Snowpark Conversion Core 4.8.0

### Snowpark Conversion Core Version 4.8.0

#### Added

* Added `EwiCatalog.csv` and .md files to reorganize documentation
* Added the mapping status of `pyspark.sql.functions.ln` Direct.
* Added a transformation for `pyspark.context.SparkContext.getOrCreate`

  + Check the EWI SPRKPY1080 for further details.
* Added an improvement for the SymbolTable, infer type for parameters in functions.
* Added SymbolTable supports static methods and do not assume the first parameter will be self for them.
* Added documentation for missing EWIs

  + SPRKHVSQL1005
  + SPRKHVSQL1006
  + SPRKSPSQL1005
  + SPRKSPSQL1006
  + SPRKSCL1002
  + SPRKSCL1170
  + SPRKSCL1171
  + SPRKPY1057
  + SPRKPY1058
  + SPRKPY1059
  + SPRKPY1060
  + SPRKPY1061
  + SPRKPY1064
  + SPRKPY1065
  + SPRKPY1066
  + SPRKPY1067
  + SPRKPY1069
  + SPRKPY1070
  + SPRKPY1077
  + SPRKPY1078
  + SPRKPY1079
  + SPRKPY1101

#### Changed

* Updated the mapping status of:

  + `pyspark.sql.functions.array_remove` from `NotSupported` to `Direct`.

#### Fixed

* Fixed the Code File Sizing table in the Detail Report to exclude .sql and .hql files and added the Extra Large row in the table.
* Fixed missing the `update_query_tag` when `SparkSession` is defined into multiple lines on `Python`.
* Fixed missing the `update_query_tag` when `SparkSession` is defined into multiple lines on `Scala`.
* Fixed missing EWI `SPRKHVSQL1001` to some SQL statements with parsing errors.
* Fixed keep new lines values inside string literals
* Fixed the Total Lines of code showed in the File Type Summary Table
* Fixed Parsing Score showed as 0 when recognize files successfully
* Fixed LOC count in the cell inventory for Databricks Magic SQL Cells

## Version 2.2.0 (Sep 26, 2024)

### Application Version 2.2.0

Feature Updates include:

* Snowpark Conversion Core 4.6.0

### Snowpark Conversion Core Version 4.6.0

#### Added

* Add transformation for `pyspark.sql.readwriter.DataFrameReader.parquet`.
* Add transformation for `pyspark.sql.readwriter.DataFrameReader.option` when it is a Parquet method.

#### Changed

* Updated the mapping status of:

  + `pyspark.sql.types.StructType.fields` from `NotSupported` to `Direct`.
  + `pyspark.sql.types.StructType.names` from `NotSupported` to `Direct`.
  + `pyspark.context.SparkContext.setLogLevel` from `Workaround` to `Transformation`.
    - More detail can be found in EWIs SPRKPY1078 and SPRKPY1079
  + `org.apache.spark.sql.functions.round` from `WorkAround` to `Direct`.
  + `org.apache.spark.sql.functions.udf` from `NotDefined` to `Transformation`.
    - More detail can be found in EWIs SPRKSCL1174 and SPRKSCL1175
* Updated the mapping status of the following Spark elements from `DirectHelper` to `Direct`:

  + `org.apache.spark.sql.functions.hex`
  + `org.apache.spark.sql.functions.unhex`
  + `org.apache.spark.sql.functions.shiftleft`
  + `org.apache.spark.sql.functions.shiftright`
  + `org.apache.spark.sql.functions.reverse`
  + `org.apache.spark.sql.functions.isnull`
  + `org.apache.spark.sql.functions.unix_timestamp`
  + `org.apache.spark.sql.functions.randn`
  + `org.apache.spark.sql.functions.signum`
  + `org.apache.spark.sql.functions.sign`
  + `org.apache.spark.sql.functions.collect_list`
  + `org.apache.spark.sql.functions.log10`
  + `org.apache.spark.sql.functions.log1p`
  + `org.apache.spark.sql.functions.base64`
  + `org.apache.spark.sql.functions.unbase64`
  + `org.apache.spark.sql.functions.regexp_extract`
  + `org.apache.spark.sql.functions.expr`
  + `org.apache.spark.sql.functions.date_format`
  + `org.apache.spark.sql.functions.desc`
  + `org.apache.spark.sql.functions.asc`
  + `org.apache.spark.sql.functions.size`
  + `org.apache.spark.sql.functions.locate`
  + `org.apache.spark.sql.functions.ntile`

#### Fixed

* Fixed value showed in the Percentage of total Pandas Api
* Fixed Total percentage on ImportCalls table in the DetailReport

### Deprecated

* Deprecated the following EWI code:

  + SPRKSCL1115

## Version 2.1.7 (Sep 12, 2024)

### Application Version 2.1.7

Feature Updates include:

* Snowpark Conversion Core 4.5.7
* Snowpark Conversion Core 4.5.2

### Snowpark Conversion Core Version 4.5.7

#### Hotfixed

* Fixed Total row added on Spark Usages Summaries when there are not usages
* Bumped of Python Assembly to Version=:code:1.3.111

  + Parse trail comma in multiline arguments

### Snowpark Conversion Core Version 4.5.2

#### Added

* Added transformation for `pyspark.sql.readwriter.DataFrameReader.option`:

  + When the chain is from a CSV method call.
  + When the chain is from a JSON method call.
* Added transformation for `pyspark.sql.readwriter.DataFrameReader.json`.

#### Changed

* Executed SMA on SQL strings passed to Python/Scala functions

  + Create AST in Scala/Python to emit temporary SQL unit
  + Create SqlEmbeddedUsages.csv inventory
  + Deprecate SqlStatementsInventroy.csv and SqlExtractionInventory.csv
  + Integrate EWI when the SQL literal could not be processed
  + Create new task to process SQL-embedded code
  + Collect info for SqlEmbeddedUsages.csv inventory in Python
  + Replace SQL transformed code to Literal in Python
  + Update test cases after implementation
  + Create table, views for telemetry in SqlEmbeddedUsages inventory
  + Collect info for SqlEmbeddedUsages.csv report in Scala
  + Replace SQL transformed code to Literal in Scala
  + Check line number order for Embedded SQL reporting
* Filled the `SqlFunctionsInfo.csv` with the SQL functions documented for SparkSQL and HiveSQL
* Updated the mapping status for:

  + `org.apache.spark.sql.SparkSession.sparkContext` from NotSupported to Transformation.
  + `org.apache.spark.sql.Builder.config` from `NotSupported` to `Transformation`. With this new mapping status, the SMA will remove all the usages of this function from the source code.

## Version 2.1.6 (Sep 5, 2024)

### Application Version 2.1.6

* Hotfix change for Snowpark Engines Core version 4.5.1

### Spark Conversion Core Version 4.5.1

**Hotfix**

* Added a mechanism to convert the temporal Databricks notebooks generated by SMA in exported Databricks notebooks

## Version 2.1.5 (Aug 29, 2024)

### Application Version 2.1.5

Feature Updates include:

* Updated Spark Conversion Core: 4.3.2

### Spark Conversion Core Version 4.3.2

#### Added

* Added the mechanism (via decoration) to get the line and the column of the elements identified in notebooks cells
* Added an EWI for pyspark.sql.functions.from_json.
* Added a transformation for pyspark.sql.readwriter.DataFrameReader.csv.
* Enabled the query tag mechanism for Scala files.
* Added the Code Analysis Score and additional links to the Detailed Report.
* Added a column called OriginFilePath to InputFilesInventory.csv

#### Changed

* Updated the mapping status of pyspark.sql.functions.from_json from Not Supported to Transformation.
* Updated the mapping status of the following Spark elements from Workaround to Direct:

  + org.apache.spark.sql.functions.countDistinct
  + org.apache.spark.sql.functions.max
  + org.apache.spark.sql.functions.min
  + org.apache.spark.sql.functions.mean

#### Deprecated

* Deprecated the following EWI codes:

  + SPRKSCL1135
  + SPRKSCL1136
  + SPRKSCL1153
  + SPRKSCL1155

#### Fixed

* Fixed a bug that caused an incorrect calculation of the Spark API score.
* Fixed an error that avoid copy SQL empty or commented files in the output folder.
* Fixed a bug in the DetailedReport, the notebook stats LOC and Cell count is not accurate.

## Version 2.1.2 (Aug 14, 2024)

### Application Version 2.1.2

Feature Updates include:

* Updated Spark Conversion Core: 4.2.0

### Spark Conversion Core Version 4.2.0

#### Added

* Add technology column to SparkUsagesInventory
* Added an EWI for not defined SQL elements .
* Added SqlFunctions Inventory
* Collect info for SqlFunctions Inventory

#### Changed

* The engine now processes and prints partially parsed Python files instead of leaving original file without modifications.
* Python notebook cells that have parsing errors will also be processed and printed.

#### Fixed

* Fixed `pandas.core.indexes.datetimes.DatetimeIndex.strftime` was being reported wrongly.
* Fix mismatch between SQL readiness score and SQL Usages by Support Status.
* Fixed a bug that caused the SMA to report `pandas.core.series.Series.empty` with an incorrect mapping status.
* Fix mismatch between Spark API Usages Ready for Conversion in DetailedReport.docx is different than UsagesReadyForConversion row in Assessment.json.

## Version 2.1.1 (Aug 8, 2024)

### Application Version 2.1.1

Feature Updates include:

* Updated Spark Conversion Core: 4.1.0

### Spark Conversion Core Version 4.1.0

#### Added

* Added the following information to the `AssessmentReport.json` file

  + The third-party libraries readiness score.
  + The number of third-party library calls that were identified.
  + The number of third-party library calls that are supported in Snowpark.
  + The color code associated with the third-party readiness score, the Spark API readiness score, and the SQL readiness score.
* Transformed `SqlSimpleDataType` in Spark create tables.
* Added the mapping of `pyspark.sql.functions.get` as direct.
* Added the mapping of `pyspark.sql.functions.to_varchar` as direct.
* As part of the changes after unification, the tool now generates an execution info file in the Engine.
* Added a replacer for `pyspark.sql.SparkSession.builder.appName`.

#### Changed

* Updated the mapping status for the following Spark elements

  + From Not Supported to Direct mapping:
    - `pyspark.sql.functions.sign`
    - `pyspark.sql.functions.signum`
* Changed the Notebook Cells Inventory report to indicate the kind of content for every cell in the column Element
* Added a `SCALA_READINESS_SCORE` column that reports the readiness score as related only to references to the Spark API in Scala files.
* Partial support to transform table properties in `ALTER TABLE` and `ALTER VIEW`
* Updated the conversion status of the node `SqlSimpleDataType` from Pending to Transformation in Spark create tables
* Updated the version of the Snowpark Scala API supported by the SMA from `1.7.0` to `1.12.1`:

  + Updated the mapping status of:
    - `org.apache.spark.sql.SparkSession.getOrCreate` from Rename to Direct
    - `org.apache.spark.sql.functions.sum` from Workaround to Direct
* Updated the version of the Snowpark Python API supported by the SMA from `1.15.0` to `1.20.0`:

  + Updated the mapping status of:
    - `pyspark.sql.functions.arrays_zip` from Not Supported to Direct
* Updated the mapping status for the following Pandas elements:

  + Direct mappings:
    - `pandas.core.frame.DataFrame.any`
    - `pandas.core.frame.DataFrame.applymap`
* Updated the mapping status for the following Pandas elements:

  + From Not Supported to Direct mapping:
    - `pandas.core.frame.DataFrame.groupby`
    - `pandas.core.frame.DataFrame.index`
    - `pandas.core.frame.DataFrame.T`
    - `pandas.core.frame.DataFrame.to_dict`
  + From Not Supported to Rename mapping:
    - `pandas.core.frame.DataFrame.map`
* Updated the mapping status for the following Pandas elements:

  + Direct mappings:
    - `pandas.core.frame.DataFrame.where`
    - `pandas.core.groupby.generic.SeriesGroupBy.agg`
    - `pandas.core.groupby.generic.SeriesGroupBy.aggregate`
    - `pandas.core.groupby.generic.DataFrameGroupBy.agg`
    - `pandas.core.groupby.generic.DataFrameGroupBy.aggregate`
    - `pandas.core.groupby.generic.DataFrameGroupBy.apply`
  + Not Supported mappings:
    - `pandas.core.frame.DataFrame.to_parquet`
    - `pandas.core.generic.NDFrame.to_csv`
    - `pandas.core.generic.NDFrame.to_excel`
    - `pandas.core.generic.NDFrame.to_sql`
* Updated the mapping status for the following Pandas elements:

  + Direct mappings:
    - `pandas.core.series.Series.empty`
    - `pandas.core.series.Series.apply`
    - `pandas.core.reshape.tile.qcut`
  + Direct mappings with EWI:
    - `pandas.core.series.Series.fillna`
    - `pandas.core.series.Series.astype`
    - `pandas.core.reshape.melt.melt`
    - `pandas.core.reshape.tile.cut`
    - `pandas.core.reshape.pivot.pivot_table`
* Updated the mapping status for the following Pandas elements:

  + Direct mappings:
    - `pandas.core.series.Series.dt`
    - `pandas.core.series.Series.groupby`
    - `pandas.core.series.Series.loc`
    - `pandas.core.series.Series.shape`
    - `pandas.core.tools.datetimes.to_datetime`
    - `pandas.io.excel._base.ExcelFile`
  + Not Supported mappings:
    - `pandas.core.series.Series.dt.strftime`
* Updated the mapping status for the following Pandas elements:

  + From Not Supported to Direct mapping:
    - `pandas.io.parquet.read_parquet`
    - `pandas.io.parsers.readers.read_csv`
* Updated the mapping status for the following Pandas elements:

  + From Not Supported to Direct mapping:
    - `pandas.io.pickle.read_pickle`
    - `pandas.io.sql.read_sql`
    - `pandas.io.sql.read_sql_query`
* Updated the description of Understanding the SQL Readiness Score.
* Updated `PyProgramCollector` to collect the packages and populate the current packages inventory with data from Python source code.
* Updated the mapping status of `pyspark.sql.SparkSession.builder.appName` from Rename to Transformation.
* Removed the following Scala integration tests:

  + `AssesmentReportTest_AssessmentMode.ValidateReports_AssessmentMode`
  + `AssessmentReportTest_PythonAndScala_Files.ValidateReports_PythonAndScala`
  + `AssessmentReportTestWithoutSparkUsages.ValidateReports_WithoutSparkUsages`
* Updated the mapping status of `pandas.core.generic.NDFrame.shape` from Not Supported to Direct.
* Updated the mapping status of `pandas.core.series` from Not Supported to Direct.

#### Deprecated

* Deprecated the EWI code `SPRKSCL1160` since `org.apache.spark.sql.functions.sum` is now a direct mapping.

#### Fixed

* Fixed a bug by not supporting Custom Magics without arguments in Jupyter Notebook cells.
* Fixed incorrect generation of EWIs in the issues.csv report when parsing errors occur.
* Fixed a bug that caused the SMA not to process the Databricks exported notebook as Databricks notebooks.
* Fixed a stack overflow error while processing clashing type names of declarations created inside package objects.
* Fixed the processing of complex lambda type names involving generics, e.g., `def func[X,Y](f: (Map[Option[X], Y] => Map[Y, X]))...`
* Fixed a bug that caused the SMA to add a PySpark EWI code instead of a Pandas EWI code to the Pandas elements that are not yet recognized.
* Fixed a typo in the detailed report template: renaming a column from “Percentage of all Python Files” to “Percentage of all files”.
* Fixed a bug where `pandas.core.series.Series.shape` was wrongly reported.

---
title: Snowpark Migration Accelerator: Roadmap
source: https://docs.snowflake.com/en/migrations/sma-docs/general/roadmap.md
section: Migrations
---

# Snowpark Migration Accelerator: Roadmap

The SMA team continuously enhances and updates the tool. You can track these improvements in the [Release Notes](release-notes/README.md).

The SMA team continuously improves the tool based on user feedback. This roadmap outlines current and future enhancements, and will be updated regularly.

## Coming Soon

* Initial support for converting SparkML code to SnowparkML
* Initial support for converting Pandas and PySpark Pandas code to Snowpark
* Better organization of Python import statements
* Step-by-step conversion guide in the documentation

## Planned

* Enhanced assessment metrics beyond the Readiness Score, including:

  + Compatibility indicators for third-party libraries
  + Code parsing accuracy measurement
  + SQL conversion accuracy measurement
* Comprehensive Airflow analysis and inventory tools
* Automated conversion of Spark SQL to Snowflake SQL within Python and Scala code and notebooks
* Automated conversion of HiveQL to Snowflake SQL within Python and Scala code and notebooks
* Integration with Workspace Estimator (WE) calculator
* EMR and Cloudera analysis capabilities in Workspace Estimator (WE)
* SMA deployment option within your Snowflake account
* Interactive Assessment Application for analyzing SMA output

## Feature Requests

We welcome your suggestions for improving the Snowpark Migration Accelerator (SMA). Here’s how you can share your feedback:

* Post your question in the [Spark Migration forum in the Snowflake Community](https://community.snowflake.com/s/topic/0TO3r000000bskWGAQ/spark-migrations).
* Report an issue or request a new feature [in the SMA](../user-guide/project-overview/configuration-and-settings.md).
* Send an email to [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

We welcome your feedback on how we can enhance this tool. Please share your suggestions and comments.

---
title: Snowpark Migration Accelerator: Running the Tool
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/running-the-tool.md
section: Migrations
---

# Snowpark Migration Accelerator: Running the Tool

Now that you have installed the Snowpark Migration Accelerator (SMA) and prepared your codebase, you can begin the execution process. Return to the SMA application if it’s still open, or launch it if you’ve closed it.

## Project Setup

When you first open the SMA, the project page is shown.

From the menu, select “New Project” to begin. If you have already created a project for this walkthrough, you can access it by selecting “Open Project” instead.

The “Project Creation” page allows you to create a new project file, which is essential for both assessment and code conversion tasks in SMA. The project file (with a `.snowct` extension) is stored in your selected output directory and keeps track of all your SMA executions. If you want to link multiple executions together, you can reopen an existing project file. All project information is saved both on your local machine and in the shared database. For more details about projects, see [the “project” file](../../user-guide/project-overview/project-setup.md).

All fields shown are required for configuring the assessment tool and managing the project after running the analysis.

1. **Project name**: This is the name for your project file. Multiple executions can be connected to a single project as well as any settings you save. You can learn more about the project file below.
2. **Email address**: This email address identifies the user of the tool. This should be the user of the tool, not the owner of the codebase being scanned.
3. **Company name**: This is to help you specify the organization’s code you are working with. If you are running your own code, then put your own organization here. If you are working with another organization, then put that organization name here.
4. **Input folder**: Specify the directory where your source codebase is located.
5. **Output folder**: The directory where the output files (logs, reports, code) will be placed.

For this walkthrough, we will use the “Spark Data Engineering Examples” codebase. You can find it in the [sample codebases section](walkthrough-setup/README.md). Follow these steps:

1. Download and unzip the codebase
2. Locate the root directory containing all files - this will be your input directory
3. Choose any project name you prefer
4. Select an output directory (the tool will suggest a default location, but you can change it as needed)

Before starting the assessment, make sure your input directory contains the correct source code files with the proper file extensions, as explained in the [code preparation](walkthrough-setup/notes-on-code-preparation.md) section.

When you are ready to begin, click **Save** to save your project.

After you save, the SMA takes you to the project home page. Select the **Code Process** tile to start the guided assessment or conversion workflow:

### Execution and Assessment Output

When you start the assessment process, SMA analyzes your source code in three steps:

1. First, it performs a basic scan to create an inventory of all files and keywords in your codebase.
2. Then, it parses the code according to your source language and creates a semantic model that represents the code’s functionality.
3. Finally, it uses this model to generate detailed information, including the [Spark Reference Inventory](../../user-guide/scos-conversion/output-reports/sma-inventories.md) and [Import Library Analysis](../../user-guide/scos-conversion/output-reports/sma-inventories.md). It also produces the converted code.

During this process, you will see three progress indicators on the screen:

* Loading Source Code
* Analyzing Source Code
* Writing Results

These indicators will light up as each step is completed.

After the analysis is complete, the SMA automatically shows the Assessment Results page where you can see the analysis output.

---
title: Snowpark Migration Accelerator: SCOS Conversion
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SCOS Conversion

The Snowpark Migration Accelerator (SMA) can convert specific elements from your source code into Snowpark Connect (SCOS) compatible formats. Understanding the conversion process is essential to maximize its benefits and ensure successful migration.

This section covers the following topics:

* [How the Conversion Works](how-the-conversion-works.md)
* [Conversion Quick Start](conversion-quick-start.md)
* [Understanding the Conversion Summary](understanding-the-conversion-summary.md)
* [Understanding Readiness Scores](readiness-scores.md)
* [Detailed Guide to Output Reports](output-reports/README.md)
* [Spark Reference Categories Guide](spark-reference-categories.md)
* [Understanding Output Logs](output-logs.md)

The following additional topics may be helpful for reference:

* [Issue Analysis](../../issue-analysis/approach.md)
* [Glossary](../../support/glossary.md)

---
title: Snowpark Migration Accelerator: SCOS Conversion Quick Start
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/conversion-quick-start.md
section: Migrations
---

# Snowpark Migration Accelerator: SCOS Conversion Quick Start

The Snowpark Migration Accelerator (SMA) helps you convert your source code to Snowpark Connect (SCOS) compatible formats. This guide will show you how to begin the conversion process.

## How to Execute a Conversion

Run the conversion process by selecting the **Convert to Snowpark Connect** card on the project home page.

## Next Steps

After the tool completes its analysis, review the results and determine your next steps. The following tips can help guide you:

* Consider the Readiness Score as an Initial Guide: While the readiness score evaluates Snowpark Connect compatibility, it is important to understand that successful migration depends on multiple factors. These include compatibility with third-party libraries and whether Snowpark Connect is the optimal solution for your specific workload.
* Take Time to Analyze the Conversion Results: The conversion results provide valuable insights that can help you create an effective migration strategy. Carefully review the data before proceeding to avoid unnecessary rework and ensure a more efficient migration.

Additional options are available in the application menu, as shown in the image below:

* **Retry Conversion** - You can run the conversion again by clicking the **Retry Conversion** button on the Conversion Results page. This is useful when you’ve made changes to your source code and want to see updated results.
* **View Reports** - Opens the folder containing conversion output reports. These include the detailed conversion report, Spark reference inventory, and other analyses of your source codebase. Each report type is explained in detail in this documentation.

For a detailed review of the conversion summary information, continue reading.

---
title: Snowpark Migration Accelerator: SELECT
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-dml/select/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SELECT

A SELECT statement offers multiple options to enhance your query results. While some of these options are directly compatible with Snowflake, others may require conversion or modification to work properly.

Here’s a list of available parameters you can use:

* [DISTINCT](distinct.md)
* [VALUES](values.md)
* [JOIN](join.md)
* [WHERE](where.md)
* [GROUP BY](group-by.md)
* [UNION](union.md)

Let’s examine each of these items in detail.

---
title: Snowpark Migration Accelerator: Sharing the Output with Snowflake
source: https://docs.snowflake.com/en/migrations/sma-docs/support/frequently-asked-questions-faq/sharing-the-output-with-snowflake.md
section: Migrations
---

# Snowpark Migration Accelerator: Sharing the Output with Snowflake

The Snowpark Migration Accelerator (SMA) is a standalone tool created by Snowflake. While it operates independently from the main Snowflake Service, it gathers basic usage statistics about how the tool is used. Rest assured, SMA does not collect or transmit any details about your source code.

## Sharing a Report

To share your locally generated code or reports with the SMA team (for troubleshooting, product improvement, or detailed analysis), you can use these methods:

* **In the tool** - Use the [Report an Issue](../../user-guide/project-overview/configuration-and-settings.md) feature to notify the SMA support team. You can select specific information to share, attach files, and provide a description of your issue or inquiry.
* **Explicitly (by email)** - Send an email directly with your files and information. This is the most straightforward method.

Before sharing any report, you must complete some initial steps. Let’s demonstrate this process using the [PandasUsagesInventory.csv](../../user-guide/scos-conversion/output-reports/sma-inventories.md) report as an example.

1. **Run the SMA Assessment or Conversion** - If you need help with code conversion or want to analyze your code, visit the [getting started](../../general/getting-started/README.md) guide to learn how to set up and use the Snowpark Migration Accelerator (SMA).
2. **Locate Your Files** - Find the files you want to share. These are typically in the Reports directory for assessment results, or in the Output directory for converted code.

3. **Select the file(s) you want to share** - For multiple files, we recommend creating a zip file instead of sharing individual files separately.
4. **Share the file(s)** - You can share files in two ways:
   1. Using the tool: Access the [Report an Issue](../../user-guide/project-overview/configuration-and-settings.md) option from the help menu. Add a brief description explaining the shared content and attach your file(s).

   2. Via Email: Send your file(s) to [sma-info@snowflake.com](mailto:sma-info%40snowflake.com) along with an explanation of why you’re sharing them.

And that’s all there is to it - it’s as straightforward as browsing through folders on your computer.

---
title: Snowpark Migration Accelerator: SMA Checkpoints walkthrough
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SMA Checkpoints walkthrough

The Snowpark Migration Accelerator provides the SMA-Checkpoints feature, this feature goal is to be used within the checkpoints feature of the Snowflake Extension. SMA Checkpoints can be either enabled or disabled in order to generate a file named checkpoints.json which can be used by the extension to insert the checkpoints in the corresponding location based on the analysis made by the SMA.

## Feature limitations

1. Dbx notebooks are not supported by checkpoints.

## Known Issues

1. Jupyter notebooks checkpoints are being generated in wrong locations, it is possible that those checkpoints do not work as expected.
2. Entry points by imports are not being identified.

**Note:** This guide is closely related to the Snowpark Checkpoints framework. For more information, see [Snowpark Checkpoints](../../../../developer-guide/snowpark/python/snowpark-checkpoints-library.md).

---
title: Snowpark Migration Accelerator: SMA CLI Walkthrough
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-cli-walkthrough.md
section: Migrations
---

# Snowpark Migration Accelerator: SMA CLI Walkthrough

The [Snowpark Migration Accelerator (SMA)](https://www.snowflake.com/en/data-cloud/snowpark/migration-accelerator/) helps developers migrate their Python or Scala Spark code to Snowpark. It analyzes your code and:

1. Evaluates compatibility with Snowpark
2. Automatically converts compatible Spark API calls to Snowpark API
3. Identifies code that cannot be automatically converted
4. Creates an inventory of third-party library imports from scripts and notebooks
5. Generates an editable compatibility report comparing Spark and Snowpark code

Snowflake has released a Command Line Interface (CLI) for the Snowpark Migration Accelerator (SMA). This guide will demonstrate how to use the CLI both as a standalone tool and within a script.

## Using the CLI

You can download the Command Line Interface (CLI) from [the Download and Access section](../general/getting-started/download-and-access.md). Select the version that matches your operating system. You can store the CLI in any accessible location on your machine or container.

> **Note:**
>
> **NOTE**: While this walkthrough uses screenshots from a Mac computer, the process is similar for Windows and Linux users.

After downloading the package file (.zip or .tar format), extract its contents. The Command Line Interface (CLI) tool is located in the “orchestrator” folder within the extracted files.

Open a terminal or command prompt in the installation folder and verify the CLI installation by running the following command to check its version:

./sma –version

You will see results that look like this:

The SMA Command Line Interface (CLI) is a local application that runs on your computer, similar to the SMA desktop application. To analyze your code files using the SMA CLI, these files must be stored on your local machine where the CLI can access them. The CLI supports the same file types as the regular SMA application. For a complete list of supported file types, refer to [the supported filetypes in the SMA documentation](../user-guide/before-using-the-sma/supported-filetypes.md).

> **Note:**
>
> **NOTE**: To test the CLI functionality, you can use the sample codebase provided in [the Assessment](assessment-walkthrough/walkthrough-setup/README.md) section or refer to the Conversion walkthroughs in the SMA documentation.

The SMA documentation contains a complete list of CLI arguments. Let’s explore the most important ones in this section.

The SMA CLI runs in [Conversion mode](../user-guide/snowpark-api-conversion/README.md) by default, rather than [Assessment mode](../user-guide/assessment/README.md). To run the CLI in assessment mode, use [the -a argument](assessment-walkthrough/README.md). For conversion operations, you’ll need a valid access code. To verify if you have a valid access code, use the following command:

```bash
./sma show-ac
```

To run a conversion, you need to provide:

1. Input directory (required)
2. Output directory (required)

If you haven’t created a project file before, you’ll also need to provide:

* User email
* Organization name
* Project name

Once you’ve set up these parameters for the first time, you only need to specify the input and output directories for future conversions.

```bash
./sma -i '/your/INput/directory/path/here' -o '/your/OUTput/directory/path/here' -e your@email.com -c Your-Organization -p Your-Project-Name
```

This screen displays a summary of your execution settings and prompts you to confirm whether you want to proceed.

To skip the confirmation prompt, add the –yes or -y parameter. This is particularly important when running the CLI from automated scripts.

The tool provides detailed progress information during its execution.

While the tool is running, it will continuously print output to the screen. When the process is complete, you will see the prompt again. The tool generates detailed output that includes all processes, issues, and completed or failed steps. You don’t need to read through all of this information while it’s running, as you can review it later in [the Logs output folder](../user-guide/scos-conversion/output-logs.md).

## Viewing the Output

The SMA CLI produces the same output as the SMA application. When you run the tool, it creates three folders in your specified output directory:

* [Reports](../user-guide/scos-conversion/output-reports/README.md)
* [Logs](../user-guide/scos-conversion/output-logs.md)
* Output (contains the converted code)

For detailed guidance on working with code that has been converted by the Snowpark Migration Accelerator (SMA), refer to [the conversion walkthrough](conversion-walkthrough.md).

## Using the Workspace Estimator

The SMA CLI includes a [Workspace Estimator](../workspace-estimator/overview.md) verb (`we` or `workspace-estimator`) that connects to a Databricks workspace, extracts metadata such as clusters, jobs, and runs, and optionally uploads the results to Snowflake for analysis.

### Command hierarchy

The Workspace Estimator currently supports Databricks workspaces through the `dbx` subcommand. Running `sma we dbx` without a subcommand displays help listing the available subcommands:

* `sma we dbx run` – Runs both extraction and upload in a single invocation.
* `sma we dbx extract` – Extracts workspace metadata to a local `.zip` file only.
* `sma we dbx upload` – Uploads a previously extracted `.zip` file to Snowflake.

### Authentication

A Databricks Personal Access Token (PAT) is required for extraction. You can supply it in one of two ways:

1. Pass it directly with `-t` / `--token`.
2. Set the `SMA_DBX_TOKEN` environment variable. If `--token` is omitted, the CLI defaults to this variable.

### Running extraction and upload together

When you provide all required options to `sma we dbx run`, the CLI extracts workspace metadata and uploads the resulting `.zip` file in a single step.

```bash
./sma we dbx run \
  -w https://adb-1234567890.azuredatabricks.net/ \
  -o ~/output/workspace-estimator \
  -n analytics-workspace \
  -c Example-Inc \
  -e user@example.com
```

The following table lists the available options for `sma we dbx run`:

| Option | Short | Required | Default | Description |
| --- | --- | --- | --- | --- |
| `--workspace-url` | `-w` | Yes | – | Databricks workspace URL (e.g. `https://adb-1234.azuredatabricks.net`). |
| `--token` | `-t` | No | `SMA_DBX_TOKEN` env var | Databricks Personal Access Token (PAT). |
| `--output-dir` | `-o` | No | Current directory | Directory where the extraction `.zip` file will be written. |
| `--workspace-name` | `-n` | Yes | – | Logical name for the workspace. Cannot be empty or whitespace. |
| `--lookback-days` | `-l` | No | 30 | Number of days to look back for cluster events (15, 30, or 60). |
| `--log-level` | – | No | Information | Minimum log level for diagnostic output (Trace, Debug, Information, Warning, Error, Critical). |
| `--company-name` | `-c` | Yes | – | Company name for this estimation. Cannot be empty or whitespace. |
| `--email` | `-e` | Yes | – | Email of the person performing the estimation. Must be a valid email address. |

### Running extraction only

To extract workspace metadata without uploading, use the `extract` subcommand. This produces a `.zip` file in the specified output directory.

```bash
./sma we dbx extract \
  -w https://adb-1234567890.azuredatabricks.net/ \
  -n analytics-workspace \
  -o ~/output/workspace-estimator
```

The `extract` subcommand accepts the same options listed in the `run` table above except `--company-name` and `--email`, which are only required for upload.

### Uploading a previously extracted file

If you have already extracted workspace metadata to a `.zip` file, you can upload it separately using the `upload` subcommand.

```bash
./sma we dbx upload \
  -i ~/output/workspace-estimator/workspace-extraction.zip \
  -n analytics-workspace \
  -c Example-Inc \
  -e user@example.com
```

The following table lists the available options for `sma we dbx upload`:

| Option | Short | Required | Default | Description |
| --- | --- | --- | --- | --- |
| `--input-zip` | `-i` | Yes | – | Path to an existing `.zip` file to upload. The file must have a `.zip` extension. |
| `--company-name` | `-c` | Yes | – | Company name for this estimation. Cannot be empty or whitespace. |
| `--email` | `-e` | Yes | – | Email of the person performing the estimation. Must be a valid email address. |
| `--workspace-name` | `-n` | Yes | – | Logical name for the workspace. Cannot be empty or whitespace. |
| `--log-level` | – | No | Information | Minimum log level for diagnostic output (Trace, Debug, Information, Warning, Error, Critical). |

## Running the CLI Programmatically

Coming soon! The SMA team will provide a script that enables you to run the SMA Command Line Interface (CLI) automatically across multiple directories.

---

Try out the Command Line Interface (CLI) today. If you need help or have questions, contact the Snowpark Migration Accelerator team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---
title: Snowpark Migration Accelerator: SMA Execution Guide
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/sma-execution-guide/README.md
section: Migrations
---

# Snowpark Migration Accelerator: SMA Execution Guide

## PySpark Input

The SMA-Checkpoints feature requires a PySpark workload as its entry point, since it depends on detecting the use of PySpark DataFrames. This walkthrough will guide you through the feature using a single Python script, providing a straightforward example of how checkpoints are generated and utilized within a typical PySpark workflow.

**Input workload**

**Sample.py file content**

```python
from pyspark.sql import SparkSession
from pyspark.sql import functions as F

spark = SparkSession.builder.appName("SparkFunctionsExample2").getOrCreate()

df1 = spark.createDataFrame([("Alice", "NY"), ("Bob", "LA")], ["name", "city"])
df2 = spark.createDataFrame([(10,), (20,)], ["number"])

df1_with_index = df1.withColumn("index", F.monotonically_increasing_id())
df2_with_index = df2.withColumn("index", F.monotonically_increasing_id())

df3 = df1_with_index.join(df2_with_index, on="index").drop("index")
df3.show()
```

## **Migrating Workload**

### Feature Enabled

If the SMA-Checkpoints feature is enabled, a `checkpoints.json` file will be generated. If the feature is disabled, this file will not be created in either the input or output folders. Regardless of whether the feature is enabled, the following inventory files will always be generated: `DataFramesInventory.csv` and `CheckpointsInventory.csv`. These files provide metadata essential for analysis and debugging.

### Conversion Process

To create a convert your own project please follow up the following guide: [SMA User Guide](https://docs.snowconvert.com/sma/user-guide/overview).

#### SMA-Checkpoints Feature Settings

As part of the conversion process you can customize your conversion settings, take a look on the [SMA-Checkpoints](https://app.gitbook.com/o/-MB4z_O8Sl--Tfl3XVml/s/6on4bNAZUZGzMpdEum8X/~/changes/499/use-cases/sma-checkpoints-walkthrough/usage-guide/feature-settings) feature settings.

**Note:** This user guide used the default conversion settings.

### Conversion Results

Once the migration process is complete, the SMA-Checkpoints feature should have created two new inventory files and added a `checkpoints.json` file to both the input and output folders.

Take a look on [SMA-Checkpoints inventories](https://app.gitbook.com/o/-MB4z_O8Sl--Tfl3XVml/s/t950HWwa5FvNA71Qes8u/) to review the related inventories.

#### Input Folder

**checkpoints.json file content**

```json
{
  "createdBy": "Snowpark Migration Accelerator",
  "comment": "This file was automatically generated by the SMA tool as checkpoints collection was enabled in the tool settings. This file may also be modified or deleted during SMA execution.",
  "type": "Collection",
  "pipelines": [
    {
      "entryPoint": "sample.py",
      "checkpoints": [
        {
          "name": "sample$BBVOC7$df1$1",
          "file": "sample.py",
          "df": "df1",
          "location": 1,
          "enabled": true,
          "mode": 1,
          "sample": "1.0"
        },
        {
          "name": "sample$BBVOC7$df2$1",
          "file": "sample.py",
          "df": "df2",
          "location": 1,
          "enabled": true,
          "mode": 1,
          "sample": "1.0"
        },
        {
          "name": "sample$BBVOC7$df3$1",
          "file": "sample.py",
          "df": "df3",
          "location": 1,
          "enabled": true,
          "mode": 1,
          "sample": "1.0"
        }
      ]
    }
  ]
}
```

#### Output Folder

**checkpoints.json file content**

```json
{
  "createdBy": "Snowpark Migration Accelerator",
  "comment": "This file was automatically generated by the SMA tool as checkpoints collection was enabled in the tool settings. This file may also be modified or deleted during SMA execution.",
  "type": "Validation",
  "pipelines": [
    {
      "entryPoint": "sample.py",
      "checkpoints": [
        {
          "name": "sample$BBVOC7$df1$1",
          "file": "sample.py",
          "df": "df1",
          "location": 1,
          "enabled": true,
          "mode": 1,
          "sample": "1.0"
        },
        {
          "name": "sample$BBVOC7$df2$1",
          "file": "sample.py",
          "df": "df2",
          "location": 1,
          "enabled": true,
          "mode": 1,
          "sample": "1.0"
        },
        {
          "name": "sample$BBVOC7$df3$1",
          "file": "sample.py",
          "df": "df3",
          "location": 1,
          "enabled": true,
          "mode": 1,
          "sample": "1.0"
        }
      ]
    }
  ]
}
```

Once the SMA execution flow is complete and both the input and output folders contain their respective `checkpoints.json` files, you are ready to begin the Snowpark-Checkpoints execution process.

---
title: Snowpark Migration Accelerator: SMA-Checkpoints inventories
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/sma-execution-guide/sma-checkpoints-inventories.md
section: Migrations
---

# Snowpark Migration Accelerator: SMA-Checkpoints inventories

The SMA-Checkpoints feature introduces two new inventory files: `CheckpointsInventory.csv` and `DataFramesInventory.csv`. These files are generated regardless of whether the feature is enabled.

**Checkpoints Inventory Sample**

```markdown
Name: sample$BBVOC7$df1$1
FileId: sample.py
CellId: 0
Line: 6
Column: 1
Type: Collection
DataFrameName: df1
Location: 1
Enabled: True
Mode: Schema
Sample: 1
EntryPoint: sample.py
ExecutionId: 00000000-0000-0000-0000-000000000000
```

**DataFrames Inventory Sample**

```markdown
FullName: TestingCheckpoints.sample.df1
Name: df1
FileId: sample.py
CellId: 0
Line: 6
Column: 1
AssignmentNumber: 1
RelevantFunction: pyspark.sql.session.SparkSession.createDataFrame
RelatedDataFrames:
EntryPoints: sample.py
ExecutionId: 00000000-0000-0000-0000-000000000000
```

---
title: Snowpark Migration Accelerator: Snowpark Connect
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/snowpark-connect/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Snowpark Connect

[Snowpark Connect for Spark](../../../../developer-guide/snowpark-connect/snowpark-connect-overview.md) allows you to run some Spark workflows in Snowflake with minimal changes.

---
title: Snowpark Migration Accelerator: Snowpark Connect Issue Codes for Python
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/python/snowpark-connect-codes-python.md
section: Migrations
---

# Snowpark Migration Accelerator: Snowpark Connect Issue Codes for Python

## SPRKCNTPY1000

**Message** The element < **element** > is not supported for Snowpark Connect

**Category** Conversion Error

### Description

This issue appears when the tool detects the usage of an element that is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for an unsupported element.

### Scenario

#### Input

The tool found an unidentified element of the pyspark library.

```python
from pyspark import NotSupportedElement
sc.addFile("data.txt")
print(NotSupportedElement.get("data.txt"))
```

#### Output

The tool adds the comment to the statement pointing to the unsupported element.

```python
from pyspark import NotSupportedElement
sc.addFile("data.txt")
# EWI SPRKCNTPY1000: The element 'NotSupportedElement' is not supported for Snowpark Connect
print(NotSupportedElement.get("data.txt"))
```

### Recommended fix

Since this is a generic error code that applies to a range of unsupported functions, there is not a single and specific fix. The appropriate action will depend on the particular element in use.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is some kind of workaround, please report that you encountered a conversion error on that particular element using the [Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY1001

`SparkSession` has been replaced with `Session` in Snowpark Connect.

**Message** The `SparkSession` creation has been transformed to use a Snowpark `Session` equivalent.

**Category** Warning.

### Description

This issue appears when the SMA detects the creation of a `SparkSession` object in the input code. Snowpark Connect uses a different object, called `Session`, to manage the connection to Snowflake and to create DataFrames.

When the SMA encounters the creation of a `SparkSession`, it adds this EWI to inform you that it has transformed the code to use a Snowpark Session instead.

### Scenario

#### Input

Below is an example of a Python `SparkSession` initialization which will be replaced for a Snowpark Connect `Session` initialization, and therefore it generates this EWI.

```python
spark = SparkSession.builder.getOrCreate()
```

#### Output

The SMA adds the EWI `SPRKCNTPY1001` to the output code to let you know that your `SparkSession` initialization has been replaced for a Snowpark Connect `Session` initialization.

```python
#EWI: SPRKCNTPY1001 => The creation of the SparkSession has been replaced with the creation of an equivalent Snowpark Connect Session.
spark = snowpark_connect.server.init_spark_session()
```

### Additional recommendations

* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com). If you have a contract for support with Snowflake, reach out to your sales engineer, and they can direct your support needs.

## SPRKCNTPY1500

**Message** The element < **element** > of the library RDD is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of an RDD element is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for RDD unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `pyspark.rdd` library.

```python
from pyspark import SparkContext

sc = SparkContext("local", "Simple App")
data = [1, 2, 3, 4, 5]
rdd = sc.parallelize(data)
result = rdd.NotSupportedElement(withReplacement=False, fraction=0.5).collect()
print(result)
```

#### Output

The SMA adds the EWI `SPRKCNTPY1500` to the output code to let you know that this RDD element is not supported by Snowpark Connect.

```python
from pyspark import SparkContext

sc = SparkContext("local", "Simple App")
data = [1, 2, 3, 4, 5]
rdd = sc.parallelize(data)
# EWI SPRKCNTPY1500: The element 'NotSupportedElement' of the library RDD is not supported for Snowpark Connect
result = rdd.NotSupportedElement(withReplacement=False, fraction=0.5).collect()
print(result)
```

### Recommended fix

* Convert RDD operations to DataFrame operations.

  ```python
  # PySpark RDD approach (NOT supported)
  rdd = spark.sparkContext.parallelize(data)
  result = rdd.map(lambda x: x * 2).collect()

  # Snowpark DataFrame approach (Supported)
  df = session.create_dataframe(data, schema=["value"])
  result = df.select(col("value") * 2).collect()
  ```
* Process data locally before sending it to Snowflake if the RDD operations are simple.
* Use pandas DataFrames for local processing, then convert to Snowpark DataFrames.
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action will depend on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY2000

**Message** The element < **element** > of the library Streaming is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of a Streaming element is not supported in Snowpark Connect and does not have its own error code associated with it. This is the generic error code used by the SMA for Streaming unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `pyspark.streaming` library.

```python
from pyspark import SparkContext
from pyspark.streaming import NotSupportedElement

sc = SparkContext("local[2]", "NetworkWordCount")
ssc = NotSupportedElement(sc, 1)
```

#### Output

The SMA adds the EWI `SPRKCNTPY2000` to the output code to let you know that this Streaming element is not supported by Snowpark Connect.

```python
from pyspark import SparkContext
from pyspark.streaming import NotSupportedElement

sc = SparkContext("local[2]", "NetworkWordCount")
# EWI SPRKCNTPY2000: The element 'NotSupportedElement' of the library Streaming is not supported for Snowpark Connect
ssc = NotSupportedElement(sc, 1)
```

### Recommended fix

* Use Snowflake Streams to capture table changes.
* Process changes with Snowpark Connect in scheduled jobs.
* Ideal for maintaining derived tables.
* Since this is a generic error code that applies to a range of unsupported functions, there no single and specific fix. The appropriate action will depend on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY2500

**Message** The element < **element** > of the library ML is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines the usage instance of an ML element that is not supported in Snowpark Connect and does not have its own error code associated with it. This is the generic error code used by the SMA for ML unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `pyspark.ml` library.

```python
from pyspark.ml.classification import NotSupportedElement

lr = NotSupportedElement(maxIter=10, regParam=0.01)
model = lr.fit(trainingData)
```

#### Output

The SMA adds the EWI `SPRKCNTPY2500` to the output code to let you know that this ML element is not supported by Snowpark Connect.

```python
from pyspark.ml.classification import NotSupportedElement

# EWI SPRKCNTPY2500: The element 'NotSupportedElement' of the library ML is not supported for Snowpark Connect
lr = NotSupportedElement(maxIter=10, regParam=0.01)
model = lr.fit(trainingData)
```

### Recommended fix

* Use the Snowpark ML library.

  ```python
  # Instead of PySpark ML
  from snowflake.ml.modeling.linear_model import LinearRegression
  from snowflake.ml.modeling.ensemble import RandomForestRegressor

  # Snowpark ML approach
  model = LinearRegression()
  model.fit(train_df)
  predictions = model.predict(test_df)
  ```
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action will depend on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is some kind of workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY3000

**Message** The element < **element** > of the library MLLIB is not supported for Snowpark Connect

**Category** Conversion Error

### Description

This issue appears when the tool determines the usage instance of an MLLIB element that is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for ML unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `pyspark.mllib` library.

```python
from pyspark.mllib.recommendation import NotSupportedElement, Rating
ratings = [
    Rating(0, 0, 4.0),
    Rating(0, 1, 2.0),
    Rating(1, 1, 5.0)
]
model = NotSupportedElement.train(ratings, 10)
```

#### Output

The SMA adds the EWI `SPRKCNTPY3000` to the output code to let you know that this MLLIB element is not supported by Snowpark Connect.

```python
from pyspark.mllib.recommendation import NotSupportedElement, Rating

# EWI SPRKCNTPY3000: The element 'NotSupportedElement' of the library MLLIB is not supported for Snowpark Connect
ratings = [
    Rating(0, 0, 4.0),
    Rating(0, 1, 2.0),
    Rating(1, 1, 5.0)
]
model = NotSupportedElement.train(ratings, 10)
```

### Recommended fix

* Use the Snowpark ML library.

  ```python
  # Instead of MLlib's LinearRegressionWithSGD
  from snowflake.ml.modeling.linear_model import LinearRegression

  # MLlib approach (NOT supported)
  # from pyspark.mllib.regression import LinearRegressionWithSGD
  # model = LinearRegressionWithSGD.train(rdd_data)

  # Snowpark ML approach
  model = LinearRegression(input_cols=['feature1', 'feature2'], label_cols=['target'])
  model.fit(train_df)
  ```
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action will depend on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY3500

**Message** The element < **element** > of the library Spark Session is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of a Spark Session element is not supported in Snowpark Connect and does not have its own error code associated with it. This is the generic error code used by the SMA for ML unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `pyspark.SparkSession` library.

```python
from pyspark.sql import SparkSession
from pyspark.sql.SparkSession import NotSupportedElement

spark = SparkSession.builder.appName("Example").getOrCreate()
new_spark = spark.NotSupportedElement()
```

#### Output

The SMA adds the EWI `SPRKCNTPY3500` to the output code to let you know that this Spark Session element is not supported by Snowpark Connect.

```python
from pyspark.sql import SparkSession
from pyspark.sql.SparkSession import NotSupportedElement

spark = SparkSession.builder.appName("Example").getOrCreate()
# EWI SPRKCNTPY3500: The element 'NotSupportedElement' of the library Spark Session is not supported for Snowpark Connect
new_spark = spark.NotSupportedElement()
```

### Recommended fix

* The key is to replace `SparkSession` patterns with equivalent Snowpark `Session` operations while leveraging Snowflake’s unique capabilities like warehouses, stages, and native SQL functions.

  ```python
  # PySpark SparkSession
  from pyspark.sql import SparkSession
  spark = SparkSession.builder.appName("MyApp").getOrCreate()

  # Snowpark Session
  from snowflake.snowpark import Session
  session = Session.builder.configs({
      "account": "your_account",
      "user": "your_user",
      "password": "your_password",
      "role": "your_role",
      "warehouse": "your_warehouse",
      "database": "your_database",
      "schema": "your_schema"
  }).create()
  ```
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action will depend on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is some kind of workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY3501

The `AppName` method of `pyspark.sql.session.SparkSession.Builder` has been replaced with the `SetName` function to provide equivalent functionality in Snowpark Connect when creating a session.

**Message** The `AppName` method of `pyspark.sql.session.SparkSession.Builder` has been replaced with the `SetName` function to provide equivalent functionality in Snowpark Connect.

**Category** Warning.

### Description

This issue occurs when the SMA detects the use of `AppName` Spark function while creating a `SparkSession` instance. The tool replaces this function with Snowpark initialization statements to achieve equivalent functionality.

### Scenario

#### Input

Below is an example of a Python `AppName` Spark function that will be replaced by the `SetAppName` Snowpark Connect function, and therefore it will add this EWI.

```python
spark = (
  SparkSession
    .builder
    .appName("MyApp")
    .getOrCreate()
)
```

#### Output

The SMA adds the EWI `SPRKCNTPY3501` to the output code to let you know that this element has been transformed.

```python
conf = SparkConf()
conf.setAppName("MyApp")
#EWI: SPRKCNTPY3501 => The AppName method of pyspark.sql.session.SparkSession.Builder has been replaced with the SetAppName function to provide equivalent functionality in Snowpark Connect

spark = ( snowpark_connect.server.init_spark_session(conf = conf)
)
```

### Recommended fix

Review the output code and ensure that the Snowpark Connect session is configured correctly with the desired application name.

### Additional recommendations

* Please review the Snowpark Connect documentation to understand how to configure and use Snowpark Connect sessions effectively.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com). If you have a contract for support with Snowflake, reach out to your sales engineer, and they can direct your support needs.

## SPRKCNTPY3502

The `Master` method of `pyspark.sql.session.SparkSession.Builder` has been replaced with the `SetName` function to provide equivalent functionality in Snowpark Connect when creating a session.

**Message** The `Master` method of `pyspark.sql.session.SparkSession.Builder` has been replaced with the `SetName` function to provide equivalent functionality in Snowpark Connect.

**Category** Warning.

### Description

This issue occurs when the SMA detects the use of `Master` Spark function while creating a `SparkSession` instance. The tool replaces this function with Snowpark initialization statements to achieve equivalent functionality.

### Scenario

#### Input

Below is an example of a Python `Master` Spark function that will be replaced by the `SetAppName` Snowpark Connect function, and therefore it will add this EWI.

```python
spark = (
  SparkSession
    .builder
    .master("local[1]")
    .getOrCreate()
)
```

#### Output

The SMA adds the EWI `SPRKCNTPY3502` to the output code to let you know that this element has been transformed.

```python
conf = SparkConf()
conf.setMaster("local[1]")
#EWI: SPRKCNTPY3502 => The Master method of pyspark.sql.session.SparkSession.Builder has been replaced with the SetMaster function to provide equivalent functionality in Snowpark Connect

spark = ( snowpark_connect.server.init_spark_session(conf = conf)
)
```

### Recommended fix

Review the output code and ensure that the Snowpark Connect session is configured correctly with the desired master setting.

### Additional recommendations

* Please review the Snowpark Connect documentation to understand how to configure and use Snowpark Connect sessions effectively.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com). If you have a contract for support with Snowflake, reach out to your sales engineer, and they can direct your support needs.

## SPRKCNTPY4000

SparkContext element is not supported by Snowpark Connect.

**Message** The element **<element full name>** of the library `SparkContext` is not supported by Snowpark Connect.

**Category** Conversion error

### Description

This issue appears when the SMA detects an usage of a Python `SparkContext` element that is not supported by Snowpark Connect and does not have its own specific error code associated with it. This is a generic error code used by the SMA for unsupported `SparkContext` elements.

### Scenario

#### Input

Below is an example of an usage of a `SparkContext` element that triggers this EWI:

```python
from pyspark import SparkContext

sc = SparkContext()
sc.not_supported_element()
```

#### Output

The SMA adds the EWI `SPRKCNTPY4000` indicating that the `SparkContext` element is not supported by Snowpark Connect.

```python
from pyspark import SparkContext

sc = SparkContext()
#EWI: SPRKCNTPY4000 => The element 'pyspark.context.SparkContext.not_supported_element' of the library SparkContext is not supported by Snowpark Connect
sc.not_supported_element()
```

### Recommended fix

Snowpark Connect uses a DataFrame-based architecture and doesn’t support `SparkContext` or RDD operations. As a workaround, you could refactor your code to use Snowpark Connect `Session` and `DataFrame` APIs instead.

### Additional recommendations

* Consult the [Snowpark Connect documentation](../../../../../developer-guide/snowpark-connect/snowpark-connect-overview.md) for available alternatives to your specific use case.
* Note that some SparkContext functionality has no direct equivalent and may require application redesign.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTPY4001

**Message** `SparkContext` instantiation has been converted to a Snowpark Connect session.

**Category** Warning

### Description

This issue appears when the SMA detects [pyspark.context.SparkContext](https://spark.apache.org/docs/latest/api/python/reference/api/pyspark.SparkContext.html) constructor calls in your code. The SMA automatically transforms these instantiations into equivalent Snowpark Connect session calls, enabling your Spark applications to run on Snowflake’s infrastructure.

The transformation process involves the following steps:

* Replacing `SparkContext` instantiation with `snowpark_connect.server.init_spark_session()`
* Preserving any existing `SparkConf` configuration parameters

> **Important**: Before running the converted code, you **must** configure your connection details in a `connections.toml` or `config.toml` file. This configuration file should contain your Snowflake account credentials, warehouse information, and other connection parameters required for Snowpark Connect to establish a connection to your Snowflake account.
>
> For comprehensive setup instructions, please refer to the [official Snowpark Connect documentation](../../../../../developer-guide/snowpark-connect/snowpark-connect-workloads-jupyter.md).

### Scenarios

#### Scenario 1

##### Input code

`SparkContext` instantiated with default parameters:

```python
sc = SparkContext()
```

##### Output code

The SMA sets the environment variable, starts the Snowpark Connect session, and retrieves the session without additional configuration:

```python
#EWI: SPRKCNTPY4001 => SparkContext instantiation has been converted to a Snowpark Connect session
sc = snowpark_connect.server.init_spark_session()
```

#### Scenario 2

##### Input code

SparkContext instantiated with master and `appName` parameters:

```python
sc = SparkContext(master="local[*]", appName="MyApp")
# or
sc = SparkContext("local[*]", "MyApp")
```

##### Output code

The SMA sets the environment variable, starts the Snowpark Connect session, and passes the parameters via a `SparkConf` object:

```python
conf = SparkConf()
conf.setAppName("MyApp")
conf.setMaster("local[*]")
#EWI: SPRKCNTPY4001 => SparkContext instantiation has been converted to a Snowpark Connect session
sc = snowpark_connect.server.init_spark_session(conf = conf)
```

#### Scenario 3

##### Input code

`SparkContext` instantiated using an existing `SparkConf` object.

```python
my_conf = SparkConf()
sc = SparkContext(conf=my_conf)
```

##### Output code

The SMA preserves the existing `SparkConf` object and passes it directly to the `snowpark_connect.server.init_spark_session()` method:

```python
my_conf = SparkConf()
#EWI: SPRKCNTPY4001 => SparkContext instantiation has been converted to a Snowpark Connect session
sc = snowpark_connect.server.init_spark_session(conf = my_conf)
```

### Additional Recommendations

* While the SMA preserves your `SparkConf` settings, not all Spark configurations may be supported in Snowpark Connect. Review your configurations to ensure compatibility.
* Ensure that downstream operations using the `SparkContext` object are compatible with Snowpark Connect, as some Spark-specific functionalities may not have direct equivalents.
* For more support, you can email us at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---
title: Snowpark Migration Accelerator: Snowpark Connect Issue Codes for Scala
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/issue-codes-by-source/spark-scala/snowpark-connect-codes-scala.md
section: Migrations
---

# Snowpark Migration Accelerator: Snowpark Connect Issue Codes for Scala

## SPRKCNTSCL1000

**Message** The element < **element** > is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool detects the usage of an element that is not supported in Snowpark Connect and does not have its own error code associated with it. This is the generic error code used by the SMA for an unsupported element.

### Scenario

#### Input

The tool found an unidentified element of the `org.apache.spark` library.

```scala
import org.apache.spark.NotSupportedElement
import org.apache.spark.rdd.RDD
import org.apache.spark.{SparkConf, SparkContext}

val conf = new SparkConf().setAppName("GraphXExample").setMaster("local")
val sc = new SparkContext(conf)
val vertices: RDD[(VertexId, String)] = sc.parallelize(Seq((1L, "A"), (2L, "B")))
val edges: RDD[Edge[String]] = sc.parallelize(Seq(Edge(1L, 2L, "edge")))
val graph = NotSupportedElement(vertices, edges)
```

#### Output

The tool adds the comment to the statement pointing to the unsupported element.

```scala
import org.apache.spark.NotSupportedElement
import org.apache.spark.rdd.RDD
import org.apache.spark.{SparkConf, SparkContext}

val conf = new SparkConf().setAppName("GraphXExample").setMaster("local")
val sc = new SparkContext(conf)
val vertices: RDD[(VertexId, String)] = sc.parallelize(Seq((1L, "A"), (2L, "B")))
val edges: RDD[Edge[String]] = sc.parallelize(Seq(Edge(1L, 2L, "edge")))
// EWI SPRKCNTSCL1000: The element 'NotSupportedElement' is not supported for Snowpark Connect
val graph = NotSupportedElement(vertices, edges)
```

### Recommended fix

Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action depends on the particular element in use.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTSCL1500

**Message** The element < **element** > of the library RDD is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of an RDD element is not supported in Snowpark Connect and does not have its own error code associated with it. This is the generic error code used by the SMA for RDD unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `org.apache.spark.rdd` library.

```scala
import org.apache.spark.rdd.RDD

val rdd: RDD[Int] = ???
rdd.NotSupportedElement()
```

#### Output

The SMA adds the EWI `SPRKCNTSCL1500` to the output code to let you know that this RDD element is not supported by Snowpark Connect.

```scala
import snowflake.snowpark.snowpark_connect
import org.apache.spark.rdd.RDD

val rdd: RDD[Int] = ???
/*EWI: SPRKCNTSCL1500 => The element 'org.apache.spark.rdd.RDD.NotSupportedElement' of the library RDD is not supported for Snowpark Connect*/
rdd.NotSupportedElement()
```

### Recommended fix

* Convert RDD operations to DataFrame operations.
* The key recommendation is to completely abandon RDD patterns and redesign your data processing logic using Snowpark’s DataFrame API and SQL capabilities, leveraging Snowflake’s distributed computing architecture rather than trying to replicate RDD functionality.
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action depends on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTSCL2000

**Message** The element < **element** > of the library Streaming is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of a Streaming element is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for Streaming unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `org.apache.spark.streaming` library.

```scala
import org.apache.spark.SparkConf
import org.apache.spark.streaming.{Seconds, NotSupportedElement}

val conf = new SparkConf().setAppName("NetworkWordCount").setMaster("local[2]")
val ssc = new NotSupportedElement(conf, Seconds(1))
```

#### Output

The SMA adds the EWI `SPRKCNTSCL2000` to the output code to let you know that this Streaming element is not supported by Snowpark Connect.

```scala
import org.apache.spark.SparkConf
import org.apache.spark.streaming.{Seconds, NotSupportedElement}

val conf = new SparkConf().setAppName("NetworkWordCount").setMaster("local[2]")
// EWI SPRKCNTSCL2000: The element 'NotSupportedElement' of the library Streaming is not supported for Snowpark Connect
val ssc = new NotSupportedElement(conf, Seconds(1))
```

### Recommended fix

* The key recommendation is to completely redesign streaming architecture to use external streaming platforms for real-time processing combined with Snowpark Connect for analytical batch processing, rather than trying to replicate Spark Streaming functionality.
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action depends on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTSCL2500

**Message** The element < **element** > of the library ML is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of an ML element is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for ML unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `org.apache.spark.ml` library.

```scala
import org.apache.spark.ml.feature.NotSupportedElement
import org.apache.spark.sql.SparkSession

val spark = SparkSession.builder.appName("Example").getOrCreate()
val data = spark.read.format("libsvm").load("data.txt")
val scaler = new NotSupportedElement().setInputCol("features").setOutputCol("scaledFeatures")
```

#### Output

The SMA adds the EWI `SPRKCNTSCL2500` to the output code to let you know that this ML element is not supported by Snowpark Connect.

```scala
import org.apache.spark.ml.feature.NotSupportedElement
import org.apache.spark.sql.SparkSession

val spark = SparkSession.builder.appName("Example").getOrCreate()
val data = spark.read.format("libsvm").load("data.txt")
// EWI SPRKCNTSCL2500: The element 'NotSupportedElement' of the library ML is not supported for Snowpark Connect
val scaler = new NotSupportedElement().setInputCol("features").setOutputCol("scaledFeatures")
```

### Recommended fix

* Use the Snowpark ML library.

  ```scala
  // Spark ML Pipeline (NOT supported)
  import org.apache.spark.ml.Pipeline
  import org.apache.spark.ml.classification.LogisticRegression
  import org.apache.spark.ml.feature.{HashingTF, Tokenizer}

  // Snowpark ML equivalent
  import com.snowflake.snowpark.ml.modeling.linear_model.LogisticRegression
  import com.snowflake.snowpark.ml.preprocessing.StandardScaler

  val lr = new LogisticRegression()
      .setInputCols(Array("feature1", "feature2"))
      .setLabelCols(Array("label"))
  val model = lr.fit(trainingData)
  ```
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action depends on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTSCL3000

**Message** The element < **element** > of the library MLLIB is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines the the usage instance of an MLLIB element is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for ML unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `org.apache.spark.mllib` library.

```scala
import org.apache.spark.mllib.recommendation.NotSupportedElement
import org.apache.spark.mllib.recommendation.Rating
import org.apache.spark.{SparkConf, SparkContext}

val conf = new SparkConf().setAppName("ALSExample").setMaster("local")
val sc = new SparkContext(conf)
val data = sc.textFile("data.txt")
val ratings = data.map(_.split(',') match { case Array(user, item, rate) =>
  Rating(user.toInt, item.toInt, rate.toDouble)
})
val model = NotSupportedElement.train(ratings, 10, 10, 0.01)
```

#### Output

The SMA adds the EWI `SPRKCNTSCL3000` to the output code to let you know that this MLLIB element is not supported by Snowpark Connect.

```scala
import org.apache.spark.mllib.recommendation.NotSupportedElement
import org.apache.spark.mllib.recommendation.Rating
import org.apache.spark.{SparkConf, SparkContext}

val conf = new SparkConf().setAppName("ALSExample").setMaster("local")
val sc = new SparkContext(conf)
val data = sc.textFile("data.txt")
val ratings = data.map(_.split(',') match { case Array(user, item, rate) =>
  Rating(user.toInt, item.toInt, rate.toDouble)
})
// EWI SPRKCNTSCL3000: The element 'NotSupportedElement' of the library MLLIB is not supported for Snowpark Connect
val model = NotSupportedElement.train(ratings, 10, 10, 0.01)
```

### Recommended fix

* The key recommendation is to completely abandon MLlib patterns and redesign machine learning workflows using Snowpark ML, SQL-based ML functions, or hybrid approaches that leverage Snowflake distributed architecture rather than trying to replicate RDD-based MLlib functionality.

  ```scala
  // MLlib approach (NOT supported)
  import org.apache.spark.mllib.classification.LogisticRegressionWithSGD
  import org.apache.spark.mllib.regression.LabeledPoint

  val training = rdd.map { line =>
      val parts = line.split(',')
      LabeledPoint(parts(0).toDouble, Vectors.dense(parts(1).split(' ').map(_.toDouble)))
  }
  val model = LogisticRegressionWithSGD.train(training, numIterations)

  // Snowpark ML equivalent
  import com.snowflake.snowpark.ml.modeling.linear_model.LogisticRegression

  val model = new LogisticRegression()
      .setInputCols(Array("feature1", "feature2", "feature3"))
      .setLabelCols(Array("target"))
  val trainedModel = model.fit(trainingDataFrame)
  ```
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action depends on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

## SPRKCNTSCL3500

**Message** The element < **element** > of the library Spark Session is not supported for Snowpark Connect.

**Category** Conversion Error

### Description

This issue appears when the tool determines that the usage instance of a Spark Session element is not supported in Snowpark Connect, and does not have its own error code associated with it. This is the generic error code used by the SMA for ML unsupported elements.

### Scenario

#### Input

The tool found an unidentified element of the `org.apache.spark.sql.SparkSession` library.

```scala
import org.apache.spark.sql.SparkSession.NotSupportedElement

val spark = SparkSession.builder.appName("Example").getOrCreate()
SparkSession.NotSupportedElement()
```

#### Output

The SMA adds the EWI `SPRKCNTSCL3500` to the output code to let you know that this Spark Session element is not supported by Snowpark Connect.

```scala
import org.apache.spark.sql.SparkSession

val spark = SparkSession.builder.appName("Example").getOrCreate()
// EWI SPRKCNTSCL3500: The element 'NotSupportedElement' of the library Spark Session is not supported for Snowpark Connect
SparkSession.NotSupportedElement()
```

### Recommended fix

* The key is to replace `SparkSession` patterns with equivalent Snowpark `Session` operations while leveraging Snowflake’s unique capabilities like warehouses, stages, and native SQL functions.

  ```scala
  // Spark SparkSession
  import org.apache.spark.sql.SparkSession

  val spark = SparkSession.builder()
      .appName("MySparkApp")
      .master("local[*]")
      .getOrCreate()

  // Snowpark Session
  import com.snowflake.snowpark.Session

  val session = Session.builder
      .configs(Map(
          "URL" -> "https://account.snowflakecomputing.com",
          "USER" -> "username",
          "PASSWORD" -> "password",
          "ROLE" -> "role_name",
          "WAREHOUSE" -> "warehouse_name",
          "DB" -> "database_name",
          "SCHEMA" -> "schema_name"
      ))
      .create
  ```
* Since this is a generic error code that applies to a range of unsupported functions, there is no single and specific fix. The appropriate action depends on the particular element in use.
* Please note that even though the element is not supported, it does not necessarily mean that a solution or workaround cannot be found. It means only that the SMA itself cannot find the solution.

### Additional recommendations

* Even though the option or the element on the message is not supported, this does not mean that a solution cannot be found. It means only that the tool itself cannot find the solution.
* If you believe that Snowpark Connect already supports this element or that there is a workaround, please report that you encountered a conversion error on that particular element using [the Report an Issue option](../../../user-guide/project-overview/configuration-and-settings.md) in the SMA and include any additional information that you think may be helpful.
* For more support, email support at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com) or post an issue [in the SMA](../../../user-guide/project-overview/configuration-and-settings.md).

---
title: Snowpark Migration Accelerator: Snowpark-Checkpoints Execution Guide
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/snowpark-checkpoints-execution-guide/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Snowpark-Checkpoints Execution Guide

The SMA generates a file named `checkpoints.json`, which is placed in both the input and output folders. The file in the input folder is used for data collection, while the one in the output folder is employed for the validation process. These two files are essential for maintaining a smooth and accurate workflow throughout the process.

---
title: Snowpark Migration Accelerator: Spark SQL DDL
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/spark-sql/spark-sql-ddl/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Spark SQL DDL

While DDL (Data Definition Language) may seem straightforward at first glance, each database platform has its own specific parameters and syntax. These platform-specific differences can create significant challenges when migrating between different database systems.

Let’s examine some basic DDL (Data Definition Language) statements that we’ll be working with:

* [CREATE TABLE](create-table/README.md)

Let’s examine each of these components in detail.

---
title: Snowpark Migration Accelerator: Troubleshooting the Output Code
source: https://docs.snowflake.com/en/migrations/sma-docs/issue-analysis/troubleshooting-the-output-code/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Troubleshooting the Output Code

Resolving Common Issues with Code Generated by the Snowpark Migration Accelerator (SMA)

Here are some helpful troubleshooting tips while this section is being developed.

## For Spark Scala

| Problem observed | Possible reason | Solution |
| --- | --- | --- |
| Spark and snowpark import statements do not compile on output scala code | Snowpark and Snowpark extensions library references were not added to the project configuration file | Add snowpark and snowpark extensions references to the project configuration file |

## For PySpark

| Problem observed | Possible reason | Solution |
| --- | --- | --- |
| Snowpark import statement do not compile on output python code or snowpark references do not compile | Snowpark and Snowpark extensions library references were not installed |  |

---
title: Snowpark Migration Accelerator: Understanding the Assessment Summary
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/assessment/understanding-the-assessment-summary.md
section: Migrations
---

# Snowpark Migration Accelerator: Understanding the Assessment Summary

After running an assessment, you can view the initial results and summary in the Assessment Summary Report.

Keep in mind that this report summarizes the assessment output results for the best conversion option for your source code.

The Assessment Results section of the application contains several components, which are explained in detail below.

## Analysis Summary

Displays a high-level summary of the codebase analyzed for the best conversion option. The header shows the overall readiness score as a percentage badge. Below, three KPI cards present: the total number of input files broken down by type (Python, Jupyter Notebook, Other), the total line count across all files, and the number of files that contain Spark API references.

## File compatibility breakdown

Categorizes all analyzed files into three groups: fully compatible (no changes needed), files that require changes (partial support), and files with unsupported APIs (significant rework needed). Each category shows its count with a severity icon. A stacked bar chart below visualizes the proportional distribution as percentages, with a color-coded legend.

## Data distribution

Shows how the codebase reads and writes data, split into two stacked bar charts: Data sources (reads) and Data targets (writes). The X-axis represents the platform or location (e.g., Local Directory, S3, JDBC), and the Y-axis stacks the file formats used (e.g., CSV, Json, Undefined). Each chart has its own independent legend showing only the formats present in its data. An informational banner alerts when external data connections are detected, prompting the user to validate that those sources will be accessible from the migrated codebase.

## Code dependencies

Displays a donut chart summarizing all libraries and packages referenced in the codebase, grouped into four categories: Supported third-party libraries (known to work in Snowflake), Unsupported third-party libraries (third-party but not confirmed as supported), Internal (project-internal modules), and Unknown (libraries the SMA could not identify). The center of the chart shows the total dependency count. A warning banner appears when unknown or unsupported libraries are detected, advising the user to review their code dependencies to ensure they are available in Snowflake.

## Issues by category

Groups all issues found during the assessment into human-readable categories (e.g., Unsupported API, Redundant Elements, SparkContext Usage). Each row shows the category name, the number of affected files, and a key issue description summarizing the most representative problem in that group. This section requires an active Snowflake connection, as it uses Snowflake Cortex AI to analyze the raw issue codes and generate meaningful category names and descriptions.

## Execution summary

Displays metadata about the assessment run. Includes the analysis coverage percentage and any parsing warnings at the top. Below, it shows the customer information (company, email, project name, and project ID), the engine and library versions used during the assessment, and the input/output folder paths with clickable links to open them directly.

## Next Steps

The application provides several additional features, which can be accessed through the interface shown in the image below.

* **Primary option** - The recommended conversion option for your source code.
* **Secondary option** - An alternative conversion option for your source code.
* **View Reports** - Opens the folder containing assessment output reports. These include the detailed assessment report, Spark reference inventory, and other analyses of your source codebase. Each report type is explained in detail in this documentation.

---
title: Snowpark Migration Accelerator: Understanding the Conversion Assessment and Reporting
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/snowpark-api-conversion/understanding-the-conversion-assessment-and-reporting.md
section: Migrations
---

# Snowpark Migration Accelerator: Understanding the Conversion Assessment and Reporting

When you run a conversion using the Snowpark Migration Accelerator (SMA), it generates detailed information similar to what you get in assessment mode. The process is identical in both cases (as explained in [How the Conversion Works](how-the-conversion-works.md)), but during conversion, the tool also produces the converted code as part of its output.

To ensure accurate code conversion, SMA generates the same information twice because source code can be modified between the initial assessment and the final conversion. This approach provides the most reliable way to verify that the conversion matches the current state of your codebase.

> **Note:**
>
> The assessment and conversion modes produce identical log files, reports, and assessment summary screens. The only difference between these modes is the converted code that is generated when running in conversion mode.

## Conversion Summary

The conversion results will be displayed in the Results panel of the application.

This section provides the same information as the Assessment Summary. For more details about interpreting the conversion assessment results, refer to the [Understanding the Assessment Summary](../assessment/understanding-the-assessment-summary.md) section.

## Conversion Output Reports

To access the reports generated by the Snowpark Migration Accelerator (SMA), click the “View Reports” button on the Results page.

The tool creates a directory containing all reports. These reports are identical to those generated during the Assessment phase. For detailed information about each report, refer to the [Output Reports](../scos-conversion/output-reports/README.md) section in the documentation.

## Conversion Logs

The logs are generated automatically during the assessment process, similar to how summaries and reports are created. To access these logs, click the “View Log Folder” option.

To view detailed information about available logs, refer to the [Output Logs](../scos-conversion/output-logs.md) section in the documentation.

---

Let’s explore where the converted code will be stored after the conversion process is complete.

To see the converted code, click **View Output** on the Results page.

---
title: Snowpark Migration Accelerator: Understanding the Conversion Summary
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/scos-conversion/understanding-the-conversion-summary.md
section: Migrations
---

# Snowpark Migration Accelerator: Understanding the Conversion Summary

After running a conversion, you can view the initial results and summary in the Conversion Summary Report.

Keep in mind that this report summarizes the information from the inventory files created in the [Output Reports](output-reports/README.md) folder during the SMA execution. For a comprehensive analysis, review the [Detailed Report](output-reports/README.md) in the output directory.

The Conversion Results section of the application contains several components, which are explained in detail below.

## Standard Conversion Summary

The summary will appear as shown below:

In the top-right corner of the report, the **Execution date** indicates when the analysis was run.

### Snowpark Connect Readiness Score

The Snowpark Connect Readiness Score will look something like this:

1. **Readiness Score** - It will show you the readiness score you obtained. The Snowpark Connect readiness score indicates the proportion of Spark API references that are supported by Snowpark Connect. This score is calculated by dividing the number of supported Spark API references by the total Spark API references. You can learn more about this score in the [Snowpark Connect Readiness Score](readiness-scores.md) section.
2. **Score Explanation** - An explanation of what the Snowpark Connect Readiness score is and how to interpret it.
3. **Next Steps** - Depending on the readiness score obtained, the SMA will advise you on what actions you should take before proceeding to the next step.
4. **Score Breakdown** - A detailed explanation of how the Snowpark Connect Readiness Score was calculated. In this case, it will show you the number of Spark API references supported by Snowpark Connect divided by the total number of Spark API references.

**Supported Usages** refers to the number of Spark API references in a workload that are supported by Snowpark Connect. In contrast, **identified usages** represents the total count of Spark API references found within that workload.

### Spark API Usages

> **Danger:**
>
> The **Spark API Usages** section has been deprecated since version **2.0.2**. You can now find:
>
> * A summary of Spark API usage in [the Detailed Report](output-reports/curated-reports.md)
> * A complete list of all Spark API usage instances in [the Spark API Usages Inventory](output-reports/sma-inventories.md)

The report contains three main sections displayed as tabs:

1. Overall Usage Classification
2. Spark API Usage Categorization
3. Spark API Usages By Status

We will examine each section in detail below.

#### Overall Usage Classification

This tab displays a table containing three rows that show:

* Supported operations
* Unsupported operations
* Total usage statistics

Additional details are provided in the following section:

1. **Usages Count** - The total number of times Spark API functions are referenced in your code. Each reference is classified as either supported or unsupported, with totals shown at the bottom.
2. **Files with at least 1 usage** - The number of files that contain at least one Spark API reference. If this number is less than your total file count, it means some files don’t use Spark API at all.
3. **Percentage of All Files** - Shows what portion of your files use Spark API. This is calculated by dividing the number of files with Spark API usage by the total number of code files, expressed as a percentage.

#### Spark API Usage Categorization

This tab displays the different types of Spark references detected in your codebase. It shows the overall Readiness Score (which is the same score shown at the top of the page) and provides a detailed breakdown of this score by category.

You can find all available categorizations in the [Spark Reference Categories](spark-reference-categories.md) section.

#### Spark API Usages By Status

The final tab displays a categorical breakdown organized by mapping status.

The SMA tool uses seven main mapping statuses, which indicate how well Spark code can be converted to Snowpark. For detailed information about these statuses, refer to the [Spark Reference Categories](spark-reference-categories.md) section.

### Import Calls

> **Danger:**
>
> The **Import Calls** section has been removed since version **2.0.2**. You can now find:
>
> * A summary of import statements in [the Detailed Report](output-reports/curated-reports.md)
> * A complete list of all import calls in [the Import Usages Inventory](output-reports/sma-inventories.md)

The “Import Calls” section displays frequently used external library imports found in your codebase. Note that Spark API imports are excluded from this section, as they are covered separately in the “Spark API” section.

This table contains the following information:

The report displays the following information:

1. A table with 5 rows showing:

   * The 3 most frequently imported Python libraries
   * An “Other” row summarizing all remaining packages
   * A “Total” row showing the sum of all imports
2. A “Supported in Snowpark” column indicating whether each library is included in Snowflake’s [list of supported packages in Snowpark](https://repo.anaconda.com/pkgs/snowflake/).
3. An “Import Count” column showing how many times each library was imported across all files.
4. A “File Coverage” column showing the percentage of files that contain at least one import of each library. For example:

   * If ‘sys’ appears 29 times in the import statements but is only used in 28.16% of files, this suggests it’s typically imported once per file where it’s used.
   * The “Other” category might show 56 imports occurring across 100% of files.

For detailed import information per file, refer to the ImportUsagesInventory.csv file in the [Output Reports](output-reports/README.md).

### File Summary

> **Danger:**
>
> The **File Summary** section has been removed since version **2.0.2**. You can now find:
>
> * A summary of files and file types in [the Detailed Report](output-reports/curated-reports.md)
> * A complete list of all files (both analyzed and not analyzed) in [the File Inventory](output-reports/sma-inventories.md)

The summary report contains multiple tables displaying metrics organized by file type and size. These metrics provide insights into the codebase’s volume and help estimate the required effort for the migration project.

The Snowpark Migration Accelerator analyzes all files in your source codebase, including both code and non-code files. You can find detailed information about the scanned files in the [files.csv](output-reports/README.md) report.

The File Summary contains multiple sections. Let’s examine each section in detail.

#### File Type Summary

The File Type Summary displays a list of all file extensions found in your scanned code repository.

The file extensions listed indicate which types of code files SMA can analyze. For each file extension, you will find the following information:

* **Lines of Code** - The total number of executable code lines across all files with this extension. This count excludes comments and empty lines.
* **File Count** - The total number of files found with this extension.
* **Percentage of Total Files** - The percentage that files with this extension represent out of all files in the project.

To analyze your workload, you can easily identify whether it primarily consists of script files (such as Python or R), notebook files (like Jupyter notebooks), or SQL files. This information helps determine the main types of code files in your project.

#### Notebook Sizing by Language

The tool evaluates notebooks in your codebase and assigns them a “t-shirt” size (S, M, L, XL) based on the number of code lines they contain. This sizing helps estimate the complexity and scope of each notebook.

The notebook sizes are categorized according to the main programming language used within each notebook.

#### Notebook Stats By Language

This table displays the total number of code lines and cells in all notebooks, organized by programming language.

These notebooks are organized by the primary programming language used within them.

#### Code File Content

When running SMA, the tab name will change based on your source language:

* For Python source files, the tab will display “Python File Content”
* For Scala source files, the tab will display “Scala File Content”

This row shows how many files contain Spark API references. The “Spark Usages” row displays:

1. The number of files that use Spark APIs
2. What percentage these files represent of the total codebase files analyzed

This metric helps identify what percentage of files do not contain Spark API references. A low percentage suggests that many code files lack Spark dependencies, which could mean the migration effort might be smaller than initially estimated.

#### Code File Sizing

The File Sizing tab name changes based on your source language:

* For Python source files, it displays as “Python File Sizing”
* For Scala source files, it displays as “Scala File Sizing”

The codebase files are categorized using “t-shirt” sizes (S, M, L, XL). Each size has specific criteria described in the “Size” column. The table also shows what percentage of all Python files falls into each size category.

Understanding the file size distribution in your codebase can help assess workload complexity. A high percentage of small files typically suggests simpler, less complex workloads.

### Issues Summary

The Issues Summary provides critical information about potential problems found during code scanning. During conversion, you’ll see a list of EWIs (Errors, Warnings, and Issues) detected in your codebase. For a detailed explanation of these issues, refer to the Issue Analysis section in the documentation.

At the top of the issue summary, you will find a table that provides an overview of all identified issues.

The table contains two rows.

* The “Number of issues” represents the total count of all issue codes found in each category.
* The “Number of unique issues” represents the count of distinct error codes found in each category.

The problems are divided into three main categories:

* **Warnings** indicate potential differences between source and target platforms that may not require immediate action but should be considered during testing. These could include slight variations in behavior for edge cases or notifications about changes in appearance compared to the source platform.
* **Conversion issues** highlight elements that either failed to convert or need additional configuration to work properly in the target platform.
* **Parsing issues** occur when the tool cannot interpret specific code elements. These are critical issues requiring immediate attention, typically caused by non-compiling source code or incorrect code extraction. If you believe your source code is correct but still receive parsing errors, it may be due to an unrecognized pattern in SMA. In such cases, [report an issue](../project-overview/configuration-and-settings.md) and include the problematic source code section.

The table summarizes the total count for each item.

Below this table, you will find a list of unique issue codes and their descriptions.

Each issue code entry provides:

* The unique issue identifier
* A description of the issue
* The number of occurrences
* The severity level (Warning, Conversion Error, or Parsing Error)

You can click any issue code to view detailed documentation that includes:

* A full description of the issue
* Example code
* Recommended solutions

For instance, clicking the first issue code shown above (SPRKPY1002) will take you to its dedicated documentation page.

By default, the table displays only the top 5 issues. To view all issues, click the SHOW ALL ISSUES button located below the table. You can also use the search bar above the table to find specific issues.

Understanding the remaining conversion work is crucial. You can find detailed information about each issue and its location in the issue inventory within the [Reports folder](output-reports/README.md).

### Execution Summary

The execution summary provides a comprehensive overview of the tool’s recent analysis. It includes:

* The code analysis score
* User details
* The unique execution ID
* Version information for both SMA and Snowpark API
* Project folder locations that were specified during [Project Creation](../project-overview/project-setup.md)

### Appendixes

The appendixes contain additional reference information that can help you better understand the output generated by the SMA tool.

This guide contains general reference information about using the Snowpark Migration Accelerator (SMA). While the content may be updated periodically, it focuses on universal SMA usage rather than details about specific codebases.

---

This is what most users will see when they run the Snowpark Migration Accelerator (SMA). If you are using an older version, you might see the Abbreviated Conversion Summary instead, which is shown below.

## Abbreviated Conversion Summary [Deprecated]

If your readiness score is low, your migration summary might appear as follows:

This summary contains the following information:

* **Execution Date**: Shows when your analysis was performed. You can view results from any previous execution for this project.
* **Result**: Indicates if your workload is suitable for migration based on the [readiness score](../../support/glossary.md). The readiness score is a preliminary assessment tool and does not guarantee migration success.
* **Input Folder**: Location of the source files that were analyzed.
* **Output Folder**: Location where analysis reports and converted code files are stored.
* **Total Files**: Number of files analyzed.
* **Execution Time**: Duration of the analysis process.
* **Identified Spark References**: Number of Spark API calls found in your code.
* **Count of Python (or Scala) Files**: Number of source code files in the specified programming language.

---

## Next Steps

The application provides several additional features, which can be accessed through the interface shown in the image below.

* **Retry Conversion** - You can run the conversion again by clicking the **Retry Conversion** button on the Conversion Results page. This is useful when you make changes to the source code and want to see updated results.
* **View Reports** - Opens the folder containing conversion output reports. These include the detailed conversion report, Spark reference inventory, and other analyses of your source codebase. Each report type is explained in detail in this documentation.

The following pages provide detailed information about the reports generated each time the tool runs.

---
title: Snowpark Migration Accelerator: Validation
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/sma-checkpoints-walkthrough/snowpark-checkpoints-execution-guide/validation.md
section: Migrations
---

# Snowpark Migration Accelerator: Validation

To proceed with the validation process, follow the steps outlined below:

1. Copy the `snowpark-checkpoints-output` folder, generated during the collection process, into the validation workload.
2. Open the validation workload in VS Code to begin the validation process.
3. Generate checkpoints using the `checkpoints.json` file.

To generate checkpoints you can do one of the following actions:

* Generate them by accepting the suggesting message:

* Execution “Snowflake: Load All Checkpoints command”

Once all checkpoints are loaded, your files should appear as follows:

4. Run the Python file to execute the checkpoints validation process.

When running a python file that contains validation checkpoints, the validation results are going to be shown in the copied “snowpark-checkpoints-output” folder as “checkpoints_validation_results.json”:

The “checkpoints_validation_results.json” contains the unified results of the collection process

```json
{
    "results": [
        {
            "checkpoint_name": "sample$BBVOC7$df1$1",
            "file": "sample.py",
            "line_of_code": 10,
            "result": "PASS",
            "timestamp": "2025-05-05T15:32:29.248917"
        },
        {
            "checkpoint_name": "sample$BBVOC7$df2$1",
            "file": "sample.py",
            "line_of_code": 12,
            "result": "PASS",
            "timestamp": "2025-05-05T15:32:31.137536"
        },
        {
            "checkpoint_name": "sample$BBVOC7$df3$1",
            "file": "sample.py",
            "line_of_code": 17,
            "result": "PASS",
            "timestamp": "2025-05-05T15:32:33.133002"
        }
    ]
}
```

The validation results, as seen above, will contain the comparison result between the PySpark and Snowpark DataFrames.

---
title: Snowpark Migration Accelerator: Walkthrough Setup
source: https://docs.snowflake.com/en/migrations/sma-docs/use-cases/assessment-walkthrough/walkthrough-setup/README.md
section: Migrations
---

# Snowpark Migration Accelerator: Walkthrough Setup

This guide offers practical experience with the Snowpark Migration Accelerator (SMA). Through real-world examples, you will learn how to evaluate code and interpret assessment results, giving you a clear understanding of the tool’s capabilities.

## Materials

To complete this tutorial, you will need the following:

* A computer that has Snowpark Migration Accelerator (SMA) software installed
* Access to the sample code files on the same computer

To begin, you will need two items on your computer:

1. The Snowpark Migration Accelerator (SMA) tool
2. Code samples

Let’s walk through how to obtain these essential resources.

### SMA Application

The Snowpark Migration Accelerator (SMA) helps developers convert their PySpark and Spark Scala applications to run on Snowflake. It automatically detects Spark API calls in your Python or Scala code and transforms them into equivalent Snowpark API calls. This guide will demonstrate basic SMA functionality by analyzing sample Spark code and showing how it assists with migration projects.

During the initial assessment phase, Snowpark Migration Accelerator (SMA) examines your source code and builds a detailed model that captures all the functionality in your code. Based on this analysis, SMA creates several reports, including a detailed assessment report that we’ll review in this walkthrough. These reports help you understand how ready your code is for migration to Snowpark and estimate the effort needed for the transition. We’ll look at these findings in more detail as we continue through this lab.

#### Download and Installation

To begin an assessment with the Snowpark Migration Accelerator (SMA), you only need to complete the installation process. While Snowflake provides optional [helpful training on using the SMA](https://learn.snowflake.com/en/courses/spark-to-snowpark-sma/), you can proceed without it. No special access codes are needed. Simply:

1. Visit our [Download and Access](../../../general/getting-started/download-and-access.md) section
2. [Download the installer](https://www.snowflake.com/en/data-cloud/snowpark/migration-accelerator/)
3. Follow our [Installation instructions](../../../general/getting-started/installation/README.md) to set up the application on your computer

### Sample Codebase

This guide uses Python code examples to demonstrate the migration process. We have selected two publicly available sample codebases from third-party Git repositories as unbiased, real-world examples. You can access these codebases at:

* PySpark Data Engineering Examples: <https://github.com/spark-examples/pyspark-examples>
* Apache Spark Machine Learning Examples: <https://github.com/apache/spark/tree/master/examples/src/main/python>

To analyze codebases using the Snowpark Migration Accelerator (SMA), follow these steps:

1. Download the codebases as zip files from GitHub. You can find instructions on how to do this in the [GitHub documentation](https://docs.github.com/en/repositories/working-with-files/using-files/downloading-source-code-archives).
2. Create separate folders on your computer for each codebase.
3. Extract each zip file into its designated folder, as shown in the image below:

These sample codebases demonstrate how SMA evaluates Spark API references to calculate the [Spark API Readiness Score](../../../user-guide/assessment/readiness-scores.md). Let’s look at two scenarios:

1. A codebase that received a high score, indicating it is highly compatible with Snowpark and ready for migration
2. A codebase that received a low score, indicating it requires additional review and potential modifications before migration

While the readiness score provides valuable insight, it should not be the only factor considered when planning a migration. A comprehensive evaluation of all aspects is necessary for both high and low scoring assessments to ensure a successful migration.

After unzipping the directories, SMA will analyze only files that use supported code formats and notebook formats. These files are checked for references to Spark API and other Third Party APIs. To see which file types are supported, please check the list [here](../../../user-guide/before-using-the-sma/supported-filetypes.md).

Throughout the rest of this walkthrough, we will analyze how these two codebases execute.

## Support

For help with installation or to get access to the code, please email [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---

After downloading and unzipping the codebases into separate directories, you can either:

* Move on to [running the tool](../running-the-tool.md)
* Review [the code preparation notes](notes-on-code-preparation.md)

---
title: Snowpark Migration Accelerator: Windows Installation
source: https://docs.snowflake.com/en/migrations/sma-docs/general/getting-started/installation/windows-installation.md
section: Migrations
---

# Snowpark Migration Accelerator: Windows Installation

You can install the Snowpark Migration Accelerator (SMA) on Windows in two ways:

* As a desktop application with a graphical interface
* As a Command Line Interface (CLI)

Instructions for both installation methods are provided below.

If you need the application or CLI files, please check the [Downloading and License Access page](../download-and-access.md) for detailed instructions on how to obtain them.

## Installing the SMA Application on Windows

Follow these steps to install the Snowpark Migration Accelerator (SMA) application on Windows:

1. **Run the Installer:** Double-click the downloaded installer file (with .exe extension).
2. **Follow the Installation Wizard:** A setup wizard will appear. Simply follow the prompts to install the software.
3. **Open the SMA:** After installation completes, launch the Snowpark Migration Accelerator (SMA) from your Windows Start menu.

After launching the application, you have two options:

* Create a new assessment or conversion project
* Open a project you have previously created

Great! Now that you’ve completed the installation, you can start using the Snowpark Migration Accelerator (SMA) application. For detailed instructions on how to use SMA, please refer to the [SMA User Guide](../../../user-guide/overview.md).

**Important Note:**

When a newer version of SMA is available, you will see an “UPDATE NOW” button in the top right corner of your screen. Simply click this button to download and install the latest version.

If you experience any problems while installing the software, please email our support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

## Installing the SMA CLI on Windows

Here’s how to install the Snowpark Migration Accelerator (SMA) Command Line Interface on Windows:

1. **Download the .zip File:** Download the SMA CLI `.zip` file from the [Downloading and License Access page](../download-and-access.md).
2. **Extract the Files:** Unzip the downloaded file to a location on your computer (for example, `C:sma-cli`).
3. **Copy the Orchestrator Path:** Find and copy the full path to the `orchestrator` folder in the extracted files.
4. **Open Environment Variables:** Type “environment variables for your account” in the Windows search bar and select **Edit environment variables for your account**.
5. **Edit the Path Variable:** Find and select the “Path” variable in the “User variables” section, then click **Edit**.
6. **Add the Orchestrator Path:** Click **New** and paste the copied orchestrator path.
7. **Save Changes:** Click **OK** twice to save and close both windows.
8. **Open a Command Prompt:** Launch a new command prompt window.
9. **Verify Installation:** Run `sma --version` to confirm the installation was successful.

After installing either the application or Command Line Interface (CLI), you can begin using the Snowpark Migration Accelerator (SMA). For detailed instructions, please refer to the [SMA User Guide](../../../user-guide/overview.md).

If you experience any problems while installing the software, please email our support team at [sma-support@snowflake.com](mailto:sma-support%40snowflake.com).

---
title: Spark to Snowpark Connect with the Cortex Code migration skill
source: https://docs.snowflake.com/en/migrations/sma-docs/migrating-with-cortex-code/spark-to-snowpark-connect.md
section: Migrations
---

# Spark to Snowpark Connect with the Cortex Code migration skill

You can migrate PySpark code to something compatible with Snowpark Connect by using the snowpark-connect migration
skill. You can learn more by reading the `skill.md` file associated with the skill. This skill includes two primary
components:

* **Conversion**: The skill reads source code files, assesses their compatibility with Snowpark Connect, and makes
  changes to ensure compatibility.
* **Validation**: The skill then asks if you want to validate the converted code. The validation skill takes a sampling
  of the files, generates some synthetic data, and runs them in Snowflake. Consider this a “smoke test” to ensure
  the code runs. This isn’t data validation.

## Accessing and invoking the skill with the Cortex Code CLI

The **snowpark-connect** Cortex Code skill is bundled with the Cortex Code CLI. You can
[check the skill list](https://docs.snowflake.com/en/user-guide/cortex-code/extensibility#using-skills)
to validate that the skill is available.

Using the skill is straightforward. You can either ask specific questions about migrating a directory to Snowpark or
invoke the skill directly.

### Invoking the skill directly

Specify the name of the skill followed by the directory or file where you want to apply the skill:

```text
/snowpark-connect "path/to/your/pyspark_file.py"
```

### Asking questions

You can ask any question related to Snowpark Connect or Spark migration:

* `I want to migrate this spark data pipeline to snowflake: path/to/your/spark_files_directory.`
* `Show me how compatible this file is with snowpark connect: path/to/your/pyspark_file.py.`
* `Convert this set of notebooks to be compatible with snowpark connect: path/to/your/notebooks_directory.`

## Using the skill with the Cortex Code CLI

When you invoke the skill, point it at the source codebase you want to migrate. You can also point it at the output
of an [SMA](../general/getting-started/installation/README.md) run (see SMA issue resolution with the Cortex Code CLI later). The skill needs to do two setup tasks:

* Access a knowledge base that contains many sample code patterns that work well with Snowpark and Snowpark Connect.
* Access the files on your local machine. This might include a series of prompts requesting permissions.

### Conversion

The skill gives you a basic analysis, including:

* The number of files found.
* The number of critical issues found.

Then it generates a file manifest, a migration copy of the code, and converts all the issues it finds. It does this by
referencing a set of code samples in an internal knowledge base that help inform each issue or error that it encounters.
Because this uses Cortex Code, all the issues have a generated solution.

The skill runs a syntax check when it has migrated all of the files, then reports its results. This prompts you to
start the validation step.

### Validation

If you choose to execute the validation, the skill takes a sampling of the files, generates some synthetic data, and
runs them in Snowflake. Consider this a “smoke test” to ensure the code runs. This isn’t data validation. Instead, a
set of synthetic data is generated and passed to a set of tests generated from the converted code.

As part of this, Cortex Code:

1. Sets up a Snowpark Connect session.
2. Sets up a test directory and an entrypoint.
3. Runs the files from the entrypoints.
4. Reports the results and makes recommendations based on the results.

You can accept or reject the recommendations.

Once the validation is complete, you should move on to testing these files with actual data.

## SMA issue resolution with the Cortex Code CLI

You can also use the snowpark-connect skill on the output from an SMA run. Point the skill at the entire directory
you migrated with the SMA, and ask it to make the code compatible with Snowpark Connect.

Whether it’s the SMA output or the input directory, the skill starts processing all the files and resolving issues.

It might ask questions at times and summarizes what it does at the end.

At the end, a prompt suggests you run the validation step.

---
title: SQL magic cell transformation
source: https://docs.snowflake.com/en/migrations/sma-docs/translation-reference/notebooks/databricks/magic-sql.md
section: Migrations
---

# SQL magic cell transformation

This document describes how the Snowpark Migration Accelerator (SMA) handles the transformation of SQL magic cells during notebook migration.

## Magic SQL cell transformation

When the SMA processes a notebook and detects a magic cell that begins with `%sql`, it automatically transforms the cell into a standard Jupyter notebook (`.ipynb`) cell with the appropriate SQL metadata configuration.

### How it works

In Databricks notebooks, SQL code is commonly written using magic commands:

```python
%sql
SELECT * FROM my_table
WHERE status = 'active'
```

During migration, the SMA recognizes this pattern and converts it into a native notebook cell with the cell metadata set to `"sql"`. This ensures that the following occurs:

* The SQL code is properly recognized and executed as SQL in the target environment.
* Syntax highlighting is correctly applied for SQL.
* The notebook maintains its intended execution behavior.

### Before migration (Databricks)

A cell with the `%sql` magic command in the notebook JSON structure:

```json
{
  "cell_type": "code",
  "source": [
    "%sql\n",
    "SELECT COUNT(*) FROM customers"
  ],
  "metadata": {},
  "outputs": []
}
```

### After migration (Snowflake)

The same content is converted to a notebook cell with the language metadata set to `sql`, as shown in the following example. Note that the `%sql` magic command is removed from the source, and the cell metadata now includes `"language": "sql"` to indicate the cell should be executed as SQL.

```json
{
  "cell_type": "code",
  "source": [
    "SELECT COUNT(*) FROM customers"
  ],
  "metadata": {
    "language": "sql"
  },
  "outputs": []
}
```

### Benefits

* **Native SQL support**: The migrated notebook uses native SQL cell types instead of magic commands.
* **Better tooling integration**: SQL cells are recognized by IDEs and notebook environments for enhanced features like auto-completion and validation.
* **Cleaner code**: Removal of magic command prefixes results in cleaner, more portable SQL code.

---
title: SQL Server Commands Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/sqlserver_commands.md
section: Migrations
---

# SQL Server Commands Reference

## Overview

This page provides comprehensive reference documentation for SQL Server-specific commands in the Snowflake Data Validation CLI. For Teradata commands, see [Teradata Commands Reference](teradata_commands.md). For Amazon Redshift commands, see [Redshift Commands Reference](redshift_commands.md). For Snowflake-to-Snowflake commands, see [Snowflake Commands Reference](snowflake_commands.md).

---

## Command Structure

All SQL Server commands follow this consistent structure:

```bash
snowflake-data-validation sqlserver <command> [options]

# Or use the shorter alias
sdv sqlserver <command> [options]
```

Where `<command>` is one of:

* `run-validation` - Run synchronous validation
* `run-async-validation` - Run asynchronous validation
* `generate-validation-scripts` - Generate validation scripts
* `get-configuration-files` - Get configuration templates
* `auto-generated-configuration-file` - Interactive config generation
* `row-partitioning-helper` - Interactive row partitioning configuration
* `column-partitioning-helper` - Interactive column partitioning configuration

---

## Run Synchronous Validation

Validates data between SQL Server and Snowflake in real-time.

### Syntax

```bash
snowflake-data-validation sqlserver run-validation \
  --data-validation-config-file /path/to/config.yaml \
  --log-level INFO
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file containing validation settings
* **Example:** `--data-validation-config-file ./configs/sqlserver_validation.yaml`

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for validation execution
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Basic validation
sdv sqlserver run-validation \
  --data-validation-config-file ./configs/sqlserver_validation.yaml

# Validation with debug logging
sdv sqlserver run-validation \
  --data-validation-config-file ./configs/sqlserver_validation.yaml \
  --log-level DEBUG

# Using full command name
snowflake-data-validation sqlserver run-validation \
  -dvf /opt/validations/prod_config.yaml \
  -ll INFO
```

### Use Cases

* Real-time validation during migration
* Pre-cutover validation checks
* Post-migration verification
* Continuous validation in CI/CD pipelines

---

## Run Asynchronous Validation

Performs validation using pre-generated metadata files without connecting to databases.

### Syntax

```bash
snowflake-data-validation sqlserver run-async-validation \
  --data-validation-config-file /path/to/config.yaml
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file
* **Note:** Configuration must specify paths to pre-generated metadata files

### Example Usage

```bash
# Run async validation
sdv sqlserver run-async-validation \
  --data-validation-config-file ./configs/async_validation.yaml

# Using full command name
snowflake-data-validation sqlserver run-async-validation \
  -dvf /data/validations/async_config.yaml
```

### Prerequisites

Before running async validation:

1. Generate validation scripts using `generate-validation-scripts`
2. Execute the generated scripts on source and target databases
3. Ensure metadata files are available in the configured paths

### Use Cases

* Validating in environments with restricted database access
* Separating metadata extraction from validation
* Batch validation workflows
* Scheduled validation jobs

---

## Generate Validation Scripts

Generates SQL scripts for extracting metadata that can be executed separately.

### Syntax

```bash
snowflake-data-validation sqlserver generate-validation-scripts \
  --data-validation-config-file /path/to/config.yaml
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file

### Example Usage

```bash
# Generate scripts
sdv sqlserver generate-validation-scripts \
  --data-validation-config-file ./configs/validation.yaml

# Using full command name
snowflake-data-validation sqlserver generate-validation-scripts \
  -dvf /opt/configs/script_generation.yaml
```

### Output

The command generates SQL scripts in the output directory configured in your YAML file:

```text
<output_directory>/
├── source_schema_queries.sql
├── source_metrics_queries.sql
├── source_row_queries.sql
├── target_schema_queries.sql
├── target_metrics_queries.sql
└── target_row_queries.sql
```

### Use Cases

* Generating scripts for execution by DBAs
* Compliance requirements for query review
* Environments where direct CLI database access is restricted
* Manual execution and validation workflows

---

## Get Configuration Templates

Retrieves example configuration files and optional query templates.

### Syntax

```bash
snowflake-data-validation sqlserver get-configuration-files \
  --templates-directory ./my-templates \
  --query-templates
```

### Options

**`--templates-directory, -td`** (optional)

* **Type:** String (path)
* **Default:** Current directory
* **Description:** Directory to save template files
* **Example:** `--templates-directory ./templates`

**`--query-templates`** (optional)

* **Type:** Flag (no value required)
* **Description:** Include J2 (Jinja2) query template files for advanced customization
* **Example:** `--query-templates`

### Example Usage

```bash
# Get basic templates in current directory
sdv sqlserver get-configuration-files

# Save templates to specific directory
sdv sqlserver get-configuration-files \
  --templates-directory ./my-project/templates

# Include query templates for customization
sdv sqlserver get-configuration-files \
  --templates-directory ./templates \
  --query-templates

# Using short flags
sdv sqlserver get-configuration-files -td ./templates --query-templates
```

### Output Files

**Without `--query-templates` flag:**

```text
<templates_directory>/
└── sqlserver_validation_template.yaml
```

**With `--query-templates` flag:**

```text
<templates_directory>/
├── sqlserver_validation_template.yaml
└── query_templates/
    ├── sqlserver_columns_metrics_query.sql.j2
    ├── sqlserver_row_count_query.sql.j2
    ├── sqlserver_compute_md5_sql.j2
    └── snowflake_columns_metrics_query.sql.j2
```

### Use Cases

* Starting a new validation project
* Learning configuration options
* Customizing validation queries for specific needs
* Creating organization-specific templates

---

## Auto-Generate Configuration File

Interactive command to generate a configuration file by prompting for connection parameters.

### Syntax

```bash
snowflake-data-validation sqlserver auto-generated-configuration-file
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### Interactive Prompts

The command will prompt for the following information:

1. **SQL Server host**

   * Hostname or IP address of SQL Server
   * Example: `sqlserver.company.com`
2. **SQL Server port** (default: 1433)

   * Port number for SQL Server connection
   * Press Enter to accept default
3. **SQL Server username**

   * Authentication username
   * Example: `migration_user`
4. **SQL Server password**

   * Authentication password (hidden input)
   * Not displayed on screen for security
5. **SQL Server database**

   * Name of the database to validate
   * Example: `production_db`
6. **SQL Server schema**

   * Schema name within the database
   * Example: `dbo`
7. **Trust server certificate** (default: no)

   * Options: yes/no
   * Set to “yes” for self-signed certificates
8. **Encrypt connection** (default: yes)

   * Options: yes/no/optional
   * Controls SSL/TLS encryption
9. **Output path for configuration file**

   * Where to save the generated YAML file
   * Example: `./configs/my_validation.yaml`

### Example Session

```bash
$ sdv sqlserver auto-generated-configuration-file

SQL Server host: sqlserver.company.com
SQL Server port [1433]:
SQL Server username: migration_user
SQL Server password: ********
SQL Server database: production_db
SQL Server schema: dbo
Trust server certificate (yes/no) [no]: no
Encrypt connection (yes/no/optional) [yes]: yes
Output path for configuration file: ./configs/sqlserver_config.yaml

Configuration file generated successfully: ./configs/sqlserver_config.yaml
```

### Generated Configuration

The command generates a basic YAML configuration file:

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./validation_results

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: migration_user
  password: "<hidden>"
  database: production_db
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables: []
```

### Next Steps After Generation

1. **Edit the configuration file** to add:

   * Target connection details
   * Tables to validate
   * Validation options
   * Column selections and mappings
2. **Review security settings:**

   * Consider using environment variables for passwords
   * Update trust certificate and encryption settings as needed
3. **Add table configurations:**

   * Specify fully qualified table names
   * Configure column selections
   * Set up filtering where clauses
4. **Test the configuration:**

   ```bash
   sdv sqlserver run-validation \
     --data-validation-config-file ./configs/sqlserver_config.yaml
   ```

### Use Cases

* Quick setup for new users
* Generating baseline configurations
* Testing connectivity during setup
* Creating template configurations for teams

---

## Row Partitioning Helper

Interactive command to generate partitioned table configurations for large tables. This helper divides tables into smaller row partitions based on a specified column, enabling more efficient validation of large datasets.

### Syntax

```bash
snowflake-data-validation sqlserver row-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The table partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply partitioning
3. If partitioning is enabled, collects partition parameters
4. Queries the source database to determine partition boundaries
5. Generates new table configurations with `WHERE` clauses for each partition
6. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/sqlserver_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply partitioning?** (yes/no)

   * Whether to partition this specific table
   * Default: yes

   b. **Partition column** (if partitioning)

   * Column name used to divide the table
   * Should be indexed for performance
   * Example: `transaction_id`, `created_date`

   c. **Is partition column a string type?** (yes/no)

   * Determines quoting in generated WHERE clauses
   * Default: no (numeric)

   d. **Number of partitions**

   * How many partitions to create
   * Example: `10`, `50`, `100`

### Example Session

```bash
$ sdv sqlserver row-partitioning-helper

Generate a configuration file for SQL Server table partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip partitioning or specify partitioning parameters for each table.

Configuration file path: ./configs/sqlserver_validation.yaml

Apply partitioning for production_db.dbo.fact_sales? [Y/n]: y
Write the partition column for production_db.dbo.fact_sales: sale_id
Is 'sale_id' column a string type? [y/N]: n
Write the number of partitions for production_db.dbo.fact_sales: 10

Apply partitioning for production_db.dbo.dim_customer? [Y/n]: n

Apply partitioning for production_db.dbo.transactions? [Y/n]: y
Write the partition column for production_db.dbo.transactions: transaction_date
Is 'transaction_date' column a string type? [y/N]: n
Write the number of partitions for production_db.dbo.transactions: 5

Table partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with WHERE clauses:

```yaml
tables:
  # Original table partitioned into 10 segments
  - fully_qualified_name: production_db.dbo.fact_sales
    where_clause: "sale_id >= 1 AND sale_id < 100000"
    target_where_clause: "sale_id >= 1 AND sale_id < 100000"
    # ... other table settings preserved

  - fully_qualified_name: production_db.dbo.fact_sales
    where_clause: "sale_id >= 100000 AND sale_id < 200000"
    target_where_clause: "sale_id >= 100000 AND sale_id < 200000"
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: production_db.dbo.dim_customer
    # ... original configuration
```

### Use Cases

* **Large table validation**: Break multi-billion row tables into manageable chunks
* **Parallel processing**: Enable concurrent validation of different partitions
* **Memory optimization**: Reduce memory footprint by processing smaller data segments
* **Incremental validation**: Validate specific data ranges independently
* **Performance tuning**: Optimize validation for tables with uneven data distribution

### Best Practices

1. **Choose appropriate partition columns:**

   * Use indexed columns for better query performance
   * Prefer columns with sequential values (IDs, timestamps)
   * Avoid columns with highly skewed distributions
2. **Determine optimal partition count:**

   * Consider table size and available resources
   * Start with 10-20 partitions for tables with 10M+ rows
   * Increase partitions for very large tables (100M+ rows)
3. **String vs numeric columns:**

   * Numeric columns are generally more efficient
   * String columns work but may have uneven distribution
4. **After partitioning:**

   * Review generated WHERE clauses
   * Adjust partition boundaries if needed
   * Test with a subset before full validation

---

## Column Partitioning Helper

Interactive command to generate partitioned table configurations for wide tables with many columns. This helper divides tables into smaller column partitions, enabling more efficient validation of tables with a large number of columns.

### Syntax

```bash
snowflake-data-validation sqlserver column-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The column partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply column partitioning
3. If partitioning is enabled, collects the number of partitions
4. Queries the source database to retrieve all column names for the table
5. Divides the columns into the specified number of partitions
6. Generates new table configurations where each partition validates only a subset of columns
7. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/sqlserver_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply column partitioning?** (yes/no)

   * Whether to partition this specific table by columns
   * Default: yes

   b. **Number of partitions** (if partitioning)

   * How many column partitions to create
   * Example: `3`, `5`, `10`

### Example Session

```bash
$ sdv sqlserver column-partitioning-helper

Generate a configuration file for SQL Server column partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip column partitioning or specify column partitioning parameters for each table.

Configuration file path: ./configs/sqlserver_validation.yaml

Apply column partitioning for production_db.dbo.wide_table? [Y/n]: y
Write the number of partitions for production_db.dbo.wide_table: 5

Apply column partitioning for production_db.dbo.small_table? [Y/n]: n

Apply column partitioning for production_db.dbo.report_table? [Y/n]: y
Write the number of partitions for production_db.dbo.report_table: 3

Column partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with column subsets:

```yaml
tables:
  # Original table with 100 columns partitioned into 5 segments (20 columns each)
  - fully_qualified_name: production_db.dbo.wide_table
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - column_a
      - column_b
      - column_c
      # ... first 20 columns alphabetically

  - fully_qualified_name: production_db.dbo.wide_table
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - column_d
      - column_e
      - column_f
      # ... next 20 columns alphabetically
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: production_db.dbo.small_table
    # ... original configuration
```

### Use Cases

* **Wide table validation**: Break tables with hundreds of columns into manageable chunks
* **Memory optimization**: Reduce memory footprint by validating fewer columns at a time
* **Parallel processing**: Enable concurrent validation of different column groups
* **Targeted validation**: Validate specific column groups independently
* **Performance tuning**: Optimize validation for tables with many LOB or complex columns

### Best Practices

1. **Determine optimal partition count:**

   * Consider the total number of columns in the table
   * For tables with 50+ columns, start with 3-5 partitions
   * For tables with 100+ columns, consider 5-10 partitions
2. **Column ordering:**

   * Columns are divided alphabetically
   * Related columns may end up in different partitions
3. **After partitioning:**

   * Review generated column lists
   * Verify all required columns are included
   * Test with a subset before full validation
4. **Combine with row partitioning:**

   * For very large, wide tables, consider using both row and column partitioning
   * First partition by columns, then apply row partitioning to each column partition if needed

---

## SQL Server Connection Configuration

SQL Server connections require specific configuration in the YAML file.

### Connection Example

```yaml
source_connection:
  mode: credentials
  host: "sqlserver.company.com"
  port: 1433
  username: "sqlserver_user"
  password: "secure_password"
  database: "source_database"
  trust_server_certificate: "no"
  encrypt: "yes"
```

### Connection Fields

**`mode`** (required)

* **Type:** String
* **Valid Values:** `credentials`
* **Description:** Connection mode for SQL Server

**`host`** (required)

* **Type:** String
* **Description:** SQL Server hostname or IP address
* **Examples:**

  + `"sqlserver.company.com"`
  + `"192.168.1.100"`
  + `"sql-prod-01.internal.company.net"`

**`port`** (required)

* **Type:** Integer
* **Default:** 1433
* **Description:** SQL Server port number
* **Common Values:**

  + 1433 (default)
  + 1434 (SQL Server Browser)

**`username`** (required)

* **Type:** String
* **Description:** SQL Server authentication username
* **Example:** `"migration_admin"`

**`password`** (required)

* **Type:** String
* **Description:** SQL Server authentication password
* **Security Note:** Consider using environment variables

**`database`** (required)

* **Type:** String
* **Description:** SQL Server database name
* **Example:** `"production_database"`

**`trust_server_certificate`** (optional)

* **Type:** String
* **Valid Values:** `"yes"`, `"no"`
* **Default:** `"no"`
* **Description:** Whether to trust the server certificate for SSL/TLS connections
* **Use Case:** Set to “yes” for self-signed certificates

**`encrypt`** (optional)

* **Type:** String
* **Valid Values:** `"yes"`, `"no"`, `"optional"`
* **Default:** `"yes"`
* **Description:** Connection encryption setting
* **Recommendations:**

  + Use “yes” for production
  + Use “optional” for development/testing
  + Use “no” only in secure internal networks

### Connection Examples

**Production Connection with SSL/TLS:**

```yaml
source_connection:
  mode: credentials
  host: "sql-prod.company.com"
  port: 1433
  username: "prod_reader"
  password: "${SQL_SERVER_PASSWORD}"  # From environment variable
  database: "production_db"
  trust_server_certificate: "no"
  encrypt: "yes"
```

**Development Connection:**

```yaml
source_connection:
  mode: credentials
  host: "localhost"
  port: 1433
  username: "dev_user"
  password: "dev_password"
  database: "dev_database"
  trust_server_certificate: "yes"
  encrypt: "optional"
```

**Self-Signed Certificate Connection:**

```yaml
source_connection:
  mode: credentials
  host: "internal-sql.company.local"
  port: 1433
  username: "internal_user"
  password: "secure_password"
  database: "internal_db"
  trust_server_certificate: "yes"  # Required for self-signed certs
  encrypt: "yes"
```

---

## Complete SQL Server Examples

### Example 1: Basic SQL Server Validation

```yaml
# Global configuration
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./validation_results
max_threads: auto

# Source connection
source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: sql_user
  password: sql_password
  database: production_db
  trust_server_certificate: "no"
  encrypt: "yes"

# Target connection
target_connection:
  mode: name
  name: snowflake_prod

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Tables to validate
tables:
  - fully_qualified_name: production_db.dbo.customers
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  - fully_qualified_name: production_db.dbo.orders
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_notes
      - audit_log
    where_clause: "order_date >= '2024-01-01'"
    target_where_clause: "order_date >= '2024-01-01'"
```

### Example 2: SQL Server with Column Mappings

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: /opt/validation/sqlserver
max_threads: 16

source_connection:
  mode: credentials
  host: sql-prod.company.com
  port: 1433
  username: migration_user
  password: secure_password
  database: legacy_db
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 100

comparison_configuration:
  tolerance: 0.01

tables:
  - fully_qualified_name: legacy_db.dbo.customer_master
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - cust_id
      - cust_name
      - email_addr
      - phone_num
    column_mappings:
      cust_id: customer_id
      cust_name: customer_name
      email_addr: email
      phone_num: phone
    index_column_list:
      - cust_id
    chunk_number: 20
```

### Example 3: SQL Server Large Table Optimization

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./large_table_validation
max_threads: 32

source_connection:
  mode: credentials
  host: bigdata-sql.company.com
  port: 1433
  username: bigdata_reader
  password: readonly_password
  database: analytics_db
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_analytics

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 500
  exclude_metrics: false

comparison_configuration:
  tolerance: 0.005

tables:
  - fully_qualified_name: analytics_db.dbo.fact_transactions
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - large_blob_column
      - xml_metadata
    index_column_list:
      - transaction_id
    chunk_number: 100
    where_clause: "transaction_date >= '2024-01-01' AND amount > 0"
    target_where_clause: "transaction_date >= '2024-01-01' AND amount > 0"
    max_failed_rows_number: 1000
```

### Example 4: SQL Server View Validation

Validate SQL Server views alongside tables for comprehensive migration verification.

```yaml
source_platform: SqlServer
target_platform: Snowflake
output_directory_path: ./view_validation
max_threads: auto

source_connection:
  mode: credentials
  host: sqlserver.company.com
  port: 1433
  username: view_validator
  password: secure_password
  database: ReportingDB
  trust_server_certificate: "no"
  encrypt: "yes"

target_connection:
  mode: name
  name: snowflake_reporting

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true

# Tables to validate
tables:
  - fully_qualified_name: ReportingDB.dbo.CUSTOMERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

# Views to validate
views:
  # Basic view validation with index columns
  - fully_qualified_name: ReportingDB.dbo.CUSTOMER_SUMMARY_VIEW
    target_name: CUSTOMER_SUMMARY_VIEW
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

  # View with specific columns
  - fully_qualified_name: ReportingDB.dbo.SALES_METRICS_VIEW
    target_name: SALES_METRICS_VIEW
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - REGION
      - TOTAL_SALES
      - ORDER_COUNT
    index_column_list: [REGION, PERIOD]
    target_index_column_list: [REGION, PERIOD]

  # View with filtering and column mappings
  - fully_qualified_name: ReportingDB.dbo.ACTIVE_ORDERS_VIEW
    target_name: ACTIVE_ORDERS_VIEW
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [ORDER_ID]
    target_index_column_list: [ORDER_ID]
    where_clause: "order_date >= '2024-01-01'"
    target_where_clause: "order_date >= '2024-01-01'"
    column_mappings:
      ORD_ID: ORDER_ID
      CUST_ID: CUSTOMER_ID

  # View with different target name
  - fully_qualified_name: ReportingDB.dbo.LEGACY_REPORT_VIEW
    target_database: MODERN_DB
    target_schema: ANALYTICS
    target_name: MODERNIZED_REPORT_VIEW
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [REPORT_ID]
    target_index_column_list: [REPORT_ID]
```

**Note:** View validation creates temporary tables internally to materialize view data for comparison between SQL Server and Snowflake.

---

## Troubleshooting SQL Server Connections

### Issue: SSL/TLS Certificate Errors

**Symptom:**

```sql
SSL certificate verification failed
```

**Solution:**

Set `trust_server_certificate` to “yes”:

```yaml
source_connection:
  trust_server_certificate: "yes"
  encrypt: "yes"
```

### Issue: Connection Timeout

**Symptom:**

```sql
Connection timeout: Unable to connect to SQL Server
```

**Solutions:**

1. Verify the host and port:

   ```bash
   telnet sqlserver.company.com 1433
   ```
2. Check firewall rules
3. Verify SQL Server is running and accepting connections
4. Test with SQL Server Management Studio (SSMS)

### Issue: Authentication Failed

**Symptom:**

```sql
Login failed for user 'username'
```

**Solutions:**

1. Verify credentials are correct
2. Check SQL Server authentication mode (mixed mode required)
3. Ensure user has necessary permissions:

   ```sql
   -- Grant read permissions
   GRANT SELECT ON SCHEMA::dbo TO migration_user;
   GRANT VIEW DEFINITION ON SCHEMA::dbo TO migration_user;
   ```

### Issue: Database Not Found

**Symptom:**

```sql
Cannot open database "database_name"
```

**Solutions:**

1. Verify database name is correct
2. Check user has access to the database:

   ```sql
   USE database_name;
   SELECT * FROM sys.tables;
   ```
3. Ensure database is online and accessible

---

## Best Practices for SQL Server

### Security

1. **Use encrypted connections** in production:

   ```yaml
   source_connection:
     encrypt: "yes"
     trust_server_certificate: "no"
   ```
2. **Store passwords securely:**

   * Use environment variables
   * Use secret management systems
   * Avoid hardcoding passwords
3. **Use read-only accounts:**

   ```sql
   CREATE USER migration_reader WITH PASSWORD = 'secure_password';
   GRANT SELECT ON SCHEMA::dbo TO migration_reader;
   ```

### Performance

1. **Enable chunking for large tables:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       chunk_number: 50
   ```
2. **Use WHERE clauses to filter data:**

   ```yaml
   tables:
     - fully_qualified_name: transactions
       where_clause: "date >= '2024-01-01'"
   ```
3. **Optimize thread count:**

   ```yaml
   max_threads: 16  # Adjust based on server capacity
   ```

### Data Quality

1. **Start with schema validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: false
     row_validation: false
   ```
2. **Add metrics validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: true
     row_validation: false
   ```
3. **Enable row validation selectively:**

   ```yaml
   validation_configuration:
     row_validation: true

   tables:
     - fully_qualified_name: critical_table
       # Row validation enabled for this table
   ```

---

## See Also

* [Main CLI Usage Guide](CLI_USAGE_GUIDE.md)
* [Teradata Commands Reference](teradata_commands.md)
* [Redshift Commands Reference](redshift_commands.md)
* [Snowflake Commands Reference](snowflake_commands.md)
* [Configuration Examples](CONFIGURATION_EXAMPLES.md)
* [Quick Reference Guide](CLI_QUICK_REFERENCE.md)

---
title: SQL Server to Snowflake Migration Guide
source: https://docs.snowflake.com/en/migrations/guides/sqlserver.md
section: Migrations
---

# **SQL Server to Snowflake Migration Guide**

## **Snowflake Migration Framework**

A typical SQL Server-to-Snowflake migration can be broken into nine key steps:

1. **Planning and design** are often overlooked steps in the migration process. The main reason is that companies typically want to show progress quickly, even if they haven’t fully understood the scope of the project. That is why, this phase is critical to understand and prioritize the migration project.
2. **Environment and security** with a plan, a clear timeline, a RACI matrix, and buy-in from all stakeholders, it’s time to move into execution mode.
   Setting up the necessary environments and security measures to begin the migration is very important before starting the migration phase given that there are many moving parts, and will be more impactful for the migration project if all your setup is ready before moving forward.
3. **Database code conversion** process involves extracting code directly from the source systems’ database catalog, such as table definitions, views, stored procedures and functions. Once extracted, you migrate all this code to equivalent data definition languages (DDLs) in Snowflake. This step also includes migrating data manipulation language (DML) scripts, which may be used by business analysts to build reports or dashboards.
   All this code needs to be migrated and adjusted to work in Snowflake. The adjustments can range from simple changes, such as naming conventions and data type mappings, to more complex differences in syntax, platform semantics and other factors. To assist with this, Snowflake offers a powerful solution called SnowConvert AI, which automates much of the database code conversion process.
4. **Data migration** Data migration involves transferring data between different storage systems, formats, or computer systems. In the context of a SQL Server to Snowflake migration, it specifically refers to moving data from your SQL Server environment to your new Snowflake environment.

   There are two main types discussed in this guide:

* **Historical data migration:** Taking a snapshot of your SQL Server data at a specific point in time and transferring it to Snowflake. This is often done as an initial, bulk transfer.
* **Incremental data migration:** Moving new or changed data from SQL Server to Snowflake on an ongoing basis after the initial historical migration. This ensures that your Snowflake environment stays up-to-date with your source systems.

5. **Data ingestion:** After migrating the historical data, the next step is migrating the data ingestion process, bringing in live data from various sources. Typically, this process follows an extract, transform, load (ETL) or extract, load, transform (ELT) model, depending on when and where the data transformation occurs before it becomes available to business users.
6. **Reporting and analytics,** now that the database has both historical data and live pipelines continuously importing new data, the next step is to extract value from this data through BI. Reporting can be done using standard BI tools or custom queries. In both cases, the SQL sent to the database may need to be adjusted to meet Snowflake’s requirements. These adjustments can range from simple name changes (common during migration) to syntax and more complex semantic differences. All these need to be identified and addressed.
7. **Data validation and testing:** The goal is to have the data as clean as possible before entering this phase.
   Every organization has its own testing methodologies and requirements for moving data into production. These must be fully understood from the start of the project.
8. **Deployment.** At this stage, the data is validated, an equivalent system is set up, all the ETLs have been migrated, and reports have been verified. Are you ready to go live?
   Not so fast — there are still a few critical considerations before final promotion to production. First, your legacy application may consist of multiple components or services. Ideally, you should migrate these applications one by one (although parallel migration is possible) and promote them to production in the same order. During this process, ensure your bridging strategy is in place so business users don’t have to query both Snowflake and the legacy system. Data synchronization for applications that haven’t been migrated yet should happen behind the scenes through the bridging mechanism. If this isn’t done, business users will have to work in a hybrid environment, and they must understand the implications of this setup.
9. **Optimize and run** once a system has been migrated to Snowflake, it enters normal maintenance mode. All software systems are living organisms requiring ongoing maintenance. This phase, after migration, is referred to as optimize and run, and it is not part of the migration itself.

## **Key Phases**

A successful migration from SQL Server to Snowflake is a modernization project that unfolds over a sequence of well-defined phases. Following this structured nine-phase approach ensures a comprehensive and methodical transition, addressing everything from initial strategy to long-term operational excellence.

### **Phase 1: Planning and Design**

This initial phase is the most critical for the success of the entire migration project, as it lays the groundwork for accurate scoping, realistic timelines, and stakeholder alignment. A rushed or incomplete planning phase is the leading cause of budget overruns, missed deadlines, and project failure. The objective is not just to catalog the existing system but to strategically decide what assets are valuable enough to move to the new platform. A “lift and shift everything” approach is a recipe for migrating years of accumulated technical debt and inflating cloud costs from day one.

**Key Activities:**

* **Conducting a Comprehensive Inventory:** The first step is to create a detailed and exhaustive manifest of every asset within the scope of the migration. This inventory should be created using a combination of automated discovery tools, system catalog queries, and interviews with application owners. The inventory must include:

  + **Database Objects:** All databases, schemas, tables, and views. For tables, document row counts and raw data size.
  + **Procedural Code:** All stored procedures, user-defined functions (UDFs), triggers, and any logic using cursors.
  + **Automation and ETL:** All SQL Server Agent jobs, their schedules, and their dependencies. A complete catalog of SQL Server Integration Services (SSIS) packages is especially critical.
  + **Downstream Consumers:** All applications and BI tools that connect to the database, such as SSRS reports, Power BI dashboards, and Tableau workbooks.
  + **Security Principals:** All users, roles, and granular permissions.
  + **Excluding System Databases:** It is a critical mistake to attempt to migrate SQL Server’s internal system databases (`master`, `msdb`, `tempdb`, `model`). These are integral to a SQL Server instance but have no function or equivalent in Snowflake and must be explicitly excluded from all migration plans.
* **Defining Migration Objectives, Scope, and Success Metrics:** With a complete inventory, the team can define clear and measurable goals tied to business outcomes. Examples include:

  + **Objective:** Improve performance of month-end financial reporting.
  + **Metric:** Reduce the runtime of the “MonthEnd_Consolidation” report suite by 50%.
  + **Objective:** Reduce data warehousing total cost of ownership (TCO).
  + **Metric:** Decrease annual TCO by 30% compared to the previous year’s costs.
* **Stakeholder Alignment and Assembling the Migration Team (RACI):** A data platform migration is a business transformation. Early and continuous engagement with all stakeholders is critical. The migration team should include representatives from business users, data engineering, finance, security, and legal. A RACI (Responsible, Accountable, Consulted, Informed) matrix should be established to formalize roles and responsibilities.
* **Introducing FinOps:** The shift to Snowflake’s consumption-based cost model must be planned from the beginning. The migration team must coordinate with the finance department to understand the pricing model, establish budgets, and define how costs will be tracked and attributed, often using Snowflake’s object tagging features.
* **Initial Assessment and Triage:** The inventory provides the data needed for a critical triage process. The team should analyze usage logs to identify redundant or obsolete data, unused objects, and temporary staging data that can be decommissioned or archived instead of migrated.

### **Phase 2: Environment and Security**

With a strategic plan in place, this phase involves building the foundational Snowflake environment. This is a “greenfield” opportunity to design a clean, secure, and governable data platform from first principles, rather than simply mapping the legacy security model 1:1. Most mature SQL Server environments suffer from “security debt” like overly broad access and inconsistent roles, which this phase aims to resolve.

**Key Activities:**

* **Architecting Your Snowflake Account Structure:** For most enterprises, a multi-account strategy is recommended to ensure complete data and metadata isolation. This typically includes separate accounts for:

  + **Production Account:** Houses all production data and workloads with the strictest security controls.
  + **Development/QA Account:** A separate account for all development and testing activities.
  + **Sandbox Account (Optional):** An account for experimental work by data scientists or analysts.
* **Implementing a Robust Security Model:** Security should be implemented in layers:

  + **Network Policies:** As the first line of defense, create network policies to restrict access to the Snowflake account to a whitelist of trusted IP addresses.
  + **Authentication:** Enforce Multi-Factor Authentication (MFA) for all users. For a seamless and secure user experience, integrate Snowflake with a corporate Single Sign-On (SSO) provider like Azure Active Directory (Azure AD) or Okta.
  + **Designing a Role-Based Access Control (RBAC) Hierarchy:** This is the cornerstone of Snowflake security. All privileges on objects are granted exclusively to roles, which are then granted to users. A best-practice hierarchy involves creating distinct types of roles:

    - **System-Defined Roles:** `ACCOUNTADMIN`, `SYSADMIN`, etc., used for administrative tasks only.
    - **Functional Roles:** Custom roles that map to business functions (e.g., `FINANCE_ADMIN`, `MARKETING_ANALYST`).
    - **Access Roles:** Granular roles that define specific permissions (e.g., `READ_ONLY`, `WRITE_ACCESS`). These roles are then granted in a hierarchy to simplify administration.
* **Configuring Resource Monitors and Cost Controls:** Resource monitors are the primary tool within Snowflake for implementing cost controls. They should be configured as part of the initial environment setup to track credit consumption at both the account and warehouse levels. For each monitor, set notification and suspension triggers (e.g., send an email at 75% of quota, suspend the warehouse at 100%) to prevent budget overruns.

### **Phase 3: Database Code Conversion**

This phase focuses on the technical translation of the database’s physical structure and procedural logic from SQL Server’s T-SQL to Snowflake’s ANSI-compliant SQL. This is often the most complex and time-consuming part of the migration. The process is a catalyst for modernizing data processing logic, forcing a fundamental shift away from imperative, stateful logic toward declarative, set-based processing.

**Key Activities:**

* **Translating Data Definition Language (DDL):** This involves extracting and converting `CREATE TABLE` and `CREATE VIEW` statements. Automated code conversion tools like Snowflake’s SnowConvert AI are highly recommended to parse T-SQL DDL and generate the equivalent Snowflake SQL, handling syntax differences and data type mapping.
* **Data Type Mapping:** Accurate data type mapping is foundational. While many types map directly (e.g., `INT` to `NUMBER`), several key differences require careful attention, especially with date/time types. SQL Server’s `DATETIME` and `DATETIME2` are time zone-unaware and must be mapped to Snowflake’s `TIMESTAMP_NTZ`. Conversely, `DATETIMEOFFSET` contains a time zone offset and must be mapped to `TIMESTAMP_TZ` to preserve this information.
* **Handling Constraints (Enforced vs. Unenforced):** This represents a significant conceptual shift. In SQL Server, constraints like Primary Keys and Foreign Keys are **enforced** by the database engine. In Snowflake, these constraints can be defined but are **not enforced**. They exist purely as metadata. The responsibility for maintaining data integrity shifts entirely from the database to the data pipeline (ETL/ELT process).
* **Stored Procedure and T-SQL Conversion:** Migrating T-SQL stored procedures is a significant undertaking.

  + **SQL Dialect Discrepancies:** Numerous T-SQL functions and syntax constructs require conversion (e.g., `GETDATE()` becomes `CURRENT_TIMESTAMP()`, `ISNULL()` becomes `COALESCE()`).
  + **Refactoring Logic:** The preferred path is to rewrite T-SQL procedures using Snowflake Scripting, a SQL-based procedural language. The overarching goal is to eliminate row-by-row processing (like cursors) in favor of set-based SQL statements wherever possible.
  + **Replacing Cursors and Triggers:** Cursors are a severe performance anti-pattern in Snowflake and must be eliminated. Snowflake does not support triggers; their functionality must be re-implemented using a cloud-native pattern of **Streams and Tasks**, where a stream captures table changes and a scheduled task consumes those changes to apply business logic.

### **Phase 4: Data Migration**

This phase focuses on the initial, one-time bulk transfer of historical data from the source SQL Server system to the target Snowflake environment. The fundamental architecture for loading data into Snowflake is a “three-box” model: **Source -> Stage -> Target**. Data is not moved directly from source to target but is first landed in an intermediate cloud object storage location (the stage).

**Key Activities:**

* **Data Extraction from SQL Server:** For the initial migration of historical data, SQL Server’s native **Bulk Copy Program (BCP)** command-line utility is a highly efficient option. It can export large tables to flat files (e.g., CSV) at high speed. These files can then be uploaded to the cloud stage (e.g., Amazon S3, Azure Blob Storage).
* **Loading Data into Snowflake from the Stage:** Once data files are present in the cloud stage, the primary mechanism for ingestion is the **`COPY INTO <table>`** command. This is the workhorse SQL command for high-performance, bulk data loading. It is designed to work in a massively parallel fashion. For optimal performance, it is a best practice to split large data sets into multiple files of a moderate size (100-250MB is a common recommendation) to maximize this parallelism.

### **Phase 5: Data Ingestion**

After migrating the historical data, this phase focuses on migrating the ongoing data ingestion processes to bring live, incremental data from various sources into Snowflake. This typically involves migrating logic from legacy ETL tools like SSIS and scheduling from SQL Server Agent.

**Key Activities:**

* **Incremental Data Replication:** For replicating ongoing changes after the initial load, SQL Server’s native **Change Data Capture (CDC)** feature is the preferred method. CDC works by reading the database’s transaction log to capture all `INSERT`, `UPDATE`, and `DELETE` operations as they occur, providing a low-impact, near real-time stream of changes.
* **Continuous Ingestion with Snowpipe:** **Snowpipe** is Snowflake’s continuous data ingestion service, designed for streaming and micro-batch use cases. You create a `PIPE` object that “subscribes” to a stage. When new change files generated by a CDC process arrive in the stage, Snowpipe is automatically triggered to load the data.
* **Applying Changes with MERGE:** After change data has been loaded into a temporary staging table in Snowflake (via Snowpipe), the **`MERGE`** command is used to apply those changes to the final production table. It can handle inserts, updates, and deletes in a single, atomic statement.
* **Modernizing SSIS and SQL Server Agent Jobs:**

  + **SSIS Migration:** Simply pointing an existing SSIS package at Snowflake is not a viable strategy. The recommended approach is to **re-architect SSIS logic with cloud-native tools**, embracing the **ELT (Extract, Load, Transform)** pattern. This involves decommissioning SSIS and rebuilding the business logic using tools like **dbt (data build tool)** for in-warehouse transformations, with orchestration managed by a tool like **Apache Airflow**.
  + **SQL Server Agent Migration:** The scheduling functionality of SQL Server Agent must be migrated. Simple, non-dependent jobs can be scheduled using native **Snowflake Tasks**. Complex workflows with dependencies require a more powerful external orchestrator like Apache Airflow or Azure Data Factory.

### **Phase 6: Reporting and Analytics**

A data warehouse migration is not truly complete until the end-users are successfully using the new platform through their preferred analytics tools. This “last mile” of the project is often underestimated and requires meticulous planning to manage user acceptance, performance, and cost.

**Key Activities:**

* **Connecting BI Tools (Tableau, Power BI):** Both Tableau and Power BI are first-class citizens in the Snowflake ecosystem and provide native, high-performance connectors. For both tools, a critical decision must be made on a per-dashboard basis between a **live connection** (e.g., Tableau Live, Power BI DirectQuery) and an **imported/extracted model**.

  + **Live/DirectQuery:** Provides real-time data but sends queries directly to Snowflake for every user interaction, which can lead to significant compute costs.
  + **Extract/Import:** Provides excellent performance by serving queries from an in-memory copy of the data, but the data is only as fresh as the last refresh.
* **The SSRS Challenge and Replacement:** Connecting SQL Server Reporting Services (SSRS) to Snowflake is notoriously challenging and not a recommended long-term strategy. The migration to Snowflake should serve as the catalyst for a strategic plan to **decommission SSRS**. Critical SSRS reports should be assessed and rebuilt in a modern, cloud-native BI platform like Power BI or Tableau.
* **Workload Isolation:** To govern the performance and cost impact of these BI tools, it is a best practice to create dedicated, appropriately-sized virtual warehouses in Snowflake specifically for BI workloads. This isolates BI queries from other workloads like ETL.

### **Phase 7: Data Validation and Testing**

This phase is where the newly built Snowflake platform is rigorously tested and validated against the legacy system to build business trust and ensure a successful deployment. Data validation cannot be an afterthought and must go far beyond simple row counts.

**Key Activities:**

* **A Multi-Layered Data Validation Strategy:**

  + **Level 1: File and Object Validation:** Use checksums or hash functions to verify that data files transferred from the source system to the cloud stage have not been corrupted in transit.
  + **Level 2: Reconciliation and Aggregate Validation:** Run queries on both the source SQL Server database and the target Snowflake tables to compare key metrics like row counts and aggregate functions (`SUM`, `AVG`, `MIN`, `MAX`) for all key numeric columns.
  + **Level 3: Cell-Level Validation (Data Diff):** For the most business-critical tables, a more granular, cell-by-cell comparison of a statistically significant sample of rows is required to catch subtle data type conversion errors or transformation logic bugs.
* **Performance Testing and User Acceptance Testing (UAT):**

  + **Performance Testing:** The migrated ETL/ELT pipelines and BI reports must be tested against the performance SLAs defined in the planning phase.
  + **User Acceptance Testing (UAT):** This is where business users get hands-on with the new system. They must be given the time and resources to run their reports, execute their queries, and validate that the migrated system meets their functional requirements and produces the same results as the legacy system. UAT is the final gate before production deployment.

### **Phase 8: Deployment**

This phase is the culmination of all preceding efforts, where the validated system is promoted to production and the formal switch, or “cutover,” from the legacy SQL Server system to Snowflake occurs. The strategy should be chosen to minimize risk and business disruption.

**Key Activities:**

* **Developing a Cutover Plan:** Instead of a single “big bang” cutover, a phased approach is recommended to limit the “blast radius” of any potential issues.

  + **Phased Rollout (Recommended):** Migrate applications, reports, or business units one by one over a period of time.
  + **Parallel Run:** For a period, run both the legacy SQL Server and the new Snowflake systems in parallel, feeding data to both and comparing outputs to ensure 100% consistency before decommissioning the legacy system.
  + **Bridging Strategy:** During a phased rollout or parallel run, it is critical to implement a bridging strategy so that users do not have to query two different systems. The goal is to present a single, unified view to the business.
* **Final Deployment Checklist and Stakeholder Sign-off:** Before the final cutover, the team should conduct a final readiness review. This includes verifying all permissions and roles, ensuring all service accounts are in place, and confirming that monitoring and alerting are active. Obtain formal, written sign-off from all key business and technical stakeholders before going live.

### **Phase 9: Optimize and Run**

The completion of the cutover marks the end of the migration project but the beginning of the platform’s operational life. A data platform is a living system that requires ongoing maintenance, governance, and optimization. In the Snowflake paradigm, performance tuning and cost optimization are two sides of the same coin: applying the right amount of compute, for the right amount of time, to meet a business SLA at the lowest possible cost.

**Key Activities:**

* **Performance Tuning:**

  + **Virtual Warehouse Sizing and Management:** This is the primary lever for both performance and cost. Continuously monitor and right-size warehouses, create separate warehouses for different workloads (workload isolation), and ensure all warehouses have an aggressive auto-suspend policy.
  + **Query Optimization:** Use Snowflake’s **Query Profile** tool to visually analyze and debug slow-running queries.
  + **Clustering Keys:** For very large tables (typically over 1 terabyte), defining a clustering key can significantly improve query performance by physically co-locating related data.
* **Implementing Long-Term FinOps:**

  + **Continuous Monitoring:** Regularly review cost and usage data from the `ACCOUNT_USAGE` schema.
  + **Showback and Chargeback:** Implement a model to attribute costs back to the business units or projects that incur them to drive accountability.
  + **Object Tagging:** Use Snowflake’s tagging feature to apply metadata tags to objects to simplify cost allocation and governance.
* **Establishing Data Governance and Security:**

  + **RBAC Refinement:** Continuously update the RBAC hierarchy and perform regular audits to remove unused roles or excessive permissions.
  + **Advanced Security Features:** For highly sensitive data, implement Snowflake’s advanced data governance features like **Dynamic Data Masking** and **Row-Access Policies**.

---
title: SQL Statements
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/translation-references/hive/ddls/README.md
section: Migrations
---

# SQL Statements

Translation reference for all the supported statements by SnowConvert AI for Hive, Spark and Databricks SQL.

---
title: Teradata Commands Reference
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/data-validation-cli/teradata_commands.md
section: Migrations
---

# Teradata Commands Reference

## Overview

This page provides comprehensive reference documentation for Teradata-specific commands in the Snowflake Data Validation CLI. For SQL Server commands, see [SQL Server Commands Reference](sqlserver_commands.md). For Amazon Redshift commands, see [Redshift Commands Reference](redshift_commands.md). For Snowflake-to-Snowflake commands, see [Snowflake Commands Reference](snowflake_commands.md).

---

## Command Structure

All Teradata commands follow this consistent structure:

```bash
snowflake-data-validation teradata <command> [options]

# Or use the shorter alias
sdv teradata <command> [options]
```

Where `<command>` is one of:

* `run-validation` - Run synchronous validation
* `run-async-validation` - Run asynchronous validation
* `generate-validation-scripts` - Generate validation scripts
* `get-configuration-files` - Get configuration templates
* `auto-generated-configuration-file` - Interactive config generation
* `row-partitioning-helper` - Interactive row partitioning configuration
* `column-partitioning-helper` - Interactive column partitioning configuration

---

## Run Synchronous Validation

Validates data between Teradata and Snowflake in real-time.

### Syntax

```bash
snowflake-data-validation teradata run-validation \
  --data-validation-config-file /path/to/config.yaml \
  --log-level INFO
```

### Options

**`--data-validation-config-file, -dvf`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file containing validation settings
* **Example:** `--data-validation-config-file ./configs/teradata_validation.yaml`

**`--teradata-host`** (optional)

* **Type:** String
* **Description:** Teradata server hostname (overrides config file)
* **Example:** `--teradata-host teradata.company.com`

**`--teradata-username`** (optional)

* **Type:** String
* **Description:** Teradata username (overrides config file)
* **Example:** `--teradata-username my_user`

**`--teradata-password`** (optional)

* **Type:** String
* **Description:** Teradata password (overrides config file)
* **Example:** `--teradata-password my_password`

**`--teradata-database`** (optional)

* **Type:** String
* **Description:** Teradata database name (overrides config file)
* **Example:** `--teradata-database prod_db`

**`--snowflake-connection-name`** (optional)

* **Type:** String
* **Description:** Snowflake connection name
* **Example:** `--snowflake-connection-name prod_connection`

**`--output-directory`** (optional)

* **Type:** String (path)
* **Description:** Directory for validation results
* **Example:** `--output-directory ./validation_results`

**`--log-level, -ll`** (optional)

* **Type:** String
* **Valid Values:** DEBUG, INFO, WARNING, ERROR, CRITICAL
* **Default:** INFO
* **Description:** Logging level for validation execution
* **Example:** `--log-level DEBUG`

### Example Usage

```bash
# Basic validation using config file
sdv teradata run-validation \
  --data-validation-config-file ./configs/teradata_validation.yaml

# Validation with connection override
sdv teradata run-validation \
  --data-validation-config-file ./config.yaml \
  --teradata-host teradata.company.com \
  --teradata-username my_user \
  --teradata-password my_password \
  --output-directory ./validation_results

# Validation with debug logging
sdv teradata run-validation \
  --data-validation-config-file ./config.yaml \
  --log-level DEBUG

# Using full command name with all options
snowflake-data-validation teradata run-validation \
  -dvf /opt/validations/prod_config.yaml \
  --teradata-host td-prod.company.com \
  --teradata-database production_db \
  --snowflake-connection-name snowflake_prod \
  --output-directory /data/validation_results \
  -ll INFO
```

### Use Cases

* Real-time validation during Teradata migration
* Pre-cutover validation checks
* Post-migration verification
* Continuous validation in CI/CD pipelines
* Testing with temporary credentials

---

## Generate Validation Scripts

Generates SQL scripts for Teradata and Snowflake metadata extraction.

### Syntax

```bash
snowflake-data-validation teradata generate-validation-scripts \
  /path/to/config.yaml \
  --output-directory ./scripts
```

### Positional Arguments

**`config_file`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file
* **Example:** `./configs/validation.yaml`

### Options

**`--teradata-host`** (optional)

* **Type:** String
* **Description:** Teradata server hostname (overrides config file)
* **Example:** `--teradata-host teradata.company.com`

**`--teradata-username`** (optional)

* **Type:** String
* **Description:** Teradata username (overrides config file)
* **Example:** `--teradata-username script_generator`

**`--teradata-password`** (optional)

* **Type:** String
* **Description:** Teradata password (overrides config file)
* **Example:** `--teradata-password secure_password`

**`--teradata-database`** (optional)

* **Type:** String
* **Description:** Teradata database name (overrides config file)
* **Example:** `--teradata-database analytics_db`

**`--output-directory`** (optional)

* **Type:** String (path)
* **Description:** Directory for generated scripts
* **Example:** `--output-directory ./generated_scripts`

### Example Usage

```bash
# Basic script generation
sdv teradata generate-validation-scripts \
  ./configs/validation.yaml

# Script generation with connection override
sdv teradata generate-validation-scripts \
  ./config.yaml \
  --teradata-host teradata.company.com \
  --teradata-username script_user \
  --teradata-password script_password \
  --output-directory ./scripts

# Script generation to specific directory
sdv teradata generate-validation-scripts \
  /opt/configs/prod_validation.yaml \
  --output-directory /data/validation_scripts

# Using full command name
snowflake-data-validation teradata generate-validation-scripts \
  ./config.yaml \
  --teradata-database production_db \
  --output-directory ./generated_scripts
```

### Output

The command generates SQL scripts in the specified output directory:

```text
<output_directory>/
├── teradata_schema_queries.sql
├── teradata_metrics_queries.sql
├── teradata_row_queries.sql
├── snowflake_schema_queries.sql
├── snowflake_metrics_queries.sql
└── snowflake_row_queries.sql
```

### Use Cases

* Generating scripts for execution by DBAs
* Compliance requirements for query review
* Environments where direct CLI database access is restricted
* Manual execution and validation workflows
* Separating metadata extraction from validation

---

## Run Asynchronous Validation

Performs validation using pre-generated metadata files without connecting to databases.

### Syntax

```bash
snowflake-data-validation teradata run-async-validation \
  /path/to/config.yaml \
  --output-directory ./validation_results
```

### Positional Arguments

**`config_file`** (required)

* **Type:** String (path)
* **Description:** Path to YAML configuration file
* **Example:** `./configs/async_validation.yaml`

### Options

**`--output-directory`** (optional)

* **Type:** String (path)
* **Description:** Directory containing metadata files generated from scripts
* **Example:** `--output-directory ./metadata_files`

### Example Usage

```bash
# Run async validation
sdv teradata run-async-validation \
  ./configs/async_validation.yaml

# Run async validation with specific metadata directory
sdv teradata run-async-validation \
  ./config.yaml \
  --output-directory ./validation_metadata

# Using full command name
snowflake-data-validation teradata run-async-validation \
  /opt/configs/validation.yaml \
  --output-directory /data/validation_metadata
```

### Prerequisites

Before running async validation:

1. Generate validation scripts using `generate-validation-scripts`
2. Execute the generated scripts on Teradata and Snowflake databases
3. Save results to CSV/metadata files
4. Ensure metadata files are available in the configured paths

### Use Cases

* Validating in environments with restricted database access
* Separating metadata extraction from validation
* Batch validation workflows
* Scheduled validation jobs
* When database connections are intermittent

---

## Get Configuration Templates

Retrieves Teradata configuration templates.

### Syntax

```bash
snowflake-data-validation teradata get-configuration-files \
  --templates-directory ./teradata-templates \
  --query-templates
```

### Options

**`--templates-directory, -td`** (optional)

* **Type:** String (path)
* **Default:** Current directory
* **Description:** Directory to save template files
* **Example:** `--templates-directory ./templates`

**`--query-templates`** (optional)

* **Type:** Flag (no value required)
* **Description:** Include J2 (Jinja2) query template files for advanced customization
* **Example:** `--query-templates`

### Example Usage

```bash
# Get basic templates in current directory
sdv teradata get-configuration-files

# Save templates to specific directory
sdv teradata get-configuration-files \
  --templates-directory ./my-project/teradata-templates

# Include query templates for customization
sdv teradata get-configuration-files \
  --templates-directory ./templates \
  --query-templates

# Using short flags
sdv teradata get-configuration-files -td ./templates --query-templates
```

### Output Files

**Without `--query-templates` flag:**

```text
<templates_directory>/
└── teradata_validation_template.yaml
```

**With `--query-templates` flag:**

```text
<templates_directory>/
├── teradata_validation_template.yaml
└── query_templates/
    ├── teradata_columns_metrics_query.sql.j2
    ├── teradata_row_count_query.sql.j2
    ├── teradata_compute_md5_sql.j2
    └── snowflake_columns_metrics_query.sql.j2
```

### Use Cases

* Starting a new Teradata validation project
* Learning Teradata-specific configuration options
* Customizing validation queries for Teradata
* Creating organization-specific templates

---

## Auto-Generate Configuration File

Interactive command for Teradata configuration generation.

### Syntax

```bash
snowflake-data-validation teradata auto-generated-configuration-file
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### Interactive Prompts

The command will prompt for the following information:

1. **Teradata host**

   * Hostname or IP address of Teradata server
   * Example: `teradata.company.com`
2. **Teradata username**

   * Authentication username
   * Example: `migration_user`
3. **Teradata password**

   * Authentication password (hidden input)
   * Not displayed on screen for security
4. **Teradata database**

   * Name of the database to validate
   * Example: `production_db`
5. **Output directory path**

   * Where to save validation results
   * Example: `./validation_results`

### Example Session

```bash
$ sdv teradata auto-generated-configuration-file

Teradata host: teradata.company.com
Teradata username: migration_user
Teradata password: ********
Teradata database: production_db
Output directory path: ./validation_results

Configuration file generated successfully: ./teradata_validation_config.yaml
```

### Generated Configuration

The command generates a basic YAML configuration file:

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: ./validation_results
target_database: PRODUCTION_DB

source_connection:
  mode: credentials
  host: teradata.company.com
  username: migration_user
  password: "<hidden>"
  database: production_db

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

tables: []
```

### Next Steps After Generation

1. **Edit the configuration file** to add:

   * Target connection details (if not using default)
   * Tables to validate
   * Validation options
   * Column selections and mappings
2. **Add table configurations:**

   * Specify fully qualified table names
   * Configure column selections
   * Set up filtering where clauses
3. **Review Teradata-specific settings:**

   * Verify target_database is correctly set
   * Check schema mappings if needed
4. **Test the configuration:**

   ```bash
   sdv teradata run-validation \
     --data-validation-config-file ./teradata_validation_config.yaml
   ```

### Use Cases

* Quick setup for new Teradata users
* Generating baseline configurations
* Testing connectivity during setup
* Creating template configurations for teams

---

## Row Partitioning Helper

Interactive command to generate partitioned table configurations for large tables. This helper divides tables into smaller row partitions based on a specified column, enabling more efficient validation of large datasets.

### Syntax

```bash
snowflake-data-validation teradata row-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The table partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply partitioning
3. If partitioning is enabled, collects partition parameters
4. Queries the source Teradata database to determine partition boundaries
5. Generates new table configurations with `WHERE` clauses for each partition
6. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/teradata_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply partitioning?** (yes/no)

   * Whether to partition this specific table
   * Default: yes

   b. **Partition column** (if partitioning)

   * Column name used to divide the table
   * Should be indexed for performance
   * Example: `transaction_id`, `created_date`

   c. **Is partition column a string type?** (yes/no)

   * Determines quoting in generated WHERE clauses
   * Default: no (numeric)

   d. **Number of partitions**

   * How many partitions to create
   * Example: `10`, `50`, `100`

### Example Session

```bash
$ sdv teradata row-partitioning-helper

Generate a configuration file for Teradata table partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip partitioning or specify partitioning parameters for each table.

Configuration file path: ./configs/teradata_validation.yaml

Apply partitioning for enterprise_db.fact_sales? [Y/n]: y
Write the partition column for enterprise_db.fact_sales: sale_id
Is 'sale_id' column a string type? [y/N]: n
Write the number of partitions for enterprise_db.fact_sales: 10

Apply partitioning for enterprise_db.dim_customer? [Y/n]: n

Apply partitioning for enterprise_db.transactions? [Y/n]: y
Write the partition column for enterprise_db.transactions: transaction_date
Is 'transaction_date' column a string type? [y/N]: n
Write the number of partitions for enterprise_db.transactions: 5

Table partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with WHERE clauses:

```yaml
tables:
  # Original table partitioned into 10 segments
  - fully_qualified_name: enterprise_db.fact_sales
    where_clause: "sale_id >= 1 AND sale_id < 100000"
    target_where_clause: "sale_id >= 1 AND sale_id < 100000"
    # ... other table settings preserved

  - fully_qualified_name: enterprise_db.fact_sales
    where_clause: "sale_id >= 100000 AND sale_id < 200000"
    target_where_clause: "sale_id >= 100000 AND sale_id < 200000"
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: enterprise_db.dim_customer
    # ... original configuration
```

### Use Cases

* **Large table validation**: Break multi-billion row tables into manageable chunks
* **Parallel processing**: Enable concurrent validation of different partitions
* **Memory optimization**: Reduce memory footprint by processing smaller data segments
* **Incremental validation**: Validate specific data ranges independently
* **Performance tuning**: Optimize validation for tables with uneven data distribution

### Best Practices

1. **Choose appropriate partition columns:**

   * Use indexed columns for better query performance
   * Prefer columns with sequential values (IDs, timestamps)
   * Avoid columns with highly skewed distributions
2. **Determine optimal partition count:**

   * Consider table size and available resources
   * Start with 10-20 partitions for tables with 10M+ rows
   * Increase partitions for very large tables (100M+ rows)
3. **String vs numeric columns:**

   * Numeric columns are generally more efficient
   * String columns work but may have uneven distribution
4. **After partitioning:**

   * Review generated WHERE clauses
   * Adjust partition boundaries if needed
   * Test with a subset before full validation

---

## Column Partitioning Helper

Interactive command to generate partitioned table configurations for wide tables with many columns. This helper divides tables into smaller column partitions, enabling more efficient validation of tables with a large number of columns.

### Syntax

```bash
snowflake-data-validation teradata column-partitioning-helper
```

### Options

This command has no command-line options. All input is provided through interactive prompts.

### How It Works

The column partitioning helper:

1. Reads an existing configuration file with table definitions
2. For each table, prompts whether to apply column partitioning
3. If partitioning is enabled, collects the number of partitions
4. Queries the source Teradata database to retrieve all column names for the table
5. Divides the columns into the specified number of partitions
6. Generates new table configurations where each partition validates only a subset of columns
7. Saves the partitioned configuration to a new file

### Interactive Prompts

The command will prompt for the following information:

1. **Configuration file path**

   * Path to existing YAML configuration file
   * Example: `./configs/teradata_validation.yaml`
2. **For each table in the configuration:**

   a. **Apply column partitioning?** (yes/no)

   * Whether to partition this specific table by columns
   * Default: yes

   b. **Number of partitions** (if partitioning)

   * How many column partitions to create
   * Example: `3`, `5`, `10`

### Example Session

```bash
$ sdv teradata column-partitioning-helper

Generate a configuration file for Teradata column partitioning. This interactive
helper function processes each table in the configuration file, allowing users to
either skip column partitioning or specify column partitioning parameters for each table.

Configuration file path: ./configs/teradata_validation.yaml

Apply column partitioning for enterprise_db.wide_table? [Y/n]: y
Write the number of partitions for enterprise_db.wide_table: 5

Apply column partitioning for enterprise_db.small_table? [Y/n]: n

Apply column partitioning for enterprise_db.report_table? [Y/n]: y
Write the number of partitions for enterprise_db.report_table: 3

Column partitioning configuration file generated successfully!
```

### Generated Output

The command generates partitioned table configurations with column subsets:

```yaml
tables:
  # Original table with 100 columns partitioned into 5 segments (20 columns each)
  - fully_qualified_name: enterprise_db.wide_table
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - column_a
      - column_b
      - column_c
      # ... first 20 columns alphabetically

  - fully_qualified_name: enterprise_db.wide_table
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - column_d
      - column_e
      - column_f
      # ... next 20 columns alphabetically
    # ... continues for each partition

  # Non-partitioned table preserved as-is
  - fully_qualified_name: enterprise_db.small_table
    # ... original configuration
```

### Use Cases

* **Wide table validation**: Break tables with hundreds of columns into manageable chunks
* **Memory optimization**: Reduce memory footprint by validating fewer columns at a time
* **Parallel processing**: Enable concurrent validation of different column groups
* **Targeted validation**: Validate specific column groups independently
* **Performance tuning**: Optimize validation for tables with many LOB or complex columns

### Best Practices

1. **Determine optimal partition count:**

   * Consider the total number of columns in the table
   * For tables with 50+ columns, start with 3-5 partitions
   * For tables with 100+ columns, consider 5-10 partitions
2. **Column ordering:**

   * Columns are divided alphabetically
   * Related columns may end up in different partitions
3. **After partitioning:**

   * Review generated column lists
   * Verify all required columns are included
   * Test with a subset before full validation
4. **Combine with row partitioning:**

   * For very large, wide tables, consider using both row and column partitioning
   * First partition by columns, then apply row partitioning to each column partition if needed

---

## Teradata Connection Configuration

Teradata connections require specific configuration in the YAML file.

### Connection Example

```yaml
source_connection:
  mode: credentials
  host: "teradata.company.com"
  username: "teradata_user"
  password: "secure_password"
  database: "source_database"
```

### Connection Fields

**`mode`** (required)

* **Type:** String
* **Valid Values:** `credentials`
* **Description:** Connection mode for Teradata

**`host`** (required)

* **Type:** String
* **Description:** Teradata hostname or IP address
* **Examples:**

  + `"teradata.company.com"`
  + `"td-prod.internal.company.net"`
  + `"192.168.1.50"`

**`username`** (required)

* **Type:** String
* **Description:** Teradata authentication username
* **Example:** `"migration_admin"`

**`password`** (required)

* **Type:** String
* **Description:** Teradata authentication password
* **Security Note:** Consider using environment variables

**`database`** (required)

* **Type:** String
* **Description:** Teradata database name
* **Example:** `"production_database"`

### Teradata-Specific Global Configuration

**`target_database`** (required for Teradata)

* **Type:** String
* **Description:** Target database name in Snowflake for Teradata validations
* **Example:** `target_database: PROD_DB`
* **Note:** This is required in the global configuration section, not the connection section

### Connection Examples

**Production Connection:**

```yaml
source_connection:
  mode: credentials
  host: "td-prod.company.com"
  username: "prod_reader"
  password: "${TERADATA_PASSWORD}"  # From environment variable
  database: "production_db"

target_database: PROD_SNOWFLAKE_DB
```

**Development Connection:**

```yaml
source_connection:
  mode: credentials
  host: "td-dev.company.local"
  username: "dev_user"
  password: "dev_password"
  database: "dev_database"

target_database: DEV_SNOWFLAKE_DB
```

**Multi-Database Setup:**

```yaml
source_connection:
  mode: credentials
  host: "teradata.company.com"
  username: "migration_user"
  password: "secure_password"
  database: "primary_db"

target_database: ENTERPRISE_DATA_DB

database_mappings:
  primary_db: ENTERPRISE_DATA_DB
  secondary_db: ANALYTICS_DB
```

---

## Complete Teradata Examples

### Example 1: Basic Teradata Configuration

```yaml
# Global configuration
source_platform: Teradata
target_platform: Snowflake
output_directory_path: ./validation_results
max_threads: auto
target_database: PROD_DB

# Source connection
source_connection:
  mode: credentials
  host: teradata.company.com
  username: teradata_user
  password: teradata_password
  database: prod_db

# Target connection
target_connection:
  mode: default

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

# Schema mappings
schema_mappings:
  prod_db: PUBLIC

# Tables configuration
tables:
  - fully_qualified_name: prod_db.sales_data
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - transaction_id

  - fully_qualified_name: prod_db.customer_master
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - credit_card
```

### Example 2: Teradata Large-Scale Migration

```yaml
# Global configuration
source_platform: Teradata
target_platform: Snowflake
output_directory_path: /opt/validation/results
max_threads: 16
target_database: PROD_SNOWFLAKE

# Source connection
source_connection:
  mode: credentials
  host: td-prod.company.com
  username: migration_admin
  password: secure_password
  database: enterprise_db

# Target connection
target_connection:
  mode: name
  name: snowflake_enterprise

# Validation configuration
validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true
  max_failed_rows_number: 200
  exclude_metrics: false

# Comparison configuration
comparison_configuration:
  tolerance: 0.01

# Logging configuration
logging_configuration:
  level: INFO
  console_level: WARNING
  file_level: DEBUG

# Schema mappings
schema_mappings:
  enterprise_db: PUBLIC

# Tables configuration
tables:
  - fully_qualified_name: enterprise_db.fact_sales
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - sale_id
      - customer_id
      - product_id
      - sale_amount
      - sale_date
    index_column_list:
      - sale_id
    chunk_number: 50
    max_failed_rows_number: 500

  - fully_qualified_name: enterprise_db.dim_customer
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_notes
      - audit_fields
    where_clause: "status = 'ACTIVE'"
    target_where_clause: "status = 'ACTIVE'"
    column_mappings:
      cust_key: customer_key
      cust_name: customer_name
```

### Example 3: Teradata Multi-Schema Validation

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: /data/validation/multi_schema
max_threads: 24
target_database: MULTI_SCHEMA_DB

source_connection:
  mode: credentials
  host: teradata.company.com
  username: multi_schema_user
  password: password123
  database: main_db

target_connection:
  mode: default

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: false

comparison_configuration:
  tolerance: 0.005

schema_mappings:
  sales_schema: SALES
  finance_schema: FINANCE
  hr_schema: HUMAN_RESOURCES

tables:
  # Sales schema tables
  - fully_qualified_name: main_db.sales_schema.orders
    target_schema: SALES
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - order_id

  - fully_qualified_name: main_db.sales_schema.customers
    target_schema: SALES
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list:
      - customer_id

  # Finance schema tables
  - fully_qualified_name: main_db.finance_schema.transactions
    target_schema: FINANCE
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - internal_ref_number
    index_column_list:
      - transaction_id
    where_clause: "fiscal_year >= 2024"
    target_where_clause: "fiscal_year >= 2024"

  # HR schema tables
  - fully_qualified_name: main_db.hr_schema.employees
    target_schema: HUMAN_RESOURCES
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - ssn
      - salary
      - bank_account
    index_column_list:
      - employee_id
```

### Example 4: Teradata View Validation

Validate Teradata views alongside tables for comprehensive migration verification.

```yaml
source_platform: Teradata
target_platform: Snowflake
output_directory_path: ./teradata_view_validation
target_database: SNOWFLAKE_DW
max_threads: auto

source_connection:
  mode: credentials
  host: teradata.company.com
  username: td_validator
  password: TeradataPass123!
  database: DW_DB

target_connection:
  mode: name
  name: snowflake_dw

validation_configuration:
  schema_validation: true
  metrics_validation: true
  row_validation: true

schema_mappings:
  DW_DB: PUBLIC

# Tables to validate
tables:
  - fully_qualified_name: DW_DB.CUSTOMERS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

# Views to validate
views:
  # Basic view validation with sensitive column exclusion
  - fully_qualified_name: DW_DB.V_CUSTOMER_360
    target_name: V_CUSTOMER_360
    use_column_selection_as_exclude_list: true
    column_selection_list:
      - SSN
      - CREDIT_SCORE
    index_column_list: [CUSTOMER_ID]
    target_index_column_list: [CUSTOMER_ID]

  # View with specific columns
  - fully_qualified_name: DW_DB.V_SALES_DASHBOARD
    target_name: V_SALES_DASHBOARD
    use_column_selection_as_exclude_list: false
    column_selection_list:
      - REGION
      - QUARTER
      - TOTAL_SALES
      - ORDER_COUNT
      - AVG_ORDER_VALUE
    index_column_list: [REGION, QUARTER]
    target_index_column_list: [REGION, QUARTER]

  # View with filtering
  - fully_qualified_name: DW_DB.V_INVENTORY_STATUS
    target_name: V_INVENTORY_STATUS
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [INVENTORY_ID]
    target_index_column_list: [INVENTORY_ID]
    where_clause: "status = 'ACTIVE'"
    target_where_clause: "status = 'ACTIVE'"

  # View with different target name
  - fully_qualified_name: DW_DB.V_LEGACY_REPORT
    target_database: MODERN_DW
    target_schema: ANALYTICS
    target_name: V_MODERNIZED_REPORT
    use_column_selection_as_exclude_list: false
    column_selection_list: []
    index_column_list: [REPORT_ID]
    target_index_column_list: [REPORT_ID]
    column_mappings:
      OLD_COL: NEW_COL
```

**Note:** View validation creates temporary tables internally to materialize view data for comparison between Teradata and Snowflake.

---

## Troubleshooting Teradata Connections

### Issue: Connection Timeout

**Symptom:**

```sql
Connection timeout: Unable to connect to Teradata
```

**Solutions:**

1. Verify the host and network connectivity:

   ```bash
   telnet teradata.company.com 1025
   ```
2. Check firewall rules allow Teradata connections
3. Verify Teradata server is running
4. Test connection with Teradata SQL Assistant or other client tools

### Issue: Authentication Failed

**Symptom:**

```sql
Authentication failed for user 'username'
```

**Solutions:**

1. Verify credentials are correct
2. Check user has necessary permissions:

   ```sql
   -- Grant read permissions
   GRANT SELECT ON database_name TO migration_user;
   ```
3. Verify user account is not locked
4. Check password hasn’t expired

### Issue: Database Not Found

**Symptom:**

```sql
Database 'database_name' does not exist
```

**Solutions:**

1. Verify database name is correct (case-sensitive)
2. Check user has access to the database:

   ```sql
   DATABASE database_name;
   SHOW TABLES;
   ```
3. Ensure database exists and is accessible

### Issue: Target Database Configuration Missing

**Symptom:**

```sql
target_database configuration is required for Teradata validations
```

**Solution:**

Add `target_database` to global configuration:

```yaml
source_platform: Teradata
target_platform: Snowflake
target_database: TARGET_DB_NAME  # Add this line
```

### Issue: Schema Mapping Errors

**Symptom:**

```sql
Schema not found in target
```

**Solution:**

Add schema mappings in configuration:

```yaml
schema_mappings:
  source_schema: TARGET_SCHEMA
  prod_db: PUBLIC
```

---

## Best Practices for Teradata

### Configuration

1. **Always specify target_database:**

   ```yaml
   target_database: SNOWFLAKE_DB_NAME
   ```
2. **Use schema mappings:**

   ```yaml
   schema_mappings:
     teradata_schema: snowflake_schema
   ```
3. **Handle case sensitivity:**

   ```yaml
   tables:
     - fully_qualified_name: db.schema.TABLE_NAME
       is_case_sensitive: true
   ```

### Security

1. **Use environment variables for passwords:**

   ```yaml
   source_connection:
     password: "${TERADATA_PASSWORD}"
   ```
2. **Use read-only accounts:**

   ```sql
   CREATE USER migration_reader AS PASSWORD = secure_password;
   GRANT SELECT ON database_name TO migration_reader;
   ```
3. **Restrict column access for sensitive data:**

   ```yaml
   tables:
     - fully_qualified_name: sensitive_table
       use_column_selection_as_exclude_list: true
       column_selection_list:
         - ssn
         - credit_card
         - salary
   ```

### Performance

1. **Enable chunking for large tables:**

   ```yaml
   tables:
     - fully_qualified_name: large_table
       chunk_number: 100
   ```
2. **Use WHERE clauses to filter data:**

   ```yaml
   tables:
     - fully_qualified_name: transactions
       where_clause: "transaction_date >= DATE '2024-01-01'"
   ```
3. **Optimize thread count:**

   ```yaml
   max_threads: 16  # Adjust based on Teradata server capacity
   ```
4. **Exclude unnecessary metrics for very large tables:**

   ```yaml
   validation_configuration:
     exclude_metrics: true  # Excludes avg, sum, stddev, variance
   ```

### Data Quality

1. **Start with schema validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: false
     row_validation: false
   ```
2. **Progress to metrics validation:**

   ```yaml
   validation_configuration:
     schema_validation: true
     metrics_validation: true
     row_validation: false
   ```
3. **Enable row validation for critical tables:**

   ```yaml
   tables:
     - fully_qualified_name: critical_fact_table
       # Row validation will be performed

   validation_configuration:
     row_validation: true
   ```

---

## See Also

* [Main CLI Usage Guide](CLI_USAGE_GUIDE.md)
* [SQL Server Commands Reference](sqlserver_commands.md)
* [Redshift Commands Reference](redshift_commands.md)
* [Snowflake Commands Reference](snowflake_commands.md)
* [Configuration Examples](CONFIGURATION_EXAMPLES.md)
* [Quick Reference Guide](CLI_QUICK_REFERENCE.md)

---
title: Teradata to Snowflake Migration Guide
source: https://docs.snowflake.com/en/migrations/guides/teradata.md
section: Migrations
---

# Teradata to Snowflake Migration Guide

## Snowflake Migration Framework

A typical Teradata-to-Snowflake migration can be broken into five key steps:

1. **Planning and design** are often overlooked steps in the migration process. The main reason is that companies typically want to show progress quickly, even if they haven’t fully understood the scope of the project. That is why, this phase is critical to understand and prioritize the migration project.
2. **Environment and security** with a plan, a clear timeline, a RACI matrix, and buy-in from all stakeholders, it’s time to move into execution mode.
   Setting up the necessary environments and security measures to begin the migration is very important before starting the migration phase given that there are many moving parts, and will be more impactful for the migration project if all your setup is ready before moving forward.
3. **Database code conversion** process involves extracting code directly from the source systems’ database catalog, such as table definitions, views, stored procedures and functions. Once extracted, you migrate all this code to equivalent data definition languages (DDLs) in Snowflake. This step also includes migrating data manipulation language (DML) scripts, which may be used by business analysts to build reports or dashboards.
   All this code needs to be migrated and adjusted to work in Snowflake. The adjustments can range from simple changes, such as naming conventions and data type mappings, to more complex differences in syntax, platform semantics and other factors. To assist with this, Snowflake offers a powerful solution called SnowConvert AI, which automates much of the database code conversion process.
4. **Data migration** Data migration involves transferring data between different storage systems, formats, or computer systems. In the context of a Teradata to Snowflake migration, it specifically refers to moving data from your Teradata environment to your new Snowflake environment.
   There are two main types discussed in this guide:

* **Historical data migration:** Taking a snapshot of your Teradata data at a specific point in time and transferring it to Snowflake. This is often done as an initial, bulk transfer.
* **Incremental data migration:** Moving new or changed data from Teradata to Snowflake on an ongoing basis after the initial historical migration. This ensures that your Snowflake environment stays up-to-date with your source systems.

5. **Data ingestion:** After migrating the historical data, the next step is migrating the data ingestion process, bringing in live data from various sources. Typically, this process follows an extract, transform, load (ETL) or extract, load, transform (ELT) model, depending on when and where the data transformation occurs before it becomes available to business users.
6. **Reporting and analytics,** now that the database has both historical data and live pipelines continuously importing new data, the next step is to extract value from this data through BI. Reporting can be done using standard BI tools or custom queries. In both cases, the SQL sent to the database may need to be adjusted to meet Snowflake’s requirements. These adjustments can range from simple name changes (common during migration) to syntax and more complex semantic differences. All these need to be identified and addressed.
7. **Data validation and testing:** The goal is to have the data as clean as possible before entering this phase.
   Every organization has its own testing methodologies and requirements for moving data into production. These must be fully understood from the start of the project.
8. **Deployment.** At this stage, the data is validated, an equivalent system is set up, all the ETLs have been migrated, and reports have been verified. Are you ready to go live?
   Not so fast — there are still a few critical considerations before final promotion to production. First, your legacy application may consist of multiple components or services. Ideally, you should migrate these applications one by one (although parallel migration is possible) and promote them to production in the same order. During this process, ensure your bridging strategy is in place so business users don’t have to query both Snowflake and the legacy system. Data synchronization for applications that haven’t been migrated yet should happen behind the scenes through the bridging mechanism. If this isn’t done, business users will have to work in a hybrid environment, and they must understand the implications of this setup.
9. **Optimize and run** once a system has been migrated to Snowflake, it enters normal maintenance mode. All software systems are living organisms requiring ongoing maintenance. This phase, after migration, is referred to as optimize and run, and it is not part of the migration itself.

---

## Migration Phases

### Phase 1: Planning and design

This phase is the crucial first step in a successful Snowflake migration. It lays the groundwork for the entire migration process by defining the scope, objectives, and requirements. This phase involves a deep understanding of the current environment and a clear vision for the future state in Snowflake.

During this phase, organizations identify the key business drivers and technical objectives for migrating to Snowflake by executing the following tasks:

#### Conduct a Thorough Assessment of your Teradata Environment

To conduct a thorough assessment of the current environment, it is crucial to start by **inventorying existing data assets**. This involves documenting not only databases and files but also any external systems, while carefully noting data types, schemas, and any prevalent data quality issues. Simultaneously, **analyzing query workloads** is essential to pinpoint frequently executed and resource-intensive queries, which will shed light on data access patterns and user behavior. Lastly, **assessing security and compliance requirements** is non-negotiable, requiring the identification of sensitive data, regulatory obligations, and potential vulnerabilities within the existing system.

### Phase 2: Environment and security

One of the first steps we recommend is setting up the necessary environments and security measures to begin the migration. There are many moving parts, so let’s start with security. As with any cloud platform, Snowflake operates under a shared security model between the platform and administrators.

####

#### Setting Up Environments

First, you need to decide how many accounts you will need. In legacy platforms, you typically had database instances, but in Snowflake, the setup revolves around accounts. At a minimum, you should set up a production environment and a development environment. Depending on your testing strategy, you may also need additional environments for different stages of testing.

#### Security Measures

Once the environments are set up, it’s crucial to implement the right security measures. Start with the network policy to ensure that only authorized users within your VPN can access the Snowflake system.

Snowflake’s user access control is role-based, so administrators must define roles according to the business needs. Once the roles are defined, create the user accounts and enforce Multi-Factor Authentication (MFA) and/or Single Sign-On (SSO) for all users. Additionally, you’ll need to set up service accounts and ensure that you’re not relying on traditional username/password authentication for these accounts.

#### Roles During Migration

During the migration, you’ll also need to define specific roles for the users executing the migration itself. Although the roles for non-production environments may differ, remember that during migration, you will be dealing with real data. Don’t skimp on security, even for non-production environments.

In development, the migration team will generally have more freedom when deploying changes to the structure or code. These are active development environments, and you don’t want to block the migration team with excessive security restrictions. However, it’s still important to maintain a robust security model, even in non-production environments.

#### Rethinking the Access Model

Since the security model in Snowflake differs from that of many legacy platforms, this migration is a good opportunity to rethink your access model. Clean up the hierarchy of users who need access to your system and ensure that only the necessary users have access to specific resources.

#### Coordinating with Finance

Snowflake uses a consumption-based pricing model, meaning costs are tied to usage. As you define roles, it’s a good idea to coordinate with your finance team to track which departments are using Snowflake and how. Snowflake also allows you to tag database objects, which can be used to track ownership at the business level, helping you align usage with departmental cost allocation.

Security and environment setup are complex tasks, and they need to be planned upfront. You may even need to consider a redesign of your access model to ensure the new platform is manageable in the long run. Taking the time to set this up correctly will lay a strong foundation for a secure and efficient migration to Snowflake.

### Phase 3: Database code conversion

SnowConvert AI understands the Teradata source code and converts the Data Definition Language (DDL), Data Manipulation Language (DML), and functions in the source code to the corresponding SQL in the target: Snowflake. SnowConvert AI can migrate the source code in any of these three extensions .sql, .dml, ddl

This phase involves extracting code directly from the source systems’ database catalog, such as table definitions, views, stored procedures, and functions. Once extracted, you migrate all of this code to equivalent DDLs (Data Definition Language) in Snowflake. This step also includes migrating DML (Data Manipulation Language) scripts, which may be used by business analysts to build reports or dashboards.

Please review our recommended extraction scripts [here](https://docs.snowconvert.com/sc/general/getting-started/code-extraction/teradata)

Teradata DDL typically includes references to **primary indexes**, **fallback**, or **partitioning**. In Snowflake, these structures do not exist in the same way:

* [**Use SnowConvert AI for Teradata**](https://www.snowflake.com/en/migrate-to-the-cloud/snowconvert/) that significantly streamlines the Data Definition Language (DDL) conversion process, especially when dealing with numerous tables. It automates the translation of Teradata’s specific DDL constructs, such as primary index definitions and fallback options, into Snowflake’s equivalent structures. This automation reduces manual effort and minimizes the risk of errors, allowing teams to focus on higher-level migration strategy and validation.
  Beyond basic DDL conversion, SnowConvert AI also addresses nuances like data type mapping and schema reorganization. It can automatically adjust data types to align with Snowflake’s offerings and facilitate decisions on whether to consolidate or break down schemas for optimal performance and manageability. This comprehensive approach ensures that the migrated database structure is not only functional but also optimized for Snowflake’s architecture.
* Adjust data types where needed or use the Migrations AI assistant to fix any Error or Warning (EWI).
* Decide whether to reorganize schemas (e.g., breaking large monolithic schemas into multiple Snowflake databases).

#### Teradata Migration Considerations

When migrating data from Teradata to Snowflake, it is crucial to consider the functional differences between the databases.

##### Session Modes in Teradata

The Teradata database has different modes for running queries: ANSI Mode (rules based on the ANSI SQL: 2011 specifications) and TERA mode (rules defined by Teradata). Please review the following [Teradata documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Request-and-Transaction-Processing/Transaction-Processing/Transaction-Semantics-Differences-in-ANSI-and-Teradata-Session-Modes) for more information.

**Teradata mode for strings informative table**

For strings, the Teradata Mode works differently. As it is explained in the following table based on the [Teradata documentation](https://docs.teradata.com/r/Enterprise_IntelliFlex_VMware/SQL-Request-and-Transaction-Processing/Transaction-Processing/Comparison-of-Transactions-in-ANSI-and-Teradata-Session-Modes):

| Feature | ANSI mode | Teradata mode |
| --- | --- | --- |
| Default attribute for character comparisons | CASESPECIFIC | NOT CASESPECIFIC |
| Default TRIM behavior | TRIM(BOTH FROM) | TRIM(BOTH FROM) |

**Translation specification summary**

| Mode | Column constraint values | Teradata behavior | SC expected behavior |
| --- | --- | --- | --- |
| ANSI Mode | CASESPECIFIC | CASESPECIFIC | No constraint added. |
|  | NOT CASESPECIFIC | CASESPECIFIC | Add COLLATE ‘en-cs’ in column definition. |
| Teradata Mode | CASESPECIFIC | CASESPECIFIC | In most cases, do not add COLLATE, and convert its usages of string comparison to RTRIM( expression ) |
|  | NOT CASESPECIFIC | NOT CASESPECIFIC | In most cases, do not add COLLATE, and convert its usages of string comparison to RTRIM(UPPER( expression )) |

**Available translation specification options**

* [TERA Mode For Strings Comparison - NO COLLATE](https://docs.snowconvert.com/sc/translation-references/teradata/session-modes-in-teradata/tera-mode-for-strings-comparison-no-collate)
* [TERA Mode For Strings Comparison - COLLATE](https://docs.snowconvert.com/sc/translation-references/teradata/session-modes-in-teradata/tera-mode-for-strings-comparison-collate)
* [ANSI Mode For Strings Comparison - NO COLLATE](https://docs.snowconvert.com/sc/translation-references/teradata/session-modes-in-teradata/ansi-mode-for-strings-comparison-no-collate)
* [ANSI Mode For Strings Comparison - COLLATE](https://docs.snowconvert.com/sc/translation-references/teradata/session-modes-in-teradata/ansi-mode-for-strings-comparison-collate)

##### SQL Translation Reference

Use this as a guide to understand how the transformed code might look when migrating from Teradata to Snowflake. SQL has a similar syntax between dialects, but each dialect can extend or add new functionalities.

For this reason, when running SQL in one environment (such as Teradata) vs. another (such as Snowflake),
there are many statements that require transformation or even removal. These transformations are done
by SnowConvert AI.

Browse through the following pages to find more information about specific topics.

* [Data Types](https://docs.snowconvert.com/sc/translation-references/teradata/sql-translation-reference/data-types), compare Teradata data types and their equivalents in Snowflake.
* [DDL](https://docs.snowconvert.com/sc/translation-references/teradata/sql-translation-reference/ddl), explore the translation of the Data Definition Language.
* [DML](https://docs.snowconvert.com/sc/translation-references/teradata/sql-translation-reference/dml), explore the translation of the Data Manipulation Language.
* [Built-in Functions](https://docs.snowconvert.com/sc/translation-references/teradata/sql-translation-reference/built-in-functions), compare functions included in the runtime of both languages.

##### SQL to JavaScript (Procedures)

##### Scripts to Snowflake SQL Translation Reference

Translation reference to convert Teradata scripts files to Snowflake SQL

* [Common Statements](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/common-statements)
* [BTEQ](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/bteq)
* [MLOAD](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-to-snowflake-sql-translation-reference/mload)

##### Scripts To Python Translation Reference

This section details how SnowConvert AI translates the Teradata Scripts (BTEQ, FastLoad, MultiLoad, TPUMP, etc.) into a scripting language compatible with Snowflake.

Browse through the following pages to find more information about specific topics.

* [BTEQ](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-translation-reference/bteq-translation), explore the translation reference for Basic Teradata Query syntax.
* [FastLoad](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-translation-reference/fastload-translations), explore the translation reference for FastLoad syntax.
* [MultiLoad](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-translation-reference/multiload-translation), explore the translation reference for MultiLoad syntax.
* [TPT](https://docs.snowconvert.com/sc/translation-references/teradata/scripts-translation-reference/tpt-translation), explore the translation reference for TPT syntax.

### Phase 4: Data Migration

First, it’s important to differentiate between historical data migration and new data addition. Historical data migration refers to taking a snapshot of the data at a specific point in time and transferring it to Snowflake. Our recommendation is to first perform an exact copy of the data without any transformation into Snowflake. This initial copy will put some load on the legacy platform, so you’ll want to do it only once and store it in Snowflake.

Your Actionable Steps:

* **Perform Historical Data Migration**: Take a snapshot of your Teradata data at a specific point in time and transfer it to Snowflake, often as an initial bulk transfer. The recommendation is to perform an exact copy without transformation initially.
* **Plan Incremental Data Migration**: After the initial historical migration, set up processes to move new or changed data from Teradata to Snowflake on an ongoing basis to keep your Snowflake environment up-to-date.

### Phase 5: Data Ingestion

Pipeline migration to Snowflake involves moving or rewriting Teradata-based logic, such as BTEQ scripts, stored procedures, macros, or specialized ETL flows. This includes an Orchestration Transition, which replaces BTEQ or scheduled Teradata jobs with Streams and Tasks inside Snowflake for incremental transformations. It also requires Source/Sink Realignment, which redirects multiple inbound data sources landing in Teradata to Snowflake ingestion patterns (COPY, Snowpipe).

During the Query Conversion and Optimization stage, Teradata SQL is converted to Snowflake SQL, which may include replacing macros with stored procedures or views, rewriting QUALIFY logic, and adjusting stored procedures and join indexes. SnowConvert AI for Teradata can automate much of this translation.

With the data itself in Snowflake, you now shift to **migrating or rewriting Teradata-based logic**—BTEQ scripts, stored procedures, macros, or specialized ETL flows.

#### Orchestration Transition

1. **Native Snowflake**: Replace BTEQ or scheduled Teradata jobs with **Streams and Tasks** inside Snowflake for incremental transformations.
2. **External Orchestrators**: If you used third-party schedulers (Airflow, Control-M, etc.), point them to Snowflake and rewrite any embedded Teradata SQL.

#### Source/Sink Realignment

* If you had multiple inbound data sources landing in Teradata, redirect them to Snowflake ingestion patterns (COPY, Snowpipe).
* If downstream systems read from Teradata, plan to repoint them to Snowflake once the pipeline has stabilized.

**SnowConvert AI for Teradata** is recommended for automated translation. It can handle macros, stored procedures, and BTEQ scripts, outputting Snowflake-compatible code.

### Phase 6: Reporting and analytics

Now that we have a database with both historical data and live pipelines continuously importing new data, the next step is to extract value from this data through **Business Intelligence** (BI). Reporting can be done using standard BI tools or custom queries. In both cases, the SQL sent to the database may need to be adjusted to meet Snowflake’s requirements. These adjustments can range from simple name changes (which are common during migration) to syntax differences and more complex semantic differences. All of these need to be identified and addressed.

As with the ingestion process, it’s crucial to review all legacy platform usage and incorporate those findings into the migration plan. There are generally two types of reports to consider: IT-owned reports and business-owned reports. It’s usually easier to track down IT-owned reports, but business-owned reports and complex queries created by business users require a different approach.

Business users are a key stakeholder in the migration process and should be included in the **RACI matrix** during the planning phase. They need to be trained on how Snowflake operates and should clearly understand the platform differences. This will enable them to modify their custom queries and reports as needed. We typically recommend a parallel training track for business users, followed by **office hours** with migration experts who can help address platform differences and guide users through the adjustments they need to make.

Business users are ultimately the ones who “accept” the migration. You might have completed the technical migration from an IT perspective, but if business users aren’t involved, they may still rely on thousands of reports that are crucial for running the business. If these reports are not updated to work with Snowflake, the business cannot fully transition away from the legacy platform.

Teradata SQL has some constructs not in Snowflake, and vice versa. Key differences include:

* **Macros**: Not supported in Snowflake; typically replaced by stored procedures or views.
* **QUALIFY**: Snowflake does not support `QUALIFY` directly; rewrite logic using a subquery or an outer SELECT.
* **Stored Procedures**: Teradata SP vs. Snowflake SP (SQL or JavaScript-based). The procedural language differs.
* **Join Indexes**: Have no direct equivalent; rely on micro-partition pruning and clustering keys.
* **COLLECT STATISTICS**: Teradata uses explicit stats, while Snowflake does this automatically.

**SnowConvert AI for Teradata** is recommended for automated translation. It can handle macros, stored procedures, and BTEQ scripts, outputting Snowflake-compatible code.

### Phase 7: Data validation and testing

This brings us to data validation and testing, two often underestimated steps in the migration planning process. Of course, the goal is to have the data as clean as possible before entering this phase.

Every organization has its own testing methodologies and requirements for moving data into production. These must be fully understood from the start of the project. So, what are some useful strategies for data validation?

* **Conduct Comprehensive Testing in Snowflake Migration:** During the Snowflake migration process, comprehensive testing must be conducted, including:

  1. Functional testing: To verify that all migrated applications and functionalities work as expected within the new environment, ensuring data integrity and accuracy.
  2. Performance testing: To evaluate query performance, data loading speed, and overall system responsiveness, which helps identify and address any performance bottlenecks.
  3. User acceptance testing (UAT): To involve end-users in the testing process to ensure that the migrated system meets their requirements and gather feedback for potential improvements.
* **Provide Training and Documentation for Snowflake Migration:**

  + Provide comprehensive training to end-users on Snowflake’s features, functionalities, and best practices, covering topics like data access, query optimization, and security.
  + Create comprehensive documentation, including system architecture diagrams, data flow diagrams, operational procedures, user guides, troubleshooting guides, and FAQs for easy reference.

### Phase 8: Deployment

When you’re finally ready for the cutover, ensure that all stakeholders are aligned and understand that from this point forward, **Snowflake** will be the system of record, not the legacy platform. You’ll need final and formal sign-offs from all stakeholders before proceeding. Any reports that were not migrated are now the responsibility of the business users. This is why it’s crucial not to involve users at the last minute—**they should be part of the process from the start** and should be aware of the migration timeline.

Additionally, verify that all permissions have been properly granted. For example, if you are using Active Directory-based roles, ensure these are created and configured in Snowflake.

A few additional scenarios are typically left to the end, but they shouldn’t be overlooked:

* **Surrogate keys**: If you are using surrogate keys, be aware that their lifecycle may differ between the legacy and Snowflake systems. These keys need to be synchronized during the cutover.
* **Cutover timing**: Depending on your industry, there may be more or less favorable times during the year for performing a cutover. Consider the timing carefully.
* **Legacy platform licensing**: Don’t forget that you may face hard deadlines related to the licensing of the legacy platform. Be sure to plan your cutover around any such deadlines.

### Phase 9: Optimize and run

Once a system has been migrated to Snowflake, it enters normal maintenance mode. All software systems are living organisms that require ongoing maintenance. We refer to this phase after migration as Optimize and Run, and we emphasize that it is not part of the migration itself.

Optimization and continuous improvement are ongoing processes that happen after migration. At this point, your team takes full ownership of the system in Snowflake. The system will continue to evolve, and optimization will be driven by usage patterns.

In general, we find that jobs in Snowflake tend to run faster than on the original platforms. If performance doesn’t meet expectations, you may need to run some optimizations to fully leverage Snowflake’s unique architecture. Snowflake provides various query analysis tools that can help identify bottlenecks, enabling you to optimize specific parts of the workflow.

During the optimization phase, you may need to revisit different aspects of the system. The advantage is that you are already benefiting from Snowflake’s capabilities, and optimization tasks will become part of your regular maintenance routine.

As a recommendation, you should focus on addressing only critical performance issues during the migration phase. Optimization is best treated as a post-migration effort.

---

## Need Migration Assistance?

For complex migration scenarios, addressing specific functional differences, or general assistance, Snowflake provides dedicated support channels, such as `snowconvert-support@snowflake.com`. Furthermore, leveraging Snowflake’s extensive migration resources, including master classes, webinars, and detailed reference guides specifically for Teradata migrations, can substantially enhance the likelihood of migration success.

A successful data platform migration from Teradata is not solely dependent on the conversion tool itself. Instead, it relies on a holistic strategy that integrates the efficiency of automation (provided by SnowConvert AI), the critical judgment and problem-solving capabilities of human experts (such as data architects), and the comprehensive support and resources offered by the target platform’s ecosystem (including Snowflake’s documentation, support services, and best practices). This implies that organizations should strategically invest not only in the migration tool but also in upskilling their teams in Snowflake-native capabilities and establishing robust validation processes. The ultimate goal is not merely to *move* the data, but to *modernize* the entire data operation, leading to a more resilient, performant, and future-ready cloud data platform.

---

## Appendix

### Appendix 1: Teradata databases to exclude when migrating to Snowflake

The following list of databases are needed for Teradata only and shouldn’t be migrated to Snowflake:

| DBC Crashdumps Dbcmngr External_AP EXTUSER LockLogShredder QCD SQLJ Sys_Calendar | SysAdmin SYSBAR SYSJDBC SYSLIB SYSSPATIAL SystemFE SYSUDTLIB SYSUIF TD_SERVER_DB | TD_SYSFNLIB TD_SYSGPL TD_SYSXML TDPUSER TDQCD TDStats tdwm |
| --- | --- | --- |

### Appendix 2: Teradata types to Snowflake data types

| Teradata Column Type | Teradata Data Type | Snowflake Data Type |
| --- | --- | --- |
| ++ | TD_ANYTYPE | TD_ANYTYPE data type isn’t supported in Snowflake. |
| A1 | ARRAY | ARRAY |
| AN | ARRAY | ARRAY |
| AT | TIME | TIME |
| BF | BYTE | BINARY |
| BO | BLOB | BLOB data type isn’t directly supported but can be replaced with BINARY (limited to 8MB). |
| BV | VARBYTE | BINARY |
| CF | CHAR | VARCHAR |
| CO | CLOB | CLOB data type isn’t directly supported but can be replaced with VARCHAR (limited to 16MB). |
| CV | VARCHAR | VARCHAR |
| D | DECIMAL | NUMBER |
| DA | DATE | DATE |
| DH | INTERVAL DAY TO HOUR | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD) |
| DM | INTERVAL DAY TO MINUTE | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD) |
| DS | INTERVAL DAY TO SECOND | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| DT | DATASET | DATASET data type isn’t supported in Snowflake. |
| DY | INTERVAL DAY | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| F | FLOAT | FLOAT |
| HM | INTERVAL HOUR TO MINUTE | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| HR | INTERVAL HOUR | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| HS | INTERVAL HOUR TO SECOND | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| I1 | BYTEINT | NUMBER |
| I2 | SMALLINT | NUMBER |
| I8 | BIGINT | NUMBER |
| I | INTEGER | NUMBER |
| JN | JSON | VARIANT |
| LF | CHAR | This data type is in DBC only and can’t be converted to Snowflake. |
| LV | VARCHAR | This data type is in DBC only and can’t be converted to Snowflake. |
| MI | INTERVAL MINUTE | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| MO | INTERVAL MONTH | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD) |
| MS | INTERVAL MINUTE TO SECOND | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| N | NUMBER | NUMBER |
| PD | PERIOD(DATE) | Can be converted to VARCHAR or split into 2 separate dates. |
| PM | PERIOD(TIMESTAMP WITH TIME ZONE) | Can be converted to VARCHAR or split into 2 separate timestamps (TIMESTAMP_TZ). |
| PS | PERIOD(TIMESTAMP) | Can be converted to VARCHAR or split into 2 separate timestamps (TIMESTAMP_NTZ). |
| PT | PERIOD(TIME) | Can be converted to VARCHAR or split into 2 separate times. |
| PZ | PERIOD(TIME WITH TIME ZONE) | Can be converted to VARCHAR or split into 2 separate times but WITH TIME ZONE isn’t supported for TIME. |
| SC | INTERVAL SECOND I | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |
| SZ | TIMESTAMP WITH TIME ZONE | TIMESTAMP_TZ |
| TS | TIMESTAMP | TIMESTAMP_NTZ |
| TZ | TIME WITH TIME ZONE | TIME WITH TIME ZONE isn’t supported because TIME is stored using “wall clock” time only without a time zone offset. |
| UF | CHAR | This data type is in DBC only and can’t be converted to Snowflake. |
| UT | UDT | UDT data type isn’t supported in Snowflake |
| UV | VARCHAR | This data type is in DBC only and can’t be converted to Snowflake |
| XM | XML | VARIANT |
| YM | INTERVAL YEAR TO MONTH | INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD) |
| YR | INTERVAL YEAR | YR INTERVAL YEAR INTERVAL data types aren’t supported in Snowflake but date calculations can be done with the date comparison functions (e.g. DATEDIFF and DATEADD). |

---
title: Using SnowConvert AI
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/README.md
section: Migrations
---

# Using SnowConvert AI

Use this guide to learn how SnowConvert AI can accelerate your migration to Snowflake:

* [How to install the tool.](how-to-install-the-tool/README.md)
* [How to update the tool.](how-to-update-the-tool.md)
* [What is a SnowConvert AI Project?](what-is-a-snowconvert-project.md)
* [Conversion.](../../getting-started/running-snowconvert/conversion/README.md)

---
title: Using the SMA AI assistant
source: https://docs.snowflake.com/en/migrations/sma-docs/user-guide/chatbot.md
section: Migrations
---

# Using the SMA AI assistant

You can use the SMA AI assistant to analyze and answer questions based exclusively on assessment data and documentation produced during the SMA
migration assessment process.

## How the assistant works

The assistant works on top of the assessment reports generated by the SMA (Snowflake Migration Accelerator). The process is divided into
four main phases:

* Activation
* Processing questions and generating SQL
* Retrieving answers
* Formatting and delivering response

### Phase 1: Activation and data ingestion

1. Activate the assistant.

   The assistant is launched directly from the SMA application. Upon launch, you’ll be prompted to enter your Snowflake account credentials.
2. Configure authentication.

   Select the appropriate authentication method for connecting to the Snowflake account. The assistant supports multiple authentication
   types to accommodate different security requirements.
3. Test the connection.

   Before proceeding, test the connection to ensure proper access to the Snowflake account. A successful connection test is required to
   activate the assistant, which then creates the necessary infrastructure (tables, schemas, etc.) in your Snowflake environment.
4. Provision infrastructure.

   Once the connection is validated, the assistant automatically provisions a dedicated table structure within your Snowflake
   account for storing and managing documentation.
5. Ingest assessment data.

   > Upload all relevant assessment reports generated by the SMA. The assistant accepts both .csv and .docx files. Each uploaded file is
   > mapped to a separate table in Snowflake, maintaining a one-to-one relationship between source files and database tables.

### Phase 2: Question processing and SQL generation (RAG)

Once the infrastructure is ready, you can begin asking questions through the chat interface. Each question triggers a sophisticated retrieval-augmented generation (RAG) process powered by Snowflake Cortex.

#### LLM selection

The assistant interface allows you to select from different Large Language Models (LLMs) to power the question-answering process. The available LLM options vary depending on the **region where your Snowflake account is hosted**, as Snowflake Cortex model availability differs across geographic regions.

> **Note:**
>
> The list of available models is dynamically populated based on your Snowflake account’s region. Some advanced models might
> only be available in specific regions due to Snowflake’s regional deployment of Snowflake Cortex AI services.

1. Semantic documentation search

   The assistant performs a semantic search against a vectorized documentation table in the Snowflake account to identify
   most relevant documents and data sources that can potentially answer your query.
2. Context-aware SQL generation

   Using the identified documentation as context, Snowflake Cortex generates a targeted SQL query designed to extract the precise
   information needed to answer your question.
3. SQL validation (self-correction loop 1)

   The generated SQL query undergoes validation by passing it back to Snowflake Cortex along with the original question to ensure the query
   logically produces an answer that addresses the user’s intent.

### Phase 3: Answer retrieval and refinement

The answer retrieval process adapts based on the SQL validation results, ensuring the most accurate response possible.

The following scenarios describe what might happen.

#### Scenario A: Valid SQL query

* The validated SQL query is executed against the assessment data tables.
* The query results are extracted and prepared as the preliminary answer.

#### Scenario B: Invalid or insufficient SQL query

* When SQL generation is not viable, the assistant falls back to the summary .docx file as context.
* Snowflake Cortex generates a preliminary answer based on the documentation content rather than structured data.

#### Final answer verification (self-correction loop 2)

Before presenting the answer, the assistant performs a final quality check:

* The preliminary answer is evaluated by Snowflake Cortex to verify it logically addresses the original question.

  + **If validated**: The answer is accepted and moves to formatting.
  + **If not validated**: The question is posed directly to Snowflake Cortex as a general inquiry (without specific documentation context), and this
    response becomes the final answer.

### Phase 4: Response formatting and delivery

#### Natural language formatting

The final answer is processed one last time by Snowflake Cortex to transform it into a clear, conversational response that’s easy to understand and
properly formatted for the chat interface.

The formatted response is then displayed in the chat window, complete with proper structure, context, and any relevant details extracted from the assessment data.

## Assessment report files and artifacts

The following files are uploaded to your account and serve as the data sources for the assistant:

| File Type | File Name | Purpose |
| --- | --- | --- |
| **CSV** | `DbxElementsInventory.csv` | Lists Databricks (DBX) elements found inside notebooks. |
| **CSV** | `ExecutionFlowInventory.csv` | Lists the relations between different workload scopes based on function calls. |
| **CSV** | `Checkpoints.csv` | Lists generated checkpoints for the user workload. |
| **CSV** | `DataFramesInventory.csv` | Lists the DataFrames assignments found for generating checkpoints. |
| **CSV** | `ArtifactDependencyInventory.csv` | Lists the artifact dependencies of each file analyzed by the SMA. |
| **CSV** | `Files.csv` | Inventory of each file’s type and size present in that execution. |
| **CSV** | `ImportUsagesInventory.csv` | Lists all referenced import calls in the codebase. |
| **CSV** | `InputFilesInventory.csv` | Lists every file by filetype and size. |
| **CSV** | `IOFilesInventory.csv` | Lists all external elements being read from or written to. |
| **CSV** | `Issues.csv` | Lists every conversion issue found, including description and location. |
| **CSV** | `JoinsInventory.csv` | Inventory of all DataFrame joins done in that codebase. |
| **CSV** | `NotebookCellsInventory.csv` | Inventory of all cells in a notebook. |
| **CSV** | `NotebookSizeInventory.csv` | Lists the size in lines of code of different source languages in notebook files. |
| **CSV** | `PandasUsagesInventory.csv` | Lists every reference to the Pandas API (Python Only). |
| **CSV** | `SparkUsagesInventory.csv` | Shows the exact location and usage for each reference to the Spark API. |
| **CSV** | `SqlStatementsInventory.csv` | Count of SQL keywords present in SQL Spark elements. |
| **CSV** | `SQLElementsInventory.csv` | Count of SQL elements present in SQL Spark elements. |
| **CSV** | `SqlEmbeddedUsageInventory.csv` | Count of embedded SQL present in SQL Spark elements. |
| **CSV** | `ThirdPartyUsagesInventory.csv` | Lists the third-party references in the codebase. |

## Database schema reference

This section describes the database tables created in your Snowflake account when using the assistant. These tables
store migration assessment data, code inventories, and metadata used by the assistant to provide context-aware responses.

### Schema overview

The assistant system creates tables to store:

* **Migration assessment data**: Results from code analysis including dependencies, issues, and usage patterns
* **Code inventories**: Detailed tracking of various code elements (imports, functions, DataFrames, etc.)
* **Documentation metadata**: Vector embeddings for semantic search capabilities
* **Execution tracking**: Records of tool executions and their results

Most tables include an `EXECUTIONID` column to associate records with specific migration assessment runs.

### Table reference

#### DOCUMENTATION_METADATA

Stores documentation text with vector embeddings for semantic search capabilities. Used by the assistant to find relevant context based on user questions using vector similarity.

| Column | Type | Description |
| --- | --- | --- |
| TABLE_NAME | VARCHAR | Name of the table the documentation describes |
| DOCUMENTATION_TEXT | VARCHAR | The documentation text content |
| EMBEDDING | VECTOR(FLOAT, 768) | Vector embedding for semantic search |

#### ARTIFACTDEPENDENCYINVENTORIES

Lists the artifact dependencies of each file analyzed by the SMA. This inventory allows the user to determine which artifacts are needed for the file to work properly in Snowflake.

The following are considered artifacts: a third-party library, SQL entity, source of a read or write operation, and another source code file in the workload.

| Column | Type | Description |
| --- | --- | --- |
| EXECUTIONID | VARCHAR(16777216) | The identifier of the execution |
| FILEID | VARCHAR(16777216) | The identifier of the source code file |
| DEPENDENCY | VARCHAR(16777216) | The artifact dependency that the current file has |
| TYPE | VARCHAR(16777216) | The type of the artifact dependency |
| SUCCESS | BOOLEAN | If the artifact needs any intervention, it shows FALSE; otherwise, it shows TRUE |
| STATUSDETAIL | VARCHAR(16777216) | The status of the artifact dependency, based on the type |
| ARGUMENTS | VARCHAR(16777216) | Extra data of the artifact dependency, based on the type |
| LOCATION | VARCHAR(16777216) | The collection of cell ID and line number where the artifact dependency is being used in the source code file |
| INDIRECTDEPENDENCIES | VARCHAR(16777216) | A list of other files that this file relies on, even if not directly |
| TOTALINDIRECTDEPENDENCIES | NUMBER(38,0) | The total count of these indirect dependencies |
| DIRECTPARENTS | VARCHAR(16777216) | A list of files that directly use this file |
| TOTALDIRECTPARENTS | NUMBER(38,0) | The total count of these direct parent files |
| INDIRECTPARENTS | VARCHAR(16777216) | A list of files that use this file indirectly (through other files) |
| TOTALINDIRECTPARENTS | NUMBER(38,0) | The total count of these indirect parent files |

#### CHECKPOINTSINVENTORIES

Lists the generated checkpoints for the user workload. These checkpoints are completely capable of being used in the Checkpoints Feature from the Snowflake Extension.

| Column | Type | Description |
| --- | --- | --- |
| NAME | VARCHAR(16777216) | The checkpoint name (using the format described before) |
| FILEID | VARCHAR(16777216) | The relative path of the file (starting from the input folder the user chose in the SMA tool) |
| CELLID | NUMBER(38,0) | The number of cell where the DataFrame operation was found inside a notebook file |
| LINE | NUMBER(38,0) | Line number where the DataFrame operation was found |
| COLUMNLINE | NUMBER(38,0) | The column number where the DataFrame operation was found |
| TYPECHECKPOINT | VARCHAR(16777216) | The use case of the checkpoints (Collection or Validation) |
| DATAFRAMENAME | VARCHAR(16777216) | The name of the DataFrame |
| LOCATIONASSIGNMENT | NUMBER(38,0) | The assignment number of the DataFrame name |
| ENABLED | BOOLEAN | Indicates whether the checkpoint is enabled (True or False) |
| MODENUMBER | VARCHAR(16777216) | The mode number of the collection (Schema [1] or DataFrame [2]) |
| SAMPLEDATAFRAME | NUMBER(38,0) | The sample of the DataFrame |
| ENTRYPOINT | VARCHAR(16777216) | The entry point that guides the flow to execute the checkpoint |
| EXECUTIONID | VARCHAR(16777216) | The execution ID |

#### DATAFRAMESINVENTORIES

Lists the DataFrame assignments to use when generating checkpoints for the user workload

| Column | Type | Description |
| --- | --- | --- |
| FULLNAME | VARCHAR(16777216) | The full name of the DataFrame |
| NAME | VARCHAR(16777216) | The simple name of the variable of the DataFrame |
| FILEID | VARCHAR(16777216) | The relative path of the file (starting from the input folder the user chose in the SMA tool) |
| CELLID | NUMBER(38,0) | The number of cells where the DataFrame operation was found inside a notebook file |
| LINE | NUMBER(38,0) | The line number where the DataFrame operation was found |
| COLUMNLINE | NUMBER(38,0) | The column number where the DataFrame operation was found |
| ASSIGNMENTNUMBER | NUMBER(38,0) | The number of assignments for this particular identifier (not symbol) in the file |
| RELEVANTFUNCTION | VARCHAR(16777216) | The relevant function why this was collected |
| RELATEDDATAFRAMES | VARCHAR(16777216) | The fully-qualified name of the DataFrame(s) involved in the operation (separated by semicolons) |
| ENTRYPOINTS | VARCHAR(16777216) | Empty for this phase |
| EXECUTIONID | VARCHAR(16777216) | The execution ID |

#### DBXELEMENTSINVENTORIES

Lists the DBX (Databricks) elements found inside notebooks.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The DBX element name |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the element was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| CATEGORY | VARCHAR(16777216) | The element category |
| KIND | VARCHAR(16777216) | A category for each element. These could include Function or Magic |
| LINE | NUMBER(38,0) | The line number in the source files where the element was found |
| PACKAGENAME | VARCHAR(16777216) | The name of the package where the element was found |
| SUPPORTED | BOOLEAN | Whether this reference is “supported” or not (`True`/`False`) |
| AUTOMATED | BOOLEAN | Whether or not the tool can automatically convert it (`True`/`False`) |
| STATUS | VARCHAR(16777216) | The categorization of each element (Rename, Direct, Helper, Transformation, WorkAround, NotSupported, or NotDefined) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| SNOWCONVERTCOREVERSION | VARCHAR(16777216) | The version number for the core code process of the tool |
| CELLID | NUMBER(38,0) | If this element was found in a notebook file, the numbered location of the cell where this element was in the file |
| EXECUTIONID | VARCHAR(16777216) | The unique identifier for this execution of the SMA |
| TECHNOLOGY | VARCHAR(16777216) | Source technology platform |

#### DETAILEDREPORTS

Stores detailed migration assessment reports for each execution. Used by the assistant to provide context-specific answers about migration assessments.

| Column | Type | Description |
| --- | --- | --- |
| ID | NUMBER(38,0) | Auto-incrementing primary key |
| EXECUTION_ID | VARCHAR(16777216) | Unique identifier for the execution run |
| REPORT_TEXT | VARCHAR(16777216) | Full text of the detailed report |

#### EXECUTIONFLOWINVENTORIES

Lists the relations between the different workload scopes, based on the function calls found. This inventory’s main purpose is to serve as the base for the entry points identification.

| Column | Type | Description |
| --- | --- | --- |
| CALLER | VARCHAR(16777216) | The full name of the scope where the call was found |
| CALLERTYPE | VARCHAR(16777216) | The type of the scope where the call was found. This can be: Function, Class, or Module |
| INVOKED | VARCHAR(16777216) | The full name of the element that was called |
| INVOKEDTYPE | VARCHAR(16777216) | The type of the element. This can be: Function or Class |
| FILEID | VARCHAR(16777216) | The relative path of the file (starting from the input folder the user chose in the SMA tool) |
| CELLID | NUMBER(38,0) | The cell number where the call was found inside a notebook file, if applicable |
| LINE | NUMBER(38,0) | The line number where the call was found |
| COLUMNLINE | NUMBER(38,0) | The column number where the call was found |
| EXECUTIONID | VARCHAR(16777216) | The execution ID |

#### IMPORTUSAGESINVENTORIES

Contains all the referenced import calls in the codebase. An import is classified as an external library that gets imported at any point in the file.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The unique name for the actual Spark reference |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| ALIAS | VARCHAR(16777216) | The alias of the element (if any) |
| KIND | VARCHAR(16777216) | Null/empty value because all elements are imports |
| LINE | NUMBER(38,0) | The line number in the source files where the element was found |
| PACKAGENAME | VARCHAR(16777216) | The name of the package where the element was found |
| ISSNOWPARKANACONDASUPPORTED | BOOLEAN | Whether this reference is “supported” or not. Values: `True`/`False` |
| AUTOMATED | VARCHAR(16777216) | Null/empty. This column is deprecated |
| STATUS | VARCHAR(16777216) | Value Invalid. This column is deprecated |
| STATEMENT | VARCHAR(16777216) | The code where the element was used. (Note: This column is not sent via telemetry) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| SNOWCONVERTCOREVERSION | VARCHAR(16777216) | The version number for the core code process of the tool |
| SNOWPARKVERSION | VARCHAR(16777216) | The version of Snowpark API available for the specified technology and run of the tool |
| ELEMENTPACKAGE | VARCHAR(16777216) | The package name where the imported element is declared (when available) |
| CELLID | NUMBER(38,0) | If this element was found in a notebook file, the numbered location of the cell where this element was in the file |
| EXECUTIONID | VARCHAR(16777216) | The unique identifier for this execution of the SMA |
| ORIGIN | VARCHAR(16777216) | Category of the import reference. Possible values are BuiltIn, ThirdPartyLib, or blank |
| TECHNOLOGY | VARCHAR(16777216) | Source technology platform |
| FULLNAME | VARCHAR(16777216) | It represents the correct full path for the current element |

#### INPUTFILESINVENTORIES

Similar to the files inventory, lists every file by filetype and size.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | Filename (same as FileId) |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | Count of files with that filename |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each session of the tool |
| EXTENSION | VARCHAR(16777216) | The file’s extension |
| TECHNOLOGY | VARCHAR(16777216) | The source file’s technology based on extension |
| BYTES | NUMBER(38,0) | Size of the file in bytes |
| CHARACTERLENGTH | NUMBER(38,0) | Count of characters in the file |
| LINESOFCODE | NUMBER(38,0) | Lines of code in the file |
| PARSERESULT | VARCHAR(16777216) | “Successful” if the cell was fully parsed, “Error” if it was not parsed |
| IGNORED | BOOLEAN | Whether file was ignored |
| ORIGINFILEPATH | VARCHAR(16777216) | Original file path |
| EXECUTIONID | VARCHAR(16777216) | The unique identifier for this execution of the SMA |

#### IOFILESINVENTORIES

Lists all external elements that are being read from or written to in the codebase.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The file, variable, or other element being read or written |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | Count of files with that filename |
| ISLITERAL | BOOLEAN | If the read/write location was in a literal |
| FORMAT | VARCHAR(16777216) | If the SMA can determine the format of the element (such as csv, json, etc.) |
| FORMATTYPE | VARCHAR(16777216) | If the format above is specific |
| MODE | VARCHAR(16777216) | Value will be `Read` or `Write` depending on whether there is a reader or writer |
| SUPPORTED | BOOLEAN | Whether this operation is supported in Snowpark |
| LINE | NUMBER(38,0) | The line in the file where the read or write occurs |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each session of the tool |
| OPTIONSETTINGS | VARCHAR(16777216) | If a parameter is defined in the element, it will be listed here |
| CELLID | NUMBER(38,0) | Cell ID where that element was in that FileId (if in a notebook, null otherwise) |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |

#### ISSUES

Lists every conversion issue found in the codebase. A description, the exact location of the issue in the file, and a code associated with that issue will be reported.

| Column | Type | Description |
| --- | --- | --- |
| CODE | VARCHAR(16777216) | The unique code for the issues reported by the tool |
| DESCRIPTION | VARCHAR(16777216) | The text describing the issue and the name of the Spark reference when applies |
| CATEGORY | VARCHAR(16777216) | The classification of each issue. Options: Warning, Conversion Error, Parser Error, Helper, Transformation, WorkAround, NotSupported, NotDefined |
| NODETYPE | VARCHAR(16777216) | The name associated to the syntax node where the issue was found |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| LINE | NUMBER(38,0) | The line number in the source file where the issue was found |
| COLUMNLINE | NUMBER(38,0) | The column position in the source file where the issue was found |
| URL | VARCHAR(16777216) | URL to documentation or more info |
| EXECUTIONID | VARCHAR(16777216) | The unique identifier for this execution of the SMA |

#### JOINSINVENTORIES

Contains an inventory of all DataFrame joins done in the codebase.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | Line number where the join begins (and ends, if not on a single line) |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | Count of files with that filename |
| ISSELFJOIN | BOOLEAN | TRUE if the join is a self join, FALSE if not |
| HASLEFTALIAS | BOOLEAN | TRUE if the join has a left alias, FALSE if not |
| HASRIGHTALIAS | BOOLEAN | TRUE if the join has a right alias, FALSE if not |
| LINE | NUMBER(38,0) | Line number where the join begins |
| KIND | VARCHAR(16777216) | Join type (INNER, LEFT, RIGHT, etc.) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each session of the tool |
| CELLID | NUMBER(38,0) | Cell ID where that element was in that FileId (if in a notebook, null otherwise) |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |

#### NOTEBOOKCELLSINVENTORIES

Gives an inventory of all cells in a notebook based on the source code for each cell and the lines of code in that cell.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | Source language (Python, Scala, or SQL) |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | Count of files with that filename |
| CELLID | NUMBER(38,0) | Cell ID where that element was in that FileId (if in a notebook, null otherwise) |
| ARGUMENTS | VARCHAR(16777216) | Null (this field will be empty) |
| LOC | NUMBER(38,0) | Lines of code in that cell |
| SIZE | NUMBER(38,0) | Count of characters in that cell |
| SUPPORTEDSTATUS | BOOLEAN | `TRUE` unless the element (source language) is not supported by the SMA tool (`FALSE`) |
| PARSINGRESULT | VARCHAR(16777216) | “Successful” if the cell was fully parsed; “Error” if it was not parsed |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |

#### NOTEBOOKSIZEINVENTORIES

Lists the size in lines of code of different source languages present in notebook files.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | Filename (for this spreadsheet, it is the same as the FileId) |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | Count of files with that filename |
| PYTHONLOC | NUMBER(38,0) | Python lines of code present in notebook cells (will be 0 for non-notebook files) |
| SCALALOC | NUMBER(38,0) | Scala lines of code present in notebook cells (will be 0 for non-notebook files) |
| SQLLOC | NUMBER(38,0) | SQL lines of code present in notebook cells (will be 0 for non-notebook files) |
| LINE | VARCHAR(16777216) | Null (this field will be empty) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each session of the tool |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |

#### PACKAGESINVENTORIES

Tracks package usage in the codebase.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The name of the package |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where package was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| ALIAS | VARCHAR(16777216) | Package alias if used |
| KIND | VARCHAR(16777216) | Type of package |
| LINE | VARCHAR(16777216) | Line reference |
| PACKAGENAME | VARCHAR(16777216) | Full package name |
| SUPPORTED | VARCHAR(16777216) | Support status |
| AUTOMATED | VARCHAR(16777216) | Automation status |
| STATUS | VARCHAR(16777216) | Migration status |
| STATEMENT | VARCHAR(16777216) | Full import statement |
| SESSIONID | VARCHAR(16777216) | Session identifier |
| SNOWCONVERTCOREVERSION | VARCHAR(16777216) | SnowConvert core version used |
| SNOWPARKVERSION | VARCHAR(16777216) | Target Snowpark version |
| CELLID | VARCHAR(16777216) | Cell identifier (for notebooks) |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| PARAMETERSINFO | VARCHAR(16777216) | Parameter information |
| TECHNOLOGY | VARCHAR(16777216) | Source technology platform |

#### PANDASUSAGESINVENTORIES

**[Python Only]** Lists every reference to the Pandas API present in the scanned codebase.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The unique name for the actual pandas reference |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| ALIAS | VARCHAR(16777216) | The alias of the element (applies just for import elements) |
| KIND | VARCHAR(16777216) | A category for each element. These could include Class, Variable, Function, Import and others |
| LINE | NUMBER(38,0) | The line number in the source files where the element was found |
| PACKAGENAME | VARCHAR(16777216) | The name of the package where the element was found |
| SUPPORTED | BOOLEAN | Whether this reference is “supported” or not. Values: `True`/`False` |
| AUTOMATED | BOOLEAN | Whether or not the tool can automatically convert it. Values: `True`/`False` |
| STATUS | VARCHAR(16777216) | The categorization of each element. Options: Rename, Direct, Helper, Transformation, WorkAround, NotSupported, NotDefined |
| STATEMENT | VARCHAR(16777216) | How that element was used. (Note: This column is not sent via telemetry) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| SNOWCONVERTCOREVERSION | VARCHAR(16777216) | The version number for the core code process of the tool |
| PANDASVERSION | VARCHAR(16777216) | Version number of the pandas API that was used to identify elements in this codebase |
| CELLID | VARCHAR(16777216) | Cell ID where that element was in that FileId (if in a notebook, null otherwise) |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| PARAMETERSINFO | VARCHAR(16777216) | Parameter information |
| TECHNOLOGY | VARCHAR(16777216) | Source technology platform |

#### SPARKUSAGESINVENTORIES

Shows the exact location and usage for each reference to the Spark API. This information is used to build the Readiness Score.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The unique name for the actual Spark reference |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| ALIAS | VARCHAR(16777216) | The alias of the element (applies just for import elements) |
| KIND | VARCHAR(16777216) | A category for each element. These could include Class, Variable, Function, Import and others |
| LINE | NUMBER(38,0) | The line number in the source files where the element was found |
| PACKAGENAME | VARCHAR(16777216) | The name of the package where the element was found |
| SUPPORTED | BOOLEAN | Whether this reference is “supported” or not. Values: `True`/`False` |
| AUTOMATED | BOOLEAN | Whether or not the tool can automatically convert it. Values: `True`/`False` |
| STATUS | VARCHAR(16777216) | The categorization of each element. Options: Rename, Direct, Helper, Transformation, WorkAround, NotSupported, NotDefined |
| STATEMENT | VARCHAR(16777216) | The code where the element was used. (Note: This column is not sent via telemetry) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| SNOWCONVERTCOREVERSION | VARCHAR(16777216) | The version number for the core code process of the tool |
| SNOWPARKVERSION | VARCHAR(16777216) | The version of Snowpark API available for the specified technology and run of the tool |
| CELLID | NUMBER(38,0) | If this element was found in a notebook file, the numbered location of the cell where this element was in the file |
| EXECUTIONID | VARCHAR(16777216) | The unique identifier for this execution of the SMA |
| PARAMETERSINFO | VARCHAR(16777216) | Parameter information |
| TECHNOLOGY | VARCHAR(16777216) | Source technology platform |

#### SQLEMBEDDEDUSAGEINVENTORIES

Contains a count of SQL keywords present in SQL Spark elements.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | Name for the code element where the SQL was found (such as SqlFromClause, SqlSelect, SqlSelectBody, SqlSignedNumericLiteral) |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the SQL reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| LIBRARYNAME | VARCHAR(16777216) | Name of the library being used |
| HASLITERAL | BOOLEAN | Indicates whether the element contains literals |
| HASVARIABLE | BOOLEAN | Indicates whether the element contains variables |
| HASFUNCTION | BOOLEAN | Indicates whether the element contains functions |
| PARSINGSTATUS | VARCHAR(16777216) | Indicates the parsing status (such as Success, Failed, Partial) |
| HASINTERPOLATION | BOOLEAN | Indicates whether the element contains interpolations |
| CELLID | NUMBER(38,0) | The notebook cell ID |
| LINE | NUMBER(38,0) | The line number where that element occurs |
| COLUMNLINE | NUMBER(38,0) | The column number where that element occurs |

#### SQLFUNCTIONSINVENTORIES

Inventories SQL functions used in the code with their categories and migration status.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | SQL function name |
| PROJECTID | VARCHAR(16777216) | Project identifier |
| FILEID | VARCHAR(16777216) | Identifier for the source file |
| COUNT | NUMBER(38,0) | Number of occurrences |
| CATEGORY | VARCHAR(16777216) | Function category |
| MIGRATIONSTATUS | VARCHAR(16777216) | Migration status for this function |
| CELLID | NUMBER(38,0) | Cell identifier (for notebooks) |
| LINE | NUMBER(38,0) | Line number in the source file |
| COLUMNLINE | NUMBER(38,0) | Column position in the line |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for the execution run |

#### THIRDPARTYUSAGESINVENTORIES

Tracks third-party library and package usage found during code analysis.

| Column | Type | Description |
| --- | --- | --- |
| ELEMENT | VARCHAR(16777216) | The unique name for the third party reference |
| PROJECTID | VARCHAR(16777216) | Name of the project (root directory the tool was run on) |
| FILEID | VARCHAR(16777216) | File where the Spark reference was found and the relative path to that file |
| COUNT | NUMBER(38,0) | The number of times that element shows up in a single line |
| ALIAS | VARCHAR(16777216) | The alias of the element (if any) |
| KIND | VARCHAR(16777216) | Categorization of the element such as variable, type, function, or class |
| LINE | NUMBER(38,0) | The line number in the source files where the element was found |
| PACKAGENAME | VARCHAR(16777216) | Package name for the element (concatenation of ProjectId and FileId in Python) |
| STATEMENT | VARCHAR(16777216) | The code where the element was used. (Note: This column is not sent via telemetry) |
| SESSIONID | VARCHAR(16777216) | Unique identifier for each session of the tool |
| CELLID | NUMBER(38,0) | Cell ID where that element was in that FileId (if in a notebook, null otherwise) |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each execution of the tool |
| PARAMETERSINFO | VARCHAR(16777216) | Parameter information |

#### TOOLEXECUTIONS

Tracks tool execution metadata including timing, results, and version information.

| Column | Type | Description |
| --- | --- | --- |
| EXECUTIONID | VARCHAR(16777216) | Unique identifier for each run of the tool |
| TOOLNAME | VARCHAR(16777216) | The name of the tool. Values: PythonSnowConvert, SparkSnowConvert (Scala tool) |
| TOOLVERSION | VARCHAR(16777216) | The version number of the tool |
| ASSEMBLYNAME | VARCHAR(16777216) | The name of the code processor (essentially, a longer version of the ToolName) |
| LOGFILE | VARCHAR(16777216) | Whether a log file was sent on an exception/failure |
| FINALRESULT | VARCHAR(16777216) | Where the tool stopped if there was an exception/failure |
| EXCEPTIONREPORT | VARCHAR(16777216) | If an exception report was sent on an exception/failure |
| STARTTIME | NUMBER(38,0) | The timestamp for when the tool started executing |
| ENDTIME | NUMBER(38,0) | The timestamp for when the tool stopped executing |
| SYSTEMNAME | VARCHAR(16777216) | The serial number of the machine where the tool was executing (this is only used for troubleshooting and license validation purposes) |

#### Assessment files to table mapping

The following CSV files from assessment exports map to their corresponding database tables:

| CSV File | Database Table |
| --- | --- |
| ArtifactDependencyInventory.csv | ARTIFACTDEPENDENCYINVENTORIES |
| CheckpointsInventory.csv | CHECKPOINTSINVENTORIES |
| DataFramesInventory.csv | DATAFRAMESINVENTORIES |
| DbxElementsInventory.csv | DBXELEMENTSINVENTORIES |
| ExecutionFlowInventory.csv | EXECUTIONFLOWINVENTORIES |
| ImportUsagesInventory.csv | IMPORTUSAGESINVENTORIES |
| InputFilesInventory.csv | INPUTFILESINVENTORIES |
| IOFilesInventory.csv | IOFILESINVENTORIES |
| Issues.csv | ISSUES |
| JoinsInventory.csv | JOINSINVENTORIES |
| NotebookCellsInventory.csv | NOTEBOOKCELLSINVENTORIES |
| NotebookSizeInventory.csv | NOTEBOOKSIZEINVENTORIES |
| PackagesInventory.csv | PACKAGESINVENTORIES |
| PandasUsagesInventory.csv | PANDASUSAGESINVENTORIES |
| SparkUsagesInventory.csv | SPARKUSAGESINVENTORIES |
| SqlEmbeddedUsageInventory.csv | SQLEMBEDDEDUSAGEINVENTORIES |
| SqlFunctionsInventory.csv | SQLFUNCTIONSINVENTORIES |
| ThirdPartyUsagesInventory.csv | THIRDPARTYUSAGESINVENTORIES |
| tool_execution.csv | TOOLEXECUTIONS |
| DetailedReport.docx | DETAILEDREPORTS |

### Schema notes

* The `DOCUMENTATION_METADATA` table uses vector embeddings (768 dimensions) for semantic search.
* Tables are created with `CREATE OR REPLACE` to ensure clean setup on each initialization.

## Stored procedures

The assistant creates several stored procedures in your Snowflake account to power the natural language query processing capabilities.
These procedures leverage Snowflake Cortex AI to interpret questions, generate SQL, and format responses.

### GET_CHATBOT_RESPONSE

The primary stored procedure that processes natural language questions and returns human-readable answers using Snowflake Cortex AI.

#### Signature

```sql
GET_CHATBOT_RESPONSE(IA_MODEL VARCHAR, QUESTION VARCHAR, EXECUTION_ID VARCHAR)
RETURNS VARCHAR
```

#### Parameters

| Parameter | Type | Description |
| --- | --- | --- |
| IA_MODEL | VARCHAR | The Snowflake Cortex AI model to use (such as ‘llama3.1-70b’, ‘mistral-large2’) |
| QUESTION | VARCHAR | The natural language question from the user |
| EXECUTION_ID | VARCHAR | The execution ID to filter data to a specific assessment run |

#### Returns

VARCHAR - A human-readable answer to the user’s question

#### Workflow

The procedure implements a multi-step process:

1. **Context Retrieval (RAG)**

   * Performs vector similarity search on `DOCUMENTATION_METADATA` table.
   * Uses `SNOWFLAKE.CORTEX.EMBED_TEXT_768` with the ‘e5-base-v2’ model.
   * Retrieves the most relevant documentation using `VECTOR_COSINE_SIMILARITY`.
2. **SQL Generation**

   * Constructs a prompt with the retrieved context and user question.
   * Calls `SNOWFLAKE.CORTEX.COMPLETE` to generate a SQL query.
   * The generated query is scoped to the specific `EXECUTION_ID`.
3. **SQL Validation (Self-Correction Loop 1)**

   * Validates whether the generated SQL logically answers the question.
   * Uses Snowflake Cortex to perform a yes/no validation check.
4. **Fallback Path (if SQL is invalid)**

   * Retrieves full report context from `DETAILEDREPORTS` table.
   * Attempts to answer using the detailed report as context.
   * Performs another validation check on the answer quality.
   * Falls back to general knowledge if context-based answer is insufficient.
5. **SQL Execution (if SQL is valid)**

   * Executes the generated SQL query dynamically.
   * Stores results in a temporary table for processing.
   * Handles single results, multiple results (aggregates up to 10), and empty results.
6. **Response Formatting**

   * Creates a final prompt combining the question and query results.
   * Calls Snowflake Cortex again to format the answer as a friendly, natural sentence.
   * Returns the human-readable response.

#### Error Handling

* Returns `"I could not find any data that matched your request."` when query returns zero rows.
* Automatically falls back through multiple strategies to ensure a meaningful answer.

#### Example

```sql
CALL GET_CHATBOT_RESPONSE(
    'llama3.1-70b',
    'How many Python files were analyzed?',
    'ABC123XYZ'
);
```

### GET_CURRENT_REGION_DETAILS

Retrieves information about the current Snowflake account’s region, including cloud provider and region display name.

#### Signature

```sql
GET_CURRENT_REGION_DETAILS()
RETURNS TABLE (
    CLOUD_PROVIDER VARCHAR,
    CLOUD_REGION_NAME VARCHAR,
    CLOUD_REGION_DISPLAY_NAME VARCHAR
)
```

#### Parameters

None

#### Returns

A table with three columns:

| Column | Type | Description |
| --- | --- | --- |
| CLOUD_PROVIDER | VARCHAR | The cloud provider (such as ‘AWS’, ‘Azure’, ‘GCP’) |
| CLOUD_REGION_NAME | VARCHAR | The technical region identifier (such as ‘us-east-1’) |
| CLOUD_REGION_DISPLAY_NAME | VARCHAR | The human-readable region name (such as ‘US East (N. Virginia)’) |

#### Workflow

1. Executes `SHOW REGIONS` to populate the result set cache.
2. Queries the result using `RESULT_SCAN(LAST_QUERY_ID())`.
3. Filters to only the region matching `CURRENT_REGION()`.
4. Returns the filtered result as a table.

#### Purpose

This procedure is used to determine which Snowflake Cortex AI models are available, as model availability varies by region. The assistant uses this information to populate the LLM selection dropdown with region-specific options.

#### Example

```sql
CALL GET_CURRENT_REGION_DETAILS();
```

##### Output

| CLOUD_PROVIDER | CLOUD_REGION_NAME | CLOUD_REGION_DISPLAY_NAME |
| --- | --- | --- |
| AWS | us-west-2 | US West (Oregon) |

### Stored procedure notes

* Both procedures are created with `CREATE OR REPLACE` to ensure clean setup.
* `GET_CHATBOT_RESPONSE` executes as `OWNER` to access the necessary tables and Snowflake Cortex functions.
* The procedures use Snowflake’s dynamic SQL capabilities (`EXECUTE IMMEDIATE`) for flexible query execution.
* Vector embeddings use the ‘e5-base-v2’ model with 768 dimensions for semantic search.
* The multi-step validation process ensures high-quality, contextually relevant answers.

## Troubleshooting

### Resetting the assistant infrastructure

If you need to re-create the assistant infrastructure from scratch, you must first delete the local configuration file.

#### Configuration file location

* **macOS/Linux**: `~/.smachatbot/config.json`
* **Windows**: `%USERPROFILE%.smachatbotconfig.json`

This JSON configuration file contains a composite key consisting of:

* `snowflake_identifier`
* `snowflake_user`
* `snowflake_role`

**To reset the assistant**:

1. Delete the `config.json` file from the appropriate directory:

   * **macOS/Linux**: `~/.smachatbot/config.json`
   * **Windows**: `%USERPROFILE%.smachatbotconfig.json`
2. Re-run the assistant initialization process

   > **Note:**
   >
   > Deleting this file will remove all stored connection settings and require you to reconfigure the assistant with your Snowflake credentials.

## Known issues / FAQs

### Snowflake connection caching with VPN

#### Issue

When testing the connection to Snowflake while a VPN is required but not yet connected, the Snowflake driver caches the failed connection state. Even after successfully connecting to the VPN, subsequent connection attempts may still fail due to this cached state.

#### Workaround

If you experience connection failures after connecting to your VPN:

1. Close and restart the SMA application.
2. Attempt the connection test again with the VPN active from the start.

   > **Tip:**
   >
   > Always ensure your VPN is connected *before* initiating any Snowflake connection tests to avoid this caching behavior.

---
title: What is SnowConvert CLI?
source: https://docs.snowflake.com/en/migrations/snowconvert-docs/general/user-guide/snowconvert/command-line-interface/README.md
section: Migrations
---

# What is SnowConvert CLI?

SnowConvert AI CLI (scai) encapsulates all SnowConvert functions into a single command line tool dedicated to increasing the speed of migrations from various source platforms into Snowflake.

With the SnowConvert AI CLI, migration engineers can:

* extract code from their source platform
* run a deterministic conversion on that code
* further advance their migration using ai-conversion to cover objects that the deterministic engine could not translate
* deploy that code to Snowflake
* migrate data from the source system to Snowflake
* validate that data between the two systems

The CLI will also allow developers to create skills and agents that utilize the tool to automate their process.

## Prerequisites

* macOS, Windows, or Linux
* SnowflakeCLI: recommended for Snowflake connection configuration [SnowCLI Install Guide](https://docs.snowflake.com/en/developer-guide/snowflake-cli/installation/installation)
* A source database to extract from, or a set of code to use

## Snowflake Connection Setup

The SnowConvert AI CLI (scai) reuses your Snowflake CLI connection configuration. The connection is used for functionality in ai-convert, deploy, and the cloud versions of data migration and validation. Your Snowflake account also authenticates you for SnowConvert, skipping the need for an access code that was necessary in prior versions.\*

**Snowflake Account Requirement**

Before using scai init, scai code convert, scai code extract, scai ai-convert, or scai code deploy, ensure that you:

Can connect to Snowflake with snow connection test
Your Snowflake CLI has a default connection configured (this is used when you don’t specify a name).

`To configure a Snowflake connection:`

```shell
# Add a new connection using Snowflake CLI
snow connection add

# Set it as the default
snow connection set-default <connection_name>

# Test your connection
snow connection test
```

Once this is configured, commands needing a connection to Snowflake will use your Snowflake CLI connection automatically.

## Installation

Homebrew Installation

If you do not have homebrew installed, follow the [instructions here](https://brew.sh/).

There are two public channels for builds, Preview and GA.

Stable Version (recommended)

Install the stable production (GA) release:

```shell
brew tap snowflakedb/snowconvert-ai
brew install --cask snowconvert-ai
```

Preview Version

Install the Preview (pr) version with pre-release features from the beta/staging environment:

```shell
brew tap snowflakedb/snowconvert-ai
brew install --cask snowconvert-ai-pr
```

Usage

After installation, you can use the SnowConvert CLI:

```shell
scai --help
```

Managing Installations

View installed Version

```shell
brew info --cask snowconvert-ai
# or
brew info --cask snowconvert-ai-pr
```

Switch between versions

```shell
# Uninstall current version
brew uninstall --cask snowconvert-ai #or snowconvert-ai-pr

# Install another version
brew install --cask snowconvert-ai
# or
brew install --cask snowconvert-ai-pr
```

Update to latest version
Important: you must run `brew update` first to sync the tap with the latest cask definitions:

```shell
# Update tap definitions and upgrade to latest version
brew update && brew upgrade --cask snowconvert-ai

# For preview version
brew update && brew upgrade --cask snowconvert-ai-pr
```

Why both commands? `brew update` synchronizes your local tap with the latest cask definitions from GitHub. Without it, `brew upgrade` won’t see new versions even if they exist on the server.

## Installer Packages:

*GA Releases*

| OS | Installer |
| --- | --- |
| macOS | [Apple Silicon](https://snowconvert.snowflake.com/storage/darwin_arm64/prod/cli/snowflake-scai-cli-darwin-arm64.pkg) |
| macOS | [Intel](https://snowconvert.snowflake.com/storage/darwin_x64/prod/cli/snowflake-scai-cli-darwin-x64.pkg) |
| Linux | [arm64 .pkg](https://snowconvert.snowflake.com/storage/linux/prod/cli/snowflake-scai-cli-linux-arm64.rpm) |
| Linux | [arm64 .deb](https://snowconvert.snowflake.com/storage/linux/prod/cli/snowflake-scai-cli-linux-arm64.deb) |
| Linux | [x64 .rpm](https://snowconvert.snowflake.com/storage/linux/prod/cli/snowflake-scai-cli-linux-x64.rpm) |
| Linux | [x64 .deb](https://snowconvert.snowflake.com/storage/linux/prod/cli/snowflake-scai-cli-linux-x64.deb) |
| Linux | [x64 .tar.gz](https://snowconvert.snowflake.com/storage/linux/prod/cli/snowflake-scai-cli-linux-x64.tar.gz) |
| Linux | [arm64 .tar.gz](https://snowconvert.snowflake.com/storage/linux/prod/cli/snowflake-scai-cli-linux-arm64.tar.gz) |
| Windows | [arm64 .msi](https://snowconvert.snowflake.com/storage/windows_arm64/prod/cli/snowflake-scai-cli-windows-arm64.msi) |
| Windows | [x64 .msi](https://snowconvert.snowflake.com/storage/windows/prod/cli/snowflake-scai-cli-windows-x64.msi) |

*Preview Releases*

| OS | Installer |
| --- | --- |
| macOS | [Apple Silicon](https://snowconvert.snowflake.com/storage/darwin_arm64/beta/cli/snowflake-scai-cli-darwin-arm64-beta.pkg) |
| macOS | [Intel](https://snowconvert.snowflake.com/storage/darwin_x64/beta/cli/snowflake-scai-cli-darwin-x64-beta.pkg) |
| Linux | [arm64 .pkg](https://snowconvert.snowflake.com/storage/linux/beta/cli/snowflake-scai-cli-linux-arm64-beta.rpm) |
| Linux | [arm64 .deb](https://snowconvert.snowflake.com/storage/linux/beta/cli/snowflake-scai-cli-linux-arm64-beta.deb) |
| Linux | [x64 .rpm](https://snowconvert.snowflake.com/storage/linux/beta/cli/snowflake-scai-cli-linux-x64-beta.rpm) |
| Linux | [x64 .deb](https://snowconvert.snowflake.com/storage/linux/beta/cli/snowflake-scai-cli-linux-x64-beta.deb) |
| Linux | [x64 .tar.gz](https://snowconvert.snowflake.com/storage/linux/beta/cli/snowflake-scai-cli-linux-x64-beta.tar.gz) |
| Linux | [arm64 .tar.gz](https://snowconvert.snowflake.com/storage/linux/beta/cli/snowflake-scai-cli-linux-arm64-beta.tar.gz) |
| Windows | [arm64 .msi](https://snowconvert.snowflake.com/storage/windows_arm64/beta/cli/snowflake-scai-cli-windows-arm64-beta.msi) |
| Windows | [x64 .msi](https://snowconvert.snowflake.com/storage/windows/beta/cli/snowflake-scai-cli-windows-x64-beta.msi) |

## Accept Terms and Conditions

Issusing the following command will display the license terms for using the SnowConvert AI CLI. It is required that you do this in order to use the product.

```shell
# display the scai terms and conditions
scai terms

#displays terms and allows you to accept them
scai terms accept
```

## Understanding Projects

A project is required before you can use any other scai command. This is similar to how Git requires you to run git init before using other Git commands.

A project:

* Organizes your migration work in a dedicated folder structure
* Tracks your source dialect (Oracle, SQL Server, Teradata, etc.)
* Stores configuration, source code, converted code, and reports

When you run scai init, it creates this folder structure in the target directory.

* If you pass a PATH, scai will create that folder (if it doesn’t exist) and initialize the project inside it.
* If you omit PATH, scai initializes the current directory (which must be empty).

The following folder structure:

```none
project/
├── .git/
├── .gitignore
├── .scai/
│   ├── config/
│   │   ├── project.yml                        ← Team-shared config (Git)
│   │   ├── project.local.yml                  ← Personal config (gitignored)
│   │   └── conversion-context/
│   │       └── MigrationContext.json
│   └── registry/
│       ├── {uuid1}.json
│       ├── {uuid2}.json
│       ├── {uuid3}.json
│       └── .locks/                            ← SCRIPT STATE (Git)
│           └── registry.lock
├── settings/                ← User-managed settings (not created by default)
│   ├── extraction.yml
│   ├── deployment.yml
│   └── ai-verification.yml
├── source/                                    ← Source code Database objects
│   └── db1/
│       └── retail/                            -- schema1
│           ├── Tables/
│           │   ├── table_customers.sql
│           │   └── table_orders.sql
│           └── Stored Procedures/
│               └── proc_calculate.sql
├── snowflake/                                ← Working directory of converted code (may have manual edits, should not get overwritten)
│   ├── db1/
│   │   └── retail/                            -- schema1
│   │       ├── Tables/
│   │       │   ├── table_customers.sql
│   │       │   └── table_orders.sql
│   │       └── Stored Procedures/
│   │           └── proc_calculate.sql
│   └── dbt/                                   ← Converted scripts (optional)
│       └── models/
│           └── staging/
│               └── stg_customer_daily.sql
├── artifacts/                                 ← All artifacts related to objects
│   ├── source_raw/                            ← Original source code without changes
│   ├── db1/
│   │   ├── retail/                            -- schema1
│   │   │   └── table/             -- one folder per object kind
│   │   │       └── products/
│   │   │           ├── deterministic/         -- different runs from deterministic engine
│   │   │           │   ├── 20261201.350956/
│   │   │           │   │   └── proc_calculate.sql
│   ├── UDF Helpers/
│   └── ETL/
│   │     ├── DWH_EXAMPLE
│   │     │     ├── DWH_EXAMPLE.sql
├── reports/                                   ← Generated reports (optional in Git)
│   ├── SnowConvert/
│   │   ├── ObjectReferences.<timestamp>.csv
│   │   ├── TopLevelCodeUnits.csv
│   │   └── Issues.csv
│   ├── GenericScanner/
│   │   └── GenericScannerOutput/
│   │       ├── line_counts.pam
│   │       ├── files.pam
│   │       ├── FilesInventory.csv
│   │       ├── word_counts.pam
│   │       ├── KeywordCounts.csv
│   │       └── tool_execution.pam
│   └── ...
├── logs/
│   ├── GenericInfrastructureController/
│   ├── GenericScanner/
│   └── Snowconvert/
└── results/
    └── DataValidation/
```

Important: after creating a project, run all subsequent scai commands from within the project folder (where the .scai directory is).

CLI Logs are written to ~/.scai/logs/jobs.log by default

## Quick Start: Code Conversion Only

Use this workflow when you have existing SQL files to convert. Works with all supported dialects.

> [!IMPORTANT]
> You must already be in an empty directory to create a project.

#1. Create a project folder and initialize it (project name is inferred from folder name)

```shell
scai init my-project -l Oracle
cd my-project
```

#2. Add your source code

```shell
scai code add -i /path/to/your/sql/files
```

#3. Convert to Snowflake SQL

```shell
scai code convert
```

#4. Deploy to Snowflake

```shell
scai code deploy
```

Your converted code will be in the snowflake/ folder, and conversion reports in reports/.

## Quick Start: End-to-End Migration

Use this workflow to extract code directly from your source database. Only available for SQL Server and Redshift.

Step 1: Create Project
# Create a project folder and initialize it (project name is inferred from folder name)

```shell
scai init my-project -l SqlServer # or Redshift
cd my-project
```

Step 2: Configure Source Connection

```shell
#SQL Server (interactive mode - recommended)
scai connection add-sql-server
```

```shell
#Redshift (interactive mode - recommended)
scai connection add-redshift
```

The interactive mode will prompt you for connection details.

Set a default source connection (used with scai code extract runs without -source-connection)

```shell
scai connection set-default -l sqlserver --connection <NAME>
```

Or

```shell
scai connection set-default -l redshift --connection <NAME>
```

Step 3: Extract, Convert, Deploy

#extract code from source database

```shell
scai code extract
```

#convert to Snowflake SQL

```shell
scai code convert
```

#deploy to Snowflake

```shell
scai code deploy
```

---

## Filtering Objects with the `--where` Clause

Many `scai` commands operate on the **Code Unit Registry** – a local index of every code unit (table, view, procedure, function, etc.) in your project. The `--where` flag lets you filter which objects a command acts on, using a SQL-like expression against that registry.

This section covers how the WHERE clause works, which commands support it, and how to use it effectively.

### How the Code Unit Registry works

1. When you run `scai code add` or `scai code extract`, source code is added to your project and split into individual code units.
2. When you run `scai code convert`, the CLI builds a registry entry for every code unit, tracking its source/target metadata, object type, and conversion status.
3. When you pass `--where` to a supported command, the CLI queries that registry and applies the operation only to matching objects.

**The registry must exist before `--where` can be used.** If you haven’t run at least `scai code add` (or `scai code extract`) yet, `--where` will fail with a “registry not found” error.

### Discovering queryable fields: `scai code where`

Run `scai code where` to see the full, up-to-date reference of all queryable fields, supported operators, and usage examples. The output is generated from the actual registry library, so it is always current.

```bash
scai code where
```

The most commonly used fields (all field names are **camelCase**, all enum values are **lowercase**):

| Field | Description | Example values |
| --- | --- | --- |
| `source.name` | Object name in the source database | `'my_procedure'` |
| `source.objectType` | Object type in source | `'table'`, `'procedure'`, `'view'`, `'function'` |
| `source.database` | Source database name | `'my_db'` |
| `source.schema` | Source schema name | `'dbo'`, `'public'` |
| `target.name` | Object name in Snowflake | `'MY_PROCEDURE'` |
| `target.objectType` | Object type in Snowflake | `'table'`, `'procedure'`, `'view'`, `'function'` |
| `target.database` | Target database name | `'MY_DB'` |
| `target.schema` | Target schema name | `'PUBLIC'` |
| `codeStatus.conversion.status` | Conversion result | `'pending'`, `'completed'`, `'failed'`, `'excluded'` |
| `codeStatus.registration.status` | Registration/extraction result | `'pending'`, `'completed'`, `'failed'`, `'excluded'` |
| `codeStatus.aiVerification.status` | AI verification result | `'pending'`, `'completed'`, `'failed'`, `'excluded'` |

### Previewing results: `scai code find`

Before running a destructive or long-running operation, use `scai code find` to test your filter and see which objects match:

```bash
# Show all code units in the registry
scai code find

# Test a WHERE filter
scai code find --where "target.objectType = 'table'"

# Show all results (default caps at 100)
scai code find --where "source.schema = 'dbo'" --no-limit
```

`scai code find` accepts the same `--where` syntax as all other commands that support it. Use it as a dry-run before committing to an operation.

### Commands that support `--where`

The following table summarizes every `scai` command that accepts the `--where` flag:

| Command | Purpose | `--where` notes |
| --- | --- | --- |
| `scai code find` | Preview/query code units in the registry | Primary tool for testing filters before using them elsewhere |
| `scai code convert` | Convert source code to Snowflake SQL | Only matched units are transformed; dependencies are still parsed for symbol resolution |
| `scai code deploy` | Deploy converted code to Snowflake | Also supports `--include-dependencies` to automatically include objects that filtered objects depend on |
| `scai code accept` | Accept latest artifact versions into the snowflake folder |  |
| `scai ai-convert start` | Start AI-powered conversion improvement | Cannot be combined with `--selector` (pick one) |
| `scai ai-convert accept` | Accept AI-suggested fixes | Filters which suggested fixes to review/accept |
| `scai data migrate` | Migrate data from source to Snowflake | Full migration projects only |
| `scai data validate` | Validate data between source and Snowflake | Full migration projects only |

Each of these commands uses the same WHERE clause syntax. The recommended workflow is:

```bash
scai code where          (learn the fields)
        ↓
scai code find --where   (preview what matches)
        ↓
scai <command> --where   (run the operation)
```

---

### `scai code find --where`

Query the Code Unit Registry and display matching objects. This is the safest way to test a filter before using it with a command that makes changes.

```bash
scai code find --where <WHERE_CLAUSE> [--no-limit]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter objects |
| `--no-limit` | Show all results (default limit is 100) |

**Examples:**

```bash
# Find all code units (no filter)
scai code find

# Find a specific object by name
scai code find --where "source.name = 'my_table'"

# Find all procedures in a schema
scai code find --where "source.schema = 'dbo' AND source.objectType = 'procedure'"

# Find objects that failed conversion
scai code find --where "codeStatus.conversion.status = 'failed'"

# Find objects not yet AI-verified
scai code find --where "codeStatus.aiVerification.status = 'pending'"
```

---

### `scai code convert --where`

Convert only a filtered subset of code units to Snowflake SQL. Objects that don’t match the filter are still parsed for dependency and symbol resolution, but only matched units produce converted output.

```bash
scai code convert --where <WHERE_CLAUSE> [OPTIONS]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like filter to select which code units to convert |
| `--overwrite-working-directory` | Overwrite existing output files in the snowflake/ directory |
| `-x, --show-ewis` | Show detailed EWI table instead of summary |

**Examples:**

```bash
# Convert only procedures
scai code convert --where "source.objectType = 'procedure'"

# Convert objects in a single schema
scai code convert --where "source.schema = 'dbo'"

# Convert only tables and views
scai code convert --where "source.objectType IN ('table', 'view')"
```

> **Note:** Even when filtering, the converter still parses all source files for symbol resolution. This ensures that cross-object references (e.g., a procedure referencing a table) are resolved correctly, even if the referenced object is not in the `--where` filter.

---

### `scai code deploy --where`

Deploy a filtered subset of converted objects to Snowflake, instead of deploying everything.

```bash
scai code deploy --where <WHERE_CLAUSE> [--include-dependencies] [OPTIONS]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter objects to deploy |
| `--include-dependencies` | Also deploy the dependencies of the filtered code units. Has no effect without `--where`, since all code units are already included. |
| `-c, --connection <CONNECTION>` | Snowflake connection to use |
| `-d, --database` | Target database name for deployment |
| `-a, --all` | Deploy all successfully converted objects without selection prompt |
| `-r, --retry` | Number of retry attempts for failed deployments (default: 1) |
| `--continue-on-error` | Continue deploying remaining objects even if some fail (default: True) |
| `--warehouse <WAREHOUSE>` | Warehouse override (in-memory only, applied if connection has none) |
| `--schema <SCHEMA>` | Schema override (in-memory only) |
| `--role <ROLE>` | Role override (in-memory only) |

**Examples:**

```bash
# Deploy only tables
scai code deploy --where "target.objectType = 'table'"

# Deploy procedures and their dependencies (e.g. tables they reference)
scai code deploy --where "target.objectType = 'procedure'" --include-dependencies

# Deploy objects from a single schema
scai code deploy --where "source.schema = 'sales'"

# Deploy a specific object by name
scai code deploy --where "source.name = 'calculate_totals'"
```

**Why `--include-dependencies` matters:** When you filter with `--where`, you may select procedures that depend on tables or views. Without `--include-dependencies`, those dependent objects won’t be deployed, and the procedures may fail at runtime. Use this flag to automatically pull in everything the filtered objects need.

---

### `scai ai-convert start --where`

Send a filtered subset of objects for AI-powered conversion improvement, instead of processing everything.

```bash
scai ai-convert start --where <WHERE_CLAUSE> [OPTIONS]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter objects. Requires the code unit registry. |
| `-c, --connection <CONNECTION>` | Snowflake connection to use |
| `--selector <PATH>` | Path to object selector file (code-conversion-only projects). **Cannot be combined with `--where`.** |
| `-i, --instructions <PATH>` | Path to instructions file |
| `-w, --watch` | Wait for completion and show progress |
| `-y, --accept-disclaimers` | Skip disclaimer prompt |
| `--warehouse <WAREHOUSE>` | Warehouse override (in-memory only) |
| `--schema <SCHEMA>` | Schema override (in-memory only) |
| `--role <ROLE>` | Role override (in-memory only) |
| `--database <DATABASE>` | Database override (in-memory only) |

**Examples:**

```bash
# AI-convert only tables
scai ai-convert start --where "target.objectType = 'table'"

# AI-convert objects from a specific schema
scai ai-convert start --where "source.schema = 'dbo'"

# AI-convert a single object by name
scai ai-convert start --where "source.name = 'calculate_totals'"

# Combine with other flags
scai ai-convert start --where "target.objectType = 'procedure'" -w -y
scai ai-convert start --where "source.schema = 'reporting'" -i config/instructions.yml
```

> **`--where` and `--selector` cannot be combined.** Use `--selector` when you have a selector file for a code-conversion-only project. Use `--where` for expressive filtering by type, schema, status, or any other registry field.

---

### `scai code accept --where`

Accept the latest artifact versions into the snowflake output folder for a filtered subset of objects. Without `--where`, all objects are accepted.

```bash
scai code accept --where <WHERE_CLAUSE>
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | Filter expression to select which objects to accept |

**Examples:**

```bash
# Accept only tables
scai code accept --where "source.objectType = 'table'"

# Accept objects from a specific schema
scai code accept --where "source.schema = 'dbo'"

# Accept only successfully converted objects
scai code accept --where "codeStatus.conversion.status = 'completed'"
```

---

### `scai ai-convert accept --where`

Review and accept AI-suggested fixes for a filtered subset of objects, instead of reviewing everything.

```bash
scai ai-convert accept [JOB_ID] --where <WHERE_CLAUSE> [OPTIONS]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter which suggested fixes to accept. Full migration projects only. |
| `--all` | Accept all matching AI-suggested fixes without prompting |
| `-i, --interactive` | Review each matching code unit one by one |
| `--summary` | Preview affected code units without making changes (default) |
| `--json` | Output results in JSON format (for automation) |

**Examples:**

```bash
# Preview AI fixes for tables only
scai ai-convert accept --summary --where "source.objectType = 'table'"

# Accept all AI fixes for a specific schema
scai ai-convert accept --all --where "source.schema = 'sales'"

# Interactively review AI fixes for procedures
scai ai-convert accept -i --where "source.objectType = 'procedure'"
```

---

### `scai data migrate --where`

Migrate data from the source system to Snowflake for a filtered subset of tables. Available only for full migration projects (SQL Server, Redshift).

```bash
scai data migrate --where <WHERE_CLAUSE> [OPTIONS]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter tables from the Code Unit Registry |
| `-s, --source-connection <NAME>` | Source connection to extract data from |
| `-c, --connection <NAME>` | Snowflake connection to migrate data to |

**Examples:**

```bash
# Migrate only tables in the 'public' schema
scai data migrate --where "source.schema = 'public'"

# Migrate a specific table
scai data migrate --where "source.name = 'customers'"

# Migrate tables from a specific database
scai data migrate --where "source.database = 'retail_db'" --source-connection my-redshift
```

> **Note:** `--where` and `--selector` serve the same purpose (filtering tables) but for different project types. Use `--where` for full migration projects that have a Code Unit Registry. Use `--selector` for code-conversion-only projects.

---

### `scai data validate --where`

Compare data between source and Snowflake for a filtered subset of tables. Available only for full migration projects (SQL Server, Redshift).

```bash
scai data validate --where <WHERE_CLAUSE> [OPTIONS]
```

| Flag | Description |
| --- | --- |
| `--where <WHERE_CLAUSE>` | SQL-like WHERE clause to filter tables from the Code Unit Registry |
| `-s, --source-connection <NAME>` | Source connection for validation |
| `-c, --connection <NAME>` | Snowflake connection for validation |
| `-d, --target-database <DATABASE>` | Target Snowflake database for validation |
| `-m, --db-mapping <MAPPING>` | Database name mapping (`source:target`) |
| `-e, --schema-mapping <MAPPING>` | Schema name mapping (`source:target`) |

**Examples:**

```bash
# Validate tables in the 'public' schema
scai data validate --where "source.schema = 'public'"

# Validate a specific table after migration
scai data validate --where "source.name = 'customers'"

# Validate with name mappings
scai data validate --where "source.schema = 'dbo'" --db-mapping "mydb:MY_DB" --schema-mapping "dbo:PUBLIC"
```

---

### Common `--where` scenarios

#### Re-process objects that failed conversion

After running `scai code convert`, some objects might have failed. Target just those for AI conversion:

```bash
# First, see which objects failed
scai code find --where "codeStatus.conversion.status = 'failed'"

# Send only the failed ones for AI conversion
scai ai-convert start --where "codeStatus.conversion.status = 'failed'" -w
```

#### Run AI conversion on objects that haven’t been AI-verified yet

If you’ve already run AI conversion on some objects but not others, target the ones still pending:

```bash
# Find objects that haven't gone through AI verification
scai code find --where "codeStatus.aiVerification.status = 'pending'"

# Convert just those
scai ai-convert start --where "codeStatus.aiVerification.status = 'pending'" -w
```

#### Focus on a specific object type in a specific schema

```bash
# Preview: all procedures in the dbo schema
scai code find --where "source.schema = 'dbo' AND source.objectType = 'procedure'"

# Run AI conversion on them
scai ai-convert start --where "source.schema = 'dbo' AND source.objectType = 'procedure'" -w
```

#### Deploy only tables, then only procedures with dependencies

```bash
# Deploy tables first
scai code deploy --where "target.objectType = 'table'"

# Then deploy procedures, pulling in any remaining dependencies
scai code deploy --where "target.objectType = 'procedure'" --include-dependencies
```

#### Incremental deployment of a single schema

```bash
# Deploy everything in the 'sales' schema
scai code deploy --where "source.schema = 'sales'"
```

### Things to keep in mind

* **`--where` and `--selector` can’t be combined** (on `scai ai-convert start` and `scai ai-convert accept`). Use `--selector` for code-conversion-only projects with a short list, or `--where` for full migration projects with expressive filtering.
* **`--where` and `--selector` serve the same purpose** on `scai data migrate` and `scai data validate`. Use `--where` for full migration projects; use `--selector` for code-conversion-only projects.
* **The registry must exist.** You need to have run at least `scai code add` or `scai code extract` before `--where` will work. Otherwise you’ll get a “registry not found” error.
* **Use `scai code find` to preview.** Always test your filter with `scai code find --where "..."` before running a deployment or AI conversion job.
* **`scai code where` is the definitive reference.** The field list in this document covers the most common fields. Run `scai code where` for the full, always-up-to-date list of fields, operators, and examples.

---

## AI Convert Quick Guide

Use `scai ai-convert` to improve your converted code with AI. It uploads your converted SQL to Snowflake, analyzes it for functional equivalence issues, generates regression tests, and produces improved code.

**Supported languages:** SQL Server, Redshift, BigQuery, PostgreSQL

### Before you start

* A project initialized with `scai init`
* Code already converted with `scai code convert`
* A Snowflake connection configured via `snow connection add`
* `CREATE MIGRATION` privilege on your Snowflake account
* A warehouse set in the connection

### Basic usage

```bash
# Convert all objects (default behavior)
scai ai-convert start

# Convert and wait for it to finish (can take minutes to hours depending on code size)
scai ai-convert start -w

# Skip the disclaimer prompt (handy for CI/CD)
scai ai-convert start -y -w

# Convert only specific objects by name
scai ai-convert start -o MY_PROC,MY_VIEW

# Use an instructions file for source system verification
scai ai-convert start -i config/instructions.yml
```

For filtering objects with `--where`, see the Filtering Objects with the `--where` Clause section above.

### Managing jobs

Once a job is running, you’ve got a few commands to work with:

```bash
# Check the last job's status
scai ai-convert status

# Check a specific job
scai ai-convert status JOB_20260310_ABC

# Wait for a job to finish and download results
scai ai-convert status -w

# List all jobs for this project
scai ai-convert list

# Cancel a running job
scai ai-convert cancel
```

### Accepting AI fixes

After a job completes, review what the AI suggested and decide what to keep:

```bash
# Preview what changed (default -- no files modified)
scai ai-convert accept --summary

# Review each fix interactively (accept, skip, or diff)
scai ai-convert accept -i

# Accept everything at once
scai ai-convert accept --all

# JSON output for automation
scai ai-convert accept --summary --json
```

### Output structure

Results land in the `ai-converted/` directory inside your project:

```none
ai-converted/
  └── JOB_<timestamp>_<id>/
      ├── fixed/           AI-improved SQL files organized by object type/schema
      └── tests_sql/       Generated regression tests organized by database/schema
```

### All `ai-convert start` options

| Flag | Short | Description |
| --- | --- | --- |
| `--connection` | `-c` | Snowflake connection to use |
| `--objects` | `-o` | Comma-separated object names, or `'all'` (default). Cannot be combined with `--where`. |
| `--where` |  | SQL-like WHERE clause to filter objects (see Filtering Objects) |
| `--instructions` | `-i` | Path to instructions file for custom config |
| `--watch` | `-w` | Wait for completion and show progress |
| `--accept-disclaimers` | `-y` | Skip disclaimer prompt |
| `--warehouse` |  | Override warehouse (in-memory only) |
| `--schema` |  | Override schema (in-memory only) |
| `--role` |  | Override role (in-memory only) |
| `--database` |  | Override database (in-memory only) |

---

## Workflow Examples

### Example 1: Migrate Oracle Stored Procedures

```shell
# Create project
scai init oracle-migration -l Oracle
cd oracle-migration

# Add your PL/SQL files
scai code add -i ./oracle-procs/

# Convert
scai code convert

# Review converted code in converted/ folder, then deploy
scai code deploy --all
```

### Example 2: SQL Server End-to-End with Specific Schema

```shell
# Create project
scai init sqlserver-migration -l SqlServer
cd sqlserver-migration

# Add connection
scai connection add-sql-server

# Extract only the 'sales' schema
scai code extract --schema sales

# Convert
scai code convert

# Deploy
scai code deploy
```

### Example 3: AI Convert After Code Conversion

```shell
# Create project
scai init ai-convert-demo -l SqlServer
cd ai-convert-demo

# Add connection
scai connection add-sql-server

# Extract and convert
scai code extract
scai code convert

# Start AI code conversion and wait for completion
scai ai-convert start -w

# Review last executed job results
scai ai-convert status

# Review executed job list
scai ai-convert list
```

### Example 4: Selective Migration Using `--where`

```shell
# Create project and convert
scai init selective-demo -l SqlServer
cd selective-demo
scai connection add-sql-server
scai code extract
scai code convert

# Preview what failed conversion
scai code find --where "issues IS NOT NULL AND issues != '[]'"

# Send failed objects through AI conversion
scai ai-convert start --where "issues IS NOT NULL AND issues != '[]'" -w -y

# Deploy only tables first
scai code deploy --where "target.objectType = 'table'"

# Deploy procedures with their dependencies
scai code deploy --where "target.objectType = 'procedure'" --include-dependencies
```

---

## Getting Help

Use –help with any command to see available options:

```shell
scai --help
scai init --help
scai code convert --help
scai code where
scai connection add-redshift --help
```

## Troubleshooting

“Project file not found”
You must run commands from within a project directory. Navigate to your project folder (where the .scai/ directory exists) before running commands:

```shell
cd <project-folder>
scai code convert
```

“Connection not found” (source database)

1. List your connections: scai connection list -l <language>
2. Add a connection if needed: scai connection add-sql-server or scai connection add-redshift
3. Or set a default: scai connection set-default -l <language> –connection-name <name>

“Authentication failed” for Snowflake

The SCAI CLI uses your Snowflake CLI configuration. Ensure your connection is working:

Make sure you have a default Snowflake connection configured in the Snowflake CLI (used when no connection name is specified).

```shell
# List available Snowflake connections
snow connection list

# Test your connection
snow connection test

# Add a new connection if needed
snow connection add
```

“Registry not found” when using `--where`

The `--where` flag requires a Code Unit Registry, which is created when you run `scai code add` or `scai code extract`. Make sure you’ve run one of those commands before using `--where`:

```shell
# For file-based projects
scai code add -i /path/to/source

# For SQL Server / Redshift extraction
scai code extract
```

## Supported Source Dialects

| Dialect | Extract | Convert | Deploy |
| --- | --- | --- | --- |
| SQL Server | X | X | X |
| Redshift | X | X | X |
| Oracle |  | X |  |
| Teradata |  | X |  |
| BigQuery |  | X |  |
| Databricks |  | X |  |
| Greenplum |  | X |  |
| Sybase |  | X |  |
| PostgreSQL |  | X |  |
| Netezza |  | X |  |
| Spark |  | X |  |
| Vertica |  | X |  |
| Hive |  | X |  |
| DB2 |  | X |  |

## Complete CLI Reference

For quick reference, here is every top-level command and subcommand available in the `scai` CLI:

| Command | Subcommands | Description |
| --- | --- | --- |
| `scai init` |  | Create a new migration project |
| `scai project` | `info`, `set-default-connection` | View and manage project configuration |
| `scai connection` | `add-sql-server`, `add-redshift`, `set-default`, `list`, `test` | Manage source database connections |
| `scai code` | `add`, `extract`, `convert`, `deploy`, `find`, `accept`, `where`, `resync` | Manage code migration operations |
| `scai ai-convert` | `start`, `status`, `cancel`, `list`, `accept` | AI-powered conversion improvement |
| `scai data` | `migrate`, `validate` | Data migration and validation |
| `scai test` | `seed`, `capture`, `validate` | Generate and run test cases for stored procedures |
| `scai object-selector` | `create` | Generate selector files for filtering objects |
| `scai query` |  | Execute SQL queries on source database systems |
| `scai license` | `install` | Manage offline license operations |
| `scai terms` | `accept` | View and accept terms and conditions |
| `scai logs` |  | Show log directory and recent log files |

---
title: Workspace Estimator
source: https://docs.snowflake.com/en/migrations/sma-docs/workspace-estimator/overview.md
section: Migrations
---

# Workspace Estimator

The Workspace Estimator connects to your Databricks workspace, collects usage data, and generates a cost comparison for running the same workloads on Snowflake. It analyzes the following areas:

1. **Infrastructure inventory** — node types, cluster configurations
2. **Usage patterns** — cluster events, lifecycle data
3. **Workload analysis** — job definitions, execution history
4. **Performance metrics** — run statistics, resource consumption
5. **SQL analytics** — warehouse configurations, query history
6. **Data pipelines** — DLT pipeline configurations and performance

You can run the Workspace Estimator in two ways:

## SMA CLI

The SMA CLI includes the Workspace Estimator as a built-in command. This is the recommended way to run it. Use the `sma we dbx run` command to connect to your Databricks workspace, extract metadata, and upload the results to Snowflake in a single step.

For full usage instructions, command options, and examples, see [the SMA CLI walkthrough](../use-cases/sma-cli-walkthrough.md).

## Jupyter notebook

The Workspace Estimator is also available as a Jupyter notebook hosted on the [Snowflake Labs GitHub repository](https://github.com/Snowflake-Labs/Workspace-Estimator). This version is in maintenance mode and receives bug fixes only. The SMA CLI is the recommended path for new users.

## Programmatic Access

REST APIs and programmatic interfaces for Snowflake.

---
title: About managing listings using SQL
source: https://docs.snowflake.com/en/progaccess/listing-progaccess-about.md
section: Programmatic Access
---

# About managing listings using SQL

Providers can use listings to share data products with accounts in any Snowflake region. To learn more about listings,
see [About sharing with listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about).

Providers can use SQL commands to create and manage listings and offer them to specific consumers. To share a listing using SQL, providers complete the following tasks:

* (Optional) Create a Provider Profile to offer listings. See [Use listings as a provider](../collaboration/provider-becoming.md).
* Define a listing manifest.
* Create a listing using SQL.
* Publish a listing using SQL.

> **Note:**
>
> Providers can’t offer paid, personalized listings, or listings on private data exchanges.

## Prerequisites for working with listings using SQL

* [Review and accept the Snowflake Provider and Consumer Terms](../collaboration/provider-becoming.md)

  You don’t need to accept the Snowflake Provider and Consumer Terms if you’re creating free private listings and
  you’ve accepted the [Snowflake Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/legal/data-sharing-terms/).
* Prepare the data for your listing. See [Prepare data for a listing](../collaboration/provider-listings-preparing.md).
* Review the [Provider Policies](https://www.snowflake.com/provider-policies/).
* Configure account privileges.

## Listing and application owner roles

When you create a listing, you create it from the account that has the data or application package in it. The role that attaches a data
product to a listing and publishes the listing must be the same role that created, and therefore owns, the application package or share.
You cannot transfer the OWNERSHIP privilege for a share.

If you use a different role to create and manage the listing, grant the MODIFY privilege on the listing to the role
that owns the application package or share. For example:

Share or application package owner role:
:   OWNERSHIP privilege on the share or application package.
    MODIFY privilege on the listing.

Listing owner role:
:   OWNERSHIP privilege on the listing.

    Global CREATE LISTING privilege.

Within the provider account, you can use one of the following to create and manage listings:

ACCOUNTADMIN:
:   If you use the ACCOUNTADMIN role to create and manage listings, the ORGADMIN role must first
    [Delegate privileges to set up auto-fulfillment](../collaboration/provider-listings-auto-fulfillment-manage-privileges.md).

Custom role:
:   If you use a custom role, the ORGADMIN role must first [Delegate privileges to set up auto-fulfillment](../collaboration/provider-listings-auto-fulfillment-manage-privileges.md)
    to the ACCOUNTADMIN role, which can then be used to grant the relevant privileges to the custom role.

For more information about granting sharing privileges, see [Granting Privileges to Other Roles:](../user-guide/data-exchange-marketplace-privileges.md).

## Define a listing manifest

To create a listing you must first create a listing manifest. Manifests are written in YAML
(<https://yaml.org/spec/>), and include a prefix and required and optional fields.

For example, to create a simple titled listing with listing terms, define a manifest similar to:

```yaml
title: A title for the listing.
subtitle: An optional subtitle.
description: A general description.
profile: Provider profile reference.
listing_terms: ...
targets: ...
```

Each manifest then includes additional sections, such as:

```yaml
auto_fulfillment: ...
```

And a number of optional fields, such as `data_dictionary`, `business_needs`, and more.

A simple manifest would include:

```yaml
title: "MyFirstListing"
subtitle: "Example listing"
description: "This is my first listing!"
listing_terms:
  type: "OFFLINE"
targets:
   accounts: ["Org1.Account1"]
```

For more information, see [Listing manifest reference](listing-manifest-reference.md).

For additional examples and use cases associated with managing listings using SQL see [Manage listings with SQL as a provider - examples](listing-progaccess-examples.md).

## Create a listing using SQL

To create a listing, you use the [CREATE LISTING](../sql-reference/sql/create-listing.md) command, specifying a name and the listing details inline in a YAML manifest that describes the listing.
Listings created using CREATE LISTING … are automatically published.

After a listing is created, you can alter it using [ALTER LISTING](../sql-reference/sql/alter-listing.md), which includes unpublish and publish
support. Additionally, listings can be [described](../sql-reference/sql/desc-listing.md), [shown](../sql-reference/sql/show-listings.md),
[published and unpublished](../sql-reference/sql/alter-listing.md), and [dropped](../sql-reference/sql/drop-listing.md).

> **Note:**
>
> Creating a listing using SQL is conceptually similar to [Share data or apps with specific consumers using a private listing](https://other-docs.snowflake.com//collaboration/provider-listings-creating-publishing#label-listings-create). You should be familiar and comfortable
> with creating, viewing, and publishing listings using [Snowsight](../user-guide/ui-snowsight-gs.md) and Provider Studio before creating listings using SQL.
> For more information, see [Share data or apps with specific consumers using a private listing](https://other-docs.snowflake.com//collaboration/provider-listings-creating-publishing#label-listings-create).

Before you create your listing, ensure you have completed all prerequisites.

For example, if you want to create a DRAFT listing `my1stlisting` from share `myshare` with title “My first SQL listing”, execute the following command:

```sqlexample
CREATE EXTERNAL LISTING my1stlisting
SHARE myshare AS
$$
 title: "My first SQL listing"
 description: "This is my first listing"
 listing_terms:
   type: "OFFLINE"
 targets:
   accounts: ["Org1.Account1"]
$$ PUBLISH=FALSE REVIEW=FALSE;
```

> **Note:**
>
> Listings are identified using the listing’s *NAME*. A listing *NAME* is the identifier used when initially creating the listing.
> In the example above, the listing name is MY1STLISTING. While title, subtitle and other listing characteristics can be altered,
> *NAME* cannot be altered by specifying a new name in yaml. Use [ALTER LISTING … RENAME TO](../sql-reference/sql/alter-listing.md) to rename a listing.
> Commands such as [ALTER LISTING](../sql-reference/sql/alter-listing.md), [SHOW LISTINGS](../sql-reference/sql/show-listings.md), [DESCRIBE LISTING](../sql-reference/sql/desc-listing.md), and [DROP LISTING](../sql-reference/sql/drop-listing.md)
> all use *NAME* to identify a listing.
> Listing *NAME* is not shown in Snowsight, which identifies listings by title.

For additional examples and use-cases associated with managing listings using SQL see [Manage listings with SQL as a provider - examples](listing-progaccess-examples.md).

## Publish a listing using SQL

You can publish and un-publish listings using [ALTER LISTING … PUBLISH](../sql-reference/sql/alter-listing.md) and ALTER LISTING … UNPUBLISH.

For more information about publishing listings using Snowsight, see [Publish a listing](https://other-docs.snowflake.com//collaboration/provider-listings-creating-publishing#label-listings-create-publish).

Note that listings can be automatically published when created using [CREATE LISTING](../sql-reference/sql/create-listing.md).

For example, to publish the previously unpublished listing, execute the following command:

```sqlexample
ALTER LISTING MY1STLISTING PUBLISH;
```

Additionally, before a listing can be dropped, it must be un-published. To
un-publish a previously published listing, execute a command similar to:

```sqlexample
ALTER LISTING MY1STLISTING UNPUBLISH;
```

For additional examples and use-cases associated with managing listings using SQL see [Manage listings with SQL as a provider - examples](listing-progaccess-examples.md).

## Expand the definition of a listing using SQL

The previous example did not include targets or usage examples. You can use the [ALTER LISTING](../sql-reference/sql/alter-listing.md) to alter a listing’s characteristics.
In this example, we update an existing listing to add targets and example SQL. Note that the original YAML manifest is extended to include
new content.

To alter a listing to include additional fields execute a command similar to:

```sqlexample
ALTER LISTING MY1STLISTING AS
$$
   title: "My First SQL Listing"
   description: "This is my first listing"
   listing_terms:
     type: "OFFLINE"
   targets:
     accounts: ["Org1.Account1"]
   usage_examples:
     - title: "this is a test sql"
       description: "Simple example"
       query: "select *"
$$;
```

For additional examples and use-cases associated with managing listings using SQL see [Manage listings with SQL as a provider - examples](listing-progaccess-examples.md).

## Examine listings using SQL

Much like tables and other SQL elements, listings can be described and shown. [DESCRIBE LISTING](../sql-reference/sql/desc-listing.md) takes a single listing name
as a parameter and provides details about that listing. [SHOW LISTINGS](../sql-reference/sql/show-listings.md) can provide information about a group of listings, using a
`LIKE` filter, or all listings created by a given account if no filter is provided.

To show the details of the MY1STLISTING listing, execute a command similar to:

```sqlexample
SHOW LISTINGS LIKE 'MY1STLISTING';
```

To show all listings that your role has access to, execute a command similar to:

```sqlexample
SHOW LISTINGS;
```

To [describe](../sql-reference/sql/desc-listing.md) the listing MY1STLISTING, execute a command similar to:

```sqlexample
DESC LISTING MY1STLISTING;
```

## Drop listings using SQL

To remove a listing, you must first un-publish the listing. You should be familiar with removing listings using Snowsight
before dropping listings using SQL. For more information about removing listings using Snowsight,
see [Removing listings as a provider](https://other-docs.snowflake.com/en/collaboration/provider-listings-removing).

To un-publish a listing, execute a command similar to:

```sqlexample
ALTER LISTING MY1STLISTING UNPUBLISH;
```

To drop a listing, execute a command similar to:

```sqlexample
DROP LISTING IF EXISTS MY1STLISTING
```

---
title: Listing manifest reference
source: https://docs.snowflake.com/en/progaccess/listing-manifest-reference.md
section: Programmatic Access
---

# Listing manifest reference

Creating Snowflake listings programmatically requires a manifest, written in YAML
(<https://yaml.org/spec/>). Use the information provided here to learn about the manifest format and its individual
sections.

See also:
:   [CREATE LISTING](../sql-reference/sql/create-listing.md), [ALTER LISTING](../sql-reference/sql/alter-listing.md), [DESCRIBE LISTING](../sql-reference/sql/desc-listing.md), [SHOW LISTINGS](../sql-reference/sql/show-listings.md), [DROP LISTING](../sql-reference/sql/drop-listing.md)

> **Note:**
>
> Fields can be any of:
>
> * Optional - Optional for either marketplace listings or private listings.
> * Required - Required for either marketplace listings or private listings.
> * Qualified - requirements differ for marketplace listings or private listings and optional vs required is qualified by listing type.
>   For example optional for private listings, but required for marketplace listings.

The general format of a listing manifest is:

```yaml
#
# Listing prefix
#
title: <listing title>
subtitle: <Optional listing subtitle>
description: <listing description>
profile : <Optional name of the provider profile>

listing_terms:
  - # Required listing terms that the consumer must sign.
targets:
  - # Required <List> Consumer accounts to target with this private listing.
auto_fulfillment:
  - # Required when the target accounts are outside the provider's region, otherwise optional.
resharing:
  # Optional; Controls whether the listing can be reshared by consumers.
business_needs:
  - # Optional <List> BusinessNeed elements; maximum 6.
categories:
  - # Optional <List> The category or area the listing belongs to, maximum 1.
cke_content_protection:
  - # Optional <List> CKE content protection elements; maximum 1.
compliance_badges:
  - # Optional <List> Compliance badges; maximum 6.
data_attributes:
  - # Optional <Name Value pairs> DataAttributes elements; maximum 1.
data_dictionary:
  - # Required for public listings and optional for all other listing types.
data_preview:
  - # Required for public listings and optional for all other listing types.
draft_access_type:
  - # Required <String> for "by request" listings.
locations:
  - # Optional list of regions to share into.
monetization_display_order:
  - # Optional <List> MonetizationDisplayOrder elements.
offers:
  - # Optional <List> Offer elements; maximum 100.
pricing_plans:
  - # Optional <List> PricingPlan elements; maximum 100.
resources:
  - # Optional for private listings; required for marketplace listings, <Name Value pairs> Resources elements such as documentation and media.
trial_details:
  - # Optional <Name Value pairs> Provides details about a trial listing.
usage_examples:
  - # Optional <List> UsageExample elements; maximum 10.
```

The following sections detail each listing manifest field and child field and provide associated examples.

## Listing prefix

Each listing manifest starts with the following fields:

* `title` (String, required, maximum length 110): Listing title.
* `subtitle` (String, optional for private, required for marketplace listings, maximum length 110): Listing subtitle.
* `description` (String, required, maximum length 7500): Listing description. Markdown syntax is supported.
* `custom_contact` (String, optional): Email. Must be a valid, well formed email address.
* `profile` (String, optional for private listings, required for marketplace listings): Name of an approved provider profile.

For more information, refer to: [Provider basic information](https://other-docs.snowflake.com/collaboration/provider-listings-reference#label-configuring-metadata-for-data-listing).

> **Note:**
>
> Values for `profile` can be found by executing `show profiles in data exchange SNOWFLAKE_DATA_MARKETPLACE;`.

### Listing prefix example

```yaml
title: Weather information
subtitle: Historical weather by postcode.
description: This listing includes historical weather data by post code.
profile: My provider profile
```

## `listing_terms`

The **required** `listing_terms` (required) field contains the following name value pairs:

* `listing_terms.type` (enum, required): must be one of:

  + `STANDARD` - Refers to the Standard Agreement for Marketplace Products.
  + `OFFLINE` - Indicates that terms are negotiated offline by the parties.
  + `CUSTOM` - When specified, must specify a value for `listing_terms.link`.
* `listing_terms.link` (required when type is CUSTOM): A fully qualified link to the provider’s listing terms, must start with `http` or `https`.

For more information, refer to **Terms of Service** in the table in [Basic information](https://other-docs.snowflake.com/collaboration/provider-listings-reference#label-configuring-metadata-for-data-listing).

> **Note:**
>
> Consumers can accept listing terms programmatically. For more information contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

### `listing_terms` example

```yaml
. . .
listing_terms:
  type: "CUSTOM"
  link: "http://example.com/my/listing/terms"
. . .
```

## `targets`

> **Note:**
>
> This field can only be used with [V1 listings](../collaboration/collaboration-listings-about.md).

The `targets` field is required for marketplace and private listings.

Contains a list, maximum of 100 elements:

* `targets.accounts` (Required if `targets.region` not present): List of accounts with which to share the listing.

  Each target account must be in `<OrgName>.<AccountName>` format, where:

  + `OrgName` can be obtained using [SELECT CURRENT_ORGANIZATION_NAME();](../sql-reference/functions/current_organization_name.md).
  + `AccountName` can be obtained from account_name using [SHOW ACCOUNTS](../sql-reference/sql/show-accounts.md) or using Snowsight.

or

* `targets.regions` (Required if `targets.accounts` not present):

  List of regions with which to share the listing.

  Each target region must be of the form “region_groups_type.snowflake_region”.
  In addition, “ALL” is supported for including all regions.

  For example “PUBLIC.AWS_US_EAST_1”.

  For a complete list of region group types and Snowflake regions execute:

  ```sqlexample
  SHOW REGIONS IN DATA EXCHANGE SNOWFLAKE_DATA_MARKETPLACE;
  ```

For more information, see [Business needs](../collaboration/provider-listings-reference.md).

### `targets` examples

Define a set of target accounts for this listing.

```yaml
. . .
targets:
   accounts: ["Org1.Account1", "Org2.Account2"]
. . .
```

Define a set of target regions for this listing.

```yaml
. . .
targets:
   regions: ["PUBLIC.AWS_US_EAST_1", "PUBLIC.AZURE_WESTUS2"]
. . .
```

## `auto_fulfillment`

Cross-Cloud Auto-fulfillment allows the data product associated with a listing
to be automatically fulfilled to other Snowflake regions.
The `auto_fulfillment` field defines how that auto-fulfillment takes place.

For more information on Cross-Cloud Auto-fulfillment, see [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md).

Auto-fulfillment is only required if you’re sharing data to multiple regions.
Do not enable it if you are sharing to accounts in the same region.

If you share data across multiple regions, the `auto_fulfillment` is:

* Required if your data product is an application package.
* Required if your data product is shared through a private listing.
* Recommended if your data product is shared through a public listing.

Contains the following name value pairs:

* `auto_fulfillment.refresh_schedule`

  + `<num> MINUTE` - Number of minutes. Minimum 10 minutes, maximum 8 days, or 11520 minutes.

    If `refresh_type` is specified as `SUB_DATABASE_WITH_REFERENCE_USAGE`, do not include this setting.
    The refresh schedule for application packages must be defined at the account level and cannot specified at the listing level.

    For more information see [Set the account-level refresh interval](../collaboration/provider-listings-auto-fulfillment-set-refresh-interval.md).

* `USING CRON <expression>` - Defines the data product auto-fulfillment refresh schedule.

  > The syntax for `USING CRON` and `REPLICATION SCHEDULE` are the same. See [Parameters](../sql-reference/sql/create-replication-group.md).
* `auto_fulfillment.refresh_type` (required when using `auto_fulfillment`): Must be one of -

  + `SUB_DATABASE` - database replication (object level) - recommended.
  + `SUB_DATABASE_WITH_REFERENCE_USAGE` - application package.
  + `FULL_DATABASE` - database replication (for the entire database). (Deprecated.)
* `auto_fulfillment.refresh_schedule_override` (optional): Overrides the defined update refresh frequency for all listings that use the same database. When this value is `FALSE`, listing updates fail when multiple listings sharing the same database have different refresh frequencies.

  + `TRUE` - enables the refresh frequency override.
  + `FALSE` - (default) disables the refresh frequency override.
* `auto_fulfillment.warehouse` (optional): The name of the warehouse used to create and refresh hidden dynamic tables for
  cross-region resharing. This warehouse is used only for resharing maintenance operations. Required when the listing is a reshared
  listing. Can be omitted for non-reshared listings.

See also [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md).

### `auto_fulfillment.refresh_schedule` examples

The following example refreshes the data product associated with a listing every 10 minutes:

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_schedule: 10 MINUTE
  refresh_type: SUB_DATABASE
. . .
```

The following example refreshes the data product associated with a listing on specific days and times in specific regions:

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_schedule: USING CRON  0 17 * * MON-FRI Europe/London
  refresh_type: SUB_DATABASE
. . .
```

The following example enables the refresh frequency override for listings that share the same database but have different refresh frequencies:

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_schedule: 10 MINUTE
  refresh_type: SUB_DATABASE
  refresh_schedule_override: TRUE
. . .
```

### Snowflake Native App `auto_fulfillment` example

`SUB_DATABASE_WITH_REFERENCE_USAGE` can only be used with application packages
and cannot be combined with `auto_fulfillment.refresh_schedule`.

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_type: SUB_DATABASE_WITH_REFERENCE_USAGE
. . .
```

### Object level `auto_fulfillment` example

```yaml
. . .
listing_terms: . . .
. . .
auto_fulfillment:
  refresh_type: SUB_DATABASE
. . .
```

## `resharing`

The `resharing` field controls whether consumers of the listing can reshare the data product with other accounts.

For more information on resharing, see [Resharing listings](../collaboration/reshare-listings.md).

Contains the following name value pairs:

* `resharing.enabled` (optional): Enables or disables resharing for the listing.

  + `true` - allows consumers to reshare the listing data.
  + `false` - (default) prevents consumers from resharing the listing data.

> **Note:**
>
> Only external listings support the `resharing` property. You can’t enable resharing on internal (organizational) listings.

### `resharing` example

The following example enables resharing on a listing:

```yaml
. . .
listing_terms: . . .
. . .
resharing:
  enabled: true
. . .
```

## `business_needs`

Listings are grouped by business needs for easy discovery.
Business need is a describes how a specific listing meets a given business need.
For more information, see [Business needs](../collaboration/provider-listings-reference.md).

### STANDARD business needs

* `business_needs.name` (required when using `business_needs`):

  Valid values include:

  + “360-Degree Customer View”
  + “Supply Chain”
  + “Personalize Customer Experiences”
  + “Inventory Management”
  + “Accelerating Advertising Revenue”
  + “Attribution Analysis”
  + “Contact Data Enrichment”
  + “Foot Traffic Analytics”
  + “Audience Segmentation”
  + “Sentiment Analysis”
  + “ESG Investment Analysis”
  + “Fundamental Analysis”
  + “Quantitative Analysis”
  + “Risk Analysis”
  + “Fraud Remediation”
  + “Customer Onboarding”
  + “Identity Resolution”
  + “Asset Valuation”
  + “Economic Impact Analysis”
  + “Demand Forecasting”
  + “Population Health Management”
  + “Real World Data (RWD)”
  + “Location Planning”
  + “Regulatory Reporting”
  + “Subscriber Acquisition and Retention”
  + “Life Sciences Commercialization”
  + “Patient 360”
  + “Blockchain Analysis”
  + “Customer Acquisition”
  + “Data Quality and Cleansing”
  + “Location Data Enrichment”
  + “Location Geocoding”
  + “Machine Learning”
  + “Market Analysis”
  + “Pricing Analysis”
  + “Audience Activation”

`business_needs[].description` (required when using `business_needs`): Description of associated business_needs.name, maximum length 1000.

`business_needs[].type`: STANDARD (optional).

### CUSTOM business needs

Custom business needs include a user defined `name`, `description`, and required `type` field with value CUSTOM.

`business_needs.name` (required when using `business_needs`): User defined name.

`business_needs[].description` (required when using `business_needs`): Description of associated business_needs.name, maximum length 1000.

`business_needs[].type`: CUSTOM (required when defining custom business needs).

### `business_needs` examples

Standard without optional `type`

```yaml
. . .
business_needs:
 - name: "Real World Data (RWD)"
   description: "Global weather data"
. . .
```

Standard with optional `type`

```yaml
. . .
business_needs:
 - name: "Real World Data (RWD)"
   description: "Global weather data"
   type: STANDARD
. . .
```

Custom with required `type`

```yaml
. . .
business_needs:
 - name: "Machinery Maintenance"
   description: "Repair and maintenance data for machinery"
   type: CUSTOM
. . .
```

## `categories`

The `categories` field specifies the category or area the listing belongs to.
Categories are optional for private listings but required for marketplace listings.

Categories are used in Snowflake Marketplace to browse listings by area and help consumers find your data.

The `categories` field is a list, but can only contain a single entry, from the set below:

* BUSINESS
* CONNECTORS
* DEMOGRAPHICS
* ECONOMY
* ENERGY
* ENVIRONMENT
* FINANCIAL
* GOVERNMENT
* HEALTH
* IDENTITY
* LEGAL
* LOCAL
* LOOKUP_TABLES
* MARKETING
* MEDIA
* SECURITY
* SPORTS
* TRANSPORTATION
* TRAVEL
* WEATHER

### `categories` example

```yaml
. . .
categories:
 - ECONOMY
. . .
```

## `cke_content_protection`

The `cke_content_protection` field is used to protect the content of a Cortex Knowledge Extension (CKE). Using this field, providers can restrict the amount of content a consumer can access. The threshold limits the percentage of the indexed corpus that a consumer can retrieve within a rolling 24-hour period. When a consumer exceeds the configured threshold, subsequent queries to the CKE are blocked until the window resets, and the consumer receives an error indicating that they’ve reached the content protection threshold.

The `cke_content_protection` field contains the following entries:

* `enable`: Indicates whether content protection is enabled.

  + `TRUE` - Content protection is enabled.
  + `FALSE` - Content protection is disabled.
* `threshold`: The threshold for content protection when content protection is enabled. This indicates the percentage of the indexed corpus that any one consumer can retrieve within a rolling 24-hour period. This can be a value between 0 and 1.

### `cke_content_protection` example

```yaml
. . .
cke_content_protection:
  enable: TRUE
  threshold: 0.2
. . .
```

## `compliance_badges`

The `compliances_badges` field is used to indicate that a listing was reviewed by a third-party auditor and was certified as compliant with a specific standard or regulation.

When you configure a compliance badge, you can specify up to six types. Include the expiration date for each badge and the accompanying third-party certification documentation.

The following fields are used to configure a compliance badge:

* `compliance_badges`: Used to declare and configure a compliance badge for a listing. Providers can declare multiple compliance certifications within the `compliance_badges` property.

  + `type`: The compliance certification being requested. The following list shows the possible values:

    - `SOC2`
    - `FEDRAMP`
    - `GDPR`
    - `HIPAA`
    - `ISO27001`
    - `PCIDSS`
  + `expiry`: The date when the compliance certification expires.
  + `files`: The list of files that are used to verify the compliance certification.

For more information, see [Listing compliance badges](../collaboration/provider-becoming.md).

### `compliance_badges` example

```yaml
. . .
compliance_badges:
  - type: SOC2
    expiry: 12-25-2026
    files:
      - soc2_compliance_verification.pdf
  - type: HIPAA
    expiry: 06-07-2026
    files:
      - hipaa_compliance_verification.pdf
  - type: FEDRAMP
    expiry: 03-15-2027
    files:
      - fedramp_compliance_verification.pdf
. . .
```

## `data_attributes`

Data attributes provide consumers insight into information about the listing such as refresh rate and other characteristics.

The `data_attributes` field is optional for private listings but required for marketplace listings.

For additional information on data product attributes, see [Data product - attributes](../collaboration/provider-listings-reference.md).

Contains the following name value pairs:

* `data_attributes.refresh_rate` (required for data listings; optional for app listings)

  Specifies how often your data product is updated in Snowflake.

  One of:

  + CONTINUOUSLY
  + HOURLY
  + DAILY
  + WEEKLY
  + MONTHLY
  + QUARTERLY
  + ANNUALLY
  + STATIC
* `data_attributes.geography` (required), containing:

  Specifies the geographic regions for which your data product has coverage.

  > + `granularity` (string, required)
  >
  >   Geographic coverage of your dataset.
  >
  >   One of:
  >
  >   - LATITUDE_LONGITUDE
  >   - ADDRESS
  >   - POSTAL_CODE
  >   - CITY
  >   - COUNTY
  >   - STATE
  >   - COUNTRY
  >   - REGION_CONTINENT
  > + `geo_option` (string, required)
  >
  >   One of:
  >
  >   - NOT_APPLICABLE
  >   - GLOBAL
  >   - COUNTRIES
  > + `coverage` (required based on selection of `geo_option`), containing either :
  >
  >   - `states` (list of states) containing any list of valid U.S. state names.
  >
  >   Or
  >
  >   - `continents` (list of continents):
  >
  >     Any of:
  >
  >     * ASIA
  >     * EUROPE
  >     * AFRICA
  >     * NORTH AMERICA
  >     * SOUTH AMERICA
  >     * OCEANIA
  >     * ANTARCTICA
  > + `time` (required) containing:
  >
  >   Specifies the time period that your data product covers.
  >
  >   - `granularity` (required)
  >
  >   One of:
  >
  >   - EVENT_BASED
  >   - HOURLY
  >   - DAILY
  >   - WEEKLY
  >   - MONTHLY
  >   - YEARLY
  >   - `time_range` (required) containing the following name/value pairs:
  >
  >     * `time_frame` (required)
  >
  >       One of:
  >
  >       + NEXT
  >       + LAST
  >       + BETWEEN
  >     * `unit` (required)
  >
  >       > One of:
  >       >
  >       > + DAYS
  >       > + WEEKS
  >       > + MONTHS
  >       > + YEARS
  >     > * `value` (required when `time_frame` is NEXT/LAST, integer), range 1-100.
  >     > * `start_time` (required when `time_frame` is BETWEEN, String date), format MM-DD-YYYY.
  >     > * `end_time` (required when `time_frame` is BETWEEN, String date), format MM-DD-YYYY.

### `data_attributes` example

```yaml
. . .
data_attributes:
  refresh_rate: DAILY
  geography:
    granularity:
      - REGION_CONTINENT
    geo_option: COUNTRIES
    coverage:
      continents:
        ASIA:
          - INDIA
          - CHINA
        NORTH AMERICA:
          - UNITED STATES
          - CANADA
        EUROPE:
          - UNITED KINGDOM
    time:
      granularity: MONTHLY
      time_range:
        time_frame: LAST
        unit: MONTHS
        value: 6
```

## `data_dictionary`

The `data_dictionary` field provides consumers insight into the contents and structure of a listing before they install it into their account.
Required for public listings, optional for all other listing types.

The `data_dictionary` field contains a list of up to five data dictionary entries:

* `data_dictionary.featured` (required when using `data_dictionary`): must be ‘featured’.
* `data_dictionary.featured.database` (required when using `data_dictionary`): database name.
* `data_dictionary.featured.objects` (required when using `data_dictionary`): list of name value pairs -

  + `name` (string, required): object name
  + `schema` (string, required): schema
  + `domain` (required):

    One of:

    - DATABASE
    - SCHEMA
    - TABLE
    - VIEW
    - EXTERNAL_TABLE
    - MATERIALIZED_VIEW
    - DIRECTORY_TABLE
    - FUNCTION
    - COLUMN

See also [Data product - data dictionary](https://other-docs.snowflake.com/collaboration/provider-listings-reference#label-listings-data-dictionaries).

### `data_dictionary` example

```yaml
. . .
data_dictionary:
 featured:
    database: "WEATHERDATA"
    objects:
       - name: "GLOBAL_WEATHER"
         schema: "PUBLIC"
         domain: "TABLE"
       - name: "GLOBAL_WEATHER_REPORT"
         schema: "PUBLIC"
         domain: "TABLE"
. . .
```

## `data_preview`

The `data_preview` field allows providers to identify and hide Personally identifiable information (PII) in the data preview samples generated from listing data. PII data is data that could directly or indirectly reveal an individual’s identity. Required for public listings, and optional for all other listing types.

The `data_preview` field includes the following entries:

* `data_preview.has_pii` (required when using `data_preview`): indicates whether PII is included in the listing data.

  + `TRUE` - PII is included in the listing data.
  + `FALSE` - PII is not included in the listing data.
* `data_preview.metadata_overrides` (recommended when `data_preview.has_pii` is `TRUE` ): identifies the location of the PII listing data and the objects within that dataset containing PII to hide or expose.

  + `database` (string, required): Database name.
  + `objects` (list, required): The objects to hide or expose columns from in the data preview samples:

    - `schema` (string, required): Schema name.
    - `domain` (string, required): Domain name.
    - `name` (string, required): Object name
    - `pii_columns` (list, optional): The columns containing PII.
    - `overridden_pii_columns` (list, optional): The columns Snowflake classification identified as containing PII, but should be available in the data preview samples shared with consumers.

      Periodically, Snowflake runs classifications on generated data previews. Any columns containing PII are defined in `classified_pii_columns` when `SHOW` commands are run.

      Columns identified by Snowflake as containing PII are hidden from consumers of the listing only in the data preview samples. If a provider of a listing determines the columns are erroneously identified as containing PII, they can specify the specific columns they want included in the data preview samples using `overridden_pii_columns`.

### `data_preview` example

```yaml
. . .
data_preview:
 has_pii: TRUE
 metadata_overrides:
    database: WEATHERDATA
    objects:
       - schema: PUBLIC
         domain: TABLE
         name: GLOBAL_WEATHER
         pii_columns: [ADDRESS, PHONE]
         overridden_pii_columns: [FIRST_NAME, LAST_NAME]
. . .
```

## `draft_access_type`

Specifies how access to a draft listing is controlled.

This field determines the access model for the listing while it’s in draft status. This is especially relevant for [compliance badging](../collaboration/provider-becoming.md), as providers await approval of a listing’s badge or badges by the Snowflake compliance team.

The allowed values for `draft_access_type` are:

* UNKNOWN
* FREE
* PAID
* LIMITED_TRIAL

### `draft_access_type` examples

> ```yaml
> . . .
> draft_access_type: "PAID"
> . . .
> ```

## `external_targets`

The `external_targets` field is used to share public or private V2 listings.

> **Note:**
>
> This field can only be used with [V2 listings](../collaboration/collaboration-listings-about.md).

The `access` field is **required** when `external_targets` is specified, and it must include one of the following sub-fields:

* `organization`: When creating a private listing, specify the organization name and accounts that can access the listing.
* `account`: When creating a private listing, specify the organization name and accounts that can access the listing.
* `all_organizations`: When creating a public listing, set this to `true`.

### `external_targets` examples

The follow example shows how to use `external_targets` to share private listings.

> ```yaml
> . . .
> external_targets:
>   access:
>     - organization: OrgName2
>       accounts: [acc1, acc2]
> . . .
> ```

The follow example shows how to use `external_targets` to share public listings.

> ```yaml
> . . .
> external_targets:
>   access:
>     - all_organizations: true
> . . .
> ```

## `locations`

Specifies the **optional** `locations` that can discover or access the listing.

> **Note:**
>
> This field can only be used with [V2 listings](../collaboration/collaboration-listings-about.md).

The `access_regions` field is **required** when `locations` is specified, and it must include one of the following sub-fields:

* `ALL`: All regions can discover or access the listing.
* `name`: An array of regions of the form “region_groups_type.snowflake_region” that can discover or access the listing; for example, `access_regions: - name: PUBLIC.AWS_US_WEST_2`.

Available region groupings for VPS deployments include the following:

* AWS_US_EAST_1 (“US East (N. Virginia)”)
* AWS_US_EAST_2 (“US East (Ohio)”)
* AWS_US_WEST_2 (“US West (Oregon)”)
* AWS_EU_WEST_1 (“EU (Ireland)”)
* AWS_EU_WEST_2 (“EU (London)”)
* AZURE_EASTUS2 (“East US 2 (Virginia)”)
* AZURE_CENTRALUS (“Central US (Iowa)”)

### `locations` example

> ```yaml
> . . .
> locations:
>   access_regions:
>     - name: "PUBLIC.AWS_US_WEST_2"
> . . .
> ```

For a complete list of regions, see [SHOW REGIONS](../sql-reference/sql/show-regions.md).

## `monetization_display_order`

The optional `monetization_display_order` field specifies the order in which pricing plans are displayed to consumers in Snowflake Marketplace.

> **Note:**
>
> This field can only be used with [V2 listings](../collaboration/collaboration-listings-about.md).

### `monetization_display_order` example

```yaml
. . .
monetization_display_order:
  - offer_id_1
  - offer_id_2
  - offer_id_3
. . .
```

## `offers`

> **Note:**
>
> This field can only be used with [V2 listings](../collaboration/collaboration-listings-about.md).

The optional `offers` field includes a list of up to eight offers that are associated with the listing. The `offers` field includes the following name value pairs:

* `name` (String, required ): The user-defined name of the offer. The name must be formatted as all uppercase.
* `type` (String, required): Must be one of the following types:

  + `FILE`: Indicates that the offer is defined in a local YAML file.
  + `URL`: Indicates that the offer is defined in a remote URL.
* `path` (String, required): The path to the local or remote [offers YAML](../user-guide/collaboration/listings/pricing-plans-offers/offer-manifest-reference.md).

### `offers` example

```yaml
. . .
offers:
  - name: PRICING_PLAN_1_DEFAULT_OFFER
    type: FILE
    path: offers/PRICING_PLAN_1_DEFAULT_OFFER.yaml
. . .
```

## `pricing_plans`

> **Note:**
>
> This field can only be used with [V2 listings](../collaboration/collaboration-listings-about.md).

The optional `pricing_plans` field includes a list of pricing plans that are associated with the listing. The `pricing_plans` field includes the following name value pairs:

* `name` (String, required): The user-defined name of the pricing plan. The name must be formatted as all uppercase.
* `type` (String, required): Must be one of the following types:

  + `FILE`: Indicates that the offer is defined in a local YAML file.
  + `URL`: Indicates that the offer is defined in a remote URL.
* `path` (String, required): The path to the local or remote [pricing plan YAML](../user-guide/collaboration/listings/pricing-plans-offers/pricing-plan-manifest-reference.md).

### `pricing_plans` example

```yaml
. . .
pricing_plans:
  - name: PRICING_PLAN_1
    type: FILE
    path: pricingPlans/PRICING_PLAN_1.yaml
. . .
```

## `resources`

Resources contain information about the listing, including links to documentation and a video.

The `resources` field is optional for private listing but required for marketplace listings.

Contains the following name value pairs:

* `resources.documentation` (String, required ): A fully qualified link to a page on your website with more detailed documentation for the listing.
  Must start with `http` or `https`.
* `resources.media` (String, optional): A fully qualified link to an unlisted or public YouTube video for the listing.

For more information see [Details](../collaboration/provider-listings-reference.md).

### `resources` example

```yaml
. . .
resources:
  documentation: https://www.example.com/documentation/
  media: https://www.youtube.com/watch?v=MEFlT3dc3uc
. . .
```

## `trial_details`

The optional `trial_details` field captures trial details associated with the listing and includes the following name value pairs:

* `trial_type` (String, required ): Specifies the type of the trial. Must be one of the following types:

  + `TIME`
  + `USAGE`
  + `LIMITED`
  + `LIMITED_TIME`
* `trial_time_limit` (Integer, optional): Specifies the number of days that the listing will be allowed as a trial, after which the consumer would need to request the full product. A null value indicates that the listing is an unlimited time trial. Either `trial_time_limit` or `trial_usage_limit` must be specified.
* `trial_usage_limit` (Integer, optional): Specifies the number of allowed free uses with this listing, after which the consumer would need to upgrade. Either `trial_time_limit` or `trial_usage_limit` must be specified.
* `trial_usage_unit` (Long, optional): Specifies the unit (such as queries or rows) for the trial usage. Depending on this usage unit, the usage count is incremented accordingly. This field can only be used with `trial-usage_limit`.
* `description` (String, optional): A string describing the trial details. The maximum length is 4,096 characters.

### `trial_details` example

```yaml
. . .
trial_details:
  trial_type: TIME
  trial_time_limit: 30
  description: "This is a 30-day free trial"
. . .
```

## `usage_examples`

The `usage_examples` field is optional for private listings but required for marketplace listings.

Contains a list of the following name value pairs:

* `usage.title` (String, required): Usage example title; maximum length 110 characters.
* `usage.description` (String, optional): Associated description; maximum length 300 characters.
* `usage.query` (String, required): Query associated with the usage example; maximum length 30,000 characters.

For more information, see [Sample SQL queries](../collaboration/provider-listings-reference.md).

### `usage_examples` example

```yaml
. . .
usage_examples:
  - title: "Return all weather for the US"
    description: "Example of how to select weather information for the United States"
    query: "select * from weather where country_code='USA'";
. . .
```

## Complete YAML example for a V1 data share listing

[V1 listings](../collaboration/collaboration-listings-about.md) use `targets` to define the accounts that can access the listing.

```yaml
title: "Covid data listing"
subtitle: "Listing about covid"
description: "Example covid manifest"
profile: "MyProfile"
listing_terms:
  type: "STANDARD"
targets:
  accounts: ["Org1.Account1", "Org2.Account2"]
auto_fulfillment:
  refresh_schedule: "120 MINUTE"
  refresh_type: "SUB_DATABASE"
business_needs:
  - name: "Life Sciences Commercialization"
    description: "COVID-19 Epidemiological Data"
usage_examples:
  - title: "Get total case count by country"
    description: "Calculates the total number of cases by country, aggregated over time."
    query: "SELECT  COUNTRY_REGION, SUM(CASES) AS Cases FROM ECDC_GLOBAL GROUP BY COUNTRY_REGION;"
data_attributes:
  refresh_rate: HOURLY
  geography:
    granularity:
      - ADDRESS
    geo_option: COUNTRIES
    coverage:
      continents:
        ASIA:
          - INDIA
          - CHINA
        NORTH AMERICA:
          - UNITED STATES
          - CANADA
        EUROPE:
          - UNITED KINGDOM
    time:
      granularity: MONTHLY
      time_range:
      time_frame: BETWEEN
      start_date: 12-24-2020
      end_date: 12-25-2021
data_preview:
  has_pii: TRUE
  metadata_overrides:
    database: WEATHERDATA
    objects:
      schema: PUBLIC
      domain: TABLE
      name: GLOBAL_WEATHER
      pii_columns: [ADDRESS, PHONE]
      overridden_pii_columns: [FIRST_NAME, LAST_NAME]
resources:
  documentation: https://www.example.com/documentation/
  media: https://www.youtube.com/watch?v=MEFlT3dc3uc
categories:
  - HEALTH
compliance_badges:
  - type: SOC2
    expiry: 12-25-2026
    files:
      - soc2_compliance_verification.pdf
  - type: HIPAA
    expiry: 06-07-2026
    files:
      - hipaa_compliance_verification.pdf
  - type: FEDRAMP
    expiry: 03-15-2027
    files:
      - fedramp_compliance_verification.pdf
cke_content_protection:
  enable: TRUE
  threshold: 0.2
trial_details:
  trial_type: TIME
  trial_time_limit: 30
  description: "This is a 30-day free trial"
```

## Complete YAML example for a V2 data share listing

[V2 listings](../collaboration/collaboration-listings-about.md) use `external_targets` to define the organizations and roles that can access the listing. V2 listings also allow users to define pricing plans and offers.

```yaml
title: "Covid data listing"
subtitle: "Listing about covid"
description: "Example covid manifest"
profile: "MyProfile"
listing_terms:
  type: "STANDARD"
external_targets:
  access:
    - organization: OrgName2
      accounts: [acc1, acc2]
    - account: acc2
      roles: [role1, role2]
auto_fulfillment:
  refresh_schedule: "120 MINUTE"
  refresh_type: "SUB_DATABASE"
business_needs:
  - name: "Life Sciences Commercialization"
    description: "COVID-19 Epidemiological Data"
usage_examples:
  - title: "Get total case count by country"
    description: "Calculates the total number of cases by country, aggregated over time."
    query: "SELECT  COUNTRY_REGION, SUM(CASES) AS Cases FROM ECDC_GLOBAL GROUP BY COUNTRY_REGION;"
data_attributes:
  refresh_rate: HOURLY
  geography:
    granularity:
      - ADDRESS
    geo_option: COUNTRIES
    coverage:
      continents:
        ASIA:
          - INDIA
          - CHINA
        NORTH AMERICA:
          - UNITED STATES
          - CANADA
        EUROPE:
          - UNITED KINGDOM
    time:
      granularity: MONTHLY
      time_range:
      time_frame: BETWEEN
      start_date: 12-24-2020
      end_date: 12-25-2021
data_preview:
  has_pii: TRUE
  metadata_overrides:
    database: WEATHERDATA
    objects:
      schema: PUBLIC
      domain: TABLE
      name: GLOBAL_WEATHER
      pii_columns: [ADDRESS, PHONE]
      overridden_pii_columns: [FIRST_NAME, LAST_NAME]
locations:
  access_regions:
    - name: "PUBLIC.AWS_US_WEST_2"
monetization_display_order:
  - offer_id_1
pricing_plans:
  - name: PRICING_PLAN_1
    type: FILE
    path: pricingPlans/PRICING_PLAN_1.yaml
offers:
  - name: PRICING_PLAN_1_DEFAULT_OFFER
    type: FILE
    path: offers/PRICING_PLAN_1_DEFAULT_OFFER.yaml
resources:
  documentation: https://www.example.com/documentation/
  media: https://www.youtube.com/watch?v=MEFlT3dc3uc
categories:
  - HEALTH
compliance_badges:
  - type: SOC2
    expiry: 12-25-2026
    files:
      - soc2_compliance_verification.pdf
  - type: HIPAA
    expiry: 06-07-2026
    files:
      - hipaa_compliance_verification.pdf
  - type: FEDRAMP
    expiry: 03-15-2027
    files:
      - fedramp_compliance_verification.pdf
draft_access_type: "LIMITED_TRIAL"
cke_content_protection:
  enable: TRUE
  threshold: 0.2
trial_details:
  trial_type: TIME
  trial_time_limit: 30
  description: "This is a 30-day free trial"
```

---
title: Manage listings with SQL as a provider - examples
source: https://docs.snowflake.com/en/progaccess/listing-progaccess-examples.md
section: Programmatic Access
---

# Manage listings with SQL as a provider - examples

The following are examples of the common tasks that providers can complete programmatically with SQL commands:

* Share data with another Snowflake account
* Share private listing and replicate
* Share publicly in the Marketplace
* Create a draft private listing ready for sharing with another account

## Share data with another Snowflake account

Create a private listing for MySHARE and publish immediately.

| Description | Notes |
| --- | --- |
| Create a listing targeted to another account. | Submit the listing for immediate approval (`REVIEW=TRUE` default but shown for clarity).  Publish on approval (`PUBLISH=TRUE` default but shown for clarity). |

```sqlexample
CREATE EXTERNAL LISTING SHARED_WITH_ANOTHER_ACCOUNT
SHARE MySHARE AS
$$
   title: "weather data"
   description: "Listing of weather data for all zipcodes in America"
   listing_terms:
     type: "OFFLINE"
   targets:
     accounts: ["targetorg.targetaccount"]
$$ PUBLISH=TRUE REVIEW=TRUE;
```

## Share private listing and replicate

Create a private listing which is automatically replicated to other regions.

| Description | Notes |
| --- | --- |
| Create a replicated private listing. | Replicate the listing and refresh every 10 minutes.  Submit the listing for immediate approval (`REVIEW=TRUE` by default).  Publish on approval (`PUBLISH=TRUE` by default). |

```sqlexample
CREATE EXTERNAL LISTING SHARED_AND_REPLICATED
SHARE MySHARE AS
$$
   title: "weather data"
   description: "Listing containing weather data for all zipcodes in America"
   listing_terms:
     type: "OFFLINE"
   targets:
     accounts: ["targetorg.targetaccount"]
   auto_fulfillment:
     refresh_type: SUB_DATABASE
     refresh_schedule: '10 MINUTE'
$$;
```

For more information on cross-cloud auto fulfillment see [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md).

## Share publicly in the Marketplace

Create a public listing in the Snowflake marketplace.

| Description | Notes |
| --- | --- |
| Create a replicated public listing in Marketplace.  Replicate the listing into multiple regions. | Replicate the listing and refresh every 10 minutes.  Submit the listing for immediate approval (`REVIEW=TRUE` by default, not shown).  Publish on approval (`PUBLISH=TRUE` by default, not shown). |

```sqlexample
CREATE EXTERNAL LISTING PUB_SHARE_AND_REPLICATE
SHARE MySHARE AS
$$
 title: "Weather Data"
 subtitle: "Weather Data on Snowflake"
 description: "This listing contains weather data for all zipcodes in America"
 terms_of_service:
   type: "STANDARD"
 targets:
   regions: ["PUBLIC.US_WEST", "PUBLIC.AWS_US_EAST_1"]
 auto_fulfillment:
   refresh_schedule: "10 MINUTE"
   refresh_type: "SUB_DATABASE"
 profile: "VERY_STARK_INDUSTRIES_PUBLIC_PROFILE"
 categories: ["BUSINESS"]
 data_dictionary:
   featured:
     database: "DATABASE_NAME"
     objects:
       - schema: "SCHEMA_NAME"
         domain: TABLE
         name: "TABLE_NAME"
 business_needs:
   - name: "Data Quality and Cleansing"
     description: "Test listing for data cleansing"
 usage_examples:
   - title: "Aggregate Weather data for a location"
     description: "Calculate the minimum and maximum temperatures over a year"
     query: "SELECT 1"
 data_attributes:
   refresh_rate: "HOURLY"
   geography:
     geo_option: "NOT_APPLICABLE"
 resources:
   documentation: "https://snowflake.com/doc"
   media: "https://www.youtube.com/watch?v=AR88dZG-hwo"
 $$;
```

## Create a draft private listing ready for sharing with another account

Create a draft listing which is automatically replicated to other regions.

This example is identical to Share data with another Snowflake account but creates a draft listing.
For a complete description of all combinations of the REVIEW and PUBLISH properties,
and their meanings, see [CREATE LISTING](../sql-reference/sql/create-listing.md).

| Description | Notes |
| --- | --- |
| Create a replicated private listing. | Replicate the listing and refresh every 10 minutes.  Do not submit the listing for approval (`REVIEW=FALSE`).  Do not publish (`PUBLISH=FALSE`). |

```sqlexample
CREATE EXTERNAL LISTING DRAFT_PRIVATE_REPLICATED
SHARE MySHARE AS
$$
   title: "weather data"
   description: "Listing containing weather data for all zipcodes in America"
   listing_terms:
     type: "OFFLINE"
   targets:
     accounts: ["targetorg.targetaccount"]
   auto_fulfillment:
     refresh_type: SUB_DATABASE
     refresh_schedule: '10 MINUTE'
$$ PUBLISH=FALSE REVIEW=FALSE;
```

## Optional

Release notes, behavior changes, and deprecation notices.

---
title: .NET Driver release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/dotnet.md
section: Release Notes
---

# .NET Driver release notes

The .NET Driver release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](dotnet-2026.md)
* [2025 releases](dotnet-2025.md)
* [2024 releases](dotnet-2024.md)
* [2023 releases](dotnet-2023.md)
* [2022 releases](dotnet-2022.md)

See [.NET Driver](../../developer-guide/dotnet/dotnet-driver.md) for documentation.

---
title: .NET Driver release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/dotnet-2022.md
section: Release Notes
---

# .NET Driver release notes for 2022

This article contains the release notes for the .NET Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for .NET Driver updates.

See [.NET Driver](../../developer-guide/dotnet/dotnet-driver.md) for documentation.

## Version 2.0.19 (November 16, 2022)

### New features

* Updated the `System.Text.RegularExpressions` library to version 4.3.1.

## Version 2.0.18 (November 02, 2022)

### BCR (Behavior Change Release) change

> **Caution:**
>
> Version 2.0.18 of the Snowflake .NET driver changed the way it handles escaping the equal sign (=) in
> connection strings to match the .NET specification. Specifically, if a password contained an equal sign, you had to
> escape the character by using double equal signs (==). If your projects are affected by breaking changes related
> specifically to special characters, Snowflake recommends that you do not install this version into a production
> environment before testing.

### New features

* Improved PUT and GET command queries:
* Query strings are case-insensitive.
* White space is allowed at the start and end of query strings.
* White space is permitted in file paths for PUT queries.
* Added the `CLIENT_SESSION_KEEP_ALIVE` configuration property to prevent a session from timing out.
* Added ability to execute a batch of SQL statements (multi-statement support).
* Added support for connecting to proxy servers.

### Bug fixes

* Changed special character handling in connection strings to match the Microsoft .NET specifications.

## Version 2.0.17 (October 3, 2022)

### Bug fixes

* Added the `SetPooling()` function to enable and disable connection pooling.

## Version 2.0.16 (August 24, 2022)

### Behavior Change Release (BCR) change

> **Caution:**
>
> Version 2.0.16 of the Snowflake .NET driver includes an update that replaces targeting .NET Standard 2.0
> with .NET 6.0. If your projects are affected by breaking changes related specifically to .NET 6.0, you must update your
> framework or project to use the new version. Snowflake recommends that you do not install this version into a
> production environment before testing.

### Bug fixes

* Fixed an issue where unicode characters appended an extra “u” for large streams (e.g “/u007f” becomes “/u007fu”).

## Version 2.0.15 (July 19, 2022)

### Bug fixes

* Updated the exception thrown for incorrect private key.

## Version 2.0.14 (June 23, 2022)

### New features

* Updated `SnowflakeDbException.ToString` to include more error details.
* Added support for bulk array binding.
* Added support for connection pools.

## Version 2.0.13 (May 18, 2022)

### New features

* Added option to disable automatically retrying to connect when a connection fails or drops.
* Added byte encryption bytes to read and write chunks for the PUT command.

### Bug fixes

* Resolved an issue where DEL characters displayed incorrectly.

## Version 2.0.12 (May 06, 2022)

### New Feature

* Added support for the GET command.

## Version 2.0.11 (Mar 15, 2022)

### New Feature

* Added support for the PUT command.

## Version 2.0.10 (Feb 16, 2022)

### Bug fixes

* Resolved issues with asynchronous warning messages returned by the Snowflake ChunkDownloader.

## Version 2.0.9 / 1.2.9 (Jan 18, 2022)

### Bug fixes

* Fixed an issue with external browser authentication on non-Windows platforms.
* Returned `TIMESTAMP` values now defaults to `DateTimeKind.Unspecified` instead of DateTimeKind.Utc
* Made the chunk downloader’s parser run asynchronously.

---
title: .NET Driver release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/dotnet-2023.md
section: Release Notes
---

# .NET Driver release notes for 2023

This article contains the release notes for the .NET Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for .NET Driver updates.

See [.NET Driver](../../developer-guide/dotnet/dotnet-driver.md) for documentation.

## Version 2.1.5 (December 18, 2023)

### New features and updates

* None

### Bug fixes

* Fixed an issue with enabling certificate revocation checks.

## Version 2.1.4 (December 05, 2023)

### New features and updates

* Added documentation on how to enable Arrow format.

### Bug fixes

* Implemented the validation of the account name format in connection parameters.
* Added synchronization of accessing query context cache.

## Version 2.1.3 (November 15, 2023)

### New features and updates

* Added support for managing the frequency of retries for unsuccessful connection requests:

  + Added the `RETRY_TIMEOUT` parameter with a default value of 300 seconds.
  + Updated how the driver uses the `CONNECTION_TIMEOUT` and `maxHttpRetries` connection parameters and changed the default value of `CONNECTION_TIMEOUT` to 300 seconds.
* Arrow format is now available as a [preview feature](../preview-features.md) (will be enabled by default in the future)

### Bug fixes

* Fixed an issue relating to failures with unexpected exceptions with HTAP metadata optimization.
* Fixed an issue with HTAP that could occur when changing databases or schemas.
* Implemented asynchronous cleanup while destroying a pool to avoid potential deadlocks.
* Removed confusing error information for PUT commands for GCP.
* Fixed incorrect `SnowflakeDbConnection.Dispose` behavior.

## Version 2.1.2 (September 27, 2023)

### New features and updates

* Added support for hybrid transactional and analytical processing:

  + Added retry context in retries for query requests.
  + Added query context caching.
* Added the `GetQueryId()` method to `SnowflakeDbCommand` to retrieve the query ID of the most
  recent executed query to match the existing functionality in `SnowflakeDbDataReader`.

### Bug fixes

* Fixed an issue where PUT/GET commands could fail with internal stages on Azure government cloud accounts.
* Decreased memory usage in PUT/GET operations.
* Fixed an issue that could occur while uploading and downloading data when source files differed from destination files,
  such as might occur due to automatic file compression.

## Version 2.1.1 (August 22, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed an issue where test connections were not reused when created successfully.
* Fixed an issue where the `*` and `?` wildcards did not work correctly in file paths.
* Fixed an issue where the driver incorrectly required a username and password for external browser authentication.

## Version 2.1.0 (July 27, 2023)

### BCR (Behavior Change Release) change

Fixed an issue where, under certain conditions, the .NET driver could retry HTTP requests indefinitely.
Previously, during an outage the .NET driver would retry the failed HTTP call continuously until the request
succeeds or until someone force kills the operation.

With this change, disables infinite HTTP retries originating from execute and executeQuery calls. Now, the .NET driver
limits HTTP retries to seven, by default. Customers can set the `MAXHTTPRETRIES` connection parameter to
customize the maximum number of retries. Customers can set `MAXHTTPRETRIES=0` to remove the retry limit,
but doing so runs the risk of the .NET driver infinitely retrying failed HTTP calls.

### New features and updates

* Improved handling of remote paths containing a subdirectory in a GET command.

### Bug fixes

* Fixed an issue with connection pools that could occur when a dirty connection is closed and the
  `BeginTransaction` method is called explicitly.
* Fixed an issue with `UseProxy` in `HTTPClientHandler`.
* Added the `BROWSER_RESPONSE_TIMEOUT` connection parameter to fix an issue with authentication in an external browser.
  The default is 120 seconds.
* Fixed an issue with connection pool timeouts during daylight saving time transitions.

## Version 2.0.25 (June 16, 2023)

### New features and updates

* None

### Bug fixes

* Fixed an issue where the proxy password could be visible in the Snowflake log file.
* Fixed an issue where `SnowflakeDbDataReader.HasRows()` always returned true for some query types (e.g. SELECT)
  regardless whether there are valid rows in query result.
* Fixed an intermittent “Authentication token has expired” or “Session no longer exists” issue when connection pooling
  enabled.
* Removed use of `WinHttpHandler`.
* Fixed an issue where retries on chunk downloading would occasionally fail, such as when a network error occurred
  after the data was partially downloaded.
* Fixed problem in chunk retry downloading process and improved testing of those retries.

## Version 2.0.24 (May 23, 2023)

### New features and updates

* Added session ID logging to get better tracking of the activity for each session in cases where multiple connections are used in parallel.

### Bug fixes

* Fixed an issue where a .NET application throw an unauthorized error when connection pooling was enabled.
* Fixed an issue with 401 errors caused by empty session tokens.

## Version 2.0.23 (April 19, 2023)

### New features and updates

* Changed the legacy supported version to version 4.7.1.

### Bug fixes

* Fixed an issue where a .NET application would terminate for an unhandled exception
  when `client_session_keep_alive=true`.
* Fixed an issue where a COMMIT could be interrupted by an unnecessary rollback.
* Fixed an issue where a connection could not terminate a session when connection pooling is enabled.
* Fixed an issue where calling `Close()` before `Dispose()` resulted in duplication connections in a pool.
* Fixed an issue where errors were thrown then a mandatory USER property was not provided.
* Fixed the WinHttpHandler `PlatformNotSupportedException`; the .NET driver now uses WinHttpHandler only
  for .NET framework applications.
* Fixed an issue where an error incorrectly occurred when passing an empty USER property in the connection
  string for SSO logins.
* Fixed an issue where database names that contained spaces that were enclosed in double quotes (e.g. “My DB”)
  were not treated properly.

## Version 2.0.22 (March 22, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed an issue that caused applications that set `CLIENT_SESSION_KEEP_ALIVE=true` to hang when it closed the connection.
* Fixed an issue where query execution would intermittently fail after a timeout occurred.
* Fixed an issue where the .NET driver would fail to execute PUT commands in an FIPS-enabled deployment.
* Fixed the .NET connector throwing error: “System.Net.Http.WinHttpException (80072EE2, 12002): Error 12002 calling WINHTTP_CALLBACK_STATUS_REQUEST_ERROR”.
* Started adding the **https:** prefix to AWS endpoints that do not include the prefix.
* Updated the **Specify an unencrypted private key (read from a file)** example in the `README.md` file
  to remove the `Replace()` function call.

## Version 2.0.21 (February 22, 2023)

### New features and updates

* Added support for using GCS access tokens for PUT and GET queries (#585).

### Bug fixes

* Improved exception handling to preserve stack traces.

## Version 2.0.20 (January 24, 2023)

### New features and updates

* Added support for the new Okta OIE.
* Improved error logging for JSON parsing by including the `queryid` in the log message.

### Bug fixes

* Fixed issue with PUT/GET not determining the correct compression type of files to be uploaded.
* Fixed issue with PUT/GET result values not being mapped to the appropriate field.
* Fixed an out of bounds issue when trimming SQL queries that contained a closing comment.
* Fixed an issue where using Okta authentication failed when receiving an HTTP 429 error.
* Fixed an issue with session timeouts by adding the `DEFAULT_TIMEOUT_IN_SECOND` session parameter.

---
title: .NET Driver release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/dotnet-2024.md
section: Release Notes
---

# .NET Driver release notes for 2024

This article contains the release notes for the .NET Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for .NET Driver updates.

See [.NET Driver](../../developer-guide/dotnet/dotnet-driver.md) for documentation.

## Version 4.2.0 (November 05, 2024)

### New features and improvements

* Added a [signature on the driver package](https://github.com/snowflakedb/snowflake-connector-net/blob/43924472903467ad741f8d1ea3ee777c02e21829/README.md#verifying-the-package-signature) to verify its authenticity and integrity.
* Added support for reading vector types.
  For more information, see [VectorType.md](https://github.com/snowflakedb/snowflake-connector-net/blob/d69e2cb42ebd0f076476b979dcd249a0470f4e71/doc/VectorType.md).
* Added support for reading structured types for the JSON result format.
  For more information, see [StructeredType.md](https://github.com/snowflakedb/snowflake-connector-net/blob/d69e2cb42ebd0f076476b979dcd249a0470f4e71/doc/StructuredTypes.md).
* Added logging for the client environment configuration.
* Implemented `SnowflakeDbDataReader.GetEnumerator()`.

### Bug fixes

* Changed the `SnowflakeDbConnection` finalizer to be non-blocking.
* Fixed an issue where some disposable objects were not properly disposed.
* Improved memory management for reading large query results.
* Increased the log level of messages for failed HTTP responses.
* Stopped retrying non-recoverable authentication exceptions.
* Fixed a concurrency issue with initializing a connection pool.
* Changed `DateTime.Kind` to `Unspecified` for reading DATE, TIME and TIMESTAMP_NTZ Snowflake types.
  Version 2.1.3 of the driver introduced an undesired change of setting the `DateTime.Kind` to `Utc`.
* Fixed null response handling for PUT/GET operations in the GCS client.
* Fixed exception handling for PUT/GET operations in the S3 client.
* Fixed very large or very small timestamps handling.
* Improved the logic for calculating when the next retry will happen.
* Fixed the returning rows count for COPY statements from multiple files.
* Fixed support of PUT/GET files without client side encryption.

## Version 4.1.0 (August 05, 2024)

### New features and improvements

* Added log messages about the domain destination to which the driver is connecting.
* Updated `DbCommand.Prepare()` to do nothing instead of throwing an exception.

### Bug fixes

* Fixed a issue where a cancel exception was lost when canceling a `OpenAsync` operation.

## Version 4.0.0 (July 08, 2024)

### BCR (Behavior Change Release) changes

Beginning with version 4.0.0, the .NET driver introduced the following breaking changes:

* Connection pool behavior changes:

  + The driver now uses separate pools for each unique connection string. Previously, the driver use only one pool for all connection strings.
  + The `maxPoolSize` parameter change:

    - Previously, it represented the number of connections to store in the pool. Now it defines the maximum number of connections allowed to open for a given pool (for each unique connection string there is a different pool so you can set it differently for each of them).
    - If `maxPoolSize` is reached, the thread requesting a new connection waits until any connection from the pool is available to reuse without exceeding the limit. An exception is thrown in case of a timeout.
    - You can configure the waiting time in the connection string by setting the `waitingForIdleSessionTimeout` property. The default value for the timeout is 30 seconds. You can change it to 0 to disable waiting.
    - The default value for `maxPoolSize` is 10. Make sure your `maxPoolSize` value is properly set for your needs to avoid hanging your threads or timeout exceptions.
    - The `maxPoolSize` property should be greater than or equal to `minPoolSize`.
  + Added a new `minPoolSize` parameter with a default value of 2 that makes the driver open two connections (the second one in background) when you open the first connection for a given connection string. You can set this value to 0 in the connection string if you want to disable the `minPoolSize` feature.
  + The configuration of the pool has been changed to a connection string driven approach. All properties that control the connection pool behavior can be now passed in the connection string. It is no longer possible to set connection pool properties by `SnowflakeDbConnectionPool` setters, such as `SnowflakeDbConnectionPool.SetTimeout`, `SetPooling`, or `SetMaxPoolSize`. Using `SnowflakeDbConnectionPool` setters now throws exceptions.
  + Previously, connections that altered the database, schema, role, or warehouse (for example, by executing the ALTER SESSION SET command) parameters, were pooled. The new default behavior for such cases destroys altered connections when closing does not return them to the pool. If you want to use altered connections in the pool you need to add `ChangedSession=OriginalPool` to your connection string.
  + Connections with external browser authentication or, some cases of KeyPair/JWT token authentication, are no longer stored in the pool by default. To enable pooling such connections you must add `poolingEnabled=true` to the connection string. For other authentication methods pooling is enabled by default.
  + For more information about using connection pool, see [Using Connection Pools](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/ConnectionPooling.md).
* `NONPROXYHOSTS` parameter behavior change:

  The behavior for `NONPROXYHOSTS` parameter has changed. Previously a host would not be proxied if its name contained the value specified in this parameter. Now the host is not proxied when it is exactly the value specified in the parameter. For instance, previously `NONPROXYHOSTS=c` would match any host containing “c”, such as “your-account.snowflakecomputing.com” as well. After the change, you have to specify the whole host, such as `NONPROXYHOSTS=your-account.snowflakecomputing.com`, to make it non-proxied.

### New features and improvements

* Changed connection pool behavior with multiple pools (one per each unique connection string) and connection string driven configuration. For more information about using connection pool, see [Using Connection Pools](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/ConnectionPooling.md).
* Targeting to .netstandard 2.0.
* Added a strong name signature to the driver assembly.
* Added ability to set the `QueryTag` parameter in the connection string and on the `SnowflakeDbCommand` object to mark the connection and command queries with a tag.
* Bumped the `BouncyCastle.Cryptography` dependency.
* Bumped the `Google.Cloud.Storage.V1` dependency.
* Introduced a new `DISABLE_SAML_URL_CHECK` parameter that disables checking if the SAML postback URL matches the host URL when authenticating with Okta.

### Bug fixes

* Fixed the handling of date and time values passed with bindings for queries with very large amount of bindings (more than `CLIENT_STAGE_ARRAY_BINDING_THRESHOLD`).
* Fixed sending SQL queries to the server by sending original queries instead of trimmed queries that resulted in errors for queries ending with comments.
* Implemented a more reliable way of providing hosts in `NONPROXYHOSTS` parameter.
* Fixed support of double quotes in DB, SCHEMA, WAREHOUSE, ROLE connection string parameters.
* Fixed S3 clients by adding “<https://>” to `ServiceUrl` if it is missing.
* Updated the secret detector to better mask secrets when logging.
* Added setting a proper `SnowflakeDbParameter.DbType` value.
* Fixed the logic of shortening the `connectionTimeout` by `retryTimeout` in case of an infinite `retryTimeout` value.
* Applied the logic of increasing `maxHttpRetries` to the default value for HTTP clients. Previously it was applied only to Okta authentication.

## Version 3.1.0 (March 27, 2024)

### New features and improvements

* Added support for running asynchronous queries.

### Bug fixes

* Improved exceptions thrown from the Okta authenticator.
* Fixed an issue with validating very short (1-2 character) account names.
* Fixed an issue related to retrieving the `WAREHOUSE` property from a connection string with quoted content, such as `"WAREHOUSE=\"two words\""`.

## Version 3.0.0 (February 29, 2024)

### BCR (Behavior Change Release) changes

* To enhance security, the driver no longer searches a temporary directory for easy logging configurations. Additionally, the driver now requires the logging configuration file on Unix-style systems to limit file permissions to allow only the file owner to modify the files (such as `chmod 0600`, `chmod 0644`).
* The driver now throws a `SnowflakeDbException` with a `QueryID` for PUT/GET failures. Previously, the driver returned different types of exceptions, such as `FileNotFound` and `DirectoryNotFound`. If your application checked for any of these exceptions, you must update your code to handle only `SnowflakeDbException` for PUT/GET failures.
* The driver no longer supports older versions, such as V1 and V2, of the chunk parser/downloader. As part of the upgrade to version V3, the driver no longer supports the `SFConfiguration.UseV2JsonParser` or `SFConfiguration.UseV2ChunkDownloader` configuration options. If you used commands similar to the following, you should remove them:

  + `SFConfiguration.Instance().ChunkParserVersion = 1;` or `SFConfiguration.Instance().ChunkParserVersion = 2;`
  + `SFConfiguration.Instance().ChunkDownloaderVersion = 1;` or `SFConfiguration.Instance().ChunkDownloaderVersion = 2;`
  + `SFConfiguration.Instance().UseV2JsonParser`
  + `SFConfiguration.Instance().UseV2ChunkDownloader`

### New features and improvements

* Added support for multiple SAML integrations.

### Bug fixes

* Improved security in the easy logging feature, including:

  + Using a more reliable way of determining which driver directory to use when searching for client configuration files.
  + No longer using a temporary directory for configuration search.
  + Enforcing additional file permissions checks under Unix for increased security.
  + Adding more verbose logging.
* Fixed an Okta retry issue for SSO/SAML endpoints.
* Added fast failing for commands without text execution.
* Fixed exceptions thrown from PUT/GET failed executions to contain `QueryId` if possible.
* Replaced the `Portable.BouncyCastle` library with `BouncyCastle.Cryptography`.

## Version 2.2.0 (January 17, 2024)

### BCR (Behavior Change Release) changes

* Beginning with version 2.2.0, the .NET driver automatically replaces underscores (`_`) in account names with hyphens (`-`) when constructing a host name based on an account name. This change impacts PrivateLink customers whose account names contain underscores. In this situation, you must override the default value by setting `allowUnderscoresInHost` to `true`. You can override this behavior by setting `allowUnderscoresInHost=true` in the `ConnectionString`.

  This change was made to fix the DNS resolution errors that occurred when connecting over the public link with Snowflake accounts that had underscores in their account names.

### New features and updates

* Improved Arrow performance.
* Automatically replaces underscores (`_`) in account names with hyphens (`-`) when constructing a host name based on an account name.
* Added an `allowUnderscoresInHost` configuration parameter to allow underscores (_) in account names to be maintained in the constructed host name. This parameter lets you override the behavior change associated with this release.

### Bug fixes

* To fix an issue with connection timeouts, the driver now closes expired sessions asynchronously when connecting.

---
title: .NET Driver release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/dotnet-2025.md
section: Release Notes
---

# .NET Driver release notes for 2025

This article contains the release notes for the .NET Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for .NET Driver updates.

See [.NET Driver](../../developer-guide/dotnet/dotnet-driver.md) for documentation.

## Version 5.2.1 (December 10, 2025)

### New features and improvements

* None.

### Bug fixes

* Fixed the extremely rare case where intermittent network issues during uploads to Azure Blob Storage prevented metadata updates.

## Version 5.2.0 (December 03, 2025)

### New features and improvements

* Added multi-targeting support. NuGet now selects the appropriate build based on the target framework and OS.
* Added support for native Arrow structured types.

### Bug fixes

* Fixed CRL validation to reject newly downloaded CRLs when their `NextUpdate` value has expired.
* Added exception handling to the session heartbeat to prevent network errors from disrupting background heartbeat checks.
* Added retry support for HTTP 307/308 status codes.
* Added the ability to specify non-string values in TOML configuration files. For example, `port` can now be specified as an integer.

## Version 5.1.0 (November 04, 2025)

### New features and improvements

* Added the `APPLICATION_PATH` to the `CLIENT_ENVIRONMENT` sent during authentication to identify the application connecting to Snowflake.
* AWS WIF (Workload Identity Federation) now also checks the application configuration and AWS profile credentials store when determining the current AWS region.
* Added ability for users to configure the maximum number of connections by setting the `SERVICE_POINT_CONNECTION_LIMIT` property.
* Added the `CRLDOWNLOADMAXSIZE` connection parameter to limit the maximum size of CRL (certificate revocation list) files downloaded during certificate revocation checks.

### Bug fixes

* Renew idle sessions in the pool if keep alive is enabled.

## Version 5.0.0 (October 16, 2025)

### BCR (Behavior Change Release) changes

* Removed the `log4net` dependency and enabled delegated logging.
* Upgraded the AWS SDK library to v4.
* Removed some internal classes from the public API.

### New features and improvements

* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. For information on enabling this feature, see [Switching on/off certificate revocation checks (CRL)](https://github.com/snowflakedb/snowflake-connector-net/blob/master/doc/CertficateValidation.md#switching-onoff-certificate-revocation-checks-crl). We recommend you test this feature in advisory mode before enabling it in production.
* Added support for TLS 1.3. The default negotiated version of TLS is either TLS 1.2 or TLS 1.3, and the server decides which one to establish.
* Removed noisy log messages.

### Bug fixes

* None.

## Version 4.8.0 (August 13, 2025)

### New features and updates

* Added support for workload identity federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `WORKLOAD_IDENTITY_PROVIDER` connection parameter.
  + Added `WORKLOAD_IDENTITY` to the values for the `authenticator` connection parameter.
* Added support of single use refresh tokens during the OAuth flow.

### Bug fixes

* Removed trailing slash from the default `RedirectUri` within the OAuth Authorization process.
* Fixed a problem with ignoring `endpoint` override in AWS FIPS deployments.

## Version 4.7.0 (July 01, 2025)

### Private Preview (PrPr) features

Added support for workload identity federation in the AWS, Azure, GCP, and Kubernetes platforms.

Disclaimer:

* This feature can only be accessed by setting the `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use this feature only with non-production data.
* This PrPr feature is not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and improvements

* None.

### Bug fixes

* Set `ConfigureAwait(false)` for asynchronous Programmatic Access Token authentications.
* Fixed an issue with the missing `OAuthClientSecret` parameter provided externally to a connection string when creating sessions that use the `MinPoolSize` feature.

## Version 4.6.0 (June 18, 2025)

### New features and improvements

* Added support for virtual style domains in Google Cloud Storage (GCS).
* Added a time duration to the logs for HTTPS calls.
* Added a cleaning query context cache before pooling a connection.

### Bug fixes

* Enabled returning result sets for DML operations.
* Added refreshing of expired sessions when fetching operation results.

## Version 4.5.0 (May 09, 2025)

### New features and improvements

* Added OAuth 2.0 Authorization Code flow authentication:

  + Added the `oauth_authorization_code` authenticator.
  + Added the `oauthScope`, `oauthClientId`, `oauthClientSecret`, `oauthAuthorizationUrl`, `oauthTokenRequestUrl`, and `oauthRedirectUri` connection parameters to configure the authentication.
  + Added the ability to provide `oauthClientSecret` by setting the `SnowflakeDbConnection.OAuthClientSecret` property instead of providing it in a connection string.
  + Added a cache for OAuth 2.0 tokens.
* Added OAuth 2.0 Client Credential flow authentication:

  + Added the `oauth_client_credentials` authenticator.
  + Added `oauthScope`, `oauthClientId`, `oauthClientSecret`, and `oauthTokenRequestUrl` connection parameters to configure the authentication.
  + Added the ability to provide `oauthClientSecret` by setting the `SnowflakeDbConnection.OAuthClientSecret` property instead of providing it in a connection string.
* Added Programmatic Access Token authentication:

  + Added the `programmatic_access_token` authenticator.
  + Added the ability to specify the `token` parameter either in a connection string or by setting the `SnowflakeDbConnection.Token` property.
* Added validations for the `scheme`, `port`, and `host` connection properties.
* Added the ability to provide tokens by setting the `SnowflakeDbConnection.Token` property instead of providing them in a connection string.

### Bug fixes

* None.

## Version 4.4.1 (April 28, 2025)

### New features and improvements

* None.

### Bug fixes

* Fixed a Time-of-check Time-of-use (TOCTOU) race condition when checking access to Easy Logging configuration file. For more information, see [CVE-2025-46326](https://github.com/snowflakedb/snowflake-connector-net/security/advisories/GHSA-c82r-c9f7-f5mj).
* Fixed an issue with cancelling connecting with `CancellationTokenSource.CancelAsync()` that did not decrease the pool usage counter.

## Version 4.4.0 (April 10, 2025)

### New features and improvements

* Added an SSO token cache for external browser authentication and the `client_store_temporary_credential` parameter to indicate whether to use the SSO cache.
* Implemented and improved the file-based credentials cache for Linux, including enhanced token caching.

### Bug fixes

* Fixed case insensitivity for authenticators. Before the fix, the logic for `username_password_mfa` and `oauth` was not properly applied if authenticators used uppercase characters.
* Fixed an issue with passing null into a query parameter.
* Fixed an issue with reading tokens from the Windows Credential Manager, which was used for `username_password_mfa` authenticator. In some cases the value read from the credential manager could be too long.
* Made some small changes to credential manager implementations, such as changing some log levels and issuing a warning for too permissive cache directory permissions on Unix instead of changing the permissions automatically.
* Fixed the binding of `AnsiString` parameters to the `TEXT` type.
* Fixed loading structured or semi-structured data to a `DataTable`.

## Version 4.3.0 (January 29, 2025)

### New features and improvements

* Added support for configuring connection parameters in TOML files.
* Added an MFA token cache.
* Added support for GCP region-specific endpoints.
* Made encryption headers for files downloaded by GET be case insensitive.
* The driver was tested with .net9 framework.
* Extended documentation for checking CRL endpoints for Windows users.

### Bug fixes

* Improved security of intermediary files placed in OS temporary directories, which makes the files no longer world-readable. For more information, see [CVE-2025-24788](https://github.com/snowflakedb/snowflake-connector-net/security/advisories/GHSA-2mqw-rq5m-8hc8).
* Fixed an issue with handling null data in failed responses.
* Fixed an issue with logging diagnostic information.
* Fixed an issue with handling of spaces in the file path for PUT command with GCS (Google Cloud Storage).
* Fixed an issue with handling GCS endpoints without `https://` prefix.
* Fixed an issue with downloading files with a GET operation that don’t have the `SFC_DIGEST` property in their metadata.
* Fixed the ability to use `STDOUT` as the log path in Easy Logging feature.

---
title: .NET Driver release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/dotnet-2026.md
section: Release Notes
---

# .NET Driver release notes for 2026

This article contains the release notes for the .NET Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for .NET Driver updates.

See [.NET Driver](../../developer-guide/dotnet/dotnet-driver.md) for documentation.

## Version 5.5.0 (April 13, 2026)

### New features and improvements

* The driver now includes `SPCS_TOKEN` in login requests when running inside a Snowpark Container Services (SPCS) container (detected via the `SNOWFLAKE_RUNNING_INSIDE_SPCS` environment variable).
* Extended login-request telemetry with cloud platform and environment detection (AWS Lambda, EC2, Azure VM/Functions, GCE/Cloud Run, GitHub Actions). Detection runs once at startup in the background within a 200ms timeout. You can disable this feature by setting the `SNOWFLAKE_DISABLE_PLATFORM_DETECTION` environment variable.
* Added the `workloadIdentityImpersonationPath` connection parameter for `authenticator=WORKLOAD_IDENTITY`, which allows workloads to authenticate as a different identity through transitive service account impersonation.
* Added the `HonorSessionTimezone` connection parameter (default: `false`). When set to `true`, `TIMESTAMP_LTZ` values honor the session TIMEZONE parameter (set using ALTER SESSION SET TIMEZONE) instead of the local machine timezone. This will become the default behavior in a future major release.

### Bug fixes

* Fixed an issue where idle sessions were not evicted from the connection pool when closing them fails.
* Fixed an issue where sessions that receive HTTP 401 during query execution were returned to the connection pool.
* Fixed `GetResultsFromQueryIdAsync` not aborting queries on the server when a `CancellationToken` is cancelled. Previously, only client-side polling stopped while queries continued running on Snowflake.
* Fixed Azure GET (download) operations incorrectly reporting an `UPLOADED` result status instead of `DOWNLOADED` when the server returns presigned URLs for an encrypted stage.
* Fixed query context cache not being updated when the server returns `queryContext` in a failed query response.
* Improved CRL issuer validation: issuer names are now compared using DER encoding (avoiding string-form mismatches such as `S=` vs `ST=`), and the CRL’s Authority Key Identifier is verified against the issuing CA’s Subject Key Identifier when both extensions are present.

## Version 5.4.1 (February 17, 2026)

### New features and improvements

* Extended login-request telemetry with Linux distribution details parsed from `/etc/os-release`.

### Bug fixes

* Fixed `IndexOutOfRangeException` in Arrow result chunk processing by adding retry state cleanup, batch integrity validation, and defensive bounds checking in `ExtractCell()`.
* Fixed `IndexOutOfRangeException` when reading `NUMBER`/`DECIMAL` columns with scale greater than 9 in Arrow result format.

## Version 5.4.0 (February 05, 2026)

### New features and improvements

* Added support for Red Hat Enterprise Linux (RHEL) 9.
* Added support for the [DECFLOAT](../../sql-reference/data-types-numeric.md) data type (returned as string to preserve full precision).

### Bug fixes

* Fixed `IndexOutOfRangeException` in Arrow result processing when empty batches are returned by the Snowflake backend.

## Version 5.3.0 (January 07, 2026)

### New features and improvements

* Introduced a shared library for extended telemetry to identify and prepare the testing platform for native Rust extensions.

### Bug fixes

* None.

---
title: 10.0 Release Notes: Jan 12, 2026-Jan 15, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_0.md
section: Release Notes
---

# 10.0 Release Notes: Jan 12, 2026-Jan 15, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Search optimization: Support for structured data types

With this release, the search optimization service can improve the performance of point lookup and substring queries on structured data in
Snowflake tables. You can store structured data in ARRAY, OBJECT, and MAP columns in standard Snowflake tables and Iceberg tables.

For more information, see [Speeding up queries of structured data with search optimization](../../user-guide/search-optimization/structured-queries.md).

## Data governance updates

### Copy tags when running a CREATE OR REPLACE TABLE command (*Preview*)

A new COPY TAGS parameter of the CREATE OR REPLACE TABLE command allows you to copy tags that are associated with the original table and
its columns. The newly created table and its columns are associated with the same tags.

For more information, see [CREATE TABLE](../../sql-reference/sql/create-table.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Jan 09, 2026 |

---
title: 10.1 Release Notes (with behavior changes): Jan 19, 2026-Jan 23, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_1.md
section: Release Notes
---

# 10.1 Release Notes (with behavior changes): Jan 19, 2026-Jan 23, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2026_01](../bcr-bundles/2026_01_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_07](../bcr-bundles/2025_07_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_06](../bcr-bundles/2025_06_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change again in the following behavior change release, planned for March 2-5, 2026; however, this
schedule is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### Retrieve bind variable values (*General availability*)

The ability to retrieve bind variable values is now generally available and is no longer in Preview.

You can retrieve the values of the bind variables by using the BIND_VALUES table function in the INFORMATION_SCHEMA
schema. Using this function, you can retrieve bind variable values from any code that supports bind variables,
including JavaScript and Snowflake Scripting code.

Bind variable values for past queries are also visible in the `bind_values` column in the output for the
[QUERY_HISTORY Account Usage view](../../sql-reference/account-usage/query_history.md), the
[QUERY_HISTORY Organization Usage view](../../sql-reference/organization-usage/query_history.md), or the
[QUERY_HISTORY function](../../sql-reference/functions/query_history.md) in the INFORMATION_SCHEMA schema.

For more information, see [Retrieve bind variable values](../../sql-reference/bind-variables.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Jan 16, 2026 |

---
title: 10.10 Release Notes: Mar 22, 2026-Mar 25, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_10.md
section: Release Notes
---

# 10.10 Release Notes: Mar 22, 2026-Mar 25, 2026

> **Attention:**
>
> This release has completed. These release notes may be updated as additional announcements
> are finalized. For updates, see Release notes change log.

## SQL updates

### Interval data types (*Preview*)

Interval data types store values that represent a duration of time. You can calculate an interval as the difference
between two dates or times. An interval only defines a duration, so it doesn’t have a start or end point in time.
For example, you might define an interval as three years and seven months.

This release adds support for several new interval data types, including INTERVAL YEAR TO MONTH and INTERVAL DAY
TO SECOND.

For more information, see [Interval data types](../../sql-reference/data-types-datetime.md).

## Snowflake Cortex updates

### Batch Cortex Search (*Preview*)

The Batch Cortex Search function is now available in public preview. The `CORTEX_SEARCH_BATCH` table function lets you
submit a batch of queries to a Cortex Search service for offline use cases with high throughput requirements, such as entity
resolution, deduplication, or clustering tasks.

Batch search leverages dedicated compute resources to provide significantly higher throughput than the interactive
Cortex Search API. Unlike interactive queries, batch search can also query services that are currently suspended.

For more information, see [Batch Cortex Search](../../user-guide/snowflake-cortex/cortex-search/batch-cortex-search.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication | Mar 25, 2026 |

---
title: 10.11 Release Notes (no announcements): Mar 30, 2026-Apr 1, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_11.md
section: Release Notes
---

# 10.11 Release Notes (no announcements): Mar 30, 2026-Apr 1, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

This release contains no significant features, updates, or enhancements to announce.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Mar 27, 2026 |
| Release notes | Final publication | Apr 2, 2026 |

---
title: 10.12 Release Notes (with behavior changes): Apr 03, 2026-Apr 08, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_12.md
section: Release Notes
---

# 10.12 Release Notes (with behavior changes): Apr 03, 2026-Apr 08, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2026_03](../bcr-bundles/2026_03_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2026_02](../bcr-bundles/2026_02_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2026_01](../bcr-bundles/2026_01_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change again in the following behavior change release, planned for May 2026; however, this
schedule is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### CHECK constraints for standard tables (*General availability*)

CHECK constraints for standard tables are now generally available. A CHECK constraint enforces a
condition on the values that can be inserted into or updated in one or more columns of a table.
The condition is a SQL expression that you define when you create or alter the table.

CHECK constraints are enforced, which means that any INSERT or UPDATE operation that violates the
constraint results in an error. You can define CHECK constraints inline as part of a column
definition or out-of-line in a separate clause.

For more information, see [CHECK constraints](../../sql-reference/constraints-overview.md).

## New features

### Dynamic table refresh boundaries

You can now use `DYNAMIC_TABLE_REFRESH_BOUNDARY()` in a dynamic table definition to prevent an upstream dynamic table from being refreshed
together with the downstream dynamic table. This lets you decouple dynamic table pipelines so that each pipeline refreshes independently.
Cascading refreshes and snapshot isolation do not apply across the boundary.

For more information, see [Dynamic table refresh boundary](../../user-guide/dynamic-tables-refresh-boundary.md).

### Access history improvements

[Access history](../../user-guide/access-history.md) lets you monitor the SQL statements executed in Snowflake. It keeps track of the
following types of statements:

* Data Manipulation Language (DML) statements. For example, statements used to insert data into a table.
* Data Query Language (DQL) statements. For example, statements that use a SELECT statement to project data.
* Data Definition Language (DDL) statements. For example, statements that create or alter a Snowflake object.

Snowflake is expanding which SQL statements are included in the access history. This release adds support for the following:

* CREATE STREAM statements.
* SHOW AGGREGATION POLICIES, SHOW AUTHENTICATION POLICIES, SHOW NETWORK POLICIES, and SHOW PASSWORD POLICIES.
* DESCRIBE JOIN POLICY, DESCRIBE NETWORK POLICY, DESCRIBE AUTHENTICATION POLICY, and DESCRIBE PASSWORD POLICY (DESCRIBE can be abbreviated to DESC).
* DDL related to MCP servers.
* DDL related to Postgres instances.
* DDL related to Cortex agents.

For a complete list of objects and commands that appear in your access history, see [Supported Objects](../../user-guide/access-history.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Mar 31, 2026 |
| CHECK constraints for standard tables | **Added** to SQL updates section | Apr 09, 2026 |
| Access history improvements | **Added** to New features section | Apr 06, 2026 |
| Release notes | Final publication | Apr 08, 2026 |

---
title: 10.13 Release Notes (no announcements): Apr 11, 2026-Apr 16, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_13.md
section: Release Notes
---

# 10.13 Release Notes (no announcements): Apr 11, 2026-Apr 16, 2026

This release contains no significant features, updates, or enhancements to announce.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Apr 08, 2026 |
| Release notes | Final publication | Apr 16, 2026 |

---
title: 10.2 Release Notes: Jan 26, 2026-Jan 30, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_2.md
section: Release Notes
---

# 10.2 Release Notes: Jan 26, 2026-Jan 30, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New UUID data type

This release adds support for the UUID data type. The UUID data type stores universally unique identifiers (UUIDs).

For more information, see [UUID data type](../../sql-reference/data-types-uuid.md).

## Data loading / unloading updates

### Support for Microsoft Fabric OneLake (*General availability*)

You can now use Microsoft Fabric OneLake as an external stage location for loading and unloading data. OneLake is the unified data lake for
Microsoft Fabric that provides a single storage layer across all Fabric workloads.

For more information, see [CREATE STAGE](../../sql-reference/sql/create-stage.md) and [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Jan 23, 2026 |

---
title: 10.3 Release Notes: Feb 02, 2026-Feb 05, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_3.md
section: Release Notes
---

# 10.3 Release Notes: Feb 02, 2026-Feb 05, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Extensibility updates

### Owner’s rights contexts: Allow INFORMATION_SCHEMA, SHOW, and DESCRIBE

We have updated the permission models for owner’s rights contexts — including owner’s rights stored procedures,
Native Apps, and Streamlit — to support a wider range of introspection commands.

* **SHOW and DESCRIBE**: Most SHOW and DESCRIBE commands are now permitted.

  + *Exceptions*: Commands that read specific domains related to the current session or user remain blocked.
* **Information Schema**: INFORMATION_SCHEMA views and table functions are now accessible.

  + *Exceptions*: The following history functions remain restricted: QUERY_HISTORY, QUERY_HISTORY_BY_\*, and LOGIN_HISTORY_BY_USER.

For more information, see [Owner’s rights stored procedures](../../developer-guide/stored-procedure/stored-procedures-rights.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Jan 30, 2026 |

---
title: 10.4 Release Notes: Feb 09, 2026-Feb 13, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_4.md
section: Release Notes
---

# 10.4 Release Notes: Feb 09, 2026-Feb 13, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New SQL functions

The following function is now available with this release:

| Function subcategory | New function | Description |
| --- | --- | --- |
| Context | [IS_DATABASE_ROLE_ACTIVATED (SYS_CONTEXT function)](../../sql-reference/functions/is_database_role_activated.md) | Returns the VARCHAR value ‘TRUE’ if a database role is activated in the current session. |

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Feb 06, 2026 |
| New SYS_CONTEXT function: IS_DATABASE_ROLE_ACTIVATED | **Added** to SQL updates | Feb 10, 2026 |

---
title: 10.5 Release Notes: Feb 16, 2026-Feb 19, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_5.md
section: Release Notes
---

# 10.5 Release Notes: Feb 16, 2026-Feb 19, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### SAML2 federated authentication: Support for metadata URL

You can now specify the identity provider’s (IdP’s) metadata URL when creating a SAML2 security integration,
instead of providing four separate parameters for IdP information. Snowflake obtains the information directly
from the metadata URL, which is less error-prone and allows IdP changes to be dynamically updated without
changing any parameter values.

For more information, see [Configuring Snowflake to use federated authentication](../../user-guide/admin-security-fed-auth-security-integration.md).

### Tri-Secret Secure supports secure share area accounts

When you publish a listing and enable [auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment.md), Snowflake manages the
provisioning for secure share area (SSA) accounts in consumer regions. With this release, you can now protect your SSA accounts with
Tri-Secret Secure (TSS). This feature allows you to register a customer-managed key (CMK) for use with TSS and activate TSS for your
SSA accounts. You can also optionally pass in the account name when calling the [SYSTEM$GET_CMK_INFO](../../sql-reference/functions/system_get_cmk_info.md) function or the [SYSTEM$VERIFY_CMK_INFO](../../sql-reference/functions/system_verify_cmk_info.md) function to get or verify CMK information.

For more information, see [Tri-Secret Secure with secure share area accounts in Snowflake](../../user-guide/security-encryption-tss-ssa.md).

## Data governance updates

### DUPLICATE_COUNT DMF: Ability to specify multiple columns

You can now associate the DUPLICATE_COUNT data metric function (DMF) with a table to find the number of rows where a combination of
specified columns is duplicated. Previously, you could only return the number of duplicates in a single column.

For more information, see [DUPLICATE_COUNT (system data metric function)](../../sql-reference/functions/dmf_duplicate_count.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Feb 13, 2026 |
| Tri-Secret Secure supports secure share area accounts | **Added** to Security updates | Feb 19, 2026 |

---
title: 10.6 Release Notes: Feb 23, 2026-Feb 27, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_6.md
section: Release Notes
---

# 10.6 Release Notes: Feb 23, 2026-Feb 27, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Data lake updates

### Apache Iceberg™ tables: Partitioned writes with hierarchical paths (*Preview*)

You can now use Snowflake to write to partitioned Iceberg tables by using a hierarchical path layout. This layout is also called
“Hive-style” partitioning.

This update lets Snowflake write to Iceberg tables by using the same hierarchical path layout for partitions that some external engines support.

This feature is supported for both Snowflake-managed and externally managed Iceberg tables. To use a hierarchical path layout for
partitioned writes, set the new PATH_LAYOUT parameter to HIERARCHICAL when you create a table. If you don’t specify this parameter,
Snowflake uses the existing layout for partitioned writes, which is a flat file structure.

For more information, see [Partitioning with hierarchical paths](../../user-guide/tables-iceberg-metadata.md).

## Data governance updates

### Data quality: Non-owners can associate a data metric function with an object (*General availability*)

Users with the SELECT privilege on a table or view can now associate it with a data metric function (DMF) to set up a data quality check.
Previously, only the owner of the table or view could associate a DMF.

As part of this change, an association between a DMF and an object has a new property: EXECUTE AS ROLE. This property specifies which role
the DMF runs with.

For more information, see [Required privilege on the table or view](../../user-guide/data-quality-access-control.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Feb 20, 2026 |

---
title: 10.7 Release Notes (with behavior changes): Mar 02, 2026-Mar 05, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_7.md
section: Release Notes
---

# 10.7 Release Notes (with behavior changes): Mar 02, 2026-Mar 05, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2026_02](../bcr-bundles/2026_02_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2026_01](../bcr-bundles/2026_01_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_07](../bcr-bundles/2025_07_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change again in the following behavior change release, planned for April 2026; however, this
schedule is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## Data lake updates

### Apache Iceberg™ tables: Support for fixed(L) data type

Snowflake now supports the `fixed(L)` primitive data type as defined by the Apache Iceberg™ specification. Snowflake maps the Iceberg
`fixed(L)` data type to the Snowflake `BINARY(L)` data type. In addition, Snowflake enforces that values inserted in Iceberg
`fixed(L)` columns are exactly `L` bytes in length.

This enforcement ensures consistent behavior across engines in the Iceberg ecosystem.

For more information, see [Other data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).

> **Note:**
>
> To use the Iceberg `fixed(L)` primitive data type, you must enable the `2026_02` behavior change bundle in your account. For
> more information, see [Enabling a behavior change bundle in your account](../bcr-bundles/managing-behavior-change-releases.md).

To enable this bundle in your account, execute the following statement:

> ```sqlexample
> SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2026_02');
> ```

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Feb 27, 2026 |

---
title: 10.8 Release Notes: Mar 08, 2026-Mar 12, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_8.md
section: Release Notes
---

# 10.8 Release Notes: Mar 08, 2026-Mar 12, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### User-defined types

With this release, you can define user-defined types, which are new data types that are based on existing [Snowflake data types](../../sql-reference-data-types.md). User-defined
types can simplify schema maintenance and improve data quality. You can define a user-defined type once, and then use it in multiple objects.

For example, you can define a data type named `age` that corresponds to NUMBER(3,0) or a data type named `address` that is a
structured OBJECT type with fields for the street address, city, state, and postal code.

For more information, see [User-defined types](../../sql-reference/data-types-user-defined.md).

## Data collaboration updates

### Business Continuity and Disaster Recovery (BCDR) for listings

Providers can now include listings and their dependencies — such as shares and databases — in
[account replication and failover groups](../../user-guide/account-replication-intro.md). With failover groups, in the event
of a service degradation or outage, [auto-fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) relies on
failover groups for data replication and disaster recovery.

For more information, see [Listing support in Business Continuity and Disaster Recovery](../../collaboration/listings-bcdr.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Mar 06, 2026 |
| CKE document access history | **Removed** from Snowflake Cortex updates | Mar 09, 2026 |
| Business Continuity and Disaster Recovery (BCDR) for listings | **Added** to Data collaboration updates | Mar 10, 2026 |

---
title: 10.9 Release Notes: Mar 17, 2026-Mar 20, 2026
source: https://docs.snowflake.com/en/release-notes/2026/10_9.md
section: Release Notes
---

# 10.9 Release Notes: Mar 17, 2026-Mar 20, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Snowflake supports directory, root stage, and SnowGit imports

Snowflake now supports the following import types for Python, Java, and Scala UDFs, UDTFs, UDAFs, and stored procedures:

* Directory imports: Import an entire directory from a stage.
* Root stage imports: Import from the root of a stage.
* SnowGit imports: Import folders from Git-backed refs such as `@repo/branches/main/...`.

For more information, see [Python UDF handler examples](../../developer-guide/udf/python/udf-python-examples.md),
[Scala UDF handler examples](../../developer-guide/udf/scala/udf-scala-examples.md), and
[Java UDF handler examples](../../developer-guide/udf/java/udf-java-cookbook.md).

## Security updates

### TSS history account usage view (*General availability*)

Customers can now view information about the registration and activation of customer-managed keys (CMKs) with Tri-Secret Secure
by using the TRI_SECRET_SECURE_HISTORY account usage view.

For more information, see [TRI_SECRET_SECURE_HISTORY view](../../sql-reference/account-usage/tri-secret-secure-history.md).

## SQL updates

### DML error logging for tables

This release adds support for DML error logging for tables. When this feature is turned on for a table, data manipulation language
(DML) statements can continue when they perform an operation on the table and a supported error occurs. Errors are logged in an
*error table* that is associated with the base table.

For more information, see [DML error logging](../../user-guide/data-load-overview.md).

### Additional date and time formats

Date and time formats are standardized ways of representing dates and times. This release adds the following additional date and
time formats: `Y`, `MO`, `D`, `H24`, `H12`, `HH`, `H`, `ME`, `S`, and `P`.

For more information, see [date and time formats](../../sql-reference/data-types-datetime.md).

### Additional fixed-position numeric format models

SQL format models are used to specify how numeric values are converted to text strings and vice versa. This release adds the
following additional fixed-position numeric format models: `%` and parameterized `TM9`.

For more information, see [fixed-position numeric format models](../../sql-reference/sql-format-models.md).

## Snowflake Cortex updates

### CKE document access history

Providers can now track which documents are being accessed in their Cortex Knowledge Extensions (CKEs) using access history data
in the [LISTING_ACCESS_HISTORY](../../sql-reference/data-sharing-usage/listing-access-history.md) view.

Two new system functions help providers map hashed document IDs back to their original primary key columns:

* `SYSTEM$ENCODE_CKE_PRIMARY_KEY`: Transform and anonymize the primary key from the set of selected columns.

  For more information, see [SYSTEM$ENCODE_CKE_PRIMARY_KEY](../../sql-reference/functions/system_encode_cke_primary_key.md).
* `SYSTEM$CKE_HASH_FUNCTION`: Hash the primary key.

  For more information, see [SYSTEM$CKE_HASH_FUNCTION](../../sql-reference/functions/system_cke_hash_function.md).

For more information, see [CKE document access history](../../user-guide/snowflake-cortex/cortex-knowledge-extensions/cke-access-history.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Mar 12, 2026 |
| Release notes | Final publication | Mar 20, 2026 |

---
title: 2022 Performance Improvements
source: https://docs.snowflake.com/en/release-notes/performance-improvements-2022.md
section: Release Notes
---

# 2022 Performance Improvements

> **Important:**
>
> Performance improvements often target specific query patterns or workloads. These improvements might or might not have a material impact
> on a specific workload.

The following performance improvements were introduced in 2022:

| Released | Description | Impact |
| --- | --- | --- |
| November 2022 | Improved performance and resilience of Snowflake applications and content. | Improved response times and availability for Snowsight pages. |
| November 2022 | Ability to use scalar [Vectorized Python UDFs](../developer-guide/udf/python/udf-python-batch.md) in Snowpark. | Performance improvements for Python code that operates efficiently on batches of rows. Also requires less transformation logic when using Pandas DataFrames and arrays. |
| November 2022 | Improvements for metadata queries. | Improved query execution time for small/metadata queries. |
| October 2022 | Ability to enable [Search Optimization](../user-guide/search-optimization-service.md) for [specific columns](../user-guide/search-optimization/enabling.md). (Preview) | Point lookup queries that act upon a column can be improved without incurring the expense of enabling Search Optimization for the entire table. |
| October 2022 | Support for [substring operations](../user-guide/search-optimization/substring-queries.md) when using Search Optimization. (Preview) | Improves the performance of point lookup queries that use substring operations such as LIKE and ENDSWITH. |
| October 2022 | Support for [VARIANT data](../user-guide/search-optimization/semi-structured-queries.md) when using Search Optimization. (Preview) | Improves the performance of point lookup queries that act upon VARIANT data (such as JSON). |
| October 2022 | Support for [geospatial functions with GEOGRAPHY objects](../user-guide/search-optimization/geospatial-queries.md) when using Search Optimization. (Preview) | Improves the performance of point lookup queries that use a geospatial function in a predicate. |
| October 2022 | Improvements for Collation and BINARY columns. | Improved pruning for collations and BINARY columns, which means fewer [micro-partitions](../user-guide/tables-clustering-micropartitions.md) must be scanned to return results. |
| October 2022 | Improvements for hash table joins. | Improved query performance by reducing memory I/O latency in hash table equality checks. |
| October 2022 | Improvements for DateTrunc range derivations. | Improved execution time for queries that use DateTrunc range derivation when [TIMESTAMP-TZ](../sql-reference/data-types-datetime.md) data is a constant. |
| August 2022 | Improvements related to Data Governance features. | More responsive Data Governance UI pages in Snowsight as well as improved query latency for [tag-based masking policies](../user-guide/tag-based-masking-policies.md). |
| August 2022 | Improvements for [window functions](../sql-reference/functions-window.md). | Improved rule-based optimization as well as improved query execution for outer join and filter pushdown in window functions. |
| August 2022 | Scheduling improvements for high-concurrency workloads. | Improved query scheduling for high concurrency and lower latency workloads. |
| July 2022 | Improved query performance using Join Elimination. | Optimized query performance through the automatic elimination of unnecessary joins, which are identified by automatically evaluating query logic. |

---
title: 2023 Performance Improvements
source: https://docs.snowflake.com/en/release-notes/performance-improvements-2023.md
section: Release Notes
---

# 2023 Performance Improvements

> **Important:**
>
> Performance improvements often target specific query patterns or workloads. These improvements might or might not have a material impact
> on a specific workload.

The following performance improvements were introduced in 2023:

| Released | Description | Impact |
| --- | --- | --- |
| December 2024 | Improved column replication. | Reduces the time spent in the SECONDARY_DOWNLOADING_METADATA phase of a refresh operation for table columns. Improvements scale linearly with the number of columns replicated. |
| November 2023 | Improved execution times for some SHOW commands. | Reduces the execution time for the [SHOW TABLES](../sql-reference/sql/show-tables.md), [SHOW SCHEMAS](../sql-reference/sql/show-schemas.md), and [SHOW DATABASES](../sql-reference/sql/show-databases.md) commands. Improvements are most significant for queries that return large result sets. |
| November 2023 | Search Optimization: Support for [substring search in semi-structured data](../user-guide/search-optimization/semi-structured-queries.md). (General Availability) | Improves the performance of point lookup queries that use substring and regular expression functions against semi-structured data, including ARRAY, OBJECT, and VARIANT types. Previously, only equality searches on such columns could be optimized. |
| October 2023 | Reduced maintenance costs for materialized views. | Reduces materialized view maintenance credits by improving the utilization of service resources. |
| October 2023 | Improved compile times for SQL expressions. | Reduces the compilation time of queries that contain many SQL expressions. |
| September 2023 | Improved compile times. | Reduces compilation times by skipping optimizations that will not result in performance improvements. |
| August 2023 | Ability to use a query hash to identify patterns and trends in query execution. | Helps to [monitor and analyze recurring queries](../user-guide/query-hash.md) by including a query hash of each query in ACCOUNT_USAGE views and INFORMATION_SCHEMA table functions. Can be used to determine the effects of performance improvements like choosing a new cluster key. |
| August 2023 | Improved execution times for non-clustered tables. | Reduces execution time for SELECT and DML operations against non-clustered tables with micro-partitions that are smaller than average. |
| August 2023 | Ability to call the [GET_QUERY_OPERATOR_STATS](../sql-reference/functions/get_query_operator_stats.md) function to obtain query profile statistics. (General Availability) | Helps to programmatically debug queries and gain insights into query performance. |
| August 2023 | Improved execution times for joins on wide build-side rows. | Reduces execution time and improves memory management for queries matching wide rows on the build side of a join (for example, rows that include columns with long strings). |
| July 2023 | Improved compile times for materialized views. | Reduces compilation times for materialized views based on tables that have 100s or 1000s of micro-partitions. |
| July 2023 | Ability to use [Snowpipe Streaming](../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). (General Availability) | Enables low-latency streaming data pipelines to support writing data rows directly into Snowflake. |
| July 2023 | Improved selectivity and cardinality estimation. | Uses improved plan selection to reduce execution time for queries with low selectivity. |
| July 2023 | Search Optimization Update: Support for [substring search in VARIANT types](../user-guide/search-optimization/semi-structured-queries.md). (Preview) | Improves the performance of point lookup queries that use substring and regular expression functions against semi-structured data, including ARRAY, OBJECT, and VARIANT types. |
| July 2023 | Improved compile times for simple queries and DML statements. | Reduces compilation times and improves memory management for simple DML statements and single-table queries with simple equality or range predicates. |
| June 2023 | Improved execution times for SELECT statements with LIMIT and ORDER BY clauses. | Reduces execution time of some queries with long-running SELECT statements containing both LIMIT and ORDER BY clauses. |
| June 2023 | Improved execution times against [secure views](../user-guide/views-secure.md). | Uses predicate pushdown to reduce execution time for queries against secure views. |
| May 2023 | Improved compile times for queries with numerous extraction expressions. | Reduces compilation times for queries with many extraction expressions (such as those used for processing JSON). |
| May 2023 | Improved compile times for queries with numerous subqueries. | Reduces compilation time for queries with 100+ subqueries. |
| April 2023 | Search Optimization Update: Ability to enable Search Optimization for [specific columns](../user-guide/search-optimization/enabling.md). (General Availability) | Point lookup queries that act upon a column can be improved without incurring the expense of enabling Search Optimization for the entire table. |
| April 2023 | Search Optimization Update: Support for [substring operations](../user-guide/search-optimization/substring-queries.md). (General Availability) | Improves the performance of point lookup queries that use substring operations such as LIKE and ENDSWITH. |
| April 2023 | Search Optimization Update: Support for [VARIANT data](../user-guide/search-optimization/semi-structured-queries.md). (General Availability) | Improves the performance of point lookup queries that act upon VARIANT data (such as JSON). |
| April 2023 | Search Optimization Update: Support for [geospatial functions with GEOGRAPHY objects](../user-guide/search-optimization/geospatial-queries.md). (General Availability) | Improves the performance of point lookup queries that use a geospatial function in a predicate. |
| April 2023 | Ability to use the [query acceleration service](../user-guide/query-acceleration-service.md) to speed up queries against tables that have [Search Optimization](../user-guide/search-optimization-service.md) enabled. (General Availability) | Additional compute power provided by the query acceleration service can be combined with performance boost provided by search optimization. |
| March 2023 | Ability to use [Snowpipe Streaming](../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). (Preview) | Enables low-latency streaming data pipelines to support writing data rows directly into Snowflake. |
| February 2023 | Ability to use the [query acceleration service](../user-guide/query-acceleration-service.md). (General Availability) | Improves overall warehouse performance by reducing the impact of outlier queries. |
| February 2023 | Ability to call the [GET_QUERY_OPERATOR_STATS](../sql-reference/functions/get_query_operator_stats.md) function to obtain programmatic Query Profile statistics. (Preview) | Helps debug queries and gain insights into query performance. |
| February 2023 | Ability to use [memory-optimized warehouses](../user-guide/warehouses-snowpark-optimized.md). | Memory-intensive queries can be run on Snowpark-optimized warehouses that provide 16x more memory per node and 10x the local cache compared with standard warehouses. |

---
title: 2023_01 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01_bundle.md
section: Release Notes
---

# 2023_01 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.2 release (Jan 19-20) with status **Disabled by Default**.
2. Status changed in the 7.7 release (Mar 6-7) to **Enabled by Default**.
3. Status changed in the 7.13 release (Apr 20-24) to **Generally Enabled**; account admins can no longer enable or disable the bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

| **Security Changes** | **Additional Notes** |
| --- | --- |
| [SHOW INTEGRATIONS Command: USAGE Privilege Required to View Output](2023_01/bcr-930.md) |  |
| [System Functions: MONITOR SECURITY Privilege Required to Execute Certain System Functions](2023_01/bcr-805.md) |  |
| [Query History: Redacted SQL Upon Syntax Error](2023_01/bcr-936.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [Materialized Views: Using Time Travel to Query Historical Data Produces Expected Error Message](2023_01/bcr-923.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [SHOW Commands: New Column Added to Output for Certain Commands](2023_01/bcr-865.md) |  |
| [SHOW ORGANIZATION ACCOUNTS Command: New Column in Output](2023_01/bcr-942.md) |  |
| [ARRAY_CAT Function: Changes to NULL Handling](2023_01/bcr-940.md) |  |
| [ARRAY_POSITION Function: Changes to Finding the Position of a NULL Value](2023_01/bcr-882.md) |  |
| [GET_DDL Function: Tags Set on Streams, Tasks, and Pipes Included in Output](2023_01/bcr-924.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Account Usage: New and Changed Columns in Certain Views](2023_01/bcr-732.md) |  |
| [QUERY_HISTORY View (Account Usage): New Columns Added](2023_01/bcr-921.md) |  |
| [REPLICATION_DATABASES View (Information Schema): Changes to Column Values](2023_01/bcr-892.md) |  |
| [TABLES View (Account Usage): Changes to the RETENTION_TIME Column](2023_01/bcr-853.md) |  |
| [TABLES, VIEWS, and EXTERNAL_TABLES Views (Account Usage, Information Schema): New Columns Added](2023_01/bcr-891.md) |  |
| [TASK_HISTORY View (Account Usage): Change to Status for Failed and Auto-suspended Tasks](2023_01/bcr-899.md) |  |
| **Snowflake CLI, Connectors, Drivers, and SQL API Changes** | **Additional Notes** |
| [Some Unused Data No Longer Sent to Drivers, Connectors, and Clients](2023_01/bcr-916.md) |  |
| **Data Pipeline Changes** | **Additional Notes** |
| [Streams: CREATE STREAM with INSERT_ONLY = TRUE Not Allowed on Non-external Tables](2023_01/bcr-795.md) |  |
| [Temporary Tables: Changes to Table Creation in Schemas (Pending)](2023_01/bcr-934.md) |  |
| [Streams: Joins on Views for Append-only Streams No Longer Produce Unexpected Results](2023_01/bcr-920.md) |  |
| [Task Parameters Preserved When Cloning Tasks](2023_01/bcr-912.md) |  |
| [Task Parameters Preserved When Cloning Databases, Schemas, and Tables](2023_01/bcr-913.md) |  |
| **Data Governance Changes** | **Additional Notes** |
| [Tag Must Exist When Calling System Functions](2023_01/bcr-938.md) |  |
| [Query History: Redacted SQL Upon Syntax Error](2023_01/bcr-936.md) |  |
| **Replication Changes** | **Additional Notes** |
| [Failover Groups: Change to GRANTED_ON Column in SHOW GRANTS Output](2023_01/bcr-895.md) |  |
| [Integrations: Read-only Secondary Integrations Enforced](2023_01/bcr-906.md) |  |

---
title: 2023_02 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02_bundle.md
section: Release Notes
---

# 2023_02 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.7 release (Mar 6-7) with status **Disabled by Default**.
2. Status changed in the 7.13 release (Apr 20-24) to **Enabled by Default**.
3. Status changed in the 7.19 release (Jun 7-8) to **Generally Enabled**; account admins can no longer enable or disable the bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Access Control: Granting REFERENCE_USAGE on a Database to a Role No Longer Allowed](2023_02/bcr-944.md) |  |
| [SCIM Security Integrations: Using the ENABLED Parameter to Enable or Disable an Integration](2023_02/bcr-937.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [Time Travel: Data Retention Disabled for a Database Created from a Share](2023_02/bcr-945.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [SHOW Commands: New OWNER_ROLE_TYPE Column in Output](2023_02/bcr-747.md) |  |
| **SQL — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [REPLICATION_GROUPS View (Information Schema): New Column in View](2023_02/bcr-950.md) |  |
| [USAGE_IN_CURRENCY_DAILY View (Org Usage): New Usage Types](2023_02/bcr-965.md) |  |
| **Developer / Extensibility Changes** | **Additional Notes** |
| [Snowpark: Creation Time Validation Disabled for Python and Java Temporary UDFs](2023_02/bcr-922.md) |  |
| [Stored Procedures: put_stream Uses Different Way to Get the File Name](2023_02/bcr-943.md) |  |

---
title: 2023_03 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03_bundle.md
section: Release Notes
---

# 2023_03 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.13 release (Apr 20-24) with status **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 7.19 release (June 7-8) to **Enabled by Default**; however, account admins can disable for opt-out.
3. Status changed in the 7.23 release (July 10-11) to **Generally Enabled**; account admins can no longer enable or disable the bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **SQL Changes — General** | **Additional Notes** |
| [Cloned Tables: Default Value for Columns Not Allowed](2023_03/bcr-948.md) |  |
| [Search Optimization: Removing Search Optimization from a Table Requires the ADD SEARCH OPTIMIZATION Privilege](2023_03/bcr-1046.md) |  |
| [Temporary Tables: Changes to Table Creation in Schemas](2023_03/bcr-427.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [RESULT_SCAN Table Function: Changes to Duplicate Column Names](2023_03/bcr-1039.md) |  |
| [SHOW DATABASES Command: New Column in Output](2023_03/bcr-1021.md) |  |
| [SHOW ORGANIZATION ACCOUNTS Command: New Columns in Output](2023_03/bcr-803.md) |  |
| [SHOW TERSE DATABASES Command: Values Populated in the KIND Column](2023_03/bcr-1022.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [COPY_HISTORY View (Account Usage): “Load in progress” No Longer Shown in STATUS Column](2023_03/bcr-986.md) |  |
| [Account Usage: Changes to Columns in DATABASES View](2023_03/bcr-949.md) |  |
| [Account Usage: New Column in DATABASES View](2023_03/bcr-1033.md) |  |
| [Information Schema: New Column in DATABASES View](2023_03/bcr-1032.md) |  |
| [GRANT_TO_ROLES View (Account Usage): Privileges Added to View](2023_03/bcr-1040.md) |  |
| PIPES View (Account Usage): New Column in View (Pending) | This change has been removed. |
| [SESSIONS and LOGIN_HISTORY Views (Account Usage): Events from Internal Users Removed from Views](2023_03/bcr-1053.md) |  |
| [SESSIONS View (Account Usage): New Column in View](2023_03/bcr-991.md) |  |
| [TASK_HISTORY Table Function (Information Schema): Consistent Values for Failed and Auto-suspended Tasks in ERROR_ONLY Output](2023_03/bcr-990.md) |  |
| **Data Loading / Unloading Changes** | **Additional Notes** |
| [Parquet Files: Statistics Included for Decimal Columns Unloaded as FixedLengthByteArrays](2023_03/bcr-976.md) |  |
| **Replication Changes** | **Additional Notes** |
| [Roles and Privileges: Changes to Secondary Roles and the REPLICATE Privilege](2023_03/bcr-1042.md) |  |
| [Stream and Task Replication: Changes for GA](2023_03/bcr-1048.md) |  |
| **Developer / Extensibility Changes** | **Additional Notes** |
| [Stored Procedures and UDTFs: Argument Names Respected in Calls](2023_03/bcr-1017.md) |  |
| [Stored Procedures: Calls to BUILD_SCOPED_FILE_URL Function Allowed Within Owner’s Rights Procedures](2023_03/bcr-1007.md) |  |
| [UDFs: Functions with Handler Code That Reads Files from a Stage Execute in the Owner’s Context](2023_03/bcr-1008.md) |  |
| **Web Interface Changes** | **Additional Notes** |
| [Query Profile: Changes to the Update, Delete, and Insert Operators](2023_03/bcr-978.md) |  |
| [Snowsight: Roles Removed from Worksheet Folders](2023_03/bcr-1025.md) |  |

---
title: 2023_04 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04_bundle.md
section: Release Notes
---

# 2023_04 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.19 release (June 7-8) with status **Disabled by Default**; account admins can enable for testing.
2. Status changed in 7.23 release (July 10-11) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in 7.29 release (August 22-23) to **Generally Enabled**; account admins can no longer enable or disable the bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [IS_GRANTED_TO_INVOKER_ROLE Function: Change to the Output](2023_04/bcr-984.md) |  |
| [Private Connectivity Functions: OCSP Connection URLs Added to Output (Pending)](2023_04/bcr-1111.md) |  |
| [Roles: Changes to How Regrants Are Recorded in the GRANTS_TO_USERS View](2023_04/bcr-1132.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [Automatic Clustering: SYSTEM$CLUSTERING_INFORMATION Syntax and Output Changes](2023_04/bcr-985.md) |  |
| [Materialized Views: MINUS, EXCEPT, and INTERSECT No Longer Allowed](2023_04/bcr-757.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [SHOW Commands: New OWNER_ROLE_TYPE Column in Output](2023_04/bcr-747.md) |  |
| [SHOW GRANTS ON Command: New GRANTED_BY_ROLE_TYPE Column in Output](2023_04/bcr-754.md) |  |
| [Table Functions (Except SQL UDTFs): Restrictions With Lateral Table Functions and Outer Lateral Joins](2023_04/bcr-1057.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Account Usage Views: New INSTANCE_ID Column in Select Views](2023_04/bcr-1100.md) |  |
| [Account Usage Views: New OPTIONS Column in Select Views](2023_04/bcr-1091.md) |  |
| [Data Sharing Usage Views: New CONSUMER_NAME Column in Select Views](2023_04/bcr-1097.md) |  |
| [Metering Views (Account Usage): Additional Replication and Snowpipe Streaming Credit Usage Information Included in Views](2023_04/bcr-1121.md) |  |
| [Observability Views (Account Usage and Information Schema): New Columns in Views](2023_04/bcr-1070.md) |  |
| [DATABASE_STORAGE_USAGE_HISTORY View (Organization Usage): New Columns in View](2023_04/bcr-1036.md) |  |
| [LISTING_TELEMETRY_DAILY View (Data Sharing Usage): New Value for EVENT_TYPE Column](2023_04/bcr-1084.md) |  |
| [PACKAGES View (Information Schema): New RUNTIME_VERSION Column in View](2023_04/bcr-1094.md) |  |
| [TABLES and SCHEMATA Views (Account Usage): Changes to RETENTION_TIME Column](2023_04/bcr-928.md) |  |
| [VIEWS View (Information Schema): New Columns in View](2023_04/bcr-1127.md) |  |
| **Data Pipeline Changes** | **Additional Notes** |
| [Tasks: New Column in Views and SQL Command Output](2023_04/bcr-1080.md) |  |
| **Data Loading / Unloading Changes** | **Additional Notes** |
| [SNOWPIPE Commands: New INVALID_REASON Column in Output](2023_04/bcr-1085.md) |  |
| [Snowpipe Streaming Invalidates Older Versions of Snowflake Ingest SDK and the Kafka Connector](2023_04/bcr-1102.md) |  |
| **Data Sharing Changes** | **Additional Notes** |
| [Privileges: WITH GRANT OPTION No Longer Allowed When Granting Privileges to Shares](2023_04/bcr-1096.md) |  |
| **Data Governance Changes** | **Additional Notes** |
| [Object Tagging: Tag Assignment Not Allowed When Creating Secondary Databases](2023_04/bcr-961.md) |  |
| **Replication Changes** | **Additional Notes** |
| [Alerts: Support for Account and Database Replication](2023_04/bcr-1023.md) |  |
| [Users and Groups: Changes to Initial Replication](2023_04/bcr-1044.md) |  |
| **Developer / Extensibility Changes** | **Additional Notes** |
| [UDFs: Input to JavaScript UDTFs Grouped Into a Single Partition](2023_04/bcr-595.md) |  |
| **Web Interface Changes** | **Additional Notes** |
| [Snowsight: Default Interface for All Users of Snowflake On Demand™](2023_04/bcr-969.md) |  |
| [Snowsight: Default Interface for New Users](2023_04/bcr-1113.md) |  |

---
title: 2023_05 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05_bundle.md
section: Release Notes
---

# 2023_05 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.23 release (July 11-12) with status **Disabled by Default**; account admins can enable for testing.
2. Status changed in 7.29 release (August 22-23) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in 7.34 release (September 27-28) to **Generally Enabled**; account admins can no longer enable or disable the bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Reader Accounts: ORGADMIN Role Removed from Accounts](2023_05/bcr-1045.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [Database Roles: Sharing Database Roles with Future Grants Not Allowed](2023_05/bcr-1144.md) |  |
| [Materialized Views: Failed Refresh Invalidates a Materialized View](2023_05/bcr-1178.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [New Functions: ARRAY_SORT, ARRAY_MIN, and ARRAY_MAX May Conflict With Similarly Named UDFs](2023_05/bcr-1135.md) |  |
| [GET_QUERY_OPERATOR_STATS and EXPLAIN Functions and Commands: Parent Operators Represented by Arrays](2023_05/bcr-1175.md) |  |
| [SHOW PARAMETERS: Changes to Retention Time Values for Databases Created From a Share](2023_05/bcr-1146.md) |  |
| [Managed Account Commands: Changes to Output](2023_05/bcr-1193.md) |  |
| [EXTRACT_SEMANTIC_CATEGORIES Function: International Tag Values](2023_05/bcr-1110.md) |  |
| [GRANT and REVOKE Commands: Changes to the Output for a Failed Grant](2023_05/bcr-515.md) |  |
| [SHOW TABLES Command: Event Tables Listed and New Columns Added to Output](2023_05/bcr-1006-1157.md) |  |
| [SHOW Commands: Pagination Support](2023_05/bcr-1080.md) |  |
| [SHOW SHARES Command: Changes to Output and New OWNER_ACCOUNT Column in Output](2023_05/bcr-1180.md) |  |
| **Data Loading / Unloading Changes** | **Additional Notes** |
| [File Formats: Validation of Format Options](2023_05/bcr-1134.md) |  |
| **Data Governance Changes** | **Additional Notes** |
| [SHOW TAGS: Shared Tags Require the READ Privilege on the Tag](2023_05/bcr-1196.md) |  |

---
title: 2023_06 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06_bundle.md
section: Release Notes
---

# 2023_06 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.29 release (August 22-23) with status **Disabled by Default**; account admins can enable for testing.
2. Status changed in 7.34 release (September 27-28) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in 7.41 release (November 11-14, 2023) to **Generally Enabled**; account admins can no longer enable or disable the bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [WRITE/USAGE stage privileges: Directory table refreshes allowed](2023_06/bcr-1222.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [Cloning: Alerts cloned when cloning databases or schemas](2023_06/bcr-1211.md) |  |
| [Sequences and columns: Changes to SHOW command, view, and GET_DDL function output](2023_06/bcr-1225.md) |  |
| [Query History: Queries for alert conditions and actions included in history](2023_06/bcr-1233.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [SHOW USERS command: Output filtered based on privileges granted to active role](2023_06/bcr-975.md) |  |
| [SHOW STAGES command: New columns](2023_06/bcr-1131.md) |  |
| [SYSTEM$GET_PRIVATELINK_CONFIG function: OCSP account identifier URL added to output](2023_06/bcr-1212.md) |  |
| [DYNAMIC_TABLE_REFRESH_HISTORY function: DATA_TIMESTAMP value in output displayed in new format](2023_06/bcr-1231.md) |  |
| [New function: ARRAY_FLATTEN may conflict with similarly named UDFs](2023_06/bcr-1239.md) |  |
| [CREATE ALERT and ALTER ALERT commands: Some validation checks no longer performed on individual statements in conditions and actions](2023_06/bcr-1246.md) |  |
| [SHOW commands: New column](2023_06/bcr-1281.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Query and task history views and functions: New columns](2023_06/bcr-1147.md) |  |
| [ALERT_HISTORY view and function (Account Usage and Information Schema): Changes to output when action contains RETURN statement](2023_06/bcr-1183.md) |  |
| [ROLES view (Account Usage): New columns and new value for ROLE_TYPE column](2023_06/bcr-1229-1240.md) |  |
| [GRANTS_TO_ROLES View: New Column in the Output and New Values for Existing Columns](2023_06/bcr-1240.md) |  |
| [TABLES view (Account Usage and Information Schema): New column and column values](2023_06/bcr-1260.md) |  |
| [Task views and functions (Account Usage and Information Schema): New column](2023_06/bcr-1279.md) |  |
| [Task views and functions (Account Usage and Information Schema): ATTEMPT_NUMBER value changed to 1-based](2023_06/bcr-1280.md) |  |
| [METERING_HISTORY view (Account Usage): New column](2023_06/bcr-1282.md) |  |
| **Data Loading / Unloading Changes** | **Additional Notes** |
| [Snowpipe: Modification of auto-ingest notification integration queue for Azure and GCP not allowed](2023_06/bcr-1186.md) |  |
| [PUT command on GCP: OVERWRITE parameter must be set to TRUE to overwrite files](2023_06/bcr-1253.md) |  |
| **Data Pipelines Changes** | **Additional Notes** |
| [Dynamic tables: TARGET_LAG parameter set to less than 1 minute for new or modified tables results in error](2023_06/bcr-1247.md) |  |
| [Tasks: Graph completion time differs from final task](2023_06/bcr-1251.md) |  |
| **Data Sharing Changes** | **Additional Notes** |
| [GRANT OWNERSHIP command: Ownership transfer not allowed for shared databases](2023_06/bcr-1181.md) |  |
| [Database roles: Updated error messages when granting to a share](2023_06/bcr-1220.md) |  |
| [Reader accounts: DROP ACCOUNT command not supported](2023_06/bcr-1271.md) |  |
| **Data Governance Changes** | **Additional Notes** |
| [SEMANTIC_CATEGORY system tag: Allowed values constraint removed](2023_06/bcr-1295.md) |  |
| **Replication Changes** | **Additional Notes** |
| [BLOCK_NON_READLIST_OPERATIONS_ON_STAGES_IN_SECONDARY: Parameter set to TRUE by default](2023_06/bcr-1234.md) |  |
| **Developer and Extensibility Changes** | **Additional Notes** |
| [TABLES views (Account Usage and Information Schema): TABLE_TYPE column value shows correct value for event tables](2023_06/bcr-1169.md) |  |
| [Native Apps: GET_DDL error message updated on APPLICATION PACKAGE](2023_06/bcr-1228.md) |  |
| [Procedures (caller’s rights): SQL statements that include PUT and GET commands produce a compiler error](2023_06/bcr-1244.md) |  |
| [Native Apps: Different privileges required to rename APPLICATION and APPLICATION PACKAGE](2023_06/bcr-1249.md) |  |
| [UDTFs: Default column names updated for Python vectorized UDTFs](2023_06/bcr-1275.md) |  |

---
title: 2023_07 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07_bundle.md
section: Release Notes
---

# 2023_07 Bundle (Generally Enabled)

## Bundle History

1. Introduced in the 7.34 release (September 27-28) with status **Disabled by Default**; account admins can enable for testing.
2. Status changed in 7.41 release (November 11-14, 2023) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.2 release (January 15-17, 2024) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **SQL Changes — General** | **Additional Notes** |
| [Table Aliases: Changes to Name Resolution for Quoted Column Identifiers](2023_07/bcr-881.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [DESC SHARE command: Specify “DATABASE ROLE” in the KIND column](2023_07/bcr-1285.md) |  |
| [Network policy commands: Cannot drop active network policies](2023_07/bcr-1337.md) |  |
| [SHOW EVENT TABLES command: Add owner_role_type column](2023_07/bcr-1294.md) |  |
| [SHOW REGIONS command: Changes to region names in output](2023_07/bcr-1335.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [ALTER TABLE and ALTER VIEW commands: Enable drop operation when a row access policy is not set](2023_07/bcr-1327.md) |  |
| [DATABASE_STORAGE_USAGE_HISTORY and STORAGE_USAGE views: New column](2023_07/bcr-1333.md) |  |
| [PASSWORD_POLICIES view: New columns](2023_07/bcr-1309.md) |  |
| **Data Loading / Unloading Changes** | **Additional Notes** |
| [Cloning: Table history not preserved on clone (Postponed)](un-bundled/unbundled-behavior-changes.md) | This behavior change was originally in the 2023_07 bundle and intended to become enabled by default in the 2023_08 bundle. However, it has been postponed and a new release date has not been determined. This change is not available for testing. |
| **Data Governance** | **Additional Notes** |
| [Grant on Native Applications: Must grant access to tags and policies](2023_07/bcr-1274.md) |  |
| **Snowflake CLI, Connectors, Drivers, and SQL API** | **Additional Notes** |
| [PUT command: Drivers affected by upcoming Google authentication method changes](2023_07/bcr-1345.md) |  |
| **Web Interface** | **Additional Notes** |
| [Snowsight worksheets and dashboards: Changes to formatting of query results](2023_07/bcr-1314.md) |  |
| [Snowsight worksheets: Changes to working with versions](2023_07/bcr-1313.md) |  |

---
title: 2023_08 Bundle (Generally Enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08_bundle.md
section: Release Notes
---

# 2023_08 Bundle (Generally Enabled)

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 7.41 release (November 11-14, 2023) with status **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.2 release (January 15-17, 2024) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.7 release (February 19-21, 2024) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

|  |  |
| --- | --- |
| **SQL Changes — General** | **Additional Notes** |
| [ACCESS_HISTORY View: New parent_query_id and root_query_id columns](2023_08/bcr-1265.md) |  |
| [DIV0 and DIV0NULL: Change to results exceeding the output scale](2023_08/bcr-1400.md) |  |
| [Native Apps: Queries that use a reference removed from an app’s manifest file fail](2023_08/bcr-1218.md) |  |
| [SHOW commands for objects owned by an application: New column owner_role_type](2023_08/bcr-1370.md) |  |
| [SHOW ORGANIZATION ACCOUNTS command / ACCOUNTS view (Organization Usage): New Column](2023_08/bcr-1358.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [ALTER TABLE: Incompatible Default Values No Longer Allowed in New Columns](2023_08/bcr-1425.md) |  |
| [Bind variables: No longer ignored as parameters for some built-in table functions](2023_08/bcr-1410.md) |  |
| [DESCRIBE TABLE: New column](2023_08/bcr-1350.md) |  |
| [FUNCTIONS view (Account Usage): New columns](2023_08/bcr-1362.md) |  |
| [New function: MAP_KEYS may conflict with similarly named UDFs](2023_08/bcr-1430.md) |  |
| [Replication: Add support for secret object](2023_08/bcr-1278.md) |  |
| [SHOW GRANTS command: Updates for managed access schema](2023_08/bcr-1397.md) |  |
| [SHOW RELEASE DIRECTIVES command: new columns](2023_08/bcr-1376.md) |  |
| [SHOW TABLES command: New is_hybrid column](2023_08/bcr-1415.md) |  |
| [SHOW TASKS and DESCRIBE TASK commands: New columns](2023_08/bcr-1385-1414.md) |  |
| [SYSTEM$REFERENCE function: Creating a Reference with Mismatched Object Types Fails](2023_08/bcr-1315.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [DESC TABLE command, SHOW COLUMNS command, and COLUMNS views: Add new SchemaEvolutionRecord column](2023_08/bcr-1377.md) |  |
| [FUNCTIONS and PROCEDURES views (INFORMATION_SCHEMA): Corrections to columns When name contains special characters](2023_08/bcr-1404.md) |  |
| [QUERY_HISTORY view and table functions: New MULTI_STATEMENT value in the query_type column for multi-statement queries](2023_08/bcr-1214.md) |  |
| [SHOW TABLES command / TABLES view: New is_iceberg column](2023_08/bcr-1448.md) |  |
| [SYSTEM$ALLOWLIST function: Fail query when socket connection hangs](2023_08/bcr-1357.md) |  |
| **Data Loading / Unloading Changes** | **Additional Notes** |
| [Snowpipe: Multiple auto-ingest notification integrations with the same URL not allowed for Azure and GCP](2023_08/bcr-1394.md) |  |
| **Data Pipelines** | **Additional Notes** |
| [Dynamic tables: Added support for MONITOR privileges](2023_08/bcr-1373.md) |  |
| [Dynamic tables: OPERATE privilege on upstream dynamic tables required for initial refresh](2023_08/bcr-1371.md) |  |
| [Tasks: Automatically suspend failed task runs](2023_08/bcr-1412.md) |  |
| [Tasks: New BACKFILL_INFO column in views](2023_08/bcr-1375.md) |  |
| **Data Governance** | **Additional Notes** |
| [SHOW FUNCTIONS commands: New is_data_metric column](2023_08/bcr-1248.md) |  |
| [TABLE_STORAGE_METRICS view (Account Usage): New column](2023_08/bcr-1361.md) |  |
| **Developer / Extensibility Changes** | **Additional Notes** |
| [Logging and tracing: Logging of unhandled exceptions in handler code on by default](2023_08/bcr-1428.md) |  |
| [Snowflake Native App Framework: Block creating event tables and temporary stages within an application package](2023_08/bcr-1366.md) |  |
| **Snowflake CLI, Connectors, Drivers, and SQL API** | **Additional Notes** |
| [Snowflake Native App Framework: Enforce REFERENCE usage on databases containing tags and policies](2023_08/bcr-1367.md) |  |
| **Web Interface** | **Additional Notes** |
| [Snowsight: Default interface for all users in Standard Edition accounts](2023_08/bcr-1338.md) |  |

## Bundle Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Postponed | BCR-1325 | 19-Jan-24 |
| Updated | BCR-1325 is now included in the 2024_06 bundle and listed as [CURRENT_DATABASE and CURRENT_SCHEMA functions: Ensure deterministic outputs with policies, views, and UDFs](2024_06/bcr-1722.md) | 23-Jul-24 |

---
title: 2024 Performance Improvements
source: https://docs.snowflake.com/en/release-notes/performance-improvements-2024.md
section: Release Notes
---

# 2024 Performance Improvements

> **Important:**
>
> Performance improvements often target specific query patterns or workloads. These improvements might or might not have a material impact
> on a specific workload.

The following performance improvements were introduced in 2024:

| Released | Description | Impact |
| --- | --- | --- |
| December 2024 | Improved sharing of common or similar parts of a query. | Reduces query execution time for queries with multiple WITH clauses. |
| December 2024 | Improved scaling of document pre-processing and inference in Document AI. | Decreases processing time of documents. |
| November 2024 | Top-K pruning for queries that contain aggregate functions. | Expands [top-K pruning](../user-guide/querying-top-k-pruning-optimization.md) to include queries that contain [aggregate functions](../sql-reference/functions-aggregation.md). |
| October 2024 | Improved performance for queries that have equivalent (or similar) subqueries or sub-expressions. | Reduces query execution time by eliminating duplicate parts of a query plan. |
| October 2024 | Improved handling of skew. | Reduces query execution time by automatically detecting and resolving skew in the build side of joins. |
| October 2024 | Search Optimization Update: Support for [join queries](../user-guide/search-optimization/join-queries.md). (General Availability) | Improves the performance of join queries that have a small number of distinct values on the build side of the join. |
| October 2024 | Improved metadata replication. | Reduces the time spent in the SECONDARY_UPLOADING_INVENTORY, PRIMARY_UPLOADING_METADATA, and SECONDARY_DOWNLOADING_METADATA phases of a replication refresh by optimizing serverless compute allocation. This improvement targets refreshes with larger metadata sizes. |
| September 2024 | Improved cloning operations through parallelization. | Reduces the time it takes to clone objects, especially for databases and schemas with extensive metadata. |
| September 2024 | Improved replication refreshes through parallelization. | Reduces the overall refresh time when replicating large volumes of data. |
| August 2024 | Improved performance for LIMIT queries. | Reduces compilation and execution time for queries that use a [LIMIT](../sql-reference/constructs/limit.md) clause to return `n` rows from a table. This optimization shrinks the partitions that are scanned to cover only the first `n` rows. |
| July 2024 | Improved table column synchronization for replication. | Reduces the time spent in the SECONDARY_DOWNLOADING_METADATA phase of a refresh operation. |
| July 2024 | Improved warehouse utilization for queries that scan only a small amount of micro-partitions when compared to the compute resources that are available to the virtual warehouse. | Faster execution for queries with expensive operations when scanning data from a small number of micro-partitions, which is common in BI and dashboard use cases. |
| July 2024 | Improved query processing that:   * Pushes down LIMIT clauses into aggregation nodes that do not contain any aggregations besides the ANY_VALUE function. * Eliminates redundant grouping keys when PRIMARY KEY or UNIQUE constraints are enforced by validation, or when the RELY constraint   property is used. | Faster execution for some queries with LIMIT clauses and GROUP BY statements. |
| June 2024 | Improved single instruction, multiple data (SIMD) processing. | * Reduces query execution time and improves scan performance for queries that access columns that contain NULL values. * Provides better scan performance by decoding numbers more efficiently when reading data from remote storage. |
| May 2024 | Improved efficiency of [Automatic Clustering](../user-guide/tables-auto-reclustering.md). | Reduces the cost of Automatic Clustering because it works more efficiently. |
| May 2024 | Improved object replication. | Reduces the time spent in the SECONDARY_UPLOADING_INVENTORY and SECONDARY_DOWNLOADING_METADATA phases of a refresh operation by optimizing the synchronization of some objects and the authorization mechanism for replication operations. |
| May 2024 | Reduced the latency for loading most Parquet files by up to 50% when the file format option, [USE_VECTORIZED_SCANNER](../sql-reference/sql/copy-into-table.md), is set to `TRUE`. | The vectorized scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file and reduces the ingestion latency by downloading only relevant sections of the Parquet file into memory, such as the subset of selected columns. |
| May 2024 | Improved evaluation of aggregations so they are made at more intermediate join trees. | Reduces query execution time for complex queries with aggregations by reducing the amount of data that needs to be processed at the earliest point possible. |
| May 2024 | Improved query execution times for queries that spend a significant amount of time communicating across virtual warehouse nodes. | Increases throughput between compute resources in a warehouse. Each warehouse is a cluster of compute resources. |
| May 2024 | Improved top-k pruning for LIMIT and ORDER BY queries. | Reduces execution time for top-k queries due to fewer scanned files and file header reads. Expands existing top-k improvements to include STRING/BINARY support in ORDER BY columns. Further increases pruning efficiency by sorting the scan set in order of largest/smallest files with respect to the value domain. |
| May 2024 | Improved join order decisions by calculating selectivity estimates with more granularity. | Reduces compilation time and query execution time by calculating selectivity estimates at the micro-partition level. |
| May 2024 | Faster loading time for Python. | Improves performance for Streamlit in Snowflake apps (including Streamlit apps within a Snowflake Native App), Python worksheets, Python UDFs, and stored procedures in Python. |
| April 2024 | Reduced lock/mutex contention. | Reduces query execution times by improving scan performance in a variety of scenarios such as highly concurrent queries running on a warehouse. |
| April 2024 | Improved broadcast join decisions. | Reduces query execution time and improves memory management by optimizing broadcast joins in scenarios like right-deep join trees. |
| April 2024 | Faster query results in Snowsight. | Reduces the time it takes for query results to appear when run in Snowsight. Improvements are most noticeable for queries that return result sets larger than 10,000 rows. |
| March 2024 | Improved metadata replication. | Reduces the time spent in the PRIMARY_UPLOADING_METADATA, SECONDARY_DOWNLOADING_METADATA, and SECONDARY_UPLOADING_INVENTORY phases for metadata. |
| March 2024 | Improved query performance as a result of more accurately calculating selectivity estimates in order to optimize the order of joins. | Reduces execution time when there are mismatches between partition metadata and actual cardinality from join filters. |
| March 2024 | Improved performance for loading JSON files. | Results in lower ingestion latency of up to 25% for many JSON loading scenarios. |
| February 2024 | Improved object replication. | Reduces the time spent in the PRIMARY_UPLOADING_METADATA, SECONDARY_DOWNLOADING_METADATA, and SECONDARY_UPLOADING_INVENTORY phases of a refresh operation by optimizing portions of the snapshot operation and the way some objects are added to the replication inventory. |
| February 2024 | Support for the `upper` and `lower` collation specifications added to some functions. | Ability to set the `upper` and `lower` collation specifications for some functions. The `upper` and `lower` collation specifications perform better than the `ci` specification for some use cases. The `upper` and `lower` collation specifications are now supported for the following functions: [CHARINDEX](../sql-reference/functions/charindex.md), [CONTAINS](../sql-reference/functions/contains.md), [ENDSWITH](../sql-reference/functions/endswith.md), [POSITION](../sql-reference/functions/position.md), [SPLIT](../sql-reference/functions/split.md), [SPLIT_PART](../sql-reference/functions/split_part.md), and [STARTSWITH](../sql-reference/functions/startswith.md). For more information, see [Differences between ci and upper / lower](../sql-reference/collation.md). |
| January 2024 | Improved execution time for LIMIT 0 queries. | Reduces execution time for queries that use a count of `0` with [LIMIT](../sql-reference/constructs/limit.md), which is often used by applications to return column headings and data types for query results. |
| January 2024 | General Availability of [larger warehouses](../user-guide/warehouses-overview.md) (5X-LARGE and 6X-LARGE) in Microsoft Azure regions, excluding Azure Government regions. | Ability to use larger compute resources for memory-intensive queries compared to smaller warehouses. |

---
title: 2024_01 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01_bundle.md
section: Release Notes
---

# 2024_01 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.2 release (January 15-17) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.7 release (February 19-21) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.12 release (March 26-27) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see the
> Bundle Notes Change Log.

|  |  |
| --- | --- |
| **SQL Changes — General** | **Additional Notes** |
| [ASOF JOIN syntax: Restricted use of keywords](2024_01/bcr-1138.md) |  |
| [Sequences and columns: New sequences and columns use NOORDER by default](2024_01/bcr-1483.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [DESC SECRET command: Add INTEGRATION_NAME column](2024_01/bcr-1494.md) |  |
| [GRANT PRIVILEGES … TO ROLE command: Creating instances and privilege format](2024_01/bcr-1462.md) |  |
| [IS_DATABASE_ROLE_IN_SESSION: Name resolution with policy and UDF evaluation](2024_01/bcr-1499.md) |  |
| [SHOW commands: Update OWNER column for objects owned by Snowflake](2024_01/bcr-1475.md) |  |
| [SHOW FUNCTIONS and SHOW PROCEDURES commands: Changes to output](2024_01/bcr-1508.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Account Usage views: Add support for versioned schemas in Snowflake Native Apps](2024_01/bcr-1463.md) |  |
| [Account Usage views: Column updates to support the Snowflake Native App Framework](2024_01/bcr-1379.md) |  |
| [GRANTS_TO_ROLES View (Account Usage): Match SHOW GRANTS TO ROLE command](2024_01/bcr-1481.md) |  |
| [LOAD_HISTORY and COPY_HISTORY Information Schema views: Showing only post-truncate load history](2024_01/bcr-1493.md) |  |
| [NOTIFICATION_HISTORY table function: New column in output](2024_01/bcr-1470.md) |  |
| **Data Loading** | **Additional Notes** |
| [Snowpipe and Tasks: Updates to Azure Event Grid notifications](2024_01/bcr-1421.md) |  |
| **Data Pipelines** | **Additional Notes** |
| [Dynamic tables: disallow using SQL UDFs and UDTFs in new dynamic tables](2024_01/bcr-1489.md) |  |
| **Data Governance** | **Additional Notes** |
| [Versioned schemas: Disallow policy assignments across schemas](2024_01/bcr-1453.md) |  |
| [Versioned schemas: Disallow tag propagation](2024_01/bcr-1401.md) |  |
| **Replication** | **Additional Notes** |
| [Replication Groups: New column in output of SHOW command and Information Schema View](2024_01/bcr-1490.md) |  |
| **Extensibility & Developer** | **Additional Notes** |
| [Snowflake Native Apps: Introduce maximum number of scanned patches](2024_01/bcr-1466.md) |  |

## Bundle Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Moved | [Replication: Stages, pipes, storage integrations, and load history](2024_02/bcr-1461.md)  Moved to the [2024_02 BCR Bundle](2024_02_bundle.md). | 15-Feb-24 |
| Rewrite | [SHOW commands: Update OWNER column for objects owned by Snowflake](2024_01/bcr-1475.md)  Specify the commands and values that are changing more clearly. | 12-Feb-24 |
| Update | [GRANT PRIVILEGES … TO ROLE command: Creating instances and privilege format](2024_01/bcr-1462.md)  Remove SNOWFLAKE.CORE.COMPARE from the list of classes that are changing. | 09-Feb-24 |
| Update | * [Account Usage views: Column updates to support the Snowflake Native App Framework](2024_01/bcr-1379.md) * Retitle to “Account Usage views: Column updates to support the Snowflake Native App Framework (Pending)” * Specify `role_type` column and `APPLICATION` value for the AGGREGATE_QUERY_HISTORY and QUERY_HISTORY views. * Remove the reference to the RESOURCE_GROUPS view. | 07-Feb-24 |

---
title: 2024_02 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02_bundle.md
section: Release Notes
---

# 2024_02 Bundle

> **Note:**
>
> This bundle contains a replication behavior change that might result in objects being dropped in target accounts. For more
> information, see [Replication: Stages, pipes, storage integrations, and load history](2024_02/bcr-1461.md).

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.7 release (February 19-21) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.12 release (March 26-27) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.17 release (April 30 - May 7) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **SQL — General** | **Additional Notes** |
| [CREATE and ALTER DATABASE commands: Database names starting with “datacloud$” no longer allowed](2024_02/bcr-1549.md) |  |
| [Snowflake Native App Framework: Update error message when an app is disabled](2024_02/bcr-1551.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [DESCRIBE ICEBERG TABLE command: New column](2024_02/bcr-1554.md) |  |
| [SHOW/DESC SERVICE commands and Information Schema and Account Usage SERVICES views: New IS_JOB column](2024_02/bcr-1516.md) |  |
| [SQL functions: Passing in columns that have the upper, lower, or trim collation specifier](2024_02/bcr-1535.md) |  |
| [SHOW APPLICATIONS command: New UPGRADE_STATUS column](2024_02/bcr-1521.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Information Schema: New columns in output for QUERY_HISTORY, QUERY_HISTORY_BY_\* functions](2024_02/bcr-1431-1524-1540.md) |  |
| [QUERY_HISTORY view (Account Usage): Changes to columns and new columns](2024_02/bcr-1497-1524-1540.md) |  |
| [Account Usage views: Additional rows added to support versioned schemas](2024_02/bcr-1544.md) |  |
| **Data Loading** | **Additional Notes** |
| [Transforming data during a load: Disallow using MATCH_BY_COLUMN_NAME with a SELECT statement](2024_02/bcr-1514.md) |  |
| **Data Pipelines** |  |
| [Dynamic tables: Return value changes and new columns added to DYNAMIC_TABLE_GRAPH_HISTORY, DYNAMIC_TABLE_REFRESH_HISTORY, and SHOW DYNAMIC TABLES](2024_02/bcr-1543.md) |  |
| **Data Lake** | **Additional Notes** |
| [GET_DDL function: Return source Apache Iceberg™ data types](2024_02/bcr-1553.md) |  |
| [Apache Iceberg™ tables: Updates to metadata retention period](2024_02/bcr-1519.md) |  |
| **Replication** | **Additional Notes** |
| [Replication: Stages, pipes, storage integrations, and load history](2024_02/bcr-1461.md) |  |
| [Replication: Skip external and Apache Iceberg™ tables during refresh operation](2024_02/bcr-1528.md) |  |
| [Replication: Changes to refresh operations that fail with dangling reference errors](2024_02/bcr-1555.md) |  |
| **Extensibility & Developer** | **Additional Notes** |
| [Python Snowpark Stored Procedures and UDFs: Tracing improvements in Event table](2024_02/bcr-1520.md) |  |
| [Snowpark Container Services: Error if image is not found when creating service or job](2024_02/bcr-1550.md) |  |
| **Virtual Warehouses** | **Additional Notes** |
| [Query Acceleration Service: Expanded support for INSERT statements](2024_02/bcr-1487.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| [Python UDFs: Changes to return value types for semi-structured data](2024_03/bcr-1546.md) | **Moved** to [2024_03 Bundle](2024_03_bundle.md) | 26-Mar-2023 |
| [Replication: Skip external and Apache Iceberg™ tables during refresh operation](2024_02/bcr-1528.md) | **Added** to the 2024_02 BCR Bundle | 20-Feb-24 |
| [Query Acceleration Service: Expanded support for INSERT statements](2024_02/bcr-1487.md) | **Added** to the 2024_02 BCR Bundle | 20-Feb-24 |
| 2024_02 BCR Bundle notes | Pending release (preview) | 19-Feb-24 |
| [Replication: Stages, pipes, storage integrations, and load history](2024_02/bcr-1461.md) | **Moved** to the 2024_02 BCR Bundle | 15-Feb-24 |
| ALTER USER command: Case sensitivity when specifying a default role (Pending) | **Removed** from *SQL Changes — Commands & Functions* | 08-Feb-24 |
| 2024_02 BCR Bundle notes | Initial publication (preview) | 06-Feb-24 |

---
title: 2024_03 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03_bundle.md
section: Release Notes
---

# 2024_03 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.12 release (March 26-27) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.17 release (April 30 - May 7) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.22 release (June 11-15, 2024) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Network policies: Apply network policy to presigned URL](2024_03/bcr-1558.md) |  |
| [Snowflake Native Apps Framework: Changes to the MANAGE EVENT SHARING privilege](2024_03/bcr-1576.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [New SQL functions: GREATEST_IGNORE_NULLS and LEAST_IGNORE_NULLS may conflict with similarly named UDFs](2024_03/bcr-1354.md) |  |
| [SHOW OBJECTS command: New column and changes to output](2024_03/bcr-1529.md) |  |
| [SHOW TABLES command: New column is_dynamic](2024_03/bcr-1580.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Account Usage and Information Schema functions views: New column(s) in output](2024_03/bcr-1406.md) |  |
| [Account Usage views: Add support for versioned schemas in Snowflake Native App](2024_03/bcr-1570.md) |  |
| [FUNCTIONS view (Information Schema): Add support for data metric function](2024_03/bcr-1569.md) |  |
| **Data Pipelines** | **Additional Notes** |
| [Apache Iceberg™ tables: New write location for empty string BASE_LOCATION](2024_03/bcr-1534.md) |  |
| **Replication** | **Additional Notes** |
| [Replication: Skip event tables and hybrid tables during refresh operation](2024_03/bcr-1560-1582.md) |  |
| **Extensibility & Developer** | **Additional Notes** |
| [Account Usage QUERY_HISTORY View: Change to QUERY_TAG](2024_03/bcr-1571.md) |  |
| [Python UDFs: Changes to return value types for semi-structured data](2024_03/bcr-1546.md) |  |
| [SHOW ENDPOINTS command: Output column name change](2024_03/bcr-1563.md) |  |
| **Web Interface** | **Additional Notes** |
| [Retirement window for free listings published on the Snowflake Marketplace](2024_03/bcr-1574.md) |  |
| [Snowsight: Default all users in all accounts to Snowsight](2024_03/bcr-1511.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| 2024_03 BCR Bundle notes | Pending release (preview) | 25-Mar-24 |
| *QUERY_ACCELERATION_ELIGIBLE View (ACCOUNT_USAGE): Changes to columns* | **Removed** from *Virtual Warehouse* | 08-May-24 |
| *CURRENT_DATABASE and CURRENT_SCHEMA functions: Ensure deterministic outputs with policies, views, and UDFs* | **Removed** from *Data Governance* | 01-Jul-24 |
| Updated | BCR-1325 is now included in the 2024_06 bundle and listed as [CURRENT_DATABASE and CURRENT_SCHEMA functions: Ensure deterministic outputs with policies, views, and UDFs](2024_06/bcr-1722.md) | 23-Jul-24 |

---
title: 2024_04 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04_bundle.md
section: Release Notes
---

# 2024_04 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.17 release (April 30 - May 7) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.22 release (June 11-15) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.27 release (July 22-25) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Snowflake Native App Framework: Roles with the ATTACH LISTING privilege can run the DESCRIBE APPLICATION PACKAGE command](2024_04/bcr-1603.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [DESC FUNCTION command: New column IS_AGGREGATE in output](2024_04/bcr-1609.md) |  |
| [DESCRIBE SESSION POLICY: Add allowed_secondary_roles column](2024_04/bcr-1621.md) |  |
| [SHOW ENDPOINTS command: New column and changes to output](2024_04/bcr-1586.md) |  |
| [SHOW SHARES command: New column (SECURE_OBJECTS_ONLY)](2024_04/bcr-1600.md) |  |
| [SHOW TABLES command, TABLES view, and GET_DDL command: Changes related to the READ ONLY property for tables](2024_04/bcr-1572.md) |  |
| [SHOW VERSIONS IN MODEL command: New column aliases](2024_04/bcr-1620.md) |  |
| [SHOW/DESC COMPUTE POOL command: New columns](2024_04/bcr-1602.md) |  |
| [SHOW/DESC SERVICE commands and Information Schema view: New STATUS column](2024_04/bcr-1596.md) |  |
| **SQL Changes - Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Account Usage views: Add support for versioned schemas in Snowflake Native Apps](2024_04/bcr-1610.md) |  |
| [NOTIFICATION_HISTORY table function: Changes to output](2024_04/bcr-1593.md) |  |
| [LISTING_CONSUMPTION_DAILY view: New columns](2024_04/bcr-1601.md) |  |
| [STAGES View (ACCOUNT_USAGE): New Column STORAGE_INTEGRATION](2024_04/bcr-1547.md) |  |
| **Virtual Warehouse** | **Additional Notes** |
| [WAREHOUSE_EVENTS_HISTORY view (ACCOUNT_USAGE): New columns and changes to events](2024_04/bcr-1616.md) |  |
| **Data Pipelines** | **Additional Notes** |
| [Dynamic tables: Updates to dynamic table default refresh mode](2024_04/bcr-1614.md) |  |
| **Data Governance** | **Additional Notes** |
| [Masking policy: Comply with the scale and precision of a column](2024_04/bcr-1355.md) |  |
| **Replication** | **Additional Notes** |
| [Replication support for CREATE <class_name> privilege](2024_04/bcr-1607.md) |  |
| **Extensibility & Developer** | **Additional Notes** |
| [Event tracing: Trace IDs propagated from parent to child through procedure calls](2024_04/bcr-1592.md) |  |
| [Snowpark Container Services: Changes to access control for services and endpoints](2024_04/bcr-1611.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| [Masking policy: Comply with the scale and precision of a column](2024_04/bcr-1355.md) | Remove references to length of a column | 28-Jul-24 |
| Logging and tracing: Default event table included (Postponed) | Behavior change postponed and announced in [Logging and tracing: Default event table included](2024_06/bcr-1598.md) | 10-May-24 |
| 2024_04 BCR Bundle notes | Pending release (preview) | 29-Apr-24 |

---
title: 2024_05 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05_bundle.md
section: Release Notes
---

# 2024_05 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.22 release (June 11-15) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.27 release (July 22-25) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.32 release (August 26-30) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [CREATE <object> commands: Changes to error messages when creating an object in a share](2024_05/bcr-1623.md) |  |
| [DESCRIBE ICEBERG TABLE command: SOURCE ICEBERG TYPE column value change](2024_05/bcr-1638.md) |  |
| [Full-text search: TOKEN_SEARCH function renamed to SEARCH](2024_05/bcr-1633.md) |  |
| [SHOW MODELS Command: New columns in output](2024_05/bcr-1653.md) |  |
| [SHOW REGIONS command: Changes to region display names in output](2024_05/bcr-1635.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [LISTING_ACCESS_HISTORY view: New columns](2024_05/bcr-1641.md) |  |
| [SNOWPARK_CONTAINER_SERVICES_HISTORY View (ACCOUNT_USAGE): New Columns](2024_05/bcr-1649.md) |  |
| **Data Lake Changes** | **Additional Notes** |
| [Apache Iceberg™ tables: Metadata file naming convention change](2024_05/bcr-1645.md) |  |
| [Apache Iceberg™ tables: version-hint.text file no longer generated](2024_05/bcr-1658.md) |  |
| **Extensibility & Developer Changes** | **Additional Notes** |
| [ALTER APPLICATION PACKAGE command: Expanded validation](2024_05/bcr-1627.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| [Snowpark Container Services: New default values and validation of resource requirements for a service](2024_06/bcr-1648.md) | Behavior change postponed | 13-Jun-24 |
| 2024_05 BCR Bundle notes | Pending release (preview) | 10-Jun-24 |

---
title: 2024_06 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06_bundle.md
section: Release Notes
---

# 2024_06 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.27 release (July 22-25) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 8.32 release (August 26-30) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 8.38 release (October 7-9) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [DESC COMPUTE POOL command: New columns in output and deprecation of SYSTEM$GET_COMPUTE_POOL_STATUS](2024_06/bcr-1594.md) |  |
| [Extend SHOW USERS, SHOW TERSE USERS, and SELECT FROM account_usage.users: New columns](2024_06/bcr-1690.md) |  |
| [SHOW COMPUTE POOLS and DESC COMPUTE POOL commands: New column in output](2024_06/bcr-1595-1652.md) |  |
| [SHOW FUNCTIONS IN MODEL Command: New column in output](2024_06/bcr-1678.md) |  |
| [SHOW ORGANIZATION ACCOUNTS command: Repurposed for new functionality](2024_06/bcr-1712.md) |  |
| [SHOW/DESC SERVICE commands and Information Schema SERVICES view: New columns](2024_06/bcr-1717-1723.md) |  |
| [SHOW SERVICES and DESCRIBE SERVICE commands: New format for the DNS name of a service](2024_06/bcr-1656.md) |  |
| [Snowflake Native App Framework: Enable event sharing for all apps in an account](2024_06/bcr-1697.md) |  |
| [TASK_HISTORY function: Consistent FAILED state for timed-out tasks](2024_06/bcr-1681.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [FUNCTIONS view (Account Usage and Information Schema): New column IS_AGGREGATE](2024_06/bcr-1617.md) |  |
| [FUNCTIONS view (ACCOUNT_USAGE): Stored procedures are no longer included](2024_06/bcr-1671.md) |  |
| [NOTIFICATION_HISTORY function: New source type BUDGET in MESSAGE_SOURCE column](2024_06/bcr-1687.md) |  |
| **Data Pipeline Changes** | **Additional Notes** |
| [Tasks: Reducing the number of SKIPPED tasks](2024_06/bcr-1710.md) |  |
| **Data Lake Changes** | **Additional Notes** |
| [Apache Iceberg™ tables: Writing data files to subdirectories in Amazon S3](2024_06/bcr-1706.md) |  |
| **Data Governance Changes** | **Additional Notes** |
| [CURRENT_DATABASE and CURRENT_SCHEMA functions: Ensure deterministic outputs with policies, views, and UDFs](2024_06/bcr-1722.md) |  |
| [Custom Classification: Replicate and clone instances](2024_06/bcr-1688.md) |  |
| [NETWORK POLICIES and NETWORK RULES views (Account Usage): New columns](2024_06/bcr-1661.md) |  |
| [Security: Update dangling network policy references](2024_06/bcr-1622.md) |  |
| [SHOW commands: Privilege updates](2024_06/bcr-1665.md) |  |
| [Snowflake Native App Framework: Enable event sharing for all apps in an account](2024_06/bcr-1697.md) |  |
| **Extensibility & Developer Changes** | **Additional Notes** |
| [Logging and tracing: Default event table included](2024_06/bcr-1598.md) |  |
| [Snowflake Native App Framework: Event sharing continues after consumer disables event table](2024_06/bcr-1724.md) |  |
| [Snowpark Container Services: New default values and validation of resource requirements for a service](2024_06/bcr-1648.md) |  |
| [Snowpark Container Services: New stage mount allotment limit per compute pool node](2024_06/bcr-1698.md) |  |
| [Tracing: Span and trace IDs propagated from parent to child through procedure calls](2024_06/bcr-1683.md) |  |
| **Web Interface** | **Additional Notes** |
| *Snowsight: Default all users, including VPS and Private Link, to Snowsight (Postponed)* |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| *Extend SHOW USERS, SHOW TERSE USERS, and SELECT FROM account_usage.users: New columns (Pending)* | **Added** to *SQL Changes — Usage Views & Information Schema Views / Table Functions* | 24-Jul-24 |
| *Snowsight: Default all users, including VPS and Private Link, to Snowsight* | **Postponed** | 15-Aug-2024 |

---
title: 2024_07 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07_bundle.md
section: Release Notes
---

# 2024_07 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.32 release (August 26-30, 2024) as **Disabled by Default**; however, account admins can enable for testing.
2. Status changed in the 8.38 release (October 7-9, 2024) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.2 release (January 22-February 13, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Snowflake Information Schema views: New column GRANTED_TO](2024_07/bcr-1753.md) |  |
| **SQL Changes - Commands & Functions** | **Additional Notes** |
| [SHOW <class_name> INSTANCES commands: Changes in output](2024_07/bcr-1735.md) |  |
| [SHOW VERSIONS IN MODEL command: module_name column in output renamed model_name](2024_07/bcr-1700.md) |  |
| [SHOW WAREHOUSES command: New column in output](2024_07/bcr-1725.md) |  |
| [SHOW/DESC PIPE[S] commands: New column in output](2024_07/bcr-1659.md) |  |
| [Snowflake Native App Framework: Changes to the SHOW APPLICATION and DESC APPLICATION commands](2024_07/bcr-1729.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [APPLICATION_STATE view: Add columns to provide additional information about app](2024_07/bcr-1716.md) |  |
| [LISTING_EVENTS_DAILY: new columns in output](2024_07/bcr-1566.md) |  |
| [NOTIFICATION_HISTORY table function: Removal of the message column from the output](2024_07/bcr-1742.md) |  |
| [Snowpark Container Services container logs: Changes to resource attributes](2024_07/bcr-1682.md) |  |
| **Virtual Warehouse Changes** | **Additional Notes** |
| [Query Acceleration Service: Expanded support for COPY statements](2024_07/bcr-1749.md) |  |
| **Data Sharing Changes** | **Additional Notes** |
| [New privilege MANAGE SHARE TARGET replaces CREATE SHARE to add accounts to shares](2024_07/bcr-1734.md) |  |
| **Extensibility & Developer Changes** | **Additional Notes** |
| [Addition of the uses_gpu parameter](2024_07/bcr-1704.md) |  |
| [Telemetry: Event table attribute name and value changes](2024_07/bcr-1668.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| BCR-1692: Change default value of DEFAULT_SECONDARY_ROLES to ALL | Moved from 2024_07 bundle to 2024_08 bundle. | 03-Oct-2024 |
| Updated as complete | Bundle 2024_07 announcements released. | 30-Aug-2024 |
| Updated as pending | Bundle 2024_07 announcements released as pending. | 26-Aug-2024 |
| Released as preview | Bundle 2024_07 announcements released as preview. | 23-Aug-2024 |

---
title: 2024_08 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08_bundle.md
section: Release Notes
---

# 2024_08 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 8.38 release (October 7-9, 2024) as **Disabled by Default**; however, account admins can enable for testing.
2. Status changed in the 9.2 release (January 22-February 13, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.7 release (March 17-27, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Default value for the SYNC_PASSWORD parameter on SCIM security integrations has changed](2024_08/bcr-1768.md) |  |
| [Default value of DEFAULT_SECONDARY_ROLES object property on users changed to (‘ALL’)](2024_08/bcr-1692.md) |  |
| [Multi-factor authentication enrollment enforced by default for new Snowflake accounts](2024_08/bcr-1784.md) |  |
| [Network security: Cannot activate an empty network policy](2024_08/bcr-1761.md) |  |
| [Network security: Cannot attach egress network rule to a network policy](2024_08/bcr-1760.md) |  |
| [Stronger Default Password Policies](2024_08/bcr-1776.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [SQL data types: Changes to maximum length, output, and error messages](2024_08/bcr-1779.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [CREATE … CLONE command: Cloning databases and schemas that contain hybrid tables](2024_08/bcr-1792.md) |  |
| [CREATE DYNAMIC ICEBERG TABLE command: Write data types to table files](2024_08/bcr-1773.md) |  |
| [Object tagging commands, functions, and views: New column and property](2024_08/bcr-1777.md) |  |
| [SHOW and DESCRIBE commands for listings: New columns in output](2024_08/bcr-1756.md) |  |
| [SHOW DYNAMIC TABLES command and DYNAMIC_TABLES function: New changes to output](2024_08/bcr-1796.md) |  |
| [SHOW ICEBERG TABLES command: New column in output](2024_08/bcr-1745.md) |  |
| [SHOW MANAGED ACCOUNTS command: New and modified columns in output](2024_08/bcr-1738.md) |  |
| [SHOW SCHEMAS command: New columns in output](2024_08/bcr-1757.md) |  |
| [SHOW SERVICE CONTAINERS IN SERVICE command: New columns in output](2024_08/bcr-1787.md) |  |
| [SHOW TASKS/DESC TASK commands: New columns in output](2024_08/bcr-1719-1807.md) |  |
| [SHOW USERS command: NULL values replace default values in output](2024_08/bcr-1798.md) |  |
| [SHOW VERSIONS IN MODEL: New columns in output](2024_08/bcr-1778.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [DATABASES and SCHEMATA views, and SHOW DATABASES and SHOW SCHEMAS commands: New column in output](2024_08/bcr-1759.md) |  |
| [LISTING_ACCESS_HISTORY view (DATA_SHARING_USAGE): Deprecate LISTING_OBJECTS_ACCESSED column](2024_08/bcr-1758.md) |  |
| [METERING_HISTORY view (ACCOUNT_USAGE): Database-level output for COPY_FILES](2024_08/bcr-1728.md) |  |
| [SHOW/DESC SERVICE commands and SERVICES view (ACCOUNT_USAGE and INFORMATION SCHEMA): New column MIN_READY_INSTANCES](2024_08/bcr-1793.md) |  |
| [TABLES views and SHOW OBJECTS command: New column IS_HYBRID](2024_08/bcr-1732.md) |  |
| [TASK_HISTORY view (ACCOUNT_USAGE): task usage history restricted to 1 year](2024_08/bcr-1806.md) |  |
| [USERS and QUERY_HISTORY views (ACCOUNT_USAGE) and QUERY_HISTORY function: New columns](2024_08/bcr-1771.md) |  |
| [WAREHOUSE_EVENTS_HISTORY view: New columns in output](2024_08/bcr-1770.md) |  |
| [WAREHOUSE_METERING_HISTORY view (ACCOUNT_USAGE): New column](2024_08/bcr-1714.md) |  |
| **Replication** | **Additional Notes** |
| [Replication: Support for replicating machine learning models](2024_08/bcr-1746.md) |  |
| **Extensibility & Developer Changes** | **Additional Notes** |
| [Streamlit in Snowflake: Default Python version for Streamlit in Snowflake apps changes from 3.8 to 3.11](2024_08/bcr-1804.md) |  |
| [Snowflake Native App Framework: Apps with containers share events with provider when consumer event table not set](2024_08/bcr-1800.md) |  |
| [Telemetry: Event table attribute name and value changes](2024_08/bcr-1767.md) |  |
| **Web Interface** | **Additional Notes** |
| [Deprecation of worksheet results sharing and secondary roles in dashboards](2024_08/bcr-1801.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Released as preview | Bundle 2024_08 announcements released as preview. | 04-Oct-2024 |
| [Default value for the SYNC_PASSWORD parameter on SCIM security integrations has changed](2024_08/bcr-1768.md) | **Added** to the 2024_08 BCR Bundle | 07-Oct-24 |
| [Stronger Default Password Policies](2024_08/bcr-1776.md) | **Added** to the 2024_08 BCR Bundle | 07-Oct-24 |
| [Use primary role for authorizing view and materialized view creation (Canceled)](un-bundled/bcr-1782.md) | **Added** to the 2024_08 BCR Bundle | 07-Oct-24 |
| [Multi-factor authentication enrollment enforced by default for new Snowflake accounts](2024_08/bcr-1784.md) | **Added** to the 2024_08 BCR Bundle | 07-Oct-24 |
| [WAREHOUSE_METERING_HISTORY view (ACCOUNT_USAGE): New column](2024_08/bcr-1714.md) | Column name correction. `CREDITS_USED_COMPUTE_QUERIES` replaced by `CREDITS_ATTRIBUTED_COMPUTE_QUERIES`. | 07-Oct-24 |
| [SQL data types: Changes to maximum length, output, and error messages](2024_08/bcr-1779.md) | Updated to now include information about [changes to UDF metadata in the event table](2024_08/bcr-1779.md) | 08-Oct-24 |
| [Deprecation of worksheet results sharing and secondary roles in dashboards](2024_08/bcr-1801.md) | Updated to now include Knowledge Base articles | 08-Oct-24 |
| Updated as complete | Bundle 2024_08 announcements released. | 09-Oct-2024 |
| [Multi-factor authentication enrollment enforced by default for new Snowflake accounts](2024_08/bcr-1784.md) | Updated the title and clarified the description. | 14-Oct-2024 |
| [Use primary role for authorizing view and materialized view creation (Canceled)](un-bundled/bcr-1782.md) | **Removed** from the 2024_08 BCR Bundle | 12-Nov-24 |
| [GRANT OWNERSHIP ON ROLE command: Restrict transfer of role ownership to itself (Canceled)](un-bundled/bcr-1781.md) | **Removed** from the 2024_08 BCR Bundle | 17-Dec-24 |
| [CREATE … CLONE command: Cloning databases and schemas that contain hybrid tables](2024_08/bcr-1792.md) | Updated the description; cloning is now supported for hybrid tables. | 25-Feb-25 |

---
title: 2025 Performance improvements
source: https://docs.snowflake.com/en/release-notes/performance-improvements-2025.md
section: Release Notes
---

# 2025 Performance improvements

> **Important:**
>
> Performance improvements often target specific query patterns or workloads. These improvements might or might not have a material impact
> on a specific workload.

The following performance improvements were introduced in 2025:

| Released | Description | Impact |
| --- | --- | --- |
| November 2025 | Enhanced [Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md) to intelligently determine when queries with LIMIT clauses can be accelerated. | More queries with LIMIT clauses (including those without ORDER BY) are now eligible for acceleration. QAS automatically determines when accelerating these queries improves performance, broadening the scope of queries that benefit from QAS. |
| November 2025 | More accurate cardinality estimation for expressions and functions, leading to better join ordering and distribution decisions. | Improves query plan quality for queries involving function expressions in join or group-by clauses. Reduces execution time for queries affected by previous estimation inaccuracies. |
| November 2025 | Improved cardinality estimation in the query optimizer using probabilistic data structures for faster and more accurate estimates during grouping operations. | Reduces compilation time and improves grouping selectivity estimation, leading to better query plans. Particularly benefits queries with complex grouping and aggregation patterns. |
| November 2025 | Improved bloom filter pushdown, enabling earlier data elimination during both SELECT and DML (UPDATE, DELETE, MERGE) query processing. | Improves execution time for join queries and DML operations by applying filters earlier in the query plan, reducing the volume of data processed. |
| October 2025 | [Insights about query performance provided in Snowsight](../user-guide/query-insights.md) (General Availability). | You can use these [query insights](../user-guide/query-insights.md) to identify queries where you can improve performance. |
| October 2025 | Enhanced predicate derivation logic to increase the coverage of filter propagation across more query patterns, generating additional derived predicates that enable earlier data elimination. | Improves execution time for queries with complex join and filter patterns by propagating more filters earlier in the query plan. |
| October 2025 | Improved probe-side join pruning on numeric columns using larger approximation structures that better represent the build-side data distribution. | Improves performance for join queries by skipping more irrelevant data files, reducing I/O and execution time. Particularly benefits joins with clustered or low-cardinality build-sides. |
| October 2025 | Simplifies the query plan by removing redundant grouping expressions based on functional dependencies. | Improves execution time for queries containing GROUP BY clauses with functionally dependent columns, reducing unnecessary computation. |
| October 2025 | Enhanced filter pushdown through window functions, allowing filters to be applied before window function computation. | Improves execution time for queries where window functions previously blocked filter pushdown, reducing the volume of data processed by the window operation. |
| October 2025 | Derives additional filter predicates from join key constraints to enable early data elimination. When a join key has a single constant value, that value is propagated as a filter to the opposite side of the join. | Improves performance for join queries by applying derived filters earlier, reducing the data processed in subsequent operations. |
| October 2025 | Improved join ordering using cost-based optimization. The optimizer now evaluates join order alternatives using a cost model for queries with multiple joins. | Improves query execution times for complex queries with multiple joins. Queries that previously had suboptimal join orders may see significant performance improvements. |
| October 2025 | Improved simplification of join conditions and HAVING clauses by factoring out common expressions. | Improves execution time for queries with complex join conditions or HAVING clauses that contain redundant or factorable expressions. |
| October 2025 | Simplifies aggregates containing simple arithmetic expressions to enable further optimizations, including reduced table scans. | Improves execution time for queries with aggregations over computed expressions. |
| October 2025 | Improved LIKE predicate selectivity estimation for more accurate query plan costing. | Improves join ordering and plan decisions for queries with LIKE predicates. |
| September 2025 | More efficient workload distribution. | Improves query execution time by detecting and adaptively redistributing workloads across nodes in the warehouse, without user intervention. |
| September 2025 | [Insights about query performance provided in Snowsight](../user-guide/query-insights.md) (Preview). | You can use these [query insights](../user-guide/query-insights.md) to identify queries where you can improve performance. |
| September 2025 | Enhanced predicate derivation to propagate filter conditions across outer join boundaries. | Improves performance for queries with outer joins by applying filters earlier in the query plan, reducing the volume of data processed in subsequent operations. |
| August 2025 | More efficient and accurate NDV estimations that lead to more effective query plans. | Improves query compilation and execution times, especially for DML statements. |
| August 2025 | Improved filters that eliminate irrelevant data early, thereby reducing the volume of data that needs to be buffered to memory or storage. These filters reduce the amount of data processed before it’s used in a sub-query or Common Table Expression (CTE). | Improves query performance for complex queries where the same data is needed across different parts of the query plan. Subsequent filter operations are more efficient, saving time and compute resources. |
| August 2025 | Improved query performance with [Snowflake Optima Indexing](../user-guide/snowflake-optima.md), which continuously analyzes your workload patterns and automatically creates and maintains search optimization indexes in the background. Snowflake Optima is only available on [Snowflake generation 2 standard warehouses (Gen2)](../user-guide/warehouses-gen2.md). | Improves performance of queries that include frequently used selective predicate patterns, such as repetitive point-lookup queries on a table. No user configuration or additional cost required. |
| August 2025 | Adaptive redistribution of skewed data during query execution. Snowflake now detects when data is unevenly distributed across processing nodes during joins and automatically redistributes work to prevent bottlenecks. Applies to both SELECT and DML operations. | Improves execution time for queries and DML operations that encounter data skew during join operations. Applies broadly across workloads. |
| July 2025 | Insights about query performance provided in the [QUERY_INSIGHTS view](../sql-reference/account-usage/query_insights.md). | You can use these [query insights](../user-guide/query-insights.md) to identify queries where you can improve performance. |
| July 2025 | Enhanced predicate derivation to propagate filter conditions above projections, generating additional derived filters for downstream operations. | Improves execution time for queries with projections above filtered joins by deriving more filters earlier. |
| July 2025 | Improved aggregation folding for queries with LIMIT clauses, reducing unnecessary table scans. | Improves performance for aggregation queries with LIMIT by skipping full table scans when possible. |
| July 2025 | Improved probe-side skew handling in hash joins, expanding coverage to more query patterns. | Improves execution time for queries that encounter probe-side data skew during join operations. |
| July 2025 | Faster execution through sharing plan fragments to avoid redundant computation in DML operations using common table expressions. | Improves execution time for DML operations (INSERT, UPDATE, MERGE) that reference the same data multiple times through CTEs. |
| July 2025 | Improved selectivity estimation for derived filter predicates, leading to better join ordering decisions. | Improves execution time for queries with derived predicates by providing more accurate cost estimates to the optimizer. |
| June 2025 | Expands coverage of the [Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md) to [Apache Iceberg™ tables](../user-guide/tables-iceberg.md). | QAS can now improve the performance of queries on Iceberg tables. |
| June 2025 | Runtime pruning for queries with geospatial predicates using bounding box filtering. | Improves performance for geospatial queries by pruning micro-partitions based on spatial intersection, reducing scan volume. |
| June 2025 | Faster execution when querying with prefix match patterns (LIKE clauses) by rewriting predicates for more effective pruning. | Improves execution time for queries using LIKE with prefix patterns by enabling micro-partition pruning that was previously not possible. |
| June 2025 | Improved bloom filter derivation for outer joins, enabling early filtering opportunities on the null-extended side that were previously not viable. | Improves execution time for queries where outer joins are a bottleneck, by enabling earlier data elimination. |
| June 2025 | More efficient cost-based search space exploration during query optimization, reducing the optimizer’s overhead for complex queries. | Improves both compilation and execution time for queries with many possible plan alternatives. |
| June 2025 | Faster execution by removing unnecessary grouping columns in DML operations based on functional dependency analysis. | Improves execution time for DML operations with GROUP BY clauses that contain redundant columns. |
| May 2025 | Search optimization update: [Support for Apache Iceberg™ tables](../user-guide/search-optimization/queries-that-benefit.md). | Improves the performance of queries on Iceberg tables. |
| May 2025 | Improved performance of dynamic table refreshes that contain top-level QUALIFY clauses with RANK or ROW_NUMBER ranking window functions, specifically when the rank value is 1. | Dynamic tables using `QUALIFY RANK() = 1` or `ROW_NUMBER = 1` now refresh more quickly, improving performance for common deduplication and top-N use cases. |
| May 2025 | Enhanced vectorized scanner availability for improved performance | Previously, [the vectorized scanner](../sql-reference/sql/copy-into-table.md) could only be used with specific `ON_ERROR` settings (`ABORT_STATEMENT` or `SKIP_FILE`). This restriction has been removed. Now, you can enable the vectorized scanner with any `ON_ERROR` option, including `CONTINUE`, `SKIP_FILE_num`, and `'SKIP_FILE_num%'`. This change allows the performance-enhancing vectorized scanner to be used in more situations. You may see faster data processing as a result. |
| May 2025 | More accurate cardinality estimation for window functions with ROW_NUMBER = 1 filtering patterns, leading to better join ordering decisions. | Improves execution time for queries using common deduplication patterns by providing better cardinality estimates to the optimizer. |
| May 2025 | Enhanced predicate derivation for queries on secure views, propagating filters more effectively. | Improves execution time for queries against secure views by applying filters earlier, reducing data processed downstream. |
| April 2025 | Expands coverage of the [Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md) to more queries. | Improves the heuristics that QAS uses to determine whether or not a query will benefit from acceleration. As a result, more queries are eligible for acceleration by QAS. |
| April 2025 | Improved cardinality estimation for foreign key join relationships. | Improves join ordering for queries involving foreign key joins by providing more accurate cardinality estimates. |
| April 2025 | Improved extraction pushdown through functions, enabling more efficient scan and metadata usage for subcolumns. | Improves execution time for queries that extract subcolumns through function expressions. |
| March 2025 | Improves the batching of files during replication refresh operations. | Replication refresh jobs that replicate up to 8 GB of data will have less variance and more predictability. |
| March 2025 | Improves performance for dynamic tables with incremental refresh mode using left outer joins. | Provides faster incremental refresh performance for dynamic tables that contain one or more left outer joins. Performance gains can be substantial depending on the workload. |
| March 2025 | Adaptively optimizes compute and I/O resources for queries executed against Apache Iceberg™ tables. | Improves Apache Iceberg™ query performance and memory efficiency in high-concurrency scenarios. |
| February 2025 | [Tasks](../user-guide/tasks-intro.md) can be scheduled to run as frequently as every 10 seconds. | Reduces the time required between scheduled task executions. |

---
title: 2025_01 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01_bundle.md
section: Release Notes
---

# 2025_01 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.2 release (January 22-February 13, 2025) as **Disabled by Default**; however, account admins can enable for testing.
2. Status changed in the 9.7 release (March 17-27, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.12 release (May 5-12, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Changes to the PREVENT_UNLOAD_TO_INLINE_URL and PREVENT_UNLOAD_TO_INTERNAL_STAGES parameters](2025_01/bcr-1841.md) |  |
| [Expanded task view capabilities for MONITOR, OPERATE, MONITOR EXECUTION, and OWNER privileges](2025_01/bcr-1799.md) |  |
| [Fix future grant materialization precedence](2025_01/bcr-1870.md) |  |
| [Multi-factor authentication: New Duo interface](2025_01/bcr-1875.md) |  |
| **SQL Changes — General** | **Additional Notes** |
| [GENERATE_SYNTHETIC_DATA: Join key column type change in output](2025_01/bcr-1868.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [Changes to XML parsing and emitting behavior](2025_01/bcr-1862.md) |  |
| [Compute pools: Deprecated instance types cannot be resumed](2025_01/bcr-1824.md) |  |
| [DESCRIBE LISTING command: New columns in output](2025_01/bcr-1891.md) |  |
| [DROP ROLE command: Restriction on dropping the current primary role](2025_01/bcr-1843.md) |  |
| [SHOW APPLICATIONS command: New column in output](2025_01/bcr-1879.md) |  |
| [SHOW GRANTS TO USER and SHOW GRANTS commands: New columns in output](2025_01/bcr-1803.md) |  |
| [SHOW NOTIFICATION INTEGRATIONS and DESC NOTIFICATION INTEGRATION commands: Changes to output](2025_01/bcr-1846.md) |  |
| [SHOW OBJECTS command: New IS_ICEBERG column in output](2025_01/bcr-1842.md) |  |
| [SHOW/DESCRIBE APPLICATION PACKAGE command: New column in output](2025_01/bcr-1838.md) |  |
| [SYSTEM$GET_COMPUTE_POOL_STATUS function: Deprecated](2025_01/bcr-1830.md) |  |
| [SHOW/DESC AVAILABLE LISTING: New column in output](2025_01/bcr-1865.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [Changes to Apache Iceberg™ tables created from Delta files](2025_01/bcr-1852.md) |  |
| [DESC SERVICE and SHOW SERVICES command: New column(s) in output](2025_01/bcr-1849-1867.md) |  |
| [LOGIN_HISTORY view and functions: New column CLIENT_PRIVATE_LINK_ID](2025_01/bcr-1847.md) |  |
| [Replication Groups: New column in output of SHOW command and Information Schema view](2025_01/bcr-1273.md) |  |
| [SHOW SERVICE INSTANCES IN SERVICE command: New column in output](2025_01/bcr-1883.md) |  |
| [Snowpark Container Services: Changes to the compute pool maintenance window](2025_01/bcr-1856.md) |  |
| [TASK_VERSIONS view (ACCOUNT_USAGE): New columns](2025_01/bcr-1882.md) |  |
| **Data Pipelines** | **Additional Notes** |
| [Streams on views: Changes to column behavior when selecting from a stream](2025_01/bcr-1834.md) |  |
| **Data Lake** | **Additional Notes** |
| [Apache Iceberg™: New write paths for Snowflake-managed tables](2025_01/bcr-1873.md) |  |
| **Data Governance** | **Additional Notes** |
| [CLONE and CREATE … LIKE commands: Cloning and propagating DMF entity mappings](2025_01/bcr-1854.md) |  |
| **Developer / Extensibility Changes** | **Additional Notes** |
| [Deprecate previous syntax for working with SQL classes](2025_01/bcr-1829.md) |  |
| [PROCEDURES view (Account Usage and Information Schema): New columns](2025_01/bcr-1786.md) |  |
| [Providers must explicitly authorize event sharing when testing apps that include mandatory event definitions](2025_01/bcr-1851.md) |  |
| [Snowflake Scripting: Changes to global variables](2025_01/bcr-1850.md) |  |
| [Update the default Streamlit version for Snowflake Native Apps](2025_01/bcr-1857.md) |  |
| [Streamlit in Snowflake: Enable Git integration and multi-file editing for Streamlit in Snowflake apps](2025_01/bcr-1888.md) |  |
| **Web Interface** | **Additional Notes** |
| [Administrator-owned warehouse for Snowflake Notebooks](2025_01/bcr-1871.md) |  |
| [Default warehouse for Snowflake Notebooks](2025_01/bcr-1887.md) |  |
| [Personal databases and private notebooks (Private notebooks deprecated)](2025_01/bcr-1872.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Released as preview | Bundle 2025_01 announcements released as preview. | 17-Jan-2025 |
| [Streamlit in Snowflake: Enable Git integration and multi-file editing for Streamlit in Snowflake apps](2025_01/bcr-1888.md) | **Added** to the 2025_01 BCR Bundle | 22-Jan-25 |
| Updated as complete | Bundle 2025_01 announcements released. | 13-Feb-2025 |
| DATABASES and SCHEMATA views: New column in output | **Removed** from *SQL Changes — Usage Views & Information Schema Views / Table Functions* | 05-Mar-2025 |
| Personal databases and private notebooks | Added notice that BCR-1872 remains disabled in BCR bundle 2025_02. | 03-Apr-2025 |

---
title: 2025_02 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02_bundle.md
section: Release Notes
---

# 2025_02 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.7 release (March 17-27, 2025) as **Disabled by Default**; however, account admins can enable for testing.
2. Status changed in the 9.12 release (May 5-12, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.17 release (June 23-30, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.

|  |  |
| --- | --- |
| **Security Changes** | **Additional Notes** |
| [Access control: Privileges can be granted to users](2025_02/bcr-1924.md) |  |
| **SQL Changes — Commands & Functions** | **Additional Notes** |
| [DYNAMIC_TABLES function: New default for maximum number of rows returned](2025_02/bcr-1928.md) |  |
| [SHOW <objects> commands: Remove BUDGET column from output](2025_02/bcr-1913.md) |  |
| [SHOW GRANTS TO ROLE command: SNOWFLAKE database role grants to PUBLIC included in output](2025_02/bcr-1925.md) |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Additional Notes** |
| [DATA_CLASSIFICATION_LATEST view (ACCOUNT_USAGE): schemaId column name changed to schema_id](2025_02/bcr-1885.md) |  |
| [Snowflake Native Apps: changes to hash values](2025_02/bcr-1901.md) |  |
| **Data Governance** | **Additional Notes** |
| [Sensitive Data Classification: Preserving user-specified tag values](2025_02/bcr-1929.md) |  |
| **Developer / Extensibility Changes** | **Additional Notes** |
| [Snowpark Container Services: Changes to failed batch retry logic and new columns in the DESC FUNCTION command output](2025_02/bcr-1938.md) |  |
| **Web Interface** | **Additional Notes** |
| [Snowsight: Default all users, including VPS and Private Link, to Snowsight](2025_02/bcr-1930.md) |  |
| **New column in view or command** | **Additional Notes** |
| [DATABASES and SCHEMATA views: New columns and rows to include personal databases and replication information](2025_02/bcr-1869-1880.md) |  |
| [ALERT_HISTORY view and table function: New SCHEDULED_FROM column](2025_02/bcr-1894.md) |  |
| [APPLICATION_STATE view: New columns in output](2025_02/bcr-1876.md) |  |
| [BLOCK_STORAGE_HISTORY view (Account Usage): New columns ADDITIONAL_IOPS and ADDITIONAL_THROUGHPUT](2025_02/bcr-1921.md) |  |
| [DESCRIBE SECRET command: New columns in output](2025_02/bcr-1890.md) |  |
| [SHOW ICEBERG TABLES command: New column ICEBERG_TABLE_AUTO_REFRESH_STATUS in output](2025_02/bcr-1941.md) |  |
| [SHOW IMAGE REPOSITORIES command: New column in output and removal of image repositories from the SHOW STAGES command output](2025_02/bcr-1825.md) |  |
| [SHOW RELEASE DIRECTIVES command: New column in output](2025_02/bcr-1906.md) |  |
| [SHOW SERVICE INSTANCES IN SERVICE and SHOW SERVICE CONTAINERS IN SERVICE commands: New columns in output](2025_02/bcr-1915.md) |  |
| [SHOW VERSIONS IN APPLICATION command: New column in output](2025_02/bcr-1900.md) |  |
| [TAGS view (ACCOUNT_USAGE): New columns](2025_02/bcr-1937.md) |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Released as preview | Bundle 2025_02 announcements released as preview. | 14-Mar-2025 |
| *DATABASES and SCHEMATA views: new column and rows to include personal databases and schemas (Preview)* | **Removed** from *Web Interface Changes* | 24-Mar-25 |
| *DATABASES and SCHEMATA views: new column and rows to include personal databases and schemas (Preview)* | **Added** to *New column in view or command* | 24-Mar-25 |
| BCR-1880 *DATABASES and SCHEMATA views: new column and rows to include personal databases and schemas (Preview)* and BCR-1869 *DATABASES and SCHEMATA views: New column in output (Preview)* were combined | **Added** to *New column in view or command* | 24-Mar-25 |
| *DATABASES and SCHEMATA views: new column and rows to include personal databases and schemas (Preview)*” | **Changed** to *DATABASES and SCHEMATA views: New columns and rows to include personal databases and replication information (Preview)*” | 24-Mar-25 |
| *BCR-1942: New maximum size limits for database objects* | **Moved** from 2025_02 bundle to 2025_03 bundle. | 12-May-2025 |
| Updated as complete | Bundle 2025_02 announcements released. | 27-Mar-25 |

---
title: 2025_03 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03_bundle.md
section: Release Notes
---

# 2025_03 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.12 release (May 5-12, 2025) as **Disabled by Default**; however, account admins can enable for testing.
2. Status changed in the 9.17 release (June 23-30, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.22 release (August 4-8, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [Rename the CREATE DATA EXCHANGE LISTING privilege to CREATE LISTING](2025_03/bcr-1926.md) | Low |  |
| **SQL Changes — General** | **Impact Score** | **Additional Notes** |
| [New maximum size limits for database objects](2025_03/bcr-1942.md) | Medium |  |
| **SQL Changes — Commands & Functions** | **Impact Score** | **Additional Notes** |
| [DML and CTAS commands: Potential for wrong results when the RELY property is set](2025_03/bcr-1902.md) | High |  |
| [DROP TABLE command: Changes to CASCADE behavior for hybrid tables](2025_03/bcr-1741.md) | High |  |
| [CREATE ORGANIZATION LISTING and ALTER LISTING organization_targets field cannot be empty](2025_03/bcr-1963.md) | Medium |  |
| [SHOW GIT REPOSITORIES and DESC GIT REPOSITORY commands: LAST_OPERATION_STATUS column removed from output](2025_03/bcr-1949.md) | Medium |  |
| [SHOW USERS and DESCRIBE USER commands: Changes to output](2025_03/bcr-1951.md) | Low |  |
| **Data Pipeline Changes** | **Impact Score** | **Additional Notes** |
| [Dynamic tables: New behavior for cloned dynamic tables](2025_03/bcr-1943.md) | High |  |
| **Data Lake Changes** | **Impact Score** | **Additional Notes** |
| [Apache Iceberg™ tables: ABFS write paths for Azure external volumes](2025_03/bcr-1935.md) | High |  |
| **Developer / Extensibility Changes** | **Impact Score** | **Additional Notes** |
| [Java and Python UDFs and stored procedures: Changes to handling of // when resolving file paths in file access APIs](2025_03/bcr-1810.md) | High |  |
| [Snowpark Container Services: Ingress and web app security updates for Azure](2025_03/bcr-1953.md) | High |  |
| [Python UDFs and stored procedures: Stop implicit auto-injection of the psutil package](2025_03/bcr-1948.md) | Medium |  |
| **New column in view or command** | **Impact Score** | **Additional Notes** |
| [DESCRIBE AVAILABLE LISTING and DESCRIBE LISTING commands: New column in output](2025_03/bcr-1962.md) | Low |  |
| [SHOW CHANNELS command: New columns in output](2025_03/bcr-1950.md) | Low |  |
| [SHOW SNAPSHOTS and DESCRIBE SNAPSHOT commands: New column in output](2025_03/bcr-1836.md) | Low |  |
| [SHOW WAREHOUSES command: New columns in output](2025_03/bcr-1889.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Released as preview | Bundle 2025_03 announcements released as preview. | 25-Apr-2025 |
| Added impact scores | BCRs in Bundle 2025_03 have been rated according to the definitions in [Impact score](../behavior-change-policy.md). | 30-Apr-2025 |
| Updated as complete | Bundle 2025_03 announcements released. | 12-May-25 |
| *BCR-1942: New maximum size limits for database objects* | **Moved** from 2025_02 bundle to 2025_03 bundle. | 12-May-2025 |
| *BCR-1960: Account Usage and Information Schema views: Changes to DATA_TYPE output for string columns* | **Removed** from 2025_03 bundle. For more information, see [Account Usage and Information Schema views: Changes to DATA_TYPE output for string columns (Postponed)](un-bundled/bcr-1960.md). | 16-Jun-2025 |
| *BCR-1525* SHOW GIT REPOSITORIES and DESC GIT REPOSITORY commands: LAST_OPERATION_STATUS column removed from output (Pending) | **Removed** from 2025_03 bundle. | 27-Jun-2025 |
| *BCR-1944* SHOW FUNCTIONS and SHOW PROCEDURES commands: The complete data type for arguments is displayed in output (Pending) | **Removed** from 2025_03 bundle. | 07-Jul-2025 |

---
title: 2025_04 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04_bundle.md
section: Release Notes
---

# 2025_04 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.17 release (June 23-30, 2025) as **Disabled by Default**; however, account admins can enable for testing.
2. Status changed in the 9.22 release (August 4-8, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.27 release (September 8-10, 2025) to **Generally Enabled**; account admins can no longer disable or enable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [DESCRIBE SESSION POLICY command: convert output from row-oriented to column-oriented](2025_04/bcr-1985.md) | High |  |
| [MFA_AUTHENTICATION_METHODS in authentication policy now only includes PASSWORD by default](2025_04/bcr-1971.md) | High |  |
| [DESCRIBE SECRET command: key length for symmetric keys](2025_04/bcr-2000.md) | Medium |  |
| [Disable external OAuth session closure](2025_04/bcr-1991.md) | Medium |  |
| **SQL Changes — General** | **Impact Score** | **Additional Notes** |
| [Deprecation of the SNOWFLAKE user](2025_04/bcr-1976.md) | Low |  |
| **SQL Changes — Commands & Functions** | **Impact Score** | **Additional Notes** |
| [CREATE EXTERNAL TABLE command: Primary role requires stage access](2025_04/bcr-1993.md) | Medium |  |
| [SHOW AVAILABLE LISTINGS and DESCRIBE LISTING commands: Reformat the regions column in output](2025_04/bcr-1986.md) | Medium |  |
| [SHOW USERS and DESCRIBE USER commands: Changes to output](2025_04/bcr-1999.md) | Medium |  |
| **Data Lake Changes** | **Impact Score** | **Additional Notes** |
| [Apache Iceberg™ tables: Refreshing Delta-based tables fails if the UUID changes](2025_04/bcr-2006.md) | Medium |  |
| **Developer / Extensibility Changes** | **Impact Score** | **Additional Notes** |
| [Snowflake Native App Framework: New application packages enable release channels by default](2025_04/bcr-1977.md) | Medium |  |
| **New Column in View or Command** | **Impact Score** | **Additional Notes** |
| [DESCRIBE LISTING command: New column in output](2025_04/bcr-1979.md) | Low |  |
| [LOGIN_HISTORY view and table function (Account Usage / Information Schema): New columns in output](2025_04/bcr-1966.md) | Low |  |
| [MODEL MONITOR METRIC functions: New columns in output](2025_04/bcr-1982.md) | Low |  |
| [SHOW DYNAMIC TABLES command: New columns added to output](2025_04/bcr-2001.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Released as preview | Bundle 2025_04 announcements released as preview. | 20-Jun-2025 |
| *Authentication policy commands: Deprecate MFA_AUTHENTICATION_METHODS property (Pending)* | **Added** to *Security* | 23-Jun-2025 |
| *Mandatory multi-factor authentication on Snowsight login* | Temporarily disabled | 18-Jul-2025 |
| *Authentication policy commands: Deprecate MFA_AUTHENTICATION_METHODS property* | Behavior change postponed | 21-Jul-2025 |
| *Mandatory multi-factor authentication on Snowsight login* | Re-enabled in bundle | 05-Aug-2025 |
| *MFA_AUTHENTICATION_METHODS in authentication policy now only includes PASSWORD by default* | **Added** to *Security* | 07-Aug-2025 |
| [Mandatory multi-factor authentication on Snowsight login (Replaced)](un-bundled/bcr-1972.md) | Behavior change postponed | 07-Aug-2025 |
| [USERS view (ACCOUNT_USAGE and ORGANIZATION_USAGE): New columns and changes to has_mfa column](2025_06/bcr-2102.md) | Behavior change postponed | 12-Aug-2025 |
| [USERS view (ACCOUNT_USAGE and ORGANIZATION_USAGE): New columns and changes to has_mfa column](2025_06/bcr-2102.md) | Behavior change moved to the 2025_06 bundle | 05-Sep-2025 |
| [Snowpark Python: Eliminate repeated subqueries in Snowpark-generated queries (Canceled)](un-bundled/bcr-1995.md) | Behavior change canceled | 28-August-2025 |

---
title: 2025_05 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05_bundle.md
section: Release Notes
---

# 2025_05 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.22 release (August 4-8, 2025) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 9.27 release (September 8-10, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 9.32 release (October 13-15, 2025) to **Generally Enabled**; account admins can no longer disable or enable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [Key-pair authentication for Google Cloud accounts in the us-central1 region](2025_05/bcr-2055.md) | Medium |  |
| [User types: TYPE property is set to PERSON instead of NULL](2025_05/bcr-2067.md) | Medium |  |
| **SQL Changes — Commands & Functions** | **Impact Score** | **Additional Notes** |
| [CREATE IMAGE REPOSITORY command: Change in the default encryption type](2025_05/bcr-2036.md) | Low |  |
| **Data Loading / Unloading Changes** | **Impact Score** | **Additional Notes** |
| [File formats and stages: Enforce dependency checks](2025_05/bcr-1989.md) | Medium |  |
| **Replication Changes** | **Impact Score** | **Additional Notes** |
| [Cortex Search Services: Replication of existing services](2025_05/bcr-2053.md) | Medium |  |
| [Replication views and functions: New refresh phase SECONDARY_COMMITTING](2025_05/bcr-2043.md) | Low |  |
| **Developer / Extensibility Changes** | **Impact Score** | **Additional Notes** |
| [Snowflake Cortex: Model deprecation](2025_05/bcr-1984.md) | Low |  |
| [Snowpark stored procedures: Enable billing for PUT calls to external stages](2025_05/bcr-2002.md) | Low |  |
| **New Column in View or Command** | **Impact Score** | **Additional Notes** |
| [GET_LINEAGE function: New column in output](2025_05/bcr-2059.md) | Low |  |
| [Pipe usage history and COPY history views: New column](2025_05/bcr-2045.md) | Low |  |
| [SHOW APPLICATIONS and SHOW APPLICATION PACKAGES commands: New column in output: TYPE](2025_05/bcr-2065.md) | Low |  |
| [SHOW TASKS and DESCRIBE TASKS commands: New column in output](2025_05/bcr-2051.md) | Low |  |
| [SHOW USERS and DESCRIBE USER commands: New column/property in output](2025_05/bcr-2066.md) | Low |  |
| [SYSTEM$ESTIMATE_QUERY_ACCELERATION function: New property in result value](2025_05/bcr-2044.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Released as preview | Bundle 2025_05 announcements released as preview. | 01-Aug-25 |
| Updated as complete | Bundle 2025_05 announcements released. | 08-Aug-25 |
| *Snowsight Templates learning environment* | **Removed** from *Web Interface Changes* | 12-Aug-25 |
| *Snowflake Notebooks: replication* | **Removed** from *Replication Changes* | 11-Sep-25 |

---
title: 2025_06 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06_bundle.md
section: Release Notes
---

# 2025_06 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.27 release (September 8-10, 2025) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 9.32 release (October 13-15, 2025) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 10.1 release (January 19-23, 2026) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [Multi-factor authentication: MFA_AUTHENTICATION_METHODS parameter deprecation](2025_06/bcr-2086.md) | High |  |
| [Multi-factor authentication: MFA_ENROLLMENT parameter values change](2025_06/bcr-2097.md) | High |  |
| [JWT Subject Claim Validation](2025_06/bcr-2077.md) | Low |  |
| **Data Pipelines** | **Impact Score** | **Additional Notes** |
| [Streams: Changes to replication support in replication groups](2025_06/bcr-2079.md) | Medium |  |
| **Snowflake CLI, Connectors, Drivers, and SQL API** | **Impact Score** | **Additional Notes** |
| [Snowpark Container Services job service: Retention-time increase](2025_06/bcr-2093.md) | Medium |  |
| **Web Interface Changes** | **Impact Score** | **Additional Notes** |
| [Snowsight is the only interface available for all users](2025_06/bcr-2080.md) | Medium |  |
| **New Column in View or Command** | **Impact Score** | **Additional Notes** |
| [QUERY_HISTORY views and function: New columns in output](2025_06/bcr-1980-2050.md) | Medium |  |
| [USERS view (ACCOUNT_USAGE and ORGANIZATION_USAGE): New columns and changes to has_mfa column](2025_06/bcr-2102.md) | Medium |  |
| [METERING_HISTORY view (ACCOUNT_USAGE): Change in columns](2025_06/bcr-2073.md) | Medium |  |
| [APPLICATION_STATE view : New columns in output](2025_06/bcr-2071.md) | Low |  |
| [DESCRIBE ORGANIZATION PROFILE: New column, updated_on, in output](2025_06/bcr-2089.md) | Low |  |
| [ROLES view: New column is_from_organization_user_group](2025_06/bcr-2104.md) | Low |  |
| [SHOW ICEBERG TABLES command: New columns in output](2025_06/bcr-2076.md) | Low |  |
| [SHOW ROLES command: New column in output](2025_06/bcr-2095.md) | Low |  |
| [STORAGE_USAGE view (ACCOUNT_USAGE): New columns](2025_06/bcr-2098.md) | Low |  |
| [USERS view: New column is_from_organization_user](2025_06/bcr-2105.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Preview of upcoming bundle | Bundle 2025_06 preview announcements available | 05-Sep-25 |
| *Snowsight is the only interface available for all users* | **Added** to *Web Interface Changes* | 09-Sept-25 |
| Bundle updated as complete | Bundle 2025_06 announcements released | 10-Sep-25 |
| *OAuth authentication: Change in network policy behavior* | **Removed** from *Security Changes* | 15-Oct-25 |

---
title: 2025_07 Bundle
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07_bundle.md
section: Release Notes
---

# 2025_07 Bundle

## Bundle History

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 9.32 release (October 13-15, 2025) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 10.1 release (January 19-23, 2026) to **Enabled by Default**; account admins can disable for opt-out.
3. Status changed in the 10.7 release (March 2-5, 2026) to **Generally Enabled**; account admins can no longer enable or disable this bundle.

## List of Changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **SQL Changes — General** | **Impact Score** | **Additional Notes** |
| [Catalog-linked database: USAGE privilege on CATALOG INTEGRATION and EXTERNAL VOLUME required for database owner role for all operations](2025_07/bcr-2114.md) | High |  |
| [VARIANT data: Casting some of the fixed numeric values to floating-point results in different approximate values](2025_07/bcr-2106.md) | Medium |  |
| [SQL parameters: Disallow setting date and time output formats to AUTO](2025_07/bcr-2115.md) | Medium |  |
| **SQL Changes — Commands & Functions** | **Impact Score** | **Additional Notes** |
| [Access control: Disallow GRANT REFERENCE_USAGE if GRANT USAGE isn’t set first](2025_07/bcr-2136.md) | Medium |  |
| [Python telemetry library automatically installed](2025_07/bcr-2120.md) | Medium |  |
| **Data Pipelines** | **Impact Score** | **Additional Notes** |
| [Streams: Updates to stream replication](2025_07/bcr-2112.md) | Low |  |
| **Data Governance** | **Impact Score** | **Additional Notes** |
| [Data quality: Default schedule for data metric functions](2025_07/bcr-2101.md) | Medium |  |
| **Web Interface Changes** | **Impact Score** | **Additional Notes** |
| [Snowflake Notebooks: replication](2025_07/bcr-2058.md) | Medium |  |
| **New Column in View or Command** | **Impact Score** | **Additional Notes** |
| [COLUMNS view (multiple schemas): New column](2025_07/bcr-2061.md) | Low |  |
| [DATABASE_STORAGE_USAGE_HISTORY_VIEW (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns](2025_07/bcr-2129.md) | Low |  |
| [LOGIN_HISTORY view and functions: New LOGIN_DETAILS column](2025_07/bcr-2052.md) | Low |  |
| [SHOW COMPUTE POOL and DESCRIBE COMPUTE POOL commands: New column in output](2025_07/bcr-2119.md) | Low |  |
| [SHOW WAREHOUSES command: New column in output](2025_07/bcr-2110.md) | Low |  |
| [TABLE_STORAGE_METRICS and TABLES views (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns](2025_07/bcr-2127.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Preview of upcoming bundle | Bundle 2025_07 preview announcements available | 10-Oct-25 |
| Bundle updated as complete | Bundle 2025_07 announcements released | 15-Oct-25 |
| *Python telemetry library automatically installed* | **Added** to *SQL Changes — Commands & Functions* | 16-Oct-25 |
| *DATABASE_STORAGE_USAGE_HISTORY_VIEW (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns* | **Added** to *New Column in View or Command* | 21-Oct-25 |
| *New error code for function-related errors* | **Removed** | 16-Dec-25 |
| *SQL general: New default column sizes for string and binary data types* | **Behavior change postponed** | 14-Jan-26 |
| *Virtual warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses* | **Behavior change postponed** | 6-Feb-26 |

---
title: 2026 Performance improvements
source: https://docs.snowflake.com/en/release-notes/performance-improvements-2026.md
section: Release Notes
---

# 2026 Performance improvements

> **Important:**
>
> Performance improvements often target specific query patterns or workloads. These improvements might or might not have a material impact
> on a specific workload.

The following performance improvements were introduced in 2026:

| Released | Description | Impact |
| --- | --- | --- |
| April 2026 | Improved runtime pruning for expressions with TIMESTAMP_TZ data types. Snowflake now prunes micro-partitions more effectively for timestamp-based filter predicates. | Improves performance for time-series queries with timestamp filters by skipping significantly more irrelevant micro-partitions, reducing I/O and execution time. |
| April 2026 | Continued improvements to skew handling in hash joins, further reducing processing bottlenecks from unevenly distributed data. | Improves execution time for join queries with data skew by dynamically adjusting redistribution based on warehouse configuration. |
| March 2026 | Enhanced parallel scanning for queries accelerated by the [Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md). | Improves execution time for QAS-accelerated queries by enabling more parallel I/O during scan operations. |
| March 2026 | Dynamically adjusts network message sizes based on the execution plan to optimize data transfer between processing nodes. | Improves execution time for interactive and latency-sensitive workloads by reducing network overhead. Particularly benefits short-running queries and high-concurrency scenarios. |
| March 2026 | Improved scanset construction to reduce lock contention during parallel query execution. | Improves execution time for scan-heavy queries, especially on larger warehouses with high concurrency. Reduces CPU overhead during parallel scan coordination. |
| March 2026 | Identifies opportunities to push aggregations earlier in the query plan when common table expressions (CTEs) are present. | Improves execution time for complex queries with CTEs by reducing the volume of data processed in later stages of the query plan. |
| March 2026 | Improved extraction pushdown through view columns, enabling more efficient scan and metadata usage for subcolumns accessed through views. | Improves execution time for queries that access subcolumns through views. |
| February 2026 | Performance improvements to the file pruner, reducing per-file pruning overhead for queries scanning many files. | Faster pruning decisions during compilation and execution, especially for queries that scan tables with many micro-partitions. |
| February 2026 | Improved range-based micro-partition pruning for more query patterns. | Reduces both compilation and execution time for queries with range predicates by skipping more irrelevant micro-partitions. |
| February 2026 | More efficient aggregation processing when data fits on a single server node, avoiding unnecessary distributed processing overhead. | Improves performance for aggregation queries where the data volume doesn’t require distributed computation. |
| January 2026 | Improved query performance with [Snowflake Optima Metadata](../user-guide/snowflake-optima.md), which continuously analyzes your workload patterns and creates metadata to optimize pruning of unused micro-partitions. Snowflake Optima is only available on [Snowflake generation 2 standard warehouses (Gen2)](../user-guide/warehouses-gen2.md). | Improves performance of queries by creating metadata for more efficient pruning. |
| January 2026 | Improved pruning for [join queries](../sql-reference/constructs/join.md) with inequality predicates. For example, the following join query uses the `>` operator in an inequality predicate:  ```sqlexample SELECT *   FROM employees e, managers m   WHERE e.employee_id = m.employee_id AND         e.salary > m.salary AND         m.level = 'M5'; ```  For this query, Snowflake prunes micro-partitions from the `employees` table where all salaries are below the lowest `M5` manager salary. | Improves the performance of join queries that have inequality predicates. |
| January 2026 | Faster JSON parsing for PARSE_JSON operations. | Improves execution time for queries that parse JSON data. Queries with heavy JSON processing may see significant speedups. |
| January 2026 | Improved compilation performance for queries with deeply nested CASE expressions by keeping them in a simplified form throughout the compilation process. | Reduces compilation time for queries with large CASE expressions, especially those with many branches. |

---
title: 2026_01 Bundle (Generally enabled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01_bundle.md
section: Release Notes
---

# 2026_01 Bundle (Generally enabled)

## Bundle history

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 10.1 release (January 19-23, 2026) as **Disabled by Default**; account admins can enable for testing.
2. Status changed in the 10.7 release (March 2-5, 2026) to **Enabled by Default**; account admins can disable for opt-out.
3. Status planned to change in April 2026 to **Generally Enabled**; however, this schedule is subject to change.

## List of changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [Snowsight session timeout parameter: Change default value](2026_01/bcr-2139.md) | Medium |  |
| [DROP ROLE command: No longer requires the MANAGE GRANTS privilege](2026_01/bcr-2167.md) | Medium |  |
| [ENFORCE_SESSION_POLICY parameter: Always set = TRUE](2026_01/bcr-2164.md) | Medium |  |
| [OAuth: Proper normalization of explicit mixed case role names](2026_01/bcr-2192.md) | Medium |  |
| **SQL Changes — General** | **Impact Score** | **Additional Notes** |
| [SQL general: Changes to error messages for subqueries](2026_01/bcr-2140.md) | Low |  |
| **SQL Changes — Commands & Functions** | **Impact Score** | **Additional Notes** |
| [ALTER USER and DESCRIBE USER commands: LOGIN_NAME mapped to SCIM_USER_NAME](2026_01/bcr-2158.md) | Medium |  |
| [New error code for function-related errors](2026_01/bcr-2124.md) | Medium |  |
| [SHOW GRANTS: Changes to output for grants on functions and procedures](2026_01/bcr-2190.md) | Medium |  |
| [SHOW STREAMS command: Change in output for streams on directory tables](2026_01/bcr-2170.md) | Medium |  |
| [CREATE INTEGRATION commands: ENABLED parameter defaults to TRUE](2026_01/bcr-2166.md) | Low |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Impact Score** | **Additional Notes** |
| [ACCESS_HISTORY view (Account Usage and Organization Usage): Simpler format for values](2026_01/bcr-2179.md) | Medium |  |
| [ACCESS_HISTORY view (Account Usage and Organization Usage): Simplified format for tag creation](2026_01/bcr-2180.md) | Medium |  |
| [ACCESS_HISTORY view (Account Usage and Organization Usage): Simplified records for policy creation](2026_01/bcr-2181.md) | Medium |  |
| [ACCESS_HISTORY view (Account Usage and Organization Usage): User-defined values in the objects_modified_by_ddl column](2026_01/bcr-2178.md) | Medium |  |
| [BACKUP_OPERATION_HISTORY views: TIMESTAMP_LTZ always used for the start_time column](2026_01/bcr-2200.md) | Medium |  |
| **Replication Changes** | **Impact Score** | **Additional Notes** |
| [ALTER REPLICATION GROUP or ALTER FAILOVER GROUP with SUSPEND IMMEDIATE clause: Synchronously cancel active replication job](2026_01/bcr-2202.md) | Medium |  |
| **Developer / Extensibility Changes** | **Impact Score** | **Additional Notes** |
| [Snowpark Container Services job service: Retention time increase](2026_01/bcr-2206.md) | Medium |  |
| **Web Interface Changes** | **Impact Score** | **Additional Notes** |
| [Snowflake Support page: Access requirements changes](2026_01/bcr-2188.md) | High |  |
| **New Column in View or Command** | **Impact Score** | **Additional Notes** |
| [ACCESS_HISTORY view (Account Usage and Organization Usage): New columns](2026_01/bcr-2177.md) | Medium |  |
| [DATA_CLASSIFICATION_LATEST view (ACCOUNT_USAGE): New columns](2026_01/bcr-2189.md) | Low |  |
| [DYNAMIC_TABLE_REFRESH_HISTORY function: New columns in output](2026_01/bcr-2183.md) | Low |  |
| [DYNAMIC_TABLE_REFRESH_HISTORY view and SHOW DYNAMIC TABLES command: New columns in output](2026_01/bcr-2163.md) | Low |  |
| [PIPES views and commands: New column](2026_01/bcr-2137.md) | Low |  |
| [SESSIONS views: New columns and a behavior change for the closed_reason column](2026_01/bcr-2149.md) | Low |  |
| [SHOW AVAILABLE LISTINGS command: New columns in output](2026_01/bcr-2201.md) | Low |  |
| [SHOW DATABASES command: New column in output](2026_01/bcr-2199.md) | Low |  |
| [SHOW TABLES and SHOW WAREHOUSES commands: New columns in output](2026_01/bcr-2165.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Preview of upcoming bundle | Bundle 2026_01 preview announcements available | 16-Jan-26 |
| Bundle updated as complete | Bundle 2026_01 announcements released | 23-Jan-26 |

---
title: 2026_02 Bundle (Enabled by default)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02_bundle.md
section: Release Notes
---

# 2026_02 Bundle (Enabled by default)

## Bundle history

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Introduced in the 10.7 release (March 2-5, 2026) as **Disabled by Default**; account admins can enable for testing.
2. Status planned to change in April 2026 to **Enabled by Default**; however, this schedule is subject to change.
3. Status planned to change in May 2026 to **Generally Enabled**; however, this schedule is subject to change.

## List of changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [Authentication policies: Changes to the NETWORK_POLICY_EVALUATION property no longer affect existing sessions](2026_02/bcr-2191.md) | Medium |  |
| [New CREATE HYBRID TABLE privilege](2026_02/bcr-2157.md) | Medium |  |
| [External OAuth security integrations: EXTERNAL_OAUTH_JWS_KEYS_URL parameter requires HTTPS](2026_02/bcr-2218.md) | Low |  |
| **SQL Changes — General** | **Impact Score** | **Additional Notes** |
| [CREATE, ALTER, and CREATE OR ALTER WAREHOUSE commands: Behavior change with new columns in output](2026_02/bcr-2225.md) | Medium |  |
| [New LOG_EVENT_LEVEL parameter to control events](2026_02/bcr-2229.md) | Low |  |
| **SQL Changes — Commands & Functions** | **Impact Score** | **Additional Notes** |
| [Dynamic tables: New column in SHOW DYNAMIC TABLES and DDL fix](2026_02/bcr-2248.md) | Low |  |
| **SQL Changes — Usage Views & Information Schema Views / Table Functions** | **Impact Score** | **Additional Notes** |
| [CLIENT_APPLICATION_ID field in SESSIONS view: Trim trailing whitespace in return value](2026_02/bcr-2226.md) | Low |  |
| [DBT_PROJECT_EXECUTION_HISTORY function: New columns in output](2026_02/bcr-2233.md) | Low |  |
| **Data Pipelines** | **Impact Score** | **Additional Notes** |
| [Snowpipe: Disable pipe role drop prevention](2026_02/bcr-2216.md) | Low |  |
| **Data Lake** | **Impact Score** | **Additional Notes** |
| [Enforce exact length on inserts into Apache Iceberg™ fixed[L] columns](2026_02/bcr-2246.md) | Medium |  |
| [Restrict Apache Iceberg™ binary columns to maximum size](2026_02/bcr-2244.md) | Medium |  |
| [Full lifecycle management for converted Apache Iceberg™ tables](2026_02/bcr-2219.md) | Low |  |
| **Data Sharing** | **Impact Score** | **Additional Notes** |
| [Fast listing auto-fulfillment enabled by default](2026_02/bcr-2221.md) | Medium |  |
| **Data Governance** | **Impact Score** | **Additional Notes** |
| [Snowflake Cortex AI Functions Model RBAC Rollout](2026_02/bcr-2220.md) | High |  |
| **Developer / Extensibility** | **Impact Score** | **Additional Notes** |
| [Snowflake Cortex AI Function: Multirow error handling improvements](2026_02/bcr-2184.md) | Low |  |
| **New Column in View, Command, or Function** | **Impact Score** | **Additional Notes** |
| [APPLICATION_STATE view (Data Sharing Usage): New columns in output](2026_02/bcr-2231.md) | Low |  |
| [DESCRIBE and SHOW commands for Apache Iceberg™ tables: New columns in output](2026_02/bcr-2210.md) | Low |  |
| [DESCRIBE LISTING and DESCRIBE AVAILABLE LISTING commands: New column in output](2026_02/bcr-2258.md) | Low |  |
| [Dynamic tables: SHOW command and functions: new columns in output](2026_02/bcr-2208.md) | Low |  |
| [DYNAMIC_TABLE_REFRESH_HISTORY function (Information Schema): New columns in output](2026_02/bcr-2254.md) | Low |  |
| [LOGIN_HISTORY view (Account Usage, Organization Usage): New column in output](2026_02/bcr-2209.md) | Low |  |
| [SHOW ACCOUNT command: New column in output](2026_02/bcr-2174.md) | Low |  |
| [SHOW CONTACTS command and CONTACTS view (Account Usage): New column in output](2026_02/bcr-2222.md) | Low |  |
| [SHOW VERSIONS IN APPLICATION PACKAGE command: New column in output](2026_02/bcr-2232.md) | Low |  |
| [TABLES view and SHOW TABLES command: New columns in output](2026_02/bcr-2168.md) | Low |  |
| [Task views, functions, and commands: New columns in output](2026_02/bcr-2212.md) | Low |  |
| [TASK_HISTORY views: New column in output](2026_02/bcr-2182.md) | Low |  |
| [USERS view (Account Usage / Organization Usage): New column](2026_02/bcr-2217.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Preview of upcoming bundle | Bundle 2026_02 preview announcements available | 27-Feb-26 |
| Bundle updated as complete | Bundle 2026_02 announcements released | 05-Mar-26 |
| *Dynamic tables: New column in SHOW DYNAMIC TABLES and DDL fix* | **Added** to *SQL Changes — Commands & Functions* | 03-12-26 |

---
title: 2026_03 Bundle (Disabled by default)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03_bundle.md
section: Release Notes
---

# 2026_03 Bundle (Disabled by default)

## Bundle history

> **Attention:**
>
> To determine the current status of bundles in your account,
> see [SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md).
>
> For more information, see
> [Behavior change management](managing-behavior-change-releases.md).

1. Status planned to be introduced in the 10.12 release (April 2026) as **Disabled by Default**; account admins can enable for testing. However, this schedule is subject to change.
2. Status planned to change in May 2026 to **Enabled by Default**; however, this schedule is subject to change.
3. Status planned to change in June 2026 to **Generally Enabled**; however, this schedule is subject to change.

## List of changes

> **Important:**
>
> This change list has been compiled using reasonable efforts. We are not always able to determine the full customer impact of a
> behavior change beforehand. The change list may not include all changes in a release, for example, last minute or emergency changes.
> In addition, behavior changes that are determined to have minimal to no user impact may not be pre-announced.
>
> If you have any questions about the changes in this bundle, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).
>
> For differences between the in-advance and final versions of these notes, see Behavior Change Bundle change log.
>
> For information and definitions of impact scores, see [Impact score](../behavior-change-policy.md).

|  |  |  |
| --- | --- | --- |
| **Security Changes** | **Impact Score** | **Additional Notes** |
| [Enable ability to grant entities in Personal Databases to account roles (Pending)](2026_03/bcr-2290.md) | Low |  |
| [63-character limit for account identifiers (Pending)](2026_03/bcr-2215.md) | Low |  |
| **SQL Changes — General** | **Impact Score** | **Additional Notes** |
| [SQL Changes: Add new date and time format elements (Pending)](2026_03/bcr-2281.md) | Medium |  |
| [SQL Changes — General: Correctly set byteLength for VARCHAR string columns (Pending)](2026_03/bcr-2286.md) | Medium |  |
| [SHOW MANAGED ACCOUNTS command: New column tenant_type in output (Pending)](2026_03/bcr-2265.md) | Low |  |
| **Ecosystem Changes** | **Impact Score** | **Additional Notes** |
| [SCIM: List API returns paginated results; unsupported filters rejected (Pending)](2026_03/bcr-2276.md) | Medium |  |
| **Virtual Warehouse Changes** | **Impact Score** | **Additional Notes** |
| [Standard warehouses: Gen2 is the default generation (Pending)](2026_03/bcr-2250.md) | Medium |  |
| [Warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses (Pending)](2026_03/bcr-2269.md) | Medium |  |
| **Data Lake** | **Impact Score** | **Additional Notes** |
| [Merge-on-read with positional delete files for Snowflake-managed Apache Iceberg™ v2 tables (Pending)](2026_03/bcr-2279.md) | Medium |  |
| [Data Lake: Apache Iceberg™ string column length in CREATE TABLE AS SELECT (Pending)](2026_03/bcr-2285.md) | Medium |  |
| **Native Apps** | **Impact Score** | **Additional Notes** |
| [Application package version drop: Error when consumers are still using the version (Pending)](2026_03/bcr-2273.md) | Medium |  |
| [Application package version drop: Immediate replication refresh for version drops (Pending)](2026_03/bcr-2274.md) | Medium |  |
| [SHOW VERSIONS command: New column in output (Pending)](2026_03/bcr-2283.md) | Low |  |
| **SQL Changes — Usage Views & Information Schema Views** | **Impact Score** | **Additional Notes** |
| [New ERROR_LOGGING column in TABLES views and commands (Pending)](2026_03/bcr-2185.md) | Low |  |
| [DATABASES view (Account Usage): New column DATA_QUALITY_MONITORING_SETTINGS (Pending)](2026_03/bcr-2266.md) | Low |  |
| [SHOW TAGS command and TAGS view (Account Usage): New columns in output (Pending)](2026_03/bcr-2291.md) | Low |  |

## Behavior Change Bundle change log

| Announcement | Update | Date |
| --- | --- | --- |
| Preview of upcoming bundle | Bundle 2026_03 preview announcements available | 30-Mar-26 |

---
title: 63-character limit for account identifiers (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2215.md
section: Release Notes
---

# 63-character limit for account identifiers (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

Before the change:
:   Snowflake doesn’t enforce a maximum length or trailing-underscore restriction on
    [Format 1](https://docs.snowflake.com/en/user-guide/admin-account-identifier#format-1-preferred-account-name-in-your-organization)
    of the account identifier (the account name prefixed by its organization name, for example `myorg-myaccount`). Account creation and
    renaming succeed regardless of identifier length.

After the change:
:   When the 2026_03 behavior change bundle is enabled in your account, Snowflake enforces the following restrictions on Format 1 of the
    account identifier:

    * The combined `<orgname>-<account_name>` identifier can’t exceed **63 characters**.
    * The account name can’t end with an **underscore** (`_`).

    If an `ALTER ACCOUNT ... RENAME TO` or `CREATE ACCOUNT` statement violates either restriction, the statement fails with one of the
    following error codes:

    * `ORG_ACCOUNT_NAME_EXCEEDS_DNS_LIMIT` if the identifier exceeds 63 characters.
    * `ACCOUNT_NAME_INVALID_FOR_DNS` if the account name ends with an underscore.

    These restrictions ensure that the account identifier complies with the limits for a DNS label defined in
    [RFC 1035](https://www.rfc-editor.org/rfc/rfc1035) and the LDH label defined in the CA/Browser Forum Baseline Requirements,
    preventing possible DNS lookup and account reachability failures.

    **Existing accounts** that don’t comply with these limits continue to function. However, Snowflake recommends renaming
    them to ensure smooth operation with current and future features. Once you rename an account to a compliant name, you
    can’t rename it back to a non-compliant name.

    **To find non-compliant accounts**, run the following queries:

    ```sqlexample
    SHOW ACCOUNTS;

    SELECT
      CURRENT_ORGANIZATION_NAME() || '-' || "account_name" AS identifier,
      LENGTH(identifier) AS len
    FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
    WHERE len > 63 OR ENDSWITH("account_name", '_');
    ```

    ```sqlexample
    SHOW MANAGED ACCOUNTS;

    SELECT
      CURRENT_ORGANIZATION_NAME() || '-' || "account_name" AS identifier,
      LENGTH(identifier) AS len
    FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()))
    WHERE len > 63 OR ENDSWITH("account_name", '_');
    ```

    For more information about account identifiers, see [Account identifiers](../../../user-guide/admin-account-identifier.md).
    For more information about renaming accounts, see [Renaming an account](../../../user-guide/organizations-manage-accounts-rename.md).

Ref: 2215

---
title: 9.0 Release notes: Jan 07, 2025-Jan 09, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_00.md
section: Release Notes
---

# 9.0 Release notes: Jan 07, 2025-Jan 09, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### External key store integration for Tri-Secret Secure (*General availability*)

With this release, Snowflake is pleased to announce the general availability of integration support for Tri-Secret Secure (TSS) with AWS external
key stores to securely store and manage a customer-managed key (CMK) outside AWS. Snowflake currently only tests and supports Thales HSM and
Thales CCKM data encryption products.

For more information about setting up and configuring TSS with Thales’ solutions, see
[How to use Thales External Key Store for Tri-Secret Secure on an AWS Snowflake account](https://community.snowflake.com/s/article/thales-xks-for-tss-aws#e3).

### Pinning private endpoints (*General availability*)

With this release, Snowflake is pleased to announce the general availability of pinning private endpoints. After configuring inbound AWS PrivateLink
or Azure Private Link, pinning your private endpoint ensures that only an authorized private endpoint is used to send traffic from the
customer network to an authorized Snowflake account. This helps you harden your security posture by reducing the network attack surface to
your Snowflake account.

For more information, see [Pinning private connectivity endpoints for inbound traffic](../../user-guide/pin-private-endpoints.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 07-Jan-25 |
| *EKS for TSS - GA announcement* | **Moved** to *Security updates* section | 09-Jan-25 |
| *Pinning private endpoints - GA announcement* | **Moved** to *Security updates* section | 09-Jan-25 |

---
title: 9.1 Release notes: Jan 13, 2025-Jan 16, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_01.md
section: Release Notes
---

# 9.1 Release notes: Jan 13, 2025-Jan 16, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Outbound private connectivity for Snowflake features

Outbound private connectivity lets you create private endpoints in Snowflake to access a cloud platform using the platform’s private
connectivity solution rather than the Internet. This lets you access cloud platform services privately and securely from Snowflake.

For general information about using outbound private connectivity, see [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md).

#### External stages using Azure Private Link (*General availability*)

You can configure an external stage and create a private endpoint so bulk loading from Azure storage occurs over Azure Private Link.

For more information, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

#### External volumes using Azure Private Link (*General availability*)

You can configure an external volume and create a private endpoint so you can connect Snowflake to your external cloud storage for Apache Iceberg™
tables using Azure Private Link instead of the public Internet.

For more information, see [Private connectivity to external volumes for Microsoft Azure](../../user-guide/tables-iceberg-configure-external-volume-azure-private.md).

#### Snowpipe automation using Azure Private Link (*General availability*)

You can configure an external stage and notification integration, and create a private endpoint, so that automatic Snowpipe data loads that
are triggered by Microsoft Azure Event Grid use Azure Private Link instead of the public Internet.

For more information, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

#### External stages using AWS PrivateLink (*General availability*)

You can configure an external stage and create a private endpoint so bulk loading from Amazon S3 storage occurs over AWS PrivateLink.

For more information, see [Private connectivity to external stages for Amazon Web Services](../../user-guide/data-load-aws-private.md).

#### External volumes using AWS PrivateLink (*General availability*)

You can configure an external volume and create a private endpoint so you can connect Snowflake to your external cloud storage for Apache Iceberg™
tables using AWS PrivateLink instead of the public Internet.

For more information, see [Private connectivity to external volumes for Amazon Web Services](../../user-guide/tables-iceberg-configure-external-volume-s3-private.md).

## SQL updates

### ARRAY_AGG function support for window frames (*General availability*)

With this release, we are pleased to announce that the [ARRAY_AGG](../../sql-reference/functions/array_agg.md) window function now supports row-based and
range-based window frames.

## Data pipelines updates

### CREATE DYNAMIC TABLE command: New REQUIRE USER parameter added

With this release, we are pleased to announce support for the REQUIRE USER parameter, which enables users to ensure that a dynamic table cannot
refresh unless a user is specified via COPY SESSION.

For more information, see [CREATE DYNAMIC TABLE](../../sql-reference/sql/create-dynamic-table.md).

### ALTER DYNAMIC TABLE command: New COPY SESSION parameter added

With this release, we are pleased to announce support for the COPY SESSION parameter, which enables you to run a refresh operation in a copy
of the current session, using the same user and warehouse.

For more information, see [ALTER DYNAMIC TABLE](../../sql-reference/sql/alter-dynamic-table.md).

## Data lake updates

### External stage and external volume support for Amazon S3 access points (*General availability*)

With this release, we are pleased to announce support for using an Amazon S3 access point to connect Snowflake to Amazon S3 using an external
stage or external volume.

For more information, see [CREATE STAGE](../../sql-reference/sql/create-stage.md) and [CREATE EXTERNAL VOLUME](../../sql-reference/sql/create-external-volume.md).

### Apache Iceberg™ tables: Automated refresh (*General availability*)

With this release, we are pleased to announce general availability support for automated refreshes of Apache Iceberg™ tables that use an external
catalog. With automated refreshes, Snowflake polls your external Iceberg catalog in a continuous and serverless fashion to synchronize the
metadata with the most recent remote changes.

For more information, see [Automatically refresh Apache Iceberg™ tables](../../user-guide/tables-iceberg-auto-refresh.md).

## Data governance updates

### Data metric functions: Support for referential integrity checks

With the release, we are pleased to announce that a user-defined data metric function (DMF) can accept multiple tables as arguments, which
simplifies referential integrity, matching and comparison, or conditional checking across different datasets.

For more information, including an example of using a DMF with multiple table arguments to perform referential integrity checks, see
[Create a custom DMF](../../user-guide/data-quality-custom-dmfs.md).

## Privacy updates

### Join policies (*Preview*)

With this release, we are pleased to announce the preview of [Join policies](../../user-guide/join-policies.md).

A join policy is a means of protecting tables and views. When a join policy is applied to a table, queries either require or do not require a
join. In addition, when joins are required, they can be restricted to certain joining columns.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 10-Jan-25 |

---
title: 9.10 Release notes: Apr 14, 2025-Apr 22, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_10.md
section: Release Notes
---

# 9.10 Release notes: Apr 14, 2025-Apr 22, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Extensibility updates

### Support for public custom Git repository URLs (*General availability*)

With this release, we are pleased to announce the general availability of support for custom URLs for connecting with a Git repository.
For example, you can specify a custom URL to a public corporate Git server within your own domain.

For more information, see the ORIGIN parameter of [CREATE GIT REPOSITORY](../../sql-reference/sql/create-git-repository.md).

## Data loading / unloading updates

### Automated refresh for internal named stages (*Preview*)

For more information, see [Automated directory table refreshes for internal stages](../../user-guide/data-load-dirtables-auto.md).

### Auto-ingest pipes for internal named stages (*Preview*)

## Data lake updates

### Apache Iceberg™ tables: Automated refresh table names now appear in the ACCOUNT_USAGE.PIPE_USAGE_HISTORY view

With this release, you can now see the table name in the ACCOUNT_USAGE.PIPE_USAGE_HISTORY view
for Iceberg tables that use automated refresh. The view displays information to help you estimate charges incurred when you use [automated refresh](../../user-guide/tables-iceberg-auto-refresh.md).

For more information, see [Automatically refresh Apache Iceberg™ tables](../../user-guide/tables-iceberg-auto-refresh.md) and [PIPE_USAGE_HISTORY view](../../sql-reference/account-usage/pipe_usage_history.md).

## Privacy updates

### Synthetic data generation (*General availability*)

With this release, synthetic data generation is now generally available. This feature enables you to generate synthetic data with
characteristics of the source data that can be used for testing or public release without risking exposure of the source data. For more
information, read [Using synthetic data in Snowflake](../../user-guide/synthetic-data.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 11-Apr-25 |
| Support for public custom Git repository URLs (General availability) | **Added** to *Extensibility updates* section | 14-Apr-25 |
| Synthetic data generation GA | **Added** to *Privacy updates* section | 15-Apr-25 |

---
title: 9.11 Release notes: Apr 28, 2025-May 02, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_11.md
section: Release Notes
---

# 9.11 Release notes: Apr 28, 2025-May 02, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Snowflake Native Apps session debugging (*General availability*)

Session debugging for Snowflake Native Apps is now generally available. Session debug mode allows providers to view and modify all of the
objects within the app and execute statements using the same privileges that the app has when installed in the consumer account.

See [Session debug mode](../../developer-guide/native-apps/installing-testing-application.md) for more information.

### Writing files from Snowpark Python UDFs and UDTFs (*General availability*)

Snowflake supports writing files from Snowpark Python UDFs and UDTFs.

For more information, see [Writing files from Snowpark Python UDFs and UDTFs](../../developer-guide/snowpark/python/creating-udfs.md).

## Extensibility updates

### Support for allowing requests to all outbound endpoints from functions and procedures (*General availability*)

With this release, we are pleased to announce the general availability of support for allowing requests to all outbound endpoints
with external access from a function or procedure. You can do this by specifying `0.0.0.0` as the domain in a value of the CREATE
NETWORK RULE command’s VALUE_LIST parameter.

For more information, see the VALUE_LIST parameter of [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md).

## Data lake updates

### Support for Iceberg tables in the People’s Republic of China (*General availability*)

With this release, we are pleased to announce the general availability of support for
Apache Iceberg™ tables in the People’s Republic of China.

For more information, see [Apache Iceberg™ tables](https://docs.snowflake.cn/en/user-guide/tables-iceberg).

## Decommissioned runtimes

### Python 3.8 decommissioned

Snowflake no longer supports Python 3.8. For more information on runtime deprecation and decommission schedules, see [Deprecating and decommissioning runtimes (end of support)](../../developer-guide/python-runtime-support-policy.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 18-Apr-25 |
| Writing files from Snowpark Python UDFs and UDTFs (General availability) | **Added** to *New features* section | 23-Apr-25 |
| Python 3.8 decommissioned | **Added** to *Decommissioned runtimes* section | 23-Apr-25 |
| Support for allowing requests to all outbound endpoints from functions and procedures (General availability) | **Added** to *Extensibility updates* section | 24-Apr-25 |
| Support for Iceberg tables in the People’s Republic of China (General availability) | **Added** to *Data lake updates* section | 01-May-25 |

---
title: 9.12 Release notes (with behavior changes): May 05, 2025-May 12, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_12.md
section: Release Notes
---

# 9.12 Release notes (with behavior changes): May 05, 2025-May 12, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_03](../bcr-bundles/2025_03_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_02](../bcr-bundles/2025_02_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_01](../bcr-bundles/2025_01_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for June 2025; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New features

### Release channels for Snowflake Native Apps (*General availability*)

With this release, the release channels feature in Snowflake Native Apps is generally available.

Release channels allow providers to publish apps at different stages of the app development lifecycle. For example, a provider can use release channels to perform the following tasks for a version or patch of an app:

* Test an app.
* Publish an app to consumers as a preview or for UAT (user acceptance testing).
* Publish the app to a production environment.

For more information, see [Publish an app using release channels](../../developer-guide/native-apps/release-channels.md).

## SQL Updates

### Improved error messages for Data Manipulation Language (DML) commands

In past releases, error messages for [DML commands](../../sql-reference/sql-dml.md) didn’t include the column name for errors that
involved a specific column. With this release, some error messages for DML commands include the column name. Note that the column
name isn’t included in all DML error messages.

For example, the following SQL statements return a DML error message:

```sqlexample
CREATE OR REPLACE TABLE demo_dml_error_message (v VARCHAR);

INSERT INTO demo_dml_error_message (v) VALUES
  (3),
  ('d');
```

In past releases, the following error message was returned:

```output
100038 (22018): Numeric value 'd' is not recognized
```

With this release, the following error message is returned:

```output
100038 (22018): DML operation to table DEMO_INSERT_TYPE_MISMATCH failed on
column V with error: Numeric value 'd' is not recognized
```

### New SQL functions

The following functions are now available with this release:

| Function subcategory | New function | Description |
| --- | --- | --- |
| Cardinality estimation | DATASKETCHES_HLL (Preview) | Returns an approximation of the distinct cardinality of the input (that is, `DATASKETCHES_HLL(col1)` returns an approximation of `COUNT(DISTINCT col1)`). |
| Cardinality estimation | DATASKETCHES_HLL_ACCUMULATE (Preview) | Returns the sketch at the end of aggregation. |
| Cardinality estimation | DATASKETCHES_HLL_COMBINE (Preview) | Combines (merges) input sketches into a single output sketch. |
| Cardinality estimation | DATASKETCHES_HLL_ESTIMATE (Preview) | Returns the cardinality estimate for the given sketch. |

## Extensibility updates

### Built-in code profiler for Python stored procedures (*General availability*)

With this release, we are pleased to announce the general availability of built-in code profiling for stored procedure handler
code written in Python. Using the profiler, you can discover how much time or memory was spent executing your handler code.
The profiler generates information describing how much time or memory was spent executing each line of the procedure handler.

For procedures written in SQL, see [Profiling Python procedure handler code](../../developer-guide/stored-procedure/python/procedure-python-profiler.md).

For procedures written with the Snowpark API, see [Profiling Snowpark Python stored procedure handlers](../../developer-guide/snowpark/python/profiling-procedure-handlers.md).

## Data loading / unloading updates

### Support for internal stage cloning (*General availability*)

With this release, we are pleased to announce the general availability of support for internal stage cloning when you clone a database or schema.

For more information, see [CREATE <object> … CLONE](../../sql-reference/sql/create-clone.md).

### Vectorized scanner now available without ON_ERROR restrictions

Previously, enabling the vectorized scanner required the `ON_ERROR` option to be set to either `ABORT_STATEMENT` or
`SKIP_FILE`. This limitation has been removed.

You can now leverage the performance benefits of the vectorized scanner regardless of the `ON_ERROR` setting you choose,
including `CONTINUE`, `SKIP_FILE_num`, and `'SKIP_FILE_num%'`. This provides greater flexibility in configuring
your data loading processes while still taking advantage of optimized scanning.

For more information, see [USE_VECTORIZED_SCANNER](../../sql-reference/sql/copy-into-table.md).

## Data governance updates

### Sensitive data classification: New classifiers for India

The following [sensitive data classifiers](../../user-guide/classify-intro.md) now support the protection of sensitive data in India:

* NATIONAL_IDENTIFIER (Permanent account number (PAN), Aadhaar, and Voter ID)
* DRIVERS_LICENSE
* TAX_IDENTIFIER (Goods and Service Tax Identification Number (GSTIN))

## Snowpark Container Services updates

### Using caller’s rights to connect to Snowflake (*General availability*)

With this release, we are pleased to announce the general availability of
[connecting to Snowflake from inside a container using caller’s rights](../../developer-guide/snowpark-container-services/spcs-execute-sql.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 25-Apr-25 |
| New authentication methods for multi-factor authentication (MFA) (General availability) | **Added** to Security section | 28-Apr-25 |
| Using caller’s rights to connect to Snowflake (General availability) | **Added** to Snowpark Container Services section | 30-Apr-25 |
| Snowflake Scripting output (OUT) arguments (General availability) | **Removed** from SQL Updates section | 30-Apr-25 |
| Vectorized scanner now available without ON_ERROR restrictions | **Added** to Data loading / unloading section | 08-May-25 |
| New maximum size limits for database objects (General availability) | **Removed** from SQL Updates section | 12-May-25 |
| New authentication methods for multi-factor authentication (MFA) (General availability) | **Removed** from Security section | 12-May-25 |

---
title: 9.13 Release notes: May 19, 2025-May 20, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_13.md
section: Release Notes
---

# 9.13 Release notes: May 19, 2025-May 20, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Outbound private connectivity for AWS Government regions

You can use AWS PrivateLink for outbound network traffic originating from AWS Government regions.

For information about using AWS PrivateLink for outbound network traffic, see [Manage private connectivity endpoints: AWS](../../user-guide/private-manage-endpoints-aws.md).

## SQL updates

### Pipe operator

With this release, you can use the new pipe operator (`->>`) to chain SQL statements together. In the chain of SQL statements, the results of
one statement can serve as the input to another statement. The pipe operator can simplify the execution of dependent SQL statements and
improve the readability and flexibility of complex SQL operations.

For more information, see [Flow operators](../../sql-reference/operators-flow.md).

## Data loading/unloading updates

### INFER_SCHEMA function: Support for Apache Iceberg™ data types

With this release, you can now automatically retrieve the column definitions for Apache Iceberg data types from a set of staged data files
that contain semi-structured data. To retrieve these column definitions, set the new `KIND` argument for the INFER_SCHEMA function to
`ICEBERG`.

With this new feature, you can use the CREATE ICEBERG TABLE … USING TEMPLATE variant syntax to create an Iceberg table with the column
definitions derived from INFER_SCHEMA using the `KIND=>'ICEBERG'` output.

To learn more, see [INFER_SCHEMA](../../sql-reference/functions/infer_schema.md).

## Data lake updates

### Cross-cloud/cross-region support for Snowflake-managed Apache Iceberg™ tables

With this release, Snowflake supports cross-cloud/cross-region writes (and reads) to Iceberg tables that use Snowflake as the catalog. In
the DATA_TRANSFER_HISTORY view, the cross-region data transfer charges for these tables appear as a TRANSFER_TYPE of DATA_LAKE.
For more information, see the following:

> * [Cross-cloud/cross-region support](../../user-guide/tables-iceberg.md)
> * [DATA_TRANSFER_HISTORY view](../../sql-reference/organization-usage/data_transfer_history.md) in the ORGANIZATION_USAGE schema
> * [DATA_TRANSFER_HISTORY view](../../sql-reference/account-usage/data_transfer_history.md) in the ACCOUNT_USAGE schema

In addition, you can now convert a cross-cloud/cross-region Iceberg table that uses an external catalog to use Snowflake as the catalog.
For more information, see [Convert an Apache Iceberg™ table to use Snowflake as the catalog](../../user-guide/tables-iceberg-conversion.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 09-May-25 |

---
title: 9.14 Release notes: May 23, 2025-May 28, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_14.md
section: Release Notes
---

# 9.14 Release notes: May 23, 2025-May 28, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Trust Center: In-app notifications (*Preview*)

With this release, you receive Trust Center notifications in Snowsight about accounts that haven’t enrolled in
multi-factor authentication (MFA). For information about enabling, viewing, and disabling Trust Center notifications for your account in
Snowsight, see [Enable notifications from Trust Center](../../user-guide/ui-snowsight-profile.md).

### Trust Center: New Abnormal Failure Rate Detection scanners

The Trust Center now has two new scanners in the Threat Intelligence scanner package. These two new scanners help you find users with a high
number of authentication failures or job errors, which can indicate attempted takeovers of an account, misconfigurations, exceeded quotas,
or permission issues.

For more information, see [Threat Intelligence scanner package](../../user-guide/trust-center/overview.md).

## Snowpark Python version updates

With this release, we are pleased to announce general availability of support for Python version 3.12.

## SQL updates

### Search optimization: Support for Apache Iceberg™ tables

With this release, search optimization can improve the performance of queries on Iceberg tables. To configure search optimization for an Iceberg table, use ALTER ICEBERG TABLE … ADD SEARCH OPTIMIZATION.

For more information, see [Support for Apache Iceberg™ tables](../../user-guide/search-optimization/queries-that-benefit.md) in the search optimization documentation.

### Query Acceleration Service: Support for Apache Iceberg™ tables

With this release, we are pleased to announce support for the Query Acceleration Service (QAS) for Iceberg tables.
The QAS accelerates scan performance and inserts on Iceberg tables. When you use a QAS-enabled warehouse to
query an Iceberg table, QAS accelerates the query if it’s eligible for acceleration.

As a result of this change, QAS might incur additional credits.

For more information, see [Using the Query Acceleration Service (QAS)](../../user-guide/query-acceleration-service.md).

### Data types: Structured types support for standard Snowflake tables (*Preview*)

With this release, you can define a structured type column in a standard Snowflake table.

> **Note:**
>
> Structured types are generally available for Apache Iceberg™ tables. Structured types aren’t supported for dynamic, hybrid, or external tables.

For more information, see [Structured data types](../../sql-reference/data-types-structured.md)

## Data pipeline updates

### Triggered tasks: Support for streams hosted on directory tables and data shares

With this release, you can now use Triggered Tasks to call stored procedures whenever there are changes in a stream hosted on a directory table or data share.

If the directory table is configured to auto-refresh, the triggered task runs automatically. Otherwise, in order to trigger the task, you must also refresh the directory table metadata manually, for example: ALTER STAGE my_stage REFRESH.

For more information, see [Triggered tasks](../../user-guide/tasks-triggered.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-May-25 |
| Trust Center: In-app notifications (Preview) | **Added** to New features section | 28-May-25 |
| Snowpark Python version updates | **Added** | 30-May-25 |
| Row-level deletes for externally managed Iceberg tables | **Removed** from Data lake updates section | 20-Aug-25 |
| Trust Center: New Abnormal Failure Rate Detection scanners | **Added** to New features section | 04-Sep-25 |

---
title: 9.15 Release notes: Jun 09, 2025-Jun 11, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_15.md
section: Release Notes
---

# 9.15 Release notes: Jun 09, 2025-Jun 11, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Artifact Repository (*General availability*)

With this release, we are pleased to announce general availability of support for Artifact Repository. Artifact Repository allows you to directly use Python packages from the Python Package Index ([PyPI](https://pypi.org)) within Snowpark Python user-defined functions (UDFs) and stored procedures.

For more information, see [Artifact Repository overview](../../developer-guide/udf/python/udf-python-packages.md).

## Security updates

### Malicious IP Protection

Malicious IP Protection service automatically protects all types of Snowflake accounts by blocking network access and login attempts that
originate from known, malicious IP addresses. It is enabled by default and does not require any configuration by administrators.

For more information, see [Malicious IP Protection](../../user-guide/malicious-ip-protection.md).

### Findings Lifecycle Management

Snowflake announces the Findings Lifecycle Management (FLM), a Trust Center feature that lets you manage, filter, and proactively respond to violations with auditable controls.

FLM lets you do the following tasks:

* Track and resolve security violations reported by scanners.
* Filter violations based on “Open” or “Resolved” status.
* Focus on active violations that require attention.
* Attach evidence or notes to violations for audit logs.

For more information, see [Manage the violation findings lifecycle](../../user-guide/trust-center/using-the-trust-center.md).

> **Note:**
>
> This feature is being rolled out gradually and will be available to all accounts within three weeks.

## SQL updates

### UNION BY NAME operator

This release introduces the UNION BY NAME operator. You can use this operator to combine rows by name instead of by position. When rows are
combined with UNION BY NAME, and a column exists in one input but not the other, it is filled with NULL values in the combined result set
for each row where it’s missing.

For more information, see [Set operators](../../sql-reference/operators-query.md).

## Data pipeline updates

### Support for streams on externally managed Apache Iceberg™ tables with row-level deletes

You can now create streams on externally managed Iceberg tables with row-level deletes.

For more information, see [Insert-only streams](../../user-guide/streams-intro.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 06-Jun-25 |
| Snowflake Scripting output (OUT) arguments (General availability) | **Removed** from SQL Updates section | 10-Jun-25 |
| Support for streams on externally managed Apache Iceberg™ tables with row-level deletes | **Added** to Data pipeline updates section | 11-Jun-25 |
| Artifact Repository (General availability) | **Added** to *New features* section | 12-Jun-25 |
| Findings Lifecycle Management (General availability) | **Added** to *Security updates* section | 20-Jun-25 |

---
title: 9.16 Release notes (no announcements): Jun 16, 2025-Jun 23, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_16.md
section: Release Notes
---

# 9.16 Release notes (no announcements): Jun 16, 2025-Jun 23, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

This release contains no significant features, updates, or enhancements to announce.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 17-Jun-25 |
| *Account Usage: New CREDENTIALS view* | **Removed** from *Security updates*; will be released separately from 9.16 | 23-Jun-25 |

---
title: 9.17 Release notes (with behavior changes): Jun 24, 2025-Jun 30, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_17.md
section: Release Notes
---

# 9.17 Release notes (with behavior changes): Jun 24, 2025-Jun 30, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_04](../bcr-bundles/2025_04_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_03](../bcr-bundles/2025_03_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_02](../bcr-bundles/2025_02_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for July 2025; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### New maximum size limits for database objects (*General availability*)

With this release, the new maximum allowed length for columns of type VARCHAR, VARIANT, ARRAY, and OBJECT is 128 MB, and the new maximum
allowed length for columns of type BINARY, GEOGRAPHY, and GEOMETRY is 64 MB.

To use this feature, the [2025_03 bundle](../bcr-bundles/2025_03_bundle.md) must be enabled. This bundle is enabled by default with this release.

For more information, see [Size limits for database objects](../../user-guide/data-load-considerations-prepare.md).

### Snowflake Scripting supports nested stored procedures (*General availability*)

With this release, you can define nested stored procedures in Snowflake Scripting anonymous blocks and stored procedures. A nested stored
procedure only exists in the scope of its block and can be called in from any section of its block (DECLARE, BEGIN … END, and EXCEPTION).

Nested stored procedures can enhance and simplify security, keep code more modular, and improve maintainability.

For more information, see [Using nested stored procedures](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

## Data pipeline updates

### Snowsight: Task Overview and Graph Run History updates (*General availability*)

With this release, you can see more in Snowsight task history:

* Account level overview of all task graphs based on task monitoring privileges
* See status and duration stats for recent runs
* Manually run, retry, edit or suspend tasks in one place
* Access task graph run details from the overview or the task-level

For more information, see [View tasks and task graphs in Snowsight](../../user-guide/ui-snowsight-tasks.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 20-Jun-25 |

---
title: 9.18 Release notes: Jul 02, 2025-Jul 08, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_18.md
section: Release Notes
---

# 9.18 Release notes: Jul 02, 2025-Jul 08, 2025

> **Attention:**
>
> Content in this page is available in advance of the completion of the 9.18 release, which is currently either pending or
> in progress.
>
> The release is scheduled to complete in early July (subject to change).
>
> Features, updates, or behavior changes described in this page might not become available in your account(s) until the completion of the
> release.
>
> For updates to these release notes, see Release notes change log.

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Snowflake Scripting output (OUT) arguments (*General availability*)

With this release, Snowflake Scripting supports output (OUT) arguments. When an output argument is specified in the definition of a Snowflake Scripting stored procedure, the stored procedure can return the current value of the output argument to a calling program, such as an anonymous block or a different stored procedure.

For more information, see [Using arguments passed to a stored procedure](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

## Data pipeline updates

### Dynamic tables: Support for externally managed Apache Iceberg™ tables (*General availability*)

You can now create dynamic tables that read from Iceberg tables that are managed by [external catalogs](../../user-guide/tables-iceberg.md). This is useful for processing data from external
data lakes, without duplicating or ingesting the data into Snowflake.

For more information, see [Create dynamic tables that read from Snowflake-managed or externally managed Apache Iceberg™ tables](../../user-guide/dynamic-tables-create.md)

## Data governance updates

### Data Quality: New system data metric function

A new system data metric function, ACCEPTED_VALUES, validates whether values in a column match a Boolean expression. When the ACCEPTED_VALUES function runs, it returns the number of records where the column value didn’t match the Boolean value, which can indicate a data quality issue.

For more information, see [ACCEPTED_VALUES](../../sql-reference/functions/dmf_accepted_values.md).
For general information about system data metric functions, see [Introduction to data quality and data metric functions](../../user-guide/data-quality-intro.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 30-Jun-25 |
| Dynamic tables: Support for externally managed Apache Iceberg™ tables | **Added** to Data pipeline updates section | 03-Jul-2025 |

---
title: 9.19 Release notes: Jul 14, 2025-Jul 17, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_19.md
section: Release Notes
---

# 9.19 Release notes: Jul 14, 2025-Jul 17, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Data types: Structured types support for standard Snowflake tables (*General availability*)

With this release, you can define a structured type column in a standard Snowflake table.

For more information, see [Structured data types](../../sql-reference/data-types-structured.md).

> **Note:**
>
> Structured types aren’t supported for dynamic, hybrid, or external tables.

## Data loading / unloading updates

### Optimize data ingestion with pre-clustering for Snowpipe Streaming - high-performance architecture (*Preview*)

With this release, we are pleased to announce a new feature for Snowpipe Streaming with high-performance architecture (Preview) that
enables pre-clustering of your data directly during ingestion. This helps to significantly improve query performance on your target
tables by ensuring data is sorted before it’s committed.

To use this feature, your target table must be configured with clustering keys defined. You can then enable this behavior within your
Snowpipe Streaming definition using the new CLUSTER_AT_INGEST_TIME option in your COPY INTO statement, as the following example shows.

```sqlexample
CREATE OR REPLACE PIPE TEST_PRECLUSTERED_PIPE
AS
    COPY INTO TEST_PRECLUSTERED_TABLE (num) FROM (
            SELECT $1:num::number as num FROM TABLE(
                DATA_SOURCE(
                    TYPE => 'STREAMING')
        ))
        CLUSTER_AT_INGEST_TIME=TRUE;
```

For more information, see [CLUSTER_AT_INGEST_TIME](../../sql-reference/sql/copy-into-table.md).

### COPY FILES (*General availability*)

Use the COPY FILES command to move files from a source location to an output stage.

For details, see [COPY FILES](../../sql-reference/sql/copy-files.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 09-Jul-25 |
| COPY FILES (General availability) | **Added** to Security section | 09-Jul-25 |

---
title: 9.2 Release notes (with behavior changes): Jan 22, 2025-Feb 13, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_02.md
section: Release Notes
---

# 9.2 Release notes (with behavior changes): Jan 22, 2025-Feb 13, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_01](../bcr-bundles/2025_01_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_08](../bcr-bundles/2024_08_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_07](../bcr-bundles/2024_07_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for March 2025; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## Non-bundled behavior changes

The following changes aren’t part of a bundle, and therefore can’t be disabled:

* [Secure objects: Redaction of information in error messages](../bcr-bundles/un-bundled/bcr-1858.md)

## New features

### Triggered tasks now can operate as Serverless Tasks (*General availability*)

Snowflake can now automatically manage the compute resources for triggered tasks to be completed within a target interval that you specify.
To convert an existing triggered task to a Serverless Task:

1. Suspend the task.
2. Remove the `WAREHOUSE` parameter and add the `TARGET_COMPLETION_INTERVAL` parameter.
3. Resume the task.

For more information, see [Triggered tasks](../../user-guide/tasks-triggered.md) and [Serverless tasks](../../user-guide/tasks-intro.md).

### Trust Center: Manage individual scanners

Trust Center scanners are scheduled background processes that check your account for security risks based on how you configured your account.
With this release, you can manage individual scanners in a scanner package. When a scanner package is enabled, you can manage the scanners in
the scanner package in the following ways:

* Enable or disable individual scanners
* Change the schedule of individual scanners
* Manually start individual scanners

For more information, see [Managing scanners](../../user-guide/trust-center/using-the-trust-center.md).

## Security updates

### Outbound private connectivity for Microsoft Azure Government regions

You can use Azure Private Link for outbound network traffic originating from Microsoft Azure Government regions. This allows you to harden
your security posture for Snowflake features like Snowpark Container Services and external volumes for Apache Iceberg™ tables.

For information about using Azure Private Link for outbound network traffic, see [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md).

## SQL updates

### New SQL functions

The following function is now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| System (Information) | [SYSTEM$SHOW_BUDGETS_FOR_RESOURCE](../../sql-reference/functions/system_show_budgets_for_resource.md) | Returns a string containing a list of the budgets that track a specified resource (for example, a table or a schema). |

### Additional CREATE OR ALTER commands (*Preview*)

With this release, we are pleased to announce the preview of additional CREATE OR ALTER commands. These commands combine the functionality of
the CREATE command and the ALTER command. A CREATE OR ALTER statement executes as a CREATE statement if the object doesn’t exist. If it does
exist, it transforms the object according to the object definition in the statement.

CREATE OR ALTER TABLE provides a declarative and idempotent approach to defining your Snowflake objects. When used together with the Git
integration, this enables an Infrastructure-as-Code (IaC) approach to database change management.

With this preview, the following additional objects are supported:

* [CREATE OR ALTER AUTHENTICATION POLICY](../../sql-reference/sql/create-authentication-policy.md): Creates an authentication policy if it doesn’t exist or alters an existing authentication
  policy.
* [CREATE OR ALTER FILE FORMAT](../../sql-reference/sql/create-file-format.md): Creates a new named file format if it doesn’t exist or alters an existing file format.
* [CREATE OR ALTER TAG](../../sql-reference/sql/create-tag.md): Creates a tag if it doesn’t exist or alters an existing tag.

For more information, see [CREATE OR ALTER <object>](../../sql-reference/sql/create-or-alter.md).

## Data lake updates

### Apache Iceberg™ tables: Support for writing Apache Iceberg metadata for Delta-based tables

With this release, we are pleased to announce support for writing metadata for Delta-based Iceberg tables to your external cloud storage.

For more information, see [Changes to Apache Iceberg™ tables created from Delta files](../bcr-bundles/2025_01/bcr-1852.md).

> **Note:**
>
> In order to write Iceberg metadata for Delta-based tables, you must [enable the 2025_01 bundle in your account](../bcr-bundles/managing-behavior-change-releases.md).
>
> To enable this bundle in your account, execute the following statement:
>
> ```sqlexample
> SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_01');
> ```

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 17-Jan-25 |
| Release notes | Added new feature: Triggered tasks can operate as Serverless Tasks | 29-Jan-25 |
| Release notes | Correction: Triggered tasks do not yet support Data Shares | 14-Feb-25 |
| Release notes | Added new feature: Additional CREATE OR ALTER commands (Preview) | 25-Feb-25 |

---
title: 9.20 Release notes: Jul 21, 2025-Jul 25, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_20.md
section: Release Notes
---

# 9.20 Release notes: Jul 21, 2025-Jul 25, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### CREATE INDEX command supports INCLUDE columns

With this release, you can use the CREATE INDEX command to create secondary indexes with INCLUDE columns. In previous releases, INCLUDE
columns were supported only for secondary indexes defined within CREATE HYBRID TABLE statements.

### Semantic views: Listing dimensions and metrics in a view, schema, database, or account

To list the dimensions and metrics in a semantic view, schema, database, or account, run the following commands:

* [SHOW SEMANTIC DIMENSIONS](../../sql-reference/sql/show-semantic-dimensions.md)
* [SHOW SEMANTIC METRICS](../../sql-reference/sql/show-semantic-metrics.md)

You can also list the dimensions that you can specify when querying for a specific metric. When you specify a dimension and metric in a
query, the base table for the dimension must be related to the base table for the metric. In addition, the base table for the dimension must
have an equal or lower level of granularity than the base table for the metric.

For example, the following example queries the `tpch_analysis` view and returns the `customer_order_count` metric and the
`order_date` dimension:

```sqlexample
SELECT * FROM SEMANTIC_VIEW (
  tpch_analysis
  DIMENSIONS orders.order_date
  METRICS customer.customer_order_count
);
```

This query fails because the `orders` table for the dimension has a higher level of granularity than the `customer` table for
the metric:

```output
010234 (42601): SQL compilation error:
Invalid dimension specified: The dimension entity 'ORDERS' must be related to and
have an equal or lower level of granularity compared to the base metric or dimension entity 'CUSTOMER'.
```

To list the dimensions that have base tables that are related to and are at an equal or lower level of granularity than the base table for a
metric, run the [SHOW SEMANTIC DIMENSIONS FOR METRIC](../../sql-reference/sql/show-semantic-dimensions-for-metric.md) command. For example:

```sqlexample
SHOW SEMANTIC DIMENSIONS IN tpch_analysis FOR METRIC customer_order_count;
```

### New query insights about join performance and optimization

The QUERY_INSIGHTS view now includes insights about the following conditions that might have affected query performance:

* A query or subquery has no WHERE clause, which means that the query scans an entire table and might return more rows than intended.
* A join that includes the output of at least one other join is returning many more rows than are in the tables being joined.
* A join of two data sets (for example, tables, views, or output from table function calls) is returning many more rows than are in the tables
  being joined.
* The performance of a query has been improved through search optimization.

Each insight includes a message that explains how query performance might have been affected and provides a general recommendation for next
steps.

For information, see [Using query insights to improve performance](../../user-guide/query-insights.md).

## Data pipeline updates

### Tasks: New EXECUTE AS USER option and IMPERSONATE privilege for user objects

With this release, organizations that assign Snowflake security privileges by user can allow users to run team tasks by using their existing user accounts.

As a best practice, we recommend that teams create a service user that represents a team, and assign required privileges to
that user. You can then use GRANT IMPERSONATE ON USER <user_name> TO ROLE <role_name> to grant users privileges to create or modify tasks
based on the team user account. Individual users can then run tasks on behalf of the team user to use their privileges with the new
parameters: CREATE TASK … EXECUTE AS USER <user_name> and ALTER TASK … EXECUTE AS USER <user_name>.

For more information, see [Run tasks with user privileges](../../user-guide/tasks-intro.md).

### Dynamic tables: Disallowed use of the COPY_SESSION attribute while manually refreshing dynamic tables on a serverless warehouse

Using COPY_SESSION with a dynamic table in a serverless context causes the refresh to inherit the serverless warehouse, leading to
unsupported and undefined behavior. This configuration now results in an error.

For more information, see [REFRESH [ COPY SESSION ]](../../sql-reference/sql/alter-dynamic-table.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Jul 24, 2025 |
| Tasks: New EXECUTE AS USER option and IMPERSONATE privilege for user objects | Announcement removed temporarily until the supporting documentation is available. | Jul 28, 2025 |
| Tasks: New EXECUTE AS USER option and IMPERSONATE privilege for user objects | Announcement restored. | Jul 28, 2025 |

---
title: 9.21 Release notes: Jul 29, 2025-Aug 01, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_21.md
section: Release Notes
---

# 9.21 Release notes: Jul 29, 2025-Aug 01, 2025

> **Attention:**
>
> This release has been completed. For differences between the in-advance and final versions of these release notes, see the Release notes change log.

## Security updates

### GENERATE_SYNTHETIC_DATA: Consistency secret now optional in most cases

Previously, when you called [GENERATE_SYNTHETIC_DATA](../../sql-reference/stored-procedures/generate_synthetic_data.md) with a replace column property,
you needed to provide a SECRET for `consistency_secret`. With this change, `consistency_secret` is now optional. **However**,
if you run GENERATE_SYNTHETIC_DATA in an owner’s rights stored procedure, you still must provide a value to `consistency_secret`.

## SQL updates

### Account Usage: TABLE_QUERY_PRUNING_HISTORY and COLUMN_QUERY_PRUNING_HISTORY views (*General availability*)

You can monitor data access patterns at the table and column level by querying two new Account Usage views:

* [TABLE_QUERY_PRUNING_HISTORY](../../sql-reference/account-usage/table_query_pruning_history.md) provides a breakdown
  of query execution time and pruning by table, query-hash, and warehouse.
* [COLUMN_QUERY_PRUNING_HISTORY](../../sql-reference/account-usage/column_query_pruning_history.md) returns an equivalent
  pruning summary that is aggregated by column name.

### The SEARCH_IP function supports searching for IPv6 addresses

You can use the SEARCH_IP function to search for IPv6 addresses in data. Previously, the function only supported
searching for IPv4 addresses.

For more information, see [SEARCH_IP](../../sql-reference/functions/search_ip.md).

### Generating YAML for a semantic view and creating a semantic view from YAML

To generate the [YAML specification](../../user-guide/views-semantic/sql.md) for a semantic view, you can call the
SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW function. For example:

```sqlexample
SELECT SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW(
  'my_db.my_schema.tpch_rev_analysis'
);
```

You can also create a semantic view from a YAML specification by calling the SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML stored
procedure. For example:

```sqlexample-yaml
CALL SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML(
  'my_db.my_schema',
  $$
  name: TPCH_REV_ANALYSIS
  description: Semantic view for revenue analysis
  ...
  $$
);
```

For information, see:

* [SYSTEM$READ_YAML_FROM_SEMANTIC_VIEW](../../sql-reference/functions/system_read_yaml_from_semantic_view.md)
* [SYSTEM$CREATE_SEMANTIC_VIEW_FROM_YAML](../../sql-reference/stored-procedures/system_create_semantic_view_from_yaml.md)

## Data loading / unloading updates

### Simplified Snowpipe pricing

Starting August 1, 2025, we’re rolling out a new, simplified pricing model for Snowpipe for all Business Critical and VPS accounts.
Instead of a per-second/per-core compute charge and a per-1,000-files fee, you’ll now be charged a fixed credit amount per gigabyte (GB) of data ingested. This change provides more predictability for your data ingestion costs.

* Text files (e.g., CSV, JSON) are billed on their uncompressed size.
* Binary files (e.g., Parquet, Avro) are billed on their observed size.

This new model is being applied automatically to all Business Critical and VPS accounts. Enterprise and Standard editions will be updated in the future.

For more information, see [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) and [Snowpipe costs](../../user-guide/data-load-snowpipe-billing.md).

## Data pipeline updates

### Snowpark Connect for Spark and Snowpark Submit (*Preview*)

With Snowpark Connect for Spark, you can run Spark DataFrame, SQL, and UDF APIs directly on the Snowflake platform using the same Spark
code you use today. You can develop using client tools such as Snowflake Notebooks, Jupyter Notebooks, and others. With Snowpark Submit,
you can run Spark workloads in a non-interactive, asynchronous way directly on Snowflake’s infrastructure while you use familiar Spark
semantics.

Snowpark Connect for Spark and Snowpark Submit are in [Preview](../preview-features.md).

For more information, see [Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark](../../developer-guide/snowpark-connect/snowpark-connect-overview.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Jul 25, 2025 |
| Creating semantic views from YAML and reading YAML for semantic views | **Added** to SQL updates | Jul 29, 2025 |
| Tracing SQL statements run from handler code (General availability) | **Added** to Extensibility updates | Aug 01, 2025 |
| *Tracing SQL statements run from handler code (General availability)* | **Moved** to 9.22 release notes | Aug 06, 2025 |
| Simplified Snowpipe Pricing | **Added** to Data loading / unloading updates | Aug 12, 2025 |

---
title: 9.22 Release notes (with behavior changes): Aug 04, 2025-Aug 08, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_22.md
section: Release Notes
---

# 9.22 Release notes (with behavior changes): Aug 04, 2025-Aug 08, 2025

> **Attention:**
>
> This release has been completed. For differences between the in-advance and final versions of these release notes, see the Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_05](../bcr-bundles/2025_05_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_04](../bcr-bundles/2025_04_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_03](../bcr-bundles/2025_03_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for September 2025; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New features

### Data quality: Using expectations to define quality checks (*General availability*)

You can use an expectation to define the threshold for a data metric function (DMF). After you add the expectation to an association between
a DMF and an object, if a value returned by the DMF does not match the Boolean expression of the expectation, it is flagged as a violation
of the data quality check.

For more information, see [Use SQL to work with expectations](../../user-guide/data-quality-expectations.md).

## Extensibility updates

### Tracing SQL statements run from handler code (*General availability*)

When you have enabled tracing, Snowflake traces SQL statements executed in conjunction with other traced code,
such as within the handler for a stored procedure or user-defined function.

For more information, see [SQL statement tracing](../../developer-guide/logging-tracing/tracing.md).

## Data pipeline updates

### Dynamic tables: Support for immutability constraints

Immutability constraints give you finer control over dynamic table updates by allowing parts of the table to remain unchanged instead of
always reflecting the latest query results.

By marking specific regions as immutable, you can:

* Prevent propagation of updates or deletions to existing data.
* Restrict inserts, updates, and deletes for rows matching a condition.
* Limit future modifications while still allowing incremental updates to other parts of the table.

To define immutability constraints, use the `IMMUTABLE WHERE` parameter in the [CREATE DYNAMIC TABLE](../../sql-reference/sql/create-dynamic-table.md) or
[ALTER DYNAMIC TABLE](../../sql-reference/sql/alter-dynamic-table.md) command.

For more information, see [Understanding immutability constraints](../../user-guide/dynamic-tables-immutability-constraints.md).

### Dynamic tables: Support for backfill

You can now create a dynamic table with its initial data backfilled from a regular table. Backfilling is a zero-copy, low-cost operation that
makes the source data immediately available in the dynamic table.

You can backfill data into a dynamic table while still defining a custom refresh query for future updates. With immutability constraints,
backfilled data remains unchanged even if it no longer matches the upstream source, ensuring it persists over time.

For more information, see [Backfill examples](../../user-guide/dynamic-tables-performance-optimize-immutability.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Aug 01, 2025 |
| *Data quality: Using expectations to define quality checks (General availability)* | **Added** to *New features* | Aug 05, 2025 |
| *Tracing SQL statements run from handler code (General availability)* | **Added** to *Extensibility updates* (moved from 9.21 release notes) | Aug 06, 2025 |
| Security updates: Private Service Connect Endpoints for internal stages on Google Cloud (General availability) | **Removed** section and its announcement(s): | Aug 07, 2025 |

---
title: 9.23 Release notes: Aug 11, 2025-Aug 15, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_23.md
section: Release Notes
---

# 9.23 Release notes: Aug 11, 2025-Aug 15, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Snowflake Scripting user-defined functions (UDFs)

With this release, you can create SQL UDFs that contain [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) procedural language. Snowflake Scripting UDFs can be called in
a SQL statement, such as a SELECT statement or INSERT statement. Therefore, they are more flexible than a Snowflake Scripting stored
procedure, which can only be called in a SQL [CALL](../../sql-reference/sql/call.md) command.

For more information, see [Snowflake Scripting UDFs](../../developer-guide/udf/sql/udf-sql-procedural-functions.md).

### Private facts and metrics in semantic views

If you are defining a fact or metric only for use in calculations in the semantic view and you don’t want the fact or metric to
be returned in a query, you can specify the PRIVATE keyword to mark the fact or metric as private.

Facts and metrics that are marked as private cannot be queried or used in a query condition.

For information, see [Marking a fact or metric as private](../../user-guide/views-semantic/sql.md).

## Data loading / unloading updates

### Apache Arrow library upgrade to version 21.0.0

Snowflake 9.23 upgrades to Apache Arrow 21.0.0 for unloading Apache Parquet data.

If your data processing pipeline uses third-party tools to read Parquet files written by Snowflake, we recommend that you verify
compatibility with the updated Apache Arrow version.

## Data pipeline updates

### Dynamic tables: Support for UNION in incremental refresh mode

The UNION set operator is now supported with dynamic table incremental refresh, which works like the combination of the UNION ALL and SELECT
DISTINCT operators.

Other set operators are not currently supported. For a complete list, see [Supported queries in incremental and full refresh modes](../../user-guide/dynamic-tables-supported-queries.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Aug 08, 2025 |
| Dynamic tables: Support for UNION in incremental refresh mode | **Added** to Data pipeline updates | Aug 12, 2025 |
| Private facts and metrics in semantic views | **Added** to SQL updates | Aug 26, 2025 |

---
title: 9.24 Release notes: Aug 18, 2025-Aug 20, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_24.md
section: Release Notes
---

# 9.24 Release notes: Aug 18, 2025-Aug 20, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Self-service activation of Tri-Secret Secure (*General availability*)

Self-service activation of Tri-Secret Secure is now generally available and is no longer in [Preview](../preview-features.md).

Customers with the ACCOUNTADMIN role can activate Tri-Secret Secure for their Snowflake account using system functions without requiring
Snowflake support. The self-service procedure also supports deactivating Tri-Secret Secure and simplifies working with your customer managed
key (CMK).

For more information, see [Tri-Secret Secure self-service in Snowflake](../../user-guide/security-encryption-tss-self-serve.md).

## SQL updates

### ALTER LISTING command to simplify adding and removing targets (*General availability*)

You can now add or remove targets (accounts, roles, and organizations) from a listing without passing all of the existing targets. With this release, the [ALTER LISTING](../../sql-reference/sql/alter-listing.md) command allows you to provide a manifest section that contains only the targets you want to add or remove. This partial manifest reuses the familiar structures `targets`, `external_targets`, and `organization_targets`, which are already defined in the [listing manifest reference](../../progaccess/listing-manifest-reference.md).

## New features

### Snowflake Native App Framework - MONITOR privilege support for apps (*General availability*)

The Snowflake Native App Framework supports granting the MONITOR privilege on an app to a user role or to another app. To grant the MONITOR privilege, run the following commands:

```sqlexample
GRANT MONITOR ON APPLICATION <app_name> TO ROLE <app_user>;
GRANT MONITOR ON APPLICATION <app_name> TO APPLICATION <other_app_name>;
```

The MONITOR privilege allows the grantee to run the following commands:

```sqlexample
DESC APPLICATION <app_name>;
DESC SPECIFICATION SPEC IN APPLICATION <app_name>;
SHOW APPROVED SPECIFICATIONS IN APPLICATION <app_name>;
SHOW OBJECTS OWNED BY APPLICATION <app_name>;
SHOW REFERENCES IN APPLICATION <app_name>;
SHOW SPECIFICATIONS IN APPLICATION <app_name>;
```

The owner of the app has privileges to run these commands by default. The MONITOR privilege allows the app owner to delegate monitoring of
an app to a user role or another app.

For more information, see [Monitor an app](../../developer-guide/native-apps/ui-consumer-managing-applications.md).

## Data lake updates

### Set a target file size for Apache Iceberg™ tables (*Preview*)

You can now set a target Parquet file size for Iceberg tables. Doing so improves cross-engine query performance when you use an external
Iceberg engine such as Apache Spark, Delta, or Trino that’s optimized for larger file sizes. You can set the target file size when you
create a table, or update it later by using the [ALTER ICEBERG TABLE](../../sql-reference/sql/alter-iceberg-table.md) command.
For more information, see [Set a target file size](../../user-guide/tables-iceberg-manage.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Aug 15, 2025 |
| Snowflake Native App Framework: MONITOR privilege support for apps | **Added** to New features | Aug 21, 2025 |
| Set a target file size for Apache Iceberg™ tables (Preview) | **Added** to Data lake updates | Aug 21, 2025 |
| Self-service activation of Tri-Secret Secure (General availability) | **Added** to Security updates | Aug 21, 2025 |
| ALTER LISTING command to simplify adding and removing targets (General availability) | **Added** to SQL updates | Aug 21, 2025 |

---
title: 9.25 Release notes: Aug 25, 2025-Aug 28, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_25.md
section: Release Notes
---

# 9.25 Release notes: Aug 25, 2025-Aug 28, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Sensitive data classification: Automatic classification of a database (*General availability*)

You can now set a classification profile on a database so that all tables and views within the database are
automatically classified for sensitive data.

For more information, see [Set a classification profile on a database](../../user-guide/classify-auto.md).

## Security updates

### Support for keys generated with Elliptic Curve Digital Signature Algorithms (ECDSA)

For Snowflake authentication methods that use a cryptographic key (key-pair authentication and External OAuth), you can now generate keys using Elliptic Curve Digital Signature Algorithms (ECDSA) algorithms ES256(P-256), ES384 (P-384), and ES512 (P-512). These signatures use the SHA-256, SHA-384, and SHA-512 hash algorithms, respectively.

## SQL updates

### Querying semantic views (*General availability*)

The ability to query semantic views is now generally available and is no longer in
[Preview](../preview-features.md).

You can use a SELECT statement to query a semantic view by specifying the SEMANTIC_VIEW clause. In this clause, you specify the
dimensions and metrics that you want to retrieve. You can also filter the results based on dimensions.

For information, see [Querying semantic views](../../user-guide/views-semantic/querying.md).

### Semantic views: Listing facts in a view, schema, database, or account

To list the facts in a semantic view, schema, database, or account, run the SHOW SEMANTIC FACTS command.

For information, see [SHOW SEMANTIC FACTS](../../sql-reference/sql/show-semantic-facts.md).

### Semantic views: Support for renaming views

You can use ALTER SEMANTIC VIEW … RENAME TO … to rename a semantic view.

For information, see [ALTER SEMANTIC VIEW](../../sql-reference/sql/alter-semantic-view.md).

## Data lake updates

### Apache Iceberg™ tables: Row-level deletes for externally managed tables (*General availability*)

Snowflake supports row-level deletes with positional delete files for externally managed Iceberg tables. Iceberg engines can perform update, delete, and merge operations on these tables using both copy-on-write and merge-on-read modes. This expands interoperability between Snowflake and external tools that manage Iceberg table data, and ensures consistent behavior across different compute engines.

For more information, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

## Data governance updates

### Data quality: Updated privilege model allows non-owners to associate a data metric function with an object (*Preview*)

Users with the SELECT privilege on a table or view can associate it with a data metric function (DMF) to set up a data quality check. Previously, only the owner of the table or view could associate a DMF.

As part of this change, an association between a DMF and an object has a new property: EXECUTE AS ROLE. This property specifies which role the DMF runs with.

For more information, see [Required privilege on the table or view](../../user-guide/data-quality-access-control.md).

### Object tags: New limit for allowed values

The ALLOWED_VALUES property of a tag controls which values someone can associate with the tag when they set it on an object. This list of allowed values can now include 5,000 values. Previously, the limit was 300.

For more information, see [Set a list of allowed tag values](../../user-guide/object-tagging/work.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Aug 22, 2025 |
| Sensitive data classification: Automatic classification of a database (General availability) | **Added** to New features | Aug 25, 2025 |
| Querying semantic views (General availability) | **Added** to SQL updates | Aug 26, 2025 |
| Semantic views: Listing facts in a view, schema, database, or account | **Added** to SQL updates | Aug 26, 2025 |
| Semantic views: Support for renaming views | **Added** to SQL updates | Aug 26, 2025 |

---
title: 9.26 Release notes: Sep 01, 2025-Sep 04, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_26.md
section: Release Notes
---

# 9.26 Release notes: Sep 01, 2025-Sep 04, 2025

## SQL updates

### Filling gaps in time-series data (*Preview*)

You can use the following new features to fill gaps in time-series data sets:

* You can use the RESAMPLE clause within the FROM clause of a SELECT statement to “upsample” rows to a specific time interval and fill in missing rows.
* You can call one of the following [interpolation functions](../../sql-reference/functions/interpolate_bfill.md) to update columns for the generated rows that you resampled. You can also use these functions independently to gap-fill rows in an existing data set:

  + INTERPOLATE_BFILL
  + INTERPOLATE_FFILL
  + INTERPOLATE_LINEAR

For more information, see [Filling gaps in time-series data](../../user-guide/querying-time-series-data.md).

### Account Usage: New INGRESS_NETWORK_ACCESS_HISTORY view

This Account Usage view can be used to query any network access attempts to your Snowflake account within the last year.

For more information, see [INGRESS_NETWORK_ACCESS_HISTORY view](../../sql-reference/account-usage/ingress_network_access_history.md).

### Account Usage: New INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view

This Account Usage view can be used to query any network access attempts to your internal stage within the last year.

For more information, see [INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](../../sql-reference/account-usage/internal_stage_network_access_history.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Aug 29, 2025 |
| Data governance updates | **Removed** section and its announcements:   * Sensitive data classification: Classifying views automatically * Sensitive data classification: Excluding objects from automatic classification (*Preview*) | Sep 04, 2025 |
| Sensitive data classification: Classifying views automatically | **Removed** from Data governance updates | Sep 04, 2025 |
| Sensitive data classification: Excluding objects from automatic classification (\*Preview\*) | **Removed** from Data governance updates | Sep 04, 2025 |

---
title: 9.27 Release Notes (with behavior changes): Sep 08, 2025-Sep 10, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_27.md
section: Release Notes
---

# 9.27 Release Notes (with behavior changes): Sep 08, 2025-Sep 10, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_06](../bcr-bundles/2025_06_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_05](../bcr-bundles/2025_05_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_04](../bcr-bundles/2025_04_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change again in the following behavior change release, planned for October 13-15, 2025; however, this schedule is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### Retrieve bind variable values (*Preview*)

With this release, you can retrieve the values of the bind variables for queries that have been executed by using
the BIND_VALUES table function in the INFORMATION_SCHEMA schema. Using this function, you can retrieve bind variable
values from any code that supports bind variables, including Javascript and Snowflake Scripting code.

You can also access these bind variable values from the `bind_values` column in the output for the QUERY_HISTORY
Account Usage view, the QUERY_HISTORY Organization Usage view, or the QUERY_HISTORY function in the INFORMATION_SCHEMA.

> **Note:**
>
> To use this feature, the [2025_06 bundle](../bcr-bundles/2025_06_bundle.md) must be enabled.

For more information, see [Retrieve bind variable values](../../sql-reference/bind-variables.md).

## Data pipeline updates

### Dynamic tables: Support for base tables with zero data retention

You can now create dynamic tables on base tables with zero data retention (`DATA_RETENTION_TIME_IN_DAYS = 0`). This feature doesn’t
apply to shared base tables.

For more information, see [Data retention period](../../user-guide/data-time-travel.md).

## Data lake updates

### New system function to replace the catalog integration for an externally managed Apache Iceberg™ table

Use the new SYSTEM$SET_CATALOG_INTEGRATION system function to replace the catalog integration associated with an externally managed
Apache Iceberg™ table.

You might use this function to access the latest Iceberg features for your tables, such as
[write support for externally managed Iceberg tables](../../user-guide/tables-iceberg-externally-managed-writes.md). You might also use this function to roll back to
the original Glue catalog integration, if needed.

You can also use this function to migrate your tables to a different [Iceberg REST catalog integration](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md).

For more information, see [SYSTEM$SET_CATALOG_INTEGRATION](../../sql-reference/functions/system_set_catalog_integration.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Sep 05, 2025 |
| New system function for replacing the catalog integration associated with an externally managed Apache Iceberg™ table | **Added** to Data lake updates | Sep 19, 2025 |
| Dynamic tables: Support for base tables with zero data retention | **Added** to Data pipeline updates | Oct 21, 2025 |

---
title: 9.28 Release Notes: Sep 15, 2025-Sep 17, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_28.md
section: Release Notes
---

# 9.28 Release Notes: Sep 15, 2025-Sep 17, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Query insights in Snowsight (*Preview*)

You can now view [query insights](../../user-guide/query-insights.md) in Snowsight. The
[Query Profile](../../user-guide/ui-snowsight-activity.md) tab under Query History now displays insights about conditions
that affect query performance. Each insight includes a message that explains how query performance might be affected and provides
a general recommendation for next steps.

Query insights in Snowsight is in [Preview](../preview-features.md).

For more information, see [Viewing the query insights in Snowsight](../../user-guide/query-insights.md).

## Data pipeline updates

### dbt Projects on Snowflake: Support for dbt retry (*Preview*)

dbt Projects on Snowflake now support the `dbt retry` command. For information about using dbt commands, see the [dbt Command reference](https://docs.getdbt.com/reference/dbt-commands).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Sep 12, 2025 |
| Query insights in Snowsight | **Added** to SQL updates | Sep 17, 2025 |
| dbt Projects on Snowflake: Support for dbt retry | **Added** to Data pipeline updates | Oct 17, 2025 |

---
title: 9.29 Release Notes: Sep 24, 2025-Sep 26, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_29.md
section: Release Notes
---

# 9.29 Release Notes: Sep 24, 2025-Sep 26, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Declarative Shared Native Apps (*Preview*)

Declarative Sharing allows providers to share and sell data products, enhanced by Snowflake Notebooks to help Snowflake consumers visualize and
explore the data.

Declarative Shared Native Apps is in Preview.

Declarative Sharing’s simplified development experience makes it easy to get started quickly.

Key features include:

* **Streamlined Development**: Providers can define shared objects, including notebooks, using a straightforward YAML file format, with
  automatic version control.
* **Live Notebook Development**: You can interactively develop notebooks, edit notebook content and share it, all from within Snowsight.
* **Controlled Data Visibility**: Application roles enable providers to categorize data, giving consumers easy control over data visibility.
* **Consumer-managed Resources**: The application runs in the consumer’s account, allowing them to manage resource usage and costs.
* **Secure Execution**: Declaratively shared applications operate within a tightly controlled environment, ensuring strict limitations on
  their actions and data access.

For more information, see [About Declarative Sharing in the Native Application Framework](../../developer-guide/declarative-sharing/about.md).

### Cortex Agent Monitoring (*Preview*)

Cortex Agent Monitoring gives you access to detailed logs and tracing for your agents, accessible through Snowsight. Your agent’s logs include details on LLM planning, tool execution, SQL generation and execution, and more.

For more information, see [Monitor Cortex Agent requests](../../user-guide/snowflake-cortex/cortex-agents-monitor.md).

## Data collaboration updates

### Cross-Cloud Auto-Fulfillment support for open table formats

You can now share open table formats, such as Apache Iceberg and Delta Lake, in listings across regions and clouds and enable auto-fulfillment on those listings.

For more information, see [Using auto-fulfillment with open table formats](../../collaboration/use-auto-fulfillment-with-open-table-formats.md).

## Data pipeline updates

### CREATE OR ALTER DYNAMIC TABLE (*Preview*)

The CREATE OR ALTER DYNAMIC TABLE command combines the functionality of the CREATE DYNAMIC TABLE command and the ALTER DYNAMIC TABLE command.
It executes as a CREATE statement if the dynamic table doesn’t exist. If it does exist, it transforms the dynamic table according to the
object definition in the statement.

For more information, see [CREATE OR ALTER <object>](../../sql-reference/sql/create-or-alter.md) and [CREATE OR ALTER DYNAMIC TABLE](../../sql-reference/sql/create-dynamic-table.md).

## Data governance updates

### Data quality: FRESHNESS data metric function improvement

You can now associate the FRESHNESS data metric function (DMF) with a table without specifying a column argument, which lets you determine
the last time a DML command acted on the table. Previously, you needed to associate the FRESHNESS with a timestamp column to determine the
last time the table was modified.

For more information, see the [FRESHNESS DMF](../../sql-reference/functions/dmf_freshness.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Sep 19, 2025 |
| Support for Scala version 2.13 (Preview) | **Removed** from Extensibility updates | Sep 22, 2025 |
| Cortex Agent Monitoring (Preview) | **Added** to New Features | Sep 24, 2025 |
| CREATE OR ALTER DYNAMIC TABLE (Preview) | **Added** to Data pipeline updates | Sep 25, 2025 |
| Cross-Cloud Auto-Fulfillment support for open table formats | **Added** to New Features | Sep 26, 2025 |

---
title: 9.3 Release notes: Feb 18, 2025-Feb 21, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_03.md
section: Release Notes
---

# 9.3 Release notes: Feb 18, 2025-Feb 21, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Tasks now support lower scheduling intervals (*General availability*)

Tasks now support lower scheduling executions as frequent as every 10 seconds. (The previous minimum scheduling interval was 1 minute.)

### Data lineage (*General availability*)

Data lineage automatically tracks the flow of data between Snowflake objects in real-time, for example, from a table to a view. You can track
the lineage of table-like objects, columns, and stages as well as the lineage between data objects and machine learning objects like datasets,
feature views, and models. You can use lineage information to assist in impact analysis, monitoring, troubleshooting, and compliance efforts.
Lineage can also help you propagate knowledge of sensitive data elements using tags.

> Lineage information is available through Snowsight, SQL, and Python. For more information, see:
> :   * [Data Lineage](../../user-guide/ui-snowsight-lineage.md)
>     * [GET_LINEAGE SQL function](../../sql-reference/functions/get_lineage-snowflake-core.md)
>     * [Snowpark ML lineage API](../../developer-guide/snowflake-ml/ml-lineage.md)

## SQL updates

### SEARCH function: Support for conjunctive semantics

With this release, the [SEARCH](../../sql-reference/functions/search.md) function supports conjunctive (AND) semantics. Before this release, the function only supported disjunctive (OR)
semantics. To specify the semantics for a full-text search using the function, set the new SEARCH_MODE argument to ‘AND’ or ‘OR’.

When you specify ‘AND’ for the SEARCH_MODE argument, there is a match if the tokens extracted from at least one of the columns or fields being
searched match all of the tokens extracted from the search string. The matching tokens must all be in one column or field; they can’t be
spread across multiple columns or fields.

## Extensibility updates

### Support for a wildcard character in network rule network identifiers (*General availability*)

With this release, support for using an asterisk as a wildcard character when specifying a network identifier for a network rule in its
VALUE_LIST parameter is generally available.

For more information, see [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md).

### Support for telemetry metrics and custom spans, with visualizations in Snowsight (*General availability*)

With this release, support for collecting metrics data, implementing custom spans, and viewing log, trace, and metric data in Snowsight is
generally available. Metrics data provides signs of resource consumption by using CPU and memory metrics. With custom spans, you can get
finer-grained tracing within the handler for a procedure or function. Snowsight provides visualizations for collected data with which you can
analyze and optimize your code. This telemetry can be found under the Monitoring tab from the [Snowsight](../../user-guide/ui-snowsight-gs.md) home page.

For more information, see [Adding custom spans to a trace](../../developer-guide/logging-tracing/tracing-custom-spans.md), [Collecting metrics data](../../developer-guide/logging-tracing/metrics.md), [Viewing log messages](../../developer-guide/logging-tracing/logging-accessing-messages.md), and [Viewing trace data](../../developer-guide/logging-tracing/tracing-accessing-events.md).

For an introduction to observability in Snowflake, see [Observability in Snowflake apps](../../developer-guide/builders/observability.md).

## Data pipeline updates

### Dynamic tables: Support for UNION ALL

Dynamic tables now support the following use cases for [UNION ALL](../../sql-reference/operators-query.md) for both full and incremental refresh:

> * UNION ALL of a table and itself or a clone of itself.
> * UNION ALL between a GROUP BY or DISTINCT and another GROUP BY or DISTINCT.

## Data lake updates

### Cloning support for Snowflake-managed Apache Iceberg™ tables (*General availability*)

With this release, support for cloning Snowflake-managed Iceberg tables is available.

For more information, see [CREATE <object> … CLONE](../../sql-reference/sql/create-clone.md) and [Cloning and Apache Iceberg™ tables](../../user-guide/object-clone.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 17-Feb-25 |
| Dynamic tables: Support for multiple window functions with different PARTITION BY clauses | Removed support | 24-Feb-25 |

---
title: 9.30 Release Notes: Sep 29, 2025-Oct 01, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_30.md
section: Release Notes
---

# 9.30 Release Notes: Sep 29, 2025-Oct 01, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Hybrid table support for Tri-Secret Secure

Tri-Secret Secure (TSS) is now supported for hybrid tables. Enabling TSS support for hybrid tables requires a storage configuration known as
Dedicated Storage Mode.

For more information, see [Dedicated Storage Mode for TSS](../../user-guide/tables-hybrid-dedicated-storage-mode.md).

## SQL updates

### Update to the 2025b release of the TZDB

Snowflake uses the Time Zone Database (TZDB) for timezone information (for example, for the list of timezone names and aliases
for the [CONVERT_TIMEZONE](../../sql-reference/functions/convert_timezone.md) function).

With this release, Snowflake now uses the 2025b release of the TZDB. Snowflake previously used the 2024a release of the TZDB.

For a list of the changes made up to the 2025b release of the TZDB, see
[News for the tz database](https://data.iana.org/time-zones/tzdb/NEWS).

### MERGE ALL BY NAME

When the target table and source must have the same number of columns and the same names for all of the columns, you can simplify MERGE
operations by using MERGE ALL BY NAME.

MERGE statements can update each column in the target table with the values of the column with the same name from the source table. MERGE
statements can also insert rows from the source table into the target table based on column names when there is no match. These MERGE ALL
BY NAME operations are supported even when the column order is different in the target and source tables.

For more information, see [MERGE](../../sql-reference/sql/merge.md).

### Aliases for PIVOT and UNPIVOT columns

In PIVOT queries, you can use the AS clause to specify aliases for the pivot column names.

In UNPIVOT queries, you can use the AS clause to specify aliases for column names that appear in the result of the UNPIVOT operation.

For more information, see [PIVOT](../../sql-reference/constructs/pivot.md) and [UNPIVOT](../../sql-reference/constructs/unpivot.md).

### New SQL parameter: ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS

The new ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS parameter specifies whether the output returned by the [GET_DDL](../../sql-reference/functions/get_ddl.md) function contains data type synonyms
specified in the original DDL statement. This parameter is set to FALSE by default.

For more information, see [ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS](../../sql-reference/parameters.md).

### Reference table columns in lambda expressions when calling higher-order functions

You can now reference table columns in lambda expressions when calling higher-order functions such as [FILTER](../../sql-reference/functions/filter.md), [REDUCE](../../sql-reference/functions/reduce.md), and [TRANSFORM](../../sql-reference/functions/transform.md).

For example, you can specify the following lambda expression in a higher-order function that subtracts the value of table1.col2 from elements:

```sqlexample
a -> a - table1.col2
```

For more information, see [Use lambda functions on data with Snowflake higher-order functions](../../user-guide/querying-semistructured.md).

### SEARCH function supports PHRASE and EXACT search modes

The [SEARCH](../../sql-reference/functions/search.md) function now supports two new search modes in addition to the existing `OR` and `AND` modes:

* `PHRASE`: The search semantics find a match if the tokens extracted from at least one of the columns or fields being searched match all of the tokens extracted from the search string, including the order and adjacency of the tokens.
* `EXACT`: The search semantics are the same as ‘PHRASE’ search semantics, except that the delimiter strings between the tokens must match exactly.

These new search modes provide more flexibility than the existing disjunctive `OR` and conjunctive `AND` search semantics.

For more information, see [SEARCH](../../sql-reference/functions/search.md).

### Snowflake Scripting CONTINUE handlers

A CONTINUE handler can catch and handle exceptions without ending the Snowflake Scripting statement block that raised the exception. With the
default EXIT handler, when an error occurs in a block, the flow is interrupted and the error is returned to the caller. You can use a CONTINUE
handler when the error condition isn’t severe enough to warrant interrupting the flow.

For more information, see [Handling exceptions](../../developer-guide/snowflake-scripting/exceptions.md) and [EXCEPTION (Snowflake Scripting)](../../sql-reference/snowflake-scripting/exception.md).

### Snowflake Scripting user-defined functions (UDFs) (*General availability*)

Snowflake Scripting UDFs are now generally available and are no longer in [preview](../preview-features.md).

You can create SQL UDFs that contain [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) procedural language. Snowflake Scripting UDFs can be called in a SQL statement, such
as a SELECT or INSERT statement. They are more flexible than a Snowflake Scripting stored procedure, which can only be called in a SQL CALL
command.

### Semantic views: Support for dimensions that use a Cortex Search Service

In a semantic view, you can now define a dimension that uses a
[Cortex Search Service](../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md). To do this, set the
WITH CORTEX SEARCH SERVICE clause to the name of the Cortex Search Service.

For information, see [Defining a dimension that uses a Cortex Search Service](../../user-guide/views-semantic/sql.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Sep 26, 2025 |
| Hybrid table support for Tri-Secret Secure | **Added** to Security updates | Sep 30, 2025 |
| Update to the 2025b release of the TZDB | **Added** to SQL updates | Sep 30, 2025 |
| Support for Scala version 2.13 (Preview) | **Removed** from Extensibility updates | Oct 01, 2025 |
| Semantic views: Support for dimensions that use a Cortex Search Service | **Added** to SQL updates | Oct 17, 2025 |

---
title: 9.31 Release Notes: Oct 06, 2025-Oct 08, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_31.md
section: Release Notes
---

# 9.31 Release Notes: Oct 06, 2025-Oct 08, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Tri-Secret Secure supports private connectivity

Snowflake system functions now support privately connecting Tri-Secret Secure with your key management service.
You can now create a private endpoint for your customer-managed key (CMK).

For the complete self-service procedures, see [Tri-Secret Secure self-service with private connectivity in Snowflake](../../user-guide/security-encryption-tss-self-serve-private.md).

## Data lake updates

### Query data compaction jobs for Apache Iceberg™ tables

You can use the new ICEBERG_STORAGE_OPTIMIZATION_HISTORY view to query data compaction jobs for Apache Iceberg™ tables within the last
year. This view includes a CREDITS_USED column, which you can use to monitor the cost of data compaction. We will start billing for
data compaction of data files for Snowflake-managed Iceberg tables on October 20th, 2025.

For more information, see [ICEBERG_STORAGE_OPTIMIZATION_HISTORY view](../../sql-reference/account-usage/iceberg_storage_optimization_history.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Oct 03, 2025 |
| Tri-Secret Secure supports private connectivity stated “Activate Tri-Secret Secure with Private Connectivity” | **Changed** to “Tri-Secret Secure supports private connectivity” | Oct 07, 2025 |

---
title: 9.32 Release Notes (with behavior changes): Oct 13, 2025-Oct 15, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_32.md
section: Release Notes
---

# 9.32 Release Notes (with behavior changes): Oct 13, 2025-Oct 15, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_07](../bcr-bundles/2025_07_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_06](../bcr-bundles/2025_06_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2025_05](../bcr-bundles/2025_05_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change again in the following behavior change release, planned for January 19-23, 2026; however, this
schedule is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## Data lake updates

### Catalog-linked databases: Auto-refresh for Apache Iceberg™ table creation

We now leverage auto-refresh for Iceberg table creation in catalog-linked databases to improve metadata consistency. Note that this
change will cause table creation to contribute to your auto-refresh service billing.

### Table optimization for Snowflake-managed Apache Iceberg™ tables (*General availability*)

Table optimization for Snowflake-managed Apache Iceberg™ tables is now generally available. This update includes enabling billing for the data
compaction. We will start billing for these features on October 20, 2025.

For more information, see [Table optimization for Snowflake-managed Iceberg tables](../../user-guide/tables-iceberg-manage.md).

## Replication updates

### Snowflake Notebooks replication (*General availability*)

With this release, Snowflake introduces replication for Snowflake Notebooks. Notebooks will now be replicated when they are part of a database
included in a replication or failover group. When a secondary failover group is promoted to primary, all contained objects, including notebooks,
become writable in the new primary account.

> **Note:**
>
> Notebooks are not replicated unless you have enabled the [2025_07 behavior change bundle](../bcr-bundles/2025_07_bundle.md).

For more information, see [Notebook replication](../../user-guide/ui-snowsight/notebooks-replication.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Oct 10, 2025 |
| Catalog-linked databases: Auto-refresh for Apache Iceberg™ table creation | **Added** to Data lake updates | Oct 13, 2025 |
| Snowflake Notebooks replication (General availability) | **Added** to Replication updates | Oct 15, 2025 |

---
title: 9.33 Release Notes: Oct 21, 2025-Oct 23, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_33.md
section: Release Notes
---

# 9.33 Release Notes: Oct 21, 2025-Oct 23, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### AWS cross-region support for PrivateLink (*General availability*)

Snowflake now supports using PrivateLink to privately connect a VPC endpoint in one AWS region to your Snowflake account in another supported
AWS region. You can also connect your Snowflake account to hosted VPC endpoint services across different AWS regions.

For more information, see [AWS PrivateLink and Snowflake](../../user-guide/admin-security-privatelink.md).

### Outbound network traffic to stages and volumes on Google Cloud Storage supports private connectivity (*General availability*)

You can now route outbound traffic to the Google Cloud Storage service through Google Cloud Private Service Connect.

For more information, see [Private connectivity to external stages for Google Cloud](../../user-guide/data-load-gcs-private.md)
and [Private connectivity to external volumes for Google Cloud](../../user-guide/tables-iceberg-configure-external-volume-gcs-private.md).

### Snowflake-managed network rules (*General availability*)

Snowflake provides the SNOWFLAKE.NETWORK_SECURITY schema that contains a suite of Snowflake-managed (built-in) network rules. These network
rules provide a secure, consistent, fast, and low-maintenance way to manage network security for popular SaaS and partner applications.

For more information, see [Snowflake-managed network rules](../../user-guide/network-rules.md).

## SQL updates

### Semantic views: Support for ASOF JOIN

In a semantic view, you can now use an ASOF JOIN to join two logical tables on a date, time, timestamp, or numeric range, where
the values in one column must be in the same range as the values in the other column. When you define the relationship between the
two tables, specify the ASOF keyword with the referenced column name. For example:

```sqlexample
RELATIONSHIPS(
  my_relationship AS
    logical_table_1(
      col_table_1
    )
    REFERENCES
    logical_table_2(
      ASOF col_table_2
    )
)
```

For information, see [Using a date, time, timestamp, or numeric range to join logical tables](../../user-guide/views-semantic/sql.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Oct 17, 2025 |
| Semantic views: Support for ASOF JOIN | **Added** to SQL updates | Oct 22, 2025 |
| AWS cross-region support for PrivateLink | **Added** to Security updates | Oct 24, 2025 |
| Outbound network traffic to stages and volumes on Google Cloud Storage supports private connectivity | **Added** to Security updates | Oct 24, 2025 |
| Snowflake-managed network rules | **Added** to Security updates | Oct 24, 2025 |

---
title: 9.34 Release Notes (no announcements): Oct 27, 2025-Oct 29, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_34.md
section: Release Notes
---

# 9.34 Release Notes (no announcements): Oct 27, 2025-Oct 29, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

This release contains no significant features, updates, or enhancements to announce.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Oct 24, 2025 |

---
title: 9.35 Release Notes: Nov 03, 2025-Nov 07, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_35.md
section: Release Notes
---

# 9.35 Release Notes: Nov 03, 2025-Nov 07, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Interval data types (*Preview*)

Interval data types store values that represent a duration of time. You can calculate an interval as the difference between two dates or
times. An interval only defines a duration, so it doesn’t have a start or end point in time. For example, you might define an interval as
three years and seven months.

This release adds support for several new interval data types, including INTERVAL YEAR TO MONTH and INTERVAL DAY TO SECOND.

For more information, see [Interval data types](../../sql-reference/data-types-datetime.md).

## Data lake updates

### Replicate Snowflake-managed Apache Iceberg™ tables (*Preview*)

You can now replicate Snowflake-managed Apache Iceberg™ tables from a source account to one or more target accounts in the same
organization. This replication is integrated seamlessly with Snowflake replication and failover groups to provide point-in-time consistency
for the objects on the target account.

For more information, see [Configure replication for Snowflake-managed Apache Iceberg™ tables](../../user-guide/tables-iceberg-replication.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Oct 31, 2025 |

---
title: 9.36 Release Notes: Nov 10, 2025-Nov 16, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_36.md
section: Release Notes
---

# 9.36 Release Notes: Nov 10, 2025-Nov 16, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Enhanced SQL functionality

This update enhances your ability to modify properties on both functions and stored procedures:

| Function category | Function | Description |
| --- | --- | --- |
| Function | [CREATE OR ALTER FUNCTION](../../sql-reference/sql/create-function.md) | Updated to support changing function definition. For example, RUNTIME_VERSION, ARTIFACT_REPOSITORY (Python), PACKAGES, IMPORTS, return type, and function body. |
| Procedure | [CREATE OR ALTER PROCEDURE](../../sql-reference/sql/create-procedure.md) | Updated to support changing procedure definition. For example, RUNTIME_VERSION, IMPORTS, PACKAGES, return type, procedure body, and ARTIFACT_REPOSITORY for Python stored procedures. |

## Extensibility updates

### Support for OAuth when authenticating with GitHub (*General availability*)

You can authenticate using OAuth when you’re integrating a repository on [GitHub](https://github.com/about) with Snowflake.

For more information, see [Configure for authenticating with OAuth](../../developer-guide/git/git-setting-up.md).

### Run Apache Spark™ workloads on Snowflake (*General availability*)

You can connect your existing Spark workloads directly to Snowflake and run them on the Snowflake compute engine. As a result, you can run your PySpark dataframe code with all the benefits of the Snowflake engine.

For more information, see [Apache Spark™ workloads on Snowflake with Snowpark Connect](../../developer-guide/snowpark-connect/snowpark-connect-overview.md).

### Support for connecting Scala applications to Snowpark Connect for Spark (*Preview*)

You can now connect your Scala applications to the Snowpark Connect for Spark server. After you configure a connection to authenticate with Snowflake and start the Snowpark Connect for Spark server, you can run Scala code to connect to Snowpark Connect for Spark.

For more information, see [Getting Started with Snowpark Connect for Scala Applications](../../developer-guide/snowpark-connect/snowpark-connect-workloads-jupyter.md).

## Data governance updates

### Anomaly detection for Data Quality Monitoring (*Preview*)

Set up anomaly detection for data quality monitoring so that Snowflake automatically detects unexpected changes in the following dimensions:

* Volume of data in a table.
* Frequency with which a table is being updated.

For more information, see [Detecting anomalies in data quality](../../user-guide/data-quality-anomaly.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Nov 07, 2025 |

---
title: 9.37 Release Notes: Nov 17, 2025-Nov 20, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_37.md
section: Release Notes
---

# 9.37 Release Notes: Nov 17, 2025-Nov 20, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Preparation for renaming Snapshots feature to Backups

The billing line item for the WORM snapshots feature, which is currently in preview, changes
from `Snapshot` to `Backup`.

The METERING_HISTORY Account Usage view now contains data for the BACKUP service type. The entity
type is BACKUP SET. Previously, the service type was SNAPSHOT and the entity type was SNAPSHOT SET.

For the METERING_DAILY_HISTORY Account Usage and Organization Usage views, the value of the
SERVICE_TYPE column changes from SNAPSHOT to BACKUP.

> **Note:**
>
> This billing change is in advance of a broad set of syntax changes that will happen in coming
> weeks, before this feature becomes generally available. Syntax that mentions SNAPSHOT or
> SNAPSHOTS will change to BACKUP and BACKUPS. For example, the CREATE SNAPSHOT SET and CREATE
> SNAPSHOT POLICY commands will change to CREATE BACKUP SET and CREATE BACKUP POLICY. The changes
> will apply to all references to WORM snapshots, such as in view names. Syntax and naming related
> to block storage volume snapshots aren’t affected.

For more information about the WORM snapshots feature, see [Backups for disaster recovery and immutable storage](../../user-guide/backups.md).

### New DECFLOAT data type

This release adds support for the decimal float (DECFLOAT) data type. The DECFLOAT data type stores numbers
exactly, with up to 38 significant digits of precision, and uses a dynamic base-10 exponent to represent very large
or small values. In contrast to the FLOAT data type, which represents values as approximations, the DECFLOAT data
type represents exact values in the specified precision.

You can use the DECFLOAT data type when you need exact decimal results and a wide, variable scale in the same column.

For more information, see [DECFLOAT](../../sql-reference/data-types-numeric.md).

## Documentation and learning resources

### New topic that provides an overview of Snowflake authentication methods

A new topic introduces the authentication methods that users and applications can use to access Snowflake. It also lists key considerations
and recommendations to help you select the best authentication method for your use case.

See [Overview of Snowflake authentication](../../user-guide/security-authentication-overview.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Nov 14, 2025 |
| Overview of Snowflake authentication methods | **Added** to Documentation and learning resources | Nov 17, 2025 |
| Security updates | **Removed** section and its announcement(s):   * Google Private Service Connect endpoints for internal stages (General availability) | Nov 19, 2025 |
| New DECFLOAT data type | **Added** to SQL updates | Nov 19, 2025 |
| Preparation for renaming Snapshots feature to Backups | **Added** to SQL updates | Nov 20, 2025 |

---
title: 9.38 Release Notes: Dec 03, 2025-Dec 05, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_38.md
section: Release Notes
---

# 9.38 Release Notes: Dec 03, 2025-Dec 05, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Query insights: Support for queries that benefit from the query acceleration service

[Query insights](../../user-guide/query-insights.md) are now produced for queries that are accelerated by the
[query acceleration service](../../user-guide/query-acceleration-service.md).

> **Note:**
>
> Snowflake does not produce the [“filter not selective” insight](../../user-guide/query-insights.md) for these queries.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Nov 21, 2025 |
| Query insights: Support for queries that benefit from the query acceleration service | **Added** to SQL updates | Jan 14, 2026 |

---
title: 9.39 Release Notes: Dec 08, 2025-Dec 12, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_39.md
section: Release Notes
---

# 9.39 Release Notes: Dec 08, 2025-Dec 12, 2025

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Trust Center: Detection findings and event-driven scanners (*Preview*)

You can now use Trust Center to view a new type of findings — detections that scanners find in your account. This preview also adds a new
type of scanners — event driven, which constantly monitor your account for specific events, to the existing type of schedule-based
scanners.

For more information, see [Detections](../../user-guide/trust-center/overview.md) and [Event-driven scanners](../../user-guide/trust-center/overview.md).

### Programmatic access tokens: Removing the single-role restriction for service users

For service users (users with TYPE=SERVICE or TYPE=LEGACY_SERVICE), you can now generate a
[programmatic access token](../../user-guide/programmatic-access-tokens.md) that is not restricted to a single role.

To bypass this restriction, create or alter an authentication policy that sets the REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS
property to FALSE in the PAT_POLICY clause. For example:

```sqlexample
CREATE AUTHENTICATION POLICY my_authentication_policy
  PAT_POLICY = (
    REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = FALSE
  );
```

```sqlexample
ALTER AUTHENTICATION POLICY my_authentication_policy
  SET PAT_POLICY = (
    REQUIRE_ROLE_RESTRICTION_FOR_SERVICE_USERS = FALSE
  );
```

After creating or altering the authentication policy, apply the policy to a service user.

> **Note:**
>
> The restriction is lifted only when you use the [ALTER USER … ADD PROGRAMMATIC ACCESS TOKEN (PAT)](../../sql-reference/sql/alter-user-add-programmatic-access-token.md) command to
> generate the programmatic access token.
>
> Currently, the restriction is not lifted if you are using Snowsight to generate the programmatic access token, but
> support will be added in the future.

For information, see [Removing the role restriction for service users](../../user-guide/programmatic-access-tokens.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Dec 05, 2025 |
| Programmatic access tokens: Removing the single-role restriction for service users | **Added** to Security updates | Dec 10, 2025 |

---
title: 9.4 Release notes: Feb 24, 2025-Mar 01, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_04.md
section: Release Notes
---

# 9.4 Release notes: Feb 24, 2025-Mar 01, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Additional information returned for objects bound to references (*General availability*)

Snowflake Native App providers can now use the following to fetch the object name, schema name, and database name of an object bound to a reference:

* The [SYSTEM$GET_ALL_REFERENCES](../../sql-reference/functions/system_get_all_references.md) system function.
* The [snowflake.permissions.get_detailed_reference_associations](../../developer-guide/native-apps/requesting-permission-sdk-ref.md) method of the Python Permission SDK.

### More granular control for log, trace, and metric levels in an app (*General availability*)

Within a Snowflake Native App, you can now override the log, trace, and metric levels for specific objects within an app, including:

* database schemas
* versioned schemas
* stored procedures
* functions

This allows for precise monitoring and analysis of these objects and gives providers more granular control over telemetry data collection.
App-level log, trace, and metric levels are used as the default and are applied only when specific object or schema overrides are
not defined. You can set the default app-level log, trace, and metric levels in the manifest file of the app.
See [Configure event definitions for an app](../../developer-guide/native-apps/event-definition.md) for more information.
Object-specific overrides in the `setup.sql` take precedence over application-level defaults.

To get the override values for the logging, metric, and tracing levels, use the following system functions:

* SYSTEM$APPLICATION_GET_LOG_LEVEL
* SYSTEM$APPLICATION_GET_TRACE_LEVEL
* SYSTEM$APPLICATION_GET_METRIC_LEVEL

## SQL updates

### Cloning databases that contain hybrid tables (*Preview*)

With this release, we are pleased to announce the preview of cloning support for databases that
contain hybrid tables. You can create cloned databases to set up a backup and restore solution
for Unistore applications.

For more information, see:

* [Clone databases that contain hybrid tables](../../user-guide/tables-hybrid-clone.md)
* [CREATE <object> … CLONE](../../sql-reference/sql/create-clone.md)
* [AT | BEFORE](../../sql-reference/constructs/at-before.md)

### New SQL functions

The following function is now available with this release:

| Function Category | New function | Description |
| --- | --- | --- |
| System | [SYSTEM$TRIGGER_LISTING_REFRESH](../../sql-reference/functions/system_trigger_listing_refresh.md) | Triggers an immediate, one-time data refresh for a provider’s database or listing for all consumers who have access to it. |

## Extensibility updates

### Support for associating an event table with a database (*General availability*)

With this release, support for associating an event table with a database is generally available. When you assign an event table to a database,
the scope of objects for which events are collected in the event table is limited to objects in the database.

Previously, an event table could be associated only with the account.

For more information, see [Event table overview](../../developer-guide/logging-tracing/event-table-setting-up.md).

## Data loading updates

### Dynamic tables and tasks: Events logged for refreshes and task executions

You can now configure Snowflake to log events for dynamic table refreshes and task executions. These events are stored in the
[active event table](../../developer-guide/logging-tracing/event-table-setting-up.md) associated with the dynamic table or task.

When a dynamic table is refreshed, Snowflake logs an event to indicate if:

* The refresh succeeded.
* The refresh failed. In this case, the event also includes the error message.
* The refresh failed due to a failure with refreshing an upstream dynamic table.

Similarly, when a task executes, Snowflake logs an event to indicate if the task completed successfully or an error occurred. If an error occurred, the event includes the error message.

You can query these events to identify refreshes that have failed or task executions that resulted in errors.

For example, the following query gets the timestamp, dynamic table name, query ID, and error message for errors with dynamic
tables in the database `my_db`:

```sqlexample
SELECT
    timestamp,
    resource_attributes:"snow.executable.name"::VARCHAR AS dt_name,
    resource_attributes:"snow.query.id"::VARCHAR AS query_id,
    value:message::VARCHAR AS error
  FROM my_event_table
  WHERE
    resource_attributes:"snow.executable.type" = 'DYNAMIC_TABLE' AND
    resource_attributes:"snow.database.name" = 'MY_DB' AND
    value:state = 'FAILED'
  ORDER BY timestamp DESC;
```

The following query gets the timestamp, task name, query ID, and error message for errors with tasks in the database `my_db`:

```sqlexample
SELECT
    timestamp,
    resource_attributes:"snow.executable.name"::VARCHAR AS task_name,
    resource_attributes:"snow.query.id"::VARCHAR AS query_id,
    value:message::VARCHAR AS error
  FROM my_event_table
  WHERE
    resource_attributes:"snow.executable.type" = 'TASK' AND
    resource_attributes:"snow.database.name" = 'MY_DB' AND
    value:state = 'FAILED'
  ORDER BY timestamp DESC;
```

For more information, see:

* [Query an event table to monitor refreshes](../../user-guide/dynamic-tables-monitor-event-table-alerts.md) (for dynamic tables)
* [Monitor events for task executions](../../user-guide/tasks-events.md)

## Data lake updates

### CATALOG_NAMESPACE parameter for catalog integrations is now optional

With this release, the CATALOG_NAMESPACE parameter for catalog integrations is now optional instead of required:

* If you create a catalog integration to [sync a Snowflake-managed Iceberg table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md), you no longer need to specify the
  CATALOG_NAMESPACE parameter. Snowflake syncs the Apache Iceberg™ table to the external catalog in Open Catalog that you specify in the catalog
  integration.
* If you create a catalog integration for externally managed Iceberg tables and you don’t specify a CATALOG_NAMESPACE with the catalog integration, you
  must specify it at the table level. You can alternatively specify it with the catalog integration and then override it at the table level.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 21-Feb-25 |
| *Additional information returned for objects bound to references — GA announcement* | **Added** to *New features* section | 24-Feb-25 |
| *More granular control for log, trace, and metric levels in an app — GA announcement* | **Added** to *New features* section | 24-Feb-25 |
| *Cloning databases that contain hybrid tables - Preview* | **Added** to *SQL updates* section | 25-Feb-25 |
| *Automatic tag propagation - GA announcement* | **Removed** from *New features* section | 28-Feb-25 |
| *Dynamic tables and tasks: Events logged for refreshes and task executions* | **Added** to *Data loading updates* section | 01-Mar-25 |
| *New SQL functions* (SYSTEM$TRIGGER_LISTING_REFRESH) | **Added** to *SQL updates* section | 10-Mar-25 |

---
title: 9.40 Release Notes: Dec 15, 2025-Jan 09, 2026
source: https://docs.snowflake.com/en/release-notes/2025/9_40.md
section: Release Notes
---

# 9.40 Release Notes: Dec 15, 2025-Jan 09, 2026

> **Attention:**
>
> This release has completed. For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Notifications for data quality incidents (*Preview*)

Snowflake can automatically send a notification when there is a data quality incident in a database. A data quality incident occurs when the return value of a data metric function (DMF) violates an expectation or constitutes an anomaly.

Notifications can be sent via email or through an external system like Slack, Teams, and PagerDuty.

For more information, see [Sending notifications for data quality issues](../../user-guide/data-quality-notifications.md).

## Deprecated features

### Deprecation of external OpenAI model routing for Cortex Analyst

Snowflake has deprecated the `ENABLE_CORTEX_ANALYST_MODEL_AZURE_OPENAI` account parameter that routes Cortex Analyst requests to external OpenAI GPT models using Azure OpenAI outside the Snowflake secure perimeter.

Snowflake deprecated this legacy configuration for the following reasons:

* It sends data outside Snowflake’s trusted environment.
* It uses older model versions.
* Newer GPT and Claude models now run fully within the Snowflake secure perimeter, offering better text-to-SQL accuracy and stronger security.

Snowflake no longer honors this parameter.

#### Recommended action

Disable the external model parameter, so that Cortex Analyst automatically uses Snowflake-hosted models:

```sqlexample
USE ROLE ACCOUNTADMIN;

ALTER ACCOUNT SET ENABLE_CORTEX_ANALYST_MODEL_AZURE_OPENAI = FALSE;
```

After this change, Cortex Analyst uses the latest models available in your Snowflake region.

#### Cross-region inference (optional)

We strongly recommend enabling cross-region inference to access the full set of LLMs:

```sqlexample
ALTER ACCOUNT SET CORTEX_ENABLED_CROSS_REGION = 'ANY_REGION';
```

Enabling this parameter allows Snowflake to serve requests from the best available LLMs, which might reside in another Snowflake region, while keeping all processing within the Snowflake secure perimeter.

If you prefer to keep inference within your current region, the LLMs must be available in-region. If your region does not have LLMs available, you must enable cross-region inference to use Cortex Analyst.

For more information about the models used by Cortex Analyst, see [Control models used by Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md).

## SQL updates

### Semantic views: Using standard SQL clauses to query semantic views (*Preview*)

You can now use standard SQL clauses in a SELECT statement to query a semantic view.

This feature is in [Preview](../preview-features.md).

You can just specify the name of the semantic view in the FROM clause, rather than specifying the SEMANTIC_VIEW clause. For
example, the following query specifies the SEMANTIC_VIEW clause:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_market_segment
    METRICS orders.order_average_value
  )
  ORDER BY customer_market_segment;
```

The following statement demonstrates how to execute the same query without specifying the SEMANTIC_VIEW clause:

```sqlexample
SELECT customer_market_segment, AGG(order_average_value)
  FROM tpch_analysis
  GROUP BY customer_market_segment
  ORDER BY customer_market_segment;
```

For information, see [Specifying the name of the semantic view in the FROM clause](../../user-guide/views-semantic/querying.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | Dec 15, 2025 |
| Semantic views: Using standard SQL clauses to query semantic views (Preview) | **Added** to SQL updates | Dec 16, 2025 |
| Copy tags when running a CREATE OR REPLACE TABLE command (Preview) | **Removed** from Data governance updates | Jan 06, 2026 |
| Deprecation of external OpenAI model routing for Cortex Analyst | **Added** to Deprecated features | Jan 06, 2026 |

---
title: 9.5 Release notes: Mar 03, 2025-Mar 06, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_05.md
section: Release Notes
---

# 9.5 Release notes: Mar 03, 2025-Mar 06, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Automatic sensitive data classification (*General availability*)

You can now use Snowflake to automatically detect sensitive data using native and custom classifiers. With automatic
sensitive data classification, user-defined tags and masking policies can be automatically applied to columns when sensitive data
is detected.

For information, see [Use SQL to set up sensitive data classification](../../user-guide/classify-auto.md).

## SQL updates

### Snowflake Scripting: Asynchronous child jobs (*General availability*)

With this release, Snowflake Scripting (SQL) support for asynchronous child jobs in stored procedures is generally available. Stored procedures run asynchronous child jobs concurrently. A child job can be any valid SQL statement, including SELECT statements and DML statements, such as INSERT or UPDATE.

To run a query as an asynchronous child job, add the ASYNC keyword to the query. When this keyword is omitted, the stored procedure runs child jobs sequentially, and each child job waits for the running child job to finish before it starts.

Running multiple child jobs concurrently can improve efficiency
and reduce overall run time.

For more information, see [Working with asynchronous child jobs](../../developer-guide/snowflake-scripting/asynchronous-child-jobs.md).

### Snowflake Scripting: Improved error messages

With this release, Snowflake Scripting error messages have been improved to provide more accurate information about the error and about the line number in the code that caused the error.

For example, the following Snowflake Scripting code returns an error:

```sqlexample
EXECUTE IMMEDIATE $$
BEGIN
  LET c1 := 0;
  IF (c1 = 0) THEN
    INSERT invalid_text VALUES (1);
  END IF;
END;
$$
;
```

In past releases, the following error message was returned:

```output
001003 (42000): SQL compilation error:
syntax error line 4 at position 5 unexpected '('.
syntax error line 4 at position 9 unexpected '='.
```

With this release, the following error message is returned:

```output
001003 (42000): SQL compilation error:
syntax error line 5 at position 11 unexpected 'invalid_text'.
```

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 28-Feb-25 |
| Automatic sensitive data classification — GA announcement | **Added** to *New features* section | 03-Mar-25 |

---
title: 9.6 Release notes: Mar 10, 2025-Mar 12, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_06.md
section: Release Notes
---

# 9.6 Release notes: Mar 10, 2025-Mar 12, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Search optimization: Support for column collations

With this release, Search optimization can improve the performance of queries on columns defined with a [COLLATE clause](../../sql-reference/collation.md).
For more information, see [Support for collation](../../user-guide/search-optimization/queries-that-benefit.md) in the search optimization documentation.

## Data pipeline updates

### Dynamic tables: Maximum number of dynamic tables in an account increased to 50,000

With this release, your account can now hold a maximum of 50,000 dynamic tables. Previously, the limit was 10,000 dynamic tables in a single
account.

For more information, see [General limitations](../../user-guide/dynamic-tables-limitations.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 07-Mar-25 |
| Apache Iceberg™ tables: Row-level deletes for externally managed tables | **Removed** from *Data lake updates* section | 10-Mar-25 |
| Dynamic tables: Maximum number of dynamic tables in an account increased to 50,000 | **Added** to *Data pipeline updates* section | 12-Mar-25 |

---
title: 9.7 Release notes (with behavior changes): Mar 17, 2025-Mar 27, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_07.md
section: Release Notes
---

# 9.7 Release notes (with behavior changes): Mar 17, 2025-Mar 27, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2025_02](../bcr-bundles/2025_02_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2025_01](../bcr-bundles/2025_01_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_08](../bcr-bundles/2024_08_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for April 2025; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New features

### Grant database roles to a Snowflake Native App — *Preview*

With this release, providers may grant a database role to a Snowflake Native App. This includes database roles in a database imported
from a data share or the SNOWFLAKE database.

For example, to allow an app named `hello_snowflake_app` to access all tables in a database named `db1`:

```sqlexample
GRANT SELECT ON ALL TABLES IN DATABASE DB1 TO DATABASE ROLE db1.viewer;
GRANT DATABASE ROLE db1.viewer TO APPLICATION hello_snowflake_app;
```

### DISABLE_UI_DOWNLOAD_BUTTON object parameter for Snowsight and the Classic Console (*General availability*)

With this release, the DISABLE_UI_DOWNLOAD_BUTTON object parameter is now available in Snowsight and the Classic Console.

You can set this parameter for accounts or users to hide or display the button for downloading data in Snowsight or the
Classic Console, such as a table returned from running a query in a worksheet.

To hide the download button in Snowsight and the Classic Console from all users in an account, execute the following SQL
statements:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER ACCOUNT SET DISABLE_UI_DOWNLOAD_BUTTON = TRUE;
```

To hide the download button in Snowsight and the Classic Console from a specific user, execute the following SQL statements:

```sqlexample
USE ROLE ACCOUNTADMIN;
ALTER USER <username> SET DISABLE_UI_DOWNLOAD_BUTTON =  TRUE;
```

By default, the DISABLE_UI_DOWNLOAD_BUTTON object parameter is set to FALSE, which displays the download button for all users in an
account.

For more information, see [DISABLE_UI_DOWNLOAD_BUTTON](../../sql-reference/parameters.md).

## Replication updates

### Schema-level replication for failover groups (*General availability*)

With this release, you can choose a subset of schemas for replication for databases in failover groups. To do so, you use the ALTER DATABASE and ALTER SCHEMA commands to set the REPLICABLE_WITH_FAILOVER_GROUPS property on a database and/or specific schemas within that database.

For more information, see [Schema-level replication for failover groups](../../user-guide/account-replication-config.md).

## SQL updates

### Semi-structured data: XML format (*General availability*)

Snowflake support for the XML format is now generally available.

For more information, see [About XML](../../user-guide/semistructured-data-formats.md) and [Introduction to loading semi-structured data](../../user-guide/semistructured-intro.md).

### Spread operator

With this release, you can use the new spread operator (`**`) to expand an array into a list of individual values.

For more information, see [Expansion operators](../../sql-reference/operators-expansion.md).

### New maximum size limits for database objects (*Preview*)

With this release, the new maximum allowed length for columns of type VARCHAR, VARIANT, ARRAY, and OBJECT is 128 MB, and the new maximum allowed length for columns of type BINARY, GEOGRAPHY, and GEOMETRY is 64 MB.

To use this feature, you must [enable the 2025_02 bundle](../bcr-bundles/2025_02_bundle.md).

For more information, see [Size limits for database objects](../../user-guide/data-load-considerations-prepare.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 14-Mar-25 |
| Grant database roles to a Snowflake Native App — Preview | **Added** to *New features* section | 27-Mar-25 |
| DISABLE_UI_DOWNLOAD_BUTTON object parameter for Snowsight and the Classic Console — GA announcement | **Added** to *New features* section | 27-Mar-25 |

---
title: 9.8 Release notes: Mar 31, 2025-Apr 04, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_08.md
section: Release Notes
---

# 9.8 Release notes: Mar 31, 2025-Apr 04, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Trust Center: Risky Human and Service User scanners

With this release, Snowflake is pleased to announce two additional scanners: Human User MFA Readiness and Service User Passwordless Readiness.
As part of the Threat Intelligence scanner package, the new scanners allow you to check for risky human and service users to further reduce
security vulnerabilities.

* **Human User MFA Readiness Scanners** identify human users who have signed in with just a password in the last 90 days and haven’t yet set up
  multi-factor authentication (MFA). It also flags human users who haven’t logged in in 90 days but still have a password set.
* **Service User Passwordless Readiness** looks for Legacy service users who have recently logged in with a password and haven’t removed it. It
  also flags service users who haven’t logged in in 90 days but still have a password set.

For more information, see the [Threat Intelligence scanner package](../../user-guide/trust-center/overview.md).

## SQL updates

### Asynchronous refresh for failover groups and replication groups

With this release, you can call the function SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH
to perform the same refresh as the command ALTER FAILOVER GROUP … REFRESH or ALTER REPLICATION GROUP … REFRESH.
The refresh operations from this function happen asynchronously, so you can continue doing work while the refreshes
are in progress.

For more information, see [SYSTEM$SCHEDULE_ASYNC_REPLICATION_GROUP_REFRESH](../../sql-reference/functions/system_schedule_async_replication_group_refresh.md).

### Bind variables in SHOW commands

With this release, you can use [bind variables](../../sql-reference/bind-variables.md) with the LIKE and LIMIT keywords in
[SHOW](../../sql-reference/sql/show.md) commands. For example, the following SHOW command,
which could be included in a [Javascript](../../developer-guide/stored-procedure/stored-procedures-javascript.md) stored procedure, uses bind variables:

```sqlexample
SHOW TABLES LIKE ? LIMIT ?;
```

The following example uses bind variables in a SHOW command in a [Snowflake Scripting](../../developer-guide/snowflake-scripting/index.md) block:

```sqlexample
BEGIN
  LET a INT := 10;
  LET p STRING := 'mytable';
  LET res RESULTSET := (SHOW TABLES LIKE :p LIMIT :a);
  RETURN TABLE(res);
END;
```

## Data lake updates

### Apache Iceberg™ tables: Row-level deletes for externally managed tables (*Preview*)

With this release, we are pleased to announce the preview of [row-level deletes](https://iceberg.apache.org/spec/?#row-level-deletes)
support with positional delete files when external engines perform update, delete, and merge operations on externally managed Iceberg tables
in Snowflake.

For more information, see [Use row-level deletes](../../user-guide/tables-iceberg-manage.md).

### Apache Iceberg™ tables: Delta table support (*General availability*)

With this release, we are pleased to announce general availability support for creating read-only Iceberg tables from Delta Lake tables stored
in object storage. Creating Iceberg tables sourced from Delta Lake delta logs provides the ability to perform efficient Lakehouse analytics
in Snowflake and generate Iceberg metadata for consumption in an Iceberg engine ecosystem.

For more information, see [CREATE ICEBERG TABLE (Delta files in object storage)](../../sql-reference/sql/create-iceberg-table-delta.md).

### New database properties: CATALOG_SYNC_NAMESPACE_MODE and CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER

With this release, Snowflake is pleased to announce the release of two new database properties:

* CATALOG_SYNC_NAMESPACE_MODE
* CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER

This allows you to query Snowflake-managed Apache Iceberg™ tables in Open Catalog by using a third-party engine that can only query tables
located up to the second namespace level in a catalog, such as Trino.

Use the `FLATTEN` setting for the CATALOG_SYNC_NAMESPACE_MODE property to sync a Snowflake-managed Iceberg table to Snowflake Open Catalog with
one parent namespace by flattening the table’s two parent namespaces into one namespace. Use the CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER
property to insert a delimiter in the resulting namespace to avoid conflicts that could arise from flattening the two parent namespaces.
You specify these properties when you create a database.

For more information, see [CREATE DATABASE](../../sql-reference/sql/create-database.md) and [Sync a Snowflake-managed table with Snowflake Open Catalog](../../user-guide/tables-iceberg-open-catalog-sync.md).

For example, when you set the CATALOG_SYNC_NAMESPACE_MODE property to `FLATTEN` and specify a hyphen (`-`) for the
CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER property, Snowflake syncs the `customer.data.table1` and `custom.erdata.table1` Snowflake-managed
Iceberg tables to the `catalog1` external catalog in Open Catalog with the following fully qualified names:

* `catalog1.customer-data.table1`
* `catalog1.custom-erdata.table1`

If you use the default for the CATALOG_SYNC_NAMESPACE_MODE property (`NEST`), Snowflake continues to sync the table to Open Catalog with two
parent namespaces and the CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER property isn’t required.

## Snowpark Container Services updates

### Automatic suspension of a Snowpark Container Services service (*Preview*)

With this release, we are pleased to announce a preview of support for the AUTO_SUSPEND_SECS service property to define the inactivity duration after which Snowflake automatically suspends the service.

For more information, see [CREATE SERVICE](../../sql-reference/sql/create-service.md) and [ALTER SERVICE](../../sql-reference/sql/alter-service.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 21-Mar-25 |
| Automatic suspension of a Snowpark Container Services service — Preview | **Added** to *Snowpark Container Services updates* section | 27-Mar-25 |

---
title: 9.9 Release notes: Apr 07, 2025-Apr 09, 2025
source: https://docs.snowflake.com/en/release-notes/2025/9_09.md
section: Release Notes
---

# 9.9 Release notes: Apr 07, 2025-Apr 09, 2025

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New Snowflake parameter: DEFAULT_NULL_ORDERING

The new Snowflake parameter
DEFAULT_NULL_ORDERING controls the
default ordering of NULL values in a result set. When this parameter is
set to `FIRST`, NULL values are lower than any non-NULL values.
When this parameter is set to `LAST`, NULL values are higher than any
non-NULL values. The default value is `LAST`, which is the same behavior
as past releases.

For more information, see [DEFAULT_NULL_ORDERING](../../sql-reference/parameters.md).

## Extensibility updates

### Artifact Repository (*Preview*)

Artifact Repository allows you to directly use Python packages from the Python Package Index (PyPI) within Snowpark Python user-defined functions (UDFs) and stored procedures. This capability significantly simplifies development workflows, making it easier to build and scale Python-powered applications in Snowflake.

For more information, see [Artifact Repository overview](../../developer-guide/udf/python/udf-python-packages.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 04-Apr-25 |
| Artifact Repository — Preview | **Added** to *Extensibility updates* section | 09-Apr-25 |

---
title: About Behavior Changes
source: https://docs.snowflake.com/en/release-notes/intro-bcr-releases.md
section: Release Notes
---

# About Behavior Changes

Each month (except for December), Snowflake selects one of the weekly full releases for the month to introduce behavior changes.
The weekly release selected for the behavior changes varies, but is typically the 4th or 5th release after the previous behavior change release.

A behavior change is defined as any change to existing behavior that returns different results from before and may impact customer code or
workloads.

## Behavior Change Bundles

Behavior changes are provided in bundles that utilize the following naming convention:

`YYYY_NN`

Where `YYYY` is the year and `NN` is the ordinal number of the release within the year. For example, `2022_06` would be the 6th behavior
change bundle introduced in 2022.

For more details about working with behavior change bundles, see [Behavior change management](bcr-bundles/managing-behavior-change-releases.md).

## Bundle Lifecycle

The behavior change bundle lifecycle consists of the following two periods:

Testing period (1st month):
:   The bundle is introduced **Disabled by Default**. During this period, you can choose to *enable* the bundle in
    one or more accounts. Typically, you would choose accounts designated for development or QA (quality assurance) so that you can test the
    changes without impacting your production accounts.

Opt-out period (2nd month):
:   The bundle moves from **Disabled by Default** to **Enabled by Default**. During this period, you can choose to
    *disable* the bundle in your accounts. This allows you to postpone the changes in the bundle, typically for production accounts, while
    making any necessary adjustments to mitigate the impact of the changes.

You may choose to explicitly enable or disable the behavior change bundle anytime during these two periods. Once explicitly set, the bundle is
changed from its default state and Snowflake does not override the setting for the above periods.
For example, if you disable a bundle during the testing period, we do not enable it at the beginning of the opt-out period.

At end of the opt-out period, Snowflake enables the behavior changes in the bundle across all accounts, at which time the bundle is considered
**Generally Enabled**. From this time onwards, any overrides are cleared and you are unable to explicitly enable or disable the bundle.

## Behavior Change Documentation

A release that contains behavior change bundles includes the following documentation (in addition to the Release Notes for the
release):

* Summary of each bundle in the release.
* Detailed descriptions of the behavior changes in each bundle.

---
title: Access control: Disallow GRANT REFERENCE_USAGE if GRANT USAGE isn’t set first
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2136.md
section: Release Notes
---

# Access control: Disallow GRANT REFERENCE_USAGE if GRANT USAGE isn’t set first

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, users will not be able to set GRANT REFERENCE_USAGE on a database without first setting GRANT USAGE.

Before the change:
:   Users could run GRANT REFERENCE_USAGE on a database to a share without running GRANT USAGE, and Snowflake would apply the grant on the database as GRANT USAGE.

After the change:
:   Users must run GRANT USAGE before running GRANT REFERENCE_USAGE.

Before this change, if a user ran the following command without running GRANT USAGE, Snowflake also applied GRANT USAGE on the same database to the same share:

```sqlexample
GRANT REFERENCE_USAGE ON DATABASE database2 TO SHARE share1;
```

After the change, if a user runs GRANT REFERENCE_USAGE without first running GRANT USAGE, Snowflake will return the following error:

```output
Cannot grant REFERENCE_USAGE on database {db_name} to share {share_name}. Grant USAGE on a database to share prior to granting REFERENCE_USAGE.
```

Ref: 2136

---
title: Access Control: Granting REFERENCE_USAGE on a Database to a Role No Longer Allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-944.md
section: Release Notes
---

# Access Control: Granting REFERENCE_USAGE on a Database to a Role No Longer Allowed

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The behavior of granting the REFERENCE_USAGE privilege has changed as follows:

Previously:
:   The REFERENCE_USAGE privilege could be granted on the database either individually, in a series of privileges, or with all privileges
    to a role object. For example:

    ```sqlexample
    grant reference_usage on database mydb to role r1;
    grant modify, reference_usage on database mydb to role r1;
    grant all privileges on database mydb to role r1;
    ```

    The output of the SHOW GRANTS command included a row for the REFERENCE_USAGE privilege for each of its grants.

Currently:
:   The REFERENCE_USAGE privilege cannot be granted on a database to a role object. This privilege can only be granted to a share object.

    If a user tries to grant the REFERENCE_USAGE privilege individually, Snowflake returns the following error message:

    `REFERENCE_USAGE ON DATABASE can only be granted to share(s).`

    If a user specifies the REFERENCE_USAGE privilege in a series of privileges or tries to grant all privileges on a database, Snowflake
    returns the follow message:

    `Grant partially executed: privileges [REFERENCE_USAGE] not granted.`

Snowflake allows privileges that can be granted and prevents granting the REFERENCE_USAGE privilege.

The output of the SHOW GRANTS command does not include a row for the grant of the REFERENCE_USAGE privilege on a database to a role object.

Ref: 944

---
title: Access control: Privileges can be granted to users
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1924.md
section: Release Notes
---

# Access control: Privileges can be granted to users

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

The ability to grant privileges is changing as follows:

Before the change:
:   You can grant privileges only to roles (RBAC).

After the change:
:   You can also grant privileges directly to users (UBAC).
    Privileges granted to a user allow that user access only when ALL secondary roles are activated in the current session.

This change extends the Snowflake access control framework to include user-based access control (UBAC).

> **Note:**
>
> To use UBAC, you must enable the 2025_02 behavior change bundle in your account.
>
> To [enable this bundle in your account](../managing-behavior-change-releases.md),
> execute the following statement:
>
> ```sqlexample
> SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2025_02');
> ```

For example, when bundle 2025_02 is enabled, the following command syntax will be supported:

```sqlsyntax
GRANT <privileges> ... TO USER;
```

To grant the USAGE privilege on a Streamlit application to a specific user, `joe`:

```sqlexample
GRANT USAGE ON STREAMLIT streamlit_db.streamlit_schema.streamlit_app TO USER joe;
```

If you need to disable UBAC in your account *after* Bundle 2025_02 becomes enabled by default, set the account parameter
`DISABLE_USER_PRIVILEGE_GRANTS = TRUE`. For example:

```sqlexample
ALTER ACCOUNT SET DISABLE_USER_PRIVILEGE_GRANTS = TRUE;
```

Ref: 1924

---
title: ACCESS_HISTORY view (Account Usage and Organization Usage): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2177.md
section: Release Notes
---

# ACCESS_HISTORY view (Account Usage and Organization Usage): New columns

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the ACCOUNT_USAGE.ACCESS_HISTORY and ORGANIZATION_USAGE.ACCESS_HISTORY views include the
following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `event_source` | VARCHAR | Indicates the source of the event that resulted in an access history record. |
| `additional_properties` | VARIANT | Provides operational metadata for the source of the event. |

Ref: 2177

---
title: ACCESS_HISTORY view (Account Usage and Organization Usage): Simpler format for values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2179.md
section: Release Notes
---

# ACCESS_HISTORY view (Account Usage and Organization Usage): Simpler format for values

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The format of values in the ACCOUNT_USAGE.ACCESS_HISTORY and
ORGANIZATION_USAGE.ACCESS_HISTORY views behaves as follows:

Before the change:
:   A value is specified as a JSON object with a `value` key. For example:

    ```json
    {
      "columns": [
        {
          "objectName": "A",
          "objectId": {
            "value": 2
          },
          "subOperationType": "ADD"
        },
        {
          "objectName": "B",
          "objectId": {
            "value": 3
          },
          "subOperationType": "ADD"
        }
      ]
    }
    ```

After the change:
:   A key-value pair shows the value as a string, number, or Boolean directly. The `value` key is no longer needed. For example:

    ```json
    {
      "columns": [
        {
          "objectName": "A",
          "objectId": 2,
          "subOperationType": "ADD"
        },
        {
          "objectName": "B",
          "objectId": 3,
          "subOperationType": "ADD"
        }
      ]
    }
    ```

This change simplifies the record structure and improves readability.

Ref: 2179

---
title: ACCESS_HISTORY view (Account Usage and Organization Usage): Simplified format for tag creation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2180.md
section: Release Notes
---

# ACCESS_HISTORY view (Account Usage and Organization Usage): Simplified format for tag creation

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When you create a tag, you can specify a list of allowed values. The format of these allowed values in the `object_modified_by_ddl` column of the ACCOUNT_USAGE.ACCESS_HISTORY and ORGANIZATION_USAGE.ACCESS_HISTORY views behaves as follows:

Before the change:
:   Allowed values are formatted as keys of a JSON object. For example, when you create a tag with allowed values
    `A`, `B`, and `C`, the format in access history is shown in the following JSON object:

    ```json
    {
      "allowedValues": {
        "A": {
          "subOperationType": "ADD"
        },
        "B": {
          "subOperationType": "ADD"
        },
        "C": {
          "subOperationType": "ADD"
        }
      }
    }
    ```

After the change:
:   Allowed values are formatted as an array of strings. This array is the value of the `allowedValues` key. For example:

    ```json
    {
      "allowedValues": [
        "A",
        "B",
        "C"
      ]
    }
    ```

This change simplifies the record and more clearly reflects the command that was executed.

Ref: 2180

---
title: ACCESS_HISTORY view (Account Usage and Organization Usage): Simplified records for policy creation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2181.md
section: Release Notes
---

# ACCESS_HISTORY view (Account Usage and Organization Usage): Simplified records for policy creation

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The `object_modified_by_ddl` column in the ACCOUNT_USAGE.ACCESS_HISTORY and ORGANIZATION_USAGE.ACCESS_HISTORY
views contains records that correspond to creating a policy. This behavior change affects what is included in a record that corresponds to creating any of the following policies:

* Aggregation policies
* Join policies
* Masking policies
* Privacy policies
* Projection policies
* Row access policies
* Storage lifecycle policies

Before the change:
:   Records include system-generated fields that weren’t specified when creating the policy.
    For example, if someone created a row access policy, the record includes detailed type information with default values:

    ```json
    {
      "policyReturnType": {
        "value": {
          "byteLength": null,
          "collation": null,
          "fixed": null,
          "length": null,
          "nullable": true,
          "precision": null,
          "scale": null,
          "template": null,
          "type": "BOOLEAN"
        }
      },
      "policySignature": {
        "value": {
          "arguments": [
            {
              "datatype": {
                "byteLength": 134217728,
                "collation": null,
                "fixed": false,
                "isMaxLength": true,
                "length": 134217728,
                "nullable": true,
                "precision": null,
                "scale": null,
                "template": null,
                "type": "TEXT"
              },
              "defaultVal": null,
              "hasDefaultValue": false,
              "identifier": "EMAIL",
              "parameterType": "NONE"
            }
          ]
        }
      }
    }
    ```

After the change:
:   Records include only information that was specified by the user when creating the policy. For example:

    ```json
    {
      "policyReturnType": {
        "value": {
          "type": "BOOLEAN"
        }
      },
      "policySignature": {
        "value": {
          "arguments": [
            {
              "datatype": {
                "type": "TEXT"
              },
              "identifier": "EMAIL"
            }
          ]
        }
      }
    }
    ```

Ref: 2181

---
title: ACCESS_HISTORY view (Account Usage and Organization Usage): User-defined values in the objects_modified_by_ddl column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2178.md
section: Release Notes
---

# ACCESS_HISTORY view (Account Usage and Organization Usage): User-defined values in the `objects_modified_by_ddl` column

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

With this behavior change, the `objects_modified_by_ddl` column of the ACCOUNT_USAGE.ACCESS_HISTORY and ORGANIZATION_USAGE.ACCESS_HISTORY views never uses a user-defined value as the key of a JSON key-value pair. The change behaves as follows:

Before the change:
:   User-specified values like column names and tag names are keys in the JSON object. For example:

    ```json
    {
      "columns": {
        "A": {
          "objectId": {"value": 0},
          "subOperationType": "ADD",
          "tags": {
            "DB1.SCH.TAG1": {
              "objectId": {"value": 0},
              "subOperationType": "ADD",
              "tagValue": {"value": "v1"}
            }
          }
        }
      }
    }
    ```

After the change:
:   User-specified information is captured as a value of a key-value pair. For example:

    ```json
    {
      "columns": [
        {
          "objectName": "A",
          "objectId": {"value": 0},
          "subOperationType": "ADD",
          "tags": [
            {
              "objectName": "DB1.SCH.TAG1",
              "objectId": {"value": 0},
              "subOperationType": "ADD",
              "tagValue": {"value": "v1"}
            }
          ]
        }
      ]
    }
    ```

Ref: 2178

---
title: ACCESS_HISTORY View: New parent_query_id and root_query_id columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1265.md
section: Release Notes
---

# ACCESS_HISTORY View: New `parent_query_id` and `root_query_id` columns

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The Account Usage ACCESS_HISTORY view behaves as follows:

Before the change
:   A query on the view does not include the `parent_query_id` and `root_query_id` columns.

After the change
:   A query on the view includes the `parent_query_id` and `root_query_id` columns, which are defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | `parent_query_id` | TEXT | The query ID of the parent job or NULL if the job does not have a parent. |
    | `root_query_id` | TEXT | The query ID of the top most job in the chain or NULL if the job does not have a parent. |

    These columns start to record data when the 2023_08 bundle is enabled and are the last two columns in the view. The query ID corresponds
    to a query in the `query_id` column in the Account Usage ACCESS_HISTORY view. The columns record query IDs for these kinds of
    queries:

    * A query performs a read or write operation on another object.

      The [read or write operation](../../../sql-reference/account-usage/access_history.md) must be an operation that the ACCESS_HISTORY view
      currently supports.
    * A query performs a read or write operation on an object that calls a stored procedure. Nested stored procedure calls are also supported.

    For example, if you have these statements run in order:

    ```sqlexample
    CREATE OR REPLACE PROCEDURE myproc_child()
    RETURNS INTEGER
    LANGUAGE SQL
    AS
    $$
      BEGIN
      SELECT * FROM mydb.mysch.mytable;
      RETURN 1;
      END
    $$;

    CREATE OR REPLACE PROCEDURE myproc_parent()
    RETURNS INTEGER
    LANGUAGE SQL
    AS
    $$
      BEGIN
      CALL myproc_child();
      RETURN 1;
      END
    $$;

    CALL myproc_parent();
    ```

    A query on the ACCESS_HISTORY view records the information as follows:

    ```sqlexample
    USE ROLE GOVERNANCE_VIEWER;

    SELECT
      query_id,
      parent_query_id,
      root_query_id,
      direct_objects_accessed
    FROM
      SNOWFLAKE.ACCOUNT_USAGE.ACCESS_HISTORY;
    ```

    ```output
    +----------+-----------------+---------------+-----------------------------------+
    | QUERY_ID | PARENT_QUERY_ID | ROOT_QUERY_ID |      DIRECT_OBJECTS_ACCESSED      |
    +----------+-----------------+---------------+-----------------------------------+
    |  1       | NULL            | NULL          | [{"objectName": "myproc_parent"}] |
    |  2       | 1               | 1             | [{"objectName": "myproc_child"}]  |
    |  3       | 2               | 1             | [{"objectName": "mytable"}]       |
    +----------+-----------------+---------------+-----------------------------------+
    ```

    * The first row corresponds to calling the second procedure named `myproc_parent` as shown in the `direct_objects_accessed`
      column.

      The `parent_query_id` and `root_query_id` columns return NULL because you called this stored procedure directly.
    * The second row corresponds to the query that calls the first procedure named `myproc_child` as shown in the
      `direct_objects_accessed` column.

      The `parent_query_id` and `root_query_id` columns return the same query ID because the query calling `myproc_child`
      was initiated by the query calling `myproc_parent`, which you called directly.
    * The third row corresponds to the query that accessed the table named `mytable` in the `myproc_child` procedure as shown in
      the `direct_objects_accessed` column.

      The `parent_query_id` column returns the query ID of the query that accessed `mytable`, which corresponds to calling
      `myproc_child`. That stored procedure was initiated by the query calling `myproc_parent`, which is shown in the
      `root_query_id` column.

Ref: 1265

---
title: Account Usage and Information Schema functions views: New column(s) in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1406.md
section: Release Notes
---

# Account Usage and Information Schema functions views: New column(s) in output

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

When this behavior change bundle is enabled, the output of the
[Account Usage Functions view](../../../sql-reference/account-usage/functions.md)
and [Information Schema Functions view](../../../sql-reference/info-schema/functions.md) includes the following new columns:

> | Column name | Data type | Description |
> | --- | --- | --- |
> | `EXTERNAL_ACCESS_INTEGRATIONS` | STRING array | Names of [external access integrations](../../../developer-guide/external-network-access/external-network-access-overview.md) specified by the function’s EXTERNAL_ACCESS_INTEGRATION parameter.  For example: [“TEST_EXTERNAL_ACCESS_INTEGRATION”] |
> | `SECRETS` | JSON map | Map of [secrets](../../../sql-reference/sql/create-secret.md) specified by the function’s SECRETS parameter, where map keys are secret variable names and map values are secret object names.  For example: {“cred”:”SECRET_OAUTH”,”cre2”:”SECRET_GENERIC_STRING”} |

Ref: 1406

---
title: Account Usage and Information Schema views: Changes to DATA_TYPE output for string columns (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1960.md
section: Release Notes
---

# Account Usage and Information Schema views: Changes to DATA_TYPE output for string columns (Postponed)

> **Attention:**
>
> This behavior change was originally in the [2025_03 Bundle](../2025_03_bundle.md) and intended to become enabled by default in the 2025_04 bundle. However, it has been postponed and a new release date has not been determined.

When this behavior change bundle is enabled, the output for the DATA_TYPE column in Account Usage and Snowflake Information Schema
views changes for string columns:

Before the change:
:   In the output of a query on an Account Usage or Information Schema view, the DATA_TYPE column showed `TEXT` for a string column.

After the change:
:   In the output of a query on an Account Usage or Information Schema view, the DATA_TYPE column shows `VARCHAR` for a string column.

The following Account Usage views include a DATA_TYPE column:

* [COLUMNS view](../../../sql-reference/account-usage/columns.md)
* [ELEMENT_TYPES view](../../../sql-reference/account-usage/element_types.md)
* [FIELDS view](../../../sql-reference/account-usage/fields.md)

The following Information Schema views include a DATA_TYPE column:

* [COLUMNS view](../../../sql-reference/info-schema/columns.md)
* [ELEMENT_TYPES view](../../../sql-reference/info-schema/element_types.md)
* [FIELDS view](../../../sql-reference/info-schema/fields.md)

When you query these views, the DATA_TYPE column shows the data type of a column in a table. When this behavior change bundle is enabled,
the output for a column of any [text string type](../../../sql-reference/data-types-text.md) changes. For example, create a table with columns of
various text string types:

```sqlexample
CREATE TABLE text_string_columns_test(
  col1 VARCHAR,
  col2 CHAR,
  col3 TEXT,
  col4 STRING);
```

Execute a query on the INFORMATION_SCHEMA.COLUMNS view:

```sqlexample
SELECT column_name, data_type
  FROM INFORMATION_SCHEMA.COLUMNS
  WHERE table_name ILIKE 'text_string_columns_test'
  ORDER BY column_name;
```

Before the change, the query shows `TEXT` for these columns:

```output
+-------------+-----------+
| COLUMN_NAME | DATA_TYPE |
|-------------+-----------|
| COL1        | TEXT      |
| COL2        | TEXT      |
| COL3        | TEXT      |
| COL4        | TEXT      |
+-------------+-----------+
```

After the change, the query shows `VARCHAR` for these columns:

```output
+-------------+-----------+
| COLUMN_NAME | DATA_TYPE |
|-------------+-----------|
| COL1        | VARCHAR   |
| COL2        | VARCHAR   |
| COL3        | VARCHAR   |
| COL4        | VARCHAR   |
+-------------+-----------+
```

Ref: 1960

---
title: Account Usage QUERY_HISTORY View: Change to QUERY_TAG
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1571.md
section: Release Notes
---

# Account Usage QUERY_HISTORY View: Change to QUERY_TAG

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The output of the [QUERY_HISTORY view](../../../sql-reference/account-usage/query_history.md), when returning information related to a Streamlit in Snowflake query, displays as follows:

Before the change:
:   The `QUERY_TAG` column contains a free form value resembling:

    ```none
    ExecuteStreamlit,streamlitName: STREAMLIT_DB.STREAMLIT_SCHEMA.OBJECT_NAME,streamlitId:123456789
    ```

    With a child query tag content resembling:

    ```none
    File "/usr/lib/python_udf/ed2bb26281494c8405804a3281315153bd4c74b8d05d7de038bb8ce6fe8796d5/lib/python3.8/site-packages/streamlit/runtime/scriptrunner/script_runner.py", line 552, in _run_script
    exec(code, module.dict)
    File "/home/udf/10380937708282/streamlit_app.py", line 29, in <module>
    df = session.sql(sql).collect()
    File "/usr/lib/python_udf/ed2bb26281494c8405804a3281315153bd4c74b8d05d7de038bb8ce6fe8796d5/lib/python3.8/site-packages/snowflake/snowpark/_internal/telemetry.py", line 139, in wrap
    result = func(*args, **kwargs)
    ```

After the change:
:   The `QUERY_TAG` column contains a JSON value resembling:

    ```json
    {
      "StreamlitEngine": "ExecuteStreamlit",
      "StreamlitName": "STREAMLIT_DB.STREAMLIT_SCHEMA.OBJECT_NAME"
    }
    ```

    With a child query tag content resembling:

    ```json
    {
      "StreamlitEngine": "ExecuteStreamlit",
      "StreamlitName": "STREAMLIT_DB.STREAMLIT_SCHEMA.OBJECT_NAME",
      "ChildQuery": "true"
    }
    ```

    When parsing query history or tracking the Streamlit app, refer to the fully-qualified name of the Streamlit app, for example `"StreamlitName": "STREAMLIT_DB.STREAMLIT_SCHEMA.OBJECT_NAME"`.

Ref: 1571

---
title: Account Usage views: Add support for versioned schemas in Snowflake Native App
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1570.md
section: Release Notes
---

# Account Usage views: Add support for versioned schemas in Snowflake Native App

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The output of certain [Account Usage](../../../sql-reference/account-usage.md) views, with respect to versioned schemas, behaves as follows.

Before the change:
:   Some Account Usage views may not include additional rows if the schema is a versioned schema:

After the change:
:   The following views can include additional rows if the schema is a versioned schema:

    * [COMPLETE_TASK_GRAPHS view](../../../sql-reference/account-usage/complete_task_graphs.md)
    * [FUNCTIONS view](../../../sql-reference/account-usage/functions.md)
    * [TABLES view](../../../sql-reference/account-usage/tables.md)
    * [COLUMNS view](../../../sql-reference/account-usage/columns.md)
    * [TABLE_STORAGE_METRICS view](../../../sql-reference/account-usage/table_storage_metrics.md)
    * [TASK_HISTORY view](../../../sql-reference/account-usage/task_history.md)
    * [OBJECT_DEPENDENCIES view](../../../sql-reference/account-usage/object_dependencies.md)
    * [POLICY_REFERENCES view](../../../sql-reference/account-usage/policy_references.md)

> **Note:**
>
> This behavior change is a follow-up to the following behavior changes:
>
> * [BCR-1544: Add support for versioned schemas in Snowflake Native Apps](../2024_02/bcr-1544.md).
> * [BCR-1463: Add support for versioned schemas in Snowflake Native Apps](../2024_01/bcr-1463.md)

Ref: 1570

---
title: Account Usage views: Add support for versioned schemas in Snowflake Native Apps
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1463.md
section: Release Notes
---

# Account Usage views: Add support for versioned schemas in Snowflake Native Apps

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

This behavior change alters the output of [Account Usage views](../../../sql-reference/account-usage/views.md) to include information about versioned schemas.

Before the change:

The Account Usage views may not display information about versioned schemas.

After the change:

The Account Usage views contain information about versioned schemas as described.

**Changes to the SCHEMATA view**

This behavior change adds the following columns to the [SCHEMATA view](../../../sql-reference/account-usage/schemata.md):

* `SCHEMA_TYPE` indicates the type of schema.
  Possible values are:

  > + VERSIONED
  > + STANDARD
* `VERSION_NAME` indicates the name of the schema if it is a versioned schema, or NULL otherwise.
* `VERSIONED_SCHEMA_ID` indicates the internal/system-generated identifier of the schema if it is a versioned schema or NULL otherwise.

This behavior change modifies the following columns in the [Account Usage](../../../sql-reference/account-usage.md):

* SCHEMA_NAME
* CATALOG_NAME
* CATALOG_ID

**Changes to other Account Usage views**

This behavior change modifies the following views to include additional rows if the schema is a versioned schema:

* ALERT_HISTORY
* AUTOMATIC_CLUSTERING_HISTORY
* CLASS_INSTANCES
* CLASSES
* COPY_HISTORY
* FILE_FORMATS
* LOAD_HISTORY
* MASKING_POLICIES
* MATERIALIZED_VIEW_REFRESH_HISTORY
* PIPES
* PROCEDURES
* REFERENTIAL_CONSTRAINTS
* ROLES
* ROW_ACCESS_POLICIES
* SEARCH_OPTIMIZATION_HISTORY
* SEQUENCES
* SERVERLESS_TASK_HISTORY
* SESSION_POLICIES
* STAGES
* TABLE_CONSTRAINTS
* TAGS
* VIEWS

The [ALERT_HISTORY view](../../../sql-reference/account-usage/alert_history.md) also contains the following new columns:

* DATABASE_ID
* SCHEMA_ID

The [PROCEDURES view](../../../sql-reference/account-usage/procedures.md) also contains the following new columns:

* PROCEDURE_SCHEMA_ID
* PROCEDURE_CATALOG_ID

Ref: 1463

---
title: Account Usage views: Add support for versioned schemas in Snowflake Native Apps
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1610.md
section: Release Notes
---

# Account Usage views: Add support for versioned schemas in Snowflake Native Apps

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

This behavior change alters the output of [Account Usage views](../../../sql-reference/account-usage.md) to include information about versioned schemas.

> **Note:**
>
> This behavior change is a follow-up to the following behavior changes:
>
> * [Account Usage views: Add support for versioned schemas in Snowflake Native App](../2024_03/bcr-1570.md)
> * [Account Usage views: Additional rows added to support versioned schemas](../2024_02/bcr-1544.md)
> * [Account Usage views: Add support for versioned schemas in Snowflake Native Apps](../2024_01/bcr-1463.md)

Before the change:
:   Some Account Usage views may not display information about versioned schemas.

After the change:
:   The Account Usage views contain information about versioned schemas as described.

**View affected by this change**

> This behavior change modifies the following views to include additional rows if the schema is a versioned schema:
>
> * [GRANTS_TO_ROLES view](../../../sql-reference/account-usage/grants_to_roles.md)

Ref: 1610

---
title: Account Usage views: Additional rows added to support versioned schemas
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1544.md
section: Release Notes
---

# Account Usage views: Additional rows added to support versioned schemas

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

> **Note:**
>
> This is a follow-up to [this behavior change](../2024_01/bcr-1463.md)
> that added new columns to views to support versioned schema.

Before the change:
:   Some Account Usage views may not display information about versioned schemas.

After the change:
:   The Account Usage views contain information about versioned schemas as described.

**Changes to Account Usage views**

This behavior change modifies the following views to include additional rows if the schema is a versioned schema:

* ELEMENT_TYPES
* FIELDS
* PASSWORD_POLICIES
* DATA_CLASSIFICATION_LATEST
* SERVICES

Ref: 1544

---
title: Account Usage views: Column updates to support the Snowflake Native App Framework
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1379.md
section: Release Notes
---

# Account Usage views: Column updates to support the Snowflake Native App Framework

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of these Account Usage views is as follows:

* CLASSES
* CLASS_INSTANCES

* FILE_FORMATS
* MASKING_POLICIES
* PIPES
* ROW_ACCESS_POLICIES
* SCHEMATA
* SEQUENCES
* SESSION_POLICIES
* STAGES
* TAGS
* VIEWS

* DATABASES
* PROCEDURES
* ROLES

* AGGREGATE_QUERY_HISTORY
* QUERY_HISTORY

* GRANTS_TO_ROLES

Before the change:
:   Regarding the `owner_role_type` column:

    * Some of the views do not include the column.
    * Some of the views include the column but:

      + Do not include support for the application object by specifying `APPLICATION` as the owner object type.
      + Are not consistent with how other Account Usage views specify the column.
    * In the GRANTS_TO_ROLES view:

      + The `grantee_name` column specifies the name of the application object, and the `granted_to` column specifies
        `APPLICATION`.

After the change:
:   The changes to the views are grouped as follows:

    * The CLASSES and CLASS_INSTANCES views update the column to return the identifier of the role that owns the class or the instance of the
      class.
    * The following views already include the `owner_role_type` column and add support for `APPLICATION` as a possible value:

      + FILE_FORMATS
      + MASKING_POLICIES
      + PIPES
      + ROW_ACCESS_POLICIES
      + SCHEMATA
      + SEQUENCES
      + SESSION_POLICIES
      + STAGES
      + TAGS
      + VIEWS
    * The following views add the column as the last column in the view and add support for `APPLICATION` as a possible value:

      + DATABASES
      + PROCEDURES
      + ROLES
    * The following views already include the `role_type` column and add support for `APPLICATION` as a possible value:

      + AGGREGATE_QUERY_HISTORY
      + QUERY_HISTORY
    * In the GRANTS_TO_ROLES view:

      + The `grantee_name` column specifies the name of the application object and the `granted_to` column specifies
        `APPLICATION`.
      + The `granted_by` column specifies the name of the application object when there are grants to application roles.
      + The `granted_by_role_type` column specifies `APPLICATION`.

Ref: 1379

---
title: Account Usage Views: New INSTANCE_ID Column in Select Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1100.md
section: Release Notes
---

# Account Usage Views: New INSTANCE_ID Column in Select Views

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The following views in the ACCOUNT_USAGE schema include the column INSTANCE_ID:

* AUTOMATIC_CLUSTERING_HISTORY
* SERVERLESS_TASK_HISTORY
* STAGES
* TABLES
* TASK_HISTORY
* VIEWS

The INSTANCE_ID column was added to support future functionality.

Ref: 1100

---
title: Account Usage Views: New OPTIONS Column in Select Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1091.md
section: Release Notes
---

# Account Usage Views: New OPTIONS Column in Select Views

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The following ACCOUNT_USAGE views include the OPTIONS column:

* MASKING_POLICIES
* ROW_ACCESS_POLICIES

The OPTIONS column indicates whether a policy has the EXEMPT_OTHER_POLICIES property set to TRUE or FALSE:

* If the property is set to TRUE, the OPTIONS column returns {“EXEMPT_OTHER_POLICIES”: “TRUE”}.
* If the property is set to FALSE or the property is not specified in the policy, the OPTIONS column returns NULL.

Ref: 1091

---
title: Account Usage: Changes to Columns in DATABASES View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-949.md
section: Release Notes
---

# Account Usage: Changes to Columns in DATABASES View

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

In a future release, the RETENTION_TIME column in the DATABASES view will change. In addition, a new RESOURCE_GROUP column will be added to the view.

## RETENTION_TIME Column

The data retention period for a database is determined by the retention time parameter settings set on the database and account. These parameters are [DATA_RETENTION_TIME _IN_DAYS](../../../sql-reference/parameters.md) and [MIN_DATA_RETENTION_TIME_IN_DAYS](../../../sql-reference/parameters.md).

* If the retention time is not explicitly set for a database, it inherits the setting for the account.
* If there is no retention time set at the account level, the default retention time for a database is 1 day.
* The maximum retention time for a transient database is 1 day, regardless of the account-level setting.
* If there is a minimum retention time set for the account, and a retention time explicitly set on a database, the effective retention time for the database is the greater of the two: MAX(DATA_RETENTION_TIME_IN_DAYS, MIN_DATA_RETENTION_TIME_IN_DAYS).

The RETENTION_TIME column in the Account Usage [DATABASES View](../../../sql-reference/account-usage/databases.md) might display the incorrect value in the following scenarios:

* If there is no explicit retention time set for transient databases, and the retention time for the account is set to 7 days, the RETENTION_TIME column value is 7 days. This is incorrect. The maximum data retention time for a transient database is 1 day.
* If the minimum retention time for an account is 7 days, and the retention time setting for a database is 4 days, the RETENTION_TIME column value is 4 days. This is incorrect. The minimum account retention time is longer and therefore overrides the retention time explicitly set for the database.
* If the retention time is set to 10 days for a database, then unset, the RETENTION_TIME column value is the unset value (in this case 10). This might be incorrect.

The RETENTION_TIME column value behaves as follows:

Previously:
:   In some cases, the RETENTION_TIME column displays an incorrect data retention time for databases.

Currently:
:   The RETENTION_TIME column will display the correct data retention time for databases.

For more information about setting the data retention period, refer to [Specifying the Data Retention Period for an Object](../../../user-guide/data-time-travel.md).

## RESOURCE_GROUP Column (New)

In a future release, the Account Usage DATABASES view will include the following new column:

| Column Name | Data Type | Description |
| --- | --- | --- |
| RESOURCE_GROUP | TEXT | Reserved for future use. |

Ref: 949

---
title: Account Usage: New and Changed Columns in Certain Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-732.md
section: Release Notes
---

# Account Usage: New and Changed Columns in Certain Views

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

To differentiate between account-level roles and database roles, the following changes have been made to the specified Account Usage
views in the shared SNOWFLAKE database:

* New column, OWNER_ROLE_TYPE, has been added to the following views:

  > + [FILE_FORMATS](../../../sql-reference/account-usage/file_formats.md)
  > + [FUNCTIONS](../../../sql-reference/account-usage/functions.md)
  > + [MASKING_POLICIES](../../../sql-reference/account-usage/masking_policies.md)
  > + [PIPES](../../../sql-reference/account-usage/pipes.md)
  > + [ROLES](../../../sql-reference/account-usage/roles.md)
  > + [ROW_ACCESS_POLICIES](../../../sql-reference/account-usage/row_access_policies.md)
  > + [SCHEMATA](../../../sql-reference/account-usage/schemata.md)
  > + [SEQUENCES](../../../sql-reference/account-usage/sequences.md)
  > + [STAGES](../../../sql-reference/account-usage/stages.md)
  > + [TABLES](../../../sql-reference/account-usage/tables.md)
  > + [TAGS](../../../sql-reference/account-usage/tags.md)
  > + [VIEWS](../../../sql-reference/account-usage/views.md)

  The new column specifies the type of role (ROLE or DATABASE_ROLE) that owns the object.
* [GRANTS_TO_ROLES](../../../sql-reference/account-usage/grants_to_roles.md) view:

  > + Existing column, GRANTED_TO, now differentiates between ROLE and DATABASE_ROLE. Previously, it was always ROLE,
  > + New column, GRANTED_BY_ROLE_TYPE, that displays ROLE or DATABASE_ROLE depending on whether the grantor is an account-level
  >   or database role.
* [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) view:

  > + New column, ROLE_TYPE, that displays ROLE or DATABASE_ROLE depending on whether the job was executed by an account-level or
  >   database role.

Ref: 732

---
title: Account Usage: New Column in DATABASES View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1033.md
section: Release Notes
---

# Account Usage: New Column in DATABASES View

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

A column was added to the DATABASES view in the ACCOUNT_USAGE schema in the shared SNOWFLAKE database as follows:

| Column Name | Data Type | Description |
| --- | --- | --- |
| TYPE | VARCHAR | Specifies the type of database. Valid values are: . - STANDARD: Specifies a normal database. . - IMPORTED DATABASE: Specifies a database that is created from a share. |

Ref: 1033

---
title: Addition of the uses_gpu parameter
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1704.md
section: Release Notes
---

# Addition of the `uses_gpu` parameter

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, providers must indicate in the manifest that a
Snowflake Native App requires a compute pool with a graphics processing unit (GPU).

Before the change:
:   Providers can configure a Snowflake Native App to use a graphic processing unit (GPU) by setting the INSTANCE_FAMILY
    parameter of the [CREATE COMPUTE POOL](../../../sql-reference/sql/create-compute-pool.md) in the setup script of the app.

After the change:
:   In addition to setting the INSTANCE_FAMILY parameter when [creating a compute pool](../../../sql-reference/sql/create-compute-pool.md),
    providers must also add the `uses_gpu` property to the manifest file of the app. This property is used during the automated
    security scan and to enable the app to create compute pools that use GPUs.

Ref: 1704

---
title: Administrator-owned warehouse for Snowflake Notebooks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1871.md
section: Release Notes
---

# Administrator-owned warehouse for Snowflake Notebooks

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

In release 8.26, Snowflake introduced a new default warehouse, SYSTEM$STREAMLIT_NOTEBOOK_WH, as a Snowflake-managed object in customer accounts.
With this behavior change, the ownership of this warehouse transitions from system-owned to administrator-owned.

After this change:

1. The ACCOUNTADMIN role will have the OWNERSHIP privilege on this warehouse.
2. Users can run Python workloads as well as execute any SQL queries on this warehouse.
3. The ACCOUNTADMIN role can manage this warehouse and perform operations on it.
4. In alignment with [BCR-1887](bcr-1887.md), the SYSTEM$STREAMLIT_NOTEBOOK_WH warehouse is now available for use by [Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks.md) users.

Ref: 1871

---
title: ALERT_HISTORY view and function (Account Usage and Information Schema): Changes to output when action contains RETURN statement
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1183.md
section: Release Notes
---

# ALERT_HISTORY view and function (Account Usage and Information Schema): Changes to output when action contains RETURN statement

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the values in the `state` and `action_query_id` columns of the output of the
[ALERT_HISTORY table function](../../../sql-reference/functions/alert_history.md) and
[ALERT_HISTORY Account Usage view](../../../sql-reference/account-usage/alert_history.md) have changed, if the action of the query
contains a [Snowflake Scripting RETURN statement](../../../sql-reference/snowflake-scripting/return.md):

Previously:
:   The state column contains the value `SUCCEEDED`, and the `action_query_id` column contains the incorrect query ID.

Currently:
:   The `state` column contains the value `TRIGGERED` (if the action successfully executed) or `ACTION_FAILED` (if the
    action failed to execute), and the `action_query_id` column contains the correct query ID.

Ref: 1183

---
title: ALERT_HISTORY view and table function: New SCHEDULED_FROM column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1894.md
section: Release Notes
---

# ALERT_HISTORY view and table function: New `SCHEDULED_FROM` column

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the [ALERT_HISTORY view](../../../sql-reference/account-usage/alert_history.md) (in ACCOUNT_USAGE) and the
output of the [ALERT_HISTORY](../../../sql-reference/functions/alert_history.md) table function (in INFORMATION_SCHEMA) include the following new
column:

| Column name | Data type | Description |
| --- | --- | --- |
| SCHEDULED_FROM | TEXT | Specifies what initiated the alert. The column contains one of the following values:   * `SCHEDULE`: The alert was scheduled to run normally, as described in SCHEDULE clause of   [CREATE ALERT](../../../sql-reference/sql/create-alert.md). * `EXECUTE ALERT`: The alert was scheduled to run using [EXECUTE ALERT](../../../sql-reference/sql/execute-alert.md). * `TRIGGER`: The [alert on new data](../../../user-guide/alerts.md) was run because the underlying table or view   contains new data. |

Ref: 1894

---
title: Alerts: Support for Account and Database Replication
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1023.md
section: Release Notes
---

# Alerts: Support for Account and Database Replication

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

Alert objects are replicated when you replicate the objects in an account or database:

Previously:
:   When you replicate objects in your account or database, alert objects are not replicated.

Currently:
:   Alert objects will be replicated. This includes alerts that were created before this behavior change takes effect.

> **Note:**
>
> This behavior change is enabled by default for the 2023_05 bundle, the replication behavior changes may take
> effect up to 1 business day in advance, due to a known limitation.

Ref: 1023

---
title: All Release Notes
source: https://docs.snowflake.com/en/release-notes/all-release-notes.md
section: Release Notes
---


---
title: ALTER APPLICATION PACKAGE command: Expanded validation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1627.md
section: Release Notes
---

# ALTER APPLICATION PACKAGE command: Expanded validation

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

The [ALTER APPLICATION PACKAGE … [ADD VERSION | ADD PATCH FOR VERSION] command](../../../sql-reference/sql/alter-application-package-version.md) command behaves as follows:

Before the change:
:   When altering an application package to either add a new version, or patch an existing version,
    the operation could succeed even though there were SQL syntax errors in the associated setup script.

    Such syntax errors would cause errors later in the application lifecycle, such as when a user attempted to install the app.

After the change:
:   When altering an app to either add a new version, or patch an existing version, static validation is performed for setup script syntax errors.

    When a error is detected the command fails with error:

    ```output
    Application package <pkg> failed validation during version creation: ....<details of error>
    ```

Ref: 1627

---
title: ALTER REPLICATION GROUP or ALTER FAILOVER GROUP with SUSPEND IMMEDIATE clause: Synchronously cancel active replication job
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2202.md
section: Release Notes
---

# ALTER REPLICATION GROUP or ALTER FAILOVER GROUP with SUSPEND IMMEDIATE clause: Synchronously cancel active replication job

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The SUSPEND IMMEDIATE clause of the [ALTER REPLICATION GROUP](../../../sql-reference/sql/alter-replication-group.md) command or
[ALTER FAILOVER GROUP](../../../sql-reference/sql/alter-failover-group.md) command behaves as follows:

Before the change:
:   The `ALTER { REPLICATION | FAILOVER GROUP } name SUSPEND IMMEDIATE` command begins to cancel
    the ongoing refresh job but returns immediately, regardless of whether the refresh job is canceled yet.

After the change:
:   The `ALTER { REPLICATION | FAILOVER GROUP } name SUSPEND IMMEDIATE` command waits
    and returns only after the refresh job has been successfully canceled.

This behavior change is being made to provide more predictable and deterministic behavior for the command.
The change removes the need for you to manually verify when the refresh job is fully canceled.

Ref: 2202

---
title: ALTER TABLE and ALTER VIEW commands: Enable drop operation when a row access policy is not set
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1327.md
section: Release Notes
---

# ALTER TABLE and ALTER VIEW commands: Enable drop operation when a row access policy is not set

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

The [ALTER TABLE](../../../sql-reference/sql/alter-table.md) … DROP ALL ROW ACCESS POLICIES command and [ALTER VIEW](../../../sql-reference/sql/alter-view.md) … DROP ALL
ROW ACCESS POLICIES command behave as follows:

Before the change:
:   For example, if a row access policy is not set on the table and you try to run an ALTER TABLE … DROP ALL ROW ACCESS POLICIES command,
    Snowflake returns the following error message:

    ```none
    Any policy of kind ROW_ACCESS_POLICY is not attached to TABLE T1.
    ```

After the change:
:   If a row access policy is not set on the table and you try to run an ALTER TABLE … DROP ALL ROW ACCESS POLICIES command, Snowflake
    returns a successful status message:

    ```none
    +----------------------------------+
    | status                           |
    |----------------------------------|
    | Statement executed successfully. |
    +----------------------------------+
    ```

    This change can simplify your workflow scripts because you no longer need to have a workaround when Snowflake returns the error message.

Ref: 1327

---
title: ALTER TABLE: Incompatible Default Values No Longer Allowed in New Columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1425.md
section: Release Notes
---

# ALTER TABLE: Incompatible Default Values No Longer Allowed in New Columns

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The ALTER TABLE command behaves as follows:

Before the change:
:   ALTER TABLE ADD COLUMN DEFAULT commands allowed adding a column with a default value that was
    incompatible with the data type of the column. The resulting default value was unusable.
    For example, although the default value of 1 is incompatible with the TIMESTAMP_TZ data type,
    the following command would succeed:

    `ALTER TABLE t ADD COLUMN x TIMESTAMP_TZ DEFAULT 1;`

After the change:
:   ALTER TABLE ADD COLUMN DEFAULT commands no longer allow adding a column with a default value that
    is incompatible with the data type of the column. An attempt to set an incompatible default value
    for a column fails with an error. For example:

    `SQL compilation error: Expression type does not match column data type, expecting DATE but got NUMBER(1,0) for column Y`

    The following specific combinations fail:

    | Data type of column | Data type of DEFAULT value |
    | --- | --- |
    | VARCHAR | BOOLEAN |
    | DATE | BOOLEAN |
    | TIME | BOOLEAN |
    | TIMESTAMP_LTZ | BOOLEAN |
    | TIMESTAMP_NTZ | BOOLEAN |
    | TIMESTAMP_TZ | BOOLEAN |
    | FLOAT | BOOLEAN |
    | NUMBER | BOOLEAN |
    | BOOLEAN | VARCHAR |
    | DATE | FLOAT |
    | TIME | FLOAT |
    | TIMESTAMP_LTZ | FLOAT |
    | TIMESTAMP_NTZ | FLOAT |
    | TIMESTAMP_TZ | FLOAT |
    | DATE | NUMBER |
    | TIME | NUMBER |
    | TIMESTAMP_LTZ | NUMBER |
    | TIMESTAMP_NTZ | NUMBER |
    | TIMESTAMP_TZ | NUMBER |

Ref: 1425

---
title: ALTER USER and DESCRIBE USER commands: LOGIN_NAME mapped to SCIM_USER_NAME
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2158.md
section: Release Notes
---

# ALTER USER and DESCRIBE USER commands: LOGIN_NAME mapped to SCIM_USER_NAME

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

## ALTER USER command: LOGIN_NAME mapped to SCIM_USER_NAME

The [ALTER USER](../../../sql-reference/sql/alter-user.md) command behaves in the following manner:

Before the change:
:   Running the ALTER USER SET LOGIN_NAME command never updates the SCIM_USER_NAME field.

After the change:
:   Running the ALTER USER SET LOGIN_NAME command updates the SCIM_USER_NAME field
    *if and only if* it is previously populated. If the user’s SCIM_USER_NAME field is
    not populated, the field remains blank.

## DESCRIBE USER command: New column in output

The [DESCRIBE USER](../../../sql-reference/sql/desc-user.md) command behaves in the following manner:

Before the change:
:   The SCIM_USER_NAME field is not visible in the output of DESCRIBE USER.

After the change:
:   The SCIM_USER_NAME field is visible in the output of DESCRIBE USER.

When this behavior change bundle is enabled, the output of the DESCRIBE USER command
includes the following new column:

| Column name | Data Type | Description |
| --- | --- | --- |
| SCIM_USER_NAME | VARCHAR | LOGIN_NAME defined for a user in a Security Identification Module (SCIM). |

### Displaying SCIM_USER_NAME in DESCRIBE USER

For all accounts, running DESCRIBE USER outputs a new row displaying the
SCIM_USER_NAME. Only users that were provisioned or updated with a SCIM
integration have that field set. Other users don’t have that field set.

### Updating SCIM_USER_NAME on ALTER USER SET LOGIN_NAME

If the target user has SCIM_USER_NAME set, that field is updated to the raw value
provided in the ALTER USER request. For example:

```sqlexample
ALTER USER user1 RENAME TO "user2"
```

This updates the user to have the following values:

* NAME: `user2`
* LOGIN_NAME: `USER2`
* SCIM_USER_NAME: `"user2"`

This matches the behavior in the SCIM API.

If the target user doesn’t have the SCIM_USER_NAME set, the field remains blank.

#### Examples: Valid requests

```sqlexample
ALTER USER user SET LOGIN_NAME='user1'
```

After this valid request, the user has LOGIN_NAME set to `USER1` and the SCIM_USER_NAME
set to `USER1`.

```sqlexample
ALTER USER user SET LOGIN_NAME='user1' SCIM_USER_NAME='User1'
```

After this valid request, the user has LOGIN_NAME set to `USER1` and the SCIM_USER_NAME
set to `User1`.

```sqlexample
ALTER USER user SET LOGIN_NAME='user1' SCIM_USER_NAME='"User1"'
```

After this valid request, the user has LOGIN_NAME set to `USER1` and the SCIM_USER_NAME
set to `"User1"`.

#### Examples: Invalid requests

```sqlexample
ALTER USER user SET SCIM_USER_NAME='value'
```

This request is invalid. SCIM_USER_NAME can only be provided when LOGIN_NAME is
present in the ALTER USER request.

```sqlexample
ALTER USER user SET LOGIN_NAME='user1' SCIM_USER_NAME='user2'
```

This request is invalid. SCIM_USER_NAME `user2` isn’t a case insensitive match against
the LOGIN_NAME `user1`.

Ref: 2158

---
title: Amazon Virtual Private Cloud ID for external stage, external function, and external volume
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-vpc-change-2025-02-03.md
section: Release Notes
---

# Amazon Virtual Private Cloud ID for external stage, external function, and external volume

As part of Snowflake’s continued commitment to enhance control over data leaving Snowflake,
we are migrating our egress control points to a new Amazon Virtual Private Cloud (VPC).
This will result in a change to the Amazon VPC ID used by Snowflake when
making outbound connections for external functions, external stages, and external volumes.

> **Note:**
>
> Customers using [Amazon S3-compatible storage](../../../user-guide/data-load-s3-compatible-storage.md)
> should confirm if they have any policies that may be affected by this change, and update their policies accordingly.

Customers who filter traffic coming into their API Gateways or S3 stages based on the published VPC ID,
will need to update their policies to include the new VPC ID.

To obtain the full list of VPC IDs that need to be allowlisted, customers should
run the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../../../sql-reference/functions/system_get_snowflake_platform_info.md) function.

## Behavior change

Before the change:
:   The output of [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../../../sql-reference/functions/system_get_snowflake_platform_info.md) does not contain the `snowflake-egress-vpc-ids` property.

After the change:
:   The output of the SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO function contains a new property `snowflake-egress-vpc-ids` which includes
    `id`, `expires`, and `purpose` child properties.

The output of the function resembles:

```output
{
  "snowflake-vpc-id": ["<existing VPC ID>"],
  "snowflake-egress-vpc-ids": [
    {
      "id": "<existing VPC ID>",
      "expires": "2025-03-01T00:00:00",
      "purpose": "generic"
    },
    {
      "id": "<new VPC ID>",
      "expires": "2025-03-01T00:00:00",
      "purpose": "generic"
    }
  ]
}
```

Customers should examine the `id` field within the `snowflake-egress-vpc-ids`
property and note `id` values marked as `"purpose":"generic"`.
`generic` IDs are VPC IDs which will need to be allowlisted to support core Snowflake functionality.

This change becomes effective during the week of February 24 2025.

> **Note:**
>
> The function returns a list of VPC IDs: the currently used VPC ID and new VPC ID(s).
> VPC IDs from `snowflake-vpc-id` will be duplicated in `snowflake-egress-vpc-ids` but marked as `"purpose":"generic"`.
> All VPC IDs with the generic purpose must be allowlisted in the policies.
>
> The `expires` property specifies the date and time until which the associated VPC ID is guaranteed to remain valid.
> Customers should update any automation or processes to query the function before the
> expiration date to ensure they have the latest information about the current VPC IDs.
>
> Output VPC IDs are stable and their expiration dates are automatically updated and extended.
>
> While Snowflake may need to change VPC IDs in the future, there are no plans to change before March 31, 2025.
> This information is primarily for future reference.

The following changes must be made to continue to access the following Snowflake features:

External stage and volumes:
:   Follow the instructions in [Allowing the Virtual Private Cloud IDs](../../../user-guide/data-load-s3-allow.md) to specify VPC IDs for external stages or external volumes.

External functions:
:   Follow the instructions in [Secure your Amazon API Gateway endpoint](../../../sql-reference/external-functions-creating-aws-ui-proxy-service.md) to specify VPC IDs for external functions.

## Timeline

Stage one:
:   Starting the week of February 24, 2025, [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../../../sql-reference/functions/system_get_snowflake_platform_info.md)
    function will be updated across all AWS deployments to include the new egress VPC IDs under the
    `snowflake-egress-vpc-ids` element. Customers can begin updating their S3 and API Gateway policies to allowlist these new VPC IDs.

Stage two:
:   Starting the week of June 9, 2025 (previously May 24, 2025), Snowflake will start a gradual
    transition to using the new VPCs for external stages, external functions, and external volumes.
    Customers must ensure their S3 and API Gateway policy updates are completed by this date.

Stage three:
:   Starting the week of February 15, 2026, old VPC-IDs will no longer be available.
    If you encounter any issues while connecting, update to the new VPC-ID for external stages, external functions, and external volumes.

    Additionally, you must ensure their S3 and API Gateway policies are updated with the new VPC-ID.

For any issues, or concerns, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Frequently asked questions

### How can I find S3 buckets for external stages that can be impacted by this change?

Using the ACCOUNTADMIN role, execute the following query to identify the external stages that are affected by the change:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT stage_url, stage_region, stage_owner, stage_catalog, stage_schema
  FROM SNOWFLAKE.ACCOUNT_USAGE.STAGES
  WHERE STARTSWITH(stage_url, 's3')
    AND stage_url IS NOT NULL
    AND deleted IS NULL;
```

### How can I find API Gateways that can be impacted by this change?

Using the ACCOUNTADMIN role, execute the following query to identify the S3 gateways that are affected by the change:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT function_name, function_definition, function_owner, function_catalog, function_schema
  FROM SNOWFLAKE.ACCOUNT_USAGE.FUNCTIONS
  WHERE function_language = 'EXTERNAL'
    AND function_definition ILIKE '%.execute-api.%.amazonaws.com%'
    AND deleted IS NULL;
```

### How can I find S3 buckets for external volume that can be impacted by this change?

Using the ACCOUNTADMIN role, execute the following query to identify the S3 buckets that are affected by the change:

```sqlexample
use role accountadmin;

DECLARE
res1 RESULTSET;
res2 RESULTSET;
sql_vol VARCHAR;
rpt VARIANT;
rpt_int VARIANT;
BEGIN
  rpt := object_construct();
  sql_vol := 'SELECT PROPERTY, VALUE:"NAME"::VARCHAR as NAME, VALUE:"STORAGE_ALLOWED_LOCATIONS"::VARCHAR as S3_PATH FROM (
SELECT PARSE_JSON(T."property_value") AS VALUE, T."property" as PROPERTY
FROM TABLE(RESULT_SCAN(last_query_id())) T
WHERE T."property_type" = \'String\'
AND T."property" != \'ACTIVE\'
AND VALUE:"STORAGE_PROVIDER"=\'S3\')
;';
  show external volumes;
  LET c1 CURSOR FOR SELECT * FROM TABLE(RESULT_SCAN(last_query_id()));
  OPEN c1;
  FOR record IN c1 DO
    res1 := (execute immediate 'describe external volume ' || record."name");
    res2 := (execute immediate :sql_vol);
    rpt_int := object_construct();
    let c2 CURSOR for res2;
    open c2;
    for inner_record in c2 do
        rpt_int := object_insert( rpt_int, inner_record.NAME, inner_record.S3_PATH);
    end for;

    rpt := object_insert( rpt, record."name", rpt_int );
  END FOR;
  RETURN rpt;
END;
```

### How can I find AWS policies that may contain the current Snowflake VPC ID?

To identify policies potentially affected by the change, use the [AWS CLI](https://docs.aws.amazon.com/cli/latest/userguide/getting-started-install.html).

1. Run [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../../../sql-reference/functions/system_get_snowflake_platform_info.md) command and note VPC ID returned in snowflake-vpc-id.
2. Run the following script to list the ARNs of IAM policies and the API IDs of API Gateways that contain resource policies with the current Snowflake VPC ID. Please note, running the script may take time.
3. Review the listed policies and determine which ones need to be updated to include additional VPC ID(s), as per the instructions above.

```bash
SNOWFLAKE_VPC_ID="<VPC ID returned in snowflake-vpc-id>"

# List ARNs of IAM policies that contain the current Snowflake VPC ID.
aws iam list-policies --scope Local --query 'Policies[*].Arn' --output text | tr '\t' '\n' | while read -r policy_arn; do
  version_id=$(aws iam get-policy --policy-arn "${policy_arn}" --query 'Policy.DefaultVersionId' --output text)
  aws iam get-policy-version --policy-arn "${policy_arn}" --version-id "${version_id}" | grep -q "${SNOWFLAKE_VPC_ID}" && echo "${policy_arn}"
done

# List API IDs of API Gateways that contain resource policies with the current Snowflake VPC ID.
aws apigateway get-rest-apis --query 'items[*].id' --output text --profile | tr '\t' '\n' | while read -r api_id; do
  aws apigateway get-rest-api --rest-api-id "${api_id}" --query 'policy' --output text | grep -q "${SNOWFLAKE_VPC_ID}" && echo "${api_id}"
done
```

Ref: 1910

---
title: Apache Iceberg™ tables:  Refreshing Delta-based tables fails if the UUID changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-2006.md
section: Release Notes
---

# Apache Iceberg™ tables: Refreshing Delta-based tables fails if the UUID changes

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

For Delta-based Iceberg tables:

Before the change:
:   If the universally unique identifier (UUID) of the underlying Delta table changes,
    Snowflake doesn’t block refreshing the table.

After the change:
:   If the UUID changes, Snowflake recognizes the table as entirely separate and you can’t refresh it.
    If this occurs, recreate the Delta-based table.

Ref: 2006

---
title: Apache Iceberg™ tables: ABFS write paths for Azure external volumes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1935.md
section: Release Notes
---

# Apache Iceberg™ tables: ABFS write paths for Azure external volumes

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

For Snowflake-managed [Apache Iceberg™ tables](../../../user-guide/tables-iceberg.md) that use an Azure external volume, Snowflake behaves as follows:

Before the change:
:   Snowflake uses the Windows Azure Storage Blob (WASB) protocol to create paths when writing Iceberg metadata files.

    For example: `wasbs://<azure_container>@<azure_storage_account>.blob.core.windows.net/<file_path>/`

After the change:
:   Snowflake uses the Azure Blob File System (ABFS) protocol to create paths when writing Iceberg metadata files.

    For example: `abfss://<azure_container>@<azure_storage_account>.blob.core.windows.net/<file_path>/`

    Ensure that any external tools that read your Snowflake-managed Iceberg tables can read the ABFS paths.

Ref: 1935

---
title: Apache Iceberg™ tables: Metadata file naming convention change
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1645.md
section: Release Notes
---

# Apache Iceberg™ tables: Metadata file naming convention change

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

Snowflake names metadata files for new and existing Snowflake-managed Apache Iceberg™ tables as follows:

Before the change:
:   Snowflake uses a deterministic naming convention based on the table version.

    For example: `v1715886514322000000.metadata.json`

After the change:
:   Snowflake uses a non-deterministic naming convention based on a universally unique identifier (UUID).

    For example: `00001-8a14161c-65ad-45fc-b665-ec16dcbf647e.metadata.json`

    Snowflake doesn’t rename existing metadata files, but applies the new naming convention
    to new metadata files created when the table is updated.

Ref: 1645

---
title: Apache Iceberg™ tables: New write location for empty string BASE_LOCATION
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1534.md
section: Release Notes
---

# Apache Iceberg™ tables: New write location for empty string BASE_LOCATION

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

For [Apache Iceberg™ tables](../../../user-guide/tables-iceberg.md) that use Snowflake as the catalog,
the write location is as follows when you specify an empty string as the relative path
from your external volume (`BASE_LOCATION = ''`):

Before the change:
:   Snowflake creates a directory under your external volume location (`STORAGE_BASE_URL`)
    using the table name and entity ID, and writes to subdirectories named `data` and `metadata` in the new directory.

    For example:

    * `s3://my/storage/base/url/table_name_entity_id/data`
    * `s3://my/storage/base/url/table_name_entity_id/metadata`

After the change:
:   Snowflake writes to subdirectories named `data` and `metadata` that appear directly under your external volume location.

    For example:

    * `s3://my/storage/base/url/data`
    * `s3://my/storage/base/url/metadata`

    You can still access any data that was written before the behavior change in the previous locations
    under the `table_name_entity_id` directory.

This behavior change also applies to converted Iceberg tables.

Ref: 1534

---
title: Apache Iceberg™ tables: Updates to metadata retention period
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1519.md
section: Release Notes
---

# Apache Iceberg™ tables: Updates to metadata retention period

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

Snowflake determines the metadata retention period for [Apache Iceberg™ tables
that use a catalog integration](../../../user-guide/tables-iceberg.md) as follows:

Before the change:
:   Snowflake retrieves the value of `history.expire.max-snapshot-age-ms` from the current
    metadata file, converts the value to days (rounding down), and stores it in the
    DATA_RETENTION_TIME_IN_DAYS parameter.

    If Snowflake doesn’t find `history.expire.max-snapshot-age-ms` in the metadata file,
    or can’t parse the value, it sets DATA_RETENTION_TIME_IN_DAYS at the table level to a
    default value of 5 days (the default Apache Iceberg value).

    You can also change the value of DATA_RETENTION_TIME_IN_DAYS manually.

After the change:
:   Snowflake sets DATA_RETENTION_TIME_IN_DAYS at the table level to whichever of
    the following values is *smaller*:

    * `history.expire.max-snapshot-age-ms`
    * The following value, depending on your Snowflake account edition:

      + Standard Edition: 1 day.
      + Enterprise Edition or higher: 5 days.

    You can’t change the value of DATA_RETENTION_TIME_IN_DAYS manually. Instead, you must update
    `history.expire.max-snapshot-age-ms` and refresh the table.

Ref: 1519

---
title: Apache Iceberg™ tables: version-hint.text file no longer generated
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1658.md
section: Release Notes
---

# Apache Iceberg™ tables: version-hint.text file no longer generated

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

For new and existing Snowflake-managed Apache Iceberg™ tables:

Before the change:
:   Snowflake generates a `version-hint.text` file in the metadata file location for a table.

After the change:
:   Snowflake no longer generates the `version-hint.text` file.

Ref: 1658

---
title: Apache Iceberg™ tables: Writing data files to subdirectories in Amazon S3
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1706.md
section: Release Notes
---

# Apache Iceberg™ tables: Writing data files to subdirectories in Amazon S3

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

Snowflake writes Parquet data files in Amazon S3 for Snowflake-managed tables as follows:

Before the change:
:   Snowflake writes all table data files to a single directory named `data/` in your external cloud storage.

    Example path for each data file:

    `s3://externalVolumeStorageLocation/tableBaseLocation/data/snow_externalFileId.parquet`

After the change:
:   Snowflake supports writing data files for new or existing tables to randomly-named subdirectories
    under the `data/` directory. This helps you avoid S3 throttling and optimize query performance.

    Example path for each data file:

    `s3://externalVolumeStorageLocation/tableBaseLocation/data/randomPrefix/snow_externalFileId.parquet`

Ref: 1706

---
title: Apache Iceberg™: New write paths for Snowflake-managed tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1873.md
section: Release Notes
---

# Apache Iceberg™: New write paths for Snowflake-managed tables

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

For Snowflake-managed Iceberg tables (including [converted tables](../../../user-guide/tables-iceberg-conversion.md) and [dynamic Iceberg tables](../../../user-guide/dynamic-tables-create-iceberg.md)):

Before the change:
:   Snowflake requires that you specify a `BASE_LOCATION` for the table,
    and writes Parquet data files and table metadata to the following paths in your external cloud storage:

    * `STORAGE_BASE_URL/BASE_LOCATION/data/`
    * `STORAGE_BASE_URL/BASE_LOCATION/metadata/`

After the change:
:   Snowflake no longer requires a `BASE_LOCATION`,
    and constructs paths using a random 8-character string or the value of a new schema-level string parameter called
    `BASE_LOCATION_PREFIX`.

    If you specify a `BASE_LOCATION`, Snowflake ignores and does not use the `BASE_LOCATION_PREFIX`.

    Snowflake constructs paths using the following patterns, depending on the values specified
    for `BASE_LOCATION` or `BASE_LOCATION_PREFIX`:

    * No BASE_LOCATION, no BASE_LOCATION_PREFIX: `STORAGE_BASE_URL/<database>/<schema>/<table_name>.<randomId>/<data | metadata>/`
    * No BASE_LOCATION, BASE_LOCATION_PREFIX = ‘my_prefix’: `STORAGE_BASE_URL/my_prefix/<table_name>.<randomId>/<data | metadata>/`
    * BASE_LOCATION = ‘my_base_loc’: `STORAGE_BASE_URL/my_base_loc.<randomId>/<data | metadata>/`
    * BASE_LOCATION = ‘’ (empty string): `STORAGE_BASE_URL/<randomId>/<data | metadata>/`

Ref: 1873

---
title: Application package version drop: Error when consumers are still using the version (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2273.md
section: Release Notes
---

# Application package version drop: Error when consumers are still using the version (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

The version drop process for application packages and application package release channels behaves as follows:

Before the change:
:   When a provider dropped a version from an application package or an application package release channel,
    the command succeeded even when there were applications still using the version. The version entered the
    dropping state and the drop completed once no applications were using that version.

After the change:
:   When a provider drops a version from an application package or an application package release channel,
    if there are applications still using that version, the provider receives an error indicating the consumer
    account that is still using the version. The provider must move all consumers off the version before it
    can enter the dropping state.

Ref: 2273

---
title: Application package version drop: Immediate replication refresh for version drops (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2274.md
section: Release Notes
---

# Application package version drop: Immediate replication refresh for version drops (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

The version drop process for application packages and application package release channels behaves as follows:

Before the change:
:   When a provider marked a version as dropping from an application package or application package release channel,
    the version entered the dropping state in the primary region. However, the provider had to wait for the next
    replication attempt to transition the version to the dropping state in replicated regions. This caused delays
    in the version drop process.

After the change:
:   When a provider marks a version as dropping from an application package or application package release channel,
    the version enters the dropping state in the primary region and a new replication refresh task starts immediately
    to update the secondary regions. The version enters the dropping state in all replicated regions without waiting
    for the next scheduled replication attempt.

    The LISTING_AUTO_REFRESH property on the application package controls this behavior and is enabled by default.

Ref: 2274

---
title: APPLICATION_STATE view (Data Sharing Usage): New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2231.md
section: Release Notes
---

# APPLICATION_STATE view (Data Sharing Usage): New columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the [APPLICATION_STATE view](../../../sql-reference/data-sharing-usage/application-state-view.md) view in the [Data Sharing Usage](../../../sql-reference/data-sharing-usage.md) schema includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| UPGRADE_DEADLINE | TIMESTAMP_LTZ | The deadline by which an upgrade must be completed. After this time, the system will automatically upgrade the application regardless of the consumer’s maintenance policy. |
| MAINTENANCE_STATE | VARCHAR | The current state of the application’s maintenance process. Possible values: COMPLETED, QUEUED, INITIALIZING, IN_PROGRESS. |
| MAINTAIN_AFTER | TIMESTAMP_LTZ | The earliest time when the system will begin maintenance on the application. Maintenance will not start before this timestamp. |

Ref: 2231

---
title: APPLICATION_STATE view : New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2071.md
section: Release Notes
---

# APPLICATION_STATE view : New columns in output

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the [APPLICATION_STATE view](../../../sql-reference/data-sharing-usage/application-state-view.md) in the [DATA_SHARING_USAGE schema](../../../sql-reference/data-sharing-usage.md) includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| UPGRADE_AFTER | TIMESTAMP | Timestamp when the application will be upgraded. |
| UPGRADE_IN_MAINTENANCE_WINDOW | BOOLEAN | TRUE if the application upgrade is scheduled to happen during the Snowpark Container Services maintenance window, otherwise FALSE. |
| UPCOMING_VERSION | VARCHAR | Information about the state of the upcoming version for the application in JSON format.  The `holds` field reflects application state. Other fields describe incoming version information.  A hold value of PENDING_REPLICATION indicates that this specific version and release directive has not yet been replicated.  For example:  ```json  {    "holds": [      "PENDING_REPLICATION"     ],     "upcomingPatchId": 2,     "upcomingReleaseDirectiveName": "CUSTOM_RD" ,     "upcomingVersion": "V1",     "upcomingVersionEffectiveOn": "Thu, 19 Jun 2025 21:52:37 Z" } ```  For more information on application upgrade see [Upgrade an app (Legacy)](../../../developer-guide/native-apps/update-app-upgrade.md). |

Ref: 2071

---
title: APPLICATION_STATE view: Add columns to provide additional information about app
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1716.md
section: Release Notes
---

# APPLICATION_STATE view: Add columns to provide additional information about app

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, the output of the [APPLICATION_STATE view](../../../sql-reference/data-sharing-usage/application-state-view.md) includes
additional information about the health status of an app, upgrades, and event sharing.

This change adds the following new columns at the end of the output of the APPLICATION_STATE view:

| Column name | Data type | Description |
| --- | --- | --- |
| LAST_HEALTH_STATUS | VARCHAR | The last reported health status of the app. Possible values are:   * OK * FAILED * PAUSED |
| LAST_HEALTH_STATUS_UPDATED_ON | VARCHAR | The timestamp when the health status was last reported. |
| ENABLED_TELEMETRY_EVENT_DEFINITIONS | VARCHAR | A list of event definitions that the consumer has enabled. See [About event definitions](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#about-event-sharing) for more information. |
| UPGRADE_STATE_UPDATED_ON | TIMESTAMP_LTZ | The timestamp when the app entered its current upgrade state. This value is automatically set by Snowflake. Upgrade state is already present. |
| DISABLEMENT_REASONS | VARCHAR | An array containing the reasons why the Snowflake Native App was disabled. See Possible statuses for a disabled app for the list of reasons. |

The following table lists the possible values for the DISABLEMENT_REASONS column:

| Value | Status description | Is recoverable? |
| --- | --- | --- |
| MANUALLY_DISABLED | The app is disabled by Snowflake | Yes. To re-enable the app, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| ACCOUNT_INACTIVE | The account becomes inactive by being locked or suspended causing the app to be unavailable. In this state a consumer cannot execute any SQL queries in their account and the app cannot be upgraded. | Yes. The app is automatically re-enabled if the account lock or suspension is removed |
| PACKAGE_VERSION_IS_MISSING | The application package version for the app was dropped by the provider. | Possibly. This can be caused by a temporary platform outage, in which case the app may recover automatically. Otherwise, the provider can work with Snowflake Support to attempt version recovery. Contact the application provider for more details. |
| CMK_ACCESS_DENIED | The consumer manages the encryption key themselves (ENCRYPT_USE_CMK_KMS is enabled) and Snowflake doesn’t have access to this key. | Yes. To re-enable the app, ensure that the cloud provider configuration to retrieve the CMK is correct and that Snowflake has access to the key. |
| LISTING_ACCESS_REVOKED | The listing used to create the app is no longer available. Possible reasons for this status include:   * The provider deleted the listing * The provider manually removed access to the private listing from the consumer account | Possibly. Recoverability depends on the reason why access was revoked.  For example, if the listing was deleted it is not recoverable. If a consumer account was manually removed from the private listing, access to the listing and app can be restored. |
| LISTING_TRIAL_USAGE_EXCEEDED | The application has exceeded the usage limit for a usage-based trial listing. | No |
| LISTING_PAYMENT_REQUIRED | The listing used to install the app is a paid listing and requires payment for further usage. | Yes. The consumer must correctly set up payment for the app. |
| LISTING_TRIAL_TIME_EXCEEDED | The application exceeded the trial duration. | No |
| APPLICATION_PACKAGE_NOT_AVAILABLE | The application package used to create the app no longer exists. The provider may have dropped the corresponding application package. | No |
| APPLICATION_PACKAGE_DISABLED | The application package used to create the app is disabled by the Snowflake. | Yes. The app is re-enabled, if Snowflake re-enables the application package. |
| APPLICATION_SUSPENDED | The app resources for example, tasks, services, and compute pools, are suspended due to the app being disabled.  The suspended objects remain suspended until the app is re-enabled and there are no other reasons the app was disabled. | Yes |
| APPLICATION_SUSPEND_RESUME_IN_PROGRESS | The app resources, for example tasks, services, and compute pools, are currently resuming. | Yes |

Ref: 1716

---
title: APPLICATION_STATE view: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1876.md
section: Release Notes
---

# APPLICATION_STATE view: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the output of the
[DATA_SHARING_USAGE.APPLICATION_STATE](../../../sql-reference/data-sharing-usage/application-state-view.md)
view will contain the following new columns:

| Column name | Description |
| --- | --- |
| CURRENT_RELEASE_CHANNEL_NAME | Displays the name of the current [release channel](../../../developer-guide/native-apps/release-channels.md) of the app. |
| LAST_UPGRADED_ON | Displays the date when the app was last upgraded. |

Ref: 1876

---
title: Apr 02, 2026: AI_COMPLETE document intelligence (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-02-ai-complete-document-intelligence-ga.md
section: Release Notes
---

# Apr 02, 2026: AI_COMPLETE document intelligence (*General availability*)

We are announcing General Availability of AI_COMPLETE document intelligence in Snowflake.

The multimodal AI_COMPLETE function now supports document inputs, including PDFs and Microsoft Word files, stored in
internal and external stages. This enables reasoning over text, charts, tables, and structured data within documents.

With this release, you can:

* Answer questions about charts and diagrams embedded in PDFs.
* Compare information across multiple documents in a single prompt.
* Generate summaries tailored to a specific audience or perspective.
* Extract entities and structured insights from reports, contracts, spreadsheets, and technical documentation.

This expands AI_COMPLETE beyond text and image inputs, bringing full document understanding into your data workflows.

To get started, see [AI_COMPLETE with documents](../../../user-guide/snowflake-cortex/ai-complete-document-intelligence.md).

---
title: Apr 02, 2026: AI_FUNCTIONS_USER database role (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-02-ai-functions-user-db-role.md
section: Release Notes
---

# Apr 02, 2026: AI_FUNCTIONS_USER database role (*General availability*)

Snowflake has added an AI_FUNCTIONS_USER database role in the SNOWFLAKE database to more granularly manage access
to Cortex AI functions. With this role, you can independently control access to AI functions without requiring
the CORTEX_USER database role. For example, an account administrator can disable CORTEX_USER for a role but still
allow that role to use AI functions by doing the following:

1. Revoke SNOWFLAKE.CORTEX_USER from the role.
2. Grant SNOWFLAKE.AI_FUNCTIONS_USER to the role.
3. Verify that the role also has the USE AI FUNCTIONS account-level privilege, which is granted to PUBLIC by default.

For more information about this role, see [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md).

---
title: Apr 02, 2026: AI_PARSE_DOCUMENT now available in AWS Europe West 2 (London)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-02-ai-parse-document-london-region.md
section: Release Notes
---

# Apr 02, 2026: AI_PARSE_DOCUMENT now available in AWS Europe West 2 (London)

AI_PARSE_DOCUMENT is now available in the AWS Europe West 2 (London) region, expanding regional support for customers
with in-region requirements for document processing workloads.

This addition enables customers in the London region to use AI_PARSE_DOCUMENT for text extraction, layout analysis, and
document parsing without requiring cross-region inference.

For the full list of supported regions and more information, see
[AI_PARSE_DOCUMENT regional availability](../../../user-guide/snowflake-cortex/parse-document.md).

---
title: Apr 04, 2025: Cortex AI Observability (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-04-cortex-ai-observability.md
section: Release Notes
---

# Apr 04, 2025: Cortex AI Observability (*Preview*)

Snowflake announces the preview of AI Observability in Snowflake Cortex. You can use it to evaluate and trace your generative AI applications. AI Observability provides tools to systematically measure application performance, debug execution traces, and optimize configurations for production deployments. Key features include:

* **Evaluations**: Assess application performance using metrics such as accuracy, latency, usage, and cost.
* **Comparison**: Compare multiple evaluations side-by-side to identify the best configurations.
* **Tracing**: Debug application executions by tracing inputs, outputs, and intermediate steps.

AI Observability supports task types such as Retrieval Augmented Generation (RAG) and Summarization. It provides metrics that you can use to evaluate context relevance, groundedness, and factual correctness.

For more information, see [AI Observability in Snowflake Cortex](../../../user-guide/snowflake-cortex/ai-observability.md).

---
title: Apr 04, 2025: Cortex COMPLETE Structured Outputs (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-04-cortex-complete-structured-outputs.md
section: Release Notes
---

# Apr 04, 2025: Cortex COMPLETE Structured Outputs (*General availability*)

Snowflake announces the general availability of support for structured outputs in the Snowflake Cortex
COMPLETE function, where completion results conform to a user-specified JSON schema. Defining the desired
outputs and their formats via a schema simplifies prompting, reduces the need for post-processing COMPLETE results in
your AI data pipelines, and enables seamless integration with systems that require deterministic responses. COMPLETE
Structured Outputs is available in SQL and via Python and REST APIs.

For more information, see [Cortex COMPLETE Structured Outputs](../../../user-guide/snowflake-cortex/complete-structured-outputs.md).

---
title: Apr 10, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-10-dcr.md
section: Release Notes
---

# Apr 10, 2025: Snowflake Data Clean Rooms updates

> **Note:**
>
> **Clean rooms UI users** must sign out and back in to the clean rooms UI for these updates to take effect.
>
> **Clean rooms API users** must run the following code for these updates to take effect:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL samooha_by_snowflake_local_db.library.apply_patch();
> ```
>
> **To enable auto-upgrades for API users,** run the following code:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL samooha_by_snowflake_local_db.library.enable_local_db_auto_upgrades();
> ```

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms:

**Column policies are optional for provider-run analysis:** This applies only to clean rooms API users.

**Google Ads activation connector is now available from the DCR.** Google Ads now requires customers to check two check
boxes affirming that the DCR collaborator activating the analysis output with Google Ads have obtained data subject consents for
advertising purposes and for ad personalization.

**Additional procedures supported by grant_run_on_cleanrooms_to_role.** Additional procedures are now supported by roles granted clean room privileges using [consumer.grant_run_on_cleanrooms_to_role](../../../user-guide/cleanrooms/consumer.md).

> **Important:**
>
> To support the additional procedures in `grant_run_on_cleanrooms_to_role`, the provider and consumer must run the following
> procedures in this order:
>
> 1. **Provider:**
>
>    ```sqlexample
>    CALL samooha_by_snowflake_local_db.provider.patch_cleanroom($cleanroom_name, TRUE);
>    ```
> 2. **Consumer:**
>
>    ```sqlexample
>    CALL samooha_by_snowflake_local_db.consumer.patch_cleanroom($cleanroom_name);
>    ```

**Clean rooms now support Snowpark Container Services.** You can run a Snowpark container within a clean room for large jobs in custom
Snowpark environments. [Learn more.](../../../user-guide/cleanrooms/demo-flows/snowpark.md)

---
title: Apr 10, 2026: Budgets for AI features (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-10-budgets-ai-features-ga.md
section: Release Notes
---

# Apr 10, 2026: Budgets for AI features (*General availability*)

You can now use custom budgets to track and control credit consumption for AI features,
including AI Functions, Cortex Code, Cortex Agents, and Snowflake Intelligence, broken down
by the team or cost center consuming them. Add an AI feature to a budget as a *shared resource*,
tag the users who belong to a business unit, and the budget tracks only the credits consumed by
those tagged users.

For more information, see [Using budgets for AI features (shared resources)](../../../user-guide/budgets/budget-shared-resources.md).

---
title: Apr 11, 2025: Snowsight replication configuration and monitoring (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-11-snowsight-replication-setup-and-monitoring.md
section: Release Notes
---

# Apr 11, 2025: Snowsight replication configuration and monitoring (*General availability*)

With this release, configuring and monitoring replication with Snowsight is generally available. You can perform
the following actions with Snowsight:

* [Create a replication or failover group](../../../user-guide/account-replication-config.md).
* [Modify a replication or failover group in a source account](../../../user-guide/account-replication-config.md).
* [Pause or resume a replication schedule in a target account](../../../user-guide/account-replication-config.md).
* [Drop a replication or failover group](../../../user-guide/account-replication-config.md).
* [Promote a target account to serve as the source account](../../../user-guide/account-replication-failover-failback.md).
* [Monitor replication, including the progress of refresh operations and replication history](../../../user-guide/account-replication-monitor.md).
* [Create a primary and secondary connection](../../../user-guide/client-redirect.md).
* [Modify target accounts for a connection](../../../user-guide/client-redirect.md).
* [Monitor Client Redirect connection details](../../../user-guide/client-redirect.md).
* [Drop a connection](../../../user-guide/client-redirect.md).

For more information about replication, see [Introduction to replication and failover across multiple accounts](../../../user-guide/account-replication-intro.md).

---
title: Apr 14, 2025: EMBED Function Added to Cortex REST API (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-14-cortex-offers-embed-rest-api.md
section: Release Notes
---

# Apr 14, 2025: EMBED Function Added to Cortex REST API (*General availability*)

Snowflake announces the availability of the EMBED function in the Cortex REST API. You can use it to create embeddings for text using the `/api/v2/cortex/inference:embed` endpoint. The EMBED function supports multiple models and regions, enabling a wide range of use cases, including:

* Text similarity and clustering.
* Semantic search and recommendation systems.
* Multilingual text analysis.

For more information, see [Vector embedding REST API](../../../user-guide/snowflake-cortex/cortex-rest-api/embed-api.md).

---
title: Apr 14, 2025: FILE data type to create tables for multimodal analysis (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-14-file-data-type.md
section: Release Notes
---

# Apr 14, 2025: FILE data type to create tables for multimodal analysis (*Preview*)

Snowflake announces the preview of FILE, a new unstructured data type that makes it easy to create tables as inputs to
unstructured data analytics. FILE lets you create references to files stored in internal or external stages, which you
can then store in tables for multimodal analytics using Cortex AI functions like COMPLETE. In addition to the new data
type, this release also includes a suite of utility functions to work with FILE objects. For more information, see
[Unstructured data types](../../../sql-reference/data-types-unstructured.md).

---
title: Apr 14, 2025: Mistral AI’s multimodal Pixtral Large now available for Snowflake Cortex AI (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-14-cortex-offers-pixtral-large.md
section: Release Notes
---

# Apr 14, 2025: Mistral AI’s multimodal Pixtral Large now available for Snowflake Cortex AI (*General availability*)

Snowflake announces the immediate available to all customers of Mistral AI’s Pixtral Large model for multimodal
analytics via Cortex AI COMPLETE. Mistral AI’s Pixtral Large combines advanced vision capabilities with language
understanding and is ideal for the following use cases:

* Visual reasoning tasks like image question answering and document and chart analysis.
* Multilingual tasks that may require English, French, German, Spanish, Italian, Chinese, Japanese, Korean, Portuguese, Dutch, or Polish.

---
title: Apr 14, 2025: PROMPT helper function (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-14-prompt-helper-function.md
section: Release Notes
---

# Apr 14, 2025: PROMPT helper function (*Preview*)

Snowflake announces the preview of PROMPT, a new helper function designed to streamline the creation and use of template strings for Snowflake AI functions, such as Cortex COMPLETE.
PROMPT facilitates dynamic message formatting and structured prompt creation.
Use it to templatize prompts that incorporate image files, so that image AI functions can process those prompts more efficiently.

For more information, see [PROMPT](../../../sql-reference/functions/prompt.md).

---
title: Apr 14, 2025: Snowflake Cortex AI COMPLETE multimodal support (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-14-cortex-complete-multimodal.md
section: Release Notes
---

# Apr 14, 2025: Snowflake Cortex AI COMPLETE multimodal support (*Preview*)

Snowflake announces the preview of Cortex AI COMPLETE multimodal support, enabling the processing and analysis of images alongside text using powerful
vision AI models, all through simple SQL. This feature allows image processing from Snowflake or external stages without complex integrations or external APIs.

Snowflake’s new multimodal capabilities include:

* Advanced vision models, including Claude 3.5 Sonnet and Pixtral Large
* Comprehensive visual analysis with image comparison, captioning, classification, entity extraction, and visual question answering
* Flexible image analysis using intuitive SQL syntax, for single or multiple images
* Support for common image formats (.jpg, .jpeg, .png, .webp, .gif)
* Efficient batch image processing integrated with existing data workflows through table-based approaches

For more information, see [AI_COMPLETE](../../../sql-reference/functions/ai_complete.md).

---
title: Apr 14, 2025: Support for Streamlit 1.42.0 (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-14-sis.md
section: Release Notes
---

# Apr 14, 2025: Support for Streamlit 1.42.0 (General availability)

Version 1.42.0 of the Streamlit open-source library is now supported in Streamlit in Snowflake.

---
title: Apr 14, 2026: Cortex Search Service replication (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-14-cortex-search-replication-ga.md
section: Release Notes
---

# Apr 14, 2026: Cortex Search Service replication (*General availability*)

Replication of Cortex Search Services is now generally available. You can replicate
Cortex Search Services from a source account to one or more target accounts in the
same organization using replication or failover groups.

For more information, see [Replicate a Cortex Search Service](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-replication.md).

---
title: Apr 14, 2026: Monitor Cortex Search requests (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-14-cortex-search-monitoring.md
section: Release Notes
---

# Apr 14, 2026: Monitor Cortex Search requests (*Preview*)

Cortex Search request monitoring is now available in public preview.

You can enable request logging on a Cortex Search Service to collect detailed
information about search requests for monitoring and debugging purposes. With
request logging enabled, you can review query patterns, response times, and
request details for a Cortex Search Service.

Request logs are stored in the `SNOWFLAKE.LOCAL.AI_OBSERVABILITY_EVENTS` event
table and are accessible using the `snowflake.local.get_ai_observability_events`
function or by querying the event table directly as ACCOUNTADMIN.

For more information, see [Monitor Cortex Search requests](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-monitor.md).

---
title: Apr 14, 2026: Snowflake storage for Apache Iceberg™ tables (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-14-iceberg-snowflake-storage.md
section: Release Notes
---

# Apr 14, 2026: Snowflake storage for Apache Iceberg™ tables (*Preview*)

With this preview release, you can create Apache Iceberg™ tables that use Snowflake storage.
This option lets Snowflake store and manage the Iceberg table files for you, so you don’t
need to set up access to external cloud storage.

Just like standard Snowflake tables, Iceberg tables with Snowflake storage support Fail-safe
data protection for permanent tables and can be transient to reduce storage costs. You can also
use an external query engine through the Snowflake Horizon Catalog to access these tables.

For more information, see [Snowflake storage for Apache Iceberg™ tables](../../../user-guide/tables-iceberg-internal-storage.md).

---
title: Apr 15, 2025: Search optimization improves the performance of queries containing scalar functions
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-15-search-optimization-scalar-functions.md
section: Release Notes
---

# Apr 15, 2025: Search optimization improves the performance of queries containing scalar functions

The search optimization service can now improve the performance of queries containing scalar functions.
A scalar function returns a single value for each invocation. The search optimization service can improve the
performance of queries that use scalar functions in equality predicates. The scalar function can be a
[system-defined scalar function](../../../sql-reference/functions.md) or a
[user-defined scalar SQL function](../../../developer-guide/udf/sql/udf-sql-introduction.md).

For more information, see [Speeding up queries with scalar functions using search optimization](../../../user-guide/search-optimization/scalar-functions.md).

---
title: Apr 15, 2025: Snowflake Cortex AI state-of-the-art Entity Sentiment (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-15-cortex-entity-sentiment-function.md
section: Release Notes
---

# Apr 15, 2025: Snowflake Cortex AI state-of-the-art Entity Sentiment (*Preview*)

Snowflake announces the preview of Cortex AI Entity Sentiment, a powerful new
task-specific function that provides high-quality overall and granular aspect-based sentiment analysis. It delivers
nuanced insights by analyzing the sentiment directed at specific entities, helping organizations understand what aspects
of their offerings customers love – or are dissatisfied with.

Entity Sentiment capabilities include:

* Comprehensive analysis delivering both overall and granular entity-specific sentiment.
* Customized sentiment analysis by defining the specific entities that matter most to your business.
* Advanced classification that identifies Positive, Negative, Neutral, and Mixed emotions, returning Unknown only when sentiment cannot be determined.
* Overall sentiment is detected in addition to granular entity-level sentiment.
* Human-like contextual understanding that interprets subtle signals, such as implicit complaints (“I had to contact support three times”) and
  figurative language, that typically confuse other AI systems.

Entity Sentiment is designed for enterprise workloads that demand highly accurate zero-shot sentiment detection to guide
decisions that will improve products and services. For more information, see [ENTITY_SENTIMENT (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/entity_sentiment-snowflake-cortex.md).

---
title: Apr 15, 2025: Snowflake Egress Cost Optimizer (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-15-eco-ga.md
section: Release Notes
---

# Apr 15, 2025: Snowflake Egress Cost Optimizer (*General availability*)

Snowflake would like to announce the general availability of Snowflake Egress Cost Optimizer for Cross-Cloud Auto-Fulfillment.
Egress Cost Optimizer minimizes egress costs when sharing data or apps to multiple regions, helping providers on Snowflake (of both public and private listings) reduce costs of sharing, cost of service, and as a result maximize their return on investment (ROI).

> **Note:**
>
> Snowflake Egress Cost Optimizer is being rolled out to all regions during the week of April 14th, 2025
> and may not be available in a specific region until the roll out is complete.

For more information see [Optimizing data transfer costs with Egress Cost Optimizer](../../../collaboration/provider-listings-auto-fulfillment-eco.md).

---
title: Apr 15, 2026: Openflow Connector for HubSpot (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-15-openflow-hubspot-pupr.md
section: Release Notes
---

# Apr 15, 2026: Openflow Connector for HubSpot (*Preview*)

The Openflow Connector for HubSpot is now available in preview. The connector ingests HubSpot
CRM data into Snowflake using the HubSpot API. It performs an initial
full load followed by incremental updates that merge new and changed
records into the destination table using timestamps from previous runs.

For more information, see
[About Openflow Connector for HubSpot](../../../user-guide/data-integration/openflow/connectors/hubspot/about.md).

---
title: Apr 15, 2026: Snowflake documentation for AI agents and LLMs
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-15-agent-friendly-docs.md
section: Release Notes
---

# Apr 15, 2026: Snowflake documentation for AI agents and LLMs

Snowflake documentation is now easier for AI coding assistants, agents, and large language models
(LLMs) to discover and consume.

**Hierarchical llms.txt**

[docs.snowflake.com/llms.txt](https://docs.snowflake.com/llms.txt) now follows a hierarchical
structure. Instead of one large file containing every page, the root file links to section-level
indexes (for example, [SQL Commands](/sql-reference/sql/llms.txt)). Agents and tools can
fetch only the sections they need, reducing token usage and improving relevance.

**Markdown versions of every page**

Every documentation page is also available in Markdown by appending `.md` to the URL. For
example:

* [CREATE TABLE](/sql-reference/sql/create-table) has a corresponding Markdown page
  [create-table.md](/sql-reference/sql/create-table.md).
* [Virtual warehouses](/user-guide/warehouses-overview) has a corresponding Markdown page
  [warehouses-overview.md](/user-guide/warehouses-overview.md).

Markdown is smaller than the equivalent HTML and strips away navigation, scripts, and other
elements that aren’t useful for LLMs. Tools such as Cortex Code, Cursor, and Claude Code can use these
URLs directly as context.

---
title: Apr 16, 2025: Document AI multi-language support
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-16-document-ai.md
section: Release Notes
---

# Apr 16, 2025: Document AI multi-language support

The release of a new version of the foundational Arctic-TILT model in Document AI includes
improvements in the following areas:

* Multiple languages support: You can now upload documents and ask questions in Spanish, French, German, Portuguese, Italian, and Polish.
* Language-specific diacritics: The model can now read and extract language-specific diacritics, such as Ñ.
* Overall model quality.

These improvements are available to new Document AI model builds.

---
title: Apr 16, 2025: Snowflake ML Jobs (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-16-snowflake-ml-jobs.md
section: Release Notes
---

# Apr 16, 2025: Snowflake ML Jobs (*Preview*)

Snowflake announces the preview of Snowflake ML Jobs, a new capability that allows you to run machine learning (ML) workflows from your local environment.

Snowflake ML Jobs enable you to:

* Run ML workloads on Snowflake Compute Pools, leveraging GPU and high-memory CPU instances.
* Use your preferred development environment, such as VS Code or Jupyter notebooks, without requiring Snowflake worksheets or notebooks.
* Install and use custom Python packages within your runtime environment.
* Optimize data loading, training, and hyperparameter tuning with Snowflake’s distributed APIs.
* Integrate with orchestration tools, such as Apache Airflow.
* Monitor and manage jobs programmatically using Snowflake’s APIs.

Key benefits of Snowflake ML Jobs include:

* **Scalability**: Execute large-scale ML training on datasets requiring significant compute resources or GPU acceleration.
* **Flexibility**: Retain your existing development environment while leveraging Snowflake’s compute resources.
* **Efficiency**: Work directly with large Snowflake datasets to reduce data movement and avoid expensive data transfers.
* **Productionization**: Move ML code from development to production with minimal changes, enabling programmatic execution through pipelines.
* **Compatibility**: Lift and shift open-source ML workflows with minimal code modifications.

To get started with Snowflake ML Jobs, see [Snowflake ML Jobs](../../../developer-guide/snowflake-ml/ml-jobs/overview.md).

> **Important:**
>
> Snowflake ML Jobs are available in Snowpark ML Python package (`snowflake-ml-python`) version 1.8.2 and later.

---
title: Apr 16, 2026: Consumer-controlled maintenance policies: Provider support (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-16-native-apps-consumer-maintenance-policies-provider.md
section: Release Notes
---

# Apr 16, 2026: Consumer-controlled maintenance policies: Provider support (*Preview*)

Provider-side support for consumer-controlled maintenance policies is now in public preview for
Snowflake Native Apps.

Providers can now configure release directives to respect consumer maintenance policies by setting
the UPGRADE_IN_MAINTENANCE_WINDOW parameter. Providers can also align Snowpark Container Services compute pool node
maintenance with the consumer’s maintenance window by setting the AUTOMATIC_APPLICATION_MAINTENANCE
property on the application package.

For more information, see [Consumer-controlled maintenance policies: Provider guide](../../../developer-guide/native-apps/consumer-maintenance-policies-provider.md).

---
title: Apr 16, 2026: Primary key support in dynamic tables (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-16-dynamic-table-primary-keys.md
section: Release Notes
---

# Apr 16, 2026: Primary key support in dynamic tables (*General availability*)

Snowflake can now use primary keys in dynamic tables to track row-level changes and enable incremental refresh downstream of full-refresh
dynamic tables. This release includes the following capabilities:

* **Base table-defined primary keys**: When a base table has a primary key with the `RELY` property, Snowflake uses it for change tracking
  in downstream dynamic tables. This is especially useful when the base table is periodically rewritten through INSERT OVERWRITE, which
  normally prevents change tracking across table versions.
* **Query-derived primary keys**: Snowflake automatically derives primary keys from the query definition of a dynamic table. Queries with
  GROUP BY clauses or QUALIFY ROW_NUMBER() = 1 filters produce unique constraints that Snowflake relies on for change tracking.
* **Incremental refresh on full-refresh dynamic tables**: Dynamic tables in incremental refresh mode can now read from upstream dynamic
  tables that use full refresh mode, as long as the upstream table has a system-derived primary key. To use this capability, set
  `REFRESH_MODE = INCREMENTAL` explicitly on the downstream dynamic table.

To check whether a dynamic table has a derived primary key, run `SHOW UNIQUE KEYS IN <dt_name>`.

For more information, see [Understanding primary keys in dynamic tables](../../../user-guide/dynamic-tables-primary-keys.md). To try this feature with a
step-by-step example, see [Tutorial: Use primary keys to optimize dynamic table pipelines](../../../user-guide/tutorials/dynamic-table-primary-keys.md).

---
title: Apr 16, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-16-dcr.md
section: Release Notes
---

# Apr 16, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 14.4

The following updates are now available in Snowflake Data Clean Rooms:

* General performance improvements and bug fixes.
* Updates to private preview features.

---
title: Apr 17, 2025: Semantic views (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-17-semantic-views.md
section: Release Notes
---

# Apr 17, 2025: Semantic views (*Preview*)

With this preview release, you can store
[semantic models](../../../user-guide/views-semantic/sql.md) (for use by
[Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md)) as Snowflake objects in a database schema. These Snowflake
objects are *semantic views* and are schema-level objects that correspond to semantic models.

To create and manage semantic views, you can use SQL commands (such as CREATE SEMANTIC VIEW) and the Cortex Analyst Semantic View Generator, which is a wizard in Snowsight that guides you through the process of creating a semantic view.

For more information, see [Overview of semantic views](../../../user-guide/views-semantic/overview.md).

---
title: Apr 17, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-17-dcr.md
section: Release Notes
---

# Apr 17, 2025: Snowflake Data Clean Rooms updates

> **Note:**
>
> **Clean rooms UI users** must sign out and back in to the clean rooms UI for these updates to take effect.
>
> **Clean rooms API users** must run the following code for these updates to take effect:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL samooha_by_snowflake_local_db.library.apply_patch();
> ```
>
> **To enable auto-upgrades for API users,** run the following code:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL samooha_by_snowflake_local_db.library.enable_local_db_auto_upgrades();
> ```

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms:

**Segmentation and attribute columns are now optional in the Audience Overlap template:** Segmentation and attribute columns in the
Audience Overlap template are no longer a required feature. This means that a clean room creator (provider) can use the Audience Overlap
Analysis template for a simple overlap analysis if segmentation and activation are not needed.

---
title: Apr 17, 2026: Performance Explorer tabs, filter presets, CSV export, and side-panel search
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-17-performance-explorer-usability.md
section: Release Notes
---

# Apr 17, 2026: Performance Explorer tabs, filter presets, CSV export, and side-panel search

This release updates the Performance Explorer experience in Snowsight:

* A tabbed layout with Queries, Warehouses, and Tables pages so you can easily navigate
  between metric areas.
* Filter presets to save your current filter settings (time period, warehouse, database, and role filters)
  and set a default preset for future visits.
* Export the data in the detail tables in the side panels as a CSV file.
* Search the side panel detail tables for keywords of interest (for example, a user name or a substring in
  query text).

For more information, see [Analyzing query workloads with Performance Explorer](../../../user-guide/performance-explorer.md).

---
title: Apr 18, 2025: Support for st.query_params (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-18-sis.md
section: Release Notes
---

# Apr 18, 2025: Support for `st.query_params` (General availability)

[st.query_params](https://docs.streamlit.io/develop/api-reference/caching-and-state/st.query_params)
is now generally available in Streamlit in Snowflake, with some considerations. For more information, see [Query parameters](../../../developer-guide/streamlit/limitations.md).

---
title: Apr 2, 2026: Copy tags when running a CREATE OR REPLACE TABLE command (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-02-create-table-copy-tags-ga.md
section: Release Notes
---

# Apr 2, 2026: Copy tags when running a CREATE OR REPLACE TABLE command (*General availability*)

With this release, the COPY TAGS parameter for the CREATE OR REPLACE TABLE command is generally available.

* When you use CREATE OR REPLACE TABLE … COPY TAGS without LIKE, CLONE, or a WITH TAG clause, tags from the replaced table
  and its columns are retained on the new table.
* When you use COPY TAGS with CREATE OR REPLACE TABLE … LIKE, CREATE OR REPLACE TABLE … CLONE, or together with
  a WITH TAG clause, Snowflake combines tags from the applicable sources. If both sources set the same tag, the value
  from the replaced table takes precedence.

For more information, see the COPY TAGS parameter and usage notes in [CREATE TABLE](../../../sql-reference/sql/create-table.md).

---
title: Apr 2, 2026: Performance Explorer granular access aligned with your privileges
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-02-performance-explorer-granular-access.md
section: Release Notes
---

# Apr 2, 2026: Performance Explorer granular access aligned with your privileges

Performance Explorer now applies granular access control that aligns visibility with your privileges on
warehouses, databases, and [Snowflake database roles](../../../sql-reference/snowflake-db-roles.md) in the
shared `SNOWFLAKE` database. Snowflake grants the `SNOWFLAKE.PERFORMANCE_EXPLORER_PUBLIC_USER`
application role to the `PUBLIC` role so that more users can open Performance Explorer; charts and tables
show account activity that your roles are allowed to see, and some sections require elevated privileges
(such as `GOVERNANCE_VIEWER` for table-level metrics).

Users who have full account visibility today keep it if **any** role granted to them is
[ACCOUNTADMIN](../../../user-guide/security-access-control-overview.md), has `IMPORTED PRIVILEGES` on the
`SNOWFLAKE` database, or has the `SNOWFLAKE.PERFORMANCE_EXPLORER_USER` application role.

Privilege changes can take a few hours to appear in Performance Explorer.

For more information, see [Analyzing query workloads with Performance Explorer](../../../user-guide/performance-explorer.md) and
[Required privileges for Performance Explorer](../../../user-guide/performance-explorer.md).

---
title: Apr 2, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-02-dcr.md
section: Release Notes
---

# Apr 2, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 14.0

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Case-Insensitive Identifiers:** Data offering IDs, Template IDs, and Collaborator Aliases are now case-insensitive. You no longer need to match the exact casing used during registration when referencing a data offering or template by ID. Original casing is preserved in all displays.
* **Cross-Registry Resource Discovery:** The [VIEW_REGISTERED_DATA_OFFERINGS](/user-guide/cleanrooms/v2/v2-api-reference), [VIEW_REGISTERED_TEMPLATES](/user-guide/cleanrooms/v2/v2-api-reference), and [VIEW_REGISTERED_CODE_SPECS](/user-guide/cleanrooms/v2/custom-functions) procedures now include a `REGISTRY` field in their responses and returns resources across all registries by default.
* Updates to private preview features.

---
title: Apr 22, 2025: Trust Center email notifications (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-22-trust-center-email-notifications.md
section: Release Notes
---

# Apr 22, 2025: Trust Center email notifications (*Preview*)

With this preview release, you can configure the Trust Center to send email notifications when it finds violations.
You can specify that the Trust Center sends notifications for all of the enabled scanners in a scanner package or
for individual scanners. You can also specify the severity of the violations for which email notifications are sent.

For more information, see [Sending email notifications about Trust Center findings](../../../user-guide/trust-center/notifications-trust-center.md).

---
title: Apr 24, 2025: Container Runtime for ML on multi-node clusters (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-24-container-runtime-multi-node.md
section: Release Notes
---

# Apr 24, 2025: Container Runtime for ML on multi-node clusters (*Preview*)

Snowflake announces the preview of Container Runtime for ML on multi-node clusters, a new capability that allows you to scale your ML workloads across multiple compute nodes in Snowflake Notebooks.

Container Runtime for ML on multi-node clusters enables you to:

* **Scale ML workloads**: Dynamically adjust the number of nodes in your compute pool to match the resource needs of your ML tasks.
* **Run distributed training**: Train ML models on larger datasets using distributed frameworks like PyTorch, LightGBM, and XGBoost.
* **Manage cluster resources**: Easily scale up for resource-intensive tasks and scale down when fewer resources are needed.
* **Control scaling operations**: Configure asynchronous scaling, timeout thresholds, and minimum node requirements to match your workflow needs.

Key benefits of Container Runtime for ML on multi-node clusters include:

* **Improved performance**: Process larger datasets and accelerate training of complex models through parallelization.
* **Resource efficiency**: Scale resources up or down based on workload requirements without provisioning new compute pools.
* **Flexibility**: Support for synchronous or asynchronous scaling operations to match your development workflow.
* **Simplicity**: Straightforward APIs for scaling clusters and monitoring active nodes with minimal configuration.

To get started with Container Runtime for ML on multi-node clusters, see [Container Runtime on multi-node clusters](../../../developer-guide/snowflake-ml/container-runtime-multi-node.md).

---
title: Apr 24, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-24-dcr.md
section: Release Notes
---

# Apr 24, 2025: Snowflake Data Clean Rooms updates

> **Note:**
>
> **Clean rooms UI users** must sign out and back in to the clean rooms UI for these updates to take effect.
>
> **Clean rooms API users** must run the following SWL commands for these updates to take effect:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.apply_patch();
> ```
>
> **To enable auto-upgrades for API users,** run the following SQL commands:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
> ```

With this release, we are pleased to announce the availability of the following new features and enhancements to Snowflake
Data Clean Rooms:

**Consumer template requests for cross-cloud auto-fulfillment:** Collaborators in different regions can now use
[consumer-defined templates](../../../user-guide/cleanrooms/demo-flows/custom-templates.md). Previously, consumer-defined templates were
supported only for consumers in the same cloud region as the provider.

**Provider-run warehouse selection using the clean rooms API:** Providers can now specify the warehouse size used in a
[provider-run analysis](../../../user-guide/cleanrooms/demo-flows/provider-run-analysis.md) using the clean rooms API. Until now, clean rooms used automatic scaling logic based on dataset sizes to determine the warehouse
size.

---
title: Apr 28, 2025: Boost Cortex Search results based on metadata signals (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-28-boost-decay.md
section: Release Notes
---

# Apr 28, 2025: Boost Cortex Search results based on metadata signals (*General availability*)

With this release, Cortex Search can now boost results based on metadata fields, such as the number of likes or comments
on a document or its recency based on a timestamp. When making a query, you can specify the metadata fields you want to
boost on and the weight you want to assign to each field. Recency signals decay over time.

For more information, see [Numeric boosts and time decays](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md).

---
title: Apr 28, 2025: Disable reranker in Cortex Search queries (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-28-reranking.md
section: Release Notes
---

# Apr 28, 2025: Disable reranker in Cortex Search queries (*General availability*)

With this release, you can now disable reranking in any Cortex Search query. The Cortex Search reranker aims to elevate
results with higher relevance to the query. However, the reranking step can noticeably increase query latency. Disabling
reranking can improve search performance without penalty if you’ve found that reranking does not improve search quality for
your use case.

For more information, see [Reranking](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md).

---
title: Apr 28, 2025: Role-Based Access Control for Cortex LLM Models
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-28-cortex-llm-model-rbac.md
section: Release Notes
---

# Apr 28, 2025: Role-Based Access Control for Cortex LLM Models

Snowflake announces the general availability of Role-Based Access Control (RBAC) for Cortex LLM Models, a new feature that provides fine-grained access control for managing model access in Snowflake Cortex.

RBAC for Cortex LLM Models enables you to:

* **Control model access**: Use application roles to grant or revoke access to specific models.
* **Combine access methods**: Leverage both model allowlists and RBAC for a mix of broad and fine-grained access control.
* **Enhance security**: Ensure that only authorized users can access sensitive models.

Key benefits of RBAC for Cortex LLM Models include:

* **Granular access control**: Manage access to individual models using application roles.
* **Flexibility**: Combine allowlists and RBAC to meet your organization’s access control requirements.
* **Ease of use**: Straightforward commands to configure and manage model access.

To get started with RBAC for Cortex LLM Models, see [Role-based access control (RBAC)](../../../user-guide/snowflake-cortex/aisql.md).

---
title: Apr 3, 2026: Medical and health data classifiers in sensitive data classification (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-03-sensitive-data-classification-medical-health-ga.md
section: Release Notes
---

# Apr 3, 2026: Medical and health data classifiers in sensitive data classification (*General availability*)

With this release, native semantic classifiers for medical and health-related data in [sensitive data classification](../../../user-guide/classify-intro.md) are generally available.
Snowflake adds the `MEDICAL_DATA` and `MEDICAL_SPECIALTY` semantic categories and related subcategories so that personal health information aligned with the Health Insurance Portability and Accountability Act (HIPAA) can be detected and tagged consistently.

The new classifiers cover, for example:

* ICD (International Classification of Diseases) codes
* Laboratory and blood-test terminology
* Medical conditions and impairments
* Medical procedures
* Medication names and types (including brand and generic)

For a full list of native semantic categories and subcategories, see [Native semantic categories of sensitive data classification](../../../user-guide/classify-native.md).

---
title: Apr 30, 2025: Programmatic access tokens
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-30-programmatic-access-tokens.md
section: Release Notes
---

# Apr 30, 2025: Programmatic access tokens

You can now generate and use programmatic access tokens to authenticate to the following Snowflake endpoints:

* [Snowflake REST APIs](../../../developer-guide/snowflake-rest-api/snowflake-rest-api.md).
* [The Snowflake SQL API](../../../developer-guide/sql-api/index.md).
* [The Snowflake Catalog SDK](../../../user-guide/tables-iceberg-catalog.md).

> **Note:**
>
> Using programmatic access tokens to authenticate to
> [Snowpark Container Services](../../../developer-guide/snowpark-container-services/working-with-services.md) endpoints is not yet
> supported.

You can also use a programmatic access token as a replacement for a password in:

* [Snowflake drivers](../../../developer-guide/drivers.md).
* [Third-party applications that connect to Snowflake](../../../user-guide/ecosystem.md) (such as Tableau and PowerBI).
* Snowflake APIs and libraries (such as the [Snowpark API](../../../developer-guide/snowpark/index.md) and the
  [Snowflake Python API](../../../developer-guide/snowflake-python-api/snowflake-python-overview.md).
* Snowflake command-line clients (such as the [Snowflake CLI](../../../developer-guide/snowflake-cli/index.md) and
  [SnowSQL](../../../user-guide/snowsql.md).

You can generate programmatic access tokens for human users (users with TYPE=PERSON) as well as for service users (users with
TYPE=SERVICE).

For more information, see [Using programmatic access tokens for authentication](../../../user-guide/programmatic-access-tokens.md).

---
title: Apr 6, 2026: AI_SERVICES billing breakout for implemented AI Credits services
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-06-ai-services-billing-breakout.md
section: Release Notes
---

# Apr 6, 2026: AI_SERVICES billing breakout for implemented AI Credits services

Snowflake introduces **more granular billing service types** for a subset of services currently
billed under AI_SERVICES as part of the transition to **AI Credits**. These changes will impact
both the [METERING_HISTORY](../../../sql-reference/account-usage/metering_history.md) and
[METERING_DAILY_HISTORY](../../../sql-reference/account-usage/metering_daily_history.md) views.
This change **improves customer clarity** and supports more flexible pricing and packaging over time.

The following services are now being broken out of AI_SERVICES as separate service types:

| Feature | Previous SERVICE_TYPE | Future SERVICE_TYPE |
| --- | --- | --- |
| Cortex Agents | AI_SERVICES | CORTEX_AGENTS |
| Cortex Code CLI | AI_SERVICES | CORTEX_CODE_CLI |
| Cortex Code UI | AI_SERVICES | CORTEX_CODE_SNOWSIGHT |
| Snowflake Intelligence | AI_SERVICES | SNOWFLAKE_INTELLIGENCE |

**AI Functions**, **Search Serving**, **Batch Search Serving**, **Cortex Analyst**, **Cortex Fine Tuning**, and
**Provisioned Throughput** remain in AI_SERVICES.

---
title: Apr 6, 2026: Apache Iceberg™ tables: Write support for Databricks Unity Catalog on Azure (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-06-iceberg-write-support-azure-unity-catalog.md
section: Release Notes
---

# Apr 6, 2026: Apache Iceberg™ tables: Write support for Databricks Unity Catalog on Azure (*General availability*)

Previously, write support for externally managed Apache Iceberg™ tables managed by Databricks Unity Catalog was limited to
workspaces where the underlying storage was on AWS. With this release, you can also write to externally managed
Iceberg tables managed by Unity Catalog when the underlying storage is on Azure.

This is made possible by the support for Azure Data Lake Storage Gen2 with external volumes. To write to Unity Catalog
tables on Azure, configure an external volume that connects to Data Lake Storage Gen2, then configure a catalog integration
for Unity Catalog.

For more information, see the following topics:

* [Configure an external volume for Azure](../../../user-guide/tables-iceberg-configure-external-volume-azure.md)
* [Configure a catalog integration for Unity Catalog](../../../user-guide/tables-iceberg-configure-catalog-integration-rest-unity.md)
* [Write support for externally managed Apache Iceberg™ tables](../../../user-guide/tables-iceberg-externally-managed-writes.md)

---
title: Apr 7, 2025: Google Cloud Private Service Connect in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-04-07-sis.md
section: Release Notes
---

# Apr 7, 2025: Google Cloud Private Service Connect in Streamlit in Snowflake (Preview)

Google Cloud Private Service Connect is now supported in Streamlit in Snowflake.

For more information, see [Private connectivity for Streamlit in Snowflake](../../../developer-guide/streamlit/object-management/privatelink.md).

---
title: Apr 7, 2026: Workspaces replication (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-07-workspace-replication-ga.md
section: Release Notes
---

# Apr 7, 2026: Workspaces replication (*General availability*)

Workspaces replication, which allows user workspaces to be included in database replication and failover operations, is now generally available.
When a workspace or its owning user is part of a replication or failover group, the workspace is copied to secondary accounts to support business
continuity and disaster recovery.

Replicated workspaces in secondary accounts are read-only. Files can be executed but not modified. When a secondary failover group is promoted
to primary, all contained workspaces become writable.

> **Note:**
>
> Workspaces replication and failover require Business Critical Edition or higher.

For more information, see [Workspaces replication](../../../user-guide/ui-snowsight/workspaces-replication.md).

---
title: Apr 8, 2026: Error logging for Snowpipe Streaming (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-08-snowpipe-streaming-error-tables.md
section: Release Notes
---

# Apr 8, 2026: Error logging for Snowpipe Streaming (*General availability*)

With this release, error logging for Snowpipe Streaming with high-performance architecture is now generally
available. When error logging is turned on for a target table, rows that fail server-side processing are
automatically captured in a dedicated error table instead of being silently dropped. This feature includes
the following capabilities:

* Row-level error capture with full original payloads and detailed error metadata.
* Filtering by Snowpipe Streaming errors using the `error_metadata:service` field.
* Querying, analyzing, and reprocessing failed rows using standard SQL.

Turning on error logging doesn’t change your Snowpipe Streaming ingestion costs. Snowflake charges for
data stored in the error table at the standard storage rate.

For more information, see [Error logging in Snowpipe Streaming with high-performance architecture](../../../user-guide/snowpipe-streaming/snowpipe-streaming-error-tables.md).

---
title: Apr 9, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-09-dcr.md
section: Release Notes
---

# Apr 9, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 14.3

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Collaboration API - Generally Available:** The new [Collaboration API](../../../user-guide/cleanrooms/overview.md) is now generally available. The new architecture supports fully symmetric, multi-party collaboration with flexible roles and fine-grained data access controls for any number of participants. Read the [overview](../../../user-guide/cleanrooms/overview.md) and [try out building a new collaboration yourself](../../../user-guide/cleanrooms/tutorials/collaboration-basic-api-tutorial.md).
* **Collaboration Configuration APIs:** Two new stored procedures are now available for managing collaboration configuration flags:

  + [GET_CONFIGURATION](../../../user-guide/cleanrooms/collaboration-api-reference.md) returns a table of configuration key-value pairs for the collaboration. Currently exposes the TEMPLATE_AUTO_APPROVAL setting per collaborator.
  + [SET_CONFIGURATION](../../../user-guide/cleanrooms/collaboration-api-reference.md) sets a configuration value for the collaboration.
* **Case-Insensitive Code Spec IDs:** Code spec IDs are now case-insensitive, extending the case-insensitive identifier support [introduced in the previous release](/release-notes/2026/other/2026-04-02-dcr). You no longer need to match the exact casing used during registration when referencing a code spec by ID.
* **Improved join and review reliability:** Join and review operations now automatically retry when a collaboration listing isn’t immediately available, reducing transient failures.
* Updates to private preview features.

---
title: April 01-03, 2024 — 8.13 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_13.md
section: Release Notes
---

# April 01-03, 2024 — 8.13 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Snowflake Cortex updates

### Evaluation Metrics for Forecasting and Anomaly Detection

With this release, we are pleased to introduce training-time evaluation metrics for Anomaly Detection, a member of the
[Snowflake Cortex ML Functions](../../guides-overview-ml-functions.md) suite, similar to those supported for the
Forecasting and Classification ML Functions. The Forecasting and Anomaly Detection functions can now also calculate
evaluation metrics on new data, so that you can easily compare forecasts to actual observations. These tools allow you
to easily determine how well the model predicts real data. For more information, see:

* Anomaly Detection: [Show Evaluation Metrics](../../sql-reference/classes/anomaly-detection/methods/show_evaluation_metrics.md)
* Forecasting: [Show Evaluation Metrics](../../sql-reference/classes/forecast/methods/show_evaluation_metrics.md)

## SQL updates

### Fixed an issue with the PARSE_IP function

Previously, the PARSE_IP function would parse the following types of invalid IP addresses:

* IPv4 addresses with less than 4 parts
* IPv6 addresses with more than 4 hex digits in a single part

For example, the PARSE_IP function returned results for the following queries:

> ```sqlexample
> SELECT PARSE_IP('1.1.1', 'inet');
> SELECT PARSE_IP('1::abcde', 'inet');
> ```

This issue has been fixed, and the PARSE_IP function now returns an error for these types of invalid IP addresses.

### Fixed an issue with the SPLIT_PART function

Previously, the SPLIT_PART function ignored trailing spaces when all of the following conditions were met:

* All inputs were constants.
* The string to split ended with the delimiter.
* The `partNumber` value was negative.

For example, the SPLIT_PART function returned the following results:

> > ```sqlexample
> > SELECT SPLIT_PART('/a/b/c/', '/', -1);
> >
> > +--------------------------------+
> > | SPLIT_PART('/A/B/C/', '/', -1) |
> > |--------------------------------|
> > | c                              |
> > +--------------------------------+
> >
> > SELECT SPLIT_PART('/a/b/c/', '/', -2);
> >
> > +--------------------------------+
> > | SPLIT_PART('/A/B/C/', '/', -2) |
> > |--------------------------------|
> > | b                              |
> > +--------------------------------+
> > ```
>
> This issue has been fixed, and the SPLIT_PART function now returns the correct results under these conditions. For example, the SPLIT_PART function now returns the following results:
>
> > ```sqlexample
> > SELECT SPLIT_PART('/a/b/c/', '/', -1);
> >
> > +--------------------------------+
> > | SPLIT_PART('/A/B/C/', '/', -1) |
> > |--------------------------------|
> > |                               |
> > +--------------------------------+
> >
> > SELECT SPLIT_PART('/a/b/c/', '/', -2);
> >
> > +--------------------------------+
> > | SPLIT_PART('/A/B/C/', '/', -2) |
> > |--------------------------------|
> > | c                              |
> > +--------------------------------+
> > ```

## Extensibility updates

### Access to Git repositories from Snowflake — *Preview*

With this release, we are pleased to announce the public preview of access to Git repositories from within Snowflake. After you configure Snowflake to act as a client of your Git repository, you can fetch a full clone of your remote repository to a Snowflake Git repository clone, which represents a local repository. You can reference these fetched files in procedure and function handler code, execute SQL and Python code in Snowflake, copy file contents into Snowflake worksheets, and more.

For more information, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 01-Apr-24 |

---
title: April 01-03, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-04-01.md
section: Release Notes
---

# April 01-03, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced
in this update to Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## No limit on rows returned in worksheet results –— *Preview*

With this release, we are pleased to announce the preview of no limit on rows returned in worksheet results in Snowsight.

Before this preview, if you ran a query in a worksheet in Snowsight, the results were limited to 10,000 rows of results or 16MB.
With this release, there is no longer a limit on the rows displayed in worksheet results.

For more details, see [Exploring the worksheet results](../../../user-guide/ui-snowsight-query.md).

---
title: April 08-15, 2024 — 8.14 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_14.md
section: Release Notes
---

# April 08-15, 2024 — 8.14 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New regions

Effective immediately, Snowflake accounts can be provisioned in the following new region(s):

| Cloud Platform | Region |
| --- | --- |
| Amazon Web Services (AWS) | EU (Zurich) |

The new region(s) support all [Snowflake editions](../../user-guide/intro-editions.md). You can provision initial accounts in the region through
[self-service](https://signup.snowflake.com/) or a Snowflake representative.

## Extensibility updates

### Python UDTFs with vectorized process methods — *General Availability*

With this release, we are pleased to announce the general availability of Python UDTFs (user-defined table functions) with vectorized
process methods. This new feature provides a convenient way to operate over rows in batches when the method returns one output row for
each input row.

For more information, see [Vectorized Python UDTFs](../../developer-guide/udf/python/udf-python-tabular-vectorized.md).

## Snowflake Cortex updates

### Forecasting improvements in Snowflake Cortex ML Functions

With this release, we are pleased to announce that we have improved the algorithm behind
[Time-Series Forecasting](../../user-guide/ml-functions/forecasting.md), part of the
Snowflake Cortex ML Functions suite of analysis tools. The underlying approach remains the same; rather, we have changed
some parameters that, for most users, should:

* Reduce the incidence of trend extrapolation errors (incorrect trends)
* Improve accuracy on “noisy” time series

These improvements apply only to forecasting models trained with Snowflake 8.14 or later. Forecasting models trained
with previous versions continue to use the earlier model parameters.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 08-Apr-24 |
| *Forecasting improvements in Snowflake Cortex ML Functions* | **Added** to *Snowflake Cortex updates* | 24-Apr-24 |

---
title: April 09, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-09-dcr.md
section: Release Notes
---

# April 09, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, Snowflake is pleased to announce the following improvements to Snowflake Data Clean Rooms:

* **Audience Overlap & Segmentation template**: The Audience Overlap & Segmentation template allows analysts to create segmentation groups
  for audience overlap results. Note that this template replaces the Audience Overlap template; clean rooms that used the Audience Overlap
  template need to be re-created with the new template.

  For a tutorial that uses the new template, see [Get started with the web app of a Snowflake Data Clean Room](https://other-docs.snowflake.com/en/cleanrooms/tutorials/cleanroom-web-app-tutorial).
* **Provider Activation with Enrichment**: When consumers activate matched IDs back to the provider, they can now enrich the results with
  data from additional columns of their table or the provider’s table. The provider controls which of their columns can be used when
  configuring the analyses template for the clean room, and the consumer controls which of their columns can be used when installing the
  clean room.

---
title: April 11, 2024 — Budgets Release Notes — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-11-budgets.md
section: Release Notes
---

# April 11, 2024 — Budgets Release Notes — *General Availability*

With this release, we are pleased to announce the general availability of Budgets which enables account-level monitoring and
notification of Snowflake credit usage for a group of specific Snowflake objects. You can define a monthly spending limit on the
compute costs for [supported objects](../../../user-guide/budgets/custom-budget.md) in your account. In addition to your account budget,
you can create custom budgets to monitor credit usage for a specified group of objects. Budgets sends you a notification when
your credit usage is on track to exceed your monthly limit.

For more information, see [Monitor credit usage with budgets](../../../user-guide/budgets.md).

---
title: April 11-25, 2024 — Snowflake Copilot — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-11-snowflake-copilot-in-snowsight.md
section: Release Notes
---

# April 11-25, 2024 — Snowflake Copilot — *Preview*

We are pleased to announce the preview of Snowflake Copilot.

Snowflake Copilot is an LLM-powered assistant that simplifies data analysis while maintaining robust data governance and seamlessly
integrates into your existing Snowflake workflow. You can ask open-ended questions about your data structure, send follow-up
inquiries, or even use it to refine and improve your own SQL queries.

> **Note:**
>
> With this preview release, Snowflake Copilot will be made available to accounts in the following regions:
>
> * AWS us-east-1
> * AWS us-west-2

For more details, see [Using Snowflake Copilot](../../../user-guide/snowflake-copilot.md).

---
title: April 12, 2024 — Cost Management Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-12-cost.md
section: Release Notes
---

# April 12, 2024 — Cost Management Release Notes

## Account Overview Page — *General Availability*

With this release, we are pleased to announce the general availability of the Account Overview page in Snowsight that
allows you to gain high-level insights into the cost of using Snowflake. It improves visibility into incurred costs and provides information
that can be a starting off point for reporting and optimizing your spend. For example, you can view your total spend for a time period in
dollars and credits, and discover what is contributing to your costs, such as top warehouses by spend and most expensive queries.

For more details about using the Account Overview page, see [Overview of account-level costs](../../../user-guide/cost-exploring-overall.md).

---
title: April 12, 2024 — Snowflake Cortex LLM Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-12-snowflake-cortex-llm-update.md
section: Release Notes
---

# April 12, 2024 — Snowflake Cortex LLM Release Notes

We’re pleased to announce three changes that will make it easier for you to build generative AI workflows using
[Cortex LLM Functions](../../../user-guide/snowflake-cortex/aisql.md).

* The `reka-flash` model is now available for text completion tasks. `reka-flash` is a high-quality model
  optimized for fast processing. The model is now available is AWS US East (N. Virginia) and AWS US West (Oregon).
  For more information about this model, see [reka.ai](https://www.reka.ai/).
* The `mixtral-8x7b` model is now 56% more cost-efficient, with the credits per million tokens cut to 0.22 from 0.50.
* The `mixtral-8x7b` model is now available for use in AWS Europe (Frankfurt) and Azure West Europe (Netherlands).

---
title: April 13, 2026: Dynamic Apache Iceberg™ tables now support PARTITION BY, TARGET_FILE_SIZE, and PATH_LAYOUT (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-04-13-dynamic-iceberg-tables-partition-file-path.md
section: Release Notes
---

# April 13, 2026: Dynamic Apache Iceberg™ tables now support PARTITION BY, TARGET_FILE_SIZE, and PATH_LAYOUT (*General availability*)

Dynamic Apache Iceberg™ tables now support the following table properties:

* `PARTITION BY`: Partition the table using Iceberg partition expressions such as identity, bucket, truncate, year, month, day, and hour transforms.
* `TARGET_FILE_SIZE`: Control the target Parquet file size for table writes. Defaults to `AUTO`, which lets Snowflake choose the optimal file size.
* `PATH_LAYOUT`: Choose between a flat or hierarchical (Hive-style) directory layout for data files. Use `HIERARCHICAL` together with `PARTITION BY` to write data to partition-aware paths.

For more information, see [Configure partitioning, file size, and path layout](../../../user-guide/dynamic-tables-create-iceberg.md) and [CREATE DYNAMIC ICEBERG TABLE](../../../sql-reference/sql/create-dynamic-table.md).

---
title: April 17, 2024 — Snowpark Container Services Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-17.md
section: Release Notes
---

# April 17, 2024 — Snowpark Container Services Release Notes

[Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) now provides metrics about nodes in the compute pool on which Snowflake runs services, as well as metrics about the services themselves running on the compute pool. For more information, see
[Snowpark Container Services: Monitoring Services](../../../developer-guide/snowpark-container-services/monitoring-services.md).

---
title: April 17-19, 2024 — 8.15 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_15.md
section: Release Notes
---

# April 17-19, 2024 — 8.15 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Geospatial | [H3_COMPACT_CELLS](../../sql-reference/functions/h3_compact_cells.md) (Preview) | Returns an array of VARIANT values that contain the INTEGER IDs of fewer, larger H3 cells that cover the same area as the H3 cells in the input. |
| Geospatial | [H3_COMPACT_CELLS_STRINGS](../../sql-reference/functions/h3_compact_cells_strings.md) (Preview) | Returns an array of VARIANT values that contain the VARCHAR hexadecimal IDs of fewer, larger H3 cells that cover the same area as the H3 cells in the input. |
| Geospatial | [H3_IS_PENTAGON](../../sql-reference/functions/h3_is_pentagon.md) (Preview) | Returns TRUE if the boundary of an H3 cell represents a pentagon. |
| Geospatial | [H3_IS_VALID_CELL](../../sql-reference/functions/h3_is_valid_cell.md) (Preview) | Returns TRUE if the input represents a valid H3 cell. |
| Geospatial | [H3_TRY_GRID_DISTANCE](../../sql-reference/functions/h3_try_grid_distance.md) (Preview) | A special version of H3_GRID_DISTANCE that returns NULL if an error occurs when it attempts to return the distance between two H3 cells. |
| Geospatial | [H3_TRY_GRID_PATH](../../sql-reference/functions/h3_try_grid_path.md) (Preview) | A special version of H3_GRID_PATH that returns NULL if an error occurs when it attempts to return an array of VARIANT values that contain the IDs of the H3 cells that represent the line between two cells. |
| Geospatial | [H3_UNCOMPACT_CELLS](../../sql-reference/functions/h3_uncompact_cells.md) (Preview) | Returns an array of VARIANT values that contain the INTEGER IDs of H3 cells at the specified resolution that cover the same area as the H3 cells in the input. |
| Geospatial | [H3_UNCOMPACT_CELLS_STRINGS](../../sql-reference/functions/h3_uncompact_cells_strings.md) (Preview) | Returns an array of VARIANT values that contain the VARCHAR hexadecimal IDs of H3 cells at the specified resolution that cover the same area as the H3 cells in the input. |

## Data loading / unloading updates

### Support for granting the READ and WRITE privileges on external stages

With this release, Snowflake is pleased to announce support for granting the READ and WRITE privileges on an external stage.
Previously, you could grant only the USAGE privilege on an external stage.

For more information, see [Stage privileges](../../user-guide/security-access-control-privileges.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 16-Apr-24 |
| *New FAILOVER privilege for client redirect* | **Removed** from *Replication updates* section (and section removed) | 18-Apr-24 |
| *Support for granting the WRITE privilege on external stages* | **Changed** to include the READ privilege as reflected in the new title, *Support for granting the READ and WRITE privileges on external stages* | 22-Apr-24 |

---
title: April 2023
source: https://docs.snowflake.com/en/release-notes/2023-04.md
section: Release Notes
---

# April 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Account Replication — *General Availability*

With this release, we are pleased to announce the general availability of
[Account Replication](../user-guide/account-replication-intro.md). This feature uses
[replication groups and failover groups](../user-guide/account-replication-intro.md) to replicate objects with point-in-time consistency
from a source account to one or more target accounts. A replication group allows customers to specify which account objects to replicate, to
which regions or cloud platforms, at customizable scheduled intervals. A failover group provides the same functionality as a replication
group and can additionally failover the objects in a group.

Account objects can include warehouses, users, and roles, along with databases and shares (refer to
[Replicated Objects](../user-guide/account-replication-intro.md) for the full list of objects that can be included in a replication or failover group).
Account objects can be grouped in one or multiple groups.

In the case of failover, account replication enables the failover of your entire account to a different region or cloud platform. Each
replication and failover group has its own replication schedule, allowing you to set the frequency for replication at different intervals
for different groups of objects. In the case of failover groups, it also enables failover of groups individually. You can choose to failover
all failover groups, or only select failover groups.

For more information, refer to [Introduction to replication and failover across multiple accounts](../user-guide/account-replication-intro.md).

### Support for Scala User-Defined Function Handlers — *Preview*

With this release, Snowflake is pleased to announce a preview of user-defined functions (UDFs) with a handler written in Scala.

For more information, refer to [Introduction to Scala UDFs](../developer-guide/udf/scala/udf-scala-introduction.md).

### Tabular Return Values from Python Stored Procedures — *Preview*

With this release, we are pleased to announce a preview of tabular stored procedures with a handler written in Python. You can write a
procedure that returns data in tabular form. To do this, you specify the procedure’s return type as TABLE (specifying columns for the return
value), then have your handler code return the tabular value in a Snowpark dataframe.

For more information, refer to [Python](../developer-guide/stored-procedure/python/procedure-python-tabular-data.md).

## SQL Updates

### Encryption Enhancements

With this release, we are pleased to announce upgraded encryption functions, ENCRYPT() and ENCRYPT_RAW(), which can be used by customers
to provide an additional layer of protection to user-provided values. The upgrades to both function were implemented as part of our
commitment to continuous improvement for our customers. Customers will get the benefit of these enhancements in all subsequent invocations
of the encryption functions.

For more information, refer to [Encryption functions](../sql-reference/functions-encryption.md).

### ALTER *<policy_kind>* POLICY Command: Support for Setting and Unsetting Tags

With this release, Snowflake adds support to set or unset a tag on a masking, password, row access, and session policy using an ALTER
statement.

Set a tag using the ALTER statement:

```sqlsyntax
ALTER <policy_kind> POLICY <name> SET TAG <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' ... ]
```

The ALTER … SET statement specifies the tag name (i.e. the key) and the tag value.

The tag value is always a string, and the maximum number of characters for the tag value is 256.

For more information, refer to [Tag quotas](../user-guide/object-tagging/introduction.md).

Unset a tag using the ALTER statement:

```sqlsyntax
ALTER <policy_kind> POLICY <name> UNSET TAG <tag_name> [ , <tag_name> ... ]
```

The ALTER … UNSET statement specifies the tag name only.

Set `policy_kind` to one of the following policies:

* MASKING
* PASSWORD
* ROW ACCESS
* SESSION

### SRID Argument Now Supported in GEOMETRY Constructor Functions

In the following GEOMETRY constructor functions, you can now specify the SRID as an argument:

* [TO_GEOMETRY](../sql-reference/functions/to_geometry.md)
* [TRY_TO_GEOMETRY](../sql-reference/functions/try_to_geometry.md)
* [ST_GEOMETRYFROMWKB](../sql-reference/functions/st_geometryfromwkb.md)
* [ST_GEOMETRYFROMWKT](../sql-reference/functions/st_geometryfromwkt.md)

The following example passes the SRID 4326 to the TO_GEOMETRY function:

```sqlexample
SELECT TO_GEOMETRY('POINT(1820.12 890.56)', 4326);
```

### Search Optimization and Query Acceleration Compatibility — *General Availability*

With this release, we are pleased to announce that search optimization and query acceleration can work together to optimize query
performance.

Search optimization can prune the micro-partitions not needed for a query. For eligible queries, query acceleration can offload parts of the
remaining work to shared compute resources provided by the service.

The performance of queries accelerated by both services varies depending on workload and available resources.

For more information, refer to:

* [Search optimization service](../user-guide/search-optimization-service.md)
* [Using the Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md)

### Search Optimization Service: Column-Specific Enablement and Substring, Geospatial, and Variant Support - *General Availability*

With this release, we are pleased to announce the general availability of enabling the Search Optimization Service for a specific column of
a table. Two related features that estimate the cost of enabling search optimization on a column and that display the search optimization
configuration for a specified table and its columns are also now generally available.

In addition, the Search Optimization Service now supports the following types of predicates:

* Predicates that use string patterns (e.g. LIKE, ILIKE, etc.) and POSIX regular expressions (e.g. RLIKE, REGEXP).
* Predicates that use fields in VARIANT, ARRAY, and OBJECT columns.
* Predicates that use geospatial functions such as ST_INTERSECTS, ST_CONTAINS, ST_WITHIN, ST_DWITHIN, ST_COVERS, and ST_COVEREDBY against
  GEOGRAPHY columns.

For more information, refer to:

* [Enabling search optimization for specific columns](../user-guide/search-optimization/enabling.md)
* [Displaying the search optimization configuration for a table](../user-guide/search-optimization/enabling.md)
* [Estimating the costs of search optimization](../user-guide/search-optimization/cost-estimation.md)
* [Speeding up substring and regular expression queries with search optimization](../user-guide/search-optimization/substring-queries.md)
* [Speeding up queries of semi-structured data with search optimization](../user-guide/search-optimization/semi-structured-queries.md)
* [Speeding up geospatial queries with search optimization](../user-guide/search-optimization/geospatial-queries.md)

## Data Loading Updates

### Cross-platform Support for Snowpipe Auto-Ingest — *Preview*

With this release, we are pleased to complete the cross-platform support for Snowpipe auto-ingest. Triggering automated Snowpipe data loads
using S3 event messages, GCS Pub/Sub event messages, and Azure Event Grid messages are now supported by Snowflake accounts hosted on
[any supported cloud platforms](../user-guide/intro-cloud-platforms.md).

For more information, refer to [Automating Continuous Data Loading Using Cloud Messaging](../user-guide/data-load-snowpipe-auto.md).

### Amazon EventBridge Support for Snowpipe Auto-Ingest — *Preview*

With this release, we are pleased to announce the Amazon EventBridge support for Snowpipe auto-ingest. You can set up Amazon EventBridge for
Snowpipe auto-ingest by following the steps in
[Automating Snowpipe for Amazon S3 with SNS](../user-guide/data-load-snowpipe-auto-s3.md).

### Snowpipe Auto-Ingest Supports the SftpCommit API for Azure

With this release, Snowpipe auto-ingest now supports the SftpCommit API for Microsoft.Storage.BlobCreated events to automatically retrieve
and load files created through SFTP. For more information, refer to
[Automating Snowpipe for Microsoft Azure Blob Storage](../user-guide/data-load-snowpipe-auto-azure.md).

## Data Collaboration Updates

### Timed Trials for Paid Listings — *General Availability*

With this release, we are pleased to announce the general availability of timed trials for paid listings offered on the Snowflake
Marketplace.

Providers who offer a paid listing on the Snowflake Marketplace can set up a timed trial to allow consumers to explore the entire data
product in a listing for a limited period of time, or combine a limited functionality trial with a timed trial and offer access to a
subset of data for a limited time.

For more information, refer to
[Configure listings](../collaboration/provider-listings-reference.md).

## Data Governance Updates

### Object Tagging: Support Added for Policy Objects

With this release, Snowflake is pleased to announce newly supported objects that can be tagged:

* [ALTER MASKING POLICY](../sql-reference/sql/alter-masking-policy.md)
* [ALTER PASSWORD POLICY](../sql-reference/sql/alter-password-policy.md)
* [ALTER ROW ACCESS POLICY](../sql-reference/sql/alter-row-access-policy.md)
* [ALTER SESSION POLICY](../sql-reference/sql/alter-session-policy.md)

You can set a tag or unset tag using the corresponding ALTER `policy_kind` POLICY statement.

For more information, refer to ALTER <policy_kind> POLICY Command: Support for Setting and Unsetting Tags.

## Web Interface Updates

### Secondary Roles Support in Snowsight — *General Availability*

With this release, we are pleased to announce the general availability of using secondary roles to access functionality in
Snowsight.

If you set the DEFAULT_SECONDARY_ROLES user property to `ALL`, secondary roles are activated when the user signs in to Snowflake.

When secondary roles are active, you do not need to switch roles or manually activate secondary roles to access pages in Snowsight
that your primary role, or a role in its hierarchy, cannot access. You can use your primary role to perform actions such as using worksheets
with a specific role, but still easily access other pages in Snowsight.

For more information, refer to [Active roles](../user-guide/security-access-control-overview.md) and [CREATE USER](../sql-reference/sql/create-user.md).

### Upload Files onto Stages Using Snowsight — *Preview*

With this release, we are pleased to announce the preview of loading files into stages using Snowsight.

With Snowsight, you can upload files onto named internal stages so that you can, for example, prepare to load data from the files
into tables or load dependencies for Python worksheets.

For more information, refer to [Staging files using Snowsight](../user-guide/data-load-local-file-system-stage-ui.md).

### Load Data into Tables using Snowsight — *Preview*

With this release, we are pleased to announce the preview of loading data into tables using Snowsight.

Load structured data files such as CSV or TSV-formatted files, or semi-structured data files such as JSON, Avro, or XML-formatted files into
tables using Snowsight.

You can load a file from your local machine into a table using Snowsight. You can specify an existing file format that you created
with the [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md) command, or you can define a temporary file format when you load the file.

For more information, refer to [Load data using Snowsight](../user-guide/data-load-web-ui.md).

### Snowsight Worksheet Tabs — *Preview*

With this release, we are pleased to announce the preview of tabs for opening worksheets in Snowsight.

Opening Snowsight worksheets in tabs lets you mimic the experience in Classic Console. You can use tabs to refer to multiple
active worksheets and explore the databases and schemas in Snowflake while writing SQL or Python.

For more information, refer to [Opening worksheets in tabs](../user-guide/ui-snowsight-worksheets-gs.md).

---
title: April 22, 2024 — Snowpark Container Services release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-22.md
section: Release Notes
---

# April 22, 2024 — Snowpark Container Services release notes

[Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) now supports block storage volumes for your services. For more information, see
[Using block storage volumes with services](../../../developer-guide/snowpark-container-services/block-storage-volume.md).

---
title: April 22-24, 2024 — 8.16 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_16.md
section: Release Notes
---

# April 22-24, 2024 — 8.16 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New SQL command(s)

The following commands(s) are now available with this release:

| Command Category | New Command | Description |
| --- | --- | --- |
| Organization DDL | [SHOW ACCOUNTS](../../sql-reference/sql/show-accounts.md) | Lists all accounts in an organization, with the exclusion of managed accounts.  Currently duplicates the functionality of [SHOW ORGANIZATION ACCOUNTS](../../sql-reference/sql/show-organization-accounts.md). In a future release, the purpose and output of SHOW ORGANIZATION ACCOUNTS will change. |

### SQL API support for hybrid tables

With this release, Snowflake is pleased to announce support for the Snowflake SQL API when queries and other operations access data
in hybrid tables. The SQL API is a REST API that you can use to access and update data in a Snowflake database.

## Extensibility updates

### Asynchronous job support in Snowpark stored procedures

With this release, Snowflake is pleased to announce support for running concurrent asynchronous child jobs using Snowpark APIs
within stored procedure handler code written in Java, Python, or Scala. You can run an asynchronous query, as well as access its
status and result, or cancel the query.

## Data Lake Updates

### Apache Iceberg™ tables: Support for un-materialized identity partition columns

With this release, Snowflake is pleased to announce that you can create an Apache Iceberg™ table from an Iceberg data source that contains
un-materialized identity partition columns.

An un-materialized identity partition column is created when a table defines an identity transform
using a source column that doesn’t exist in a Parquet file.

Before release 8.16, this scenario was not supported.

For more information, see [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 22-Apr-24 |
| *Asynchronous job support in Snowpark stored procedures* | **Added** to *Extensibility updates* section | 23-Apr-24 |
| *Support for granting the WRITE privilege on external stages* | **Removed** from *Data loading / unloading updates* section (and section removed) because this update was announced in the previous release | 23-Apr-24 |
| *Apache Iceberg™ tables: Support for un-materialized identity partition columns* | **Added** to *Data Lake Updates* | 26-Apr-24 |

---
title: April 23, 2024 — Snowflake Connector for ServiceNow® V2  — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-23-svnc.md
section: Release Notes
---

# April 23, 2024 — Snowflake Connector for ServiceNow® V2 — *General Availability*

With this release, we are pleased to announce the general availability of the Snowflake Connector for ServiceNow® V2.

The Snowflake Connector for ServiceNow® provides instant access to up-to-date ServiceNow®
data without needing to manually integrate against API endpoints or manage third-party solutions.
Built on the Snowflake Native App Framework, the connector ingests data from ServiceNow® into
Snowflake automatically leveraging built-in security and reliability capabilities.

The connector supports both the initial load of historical data as well as incremental updates.
The latest data is regularly pulled from ServiceNow® and you control how frequently it is refreshed.

ServiceNow® is a cloud-based platform that delivers workflows for Service Management
including Incident Management, Change Management, Asset Management, Configuration Management,
Service Catalog, Request Fulfillment, and more.

For more information, see [About the Snowflake Connector for ServiceNow®](https://other-docs.snowflake.com/connectors/servicenow/about).

---
title: April 24, 2024 — Managing Listings using SQL
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-24-pl.md
section: Release Notes
---

# April 24, 2024 — Managing Listings using SQL

Snowflake is pleased to announce the preview of Managing listings using SQL.

You can now [create](../../../sql-reference/sql/create-listing.md),
[alter](../../../sql-reference/sql/alter-listing.md),
[describe](../../../sql-reference/sql/desc-listing.md),
[show](../../../sql-reference/sql/show-listings.md), and
[drop](../../../sql-reference/sql/drop-listing.md) the contents of a listing using SQL commands.

> **Note:**
>
> You cannot use SQL commands to offer paid, personalized listings, or listings on private data exchanges.

For a complete set of limitations and restrictions see [About managing listings using SQL](../../../progaccess/listing-progaccess-about.md).

For details, see [About managing listings using SQL](../../../progaccess/listing-progaccess-about.md)

---
title: April 24, 2024 — New FAILOVER privilege for Client Redirect
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-24-failover-privilege.md
section: Release Notes
---

# April 24, 2024 — New FAILOVER privilege for Client Redirect

With this release, we are pleased to announce the FAILOVER privilege for connection objects. The FAILOVER privilege enables
promoting a secondary connection to serve as the primary connection. The privilege is granted to the ACCOUNTADMIN role by default.
Account administrators can grant this privilege to other roles to facilitate failover in a disaster recover scenario.

For more information, see [Redirecting client connections](../../../user-guide/client-redirect.md).

---
title: April 29, 2024 — Dynamic Tables — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-29-dynamic-tables.md
section: Release Notes
---

# April 29, 2024 — Dynamic Tables — *General Availability*

With this release, we are pleased to announce the general availability of dynamic
tables, which are a new table type for continuous processing pipelines. Whether
you’re processing batch data daily or real time data in minutes, dynamic tables
allow you to create data pipelines that are easy to build, operate, and evolve.

With general availability, the following new enhancements are added:

* **Sharing and collaboration**: Dynamic tables can now be
  [shared](../../../user-guide/dynamic-tables-data-sharing.md) across regions and clouds using
  Snowflake’s sharing and collaboration features. This makes it easy to share
  clean, enriched, and transformed data products with consumers in your
  organization, partner organizations, or the broader data cloud community,
  ensuring they stay updated according to your specified cadence.
* **Disaster recovery and replication**: Dynamic tables now support high
  availability through Snowflake’s
  [replication infrastructure](../../../user-guide/account-replication-considerations.md). You can
  build your production pipelines with peace of mind knowing that you’re supported
  with Snowflake’s disaster recovery solutions.
* **Observability**: New functionality added for better observability via Snowsight
  and programmatic interfaces. In Snowsight, there are new account-level views,
  visibility into [warehouse consumption](../../../user-guide/dynamic-tables-cost.md),
  improved [graph](../../../sql-reference/functions/dynamic_table_graph_history.md) and
  [refresh history](../../../sql-reference/functions/dynamic_table_refresh_history.md),
  and the ability to [suspend and resume refreshes](../../../sql-reference/sql/alter-dynamic-table.md).
  Observability functions now include new account usage views, extended retention of
  information schema functions and added support for consistent metadata across
  Snowflake observability interfaces.
* **Data Cloud integrations**: Added support for [clustering](../../../sql-reference/sql/alter-dynamic-table.md),
  [transient](../../../sql-reference/sql/create-dynamic-table.md) dynamic tables, and governance policies (on
  sources of dynamic tables and dynamic tables themselves), allowing you to benefit
  from the best features of the Snowflake Data Cloud.
* **Scalability**: You can now create four times more dynamic tables in your account,
  and ten times more dynamic table sources feeding into another dynamic table. There
  are no longer any limits on the depth of a directed acyclic graph (DAG) that you can
  create.
* **Query evolution support**: Dynamic tables now automatically filter out new columns
  added to base tables without needing to rebuild the dynamic table, as long as the
  definition of the dynamic table does not use `SELECT *`.
* **New documentation**: We’ve added new articles to our documentation covering
  development best practices,
  [performance optimization guides](../../../user-guide/dynamic-tables-performance.md),
  [troubleshooting](../../../user-guide/dynamic-tables-troubleshooting.md) pipeline issues,
  and other improvements.

Additionally, Snowflake has made numerous under-the-hood refinements to enhance refresh
performance, system stability and scalability.

For more information, see [Dynamic tables](../../../user-guide/dynamic-tables-about.md).

---
title: April 29, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-04-29.md
section: Release Notes
---

# April 29, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced
in this update to Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Snowflake Provider Studio —– *Preview*

Snowflake is pleased to announce the preview of Provider Studio.

To use Provider Studio, you no longer need to be an account administrator or a user with create listings privileges.

For more information, see [Accessing Provider Studio](https://other-docs.snowflake.com/collaboration/provider-studio-accessing.html).

---
title: April 30, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-30-dcr.md
section: Release Notes
---

# April 30, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, Snowflake is pleased to announce the following improvements to Snowflake Data Clean Rooms:

* **Service Agreement**: Providers and consumers who are Snowflake Service customers can now use Snowflake Data Clean Rooms under their
  existing Service Agreement with Snowflake.
* **Billing for clean room managed accounts**: The consumer, not the provider, is billed for the consumer’s use of a
  [clean room managed account](../../../user-guide/cleanrooms/managed-accounts.md). A managed account allows someone who is not a Snowflake
  Service customer to collaborate with a provider.
* **Additional regions**: This release adds support for the following additional regions:

  | Cloud platform | Supported region | Cloud region ID |
  | --- | --- | --- |
  | **Amazon Web Services** | US East (Ohio) | us-east-2 |
  |  | Canada (Central) | ca-central-1 |
  |  | Asia Pacific (Mumbai) | ap-south-1 |
  |  | Asia Pacific (Singapore) | ap-southeast-1 |
  |  | Asia Pacific (Sydney) | ap-southeast-2 |
  | **Microsoft Azure** | Central US (Iowa) | centralus |
  |  | South Central US (Texas) | southcentralus |
  |  | East US 2 (Virginia) | eastus2 |
  |  | Canada Central (Toronto) | canadacentral |
  |  | Central India (Pune) | centralindia |
  |  | Southeast Asia (Singapore) | southeastasia |
  |  | Australia East (New South Wales) | australiaeast |

---
title: April 30, 2024 — Snowflake Google connectors
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-04-30-gaad-gard-ga.md
section: Release Notes
---

# April 30, 2024 — Snowflake Google connectors

## Snowflake Connector for Google Analytics Raw Data

With this release, we are pleased to announce the general availability of Snowflake Connector for Google Analytics Raw Data.
Google Analytics is a cloud-based tool that provides insight into how users interact with your website.
You can use it to analyze user actions, track the number of visitors and page views, and analyze bounce rates for a page.

The Snowflake Connector for Google Analytics Raw Data enables you to automatically ingest event-level Google Analytics 4 (GA4) data into your Snowflake account.

For more details, see [Snowflake Connector for Google Analytics Raw Data](../../../connectors/google/gard/gard-connector-about.md).

## Snowflake Connector for Google Analytics Aggregate Data

With this release, we are pleased to announce the general availability of Snowflake Connector for Google Analytics Aggregate Data.

The Snowflake Connector for Google Analytics Aggregate Data enables you to automatically ingest Google Analytics 4 (GA4) data into your Snowflake account.
The connector extracts aggregated data using the [GA4 Reporting API](https://developers.google.com/analytics/devguides/reporting/data/v1).

For more details, see [Snowflake Connector for Google Analytics Aggregate Data](https://other-docs.snowflake.com/connectors/google/gaad/gaad-connector-about.html).

See also:

* [Snowflake Connector for Google Analytics Aggregate Data release notes](../../connectors/gaad.md)
* [Snowflake Connector for Google Analytics Raw Data release notes](../../connectors/gard.md)

---
title: April 30-May 07, 2024 — 8.17 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_17.md
section: Release Notes
---

# April 30-May 07, 2024 — 8.17 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_04](../bcr-bundles/2024_04_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_03](../bcr-bundles/2024_03_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_02](../bcr-bundles/2024_02_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for June 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## Security updates

### Authentication enhancements — *General Availability*

With this release, we are pleased to announce the general availability of several authentication enhancements:

* Authentication policies
* Identifier-first login flow
* New properties for SAML2 security integrations
* Multiple identity providers support

#### Authentication policies

Authentication policies provide you with control over how users authenticate by allowing you to specify which clients can authenticate and
which authentication methods can be used with SAML2 and External OAuth security integrations.

For more information, see [Authentication policies](../../user-guide/authentication-policies.md) and [Limitations](../../user-guide/authentication-policies.md).

#### Identifier-first login flow

Identifier-first login allows Snowflake to identify a user before presenting authentication options. In this flow, Snowflake prompts the
user for their email address or username only, then displays authentication options based on the identity of the user.

For more information about this feature and how to enable it, see [Identifier-first login](../../user-guide/identifier-first-login.md).

#### New properties for SAML2 security integrations

A SAML2 security integration for a federated authentication configuration contains two new properties: ALLOWED_USER_DOMAINS and
ALLOWED_EMAIL_PATTERNS. When the user logs in, the user’s email address must match the values specified in these properties in order to
authenticate with the identifier provider associated with the security integration. This feature requires the Identifier-first login to be
enabled.

For more information, see [CREATE SECURITY INTEGRATION (SAML2)](../../sql-reference/sql/create-security-integration-saml2.md).

#### Multiple identity providers support

Snowflake now supports using multiple identity providers for federated authentication using SAML2 security integrations, which allows
different users to authenticate with different identity providers. This feature requires the identity-first login flow to be enabled.

For more information, see [Using multiple identity providers for federated authentication](../../user-guide/admin-security-fed-auth-security-integration-multiple.md).

## SQL updates

### READ ONLY property available for tables

With this release, you can create tables with a new READ ONLY property. The READ ONLY property is valid only for a temporary table that is
being created with the [CREATE TABLE … CLONE](../../sql-reference/sql/create-table.md) variant of the CREATE TABLE command. A read-only table does not allow DML
operations and only allows a subset of DDL operations.

When the 2024_04 behavior change bundle is enabled, information about the READ ONLY property is included in the output when you execute the
SHOW TABLES command, query the TABLES view, and call the GET_DDL function.

### ST_INTERSECTION_AGG and ST_UNION_AGG functions — *General Availability*

The following functions are now generally available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Geospatial | [ST_INTERSECTION_AGG](../../sql-reference/functions/st_intersection_agg.md) | Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the shape containing the combined set of points that are common to the shapes represented by the objects in the column (i.e. the intersection of the shapes). |
| Geospatial | [ST_UNION_AGG](../../sql-reference/functions/st_union_agg.md) | Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the combined set of points that are in at least one of the shapes represented by the objects in the column (i.e. the union of the shapes). |

## Data loading /unloading updates

### New copy option: INCLUDE_METADATA

With this release, we are pleased to announce a new copy option `INCLUDE_METADATA` for COPY INTO <table>. This copy option provides a
user-defined mapping between target table columns to [METADATA columns](../../user-guide/querying-metadata.md) and can only be used with the
`MATCH_BY_COLUMN_NAME` copy option.

By using these two copy options, `INCLUDE_METADATA` with `MATCH_BY_COLUMN_NAME`, data ingestion is simplified allowing for the
inclusion of file metadata into target tables columns while also loading file data columns.

In the following example, a mapping is defined with INCLUDE_METADATA. The existing columns, `ingestdate` and `filename`, are
populated with corresponding metadata columns alongside the file data columns.

```sqlexample
COPY INTO table1 FROM @stage1
MATCH_BY_COLUMN_NAME = CASE_INSENSITIVE
INCLUDE_METADATA = (
    ingestdate = METADATA$START_SCAN_TIME, filename = METADATA$FILENAME);
```

```output
+-----+-----------------------+---------------------------------+-----+
| ... | FILENAME              | INGESTDATE                      | ... |
|---------------------------------------------------------------+-----|
| ... | example_file.json.gz  | Thu, 22 Feb 2024 19:14:55 +0000 | ... |
+-----+-----------------------+---------------------------------+-----+
```

> **Note:**
>
> For CSV only, there is a known issue when the `INCLUDE_METADATA` copy option is used with `MATCH_BY_COLUMN_NAME`. Do not use
> this copy option when loading CSV files until the known issue is resolved.
>
> **Update**: This issue is resolved with [the 8.19 release](8_19.md).

For more information, see [Copy options (copyOptions)](../../sql-reference/sql/copy-into-table.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 29-Apr-24 |
| *New copy option: INCLUDE_METADATA* stated a known issue with CSV | **Changed** to the known issue is resolved | 15-May-24 |

---
title: Archived implemented unbundled behavior changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/unbundled-behavior-changes-implemented-archive.md
section: Release Notes
---

# Archived implemented unbundled behavior changes

Archived implemented unbundled behavior changes are unbundled behavior changes with an implementation date older than two years.
Snowflake periodically moves older but still relevant implemented unbundled behavior changes to this page.

For more information about unarchived BCRs, see:

* [Recently implemented changes](unbundled-behavior-changes.md) that were previously pending/disabled, were not part of a behavior change bundle, and cannot be disabled.
* [Upcoming pending changes](unbundled-behavior-changes.md) that will not be part of a behavior change bundle and cannot be enabled in advance.
* [Canceled behavior changes](unbundled-cancelled-behavior-changes.md) that have been removed from BCR bundles and will not be implemented.

If you have questions about any of these behavior changes, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Archived implemented behavior changes

The following table lists behavior changes that were implemented but archived after a certain period of time, typically two years.

| Release Date | Functional Area | Implemented Behavior Change | Additional Notes |
| --- | --- | --- | --- |
| **Nov 30, 2023** | Snowflake Native App Framework | [Snowflake Native App Framework: Need to recreate or update some APPLICATION objects](bcr-update-app-dev-mode.md) |  |
| **Nov 14, 2023** | Snowflake Native App Framework | [Snowflake Native App Framework: Providers must accept terms of service to set the DISTRIBUTION property to EXTERNAL](bcr-enforce-provider-tos.md) |  |
| **Nov 14, 2023** | Snowflake Native App Framework | [Snowflake Native App Framework Changes to the version output for the SHOW APPLICATIONS and DESC APPLICATION commands](bcr-add-unversioned-status.md) |  |
| **Nov 7, 2023** | Snowflake Native App Framework | [Snowflake Native App Framework Cannot use “UNVERSIONED” as the prefix of a version label](bcr-prevent-unversioned-in-version-name.md) |  |
| **October 23, 2023** | SQL Changes — Usage Views & Information Schema Views / Table Functions | [WAREHOUSE_EVENTS_HISTORY view: Change to the CLUSTER_NUMBER column output](bcr-warehouse-events-history-cluster-number.md) |  |
| **Sep 28, 2023** | Data Loading and Unloading | [Stronger UTF-8 validation for external files](bcr-1013-1014.md) |  |
| **Sep 19, 2023** | SQL Changes — Commands & Functions | [SHOW APPLICATIONS command: Changes to the LABEL column output](bcr-show-applications-output-change.md) | This change is enabled by default and cannot be disabled. |
| **Aug 23, 2023** | SQL Changes — Security | [CREATE USER command: NETWORK_POLICY parameter must specify a valid network policy](bcr-non-existing-network-policy.md) |  |
| **Sep 27, 2022** | Snowflake CLI, Connectors, Drivers, and SQL API Changes | [Snowflake Connector for Python: Empty results of fetch_arrow and fetch_pandas are typed](bcr-812.md) |  |
| **Aug 24, 2022** | Snowflake CLI, Connectors, Drivers, and SQL API Changes | [Snowflake .NET driver update - August 2022](dot-net-driver-relnotes.md) | Snowflake .NET driver 2.0.16: Replaces .NET Standard 2.0 with .NET 6.0 |
| **2021 and 2022** | Infrastructure Changes | [Microsoft Azure subnet expansion (Pending for selected accounts)](bcr-MSAzure-2021-11-29.md) | This change only impacts accounts hosted on Azure that are using the functionality documented in the provided article. |

---
title: ARRAY_CAT Function: Changes to NULL Handling
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-940.md
section: Release Notes
---

# ARRAY_CAT Function: Changes to NULL Handling

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The way in which the [ARRAY_CAT](../../../sql-reference/functions/array_cat.md) function handles NULL input values has changed:

Previously:
:   If you passed NULL as an input argument to ARRAY_CAT, the function reported the following error:

    `NULL result in a non-nullable column`

Currently:
:   If you pass NULL as an input argument to ARRAY_CAT, the function returns NULL, rather than reporting an error.

Ref: 940

---
title: ARRAY_POSITION Function: Changes to Finding the Position of a NULL Value
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-882.md
section: Release Notes
---

# ARRAY_POSITION Function: Changes to Finding the Position of a NULL Value

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

When you call the [ARRAY_POSITION](../../../sql-reference/functions/array_position.md) function and pass in a value as the first argument, the
function returns the position of the first ARRAY element with that value.

The ARRAY_POSITION function has changed when you specify NULL as the first argument:

Previously:
:   The function returned NULL. For example:

    ```sqlexample
    SELECT ARRAY_POSITION(NULL, [10, NULL, 30]);

    +--------------------------------------+
    | ARRAY_POSITION(NULL, [10, NULL, 30]) |
    |--------------------------------------|
    |                                 NULL |
    +--------------------------------------+
    ```

Currently:
:   The function returns the position of the first NULL in the ARRAY. For example:

    ```sqlexample
    SELECT ARRAY_POSITION(NULL, [10, NULL, 30]);
    +--------------------------------------+
    | ARRAY_POSITION(NULL, [10, NULL, 30]) |
    |--------------------------------------|
    |                                    1 |
    +--------------------------------------+
    ```

This change was implemented for consistency with the [ARRAY_CONTAINS](../../../sql-reference/functions/array_contains.md) function. When you use the
ARRAY_CONTAINS function to determine if an ARRAY contains NULL, the function returns TRUE.

Ref: 882

---
title: ASOF JOIN syntax: Restricted use of keywords
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1138.md
section: Release Notes
---

# ASOF JOIN syntax: Restricted use of keywords

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The ASOF JOIN feature behaves as follows:

Before the change:
:   The use of ASOF and MATCH_CONDITION as object names, object aliases, session variables,
    and bind variables is not restricted.

After the change:
:   ASOF and MATCH_CONDITION are new keywords. The use of these keywords in SELECT commands
    and commands that set or use session or bind variables is restricted.

    **Names of objects**

    If a SELECT statement uses ASOF or MATCH_CONDITION as the name of a table, view, or inline view, you must
    identify it as follows:

    * If the object was created with double quotes in the name, use the same double-quoted name.
    * If the object was created without double quotes in the name, use double quotes and capital letters.

    For example, the following statements are no longer allowed and return errors:

    ```sqlexample
    SELECT * FROM asof;
    WITH match_condition AS (SELECT * FROM T1)
      SELECT * FROM match_condition;
    ```

    If you created the objects with double quotes, fix the problem by using double quotes:

    ```sqlexample
    SELECT * FROM "asof";
    WITH "match_condition" AS (SELECT * FROM T1)
      SELECT * FROM "match_condition";
    ```

    If you created the objects without double quotes, fix the problem by using double quotes and capital letters:

    ```sqlexample
    SELECT * FROM "ASOF";
    WITH "MATCH_CONDITION" AS (SELECT * FROM T1)
      SELECT * FROM "MATCH_CONDITION";
    ```

    > **Note:**
    >
    > Snowflake recommends that you discontinue the use of these object names in your applications.

    **Names of aliases**

    If a SELECT statement uses ASOF or MATCH_CONDITION as an alias, you must use AS before the alias or double-quote
    the alias. For example, the following statements are no longer allowed and return errors:

    ```sqlexample
    SELECT * FROM t1 asof;
    SELECT * FROM t2 match_condition;
    ```

    Fix the problem in one of the following ways:

    ```sqlexample
    SELECT * FROM t1 AS asof;
    SELECT * FROM t1 "asof";
    SELECT * FROM t2 AS match_condition;
    SELECT * FROM t2 "match_condition";
    ```

    **Names of variables**

    If you are using session variables or bind variables with the name ASOF or MATCH_CONDITION,
    and their names were not double-quoted when they were created, they must be renamed or removed.

    For example, you can no longer set a session variable named `asof`:

    ```sqlexample
    set asof ='2024/01/15';
    ```

    ```output
    001003 (42000): SQL compilation error:
    syntax error line 1 at position 4 unexpected 'asof'.
    ```

    However, you can set a variable that is explicitly double-quoted and named `"asof"` or `"ASOF"`:

    ```sqlexample
    set "asof" ='2024/01/15';
    ```

    ```output
    +----------------------------------+
    | status                           |
    |----------------------------------|
    | Statement executed successfully. |
    +----------------------------------+
    ```

    The same rules apply to bind variables, such as `:asof` and `:match_condition`.

Ref: 1138

---
title: Aug 01, 2025: Snowflake Intelligence (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-01-snowflake-intelligence.md
section: Release Notes
---

# Aug 01, 2025: Snowflake Intelligence (*Preview*)

Gain insights and take action based on data in your organization with agents using Snowflake Intelligence. Agents can answer questions, provide insights, and show visualizations.

With Snowflake Intelligence, you can:

* Create charts and get instant answers using natural language. You can discover trends and analyze data without technical expertise or waiting for custom dashboards.
* Access and analyze thousands of data sources, including structured and unstructured data together. You can connect insights from spreadsheets, documents, images, and databases simultaneously.

For more information, see [Overview of Snowflake Intelligence](../../../user-guide/snowflake-cortex/snowflake-intelligence.md).

---
title: Aug 01, 2025: Snowpark Container Services in Google Cloud (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-01-spcs-google-cloud-ga.md
section: Release Notes
---

# Aug 01, 2025: Snowpark Container Services in Google Cloud (*General availability*)

[Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) is now generally available to Snowflake accounts in Google Cloud commercial regions. For more
information, see [Available Regions](../../../developer-guide/snowpark-container-services/overview.md).

---
title: Aug 04, 2025: Hybrid table storage for Time Travel data
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-04-hybrid-tables-time-travel-billing.md
section: Release Notes
---

# Aug 04, 2025: Hybrid table storage for Time Travel data

Consumption for hybrid table storage now takes into account the data that is retained by
[Time Travel](../../../user-guide/data-time-travel.md).
Data retained by Time Travel is included in the following storage metrics:

* STORAGE_BYTES column in the [STORAGE_USAGE view](../../../sql-reference/account-usage/storage_usage.md)
* AVERAGE_DATABASE_BYTES column in:

  + The Account Usage [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/account-usage/database_storage_usage_history.md)
  + The Organization Usage [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/organization-usage/database_storage_usage_history.md)
  + The Information Schema [DATABASE_STORAGE_USAGE_HISTORY](../../../sql-reference/functions/database_storage_usage_history.md) function

Time Travel data is stored in object storage, not the row store, and is charged at the standard table rate,
not the higher hybrid table rate.

---
title: Aug 05, 2025: Document AI table extraction (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-05-document-ai.md
section: Release Notes
---

# Aug 05, 2025: Document AI table extraction (*General availability*)

With Document AI, you can extract tables from documents of various formats.
Additionally, you can now export and import CSV files to review answers
for table extraction more easily.

To extract tables, select the document processing type before you begin defining values
for your Document AI model build.

---
title: Aug 06, 2025: Cortex Agents: admin configuration UI (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-06-cortex-agents-admin-ui.md
section: Release Notes
---

# Aug 06, 2025: Cortex Agents: admin configuration UI (*Preview*)

Create an agent from the Agent admin page in the Snowsight UI to answer questions and provide insights using a semantic view, semantic model, a Cortex Search service, or a combination of these.

For more information, see [Configure and interact with Agents](../../../user-guide/snowflake-cortex/cortex-agents-manage.md).

---
title: Aug 06, 2025: Support for custom components in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-06-sis.md
section: Release Notes
---

# Aug 06, 2025: Support for custom components in Streamlit in Snowflake (Preview)

Custom components are now supported in Streamlit in Snowflake. Currently, Streamlit in Snowflake only supports custom components that don’t
require making calls to external services.

For more information about custom components, see
[Streamlit documentation](https://docs.streamlit.io/develop/concepts/custom-components/intro).

---
title: Aug 07, 2025: Cortex AI_TRANSCRIBE (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-07-cortex-aisql-ai-transcribe.md
section: Release Notes
---

# Aug 07, 2025: Cortex AI_TRANSCRIBE (*Preview*)

The AI_TRANSCRIBE function, now in available in preview in [select regions](../../../user-guide/snowflake-cortex/aisql.md),
provides SQL-native speech-to-text AI processing at scale. With AI_TRANSCRIBE, you can extract insights from customer
care interactions, healthcare consultations, and business meeting recordings. Files can be processed directly from
object storage, avoiding data movement, and no infrastructure management is required, so you can get started right away.

AI_TRANSCRIBE lets you:

* Perform simple text transcription for basic needs, extract word-level timestamps for precise navigation, or
  automatically identify speakers for multi-speaker content analysis.
* Build comprehensive customer intelligence pipelines that transcribe support calls and combine with AI_SENTIMENT for
  instant sentiment analysis.
* Streamline compliance and quality monitoring by transcribing customer service calls with speaker identification,
  enabling automated labeling of issues using AI_CLASSIFY.
* Generate executive meeting summaries that automatically identify key speakers, extract decision points, and create
  structured reports from board meetings or stakeholder calls using AI_TRANSCRIBE and AI_AGG.
* Build multilingual content processing workflows that transcribe international customer interactions and combine with
  other AI Functions for comprehensive global customer experience analysis.

For more information, see [Cortex AI Functions: Audio](../../../user-guide/snowflake-cortex/ai-audio.md) and [AI_TRANSCRIBE](../../../sql-reference/functions/ai_transcribe.md).

---
title: Aug 07, 2025: Enforced join order with directed joins (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-07-directed-join.md
section: Release Notes
---

# Aug 07, 2025: Enforced join order with directed joins (*Preview*)

When you run join queries, you can now enforce the join order of the tables using the `DIRECTED` keyword.
When you run a query with a directed join, the first, or left, table is scanned before the second, or right, table.
For example, `o1 INNER DIRECTED JOIN o2` scans the `o1` table before the `o2` table.

Directed joins are useful in the following situations:

> * You are migrating workloads into Snowflake that have join order directives.
> * You want to improve performance by scanning join tables in a specific order.

For more information, see [JOIN](../../../sql-reference/constructs/join.md).

---
title: Aug 07, 2025: Snowpark Container Services batch jobs (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-07-spcs-batch-jobs-pupr.md
section: Release Notes
---

# Aug 07, 2025: Snowpark Container Services batch jobs (*Preview*)

Support for running [multiple replicas of a Snowpark Container Services job service](../../../developer-guide/snowpark-container-services/working-with-services.md) is available in preview.

---
title: Aug 08, 2025: Contacts (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-08-contacts.md
section: Release Notes
---

# Aug 08, 2025: Contacts (*General availability*)

You can now associate contacts with objects such as databases and tables so users can reach the right person for assistance with those
objects. Each contact is a schema-level object that contains details about how to communicate with the user or group of users,
for example, whether to use an email address or access a URL. An object can have multiple contacts as long as the purpose of
each contact is different. For example, a table might have one contact for access approval and another contact for general
support.

For more information, see [Using Contacts](../../../user-guide/contacts-using.md).

---
title: Aug 11, 2025: CORS configuration to enable cross-origin requests to a Snowpark Container Services service (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-11-spcs-cors-ga.md
section: Release Notes
---

# Aug 11, 2025: CORS configuration to enable cross-origin requests to a Snowpark Container Services service (*General availability*)

Using CORS configuration to enable cross-origin requests to a Snowpark Container Services service is now generally available.
For more information, see [Ingress and web app security](../../../developer-guide/snowpark-container-services/service-network-communications.md).

---
title: Aug 12, 2025: Snowflake ML Jobs (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-12-distributed-ml-jobs.md
section: Release Notes
---

# Aug 12, 2025: Snowflake ML Jobs (*General availability*)

Support for ML jobs is now generally available and is no longer in [Preview](../../preview-features.md).

Snowflake ML Jobs is a framework which lets you leverage the Container Runtime from any environment. You can use the ML Jobs SDK to:

* Submit and manage jobs using Snowpark Container Services
* Leverage GPU and high-memory CPU instances for resource-intensive tasks
* Use your preferred development environment (VS Code, external notebooks, etc.)

For more information, see: - [Snowflake Multi-Node ML Jobs](../../../developer-guide/snowflake-ml/ml-jobs/distributed-ml-jobs.md)

---
title: Aug 12, 2025: Support for Streamlit 1.46 (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-12-sis.md
section: Release Notes
---

# Aug 12, 2025: Support for Streamlit 1.46 (General availability)

Version 1.46 of the Streamlit open-source library is now supported in Streamlit in Snowflake.

---
title: Aug 14, 2025: Support for stored procedures in data lineage (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-14-lineage.md
section: Release Notes
---

# Aug 14, 2025: Support for stored procedures in data lineage (*Preview*)

Snowflake is extending its lineage capabilities beyond data and ML lineage to capture processes connecting source and
target objects. As you view the lineage graph in Snowsight, you can now obtain details about a stored procedure that resulted in a
downstream object. If this stored procedure is nested within other stored procedures, you can also view details about the stored procedure
that is at the top of the hierarchy of nested procedures.

For more information, see [Lineage created by a stored procedure or task](../../../user-guide/ui-snowsight-lineage.md).

---
title: Aug 14, 2025: Using SQL for Cortex Powered Object Descriptions (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-14-sql-object-descriptions.md
section: Release Notes
---

# Aug 14, 2025: Using SQL for Cortex Powered Object Descriptions (*Preview*)

You can now call a stored procedure, AI_GENERATE_TABLE_DESC, to programmatically generate Cortex Powered Object Descriptions. The Cortex
Powered Object Descriptions feature uses the [Snowflake Cortex COMPLETE function](../../../sql-reference/functions/complete-snowflake-cortex.md)
to automatically generate descriptions for tables, views, and columns.

The AI_GENERATE_TABLE_DESC stored procedure is in preview. Using Snowsight to generate object descriptions is generally available.

For more information, see [Using SQL to automatically generate object descriptions](../../../user-guide/sql-cortex-descriptions.md).

---
title: Aug 14, 2025: Workload identity federation (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-14-wif.md
section: Release Notes
---

# Aug 14, 2025: Workload identity federation (*General availability*)

Workload identity federation lets your workloads — such as services, applications, and containers — authenticate to Snowflake without
managing or storing long-lived credentials. It provides similar security benefits to using an identity provider like in External OAuth, but
can be much simpler to implement.

Implementing workload identity federation consists of configuring the workload to use its native identity provider, creating a Snowflake
service user for the workload, and making sure the workload uses a Snowflake driver that is capable of sending an attestation or security
token from the native identity provider to Snowflake.

When Snowflake’s [deprecation of single-factor password sign-ins](../../../user-guide/security-mfa-rollout.md) is complete, workloads that
authenticate to Snowflake without human interaction won’t be able to use a password. Workload identity federation provides a
straightforward, secure authentication method for these workloads.

For more information, see [Workload identity federation](../../../user-guide/workload-identity-federation.md).

---
title: Aug 18, 2025: Snowsight navigation menu updates (Gradual rollout)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-18-snowsight-navigation.md
section: Release Notes
---

# Aug 18, 2025: Snowsight navigation menu updates (Gradual rollout)

Snowflake is gradually rolling out updates to the Snowsight navigation menu. During this transition, some accounts already have the new
navigation while others still have the previous version. This is expected, as the rollout is occurring in phases. Eventually, all accounts
will be transitioned to the new navigation experience.

The updated navigation menu is organized by feature groups under key categories to help you find the tools you need more quickly.

For more information, see [Snowsight navigation menu](../../../user-guide/ui-snowsight-navigation.md).

---
title: Aug 18, 2025: Write Once, Read Many (WORM) snapshots (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-18-worm-snapshots.md
section: Release Notes
---

# Aug 18, 2025: Write Once, Read Many (WORM) snapshots (*Preview*)

The Write Once, Read Many (WORM) snapshots feature is now in [Preview](../../preview-features.md).

WORM snapshots represent backups of specific Snowflake tables, schemas, or databases.
These backups are *immutable*: they can’t be changed after being created.
Snowflake manages all the snapshots of a specific object within a single container object, the snapshot set.
You can establish snapshot policies that determine how often to automatically take new snapshots, when to automatically
delete old snapshots, and when to prevent important snapshots from being deleted, even by privileged users.

For more information, see: [Backups for disaster recovery and immutable storage](../../../user-guide/backups.md).

---
title: Aug 19, 2025: Trust Center email notifications (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-19-trust-center-email-notifications-ga.md
section: Release Notes
---

# Aug 19, 2025: Trust Center email notifications (*General availability*)

Trust Center email notifications are now generally available and are no longer in
[Preview features](../../preview-features.md).

You can configure the Trust Center to send email notifications when it finds violations.
You can specify that the Trust Center sends notifications for all of the enabled scanners
in a scanner package or for individual scanners. You can also specify the severity of the
violations for which email notifications are sent.

For more information, see [Sending email notifications about Trust Center findings](../../../user-guide/trust-center/notifications-trust-center.md).

---
title: Aug 20, 2025: Cortex Search Service replication (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-20-cortex-search-service-replication.md
section: Release Notes
---

# Aug 20, 2025: Cortex Search Service replication (*Preview*)

Cortex supports the replication of Cortex Search Services from a source account to one or more target accounts in the same organization. This replication is integrated seamlessly with Snowflake replication and failover groups to provide point-in-time consistency for the objects on the target account.

For more information, see [Replicate a Cortex Search Service](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-replication.md).

---
title: Aug 20, 2025: Distributed processing in Snowflake ML: Many Model Training and Distributed Partition Function
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-20-snowflake-ml-distributed-processing.md
section: Release Notes
---

# Aug 20, 2025: Distributed processing in Snowflake ML: Many Model Training and Distributed Partition Function

Snowflake ML now supports distributed processing capabilities for training multiple models and processing data across partitions.

You can use Many Model Training (MMT) to train multiple machine learning models efficiently across data partitions. MMT partitions your Snowpark DataFrame by a column that you specify and trains separate models on each partition in parallel.

You can use the Distributed Partition Function (DPF) to process data in parallel across one or more nodes in a compute pool. DPF partitions your Snowpark DataFrame by a column that you specify and executes your Python function on each partition in parallel.

Both features help you handle infrastructure complexity and scale automatically.

For more information, see [Train models across data partitions](../../../developer-guide/snowflake-ml/train-models-across-partitions.md) and [Process data with custom logic across partitions](../../../developer-guide/snowflake-ml/process-data-across-partitions.md).

---
title: Aug 20, 2025: New stage volume implementation in Snowpark Container Services (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-20-spcs-stage-volume-new.md
section: Release Notes
---

# Aug 20, 2025: New stage volume implementation in Snowpark Container Services (*Preview*)

A new stage volume implementation for your application containers is available in preview.
For more information, see [Using Snowflake stage volumes with services](../../../developer-guide/snowpark-container-services/snowflake-stage-volume.md).

---
title: Aug 21, 2025: AI Parse Document layout mode (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-21-aisql-ai-parse-document-layout-ga.md
section: Release Notes
---

# Aug 21, 2025: AI Parse Document layout mode (*General availability*)

The Snowflake Cortex AI_PARSE_DOCUMENT document is now generally available with advanced layout extraction capabilities. This fully managed SQL function extracts the layout of the page in Markdown format, preserving text, tables, and structural elements from documents with enterprise-grade accuracy and scale.

> **Note:**
>
> The AI_PARSE_DOCUMENT function is the new version of SNOWFLAKE.CORTEX.PARSE_DOCUMENT.
> The old function is still supported, but Snowflake recommends using the new function.

Key capabilities of AI_PARSE_DOCUMENT include:

* **Complex layout mastery:** Accurately process multi-column research papers, financial reports, and technical
  documentation while preserving reading order and document hierarchy.
* **Precise table extraction:** Maintains table structure, headers, and relationships from financial statements,
  regulatory filings, and data-heavy documents for downstream analysis
* **Advanced Layout Preservation** Handles mixed content including embedded images, pull quotes, and complex
  formatting without losing context or meaning

For more information, see [Parsing documents with AI_PARSE_DOCUMENT](../../../user-guide/snowflake-cortex/parse-document.md).

---
title: Aug 21, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-21-dcr.md
section: Release Notes
---

# Aug 21, 2025: Snowflake Data Clean Rooms updates

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Simplified flow for cross-region sharing.** Previously a provider needed to call both `request_laf_cleanroom_requests` and
  `mount_laf_cleanroom_requests_share` to receive requests from the consumer when the consumer account is in a different region. Now the
  provider can simply call `mount_request_logs_for_all_consumers` instead.
* **Simplified installation.** Instead of requiring the user to create and verify a service user, clean rooms now automatically creates and
  verifies a service user, or reuses an existing service user. This greatly simplifies the installation process.

---
title: Aug 22, 2025: AI_EXTRACT function (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-22-ai-extract.md
section: Release Notes
---

# Aug 22, 2025: AI_EXTRACT function (*Preview*)

Extract information from an input string or file with the new
[Snowflake Cortex AI Functions](../../../user-guide/snowflake-cortex/aisql.md) function,
[AI_EXTRACT](../../../sql-reference/functions/ai_extract.md), now available in preview.

The AI_EXTRACT function enables extracting information from unstructured data sources, such as text, images,
and documents, for example, financial and tax statements, contracts, invoices, medical reports, marketing materials,
and regulatory or business records.

With AI_EXTRACT, you can:

* Simplify complex extraction tasks into a single operation that offers predictable pricing and reduced orchestration requirements.
* Process files directly from a stage, eliminating the need for data transfer, duplication, or infrastructure management.
* Define output schemas with the `responseFormat` argument.

For more information, see [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md).

---
title: Aug 22, 2025: Organization profile updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-22-org-profiles.md
section: Release Notes
---

# Aug 22, 2025: Organization profile updates

The following new features and enhancements are now available for organization profiles:

* **Support for creating organization profiles in Snowsight.** Previously, creating organization profiles was only available using SQL commands.
* **Allow specific roles in an account to access a profile.** When creating organization profiles, you can now assign specific roles within an account that can access the profile.

For more information, see the following documents:

* [Create and manage organization profiles](../../../user-guide/collaboration/organization-profiles/org-profiles-create-manage.md)
* [Organization profile manifest reference](../../../user-guide/collaboration/organization-profiles/org-profile-manifest-reference.md)

---
title: Aug 25, 2025: Snowflake Connectors for Microsoft Power Apps (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-25-mspowerapps.md
section: Release Notes
---

# Aug 25, 2025: Snowflake Connectors for Microsoft Power Apps (*General availability*)

The Snowflake Connector for Microsoft Power Platform allows you to connect to Snowflake from Microsoft Power Apps,
Power Automate, Copilot Studio, and other Microsoft applications.

The Microsoft Power Platform allows you to create flows and add actions to execute and return results of custom SQL statements executed within Snowflake.

For more information about the connector, see [About the Snowflake Connector for Microsoft Power Platform](../../../connectors/microsoft/powerapps/about.md).

---
title: Aug 26, 2025: Using the database object explorer in Snowsight to create and manage semantic views (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-26-semantic-views-in-snowsight.md
section: Release Notes
---

# Aug 26, 2025: Using the database object explorer in Snowsight to create and manage semantic views (*Preview*)

In Snowsight, you can use the database object explorer to create and manage semantic views.

This is a [preview feature](../../preview-features.md).

For information, see [Using Snowsight to create and manage semantic views](../../../user-guide/views-semantic/ui.md).

---
title: Aug 28, 2025: Hybrid table support for periodic rekeying (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-28-hybrid-tables-periodic-rekeying.md
section: Release Notes
---

# Aug 28, 2025: Hybrid table support for periodic rekeying (*General availability*)

Hybrid tables now support [periodic rekeying](../../../user-guide/security-encryption-manage.md).

Accounts that contain hybrid tables can enable and use periodic rekeying without any
additional configuration. The command to enable periodic rekeying is the same as for
standard tables:

```sqlexample
ALTER ACCOUNT SET PERIODIC_DATA_REKEYING = true;
```

---
title: Aug 28, 2025: Model Registry model deployment UI (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-28-model-deployment-ui.md
section: Release Notes
---

# Aug 28, 2025: Model Registry model deployment UI (*Preview*)

Deploy models directly to SPCS Model Serving from the Model Registry UI. You can also view the details of the deployed inference service, as well as suspend the service directly from the Model Registry UI.

For more information, see [Deploy user models](../../../developer-guide/snowflake-ml/model-registry/snowsight-ui.md).

---
title: Aug 28, 2025: Monitoring events for Snowpipe
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-28-monitoring-events-for-snowpipe.md
section: Release Notes
---

# Aug 28, 2025: Monitoring events for Snowpipe

You can now configure Snowflake to record detailed events, providing valuable insights into the status and progress of your data pipelines
and tables. This new capability helps you monitor your data ingestion processes more effectively.

## Snowpipe: data ingestion events

Snowpipe now records events that provide detailed information about the status of your pipes. These events are captured in the active event
table associated with the pipe. By monitoring these events, you can gain insights into the following areas:

* Pipe status changes: Track the operational state of your Snowpipes.
* File processing progress: Understand the journey of files through the Snowpipe system.
* Periodic, aggregated, ingestion statistics digest: Get summarized statistics on data ingestion.

For more information, see [Monitor events for Snowpipe](../../../user-guide/data-load-snowpipe-monitor-events.md).

## Externally managed Apache Iceberg™ tables: automated refresh events

As part of the Snowpipe event monitoring feature, Snowflake now records events that provide information about the status of automated
refresh for externally managed Iceberg tables. These events, which are a component of the new Snowpipe events, can help you gain insights
into automated refresh progress and aggregated statistics. Note that this feature does not support events for manual refreshes.

For more information, see [Monitor automated refresh events](../../../user-guide/tables-iceberg-auto-refresh.md).

---
title: Aug 28, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-28-dcr.md
section: Release Notes
---

# Aug 28, 2025: Snowflake Data Clean Rooms updates

**Clean Rooms API Version: 9.7**

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Single-account testing** - You can now act as both provider and consumer in the single account for a test clean room. This should
  enable easier testing of clean room code for users with a single Snowflake account. [Learn more.](../../../user-guide/cleanrooms/v1/developer-introduction.md)
* **Configurable refresh rates for Cross-Cloud Auto-Fulfillment.** The default refresh rate for provider clean room data to consumers
  located on other cloud hosting regions has been shortened from 24 hours to 30 minutes. The refresh rate for this data is
  [configurable](../../../user-guide/cleanrooms/provider.md).

---
title: Aug 29, 2025: Snowflake Native Apps: Restricted caller’s rights (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-08-29-native-apps-rcr-ga.md
section: Release Notes
---

# Aug 29, 2025: Snowflake Native Apps: Restricted caller’s rights (*General availability*)

Snowflake Native App support for restricted caller’s rights is now generally available. Restricted caller’s rights allow an app’s stored procedures and Snowpark Container Services (SPCS) services to execute with caller’s rights. However, these executables can only use a select subset of
available privileges. These privileges must be requested by the app and
granted by the admin of the consumer account when configuring the app.

For information on using restricted caller’s rights in an app, see
[Grant restricted caller’s rights to an executable in an app](../../../developer-guide/native-apps/ui-consumer-restricted-callers-rights.md).

---
title: August 01, 2024 — Snowpark Container Services release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-01-spcs.md
section: Release Notes
---

# August 01, 2024 — Snowpark Container Services release notes

With this [Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) release, we are pleased to announce the following:

* General availability to Snowflake accounts in all commercial AWS regions.
* Preview availability to Snowflake accounts in all commercial Azure regions.

---
title: August 01, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-08-01.md
section: Release Notes
---

# August 01, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## New stage explorer in Snowsight — *General availability*

With this release, we are pleased to announce the general availability of the new stage explorer in Snowsight. When you load a staged file into a table, you can select the files directly from a stage explorer. In the stage explorer, you can select the stage root, or folders and files within the stage. You don’t need to navigate to the stage in the object explorer anymore, and you don’t need to copy and paste the path of the staged file when loading a staged file into a table.

For more information, see [Load data using Snowsight](../../../user-guide/data-load-web-ui.md).

## Schema detection and visual column mapping for loading files to existing tables in Snowsight –— *Preview*

With this release, we are pleased to announce the preview of schema detection and visual column mapping for loading files into existing tables in Snowsight.

When you load files into an existing table in Snowsight, you can now visualize the column mapping between the source file and the target table. Use the UI to make adjustments as needed before loading.

For more information, see [Load data using Snowsight](../../../user-guide/data-load-web-ui.md).

---
title: August 01, 2024 — Support for Streamlit 1.35.0 in Streamlit in Snowflake
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-01-sis.md
section: Release Notes
---

# August 01, 2024 — Support for Streamlit 1.35.0 in Streamlit in Snowflake

With this release, we are pleased to announce support for version 1.35.0 of the Streamlit open-source
library in Streamlit in Snowflake.

For more information, see [About Streamlit in Snowflake](../../../developer-guide/streamlit/about-streamlit.md).

---
title: August 01-02, 2023 — 7.26 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_26.md
section: Release Notes
---

# August 01-02, 2023 — 7.26 Release Notes

The following new features and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## SQL Updates

### SELECT \*: Selecting Columns Matching a SQL Pattern and Replacing Column Values

With this release, in a SELECT \* statement, you can specify ILIKE with a pattern containing SQL wildcards (`_` to match a
single character and `%` to match any sequence of zero or more characters) to select only the columns that match that pattern.
The ILIKE keyword performs case-insensitive matching.

```sqlexample
SELECT * ILIKE '<pattern>' ...
```

For example, to select only the columns with names containing `id`:

```sqlexample
SELECT * ILIKE '%id%' ...
```

In addition, you can replace the values of specific columns in a SELECT \* statement by specifying REPLACE with an expression that evaluates
to the new value:

```sqlexample
SELECT * REPLACE (<expr> AS <col_name> [ , <expr> AS <col_name> , ... ])
```

For example, to prepend the string `'DEPT-'` to each value in the `department_id` column:

```sqlexample
SELECT * REPLACE ('DEPT-' || department_id AS department_id) ...
```

For more information, see [SELECT](../../sql-reference/sql/select.md).

### Transforming a GEOMETRY Object to a Different Spatial Reference System (ST_TRANSFORM) — *General Availability*

With this release, we are pleased to announce the general availability of the ST_TRANSFORM function, which you can use to transform a
GEOMETRY object from one spatial reference system (SRS) to another.

The following example creates a POINT GEOMETRY object that uses EPSG:32633 (WGS 84 / UTM zone 33N) as the SRS. The example transforms this
GEOMETRY object to use EPSG:3857 (Web Mercator).

```sqlexample
-- Set the output format to EWKT
ALTER SESSION SET GEOMETRY_OUTPUT_FORMAT='EWKT';

SELECT
  ST_TRANSFORM(
    ST_GEOMFROMWKT('POINT(389866.35 5819003.03)', 32633),
    3857
  ) AS transformed_geom;
```

```output
+---------------------------------------------------------------+
| transformed_geom                                              |
|---------------------------------------------------------------|
| SRID=3857;POINT(1489140.093765644 6892872.198680112)          |
+---------------------------------------------------------------+
```

For more information, see [ST_TRANSFORM](../../sql-reference/functions/st_transform.md).

### Vectorized Python UDTFs — *Preview*

With this release, we are pleased to announce the preview of Vectorized Python UDTFs (user-defined table functions).

Vectorized Python UDTFs enable seamless partition-by-partition processing by operating on partitions as pandas DataFrames and returning
results as pandas DataFrames or lists of pandas Series or arrays. Vectorized Python UDTFs allow for easy integration with libraries that
operate on pandas DataFrames or pandas arrays.

For more information, see [Vectorized Python UDTFs](../../developer-guide/udf/python/udf-python-tabular-vectorized.md).

## Data Collaboration Updates

### Recurring Subscription-based Pricing Plans for Paid Listings — *Preview*

With this release, we are pleased to announce the preview of recurring subscription-based pricing plans for paid listings. With this plan,
you can bill consumers upfront on a recurring basis for access to your listing.

For more information, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).

### Non-Recurring Subscription-based Pricing Plans for Paid Listings — *General Availability*

With this release, we are pleased to announce the general availability of one time subscription-based pricing plans for paid listings. With
this plan, you only need to bill consumers once for access to your listing, with no option to repurchase or renew.

For more information, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).

## Documentation and Learning Resources

### Weekly Release Notes in the Snowflake Documentation

With this release, we are pleased to announce an update to the format of the Release Notes in the Snowflake Documentation:

* Historically, we have published the details for each weekly release, including the version and release dates, only in the
  [Release Notes](https://community.snowflake.com/s/articles?tId=0TO0Z000000kHxAWAU) and
  [Announcements](https://community.snowflake.com/s/announcements) (in the Snowflake Community).
* Starting in August 2023:

  + We will no longer aggregate the [Release Notes](../new-features.md) (in the Snowflake Documentation) by month. Instead,
    we will document each weekly release separately by version and release dates, effectively replicating the format of the Release Notes
    in the Snowflake Community.
  + Additionally, we have backported this change to the monthly Release Notes for June 2023 and July 2023. The Release Notes prior to June
    2023 remain unchanged.
* Through the month of August, we will continue publishing the Release Notes weekly in the Snowflake Community.
* Planned for September 2023; however, this schedule is subject to change:

  + We will no longer publish detailed weekly Release Notes in the Snowflake Community.
  + We will publish detailed weekly Release Notes only in the Snowflake Documentation.

---
title: August 02, 2024 — Custom UI in Streamlit in Snowflake –— General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-02-sis.md
section: Release Notes
---

# August 02, 2024 — Custom UI in Streamlit in Snowflake –— *General Availability*

With this release, we are pleased to announce the general availability of Custom UI in Streamlit in Snowflake.

Custom UI enables customization of the look, feel, and front-end behavior of Streamlit in Snowflake apps.
This feature supports the following:

* Custom HTML and CSS using `unsafe_allow_html=True` in [st.markdown](https://docs.streamlit.io/library/api-reference/text/st.markdown).
* Iframed HTML, CSS, and JavaScript using [st.components.v1.html](https://docs.streamlit.io/develop/api-reference/custom-components/st.components.v1.html).

---
title: August 02, 2024 — ML Functions: Improved Error Messages in Classification
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-02-classification-errors.md
section: Release Notes
---

# August 02, 2024 — ML Functions: Improved Error Messages in Classification

We are pleased to announce improved error messages for the [Classification](../../../user-guide/ml-functions/classification.md)
ML Function. These new messages more clearly communicate the cause of the error and better suggest how to address the core
issue. The following table shows the most common error messages and their improved versions.

| Previous message | New message |
| --- | --- |
| Error: Evaluation Data is Not Available. Please train with evaluation enabled. | Evaluation data is not available. Please create a new model with the EVALUATE parameter set to TRUE. Try adding the following line when calling the model: CONFIG_OBJECT => {‘evaluate’: “TRUE”}. |
| {col} has a different type from training. Make sure each column has the same SQL type between training and prediction. | Your column {col} has a different SQL type in this dataset than in your training dataset. Make sure each column in this dataset matches the corresponding column’s type from your training data. Try casting the column to the type it was in the training data. |
| All values in the label column are NULL. | All values in your target column are NULL. The model requires non-NULL values in your target column to train successfully. Try picking a different target column. |
| test_fraction must be a number; test_fraction must be greater than 0 and less than 1 | Your test_fraction value is not valid. test_fraction must be greater than 0 and less than 1. Try entering a decimal value between 0 and 1 (i.e. 0.2). |
| evaluation test_fraction is too large to generate an evaluation dataset | Your test_fraction is too high. Try a smaller fraction. The test_fraction is likely really close to 1. Try something lower (i.e. 0.2). |
| Unable to create an evaluation dataset - only one class present in the evaluation training set. This may be because of a large class imbalance or because of test_fraction is too large. | Evaluation data is not available. If one class (or category) significantly outweighs the rest, the model may not be able to complete training. Check that each distinct class in your target column appears many times. If the above is not an issue, try lowering your test_fraction. |

---
title: August 02, 2024 — Snowflake Native App Framework release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-02-na-spcs-laf.md
section: Release Notes
---

# August 02, 2024 — Snowflake Native App Framework release notes

We are pleased to announce preview availability for Cross-Cloud Auto-Fulfillment in a Snowflake Native App with Snowpark Container Services.
See [Auto-fulfillment for listings](../../../collaboration/provider-listings-auto-fulfillment.md) for more information.

---
title: August 02, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-08-02.md
section: Release Notes
---

# August 02, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Cortex Search Service –— *Preview*

With this release, we are pleased to announce the preview of the Cortex Search in Snowsight.
Cortex Search enables low-latency, high-quality “fuzzy” search over your Snowflake data. Now you can create your own search service
through Snowsight without writing any SQL.

For more details, see [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

---
title: August 05, 2024 — Data Dictionary Data Preview — Generally available
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-08-05.md
section: Release Notes
---

# August 05, 2024 — Data Dictionary Data Preview — *Generally available*

We are pleased to announce the general availability of Data Preview in Data Dictionary.

With this release, Snowflake periodically runs [data classification](../../../user-guide/classify-intro.md) on data previews to identify and
mask any column with a high likelihood of containing PII and other sensitive data. Once Snowflake identifies and masks a PII column, an
email is sent to the technical contact listed in the provider profile to review the details.

For more information, see [Mask PII and other data in data previews](https://other-docs.snowflake.com/en/collaboration/provider-listings-reference#label-mask-pii-and-other-data-in-data-previews).

If you have additional questions, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

---
title: August 06, 2024 — Snowflake Native App Framework: Support for VPS on AWS
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-06-na-vps-aws.md
section: Release Notes
---

# August 06, 2024 — Snowflake Native App Framework: Support for VPS on AWS

We are pleased to announce general availability for Snowflake Native App Framework support for Virtual Private Snowflake (VPS)
on Amazon Web Services. See [Limitations on Virtual Private Snowflake (VPS)](../../../developer-guide/native-apps/limitations.md)
for more information.

---
title: August 06, 2024: Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-06-document-ai.md
section: Release Notes
---

# August 06, 2024: Document AI release notes

With this release, we are pleased to announce the availability of a new version of the Arctic-TILT model.
The new model version includes the following improvements:

* Doubling length of the answers provided by the model. The model can now return answers that are up to 256 tokens long (about 160 words).
* Improving training time.

You can now use the new model by creating a new Document AI model build.

---
title: August 07-08, 2023 — 7.27 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_27.md
section: Release Notes
---

# August 07-08, 2023 — 7.27 Release Notes

The following new features and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Account Usage: New CLASS_INSTANCES View

With this release, we are pleased to announce the CLASS_INSTANCES view in the Account Usage schema of the shared SNOWFLAKE database. This
view returns one row for each class instance in the account.

A [class](../../sql-reference/snowflake-db-classes.md) is an extensible object type that serves as a blueprint for creating instances. You
can create an instance of a class and execute its methods (procedures and functions) to take advantage of the advanced functionality that
classes provide. For a list of available classes, see the [SQL class reference](../../sql-reference-classes.md).

For more information, see [CLASS_INSTANCES view](../../sql-reference/account-usage/class_instances.md).

## SQL Updates

### New System Stored Procedure for Sending Email Notifications — *General Availability*

With this release, we are pleased to announce the general availability of the SYSTEM$SEND_EMAIL() system stored procedure for sending email
notifications. You can call this stored procedure to send an email notification from a task, your own stored procedure, or an interactive
session.

For more information, see [Using SYSTEM$SEND_EMAIL to send email notifications](../../user-guide/notifications/email-stored-procedures.md).

## Web Interface Updates

### Sharing: Improved UI Messaging

We are pleased to announce that the Snowsight Share dialog for worksheets, folders, and dashboards now provides information about
which roles can view the latest worksheet or dashboard tile results. This update does not introduce any behavior changes.

Previously:
:   The Share dialog used the following description: “Viewers without the role can duplicate and run under their own roles.”

Currently:
:   The Share dialog displays the following information to help you understand who has access to results when you share a worksheet,
    folder, or dashboard.

    * The role most recently used to run the worksheet or dashboard tile is required to view the latest results.
    * When you use a secondary role to generate results, the results are visible to any recipient with the role.
    * Viewers without the role can duplicate and run the worksheet or dashboard tile using their own roles.

---
title: August 07-08, 2024 — 8.29 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_29.md
section: Release Notes
---

# August 07-08, 2024 — 8.29 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Session policies: Support added for secondary roles

With this release, we are pleased to announce support for specifying secondary roles in a session policy. The `ALLOWED_SECONDARY_ROLES`
property of a session policy enables you to scope the set of secondary roles available to a user for the duration of the session. After
you set the session policy on the account or a user in the account, the enforcement of the secondary roles occurs when the Snowflake
session begins.

Depending on how you configure the property and whether you set the session policy on your account or a user in the account, you can do
the following:

* Allow secondary roles in the session.
* Disallow secondary roles in the session.
* Allow only specific secondary roles.

The user can activate all of the allowed secondary roles or a list of secondary roles that are specified by the session policy with the
USE SECONDARY ROLES command.

For more information, see:

* [Using session policies](../../user-guide/session-policies-using.md)
* [CREATE SESSION POLICY](../../sql-reference/sql/create-session-policy.md)
* [USE SECONDARY ROLES](../../sql-reference/sql/use-secondary-roles.md)

## Extensibility updates

### Python user-defined aggregate functions — *General availability*

With this release, Snowflake is pleased to announce the general availability of support for writing user-defined aggregate functions (UDAFs)
with a Python handler.

You can use Snowpark Python APIs to create and call user-defined aggregate functions (UDAFs), which take one or more rows as input and produce a
single row of output. A UDAF operates on values across multiple rows to perform mathematical calculations such as sum, average, counting, finding
minimum or maximum values, standard deviation, and estimation, as well as some non-mathematical operations.

For more information, see:

* [Python user-defined aggregate functions](../../developer-guide/udf/python/udf-python-aggregate-functions.md) (for SQL- and Python-based instructions)
* [Creating User-Defined Aggregate Functions (UDAFs) for DataFrames in Python](../../developer-guide/snowpark/python/creating-udafs.md) (for a Snowpark Python-based guide)

### Access to Git repositories from Snowflake — *General availability*

With this release, we are pleased to announce the general availability of access to Git repositories from within Snowflake. Once you configure
Snowflake to act as a client of your Git repository, you can fetch a full clone of your remote repository to a Snowflake Git repository clone, which
represents a local repository. You can reference these fetched files in procedure and function handler code, execute SQL and Python code in
Snowflake, copy file contents into Snowflake worksheets, and more.

For more information, see [Using a Git repository in Snowflake](../../developer-guide/git/git-overview.md).

## Data lake updates

### Apache Iceberg™ tables: Support for government regions — *General availability*

With this release, we are pleased to announce the general availability of support for
connecting Snowflake to external storage for Apache Iceberg™ tables in [government regions](../../user-guide/intro-regions.md).

For more information, see [CREATE EXTERNAL VOLUME](../../sql-reference/sql/create-external-volume.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 02-Aug-24 |
| *Iceberg tables: Support for government regions* | **Added** to *Data lake updates* section | 05-Aug-24 |

---
title: August 08, 2024 — Cross-region inference for Snowflake AI & ML features — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-08-cross-region-llm.md
section: Release Notes
---

# August 08, 2024 — Cross-region inference for Snowflake AI & ML features — *General Availability*

With this release, we are pleased to announce the availability of cross-region inferencing for Snowflake AI & ML features. A new
parameter enables processing inference requests in a different region if the request cannot be processed in the region where the inference
is originally requested.

The parameter is used to determine the inference behavior for any Snowflake feature supported by cross-region inference, including
Cortex LLM Functions.

To learn more about the parameter, see [Cross-region inference](../../../user-guide/snowflake-cortex/cross-region-inference.md).

---
title: August 08, 2024 — RANGE BETWEEN window frames with explicit offsets — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-08-range-between-ga.md
section: Release Notes
---

# August 08, 2024 — RANGE BETWEEN window frames with explicit offsets — *General Availability*

With this release, we are pleased to announce the general availability of RANGE BETWEEN window frames with explicit
offsets. A range-based window frame consists of a logically computed set of rows. By using a range-based frame with
explicit offsets, such as `RANGE BETWEEN 3 PRECEDING AND 3 FOLLOWING`, you can easily compute rolling
calculations, such as moving sums and averages, over time-series data. Because of the range-based frame, these
calculations are not disrupted by gaps in the data set.

For details about using this feature, see [Window function syntax and usage](../../../sql-reference/functions-window-syntax.md).

---
title: August 08, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-08-dcr.md
section: Release Notes
---

# August 08, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Support for external tables and Apache Iceberg™ tables

Providers and consumers can now include external tables and Apache Iceberg™ tables in a clean room.

This support also allows Snowflake to use external tables when a collaborator uses a third-party connector to access data from external
cloud storage, which eliminates problems associated with materializing large datasets.

For more information about including external tables and Iceberg tables in a clean room, see [Registering data](../../../user-guide/cleanrooms/register-data.md).

## Integration with TransUnion TruAudience Identity

Providers and consumers can now use TransUnion’s latest TruAudience Identity solution when creating or installing a clean room in the web
app, which allows them to use the TransUnion identity graph to match records based on a collaboration ID.

For more information, see the following:

* If you are an administrator who is configuring the connector so clean room users can leverage TruAudience Identity, see
  [TransUnion TruAudience Identity connector](../../../user-guide/cleanrooms/connector-identity.md).
* If you are a clean room user who is using the Identity Hub during clean room creation or installation, see
  [TransUnion TruAudience Identity connector](../../../user-guide/cleanrooms/connector-identity.md).

---
title: August 09, 2024 — Streamlit in Snowflake on AWS GovCloud –— General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-09-sis.md
section: Release Notes
---

# August 09, 2024 — Streamlit in Snowflake on AWS GovCloud –— *General Availability*

With this release, we are pleased to announce the general availability of Streamlit in Snowflake on AWS GovCloud.

For more information, see [About Streamlit in Snowflake](../../../developer-guide/streamlit/about-streamlit.md).

---
title: August 11-14, 2024 — 8.30 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_30.md
section: Release Notes
---

# August 11-14, 2024 — 8.30 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Outbound private connectivity with Azure External Network Access and External Functions — *Preview*

With this release, we are pleased to announce support for Snowflake accounts on Microsoft Azure to use outbound private
connectivity with two features:

* External Network Access
* External Functions

Outbound private connectivity enables you to use Azure Private Link from the VNet that hosts your Snowflake account to
connect to an Azure resource using Azure Private Link.

You can configure external network access to use Azure Private Link to connect to external service from UDF/UDTF or stored procedures
within Snowpark when you call the stored procedure to connect to the external location. The hostname of the external service is used
to provision a private endpoint. The network rule of type `PRIVATE_HOST_PORT` enables the external access integration to use
Azure Private Link. The hostname and the external access integration are then specified in the stored procedure that you create. This
allows you to call the stored procedure in Snowflake and use Azure Private Link to connect to the external service.

You can configure external functions in Snowflake to use Azure Private Link to connect to the external service via Azure API Management,
using both the Azure Portal and the Azure ARM template. Your Azure subscription and hostname for the API Management service are used to
map your external service to the private endpoint that you provision. These are the same values that you specify in the API integration
for the external function. This allows you to call an external function in Snowflake and use Azure Private Link to connect to the
external service.

For more information, see:

* [Private connectivity with external functions: Azure ARM template](../../developer-guide/external-network-access/creating-using-private-azure.md)
* [Private connectivity with external functions: Azure Portal](../../sql-reference/external-functions-creating-azure-template-private-connect.md)
* [External network access and private connectivity on Microsoft Azure](../../sql-reference/external-functions-creating-azure-ui-private-connect.md)
* [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md)

### Full-text search - *Preview*

With this release, we are pleased to announce the preview of a new full-text search feature which is now available. To use full-text search, call a new [SEARCH](../../sql-reference/functions/search.md) function to find character data
(text) in specified columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. In most cases, you call the
SEARCH function by specifying it in the SELECT list or the WHERE clause of a SELECT statement.

The SEARCH function supports token-based text search across multiple columns (or all columns) of one or more tables, which is a good
solution for the following use cases:

* Searching for text in data with an inherent structure, where tokens naturally correspond to words, fields, or message components.
  Token searches can exactly match the specified text in a large amount of data, which results in fewer false positives and simpler queries.
  For example, a token search for “unauthorized access” in the system logs finds case-insensitive instances of “unauthorized” and “access”
  but does not find instances of “authorized” or “accessible.”
* Searching for text without knowing the exact location of relevant data. Because full-text search supports wildcard searches, you can search
  for relevant text in a set of columns or entire tables without writing complex SQL queries. For example, you can use full-text search to
  search for a list of email addresses and usernames in a table.

To improve the performance of full-text search queries, you can optionally enable FULL_TEXT search optimization on a specific column or set of
columns in a table. To do so, run an ALTER TABLE…ADD SEARCH OPTIMIZATION ON FULL_TEXT statement.

For more information about full-text search, see [Using full-text search](../../user-guide/querying-with-search-functions.md). For more information
about search optimization for full-text search queries, see [Enabling and disabling search optimization](../../user-guide/search-optimization/enabling.md).

## SQL updates

### Setting users as SNOWFLAKE_SUPPORT users no longer supported

With this release, you can no longer set a user’s SUPPORT_USER attribute using the CREATE USER or ALTER USER commands.

Users with SNOWFLAKE_SUPPORT set to TRUE remain support users until you drop them. Snowflake can access these users through support
processes.

### RANGE BETWEEN with explicit offsets: Additional window functions supported

With this release, we are pleased to announce that the following additional window functions support RANGE BETWEEN window frames with
explicit offsets:

* [STDDEV, STDDEV_SAMP](../../sql-reference/functions/stddev.md), [STDDEV_POP](../../sql-reference/functions/stddev_pop.md) (and aliases)
* [VARIANCE , VARIANCE_SAMP](../../sql-reference/functions/variance.md), [VARIANCE_POP](../../sql-reference/functions/variance_pop.md) (and aliases)
* [COUNT_IF](../../sql-reference/functions/count_if.md)

For example, you can calculate standard deviation values for a column and specify a
`RANGE BETWEEN 3 PRECEDING AND 3 FOLLOWING` window frame.

For more information about window frame syntax, see [Syntax](../../sql-reference/functions-window-syntax.md).

### UNDROP command: Support for restoring objects using ID

With this release, we are pleased to announce support for the UNDROP command to restore tables, schemas, and databases using an object ID. For
example, if you have dropped multiple tables with the same name, you can use this feature to restore a specific table using the table ID. The
table is restored with its original name.

For more information, see the following topics:

* [UNDROP TABLE](../../sql-reference/sql/undrop-table.md)
* [UNDROP SCHEMA](../../sql-reference/sql/undrop-schema.md)
* [UNDROP DATABASE](../../sql-reference/sql/undrop-database.md)

### Wildcard filtering for functions

When you specify a wildcard (`*`) as an argument in a call to a function, you can now use the ILIKE and EXCLUDE keywords for filtering in a
SELECT list or GROUP BY clause.

For example, the following call to the [COUNT](../../sql-reference/functions/count.md) function is now valid:

```sqlexample
SELECT COUNT(* ILIKE 'col1%') FROM mytable;
```

The following call to the [OBJECT_CONSTRUCT](../../sql-reference/functions/object_construct.md) function is also valid:

```sqlexample
SELECT OBJECT_CONSTRUCT(* EXCLUDE col1) AS oc FROM mytable;
```

The ILIKE and EXCLUDE keywords are now also valid in object constants. For example:

```sqlexample
SELECT {* ILIKE 'col1%'} FROM mytable;

SELECT {* EXCLUDE col1} FROM mytable;
```

For more information, see [OBJECT constants](../../sql-reference/data-types-semistructured.md).

## Data loading / unloading updates

### Loading unstructured data with Document AI — *Preview*

With this release, we are pleased to announce the preview of loading unstructured data with Document AI. By integrating with Document AI,
Snowflake now supports loading unstructured data, similar to loading structured and semi-structured data. To load unstructured data with this
preview feature, you can run the same COPY INTO table command with a new copy option `file_processor`.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 09-Aug-24 |
| *RANGE BETWEEN with explicit offsets: Additional window functions supported* | **Added** to *SQL updates* section | 12-Aug-24 |
| *Setting users as SNOWFLAKE_SUPPORT users no longer supported* | **Added** to *SQL updates* section | 15-Aug-24 |

---
title: August 14, 2024 — Cortex Analyst –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-14-cortex-analyst.md
section: Release Notes
---

# August 14, 2024 — Cortex Analyst –— *Preview*

With this release, we are pleased to announce the preview of Cortex Analyst, a fully-managed
[Snowflake Cortex](https://www.snowflake.com/en/data-cloud/cortex/) feature that enables you to create applications
capable of reliably answering business questions based on your structured data in Snowflake. With Cortex Analyst,
business users can ask questions in natural language and receive direct answers without writing SQL.

Available as a convenient REST API, Cortex Analyst can be seamlessly integrated into any application, empowering data
teams and developers to customize how and where users interact with results, while still benefiting from Snowflake’s
integrated security and governance features, including role-based access controls (RBAC), to protect valuable data.

To deliver high text-to-SQL accuracy, Cortex Analyst uses an agentic AI setup powered by state-of-the-art LLMs and a
user-defined [semantic model](../../../user-guide/views-semantic/sql.md). With
Cortex Analyst, you can eliminate the complexities of model selection, architecture maintenance, GPU capacity planning,
and more, accelerating the deployment of reliable conversational self-serve analytics solutions and significantly
lowering total cost of ownership (TCO).

For more information, see [Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md).

---
title: August 16, 2024 — Snowflake Native App Framework: Support for government regions on AWS
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-16-na-gov-cloud.md
section: Release Notes
---

# August 16, 2024 — Snowflake Native App Framework: Support for government regions on AWS

We are pleased to announce general availability for GovCloud
on Amazon Web Services. See [Limitations on government regions](../../../developer-guide/native-apps/limitations.md)
for more information.

---
title: August 16-17, 2023 — 7.28 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_28.md
section: Release Notes
---

# August 16-17, 2023 — 7.28 Release Notes

## New Features

### Blocking Public Access to Azure Internal Stages — *General Availability*

With this release, we are pleased to announce the general availability of a new function, SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS, that
allows an account administrator to block all public network traffic from accessing the internal stage of an Azure account. By blocking all
public IP addresses, the administrator ensures that all successful requests to the internal stage originate from a Private Endpoint and
traverse Azure Private Link rather than a public network.

Two additional functions complement this feature:

* The SYSTEM$INTERNAL_STAGES_PUBLIC_ACCESS_STATUS function determines whether public access to the Azure internal stage is currently
  blocked.
* The SYSTEM$UNBLOCK_INTERNAL_STAGES_PUBLIC_ACCESS allows public access to an internal stage that was previously blocked.

For more information about using these functions, see [Blocking public access — Recommended](../../user-guide/private-internal-stages-azure.md).

## SQL Updates

### Python Package Version Range Support — *Preview*

When you create a Python UDF or stored procedure, you can now specify a range of package versions in the PACKAGES section of the handler
definition. You can specify a particular package version or a range of versions by using version specifiers such as: `==`, `<=`,
`>=`, `<`, or `>`.

For more information, see [CREATE FUNCTION](../../sql-reference/sql/create-function.md) and [CREATE PROCEDURE](../../sql-reference/sql/create-procedure.md).

## Data Loading Updates

### New File Format Option: USE_LOGICAL_TYPE

With this release, we are pleased to announce that the COPY INTO command supports a new file format option USE_LOGICAL_TYPE for Parquet.

With this new file format option, Snowflake can interpret Parquet logical types during data loading. To enable Parquet logical types, set
USE_LOGICAL_TYPE to TRUE when you create a new file format option.

For more information, see [Parquet Logical Type Definitions](https://github.com/apache/parquet-format/blob/master/LogicalTypes.md) and
[CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md).

## Web Interface Updates

### Snowsight Worksheet Tabs — *General Availability*

With this release, we are pleased to announce the general availability of opening worksheets in tabs in Snowsight.

Opening Snowsight worksheets in tabs lets you mimic the experience in Classic Console. You can use tabs to refer to multiple active
worksheets and explore the databases and schemas in Snowflake while writing SQL or Snowpark Python.

For more information, see [Opening worksheets in tabs](../../user-guide/ui-snowsight-worksheets-gs.md).

---
title: August 19-21, 2024 — 8.31 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_31.md
section: Release Notes
---

# August 19-21, 2024 — 8.31 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Data lake updates

### Snowflake Open Catalog: New system function for troubleshooting issues with syncing Snowflake-managed Apache Iceberg™ tables - *Preview*

With this release, we are pleased to announce the public preview of the system function [SYSTEM$SEND_NOTIFICATIONS_TO_CATALOG](../../sql-reference/functions/system_send_notifications_to_catalog.md).
This function sends a notification to Snowflake Open Catalog, and if the send fails it returns an error message explaining why.
This error message is helpful for diagnosing why a Snowflake-managed Apache Iceberg™ table isn’t syncing to a Open Catalog.

### Apache Iceberg™ tables: Support for time travel queries using third-party engines — *General availability*

With this release, we are pleased to announce the general availability of support for time travel queries on Snowflake-managed
Apache Iceberg™ tables when you use a third-party compute engine like Apache Spark.

For more information, see [Time travel](../../user-guide/tables-iceberg-metadata.md) for Iceberg tables.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 16-Aug-24 |
| *Iceberg tables: Support for time travel queries using third-party engines* | **Added** to *Data lake updates* section | 23-Aug-24 |

---
title: August 20, 2024 — Cortex LLM Functions — Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-20-new-region-llama-405b.md
section: Release Notes
---

# August 20, 2024 — Cortex LLM Functions — Release Notes

With this release, we are pleased to announce the availability of the `llama3.1-405b` model
in the following additional Amazon Web Services (AWS) region:

| Cloud Region | Cloud Region ID |
| --- | --- |
| US East 1 (N. Virginia) | us-east-1 |

For a full list of regions where the `llama3.1-405b` and other models are available, see
[Regional availability](../../../user-guide/snowflake-cortex/aisql.md).

---
title: August 20, 2024 — Differential Privacy — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-16-diff-privacy.md
section: Release Notes
---

# August 20, 2024 — Differential Privacy — *Preview*

With this release, we are pleased to announce the preview of differential privacy in Snowflake.

Differential privacy is a widely recognized standard for data privacy that limits the risk that someone could leak sensitive information
from a sensitive dataset, even if they are carrying out a targeted privacy attack. Data providers implement differential privacy by
assigning privacy policies to their sensitive tables and views. As analysts query the protected data, Snowflake uses rigorous mathematics to
ensure that they cannot identify individuals and entities in the dataset to an unacceptable degree of certainty.

For more information, see [Differential privacy in Snowflake](../../../user-guide/diff-privacy/differential-privacy-overview.md).

---
title: August 22-23, 2023 – 7.29 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2023/7_29.md
section: Release Notes
---

# August 22-23, 2023 – 7.29 Release Notes (with behavior changes)

## Behavior Changes Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2023_06](../bcr-bundles/2023_06_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2023_05](../bcr-bundles/2023_05_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_04](../bcr-bundles/2023_04_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for September; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## Non-bundled Pending Behavior Changes

The following changes are pending, but are not part of a bundle, and therefore cannot be enabled for testing:

* [Stronger UTF-8 validation for external files](../bcr-bundles/un-bundled/bcr-1013-1014.md)

## SQL Updates

### GET_QUERY_OPERATOR_STATS Function — *General Availability*

With this release, we are pleased to announce the general availability of the GET_QUERY_OPERATOR_STATS system function, which provides
programmatic access to the query profile. For more information, see
[GET_QUERY_OPERATOR_STATS](../../sql-reference/functions/get_query_operator_stats.md).

### Using the Query Hash to Identify Patterns and Trends in Queries

Snowflake now provides the hash of the query text in the views and table functions that provide historical data on queries. You can use the
hash of the query text to identify, group, and analyze similar queries in the query history.

The following Account Usage views and Information Schema table functions now include a hash of the canonicalized SQL text of the query:

* The following Account Usage views:

  + [QUERY_HISTORY view](../../sql-reference/account-usage/query_history.md)
  + [QUERY_ACCELERATION_ELIGIBLE view](../../sql-reference/account-usage/query_acceleration_eligible.md)
  + [TASK_HISTORY view](../../sql-reference/account-usage/task_history.md)
* The following Information Schema table functions:

  + [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../../sql-reference/functions/query_history.md)
  + [TASK_HISTORY](../../sql-reference/functions/task_history.md)

Note that these columns are present only when the 2023_06 bundle is enabled.

For more information, see [Using the Query Hash to Identify Patterns and Trends in Queries](../../user-guide/query-hash.md).

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_MAX](../../sql-reference/functions/array_max.md) | Given an input ARRAY, returns the element with the highest value that is not a SQL NULL. |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_MIN](../../sql-reference/functions/array_min.md) | Given an input ARRAY, returns the element with the lowest value that is not a SQL NULL. |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_SORT](../../sql-reference/functions/array_sort.md) | Returns an ARRAY that contains the elements of the input ARRAY sorted in ascending or descending order. |

---
title: August 26, 2024 — Easier Training of Forecasting Models from Real-World Data
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-26-forecasting-preprocessing.md
section: Release Notes
---

# August 26, 2024 — Easier Training of Forecasting Models from Real-World Data

We are pleased to announce that the Time-Series Forecasting ML Function now includes preprocessing features that allow
you to successfully train a forecasting model even when your training data has missing, duplicate, or misaligned time
steps. In the past, such issues, which are common in real-world data, typically prevented the model from being trained.
These features are:

* You can manually specify an event cadence in case the model fails to infer it or infers it incorrectly
* The model can interpolate missing target values from nearby time steps.
* The model can aggregate dimensional values from events occurring outside the canonical event cadence in a number of
  ways, and you can specify aggregation behaviors for the type of value or per column.

A relatively small number of such corrections does not noticeably affect prediction accuracy.

For more information, see [Dealing with real-world data in Time-Series Forecasting](../../../user-guide/ml-functions/preprocessing.md).

---
title: August 26, 2024 — Time Series ML Functions — Error Message Improvements
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-26-time-series-error-message.md
section: Release Notes
---

# August 26, 2024 — Time Series ML Functions — Error Message Improvements

We are pleased to announce improved error messages for the [Forecasting](../../../user-guide/ml-functions/forecasting.md) and
[Anomaly Detection](../../../user-guide/ml-functions/anomaly-detection.md) ML Functions. These error messages previously
included internal debug traces that were not relevant to end users. Error messages from these functions now contain only
actionable information.

---
title: August 26-30, 2024 — 8.32 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_32.md
section: Release Notes
---

# August 26-30, 2024 — 8.32 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_07](../bcr-bundles/2024_07_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_06](../bcr-bundles/2024_06_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_05](../bcr-bundles/2024_05_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for September 2024; however, this schedule
is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| String & binary | SEARCH_IP (Preview) | Searches for valid IPv4 addresses in specified character-string columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. |

## Data pipeline updates

### Tasks: A new option for ALTER TASK

With this release, the ALTER TASK command supports a new option, `REMOVE WHEN`. You can use this option to easily remove the task’s condition. The syntax will be as follows:

```sqlsyntax
ALTER TASK [ IF EXISTS ] <name> REMOVE WHEN
```

For more information, see [ALTER TASK](../../sql-reference/sql/alter-task.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 23-Aug-24 |

---
title: August 28, 2024 — Snowflake ML Functions: Top Insights Preview Update
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-28-top-insights-preview-refresh.md
section: Release Notes
---

# August 28, 2024 — Snowflake ML Functions: Top Insights Preview Update

We are pleased to announce updates to the Top Insights ML Function for key driver analysis, which was already in
preview. Top Insights lets you easily identify drivers of a metric’s change over time or explain differences in a
metric among various verticals. With a few lines of SQL, you can integrate Top Insights into your business intelligence
workflows to automatically monitor segments responsible for changes in any metric.

The changes in this release make using Top Insights more like using the other ML Functions while simplifying output and
improving its explanatory ability. You can also now use Top Insights to analyze non-time-series data.

For more information, see [Top Insights](../../../user-guide/ml-functions/top-insights.md).

---
title: August 28-29, 2023 — 7.30 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_30.md
section: Release Notes
---

# August 28-29, 2023 — 7.30 Release Notes

## New Features

### Data Pipelines Replication Support — *Preview*

With this release, we are pleased to announce the preview of Data Pipelines replication support, including the replications of stages,
storage integrations, pipes, and load history. You can replicate these objects to configure failover for data pipelines across
[regions](../../user-guide/intro-regions.md) and [cloud platforms](../../user-guide/intro-cloud-platforms.md).

Before you can replicate data pipeline objects, you must set at the replication/failover group or account level the
`enable_etl_replication` parameter to TRUE. To replicate any external stages that use a storage integration, you must also
configure your replication/failover group to replicate STORAGE INTEGRATIONS.

You can use an [ALTER REPLICATION GROUP](../../sql-reference/sql/alter-replication-group.md) or [ALTER FAILOVER GROUP](../../sql-reference/sql/alter-failover-group.md) statement to modify
these properties for an existing group.

For more information, see [Stage, pipe, and load history replication](../../user-guide/account-replication-stages-pipes-load-history.md).

## Security Updates

### Password policies: Add support for password history and time to wait to change a password

With this release, we are pleased to announce support for password history values and the minimum number of days before you can change
a password in a password policy:

* The PASSWORD_HISTORY property in a password policy specifies the number of passwords that Snowflake stores. When a user changes their
  password, the new password cannot match any of the values in the history. If you increase the history value, such as changing the value
  from 3 to 6, Snowflake stores the three existing values. If you decrease the history value, such as changing from 6 to 3, Snowflake stores
  the three most recent values and deletes the three oldest values.
* The PASSWORD_MIN_AGE_DAYS property in a password policy specifies the number of days the user must wait before a recently changed password
  can be changed again. This value helps to ensure that the password history is not exhausted too soon.

These two properties should be set together in a password policy with values that align with your internal security practices. You can
specify these property values when you create a password policy or modify an existing password policy.

For details, see [CREATE PASSWORD POLICY](../../sql-reference/sql/create-password-policy.md) and [ALTER PASSWORD POLICY](../../sql-reference/sql/alter-password-policy.md).

## SQL Updates

### EXECUTE IMMEDIATE FROM File — *Preview*

With this release, we are pleased to announce the preview of the EXECUTE IMMEDIATE FROM command. This command executes the SQL statements
in a file on a stage. The file must contain syntactically valid SQL statements.

This feature provides a mechanism to control the deployment and management of your Snowflake objects and code. You can use the EXECUTE
IMMEDIATE FROM command to execute scripts in any session.

For more information, see [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md).

### Organizations & Accounts: Dropping an account URL — *Preview*

With this release, we are pleased to announce that organization administrators can use the
[ALTER ACCOUNT … DROP OLD ORGANIZATION URL](../../sql-reference/sql/alter-account.md)
command to drop an old [account URL](../../user-guide/organizations-connect.md) that was saved when Snowflake Customer Support performed any of the
following actions:

* Renamed the organization.
* Merged two organizations.
* Moved an account from one organization to another.

An old account URL is dropped automatically 90 days after Snowflake Customer Support performs one of these actions, but the organization
administrator can now drop it sooner.

## Developer and Extensibility Updates

### Support for Python 3.9 and 3.10 in Snowpark, UDFs, UDTFs and stored procedures — *General Availability*

With this release, we are pleased to announce the general availability of support for Python 3.9 and 3.10 in Snowpark Python, Python UDFs,
Python UDTFs and Python stored procedures.

For more information, see:

* [Setting up your development environment for Snowpark Python](../../developer-guide/snowpark/python/setup.md)
* [Introduction to Python UDFs](../../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md)

### Tabular Return Values from Python Stored Procedures — *General Availability*

With this release, we are pleased to announce the general availability of tabular stored procedures with a handler written in Python. You
can write a procedure that returns data in tabular form. To do this, you specify the procedure’s return type as TABLE (specifying columns for
the return value), then have your handler code return the tabular value in a Snowpark dataframe.

For more information, see [Python](../../developer-guide/stored-procedure/python/procedure-python-tabular-data.md).

## Data Governance Updates

### Set a masking policy on a virtual column — *Preview*

With this release, we are pleased to announce that you can set a masking policy on a virtual column in an external table. This update
allows the masking policy on the virtual column to override the masking policy that the virtual column inherits from the VALUE column. This
update simplifies external table management because data administrators no longer need to create a view from the semi-structured data in the
VALUE column and protect the view, and provides consistent data management and protection of the external table data because the protected
virtual column does not expose data unnecessarily.

For details, see [Masking policies and external tables](../../user-guide/security-column-intro.md).

## Web Interface Updates

### Governance area supports GOVERNANCE_VIEWER and OBJECT_VIEWER database roles

With this release, we are pleased to announce that an account role can access the Governance area of Snowsight if the role has been
granted the GOVERNANCE_VIEWER and OBJECT_VIEWER database roles. These database roles exist in the shared SNOWFLAKE database. By granting
these database roles to an account role, it is no longer necessary to use the ACCOUNTADMIN role to access the Governance area of Snowsight.
This update simplifies the management to access the Governance area of Snowsight.

### Provider Studio Onboarding — *General Availability*

With this release, we are pleased to announce the general availability of self-service onboarding to become a provider of listings
using Provider Studio.

For more details, see [Use listings as a provider](../../collaboration/provider-becoming.md).

---
title: August 29, 2024 — Cortex Analyst: New regions
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-29-cortex-analyst-new-regions.md
section: Release Notes
---

# August 29, 2024 — Cortex Analyst: New regions

We’re pleased to announce that [Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md) is now available in
the following additional regions:

* AWS ap-northeast-1 (Tokyo)
* Azure West Europe (Netherlands)

---
title: August 29, 2024 — New Mistral Large 2 model available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-29-mistral-large2.md
section: Release Notes
---

# August 29, 2024 — New Mistral Large 2 model available in Snowflake Cortex AI

We’re pleased to announce that Mistral AI’s Mistral Large 2 large language model (LLM) is now available for serverless inference in
[Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/).

Mistral Large 2 is a significantly more capable model than Mistral Large in math, reasoning, and coding with an increased context window
of 128K. It also has strong multilingual capabilities to simplify text processing and analytics of multi-language content.

For details, see [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md).

---
title: August 29, 2024 — New multilingual embedding models available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-29-multilingual-embed-models.md
section: Release Notes
---

# August 29, 2024 — New multilingual embedding models available in Snowflake Cortex AI

We’re pleased to announce that the text embedding functions in [Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/)
now support the following multilingual model:

* `multilingual-e5-large`

For additional details, see [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/embed_text_1024-snowflake-cortex.md).

---
title: August 29, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-29-dcr.md
section: Release Notes
---

# August 29, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## RSA authentication for the service account user

The service account user that the web app uses to interact with the Snowflake account now authenticates using Snowflake’s key-pair
authentication instead of username/password authentication, which provides a more secure method of authentication.

If the Snowflake account uses authentication policies to control which methods of authentication can be used to access the account, an
administrator must ensure that these authentication policies allow the service account user to authenticate with key-pair authentication.
For more information, see [Allow key-pair authentication](../../../user-guide/cleanrooms/admin-tasks.md).

## Activation for provider-run analyses

Providers can now push the results of a provider-run analysis to their own Snowflake account, where it can be used for activation. A
consumer who has shared data in the clean room can control whether the provider can push the results of the provider-run analysis.

For more information, see [Run an analysis as a provider](../../../user-guide/cleanrooms/v1/web-app-working.md).

---
title: August 30, 2024 — Query attribution costs
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-08-30-per-query-cost-attribution.md
section: Release Notes
---

# August 30, 2024 — Query attribution costs

## Account Usage: New QUERY_ATTRIBUTION_HISTORY view

The new QUERY_ATTRIBUTION_HISTORY view in the ACCOUNT_USAGE schema provides information about the warehouse cost for queries and
enables the attribution of query costs by tag, user, or query hash.

For more information, see [Viewing cost by tag in SQL](../../../user-guide/cost-attributing.md) and
[QUERY_ATTRIBUTION_HISTORY](../../../sql-reference/account-usage/query_attribution_history.md) view.

---
title: Authentication for local applications: Built-in security integration for Snowflake OAuth
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2056.md
section: Release Notes
---

# Authentication for local applications: Built-in security integration for Snowflake OAuth

Applications that use [Snowflake OAuth](../../../user-guide/oauth-snowflake-overview.md) to authenticate need a security integration that defines
the interface between Snowflake and the application. This change introduces a built-in, system-owned security integration that simplifies
how local desktop applications authenticate with Snowflake OAuth.

> **Note:**
>
> When this behavior change is enabled, the built-in security integration will be rolled out to accounts slowly, and might not be available
> immediately.

Before the change:
:   You must always create and configure a security integration if you want an application to use Snowflake OAuth to authenticate to
    Snowflake.

After the change:
:   Local, desktop applications that want to authenticate with Snowflake OAuth don’t have to create a security integration. The built-in
    security integration `SNOWFLAKE$LOCAL_APPLICATION` exists in all accounts. It is a security integration of type OAUTH.

    Because you don’t need to create a security integration, application developers can implement Snowflake OAuth without the assistance of a
    Snowflake administrator. But a security administrator can still configure the built-in security integration to control things like the
    validity duration of access tokens and whether to use single-use refresh tokens.

    Only local, desktop applications can authenticate using the built-in security integration. Other types of applications — for example,
    third-party web applications — must still create and configure a security integration if they want to authenticate with Snowflake OAuth.

## Benefits of the built-in integration

Using the `SNOWFLAKE$LOCAL_APPLICATION` security integration to authenticate local applications has the following benefits:

* Provides a straightforward authentication method that is an alternative to password authentication, helping you conform to the upcoming
  Snowflake requirement that human users use multi-factor authentication (MFA) if they authenticate with a password.
* Reduces administrative friction; no initial administrator action is required to use Snowflake OAuth.
* Improves the user experience for developers, especially those using the Snowflake CLI.
* Enables local applications to use OAuth as a singular authentication method, eliminating the need for complex configurations and making
  the authentication process mostly opaque to the application.
* Supports in-role session switching like other authentication methods. A user-defined Snowflake OAuth security integration doesn’t support
  in-role session switching.
* Isolates local applications from user credentials. This eliminates long-living credentials on disk, meaning sensitive authentication data,
  such as passwords or personal access tokens, aren’t persisted in an insecure manner.

Having a built-in security integration doesn’t weaken the security posture of your account, but rather combines an enhanced user
experience with the most secure form of local authentication. Creating new sessions within the window of OAuth refresh token validity is
equivalent to the existing pattern of using saved user credentials to create new sessions. In addition, administrators retain control over
authentication by using [authentication policies](../../../user-guide/authentication-policies.md) to dictate which credentials are allowed for
users.

## Opt out of the change

An account administrator can disable the `SNOWFLAKE$LOCAL_APPLICATION` security integration for an account. This action prevents local
applications from using Snowflake OAuth to authenticate unless an administrator creates their own security integration.

To opt out of this change so that security administrators can’t enable the `SNOWFLAKE$LOCAL_APPLICATION` security integration, run the
following commands:

```sqlexample
USE ROLE ACCOUNTADMIN;

ALTER ACCOUNT SET DISABLE_SNOWFLAKE_LOCAL_APPLICATION_INTEGRATION = TRUE;
```

Using the DISABLE_SNOWFLAKE_LOCAL_APPLICATION_INTEGRATION parameter to opt out doesn’t prevent the `SNOWFLAKE$LOCAL_APPLICATION`
integration from being created. The integration will still exist in your account, but its ENABLED property will be FALSE, and a security
administrator can’t change the property to TRUE.

The account administrator can set the DISABLE_SNOWFLAKE_LOCAL_APPLICATION_INTEGRATION parameter before the `SNOWFLAKE$LOCAL_APPLICATION`
integration is created in an account so that the integration is never enabled.

If the account administrator doesn’t use the account parameter to opt out, a security administrator can still disable the
`SNOWFLAKE$LOCAL_APPLICATION` integration after it is created. To disable the built-in security integration after it exists in the
account, run the following commands:

```sqlexample
USE ROLE SECURITYADMIN;

ALTER SECURITY INTEGRATION SNOWFLAKE$LOCAL_APPLICATION SET ENABLED = FALSE;
```

Ref: 2056

---
title: Authentication policies: Changes to the NETWORK_POLICY_EVALUATION property no longer affect existing sessions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2191.md
section: Release Notes
---

# Authentication policies: Changes to the NETWORK_POLICY_EVALUATION property no longer affect existing sessions

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

In an [authentication policy](../../../user-guide/authentication-policies.md), the NETWORK_POLICY_EVALUATION property determines how
network policies are evaluated when a [programmatic access token](../../../user-guide/programmatic-access-tokens.md) is used for
authentication. For more information, see [Network policy requirements](../../../user-guide/programmatic-access-tokens.md).

The way in which this property is read is changing:

Before the change:
:   The value of the NETWORK_POLICY_EVALUATION property is read when the session is first created and with each action performed.

    If you change the value of the NETWORK_POLICY_EVALUATION property, this change affects existing sessions.

After the change:
:   The value of the NETWORK_POLICY_EVALUATION property is read only when the session is first created.

    This value is kept in memory and is used for the entire session.

    If you change the value of the NETWORK_POLICY_EVALUATION property, this change does not affect existing sessions.

    The change only affects new sessions created after the behavior change bundle was enabled.

This change is being made for consistency. Other properties of an authentication policy are checked when a session is created,
not with each action performed.

Ref: 2191

---
title: Automatic Clustering: SYSTEM$CLUSTERING_INFORMATION Syntax and Output Changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-985.md
section: Release Notes
---

# Automatic Clustering: SYSTEM$CLUSTERING_INFORMATION Syntax and Output Changes

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

Currently you can use the SYSTEM$CLUSTERING_INFORMATION function to view Automatic Clustering errors that have
occurred in the last 14 days.

Previously:
:   Users cannot obtain descriptive messages for errors encountered during Automatic Clustering.

Currently:
:   * The JSON output of the SYSTEM$CLUSTERING_INFORMATION function includes a new field, `clustering_errors`, which contains an array of errors.
      Each error contains a timestamp and descriptive message.

      For example, the new output of the function might be:

      ```sqljson
      {
      "cluster_by_keys" : "LINEAR(i)",
      "notes" : "Clustering key columns contain high cardinality key I which
      might result in expensive re-clustering. Consider reducing the
      cardinality of clustering keys. Please refer to
      https://docs.snowflake.net/manuals/user-guide/tables-clustering-keys.html
      for more information.",
      "total_partition_count" : 0,
      "total_constant_partition_count" : 0,
      "average_overlaps" : 0.0,
      "average_depth" : 0.0,
      "partition_depth_histogram" : {
          "00000" : 0,
          // omitted for brevity
      },
      "clustering_errors" : [ {
          "timestamp" : "2023-04-03 17:50:42 +0000",
          "error" : "(003325) Clustering service has been disabled.\n"
      } ]
      }
      ```
    * By default, the 10 most recent messages are returned by the function. New function syntax allows you to specify an integer as the
      optional second argument in order to return more or fewer messages. For example, the following returns the 25 most recent errors:

      ```sqlexample
      SELECT SYSTEM$CLUSTERING_INFORMATION( 'my_table' , 25);
      ```

Ref: 985

---
title: Azure access: New VNET subnet IDs required for rules that filter based on subnet ID (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1955-2078.md
section: Release Notes
---

# Azure access: New VNET subnet IDs required for rules that filter based on subnet ID (Pending)

This behavior change applies only to customers who use Azure Virtual Network (VNet) subnet IDs in virtual network, policy, or firewall rules that filter traffic in Azure regions. If you don’t use the [VNET subnet IDs](../../../user-guide/data-load-azure-allow.md) feature offered in Snowflake Azure deployments, you can ignore this change.

Snowflake is expanding its support to include additional Azure VNet subnet IDs in some regions. We are doing this by setting up additional subnets and migrating customers to them after verifying readiness. We are verifying that customers have updated their subnet IDs before migrating them. We are doing this verification and migration through dedicated engagement with customers.

However, if you try to update your subnet IDs in these regions, you might encounter an error similar to `vnet-******** cannot have more than 200 tagged traffic consumers of service`. This is because, per Azure limits, a virtual network can be associated with a maximum of 200 different subscriptions and regions per supported service. This means that Snowflake customers can use a subnet ID queried from the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../../../sql-reference/functions/system_get_snowflake_platform_info.md) function in 200 Azure subscription/region combinations in aggregate. After a total of 200 subscriptions across all customers have used the subnet ID in a network rule, new attempts to use the subnet ID for another Azure subscription will fail.

To avoid encountering these errors, consider taking the following actions:

* If you are already a Business Critical customer, consider using [Private connectivity for outbound network traffic](../../../user-guide/private-connectivity-outbound.md).
* If you have an Azure Blob Storage account that has allowlisted Snowflake subnets in the firewall, you can use the same subscription and region to create a new storage account. You can then allowlist Snowflake subnets on this new storage account.
* Consider not [Allowing VNET subnet IDs](../../../user-guide/data-load-azure-allow.md). For more detailed information, see [Network security for Azure Key Vault](https://learn.microsoft.com/en-us/azure/key-vault/general/network-security) in Azure documentation.

Ref: 1995, 2078

---
title: BACKUP_OPERATION_HISTORY views: TIMESTAMP_LTZ always used for the start_time column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2200.md
section: Release Notes
---

# BACKUP_OPERATION_HISTORY views: TIMESTAMP_LTZ always used for the `start_time` column

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

In the [ACCOUNT_USAGE.BACKUP_OPERATION_HISTORY](../../../sql-reference/account-usage/backup_operation_history.md) and [ORGANIZATION_USAGE.BACKUP_OPERATION_HISTORY](../../../sql-reference/organization-usage/backup_operation_history.md) views, the `start_time` column behaves as follows:

Before the change:
:   The `start_time` column has the type [TIMESTAMP_LTZ](../../../sql-reference/data-types-datetime.md) or another type returned by the [TO_TIMESTAMP](../../../sql-reference/functions/to_timestamp.md) function, depending on the value of the [TIMESTAMP_TYPE_MAPPING](../../../sql-reference/parameters.md) parameter.

After the change:
:   The `start_time` column will have the type TIMESTAMP_LTZ, regardless of the setting of the TIMESTAMP_TYPE_MAPPING parameter.

This change is being made to ensure consistent behavior across these views and to align with standard Snowflake practices for TIMESTAMP columns in system views.

The `start_time` column in the following deprecated views will also have this change:

* [ACCOUNT_USAGE.SNAPSHOT_OPERATION_HISTORY](../../../sql-reference/account-usage/snapshot_operation_history.md)
* [ORGANIZATION_USAGE.SNAPSHOT_OPERATION_HISTORY](../../../sql-reference/organization-usage/snapshot_operation_history.md)

Ref: 2200

---
title: Behavior change announcements
source: https://docs.snowflake.com/en/release-notes/behavior-changes.md
section: Release Notes
---

# Behavior change announcements

To help you manage your operations and minimize disruption to your Snowflake service, we document behavior changes that may impact your usage,
including:

* Upcoming pending changes that can be enabled (for testing), unless otherwise noted.
* Recently implemented changes that were previously pending/disabled.

For behavior changes that are not associated with a specific behavior change bundle, see [Unbundled behavior changes](bcr-bundles/un-bundled/unbundled-behavior-changes.md).

If you have questions about any of these behavior changes, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Upcoming pending changes

The following table lists behavior changes that are pending. If the change is in a monthly behavior change bundle, the bundle is currently
disabled, but can be enabled (for testing purposes).

To enable a bundle that is currently disabled by default, use the
[SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](../sql-reference/functions/system_enable_behavior_change_bundle.md) system function.
This is typically done in your non-production accounts (for development and testing purposes).

> **Important:**
>
> All information in this table, including planned versions and dates, is subject to change; the information is provided only as a guideline
> for any updates you need to make to accommodate the changes.
>
> If a link is not provided to the individual pending behavior changes, the release in which the bundle was introduced has not started or is
> still in progress.

| Bundle | Status / History | Pending Changes | Additional Notes |
| --- | --- | --- | --- |
| **2026_03** | Introduced in the 10.12 release (April 2026) as **Disabled by Default**; account admins can enable for testing.  Status planned to change in May 2026 to **Enabled by Default**; however, this schedule is subject to change.  Status planned to change in June 2026 to **Generally Enabled**; however, this schedule is subject to change. | For detailed descriptions of each change, grouped by functional area, see:  [2026_03 Bundle (Disabled by default)](bcr-bundles/2026_03_bundle.md) |  |
| **2026_02** | Introduced in the 10.7 release (March 2-5, 2026) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 10.12 release (April 2026) to **Enabled by Default**; account admins can disable for opt-out.  Status planned to change in May 2026 to **Generally Enabled**; however, this schedule is subject to change. | For detailed descriptions of each change, grouped by functional area, see:  [2026_02 Bundle (Enabled by default)](bcr-bundles/2026_02_bundle.md) |  |

## Recently implemented changes

The following table lists behavior changes that were previously pending, but have been implemented in a recent release. If the change is in a monthly behavior change bundle that is currently enabled by default, the bundle can be disabled.

To disable a bundle that is currently enabled by default, use the [SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](../sql-reference/functions/system_disable_behavior_change_bundle.md)
system function. This is typically done in your production accounts to opt-out of the changes in the bundle while you continue testing the changes in your non-production accounts.

| Bundle | Status / History | Implemented Changes | Additional Notes |
| --- | --- | --- | --- |
| **2026_01** | Introduced in the 10.1 release (January 19-23, 2026) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 10.7 release (March 2-5, 2026) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 10.12 release (April 2026) to **Generally Enabled**; account admins can no longer disable or enable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2026_01 Bundle (Generally enabled)](bcr-bundles/2026_01_bundle.md) |  |
| **2025_07** | Introduced in the 9.32 release (October 13-15, 2025) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 10.1 release (January 19-23, 2026) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 10.7 release (March 2-5, 2026) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_07 Bundle](bcr-bundles/2025_07_bundle.md) |  |
| **2025_06** | Introduced in the 9.27 release (September 8-10, 2025) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 9.32 release (October 13-15, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 10.1 release (January 19-23, 2026) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_06 Bundle](bcr-bundles/2025_06_bundle.md) |  |
| **2025_05** | Introduced, **Disabled by Default**, in the 9.22 release (August 4-8, 2025), account admins can enable for testing.  Status changed in the 9.27 release (September 8-10, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.32 release (October 13-15, 2025) to **Generally Enabled**; account admins can no longer disable or enable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_05 Bundle](bcr-bundles/2025_05_bundle.md) |  |
| **2025_04** | Introduced, **Disabled by Default**, in the 9.17 release (June 23-30, 2025), account admins can enable for testing.  Status changed in the 9.22 release (August 4-8, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.27 release (September 8-10, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_04 Bundle](bcr-bundles/2025_04_bundle.md) |  |
| **2025_03** | Introduced, **Disabled by Default**, in the 9.12 release (May 5-12, 2025), account admins can enable for testing.  Status changed in the 9.17 release (June 23-30, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.22 release (August 4-8, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_03 Bundle](bcr-bundles/2025_03_bundle.md) |  |
| **2025_02** | Introduced, **Disabled by Default**, in the 9.7 release (March 17-27, 2025); account admins can enable for testing.  Status changed in the 9.12 release (May 5-12, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.17 release (June 23-30, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_02 Bundle](bcr-bundles/2025_02_bundle.md) |  |
| **2025_01** | Introduced, **Disabled by Default**, in the 9.2 release (January 22-February 13, 2025); account admins can enable for testing.  Status changed in the 9.7 release (March 17-27, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.12 release (May 5-12, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2025_01 Bundle](bcr-bundles/2025_01_bundle.md) |  |
| **2024_08** | Introduced, **Disabled by Default**, in the 8.38 release (October 7-9, 2024), account admins can enable for testing.  Status changed in the 9.2 release (January 22-February 13, 2025) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.7 release (March 17-27, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2024_08 Bundle](bcr-bundles/2024_08_bundle.md) |  |
| **2024_07** | Introduced, **Disabled by Default**, in the 8.32 release (August 26-30); account admins can enable for testing.  Status changed in the 8.38 release (October 7-9) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 9.2 release (January 22-February 13, 2025) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, see:  [2024_07 Bundle](bcr-bundles/2024_07_bundle.md) |  |
| **2024_06** | Introduced in the 8.27 release as **Disabled by Default**, (July 22-25); account admins can enable for testing.  Status changed to **Enabled by Default**, in the 8.32 release (scheduled for August 26-30); account admins can disable for opt-out.  Status changed in the 8.38 release (October 7-9) to **Generally Enabled**. | For detailed descriptions of each change, grouped by functional area, see:  [2024_06 Bundle](bcr-bundles/2024_06_bundle.md) |  |
| **2024_05** | Introduced in the 8.22 release (June 11-15) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 8.27 release (July 22-25) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 8.32 release (August 26-30) to **Generally Enabled**. | For detailed descriptions of each change, grouped by functional area, see:  [2024_05 Bundle](bcr-bundles/2024_05_bundle.md) |  |
| **2024_04** | Introduced in the 8.17 release (April 30 - May 7) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 8.22 release (June 11-15) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 8.27 release (July 22-25) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, refer to:  [2024_04 Bundle](bcr-bundles/2024_04_bundle.md) |  |
| **2024_03** | Introduced in the 8.12 release (March 26-27) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 8.17 release (April 30 - May 7) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 8.22 release (June 11-15) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, refer to:  [2024_03 Bundle](bcr-bundles/2024_03_bundle.md) |  |
| **2024_02** | Introduced in the 8.7 release (February 19-21) as **Disabled by Default**; account admins can enable for testing.  Status changed in the 8.12 release (March 26-27) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 8.17 release (April 30 - May 7) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, refer to:  [2024_02 Bundle](bcr-bundles/2024_02_bundle.md) |  |
| **2024_01** | Introduced, **Disabled by Default**, in the 8.2 release (scheduled for January 15-17); account admins can enable for testing.  Status changed in the 8.7 release (February 19-21) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in then 8.12 release (March 26-27) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, refer to:  [2024_01 Bundle](bcr-bundles/2024_01_bundle.md) |  |
| **2023_08** | Introduced, **Disabled by default**, in the 7.41 release (November 11-14, 2023); account admins can enable for testing.  Status changed in the 8.2 release (January 15-17, 2024) to **Enabled by Default**; account admins can disable for opt-out.  Status changed in the 8.7 release (February 19-21, 2024) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_08 Bundle (Generally Enabled)](bcr-bundles/2023_08_bundle.md) |  |
| **2023_07** | Introduced, **Disabled by default**, in the 7.34 release (September 27-28); account admins can enable for testing.  Status changed to **Enabled by Default** in the 7.41 (November 11-14, 2023); account admins can disable for opt-out.  Status changed in the 8.2 release (January 15-17, 2024) to **Generally Enabled**; account admins can no longer enable or disable this bundle. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_07 Bundle (Generally Enabled)](bcr-bundles/2023_07_bundle.md) |  |
| **2023_06** | Introduced, **disabled by default**, in the 7.29 release (August 22-23); account admins can enable for testing.  Status changed to **Enabled by Default** in the 7.34 release (September 25-26); account admins can disable for out-out.  Status changed to **Generally Enabled** in the 7.41 release (November 11-14, 2023). Account admins can no longer enable or disable. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_06 Bundle (Generally Enabled)](bcr-bundles/2023_06_bundle.md) |  |
| **2023_05** | Introduced, **disabled by default**, in the 7.23 release (July 10-11).  Status changed in the 7.29 release (August 22-23) to **Enabled by Default**; account admins can disable for opt-out.  Status changed to **Generally Enabled** in the 7.34 release (September 25-26). Account admins can no longer enable or disable. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_05 Bundle (Generally Enabled)](bcr-bundles/2023_05_bundle.md) |  |
| **2023_04** | Introduced, **disabled by default**, in the 7.19 release (Jun 7-8); account admins can enable for testing.  **Enabled by default** in the 7.23 release (July 10-11); account admins can disable to opt-out.  **Generally enabled** in the 7.29 release (August 22-23). Account admins can no longer enable or disable. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_04 Bundle (Generally Enabled)](bcr-bundles/2023_04_bundle.md) |  |
| **2023_03** | Introduced, **disabled by default**, in the 7.13 release (Apr 20-24); account admins can enable for testing.  **Enabled by default** in the 7.19 release (Jun 7-8); account admins can disable to opt-out.  **Generally enabled** in the 7.23 release (July 10-11). Account admins can no longer enable or disable. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_03 Bundle (Generally Enabled)](bcr-bundles/2023_03_bundle.md) |  |
| **2023_02** | Introduced, **disabled by default**, in the 7.7 release (Mar 6-7).  **Enabled by default** in the 7.13 release (Apr 20-23).  **Generally enabled** in the 7.19 release (Jun 7-8); Account admins can no longer enable or disable. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_02 Bundle (Generally Enabled)](bcr-bundles/2023_02_bundle.md) |  |
| **2023_01** | Introduced, **disabled by default**, in the 7.2 release (Jan 19-20).  **Enabled by default** in the 7.7 release (Mar 6-7).  **Generally enabled** in the 7.13 release (Apr 20-24); Account admins can no longer enable or disable. | For detailed descriptions of each change, grouped by functional area, refer to:  [2023_01 Bundle (Generally Enabled)](bcr-bundles/2023_01_bundle.md) |  |

---
title: Behavior change management
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/managing-behavior-change-releases.md
section: Release Notes
---

# Behavior change management

This document explains how to check whether a particular
[behavior change bundle](../behavior-change-policy.md) is enabled in your account and how to enable or disable it.

## Overview

Snowflake implements behavior changes monthly in bundles included in regularly-scheduled
[releases](../../user-guide/intro-releases.md). During the testing period and opt-out period for each behavior change bundle,
you can enable or disable the bundle in your account. This document explains how to check whether a particular bundle is enabled
in your account and how to enable or disable it.

In this document, the name of the behavior change bundle is in the form `YYYY_NN`. For the names of the
currently available behavior change bundles, see [Behavior change announcements](../behavior-changes.md).

> **Note:**
>
> Behavior changes in bundles cannot be enabled/disabled individually. To enable/disable a behavior change, you must
> enable/disable the bundle containing the change.

## Checking the status of a behavior change bundle in your account

To check whether a specific behavior change bundle is enabled in your account, call the
[SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS](../../sql-reference/functions/system_behavior_change_bundle_status.md) function. For example, to check the status of the bundle
named `2024_02`:

```sqlexample
SELECT SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS('2024_02');
```

```output
+-------------------------------------------------+
| SYSTEM$BEHAVIOR_CHANGE_BUNDLE_STATUS('2024_02') |
|-------------------------------------------------|
| DISABLED                                        |
+-------------------------------------------------+
```

To check the status of all currently available behavior change bundles, call the
[SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES](../../sql-reference/functions/system_show_active_behavior_change_bundles.md) function:

```sqlexample
SELECT SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES();
```

```output
+--------------------------------------------------------------------------------------------------------------+
| SYSTEM$SHOW_ACTIVE_BEHAVIOR_CHANGE_BUNDLES()                                                                 |
|--------------------------------------------------------------------------------------------------------------|
| [{"name":"2023_08","isDefault":true,"isEnabled":true},{"name":"2024_01","isDefault":false,"isEnabled":true}] |
+--------------------------------------------------------------------------------------------------------------+
```

## Enabling a behavior change bundle in your account

To enable a particular behavior change in your account, call the
[SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE](../../sql-reference/functions/system_enable_behavior_change_bundle.md) function. For example, to enable the bundle
named `2024_02`:

```sqlexample
SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2024_02');
```

```output
+-------------------------------------------------+
| SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2024_02') |
|-------------------------------------------------|
| ENABLED                                         |
+-------------------------------------------------+
```

## Disabling a behavior change bundle in your account

To disable a particular behavior change in your account, call the
[SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE](../../sql-reference/functions/system_disable_behavior_change_bundle.md). For example, to disable the bundle
named `2024_02`:

```sqlexample
SELECT SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE('2024_02');
```

```output
+-------------------------------------------------+
| SYSTEM$DISABLE_BEHAVIOR_CHANGE_BUNDLE('2024_02')|
|-------------------------------------------------|
| DISABLED                                        |
+-------------------------------------------------+
```

## Determining the current version of your account

To check the current version of Snowflake that is in your account, call the
[CURRENT_VERSION](../../sql-reference/functions/current_version.md) function. For example:

> ```sqlexample
> SELECT CURRENT_VERSION();
> ```
>
> ```output
> +-------------------+
> | CURRENT_VERSION() |
> |-------------------|
> | 8.5.1             |
> +-------------------+
> ```

## Mitigating masking policy return value updates

In the `2024_04` [bundle](2024_04/bcr-1355.md), there are changes to the values for precision and
scale in masking policy conditions (collectively: “return value updates”). A query on a column protected by a masking policy fails when the
following are true:

* The bundle is enabled.
* The masking policy conditions return a value whose precision is greater than the precision of the column to which the masking
  policy is assigned.

If the scale of the return value is larger than the scale of the column, the value is truncated to match the scale of the column.

If you want to apply the new behavior to a pre-existing policy, create a new masking policy and replace the pre-existing policy using the
`FORCE` [keyword](../../user-guide/security-column-intro.md).

When the bundle is enabled, you can test the behavior as follows:

1. Create a policy:

   ```sqlexample
   CREATE MASKING POLICY MP AS (n NUMBER)
   RETURNS NUMBER -> 12345;
   ```
2. Assign the policy:

   ```sqlexample
   CREATE TABLE t(col1 NUMBER(2,0));

   ALTER TABLE t MODIFY COLUMN col1 SET MASKING POLICY mp;
   INSERT INTO t VALUES (10);
   ```
3. Query the column (fails):

   ```sqlexample
   SELECT * FROM t;
   ```

> **Note:**
>
> The changes to the values for precision and scale are not applicable to the string data type.

To determine the impact of this change and provide enough time to update the masking policy conditions to protect data, query the
SNOWFLAKE.BCR_ROLLOUT.BCR_2024_03_DDM_ROLLOUT view to understand how the future return value updates affect your account.

The BCR_2024_03_DDM_ROLLOUT view is temporary. Snowflake will remove the view when the return value updates are generally enabled
in a future behavior change bundle. At this point, you will not be able to query the view to determine affected columns and policies or
prevent column query or masking policy assignment operation failures due to return value updates.

The view records data starting from March 2024. If a query on the view takes a long time to complete, you can specify the start date and
end date session variables using a [SET](../../sql-reference/sql/set.md) command. These variables help to reduce the number of rows to evaluate when
you query the view. For example:

> ```sqlexample
> SET DDM_CASTING_BCR_START_DATE = '2024-03-01';
> SET DDM_CASTING_BCR_END_DATE = '2024-04-03';
> ```

### Identify masking policy & column associations

To query the view and mitigate the upcoming return value changes, do the following:

1. Query the SNOWFLAKE.BCR_ROLLOUT.BCR_2024_03_DDM_ROLLOUT view. For example:

   ```sqlexample
   USE ROLE ACCOUNTADMIN;
   SET DDM_CASTING_BCR_START_DATE = '2024-03-01';
   SET DDM_CASTING_BCR_END_DATE = '2024-04-03';
   SELECT * FROM SNOWFLAKE.BCR_ROLLOUT.BCR_2024_03_DDM_ROLLOUT;
   ```
2. Evaluate the REASON column in the BCR_2024_03_DDM_ROLLOUT View Reference section to determine what update needs to be made to the
   masking policy conditions.
3. Update the masking policy conditions with an [ALTER MASKING POLICY](../../sql-reference/sql/alter-masking-policy.md) statement to ensure the column data remains
   protected and that policy assignment operations or protected column queries do not fail.
4. Test the new policy conditions by querying the table columns to which the masking policies are assigned.

### BCR_2024_03_DDM_ROLLOUT view reference

The BCR_2024_03_DDM_ROLLOUT view (in the SNOWFLAKE.BCR_ROLLOUT schema) records information starting on July 15, 2022 and contains the
following columns:

| Column | Data type | Description |
| --- | --- | --- |
| `policy_name` | VARCHAR | The name of the policy. |
| `policy_id` | NUMBER | Internal/system-generated identifier for the policy. |
| `policy_schema` | VARCHAR | The parent schema of the policy. |
| `policy_database` | VARCHAR | The parent database of the policy. |
| `policy_body` | VARIANT | The conditions of the policy to mask or unmask the column data. |
| `column_name` | VARCHAR | The name of the column that has the policy. |
| COLUMN_TYPE | VARCHAR | The data type of the column. |
| COLUMN_LENGTH | NUMBER | The length of the column that has the policy or `[NULL]` if not set for the column. |
| COLUMN_PRECISION | NUMBER | The precision of the column that has the policy or `[NULL]` if not set for the column. |
| COLUMN_SCALE | NUMBER | The scale of the column that has the policy or `[NULL]` if not set for the column. |
| TABLE_NAME | VARCHAR | The name of the table. |
| `table_id` | NUMBER | Internal/system-generated identifier for the table. |
| `table_schema` | VARCHAR | The parent schema of the table. |
| `table_database` | VARCHAR | The parent database of the table. |
| `table_kind` | VARCHAR | The type of table. One of the following: `TABLE`, `LOCAL TEMPORARY`, `VIEW`, `MATERIALIZED VIEW`, `EXTERNAL TABLE`, or `DYNAMIC TABLE`. |
| `reason` | VARCHAR | Possible reason for the mismatch. One of the following: `precision` or `scale`. |
| LARGEST_MASKED_SIZE | NUMBER | The maximum length, scale, or precision a masked value can have based on the masking policy assigned to the column. |

---
title: Behavior change policy
source: https://docs.snowflake.com/en/release-notes/behavior-change-policy.md
section: Release Notes
---

# Behavior change policy

The behavior change release process at Snowflake lets Snowflake users control a bundle of product or feature
changes that may affect existing functionality for at least eight weeks before the changes are generally enabled across
all Snowflake accounts. During this period of time, account administrators can selectively disable or enable
each bundle of behavior changes. New behavior change bundles are introduced on a monthly basis.

## Introduction

To provide the best experience and value to our customers, Snowflake is continually improving and enhancing our service offerings.
As part of these ongoing efforts, Snowflake must sometimes make changes to products or features that may affect existing functionality.
To minimize the impact of these behavior changes on production accounts, and to ensure consistent, timely customer communication,
behavior changes are typically released on a monthly basis as bundles introduced in designated regularly-scheduled weekly releases.
As a general rule, one month elapses between the start of each of these releases. Once a new bundle is released, account administrators
can enable or disable the bundle for their accounts for eight weeks before the behavior changes in the bundle become generally enabled
for all accounts.

## Monthly behavior change bundles

**Snowflake releases behavior changes in monthly behavior change bundles, with each behavior change bundle typically containing multiple behavior changes.**

Details about each behavior change are published to the [Snowflake Documentation site](behavior-changes.md) and email notifications of
behavior change releases are sent to the [Product Notification contact](../user-guide/ui-snowsight-contacts.md) and a mailing list of users
who are authorized to submit support cases.
If Snowflake identifies specific customers who are likely to be directly affected by the behavior changes,
Snowflake Customer Support may also send email notifications to the designated support contacts for those customers.
Behavior change bundles are named by the year and the ordinal number of the bundle within the year. For example, bundle 2023_03 would be
the third behavior change bundle released in the year 2023.

For information about current and past behavior change releases, refer to the [Behavior change announcements](behavior-changes.md).

## Testing period

**For at least four weeks following release, account administrators can opt in to a behavior change bundle..**

The first four-week period after a behavior change bundle is released is called the **Testing Period**.
During this time, the behavior changes in the bundle are disabled by default.
Account administrators can [enable the entire behavior change bundle](bcr-bundles/managing-behavior-change-releases.md),
but cannot enable or disable individual changes in the bundle.
To test the changes during this period, Snowflake recommends enabling the bundle in one or more accounts dedicated to development or
quality assurance purposes.
If more time is required to test the changes in the bundle and to mitigate
their impact on a production account, the account administrator can proactively disable the entire bundle in the account prior to the
beginning of the Opt-out Period.

## Opt-out period

**For at least four weeks following the end of the Testing Period, account administrators can opt out of a behavior change bundle.**

The next four-week period is the Opt-out Period. At the beginning of the **Opt-out Period**, the behavior change bundle status changes
from disabled by default to enabled by default. If the behavior change bundle status was explicitly modified at any point during the previous Testing Period,
it will remain in its specified state.
As with the Testing Period, individual changes cannot be disabled, but account administrators can disable the entire behavior change bundle at any time.

## Generally enabled

**After these 8 weeks, the behavior changes in the bundle are generally enabled.**

After the Testing and Opt-Out periods, the bundle is generally enabled and the behavior change release process is complete. The behavior change bundle is fully released, meaning all the changes in the bundle are now in effect in production for all accounts across all deployments.

At this point, you can no longer disable the behavior changes from your accounts.

## Enabling and disabling behavior change bundles

As described above, account administrators can enable or disable behavior change bundles any time during the Testing or Opt-Out periods.
To learn how to check the status of a behavior change bundle for an account, enable a bundle, or disable a bundle, refer to
[Behavior Change Management](bcr-bundles/managing-behavior-change-releases.md).
When an account administrator (or a Snowflake representative) explicitly enables or disables a behavior change bundle for an account,
Snowflake will not override or reverse that setting. However, at the end of the Opt-out Period, behavior change bundles become
generally enabled and are in production for all accounts.

## Multiple behavior change release processes overlap

New behavior change bundles are typically released on a monthly basis, and take at least eight weeks to complete.
Therefore, only two behavior change bundles may be available for your Snowflake account at any given time,
with each behavior change bundle in different periods of the release process.
Specifically, the Opt-out Period of a bundle will overlap with the Testing Period of the next bundle.
In some instances, Snowflake may postpone or cancel the release of a new behavior change bundle in a given month,
resulting in the two available behavior change bundles to exist for longer than the normal 8 week period.

## Impact score

BCRs are ranked from highest to lowest potential technical impact. While we recommend testing higher-ranked BCRs first, the actual impact depends on how you use our services.

> For example:

* A high-ranked BCR might not affect you if you don’t use that feature.
* A lower-ranked BCR might be more disruptive based on your specific usage.

Use this ranking as a general guide, but prioritize testing based on both the BCR’s rank and its relevance to your account.

Impact Score [LOW]:
:   This signifies minimal change to existing structures or processes.
    An example would be adding a new column to a current view or table, which would not disrupt existing queries or functionality.

Impact Score [MEDIUM]:
:   Indicates minor changes, primarily aimed at increasing awareness or requiring slight adjustments from users.

Impact Score [HIGH]:
:   Represents major changes that necessitate substantial adjustments from users.
    These changes often have a high impact due to significant alterations to use cases and workloads.

---
title: Bind variables: No longer ignored as parameters for some built-in table functions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1410.md
section: Release Notes
---

# Bind variables: No longer ignored as parameters for some built-in table functions

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

In [Snowflake Scripting](../../../developer-guide/snowflake-scripting/index.md), a [driver](../../../developer-guide/drivers.md), or the
[SQL REST API](../../../developer-guide/sql-api/index.md), you can use bind variables in SQL statements. (For examples of using bind
variables, see [Using a variable in a SQL statement (binding)](../../../developer-guide/snowflake-scripting/variables.md), [Binding data](../../../developer-guide/python-connector/python-connector-example.md), and
[Using bind variables in a statement](../../../developer-guide/sql-api/submitting-requests.md).

This behavior change affects cases in which you pass a bind variable directly as one of the
built-in table function arguments listed below. The behavior changes in the
following way:

Before the change:
:   The bind variable is ignored, and the argument is not passed to the table function.

AFter the change:
:   The bind variable is passed as an argument to the table function.

Note that this does not affect cases in which you pass a bind variable to another function before passing the result to a table
function argument. For example, if you are calling the COPY_HISTORY function, this change affects cases in which you pass a
bind variable directly as the START_TIME argument:

```sqlexample
COPY_HISTORY( START_TIME=> ?, ...
```

This does not affect cases in which you pass the bind variable to another built-in function first:

```sqlexample
COPY_HISTORY( START_TIME=> DATEADD('days', ?, ...
```

If you want to preserve the behavior before the change, you can rewrite your code to avoid passing the argument that uses the
bind variable. For example, if you are calling the TASK_HISTORY function and you do not want the results filtered by a specific
task, omit the TASK_NAME argument from the call.

The following table function arguments are affected by this change:

| Table Function | Arguments Affected |
| --- | --- |
| [AUTO_REFRESH_REGISTRATION_HISTORY](../../../sql-reference/functions/auto_refresh_registration_history.md) | OBJECT_TYPE |
|  | OBJECT_NAME |
| [COPY_HISTORY](../../../sql-reference/functions/copy_history.md) | TABLE_NAME |
|  | START_TIME |
| [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md) | ROOT_TASK_NAME |
| [DYNAMIC_TABLE_REFRESH_HISTORY](../../../sql-reference/functions/dynamic_table_refresh_history.md) | RESULT_LIMIT |
| [EXTERNAL_TABLE_FILE_REGISTRATION_HISTORY](../../../sql-reference/functions/external_table_registration_history.md) | TABLE_NAME |
| [INFER_SCHEMA](../../../sql-reference/functions/infer_schema.md) | LOCATION |
|  | FILE_FORMAT |
|  | FILES |
| [POLICY_REFERENCES](../../../sql-reference/functions/policy_references.md) | POLICY_NAME |
|  | REF_ENTITY_NAME |
|  | REF_ENTITY_DOMAIN |
| [QUERY_HISTORY](../../../sql-reference/functions/query_history.md) | END_TIME_RANGE_START |
|  | END_TIME_RANGE_END |
|  | RESULT_LIMIT |
| [QUERY_HISTORY_BY_SESSION](../../../sql-reference/functions/query_history.md) | SESSION_ID |
|  | RESULT_LIMIT |
| [QUERY_HISTORY_BY_USER](../../../sql-reference/functions/query_history.md) | USER_NAME |
| [QUERY_HISTORY_BY_WAREHOUSE](../../../sql-reference/functions/query_history.md) | WAREHOUSE_NAME |
|  | END_TIME_RANGE_START |
|  | END_TIME_RANGE_END |
|  | RESULT_LIMIT |
| [TAG_REFERENCES](../../../sql-reference/functions/tag_references.md) | OBJECT_NAME (the `object_name` argument) |
| [TASK_DEPENDENTS](../../../sql-reference/functions/task_dependents.md) | TASK_NAME |
| [TASK_HISTORY](../../../sql-reference/functions/task_history.md) | RESULT_LIMIT |
|  | TASK_NAME |
| [WAREHOUSE_LOAD_HISTORY](../../../sql-reference/functions/warehouse_load_history.md) | DATE_RANGE_START |
|  | DATE_RANGE_END |
|  | WAREHOUSE_NAME |
| [WAREHOUSE_METERING_HISTORY](../../../sql-reference/functions/warehouse_metering_history.md) | DATE_RANGE_START |
|  | DATE_RANGE_END |

Ref: 1410

---
title: BLOCK_NON_READLIST_OPERATIONS_ON_STAGES_IN_SECONDARY: Parameter set to TRUE by default
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1234.md
section: Release Notes
---

# BLOCK_NON_READLIST_OPERATIONS_ON_STAGES_IN_SECONDARY: Parameter set to TRUE by default

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

Previously:
:   The BLOCK_NON_READLIST_OPERATIONS_ON_STAGES_IN_SECONDARY parameter has a default value of `FALSE`.
    This means that the following write operations are allowed on stages in read-only database replicas by default:

    * [PUT](../../../sql-reference/sql/put.md)
    * [REMOVE](../../../sql-reference/sql/remove.md)
    * [ALTER STAGE … REFRESH](../../../sql-reference/sql/alter-stage.md)

Currently:
:   To be consistent with replication semantics where there is one primary (writeable)
    and one or more secondary replicas (readable), the BLOCK_NON_READLIST_OPERATIONS_ON_STAGES_IN_SECONDARY
    parameter will have a default value of `TRUE`.

    Write operations on replicated stages will fail unless you set the value of the parameter to `FALSE`.

Ref: 1234

---
title: BLOCK_STORAGE_HISTORY view (Account Usage): New columns ADDITIONAL_IOPS and ADDITIONAL_THROUGHPUT
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1921.md
section: Release Notes
---

# BLOCK_STORAGE_HISTORY view (Account Usage): New columns ADDITIONAL_IOPS and ADDITIONAL_THROUGHPUT

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the Account Usage [BLOCK_STORAGE_HISTORY view](../../../sql-reference/account-usage/block_storage_history.md) includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| ADDITIONAL_IOPS | NUMBER | Average number of additional IOPS used on the given date. |
| ADDITIONAL_THROUGHPUT | NUMBER | Average amount of additional throughput (MiB per second) used on the given date. |

Additional IOPS and throughput refer to block configuration values exceeding the default settings (see [Specifying block storage in service specification](../../../developer-guide/snowpark-container-services/block-storage-volume.md)). For example, on AWS, the block configuration default IOPS is 3,000, and the default throughput is 125 MiB/second. If you configure an AWS block device with 4,000 IOPS and 225 MiB/second throughput, the additional IOPS would be 1,000 (4,000 - 3,000), and the additional throughput would be 100 MiB/second (225 - 125).

The following three examples illustrate how you can get this information from the BLOCK_STORAGE_HISTORY view. Suppose that your account is set up with the following:

* Your account provisioned a 10 GB block volume (as part of a service) with 1000 additional IOPS and 100 MiB/second additional throughput for 6 hours on 2025-02-01 for compute pool `pool_1`. If you query the view, you can get the following information from the `additional_iops` and `additional_throughput` columns:

  + Using 10 GB for 6 hours equals 2.5 GB per day (10 GB x 6/24 hours = 2.5 GB = 2684354560 bytes per day).
  + Using 1000 additional IOPS for 6 hours equals 250 IOPS per day (1000 IOPS \* 6/24 hours = 250 IOPS per day).
  + Using 100 additional MiB/second for 6 hours equals average 25 MiB/second per day (100 MiB \* 6/24 hours = 25 MiB per day).
* Your account is provisioned a 10 GB block volume (as part of a service) with 1 additional IOPS and 1 MiB/s additional throughput for 12 hours on 2025-02-01 for compute pool `POOL_2`.

  + Using 10 GB for 12 hours equals 5 GB per day (10 GB \* 12/24 hours = 5 GB = 5368709120 bytes per day).
  + 1 additional IOPS used for 12 hours equals 0.5 IOPS per day (1 IOPS \* 12/24 =hours 0.5 IOPS per day).
  + 1 additional MiB/second throughput MiB/s used for 12 hours equals 0.5 MiB/second per day (1 MiB \* 12/24 hours = 0.5 MiB per day)
* You use a 20 GB snapshot for 24 hours on 2025-02-01. Using 20 GB for 24 hours is equivalent to 20 GB (21474836480 bytes) per day.

When you query the view:

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.BLOCK_STORAGE_HISTORY;
```

The `bytes`, `additional_iops`, and `additional_throughput` columns in the query output provide this information:

```output
+-------------------------------+--------------------+-------------------------+----------------+-----------------------+-----------------------------+
| USAGE_DATE                    | STORAGE_TYPE       | COMPUTE_POOL_NAME       |       BYTES    |       ADDITIONAL_IOPS |       ADDITIONAL_THROUGHPUT |
|-------------------------------+--------------------+-------------------------+----------------|-----------------------|-----------------------------|
| 2025-02-01 00:00:00.000 -0700 | BLOCK_STORAGE      | POOL_1                  | 2,684,354,560  | 250.000000000         | 25.000000000                |
| 2025-02-01 00:00:00.000 -0700 | BLOCK_STORAGE      | POOL_2                  | 5,368,709,120  | 0.50000000            | 0.500000000                 |
| 2025-02-01 00:00:00.000 -0700 | SNAPSHOT           | NULL                    | 21,474,836,480 | 0.000000000           | 0.000000000                 |
+-------------------------------+--------------------+-------------------------+----------------+-----------------------+-----------------------------+
```

Ref: 1921

---
title: Canceled unbundled behavior changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/unbundled-cancelled-behavior-changes.md
section: Release Notes
---

# Canceled unbundled behavior changes

Canceled unbundled behavior changes are changes that were originally intended to be released but have been canceled and will not be implemented.

To help you manage your operations and minimize disruption to your Snowflake service, we document behavior changes that may impact your usage,
including:

* [Recently implemented changes](unbundled-behavior-changes.md) that were previously pending/disabled, were not part of a behavior change bundle, and cannot be disabled.
* [Upcoming pending changes](unbundled-behavior-changes.md) that will not be part of a behavior change bundle and cannot be enabled in advance.

If you have questions about any of these behavior changes, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Canceled unbundled behavior changes

The following table lists canceled behavior changes. Canceled behavior changes are behavior changes that have been previously planned but will not be implemented.

| Originally Planned BCR Bundle | Functional Area | Canceled Behavior Change | Additional Notes |
| --- | --- | --- | --- |
| [2025_06 Bundle](../2025_06_bundle.md) | Security | [OAuth authentication: Change in network policy behavior (Canceled)](bcr-2094.md) |  |
| [2025_04 Bundle](../2025_04_bundle.md) | Security | [Mandatory multi-factor authentication on Snowsight login (Replaced)](bcr-1972.md) | Replaced by [Multi-factor authentication: MFA_ENROLLMENT parameter values change](../2025_06/bcr-2097.md) |
| [2025_04 Bundle](../2025_04_bundle.md) | Snowpark Python | [Snowpark Python: Eliminate repeated subqueries in Snowpark-generated queries (Canceled)](bcr-1995.md) |  |
| [2024_08 Bundle](../2024_08_bundle.md) | Data Governance | [Use primary role for authorizing view and materialized view creation (Canceled)](bcr-1782.md) |  |
| [2024_08 Bundle](../2024_08_bundle.md) | Security | [GRANT OWNERSHIP ON ROLE command: Restrict transfer of role ownership to itself (Canceled)](bcr-1781.md) |  |

---
title: Catalog-linked database: USAGE privilege on CATALOG INTEGRATION and EXTERNAL VOLUME required for database owner role for all operations
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2114.md
section: Release Notes
---

# Catalog-linked database: USAGE privilege on CATALOG INTEGRATION and EXTERNAL VOLUME required for database owner role for all operations

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

This behavior change alters the requirements for accessing tables in a catalog-linked database. It affects the following operations inside a catalog-linked database, which require access to the catalog integration and external volume for the database:

* Any DML operations on an object inside the database
* Any DDL operations on an object inside the database
* Automatic table discovery
* Automatically or manually refreshing a table in the database
* Reading a table in the database

Before the change:
:   To access tables in a catalog-linked database, any of the following roles must have the USAGE privilege on the external volume for the
    database and the USAGE privilege on the catalog integration for the database:

    > * The role that has the OWNERSHIP privilege for the catalog-linked database.
    > * The role that has the OWNERSHIP privilege for a table within the catalog-linked database.
    > * Any role that is active for the session.

After the change:
:   To access tables in a catalog-linked database, the following role must have the USAGE privilege on the external volume for the database
    and the USAGE privilege on the catalog integration for the database:

    > * The role that has the OWNERSHIP privilege for the catalog-linked database.

    For example, the ALTER command only succeeds if the database owner role has access to the catalog integration and external volume. If you
    try to run the ALTER command but the database owner role doesn’t have access to the catalog integration, you’ll receive the following
    error:

    ```none
    SQL access control error: Insufficient privileges to operate on integration '<name of catalog integration>'.
    ```

    If you try to run the ALTER command but the database owner role doesn’t have access to the external volume, you’ll receive the following
    error:

    ```none
    SQL access control error: Insufficient privileges to operate on external volume '<name of external volume>').
    ```

    If needed, grant the required USAGE privileges to the role that owns the catalog-linked database.

    In the following example, the data_engineer role, which has the OWNERSHIP privilege for the catalog-linked database, is
    granted the necessary USAGE privileges to provide access the tables in the catalog-linked database:

    ```sqlexample
    GRANT USAGE ON INTEGRATION glueCatalogInt TO ROLE data_engineer;
    GRANT USAGE ON EXTERNAL VOLUME exvol TO ROLE data_engineer;
    ```

This change makes access management for Apache Iceberg™ tables in catalog-linked databases more efficient by routing all of this management
through the owner of the catalog-linked database.

> **Note:**
>
> If you’re using catalog-vended credentials, the requirement to have the USAGE privilege for the external volume doesn’t apply to before
> or after the change.

Ref: 2114

---
title: Change in HTTP error code for URL not found error
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1669.md
section: Release Notes
---

# Change in HTTP error code for URL not found error

The error code returned to customers, when using an incorrect or invalid URL to access their Snowflake account, behaves as follows:

Before the change:
:   Anyone using an incorrect (non-existent org/account or malformed) URL to access their Snowflake account was returned a 403 error.

After the change:
:   Anyone using an incorrect (non-existent org/account or malformed) URL to access their Snowflake account can see
    403, 404, or 513 return codes.

    **What you need to do**
    If you have hard-coded the error code 403 in your error handling logic/code,
    Snowflake recommends updating it to include error codes 403, 404, and 513.

If you have any questions regarding this change, please open a case with [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Ref: 1669

---
title: Change of Certificate Authority and OCSP Allowlist for AWS Customers
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1657.md
section: Release Notes
---

# Change of Certificate Authority and OCSP Allowlist for AWS Customers

> **Note:**
>
> The changes mentioned in this BCR affect only customers using Snowflake on AWS (including AWS PrivateLink).

## Changes

As part of Snowflake’s continued commitment to providing best-in-class transport-layer-security (TLS)
we are migrating all endpoints used by connectors, drivers, SQL API clients and all PrivateLink Endpoints
to a new load balancing stack. A final step in this migration moves TLS session termination from Amazon Elastic
Load Balancers to Snowflake-managed Envoy proxies.

As a result, Snowflake is changing which TLS Certificate Authority (CA) signs the certificates
used to terminate TLS connections to its API endpoints from Amazon Trust Services to Digicert.

> **Note:**
>
> Digicert is already used for Snowflake’s Azure & GCP regions.

Since Digicert CA certificates are present in the default trust stores of all major operating systems, browsers
and client environments, and allowlisting egress to OCSP responders is a rare configuration, this migration will be
transparent and **require no changes for the majority of Snowflake customers**.

For the small fraction of customers who allowlist network egress or customize their CA trust stores to exclude Digicert, configuration updates may be required:

1. An update to operating system or application level trust stores to include the Digicert CA root certificate,
   or intermediates (applies to PrivateLink and non-PrivateLink connectivity).
2. An update to client firewalls and egress proxies to allow requests to the `ocsp.digicert.com` OCSP responder endpoint (applies only to non-PrivateLink connectivity).

## Validation

### CA Trust Store

Your Operating System, Browser or Application level TLS Certificate Authority trust store must contain the certificate
for Digicert Global Root G2, serial `03:3A:F1:E6:A7:11:A9:A0:BB:28:64:B1:1D:09:FA:E5`.

Operating system trust stores are implemented by the OS provider, and all recently patched operating systems
contain the Digicert Global Root G2 certification in their default trust stores. Please reach out to your OS vendor
for additional assistance.

For more information see the following:

* [Windows](https://learn.microsoft.com/en-us/windows-hardware/drivers/install/certificate-stores?source=recommendations)
* [macOS](https://support.apple.com/en-us/HT209143)
* [RedHat Linux](https://www.redhat.com/sysadmin/configure-ca-trust-list)
* [Ubuntu](https://ubuntu.com/server/docs/security-trust-store)

If you access Snowflake from a Java application with a custom trust store, you can validate that Digicert Global Root G2 appears in the output of:

```bash
keytool -list -keystore <path_to_keystore_file>
```

### OCSP Allowlist

> **Note:**
>
> This BCR does not require any OCSP allowlist changes for customers using Snowflake drivers to access AWS PrivateLink endpoints.

Non-privatelink customers should validate that their clients have outbound network connectivity to `ocsp.digicert.com` on port `80`. Note the `curl` url must use the `http` protocol and not `https`. Use of `https` will result in a TLS error.

```bash
curl -I 'http://ocsp.digicert.com'
HTTP/1.1 200 OK
...
```

For general instructions on Firewall allowlist requirements and validation using the SnowCD tool, see [SYSTEM$ALLOWLIST](../../../sql-reference/functions/system_allowlist.md).

### Privatelink Early Adopter Opt-In Validation

Privatelink customers who have opted in to the early adopter program have several options to validate:

1. Use Snowsight with private connectivity. For additional details see the
   [private connectivity instructions](../../../user-guide/ui-snowsight-gs.md). If you are able to connect to Snowsight,
   you have the correct configuration for Digicert CA update.
2. Use any Snowflake driver with a privatelink URL. For additional details see the
   [Snowflake driver with private link instructions](../../../user-guide/admin-security-privatelink.md). If you are able to run queries,
   you have the correct configuration for Digicert CA update.

If you have opted an account in for Privatelink testing, you can run the following commands to confirm that the account has been migrated to Digicert CA:

```bash
curl -v 'https://<privatelink hostname>/console'
...
Server certificate:
...
issuer: C=US; O=DigiCert Inc; CN=DigiCert Global G2 TLS RSA SHA256 2020 CA1
...
HTTP/1.1 200 OK
...
```

In the response, you should see a server certificate section with the Digicert Global G2 included. If you do, your account is currently on Digicert CA.
In the server certificate section, if you see Amazon as the issuer, your account is still on ACM certificates.

Alternatively, you can run the following command:

```bash
openssl s_client -connect <privatelink hostname>:443 -showcerts
...
Certificate chain:
...
issuer: C=US; O=DigiCert Inc; CN=DigiCert Global G2 TLS RSA SHA256 2020 CA1
...
Verification: OK
...
```

If you see the Digicert G2 cert in the chain, your account is currently on Digicert CA.
If you see Amazon certificates in the chain, your account is still on ACM certificates.

> **Note:**
>
> Due to some limitations with our NLB infrastructure, we are unable to support the use of underscores in the PrivateLink hostname. If you are still seeing Amazon certificates during validation, please try replacing the underscores with hyphens.
> This issue only affect early opt-in testing (getting the Digicert certificate during the early opt-in phase). Once the migration completes, you will be able to continue using underscores in hostnames.

## Timeline

> **Important:**
>
> This BCR is an [Unbundled Change](unbundled-behavior-changes.md). This infrastructure update will be
> executed by Snowflake on the timeline below, and is not coupled to the Snowflake release cycle or
> [Behavior Change Management](../managing-behavior-change-releases.md) tooling. There is no self-service mechanism to
> opt in or opt out of this change. For validation and testing, please reach out to the support team to opt in individual accounts for non-PrivateLink connectivity testing.
> For PrivateLink validation, Snowflake will provide support for account level early adopter opt-in.

For non-PrivateLink traffic, this change has been applied to all AWS regions in January 2025 with some exceptions. We will gradually remove these exceptions starting in July 2025.
For PrivateLink traffic, we are offering early adopter opt-in testing starting on June 23rd 2025. We will give everyone 2 months for testing and validation. Deployment wide changes will be rolled out across all AWS regions starting in September 2025.

Ref: 1657

---
title: Changes in TLS Cipher Suite Requirements
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1727.md
section: Release Notes
---

# Changes in TLS Cipher Suite Requirements

As part of Snowflake’s continued commitment to providing best-in-class Transport Layer Security (TLS) we are
migrating all endpoints used by connectors, drivers, or SQL API clients to a new
load balancing stack based on the open source Envoy Proxy,
as described in [Snowflake’s migration to Envoy for traffic management](https://medium.com/snowflake/snowflakes-migration-to-envoy-for-traffic-management-fc957e50bc6f).

While this migration is expected be transparent for most customers, there are two changes which may require action,
depending on client-specific configuration:

1. Change of TLS server-side implementation.
2. Enabling of TLS 1.3 and deprecation of weak TLS 1.2 cipher suites.

Details follow:

1. When a region is switched over to terminating TLS using Envoy, some Snowflake Java-based clients
   with custom Security Provider configurations may experience issues establishing TLS connections.
   Configuration of this kind is very uncommon. To reduce connectivity problems,
   Java clients should verify that a security provider that supports Elliptic Curve Cryptography (ECC), such as `SunEC` or `BouncyCastleProvider`,
   is enabled in their `java.security` file. `SunEC` is enabled by default.

   For more details on how to ensure your clients are configured for compatibility see Snowflake knowledge base article [Envoy migration updates](https://community.snowflake.com/s/article/FAQ-Updates-on-Migration-of-Traffic-Serving-Proxy-Load-Balancing-Infrastructure).
2. Once a region is migrated to Envoy, TLS 1.3 is automatically available, and capable clients will begin negotiating connections using this most up-to-date version of TLS. TLS 1.2 will continue to be supported.

   Initially, to maintain backwards compatibility for as many customers as possible, the following TLS 1.2 cipher suites will continue to be supported in multi-tenant regions:

   Strong ciphersuites (preferred, and always used if supported by the client):

   * `TLS_ECDHE_RSA_WITH_CHACHA20_POLY1305_SHA256`
   * `TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256`
   * `TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384`

   Weak ciphersuites (for backwards-compatibility, not available in US Gov regions)

   * `TLS_RSA_WITH_AES_128_GCM_SHA256`
   * `TLS_RSA_WITH_AES_256_GCM_SHA384`
   * `TLS_ECDHE_RSA_WITH_AES_256_CBC_SHA`
   * `TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA`
   * `TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256`
   * `TLS_RSA_WITH_AES_256_CBC_SHA`
   * `TLS_RSA_WITH_AES_128_CBC_SHA`
   * `TLS_RSA_WITH_AES_128_CBC_SHA256`
   * `TLS_RSA_WITH_AES_256_CBC_SHA256`

   Approximately one month after TLS 1.3 is enabled:

   1. For accounts observed to be connecting exclusively via TLS 1.3 or the preferred TLS 1.2 cipher suites above
      during this period, and for all newly created accounts, TLS 1.3 or TLS 1.2 with strong ciphersuites
      will be made mandatory when connecting to Snowflake using public IP addresses. Weak TLS 1.2 cipher suites will no longer be an option.
   2. For accounts identified to be connecting with TLS 1.2 cipher suites on the weak list above,
      targeted communications will be sent recommending a client-side upgrade to move to TLS 1.3 or
      strong TLS 1.2 ciphers. These accounts will continue to be able to use legacy cipher suites
      for up to 3 months following the targeted notification.
   3. After the 3 month notification period, a choice of strong TLS 1.2 cipher suites or TLS 1.3
      will be mandatory for both Public IP and Private Link / Private Service Connect access.

No action is required at this time, but this information is being provided now so
that clients may proactively upgrade their TLS client implementations ahead of the
targeted communication requiring an upgrade.

TLS 1.3 offers multiple improvements, including faster TLS handshakes and simpler, more secure cipher suites.
These changes provide better performance and stronger security.
Upgrading to a TLS 1.3 compatible client is highly recommended.

When will these changes be occurring?

The change in TLS server implementation has been rolling out gradually across all clouds and regions from July 2024 and is ongoing.

For customers using weak ciphersuites when connecting to Snowflake, communication will be sent out on the following timelines:

* May 2025 for customers using weak ciphersuites for public traffic.
* July 2025 for customers using weak ciphersuites over Private Link / Private Service Connect.

Customers have 6 months after receiving the notification to update their ciphersuite usage, after which support for weak ciphersuites will be dropped. As a result:

* On or about the first week of November 2025, weak ciphersuite support will be dropped for public traffic. Subject to change.
* On or about the first week of January 2026, weak ciphersuite support will be dropped for Private Link / Private Service Connect. Subject to change.

Ref: 1727

---
title: Changes to Apache Iceberg™ tables created from Delta files
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1852.md
section: Release Notes
---

# Changes to Apache Iceberg™ tables created from Delta files

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

For Iceberg tables created from Delta files in object storage:

Before the change:
:   Snowflake does not generate or write Iceberg metadata.

After the change:
:   * Snowflake generates Iceberg metadata and writes the metadata to the table’s storage location if you configure the external volume associated with the table to allow write access.

      If you don’t want Snowflake to write Iceberg metadata for the table, you can set the ALLOW_WRITES parameter to FALSE on your external volume as long as there are no Snowflake-managed Iceberg tables that use the same external volume.
    * The DESC ICEBERG TABLE command returns the `NAME_MAPPING` column if you configure Iceberg Compatibility V2 ([icebergCompatV2](https://github.com/delta-io/delta/blob/master/PROTOCOL.md#iceberg-compatibility-v2)) for the Delta table that your Iceberg table is based on.

Ref: 1852

---
title: Changes to the PREVENT_UNLOAD_TO_INLINE_URL and PREVENT_UNLOAD_TO_INTERNAL_STAGES parameters
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1841.md
section: Release Notes
---

# Changes to the PREVENT_UNLOAD_TO_INLINE_URL and PREVENT_UNLOAD_TO_INTERNAL_STAGES parameters

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

The following parameters behave as follows:

Before the change:
:   * PREVENT_UNLOAD_TO_INLINE_URL is an account-level parameter. Only users with the ACCOUNTADMIN role can set the parameter.
    * PREVENT_UNLOAD_TO_INTERNAL_STAGES is a user-level parameter. Any Snowflake user can set the parameter at the user level.

After the change:
:   * PREVENT_UNLOAD_TO_INLINE_URL is a user-level parameter. Only users with the ACCOUNTADMIN role can set the parameter (same as before the change).
    * PREVENT_UNLOAD_TO_INTERNAL_STAGES is an account and user-level parameter (user level takes precedence).
      However, only users with the ACCOUNTADMIN role can set the parameter.

Ref: 1841

---
title: Changes to XML parsing and emitting behavior
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1862.md
section: Release Notes
---

# Changes to XML parsing and emitting behavior

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, parsing and emitting XML content changes when using the
[COPY INTO <table>](../../../sql-reference/sql/copy-into-table.md) command with the XML file format and when calling the following
functions:

* [CHECK_XML](../../../sql-reference/functions/check_xml.md)
* [PARSE_XML](../../../sql-reference/functions/parse_xml.md)
* [TO_XML](../../../sql-reference/functions/to_xml.md)
* [XMLGET](../../../sql-reference/functions/xmlget.md)

Before the change:
:   XML parsing and emitting behavior:

    * Some queries that call the CHECK_XML function return a string with an error message.
    * Some queries that call the PARSE_XML function fail.

After the change:
:   XML parsing and emitting behavior:

    * Some queries that returned a string with an error message when calling the CHECK_XML function before the
      change now return NULL.
    * Some queries that failed when calling the PARSE_XML function before the change now succeed, and the function
      returns the parsed XML.
    * Queries with XML strings containing angle brackets or apostrophes return different results after the change.
    * Queries with XML strings containing white space or XML attributes relating to preserving white space return
      different results after the change.

The following sections provide more details about the changes.

## Parsing XML content that contains processing instructions

The following example uses the PARSE_XML function to parse XML content with question marks in the processing instructions:

```sqlexample
SELECT PARSE_XML('<?PITarget PIContent ??><mytag />') AS mytag;
```

Returned before the change::
:   ```output
    100100 (22P02): Error parsing XML: prematurely terminated XML document in processing instructions, pos 33
    ```

Returned after the change::
:   ```output
    +-----------------+
    | MYTAG           |
    |-----------------|
    | <mytag></mytag> |
    +-----------------+
    ```

## Parsing XML content that contains angle brackets or apostrophes

The following example uses the PARSE_XML function to parse XML content that contains angle brackets and
apostrophes in XML attribute values. After the change, apostrophes and angle brackets in the XML attribute
values are properly escaped in the return value and in the emitted XML:

```sqlexample
SELECT PARSE_XML('<mytag myattr="&lt;&gt;\'"/>') AS mytag;
```

Returned before the change::
:   ```output
    +------------------------------+
    | MYTAG                        |
    |------------------------------|
    | <mytag myattr="<>'"></mytag> |
    +------------------------------+
    ```

Returned after the change::
:   ```output
    +-----------------------------------------+
    | MYTAG                                   |
    |-----------------------------------------|
    | <mytag myattr="&lt;&gt;&apos;"></mytag> |
    +-----------------------------------------+
    ```

## Parsing XML content that contains user-defined entities

The following example uses the PARSE_XML function to parse XML content that contains user-defined
entities:

```sqlexample
SELECT PARSE_XML('<!DOCTYPE doc [<!ENTITY placeholder "some text">]><doc>&placeholder;</doc>')
  AS placeholder;
```

Returned before the change::
:   ```output
    100100 (22P02): Error parsing XML: unknown entity &placeholder;, pos 68
    ```

Returned after the change::
:   ```output
    +-------------------------------------------------------------+
    | PLACEHOLDER                                                 |
    |-------------------------------------------------------------|
    | <!DOCTYPE doc [<!ENTITY placeholder "some                   |
    | text">]><doc>some text</doc>                                |
    +-------------------------------------------------------------+
    ```

## Parsing XML content that preserves white space

This change was made so the behavior in Snowflake matches the
[XML specification](https://www.w3.org/TR/xml11/#sec-white-space) regarding preservation of whitespace:

* Before the change, whitespace is preserved for the `xsl:space="preserve"` attribute. After the change, whitespace
  isn’t preserved for the `xsl:space="preserve"` attribute.
* Before the change, whitespace isn’t preserved for the `xml:space="preserve"` attribute. After the change,
  whitespace is preserved for the `xml:space="preserve"` attribute.

The following example uses the PARSE_XML function to parse XML content and specifies the `xsl:space="preserve"`
attribute:

```sqlexample
SELECT PARSE_XML('<mytag xsl:space="preserve"> my content </mytag>')
  AS space_preserve;
```

Returned before the change::
:   ```output
    +--------------------------------------------------+
    | SPACE_PRESERVE                                   |
    |--------------------------------------------------|
    | <mytag xsl:space="preserve"> my content </mytag> |
    +--------------------------------------------------+
    ```

Returned after the change::
:   ```output
    +--------------------------------------------------+
    | SPACE_PRESERVE                                   |
    |--------------------------------------------------|
    | <mytag xsl:space="preserve">my content</mytag>   |
    +--------------------------------------------------+
    ```

The following example uses the PARSE_XML function to parse XML content and specifies the `xml:space="preserve"` attribute:

```sqlexample
SELECT PARSE_XML('<mytag xml:space="preserve"> my content </mytag>')
  AS space_preserve;
```

Returned before the change::
:   ```output
    +--------------------------------------------------+
    | SPACE_PRESERVE                                   |
    |--------------------------------------------------|
    | <mytag xml:space="preserve">my content</mytag>   |
    +--------------------------------------------------+
    ```

Returned after the change::
:   ```output
    +--------------------------------------------------+
    | SPACE_PRESERVE                                   |
    |--------------------------------------------------|
    | <mytag xml:space="preserve"> my content </mytag> |
    +--------------------------------------------------+
    ```

## Loading XML content that preserves white space

The following example loads data into a table using the COPY INTO <table> command. The PRESERVE_SPACE
parameter is set to TRUE to preserve white space:

```sqlexample
COPY INTO mytable
  FROM @my_xml_stage
  FILE_FORMAT = (TYPE = 'XML' PRESERVE_SPACE = TRUE);
```

Loaded content before the change::
:   ```output
    +--------------------------------------------------+
    | SPACE_PRESERVE                                   |
    |--------------------------------------------------|
    | <mytag xsl:space="preserve"> my content </mytag> |
    +--------------------------------------------------+
    ```

Loaded content after the change::
:   ```output
    +--------------------------------------------------+
    | SPACE_PRESERVE                                   |
    |--------------------------------------------------|
    | <mytag xml:space="preserve"> my content </mytag> |
    +--------------------------------------------------+
    ```

Before and after the change, the content preserves the white space, but the attribute changes from
`xsl:space="preserve"` to `xml:space="preserve"`.

Ref: 1862

---
title: Client Changes by Version
source: https://docs.snowflake.com/en/release-notes/client-change-log.md
section: Release Notes
---

# Client Changes by Version

These topics list the major changes made in each released version of SnowSQL (Snowflake CLI client) and the JDBC and ODBC drivers provided by Snowflake.

You can use this information to determine whether to upgrade to the latest version of a particular client; however, we highly recommend that you always use the latest versions
of any client software provided by Snowflake.

These topics do not include changes made to the other connectors and drivers provided by Snowflake (JDBC, Python, Spark, Node.js, etc.). For links to these changes, see the
**Related Info** sidebar (in this topic).

**Next Topics:**

* [SnowSQL Change Log (Prior to January 2022)](client-change-log-snowsql.md)
* [ODBC Driver Change Log (Prior to January 2022)](client-change-log-odbc.md)

---
title: Client versions & support policy
source: https://docs.snowflake.com/en/release-notes/requirements.md
section: Release Notes
---

# Client versions & support policy

Snowflake provides a CLI (command-line interface) as well as other client software (drivers, connectors, etc.) for connecting to Snowflake and using certain
Snowflake features (e.g. Apache Kafka for loading data, Apache Hive metadata for external tables). The clients must be installed on each local workstation or system from which
you wish to connect.

As needed, we release new versions of the clients to fix bugs, and introduce enhancements and new features. New versions are backward-compatible with existing Snowflake
features, but we do not guarantee that earlier versions are forward-compatible. As such, we recommend actively monitoring and maintaining the versions of your installed
clients; if they are not in-sync with the current version of Snowflake, you may encounter issues when connecting to and using Snowflake.

> **Attention:**
>
> For critical or important client changes (especially required security updates), Snowflake might require you to upgrade to the latest version. Please make sure to always check the [Release Notes](clients-drivers/monthly-releases.md) for the client drivers you’re using to see if there’s an important security fix in a particular version, and plan your driver upgrades accordingly.

For more information about determining the current version of a client or driver, refer to the following:

* [View the Snowflake client version](../user-guide/snowflake-client-version-check.md)
* [How to report on the Clients connecting to a Snowflake account?](https://community.snowflake.com/s/article/how-to-report-on-the-clients-connecting-to-a-snowflake-account)

All downloads on this page are considered “Client Software” as defined in your agreement for use of the Snowflake Service.

> **Attention:**
>
> Customers who use GCP (Google Cloud Platform) for authentication must update their clients and drivers to new
> minimum versions due to upcoming changes by Google for signing request headers and payloads.
> Snowflake recommends affected customers read the
> [FAQ: 2023 Client Driver deprecation for GCP customers](https://community.snowflake.com/s/article/faq-2023-client-driver-deprecation-for-GCP-customers) knowledge
> base article for more information.

## Recommended client versions

As a policy, Snowflake recommends that you always install the latest (i.e. most recent) version of each client,
if possible.

Snowflake uses semantic versioning for client and driver updates, excluding Snowpark APIs.

> **Note:**
>
> Snowflake’s support policy generally provides a minimum two-year window for clients and drivers, after which support might be dropped.
> To help you track supported versions, the following table includes the minimum version of clients and drivers Snowflake currently
> supports. If you use a version older than the minimum, Snowflake makes no commitment to provide support.

Once a client is installed, you are not required to upgrade each time a new version is released; however, to stay current with the
latest fixes, updates, and features, we recommend monitoring for new versions and upgrading at regular intervals (e.g. monthly,
quarterly, semiannually).

| Type | Client | Recommended Version | Minimum Supported Version (as of Feb 01, 2026) [1] [2] | Release Information | Where to Download the Installers [3] |
| --- | --- | --- | --- | --- | --- |
| CLI (Command-line Interface) | [Snowflake CLI](../developer-guide/snowflake-cli/index.md) | 3.16.0 (or later) | 1.2.5 | [Release Notes](clients-drivers/snowflake-cli.md) | [Snowflake CLI Download](https://sfc-repo.snowflakecomputing.com/snowflake-cli/index.html) page |
|  | [SnowSQL](../user-guide/snowsql.md) | 1.5.0 (or later) | 1.3.0 | [Release Notes](clients-drivers/snowsql.md) | [SnowSQL Download](https://developers.snowflake.com/snowsql/) page |
| Connectors and Drivers | [.NET Driver](../developer-guide/dotnet/dotnet-driver.md) | 5.5.0 (or later) | 2.2.0 | [Release Notes](clients-drivers/dotnet.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Go Snowflake Driver](../developer-guide/golang/go-driver.md) | 2.0.0 (or later) | 1.7.2 | [Release Notes](clients-drivers/golang.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Ingest Java SDK](../user-guide/snowpipe-streaming/snowpipe-streaming-classic-overview.md) | 4.4.2 (or later) | 2.2.0 | [Release Notes](clients-drivers/ingest-java-sdk.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | Ingest Python SDK | 1.0.10 (or later) | 1.0.5 | [Release Notes](https://github.com/snowflakedb/snowflake-ingest-python/releases) (in GitHub) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowpipe Streaming SDK (for high-performance architecture)](../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md) | 1.2.0 (or later) | 1.0.0 | [Release Notes](clients-drivers/snowpipe-streaming-sdk.md) | [Java SDK](https://central.sonatype.com/artifact/com.snowflake/snowpipe-streaming) | [Python SDK](https://pypi.org/project/snowpipe-streaming/) |
|  | [JDBC Driver](../developer-guide/jdbc/jdbc.md) | 4.1.0 (or later) | 3.14.5 | [Release Notes](clients-drivers/jdbc.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Node.js Driver](../developer-guide/node-js/nodejs-driver.md) | 2.3.6 (or later) | 1.9.3 | [Release Notes](clients-drivers/nodejs.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page [3] |
|  | [ODBC Driver](../developer-guide/odbc/odbc.md) | 3.16.0 (or later) | 3.2.0 | [Release Notes](clients-drivers/odbc.md) | [ODBC Download](https://developers.snowflake.com/odbc/) page |
|  | [PHP PDO Driver](../developer-guide/php-pdo/php-pdo-driver.md) | 3.6.0 (or later) | 2.0.1 | [Release Notes](clients-drivers/php-pdo.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowflake Connector for Kafka](../user-guide/kafka-connector.md) | 3.3.0 (or later) | 2.1.2 | [Release Notes](clients-drivers/kafka-connector.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowflake Connector for Python](../developer-guide/python-connector/python-connector.md) | 4.4.0 (or later) | 3.7.0 | [Release Notes](clients-drivers/python-connector.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page [3] |
|  | [Snowflake Connector for Spark](../user-guide/spark-connector.md) | 3.1.8 (or later) | 2.14.0 | [Release Notes](clients-drivers/spark-connector.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowflake SQLAlchemy (for Python)](../developer-guide/python-connector/sqlalchemy.md) | 1.9.0 (or later) | 1.5.1 | [Release Notes](clients-drivers/sqlalchemy.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page [3] |
| Snowpark | [Snowpark Library for Java](../developer-guide/snowpark/java/index.md) | 1.18.0 (or later) | 1.8.0 | [Release Notes](clients-drivers/snowpark-scala-java.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowpark Library for Python](../developer-guide/snowpark/python/index.md) | 1.44.0 (or later) | 1.0.0 | [Release Notes](clients-drivers/snowpark-python.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page [3] |
|  | [Snowpark Library for Scala](../developer-guide/snowpark/scala/index.md) | 1.18.0 (or later) | 1.8.0 | [Release Notes](clients-drivers/snowpark-scala-java.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowpark Connect for Spark](../developer-guide/snowpark-connect/snowpark-connect-overview.md) | 1.17.0 (or later) | 0.25.0 | [Release Notes](clients-drivers/snowpark-connect.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page |
|  | [Snowpark ML](../developer-guide/snowflake-ml/overview.md) | 1.27.0 (or later) | 0.3.0 | [Release Notes](clients-drivers/snowpark-ml.md) | [Drivers and Libraries](https://developers.snowflake.com/drivers-and-libraries/) page [3] |
| Other | [Snowflake Metadata Connector for Hive](../user-guide/tables-external-hive.md) | Latest | None (preview) | [Release Notes](https://github.com/snowflakedb/snowflake-hive-metastore-connector/releases) (in GitHub) | See [Integrate Apache Hive metastores with Snowflake](../user-guide/tables-external-hive.md). |

[1]

Minimum versions include updates that enable environment resilience to OCSP-related connectivity issues.

[2]

Minimum versions are not intended for installation.

[3]

Digitally signed to ensure the downloadable image is authentic.

> **Tip:**
>
> You can also use the [SYSTEM$CLIENT_VERSION_INFO](../sql-reference/functions/system_client_version_info.md) system function to retrieve this information programmatically.

## Minimum client versions

The minimum version for a client identifies the earliest supported version of the client. Any client versions lower than the documented minimum are no longer
covered by our support policy (see below) and may encounter issues when connecting to Snowflake.

> **Attention:**
>
> As stated in the Client Support Policy, Snowflake fixes issues on the latest client versions only.
> As such, the minimum versions might contain issues that have
> been fixed in later versions. Therefore, you should not install the minimum versions.
>
> The versions documented in the table above serve only as guidelines for managing your installed clients relative to
> the support policy.

## Client support policy

Snowflake maintains the following support policy for all clients provided by Snowflake:

* For all clients listed on this page, Snowflake generally supports each client version for at least two years, except in cases where a more recent version introduces
  critical fixes (e.g. for security or performance issues).

  Client versions that are below the minimum supported version might be blocked from connecting to Snowflake. Note that Snowflake
  will provide advance notification before blocking access for a particular client version.
* Unsupported versions might be removed from distribution (i.e. they may no longer be available for download/install).
* Snowflake provides bug fixes, new features, and required security updates only on the latest client versions. Likewise, when troubleshooting client issues,
  Snowflake verifies only against the latest client versions only.
* Snowflake ensures backward compatibility for APIs across all supported client versions.

> **Note:**
>
> This policy does not cover client connectors provided by third-party partners (Informatica, Tableau, etc.); please
> consult directly with the partners providing the
> connectors for information about their support policies.
>
> For more details about Snowflake’s third-party partners, see [Snowflake Ecosystem](../user-guide/ecosystem.md).

## Operating system support

> **Attention:**
>
> Snowflake plans to drop support for the following operating systems for all clients beginning April 1, 2026:
>
> * CentOS 7
> * macOS 11, 12, and 13
> * Ubuntu 16.04
>
> Additionally, Snowflake plans to drop Ubuntu 18.04 support specifically for the ODBC driver on x86.

The latest versions of most Snowflake clients are supported on the following operating systems:

| Operating System | Supported Versions |
| --- | --- |
| AIX | AIX 7.2 (JDBC only) |
| Linux | CentOS 7, 8 |
|  | Red Hat Enterprise Linux (RHEL) 7, 8, and, for selected clients, version 9 |
|  | Ubuntu 16.04, 18.04, 20.04 or later |
| macOS | 10.14 or later |
| Microsoft Windows | Microsoft Windows 8 or later |
|  | Microsoft Windows Server 2012, 2016, 2019, 2022 |

> **Note:**
>
> The supported version numbers change over time, based largely on the evolving support policies of the
> operating system vendors.

The following table shows which clients are available on which operating systems:

|  | Linux | macOS | Microsoft Windows | Notes |
| --- | --- | --- | --- | --- |
| .NET Driver | ✔ | ✔ | ✔ | Red Hat Enterprise Linux (RHEL) 9 is supported starting with version 5.4.0. |
| Go Snowflake Driver | ✔ | ✔ | ✔ | Red Hat Enterprise Linux (RHEL) 9 is supported starting with version 1.17.1. |
| Ingest Java SDK | ✔ | ✔ | ✔ |  |
| Ingest Python SDK | ✔ | ✔ | ✔ |  |
| Snowpipe Streaming SDK (for high-performance architecture) | ✔ | ✔ | ✔ | Supported architectures: ARM64 Mac, Windows, ARM64-Linux, and x86_64-Linux. Linux requires glibc version 2.26 or later. |
| Node.js Driver | ✔ | ✔ | ✔ | Red Hat Enterprise Linux (RHEL) 9 is supported starting with version 2.3.2. |
| JDBC Driver | ✔ | ✔ | ✔ | Red Hat Enterprise Linux (RHEL) 9 is supported starting with version 3.27.1. |
| ODBC Driver | ✔ | ✔ | ✔ | Linux support is based on the architecture, as follows:   * x86:    + Red Hat Enterprise Linux (RHEL) 7, 8, and 9 (starting with version 3.14.0). Note that ODBC v3.15.0 and later do not support RHEL 7 or earlier versions.   + CentOS 7. Note that ODBC v3.15.0 and later do not support CentOS 7 or earlier versions.   + Ubuntu versions 16.04 [4], 18.04 [5], and 20.04 or later * ARM64 (aarch64)    + Red Hat Enterprise Linux (RHEL) 8, and 9 (starting with version 3.14.0)   + CentOS 8   + Ubuntu 20.04   ODBC supports macOS 11.0 [4] and later.  ODBC does not support ARM64 architectures for Windows.  For Linux, ODBC v3.15.0 and later requires glibc 2.28+ and is incompatible with an OS that has an earlier version of glibc, such as Ubuntu 18.04 and earlier, RHEL 7 and earlier. Before upgrading to ODBC driver v3.15.0 or later, consult your operating system documentation to confirm it supports glibc version 2.28 or later. |
| PHP PDO Driver | ✔ | ✔ | ✔ | Red Hat Enterprise Linux (RHEL) 9 is supported starting with version 3.5.0. |
| Snowflake Connector for Kafka | ✔ | ✔ | ✔ |  |
| Snowflake Connector for Python | ✔ | ✔ | ✔ | Red Hat Enterprise Linux (RHEL) 9 is supported starting with version 4.0.0. |
| Snowflake Connector for Spark | ✔ | ✔ | ✔ |  |
| Snowflake Library for Java | ✔ | ✔ | ✔ |  |
| Snowflake Library for Python | ✔ | ✔ | ✔ |  |
| Snowflake Library for Scala | ✔ | ✔ | ✔ |  |
| Snowflake ML | ✔ | ✔ | ✔ |  |
| SnowSQL | ✔ | ✔ | ✔ | Versions 1.3.3 and later require at least glibc version 2.25 on Linux, which might not be available on older operating systems, such as RHEL7. Consult your operating system documentation to confirm it supports glibc version 2.25 or later. |

[4]

Support for these operating systems will be dropped beginning April 1, 2026.

[5]

For ODBC driver v3.15.0 and later, support for these operating systems is not available.

## Operating system support policy

Snowflake typically obsoletes support for an operating system version in accordance with the support timeline stated
by the operating system vendor.

Snowflake typically provides three months’ notice before dropping support for a particular version of an operating system.

---
title: CLIENT_APPLICATION_ID field in SESSIONS view: Trim trailing whitespace in return value
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2226.md
section: Release Notes
---

# CLIENT_APPLICATION_ID field in SESSIONS view: Trim trailing whitespace in return value

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the return value of the `CLIENT_APPLICATION_ID`
field in the [SESSIONS](../../../sql-reference/account-usage/sessions.md) view in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema
changes as follows:

Before the change:
:   The `CLIENT_APPLICATION_ID` field may include trailing whitespace.

After the change:
:   The `CLIENT_APPLICATION_ID` field does not include trailing whitespace.

Ref: 2226

---
title: CLONE and CREATE … LIKE commands: Cloning and propagating DMF entity mappings
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1854.md
section: Release Notes
---

# CLONE and CREATE … LIKE commands: Cloning and propagating DMF entity mappings

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

DMF entity mappings are an association between a data metric function and a table column.
When this behavior change bundle is enabled, the behavior of the CLONE and CREATE … LIKE commands changes as follows:

Before the change:
:   When users clone a table, db, or schema, the new table has no data metric function associations. Users cannot clone the
    DMF entity mappings when executing the CLONE command to clone a database, schema, or table.
    When users create a table using the LIKE command and the source table has data metric associations, the new table has no
    DMF associations. Users cannot propagate the DMF entity mappings to the newly created table by executing the
    CREATE TABLE … LIKE command.

After the change:
:   When users clone a table, the new table will have DMF associations. Users can clone the DMF entity mappings from the
    source to the target object by executing the CLONE command.
    When users create a table using the LIKE command and the source table has data metric associations, the new table will
    have DMF associations. Users can materialize the DMF entity mappings from the original table to the new table by
    executing the CREATE … LIKE command.

Ref: 1854

---
title: Cloned Tables: Default Value for Columns Not Allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-948.md
section: Release Notes
---

# Cloned Tables: Default Value for Columns Not Allowed

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The default value for a column cannot be dropped if the column was added to a table using the ALTER TABLE command. This restriction prevents inconsistency between values in rows inserted before the column was added and rows inserted after the column was added.

If you create a clone of that table, the column with the DEFAULT value does not inherit the restriction in some cases.

Columns in cloned tables behave as follows:

Previously:
:   If a source table has a column with a default value that was added after table creation time (that is, using the [ALTER TABLE](../../../sql-reference/sql/alter-table.md) command), dropping the default value for that column is blocked.

    If a table is cloned from that source table, it might not inherit the restriction on dropping the DEFAULT value in some cases.

Currently:
:   The columns in the cloned table will inherit the intended behavior from its source table.

Ref: 948

---
title: Cloning: Alerts cloned when cloning databases or schemas
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1211.md
section: Release Notes
---

# Cloning: Alerts cloned when cloning databases or schemas

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, when you clone a database or schema, any [Snowflake alerts](../../../user-guide/alerts.md) in that database or
schema will also be cloned:

Previously:
:   When you clone a database or schema, any alerts in that database or schema are not cloned.

Currently:
:   When you clone a database or schema, any alerts in that database or schema will be cloned.

    The cloned alerts are [suspended](../../../user-guide/alerts.md).

Ref: 1211

---
title: Cloning: Table history not preserved on clone (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-908.md
section: Release Notes
---

# Cloning: Table history not preserved on clone (Postponed)

> **Attention:**
>
> This behavior change was originally in the 2023_07 bundle and intended to become enabled by default in the 2023_08 bundle. However, it has been postponed and a new release date has not been determined. This change is not available for testing.

Cloning a table also clones the load history associated with the table.

Before the change:
:   The load history is not cloned when a table is cloned. Files previously loaded into the source table could be reloaded into the cloned table.

After the change:
:   The load history is also cloned when a table is cloned. As a result, files are not reloaded and data is not duplicated in the cloned table. Note that truncating the table and using the `FORCE = TRUE` COPY option override the load history in the cloned table.

It’s recommended to check any existing workloads that require table cloning. If the load history information is not required and you want to bypass this behavior, use the `FORCE = TRUE` COPY option.

Ref: 908

---
title: COLUMNS view (multiple schemas): New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2061.md
section: Release Notes
---

# COLUMNS view (multiple schemas): New column

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, the COLUMNS view in Account Usage, Organization Usage, and
Information Schema includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `data_type_alias` | VARCHAR | The data type alias or synonym specified for the column when the table was created or when the column was last altered.  For columns in tables that were created before this behavior change and not altered after the behavior change, the value in this column is NULL. |

The `data_type` column shows the standard Snowflake data type of the column. The `data_type_alias`
column displays the original data type name that was specified for a column when a table was created, or when
the column was altered with an `ALTER TABLE table ALTER COLUMN column SET DATA TYPE data_type` statement.

For example, NUMBER is a standard Snowflake data type. BIGINT is synonymous with NUMBER. If you specify BIGINT
as the data type for a column, the `data_type` column in the COLUMNS view shows NUMBER for this column,
but the `data_type_alias` column shows BIGINT:

```sqlexample
CREATE TABLE test_data_type_alias (b BIGINT);

SELECT data_type, data_type_alias
  FROM INFORMATION_SCHEMA.COLUMNS
  WHERE table_name = 'TEST_DATA_TYPE_ALIAS';
```

```output
+-----------+-----------------+
| DATA_TYPE | DATA_TYPE_ALIAS |
|-----------+-----------------|
| NUMBER    | BIGINT          |
+-----------+-----------------+
```

When a standard, unqualified Snowflake data type is specified for a column, the values in the `data_type` and
`data_type_alias` columns are the same. For example, if NUMBER is specified as the data type of a column,
then `data_type` and `data_type_alias` in the COLUMNS view both show NUMBER as the data type:

```sqlexample
CREATE TABLE test_data_type_alias_2 (n NUMBER);

SELECT data_type, data_type_alias
  FROM INFORMATION_SCHEMA.COLUMNS
  WHERE table_name = 'TEST_DATA_TYPE_ALIAS_2';
```

```output
+-----------+-----------------+
| DATA_TYPE | DATA_TYPE_ALIAS |
|-----------+-----------------|
| NUMBER    | NUMBER          |
+-----------+-----------------+
```

The `data_type_alias` column shows the exact name that was specified for a data type. For example,
the following statement creates a table with a fully-qualified NUMBER column:

```sqlexample
CREATE TABLE test_data_type_alias_3 (n NUMBER(16, 2));

SELECT data_type, data_type_alias
  FROM INFORMATION_SCHEMA.COLUMNS
  WHERE table_name = 'TEST_DATA_TYPE_ALIAS_3';
```

```output
+-----------+-----------------+
| DATA_TYPE | DATA_TYPE_ALIAS |
|-----------+-----------------|
| NUMBER    | NUMBER(16, 2)   |
+-----------+-----------------+
```

The only exceptions are [data types for text strings](../../../sql-reference/data-types-text.md). The standard Snowflake data
type for text strings is VARCHAR, but the `data_type` column displays TEXT for these columns:

```sqlexample
CREATE TABLE test_data_type_alias_4 (
  c CHAR,
  s STRING,
  t TEXT,
  v VARCHAR,
  vq VARCHAR(25));

SELECT column_name, data_type, data_type_alias
  FROM INFORMATION_SCHEMA.COLUMNS
  WHERE table_name = 'TEST_DATA_TYPE_ALIAS_4'
  ORDER BY column_name;
```

```output
+-------------+-----------+-----------------+
| COLUMN_NAME | DATA_TYPE | DATA_TYPE_ALIAS |
|-------------+-----------+-----------------|
| C           | TEXT      | CHAR            |
| S           | TEXT      | STRING          |
| T           | TEXT      | TEXT            |
| V           | TEXT      | VARCHAR         |
| VQ          | TEXT      | VARCHAR(25)     |
+-------------+-----------+-----------------+
```

This behavior change affects any scripts or data loading processes that use `SELECT *` to query these views
and depend on a fixed number of columns. To avoid any disruptions, review your scripts and applications. Update any
queries on ACCOUNT_USAGE.COLUMNS, ORGANIZATION_USAGE.COLUMNS, and INFORMATION_SCHEMA.COLUMNS to specify the exact
columns needed, instead of using `SELECT *`.

Ref: 2061

---
title: Completed rollout of BYTES_BILLED column in history views (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2241.md
section: Release Notes
---

# Completed rollout of BYTES_BILLED column in history views (Pending)

In the [2025_05 bundle](../2025_05/bcr-2045.md), Snowflake introduced the `BYTES_BILLED` column to provide visibility into Snowpipe costs. This rollout is now being completed to ensure that all remaining accounts have access to this billing metadata.

## Details of the change

The following views now consistently include the `BYTES_BILLED` column across all accounts:

* [ACCOUNT_USAGE.PIPE_USAGE_HISTORY](../../../sql-reference/account-usage/pipe_usage_history.md)
* [ORGANIZATION_USAGE.PIPE_USAGE_HISTORY](../../../sql-reference/organization-usage/pipe_usage_history.md)
* [INFORMATION_SCHEMA.PIPE_USAGE_HISTORY](../../../sql-reference/functions/pipe_usage_history.md)
* [ACCOUNT_USAGE.COPY_HISTORY](../../../sql-reference/account-usage/copy_history.md)
* [ORGANIZATION_USAGE.COPY_HISTORY](../../../sql-reference/organization-usage/copy_history.md)
* [INFORMATION_SCHEMA.COPY_HISTORY](../../../sql-reference/functions/copy_history.md)

| Column name | Data type | Description |
| --- | --- | --- |
| `BYTES_BILLED` | NUMBER | The number of bytes Snowpipe uses for billing purposes. This provides visibility into Snowpipe’s cost implications within these history views. |

## Impact

The `BYTES_BILLED` column provides a direct metric for the data volume that is considered for billing, making it easier to monitor and manage Snowpipe-related expenses.

Ref: 2241

---
title: Compute pools: Deprecated instance types cannot be resumed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1824.md
section: Release Notes
---

# Compute pools: Deprecated instance types cannot be resumed

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

If you have a compute pool using a deprecated instance type, the following changes apply. Deprecated instance types have names ending with “_<number>”, such as “STANDARD_2” or “GPU_7”.

## Compute pools that use deprecated instance types cannot be resumed

Before the change:
:   You are able to resume a compute pool that uses a deprecated instance type.

After the change:
:   You are not able to resume a compute pool that uses a deprecated instance type. Any scenarios that rely on auto-resume of such compute pools will start to fail.

## Changes to billing for compute pools using deprecated instance types

Before the change:
:   If you were using these deprecated instance families, you have been billed for these deprecated instances under the pricing terms from the Private Preview phase.

After the change:
:   You are billed according to the standard rates for the equivalent supported instance families as outlined in the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

Ref: 1824

---
title: COPY_HISTORY View (Account Usage): “Load in progress” No Longer Shown in STATUS Column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-986.md
section: Release Notes
---

# COPY_HISTORY View (Account Usage): “Load in progress” No Longer Shown in STATUS Column

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The [Account Usage COPY_HISTORY view](../../../sql-reference/account-usage.md) no longer shows the **Load_in_progress** status:

Previously:
:   The **Load_in_progress** status shown in the Account Usage COPY_HISTORY **view** causes confusion due to the 2-hour latency and append-only nature of the view.

    For example, when you load large files using Snowpipe or COPY INTO, even after the file is loaded, the status of the file in COPY_HISTORY view may show **Load_in_progress** and **Loaded** status at the same time.

Currently:
:   The Account Usage COPY_HISTORY view will no longer show **Load_in_progress** status.

Ref: 986

---
title: Cortex ML Functions - New column in single-series Forecasting and Anomaly Detection results
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-cortex-forecast-anomaly-detection-series-column.md
section: Release Notes
---

# Cortex ML Functions - New column in single-series Forecasting and Anomaly Detection results

The SERIES column now appears in all Time-Series Forecasting and Anomaly Detection results, instead of just multi-series
results. This change was rolled out in phases and completed on May 10, 2024.

|  |  |
| --- | --- |
| Before the change | The SERIES column appears only in multi-series Forecasting and Anomaly Detection results. It does not appear in single-series results. |
| After the change | The SERIES column appears in all Forecasting and Anomaly Detection results. In single-series results, this column is NULL in all rows. |

For more information on the affected functions, see:

> * [Anomaly Detection](../../../user-guide/ml-functions/anomaly-detection.md)
> * [Time-Series Forecasting](../../../user-guide/ml-functions/forecasting.md)

---
title: Cortex model deprecations for April 2026
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-april-model-deprecations.md
section: Release Notes
---

# Cortex model deprecations for April 2026

The following model deprecations are planned for April 2026. Customers using these models in
AI_COMPLETE, Agents API, or Snowflake Intelligence are expected to transition to an
alternative model in advance of the deprecation date to avoid disruption.

| Model | Deprecation date |
| --- | --- |
| OpenAI GPT o4-mini | April 16, 2026 |
| Claude Sonnet 3.7 | April 28, 2026 |
| Snowflake Arctic | April 28, 2026 |

If the named model parameter is not updated, queries or API calls that reference the deprecated
model will fail after the deprecation date.

The following models were deprecated in March.

| Model | Deprecation date |
| --- | --- |
| Claude Sonnet 3.5 | March 31, 2026 |
| Gemini Pro 3 | March 26, 2026 |

Ref: April-Model-Deprecations

---
title: Cortex Search Services: Replication of existing services
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2053.md
section: Release Notes
---

# Cortex Search Services: Replication of existing services

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

Before the change:
:   [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md) Services created before July 1, 2025 are not
    [replicated](../../../user-guide/account-replication-intro.md) even when they are included in a [replication group](../../../user-guide/account-replication-intro.md).

After the change:
:   When this behavior change bundle is enabled, Cortex Search Services created before July 1, 2025 are
    replicated when they are in a replication group.

This behavior change does not affect Cortex Search Services created on or after July 1, 2025.

Ref: 2053

---
title: CREATE <object> commands: Changes to error messages when creating an object in a share
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1623.md
section: Release Notes
---

# CREATE *<object>* commands: Changes to error messages when creating an object in a share

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

When you execute a [CREATE <object>](../../../sql-reference/sql/create.md) command to create an object in a
[share](../../../user-guide/data-sharing-intro.md), if an error occurs during the type-checking phase of the command, the command
prints an error message.

This error message is changing in the following way:

Before the change:
:   The command fails with the following error message, regardless of the cause of the error:

    ```output
    003540 (42501): SQL execution error:
      Creating table on shared database '<database_name>'
      is not allowed.
    ```

After the change:
:   The command fails with an error message that describes the specific problem that occurred.

    For example, if you do not have the privilege to operate on the schema, the command fails with the following error message:

    ```output
    003001 (42501): SQL access control error:
      Insufficient privileges to operate on schema '<schema_name>'
    ```

Ref: 1623

---
title: CREATE ALERT and ALTER ALERT commands: Some validation checks no longer performed on individual statements in conditions and actions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1246.md
section: Release Notes
---

# CREATE ALERT and ALTER ALERT commands: Some validation checks no longer performed on individual statements in conditions and actions

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, when you execute a [CREATE ALERT](../../../sql-reference/sql/create-alert.md) or [ALTER ALERT](../../../sql-reference/sql/alter-alert.md) statement,
some validation checks are no longer performed on the individual statements in the condition and action, including:

* The resolution of the identifiers for objects.
* The resolution of the data types of expressions.
* The verification of the number and types of arguments in a function call.

If a SQL statement for a condition or action specifies an invalid identifier, incorrect data type, incorrect number and types of
function arguments, etc., the statement will fail when the alert executes, as opposed to when you execute CREATE ALERT or ALTER
ALERT.

Previously:
:   When you execute the CREATE ALERT or ALTER ALERT command, some validation checks are performed on the condition and action.

    For example, if a statement in the condition or action specifies a non-existent table, the CREATE ALERT or ALTER ALERT command
    fails with an “Object does not exist” error.

Currently:
:   When you execute the CREATE ALERT or ALTER ALERT command, these validation checks will no longer be performed on the condition
    and action.

    For example, if a statement in the condition or action specifies a non-existent table, the CREATE ALERT or ALTER ALERT command
    will succeed.

    When you resume the alert, the condition or action will fail due to the reference to the non-existent table.

    To check for failures in the alert, use the [ALERT_HISTORY](../../../sql-reference/functions/alert_history.md) table function.

    You should verify the SQL expressions and statements for the condition and action before you specify these in an alert.

Ref: 1246

---
title: CREATE and ALTER DATABASE commands: Database names starting with “datacloud$” no longer allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1549.md
section: Release Notes
---

# CREATE and ALTER DATABASE commands: Database names starting with “datacloud$” no longer allowed

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The [CREATE DATABASE](../../../sql-reference/sql/create-database.md) and [ALTER DATABASE](../../../sql-reference/sql/alter-database.md) commands are changing as follows:

Before the change:
:   You can use the CREATE DATABASE command to create a database with a name that starts with `datacloud$`.

    You can also use the ALTER DATABASE command to change the name of a database to a name that starts with `datacloud$`.

After the change:
:   If you execute the CREATE DATABASE command to create a database with a name that starts with `datacloud$`, the following
    error occurs:

    ```output
    090841 (0A000): Database cannot have "DATACLOUD$" as prefix in its name.
    ```

    The same error occurs if you use the ALTER DATABASE command to change the name of a database to a name that starts with
    `datacloud$`.

    Note that the case of `datacloud$` does not matter. The error occurs if this prefix is in uppercase, lowercase, or mixed
    case.

Ref: 1549

---
title: CREATE DYNAMIC ICEBERG TABLE command: Write data types to table files
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1773.md
section: Release Notes
---

# CREATE DYNAMIC ICEBERG TABLE command: Write data types to table files

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

With this release, the [CREATE DYNAMIC ICEBERG TABLE](../../../sql-reference/sql/create-dynamic-table.md) command behaves as follows:

Before the change:
:   Snowflake-managed dynamic Apache Iceberg™ tables created with explicit column definitions do not write iceberg data types to table files.

After the change:
:   Snowflake-managed dynamic Apache Iceberg™ tables created with explicit column definitions write iceberg data types to table files.

    The following is an example of a Snowflake-managed dynamic Apache Iceberg™ table with explicit column definitions:

    ```sqlexample
    CREATE OR REPLACE DYNAMIC ICEBERG TABLE iceberg_dt (id int)
      WAREHOUSE = mywh
      TARGET_LAG = 'downstream'
      EXTERNAL_VOLUME = 'iceberg_default_volume'
      BASE_LOCATION = 'my_base_location'
      CATALOG = 'snowflake'
      AS
        SELECT id FROM base_table;
    ```

Ref: 1773

---
title: CREATE EXTERNAL TABLE command: Primary role requires stage access
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1993.md
section: Release Notes
---

# CREATE EXTERNAL TABLE command: Primary role requires stage access

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

Before the change:
:   Creating an external table succeeds if a user’s primary or secondary roles have the
    USAGE privilege on the stage referenced in the [CREATE EXTERNAL TABLE](../../../sql-reference/sql/create-external-table.md) command.

After the change:
:   Creating an external table succeeds only if a user’s primary role has the USAGE privilege on the
    stage referenced in the CREATE EXTERNAL TABLE command.

Ref: 1993

---
title: CREATE IMAGE REPOSITORY command: Change in the default encryption type
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2036.md
section: Release Notes
---

# CREATE IMAGE REPOSITORY command: Change in the default encryption type

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, the default value of the ENCRYPTION parameter for the [CREATE IMAGE REPOSITORY](../../../sql-reference/sql/create-image-repository.md) command changes:

Before the change:
:   The default value is `SNOWFLAKE_SSE`.
    If you omit the ENCRYPTION parameter, Snowflake uses `SNOWFLAKE_SSE` encryption for binaries stored in the image repositories.

After the change:
:   The default value is `SNOWFLAKE_FULL`.
    If you omit the ENCRYPTION parameter, Snowflake uses `SNOWFLAKE_FULL` encryption for binaries stored in the image repositories.

> **Note:**
>
> The default value is changing for Snowflake accounts on AWS and Azure. This change does not apply to Snowflake accounts on Google Cloud.

Ref: 2036

---
title: CREATE INTEGRATION commands: ENABLED parameter defaults to TRUE
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2166.md
section: Release Notes
---

# CREATE INTEGRATION commands: ENABLED parameter defaults to TRUE

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When you [create a Snowflake integration of any kind](../../../sql-reference/sql/create-integration.md), such as a security
integration, a catalog integration, or a storage integration, the behavior is as follows:

Before the change:
:   The ENABLED parameter defaults to FALSE if it is not explicitly specified. Integrations are disabled by default.

After the change:
:   The ENABLED parameter defaults to TRUE if it is not explicitly specified. Integrations are enabled by default.

This change is being made to align the default behavior with user expectations and to reduce the need for
[ALTER INTEGRATION](../../../sql-reference/sql/alter-integration.md) commands to enable newly created integrations.

Ref: 2166

---
title: CREATE ORGANIZATION LISTING and ALTER LISTING organization_targets field cannot be empty
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1963.md
section: Release Notes
---

# CREATE ORGANIZATION LISTING and ALTER LISTING `organization_targets` field cannot be empty

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

The `organization_targets` field specifies the roles and accounts that can access or discover the listing. To ensure consistency with the Snowsight, the following behavior is being changed:

Before the change:
:   Providers can enter empty values for the `organization_targets` field when publishing or altering an organization listing programmatically with the [CREATE ORGANIZATION LISTING](../../../sql-reference/sql/create-organization-listing.md) and [ALTER LISTING](../../../sql-reference/sql/alter-listing.md) commands.

After the change:
:   When this behavior change bundle is enabled, providers must specify a value for the `organization_targets` field when publishing or altering an organization listing programmatically with the [CREATE ORGANIZATION LISTING](../../../sql-reference/sql/create-organization-listing.md) and [ALTER LISTING](../../../sql-reference/sql/alter-listing.md) commands unless they are creating a draft organization listing.

Ref: 1963

---
title: CREATE USER command: NETWORK_POLICY parameter must specify a valid network policy
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-non-existing-network-policy.md
section: Release Notes
---

# CREATE USER command: NETWORK_POLICY parameter must specify a valid network policy

Trying to add a network policy when creating a user behaves as follows:

Before the change:
:   When setting the `NETWORK_POLICY` parameter with the CREATE USER command, a user could:

    * Specify a non-existent network policy.
    * Execute the CREATE USER command using a role that did not have or inherit the OWNERSHIP privilege on the network policy.

After the change:
:   The following must be true to set the `NETWORK_POLICY` parameter when executing the CREATE USER command:

    * The network policy must exist.
    * The user executing the CREATE USER command must use a role that has or inherits the OWNERSHIP privilege on the specified network policy.

Ref: n/a

---
title: CREATE … CLONE command: Cloning databases and schemas that contain hybrid tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1792.md
section: Release Notes
---

# CREATE … CLONE command: Cloning databases and schemas that contain hybrid tables

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

Given that [hybrid tables](../../../user-guide/tables-hybrid.md) have limited support for [cloning](../../../user-guide/object-clone.md), note the
following behavior when you attempt to clone a database or a schema that contains hybrid tables:

Before the change:
:   In general, [CREATE DATABASE … CLONE and CREATE SCHEMA … CLONE commands](../../../sql-reference/sql/create-clone.md) silently skip hybrid tables if any exist in the specified database or schema.

    CREATE DATABASE … CLONE commands do clone hybrid tables if no Time Travel parameters are specified in the command, or if an AT TIMESTAMP value is specified.

    For example, the following commands succeed but skip hybrid tables:

    ```sqlexample
    CREATE SCHEMA dst CLONE src;
    CREATE DATABASE dst CLONE src
      BEFORE (STATEMENT => '01b7676a-0002-d908-0000-a99500f6e00e');
    ```

    The following command succeeds and includes hybrid tables in the cloned database:

    ```sqlexample
    CREATE DATABASE dst CLONE src;
    ```

After the change:
:   CREATE SCHEMA … CLONE commands return an error if any hybrid tables exist in the specified schema. For example, the following command fails:

    ```sqlexample
    CREATE SCHEMA dst CLONE src;
    ```

    ```output
    392105 (0A000): SQL execution error: Cloning a SCHEMA which contains a HYBRID TABLE is unsupported. To perform the clone while skipping HYBRID TABLES, append the `IGNORE HYBRID TABLES` syntax to your DDL.
    ```

    The error prompts you to run the command using the [IGNORE HYBRID TABLES parameter](../../../sql-reference/sql/create-clone.md). When you use this parameter, the command will create the cloned schema but skip any hybrid tables. For example:

    ```sqlexample
    CREATE SCHEMA dst CLONE src IGNORE HYBRID TABLES;
    ```

    The behavior of CREATE DATABASE … CLONE commands that do not specify Time Travel parameters *does not change*. For example, the following command succeeds and includes hybrid tables in the cloned database:

    ```sqlexample
    CREATE DATABASE dst CLONE src;
    ```

    CREATE DATABASE … CLONE commands that use [Time Travel](../../../user-guide/data-time-travel.md) and specify the time with the STATEMENT parameter return an error if any hybrid tables exist in the specified database. For example, the following command fails:

    ```sqlexample
    CREATE DATABASE dst CLONE src
      BEFORE (STATEMENT => '01b7676a-0002-d908-0000-a99500f6e00e');
    ```

    ```output
    392106 (0A000): SQL execution error: Time Travel cloning a DATABASE which contains a HYBRID TABLE, when specifying the time via a `STATEMENT` is unsupported. To perform the clone while skipping HYBRID TABLES, append the `IGNORE HYBRID TABLES` syntax to your DDL.
    ```

    The error prompts you to run the command using the IGNORE HYBRID TABLES parameter. When you use this parameter, the command will create the cloned database but skip any hybrid tables. For example:

    ```sqlexample
    CREATE DATABASE dst CLONE src
      BEFORE (STATEMENT => '01b7676a-0002-d908-0000-a99500f6e00e')
      IGNORE HYBRID TABLES;
    ```

    Other CREATE DATABASE … CLONE commands that specify Time Travel parameters and do not use AT TIMESTAMP on a target database that contains hybrid tables either return an error or silently skip the hybrid tables:

    * If the bundle is enabled (either explicitly or by default), these CREATE DATABASE … CLONE commands return an error.
    * If the bundle is explicitly disabled, these CREATE DATABASE … CLONE commands silently skip hybrid tables.

    For more information, see [Clone databases that contain hybrid tables](../../../user-guide/tables-hybrid-clone.md).

Ref: 1792

---
title: CREATE, ALTER, and CREATE OR ALTER WAREHOUSE commands: Behavior change with new columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2225.md
section: Release Notes
---

# CREATE, ALTER, and CREATE OR ALTER WAREHOUSE commands: Behavior change with new columns in output

When this behavior change bundle is enabled, warehouse DDL commands use the GENERATION parameter to identify the generation of a warehouse.
You can no longer use the RESOURCE_CONSTRAINT parameter values ‘STANDARD_GEN_1’ or ‘STANDARD_GEN_2’ to identify the generation of a
warehouse.

Before the change:
:   Using any of [CREATE WAREHOUSE](../../../sql-reference/sql/create-warehouse.md), [ALTER WAREHOUSE](../../../sql-reference/sql/alter-warehouse.md), or [CREATE OR ALTER WAREHOUSE](../../../sql-reference/sql/create-warehouse.md)
    to SET the RESOURCE_CONSTRAINT property to STANDARD_GEN_1 or STANDARD_GEN_2, or to UNSET it from those values, was allowed.

After the change:
:   Warehouses that used the RESOURCE_CONSTRAINT parameter to identify the warehouse generation retain their values and settings. Use the GENERATION parameter
    to create new warehouses or alter existing warehouses. Existing records are not affected.

When enabled, this behavior change also adds new columns to the output of the [WAREHOUSE_EVENTS_HISTORY view](../../../sql-reference/account-usage/warehouse_events_history.md) in
the [ACCOUNT_USAGE schema](../../../sql-reference/account-usage.md) and the [QUERY_HISTORY view](../../../sql-reference/organization-usage/query_history.md) in the
[ORGANIZATION_USAGE schema](../../../sql-reference/organization-usage.md) and [ACCOUNT_USAGE schema](../../../sql-reference/account-usage.md):

WAREHOUSE_EVENTS_HISTORY in the ORGANIZATION_USAGE and ACCOUNT_USAGE schemas:

| Column name | Data type | Description |
| --- | --- | --- |
| `GENERATION` | TEXT | The type of warehouse generation.   * `1` if the warehouse is a generation 1 warehouse * `2` if the warehouse is a generation 2 warehouse |

QUERY_HISTORY in the ORGANIZATION_USAGE and ACCOUNT_USAGE schemas:

| Column name | Data type | Description |
| --- | --- | --- |
| `RESOURCE_CONSTRAINT` | TEXT | One of:   * `MEMORY_1X` * `MEMORY_1X_x86` * `MEMORY_16X` * `MEMORY_16X_x86` * `MEMORY_64X` * `MEMORY_64X_x86`   This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |

The following behavior change is introduced:

> Using CREATE WAREHOUSE, ALTER WAREHOUSE, or CREATE OR ALTER WAREHOUSE to SET the RESOURCE_CONSTRAINT property to STANDARD_GEN_1 or STANDARD_GEN_2, or to UNSET it from those values, generates an SQL error similar to:
>
> ```output
> Cannot set resource constraint to 'STANDARD_GEN_[12]'. Use the GENERATION property to set warehouse hardware generation.
>
> Cannot unset resource constraint from 'STANDARD_GEN_[12]'. Use the GENERATION property to unset warehouse hardware generation.
> ```

Ref: 2225

---
title: CURRENT_DATABASE and CURRENT_SCHEMA functions: Ensure deterministic outputs with policies, views, and UDFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1722.md
section: Release Notes
---

# CURRENT_DATABASE and CURRENT_SCHEMA functions: Ensure deterministic outputs with policies, views, and UDFs

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

The behavior of the [CURRENT_DATABASE](../../../sql-reference/functions/current_database.md) and [CURRENT_SCHEMA](../../../sql-reference/functions/current_schema.md) functions is as
follows:

Before the change:
:   The return values when calling the CURRENT_DATABASE or CURRENT_SCHEMA function are not deterministic:

    When you call the function inside of a data access policy, such as a masking or row access policy, the functions return one of two values:

    * The database or schema that contains the policy.
    * The database or schema in use in the session.

    When you call the function in the definition of a view or a UDF and the SELECT keyword does not precede the function, the function
    returns one of two values:

    * The database or schema in use in the session.
    * The database or schema that contains the UDF or the view.

After the change:
:   The return values when calling the CURRENT_DATABASE or CURRENT_SCHEMA function are deterministic:

    * When you call the function inside of a data access policy, such as a masking or row access policy, the functions return the database or
      schema that contains the protected table or view.
    * When you call the function in the definition of a view or a UDF, the function returns the database or schema that contains the UDF or
      the view.

    To minimize the impact of these changes, do the following:

    * If your view definition or UDF uses either of these functions and the SELECT keyword does not precede the function, double-check to
      ensure that the UDF definition is correct for how the function should be used.
    * If your policy calls either of these functions, double check to ensure that the body of the policy is written for the database or
      schema that contains the protected table and not the database or schema in use for the session.

> **Note:**
>
> The updates in this announcement were previously announced in the 2024_03 bundle. The behavior change process for these
> updates has restarted, beginning with the 2024_06 bundle.

Ref: 1722

---
title: Custom Classification: Replicate and clone instances
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1688.md
section: Release Notes
---

# Custom Classification: Replicate and clone instances

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

The behavior of instances of the CUSTOM_CLASSIFIER class with respect to replication and cloning is as follows:

Before the change:
:   Instances are neither replicated nor cloned.

After the change:
:   Instances are replicated when you replicate the database that contains the instances. The instances are visible when you refresh the
    target account.

    Instances are cloned when you clone the schema that contains the instances.

    Instances of other [classes](../../../sql-reference-classes.md) are not affected.

Ref: 1688

---
title: Data Lake: Apache Iceberg™ string column length in CREATE TABLE AS SELECT (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2285.md
section: Release Notes
---

# Data Lake: Apache Iceberg™ string column length in CREATE TABLE AS SELECT (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

When you create an Apache Iceberg™ table string column using CREATE TABLE AS SELECT (CTAS), the column will always be created with the
maximum allowed length (128M). This change also extends to string type fields within structured types.

Before the change:
:   Apache Iceberg™ table string columns could be created with lengths less than 128M using CREATE TABLE AS SELECT, for example, when the source
    was a standard Snowflake table or an expression.

After the change:
:   Apache Iceberg™ table string columns will always be of length 128M when created using CREATE TABLE AS SELECT.

The Iceberg specification defines string types to be of arbitrary length, without a ceiling. In Snowflake, this is implemented as a
string type with a maximum length of 128M. Previously, it was possible to bypass this maximum length restriction during CREATE TABLE AS
SELECT. This behavior change now ensures that all string columns created in Iceberg tables via CREATE TABLE AS SELECT uniformly adhere to
the maximum length.

Ref: 2285

---
title: Data Lineage: VIEW LINEAGE privilege granted to the PUBLIC role
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1933.md
section: Release Notes
---

# Data Lineage: VIEW LINEAGE privilege granted to the PUBLIC role

Users must use a role that has the VIEW LINEAGE privilege to execute the GET_LINEAGE function or use Snowsight to view the lineage
of data and machine learning pipelines. This change affects which roles have the VIEW LINEAGE privilege by default.

Before the change:
:   By default, only the ACCOUNTADMIN role has the VIEW LINEAGE privilege. The account administrator must grant the privilege to other roles
    to allow users to execute the GET_LINEAGE function and view lineage in Snowsight.

After the change:
:   The PUBLIC role has the VIEW LINEAGE privilege, which means a user can use any role to execute the GET_LINEAGE function and view lineage
    in Snowsight.

    This doesn’t mean that all roles and users can view lineage for all objects; users must still have privileges to access an object in
    order to view the lineage of that object.

After the change, if you want to limit who can access lineage, you’ll need to revoke the VIEW LINEAGE privilege from the PUBLIC role, then
grant the privilege to other, more specific roles.

* For information about revoking the VIEW LINEAGE privilege, see [REVOKE <privileges> … FROM ROLE](../../../sql-reference/sql/revoke-privilege.md).
* For information about granting the VIEW LINEAGE privilege to other roles, see [Access control for lineage information](../../../user-guide/ui-snowsight-lineage.md).

Ref: 1933

---
title: Data loading, data unloading, and file staging DML commands: Single-character pattern matches (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-209969.md
section: Release Notes
---

# Data loading, data unloading, and file staging DML commands: Single-character pattern matches (Postponed)

This behavior change was originally planned for **February, 2021**; however, it has been postponed and a new release date has not been determined.

For the most up-to-date details about the release date, as well as other release-related details, see the
[Behavior Change Log](../../behavior-changes.md).

The PATTERN parameter filters the set of staged files in the output of the following DML commands using a regular expression:

COPY INTO *<location>*

COPY INTO *<table>*

GET

LIST

REMOVE

In a future release, the behavior of the PATTERN parameter will change as follows:

Currently:
:   When the regular expression is matched against the file path, an additional internal path is incorrectly prepended to the file
    path. As a result, some regular expressions incorrectly match characters not included in the specified internal path.

    For example, the LIST command could filter filenames against a PATTERN regular expression that matches the letter “t”:

    ```sqlexample
    LIST @mystage pattern='.*t.*';
    ```

    This LIST statement returns all filenames in the stage, even if the files don’t contain the letter ‘t’, because the incorrectly
    prepended path contains the letter ‘t’.

    The source of the issue is an internal/hidden path that the commands apply to all files in a stage. The PATTERN regular
    expression includes this path when evaluating the filenames in the command output.

Pending:
:   The PATTERN parameter ignores the internal/hidden path when evaluating the filenames in the command output.
    The regular expression only matches the customer-created paths and filenames in the stage.

Ref: 209969

---
title: Data quality: DATA_METRIC_USER database role granted to the PUBLIC role
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2155.md
section: Release Notes
---

# Data quality: DATA_METRIC_USER database role granted to the PUBLIC role

Users must use a role that has the SNOWFLAKE.DATA_METRIC_USER database role to work with system data metric functions (DMFs), which are used
to set up data quality checks. This change affects which roles have the SNOWFLAKE.DATA_METRIC_USER database role by default.

Before the change:
:   By default, only the ACCOUNTADMIN role has the SNOWFLAKE.DATA_METRIC_USER database role. The account administrator must
    grant the database role to other roles to allow users to work with system DMFs.

After the change:
:   The PUBLIC role is granted the SNOWFLAKE.DATA_METRIC_USER database role, which means a user can use any role to work with system DMFs,
    including associating a system DMF with an object.

Ref: 2155

---
title: Data quality: DATA_QUALITY_MONITORING_LOOKUP application role granted to the PUBLIC role
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2068.md
section: Release Notes
---

# Data quality: DATA_QUALITY_MONITORING_LOOKUP application role granted to the PUBLIC role

Users must use a role that has the SNOWFLAKE.DATA_QUALITY_MONITORING_LOOKUP application role to call the
[DATA_QUALITY_MONITORING_RESULTS](../../../sql-reference/functions/data_quality_monitoring_results.md) function, which lets you view data quality metrics and trends. This change
affects which roles have the SNOWFLAKE.DATA_QUALITY_MONITORING_LOOKUP application role by default.

Before the change:
:   By default, only the ACCOUNTADMIN role has the SNOWFLAKE.DATA_QUALITY_MONITORING_LOOKUP application role. The account administrator must
    grant the application role to other roles
    to allow users to call the DATA_QUALITY_MONITORING_RESULTS function.

After the change:
:   The PUBLIC role is granted the SNOWFLAKE.DATA_QUALITY_MONITORING_LOOKUP application role, which means a user can use any role to call the
    DATA_QUALITY_MONITORING_RESULTS function.

    This doesn’t mean that all roles and users can view data quality metrics for all objects; users must still have privileges to access an
    object in order to return results from the DATA_QUALITY_MONITORING_RESULTS function.

Ref: 2068

---
title: Data quality: Default schedule for data metric functions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2101.md
section: Release Notes
---

# Data quality: Default schedule for data metric functions

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

[Data metric functions](../../../user-guide/data-quality-intro.md) (DMFs) are associated with tables and views to run data quality checks on a
regular schedule. This change introduces the concept of a default schedule, and behaves as follows:

Before the change:
:   You must modify a table or view to set a schedule before you can associate a DMF with the object. There is no default schedule.

    If you run an ALTER <object> UNSET DATA_METRIC_SCHEDULE command, the DMF schedule is set to an empty string, which suspends all DMF
    evaluations on the table or view.

After the change:
:   Every table and view has a default schedule of one hour. All you need to do to run data quality checks at one-hour intervals is associate
    a DMF with an object. You can change the default at any time.

    If you run an ALTER <object> UNSET DATA_METRIC_SCHEDULE command, the DMF schedule is reset to the default of one hour.

    If you want to suspend all DMFs associated with an object, set the schedule to an empty string.

Ref: 2101

---
title: Data Sharing Usage Views: New CONSUMER_NAME Column in Select Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1097.md
section: Release Notes
---

# Data Sharing Usage Views: New CONSUMER_NAME Column in Select Views

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The following views in the DATA_SHARING_USAGE schema include the CONSUMER_NAME column:

* LISTING_ACCESS_HISTORY
* LISTING_CONSUMPTION_DAILY
* LISTING_EVENTS_DAILY

Added column:

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONSUMER_NAME | VARCHAR | Contains the name of the consumer account that accessed, used, or requested a listing. If no name is available, such as for trial accounts, the value is NULL. |

To help minimize the impact of this addition, the column was added as the last column in the output.

Ref: 1097

---
title: DATA_CLASSIFICATION_LATEST view (ACCOUNT_USAGE): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2189.md
section: Release Notes
---

# DATA_CLASSIFICATION_LATEST view (ACCOUNT_USAGE): New columns

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled,
the [DATA_CLASSIFICATION_LATEST](../../../sql-reference/account-usage/data_classification_latest.md) view
in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema includes the
following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `last_classification_attempt` | TIMESTAMP_LTZ | Timestamp of the last sensitive data classification attempt. If greater than `last_classified_on`, the last attempt resulted in a failure. |
| `error_message` | VARCHAR | Error message from the last sensitive data classification attempt, if it resulted in a failure. |

Ref: 2189

---
title: DATA_CLASSIFICATION_LATEST view (ACCOUNT_USAGE): schemaId column name changed to schema_id
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1885.md
section: Release Notes
---

# DATA_CLASSIFICATION_LATEST view (ACCOUNT_USAGE): `schemaId` column name changed to `schema_id`

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

With this behavior change, the name of the `schemaId` column in the ACCOUNT_USAGE
[DATA_CLASSIFICATION_LATEST view](../../../sql-reference/account-usage/data_classification_latest.md) is changing to `schema_id`.

If you have SQL statements that refer to the column name `schemaId`, you must change those statements to use `schema_id` as
the column name.

Before the change:
:   Referring to the column name `schemaId` does not cause a SQL statement to fail. For example:

    ```sqlexample
    SELECT schemaId FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_LATEST LIMIT 1;
    ```

    ```output
    +----------+
    | SCHEMAID |
    |----------|
    | ...      |
    +----------+
    ```

After the change:
:   Referring to the column name `schemaId` causes a SQL statement to fail. For example:

    ```sqlexample
    SELECT schemaId FROM SNOWFLAKE.ACCOUNT_USAGE.DATA_CLASSIFICATION_LATEST LIMIT 1;
    ```

    ```output
    000904 (42000): SQL compilation error: error line 1 at position 7
    invalid identifier 'SCHEMAID'
    ```

Ref: 1885

---
title: Database Roles: Sharing Database Roles with Future Grants Not Allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1144.md
section: Release Notes
---

# Database Roles: Sharing Database Roles with Future Grants Not Allowed

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

The behavior of future grants and database roles is as follows:

Previously:
:   You can grant future privileges to a database role and grant the database role to a share. There are two scenarios:

    1. Grant the privileges to the database role, and then grant the database role to the share.

       ```sqlexample
       GRANT SELECT ON FUTURE TABLES IN SCHEMA sh TO DATABASE ROLE dbr1;
       GRANT DATABASE ROLE dbr1 TO SHARE myshare;
       ```
    2. Grant the database role to a share, and then grant the future privileges to the database role.

       ```sqlexample
       GRANT DATABASE ROLE dbr1 TO SHARE myshare;
       GRANT SELECT ON FUTURE TABLES IN SCHEMA sh TO DATABASE ROLE dbr1;
       ```

    You can use the following commands to identify whether you have database roles that are affected by the pending changes:

    ```sqlexample
    SHOW FUTURE GRANTS IN DATABASE parent_db;
    SHOW FUTURE GRANTS IN shared_schema;
    ```

Currently:
:   You will not be able to grant future grants on objects when the database role is granted to a share. Snowflake returns a unique error
    message depending on the scenario that you try:

    1. With scenario one, the error message is:

       ```output
       Cannot share a database role with future grants to it.
       ```

       Use a [REVOKE <privileges> … FROM ROLE](../../../sql-reference/sql/revoke-privilege.md) statement to revoke the future grant from the database role. If necessary, update the
       [GRANT <privileges> … TO ROLE](../../../sql-reference/sql/grant-privilege.md) statement so that it does not specify future grants. Finally, grant the database role to the
       share.
    2. With scenario two, the error message is:

       ```output
       Cannot grant future grants to a database role that is granted to a share.
       ```

       Modify the [GRANT <privileges> … TO ROLE](../../../sql-reference/sql/grant-privilege.md) statement so that it does not specify future grants.

Ref: 1144

---
title: Database roles: Updated error messages when granting to a share
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1220.md
section: Release Notes
---

# Database roles: Updated error messages when granting to a share

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The error messages associated with granting a database role to a share using the [GRANT DATABASE ROLE … TO SHARE](../../../sql-reference/sql/grant-database-role-share.md)
command have changed.

In these tables, the word “resolve” means that the owner role of the database role (executing role) has the appropriate privilege to access
the object granted to the database role. For example, the owner role can resolve a table if the database role has the SELECT privilege on a
table with the USAGE privilege on the database and schema that stores the table and the owner role has the same privileges granted to it.

This table lists the error message replacements when granting a database role to a share:

| Behavior | Previously | Currently |
| --- | --- | --- |
| The executing role can resolve the object but the object cannot be shared. | Cannot share a database role that is granted privilege ‘SELECT’ on ‘Table’ object: SQL compilation error: A view can only be shared if it is created as a SECURE view, or marked SECURE using ALTER VIEW V SET SECURE. | Cannot share a database role that is granted privilege ‘SELECT’ on VIEW ‘DB.SCH.V’: SQL compilation error: A view can only be shared if it is created as a SECURE view, or marked SECURE using ALTER VIEW V SET SECURE. |
| The database role cannot resolve the object and the object is not shared. | Cannot share a database role that is granted privilege ‘SELECT’ on ‘Table’ object. | Cannot share a database role that is granted non-shareable privileges. Use role with MANAGE GRANTS to fix it. |
| The database role can resolve a dropped object that was not shared. | Cannot share a database role that is granted privilege ‘SELECT’ on ‘Table’ object: SQL compilation error: A view can only be shared if it is created as a SECURE view, or marked SECURE using ALTER VIEW VD SET SECURE. | Cannot share a database role that is granted privilege ‘SELECT’ on DROPPED View ‘DB.DSCH.V’. Use roles with MANAGE GRANTS to call the CLEANUP_DATABASE_ROLE_GRANTS(‘database_role_name’, ‘share_name’) to revoke the privileges and then grant the database role to the share. |
| The database role cannot resolve a dropped object that was not shared. | Cannot share a database role that is granted privilege ‘SELECT’ on ‘Table’ object. | Cannot share a database role that is granted non-shareable privileges. Use role with MANAGE GRANTS to fix it. |

Additionally, the system function [SYSTEM$CLEANUP_DATABASE_ROLE_GRANTS](../../../sql-reference/functions/system_cleanup_database_role_grants.md) helps to address the scenario when a
database role can resolve a dropped object that was not shared.

This table lists the error messages that are being removed when you try to grant a database role to a share.

| Behavior | Previous error message | Current result |
| --- | --- | --- |
| The database role cannot resolve the shared object. | Cannot share a database role that is granted privilege ‘SELECT’ on ‘Table’ object. | You can grant the database role to the share. Snowflake returns a successful status message. |
| The database role cannot resolve a dropped object that was shared. | Cannot share a database role that is granted privilege ‘SELECT’ on ‘Table’ object. | You can grant the database role to the share. Snowflake returns a successful status message. |

Ref: 1220

---
title: DATABASE_STORAGE_USAGE_HISTORY and STORAGE_USAGE views: New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1333.md
section: Release Notes
---

# DATABASE_STORAGE_USAGE_HISTORY and STORAGE_USAGE views: New column

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

When this bundle is enabled, a new column is added to the following views:

* [DATABASE_STORAGE_USAGE_HISTORY (Account Usage)](../../../sql-reference/account-usage/database_storage_usage_history.md)
* [DATABASE_STORAGE_USAGE_HISTORY (ORGANIZATION_USAGE)](../../../sql-reference/organization-usage/database_storage_usage_history.md)
* [STORAGE_USAGE (Account Usage)](../../../sql-reference/account-usage/storage_usage.md)

Before the change:
:   The DATABASE_STORAGE_USAGE_HISTORY (Account Usage and ORGANIZATION_USAGE) and STORAGE_USAGE (Account Usage) views do not include a column for hybrid table storage bytes.

After the change:
:   The DATABASE_STORAGE_USAGE_HISTORY (Account Usage and ORGANIZATION_USAGE) view includes a new column:

    | Column Name | Data type | Description |
    | --- | --- | --- |
    | average_hybrid_table_storage_usage | FLOAT | Number of bytes of hybrid storage used. |

    The STORAGE_USAGE (Account Usage) view includes a new column:

    | Column Name | Data type | Description |
    | --- | --- | --- |
    | hybrid_table_storage_usage | FLOAT | Number of bytes of hybrid storage used. |

Ref: 1333

---
title: DATABASE_STORAGE_USAGE_HISTORY View (Organization Usage): New Columns in View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1036.md
section: Release Notes
---

# DATABASE_STORAGE_USAGE_HISTORY View (Organization Usage): New Columns in View

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The following columns were added as the last columns of the DATABASE_STORAGE_USAGE_HISTORY view in the ORGANIZATION_USAGE
schema:

| Column Name | Data Type | Description |
| --- | --- | --- |
| DATABASE_ID | NUMBER | Internal/system-generated identifier for the database. |
| DELETED | TIMESTAMP_LTZ | Date and time when the database was dropped; NULL for active databases. |

These columns already exist in the DATABASE_STORAGE_USAGE_HISTORY view in the ACCOUNT_USAGE schema.

Ref: 1036

---
title: DATABASE_STORAGE_USAGE_HISTORY_VIEW (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2129.md
section: Release Notes
---

# DATABASE_STORAGE_USAGE_HISTORY_VIEW (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, the [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/account-usage/database_storage_usage_history.md) in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema and the [DATABASE_STORAGE_USAGE_HISTORY view](../../../sql-reference/organization-usage/database_storage_usage_history.md) in the [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md) schema include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| AVERAGE_ARCHIVE_STORAGE_COOL_BYTES | FLOAT | Reserved for future use. |
| AVERAGE_ARCHIVE_STORAGE_COLD_BYTES | FLOAT | Reserved for future use. |
| AVERAGE_ARCHIVE_STORAGE_COOL_FAILSAFE_BYTES | FLOAT | Reserved for future use. |
| AVERAGE_ARCHIVE_STORAGE_COLD_FAILSAFE_BYTES | FLOAT | Reserved for future use. |

Ref: 2129

---
title: DATABASES and SCHEMATA views, and SHOW DATABASES and SHOW SCHEMAS commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1759.md
section: Release Notes
---

# DATABASES and SCHEMATA views, and SHOW DATABASES and SHOW SCHEMAS commands: New column in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, a new column, OBJECT_VISIBILITY, will be added to the following
[ACCOUNT_USAGE](../../../sql-reference/account-usage.md) views:

* [DATABASES](../../../sql-reference/account-usage/databases.md)
* [SCHEMATA](../../../sql-reference/account-usage/schemata.md)

OBJECT_VISIBILITY will also be available in the output of the [SHOW DATABASES](../../../sql-reference/sql/show-databases.md) and
[SHOW SCHEMAS](../../../sql-reference/sql/show-schemas.md) commands.

| Column Name | Data Type | Description |
| --- | --- | --- |
| OBJECT_VISIBILITY | OBJECT | The new column is appended to the existing views and is reserved for future use. |

Ref: 1759

---
title: DATABASES and SCHEMATA views: New columns and rows to include personal databases and replication information
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1869-1880.md
section: Release Notes
---

# DATABASES and SCHEMATA views: New columns and rows to include personal databases and replication information

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When personal databases are enabled and you query [ACCOUNT_USAGE.DATABASES](../../../sql-reference/account-usage/databases.md), [ACCOUNT_USAGE.SCHEMATA](../../../sql-reference/account-usage/schemata.md),
[INFORMATION_SCHEMA.DATABASES](../../../sql-reference/info-schema/databases.md), and [INFORMATION_SCHEMA.SCHEMATA](../../../sql-reference/info-schema/schemata.md)
views, you will see new information in the OWNER and TYPE columns, as well as a new columns OWNER_ROLE_TYPE and REPLICABLE_WITH_FAILOVER_GROUPS.

In addition, REPLICABLE_WITH_FAILOVER_GROUPS will indicate whether a database is replicable as part of a failover group.

| Column name | New or existing | Description |
| --- | --- | --- |
| OWNER | Existing | The owner for personal objects. |
| OWNER_ROLE_TYPE | Existing in ACCOUNT_USAGE and SCHEMATA views, new in INFORMATION_SCHEMA view | The owner role of the personal object. |
| TYPE | Existing | The type of personal object, such as PERSONAL DATABASE. |
| REPLICABLE_WITH_FAILOVER_GROUPS | New | Whether the database is replicable as part of a failover group.  Returns one of:   * NO * YES * UNSET |

For more information on personal databases, see [Personal Databases](../../../user-guide/personal-databases.md).

Ref: 1869, 1880

---
title: DATABASES view (Account Usage): New column DATA_QUALITY_MONITORING_SETTINGS (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2266.md
section: Release Notes
---

# DATABASES view (Account Usage): New column DATA_QUALITY_MONITORING_SETTINGS (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

When this behavior change bundle is enabled, the [DATABASES](../../../sql-reference/account-usage/databases.md) view in the
[ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `DATA_QUALITY_MONITORING_SETTINGS` | VARIANT | Data quality monitoring notification settings for the database, as set with the `DATA_QUALITY_MONITORING_SETTINGS` property in [ALTER DATABASE](../../../sql-reference/sql/alter-database.md). For more information, see [Configure database settings for data quality notifications](../../../user-guide/data-quality-notifications.md). |

Ref: 2266

---
title: DBT_PROJECT_EXECUTION_HISTORY function: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2233.md
section: Release Notes
---

# DBT_PROJECT_EXECUTION_HISTORY function: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [DBT_PROJECT_EXECUTION_HISTORY](../../../sql-reference/functions/dbt_project_execution_history.md) table function
in the [Snowflake Information Schema](../../../sql-reference/info-schema.md) includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| DBT_VERSION | INTEGER | The DBT project version. |
| DBT_SNOWFLAKE_VERSION | INTEGER | The Snowflake dbt project version. |

Ref: 2233

---
title: Dec 01, 2025: CORTEX_AISQL_USAGE_HISTORY Account Usage view (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-01-cortex-aisql-usage-history.md
section: Release Notes
---

# Dec 01, 2025: CORTEX_AISQL_USAGE_HISTORY Account Usage view (*General availability*)

The CORTEX_AISQL_USAGE_HISTORY view in the SNOWFLAKE.ACCOUNT_USAGE schema is now generally available. This view provides
detailed information about the usage of Cortex AI Functions in your SQL queries, providing finer-grained insights into
how AI features are being used in your account. For more information, see
[CORTEX_AISQL_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_aisql_usage_history.md).

---
title: Dec 02, 2025: Auto-fulfillment for listings that span databases (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-02-laf-listings-span-databases.md
section: Release Notes
---

# Dec 02, 2025: Auto-fulfillment for listings that span databases (*General availability*)

Providers can create listings on databases that reference views or tables across multiple other databases. By granting reference usage to a share, a single listing can span databases, eliminating the need to create one combined database per listing. This provides greater flexibility with listings, simplifies integration into existing architectures, and ensures that all listings associated with a database are auto-fulfilled together

For more information, see [Set up auto-fulfillment for a listing that spans databases](../../../collaboration/provider-listings-auto-fulfillment-setup-steps.md).

---
title: Dec 02, 2025: Optimize existing semantic views or models with verified queries (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-02-cortex-analyst-optimization.md
section: Release Notes
---

# Dec 02, 2025: Optimize existing semantic views or models with verified queries (*Preview*)

With Snowflake’s optimization feature, you can optimize existing semantic views and models using only verified queries. Snowflake automatically analyzes your verified queries to find useful information to add to the rest of the semantic layer. This optimization helps Cortex Analyst answer a broader range of questions correctly in addition to those that match with existing verified queries.

For more information, see [Optimize an existing semantic view or model with verified queries](../../../user-guide/snowflake-cortex/cortex-analyst/analyst-optimization.md).

---
title: Dec 02, 2025: Private connectivity for Apache Iceberg™ REST catalog integrations (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-02-iceberg-rest-catalog-private-connectivity.md
section: Release Notes
---

# Dec 02, 2025: Private connectivity for Apache Iceberg™ REST catalog integrations (*General availability*)

You can now set up an Apache Iceberg™ REST catalog integration to use outbound private connectivity.
This feature lets you connect to external Iceberg REST catalogs, such as generic Iceberg REST, Amazon Web Services (AWS) Glue Data Catalog,
and Databricks Unity Catalog, through private endpoints instead of the public internet.
This enhances security by keeping your network traffic within your cloud provider’s private network.

Private connectivity is only supported for catalog integrations on AWS that use AWS PrivateLink and
Microsoft Azure that use Azure Private Link.

For more information, see [Configure an Apache Iceberg™ REST catalog integration with outbound private connectivity](../../../user-guide/tables-iceberg-configure-catalog-integration-rest-private.md).

---
title: Dec 03, 2025: Access history improvements
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-03-access-history.md
section: Release Notes
---

# Dec 03, 2025: Access history improvements

[Access history](../../../user-guide/access-history.md) lets you monitor the SQL statements executed in Snowflake. It keeps track of the
following types of statements:

* Data Manipulation Language (DML) statements. For example, statements used to insert data into a table.
* Data Query Language (DQL) statements. For example, statements that use a SELECT statement to project data.
* Data Definition Language (DDL) statements. For example, statements that create or alter a Snowflake object.

Snowflake is expanding which SQL statements are included in the access history. Recent improvements include the following:

* Added support for the following objects: listing, role, share, and session.
* Added DQL command support for externally managed Apache Iceberg™ tables.
* Enhanced support for database DDL commands, including the ALTER DATABASE command and commands related to database replication.
* Enhanced DDL support for tables, including variations of ALTER TABLE and variations of ALTER TABLE…MODIFY COLUMN.
* Enhanced support for [file staging commands](../../../sql-reference/commands-file.md) like GET and PUT.

For a complete list of objects and commands that appear in your access history, see [Supported Objects](../../../user-guide/access-history.md).

---
title: Dec 04, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-04-dcr.md
section: Release Notes
---

# Dec 04, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 12.2

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Updates to private preview features.

---
title: Dec 08, 2025: AI_REDACT for automated redaction of PII (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-08-ai-redact-ga.md
section: Release Notes
---

# Dec 08, 2025: AI_REDACT for automated redaction of PII (*General availability*)

The AI_REDACT function, now generally available, detects and redacts personally identifiable information (PII) from
unstructured text data using a large language model (LLM). AI_REDACT automatically recognizes various categories of PII
(name, address, and so on, including partial PII such as first or last name) and replaces them with placeholders.

For example, passing the following string to AI_REDACT:

> “John Smith’s email is [jsmith@example.com](mailto:jsmith%40example.com) and he lives in San Francisco.”

Results in the following output:

> “[NAME]’s email is [EMAIL] and he lives in [ADDRESS].”

For more information, see [Detect and redact personally identifiable information (PII)](../../../user-guide/snowflake-cortex/redact-pii.md) and [AI_REDACT](../../../sql-reference/functions/ai_redact.md).

---
title: Dec 08, 2025: Dynamic tables: Support for dual warehouses
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-08-dynamic-tables-dual-warehouses.md
section: Release Notes
---

# Dec 08, 2025: Dynamic tables: Support for dual warehouses

Dynamic tables support dual warehouses to optimize performance and cost for different types of refresh operations. You can specify a dedicated
warehouse for [initializations and reinitializations](../../../user-guide/dynamic-tables-refresh.md), which are typically more resource-intensive,
while you use another warehouse for all other refreshes.

For more information, see [Understand warehouse usage for dynamic tables](../../../user-guide/dynamic-tables-warehouses.md).

---
title: Dec 08, 2025: Snowpipe simplified pricing
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-08-snowpipe-simplified-pricing.md
section: Release Notes
---

# Dec 08, 2025: Snowpipe simplified pricing

Snowflake is extending a significant Snowpipe enhancement to Enterprise and Standard Edition Snowflake accounts. Starting December 8, 2025, you’ll benefit from a simpler, more predictable Snowpipe pricing model that may significantly lower your data ingestion costs for most types of workloads. This update is applied automatically to your account.

Instead of a per-second/per-core compute charge and a per-1,000-files fee, you are now charged a fixed credit amount per gigabyte (0.0037 credits per GB) of data ingested with Snowpipe. This pricing makes it easier for you to estimate your Snowpipe costs.

Data ingested is calculated in the following ways:

* Text files, such as CSV and JSON, are billed on their uncompressed size.
* Binary files, such as Parquet and Avro, are billed on their observed size.

For a complete breakdown of the updated billing documentation and cost verification guidance, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf) and [Snowpipe costs](../../../user-guide/data-load-snowpipe-billing.md).

> **Note:**
>
> This simplified pricing model was previously rolled out to all Business Critical and VPS accounts on August 1, 2025. For more information, see [the previous release note](../9_21.md).

---
title: Dec 10, 2025: Cost anomalies (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-02-cost-anomalies-ga.md
section: Release Notes
---

# Dec 10, 2025: Cost anomalies (*General availability*)

Snowflake can automatically detect cost anomalies based on prior levels of consumption, which simplifies the process of
identifying spikes or dips in costs so you can find ways to optimize your spend. You can use this feature to identify both account-level and
organization-level cost anomalies.

For more information, see [Introduction to cost anomalies](../../../user-guide/cost-anomalies.md).

---
title: Dec 10, 2025: General availability of WORM backups
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-10-worm-backups.md
section: Release Notes
---

# Dec 10, 2025: General availability of WORM backups

WORM (Write Once, Read Many) backups are now generally available to all accounts.

Backups help organizations protect critical data against modification or deletion. Backups represent discrete
point-in-time copies of Snowflake objects. You choose which objects to back up (tables, schemas, or databases),
how frequently to back them up, how long to keep the backups, and whether to add a retention lock so that they
can’t be deleted prematurely.

Key use cases for backups include:

* **Regulatory compliance**: Backups with retention lock help organizations, financial institutions, and related
  industries address regulations that require records to be retained in an immutable format.
* **Recovery**: Backups help organizations create discrete copies to protect and recover business-critical data
  in case of accidental modifications or deletions.
* **Cyber resilience**: Backups with retention lock are part of an overall cyber-resilience strategy. They help
  organizations protect business-critical data during cyber attacks, especially ransomware attacks. The retention
  lock ensures that this data can’t be deleted by the attacker, even if they gain access to the account by using
  the ACCOUNTADMIN or ORGADMIN roles.

For more information, see [Backups for disaster recovery and immutable storage](../../../user-guide/backups.md).

## Terminology change

The feature is now called **backups** instead of snapshots. All SQL commands, views, and privileges use
**BACKUP** terminology:

* CREATE BACKUP POLICY, CREATE BACKUP SET
* ALTER BACKUP POLICY, ALTER BACKUP SET
* DROP BACKUP POLICY, DROP BACKUP SET
* SHOW BACKUP POLICIES, SHOW BACKUP SETS, SHOW BACKUPS IN BACKUP SET
* BACKUPS, BACKUP_POLICIES, BACKUP_SETS views in Account Usage, Organization Usage, and Information Schema
* APPLY BACKUP POLICY, APPLY BACKUP RETENTION LOCK privileges

The former SNAPSHOT/SNAPSHOTS names are still present but deprecated in favor of their BACKUP/BACKUPS equivalents.
For example:

* CREATE SNAPSHOT POLICY is deprecated; use CREATE BACKUP POLICY instead.
* SNAPSHOTS view is deprecated; use BACKUPS view instead.
* APPLY SNAPSHOT POLICY privilege is deprecated; use APPLY BACKUP POLICY privilege instead.

The deprecated commands, views, and privileges continue to work, but Snowflake intends to remove them in a future release.

---
title: Dec 11, 2025: Default pipe for Snowpipe Streaming with high-performance architecture
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-11-default-pipe.md
section: Release Notes
---

# Dec 11, 2025: Default pipe for Snowpipe Streaming with high-performance architecture

We are enhancing the high-performance Snowpipe Streaming architecture by introducing the default pipe capability.

This feature simplifies the data ingestion process by eliminating the need for you to create a pipe manually by using CREATE PIPE DDL statements. With this change, users can initiate streaming immediately against a target table. The default pipe is implicitly available for any table designed to receive streaming data.

The default pipe is system-generated and follows a specific naming convention that is derived from its target table:

| Attribute | Format | Example |
| --- | --- | --- |
| Default Pipe Name | `<TABLE_NAME>-STREAMING` | If your target table is named `MY_TABLE`, the default pipe is named `MY_TABLE-STREAMING`. |

The default pipe is entirely managed by Snowflake and has system-defined metadata; for example, its OWNER attribute is NULL.

When you configure the Snowpipe Streaming Client SDK, or REST API, you can reference this default pipe name in your client configuration to target the destination table directly.

For more information, see [Snowpipe Streaming key concepts](../../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md).

---
title: Dec 11, 2025: Interactive tables and interactive warehouses (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-11-interactive-tables-ga.md
section: Release Notes
---

# Dec 11, 2025: Interactive tables and interactive warehouses (*General availability*)

Interactive tables and interactive warehouses are now generally available. Together,
interactive tables and interactive warehouses deliver low-latency query performance for
high-concurrency, interactive workloads such as real-time dashboards and data-powered APIs.

Interactive tables are a new type of Snowflake table optimized for low-latency, interactive
queries. Interactive warehouses are a new type of warehouse that’s optimized for low-latency,
interactive workloads. You get the best performance when you query interactive tables using
interactive warehouses.

Currently, this feature is available in select Amazon Web Services (AWS)
[regions](../../../user-guide/interactive.md).

For more information, see the following topics:

* [Snowflake interactive tables and interactive warehouses](../../../user-guide/interactive.md)
* [CREATE INTERACTIVE TABLE](../../../sql-reference/sql/create-interactive-table.md)
* [CREATE INTERACTIVE WAREHOUSE](../../../sql-reference/sql/create-interactive-warehouse.md)

---
title: Dec 11, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-11-dcr.md
section: Release Notes
---

# Dec 11, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 12.3

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Lookalike audience modeling template removed from the clean rooms UI. The lookalike audience modeling template
  [is now available as a custom template](../../../user-guide/cleanrooms/lookalike-audience-modeling-template.md) for you to add, modify, and
  run in your account using the clean rooms API.
  Column Policy Optimization: Reduced latency when adding a column policy with multiple columns.
* Updates to private preview features.

---
title: Dec 11, 2025: Support for Streamlit in Snowflake container runtime (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-11-sis.md
section: Release Notes
---

# Dec 11, 2025: Support for Streamlit in Snowflake container runtime (Preview)

You can now run your Streamlit in Snowflake apps on containers.

For more information, see [Runtime environments for Streamlit apps](../../../developer-guide/streamlit/app-development/runtime-environments.md).

---
title: Dec 12, 2025: Private connectivity for internal stages on Google Cloud (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-12-gcp-pl-internal-stages.md
section: Release Notes
---

# Dec 12, 2025: Private connectivity for internal stages on Google Cloud (*General availability*)

Support for private connectivity to internal stages, which was previously supported on Amazon Web Services and Microsoft Azure is now generally available for Business
Critical accounts on Google Cloud.

For more information, see [Google Private Service Connect endpoints for internal stages](../../../user-guide/private-internal-stages-gcp.md).

---
title: Dec 15, 2025: Account Usage: New CATALOG_LINKED_DATABASE_USAGE_HISTORY view
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-15-catalog-linked-db-usage-history.md
section: Release Notes
---

# Dec 15, 2025: Account Usage: New CATALOG_LINKED_DATABASE_USAGE_HISTORY view

The new CATALOG_LINKED_DATABASE_USAGE_HISTORY view displays the credit usage for
[catalog-linked databases](../../../user-guide/tables-iceberg-catalog-linked-database.md).
It includes compute and cloud services credit usage for each entity during an operation.

For more information, see [CATALOG_LINKED_DATABASE_USAGE_HISTORY view](../../../sql-reference/account-usage/catalog_linked_database_usage_history.md).

> **Note:**
>
> Billing for catalog-linked databases started on December 15, 2025.

---
title: Dec 15, 2025: Vector aggregate functions
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-15-vector-aggregate-functions.md
section: Release Notes
---

# Dec 15, 2025: Vector aggregate functions

Snowflake now has vector aggregate functions that enable element-wise mathematical operations across multiple [VECTOR](../../../sql-reference/data-types-vector.md) values. These functions perform aggregation operations on columns of vectors, computing element-wise results across all vectors in a group.

Vector aggregate functions are essential for machine learning and data science workflows that require statistical operations on vector embeddings, such as computing centroids, finding ranges, or calculating averages across vector datasets. These functions ignore NULL in aggregation, preserve data types where possible, and are optimized for handling vector data.

The newly offered vector aggregation functions are:

* [VECTOR_SUM](../../../sql-reference/functions/vector_sum.md) – Compute the element-wise sum of vectors, preserving type.
* [VECTOR_MIN](../../../sql-reference/functions/vector_min.md) – Compute the element-wise minimum of vectors, preserving type.
* [VECTOR_MAX](../../../sql-reference/functions/vector_max.md) – Compute the element-wise maximum of vectors, preserving type.
* [VECTOR_AVG](../../../sql-reference/functions/vector_avg.md) – Compute the element-wise average of vectors, returning a vector containing [FLOAT](../../../sql-reference/data-types-numeric.md) elements.

For more information, see [Vector functions](../../../sql-reference/functions-vector.md).

---
title: Dec 16, 2024: Azure Private Link in Streamlit in Snowflake (General Availability)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-16-sis.md
section: Release Notes
---

# Dec 16, 2024: Azure Private Link in Streamlit in Snowflake (General Availability)

With this release, we are pleased to announce the general availability of Azure Private Link in Streamlit in Snowflake.

For more information, see [Private connectivity for Streamlit in Snowflake](../../../developer-guide/streamlit/object-management/privatelink.md).

---
title: Dec 16, 2025: Cortex Search multi-indexing and custom vector embedding (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-16-cortex-search-multi-index-preview.md
section: Release Notes
---

# Dec 16, 2025: Cortex Search multi-indexing and custom vector embedding (*Preview*)

Cortex Search services now support both multi-indexing and customized vector embeddings in preview. These features allow for more refined results from a Cortex Search service by allowing searches over multiple columns of data, in addition to customized vector embeddings. Embeddings are provided either automatically through a Snowflake-provided model, or you can bring your own model or pre-computed embeddings.

For more information on multi-indexed Cortex Search services, see [Multi-index Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

---
title: Dec 16, 2025: Notebooks in Workspaces (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-16-notebooks-in-workspaces.md
section: Release Notes
---

# Dec 16, 2025: Notebooks in Workspaces (*Preview*)

Snowflake Notebooks in Workspaces is now available in [preview](../../preview-features.md). This new notebook experience provides
a fully-managed, end-to-end environment for data science and machine learning development on Snowflake data, combining the familiar Jupyter
notebook interface with enterprise-grade compute, governance, and collaboration capabilities.

Notebooks in Workspaces runs on a Container Runtime powered by Snowpark Container Services, offering preconfigured containers optimized for AI/ML workloads with
access to CPUs and GPUs, parallel data loading, and distributed training APIs for popular ML packages.

## Key features

**Integration with Workspaces**

* Notebooks are files in Workspaces, enabling easy file management and organization.
* Git integration provides version control and collaboration across development environments.

**Updates to compute and cost management**

* CPU or GPU compute pools match your workload requirements.
* Shared container service connections reduce start-up time and improve resource utilization.
* Background kernel persistence ensures uninterrupted execution of long-running processes.
* Simplified idle time configuration prevents unused compute resources from running indefinitely.
* Service-level External Access Integration (EAI) management applies to all notebooks in the workspace.

**Jupyter compatibility**

* Standard Jupyter magic commands for familiar development experience.
* Pre-installed data science and machine learning packages.
* Install additional packages via `pip`, PyPI, or file upload.

**Enhanced editing experience**

* Bidirectional SQL and Python cell referencing for seamless language switching.
* Interactive datagrid and automated chart builder for data visualization.
* Enhanced minimap with cell status tracking and table of contents.

For details, see [Snowflake Notebooks in Workspaces](../../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md).

---
title: Dec 17, 2025 — Snowflake High Performance connector for Kafka (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-17-kafkahp-pupr.md
section: Release Notes
---

# Dec 17, 2025 — Snowflake High Performance connector for Kafka (*Preview*)

This release marks the public preview of Snowflake High Performance connector for Kafka.

The Snowflake High Performance connector for Kafka is a high-performance connector for Kafka that allows you to ingest data from Kafka topics into Snowflake tables.

The connector leverages Snowflake’s high-performance Snowpipe Streaming architecture to deliver multiple GB/s throughput with little latency. Key features include transparent billing, Rust-based performance improvements, in-flight transformations, server-side validation, and pre-clustering capabilities. PIPE objects serve as the central entry point for managing and configuring the streaming data ingestion process.

For more details, see [Snowflake High Performance Connector for Kafka](../../../connectors/kafkahp/about.md).

---
title: Dec 17, 2025: Schema evolution support for Snowpipe Streaming with high-performance architecture
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-17-schema-evolution-snowpipe-streaming.md
section: Release Notes
---

# Dec 17, 2025: Schema evolution support for Snowpipe Streaming with high-performance architecture

Snowflake announces support for automatic table schema evolution within the Snowpipe Streaming high-performance architecture. This feature lets your streaming pipelines seamlessly adapt to schema drift in near real-time, which eliminates the need for manual DDL intervention when new data attributes are introduced at the source.

To enable this feature, set `ENABLE_SCHEMA_EVOLUTION = TRUE` on your target table.

Key Features:

* Automatic column addition: New fields detected in the incoming stream are automatically added to the target table.
* Constraint management: Automatically drops NOT NULL constraints if incoming records are missing specific values.
* Seamless ingestion: Reduces pipeline failures caused by schema mismatches, ensuring continuous data availability.

Limitations:

* Table type: Support is limited to standard (native) Snowflake tables. External tables and Iceberg tables aren’t supported.
* Column modifications: Automatic column widening — increasing the precision, scale, or text length — isn’t supported.
* Data types: Schema evolution isn’t currently supported for structured types, which are structured OBJECT, ARRAY, or MAP columns. However, new columns that contain structured types are inferred as VARIANT, which enables JSON objects and arrays to be supported.

For more information, see:

* [Table schema evolution](../../../user-guide/data-load-schema-evolution.md)
* [Snowpipe Streaming with high-performance architecture](../../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md)

---
title: Dec 17, 2025: Snowflake Postgres (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-17-snowflake-postgres.md
section: Release Notes
---

# Dec 17, 2025: Snowflake Postgres (*Preview*)

Snowflake Postgres is now available in public preview. Snowflake Postgres lets you create, manage,
and use Postgres instances directly from Snowflake. Each instance runs a Postgres database server
on a dedicated virtual machine managed by Snowflake. You connect directly to your instances using
any Postgres client. Snowflake Postgres brings the reliable and trusted transactional database
capabilities of Postgres to the Snowflake data platform.

For more information, see the following topics:

* [Snowflake Postgres](../../../user-guide/snowflake-postgres/about.md)
* [Creating a Snowflake Postgres Instance](../../../user-guide/snowflake-postgres/postgres-create-instance.md)

---
title: Dec 18, 2025: Network rules and policies support Google Cloud Private Service Connect IDs (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-12-18-gcp-pscid-network-rules-and-policies.md
section: Release Notes
---

# Dec 18, 2025: Network rules and policies support Google Cloud Private Service Connect IDs (*General availability*)

You can now create Snowflake network rules and policies using Google Cloud Private Service Connect IDs.

> For more information, see the following topics:
>
> > * [Supported network identifiers](../../../user-guide/network-rules.md)
> > * [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md)
> > * [ALTER NETWORK RULE](../../../sql-reference/sql/alter-network-rule.md)

---
title: Dec 20, 2024: Support for Streamlit 1.39.0 (Preview)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-20-sis.md
section: Release Notes
---

# Dec 20, 2024: Support for Streamlit 1.39.0 (Preview)

With this release, we are pleased to announce the preview of support for version 1.39.0 of the Streamlit open-source library in Streamlit in Snowflake.

---
title: Dec 4, 2024: Azure Private Link in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-04-sis.md
section: Release Notes
---

# Dec 4, 2024: Azure Private Link in Streamlit in Snowflake (Preview)

With this release, we are pleased to announce the preview of Azure Private Link in Streamlit in Snowflake.

For more information, see [Private connectivity for Streamlit in Snowflake](../../../developer-guide/streamlit/object-management/privatelink.md).

---
title: December 01, 2023 — Streamlit in Snowflake Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/other/2023-12-01.md
section: Release Notes
---

# December 01, 2023 — Streamlit in Snowflake Release Notes

Streamlit in Snowflake — General Availability on AWS

With this release, we are pleased to announce the general availability of Streamlit in Snowflake on AWS, which was previously available as a preview feature.

For more information, see [About Streamlit in Snowflake](../../../developer-guide/streamlit/about-streamlit.md).

---
title: December 03, 2024 — Snowflake Native Apps in Azure Government regions — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-03-na-ga-gov-azure.md
section: Release Notes
---

# December 03, 2024 — Snowflake Native Apps in Azure Government regions — *Preview*

With this release, Snowflake is pleased to announce the general availability of Snowflake Native App
Framework support for Azure Government regions.

Providers publishing apps from government regions can only share listings within the same organization,
while consumers in government regions can install apps from the public marketplace.

For more information see [Understand limitations in the Snowflake Native App Framework](../../../developer-guide/native-apps/limitations.md).

---
title: December 03-05, 2024 — 8.45 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_45.md
section: Release Notes
---

# December 03-05, 2024 — 8.45 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Snowflake Scripting: Asynchronous child jobs — *Preview*

With this release, Snowflake Scripting (SQL) stored procedures can run queries concurrently as asynchronous child jobs. The query can be any
valid SQL statement, including SELECT statements and DML statements, such as INSERT or UPDATE.

To run a query as an asynchronous child job, add the ASYNC keyword to the query for a [RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md).
When this keyword is omitted, the stored procedure runs child jobs sequentially, and each child job waits for the running child job to finish
before it starts.

Running multiple child jobs concurrently can improve efficiency and reduce overall run time.

For more information, see [Assigning a query to a declared RESULTSET](../../developer-guide/snowflake-scripting/resultsets.md).

## Extensibility updates

### Profiling Python stored procedure handlers — *Preview*

With this release, you can discover how much time or memory was spent executing your stored procedure handler code by using the built-in code
profiler. The profiler generates information describing how much time or memory was spent executing each line of Python code.

For more information, see [Profiling Python procedure handler code](../../developer-guide/stored-procedure/python/procedure-python-profiler.md) (for SQL API) and
[Profiling Snowpark Python stored procedure handlers](../../developer-guide/snowpark/python/profiling-procedure-handlers.md) (for Python API).

### Java 17 support — *General Availability*

With this release, we are pleased to announce general availability of Java 17 in Snowpark. You can now create and run stored procedures and
UDFs using Java 17. The Snowpark API and JDBC Driver have also been updated to support Java 17.

For more information, see [Snowflake Java Runtime Support](../../developer-guide/java-runtime-support-policy.md).

## Data pipeline updates

### Dynamic tables: Unlimited inputs

With this release, you can define dynamic tables that read from an unlimited number of tables or dynamic tables. Previously, dynamic table
definitions were limited to querying up to 100 tables or dynamic tables.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-Nov-24 |
|  |  |  |

---
title: December 04-05, 2023 — 7.43 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_43.md
section: Release Notes
---

# December 04-05, 2023 — 7.43 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## New Features

### Finalizer Task — *General Availability*

With this release, we are pleased to announce the general availability of the finalizer task. A finalizer task handles the release and cleanup of resources that a DAG uses. You can create a finalizer task that is associated with a root task or change an existing standalone task to a finalizer task.

The finalizer task is guaranteed to run regardless of the DAG’s success or failure and ensures proper resource cleanup and completion of necessary steps in all scenarios. For example, if a DAG run uses intermediate tables to track data for processing and fails before the table rows are consumed, the next run will encounter duplicate rows and reprocess data resulting in longer execution time or wasting compute resources. The finalizer task can address this issue by dropping the rows or truncating the table as needed.

For more information, see [Finalizer Task](../../user-guide/tasks-graphs.md).

## SQL Updates

### New SQL functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Semi-structured Data Functions (Array/Object) | [ARRAYS_TO_OBJECT](../../sql-reference/functions/arrays_to_object.md) | Returns an object that contains the keys specified by one input array and the values specified by another input array. |

## Extensibility Updates

### Python Snowpark Local Testing Framework — *Preview*

With this release, we are pleased to announce the Snowpark Python local testing framework as a preview feature to all accounts. The Snowpark Python local testing framework allows you to create and operate on Snowpark Python DataFrames locally without connecting to a Snowflake account. You can use this to test your DataFrame operations locally, on your development machine or in a CI (continuous integration) pipeline, before deploying code changes to your account. The API is the same, so you can either run your tests locally or against a Snowflake account, without making code changes.

For more information, see [Local testing framework](../../developer-guide/snowpark/python/testing-locally.md).

## Web Interface Updates

### Load Files onto Stages and Managed Staged Files using Snowsight — *General Availability*

With this release, we are pleased to announce the general availability of the following Snowsight features:

* Loading files onto internal stages.
* Browsing files on an internal or external stage.

With Snowsight, you can load files onto internal named stages and prepare to load data into tables or load dependencies for Python worksheets. You can also use Snowsight to view and manage staged files.

For more information, see [Staging files using Snowsight](../../user-guide/data-load-local-file-system-stage-ui.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 04-Dec-23 |
| *Load Files onto Stages and Managed Staged Files using Snowsight* | **Added** to *Web Interface Updates* | 05-Dec-23 |
| *New SQL Functions* | **Updated** to include ARRAYS_TO_OBJECT | 07-Dec-23 |
| *Paused SQL Functions* | **Paused** the `SYSTEM$CLIENT_VERSION_INFO` system function from general availability until a future release. | 19-Dec-23 |

---
title: December 05, 2024 — Private Notebooks in a Personal Database — Deprecated
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-05-personal-db-private-nb.md
section: Release Notes
---

# December 05, 2024 — Private Notebooks in a Personal Database — *Deprecated*

With this release, we are pleased to announce the preview of private notebooks in a personal database.

In Snowsight, you can create a private, user-owned notebook. This private notebook is stored in your personal database: a dedicated
workspace where you can create, modify, and manage your private notebooks.

---
title: December 05, 2024 — Snowflake Cortex Powered Descriptions — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-05-cortex-descriptions.md
section: Release Notes
---

# December 05, 2024 — Snowflake Cortex Powered Descriptions — *General Availability*

We are pleased to announce the general availability of being able to sign in to Snowsight and use Snowflake Cortex to generate
descriptions for tables, views, and columns. Snowflake Cortex leverages large language models to generate the descriptions based on object
metadata and, optionally, sample data.

For more information, see [Generate descriptions with Snowflake Cortex](../../../user-guide/ui-snowsight-cortex-descriptions.md).

---
title: December 09, 2024 — Organizational listings: Discovery and access — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-09-dbna.md
section: Release Notes
---

# December 09, 2024 — Organizational listings: Discovery and access — *Preview*

Snowflake would like to announce the preview of enhancements to Organizational listing: discovery and access.

With this release, Listing creators can now configure both the access and discovery of [organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-about.md).
Listing creators can now define how various accounts and roles in their organization can discover and access the organizational listings.
For example, to specify who can discover and request access to a listing, the Listing owner can select the entire organization,
a list of accounts, or a specific role(s) in an account.
Similarly, the listing creator can also specify who can access the Listing directly in the Internal Marketplace.
The possible values are the entire organization, a list of accounts, or a specific role(s) in an account.

For more information see [Create an organizational listing](../../../user-guide/collaboration/listings/organizational/org-listing-create.md).

> **Note:**
>
> The rollout of this feature is scheduled to begin on December 9th and continue for approximately 6 weeks.
> For questions, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) or your Snowflake representative.

---
title: December 09, 2024 — Snowflake Native Apps with Azure Private Link support —– Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-09-na-az-privatelink.md
section: Release Notes
---

# December 09, 2024 — Snowflake Native Apps with Azure Private Link support —– *Preview*

Snowflake is pleased to announce the preview of Azure Private link in the
Snowflake Native Framework. Snowflake Native Apps can be deployed and operated with Private Link
connectivity in Azure, enabling secure network isolation.

For more information see [Understand limitations in the Snowflake Native App Framework](../../../developer-guide/native-apps/limitations.md).

---
title: December 09, 2024 — Using block storage with Snowpark Container Services job services — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-09-spcs-block-storage-for-jobs-in-preview.md
section: Release Notes
---

# December 09, 2024 — Using block storage with Snowpark Container Services job services — *Preview*

With this release, we are pleased to announce a preview of support for using block storage volumes with Snowpark Container Services job services.

For more information, see [Using block storage volumes with services](../../../developer-guide/snowpark-container-services/block-storage-volume.md).

---
title: December 09-13, 2024 — 8.46 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_46.md
section: Release Notes
---

# December 09-13, 2024 — 8.46 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Restricted caller’s rights — *Preview*

With this release, we are pleased to announce the preview of restricted caller’s rights. Previously, when an executable like a stored procedure
ran with the privileges of the caller of the executable (caller’s rights), the executable could run with all of the caller’s privileges.
Restricted caller’s rights allows an executable to run with caller’s rights, but restricts which of the caller’s privileges the executable
runs with. Administrators use caller grants to specify which of the caller’s privileges the executable can use to perform an operation.

For more information, see [Restricted caller’s rights](../../developer-guide/restricted-callers-rights.md).

## Snowsight updates

### New login screen version

With this release, a new version of the login screen is now available, offering improved performance and enhanced security. The updated login
screen retains the same behavior and appearance as before, requiring no configuration or routing updates.

For more information, see the [Snowflake Knowledge Base](https://community.snowflake.com/s/article/New-SIgn-In-Screen).

## SQL Updates

### New SQL functions

The following function(s) are now generally available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Information Schema, Table Functions | [AVAILABLE_LISTING_REFRESH_HISTORY](../../sql-reference/functions/available_listing_refresh_history.md) | Returns the past 14 days of refresh history for a database mounted from a listing using cross-cloud auto-fulfillment. |

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 06-Dec-24 |
| AVAILABLE_LISTING_REFRESH_HISTORY | New function | 13-Dec-24 |

---
title: December 12, 2024 — Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-12-document-ai.md
section: Release Notes
---

# December 12, 2024 — Document AI release notes

With this release, we are pleased to announce the following improvements in Document AI:

* Doubling the length limit of the answers provided by the model. The model can now return answers that are up to 512 tokens long (about 320 words) per question.
* Re-ordering values and answers in the Document AI user interface by dragging them is now possible. This makes the evaluation process easier, as the defined values can follow a certain order.
* Performance improvements.

  For more information, see [2024 Performance Improvements](../../performance-improvements-2024.md).

These improvements are available both to new Document AI model builds as well as the already existing ones.

---
title: December 12–14, 2023 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/ui/2023-12-12.md
section: Release Notes
---

# December 12–14, 2023 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Recover worksheets for dropped users — *Preview*

With this release, we are pleased to announce the preview of recovering Snowsight worksheets for users that have been dropped
from Snowflake. You can recover up to 500 worksheets for each dropped user.

For more details, see [Recover worksheets owned by a dropped user](../../../user-guide/ui-snowsight-worksheets.md).

## View Query History in worksheets —– *General Availability*

With this release, we are pleased to announce the general availability of Query History in worksheets in Snowsight.
When you view Query History for a worksheet, you can review the queries run in a Snowsight worksheet, as well as the query results.

For more information, see [View query history](../../../user-guide/ui-snowsight-query.md).

---
title: December 14-15, 2023 — 7.44 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_44.md
section: Release Notes
---

# December 14-15, 2023 — 7.44 Release Notes

> **Attention:**
>
> The release has completed. There are no differences between the in-advance and final versions of the release notes.

## New Features

### Organization Usage: Improved views for billing reconciliation — *General Availability*

With this release, we are pleased to announce that the following views in the Organization Usage schema have been improved to make it easier to reconcile Snowflake usage with monthly billing statements:

* CONTRACT_ITEMS
* RATE_SHEET_DAILY
* REMAINING_BALANCE_DAILY
* USAGE_IN_CURRENCY_DAILY

During a transition period, only new accounts have these upgraded views available by default. To inquire about upgrading your account, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

* [CONTRACT_ITEMS view](../../sql-reference/organization-usage/contract_items.md)
* [RATE_SHEET_DAILY view](../../sql-reference/organization-usage/rate_sheet_daily.md)
* [REMAINING_BALANCE_DAILY view](../../sql-reference/organization-usage/remaining_balance_daily.md)
* [USAGE_IN_CURRENCY_DAILY view](../../sql-reference/organization-usage/usage_in_currency_daily.md)

## SQL Updates

### Snowflake Cortex ML-Based Time-Series Functions — *General Availability*

With this release, we are pleased to announce the general availability of the Snowflake Cortex ML-Based functions
Forecasting (SNOWFLAKE.ML.FORECAST) and Anomaly Detection (SNOWFLAKE.ML.ANOMALY_DETECTION), which were previously
available as preview features. These functions use a machine learning model trained on your historical data to make
predictions and detect unexpected events. You can also obtain evaluation metrics and feature importance data for these
models to learn what factors are driving trends and causing anomalies.

For more information, see [Forecasting](../../user-guide/ml-functions/anomaly-detection.md) and
[Anomaly Detection](../../user-guide/ml-functions/anomaly-detection.md).

## Ecosystem Updates

### Snowpark ML Modeling API — *General Availability*

With this release, we are pleased to announce the general availability of the Snowpark ML Modeling API, which was
previously available as a preview feature. Snowpark ML Modeling lets you train Python models inside Snowflake using APIs similar to those provided by Scikit-Learn,
LightGBM, and XGBoost. Many preprocessing classes run in distributed fashion, on as many nodes as are available in
your warehouse, cutting runtime significantly.

This feature is available in Snowpark ML 1.1.1 and later. For more information, see
[Snowpark ML Modeling](../../developer-guide/snowflake-ml/modeling.md).

### Snowpark ML Distributed Hyperparameter Optimization — *Preview*

With this release, we are pleased to announce the preview of distributed hyperparameter optimization in the
Snowpark ML Modeling API. Hyperparameter optimization allows you to find the best parameters for your models and can now be run in
distributed fashion, cutting runtime significantly. Distributed processing is enabled by default, but may be disabled.

This feature is available in Snowpark ML 1.1.1 and later. For more information, see
[Snowpark ML Modeling](../../developer-guide/snowflake-ml/modeling.md).

## Data Lake Updates

### Cross-Cloud/Cross-Region Support for Apache Iceberg™ Tables — *Preview*

With this release, we are pleased to announce cross-cloud/cross-region support for Apache Iceberg™ tables in Snowflake that use an external Iceberg catalog.

For more information, see [Cross-cloud/cross-region support](../../user-guide/tables-iceberg.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 11-Dec-23 |
| *Snowflake Cortex ML-Based Time-Series Functions* | **Added** to *SQL Updates* | 19-Dec-23 |
| *Snowpark ML Modeling API*  *Snowpark ML Distributed Hyperparameter Optimization* | **Added** to *Ecosystem Updates* | 19-Dec-23 |
| *Cross-Cloud/Cross-Region Support for Apache Iceberg™ Tables* | **Added** to *Data Lake Updates* | 12-Jan-24 |

---
title: December 15, 2023 — Cost Management Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/other/2023-12-15.md
section: Release Notes
---

# December 15, 2023 — Cost Management Release Notes

This document provides an overview of the new features related to cost management in Snowsight.

## Cost Management: Account Overview Page — *Preview*

With this release, we are pleased to announce the preview of a new Account Overview page in Snowsight that allows you to
gain high-level insights into the cost of using Snowflake. It improves visibility into incurred costs and provides information that can be
a starting off point for reporting and optimizing your spend. For example, you can view your total spend for a time period in dollars and
credits, and discover what is contributing to your costs, such as top warehouses by spend and most expensive queries.

Only users with the ACCOUNTADMIN role can view the Account Overview page.

For more details about using the Account Overview page, see [Overview of account-level costs](../../../user-guide/cost-exploring-overall.md).

As part of this change, all Snowsight pages related to cost have been grouped together under Admin » Cost Management
in the left navigation bar. This changes:

* Where you work with budgets and resource monitors. Both pages are found as tabs under Admin » Cost Management. Previously,
  resource monitors were accessed from Admin » Resource Monitors.
* Where to find usage information that allows you to drill down into incurred costs. Previously, this information was found on the
  Usage page, but now this same information is found by selecting the Consumption tab under Admin »
  Cost Management.

---
title: December 16-18, 2024  — 8.47 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_47.md
section: Release Notes
---

# December 16-18, 2024 — 8.47 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Table | [ICEBERG_TABLE_FILES](../../sql-reference/functions/iceberg_table_files.md) | Information schema table function that returns information about the data files registered to an externally managed Apache Iceberg™ table. You can also view information about files associated with historical snapshots. |
| Table | [ICEBERG_TABLE_SNAPSHOT_REFRESH_HISTORY](../../sql-reference/functions/iceberg_table_snapshot_refresh_history.md) | Information schema table function that returns metadata and snapshot information about the most recent refresh history for a specified externally managed Apache Iceberg™ table. |

## Extensibility updates

### Support for a wildcard character in network rule network identifiers —– *Preview*

With this release, you can use an asterisk as a wildcard character when specifying a network identifier for a network rule in its VALUE_LIST
parameter.

For more information, see [CREATE NETWORK RULE](../../sql-reference/sql/create-network-rule.md).

## Data pipeline updates

### Dynamic tables: Maximum number of dynamic tables in an account increased to 10,000

With this release, your account can now hold a maximum of 10,000 dynamic tables. Previously, the limit was 4,000 dynamic tables in a single
account.

For more information, see [General limitations](../../user-guide/dynamic-tables-limitations.md).

## Data governance updates

### OBJECT_DEPENDENCIES view: Support for dynamic tables

With this release, we are pleased to announce that the OBJECT_DEPENDENCIES view in the ACCOUNT_USAGE schema now includes dependencies
involving dynamic tables. The dynamic table can be the referencing object or the referenced object.

For more information, see [OBJECT_DEPENDENCIES view](../../sql-reference/account-usage/object_dependencies.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 13-Dec-24 |
| *OBJECT_DEPENDENCIES view: Support for dynamic tables* | **Added** to *Data governance* section | 17-Dec-24 |

---
title: December 18, 2024 — Inbound private connectivity to Snowpark Container Services for accounts on AWS — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-18-spcs-aws-inbound-private-connectivity.md
section: Release Notes
---

# December 18, 2024 — Inbound private connectivity to Snowpark Container Services for accounts on AWS — *Preview*

With this release, we are pleased to announce a preview of support for inbound private connectivity to Snowpark Container Services for accounts on AWS.

For more information, see [Inbound connectivity](../../../developer-guide/snowpark-container-services/private-connectivity.md).

---
title: December 19, 2024 — New homepage for Snowsight —– General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-19-snowsight-homepage-ga.md
section: Release Notes
---

# December 19, 2024 — New homepage for Snowsight —– *General Availability*

With this release, we are pleased to announce the general availability of the Snowsight homepage.

The Snowsight homepage has been updated to include:

* New navigation menu - updated to include menu items for creating and managing data products, notebooks, worksheets, databases and all related artifacts.
* Updated search - updated to allow for easier discovery of content.
* Quick actions - quickly and easily perform operations specific to your current role. For example, examine tables or create worksheets to
  execute python code.
* Recently viewed tab, which includes recent operations and associated content.

For more information, see [Exploring the Snowsight user interface](../../../user-guide/ui-snowsight-homepage.md).

---
title: December 19, 2024 — Snowflake Native Apps with Azure Private Link support — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-19-na-az-gov-ga.md
section: Release Notes
---

# December 19, 2024 — Snowflake Native Apps with Azure Private Link support — *General Availability*

Snowflake is pleased to announce general availability of Azure Private Link support
in the Snowflake Native App Framework. Snowflake Native Apps can be deployed and operated with
Private Link connectivity in Azure, enabling secure network isolation.

For more information see [Understand limitations in the Snowflake Native App Framework](../../../developer-guide/native-apps/limitations.md).

---
title: December 19, 2024 — Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-12-19-notebooks-wh-aws-azure-pl.md
section: Release Notes
---

# December 19, 2024 — Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link — *Preview*

With this release, Snowflake Notebooks now supports AWS PrivateLink and Microsoft Azure Private Link.

Snowflake Notebooks is a development interface in Snowsight that offers an interactive, cell-based programming environment for Python and SQL. In Snowflake Notebooks, you can perform exploratory data analysis, develop machine learning models, and perform other data science and data engineering tasks all in one place.

For more information, see [Private connectivity for Notebooks](../../../user-guide/ui-snowsight/notebooks-privatelink.md).

---
title: December 20, 2023 — Snowpark Container Services Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/other/2023-12-20.md
section: Release Notes
---

# December 20, 2023 — Snowpark Container Services Release Notes

This document provides an introduction to the new feature Snowpark Container Services.

With this release, we are pleased to announce the preview of Snowpark Container Services.
Snowpark Container Services is a fully managed container offering that helps you easily deploy, manage, and scale containerized
applications without having to move data out of Snowflake.
Snowpark Container Services provides an OCI runtime execution environment for OCI images that is similar to Docker or Kubernetes.
As a fully managed service, Snowpark Container Services includes support for security, configuration, and operational best practices.

For more details, see [Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md).

---
title: Default value for the SYNC_PASSWORD parameter on SCIM security integrations has changed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1768.md
section: Release Notes
---

# Default value for the SYNC_PASSWORD parameter on SCIM security integrations has changed

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the default value of the SYNC_PASSWORD parameter on SCIM security integrations behaves as
follows:

Before the change:
:   If not set, SYNC_PASSWORD is set to TRUE.

After the change:
:   If not set, SYNC_PASSWORD is set to FALSE.

Ref: 1768

---
title: Default value of DEFAULT_SECONDARY_ROLES object property on users changed to (‘ALL’)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1692.md
section: Release Notes
---

# Default value of DEFAULT_SECONDARY_ROLES object property on users changed to (‘ALL’)

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

> **Attention:**
>
> Attention: This change will only be visible in the SNOWFLAKE.ACCOUNT_USAGE.USERS view in the next behavior change release.

This behavior change was originally introduced in the 2024_07 bundle. To give users additional time to evaluate the change, this
behavior change remains disabled in 2024_08.

This BCR affects all users. If a new or existing user has their DEFAULT_SECONDARY_ROLES object property unset, or set to NULL, then their
DEFAULT_SECONDARY_ROLES object property changes to `('ALL')`.

If a new or existing user has their DEFAULT_SECONDARY_ROLES object property explicitly set, then their DEFAULT_SECONDARY_ROLES object
property does not change.

Setting a user’s DEFAULT_SECONDARY_ROLES object property to `()` specifies that a user does not have secondary roles. If you want to
preserve the existing behavior of the DEFAULT_SECONDARY_ROLES object property in your account, you can use the following procedure to
explicitly set DEFAULT_SECONDARY_ROLES to an empty list:

```sqlexample-javascript
CREATE OR REPLACE PROCEDURE update_default_secondary_roles()
RETURNS VARIANT NOT NULL
LANGUAGE JAVASCRIPT
EXECUTE AS CALLER
AS
$$
let updated_users = [];
let users = snowflake.execute({sqlText: "SHOW USERS"});
while (users.next()) {
  let username = users.getColumnValue("name");
  let dsr = users.getColumnValue("default_secondary_roles");
  if (dsr !== "") {
    continue;
  }
  snowflake.execute({
    sqlText: "alter user identifier(?) set default_secondary_roles=()",
    binds: ["\"" + username + "\""],
  });
  updated_users.push(username);
}
return updated_users;
$$;

CALL update_default_secondary_roles();
```

For more information, see the
[community article](https://community.snowflake.com/s/article/default-secondary-roles-all-overview-and-additional-explanations)

Before the change:
:   The default value of the DEFAULT_SECONDARY_ROLES object property on users is NULL.

After the change:
:   The default value of the DEFAULT_SECONDARY_ROLES object property on users is (‘ALL’).

Ref: 1692

---
title: Default warehouse for Snowflake Notebooks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1887.md
section: Release Notes
---

# Default warehouse for Snowflake Notebooks

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

With this behavior change, Snowflake Notebooks users can now specify separate warehouses for running the notebook kernel (Python code) and
executing SQL queries in the notebook. The notebook warehouse remains active for the duration of the notebook session, while the SQL warehouse
is utilized only on demand.

This separation enables users to assign a smaller warehouse for the kernel while optionally assigning a larger warehouse to handle heavy SQL
queries. When creating a notebook, users can choose to specify two separate warehouses or use a single warehouse for running both the Notebook
kernel and any pushdown SQL.

By default, the notebook warehouse is set to SYSTEM$STREAMLIT_NOTEBOOK_WH. However, users can specify a different warehouse at the time of
notebook creation by choosing one from the dropdown list. After notebook creation, users can choose a different warehouse from the notebook
settings.

Before the change:

> ```sqlsyntax
> CREATE [ OR REPLACE ] NOTEBOOK [ IF NOT EXISTS ] <name>
>   [ FROM '<source_location>' ]
>   [ MAIN_FILE = '<main_file_name>' ]
>   [ COMMENT = '<string_literal>' ]
>   [ QUERY_WAREHOUSE = <warehouse_to_run_nb_and_sql_queries_in> ]
>   [ IDLE_AUTO_SHUTDOWN_TIME_SECONDS = <number_of_seconds> ]
> ```

After the change:

> ```sqlsyntax
> CREATE [ OR REPLACE ] NOTEBOOK [ IF NOT EXISTS ] <name>
>   WAREHOUSE = <notebook_kernel_warehouse_name>
>   [ FROM '<source_location>' ]
>   [ MAIN_FILE = '<main_file_name>' ]
>   [ COMMENT = '<string_literal>' ]
>   [ QUERY_WAREHOUSE = <warehouse_to_run_sql_queries> ]
>   [ IDLE_AUTO_SHUTDOWN_TIME_SECONDS = <number_of_seconds> ]
> ```

A new parameter, WAREHOUSE, has been introduced as a required parameter to specify the warehouse used to run the Notebook kernel and Python
code. If this parameter is not explicitly set, it defaults to the value of the schema-level parameter DEFAULT_STREAMLIT_NOTEBOOK_WAREHOUSE,
which determines the default warehouse to be used.

For details, see [Default warehouse for notebooks](../../../user-guide/warehouses-overview.md).

Ref: 1887

---
title: Defaulting accounts from Worksheets to Workspaces
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2117.md
section: Release Notes
---

# Defaulting accounts from Worksheets to Workspaces

Starting in September 2025, Snowflake will gradually default accounts from Worksheets to Workspaces.

| Cohort | Timing of change |
| --- | --- |
| New organizations | Currently available to some new customers |
| On Demand accounts | Gradual rollout begins in September |
| Organizations with only Standard Edition accounts | Gradual rollout begins in September |
| Organizations with only Enterprise Edition and below accounts | Gradual rollout begins in late October |
| Organizations with only Business Critical Edition and below accounts | Gradual rollout begins in November |
| Organizations with Virtual Private Snowflake (VPS) and below accounts | Gradual rollout begins in January 2026 |

> **Note:**
>
> Classic Console users will be upgraded to Workspaces starting in September 2025.

Before the change:
:   Worksheets is the default SQL editing experience in Snowflake.

After the change:
:   Workspaces is the default SQL editing experience in Snowflake. Opening a worksheet will open it in the Workspaces editor, and navigation
    options to Worksheets will be replaced with Workspaces. For now, users can still navigate back to the original Worksheets editor from within
    Workspaces or revert the default themselves.

To set Workspaces as the account-wide default editor for all users from Snowsight, follow these steps:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md) as ACCOUNTADMIN.
2. In the lower-left corner, select your name » Settings.
3. Under Account, choose General.
4. Enable the Set Workspaces as default SQL editor for the account option.

   Administrators can revert to Worksheets as the default editor by disabling this option. If users want to revert to Worksheets, they can also
   select Go to Worksheets from the Workspaces UI:

   Or toggle the user setting in the Workspaces editor:

Administrators have several options for managing this transition by setting the `USE_WORKSPACES_FOR_SQL` parameter.

To set the account-wide default editor to be Workspaces for all users:

```sqlexample
ALTER ACCOUNT SET USE_WORKSPACES_FOR_SQL = 'always';
```

To revert this setting and use the previous default editor, but respect any Snowflake-managed BCR that makes Workspaces the default:

```sqlexample
ALTER ACCOUNT UNSET USE_WORKSPACES_FOR_SQL;
```

To revert to the previous editor and temporarily ignore any Snowflake-managed BCR that makes Workspaces the default:

```sqlexample
ALTER ACCOUNT SET USE_WORKSPACES_FOR_SQL = 'never';
```

> **Note:**
>
> Worksheets will eventually become deprecated and the command above will no longer work. If you had previously set this parameter, it will
> be automatically cleared once Worksheets is deprecated. Snowflake will provide advance notice when a deprecation date is available.

For more information on Workspaces, see [Workspaces](../../../user-guide/ui-snowsight/workspaces.md).

Ref: 2117, 2075

---
title: Deprecate previous syntax for working with SQL classes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1829.md
section: Release Notes
---

# Deprecate previous syntax for working with SQL classes

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the previous syntax used with SQL classes will
no longer work.

Before this change:
:   The prior, deprecated syntax for working with SQL classes is still supported. For example:

    ```sqlexample
    CREATE INSTANCE INST OF CLASS test_class();
    SHOW INSTANCES OF CLASS test_class;
    ```

After this change:
:   The prior, deprecated syntax for working with SQL classes will no longer work.

    Use the newer “native” syntax instead, for example:

    ```sqlexample
    CREATE TEST_CLASS inst();
    SHOW TEST_CLASS instances;
    SHOW test_class;
    ```

Ref: 1829

---
title: Deprecation of Legacy Worksheets and Dashboards
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2260.md
section: Release Notes
---

# Deprecation of Legacy Worksheets and Dashboards

Snowflake is deprecating two legacy surfaces in Snowsight: **Legacy Worksheets** and
**Legacy Dashboards**.

* **Legacy Worksheets** are replaced by [Workspaces](../../../user-guide/ui-snowsight/workspaces.md), the modern SQL
  editing experience that supports file-and-folder organization, sharing, and Git integration.
* **Legacy Dashboards** are retired, with migration paths to Streamlit apps or third-party BI tools.

> **Important:**
>
> On **June 22, 2026**, Legacy Worksheets and Dashboards will be permanently removed from
> Snowsight. Migrate your worksheets and dashboards before this date.

## Timeline

| Date | Worksheets | Dashboards |
| --- | --- | --- |
| **April 20, 2026** | Workspaces becomes universal and the default editor for all accounts. Workspaces can no longer be disabled. Account administrators can still temporarily revert to Legacy Worksheets as the default editor until June 22. Self-service worksheet migration tooling available.  Starting June 1, creation of new Legacy Worksheets is disabled. Users receive in-product announcements with reminders to migrate before June 22. | Creation of new dashboards is disabled across all accounts. |
| **June 22, 2026** | Legacy Worksheets UI fully removed from Snowsight. Remaining worksheets are automatically migrated to Workspaces files. | Legacy Dashboards UI fully removed from Snowsight. Dashboards are no longer accessible. |

## Legacy Worksheets

Legacy Worksheets are replaced by [Workspaces](../../../user-guide/ui-snowsight/workspaces.md), a modern SQL editing
experience that supports file-and-folder organization, sharing, and Git integration. Most accounts are already
using Workspaces as their default editor.

For the previous behavior change that defaulted accounts to Workspaces, see
[Defaulting accounts from Worksheets to Workspaces](bcr-2117.md).

Before the change:
:   Legacy Worksheets is available as a SQL editing experience in Snowsight. Account administrators and
    individual users can revert from Workspaces to Legacy Worksheets as the default editor using the
    `USE_WORKSPACES_FOR_SQL` parameter or the corresponding controls in the Snowsight settings.

After the change:
:   Legacy Worksheets is removed from Snowsight.

    * **April 20, 2026**: Workspaces can no longer be disabled. The feature flag used to disable Workspaces
      (documented in [Disable Workspaces](../../../user-guide/ui-snowsight/workspaces.md)) is ignored. All remaining accounts
      are defaulted to Workspaces as the default SQL editor.

      Account administrators can still temporarily revert the default editor to Legacy Worksheets using the
      `USE_WORKSPACES_FOR_SQL` parameter:

      ```sqlexample
      ALTER ACCOUNT SET USE_WORKSPACES_FOR_SQL = 'never';
      ```

      This revert capability is available until June 22, 2026.
    * **June 22, 2026**: The Legacy Worksheets UI is permanently removed. All entry points to Legacy Worksheets
      are removed from Snowsight. Setting `USE_WORKSPACES_FOR_SQL` to `'never'` is no longer
      supported. Any remaining legacy worksheets are automatically migrated to Workspaces files.

### Shared worksheets

Worksheets previously shared with other users remain accessible to all shared users from the Legacy Worksheets
section in the Workspaces sidebar.

When worksheets are automatically migrated to Workspaces files, a copy is created for each user who had access.
The shared status is not preserved after migration.

For ongoing collaboration, use [shared workspaces](../../../user-guide/ui-snowsight/workspaces-shared.md), which
allow teams to collaborate in dedicated spaces with role-based access control and a wiki-style draft and
publish model. Shared workspaces are the recommended replacement for shared worksheets.

## Legacy Dashboards

Snowsight Dashboards are retired. Snowflake provides a Dashboard-to-Streamlit conversion tool to migrate
existing dashboards to Streamlit apps running natively on Snowflake. You can also migrate to any
third-party BI or visualization tool.

Before the change:
:   Snowsight Dashboards are available in Snowsight for creating and viewing dashboards.

After the change:
:   Snowsight Dashboards are removed from Snowsight.

    * **April 20, 2026**: Creation of new dashboards is disabled across all accounts. Any action that would
      create a new dashboard is blocked.
    * **June 22, 2026**: Dashboards are entirely removed from Snowsight. Existing dashboards are
      no longer accessible after this date.

### Dashboard migration options

* **Streamlit**: Open any existing dashboard and select Generate Streamlit app to convert it to a
  Streamlit app running natively on Snowflake.
* **Third-party BI tools**: Migrate to any BI or visualization tool of your choice.

> **Important:**
>
> Migrate your dashboards before **June 22, 2026**. After this date, dashboards are no longer accessible.

## Actions required

### For Legacy Worksheets

1. **Verify Workspaces access**: Ensure your users can access
   [Workspaces](../../../user-guide/ui-snowsight/workspaces.md) and that it functions as expected for
   your workflows.
2. **Review existing worksheets**: Open the **Legacy Worksheets** section in the Workspaces sidebar to
   verify that your worksheets are accessible.
3. **Migrate worksheets to Workspaces files**: Before June 22, 2026, convert your worksheets to
   Workspaces files by dragging them into your workspace, or use the self-service bulk migration tool
   when it becomes available.
4. **Update automation or bookmarks**: If you have bookmarks, automation, or documentation that
   references Legacy Worksheets URLs or entry points, update them to use Workspaces.

### For Snowsight Dashboards

1. **Migrate dashboards before June 22, 2026**: After this date, dashboards are no longer accessible.
2. **Use the conversion tool**: Open any dashboard and select Generate Streamlit app to convert it
   to a Streamlit app.
3. **Consider alternative tools such as Snowflake Intelligence**: You can also migrate to third-party BI or visualization tools.

## Additional notes

* **Temporary revert controls**: Between April 20 and June 22, 2026, account administrators can
  temporarily revert the default editor to Legacy Worksheets using the `USE_WORKSPACES_FOR_SQL`
  parameter. Individual users can also revert their own default. These controls are removed on June 22,
  2026. For details on these controls, see
  [Defaulting accounts from Worksheets to Workspaces](bcr-2117.md).

## Change log

| Update | Date |
| --- | --- |
| Initial publication | 23-Mar-26 |

Ref: 2260

---
title: Deprecation of the SNOWFLAKE user
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1976.md
section: Release Notes
---

# Deprecation of the `SNOWFLAKE` user

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

The deprecation of the `SNOWFLAKE` user behaves as follows:

Before the change:
:   A system-defined user called `SNOWFLAKE` exists in every account. This `SNOWFLAKE` user has the SNOWFLAKE_SUPPORT property set to
    TRUE.

After the change:
:   New accounts no longer have a `SNOWFLAKE` user automatically added to the account. The SNOWFLAKE_SUPPORT property is set to FALSE for
    all users.

When this change becomes Generally Enabled, Snowflake will transfer ownership of all existing `SNOWFLAKE` users to the ACCOUNTADMIN
role. At that point, you can drop the `SNOWFLAKE` user from your account at your convenience.

Ref: 1976

---
title: Deprecation of worksheet results sharing and secondary roles in dashboards
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1801.md
section: Release Notes
---

# Deprecation of worksheet results sharing and secondary roles in dashboards

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, worksheet results sharing will be deprecated and transition to code-only. Recipients will only
be able to view the worksheet code, not the query results. In addition, dashboards can no longer be run with secondary roles.

> **Note:**
>
> If an account enables this bundle, and then later disables the bundle, users of that account will return to the pre-BCR behavior where
> cached results are again displayed. This could result in the following unexpected behavior:
>
> 1. Account enables the bundle.
> 2. User 1 shares a worksheet, believing they are only sharing code, with User 2.
> 3. Account disables the bundle.
> 4. User 2 can now see results of the shared worksheet, which could include sensitive data.

## Prepare for the change in dashboards

Snowflake recommends testing your dashboards without secondary roles to ensure queries function correctly. To test the dashboards, run
[USE SECONDARY ROLES](../../../sql-reference/sql/use-secondary-roles.md) `'NONE'` for each tile, followed by executing the main query.
Alternatively, you can request the administrator to temporarily set your user’s DEFAULT_SECONDARY_ROLES to `'NULL'` during the testing
process.

If the queries fail, you may need to rebuild the dashboard:

* A dashboard owner or editor can split the dashboard into multiple dashboards, running in different roles, to gather the required data.
* An administrator can create a new role that is a superset of the necessary permissions and grant this to the dashboard users as their
  primary role.
* For any broken dashboard tiles, the administrator can grant the primary role access to the necessary source objects to restore dashboard
  functionality.

For more information, see the following Knowledge Base articles:

* [Changes to worksheet sharing](https://community.snowflake.com/s/article/BCR-1801-Changes-to-Worksheet-sharing---Overview-and-additional-explanations)
* [Changes to secondary roles in dashboards](https://community.snowflake.com/s/article/BCR-1801-Changes-to-Secondary-Roles-in-Dashboards-Overview-and-additional-explanations)

Ref: 1801

---
title: DESC COMPUTE POOL command: New columns in output and deprecation of SYSTEM$GET_COMPUTE_POOL_STATUS
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1594.md
section: Release Notes
---

# DESC COMPUTE POOL command: New columns in output and deprecation of SYSTEM$GET_COMPUTE_POOL_STATUS

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

## DESC COMPUTE POOL command: New columns in output

When this behavior change bundle is enabled, the output of the [DESCRIBE COMPUTE POOL](../../../sql-reference/sql/desc-compute-pool.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| ERROR_CODE | Error code, if any, relevant to the STATUS_MESSAGE. Otherwise, this field is empty.  For example, when you resize a compute pool:   * If Snowflake encounters a capacity error (new nodes can’t be provisioned), Snowflake returns the error code 392507.  Note that the capacity error indicates the instance type you requested for your compute pool node is currently not available with the cloud provider. You can either wait for the capacity to become available or choose another instance type. * If you have pending services (including job services) and Snowflake cannot scale up your compute pool, Snowflake returns the error code 392508. |
| STATUS_MESSAGE | Optional message about the status of the compute pool. For example:   * After creating a compute pool, if you run the DESC COMPUTE POOL command, the output might include the status message: “Compute pool is starting for last 1 minute” * If Snowflake encounters a capacity error when provisioning a node, the output might include the status message: “Compute pool is starting for the last 3 minutes. We have observed CAPACITY_ERROR.” * If you have pending services (including job services) and Snowflake can’t scale up your compute pool, the output might include the status message: “Compute Pool has reached the maximum node limit. Consider increasing `max_nodes` using the ALTER COMPUTE POOL command.” |

## Deprecation of the SYSTEM$GET_COMPUTE_POOL_STATUS function

The SYSTEM$GET_COMPUTE_POOL_STATUS function returns a JSON object that contains the same information that is in the two new columns (the compute pool status and a message relevant to the status.). Because the output of the DESC COMPUTE POOL command provides this information, the SYSTEM$GET_COMPUTE_POOL_STATUS function will be deprecated in the near future.

Ref: 1594

---
title: DESC FUNCTION command: New column IS_AGGREGATE in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1609.md
section: Release Notes
---

# DESC FUNCTION command: New column IS_AGGREGATE in output

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, the output of the [DESC FUNCTION](../../../sql-reference/sql/desc-function.md) command includes
the following new column(s) when the function has a Python handler:

| Column name | Data type | Description |
| --- | --- | --- |
| IS_AGGREGATE | Boolean | TRUE if the function is a user-defined aggregate function (UDAF); otherwise, FALSE. |

Ref: 1609

---
title: DESC SECRET command: Add INTEGRATION_NAME column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1494.md
section: Release Notes
---

# DESC SECRET command: Add INTEGRATION_NAME column

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of the [DESCRIBE SECRET](../../../sql-reference/sql/desc-secret.md) command is as follows:

Before the change:
:   The output of the command did not include the `integration_name` column

After the change:
:   The output of the command includes the `integration_name` column as the last column in the output.

    | Column name | Data type | Description |
    | --- | --- | --- |
    | `integration_name` | VARCHAR | Specifies the name of the External API Authentication integration that references the secret or NULL if the integration does not specify a secret. |

Ref: 1494

---
title: DESC SERVICE and SHOW SERVICES command: New column(s) in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1849-1867.md
section: Release Notes
---

# DESC SERVICE and SHOW SERVICES command: New column(s) in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) and [SHOW SERVICES](../../../sql-reference/sql/show-services.md) commands include the following new columns:

| Column name | Description |
| --- | --- |
| SUSPENDED_ON | Timestamp when the service was last suspended. SUSPENDED_ON is set when Snowflake suspends a service and remains unchanged even after the service is resumed. If SUSPENDED_ON is NULL, the service was never suspended. |
| AUTO_SUSPEND_SECS | Number of seconds of inactivity after which Snowflake automatically suspends the service. If AUTO_SUSPEND_SECS is set to 0 or never set, Snowflake does not automatically suspend the service. |
| IS_ASYNC_JOB | If TRUE, the job service is running asynchronously. By default, Snowflake executes the job services synchronously.  This column is included in the output of the DESC SERVICE, SHOW SERVICES, and SHOW JOB SERVICES commands but not in the output of the SHOW SERVICES EXCLUDING JOBS command. |

> **Note:**
>
> * The SUSPENDED_ON and AUTO_SUSPEND_SECS columns appear after the existing RESUMED_ON column.
> * The IS_ASYNC_JOB column appears after the existing IS_JOB column.

Ref: 1849, 1867

---
title: DESC SHARE command: Specify “DATABASE ROLE” in the KIND column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1285.md
section: Release Notes
---

# DESC SHARE command: Specify “DATABASE ROLE” in the KIND column

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

The [DESCRIBE SHARE](../../../sql-reference/sql/desc-share.md) command behaves as follows:

Before the change:
:   When a database role is granted to a share and you run a DESC SHARE command on the share, the KIND column specifies `ROLE` for the
    database role.

After the change:
:   When a database role is granted to a share and you run a DESC SHARE command on the share, the KIND column specifies `DATABASE ROLE` for
    the database role.

Ref: 1285

---
title: DESC TABLE command, SHOW COLUMNS command, and COLUMNS views: Add new SchemaEvolutionRecord column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1377.md
section: Release Notes
---

# DESC TABLE command, SHOW COLUMNS command, and COLUMNS views: Add new SchemaEvolutionRecord column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

When this bundle is enabled, a new SchemaEvolutionRecord column is added to the output of the following commands and views:

* [DESC TABLE command](../../../sql-reference/sql/desc-table.md)
* [SHOW COLUMNS command](../../../sql-reference/sql/show-columns.md)
* [COLUMNS View (Information Schema)](../../../sql-reference/info-schema/columns.md)
* [COLUMNS View (Account Usage)](../../../sql-reference/account-usage/columns.md)

Before the change:
:   The output of the aforementioned commands and views does not have the SchemaEvolutionRecord column.

After the change:
:   The output of the aforementioned commands and views adds a new SchemaEvolutionRecord column.

    The DESC TABLE command displays the SchemaEvolutionRecord column for tables that have [Table Schema Evolution](../../../user-guide/data-load-schema-evolution.md) enabled (that is, the ENABLE_SCHEMA_EVOLUTION parameter is set to TRUE). In the case that no evolutions have occurred for the table, the column shows all NULLs.

    The SHOW COLUMNS command and COLUMNS views (Information_schema and Account_usage) always display the SchemaEvolutionRecord column. In the case that no tables have schema evolution enabled or no evolutions have occurred, the column shows all NULLs.

    This new column will be set to NULL when the user manually modifies the table column after an evolution has occurred. The record will be reinstated if another schema evolution occurs on the column.

    | Column name | Description |
    | --- | --- |
    | SchemaEvolutionRecord | Records information about the latest triggered Schema Evolution for a given table column. This column contains the following subfields:   * EvolutionType: The type of the triggered schema evolution (ADD_COLUMN or DROP_NOT_NULL). * EvolutionMode: The triggering ingestion mechanism (COPY or SNOWPIPE). * FileName: The file name that triggered the evolution. * TriggeringTime: The approximate time when the column was evolved. * QueryId or PipeID: A unique identifier of the triggering query or pipe (QUERY ID for COPY or PIPE ID for SNOWPIPE). |

Ref: 1377

---
title: DESCRIBE and SHOW commands for Apache Iceberg™ tables: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2210.md
section: Release Notes
---

# DESCRIBE and SHOW commands for Apache Iceberg™ tables: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the following commands includes the following new column:

* [DESCRIBE ICEBERG TABLE](../../../sql-reference/sql/desc-iceberg-table.md) command
* [DESCRIBE TABLE](../../../sql-reference/sql/desc-table.md) command
* [DESCRIBE DYNAMIC TABLE](../../../sql-reference/sql/desc-dynamic-table.md) command
* [SHOW COLUMNS](../../../sql-reference/sql/show-columns.md) command

| Column name | Data type | Description |
| --- | --- | --- |
| write default | string | The write default for the column. This column is appended to the end of the output.  **Note:** For the DESCRIBE DYNAMIC TABLE command, when you base a dynamic table on an Iceberg table that has a write default defined for a column, the dynamic table doesn’t inherit the default value. However, the write default column returns in the output for the DESCRIBE DYNAMIC TABLE. command. |

When this behavior change bundle is enabled, the output of the [SHOW ICEBERG TABLES](../../../sql-reference/sql/show-iceberg-tables.md) command includes the following
new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| iceberg_table_format_version | integer | The version of the Apache IcebergTM table specification that the table conforms to. This column is appended to the end of the output. |

Ref: 2210

---
title: DESCRIBE AVAILABLE LISTING and DESCRIBE LISTING commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1962.md
section: Release Notes
---

# DESCRIBE AVAILABLE LISTING and DESCRIBE LISTING commands: New column in output

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the output of the
[DESCRIBE AVAILABLE LISTING](../../../sql-reference/sql/desc-available-listing.md) and [DESCRIBE LISTING](../../../sql-reference/sql/desc-listing.md)
commands will contain the following new column:

| Column name | Description |
| --- | --- |
| REQUEST_APPROVAL_TYPE | Displays the organization listing access request type. The access request type defines how discovery targets of a listing submit access requests to the listing approver.  Returned values include `NULL`, `REQUEST_AND_APPROVE_IN_SNOWFLAKE`, and `REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE`. `REQUEST_AND_APPROVE_IN_SNOWFLAKE` indicates access requests are submitted and approved within the Snowflake environment. `REQUEST_AND_APPROVE_OUTSIDE_SNOWFLAKE` indicates the provider manages access request submissions and approvals independently. The value for external listings is always `NULL`. |

Ref: 1962

---
title: DESCRIBE ICEBERG TABLE command: New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1554.md
section: Release Notes
---

# DESCRIBE ICEBERG TABLE command: New column

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The [DESCRIBE ICEBERG TABLE](../../../sql-reference/sql/desc-iceberg-table.md) command behaves as follows:

Before the change:
:   The command output does not contain the `source iceberg type` column.

After the change:
:   The command output contains the `source iceberg type` column. The new
    column appears immediately after the `type` column.

    For the Snowflake data type that is used to process and return table data, see the `type` column.

Ref: 1554

---
title: DESCRIBE ICEBERG TABLE command: SOURCE ICEBERG TYPE column value change
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1638.md
section: Release Notes
---

# DESCRIBE ICEBERG TABLE command: SOURCE ICEBERG TYPE column value change

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

The [DESCRIBE ICEBERG TABLE](../../../sql-reference/sql/desc-iceberg-table.md) command returns the following in the SOURCE ICEBERG TYPE column:

Before the change:
:   For some table columns, the command output displays the Snowflake data type or NULL instead of the expected Iceberg data type.

After the change:
:   For all table columns, the command output contains the source Iceberg data type.

Ref: 1638

---
title: DESCRIBE LISTING and DESCRIBE AVAILABLE LISTING commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2258.md
section: Release Notes
---

# DESCRIBE LISTING and DESCRIBE AVAILABLE LISTING commands: New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE LISTING](../../../sql-reference/sql/desc-listing.md) and [DESCRIBE AVAILABLE LISTING](../../../sql-reference/sql/desc-available-listing.md) commands include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `resharing` | VARCHAR | This column indicates whether providers and consumers can reshare a listing. The output is one of:   * `{"enabled": true}` * `{"enabled": false}` |

Ref: 2258

---
title: DESCRIBE LISTING command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1979.md
section: Release Notes
---

# DESCRIBE LISTING command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE LISTING](../../../sql-reference/sql/desc-listing.md) command includes the following new column.

| Column name | Description |
| --- | --- |
| `legacy_uniform_listing_locator` | String. Specifies the legacy Uniform Listing Locator (ULL).  If an existing organizational listing profile is updated to use a custom organization profile this column will include the ULL associated with the previous default profile which continues to be valid.  If no profile updates have been made, this column is `NULL`.  For more information about ULLs, see [Configure organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-configure.md). |

Ref: 1979

---
title: DESCRIBE LISTING command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1891.md
section: Release Notes
---

# DESCRIBE LISTING command: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE LISTING](../../../sql-reference/sql/desc-listing.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| LIVE_VERSION_URI | String. Full URI of live version of the listing, against which stage operations can be performed. NULL if no live version exists for the listing. |
| LAST_COMMITTED_VERSION_URI | String. Full URI of the last committed version of the listing. |
| LAST_COMMITTED_VERSION_NAME | String. System-generated name for the last committed version of the listing. |
| LAST_COMMITTED_VERSION_ALIAS | String. User-specified alias for the last committed version of the listing. |
| PUBLISHED_VERSION_URI | String. Full URI of the current published version of the listing. |
| PUBLISHED_VERSION_NAME | String. System-generated name of the published version of the listing. |
| PUBLISHED_VERSION_ALIAS | String. User-specified alias for the last published version of the listing. |
| IS_SHARE | Boolean. Indicates whether the listing was created based on a share. Either TRUE or FALSE. |

Ref: 1891

---
title: DESCRIBE ORGANIZATION PROFILE: New column, updated_on, in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2089.md
section: Release Notes
---

# DESCRIBE ORGANIZATION PROFILE: New column, `updated_on`, in output

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE AVAILABLE ORGANIZATION PROFILE](../../../sql-reference/sql/desc-available-organization-profile.md) command includes a new column, `updated_on`, in the result set.

| Column name | Description |
| --- | --- |
| `updated_on` | The date and time when the organization profile was last updated. |

Ref: 2089

---
title: DESCRIBE SECRET command: key length for symmetric keys
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-2000.md
section: Release Notes
---

# DESCRIBE SECRET command: key length for symmetric keys

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

The `key_length` column in the output of the [DESCRIBE SECRET](../../../sql-reference/sql/desc-secret.md) command is changing as follows:

Before the change:
:   The `key_length` value for symmetric keys is shown in bytes.

After the change:
:   The `key_length` value for symmetric keys is shown in bits. For example, if the value before the change was shown as `32`, it is now shown as `256`.

    This change does not affect other secret types, only symmetric keys.

Ref: 2000

---
title: DESCRIBE SECRET command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1890.md
section: Release Notes
---

# DESCRIBE SECRET command: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE SECRET](../../../sql-reference/sql/desc-secret.md) command and
[SECRETS view](../../../sql-reference/account-usage/secrets.md) include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| algorithm | string | The algorithm used to generate the secret. |
| key_length | number | The length of the key used to generate the secret. |

Ref: BCR-1890

---
title: DESCRIBE SESSION POLICY command: convert output from row-oriented to column-oriented
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1985.md
section: Release Notes
---

# DESCRIBE SESSION POLICY command: convert output from row-oriented to column-oriented

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

To provide consistency with DESC [SESSION, AUTHENTICATION, PASSWORD] POLICY, and enable future expansion of [DESCRIBE SESSION POLICY](../../../sql-reference/sql/desc-session-policy.md) without
breaking changes, the DESC SESSION POLICY and [USE SECONDARY ROLES](../../../sql-reference/sql/use-secondary-roles.md) commands behave as follows:

Before the change:
:   DESC SESSION POLICY returns a single row, with one column per property.

    USE SECONDARY ROLES fails if one of the requested roles is disallowed by session policy or is not granted to the user.

After the change:
:   DESC SESSION POLICY returns one row per property. This change aligns how Snowflake describes session policies with how Snowflake describes
    authentication and password policies.

    USE SECONDARY ROLES fails only if the user is not granted one of the requested roles.

Ref: 1985

---
title: DESCRIBE SESSION POLICY: Add allowed_secondary_roles column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1621.md
section: Release Notes
---

# DESCRIBE SESSION POLICY: Add allowed_secondary_roles column

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The behavior of the DESC SESSION POLICY command is as follows:

Before the change:
:   The output of the command does not include the `allowed_secondary_roles` column.

After the change:
:   The output of the command includes the `allowed_secondary_roles` column, which is placed between the
    `session_ui_idle_timeout_mins` and `comment` columns. This column is a placeholder for future functionality.

Ref: 1621

---
title: DESCRIBE TABLE: New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1350.md
section: Release Notes
---

# DESCRIBE TABLE: New column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The output of the DESCRIBE TABLE command is as follows:

Before the change:
:   The DESCRIBE TABLE output does not contain the PRIVACY_DOMAIN column.

After the change:
:   The last column of the DESCRIBE TABLE output is the PRIVACY_DOMAIN column, which is reserved for future use.

Ref: 1350

---
title: Disable external OAuth session closure
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1991.md
section: Release Notes
---

# Disable external OAuth session closure

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

To provide consistency with other security integrations, BCR-1991 changes the behavior of external OAuth security integrations, as follows:

Before the change:
:   Sessions that were created with an external OAuth security integration close when you take the following actions:

    * Dropping the integration.
    * Disabling the integration by setting ENABLED=FALSE.
    * Changing the EXTERNAL_OAUTH_ANY_ROLE_MODE property.
    * Removing roles form the EXTERNAL_OAUTH_ALLOWED_ROLES_LIST property.
    * Adding roles to the EXTERNAL_OAUTH_BLOCKED_ROLES_LIST property.

After the change:
:   The actions in the preceding list do not close related sessions created using that integration.

Ref: 1991

---
title: DIV0 and DIV0NULL: Change to results exceeding the output scale
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1400.md
section: Release Notes
---

# DIV0 and DIV0NULL: Change to results exceeding the output scale

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

For the [DIV0](../../../sql-reference/functions/div0.md) and [DIV0NULL](../../../sql-reference/functions/div0null.md) functions, when the result of the
division operation exceeds the output scale:

Before the change:
:   DIV0 and DIV0NULL truncate the output.

    This is inconsistent with the results from using the division operator (`/`).

After the change:
:   DIV0 and DIV0NULL round the
    [output half away from zero](https://en.wikipedia.org/wiki/Rounding#Rounding_half_away_from_zero).

    This is consistent with the results from using the division operator (`/`).

For example, suppose that you are dividing 5 by 9:

```sqlexample
SELECT DIV0(5, 9), DIV0NULL(5, 9), 5/9;
```

Before the change:
:   DIV0 and DIV0NULL truncate the result to 0.555555, while the division operator rounds up the result to 0.555556.

    ```output
    +------------+----------------+----------+
    | DIV0(5, 9) | DIV0NULL(5, 9) |      5/9 |
    |------------+----------------+----------|
    |   0.555555 |       0.555555 | 0.555556 |
    +------------+----------------+----------+
    ```

After the change:
:   DIV0, DIV0NULL, and the division operator round up the result to 0.555556.

    ```output
    +------------+----------------+----------+
    | DIV0(5, 9) | DIV0NULL(5, 9) |      5/9 |
    |------------+----------------+----------|
    |   0.555556 |       0.555556 | 0.555556 |
    +------------+----------------+----------+
    ```

In addition, passing in more than two arguments to the DIV0 and DIV0NULL functions results in an exception.

Before the change:
:   Although the DIV0 and DIV0NULL functions only support two arguments, you can pass in additional arguments.

    The functions ignore the additional arguments.

After the change:
:   Passing more than two arguments to the DIV0 and DIV0NULL functions results in the following exception:

    ```output
    000939 (22023): SQL compilation error: ...
      too many arguments for function [DIV0] expected 2, got 3
    ```

Ref: 1400

---
title: DML and CTAS commands: Potential for wrong results when the RELY property is set
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1902.md
section: Release Notes
---

# DML and CTAS commands: Potential for wrong results when the RELY property is set

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, an optimization on certain [DML](../../../sql-reference/sql-dml.md) and
[CTAS statements](../../../sql-reference/sql/create-table.md) might produce wrong results when the
[RELY property](../../../sql-reference/sql/create-table-constraint.md) is set on the target table:

Before the change:
:   DML and CTAS statements on tables with [UNIQUE and PRIMARY KEY constraints](../../../sql-reference/constraints-overview.md)
    do not qualify for an optimization that is applied to DISTINCT and GROUP BY operations. (The optimizer prunes redundant grouping
    columns.)

    Because the optimization in question is currently applied to SELECT statements only, the optimization presents no risk that
    incorrect data might be inserted if the RELY property is set for a constraint and a violation of referential integrity occurs.

After the change:
:   DML and CTAS statements on tables with UNIQUE and PRIMARY KEY constraints now qualify for the same optimization that is
    applied to SELECT statements.

    If the RELY property is set for a constraint and a violation of referential integrity occurs, incorrect data might be inserted.

## Background information

Snowflake does not enforce [referential integrity](../../../user-guide/table-considerations.md) on standard
tables, regardless of the constraints and properties that you define. (The exception to this rule is
[hybrid tables](../../../user-guide/tables-hybrid.md), where certain constraints are required and enforced.)

The RELY constraint property declares that you believe your table data has referential integrity (or that another application is
enforcing it before the data is loaded into Snowflake). The RELY property must be set explicitly and is not enabled by default.
It is already documented that setting this property might lead to unintended behavior and/or unexpected results.

## Identifying tables with the RELY constraint property

To find out which tables have the RELY property set for PRIMARY KEY or UNIQUE constraints, run this query:

```sqlexample
SELECT table_schema, table_name
  FROM INFORMATION_SCHEMA.TABLE_CONSTRAINTS
  WHERE RELY ='YES'
    AND (constraint_type = 'PRIMARY KEY' OR constraint_type = 'UNIQUE');
```

Consider dropping constraints or not setting the RELY property if you anticipate any issues when the
optimization takes effect.

Ref: 1902

---
title: Document AI decommission
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2156.md
section: Release Notes
---

# Document AI decommission

Snowflake is decommissioning the Document AI UI experience and
the `<model_build_name>!PREDICT` method, and transitioning to the [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md)
function for document extraction, which is a next-generation document extraction solution powered by the `arctic-extract` model. This model offers better
scalability and accuracy, faster inference, and enhanced features such as longer output token limits and the ability to extract entities, lists, and tables
in a single API call.

> **Important:**
>
> The Document AI UI and the `<model_build_name>!PREDICT` method were decommissioned on **March 16th, 2026**.
> After the decommission you no longer have access to your existing model builds in the Document AI UI. The Document AI UI and the
> `<model_build_name>!PREDICT` method are not accessible after the decommission date, and existing Document AI models that
> were not migrated to the Snowflake Model Registry are not available for inference after the decommission date.

## Behavior change

### Before the change

The workflow for extracting data from documents is as follows:

1. Create a model build.

   You create model builds in a dedicated Document AI UI, where you also upload documents, define values for extraction, and verify the answers that the model provides.
2. Optional: Fine-tune the model.

   You can fine-tune the model in the Document AI UI if the results provided by the Snowflake Arctic-TILT model are not satisfactory.
3. Run inference.

   You use the `<model_build_name>!PREDICT` method and the model build created in the Document AI UI to extract information from documents.

> **Note:**
>
> You can view the Document AI models in the Document AI UI and in the Snowflake Model Registry.

### After the change

The workflow for extracting data from documents is as follows:

* You use the [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md) function to define values for extraction and extract information from documents. You no longer create model builds in the Document AI UI.

Previously, you had to complete a three-step workflow that involved creating model builds, but now this process is simplified to a single step which is using the
[AI_EXTRACT](../../../sql-reference/functions/ai_extract.md) function.

> **Note:**
>
> The models that were previously created and/or fine-tuned in the Document AI UI can be viewed in the Snowflake Model Registry if you migrated the models (see Actions required).
>
> You can use the fine-tuned model for inference, but you can’t fine-tune the new version of the model.

> **Important:**
>
> AI_EXTRACT uses token-based billing. For more information, see the [Snowflake Service Consumption Table](https://www.snowflake.com/legal-files/CreditConsumptionTable.pdf).

## Actions required

To continue running inference on the existing models that you created in Document AI, you must complete the following steps before **March 16, 2026**:

1. Migrate your existing Document AI models (both published and trained) to the Snowflake Model Registry. When prompted in the Document AI UI, to integrate the existing
   models into the Model Registry, follow the instructions on the integration banner.
2. Update your extraction pipelines to use the [AI_EXTRACT](../../../sql-reference/functions/ai_extract-document-ai.md) function for inference of Document AI legacy models.

   This ensures uninterrupted inference in production. For more information on using AI_EXTRACT with legacy Document AI models, see [AI_EXTRACT (Document AI legacy models)](../../../sql-reference/functions/ai_extract-document-ai.md).
3. Recommended: To continue using the Document AI data, export all of your existing Document AI model builds (which contain documents, prompts, and annotations) to a target internal stage.

## Change log

| Update | Date |
| --- | --- |
| Initial publication | 21-Nov-25 |
| Updated decommission date. | 20-Feb-26 |

Ref: 2156

---
title: Document AI: CREATE MODEL privilege required to create, publish, and train model builds
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1904.md
section: Release Notes
---

# Document AI: CREATE MODEL privilege required to create, publish, and train model builds

As part of Snowflake’s continued commitment to enable object sharing across Snowflake, Document AI now
stores any published or trained models within the model registry to enable replicating or cloning the
models between schemas, databases, and accounts. This change requires you to grant the CREATE MODEL privilege
on the schema to the account role to continue using the `<model_build_name>!PREDICT` method.

> **Note:**
>
> You can’t replicate a *model build* (which includes the model, the data values to be extracted,
> and the documents uploaded to test and train the model). This change only affects the *model*
> that was either published or trained out of a model build.

## Behavior change

Before the change:
:   To create, publish, and train a Document AI model build, you must use a role that has the following privileges:

    ```sqlexample
    GRANT CREATE SNOWFLAKE.ML.DOCUMENT_INTELLIGENCE ON SCHEMA doc_ai_db.doc_ai_schema TO ROLE doc_ai_role;
    ```

After the change:
:   To create, publish, and train a Document AI model build, you must use a role that has the following privileges:

    ```sqlexample
    GRANT CREATE SNOWFLAKE.ML.DOCUMENT_INTELLIGENCE ON SCHEMA doc_ai_db.doc_ai_schema TO ROLE doc_ai_role;
    GRANT CREATE MODEL ON SCHEMA doc_ai_db.doc_ai_schema TO ROLE doc_ai_role;
    ```

## Actions required

1. To continue working with the existing Document AI models, grant the additional privilege to the account role, as shown in the following example:

   ```sqlexample
   GRANT CREATE MODEL ON SCHEMA doc_ai_db.doc_ai_schema TO ROLE doc_ai_role;
   ```

> To create a new Document AI model build, you must use a role that has the required privileges.

## Additional notes

* New Document AI models are automatically integrated into the model registry.
* Existing Document AI models must be manually integrated into the model registry. See Actions required.
* Only published or trained models can be integrated into the model registry.
* You can’t integrate a model into the model registry during training. Wait until the training is completed.
* You must grant the CREATE MODEL privilege on both the schema that stores the source model and the schema to which you replicate a model.
* There are no changes to the `<model_build_name>!PREDICT` method itself.

## Change log

| Section | Update | Date |
| --- | --- | --- |
|  | Initial publication | 04-Mar-25 |
| * Actions required * Additional notes | Added information about integrating the existing Document AI models into the model registry. | 09-Jul-25 |

Ref: 1904

---
title: DROP ROLE command: No longer requires the MANAGE GRANTS privilege
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2167.md
section: Release Notes
---

# DROP ROLE command: No longer requires the MANAGE GRANTS privilege

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The [DROP ROLE](../../../sql-reference/sql/drop-role.md) command behaves as follows:

Before the change:
:   Dropping a role that is granted future grants requires a role with OWNERSHIP of that role and the MANAGE GRANTS privilege.

After the change:
:   Dropping a role that is granted future grants requires OWNERSHIP of that role and no additional privileges.

This BCR simplifies the introduction of container-scoped MANAGE GRANTS, and simplifies SCIM role management.

Ref: 2167

---
title: DROP ROLE command: Restriction on dropping the current primary role
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1843.md
section: Release Notes
---

# DROP ROLE command: Restriction on dropping the current primary role

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the behavior of the [DROP ROLE](../../../sql-reference/sql/drop-role.md) command
changes as follows:

Before the change:
:   Users can execute a DROP ROLE command that drops the current primary role. The current primary role is the
    role that is currently active for a session. For example, this role can be set with the USE ROLE command,
    defined as part of a user connection, or defined as the default role for a user.

After the change:
:   Users can no longer execute a DROP ROLE command that drops the current primary role. An attempt to drop this
    role returns the following error:

    ```output
    SQL execution error: Cannot drop role <x> as it is the current primary role.
    ```

## Reasons for this change

This change is being made because the current behavior has the following consequences:

* Object ownership metadata is left in an inconsistent state when the current primary role is dropped.
* Sessions that were using the dropped primary role are interrupted.

## Preparation for this change

To prepare for this change, check all of your automated processes for use of the DROP ROLE command and make sure
that these commands do not attempt to drop the current primary role. Also check your query history for any past
instances of this behavior.

Ref: 1843

---
title: DROP TABLE command: Changes to CASCADE behavior for hybrid tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1741.md
section: Release Notes
---

# DROP TABLE command: Changes to CASCADE behavior for hybrid tables

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the default setting of the CASCADE/RESTRICT parameter is
different for [hybrid tables](../../../user-guide/tables-hybrid.md). This change does not apply to standard tables.

Before the change:
:   When you drop a hybrid table without specifying RESTRICT or CASCADE, and the hybrid table has a
    primary-key/foreign-key or unique-key/foreign-key relationship with another table, the DROP TABLE command succeeds.

    The default behavior is CASCADE.

    ```sqlexample
    CREATE OR REPLACE HYBRID TABLE ht1(
      col1 NUMBER(38,0) NOT NULL,
      col2 NUMBER(38,0) NOT NULL,
      CONSTRAINT pkey_ht1 PRIMARY KEY (col1, col2));

    CREATE OR REPLACE HYBRID TABLE ht2(
      cola NUMBER(38,0) NOT NULL,
      colb NUMBER(38,0) NOT NULL,
      colc NUMBER(38,0) NOT NULL,
      CONSTRAINT pkey_ht2 PRIMARY KEY (cola),
      CONSTRAINT fkey_ht1 FOREIGN KEY (colb, colc) REFERENCES ht1(col1,col2));

    DROP TABLE ht1;
    ```

    The DROP TABLE command succeeds without any error.

After the change:
:   When you drop a hybrid table without specifying the RESTRICT or CASCADE option, and the hybrid table
    has a primary-key/foreign-key or unique-key/foreign-key relationship with another table, the DROP TABLE
    command fails with an error.

    The default behavior is RESTRICT.

    For example:

    ```sqlexample
    DROP TABLE ht1;
    ```

    ```output
    SQL compilation error:
    Cannot drop the table because of dependencies
    ```

    The DROP TABLE command fails in this case. If necessary, you can override the default behavior by specifying
    CASCADE in the DROP TABLE command.

    ```sqlexample
    DROP TABLE ht1 CASCADE;
    ```

    Alternatively in this case, you could drop the dependent table `ht2` first, then drop table `ht1`.

Ref: 1741

---
title: Dynamic tables: Added support for MONITOR privileges
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1373.md
section: Release Notes
---

# Dynamic tables: Added support for MONITOR privileges

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

To view the metadata of a dynamic table, such as the graph history, refresh history, or attached warehouse, specific privileges are required:

Before the change:
:   Only SELECT privilege is required.

After the change:
:   MONITOR privilege is required.

Ref: 1373

---
title: Dynamic tables: Changes to ACCOUNT_USAGE.TABLES and INFORMATION_SCHEMA.TABLES
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-account-usage-and-info-schema-changes.md
section: Release Notes
---

# Dynamic tables: Changes to ACCOUNT_USAGE.TABLES and INFORMATION_SCHEMA.TABLES

## New column added

The ACCOUNT_USAGE.TABLES and INFORMATION_SCHEMA.TABLES views include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `is_dynamic` | Text | Indicates whether the table is a dynamic table. Valid values are YES or NO. |

## Changes to ACCOUNT_USAGE.TABLES

Beginning with the **8.9** release, the following changes to ACCOUNT_USAGE.TABLES are enabled:

Before the change:
:   Dynamic tables are not included in this view. For rows that represent dynamic tables, the
    value for the `is_insertable_into` column is `YES`.

    The ACCOUNT_USAGE.TABLES view doesn’t include the `is_dynamic` column.

After the change:
:   Dynamic tables are included in this view. For rows that represent dynamic tables, the values
    of the `table_type` and `is_insertable_into` columns are `BASE TABLE` and `NO`,
    respectively.

    The ACCOUNT_USAGE.TABLES view includes the `is_dynamic` column, defined above.

## Changes to INFORMATION_SCHEMA.TABLES

Beginning with the **8.9** release, the following changes to INFORMATION_SCHEMA.TABLES are enabled:

Before the change:
:   For rows that represent dynamic tables, the values of the `table_type` and `is_insertable_into`
    columns are `NULL` and `YES`, respectively.

    The INFORMATION_SCHEMA.TABLES view doesn’t include the `is_dynamic` column.

After the change:
:   For rows that represent dynamic tables, the values of the `table_type` and `is_insertable_into`
    columns are `BASE TABLE` and `NO`, respectively.

    The INFORMATION_SCHEMA.TABLES view includes the `is_dynamic` column, defined above.

---
title: Dynamic tables: disallow using SQL UDFs and UDTFs in new dynamic tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1489.md
section: Release Notes
---

# Dynamic tables: disallow using SQL UDFs and UDTFs in new dynamic tables

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

SQL user-defined functions (UDF) and user-defined tabular functions (UDTF) in dynamic tables
can cause unexpected behavior, resulting in confusing errors or unexpected behavior.

Before the change:
:   You can use SQL UDFs and/or UDTFs in new and existing dynamic tables.

After the change:
:   You cannot use SQL UDFs or UDTFs in new dynamic tables.

    Existing dynamic tables that use SQL UDFs and/or UDTFs are not affected by this behavior change,
    but they may be subject to unexpected behavior.

    UDFs and UDTFs written in other [supported programming languages](../../../developer-guide/udf/udf-overview.md)
    are not affected by this behavior change.

Ref: 1489

---
title: Dynamic tables: New behavior for cloned dynamic tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1943.md
section: Release Notes
---

# Dynamic tables: New behavior for cloned dynamic tables

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

Cloned dynamic tables behave as follows:

Before the change:
:   Cloned dynamic tables assume the same state (ACTIVE or SUSPENDED) as the source dynamic tables.

After the change:
:   Cloned dynamic tables are suspended by default, whether you clone a dynamic table directly
    or clone a database or schema that contains dynamic tables.

    In the output of the [DYNAMIC_TABLE_GRAPH_HISTORY](../../../sql-reference/functions/dynamic_table_graph_history.md) table function,
    their `SCHEDULING_STATE` column shows CLONED_AUTO_SUSPENDED as the `reason_code`. Any
    dynamic tables created downstream to these cloned dynamic tables are also suspended, with a
    `reason_code` of UPSTREAM_CLONED_AUTO_SUSPENDED.

    For more information, see [Automatic dynamic table suspension](../../../user-guide/dynamic-tables-suspend-resume.md).

Ref: 1943

---
title: Dynamic tables: New column in SHOW DYNAMIC TABLES and DDL fix
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2248.md
section: Release Notes
---

# Dynamic tables: New column in SHOW DYNAMIC TABLES and DDL fix

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the [SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md) command includes a
new BACKFILL_FROM column, and the generated Data Definition Language (DDL) for dynamic tables preserves the user-provided table from the
BACKFILL_FROM attribute.

## SHOW DYNAMIC TABLES command: New column in output

When this behavior change bundle is enabled, the output of the [SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md) command
includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| BACKFILL_FROM | VARCHAR | The backfill source table, if specified. For example, `table`, `schema.table`, or `db.schema.table`. |

Before the change:
:   BACKFILL_FROM is only visible inside the DDL string in the `description` column. Detecting backfill configuration changes requires
    parsing raw SQL.

After the change:
:   BACKFILL_FROM is a standalone column, giving you direct programmatic access to the backfill configuration.

## DDL consistency fix

When this behavior change bundle is enabled, the generated DDL for dynamic tables behaves in the following manner:

Before the change:
:   If you create a dynamic table with a fully or partially qualified table name—for example,
    `CREATE DYNAMIC TABLE ... BACKFILL FROM my_schema.my_table`—the generated DDL might strip the schema and display only `my_table`.
    This inconsistency can cause issues during redeployments.

After the change:
:   If you create a dynamic table with a fully or partially qualified table name—for example,
    `CREATE DYNAMIC TABLE ... BACKFILL FROM my_schema.my_table`—the generated DDL preserves the exact qualification you provided. If you
    specified `my_schema.my_table`, the DDL reflects `my_schema.my_table`.

Ref: 2248

---
title: Dynamic tables: OPERATE privilege on upstream dynamic tables required for initial refresh
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1371.md
section: Release Notes
---

# Dynamic tables: OPERATE privilege on upstream dynamic tables required for initial refresh

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

When you create a dynamic table that depends on upstream dynamic tables, you must have the
following privileges on the upstream dynamic tables in order for the initial refresh to succeed:

Before the change:
:   Only SELECT privilege on the upstream dynamic tables is required.

After the change:
:   You must have OPERATE privilege on the upstream dynamic tables.

    If you do not have OPERATE privilege, the initial refresh will fail with the following error:

    ```output
    OPERATE privilege is required on all upstream Dynamic Tables of '<table_name>' to perform a synchronous INITIAL refresh. Please acquire the right privileges.
    ```

Ref: 1371

---
title: Dynamic tables: Return value changes and new columns added to DYNAMIC_TABLE_GRAPH_HISTORY, DYNAMIC_TABLE_REFRESH_HISTORY, and SHOW DYNAMIC TABLES
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1543.md
section: Release Notes
---

# Dynamic tables: Return value changes and new columns added to DYNAMIC_TABLE_GRAPH_HISTORY, DYNAMIC_TABLE_REFRESH_HISTORY, and SHOW DYNAMIC TABLES

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

## Return value behavior

The behavior of the DYNAMIC_TABLE_GRAPH_HISTORY, DYNAMIC_TABLE_REFRESH_HISTORY, and SHOW DYNAMIC
TABLES functions has changed. Return values for these functions are displayed as follows.

### Return value behavior for the DYNAMIC_TABLE_GRAPH_HISTORY function

When this behavior change bundle is enabled, the output of the DYNAMIC_TABLE_GRAPH_HISTORY function
includes the following return value changes:

Before the change:
:   * The SCHEDULING_STATE column returns `"state": "RUNNING"` or `"state": "SUSPENDED"` to
      describe the state of the dynamic table.
    * The SCHEDULING_STATE column returns RUNNING if an upstream table was suspended and you lacked the
      MONITOR privilege on that upstream table.

After the change:
:   * The SCHEDULING_STATE column returns `"state": "ACTIVE"` or `"state": "SUSPENDED"` to
      describe the state of the dynamic table.
    * The SCHEDULING_STATE column returns SUSPENDED, even if you don’t have the MONITOR privilege on
      upstream tables.

### Return value behavior for the DYNAMIC_TABLE_REFRESH_HISTORY function

When this behavior change bundle is enabled, the output of the DYNAMIC_TABLE_REFRESH_HISTORY function
includes the following return value changes:

Before the change:
:   * The LAST_COMPLETED_DEPENDENCY column might incorrectly return NULL values.
    * The STATE column returned SKIPPED for refresh jobs that were skipped due to an upstream failure.
    * Refresh histories were displayed for all dynamic table states.

After the change:
:   * The LAST_COMPLETED_DEPENDENCY column now returns accurate values.
    * The STATE column now returns UPSTREAM_FAILED for refresh jobs that are skipped due to an upstream
      failure.
    * Refresh histories are no longer displayed if the STATE column returns QUEUED or SKIPPED. (If your
      dynamic table refresh was skipped due to upstream failure, the STATE column now returns UPSTREAM_FAILED
      instead.)

### Return value behavior for the SHOW DYNAMIC TABLE function

When this behavior change bundle is enabled, the output of the SHOW DYNAMIC TABLES command includes the
following return value changes:

Before the change:
:   * The SCHEDULING_STATE column returns RUNNING or SUSPENDED to describe the state of the dynamic table.

After the change:
:   * The SCHEDULING_STATE column returns ACTIVE or SUSPENDED to describe the state of the dynamic table.

## Column changes

When enabled, the following additional columns are added to the DYNAMIC_TABLE_GRAPH_HISTORY and
DYNAMIC_TABLE_REFRESH_HISTORY functions.

### DYNAMIC_TABLE_GRAPH_HISTORY function: New column in output

When this behavior change bundle is enabled, the output of the DYNAMIC_TABLE_GRAPH_HISTORY function
includes the following new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| ALTER_TRIGGER | ARRAY | Describes why a new entry is created in the DYNAMIC_TABLE_GRAPH_HISTORY function. Can be one of the following:   * NONE (backwards-compatible) * CREATE_DYNAMIC_TABLE * ALTER_TARGET_LAG * SUSPEND * RESUME * REPLICATION_REFRESH * ALTER_WAREHOUSE |

### DYNAMIC_TABLE_REFRESH_HISTORY function: New columns in output

When this behavior change bundle is enabled, the output of the DYNAMIC_TABLE_REFRESH_HISTORY function
includes the following new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| TARGET_LAG | TEXT | This column describes the TARGET_LAG value for the dynamic table at the time the refresh occurred. |
| GRAPH_HISTORY_VALID_FROM | TIMESTAMP_NTZ | Encodes the VALID_FROM timestamp of the DYNAMIC_TABLE_GRAPH_HISTORY table function when the refresh occurred. |

Ref: 1543

---
title: Dynamic tables: SHOW command and functions: new columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2208.md
section: Release Notes
---

# Dynamic tables: SHOW command and functions: new columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the following dynamic table command and Information Schema table functions
include new columns that provide visibility into the execution context of dynamic table refreshes.

## SHOW DYNAMIC TABLES command: New columns in output

When this behavior change bundle is enabled, the output of the
[SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md) command includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| EXECUTE_AS_USER | VARCHAR | The user that the dynamic table refresh runs as. |
| SECONDARY_ROLE_NAMES | VARCHAR | The secondary roles used during dynamic table refresh execution. |

## DYNAMIC_TABLES and DYNAMIC_TABLE_GRAPH_HISTORY functions (Information Schema): New columns

When this behavior change bundle is enabled, the [DYNAMIC_TABLES](../../../sql-reference/functions/dynamic_tables.md) and
[DYNAMIC_TABLE_GRAPH_HISTORY](../../../sql-reference/functions/dynamic_table_graph_history.md) [Information Schema](../../../sql-reference/info-schema.md) table
functions include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| EXECUTE_AS_USER | VARCHAR | The user that the dynamic table refresh runs as. |
| SECONDARY_ROLE_NAMES | VARCHAR | The secondary roles used during dynamic table refresh execution. |

For more information, see [Refresh dynamic tables with specific user privileges and secondary roles](../../../user-guide/dynamic-tables-privileges.md).

Ref: 2208

---
title: Dynamic tables: TARGET_LAG parameter set to less than 1 minute for new or modified tables results in error
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1247.md
section: Release Notes
---

# Dynamic tables: TARGET_LAG parameter set to less than 1 minute for new or modified tables results in error

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

Dynamic tables, when TARGET_LAG is defined as < 1 minute, behave as follows:

Previously:
:   In a CREATE DYNAMIC TABLE or ALTER DYNAMIC TABLE statement, you can specify a value of
    less than 1 minute for the TARGET_LAG parameter.

    Dynamic tables with TARGET_LAG defined as less than 1 minute function as if their
    TARGET_LAG was set to 1 minute.

Currently:
:   Executing a CREATE DYNAMIC TABLE or ALTER DYNAMIC TABLE statement with the TARGET_LAG set
    to less than 1 minute results in an error.

    Any existing dynamic tables with TARGET_LAG defined as less than 1 minute continue to
    function as if their TARGET_LAG was set to 1 minute. Replacing an existing dynamic table
    with a new dynamic table that has TARGET_LAG defined as less than 1 minute results in an
    error.

Ref: 1247

---
title: Dynamic tables: Updates to dynamic table default refresh mode
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1614.md
section: Release Notes
---

# Dynamic tables: Updates to dynamic table default refresh mode

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

A dynamic table’s actual [refresh mode](../../../user-guide/dynamic-tables-refresh.md) is
determined at creation time and is immutable afterward. If not specified explicitly, the
refresh mode defaults to `AUTO`. When the 2024_04 behavior change bundle is enabled,
Snowflake chooses the refresh mode that’s likely to perform best depending on your query
definition.

To determine the best mode for your use case, experiment with refresh modes and automatic
recommendations. For consistent behavior across Snowflake releases, Snowflake recommends that
you explicitly set the refresh mode on all production dynamic tables.

For more information, see [best practices](../../../user-guide/dynamic-tables-performance.md) and
[limitations around using incremental refresh](../../../user-guide/dynamic-tables-limitations.md).

Before the change:
:   Snowflake chooses an incremental refresh of the dynamic table by default. If the dynamic
    table’s definition doesn’t support the incremental refresh mode, the dynamic table is
    automatically created with the full refresh mode.

After the change:
:   Snowflake chooses the refresh mode that is likely to perform best depending on your query
    definition.

    For consistent behavior across Snowflake releases, you should explicitly set the refresh mode
    on all production dynamic tables.

Ref: 1614

---
title: DYNAMIC_TABLE_REFRESH_HISTORY function (Information Schema): New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2254.md
section: Release Notes
---

# DYNAMIC_TABLE_REFRESH_HISTORY function (Information Schema): New columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the
[DYNAMIC_TABLE_REFRESH_HISTORY](../../../sql-reference/functions/dynamic_table_refresh_history.md)
Information Schema table function includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| REINIT_REASON | STRING | The reason for the reinitialization of the dynamic table refresh. |
| EXECUTE_AS_USER | VARCHAR | The user that the dynamic table refresh runs as. |
| SECONDARY_ROLE_NAMES | VARCHAR | The secondary roles used during dynamic table refresh execution. |

For more information, see [Refresh dynamic tables with specific user privileges and secondary roles](../../../user-guide/dynamic-tables-privileges.md).

Ref: 2254

---
title: DYNAMIC_TABLE_REFRESH_HISTORY function: DATA_TIMESTAMP value in output displayed in new format
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1231.md
section: Release Notes
---

# DYNAMIC_TABLE_REFRESH_HISTORY function: DATA_TIMESTAMP value in output displayed in new format

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

After manually refreshing a dynamic table using the [ALTER DYNAMIC TABLE … REFRESH](../../../sql-reference/sql/alter-dynamic-table.md) command,
when you call the DYNAMIC_TABLE_REFRESH_HISTORY function (in INFORMATION_SCHEMA),
the DATA_TIMESTAMP returned result behaves as follows:

Previously:
:   DATA_TIMESTAMP is shown as a raw value (seconds since epoch). Documentation lists the column data type incorrectly as TIMESTAMP_LTZ.

Currently:
:   DATA_TIMESTAMP will be shown as a human readable timestamp.

Ref: 1231

---
title: DYNAMIC_TABLE_REFRESH_HISTORY function: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2183.md
section: Release Notes
---

# DYNAMIC_TABLE_REFRESH_HISTORY function: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the output of the [DYNAMIC_TABLE_REFRESH_HISTORY](../../../sql-reference/functions/dynamic_table_refresh_history.md) table function
includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| inputs_with_changed_data | OBJECT | Contains partition and row level information about changed dynamic table inputs. |
| warehouse | TEXT | Name of the warehouse that was used to execute the refresh operation. |

Ref: 2183

---
title: DYNAMIC_TABLE_REFRESH_HISTORY view and SHOW DYNAMIC TABLES command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2163.md
section: Release Notes
---

# DYNAMIC_TABLE_REFRESH_HISTORY view and SHOW DYNAMIC TABLES command: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md) command includes the following
new column:

| Column name | Data type | Description |
| --- | --- | --- |
| initialization_warehouse | TEXT | The warehouse used for initializations and reinitializations.  Null if no initialization warehouse is set on the dynamic table. For more information, see [Understand warehouse usage for dynamic tables](../../../user-guide/dynamic-tables-warehouses.md). |

When this behavior change bundle is enabled, the output of the DYNAMIC_TABLE_REFRESH_HISTORY [table function](../../../sql-reference/functions/dynamic_table_refresh_history.md)
and [view](../../../sql-reference/account-usage/dynamic_table_refresh_history.md) includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| warehouse | TEXT | The warehouse used for the dynamic table refresh.  If the INITIALIZATION_WAREHOUSE property is SET, the specified warehouse is used for all initializations and reinitializations; otherwise, the dynamic table uses the warehouse that is specified by the WAREHOUSE property for all refreshes. For more information, see [Understand warehouse usage for dynamic tables](../../../user-guide/dynamic-tables-warehouses.md). |

Ref: 2163

---
title: DYNAMIC_TABLES function: New default for maximum number of rows returned
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1928.md
section: Release Notes
---

# DYNAMIC_TABLES function: New default for maximum number of rows returned

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

The DYNAMIC_TABLES function behaves as follows:

Before the change:
:   By default, the function returns all rows in an unsorted order when RESULT_LIMIT is not specified. For example, if an account has 10,000
    dynamic tables, the function returns 10,000 rows.

After the change:
:   By default, the function returns 100 rows and the results are sorted by the dynamic table’s last completed refresh state in the following
    order, unless specified otherwise using the RESULT_LIMIT argument.

    1. FAILED
    2. UPSTREAM_FAILED
    3. SKIPPED
    4. SUCCEEDED
    5. CANCELED

    To sort by a different order, you must provide a large enough RESULT_LIMIT value (for example, the maximum value of a signed integer). As
    long as RESULT_LIMIT exceeds the total number of dynamic tables in the account, the results can be sorted using an ORDER BY clause.

    To apply a filter on the results, also specify a large enough RESULT_LIMIT value for the filter to be applied on all dynamic tables.

    **Examples**:

    The following example sorts by a different order of `name` and returns 100 rows:

    ```sqlsyntax
    SELECT * FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLES(result_limit => <max_value>)) ORDER BY name ASC LIMIT 100 ;
    ```

    The following example sorts by a different order of `name` and returns all rows:

    ```sqlsyntax
    SELECT * FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLES(result_limit => <max_value>)) ORDER BY name ASC ;
    ```

    The following example filters for all dynamic tables with 1-minute target lag, uses the default sort, and returns all rows:

    ```sqlsyntax
    SELECT * FROM TABLE(INFORMATION_SCHEMA.DYNAMIC_TABLES(result_limit => <max_value>)) WHERE TARGET_LAG_SEC = 60 ;
    ```

Ref: 1928

---
title: Enable ability to grant entities in Personal Databases to account roles (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2290.md
section: Release Notes
---

# Enable ability to grant entities in Personal Databases to account roles (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

Before the change:
:   Objects created in a [Personal Database](../../../user-guide/personal-databases.md) could be granted to users. For example:

    ```sqlexample
    GRANT READ ON WORKSPACE USER$.PUBLIC.DEFAULT$ TO USER OTHER_USER;
    ```

    Granting privileges on Personal Database entities to account roles failed for any privilege, **including CREATE**. For example, Snowflake
    returned an error for the following statement:

    ```sqlexample
    GRANT READ ON WORKSPACE USER$.PUBLIC.DEFAULT$ TO ROLE some_role;
    ```

After the change:
:   When the 2026_03 behavior change bundle is enabled in your account, granting privileges on Personal Database entities to account roles succeeds
    for the same privileges you can grant to users, **except** CREATE. Grants of CREATE on Personal Database entities to roles continue to fail.

    For example, the following statement succeeds when the bundle is enabled in your account:

    ```sqlexample
    GRANT READ ON WORKSPACE USER$.PUBLIC.DEFAULT$ TO ROLE some_role;
    ```

    Granting other users or roles the ability to CREATE objects in a Personal Database remains blocked.

This change expands how you can share Personal Database objects **within your account**. Personal Databases support a frictionless experience for
building tools such as UIs and dashboards; before this change, sharing those artifacts with others in the account effectively relied on direct
user-to-user grants because grants to account roles were not allowed. User-to-user grants were already permitted, so this change does not introduce
a new category of cross-user access; it adds role-based sharing so you can reach a wider audience through roles you already use elsewhere in the
account. It does not expand [data sharing](../../../user-guide/data-sharing-intro.md) or other cross-account sharing of PDB content.

This capability is generally available. Snowflake does not require additional customer readiness steps or diagnostics for this change.

Ref: 2290

---
title: Enforce exact length on inserts into Apache Iceberg™ fixed[L] columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2246.md
section: Release Notes
---

# Enforce exact length on inserts into Apache Iceberg™ fixed[L] columns

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

Snowflake will enforce exact length inserts into Iceberg `fixed[L]` column types.

Before the change:
:   Columns that use the Iceberg `fixed[L]` data type accept binary values up to the defined length ‘L’ in bytes. Values shorter than L are allowed.

    Values longer than L cause the query to fail.

After the change:
:   Columns that use the Iceberg `fixed[L]` data type accept binary values that exactly match the defined length ‘L’ in bytes. INSERT statements
    that attempt to insert a binary value shorter than L result in the following error message: `Binary value has length <len1>, but Iceberg fixed[<len2>] type requires exactly <len2> bytes for column '<column name>'.`

    This change aligns Snowflake with the Iceberg specification and other Iceberg-compatible engines, such as Apache Spark™.

    The following list summarizes the behavior after this change is enabled:

    * This change applies to both new and existing Iceberg tables.
    * This change applies to both Snowflake-managed and externally managed Iceberg tables.
    * Regarding reads, this change has no impact on older snapshots or time travel. Existing snapshots will remain readable, as Snowflake only
      validates the length of the Iceberg `fixed[L]` data type during DML operations.
    * Regarding writes, with this change enabled, Snowflake will always enforce the correct length of the Iceberg `fixed[L]` data type during write
      operations, including during time travel. Writes that attempt to insert a binary value that doesn’t exactly match the defined length ‘L’
      in bytes will always fail. Values longer than L still cause the query to fail.

    What you need to do to avoid any service interruption: Identify and resolve impacted columns.

## Identify and resolve impacted columns

This section shows how to identify whether a column is impacted by this BCR and resolve the impacted columns.

### Step 1: Identify impacted columns

To identify if you have any impacted columns, follow these steps:

1. Run the DESCRIBE ICEBERG TABLE command.

   In the output, look for columns with source Iceberg type `fixed[L]`. If you find a column of Iceberg type `fixed[L]`,
   proceed to the next step.
2. Run the following query:

   ```sqlexample
   SELECT BOOLOR_AGG(OCTET_LENGTH(<column_name>) != L) from <table_name>
   ```

   If the query returns true, the column contains values of an incorrect length so the column is impacted.

### Step 2: Resolve impacted columns

To resolve the impacted columns, do one of the following

* Allow inserting binary values of arbitrary size into the impacted column, up to a maximum length
* Allow inserting fixed-length binary values of length exactly L into the impacted column

#### Allow inserting binary values of arbitrary size into the impacted column, up to a maximum length

To resolve the impacted columns, you can allow inserting binary values of arbitrary size into the impacted column, up to a maximum length.

This resolution ensures that the metadata and physical files for the table are aligned with the Iceberg specification and are
therefore compatible with external engines.

To allow inserting binary values of arbitrary size into the impacted column, up to a maximum length, follow these instructions:

* If your table is accessed by external engines such as Spark, you must recreate the table by using column type BINARY.

  To recreate the table by using column type BINARY, follow these steps:

  1. To create and populate a new table that is based on the table with the impacted columns, run the CREATE ICEBERG TABLE … AS SELECT
     (also referred to as CTAS) command and specify binary as the data type for the impacted columns.

     The following example shows a CTAS statement where the data type for column b in the new table is specified as binary:

     ```sqlexample
     CREATE ICEBERG TABLE my_table (..., b binary) AS SELECT * FROM my_old_table
     ```
  2. Use the DROP ICEBERG TABLE command to remove the table with the impacted columns.
* Alternatively, you can evolve the table schema by running the ALTER ICEBERG TABLE command to set the data type for the column to BINARY.

  > **Important:**
  >
  > Before you run an ALTER ICEBERG TABLE statement to change a column data type to BINARY, you must first contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support)
  > to enable this functionality for your account.

  For example, the following statement evolves the schema for table `t` by changing the data type for column `c` to `BINARY`:

  ```sqlexample
  ALTER ICEBERG TABLE t ALTER COLUMN c SET DATA TYPE BINARY;
  ```

  This approach is a temporary, supported solution for affected customers during the BCR period. The advantage of this approach is that
  you only need to update the table schema instead of recreating the entire table.

  > **Important:**
  >
  > This approach is not a valid Iceberg type promotion, so external engines might detect an invalid type promotion and fail to refresh
  > the table. Therefore, you should only use this approach with Snowflake-managed Iceberg tables that aren’t read by external engines
  > or written to by external engines.

#### Allow inserting fixed-length binary values of length exactly L into the impacted column

To allow inserting fixed-length binary values of length exactly L into the impacted column, ensure that the size of the value that you input
matches the target size by adjusting your workflow, such as by using trimming or padding. We also recommend that you recreate the Iceberg
table with Iceberg column type `fixed(L)` to ensure that the size of any values you previously inserted in the table exactly match the
defined length L.

Ref: 2246

---
title: ENFORCE_SESSION_POLICY parameter: Always set = TRUE
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2164.md
section: Release Notes
---

# ENFORCE_SESSION_POLICY parameter: Always set = TRUE

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The ENFORCE_SESSION_POLICY parameter behaves in the following manner:

Before the change:
:   ENFORCE_SESSION_POLICY is set = TRUE by default.
    ENFORCE_SESSION_POLICY can be set to FALSE by ACCOUNTADMIN.
    When ENFORCE_SESSION_POLICY=FALSE, session policies are not enforced.

After the change:
:   ENFORCE_SESSION_POLICY is set = TRUE by default.
    ENFORCE_SESSION_POLICY can’t be set = FALSE.
    Session policies are always enforced.

Ref: 2164

---
title: Event tracing: Trace IDs propagated from parent to child through procedure calls
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1592.md
section: Release Notes
---

# Event tracing: Trace IDs propagated from parent to child through procedure calls

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

Before the change:
:   The `trace_id` of each of the spans created by chained Java and Scala stored procedures or UDFs is unique.

    The `parent_span_id` field does not exist in the RECORD column of the event table.

    Native apps providers and consumers see different `trace_id` values for shared events. The provider sees the hashed version.

After the change:
:   Spans generated by chained Java and Scala stored procedures or UDFs have the same `trace_id`. The RECORD column has a
    `parent_span_id` attribute.

    Spans generated by chained Java and Scala stored procedures or UDFs have a parent-child relationship between `parent_span_id` and
    `span_id`. Java and Scala stored procedures can call other stored procedures in a chain of any length. (UDFs can’t execute
    SQL statements, so calling a UDF ends the chain. However, the trace info is still propagated to the UDF’s spans.)

    If the Java or Scala stored procedure or UDF was called by the user directly (the root), then the `trace_id` will be a random ID
    and there will be no `parent_span_id`. If tracing is disabled for a stored procedure and it calls another stored procedure or UDF,
    then the `trace_id` of the child’s spans will be random and they will have no `parent_span_id`. In other words, the trace is
    restarted at the child.

    Native apps providers and consumers see the same `trace_id` for shared Java or Scala stored procedure or UDF events, so they can be
    debugged more easily.

Ref: 1592

---
title: Expanded task view capabilities for MONITOR, OPERATE, MONITOR EXECUTION, and OWNER privileges
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1799.md
section: Release Notes
---

# Expanded task view capabilities for MONITOR, OPERATE, MONITOR EXECUTION, and OWNER privileges

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

Enabling this behavior change bundle expands the ability of roles with these privileges to view task information.

Before the change:
:   Some task information was unavailable to roles with MONITOR, OPERATE, MONITOR EXECUTION, and OWNERSHIP privileges.

After the change:
:   Task information available with MONITOR, OPERATE, MONITOR EXECUTION, and OWNERSHIP privileges is expanded.

    The following tables describe the task information that is now available when this behavior change bundle is enabled.

## MONITOR and OPERATE privileges

|  | Before the change | After the change |
| --- | --- | --- |
| [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) | ✔ | ✔ |
| [DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) | ✔ | ✔ |
| **INFORMATION_SCHEMA table functions** |  |  |
| [TASK_HISTORY](../../../sql-reference/functions/task_history.md) | ❌ | ✔ |
| [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md) | ❌ | ✔ |
| [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md) | ❌ | ✔ |
| [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../../../sql-reference/functions/query_history.md) | **For serverless tasks:**  ✔ All queries  ✔ Stored-procedure child queries that ran with caller’s rights.  ❌ Stored-procedure child queries that ran with owner’s rights.  **For user-managed tasks:**  ❌ Task privileges do not determine what a role can view in QUERY_HISTORY. | **For serverless tasks:**  ✔ All queries  ✔ Stored-procedure child queries that ran with caller’s rights.  ❌ Stored-procedure child queries that ran with owner’s rights.  **For user-managed tasks:**  ✔ Task queries and stored-procedure child queries.  ❌ Stored-procedure child queries that ran with owner’s rights require at least MONITOR privileges on the warehouse. Task privileges do not grant this visibility. |
| **Snowsight** |  |  |
| Task details > **Run History**, and **Transformation** > **Tasks** > **Task Runs** | ❌ | ✔ |
| Task details > **Graph**, and **Transformation** > **Tasks** > **Task Graphs** | ✔ | ✔ |

## MONITOR EXECUTION privilege

|  | Before the change | After the change |
| --- | --- | --- |
| [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) | ✔ | ✔ |
| [DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) | ✔ | ✔ |
| **INFORMATION_SCHEMA table functions** |  |  |
| [TASK_HISTORY](../../../sql-reference/functions/task_history.md) | ✔ | ✔ |
| [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md) | ✔ | ✔ |
| [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md) | ✔ | ✔ |
| [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../../../sql-reference/functions/query_history.md) | ❌ Task privileges do not determine what a role can view in QUERY_HISTORY. | **For serverless tasks:**  ✔ All queries  ✔ Stored-procedure child queries that ran with caller’s rights.  ❌ Stored-procedure child queries that ran with owner’s rights.  **For user-managed tasks:**  ✔ Task queries and stored-procedure child queries.  ❌ Stored-procedure child queries that ran with owner’s rights require at least MONITOR privileges on the warehouse. Task privileges do not grant this visibility. |
| **Snowsight** |  |  |
| Task details > **Run History**, and **Transformation** > **Tasks** > **Task Runs** | ✔ | ✔ |
| Task details > **Graph**, and **Transformation** > **Tasks** > **Task Graphs** | ✔ | ✔ |

## OWNERSHIP privilege

|  | Before the change | After the change |
| --- | --- | --- |
| [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) | ✔ | ✔ |
| [DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) | ✔ | ✔ |
| **INFORMATION_SCHEMA table functions** |  |  |
| [TASK_HISTORY](../../../sql-reference/functions/task_history.md) | ✔ | ✔ |
| [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md) | ✔ | ✔ |
| [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md) | ✔ | ✔ |
| [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../../../sql-reference/functions/query_history.md) | **For serverless tasks:**  ✔ All queries  ✔ Stored-procedure child queries that ran with caller’s rights.  ❌ Stored-procedure child queries that ran with owner’s rights.  **For user-managed tasks:**  ❌ Task privileges do not determine what a role can view in QUERY_HISTORY. | **For serverless tasks:**  ✔ All queries  ✔ Stored-procedure child queries that ran with caller’s rights.  ❌ Stored-procedure child queries that ran with owner’s rights.  **For user-managed tasks:**  ✔ Task queries and stored-procedure child queries.  ❌ Stored-procedure child queries that ran with owner’s rights require at least MONITOR privileges on the warehouse. Task privileges do not grant this visibility. |
| **Snowsight** |  |  |
| Task details > **Run History**, and **Transformation** > **Tasks** > **Task Runs** | ✔ | ✔ |
| Task details > **Graph**, and **Transformation** > **Tasks** > **Task Graphs** | ✔ | ✔ |

Ref: 1799

---
title: Extend SHOW USERS, SHOW TERSE USERS, and SELECT FROM account_usage.users: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1690.md
section: Release Notes
---

# Extend SHOW USERS, SHOW TERSE USERS, and SELECT FROM account_usage.users: New columns

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the SHOW USERS, SHOW TERSE USERS, and SELECT FROM account_usage.users includes the following
new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| TYPE | VARCHAR | Specifies the type of user. PERSON, SERVICE, LEGACY_SERVICE, or NULL. Default: NULL. For more information about types of users, see [the TYPE object property](../../../sql-reference/sql/create-user.md) of users. |
| HAS_MFA | BOOLEAN | Specifies whether the user is enrolled for multi-factor authentication. |

This change also changes the order of the returned columns, so if you use ordinals to select from these columns, then you need to change the
ordinals.

Ref: 1690

---
title: External OAuth security integrations: EXTERNAL_OAUTH_JWS_KEYS_URL parameter requires HTTPS
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2218.md
section: Release Notes
---

# External OAuth security integrations: EXTERNAL_OAUTH_JWS_KEYS_URL parameter requires HTTPS

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

The EXTERNAL_OAUTH_JWS_KEYS_URL parameter of an External OAuth security integration specifies the
endpoint from which Snowflake retrieves public keys to validate OAuth access tokens. This behavior change strengthens security by ensuring
that public keys used to validate OAuth access tokens are always retrieved over an encrypted connection.

Before the change:
:   The EXTERNAL_OAUTH_JWS_KEYS_URL parameter accepts both HTTP and HTTPS URLs. HTTP transmits data
    in plain text, which means the keys retrieved over HTTP are vulnerable to interception and
    man-in-the-middle attacks.

After the change:
:   The EXTERNAL_OAUTH_JWS_KEYS_URL parameter requires an HTTPS URL. HTTPS encrypts data in transit
    using TLS, which protects against attacks. HTTP URLs are no longer accepted.

Ref: 2218

---
title: EXTRACT_SEMANTIC_CATEGORIES Function: International Tag Values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1110.md
section: Release Notes
---

# EXTRACT_SEMANTIC_CATEGORIES Function: International Tag Values

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

The [EXTRACT_SEMANTIC_CATEGORIES](../../../sql-reference/functions/extract_semantic_categories.md) function behaves as follows:

Previously:
:   The output of the function takes the following form:

    > ```sqljson
    > {
    >     "<col1_name>": {
    >     "extra_info" : {
    >         "alternates" : [<semantic_categories>],
    >         "probability" : "<number>"
    >     },
    >     "privacy_category" : "<value>",
    >     "semantic_category" : "<value>"
    >     },
    > ...
    > ...
    >     "<colN_name>": {
    >     "extra_info" : {
    >         "alternates" : [<semantic_categories>],
    >         "probability" : "<number>"
    >     },
    >     "privacy_category" : "<value>",
    >     "semantic_category" : "<value>"
    >     }
    > }
    > ```

Currently:
:   The output of the function will change in its formatting, and the output will include support for SEMANTIC_CATEGORY tag values
    that pertain to Australia, Canada, the United Kingdom, and the United States. To support these countries, the tag values correspond to
    certain *parent category groups*. A parent category contains information about the classification result, including whether the column
    consists of values largely from one country or another.

    The formatting changes are:

    * Remove the `extra_info` and `probability` fields.
    * Move the `alternates` field to a different position in the output.
    * Add these new fields:

      + `valid_value_ratio`, which specifies the ratio of valid values in the sample size. Invalid values include NULL, an empty string,
        and a string with more than 256 characters.
      + `recommendation`, which includes information about each tag and value.
      + `confidence`, where the possible values are either `HIGH`, `MEDIUM`, or `LOW`.
      + `coverage`, which indicates the percent of sampled cell values that match the rules for a particular category.
      + `details`, which contains fields and values that can specify a geographical tag value for the SEMANTIC_CATEGORY tag.

    For example:

    ```sqljson
    {
      "valid_value_ratio": 1.0,
      "recommendation": {
        "semantic_category": "PASSPORT",
        "privacy_category": "IDENTIFIER",
        "confidence": "HIGH",
        "coverage": 0.7,
        "details": [
          {
            "semantic_category": "US_PASSPORT",
            "coverage": 0.7
          },
          {
            "semantic_category": "CA_PASSPORT",
            "coverage": 0.1
          }
        ]
      },
      "alternates": [
        {
          "semantic_category": "NATIONAL_IDENTIFIER",
          "privacy_category": "IDENTIFIER",
          "confidence": "LOW",
          "coverage": 0.3,
          "details": [
            {
              "semantic_category": "US_SSN",
              "privacy_category": "IDENTIFIER",
              "coverage": 0.3
            }
          ]
        }
      ]
    }
    ```

    The following table summarizes the relationship between the classification tags, new category groups and group members, and
    supported countries. The country codes are based on the [ISO-3166-1 alpha-2](https://en.wikipedia.org/wiki/ISO_3166-1_alpha-2)
    standard. Other semantic categories, such as EMAIL and GENDER, are not affected.

    > | PRIVACY_CATEGORY Tag Values | SEMANTIC_CATEGORY Tag Values (Parent Group) | Group Members | Country Code |
    > | --- | --- | --- | --- |
    > | `IDENTIFIER` | `BANK_ACCOUNT` | `CA_BANK_ACCOUNT` . `US_BANK_ACCOUNT` . `IBAN` | CA . US |
    > |  | `ORGANIZATION_IDENTIFIER` | `AU_BUSINESS_NUMBER` . `AU_COMPANY_NUMBER` | AU |
    > |  | `DRIVERS_LICENSE` | `AU_DRIVERS_LICENSE` . `CA_DRIVERS_LICENSE` . `US_DRIVERS_LICENSE` | AU . CA . US |
    > |  | `MEDICARE_NUMBER` | `AU_MEDICARE_NUMBER` | AU |
    > |  | `PASSPORT` | `AU_PASSPORT` . `CA_PASSPORT` . `US_PASSPORT` | AU . CA . US |
    > |  | `PHONE_NUMBER` | `AU_PHONE_NUMBER` . `CA_PHONE_NUMBER` . `UK_PHONE_NUMBER` . `US_PHONE_NUMBER` | AU . CA . GB . US |
    > |  | `STREET_ADDRESS` | `CA_STREET_ADDRESS` . `US_STREET_ADDRESS` | CA . US |
    > |  | `TAX_IDENTIFIER` | `AU_TAX_NUMBER` | AU |
    > |  | `NATIONAL_IDENTIFIER` | `CA_SOCIAL_INSURANCE_NUMBER` . `UK_NATIONAL_INSURANCE_NUMBER` . `US_SSN` | CA . GB . US |
    > | `QUASI_IDENTIFIER` | `CITY` | `US_CITY` . `CA_CITY` . | US . CA . |
    > |  | `POSTAL_CODE` | `AU_POSTAL_CODE` . `CA_POSTAL_CODE` . `UK_POSTAL_CODE` . `US_POSTAL_CODE` | AU . CA . GB . US |
    > |  | `ADMINISTRATIVE_AREA_1` | `CA_PROVINCE_OR_TERRITORY` . `US_STATE_OR_TERRITORY` | CA . US |
    > |  | `ADMINISTRATIVE_AREA_2` | `US_COUNTY` | US |

    The data engineer can use the pending tag values by manually specifying the tag value in the [ALTER TABLE](../../../sql-reference/sql/alter-table.md) or
    [ALTER VIEW](../../../sql-reference/sql/alter-view.md) statement. Alternatively, the data engineer can call the
    [ASSOCIATE_SEMANTIC_CATEGORY_TAGS](../../../sql-reference/stored-procedures/associate_semantic_category_tags.md) stored procedure to set the tag.

    For example, use an ALTER TABLE statement to set the `PASSPORT` tag value on the PASSPORT table column manually.

    ```sqlexample
    ALTER TABLE mydb.myschema.mytable
      MODIFY COLUMN passport
      SET TAG SNOWFLAKE.CORE.SEMANTIC_CATEGORY = 'PASSPORT';
    ```

    There are no changes to the overall classification process or the
    steps to classify a table, all tables in a schema, or all tables in a database.

    > **Tip:**
    >
    > If you pass the EXTRACT_SEMANTIC_CATEGORIES function as an argument to the ASSOCIATE_SEMANTIC_CATEGORY_TAGS stored procedure, be sure
    > to double-check any custom handling that you might have configured to ensure that your workflows do not break due to the pending
    > formatting changes.

Ref: 1110

---
title: Failover Groups: Change to GRANTED_ON Column in SHOW GRANTS Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-895.md
section: Release Notes
---

# Failover Groups: Change to GRANTED_ON Column in SHOW GRANTS Output

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The [SHOW GRANTS ON <object_type> <object_name>](../../../sql-reference/sql/show-grants.md) command lists all the privileges granted on a
specific object. The value in the GRANTED_ON column is the type of that object. For failover groups, this value is REPLICATION_GROUP.

The value in the GRANTED_ON column for SHOW GRANTS ON FAILOVER GROUP <failover_group_name> has changed as follows:

Previously:
:   The GRANTED_ON column value for failover groups was REPLICATION_GROUP.

Currently:
:   The GRANTED_ON column value for failover groups is now FAILOVER_GROUP.

Ref: 895

---
title: Fast listing auto-fulfillment enabled by default
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2221.md
section: Release Notes
---

# Fast listing auto-fulfillment enabled by default

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

This behavior change applies to the [listing auto-fulfillment](../../../collaboration/provider-listings-auto-fulfillment.md)
setting for all existing and new application packages.

Listing auto-fulfillment behaves as follows:

Before the change:
:   `LISTING_AUTO_REFRESH` is not enabled by default. For more information, see [ALTER APPLICATION PACKAGE](../../../sql-reference/sql/alter-application-package.md).

After the change:
:   `LISTING_AUTO_REFRESH` is enabled by default.

Ref: 2221

---
title: Feature updates earlier in 2026
source: https://docs.snowflake.com/en/release-notes/feature-releases-2026.md
section: Release Notes
---

# Feature updates earlier in 2026

This topic lists the feature updates that occurred earlier in 2026.

For more recent feature updates, see [Snowflake server release notes and feature updates](new-features.md).

* [Mar 16, 2026: Metering disabled for hybrid table requests](2026/other/2026-03-16-hybrid-tables-metering-disabled.md)
* [Mar 16, 2026: Snowflake Notebooks renamed to Legacy Snowflake Notebooks](2026/other/2026-03-16-legacy-notebooks.md)
  + [Migration to Notebooks in Workspaces](2026/other/2026-03-16-legacy-notebooks.md)
* [Mar 16, 2026: Apache Iceberg™ tables: Write support by using an external query engine (*Preview*)](2026/other/2026-03-16-tables-iceberg-query-using-external-query-engine-snowflake-horizon-writes-feature.md)
* [Mar 13, 2026: Cortex Agent evaluations (*General availability*)](2026/other/2026-03-13-cortex-agent-evaluations.md)
* [Mar 13, 2026: Time distribution information added to STATISTICS column in dynamic table refresh history](2026/other/2026-03-13-dt-time-distribution.md)
* [Mar 13, 2026: Network Policy Advisor — *General availability*](2026/other/2026-03-13-network-policy-advisor-ga.md)
* [Mar 13, 2026: Support for specifying relationship paths in semantic views (*Preview*)](2026/other/2026-03-13-semantic-views-multi-path.md)
* [Mar 13, 2026: New OVERLAP_POLICY parameter for task graphs](2026/other/2026-03-13-tasks-overlap-policy.md)
* [Mar 12, 2026: AI_EXTRACT scale factor parameter (*General availability*)](2026/other/2026-03-12-ai-extract.md)
* [Mar 12, 2026: AI code suggestions in Workspaces (*Preview*)](2026/other/2026-03-12-cortex-code-ai-suggestions-preview.md)
* [Mar 12, 2026: Investigate cost anomalies using hourly consumption by service type](2026/other/2026-03-12-cost-anomaly-hourly-consumption-by-service-type.md)
* [Mar 12, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-03-12-dcr.md)
  + [Clean Rooms API Version: 13.6](2026/other/2026-03-12-dcr.md)
* [Mar 12, 2026: Multi-Location Resilience for Data Pipelines (General availability)](2026/other/2026-03-12-multi-location-resilience-data-pipelines-ga.md)
* [Mar 12, 2026: Recent Cortex Search updates (*Generally Available*)](2026/other/2026-03-12-recent-cortex-search.md)
  + [Multi-index search](2026/other/2026-03-12-recent-cortex-search.md)
  + [Custom vector embeddings](2026/other/2026-03-12-recent-cortex-search.md)
  + [Enhanced Cortex Search tool for Cortex Agents and Snowflake Intelligence](2026/other/2026-03-12-recent-cortex-search.md)
* [Mar 11, 2026: Resource budgets for Cortex Agents](2026/other/2026-03-11-cortex-agents-resource-budgets.md)
* [Mar 11, 2026: Resource budgets for Snowflake Intelligence](2026/other/2026-03-11-snowflake-intelligence-resource-budgets.md)
* [Mar 9, 2026: Cortex Code in Snowsight - *General availability*](2026/other/2026-03-09-cortex-code-snowsight-ga.md)
  + [Why this matters](2026/other/2026-03-09-cortex-code-snowsight-ga.md)
  + [Legal notices](2026/other/2026-03-09-cortex-code-snowsight-ga.md)
* [Mar 09, 2026: Streamlit in Snowflake container runtime and secrets support (*General availability*)](2026/other/2026-03-09-sis-container-runtime-ga.md)
* [Mar 06, 2026: SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG function (*General availability*)](2026/other/2026-03-06-system-get-catalog-linked-database-config.md)
* [Mar 05, 2026: AI_COMPLETE document intelligence (*Preview*)](2026/other/2026-03-05-ai-complete-document-intelligence.md)
* [Mar 05, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-03-05-dcr.md)
  + [Clean Rooms API Version: 13.5](2026/other/2026-03-05-dcr.md)
* [Mar 05, 2026: Preventing a semantic view metric from being aggregated across specific dimensions](2026/other/2026-03-05-semantic-views-semi-additive-metrics.md)
* [Mar 05, 2026: Exporting a semantic view to a Tableau Data Source (TDS) file (*Preview*)](2026/other/2026-03-05-semantic-views-tableau-tds.md)
* [Mar 04, 2026: Support for Apache Iceberg™ version 3 (*Preview*)](2026/other/2026-03-04-iceberg-v3-support-preview.md)
* [Mar 2, 2026: Monitor and control Cortex AI Functions spending (*General availability*)](2026/other/2026-02-25-ai-functions-cost-management.md)
* [Mar 02, 2026: No limit on the number of backup sets per object](2026/other/2026-03-02-backups-no-limit-backup-sets.md)
* [Mar 02, 2026: Support for new dbt Core versions for dbt Projects on Snowflake](2026/other/2026-03-02-dbt-core-versions.md)
* [Mar 02, 2026: Simplified pricing for hybrid tables](2026/other/2026-03-02-hybrid-tables-pricing.md)
* [Mar 02, 2026: Query Delta-based Apache Iceberg™ tables with deletion vectors](2026/other/2026-03-02-iceberg-delta-deletion-vectors.md)
* [Mar 02, 2026: Using standard SQL clauses to query semantic views (*General availability*)](2026/other/2026-03-02-semantic-views-standard-sql.md)
* [Feb 27, 2026: Openflow Connector for Oracle (*General availability*)](2026/other/2026-02-27-openflow-oracle-ga.md)
* [Feb 27, 2026: Restricted caller’s rights in Streamlit in Snowflake (*Preview*)](2026/other/2026-02-27-sis-restricted-callers-rights.md)
* [Feb 26, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-26-dcr.md)
  + [Clean Rooms API Version: 13.4](2026/other/2026-02-26-dcr.md)
* [Feb 25, 2026: Account Usage CORTEX_AGENT_USAGE_HISTORY view (*General availability*)](2026/other/2026-02-25-cortex-agent-usage-history-view.md)
* [Feb 25, 2026: Joining logical tables that contain ranges of values in a semantic view (*Preview*)](2026/other/2026-02-25-semantic-views-range-joins.md)
* [Feb 25, 2026: Account Usage SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (*General availability*)](2026/other/2026-02-25-snowflake-intelligence-usage-history-view.md)
* [Feb 24, 2026: View invoices in Snowsight](2026/other/2026-02-24-billing-invoices.md)
* [Feb 24, 2026: User-defined actions for budgets](2026/other/2026-02-24-budget-user-defined-actions.md)
* [Feb 24, 2026: Enforcement of privatelink-only access (*General availability*)](2026/other/2026-02-24-enforce-privatelink-access-only.md)
* [Feb 24, 2026: Snowflake Postgres (*General availability*)](2026/other/2026-02-24-snowflake-postgres-ga.md)
* [Feb 23, 2026: Simplified setup for Data Quality Monitoring](2026/other/2026-02-23-data-quality-monitoring-setup.md)
  + [Cortex Data Quality (*Preview*)](2026/other/2026-02-23-data-quality-monitoring-setup.md)
  + [User interface for creating data quality checks (*Preview*)](2026/other/2026-02-23-data-quality-monitoring-setup.md)
* [Feb 23, 2026: Grouped Query History in Snowsight (*General availability*)](2026/other/2026-02-23-grouped-query-history-ui.md)
* [Feb 20, 2026: Snowflake Native Apps: Configuration (*Preview*)](2026/other/2026-02-20-nativeapps-configuration.md)
* [Feb 20, 2026: USE AI FUNCTIONS account privilege for Cortex AI Functions](2026/other/2026-02-20-use-ai-functions-privilege.md)
* [Feb 19, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-19-dcr.md)
  + [Clean Rooms API Version: 13.3](2026/other/2026-02-19-dcr.md)
* [Feb 19, 2026: Machine learning experiments (*General availability*)](2026/other/2026-02-19-ml-experiments-ga.md)
* [Feb 18, 2026: Snowflake Container Runtime versioning for ML Jobs (*Preview*)](2026/other/2026-02-18-container-runtime-versions-preview.md)
* [Feb 18, 2026: Account Usage New CORTEX_AGENT_USAGE_HISTORY view (*Preview*)](2026/other/2026-02-18-cortex-agent-usage-history-view.md)
* [Feb 18, 2026: Support for changing refresh user and secondary roles](2026/other/2026-02-18-dynamic-tables-execute-as-user.md)
* [Feb 18, 2026: Row timestamps for pipeline latency and event tracking (*General availability*)](2026/other/2026-02-18-row-timestamps.md)
* [Feb 18, 2026: Account Usage New SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (*Preview*)](2026/other/2026-02-18-snowflake-intelligence-usage-history-view.md)
* [Feb 17, 2026: Access history improvements](2026/other/2026-02-17-access-history.md)
* [Feb 16, 2026: Sharing Streamlit in Snowflake apps (*Preview*)](2026/other/2026-02-16-sis.md)
* [Feb 13, 2026: Run Security Essentials scanners on demand](2026/other/2026-02-13-adhoc-security-essentials.md)
* [Feb 13, 2026: Snowflake Native Apps: Inter-App Communication (*Preview*)](2026/other/2026-02-13-nativeapps-iac.md)
* [Feb 12, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-12-dcr.md)
  + [Clean Rooms API Version: 13.2](2026/other/2026-02-12-dcr.md)
* [Feb 12, 2026: New checkout experience for private offers with flat-fee pricing (*General availability*)](2026/other/2026-02-12-marketplace-checkout-experience-ga.md)
* [Feb 12, 2026: Strong Authentication Hub (*Preview*)](2026/other/2026-02-12-strong-authentication-hub.md)
* [Feb 10, 2026: Snowflake Native Apps: Shareback (*General Availability*)](2026/other/2026-02-10-nativeapps-shareback.md)
* [Feb 09, 2026: Performance Explorer enhancements (*Preview*)](2026/other/2026-02-09-performance-explorer-enhancements-preview.md)
* [Feb 06, 2026: Cortex Code data science and machine learning skill (*Preview*)](2026/other/2026-02-06-cortex-code-data-science-preview.md)
* [Feb 06, 2026: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (*General availability*)](2026/other/2026-02-06-tables-iceberg-query-using-external-query-engine-snowflake-horizon-ga.md)
* [Feb 06, 2026: Trust Center Overview tab (*Preview*)](2026/other/2026-02-06-trust-center-overview-tab-preview.md)
* [Feb 05, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-05-dcr.md)
  + [Clean Rooms API Version: 12.9](2026/other/2026-02-05-dcr.md)
* [Feb 05, 2026: Notebooks in Workspaces (*General Availability*)](2026/other/2026-02-05-notebooks-in-workspaces.md)
  + [Key features](2026/other/2026-02-05-notebooks-in-workspaces.md)
* [Feb 05, 2026: Sensitive data classification: Support for semi-structured data (*General availability*)](2026/other/2026-02-05-sensitive-data-classification-json.md)
* [Feb 04, 2026: Cortex Search Component Scores (*General availability*)](2026/other/2026-02-04-cortex-search-component-scores-ga.md)
* [Feb 4, 2026: Object tagging support for interactive tables](2026/other/2026-02-04-interactive-tagging.md)
* [Feb 04, 2026: Sensitive data classification: Classify a subset of native semantic categories (*Preview*)](2026/other/2026-02-04-sensitive-data-classification-subset-categories.md)
* [Feb 02, 2026: Cortex Code CLI (*General availability*)](2026/other/2026-02-02-cortex-code-cli.md)
* [Feb 02, 2026: Cortex Code in Snowsight (*Preview*)](2026/other/2026-02-02-cortex-code-snowsight.md)
  + [Key capabilities](2026/other/2026-02-02-cortex-code-snowsight.md)
* [Feb 02, 2026: Support for listing and share observability (*General availability*)](2026/other/2026-02-02-listing-observability-ga.md)
  + [New views and functions in the INFORMATION_SCHEMA schema](2026/other/2026-02-02-listing-observability-ga.md)
    - [INFORMATION_SCHEMA.LISTINGS view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [INFORMATION_SCHEMA.SHARES view (for providers and consumers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [INFORMATION_SCHEMA.AVAILABLE_LISTINGS table function (for consumers)](2026/other/2026-02-02-listing-observability-ga.md)
  + [New and updated views in the ACCOUNT_USAGE schema](2026/other/2026-02-02-listing-observability-ga.md)
    - [ACCOUNT_USAGE.LISTINGS view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [ACCOUNT_USAGE.SHARES view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [ACCOUNT_USAGE.GRANTS_TO_SHARES view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [Updates to ACCOUNT_USAGE.ACCESS_HISTORY view](2026/other/2026-02-02-listing-observability-ga.md)
* [Feb 02, 2026: Use Snowsight to manage external volumes (*Preview*)](2026/other/2026-02-02-manage-external-volumes-by-using-snowsight.md)
* [Feb 02, 2026: Share Connected Apps in Snowflake Marketplace listings (*General availability*)](2026/other/2026-02-02-share-connected-apps-in-sfmarketplace-listings-ga.md)
* [Feb 01, 2026: New ORGANIZATION_USAGE premium views](2026/other/2026-02-01-organization-usage-new-views.md)
* [Jan 30, 2026: Support for bi-directional data access with Microsoft Fabric (*General availability*)](2026/other/2026-01-30-iceberg-microsoft-fabric-bidirectional-data-access-ga.md)
* [Jan 30, 2026: New regions](2026/other/2026-01-30-new-regions.md)
* [Jan 29, 2026: Apache DataSketches functions (*General availability*)](2026/other/2026-01-29-datasketches-functions-ga.md)
* [Jan 29, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-01-29-dcr.md)
  + [Clean Rooms API Version: 12.8](2026/other/2026-01-29-dcr.md)
* [Jan 28, 2026: Fine-tuning `arctic-extract` models (*Preview*)](2026/other/2026-01-28-fine-tuning-arctic-extract-models.md)
* [Jan 28, 2026: Private connectivity for TSS on Google Cloud (*General availability*)](2026/other/2026-01-28-tss-private-connectivity-gcp.md)
* [Jan 27, 2026: Estimate token usage with AI_COUNT_TOKENS (*General availability*)](2026/other/2026-01-27-ai-count-tokens-function-ga.md)
* [Jan 27, 2026: Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](2026/other/2026-01-27-iceberg-enforce-access-policies-on-tables-queried-from-apache-spark.md)
* [Jan 26, 2026: Extract images from documents using AI_PARSE_DOCUMENT (Preview)](2026/other/2026-01-26-ai-parse-document-images-preview.md)
* [Jan 26, 2026: Specify a dynamic task configuration with EXECUTE TASK](2026/other/2026-01-26-dynamic-task-config.md)
* [Jan 23, 2026: Malicious IP Protection updates](2026/other/2026-01-23-malicious-ip-protection.md)
* [Jan 23, 2026: Consumer-controlled maintenance policies for Snowflake Native Apps (*Preview*)](2026/other/2026-01-23-native-apps-consumer-maintenance-policies.md)
* [Jan 23, 2026: Network policies for External OAuth](2026/other/2026-01-23-network-policies-external-oauth.md)
* [Jan 23, 2026: Organization users (*General availability*)](2026/other/2026-01-23-org-users-ga.md)
* [Jan 23, 2026: Storage lifecycle policies: Expanded support](2026/other/2026-01-23-storage-lifecycle-policies-azure.md)
* [Jan 22, 2026: AI_AGG and AI_SUMMARIZE_AGG (*General availability*)](2026/other/2026-01-22-ai-agg-ai-summarize-agg-ga.md)
* [Jan 22, 2026: AI_FILTER for filtering with natural language predicates (*General availability*)](2026/other/2026-01-22-ai-filter-ga.md)
* [Jan 22, 2026: Document Processing Playground (*General availability*)](2026/other/2026-01-22-document-processing-playground.md)
* [Jan 22, 2026: European Union categories for sensitive data classification](2026/other/2026-01-22-sensitive-data-classification-eu-india.md)
* [Jan 21, 2026: Snowflake OAuth for local applications](2026/other/2026-01-21-snowflake-oauth-local-applications.md)
* [Jan 20, 2026: Shared Workspaces (*General availability*)](2026/other/2026-01-20-shared-workspaces.md)
  + [Key features](2026/other/2026-01-20-shared-workspaces.md)
* [Jan 16, 2026: External lineage (*Preview*)](2026/other/2026-01-16-external-lineage.md)
* [Jan 16, 2026: Sensitive data classification in the Trust Center (*Preview*)](2026/other/2026-01-16-trust-center-sensitive-data-classification.md)
* [Jan 15, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-01-15-dcr.md)
  + [Clean Rooms API Version: 12.5](2026/other/2026-01-15-dcr.md)
* [Jan 14, 2026: Workspaces replication (*Preview*)](2026/other/2026-01-14-workspace-replication.md)
* [Jan 12, 2026: Specifying custom instructions in semantic views](2026/other/2026-01-12-semantic-views-custom-instructions.md)
* [Jan 08, 2026: Tri-Secret Secure in China (*General availability*)](2026/other/2026-01-08-tss-available-china.md)
* [Jan 07, 2026: Reorganized UI for listings (*General availability*)](2026/other/2026-01-07-listings-ui-reorganization.md)
  + [Changes to Provider Studio](2026/other/2026-01-07-listings-ui-reorganization.md)
  + [Changes to Internal Marketplace listings](2026/other/2026-01-07-listings-ui-reorganization.md)

---
title: Feature updates in 2023
source: https://docs.snowflake.com/en/release-notes/feature-releases-2023.md
section: Release Notes
---

# Feature updates in 2023

This topic lists the feature updates that occurred in 2023.

For more recent feature updates, see [Snowflake server release notes and feature updates](new-features.md).

* [December 20, 2023 — Snowpark Container Services Release Notes](2023/other/2023-12-20.md)
* [December 15, 2023 — Cost Management Release Notes](2023/other/2023-12-15.md)
  + [Cost Management: Account Overview Page — *Preview*](2023/other/2023-12-15.md)
* [December 01, 2023 — Streamlit in Snowflake Release Notes](2023/other/2023-12-01.md)

---
title: Feature updates in 2024
source: https://docs.snowflake.com/en/release-notes/feature-releases-2024.md
section: Release Notes
---

# Feature updates in 2024

This topic lists the feature updates that occurred in 2024.

For more recent feature updates, see [Snowflake server release notes and feature updates](new-features.md).

* [Dec 20, 2024: Support for Streamlit 1.39.0 (Preview)](2024/other/2024-12-20-sis.md)
* [December 19, 2024 — Snowflake Native Apps with Azure Private Link support — *General Availability*](2024/other/2024-12-19-na-az-gov-ga.md)
* [December 19, 2024 — Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link — *Preview*](2024/other/2024-12-19-notebooks-wh-aws-azure-pl.md)
* [December 19, 2024 — New homepage for Snowsight —– *General Availability*](2024/other/2024-12-19-snowsight-homepage-ga.md)
* [December 18, 2024 — Inbound private connectivity to Snowpark Container Services for accounts on AWS — *Preview*](2024/other/2024-12-18-spcs-aws-inbound-private-connectivity.md)
* [Dec 16, 2024: Azure Private Link in Streamlit in Snowflake (General Availability)](2024/other/2024-12-16-sis.md)
* [December 12, 2024 — Document AI release notes](2024/other/2024-12-12-document-ai.md)
* [December 09, 2024 — Organizational listings: Discovery and access — *Preview*](2024/other/2024-12-09-dbna.md)
* [December 09, 2024 — Snowflake Native Apps with Azure Private Link support —– *Preview*](2024/other/2024-12-09-na-az-privatelink.md)
* [December 09, 2024 — Using block storage with Snowpark Container Services job services — *Preview*](2024/other/2024-12-09-spcs-block-storage-for-jobs-in-preview.md)
* [December 05, 2024 — Snowflake Cortex Powered Descriptions — *General Availability*](2024/other/2024-12-05-cortex-descriptions.md)
* [December 05, 2024 — Private Notebooks in a Personal Database — *Deprecated*](2024/other/2024-12-05-personal-db-private-nb.md)
* [Dec 4, 2024: Azure Private Link in Streamlit in Snowflake (Preview)](2024/other/2024-12-04-sis.md)
* [December 03, 2024 — Snowflake Native Apps in Azure Government regions — *Preview*](2024/other/2024-12-03-na-ga-gov-azure.md)
* [November 27, 2024 — Snowflake Native Apps: Multiple app installs — *General Availability*](2024/other/2024-11-27-na-mult-install.md)
* [November 25, 2024 — Snowflake Cortex AI TRANSLATE — Updates](2024/other/2024-11-25-cortex-translate-update.md)
* [November 25, 2024 — Data governance release notes](2024/other/2024-11-26-data-governance.md)
  + [Governance for organization listings through access history](2024/other/2024-11-26-data-governance.md)
* [November 21, 2024 — Snowflake Data Clean Rooms release notes](2024/other/2024-11-21-dcr.md)
  + [Non-overlap metrics](2024/other/2024-11-21-dcr.md)
  + [Unlink datasets API](2024/other/2024-11-21-dcr.md)
  + [Dynamic table support](2024/other/2024-11-21-dcr.md)
  + [Custom Python code in consumer templates](2024/other/2024-11-21-dcr.md)
  + [Merkury Identity connector](2024/other/2024-11-21-dcr.md)
  + [Google Display & Video 360 - Customer Match activation connector](2024/other/2024-11-21-dcr.md)
* [November 21, 2024 — Logical replication of clones — *General Availability*](2024/other/2024-11-21-logical-repl-clones.md)
* [November 20, 2024 — Snowsight rate limits — *General Availability*](2024/other/2024-11-20-snowsight-rate-limits.md)
* [November 18, 2024 — S3-compatible storage for externally managed Apache Iceberg™ tables — *General Availability*](2024/other/2024-11-18-s3-compatible-externally-managed-iceberg-ga.md)
* [November 18, 2024 — Sensitive data classification](2024/other/2024-11-18-sensitive-data-classification.md)
  + [Automatic Sensitive Data Classification — *Preview*](2024/other/2024-11-18-sensitive-data-classification.md)
  + [Classifier improvements](2024/other/2024-11-18-sensitive-data-classification.md)
* [November 15, 2024 — Apache Iceberg™ tables: Efficient bulk loading, continuous ingestion, and data streaming — *General Availability*](2024/other/2024-11-15-iceberg-tables-loading.md)
  + [COPY INTO <table> and Snowpipe continuous file ingestion](2024/other/2024-11-15-iceberg-tables-loading.md)
  + [Snowpipe Streaming](2024/other/2024-11-15-iceberg-tables-loading.md)
* [November 14, 2024 — Cortex Analyst](2024/other/2024-11-14-cortex-analyst.md)
  + [Multi-turn conversation in Cortex Analyst — *Preview*](2024/other/2024-11-14-cortex-analyst.md)
  + [Joins support in Cortex Analyst — *Preview*](2024/other/2024-11-14-cortex-analyst.md)
* [November 14, 2024 — Manage account preview features — *General Availability*](2024/other/2024-11-14-manage-preview.md)
* [November 13, 2024 — Hybrid tables support extended to additional AWS regions](2024/other/2024-11-13-hybrid-tables-ga-regions.md)
* [November 12, 2024 — Budgets: Support for cloud provider queue and webhook notifications](2024/other/2024-11-12-budget-notification-queue-webhook.md)
* [November 12, 2024 — Additional CREATE OR ALTER commands — *Preview*](2024/other/2024-11-12-create-or-alter-pupr.md)
* [November 12, 2024 — Dynamic tables: Support for reading from Snowflake-managed Iceberg tables and creating dynamic Apache Iceberg™ tables –— *General Availability*](2024/other/2024-11-12-dynamic-iceberg-tables.md)
* [November 12, 2024 — Classification (Snowflake ML Function) — *General Availability*](2024/other/2024-11-12-ml-functions-classification-ga.md)
* [November 12, 2024 — Organizational listings and the Internal Marketplace — *General Availability*](2024/other/2024-11-12-organizational-listings.md)
* [November 12, 2024 — Snowflake ML: Distributed Hyperparameter Optimization on Snowpark Container Services — *Preview*](2024/other/2024-11-12-snowflake-ml-hpo-spcs.md)
* [Nov 11, 2024 — Snowflake Data Clean Rooms release notes](2024/other/2024-11-11-dcr.md)
  + [All developer API clean rooms now available in the web app](2024/other/2024-11-11-dcr.md)
  + [Provider run for custom web app templates](2024/other/2024-11-11-dcr.md)
  + [Provider and consumer activation in custom web app templates](2024/other/2024-11-11-dcr.md)
  + [SQL policy configuration updates](2024/other/2024-11-11-dcr.md)
  + [Sync and naming support for data connectors](2024/other/2024-11-11-dcr.md)
* [November 11, 2024 — Snowflake Native App Framework release notes](2024/other/2024-11-11-native-apps.md)
  + [Snowflake Native Apps with Snowpark Container Services in AWS — *General availability*](2024/other/2024-11-11-native-apps.md)
  + [Snowflake Native Apps with Snowpark Container Services in Azure — *Preview*](2024/other/2024-11-11-native-apps.md)
  + [Native App Framework support for Budgets](2024/other/2024-11-11-native-apps.md)
* [November 11, 2024 — Snowflake Notebooks Warehouse Runtime — *General Availability*](2024/other/2024-11-11-notebooks-wh-ga.md)
  + [Updates](2024/other/2024-11-11-notebooks-wh-ga.md)
* [November 08, 2024 — Grouped Query History — *Preview*](2024/other/2024-11-08-grouped-query-history.md)
* [November 08, 2024 — Snowflake Microsoft Sharepoint connector](2024/other/2024-11-08.md)
  + [Snowflake Connector for SharePoint](2024/other/2024-11-08.md)
* [November 06, 2024 — SPLIT_TEXT_RECURSIVE_CHARACTER Cortex function — *Preview*](2024/other/2024-11-06-split-text-recursive-character.md)
* [November 04, 2024 — Classify Text (Snowflake Cortex LLM Function) — *General Availability*](2024/other/2024-11-04-classify-text-ga.md)
* [November 04, 2024 — Data Lineage — *Preview*](2024/other/2024-11-04-data-lineage.md)
* [November 04, 2024 — Replication error notifications — *General Availability*](2024/other/2024-11-04-error-notifications.md)
* [November 04, 2024 — Snowflake Native Apps support for AWS Private Link — *General Availability*](2024/other/2024-11-04-na-aws-pl-ga.md)
* [November 04, 2024 — Top Insights (Snowflake ML Function) — *General Availability*](2024/other/2024-11-04-top-insights-ga.md)
* [Oct 31, 2024: Custom themes in Streamlit in Snowflake (Preview)](2024/other/2024-10-31-sis-custom-themes.md)
* [Oct 31, 2024: AWS PrivateLink in Streamlit in Snowflake (General Availability)](2024/other/2024-10-31-sis-privatelink.md)
* [October 30, 2024 — Hybrid tables — *General Availability*](2024/other/2024-10-30-hybrid-tables-ga.md)
* [October 29, 2024 — Universal Search in Virtual Private Snowflake (VPS)](2024/other/2024-10-29-snowsight-vps.md)
* [October 21, 2024 — Document AI — *General Availability*](2024/other/2024-10-21-document-ai.md)
* [October 18, 2024 —Apache Iceberg™ tables: Support for Snowflake Open Catalog — *General Availability*](2024/other/2024-10-18-snowflake-open-catalog-ga.md)
* [October 14, 2024 — Cortex Analyst: New regions](2024/other/2024-10-14-new-regions-cortex-analyst.md)
* [October 14, 2024 — Snowflake Data Clean Rooms release notes](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
  + [Clean room overlap stats](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
  + [Provider-initiated activation for third-party connectors](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
  + [Security scans for custom templates](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
* [October 10, 2024 — CORTEX_FINE_TUNING_USAGE_HISTORY view — *General Availability*](2024/other/2024-10-10-cortex-finetuning-usage-history.md)
* [October 10, 2024 — CORTEX_SEARCH_SERVING_USAGE_HISTORY view — *General Availability*](2024/other/2024-10-10-cortex-search-serving-usage-history.md)
* [October 08, 2024 — Native App support for AWS PrivateLink — *Preview*](2024/other/2024-10-08-na-aws-pl.md)
* [October 07, 2024 — Updated event sharing for Snowflake Native Apps — *General Availability*](2024/other/2024-10-07-na-event-sharing.md)
* [Oct 07, 2024: AWS PrivateLink in Streamlit in Snowflake (Preview)](2024/other/2024-10-07-sis.md)
* [October 04, 2024 — Cortex Analyst integration with Cortex Search — *Preview*](2024/other/2024-10-04-cortex-analyst-search-integration.md)
* [October 04, 2024 — Suggested Questions for Cortex Analyst — *Preview*](2024/other/2024-10-04-cortex-analyst-suggested-questions.md)
* [October 04, 2024 — Cortex Search — *General Availability*](2024/other/2024-10-04-cortex-search-ga.md)
* [October 04, 2024 — Differential Privacy — *General Availability*](2024/other/2024-10-04-differential-privacy.md)
* [October 03, 2024 — New Cortex LLM Function - PARSE_DOCUMENT — *Preview*](2024/other/2024-10-03-parse-document.md)
* [October 02, 2024 — Organization accounts — *Preview*](2024/other/2024-10-01-organization-account.md)
* [October 02, 2024 — Notebooks on Container Runtime — *Preview*](2024/other/2024-10-02-notebooks-on-spcs.md)
* [October 01, 2024 — Cortex Fine-tuning Sharing — *Preview*](2024/other/2024-10-01-cortex-finetuning-sharing.md)
* [September 26, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-09-26-dcr.md)
  + [Branded clean room tiles](2024/other/2024-09-26-dcr.md)
  + [Consumer direct activation](2024/other/2024-09-26-dcr.md)
  + [Activation Hub column policies](2024/other/2024-09-26-dcr.md)
  + [Schedule analyses as a consumer](2024/other/2024-09-26-dcr.md)
  + [Clean room data stats](2024/other/2024-09-26-dcr.md)
  + [LiveRamp activation](2024/other/2024-09-26-dcr.md)
  + [The Trade Desk CRM activation](2024/other/2024-09-26-dcr.md)
  + [Managed account credit limit and monitoring](2024/other/2024-09-26-dcr.md)
  + [Audience Overlap, SQL Query and Custom template update](2024/other/2024-09-26-dcr.md)
* [September 26, 2024 — Snowpark-optimized Warehouse RESOURCE_CONSTRAINT — *Preview*](2024/other/2024-09-26-sow-resource-constraints.md)
* [September 25, 2024 — Snowflake Feature Store — *General Availability*](2024/other/2024-09-25-feature-store-ga.md)
* [September 25, 2024 — New models available in Snowflake Cortex AI](2024/other/2024-09-25-new-cortex-models.md)
* [September 24, 2024 — DOCUMENT_AI_USAGE_HISTORY view — *General Availability*](2024/other/2024-09-24-document-ai.md)
* [September 12, 2024 — New Cortex LLM Function - CLASSIFY_TEXT — *Preview*](2024/other/2024-09-12-classify-text-function.md)
* [September 12, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-09-12-dcr.md)
  + [Integration with Yahoo DSP](2024/other/2024-09-12-dcr.md)
  + [Integration with Google PAIR and Google DV 360](2024/other/2024-09-12-dcr.md)
* [September 12, 2024 — New multilingual embedding model available in Snowflake Cortex AI](2024/other/2024-09-12-voyage-embed-model.md)
* [September 09, 2024 — New AI21 model available in Snowflake Cortex AI](2024/other/2024-09-09-jamba-mini-model.md)
* [September 04, 2024 — Easier Training of Anomaly Detection Models from Real-World Data](2024/other/2024-09-04-anomaly-detection-preprocessing.md)
* [September 04, 2024 — Calling stored procedures in the FROM clause of SELECT statements](2024/other/2024-09-04-call-stored-procedure-in-from-clause.md)
* [September 01, 2024 — New Snowflake region](2024/other/2024-09-01-new-region.md)
  + [China (Ningxia) region - *General Availability*](2024/other/2024-09-01-new-region.md)
* [August 30, 2024 — Query attribution costs](2024/other/2024-08-30-per-query-cost-attribution.md)
  + [Account Usage: New QUERY_ATTRIBUTION_HISTORY view](2024/other/2024-08-30-per-query-cost-attribution.md)
* [August 29, 2024 — Cortex Analyst: New regions](2024/other/2024-08-29-cortex-analyst-new-regions.md)
* [August 29, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-08-29-dcr.md)
  + [RSA authentication for the service account user](2024/other/2024-08-29-dcr.md)
  + [Activation for provider-run analyses](2024/other/2024-08-29-dcr.md)
* [August 29, 2024 — New Mistral Large 2 model available in Snowflake Cortex AI](2024/other/2024-08-29-mistral-large2.md)
* [August 29, 2024 — New multilingual embedding models available in Snowflake Cortex AI](2024/other/2024-08-29-multilingual-embed-models.md)
* [August 28, 2024 — Snowflake ML Functions: Top Insights Preview Update](2024/other/2024-08-28-top-insights-preview-refresh.md)
* [August 26, 2024 — Easier Training of Forecasting Models from Real-World Data](2024/other/2024-08-26-forecasting-preprocessing.md)
* [August 26, 2024 — Time Series ML Functions — Error Message Improvements](2024/other/2024-08-26-time-series-error-message.md)
* [August 20, 2024 — Differential Privacy — *Preview*](2024/other/2024-08-16-diff-privacy.md)
* [August 20, 2024 — Cortex LLM Functions — Release Notes](2024/other/2024-08-20-new-region-llama-405b.md)
* [August 16, 2024 — Snowflake Native App Framework: Support for government regions on AWS](2024/other/2024-08-16-na-gov-cloud.md)
* [August 14, 2024 — Cortex Analyst –— *Preview*](2024/other/2024-08-14-cortex-analyst.md)
* [August 09, 2024 — Streamlit in Snowflake on AWS GovCloud –— *General Availability*](2024/other/2024-08-09-sis.md)
* [August 08, 2024 — Cross-region inference for Snowflake AI & ML features — *General Availability*](2024/other/2024-08-08-cross-region-llm.md)
* [August 08, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-08-08-dcr.md)
  + [Support for external tables and Apache Iceberg™ tables](2024/other/2024-08-08-dcr.md)
  + [Integration with TransUnion TruAudience Identity](2024/other/2024-08-08-dcr.md)
* [August 08, 2024 — RANGE BETWEEN window frames with explicit offsets — *General Availability*](2024/other/2024-08-08-range-between-ga.md)
* [August 06, 2024: Document AI release notes](2024/other/2024-08-06-document-ai.md)
* [August 06, 2024 — Snowflake Native App Framework: Support for VPS on AWS](2024/other/2024-08-06-na-vps-aws.md)
* [August 02, 2024 — ML Functions: Improved Error Messages in Classification](2024/other/2024-08-02-classification-errors.md)
* [August 02, 2024 — Snowflake Native App Framework release notes](2024/other/2024-08-02-na-spcs-laf.md)
* [August 02, 2024 — Custom UI in Streamlit in Snowflake –— *General Availability*](2024/other/2024-08-02-sis.md)
* [August 01, 2024 — Support for Streamlit 1.35.0 in Streamlit in Snowflake](2024/other/2024-08-01-sis.md)
* [August 01, 2024 — Snowpark Container Services release notes](2024/other/2024-08-01-spcs.md)
* [July 31, 2024 — Context functions and row access policies in Streamlit in Snowflake –— *General Availability*](2024/other/2024-07-31-sis.md)
* [July 31, 2024 — Snowflake VS Code Extension Release Notes](2024/other/2024-07-31.md)
  + [Edit Snowflake `connections.toml` files](2024/other/2024-07-31.md)
  + [Work with the Snowflake Native App Framework](2024/other/2024-07-31.md)
* [July 25, 2024 — Cortex Search — *Preview*](2024/other/2024-07-25-cortex-search-preview.md)
* [July 25, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-07-25-dcr.md)
  + [Acxiom Real ID integration](2024/other/2024-07-25-dcr.md)
  + [Using developer APIs for provider activation](2024/other/2024-07-25-dcr.md)
  + [User interface for custom templates enhancement](2024/other/2024-07-25-dcr.md)
  + [SQL Query template enhancement](2024/other/2024-07-25-dcr.md)
* [July 25, 2024 — New AI21 model available in Snowflake Cortex AI](2024/other/2024-07-25-new-llm-model-jamba.md)
* [July 24, 2024 — Cortex Guard for Snowflake Cortex AI — *General Availability*](2024/other/2024-07-24-cortex-llm-updates.md)
* [July 24, 2024 — Document AI release notes](2024/other/2024-07-24-document-ai.md)
* [July 23, 2024 — New Meta AI models available in Snowflake Cortex AI](2024/other/2024-07-23-new-llm-models.md)
* [July 23, 2024 — Managing Listings using SQL — **Generally Available**](2024/other/2024-07-23-pl.md)
* [July 19, 2024 — CORTEX_FUNCTIONS_USAGE_HISTORY view — *General Availability*](2024/other/2024-07-19-cortex-functions-usage-history.md)
* [July 18, 2024 — Snowflake Native App Framework - Support for shared external table and Apache Iceberg™ tables — *Preview*](2024/other/2024-07-18-native-app-external-table.md)
* [July 15, 2024 — Snowflake Copilot — *Generally available*](2024/other/2024-07-15-snowflake-copilot-ga.md)
* [July 11, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-07-11-dcr.md)
  + [Sequenced template execution](2024/other/2024-07-11-dcr.md)
  + [Multi-factor authentication](2024/other/2024-07-11-dcr.md)
  + [Register objects in a managed access schema](2024/other/2024-07-11-dcr.md)
  + [Support for additional region](2024/other/2024-07-11-dcr.md)
  + [Single-party SQL query](2024/other/2024-07-11-dcr.md)
* [July 11, 2024 — Snowflake connectors](2024/other/2024-07-11.md)
  + [Snowflake Connector for PostgreSQL](2024/other/2024-07-11.md)
  + [Snowflake Connector for MySQL](2024/other/2024-07-11.md)
* [July 03, 2024 — Data pipelines: Support for Apache Iceberg™ tables with dynamic tables and streams –— *Preview*](2024/other/2024-07-03-dynamic-iceberg-tables.md)
* [July 03, 2024 — External network access in Streamlit in Snowflake –— *General Availability*](2024/other/2024-07-03-sis.md)
* [June 28, 2024 — New geospatial H3 functions — *General Availability*](2024/other/2024-06-28-geospatial-h3-functions-ga.md)
* [June 28, 2024 — Custom UI in Streamlit in Snowflake –— *Preview*](2024/other/2024-06-28-sis.md)
* [June 27, 2024 — Document AI release notes](2024/other/2024-06-27-document-ai.md)
* [June 26, 2024 — Cost Management Release Notes](2024/other/2024-06-26-cost.md)
  + [Organization Overview Page —– *General Availability*](2024/other/2024-06-26-cost.md)
* [June 25, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-06-25-dcr.md)
  + [Provider-run analyses](2024/other/2024-06-25-dcr.md)
  + [Consumer-defined templates](2024/other/2024-06-25-dcr.md)
  + [Granular access controls for tables and templates](2024/other/2024-06-25-dcr.md)
  + [Activating results across regions](2024/other/2024-06-25-dcr.md)
  + [SQL Template enhancement](2024/other/2024-06-25-dcr.md)
* [June 25, 2024 — New TO_QUERY table function](2024/other/2024-06-25-to-query-function.md)
* [June 24, 2024: Time Travel for hybrid tables –— *Preview*](2024/other/2024-06-24-time-travel-hybrid-tables.md)
* [June 21, 2024: Document AI release notes](2024/other/2024-06-21-document-ai.md)
* [June 17, 2024 — New LLM helper functions - TRY_COMPLETE and COUNT_TOKENS](2024/other/2024-06-17-new-llm-functions.md)
  + [New SQL function](2024/other/2024-06-17-new-llm-functions.md)
* [June 15, 2024 — Anomaly Detection](2024/other/2024-06-15-anomaly-detection.md)
* [June 11, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-06-11-dcr.md)
  + [Additional supported regions — *General Availability*](2024/other/2024-06-11-dcr.md)
  + [Granular access management for Snowflake data — *General Availability*](2024/other/2024-06-11-dcr.md)
  + [Choosing a warehouse when running an analysis — *General Availability*](2024/other/2024-06-11-dcr.md)
  + [Support for multiple custom templates in web app — *General Availability*](2024/other/2024-06-11-dcr.md)
* [Jun 11, 2024 — Sharing data in non-secure views –— *Preview*](2024/other/2024-06-11-sharing-non-secure-views.md)
* [June 10, 2024 — Apache Iceberg™ tables — *General Availability*](2024/other/2024-06-10-iceberg-tables.md)
* [June 05, 2024 — New geospatial functions in preview](2024/other/2024-06-05.md)
  + [New geospatial functions available –— *Preview*](2024/other/2024-06-05.md)
* [June 03, 2024 — New EMBED_TEXT_1024 function for 1024 dimensional output vectors](2024/other/2024-06-03-embed-text-1024.md)
  + [New SQL function](2024/other/2024-06-03-embed-text-1024.md)
* [June 3, 2024 — Entity-Level Privacy Release Notes](2024/other/2024-06-03-entity-level.md)
  + [Aggregation policies with entity-level privacy — *General Availability*](2024/other/2024-06-03-entity-level.md)
* [May 31, 2024 — Snowflake ML Classification Update –— *Preview*](2024/other/2024-05-31-classification-update.md)
* [May 31, 2024 — Structured data types — *General Availability*](2024/other/2024-05-31-structured-types-ga.md)
* [May 28, 2024 — ML Functions Release Notes](2024/other/2024-05-28-call-method-in-from-clause.md)
  + [Simpler SQL for storing results from ML functions](2024/other/2024-05-28-call-method-in-from-clause.md)
* [May 28, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-05-28-dcr.md)
  + [Multi-provider clean rooms via Developer APIs — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Additional supported regions — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Support for views in the web app — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Clean room customizations for identity & activation — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Custom template enhancements — *General Availability*](2024/other/2024-05-28-dcr.md)
* [May 22, 2024 — SQL Release Notes](2024/other/2024-05-22-table-references.md)
  + [Using the TABLE keyword as an alternative to SYSTEM$REFERENCE and SYSTEM$QUERY_REFERENCE](2024/other/2024-05-22-table-references.md)
* [May 20, 2024 — Cost Management Release Notes](2024/other/2024-05-20-cost.md)
  + [Cost Insights — *General Availability*](2024/other/2024-05-20-cost.md)
* [May 17, 2024 — Document AI Release Notes](2024/other/2024-05-17-document-ai.md)
  + [Document AI —– *Preview*](2024/other/2024-05-17-document-ai.md)
* [May 16, 2024 — Vector data type and vector similarity functions — *General Availability*](2024/other/2024-05-16-vector-data-type-ga.md)
  + [New SQL data type](2024/other/2024-05-16-vector-data-type-ga.md)
  + [New SQL functions](2024/other/2024-05-16-vector-data-type-ga.md)
* [May 14, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-05-14-dcr.md)
  + [Tracing user activity in the web app — *General Availability*](2024/other/2024-05-14-dcr.md)
* [May 14, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-05-14-sis.md)
* [May 13, 2024 — ASOF JOIN Release Notes](2024/other/2024-05-13-asof-join.md)
  + [ASOF JOIN — *General Availability*](2024/other/2024-05-13-asof-join.md)
* [May 08, 2024 — New model for vector embedding — *Preview*](2024/other/2024-05-08-embed-text-model.md)
* [May 08, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-05-08-sis.md)
  + [Streamlit in Snowflake: Custom sleep timer —– *Preview*](2024/other/2024-05-08-sis.md)
* [May 08, 2024 — Snowflake Notifications Release Notes](2024/other/2024-05-08.md)
  + [New SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure for sending notifications](2024/other/2024-05-08.md)
* [May 07, 2024 — Cortex LLM Functions — *General Availability*](2024/other/2024-05-07-llm-functions-ga.md)
* [May 06, 2024 — Vector data type and vector similarity functions — *Preview*](2024/other/2024-05-06-vector-data-type.md)
  + [New SQL data type](2024/other/2024-05-06-vector-data-type.md)
  + [New SQL functions](2024/other/2024-05-06-vector-data-type.md)
* [May 03, 2024 — Aggregation and Projection Policies Release Notes](2024/other/2024-05-03-policies.md)
  + [Aggregation Policies — *General Availability*](2024/other/2024-05-03-policies.md)
  + [Projection Policies — *General Availability*](2024/other/2024-05-03-policies.md)
* [May 03, 2024 — Snowflake Model Registry – General Availability](2024/other/2024-05-03-snowflake-model-registry.md)
* [May 02, 2024 — Cost Management Release Notes](2024/other/2024-05-02-cost.md)
  + [Organization Overview Page —– *Preview*](2024/other/2024-05-02-cost.md)
* [May 02, 2024 — Snowsight Release Notes](2024/other/2024-05-02-snowsight-dd-preview.md)
  + [Data Dictionary with masked PII –— *Preview*](2024/other/2024-05-02-snowsight-dd-preview.md)
* [April 30, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-04-30-dcr.md)
* [April 30, 2024 — Snowflake Google connectors](2024/other/2024-04-30-gaad-gard-ga.md)
  + [Snowflake Connector for Google Analytics Raw Data](2024/other/2024-04-30-gaad-gard-ga.md)
  + [Snowflake Connector for Google Analytics Aggregate Data](2024/other/2024-04-30-gaad-gard-ga.md)
* [April 29, 2024 — Dynamic Tables — *General Availability*](2024/other/2024-04-29-dynamic-tables.md)
* [April 24, 2024 — New FAILOVER privilege for Client Redirect](2024/other/2024-04-24-failover-privilege.md)
* [April 24, 2024 — Managing Listings using SQL](2024/other/2024-04-24-pl.md)
* [April 23, 2024 — Snowflake Connector for ServiceNow® V2 — *General Availability*](2024/other/2024-04-23-svnc.md)
* [April 22, 2024 — Snowpark Container Services release notes](2024/other/2024-04-22.md)
* [April 17, 2024 — Snowpark Container Services Release Notes](2024/other/2024-04-17.md)
* [April 12, 2024 — Cost Management Release Notes](2024/other/2024-04-12-cost.md)
  + [Account Overview Page — *General Availability*](2024/other/2024-04-12-cost.md)
* [April 12, 2024 — Snowflake Cortex LLM Release Notes](2024/other/2024-04-12-snowflake-cortex-llm-update.md)
* [April 11, 2024 — Budgets Release Notes — *General Availability*](2024/other/2024-04-11-budgets.md)
* [April 11-25, 2024 — Snowflake Copilot — *Preview*](2024/other/2024-04-11-snowflake-copilot-in-snowsight.md)
* [April 09, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-04-09-dcr.md)
* [March 29, 2024 — Data Quality Monitoring Release Notes](2024/other/2024-03-29-dmf.md)
  + [Data Quality Monitoring and data metric functions — *Preview*](2024/other/2024-03-29-dmf.md)
* [March 28, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-03-28-snowflake-data-clean-rooms.md)
* [March 18-20, 2024 — Limit functionality of your Snowflake Native App —– *Preview*](2024/other/2024-03-18-limit-app-functionality.md)
* [March 15, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-03-15.md)
  + [Streamlit in Snowflake: Support for Streamlit 1.26.0 —– *Preview*](2024/other/2024-03-15.md)
* [March 13, 2024 — Hybrid Tables Release Notes](2024/other/2024-03-13-hybrid-tables.md)
* [March 12, 2024 — Snowflake Cortex Classification Release Notes –— *Preview*](2024/other/2024-03-12-snowflake-cortex-classification.md)
* [March 08, 2024 — Geospatial Functions Release Notes](2024/other/2024-03-08.md)
  + [New Geospatial Functions Available](2024/other/2024-03-08.md)
* [March 05, 2024 — Hybrid Tables Release Notes](2024/other/2024-03-05-hybrid-tables.md)
* [March 05, 2024 — Snowflake Cortex LLM Functions Release Notes –— *Preview*](2024/other/2024-03-05-snowflake-cortex-llm-functions.md)
* [February 28, 2024 — ASOF JOIN Release Notes](2024/other/2024-02-28.md)
  + [ASOF JOIN –— *Preview*](2024/other/2024-02-28.md)
* [February 26, 2024 — Snowpark Container Services Release Notes](2024/other/2024-02-26.md)
* [February 22, 2024 — Snowflake Extension for Visual Studio Code Release Notes](2024/other/2024-02-22.md)
  + [Visual Studio Code extension for Snowpark Python — *Preview*](2024/other/2024-02-22.md)
* [February 21, 2024 — Data sharing & collaboration for accounts in U.S. government regions](2024/other/2024-02-21.md)
* [February 20, 2024 — Hybrid Tables Release Notes](2024/other/2024-02-20-hybrid-tables.md)
* [February 20 - March 5, 2024 — Universal Search in Snowsight –— *Preview*](2024/other/2024-02-20.md)
* [February 15, 2024 — Aggregation and Projection Policies Release Notes](2024/other/2024-02-15-policies.md)
  + [Aggregation Policies — *Preview*](2024/other/2024-02-15-policies.md)
  + [Projection Policies — *Preview*](2024/other/2024-02-15-policies.md)
* [February 15, 2024 — Geospatial Functions Release Notes](2024/other/2024-02-15.md)
  + [H3 Functions for GEOGRAPHY Objects — *General Availability*](2024/other/2024-02-15.md)
* [February 12-14, 2024 — New navigation for Snowsight —– *Preview*](2024/other/2024-02-12.md)
* [January 31, 2024 — Snowflake Native Apps Framework Release Notes](2024/other/2024-01-31.md)
* [January 29, 2024 — Snowflake Google connectors](2024/other/2024-01-29.md)
  + [Snowflake Connector for Google Analytics Raw Data](2024/other/2024-01-29.md)
  + [Snowflake Connector for Google Analytics Aggregate Data](2024/other/2024-01-29.md)
* [January 25, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-01-25.md)
* [January 18, 2024 — Snowflake Native Apps Framework Release Notes](2024/other/2024-01-18.md)

---
title: Feature updates in 2025
source: https://docs.snowflake.com/en/release-notes/feature-releases-2025.md
section: Release Notes
---

# Feature updates in 2025

This topic lists the feature updates that occurred in 2025.

For more recent feature updates, see [Snowflake server release notes and feature updates](new-features.md).

* [Dec 18, 2025: Network rules and policies support Google Cloud Private Service Connect IDs (*General availability*)](2025/other/2025-12-18-gcp-pscid-network-rules-and-policies.md)
* [Dec 17, 2025 — Snowflake High Performance connector for Kafka (*Preview*)](2025/other/2025-12-17-kafkahp-pupr.md)
* [Dec 17, 2025: Schema evolution support for Snowpipe Streaming with high-performance architecture](2025/other/2025-12-17-schema-evolution-snowpipe-streaming.md)
* [Dec 17, 2025: Snowflake Postgres (*Preview*)](2025/other/2025-12-17-snowflake-postgres.md)
* [Dec 16, 2025: Cortex Search multi-indexing and custom vector embedding (*Preview*)](2025/other/2025-12-16-cortex-search-multi-index-preview.md)
* [Dec 16, 2025: Notebooks in Workspaces (*Preview*)](2025/other/2025-12-16-notebooks-in-workspaces.md)
  + [Key features](2025/other/2025-12-16-notebooks-in-workspaces.md)
* [Dec 15, 2025: Account Usage: New CATALOG_LINKED_DATABASE_USAGE_HISTORY view](2025/other/2025-12-15-catalog-linked-db-usage-history.md)
* [Dec 15, 2025: Vector aggregate functions](2025/other/2025-12-15-vector-aggregate-functions.md)
* [Dec 12, 2025: Private connectivity for internal stages on Google Cloud (*General availability*)](2025/other/2025-12-12-gcp-pl-internal-stages.md)
* [Dec 11, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-12-11-dcr.md)
  + [Clean Rooms API Version: 12.3](2025/other/2025-12-11-dcr.md)
* [Dec 11, 2025: Default pipe for Snowpipe Streaming with high-performance architecture](2025/other/2025-12-11-default-pipe.md)
* [Dec 11, 2025: Interactive tables and interactive warehouses (*General availability*)](2025/other/2025-12-11-interactive-tables-ga.md)
* [Dec 11, 2025: Support for Streamlit in Snowflake container runtime (Preview)](2025/other/2025-12-11-sis.md)
* [Dec 10, 2025: Cost anomalies (*General availability*)](2025/other/2025-12-02-cost-anomalies-ga.md)
* [Dec 10, 2025: General availability of WORM backups](2025/other/2025-12-10-worm-backups.md)
  + [Terminology change](2025/other/2025-12-10-worm-backups.md)
* [Dec 08, 2025: AI_REDACT for automated redaction of PII (*General availability*)](2025/other/2025-12-08-ai-redact-ga.md)
* [Dec 08, 2025: Dynamic tables: Support for dual warehouses](2025/other/2025-12-08-dynamic-tables-dual-warehouses.md)
* [Dec 08, 2025: Snowpipe simplified pricing](2025/other/2025-12-08-snowpipe-simplified-pricing.md)
* [Dec 04, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-12-04-dcr.md)
  + [Clean Rooms API Version: 12.2](2025/other/2025-12-04-dcr.md)
* [Dec 03, 2025: Access history improvements](2025/other/2025-12-03-access-history.md)
* [Dec 02, 2025: Optimize existing semantic views or models with verified queries (*Preview*)](2025/other/2025-12-02-cortex-analyst-optimization.md)
* [Dec 02, 2025: Private connectivity for Apache Iceberg™ REST catalog integrations (*General availability*)](2025/other/2025-12-02-iceberg-rest-catalog-private-connectivity.md)
* [Dec 02, 2025: Auto-fulfillment for listings that span databases (*General availability*)](2025/other/2025-12-02-laf-listings-span-databases.md)
* [Nov 21, 2025: AI_COMPLETE function (*General availability*)](2025/other/2025-11-21-ai-complete-ga.md)
* [Nov 21, 2025: Import models from Hugging Face to Snowflake (*Preview*)](2025/other/2025-11-21-hugging-face-model-import-preview.md)
* [Nov 21, 2025: Tri-Secret Secure data protection for Snowpark Container Services block volumes (General availability)](2025/other/2025-11-21-spcs-tri-secret-secure.md)
* [Nov 21, 2025: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (*Preview*)](2025/other/2025-11-21-tables-iceberg-query-using-external-query-engine-snowflake-horizon-preview.md)
* [Nov 21, 2025: Trust Center notifications in Snowsight (*General availability*)](2025/other/2025-11-21-trust-center-in-app-notifications.md)
* [Nov 20, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-11-20-dcr.md)
  + [Clean Rooms API Version: 11.9](2025/other/2025-11-20-dcr.md)
* [Nov 20, 2025: New versions of Streamlit supported in Streamlit in Snowflake (General availability)](2025/other/2025-11-20-sis.md)
* [Nov 20, 2025: SnowConvert AI interface improvements](2025/other/2025-11-20-snowconvert-ai-interface-improvements.md)
* [Nov 18, 2025: Apache Iceberg™ tables: Support for bi-directional data access with Microsoft Fabric (*Preview*)](2025/other/2025-11-18-iceberg-microsoft-fabric-bidirectional-data-access.md)
* [Nov 17, 2025: Access control enhancements for cost anomalies](2025/other/2025-11-17-cost-anomaly.md)
* [Nov 17, 2025: Document Processing Playground (*Preview*)](2025/other/2025-11-17-document-processing-playground.md)
* [Nov 17, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (*General availability*)](2025/other/2025-11-17-native-apps-spcs-aws-gov-fedram-ga.md)
* [Nov 14, 2025: Cortex Analyst Routing Mode (*Preview*)](2025/other/2025-11-14-cortex-analyst-routing-mode.md)
* [Nov 13, 2025: Excluding objects from sensitive data classification (*General availability*)](2025/other/2025-11-13-data-classification.md)
* [Nov 13, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-11-13-dcr.md)
  + [Clean Rooms API Version: 11.8](2025/other/2025-11-13-dcr.md)
* [Nov 13, 2025: Improved stage volume implementation in Snowpark Container Services (*General availability*)](2025/other/2025-11-13-spcs-improved-storage-mount-ga.md)
* [Nov 10, 2025: Snowpipe Streaming with high-performance architecture on Google Cloud Platform (GCP) (*General availability*)](2025/other/2025-11-10-snowpipe-streaming-gcp-ga.md)
* [Nov 07, 2025: AI_REDACT function (*Preview*)](2025/other/2025-11-07-aisql-redact-pii.md)
* [Nov 07, 2025: Pricing plans and offers (*General availability*)](2025/other/2025-11-07-pricing-plans-offers.md)
* [Nov 07, 2025: Storage lifecycle policies (*General availability*)](2025/other/2025-11-07-storage-lifecycle-policies-ga.md)
* [Nov 07, 2025: Trust Center extensions (*Preview*)](2025/other/2025-11-07-trust-center-extensions.md)
* [Dec 01, 2025: CORTEX_AISQL_USAGE_HISTORY Account Usage view (*General availability*)](2025/other/2025-12-01-cortex-aisql-usage-history.md)
* [Nov 06, 2025: dbt Projects on Snowflake (*General availability*)](2025/other/2025-11-06-dbt-projects-on-snowflake-ga.md)
  + [What’s new since preview](2025/other/2025-11-06-dbt-projects-on-snowflake-ga.md)
* [Nov 06, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-11-06-dcr.md)
  + [Clean Rooms API Version: 11.2](2025/other/2025-11-06-dcr.md)
* [Nov 05, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*General availability*)](2025/other/2025-11-05-cortex-agents-teams-ga.md)
* [Nov 05, 2025: Shared Workspaces (*Preview*)](2025/other/2025-11-05-shared-workspaces.md)
  + [Key features](2025/other/2025-11-05-shared-workspaces.md)
* [Nov 05, 2025: Snowpipe Streaming with high-performance architecture on Azure (*General availability*)](2025/other/2025-11-05-snowpipe-streaming-azure-ga.md)
* [Nov 05, 2025: Support for paid listings in the Kingdom of Saudi Arabia (KSA) (*General availability*)](2025/other/2025-11-05-support-for-paid-listings-ksa.md)
* [Nov 04, 2025: Snowflake-managed MCP server (*General availability*)](2025/other/2025-11-04-cortex-agents-mcp.md)
* [Nov 04, 2025: Cortex Agents (*General availability*)](2025/other/2025-11-04-cortex-agents.md)
* [Nov 04, 2025: Interactive tables and interactive warehouses (*Preview*)](2025/other/2025-11-04-interactive-tables-and-interactive-warehouses.md)
* [Nov 04, 2025: Snowflake Machine Learning Experiments (*Preview*)](2025/other/2025-11-04-ml-experiment-tracking.md)
* [Nov 04, 2025: Snowflake Openflow - Snowflake Deployments (*General availability*)](2025/other/2025-11-04-openflow.md)
* [Nov 04, 2025: Performance Explorer (*General availability*)](2025/other/2025-11-04-performance-explorer-ga.md)
* [Nov 04, 2025: Sharing semantic views](2025/other/2025-11-04-sharing-semantic-views.md)
* [Nov 04, 2025: Snowflake Intelligence (*General availability*)](2025/other/2025-11-04-snowflake-intelligence.md)
* [Nov 03, 2025: Semantic views support for account replication](2025/other/2025-11-03-semantic-views-replication.md)
* [Oct 31, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (*General availability*)](2025/other/2025-10-31-na-spcs-gcp-ga.md)
* [Oct 31, 2025: Organization-level findings in the Trust Center](2025/other/2025-10-31-trust-center-org-findings.md)
* [Oct 30, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-30-dcr.md)
  + [Clean Rooms API Version: 11.0](2025/other/2025-10-30-dcr.md)
* [Oct 29, 2025: CLIENT_POLICY parameter for authentication policies](2025/other/2025-10-29-client-version-policies.md)
* [Oct 29, 2025: Guided account failover in Snowsight (*General availability*)](2025/other/2025-10-29-guided-account-failover-snowsight.md)
* [Oct 29, 2025: Snowflake Native Apps: Shareback (*Preview*)](2025/other/2025-10-29-nativeapps-shareback.md)
* [Oct 23, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-23-dcr.md)
  + [Clean Rooms API Version: 10.6](2025/other/2025-10-23-dcr.md)
* [Oct 20, 2025: Performance Explorer (*Preview*)](2025/other/2025-10-20-performance-explorer.md)
* [Oct 17, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (*General availability*)](2025/other/2025-10-17-iceberg-external-writes-cld-ga.md)
* [Oct 17, 2025: Partitioned writes for Apache Iceberg™ tables (*General availability*)](2025/other/2025-10-17-iceberg-partitioned-writes-ga.md)
* [Oct 17, 2025: Set a target file size for Apache Iceberg™ tables (*General availability*)](2025/other/2025-10-17-set-target-file-size-ga.md)
* [Oct 16, 2025: AI_EXTRACT function (*General availability*)](2025/other/2025-10-16-ai-extract.md)
* [Oct 16, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-16-dcr.md)
  + [Clean Rooms API Version: 10.5](2025/other/2025-10-16-dcr.md)
* [Oct 16, 2025: Organization account in a hybrid organization](2025/other/2025-10-16-hybrid-orgs.md)
* [Oct 15, 2025: Enforced join order with directed joins (*General availability*)](2025/other/2025-10-15-directed-join.md)
* [Oct 16, 2025: Cross-region inference for US Commercial Gov](2025/other/2025-10-16-aisql-cross-region-gov-preview.md)
* [Oct 13, 2025: CORTEX_EMBED_USER database role (*General availability*)](2025/other/2025-10-13-cortex-embed-user-db-role.md)
* [Oct 10, 2025: Cortex Search Component Scores (*Preview*)](2025/other/2025-10-10-cortex-search-component-scores.md)
* [Oct 09, 2025: dbt Projects on Snowflake: Recent improvements (*Preview*)](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [dbt Project failures show up as failed queries](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [Compile on create](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [Install deps on compile](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [MONITOR privilege](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [Accessing execution results is easier](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
* [Oct 09, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-09-dcr.md)
  + [Clean Rooms API Version: 10.4](2025/other/2025-10-09-dcr.md)
* [Oct 09, 2025: Organization user groups with organizational listings (*Preview*)](2025/other/2025-10-09-org-user-groups-with-org-listings.md)
* [Oct 09, 2025: Verified query suggestions (*Preview*)](2025/other/2025-10-09-verified-query-suggestions.md)
* [Oct 07, 2025: Query insights in Snowsight (*General availability*)](2025/other/2025-10-07-query-insights-in-snowsight-ga.md)
* [Oct 06, 2025: Hybrid table support for Microsoft Azure (*General availability*)](2025/other/2025-10-06-hybrid-tables-azure-ga.md)
* [Oct 03, 2025: Named scoring profiles for Cortex Search Services (*General availability*)](2025/other/2025-10-03-cortex-search-named-scoring-profiles.md)
* [Oct 03, 2025: Lineage for stored procedures and tasks (*General availability*)](2025/other/2025-10-03-process-lineage.md)
* [Oct 02, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-02-dcr.md)
* [Oct 02, 2025: Snowflake-managed MCP server (*Preview*)](2025/other/2025-10-02-mcp-server.md)
* [Oct 02, 2025: Using the database object explorer in Snowsight to create and manage semantic views (*General availability*)](2025/other/2025-10-02-semantic-views-in-snowsight.md)
* [Oct 01, 2025: New OBJECT_VISIBILITY property (*Preview*)](2025/other/2025-10-01-object-visibility.md)
* [Sep 30, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*Preview*)](2025/other/2025-09-30-cortex-agents-teams-ga.md)
* [Sep 30, 2025: Declarative Sharing (*Preview*)](2025/other/2025-09-30-declarative-sharing.md)
* [Sep 30, 2025: GRANT OWNERSHIP ON NOTEBOOK (*General availability*)](2025/other/2025-09-30-grant-ownership-on-notebook.md)
* [Sep 30, 2025: Support for derived metrics in semantic views](2025/other/2025-09-30-semantic-view-derived-metrics.md)
* [Nov 04, 2025: Cortex AI_TRANSCRIBE function (*General availability*)](2025/other/2025-11-04-cortex-ai-transcribe-ga.md)
* [Nov 04, 2025: Cortex AI Functions (*General availability*)](2025/other/2025-11-04-cortex-aisql-operators-ga.md)
* [Sep 29, 2025: External OAuth support for Snowflake Open Catalog catalog integration (*General availability*)](2025/other/2025-09-29-open-catalog-support-external-oauth.md)
* [Sep 29, 2025: Using SQL for Cortex Powered Object Descriptions (*General availability*)](2025/other/2025-09-29-sql-object-descriptions.md)
* [Sep 26, 2025: AI_COUNT_TOKENS function (*Preview*)](2025/other/2025-09-26-ai-count-tokens-function.md)
* [Sep 25, 2025: Page filtering for AI_PARSE_DOCUMENT](2025/other/2025-09-25-ai-parse-document-page-filter.md)
* [Sep 25, 2025: Cortex AI Functions – AI_TRANSLATE (*General availability*)](2025/other/2025-09-25-ai-translate-updates.md)
* [Sep 25, 2025: Cost management — Updating budgets more frequently](2025/other/2025-09-25-budget-refresh-interval.md)
* [Sep 25, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-09-25-dcr.md)
* [Sep 25, 2025: FILE data type (*General availability*)](2025/other/2025-09-25-file-data-type-ga.md)
* [Sep 23, 2025: AI_FILTER Performance Optimization (*Preview*)](2025/other/2025-09-23-ai-filter-optimization.md)
* [Sep 23, 2025: Snowpipe Streaming with high-performance architecture (*General availability*)](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
  + [Key features and benefits](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
  + [Key difference from classic architecture](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
  + [Recommended use cases](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
* [Sep 22, 2025: Prevent data compaction on Snowflake-managed Apache Iceberg™ tables](2025/other/2025-09-22-enable-data-compaction-parameter.md)
* [Sep 19, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (*Preview*)](2025/other/2025-09-19-native-apps-spcs-aws-fedramp-ga.md)
* [Sep 19, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Azure (*Preview*)](2025/other/2025-09-19-position-row-level-deletes-support-writing-to-externally-managed-iceberg-table-azure.md)
* [Sep 19, 2025: Read consistency mode for sessions with near-concurrent changes](2025/other/2025-09-19-read-consistency-mode.md)
* [Sep 19, 2025: SnowConvert AI Verification (*Preview*)](2025/other/2025-09-19-snowconvert-ai-verification.md)
* [Sep 17, 2025: Snowflake Openflow - Snowflake Deployments (*Preview*)](2025/other/2025-09-17-openflow.md)
* [Sep 17, 2025: New SYS_CONTEXT function for getting context about applications, sessions, and organizations](2025/other/2025-09-17-sys_context-function.md)
* [Sep 17, 2025: Data lineage for tasks](2025/other/2025-09-17-task-lineage.md)
* [Sep 16, 2025: Support for Streamlit in Snowflake in the People’s Republic of China (Preview)](2025/other/2025-09-16-sis.md)
* [Sep 15, 2025: Billing views for Snowflake resellers and distributors](2025/other/2025-09-15-billing-schema.md)
* [Sep 15, 2025: Snowflake Native Apps updates](2025/other/2025-09-15-native-app-ga.md)
  + [Automated granting of privileges (General availability)](2025/other/2025-09-15-native-app-ga.md)
  + [App specifications (General availability)](2025/other/2025-09-15-native-app-ga.md)
  + [Feature policies (General availability)](2025/other/2025-09-15-native-app-ga.md)
* [Sep 15, 2025: Multi-factor authentication — Support for one-time passcodes](2025/other/2025-09-15-otp.md)
* [Sep 12, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Amazon S3 or Google Cloud (*Preview*)](2025/other/2025-09-12-position-row-level-deletes-support-writing-to-externally-managed-iceberg-table-s3-google-cloud.md)
* [Sep 11, 2025: Support for Snowflake Cortex AI Functions in incremental dynamic table refresh](2025/other/2025-09-11-dynamic-tables-cortex-aisql-support.md)
* [Sep 11, 2025: Workspaces (*General availability*)](2025/other/2025-09-11-workspaces-ga.md)
* [Sep 09, 2025: Sensitive data classification](2025/other/2025-09-09-data-classification.md)
  + [Classifying views automatically (*General availability*)](2025/other/2025-09-09-data-classification.md)
  + [Excluding objects from automatic classification (*Preview*)](2025/other/2025-09-09-data-classification.md)
* [Sep 09, 2025: Hybrid table support for Microsoft Azure (*Preview*)](2025/other/2025-09-09-hybrid-tables-azure-pupr.md)
* [Sep 09, 2025: Using Snowsight to monitor data quality (*Preview*)](2025/other/2025-09-11-dq-ui.md)
* [Sep 02, 2025: Cortex Agents: Admin object REST API (*Preview*)](2025/other/2025-09-02-cortex-agents-rest-api-object.md)
* [Sep 02, 2025: Document AI models in the model registry](2025/other/2025-09-02-document-ai.md)
* [Sep 02, 2025: Partitioned writes for Apache Iceberg™ tables (*Preview*)](2025/other/2025-09-02-iceberg-partitioned-writes.md)
* [Aug 29, 2025: Snowflake Native Apps: Restricted caller’s rights (*General availability*)](2025/other/2025-08-29-native-apps-rcr-ga.md)
* [Aug 28, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-08-28-dcr.md)
* [Aug 28, 2025: Hybrid table support for periodic rekeying (*General availability*)](2025/other/2025-08-28-hybrid-tables-periodic-rekeying.md)
* [Aug 28, 2025: Monitoring events for Snowpipe](2025/other/2025-08-28-monitoring-events-for-snowpipe.md)
  + [Snowpipe: data ingestion events](2025/other/2025-08-28-monitoring-events-for-snowpipe.md)
  + [Externally managed Apache Iceberg™ tables: automated refresh events](2025/other/2025-08-28-monitoring-events-for-snowpipe.md)
* [Aug 26, 2025: Using the database object explorer in Snowsight to create and manage semantic views (*Preview*)](2025/other/2025-08-26-semantic-views-in-snowsight.md)
* [Aug 25, 2025: Snowflake Connectors for Microsoft Power Apps (*General availability*)](2025/other/2025-08-25-mspowerapps.md)
* [Aug 22, 2025: AI_EXTRACT function (*Preview*)](2025/other/2025-08-22-ai-extract.md)
* [Aug 22, 2025: Organization profile updates](2025/other/2025-08-22-org-profiles.md)
* [Aug 21, 2025: AI Parse Document layout mode (*General availability*)](2025/other/2025-08-21-aisql-ai-parse-document-layout-ga.md)
* [Aug 21, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-08-21-dcr.md)
* [Aug 20, 2025: Cortex Search Service replication (*Preview*)](2025/other/2025-08-20-cortex-search-service-replication.md)
* [Aug 20, 2025: Distributed processing in Snowflake ML: Many Model Training and Distributed Partition Function](2025/other/2025-08-20-snowflake-ml-distributed-processing.md)
* [Aug 20, 2025: New stage volume implementation in Snowpark Container Services (*Preview*)](2025/other/2025-08-20-spcs-stage-volume-new.md)
* [Aug 28, 2025: Model Registry model deployment UI (*Preview*)](2025/other/2025-08-28-model-deployment-ui.md)
* [Aug 19, 2025: Trust Center email notifications (*General availability*)](2025/other/2025-08-19-trust-center-email-notifications-ga.md)
* [Aug 18, 2025: Snowsight navigation menu updates (Gradual rollout)](2025/other/2025-08-18-snowsight-navigation.md)
* [Aug 18, 2025: Write Once, Read Many (WORM) snapshots (*Preview*)](2025/other/2025-08-18-worm-snapshots.md)
* [Aug 14, 2025: Support for stored procedures in data lineage (*Preview*)](2025/other/2025-08-14-lineage.md)
* [Aug 14, 2025: Using SQL for Cortex Powered Object Descriptions (*Preview*)](2025/other/2025-08-14-sql-object-descriptions.md)
* [Aug 14, 2025: Workload identity federation (*General availability*)](2025/other/2025-08-14-wif.md)
* [Aug 12, 2025: Snowflake ML Jobs (*General availability*)](2025/other/2025-08-12-distributed-ml-jobs.md)
* [Aug 12, 2025: Support for Streamlit 1.46 (General availability)](2025/other/2025-08-12-sis.md)
* [Aug 11, 2025: CORS configuration to enable cross-origin requests to a Snowpark Container Services service (*General availability*)](2025/other/2025-08-11-spcs-cors-ga.md)
* [Aug 08, 2025: Contacts (*General availability*)](2025/other/2025-08-08-contacts.md)
* [Aug 07, 2025: Cortex AI_TRANSCRIBE (*Preview*)](2025/other/2025-08-07-cortex-aisql-ai-transcribe.md)
* [Aug 07, 2025: Enforced join order with directed joins (*Preview*)](2025/other/2025-08-07-directed-join.md)
* [Aug 07, 2025: Snowpark Container Services batch jobs (*Preview*)](2025/other/2025-08-07-spcs-batch-jobs-pupr.md)
* [Aug 06, 2025: Cortex Agents: admin configuration UI (*Preview*)](2025/other/2025-08-06-cortex-agents-admin-ui.md)
* [Aug 06, 2025: Support for custom components in Streamlit in Snowflake (Preview)](2025/other/2025-08-06-sis.md)
* [Aug 05, 2025: Document AI table extraction (*General availability*)](2025/other/2025-08-05-document-ai.md)
* [Aug 04, 2025: Hybrid table storage for Time Travel data](2025/other/2025-08-04-hybrid-tables-time-travel-billing.md)
* [Aug 01, 2025: Snowflake Intelligence (*Preview*)](2025/other/2025-08-01-snowflake-intelligence.md)
* [Aug 01, 2025: Snowpark Container Services in Google Cloud (*General availability*)](2025/other/2025-08-01-spcs-google-cloud-ga.md)
* [Jul 31, 2025: AI Observability in Snowflake Cortex (*General availability*)](2025/other/2025-07-31-ai-observability-ga.md)
* [Jul 30, 2025: External network access with private connectivity: Google Cloud](2025/other/2025-07-30-outbound-network-access-private-gcp.md)
* [Jul 29, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*Preview*)](2025/other/2025-07-29-cortex-agents-for-ms-teams.md)
* [Jul 28, 2025: Single-use refresh tokens for Snowflake OAuth](2025/other/2025-07-28-oauth.md)
* [Jul 28, 2025: Cortex Powered Object Descriptions](2025/other/2025-07-28-object-descriptions.md)
  + [Ability to generate descriptions without being the owner](2025/other/2025-07-28-object-descriptions.md)
* [Jul 25, 2025: Cortex AI Functions AI_SENTIMENT (*General availability*)](2025/other/2025-07-25-cortex-aisql-ai-sentiment.md)
* [Jul 25, 2025: Snowflake Native App Framework support for Snowflake machine learning models (*General availability*)](2025/other/2025-07-25-na-ml-ga.md)
* [Jul 24, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-07-24-dcr.md)
* [Jul 21, 2025: Billing contact information updates (*General availability*)](2025/other/2025-07-21-billing-contact-info-updates.md)
* [Jul 21, 2025: CREATE_BILLING_EVENT and CREATE_BILLING_EVENTS system functions (*General availability*)](2025/other/2025-07-21-create-billing-events-ga.md)
* [Jul 18, 2025: Alerts on new data (*General availability*)](2025/other/2025-07-18-alerts-on-new-data.md)
* [Jul 18, 2025: Sensitive data classification](2025/other/2025-07-18-database-classification.md)
  + [Automatic classification of a database (*Preview*)](2025/other/2025-07-18-database-classification.md)
  + [Determine which databases and schemas are monitored by automatic sensitive data classification (*Preview*)](2025/other/2025-07-18-database-classification.md)
* [Jul 18, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (*Preview*)](2025/other/2025-07-18-iceberg-external-writes-cld.md)
* [Jul 17, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-07-17-dcr.md)
* [Jul 16, 2025: Data governance release notes](2025/other/2025-07-16-tag-propagation-log.md)
  + [Automatic tag propagation: Event table to monitor conflicts (*General availability*)](2025/other/2025-07-16-tag-propagation-log.md)
* [Jul 15, 2025: Support for Streamlit 1.45.1 (General availability)](2025/other/2025-07-15-sis.md)
* [Jul 08, 2025: Snowflake AI_EMBED multimodal embeddings (*Preview*)](2025/other/2025-07-08-aisql-image-ai-embed.md)
* [Jul 08, 2025: ML Explainability visualizations (*General availability*)](2025/other/2025-07-08-ml-explainability-visualizations.md)
* [Jul 07, 2025: Account Usage: New CREDENTIALS view](2025/other/2025-07-07-credentials-view.md)
* [Jul 04, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (*Preview*)](2025/other/2025-07-04-na-spcs-gcp-ga.md)
* [Jul 03, 2025: Query insights](2025/other/2025-07-03-query-insights.md)
* [Jul 01, 2025: Snowflake Multi-Node ML Jobs (*Preview*)](2025/other/2025-07-01-distributed-ml-jobs.md)
* [Jun 27, 2025: dbt Projects on Snowflake (*Preview*)](2025/other/2025-06-27-dbt-projects-on-snowflake.md)
* [Jun 26, 2025: Clone dynamic tables as tables (*General availability*)](2025/other/2025-06-26-clone-dt-as-table.md)
* [Jun 24, 2025: Premium views in the organization account (*General availability*)](2025/other/2025-06-24-premium-views.md)
* [Jun 23, 2025: Snowflake Native App Framework updates](2025/other/2025-06-23-auto-privs-app-spec.md)
  + [Automated granting of privileges (*Preview*)](2025/other/2025-06-23-auto-privs-app-spec.md)
  + [App specifications (*Preview*)](2025/other/2025-06-23-auto-privs-app-spec.md)
* [Jun 19, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-06-19-dcr.md)
* [Jun 18, 2025: Customized runtime environments in Warehouse notebooks (*Preview*)](2025/other/2025-06-18-preconfigured-nb-wh-runtime.md)
* [Jun 16, 2025: Cost Management release notes](2025/other/2025-06-16-budget.md)
  + [Budgets: Using tags to add objects](2025/other/2025-06-16-budget.md)
* [Jun 12, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-06-12-dcr.md)
* [Jun 03, 2025: Snowflake Copilot inline (*Preview*)](2025/other/2025-06-03-copilot-inline.md)
* [Jun 03, 2025: Workspaces in Snowsight (*Preview*)](2025/other/2025-06-03-workspaces.md)
* [Jun 02, 2025: AI_CLASSIFY supports up to 500 labels and multi-label classification](2025/other/2025-06-02-ai-classify-label-increase.md)
* [Jun 02, 2025: Snowflake Cortex AI Functions (*Preview*)](2025/other/2025-06-02-cortex-aisql-public-preview.md)
  + [AI capability meets SQL operators across multimodal data](2025/other/2025-06-02-cortex-aisql-public-preview.md)
* [Jun 01, 2025: Snowsight templates in trial accounts (*General availability*)](2025/other/2025-06-01-snowsight-templates.md)
* [May 30, 2025: Additional model support for Cortex AISQL Images](2025/other/2025-05-30-complete-multimodal-new-models.md)
* [May 30, 2025: Snowflake Openflow (*General availability*)](2025/other/2025-05-30-openflow.md)
* [May 30, 2025: Request Approval Workflow (*General availability*)](2025/other/2025-05-30-raw.md)
* [May 30, 2025: Data Governance release notes](2025/other/2025-05-30-tags.md)
  + [Object tags available in Standard Edition](2025/other/2025-05-30-tags.md)
* [May 29, 2025: Table extraction in Document AI (*Preview*)](2025/other/2025-05-29-document-ai.md)
* [May 29, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-05-30-dcr.md)
* [May 28, 2025: Organization users (*Preview*)](2025/other/2025-05-29-org-users.md)
* [May 27, 2025: Data sharing & collaboration for accounts in Kingdom of Saudi Arabia region](2025/other/2025-05-27-KSA-regions.md)
* [May 27, 2025: Snowflake Native App with Snowpark Container Services support for Azure Private Link (*General availability*)](2025/other/2025-05-27-na-spcs-azure-pl-ga.md)
* [May 27, 2025: Security release notes](2025/other/2025-05-23-mfa.md)
  + [New authentication methods for multi-factor authentication (MFA) (*General availability*)](2025/other/2025-05-23-mfa.md)
* [May 23, 2025: Notebooks `st.secrets` support for Warehouse and Container Runtimes (*General availability*)](2025/other/2025-05-23-st-secrets-support.md)
* [May 22, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-05-22-dcr.md)
* [May 22, 2025: New Snowsight navigation menu (*Preview*)](2025/other/2025-05-22-snowsight-navigation-menu.md)
* [May 21, 2025: Snowflake Openflow updates](2025/other/2025-05-21-openflow.md)
* [May 20, 2025: Data Governance release notes](2025/other/2025-05-20-contacts.md)
  + [Contacts for objects (*Preview*)](2025/other/2025-05-20-contacts.md)
* [May 20, 2025: Snowflake Copilot model level RBAC](2025/other/2025-05-20-model-level-rbac.md)
* [May 20, 2025: Snowflake Openflow (*Preview*)](2025/other/2025-05-20-openflow.md)
* [May 20, 2025: Snowpark Container Services preview available in Google Cloud (*Preview*)](2025/other/2025-05-20-spcs-preview-available-in-gcp.md)
* [May 19, 2025: Cortex COMPLETE Structured Output schema references](2025/other/2025-05-19-complete-structured-output-json-refs.md)
* [May 19, 2025: Snowflake ML Data Connector release notes](2025/other/2025-05-19-data-connector-container-runtime.md)
  + [Snowflake ML Data Connector for Container Runtime (*General availability*)](2025/other/2025-05-19-data-connector-container-runtime.md)
* [May 19, 2025: Snowflake Notebooks Container Runtime - Support for Azure and Azure Private Link (*General availability*)](2025/other/2025-05-19-nb-spcs-azure-pl-ga.md)
* [May 16, 2025: Cost Management release notes](2025/other/2025-05-16-cost.md)
  + [Cost anomalies (*Preview*)](2025/other/2025-05-16-cost.md)
* [May 16, 2025: Universal Search support for pipes, tasks, and streams (*General availability*)](2025/other/2025-05-16-universal-search-pipes-tasks-streams.md)
* [May 15, 2025: Organizational listings: discovery and access](2025/other/2025-05-15-dna.md)
* [May 14, 2025: Data Governance release notes](2025/other/2025-05-14-tag-propagation.md)
  + [Automatic propagation of user-defined tags (*General availability*)](2025/other/2025-05-14-tag-propagation.md)
* [May 13, 2025: Support for Streamlit 1.44.0 (General availability)](2025/other/2025-05-13-sis.md)
* [May 08, 2025: Document AI updates](2025/other/2025-05-08-document-ai.md)
* [May 08, 2025: Dynamic tables: Support for IS_ROLE_IN_SESSION in access policies (*General availability*)](2025/other/2025-05-08-dynamic-tables-is-role-in-session.md)
* [May 05, 2025: Generation 2 standard warehouses (*General availability*)](2025/other/2025-05-05-gen2-standard-warehouses.md)
* [May 05, 2025: Snowflake Cortex Provisioned Throughput (*General availability*)](2025/other/2025-05-05-provisioned-throughput.md)
* [May 01, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-05-01-dcr.md)
* [May 01, 2025: Dynamic tables: Support for filtering by current time and date for incremental refresh (*General availability*)](2025/other/2025-05-01-dynamic-tables-current-timestamp.md)
* [Apr 30, 2025: Programmatic access tokens](2025/other/2025-04-30-programmatic-access-tokens.md)
* [Apr 28, 2025: Boost Cortex Search results based on metadata signals (*General availability*)](2025/other/2025-04-28-boost-decay.md)
* [Apr 28, 2025: Role-Based Access Control for Cortex LLM Models](2025/other/2025-04-28-cortex-llm-model-rbac.md)
* [Apr 28, 2025: Disable reranker in Cortex Search queries (*General availability*)](2025/other/2025-04-28-reranking.md)
* [Apr 24, 2025: Container Runtime for ML on multi-node clusters (*Preview*)](2025/other/2025-04-24-container-runtime-multi-node.md)
* [Apr 24, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-04-24-dcr.md)
* [Apr 22, 2025: Trust Center email notifications (*Preview*)](2025/other/2025-04-22-trust-center-email-notifications.md)
* [Apr 18, 2025: Support for `st.query_params` (General availability)](2025/other/2025-04-18-sis.md)
* [Apr 17, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-04-17-dcr.md)
* [Apr 17, 2025: Semantic views (*Preview*)](2025/other/2025-04-17-semantic-views.md)
* [Apr 16, 2025: Document AI multi-language support](2025/other/2025-04-16-document-ai.md)
* [Apr 16, 2025: Snowflake ML Jobs (*Preview*)](2025/other/2025-04-16-snowflake-ml-jobs.md)
* [Apr 15, 2025: Snowflake Cortex AI state-of-the-art Entity Sentiment (*Preview*)](2025/other/2025-04-15-cortex-entity-sentiment-function.md)
* [Apr 15, 2025: Snowflake Egress Cost Optimizer (*General availability*)](2025/other/2025-04-15-eco-ga.md)
* [Apr 15, 2025: Search optimization improves the performance of queries containing scalar functions](2025/other/2025-04-15-search-optimization-scalar-functions.md)
* [Apr 14, 2025: Snowflake Cortex AI COMPLETE multimodal support (*Preview*)](2025/other/2025-04-14-cortex-complete-multimodal.md)
* [Apr 14, 2025: EMBED Function Added to Cortex REST API (*General availability*)](2025/other/2025-04-14-cortex-offers-embed-rest-api.md)
* [Apr 14, 2025: Mistral AI’s multimodal Pixtral Large now available for Snowflake Cortex AI (*General availability*)](2025/other/2025-04-14-cortex-offers-pixtral-large.md)
* [Apr 14, 2025: FILE data type to create tables for multimodal analysis (*Preview*)](2025/other/2025-04-14-file-data-type.md)
* [Apr 14, 2025: PROMPT helper function (*Preview*)](2025/other/2025-04-14-prompt-helper-function.md)
* [Apr 14, 2025: Support for Streamlit 1.42.0 (General availability)](2025/other/2025-04-14-sis.md)
* [Apr 11, 2025: Snowsight replication configuration and monitoring (*General availability*)](2025/other/2025-04-11-snowsight-replication-setup-and-monitoring.md)
* [Apr 10, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-04-10-dcr.md)
* [Apr 7, 2025: Google Cloud Private Service Connect in Streamlit in Snowflake (Preview)](2025/other/2025-04-07-sis.md)
* [Apr 04, 2025: Cortex AI Observability (*Preview*)](2025/other/2025-04-04-cortex-ai-observability.md)
* [Apr 04, 2025: Cortex COMPLETE Structured Outputs (*General availability*)](2025/other/2025-04-04-cortex-complete-structured-outputs.md)
* [Mar 31, 2025: Data Governance release notes](2025/other/2025-03-31-ah-joins.md)
  + [Access history: Support for joins](2025/other/2025-03-31-ah-joins.md)
* [Mar 27, 2025: Snowflake Data Clean Rooms release notes](2025/other/2025-03-27-dcr.md)
  + [Simplified onboarding](2025/other/2025-03-27-dcr.md)
  + [Analysis Error Messaging in the clean rooms UI](2025/other/2025-03-27-dcr.md)
  + [Obfuscated provider templates](2025/other/2025-03-27-dcr.md)
  + [Cross-cloud collaboration support for multiple accounts](2025/other/2025-03-27-dcr.md)
  + [Update to default caching behavior in consumer run analysis](2025/other/2025-03-27-dcr.md)
  + [New limited API access role for developers](2025/other/2025-03-27-dcr.md)
  + [LiveRamp Identity & Translation integration update](2025/other/2025-03-27-dcr.md)
* [Mar 27, 2025: Git integration and multi-file editing in Streamlit in Snowflake (Preview)](2025/other/2025-03-27-sis.md)
* [Mar 26, 2025: Support for multiple semantic models in Cortex Analyst queries (*General availability*)](2025/other/2025-03-26-multiple-models-cortex-analyst.md)
* [Mar 24, 2025: Support for `st.experimental_audio_input` and `st.camera_input` (General availability)](2025/other/2025-03-24-sis.md)
* [Mar 20, 2025: Data Governance release notes](2025/other/2025-03-20-cortex-descriptions.md)
  + [Cortex Powered Object Descriptions: Support for additional table types](2025/other/2025-03-20-cortex-descriptions.md)
* [Mar 20, 2025: Snowflake Datasets (*General availability*)](2025/other/2025-03-20-snowflake-ml-datasets.md)
* [Mar 19, 2025: Alerts on new data (*Preview*)](2025/other/2025-03-19-alerts-on-new-data.md)
* [Mar 19, 2025: Additional file format support for Cortex AI Parse Document](2025/other/2025-03-19-parse-document-more-file-formats.md)
* [Mar 17, 2025: Document AI release notes](2025/other/2025-03-17-document-ai.md)
* [Mar 17, 2025: Snowflake Notebooks on Container Runtime for AWS (*General availability*)](2025/other/2025-03-17-notebooks-on-spcs-aws.md)
  + [New features](2025/other/2025-03-17-notebooks-on-spcs-aws.md)
* [Mar 12, 2025: Support for `st.file_uploader` (General availability)](2025/other/2025-03-12-sis.md)
* [Mar 07, 2025: RESOURCE_CONSTRAINT clause for Snowpark-optimized warehouses (*General availability*)](2025/other/2025-03-07-snowpark-optimized-warehouses-resource_constraint.md)
* [Mar 06, 2025: Cortex AI PARSE_DOCUMENT function for OCR (*General availability*)](2025/other/2025-03-06-ocr-mode-parse-document.md)
* [Mar 05, 2025: Search optimization improves the performance of queries containing scalar subqueries](2025/other/2025-03-05-search-optimization-scalar-subqueries.md)
* [Mar 05, 2025: Snowpark Container Services support for application metrics](2025/other/2025-03-05-spcs-application-metrics.md)
* [Mar 04, 2025: Universal Search ML model support (*General availability*)](2025/other/2025-03-04-universal-search-ml-models.md)
* [Mar 03, 2025: Snowflake Cortex Document Processing Usage History](2025/other/2025-03-03-cortex-document-processing-usage-history.md)
* [Mar 03, 2025: Native Apps with Snowpark Container Services - Support for AWS PrivateLink (*General availability*)](2025/other/2025-03-03-na-spcs-aws-pl-ga.md)
* [Mar 03, 2025: Native Apps with Snowpark Container Services - Support for Azure Private Link (*Preview*)](2025/other/2025-03-03-na-spcs-azure-pl-pupr.md)
* [Mar 03, 2025: Collapsible navigation bar in Snowsight (*General availability*)](2025/other/2025-03-03-snowsight-collapsible-nav-bar.md)
* [Feb 28, 2025: Increased max_cluster_count limits for multi-cluster warehouses](2025/other/2025-02-28-increased-max_cluster_count-limits.md)
* [Feb 27, 2025: Snowflake Data Clean Rooms release notes](2025/other/2025-02-27-dcr.md)
  + [UI loading improvements](2025/other/2025-02-27-dcr.md)
  + [External and Apache Iceberg™ table support in SQL templates](2025/other/2025-02-27-dcr.md)
  + [Data Clean Rooms available with data sharing terms](2025/other/2025-02-27-dcr.md)
  + [Improvements to provider-linked views in the API](2025/other/2025-02-27-dcr.md)
  + [Multi-template approval](2025/other/2025-02-27-dcr.md)
  + [Change in UI form handling with custom templates](2025/other/2025-02-27-dcr.md)
* [Feb 27, 2025: Snowflake Native Apps release channels (*Preview*)](2025/other/2025-02-27-na-release-channels.md)
* [Feb 26, 2025: Generating connection settings for a client, driver, library, or third-party application](2025/other/2025-02-26-connect-to-snowflake.md)
* [Feb 24, 2025: Changes to the app toolbar in Snowsight](2025/other/2025-02-24-na-toolbar-change.md)
* [Feb 19, 2025: Snowflake ML Model Serving Automatic Suspension (*Preview*)](2025/other/2025-02-19-spcs-model-serving-auto-suspend.md)
* [Feb 14, 2025: Document AI release notes](2025/other/2025-02-14-document-ai.md)
* [Feb 14, 2025: Support for `st.file_uploader` (Preview)](2025/other/2025-02-14-sis.md)
* [Feb 13, 2025: Snowpark Container Services (SPCS) Model Serving on Azure (*Preview*)](2025/other/2025-02-13-spcs-model-serving-azure.md)
* [Feb 11, 2025: Snowflake Cortex COMPLETE Structured Outputs (*Preview*)](2025/other/2025-02-11-cortex-complete-structured-outputs.md)
* [Feb 07, 2025: Snowflake Cortex Fine-tuning (*General availability*)](2025/other/2025-02-07-cortex-finetuning.md)
* [Feb 7, 2025: Support for material icons (General Availability)](2025/other/2025-02-07-sis.md)
* [Feb 03, 2025: Snowflake Native Apps with Snowpark Container Services support for Azure (*General availability*)](2025/other/2025-02-03-na-spcs-azure-ga.md)
* [Jan 31, 2025: Support for future grants in Streamlit in Snowflake (General Availability)](2025/other/2025-01-31-sis.md)
* [Jan 27, 2025: Organization account (*General availability*)](2025/other/2025-01-27-org-account.md)
* [Jan 23, 2025: Document AI on GCP (*General availability*)](2025/other/2025-01-23-document-ai.md)
* [Jan 20, 2025: Snowflake Native Apps with Snowpark Container Services support for AWS PrivateLink (*Preview*)](2025/other/2025-01-20-na-spcs-aws-pl-pupr.md)
* [Jan 16, 2025: Snowsight enhancements to contact email management (*General availability*)](2025/other/2025-01-16-snowsight-contact-email-update.md)
* [Jan 15, 2025: Custom instructions in Cortex Analyst (*Preview*)](2025/other/2025-01-15-cortex-analyst-custom-instructions.md)
* [Jan 15, 2025: Optimized COPY and INSERT bulk loads on empty hybrid tables (*General availability*)](2025/other/2025-01-15-ht-optimized-bulk-load.md)
* [Jan 07, 2025: Snowflake Cortex Playground (*Preview*)](2025/other/2025-01-07-cortex-llm-playground.md)
* [Jan 06, 2025: Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link (*General availability*)](2025/other/2025-01-06-notebooks-wh-aws-azure-pl.md)

---
title: Feb 01, 2026: New ORGANIZATION_USAGE premium views
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-01-organization-usage-new-views.md
section: Release Notes
---

# Feb 01, 2026: New ORGANIZATION_USAGE premium views

The ORGANIZATION_USAGE schema now includes three new [premium views](../../../user-guide/organization-accounts-premium-views.md)
that are available in the [organization account](../../../user-guide/organization-accounts.md). These views provide
visibility into usage across all accounts in your organization.

The new views are:

* [METERING_HISTORY](../../../sql-reference/organization-usage/metering_history.md) — Returns hourly credit usage for each account in your organization.
* [QUERY_ATTRIBUTION_HISTORY](../../../sql-reference/organization-usage/query_attribution_history.md) — Attributes compute costs to specific queries run on warehouses in your organization.

For more information about accessing premium views in the organization account, see [Access schema in the organization account](../../../sql-reference/organization-usage.md).

> **Note:**
>
> The new premium views are being rolled out slowly and should be available to all accounts by February 9, 2026.

---
title: Feb 02, 2026: Cortex Code CLI (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-02-cortex-code-cli.md
section: Release Notes
---

# Feb 02, 2026: Cortex Code CLI (*General availability*)

The Cortex Code CLI is a command-line interface for interacting with [Cortex Code](../../../user-guide/cortex-code/cortex-code.md),
Snowflake’s AI-driven development assistant. With the Cortex Code CLI, you can query your Snowflake data, build Streamlit apps,
and create agents that interact with your Snowflake data, all with natural language prompts.

The Cortex Code CLI is extensible, letting you add custom commands, tools, subagents, and hooks to tailor its functionality to your needs.

For more information, see [Cortex Code CLI](../../../user-guide/cortex-code/cortex-code-cli.md).

---
title: Feb 02, 2026: Cortex Code in Snowsight (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-02-cortex-code-snowsight.md
section: Release Notes
---

# Feb 02, 2026: Cortex Code in Snowsight (*Preview*)

Snowsight now includes Cortex Code, available in [preview](../../preview-features.md). Cortex Code is an agentic
assistant that helps with tasks such as SQL and Python notebook development, data exploration, and account administration.

## Key capabilities

* Generate, modify, and explain SQL files and Python Notebooks in Workspaces, including reviewing suggested edits in a diff view before applying changes.
* Discover data assets and documentation using Snowflake context such as object metadata, tags, and lineage (when available).
* Assist with administrative workflows such as governance/security questions and cost investigation.

For details, see [Cortex Code in Snowsight](../../../user-guide/cortex-code/cortex-code-snowsight.md).

---
title: Feb 02, 2026: Share Connected Apps in Snowflake Marketplace listings (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-02-share-connected-apps-in-sfmarketplace-listings-ga.md
section: Release Notes
---

# Feb 02, 2026: Share Connected Apps in Snowflake Marketplace listings (*General availability*)

Providers can now share Connected Apps in Snowflake Marketplace listings. Connected Apps are applications that live in the provider’s own
environment. Connected Apps connect to a consumer’s Snowflake account using supported clients/credentials to work with their data.

For more information on how to attach a Connected App to a listing, see [Share data or apps publicly on Snowflake Marketplace](../../../collaboration/provider-listings-creating-publishing.md).

---
title: Feb 02, 2026: Support for listing and share observability (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-02-listing-observability-ga.md
section: Release Notes
---

# Feb 02, 2026: Support for listing and share observability (*General availability*)

Enhanced observability for listings and shares through new Information Schema views, table functions, and Account Usage views is now generally available.

## New views and functions in the INFORMATION_SCHEMA schema

The following Information Schema views and table functions are now available:

### INFORMATION_SCHEMA.LISTINGS view (for providers)

The LISTINGS view displays all listings for which the current role has been granted access privileges.
This view provides real-time information with no data latency. It doesn’t capture deleted objects.

Usage example:

```sqlexample
SELECT * FROM <database_name>.INFORMATION_SCHEMA.LISTINGS;
```

For the complete list of columns, see [LISTINGS view](../../../sql-reference/info-schema/listings.md).

### INFORMATION_SCHEMA.SHARES view (for providers and consumers)

The SHARES view lists all shares available in the system, consistent with the output of the [SHOW SHARES](../../../sql-reference/sql/show-shares.md) command. This includes:

* Outbound shares (to consumers) that have been created in your account as a provider
* Inbound shares (from providers) that are available for your account to consume

Usage example:

```sqlexample
SELECT * FROM <database_name>.INFORMATION_SCHEMA.SHARES;
```

For more information, see [SHARES view](../../../sql-reference/info-schema/shares.md).

### INFORMATION_SCHEMA.AVAILABLE_LISTINGS table function (for consumers)

The AVAILABLE_LISTINGS table function in the Information Schema returns all listings that are available for the consumer to discover or
access. The function supports optional filters for imported listings, organization listings, and directly shared listings.

Usage example:

```sqlexample
SELECT * FROM TABLE(<database_name>.INFORMATION_SCHEMA.AVAILABLE_LISTINGS());

-- Filter for imported listings only
SELECT * FROM TABLE(<database_name>.INFORMATION_SCHEMA.AVAILABLE_LISTINGS(IS_IMPORTED => TRUE));
```

For more information, see [AVAILABLE_LISTINGS](../../../sql-reference/functions/available_listings.md).

## New and updated views in the ACCOUNT_USAGE schema

The following Account Usage views are now available for historical analysis with up to three hours of data latency:

### ACCOUNT_USAGE.LISTINGS view (for providers)

This view displays a row for each listing in the provider account, including listings that have been dropped.

For more information, see [LISTINGS view](../../../sql-reference/account-usage/listings.md).

### ACCOUNT_USAGE.SHARES view (for providers)

This view displays a row for each share in the provider account, including shares that have been dropped.

For more information, see [SHARES view](../../../sql-reference/account-usage/shares.md).

### ACCOUNT_USAGE.GRANTS_TO_SHARES view (for providers)

This view can be used to query access control privileges that have been granted to a share, including historical grant and revoke operations.

For more information, see [GRANTS_TO_SHARES view](../../../sql-reference/account-usage/grants_to_shares.md).

### Updates to ACCOUNT_USAGE.ACCESS_HISTORY view

The ACCESS_HISTORY view now captures the following DDL operations on listings and shares:

* `CREATE`, `ALTER`, and `DROP` operations on listings.
* `CREATE`, `ALTER`, and `DROP` operations on shares.
* Detailed property changes in the `OBJECT_MODIFIED_BY_DDL` JSON column.

These additions enable comprehensive auditing of listing lifecycle and share lifecycle events.

For more information, see [ACCESS_HISTORY view](../../../sql-reference/account-usage/access_history.md) in the ACCOUNT_USAGE schema and
[ACCESS_HISTORY view](../../../sql-reference/organization-usage/access_history.md) in the ORGANIZATION_USAGE schema.

---
title: Feb 02, 2026: Use Snowsight to manage external volumes (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-02-manage-external-volumes-by-using-snowsight.md
section: Release Notes
---

# Feb 02, 2026: Use Snowsight to manage external volumes (*Preview*)

You can now use Snowsight to manage external volumes for Apache Iceberg™ tables by performing the following
tasks:

* Create an external volume, which includes optionally setting the external volume as the default at the account, database, or schema level
* Grant USAGE privileges to an external volume
* Add a storage location to an external volume
* Verify an external volume to check that Snowflake can successfully authenticate to your storage provider
* Drop an external volume

For more information, see the following topics:

* [Configure an external volume](../../../user-guide/tables-iceberg-configure-external-volume.md)
* [Drop an external volume by using Snowsight](../../../user-guide/tables-iceberg-drop-external-volume.md)

---
title: Feb 03, 2025: Snowflake Native Apps with Snowpark Container Services support for Azure (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-03-na-spcs-azure-ga.md
section: Release Notes
---

# Feb 03, 2025: Snowflake Native Apps with Snowpark Container Services support for Azure (*General availability*)

We are pleased to announce the general availability of support for Snowflake Native Apps with Snowpark Container Services on Microsoft Azure.

Providers can now build container apps on Azure that are deployed and monetized for Snowflake customers through the
Snowflake Marketplace. Using Snowpark Container Services, existing containerized workloads can be leveraged in an accelerated
development cycle.

Developers can write app code in any programming language, package it as a container, and run it in on multiple
configurable hardware options, including GPUs, all within a Snowflake Native App.

See [Workflow: Develop an app with containers](../../../developer-guide/native-apps/container-workflow.md) for more information on developing container apps.

---
title: Feb 04, 2026: Cortex Search Component Scores (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-04-cortex-search-component-scores-ga.md
section: Release Notes
---

# Feb 04, 2026: Cortex Search Component Scores (*General availability*)

Cortex Search Component Scores are now generally available. Component scores
allow developers to access detailed scoring information for search results, understand how search rankings are determined, and debug search performance.

For more information, see
[Customizing Cortex Search scoring](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md).

---
title: Feb 04, 2026: Sensitive data classification: Classify a subset of native semantic categories (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-04-sensitive-data-classification-subset-categories.md
section: Release Notes
---

# Feb 04, 2026: Sensitive data classification: Classify a subset of native semantic categories (*Preview*)

You can now configure sensitive data classification to limit which types of data are classified as sensitive.
By default, Snowflake classifies data into all [native semantic categories](../../../user-guide/classify-native.md) whenever Snowflake identifies
sensitive data. You can now specify a subset of semantic categories so Snowflake classifies data only if it belongs
to the categories you specify.

You can specify a subset of semantic categories in the following ways:

* **Based on the semantic category**: For example, configure sensitive data classification so tax identifiers (the TAX_IDENTIFIER semantic
  category) are classified as sensitive, but other semantic categories (for example, POSTAL_CODE) are not.
* **Based on a country**: For example, configure Snowflake to classify identifiers in the United States
  as sensitive data, but not identifiers in other countries.

For more information, see [Classify data using a subset of native semantic categories](../../../user-guide/classify-auto.md).

---
title: Feb 05, 2026: Notebooks in Workspaces (General Availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-05-notebooks-in-workspaces.md
section: Release Notes
---

# Feb 05, 2026: Notebooks in Workspaces (*General Availability*)

Snowflake Notebooks in Workspaces is now generally available. This new notebook experience provides
a fully-managed, end-to-end environment for data science and machine learning development on Snowflake data, combining the familiar Jupyter
notebook interface with enterprise-grade compute, governance, and collaboration capabilities.

Notebooks in Workspaces runs on a Container Runtime powered by Snowpark Container Services, offering preconfigured containers optimized for AI/ML workloads with
access to CPUs and GPUs, parallel data loading, and distributed training APIs for popular ML packages.

## Key features

**Integration with Workspaces**

* Notebooks are files in Workspaces, enabling easy file management and organization.
* Git integration provides version control and collaboration across development environments.

**Updates to compute and cost management**

* CPU or GPU compute pools match your workload requirements.
* Shared container service connections reduce start-up time and improve resource utilization.
* Background kernel persistence ensures uninterrupted execution of long-running processes.
* Simplified idle time configuration prevents unused compute resources from running indefinitely.
* Service-level External Access Integration (EAI) management applies to all notebooks in the workspace.

**Jupyter compatibility**

* Standard Jupyter magic commands for familiar development experience.
* Pre-installed data science and machine learning packages.
* Install additional packages via `pip`, PyPI, or file upload.

**Enhanced editing experience**

* Bidirectional SQL and Python cell referencing for seamless language switching.
* Interactive datagrid and automated chart builder for data visualization.
* Enhanced minimap with cell status tracking and table of contents.

For details, see [Snowflake Notebooks in Workspaces](../../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md).

---
title: Feb 05, 2026: Sensitive data classification: Support for semi-structured data (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-05-sensitive-data-classification-json.md
section: Release Notes
---

# Feb 05, 2026: Sensitive data classification: Support for semi-structured data (*General availability*)

Sensitive data classification now supports the VARIANT, ARRAY, and OBJECT data types, which means Snowflake can classify fields in
semi-structured data into native semantic categories as long as the data is in JSON format. For example, if a VARIANT column contains JSON
objects with email addresses and phone numbers, Snowflake can classify the email field as EMAIL and the phone field as PHONE_NUMBER.

For more information, including an example of classification results for JSON data, see [View classification results for JSON columns](../../../user-guide/classify-results.md).

---
title: Feb 05, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-05-dcr.md
section: Release Notes
---

# Feb 05, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 12.9

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Public preview of new clean room architecture.** Snowflake Data Clean Rooms is previewing a new clean room architecture called Collaboration Data Clean Rooms. With Collaboration Data Clean Rooms, you can collaborate in a fully symmetric, multi-party environment. Unlike traditional provider-consumer models, which limit the roles and number of collaborators, the Collaboration API supports flexible roles and fine-grained data access controls for any number of participants. [Read the overview](../../../user-guide/cleanrooms/about.md) and [try out building a new collaboration data clean room yourself](../../../user-guide/cleanrooms/demo-flows/basic-multiparty-collab.md).
* **Updates to various private preview features.**

---
title: Feb 06, 2026: Cortex Code data science and machine learning skill (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-06-cortex-code-data-science-preview.md
section: Release Notes
---

# Feb 06, 2026: Cortex Code data science and machine learning skill (*Preview*)

The Cortex Code CLI now offers the *Data Science and Machine Learning skill*, allowing it to more accurately detect when you want to perform data science or machine learning operations as part of your request. This skill defines how Cortex Code should interact with Snowflake components used for machine learning and data science, including interactions with a Model Registry and running inferrence on Snowpark Container Services.

This skill is automatically loaded into Cortex Code’s context when needed, allowing you to focus on your agent requests rather than determining up front what information the agent might need to succeed at your task.

The data science and machine learning skill is built in to Cortex Code. [Install the Cortex Code CLI](../../../user-guide/cortex-code/cortex-code-cli.md) or run `cortex update` from the command line to update to the latest version. If you’d like to inspect the prompts that make up the data science skill, use the `/skill` command or ask Cortex Code `What does the machine learning and data science skill do?`.

---
title: Feb 06, 2026: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-06-tables-iceberg-query-using-external-query-engine-snowflake-horizon-ga.md
section: Release Notes
---

# Feb 06, 2026: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (*General availability*)

Support for querying Snowflake-managed Apache Iceberg™ tables by using any external query engine that supports the open Iceberg REST protocol,
such as Apache Spark™, is now generally available. To ensure this interoperability with external engines,
[Apache Polaris™ (incubating)](https://github.com/apache/polaris) is integrated into Horizon Catalog. You can query these tables in a
Snowflake account by using a single Horizon Catalog endpoint and you can use your existing users, roles, policies, and authentication
in Snowflake.

For more information, see [Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog](../../../user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

---
title: Feb 06, 2026: Trust Center Overview tab (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-06-trust-center-overview-tab-preview.md
section: Release Notes
---

# Feb 06, 2026: Trust Center Overview tab (*Preview*)

You can use the Overview tab in the Trust Center to analyze the security posture of your account at a high level.

For more information, see [Trust Center Overview](../../../user-guide/trust-center/overview.md).

---
title: Feb 07, 2025: Snowflake Cortex Fine-tuning (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-07-cortex-finetuning.md
section: Release Notes
---

# Feb 07, 2025: Snowflake Cortex Fine-tuning (*General availability*)

We are pleased to announce the general availability of [Cortex Fine-tuning](../../../user-guide/snowflake-cortex/cortex-finetuning.md),
a fully managed service that lets you fine-tune popular large language models using your data all within Snowflake, in the following regions:

> * AWS US West 2 (Oregon)
> * AWS US East 1 (N. Virginia)
> * AWS Europe Central 1 (Frankfurt)
> * Azure East US 2 (Virginia)

Cortex Fine-tuning allows users to adapt pre-trained models to more specialized tasks. If you don’t want the high cost of training a large model
from scratch but need better latency and results than you’re getting from prompt engineering or even retrieval augmented generation (RAG)
methods, fine-tuning an existing large model is an option. Fine-tuning allows you to use examples to adjust the behavior of the model and
improve the model’s knowledge of domain-specific tasks.

---
title: Feb 09, 2026: Performance Explorer enhancements (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-09-performance-explorer-enhancements-preview.md
section: Release Notes
---

# Feb 09, 2026: Performance Explorer enhancements (*Preview*)

This preview release makes the following enhancements to Performance Explorer:

* Use the new By grouped queries tab in side panels to quickly identify the recurring queries that are
  driving the metrics.
* Narrow investigations with filtering by hour.
* Interactively analyze top contributors to a metric change by dragging over side-panel charts to select
  the time window of interest.
* Compare metrics across time with a new PREVIOUS PERIOD column in side-panel tables.

For more information, see [Analyzing query workloads with Performance Explorer](../../../user-guide/performance-explorer.md).

---
title: Feb 10, 2026: Snowflake Native Apps: Shareback (General Availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-10-nativeapps-shareback.md
section: Release Notes
---

# Feb 10, 2026: Snowflake Native Apps: Shareback (*General Availability*)

Snowflake Native Apps can now securely request permission from consumers to share data back with the provider or designated third parties.

This powerful capability supports essential business needs such as compliance reporting, telemetry and analytics sharing, and data preprocessing by providing a secure, governed channel for data exchange. This feature is now generally available.

For more information, see [Request data sharing with app specifications](../../../developer-guide/native-apps/requesting-app-specs-listing.md).

---
title: Feb 11, 2025: Snowflake Cortex COMPLETE Structured Outputs (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-11-cortex-complete-structured-outputs.md
section: Release Notes
---

# Feb 11, 2025: Snowflake Cortex COMPLETE Structured Outputs (*Preview*)

Snowflake is pleased to announce the preview release of support for structured outputs in the Snowflake Cortex COMPLETE
function, which forces completion results to conform to a user-specified JSON schema. Defining the desired outputs and
their formats via a schema simplifies prompting, reduces the need for post-processing COMPLETE results in your AI data
pipelines, and enables seamless integration with systems that require deterministic responses. COMPLETE Structured
Outputs is available for customers using the SQL and REST API Interfaces.

For more information, see [AI_COMPLETE structured outputs](../../../user-guide/snowflake-cortex/complete-structured-outputs.md).

---
title: Feb 12, 2026: New checkout experience for private offers with flat-fee pricing (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-12-marketplace-checkout-experience-ga.md
section: Release Notes
---

# Feb 12, 2026: New checkout experience for private offers with flat-fee pricing (*General availability*)

When consumers accept a private, flat-fee offer, they now use a new checkout experience to complete their purchase. This new checkout
experience includes any available [Marketplace Capacity Drawdown (MCD)](../../../collaboration/marketplace-capacity-drawdown.md) funds that can
be applied toward the purchase as well as the sales tax amount that will be collected at checkout.

> **Note:**
>
> Sales tax information is only available for consumers in the U.S. and Canada.

For more information, see [View and accept a private offer with flat-fee pricing](../../../user-guide/collaboration/listings/pricing-plans-offers/consumers-manage-offers.md).

---
title: Feb 12, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-12-dcr.md
section: Release Notes
---

# Feb 12, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 13.2

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Custom registries:** Collaboration Data Clean Rooms users can now add custom registries to organize and manage access to resources in their collaborations. To learn more, read [Registries](../../../user-guide/cleanrooms/registries.md).
* Updates to private preview features.

---
title: Feb 12, 2026: Strong Authentication Hub (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-12-strong-authentication-hub.md
section: Release Notes
---

# Feb 12, 2026: Strong Authentication Hub (*Preview*)

The Strong Authentication Hub is an essential tool that helps you implement strong authentication for all of your users
and prepare your account for the [deprecation of single-factor passwords](../../../user-guide/security-mfa-rollout.md).

The Strong Authentication Hub provides you with:

* **Complete visibility** into your account’s readiness for strong authentication enforcement deadlines, with a clear path to 100% compliance.
* **Risk identification** that pinpoints specific users who need to migrate to stronger authentication methods, including users who have
  logged in with only a password via applications like Power BI, users with inactive accounts, and legacy service users.
* **Step-by-step remediation guidance** that makes it easy to bring users into conformance, with the ability to prioritize by issue type or
  by individual user.
* **Timeline management** that displays enforcement phases and allows you to extend deadlines if you need more time to complete your migration.

The Strong Authentication Hub significantly simplifies the process of meeting Snowflake’s strong authentication requirements. Instead of
manually identifying and remediating non-compliant users, the hub automatically identifies issues and provides targeted instructions to
resolve them. This makes meeting the enforcement deadlines associated with the deprecation of single-factor passwords straightforward and
manageable.

For more information, see [Strong Authentication Hub](../../../user-guide/strong-authentication-hub.md).

> **Note:**
>
> The Strong Authentication Hub is being rolled out slowly and should be available to all accounts by February 20, 2026.

---
title: Feb 13, 2025: Snowpark Container Services (SPCS) Model Serving on Azure (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-13-spcs-model-serving-azure.md
section: Release Notes
---

# Feb 13, 2025: Snowpark Container Services (SPCS) Model Serving on Azure (*Preview*)

Snowflake is pleased to announce that Snowpark Container Services (SPCS) Model Serving is now available in Azure
commercial regions in addition to AWS commercial regions.

SPCS Model Serving lets you deploy machine learning models to a SPCS compute pool, providing scalable high-performance
inference on CPU or GPU. Models can use libraries from any source, and knowledge of container technologies is required.

For more information, see [Deploy models for Real time Inference (REST API)](../../../developer-guide/snowflake-ml/inference/real-time-inference-rest-api.md).

---
title: Feb 13, 2026: Run Security Essentials scanners on demand
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-13-adhoc-security-essentials.md
section: Release Notes
---

# Feb 13, 2026: Run Security Essentials scanners on demand

The Security Essentials scanner package is the baseline security monitoring solution offered within each Snowflake account. This package currently
contains a set of scanners that run regularly on a fixed schedule at no cost to customers. This feature update allows you to run any or all
of the scanners in the Security Essentials package on demand in addition to the regular free runs. Your account will incur serverless compute
cost for additional runs. For more information, see [Security Essentials scanner package](../../../user-guide/trust-center/overview.md).

---
title: Feb 13, 2026: Snowflake Native Apps: Inter-App Communication (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-13-nativeapps-iac.md
section: Release Notes
---

# Feb 13, 2026: Snowflake Native Apps: Inter-App Communication (*Preview*)

Snowflake Native Apps can now securely communicate with other apps in the same account.

This capability allows a Snowflake Native App to enhance the functionality of multiple Snowflake Native Apps in the same consumer account by enabling the sharing and merging of data.

For more information, see [Inter-app Communication](../../../developer-guide/native-apps/inter-app-communication.md).

---
title: Feb 14, 2025: Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-14-document-ai.md
section: Release Notes
---

# Feb 14, 2025: Document AI release notes

The release of a new version of the foundational Arctic-TILT model in Document AI includes
improvements in the following areas:

* Checkbox identification
* Answers to yes/no questions
* Overall model quality across internal and external question-answering benchmarks

These improvements are available to new Document AI model builds.

---
title: Feb 14, 2025: Support for st.file_uploader (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-14-sis.md
section: Release Notes
---

# Feb 14, 2025: Support for `st.file_uploader` (Preview)

You can now use [st.file_uploader](https://docs.streamlit.io/develop/api-reference/widgets/st.file_uploader) in Streamlit in Snowflake.

---
title: Feb 16, 2026: Sharing Streamlit in Snowflake apps (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-16-sis.md
section: Release Notes
---

# Feb 16, 2026: Sharing Streamlit in Snowflake apps (*Preview*)

You can now share your Streamlit in Snowflake apps using app-builder URLs or app-viewer URLs. You can also restrict users
to access only Streamlit in Snowflake apps, preventing them from accessing other parts of Snowflake.

For more information, see [Sharing Streamlit in Snowflake apps](../../../developer-guide/streamlit/features/sharing-streamlit-apps.md).

---
title: Feb 17, 2026: Access history improvements
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-17-access-history.md
section: Release Notes
---

# Feb 17, 2026: Access history improvements

[Access history](../../../user-guide/access-history.md) lets you monitor the SQL statements executed in Snowflake. It keeps track of the
following types of statements:

* Data Manipulation Language (DML) statements. For example, statements used to insert data into a table.
* Data Query Language (DQL) statements. For example, statements that use a SELECT statement to project data.
* Data Definition Language (DDL) statements. For example, statements that create or alter a Snowflake object.

Previously, records that were too large were excluded from the ACCESS_HISTORY view. Now, Snowflake truncates enough data for the record to
fit in the view. Truncated records contain indicators where values have been truncated.

For more information, see [Usage notes: Truncation](../../../sql-reference/account-usage/access_history.md).

---
title: Feb 18, 2026: Account Usage New CORTEX_AGENT_USAGE_HISTORY view (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-18-cortex-agent-usage-history-view.md
section: Release Notes
---

# Feb 18, 2026: Account Usage New CORTEX_AGENT_USAGE_HISTORY view (*Preview*)

The ACCOUNT_USAGE schema now includes a new [CORTEX_AGENT_USAGE_HISTORY](../../../sql-reference/account-usage/cortex_agent_usage_history.md)
view that provides visibility into the usage history of Cortex Agents.

The information in the view includes the number of credits consumed each time a user interacts
with Cortex Agents. A request results in one or more calls to underlying tools (for example, Cortex Analyst and Cortex Search). Each row in the view represents a call to the agent and provides detail about
the aggregated tokens and credits in the call as well as granular detail. The view also includes
relevant metadata, such as the user ID, request ID, and the agent ID.

For more information, see [CORTEX_AGENT_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_agent_usage_history.md).

---
title: Feb 18, 2026: Account Usage New SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-18-snowflake-intelligence-usage-history-view.md
section: Release Notes
---

# Feb 18, 2026: Account Usage New SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (*Preview*)

The ACCOUNT_USAGE schema now includes a new [SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY](../../../sql-reference/account-usage/snowflake_intelligence_usage_history_view.md)
view that provides visibility into the usage history of Snowflake Intelligence.

The information in the view includes the number of credits consumed each time a user interacts
with Snowflake Intelligence. A request results in one or more calls to underlying agents and any
tools (for example, Cortex Analyst and Cortex Search). Each row in the view represents a call to the agent and provides detail on
the aggregated tokens and credits in the call as well as granular detail. The view also includes
relevant metadata, such as the user ID, request ID, Snowflake Intelligence ID, and the agent ID.

For more information, see [SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view](../../../sql-reference/account-usage/snowflake_intelligence_usage_history_view.md).

---
title: Feb 18, 2026: Row timestamps for pipeline latency and event tracking (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-18-row-timestamps.md
section: Release Notes
---

# Feb 18, 2026: Row timestamps for pipeline latency and event tracking (*General availability*)

Row timestamps provide a precise, chronological record of when each row in a table was last updated. Rows modified in the
same transaction share the exact same timestamp and rows modified in different transactions are ordered by when they were
committed.

This feature eliminates the need to rely on unpredictable client-side timestamps, empowering data teams to accurately
measure data latency, manage incremental processing without missing out of order events, and establish an audit trail of
changes.

Key use cases include the following:

* **Pipeline observability:** Measure end-to-end latency and data freshness for streaming ingest, CDC, and ETL workloads
  with higher accuracy than client-side timestamps.
* **Reliable incremental processing:** Capture delayed or backfilled records that event timestamps might skip by using
  definitive commit times.
* **Definitive audit trails:** Establish a chronological order of events for regulatory compliance or SCD2-style
  milestoning.

You can enable row timestamps for individual tables, set it as a default at the account/database/schema level, or use a
system function to bulk-enable it for existing tables.

For more information, see [Use row timestamps to measure latency in your pipelines](../../../user-guide/data-engineering/row-timestamps.md).

---
title: Feb 18, 2026: Snowflake Container Runtime versioning for ML Jobs (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-18-container-runtime-versions-preview.md
section: Release Notes
---

# Feb 18, 2026: Snowflake Container Runtime versioning for ML Jobs (*Preview*)

Snowflake Container Runtimes are now versioned releases, giving you the ability to maintain a stable environment for your ML Jobs while upgrading at your own pace. This change starts with the release of Container Runtime version 2.3.

Snowflake uses the latest version of Container Runtime by default when running an ML Job, and with the change to versioned releases, you can now run your ML Job under any versioned Snowflake-provided Container Runtime release. Snowflake still recommends using the latest Container Runtime version to ensure that you have important security updates.

For the full list of available container runtime releases, see [Snowflake Container Runtime release notes](../../../developer-guide/snowflake-ml/container-runtime/releases.md). For more information on runtime versioning with ML Jobs, see [Snowflake ML Jobs](../../../developer-guide/snowflake-ml/ml-jobs/overview.md).

---
title: Feb 18, 2026: Support for changing refresh user and secondary roles
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-18-dynamic-tables-execute-as-user.md
section: Release Notes
---

# Feb 18, 2026: Support for changing refresh user and secondary roles

You can now configure dynamic tables to refresh with the privileges of a specific user, in addition to privileges of the owner role.
Dynamic tables that specify EXECUTE AS USER run on behalf of the named user, instead of the system user.

For example, you can grant a user a primary role that provides access to a table and a secondary role that provides access to a virtual
warehouse. The user can then create a dynamic table that operates with the combined privileges of both roles, simplifying permissions
management and enhancing the flexibility of your data operations.

Key use cases include:

* Unified privileges: Enables access to resources spread across multiple roles within a single refresh session.
* Enhanced accountability: Attributes all refresh activity to a specific individual for compliance and auditing.
* Governance control: Supports granular security through the IMPERSONATE privilege and ensures data policies, like masking, are evaluated against the correct user context.

For more information, see [Refresh dynamic tables with specific user privileges and secondary roles](../../../user-guide/dynamic-tables-privileges.md) and [CREATE DYNAMIC TABLE](../../../sql-reference/sql/create-dynamic-table.md).

---
title: Feb 19, 2025: Snowflake ML Model Serving Automatic Suspension (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-19-spcs-model-serving-auto-suspend.md
section: Release Notes
---

# Feb 19, 2025: Snowflake ML Model Serving Automatic Suspension (*Preview*)

Snowflake is pleased to announce that model inference services created by Snowflake ML Model Serving now automatically
suspend after thirty minutes of inactivity. They are restarted upon the next incoming request. To allow models to be
available at all times to service requests from the public Internet, automatic suspension is disabled for services with
HTTP ingress enabled.

For more information, see [Deploy models for Real time Inference (REST API)](../../../developer-guide/snowflake-ml/inference/real-time-inference-rest-api.md).

---
title: Feb 19, 2026: Machine learning experiments (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-19-ml-experiments-ga.md
section: Release Notes
---

# Feb 19, 2026: Machine learning experiments (*General availability*)

Snowflake ML Experiments, which allows you to set up organized evaluations of model training results,
is now generally available. With experiments, you can quickly compare results of different training runs and select the best model
for your needs.

For more information, see [Run an experiment to compare and select models](../../../developer-guide/snowflake-ml/experiments.md).

---
title: Feb 19, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-19-dcr.md
section: Release Notes
---

# Feb 19, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 13.3

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Additional statuses returned by `GET_STATUS`:

  + INSTALLING: Installing the application package and preparing the collaboration details for review.
  + IN_REVIEW: The collaboration is installed and ready for review.
  + INSTALLATION_FAILED: Installation failed; application package not installed, and can’t be reviewed.
* Updates to private preview features.

---
title: Feb 20, 2026: Snowflake Native Apps: Configuration (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-20-nativeapps-configuration.md
section: Release Notes
---

# Feb 20, 2026: Snowflake Native Apps: Configuration (*Preview*)

Snowflake Native Apps can now request configuration values from consumers using application configurations.

This capability allows an app to define configuration keys that request specific information from the consumer, such as the name of a server app for inter-app communication, or an arbitrary string value like an external URL or account identifier. Configurations can be marked as sensitive to protect values such as API keys or access tokens from exposure in query history and command output.

For more information, see [Application configuration](../../../developer-guide/native-apps/app-configuration.md).

---
title: Feb 20, 2026: USE AI FUNCTIONS account privilege for Cortex AI Functions
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-20-use-ai-functions-privilege.md
section: Release Notes
---

# Feb 20, 2026: USE AI FUNCTIONS account privilege for Cortex AI Functions

Snowflake now provides the USE AI FUNCTIONS account privilege to control access to Cortex AI Functions. This privilege allows administrators to manage which roles can use AI Functions across the account.

By default, the USE AI FUNCTIONS privilege is granted to the PUBLIC role, allowing all users to access Cortex AI Functions. Account administrators can revoke this privilege from PUBLIC and grant it to specific roles for more granular access control.

Users need both the USE AI FUNCTIONS account privilege and the CORTEX_USER database role to use all Snowflake Cortex AI Functions. Note that AI_AGG and AI_SUMMARIZE_AGG functions remain accessible to users with only the CORTEX_USER role.

For more information, see [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md).

---
title: Feb 23, 2026: Grouped Query History in Snowsight (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-23-grouped-query-history-ui.md
section: Release Notes
---

# Feb 23, 2026: Grouped Query History in Snowsight (*General availability*)

You can use the Grouped Query History view in Snowsight to monitor usage
and performance of critical and frequently run queries. This graphical view is based on information
that is recorded in the [AGGREGATE_QUERY_HISTORY view](../../../sql-reference/account-usage/aggregate_query_history.md).

The Grouped Query History view is particularly useful for monitoring and analyzing
[Unistore workloads](https://www.snowflake.com/en/data-cloud/workloads/unistore/)
that execute a small number of distinct statements repeatedly at high throughput.

For more information, see [Use the Grouped Query History view in Snowsight](../../../user-guide/ui-snowsight-activity.md).

---
title: Feb 23, 2026: Simplified setup for Data Quality Monitoring
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-23-data-quality-monitoring-setup.md
section: Release Notes
---

# Feb 23, 2026: Simplified setup for Data Quality Monitoring

Data quality checks in Snowflake continuously validate the health of your data, helping you comply with regulatory standards, meet
service-level agreements, and build credibility in data-driven decisions through automated, consistent data validation.

This preview simplifies the setup of data quality checks.

## Cortex Data Quality (*Preview*)

Cortex Data Quality uses AI to intelligently suggest data quality checks based on characteristics of your metadata and usage patterns.
Leveraging the [Snowflake Cortex AI_COMPLETE function](../../../sql-reference/functions/ai_complete.md), it eliminates the need to manually define quality checks, which accelerates the
setup process and allows someone without deep domain expertise to implement checks.

For more information, see [Set up quality checks using Cortex Data Quality](../../../user-guide/data-quality-ui-setup.md).

## User interface for creating data quality checks (*Preview*)

You can now set up and manage data quality checks directly in [Snowsight](../../../user-guide/ui-snowsight-gs.md). The user interface lets you create quality checks
using two strategies: accept AI-suggested checks from Cortex Data Quality, or manually define checks based on your knowledge of your data.
You can define rules, monitor results, and investigate quality issues without writing SQL.

For more information, see [Use Snowsight to set up data quality checks](../../../user-guide/data-quality-ui-setup.md).

---
title: Feb 24, 2025: Changes to the app toolbar in Snowsight
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-24-na-toolbar-change.md
section: Release Notes
---

# Feb 24, 2025: Changes to the app toolbar in Snowsight

With this release, the configuration information previously accessed from the Readme, Security,
and Manage Access controls of a Snowflake Native App is consolidated into the Settings
page of the app.

Providers may need to update any custom documentation they have sent to consumers.

> **Note:**
>
> This change is only applicable to Snowsight. There are no underlying changes to the functionality
> of an app.

---
title: Feb 24, 2026: Enforcement of privatelink-only access (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-24-enforce-privatelink-access-only.md
section: Release Notes
---

# Feb 24, 2026: Enforcement of privatelink-only access (*General availability*)

Enforcement of privatelink-only access, which disables public access to your accounts, is now generally available.
For more information, see [Enforcement of privatelink-only access](../../../user-guide/security-disable-public-access-privatelink.md).

---
title: Feb 24, 2026: Snowflake Postgres (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-24-snowflake-postgres-ga.md
section: Release Notes
---

# Feb 24, 2026: Snowflake Postgres (*General availability*)

Snowflake Postgres is now generally available. Snowflake Postgres lets you create, manage, and use
Postgres instances directly from Snowflake. Each instance runs a Postgres database server on a
dedicated virtual machine managed by Snowflake. You connect directly to your instances using any
Postgres client. Snowflake Postgres brings the reliable and trusted transactional database
capabilities of Postgres to the Snowflake data platform.

Snowflake Postgres is available for selected AWS and Azure regions. For information about supported
cloud service providers and regions, see [Regional availability](../../../user-guide/snowflake-postgres/about.md).

For more information, see [Snowflake Postgres](../../../user-guide/snowflake-postgres/about.md).

---
title: Feb 24, 2026: User-defined actions for budgets
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-24-budget-user-defined-actions.md
section: Release Notes
---

# Feb 24, 2026: User-defined actions for budgets

You can now configure budgets to automatically call stored procedures at key points during a budget cycle, such as when
a spending threshold is reached or when the cycle restarts.

**Custom actions:** You can now configure a budget to automatically call a stored procedure when a spending threshold is
reached. This lets you take automated actions in response to credit consumption, such as suspending warehouses, sending
custom alerts, or logging spending events to a table.

When you define a custom action, you specify whether it triggers based on projected or actual credit consumption, then
you set the threshold percentage. You can add up to 10 custom actions per budget.

For more information, see [Custom actions for budgets](../../../user-guide/budgets/custom-actions.md).

**Cycle-start actions:** You can now configure a budget to automatically call a stored procedure when the budget cycle
restarts at the beginning of its monthly period. This lets you run automated actions at the start of each cycle, such as
re-enabling warehouses or sending notifications. Cycle-start actions are particularly useful for reversing actions that
were triggered by custom actions during the previous budget cycle.

For more information, see [Cycle-start actions for budgets](../../../user-guide/budgets/cycle-start-actions.md).

---
title: Feb 24, 2026: View invoices in Snowsight
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-24-billing-invoices.md
section: Release Notes
---

# Feb 24, 2026: View invoices in Snowsight

You can now use Snowsight to view and download billing invoices for an On Demand account. An On Demand account is one that doesn’t
have a capacity contract with Snowflake.

For more information, see [Access billing invoices](../../../user-guide/billing-invoices.md).

---
title: Feb 25, 2026: Account Usage CORTEX_AGENT_USAGE_HISTORY view (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-25-cortex-agent-usage-history-view.md
section: Release Notes
---

# Feb 25, 2026: Account Usage CORTEX_AGENT_USAGE_HISTORY view (*General availability*)

The [CORTEX_AGENT_USAGE_HISTORY](../../../sql-reference/account-usage/cortex_agent_usage_history.md)
view in the ACCOUNT_USAGE schema is now generally available. This view provides visibility into the usage history of Cortex Agents.

The information in the view includes the number of credits consumed each time a user interacts
with Cortex Agents. A request results in one or more calls to underlying tools (for example, Cortex Analyst and Cortex Search). Each row in the view represents a call to the agent and provides detail about
the aggregated tokens and credits in the call as well as granular detail. The view also includes
relevant metadata, such as the user ID, request ID, and the agent ID.

For more information, see [CORTEX_AGENT_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_agent_usage_history.md).

---
title: Feb 25, 2026: Account Usage SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-25-snowflake-intelligence-usage-history-view.md
section: Release Notes
---

# Feb 25, 2026: Account Usage SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (*General availability*)

The [SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY](../../../sql-reference/account-usage/snowflake_intelligence_usage_history_view.md)
view in the ACCOUNT_USAGE schema is now generally available. This view provides visibility into the usage history of Snowflake Intelligence.

The information in the view includes the number of credits consumed each time a user interacts
with Snowflake Intelligence. A request results in one or more calls to underlying agents and any
tools (for example, Cortex Analyst and Cortex Search). Each row in the view represents a call to the agent and provides detail about
the aggregated tokens and credits in the call as well as granular detail. The view also includes
relevant metadata, such as the user ID, request ID, Snowflake Intelligence ID, and the agent ID.

For more information, see [SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view](../../../sql-reference/account-usage/snowflake_intelligence_usage_history_view.md).

---
title: Feb 25, 2026: Joining logical tables that contain ranges of values in a semantic view (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-25-semantic-views-range-joins.md
section: Release Notes
---

# Feb 25, 2026: Joining logical tables that contain ranges of values in a semantic view (*Preview*)

You can use a *range join* when you want to join a table with another table that defines a range of possible values in the
first table. For example, suppose that one table represents sales orders and has a column with the timestamp when the order
was placed. Suppose that another table represents fiscal quarters and contains the distinct ranges of time that represent
these quarters. You can create a semantic view that joins the two tables so that the row for an order includes the fiscal
quarter in which the order was placed.

Support for range joins is in [Preview](../../preview-features.md).

For information, see [Joining logical tables that contain ranges of values](../../../user-guide/views-semantic/sql.md).

---
title: Feb 26, 2025: Generating connection settings for a client, driver, library, or third-party application
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-26-connect-to-snowflake.md
section: Release Notes
---

# Feb 26, 2025: Generating connection settings for a client, driver, library, or third-party application

We are pleased to announce that you can now use Snowsight to generate the connection settings for a client, driver,
library, or third-party application.

You can use the new Account Details dialog to generate a connection string for the ODBC or JDBC driver or settings in
TOML file format for Snowflake CLI, Snowflake Python APIs, or the Snowflake Connector for Python. The generated configuration information
includes the following:

* Your account identifier
* Settings for authenticating to Snowflake
* The warehouse, database, and schema to use for the session

The Account Details dialog also provides quick access to the SQL commands that you can execute to get the settings yourself
(for example, if you need to specify the settings in a format other than a connection string or a TOML file).

For more information, see [Configuring a client, driver, library, or third-party application to connect to Snowflake](../../../user-guide/gen-conn-config.md).

---
title: Feb 26, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-26-dcr.md
section: Release Notes
---

# Feb 26, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 13.4

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* The INITIALIZE procedure now can automatically join the caller to the collaboration when you provide the optional `auto_join_warehouse`
  parameter. If using this parameter with a custom role that was granted the CREATE COLLABORATION privilege, the role must also be granted the EXECUTE TASK account-level privilege.
* Updates to private preview features.

---
title: Feb 27, 2025: Snowflake Data Clean Rooms release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-27-dcr.md
section: Release Notes
---

# Feb 27, 2025: Snowflake Data Clean Rooms release notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

> **Note:**
>
> You must sign out and back in to the web app (UI) for these updates to take effect.

## UI loading improvements

UI load times have been improved in key user flows in clean room and analysis listing pages as well as when navigating across steps within
the clean room creation and installation flows.

## External and Apache Iceberg™ table support in SQL templates

Privacy policies used in the SQL template within the UI are now supported on external and Apache Iceberg tables. Users can now leverage
these objects in scenarios where they would like to enable free-form querying on their data while enforcing necessary privacy protection on
their datasets.

## Data Clean Rooms available with data sharing terms

Previously, customers were required to accept our Provider and Consumer Terms to onboard and use Snowflake Data Clean Rooms. Now customers
can onboard and use Snowflake Data Clean Rooms under our [Customer-Controlled Data Sharing Functionality Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/data-sharing-terms/), which are included within our standard Service terms. If these
terms have not yet been accepted, please reach out to [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) to accept these terms.

## Improvements to provider-linked views in the API

Previously, a provider linking a view using the developer APIs needed to explicitly grant reference usage to all underlying databases
referenced by the view. Now reference usage grants are applied automatically to underlying referenced objects when linking a view. Please
note that the underlying object must still be registered for usage in clean rooms.

## Multi-template approval

Previously, providers could only approve one template at a time submitted by a consumer to be used in the clean room. With this release,
providers can approve multiple templates in a single request with the `provider.approve_multiple_template_requests` procedure.

## Change in UI form handling with custom templates

If you providing a custom web form, any UI elements that have a `references` field that returns column names auto-populated by Snowflake will now return properly P/C aliased column names. Values accessed in the template should either be processed by IDENTIFIER or the `sqlsafe` filter in the template, and should not be aliased explicitly in the template.

For example, the following two elements passed in to `provider.add_ui_form_customizations` use `references` to auto-populate column names into template variables `reference_provider_join` and `reference_consumer_column`:

```text
  'reference_provider_join': {
    'display_name': 'Provider join column',
    'description': 'Which provider col do you want to join on',
    'references': ['PROVIDER_JOIN_POLICY'],
    'provider_parent_table_field': 'source_table',
    'type': 'dropdown'
  },
  'reference_consumer_column': {
    'display_name': 'Consumer join column',
    'description': 'Which consumer col do you want to join on',
    'references': ['CONSUMER_COLUMNS'],
    'consumer_parent_table_field': 'my_table',
    'type': 'dropdown'
  }
```

Previously a custom template would have needed to qualify these values with `p.` or `c.` as shown here:

```sqlexample
SELECT COUNT(*) AS cnt_agg FROM identifier({{ source_table[0] }}) AS P
  JOIN IDENTIFIER ({{ my_table[0] }}) AS C
  ON p.{{ reference_provider_join | sqlsafe }} = c.{{ reference_consumer_join | sqlsafe }};
```

With this change you should omit the `p.` and `c.` qualifiers from the template, as they will be provided directly to the variable:

```sqlexample
SELECT COUNT(*) AS cnt_agg FROM identifier({{ source_table[0] }}) AS P
  JOIN IDENTIFIER ({{ my_table[0] }}) AS C
  ON {{ reference_provider_join | sqlsafe }} = {{ reference_consumer_join | sqlsafe }};
```

---
title: Feb 27, 2025: Snowflake Native Apps release channels (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-27-na-release-channels.md
section: Release Notes
---

# Feb 27, 2025: Snowflake Native Apps release channels (*Preview*)

In this release, the Snowflake Native App Framework adds support for release channels. Release channels allow providers to publish apps preview versions of the app to consumers.

By installing preview versions of an app, consumers can preview the app and perform user acceptance testing. Release channels simplify the testing environment for both providers and consumers.

See [Publish an app using release channels](../../../developer-guide/native-apps/release-channels.md) for more information.

---
title: Feb 27, 2026: Openflow Connector for Oracle (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-27-openflow-oracle-ga.md
section: Release Notes
---

# Feb 27, 2026: Openflow Connector for Oracle (*General availability*)

The Openflow Connector for Oracle is now generally available. The connector replicates
Oracle database tables into Snowflake in near real-time or on a
specified schedule using change data capture (CDC).

The connector supports Oracle 12cR2 and later on on-premises servers,
Oracle Exadata, OCI VM/Bare Metal, and AWS RDS for Oracle
(Custom and Standard Single-tenant).

Two licensing models are available:

* **Embedded License**: Snowflake provides the Oracle XStream license
  for a fee, drawn from your Snowflake capacity. Includes a 60-day
  free trial, after which a non-cancelable 36-month billing term begins
  automatically.
* **Independent License (BYOL)**: Use your own Oracle license that
  includes XStream entitlements. No additional licensing fees from
  Snowflake.

For more information, see
[About Openflow Connector for Oracle](../../../user-guide/data-integration/openflow/connectors/oracle/about.md).

---
title: Feb 27, 2026: Restricted caller’s rights in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-27-sis-restricted-callers-rights.md
section: Release Notes
---

# Feb 27, 2026: Restricted caller’s rights in Streamlit in Snowflake (*Preview*)

You can now configure Streamlit in Snowflake apps using container runtimes to use restricted caller’s rights, allowing
apps to run with the viewer’s privileges instead of the owner’s. In container runtimes, you can
mix owner’s rights and restricted caller’s rights connections in the same app.

For more information, see [Restricted caller’s rights and Streamlit in Snowflake](../../../developer-guide/streamlit/features/restricted-callers-rights.md).

---
title: Feb 28, 2025: Increased max_cluster_count limits for multi-cluster warehouses
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-28-increased-max_cluster_count-limits.md
section: Release Notes
---

# Feb 28, 2025: Increased max_cluster_count limits for multi-cluster warehouses

You now have more flexibility when specifying upper limits for the MAX_CLUSTER_COUNT property
in [Multi-cluster warehouses](../../../user-guide/warehouses-multicluster.md).

The upper limit for MAX_CLUSTER_COUNT is no longer restricted to 10. Instead, the upper limit
varies depending on the warehouse size.
Currently, you must use a SQL command, not Snowsight, to specify an upper limit higher than 10.

The scaling policies for multi-cluster warehouses now can increase or decrease the capacity
of a warehouse by more than one cluster at a time.

For more information, see [Upper limit on number of clusters for a multi-cluster warehouse](../../../user-guide/warehouses-multicluster.md).

---
title: Feb 4, 2026: Object tagging support for interactive tables
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-04-interactive-tagging.md
section: Release Notes
---

# Feb 4, 2026: Object tagging support for interactive tables

Interactive tables now support object tagging. You can set tags in CREATE INTERACTIVE TABLE commands, and
in ALTER TABLE commands for interactive tables.

Snowflake already supported object tagging for interactive warehouses.

By using Snowflake object tagging features, you can apply data governance and cost tracking practices to
your interactive tables and interactive warehouses.

For more information about using object tagging with interactive warehouses and interactive tables, see the
following topics:

* [CREATE INTERACTIVE TABLE](../../../sql-reference/sql/create-interactive-table.md)
* [ALTER TABLE](../../../sql-reference/sql/alter-table.md)
* [CREATE INTERACTIVE WAREHOUSE](../../../sql-reference/sql/create-interactive-warehouse.md)
* [ALTER WAREHOUSE](../../../sql-reference/sql/alter-warehouse.md)
* [Introduction to object tagging](../../../user-guide/object-tagging/introduction.md)

---
title: Feb 7, 2025: Support for material icons (General Availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-02-07-sis.md
section: Release Notes
---

# Feb 7, 2025: Support for material icons (General Availability)

You can now use material icons in Streamlit in Snowflake.

---
title: February 05-06, 2024 — 8.5 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_05.md
section: Release Notes
---

# February 05-06, 2024 — 8.5 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Security Updates

### External API authentication and secrets — *General Availability*

With this release, we are pleased to announce the general availability of external API authentication and the secret object.
A secret is a schema-level object that can store sensitive information, such as but not limited to OAuth refresh token values,
authentication credentials such as a username and password, and other sensitive string values.

The advantage of using a secret is that it enhances your security posture because only Snowflake itself can access the sensitive
information the secret stores. For example, when Snowflake authenticates to an external service, such as ServiceNow, it accesses
the credentials in the secret programmatically. Similarly, if the handler code in a stored procedure references a secret,
Snowflake accesses the secret programmatically when you call that stored procedure. After you create the secret, users cannot
access sensitive information that the secret stores.

For details, see [External API authentication and secrets](../../user-guide/api-authentication.md).

## Extensibility Updates

### External network access — *General Availability*

With this release, we are pleased to announce the general availability of external network access, with which you can access network
locations external to Snowflake from within procedure and UDF handler code. This GA release is available on AWS and Azure except in
the Gov region. External network access remains in preview for accounts using GCP.

When setting up external network access, you create a network rule that represents the external network location. If your handler
code will need to authenticate with the external location, you create a secret containing the credentials needed. In handler code,
you can use APIs to retrieve credential values from the secret.

For more information, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

### Python packages policies — *General Availability*

With this release, we are pleased to announce the general availability of Python packages policies.

Using a packages policy enables you to set allowlists and blocklists for third-party Python packages from Anaconda at the account
level. This lets you meet stricter auditing and security requirements and gives you more fine-grained control over which packages
are available or blocked in your environment.

For more information, see [Packages policies](../../developer-guide/udf/python/packages-policy.md).

## Data Loading / Unloading Updates

### COPY FILES — *Preview*

With this release, we are pleased to announce the preview of the COPY FILES command. You can use COPY FILES to copy files
from one named stage to another.

For details, see [COPY FILES](../../sql-reference/sql/copy-files.md).

## Data Governance Updates

### Data Classification: Asynchronous tag assignments for columns of tables in a schema and automate tagging for a single classification event — *Preview*

With this release, we are pleased to announce the preview of asynchronous classification of columns for tables in a schema
using SQL and Snowsight. This update enables the option for the classification and tag assignment actions to take place at
different times and by different personas: for example, a data steward initiates the classification process and a tag administrator
assigns the tags to columns later.

Additionally, you can choose to automate the tag assignments for a single classification event. The automation speeds up the data
classification process by removing the need to manually interpret the classification results and assign tags to columns.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 05-Feb-24 |

---
title: February 07-08, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-02-07.md
section: Release Notes
---

# February 07-08, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Write Snowpark code in Python worksheets —– *General Availability*

With this release, we are pleased to announce the general availability of Python worksheets in Snowsight. Python worksheets let
you write and run Snowpark Python code in a worksheet in Snowsight. You can now use Python version 3.11 or another
supported version.

In a Python worksheet, you can do the following:

* Write a Python script to read data from a stage, transform it, and save it to a table, all without leaving Snowsight.
* Use included packages from Anaconda or import packages from a stage to write code more easily.
* Automate your Python code by deploying it as a stored procedure and scheduling it as a task.

For more information, see [Writing Snowpark Code in Python Worksheets](../../../developer-guide/snowpark/python/python-worksheets.md).

## Recover worksheets for dropped users —– *General Availability*

With this release, we are pleased to announce the general availability of recovering Snowsight worksheets for users that have
been dropped from Snowflake. You can recover up to 500 worksheets for each dropped user.

For more details, see [Recover worksheets owned by a dropped user](../../../user-guide/ui-snowsight-worksheets.md).

## Get Started page for some accounts —– *Removed*

With this release, the Get Started page that was available to some trial and Snowflake On Demand accounts has been removed.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| New navigation menu (Preview) | *Removed* | 02-08-24 |

---
title: February 12-14, 2024 — 8.6 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_06.md
section: Release Notes
---

# February 12-14, 2024 — 8.6 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## SQL Updates

### New SQL functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Geospatial | [ST_INTERSECTION_AGG](../../sql-reference/functions/st_intersection_agg.md) (Preview) | Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the shape containing the combined set of points that are common to the shapes represented by the objects in the column, such as the intersection of the shapes. |
| Geospatial | [ST_UNION_AGG](../../sql-reference/functions/st_union_agg.md) (Preview) | Given a GEOGRAPHY column, returns a GEOGRAPHY object that represents the combined set of points that are in at least one of the shapes represented by the objects in the column, such as the union of the shapes. |

## Data Loading / Unloading Updates

### Specify an external ID for AWS storage access

With this release, we are pleased to announce support for specifying an external ID when you create an external volume or storage
integration. Specifying an external ID lets you use the same ID to configure a trust relationship between Snowflake and AWS for
multiple objects.

For more information, see:

* [CREATE EXTERNAL VOLUME](../../sql-reference/sql/create-external-volume.md)
* [CREATE STORAGE INTEGRATION](../../sql-reference/sql/create-storage-integration.md)

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 12-Feb-24 |

---
title: February 12-14, 2024 — New navigation for Snowsight —– Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-12.md
section: Release Notes
---

# February 12-14, 2024 — New navigation for Snowsight —– *Preview*

With this release, we are pleased to announce the preview of a new navigation menu in Snowsight. The new menu organizes various
features into categories for more intuitive navigation.

When working with worksheets, Streamlit apps, or dashboards, you can quickly navigate to other pages in Snowsight with the new
collapsed format of the navigation menu.

The most noticeable changes include:

* Worksheets, dashboards, and Streamlit are part of Projects.
* Sharing and app development resources like Provider Studio, Apps, Marketplace, and Partner Connect are part of Data Products.
* The Activity section is renamed Monitoring.
* Change your role, update your profile, get your account URL and identifier, and access support from one centralized user menu, located
  at the bottom of the navigation menu.

See [Snowsight navigation menu](../../../user-guide/ui-snowsight-navigation.md) for more details.

---
title: February 15, 2024 — Aggregation and Projection Policies Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-15-policies.md
section: Release Notes
---

# February 15, 2024 — Aggregation and Projection Policies Release Notes

## Aggregation Policies — *Preview*

With this release, we are pleased to announce the preview of aggregation policies, which protect the privacy of individual rows by requiring
analysts to run queries that aggregate data rather than retrieving individual rows. When defining an aggregation policy, you specify a
minimum group size, which determines how many rows must be included in each aggregation group. Once you assign the aggregation policy to a
table or view, queries must aggregate data into groups that contain enough rows to meet the minimum group size requirement.

For more information, see [Aggregation policies](../../../user-guide/aggregation-policies.md).

## Projection Policies — *Preview*

With this release, we are pleased to announce the preview of projection policies, which prevent queries from using a SELECT statement to
project a column. The projection policy defines which users should be blocked from projecting columns and which users should be allowed.
You then assign the projection policy to a column in a table or view to control who can project the column.

For more information, see [Projection policies](../../../user-guide/projection-policies.md).

---
title: February 15, 2024 — Geospatial Functions Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-15.md
section: Release Notes
---

# February 15, 2024 — Geospatial Functions Release Notes

## H3 Functions for GEOGRAPHY Objects — *General Availability*

H3 functions for GEOGRAPHY objects are now generally available. [H3](https://h3geo.org/docs/) is a
[hierarchical geospatial index](https://h3geo.org/docs/highlights/indexing/) that partitions the world into hexagonal cells in
a [discrete global grid system](https://en.wikipedia.org/wiki/Discrete_global_grid). Snowflake provides the SQL functions that
enable you to use H3 with geospatial data that are either stored in a [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) object or as
discrete latitude and longitude columns.

For more information, see [Using GEOGRAPHY objects with H3](../../../sql-reference/data-types-geospatial.md).

---
title: February 15, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-02-14.md
section: Release Notes
---

# February 15, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## New tutorials for trial accounts

Snowflake has added new Snowsight worksheets that are available when you create a 30-day trial account. These worksheets contain
sample SQL statements or Python code. You can execute the statements or run the code in these worksheets to learn how to accomplish a
specific task.

The following new tutorials in the documentation walk you through these worksheets:

* [Create users and grant roles](../../../user-guide/tutorials/users-and-roles-tutorial.md) - Create a user and grant a role to it by using SQL commands.
* [Load and query sample data using SQL](../../../user-guide/tutorials/tasty-bytes-sql-load.md) - Load and query data for a fictitious food truck brand named Tasty Bytes in Snowflake using SQL.
* Load data from cloud storage into Snowflake using SQL. The tutorial for this worksheet is available in two versions:

  + [Load data from cloud storage: Amazon S3](../../../user-guide/tutorials/load-from-cloud-tutorial.md)
  + [Load data from cloud storage: Microsoft Azure](../../../user-guide/tutorials/load-from-cloud-tutorial-azure.md)

  The worksheet itself also covers the steps and commands for Google Cloud Storage (GCS).

## Larger maximum file size —– *General Availability*

Snowflake has increased the maximum file size from 50 MB to 250 MB for the following use cases:

* [Load data using Snowsight](../../../user-guide/data-load-web-ui.md).
* [Upload files onto a named internal stage](../../../user-guide/data-load-local-file-system-stage-ui.md).

---
title: February 19-21, 2024 — 8.7 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_07.md
section: Release Notes
---

# February 19-21, 2024 — 8.7 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Behavior Change Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_02](../bcr-bundles/2024_02_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_01](../bcr-bundles/2024_01_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_08](../bcr-bundles/2023_08_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for March 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL Updates

### SQL Functions Add Support for the `upper`, `lower`, and `trim` Collation Specifiers

Collation allows you to specify alternative rules for comparing strings, which can be used to compare and sort data according to a particular language or other user-specified rules. Snowflake supports different [collation specifiers](../../sql-reference/collation.md).

The `upper` specifier converts strings to uppercase before performing comparisons. The `lower` specifier converts strings to lowercase before performing comparisons. The `trim` specifier removes leading and trailing spaces before performing comparisons. The 2024_02 behavior change bundle includes changes that add support for the upper, lower, and trim specifiers to the following functions:

* [CHARINDEX](../../sql-reference/functions/charindex.md)
* [CONTAINS](../../sql-reference/functions/contains.md)
* [ENDSWITH](../../sql-reference/functions/endswith.md)
* [ILIKE](../../sql-reference/functions/ilike.md)
* [ILIKE ANY](../../sql-reference/functions/ilike_any.md)
* [LIKE](../../sql-reference/functions/like.md)
* [LIKE ALL](../../sql-reference/functions/like_all.md)
* [LIKE ANY](../../sql-reference/functions/like_any.md)
* [POSITION](../../sql-reference/functions/position.md)
* [REPLACE](../../sql-reference/functions/replace.md)
* [SPLIT](../../sql-reference/functions/split.md)
* [SPLIT_PART](../../sql-reference/functions/split_part.md)
* [STARTSWITH](../../sql-reference/functions/startswith.md)

Combinations with `upper`, `lower`, and `trim` are also supported (for example, `upper-trim` and `lower-trim`), except for locale combinations (for example, `en-upper`).

> **Note:**
>
> In order to use these functions with the `upper`, `lower`, and `trim` specifiers, you must [enable the 2024_02 bundle in your account](../bcr-bundles/managing-behavior-change-releases.md).
>
> To enable this bundle in your account, execute the following statement:
>
> ```sqlexample
> SELECT SYSTEM$ENABLE_BEHAVIOR_CHANGE_BUNDLE('2024_02');
> ```

## Data Lake Updates

### GET_DDL for external tables supports fully-qualified location names

With this release, we are pleased to announce that the [GET_DDL](../../sql-reference/functions/get_ddl.md) function for an external table now returns the fully-qualified name of the `LOCATION` value when you specify `TRUE` for the `use_fully_qualified_names_for_recreated_objects` argument.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 19-Feb-24 |
| *ASOF JOIN — Preview* | **Added** to *SQL Updates* | 28-Feb-24 |
| *ASOF JOIN — Preview* | **Moved** from *SQL Updates* to [February 28, 2024 — ASOF JOIN Release Notes](other/2024-02-28.md) | 28-Feb-24 |

---
title: February 20 - March 5, 2024 — Universal Search in Snowsight –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-20.md
section: Release Notes
---

# February 20 - March 5, 2024 — Universal Search in Snowsight –— *Preview*

With this release, we are pleased to announce the preview of Universal Search in Snowsight.
With Universal Search, you can quickly and easily find database objects in your account, data products available to you in the
Snowflake Marketplace, relevant Snowflake Documentation topics, and relevant Snowflake Community Knowledge Base articles.

You can use natural language to search because Universal Search understands your query and information about your database objects and
can find objects with names that differ from your search terms.

For more details, see [Search Snowflake objects and resources](../../../user-guide/ui-snowsight-universal-search.md).

> **Note:**
>
> With this preview release, Universal Search is available to accounts in the following Snowflake regions:
>
> * AWS EU (Zurich)
> * GCP Europe West2 (London)
> * GCP Europe West4 (Netherlands)
> * GCP US Central1 (Iowa)
> * GCP US East4 (N. Virginia)

---
title: February 20, 2024 — Hybrid Tables Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-20-hybrid-tables.md
section: Release Notes
---

# February 20, 2024 — Hybrid Tables Release Notes

This document provides an introduction to the new feature **Hybrid Tables**.

With this release, we are pleased to announce the preview of hybrid tables.
Hybrid tables is a new Snowflake table type that provides optimized
performance on row-oriented read and write operations in transactional and
hybrid workloads. Hybrid tables features include the availability of indexes
for faster access to data, and the enforcement of primary, unique, and foreign
key constraints.

With this preview release, hybrid tables is available to accounts in the
following Amazon Web Services (AWS) regions:

> | Cloud Region | Cloud Region ID |
> | --- | --- |
> | US West (Oregon) | us-west-2 |
> | EU (Ireland) | eu-west-1 |
> | Asia Pacific (Sydney) | ap-southeast-2 |

Hybrid tables will be made available in additional regions in the near future.

For more information, see [Hybrid tables](../../../user-guide/tables-hybrid.md).

---
title: February 2023
source: https://docs.snowflake.com/en/release-notes/2023-02.md
section: Release Notes
---

# February 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Access History: Track Masking & Row Access Policy References — *Preview*

With this release, Snowflake is pleased to announce that for queries on a table or view protected by a row access policy and a column
protected by a masking policy, the enforced masking and row access policies are tracked in the Account Usage ACCESS_HISTORY view. Policy
references are tracked in the new column, `policies_referenced`. This new column includes support for intermediate objects and
columns that are policy protected. Audits on policy protected objects and columns are easier because auditors have a more unified view of
how protected data is referenced without having to do complex joins on multiple Account Usage views.

For details, refer to [Access History](../user-guide/access-history.md) and
[ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md).

### Error Notifications for Snowpipe and Tasks — *General Availability*

With this release, Snowflake is pleased to announce the general availability of error notifications for Snowpipe and Tasks. Both Snowpipe
and Snowflake Tasks can push error notifications to the cloud messaging services when errors are encountered.

* Snowpipe notifications describe the errors encountered in each file as they are loaded, enabling further analysis of the data in the
  files.
* Snowflake task error notifications describe errors encountered when executing a task or its dependent tasks.

Previously, task error notifications were available only for Snowflake Accounts hosted on Amazon Web Services. With this release, this
feature is now available for Snowflake Accounts hosted on Google Cloud Platform and Microsoft Azure.

For more information, refer to [Snowpipe error notifications](../user-guide/data-load-snowpipe-errors.md) and [Set up error notifications for tasks](../user-guide/tasks-errors.md).

### Snowflake Alerts — *Preview*

With this release, we are pleased to announce a preview of Snowflake Alerts. A Snowflake Alert is a schema-level object that you
can use to send a notification or perform an action when data in Snowflake meets certain conditions.

For example, you can set up a Snowflake Alert to send a notification or perform an action when:

* The warehouse credit usage increases by a specified percentage of your current quota.
* The resource consumption for your pipelines, tasks, materialized views, etc. increases beyond a specified amount.
* A data access request is received from an unauthorized user.
* Your data fails to comply with a particular business rule that you have set up.

For more information, refer to [Setting up alerts based on data in Snowflake](../user-guide/alerts.md).

## Security Updates

### Deprecated SAML SSO Parameters

With this release, the SAML_IDENTITY_PROVIDER and SSO_LOGIN_PAGE parameters used for SAML SSO configuration and management are deprecated.

All Snowflake configurations should use a [SAML2 security integration](../user-guide/admin-security-fed-auth-security-integration.md) instead
of the SAML_IDENTITY_PROVIDER and SSO_LOGIN_PAGE parameters.

If you have an existing SSO implementation that uses the SAML_IDENTITY_PROVIDER account parameter, refer to
[Migrating to a SAML2 security integration](../user-guide/admin-security-fed-auth-configure-snowflake.md).

These deprecated parameters still work, but will be removed in a future release. Migrating to a SAML2 security integration also
provides additional features that are not available when using the deprecated account parameters.

### Improved Error Messages for SSO Login Failures — *General Availability*

With this release, we are pleased to announce the general availability of improved error messages for SAML and External OAuth SSO login
failures.

Improved error messages SAML and External OAuth SSO login failures now provide a UUID in error messages associated with failed login
attempts. Administrators can use the UUID as an argument to a new [SYSTEM$GET_LOGIN_FAILURE_DETAILS](../sql-reference/functions/system_get_login_failure_details.md) function
to return a JSON object containing the error associated with the failed login attempt.

For more information, refer to [SYSTEM$GET_LOGIN_FAILURE_DETAILS](../sql-reference/functions/system_get_login_failure_details.md).

## SQL Updates

### ROUND Function: New Argument for Specifying the Rounding Mode

By default, when you specify the *<scale_expr>* argument in the [ROUND](../sql-reference/functions/round.md) function, the function
[rounds the value half away from zero](https://en.wikipedia.org/wiki/Rounding%23Rounding_half_away_from_zero). For example:

```sqlexample
SELECT ROUND(2.5, 0);

+---------------+
| ROUND(2.5, 0) |
|---------------|
|             3 |
+---------------+

SELECT ROUND(-2.5, 0);

+----------------+
| ROUND(-2.5, 0) |
|----------------|
|             -3 |
+----------------+
```

In this release, Snowflake provides a new, optional argument to change the rounding mode to [round the value half to
even](https://en.wikipedia.org/wiki/Rounding%23Rounding_half_to_even) :

```sqlsyntax
ROUND( <input_expr> [ , <scale_expr>  [ , <rounding_mode> ] ] )
```

If you want to round the value half to even, pass ‘HALF_TO_EVEN’ as the third argument (after specifying the scale as the second argument).
For example:

```sqlexample
SELECT ROUND(2.5, 0, 'HALF_TO_EVEN');

+-------------------------------+
| ROUND(2.5, 0, 'HALF_TO_EVEN') |
|-------------------------------|
|                             2 |
+-------------------------------+

SELECT ROUND(-2.5, 0, 'HALF_TO_EVEN');

+--------------------------------+
| ROUND(-2.5, 0, 'HALF_TO_EVEN') |
|--------------------------------|
|                             -2 |
+--------------------------------+
```

For more information, see the documentation on [ROUND](../sql-reference/functions/round.md).

### Search Optimization Service: Support for Tables with Masking Policies and Row Access Policies — *General Availability*

With this release, we are pleased to announce the general availability of Search Optimization Service support for tables that use masking
policies and row access policies. This can help improve performance of queries on such tables.

For more information, refer to [Search optimization service](../user-guide/search-optimization-service.md).

## Virtual Warehouse Updates

### Query Acceleration Service — *General Availability*

With this release, we are pleased to announce the general availability of Query Acceleration Service.

The query acceleration service can accelerate parts of the query workload in a warehouse by offloading portions of the query processing to
dynamic compute resources provided by the service. It can improve overall performance by reducing the impact of outlier queries, which are
queries that use more resources than the typical query.

This feature is available to Snowflake accounts on Enterprise Edition (or higher).

For more information, refer to [Using the Query Acceleration Service (QAS)](../user-guide/query-acceleration-service.md).

### Snowpark-optimized Warehouses — *General Availability*

With this release, we are pleased to announce the general availability of Snowpark-optimized warehouses in Amazon Web Services (AWS),
Microsoft Azure, and Google Cloud Platform regions.

For more information, refer to [Snowpark-optimized warehouses](../user-guide/warehouses-snowpark-optimized.md).

## Data Loading Updates

### ON_ERROR Copy Option Supports All File Formats

With this release, the ON_ERROR copy option for the COPY INTO <table> command consistently supports all file formats with either parsing or
transformation errors.

Previously, the ON_ERROR values worked as expected only for structured data files (CSV, TSV, etc.) with either parsing or transformation
errors. However, semi-structured data files (JSON, Avro, ORC, Parquet, or XML) did not support the same behavior semantics as structured
data files for the following ON_ERROR values: CONTINUE, SKIP_FILE_<*num>*, or ‘SKIP_FILE_<*num>*%’.

Currently, the ON_ERROR values work as expected and are consistent for all structured and semi-structured files, including, CSV, TSV, JSON,
Avro, ORC, Parquet, or XML.

For more information, refer to [Copy options (copyOptions)](../sql-reference/sql/copy-into-table.md).

### New Metadata Columns for Staged Files

With this release, Snowflake automatically generates the following new metadata columns for staged files, which can be queried or copied
into tables.

METADATA$FILE_CONTENT_KEY
:   Checksum of the staged data file the current row belongs to.

METADATA$FILE_LAST_MODIFIED
:   Last modified timestamp of the staged data file the current row belongs to. Returned as TIMESTAMP_NTZ.

METADATA$START_SCAN_TIME
:   Start timestamp of operation for each record in the staged data file. Returned as TIMESTAMP_LTZ.

These new metadata columns provide more detailed information about the staged files. For example, you can query METADATA$START_SCAN_TIME to
get an accurate time value of record loading.

For more information, refer to [Query metadata for staged files](../user-guide/querying-metadata.md).

## Data Collaboration Updates

### Listing Discovery Controls — *General Availability*

With this release, we are pleased to announce the general availability of listing discovery controls, which let you offer listings that can
only be discovered by specific consumers, similar to a direct share.

Using privately discoverable listings instead of direct shares lets you auto-fulfill your data product across clouds and Snowflake regions,
gather metrics about consumer usage of the data, and include metadata with your data share, such as a title and description, and usage
examples to help consumers use the data quickly.

For more information, refer to [About listings](../collaboration/collaboration-listings-about.md).

## Web Interface Updates

### SQL Editor Improvements —– *General Availability*

With this release, we are pleased to announce the general availability of improvements to the SQL editor in Snowsight,
including the following:

* Improved find and replace functionality.
* Redesigned autocomplete for commands, columns, and objects.
* Updated function autocomplete, including suggestions for function arguments to make it easier to write user-defined functions.
* Added highlighting for selected keywords, so that when you select a term in the SQL editor, all other instances of the term appear highlighted.

---
title: February 21, 2024 — Data sharing & collaboration for accounts in U.S. government regions
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-21.md
section: Release Notes
---

# February 21, 2024 — Data sharing & collaboration for accounts in U.S. government regions

With this release, we are pleased to announce that data sharing and collaboration with listings is now available for accounts in
U.S. government regions.

After accepting the cross-region disclaimer, anyone in your account with the required privileges can do the following:

* Publish free and limited trial listings on the Snowflake Marketplace.
* Share free listings directly with consumers.
* Access and install free and limited trial listings from the Snowflake Marketplace.
* Install free listings shared directly with your account.

Some limitations apply. For more details, see:

* [Prepare to provide listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/provider-becoming#label-listings-setup-gov-provider)
* [Prepare to access listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-becoming#label-listings-setup-gov-consumer)
* [Limitations for accessing listings from accounts in U.S. government regions](https://other-docs.snowflake.com/en/collaboration/consumer-listings-access#label-listings-gov-consumer-limitations)

---
title: February 22, 2024 — Snowflake Extension for Visual Studio Code Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-22.md
section: Release Notes
---

# February 22, 2024 — Snowflake Extension for Visual Studio Code Release Notes

## Visual Studio Code extension for Snowpark Python — *Preview*

With this release, we are pleased to announce that the Snowflake Extension for Visual Studio Code now integrates with Snowpark Python to provide authoring and debugging
features for Snowpark Python code.

These new features include:

* Inline debugging of Snowpark Python functions.
* Syntax highlighting and autocomplete suggestions for Snowflake SQL in Python strings within Python files or notebook cells.
* Syntax highlighting and bracket autocomplete of Jinja templates in Snowflake SQL.

For more information, see [Snowflake Extension for Visual Studio Code](../../../user-guide/vscode-ext.md).

---
title: February 26, 2024 — Snowpark Container Services Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-26.md
section: Release Notes
---

# February 26, 2024 — Snowpark Container Services Release Notes

[Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md) is now available in all AWS commercial [regions](../../../user-guide/intro-regions.md).

---
title: February 26-28, 2024 — 8.8 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_08.md
section: Release Notes
---

# February 26-28, 2024 — 8.8 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Data Lake Updates

### Secure Data Sharing for Apache Iceberg™ tables

With this release, we are pleased to announce Secure Data Sharing support for Apache Iceberg™ tables using shares. Before this release, only
secure views on Iceberg tables could be shared. You can now directly share an Iceberg table without having to create a secure view.

For more information, see:

* [Secure Data Sharing](../../user-guide/data-sharing-intro.md)
* [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md)

### Query an Apache Iceberg™ table without granting the USAGE privilege on related objects

With this release, we are pleased to announce that a role can now query an Apache Iceberg™ table in Snowflake
without being granted or inheriting the USAGE privilege on the external volume or catalog integration associated with the table.

For more information, see [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

## Extensibility Updates

### External network access — *General Availability*

With this release, we are pleased to announce the general availability of external network access, with which you can access network
locations external to Snowflake from within procedure and UDF handler code. This GA release is available on Google Cloud. External
network access was already generally available for accounts using AWS and Azure (except in the Gov region).

When setting up external network access, you create a network rule that represents the external network location. If your handler code
will need to authenticate with the external location, you create a secret containing the credentials needed. In handler code, you can
use APIs to retrieve credential values from the secret.

For more information, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-Feb-24 |
| *Query an Apache Iceberg™ table without granting the USAGE privilege on related objects* | **Added** to *Data Lake Updates* | 29-Feb-24 |

---
title: February 28, 2024 — ASOF JOIN Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-02-28.md
section: Release Notes
---

# February 28, 2024 — ASOF JOIN Release Notes

## ASOF JOIN –— *Preview*

We are pleased to announce the preview of ASOF JOIN.

ASOF JOIN is a means of joining tables with time-series data when corresponding timestamp columns contain values that do not match exactly. For each row in the left table, the join finds the closest matching value from the right table.

ASOF JOIN syntax is specified in the FROM clause of a SELECT statement. Although ASOF JOIN queries can be emulated through the use of complex SQL, other types of joins, and window functions, queries are easier to write (and are often more performant) if you use the ASOF JOIN syntax.

The key capability of this join method is the analysis of two or more time series to find the closest preceding or following record that matches a given criterion. Therefore, ASOF JOIN is useful for analyzing various data sets, such as financial trading data, weather observations, readings from sensors, and audit trails. In all of these use cases, ASOF JOIN may be used to associate data when records from different sources have timestamps that are not exactly the same.

For more details, see [Analyzing time-series data](../../../user-guide/querying-time-series-data.md) and [ASOF JOIN](../../../sql-reference/constructs/asof-join.md).

> **Note:**
>
> The ASOF JOIN feature does not work if the 2024_01 BCR bundle is explicitly disabled by an account administrator. The 2024_01 BCR bundle is enabled by default.

---
title: February 28-29, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-02-21.md
section: Release Notes
---

# February 28-29, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## New navigation for Snowsight —– *General Availability*

With this release, we are pleased to announce the general availability of a new navigation menu in Snowsight. The new menu
organizes various features into categories for more intuitive navigation.

When working with worksheets, Streamlit apps, or dashboards, you can quickly navigate to other pages in Snowsight with the new
collapsed format of the navigation menu.

The most noticeable changes include:

* Worksheets, Streamlit, dashboards, and developing app packages are part of Projects.
* Resources for sharing data and apps, like Provider Studio, Apps, Marketplace, and Partner Connect, are part of Data Products.
* The Activity section is renamed Monitoring.
* Change your role, update your profile, get your account URL and identifier, and access support from one centralized user menu, located
  at the bottom of the navigation menu.

For more details, see [Snowsight navigation menu](../../../user-guide/ui-snowsight-navigation.md).

---
title: File formats and stages: Enforce dependency checks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-1989.md
section: Release Notes
---

# File formats and stages: Enforce dependency checks

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

File formats and stages behave in the following ways:

Before the change:
:   You can drop or recreate a file format or stage even when external tables depend on it.
    You can also alter a stage location. These actions invalidate the dependent external tables.

After the change:
:   You can’t drop or recreate a file format or stage that has dependent external tables.
    You also can’t alter the location of a stage with dependent external tables.
    To perform these operations, first drop the dependent external tables.

Ref: 1989

---
title: File Formats: Validation of Format Options
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1134.md
section: Release Notes
---

# File Formats: Validation of Format Options

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, Snowflake has strict validation against invalid file format options and will return an error on use of invalid stages and file format options. Creation of new invalid stage or file formats are not allowed.

Previously:
:   When a stage, table, or file format is created with a named file format and some other invalid file format type options, there is currently no validation error thrown. This results in stages or file formats to be created with invalid file format options.

Currently:
:   Error messages occur when a stage, table, or file format is created with a named file format and some other invalid file format options. Queries that use such invalid stages, table stages, or file formats will fail.

We recommend the following:

* Check your stages and file formats to ensure you don’t have any unsupported options. For example, an invalid stage may have a named CSV file format with a parquet type option, such as BINARY_AS_TEXT.
* Recreate the stage or file format with only the supported file format options.

Note that existing invalid stages or file formats used by Snowpipe will not be affected by this change.

Ref: 1134

---
title: Fix future grant materialization precedence
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1870.md
section: Release Notes
---

# Fix future grant materialization precedence

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

If schema-level future grant(s) for a particular object (such as a table, view, or materialized view) exist, when creating a new, table-like object the database-level OWNERSHIP future grant behaves as follows:

Before the change:
:   OWNERSHIP future grant applies.

After the change:
:   OWNERSHIP future grant does not apply. The schema-level future grant(s) correctly take precedence.

Ref: 1870

---
title: Full lifecycle management for converted Apache Iceberg™ tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2219.md
section: Release Notes
---

# Full lifecycle management for converted Apache Iceberg™ tables

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When you convert an externally managed Apache Iceberg™ table into a table that uses Snowflake as the Iceberg catalog, Snowflake handles
full lifecycle management for the converted table. This management includes managing the data and metadata files that were created before
you converted the table.

Before the change:
:   When you convert an externally managed Iceberg table to use Snowflake as the catalog, Snowflake doesn’t delete the data and metadata files
    that were created for the table before you converted the table.

After the change:
:   When you convert an externally managed Iceberg table to use Snowflake as the catalog, Snowflake manages deleting the data and metadata
    files that were created for the table before you converted the table. Snowflake deletes these files when they expire, which is when they
    exceed their retention period.

    The metadata files that Snowflake deletes include metadata.json, manifest files, and manifest list files.

    This change affects both existing converted tables and tables that are converted after the change is enabled.

    Before the change takes effect, do the following:

    1. Check whether any workloads or pipelines rely on any Iceberg files under the table’s storage location that were created before you
       converted the table. Also, check any Iceberg snapshots and files that are older than the retention period set on the table.
    2. If you find any of these files and you still need them outside of Snowflake after the change takes effect, copy them to a separate
       storage location that you manage.

    This change allows Snowflake to manage the full lifecycle for converted Iceberg tables. As a result, this change prevents unnecessary
    storage consumption resulting from pre-conversion data and metadata files that remain in storage after they expire.

Ref: 2219

---
title: Full-text search: TOKEN_SEARCH function renamed to SEARCH
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1633.md
section: Release Notes
---

# Full-text search: TOKEN_SEARCH function renamed to SEARCH

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

The full-text search SQL function named TOKEN_SEARCH behaves as follows:

Before the change:
:   The full-text search SQL function is named TOKEN_SEARCH.

After the change:
:   The full-text search SQL function is named SEARCH. However, the current function name, TOKEN_SEARCH,
    will continue to work for a period of time.

    Customers who have user-defined functions (UDFs) or stored procedures named SEARCH must rename those
    UDFs or stored procedures. Attempting to call a UDF or a procedure named SEARCH without qualification
    will result in a SQL compilation error. It is also possible that the SEARCH function could be executed
    instead of the UDF or procedure if the function signatures match exactly.

Ref: 1633

---
title: FUNCTIONS and PROCEDURES views (INFORMATION_SCHEMA): Corrections to columns When name contains special characters
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1404.md
section: Release Notes
---

# FUNCTIONS and PROCEDURES views (INFORMATION_SCHEMA): Corrections to columns When name contains special characters

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

This behavior change affects UDFs and stored procedures that have names or argument names containing any of the following
characters:

* A colon ( `:` )
* An open parenthesis ( `(` )
* A close parenthesis ( `)` )

In the INFORMATION_SCHEMA [FUNCTIONS](../../../sql-reference/info-schema/functions.md) and
[PROCEDURES](../../../sql-reference/info-schema/procedures.md) views, the `argument_signature` and `data_type` columns contain
the following values for these functions and procedures:

Before the change:
:   `argument_signature` and `data_type` do not contain the correct argument signature and return data type.

    The value in the `argument_signature` column might contain an open parenthesis or the part of the function or procedure name
    that starts with an open parenthesis.

    The value in the `data_type` column might contain the prefix TABLE.

After the change:
:   `argument_signature` and `data_type` contain the correct argument signature and return data type.

For example, suppose that a UDF name contains a colon:

```sqlexample
CREATE OR REPLACE FUNCTION "passthrough:function"(arg VARCHAR)
  RETURNS VARCHAR
  ...
```

The `argument_signature` and `data_type` columns contain the following values:

Before the change:
:   ```output
    +--------------------+------------------------+
    | ARGUMENT_SIGNATURE | DATA_TYPE              |
    |--------------------+------------------------|
    | (                  | TABLEVARCHAR(16777216) |
    +--------------------+------------------------+
    ```

After the change:
:   ```output
    +--------------------+-------------------+
    | ARGUMENT_SIGNATURE | DATA_TYPE         |
    |--------------------+-------------------|
    | (ARG VARCHAR)      | VARCHAR(16777216) |
    +--------------------+-------------------+
    ```

Note that this change just addresses the problem in the FUNCTIONS and PROCEDURES views in INFORMATION_SCHEMA. The fix for the
[FUNCTIONS](../../../sql-reference/account-usage/functions.md) and [PROCEDURES](../../../sql-reference/account-usage/procedures.md) views
in ACCOUNT_USAGE will be made available in a future behavior change release.

Ref: 1404

---
title: FUNCTIONS view (Account Usage and Information Schema): New column IS_AGGREGATE
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1617.md
section: Release Notes
---

# FUNCTIONS view (Account Usage and Information Schema): New column IS_AGGREGATE

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the Account Usage and Information Schema [FUNCTIONS view](../../../sql-reference/account-usage/functions.md) includes the following new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| IS_AGGREGATE | VARCHAR(3) | `YES` if the function is an [aggregate function](../../../developer-guide/udf/python/udf-python-aggregate-functions.md); otherwise, `NO`. |

Ref: 1617

---
title: FUNCTIONS view (Account Usage): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1362.md
section: Release Notes
---

# FUNCTIONS view (Account Usage): New columns

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the Account Usage FUNCTIONS view is as follows:

Before the change:
:   The functions view does not contain the IS_MEMOIZABLE and IS_DATA_METRIC columns.

After the change:
:   The functions view contains the IS_MEMOIZABLE and IS_DATA_METRIC columns, which are defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | IS_MEMOIZABLE | Boolean | Y (yes) if the function is memoizable, N (no) otherwise. |
    | IS_DATA_METRIC | Boolean | This column is a placeholder for future functionality. |

Ref: 1362

---
title: FUNCTIONS view (ACCOUNT_USAGE): Stored procedures are no longer included
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1671.md
section: Release Notes
---

# FUNCTIONS view (ACCOUNT_USAGE): Stored procedures are no longer included

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

With this behavior change, [the FUNCTIONS view in the ACCOUNT_USAGE schema](../../../sql-reference/account-usage/functions.md)
changes as follows:

Before the change:
:   The view includes rows for stored procedures.

After the change:
:   The view does not include rows for stored procedures.

This behavior change makes the FUNCTIONS view in ACCOUNT_USAGE consistent with
[the FUNCTIONS view in INFORMATION_SCHEMA](../../../sql-reference/info-schema/functions.md).

If you need information about the stored procedures in your account, use
[the PROCEDURES view in the ACCOUNT_USAGE schema](../../../sql-reference/account-usage/procedures.md) instead.

Ref: 1671

---
title: FUNCTIONS view (Information Schema): Add support for data metric function
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1569.md
section: Release Notes
---

# FUNCTIONS view (Information Schema): Add support for data metric function

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The behavior of the Information Schema [FUNCTIONS](../../../sql-reference/info-schema/functions.md) view is as follows:

Before the change:
:   The output of the view does not include the `is_data_metric` column and the `argument_signature` column does not support table arguments that are associated with a data metric function.

After the change:
:   The output of the view includes the is_data_metric column, which is defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | `is_data_metric` | VARCHAR(3) | YES, if the function is a data metric function; otherwise, NO. |

    When the output row corresponds to a data metric function, the `argument_signature` column includes the signature of the data
    metric function.

Ref: 1569

---
title: GCP PSC propagated connection limit set to 0
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2193.md
section: Release Notes
---

# GCP PSC propagated connection limit set to 0

> **Note:**
>
> This change is planned for February 2026.

Propagated connections to the Google Cloud Private Service Connect (GC PSC) network allow customers to access the
Snowflake service from multiple Google projects through a single authorized project. Unlimited propagated connections
can undermine the per-account and per-project budgeting, and operational predictability of your Snowflake account.
Newly propagated connections, if left unlimited, can exhaust service-attachment limits and risk outage or necessitate the
denial of legitimate direct connections.

Disabling newly propagated connections in the Snowflake PSC service attachment configuration prevents further uncontrolled
growth while leaving existing propagated connections intact. This solution prevents immediate customer disruption. For more
information about how Google reconciles propagated connections, see
[About propagated connections](https://cloud.google.com/vpc/docs/about-propagated-connections#connection-reconciliation)
in the Google Cloud documentation.

Snowflake service connections to the GC PSC network behave as follows:

Before the change:
:   Snowflake enforced no limit on propagated connections in the GCP Inbound Privatelink PSC service attachment configuration.

After the change:
:   Snowflake sets a limit of 0 for newly propagated connections in the GCP Inbound Privatelink PSC service attachment
    configuration. Existing propagated connections remain. Only newly propagated connections are denied.

This change affects Snowflake accounts that use Google Cloud Network Connectivity Center hub-and-spoke or similar
topologies to reach Snowflake via propagated connections.

No customer action is required due to this change.

Ref: 2193

---
title: GENERATE_SYNTHETIC_DATA: Join key column type change in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1868.md
section: Release Notes
---

# GENERATE_SYNTHETIC_DATA: Join key column type change in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, any string columns that you have designated as
join keys in [GENERATE_SYNTHETIC_DATA](../../../sql-reference/stored-procedures/generate_synthetic_data.md) will be generated as UUID strings:

Before the change:
:   Join keys (columns listed in the `columns` field with `join_key=True` in GENERATE_SYNTHETIC_DATA) are generated as simple numeric strings in the output data.

After the change:
:   Join keys are in UUID format.

Ref: 1868

---
title: GET_DDL function: Return source Apache Iceberg™ data types
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1553.md
section: Release Notes
---

# GET_DDL function: Return source Apache Iceberg™ data types

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The GET_DDL function behaves as follows:

Before the change:
:   When you execute the function for an [Iceberg table](../../../user-guide/tables-iceberg.md), the return value displays the Snowflake data type
    that is used to process and return table data.

After the change:
:   When you execute the function for an Iceberg table, the return value displays the source Iceberg data type
    that is associated with the column.

Ref: 1553

---
title: GET_DDL Function: Tags Set on Streams, Tasks, and Pipes Included in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-924.md
section: Release Notes
---

# GET_DDL Function: Tags Set on Streams, Tasks, and Pipes Included in Output

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The output of the [GET_DDL](../../../sql-reference/functions/get_ddl.md) function now include any tag objects that are set on a stream, task,
or pipe. The tag associations are specified in the WITH clause of the CREATE OR REPLACE <object> command and allow creating or
replacing an object with tags already assigned to the object.

Ref: 924

---
title: GET_LINEAGE function: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2059.md
section: Release Notes
---

# GET_LINEAGE function: New column in output

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, the output of the
[SNOWFLAKE.CORE.GET_LINEAGE](../../../sql-reference/functions/get_lineage-snowflake-core.md) function includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `process` | VARIANT | Provides details about how lineage between the source object and target object was established. For example, it might include the query ID of a SQL query or the name of a stored procedure that moved data from the source object to the target object. |

Ref: 2059

---
title: GET_QUERY_OPERATOR_STATS and EXPLAIN Functions and Commands: Parent Operators Represented by Arrays
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1175.md
section: Release Notes
---

# GET_QUERY_OPERATOR_STATS and EXPLAIN Functions and Commands: Parent Operators Represented by Arrays

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

The following commands and functions provide details about the execution of a query:

* [EXPLAIN](../../../sql-reference/sql/explain.md), EXPLAIN USING TABULAR, and EXPLAIN USING JSON (this does not affect EXPLAIN USING TEXT)
* [EXPLAIN_JSON](../../../sql-reference/functions/explain_json.md)
* [SYSTEM$EXPLAIN_PLAN_JSON](../../../sql-reference/functions/system_explain_plan_json.md)
* [GET_QUERY_OPERATOR_STATS](../../../sql-reference/functions/get_query_operator_stats.md)

The output of these commands and functions includes information about each operator node in the tree of operator nodes that
comprise a query.

## Background

Currently, this information includes a column or property that identifies a single parent node. This does not handle situations
in which an operator node has multiple parent nodes.

For example, suppose that you are producing a profile of the following query:

```sqlexample
WITH wv AS (
    SELECT a FROM t WHERE a % 2 = 1
  )
  SELECT a FROM wv WHERE a % 3 = 1
  UNION ALL
  SELECT a FROM wv WHERE a % 5 = 1;
```

In the profile of this query, the `WithClause [4]` node has multiple parent nodes:

For the query above, the output of the GET_QUERY_OPERATOR_STATS and EXPLAIN commands and functions have a column or property that
identifies `WithReference [3]` as the parent node of `WithClause [4]`. However, there are two parent nodes:
`WithReference [3]` and `WithReference [8]`.

## Changes in Output

In the current release, the existing column or property for the parent node is replaced by one of the following columns or
properties that contains an array of the IDs of the parent nodes:

| SQL Command or Function | Existing Column or Property | New Column or Property |
| --- | --- | --- |
| EXPLAIN and SYSTEM$EXPLAIN_PLAN_JSON | `parent` | `parentOperators` |
| GET_QUERY_OPERATOR_STATS | `PARENT_OPERATOR_ID` | `PARENT_OPERATORS` |

### Changes in the Tabular Output of the EXPLAIN Command and the EXPLAIN_JSON Function

Suppose that you execute the following statement:

```sqlexample
EXPLAIN WITH wv AS (
    SELECT a FROM t WHERE a % 2 = 1
  )
  SELECT a FROM wv WHERE a % 3 = 1
  UNION ALL
  SELECT a FROM wv WHERE a % 5 = 1;
```

The output changed as described below:

Previously:
:   The output includes the parent column, which contains a single parent node ID:

    ```output
    +------+------+--------+---------------+ ...
    | step | id   | parent | operation     | ...
    +------+------+--------+---------------+ ...
    | NULL | NULL |   NULL | GlobalStats   | ...
    |    1 |    0 |   NULL | Result        | ...
    |    1 |    1 |      0 | UnionAll      | ...
    |    1 |    2 |      1 | Filter        | ...
    |    1 |    3 |      2 | WithReference | ...
    |    1 |    4 |      3 | WithClause    | ...
    ...
    ```

Currently:
:   The output includes the parentOperators column, which contains an array of parent node IDs:

    ```output
    +------+------+-----------------+---------------+ ...
    | step | id   | parentOperators | operation     | ...
    |------+------+-----------------+---------------+ ...
    | NULL | NULL | NULL            | GlobalStats   | ...
    |    1 |    0 | NULL            | Result        | ...
    |    1 |    1 | [0]             | UnionAll      | ...
    |    1 |    2 | [1]             | Filter        | ...
    |    1 |    3 | [2]             | WithReference | ...
    |    1 |    4 | [3, 8]          | WithClause    | ...
    ...
    ```

For the EXPLAIN_JSON function, if the plan passed to the function does not include information about the parent operators, the
`parentOperators` column will be NULL.

### Changes in the JSON Output of the EXPLAIN Command and SYSTEM$EXPLAIN_PLAN_JSON

Suppose that you execute a statement that produces the JSON output of the query plan. For example:

```sqlexample
EXPLAIN USING JSON WITH wv AS (
    SELECT a FROM t WHERE a % 2 = 1
  )
  SELECT a FROM wv WHERE a % 3 = 1
  UNION ALL
  SELECT a FROM wv WHERE a % 5 = 1;
```

The output changed as described below:

Previously:
:   The output includes the parent property, which contains a single parent node ID:

    ```sqljson
    {
      ...
      "Operations": [[
        ...
        {"id":1,"parent":0,"operation":"UnionAll"},
        {"id":2,"parent":1,"operation":"Filter", ...},
        {"id":3,"parent":2,"operation":"WithReference"},
        {"id":4,"parent":3,"operation":"WithClause", ...},
        ...
    ```

Currently:
:   The output includes the parentOperators property, which contains an array of parent node IDs:

    ```sqljson
    {
      ...
      "Operations":[[
        ...
        {"id":1,"operation":"UnionAll","parentOperators":[0]},
        {"id":2,"operation":"Filter",... , "parentOperators":[1]},
        {"id":3,"operation":"WithReference","parentOperators":[2]},
        {"id":4,"operation":"WithClause",... ,"parentOperators":[3,8]},
        ...
    ```

### Changes in the Output of the GET_QUERY_OPERATOR_STATS Function

Suppose that you execute the following statements:

```sqlexample
WITH wv AS (
    SELECT a FROM t WHERE a % 2 = 1
  )
  SELECT a FROM wv WHERE a % 3 = 1
  UNION ALL
  SELECT a FROM wv WHERE a % 5 = 1;
```

```sqlexample
SET lid = LAST_QUERY_ID();
```

```sqlexample
SELECT * FROM TABLE(GET_QUERY_OPERATOR_STATS($lid));
```

The output changed as described below:

Previously:
:   The output includes the PARENT_OPERATOR_ID column, which contains a single parent node ID:

    ```output
    +-----+---------+-------------+--------------------+---------------+ ...
    | ... | STEP_ID | OPERATOR_ID | PARENT_OPERATOR_ID | OPERATOR_TYPE | ...
    +-----+---------+-------------+--------------------+---------------+ ...
    | ... |       1 |           0 |               NULL | Result        | ...
    | ... |       1 |           1 |                  0 | UnionAll      | ...
    | ... |       1 |           2 |                  1 | Filter        | ...
    | ... |       1 |           3 |                  2 | WithReference | ...
    | ... |       1 |           4 |                  3 | WithClause    | ...
    ...
    ```

Currently:
:   The output includes the PARENT_OPERATORS column, which contains an array of parent node IDs:

    ```output
    |-----+---------+-------------+------------------+---------------+ ...
    | ... | STEP_ID | OPERATOR_ID | PARENT_OPERATORS | OPERATOR_TYPE | ...
    |-----+---------+-------------+------------------+---------------+ ...
    | ... |       1 |           0 | NULL             | Result        | ...
    | ... |       1 |           1 | [                | UnionAll      | ...
    |     |         |             |   0              |               | ...
    |     |         |             | ]                |               | ...
    | ... |       1 |           2 | [                | Filter        | ...
    |     |         |             |   1              |               | ...
    |     |         |             | ]                |               | ...
    | ... |       1 |           3 | [                | WithReference | ...
    |     |         |             |   2              |               | ...
    |     |         |             | ]                |               | ...
    | ... |       1 |           4 | [                | WithClause    | ...
    |     |         |             |   3,             |               | ...
    |     |         |             |   8              |               | ...
    |     |         |             | ]                |               | ...
    ...
    ```

Ref: 1175

---
title: Go Snowflake Driver release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/golang.md
section: Release Notes
---

# Go Snowflake Driver release notes

The Go Snowflake Driver release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](golang-2026.md)
* [2025 releases](golang-2025.md)
* [2024 releases](golang-2024.md)
* [2023 releases](golang-2023.md)
* [2022 releases](golang-2022.md)

See [Go Snowflake Driver](../../developer-guide/golang/go-driver.md) for documentation.

---
title: Go Snowflake Driver release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/golang-2022.md
section: Release Notes
---

# Go Snowflake Driver release notes for 2022

This article contains the release notes for the Go Snowflake Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Go Snowflake Driver updates.

See [Go Snowflake Driver](../../developer-guide/golang/go-driver.md) for documentation.

## Version 1.6.16 (December 14, 2022)

### New features

* Fixed an issue where file decryption was causing a panic.
* Reverted the `go-ieproxy` library back to version 0.0.1.

## Version 1.6.15 (November 16, 2022)

### New features

* Added MultiFactor Authentication mechanism and caching for MFA/Id token.
* Fixed an issue where 405 error is thrown when S3 bucket acceleration is disabled.

## Version 1.6.14 (October 28, 2022)

### New features

* Removed the requirement to provide the original SQL query in addition to the requestId when resubmitting requests.
* Updated mocha to version 10.1.0.

## Version 1.6.14 (September 21, 2022)

### New features

* Removed support for Go 1.7 and added support for Go 1.17.
* Changed the format for float and numeric values when converting arrow types.
* Added the following functions to access data in arrow.Record format directly from queries:

  + `GetArrowBatches()`, which is a blocking call
  + `GetQueryID()`
  + `GetStatus()`
* Updated Go vendors.

## Version 1.6.13 (August 22, 2022)

### New features

* Added an example to show how to use key-pair authentication.
* Added the tracing connection parameter to enable logging in the connection string and DSN.
* Improved logging details for chunk downloads.
* Added support for using interface slice `[]interface{}` to insert NULL values via array binding for
  the time.Time data types.

### Bug fixes

* Fixed the “Failed to decrypt. Check file key and master key” error that occurred when binding large data
  files via array binding.

## Version 1.6.12 (July 29, 2022)

### New features

* Added support for using interface slice `[]interface{}` to insert `NULL` values via array binding.

### Bug fixes

* Fixed an issue where setting `DisableTelemetry` to TRUE did not disable telemetry.
* Fixed an issue with encrypted SAML assertions when authenticating with an external browser.

## Version 1.6.11 (June 23, 2022)

### Bug fixes

* Created a temporary workaround to avoid the “Failed to decrypt. Check file key and master key” error that
  occurred when binding large data files via array binding. Determining the root cause of the issue is currently
  under investigation.

## Version 1.6.10 (May 25, 2022)

### Bug fixes

* Removed redundant calls that impacted performance for `PrepareContext()`.

## Version 1.6.8 (March 15, 2022)

### New features

* Added support for exporting unique universal IDs (UUIDs).

### Bug fixes

* Fixed a default server side error.

## Version 1.6.7 (February 16, 2022)

### Bug fixes

* Fixed an issue where multi-statement queries were missing result IDs.
* Implemented Universally Unique Identifier version 4 (UUIDv4).
* Fixed and issue with `GetQueryStatus`.
* Fixed an issue in PUT Memory Enhancements performance tests.
* Fixed an issue with arrow record result batches.
* Made the port parameter optional.

---
title: Go Snowflake Driver release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/golang-2023.md
section: Release Notes
---

# Go Snowflake Driver release notes for 2023

This article contains the release notes for the Go Snowflake Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Go Snowflake Driver updates.

See [Go Snowflake Driver](../../developer-guide/golang/go-driver.md) for documentation.

## Version 1.7.1 (December 07, 2023)

### New features and updates

* Upgraded the `crypto` and `net` libraries.
* Added support to run S3 clients on the new AWS SDK library while preserving compatibility with the previous library version.
* Improved the OCSP response cache performance by changing the key from an `x509.Certificate` to a string.
* Implemented separate retry strategies for authentication endpoints and other types of endpoints.

### Bug fixes

* The driver now retries `getQueryStatus` queries that fail when backend errors occur.
* The driver now provides a `QueryId` for failed queries invoked by statements.

## Version 1.7.0 (November 15, 2023)

### BCR (Behavior Change Release) change

* Changed the default PUT behavior for the `OVERWRITE` parameter. Previously, the default was `OVERWRITE=true`. With this change, the default is `OVERWRITE=false`, so you must explicitly enable overwrite PUT behavior.

### New features and updates

* Added the `IncludeRetryReason` configuration parameter to enable or disable whether to send the HTTP status code for query retry requests.
* Added a new `WithOriginalTimestamp` context to allow arrow batches to use nanosecond precision in the full year range supported by Snowflake.
* Added support for setting the log level in a configuration file.
* Improved performance by caching parsed OCSP responses.

### Bug fixes

* Fixed an issue related to concurrent access to an HTAP query context cache.
* Fixed an issue related to improper connection handling in the asynchronous demo example.

## Version 1.6.25 (September 26, 2023)

### New features and updates

* Added support for hybrid transactional and analytical processing.
* Implemented the `GetQueryId` function on the statement level, which allows you to get the last query id on this statement.
* Added the retry reason to query requests.
* Updated the `cacert` bundle used for SSL connections.

### Bug fixes

* Fixed an issue with OCSP fallback requests in PrivateLink environments.
* Removed QueryID from the snowflakeConn struct to address some race conditions when the same connection was reused among threads.
* Fixed an issue where the driver showed an error for successful queries.

## Version 1.6.24 (August 22, 2023)

### New features and updates

* Added support specifying a temporary directory for encryption and compression.
* Improved performance by checking location data once per query instead of for each row and column separately.
* Added support specifying a custom context when fetching an arrow batch.

### Bug fixes

* None.

## Version 1.6.23 (July 25, 2023)

### New features and updates

* Added support for binding named parameters.
* Added support for `sql.Null` types for query bind mapping.
* Allowed setting separate authentication timeout for key pair authentication.
* Added sample application providing example of distributed fetch feature.
* Added external browser timeout.
* Provided easier way to configure Snowflake connection (see `/cmd` examples).
* Upgraded arrow library to better handle 32bit systems.
* Provided sample app demonstrating how to use Arrow batches.

### Bug fixes

* Fixed error messages from race conditions with multiple threads.
* Fixed an issue with retry async requests if a query is still in progress.
* Added null checks before accessing connection config during chunk downloading.
* Fixed an issue with handling JSON result sets returned from server when driver expected Arrow.
* Recreate new JWT token (with new expiration) on key pair authentication retry.
* Added timeout for authentication in external browser to prevent infinite waiting when user closed browser tab.
* Fixed driver panic when temporary file system is in readonly mode.
* Fixed an authentication issue by requiring username and password only for authentication modes in which it is required.

## Version 1.6.22 (June 14, 2023)

### New features and updates

* Added a sample app, `async.go,` within the cmd folder to demonstrate how to use asynchronous API calls within
  the Golang driver.
* Added a sample app, `multistatement.go`, within the cmd folder to demonstrate how to send multiple statements
  within the Golang driver.

### Bug fixes

* Fixed an issue where `Commit()` and `Rollback()` did not use the same context set
  in `BeginTx()`, which could cause locks to occur.

## Version 1.6.21 (May 23, 2023)

### New features and updates

* Added to check to see whether the context deadline was exceeded when retrying in the `snowflakeChunkDownloader`.
* Upgraded the arrow library to version v12.
* Added the ability to expose the direct arrow IPC streams from the Snowflake Go driver.
* Included the Arrow Database Connectivity (ADBC) version 0.4.0 release, which uses the updated Snowflake library
  to provide a Snowflake ADBC driver that can be consumed by anything that access a C interface, in
  addition to the native Go bindings.

### Bug fixes

* Fixed an int64 overflow issue with large or small `datetime` values.

## Version 1.6.20 (April 18, 2023)

### New features and updates

* Added support for Okta Identity Engine (OIE) logins.
* Improved memory usage by cleaning up the first data chunk before reading the next data chunk.

### Bug fixes

* Fixed the interface conversion panic when the context has been cancelled while monitoring an asynchronous
  query and passing a cancelable context to `WithFetchResultByID`.
* Updated log messages for OCSP file lock errors.
* Now logs an error when a single file upload fails.

## Version 1.6.19 (March 21, 2023)

### New features and updates

* Added support for Go version 1.20 and dropped support for Go version 1.18.
* Migrated from azure-storage-blob-go v0.15.0 to azure-sdk-for-go v1.0.0.
* The Go driver now supports retrying on an HTTP 429 error code.
* Upgraded the Arrow library to version v10.

### Bug fixes

* Fixed an issue where the Go driver failed to validate an SSO URL before executing it. Now, the driver uses the URLValidator and URLEncoder utilities to validate and encode the URL.
* Fixed the Pointer datatype `*time.Time` returns `<nil>` value from version GO Driver 1.6.13.

## Version 1.6.18 (February 22, 2023)

### New features and updates

* None.

### Bug fixes

* Added support for disabling connection caching for multi-factor authentication and external browsers, which
  are enabled by default, but setting either of the following configuration parameters.

  + `ClientStoreTemporaryCredential=ConfigBoolFalse`
  + `ClientRequestMfaToken=ConfigBoolFalse`

## Version 1.6.17 (January 26, 2023)

### New features and updates

* Updated `golang.org/x/net/http2` to version 0.5.0.

### Bug fixes

* Improved performance of multi-statement queries by skipping queries that return no update count.
* Fixed connection caching for MFA and external browser authentication.
* Added a mutex lock to the configuration parameters map to avoid concurrent read/writes when using multiple go routines.

---
title: Go Snowflake Driver release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/golang-2024.md
section: Release Notes
---

# Go Snowflake Driver release notes for 2024

This article contains the release notes for the Go Snowflake Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Go Snowflake Driver updates.

See [Go Snowflake Driver](../../developer-guide/golang/go-driver.md) for documentation.

## Version 1.12.1 (December 05, 2024)

### New features and updates

* Introduced `golangci-lint` and fixed some errors found by this tool.
* Introduced S3 logging configuration.
* Changed `RaisePutGetError` to be `true` by default.
* Implemented the `DriverContext` interface to reduce the time for creating new connections and to use context when connections are created using `db.Conn(ctx)`.
* Introduced the `disableOCSPChecks` configuration option, which replaces `insecureMode`. Deprecated `insecureMode`.
* Provided the ability to set custom OCSP cache server URLs.

### Bug fixes

* Propagated context to authentication processes.
* HTTP headers used to communicate with Azure are now case insensitive.
* Use correct length of IV for GCM encryption.

## Version 1.12.0 (October 30, 2024)

### New features and updates

* Added support for Golang 1.23, dropped support for Golang 1.20.
* Added support for configuring connections using `connections.toml`.
* Bumped logrus to version 1.9.3.
* Extended logging when querying with `QueryArrowStream`.

### Bug fixes

* Fixed an issue with duplicate requestIDs and requestGUIDs on session renewal.
* Fixed the proxy configuration for Azure.
* Removed the `*.okta.com` native Okta authenticator URL restriction.
* Fixed the `filestransfer` example that failed with an incorrect file path.

## Version 1.11.2 (October 03, 2024)

### New features and updates

* Changed `GetFileToStream` to an exported member of the `SnowflakeFileTransferOptions` `struct` so GET operations can read files using streams to reduce memory usage.

### Bug fixes

* Fixed error handling while getting accelerated configurations from S3 bucket.

## Version 1.11.1 (August 29, 2024)

### New features and updates

* Added support for downloading files into an in-memory stream when using the GET command.
* Added context propagation to `snowflakeFileTransferAgent` to support cancel for file transfer process.

### Bug fixes

* Removed context propagation in `snowflakeConn`, which is used only for dialing purposes.
* Prevent panic in the `arrayToString` method for Golang slices.
* Prevent panic in the `decodeChunk` method when a download is canceled.

## Version 1.11.0 (July 31, 2024)

### New features and updates

* Added support for Go 1.22 and dropped support for Go 1.19.
* Adjusted driver configuration for China deployments.
* Added the ability to bind structured types in queries.
* Added support for using a passcode with MFA token caching enabled.
* Added support for setting session variables in DSN.
* Provided a simpler solution to define structured objects using tags.
* Provided a mechanism to wrap each goroutine in custom code.

### Bug fixes

* Fixed an issue with handling session expiration when executing long-running queries.
* Fixed an issue OCSP failures when OCSP cache is disabled.
* Fixed an issue with reading arrow batches that contained integer columns whose size is smaller than 64b.

## Version 1.10.1 (May 29, 2024)

### New features and updates

* Upgraded AWS SDK dependencies.
* Added automatic password masking in logs.
* Added the `DisableSamlURLCheck` parameter to disable SAML URL checks.
* Added support for binding semi-structured types.
* Decreased the number of retries to OCSP.
* Added the `OcspMaxRetryCount` and `OcspResponderTimeout` variables to define the OCSP maximum retry count and timeout, respectively.

### Bug fixes

* Fixed an issue with exposed objects in Arrow batches mode.
* Fixed an issue with extracting account names when using key-pair authentication.

## Version 1.10.0 (May 8, 2024)

### New features and updates

* Implemented support for structured types (structured objects, arrays, and maps).
* Added an option to skip driver registration during startup.
* Added the `SECURITY.md` file so customers can review Snowflake’s security policy.
* Added the ability to set custom logger fields.

### Bug fixes

* Fixed an issue with closing the error channel twice when using async mode.
* Fixed a race condition when accessing temporal credentials.

## Version 1.9.0 (March 28, 2024)

### New features and updates

* Upgraded to Arrow version 15.
* Added support for the `WithHigherPrecision` context in Arrow batches mode.
* Added date and time converter from the Snowflake format to the Golang format.
* Added a context that replaces UTF-8 characters in Arrow responses.

### Bug fixes

* Fixed an issue with handling unavailable Amazon S3 accelerated configuration when transferring files.
* Fixed an issue with dividing big numbers in Arrow mode.
* Fixed a data racing issue during logging initialization.
* Fixed an issue where results were not downloaded when the first batch was missing in a response.
* Fixed an issue with the backoff retry period for non-authenticated requests.
* Fixed an issue where zombie DBus processes were not terminated when a program ended.

## Version 1.8.0 (February 21, 2024)

### New features and updates

* Added support for multiple SAML integrations.
* Added support for second, millisecond, and microsecond precision for arrow batch timestamps.

### Bug fixes

* Fixed an issue with `WithFetchResultByID` by checking for the `queryInProgressAsyncCode` response code when fetching results.
* Fixed an issue where OKTA authentication failed when receiving an HTTP 429 error.
* Fixed an issue where the driver incorrectly returned an error for empty arrow batches.

## Version 1.7.2 (January 17, 2024)

### New features and updates

* Added support for Go version 1.21.
* Upgraded the `arrow` library to version v14.
* Updated the `jose2go` and `crypto` dependencies.
* Allow clients to set the QUERY_TAG parameter via context.
* Standardized using the same `http.Transport` for all cloud providers.
* Added an example showing how to insert data into VARIANT and OBJECT columns using variable binding.

### Bug fixes

* Fixed the following issues relating to error handling:

  + The driver now propagates errors when file upload errors occur.
  + The driver now propagates errors that occur during chunk downloading.
  + The driver does not start chunk downloading when an error occurs with the first chunk download.
* Fixed an issue where the driver tried to read an empty chunk, when `arrow_batches` mode is enabled.
* Removed retry attempts for HTTP 400 and 405 statuses.
* Fixed an issue with unexpected errors that occurred during S3 HEAD calls.
* Fixed the GET example in documentation.

---
title: Go Snowflake Driver release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/golang-2025.md
section: Release Notes
---

# Go Snowflake Driver release notes for 2025

This article contains the release notes for the Go Snowflake Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Go Snowflake Driver updates.

See [Go Snowflake Driver](../../developer-guide/golang/go-driver.md) for documentation.

## Version 1.18.1 (Dec 15, 2025)

### New features and updates

* Included a shared library to collect telemetry to identify and prepare testing platforms for native Rust extensions.

### Bug fixes

* Handled HTTP 307 and 308 responses in drivers to achieve better resiliency to backend errors.
* Created a temporary directory only if needed during file transfers.
* Fixed unnecessary user expansion for file paths during file transfers.

## Version 1.18.0 (Nov 20, 2025)

### New features and updates

* Added validation of CRL `NextUpdate` for freshly downloaded CRLs.
* Added logging of query text and parameters.

### Bug fixes

* Fixed a data race error in tests caused by the platform detection `init()` function.
* Made secrets detector initialization thread safe and more maintainable.

## Version 1.17.1 (Nov 4, 2025)

### New features and updates

* Added telemetry for login requests to supported platforms (such as EC2, Lambda, Azure function, and so on). You can disable the telemetry by setting the `SNOWFLAKE_DISABLE_PLATFORM_DETECTION` environment variable (`SNOWFLAKE_DISABLE_PLATFORM_DETECTION=true`).
* Exposed `QueryStatus` from `SnowflakeResult` and `SnowflakeRows` in the `GetStatus()` function.
* Added the `CrlDownloadMaxSize` parameter to limit the size of CRL downloads.
* Added official support for RHEL9 (Red Hat Enterprise Linux 9).
* Improved log messages.
* Deprecated several configuration options and functions. For more information, see the [Upcoming Gosnowflake v2 changes](https://github.com/snowflakedb/gosnowflake/issues/1586).

### Bug fixes

* Fixed a bug where GCP PUT/GET operations would fail when the connection context was canceled.
* Fixed unsafe reflection of `nil` pointers on `DECFLOAT` function in the bind uploader.
* Added temporary download files cleanup.
* Added a small clarification in the `oauth.go` example on token escaping.
* Ensured proper permissions for CRL cache directory.
* Bypassed proxy settings for WIF metadata requests.
* Fixed `nil` pointer dereferences while calling long-running queries.
* Moved the keyring-based secure storage manager into a separate file to avoid the need to initialize keyring on Linux.

## Version 1.17.0 (Sep 29, 2025)

### New features and updates

* Added support for Go 1.25, dropped support for Go 1.22.
* Added ability to configure OCSP for each individual connection.
* Added `DECFLOAT` support. See the [gosnowflake documentation](https://pkg.go.dev/github.com/snowflakedb/gosnowflake) for details.
* Added proxy options to connection parameters.
* Added `client_session_keep_alive_heartbeat_frequency` connection paramameter.
* Added support for multi-part downloads for S3, Azure and Google Cloud.
* Added `Config.singleAuthenticationPrompt` to control authentication flow. When `true`, only one authentication is performed at a time, allowing for manual interactions such as MFA or OAuth. Default is `true`.

### Bug fixes

* Fixed missing `DisableTelemetry` option in `Config`.
* Fixed multistatements in large result sets.
* Fixed unnecessary retries when a context is cancelled.
* Fixed a regression in loading TOML connection files.
* Fixed race conditions in stage downloads.

## Version 1.16.0 (Aug 14, 2025)

### New features and updates

* Added support for workload identity federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `WorkloadIdentityProvider` connection parameter.
  + Added `AuthTypeWorkloadIdentityFederation` to the values for the `authenticator` connection parameter.
* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. For information on enabling this feature, see [CertRevocationCheckMode](https://pkg.go.dev/github.com/snowflakedb/gosnowflake#CertRevocationCheckMode). We recommend you test this feature in advisory mode before enabling it in production.
* Added support for opt-in single-use refresh tokens in the OAuth flow.
* Implemented a connectivity diagnostic tool.
* Added a session ID to logs produced by the connection and heartbeat modules.
* Added the `RegisterTLSConfig` function that lets you pass your own `TLSConfig` for the driver to use. Please use this function instead of modifying `SnowflakeTransport` directly.
* Removed the dependency to static list of root CAs for OCSP checking. Now, the default list of root CAs is used.

### Bug fixes

* Fixed an issue where error messages were not displayed while reading in structured types.
* Fixed a memory leak in the arrow batches example.
* Fixed issues with query cancellation.
* Removed the trailing slash from the default `RedirectUri` within the OAuth Authorization process.
* Fixed an issue with ignoring the maximum retry count when the timeout is not set.

## Version 1.15.0 (Jul 01, 2025)

### Private Preview (PrPr) features

Added support for workload identity federation in the AWS, Azure, GCP, and Kubernetes platforms.

Disclaimer:

* This feature can only be accessed by setting the `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use this feature only with non-production data.
* This PrPr feature is not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Added support for snake-case connection parameters.
* Optimized memory consumption during execution of PUT commands.

### Bug fixes

* Fixed an issue with permission handling for the `configuration.toml` file.

## Version 1.14.1 (May 28, 2025)

### New features and updates

* Added support for propagating OpenTelemetry contexts to GS.
* Added support for default client credentials in the OAuth authorization code flow.
* Moved OCSP initialization to the first HTTPS call.

### Bug fixes

* Aligned scan types and actually returned types for NUMBERs.
* Fixed an issue with `nil` dereferencing when an internal timeout happened (for instance for cloud provider call) when the original context was still valid.
* Fixed an issue with `nil` dereferencing during time out or canceling context race.
* Fixed encryption bugs where errors were never returned.
* Fixed downcast `smkId` to `int`, which caused decryption problems for very large stages.
* Fixed support for virtual style domains on GCP.
* Fixed the validation of the owner of the secure storage lock directory.

## Version 1.14.0 (Apr 30, 2025)

### New features and updates

* Implemented support for OAuth2 authorization code and client credential flows.
* Added support for PAT (programmatic access token):

  + Added the PROGRAMMATIC_ACCESS_TOKEN parameter for the parameter authenticator.
* Added support for virtual endpoints for GCP stages.

### Bug fixes

* Fixed the scan type for NUMBER columns when higher precision was enabled.

## Version 1.13.3 (Apr 28, 2025)

### Private Preview (PrPr) features

* Implemented support for OAuth2 authorization code and client credential flows.

Disclaimer:

* These features can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use these features only with non-production data.
* These PrPr features are not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* None.

### Bug fixes

* Fixed an issue with re-encrypting files for each request retry.
* Fixed Time-of-check Time-of-use (TOCTOU) race condition when checking access to Easy Logging configuration file. For more information, see [CVE-2025-46327](https://github.com/snowflakedb/gosnowflake/security/advisories/GHSA-6jgm-j7h2-2fqg).

## Version 1.13.2 (Mar 31, 2025)

### New features and updates

* Bumped the JWT library version from 5.2.1 to 5.2.2.
* Implemented and improved the file-based credentials cache for Linux, including enhanced token caching.

### Bug fixes

* Fixed PUT/GET handling when the query begins with a newline.
* Added more logging to certificate chain verification.
* Falling back to OCSP GET request only if the response for POST request was malformed.
* Fixed a memory leak related to not clearing OCSP cache.

## Version 1.13.1 (Mar 05, 2025)

### Private Preview (PrPr) features

Added support for PAT (programmatic access token) in Private Preview.

* Added the `PROGRAMMATIC_ACCESS_TOKEN` parameter for the parameter authenticator.

Disclaimer:

* This feature can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use these features only with non-production data.
* These PrPr features are not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Dropped support for Go 1.21 and added support for Go 1.24.
* Upgraded Arrow to v18.
* Added a log for JWT claims.

### Bug fixes

* Fixed error messages for HTTP retries.

## Version 1.13.0 (Jan 29, 2025)

### New features and updates

* The driver now handles UUID as varchars.
* The driver honors `driver.Valuer/fmt.Stringer` interfaces when binding parameters.
* The driver detects when a response is JSON-based and runs a regular chunk downloader when Arrow batches mode is enabled to allow fetching response as rows.
* Added a timeout configuration for cloud providers calls.
* Added support for GCS region-specific endpoints.
* Fixed minor documentation formatting.
* Added retry when calling HEAD requests to GCP.
* Bumped the x/crypto library to version v0.31.0.

### Bug fixes

* Fixed a memory leak in handling Arrow responses that caused leakage of 64 bytes of memory.
* Fixed an issue with ignoring the region when us-west-2 is used.
* Added a check for empty private key before trying to generate JWT from it.
* The driver uses the correct transport for cloud providers calls.
* The driver no longer performs OCSP calls for cloud providers when OCSP is disabled.

---
title: Go Snowflake Driver release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/golang-2026.md
section: Release Notes
---

# Go Snowflake Driver release notes for 2026

This article contains the release notes for the Go Snowflake Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Go Snowflake Driver updates.

See [Go Snowflake Driver](../../developer-guide/golang/go-driver.md) for documentation.

## Version 2.0.1 (Apr 08, 2026)

### Bug fixes

* Reduced the default `CrlDownloadMaxSize` setting from 200 MB to 20 MB to prevent potential out-of-memory errors.
* Fixed an issue where parameter values could change across connections in the same connection pool.
* Fixed Azure multi-part file uploads to properly populate the blob content-MD5 property.
* Fixed 403 errors from Google Cloud Storage PUT queries on versioned stages.
* Fixed the query context cache not being updated for failed queries, which could result in stale session data.
* Improved connection handling performance by optimizing parameter synchronization.

### Internal changes

* Moved configuration to a dedicated internal package.
* Modernized Go syntax idioms throughout the codebase.
* Added libc family, version, and dynamic linking marker to client environment telemetry.
* Updated dependencies to address security vulnerabilities:

  + `golang.org/x/crypto` from v0.41.0 to v0.46.0
  + `golang.org/x/net` from v0.43.0 to v0.48.0
  + `golang.org/x/oauth2` from v0.30.0 to v0.34.0
  + `golang.org/x/sys` from v0.35.0 to v0.40.0
  + `golang.org/x/mod` from v0.27.0 to v0.30.0
  + `golang.org/x/sync` from v0.16.0 to v0.19.0
  + `golang.org/x/term` from v0.34.0 to v0.38.0
  + `golang.org/x/text` from v0.28.0 to v0.32.0
  + `golang.org/x/tools` from v0.36.0 to v0.39.0
  + `google.golang.org/grpc` from v1.73.0 to v1.79.3
  + `google.golang.org/protobuf` from v1.36.6 to v1.36.10
  + OpenTelemetry packages from v1.37.0 to v1.40.0
* Removed pointer indirection from query context cache in `snowflakeConn`.

## Version 1.9.1 (Apr 08, 2026)

### New features and updates

* Added support for Go 1.26 and dropped support for Go 1.23.

### Bug fixes

* Fixed minicore crashes (SIGFPE) on fully statically linked Linux binaries by detecting static linking via ELF PT_INTERP inspection and skipping `dlopen` gracefully.

### Internal changes

* Added libc family, version, and dynamic linking marker to client environment telemetry.

## Version 2.0.0 (Mar 03, 2026)

### BCR (Behavior Change Release) changes

* Removed `RaisePutGetError` from `SnowflakeFileTransferOptions` to ensure errors are raised for PUT/GET operations.
* Removed `GetFileToStream` from `SnowflakeFileTransferOptions`. Use `WithFileGetStream` to automatically enable file streaming for GET operations.
* Removed `WithOriginalTimestamp`. Use `WithArrowBatchesTimestampOption(UseOriginalTimestamp)` instead.
* Removed the `ClientIP` field from the `Config` struct. This field was never used and is not needed for any functionality.
* Removed the `InsecureMode` field from `Config` struct. Use `DisableOCSPChecks` instead.
* Removed the `DisableTelemetry` field from the `Config` struct. Use the `CLIENT_TELEMETRY_ENABLED` session parameter instead.
* Removed the stream chunk downloader. Use the default downloader instead.
* Removed `SnowflakeTransport`. Use `Config.Transporter`, or simply register your own TLS configuration with `RegisterTLSConfig` if you just need a custom root certificates set.
* Renamed `WithFileStream` to `WithFilePutStream` for consistency.
* Renamed the `KeepSessionAlive` field in the `Config` struct to `ServerSessionKeepAlive` for consistency with other drivers.
* The `Array` function now returns an error for unsupported types.
* `WithMultiStatement` no longer returns an error.
* Combined `WithMapValuesNullable` and `WithArrayValuesNullable` into the single `WithEmbeddedValuesNullable` option.
* Hid the streaming chunk downloader. It will be removed completely in a future release.
* The maximum number of chunk download goroutines is now configured with the `CLIENT_PREFETCH_THREADS` session parameter.
* Fixed a typo in the `GOSNOWFLAKE_SKIP_REGISTRATION` environment variable.
* Unexported `MfaToken` and `IdToken`.
* Arrow batches changes:

  + Arrow batches have been extracted to a separate package, which should significantly reduce the compilation size for those who don’t need arrow batches (~34MB -> ~18MB).
  + Removed `GetArrowBatches` from `SnowflakeRows` and `SnowflakeResult`. Use `arrowbatches.GetArrowBatches(rows.(SnowflakeRows))` instead.
  + Migrated the following functions:

    - `sf.WithArrowBatchesTimestampOption` to `arrowbatches.WithTimestampOption`
    - `sf.WithArrowBatchesUtf8Validation` to `arrowbatches.WithUtf8Validation`
    - `sf.ArrowSnowflakeTimestampToTime` to `arrowbatches.ArrowSnowflakeTimestampToTime`
* Logging changes:

  + Removed the Logrus logger and migrated to slog.
  + Simplified the `SFLogger` interface.
  + Added the `SFSlogLogger` interface for setting a custom slog handler.

### New features and updates

* Added support for Go 1.26, and dropped support for Go 1.23.
* Added support for FIPS-only mode.

### Bug fixes

* Added a panic recovery block for stage file upload and download operations.
* Fixed a WIF metadata request from an Azure container that manifested as an HTTP 400 error.
* Fixed a SAML authentication port validation bypass in `isPrefixEqual` where the second URL’s port was never checked.
* Fixed a race condition in the OCSP cache clearer.
* The `context.Context` query is now propagated to cloud storage operations for PUT and GET queries, allowing for better cancellation handling.
* Fixed minicore crashes (SIGFPE) on fully statically linked Linux binaries by detecting static linking via ELF PT_INTERP inspection and skipping `dlopen` gracefully.

## Version 1.19.0 (Feb 03, 2026)

### New features and updates

* Exposed `tokenFilePath` in the `Config` struct, in addition to the existing DSN option.
* `tokenFilePath` is now read for every new connection, not only once at driver startup.
* Added support for identity impersonation when using workload identity federation.
* Added the ability to disable minicore from loading at compile time using the `-tags minicore_disabled` parameter.

### Bug fixes

* Fixed an issue with getting files from an unencrypted stage.
* Fixed the minicore file name gathering in client environment.
* Fixed path escaping for GCS URLs that manifested in 403 responses from GCS when a file or directory contained spaces.
* Fixed leaking file descriptors when uploading files to stages (especially in GCS).

---
title: GRANT and REVOKE Commands: Changes to the Output for a Failed Grant
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-515.md
section: Release Notes
---

# GRANT and REVOKE Commands: Changes to the Output for a Failed Grant

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

The behavior of these commands is as follows:

* [GRANT APPLICATION ROLE](../../../sql-reference/sql/grant-application-role.md)
* [REVOKE APPLICATION ROLE](../../../sql-reference/sql/revoke-application-role.md)
* [GRANT <privileges> … TO APPLICATION ROLE](../../../sql-reference/sql/grant-privilege-application-role.md)
* [REVOKE <privileges> … FROM APPLICATION ROLE](../../../sql-reference/sql/revoke-privilege-application-role.md)
* [GRANT <privileges> … TO ROLE](../../../sql-reference/sql/grant-privilege.md)
* [REVOKE <privileges> … FROM ROLE](../../../sql-reference/sql/revoke-privilege.md)
* [GRANT <privilege> … TO SHARE](../../../sql-reference/sql/grant-privilege-share.md)
* [REVOKE <privilege> … FROM SHARE](../../../sql-reference/sql/revoke-privilege-share.md)
* [GRANT ROLE](../../../sql-reference/sql/grant-role.md)
* [REVOKE ROLE](../../../sql-reference/sql/revoke-role.md)
* [GRANT OWNERSHIP](../../../sql-reference/sql/grant-ownership.md)

Previously:
:   When you execute any of these commands and the operation does not work for one or more privileges or roles that you specify in the
    command, Snowflake formats the response as a “successful status message” (i.e. table) and indicates the relevant information.
    For example:

    ```sqlexample
    GRANT ALL ON ACCOUNT TO ROLE r1;
    ```

    ```output
    +--------------------------------------------------------------------------------------------------------------------------+
    | status                                                                                                                   |
    |--------------------------------------------------------------------------------------------------------------------------|
    | Grant partially executed: privileges [MANAGE LISTING AUTO FULFILLMENT, MANAGE ORGANIZATION SUPPORT CASES] not granted.   |
    +--------------------------------------------------------------------------------------------------------------------------+
    ```

    This output is a representative example of one of many possible messages when executing any of these commands.

Currently:
:   When you execute either of these commands and the operation does not work for one or more privileges or roles that you specify in the
    command, Snowflake formats the response as an error message, with the error code, and indicates the relevant information. For
    example:

    ```output
    003011 (42501): Grant partially executed: privileges [MANAGE LISTING AUTO FULFILLMENT, MANAGE ORGANIZATION SUPPORT CASES] not granted.
    ```

    The actual message text does not change.

    > **Tip:**
    >
    > If you have workflows that depend on the result of either of these commands, update your scripts to parse the error code information,
    > which is `003011 (42501)` in this example.
    >
    > The list of error codes that are affected by this change are:
    >
    > ```none
    > 003011: Grant partially executed: [ one or more privileges ] not granted.
    > 003012: Revoke partially executed: [ one or more privileges ] not revoked.
    > 003102: Grant not executed: Insufficient privileges.
    > 003103: Revoke not executed: Insufficient privileges.
    > 003104: Grant not executed: Operation not supported on a SHARE object.
    > 003105: Revoke not executed: Operation not supported on a SHARE object.
    > ```
    >
    > The value `(42501)` in the example reflects the SQL client the user chose to execute the command, which is the
    > [Snowflake Connector for Python](../../../developer-guide/python-connector/python-connector.md) in this example. This value might not show up depending on how you execute
    > the command (e.g. Snowsight does not return this value or the error code value `003011`).
    >
    > The `[ one or more privileges ]` value is a placeholder to return information about the statement that caused the error. In the
    > example, these placeholder shows that REFERENCE_USAGE privilege was not granted.

Ref: 515

---
title: Grant on Native Applications: Must grant access to tags and policies
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1274.md
section: Release Notes
---

# Grant on Native Applications: Must grant access to tags and policies

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

Using tags and policies on a reference database requires the REFERENCE_USAGE privilege on the reference
database.

Before the change:
:   If a database containing only tag or policy dependencies is used by objects shared with an
    application package, the provider is not required to explicitly grant the REFERENCE_USAGE privilege on
    the databases to the application package.

After the change:
:   If a database containing only tag or policy dependencies is used indirectly by objects shared with an
    application package, the provider must explicitly grant the REFERENCE_USAGE privilege on
    the databases to the application package.

    To grant the REFERENCE_USAGE privilege on a database to the application package, run the following command:

    ```sqlexample
    GRANT REFERENCE_USAGE ON DATABASE <name> TO SHARE IN APPLICATION PACKAGE <app_package>
    ```

    Where:

    `name`
    :   Specifies the identifier of the referenced database that contains a tag or policy.

    `app_package`
    :   Specifies the identifier for the application package to which the REFERENCE_USAGE privilege is
        being granted.

Ref: 1274

---
title: GRANT OWNERSHIP command: Ownership transfer not allowed for shared databases
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1181.md
section: Release Notes
---

# GRANT OWNERSHIP command: Ownership transfer not allowed for shared databases

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The [GRANT OWNERSHIP](../../../sql-reference/sql/grant-ownership.md) and [DROP ROLE](../../../sql-reference/sql/drop-role.md) commands behave as follows:

Previously:
:   When you grant the USAGE privilege on a database to a share, you can execute the GRANT OWNERSHIP command to transfer the OWNERSHIP
    privilege on the database to a different role. For example:

    ```sqlexample
    GRANT USAGE ON DATABASE mydb TO SHARE myshare;
    GRANT OWNERSHIP ON DATABASE mydb TO ROLE r2 REVOKE CURRENT GRANTS;
    ```

    Additionally, you can drop the role that has the OWNERSHIP privilege on the shared database:

    ```sqlexample
    DROP ROLE r2;
    ```

Currently:
:   You can transfer ownership of the shared database to a different role and use the COPY CURRENT GRANTS clause, however you cannot transfer
    ownership on the shared database to a different role and use the REVOKE CURRENT GRANTS clause. If you try to do this, Snowflake returns
    the following error message:

    ```output
    Cannot transfer ownership on a database that is granted to a share
    ```

    To avoid this error message and transfer the OWNERSHIP privilege to a different role, revoke the USAGE privilege on the database from the
    share, transfer the OWNERSHIP privilege on the database to a different role, and grant the USAGE privilege on the database to the share.
    For example:

    ```sqlexample
    REVOKE USAGE ON DATABASE mydb FROM SHARE myshare;
    GRANT OWNERSHIP ON DATABASE mydb TO ROLE r2;
    GRANT USAGE ON DATABASE mydb TO SHARE r2;
    ```

    Additionally, if you try to drop the role that has the OWNERSHIP privilege on the shared database, Snowflake returns the following error
    message with instructions on the actions to take:

    ```output
    Cannot drop a role that is the owner of one or more shared databases. Run 'SHOW GRANTS TO ROLE <role_name>' to find these shared
    databases and transfer their ownership to appropriate role using 'GRANT OWNERSHIP ON DATABASE <database_name> TO ROLE
    <target_role_name> COPY CURRENT GRANTS'.
    ```

Ref: 1181

---
title: GRANT OWNERSHIP ON ROLE command: Restrict transfer of role ownership to itself (Canceled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1781.md
section: Release Notes
---

# GRANT OWNERSHIP ON ROLE command: Restrict transfer of role ownership to itself (Canceled)

> **Attention:**
>
> This BCR is canceled and removed from the [2024_08 Bundle](../2024_08_bundle.md).

When this behavior change bundle is enabled, the transfer of role ownership will be restricted as follows:

Before the change:
:   Users can grant ownership of a role to the role itself. For example, the following GRANT statement is allowed:

    ```sqlexample
    GRANT OWNERSHIP ON ROLE my_role TO ROLE my_role;
    ```

After the change:
:   Users can no longer grant ownership of a role to the role itself. For example, the following GRANT statement returns an error:

    ```sqlexample
    GRANT OWNERSHIP ON ROLE my_role TO ROLE my_role;
    ```

    ```output
    003645 (42501): SQL execution error: Transferring OWNERSHIP of a role to itself is not allowed.
    ```

Ref: 1781

---
title: GRANT PRIVILEGES … TO ROLE command: Creating instances and privilege format
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1462.md
section: Release Notes
---

# GRANT PRIVILEGES … TO ROLE command: Creating instances and privilege format

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of the GRANT PRIVILEGES … TO ROLE command and the following classes has changed.

* ANOMALY_DETECTION
* BUDGET
* COMPARE
* FORECAST

Before the change:
:   * If you grant a privilege to create an instance of one class to a role, the role is automatically granted the privileges to create
      instances of other classes.
    * The name of the privilege to create an instance of a class is as follows:

      + CREATE ANOMALY_DETECTION
      + CREATE BUDGET
      + CREATE COMPARE
      + CREATE FORECAST
    * If you specify the `ALL` keyword to grant all privileges on a schema, such as
      `GRANT ALL PRIVILEGE ON SCHEMA db.sch TO ROLE r1`, the role is granted privileges on each class and allowed to create instances
      of each class.

After the change:
:   * The command only grants privileges on the class that is specified in the command. If you specify the `ALL` keyword to grant
      privileges on a schema, class privileges are not granted to the specified role.

      To allow a role to create an instance of a class, grant the corresponding privilege manually.
    * The format of the privilege to create an instance of a class is as follows:

      + CREATE SNOWFLAKE.ML.ANOMALY_DETECTION
      + CREATE SNOWFLAKE.CORE.BUDGET
      + CREATE SNOWFLAKE.ML.FORECAST

Ref: 1462

---
title: GRANT_TO_ROLES View (Account Usage): Privileges Added to View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1040.md
section: Release Notes
---

# GRANT_TO_ROLES View (Account Usage): Privileges Added to View

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The behavior of the GRANTS_TO_ROLES Account Usage view in the shared SNOWFLAKE database behaves as follows:

Previously:
:   Grants to a role that include the account-level APPLY MASKING POLICY or APPLY ROW ACCESS policy privilege are not recorded in this view.

Currently:
:   Grants to a role that include the account-level APPLY MASKING POLICY or APPLY ROW ACCESS policy privilege are recorded in this view.

Ref: 1040

---
title: GRANTS_TO_ROLES View (Account Usage): Match SHOW GRANTS TO ROLE command
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1481.md
section: Release Notes
---

# GRANTS_TO_ROLES View (Account Usage): Match SHOW GRANTS TO ROLE command

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of the Account Usage GRANTS_TO_ROLES view is as follows:

Before the change:
:   The privileges displayed in the output of the view is not consistent with a SHOW GRANTS TO ROLE command: the view does not list all of the grants to each role.

After the change:
:   The privileges displayed in the output of the view is consistent with a SHOW GRANTS TO ROLE command based on the set of enabled features in your Snowflake account.

Ref: 1481

---
title: GRANTS_TO_ROLES View: New Column in the Output and New Values for Existing Columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1240.md
section: Release Notes
---

# GRANTS_TO_ROLES View: New Column in the Output and New Values for Existing Columns

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The Account Usage GRANTS_TO_ROLES view in the shared SNOWFLAKE database behaves as follows:

Previously:
:   The output from a query on the GRANTS_TO_ROLES view does not include the OBJECT_INSTANCE column.

    Additionally, the following columns do not support the listed values:

    * The GRANTED_ON column does not support INSTANCE_ROLE as a possible value.
    * The TABLE_CATALOG column does not support the database that stores the instance of a class.
    * The TABLE_SCHEMA column does not support the schema that stores the instance of the class.

Currently:
:   The output from a query on the GRANTS_TO_ROLES view includes the OBJECT_INSTANCE column as the last ordinal column in the output.

    | Column | Data type | Description |
    | --- | --- | --- |
    | OBJECT_INSTANCE | STRING | The fully-qualified name of the object that contains the instance role for a particular class in the format database.schema.class. |

    Additionally, the following columns support the listed values:

    * The GRANTED_ON column supports INSTANCE_ROLE as a possible value to indicate the role associated with a particular class.
    * The TABLE_CATALOG column supports the database that stores the instance of a class.
    * The TABLE_SCHEMA column supports the schema that stores the instance of the class.

    For details about classes, instances, and the associated roles, see [Snowflake classes](../../../sql-reference/snowflake-db-classes.md).

Ref: 1240

---
title: Handling new columns in SHOW command output and Snowflake views
source: https://docs.snowflake.com/en/release-notes/behavior-changes-new-columns.md
section: Release Notes
---

# Handling new columns in SHOW command output and Snowflake views

Periodically, new columns will be introduced in the output of [SHOW <objects>](../sql-reference/sql/show.md) commands and in Snowflake views
(such as the views in the [ACCOUNT_USAGE schema](../sql-reference/account-usage.md) in the
[SNOWFLAKE database](../sql-reference/snowflake-db.md) and the views in the
[INFORMATION_SCHEMA schema](../sql-reference/info-schema.md)).

If you have a script or code that depends on the result set including a specific number of columns or that depend on the order
of the columns, the introduction of a new column might affect that script or code.

## Temporarily working around a problem introduced by a new column

If your script or code encounters problems due to the introduction of new columns, your Snowflake administrator (a user who has
been granted the ACCOUNTADMIN role) can change the columns that are returned for executions of a specific SHOW command or SELECT \*
queries of a Snowflake view. These columns are referred to as the *default columns*.

### Overriding the default columns for a SHOW command

To exclude newly introduced columns from the output of a SHOW command, call the
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](../sql-reference/functions/system_set_default_columns_override_for_show_command.md) function, specifying the type of object and
the list of columns that should be returned.

Suppose that a new `direction` column has been introduced in the output of the
[SHOW NOTIFICATION INTEGRATIONS](../sql-reference/sql/show-notification-integrations.md) command. To prevent the new `direction` column from being included in
the output of the command, call SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND, specifying `'NOTIFICATION INTEGRATIONS'`
as the type of object. Pass in a comma-separated list of the columns that should be returned in the output (a list that excludes
`direction`):

```sqlexample
SELECT SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'NOTIFICATION INTEGRATIONS',
  'name, type, category, enabled, comment, created_on'
);
```

When anyone in your account runs the SHOW NOTIFICATION INTEGRATIONS command, the new `direction` column will not be returned in
the output.

```sqlexample
SHOW NOTIFICATION INTEGRATIONS;
```

```output
+--------------------------------+---------+--------------+---------+---------+-------------------------------+
| name                           | type    | category     | enabled | comment | created_on                    |
|--------------------------------+---------+--------------+---------+---------+-------------------------------|
| SLACK_NOTIFICATION_INTEGRATION | WEBHOOK | NOTIFICATION | true    | NULL    | 2025-07-02 06:14:53.859 -0700 |
+--------------------------------+---------+--------------+---------+---------+-------------------------------+
```

### Resetting the default columns for a SHOW command

If you need to undo a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND call and return all columns in the SHOW
command for a specific object type, call the
[SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](../sql-reference/functions/system_unset_default_columns_override_for_show_command.md) function, specifying the type of object.
For example:

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'NOTIFICATION INTEGRATIONS'
);
```

### Getting the list of default columns for a SHOW command

If you need to determine if SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND was called for a specific object type and you
want the list of columns that will be returned in the output of the command, call the
[SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND](../sql-reference/functions/system_get_default_columns_override_for_show_command.md) function, specifying the type of object. For
example:

```sqlexample
SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'NOTIFICATION INTEGRATIONS'
);
```

```output
+-------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND( |
|   'NOTIFICATION INTEGRATIONS'                         |
| )                                                     |
|-------------------------------------------------------|
| name,type,category,enabled,comment,created_on         |
+-------------------------------------------------------+
```

If SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND was not previously called or if
SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND was called, the function returns an empty string.

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'NOTIFICATION INTEGRATIONS'
);

SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND(
  'NOTIFICATION INTEGRATIONS'
);
```

```output
+-------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SHOW_COMMAND( |
|   'NOTIFICATION INTEGRATIONS'                         |
| )                                                     |
|-------------------------------------------------------|
|                                                       |
+-------------------------------------------------------+
```

### Overriding the default columns for a Snowflake view

To exclude newly introduced columns from the results of a `SELECT *` query of a Snowflake view, call the
[SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](../sql-reference/functions/system_set_default_columns_override_for_system_object.md) function, specifying the type of object, the
database and schema containing the view, the name of the view, and the list of columns that should be returned.

Suppose that a new `replicable_with_failover_groups` column has been introduced in the
[DATABASES view in the ACCOUNT_USAGE schema](../sql-reference/account-usage/databases.md). To prevent the new
`replicable_with_failover_groups` column from being returned in the results of a `SELECT *` query of the view,
call SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT, specifying `'VIEW'` as the type of object, `'SNOWFLAKE'` as the
database, `'ACCOUNT_USAGE'` as the schema, and `'TABLES'` as the view. Pass in a comma-separated list of the columns that
should be returned in the output (a list that excludes `replicable_with_failover_groups`):

```sqlexample
SELECT SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'DATABASES',
  'database_id, database_name, database_owner, is_transient, ' ||
  'comment, created, last_altered, deleted, retention_time, '  ||
  'resource_group, type, owner_role_type, object_visibility'
);
```

The example uses the [||](../sql-reference/functions/concat.md) operator to construct a string that contains the comma-separated
list of columns.

When anyone in your account performs a `SELECT *` query of the DATABASES view, the new `replicable_with_failover_groups`
column will not be returned in the output.

```sqlexample
SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.DATABASES;
```

```output
+-------------+---------------+----------------+--------------+---------+-------------------------------+-------------------------------+-------------------------------+----------------+----------------+----------+-----------------+-------------------+
| DATABASE_ID | DATABASE_NAME | DATABASE_OWNER | IS_TRANSIENT | COMMENT | CREATED                       | LAST_ALTERED                  | DELETED                       | RETENTION_TIME | RESOURCE_GROUP | TYPE     | OWNER_ROLE_TYPE | OBJECT_VISIBILITY |
|-------------+---------------+----------------+--------------+---------+-------------------------------+-------------------------------+-------------------------------+----------------+----------------+----------+-----------------+-------------------|
|          55 | MY_DATABASE   | NULL           | NO           | NULL    | 2025-07-16 15:17:55.990 -0700 | 2025-07-17 15:19:52.305 -0700 | 2025-07-16 15:18:32.973 -0700 |              1 | NULL           | STANDARD | NULL            | NULL              |
+-------------+---------------+----------------+--------------+---------+-------------------------------+-------------------------------+-------------------------------+----------------+----------------+----------+-----------------+-------------------+
```

If you need to call this function for an INFORMATION_SCHEMA view, pass in an empty string for the database name. For example, to
exclude the `replicable_with_failover_groups` column from the results of `SELECT *` queries of the
[DATABASES view in the INFORMATION_SCHEMA schema](../sql-reference/info-schema/databases.md):

```sqlexample
SELECT SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  '',
  'INFORMATION_SCHEMA',
  'DATABASES',
  'database_name, database_owner, is_transient, comment, ' ||
  'created, last_altered, retention_time, type, '          ||
  'owner_role_type'
);
```

### Resetting the default columns for a Snowflake view

If you need to undo a previous SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT call and return all columns in a
`SELECT *` query of a Snowflake view, call the
[SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](../sql-reference/functions/system_unset_default_columns_override_for_system_object.md) function, specifying the type of object,
the database and schema that contain the view, and the name of the view. For example:

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'DATABASES'
);
```

If you need to call this function for an INFORMATION_SCHEMA view, pass in an empty string for the database name. For example:

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  '',
  'INFORMATION_SCHEMA',
  'DATABASES'
);
```

### Getting the list of default columns for a Snowflake view

If you need to determine if SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT was called for a specific view and you
want the list of columns that will be returned in a `SELECT *` query of that view, call the
[SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT](../sql-reference/functions/system_get_default_columns_override_for_system_object.md) function, specifying the type of object, the
database and schema containing the view, and the name of the view. For example:

```sqlexample
SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'DATABASES'
);
```

```output
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(                                                                                                          |
|   'VIEW',                                                                                                                                                       |
|   'SNOWFLAKE',                                                                                                                                                  |
|   'ACCOUNT_USAGE',                                                                                                                                              |
|   'DATABASES'                                                                                                                                                   |
| )                                                                                                                                                               |
|-----------------------------------------------------------------------------------------------------------------------------------------------------------------|
| DATABASE_ID,DATABASE_NAME,DATABASE_OWNER,IS_TRANSIENT,COMMENT,CREATED,LAST_ALTERED,DELETED,RETENTION_TIME,RESOURCE_GROUP,TYPE,OWNER_ROLE_TYPE,OBJECT_VISIBILITY |
+-----------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

If you need to call this function for an INFORMATION_SCHEMA view, pass in an empty string for the database name. For example:

```sqlexample
SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  '',
  'INFORMATION_SCHEMA',
  'DATABASES'
);
```

```output
+------------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(                                                     |
|   'VIEW',                                                                                                  |
|   '',                                                                                                      |
|   'INFORMATION_SCHEMA',                                                                                    |
|   'DATABASES'                                                                                              |
| )                                                                                                          |
|------------------------------------------------------------------------------------------------------------|
| DATABASE_NAME,DATABASE_OWNER,IS_TRANSIENT,COMMENT,CREATED,LAST_ALTERED,RETENTION_TIME,TYPE,OWNER_ROLE_TYPE |
+------------------------------------------------------------------------------------------------------------+
```

If SYSTEM$SET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT was not previously called or if
SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT was called, the function returns an empty string.

```sqlexample
SELECT SYSTEM$UNSET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'DATABASES'
);

SELECT SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT(
  'VIEW',
  'SNOWFLAKE',
  'ACCOUNT_USAGE',
  'DATABASES'
);
```

```output
+--------------------------------------------------------+
| SYSTEM$GET_DEFAULT_COLUMNS_OVERRIDE_FOR_SYSTEM_OBJECT( |
|   'VIEW',                                              |
|   'SNOWFLAKE',                                         |
|   'ACCOUNT_USAGE',                                     |
|   'DATABASES'                                          |
| )                                                      |
|--------------------------------------------------------|
|                                                        |
+--------------------------------------------------------+
```

### Getting the list of columns from all previous calls for SHOW commands and Snowflake views

To get the list of columns that are overridden for all SHOW commands and Snowflake views, call the
[SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](../sql-reference/functions/system_get_all_default_columns_overrides.md) function. For example:

```sqlexample
SELECT SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES();
```

The function returns a string containing a JSON array of objects. Each object represents the list of columns for a specific SHOW
command or Snowflake view. For example:

```output
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES()                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| [{"domain":"VIEW","isShowCommand":false,"dbName":"","schemaName":"INFORMATION_SCHEMA","objectName":"DATABASES","serializedDefaultColumns":"DATABASE_NAME,DATABASE_OWNER,IS_TRANSIENT,COMMENT,CREATED,LAST_ALTERED,RETENTION_TIME,TYPE,OWNER_ROLE_TYPE"},{"domain":"VIEW","isShowCommand":false,"dbName":"SNOWFLAKE","schemaName":"ACCOUNT_USAGE","objectName":"DATABASES","serializedDefaultColumns":"DATABASE_ID,DATABASE_NAME,DATABASE_OWNER,IS_TRANSIENT,COMMENT,CREATED,LAST_ALTERED,DELETED,RETENTION_TIME,RESOURCE_GROUP,TYPE,OWNER_ROLE_TYPE,OBJECT_VISIBILITY"},{"isShowCommand":true,"showCommandType":"NOTIFICATION INTEGRATIONS","serializedDefaultColumns":"name,type,category,enabled,comment,created_on"}] |
+--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
```

For an explanation of the name/value pairs in each object, see
[SYSTEM$GET_ALL_DEFAULT_COLUMNS_OVERRIDES](../sql-reference/functions/system_get_all_default_columns_overrides.md).

## Updating scripts and code to prevent problems when new columns are introduced

To prevent problems from occurring due to the introduction of new columns, your scripts and code should select specific columns
from the output of SHOW commands and when querying Snowflake views.

To select specific columns from the output of SHOW commands, you can use the
[pipe operator](../sql-reference/operators-flow.md). See the example in [Select a list of columns for the output of a SHOW command](../sql-reference/operators-flow.md).

---
title: Information Schema: New Column in DATABASES View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1032.md
section: Release Notes
---

# Information Schema: New Column in DATABASES View

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

A column has been added to the DATABASES view in the INFORMATION_SCHEMA schema:

| Column Name | Data Type | Description |
| --- | --- | --- |
| TYPE | VARCHAR | Specifies the type of database. Valid values are: . - STANDARD: Specifies a normal database. . - IMPORTED DATABASE: Specifies a database that is created from a share. |

Ref: 1032

---
title: Information Schema: New columns in output for QUERY_HISTORY, QUERY_HISTORY_BY_* functions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1431-1524-1540.md
section: Release Notes
---

# Information Schema: New columns in output for QUERY_HISTORY, QUERY_HISTORY_BY_\* functions

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

When this behavior change bundle is enabled, the output of the [QUERY_HISTORY , QUERY_HISTORY_BY_\*](../../../sql-reference/functions/query_history.md) functions
includes the following new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| TRANSACTION_ID | NUMBER | [ID of the transaction](../../../sql-reference/transactions.md) that contains the statement or `0` if the statement is not executed within a transaction. |
| QUERY_ACCELERATION_BYTES_SCANNED | NUMBER | Number of bytes scanned by the [query acceleration service](../../../user-guide/query-acceleration-service.md). |
| QUERY_ACCELERATION_PARTITIONS_SCANNED | NUMBER | Number of partitions scanned by the query acceleration service. |
| QUERY_ACCELERATION_UPPER_LIMIT_SCALE_FACTOR | NUMBER | Upper limit [scale factor](../../../user-guide/query-acceleration-service.md) that a query would have benefited from. |
| BYTES_WRITTEN_TO_RESULT | NUMBER | Number of bytes written to a result object. For example, `select * from ...` would produce a set of results in tabular format representing each field in the selection.  In general, the results object represents whatever is produced as a result of the query, and BYTES_WRITTEN_TO_RESULT represents the size of the returned result. |
| ROWS_WRITTEN_TO_RESULT | NUMBER | Number of rows written to a result object. For CREATE TABLE AS SELECT (CTAS) and all DML operations, this result is `1`. The values in the ROWS_INSERTED, ROWS_UPDATED, and ROWS_DELETED columns reflect the number of rows actually inserted, updated, or deleted.  For more information, see [ROWS_PRODUCED column deprecated](bcr-1497-1524-1540.md). |
| ROWS_INSERTED | NUMBER | Number of rows inserted by the query. |
| QUERY_RETRY_TIME | NUMBER | Total execution time (in milliseconds) for query retries caused by actionable errors. For more information, see [Query retry columns](bcr-1497-1524-1540.md). |
| QUERY_RETRY_CAUSE | VARCHAR | Error that caused the query to retry. If there is no query retry, the field is NULL. For more information, see [Query retry columns](bcr-1497-1524-1540.md). |
| FAULT_HANDLING_TIME | NUMBER | Total execution time (in milliseconds) for query retries caused by errors that are *not* actionable. For more information, see [Query retry columns](bcr-1497-1524-1540.md). |

These columns are added as the last (right-most) columns in the output.

For more information, see also [QUERY_HISTORY view (Account Usage): Changes to columns and new columns](bcr-1497-1524-1540.md).

Ref: 1431, 1524, 1540

---
title: Ingest Java SDK release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/ingest-java-sdk.md
section: Release Notes
---

# Ingest Java SDK release notes

The Ingest Java SDK release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](ingest-java-sdk-2026.md)
* [2025 releases](ingest-java-sdk-2025.md)
* [2024 releases](ingest-java-sdk-2024.md)
* [2023 releases](ingest-java-sdk-2023.md)

---
title: Ingest Java SDK release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/ingest-java-sdk-2023.md
section: Release Notes
---

# Ingest Java SDK release notes for 2023

This article contains the release notes for the Ingest Java SDK, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Ingest Java SDK updates.

## Version 2.0.4 (October 31, 2023)

### New features and updates

* Supported a new ON_ERROR option SKIP_BATCH, which skips the entire batch if there is any issue and returns all errors as part of the response.
* Added row index information to all exceptions.
* Upgraded snappy-java dependency.
* Added a new interface to return the table schema information for a channel.
* Added a new configuration option MAX_CLIENT_LAG that specifies the flush frequency, in seconds (default: 1).

### Bug fixes

* Fixed an issue with using `snowflake-jdbc-fips`.
* Fixed a rare `ConcurrentModificationException` issue.
* Fixed two issues in `insertRows` API that might cause wrong results in a very rare case.
* Limited the maximum allowed number of chunks in blob to avoid the case when the request is too large.

## Version 2.0.3 (August 31, 2023)

### New features and updates

* Supported OAuth authentication.
* Removed exactly-once related code for Snowpipe.
* Supported publishing unshaded snapshot release to the Nexus repo.
* Added retry logic for invalid JWT tokens.
* Added a warning for large batches in `insertRows`.

### Bug fixes

* Fixed a NPE issue caused by race condition.

## Version 2.0.2 (July 25, 2023)

### New features and updates

* Updated dependencies based on Wiz and Snyk vulnerability scan results.
* Improved retry logic on exceptions like `SSLException`.
* Made the role as an optional input and supported using the default role associated with the user.
* Sent uncompressed chunk lengths to server side for tracking purpose.

### Bug fixes

* None.

## Version 2.0.1 (June 14, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed an unexpected dependency behavior for Snowflake JDBC.

## Version 2.0.0 (June 13, 2023)

### New features and updates

* Supported Snowpipe Streaming GA release.
* Improved the dependencies for shading and relocating logic.
* Made a few parameters to configure channel/chunk/file size limits.
* Added more telemetries to track end-to-end latency.
* Supported GCS downscoped token.
* Cleaned up all Arrow related code.
* Added an attribution notice.
* Enforced allowed DATE and TIMESTAMP range.
* Exposed more error messages for server-side channel invalidation for customers to self-mitigate.

### Bug fixes

* Fixed an issue where some background threads are not stopped during exception.

---
title: Ingest Java SDK release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/ingest-java-sdk-2024.md
section: Release Notes
---

# Ingest Java SDK release notes for 2024

This article contains the release notes for the Ingest Java SDK, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Ingest Java SDK updates.

## Version 3.0.1 (December 4, 2024)

### Bug fixes

* Fixed an issue with schema evolution for structured data type.
* Fixed an issue with loading files greater than 16 MB using the MD5 hash.
* Upgraded io.netty to fix a security vulnerability.

## Version 3.0.0 (November 12, 2024)

### New features and updates

* With this release, [Snowpipe Streaming](../../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) can ingest data into Snowflake-managed [Apache Iceberg](../../user-guide/tables-iceberg.md) tables.

### Bug fixes

* Fixed dependency issues and error messages in the SDK.

## Version 2.3.0 (October 11, 2024)

### BCR (Behavior Change Release) changes

* Beginning with release 2.3.0, numeric values preserve their format. The numeric values will not be converted to and from scientific notation.

### New features and updates

* Made updates to support a new table format.

### Bug fixes

* Fixed vulnerable dependencies.
* Upgraded Hadoop to fix vulnerability issues.
* Removed unnecessary dependencies to reduce JAR size.

## Version 2.2.2 (September 12, 2024)

### Bug fixes

* Fixed a critical issue by updating the location for the file name in metadata.

## Version 2.2.1 (September 05, 2024)

### New features and updates

* Added `ExternalVolumeManager` to support multiple stages for a new table format.
* Upgraded dependency versions.
* Updated parameters to support a new table format.

## Version 2.2.0 (August 09, 2024)

### New features and updates

* Improved code logic to support different storage volumes.

### Bug fixes

* Fixed a critical issue that could potentially cause conflicts when `change_tracking` is enabled for streams and dynamic tables.

> **Note:**
>
> For all Snowpipe Streaming usage, Snowflake recommends using the Ingest Java SDK version 2.2.0 or later.

## Version 2.1.2 (July 29, 2024)

### New features and updates

* Improved `InsertRows` performance.
* Added or improved various logs for better observability.
* Fine-tuned channel and chunk sizes.

### Bug fixes

* Fixed an issue with failover across deployments.

## Version 2.1.1 (May 09, 2024)

### New features and updates

* Returned more detailed error messages for the `INVALID_CHANNEL` error.
* Added support for external OAuth 2.0.

### Bug fixes

* Upgraded several dependencies, including vulnerability fixes.
* Fixed an issue where HTTP connections are leaked due to error responses.
* Relaxed the file size constraints to deal with issues where longer client flush lags produce larger files.

## Version 2.1.0 (February 28, 2024)

### BCR (Behavior Change Release) changes

* Set Zstandard as the default compression algorithm.

### New features and updates

* Allowed clients to drop channels.
* Upgraded JDBC to 3.14.5.
* Implemented a change for sending the start and end offset tokens for a channel.
* Implemented a change for sending the column ordinal data to the server side for cross-checking table schema changes.
* Added support to pass verification logic for a user-defined offset token as part of channel creation.

### Bug fixes

* Fixed an overflow issue that caused silent data issue.

## Version 2.0.5 (January 22, 2024)

### New features and updates

* Added an optional offset token parameter for `openChannel`.
* Added support for specifying compression algorithm to be used for BDEC Parquet files.
* Updated to support customized URL and added Snowflake account name in request header.
* Implemented a change to send `spansMixedTables` flag in blob registration requests.
* Deprecated BUFFER_FLUSH_INTERVAL_IN_MILLIS parameter, instead use the MAX_CLIENT_LAG parameter.
* Implemented the refresh of downscoped GCS tokens.

### Bug fixes

* Reverted one change that updated public API for internal use case.
* Fixed the end-to-end JAR test so it can run on all cloud platforms.

---
title: Ingest Java SDK release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/ingest-java-sdk-2025.md
section: Release Notes
---

# Ingest Java SDK release notes for 2025

This article contains the release notes for the Ingest Java SDK, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Ingest Java SDK updates.

## Version 4.4.0 (November 19, 2025)

### New features and updates

* Data ingestion reliability update: To prevent failed data ingestion after system infrastructure changes, Snowflake updated the client to be more responsive to deployment mismatches. If the client detects that the primary deployment URL, storage location, or encryption key has changed, it now automatically closes the connection. This immediately forces you to recreate your client, ensuring that you use the correct, updated deployment credentials and location, thereby guaranteeing successful and reliable data ingestion.

### Bug fixes

* Security: Upgraded the Bouncy Castle FIPS (bc-fips) dependency to v2.1.1 to resolve potential security vulnerabilities.

## Version 4.3.2 (November 03, 2025)

### Bug fixes

* Fixed an issue that blocked Iceberg table ingestion when the external volume encryption is set to `NONE`.

## Version 4.3.1 (October 08, 2025)

### New features and updates

* Enhanced cloud security: Snowpipe Streaming now fully supports server-side encryption with Amazon Web Services (AWS) Key Management Service (SSE-KMS) configured on your external AWS S3 and Google Cloud Storage volumes. This enhancement ensures that data uploaded during ingestion uses your required, higher-grade KMS encryption policy, moving beyond the previously hardcoded default encryption.

### Bug fixes

* Fixed vulnerable dependencies and cleaned up internal dependency workarounds.

## Version 4.3.0 (August 21, 2025)

### Bug fixes

* Fixed vulnerable dependencies.

## Version 4.2.0 (August 18, 2025)

### New features and updates

* Improved the reliability of streaming ingest into Iceberg tables, ensuring that your data is consistently uploaded to the correct location.
* Improved how the SDK manages table keys, which ensures that our system stays in sync and helps maintain the stability and security of your tables.
* Improved system stability for high-volume data by allowing connections to retry for up to five minutes, preventing immediate closures.

## Version 4.1.0 (June 11, 2025)

### BCR (Behavior Change Release) changes

* Beginning with release 4.1.0, the Ingest Java SDK includes a behavior change to JSON handling to improve data integrity and performance. See the following list for details:

  + Added robust validation to detect and prevent duplicate JSON object fields, including those with trailing null terminators.
  + All JSON keys and values are now strictly enforced to be valid UTF-8, which improves data integrity and compatibility.
  + Optimized the JSON serialization process to directly convert objects into JSON strings, bypassing an intermediate conversion step. This results in improved performance and reduced memory usage.

## Version 4.0.1 (June 06, 2025)

### New features and updates

* Upgraded the JDBC version to 3.24.2.

## Version 4.0.0 (April 14, 2025)

### BCR (Behavior Change Release) changes

* Beginning with release 4.0.0, the Ingest Java SDK now uses the Snowflake JDBC thin JAR instead of the fat JAR.

### New features and updates

* Updated dependencies and imports for Snowflake JDBC thin JAR.
* Removed unnecessary dependencies.
* Enhanced Channel Invalidation Handling. The `channel` object now automatically invalidates itself upon receiving a response from the server indicating an invalid channel state. This improvement enhances error handling and resource management within the SDK.

## Version 3.1.2 (March 17, 2025)

### Bug fixes

* Fixed issues with the filename mismatch for Iceberg ingestion.

## Version 3.1.1 (February 27, 2025)

### New features and updates

* Made updates to silence the exception log in the JDBC driver.

### Bug fixes

* Fixed issues with the Jenkins job to push artifacts to Maven.
* Fixed the proxy settings for the OAuth HTTP client.
* Fixed a Java formatter script and its dependencies.

## Version 3.1.0 (February 24, 2025)

### BCR (Behavior Change Release) changes

* Beginning with release 3.1.0, any duplicate keys in variant columns result in client-side errors with the `INVALID_VALUE_ROW` error code.

### New features and updates

* Upgraded the JDBC version to 3.22.0.
* Upgraded the Netty version to 4.1.118.

---
title: Ingest Java SDK release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/ingest-java-sdk-2026.md
section: Release Notes
---

# Ingest Java SDK release notes for 2026

This article contains the release notes for the Ingest Java SDK, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Ingest Java SDK updates.

## Version 4.4.2 (January 12, 2026)

### Bug fixes

* Security update: Updated core networking libraries to resolve a known vulnerability in the netty-codec-http component.
* System stability: Refreshed several internal dependencies to ensure compatibility and improve overall application reliability.

## Version 4.4.1 (January 06, 2026)

### Bug fixes

* Fixed an issue where ingesting repeated fields (arrays) containing multiple null entries would cause a validation error. The ingestion process now correctly handles these structures, ensuring data flows smoothly without unnecessary failures.

---
title: Integrations: Read-only Secondary Integrations Enforced
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-906.md
section: Release Notes
---

# Integrations: Read-only Secondary Integrations Enforced

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

A replication or failover group can include integrations by adding the INTEGRATIONS type to its OBJECT_TYPES list and specifying the
integration type in the ALLOWED_INTEGRATION_TYPES list. Object types that are included in a replication or failover group are read-only
in target accounts. For example, if the USERS type is included, users cannot be created, modified, or deleted in a target account.

This behavior is now enforced for integration types included in replication or failover groups as follows:

Previously:
:   Integration types that were replicated from a source account to a target account in a replication or failover group could
    be modified in the target account.

    For example, if INTEGRATIONS was included in the OBJECT_TYPES list for a replication or failover group, and API INTEGRATIONS was
    included in the ALLOWED_INTEGRATION_TYPES list, API integrations could be created, modified, or deleted in target accounts.

Currently:
:   Integration types that are replicated from a source account to a target account in a replication or failover group are now
    read-only in the target account.

    For example, if INTEGRATIONS is included in the OBJECT_TYPES list for a replication or failover group, and API INTEGRATIONS is included
    in the ALLOWED_INTEGRATION_TYPES list, API integrations are read-only in target accounts. Consequently, API integrations cannot be
    created, modified, or deleted in target accounts.

Ref: 906

---
title: IS_DATABASE_ROLE_IN_SESSION: Name resolution with policy and UDF evaluation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1499.md
section: Release Notes
---

# IS_DATABASE_ROLE_IN_SESSION: Name resolution with policy and UDF evaluation

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of the [IS_DATABASE_ROLE_IN_SESSION](../../../sql-reference/functions/is_database_role_in_session.md) function with a masking policy, row access policy, and
UDF is as follows:

Before the change:
:   * You cannot use the fully-qualified name of the database role as an argument to the function, in the format
      `database_name.database_role_name`, unless the database name is the same database that contains the policy or UDF.
    * The function evaluation depends on whether the database role exists in the specified database. If you specify a relative name as an
      argument to the function, the function always evaluates to the database that contains the policy or UDF; the database role must be in
      the same database as the policy or UDF.

After the change:
:   * You can use the fully-qualified name of the database role as an argument, however, the function always evaluates to `False`.
    * When you specify the relative name of the database role as an argument, the function checks to see if the database role is in the same
      database as the protected table or the database that contains the UDF.

    If your UDF or policy conditions call the function, confirm that the database roles exist in the same database as the UDF or protected
    table. If necessary, recreate the database roles in the database that contains the UDF or protected table.

    > **Important:**
    >
    > If you are using this function with Secure Data Sharing, it is important that both the provider and consumer either enable the
    > bundle or disable the bundle to ensure consistent behavior.

Ref: 1499

---
title: IS_GRANTED_TO_INVOKER_ROLE Function: Change to the Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-984.md
section: Release Notes
---

# IS_GRANTED_TO_INVOKER_ROLE Function: Change to the Output

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The behavior of the IS_GRANTED_TO_INVOKER_ROLE function is as follows:

Previously:
:   The function evaluates to FALSE when the argument corresponds to a role in the following context:

    * Calling the function by itself: the role hierarchy of the active, primary role (i.e. CURRENT_ROLE()).
    * Stored procedure caller’s rights.
    * Stored procedure owner’s rights.
    * Task owner.

    If the function is used in a masking policy that protects a table, the function evaluates the policy owner’s role hierarchy. If the role in
    the function argument is in the role hierarchy of the role that owns the policy, the function evaluates to TRUE. Otherwise, the function
    evaluates to FALSE.

Currently:
:   The function evaluation uses the primary role hierarchy for the session in the following contexts:

    * Calling the function by itself.
    * Table
    * Stored procedure with caller’s rights.

    If the function is used in a stored procedure with owner’s rights, the function evaluates the role hierarchy of the role that owns the
    stored procedure with owner’s rights.

    Similarly, if the function is used with a task, the function evaluates the role hierarchy of the role that owns the task.

To identify existing masking policies that use the IS_GRANTED_TO_INVOKER_ROLE function, execute the following statements:

```sqlexample
USE ROLE ACCOUNTADMIN;

SELECT *
    FROM snowflake.account_usage.masking_policies
    WHERE policy_body ilike '%is_granted_to_invoker_role%';
```

Ref: 984

---
title: Jan 06, 2025: Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-06-notebooks-wh-aws-azure-pl.md
section: Release Notes
---

# Jan 06, 2025: Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link (*General availability*)

We are pleased to announce the general availability of support for AWS PrivateLink and Microsoft Azure Private Link in Snowflake Notebooks.

Snowflake Notebooks is a development interface in Snowsight that offers an interactive, cell-based programming environment for Python and SQL. In Snowflake Notebooks, you can perform exploratory data analysis, develop machine learning models, and perform other data science and data engineering tasks all in one place.

For more information, see [Private connectivity for Notebooks](../../../user-guide/ui-snowsight/notebooks-privatelink.md).

---
title: Jan 07, 2025: Snowflake Cortex Playground (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-07-cortex-llm-playground.md
section: Release Notes
---

# Jan 07, 2025: Snowflake Cortex Playground (*Preview*)

Snowflake is pleased to announce the preview of the [Cortex LLM Playground](../../../user-guide/snowflake-cortex/cortex-playground.md),
a no-code chat interface for LLM prompt experimentation.

The Cortex LLM Playground lets you perform side-by-side comparisons of text completions between any two language models
available in Cortex AI or different settings of a single model. You can easily connect a Snowflake table to compare
language model responses for an individual or multiple records (each in a separate row) across different prompt
configurations. After you are satisfied with the output of the LLM for a given prompt, you can easily copy the generated
SQL code to a worksheet or integrate it into your pipeline, like any other SQL statement.

---
title: Jan 07, 2026: Reorganized UI for listings (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-07-listings-ui-reorganization.md
section: Release Notes
---

# Jan 07, 2026: Reorganized UI for listings (*General availability*)

The Snowsight UI for listings has been reorganized to make it easier to manage your data products.

## Changes to Provider Studio

Provider Studio is now located under the **Marketplace** section. In Provider Studio, you can create and manage Snowflake Marketplace listings and listings for specified consumers. You no longer create Internal Marketplace (organizational) listings in Provider Studio.

For more information, see [About Snowflake Marketplace](../../../collaboration/collaboration-marketplace-about.md).

## Changes to Internal Marketplace listings

Internal Marketplace (organizational) listings remain in the **Data sharing** section.

* **To create an organizational listing on the Internal Marketplace**: In the navigation menu, select Data sharing » Internal sharing.
* **(Providers) To view organizational listings that you’re sharing**: In the navigation menu, select Data sharing » External sharing, and then select Shared by you.
* **(Consumers) To view organizational listings that are shared with you** (consumers): In the navigation menu, select Data sharing » External sharing, and then
  select Shared with you.

For more information, see [About organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-about.md)

---
title: Jan 08, 2026: Tri-Secret Secure in China (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-08-tss-available-china.md
section: Release Notes
---

# Jan 08, 2026: Tri-Secret Secure in China (*General availability*)

Tri-Secret Secure is now generally available in the China region.
Please note that external key stores are not supported in the China region.

For more information, see [Tri-Secret Secure in Snowflake](../../../user-guide/security-encryption-tss.md).

---
title: Jan 12, 2026: Specifying custom instructions in semantic views
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-12-semantic-views-custom-instructions.md
section: Release Notes
---

# Jan 12, 2026: Specifying custom instructions in semantic views

When defining a [semantic view](../../../user-guide/views-semantic/overview.md), you can now provide
[instructions for Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst/custom-instructions.md) that explain how to:

* Generate the SQL statement
* Classify questions and prompt for additional information

In the [CREATE SEMANTIC VIEW](../../../sql-reference/sql/create-semantic-view.md) command, you can use the AI_SQL_GENERATION and AI_QUESTION_CATEGORIZATION
clauses to specify instructions for generating the SQL statement and classifying questions.

For more information, see [Providing custom instructions for Cortex Analyst](../../../user-guide/views-semantic/sql.md).

---
title: Jan 14, 2026: Workspaces replication (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-14-workspace-replication.md
section: Release Notes
---

# Jan 14, 2026: Workspaces replication (*Preview*)

Workspaces replication, which allows user workspaces to be included in database replication and failover operations, is now available in [preview](../../preview-features.md).
When a workspace or its owning user is part of a replication or failover group, the workspace is copied to secondary accounts to support business
continuity and disaster recovery.

Replicated workspaces in secondary accounts are read-only. Files can be executed but not modified. When a secondary failover group is promoted
to primary, all contained workspaces become writable.

> **Note:**
>
> Workspaces replication and failover require Business Critical Edition or higher.

For more information, see [Workspaces replication](../../../user-guide/ui-snowsight/workspaces-replication.md).

---
title: Jan 15, 2025: Custom instructions in Cortex Analyst (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-15-cortex-analyst-custom-instructions.md
section: Release Notes
---

# Jan 15, 2025: Custom instructions in Cortex Analyst (*Preview*)

Snowflake is pleased to announce the preview of custom instructions in [Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md).

Custom instructions let you have greater control over SQL generation. Using natural language, you can tell Cortex Analyst
exactly how to generate SQL queries from within your semantic model YAML file. For example, use custom instructions
to tell Cortex Analyst what you mean by *performance* or *financial year*. In this way, you can improve the accuracy of the generated SQL
by incorporating custom logic or additional elements.

For more information, see [Custom instructions in Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst/custom-instructions.md).

---
title: Jan 15, 2025: Optimized COPY and INSERT bulk loads on empty hybrid tables (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-15-ht-optimized-bulk-load.md
section: Release Notes
---

# Jan 15, 2025: Optimized COPY and INSERT bulk loads on empty hybrid tables (*General availability*)

We are pleased to announce the general availability of extended support for optimized bulk loading into hybrid tables.

When a hybrid table is empty, COPY and INSERT INTO … SELECT commands benefit from optimized bulk loading, which is a
fast execution model for inserting data that previously applied only to CREATE HYBRID TABLE … AS SELECT loads.

For more information, see [Loading data](../../../user-guide/tables-hybrid-create.md).

---
title: Jan 15, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-15-dcr.md
section: Release Notes
---

# Jan 15, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 12.5

> **Important:**
>
> If you have enabled automatic updates, you must re-enable automatic updates one time
> by running the following SQL code. Automatic updating should then be enabled for this and future releases.
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
> ```

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Updates to private preview features.

---
title: Jan 16, 2025: Snowsight enhancements to contact email management (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-16-snowsight-contact-email-update.md
section: Release Notes
---

# Jan 16, 2025: Snowsight enhancements to contact email management (*General availability*)

With this release, we are pleased to introduce enhancements to how you manage your contact emails, ensuring you never miss important updates
about your account. When logging into Snowsight, users with the ACCOUNTADMIN or ORGADMIN role will receive a prompt to add any missing
critical contact emails or update outdated ones. Keeping this information updated ensures that users receive all essential notifications,
from security alerts to product updates, without delay. Until all missing critical contacts are provided, users will continue to see the
prompt at the start of their Snowsight sessions.

For details about contact notifications, see [Set up and manage notification contacts for Snowflake](../../../user-guide/ui-snowsight-contacts.md).

---
title: Jan 16, 2026: External lineage (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-16-external-lineage.md
section: Release Notes
---

# Jan 16, 2026: External lineage (*Preview*)

External lineage extends Snowflake’s lineage capabilities by including external data sources and destinations, providing visibility into
data flows across your entire data ecosystem.

External lineage leverages the [OpenLineage](https://openlineage.io) framework, accepting OpenLineage-compatible
events through a REST endpoint. External tools like dbt and Apache Airflow can send lineage metadata to Snowflake,
which then incorporates this information into the native lineage graph displayed in Snowsight.

For more information, see [External lineage](../../../user-guide/external-lineage.md).

---
title: Jan 16, 2026: Sensitive data classification in the Trust Center (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-16-trust-center-sensitive-data-classification.md
section: Release Notes
---

# Jan 16, 2026: Sensitive data classification in the Trust Center (*Preview*)

You can now use the Trust Center to set up and review the results of sensitive data classification.

The Data Security tab in the Trust Center user interface lets you start classifying your sensitive data into semantic categories like
name and age without writing SQL code. When classification completes, use the Data Security tab to find the objects that contain
sensitive data and gain insights about your sensitive data, including the number of objects that might be subject to regulation, and
whether sensitive data is currently protected by a masking policy.

* If you’re new to sensitive data classification, see [Use the Trust Center to set up sensitive data classification](../../../user-guide/classify-ui-trust-center.md).
* If you’re currently using sensitive data classification and want to see your results in the Trust Center, see
  [Use the Trust Center to view classification results](../../../user-guide/classify-results.md).

> **Note:**
>
> Support for sensitive data classification in the Trust Center is being rolled out slowly, and should be available in all accounts by
> February 1, 2026.

---
title: Jan 20, 2025: Snowflake Native Apps with Snowpark Container Services support for AWS PrivateLink (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-20-na-spcs-aws-pl-pupr.md
section: Release Notes
---

# Jan 20, 2025: Snowflake Native Apps with Snowpark Container Services support for AWS PrivateLink (*Preview*)

We are pleased to announce the preview of support for AWS PrivateLink in Snowflake Native Apps with Snowpark Container Services.

See [Understand limitations in the Snowflake Native App Framework](../../../developer-guide/native-apps/limitations.md) for more information.

---
title: Jan 20, 2026: Shared Workspaces (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-20-shared-workspaces.md
section: Release Notes
---

# Jan 20, 2026: Shared Workspaces (*General availability*)

Shared workspaces are now generally available. Shared workspaces introduce collaborative development
directly within Snowsight. Multiple users can now work together on the same set of files and folders in a governed, role-based environment.
This enables teams to collaborate more effectively while maintaining Snowflake’s existing governance and security model.

## Key features

* Create shared workspaces within a selected database and schema for team collaboration.
* Share files or folders from private workspaces into shared workspaces for team access.
* Manage access and permissions through Snowflake roles.
* Work together on shared files and folders with visibility into updates made by other users.

For details, see [Shared workspaces](../../../user-guide/ui-snowsight/workspaces-shared.md).

---
title: Jan 21, 2026: Snowflake OAuth for local applications
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-21-snowflake-oauth-local-applications.md
section: Release Notes
---

# Jan 21, 2026: Snowflake OAuth for local applications

Snowflake OAuth for local applications is now the preferred authentication method for local applications, including desktop applications and
local scripts. It is a strong authentication method that can be implemented without an administrator, requires minimal setup, and doesn’t
require the application to store secrets.

Snowflake OAuth uses a security integration to establish the interface between a client and Snowflake. Every Snowflake account now has a
built-in security integration `SNOWFLAKE$LOCAL_APPLICATION` that creates the interface between local applications and Snowflake. You can
adjust the parameters of the integration to perform administrative tasks like specifying how long OAuth access tokens and
refresh tokens are valid.

For more information, see [Using Snowflake OAuth for local applications](../../../user-guide/oauth-local-applications.md).

---
title: Jan 22, 2026: AI_AGG and AI_SUMMARIZE_AGG (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-22-ai-agg-ai-summarize-agg-ga.md
section: Release Notes
---

# Jan 22, 2026: AI_AGG and AI_SUMMARIZE_AGG (*General availability*)

AI_AGG and AI_SUMMARIZE_AGG, AI-powered aggregation functions that help you to analyze and summarize large volumes
of text data efficiently at scale, are now generally available.

Unlike row-by-row functions, such as AI_COMPLETE, these functions are optimized for set-based aggregation, making them significantly more efficient for mass processing. Based on internal performance testing, AI_AGG delivers up to twice the throughput compared to AI_COMPLETE when aggregating large datasets.

* [AI_AGG](../../../sql-reference/functions/ai_agg.md) reduces a column of text based on a natural language instruction, such as
  extracting common themes or issues across thousands of records.
* [AI_SUMMARIZE_AGG](../../../sql-reference/functions/ai_summarize_agg.md) produces an overall summary of a text column as a whole, such as
  summarizing customer feedback at the product or business level.

Suggested use cases for these AI aggregation functions include:

* Customer feedback and survey analysis: extract top themes, complaints, or praise.
* Support ticket and incident review: summarize frequent issues and outcomes.
* Topic discovery and modeling: automatically surface dominant topics or themes in text collections (e.g., product
  reviews, forum posts) using natural language prompts.
* Executive summaries: generate high-level narratives from large unstructured text sources.

Both functions support GROUP BY, enabling summarization and aggregation by product, region, time period, or customer
segment.

For more information, see [Cortex AI Functions](../../../user-guide/snowflake-cortex/aisql.md).

---
title: Jan 22, 2026: AI_FILTER for filtering with natural language predicates (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-22-ai-filter-ga.md
section: Release Notes
---

# Jan 22, 2026: AI_FILTER for filtering with natural language predicates (*General availability*)

The AI_FILTER function, now generally available, provides a new AI-powered semantic filtering capability that lets you express complex filters in plain, natural language. No rigid keywords, regex, or brittle rules are required.

With AI_FILTER, business users and data teams can filter data based on meaning and intent, not just text matches. This
unlocks faster analysis and more intuitive queries, and dramatically reduces the effort required to translate business
questions into SQL logic.

AI_FILTER use cases include:

* Customer feedback analysis: Filter reviews, surveys, or support tickets by sentiment or intent (for example,
  “customers who were frustrated with delivery delays”).
* Trust & safety and moderation: Identify content that violates policies or expresses risky behavior without enumerating
  endless keywords.
* Sales and CRM insights: Surface deal notes or call transcripts that indicate buying intent, objections, or churn risk.
* Marketing and brand monitoring: Find mentions that express positive or negative brand perception, campaign reactions,
  or competitor comparisons.
* Operational review and quality analysis: Filter internal reports or incident notes based on root cause, severity, or
  outcome described in natural language.

For example, the following query filters the REVIEWS table to include only reviews where the reviewer enjoyed the restaurant:

```sqlexample
SELECT * FROM reviews
   WHERE AI_FILTER(PROMPT('The reviewer enjoyed the restaurant: {0}', review));
```

For more information, see [AI_FILTER](../../../sql-reference/functions/ai_filter.md).

---
title: Jan 22, 2026: Document Processing Playground (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-22-document-processing-playground.md
section: Release Notes
---

# Jan 22, 2026: Document Processing Playground (*General availability*)

The Document Processing Playground helps both technical and business users explore Snowflake’s AI-powered document processing capabilities.

The playground provides a user interface where you can complete the following tasks:

* Upload documents from a stage and experiment with the AI_EXTRACT and AI_PARSE_DOCUMENT functions
* Ask questions to extract information using AI_EXTRACT
* Extract lists and tables
* Preview the layout and OCR results generated by AI_PARSE_DOCUMENT
* Copy generated SQL queries for use in workspaces
* Copy generated Python code for use in your Notebooks

For more information, see [Document Processing Playground](../../../user-guide/snowflake-cortex/document-processing-playground.md).

---
title: Jan 22, 2026: European Union categories for sensitive data classification
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-22-sensitive-data-classification-eu-india.md
section: Release Notes
---

# Jan 22, 2026: European Union categories for sensitive data classification

Snowflake now classifies your sensitive data into the following categories when it finds information related to the European Union:

* DRIVERS_LICENSE
* NATIONAL_IDENTIFIER
* PASSPORT
* PAYMENT_CARD
* TAX_IDENTIFIER

With these new native categories, Snowflake can automatically identify which data is subject to General Data Protection Regulation (GDPR).

For a list of all native categories, see [Native semantic categories of sensitive data classification](../../../user-guide/classify-native.md).

---
title: Jan 23, 2025: Document AI on GCP (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-23-document-ai.md
section: Release Notes
---

# Jan 23, 2025: Document AI on GCP (*General availability*)

Snowflake is pleased to announce the general availability of Document AI on
Google Cloud Platform (GCP).

---
title: Jan 23, 2026: Consumer-controlled maintenance policies for Snowflake Native Apps (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-23-native-apps-consumer-maintenance-policies.md
section: Release Notes
---

# Jan 23, 2026: Consumer-controlled maintenance policies for Snowflake Native Apps (*Preview*)

Consumer-controlled maintenance policies are now in public preview for Snowflake Native Apps.

With Snowflake Native Apps, consumers can set a maintenance policy for an upgrade so that apps don’t
update during specific time periods. When an upgrade is ready and a new release
directive is set, the upgrade begins. However, if the consumer has set a
maintenance policy, the upgrade is delayed until the start date and time
specified in the maintenance policy.

For more information, see [Consumer-controlled maintenance policies](../../../developer-guide/native-apps/consumer-maintenance-policies.md).

To create and set a maintenance policy, the consumer uses the following SQL commands:

* [CREATE MAINTENANCE POLICY](../../../sql-reference/sql/create-maintenance-policy.md): Creates a new maintenance policy. The customer sets a schedule for the maintenance policy to allow upgrades to begin at a specific time.
* [ALTER MAINTENANCE POLICY](../../../sql-reference/sql/alter-maintenance-policy.md): Applies or removes a maintenance policy.

---
title: Jan 23, 2026: Malicious IP Protection updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-23-malicious-ip-protection.md
section: Release Notes
---

# Jan 23, 2026: Malicious IP Protection updates

These updates enhance Malicious IP Protection to provide visibility of blocked login attempts and the option to disable blocking for IP
addresses that are categorized as low-risk.

View blocked login attempts:
:   You can now use the new LOGIN_DETAILS column in the Account Usage [LOGIN_HISTORY view](../../../sql-reference/account-usage/login_history.md) to see details of
    network access attempts that the Malicious IP Protection service has blocked.

Manage opt-out for low-risk categories:
:   If you determine that blocking certain low-risk categories blocks legitimate users, you can opt out of blocking for specific
    categories by using the new [SYSTEM$OPT_OUT_MALICIOUS_IP_PROTECTION_BY_CATEGORY](../../../sql-reference/functions/system_opt_out_malicious_ip_protection_by_category.md) function.

For more information, see [Malicious IP Protection](../../../user-guide/malicious-ip-protection.md).

---
title: Jan 23, 2026: Network policies for External OAuth
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-23-network-policies-external-oauth.md
section: Release Notes
---

# Jan 23, 2026: Network policies for External OAuth

You can now associate a network policy with an External OAuth security integration to restrict network traffic from the OAuth client to Snowflake as the resource server. This network policy governs login requests and queries against Snowflake.

For more information, see [Restricting network traffic for External OAuth](../../../user-guide/oauth-ext-overview.md).

---
title: Jan 23, 2026: Organization users (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-23-org-users-ga.md
section: Release Notes
---

# Jan 23, 2026: Organization users (*General availability*)

Multi-account organizations that need the same person to be a user in more than one account
can now create an organization user for the person. Each organization user acts as a global user entity that can be imported into regular
accounts by account administrators, simplifying the process of creating a user object for the same person in multiple accounts.

Organization users are grouped into logical units called organization user groups. When an account administrator imports an organization user
group into a regular account, all of its organization users are added to the account. The organization user group becomes an access control
role in the account, allowing you to have consistent roles across the organization.

If an existing user needs to be an organization user, you can import the organization group into each account, then link the
existing local user object to the new organization user.

Organization users and organization user groups require an organization account.

For more information, see [Organization users](../../../user-guide/organization-users.md).

---
title: Jan 23, 2026: Storage lifecycle policies: Expanded support
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-23-storage-lifecycle-policies-azure.md
section: Release Notes
---

# Jan 23, 2026: Storage lifecycle policies: Expanded support

Storage lifecycle policies now support additional cloud providers and regions:

* **Azure COOL tier**: You can now create archival policies that use the COOL storage tier
  on accounts hosted on Microsoft Azure.
* **Government regions**: You can now use storage lifecycle policies in
  Amazon Web Services (AWS) and Microsoft Azure government regions.

With this update, you can use the COOL archive tier on both AWS and Azure. The COLD archive tier
continues to support AWS only.

For more information, see [Storage lifecycle policies](../../../user-guide/storage-management/storage-lifecycle-policies.md).

---
title: Jan 26, 2026: Extract images from documents using AI_PARSE_DOCUMENT (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-26-ai-parse-document-images-preview.md
section: Release Notes
---

# Jan 26, 2026: Extract images from documents using AI_PARSE_DOCUMENT (Preview)

The AI_PARSE_DOCUMENT AI Function can now extract images embedded in PDF and Word documents, alongside text, data, and
layout elements. Extracted images can be written to stages or
passed directly to other Cortex AI Functions for further analysis.

This new capability unlocks a number of advanced use cases, including:

* *Enrich data*: Extract images from documents to add visual context for deeper insights.
* *Multimodal RAG*: Combine images and text for retrieval-augmented generation (RAG) to improve model responses.
* *Image classification*: Use extracted images with AI_EXTRACT or AI_COMPLETE for automatic tagging and analysis.
* *Knowledge bases*: Build richer repositories by including both text and images for better search and reasoning.
* *Compliance*: Extract and analyze images (e.g., charts, signatures) for regulatory and audit workflows.

There is no additional cost for image extraction beyond the standard page-based billing for AI_PARSE_DOCUMENT.

For more information, see [Cortex AI Functions: Image extraction with AI_PARSE_DOCUMENT](../../../user-guide/snowflake-cortex/image-extraction.md).

---
title: Jan 26, 2026: Specify a dynamic task configuration with EXECUTE TASK
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-26-dynamic-task-config.md
section: Release Notes
---

# Jan 26, 2026: Specify a dynamic task configuration with EXECUTE TASK

You can now dynamically override a task configuration for a single execution with the
EXECUTE TASK command. Use the `USING CONFIG` clause to run ad-hoc executions
without changing the task definition.

For more information, see [EXECUTE TASK](../../../sql-reference/sql/execute-task.md).

---
title: Jan 27, 2025: Organization account (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-27-org-account.md
section: Release Notes
---

# Jan 27, 2025: Organization account (*General availability*)

With this release, we are pleased to announce the general availability of the organization account, which is a new type of account that
global organization administrators use to perform tasks that affect the entire organization. In the future, global
organization administrators will also use the organization account to manage organization-level objects across all of their Snowflake
accounts.

Before the introduction of the organization account, administrators used the ORGADMIN role to perform organization-level tasks. After
advance notice is given, the ORGADMIN role will be phased out for multi-account organizations in a future release. Organization
administrators are strongly encouraged to start using the new GLOBALORGADMIN role in the organization account to perform organization-level
tasks.

For more information, see [Organization accounts](../../../user-guide/organization-accounts.md).

---
title: Jan 27, 2026: Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-27-iceberg-enforce-access-policies-on-tables-queried-from-apache-spark.md
section: Release Notes
---

# Jan 27, 2026: Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™

You can now enforce the following policies on Apache Iceberg tables that you query from Apache Spark™ through Snowflake
Horizon Catalog:

* [Masking policies](../../../user-guide/security-column-intro.md)
* [Tag-based masking policies](../../../user-guide/tag-based-masking-policies.md)
* [Row access policies](../../../user-guide/security-row-intro.md)

To enforce these policies, you first define the policy in Snowflake as you normally would, and then you configure Spark to enforce the
policy. This configuration uses version 3.1.6 of the Snowflake Connector for Spark to connect to Snowflake and evaluate policies.

For more information, see [Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](../../../user-guide/tables-iceberg-query-using-external-query-engine-snowflake-horizon-enforce-access-policies.md).

---
title: Jan 27, 2026: Estimate token usage with AI_COUNT_TOKENS (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-27-ai-count-tokens-function-ga.md
section: Release Notes
---

# Jan 27, 2026: Estimate token usage with AI_COUNT_TOKENS (*General availability*)

AI_COUNT_TOKENS, a Cortex AI helper function that helps users estimate token consumption and understand how prompt
context impacts cost, is now generally available. AI_COUNT_TOKENS takes into account the function, the LLM model (if
applicable), and any additional inputs that affect token count, such as categories/labels for classification tasks.

In general, token usage increases as prompts become more descriptive and complex. Minimal prompts with limited context
consume fewer tokens, while deeper context, task descriptions, and examples increase token counts. With AI_COUNT_TOKENS,
users can evaluate how these tradeoffs affect token usage and therefore cost while developing their AI workloads.

This capability is especially useful for establishing best practices around:

* How much context to include in prompts
* When richer prompts meaningfully improve accuracy
* When examples are worth the additional token cost
* How best to standardize prompt design across teams and workloads

The supported functions include:

* [AI_CLASSIFY](../../../sql-reference/functions/ai_classify.md)
* [AI_COMPLETE](../../../sql-reference/functions/ai_complete.md)
* [AI_EMBED](../../../sql-reference/functions/ai_embed.md)
* [AI_SENTIMENT](../../../sql-reference/functions/ai_sentiment.md)
* [AI_SIMILARITY](../../../sql-reference/functions/ai_similarity.md)
* [AI_TRANSLATE](../../../sql-reference/functions/ai_translate.md)

For more information, see [AI_COUNT_TOKENS](../../../sql-reference/functions/ai_count_tokens.md).

---
title: Jan 28, 2026: Fine-tuning arctic-extract models (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-28-fine-tuning-arctic-extract-models.md
section: Release Notes
---

# Jan 28, 2026: Fine-tuning `arctic-extract` models (*Preview*)

Snowflake Cortex now supports fine-tuning `arctic-extract` models to improve document extraction accuracy for your specific
formats and domains. Fine-tuning adapts the model to your data, delivering more consistent, structured outputs than zero-shot
extraction. Fine-tuned models integrate seamlessly with AI_EXTRACT for production workflows.

For more information, see [Fine-tuning arctic-extract models](../../../user-guide/snowflake-cortex/arctic-extract-finetuning.md).

---
title: Jan 28, 2026: Private connectivity for TSS on Google Cloud (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-28-tss-private-connectivity-gcp.md
section: Release Notes
---

# Jan 28, 2026: Private connectivity for TSS on Google Cloud (*General availability*)

Support for private connectivity to Tri-Secret Secure, which was previously supported on Amazon Web Services and Microsoft Azure, is now generally available for Business
Critical accounts on Google Cloud.

For more information, see [Understanding Tri-Secret Secure self-service with private connectivity](../../../user-guide/security-encryption-tss-self-serve-private.md).

---
title: Jan 29, 2026: Apache DataSketches functions (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-29-datasketches-functions-ga.md
section: Release Notes
---

# Jan 29, 2026: Apache DataSketches functions (*General availability*)

The following Apache Datasketches functions are now generally available and are no longer in Preview:

| Function subcategory | New function | Description |
| --- | --- | --- |
| Cardinality estimation | [DATASKETCHES_HLL](../../../sql-reference/functions/datasketches_hll.md) | Returns an approximation of the distinct cardinality of the input (that is, `DATASKETCHES_HLL(col1)` returns an approximation of `COUNT(DISTINCT col1)`). |
| Cardinality estimation | [DATASKETCHES_HLL_ACCUMULATE](../../../sql-reference/functions/datasketches_hll_accumulate.md) | Returns the sketch at the end of aggregation. |
| Cardinality estimation | [DATASKETCHES_HLL_COMBINE](../../../sql-reference/functions/datasketches_hll_combine.md) | Combines (merges) input sketches into a single output sketch. |
| Cardinality estimation | [DATASKETCHES_HLL_ESTIMATE](../../../sql-reference/functions/datasketches_hll_estimate.md) | Returns the cardinality estimate for the given sketch. |

---
title: Jan 29, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-29-dcr.md
section: Release Notes
---

# Jan 29, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 12.8

> **Important:**
>
> If you are running a release version earlier than 12.5 and have enabled automatic updates, you must re-enable automatic updates
> by running the following SQL code. This re-enables automatic updates for all future releases.
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
> ```

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Updates to private preview features.

---
title: Jan 30, 2026: New regions
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-30-new-regions.md
section: Release Notes
---

# Jan 30, 2026: New regions

The following new regions are now available:

| Cloud platform | Region |
| --- | --- |
| Microsoft Azure | East US (Virginia) |
| Amazon Web Services (AWS) | Middle East (UAE) |
| Google Cloud Platform (GCP) | Australia Southeast 2 (Melbourne) |

For more information, see [Supported cloud regions](../../../user-guide/intro-regions.md).

---
title: Jan 30, 2026: Support for bi-directional data access with Microsoft Fabric (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-01-30-iceberg-microsoft-fabric-bidirectional-data-access-ga.md
section: Release Notes
---

# Jan 30, 2026: Support for bi-directional data access with Microsoft Fabric (*General availability*)

Querying Apache Iceberg™ tables between Snowflake and Microsoft Fabric in both directions by using a REST API endpoint is now generally available:

* Query Snowflake-managed Iceberg tables from Fabric. To query Snowflake-managed Iceberg tables in Fabric, connect a
  Snowflake database to Fabric. You can select an existing database or create a new one. After connecting, Fabric creates an item that lets
  you access your Snowflake-managed tables. For more information, see [Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric](../../../user-guide/tables-iceberg-query-using-microsoft-fabric.md).
* Query OneLake tables with Iceberg metadata from Snowflake. To query Fabric Iceberg tables registered in Snowflake, configure a REST
  catalog integration for OneLake table APIs, which provides table information from Fabric.

  For more information, see the following topics:

  > + [Configure a catalog integration for OneLake REST](../../../user-guide/tables-iceberg-configure-catalog-integration-rest-onelake.md)
  > + [Overview of OneLake table APIs](https://learn.microsoft.com/fabric/onelake/table-apis/table-apis-overview) in the Microsoft Fabric documentation
  > + [Getting started with OneLake table APIs for Iceberg](https://learn.microsoft.com/en-us/fabric/onelake/table-apis/iceberg-table-apis-get-started#snowflake)

---
title: Jan 31, 2025: Support for future grants in Streamlit in Snowflake (General Availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-01-31-sis.md
section: Release Notes
---

# Jan 31, 2025: Support for future grants in Streamlit in Snowflake (General Availability)

With this release, Snowflake is pleased to announce the general availability of future grants in Streamlit in Snowflake.

---
title: January 03-04, 2024 — 8.0 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_00.md
section: Release Notes
---

# January 03-04, 2024 — 8.0 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Extensibility Updates

### Account Usage: New EXTERNAL_ACCESS_HISTORY View

In a recent release, the [EXTERNAL_ACCESS_HISTORY view](../../sql-reference/account-usage/external_access_history.md) was added to the account usage views (in the SNOWFLAKE shared
database) to provide information about access to external network locations from procedure and UDF handlers. In the
EXTERNAL_ACCESS_HISTORY view, each row represents a query made to a procedure or UDF that makes external access
requests.

## Data Collaboration Updates

### Organization Usage: New LISTING_AUTO_FULFILLMENT_USAGE_HISTORY View

With this release, we are pleased to announce the general availability of the [LISTING_AUTO_FULFILLMENT_USAGE_HISTORY view](../../sql-reference/organization-usage/listing_auto_fulfillment_usage_history.md) view
added to the organization usage schema (in the SNOWFLAKE shared database) to provide information to help manage the cost of
[Cross-Cloud Auto-Fulfillment](../../collaboration/provider-listings-auto-fulfillment.md) in your organization.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 02-Jan-24 |
| *Organization Usage: New LISTING_AUTO_FULFILLMENT_USAGE_HISTORY View* | **Added** to *Data Collaboration Updates* | 03-Jan-24 |

---
title: January 08-10, 2024 — 8.1 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_01.md
section: Release Notes
---

# January 08-10, 2024 — 8.1 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## New Features

### EXECUTE IMMEDIATE FROM File — *General Availability*

With this release, we are pleased to announce the general availability of the EXECUTE IMMEDIATE FROM command. This command executes the SQL statements in a file on a stage. The file must contain syntactically valid SQL statements.

This feature provides a mechanism to control the deployment and management of your Snowflake objects and code. You can use the EXECUTE IMMEDIATE FROM command to execute scripts in any session.

For more information, see [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md).

## SQL Updates

### CREATE <object> … CLONE command: New parameter

With Time Travel, you can create a clone of a database, schema, or table at a specified point in the object’s history. However, if a database or schema contains any child objects that have a shorter [data retention period](../../user-guide/data-time-travel.md) than the parent object being cloned, the cloning operation fails if the child object’s historical data has been purged from Time Travel. The IGNORE TABLES WITH INSUFFICIENT DATA RETENTION parameter of the [CREATE <object> … CLONE](../../sql-reference/sql/create-clone.md) command enables cloning a database or schema by ignoring those child tables that no longer have historical data available in Time Travel.

For more information, see [Child Objects and Data Retention Time](../../user-guide/object-clone.md).

## New SQL functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| System Functions (Information) | [SYSTEM$CLIENT_VERSION_INFO](../../sql-reference/functions/system_client_version_info.md) | Returns version information for Snowflake clients and drivers. |

## Extensibility Updates

### Support for Python 3.11 in Snowpark, UDFs, UDTFs and stored procedures — *General Availability*

With this release, we are pleased to announce the general availability of support for Python 3.11 in Snowpark Python, Python UDFs, Python UDTFs and Python stored procedures.

For more information, see

* [Setting up your development environment for Snowpark Python](../../developer-guide/snowpark/python/setup.md)
* [Introduction to Python UDFs](../../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md)

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 08-Jan-24 |
| *System function: New SYSTEM$CLIENT_VERSION_INFO function* | **Added** to *New SQL Functions* | 10-Jan-24 |

---
title: January 15-17, 2024 — 8.2 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_02.md
section: Release Notes
---

# January 15-17, 2024 — 8.2 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Behavior Change Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_01](../bcr-bundles/2024_01_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2023_08](../bcr-bundles/2023_08_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_07](../bcr-bundles/2023_07_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for February 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New Features

### Access History: Support added for stored procedure ancestor queries — Preview

With this release, Snowflake is pleased to announce the ability to track the chain of queries that call a stored procedure by using the
`parent_query_id` and `root_query_id` columns. These columns allow you to see the query ID that performs a read or write
operation on another object, and the query ID for the query that calls a stored procedure, respectively. The columns support calling a
stored procedure directly and nested stored procedure calls, such as when one stored procedure calls another stored procedure.

For details, see [Ancestor queries in Access History](../../user-guide/access-history.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 15-Jan-24 |
| *New Features* | Access History: Support added for stored procedure ancestor queries — Preview | 15-Jan-24 |

---
title: January 18, 2024 — Snowflake Native Apps Framework Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-01-18.md
section: Release Notes
---

# January 18, 2024 — Snowflake Native Apps Framework Release Notes

Snowflake Native App Framework: Support for Azure — Preview

With this release, we are pleased to announce the preview of support for the Snowflake Native App Framework on Azure.

For more information, see [About the Snowflake Native App Framework](../../../developer-guide/native-apps/native-apps-about.md).

---
title: January 2023
source: https://docs.snowflake.com/en/release-notes/2023-01.md
section: Release Notes
---

# January 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### OBJECT_DEPENDENCIES View: Support Added for Shared Objects

With this release, Snowflake is pleased to announce the support for shared objects for the Account Usage OBJECT_DEPENDENCIES view in the
shared SNOWFLAKE database. For example, when a consumer creates a view from a shared table, the view is dependent on the table the provider
shares. The dependencies related to data sharing enable data officers to ensure greater data integrity, comply with each regulatory standard
more fully, and generate more detailed impact analysis.

For details, refer to [Object Dependencies](../user-guide/object-dependencies.md) and the [Usage Notes](../sql-reference/account-usage/object_dependencies.md).

### Memoizable Functions — *Preview*

With this release, Snowflake is pleased to announce the preview of memoizable functions. A memoizable function caches the result of calling a
user-defined function (UDF) and then returns the cached result when the output is needed at a later time. Using memoizable
functions improves performance for complex queries, such as multiple column lookups in mapping tables referenced within a row access
policy or masking policy. Currently, memoizable functions are available for scalar SQL UDFs only.

For details, refer to [Memoizable UDFs](../developer-guide/udf/sql/udf-sql-scalar-functions.md).

### Working with Amazon S3-compatible Storage — *Preview*

With this release, Snowflake is pleased to announce the preview of support for accessing data in Amazon S3-compatible storage. You can create external
stages and external tables on software and devices, on premises or in a private cloud, that is highly compliant with Amazon S3 API. By using
this feature, you can manage, govern, and analyze your data more easily and efficiently, regardless of where the data is physically stored.

Note that Amazon S3-compatible endpoints are not automatically enabled for all accounts. To request this feature, contact the Snowflake
account team or [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). Make sure that you verify the endpoints by using our
[public test suite](https://github.com/snowflakedb/snowflake-s3compat-api-test-suite) (in GitHub) before sending the request.

For details, refer to [Work with Amazon S3-compatible storage](../user-guide/data-load-s3-compatible-storage.md).

### Account Usage: New PASSWORD_POLICIES View

With this release, Snowflake adds a new view, PASSWORD_POLICIES, in the Account Usage schema of the shared SNOWFLAKE database. This view
returns one row for each password policy in the account. Note that access to this view can be granted through the Snowflake-provided
SECURITY_VIEWER database role.

For details, refer to [PASSWORD_POLICIES view](../sql-reference/account-usage/password_policies.md) and [SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md).

### Account Usage: New SESSION_POLICIES View

With this release, Snowflake adds a new view, SESSION_POLICIES, in the Account Usage schema of the shared SNOWFLAKE database. This view
returns one row for each session policy in the account. Note that access to this view can be granted through the Snowflake-provided
SECURITY_VIEWER database role.

For details, refer to [SESSION_POLICIES view](../sql-reference/account-usage/session_policies.md) and [SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md).

## SQL Updates

### Setting a Snowflake Scripting Variable to the Scalar Return Value from a Stored Procedure

With this release, you can use the new `INTO :snowflake_scripting_variable` clause in a [CALL](../sql-reference/sql/call.md) statement
to capture a scalar return value from a stored procedure in a Snowflake Scripting variable. For example:

```sqlexample
DECLARE
  ret1 NUMBER;
BEGIN
  CALL my_procedure('Manitoba', 127.4) into :ret1;
  RETURN ret1;
END;
```

Note: If you are using SnowSQL or the classic web interface, use this example instead (refer to
[Using Snowflake Scripting in Snowflake CLI, SnowSQL, and Python Connector](../developer-guide/snowflake-scripting/running-examples.md)):

```sqlexample
EXECUTE IMMEDIATE $$
DECLARE
  ret1 NUMBER;
BEGIN
  CALL my_procedure('Manitoba', 127.4) into :ret1;
  RETURN ret1;
END;
$$
;
```

### New SQL Functions

The following function(s) were introduced in recent releases:

| Function Category | New Function | Description |
| --- | --- | --- |
| Aggregate Functions (General) | [MIN_BY](../sql-reference/functions/min_by.md) and [MAX_BY](../sql-reference/functions/max_by.md) | Finds the row(s) containing the minimum or maximum value for a specified column and returns the value of a second specified column for that row. |

## Data Governance Updates

### Column Lineage — *General Availability*

With this release, Snowflake is pleased to announce the general availability of column lineage. Column lineage (i.e. Access History for
columns) extends the `objects_modified` column in the Account Usage ACCESS_HISTORY view to specify how data flows from the source
column to the target column in a write operation. Snowflake tracks the data from the source columns through all subsequent table objects
that reference data from the source columns (e.g. INSERT, MERGE, CTAS).

This feature was announced in preview in October 2022. For details, refer to [Access History](../user-guide/access-history.md) and the
[ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md).

## Web Interface Updates

### Snowsight Worksheet Version History Retention

To improve Snowsight performance, worksheet version history older than 90 days will be removed on a regular basis.
The stored query results for those versions will also be removed.

---
title: January 22-23, 2024 — 8.3 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_03.md
section: Release Notes
---

# January 22-23, 2024 — 8.3 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Security Updates

### Network rules — *General Availability*

With this release, we are pleased to announce the general availability of network rules, which group related network identifiers
into logical units. When a Snowflake feature needs to restrict network traffic based on the origin or destination of a request,
it can allow or block a network rule that contains the identifiers that should be permitted or denied.

Network rules make possible the following features:

* Enhanced network security using [network policies](../../user-guide/network-policies.md). All new network policies should use
  network rules.
* [External network access](../../developer-guide/external-network-access/external-network-access-overview.md).

This release includes a new Snowsight page for network policies, which includes the ability to manage the lifecycle of a network
rule. Using SQL to work with network rules is generally available, but this new Snowsight page is a Preview feature.

### Enhanced network security — *General Availability*

With this release, we are pleased to announce the general availability of enhanced security when using network policies to
restrict access to Snowflake. When combined with network rules, network policies can now restrict access based on the identifier
of an AWS S3 endpoint or Azure private endpoint.

### Network isolation to internal stages using AWS PrivateLink — *General Availability*

With this release, we are pleased to announce the general availability of the ability to isolate network traffic to Snowflake
internal stages when connecting to them over AWS PrivateLink for Amazon S3. Snowflake recommends this approach for organizations
that use AWS PrivateLink to access the internal stages of multiple Snowflake accounts.

Benefits of isolating private connectivity traffic include:

* Simplified DNS management.
* Support for charging back costs to a specific Snowflake account.
* Support for implementing different security requirements for each Snowflake account.

For more details, see [Accessing Internal stages with dedicated interface endpoints](../../user-guide/private-internal-stages-aws.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 22-Jan-24 |

---
title: January 25, 2024 — Streamlit in Snowflake Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-01-25.md
section: Release Notes
---

# January 25, 2024 — Streamlit in Snowflake Release Notes

Streamlit in Snowflake: Support for Azure — General Availability

With this release, we are pleased to announce the general availability of Streamlit in Snowflake in Azure, which was previously available as a preview feature.

For more information, see [About Streamlit in Snowflake](../../../developer-guide/streamlit/about-streamlit.md).

---
title: January 29, 2024 — Snowflake Google connectors
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-01-29.md
section: Release Notes
---

# January 29, 2024 — Snowflake Google connectors

## Snowflake Connector for Google Analytics Raw Data

With this release, we are pleased to announce the preview of Snowflake Connector for Google Analytics Raw Data.
Google Analytics is a cloud-based tool that provides insight into how users interact with your website.
You can use it to analyze user actions, track the number of visitors and page views, and analyze bounce rates for a page.

The Snowflake Connector for Google Analytics Raw Data enables you to automatically ingest event-level Google Analytics 4 (GA4) data into your Snowflake account.

For more details, see [Snowflake Connector for Google Analytics Raw Data](../../../connectors/google/gard/gard-connector-about.md).

## Snowflake Connector for Google Analytics Aggregate Data

With this release, we are pleased to announce the preview of Snowflake Connector for Google Analytics Aggregate Data.

The Snowflake Connector for Google Analytics Aggregate Data enables you to automatically ingest Google Analytics 4 (GA4) data into your Snowflake account.
The connector extracts aggregated data using the [GA4 Reporting API](https://developers.google.com/analytics/devguides/reporting/data/v1).

For more details, see [Snowflake Connector for Google Analytics Aggregate Data](https://other-docs.snowflake.com/connectors/google/gaad/gaad-connector-about.html).

Release notes:

* [Snowflake Connector for Google Analytics Aggregate Data release notes](../../connectors/gaad.md)
* [Snowflake Connector for Google Analytics Raw Data release notes](../../connectors/gard.md)

---
title: January 29-30, 2024 — 8.4 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_04.md
section: Release Notes
---

# January 29-30, 2024 — 8.4 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Security Updates

### Authentication enhancements — *Preview*

With this release, we are pleased to announce the preview of several enhancements related to authentication, including:

* Authentication policies.
* Identifier-first login flow.
* Support for implementing federated authentication using multiple identity providers.
* New SAML2 security integration parameters for federated authentication.

#### Authentication policies

Authentication policies provide control over how a client or user authenticates. They can be used to restrict logins based
on the type of client trying to connect (for example, allowing Snowsight while blocking SnowSQL), as well as regulating which
authentication methods can be used (for example, allowing passwords while blocking key pair authentication).

For more information, see [Authentication policies](../../user-guide/authentication-policies.md).

#### Identifier-first login flow

The identifier-first login flow first prompts the user for a username or email address before presenting authentication options.
These authentication options are based on rules defined in authentication policies and SAML2 security integrations. For example,
the combination of an authentication policy and the identifier-first login flow can hide the password option from a user who
needs to be authenticating with an identity provider, which reduces confusion and improves the user experience.

For more information about this feature and how to enable it, see [Identifier-first login](../../user-guide/identifier-first-login.md).

#### Multiple identity providers support

Snowflake now supports multiple identity providers, which allows different users to authenticate with different identity providers.
This feature requires the identifier-first login flow to be enabled.

For more information, see [Using multiple identity providers for federated authentication](../../user-guide/admin-security-fed-auth-security-integration-multiple.md).

#### New properties for SAML2 security integrations

A SAML2 security integration for a federated authentication configuration can include two new properties:

* ALLOWED_USER_DOMAINS
* ALLOWED_EMAIL_PATTERNS

When the user logs in, the user’s email address must match the values specified in these properties in order to authenticate with
the identity provider associated with the security integration.

This feature requires the identifier-first login flow to be enabled.

For more information, see [CREATE SECURITY INTEGRATION (SAML2)](../../sql-reference/sql/create-security-integration-saml2.md).

## Virtual Warehouse Updates

### Larger warehouses — *General Availability in Microsoft Azure Regions*

With this release, we are pleased to announce the general availability of larger (5X-LARGE and 6X-LARGE) warehouses in
Microsoft Azure regions, excluding Azure Government regions.

Before provisioning a 5X-LARGE or 6X-LARGE warehouse, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

For more information, see [Overview of warehouses](../../user-guide/warehouses-overview.md).

## Extensibility Updates

### External network access — *Preview*

With this release, we are pleased to announce the addition of preview support for external network access on Google Cloud.
You can use external access to access network locations external to Snowflake from within procedure and UDF handler code.
In addition to AWS and Azure, this preview is now available on GCP except in the Gov region.

When setting up external network access, you create a network rule that represents the external network location. If your
handler code will need to authenticate with the external location, you create a secret containing the credentials needed.
In handler code, you can use APIs to retrieve credential values from the secret.

For more information, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

### Java 17 support — *Preview*

For more information on Java support, see [Snowflake Java Runtime Support](../../developer-guide/java-runtime-support-policy.md).

## Data Loading / Unloading Updates

### Snowpipe update: a new pipe status

With this release, a new pipe status STOPPED_BY_SNOWFLAKE_ADMIN is available in the output of SYSTEM$PIPE_STATUS function. The pipe can only be set to this state by [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). When a pipe is in this state, it means the pipe will not accept new files for ingestion.

For more information, see [SYSTEM$PIPE_STATUS](../../sql-reference/functions/system_pipe_status.md).

## Data Pipeline Updates

### Automatic task graph retry — *General Availability*

With this release, we are pleased to announce the general availability of automatic task graph retry. If any task graphs complete in a FAILED state, Snowflake can automatically retry the task graphs. This feature is disabled by default. To enable this feature, you need to set `AUTO_RETRY_ATTEMPTS` to a value greater than `0` on the root task of a task graph.

For more information, see [CREATE TASK](../../sql-reference/sql/create-task.md) and [ALTER TASK](../../sql-reference/sql/alter-task.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 29-Jan-24 |
| *Snowpipe update: a new pipe status* | **Added** to *Data Loading / Unloading Updates* | 30-Jan-24 |
| *Automatic task graph retry* | **Added** to *Data Pipeline Updates* | 31-Jan-24 |

---
title: January 30 - February 1, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-01-30.md
section: Release Notes
---

# January 30 - February 1, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Creating a table when loading a file in Snowsight —– *General Availability*

With this release, we are pleased to announce the general availability of creating a new table when loading a file in Snowsight.
Snowsight uses the [INFER_SCHEMA](../../../sql-reference/functions/infer_schema.md) table function to automatically detect the file metadata schema,
retrieve the column definitions, and generate a new table. This feature doesn’t support XML files.

For more information, see [Create a new table using Snowsight](../../../user-guide/data-load-web-ui.md).

---
title: January 31, 2024 — Snowflake Native Apps Framework Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-01-31.md
section: Release Notes
---

# January 31, 2024 — Snowflake Native Apps Framework Release Notes

Snowflake Native App Framework: General availability on AWS and Azure.

We are pleased to announce the general availability of the Snowflake Native App Framework on
AWS and Azure. The Snowflake Native App Framework allows you to create data applications
that leverage core Snowflake functionality. With a Snowflake Native App you can:

* Expand the capabilities of other Snowflake features by sharing data and related business logic
  with other Snowflake accounts. The business logic of an application can include a Streamlit app,
  stored procedures, and functions written using Snowpark API, JavaScript, and SQL.
* Share an application with consumers through listings. A listing can be either free or paid.
  You can distribute and monetize your apps in the Snowflake Marketplace or distribute them to specific
  consumers using private listings.
* Include rich visualizations in your app using Streamlit.

For more information, see [About the Snowflake Native App Framework](../../../developer-guide/native-apps/native-apps-about.md).

---
title: Java and Python UDFs and stored procedures: Changes to handling of // when resolving file paths in file access APIs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1810.md
section: Release Notes
---

# Java and Python UDFs and stored procedures: Changes to handling of `//` when resolving file paths in file access APIs

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

Snowflake currently removes `//` when you provide it between the stage and path name. However this is ambiguous because cloud storage allows forward slashes (`/`) in path names. The original goal was to help users who mis-concatenate their names (for example, `build_scoped_file_url(@stage, '/file.txt')`) which would result in `@stage//file.txt` when the user really wanted `@stage/file.txt`. When this behavior change bundle is enabled, Snowflake removes this behavior to avoid ambiguity.

So with this behavior change, resolutions to `@stage//file.txt` will fail unless `/file.txt` exists on cloud storage.

Before the change:
:   For files resolved inside a UDF or stored procedure:

    * `@stage//file.txt` resolves to `stage-location/file.txt`
    * `build_scoped_url(@stage, '//file.txt')` resolves to `stage-location/file.txt`

After the change:
:   For files resolved inside a UDF or stored procedure:

    * `@stage//file.txt` resolves to `stage-location//file.txt`
    * `build_scoped_url(@stage, '//file.txt')` resolves to `stage-location//file.txt`

Ref: 1810

---
title: JDBC Driver release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/jdbc.md
section: Release Notes
---

# JDBC Driver release notes

The JDBC Driver release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](jdbc-2026.md)
* [2025 releases](jdbc-2025.md)
* [2024 releases](jdbc-2024.md)
* [2023 releases](jdbc-2023.md)
* [2022 releases](jdbc-2022.md)

See [JDBC Driver](../../developer-guide/jdbc/jdbc.md) for documentation.

---
title: JDBC Driver release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/jdbc-2022.md
section: Release Notes
---

# JDBC Driver release notes for 2022

This article contains the release notes for the JDBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for JDBC Driver updates.

See [JDBC Driver](../../developer-guide/jdbc/jdbc.md) for documentation.

## Version 3.13.26 (December 14, 2022)

### New features and updates

* Upgraded the arrow library from version 9.0.0 to 10.0.1.
* Relocated files in `META-INF/versions` to `META-INF/versions/<version_number>/net/snowflake/client/jdbc/internal`.
* Added the getNano() and getOffset() methods to the SnowflakeTimeWithTimezone object to return the number of nanoseconds and the time zone offset, respectively.

## Version 3.13.25 (November 16, 2022)

### BCR (Behavior Change Release) change

> **Caution:**
>
> Version 3.13.25 of the Snowflake JDBC driver changes the default value of the `allowUnderscoresInHost` parameter
> to `false`. This change impacts PrivateLink customers whose account names contain underscores.
> In this situation, you must override the default value by setting `allowUnderscoresInHost` to `true`.

### New features and updates

* Set the default value of the `allowUnderscoresInHost` parameter to `false`, which converts underscores in account names to hyphens to avoid Apache `httpclient` connection errors with underscores. This behavior can be turned off by setting `allowUnderscoresInHost` to `true`.
* Updated the aws-java-sdk-bom library version from 1.11.394 to 1.12.327.
* Added the `enableReturnTimestampWithTimeZone` parameter to set whether to include the timezone in a timestamp.
* Added log warnings for each of the error return paths while parsing a `SnowflakeConnectString`.
* Added commas to the `SnowflakeDatabaseMetaData.getColumn()` arguments to improve readability.
* Added support for stored procedures.

### Bug fixes

* Fixed an issue related to using the GET command when `GCS_USE_DOWNSCOPED_CREDENTIAL` is true.
* Fixed an issue related to returning result types when the session handle is `NULL`.

## Version 3.13.24 (October 28, 2022)

### BCR (Behavior Change Release) change

> **Caution:**
>
> Version 3.13.24 of the Snowflake JDBC driver changes the return values for the `Statement.getMoreResults()`
> and `Statement.getupdateCount()`, as described below.If your projects are affected by breaking changes
> related to these functions, Snowflake recommends that you do not install this version into a production
> environment before testing.

### New features and updates

* Upgraded the following libraries:

  + arrow from version 8.0.0 to 9.0.0
  + jacksondatabind from version 2.13.2.2 to 2.13.4.2
  + google-cloud-storage from version 2.5.0 to 2.6.2
* The `Statement.getMoreResults()` function now returns TRUE when more statements are available to iterate
  through in a multi-statement query.
* The `Statement.getupdateCount()` function now returns 0 instead of -1 for non-DML queries.

## Version 3.13.23 (September 30, 2022)

### New features and updates

* Enabled the parallelism parameter for PUT/GET commands when using Azure.

### Bug fixes

* Fixed an issue with `NoClassDefFoundError` in Google libraries in the FIPs driver.
* Fixed error that occurred when getting procedures with a reader account.

## Version 3.13.22 (August 23, 2022)

### New features and updates

* Updated the tika-core library to version 2.4.1.
* Added support for the new Okta OIE (Okta Identity Engine).

### Bug fixes

* Fixed an issue where `getColumnClassName()` threw an exception when the column type is `timestamp_tz`.
* Fixed an issue where calling `getSQLStateType()` throws an exception while retrieving database metadata.
* Fixed an issue where calling `executeLargeBatch()` for prepared statements might result in no rows being inserted.
* Fixed an issue where `QueryStatus` could return invalid error codes and messages.
* Fixed a null pointer exception that sometimes occurred for session-less clients.

## Version 3.13.21 (July 13, 2022)

### New features and updates

* Added the `getStreams` function to the `SnowflakeDatabaseMetaData` object to list active streams.
* Updated the prefetch memory maximum retry value to improve chunk download performance.

### Bug fixes

* Fixed a memory leak issue with the statement object in the `snowflakeConnectionV1::createResultSet` function.
* Fixed a memory leak issue with arrow result sets.
* Fixed an issue with missing data in the JDBC chunk downloader.

## Version 3.13.20 (June 23, 2022)

### New features

* Implemented fast fail functionality for 404 errors returned from Amazon S3.
* Updated the following dependency in the JDBC driver:

  + arrow version 7.0.0 to 8.0.0
* Upgraded the following Google library versions:

  + google-auth-library from 0.9.0 to 1.5.3
  + google-cloud-storage from 1.82.0 to 2.5.0
  + google api client versions 1.30.10 to 1.33.2
  + google http client versions 1.36.0 to 1.41.4

## Version 3.13.19 (May 25, 2022)

### New features

* Updated the `isValid()` function to send a heartbeat call instead of a SELECT 1 to validate the session connection.
* Added support for setting `VARBINARY byte[]` arrays in the `SnowflakePreparedStatement.setObject()` function.
* Updated the following dependencies in the JDBC driver:

  + arrow version 0.15.1 to 7.0.0
  + jackson version 2.11.0 to 2.13.2
  + bouncy version 1.64 to 1.70

### Bug fixes

* Fixed an issue with `TIMESTAMP_INPUT_FORMAT` for stage binding.

## Version 3.13.18 (May 18, 2022)

### New features

* Upgraded the arrow and jackson libraries.

### Bug fixes

Stopped appending `retryCount` to a scoped URL for chunk downloading.

## Version 3.13.17 (April 14, 2022)

### New features

* Added getters for the `timezone` and `ZonedDateTime` for the `SnowflakeTimestampWithTimezone` object.

### Bug fixes

* Created a patch for driver release v3.13.16 that fixes incorrect behavior for `getSchemas()` function.
* Fixed the setting of invalid JVM parameters `proxyHost` and `proxyPort`.

## Version 3.13.16 (March 17, 2022)

### Bug fixes

* Fixed an issue where the `nonProxyHosts` parameter setting was not honored.

## Version 3.13.15 (February 21, 2022)

### Bug fixes

* Refactored the `isFileTransfer` function into the base class.
* Refactored the `FileTransferAgent` facade classes into base class
* Fixed a segmentation fault issue within Graal VM Native Image applications.
* Fixed and issue that cause the `ChunkDownloader` to hang.

## Version 3.13.14 (January 21, 2022)

### Bug fixes

* Added streaming ingest related metadata for streaming ingest billing.
* Updated BC FIPS version in the public POM.

## Version 3.13.13 (January 18, 2022)

### Bug fixes

* Fixed an issue where the JDBC driver was not updating `stageInfo` for information related to s3RegionalURL.
* Fixed an issue with account names containing underscores.
* Fixed an issue where an empty result set was returned for schemas containing double quotes when
  calling `getTables()` or `getColumns()`.
* Fixed an issue where `getProcedureColumns()` was not working with wildcards.

---
title: JDBC Driver release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/jdbc-2023.md
section: Release Notes
---

# JDBC Driver release notes for 2023

This article contains the release notes for the JDBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for JDBC Driver updates.

See [JDBC Driver](../../developer-guide/jdbc/jdbc.md) for documentation.

## Version 3.14.4 (December 07, 2023)

### BCR (Behavior Change Release) change

* Fixed a security issue with logging related to the `temp` directory:

  + Logging no longer accesses the temp directory.
  + The default `logpath` value changed from the `temp` directory to the `home` directory.

### New features and updates

* None.

### Bug fixes

* Fixed a `NullPointerException` that occurred when the `gzipDisabled` property key had no value.
* Fixed an issue where the driver failed on JDK v21 with Arrow.

## Version 3.14.3 (November 07, 2023)

### New features and updates

* Updated the following libraries:

  + org.codehaus.plexus:plexus-archiver from 2.4.4 to 4.8.0
  + org.codehaus.plexus:plexus-archiver from 2.6 to 4.8.0
  + org.bouncycastle:bc-fips from 1.0.2.1 to 1.0.2.4
  + aws-java-sdk to 1.12.501
  + jackson to 2.15.3
  + netty to 4.1.100.Final
  + grpc to 1.59.0
* Added the `enablePutGet` connection property to determine whether to allow PUT and GET commands access to local file systems.
* Added support for managing the frequency of retries for unsuccessful connection requests:

  + Added the `retryTimeout` parameter with a default value of 300 seconds.
  + Updated how the driver uses the `loginTimeout` and `maxHttpRetries` connection parameters and changed the default value of `loginTimeout` to 300 seconds.

### Bug fixes

* Fixed an issue relating to `NoSuchMethodError` when using snowflake-ingest-sdk 2.0.3.
* Fix an issue with handling `NULL` to `DATE` data type conversions.
* Fixed a GCP downscope token issue.

## Version 3.13.34 (October 25, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed an issue relating to failing PUT commands with a GCP Downscope token in `snowflake-jdbc-fips`.

## Version 3.14.2 (October 02, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed an issue where the driver did not honor the `useS3RegionUrl` from `JsonNode` in `getStageInfo`.

## Version 3.14.1 (August 24, 2023)

### New features and updates

* Added the ability to send optional headers from the `util` methods.
* Moved `getQueryStatus` function to `SfBaseSession` to support asynchronous calls in stored procedures.

### Bug fixes

* Fixed an issue where the driver did not send the entire OSCP URL for private links.

## Version 3.14.0 (July 27, 2023)

### BCR (Behavior Change Release) change

* Fixed an issue where, under certain conditions, the JDBC driver could retry HTTP requests indefinitely.

  > Previously, during an outage the JDBC driver would retry the failed HTTP call continuously until the request
  > succeeds or until someone force kills the operation.
  >
  > With this change, disables infinite HTTP retries originating from `execute` and `executeQuery` calls. Now, the JDBC
  > driver limits HTTP retries to seven, by default. Customers can set the `maxHttpRetries` session parameter to
  > customize the maximum number of retries. Customers can set `maxHttpRetries=0` to remove the retry limit,
  > but doing so runs the risk of the JDBC driver infinitely retrying failed HTTP calls.

### New features and updates

* Added the `CLIENT_OUT_OF_BAND_TELEMETRY_ENABLED` session property to allow you to disable OOB telemetry.
* Improved handling for `locatorsUpdateCopy()` function calls. Now, the driver returns `FALSE` instead of throwing an
  exception.
* Updated handling for 400 Bad Request errors for S3 clients and added the `putGetMaxRetries` connection property to
  configure the maximum number of retries for PUT/GET exceptions for storage clients (default: 7).
* Added support for `httpMaxRetries` in `DefaultResultStreamProvider.getResultChunk()` to improve chunk downloading
  performance.

### Bug fixes

* Fixed an issue where the driver incorrectly threw null pointer exceptions (NPEs) when calling `driver.getPropertyInfo()`.
* Fixed an issue where `reader.LoadNextBatch()` would occasionally throw a `ClosedByInterruptException` when reading
  from the arrow stream.
* Fixed an issue where the JDBC driver used the wrong proxy settings for S3 clients.
* Fixed an issue where the `downloadStream()` function disallowed filenames containing Japanese characters.
* Fixed an issue where “~” was not allowed in PUT/GET file paths.
* Fixed an issue where the driver would throw an InvalidPathException when a Windows file path included
  the `file://` prefix for logging configuration files.

## Version 3.13.33 (June 14, 2023)

### New features and updates

* None.

### Bug fixes

* Gracefully handle `MessageFormat.format` exceptions.

## Version 3.13.32 (May 26, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed a bug introduced in 3.13.31 that affects Java Runtime 8.

## Version 3.13.31 (May 25, 2023)

> **Note:**
>
> Please update to newer versions, especially if you see a bug affecting Java Runtime 8.

### New features and updates

* Enhanced hybrid transactional/analytical processing (HTAP).
* Upgraded the `org.apache.httpcomponents:httpclient` library to version 4.5.13 to pick up a security update.

### Bug fixes

* Fixed an issue where authentication attempts would time out for chunk download requests.
* Fixed an issue regarding parsing the configuration file on Windows.
* Fixed an int64 overflow issue with large or small `datetime` values.
* Improved the error message shown when a connection aborted due to SSL/TLS errors.
* Fixed an issue where the `getTime()` function returned a time based on the wrong time zone
  when `useSessionTimezone` is enabled.
* Fixed an issue where ASCII Null characters and control characters were randomly dropped from a `resultset`
  with `jdbc_query_result_format=JSON`.

## Version 3.13.30 (April 18, 2023)

### New features and updates

* Upgraded the following software libraries:

  + slf4j-api from version 1.7.25 to version 2.0.6.
  + logback-classic from version 1.2.3 to version 1.3.6.
* Changed the non-critical “SEVERE: HTTP request took longer than 5 min” from an error message to a warning message.
* Added the `http.proxyProtocol` property for JVM proxy settings.

### Bug fixes

* Fixed an issue where authentication attempts would time out for chunk download requests.
* Fixed an issue where login credentials were visible in exceptions when a connection URL failed to part.
* Fixed a memory leak cause by checking `isClosed()` before adding a `resultset` to `openResultSets`.
* Fixed an issue where a misleading SAML2 assertion error message was sent when `hostnames` mismatched.
* Fixed an issue with URL-encoded OSCP requests.
* Fixed an issue the `SnowflakeFileTransferAgent.uploadStream()` function incorrectly handled the `overwrite=false` option.
* Fixed an issue where the `metadata.getTableTypes()` method returned the wrong table types.
* Updated the driver to expose the SQL error message in an exception message triggered when asynchronous query calls
  resulted in a failed query and exception.
* Added a check for rare cases when get procedure column calls return an empty result set.
* Changed the warning level to debug/info for log messages related to `SnowflakeConnectionString` parse errors.
* Fixed an issue where the JDBC driver would retry requests that failed with `SSLHandshakeException`.
* Added support for the `snowflake.jdbc.enable.illegalAccessWarning` system property to allow users to disable
  illegal access warnings.
* Fixed an issue whether gsc upload file error messages would display the wrong information.
* Changed the default TTL value to close an idle connection after 60 seconds.
* Fixed a prepared statement ID issue by removing extra describe calls.

## Version 3.13.29 (March 17, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed an issue where incorrect column type names were returned for stored procedure column metadata
  when `USE_STATEMENT_TYPE_CALL_FOR_STORED_PROC_CALLS=true`.
* Fixed an issue where the JDBC would retry a GET request when a file could not be downloaded due to a lack of
  space on the target filesystem. Now, the driver throws an exception in this situation.
* Fixed an issue where the JDBC would retry requests on Azure clients when a 404 resource error occurred. Now,
  the driver throws an exception in this situation.
* To protect against SQL injection attacks, the JDBC driver now escapes quotes in the pattern search
  arguments of the DatabaseMetadata API.
* Fixed an issue where getClob() calls raised a NullPointerException when a column contained a NULL value.
  Now, the driver returns `NULL` when column holds a `SQL NULL` value.
* Fixed an issue where the JDBC driver failed to validate an SSO URL before executing it. Now, the driver
  uses the `URLValidator` and `URLEncoder` utilities to validate and encode the URL.

## Version 3.13.28 (February 22, 2023)

### New features and updates

* None.

### Bug fixes

* Added support for the GEOMETRY data type in the `SnowflakeType` enum to fix an issue that occurred when calling
  the `metaData.getColumns()` function to return metadata that included GEOMETRY data.
* Fixed a retry issue in GCP uploadStream that caused partial file uploads when JDBC incorrectly attempted to retry
  uploading an input stream.
* Fixed an issue with stored functions and procedures that returned a `resultset` for `getProcedureColumns()`
  and `getFunctionColumns()` function calls.
* Fixed an issue that caused StreamLoader to generate excessive log messages.

## Version 3.13.27 (January 30, 2023)

### New features and updates

* None

### Bug fixes

* Fixed a race condition that occasionally occurred during GET and PUT operations.
* Fixed an issue where using Okta authentication failed when receiving an HTTP 429 error.

---
title: JDBC Driver release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/jdbc-2024.md
section: Release Notes
---

# JDBC Driver release notes for 2024

This article contains the release notes for the JDBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for JDBC Driver updates.

See [JDBC Driver](../../developer-guide/jdbc/jdbc.md) for documentation.

## Version 3.21.0 (December 11, 2024)

### New features and updates

* Added support for regional Google Cloud Storage endpoints.
* Added the `SKIP_TOKEN_FILE_PERMISSIONS_VERIFICATION` option to skip token file permission verification.
* Added the `JAVA_LOGGING_CONSOLE_STD_OUT` option to override default `java.util.logging.ConsoleHandler` to write to `stderr` or `stdout` with a specific threshold.
* Added array bind supported log for prepared statements.
* Removed the experimental label from `snowflake-jdbc-thin artifact`.
* Changed the levels for some log messages.
* Updated the Google and Netty dependencies.
* Updated the javadoc documentation.

### Bug fixes

* Replaced raw asserts with exceptions.
* Changed the IV length to 12 bytes for GCM.
* Changed initialization of `SecureRandom` to use a default JVM random number generator.
* Fixed an issue with merging `io.netty.versions.properties` during shade.
* Fixed uncontrolled logging from Arrow library.
* Fixed native libraries relocation for Netty and Conscrypt during shade.
* Fixed get object and get bytes support for the native arrow structured type.

## Version 3.20.0 (October 30, 2024)

### New features and updates

* Added support for ZSTD decompression.
* Bumped the commons IO dependency to version 2.17.0.

### Bug fixes

* Fixed an issue affecting JDBC drivers where files pushed to Azure and GCP stages were uploaded without client-side encryption when the `CLIENT_ENCRYPTION_KEY_SIZE` parameter was set to 256-bit rather than the default 128-bit. For more information, see the [Snowflake JDBC Security Advisory](https://github.com/snowflakedb/snowflake-jdbc/security/advisories/GHSA-f686-hw9c-xw9c).

## Version 3.19.1 (October 25, 2024)

### New features and updates

* Updated the protobuf-java dependency to version 3.25.5.
* Added log message for canceled query reasons.
* Updated bouncy castle dependencies.
* Added troubleshooting guide link to the messages for SSL exceptions.

### Bug fixes

* Unified the structured types string representation.
* Fixed downloading the stream from the git repository.
* Fixed an issue with the connection timeout parameter.
* Fixed issues with Arrow logging.
* Changed the custom cloud storage header metadata handling to be case-insensitive.

## Version 3.19.0 (August 29, 2024)

### New features and updates

* Added support for disabling connection caching.
* Added the `PRIVATE_KEY_BASE64` connection parameter to support base64-encoded private keys.
* Added the following connection properties to support setting timeouts:

  + `HTTP_CLIENT_CONNECTION_TIMEOUT` and `HTTP_CLIENT_SOCKET_TIMEOUT` connection properties.
  + `BROWSER_RESPONSE_TIMEOUT` connection property to specify a browser timeout.
* Upgraded the following dependencies:

  + `Arrow` to version 17.0.0
  + `threeten-bp` to version 1.6.9

### Bug fixes

* Fixed an issue where the `getDate` method was missing an expected parameter.
* Fixed an issue with a `class not found` problem related to `LoggerFactory`.

## Version 3.18.0 (July 24, 2024)

### New features and updates

* Updated the `netty` library to version 4.1.111.Final.
* Added missing property setters in `SnowflakeBasicDataSource`.
* Added the following connection parameters to support backward compatibility for handling timezones:

  + `JDBC_DEFAULT_FORMAT_DATE_WITH_TIMEZONE` determines whether to use the previously hardcoded value for the formatter (default: `true`).
  + `JDBC_GET_DATE_USE_NULL_TIMEZONE` determines whether to use the previously null timezone value for the getDate method (default: `true`).
* Picked a top-level domain for Snowflake hosts.
* Set the last query ID for all failed statements.

### Bug fixes

* Fixed an issue where the retry backoff time could fall outside the minimum and maximum range.
* Fixed an issue relating to converting nested fields metadata in OBJECT columns.
* Fixed an issue where the date files returned the wrong day when using the `getString` or `getDate` method.
* Added a user permission check for a token file.

## Version 3.17.0 (July 08, 2024)

### New features and updates

* Improved logging.
* Exposed the vector dimension in column metadata.
* Added support for `getObject` on vector columns.
* Added support for reading the connection information from a file.
* Added support for Java version 21.
* Added support for dynamic Max LOB size in metadata.
* Improved logging configuration.
* Added JDBC connectivity diagnostics mode.

### Bug fixes

* Fixed an issue with inserting and reading timestamps assymetrically if a batch inserts a large number of columns.
* Fixed an issue with returning inconsistent `timestamps_ltz` between JSON and ARROW result sets.
* Fixed an issue where the driver failed file pattern expansion on file not found in a different pattern.

## Version 3.16.1 (May 27, 2024)

### New features and updates

* Added the `disableSamlURLCheck` parameter to disable SAML URL checks.

### Bug fixes

* Fixed an issue with choosing S3 regional URL domain base on the region name.
* Fixed an issue related to nested paths in Windows when parsing client configurations.
* Fixed an issue where the `getObject` method for arrays in JSON worked incorrectly in versions 3.15.1 and 3.16.0.
* Fixed a casting issue with a `MapVector`.

## Version 3.16.0 (April 29, 2024)

### New features and updates

* Added support for structured types.
* Added support for vector types.
* Improved support for encrypted private keys.
* Updated the security policy notice.

### Bug fixes

* Fixed an issue with native OKTA retry logic.
* Fixed an issue with unsupported reserved keywords.
* Fixed an issue with retry attempts for GET query metadata requests.

## Version 3.15.1 (April 05, 2024)

### New features and updates

* Added support for missing proxy and user password JVM parameters: `http.proxyUser`, `http.proxyPassword`, `https.proxyUser`, `https.proxyPassword`.
* Bumped the `nimbus-jose-jwt` dependency to version 9.37.3.

### Bug fixes

* Moved the public suffix list to an internal package when shading.
* Fixed an issue with ignoring default GCS credentials.
* Fixed an issue with returning decimal or integer values in ARROW format.
* Fixed an issue where the driver returned `java.util.ConcurrentModificationException` while calling `SFAsyncResultSet.next`.
* Fixed an `InvalidPathException` issue on Windows due to nested file paths.

## Version 3.15.0 (February 20, 2024)

### New features and updates

* Added a marker annotation for the internal API.
* Added two new java properties, `net.snowflake.jdbc.http_client_connection_timeout_in_ms` and `net.snowflake.jdbc.http_client_socket_timeout_in_ms`, to let you configure connection and socket timeouts.
* Added a new `enablePatternSearch` connection parameter to enable or disable pattern search for `getCrossReference`, `getExportedKeys`, `getImportedKeys`, and `getPrimaryKeys` metadata operations that should not use their parameters as patterns. Default: `true`.

### Bug fixes

* Fixed an issue with multi-release jar entries.
* Made dependency optional on `com.amazonaws.Protocol` in `HttpClientSettingsKey`.
* Deprecated `com.snowflake.client.jdbc.SnowflakeDriver`.
* Fixed an issue with parsing large responses (greater than 16MB).
* Updated the JDBC specification to version 4.2.

## Version 3.14.5 (January 24, 2024)

### New features and updates

* Added support for AIX 7.2.
* Added support for multiple SAML integrations.
* Updated the `grpc-netty-shaded` dependency to 1.60.0.
* Created a thin jar as a separate maven artifact `snowflake-jdbc-thin` (JDBC thin jar is an experimental feature).
* Implemented `toString()` in `SnowflakePreparedStatementV1`.
* Added `getQueryStatusV2` as replacement for deprecated `getQueryStatus`.

### Bug fixes

* Set the last query ID for failed statements.
* Fixed OOB telemetry initialization when using connectionless mode.
* Fixed an issue with handling GCP token expiration correctly when using connectionless mode.
* Fixed arrow format on AIX.

---
title: JDBC Driver release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/jdbc-2025.md
section: Release Notes
---

# JDBC Driver release notes for 2025

This article contains the release notes for the JDBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for JDBC Driver updates.

See [JDBC Driver](../../developer-guide/jdbc/jdbc.md) for documentation.

## Version 3.28.0 (December 15, 2025)

### New features and updates

* Introduced a shared library for extended telemetry to identify and prepare the testing platform for native Rust extensions.
* Added the ability to choose the connection configuration in the auto configuration file by specifying the `aws-oauth-file` parameter in the JDBC URL.
* Updated grpc-java to 1.77.0 to address CVE-2025-58057 from transient dependency.
* Updated netty to 4.1.128.Final to address CVE-2025-59419.

### Bug fixes

* Fixed an issue where connection and socket timeout were not propagated to the HTTP client.
* Fixed Azure 503 retries and configured it with the `putGetMaxRetries` parameter.

## Version 3.27.1 (October 30, 2025)

### New features and updates

* Upgraded aws-sdk to 1.12.792 and added STS dependency.
* Added RHEL 9 support.
* Added support for identity impersonation when using workload identity federation.

  + For Google Cloud Platform, added the `workloadIdentityImpersonationPath` connection parameter for `authenticator=WORKLOAD_IDENTITY` allowing workloads to authenticate as a different identity through transitive service account impersonation.
  + For AWS, added the `workloadIdentityImpersonationPath` connection parameter for `authenticator=WORKLOAD_IDENTITY` allowing workloads to authenticate through transitive IAM role impersonation.
* Bumped grpc-java to 1.76.0 to address CVE-2025-58056 from transient dependency.

### Bug fixes

* Fixed exponential backoff retry time for non-auth requests.

## Version 3.27.0 (October 6, 2025)

### New features and updates

* Added retries for HTTP responses 307 and 308 to handle internal IP redirects.
* PAT creation with the `execute` method now returns a `ResultSet`.
* Bumped netty to 4.1.127.Final to address CVE-2025-58056 and CVE-2025-58057.
* Added support for Interval Year-Month and Day-Time types in JDBC.
* Added support for Decfloat types in JDBC.
* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. For information on enabling this feature, see [Certificate revocation list (CRL) options](../../developer-guide/jdbc/jdbc-parameters.md). We recommend you test this feature in advisory mode before enabling it in production.

### Bug fixes

* Fixed permission check of the `.toml` configuration file.
* Fixed pattern search for file when `QUOTED_IDENTIFIERS_IGNORE_CASE` is enabled.

## Version 3.26.1 (August 29, 2025)

### New features and updates

* Added support for TLS version 1.3, including the following parameter:

  + `MIN_TLS_VERSION` specifies the minimum SSL/TLS version to use when initiating a TLS handshake.
  + `MAX_TLS_VERSION` specifies the maximum SSL/TLS version to use when initiating a TLS handshake.

### Bug fixes

* Fixed an issue with a `NullPointerException` when MFA is enabled in Okta and native Okta authentication is used.
* Fixed an issue with `CloseableHttpClient` being cached indefinitely.
* Increased netty to version 4.1.124.Final to address [CVE-2025-3823](https://github.com/netty/netty/security/advisories/GHSA-4v5m-5c5m-5c5m).

## Version 3.26.0 (August 13, 2025)

### New features and updates

* Added support for workload identity federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `workloadIdentityProvider` connection parameter.
  + Added `WORKLOAD_IDENTITY` to the values for the `authenticator` connection parameter.

### Bug fixes

* Fixed the OAuth Authorization Code’s default value for redirect URI removing a trailing / (slash) to be compliant with RFC 6749 Section 3.1.2.
* Fixed a bug resulting in `NullPointerException` when using `SnowflakeChunkDownloader` with connection pooling.
* Fixed a bug preventing the use of `auto-config` with connection pooling.
* Fixed a bug preventing application termination immediately because of telemetry threads.
* Forced proxy basic authentication for an S3 client.
* Removed the requirement for `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable in order to use workload identity federation.
* Fixed array binding for the `Date` datatype.

## Version 3.25.1 (July 21, 2025)

### New features and updates

* Added the `ENABLE_WILDCARDS_IN_SHOW_METADATA_COMMANDS` parameter to enable using patterns in `DatabaseMetaData` SHOW … IN … commands.
* Added the `OWNER_ONLY_STAGE_FILE_PERMISSIONS_ENABLED` parameter which forces the directory that contains the stage files to have owner only permissions (0600).

### Bug fixes

* Fixed unnecessary exception wrapping during network retries.
* Added retries for protocol_version error during TLS negotiation.
* Fixed an issue with the default trust manager not extending `X509ExtendedTrustManager`.
* Added a missing log parameter to the Session logs.

## Version 3.25.0 (July 09, 2025)

### New features and updates

* Added support for sovereign clouds and removed obsolete issuer checks for Workload Identity Federation.

### Bug fixes

* Fixed a bug that prevented `TelemetryThreadPool` from scaling based on the workload.
* Fixed access token expiration handling for the legacy OAuth flow.
* Removed an obsolete error log on HTTP response checks.

## Version 3.18.1 (June 05, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with initializing a trust manager with the default JVM algorithm for trust managers.

## Version 3.17.1 (June 05, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with initializing a trust manager with the default JVM algorithm for trust managers.

## Version 3.21.1 (June 04, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with initializing a trust manager with the default JVM algorithm for trust managers.

## Version 3.20.1 (June 04, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with initializing a trust manager with the default JVM algorithm for trust managers.

## Version 3.22.1 (June 03, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with initializing a trust manager with the default JVM algorithm for trust managers.

## Version 3.24.2 (May 31, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with initializing a trust manager with the default JVM algorithm for trust managers.

## Version 3.24.1 (May 28, 2025)

### New features and updates

* Added the `HttpHeadersCustomizer` interface to provide a flexible way to inject custom HTTP headers into various requests initiated by the Snowflake JDBC driver
* Added the `LOCAL_APPLICATION` default for the `clientId` and `clientSecret` OAUTH parameters.

### Bug fixes

* Fixed handling of timestamps before 04.10.1582 (Gregorian reform) when inserting with `BindUploader`.
* Fixed NPE handling of writing to the cache file when the file is not accessible.
* Fixed the Workload Identity Federation request signature for AWS.

## Version 3.24.0 (April 30, 2025)

### Private Preview (PrPr) features

Added support for Workload Identity Federation in the AWS, Azure, GCP, and Kubernetes platforms.

Disclaimer:

* This feature can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use this feature only with non-production data.
* This PrPr feature is not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Added support for PAT, OAuth 2.0 Authorization Code Flow, OAuth 2.0 Client Credentials Flow, and OAuth Token caching.

  + For PAT: Added the `PROGRAMMATIC_ACCESS_TOKEN` parameter for the parameter authenticator.
  + For OAuth 2.0 Authorization Code Flow:

    - Added the `oauthClientId`, `oauthClientSecret`, `oauthAuthorizationUrl`, `oauthTokenRequestUrl`, and `oauthScope` parameters.
    - Added the `OAUTH_AUTHORIZATION_CODE` parameter for the parameter authenticator.
  + For OAuth 2.0 Client Credentials Flow:

    - Added the `oauthClientId`, `oauthClientSecret`, `oauthTokenRequestUrl`, and `oauthScope` parameters.
    - Added the `OAUTH_CLIENT_CREDENTIALS` parameter for the parameter authenticator.
  + For OAuth Token caching: Passing a username to driver configuration is required, and the `clientStoreTemporaryCredential` property cannot be set to `false`.
* Removed dependencies on the `joda-time` and `google-http-client` libraries.

### Bug fixes

* Fixed the OCSP cache server URL when using a proxy.
* Fixed an issue where binding execution for TIMESTAMP_LTZ type caused incorrect binding for other date time types.
* Fixed the handling of dates before 04.10.1582 (Gregorian reform) when inserting with `BindUploader`.
* Fixed the handling of the TIME type as wall clock time by adding the `CLIENT_TREAT_TIME_AS_WALL_CLOCK_TIME` parameter.

## Version 3.23.2 (April 3, 2025)

### New features and updates

* None

### Bug fixes

* Fixed a null pointer exception that occurred when the cache folder is inaccessible.

## Version 3.23.1 (March 13, 2025)

### New features and updates

* None

### Bug fixes

* Fixed a missing dependency version declaration for the nimbusds library.
* Fixed an issue with creating the file used for caching on Windows environment.
* Fixed an issue with logging on the debug level when the client-side encryption master key of the target stage during the execution of GET/PUT commands was logged locally. The key by itself does not grant access to any sensitive data. For more information, see [CVE-2025-27496](https://github.com/snowflakedb/snowflake-jdbc/security/advisories/GHSA-q298-375f-5q63).
* Fixed an issue with prioritizing GCS credentials over the Snowflake credentials during communication with storage. Changed the default value of parameter `disableGcsDefaultCredentials` to `true`.
* Fixed the retry mechanism used in the authentication process using OKTA.

## Version 3.23.0 (February 27, 2025)

### Private Preview (PrPr) features

Added support for PAT, OAuth 2.0 Authorization Code Flow, OAuth 2.0 Client Credentials Flow, and OAuth Token caching in Private Preview.

* For PAT: Added the `PROGRAMMATIC_ACCESS_TOKEN` parameter for the parameter authenticator.
* For OAuth 2.0 Authorization Code Flow:

  + Added the `oauthClientId`, `oauthClientSecret`, `oauthAuthorizationUrl`, `oauthTokenRequestUrl`, and `oauthScope` parameters.
  + Added the `OAUTH_AUTHORIZATION_CODE` parameter for the parameter authenticator.
* For OAuth 2.0 Client Credentials Flow:

  + Added the `oauthClientId`, `oauthClientSecret`, `oauthTokenRequestUrl` and `oauthScope` parameters.
  + Added the `OAUTH_CLIENT_CREDENTIALS` parameter for the parameter authenticator.
* For OAuth Token caching: Passing a username to driver configuration is required, and the `clientStoreTemporaryCredential` property cannot be set to `false`.

Disclaimer:

* These features can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use these features only with non-production data.
* These PrPr features are not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Improved the exception message when getting query metadata.
* Added the `ENABLE_EXACT_SCHEMA_SEARCH_ENABLED` parameter to enable exact schema searches in some `DatabaseMetaData` methods.
* Added more explicit error messages when a username or password is missing in the DataSource.
* Bumped the following dependencies:

  + netty to version 4.1.118.Final
  + json-smart to version 2.5.2
  + asm to version 9.7.1
* Added the ability to convert the `CLIENT_REQUEST_MFA_TOKEN` flag from `string` to `boolean`.
* Added the ability to set the query timeout for the server side or client side, not both.
* Implemented and improved the file-based credentials cache for Linux, including enhanced token caching.

### Bug fixes

* Fixed wrong behavior of setting proxy in global request configurations.
* Fixed non-empty logs when the log level is set to `OFF`.
* Fixed file paths allowing triple slash file prefix (`file:///`) in the PUT command.
* Exceptions thrown by `uploadFileCallable` are now propagated to the main thread instead of failing silently.

## Version 3.22.0 (January 29, 2025)

### New features and updates

* Added the following connection parameters:

  + `CLEAR_BATCH_ONLY_AFTER_SUCCESSFUL_EXECUTION` parameter to clear batches only after successful execution.
  + `disableOCSPChecks` parameter to replace the deprecated `insecureMode` parameter.
  + `IMPLICIT_SERVER_SIDE_QUERY_TIMEOUT` parameter to set timeouts for synchronous queries on both the client and server.
* Added the `SnowflakeStatement.setAsyncQueryTimeout` method to timeout asynchronous queries on the server.
* Added the `net.snowflake.jdbc.commons_logging_wrapper` java property to configure handling logs from `commons-logging`.

### Bug fixes

* Fixed handling endpoints without protocol in PUT/GET operations in GCS (Google Cloud Storage).
* Fixed a performance issue with too frequent calls of `toString` when fetching results containing structured types.
* Fixed an issue with `createArrayOf` case-insensitivity.
* Fixed an issue where `downloadStream` could download different files with the same prefix.
* Fixed the possibility of `%PATH%` privilege escalation when authentication is set as `EXTERNALBROWSER` and used in a Windows environment. For more information, see [CVE-2025-24789](https://github.com/snowflakedb/snowflake-jdbc/security/advisories/GHSA-7hpq-3g6w-pvhf).
* Fixed the verification of the file permissions and owner created in Linux environments and used for caching tokens when authentication is set to `EXTERNALBROWSER` or `USERNAME_PASSWORD_MFA`. For more information, see [CVE-2025-24790](https://github.com/snowflakedb/snowflake-jdbc/security/advisories/GHSA-33g6-495w-v8j2).

---
title: JDBC Driver release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/jdbc-2026.md
section: Release Notes
---

# JDBC Driver release notes for 2026

This article contains the release notes for the JDBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for JDBC Driver updates.

See [JDBC Driver](../../developer-guide/jdbc/jdbc.md) for documentation.

## Version 4.1.0 (Apr 08, 2026)

### New features and updates

* Added warning when using plain HTTP endpoints for OAuth authentication.
* Added `getRole`, `getWarehouse`, and `getDatabase` API extension methods.
* Removed the `io.netty.tryReflectionSetAccessible` system property setting, as it is no longer needed with modern Arrow/Netty versions.

### Bug fixes

* Fixed `ObjectMapper` initialization when `DATE_OUTPUT_FORMAT` is specified.
* Fixed Netty native library conflict in thin JAR packaging.
* Bumped Netty to 4.1.132.Final to address [CVE-2026-33870](https://nvd.nist.gov/vuln/detail/CVE-2026-33870) and [CVE-2026-33871](https://nvd.nist.gov/vuln/detail/CVE-2026-33871).
* Fixed driver failure when a security manager prohibits access to system properties, environment variables, or security provider modifications.
* Fixed crash in `getColumns` when a table contained an unrecognized column type.
* Fixed session expiration when multiple sessions have different heartbeat intervals.
* Fixed query context not being merged from failed query responses.

## Version 4.0.2 (Mar 12, 2026)

### New features and updates

* Bumped the `commons-compress` dependency to version 1.28.0 to address [CVE-2024-25710](https://nvd.nist.gov/vuln/detail/CVE-2024-25710) and [CVE-2024-26308](https://nvd.nist.gov/vuln/detail/CVE-2024-26308).

### Bug fixes

* Fixed expired session token renewal when polling results.
* Fixed missing minicore async initialization that was dropped during public API restructuring in v4.0.0.
* Adjusted the level of logging during driver initialization.
* Added sanitization for `nonProxyHosts` regex patterns.
* Fixed a bug with a malformed file during S3 upload.
* Added periodic closure of sockets closed by the remote end.
* Restored the S3 client’s multipart threshold to 16 MB.
* Fixed the fat jar with S3 iteration where the `software.amazon.awssdk.transfer.s3.internal.ApplyUserAgentInterceptor` class could not be found.
* Removed Conscrypt from shading to prevent a `failed to find class org/conscrypt/CryptoUpcalls` native error.
* Fixed a `NullPointerException` when the HOME directory cache is not available.
* Fixed proxy authentication when connecting to GCP.
* Fixed a bug where a caller-provided schema was ignored in `getStreams()`.
* Fixed S3 error handling that manifested with a `NullPointerException`.

## Version 4.0.1 (Feb 09, 2026)

### New features and updates

* None.

### Bug fixes

* Fixed incorrect encryption algorithm selection when uploading a file to S3 with the `client_encryption_key_size` account parameter set to 256.
* Fixed a `software.amazon.awssdk.transfer.s3.internal.ApplyUserAgentInterceptor` class could not be found issue in the fat jar.
* Removed Conscrypt from shading to prevent a native error when the `org/conscrypt/CryptoUpcalls` class could not be found.
* Fixed external browser authentication after an enum name change that caused an “Invalid connection URL: Invalid SSOUrl found” error.
* Rolled back the external browser authenticator name to `externalbrowser`.
* Updated BouncyCastle dependencies to address [CVE-2025-8916](https://nvd.nist.gov/vuln/detail/CVE-2025-8916) and [CVE-2025-8885](https://nvd.nist.gov/vuln/detail/CVE-2025-8885).

## Version 4.0.0 (Jan 27, 2026)

> **Important:**
>
> Due to some underlying issues, Snowflake recommends that AWS and Azure customers do not upgrade to this version if you use PUT or GET queries. Instead, Snowflake recommends that you upgrade directly to version 4.0.1. If you have already upgraded to this version, please upgrade to version 4.0.1 as soon as possible.

### BCR (Behavior Change Release) changes

* The public API was restructured, and all public APIs were moved to the `net.snowflake.client.api.*` package hierarchy:

  + Deprecated `net.snowflake.client.jdbc.SnowflakeDriver`. You should now use `net.snowflake.client.api.driver.SnowflakeDriver` instead.
  + Added a unified `QueryStatus` class in public API that replaces the deprecated `QueryStatus` enum and `QueryStatusV2` class.
  + Added new `DownloadStreamConfig` and `UploadStreamConfig` public API interfaces for stream upload/download configuration.
  + Added `SnowflakeDatabaseMetaData` interface to public API for database metadata operations.
  + Added `SnowflakeAsyncResultSet` interface to public API for async query operations.
  + Added `SnowflakeResultSetSerializable` interface to public API.
  + Moved internal classes to `net.snowflake.client.internal.*` package hierarchy.

  For more information, see [Migrating from JDBC Driver 3.x to JDBC Driver 4.x](../../developer-guide/jdbc/jdbc-migration.md).
* Renamed BouncyCastle JVM property from `net.snowflake.jdbc.enableBouncyCastle` to `net.snowflake.jdbc.useBundledBouncyCastleForPrivateKeyDecryption`.
* Removed previously deprecated classes and methods:

  + Removed the deprecated `com.snowflake.client.jdbc.SnowflakeDriver` class.
  + Removed the deprecated `QueryStatus` enum from the `net.snowflake.client.core` package.
  + Removed the deprecated `QueryStatusV2` class from the `net.snowflake.client.jdbc` package.
  + Removed the deprecated `SnowflakeType` enum from the `net.snowflake.client.jdbc` package.

### New features and updates

* Migrated from AWS SDK v1 to AWS SDK v2 for improved performance and modern API support.
* Upgraded Azure Storage SDK from version 5 to version 12.
* Upgraded nimbus-jose-jwt OAuth2 dependency to version 11.30.1.
* Bumped netty to version 4.1.130.Final to address [CVE-2025-67735](https://nvd.nist.gov/vuln/detail/CVE-2025-67735).

### Bug fixes

* Fixed the `column_size` value in database metadata commands to match the JDBC specification.
* Fixed a `NullPointerException` when in-band telemetry is sent without an HTTP response.

---
title: Jul 01, 2025: Snowflake Multi-Node ML Jobs (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-01-distributed-ml-jobs.md
section: Release Notes
---

# Jul 01, 2025: Snowflake Multi-Node ML Jobs (*Preview*)

Snowflake Multi-Node ML Jobs are now available in preview. This feature enables you to run distributed machine learning (ML) workflows inside Snowflake ML container runtimes across multiple compute nodes. With multi-node ML jobs, you can distribute work across several nodes to process large datasets and complex models with improved performance and scalability.

For more information, see:

* [Snowflake Multi-Node ML Jobs](../../../developer-guide/snowflake-ml/ml-jobs/distributed-ml-jobs.md)

---
title: Jul 03, 2025: Query insights
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-03-query-insights.md
section: Release Notes
---

# Jul 03, 2025: Query insights

If there are conditions that affect query performance, Snowflake provides insights about these conditions. Each insight includes
a message that explains how query performance might be affected and provides a general recommendation for next steps.

You can access these insights by querying
[the QUERY_INSIGHTS view](../../../sql-reference/account-usage/query_insights.md).

For information, see [Using query insights to improve performance](../../../user-guide/query-insights.md).

---
title: Jul 04, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-04-na-spcs-gcp-ga.md
section: Release Notes
---

# Jul 04, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (*Preview*)

With this release, support for Snowflake Native App with Snowpark Container Services on Google Cloud is available for preview. Apps with containers can be
deployed and operated on Google Cloud.

See [About Snowflake Native Apps with Snowpark Container Services](../../../developer-guide/native-apps/native-apps-about.md) for information on Snowflake Native App with Snowpark Container Services.
See [Support for private connectivity, VPS, and government regions](../../../developer-guide/native-apps/limitations.md) for more information on platforms supported by the Snowflake Native App Framework.

---
title: Jul 07, 2025: Account Usage: New CREDENTIALS view
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-07-credentials-view.md
section: Release Notes
---

# Jul 07, 2025: Account Usage: New CREDENTIALS view

You can now view information about the following types of credentials in the new CREDENTIALS view in the ACCOUNT_USAGE schema:

* [Programmatic access tokens](../../../user-guide/programmatic-access-tokens.md)
* [Passkeys](../../../user-guide/security-mfa-second-factor.md)
* [Time-based one-time passcodes (TOTPs)](../../../user-guide/security-mfa-second-factor.md)

For information, see [CREDENTIALS view](../../../sql-reference/account-usage/credentials.md).

---
title: Jul 08, 2025: ML Explainability visualizations (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-08-ml-explainability-visualizations.md
section: Release Notes
---

# Jul 08, 2025: ML Explainability visualizations (*General availability*)

We are pleased to announce the general availability of ML Explainability visualizations.

These visualizations provide insights into the ways features influence model behavior and predictions.

For more information, see [Model explainability visualizations](../../../developer-guide/snowflake-ml/model-registry/model-explainability-visualization.md).

---
title: Jul 08, 2025: Snowflake AI_EMBED multimodal embeddings (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-08-aisql-image-ai-embed.md
section: Release Notes
---

# Jul 08, 2025: Snowflake AI_EMBED multimodal embeddings (*Preview*)

The AI_EMBED function is now available in preview, enabling customers to generate high-quality image and text embedding vectors
directly within Snowflake using simple SQL. Embedding vectors allow text and images to be compared and searched based on their features.

AI_EMBED allows organizations to:

* **Build sophisticated image search and similarity systems:** Find visually similar products, medical images, or design assets across massive datasets.
* **Convert complex documents and images into searchable vector representations:** Transform unstructured visual content into queryable data.
* **Enhance content moderation workflows:** Automatically detect and flag inappropriate visual content across user-generated media.
* **Streamline digital asset management:** Organize and retrieve marketing materials, brand assets, and creative content through semantic image search.
* **Support manufacturing quality control:** Identify defects and anomalies in product images by comparing against reference standards.
* **Enable intelligent document processing:** Extract insights from invoices, contracts, and forms by embedding both text and visual layout information.

To learn more, see [Cortex AI Functions: Images](../../../user-guide/snowflake-cortex/ai-images.md) and [AI_EMBED](../../../sql-reference/functions/ai_embed.md).

---
title: Jul 15, 2025: Support for Streamlit 1.45.1 (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-15-sis.md
section: Release Notes
---

# Jul 15, 2025: Support for Streamlit 1.45.1 (General availability)

Version 1.45.1 of the Streamlit open-source library is now supported in Streamlit in Snowflake.

---
title: Jul 16, 2025: Data governance release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-16-tag-propagation-log.md
section: Release Notes
---

# Jul 16, 2025: Data governance release notes

## Automatic tag propagation: Event table to monitor conflicts (*General availability*)

Use an event table to collect telemetry data related to automatic tag propagation, especially data related to conflicts encountered during
propagation and how they were resolved. After Snowflake starts collecting data in the event table, you can query the table, create a stream
to track changes, or set alerts to send notifications when certain events occur.

For more information, see [Using an event table to monitor tag propagation](../../../user-guide/object-tagging/propagation.md).

---
title: Jul 17, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-17-dcr.md
section: Release Notes
---

# Jul 17, 2025: Snowflake Data Clean Rooms updates

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Analysis reports are now retained** for Audience Overlap, SQL, and custom templates when you edit or delete a clean room.
  Previously, editing or deleting a clean room would delete the analysis reports for that clean room.
* **Cross-Cloud Auto-Fulfillment updates.** Consumers of Cross-Cloud Auto-Fulfillment clean rooms now must enable
  Cross-Cloud Auto-Fulfillment on their account before installing a cross-cloud clean room. The new requirement simplifies the sharing flow
  in the API and improves the clean room experience. [Read more here.](../../../user-guide/cleanrooms/v1/enabling-laf.md)

---
title: Jul 18, 2025: Alerts on new data (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-18-alerts-on-new-data.md
section: Release Notes
---

# Jul 18, 2025: Alerts on new data (*General availability*)

Snowflake announces the general availability of alerts on new data, which was previously available as a preview feature.

An [alert on new data](../../../user-guide/alerts.md) is executed when new rows are added to a specified table or view.
Snowflake evaluates the condition against the new rows.

You can set up an alert on new data to notify you when new rows for error messages are inserted into the
[event table](../../../developer-guide/logging-tracing/event-table-setting-up.md) for your account. Because dynamic table refreshes
and task executions log events to the event table, you can set up an alert on new data to:

* [Monitor dynamic table refreshes](../../../user-guide/dynamic-tables-monitor-event-table-alerts.md).
* [Monitor task executions](../../../user-guide/tasks-events.md).

For more information, see [Alerts on new data](../../../user-guide/alerts.md).

---
title: Jul 18, 2025: Sensitive data classification
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-18-database-classification.md
section: Release Notes
---

# Jul 18, 2025: Sensitive data classification

## Automatic classification of a database (*Preview*)

You can now set a classification profile on a database rather than a schema so that all tables and views within the database are
automatically classified for sensitive data.

For more information, see [Set a classification profile on a database](../../../user-guide/classify-auto.md).

## Determine which databases and schemas are monitored by automatic sensitive data classification (*Preview*)

You can now call a system function to determine which tables and views are being automatically classified by sensitive data classification.
This function, SYSTEM$SHOW_SENSITIVE_DATA_MONITORED_ENTITIES, returns the databases and schemas that are associated with a classification
profile, which indicates that objects within the databases and schemas are being automatically classified at the interval specified by the
profile.

For more information, see [Determine which databases are being classified](../../../user-guide/classify-intro.md).

---
title: Jul 18, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-18-iceberg-external-writes-cld.md
section: Release Notes
---

# Jul 18, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (*Preview*)

Snowflake now supports write operations for externally managed Iceberg tables and catalog-linked databases that connect
to external Iceberg REST catalogs. These features expand data engineering workflows between Snowflake and the broader Iceberg ecosystem.

Key capabilities:

* Create new Iceberg tables directly in your remote catalog using Snowflake.
* Perform full DML operations (INSERT, UPDATE, DELETE, MERGE) on externally managed tables.
* Create a Snowflake database that’s linked to your remote Iceberg REST catalog (AWS Glue, Snowflake Open Catalog, and others).
* Discover and access multiple remote Iceberg tables without individually defining them in Snowflake.

For more information, see [Write support for externally managed Apache Iceberg™ tables](../../../user-guide/tables-iceberg-externally-managed-writes.md)
and [Use a catalog-linked database for Apache Iceberg™ tables](../../../user-guide/tables-iceberg-catalog-linked-database.md).

---
title: Jul 21, 2025: Billing contact information updates (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-21-billing-contact-info-updates.md
section: Release Notes
---

# Jul 21, 2025: Billing contact information updates (*General availability*)

With this release, Snowflake introduces billing contact information updates for on-demand, self-service customers. Trial account
holders who add a payment method to their account can now update their billing contact information.

For information, see [Update billing contact information](../../../user-guide/billing-contacts.md).

The new navigation menu will roll out gradually and might not be available to all on-demand self-service accounts at the same time.

---
title: Jul 21, 2025: CREATE_BILLING_EVENT and CREATE_BILLING_EVENTS system functions (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-21-create-billing-events-ga.md
section: Release Notes
---

# Jul 21, 2025: CREATE_BILLING_EVENT and CREATE_BILLING_EVENTS system functions (*General availability*)

With this release, the SYSTEM$CREATE_BILLING_EVENT and SYSTEM$CREATE_BILLING_EVENTS functions are now generally available. These functions enable you to create one or more billing events for your Snowflake account to track consumer usage of installed monetized applications.

For more information, see [SYSTEM$CREATE_BILLING_EVENTS](../../../sql-reference/functions/system_create_billing_events.md) and [SYSTEM$CREATE_BILLING_EVENT](../../../sql-reference/functions/system_create_billing_event.md).

---
title: Jul 24, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-24-dcr.md
section: Release Notes
---

# Jul 24, 2025: Snowflake Data Clean Rooms updates

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **See provider activation task history for your account.** You can now see a history of provider activation calls for a clean room by
  calling `dcr_health.provider_run_provider_activation_history`.
* **See clean rooms tasks running or recently stopped in your account.** The new procedure `dcr_health.dcr_tasks_health_check` shows
  information about running or recently stopped clean room tasks in your account.

---
title: Jul 25, 2025: Cortex AI Functions AI_SENTIMENT (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-25-cortex-aisql-ai-sentiment.md
section: Release Notes
---

# Jul 25, 2025: Cortex AI Functions AI_SENTIMENT (*General availability*)

AI_SENTIMENT, an industry-leading task-specific function that delivers state-of-the-art sentiment classification across
diverse content and languages, is now generally available as part of Snowflake Cortex AI Functions. AI_SENTIMENT
allows organizations to understand not just how customers feel, but precisely what aspects of their products, services,
or brand drive customer satisfaction or concern.

AI_SENTIMENT’s capabilities include:

* **Industry-Leading Accuracy:** Achieves 92% accuracy on aspect-based sentiment benchmarks and 83% on overall sentiment
  analysis, significantly outperforming popular alternatives such as Claude 4 Sonnet, GPT 4.1, and AWS DetectSentiment.
* **Comprehensive Multilingual Analysis:** Delivers both overall and granular entity-specific sentiment across seven
  languages without requiring translation.
* **Customized Sentiment Analysis:** Analyzes sentiment for up to ten specific entities, aspects, or categories that
  matter most to your business.
* **Advanced Classification Spectrum:** Identifies positive, negative, neutral, and mixed emotions with sophistication
  and nuance, returning “unknown” only when sentiment cannot be reliably determined from the content.
* **Human-like Contextual Understanding:** Interprets subtle signals such as implicit complaints (“I had to contact
  support three times”), figurative language, and cultural context that typically confuse other AI systems.
* **Dual-Function Flexibility:** Choose between AI_SENTIMENT for comprehensive aspect-based analysis or SENTIMENT for
  quick overall sentiment scoring, depending on your requirements.

Whether you’re monitoring social media sentiment, analyzing product feedback, or tracking brand perception, AI_SENTIMENT
provides the intelligence you need to make data-driven decisions that improve customer satisfaction — in multiple
markets, venues, and languages. To get started, see [AI_SENTIMENT](../../../user-guide/snowflake-cortex/ai-sentiment.md).

---
title: Jul 25, 2025: Snowflake Native App Framework support for Snowflake machine learning models (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-25-na-ml-ga.md
section: Release Notes
---

# Jul 25, 2025: Snowflake Native App Framework support for Snowflake machine learning models (*General availability*)

With this release, Snowflake introduces machine learning models in the Snowflake Native App Framework. This allows you
to use [Snowflake ML](../../../developer-guide/snowflake-ml/overview.md) models in a Snowflake Native App.

For information, see [Use Snowflake machine learning models in a Snowflake Native App](../../../developer-guide/native-apps/snowflake-ml-na-about.md).

---
title: Jul 28, 2025: Cortex Powered Object Descriptions
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-28-object-descriptions.md
section: Release Notes
---

# Jul 28, 2025: Cortex Powered Object Descriptions

## Ability to generate descriptions without being the owner

You can now use Snowflake Cortex to generate descriptions for Snowflake objects if you have the SELECT privilege on the object. Previously, you needed the OWNERSHIP privilege on the object. You still need the OWNERSHIP privilege if you want to save the generated description.

For the steps you take in Snowsight to generate a description, see [Generate descriptions without saving](../../../user-guide/ui-snowsight-cortex-descriptions.md).

For a complete list of roles and privileges you need to generate object descriptions, see [Cortex descriptions access control requirements](../../../user-guide/ui-snowsight-cortex-descriptions.md).

---
title: Jul 28, 2025: Single-use refresh tokens for Snowflake OAuth
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-28-oauth.md
section: Release Notes
---

# Jul 28, 2025: Single-use refresh tokens for Snowflake OAuth

Use single-use refresh tokens to increase the security posture of your Snowflake OAuth security integrations.

Single-use refresh tokens prevent stolen refresh tokens from being reused in your Snowflake account. For more information, see
[Single-use refresh tokens for Snowflake OAuth security integrations](../../../user-guide/single-use-refresh-tokens.md).

---
title: Jul 29, 2025: Cortex Agents integration for Microsoft Teams and Copilot (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-29-cortex-agents-for-ms-teams.md
section: Release Notes
---

# Jul 29, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*Preview*)

Cortex Agents, which allow natural language queries of structured and unstructured data, now support integration with
Microsoft Teams and Microsoft 365 Copilot. This feature is available in preview in the Azure US East 2 (Virginia) region.

The integration lets users interact conversationally with a Cortex Agent within the Teams interface, or in Copilot,
making your Snowflake data more useful by bringing it closer to where users work. For more information, see
[Cortex Agents for Microsoft Teams and Microsoft 365 Copilot](../../../user-guide/snowflake-cortex/cortex-agents-teams-integration.md).

---
title: Jul 30, 2025: External network access with private connectivity: Google Cloud
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-30-outbound-network-access-private-gcp.md
section: Release Notes
---

# Jul 30, 2025: External network access with private connectivity: Google Cloud

You can now create and manage Google Cloud Private Service Connect endpoints for external network access.

For more information, see [External network access and private connectivity on Google Cloud](../../../developer-guide/external-network-access/creating-using-private-gcp.md) and [Manage private connectivity endpoints: Google Cloud](../../../user-guide/private-manage-endpoints-gcp.md).

---
title: Jul 31, 2025: AI Observability in Snowflake Cortex (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-07-31-ai-observability-ga.md
section: Release Notes
---

# Jul 31, 2025: AI Observability in Snowflake Cortex (*General availability*)

With this release, we are pleased to announce the general availability of AI Observability in Snowflake Cortex, which was previously available as a preview feature. AI Observability enables you to evaluate and trace your generative AI applications, making them more trustworthy and transparent.

AI Observability allows you to systematically measure the performance of your AI applications by running evaluations, logging application traces for debugging, and benchmarking performance for production deployments. Key features include:

* **Evaluations:** Systematically assess generative AI applications and agents using the LLM-as-a-judge technique, leveraging metrics such as accuracy, latency, usage, and cost.
* **Comparison:** Compare multiple evaluations side by side to identify the best configuration for production.
* **Tracing:** Trace every step of application executions to debug and refine your applications.

AI Observability supports a variety of task types, such as retrieval-augmented generation (RAG) and summarization, and provides detailed metrics to help you optimize your applications.

For more information, see [AI Observability in Snowflake Cortex](../../../user-guide/snowflake-cortex/ai-observability.md).

---
title: July 01, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-07-01.md
section: Release Notes
---

# July 01, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## New homepage for Snowsight —– *Preview*

With this release, we are pleased to announce the preview of the Snowsight homepage.

The Snowsight homepage has been updated to include:

* New navigation menu - updated to include menu items for creating and managing data products, notebooks, worksheets, databases and all related artifacts.
* Updated search - updated to allow for easier discovery of content.
* Quick actions - Quickly and easily perform operations specific to your current role. For example examine tables,
  create worksheets to execute python code and more.
* Recently viewed tab, which includes recent operations and associated content.

For more information, see [Exploring the Snowsight user interface](../../../user-guide/ui-snowsight-homepage.md).

---
title: July 01-03, 2024 — 8.24 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_24.md
section: Release Notes
---

# July 01-03, 2024 — 8.24 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Trust Center: CIS Benchmarks scanner package — *General availability*

With this release, we are pleased to announce the general availability of the CIS Benchmarks scanner package in the Trust Center.

For more information, see [CIS Benchmarks scanner package](../../user-guide/trust-center/overview.md).

### Authentication policies: New multi-factor authentication parameters

With this release, we are pleased to announce the following new parameters for authentication policies:

* MFA_AUTHENTICATION_METHODS: A list of authentication methods that enforce multi-factor authentication (MFA) during login.
* MFA_ENROLLMENT: Determines whether a user must enroll in MFA.

For more information, see the following topics:

* [Hardening user or account authentication using MFA](../../user-guide/authentication-policies.md)
* [Authentication policy DDL commands reference](../../user-guide/authentication-policies.md)

> **Note:**
>
> If your network administrator enables these parameters, any installed clients, drivers, and the Snowflake CLI must have their
> connection settings reconfigured to enable MFA.

## Virtual warehouse updates

### Hybrid tables: Changes to capacity quotas — *Preview*

Default capacity quotas for hybrid table storage and requests are now enforced at the database level for all Snowflake accounts. Different databases within the same account are isolated instead of sharing the same quota. This release also introduces a new quota that limits the number of databases within a single Snowflake account that can contain hybrid tables.

For more information, see [Quotas and throttling](../../user-guide/tables-hybrid-limitations.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview)  Note that, at the time of publication, the 8.23 release was still in progress. | 24-Jun-24 |
| *Trust Center: CIS Benchmarks scanner package — General availability* | **Added** to *Security updates* section | 02-Jul-24 |
| *Hybrid tables: changes to capacity quotas — Preview* | **Added** to *Virtual warehouse updates* section | 08-Jul-24 |

---
title: July 02, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-07-02.md
section: Release Notes
---

# July 02, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Snowflake Notebooks external access —– *Preview*

With this release, we are pleased to announce the preview of the Snowflake Notebooks external access support.

By default, Snowflake restricts network traffic requests from external endpoints.
With this release how you can set up external network access for your notebook.

For more information, see [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md).

---
title: July 03, 2024 — Data pipelines: Support for Apache Iceberg™ tables with dynamic tables and streams –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-03-dynamic-iceberg-tables.md
section: Release Notes
---

# July 03, 2024 — Data pipelines: Support for Apache Iceberg™ tables with dynamic tables and streams –— *Preview*

We are pleased to announce the preview of the following two new capabilities for dynamic tables with
Snowflake-managed Apache Iceberg™ tables:

* Creating a dynamic table that reads from a Snowflake-managed Iceberg table as the source, just like
  regular tables.
* Creating a dynamic Iceberg table, a new dynamic table type that stores query results as a
  Snowflake-managed Iceberg table.

Additionally, this release supports using streams on Snowflake-managed and externally managed Iceberg tables.

For more information, see [Create dynamic Apache Iceberg™ tables](../../../user-guide/dynamic-tables-create-iceberg.md) and
[Introduction to streams](../../../user-guide/streams-intro.md).

---
title: July 03, 2024 — External network access in Streamlit in Snowflake –— General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-03-sis.md
section: Release Notes
---

# July 03, 2024 — External network access in Streamlit in Snowflake –— *General Availability*

With this release, we are pleased to announce the general availability of external network access in Streamlit in Snowflake.

You can now create secure access to specific network locations external to Snowflake, and you can use that access
from within the Streamlit app code. You can enable access through an external access integration.

For more information, see [External network access in Streamlit in Snowflake](../../../developer-guide/streamlit/features/external-access.md).

---
title: July 05-06, 2023 — 7.22 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_22.md
section: Release Notes
---

# July 05-06, 2023 — 7.22 Release Notes

The following new features and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Deleting an Account (Self-service) — *Preview*

With this release, we are pleased to announce the preview of self-service account deletion. An organization administrator can now delete an account without contacting Snowflake Support.

An organization administrator starts the process of deleting an account by dropping it. Once dropped, the account enters a grace period
during which the account can be restored (“undropped”). Snowflake automatically deletes the account when the grace period expires.

To support the process for deleting an account, this release also introduces the preview of a new syntax for the SHOW ORGANIZATION ACCOUNTS command. When the HISTORY keyword is appended to the command, the output contains dropped accounts along with additional columns such as
scheduled deletion time.

For more information, see [Dropping an account](../../user-guide/organizations-manage-accounts-delete.md).

### Organization Usage: New REPLICATION_GROUP_USAGE_HISTORY View

With this release, we are pleased to announce the REPLICATION_GROUP_USAGE_HISTORY view in the Organization Usage schema. The
REPLICATION_GROUP_USAGE_HISTORY view allows an organization administrator to obtain details about the replication usage in an organization.

For more information, see [REPLICATION_GROUP_USAGE_HISTORY view](../../sql-reference/organization-usage/replication_group_usage_history.md).

## SQL Updates

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Context Functions (Session) | [CURRENT_ORGANIZATION_NAME](../../sql-reference/functions/current_organization_name.md) | Returns the name of the organization to which the current account belongs. |

### GROUP BY: New ALL Keyword

The [GROUP BY](../../sql-reference/constructs/group-by.md) clause now supports the ALL keyword, which specifies that all expressions in the SELECT list that do
not use aggregate functions should be used for grouping.

For example, the following two statements yield the same result:

```sqlexample
SELECT state, city, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY state, city;
```

```sqlexample
SELECT state, city, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY ALL;
```

---
title: July 08-12, 2024  — 8.25 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_25.md
section: Release Notes
---

# July 08-12, 2024 — 8.25 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Wildcards are now supported in OBJECT constants

You can now use wildcards in OBJECT constants. When you construct OBJECT data, you can use a wildcard by specifying `{*}`. The SQL statement constructs the OBJECT value from the specified data using the attribute names as keys and the associated values as values. Specifying `{*}` is equivalent to specifying `OBJECT_CONSTRUCT(*)`.

For example, the following SQL statement uses a wildcard to construct OBJECT data:

```sqlexample
SELECT {*} FROM my_table;
```

For more information, see [OBJECT constants](../../sql-reference/data-types-semistructured.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 05-Jul-24 |
| *New TYPE property for USER* | **Removed** from *SQL updates* section | 09-Jul-24 |

---
title: July 10-12, 2023 — 7.23 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2023/7_23.md
section: Release Notes
---

# July 10-12, 2023 — 7.23 Release Notes (with behavior changes)

The following new features and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## Behavior Changes Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2023_05](../bcr-bundles/2023_05_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2023_04](../bcr-bundles/2023_04_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_03](../bcr-bundles/2023_03_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for August; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New Features

### Schema Detection and Evolution for Kafka Connector With Snowpipe Streaming — *Preview*

With this release, we are pleased to announce that the Kafka connector with Snowpipe Streaming now supports schema detection and evolution.
The structure of tables in Snowflake can be defined and evolved automatically to support the structure of new Snowpipe streaming data loaded
by the Kafka connector.

To use this feature, you need to enable the [behavior changes in bundle 2023_05](../bcr-bundles/2023_05_bundle.md).

For more information, see [Schema detection and evolution for Kafka connector with Snowpipe Streaming classic](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md).

## SQL Updates

### SYSTEM$CLUSTERING_INFORMATION Returns Error Messages

With this release, we are pleased to announce that the SYSTEM$CLUSTERING_INFORMATION function now returns recent errors associated with
Automatic Clustering. These errors, returned as JSON objects in an array, explain why Automatic Clustering could not recluster data. By
default, the 10 most recent errors are returned by the function. To allow users to return more or fewer messages, the
SYSTEM$CLUSTERING_INFORMATION function now accepts a number as its second argument. This number specifies how many errors should be
returned.

For more information, see [SYSTEM$CLUSTERING_INFORMATION](../../sql-reference/functions/system_clustering_information.md).

## Web Interface Updates

### Snowsight Set as Default Web Interface

With this release, [behavior changes in bundle 2023_04](../bcr-bundles/2023_04_bundle.md) are enabled by default. As a result,
all customers of Snowflake On Demand have Snowsight set as the default web interface for all users in the account and new users of Snowflake
have Snowsight set as their default web interface.

For more information, see [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

---
title: July 11, 2024 — Snowflake connectors
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-11.md
section: Release Notes
---

# July 11, 2024 — Snowflake connectors

## Snowflake Connector for PostgreSQL

With this release, we are pleased to announce the preview of the Snowflake Connector for PostgreSQL.

The Snowflake Connector for PostgreSQL allows you to:

* Load data into Snowflake from a PostgreSQL database.
* Configure replication so that changes in your PostgreSQL database are replicated to Snowflake.

For more details, see [Snowflake connector for PostgreSQL](https://other-docs.snowflake.com/en/connectors/postgres6/about).

See also [Snowflake Connector for PostgreSQL release notes](../../connectors/postgres6.md).

## Snowflake Connector for MySQL

With this release, we are pleased to announce the preview of the Snowflake Connector for MySQL.

The Snowflake Connector for MySQL allows you to:

* Load data into Snowflake from a MySQL database.
* Configure replication so that changes in your MySQL database are replicated to Snowflake.

For more details, see [Snowflake connector for MySQL](https://other-docs.snowflake.com/en/connectors/mysql6/about).

See also [Snowflake Connector for MySQL release notes](../../connectors/mysql6.md).

---
title: July 11, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-11-dcr.md
section: Release Notes
---

# July 11, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Sequenced template execution

A provider can use a *template chain* to define a sequence of templates to be executed in a particular order. The results of one template
within the sequence can be used as input for a subsequent template in the chain. A clean room user executes a template chain to perform an
analysis that runs the templates in their predefined order.

For more information, see [Using the developer APIs to execute templates sequentially](../../../user-guide/cleanrooms/developer-template-chains.md).

## Multi-factor authentication

For increased security, users are required to use multi-factor authentication (MFA) when signing in to the web app. Existing
Snowflake Data Clean Room customers will be sent an email explaining how to enable MFA.

For information about enabling and authenticating with MFA, see [Sign in to the clean rooms UI](../../../user-guide/cleanrooms/v1/web-app-introduction.md).

## Register objects in a managed access schema

Collaborators can now register individual tables and views in a managed access schema (that is, a schema created with the WITH MANAGE
ACCESS clause) without registering other objects in the schema. Objects must be registered before they can be linked into a clean room.

For more information, see [Registering data](../../../user-guide/cleanrooms/register-data.md).

## Support for additional region

Snowflake Data Clean Rooms are now available in the following region:

| Cloud platform | Supported region | Cloud region ID |
| --- | --- | --- |
| Amazon Web Service | EU (Zurich) | eu-central-2 |

## Single-party SQL query

Consumers can use the SQL Query template to run analyses without adding tables or defining joins when installing the clean room, which lets
a provider share a clean room designed for SQL queries against the provider’s data only.

---
title: July 15, 2024 — Snowflake Copilot — Generally available
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-15-snowflake-copilot-ga.md
section: Release Notes
---

# July 15, 2024 — Snowflake Copilot — *Generally available*

We are pleased to announce the general availability of Snowflake Copilot.

Snowflake Copilot is an LLM-powered assistant that simplifies data analysis while maintaining robust data governance and seamlessly
integrates into your existing Snowflake workflow. You can ask open-ended questions about your data structure, send follow-up
inquiries, or even use it to refine and improve your own SQL queries.

> **Note:**
>
> Support for this feature is available to accounts in the following regions:
>
> * AWS us-east-1
> * AWS us-west-2
> * AWS eu-central-1

For more details, see [Using Snowflake Copilot](../../../user-guide/snowflake-copilot.md).

---
title: July 15-17, 2024 — 8.26 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_26.md
section: Release Notes
---

# July 15-17, 2024 — 8.26 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Estimate the cost of Automatic Clustering — *Preview*

With this release, we are pleased to announce a new system function, SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS, that estimates
the cost of enabling Automatic Clustering for a table and maintaining the table in a well-clustered state. It can also estimate
the cost of changing the clustering key for a clustered table.

For more information, see [SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS](../../sql-reference/functions/system_estimate_automatic_clustering_costs.md).

## SQL updates

### New TYPE property for USER —— *General availability*

With this release, we are pleased to announce the new TYPE property for USER. The TYPE property lets you differentiate between
service and human users.

For more information, see [CREATE USER](../../sql-reference/sql/create-user.md).

## Extensibility updates

### Access to external network locations on AWS in the Gov region — *General availability*

With this release, Snowflake is pleased to announce the general availability of access to external network locations from function
and procedure handlers for code deployed in the AWS Gov region. External network access in the Azure Gov region is not supported.

When setting up external network access, you create a network rule that represents the external network location. If your handler
code will need to authenticate with the external location, you create a secret containing the credentials needed. In handler code,
you can use APIs to retrieve credential values from the secret.

For more information, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

## Cost management updates

### Support for Snowpark Container Services in custom budgets — *General availability*

With this release, we are pleased to announce that [custom budgets](../../user-guide/budgets.md) can now be used to monitor the cost of
compute pools associated with [Snowpark Container Services](../../developer-guide/snowpark-container-services/overview.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview)  Note that, at the time of publication, the 8.25 release was still in progress. | 12-Jul-24 |
| *Support for Snowpark Container Services in custom budgets — General availability* | **Added** to *Cost Management Updates* section | 16-Jul-24 |

---
title: July 17, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-07-17.md
section: Release Notes
---

# July 17, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced
in this update to Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Network rules and network policies –— *Generally available*

With this release, we are pleased to announce the general availability of network rules and network policies in Snowsight. Since
the preview release in Jan 2024, we streamlined the user experience to simplify managing network access to your Snowflake account.

For more details, see [Network rules](../../../user-guide/network-rules.md) and [Controlling network traffic with network policies](../../../user-guide/network-policies.md).

---
title: July 18, 2024 —  Snowflake Native App Framework - Support for shared external table and Apache Iceberg™ tables — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-18-native-app-external-table.md
section: Release Notes
---

# July 18, 2024 — Snowflake Native App Framework - Support for shared external table and Apache Iceberg™ tables — *Preview*

We are pleased to announce the preview support for shared external and Apache Iceberg™ tables
in a Snowflake Native App. This feature allows providers to share
[external tables](../../../user-guide/tables-external-intro.md) and
[iceberg tables](../../../user-guide/tables-iceberg.md) with consumers.

For information on how providers can share an external or Iceberg table in an app, see
[Support for external and Iceberg tables](../../../developer-guide/native-apps/preparing-data-content.md).

---
title: July 19, 2024 — CORTEX_FUNCTIONS_USAGE_HISTORY view — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-19-cortex-functions-usage-history.md
section: Release Notes
---

# July 19, 2024 — CORTEX_FUNCTIONS_USAGE_HISTORY view — *General Availability*

With this release, we are pleased to announce the general availability of the CORTEX_FUNCTIONS_USAGE_HISTORY view in the
Account Usage schema, giving
you the ability to query the usage history for the [Cortex AI Large Language Model (LLM) Functions](../../../user-guide/snowflake-cortex/aisql.md).

This new view allows
you to track usage for all Cortex AI LLM Functions, individual models like `mistral-large` or `Llama3-70b` used in the
COMPLETE function, and task-specific functions like TRANSLATE or SUMMARIZE.

For more information, see [CORTEX_FUNCTIONS_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_functions_usage_history.md).

---
title: July 19-20, 2023 — 7.24 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_24.md
section: Release Notes
---

# July 19-20, 2023 — 7.24 Release Notes

The following new features and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### SQL Syntax for Enabling the ORGADMIN Role — *Preview*

With this release, we are pleased to announce the preview of a new ALTER ACCOUNT … SET IS_ORG_ADMIN syntax that allows an organization administrator to enable the ORGADMIN role within a specific account, without contacting Snowflake Support.

Once the ORGADMIN role is enabled for an account, organization administrators can log in to the account and use the role to perform organization-focused tasks like listing and creating accounts. Enabling the ORGADMIN role in an account also allows queries to access data
in the ORGANIZATION_USAGE schema.

For more information, see [Enabling the ORGADMIN role in an account](../../user-guide/organization-administrators.md).

---
title: July 2023
source: https://docs.snowflake.com/en/release-notes/2023-07.md
section: Release Notes
---

# July 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Snowpipe Streaming — *General Availability*

With this release, Snowflake is pleased to announce the general availability of Snowpipe Streaming, the latest addition to Snowflake ingestion offerings. The Snowpipe Streaming API writes rows of data directly to Snowflake tables without the requirement of staging files. This architecture results in lower load latencies with corresponding lower costs for loading any volume of data, which makes it a powerful tool for handling near real-time data streams.

Snowpipe Streaming is also available for the Snowflake Connector for Kafka, which offers an easy upgrade path to take advantage of the lower latency and lower cost loads.

For more information, see [Snowpipe Streaming](../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md) and [Snowflake Connector for Kafka with Snowpipe Streaming classic](../user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka.md).

### Organization Usage: New QUERY_ACCELERATION_HISTORY View

With this release, we are pleased to announce the QUERY_ACCELERATION_HISTORY view in the Organization Usage schema of the shared SNOWFLAKE
database. This view returns the query acceleration usage for warehouses across the accounts in your organization.

For more information, see [QUERY_ACCELERATION_HISTORY view](../sql-reference/organization-usage/query_acceleration_history.md).

### SQL Syntax for Enabling the ORGADMIN Role — *Preview*

With this release, we are pleased to announce the preview of a new ALTER ACCOUNT … SET IS_ORG_ADMIN syntax that allows an organization administrator to enable the ORGADMIN role within a specific account, without contacting Snowflake Support.

Once the ORGADMIN role is enabled for an account, organization administrators can log in to the account and use the role to perform organization-focused tasks like listing and creating accounts. Enabling the ORGADMIN role in an account also allows queries to access data
in the ORGANIZATION_USAGE schema.

For more information, see [Enabling the ORGADMIN role in an account](../user-guide/organization-administrators.md).

### Schema Detection and Evolution for Kafka Connector With Snowpipe Streaming — *Preview*

With this release, we are pleased to announce that the Kafka connector with Snowpipe Streaming now supports schema detection and evolution.
The structure of tables in Snowflake can be defined and evolved automatically to support the structure of new Snowpipe streaming data loaded
by the Kafka connector.

To use this feature, you need to enable the [behavior changes in bundle 2023_05](bcr-bundles/2023_05_bundle.md).

For more information, see [Schema detection and evolution for Kafka connector with Snowpipe Streaming classic](../user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md).

### Deleting an Account (Self-service) — *Preview*

With this release, we are pleased to announce the preview of self-service account deletion. An organization administrator can now delete an account without contacting Snowflake Support.

An organization administrator starts the process of deleting an account by dropping it. Once dropped, the account enters a grace period
during which the account can be restored (“undropped”). Snowflake automatically deletes the account when the grace period expires.

To support the process for deleting an account, this release also introduces the preview of a new syntax for the SHOW ORGANIZATION ACCOUNTS command. When the HISTORY keyword is appended to the command, the output contains dropped accounts along with additional columns such as
scheduled deletion time.

For more information, see [Dropping an account](../user-guide/organizations-manage-accounts-delete.md).

### Organization Usage: New REPLICATION_GROUP_USAGE_HISTORY View

With this release, we are pleased to announce the REPLICATION_GROUP_USAGE_HISTORY view in the Organization Usage schema. The
REPLICATION_GROUP_USAGE_HISTORY view allows an organization administrator to obtain details about the replication usage in an organization.

For more information, see [REPLICATION_GROUP_USAGE_HISTORY view](../sql-reference/organization-usage/replication_group_usage_history.md).

## SQL Updates

### Snowflake Alerts: Support for Future Grants and Object Tagging

With this release, Snowflake alerts now support future grants and object tagging.

* You can use the FUTURE keyword in the [GRANT <privileges> … TO ROLE](../sql-reference/sql/grant-privilege.md) command to
  [define an initial set of privileges that should be granted](../user-guide/security-access-control-configure.md)
  on new alerts created in a specified database or schema.
* You can use the [CREATE ALERT](../sql-reference/sql/create-alert.md) and [ALTER ALERT](../sql-reference/sql/alter-alert.md) commands to
  [assign tags](../user-guide/object-tagging/introduction.md) to Snowflake alerts.
* In the CREATE ALERT command, you can use WITH TAG or TAG to assign a tag to a newly created alert.
* In the ALTER ALERT command, you can use SET TAG or UNSET TAG to assign or remove a tag from an existing alert.

### Search Optimization: Support for Substring Search in Semi-Structured Data — *Preview*

With this release, we are pleased to announce the preview of [Search Optimization](../user-guide/search-optimization-service.md) support
for substring and regular expression search in [semi-structured data](../sql-reference/data-types-semistructured.md), including ARRAY,
OBJECT, and VARIANT columns. Previously, only equality searches on such columns could be optimized.

Substring queries include predicates that use the following keywords:

* LIKE, ILIKE, LIKE ANY, LIKE ALL, ILIKE ANY
* STARTSWITH, ENDSWITH, CONTAINS
* RLIKE, REGEXP, REXEP_LIKE
* SPLIT_PART

To enable search optimization of substring searches on semi-structured columns, use an
[ALTER TABLE … ADD SEARCH OPTIMIZATION](../sql-reference/sql/alter-table.md) command like one of those below.

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(semi_structured_column);
```

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(semi_structured_column:field);
```

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(semi_structured_column:field.nested_field);
```

The second and third commands illustrate enabling search optimization for a field within a column. Field names must be separated from the
column name with a colon. Nested fields may be specified by including additional field names separated by periods, as shown in the third
example.

For more information on this search optimization improvement, including its capabilities and limitations, see [Search Optimization -
Substring Search in VARIANT Types](../user-guide/search-optimization/semi-structured-queries.md).

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Context Functions (Session) | [CURRENT_ORGANIZATION_NAME](../sql-reference/functions/current_organization_name.md) | Returns the name of the organization to which the current account belongs. |

### SYSTEM$CLUSTERING_INFORMATION Returns Error Messages

With this release, we are pleased to announce that the SYSTEM$CLUSTERING_INFORMATION function now returns recent errors associated with
Automatic Clustering. These errors, returned as JSON objects in an array, explain why Automatic Clustering could not recluster data. By
default, the 10 most recent errors are returned by the function. To allow users to return more or fewer messages, the
SYSTEM$CLUSTERING_INFORMATION function now accepts a number as its second argument. This number specifies how many errors should be
returned.

For more information, see [SYSTEM$CLUSTERING_INFORMATION](../sql-reference/functions/system_clustering_information.md).

### GROUP BY: New ALL Keyword

The [GROUP BY](../sql-reference/constructs/group-by.md) clause now supports the ALL keyword, which specifies that all expressions in the SELECT list that do
not use aggregate functions should be used for grouping.

For example, the following two statements yield the same result:

```sqlexample
SELECT state, city, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY state, city;
```

```sqlexample
SELECT state, city, SUM(retail_price * quantity) AS gross_revenue
  FROM sales
  GROUP BY ALL;
```

## Data Governance Updates

### Access History: Track Masking & Row Access Policy References — *General Availability*

With this release, we are pleased to announce the general availability of the `policies_referenced` column in the Account Usage
ACCESS_HISTORY view. This column allows for the monitoring of queries on a table or view protected by a row access policy and a column
protected by a masking policy and the enforced masking and row access policies. The column includes support for intermediate objects and
columns that are policy protected. Audits on policy protected objects and columns are easier because auditors have a more unified view of
how protected data is referenced without having to do complex joins on multiple Account Usage views. This column was introduced in preview
in [February 2023](2023-02.md).

For details, see [Access History](../user-guide/access-history.md) and the [ACCESS_HISTORY view](../sql-reference/account-usage/access_history.md).

## Web Interface Updates

### Create Named Stages using Snowsight — *General Availability*

With this release, we are pleased to announce the general availability of creating and editing named stages using Snowsight without writing
SQL.

To create or edit named stages, you can enter details into Snowsight including information about authentication or encryption for the stage.

For more information, see [Staging files using Snowsight](../user-guide/data-load-local-file-system-stage-ui.md).

### Create Named Stages using Snowsight — *General Availability*

With this release, we are pleased to announce the general availability of creating and editing named stages using Snowsight without writing SQL.

To create or edit named stages, you can enter details into Snowsight including information about authentication or encryption for the stage.

For more information, see [Staging files using Snowsight](../user-guide/data-load-local-file-system-stage-ui.md).

### Snowsight Set as Default Web Interface

With this release, [behavior changes in bundle 2023_04](bcr-bundles/2023_04_bundle.md) are enabled by default. As a result,
all customers of Snowflake On Demand have Snowsight set as the default web interface for all users in the account and new users of Snowflake
have Snowsight set as their default web interface.

For more information, see [Snowsight: The Snowflake web interface](../user-guide/ui-snowsight.md).

---
title: July 22-25, 2024 — 8.27 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_27.md
section: Release Notes
---

# July 22-25, 2024 — 8.27 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_06](../bcr-bundles/2024_06_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_05](../bcr-bundles/2024_05_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_04](../bcr-bundles/2024_04_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for August 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New features

### Support for sending webhook notifications to Slack, Microsoft Teams, and PagerDuty

With this release, we are pleased to announce support for sending webhook notifications to the following systems:

* Slack
* Microsoft Teams
* PagerDuty

> **Note:**
>
> Snowflake does not send webhook notifications to external systems other than the ones listed above.

For more information, see [Sending webhook notifications](../../user-guide/notifications/webhook-notifications.md).

### Triggered tasks — *General availability*

With this release, we are pleased to announce the general availability of triggered tasks.

With triggered tasks, your tasks only run when the related stream has new data. This simplifies a common customer use case for frequently polling a source with unpredictable availability of new data, and reduces latency by immediately processing when there is new data.

For more information, see [Triggered tasks](../../user-guide/tasks-triggered.md).

## SQL updates

### GET_DDL function: Support for warehouses

With this release, we are pleased to announce support in the GET_DDL function for warehouses. You can call the GET_DDL function to get the DDL statement for recreating a warehouse.

For more information, see [GET_DDL](../../sql-reference/functions/get_ddl.md).

## Data governance updates

### Custom Data Classification — *General availability*

With this release, we are pleased to announce the general availability of Custom Classification.

Snowflake provides the CUSTOM_CLASSIFIER class in the SNOWFLAKE.DATA_PRIVACY schema to enable data engineers to extend Snowflake’s data classification capabilities based on their own knowledge of the data. You can define your own semantic category, specify the privacy category, and specify regular expressions along with a threshold to match column value patterns while optionally matching the column name.

For more information, see [Create custom categories for sensitive data](../../user-guide/classify-custom.md).

### Data Classification of tables in a schema with Snowsight — *General availability*

With this release, we are pleased to announce the general availability of using Snowsight to classify tables in a schema. You can also select custom classifiers (instances) that you can access to classify your data and determine whether to automatically assign tags to columns after classifying data.

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 19-Jul-24 |
| *Asynchronous Data Classification* | **Removed** from *Data Governance* section | 24-Jul-24 |

---
title: July 23, 2024 — Managing Listings using SQL — Generally Available
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-23-pl.md
section: Release Notes
---

# July 23, 2024 — Managing Listings using SQL — **Generally Available**

Snowflake is pleased to announce the general availability of Managing listings using SQL.

You can now [create](https://other-docs.snowflake.com/en/sql-reference/sql/create-listing),
[alter](https://other-docs.snowflake.com/en/sql-reference/sql/alter-listing),
[describe](https://other-docs.snowflake.com/en/sql-reference/sql/desc-listing),
[show](https://other-docs.snowflake.com/en/sql-reference/sql/show-listings), and
[drop](https://other-docs.snowflake.com/en/sql-reference/sql/drop-listing) the contents of a listing using SQL commands.

For details, see [About managing listings using SQL](../../../progaccess/listing-progaccess-about.md).

> **Note:**
>
> You cannot use SQL commands to offer paid, personalized listings, or listings on private data exchanges.

For a complete set of limitations and restrictions see [About managing listings using SQL](../../../progaccess/listing-progaccess-about.md)

---
title: July 23, 2024 — New Meta AI models available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-23-new-llm-models.md
section: Release Notes
---

# July 23, 2024 — New Meta AI models available in Snowflake Cortex AI

We’re pleased to announce that the Llama 3.1 collection of multilingual large language models (LLMs) are now available in
[Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/), providing enterprises with secure, serverless access
to Meta’s most advanced open source models.

* `llama3.1-405b`, the largest openly available model.
* 128K context length, improved reasoning & coding capabilities.
* Improved and upgraded multilingual `llama3.1-8b` and `llama3.1-70b` models.

For details, see the [Snowflake Cortex LLM Functions](../../../user-guide/snowflake-cortex/aisql.md) topic.

---
title: July 24, 2024 — Cortex Guard for Snowflake Cortex AI — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-24-cortex-llm-updates.md
section: Release Notes
---

# July 24, 2024 — Cortex Guard for Snowflake Cortex AI — *General Availability*

We’re happy to announce the general availability of Cortex Guard for Snowflake Cortex AI, a new feature that enables enterprises to easily
implement safeguards that filter out potentially inappropriate or unsafe large language model (LLM) responses. Cortex Guard introduces a
new safety filter you can specify in a call to the [COMPLETE](../../../sql-reference/functions/complete-snowflake-cortex.md) function so that
language model responses associated with harmful content - such as violent crimes, hate, sexual content, self-harm and more - are
automatically filtered out. We are excited to introduce Cortex Guard, the next iteration of our gen AI platform, to help enterprises
seamlessly introduce safety into the end-to-end gen AI application development process.

---
title: July 24, 2024 — Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-24-document-ai.md
section: Release Notes
---

# July 24, 2024 — Document AI release notes

With this release, we are pleased to announce deleting documents in Document AI.
You can now delete documents that were not used for training from a Document AI model build.

---
title: July 25, 2024 — Cortex Search — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-25-cortex-search-preview.md
section: Release Notes
---

# July 25, 2024 — Cortex Search — *Preview*

We are pleased to announce the preview of Cortex Search.

Cortex Search is an LLM-powered assistant that simplifies data analysis while maintaining robust data governance and seamlessly
integrating into your existing Snowflake workflow. You can ask open-ended questions about your data structure, send follow-up
inquiries, or even use it to refine and improve your own SQL queries.

> **Note:**
>
> With this preview release, Cortex Search will be made available to accounts in the following regions:
>
> * AWS us-east-1
> * AWS us-west-2

For more details, see [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

---
title: July 25, 2024 — New AI21 model available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-25-new-llm-model-jamba.md
section: Release Notes
---

# July 25, 2024 — New AI21 model available in Snowflake Cortex AI

We’re pleased to announce that AI21’s foundational model, `jamba-instruct`, is now available for serverless inference in
[Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/).

* The `jamba-Instruct` model is built by AI21 Labs to efficiently meet enterprise requirements. It is optimized to offer a 256k token
  context window with low cost and latency, making it ideal for tasks like summarization, Q&A, and entity extraction on lengthy documents
  and extensive knowledge bases.

For details, see [Snowflake Cortex LLM Functions](../../../user-guide/snowflake-cortex/aisql.md).

---
title: July 25, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-25-dcr.md
section: Release Notes
---

# July 25, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Acxiom Real ID integration

A new third-party connector integrates Acxiom Real ID, an identity solution, with the clean room environment, allowing multiple entities to
collaborate without needing to directly expose or share sensitive data.

For information about how an administrator configures the new connector so it becomes available in clean rooms, see
[Snowflake Data Clean Rooms: Identity and data provider connectors](../../../user-guide/cleanrooms/connector-identity.md).

## Using developer APIs for provider activation

Collaborators can now use the developer APIs to send a consumer’s analysis results back to the provider for activation. For more
information, see [Activating query results](../../../user-guide/cleanrooms/activation.md).

## User interface for custom templates enhancement

When creating a user interface for a custom template, the provider can now create a drop-down list of tables in the clean room without
having to individually specify them. In addition, the provider can create a drop-down list of all available columns that can be used in
filters and joins without specifying them individually.

For more information about these enhancements, see the `provider.add_ui_form_customizations` command in the
[Snowflake Data Clean Rooms: Provider API reference guide](../../../user-guide/cleanrooms/provider.md).

## SQL Query template enhancement

The SQL Query template now allows the consumer to select a warehouse when running an analysis.

---
title: July 25-26, 2023 — 7.25 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_25.md
section: Release Notes
---

# July 25-26, 2023 — 7.25 Release Notes

The following new features and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Organization Usage: New QUERY_ACCELERATION_HISTORY View

With this release, we are pleased to announce the QUERY_ACCELERATION_HISTORY view in the Organization Usage schema of the shared SNOWFLAKE
database. This view returns the query acceleration usage for warehouses across the accounts in your organization.

For more information, see [QUERY_ACCELERATION_HISTORY view](../../sql-reference/organization-usage/query_acceleration_history.md).

## SQL Updates

### Snowflake Alerts: Support for Future Grants and Object Tagging

With this release, Snowflake alerts now support future grants and object tagging.

* You can use the FUTURE keyword in the [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md) command to
  [define an initial set of privileges that should be granted](../../user-guide/security-access-control-configure.md)
  on new alerts created in a specified database or schema.
* You can use the [CREATE ALERT](../../sql-reference/sql/create-alert.md) and [ALTER ALERT](../../sql-reference/sql/alter-alert.md) commands to
  [assign tags](../../user-guide/object-tagging/introduction.md) to Snowflake alerts.
* In the CREATE ALERT command, you can use WITH TAG or TAG to assign a tag to a newly created alert.
* In the ALTER ALERT command, you can use SET TAG or UNSET TAG to assign or remove a tag from an existing alert.

### Search Optimization: Support for Substring Search in Semi-Structured Data — *Preview*

With this release, we are pleased to announce the preview of [Search Optimization](../../user-guide/search-optimization-service.md) support
for substring and regular expression search in [semi-structured data](../../sql-reference/data-types-semistructured.md), including ARRAY,
OBJECT, and VARIANT columns. Previously, only equality searches on such columns could be optimized.

Substring queries include predicates that use the following keywords:

* LIKE, ILIKE, LIKE ANY, LIKE ALL, ILIKE ANY
* STARTSWITH, ENDSWITH, CONTAINS
* RLIKE, REGEXP, REXEP_LIKE
* SPLIT_PART (in equality predicates)

To enable search optimization of substring searches on semi-structured columns, use an
[ALTER TABLE … ADD SEARCH OPTIMIZATION](../../sql-reference/sql/alter-table.md) command like one of those below.

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(semi_structured_column);
```

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(semi_structured_column:field);
```

```sqlexample
ALTER TABLE mytable ADD SEARCH OPTIMIZATION ON SUBSTRING(semi_structured_column:field.nested_field);
```

The second and third commands illustrate enabling search optimization for a field within a column. Field names must be separated from the
column name with a colon. Nested fields may be specified by including additional field names separated by periods, as shown in the third
example.

For more information on this search optimization improvement, including its capabilities and limitations, see [Search Optimization -
Substring Search in VARIANT Types](../../user-guide/search-optimization/semi-structured-queries.md).

## Data Governance Updates

### Access History: Track Masking & Row Access Policy References — *General Availability*

With this release, we are pleased to announce the general availability of the `policies_referenced` column in the Account Usage
ACCESS_HISTORY view. This column allows for the monitoring of queries on a table or view protected by a row access policy and a column
protected by a masking policy and the enforced masking and row access policies. The column includes support for intermediate objects and
columns that are policy protected. Audits on policy protected objects and columns are easier because auditors have a more unified view of
how protected data is referenced without having to do complex joins on multiple Account Usage views. This column was introduced in preview
in [February 2023](../2023-02.md).

For details, see [Access History](../../user-guide/access-history.md) and the [ACCESS_HISTORY view](../../sql-reference/account-usage/access_history.md).

## Web Interface Updates

### Create Named Stages using Snowsight — *General Availability*

With this release, we are pleased to announce the general availability of creating and editing named stages using Snowsight without writing
SQL.

To create or edit named stages, you can enter details into Snowsight including information about authentication or encryption for the stage.

For more information, see [Staging files using Snowsight](../../user-guide/data-load-local-file-system-stage-ui.md).

---
title: July 29, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-07-29-wh.md
section: Release Notes
---

# July 29, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Default warehouse for Notebooks workloads — *Public Preview*

With this release, we are pleased to announce a new default warehouse for Notebooks. SYSTEM$STREAMLIT_NOTEBOOK_WH is a
new, Snowflake-managed warehouse that is provisioned in each account for running Notebooks. This default warehouse helps in reducing cluster fragmentation and optimizing your overall costs on Notebooks.

ACCOUNTADMINS can grant or revoke USAGE privileges on this warehouse. By default, the PUBLIC role is granted USAGE privileges on this warehouse.

For more details, see [Overview of warehouses](../../../user-guide/warehouses-overview.md), [Warehouse considerations](../../../user-guide/warehouses-considerations.md), and [Set up Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-setup.md).

---
title: July 29-August 01, 2024 — 8.28 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_28.md
section: Release Notes
---

# July 29-August 01, 2024 — 8.28 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Semi-structured and structured | [ARRAYS_ZIP](../../sql-reference/functions/arrays_zip.md) | Returns an array of objects, each of which contains key-value pairs for an nth element in the input arrays. |

### CREATE and ALTER commands for replication and failover groups: Support added for tags

With this release, Snowflake adds support to set a tag on replication and failover groups as follows:

```sqlsyntax
ALTER { REPLICATION | FAILOVER } GROUP <name>
    SET TAG <tag_name> = '<tag_value>' [ , <tag_name>= '<tag_value>' … ]

ALTER { REPLICATION | FAILOVER } GROUP <name>
    UNSET TAG <tag_name> [ , <tag_name> … ]

CREATE [ OR REPLACE ] { REPLICATION | FAILOVER } GROUP <name>
    ...
    ...
    [ [ WITH ] TAG ( <tag_name> = '<tag_value>' [ , <tag_name> = '<tag_value>' , ... ] ) ]
```

Where `<tag_name>` and `<tag_value>` specify the tag name (i.e. the key) and the tag value to set/unset for the
replication or failover group:

* The maximum number of unique tag keys that can be set on an object is 20.
* The tag value is always a string, and the maximum number of characters for the tag value is 256.

> **Note:**
>
> * Tags are not set on any objects in the replication or failover group because these groups are not parents of other objects;
>   tag lineage is not applicable.
> * You cannot set tags on the secondary replication or failover group because these objects are read-only.
> * If tags are set on the primary replication or failover group, these tags are set on the secondary failover or replication group
>   when you refresh the secondary group.

For more information, see:

> * [ALTER FAILOVER GROUP](../../sql-reference/sql/alter-failover-group.md)
> * [ALTER REPLICATION GROUP](../../sql-reference/sql/alter-replication-group.md)
> * [CREATE FAILOVER GROUP](../../sql-reference/sql/create-failover-group.md)
> * [CREATE REPLICATION GROUP](../../sql-reference/sql/create-replication-group.md)

### Account Usage: New SEARCH_OPTIMIZATION_BENEFITS view

With this release, we are pleased to announce the new SEARCH_OPTIMIZATION_BENEFITS view in the ACCOUNT_USAGE schema.

This view provides information about the number of partitions pruned specifically due to search optimization. This view is
similar to the [TABLE_PRUNING_HISTORY](../../sql-reference/account-usage/table_pruning_history.md) view but provides information
about pruning due to search optimization.

For more information, see [SEARCH_OPTIMIZATION_BENEFITS view](../../sql-reference/account-usage/search_optimization_benefits.md).

## Data governance updates

### Object Tagging: Support added for replication and failover groups

With this release, Snowflake is pleased to announce that you can set tags on replication and failover groups.

For more information, see CREATE and ALTER commands for replication and failover groups: Support added for tags (in this topic).

### Data Quality and data metric functions (DMFs) — *General Availability*

With this release, Snowflake is pleased to announce the general availability of Data Quality Monitoring with data metric functions (DMFs).
Data Quality Monitoring uses DMFs to continuously monitor data quality metrics such as completeness, accuracy, uniqueness, and validity.
You can use Snowflake provided system DMFs for common metrics such as row count, duplicates, and freshness. Alternatively, you can create
your own custom DMFs to define metrics that are specific to your own data.

You can either use the DMF in a query to test the quality of data in your pipeline or associate the DMF to desired tables to continuously
monitor its quality. The continuous monitoring can either be schedule-based for periodic measurement or trigger-based to measure only
when the underlying table is modified.

Since announcing the preview availability in [March](other/2024-03-29-dmf.md), we’ve made the following updates:

* New schema privilege: CREATE DATA METRIC FUNCTION. This is a change from the preview where you needed to use the CREATE FUNCTION
  privilege.

  Now, your role must have the CREATE DATA METRIC FUNCTION privilege to create a DMF.
* New table function: [DATA_QUALITY_MONITORING_RESULTS](../../sql-reference/functions/data_quality_monitoring_results.md)
* Access control for the new table function.
* Support added for new kinds of tables: dynamic table, materialized view, Apache Iceberg™ table, external table, event table, temporary table,
  and transient table.
* Number of DMF associations increased to 10,000 per account.
* System DMFs for statistics, which was announced in [June](8_23.md).

For more information, see [Introduction to data quality checks](../../user-guide/data-quality-intro.md).

## Data loading/unloading updates

### Snowpipe: New output in SYSTEM$PIPE_STATUS

With this release, the output of the PIPE_STATUS system function includes a new field, `loadHistoryRemainingEntriesToSync`. When a pipe fails over, load history entries might continue to be replicated for the pipe, ensuring that changes from the last refresh operation are up to date. This new field can help you monitor the progress of load history replication for a pipe.

For more information, see [SYSTEM$PIPE_STATUS](../../sql-reference/functions/system_pipe_status.md).

## Data pipelines updates

### Dynamic tables: Support for incremental lateral flatten

With this release, you can now use lateral flatten with incremental refresh by setting the refresh mode to INCREMENTAL. Selecting the
flatten SEQ column from a lateral flatten join is not supported for incremental refresh.

For more information, see [Supported queries in incremental and full refresh modes](../../user-guide/dynamic-tables-supported-queries.md).

## Data lake updates

### Apache Iceberg™ tables: Support for Snowflake Open Catalog — *Preview*

With this release, Snowflake is pleased to announce the preview of support for integrating Apache Iceberg™ tables
in Snowflake with [Snowflake Open Catalog](https://other-docs.snowflake.com/en/opencatalog/overview).

Using a catalog integration configure for Open Catalog, you can do the following:

* Query a table in Open Catalog using Snowflake.
* Sync a Snowflake-managed Iceberg table with Open Catalog.

For more information, see [Use Apache Iceberg™ tables with Snowflake Open Catalog in Snowflake](../../user-guide/tables-iceberg-open-catalog.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-Jul-24 |
| *Snowpipe: New output in SYSTEM$PIPE_STATUS* | **Added** to *Data loading / unloading updates* section | 30-Jul-24 |
| *Iceberg tables: Support for Snowflake Open Catalog* | **Added** to *Data lake updates* section | 31-Jul-24 |

---
title: July 30, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-07-30.md
section: Release Notes
---

# July 30, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Cortex Fine-tuning –— *Preview*

With this release, we are pleased to announce the preview of the Snowflake Cortex Fine-tuning function in Snowsight.
Snowflake Cortex Fine-tuning offers a way to customize large language models for your specific task. Now you can fine-tune models through
Snowsight without writing any SQL.

For more details, see [Fine-tuning (Snowflake Cortex)](../../../user-guide/snowflake-cortex/cortex-finetuning.md).

---
title: July 31, 2024 — Context functions and row access policies in Streamlit in Snowflake –— General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-31-sis.md
section: Release Notes
---

# July 31, 2024 — Context functions and row access policies in Streamlit in Snowflake –— *General Availability*

With this release, we are pleased to announce the general availability of context functions and row access policies in Streamlit in Snowflake.

For more information, see [Context functions and row access policies in Streamlit in Snowflake](../../../developer-guide/streamlit/features/row-access.md).

---
title: July 31, 2024 — Snowflake VS Code Extension Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-07-31.md
section: Release Notes
---

# July 31, 2024 — Snowflake VS Code Extension Release Notes

With this release, we are pleased to announce the availability of the following new features in this update to the Snowflake
VS Code extension.

## Edit Snowflake `connections.toml` files

You can add and modify connection definitions in Snowflake `connections.toml` configuration files.

For more information, see [Edit the Snowflake connections.toml file](../../../user-guide/vscode-ext.md).

## Work with the Snowflake Native App Framework

You can now use the VS Code extension to create and manage a Snowflake Native App.

For more information, see [Work with the Snowflake Native App Framework](../../../user-guide/vscode-ext.md).

---
title: Jun 01, 2025: Snowsight templates in trial accounts (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-01-snowsight-templates.md
section: Release Notes
---

# Jun 01, 2025: Snowsight templates in trial accounts (*General availability*)

Snowsight templates provide trial users with a series of interactive introductions where they can discover and test Snowflake features and use cases.
Templates can be worksheets, notebooks or Streamlit apps and are pre-configured with sample data to get up and running quickly.

For details, see [Snowsight templates](../../../user-guide/ui-snowsight/snowsight-templates.md).

---
title: Jun 02, 2025: AI_CLASSIFY supports up to 500 labels and multi-label classification
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-02-ai-classify-label-increase.md
section: Release Notes
---

# Jun 02, 2025: AI_CLASSIFY supports up to 500 labels and multi-label classification

Use AI_CLASSIFY to classify text into multiple categories and define up to 500 labels.

Instead of using the SNOWFLAKE.CORTEX.CLASSIFY_TEXT function, which is limited to a single label and 100 categories, you can use AI_CLASSIFY instead.

For more information about AI_CLASSIFY, see [AI_CLASSIFY](../../../sql-reference/functions/ai_classify.md).

---
title: Jun 02, 2025: Snowflake Cortex AI Functions (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-02-cortex-aisql-public-preview.md
section: Release Notes
---

# Jun 02, 2025: Snowflake Cortex AI Functions (*Preview*)

## AI capability meets SQL operators across multimodal data

Snowflake announces the preview of Cortex AI Functions, bringing powerful AI capabilities directly into Snowflake’s SQL engine.
AI Functions help you build scalable AI pipelines across multimodal enterprise data with familiar SQL
commands. You can now process text and images faster and more cost effectively while gaining deeper insights from
structured and unstructured data.

The available Cortex AI Functions include:

* [AI_FILTER](../../../sql-reference/functions/ai_filter.md): Evaluates a plain-language yes-or-no question against text or
  image input, allowing you to filter results in SELECT, WHERE, and JOIN clauses using AI capabilities.
* [AI_CLASSIFY](../../../sql-reference/functions/ai_classify.md): Classifies a text or image input into a single or multiple
  user-defined categories based on plain-language category definitions.
* [AI_AGG](../../../sql-reference/functions/ai_agg.md): Aggregates a text column and returns insights across multiple rows
  based on a user-defined prompt. This function is not subject to context window limitations.
* [AI_SUMMARIZE_AGG](../../../sql-reference/functions/ai_summarize_agg.md): Aggregates a text column and returns a summary
  across multiple rows. This function is not subject to context window limitations.
* [AI_SIMILARITY](../../../sql-reference/functions/ai_similarity.md): Calculates the embedding similarity between two inputs
  without needing to explicitly create the embedding vectors.
* [AI_COMPLETE](../../../sql-reference/functions/ai_complete.md): Generates a completion for a given text string or image
  using one of several available LLMs. Use this function for generative AI tasks that aren’t covered by other functions.

Key benefits of Cortex AI Functions include:

* **Expressive and Composable AI Operators**: A new suite of AI-powered operators integrates seamlessly with familiar
  SQL primitives like FILTER and AGGREGATE, enabling more intuitive and powerful data manipulation.
* **Simplified AI Pipelines**: Build advanced, multi-step AI pipelines with greater ease and efficiency using standard
  SQL.
* **Unified Analytics for All Data**: Run analytics across structured and unstructured data within the same SQL query,
  breaking down data silos.
* **Native Multimodal Data Support**: Cortex AI Functions are designed to work fluidly across diverse modalities,
  eliminating the need for separate processing systems for text and image data.
* **Performance Enhancements**: Improved query engine performance and scalability within Snowflake.

To get started, see [Cortex AI Functions](../../../user-guide/snowflake-cortex/aisql.md).

---
title: Jun 03, 2025: Snowflake Copilot inline (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-03-copilot-inline.md
section: Release Notes
---

# Jun 03, 2025: Snowflake Copilot inline (*Preview*)

Snowflake announces the preview of Snowflake Copilot inline, an expansion of the existing Snowflake Copilot experience that gives you the ability to query Snowflake Copilot from within your SQL code.

Snowflake Copilot inline supports the following use cases:

* **Explore your data** by asking open-ended questions to learn about the structure and nuances of a new dataset.
* **Generate SQL queries** with questions in natural language.
* **Improve your queries** by asking Snowflake Copilot to help you assess query efficiency, find optimizations, or explain what the query does.
* **Fix syntax errors** by asking Snowflake Copilot to fix your query.

To get started with Snowflake Copilot inline, see [Using Snowflake Copilot inline](../../../user-guide/snowflake-copilot-inline.md).

---
title: Jun 03, 2025: Workspaces in Snowsight (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-03-workspaces.md
section: Release Notes
---

# Jun 03, 2025: Workspaces in Snowsight (*Preview*)

As part of this release, Snowflake introduces Workspaces, a new feature in Snowsight. Workspaces provides a unified editor for creating,
organizing, and managing code across multiple file types, that you can use to analyze data, develop models, and build pipelines.

Workspaces are private to you and offer a development environment where you can build, experiment, and test your work. All content in
Workspaces is file-based, allowing you to work on more complex projects and easily integrate with Git for version control, collaboration,
and alignment with your existing workflows.

For more information about Workspaces, see [Workspaces](../../../user-guide/ui-snowsight/workspaces.md).

---
title: Jun 11, 2024 — Sharing data in non-secure views –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-11-sharing-non-secure-views.md
section: Release Notes
---

# Jun 11, 2024 — Sharing data in non-secure views –— *Preview*

With this release, we are pleased to announce the preview of sharing data in non-secure views. If you need to take full advantage of the performance gains of query
optimizations on the views that you share, you can
create a share that lets you share non-secure views with other accounts.

For more information, see [Share data in non-secured views](../../../user-guide/data-sharing-views.md).

---
title: Jun 12, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-12-dcr.md
section: Release Notes
---

# Jun 12, 2025: Snowflake Data Clean Rooms updates

With this release, we are pleased to announce the availability of the following new features and enhancements to Snowflake
Data Clean Rooms:

* Analysis name and schedule run edit location update: Users can now find the element to run or edit an analysis at the top of the
  Analyses & Queries page. In order to edit the analysis name on this page, simply hover over the name of the analysis and click the
  edit icon.

---
title: Jun 16, 2025: Cost Management release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-16-budget.md
section: Release Notes
---

# Jun 16, 2025: Cost Management release notes

## Budgets: Using tags to add objects

You can now add a tag to a custom budget to specify which objects you want monitored by the budget. When you add a tag/value pair to the
budget, all objects that are assigned that pair are monitored by the budget.

Previously, an object could not be monitored by more than one custom budget. If you use tags to specify which objects you want monitored,
costs incurred by an object can count toward the spending limit of multiple custom budgets.

For more information, see [Using tags to monitor objects](../../../user-guide/budgets/custom-budget.md).

---
title: Jun 18, 2025: Customized runtime environments in Warehouse notebooks (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-18-preconfigured-nb-wh-runtime.md
section: Release Notes
---

# Jun 18, 2025: Customized runtime environments in Warehouse notebooks (*Preview*)

In this preview, you can now update the Python version used in Warehouse notebooks by selecting a runtime in the Create Notebook or
Notebook Settings dialog. By default, notebooks continue to use Python 3.9 (via Warehouse Runtime 1.0), but you can switch to Python
3.10 at any time using Warehouse Runtime 2.0. Both runtimes are preconfigured and ready to use without additional setup.

For more information, see [Create a notebook](../../../user-guide/ui-snowsight/notebooks-create.md).

---
title: Jun 19, 2025: Integrations in Snowsight (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/ui/2025-06-19-ssint.md
section: Release Notes
---

# Jun 19, 2025: Integrations in Snowsight (*Preview*)

We are pleased to announce the preview of managing integrations in Snowsight .

With this release, using the Snowsight, administrators can create, view, and otherwise manage integrations.

For more information, see [Managing integrations in Snowsight](../../../user-guide/ui-snowsight-integrations.md).

---
title: Jun 19, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-19-dcr.md
section: Release Notes
---

# Jun 19, 2025: Snowflake Data Clean Rooms updates

**Clean rooms API version 8.5**

With this release, we are pleased to announce the availability of the following new features and enhancements to Snowflake
Data Clean Rooms:

**Change in provider-run analysis flow:** Previously, a provider could call `provider.submit_analysis_request` before calling
`provider.mount_request_logs_for_all_consumers`, and the request would be queued until the mount was complete. As of this release, a
provider must mount the request logs before you can submit an analysis request. Provider-run analysis will run faster with this update.
[Learn more.](../../../user-guide/cleanrooms/demo-flows/provider-run-analysis.md)

---
title: Jun 23, 2025: Snowflake Native App Framework updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-23-auto-privs-app-spec.md
section: Release Notes
---

# Jun 23, 2025: Snowflake Native App Framework updates

The Snowflake Native App Framework now includes the following features that make it easier for providers
to develop an app to create objects in a consumer account. These features also
make it easier for consumers to configure an app during installation and upgrade.

For general information on these features, see [Create and access objects in a consumer account](../../../developer-guide/native-apps/requesting-about.md).

## Automated granting of privileges (*Preview*)

This feature allows providers to specify in the manifest file the privileges required by an app.
When a consumer installs or upgrades an app, Snowflake grants these privileges to the app. For
more information, see [Configure the privileges required by an app](../../../developer-guide/native-apps/requesting-auto-privs.md).

Consumers can use feature policies to override the automatic grants for an app. For more
information, see [Use feature policies to limit the objects an app can create](../../../developer-guide/native-apps/ui-consumer-feature-policies.md).

## App specifications (*Preview*)

App specifications allow providers to request permission from the consumer to allow connections
outside Snowflake that use external access integrations or security integrations. Consumers must
approve the app specification for these objects when configuring the app after installation or
upgrade.

For more information, see [Overview of app specifications](../../../developer-guide/native-apps/requesting-app-specs.md).

---
title: Jun 24, 2025: Premium views in the organization account (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-24-premium-views.md
section: Release Notes
---

# Jun 24, 2025: Premium views in the organization account (*General availability*)

The ORGANIZATION_USAGE schema in the organization account contains premium views that aggregate account usage across accounts. These premium
views are not found in the ORGANIZATION_USAGE schema of a regular ORGADMIN-enabled account, and incur additional storage and compute costs.
For example, the organization account lets you query premium views to track login history, access history, and query history across the
organization.

For more information, see [Premium views in the organization account](../../../user-guide/organization-accounts-premium-views.md).

---
title: Jun 26, 2025: Clone dynamic tables as tables (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-26-clone-dt-as-table.md
section: Release Notes
---

# Jun 26, 2025: Clone dynamic tables as tables (*General availability*)

You can now clone dynamic tables as regular tables. Cloned tables inherit the same column definitions and data of the source dynamic table
but lack dynamic table-specific properties. They retain row access and masking policies, tags, clustering keys, and comments.

For more information, see [Clone a dynamic table to a new table](../../../user-guide/dynamic-tables-clone.md).

---
title: Jun 27, 2025: dbt Projects on Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-06-27-dbt-projects-on-snowflake.md
section: Release Notes
---

# Jun 27, 2025: dbt Projects on Snowflake (*Preview*)

dbt Projects on Snowflake are available in preview. dbt Projects on Snowflake let you use familiar Snowflake features to create, edit, test, run, and manage dbt Core projects. You can use Workspaces in Snowsight to work with dbt project files and directories and deploy a dbt project as a schema-level DBT PROJECT object. You can also use SQL to work with dbt project objects and use Snowflake CLI commands to integrate deployment and execution into your CI/CD workflows.

For more information, see [dbt Projects on Snowflake](../../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

---
title: June 03, 2024 — New EMBED_TEXT_1024 function for 1024 dimensional output vectors
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-03-embed-text-1024.md
section: Release Notes
---

# June 03, 2024 — New EMBED_TEXT_1024 function for 1024 dimensional output vectors

With this release, we are pleased to announce the availability of a vector embedding function that outputs 1024 dimension vectors.
This function enables important applications that require semantic vector search and retrieval.

The EMBED_TEXT_1024 function is available in the following region:

* AWS US West 2 (Oregon)

For more information, see [Vector Embeddings](../../../user-guide/snowflake-cortex/vector-embeddings.md).

## New SQL function

The following functions are now generally available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| [LLM Function](../../../user-guide/snowflake-cortex/aisql.md) | [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/embed_text_1024-snowflake-cortex.md) | Creates a vector embedding of 1024 dimensions for a given string of text in English. |

---
title: June 03-06, 2024 — Summit announcements
source: https://docs.snowflake.com/en/release-notes/2024/june-summit.md
section: Release Notes
---

# June 03-06, 2024 — Summit announcements

The following major features and enhancements were announced during Summit 2024.

> **Important:**
>
> This topic does not include every feature or enhancement announced during Summit 2024. In particular, it does not include features
> and enhancements that were announced, but are not yet in public preview or generally available.

## New features

### Specify appearance in Snowsight — *Preview*

With this release, we are pleased to announce the preview of specifying appearance, often referred to as dark mode, in [Snowsight](../../user-guide/ui-snowsight-gs.md).
You can now specify the color scheme for Snowsight, including a scheme using darker colors with light text on a dark background
designed to reduce eye strain in low-light conditions and provide a comfortable browsing experience for users who prefer darker color palettes.

This new feature includes three settings:

* System - Use the same mode as specified by the operating system running Snowsight.
* Light - Traditional dark characters on a lighter background, typically used in normal daylight.
* Dark - Light text on a dark background to reduce eye strain in low-light conditions.

For more information and to learn how to specify appearance in Snowsight, see [Specify appearance](../../user-guide/ui-snowsight-profile.md).

### Snowflake Native SDK for Connectors — *Preview*

We are pleased to announce the preview of the Snowflake Native SDK for Connectors.

The Snowflake Native SDK for Connectors is a library with templates and quickstarts in Java that you can use to quickly build your own
Snowflake Native App based Connectors to easily ingest data from an external data source into Snowflake. The sample connector in the
SDK outlines best practices to ingest data and customize application flows, along with ready-to-use code blocks for your own ingestion
logic.

For more information see [Snowflake Native SDK for Connectors](../../developer-guide/native-apps/connector-sdk/about-connector-sdk.md).

### Snowflake Notebooks — *Preview*

We are pleased to announce the preview of Snowflake Notebooks. Snowflake Notebooks is a development interface in [Snowsight](../../user-guide/ui-snowsight-gs.md)
that offers an interactive, cell-based programming environment for Python and SQL. In Snowflake Notebooks, you can perform exploratory
data analysis, develop machine learning models, and perform other data science and data engineering tasks all in one place.

For more information, see [About Legacy Snowflake Notebooks](../../user-guide/ui-snowsight/notebooks.md).

### Snowpark pandas API — *Preview*

We are pleased to announce the preview of the Snowpark pandas API. The Snowpark pandas API lets you run your pandas code directly on your
data in Snowflake.

Just by changing the import statement and a few lines of code, you can get the same pandas-native experience with the scalability and
security benefits of Snowflake. With this API, you can work with much larger datasets so you can avoid rewriting your pandas pipelines to
other big data frameworks. Snowpark pandas runs workloads natively in Snowflake through transpilation to SQL, enabling it to take advantage
of parallelization and the data governance and security benefits of Snowflake.

For more information, see [pandas on Snowflake](../../developer-guide/snowpark/python/pandas-on-snowflake.md).

### Snowflake Cortex Fine-Tuning — *Preview*

Fine-tuning allows users to adapt pre-trained models to more specialized tasks. If you don’t want the high cost of training a large model
from scratch but need better latency and results than you’re getting from prompt engineering or even retrieval augmented generation (RAG)
methods, fine-tuning an existing large model is an option. Fine-tuning allows you to use examples to adjust the behavior of the model and
improve the model’s knowledge of domain-specific tasks.

Cortex Fine-Tuning is a fully managed service that lets you fine-tune popular large language models using your data all within Snowflake.

For more information, see [Fine-tuning (Snowflake Cortex)](../../user-guide/snowflake-cortex/cortex-finetuning.md).

### Snowflake Native Apps with Snowpark Container Services — *Preview*

We are pleased to announce the preview of Snowpark Native Apps with Snowpark Container Services.

Snowflake Native Apps with Snowpark Container Services allows you to run any containerized service supported by
Snowpark Container Services within a Snowflake Native App.
Snowflake Native Apps with Snowpark Container Services leverage all of the features of the Snowflake Native App Framework,
including provider IP protection, security and governance, data sharing, monetization, and integration with compute resources.

For more information, see [About Snowflake Native Apps with Snowpark Container Services](../../developer-guide/native-apps/native-apps-about.md).

### Apache Iceberg™ tables — *General availability*

We are pleased to announce the general availability of Apache Iceberg™ tables for Snowflake, released with Snowflake version 8.20.

Iceberg tables for Snowflake combine the performance and query semantics of regular Snowflake tables
with external cloud storage that you manage. They are ideal for maintaining a single copy of data
with interoperability across a variety of compute engines.

For more information, see [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

## Extensibility updates

### Snowpark Python local testing framework — *General availability*

We are pleased to announce the general availability of the Snowpark Python local testing framework, which was previously available as a
preview feature. This local testing framework is an emulator that lets you test your Python code locally when working with the Snowpark Python
library.

The Snowpark Python local testing framework allows you to create and operate on Snowpark Python DataFrames locally without connecting to a
Snowflake account. You can use the local testing framework to test your DataFrame operations on your development machine or in a
CI (continuous integration) pipeline before deploying code changes to your account. The API is the same, so you can either run your tests
locally or against a Snowflake account without making code changes.

For more information, see [Local testing framework](../../developer-guide/snowpark/python/testing-locally.md).

## Snowsight updates

### Universal Search — *General availability*

We are pleased to announce the general availability of Universal Search in [Snowsight](../../user-guide/ui-snowsight-gs.md).

With Universal Search, you find even more objects than before, quickly and securely. Searching from the Search tab finds tables, functions,
databases, data products available to you in the Snowflake Marketplace, relevant Snowflake Documentation topics, and related articles in the
Snowflake Community Knowledge Base. With general availability, Universal Search now includes worksheets and dashboards in your search results.
Whether you enter a single word or a complete question in natural language, Universal Search can interpret your query by using your customizable
Snowflake asset metadata.

For more information, see [Search Snowflake objects and resources](../../user-guide/ui-snowsight-universal-search.md).

---
title: June 05, 2024 — New geospatial functions in preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-05.md
section: Release Notes
---

# June 05, 2024 — New geospatial functions in preview

## New geospatial functions available –— *Preview*

Four new functions for [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) objects are now available in preview:

* [H3_TRY_COVERAGE](../../../sql-reference/functions/h3_try_coverage.md) - A special version of [H3_COVERAGE](../../../sql-reference/functions/h3_coverage.md)
  that returns NULL if an error occurs when it attempts to return an [array](../../../sql-reference/data-types-semistructured.md) of IDs
  (as INTEGER values) identifying the minimal set of [H3](../../../sql-reference/data-types-geospatial.md) cells that completely
  cover a shape (specified by a [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) object).
* [H3_TRY_COVERAGE_STRINGS](../../../sql-reference/functions/h3_try_coverage_strings.md) - A special version of [H3_COVERAGE_STRINGS](../../../sql-reference/functions/h3_coverage_strings.md)
  that returns NULL if an error occurs when it attempts to return an [array](../../../sql-reference/data-types-semistructured.md) of hexadecimal IDs (as VARCHAR values)
  identifying the minimal set of [H3](../../../sql-reference/data-types-geospatial.md) cells that completely cover a shape
  (specified by a [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) object).
* [H3_TRY_POLYGON_TO_CELLS](../../../sql-reference/functions/h3_try_polygon_to_cells.md) - A special version of [H3_POLYGON_TO_CELLS](../../../sql-reference/functions/h3_polygon_to_cells.md)
  that returns NULL if an error occurs when it attempts to return an [array](../../../sql-reference/data-types-semistructured.md) of INTEGER values of the IDs of
  [H3](../../../sql-reference/data-types-geospatial.md) cells that have centroids contained by a Polygon
  (specified by a [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) object).
* [H3_TRY_POLYGON_TO_CELLS_STRINGS](../../../sql-reference/functions/h3_try_polygon_to_cells_strings.md) - A special version of [H3_POLYGON_TO_CELLS_STRINGS](../../../sql-reference/functions/h3_polygon_to_cells_strings.md)
  that returns NULL if an error occurs when it attempts to return an [array](../../../sql-reference/data-types-semistructured.md) of VARCHAR values of the
  hexadecimal IDs of [H3](../../../sql-reference/data-types-geospatial.md) cells that have centroids contained by
  a Polygon (specified by a [GEOGRAPHY](../../../sql-reference/data-types-geospatial.md) object).

---
title: June 07-08, 2023 — 7.19 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2023/7_19.md
section: Release Notes
---

# June 07-08, 2023 — 7.19 Release Notes (with behavior changes)

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## Behavior Changes Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2023_04](../bcr-bundles/2023_04_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2023_03](../bcr-bundles/2023_03_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_02](../bcr-bundles/2023_02_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for July; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New Features

### Anonymous Procedures — *General Availability*

With this release, we are pleased to announce the general availability of support for creating anonymous procedures. An anonymous procedure
is similar to a stored procedure, but not stored for later use.

You can create an anonymous procedure using the [WITH…CALL](../../sql-reference/sql/call-with.md) syntax. With this command, you both create
an anonymous procedure defined by parameters in the WITH clause and call that procedure. You do not need to have a role with CREATE
PROCEDURE schema privileges for this command.

### Reading Files With a Java Function or Procedure Handler — *General Availability*

With this release, we are pleased to announce the general availability of support for reading staged files with a UDF or procedure handler
code written in Java.

For more information, refer to [Reading a file with a Java UDF](../../developer-guide/udf/java/udf-java-cookbook.md) and [Reading files with a Java stored procedure](../../developer-guide/stored-procedure/java/procedure-java-read-files.md).

### Reading Files With a Scala Function or Procedure Handler — *Preview*

With this release, we are pleased to announce a preview of support for reading staged files with a UDF or procedure handler code written in
Scala.

For more information, refer to [Reading a file with a Scala UDF](../../developer-guide/udf/scala/udf-scala-examples.md) and [Reading files with a Scala stored procedure](../../developer-guide/stored-procedure/scala/procedure-scala-read-files.md).

### Reading Files With a Python Function or Procedure — *Preview*

With this release, we are pleased to announce a preview of Python support for reading files with the `SnowflakeFile` class.

`SnowflakeFile` is a new class in the `snowflake.snowpark.files` module that provides dynamic read access for files on an
internal or external stage. With `SnowflakeFile`, you can stream files to accomplish tasks such as reading unstructured data or using
your own machine learning model in a user-defined function (UDF), user-defined table function (UDTF), or stored procedure.

For more information, refer to:

* [Reading a File from a Python UDF Handler](../../developer-guide/udf/python/udf-python-examples.md)
* [Reading Files from a UDF with the Snowpark API](../../developer-guide/snowpark/python/creating-udfs.md)
* [Reading a File from a Python Stored Procedure Handler](../../developer-guide/stored-procedure/python/procedure-python-read-files.md)
* [Reading Files from a Stored Procedure with the Snowpark API](../../developer-guide/snowpark/python/creating-sprocs.md)

### Schema Detection for JSON and CSV — *Preview*

With this release, we are pleased to announce a preview of the schema detection feature for JSON and CSV. The schema detection feature uses
the INFER_SCHEMA function to automatically detect the schema in a set of staged data files and retrieve the column definitions. The
generally available INFER_SCHEMA function applies to Apache Parquet, Apache Avro, and ORC files. This preview function expands support to
include JSON and CSV files.

For more information, refer to [Schema detection of column definitions from staged semi-structured data files](../../user-guide/data-load-overview.md).

### Table Schema Evolution — *Preview*

With this release, we are pleased to announce a preview of the table schema evolution feature. The structure of tables in Snowflake can now
evolve automatically to support the structure of new data received from the data sources. Snowflake allows adding new columns or dropping
the NOT NULL constraint from columns missing in new data files, and supports dropping columns or changing the data type, length, or
precision of existing columns.

To enable table schema evolution, you can set the ENABLE_SCHEMA_EVOLUTION parameter to TRUE when you create or alter a table.

For more information, refer to [Enable automatic table schema evolution](../../user-guide/data-load-schema-evolution.md).

## SQL Updates

### Support for Python 3.9 in Snowpark, UDFs, and Stored Procedures — *Preview*

With this release, we are pleased to announce support for Python 3.9 in Snowpark Python, Python UDFs and Python stored procedures as a
preview feature to all accounts.

For more information, refer to:

* [Setting up your development environment for Snowpark Python](../../developer-guide/snowpark/python/setup.md)
* [Introduction to Python UDFs](../../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md)

### UDFs, UDTFs, and Stored Procedures Support Passing Arguments by Name

When calling a UDF, UDTF, or stored procedure, you can now pass arguments by name, in addition to by position.

For example, suppose that you created a UDF with the following statement:

```sqlexample
CREATE OR REPLACE FUNCTION add_numbers (n1 NUMBER, n2 NUMBER)
  RETURNS NUMBER
  AS 'n1 + n2';
```

To pass the arguments by name, specify the argument name followed by => and the argument value. For example:

```sqlexample
SELECT add_numbers(n1 => 10, n2 => 5);
```

You can pass the arguments in any order:

```sqlexample
SELECT add_numbers(n2 => 5, n1 => 10);
```

For more information, refer to:

* [Executing a UDF](../../developer-guide/udf/udf-calling-sql.md)
* [Calling a stored procedure](../../developer-guide/stored-procedure/stored-procedures-calling.md)

If there are multiple functions or procedures with the same name, the same number of arguments, and different data types for the arguments,
you can specify the argument names in the call to indicate which function or procedure to execute. The argument names that you specify in
the call take precedence over the argument positions. For more information, refer to [Overloading procedures and functions](../../developer-guide/udf-stored-procedure-naming-conventions.md).

Finally, the following built-in functions support passing arguments by name:

* [CHECK_XML](../../sql-reference/functions/check_xml.md)
* [PARSE_XML](../../sql-reference/functions/parse_xml.md)
* [ROUND](../../sql-reference/functions/round.md)

## Data Science Updates

### Work With Snowflake’s Upcoming ML features

This release introduces a new schema, “ML”, to the Snowflake database, along with an ML_USER SNOWFLAKE database role, which is granted to
the PUBLIC role in all Snowflake accounts containing a shared SNOWFLAKE database.

For more information, refer to:

* [The ML schema in the SNOWFLAKE Database](../../sql-reference/snowflake-db.md)
* [SNOWFLAKE database roles](../../sql-reference/snowflake-db-roles.md)

The schema, roles, and privileges support features that will be made available in Public Preview at Snowflake Summit 2023.

## Organization Updates

### ACCOUNTS View (Organization Usage) — *Preview*

With this release, we are pleased to announce the preview of the ACCOUNTS view in the ORGANIZATION_USAGE schema. The ACCOUNTS view allows an
organization administrator to obtain details about the accounts in an organization, including accounts deleted within the last year.

For more information, refer to [ACCOUNTS view](../../sql-reference/organization-usage/accounts.md).

## Web Interface Updates

### New Organizations Only Have Snowsight Access

Starting May 30, 2023, new Snowflake organizations only have access to Snowsight and no longer have access to Classic Console.

For more information, refer to [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

---
title: June 10, 2024 — Apache Iceberg™ tables — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-10-iceberg-tables.md
section: Release Notes
---

# June 10, 2024 — Apache Iceberg™ tables — *General Availability*

We are pleased to announce the general availability of Apache Iceberg™ tables for Snowflake, released with Snowflake version 8.20.

Iceberg tables for Snowflake combine the performance and query semantics of regular Snowflake tables
with external cloud storage that you manage. They are ideal for maintaining a single copy of data
with interoperability across a variety of compute engines.

For more information, see [Apache Iceberg™ tables](../../../user-guide/tables-iceberg.md).

---
title: June 10-15, 2024 — 8.22 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_22.md
section: Release Notes
---

# June 10-15, 2024 — 8.22 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_05](../bcr-bundles/2024_05_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_04](../bcr-bundles/2024_04_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_03](../bcr-bundles/2024_03_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for July 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### Some bitwise expression functions support BINARY data

The following bitwise expression functions now support binary data:

* [BITAND](../../sql-reference/functions/bitand.md)
* [BITNOT](../../sql-reference/functions/bitnot.md)
* [BITOR](../../sql-reference/functions/bitor.md)
* [BITSHIFTLEFT](../../sql-reference/functions/bitshiftleft.md)
* [BITSHIFTRIGHT](../../sql-reference/functions/bitshiftright.md)
* [BITXOR](../../sql-reference/functions/bitxor.md)

These functions now accept BINARY values in arguments and return BINARY values when passed binary input.

## Virtual warehouse updates

### Account Usage: WAREHOUSE_EVENTS_HISTORY view — *General availability*

With this release, we are pleased to announce the general availability of the WAREHOUSE_EVENTS_HISTORY view, which was previously
available as a preview feature. The view provides a record of warehouse state changes and logs the warehouse event (for example, a
warehouse suspend event) and the timestamp and reason for the event (for example, a user-initiated warehouse suspend event).

See [WAREHOUSE_EVENTS_HISTORY view](../../sql-reference/account-usage/warehouse_events_history.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 10-Jun-24 |

---
title: June 11, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-11-dcr.md
section: Release Notes
---

# June 11, 2024 — Snowflake Data Clean Rooms Release Notes

This topic provides an overview of the new features, enhancements, and other important changes introduced in this update to Snowflake Data
Clean Rooms.

## Additional supported regions — *General Availability*

With this release, we are pleased to announce that Snowflake Data Clean Rooms are now available in all commercial regions currently
supported by Snowflake. The new regions consist of the following:

**South America**

| Cloud platform | Supported region | Cloud region ID |
| --- | --- | --- |
| Amazon Web Service | South America (Sao Paulo) | sa-east-1 |

**Middle East**

| Cloud platform | Supported region | Cloud region ID |
| --- | --- | --- |
| Microsoft Azure | UAE North (Dubai) | uae-north |

**Asia Pacific**

| Cloud platform | Supported region | Cloud region ID |
| --- | --- | --- |
| Amazon Web Services | Asia Pacific (Tokyo) | ap-northeast-1 |
|  | Asia Pacific (Osaka) | ap-northeast-3 |
|  | Asia Pacific (Seoul) | ap-northeast-2 |
|  | Asia Pacific (Jakarta) | ap-southeast-3 |
| Microsoft Azure | Japan East (Tokyo) | japaneast |

## Granular access management for Snowflake data — *General Availability*

With this release, we are pleased announce that account administrators and object owners can grant access to Snowflake data at the schema
and object level. To grant access so clean room users can link data into a clean room, the administrator or owner registers the database,
schema, or object using the web app or developer APIs. When a database or schema is registered, all of the tables and views in that database
or schema are registered.

For more information about granting access, see [Registering data](../../../user-guide/cleanrooms/register-data.md).

## Choosing a warehouse when running an analysis — *General Availability*

With this release, we are pleased to announce that clean room users can select which warehouse they want to use when running an analysis
with select templates. This helps analysts use an optimal warehouse size and type, which can speed up the analysis.

Snowflake provides a set of default warehouses to choose from, but the Snowflake administrator can create and configure additional
warehouses that can be used for the analysis.

For more information, see [Select a warehouse for an analysis](../../../user-guide/cleanrooms/v1/web-app-working.md).

## Support for multiple custom templates in web app — *General Availability*

With this release, we are pleased to announce that providers can use the developer APIs to add multiple custom templates to the web app.
Now, consumers can use a user interface to run different types of custom analyses in a single clean room.

For an extended example of using the developer API to add custom templates to the web app, see
[Define a user input form for a custom template](../../../user-guide/cleanrooms/demo-flows/custom-templates.md).

---
title: June 14-15, 2023 — 7.20 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_20.md
section: Release Notes
---

# June 14-15, 2023 — 7.20 Release Notes

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions about these additions, please contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Snowpipe Streaming Replication Support — *Preview*

With this release, we are pleased to announce the support of Snowpipe Streaming with [Snowflake replication](../../user-guide/account-replication-intro.md). Snowflake supports the replication and failover of Snowflake tables populated by Snowpipe
Streaming and its associated channel offsets from a source account to a target account in different
[regions](../../user-guide/intro-regions.md) and across [cloud platforms](../../user-guide/intro-cloud-platforms.md) with replication. Snowpipe
streaming supports both database replication and group-based replication.

For more information, refer to [Replication and Snowpipe Streaming](../../user-guide/account-replication-considerations.md).

## Security Updates

### Access Control: New Privilege for Delegating Warehouse Management — *Preview*

With this release, we are pleased to announce a preview of a new privilege for managing warehouses.

If you need to delegate the ability to alter, suspend, or resume any warehouse in your account to a custom role, you can grant the MANAGE
WAREHOUSES privilege to that role. Granting the MANAGE WAREHOUSES privilege is equivalent to granting the MODIFY, MONITOR, and OPERATE
privileges on all warehouses in the account.

For more information, refer to [Delegating warehouse management](../../user-guide/warehouses-tasks.md).

## SQL Updates

### Improved Performance for SELECT Statements With LIMIT and ORDER BY Clauses — *General Availability*

With this release, we are pleased to announce that the performance of certain long-running SELECT statements containing both LIMIT and ORDER
BY clauses has been significantly improved. This improvement is immediately available to all customers at no additional cost.

The improvement works by pruning micro-partitions that cannot affect the results of such “top K” queries. The additional pruning applies to
queries where an integer-representable value (timestamp or integer, or variant explicitly cast to integer, but not an expression) is the
first or only column specified in the ORDER BY clause. If the query contains a JOIN clause, the ORDER BY column must be from the fact table
(or probe side), typically the larger of the two tables.

Queries on small tables generally do not benefit from this improvement. Queries that return fewer than the number of rows specified in the
LIMIT clause, or that use aggregations, also do not benefit.

Note that not all queries, not even all queries that meet these requirements, will benefit.

For more information on micro-partitions and query pruning, refer to [Micro-partitions & Data Clustering](../../user-guide/tables-clustering-micropartitions.md).

### Support for Python 3.10 in Snowpark, UDFs, UDTFs and Stored Procedures — *Preview*

With this release, we are pleased to announce support for Python 3.10 in Snowpark Python, Python UDFs, Python UDTFs and Python stored
procedures as a preview feature to all accounts.

For more information, refer to:

* [Setting up your development environment for Snowpark Python](../../developer-guide/snowpark/python/setup.md)
* [Introduction to Python UDFs](../../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md)

## Data Governance Updates

### Tag-based Masking Policy: Support for Database & Schema — *Preview*

With this release, we are pleased to announce the preview of setting a tag-based masking policy on a database and schema. This update
enables data engineers to protect all columns in a schema or database when the data type of the column matches the data type of the policy
set on the tag. Additionally, a new column is protected when its data type matches the data type of the policy set on the tag. Setting the
tag-based masking policy on the database or schema simplifies data protection management because you can set the tag-based policy once and
not have to set a masking policy on every column in the database or schema.

For more information, refer to [Tag-based masking policies](../../user-guide/tag-based-masking-policies.md).

### Access History: Track Objects Modified by a DDL Operation — *Preview*

With this release, we are pleased to announce the preview of tracking objects modified by a DDL operation in the Account Usage
ACCESS_HISTORY view. These operations include:

* Track how tag and policy assignments change.
* Track the table and column lifecycle.

The `object_modified_by_ddl` column records these changes. You can use this column to enhance your data auditing practices and detect
new objects to classify to meet PII detection requirements.

For more information, refer to [Access History](../../user-guide/access-history.md).

## Web Interface Updates

### Load Files From a Stage Into a Table — *General Availability*

With this release, we are pleased to announce the general availability of loading files from a stage into a table by using Snowsight.

For more information, refer to [Load data into an existing table using Snowsight](../../user-guide/data-load-web-ui.md).

---
title: June 15, 2024 — Anomaly Detection
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-15-anomaly-detection.md
section: Release Notes
---

# June 15, 2024 — Anomaly Detection

With this release, we are pleased to announce an update to the Anomaly Detection ML Function that improves its algorithm, and results in improved performance.
If you noticed degraded performance from a model you trained between May 27, 2024, and June 15, 2024, please train and use a new model.

For more information, see [Anomaly Detection (Snowflake ML Functions)](../../../user-guide/ml-functions/anomaly-detection.md).

---
title: June 17, 2024 — New LLM helper functions - TRY_COMPLETE and COUNT_TOKENS
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-17-new-llm-functions.md
section: Release Notes
---

# June 17, 2024 — New LLM helper functions - TRY_COMPLETE and COUNT_TOKENS

With this release, we are pleased to announce the availability of two Cortex LLM helper functions, TRY_COMPLETE and COUNT_TOKENS.
These functions are purpose-built and managed functions that help to reduce cases of query failures when the number of input tokens
exceed a model limit.

For more information, see [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md).

## New SQL function

The following functions are now generally available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| [LLM Function](../../../user-guide/snowflake-cortex/aisql.md) | [TRY_COMPLETE (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/try_complete-snowflake-cortex.md) | Tries to run the COMPLETE function but returns NULL instead of an error code if unable to run. |
| [LLM Function](../../../user-guide/snowflake-cortex/aisql.md) | [COUNT_TOKENS (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/count_tokens-snowflake-cortex.md) | Counts the tokens in a given input text based on the model or function specified. |

---
title: June 17-30, 2024 — 8.23 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_23.md
section: Release Notes
---

# June 17-30, 2024 — 8.23 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Security updates

### Trust Center: Security Essentials scanner package — *General availability*

With this release, we are pleased to announce the general availability of the free Security Essentials scanner package in the Trust Center.

For more information, see [Security Essentials scanner package](../../user-guide/trust-center/overview.md).

## SQL updates

### Window functions: extended support for RANGE BETWEEN — *Preview*

With this release, we are pleased to announce that RANGE BETWEEN window frames with explicit offsets (n PRECEDING and n FOLLOWING) are supported for the following window functions:

* COUNT
* SUM
* MIN
* MAX
* AVG

DISTINCT versions of these window functions do not support this syntax. RANGE BETWEEN window frames with explicit offsets support only one numeric or datetime ORDER BY expression.

This extended range-based functionality makes it easier to run moving aggregations when expected or unexpected missing records cause gaps to occur in time-series data sets.

For more information, see [Syntax](../../sql-reference/functions-window-syntax.md) and these additional sections:

* [RANGE BETWEEN n PRECEDING | n FOLLOWING](../../sql-reference/functions-window-syntax.md)
* [Range-based versus row-based window frames](../../user-guide/functions-window-using.md)
* [RANGE BETWEEN limitations](../../sql-reference/functions-window-syntax.md)
* [RANGE BETWEEN example with explicit numeric offsets](../../sql-reference/functions-window-syntax.md)

### Account Usage: TABLE_DML_HISTORY and TABLE_PRUNING_HISTORY views — *General availability*

With this release, we are pleased to announce the following Account Usage views:

* [TABLE_DML_HISTORY view](../../sql-reference/account-usage/table_dml_history.md) can be used to determine the magnitude and effects of the DML operations performed on a table.
* [TABLE_PRUNING_HISTORY view](../../sql-reference/account-usage/table_pruning_history.md) can be used to determine pruning efficiency for all tables.

## Data governance updates

### Data quality: add new system DMFs — *Preview*

With this release, we are pleased to announce new system data metric functions to support data quality measurements related to statistics and accuracy:

* SNOWFLAKE.CORE.AVG
* SNOWFLAKE.CORE.BLANK_COUNT
* SNOWFLAKE.CORE.BLANK_PERCENT
* SNOWFLAKE.CORE.MAX
* SNOWFLAKE.CORE.MIN
* SNOWFLAKE.CORE.NULL_PERCENT
* SNOWFLAKE.CORE.STDDEV

Data quality and data metric functions are in preview as of [March 29, 2024](other/2024-03-29-dmf.md).

For more information, see [System data metric functions](../../user-guide/data-quality-system-dmfs.md).

## Data pipelines updates

### ALTER DYNAMIC TABLE command: Support for adding search optimization and setting additional properties

With this release, you can now use the ALTER DYNAMIC TABLE command to do the following:

* Enable search optimization for your dynamic tables.
* Set the following properties for your dynamic tables:

  + Retention time
  + Comments
  + Default collation
  + Tags
  + Row access and masking policies

For more information, see [ALTER DYNAMIC TABLE](../../sql-reference/sql/alter-dynamic-table.md).

## Snowflake Native App Framework

### Updates to logging and tracing for a Snowflake Native App — *Preview*

With this release, we are pleased to announce enhancements to the logging and tracing functionality for a Snowflake Native App. These enhancements allow more granular filtering of the log and trace events that are shared with a provider. Providers can use event definitions to specify which log and trace events are required or optional for an app. Consumers can configure which log and trace events they want to share with a provider.

For more information, see [Enable logging and event sharing for an application](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging).

### New events generated during app installation and upgrade — *Preview*

With this release, we are pleased to announce the addition of trace events that are generated when a Snowflake Native App is installed or upgraded. Consumers can view these events to determine the status of the app installation or upgrade.

For more information, see [Enable logging and event sharing for an application](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 17-Jun-24 |
| *Trust Center: Security Essentials scanner package — General availability* | **Added** to *Security updates* section | 02-Jul-24 |

---
title: June 19-22, 2023 — 7.21 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_21.md
section: Release Notes
---

# June 19-22, 2023 — 7.21 Release Notes

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced in this release. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## SQL Updates

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Geospatial Functions (Transformation) | [ST_TRANSFORM](../../sql-reference/functions/st_transform.md) | Converts a [GEOMETRY](../../sql-reference/data-types-geospatial.md) object from one [spatial reference system (SRS)](https://en.wikipedia.org/wiki/Spatial_reference_system) to another.  This function is a preview feature. |

## Data Loading Updates

### Support REPLACE_INVALID_CHARACTERS for Avro, Parquet, Orc, and XML

With this release, we are pleased to announce that the COPY INTO and CREATE EXTERNAL TABLE commands support the file format option
REPLACE_INVALID_CHARACTERS for Avro, Parquet, Orc, and XML. Previously, this file format option only worked with CSV and JSON.

For more information, refer to [CREATE FILE FORMAT](../../sql-reference/sql/create-file-format.md).

---
title: June 2023
source: https://docs.snowflake.com/en/release-notes/2023-06.md
section: Release Notes
---

# June 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Dynamic Tables - *Preview*

We are pleased to announce the preview of Dynamic Tables.

Dynamic tables are the building blocks of declarative data transformation pipelines.
They significantly simplify data engineering in Snowflake and provide a reliable, cost-effective, and automated way to transform your data
for consumption. Instead of defining data transformation steps as a series of tasks and having to monitor dependencies and scheduling, you
can simply define the end state of the transformation using dynamic tables and leave the complex pipeline management to Snowflake.

For more information, see [Dynamic Tables](../user-guide/dynamic-tables-about.md).

### Amazon S3-compatible Storage — *General Availability*

We are pleased to announce the general availability of support for accessing data in Amazon S3-compatible storage. You can create
external stages for on-premises or other cloud storage services and devices that are highly compliant with the Amazon S3 REST API. With this
feature, you can efficiently manage, govern, and analyze your data regardless of where the data is stored.

For more information, see [Work with Amazon S3-compatible storage](../user-guide/data-load-s3-compatible-storage.md).

### Passing References for Tables, Views, Functions, and Queries to a Stored Procedure — *Preview*

We are pleased to announce the preview of the ability to pass references for tables, views, functions, and queries to a stored procedure.

A reference is a unique identifier for a table, view, function, or query. When you pass a reference to a stored procedure, the stored
procedure performs actions using the active role or secondary roles of the user who created the reference. For example, if you are calling
an owner’s rights stored procedure, you can create and pass in a reference to a table to allow the stored procedure to perform actions on
the table using your active role.

In addition, if the table, view, or function is not fully qualified, the name of the object is resolved by using the current database and
schema when the reference was created (i.e. the database and schema of the user who created the reference).

For more information, see [Passing references for objects and queries to stored procedures](../developer-guide/stored-procedure/stored-procedures-calling-references.md).

### Snowpark ML: Machine Learning at Scale — *Preview*

We are pleased to announce the preview of Snowpark ML. Snowpark ML is a set of Python tools, including SDKs and underlying infrastructure,
for building and deploying machine learning models within Snowflake. This preview includes preprocessing and modeling classes based on
popular machine learning libraries such as [scikit-learn](https://scikit-learn.org/stable/),
[xgboost](https://xgboost.readthedocs.io/en/stable/), and [lightgbm](https://lightgbm.readthedocs.io/en/stable/).

Snowpark ML works with [Snowpark Python](../developer-guide/snowpark/python/index.md). You use Snowpark DataFrames to hold your training or
test data and to receive your prediction results.

For more information, see [Snowflake ML: End-to-End Machine Learning](../developer-guide/snowflake-ml/overview.md).

### ML Functions — *Preview*

We are pleased to announce the preview of three new analysis tools powered by machine learning algorithms.

These three features train a machine learning model on your time-series data to determine how a specified metric varies over time and
relative to other features. The model then provides insights and predictions based on the trends detected in the data.

* **Forecasting**: Predicts future metric values from trends in historical data.
* **Anomaly Detection**: Flags metric values that differ from typical expectations.
* **Contribution Explorer**: Helps you find dimensions and values that affect the metric in surprising ways.

For more information, see [ML Functions](../guides-overview-ml-functions.md).

### Native Applications Framework — *Preview*

We are pleased to announce the preview of the Native Apps Framework that enables you to create data applications that expand the
capabilities of other Snowflake features by sharing data and related business logic with other Snowflake accounts.

For more information, see [About the Native Apps
Framework](../developer-guide/native-apps/native-apps-about.md) and [Tutorial: Developing an Application
with the Native Apps Framework](../developer-guide/native-apps/tutorials/getting-started-tutorial.md).

### Custom Event Billing for Applications — *Preview*

We are pleased to announce the preview of Custom Event Billing, a usage-based pricing plan that providers can use to charge consumers for
usage of apps built with the Snowflake Native Apps Framework.

For more information, see [Paid listings pricing models](../collaboration/provider-listings-pricing-model.md) and [Adding Billable Events to Applications](../developer-guide/native-apps/adding-custom-event-billing.md).

### Marketplace Capacity Drawdown Program — *General Availability*

We are pleased to announce the general availability of the Marketplace Capacity Drawdown Program, which allows eligible customers with a
Capacity contract at Snowflake to pay for listings with their committed Capacity.

See [Pay for listings](../collaboration/consumer-listings-paying.md) for more information.

### Snowpipe Streaming Replication Support — *Preview*

With this release, we are pleased to announce the support of Snowpipe Streaming with [Snowflake replication](../user-guide/account-replication-intro.md). Snowflake supports the replication and failover of Snowflake tables populated by Snowpipe
Streaming and its associated channel offsets from a source account to a target account in different
[regions](../user-guide/intro-regions.md) and across [cloud platforms](../user-guide/intro-cloud-platforms.md) with replication. Snowpipe
streaming supports both database replication and group-based replication.

For more information, see [Replication and Snowpipe Streaming](../user-guide/account-replication-considerations.md).

### Anonymous Procedures — *General Availability*

With this release, we are pleased to announce the general availability of support for creating anonymous procedures. An anonymous procedure
is similar to a stored procedure, but not stored for later use.

You can create an anonymous procedure using the [WITH…CALL](../sql-reference/sql/call-with.md) syntax. With this command, you both create
an anonymous procedure defined by parameters in the WITH clause and call that procedure. You do not need to have a role with CREATE
PROCEDURE schema privileges for this command.

### Reading Files With a Java Function or Procedure Handler — *General Availability*

With this release, we are pleased to announce the general availability of support for reading staged files with a UDF or procedure handler
code written in Java.

For more information, see [Reading a file with a Java UDF](../developer-guide/udf/java/udf-java-cookbook.md) and [Reading files with a Java stored procedure](../developer-guide/stored-procedure/java/procedure-java-read-files.md).

### Reading Files With a Scala Function or Procedure Handler — *Preview*

With this release, we are pleased to announce a preview of support for reading staged files with a UDF or procedure handler code written in
Scala.

For more information, see [Reading a file with a Scala UDF](../developer-guide/udf/scala/udf-scala-examples.md) and [Reading files with a Scala stored procedure](../developer-guide/stored-procedure/scala/procedure-scala-read-files.md).

### Reading Files With a Python Function or Procedure — *Preview*

With this release, we are pleased to announce a preview of Python support for reading files with the `SnowflakeFile` class.

`SnowflakeFile` is a new class in the `snowflake.snowpark.files` module that provides dynamic read access for files on an
internal or external stage. With `SnowflakeFile`, you can stream files to accomplish tasks such as reading unstructured data or using
your own machine learning model in a user-defined function (UDF), user-defined table function (UDTF), or stored procedure.

For more information, see:

* [Reading a File from a Python UDF Handler](../developer-guide/udf/python/udf-python-examples.md)
* [Reading Files from a UDF with the Snowpark API](../developer-guide/snowpark/python/creating-udfs.md)
* [Reading a File from a Python Stored Procedure Handler](../developer-guide/stored-procedure/python/procedure-python-read-files.md)
* [Reading Files from a Stored Procedure with the Snowpark API](../developer-guide/snowpark/python/creating-sprocs.md)

### Schema Detection for JSON and CSV — *Preview*

With this release, we are pleased to announce a preview of the schema detection feature for JSON and CSV. The schema detection feature uses
the INFER_SCHEMA function to automatically detect the schema in a set of staged data files and retrieve the column definitions. The
generally available INFER_SCHEMA function applies to Apache Parquet, Apache Avro, and ORC files. This preview function expands support to
include JSON and CSV files.

For more information, see [Schema detection of column definitions from staged semi-structured data files](../user-guide/data-load-overview.md).

### Table Schema Evolution — *Preview*

With this release, we are pleased to announce a preview of the table schema evolution feature. The structure of tables in Snowflake can now
evolve automatically to support the structure of new data received from the data sources. Snowflake allows adding new columns or dropping
the NOT NULL constraint from columns missing in new data files, and supports dropping columns or changing the data type, length, or
precision of existing columns.

To enable table schema evolution, you can set the ENABLE_SCHEMA_EVOLUTION parameter to TRUE when you create or alter a table.

For more information, see [Enable automatic table schema evolution](../user-guide/data-load-schema-evolution.md).

## Security Updates

### Access Control: New Privilege for Delegating Warehouse Management — *Preview*

With this release, we are pleased to announce a preview of a new privilege for managing warehouses.

If you need to delegate the ability to alter, suspend, or resume any warehouse in your account to a custom role, you can grant the MANAGE
WAREHOUSES privilege to that role. Granting the MANAGE WAREHOUSES privilege is equivalent to granting the MODIFY, MONITOR, and OPERATE
privileges on all warehouses in the account.

For more information, see [Delegating warehouse management](../user-guide/warehouses-tasks.md).

## SQL Updates

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Geospatial Functions (Transformation) | [ST_TRANSFORM](../sql-reference/functions/st_transform.md) | Converts a [GEOMETRY](../sql-reference/data-types-geospatial.md) object from one [spatial reference system (SRS)](https://en.wikipedia.org/wiki/Spatial_reference_system) to another.  This function is a preview feature. |

### Improved Performance for SELECT Statements With LIMIT and ORDER BY Clauses — *General Availability*

With this release, we are pleased to announce that the performance of certain long-running SELECT statements containing both LIMIT and ORDER
BY clauses has been significantly improved. This improvement is immediately available to all customers at no additional cost.

The improvement works by pruning micro-partitions that cannot affect the results of such “top K” queries. The additional pruning applies to
queries where an integer-representable value (timestamp or integer, or variant explicitly cast to integer, but not an expression) is the
first or only column specified in the ORDER BY clause. If the query contains a JOIN clause, the ORDER BY column must be from the fact table
(or probe side), typically the larger of the two tables.

Queries on small tables generally do not benefit from this improvement. Queries that return fewer than the number of rows specified in the
LIMIT clause, or that use aggregations, also do not benefit.

Note that not all queries, not even all queries that meet these requirements, will benefit.

For more information on micro-partitions and query pruning, see [Micro-partitions & Data Clustering](../user-guide/tables-clustering-micropartitions.md).

### Support for Python 3.10 in Snowpark, UDFs, UDTFs and Stored Procedures — *Preview*

With this release, we are pleased to announce support for Python 3.10 in Snowpark Python, Python UDFs, Python UDTFs and Python stored
procedures as a preview feature to all accounts.

For more information, see:

* [Setting up your development environment for Snowpark Python](../developer-guide/snowpark/python/setup.md)
* [Introduction to Python UDFs](../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../developer-guide/stored-procedure/python/procedure-python-overview.md)

### Support for Python 3.9 in Snowpark, UDFs, and Stored Procedures — *Preview*

With this release, we are pleased to announce support for Python 3.9 in Snowpark Python, Python UDFs and Python stored procedures as a
preview feature to all accounts.

For more information, see:

* [Setting up your development environment for Snowpark Python](../developer-guide/snowpark/python/setup.md)
* [Introduction to Python UDFs](../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../developer-guide/stored-procedure/python/procedure-python-overview.md)

### UDFs, UDTFs, and Stored Procedures Support Passing Arguments by Name

When calling a UDF, UDTF, or stored procedure, you can now pass arguments by name, in addition to by position.

For example, suppose that you created a UDF with the following statement:

```sqlexample
CREATE OR REPLACE FUNCTION add_numbers (n1 NUMBER, n2 NUMBER)
  RETURNS NUMBER
  AS 'n1 + n2';
```

To pass the arguments by name, specify the argument name followed by => and the argument value. For example:

```sqlexample
SELECT add_numbers(n1 => 10, n2 => 5);
```

You can pass the arguments in any order:

```sqlexample
SELECT add_numbers(n2 => 5, n1 => 10);
```

For more information, see:

* [Executing a UDF](../developer-guide/udf/udf-calling-sql.md)
* [Calling a stored procedure](../developer-guide/stored-procedure/stored-procedures-calling.md)

If there are multiple functions or procedures with the same name, the same number of arguments, and different data types for the arguments,
you can specify the argument names in the call to indicate which function or procedure to execute. The argument names that you specify in
the call take precedence over the argument positions. For more information, see [Overloading procedures and functions](../developer-guide/udf-stored-procedure-naming-conventions.md).

Finally, the following built-in functions support passing arguments by name:

* [CHECK_XML](../sql-reference/functions/check_xml.md)
* [PARSE_XML](../sql-reference/functions/parse_xml.md)
* [ROUND](../sql-reference/functions/round.md)

## Data Science Updates

### Work With Snowflake’s Upcoming ML features

This release introduces a new schema, “ML”, to the Snowflake database, along with an ML_USER SNOWFLAKE database role, which is granted to
the PUBLIC role in all Snowflake accounts containing a shared SNOWFLAKE database.

For more information, see:

* [The ML schema in the SNOWFLAKE Database](../sql-reference/snowflake-db.md)
* [SNOWFLAKE database roles](../sql-reference/snowflake-db-roles.md)

The schema, roles, and privileges support features that will be made available in Public Preview at Snowflake Summit 2023.

## Organization Updates

### ACCOUNTS View (Organization Usage) — *Preview*

With this release, we are pleased to announce the preview of the ACCOUNTS view in the ORGANIZATION_USAGE schema. The ACCOUNTS view allows an
organization administrator to obtain details about the accounts in an organization, including accounts deleted within the last year.

For more information, see [ACCOUNTS view](../sql-reference/organization-usage/accounts.md).

## Data Loading Updates

### Support REPLACE_INVALID_CHARACTERS for Avro, Parquet, Orc, and XML

With this release, we are pleased to announce that the COPY INTO and CREATE EXTERNAL TABLE commands support the file format option
REPLACE_INVALID_CHARACTERS for Avro, Parquet, Orc, and XML. Previously, this file format option only worked with CSV and JSON.

For more information, see [CREATE FILE FORMAT](../sql-reference/sql/create-file-format.md).

## Data Governance Updates

### Tag-based Masking Policy: Support for Database & Schema — *Preview*

With this release, we are pleased to announce the preview of setting a tag-based masking policy on a database and schema. This update
enables data engineers to protect all columns in a schema or database when the data type of the column matches the data type of the policy
set on the tag. Additionally, a new column is protected when its data type matches the data type of the policy set on the tag. Setting the
tag-based masking policy on the database or schema simplifies data protection management because you can set the tag-based policy once and
not have to set a masking policy on every column in the database or schema.

For more information, see [Tag-based masking policies](../user-guide/tag-based-masking-policies.md).

### Access History: Track Objects Modified by a DDL Operation — *Preview*

With this release, we are pleased to announce the preview of tracking objects modified by a DDL operation in the Account Usage
ACCESS_HISTORY view. These operations include:

* Track how tag and policy assignments change.
* Track the table and column lifecycle.

The `object_modified_by_ddl` column records these changes. You can use this column to enhance your data auditing practices and detect
new objects to classify to meet PII detection requirements.

For more information, see [Access History](../user-guide/access-history.md).

## Web Interface Updates

### Load Files From a Stage Into a Table — *General Availability*

With this release, we are pleased to announce the general availability of loading files from a stage into a table by using Snowsight.

For more information, see [Load data into an existing table using Snowsight](../user-guide/data-load-web-ui.md).

### New Organizations Only Have Snowsight Access

Starting May 30, 2023, new Snowflake organizations only have access to Snowsight and no longer have access to Classic Console.

For more information, see [Snowsight: The Snowflake web interface](../user-guide/ui-snowsight.md).

---
title: June 21, 2024: Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-21-document-ai.md
section: Release Notes
---

# June 21, 2024: Document AI release notes

With this release, we are pleased to announce the availability of a new version of the Arctic-TILT model.
The new model version includes the following improvements:

* Extraction of lists of values
* Checkbox identification
* Query paraphrasing recognition to improve recognizing queries built as sentences, such as *Give me the date of the agreement*

You can now use the new model by creating a new Document AI model build.

---
title: June 24, 2024: Time Travel for hybrid tables –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-24-time-travel-hybrid-tables.md
section: Release Notes
---

# June 24, 2024: Time Travel for hybrid tables –— *Preview*

With this release, we are pleased to announce the preview of Time Travel support for hybrid tables.
You can use the AT TIMESTAMP parameter in SELECT statements to query historical data from both hybrid
tables and standard tables.

For more information, see [AT | BEFORE](../../../sql-reference/constructs/at-before.md) and
[Understanding & using Time Travel](../../../user-guide/data-time-travel.md).

---
title: June 25, 2024 — New TO_QUERY table function
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-25-to-query-function.md
section: Release Notes
---

# June 25, 2024 — New TO_QUERY table function

The [TO_QUERY](../../../sql-reference/functions/to_query.md) table function returns a result set based on SQL text and an optional
set of arguments that are passed to the SQL text if it is parameterized. The function compiles the SQL text as the
definition of a subquery in the FROM clause. When writing an application or a stored procedure, you can call this
function to construct a SQL statement.

---
title: June 25, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-25-dcr.md
section: Release Notes
---

# June 25, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Provider-run analyses

The provider of a clean room can now run analyses in their own clean room. This feature allows providers to directly extract insights
from data shared by the consumer within the clean room. Previously, providers could only be a data provider.

* For information about using the web app to execute provider-run analyses, see [Run an analysis as a provider](../../../user-guide/cleanrooms/v1/web-app-working.md).
* For information about using the developer API to execute provider-run analyses, see
  [Provider-run analyses](../../../user-guide/cleanrooms/demo-flows/provider-run-analysis.md).

## Consumer-defined templates

Consumers can now request that a provider allow them to add their own template to a clean room. After a request is raised, a provider can
review the template definition and decide whether to accept or reject the consumer’s request to add the template.

Because a consumer is often in the best position to know how to extract insights from data coming from multiple providers, the ability to
add a consumer-defined template is especially important when the consumer wants to execute a multi-provider analysis.

For more information, see [Consumer-written custom templates](../../../user-guide/cleanrooms/demo-flows/custom-templates.md).

## Granular access controls for tables and templates

Providers can now control which consumers can access a specific table or template in the clean room. This allows a provider to share the
same clean room with multiple consumers without giving full access to every consumer. Previously, all consumers could access a table or
template if it was included in the clean room.

Developers can now specify which consumers can access a table or template when executing the API to add it to the clean room. In addition,
the following new APIs control access:

* `provider.restrict_table_options_to_consumers`
* `provider.restrict_template_options_to_consumers`

For more information about these APIs, see [Snowflake Data Clean Rooms: Provider API reference guide](../../../user-guide/cleanrooms/provider.md).

## Activating results across regions

Consumers can now activate results back to the provider even if the provider is in a different cloud/region. To take advantage of this
enhancement, consumers must enable cross-cloud auto-fulfillment on their account.

## SQL Template enhancement

When a provider allows another party to query data via the SQL template, the provider can now specify custom aggregation threshold values
for each entity column they want to protect. This provides more flexible controls from the previous version of privacy controls, which
applied the same aggregation threshold across all join columns. Additionally, users no longer need to supply provider [p] & consumer [c]
aliases for tables and columns.

---
title: June 26, 2024 — Cost Management Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-26-cost.md
section: Release Notes
---

# June 26, 2024 — Cost Management Release Notes

## Organization Overview Page —– *General Availability*

With this release, we are pleased to announce the general availability of an Organization Overview page in Snowsight that
allows you to gain organization-level insights into the cost of using Snowflake, including:

* Details about the current contract.
* The remaining balance of the contract.
* The accumulated cost of Snowflake usage since the start of the contract.
* The monthly spend for the organization.
* An overview of the consumption of each account in the organization.

For more details about using the Organization Overview page, see [Overview of organization-level costs](../../../user-guide/cost-exploring-overall.md).

---
title: June 27, 2024 — Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-27-document-ai.md
section: Release Notes
---

# June 27, 2024 — Document AI release notes

With this release, we are pleased to announce the increase in number of documents that you can process
with `<model_build_name>!PREDICT` in one query.

You can now process a maximum of 1000 documents in one query, instead of 20.

---
title: June 28, 2024 — Custom UI in Streamlit in Snowflake –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-28-sis.md
section: Release Notes
---

# June 28, 2024 — Custom UI in Streamlit in Snowflake –— *Preview*

With this release, we are pleased to announce the preview of Custom UI in Streamlit in Snowflake.
Custom UI enables customization of the look, feel, and front-end behavior of Streamlit in Snowflake apps.

This feature supports the following:

* Custom HTML and CSS using `unsafe_allow_html=True` in [st.markdown](https://docs.streamlit.io/library/api-reference/text/st.markdown).
* Iframed HTML, CSS, and JavaScript using [st.components.v1.html](https://docs.streamlit.io/develop/api-reference/custom-components/st.components.v1.html).

---
title: June 28, 2024 — New geospatial H3 functions — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-28-geospatial-h3-functions-ga.md
section: Release Notes
---

# June 28, 2024 — New geospatial H3 functions — *General Availability*

H3 is a [hierarchical geospatial index](https://h3geo.org/docs/highlights/indexing) that partitions
the world into hexagonal cells in a [discrete global grid system](https://en.wikipedia.org/wiki/Discrete_global_grid).

The following H3 functions are now generally available:

* [H3_COMPACT_CELLS](../../../sql-reference/functions/h3_compact_cells.md)
* [H3_COMPACT_CELLS_STRINGS](../../../sql-reference/functions/h3_compact_cells_strings.md)
* [H3_IS_PENTAGON](../../../sql-reference/functions/h3_is_pentagon.md)
* [H3_IS_VALID_CELL](../../../sql-reference/functions/h3_is_valid_cell.md)
* [H3_TRY_COVERAGE](../../../sql-reference/functions/h3_try_coverage.md)
* [H3_TRY_COVERAGE_STRINGS](../../../sql-reference/functions/h3_try_coverage_strings.md)
* [H3_TRY_GRID_DISTANCE](../../../sql-reference/functions/h3_try_grid_distance.md)
* [H3_TRY_GRID_PATH](../../../sql-reference/functions/h3_try_grid_path.md)
* [H3_TRY_POLYGON_TO_CELLS](../../../sql-reference/functions/h3_try_polygon_to_cells.md)
* [H3_TRY_POLYGON_TO_CELLS_STRINGS](../../../sql-reference/functions/h3_try_polygon_to_cells_strings.md)
* [H3_UNCOMPACT_CELLS](../../../sql-reference/functions/h3_uncompact_cells.md)
* [H3_UNCOMPACT_CELLS_STRINGS](../../../sql-reference/functions/h3_uncompact_cells_strings.md)

For more information, see [Geospatial data types](../../../sql-reference/data-types-geospatial.md) and
[Geospatial functions](../../../sql-reference/functions-geospatial.md).

---
title: June 28, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-06-28.md
section: Release Notes
---

# June 28, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Snowsight and Multi-Factor Authentication reminders –— *General Availability*

With this release, we are pleased to announce the general availability of multi-factor authentication (MFA) configuration reminders in
Snowsight.

Snowflake takes security seriously, and with that in mind we strongly recommend users enable multi-factor authentication.
With the recent release of Snowsight we have enabled a reminder to assist users in enabling MFA.

If you have a password set and you sign in to Snowsight and you have not yet enabled MFA, a reminder dialog will be presented
with a link to enable MFA. You can enable MFA or dismiss the reminder.

Users who dismiss the reminder will be reminded again in several days’ time.

For more information, see [Enroll in multi-factor authentication (MFA)](../../../user-guide/ui-snowsight-profile.md).

---
title: June 3, 2024 — Entity-Level Privacy Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-06-03-entity-level.md
section: Release Notes
---

# June 3, 2024 — Entity-Level Privacy Release Notes

## Aggregation policies with entity-level privacy — *General Availability*

With this release, we are pleased to announce the general availability of entity-level privacy with aggregation policies. Aggregation
policies require queries to aggregate data into groups rather than return row-level results. Aggregation policies with entity-level privacy
ensures that each of those groups contains a minimum number of entities, where an entity is a logical object that needs to be protected
such as people, organizations, and locations.

For more information, see [Implementing entity-level privacy with aggregation policies](../../../user-guide/aggregation-policies-entity-privacy.md).

---
title: JWT Subject Claim Validation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2077.md
section: Release Notes
---

# JWT Subject Claim Validation

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

## Overview

This document describes an upcoming change to the validation process for JSON Web Tokens (JWTs) used with key-pair authentication for
Snowflake REST APIs. To enhance security, we are deprecating a legacy flow that accepted JWTs with an empty `sub` (subject) claim.

**Action Required:** If your application uses key-pair authentication, you must update your JWT generation logic to ensure the `sub` claim
is properly formatted to avoid authentication failures.

## What’s Changing

> **Note:**
>
> When this change bundle is enabled by default, JWTs with an empty `sub` claim will be rejected, regardless of the `iss` claim format.

Before the change:
:   JWTs were accepted even if the `sub` claim was empty, provided the `iss` (issuer) claim was formatted correctly for this case.

After the change:
:   The system will now strictly enforce that the `sub` claim must contain a valid value. If the `sub` claim is empty, the JWT will be
    rejected and authentication will fail.

## Impact

Any application or script that relies on the old behavior of sending a JWT with an empty `sub` claim will begin to fail authentication
requests. This will result in an immediate interruption of service for these applications.

## Required Actions

To avoid any service interruption, take the following steps:

1. Review your applications and scripts that use key-pair authentication to connect to Snowflake REST APIs.
2. Ensure that the logic for generating your JWT includes both a properly formatted `iss` (issuer) claim and a valid `sub` (subject) claim.
3. The correct format for the claims are:
   :   * `iss`: Must be formatted as `<account_identifier>.<user>.SHA256:<public_key_fingerprint>`
       * `sub`: Must be formatted as `<account_identifier>.<user>`

For detailed instructions and examples on how to correctly format and generate your JWT, see
[Using key pair authentication](../../../developer-guide/snowflake-rest-api/authentication.md).

Ref: 2077

---
title: Key-pair authentication for Google Cloud accounts in the us-central1 region
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2055.md
section: Release Notes
---

# Key-pair authentication for Google Cloud accounts in the us-central1 region

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, key-pair authentication for Google Cloud accounts in the `us-central1` region
changes in the following way:

Before the change:
:   When you use key-pair authentication from a Snowflake account in the Google Cloud `us-central1` region, specifying the account
    by using an account locator with [additional segments](../../../user-guide/gen-conn-config.md) is supported.

After the change:
:   When you use key-pair authentication across all cloud platforms and regions, you must specify the account by using *only* the
    account locator *without* additional segments.

    For information about constructing key-pair tokens, see the procedure for [generating a JWT](../../../developer-guide/snowflake-rest-api/authentication.md).

Ref: 2055

---
title: LISTING_ACCESS_HISTORY view (DATA_SHARING_USAGE): Deprecate LISTING_OBJECTS_ACCESSED column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1758.md
section: Release Notes
---

# LISTING_ACCESS_HISTORY view (DATA_SHARING_USAGE): Deprecate LISTING_OBJECTS_ACCESSED column

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

[BCR-1641](../2024_05/bcr-1641.md) introduced a new column named SHARE_OBJECTS_ACCESSED
that replaces the LISTING_OBJECTS_ACCESSED column.

This behavior change removes the original LISTING_OBJECTS_ACCESSED column.

When this behavior change bundle is enabled, the output of the LISTING_ACCESS_HISTORY view changes as follows:

Before the change:
:   The LISTING_OBJECTS_ACCESSED column appears in the output of the LISTING_ACCESS_HISTORY view.

After the change:
:   The LISTING_OBJECTS_ACCESSED column no longer appears in the output of the LISTING_ACCESS_HISTORY view.

Ref: 1758

---
title: LISTING_ACCESS_HISTORY view: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1641.md
section: Release Notes
---

# LISTING_ACCESS_HISTORY view: New columns

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

When this behavior change bundle is enabled, the [LISTING_ACCESS_HISTORY view](../../../sql-reference/data-sharing-usage/listing-access-history.md)
includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| IS_SHARE | BOOLEAN | TRUE if the access was on a share. When TRUE, the LISTING_OBJECTS_ACCESSED column provides details about the share objects accessed by the consumer query. |
| IS_APPLICATION | BOOLEAN | TRUE if the access was on an application. When TRUE, APPLICATION_OBJECTS_ACCESSED column provides details about the application objects accessed by the consumer query. |
| SHARE_OBJECTS_ACCESSED | ARRAY | See [LISTING_OBJECTS_ACCESSED array](../../../sql-reference/data-sharing-usage/listing-access-history.md) for formatting. Is NULL when IS_SHARE is FALSE.  Note: this column has the same data as the LISTING_OBJECTS_ACCESSED column.  It is recommended to use SHARE_OBJECTS_ACCESSED instead of LISTING_OBJECTS_ACCESSED as it may be deprecated in the future. |
| APPLICATION_OBJECTS_ACCESSED | ARRAY | See APPLICATION_OBJECTS_ACCESSED Array for formatting.  Is NULL when IS_APPLICATION is FALSE. |
| APPLICATION_PACKAGE_NAME | VARCHAR | The current name of the application package from which the application was installed.  Is NULL when IS_APPLICATION is FALSE. |
| APPLICATION_VERSION | VARCHAR | The version of the application when this query occurred.  Is NULL when IS_APPLICATION is FALSE. |
| APPLICATION_PATCH_ID | INTEGER | The patch number of the application when this query occurred.  Is NULL when IS_APPLICATION is FALSE. |

## APPLICATION_OBJECTS_ACCESSED Array

The APPLICATION_OBJECTS_ACCESSED array provides details about the objects in an application accessed by a consumer query.
The format of an item in the array depends on the type of object that was accessed.

> **Note:**
>
> Object IDs are not available and database names are masked.

### Functions

```sqljson
{
  "argumentSignature": (function_signature varchar),
  "objectName": "23662386A408C571B77FDC53691793E4992D1C12.SCHEMA_NAME.FUNCTION_NAME",
  "objectDomain": "Function"
}
```

### Stored procedures

```sqljson
{
  "argumentSignature": (function_signature varchar),
  "objectName": "23662386A408C571B77FDC53691793E4992D1C12.SCHEMA_NAME.PROCEDURE_NAME",
  "objectDomain":"Procedure"
}
```

### Tables, views, and columns

```sqljson
[
  {
    "Columns": [
      {
        "columnName": "column1_name"
      },
      {
        "columnName": "column2_name"
      }
    ],
    "objectDomain":"VIEW",
    "objectName": "5F3297829072D2E23B852D7787825FF762E74EF3.PUBLIC.VIEW_1"
  },
  {
    "Columns": [
      {
        "columnName": "column3_name"
      },
      {
        "columnName": "column4_name"
      }
    ],
    "objectDomain":"TABLE",
    "objectName": "D85A2CE1531C6C1E077FA701713047305BDF5A83.PUBLIC.TABLE1"
  }
]
```

Ref: 1641

---
title: LISTING_CONSUMPTION_DAILY view: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1601.md
section: Release Notes
---

# LISTING_CONSUMPTION_DAILY view: New columns

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, the [LISTING_CONSUMPTION_DAILY view](../../../sql-reference/data-sharing-usage/listing-consumption-daily.md) includes the following new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| UNIQUE_USERS_1D | INT | Count of unique users in 1 day. |
| UNIQUE_USERS_7D | INT | Count of unique users in 7 days. |
| UNIQUE_USERS_28D | INT | Count of unique users in 28 days. |

Ref: 1601

---
title: LISTING_EVENTS_DAILY: new columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1566.md
section: Release Notes
---

# LISTING_EVENTS_DAILY: new columns in output

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, two new columns appear in
[LISTING_EVENTS_DAILY view](../../../sql-reference/data-sharing-usage/listing-events-daily.md) in the
[Data Sharing Usage](../../../sql-reference/data-sharing-usage.md) schema.

| Column name | Data type | Description |
| --- | --- | --- |
| ACCESS_TYPE | VARCHAR | The access type of the listing. The access type is also called the monetization type. |
| EVENT_TIMESTAMP | DATETIME | The date and time that a listing-related event occurred. |

Ref: 1566

---
title: LISTING_TELEMETRY_DAILY View (Data Sharing Usage): New Value for EVENT_TYPE Column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1084.md
section: Release Notes
---

# LISTING_TELEMETRY_DAILY View (Data Sharing Usage): New Value for EVENT_TYPE Column

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The LISTING_TELEMETRY_DAILY View in the DATA_SHARING_USAGE schema includes a value for the EVENT_TYPE column.

Previously:
:   EVENT_TYPE column contains a value for the event that occurred for the listing. One of: GET, REQUEST, or LISTING CLICK.

Currently:
:   EVENT_TYPE column adds a new value of LISTING VIEW, recorded when a consumer visits a specific listing detail page.

Ref: 1084

---
title: LOAD_HISTORY and COPY_HISTORY Information Schema views: Showing only post-truncate load history
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1493.md
section: Release Notes
---

# LOAD_HISTORY and COPY_HISTORY Information Schema views: Showing only post-truncate load history

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The LOAD_HISTORY and COPY_HISTORY Information Schema views show results as follows:

Before the change:
:   The LOAD_HISTORY and COPY_HISTORY views show load history from both before and after the latest truncate operation on the target table.

After the change:
:   The LOAD_HISTORY and COPY_HISTORY views show load history only after the latest truncate operation on the target table.

After the behavior change, you can still see pre-truncate load history by querying the views and saving the load history before truncating the target table.
You can also query the views in the Account Usage schema, which contains a longer history and includes the pre-truncate operations.

Ref: 1493

---
title: Logging and tracing: Default event table included
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1598.md
section: Release Notes
---

# Logging and tracing: Default event table included

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

> **Attention:**
>
> This behavior change was originally in the 2024_04 bundle and was subsequently removed. The change is reintroduced in the 2024_06 bundle.

Before the change:
:   Snowflake does not include an event table by default. To begin using logging and tracing, you must, set it as the event table for the
    account to use, then enable logging and tracing. Before you install the event table, log or trace events are not captured, even with
    logging or tracing enabled.

After the change:
:   By default, Snowflake includes the following:

    * A default event table called SNOWFLAKE.TELEMETRY.EVENTS.

      If no event table is yet installed and active, the new default event table will be activated for the account. If an event table exists
      and is receiving data, it will remain active after the default event table is added.
    * A predefined view called EVENTS_VIEW in the TELEMETRY schema.

      The EVENTS_VIEW view is associated with the SNOWFLAKE.TELEMETRY.EVENTS event table.

    If logging and tracing were previously enabled and no events were captured because there was no active event table, the new default event
    table will begin capturing logging and tracing events. This will incur costs as described in
    [Costs of telemetry data collection](../../../developer-guide/logging-tracing/logging-tracing-billing.md).

    If you don’t yet have an event table and want to collect logging and tracing events, do nothing. New events will be captured in the
    SNOWFLAKE.TELEMETRY.EVENTS table in the SNOWFLAKE database, in the TELEMETRY schema.

    If you do not want to collect events for the associated objects, you can do any one of the following:

    * Disable or change the logging and tracing levels appropriately at the respective object levels. For more information, see
      [Setting levels for logging, metrics, and tracing](../../../developer-guide/logging-tracing/telemetry-levels.md) and [Setting levels for logging, metrics, and tracing](../../../developer-guide/logging-tracing/telemetry-levels.md).

      This option is not applicable for [Native Apps](../../../developer-guide/native-apps/native-apps-about.md).
    * Uninstall the applications/connector emitting log and trace events or drop the unnecessary objects.
    * If you do not want any logging and tracing events to be collected at all in the account, execute the following command:

      ```sqlexample
      ALTER ACCOUNT SET EVENT_TABLE = NONE
      ```

    If you create your own event table and [set it as active](../../../developer-guide/logging-tracing/event-table-setting-up.md), events will then be collected in
    that event table, and not in the default event table in the Snowflake database.

Ref: 1598

---
title: Logging and tracing: Logging of unhandled exceptions in handler code on by default
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1428.md
section: Release Notes
---

# Logging and tracing: Logging of unhandled exceptions in handler code on by default

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

[Logging of unhandled exceptions](../../../developer-guide/logging-tracing/unhandled-exception-messages.md) in procedure and UDF handler code
behaves as follows:

Before the change:
:   When an [event table](../../../developer-guide/logging-tracing/event-table-setting-up.md) has been associated with the Snowflake account,
    an unhandled exception occurring in procedure or UDF handler code won’t, by default, be logged in the event table.

    In other words, you can set up [logging and tracing](../../../developer-guide/logging-tracing/logging-tracing-overview.md), including
    creating an event table and associating it with your account, but leave unhandled exception logging off, such as to avoid having those
    exceptions logged. You can turn on exception logging by setting the ENABLE_UNHANDLED_EXCEPTIONS_REPORTING parameter to `true`.

After the change:
:   Unhandled exceptions in procedure or UDF handler code do, by default, result in log entries when you have an event table associated
    with the account.

    You can turn off logging for unhandled exceptions by setting the ENABLE_UNHANDLED_EXCEPTIONS_REPORTING parameter to `false`.

    When log entries might contain sensitive data, consider doing the following to protect the data:

    * Turn off unhandled exception logging.
    * If you leave unhandled exception logging on, take steps to protect sensitive data, such as by doing the following:

      + Improve your exception handling code to minimize the risk of unhandled exceptions.
      + Apply [row access policies](../../../user-guide/security-row-intro.md) to your event table to restrict access to rows that contain
        personally identifiable information (PII).
      + [Create a view](../../../sql-reference/sql/create-view.md) on top of the event table and
        [apply masking policies](../../../sql-reference/sql/create-masking-policy.md) to it to mask or delete personally identifiable
        information (PII).

Ref: 1428

---
title: LOGIN_HISTORY view (Account Usage, Organization Usage): New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2209.md
section: Release Notes
---

# LOGIN_HISTORY view (Account Usage, Organization Usage): New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the LOGIN_HISTORY views in the [Account Usage](../../../sql-reference/account-usage.md) and
[Organization Usage](../../../sql-reference/organization-usage.md) schemas include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| AUTHORIZING_INTEGRATION_NAME | VARCHAR | Name of the integration that allowed the user to authenticate.  For user-defined security integrations like OAuth or SAML2, displays the user-specified name of the integration. For internal Snowflake integrations, displays the type of integration. |

The new column appears after the CLIENT_IP column.

Ref: 2209

---
title: LOGIN_HISTORY view and functions: New column CLIENT_PRIVATE_LINK_ID
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1847.md
section: Release Notes
---

# LOGIN_HISTORY view and functions: New column CLIENT_PRIVATE_LINK_ID

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, a new column is included in the following:

* ACCOUNT_USAGE.LOGIN_HISTORY view
* Output of the INFORMATION_SCHEMA.LOGIN_HISTORY function
* Output of the INFORMATION_SCHEMA.LOGIN_HISTORY_BY_USER function

The new column in the view/output is the following:

| Column name | Data type | Description |
| --- | --- | --- |
| `client_private_link_id` | VARCHAR | If the user logged in using [private connectivity](../../../user-guide/private-connectivity-inbound.md), specifies the identifier of the endpoint from which the request originated. |

The column is added as the last column of the view/output.

Ref: 1847

---
title: LOGIN_HISTORY view and functions: New LOGIN_DETAILS column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2052.md
section: Release Notes
---

# LOGIN_HISTORY view and functions: New LOGIN_DETAILS column

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, the following view and functions include a new column named `login_details` in their output.

* ACCOUNT_USAGE.LOGIN_HISTORY view
* INFORMATION_SCHEMA.LOGIN_HISTORY function output
* INFORMATION_SCHEMA.LOGIN_HISTORY_BY_USER function output

This column contains information about login details, such as malicious IP
protection status and risk classification. The column is added as the last column of the view or command output.

| Column name | Data type | Description |
| --- | --- | --- |
| login_details | VARCHAR | Provides information related to login details. |

Examples:

* Example 1:

  ```json
  {"malicious_ip_protection_info":"{\"categories\":[\"MALICIOUS_BEHAVIOR\"],\"result\":\"OPTED_OUT\",\"riskClassification\":\"LOW\"}"}
  ```
* Example 2:

  ```json
  {"malicious_ip_protection_info":"{\"categories\":[\"ANONYMOUS_VPN\",\"TOR_EXITS\"],\"result\":\"BLOCKED\",\"riskClassification\":\"HIGH\"}"}
  ```

Ref: 2052

---
title: LOGIN_HISTORY view and table function (Account Usage / Information Schema): New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1966.md
section: Release Notes
---

# LOGIN_HISTORY view and table function (Account Usage / Information Schema): New columns in output

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, the
[ACCOUNT_USAGE LOGIN_HISTORY view](../../../sql-reference/account-usage/login_history.md) and the output of the
[INFORMATION_SCHEMA LOGIN_HISTORY table function](../../../sql-reference/functions/login_history.md)
include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `first_authentication_factor_id` | VARCHAR | ID of the credential used to authenticate the user (the first factor, if using [multi-factor authentication](../../../user-guide/security-mfa.md)). |
| `second_authentication_factor_id` | VARCHAR | ID of the credential used for the [second factor](../../../user-guide/security-mfa-second-factor.md), if using multi-factor authentication. If the user did not use multi-factor authentication, this value is NULL. |

Ref: 1966

---
title: Managed Account Commands: Changes to Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1193.md
section: Release Notes
---

# Managed Account Commands: Changes to Output

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, the output of the SHOW MANAGED ACCOUNTS and CREATE MANAGED ACCOUNT commands display [account URLs](../../../user-guide/organizations-connect.md) and account names consistently across all Snowflake accounts.

Previously:
:   The output of the SHOW MANAGED ACCOUNTS and CREATE MANAGED ACCOUNT commands varied slightly depending on the Snowflake account. In some
    accounts:

    * The output still displayed the legacy account locator URL in the `url` column.
    * SHOW MANAGED ACCOUNTS output did not always include the `account_locator_url` column.
    * CREATE MANAGED ACCOUNT output did not always include the `accountLocatorUrl` column.
    * CREATE MANAGED ACCOUNT output sometimes displayed the account locator as the name of the account.

Currently:
:   The output of the SHOW MANAGED ACCOUNTS and CREATE MANAGED ACCOUNT commands is consistent among all accounts:

    * The output always includes the account name format of the account URL in the `url`
      column. The [account identifier](../../../user-guide/admin-account-identifier.md) in this format follows the pattern `orgname-account_name`.
    * SHOW MANAGED ACCOUNTS output displays the account locator format of the account URL in the `account_locator_url` column.
    * CREATE MANAGED ACCOUNT output displays the account locator format of the account URL in the `accountLocatorUrl` column.
    * CREATE MANAGED ACCOUNT output shows the account name in the `accountName` column and includes an `accountLocator`
      column that displays the account locator.

Ref: 1193

---
title: Mandatory multi-factor authentication on Snowsight login (Replaced)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1972.md
section: Release Notes
---

# Mandatory multi-factor authentication on Snowsight login (Replaced)

> **Important:**
>
> This behavior change has been replaced by a behavior change in the 2025_06 bundle: [Multi-factor authentication: MFA_ENROLLMENT parameter values change](../2025_06/bcr-2097.md).
>
> The new behavior change accomplishes the same thing: human users must authenticate with a second factor when using a password to access Snowsight.

This change is part of the planned deprecation of single-factor password sign-ins.

For information on the rollout of MFA, see [Planning for the deprecation of single-factor password sign-ins](../../../user-guide/security-mfa-rollout.md).

For Snowflake support of MFA, see [Configuring a second factor of authentication](../../../user-guide/security-mfa-second-factor.md).

Ref: 1972

---
title: Mar 02, 2026: No limit on the number of backup sets per object
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-02-backups-no-limit-backup-sets.md
section: Release Notes
---

# Mar 02, 2026: No limit on the number of backup sets per object

You can now create an unlimited number of backup sets for a specific database, schema, or table.

Previously, you could create a maximum of two database backup sets for a specific database, two schema
backup sets for a specific schema, and two table backup sets for a specific table. This limitation has
been removed.

For more information, see [Backups for disaster recovery and immutable storage](../../../user-guide/backups.md).

---
title: Mar 02, 2026: Query Delta-based Apache Iceberg™ tables with deletion vectors
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-02-iceberg-delta-deletion-vectors.md
section: Release Notes
---

# Mar 02, 2026: Query Delta-based Apache Iceberg™ tables with deletion vectors

You can now query Delta-based Apache Iceberg™ tables that contain deletion vectors and use liquid clustering. With this update, Snowflake
now supports minReaderVersion 3 and can read tables written by engines that use Delta Lake version 4.0.0, which is the latest version.

Previously, you could only query Delta-based Iceberg tables that used copy-on-write because Snowflake supported minReaderVersion 2 and
tables written by engines that use Delta Lake version 2.2.0.

For more information, see [CREATE ICEBERG TABLE (Delta files in object storage)](../../../sql-reference/sql/create-iceberg-table-delta.md).

---
title: Mar 02, 2026: Simplified pricing for hybrid tables
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-02-hybrid-tables-pricing.md
section: Release Notes
---

# Mar 02, 2026: Simplified pricing for hybrid tables

Snowflake has simplified the pricing model for hybrid tables.
Previously, hybrid tables were billed based on three categories:
hybrid table storage, virtual warehouse compute, and hybrid table
requests (serverless credits for read and write operations on the
underlying row storage).

As of March 1, 2026, hybrid table requests
are no longer charged as a separate billing category.

Hybrid tables are now billed based on two categories:

* **Hybrid table storage**: A flat monthly rate per GB for data
  stored in hybrid tables.
* **Virtual warehouse compute**: Standard warehouse consumption
  for queries executed against hybrid tables.

For more information, see
[Evaluate cost for hybrid tables](../../../user-guide/tables-hybrid-cost.md).

---
title: Mar 02, 2026: Support for new dbt Core versions for dbt Projects on Snowflake
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-02-dbt-core-versions.md
section: Release Notes
---

# Mar 02, 2026: Support for new dbt Core versions for dbt Projects on Snowflake

Snowflake now supports explicit version pinning for dbt projects with the new DBT_VERSION parameter. You can pin a
dbt Core version when creating, altering, or executing a dbt project object. You can also query supported versions and engine types
using the [SYSTEM$SUPPORTED_DBT_VERSIONS](../../../sql-reference/functions/system_supported_dbt_versions.md) system function to plan
upgrades and maintain environment stability.

The following example creates a dbt project pinned to a specific dbt Core version:

```sqlexample
CREATE DBT PROJECT my_dbt_project
  FROM '@my_stage/dbt_files'
  DBT_VERSION = '1.10.15';
```

The following example overrides the project’s pinned version at execution time:

```sqlexample
EXECUTE DBT PROJECT my_dbt_project
  DBT_VERSION = '1.10.15';
```

This release also introduces the following changes:

* The [DEFAULT_DBT_VERSION](../../../sql-reference/parameters.md) account parameter enables organization administrators to set a default dbt version for all
  future dbt project objects created in the account without requiring users to manually update CREATE DBT PROJECT DDL statements for
  every individual project.
* The [DESCRIBE DBT PROJECT](../../../sql-reference/sql/desc-dbt-project.md) and [SHOW DBT PROJECTS](../../../sql-reference/sql/show-dbt-projects.md) commands now return
  `dbt_version` and `dbt_snowflake_version` columns.
* The [DBT_PROJECT_EXECUTION_HISTORY](../../../sql-reference/functions/dbt_project_execution_history.md) table function now returns `DBT_VERSION` and
  `DBT_SNOWFLAKE_VERSION` columns for auditing which engine version was used for each run.

For more information about the dbt Core versions that Snowflake supports, see
[Supported dbt Core versions for dbt Projects on Snowflake](../../../user-guide/data-engineering/dbt-projects-on-snowflake-dbt-core-versions.md).

---
title: Mar 02, 2026: Using standard SQL clauses to query semantic views (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-02-semantic-views-standard-sql.md
section: Release Notes
---

# Mar 02, 2026: Using standard SQL clauses to query semantic views (*General availability*)

The ability to use SQL clauses in a SELECT statement to query a semantic view is now generally available and is no longer in
[Preview](../../preview-features.md).

You can specify the name of the semantic view in the FROM clause, rather than specifying the SEMANTIC_VIEW clause. For
example, the following query specifies the SEMANTIC_VIEW clause:

```sqlexample
SELECT * FROM SEMANTIC_VIEW(
    tpch_analysis
    DIMENSIONS customer.customer_market_segment
    METRICS orders.order_average_value
  )
  ORDER BY customer_market_segment;
```

The following statement demonstrates how to execute the same query without specifying the SEMANTIC_VIEW clause:

```sqlexample
SELECT customer_market_segment, AGG(order_average_value)
  FROM tpch_analysis
  GROUP BY customer_market_segment
  ORDER BY customer_market_segment;
```

For information, see [Specifying the name of the semantic view in the FROM clause](../../../user-guide/views-semantic/querying.md).

---
title: Mar 03, 2025: Collapsible navigation bar in Snowsight (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-03-snowsight-collapsible-nav-bar.md
section: Release Notes
---

# Mar 03, 2025: Collapsible navigation bar in Snowsight (*General availability*)

With this release, we are pleased to introduce enhancements to the navigation in Snowsight. The left navigation now supports global
collapsing and expanding, with your preference saved across pages, refreshes, and sign-ins. Enhanced animations provide smoother transitions
for menus and submenus, while improved page state memory streamlines navigation across projects, data, and monitoring.

For general information about Snowsight, see [Snowsight: The Snowflake web interface](../../../user-guide/ui-snowsight.md).

---
title: Mar 03, 2025: Native Apps with Snowpark Container Services - Support for AWS PrivateLink (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-03-na-spcs-aws-pl-ga.md
section: Release Notes
---

# Mar 03, 2025: Native Apps with Snowpark Container Services - Support for AWS PrivateLink (*General availability*)

We are pleased to announce the general availability of AWS PrivateLink support in Snowflake Native Apps with Snowpark Container Services. This feature allows
connections from the consumer’s AWS virtual network to apps with containers deployed to a Snowflake virtual network in AWS.

For general information about using AWS PrivateLink in Snowflake, see [AWS PrivateLink and Snowflake](../../../user-guide/admin-security-privatelink.md).
For additional information about apps with containers, see [About Snowflake Native Apps with Snowpark Container Services](../../../developer-guide/native-apps/native-apps-about.md).

---
title: Mar 03, 2025: Native Apps with Snowpark Container Services - Support for Azure Private Link (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-03-na-spcs-azure-pl-pupr.md
section: Release Notes
---

# Mar 03, 2025: Native Apps with Snowpark Container Services - Support for Azure Private Link (*Preview*)

This feature allows connections from the consumer’s Microsoft Azure virtual network to apps with containers deployed in a Snowflake virtual network on Microsoft Azure.

For general information about using Azure Private Link in Snowflake, see [Azure Private Link and Snowflake](../../../user-guide/privatelink-azure.md).
For additional information about apps with containers, see [About Snowflake Native Apps with Snowpark Container Services](../../../developer-guide/native-apps/native-apps-about.md).

---
title: Mar 03, 2025: Snowflake Cortex Document Processing Usage History
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-03-cortex-document-processing-usage-history.md
section: Release Notes
---

# Mar 03, 2025: Snowflake Cortex Document Processing Usage History

Snowflake announces support for viewing the query usage history for document processing functions. You can use the ACCOUNT_USAGE.CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view to see the document processing features, such as Document AI or PARSE_DOCUMENT, that were run and the number of credits that they consume. For example:

```sqlexample
SELECT *
  FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY
  WHERE CREDITS_USED > 0.072
```

For more information, see [CORTEX_DOCUMENT_PROCESSING_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_document_processing_usage_history.md).

---
title: Mar 04, 2025: Universal Search ML model support (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-04-universal-search-ml-models.md
section: Release Notes
---

# Mar 04, 2025: Universal Search ML model support (*General availability*)

With this release, we are pleased to announce that Snowsight now supports ML models in Universal Search results, making it easier
to discover relevant assets.

For details on Universal Search, see [Search Snowflake objects and resources](../../../user-guide/ui-snowsight-universal-search.md).

---
title: Mar 04, 2026: Support for Apache Iceberg™ version 3 (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-04-iceberg-v3-support-preview.md
section: Release Notes
---

# Mar 04, 2026: Support for Apache Iceberg™ version 3 (*Preview*)

Support for version 3 (v3) of the Apache Iceberg™ table specification is now in public preview.

The following data types are supported:

* `geography`
* `geometry`
* `nanosecond`
* `variant`

The following additional features are supported:

* **Default values**: Define default values for columns in Iceberg tables. For more information, see [Use default values with Iceberg tables](../../../user-guide/tables-iceberg-manage.md).
* **Deletion vectors**: Use deletion vectors to improve write performance. For more information, see [Write to tables by using deletion vectors](../../../user-guide/tables-iceberg-manage.md).
* **Row lineage**: Track row-level lineage information for data governance and auditing. For more information, see [Use row lineage with Iceberg tables](../../../user-guide/tables-iceberg-manage.md).

You can read and write to v3 Iceberg tables by using these features with either Snowflake-managed or externally managed Iceberg tables.
Iceberg v3 support is integrated across the Snowflake platform, including streaming and batch ingestion, transformation, analytics, machine
learning, AI, business continuity and disaster recovery, external engine and catalog integrations, and more.

For more information, see [Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)](../../../user-guide/tables-iceberg-v3-specification-support.md).

---
title: Mar 05, 2025: Search optimization improves the performance of queries containing scalar subqueries
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-05-search-optimization-scalar-subqueries.md
section: Release Notes
---

# Mar 05, 2025: Search optimization improves the performance of queries containing scalar subqueries

The search optimization service can now improve the performance of queries containing scalar subqueries.
A scalar subquery returns a single value (one column of one row). To improve query performance,
make sure search optimization is enabled for the column that is equal to the result of the subquery.

For more information, see [Speeding up queries with scalar subqueries using search optimization](../../../user-guide/search-optimization/scalar-subqueries.md).

---
title: Mar 05, 2025: Snowpark Container Services support for application metrics
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-05-spcs-application-metrics.md
section: Release Notes
---

# Mar 05, 2025: Snowpark Container Services support for application metrics

In addition to the [platform metrics](../../../developer-guide/snowpark-container-services/monitoring-services.md) Snowflake provides for compute pools in your account, Snowpark Container Services now supports application metrics and traces generated by your service. Your service containers can produce OLTP or Prometheus metrics, which Snowflake publishes to the event table configured for your account. For more information, see [Publishing and accessing application metrics](../../../developer-guide/snowpark-container-services/monitoring-services.md).

---
title: Mar 05, 2026: AI_COMPLETE document intelligence (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-05-ai-complete-document-intelligence.md
section: Release Notes
---

# Mar 05, 2026: AI_COMPLETE document intelligence (*Preview*)

The multimodal AI_COMPLETE AI Function now supports document inputs, providing reasoning over document files, such as
PDFs or Microsoft Word files, stored in Snowflake internal or external stages. This enhancement allows AI_COMPLETE to
analyze text, charts, tables, and structured data within documents. This feature joins existing support in AI_COMPLETE
for text and image inputs.

With this release, you can apply industry-leading large language models (LLMs) directly to staged documents for
contextual Q&A, summarization, and information extraction. For example, you can:

* Answer questions about charts and diagrams embedded in PDFs.
* Compare information across multiple documents in a single prompt.
* Generate summaries tailored to a specific audience or perspective.
* Extract entities and structured insights from reports, contracts, spreadsheets, and technical documentation.

To get started, see [AI_COMPLETE with documents](../../../user-guide/snowflake-cortex/ai-complete-document-intelligence.md).

---
title: Mar 05, 2026: Exporting a semantic view to a Tableau Data Source (TDS) file (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-05-semantic-views-tableau-tds.md
section: Release Notes
---

# Mar 05, 2026: Exporting a semantic view to a Tableau Data Source (TDS) file (*Preview*)

You can now export a semantic view to a
[Tableau Data Source (TDS) file](https://help.tableau.com/current/pro/desktop/en-us/export_connection.htm#options-for-saving-a-local-data-source).
To do this, call the [SYSTEM$EXPORT_TDS_FROM_SEMANTIC_VIEW](../../../sql-reference/functions/system_export_tds_from_semantic_view.md) function.

Support for exporting semantic views to TDS files is in
[Preview](../../preview-features.md).

For information, see [Exporting a semantic view to a Tableau Data Source (TDS) file](../../../user-guide/views-semantic/sql.md).

---
title: Mar 05, 2026: Preventing a semantic view metric from being aggregated across specific dimensions
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-05-semantic-views-semi-additive-metrics.md
section: Release Notes
---

# Mar 05, 2026: Preventing a semantic view metric from being aggregated across specific dimensions

If a metric should not be aggregated across specific dimensions, you can now specify those dimensions in the NON ADDITIVE BY
clause of the [CREATE SEMANTIC VIEW](../../../sql-reference/sql/create-semantic-view.md) command.

For example, to prevent the metric from being aggregated by some date dimensions:

```sqlexample
CREATE OR REPLACE SEMANTIC VIEW bank_accounts_sv
  TABLES (
    bank_accounts
  )
  DIMENSIONS (
    bank_accounts.customer_id_dim AS bank_accounts.customer_id,
    bank_accounts.account_type_dim AS bank_accounts.account_type,
    bank_accounts.year_dim AS bank_accounts.year,
    bank_accounts.month_dim AS bank_accounts.month,
    bank_accounts.day_dim AS bank_accounts.day
  )
  METRICS (
    bank_accounts.m_account_balance
      NON ADDITIVE BY (year_dim, month_dim, day_dim)
      AS SUM(balance)
  );
```

For more information, see [Identifying the dimensions that should be non-additive for a metric](../../../user-guide/views-semantic/sql.md).

---
title: Mar 05, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-05-dcr.md
section: Release Notes
---

# Mar 05, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 13.5

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* You can now upload and use custom Python code in a collaboration. To learn more, see [Code bundles](../../../user-guide/cleanrooms/resources-code-bundles.md).
* The [PROCESS_ACTIVATION](../../../user-guide/cleanrooms/collaboration-api-reference.md) signature has changed. The original signature was:

  > ```sqlsyntax
  > PROCESS_ACTIVATION( <collaboration_name>, <segment_name> )
  > ```

  The new signature is:

  > ```sqlsyntax
  > PROCESS_ACTIVATION(<collaboration_name> [, <batch_ids> ] )``
  > ```
* A new [VIEW_REGISTRIES](../../../user-guide/cleanrooms/collaboration-api-reference.md) procedure is now available to list custom registries that you can access.
* Updates to private preview features.

---
title: Mar 06, 2025: Cortex AI PARSE_DOCUMENT function for OCR (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-06-ocr-mode-parse-document.md
section: Release Notes
---

# Mar 06, 2025: Cortex AI PARSE_DOCUMENT function for OCR (*General availability*)

Snowflake is pleased to announce the General Availability of Snowflake Cortex AI PARSE_DOCUMENT’s OCR mode, enabling
customers to accurately extract text and data from millions of document pages. This SQL function is fully-managed,
offering OCR quality on par with other cloud providers in combination with the scalability, performance, and ease of use
of Snowflake. PARSE_DOCUMENT OCR extracts text content from PDF, DOCX, and PPTX files stored in a Snowflake or external
stage using SQL, without requiring a complex cloud architecture.

The Cortex AI PARSE_DOCUMENT OCR mode enables:

* Text extraction from both digital-born and scanned documents.
* High-quality extraction for documents in English, German, French, Italian, Norwegian, Polish, Portuguese, Spanish, and
  Swedish.
* Seamless integration with RAG pipelines powering Cortex Search, and with Cortex AI Functions for document
  summarization, translation, and entity extraction.
* Automatic page orientation detection.

For details, see [Parsing documents with AI_PARSE_DOCUMENT](../../../user-guide/snowflake-cortex/parse-document.md).

---
title: Mar 06, 2026: SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG function (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-06-system-get-catalog-linked-database-config.md
section: Release Notes
---

# Mar 06, 2026: SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG function (*General availability*)

The SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG function is now available. You can use this function to get the configuration for
catalog-linked databases.

For more information, see [SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG](../../../sql-reference/functions/system_get_catalog_linked_database_config.md).

---
title: Mar 07, 2025: RESOURCE_CONSTRAINT clause for Snowpark-optimized warehouses (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-07-snowpark-optimized-warehouses-resource_constraint.md
section: Release Notes
---

# Mar 07, 2025: RESOURCE_CONSTRAINT clause for Snowpark-optimized warehouses (*General availability*)

You can now specify the memory and CPU architecture for
[Snowpark-optimized warehouses](../../../user-guide/warehouses-snowpark-optimized.md).
You can use the RESOURCE_CONSTRAINT clause with the CREATE WAREHOUSE and ALTER WAREHOUSE commands.
This feature was previously in public preview.

> **Important:**
>
> The RESOURCE_CONSTRAINT clause is generally available for 16 GB and 256 GB memory sizes.
> The 1 TB sizes, which you specify with the RESOURCE_CONSTRAINT parameters `MEMORY_64X`
> and `MEMORY_64X_x86`, are still in preview. Also, the 1 TB sizes are currently available
> for the Amazon Web Services (AWS) cloud provider, not for Microsoft Azure and Google Cloud Platform (GCP).

For more information, see [Configuration options for Snowpark-optimized warehouses](../../../user-guide/warehouses-snowpark-optimized.md), [CREATE WAREHOUSE](../../../sql-reference/sql/create-warehouse.md), and [ALTER WAREHOUSE](../../../sql-reference/sql/alter-warehouse.md).

---
title: Mar 09, 2026: Streamlit in Snowflake container runtime and secrets support (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-09-sis-container-runtime-ga.md
section: Release Notes
---

# Mar 09, 2026: Streamlit in Snowflake container runtime and secrets support (*General availability*)

The Streamlit in Snowflake container runtime is now generally available. Container runtimes run your Streamlit in Snowflake apps on
Snowpark Container Services compute pools, providing access to GPUs, broader Python package support, and long-running
services without sleep timers.

This release also includes general availability for the following container-runtime features:

* **Secrets**: Use `st.secrets` to securely access Snowflake secrets in your container-runtime apps.
  Secrets are also automatically mapped to environment variables.
* **Sharing**: Share container-runtime apps using app-viewer URLs without the Snowsight interface.
* **Logging and tracing**: Container runtimes automatically capture standard output and standard error
  from your apps.

Container runtimes are available in all commercial regions. Government and China regions are not supported.

For more information, see:

* [Runtime environments for Streamlit apps](../../../developer-guide/streamlit/app-development/runtime-environments.md)
* [Manage secrets and configure your Streamlit app](../../../developer-guide/streamlit/app-development/secrets-and-configuration.md)
* [Sharing Streamlit in Snowflake apps](../../../developer-guide/streamlit/features/sharing-streamlit-apps.md)
* [Logging and tracing for Streamlit in Snowflake](../../../developer-guide/streamlit/features/logging-tracing.md)

---
title: Mar 11, 2026: Resource budgets for Cortex Agents
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-11-cortex-agents-resource-budgets.md
section: Release Notes
---

# Mar 11, 2026: Resource budgets for Cortex Agents

Snowflake now supports resource budgets to control credit spending for Cortex Agents. Resource budgets let you define
a monthly spending limit and take automated actions, such as revoking access, when spending exceeds your limits.

For more information, see [Resource budgets for Cortex Agents](../../../user-guide/snowflake-cortex/cortex-agents-resource-budgets.md).

---
title: Mar 11, 2026: Resource budgets for Snowflake Intelligence
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-11-snowflake-intelligence-resource-budgets.md
section: Release Notes
---

# Mar 11, 2026: Resource budgets for Snowflake Intelligence

Snowflake now supports resource budgets to control credit spending for Snowflake Intelligence. Resource budgets let you define
a monthly spending limit and take automated actions, such as revoking access, when spending exceeds your limits.

For more information, see [Resource budgets for Snowflake Intelligence](../../../user-guide/snowflake-cortex/snowflake-intelligence/si-resource-budgets.md).

---
title: Mar 12, 2025: Support for st.file_uploader (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-12-sis.md
section: Release Notes
---

# Mar 12, 2025: Support for `st.file_uploader` (General availability)

[st.file_uploader](https://docs.streamlit.io/develop/api-reference/widgets/st.file_uploader) is now generally available in Streamlit in Snowflake.

---
title: Mar 12, 2026: AI code suggestions in Workspaces (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-12-cortex-code-ai-suggestions-preview.md
section: Release Notes
---

# Mar 12, 2026: AI code suggestions in Workspaces (*Preview*)

AI code suggestions in Cortex Code are now available in public preview.

As you type in a SQL file in Workspaces, Cortex Code provides context-aware inline suggestions
to help you complete your statements faster. Suggestions appear as gray text at your cursor position
and are generated based on your query history, the content of the current workspace, table schemas, and
recently executed queries.

For details, see [AI code suggestions](../../../user-guide/cortex-code/cortex-code-snowsight.md).

---
title: Mar 12, 2026: AI_EXTRACT scale factor parameter (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-12-ai-extract.md
section: Release Notes
---

# Mar 12, 2026: AI_EXTRACT scale factor parameter (*General availability*)

The AI_EXTRACT function now supports the optional `scale_factor` parameter to improve extraction quality when
you receive unexpected or unclear responses in the following scenarios:

* Documents with page sizes larger than A4
* Documents containing small text, detailed visual elements, or dense layouts
* Extracted text contains typos or character-level OCR errors

For more information, see [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md).

---
title: Mar 12, 2026: Investigate cost anomalies using hourly consumption by service type
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-12-cost-anomaly-hourly-consumption-by-service-type.md
section: Release Notes
---

# Mar 12, 2026: Investigate cost anomalies using hourly consumption by service type

When investigating a cost anomaly, you can now view hourly consumption broken down by service type. This enhancement lets you
see which service types (for example, `AI_SERVICES`) are contributing to your consumption during each hour of the day, making
it easier to identify the root cause of a cost anomaly.

You can investigate hourly consumption by service type using Snowsight or SQL.

* **Snowsight web interface:** When you investigate an account-level anomaly, the Top consumption drivers section now shows
  consumption broken down by the top service types for each hour.

  For more information, see [Identify and investigate cost anomalies with Snowsight](../../../user-guide/cost-anomalies-ui.md).
* **ANOMALY_INSIGHTS class:** A new [GET_HOURLY_CONSUMPTION_BY_SERVICE_TYPE](../../../sql-reference/classes/anomaly-insights/methods/get_hourly_consumption_by_service_type.md)
  method returns the hourly consumption for a given day, broken down by the top service types.

  For more information, see [Hourly consumption by service type](../../../user-guide/cost-anomalies-class.md).

---
title: Mar 12, 2026: Multi-Location Resilience for Data Pipelines (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-12-multi-location-resilience-data-pipelines-ga.md
section: Release Notes
---

# Mar 12, 2026: Multi-Location Resilience for Data Pipelines (General availability)

Multi-location resilience for data pipelines helps you safeguard your data
pipelines against potential region-wide cloud provider outages. It ensures that,
upon failing over to a secondary location, file-based data ingestion
(specifically Snowpipe and COPY INTO) resumes processing new data without
interruption.

Key use cases include the following:

* **Business continuity and disaster recovery:** Maintain uninterrupted data
  flows and ensure critical dashboards and machine learning models are fed with
  fresh data during region-wide cloud provider outages.
* **Regulatory compliance:** Satisfy regulatory mandates that require
  multi-region resilience without building complex, custom infrastructure.
* **Cross-cloud flexibility:** Fail over data ingestion pipelines across cloud
  providers, eliminating single-vendor infrastructure lock-in for your disaster
  recovery architecture.
* **Zero-engineering overhead:** Get exactly-once ingestion semantics upon
  failover or failback without requiring manual reconciliation or custom
  deduplication scripts.

You can enable this feature by configuring a Multi-Location Storage Integration
(MLSI) and a Multi-Queue Notification Integration (MQNI) to transition your
active storage and message queues during failover, combined with routing your
files through a dual-write architecture.

For more information, see [Multi-Location Resilience for Data Pipelines](../../../user-guide/multi-location-resilience-data-pipelines.md).

---
title: Mar 12, 2026: Recent Cortex Search updates (Generally Available)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-12-recent-cortex-search.md
section: Release Notes
---

# Mar 12, 2026: Recent Cortex Search updates (*Generally Available*)

The following Cortex Search features, previously available in preview, are now generally available.

## Multi-index search

Cortex Search services now support multiple searchable columns within a single service, with targeted queries to specific indexes.
For more information, see [Multi-index Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

## Custom vector embeddings

You can now create Cortex Search services that use pre-computed vector embeddings instead of, or in addition to,
Snowflake-provided embeddings for hybrid retrieval with your own or third-party models. For more information, see
[Custom vector embeddings](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

## Enhanced Cortex Search tool for Cortex Agents and Snowflake Intelligence

The Cortex Search tool now supports dynamic control over search behavior via the latest Cortex search API. Its capabilities include:

* **Search service selection**: Query a single search service based on tool descriptions rather than all services, reducing latency and cost.
* **Dynamic filters**: Apply filter conditions on each search call using attribute columns.
* **Dynamic columns**: Specify which metadata columns to retrieve per search call.
* **Dynamic result count**: Set the number of results per call, up to 500.
* **Multi-index query support**: Issue per-index queries to multi-index services.

To enable this functionality, configure column descriptions in the `columns_and_descriptions` field of the Cortex Search tool resource.

For more information, see [Add tools to agents](../../../user-guide/snowflake-cortex/cortex-agents-manage.md).

---
title: Mar 12, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-12-dcr.md
section: Release Notes
---

# Mar 12, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 13.6

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* [VIEW_UPDATE_REQUESTS](../../../user-guide/cleanrooms/collaboration-api-reference.md) now returns new status messages: COMPLETED and FAILED.
* The [VIEW_ACTIVATIONS](../../../user-guide/cleanrooms/collaboration-api-reference.md) response has replaced the `activation_id` column with a `batch_id` column.
* Updates to private preview features.

---
title: Mar 13, 2026: Cortex Agent evaluations (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-13-cortex-agent-evaluations.md
section: Release Notes
---

# Mar 13, 2026: Cortex Agent evaluations (*General availability*)

Snowflake now offers Cortex Agent evaluations that allow you to monitor your agent’s behavior and performance. Evaluate your agent against both ground truth-based and reference-free evaluation metrics. During evaluation, your agent’s activity is traced and monitored so you can ensure that each step in the process advances towards your end goal.

Snowflake offers the following metrics to evaluate your agent against:

* **Answer correctness** – How closely the answer from an agent to your prepared query matches an expected answer. This metric is most useful when the dataset powering your Cortex Agent is static.
* **Logical consistency** – Measures consistency across agent instructions, planning, and tool calls. This metric is *reference-free*, meaning you don’t need to prepare any information in your dataset for evaluation.
* **Custom metrics** – Snowflake also allows you to create custom metrics. By defining a prompt and scoring system, you can take advantage of the LLM judging process to perform additional consistency checks or compliance with domain-specific requirements.

For information on how to create and run a Cortex Agent evaluation, see [Cortex Agent evaluations](../../../user-guide/snowflake-cortex/cortex-agents-evaluations.md).

---
title: Mar 13, 2026: Network Policy Advisor — General availability
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-13-network-policy-advisor-ga.md
section: Release Notes
---

# Mar 13, 2026: Network Policy Advisor — *General availability*

The Network Policy Advisor guides security administrators through the process of designing an ingress network policy based on historical
access data. The feature then helps the administrators evaluate the policy with a what-if simulation before they activate the policy.

For more information, see [Network Policy Advisor](../../../user-guide/network-policy-advisor.md).

---
title: Mar 13, 2026: New OVERLAP_POLICY parameter for task graphs
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-13-tasks-overlap-policy.md
section: Release Notes
---

# Mar 13, 2026: New OVERLAP_POLICY parameter for task graphs

The new `OVERLAP_POLICY` parameter for task graphs replaces the deprecated `ALLOW_OVERLAPPING_EXECUTION` parameter
and provides more granular control over concurrent task graph execution. You can set this parameter on a root task using
[CREATE TASK](../../../sql-reference/sql/create-task.md) or [ALTER TASK](../../../sql-reference/sql/alter-task.md).

`OVERLAP_POLICY` supports three values:

* `NO_OVERLAP` (default): Executes tasks serially. The next run of a root task is scheduled only after all child tasks finish running.
* `ALLOW_CHILD_OVERLAP`: Allows child task parallelism. A new instance of the task graph can start while child tasks from a
  previous run are still executing. Root tasks never overlap with this policy.
* `ALLOW_ALL_OVERLAP`: Allows true parallelism. Multiple instances of the entire task graph, including the root task,
  can run concurrently.

For backward compatibility, `ALLOW_OVERLAPPING_EXECUTION = TRUE` maps to `OVERLAP_POLICY = ALLOW_CHILD_OVERLAP`, and
`ALLOW_OVERLAPPING_EXECUTION = FALSE` maps to `OVERLAP_POLICY = NO_OVERLAP`.

For more information, see [Create a sequence of tasks with a task graph](../../../user-guide/tasks-graphs.md).

---
title: Mar 13, 2026: Support for specifying relationship paths in semantic views (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-13-semantic-views-multi-path.md
section: Release Notes
---

# Mar 13, 2026: Support for specifying relationship paths in semantic views (*Preview*)

In some cases, multiple relationship paths might exist between two specific logical tables in a semantic view. In these cases,
you can now specify which relationship path to use when defining a metric.

Support for specifying the relationship path is in [Preview](../../preview-features.md).

In the METRICS clause of the [CREATE SEMANTIC VIEW](../../../sql-reference/sql/create-semantic-view.md) command, specify the name of the relationship to use
in the USING clause:

```sqlsyntax
METRICS (
  <table_alias>.<metric>
    [ USING ( <relationship_name> [ , ... ] )
    AS <sql_expr>
  [ , ... ]
)
```

For more information, see [Specifying the relationship for a metric when multiple relationship paths exist](../../../user-guide/views-semantic/sql.md).

---
title: Mar 13, 2026: Time distribution information added to STATISTICS column in dynamic table refresh history
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-13-dt-time-distribution.md
section: Release Notes
---

# Mar 13, 2026: Time distribution information added to STATISTICS column in dynamic table refresh history

The STATISTICS column in [DYNAMIC_TABLE_REFRESH_HISTORY](../../../sql-reference/functions/dynamic_table_refresh_history.md) and
[DYNAMIC_TABLE_REFRESH_HISTORY view](../../../sql-reference/account-usage/dynamic_table_refresh_history.md) now includes time distribution information for
dynamic table refreshes. The following new properties are added:

* `queuedTimeMs`: The time (in milliseconds) spent in the queued state.
* `compilationTimeMs`: The time (in milliseconds) spent compiling the refresh query.
* `executionTimeMs`: The time (in milliseconds) spent executing the refresh query.

For successful refreshes, the STATISTICS column includes both the existing row/partition statistics and the new time
distribution information. For example:

```json
{
  "numAddedPartitions": 1,
  "numCopiedRows": 0,
  "numDeletedRows": 25,
  "numInsertedRows": 36,
  "numRemovedPartitions": 1,
  "queuedTimeMs": 123,
  "compilationTimeMs": 456,
  "executionTimeMs": 789
}
```

For failed refreshes, the STATISTICS column is now populated with the time distribution information (previously it
defaulted to an empty object). For example:

```json
{
  "queuedTimeMs": 123,
  "compilationTimeMs": 456,
  "executionTimeMs": 789
}
```

For more information, see [DYNAMIC_TABLE_REFRESH_HISTORY](../../../sql-reference/functions/dynamic_table_refresh_history.md) (Information Schema) and
[DYNAMIC_TABLE_REFRESH_HISTORY view](../../../sql-reference/account-usage/dynamic_table_refresh_history.md) (Account Usage).

---
title: Mar 16, 2026: Apache Iceberg™ tables: Write support by using an external query engine (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-16-tables-iceberg-query-using-external-query-engine-snowflake-horizon-writes-feature.md
section: Release Notes
---

# Mar 16, 2026: Apache Iceberg™ tables: Write support by using an external query engine (*Preview*)

You can now write to Snowflake-managed Apache Iceberg™ tables by using any
external query engine that supports the open Iceberg REST protocol, such as Apache Spark™. To ensure this interoperability with
external engines, [Apache Polaris™ (incubating)](https://github.com/apache/polaris) is integrated into Horizon Catalog. You can write to
these tables in a Snowflake account by using a single Horizon Catalog endpoint and you can use your existing users, roles, policies,
and authentication in Snowflake.

For more information, see [Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog](../../../user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

---
title: Mar 16, 2026: Metering disabled for hybrid table requests
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-16-hybrid-tables-metering-disabled.md
section: Release Notes
---

# Mar 16, 2026: Metering disabled for hybrid table requests

As a follow-up to the announcement of
[simplified pricing for hybrid tables](2026-03-02-hybrid-tables-pricing.md),
Snowflake has disabled metering for hybrid table requests. You will no longer see new
events in the following views:

* [HYBRID_TABLE_USAGE_HISTORY](../../../sql-reference/account-usage/hybrid_table_usage_history.md)
* Account Usage [METERING_DAILY_HISTORY](../../../sql-reference/account-usage/metering_daily_history.md)
* Organization Usage [METERING_DAILY_HISTORY](../../../sql-reference/organization-usage/metering_daily_history.md)

Historical consumption data that was recorded before this change is still available in these views and can be queried.

---
title: Mar 16, 2026: Snowflake Notebooks renamed to Legacy Snowflake Notebooks
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-16-legacy-notebooks.md
section: Release Notes
---

# Mar 16, 2026: Snowflake Notebooks renamed to Legacy Snowflake Notebooks

The original Snowflake Notebooks have been renamed to **Legacy Notebooks**. This rename reflects Snowflake’s commitment to transitioning all users to
[Notebooks in Workspaces](../../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md), which is the next-generation
notebook experience on Snowflake. For a comparison of the two experiences, see [Key differences between legacy and new notebooks](../../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-migrate.md).

## Migration to Notebooks in Workspaces

Snowflake will migrate all users from Legacy Notebooks to Notebooks in Workspaces over the next few quarters. Before any mandatory migration is enforced, a Behavior
Change Request (BCR) will be issued so that you can prepare in advance. Snowflake will communicate the deprecation timeline and migration process ahead of time, before any action is
required of your account.

In the meantime, Legacy Notebooks will remain fully available.

For more information about the new experience, see [Notebooks in Workspaces](../../../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md).

---
title: Mar 17, 2025: Document AI release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-17-document-ai.md
section: Release Notes
---

# Mar 17, 2025: Document AI release notes

When you review the answers that the Document AI model provides, you can now highlight the answer
within a document by selecting the locate answer icon.

Note that highlighting does not work for the answers provided by the model before this release.
If you want to highlight these answers, refresh the question to reload the answer.

---
title: Mar 17, 2025: Snowflake Notebooks on Container Runtime for AWS (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-17-notebooks-on-spcs-aws.md
section: Release Notes
---

# Mar 17, 2025: Snowflake Notebooks on Container Runtime for AWS (*General availability*)

With this release, Snowflake Notebooks are now supported on Container Runtime on AWS commercial regions, including PrivateLink.

## New features

* Preconfigured ML environment: Base machine-learning runtime image with the most popular ML development packages pre-installed.
* Scalable compute resources: Access to configurable CPU or GPU pools for resource-efficient model development.
* Enhanced package management: Support for Pip and Conda Python package version upgrades.
* Real-time resource monitoring: Hover over the Active button to view detailed CPU, GPU, and memory usage metrics.
* Optimized GPU notebook storage: Notebooks running on GPU compute pools now use high-performance NVMe storage as the default boot device.
* Maximize session uptime: Sessions run up to seven days (for example, long-running jobs) without maintenance disruptions.

For more information, see [Notebooks on Container Runtime](../../../developer-guide/snowflake-ml/notebooks-on-spcs.md).

---
title: Mar 17, 2026: Openflow Connector for Google BigQuery (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-17-openflow-bigquery-pupr.md
section: Release Notes
---

# Mar 17, 2026: Openflow Connector for Google BigQuery (*Preview*)

The Openflow Connector for Google BigQuery is now available in preview. The connector replicates
datasets, tables, and views from Google BigQuery into Snowflake. Tables
are synchronized using incremental change capture with BigQuery’s
native CHANGES function. Views are replicated using a truncate and load
strategy. The connector leverages the BigQuery Storage Read API for
high-throughput data transfer.

For more information, see
[About the Openflow Connector for Google BigQuery](../../../user-guide/data-integration/openflow/connectors/google-big-query/about.md).

---
title: Mar 19, 2025: Additional file format support for Cortex AI Parse Document
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-19-parse-document-more-file-formats.md
section: Release Notes
---

# Mar 19, 2025: Additional file format support for Cortex AI Parse Document

Snowflake announces that the Cortex AI PARSE_DOCUMENT function now supports an expanded range of file formats to deliver
more comprehensive document analysis. The new file formats are image formats and include TIFF, TIF, JPEG, JPG, and PNG.
PARSE_DOCUMENT already supported PDF, DOCX, and PPTX file formats.

Customers can now use Cortex AI PARSE_DOCUMENT to streamline document parsing and extract insights from all enterprise documents.

For details, see [Cortex PARSE_DOCUMENT](../../../user-guide/snowflake-cortex/parse-document.md).

---
title: Mar 19, 2025: Alerts on new data (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-19-alerts-on-new-data.md
section: Release Notes
---

# Mar 19, 2025: Alerts on new data (*Preview*)

You can now use alerts on new data to monitor dynamic table refreshes and task completions.

An [alert on new data](../../../user-guide/alerts.md) is executed when new rows are added to a specified table or view.
Snowflake evaluates the condition against the new rows.

You can set up an alert on new data to notify you when new rows for error messages are inserted into the
[event table](../../../developer-guide/logging-tracing/event-table-setting-up.md) for your account. Because dynamic table refreshes
and task executions log events to the event table, you can set up an alert on new data to:

* [Monitor dynamic table refreshes](../../../user-guide/dynamic-tables-monitor-event-table-alerts.md).
* [Monitor task executions](../../../user-guide/tasks-events.md).

For more information, see [Alerts on new data](../../../user-guide/alerts.md).

---
title: Mar 19, 2026: Artifacts in Snowflake Intelligence (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-19-snowflake-intelligence-artifacts.md
section: Release Notes
---

# Mar 19, 2026: Artifacts in Snowflake Intelligence (*Preview*)

Using artifacts in Snowflake Intelligence, you can save, share, and revisit tables and charts without regenerating them. An artifact is a persistent chart or table object that Snowflake Intelligence generates in response to a question.

With artifacts, you can:

* Save charts and tables to your artifacts hub for later access.
* Share artifacts with teammates using a link, with data filtered through their own data permissions.
* Ask follow-up questions on saved artifacts while retaining context.
* Refresh artifacts to see fresh data at any time.

For more information, see [Artifacts in Snowflake Intelligence](../../../user-guide/snowflake-cortex/snowflake-intelligence/artifacts.md).

---
title: Mar 2, 2026: Monitor and control Cortex AI Functions spending (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-02-25-ai-functions-cost-management.md
section: Release Notes
---

# Mar 2, 2026: Monitor and control Cortex AI Functions spending (*General availability*)

The CORTEX_AI_FUNCTIONS_USAGE_HISTORY account usage view provides detailed telemetry for Cortex AI Functions, helping you to monitor usage patterns and implement automated cost controls across your organization. You can track credit consumption by function, model, user, role, warehouse, and query, forming the foundation for proactive AI cost governance.

Using the new view, you can:

* **Detect total monthly spending:** Aggregate credits at the account level to monitor overall AI Functions consumption and trigger alerts when predefined thresholds are exceeded.
* **Enforce monthly per-user spending limits:** Track usage by individual users and automatically revoke or restore AI function access based on configurable monthly credit limits.
* **Detect and cancel runaway queries:** Identify long-running or high-credit AI function queries and automatically cancel them before additional credits are consumed. This capability can be used as a drop-in replacement for previous query credit limit patterns.

The release is accompanied by a comprehensive user guide topic that includes production-ready SQL examples for:

* Daily and monthly credit consumption analysis
* Account-level monthly spending alerts
* Role-based access control with automated budget enforcement
* Automated monitoring and cancellation of excessive queries

For more information, see:

* [CORTEX_AI_FUNCTIONS_USAGE_HISTORY view reference](../../../sql-reference/account-usage/cortex_ai_functions_usage_history.md)
* [Managing AI Functions Cost with Account Usage](../../../user-guide/snowflake-cortex/ai-func-cost-management.md)

---
title: Mar 20, 2025: Data Governance release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-20-cortex-descriptions.md
section: Release Notes
---

# Mar 20, 2025: Data Governance release notes

## Cortex Powered Object Descriptions: Support for additional table types

You can now use the Snowflake Cortex COMPLETE function to generate descriptions for the following table types:

* Dynamic tables
* Hybrid tables
* Apache Iceberg™ tables
* External tables

For more information about Cortex Powered Object Descriptions, see [Generate descriptions with Snowflake Cortex](../../../user-guide/ui-snowsight-cortex-descriptions.md).

---
title: Mar 20, 2025: Snowflake Datasets (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-20-snowflake-ml-datasets.md
section: Release Notes
---

# Mar 20, 2025: Snowflake Datasets (*General availability*)

Snowflake announces the general availability of Snowflake Datasets, a new schema-level object designed for machine learning workflows. Datasets enable efficient data management and versioning, providing immutable snapshots of data for reproducible model training and testing.

Key features include:

* Versioned, materialized snapshots of data with guaranteed immutability.
* Seamless integration with Snowflake ML, SQL, and external ML frameworks.
* Support for lineage tracking, enabling end-to-end governance in ML workflows.

For more information, see [Snowflake Datasets](../../../developer-guide/snowflake-ml/dataset.md).

---
title: Mar 20, 2026: Apache Iceberg™ tables: Support for the Azure Data Lake Storage Gen2 with external volumes (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-20-iceberg-azure-dls-external-volumes.md
section: Release Notes
---

# Mar 20, 2026: Apache Iceberg™ tables: Support for the Azure Data Lake Storage Gen2 with external volumes (*Preview*)

Snowflake now supports specifying Azure Data Lake Storage Gen2 (Azure Data Lake Storage) when you configure
an external volume for Apache Iceberg™ tables on Azure.

This update enables interoperability between Snowflake and remote catalogs that are only configured to use Data Lake Storage, as follows:

* You can create Snowflake-managed Iceberg tables that the query engine for these remote catalogs can read and write to.
* You can use Snowflake to read and write to remote tables in these remote catalogs.

For example, this update enables interoperability with Unity Catalog hosted on Azure.

To specify Data Lake Storage when you configure a new external volume, specify
`azure://account.dfs.core.windows.net/container[/path/]` for the STORAGE_BASE_URL.
For more information, see [Configure an external volume for Azure](../../../user-guide/tables-iceberg-configure-external-volume-azure.md).

> **Note:**
>
> To make your existing Iceberg tables in Blob Storage interoperable with catalogs that are only configured to use Data Lake Storage,
> you can migrate the tables to Data Lake Storage. For instructions, see [Migrate an Iceberg table to Azure Data Lake Storage](../../../user-guide/tables-iceberg-manage.md).

---
title: Mar 20, 2026: Block public access to internal stages with IP allowlist exceptions (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-20-block-public-stage-access-with-exceptions.md
section: Release Notes
---

# Mar 20, 2026: Block public access to internal stages with IP allowlist exceptions (*General availability*)

You can now block public access to Microsoft Azure internal stages while maintaining an allowlist of IP addresses or
CIDR blocks that are permitted to reach the internal stage. The new
SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION function extends the existing set of functions for blocking
public access to internal stages by letting you specify exceptions, rather than blocking all public IP addresses.

For more information, see [Blocking public access with IP allowlist exceptions](../../../user-guide/private-internal-stages-azure.md) and
[SYSTEM$BLOCK_INTERNAL_STAGES_PUBLIC_ACCESS_WITH_EXCEPTION](../../../sql-reference/functions/system_block_internal_stages_public_access_with_exception.md).

---
title: Mar 20, 2026: DCM Projects (Preview)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-20-dcm-projects.md
section: Release Notes
---

# Mar 20, 2026: DCM Projects (*Preview*)

Snowflake DCM Projects are now available in preview. DCM Projects enable a declarative approach to managing Snowflake objects as code. You define the
desired target state of your databases, tables, tasks, and other Snowflake objects in definition files, and Snowflake determines and applies
the necessary changes to reach that state.

DCM Projects support version-controlled, idempotent deployments across environments (such as dev, staging, and production) using a plan-then-deploy
workflow. Key capabilities include:

* **Declarative definitions**: Use DEFINE statements in SQL files to describe the desired state of your Snowflake objects. Snowflake determines the changes needed and applies them automatically.
* **Jinja templating**: Parameterize your definitions with variables, loops, conditions, and macros to reduce repetition and support multi-environment deployments.
* **Plan-then-deploy workflow**: Reliably preview planned changes before deploying them to catch unintended modifications.
* **Broad object support**: Manage a wide variety of Snowflake object types across infrastructure, data pipeline, and governance use cases.
* **Pipeline management**: Build, test, and deploy data pipelines using dynamic tables, tasks, and data quality expectations.

DCM Projects can be managed using Snowsight, Snowflake CLI, SQL, or Cortex Code CLI. Project definition files can be stored in a Snowflake Workspace, a
remote Git repository, or a local directory.

For more information, see [Snowflake DCM Projects](../../../user-guide/dcm-projects/dcm-projects-overview.md).

---
title: Mar 20, 2026: Trust Center Extensions (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-20-tc-extensions-ga.md
section: Release Notes
---

# Mar 20, 2026: Trust Center Extensions (*General availability*)

Trust Center extensions are native applications that you can build or get from the
[Snowflake Marketplace](https://app.snowflake.com/marketplace/providers?categorySecondary=%5B%2231%22%5D) for use cases such as security,
compliance, and data governance. An extension developer can provide a Trust Center extension as either a public or a private [listing](../../../collaboration/collaboration-listings-about.md).

For more information, see [Using Trust Center extensions](../../../user-guide/trust-center/trust-center-extensions.md).

---
title: Mar 24, 2025: Support for st.experimental_audio_input and st.camera_input (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-24-sis.md
section: Release Notes
---

# Mar 24, 2025: Support for `st.experimental_audio_input` and `st.camera_input` (General availability)

[st.experimental_audio_input](https://docs.streamlit.io/1.39.0/develop/api-reference/widgets/st.audio_input) and
[st.camera_input](https://docs.streamlit.io/1.39.0/develop/api-reference/widgets/st.camera_input) are now generally available in Streamlit in Snowflake.

---
title: Mar 24, 2026: ARRAY_REPEAT function
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-24-array-repeat-function.md
section: Release Notes
---

# Mar 24, 2026: ARRAY_REPEAT function

The following function is available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Semi-structured and structured data | [ARRAY_REPEAT](../../../sql-reference/functions/array_repeat.md) | Returns an ARRAY value containing a specified number of copies of an element. |

---
title: Mar 24, 2026: MAP_ENTRIES function
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-24-map-entries-function.md
section: Release Notes
---

# Mar 24, 2026: MAP_ENTRIES function

The following function is available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Semi-structured and structured data | [MAP_ENTRIES](../../../sql-reference/functions/map_entries.md) | Returns an ARRAY value of key-value pair objects for each entry in a MAP value. |

---
title: Mar 26, 2025: Support for multiple semantic models in Cortex Analyst queries (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-26-multiple-models-cortex-analyst.md
section: Release Notes
---

# Mar 26, 2025: Support for multiple semantic models in Cortex Analyst queries (*General availability*)

Snowflake is pleased to announce the general availability of support for multiple semantic models in Cortex Analyst.

When making a query, you can now include a list of semantic models, instead of just one, and let Cortex Analyst choose
the best model for the query. This improvement allows for straightforward implementation of a single search UI to query
multiple data sources and simplifies client programming.

For more information, see [Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md).

---
title: Mar 26, 2026: New SCHEDULER attribute for dynamic tables — General availability
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-26-dynamic-table-scheduler-attribute.md
section: Release Notes
---

# Mar 26, 2026: New SCHEDULER attribute for dynamic tables — *General availability*

You can now control whether a dynamic table is automatically refreshed by setting the optional `SCHEDULER` attribute to
`ENABLE` or `DISABLE` on [CREATE DYNAMIC TABLE](../../../sql-reference/sql/create-dynamic-table.md) or [ALTER DYNAMIC TABLE](../../../sql-reference/sql/alter-dynamic-table.md).

When `SCHEDULER = DISABLE`, the dynamic table is excluded from automatic refresh and can only be refreshed manually with
`ALTER DYNAMIC TABLE ... REFRESH`. Manual refreshes do not cascade to upstream or downstream dynamic tables, which lets external
orchestrators (such as dbt) manage individual table refreshes without triggering the entire pipeline.

When `SCHEDULER = ENABLE`, the dynamic table is managed by Snowflake’s dynamic table scheduler using `TARGET_LAG`.

If the `SCHEDULER` attribute is not specified, the dynamic table behaves the same as existing dynamic tables: it is managed by the
scheduler using `TARGET_LAG`.

This feature is useful for external orchestrators, such as dbt, that manage refresh timing independently and need to refresh individual
dynamic tables without triggering the entire pipeline.

For more information, see [CREATE DYNAMIC TABLE](../../../sql-reference/sql/create-dynamic-table.md).

---
title: Mar 26, 2026: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-26-dcr.md
section: Release Notes
---

# Mar 26, 2026: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 13.9

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Cortex Code support:** Users can now leverage Cortex Code interfaces to browse objects, register data offerings and templates, create, review, and join clean rooms, and run analyses for Collaboration Data Clean Rooms. This enables users to seamlessly manage their clean rooms and run workloads through an agentic experience.
* **Request approval handling:** Requestors can now update their approval status before the request reaches a terminal state or is fully approved by all collaborators. This gives request creators the ability to effectively retract their own request before it is acted upon by all parties.
* **View update request types and statuses:** The [VIEW_UPDATE_REQUESTS](/user-guide/cleanrooms/collaboration-api-reference) API now has updated status values for better tracking of approval workflows: `REQUESTED`, `PENDING_MY_APPROVAL`, `PENDING_PARTNER_APPROVAL`. Additionally, it provides tracking updates for: `LINK_DATA_OFFERING`, `UNLINK_DATA_OFFERING`, and `REMOVE_TEMPLATE`.
* Updates to private preview features.

---
title: Mar 27, 2025: Git integration and multi-file editing in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-27-sis.md
section: Release Notes
---

# Mar 27, 2025: Git integration and multi-file editing in Streamlit in Snowflake (Preview)

You can now create multipage Streamlit apps in [Snowsight](../../../user-guide/ui-snowsight-gs.md) and sync your Streamlit apps with a Git repository.

For more information about multipage Streamlit apps, see [Multipage apps](../../../developer-guide/streamlit/app-development/file-organization.md).

For more information about Git integration, see [Sync Streamlit in Snowflake apps with a Git repository](../../../developer-guide/streamlit/features/git-integration.md).

---
title: Mar 27, 2025: Snowflake Data Clean Rooms release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-27-dcr.md
section: Release Notes
---

# Mar 27, 2025: Snowflake Data Clean Rooms release notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

> **Note:**
>
> You must sign out and back in to the clean rooms UI for these updates to take effect.

## Simplified onboarding

Snowflake Data Clean Rooms has simplified the installation process. Installation now happens through the Snowflake Marketplace, and
separates the installation flow for the APIs and the UI. [Learn more](../../../user-guide/cleanrooms/installing-dcr.md)

## Analysis Error Messaging in the clean rooms UI

Users running a query in the clean rooms UI can now see any query errors encountered. This can help users debug errors themselves
or provides information they can share with the Snowflake support team to troubleshoot query errors.

## Obfuscated provider templates

Providers can now choose to hide their template logic from collaborators, in order to protect their template intellectual property.
To hide your template body from consumers, set the `is_obfuscated` argument to FALSE in `provider.add_custom_sql_template`.

Note that a consumer must have Snowflake Enterprise edition installed in order to run an obfuscated template.

## Cross-cloud collaboration support for multiple accounts

Users can now enable cross-cloud collaboration for Data Clean Rooms across multiple accounts under the same organization.

## Update to default caching behavior in consumer run analysis

In order to improve template testing and ensure the most recent results are being generated for users, the default cache behavior for the
`consumer.run_analysis` API is now FALSE.

## New limited API access role for developers

Administrators can now grant a limited access role to consumers to enable limited API access to specified clean rooms. The role grants
permission to run a subset of consumer clean room procedures against specified clean rooms. See the
`consumer.grant_run_on_cleanrooms_to_role` documentation for more information.

## LiveRamp Identity & Translation integration update

LiveRamp offers an Identity as well as a Translation offering. Liveramp’s Embedded Identity resolves personally-identifiable information
(PII) or device identifiers into a durable, pseudonymous RampID. LiveRamp’s RampID Translation capability allows for the transcoding of a
RampID from one partner domain encoding to another, enabling you to match persistent pseudonymous identifiers to one another without
sharing the sensitive underlying identifiers. This functionality is available through the [LiveRamp native app in the Snowflake Marketplace](https://app.snowflake.com/marketplace/listing/GZT0Z11US7AP/liveramp-identity-resolution-and-transcoding?search=liveramp).

After you have installed the LiveRamp native app, follow the
[LiveRamp connector instructions](../../../user-guide/cleanrooms/connector-identity.md) to configure the LiveRamp connector in your Data
Clean Room.

---
title: Mar 31, 2025: Data Governance release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-03-31-ah-joins.md
section: Release Notes
---

# Mar 31, 2025: Data Governance release notes

## Access history: Support for joins

Access history now tracks the joins that are explicitly mentioned in a query. If a query contains a join, the ACCESS_HISTORY view includes
an array containing the joined objects and type of join.

For an example of how a join shows up in the access history, see [Join](../../../user-guide/access-history.md).

---
title: Mar 31, 2026: CTAS support for Databricks Unity Catalog with external volumes (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-31-ctas-unity-catalog-external-volumes.md
section: Release Notes
---

# Mar 31, 2026: CTAS support for Databricks Unity Catalog with external volumes (*General availability*)

You can now use CREATE ICEBERG TABLE … AS SELECT (CTAS) to create Apache Iceberg™ tables in
catalog-linked databases that use Databricks Unity Catalog as the remote catalog. Previously,
CTAS was not supported for Unity Catalog, and you had to create an empty table and then use
INSERT INTO … SELECT as a workaround.

For more information, see [CREATE ICEBERG TABLE (Iceberg REST catalog)](../../../sql-reference/sql/create-iceberg-table-rest.md).

---
title: Mar 31, 2026: Use Snowsight to manage external volumes (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-31-manage-external-volumes-snowsight-ga.md
section: Release Notes
---

# Mar 31, 2026: Use Snowsight to manage external volumes (*General availability*)

With this release, you can use Snowsight to manage external volumes for Apache Iceberg™ tables. This feature
is now generally available and includes the following capabilities:

* Create an external volume, including optionally setting the external volume as the default at the account,
  database, or schema level.
* Grant USAGE privileges to an external volume.
* Add a storage location to an external volume.
* Verify an external volume to check that Snowflake can successfully authenticate to your storage provider.
* Drop an external volume.

For more information, see the following topics:

* [Configure an external volume](../../../user-guide/tables-iceberg-configure-external-volume.md)
* [Drop an external volume by using Snowsight](../../../user-guide/tables-iceberg-drop-external-volume.md)

---
title: Mar 9, 2026: Cortex Code in Snowsight - General availability
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-09-cortex-code-snowsight-ga.md
section: Release Notes
---

# Mar 9, 2026: Cortex Code in Snowsight - *General availability*

Cortex Code is now generally available in Snowsight, bringing an agentic assistant directly into Snowflake
for SQL and Python development, end-to-end machine learning, data exploration, and account administration.

Because it is integrated into Snowsight and Workspaces, it provides context-aware assistance within
the active workspace and helps users complete development, exploration, and admin tasks without leaving Snowsight.

That changes the day-to-day experience for both technical and business users. Instead of treating AI as
a separate tool, teams can use natural language inside the same platform where they already build and operate.
And because Cortex Code works using Snowflake’s existing policies and role-based access controls, organizations
can accelerate work without stepping outside their secure and governed environment.

## Why this matters

* **Faster coding, without giving up control.**
  In Workspaces, Cortex Code can generate, modify, optimize, and explain SQL and Python code, preview proposed
  edits in a diff view before changes are applied, and let users add tables, schemas, or views as inline context
  with `@` mentions. It can also suggest fixes when a SQL statement fails.
* **A shorter path from idea to production.**
  Cortex Code provides verified solutions in the form of fully functional ML pipelines that can be directly
  executed from a Snowflake Notebook in Workspaces. For dbt Projects on Snowflake, it can help explore source
  data, scaffold models, add tests, run dbt commands, and generate documentation.
* **Better discovery across data and documentation.**
  Cortex Code uses Horizon Catalog context and Snowflake documentation to help users find tables and columns
  with plain-language questions, answer product and SQL questions from official documentation, and surface
  metadata such as tags, masking policies, and lineage when available. It also supports semantic-model-oriented
  workflows for Cortex Analyst.
* **Smarter governance and cost conversations.**
  Teams can ask about user and role access, data ownership, and tables containing PII, while also querying
  account usage, credit consumption, and the warehouses or queries driving spend.

Because Cortex Code is embedded in Snowsight, users can get assistance without switching tools or
leaving the environment where they write and run queries.

For details, see [Cortex Code in Snowsight](../../../user-guide/cortex-code/cortex-code-snowsight.md).

## Legal notices

Where your configuration of Cortex Code uses a model provided on the
[Model and Service Pass-Through Terms](https://www.snowflake.com/en/legal/optional-offerings/offering-specific-terms/ai-features/model-pass-through-terms/),
your use of that model is further subject to the terms for that model on that page.

The data classification of inputs and outputs are as set forth in the following table.

| Input data classification | Output data classification | Designation |
| --- | --- | --- |
| Usage Data | Customer Data | Covered AI Features [1] |

[1]

Represents the defined term used in the AI Terms and Acceptable Use Policy.

For additional information, refer to [Snowflake AI and ML](../../../guides-overview-ai-features.md).

---
title: March 04-05, 2024 — 8.9 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_09.md
section: Release Notes
---

# March 04-05, 2024 — 8.9 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Non-bundled Behavior Changes

The following changes are not part of a bundle, and therefore cannot be enabled for testing:

* [Dynamic tables: Changes to ACCOUNT_USAGE.TABLES and INFORMATION_SCHEMA.TABLES](../bcr-bundles/un-bundled/bcr-account-usage-and-info-schema-changes.md)

## Data Governance Updates

### Custom Classification — *Preview*

With this release, Snowflake is pleased to announce Custom Classification in preview. Snowflake provides the `CUSTOM_CLASSIFIER` class in the `SNOWFLAKE.DATA_PRIVACY` schema to enable data engineers to extend Snowflake’s data classification capabilities based on their own knowledge of the data. You can define your own semantic category, specify the privacy category, and specify regular expressions to match column value patterns while optionally matching the column name.

> For more information, see [Create custom categories for sensitive data](../../user-guide/classify-custom.md).

## Data lake updates

### Primary key information added to Apache Iceberg™ table metadata

With this release, we are pleased to announce that Snowflake now writes information for primary key columns to
Apache Iceberg™ table metadata using the Apache Iceberg [identifier-field-ids](https://iceberg.apache.org/spec/#identifier-field-ids) property.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 04-Mar-24 |
| New SQL functions | **Removed** from *SQL Updates* | 05-Mar-24 |
| *Primary key information added to Apache Iceberg™ table metadata* | **Added** to *Data lake updates* | 07-Mar-24 |

---
title: March 05, 2024 — Hybrid Tables Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-05-hybrid-tables.md
section: Release Notes
---

# March 05, 2024 — Hybrid Tables Release Notes

With this release, we are pleased to announce the availability of hybrid tables
in the following additional Amazon Web Services (AWS) regions:

| Cloud Region | Cloud Region ID |
| --- | --- |
| US East (Ohio) | us-east-2 |
| US East (N. Virginia) | us-east-1 |
| EU (Frankfurt) | eu-central-1 |
| Asia Pacific (Tokyo) | ap-northeast-1 |

For a full list of AWS regions where hybrid tables is available, see
[Limitations](../../../user-guide/tables-hybrid-limitations.md).

Hybrid tables will be made available in additional regions in the near future.

---
title: March 05, 2024 — Snowflake Cortex LLM Functions Release Notes –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-05-snowflake-cortex-llm-functions.md
section: Release Notes
---

# March 05, 2024 — Snowflake Cortex LLM Functions Release Notes –— *Preview*

With this release, we are pleased to announce the preview of Snowflake Cortex LLM functions, which provide instant
access to a suite of features powered by state-of-the-art large language models (LLMs). These models are fully hosted
and managed by Snowflake, so they require no setup and operate within the Snowflake governance and security framework.

The available functions include:

* COMPLETE: Given a prompt, returns a response that completes the prompt. This function accepts either a single prompt
  or a conversation with multiple prompts and responses.
* EXTRACT_ANSWER: Given a question and unstructured data, returns the answer to the question if it can be found in the
  data.
* SENTIMENT: Returns a sentiment score, from -1 to 1, representing the detected positive or negative sentiment of the
  given text.
* SUMMARIZE: Summarizes the given text.
* TRANSLATE: Translates the given text from any supported language to any other.

For more information, see
[Snowflake Cortex Large Language Model (LLM) Functions](../../../user-guide/snowflake-cortex/aisql.md).

---
title: March 08, 2024 — Geospatial Functions Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-08.md
section: Release Notes
---

# March 08, 2024 — Geospatial Functions Release Notes

## New Geospatial Functions Available

Two new functions for [GEOMETRY](../../../sql-reference/data-types-geospatial.md) objects are now available:

* [ST_GEOMFROMGEOHASH](../../../sql-reference/functions/st_geomfromgeohash.md) - Returns a GEOMETRY object for the polygon that represents
  the boundaries of a [geohash](https://en.wikipedia.org/wiki/Geohash).
* [ST_GEOMPOINTFROMGEOHASH](../../../sql-reference/functions/st_geompointfromgeohash.md) - Returns a GEOMETRY object for the point that
  represents center of a [geohash](https://en.wikipedia.org/wiki/Geohash).

---
title: March 11-12, 2024 — 8.10 Release Notes (no announcements)
source: https://docs.snowflake.com/en/release-notes/2024/8_10.md
section: Release Notes
---

# March 11-12, 2024 — 8.10 Release Notes (no announcements)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Announcements

This release contains no significant features, updates, or enhancements to announce.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 11-Mar-24 |

---
title: March 12, 2024 — Snowflake Cortex Classification Release Notes –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-12-snowflake-cortex-classification.md
section: Release Notes
---

# March 12, 2024 — Snowflake Cortex Classification Release Notes –— *Preview*

With this release, we are pleased to announce the preview of Snowflake Cortex Classification, an ML function that
sorts data into different classes using patterns detected in training data. Binary classification (two classes) and
multi-class classification (more than two classes) are supported. Common use cases of classification include customer
churn prediction, credit card fraud detection, and spam detection.

For more information, see
[Classification](../../../user-guide/ml-functions/classification.md).

---
title: March 13, 2024 — Hybrid Tables Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-13-hybrid-tables.md
section: Release Notes
---

# March 13, 2024 — Hybrid Tables Release Notes

With this release, we are pleased to announce the availability of hybrid tables
in the following additional Amazon Web Services (AWS) regions:

| Cloud Region | Cloud Region ID |
| --- | --- |
| Canada (Central) | ca-central-1 |
| South America (Sao Paulo) | sa-east-1 |
| Europe (London) | eu-west-2 |
| EU (Paris) | eu-west-3 |
| EU (Stockholm) | eu-north-1 |
| Asia Pacific (Seoul) | ap-northeast-2 |
| Asia Pacific (Osaka) | ap-northeast-3 |
| Asia Pacific (Mumbai) | ap-south-1 |
| Asia Pacific (Singapore) | ap-southeast-1 |

For a full list of AWS regions where hybrid tables is available, see
[Limitations](../../../user-guide/tables-hybrid-limitations.md).

---
title: March 15, 2024 — Streamlit in Snowflake Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-15.md
section: Release Notes
---

# March 15, 2024 — Streamlit in Snowflake Release Notes

## Streamlit in Snowflake: Support for Streamlit 1.26.0 —– *Preview*

With this release, we are pleased to announce the preview of Streamlit in Snowflake: Support for Streamlit 1.26.0.
For each Streamlit in Snowflake app, you can now select the Streamlit library version in Snowsight or pin the version in the app’s `environment.yml` file.

The release includes support for the following Streamlit features:

* [st.chat_message](https://docs.streamlit.io/library/api-reference/chat/st.chat_message)
* [st.chat_input](https://docs.streamlit.io/library/api-reference/chat/st.chat_input)
* [st.status](https://docs.streamlit.io/library/api-reference/status/st.status)
* [st.toggle](https://docs.streamlit.io/library/api-reference/widgets/st.toggle)
* [st.data_editor](https://docs.streamlit.io/library/api-reference/data/st.data_editor)
* [st.column_config](https://docs.streamlit.io/library/api-reference/data/st.column_config)

Support for Streamlit 1.26.0 includes changes and components from previous versions. For details, see the following Streamlit library changelogs:

* [v1.23.0](https://docs.streamlit.io/library/changelog#version-1230)
* [v1.24.0](https://docs.streamlit.io/library/changelog#version-1240)
* [v1.25.0](https://docs.streamlit.io/library/changelog#version-1250)
* [v1.26.0](https://docs.streamlit.io/library/changelog#version-1260)

For more information, see [Supported versions of the Streamlit library in warehouse runtimes](../../../developer-guide/streamlit/app-development/dependency-management.md).

---
title: March 18-20, 2024 — 8.11 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_11.md
section: Release Notes
---

# March 18-20, 2024 — 8.11 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### SELECT supports trailing commas

In a [SELECT](../../sql-reference/sql/select.md) statement or clause, a trailing comma is now supported in a column list. For example, the following SELECT statement is supported:

> ```sqlexample
> SELECT emp_id,
>        name,
>        dept,
> FROM employees;
> ```

## Data loading / unloading Updates

### Performance improvements for loading JSON files

With this release, Snowflake is pleased to announce a performance improvement for loading JSON files. This improvement results in lower ingestion latency of up to 25% for most JSON loading scenarios without any customer modification to queries.

For more information, see [2024 Performance Improvements](../performance-improvements-2024.md).

### Improvements to the SNOWPIPE_STREAMING_CLIENT_HISTORY view

With this release, the SNOWPIPE_STREAMING_CLIENT_HISTORY view resolves an issue where not all data loading events generated by the Snowpipe Streaming client calls were visible. The view has been resolved with all events showing correctly as of Feb 1, 2024.

For more information, see [SNOWPIPE_STREAMING_CLIENT_HISTORY view](../../sql-reference/account-usage/snowpipe_streaming_client_history.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 18-Mar-24 |
| New SQL functions | **Removed** from *SQL Updates* | 20-Mar-24 |

---
title: March 18-20, 2024 — Limit functionality of your Snowflake Native App —– Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-18-limit-app-functionality.md
section: Release Notes
---

# March 18-20, 2024 — Limit functionality of your Snowflake Native App —– *Preview*

With this release, we are pleased to announce the preview of the ability to limit the functionality of your Snowflake Native App for consumers who
trial your data product. You can limit the functionality of a secure view, secure UDF, or a Streamlit app included in your application package
using a system function, [SYSTEM$IS_LISTING_TRIAL](../../../sql-reference/functions/system_is_listing_trial.md).

See more information in [Limit functionality of your Snowflake Native App for trial consumers](https://other-docs.snowflake.com/collaboration/provider-listings-preparing#label-listings-trial-limit-functionality-app).

---
title: March 19, 2026: MIN_BY and MAX_BY functions are supported with dynamic table incremental refresh (General availability)
source: https://docs.snowflake.com/en/release-notes/2026/other/2026-03-19-min-max-incremental-dynamic-table.md
section: Release Notes
---

# March 19, 2026: MIN_BY and MAX_BY functions are supported with dynamic table incremental refresh (*General availability*)

The [MIN_BY](../../../sql-reference/functions/min_by.md) and [MAX_BY](../../../sql-reference/functions/max_by.md) aggregate and window functions are now supported
with dynamic table incremental refresh.

For a complete list of supported queries and functions, see [Supported queries for dynamic tables](../../../user-guide/dynamic-tables-supported-queries.md).

---
title: March 2023
source: https://docs.snowflake.com/en/release-notes/2023-03.md
section: Release Notes
---

# March 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Snowpipe Streaming — *Preview*

With this release, Snowflake is pleased to announce a preview of Snowpipe Streaming, the latest addition to Snowflake ingestion offerings.
The Snowpipe Streaming API writes rows of data directly to Snowflake tables without the requirement of staging files. This architecture
results in lower load latencies with corresponding lower costs for loading any volume of data, which makes it a powerful tool for handling
near real-time data streams.

For more information, refer to [Snowpipe Streaming](../user-guide/snowpipe-streaming/data-load-snowpipe-streaming-overview.md). Snowpipe Streaming is also now
available for the Snowflake Connector for Kafka, which offers an easy upgrade path to take advantage of the lower latency and lower cost
loads.

### Tabular Return Values from Java or Scala Stored Procedures — *Preview*

With this release, Snowflake is pleased to announce a preview of tabular stored procedures with a handler written in Java or Scala. You can
write a procedure that returns data in tabular form. To do this, you specify the procedure’s return type as TABLE (specifying columns for
the return value), then have your handler code return the tabular value in a Snowpark dataframe.

For more information, refer to
[Returning tabular data from a Java stored procedure](../developer-guide/stored-procedure/java/procedure-java-tabular-data.md) or
[Returning tabular with Scala in stored procedures created with SQL](../developer-guide/stored-procedure/scala/procedure-scala-tabular-data.md).

## New Regions

We are pleased to announce the availability of the following new region(s):

| Cloud Platform | Region |
| --- | --- |
| Amazon Web Services (AWS) | Asia Pacific (Jakarta) |

With the addition of this region, Snowflake now supports over thirty-five global [regions](../user-guide/intro-regions.md) across three
cloud platforms (AWS, GCP, and Azure), including three regions that support compliance with US government regulations.

You can provision initial accounts in the region through [self-service](https://signup.snowflake.com/) or a Snowflake representative.

## SQL Updates

### SHOW SHARES Command: Support for STARTS WITH and LIMIT … FROM

The SHOW SHARES command now supports the `STARTS WITH` and `LIMIT ... FROM` parameters so that you can filter the shares
returned by the command.

Refer to [SHOW SHARES](../sql-reference/sql/show-shares.md) for details and examples.

### Geospatial Functions for Shape Transformation and Orientation — *Preview*

With this release, we are pleased to announce the preview of the following geospatial functions for shape transformation and orientation:

| **Function** | **Description** |
| --- | --- |
| [ST_BUFFER](../sql-reference/functions/st_buffer.md) (for GEOMETRY objects) | Returns a GEOMETRY object that represents a MultiPolygon containing the points within a specified distance of the input GEOMETRY object. The returned object effectively represents a “buffer” around the input object. |
| [ST_SIMPLIFY](../sql-reference/functions/st_simplify.md) (for GEOMETRY objects) | Given an input GEOMETRY object that represents a line or polygon, returns a simpler approximation of the object. The function identifies and removes selected vertices, resulting in a similar object that has fewer vertices. |
| [ST_AZIMUTH](../sql-reference/functions/st_azimuth.md) (for GEOMETRY objects) | Given two Points that are GEOMETRY objects, returns the azimuth (in radians) of the line segment formed by the two points. |
| [ST_MAKEPOLYGONORIENTED](../sql-reference/functions/st_makepolygonoriented.md) (for GEOGRAPHY objects) | Constructs a GEOGRAPHY object that represents a polygon without holes. The function uses the specified LineString as the outer loop. This function does not attempt to correct the orientation of the loop, allowing for the creation of polygons that span more than half of the globe. This function differs from [ST_MAKEPOLYGON](../sql-reference/functions/st_makepolygon.md), which inverts the orientation of those large shapes. |

Preview features are intended for evaluation and testing purposes, and are not recommended for use in production.

### Support for Specifying How to Handle Invalid Geospatial Shapes — *Preview*

With this release, we are pleased to announce the preview of support for handling invalid geospatial shapes.

By default, when you use a [geospatial conversion function](../sql-reference/functions-geospatial.md) to convert
[data in a supported input format](../sql-reference/data-types-geospatial.md) to a GEOGRAPHY or GEOMETRY object, the function attempts to
validate the shape and repair the shape if the shape is invalid. If the shape cannot be repaired, the function does not create a GEOGRAPHY
or GEOMETRY object.

With this preview feature, you have more control over the validation and repair process. You can:

* Allow these conversion functions to create GEOGRAPHY and GEOMETRY objects for invalid shapes.
* Determine if the shape for a GEOGRAPHY or GEOMETRY object is invalid.

For details, refer to [Specifying how invalid geospatial shapes are handled](../sql-reference/data-types-geospatial.md).

## Data Pipeline Updates

### Streams on Views — *General Availability*

With this release, we are pleased to announce the general availability of streams on views. Streams on views extends table streams to track
change data capture (CDC) records for views, including secure views.

Currently, streams are limited to views that satisfy the following requirements:

* All of the underlying tables must be native tables.
* The view may only apply the following operations:

  > + Projections
  > + Filters
  > + Inner or cross joins
  > + UNION ALL
* Materialized views are not supported.

For more information about the streams on views requirements, refer to [Introduction to streams](../user-guide/streams-intro.md).

## Data Lake Updates

### External Table and Directory Table Auto-Refresh Observability and Billing

With this release, Snowflake will start billing for auto-refresh notifications in external tables and directory tables on external stages at
a rate equivalent to the Snowpipe file charge. You can estimate charges incurred by your external table and directory table auto-refresh
notifications by examining the Account Usage [PIPE_USAGE_HISTORY view](../sql-reference/account-usage/pipe_usage_history.md). Note that the auto-refresh pipes will be listed under a NULL pipe
name. You can also view your external table auto-refresh notification history at the table-level/stage-level granularity by using the
Information Schema table function [AUTO_REFRESH_REGISTRATION_HISTORY](../sql-reference/functions/auto_refresh_registration_history.md).

To avoid charges for auto-refresh notifications, perform a manual refresh for external tables and directory tables. For external tables, the
ALTER EXTERNAL TABLE <name> REFRESH … statement can be used to manually synchronize your external table to external storage. For directory
tables, the ALTER STAGE <name> REFRESH … statement can be used to manually synchronize the directory to external storage.

## Data Governance Updates

### Allow Masked Columns as Inputs to Row Access Policies and Conditional Masking Policies

With this release, Snowflake is pleased to announce that the signature of a row access policy and a conditional masking policy can specify a
column protected by a masking policy. Specifying a masked column in the policy signature provides more freedom to policy administrators to
create new policies or replace existing policies.

To enable this functionality, set the `EXEMPT_OTHER_POLICIES` property to `TRUE` when creating a new masking policy or replacing an
existing masking policy. Note that this property cannot be set on an existing policy; the existing policy must be replaced to include this
property. After creating or replacing the masking policy, the policy can be set on a column and the protected column can be referenced in
the signature of a row access policy or a conditional masking policy.

For details, refer to [CREATE MASKING POLICY](../sql-reference/sql/create-masking-policy.md).

## Replication Updates

### Account Replication: Notification Integration Support — *Preview*

With this release, account replication now includes preview support for the replication of notification integration objects of the following
types:

* TYPE = EMAIL
* TYPE = QUEUE with DIRECTION = OUTBOUND

For more information, refer to [Integration replication](../user-guide/account-replication-intro.md).

## Web Interface

### Python Worksheets — *Preview*

With this release, we are pleased to announce the preview of Python worksheets in Snowsight. Python worksheets let you write and
run Snowpark Python in a worksheet in Snowsight.

In a Python worksheet, you can do the following:

* Write a Python script to read data from a stage, transform it, and save it to a table, all without leaving Snowsight.
* Use included packages from Anaconda or import packages from a stage to write code more easily.
* Automate your Python code by deploying it as a stored procedure and scheduling it as a task.

For more information, refer to [Writing Snowpark Code in Python Worksheets](../developer-guide/snowpark/python/python-worksheets.md).

### Individual Task Observability — *General Availability*

With this release, we are pleased to announce the general availability of individual task observability. Tasks are now visible in a graph
view to highlight dependencies and order of execution. With observability for individual task runs, you can perform monitoring tasks such as
identifying long-running tasks, consistently skipped tasks, and databases with a high volume of tasks.

For more information, refer to [View tasks and task graphs in Snowsight](../user-guide/ui-snowsight-tasks.md).

---
title: March 26-27, 2024 — 8.12 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_12.md
section: Release Notes
---

# March 26-27, 2024 — 8.12 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Behavior Change Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_03](../bcr-bundles/2024_03_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_02](../bcr-bundles/2024_02_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_01](../bcr-bundles/2024_01_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for April 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL Updates

### Organization Usage: Improved views for billing reconciliation — *General Availability*

With this release, we are pleased to announce that the following views in the Organization Usage schema have been improved to make it easier
to reconcile Snowflake usage with monthly billing statements:

* CONTRACT_ITEMS
* RATE_SHEET_DAILY
* REMAINING_BALANCE_DAILY
* USAGE_IN_CURRENCY_DAILY

For details on what changed, see [Organization Usage: Updated billing views](../bcr-bundles/un-bundled/bcr-1584.md).

## Data Pipeline Updates

### Replication: Stages, pipes, storage integrations, load history, and Snowpipe Streaming — *General Availability*

With this release, we are pleased to announce the general availability of replication for stages, pipes, storage integrations, load history,
and Snowpipe Streaming channels. You can replicate these objects to configure failover for data pipelines across
[regions](../../user-guide/intro-regions.md) and [cloud platforms](../../user-guide/intro-cloud-platforms.md).

As part of pipe object replication, pipes in a secondary database are in a `READ_ONLY` execution state and receive notifications, but
do not load data until you promote the secondary database to serve as the primary. After you promote a secondary database, the pipes will
transition to a `FAILING_OVER` execution state.

With Snowpipe Streaming replication, the table object, table data, and the channel offsets associated with the table from the primary
database are replicated to the secondary database.

For more information, see [Stage, pipe, and load history replication](../../user-guide/account-replication-stages-pipes-load-history.md) and [Replication and Snowpipe Streaming](../../user-guide/account-replication-considerations.md).

### Schema detection and evolution for Kafka connector with Snowpipe Streaming — *General Availability*

With this release, we are pleased to announce the general availability of schema detection and evolution for Kafka connector with
Snowpipe Streaming. The structure of tables in Snowflake can be defined and evolved automatically to support the structure of Kafka topic
message data loaded by the Kafka connector using Snowpipe Streaming.

For more information, see [Schema detection and evolution for Kafka connector with Snowpipe Streaming classic](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-kafka-schema-detection.md).

## Data Governance Updates

### Memoizable functions with constant arguments

With this release, Snowflake is pleased to announce support for creating a memoizable function with constant arguments. This support
provides more flexibility in terms of how to define the function. Additionally, if you reference the function in a data access policy,
such as a row access policy, you have more freedom in terms of how to define your policy. The function arguments can be any of the
following data types:

* string
* numeric
* date
* Boolean

For details, see:

* [Memoizable UDFs](../../developer-guide/udf/sql/udf-sql-scalar-functions.md).
* [Masking policy with a memoizable function](../../user-guide/security-column-ddm-use.md).

### Share data protected by a role-based policy — *General Availability*

With this release, Snowflake is pleased to announce the general availability to enable a data sharing provider to use the IS_DATABASE_ROLE_IN_SESSION
function in the conditions of a masking policy or a row access policy to allow a data sharing consumer to access shared data that is protected
by either of these policies. The function argument takes either the name of a database role or a column that contains database roles. This
provides more options to the provider to share data, allows the consumer to access sensitive data that the provider makes available, and
removes restrictions on policy-protected data when the consumer queries a shared table protected by a policy.

For details, see [Share data protected by a policy](../../user-guide/data-sharing-policy-protected-data.md) and
:   [IS_DATABASE_ROLE_IN_SESSION](../../sql-reference/functions/is_database_role_in_session.md).

### Access History: Stored procedure ancestor queries — *General Availability*

With this release, Snowflake is pleased to announce the general availability to track the chain of queries that call a stored procedure
by using the `parent_query_id` and `root_query_id` columns. These columns allow you to see the query ID that performs a read
or write operation on another object, and the query ID for the query that calls a stored procedure, respectively. The columns support
calling a stored procedure directly and nested stored procedure calls, such as when one stored procedure calls another stored procedure.

This update was announced in preview during the 8.2 release. For details, see [Ancestor queries with stored procedures](../../user-guide/access-history.md).

### Shared tag references — *General Availability*

This update was announced in preview during the 7.33 release. For details, see [Shared tag references](../../user-guide/data-sharing-provider.md).

### Access History: Track objects modified by a DDL operation — *General Availability*

With this release, we are pleased to announce the general availability of tracking objects modified by DDL operations in the Account Usage
[ACCESS_HISTORY](../../sql-reference/account-usage/access_history.md) view. These operations include:

* Track how tag and policy assignments change.
* Track the table and column lifecycle.

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 26-Mar-24 |

---
title: March 28, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-28-snowflake-data-clean-rooms.md
section: Release Notes
---

# March 28, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce that Snowflake Data Clean Rooms are generally available in the following regions:

| Cloud Region | Cloud Region ID |
| --- | --- |
| Amazon Web Services US East (N. Virginia) | us-east-1 |
| Amazon Web Services US West (Oregon) | us-west-2 |
| Microsoft Azure West US 2 (Washington) | westus2 |

Snowflake Data Clean Rooms allow multiple parties to collaborate together in a secure environment. These collaborators can combine and
analyze data without worrying about the privacy concerns that go with sharing raw data.

In a Snowflake Data Clean Room, collaborators are able to return aggregated results and insights from a dataset, but cannot access the data
directly. The collaborator who is sharing their data defines what analyses are available to the other collaborators, allowing them to
tightly control how their data is used.

Snowflake Data Clean Rooms are designed for both business and technical users. The web app provides a user interface that allows
non-technical users to work with clean rooms, including running analyses without writing SQL queries. The developer edition provides APIs
so technical users can programmatically create and use clean rooms.

Snowflake customers can use a Snowflake Data Clean Room to collaborate with other Snowflake customers or with parties who do not have a
Snowflake account. Parties without a Snowflake account simply accept an invitation to use a clean room managed account to begin
collaborating in the clean room.

For more information, see [Introduction to Snowflake Data Clean Rooms](https://other-docs.snowflake.com/cleanrooms/introduction).

---
title: March 29, 2024 — Data Quality Monitoring Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-03-29-dmf.md
section: Release Notes
---

# March 29, 2024 — Data Quality Monitoring Release Notes

## Data Quality Monitoring and data metric functions — *Preview*

With this release, Snowflake is pleased to announce Data Quality Monitoring with data metric functions (DMFs) in preview. Data Quality
Monitoring uses DMFs to continuously monitor the data quality metrics such as completeness, accuracy, uniqueness, and validity. You can use
Snowflake provided system DMFs for common metrics such as row count, duplicates, and freshness. Alternatively, you can create your own
custom DMFs to define metrics that are specific to your own data.

You can either use the DMF in a query to test the quality of data in your pipeline or associate the DMF to desired tables to continuously
monitor its quality. The continuous monitoring can either be schedule-based for periodic measurement or trigger-based to measure only when
the underlying table is modified. DMF results are recorded in a centralized event table in your Snowflake account to protect the privacy of
your data. You can create dashboards, configure alerts, or query metric results directly from the event table. Furthermore, data in the
event table is in the standard OpenTelemetry format for easy integration with observability tools.

For details, see [Introduction to data quality checks](../../../user-guide/data-quality-intro.md).

---
title: Masking policy: Comply with the scale and precision of a column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1355.md
section: Release Notes
---

# Masking policy: Comply with the scale and precision of a column

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The runtime behavior of a [masking policy](../../../user-guide/security-column-intro.md) and the masked output in terms of the scale and
precision of a column is as follows:

Before the change:
:   The masked value as specified in the body of the masking policy does not respect the precision and scale of a column.

After the change:
:   If you create a new masking policy after enabling the `2024_04` behavior change bundle, set the masking policy on a column, and
    query the protected column, the following occurs:

    * The query fails when the masked value is greater than the precision of the column.
    * The masked value is truncated to the scale of the column when the scale of the masked value is greater than the scale of the column.

> **Note:**
>
> To identify which masking policies are affected by this change, see [Mitigating masking policy return value updates](../managing-behavior-change-releases.md).

Ref: 1355

---
title: Materialized Views: Failed Refresh Invalidates a Materialized View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1178.md
section: Release Notes
---

# Materialized Views: Failed Refresh Invalidates a Materialized View

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

Materialized views are updated automatically on a regular basis by a background process.

Currently, if the refresh of a materialized view fails, the data for the materialized view is not updated, and the background
process continues to attempt to refresh the materialized view periodically.

If the failure is due to a problem that will continue to occur (e.g. a division by zero error that is caused by the materialized
view definition or the data), the background process will continually fail to refresh the materialized view, and the data for the
materialized view will not be updated.

In the current release, specific types of errors during the refresh process invalidate the materialized view. In addition:

* When you query the materialized view, the output will include the reason for the invalidation.
* The output of the [SHOW MATERIALIZED VIEWS](../../../sql-reference/sql/show-materialized-views.md) command will include the reason why the materialized view
  was invalidated.

Using the information from the output of these commands, address the problem with the materialized view, and execute the
[ALTER MATERIALIZED VIEW … RESUME](../../../sql-reference/sql/alter-materialized-view.md) command to resume the materialized view.

In summary, the process of refreshing and querying the materialized view changed as described below:

Previously:
:   The background process fails to refresh the materialized view.

    Although the data in the materialized view is out of date, the output from querying the materialized view does not indicate
    that the data is stale.

    When you execute the SHOW MATERIALIZED VIEWS command, the refreshed_on column indicates that the data is out of date, but the
    output does not include a reason for this.

Currently:
:   The background process invalidates the materialized view.

    Querying the materialized view results in an error that indicates why the refresh process failed to update the materialized
    view. For example:

    ```sqlexample
    SELECT * FROM my_mv;
    ```

    ```output
    002037 (42601): SQL compilation error:
      Failure during expansion of view 'MY_MV':
        SQL compilation error: Materialized View MY_MV is invalid.
        Invalidation reason: Division by zero
    ```

    When you execute the SHOW MATERIALIZED VIEWS command, the `invalid` column indicates that the materialized view is invalid,
    and the `invalid_reason` column contains the reason for the invalidation. For example:

    ```sqlexample
    SHOW MATERIALIZED VIEWS;
    ```

    ```output
    ...  +---------+------------------+ ...
    ...  | invalid | invalid_reason   | ...
    ...  +---------+------------------+ ...
    ...  | true    | Division by zero | ...
    ...  +---------+------------------+ ...
    ```

For example, suppose that you execute the following statements to create a materialized view:

```sqlexample
CREATE OR REPLACE TABLE my_base_table (a INT, b INT, c VARCHAR(16));
```

```sqlexample
INSERT INTO my_base_table VALUES (1, 1, 'valid data');
```

```sqlexample
CREATE OR REPLACE MATERIALIZED VIEW my_mv AS SELECT a / b AS div FROM my_base_table;
```

Suppose that you insert data into the table that will cause the refresh of the materialized view to fail. For example, suppose
that you execute the following statement:

```sqlexample
INSERT INTO my_base_table VALUES (1, 0, 'invalid data');
```

When the materialized view is refreshed next, the refresh will fail with a “division by zero” error. Because the refresh fails,
the materialized view will be invalidated.

To view the reason for the invalidation, query the materialized view or execute the SHOW MATERIALIZED VIEWS command:

```sqlexample
SELECT * FROM my_mv;
```

```output
002037 (42601): SQL compilation error:
  Failure during expansion of view 'MY_MV':
    SQL compilation error: Materialized View MY_MV is invalid.
    Invalidation reason: Division by zero
```

```sqlexample
SHOW MATERIALIZED VIEWS;
```

```output
...  +---------+------------------+ ...
...  | invalid | invalid_reason   | ...
...  +---------+------------------+ ...
...  | true    | Division by zero | ...
...  +---------+------------------+ ...
```

Address the problem that caused the invalidation, and execute the ALTER MATERIALIZED VIEW … RESUME command to resume the
materialized view:

```sqlexample
ALTER MATERIALIZED VIEW my_mv RESUME;
```

Ref: 1178

---
title: Materialized Views: MINUS, EXCEPT, and INTERSECT No Longer Allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-757.md
section: Release Notes
---

# Materialized Views: MINUS, EXCEPT, and INTERSECT No Longer Allowed

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

In the current release, you are not allowed to use the MINUS (or EXCEPT) and INTERSECT set operators in a materialized view.

Previously:
:   You could create, alter, or query a materialized view that uses the MINUS, EXCEPT, or INTERSECT operator.

Currently:
:   Creating, altering, or querying a materialized view that uses the MINUS, EXCEPT, or INTERSECT operator results in one of the following
    error messages:

    `Invalid materialized view definition. Join types [MINUS] not allowed in view definition.`

    `Invalid materialized view definition. Join types [INTERSECT] not allowed in view definition.`

If you previously had a materialized view that used the MINUS, EXCEPT, or INTERSECT operator, recreate that materialized view so that it no longer
uses the MINUS, EXCEPT, or INTERSECT operator.

Ref: 215, 757

---
title: Materialized Views: Using Time Travel to Query Historical Data Produces Expected Error Message
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-923.md
section: Release Notes
---

# Materialized Views: Using Time Travel to Query Historical Data Produces Expected Error Message

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

When [Time Travel](../../../user-guide/data-time-travel.md) is incorrectly used to query historical data for materialized views, an error message
now states that Time Travel is not supported:

Previously:
:   Although Time Travel is not supported for materialized views, queries with the AT or BEFORE keyword in the FROM clause might have
    returned unexpected results. In some cases, queries might also have returned an unexpected error message. For example, for the Time Travel
    clause: `CAST('2022-11-29 00:00:00' AS TIMESTAMP_LTZ(9))`.

Currently:
:   Querying historical data using Time Travel on materialized views produces a SQL compilation error:

    `Time travel is not supported for materialized views.`

Ref: 923

---
title: May 01, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-05-01.md
section: Release Notes
---

# May 01, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Suspend and resume tasks in Snowsight

With this improvement you can now suspend and resume your tasks directly in Snowsight. To suspend or resume a task, navigate to the task and choose the Resume or Suspend option from the actions menu.

When you suspend or resume a child task in a task graph, Snowsight automatically suspends and resumes the task graph’s root task.

For more information, see [View and manage individual tasks](../../../user-guide/ui-snowsight-tasks.md).

## Suspended time and reason descriptions

With this improvement you can hover over any suspended label or icon to see the most recent time the task was suspended and if it was suspended manually or automatically due to failure. The time and reason for suspension aren’t retained in historical snapshots of task graph runs.

For more information, see [View and manage task graphs](../../../user-guide/ui-snowsight-tasks.md).

## Parameters on root tasks

With this improvement Snowsight now displays the auto-suspend, auto-retry, and task timeout parameters in the task details page of the task graph’s root task.

For more information, see [View and manage task graphs](../../../user-guide/ui-snowsight-tasks.md).

## Warehouses and Serverless Tasks

With this improvement the Snowsight tasks table displays a Serverless Tasks icon and Serverless for the warehouse column for Serverless Tasks. User-managed tasks also show the name of the warehouse that the task ran on in the task graph run history view.

For more information, see [View task history](../../../user-guide/ui-snowsight-tasks.md).

## Task return values

With this improvement you can see the assigned return values in the task history. Return values are visible both in a single task’s run history and in the task graph run details.

You can use return values to give you quick insights of what your task run has processed. You can define return values in your tasks to be returned on task completion.

For more information, see [View and manage individual tasks](../../../user-guide/ui-snowsight-tasks.md).

## Task run duration visualization

With this improvement Snowsight displays a bar-chart visualization of the duration of task runs to easily see fluctuations and trends. You can also see how long the task was scheduled before it started.

For more information, see [View task history](../../../user-guide/ui-snowsight-tasks.md).

## Task run conditions

With this improvement Snowsight displays a condition column in your task list. The condition column describes the conditions defined for the task to run. If the condition is a stream, you can see the name of that stream and the condition-logic. If the condition is a return value of a predecessor task, you can see the name of that predecessor task and the condition-logic.

For more information, see [View and manage individual tasks](../../../user-guide/ui-snowsight-tasks.md).

## Task graph configurations

With this improvement Snowsight displays the task graph configuration and the task definition on the task details page. The graph configuration is defined at the root task level but is visible in the details of and applied to all tasks in a task graph.

For more information, see [View and manage task graphs](../../../user-guide/ui-snowsight-tasks.md).

## Task definition view in task graphs

With this improvement you can inspect the definitions of each task in your task graphs from the task graph view. The new side-panel is resizable to show the most important task configurations at a glance and the full definition and graph configuration.

For more information, see [View and manage task graphs](../../../user-guide/ui-snowsight-tasks.md).

---
title: May 01, 2025: Dynamic tables: Support for filtering by current time and date for incremental refresh (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-01-dynamic-tables-current-timestamp.md
section: Release Notes
---

# May 01, 2025: Dynamic tables: Support for filtering by current time and date for incremental refresh (*General availability*)

We are pleased to announce support for using the [CURRENT_TIMESTAMP](../../../sql-reference/functions/current_timestamp.md), [CURRENT_DATE](../../../sql-reference/functions/current_date.md),
and [CURRENT_TIME](../../../sql-reference/functions/current_time.md) functions and their aliases as a filter for dynamic tables in incremental refresh
mode.

You can now use these functions inside of predicates such as a WHERE/HAVING/QUALIFY clause.

For example:

```sqlexample
CREATE TABLE my_table
 AS
  SELECT column1 AS id, parse_json(column2) AS entity, current_timestamp() as event_timestamp
  FROM values
  (12712555,
  '{ name:  { first: "John", last: "Smith"},
   contact: [
   { business:[
   { type: "phone", content:"555-1234" },
   { type: "email", content:"j.smith@example.com" } ] } ] }'),
  (98127771,
  '{ name:  { first: "Jane", last: "Doe"},
   contact: [
   { business:[
   { type: "phone", content:"555-1236" },
   { type: "email", content:"j.doe@example.com" } ] } ] }') v;

CREATE DYNAMIC TABLE my_dynamic_table
 TARGET_LAG = DOWNSTREAM
 WAREHOUSE = mywh
 REFRESH_MODE = INCREMENTAL
 AS
  SELECT id, entity, event_timestamp
  FROM my_table
  WHERE event_timestamp > timestampadd(month, -1, current_timestamp);
```

To use these functions, you must explicitly [set your dynamic table’s refresh mode to INCREMENTAL](../../../sql-reference/sql/create-dynamic-table.md).

---
title: May 01, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-01-dcr.md
section: Release Notes
---

# May 01, 2025: Snowflake Data Clean Rooms updates

> **Note:**
>
> **Clean rooms UI users** must sign out and back in to the clean rooms UI for these updates to take effect.
>
> **Clean rooms API users** must run the following SQL commands for these updates to take effect:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.apply_patch();
> ```
>
> **To enable auto-upgrades for API users,** run the following SQL commands:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
> ```

With this release, we are pleased to announce the availability of the following new features and enhancements to Snowflake
Data Clean Rooms:

**Sign in to clean rooms with your Snowflake credentials.** Snowflake users in new clean room accounts can now sign in to the clean rooms
UI using their Snowflake credentials.

**Activation methods added to run roles.** Users granted run roles can now activate their data.
[See the full list of procedures available to run role users.](../../../user-guide/cleanrooms/consumer.md)

**Differential privacy is now a managed task**, which lowers the cost to clean room owners. This feature will be enabled automatically for
any new clean room accounts. If your have an existing clean room account and want to migrate your differential privacy to a managed task,
run the following code:

```sqlexample
USE ROLE ACCOUNTADMIN;
GRANT EXECUTE MANAGED TASK ON ACCOUNT TO ROLE SAMOOHA_APP_ROLE;

USE ROLE SAMOOHA_APP_ROLE;
CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.LIBRARY.SETUP_TASK_PROVIDER();
```

**Provider activation warehouse management.** Providers can now choose the size of the warehouse used to decrypt and store provider
activation results by calling `provider.update_activation_warehouse`.

---
title: May 02, 2024 — Cost Management Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-02-cost.md
section: Release Notes
---

# May 02, 2024 — Cost Management Release Notes

## Organization Overview Page —– *Preview*

With this release, we are pleased to announce the preview of an Organization Overview page in Snowsight that allows you to
gain organization-level insights into the cost of using Snowflake, including:

* Details about the current contract.
* The remaining balance of the contract.
* The accumulated cost of Snowflake usage since the start of the contract.
* The monthly spend for the organization.
* An overview of the consumption of each account in the organization.

For more details about using the Organization Overview page, see [Overview of organization-level costs](../../../user-guide/cost-exploring-overall.md).

---
title: May 02, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-02-snowsight-dd-preview.md
section: Release Notes
---

# May 02, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Data Dictionary with masked PII –— *Preview*

With this release, we are pleased to announce that data sharing and collaboration with listings is now available for accounts in
U.S. government regions.
Snowflake is pleased to announce the preview of Data Dictionary preview with masked PII in Snowsight.

With this preview, Providers can now specify data to preview in a listing as well as mask sensitive PII.

Consumers can view data previews for listings where preview data is specified.

For more details, see [About data dictionaries](../../../collaboration/provider-listings-reference.md).

---
title: May 03, 2024 — Aggregation and Projection Policies Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-03-policies.md
section: Release Notes
---

# May 03, 2024 — Aggregation and Projection Policies Release Notes

## Aggregation Policies — *General Availability*

With this release, we are pleased to announce the general availability of aggregation policies, which protect the privacy of individual rows
by requiring analysts to run queries that aggregate data rather than retrieving individual rows. When defining an aggregation policy,
administrators specify a minimum group size, which determines how many rows must be included in each aggregation group. Once the aggregation
policy is assigned to a table or view, queries must aggregate data into groups that contain enough rows to meet the minimum group size
requirement.

For more information, see [Aggregation policies](../../../user-guide/aggregation-policies.md).

## Projection Policies — *General Availability*

With this release, we are pleased to announce the general availability of projection policies, which prevent queries from using a SELECT
statement to project a column. The projection policy defines which users should be blocked from projecting columns and which users should
be allowed. Administrators then assign the projection policy to a column in a table or view to control who can project the column.

For more information, see [Projection policies](../../../user-guide/projection-policies.md).

---
title: May 03, 2024 — Snowflake Model Registry – General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-03-snowflake-model-registry.md
section: Release Notes
---

# May 03, 2024 — Snowflake Model Registry – General Availability

We are pleased to announce that the Snowflake Model Registry is now generally available as of Snowpark ML (`snowflake-ml-python`) package version 1.5.0. The Snowflake Model Registry allows you to securely store, manage, and use machine learning models in Snowflake, where your data already lives. The registry supports most popular types of Python ML models and can be used from both Python and SQL.

For more information, see [Snowflake Model Registry](../../../developer-guide/snowflake-ml/model-registry/overview.md).

---
title: May 05, 2025: Generation 2 standard warehouses (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-05-gen2-standard-warehouses.md
section: Release Notes
---

# May 05, 2025: Generation 2 standard warehouses (*General availability*)

With this release, you can take advantage of the Generation 2 Standard Warehouse (Gen2) feature.

This feature is an updated version (the “next generation”) of the
current standard virtual warehouse in Snowflake, focused on improving performance for
analytics and data engineering workloads. Gen2 is built on top of faster underlying hardware
and intelligent software optimizations, such as enhancements to delete, update, and merge operations,
and table scan operations. With Gen2, you can expect the majority of queries finish faster, and you can
do more work at the same time. The exact details depend on your configuration and workload.
Conduct tests to verify how much this feature improves your costs, performance, or both.

To create and manage generation 2 standard warehouses, you can use the SQL commands
[CREATE WAREHOUSE](../../../sql-reference/sql/create-warehouse.md) and
[ALTER WAREHOUSE](../../../sql-reference/sql/alter-warehouse.md).

For more information, see [Snowflake generation 2 standard warehouses](../../../user-guide/warehouses-gen2.md).
For the regions and cloud service providers where this feature is currently available,
see [region availability for generation 2 standard warehouses](../../../user-guide/warehouses-gen2.md).

---
title: May 05, 2025: Snowflake Cortex Provisioned Throughput (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-05-provisioned-throughput.md
section: Release Notes
---

# May 05, 2025: Snowflake Cortex Provisioned Throughput (*General availability*)

With Provisioned Throughput, a new capability in Snowflake Cortex, you can reserve throughput for managed inference.

Use Provisioned Throughput for the following tasks:

* Reserve throughput for specific time periods using provisioned throughput units (PTUs).
* Allocate capacity for supported models.
* Scale throughput based on workload requirements with minimum and incremental configurations.

---
title: May 06, 2024 — Vector data type and vector similarity functions — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-06-vector-data-type.md
section: Release Notes
---

# May 06, 2024 — Vector data type and vector similarity functions — *Preview*

With this release, we are pleased to announce the preview of VECTOR data type, Vector Similarity Functions,
and the Vector Embedding Function. These features enable important applications that require semantic
vector search and retrieval.

For more information, see [Vector Embeddings](../../../user-guide/snowflake-cortex/vector-embeddings.md).

## New SQL data type

The following data type was introduced in recent releases:

| Category | New data type | Description |
| --- | --- | --- |
| Vector | [VECTOR](../../../sql-reference/data-types-vector.md) | With the VECTOR data type, Snowflake encodes and processes vectors efficiently. This data type supports semantic vector search and retrieval applications, such as RAG-based applications, and common operations on vectors in vector-processing applications. |

## New SQL functions

The following functions were introduced in recent releases:

| Function Category | New Function | Description |
| --- | --- | --- |
| [Vector similarity function](../../../sql-reference/functions-vector.md) | [VECTOR_INNER_PRODUCT](../../../sql-reference/functions/vector_inner_product.md) | Returns the inner product of two vectors. The inner product (also known as the dot or scalar product) multiplies two vectors. |
| [Vector similarity function](../../../sql-reference/functions-vector.md) | [VECTOR_L2_DISTANCE](../../../sql-reference/functions/vector_l2_distance.md) | Measures the L2 distance between two vectors. |
| [Vector similarity function](../../../sql-reference/functions-vector.md) | [VECTOR_COSINE_SIMILARITY](../../../sql-reference/functions/vector_cosine_similarity.md) | Measures the cosine similarity between two vectors, which is the angular distance between the vectors in a multi-dimensional space. |
| [LLM Function](../../../user-guide/snowflake-cortex/aisql.md) | [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/embed_text-snowflake-cortex.md) | Creates a vector embedding for a given string of text in English. |

---
title: May 07, 2024 — Cortex LLM Functions — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-07-llm-functions-ga.md
section: Release Notes
---

# May 07, 2024 — Cortex LLM Functions — *General Availability*

With this release, we are pleased to announce the general availability of LLM Functions. LLM Functions gives you instant access
to industry-leading large language models (LLMs), including
[Snowflake Arctic](https://www.snowflake.com/blog/arctic-open-efficient-foundation-language-models-snowflake/),
an open enterprise-grade model developed by Snowflake.

The available functions include:

* COMPLETE: Given a prompt, returns a response that completes the prompt. This function accepts either a single prompt
  or a conversation with multiple prompts and responses.
* EXTRACT_ANSWER: Given a question and unstructured data, returns the answer to the question if it can be found in the
  data.
* SENTIMENT: Returns a sentiment score, from -1 to 1, representing the detected positive or negative sentiment of the
  given text.
* SUMMARIZE: Summarizes the given text.
* TRANSLATE: Translates the given text from any supported language to any other.

With this GA release, Cortex LLM Functions will be made available to all accounts. For immediate access, see the
[LLM Functions required privileges section](../../../user-guide/snowflake-cortex/aisql.md) for instructions on adding the CORTEX_USER role
to your user roles. Over the next two weeks (5/7/2024-5/21/2024), the CORTEX_USER role will be added to the PUBLIC role.

For more information, see [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md).

---
title: May 08, 2024 — New model for vector embedding — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-08-embed-text-model.md
section: Release Notes
---

# May 08, 2024 — New model for vector embedding — *Preview*

We’re pleased to announce a change that will make it easier for you to build generative AI workflows using
the [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/embed_text-snowflake-cortex.md) function, part of the
[Cortex LLM Functions](../../../user-guide/snowflake-cortex/aisql.md).

* The `snowflake-arctic-embed-m` model is now available for text embedding tasks. This model was trained by
  Snowflake. It outperforms the existing `e5-base-v2` model on standard retrieval benchmarks, while keeping the same
  number of parameters. Read more in the [model announcement blog](https://www.snowflake.com/blog/introducing-snowflake-arctic-embed-snowflakes-state-of-the-art-text-embedding-family-of-models/).

---
title: May 08, 2024 — Snowflake Notifications Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-08.md
section: Release Notes
---

# May 08, 2024 — Snowflake Notifications Release Notes

## New SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure for sending notifications

If you need to send notifications to an email address or a queue provided by a Cloud service (Amazon SNS, Google Cloud PubSub,
or Azure Event Grid), use the
[SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure](../../../user-guide/notifications/snowflake-notifications.md).

With a single call to this stored procedure, you can:

* Send a message to multiple types of destinations (email addresses and queues).
* Send a message to multiple email addresses and queues.
* Send a message in a specified format, according to the type of notification integration (plain text or HTML for email, or
  JSON for queues).

For example, with a single call, you can send messages in plain text, HTML, and JSON formats to multiple email addresses and
multiple SNS, PubSub, and Event Grid topics.

You can use multiple notification integrations to send the notification to different queues. You can also create multiple email
notification integrations that have different sets of email addresses and subject lines, making it easier to configure email
messages for different recipients.

For details, see [Using SYSTEM$SEND_SNOWFLAKE_NOTIFICATION to send notifications](../../../user-guide/notifications/snowflake-notifications.md).

---
title: May 08, 2024 — Streamlit in Snowflake Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-08-sis.md
section: Release Notes
---

# May 08, 2024 — Streamlit in Snowflake Release Notes

## Streamlit in Snowflake: Custom sleep timer —– *Preview*

With this release, we are pleased to announce the preview of Streamlit in Snowflake: Custom sleep timer.
You can now set a custom sleep timer for a Streamlit app to auto-suspend by creating a `config.toml`
configuration file and specifying the timer. You can set the timer to any value between 5 to 240 minutes.

For more information, see [Custom sleep timer for a Streamlit app](../../../developer-guide/streamlit/features/sleep-timer.md).

---
title: May 08, 2025: Document AI updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-08-document-ai.md
section: Release Notes
---

# May 08, 2025: Document AI updates

The release of a new version of the foundational Arctic-TILT model in Document AI includes
improvements in checkbox identification.

These improvements are available to new Document AI model builds.

---
title: May 08, 2025: Dynamic tables: Support for IS_ROLE_IN_SESSION in access policies (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-08-dynamic-tables-is-role-in-session.md
section: Release Notes
---

# May 08, 2025: Dynamic tables: Support for IS_ROLE_IN_SESSION in access policies (*General availability*)

Dynamic tables now support base tables with row access or masking policies that use the [IS_ROLE_IN_SESSION](../../../sql-reference/functions/is_role_in_session.md)
function for both incremental and full refresh modes.

---
title: May 08-09, 2024 — 8.18 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_18.md
section: Release Notes
---

# May 08-09, 2024 — 8.18 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### Dynamic pivot available

A dynamic pivot query uses the ANY keyword or a subquery in the [PIVOT](../../sql-reference/constructs/pivot.md) subclause instead of specifying the pivot values explicitly.
With dynamic pivot, the pivot values are determined at runtime based on the use case.

### Added support for structured data types in UDFs

[Structured data types](../../sql-reference/data-types-structured.md) are now supported in user-defined functions (UDFs) created in [Java](../../developer-guide/udf/java/udf-java-introduction.md), [Python](../../developer-guide/udf/python/udf-python-introduction.md), and [Scala](../../developer-guide/udf/scala/udf-scala-introduction.md). For information about
data type mappings for structured data types, see [Data Type Mappings Between SQL and Handler Languages](../../developer-guide/udf-stored-procedure-data-type-mapping.md).

### New SQL functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Semi-structured (higher-order) | [FILTER](../../sql-reference/functions/filter.md) | Filters an [array](../../sql-reference/data-types-semistructured.md) based on the logic in a Lambda expression. |
| Semi-structured (higher-order) | [TRANSFORM](../../sql-reference/functions/transform.md) | Transforms an [array](../../sql-reference/data-types-semistructured.md) based on the logic in a Lambda expression. |
| System function | [SYSTEM$VALIDATE_STORAGE_INTEGRATION](../../sql-reference/functions/system_validate_storage_integration.md) | Validates the configuration for a specified storage integration. |

## Extensibility updates

### Python user-defined aggregate functions — *Preview*

With this release, Snowflake is pleased to announce the public preview of support for writing user-defined aggregate functions (UDAFs)
with a Python handler. You can use Snowpark Python APIs to create and call user-defined aggregate functions (UDAFs), which take one
or more rows as input and produce a single row of output. A UDAF operates on values across multiple rows to perform mathematical
calculations such as sum, average, counting, finding minimum or maximum values, standard deviation, and estimation, as well as some
non-mathematical operations.

For more information, see:

* [Python user-defined aggregate functions](../../developer-guide/udf/python/udf-python-aggregate-functions.md)
  :   (for SQL- and Python-based instructions)
* [Creating User-Defined Aggregate Functions (UDAFs) for DataFrames in Python](../../developer-guide/snowpark/python/creating-udafs.md)
  :   (for a Snowpark Python-based guide)

### Access to external network locations on AWS in the Gov region — *Preview*

With this release, Snowflake is pleased to announce the public preview of access to external network locations from function and
procedure handlers for code deployed in the AWS Gov region.

When setting up external network access, you create a network rule that represents the external network location. If your handler
code will need to authenticate with the external location, you create a secret containing the credentials needed. In handler code,
you can use APIs to retrieve credential values from the secret.

For more information, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 06-May-24 |
| *Validate storage integration* | **Added** to *New SQL functions* section | 09-May-24 |

---
title: May 10, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-05-10.md
section: Release Notes
---

# May 10, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## New Create menu for Snowsight —– *General Availability*

With this release, we are pleased to announce the general availability of a new Create menu in the Snowsight navigation menu. The new menu provides a shortcut for creating the following items:

* SQL worksheet
* Python worksheet
* Streamlit App
* Dashboard
* Table
* Stage
* View

By selecting the new Create menu, you can also access the Add Data page, which includes all the different ways of loading data into Snowflake.

For more information, see [Snowsight navigation menu](../../../user-guide/ui-snowsight-navigation.md).

## New Add Data page —– *General Availability*

With this release, we are pleased to announce the general availability of a new Add Data page in Snowsight. To access the Add Data page, at the top of the navigation menu, select  (Create) » Add Data, or in the navigation menu, select Ingestion » Add Data.

The Add Data page provides a combined view and quick access to all the data loading methods that Snowflake supports. You can easily access tasks such as the following:

* Loading data into a table
* Loading files into a stage
* Creating an external stage
* Loading data from other cloud providers
* Loading data using the Snowflake connectors

For more information, see [Snowsight navigation menu](../../../user-guide/ui-snowsight-navigation.md).

---
title: May 13, 2024 — ASOF JOIN Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-13-asof-join.md
section: Release Notes
---

# May 13, 2024 — ASOF JOIN Release Notes

## ASOF JOIN — *General Availability*

With this release, we are pleased to announce the general availability of the ASOF JOIN construct, which joins rows from tables based on proximity (commonly temporal proximity). For each row in the first (or left) table, the join finds a single row in the second (or right) table that has the closest value. For instance, when you are working with time-series data, the closest match could be equal in time, earlier in time, or later in time, depending on the specified comparison operator.

This type of join is very useful for aligning time-series data sets, such as financial trading data, data collected from sensors, or any data set that is historical in nature. Although ASOF JOIN queries can be emulated through the use of complex SQL, other types of joins, and window functions, these queries are easier to write (and are usually performant in comparison to workarounds) if you use the ASOF JOIN syntax.

For more information, see [ASOF JOIN](../../../sql-reference/constructs/asof-join.md) and [Analyzing time-series data](../../../user-guide/querying-time-series-data.md).

---
title: May 13, 2025: Support for Streamlit 1.44.0 (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-13-sis.md
section: Release Notes
---

# May 13, 2025: Support for Streamlit 1.44.0 (General availability)

Version 1.44.0 of the Streamlit open-source library is now supported in Streamlit in Snowflake.

---
title: May 13-15, 2024 — 8.19 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_19.md
section: Release Notes
---

# May 13-15, 2024 — 8.19 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Serverless alerts — *Preview*

With this release, we are pleased to announce the preview of the serverless compute model for Snowflake alerts.

When you configure an alert to use the serverless compute model, Snowflake automatically resizes and scales up or down the compute resources
required for the alert. Snowflake determines the ideal size of the compute resources for a given run based on a dynamic analysis of
statistics for the most recent previous runs of the same alert.

To use the serverless compute model for an alert, omit the WAREHOUSE parameter when executing the CREATE ALERT command.

For more information, see [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md).

## Security updates

### Tri-Secret Secure self-registration

With this release, we are pleased to announce the Tri-Secret Secure self-registration process for customer-managed keys (CMKs). You can use
the process to register and activate a CMK for use with Tri-Secret Secure. Additionally, if you decide to replace a CMK, the
self-registration process informs you whether your new CMK is registered and activated. After you complete the self-registration process,
you can contact Snowflake Support to enable your Snowflake account to use Tri-Secret Secure.

For more information, see [Tri-Secret Secure in Snowflake](../../user-guide/security-encryption-tss.md).

## SQL updates

### Jinja2 template support for EXECUTE IMMEDIATE FROM — *Preview*

With this release, we are pleased to announce the preview of templating support for the EXECUTE IMMEDIATE FROM command. You can generate and
execute SQL scripts using a Jinja2 template file. Templating enables more flexible control structures and enables parameterization using
template variables. For example, the deployment target of the objects defined in a script can be dynamically chosen using a template file.

For more information, see [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md).

## Data loading/unloading updates

### Resolved a known issue for INCLUDE_METADATA copy option

Previously, for CSV only, there was a known issue when the `INCLUDE_METADATA` copy option was used with `MATCH_BY_COLUMN_NAME`. With this release, we are pleased to announce that this issue is resolved.

For more information, see [Copy options (copyOptions)](../../sql-reference/sql/copy-into-table.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 13-May-24 |
| *Structured type evolution for Apache Iceberg™ tables* | **Removed** from *Data lake updates* section | 14-MAY-24 |
| *Resolved a known issue for INCLUDE_METADATA copy option* | **Added** to *Data loading/unloading updates* section | 15-MAY-24 |

---
title: May 14, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-14-dcr.md
section: Release Notes
---

# May 14, 2024 — Snowflake Data Clean Rooms Release Notes

## Tracing user activity in the web app — *General Availability*

With this release, we are pleased to announce that Snowflake Data Clean Room administrators can attribute activity in the web app to specific
users. All activities performed in the web app are logged in the query history of the Snowflake account associated with the clean room
environment. Now, this query history includes a `user_email` query tag that identifies which user performed an action.

For more information about tracing user activity in the web app, see [Monitor clean rooms UI activity](../../../user-guide/cleanrooms/admin-tasks.md).

---
title: May 14, 2024 — Snowsight Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/ui/2024-05-14.md
section: Release Notes
---

# May 14, 2024 — Snowsight Release Notes

This document provides an overview of the new features, enhancements, and other important changes introduced in this update to
Snowsight.

If you have additional questions, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Edit tasks in Snowsight —– *Preview*

With this release, we are pleased to announce the preview of task editing in Snowsight.

You can now edit the following fields in your existing tasks:

* Comment
* Schedule
* Compute type and warehouse
* Task parameters
* Task graph parameters

For more information, see [View and manage individual tasks](../../../user-guide/ui-snowsight-tasks.md).

## Finalizer tasks in Snowsight —– *General availability*

With this release, we are pleased to announce that finalizer tasks are now linked to the root task of task graphs in the task graph view.

For more information, see [View and manage task graphs](../../../user-guide/ui-snowsight-tasks.md).

---
title: May 14, 2024 — Streamlit in Snowflake Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-14-sis.md
section: Release Notes
---

# May 14, 2024 — Streamlit in Snowflake Release Notes

Streamlit in Snowflake: Support for GCP — General Availability

With this release, we are pleased to announce the general availability of Streamlit in Snowflake on Google Cloud Platform (GCP), which was previously available as a preview feature.

For more information, see [About Streamlit in Snowflake](../../../developer-guide/streamlit/about-streamlit.md).

---
title: May 14, 2025: Data Governance release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-14-tag-propagation.md
section: Release Notes
---

# May 14, 2025: Data Governance release notes

## Automatic propagation of user-defined tags (*General availability*)

You can now configure an object tag so it is automatically propagated from a source object to target objects, which streamlines tag
management across objects and ensures that data protection policies associated with tags get consistently applied to target objects.

Tags can be configured to propagate in the following scenarios:

* When a target object depends on a source object (for example, a view based on a tagged table). Tags propagated to target objects that
  depend on a source object are continuously updated as the tags change on the source object.
* When data moves from a source object to another object (for example, using an INSERT statement that uses a query to update a table with
  data from another table).

For more information, see [Automatic tag propagation with user-defined tags](../../../user-guide/object-tagging/propagation.md).

---
title: May 15, 2025: Organizational listings: discovery and access
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-15-dna.md
section: Release Notes
---

# May 15, 2025: Organizational listings: discovery and access

This release includes enhancements to organizational listings: discovery and access.

With this release, Listing creators can now configure both the access and discovery of [organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-about.md).

Listing creators can now define how various accounts and roles in their organization can discover and access the organizational listings. For example, to specify who can discover and request access to a listing, the listing owner can select the entire organization, a list of accounts, or a specific role(s) in an account. Similarly, the listing creator can also specify who can access the Listing directly in the Internal Marketplace. The possible values are the entire organization, a list of accounts, or a specific role(s) in an account.

For more information see [Create an organizational listing](../../../user-guide/collaboration/listings/organizational/org-listing-create.md).

---
title: May 16, 2024 — Vector data type and vector similarity functions — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-16-vector-data-type-ga.md
section: Release Notes
---

# May 16, 2024 — Vector data type and vector similarity functions — *General Availability*

With this release, we are pleased to announce the general availability of VECTOR data type, vector similarity functions,
and the vector embedding function. These features enable important applications that require semantic
vector search and retrieval.

For more information, see [Vector Embeddings](../../../user-guide/snowflake-cortex/vector-embeddings.md).

## New SQL data type

The following data type is now generally available with this release:

| Category | New data type | Description |
| --- | --- | --- |
| Vector | [VECTOR](../../../sql-reference/data-types-vector.md) | With the VECTOR data type, Snowflake encodes and processes vectors efficiently. This data type supports semantic vector search and retrieval applications, such as RAG-based applications, and common operations on vectors in vector-processing applications. |

## New SQL functions

The following functions are now generally available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| [Vector Similarity Function](../../../sql-reference/functions-vector.md) | [VECTOR_INNER_PRODUCT](../../../sql-reference/functions/vector_inner_product.md) | Returns the inner product of two vectors. The inner product (also known as the dot or scalar product) multiplies two vectors |
| [Vector Similarity Function](../../../sql-reference/functions-vector.md) | [VECTOR_L2_DISTANCE](../../../sql-reference/functions/vector_l2_distance.md) | Measures the L2 distance between two vectors. |
| [Vector Similarity Function](../../../sql-reference/functions-vector.md) | [VECTOR_COSINE_SIMILARITY](../../../sql-reference/functions/vector_cosine_similarity.md) | Measures the cosine similarity between two vectors, which is the angular distance between the vectors in a multi-dimensional space. |
| [LLM Function](../../../user-guide/snowflake-cortex/aisql.md) | [EMBED_TEXT_768 (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/embed_text-snowflake-cortex.md) | Creates a vector embedding for a given string of text in English. |

---
title: May 16, 2025: Cost Management release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-16-cost.md
section: Release Notes
---

# May 16, 2025: Cost Management release notes

## Cost anomalies (*Preview*)

Snowflake can now automatically detect cost anomalies based on prior levels of consumption, which simplifies the process of
identifying spikes or dips in costs so you can find ways to optimize your spend. You can use this feature to identify both account-level and
organization-level cost anomalies.

For more information, see [Introduction to cost anomalies](../../../user-guide/cost-anomalies.md).

---
title: May 16, 2025: Universal Search support for pipes, tasks, and streams (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-16-universal-search-pipes-tasks-streams.md
section: Release Notes
---

# May 16, 2025: Universal Search support for pipes, tasks, and streams (*General availability*)

With this release, Snowsight now displays pipes, tasks, and streams in Universal Search results, making it easier
to discover relevant assets.

For details on Universal Search, see [Search Snowflake objects and resources](../../../user-guide/ui-snowsight-universal-search.md).

---
title: May 17, 2024 — Document AI Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-17-document-ai.md
section: Release Notes
---

# May 17, 2024 — Document AI Release Notes

## Document AI —– *Preview*

With this release, we are pleased to announce the preview of Document AI.

Document AI enables setting up intelligent document processing (IDP) workflows within Snowflake by extracting information from documents,
such as invoices or contracts, and directly applying it to operational workflows. Document AI is powered by Snowflake Arctic-TILT
(Text Image Layout Transformer), a proprietary large language model (LLM).

With Document AI, you can prepare pipelines for continuous processing of new documents of a specific type, and turn
unstructured data from documents into structured data in tables.

Document AI is available to accounts in AWS and Microsoft Azure commercial regions, with the exception of:

* AWS Asia Pacific (Singapore)
* AWS Asia Pacific (Osaka)
* AWS EU (Paris)

---
title: May 19, 2025: Cortex COMPLETE Structured Output schema references
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-19-complete-structured-output-json-refs.md
section: Release Notes
---

# May 19, 2025: Cortex COMPLETE Structured Output schema references

Snowflake announces support for schema references in Cortex COMPLETE Structured Outputs, making it easier
for developers to create and maintain complex schemas. The new `$ref` mechanism allows developers to define
common components once and reference them throughout their schema. This enhancement also unlocks compatibility with
third-party libraries like Pydantic that rely on schema references, enabling developers to use existing Pydantic
schemas with COMPLETE Structured Outputs.

Key benefits include:

* **Use existing schemas:** Streamlined development workflow for Python developers already using Pydantic in their application code.
* **Maintenance simplicity:** Change definitions in one place and all references automatically inherit updates.
* **Error reduction:** Standardized referenced components eliminate discrepancies across implementations.
* **Scalability:** Referenced components allow you to extend functionality without duplicating validation logic
* **Schema clarity:** References create a clear, organized hierarchy that better represents real-world relationships.

To get started, see [COMPLETE Structured Outputs](../../../user-guide/snowflake-cortex/complete-structured-outputs.md).

---
title: May 19, 2025: Snowflake ML Data Connector release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-19-data-connector-container-runtime.md
section: Release Notes
---

# May 19, 2025: Snowflake ML Data Connector release notes

## Snowflake ML Data Connector for Container Runtime (*General availability*)

The Snowflake ML Data Connector is now generally available for use with container runtime instances, such as notebook sessions and ML jobs. This connector enables you to efficiently ingest data from Snowflake data sources
into your containerized Python environments. It leverages distributed processing to accelerate data loading.

Key capabilities include:

* Data loading from any Snowflake data source (tables or stages) directly into a pandas dataframe for use in open source ML workflows.
* Create PyTorch and TensorFlow datasets from Snowflake data for seamless integration with popular ML frameworks.
* Use the same code both inside and outside of Snowflake’s container runtime.
* Support for both Snowpark DataFrames (ideal for development) and Snowflake Datasets (versioned, schema-level objects for production).
* Integration with Snowflake’s distributed APIs for large-scale model training and tuning.

---
title: May 19, 2025: Snowflake Notebooks Container Runtime - Support for Azure and Azure Private Link (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-19-nb-spcs-azure-pl-ga.md
section: Release Notes
---

# May 19, 2025: Snowflake Notebooks Container Runtime - Support for Azure and Azure Private Link (*General availability*)

We are pleased to announce the general availability of Container Runtime for Azure and Azure Private Link in Snowflake Notebooks Container Runtime.

Snowflake Notebooks is a development interface in Snowsight that offers an interactive, cell-based programming environment for Python
and SQL. In Snowflake Notebooks, you can perform exploratory data analysis, develop machine learning models, and perform other data science and
data engineering tasks all in one place.

For more information on setting up PrivateLink for Notebooks, see [Private connectivity for Notebooks](../../../user-guide/ui-snowsight/notebooks-privatelink.md).

---
title: May 20, 2024 — Cost Management Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-20-cost.md
section: Release Notes
---

# May 20, 2024 — Cost Management Release Notes

## Cost Insights — *General Availability*

With this release we are pleased to announce the general availability of cost insights, a cost management tool that lets you identify
opportunities for savings within an account. There are several different cost insights. For example, one cost insight identifies large
tables that are rarely queried, giving you an opportunity to optimize your spend on storage by potentially dropping the table. Another
insight identifies data that could be stored in temporary or transient tables, which saves on Fail-safe and Time Travel costs.

The cost insights appear as a tile on the Account Overview page, which is available from Admin » Cost Management.

For more information about the available insights along with recommended actions to take to optimize your spend, see
[Using cost insights to save](../../../user-guide/cost-insights.md).

---
title: May 20, 2025: Data Governance release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-20-contacts.md
section: Release Notes
---

# May 20, 2025: Data Governance release notes

## Contacts for objects (*Preview*)

You can now associate contacts with objects such as databases and tables so users know who to reach for assistance with the object. Each contact
is a schema-level object that contains details about how to communicate with the user or group of users, for example, whether to use an email
address or access a URL. An object can have multiple contacts as long as the purpose of each contact is different. For example, a table might
have one contact for access approval and another contact for general support.

For more information, see [Using Contacts](../../../user-guide/contacts-using.md).

> **Note:**
>
> This feature is being rolled out gradually, and will be available to all accounts within five weeks.

---
title: May 20, 2025: Snowflake Copilot model level RBAC
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-20-model-level-rbac.md
section: Release Notes
---

# May 20, 2025: Snowflake Copilot model level RBAC

Snowflake Copilot now supports role-based access control (RBAC) at the model level. RBAC at the model level lets account administrators control which large language models (LLMs) can be used by Snowflake Copilot and other Cortex features based on user role.

For more information, see [Limit models used by Snowflake Copilot](../../../user-guide/snowflake-copilot.md).

---
title: May 20, 2025: Snowflake Openflow (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-20-openflow.md
section: Release Notes
---

# May 20, 2025: Snowflake Openflow (*Preview*)

Snowflake announces the preview of Snowflake Openflow, an integration service that connects any data source and any
destination with hundreds of processors supporting structured and unstructured text, images, audio, video and sensor data.
Built on Apache NiFi, Openflow lets you run a fully managed service in your own cloud for complete control.

Use Openflow if you are looking to fetch data from any source and put it
in any destination with minimal management, coupled with Snowflake’s built-in data security and governance.

Some of the use cases of Openflow are as follows:

* Ingest data from unstructured data sources, such as Google Drive and Box, and make
  it ready for chat in your AI assistants with Snowflake Cortex or use the data for your own custom processing.
* Replicate the change data capture (CDC) of database tables into Snowflake for comprehensive, centralized
  reporting
* Ingest real-time events from streaming services, such as Apache Kafka, into Snowflake for near real-time analytics
* Ingest data from SaaS platforms, such as LinkedIn Ads, to Snowflake for reporting, analytics and insights
* Create your own data flow using Snowflake and NiFi [processors](../../../user-guide/data-integration/openflow/processors/index.md) and [controllers](../../../user-guide/data-integration/openflow/controllers/index.md).

To get started with Openflow, see [About Openflow](../../../user-guide/data-integration/openflow/about.md).

---
title: May 20, 2025: Snowpark Container Services preview available in Google Cloud (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-20-spcs-preview-available-in-gcp.md
section: Release Notes
---

# May 20, 2025: Snowpark Container Services preview available in Google Cloud (*Preview*)

Snowpark Container Services preview is now available to Snowflake accounts in Google Cloud. For more information,
see [Snowpark Container Services](../../../developer-guide/snowpark-container-services/overview.md).

---
title: May 20-22, 2024 — 8.20 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_20.md
section: Release Notes
---

# May 20-22, 2024 — 8.20 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Trust Center — *Preview*

With this release, we are pleased to announce the preview of the Trust Center. You can use the Trust Center to evaluate and monitor
your account for security risks in Snowsight. The Trust Center also provides recommendations on how to remediate security risks
found in your account.

For more information, see [Trust Center](../../user-guide/trust-center/overview.md).

## SQL updates

### CREATE OR ALTER TABLE and CREATE OR ALTER TASK — *Preview*

With this release, we are pleased to announce the preview of the CREATE OR ALTER TABLE and CREATE OR ALTER TASK commands. CREATE OR ALTER
commands combine the functionality of the CREATE command and the ALTER command. A CREATE OR ALTER statement executes as a CREATE statement
if the object doesn’t exist. If it does exist, it transforms the object according to the object definition in the statement.

CREATE OR ALTER TABLE provides a declarative and idempotent approach to defining your tables and tasks. If a table is transformed, data is
preserved when possible.

For more information, see [CREATE TABLE](../../sql-reference/sql/create-table.md) and [CREATE TASK](../../sql-reference/sql/create-task.md).

## Apache Iceberg™ table updates

### Apache Iceberg™ tables — *General availability*

With this release, we are pleased to announce the general availability of Apache Iceberg™ tables for Snowflake.

Iceberg tables for Snowflake combine the performance and query semantics of regular Snowflake tables
with external cloud storage that you manage. They are ideal for maintaining a single copy of data
with interoperability across a variety of compute engines.

For more information, see [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

### Replace invalid UTF-8 characters in Iceberg tables

With this release, we are pleased to announce support for specifying whether Snowflake should replace invalid UTF-8 characters with the
Unicode replacement character (`�`) in query results for Iceberg tables that use an external catalog.

For more information, see [REPLACE_INVALID_CHARACTERS](../../sql-reference/parameters.md).

### Structured type evolution for Iceberg tables

With this release, we are pleased to announce structured type evolution for Snowflake-managed Iceberg tables.

For more information, see [ALTER ICEBERG TABLE … ALTER COLUMN … SET DATA TYPE (structured types)](../../sql-reference/sql/alter-iceberg-table-alter-column-set-data-type.md).

### Set a storage serialization policy

With this release, we are pleased to announce support for setting a storage serialization policy for Iceberg tables that use Snowflake
as the catalog. The storage serialization policy specifies the type of encoding and compression that Snowflake uses for table data files.

For more information, see [STORAGE_SERIALIZATION_POLICY](../../sql-reference/parameters.md) and [CREATE ICEBERG TABLE](../../sql-reference/sql/create-iceberg-table.md).

### Change ALLOW_WRITES to FALSE for external volumes

With this release, we are pleased to announce that you can block write operations on an external volume by setting the ALLOW_WRITES
parameter to FALSE.

For more information, see [ALTER EXTERNAL VOLUME](../../sql-reference/sql/alter-external-volume.md).

### New ICEBERG_ACCESS_ERRORS view

With this release, we are pleased to announce the release of the ICEBERG_ACCESS_ERRORS view in the Snowflake MONITORING schema.
The view displays information about external volume access errors for your Snowflake account.

For more information, see [ICEBERG_ACCESS_ERRORS view](../../sql-reference/monitoring/iceberg_access_errors.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 20-May-24 |
| *Apache Iceberg™ tables - GA* | **Added** to *Apache Iceberg™ table updates* section | 10-JUN-24 |

---
title: May 2023
source: https://docs.snowflake.com/en/release-notes/2023-05.md
section: Release Notes
---

# May 2023

The following new features, behavior changes, and updates (enhancements, fixes, etc.) have been introduced this month. If you have any
questions about these additions, please contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Important:**
>
> Each release may include updates that require the web interface to be refreshed.
>
> As a general practice, to ensure these updates do not impact your usage, we recommend refreshing the web interface after each Snowflake
> release has been deployed.

## New Features

### Logging and Tracing in Procedures and Functions — *Preview*

With this release, we are pleased to announce the preview of
[event tables, logging, and tracing](../developer-guide/logging-tracing/logging-tracing-overview.md). With this feature, you can emit log
message data and trace data from procedure and function handler code and have the data collected in an event table for analysis later.
Snowflake supports APIs for each of the supported handler languages.

## SQL Updates

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_GENERATE_RANGE](../sql-reference/functions/array_generate_range.md) | Returns an ARRAY of integer values within a specified range (e.g. [2, 3, 4]). |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_REMOVE](../sql-reference/functions/array_remove.md) | Given a source ARRAY, returns an ARRAY with elements of the specified value removed. |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_REMOVE_AT](../sql-reference/functions/array_remove_at.md) | Given a source ARRAY, returns an ARRAY with the element at the specified position removed. |

### Support for GEOMETRY Data Type — *General Availability*

With this release, we are pleased to announce the general availability of support for the new GEOMETRY data type. The GEOMETRY data type
represents features in a planar (Euclidean, Cartesian) coordinate system. This feature is available to all Snowflake accounts.

This release provides functions for constructing, formatting, measuring, and computing relationships between GEOMETRY objects. Using these
functions, you can construct GEOMETRY objects from data in standard formats, including WKT, WKB, and GeoJSON.

For more information, refer to [Geospatial data types](../sql-reference/data-types-geospatial.md).

### Geospatial Functions for Shape Transformation and Orientation — *General Availability*

With this release, we are pleased to announce the general availability of the following geospatial functions for shape transformation and
orientation:

| Function | Description |
| --- | --- |
| [ST_BUFFER](../sql-reference/functions/st_buffer.md) (for GEOMETRY objects) | Returns a GEOMETRY object that represents a MultiPolygon containing the points within a specified distance of the input GEOMETRY object. The returned object effectively represents a “buffer” around the input object. |
| [ST_SIMPLIFY](../sql-reference/functions/st_simplify.md) (forGEOMETRY objects) | Given an input GEOMETRY object that represents a line or polygon, returns a simpler approximation of the object. The function identifies and removes selected vertices, resulting in a similar object that has fewer vertices. |
| [ST_AZIMUTH](../sql-reference/functions/st_azimuth.md) (for GEOMETRY objects) | Given two Points that are GEOMETRY objects, returns the azimuth (in radians) of the line segment formed by the two Points. |
| [ST_MAKEPOLYGONORIENTED](../sql-reference/functions/st_makepolygonoriented.md) (for GEOGRAPHY objects) | Constructs a GEOGRAPHY object that represents a polygon without holes. The function uses the specified LineString as the outer loop. This function does not attempt to correct the orientation of the loop, thus allowing for the creation of polygons that span more than half of the globe, as opposed to [ST_MAKEPOLYGON , ST_POLYGON](../sql-reference/functions/st_makepolygon.md), which inverts the orientation of those large shapes. |

### Support for Specifying How to Handle Invalid Geospatial Shapes — *General Availability*

With this release, we are pleased to announce the general availability of support for handling invalid geospatial shapes.

By default, when you use a [geospatial conversion function](../sql-reference/functions-geospatial.md) to convert
[data in a supported input format](../sql-reference/data-types-geospatial.md) to a GEOGRAPHY or GEOMETRY object, the function attempts to
validate the shape and repair the shape if the shape is invalid. If the shape cannot be repaired, the function does not create a GEOGRAPHY
or GEOMETRY object.

With this feature, you have more control over the validation and repair process. You can:

* Allow these conversion functions to create GEOGRAPHY and GEOMETRY objects for invalid shapes.
* Determine if the shape for a GEOGRAPHY or GEOMETRY object is invalid.

For more information, refer to [Specifying how invalid geospatial shapes are handled](../sql-reference/data-types-geospatial.md).

### Data Sharing Usage: New LISTING_AUTO_FULFILLMENT Views — *Preview*

With this release, we are pleased to announce the preview of two new views added to the data sharing usage schema (in the SNOWFLAKE
shared database) to provide information to help manage the cost of Cross-Cloud Auto-fulfillment.

The [LISTING_AUTO_FULFILLMENT_DATABASE_STORAGE_DAILY
View](../sql-reference/data-sharing-usage/listing-auto-fulfillment-database-storage-daily.md) provides details about storage costs associated
with storing replicated data in remote Snowflake regions for the purposes of fulfilling consumer demand for a listing’s data product in a
region.

The [LISTING_AUTO_FULFILLMENT_REFRESH_DAILY View](../sql-reference/data-sharing-usage/listing-auto-fulfillment-refresh-daily.md) provides
details about compute costs associated with refreshing the data associated with specific listings to supported Snowflake regions.

## Data Collaboration Updates

### Cross-Cloud Auto-Fulfillment for Listings — *General Availability*

With this release, we are pleased to announce the general availability of cross-cloud auto-fulfillment for listings, whether you share
listings publicly or with specific accounts.

Ensure consumers have fresh, up-to-date data across cloud regions by using cross-cloud auto-fulfillment to offer your data product
directly to specific accounts across the globe or on-demand in regions you choose.

For more information, refer to [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md).

## Data Governance Updates

### Memoizable Functions — *General Availability*

With this release, Snowflake is pleased to announce the general availability of memoizable functions. A memoizable function caches the
result of calling a user-defined function (UDF) and then returns the cached result when the output is needed at a later time.

Using memoizable functions improves performance for complex queries, such as multiple column lookups in mapping tables referenced within a
row access policy or masking policy. Currently, memoizable functions are available for scalar SQL UDFs only. This feature was announced in
preview in January 2023.

For more information, refer to [Memoizable UDFs](../developer-guide/udf/sql/udf-sql-scalar-functions.md).

## Web Interface Updates

### Access Snowsight with an Account Name URL — *General Availability*

With this release, Snowflake is pleased to announce a new URL format for Snowsight that identifies an account using its name and
organization. From now on, users should access Snowsight using the following URL format:

```none
https://app.snowflake.com/<orgname>/<account_name>
```

This change automatically applies to all accounts and organizations.

Previously, Snowsight URLs identified an account by region and account locator.
Bookmarks and links that use this legacy format will continue to work and will automatically redirect to the new URL.

If you have network policies or firewall rules that specify a URL, you might need to update the policies and rules to match the new URL
format.

If you are unable to access Classic Console from Snowsight, temporarily change the default web interface for your user
profile to Classic Console and contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

> **Note:**
>
> This change is happening over time and might not be available in your Snowflake region immediately.

### Create Named Stages using Snowsight — *Preview*

With this release, we are pleased to announce the preview of creating and editing named stages using Snowsight without writing SQL.

To create or edit named stages, you can enter details into Snowsight including information about authentication or encryption for the stage.

For more information, refer to:

* [Choosing an internal stage for local files](../user-guide/data-load-local-file-system-create-stage.md)
* [Create an S3 stage](../user-guide/data-load-s3-create-stage.md)
* [Configure an integration for Google Cloud Storage](../user-guide/data-load-gcs-config.md)
* [Create an Azure stage](../user-guide/data-load-azure-create-stage.md)

### Managing Data Governance in Snowsight — *Preview*

With this release, we are pleased to announce the preview of the Data » Governance interface in Snowsight. The
Governance interface includes a Dashboard tab to monitor the most frequently used masking policies, row access policies, and
tags with their usage on tables and columns. The Governance interface also includes a Tagged Objects tab to report on the
Dashboard data, with the option to manually report on the usage of tags and policies on tables and columns.

When you select an element in the Dashboard, Snowsight automatically updates the Tagged Objects tab filters. Additionally,
when you select a row in the Tagged Objects tab, Snowsight automatically redirects you to the object or column in the
Data » Databases interface. You can then manage the policy and tag assignments as needed.

For more information, refer to:

* [Use Snowsight to set tags](../user-guide/object-tagging/work.md)
* [Monitor tags with Snowsight](../user-guide/object-tagging/monitor.md)
* [Monitor masking policies with Snowsight](../user-guide/security-column-intro.md)
* [Monitor row access policies with Snowsight](../user-guide/security-row-intro.md)

---
title: May 21, 2025: Snowflake Openflow updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-21-openflow.md
section: Release Notes
---

# May 21, 2025: Snowflake Openflow updates

Connector installation now utilizes a versioned process group created from the Connector Flow Registry Client, which allows
the connector to be seamlessly upgraded when new versions become available.

Runtimes that existed prior to this upgrade must be manually added to the Connector Flow Registry Client.

For more information about Snowflake Openflow, see [About Openflow](../../../user-guide/data-integration/openflow/about.md).

---
title: May 22, 2024 — SQL Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-22-table-references.md
section: Release Notes
---

# May 22, 2024 — SQL Release Notes

## Using the TABLE keyword as an alternative to SYSTEM$REFERENCE and SYSTEM$QUERY_REFERENCE

You can now use the TABLE keyword to get a
[reference](../../../developer-guide/stored-procedure/stored-procedures-calling-references.md) to a table, view, secure view, or query.

The reference created by this keyword is valid only for the scope of the call. In addition, the reference only confers the SELECT
privilege on the table, view, or secure view.

For example, rather than calling the [SYSTEM$REFERENCE](../../../sql-reference/functions/system_reference.md) function to get a reference to a table:

```sqlexample
CALL my_procedure(SYSTEM$REFERENCE('table', my_table));
```

you can use the TABLE keyword:

```sqlexample
CALL my_procedure(TABLE(my_table));
```

For details, see [Using the TABLE keyword to create a reference to a table, view, or query](../../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

---
title: May 22, 2025: New Snowsight navigation menu (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-22-snowsight-navigation-menu.md
section: Release Notes
---

# May 22, 2025: New Snowsight navigation menu (*Preview*)

Snowflake announces the preview of an updated navigation menu, organized by feature groups under key categories to help you find the tools you need more quickly:

* **Work with data:** Ingest, transform, analyze, and monitor data using integrated tools for development and automation.
* **Discover & collaborate:** Search, share, and manage data across teams and partners with cataloging and Marketplace features.
* **Manage:** Control access, optimize compute, and maintain governance across your Snowflake environment.

The new navigation menu is rolling out gradually and may not be available to all accounts.

For more information, see [Snowsight navigation menu](../../../user-guide/ui-snowsight-navigation.md).

---
title: May 22, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-22-dcr.md
section: Release Notes
---

# May 22, 2025: Snowflake Data Clean Rooms updates

> **Note:**
>
> **Clean rooms UI users** must sign out and back in to the clean rooms UI for these updates to take effect.
>
> **Clean rooms API users** must run the following SQL commands for these updates to take effect:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.apply_patch();
> ```
>
> **To enable auto-upgrades for API users,** run the following SQL commands:
>
> ```sqlexample
> USE ROLE SAMOOHA_APP_ROLE;
> CALL SAMOOHA_BY_SNOWFLAKE_LOCAL_DB.library.enable_local_db_auto_upgrades();
> ```

With this release, we are pleased to announce the availability of the following new features and enhancements to Snowflake
Data Clean Rooms:

**Sign in with Snowflake Authentication:** You can now sign in to the clean rooms UI using your Snowflake credentials. The new flow works
seamlessly with your existing Snowflake SSO/SAML integration. If you have an existing Snowflake Data Clean Rooms account, use the wizard to
migrate your account with just a few clicks. [Learn more.](../../../user-guide/cleanrooms/update-to-oauth.md)

---
title: May 23, 2025: Notebooks st.secrets support for Warehouse and Container Runtimes (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-23-st-secrets-support.md
section: Release Notes
---

# May 23, 2025: Notebooks `st.secrets` support for Warehouse and Container Runtimes (*General availability*)

With this release, Notebooks now support the use of `st.secrets` for simplified and secure access to secrets. Users can securely store and
reference sensitive values such as API keys or credentials directly from Snowflake Notebooks. Secrets can be used in both Warehouse and
Container Runtimes, enabling developers to integrate external services while keeping credentials secure.

For details, see [Set up external access for Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-external-access.md).

---
title: May 27, 2025: Data sharing & collaboration for accounts in Kingdom of Saudi Arabia region
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-27-KSA-regions.md
section: Release Notes
---

# May 27, 2025: Data sharing & collaboration for accounts in Kingdom of Saudi Arabia region

With this release, we are pleased to announce that data sharing and collaboration
with listings is now available for accounts in the Kingdom of Saudi Arabia (KSA) region.

Customers using Snowflake’s Data Cloud in the [Middle East](../../../user-guide/intro-regions.md) can discover and securely share data,
as well as execute diverse analytic workloads, with Snowflake’s implementation on Azure UAE North in Dubai.

With the addition of a Snowflake deployment on GCP in the KSA (Dammam, me-central2 region),
regional customers have additional flexibility in their choice for deployment.

After accepting the cross-region disclaimer, anyone in your account with the required privileges can do the following:

* Publish free and limited trial listings on the Snowflake Marketplace.
* Share free listings directly with consumers.
* Access and install free and limited trial listings from the Snowflake Marketplace.
* Install free listings shared directly with your account.

For more details, see:

* [Prepare to provide listings from accounts in the Kingdom of Saudi Arabia (KSA) region](../../../collaboration/provider-listings-government-providers.md)
* [Prepare to access listings from accounts in the Kingdom of Saudi Arabia (KSA) region](../../../collaboration/consumer-becoming.md)

---
title: May 27, 2025: Security release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-23-mfa.md
section: Release Notes
---

# May 27, 2025: Security release notes

## New authentication methods for multi-factor authentication (MFA) (*General availability*)

Users enrolled in multi-factor authentication (MFA) enter their password, then use a second factor of authentication. With this release,
these users can now use the following MFA methods as their second factor of authentication:

* A passkey that can be stored in a variety of ways. For example, passkeys allow a user to authenticate with a hardware security key or by
  using their laptop fingerprint sensor.
* A preferred authenticator app that generates a time-based one-time passcode (TOTP).

Administrators can restrict which MFA methods are available to users who are setting up their second factor of authentication.

To learn how to set up one of these MFA methods, see [Configuring a second factor of authentication](../../../user-guide/security-mfa-second-factor.md).

---
title: May 27, 2025: Snowflake Native App with Snowpark Container Services support for Azure Private Link (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-27-na-spcs-azure-pl-ga.md
section: Release Notes
---

# May 27, 2025: Snowflake Native App with Snowpark Container Services support for Azure Private Link (*General availability*)

With this release, support for Azure Private Link in Snowflake Native App with Snowpark Container Services is generally available. Snowflake Native Apps can be deployed and operated with Private Link connectivity in Azure, enabling secure network isolation.

See [Azure Private Link and Snowflake](../../../user-guide/privatelink-azure.md) for more information.

---
title: May 28, 2024 — ML Functions Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-28-call-method-in-from-clause.md
section: Release Notes
---

# May 28, 2024 — ML Functions Release Notes

## Simpler SQL for storing results from ML functions

You can now call the [Forecast](../../../user-guide/ml-functions/forecasting.md) and
[Detect Anomalies](../../../user-guide/ml-functions/anomaly-detection.md) ML Functions directly in the FROM clause
of a SELECT statement. You can call methods like [<model_name>!DETECT_ANOMALIES](../../../sql-reference/classes/anomaly-detection/methods/detect_anomalies.md),
[<model_name>!FORECAST](../../../sql-reference/classes/forecast/methods/forecast.md), and
[<model_name>!SHOW_EVALUATION_METRICS](../../../sql-reference/classes/forecast/methods/show_evaluation_metrics.md) in the FROM clause.

You can use this technique to simplify the SQL statements for saving results to a table. For example, rather than using the
[SQLID](../../../developer-guide/snowflake-scripting/query-id.md) Snowflake Scripting variable with the
[RESULT_SCAN](../../../sql-reference/functions/result_scan.md) function to create a table containing these results:

```sqlexample
BEGIN
  CALL model!FORECAST(FORECASTING_PERIODS => 7);
  LET x := SQLID;
  CREATE TABLE my_forecasts AS SELECT * FROM TABLE(RESULT_SCAN(:x));
END;
SELECT * FROM my_forecasts;
```

you can use a query that directly selects from the results of calling the methods:

```sqlexample
CREATE TABLE my_forecasts AS
  SELECT * FROM TABLE(model!forecast(forecasting_periods => 7));
```

As shown in the example above, when calling the method, omit the [CALL](../../../sql-reference/sql/call.md) command. Instead, put the call
in parentheses, preceded by the TABLE keyword.

For details, see [Selecting columns from SQL class instance methods that return tabular data](../../../sql-reference/snowflake-db-classes.md).

In addition, as [announced earlier](2024-05-22-table-references.md) and shown in the example above,
you can use the TABLE keyword (rather than calling [SYSTEM$REFERENCE](../../../sql-reference/functions/system_reference.md)) to create a reference to
pass in to the method.

---
title: May 28, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-28-dcr.md
section: Release Notes
---

# May 28, 2024 — Snowflake Data Clean Rooms Release Notes

This topic provides an overview of the new features, enhancements, and other important changes introduced in this update to Snowflake Data
Clean Rooms.

## Multi-provider clean rooms via Developer APIs — *General Availability*

With this release, we are pleased to announce that consumers can use the developer API to run an approved workload across multiple clean
rooms, which lets them gain insights from datasets from more than one provider in the same analysis. Each provider controls which clean
rooms can be combined with their own clean room, and Snowflake Data Clean Rooms enforces the security guarantees of each provider’s clean
room.

For an extended example of how a multi-provider analysis works, see [Overview of Snowflake Data Clean Rooms](../../../user-guide/cleanrooms/overview.md).

## Additional supported regions — *General Availability*

With this release, we are pleased to announce that Snowflake Data Clean Rooms are now available in regions on Google Cloud Platform and in
Europe. The following new regions are supported:

**North America**

| Cloud platform | Supported region | Cloud region ID |
| --- | --- | --- |
| Google Cloud Platform | US Central1 (Iowa) | us-central1 |
|  | US East4 (N. Virginia) | us-east4 |

**Europe**

| Cloud platform | Supported region | Cloud region ID |
| --- | --- | --- |
| Amazon Web Services | EU (Ireland) | eu-west-1 |
|  | Europe (London) | eu-west-2 |
|  | EU (Paris) | eu-west-3 |
|  | EU (Frankfurt) | eu-central-1 |
|  | EU (Stockholm) | eu-north-1 |
| Microsoft Azure | UK South (London) | uksouth |
|  | North Europe (Ireland) | northeurope |
|  | West Europe (Netherlands) | westeurope |
|  | Switzerland North (Zurich) | switzerlandnorth |
| Google Cloud Platform | Europe West2 (London) | europe-west2 |
|  | Europe West4 (Netherlands) | europe-west4 |

## Support for views in the web app — *General Availability*

With this release, we are pleased to announce that providers and consumers can use the web app to link views, materialized views, and
secure views into a clean room. All referenced databases in these views must be registered with SAMOOHA_APP_ROLE, in order for all necessary
reference usage grants to be applied. For secure views, the owner of the secure view must be the SAMOOHA_APP_ROLE role.

## Clean room customizations for identity & activation — *General Availability*

With this release, we are pleased to announce that providers can customize which activation, identity, and data provider partners display
as options within a clean room. For example, if a provider has a preferred activation partner, they can configure the clean room so the
consumer can only select that partner when activating results.

The administrator customizes the clean room environment using the new Admin » Clean Room Features menu.

## Custom template enhancements — *General Availability*

With this release, we are pleased to announce the following enhancements to the process of creating a user interface for a custom template:

* The template’s user interface can now include a date selector.
* Developers can hide the default table selectors from the template’s user interface.

For more information about these enhancements, see the `provider.add_ui_form_customizations` API in the
[Snowflake Data Clean Rooms: Provider API reference guide](../../../user-guide/cleanrooms/provider.md).

---
title: May 28, 2025: Organization users (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-29-org-users.md
section: Release Notes
---

# May 28, 2025: Organization users (*Preview*)

Multi-account organizations that need to have the same person be a user in more than one account can now create an organization user for the
person. Each organization user acts as a global user entity that can be imported into regular accounts by account administrators,
simplifying the process of having the same person have a user object in multiple accounts.

Organization users are grouped into logical units called organization user groups. When an account administrator imports an organization user
group to a regular account, all of its organization users are added to the account. The organization user group becomes an access control
role in the account, allowing you to have consistent roles across the organization.

If you have an existing user who you want to be an organization user, you can import the organization group into each account, then link the
existing local user object to the new organization user.

Organization users and organization user groups require an organization account.

For more information, see [Organization users](../../../user-guide/organization-users.md).

---
title: May 28-30, 2024 — 8.21 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_21.md
section: Release Notes
---

# May 28-30, 2024 — 8.21 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Triggered tasks — *Preview*

With this release, we are pleased to announce the preview of triggered tasks.

With triggered tasks, your tasks can run only when the related stream has new data. This replaces the need to continuously run your tasks to check for data in stream.

For more information, see [Triggered tasks](../../user-guide/tasks-triggered.md).

## SQL updates

### Email notification integrations no longer limited to 10

Previously, you could define no more than 10 email notification integrations for an account.

With this release, this limit has been lifted. You can define more than 10 email notification integrations for an account.

### UNPIVOT supports rows with NULLs in results

You can now use the `{ INCLUDE | EXCLUDE } NULLS` option in an UNPIVOT subclause to specify whether to include rows with NULLs in the results. By default, rows with NULLs are excluded. To include rows with NULLs in the results, specify `INCLUDE NULLS` in the UNPIVOT subclause.

For more information, see [UNPIVOT](../../sql-reference/constructs/unpivot.md).

## Data loading / unloading updates

### New Parquet file format option USE_VECTORIZED_SCANNER — *General Availability*

With this release, we are pleased to announce the general availability of a new file format option `USE_VECTORIZED_SCANNER`.

The default value for this file format option is `FALSE`. In a future BCR, the default value will be `TRUE`.

When setting `USE_VECTORIZED_SCANNER = TRUE`, you can use the vectorized scanner for loading Parquet files. The vectorized scanner is well suited for the columnar format of a [Parquet](https://parquet.apache.org/docs/file-format/) file and it significantly reduces the ingestion latency. The scanner only downloads relevant sections of the Parquet file into memory, such as the subset of selected columns.

For more information, see [USE_VECTORIZED_SCANNER](../../sql-reference/sql/copy-into-table.md).

## Streamlit in Snowflake updates

### Support for v1.29.0 and v1.31.1 of the Streamlit library

With this release, we are pleased to announce support for the 1.29.0 and 1.31.1 versions of the Streamlit open source library in Streamlit in Snowflake.

For more information, see [Supported versions of the Streamlit library in warehouse runtimes](../../developer-guide/streamlit/app-development/dependency-management.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-May-24 |

---
title: May 29, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-30-dcr.md
section: Release Notes
---

# May 29, 2025: Snowflake Data Clean Rooms updates

With this release, we are pleased to announce the availability of the following new features and enhancements to Snowflake
Data Clean Rooms:

* **Free-form SQL queries now available in the API.** You can now expose your linked tables and views in a clean room to be
  [available for free-form queries](../../../user-guide/cleanrooms/v1/web-app-sql-template.md) by clean room collaborators in any Snowflake coding
  environment.

---
title: May 29, 2025: Table extraction in Document AI (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-29-document-ai.md
section: Release Notes
---

# May 29, 2025: Table extraction in Document AI (*Preview*)

With Document AI, you can now extract tables from documents using the new Snowflake Arctic-TILT
foundation model geared towards tabular and relational extraction.

To extract tables, select the document processing type before you begin defining values for
your Document AI model build.

---
title: May 30, 2025: Additional model support for Cortex AISQL Images
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-30-complete-multimodal-new-models.md
section: Release Notes
---

# May 30, 2025: Additional model support for Cortex AISQL Images

> **Note:**
>
> Cortex AI Functions Images, formerly called COMPLETE Multimodal, is currently available in preview.

Snowflake Cortex Cortex AI Functions Images, which allows you to extract information from images, generate image descriptions, and more,
now supports multimodal models from Anthropic and Meta, specifically the following:

* Claude 3.7 Sonnet
* Claude 4 Sonnet
* Claude 4 Opus
* LLaMA 4 Maverick
* LLaMA 4 Scout

For more information on this feature, see [Cortex AI Functions: Images](../../../user-guide/snowflake-cortex/ai-images.md).

---
title: May 30, 2025: Data Governance release notes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-30-tags.md
section: Release Notes
---

# May 30, 2025: Data Governance release notes

## Object tags available in Standard Edition

All accounts can now create and set object tags regardless of the account’s [edition](../../../user-guide/intro-editions.md). A tag is a schema-level object that can be assigned to another Snowflake object, and can be used to govern your data and monitor resource usage. Note that a few advanced capabilities like tag propagation still require Enterprise Edition or higher.

For more information, see [Introduction to object tagging](../../../user-guide/object-tagging/introduction.md).

---
title: May 30, 2025: Request Approval Workflow (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-30-raw.md
section: Release Notes
---

# May 30, 2025: Request Approval Workflow (*General availability*)

With this release, we are pleased to announce the general availability of the Request Approval Workflow.

The request approval workflow:

* Allows consumers to request access to Internal Marketplace organizational listings from Snowsight .
* Allows providers to view and manage access requests to organization listings in Snowsight .

For more information see [Manage the request approval workflow](../../../user-guide/collaboration/listings/organizational/request-approval-workflow.md).

---
title: May 30, 2025: Snowflake Openflow (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-05-30-openflow.md
section: Release Notes
---

# May 30, 2025: Snowflake Openflow (*General availability*)

With this release, we are pleased to announce the general availability of Snowflake Openflow.

Some connectors will remain in preview. To see the list of Openflow connectors, see [Openflow connectors](../../../user-guide/data-integration/openflow/connectors/about-openflow-connectors.md).

---
title: May 31, 2024 — Snowflake ML Classification Update –— Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-31-classification-update.md
section: Release Notes
---

# May 31, 2024 — Snowflake ML Classification Update –— *Preview*

With this release, we are pleased to announce that the Classification function, which builds models that sort data into
classes using patterns detected in your training data, has been updated with significant new features.

* Support for timestamp features. The model automatically derives features such as day-of-week and month from timestamps,
  so classification can detect time-based cycles and use them to help classify new data.
* Support for high-cardinality features and labels. These are short strings with more than about 100 values, such as
  job titles or fruits.

This change affects new classification models you create. Existing models continue to use the previous implementation.
Due to the above improvements, a new model trained on the same data as an existing model will likely produce slightly
different results than the existing model.

For more information, see
[Classification](../../../user-guide/ml-functions/classification.md).

---
title: May 31, 2024 — Structured data types — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-05-31-structured-types-ga.md
section: Release Notes
---

# May 31, 2024 — Structured data types — *General Availability*

A structured type is an ARRAY, OBJECT, or MAP that contains elements or key-value pairs with specific
[Snowflake data types](../../../sql-reference-data-types.md). Structured types are now generally available.

> **Note:**
>
> Currently, tables other than [Apache Iceberg™ tables](../../../user-guide/tables-iceberg.md) do not support structured types.
> You cannot add a column of a structured type to a regular table.

For more information, see [Structured data types](../../../sql-reference/data-types-structured.md).

---
title: May 31-June 01, 2023 — 7.18 Release (no announcements)
source: https://docs.snowflake.com/en/release-notes/2023/7_18.md
section: Release Notes
---

# May 31-June 01, 2023 — 7.18 Release (no announcements)

This release contains no significant features, updates, or enhancements to announce.

---
title: Merge-on-read with positional delete files for Snowflake-managed Apache Iceberg™ v2 tables (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2279.md
section: Release Notes
---

# Merge-on-read with positional delete files for Snowflake-managed Apache Iceberg™ v2 tables (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

Before the change:
:   For Snowflake-managed Apache Iceberg™ tables at Iceberg v2, Snowflake doesn’t write
    [positional delete files](https://iceberg.apache.org/spec/#position-delete-files) for merge-on-read when you run DELETE, UPDATE, or MERGE
    statements. Snowflake uses copy-on-write for those operations instead.

After the change:
:   When the 2026_03 behavior change bundle is enabled in your account, Snowflake-managed Apache Iceberg™ tables at Iceberg v2 use merge-on-read with
    positional delete files by default for DELETE, UPDATE, and MERGE when `ENABLE_ICEBERG_MERGE_ON_READ` is `TRUE`, which is the system
    default. Snowflake writes positional delete files alongside your data files in the table’s Iceberg storage location. This behavior matches how
    Snowflake already uses positional delete files for externally managed Iceberg v2 tables when merge-on-read is enabled.

    To turn off merge-on-read and use copy-on-write for these DML operations instead, set the `ENABLE_ICEBERG_MERGE_ON_READ` parameter to
    `FALSE` at the table, schema, or database level. For more information, see [ENABLE_ICEBERG_MERGE_ON_READ](../../../sql-reference/parameters.md).

    **Compatibility with external query engines**

    Positional delete files require support from the Iceberg format version in your external engine. If you use an external query engine that relies
    on a version of Iceberg before v2, that engine might not support positional delete files and might not be able to read the table metadata for
    your Snowflake-managed Iceberg v2 tables after Snowflake writes positional delete files.

    Before the change takes effect in your production accounts, do the following:

    1. Confirm that every external query engine you use to read Snowflake-managed Apache Iceberg™ v2 tables supports
       [positional delete files](https://iceberg.apache.org/spec/#position-delete-files) (Iceberg v2 or later).
    2. If any engine doesn’t support positional delete files, either upgrade that engine to a release based on Iceberg v2 or later, or set
       `ENABLE_ICEBERG_MERGE_ON_READ` to `FALSE` for the relevant tables (or containing schema or database) so Snowflake continues to
       use copy-on-write and doesn’t write positional delete files.

    For more information about positional delete files, merge-on-read, and related parameters in Snowflake, see
    [Use row-level deletes](../../../user-guide/tables-iceberg-manage.md).

Ref: 2279

---
title: Metering Views (Account Usage): Additional Replication and Snowpipe Streaming Credit Usage Information Included in Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1121.md
section: Release Notes
---

# Metering Views (Account Usage): Additional Replication and Snowpipe Streaming Credit Usage Information Included in Views

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The following ACCOUNT_USAGE metering views provide additional information for service level credit usage for REPLICATION
and SNOWPIPE STREAMING service types.

* METERING_HISTORY View
* METERING_DAILY_HISTORY View

Previously:
:   For the REPLICATION service type, the METERING_HISTORY and METERING_DAILY_HISTORY views only showed credit usage for database replication.

    For the SNOWPIPE STREAMING service type, the METERING_HISTORY and METERING_DAILY HISTORY views did not provide detailed usage information for
    Snowpipe Streaming clients.

Currently:
:   For the REPLICATION service type, the METERING_HISTORY and METERING_DAILY_HISTORY views show all credit usage for replication including
    database replication and replication and failover groups.

    For the SNOWPIPE STREAMING service type, the METERING_HISTORY and METERING_DAILY HISTORY views show detailed usage information for both
    Snowflake table objects and for Snowpipe Streaming clients.

Ref: 1121

---
title: METERING_HISTORY view (Account Usage): New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1282.md
section: Release Notes
---

# METERING_HISTORY view (Account Usage): New column

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the ACCOUNT_USAGE.METERING_HISTORY view includes the following new column:

| Column | Data type | Description |
| --- | --- | --- |
| BUDGET_ID | TEXT | Reserved for future use. |

Ref: 1282

---
title: METERING_HISTORY view (ACCOUNT_USAGE): Change in columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2073.md
section: Release Notes
---

# METERING_HISTORY view (ACCOUNT_USAGE): Change in columns

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) [METERING_HISTORY](../../../sql-reference/account-usage/metering_history.md) view no longer includes the `budget_id` column.

When this behavior change bundle is enabled, the ACCOUNT_USAGE.METERING_HISTORY view includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `entity_type` | VARCHAR | Type of Snowflake resource that consumed credits, such as WAREHOUSE, TASK, or TABLE. Note that TABLE is used for all table-like objects. |
| `database_id` | NUMBER | Internal/system-generated identifier of the database associated with the resource of type `entity_type`. Contains a NULL value when the resource isn’t associated with a specific database (for example, a warehouse or compute pool). |
| `database_name` | VARCHAR | Name of the database associated with the resource of type `entity_type`. Contains a NULL value when the resource isn’t associated with a specific database. |
| `schema_id` | NUMBER | Internal/system-generated identifier of the schema associated with the resource of type `entity_type`. Contains a NULL value when the resource isn’t associated with a specific schema. |
| `schema_name` | VARCHAR | Name of the schema associated with the resource of type `entity_type`. Contains a NULL value when the resource isn’t associated with a specific schema. |

Ref: 2073

---
title: METERING_HISTORY view (ACCOUNT_USAGE): Database-level output for COPY_FILES
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1728.md
section: Release Notes
---

# METERING_HISTORY view (ACCOUNT_USAGE): Database-level output for COPY_FILES

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

The [METERING_HISTORY view](../../../sql-reference/account-usage/metering_history.md) in the [Account Usage](../../../sql-reference/account-usage.md) schema behaves as follows:

Before the change:
:   For the `COPY_FILES` service type, these columns contain the following information:

    * `ENTITY_ID`: The ID of the stage from which files are copied.
    * `NAME`: The name of the stage from which files are copied.

    Additionally, the `BYTES` and `FILES` columns are aggregated at the stage level.

After the change:
:   For the `COPY_FILES` service type, these columns contain the following information:

    * `ENTITY_ID`: The ID of the database from which files are copied.
    * `NAME`: The name of the database from which files are copied.

    Additionally, the `BYTES` and `FILES` columns are aggregated at the database level instead of the stage level.

Ref: 1728

---
title: MFA_AUTHENTICATION_METHODS in authentication policy now only includes PASSWORD by default
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1971.md
section: Release Notes
---

# MFA_AUTHENTICATION_METHODS in authentication policy now only includes PASSWORD by default

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

> **Note:**
>
> Previously, this behavior change included the deprecation of the MFA_AUTHENTICATION_METHODS parameter. This deprecation has been postponed
> to a future bundle.

This change modifies the default behavior of multi-factor authentication (MFA) enforcement when no authentication policy is set, and for
newly created authentication policies. Existing authentication policies are not affected.

Before the change:
:   * When no authentication policy is set, Snowflake enforces MFA on password and single-sign on (SSO) logins.
    * CREATE AUTHENTICATION POLICY commands that do not set a value for the MFA_AUTHENTICATION_METHODS parameter create a policy with
      `MFA_AUTHENTICATION_METHODS = ('PASSWORD',  'SAML')`, requiring MFA for both password and SSO logins.

After the change:
:   * When no authentication policy is set, Snowflake enforces MFA only on password logins and not on SSO logins.
    * CREATE AUTHENTICATION POLICY commands that do not set a value for the MFA_AUTHENTICATION_METHODS parameter create a policy with
      `MFA_AUTHENTICATION_METHODS = ('PASSWORD')`, enforcing MFA only on password logins and not on SSO logins.
    * Existing authentication policies keep their current MFA_AUTHENTICATION_METHODS settings. Only new policies use the updated defaults.

To check your current authentication policy settings:

1. List all authentication policies in your account:

   ```sqlexample
   SHOW AUTHENTICATION POLICIES IN ACCOUNT;
   ```
2. View the detailed settings for a specific policy:

   ```sqlexample
   DESCRIBE AUTHENTICATION POLICY <policy_name>;
   ```
3. View the policy assigned to a user:

   ```sqlexample
   SHOW AUTHENTICATION POLICIES ON USER <user_name>;
   ```

Ref: 1971

---
title: Microsoft Azure subnet expansion (Pending for selected accounts)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-MSAzure-2021-11-29.md
section: Release Notes
---

# Microsoft Azure subnet expansion (Pending for selected accounts)

As of **November 2021**, this change has been implemented for the following Azure regions:

* US East 2
* Canada Central

The change for the remaining regions has been scheduled for January 2022; however the date is subject to change.
For the most up-to-date details about the date on which it will be enabled, as well as other release-related details, refer to
[Behavior change announcements](../../behavior-changes.md).

In the 5.23 Behavior Change Release Notes, we announced the implementation of additional subnets in
the Microsoft Azure virtual network for all Snowflake accounts. The additional subnets provide scalability improvements, but can impact the
following access:

* Access to external stages and tables that reference Azure cloud storage.
* Azure storage queue access for Snowpipe auto-ingest.

* Customer-managed key access to tables for [Tri-Secret Secure](../../../user-guide/security-encryption-tss.md) with firewall-enabled Azure Key Vault.

If one or more of your accounts was identified as being impacted by this change, we deactivated the additional subnets for those accounts
while Snowflake Support contacted your internal Azure administrator about making the required network and firewall updates to allow
uninterrupted access.

The additional subnets have been activated for accounts in the East US 2 and Canada Central regions, with the remaining regions to
follow in **January 2022**. If your internal Azure administrator has not yet made the required updates when the subnets are activated,
your accounts may be impacted as described above.

This article provides additional information and resources to facilitate completing the required tasks before the scheduled date:

* [Update the Azure Key Vault Allow List (for Tri-Secret Secure with Key Vault Firewall enabled)](https://community.snowflake.com/s/article/Microsoft-Azure-Subnet-Expansion?r=398&ui-knowledge-components-aura-actions.KnowledgeArticleVersionCreateDraftFromOnlineAction.createDraftFromOnlineArticle=1#update-key-vault)

* [Identify the Azure Subnets](https://community.snowflake.com/s/article/Microsoft-Azure-Subnet-Expansion?r=398&ui-knowledge-components-aura-actions.KnowledgeArticleVersionCreateDraftFromOnlineAction.createDraftFromOnlineArticle=1#identify-subnets)
* [Update the Azure Storage Allow List (for External Stages/Tables and Snowpipe Auto-ingest)](https://community.snowflake.com/s/article/Microsoft-Azure-Subnet-Expansion?r=398&ui-knowledge-components-aura-actions.KnowledgeArticleVersionCreateDraftFromOnlineAction.createDraftFromOnlineArticle=1#update-storage)

  + Note: All of these tasks can and should be completed before the scheduled date to prevent any disruptions.

Identify the Azure Subnets:

To obtain a list of all the Azure subnet IDs for your Snowflake account(s), log into each account that will be impacted by the change and
execute the [SYSTEM$GET_SNOWFLAKE_PLATFORM_INFO](../../../sql-reference/functions/system_get_snowflake_platform_info.md) function.

Example JSON output for an account (line breaks added for readability):

```json
{"snowflake-vnet-subnet-id":[
"/subscriptions/ae0c1e4e-d49e-4115-b3ba-888d77ea97a3/resourceGroups/azure-prod/providers/Microsoft.Network/virtualNetworks/azure-prod/subnets/xp",
"/subscriptions/ae0c1e4e-d49e-4115-b3ba-888d77ea97a3/resourceGroups/azure-prod/providers/Microsoft.Network/virtualNetworks/azure-prod/subnets/gs",
"/subscriptions/37265438-aa4f-49f6-adc4-46271ae19193/resourceGroups/deployment-infra-rg2/providers/Microsoft.Network/virtualNetworks/deployment-vnet2/subnets/xp",
"/subscriptions/37265438-aa4f-49f6-adc4-46271ae19193/resourceGroups/deployment-infra-rg2/providers/Microsoft.Network/virtualNetworks/deployment-vnet2/subnets/gs",
"/subscriptions/63c9e19b-5cf1-4dcf-ace5-bf0f416f2ff7/resourceGroups/deployment-infra-rg3/providers/Microsoft.Network/virtualNetworks/deployment-vnet3/subnets/xp",
"/subscriptions/63c9e19b-5cf1-4dcf-ace5-bf0f416f2ff7/resourceGroups/deployment-infra-rg3/providers/Microsoft.Network/virtualNetworks/deployment-vnet3/subnets/gs"
]}
```

Update the Azure Key Vault Allow List (for Tri-Secret Secure with Key Vault Firewall enabled):

If you are using Tri-Secret Secure with Key Vault firewall enabled, you must update the Azure VNet allow list for Key Vault to include the additional subnets.

To update the allow list for Key Vault, complete the instructions in the this [Community article](https://community.snowflake.com/s/article/Azure-Tri-Secret-Secure-Firewall-enabled-Azure-KeyVault.html).

Note: The required subnets for Key Vault are appended with /gs. You must add each /gs subnet individually.

Update the Azure Storage Allow List (for External Stages/Tables and Snowpipe Auto-ingest):

If you are using Azure Storage for external stages/tables or Snowpipe auto-ingest, you must update the Azure VNet allow list for Azure Storage to include the additional subnets.

To update the allow list for Azure Storage, complete the instructions in [Allow the VNet subnet IDs](../../../user-guide/data-load-azure-allow.md).
:   * Note: Azure Storage requires adding all subnets appended with /gs and /xp to the allow list. You must add each /gs and /xp subnet individually.

Ref: n/a

---
title: MODEL MONITOR METRIC functions: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1982.md
section: Release Notes
---

# MODEL MONITOR METRIC functions: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, the output of the [MODEL_MONITOR_DRIFT_METRIC](../../../sql-reference/functions/model-monitor-drift-metric.md), [MODEL_MONITOR_PERFORMANCE_METRIC](../../../sql-reference/functions/model-monitor-performance-metric.md), and [MODEL_MONITOR_STAT_METRIC](../../../sql-reference/functions/model-monitor-stat-metric.md) table functions includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| SEGMENT_COLUMN | STRING | Name of the segment column used in the metric calculation. |
| SEGMENT_VALUE | STRING | Value of the segment for which the metric is calculated. |

To minimize potential impact, update your queries to explicitly select only the necessary columns instead of using a wildcard (`*`).

Ref: 1982

---
title: Multi-factor authentication enrollment enforced by default for new Snowflake accounts
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1784.md
section: Release Notes
---

# Multi-factor authentication enrollment enforced by default for new Snowflake accounts

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, newly created Snowflake accounts behave as follows:

Before the change:
:   No built-in authentication policy that enforces users to enroll in multi-factor authentication (MFA) on newly created Snowflake accounts.

After the change:
:   A new built-in authentication policy that enforces users to enroll in MFA in newly created Snowflake accounts if the user uses password
    authentication, and have their [TYPE](../../../sql-reference/sql/create-user.md) property set to `PERSON` or `NULL`.

    Trial accounts are exempt from the new built-in authentication policy. If a trial account converts to a paid account, the paid account has
    a built-in authentication policy that requires MFA enrollment.

    Reader accounts are exempt from the new built-in authentication policy.

## Recommendations for new accounts

When you create a new account, you assign an ACCOUNTADMIN for your account. This behavior change enforces multi-factor authentication (MFA)
enrollment on new Snowflake accounts. Depending on whether or not a human or a service uses the ACCOUNTADMIN role, you need to specify
whether you want to enforce MFA enrollment on the ACCOUNTADMIN to prevent lockouts or to secure your account.

Follow one of the sections below, depending on your setup:

* Enforce MFA enrollment on a human ACCOUNTADMIN
* Prevent MFA from being enforced on a non-human ACCOUNTADMIN
* Allow password authentication on a non-human ACCOUNTADMIN

### Enforce MFA enrollment on a human ACCOUNTADMIN

If a human directly uses the ACCOUNTADMIN role on your account, you can secure your account by enforcing the ACCOUNTADMIN to enroll in MFA
during account creation.

Execute the following SQL statement during account creation to specify that a human uses the ACCOUNTADMIN role, and is required to enroll in
MFA:

```sqlsyntax
CREATE ACCOUNT my_admin ADMIN_USER_TYPE = PERSON;
```

### Prevent MFA from being enforced on a non-human ACCOUNTADMIN

If a human does not use the ACCOUNTADMIN role on your account, you must prevent MFA enrollment from being enforced to allow the service that
is using the ACCOUNTADMIN role to run successfully. A service-type ACCOUNTADMIN cannot use passwords to authenticate, and must specify an
[ADMIN_RSA_PUBLIC_KEY](../../../sql-reference/sql/create-account.md) during account creation.

Execute the following SQL statement during account creation to specify that a service uses the ACCOUNTADMIN role, an RSA key to
authenticate, and is not required to enroll in MFA:

```sqlsyntax
CREATE ACCOUNT my_admin
  ADMIN_USER_TYPE = SERVICE
  ADMIN_RSA_PUBLIC_KEY = 'MIIBIj...';
```

### Allow password authentication on a non-human ACCOUNTADMIN

If a human does not use the ACCOUNTADMIN role on your account, you must prevent MFA enrollment from being enforced to allow the service that
is using the ACCOUNTADMIN role to run successfully. The recommended authentication method for a service-type ACCOUNTADMIN is
[key-pair authentication](../../../user-guide/key-pair-auth.md), but if the service using the ACCOUNTADMIN ROLE does not support key-pair
authentication, then you can specify that a legacy service uses the ACCOUNTADMIN role.

A legacy service ACCOUNTADMIN cannot log in to Snowsight, and you cannot set the `FIRST_NAME` or `LAST_NAME`
parameters.

Execute the following SQL statement during account creation to specify that a legacy service uses the ACCOUNTADMIN role, a password to
authenticate, and is not required to enroll in MFA:

```sqlexample
CREATE ACCOUNT my_admin
  ADMIN_USER_TYPE = LEGACY_SERVICE
  ADMIN_PASSWORD = 'password';
```

> **Note:**
>
> The `LEGACY_SERVICE` type is a temporary solution. Snowflake highly recommends you set up key-pair authentication.

See [user types](../../../sql-reference/sql/create-user.md) for more information about user types and their limitations.

Ref: 1784

---
title: Multi-factor authentication: MFA_AUTHENTICATION_METHODS parameter deprecation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2086.md
section: Release Notes
---

# Multi-factor authentication: MFA_AUTHENTICATION_METHODS parameter deprecation

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

This behavior change deprecates a parameter of [authentication policies](../../../user-guide/authentication-policies.md) and replaces it with a
new parameter. The change behaves as follows:

Before the change:
:   The MFA_AUTHENTICATION_METHODS parameter of an authentication policy specifies a list of authentication methods that enforce multi-factor
    authentication (MFA) during login.

    There are two possible values to the MFA_AUTHENTICATION_METHODS parameter: `PASSWORD` and `SAML`.

After the change:
:   The MFA_AUTHENTICATION_METHODS parameter is deprecated. There is no longer a parameter to specify whether MFA is required for password
    users who are enrolled in MFA; if a password user is enrolled in MFA, they must use a second factor of authentication.

    A new parameter ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION is available in an authentication policy to specify whether MFA is required for
    single-sign on (SSO) logins. The new parameter has two possible values: `ALL` and `NONE`. If `ALL` is specified, then
    MFA is enforced for SSO logins when users are enrolled in MFA.

    If your existing authentication policy had `MFA_AUTHENTICATION_METHODS = 'SAML'`, then the new ENFORCE_MFA_ON_EXTERNAL_AUTHENTICATION
    parameter is set to `ALL`.

This change helps implement a milestone in the [deprecation of single-factor password logins](../../../user-guide/security-mfa-rollout.md). It
works in conjunction with another behavior change in this bundle: [Multi-factor authentication: MFA_ENROLLMENT parameter values change](bcr-2097.md).

For detailed information about how the changes in this bundle affect password and SSO authentication for your users based on your current
authentication policy, see [Upcoming Multi-Factor Authentication (MFA) enforcement for Snowsight logins with single-factor passwords](https://community.snowflake.com/s/article/Upcoming-MFA-enforcement-for-Snowsight-logins) (Knowledge Base article).

Ref: 2086

---
title: Multi-factor authentication: MFA_ENROLLMENT parameter values change
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2097.md
section: Release Notes
---

# Multi-factor authentication: MFA_ENROLLMENT parameter values change

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

This behavior change modifies the possible values of the MFA_ENROLLMENT parameter of
[authentication policies](../../../user-guide/authentication-policies.md). This parameter controls who must enroll in multi-factor authentication
(MFA).

Before the change:
:   The MFA_ENROLLMENT parameter has two possible values: `OPTIONAL` and `REQUIRED`.

    * `OPTIONAL` — Users can, but are not required to, enroll in MFA.
    * `REQUIRED` — All users must enroll in MFA.

After the change:
:   The MFA_ENROLLMENT parameter has the following possible values:

    * `REQUIRED` — Human users who are using password or single-sign on (SSO) authentication must enroll in MFA.
    * `REQUIRED_PASSWORD_ONLY` — All human users who are using password authentication must enroll in MFA. Users using SSO
      authentication are not required to enroll.
    * `REQUIRED_SNOWFLAKE_UI_PASSWORD_ONLY` — Human users who are using password authentication to sign in to Snowsight must
      enroll in MFA. Users who are using password authentication, but are not using Snowsight, aren’t required to enroll in MFA.
      Users who are using SSO authentication aren’t required to enroll.

    If your existing authentication policy had `MFA_ENROLLMENT = OPTIONAL`, then the parameter is now set to
    `MFA_ENROLLMENT = REQUIRED_SNOWFLAKE_UI_PASSWORD_ONLY`.

This change helps implement a milestone in the [deprecation of single-factor password logins](../../../user-guide/security-mfa-rollout.md). It
works in conjunction with another behavior change in this bundle: [Multi-factor authentication: MFA_AUTHENTICATION_METHODS parameter deprecation](bcr-2086.md).

For detailed information about how the changes in this bundle affect password and SSO authentication for your users based on your current
authentication policy, see [Upcoming Multi-Factor Authentication (MFA) enforcement for Snowsight logins with single-factor passwords](https://community.snowflake.com/s/article/Upcoming-MFA-enforcement-for-Snowsight-logins) (Knowledge Base article).

Ref: 2097

---
title: Multi-factor authentication: New Duo interface
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1875.md
section: Release Notes
---

# Multi-factor authentication: New Duo interface

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When users are required to use multi-factor authentication (MFA) to sign in to Snowflake, they use Duo to enroll in and authenticate with
MFA. When this behavior change bundle is enabled, the Duo enrollment and authentication experience changes.

Before the change:
:   The traditional Duo Prompt interface appears during enrollment and authentication.

After the change:
:   The new Duo Universal Prompt interface appears during enrollment and authentication. For a description of the new interface, see
    [What are the differences between the traditional Duo Prompt and the Universal Prompt?](https://help.duo.com/s/article/7118).

In most cases, this change does not require any modifications to your environment. However, you might need to configure your environment if
any of the following is true:

* If your account name contains an underscore *and* you use private connectivity, you need to add a DNS entry that includes a dash instead
  of an underscore in the account URL. For example, if your account name is `account_dev` and your organization is `myorg`, then add an
  entry like `myorg-account-dev.privatelink.snowflakecomputing.com`.
* If your corporate firewall or proxy blocks `api-*.duosecurity.com`, `*.devicemanagement.duosecurity.com`, or `*.duosecurity.com:443`,
  modify it to allow these values. Snowflake previously recommended allowing `*.duosecurity.com:443`.

Ref: 1875

---
title: Native Apps: Different privileges required to rename APPLICATION and APPLICATION PACKAGE
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1249.md
section: Release Notes
---

# Native Apps: Different privileges required to rename APPLICATION and APPLICATION PACKAGE

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the privileges required to run the RENAME clause of the
[ALTER DATABASE](../../../sql-reference/sql/alter-database.md) command have changed for the APPLICATION and
APPLICATION PACKAGE databases.

Previously:
:   The RENAME clause of the [ALTER DATABASE](../../../sql-reference/sql/alter-database.md) command requires the
    CREATE DATABASE privilege to be granted on the APPLICATION or APPLICATION PACKAGE to the role
    of the user running the command.

Currently:
:   Running the RENAME clause of the [ALTER DATABASE](../../../sql-reference/sql/alter-database.md) command requires the
    CREATE APPLICATION or CREATE APPLICATION PACKAGE privileges to be granted on the APPLICATION or
    APPLICATION PACKAGE databases.

Ref: 1249

---
title: Native Apps: GET_DDL error message updated on APPLICATION PACKAGE
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1228.md
section: Release Notes
---

# Native Apps: GET_DDL error message updated on APPLICATION PACKAGE

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the error message returned when running the GET_DDL system function on
an APPLICATION PACKAGE database has changed.

Previously:
:   Running the GET_DDL system function on an APPLICATION PACKAGE returns the following error message:

    ```output
    SQL compilation error: Invalid object type: '{0}'
    ```

Currently:
:   Running the GET_DDL system function on an APPLICATION PACKAGE will return the following updated error message:

    ```output
    SQL compilation error: This operation is not supported on APPLICATION PACKAGE 'app pkg'
    ```

Ref: 1228

---
title: Native Apps: Queries that use a reference removed from an app’s manifest file fail
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1218.md
section: Release Notes
---

# Native Apps: Queries that use a reference removed from an app’s manifest file fail

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

A Snowflake Native App may request a [reference](../../../sql-reference/references.md) to authorize access on an object in a consumer account.
These references are defined by the app provider in the [manifest file](../../../developer-guide/native-apps/manifest-overview.md) of an
application version. If a new version of the app removes a previously included reference definition from the manifest file, the original
reference no longer exists in the current version.

For example, in version `V1` of app `my_app`, the app provider defines a reference REF_TO_TABLE. The app contains a stored
procedure CREATE_VIEW_FROM_TABLE that uses the table reference REF_TO_TABLE to create a view VIEW_SELECT_FROM_DEFINED_REF.

A consumer can install `my_app`,
[associate a reference](https://other-docs.snowflake.com/en/native-apps/consumer-granting-privs#associating-the-reference-to-the-application)
to a table in their account to `my_app`, call the app’s CREATE_VIEW_FROM_TABLE procedure, then select from the view
VIEW_SELECT_FROM_DEFINED_REF.

In version `V2` of app `my_app`, the app provider removes the reference definition for REF_TO_TABLE. When the consumer
upgrades their installed app `my_app` to `V2`, calling the CREATE_VIEW_FROM_TABLE procedure should fail because the procedure
uses a reference that is no longer defined in the manifest file for version `V2`.

Queries in a Snowflake Native App that use a reference that has been removed from the manifest file behave as follows:

Before the change:
:   Queries that use a reference that is no longer defined in the manifest file for the current version of the app *succeed*.

After the change:
:   Queries that use a reference that is no longer defined in the manifest file for the current version of the app *fail* with the
    following error:

    ```output
    Reference definition '<REF_DEF_NAME>' cannot be found in the current version of the application '<APP_NAME>'
    ```

Ref: 1218

---
title: Native SDK for Connectors Java - release notes
source: https://docs.snowflake.com/en/release-notes/native-sdk-for-connectors/native_sdk_for_connectors_java.md
section: Release Notes
---

# Native SDK for Connectors Java - release notes

Release notes for Native SDK for Connectors Java library.

## Version 2.2.0 (July 10th, 2024)

### General changes

* Replaced the SnowSQL tool with new Snowflake CLI tool
* Updated Java dependencies

### Behavior changes

* `com.snowflake.connectors.common.object`:

  + Changed value returned by `toString` to be the same as in `getValue` in classes:

    - `Identifier`
    - `ObjectName`
    - `Reference`
    - `SchemaName`
* `com.snowflake.connectors.application.scheduler.SchedulerCreator`:

  + Renamed class to `SchedulerManager`.
* `com.snowflake.connectors.taskreactor.commands.queue.CommandsQueueRepository`:

  + Renamed class to `CommandsQueue`.
* `com.snowflake.connectors.application.integration.SchedulerTaskReactorOnIngestionScheduled`:

  + Renamed class to `TaskReactorOnIngestionScheduledCallback`.
  + The class now uses `ResourceIngestionDefinition` and its generic parameters.
* `com.snowflake.connectors.taskreactor.config.ConfigRepository`:

  + Config values are now always treated as Strings, not Variants.

### New features

* New `PUBLIC.RESET_CONFIGURATION()` procedure that allows to reset the configuration wizard state.
  Additionally there are added callbacks that allow to perform custom operations during the procedure flow.
  See also [Reset configuration](../../developer-guide/native-apps/connector-sdk/flow/reset_configuration).
* New `PUBLIC.RECOVER_CONNECTOR_STATE(STRING)` procedure that allows to reset the connector state.
  See also [Recover connector state](../../developer-guide/native-apps/connector-sdk/reference/core_reference.md).
* New `TASK_REACTOR.REMOVE_INSTANCE(STRING)` procedure that allows to remove a Task Reactor instance.
  See also [Remove instance](../../developer-guide/native-apps/connector-sdk/reference/task_reactor_reference.md).
* `com.snowflake.connectors.application.configuration.connector.ConnectorConfigurationKey`:

  + Added new `CORTEX_WAREHOUSE` key.
  + Added new `CORTEX_USER_ROLE` key.
* `com.snowflake.connectors.util.time`:

  + Added new classes for JSON serialization of `LocalDate` and `ZoneId`.
* `com.snowflake.connectors.common.task.TaskRepository`:

  + Added support for the `AFTER` parameter during task creation, if task predecessors have been specified.
  + Added support for the `USER_TASK_TIMEOUT_MS` parameter.
* `com.snowflake.connectors.common.task.TaskProperties`:

  + Added support for task predecessors.
  + Added support for the `USER_TASK_TIMEOUT_MS` property.
* `com.snowflake.connectors.util.sql.SqlTools`:

> * Added `callProcedureRaw(Session, String, String...)` method.
> * Added `callProcedureRaw(Session, String, String, String...)` method.

* Added new `com.snowflake.connectors.taskreactor.worker.ingestion.SimpleIngestionWorker` class - a
  simple worker implementation for use with ingestion workloads.
* Added new `com.snowflake.connectors.taskreactor.worker.ingestion.SimpleIngestion` class - a simple
  ingestion representation, for use by an `IngestionWorker`.
* Added new `com.snowflake.connectors.taskreactor.worker.ingestion.SimpleIngestionWorkItem` class - a
  simple work item implementation for ingestion work.

### Bug fixes

* `com.snowflake.connectors.common.task.TaskRepository`:

  + Fixed the successful task creation condition check in `create(TaskDefinition, boolean, boolean)`.
* `com.snowflake.connectors.util.variant.VarianMapper`:

  + Fixed handling of timestamps in Variants.
* Corrected default input validators in handlers for the connector configuration processes.
* Removed `DataFrame#first` from most `SELECT` queries, which fixed issues with using some procedures
  in tasks.
* Removed granting `USAGE` on `STATE` schema to app role `ADMIN`.
* Added missing `UPDATED_AT` column to the Task Reactor config table.

## Version 2.1.0 (July 8th, 2024)

### Behavior changes

* New identifier approach.

  > **Important:**
  >
  > This new approach may change how identifiers are used in your connector, please test the new changes thoroughly!

  + The SDK now expects all identifiers to be sent as provided by the user; the SDK will asses by itself whether it’s a quoted identifier or not in order to process it correctly further.
  + Auto quoting of identifiers will be done only when using values returned by Snowflake queries.
  + To use the new approach with the UI - the connector must return a new property in the `PUBLIC.APP_PROPERTIES` view, with the key of `UI_ADD_QUOTES_TO_EXISTING_QUOTED_IDENTIFIERS` and a value of `TRUE`.
  + Changed `com.snowflake.connectors.common.object.Identifier` class:

    - Removed `fromWithAutoQuoting()` and `getName()` methods.
    - Removed the concept of an empty identifier; removed `empty()`, `isNullOrEmpty()`, `validateNullOrEmpty()`, and `isEmpty()` methods.
    - Added new `from()` method, which allows for enabling of auto quoting during identifier instance creation; the provided String will not be auto quoted if it is an unquoted, fully uppercase identifier.
    - Changed `validate()` method to `isValid()`.
    - Changed `toSqlString()` method to `getValue()`.
    - Added `getUnquotedValue()`, `getQuotedValue()`, `getVariantValue()`, and `isUnquoted()` methods.

> * Changed `com.snowflake.connectors.common.object.ObjectName` class:
>
>   + Made database and schema properties `Optional`.
>   + Changed return type of `getDatabase()` and `getSchema()` to `Optional`.
>   + Changed `validate()` method to `isValid()`.
>   + Changed `validateDoubleDot()` method to `isDoubleDot()`.
>   + Changed `getEscapedName()` method to `getValue()`.
>   + Added `getVariantValue()` and `getSchemaName()` methods.
> * Changed `com.snowflake.connectors.common.object.Reference` class:
>
>   + Removed the concept of an empty reference; removed `empty()` and `isEmpty()` methods.
>   + Changed `validate()` method to `isValid()`.
>   + Changed `referenceName()` method to `getName()`.
>   + Changed `value()` method to `getValue()`.
>   + Added new `com.snowflake.connectors.common.object.SchemaName` class for representing the schema; similar behavior to `com.snowflake.connectors.common.object.ObjectName` class.
>   + Added new `com.snowflake.connectors.common.object.InvalidSchemaNameException` class.

#### Other additions and changes

> * Changed `applyToAllInitializedTaskReactorInstances()` method in the `com.snowflake.connectors.taskreactor.TaskReactorInstanceActionExecutor` to execute an action only on initialized task reactor instances. Previous behavior: actions were executed on all registered task reactor instances.

### New features

* Resource management procedures:

  + Introduced new callbacks to `PUBLIC.CREATE_RESOURCE()` procedure that allows to perform custom operations during the procedure flow.
    See also [Create resource](../../developer-guide/native-apps/connector-sdk/flow/ingestion-management/create_resource).
  + New `PUBLIC.ENABLE_RESOURCE()` procedure that allows to enable disabled resource.
    Additionally there are added callbacks that allow to perform custom operations during the procedure flow.
    See also [Enable resource](../../developer-guide/native-apps/connector-sdk/flow/ingestion-management/enable_resource).
  + New `PUBLIC.DISABLE_RESOURCE()` procedure that allows to disable enabled resource.
    Additionally there are added callbacks that allow to perform custom operations during the procedure flow.
    See also [Disable resource](../../developer-guide/native-apps/connector-sdk/flow/ingestion-management/disable_resource).
  + New `PUBLIC.UPDATE_RESOURCE()` procedure that allows to update ingestion configurations of a particular resource.
    Additionally there are added callbacks that allow to perform custom operations during the procedure flow.
    See also [Update resource](../../developer-guide/native-apps/connector-sdk/flow/ingestion-management/update_resource).
* `com.snowflake.connectors.util.sql.SqlTools`:

  + Added `asVarchar()` method that is expected to replace `varcharArgument()` method.
  + Added `asVariant()` method that is expected to replace `variantArgument()` method.
  + Marked `varcharArgument()` and `variantArgument()` methods as deprecated and set them to be removed in the future.
* Other additions:

  + Defined Ingestion Process status as constants in the `com.snowflake.connectors.application.ingestion.process.IngestionProcessStatuses` class.
  + Added `isNotOk()` method to `com.snowflake.connectors.common.response.ConnectorResponse` class.
  + Added `com.snowflake.connectors.util.snowflake.DefaultTransactionManager` class that allows to execute sql statements within a transaction by using the `withTransaction()` method.
  + Improved logging in the task reactor.

### Bug fixes

* Fixed bug that resulted in removing task reactor instance schema, once unexpected error was raised during `CREATE_INSTANCE_OBJECTS()` procedure.

## Version 2.0.0 (May 24th, 2024)

Initial release.

---
title: Native SDK for Connectors Java Test - release notes
source: https://docs.snowflake.com/en/release-notes/native-sdk-for-connectors/native_sdk_for_connectors_java_test.md
section: Release Notes
---

# Native SDK for Connectors Java Test - release notes

Release notes for Native SDK for Connectors Java test library.

## Version 2.2.0 (December 10th, 2024)

### General changes

* Replaced the SnowSQL tool with new Snowflake CLI tool
* Updated Java dependencies

### Behavior changes

* `com.snowflake.connectors.application.scheduler.CreateSchedulerHandlerTestBuilder`:

  + Renamed `withSchedulerCreator(SchedulerCreator)` method to `withSchedulerManager(SchedulerManager)`.
* `com.snowflake.connectors.application.scheduler.InMemoryDefaultSchedulerCreator`:

  + Renamed class to `InMemoryDefaultSchedulerManager`.
* `com.snowflake.connectors.taskreactor.commands.queue.InMemoryCommandsQueueRepository`:

  + Renamed class to `InMemoryCommandsQueue`.

### New features

* New test builders for various handlers that allow to fully customize objects used by handler classes:

  + Added `com.snowflake.connectors.application.configuration.reset.ResetConfigurationHandlerTestBuilder`.
* `com.snowflake.connectors.application.lifecycle.pause.PauseConnectorHandlerTestBuilder`:

  + Added `withSchedulerManager(SchedulerManager)` method.
* `com.snowflake.connectors.application.lifecycle.resume.ResumeConnectorHandlerTestBuilder`:

  + Added `withSchedulerManager(SchedulerManager)` method.
* Added new assertion classes:

  + `com.snowflake.connectors.common.assertions.ingestion.IngestionConfigurationAssert` that allows to assert objects of `com.snowflake.connectors.application.ingestion.definition.IngestionConfiguration` class.
  + `com.snowflake.connectors.common.assertions.UUIDAssertions` that allows to assert String representations of UUIDs.
* `com.snowflake.connectors.common.assertions.task.TaskPropertiesAssert`:

  + Added `hasPredecessors(List<TaskRef>)` assertion.
* `com.snowflake.connectors.common.assertions.ingestion.IngestionRunAssert`:

  + Added `hasIdAsUUID()` assertion.
  + Added `hasIngestionConfigurationIdAsUUID()` assertion.
  + Added `hasIngestionProcessIdAsUUID()` assertion.
  + Added `hasStartedAt()` assertion.
  + Added `hasCompletedAt()` assertion.
  + Added `hasCompletedAtAfterStartedAt()` assertion.
  + Added `hasIngestedRowsGreaterThan(int)` assertion.
  + Added `hasUpdatedAt()` assertion.
  + Added `hasMetadata()` assertion.
  + Added `hasCompletedState()` assertion.
* Added new classes for use in integration testing:

  + `com.snowflake.connectors.common.SharedObjects`.
  + `com.snowflake.connectors.common.PathResolver`.
  + `com.snowflake.connectors.common.procedure.ProcedureDescriptor`.
  + `com.snowflake.connectors.common.procedure.ProcedureProperties`.

### Bug fixes

* `com.snowflake.connectors.application.ingestion.process.InMemoryIngestionProcessRepository`:

  + Provided an implementation of `endProcess(String, String, String)` method, instead of throwing `UnsupportedOperationException`.

## Version 2.1.0 (July 8th, 2024)

### Behavior changes

* Removed `com.snowflake.connectors.taskreactor.InMemoryConfiguredTaskReactorExistenceVerifier` class.
* Removed `com.snowflake.connectors.taskreactor.InMemoryNotConfiguredTaskReactorExistenceVerifier` class.
* Removed `com.snowflake.connectors.application.common.task.InMemoryTaskRepository` class.

### New features

* New test builders for various handlers that allow to fully customize objects used by handler classes:

  + Added `com.snowflake.connectors.application.ingestion.create.CreateResourceHandlerTestBuilder`.
  + Added `com.snowflake.connectors.application.ingestion.enable.EnableResourceHandlerTestBuilder`.
  + Added `com.snowflake.connectors.application.ingestion.disable.DisableResourceHandlerTestBuilder`.
  + Added `com.snowflake.connectors.application.ingestion.update.UpdateResourceHandlerTestBuilder`.
  + Added `com.snowflake.connectors.application.scheduler.CreateSchedulerHandlerTestBuilder`.
* New in-memory implementations:

  + Added `com.snowflake.connectors.application.scheduler.InMemoryDefaultSchedulerCreator`.
  + Added `com.snowflake.connectors.application.configuration.connector.InMemoryConnectorConfigurationService`.
  + Added `com.snowflake.connectors.application.status.InMemoryConnectorStatusRepository`.
  + Added `com.snowflake.connectors.application.status.InMemoryConnectorStatusRepository`.
  + Added `com.snowflake.connectors.taskreactor.InMemoryTaskManagement`.
  + Added `com.snowflake.connectors.util.snowflake.InMemoryAccessTools`.
  + Added `com.snowflake.connectors.util.snowflake.InMemoryTransactionManager`.
* Added new assertions in `com.snowflake.connectors.common.assertions.NativeSdkAssertions`:

  + Added `com.snowflake.connectors.common.assertions.task.CommandAssert` that allows to assert objects of `com.snowflake.connectors.taskreactor.commands.queue.Command` class.
  + Added `com.snowflake.connectors.common.assertions.common.object.ObjectNameAssert` that allows to assert objects of `com.snowflake.connectors.common.object.ObjectName` class.
  + Added `com.snowflake.connectors.common.assertions.common.object.SchemaNameAssert` that allows to assert objects of `com.snowflake.connectors.common.object.SchemaName` class.
  + Added `com.snowflake.connectors.common.assertions.common.object.ReferenceAssert` that allows to assert objects of `com.snowflake.connectors.common.object.Reference` class.
* `com.snowflake.connectors.common.assertions.ingestion.definition.ResourceIngestionDefinitionAssert`:

  + Added `isEnabled()` method.
  + Added `isDisabled()` method.
* `com.snowflake.connectors.common.assertions.common.response`:
  :   + Added `hasAdditionalPayload()` method.

## Version 2.0.0 (May 24th, 2024)

Initial release.

---
title: Native SDK for Connectors release notes
source: https://docs.snowflake.com/en/release-notes/native-sdk-for-connectors/native_sdk_example_java_github_connector.md
section: Release Notes
---

# Native SDK for Connectors release notes

Release notes of the Native SDK Example Java GitHub Connector.

## December 10th, 2024

### General changes

* Replaced the SnowSQL tool with new Snowflake CLI tool.
* Updated the example connector to the [newest SDK release](native_sdk_for_connectors_java.md).
* Updated Java dependencies.

### Bug fixes

* Explicitly specified Java 11 as the target build version.
* Added missing grant for the `VIEWER` and `DATA_READER` app roles on the Streamlit UI.

## July 15th, 2024

### Behavior changes

* Adopted changes related to a new Identifiers approach introduced in the [Connectors Native SDK library version 2.1.0](native_sdk_for_connectors_java.md).
* Implemented OAuth as an authentication mechanism in the Connection Configuration step of the Wizard.
  The connector no longer requires the user to create an `EXTERNAL ACCESS INTEGRATION` and `SECRET` objects with GitHub credentials.

### New features

* Added backend internal implementations of resource management procedures handlers and their callbacks:

  + Implementations for `PUBLIC.CREATE_RESOURCE()` callbacks available in [com.snowflake.connectors.example.ingestion.create](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/create).
  + Implementations for `PUBLIC.ENABLE_RESOURCE()` procedure and its’ callbacks available in [com.snowflake.connectors.example.ingestion.enable](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/enable).
  + Implementations for `PUBLIC.DISABLE_RESOURCE()` procedure and its’ callbacks available in [com.snowflake.connectors.example.ingestion.disable](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/disable).
  + Implementations for `PUBLIC.UPDATE_RESOURCE()` procedure and its’ callbacks available in [com.snowflake.connectors.example.ingestion.update](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/update).
* Changes in UI `Data sync` tab related to resource management:

  + Displaying values of `resource_id`, `name` and `resource_ingestion_definition_id` for each created resource in resources list.
  + Added functionality of enabling and disabling created resources.

### Bug fixes

* Correction to `setup.sql` script which was failing during the application version upgrade/downgrade.

## May 24th, 2024

Initial release.

---
title: Native SDK for Connectors Template - release notes
source: https://docs.snowflake.com/en/release-notes/native-sdk-for-connectors/native_sdk_for_connectors_template.md
section: Release Notes
---

# Native SDK for Connectors Template - release notes

Release notes of the Native SDK for Connectors Template.

## December 10th, 2024

### General changes

* Replaced the SnowSQL tool with new Snowflake CLI tool.
* Updated the template to the [newest SDK release](native_sdk_for_connectors_java.md).
* Updated Java dependencies.

### Bug fixes

* Explicitly specified Java 11 as the target build version.
* Added missing grant for the `VIEWER` and `DATA_READER` app roles on the Streamlit UI.

## July 15th, 2024

### Behavior changes

* Adopted changes related to a new Identifiers approach introduced in the [Connectors Native SDK library version 2.1.0](native_sdk_for_connectors_java.md).

### New features

* Added backend internal implementations of resource management procedures handlers and their callbacks:

  + Implementations for `PUBLIC.CREATE_RESOURCE()` callbacks available in [com.snowflake.connectors.example.ingestion.create](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/create).
  + Implementations for `PUBLIC.ENABLE_RESOURCE()` procedure and its’ callbacks available in [com.snowflake.connectors.example.ingestion.enable](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/enable).
  + Implementations for `PUBLIC.DISABLE_RESOURCE()` procedure and its’ callbacks available in [com.snowflake.connectors.example.ingestion.disable](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/disable).
  + Implementations for `PUBLIC.UPDATE_RESOURCE()` procedure and its’ callbacks available in [com.snowflake.connectors.example.ingestion.update](https://github.com/snowflakedb/connectors-native-sdk/tree/main/templates/connectors-native-sdk-template/src/main/java/com/snowflake/connectors/example/ingestion/update).
* Changes in UI `Data sync` tab related to resource management:

  + Displaying values of `resource_id`, `name` and `resource_ingestion_definition_id` for each created resource in resources list.
  + Added functionality of enabling and disabling created resources.

### Bug fixes

* Correction to `setup.sql` script which was failing during the application version upgrade/downgrade.

## May 24th, 2024

Initial release.

---
title: NETWORK POLICIES and NETWORK RULES views (Account Usage): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1661.md
section: Release Notes
---

# NETWORK POLICIES and NETWORK RULES views (Account Usage): New columns

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the SNOWFLAKE.ACCOUNT_USAGE.NETWORK_POLICIES and SNOWFLAKE.ACCOUNT_USAGE.NETWORK_RULES views include the following new columns:

| View name | Column name | Data type | Description |
| --- | --- | --- | --- |
| NETWORK_POLICIES | ALLOWED_IP_LIST | VARCHAR | The list of allowed IPv4 addresses and CIDR block ranges in the corresponding network policy. |
|  | BLOCKED_IP_LIST | VARCHAR | The list of blocked IPv4 addresses and CIDR block ranges in the corresponding network policy. |
| NETWORK_RULES | MODE | VARCHAR | The mode of the network rule. For supported values, see [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md). |
|  | TYPE | VARCHAR | The type of network rule. For supported values, see [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md). |
|  | VALUE_LIST | VARCHAR | The list of values for the network rule. For supported values, see [CREATE NETWORK RULE](../../../sql-reference/sql/create-network-rule.md). |

These columns are added as the last columns, in order, for each view.

Ref: 1661

---
title: Network policies: Apply network policy to presigned URL
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1558.md
section: Release Notes
---

# Network policies: Apply network policy to presigned URL

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

[Network policies](../../../user-guide/network-policies.md) behave as follows:

Before the change:
:   Presigned URLs generated by the [GET_PRESIGNED_URL](../../../sql-reference/functions/get_presigned_url.md) function do not contain a security token.

After the change:
:   Presigned URLs generated by the [GET_PRESIGNED_URL](../../../sql-reference/functions/get_presigned_url.md) function contain a security token.

If an account administrator enabled the [ENFORCE_NETWORK_RULES_FOR_INTERNAL_STAGES](../../../sql-reference/parameters.md) parameter, causing
[active network policies](../../../user-guide/network-policies.md) that use network rules to restrict access to
[presigned URLs](../../../sql-reference/functions/get_presigned_url.md) to [internal stages](../../../user-guide/data-load-local-file-system.md),
then only the following clients can access the restricted internal stages:

* IP addresses in the `ALLOWED_IP_LIST` parameter, and not in the `BLOCKED_IP_LIST` parameter of an active network policy.
* IP addresses and VPCE IDs in the `VALUE_LIST` parameter of a network rule. The network rule must be in the
  `ALLOWED_NETWORK_RULE_LIST` parameter but not in the `BLOCKED_NETWORK_RULE_LIST` parameter of an active network policy. The
  network rule can have one of the following combinations of parameters set:

  + The `TYPE` parameter set to `IPV4`, and the `MODE` parameter set to `INGRESS`.
  + The `TYPE` parameter set to `AWSVPCEID`, and the `MODE` parameter set to `INTERNAL_STAGE`.

Ref: 1558

---
title: Network policy commands: Cannot drop active network policies
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1337.md
section: Release Notes
---

# Network policy commands: Cannot drop active network policies

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

Trying to drop a network policy that is assigned to an account, security integration, or user behaves as follows:

Before the change:
:   You can execute the DROP NETWORK POLICY and CREATE OR REPLACE NETWORK POLICY commands without considering whether the network policy is
    still assigned to an account, security integration, or user.

After the change:
:   You must detach a network policy from all accounts, security integrations, and users before you can delete it using the
    DROP NETWORK POLICY or CREATE OR REPLACE NETWORK POLICY command.

    The CREATE OR REPLACE NETWORK POLICY command drops an existing network policy before creating a new one with the same name.

Ref: 1337

---
title: Network security: Cannot activate an empty network policy
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1761.md
section: Release Notes
---

# Network security: Cannot activate an empty network policy

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

[Network policies](../../../user-guide/network-policies.md) are activated on an account, user, or integration to control incoming network traffic.

This change behaves as follows:

Before the change:
:   An administrator can activate a network policy on an account, user, or integration even if that network policy and its attached network
    rules do not contain network identifiers.

After the change:
:   A network policy or its attached network rules must contain at least one allowed or blocked network identifier when it is activated.
    Attempting to activate an empty network policy results in an error.

    This change is being made to eliminate confusion about whether an empty network policy allows or blocks traffic. Administrators must
    explicitly allow or block identifiers, including whether to allow or block the wildcard range 0.0.0.0/0.

Ref: 1761

---
title: Network security: Cannot attach egress network rule to a network policy
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1760.md
section: Release Notes
---

# Network security: Cannot attach egress network rule to a network policy

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

Network rules can be attached to a [network policy](../../../user-guide/network-policies.md) to control incoming network traffic (ingress) or
attached to an external access integration to control outgoing network traffic (egress). The purpose of the network rule is controlled by
its MODE parameter. The same network rule is never used for both ingress (`MODE=INGRESS`) and egress (`MODE=EGRESS`).

Before the change:
:   Administrators can attach a network rule with `MODE=EGRESS` to a network policy even though it has no effect.

After the change:
:   An attempt to attach a network rule with `MODE=EGRESS` to a network policy results in an error.

Ref: 1760

---
title: New columns in views and SHOW command output are no longer treated as behavior changes (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-no-bcrs-for-new-columns.md
section: Release Notes
---

# New columns in views and SHOW command output are no longer treated as behavior changes (Pending)

Beginning with the 2026_04 behavior change bundle (tentatively scheduled for May 2026), new columns in Snowflake
views (such as views in the [SNOWFLAKE.ACCOUNT_USAGE](../../../sql-reference/account-usage.md),
[SNOWFLAKE.ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md), and
[INFORMATION_SCHEMA](../../../sql-reference/info-schema.md) schemas) and in the output of SHOW commands are longer treated as
behavior changes.

When a new column is added to a Snowflake view or to the output of a SHOW command, the change will no longer be included in a
[behavior change bundle](../../intro-bcr-releases.md), and the change will no longer be
[announced with the other behavior changes](../../behavior-changes.md). These types of changes will be introduced
in [server releases and feature updates](../../new-features.md).

If the introduction of a new column causes an unexpected problem:

* In the short term, you can [temporarily exclude that column](../../behavior-changes-new-columns.md) from queries
  of the view and from the output of SHOW commands.
* As a long term solution, update your scripts to select specific columns from the query or SHOW command output.

  To select specific columns from the output of SHOW commands, you can use the
  [pipe operator](../../../sql-reference/operators-flow.md). See the example in [Select a list of columns for the output of a SHOW command](../../../sql-reference/operators-flow.md).

---
title: New CREATE HYBRID TABLE privilege
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2157.md
section: Release Notes
---

# New CREATE HYBRID TABLE privilege

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

The CREATE HYBRID TABLE privilege is a new, separate privilege that controls which roles and
users can [create hybrid tables](../../../sql-reference/sql/create-hybrid-table.md). Previously,
creation of new hybrid tables was controlled by the existing CREATE TABLE privilege.

Customers who have created custom roles with the CREATE TABLE privilege to control both
hybrid table and standard table creation must update those roles to include the new
CREATE HYBRID TABLE privilege. Otherwise, users with those roles will no longer be able
to create new hybrid tables.

This change only applies to role-based creation of new hybrid tables. Existing hybrid tables
are not affected. Users who own the database or schema, or who have been granted ownership,
do not need this table-level privilege.

Before the change:
:   The CREATE TABLE privilege is required to create a new hybrid table.

After the change:
:   The CREATE HYBRID TABLE privilege is required to create a new hybrid table.

Ref: 2157

---
title: New error code for function-related errors
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2124.md
section: Release Notes
---

# New error code for function-related errors

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The error code of messages that are returned when there is a problem with a function call is changing as follows:

Before the change:
:   The error code of function-related errors is `002140`.

    `002140=SQL compilation error: Unknown function`

After the change:
:   The error code of function-related errors is `002139`.

    `002139=SQL compilation error: Unknown function`

    An error message with the new error code includes additional information when the error occurred because the caller didn’t have the
    correct access control privileges.

Ref: 2124

---
title: New ERROR_LOGGING column in TABLES views and commands (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2185.md
section: Release Notes
---

# New ERROR_LOGGING column in TABLES views and commands (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

Before the change:
:   The following views and commands don’t include information about whether error logging is enabled for a table:

    * [INFORMATION_SCHEMA.TABLES](../../../sql-reference/info-schema/tables.md) view
    * [ACCOUNT_USAGE.TABLES](../../../sql-reference/account-usage/tables.md) view
    * [ORGANIZATION_USAGE.TABLES](../../../sql-reference/organization-usage/tables.md) view
    * [SHOW TABLES](../../../sql-reference/sql/show-tables.md) command
    * [GET_DDL](../../../sql-reference/functions/get_ddl.md) command

After the change:
:   When the 2026_03 behavior change bundle is enabled in your account, the following changes take effect:

    * A new `ERROR_LOGGING` column (BOOLEAN) is added to the INFORMATION_SCHEMA.TABLES, ACCOUNT_USAGE.TABLES, and
      ORGANIZATION_USAGE.TABLES views. The column indicates whether error logging is enabled for a table.
    * A new `error_logging` column (STRING) is added to the output of the SHOW TABLES command.
    * The [GET_DDL](../../../sql-reference/functions/get_ddl.md) command output for tables with error logging enabled includes
      the `ERROR_LOGGING = TRUE` property, so you can use the output to recreate the table with the same setting.

    For more information about error logging, see [DML error logging](../../../user-guide/data-load-overview.md).

Ref: 2185

---
title: New function: ARRAY_FLATTEN may conflict with similarly named UDFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1239.md
section: Release Notes
---

# New function: ARRAY_FLATTEN may conflict with similarly named UDFs

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, Snowflake has introduced a new built-in function named ARRAY_FLATTEN.

ARRAY_FLATTEN flattens an ARRAY of ARRAYs into a single ARRAY.

* If the ARRAY is nested more than two levels deep, then only a single level of nesting is removed.
* If the input ARRAY is NULL or contains any NULL elements, then the result is NULL.

If you have a UDF named ARRAY_FLATTEN, this behavior change has the following effect:

Previously:
:   Calls to ARRAY_FLATTEN resolve to your UDF.

Currently:
:   Calls to ARRAY_FLATTEN will resolve to the new built-in ARRAY_FLATTEN function.

    The built-in ARRAY_FLATTEN function might work differently than your UDF.

If the documented semantics of the new built-in ARRAY_FLATTEN function does not match the semantics of your UDF, you can either:

* Rename your UDF (using [ALTER FUNCTION … RENAME TO …](../../../sql-reference/sql/alter-function.md)) and replace all references
  to the original UDF name with the new name.
* [Fully qualify](../../../sql-reference/name-resolution.md) all references to your UDF by specifying the names of the database and
  schema containing the UDF. For example:

  ```sqlexample
  SELECT my_database.my_schema.array_flatten(...);
  ```

Ref: 1239

---
title: New function: MAP_KEYS may conflict with similarly named UDFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1430.md
section: Release Notes
---

# New function: MAP_KEYS may conflict with similarly named UDFs

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

Snowflake is introducing a new function named MAP_KEYS. If you have a UDF named MAP_KEYS, calls to your function have the
following effect:

Before the change:
:   A call to your UDF named MAP_KEYS resolves to your UDF.

After the change:
:   A call to your UDF named MAP_KEYS resolves to the new built-in MAP_KEYS function, which fails with the following error:

    ```output
    Invalid argument types for function 'MAP_KEYS' ...
    ```

    The call to your UDF fails because the arguments passed to your UDF do not match the arguments expected by the built-in
    function.

To avoid having calls to your UDF resolve to the built-in function, you can either:

* Rename your UDF (using [ALTER FUNCTION … RENAME TO …](../../../sql-reference/sql/alter-function.md)), and replace all references
  to the original UDF name with the new name.
* Fully qualify all references to your UDF by specifying the names of the database and schema containing the UDF. For example:

  ```sqlexample
  SELECT my_database.my_schema.map_keys(...);
  ```

Ref: 1430

---
title: New Functions: ARRAY_SORT, ARRAY_MIN, and ARRAY_MAX May Conflict With Similarly Named UDFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1135.md
section: Release Notes
---

# New Functions: ARRAY_SORT, ARRAY_MIN, and ARRAY_MAX May Conflict With Similarly Named UDFs

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, Snowflake introduces new built-in functions with the following names and signatures:

* ARRAY_SORT returns the elements of the input ARRAY in sorted order. This function has the following signatures:

  + `ARRAY_SORT(input_array)`

    Sorts the elements in ascending order with NULLs placed at the end of the array.
  + `ARRAY_SORT(input_array, sort_ascending)`

    Sorts the elements in ascending order if `sort_ascending` is TRUE or in descending order if
    `sort_ascending` is FALSE.

    NULLs are sorted last if `sort_ascending` is TRUE or first if `sort_ascending` is FALSE.
  + `ARRAY_SORT(input_array, sort_ascending, nulls_first)`

    Sorts the elements in ascending order if `sort_ascending` is TRUE or in descending order if
    `sort_ascending` is FALSE.

    NULLs are sorted first if `nulls_first` is TRUE or last if `nulls_first` is FALSE.

  This function is not guaranteed to provide a stable sort when comparing values of two different numeric or timestamp types (or
  objects containing these types).
* ARRAY_MIN returns the minimum defined element in the input array
* ARRAY_MAX returns the maximum defined element in the input array

If you have UDFs named ARRAY_SORT, ARRAY_MIN, or ARRAY_MAX with the same signatures, this behavior change has the following
effect:

Previously:
:   Calls to ARRAY_SORT, ARRAY_MIN, or ARRAY_MAX resolve to your UDFs.

Currently:
:   Calls to ARRAY_SORT, ARRAY_MIN, or ARRAY_MAX will resolve to the new built-in functions.

    The built-in functions might work differently than your UDFs.

If the documented semantics of the new built-in functions do not match the semantics of your UDFs, you can either:

* Rename your UDFs (using [ALTER FUNCTION … RENAME TO …](../../../sql-reference/sql/alter-function.md)), and replace all
  references to the original UDF name with the new name.
* [Fully qualify](../../../sql-reference/name-resolution.md) all references to your UDFs by specifying the names of the database and
  schema containing the UDFs. For example:

  ```sqlexample
  SELECT my_database.my_schema.array_sort(...);
  ```

Ref: 1135

---
title: New LOG_EVENT_LEVEL parameter to control events
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2229.md
section: Release Notes
---

# New LOG_EVENT_LEVEL parameter to control events

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

We are introducing a new parameter, LOG_EVENT_LEVEL, to control the level of detail for telemetry events written to event tables.

Before the change:
:   The LOG_LEVEL parameter controls the level of detail for:

    * Logs generated using logging APIs.
    * Telemetry events (record type = EVENT).

After the change:
:   LOG_LEVEL controls logging API–generated logs.

    LOG_EVENT_LEVEL controls Telemetry events (such as Snowpipe, tasks, dynamic tables, SPCS compute pools, Iceberg, and data governance tag events).

    LOG_EVENT_LEVEL supports the same log levels as [LOG_LEVEL](../../../sql-reference/parameters.md).

    **Backward compatibility**

    To preserve existing behavior, LOG_EVENT_LEVEL is initialized to the same value as LOG_LEVEL.

    After the bundle is enabled, the two parameters operate independently.

    **New privileges**

    Two new privileges are introduced to control setting LOG_EVENT_LEVEL:

    * MODIFY LOG EVENT LEVEL
    * MODIFY SESSION LOG EVENT LEVEL

    Only users granted these privileges can modify LOG_EVENT_LEVEL.

Ref: 2229

---
title: New maximum size limits for database objects
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1942.md
section: Release Notes
---

# New maximum size limits for database objects

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

> **Note:**
>
> This behavior change was originally introduced in the 2025_02 bundle, but was moved to the 2025_03 bundle.

When this behavior change bundle is enabled, the storage of database objects changes as follows:

Before the change:
:   For columns of the following types, the maximum allowed length is **16 MB**:

    * VARCHAR
    * VARIANT
    * ARRAY
    * OBJECT

    For columns of the following types, the maximum allowed length is **8 MB**:

    * BINARY
    * GEOGRAPHY
    * GEOMETRY

    Statements that attempt to store values larger than these maximum allowed lengths fail.

After the change:
:   For columns of the following types, the maximum allowed length is **128 MB**:

    * VARCHAR
    * VARIANT
    * ARRAY
    * OBJECT

    For columns of the following types, the maximum allowed length is **64 MB**:

    * BINARY
    * GEOGRAPHY
    * GEOMETRY

    Statements that attempt to store values up to these new maximum allowed lengths succeed. Statements
    that attempt to store values larger than these maximum allowed lengths fail.

For both existing tables and for tables created after the change, the default length for columns of type VARIANT, ARRAY,
and OBJECT is 128 MB, and the default length for columns of type GEOGRAPHY and GEOMETRY is 64 MB.

However, the default length for columns of type VARCHAR and BINARY is 16 MB and 8 MB, respectively. For columns of type
VARCHAR, you can increase the length by explicitly specifying it when creating a new table or altering an existing
table. For columns of type BINARY, you can increase the length by explicitly specifying it when creating a new table.
You can’t alter the length of a BINARY column in an existing table.

If a table has a new limit and stores objects exceeding 16 MB, any downstream tables created using
[CREATE TABLE AS SELECT (CTAS)](../../../sql-reference/sql/create-table.md) from that table will fail. To prevent this failure, adjust
the CTAS statement, and explicitly set the size of the corresponding VARCHAR column to 134217728 (67108864 for BINARY).

If the 2025_03 bundle is enabled and then disabled, database objects with larger size limits in tables remain accessible.
Support for reading large objects was introduced with [BCR-1779](../2024_08/bcr-1779.md), which
is already enabled for all accounts and can’t be disabled.

For more information about the new size limits, see [Size limits for database objects](../../../user-guide/data-load-considerations-prepare.md).

## Sizes greater than 16 MB are visible in query results

Sizes greater than 16 MB for VARCHAR and 8 MB for BINARY are visible in query results. For example, the sizes are visible
in queries that call the [SYSTEM$TYPEOF](../../../sql-reference/functions/system_typeof.md) function or queries of views that provide
information about functions and procedures (for example, the INFORMATION_SCHEMA [FUNCTIONS view](../../../sql-reference/info-schema/functions.md)).

The following example concatenates two columns that are 16 MB in size:

```sqlexample
CREATE OR REPLACE TABLE test_larger_sizes(col1 VARCHAR, col2 VARCHAR) AS
  SELECT 'foo', 'bar';

SELECT SYSTEM$TYPEOF(CONCAT(col1, col2)) FROM test_larger_sizes;
```

```output
+-----------------------------------+
| SYSTEM$TYPEOF(CONCAT(COL1, COL2)) |
|-----------------------------------|
| VARCHAR(33554432)[LOB]            |
+-----------------------------------+
```

For functions and procedures, the new sizes are shown in the INFORMATION_SCHEMA FUNCTIONS view:

```sqlexample
CREATE OR REPLACE FUNCTION test_larger_sized_func(in_arg VARCHAR)
  RETURNS VARCHAR
  LANGUAGE JAVASCRIPT
  CALLED ON NULL INPUT AS
$$
  RETURN NULL;
$$
;

SELECT data_type FROM INFORMATION_SCHEMA.FUNCTIONS
  WHERE function_name = 'TEST_LARGER_SIZED_FUNC';
```

```output
+--------------------+
| DATA_TYPE          |
|--------------------|
| VARCHAR(134217728) |
+--------------------+
```

## Error message changes for sizes greater than 16 MB

The error message might change for some queries.

The following is an example of an insert for a VARCHAR column that returns an error:

```sqlexample
CREATE OR REPLACE TABLE test_larger_size_error(col VARCHAR);
INSERT INTO test_larger_size_error SELECT RANDSTR(20000000, 1);
```

The following error message is returned before the change:

```output
100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is 20000000
```

The following error message is returned after the change:

```output
100078 (22000): String
'CaFHJdoX3upWliCCdAPXXgytQuXzQpFO4laQEFdmiE1NDOywjwHoBqSNTCzTW66ynuR7EsI4ZxStCh3VMIBMYeHWgv1gUZRmHEK4kGmZcC02jGQhnnFJ0jtcIEWBIN6vKGkvSwG482IvfgVVwF3FTj7sb86t1SK9qigI6ujlSNByytIYBk0lkI1MM0zpRFeH2BNvGxtI.'
is too long and would be truncated
```

## Supported drivers

You might need to update your driver version to the one that supports larger database objects. Otherwise, an error
similar to the following might be returned:

```output
100067 (54000): The data length in result column <column_name> is not supported by this version of the client.
Actual length <actual_size> exceeds supported length of 16777216.
```

For information about the supported drivers, see [Driver versions that support large objects in the result set](../../../user-guide/data-load-considerations-prepare.md).

## Iceberg support

For externally managed Iceberg tables, the default length for VARCHAR and BINARY columns is 128 MB. This limit applies to
newly created or refreshed tables. Tables created before the new size limit was enabled and not refreshed still
have the old size limit. Refresh existing tables to increase the size limits.

For managed Iceberg tables, the default length for VARCHAR and BINARY columns is 128 MB. Tables created before
the new size limit was enabled still have the old size limit.

To apply the new size to columns of type VARCHAR in these tables, recreate the tables or alter the columns.
The following example alters a column to use the new size limit:

```sqlexample
ALTER ICEBERG TABLE my_iceberg_table
  ALTER COLUMN col1 SET DATA TYPE VARCHAR(134217728);
```

To apply the new size to columns of type BINARY in these tables, recreate the tables. You can’t alter the length
of a BINARY column in an existing table.

Ref: 1942

---
title: New privilege MANAGE SHARE TARGET replaces CREATE SHARE to add accounts to shares
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1734.md
section: Release Notes
---

# New privilege MANAGE SHARE TARGET replaces CREATE SHARE to add accounts to shares

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

Snowflake introduces a new privilege called MANAGE SHARE TARGET. The MANAGE SHARE TARGET privilege is granted on an account to a role.
A role granted this privilege can be used to add or remove the targets of any share in the account where the role has the privilege.
A *share target* refers to an account or user granted access to the shared data. Think of these targets as the “target audience” for the
share. This new privilege enhances security and control by allowing organizations to assign specific privileges based on job roles.

Before the change:

* The existing CREATE SHARE privilege is used to both create shares and manage share targets (add accounts to a share).
* If a role is granted CREATE SHARE privilege, the role can both create shares and manage share targets.

After the change:

* The existing CREATE SHARE privilege is used only to create shares, not manage share targets.
* The MANAGE SHARE TARGET privilege is used to manage share targets (add and remove accounts that can access a share).
* After this behavior change bundle is enabled, roles with CREATE SHARE will automatically receive
  MANAGE SHARE TARGET to ensure compatibility.

**Prepare for the change**

You will be impacted by this change if you previously granted
CREATE SHARE to a non-ACCOUNTADMIN role to manage share targets.
Customers should review and update any automations that rely on CREATE SHARE for managing accounts.

## USAGE

```sqlexample
GRANT MANAGE SHARE TARGET ON ACCOUNT TO ROLE <role-name>;
GRANT ROLE <role-name> TO USER <user_name>;

USE ROLE <role-name>;
ALTER SHARE <data_share_name> ADD ACCOUNTS = '<account_name_1>', '<account_name_2>';
```

Ref: 1734

---
title: New SQL functions: GREATEST_IGNORE_NULLS and LEAST_IGNORE_NULLS may conflict with similarly named UDFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1354.md
section: Release Notes
---

# New SQL functions: GREATEST_IGNORE_NULLS and LEAST_IGNORE_NULLS may conflict with similarly named UDFs

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

Snowflake is introducing two new built-in functions:

* GREATEST_IGNORE_NULLS: Returns the largest non-NULL value from a list of expressions. If all of the argument values are NULLs, the result is NULL.

  GREATEST_IGNORE_NULLS supports arguments of all data types, including VARIANT.
* LEAST_IGNORE_NULLS: Returns the smallest non-NULL value from a list of expressions. If all of the argument values are NULLs, the result is NULL.

  LEAST_IGNORE_NULLS supports arguments of all data types, including VARIANT.

If you have a user-defined function (UDF) named GREATEST_IGNORE_NULLS or LEAST_IGNORE_NULLS, calls to your function have the following effect:

Before the change:
:   A call to your UDF named GREATEST_IGNORE_NULLS or LEAST_IGNORE_NULLS resolves to your UDF.

After the change:
:   A call to your UDF named GREATEST_IGNORE_NULLS or LEAST_IGNORE_NULLS resolves to the new built-in GREATEST_IGNORE_NULLS or LEAST_IGNORE_NULLS function.
    The built-in GREATEST_IGNORE_NULLS or LEAST_IGNORE_NULLS function might work differently than your UDF.

If the documented semantics of the new built-in GREATEST_IGNORE_NULLS or LEAST_IGNORE_NULLS function does not match the semantics of your UDF, you can either:

* Rename your UDF (using [ALTER FUNCTION … RENAME TO …](../../../sql-reference/sql/alter-function.md)) and replace all references
  to the original UDF name with the new name.
* [Fully qualify](../../../sql-reference/name-resolution.md) all references to your UDF by specifying the names of the database and
  schema containing the UDF. For example:

  ```sqlexample
  SELECT my_database.my_schema.greatest_ignore_nulls(...);

  SELECT my_database.my_schema.least_ignore_nulls(...);
  ```

Ref: 1354

---
title: Node.js Driver release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/nodejs.md
section: Release Notes
---

# Node.js Driver release notes

The Node.js Driver release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](nodejs-2026.md)
* [2025 releases](nodejs-2025.md)
* [2024 releases](nodejs-2024.md)
* [2023 releases](nodejs-2023.md)
* [2022 releases](nodejs-2022.md)

See [Node.js Driver](../../developer-guide/node-js/nodejs-driver.md) for documentation.

---
title: Node.js Driver release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/nodejs-2022.md
section: Release Notes
---

# Node.js Driver release notes for 2022

This article contains the release notes for the Node.js Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Node.js Driver updates.

See [Node.js Driver](../../developer-guide/node-js/nodejs-driver.md) for documentation.

## Version 1.6.17 (December 14, 2022)

### New Features

* Fixed an issue where supplying an incorrect password could cause an infinite loop when attempting to log into a
  connection pool.
* Added the `arrayBindingThreshold` connection parameter for array binding, which directs the Node.js Driver
  to write an array to a file and upload it to the server when the number of binds exceeds the threshold.

## Version 1.6.16 (November 18, 2022)

### New Features

* Added a `noProxy` configuration parameter to support bypassing the proxy server when needed.
* Updated the moment library to version 2.29.4.

## Version 1.6.15 (October 28, 2022)

### New Features

* Removed the requirement to provide the original SQL query in addition to the requestId when resubmitting requests.
* Updated mocha to version 10.1.0.

## Version 1.6.14 (September 21, 2022)

### New Features

* Added support for array binding.

## Version 1.6.13 (August 23, 2022)

### Updates

* Added the ability to resubmit SQL statements with a request ID.

## Version 1.6.12 (June 25, 2022)

### Updates

* Added the `readme.md` file to the npm project description.
* Set the default timeout for HTTP requests to 360 seconds.

### Bug Fixes

* Fixed an issue regarding inaccurate encryption material IDs for numbers exceeding the maximum safe integer.

## Version 1.6.11 (June 23, 2022)

### Bug Fixes

* Fixed an issue for proxy connection not working.

## Version 1.6.10 (May 25, 2022)

### Bug Fixes

* Fixed an issue where the application configuration parameter was not being recognized.
* Fixed an issue where the PUT command did not overwrite data when the OVERWRITE argument was set to TRUE.
* Fixed an issue where the OKTA authenticator threw an error when the closing slash (“/”) was missing; now
  it authenticates whether or not the slash in provided.
* Fixed an issue where the OKTA authenticator failed to authenticate accounts that included a region in the
  connection string.

## Version 1.6.8 (Mar 17, 2022)

### Bug Fixes

* Updated “npm test” to run all unit tests.
* Added a confirmation message when a connection is authenticated.
* Updated `agent-base` and `https-proxy-agent` to latest version.

## Version 1.6.7 (Feb 16, 2022)

### Bug Fixes

* Updated the required version of the `follow-redirect` package to 1.14.18.
* Updated the version of the mocha test framework to 9.2.0.

---
title: Node.js Driver release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/nodejs-2023.md
section: Release Notes
---

# Node.js Driver release notes for 2023

This article contains the release notes for the Node.js Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Node.js Driver updates.

See [Node.js Driver](../../developer-guide/node-js/nodejs-driver.md) for documentation.

## Version 1.9.2 (December 07, 2023)

### New features and updates

* Enhanced observability for generic and proxy use cases.
* Updated the following libraries:

  + glob to version 9.0.0.
  + https-proxy-agent to version 7.0.2.

### Bug fixes

* None.

## Version 1.9.1 (November 14, 2023)

### New features and updates

* Added support for Node.js version 20.
* Connections are now considered valid if they are in either renewing or connecting state.
* Added support for executing asynchronous queries.
* Added the `retryTimeout`, `sfRetryStartingSleepTime`, and `sfRetryMaxLoginRetries` connection parameters to manage the frequency of retries for unsuccessful connection requests. The default for `retryTimeout` is 300.
* Added `account` parameter validation.
* Updated the following libraries:
  :   + Updated axios version to 1.6.0
      + Updated mocha version to 10.2.0
      + Updated bignumber.js version to 9.1.2
      + Added asn1.js to `peerDependency` and updated @techteamer/ocsp version to 1.0.1

### Bug fixes

* Fixed an issue where `sqlText` was overwritten when specified by a user.
* Fixed an issue with caching all types of HTTPS agents.
* Fixed an issue related to using an axios httpclient for Okta authentication.
* Fixed an issue with external browser SSO authentication with proxy.
* Fixed response handling for Okta authentication.

## Version 1.9.0 (September 28, 2023)

### BCR (Behavior Change Release) Change

* Removed support for the Node.js library version 12 in the Node.js driver. Node.js no longer officially supports version 12 of its library. Snowflake encourages everyone using the Node.js version 12 environment to upgrade to Node.js version 18.

### New features and updates

* Added support for hybrid transactional and analytical processing:

  + Added retry context in retries for query requests.
  + Added query context caching.
* Updated the following libraries:

  + Replaced the `urlib2` library with `axios`.
  + Upgraded `aws-sdk` to version v3.
  + Upgraded `uuid` to version 8.

### Bug fixes

* The default JSON parser now returns the result from a new `Function` object.

## Version 1.8.0 (August 29, 2023)

### New features and updates

* Added support for Node.js version 18.
* Added a new `rowMode` configuration option to specify how to return results sets that contain duplicate column names,
  including as an:

  + `array`
  + `object`
  + `object_with_renamed_duplicate_columns`

  For more information, see [Returning Result Sets that Contain Duplicate Column Names](../../developer-guide/node-js/nodejs-driver-consume.md).
* Upgraded a minor `urllib` version and deleted the vm2 exclusion.

### Bug fixes

* Fixed an issue where the `moment.js` library incorrectly populated the millisecond position for times in the log messages.
* Fixed an issue with getting files from stages in Windows and Azure environments.
* Fixed an issue where external browser authentication incorrectly required a username and password.

## Version 1.7.0 (July 28, 2023)

### New features and updates

* Added the `connection.isValidAsync()` function to determine whether a connection is up and usable.

### Bug fixes

* Fixed an issue where some stage files were not downloaded correctly during a multi-file download.
* Modified the `fetchAsString` error message to include “Buffer” as an accepted type.
* Fixed a performance issue with stage bindings.
* Fixed issue that where `connection.execute()` did not return a Statement in bind mode.
* Fixed the `connection.heartbeatAsync()` to use the same endpoint as `connection.heartbeat()`
  function is using instead of querying with SELECT 1.

## Version 1.6.23 (June 14, 2023)

### New features and updates

* Added support for initializing the JSON parser and XmlParser with a custom configuration.

### Bug fixes

* Excluded a vulnerable vm2 transitive dependency.
* Added the `browserActionTimeout` connection parameter to fix an issue with authentication in an external browser.
* Fixed an issue with private keys that contained new lines at the end of the key.
* Fixed an issue related to importing a `uuid` library.
* Removed an unused qs dependency.
* Fixed a retry issue in a `LargeResultSet`.
* Replaced the better-eval package with vm.
* Removed requirement for a username for OAuth connections.

## Version 1.6.22 (May 24, 2023)

### New features and updates

* None.

### Bug fixes

* Added the missing bn and `https-proxy-agent` dependencies.
* Fixed an issue where `econnreset` and `etimedout` error codes would not retry the connection.
* Fixed the error message that was returned when calling `connection.execute()` with a requestId failed.
* Fixed the error message that was returned for calling `connect()` failed when using OKTA or an external browser authenticator.
* Fixed the `maskedtxt` variable undefined error.
* Fixed an issue that occurred for multiple connections when using a OAuth authenticator.
* Fixed an issue where calling `connection.execute()` with extra whitespace in `sqltext` caused errors.
* Fixed an issue where retrying a connection failed due to using the wrong value in the sleep timer.

## Version 1.6.21 (April 18, 2023)

### New Features and Updates

* Added support for GCS access token for PUT/GET.
* Added support for Okta Identity Engine (OIE) logins.
* Improved security when parsing JSON strings with the `eval` function.

### Bug Fixes

* Fixed a parsing issue with XML data loaded from VARIANT columns.
* Fixed an issue where the OCSP cache was not refreshed when it expired.
* Fixed an issue where using a full table path on array binding could crash the application.
* To resolve a deprecation warning issue related to the `Buffer()` deprecation, please reinstall snowflake-sdk.
  Reinstalling updates the `formstream` library to the latest version, such as `formstream 1.2.0`, and resolves the issue.

## Version 1.6.20 (March 23, 2023)

### New Features and Updates

* None.

### Bug Fixes

* The Node.js driver now supports retrying on an HTTP 429 error code.
* Fixed an issue where the Node.js driver would not sent OCSP requests through proxies.
* Fixed an issue where errors occurred when the amount of data submitted using array binding exceeded the array
  binding threshold. The driver now produces output for ingest instead of failing the SQL statement.
* Fixed an issue that incorrectly generated “Bind variable ? not set” error messages after upgrading from
  version 1.6.13 to a higher version.

## Version 1.6.19 (February 27, 2023)

### New Features and Updates

* None.

### Bug Fixes

* Fixed an issue where an insert query failed intermittently when trying to insert large amounts of data with
  array binding.

## Version 1.6.18 (January 31, 2023)

### New Features and Updates

* Added the ability to execute a batch of SQL statements (multi-statement support).
* Updated the `jsonwebtoken` library to version 9.0.0.

### Bug Fixes

* Improved performance by sending heartbeat messages instead of select calls to verify endpoint connections.
* Added error details to the log messages for OCSP open failures and changed the log level from info to warning.
* Added a check to verify that the OCSP cache is initialized before setting the cache entry.

---
title: Node.js Driver release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/nodejs-2024.md
section: Release Notes
---

# Node.js Driver release notes for 2024

This article contains the release notes for the Node.js Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Node.js Driver updates.

See [Node.js Driver](../../developer-guide/node-js/nodejs-driver.md) for documentation.

## Version 2.0.1 (December 13, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed an issue related to missing proxy ports during configuration processing.

## Version 2.0.0 (December 11, 2024)

### BCR (Behavior Change Release) changes¶

Beginning with version 2.0.0, the Node.js driver introduced the following breaking changes:

* Removed support for the Node.js library 14, 16, and 17 versions in the Node.js driver. Node.js no longer officially supports versions lower than 18 of its library. Snowflake encourages everyone using the Node.js versions lower than 18 environments to upgrade to Node.js version 22 (LTS).
* Changed the name of the `insecureConnect` configuration flag that allows skipping OCSP verification to `disableOCSPChecks`.
* The Node.js driver considers all types and methods described in the typings file to be part of the driver’s public API; other components are treated as internal.

### New features and updates

* Extended logging at the transport layer.
* Improved URL data sanitation.
* Added support for GCS region-specific endpoints.
* Implemented GCM encryption algorithms.
* Bumped axios to version 1.7.7.
* Replaced aws-sdk by smithy in version 3.2.5.

### Bug fixes

* Fixed nonempty logs when the log level is set to `OFF`.

## Version 1.15.0 (November 07, 2024)

### New features and updates

* Added support for Node.js version 22.
* Added checks for the `PROXY*` (such as `proxyHost`) and the `noProxy` environment variables when creating an httpAgent.
* Added support for the `describeOnly` configuration parameter.
* Improved logging at the connection layer.

### Bug fixes

* Fixed an issue where the driver did not handle the `rejected` state of the `Promise` object in the `heartbeat` method.

## Version 1.14.0 (October 02, 2024)

### New features and updates

* Added support for structured types.
* Extended logs for the configuration layer.

### Bug fixes

* Fixed a callback parameter heartbeat issue.
* Fixed SSO token authentication.
* Extended log levels and added new methods in the driver types definition.

## Version 1.13.1 (September 04, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed a compilation error with the types file.

## Version 1.13.0 (September 03, 2024)

### New features and updates

* Added support for the `passcode` and `passcodeInPassword` parameters in the MFA authentication process.

### Bug fixes

* Deleted query IDs exposed to users on failed requests.
* Added `axios` error and response sanitization.
* Fixed error handling issues in the `getResultsFromQueryId` method.
* Fixed an issue related to re-authentication for JWT and SAML authentication.
* Fixed an issue with returned types for `async` methods in the driver types definition.

## Version 1.12.0 (August 05, 2024)

### New features and updates

* Added SSO and MFA token caching to the node.js driver .
* Picked a top-level domain for Snowflake hosts.
* Added support for reading the connection information from a file.
* Added the `cwd` (current working directory) parameter to use for GET/PUT execution when it differs from the connector directory.
* Added support for AES 256 encryption/decryption.

### Bug fixes

* Fixed a bug related o reusing the jwt token for login retries.
* Fixed azure-storage-blob version compatibility with node version 14.
* Fixed an issue that caused enum type errors when the `isolatedModule` option is set.
* Fixed an issue the type definitions, by adding the missing `cancel` method and set the `complete` field in `StatementOption` as optional in driver types.
* Fixed an issue with regex expressions in account name validation.

## Version 1.11.0 (May 28, 2024)

### New features and updates

* Added the `disableSamlURLCheck` parameter to disable SAML URL checks.
* Added the `representNullAsStringNull` configuration parameter to specify how the `fetchAsString` method returns null values. When disabled, `fetchAsString` returns null values as `NULL` instead of as the string, “NULL”.
* Released Snowflake’s official `d.ts` type declaration file to support TypeScript users.
* Removed the following unused dependencies:

  + agent-base
  + debug
  + extend

### Bug fixes

* Fixed an issue with millisecond precision.
* Fixed an issue with creating paths on Windows when using the PUT command.

## Version 1.10.1 (April 08, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed unhandled promise rejections on keypair authorization.
* Fixed an issue with reading a `timestamp` type with high precision.
* Fixed external browser authentication.
* Fixed an issue with native Okta URL validation.
* Fixed the data format in bulk upload `.csv` files.
* Fix validation for short account names.
* Bumped axios to version 1.6.8.

## Version 1.10.0 (February 27, 2024)

### New features and updates

* Added support for setting the log level in a logging configuration file.
* Added the `forceGCPUseDownscopedCredential` flag to force sending a custom HTTP request instead of the one from gcp library. Default: `false`.
* Added proxy support for files operations on AWS S3.
* Updated google-cloud version to 7.7.0.

### Bug fixes

* Fixed an issue where an error was thrown when getting a query status.
* Fixed an issue where OKTA authentication failed when receiving an HTTP 429 error.

## Version 1.9.3 (January 17, 2024)

### New features and updates

* Added the `host` configuration parameter.
* Added support for multiple SAML integrations.
* Added logging for mapping resultset columns.
* Updated the following libraries:

  + axios to version 1.6.5.
  + Removed the `tmp` module.

### Bug fixes

* Fixed an issue with the SESSION_TOKEN_EXPIRED error when destroying connections.

---
title: Node.js Driver release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/nodejs-2025.md
section: Release Notes
---

# Node.js Driver release notes for 2025

This article contains the release notes for the Node.js Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Node.js Driver updates.

See [Node.js Driver](../../developer-guide/node-js/nodejs-driver.md) for documentation.

## Version 2.3.3 (December 11, 2025)

### New features and updates

* None.

### Bug fixes

* Replaced the `glob` dependency used in PUT queries with a custom wildcard matching implementation to address security issues.
* Fixed misleading debug messages during login requests.
* Fixed a bug in the build script that failed to include minicore binaries in the `dist` folder.

## Version 2.3.2 (December 08, 2025)

### New features and updates

* Added support for Red Hat Enterprise Linux (RHEL) 9.
* Added support for Node.js version 24.
* Included a shared library to collect telemetry to identify and prepare testing platforms for native node addons.

### Bug fixes

* Fixed the TypeScript definition for `getResultsFromQueryId` where `queryId` should be required and `sqlText` should be optional.
* Bumped the dependency `glob` to address [CVE-2025-64756](https://nvd.nist.gov/vuln/detail/CVE-2025-64756).
* Fixed a regression introduced in version 2.3.1 where `SnowflakeHttpsProxyAgent` was instantiated without the `new` keyword, breaking the driver when both OCSP was enabled and the `HTTP_PROXY` environment variable was used to set the proxy. This bug did not affect `HTTPS_PROXY`.

## Version 2.3.1 (October 09, 2025)

### New features and updates

* Added the `workloadIdentityAzureClientId` configuration option, allowing you to customize the Azure Client for `WORKLOAD_IDENTITY` authentication.
* Added the `workloadIdentityImpersonationPath` configuration option for `authenticator=WORKLOAD_IDENTITY`, allowing workloads to use service account impersonation.

### Bug fixes

* Fixed a regression causing PUT operations to encrypt files with the wrong `smkId`.

## Version 2.3.0 (September 30, 2025)

> **Warning:**
>
> This release contained a serious regression, and was unpublished. Update to version 2.3.1 or later.

### New features and updates

* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. For information on enabling this feature, see [Certificate revocation list (CRL) options](../../developer-guide/node-js/nodejs-driver-options.md). We recommend you test this feature in advisory mode before enabling it in production.

### Bug fixes

* Improved debug logs when dowloading query result chunks.
* Fixed missing error handling in `getResultsFromQueryId()`.
* Fixed invalid transformation of `null` values to `""` when using stage binds.
* Extended typing of `Bind`.

## Version 2.2.0 (August 13, 2025)

### New features and updates

* Added support for Workload Identity Federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `workloadIdentityProvider` connection parameter.
  + Added `WORKLOAD_IDENTITY` to the values for the `authenticator` connection parameter.
* Added the `queryTag` connection parameter to set the `QUERY_TAG` session parameter.

### Bug fixes

* Fixed a network error when connecting with an expired OAuth access token.
* Fixed the OAuth Authorization Code’s default value for redirect URI by removing a trailing / (slash) to be compliant with RFC 6749 Section 3.1.2.
* Improved errors for GET commands.

## Version 2.1.3 (July 21, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with using the Google Cloud Platform (GCP) XML API when `useVirtualUrl=true`.
* Fixed a permission check for `.toml` configuration files.
* Fixed unhandled resources after creating a connection to prevent the process from terminating when using external browser authentication.
* Fixed an issue with `oauthEnableSingleUseRefreshTokens` in the authorization code flow.

## Version 2.1.2 (July 10, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed a TypeScript error that was introduced in version 2.1.1.

## Version 2.1.1 (July 03, 2025)

### Private Preview (PrPr) features

Added support for Workload Identity Federation in the AWS, Azure, GCP, and Kubernetes platforms.

Disclaimer:

* This feature can only be accessed by setting the `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use this feature only with non-production data.
* This PrPr feature is not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Removed token caching for Client Credentials authentication.

### Bug fixes

* Corrected an issue where `Util.getProxyFromEnv` incorrectly assumed HTTPS, causing `HTTP_PROXY` values to be ignored for HTTP traffic (port 80).
* Improved `extractQueryStatus` to handle cases where `getQueryResponse` returns a null response, preventing occasional breaks.
* Added `ErrorCode` to the core instance.

### Additional notes

* This release introduces TypeScript for development. The npm package contains compiled JavaScript code that contains no anticipated breaking changes for driver users.

## Version 2.1.0 (May 11, 2025)

### New features and updates

* Added support for OAuth 2.0 Authorization Code Flow and OAuth 2.0 Client Credentials Flow.

  + For OAuth 2.0 Authorization Code Flow:

    - Added the `oauthClientId`, `oauthClientSecret`, `oauthAuthorizationUrl`, `oauthTokenRequestUrl`, and `oauthScope` parameters.
    - Added the `OAUTH_AUTHORIZATION_CODE` parameter for the parameter authenticator.
  + For OAuth 2.0 Client Credentials Flow:

    - Added the `oauthClientId`, `oauthClientSecret`, `oauthTokenRequestUrl`, and `oauthScope` parameters.
    - Added the `OAUTH_CLIENT_CREDENTIALS` parameter for the parameter authenticator.
* Added support for virtual-style domains.
* Implemented and improved the file-based credentials cache for Linux, including enhanced token caching.

### Bug fixes

* None

## Version 2.0.4 (April 28, 2025)

### Private Preview (PrPr) features

* Implemented support for Programmatic Access Tokens authentication.

Disclaimer:

* These features can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use these features only with non-production data.
* These PrPr features are not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Upgraded axios to version 1.8.2+.

### Bug fixes

* Fixed a Time-of-check Time-of-use (TOCTOU) race condition when checking access to the Easy Logging configuration file. For more information, see [CVE-2025-46328](https://github.com/snowflakedb/snowflake-connector-nodejs/security/advisories/GHSA-wmjq-jrm2-9wfr).
* Fixed OCSP response cache entries not being refreshed properly.

## Version 2.0.3 (March 13, 2025)

### New features and updates

* None

### Bug fixes

* Fixed an issue with promise rejection for file upload errors.

## Version 2.0.2 (January 29, 2025)

### New features and updates

* Added support for regional Google Cloud Storage endpoints.
* Added support for endpoints without protocols for GCS.
* Updated the following dependencies:

  + azure/storage-blob to version 12.26.x,
  + aws-sdk/client-s3 to version 3.726.0,
  + smithy/node-http-handler to version 4.0.1

### Bug fixes

* Fixed the verification of the token caching file permissions and its owner when authentication is set to `EXTERNALBROWSER` or `USERNAME_PASSWORD_MFA`. For more information, see [CVE-2025-24791](https://github.com/snowflakedb/snowflake-connector-nodejs/security/advisories/GHSA-xfhv-wqj6-rx99).
* Fixed the `FileAndStageBindStatement` type in the typings file.
* Fixed an issue with aborting requests and inconsistent request methods in `HttpClient`.
* Fixed an issue with the proxy configuration settings used for sending requests to a GCS bucket.

---
title: Node.js Driver release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/nodejs-2026.md
section: Release Notes
---

# Node.js Driver release notes for 2026

This article contains the release notes for the Node.js Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Node.js Driver updates.

See [Node.js Driver](../../developer-guide/node-js/nodejs-driver.md) for documentation.

## Version 2.4.0 (Apr 07, 2026)

### New features and updates

* Added the `browserRedirectPort` connection option to customize the port of the local server that receives the EXTERNALBROWSER authentication callback.
* Bumped `@aws-sdk/*` dependencies to address a `fast-xml-parser` vulnerability.
* Improved keep-alive HTTP agents with a 30-second idle socket timeout that proactively discards stale connections before the server closes them, preventing socket hang up and ECONNRESET errors.

### Bug fixes

* Fixed connection pools re-prompting browser authentication for every pooled connection when using EXTERNALBROWSER or OAUTH_AUTHORIZATION_CODE authenticators. The first connection now completes auth and caches tokens before subsequent pool connections start.
* Fixed session token renewal failing due to a malformed request, which caused long-running connections to disconnect instead of refreshing their expired session token.
* Fixed query context cache not being updated on failed queries, which could cause a stale cache when subsequent queries land on a different GS node.

## Version 2.3.6 (Mar 25, 2026)

### New features and updates

* Added support for every authenticator type (including external browser and Okta) in `connect()`, matching `connectAsync()`.
* Removed the `@google-cloud/storage` dependency. GCS transfers now use the JSON API directly. The `forceGCPUseDownscopedCredential` connection option has been removed as it is no longer needed.
* Updated the default `jsonColumnVariantParser` to fall back to eval-based parsing for non-JSON-compliant variant values (such as `undefined`, `NaN`, and `Infinity`), restoring pre-2.3.5 behavior while keeping `JSON.parse` as the primary parser.

### Bug fixes

* Fixed the `OAUTH_AUTHORIZATION_CODE` authenticator not honoring the `openExternalBrowserCallback` connection option.
* Fixed `createConnection()` and `createPool()` types to accept no arguments, matching the runtime behavior of loading configuration from `connections.toml`.
* Fixed the `account` field in the `ConnectionOptions` type to be optional, since it can be derived from `accessUrl` or `host`.
* Fixed external browser SSO authentication crashing when the SSO URL request returns a server-side error.

## Version 2.3.5 (Mar 17, 2026)

### New features and updates

* Added the ability to skip token file permission checks by using the `SF_SKIP_TOKEN_FILE_PERMISSIONS_VERIFICATION` environment variable.
* Added Node 18+ to engines, which is the minimum officially supported version since the 2.x release.
* Added the `PLATFORM` field to `login-request` telemetry.
* Added request retries to previously uncovered query execution paths.
* Added the `rowStreamHighWaterMark` connection option to control how many rows are buffered when streaming query results through `statement.streamRows()`.
* Added a warning when converting query results to JavaScript numbers with precision loss.
* Added snake_case key support when loading `connections.toml` through `createConnection()` with no arguments.
* Exported the `normalizeConnectionOptions()` utility to convert snake_case connection keys to camelCase, with key aliases and acronym overrides.
* Added the `LIBC_FAMILY` and `LIBC_VERSION` fields to `login-request` telemetry.
* Added the `crlDownloadMaxSize` configuration option to enforce a maximum response size limit when downloading CRL files.
* Added RSASSA-PSS signature verification support for CRL validation.
* Improved error details when OAuth fails.
* Changed the default `jsonColumnVariantParser` to `JSON.parse`.
* Updated Linux GNU minicore binaries to target glibc 2.18 for broader compatibility with older Linux distributions.

### Bug fixes

* Fixed OAuth crashing when using bundlers.
* Fixed `Binds` typing to allow readonly arrays.
* Fixed the `connectAsync()` method resolving before the connection is completed.
* Fixed incorrect handling of a callback argument that should be optional in `connect()` and `connectAsync()`.
* Fixed a bug where an invalid JWT was generated if a user accidentally set both the account and the host in the configuration.
* Fixed a bug where parsing the JSON media type failed when it included an optional parameter from Microsoft Identity Platform v2.0 tokens, causing the OAuth Client Credentials flow to fail.
* Fixed `disableSamlUrlCheck` typing to use the correct casing: `disableSamlURLCheck`.
* Fixed `getDefaultCacheDir()` crashing in environments where no user home directory is configured by falling back to `os.tmpdir()`.
* Fixed `SF_OCSP_RESPONSE_CACHE_DIR` not being used directly as the OCSP cache directory.
* Fixed bugs in `noProxy` and `NO_PROXY` handling:

  + The `.domain.com` wildcard format was not correctly matching the destination host.
  + `.` was incorrectly matching as any character instead of a literal dot.
  + Partial strings were incorrectly matching instead of requiring a full destination match.
* Fixed CRL ADVISORY mode to log failures at the warn level instead of debug.
* Fixed OAuth Authorization Code reauthentication not using the refreshed access token when the cached access token is expired.
* Fixed OAuth Authorization Code refresh token being removed from cache when the IDP does not return a new one.
* Fixed an unhandled promise rejection when the server returns malformed query responses.

## Version 2.3.4 (Feb 09, 2026)

### New features and updates

* Reduced memory usage during PUT operations.
* Added `APPLICATION_PATH` to `login-request` telemetry.
* Added Linux distribution details parsed from `/etc/os-release` to `login-request` telemetry.
* Bumped axios to version 1.13.4 to address a bug in axios interceptors.
* Bumped other dependencies to their latest minor versions.

### Bug fixes

* Fixed inconsistent retry behavior across HTTP requests and ensured all recoverable failures are properly retried.
* Fixed invalid oauth scope when `role` and `oauthScope` are missing from the connection configuration.
* Fixed `APPLICATION` field not being passed from the connection configuration to `login-request` telemetry.
* Fixed build errors in bundlers caused by the `minicore` module.

---
title: NOTIFICATION_HISTORY function: New source type BUDGET in MESSAGE_SOURCE column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1687.md
section: Release Notes
---

# NOTIFICATION_HISTORY function: New source type BUDGET in MESSAGE_SOURCE column

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the output of the
[NOTIFICATION_HISTORY](../../../sql-reference/functions/notification_history.md) function behaves as follows:

Before the change:
:   If you are using the Budgets feature, the output of the NOTIFICATION_HISTORY function is as follows for the message source and
    information columns:

    * MESSAGE_SOURCE: STORED_PROCEDURE
    * MESSAGE_SOURCE_INFO: NULL

After the change:
:   If you are using the Budgets feature, the output of the NOTIFICATION_HISTORY function is as follows for the message source and
    information columns:

    * MESSAGE_SOURCE: BUDGET
    * MESSAGE_SOURCE_INFO: Budget ID and name

Ref: 1687

---
title: NOTIFICATION_HISTORY table function: Changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1593.md
section: Release Notes
---

# NOTIFICATION_HISTORY table function: Changes to output

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The [NOTIFICATION_HISTORY](../../../sql-reference/functions/notification_history.md) function is changing to provide information about:

* Notifications that have not yet been processed.
* Attempts at sending notifications that have initially failed and that are being retried.

The next sections explain how these changes affect the output of the function:

* Changes to the number of rows returned
* New columns in output
* Deprecation of the MESSAGE column

## Changes to the number of rows returned

The number of rows returned by the function is changing:

Before the change:
:   This function returns a row for each notification that has been processed (notifications that were either sent out or have
    failed).

    If multiple attempts were made to send a notification, the function returns a row for the last attempt made.

After the change:
:   This function returns a row for each attempt at sending a notification. The value in the STATUS column indicates the status of
    the attempt:

    * If the attempt failed but can be retried, the value is `RETRIABLE_FAILURE`.
    * If the attempt failed and cannot be retried, the value is `FAILURE`.
    * If the attempt succeeded, the value is `SUCCESS`.

    In addition, the function returns a row for each notification that has not yet been processed (notifications that are queued).

## New columns in output

When this behavior change bundle is enabled, the output of the [NOTIFICATION_HISTORY](../../../sql-reference/functions/notification_history.md) function
includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| ID | VARCHAR | Unique ID of a request to send a notification.  If Snowflake fails to send a notification and attempts to send the notification again, the function returns a row for each attempt. Each row for an attempt has the same value in the ID column but a different value in the ATTEMPT column. |
| ATTEMPT | INTEGER | Number of the attempt made to send the notification. |
| MESSAGE_SOURCE_INFO | OBJECT | Object containing information about the source of the notification. The fields in this object depend on the type of the source:   * For [error notifications for tasks](../../../user-guide/tasks-errors.md), the object contains the following fields:    + `name`: The name of the task.   + `graph_run_group_id`: Identifier for the graph run.   + `attempt_number`: Integer representing the number of the attempt to run this task. * For [error notifications for Snowpipe](../../../user-guide/data-load-snowpipe-errors.md), the object contains the   `pipe_name` field, which specifies the name of the pipe. * For notifications sent by calling the [SYSTEM$SEND_EMAIL](../../../sql-reference/stored-procedures/system_send_email.md) stored procedure, the   object contains the `query_id` field, which specifies the ID of the statement that called the stored procedure. |

## Deprecation of the MESSAGE column

The MESSAGE column is deprecated and will be removed in the future.

Ref: 1593

---
title: NOTIFICATION_HISTORY table function: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1470.md
section: Release Notes
---

# NOTIFICATION_HISTORY table function: New column in output

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The output of the [NOTIFICATION_HISTORY](../../../sql-reference/functions/notification_history.md) table function changes as follows:

Before the change:
:   The output of the table function does not include an ERROR_MESSAGE column.

After the change:
:   The output of the table function includes an ERROR_MESSAGE column that contains details about why a notification failed.

    | Column name | Data type | Description |
    | --- | --- | --- |
    | ERROR_MESSAGE | VARCHAR | Details about why the notification failed. |

Ref: 1470

---
title: NOTIFICATION_HISTORY table function: Removal of the message column from the output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1742.md
section: Release Notes
---

# NOTIFICATION_HISTORY table function: Removal of the `message` column from the output

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

In the output of the [NOTIFICATION_HISTORY](../../../sql-reference/functions/notification_history.md) table function, the `message` column is being removed.
It was previously announced that [this column was deprecated](../2024_04/bcr-1593.md).

If you need context around a notification (for example, which task sent the notification), you can use the information in the
`message_source_info` column of the output.

If you have queries that rely on data from the message column, you should rewrite these queries.

Ref: 1742

---
title: Nov 03, 2025: Semantic views support for account replication
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-03-semantic-views-replication.md
section: Release Notes
---

# Nov 03, 2025: Semantic views support for account replication

Semantic views are now supported for account replication.

You can now include semantic views in replication groups and failover groups to replicate them across accounts.
If a semantic view references any other objects (for example, tables, views, and Cortex Search Services),
you must also replicate those objects to ensure the semantic view functions correctly in the target account.

For more information, see [Introduction to replication and failover across multiple accounts](../../../user-guide/account-replication-intro.md).

---
title: Nov 04, 2025: Cortex Agents (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-cortex-agents.md
section: Release Notes
---

# Nov 04, 2025: Cortex Agents (*General availability*)

With this release, we are pleased to announce the general availability of Cortex Agents, which was previously available as a preview feature. Cortex Agents orchestrate across both structured and unstructured data sources to deliver insights.

Cortex Agents plan tasks, use tools to execute these tasks, and generate responses. Agents use Cortex Analyst (structured) and Cortex Search (unstructured) as tools, along with LLMs, to analyze data.

The workflow involves four key components:

* **Planning:** Agents parse requests to orchestrate a plan and arrive at solutions. They can explore options, split tasks into subtasks, and route across tools to ensure governed access and compliance with enterprise policies.
* **Tool use:** Agents retrieve data efficiently using Cortex Search for unstructured sources and Cortex Analyst for structured data.
* **Reflection:** After each tool use, agents evaluate results to determine next steps - asking for clarification, iterating, or generating a final response.
* **Monitor and iterate:** Track metrics, analyze performance, and refine behavior for continuous improvements.

For more information, see [Cortex Agents](../../../user-guide/snowflake-cortex/cortex-agents.md).

---
title: Nov 04, 2025: Cortex AI Functions (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-cortex-aisql-operators-ga.md
section: Release Notes
---

# Nov 04, 2025: Cortex AI Functions (*General availability*)

Snowflake announces the general availability of Cortex AI Functions, delivering production-ready AI capabilities within the
Snowflake SQL engine.

Cortex Functions give you the ability to unify structured and unstructured analytics within a single platform and
accelerate intelligent application development. With AI Functions, you can build scalable, multimodal AI pipelines
that run entirely inside Snowflake, enabling text, image, audio, and video intelligence without external services or
data movement.

Four Cortex AI Functions, previously available in preview, become generally available today:

* [AI_CLASSIFY](../../../sql-reference/functions/ai_classify.md): Classifies a text or image input into a single or multiple
  user-defined categories based on plain-language category definitions.
* [AI_TRANSCRIBE](../../../sql-reference/functions/ai_transcribe.md): Transcribes audio and video files stored in a stage,
  extracting text, timestamps, and speaker information. See the separate [AI_TRANSCRIBE announcement](2025-11-04-cortex-ai-transcribe-ga.md) to see what’s new.
* [AI_EMBED](../../../sql-reference/functions/ai_embed.md): Generates an embedding vector for a text or image input, which
  can be used for similarity search, clustering, and classification tasks.
* [AI_SIMILARITY](../../../sql-reference/functions/ai_similarity.md): Calculates the embedding similarity between two inputs
  without needing to explicitly create the embedding vectors.

These functions join three Cortex AI Functions that were already generally available:

* [AI_TRANSLATE](../../../sql-reference/functions/ai_translate.md): Translates text from one language to another using
  state-of-the-art language models.
* [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md): Extracts information from text, documents, and images
  based on user-defined extraction instructions.
* [AI_SENTIMENT](../../../sql-reference/functions/ai_sentiment.md): Analyzes the overall and category sentiment in text.

---
title: Nov 04, 2025: Cortex AI_TRANSCRIBE function (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-cortex-ai-transcribe-ga.md
section: Release Notes
---

# Nov 04, 2025: Cortex AI_TRANSCRIBE function (*General availability*)

The [AI_TRANSCRIBE](../../../sql-reference/functions/ai_transcribe.md) function is now generally available in all
Cortex-supported regions. This launch brings production-ready, SQL-native transcription for both audio and video content
within Snowflake, making it easier to extract and analyze spoken information at scale.

The general availability release includes several improvements over the preview release:

* Automatic language detection improvements for higher accuracy across multilingual and mixed-language recordings.
* Support for MP4 and other video files, enabling transcription and analysis of media content for advertising and
  sponsorship analytics.
* Support for Norwegian and Hebrew, expanding language coverage to 31 languages.
* Overall transcription quality improvements across diverse environments and acoustic conditions.

For more information, see [Cortex AI Functions: Audio](../../../user-guide/snowflake-cortex/ai-audio.md).

---
title: Nov 04, 2025: Interactive tables and interactive warehouses (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-interactive-tables-and-interactive-warehouses.md
section: Release Notes
---

# Nov 04, 2025: Interactive tables and interactive warehouses (*Preview*)

Interactive tables and interactive warehouses are now available in preview. Together,
interactive tables and interactive warehouses provide enhanced query performance and
real-time data processing capabilities for your Snowflake workloads.

Currently, this feature is available in select Amazon Web Services (AWS) regions.

For more information, see the following topics:

* [Snowflake interactive tables and interactive warehouses](../../../user-guide/interactive.md)
* [CREATE INTERACTIVE TABLE](../../../sql-reference/sql/create-interactive-table.md)
* [CREATE INTERACTIVE WAREHOUSE](../../../sql-reference/sql/create-interactive-warehouse.md)
* [Region availability](../../../user-guide/interactive.md)

---
title: Nov 04, 2025: Performance Explorer (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-performance-explorer-ga.md
section: Release Notes
---

# Nov 04, 2025: Performance Explorer (*General availability*)

Performance Explorer is now generally available and is no longer in
[Preview](../../preview-features.md).

You can use Performance Explorer in Snowsight to monitor interactive metrics for SQL workloads.
The metrics show the overall health of your Snowflake environment, query activity, changes to warehouses,
and changes to tables.

For more information, see [Analyzing query workloads with Performance Explorer](../../../user-guide/performance-explorer.md).

---
title: Nov 04, 2025: Sharing semantic views
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-sharing-semantic-views.md
section: Release Notes
---

# Nov 04, 2025: Sharing semantic views

Providers can share semantic views in [private listings](../../../collaboration/provider-listings-creating-publishing.md), in public listings on the [Snowflake Marketplace](https://app.snowflake.com/_deeplink/marketplace), and in [organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-about.md).

For more information, see [Sharing semantic views](../../../user-guide/views-semantic/sharing-semantic-views.md).

---
title: Nov 04, 2025: Snowflake Intelligence (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-snowflake-intelligence.md
section: Release Notes
---

# Nov 04, 2025: Snowflake Intelligence (*General availability*)

With this release, we are pleased to announce the general availability of Snowflake Intelligence, which was previously available as a preview feature. Snowflake Intelligence is a powerful tool that enables you to gain insights and take action based on data in your organization.

With Snowflake Intelligence, you can:

* Create charts and get instant answers using natural language. Discover trends and analyze data without technical expertise or waiting for custom dashboards.
* Access and analyze thousands of data sources, including structured and unstructured data together. Connect insights from spreadsheets, documents, images, and databases simultaneously.

Snowflake Intelligence uses agents, which are AI models that are connected to one or more semantic views, semantic models, Cortex Search services, and tools. Agents can answer questions, provide insights, and show visualizations. Snowflake Intelligence is powered by Cortex AI Functions, Cortex Analyst, and Cortex Search.

For more information, see [Overview of Snowflake Intelligence](../../../user-guide/snowflake-cortex/snowflake-intelligence.md).

---
title: Nov 04, 2025: Snowflake Machine Learning Experiments (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-ml-experiment-tracking.md
section: Release Notes
---

# Nov 04, 2025: Snowflake Machine Learning Experiments (*Preview*)

Snowflake now offers machine learning experiments. Experiments allow you to collect training run information for your models and evaluate them through Snowsight.

Create experiment runs from your training parameters, metrics evaluations, and any generated artifacts. At the end of your training runs, experiments let you compare the collected data so that you can select the model right for you. Snowflake Experiments aren’t opinionated about any of your training information – bring any data useful for your model evaluation process.

For more information, see [Run an experiment to compare and select models](../../../developer-guide/snowflake-ml/experiments.md).

---
title: Nov 04, 2025: Snowflake Openflow - Snowflake Deployments (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-openflow.md
section: Release Notes
---

# Nov 04, 2025: Snowflake Openflow - Snowflake Deployments (*General availability*)

Snowflake announces the general availability of Openflow Snowflake Deployments, which run on Snowpark Container Services (SPCS).

---
title: Nov 04, 2025: Snowflake-managed MCP server (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-04-cortex-agents-mcp.md
section: Release Notes
---

# Nov 04, 2025: Snowflake-managed MCP server (*General availability*)

With this release, we are pleased to announce the general availability of the Snowflake-managed MCP server, which was previously available as a preview feature. The Snowflake-managed MCP server is a standards-based interface that lets AI agents securely retrieve data from Snowflake accounts without needing to deploy separate infrastructure.

Model Context Protocol (MCP) is an open-source standard that lets AI agents securely interact with business applications and external data systems, such as databases and content repositories. The Snowflake-managed MCP server provides:

* **Standardized integration:** Unified interface for tool discovery and invocation, in compliance with the rapidly evolving standards.
* **Comprehensive authentication:** Snowflake’s built-in OAuth service to enable OAuth-based authentication for MCP integrations.
* **Robust governance:** Role-based access control (RBAC) for the MCP server and tools to manage tool discovery and invocation.

You can configure the MCP server to serve Cortex Analyst and Cortex Search as tools on the standards-based interface. MCP clients discover and invoke these tools, and retrieve data required for the application.

For more information, see [Snowflake-managed MCP server](../../../user-guide/snowflake-cortex/cortex-agents-mcp.md).

---
title: Nov 05, 2025: Cortex Agents integration for Microsoft Teams and Copilot (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-05-cortex-agents-teams-ga.md
section: Release Notes
---

# Nov 05, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*General availability*)

The Cortex Agents integration for Microsoft Teams and Microsoft 365 Copilot is now generally available across all
Snowflake public cloud deployments. This integration enables users to interact conversationally with Cortex Agents
directly within the Microsoft Teams interface or in Microsoft 365 Copilot, bringing Snowflake data insights to where
users collaborate and make decisions.

For complete setup instructions and regional considerations, see
[Cortex Agents for Microsoft Teams and Microsoft 365 Copilot](../../../user-guide/snowflake-cortex/cortex-agents-teams-integration.md).

---
title: Nov 05, 2025: Shared Workspaces (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-05-shared-workspaces.md
section: Release Notes
---

# Nov 05, 2025: Shared Workspaces (*Preview*)

Shared workspaces are now available in [preview](../../preview-features.md). Shared workspaces introduce collaborative development
directly within Snowsight. Multiple users can now work together on the same set of files and folders in a governed, role-based environment.
This enables teams to collaborate more effectively while maintaining Snowflake’s existing governance and security model.

## Key features

* Create shared workspaces within a selected database and schema for team collaboration.
* Share files or folders from private workspaces into shared workspaces for team access.
* Manage access and permissions through Snowflake roles.
* Work together on shared files and folders with visibility into updates made by other users.

For details, see [Shared workspaces](../../../user-guide/ui-snowsight/workspaces-shared.md).

---
title: Nov 05, 2025: Snowpipe Streaming with high-performance architecture on Azure (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-05-snowpipe-streaming-azure-ga.md
section: Release Notes
---

# Nov 05, 2025: Snowpipe Streaming with high-performance architecture on Azure (*General availability*)

The high-performance architecture for Snowpipe Streaming, which was previously made generally available on Amazon Web Services (AWS), is now generally available for all accounts on Microsoft Azure.

This architecture is designed for large-scale, real-time data ingestion, providing high-throughput and low-latency streaming into Snowflake across both major cloud platforms.

For more information, see [Snowpipe Streaming key concepts](../../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md).

---
title: Nov 05, 2025: Support for paid listings in the Kingdom of Saudi Arabia (KSA) (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-05-support-for-paid-listings-ksa.md
section: Release Notes
---

# Nov 05, 2025: Support for paid listings in the Kingdom of Saudi Arabia (KSA) (*General availability*)

Providers can create and offer paid listings in KSA. For a complete list of countries where providers can offer paid listings, see [Who can provide paid listings](../../../collaboration/provider-becoming.md).

Similarly, consumers can access paid listings in KSA. For a complete list of countries where consumers can access paid listings, see [Supported consumer locations](../../../collaboration/consumer-listings-paying.md).

---
title: Nov 06, 2025: dbt Projects on Snowflake (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-06-dbt-projects-on-snowflake-ga.md
section: Release Notes
---

# Nov 06, 2025: dbt Projects on Snowflake (*General availability*)

dbt Projects on Snowflake are now generally available.

dbt Projects on Snowflake let you use familiar Snowflake features to create, edit, test, run, and manage dbt Core projects. You can use Workspaces in
Snowsight to work with dbt project files and directories and deploy a dbt project as a schema-level DBT PROJECT object. You can also
use SQL to work with dbt project objects and use Snowflake CLI commands to integrate deployment and execution into your CI/CD workflows.

## What’s new since preview

* **Performance improvements and optimizations when running dbt commands on Snowflake:** During preview, result upload usually took approximately
  6 to 6.5 minutes. Now, upload completes approximately 8 to 10x faster in ~40 to 45 seconds.
* **Secondary roles requirement:** Using dbt in Workspaces no longer requires secondary roles.
* **Increased coverage for dbt commands:** dbt Projects on Snowflake now support the `dbt docs generate` and `dbt retry` commands and additional
  flags in `dbt compile`.
* **Support in Snowsight:** View the project DAG alongside model and test source code from the Project details page.
* **Expanded execution and scheduling UI support:** Enable customers to run and schedule dbt projects from the Project details page in
  Snowsight.
* **Easy access to dbt execution artifacts:** Easily access the dbt project execution artifacts containing execution results and log files. This
  is helpful for debugging dbt project executions and integration with central observability tools.
* **Replication support of dbt project objects:** Replicate dbt project objects to failover accounts.

For more information, see [dbt Projects on Snowflake](../../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

---
title: Nov 06, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-06-dcr.md
section: Release Notes
---

# Nov 06, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 11.2

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* New custom templates: The following template flows have been removed from the clean rooms UI and made into custom templates that you can
  download and run in code:

  + Last-touch attribution: [Learn more](../../../user-guide/cleanrooms/last-touch-template.md).
  + Audience lookalike modeling: [Learn more](../../../user-guide/cleanrooms/lookalike-audience-modeling-template.md).
  + Inventory forecasting: [Learn more](../../../user-guide/cleanrooms/inventory-forecasting-template.md).
* Updates to private preview features.

---
title: Nov 07, 2025: AI_REDACT function (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-07-aisql-redact-pii.md
section: Release Notes
---

# Nov 07, 2025: AI_REDACT function (*Preview*)

The AI_REDACT function is now available in preview. This managed function detects and redacts personally identifiable
information (PII) from unstructured text data using a large language model (LLM). AI_REDACT automatically recognizes
various categories of PII (name, address, and so on, including subcategories such as first or last name) and replaces
them with placeholder values.

For example, passing the following string to AI_REDACT:

> “John Smith’s email is [jsmith@example.com](mailto:jsmith%40example.com) and he lives in San Francisco.”

Results in the following output:

> “[NAME]’s email is [EMAIL] and he lives in [ADDRESS].”

For more information, see [Detect and redact personally identifiable information (PII)](../../../user-guide/snowflake-cortex/redact-pii.md) and [AI_REDACT](../../../sql-reference/functions/ai_redact.md).

---
title: Nov 07, 2025: Pricing plans and offers (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-07-pricing-plans-offers.md
section: Release Notes
---

# Nov 07, 2025: Pricing plans and offers (*General availability*)

Pricing plans and offers are now generally available.

With pricing plans, providers can create multiple, individualized pricing, terms, and discounts for a single paid listing. Providers don’t
need to create a listing for every new pricing plan they offer consumers.

After creating a pricing plan, providers create offers that define the purchase terms for a listing, and then extend those offers to consumers. Offers provide individualized billing, payment terms, payment schedules, and contract start and end dates. Before accepting or rejecting an offer, consumers can review the terms and request changes.

For more information, see [Pricing plans and offers](../../../user-guide/collaboration/listings/pricing-plans-offers/pricing-plans-and-offers.md).

---
title: Nov 07, 2025: Storage lifecycle policies (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-07-storage-lifecycle-policies-ga.md
section: Release Notes
---

# Nov 07, 2025: Storage lifecycle policies (*General availability*)

Storage lifecycle policies are now generally available.
These schema-level objects let you manage data retention for standard Snowflake
tables by archiving or expiring rows based on your defined conditions.

Storage lifecycle policies provide the following key capabilities:

* **Reduced storage costs**: Move older data to more cost-effective archive storage tiers.
* **Regulatory compliance**: Automate data archival and deletion to help meet compliance requirements.
* **Automated data management**: Archive and delete data automatically using Snowflake-managed compute resources.
* **Flexible data retrieval**: Retrieve archived data by creating a new table that contains only the
  rows you need.

For more information, see [Storage lifecycle policies](../../../user-guide/storage-management/storage-lifecycle-policies.md).

---
title: Nov 07, 2025: Trust Center extensions (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-07-trust-center-extensions.md
section: Release Notes
---

# Nov 07, 2025: Trust Center extensions (*Preview*)

Security partners and customers can use the Snowflake Native App Framework to create *Trust Center extensions*
that provide one or more additional scanner packages. Users can discover, install, and manage Trust Center
extensions. In this preview, Trust Center extensions are available now from several security partners, including
ALTR, Hunters, OneTrust, and TrustLogix.

For more information, see [Using Trust Center extensions](../../../user-guide/trust-center/trust-center-extensions.md).

---
title: Nov 10, 2025: Snowpipe Streaming with high-performance architecture on Google Cloud Platform (GCP) (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-10-snowpipe-streaming-gcp-ga.md
section: Release Notes
---

# Nov 10, 2025: Snowpipe Streaming with high-performance architecture on Google Cloud Platform (GCP) (*General availability*)

The high-performance architecture for Snowpipe Streaming, which was previously made generally available on Amazon Web Services (AWS) and Microsoft Azure, is now generally available for all accounts on Google Cloud Platform (GCP).

This architecture is designed for large-scale, real-time data ingestion, providing high-throughput and low-latency streaming into Snowflake across these major cloud platforms.

For more information, see [Snowpipe Streaming key concepts](../../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md).

---
title: Nov 11, 2024 — Snowflake Data Clean Rooms release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-11-dcr.md
section: Release Notes
---

# Nov 11, 2024 — Snowflake Data Clean Rooms release notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## All developer API clean rooms now available in the web app

Clean rooms created via developer APIs are now available by default in the web app. This allows users to manage these clean rooms directly
in the web app while still being able to run any template with a custom web app form. Providers no longer need to call the register clean
room to web app APIs, in order to make clean rooms available in the web app. The following APIs have been deprecated:

* `provider.register_cleanroom_in_ui`
* `provider.view_ui_registration_request_log`

In order to make clean rooms and any changes made to the clean room reflect in the web app, providers MUST call the following API going
forward:

```sqlexample
call samooha_by_snowflake_local_db.provider.create_or_update_cleanroom_listing($cleanroom_name);
```

Additionally, both providers and consumers can specify whether their developer API clean rooms should be accessible to their users in the
web app. Please note that it may take up to 10 minutes for the clean rooms or updates from developer APIs to reflect in the web app.

For more information, see [Define a user input form for a custom template](../../../user-guide/cleanrooms/demo-flows/custom-templates.md).

## Provider run for custom web app templates

Providers can now enable provider run on custom web app templates. This enables consumers to install and set their respective policies
directly through the web app, while allowing the provider to configure and execute the template query via the web app as well. Providers
must request provider run to be enabled via developer APIs and then call the create or update listings API, prior to the consumer
installing this in the web app. Additionally, providers can customize web app form drop downs to reference options for consumer join &
column policies.

For more information, see [Snowflake Data Clean Rooms: Provider API reference guide](../../../user-guide/cleanrooms/provider.md)

## Provider and consumer activation in custom web app templates

Providers can now add a custom activation template to their custom analysis template in the web app. This enables collaborators to support
activation use cases, while deploying custom analysis templates within the web app. Providers will need to add a reference to their
activation template in the web app form.

For more information, see [provider.add_ui_form_customizations](../../../user-guide/cleanrooms/provider.md)

## SQL policy configuration updates

Previously, all join columns required an aggregation policy and would have a projection policy applied by default in the SQL Query
template. With this release, join columns will be selected by default for both aggregation and projection policies. Users can remove
and customize their policy requirements as they see fit, while no longer being required to add a join column for every table in the clean
room.

## Sync and naming support for data connectors

Users can now manually sync their data connectors to reflect any changes in the metadata related to the table in the web app. Additionally,
users can provide their preferred name for these external tables, which is prefixed with the cloud identifier for ease of reference.

---
title: Nov 13, 2025: Excluding objects from sensitive data classification (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-13-data-classification.md
section: Release Notes
---

# Nov 13, 2025: Excluding objects from sensitive data classification (*General availability*)

By default, Snowflake automatically classifies all sensitive data in a database that has a classification profile set on it. You can now
configure Snowflake to exclude schemas, tables, or columns from automatic classification so that they are skipped during the classification
process.

For more information, see [Excluding data from sensitive data classification](../../../user-guide/classify-auto-exclude.md).

---
title: Nov 13, 2025: Improved stage volume implementation in Snowpark Container Services (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-13-spcs-improved-storage-mount-ga.md
section: Release Notes
---

# Nov 13, 2025: Improved stage volume implementation in Snowpark Container Services (*General availability*)

An improved stage mount implementation for your application containers is now generally available.
For more information, see [Using Snowflake stage volumes with services](../../../developer-guide/snowpark-container-services/snowflake-stage-volume.md).

---
title: Nov 13, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-13-dcr.md
section: Release Notes
---

# Nov 13, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 11.8

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Updates to private preview features.

---
title: Nov 14, 2025: Cortex Analyst Routing Mode (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-14-cortex-analyst-routing-mode.md
section: Release Notes
---

# Nov 14, 2025: Cortex Analyst Routing Mode (*Preview*)

Routing Mode is a query strategy that prioritizes semantic SQL and falls back to standard SQL only when needed. It acts as a simpler version of SQL, with guardrails provided by semantic views. Routing mode uses semantic views to drive higher accuracy and consistency. As a result, metrics, joins, and filters follow governed definitions from the semantic view.

Routing Mode offers the following benefits:

* **Consistent metrics:** Queries use definitions from semantic views, not SQL.
* **Safer defaults:** Dimensions, metrics, and joins come from governed metadata.
* **LLM-friendly:** Shorter SQL is easier for an LLM to produce correctly.

For more information, see [Routing Mode for Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst/cortex-analyst-routing-mode.md).

---
title: Nov 17, 2025: Access control enhancements for cost anomalies
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-17-cost-anomaly.md
section: Release Notes
---

# Nov 17, 2025: Access control enhancements for cost anomalies

A cost anomaly occurs when daily consumption is above or below the expected range of consumption for the day. Previously, only users with
system administration roles could view cost anomalies. Now, you can grant application roles to provide access to additional users. This new
access control model lets you grant some users the ability to view cost anomalies and other users the ability to both view and manage
anomalies.

For more information, see [Access control for cost anomalies](../../../user-guide/cost-anomalies-access-control.md).

---
title: Nov 17, 2025: Document Processing Playground (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-17-document-processing-playground.md
section: Release Notes
---

# Nov 17, 2025: Document Processing Playground (*Preview*)

The Document Processing Playground helps both technical and business users explore Snowflake’s AI-powered document processing capabilities.

The playground provides a user interface where you can complete the following tasks:

* Upload documents from a stage and experiment with the AI_EXTRACT and AI_PARSE_DOCUMENT functions
* Ask questions to extract information using AI_EXTRACT
* Preview the layout and OCR results generated by AI_PARSE_DOCUMENT
* Copy generated SQL queries for use in your worksheets
* Copy generated Python code for use in your Notebooks

For more information, see [Document Processing Playground](../../../user-guide/snowflake-cortex/document-processing-playground.md).

---
title: Nov 17, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-17-native-apps-spcs-aws-gov-fedram-ga.md
section: Release Notes
---

# Nov 17, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (*General availability*)

Snowflake Native App with Snowpark Container Services now support FedRAMP on Amazon Web Services. Apps with containers can be
distributed to any Snowflake customer who can use them in FedRAMP region.

Apps running in FedRAMP can use the functionality of Snowpark Container Services, including compute pools, services, jobs, and external access integrations.

For information on limitations in Amazon government regions, see [Limitations on Snowflake Native App with Snowpark Container Services](../../../developer-guide/native-apps/limitations.md).

---
title: Nov 18, 2025: Apache Iceberg™ tables: Support for bi-directional data access with Microsoft Fabric (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-18-iceberg-microsoft-fabric-bidirectional-data-access.md
section: Release Notes
---

# Nov 18, 2025: Apache Iceberg™ tables: Support for bi-directional data access with Microsoft Fabric (*Preview*)

You can now query Apache Iceberg™ tables between Snowflake and Microsoft Fabric in both directions by using a REST API endpoint:

* Query Snowflake-managed Iceberg tables from Fabric. To query Snowflake-managed Iceberg tables in Fabric, connect a
  Snowflake database to Fabric. You can select an existing database or create a new one. After connecting, Fabric creates an item that lets
  you access your Snowflake-managed tables. For more information, see [Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric](../../../user-guide/tables-iceberg-query-using-microsoft-fabric.md).
* Query OneLake tables with Iceberg metadata from Snowflake. To query Fabric Iceberg tables registered in Snowflake, configure a REST
  catalog integration for OneLake table APIs, which provides table information from Fabric.

  For more information, see the following topics:

  > + [Configure a catalog integration for OneLake REST](../../../user-guide/tables-iceberg-configure-catalog-integration-rest-onelake.md)
  > + [Overview of OneLake table APIs](https://learn.microsoft.com/fabric/onelake/table-apis/table-apis-overview) in the Microsoft Fabric documentation
  > + [Getting started with OneLake table APIs for Iceberg](https://learn.microsoft.com/en-us/fabric/onelake/table-apis/iceberg-table-apis-get-started#snowflake)
  >   in the Microsoft Fabric documentation

---
title: Nov 20, 2025: New versions of Streamlit supported in Streamlit in Snowflake (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-20-sis.md
section: Release Notes
---

# Nov 20, 2025: New versions of Streamlit supported in Streamlit in Snowflake (General availability)

The following versions of the Streamlit open-source library are now supported in Streamlit in Snowflake:

* 1.51.0
* 1.50.0
* 1.49.1
* 1.48.0
* 1.47.0

---
title: Nov 20, 2025: SnowConvert AI interface improvements
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-20-snowconvert-ai-interface-improvements.md
section: Release Notes
---

# Nov 20, 2025: SnowConvert AI interface improvements

The SnowConvert AI interface is revised to improve efficiency, control, and usability.
In the improved interface, you can run specific flows independently, including extraction, deployment, and
data validation. There is now a dedicated project page to show you which flows you can run. The improved
interface gives you more granular control over your project and makes managing complex workflows easier.

For more information, see [SnowConvert AI: Project Creation](../../../migrations/snowconvert-docs/general/user-guide/project-creation.md).

---
title: Nov 20, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-20-dcr.md
section: Release Notes
---

# Nov 20, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 11.9

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* Fixed audience overlap threshold logic: Corrected threshold comparison in SQL templates for more accurate results. Overlap used to be
  measured as *less than*; now it’s measured as *less than or equal to*.
* Updates to private preview features.

---
title: Nov 21, 2025: AI_COMPLETE function (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-21-ai-complete-ga.md
section: Release Notes
---

# Nov 21, 2025: AI_COMPLETE function (*General availability*)

The AI_COMPLETE function is now generally available. AI_COMPLETE generates responses (completions) from prompts, using
your choice of large language model (LLM). AI_COMPLETE is the most general Cortex AI Function; it is not specialized for
a particular use case, such as summarization or classification. Instead, it can generate a wide variety of responses
based on the provided content and instructions given in the prompt. Responses can be plain text or semi-structured data.

Prompts can contain text and one or more images, which are processed according to the plain English instructions you
provide. For example:

* Explain the concept of a large language model to a five year old.
* Assess the reading level of a given piece of text and simplify it to a target level.
* Critique the writing style of the provided text as bullet points.
* Estimate the star rating for a product based on the provided customer review.
* Compare two advertising creatives and describe the differences between them, in terms of content, style, and mood.
* Determine which of the countries in a graph of inflation rates has the highest rate.
* Identify the kithen appliances in an image and provide a brief description of each.
* Extract all the stock symbols and corresponding prices mentioned in an article as a JSON object.

The AI_COMPLETE function is the updated version of the SNOWFLAKE.CORTEX.COMPLETE function. Use AI_COMPLETE to take
advantage of the latest capabilities and models.

For more information about using the AI_COMPLETE function, see [AI_COMPLETE](../../../sql-reference/functions/ai_complete.md).

---
title: Nov 21, 2025: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-21-tables-iceberg-query-using-external-query-engine-snowflake-horizon-preview.md
section: Release Notes
---

# Nov 21, 2025: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (*Preview*)

You can now query Snowflake-managed Apache Iceberg™ tables by using any
external query engine that supports the open Iceberg REST protocol, such as Apache Spark™. To ensure this interoperability with
external engines, [Apache Polaris™ (incubating)](https://github.com/apache/polaris) is integrated into Horizon Catalog. You can query
these tables in a Snowflake account by using a single Horizon Catalog endpoint and you can use your existing users, roles, policies,
and authentication in Snowflake.

For more information, see [Access Apache Iceberg™ tables with an external engine through Snowflake Horizon Catalog](../../../user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md).

---
title: Nov 21, 2025: Import models from Hugging Face to Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-21-hugging-face-model-import-preview.md
section: Release Notes
---

# Nov 21, 2025: Import models from Hugging Face to Snowflake (*Preview*)

Snowflake now offers support for importing models from external services in preview. In addition to curated models available on Snowflake or training your own models, you can bring any transformer from [Hugging Face](https://huggingface.co/) to your [model registry](../../../developer-guide/snowflake-ml/model-registry/overview.md) on Snowflake. Imported models can be used like any other model in a registry.

For instructions on importing models from Hugging Face, see [Import and deploy models from an external service](../../../developer-guide/snowflake-ml/model-registry/snowsight-ui.md).

---
title: Nov 21, 2025: Tri-Secret Secure data protection for Snowpark Container Services block volumes (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-21-spcs-tri-secret-secure.md
section: Release Notes
---

# Nov 21, 2025: Tri-Secret Secure data protection for Snowpark Container Services block volumes (General availability)

Snowpark Container Services [block volumes](../../../developer-guide/snowpark-container-services/block-storage-volume.md) now support [Tri-Secret Secure](../../../user-guide/security-encryption-tss.md).

---
title: Nov 21, 2025: Trust Center notifications in Snowsight (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-11-21-trust-center-in-app-notifications.md
section: Release Notes
---

# Nov 21, 2025: Trust Center notifications in Snowsight (*General availability*)

In Snowsight, you can enable Trust Center notifications about accounts that haven’t enrolled in
multi-factor authentication (MFA).

For information about enabling, viewing, and disabling Trust Center notifications for your account in
Snowsight, see [Enable notifications from Trust Center](../../../user-guide/ui-snowsight-profile.md).

---
title: November 03-06, 2023 — 7.39 Release Notes (with Snowday 2023)
source: https://docs.snowflake.com/en/release-notes/2023/7_39.md
section: Release Notes
---

# November 03-06, 2023 — 7.39 Release Notes (with Snowday 2023)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## New Features

### Account Usage: New AGGREGATE_QUERY_HISTORY View — *Preview*

With this release, we are pleased to announce the AGGREGATE_QUERY_HISTORY view in
the Account Usage schema of the shared SNOWFLAKE database. This view returns
data on executed statements in aggregated intervals.

For more information, see [AGGREGATE_QUERY_HISTORY view](../../sql-reference/account-usage/aggregate_query_history.md).

### Budgets on Azure, GCP, and VPS — *Preview*

With this release, we are pleased to announce the preview of Budgets for accounts in Microsoft Azure (Azure) and
Google Cloud Platform (GCP) regions, and VPS Edition accounts.

Budgets enable account-level monitoring and notification of Snowflake credit usage for a group of specific Snowflake objects. You can
define a monthly spending limit on the compute costs for supported objects in your account. In addition to your account budget, you can
create custom budgets to monitor credit usage for a custom group of objects. Budgets sends you a notification when your credit usage is
on track to exceed your monthly limit.

For more information, see [Monitor credit usage with budgets](../../user-guide/budgets.md).

### Snowflake Native SDK for Connectors — *Preview*

With this release, Snowflake is pleased to announce preview support for the Snowflake Native SDK for Connectors.
The Snowflake Native SDK for Connectors is a set of application templates and quickstarts that show how to build a Snowflake Native App
that ingests data from an external data source into Snowflake.

For more information, see [Snowflake Native SDK for Connectors](../../developer-guide/native-apps/connector-sdk/about-connector-sdk.md).

## Security Updates

### Access control: Database roles — *General availability*

With this release, we are pleased to announce the general availability of database roles. Database roles are entities within a database
to which privileges on securable objects in the same database can be granted and revoked. Database roles are similar to account-level roles
except for their scope. Privileges on any object in an account can be granted to account roles, but only privileges on objects within
the same database can be granted to a database role.

Snowflake provides built-in database roles in the shared SNOWFLAKE [database](../../sql-reference/snowflake-db-roles.md), such as OBJECT_VIEWER and GOVERNANCE_VIEWER.
You can use these database roles to enable a least-privileged access approach to the shared SNOWFLAKE database.
For example, when you grant these database roles to an account-level role, you do not need to grant IMPORTED PRIVILEGES on the SNOWFLAKE
database to the same account-level role, which provides access to everything in the SNOWFLAKE database. Additionally, you do not need to
use the ACCOUNTADMIN role in a production environment to query views in the shared SNOWFLAKE database.

Since entering preview in December 2022, we added support for database roles in the following areas:

* Replication.
* Cloning, using the CREATE DATABASE ROLE … CLONE syntax.
* Future grants in the local database. Future grants are not supported for shared database roles.

The IS_DATABASE_ROLE_IN_SESSION function and its usage with [sharing policy-protected data](../../user-guide/data-sharing-policy-protected-data.md) continues to be in preview.

## Data Pipeline Updates

### New function SYSTEM$TASK_RUNTIME_INFO

With this release, we are pleased to announce a new system function SYSTEM$TASK_RUNTIME_INFO. This system function returns information about the current task run, which makes it easy for you to customize your task executions.

For more information, see [SYSTEM$TASK_RUNTIME_INFO](../../sql-reference/functions/system_task_runtime_info.md).

## Extensibility Updates

### External network access — *Preview on Azure*

With this release, we are pleased to announce that preview support of access to external network locations from procedure and UDF handler code is available to accounts on Azure (except the Gov region).

With an external access integration, you can:

* Write UDF and procedure handlers that access external locations.
* Allow or block access to locations on a network external to Snowflake.
* Use secrets that represent stored credentials, rather than using literal values, within handler code to authenticate with external network locations.
* Specify which secrets are allowed for use with external network locations.

For more information about using network rules with network policies, see [Overview of external network access](../../developer-guide/external-network-access/external-network-access-overview.md).

### Vectorized Python UDTFs — *General Availability*

With this release, we are pleased to announce the general availability of Vectorized Python UDTFs (user-defined table functions).

Vectorized Python UDTFs enable seamless partition-by-partition processing by operating on partitions as pandas DataFrames and returning results as pandas DataFrames or lists of pandas Series or arrays. Vectorized Python UDTFs allow for easy integration with libraries that operate on pandas DataFrames or pandas arrays.

For more information, see [Vectorized Python UDTFs](../../developer-guide/udf/python/udf-python-tabular-vectorized.md).

## Data Governance Updates

### Set a masking policy on a virtual column — *General Availability*

With this release, Snowflake is pleased to announce the general availability of setting a masking policy on a virtual column in an external table. This update allows the masking policy on the virtual column to override the masking policy that the virtual column inherits from the VALUE column. This update simplifies external table management because data administrators no longer need to create a view from the semi-structured data in the VALUE column and protect the view, and provides consistent data management and protection of the external table data because the protected virtual column does not expose data unnecessarily. This update was announced in preview in August 2023 (7.30).

For more information, see [Masking policies and external tables](../../user-guide/security-column-intro.md).

## Release Notes Change Log

| Announcement | Update | Date Updated |
| --- | --- | --- |
| *Account Usage: New AGGREGATE_QUERY_HISTORY View — Preview* | **Added** to *New Features* | 14-Dec-2023 |
| *Budgets on Azure, GCP, and VPS — Preview* | **Added** to *New Features* | 03-Nov-2023 |
| *Access control: Database roles — General availability* | **Added** to *Security Updates* | 01-Nov-2023 |
| *Snowflake Native SDK for Connectors — Preview* | **Added** to *New Features* | 31-Oct-2023 |
| *New function SYSTEM$TASK_RUNTIME_INFO* | **Added** to *Data Pipeline Updates* | 31-Oct-2023 |

---
title: November 04, 2024 — Classify Text (Snowflake Cortex LLM Function) — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-04-classify-text-ga.md
section: Release Notes
---

# November 04, 2024 — Classify Text (Snowflake Cortex LLM Function) — *General Availability*

We are pleased to announce the general availability of the Snowflake Cortex AI CLASSIFY_TEXT function. CLASSIFY_TEXT is
purpose-built to help you easily classify text records such as emails, call transcripts, and product reviews into
categories relevant to your business.

CLASSIFY_TEXT delivers state-of-the art zero-shot text classification accuracy comparable to leading models in the
market. To further improve CLASSIFY_TEXT, you can also provide label descriptions, examples corresponding to each label,
and a short explanation of the classification task. Descriptions and examples in particular are associated with higher
classification accuracy where the relationship between the input text and the categories is ambiguous.

With the CLASSIFY_TEXT function, you get ease of use and efficiency without compromising on accuracy. The outputs of
CLASSIFY_TEXT are structured in well-formatted JSON, without any additional post processing needed, for easy integration
of results into your data pipeline.

For more information, see [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/classify_text-snowflake-cortex.md).

---
title: November 04, 2024 — Data Lineage — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-04-data-lineage.md
section: Release Notes
---

# November 04, 2024 — Data Lineage — *Preview*

Snowflake is pleased to announce the preview of Data Lineage, a feature that automatically tracks the flow of data
between Snowflake objects in real-time, for example from a table to a view. Relationships among table-like objects,
columns, and stages are supported, as well as between data objects and machine learning objects including datasets,
feature views, and models. You can use lineage information to assist in impact analysis, monitoring, troubleshooting,
and compliance efforts. Lineage can also help you propagate knowledge of sensitive data elements using tags.

Lineage information is available through Snowsight, SQL, and Python. For more information, see:

* [Data Lineage in Snowsight](../../../user-guide/ui-snowsight-lineage.md)
* [GET_LINEAGE SQL function](../../../sql-reference/functions/get_lineage-snowflake-core.md)
* [Snowflake ML Lineage API](../../../developer-guide/snowflake-ml/ml-lineage.md)

---
title: November 04, 2024 — Replication error notifications — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-04-error-notifications.md
section: Release Notes
---

# November 04, 2024 — Replication error notifications — *General Availability*

With this release, we are pleased to announce the General Availability of Error notifications for replication and failover groups.
You can receive error notifications for refresh operation failures by setting a notification integration for a primary replication or failover group.

For additional details, see:

* [Error notifications for replication and failover groups](../../../user-guide/account-replication-error-notifications.md)
* [Monitoring replication and failover](../../../user-guide/account-replication-monitor.md)

---
title: November 04, 2024 — Snowflake Native Apps support for AWS Private Link  — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-04-na-aws-pl-ga.md
section: Release Notes
---

# November 04, 2024 — Snowflake Native Apps support for AWS Private Link — *General Availability*

With this release, Snowflake is pleased to announce the general availability of Native App support for
AWS PrivateLink. See [AWS PrivateLink and Snowflake](../../../user-guide/admin-security-privatelink.md) for more information.

---
title: November 04, 2024 — Top Insights (Snowflake ML Function) — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-04-top-insights-ga.md
section: Release Notes
---

# November 04, 2024 — Top Insights (Snowflake ML Function) — *General Availability*

We are pleased to announce the general availability of the Top Insights ML Function for key driver analysis. Top
Insights lets you easily identify drivers of a metric’s change over time or explain differences in a metric among
various verticals. You can integrate Top Insights into your business intelligence workflows powering
[business reviews](https://medium.com/snowflake/simplify-key-driver-analysis-at-massive-scale-using-top-insights-a-snowflake-ml-function-f9180a75716f),
business dashboards, and anomaly detection tools to understand key drivers impacting various metrics.

For more information, see [Top Insights](../../../user-guide/ml-functions/top-insights.md).

---
title: November 04-06, 2024 — 8.42 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_42.md
section: Release Notes
---

# November 04-06, 2024 — 8.42 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Trust Center: Two new scanners in the Security Essentials scanner package

With this release, we are pleased to announce two new Trust Center scanners in the Security Essentials scanner package. This scanner package
scans your account to check whether you have set up the following recommendations:

* You have an authentication policy that enforces all human users to enroll in multi-factor authentication (MFA) if they use passwords to
  authenticate.
* You [set up an event table](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#label-nativeapps-consumer-logging-setting-up) if your account [enabled event sharing for a native app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#label-nativeapps-consumer-logging-enabling), so your account receives a copy of the log messages and
  event information that is shared with the application provider.

For more information, see [Security Essentials scanner package](../../user-guide/trust-center/overview.md).

### Serverless alerts — *General availability*

With this release, we are pleased to announce the general availability of the serverless compute model for Snowflake alerts, which was previously available as a preview feature.

When you configure an alert to use the serverless compute model, Snowflake automatically resizes and scales up or down the compute resources required for the alert. Snowflake determines the ideal size of the compute resources for a given run based on a dynamic analysis of statistics for the most recent previous runs of the same alert.

To use the serverless compute model for an alert, omit the WAREHOUSE parameter when executing the [CREATE ALERT](../../sql-reference/sql/create-alert.md) command.

For more information, see [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md).

## SQL updates

### PARSE_JSON and TRY_PARSE_JSON functions: Duplicate keys are now allowed

With this release, the [PARSE_JSON](../../sql-reference/functions/parse_json.md) and [TRY_PARSE_JSON](../../sql-reference/functions/try_parse_json.md) functions have a new `parameter` argument. When this argument is set to `d`, duplicate keys are allowed in the string being parsed. If there are duplicate keys, these functions only return the value associated with the last occurrence of each key.

## Extensibility updates

### New Tensorflow version might require specifying Keras

With this release, version 2.17.0 of the Tensorflow library has been added. The new Tensorflow version includes a changed module structure for Keras, a deep learning API. If you have user-defined functions (UDFs) or procedures that use Tensorflow but don’t specify a version earlier than 2.17.0, Snowflake will assume that your handler should automatically begin using version 2.17.0 when you execute CREATE OR REPLACE for the UDF or procedure.

When you create or update the UDF or procedure, you might see an error such as the following:

```none
from tensorflow.keras.models import Sequential ModuleNotFoundError: No module named 'tensorflow.keras' in function
```

To resolve this error, follow the guidance in [Migrating Keras 2 code to multi-backend Keras 3](https://keras.io/guides/migrating_to_keras_3/). You might need to add Keras as a separate package using the PACKAGES parameter in CREATE OR REPLACE.

## Data pipeline updates

### Tasks: Serverless tasks user control — *General availability*

With this release, we are pleased to announce that you can take some control over the cost and performance of serverless tasks by setting the following parameters: SERVERLESS_TASK_MAX_STATEMENT_SIZE, SERVERLESS_TASK_MIN_STATEMENT_SIZE, and TARGET_COMPLETION_INTERVAL.

For more information, see [Serverless tasks](../../user-guide/tasks-intro.md).

### Tasks: Task success notifications — *General availability*

With this release, we are pleased to announce the general availability of task success notifications. Snowflake can push success notifications to a cloud messaging service when a task graph completes successfully. Success notification integration is only specified on a root task of a task graph. Snowflake only sends success notifications when the entire task graph is successfully executed and will not send notifications for any successfully executed standalone task.

For more information, see [Configure a task to send success notifications](../../user-guide/tasks-success-integrate.md).

## AI & ML updates

### API-level Role-based Access Control (RBAC) for Cortex Analyst

To further enhance security and access management, we are introducing API-level Role-based Access Control (RBAC) for Cortex Analyst. All requests made to Cortex Analyst must use a role which has been granted the [CORTEX USER](../../user-guide/snowflake-cortex/aisql.md) role. This provides admins a way to control who can call Cortex Analyst with Snowflake RBAC. CORTEX_USER is granted to PUBLIC by default. For more information, see [Cortex LLM privileges](../../user-guide/snowflake-cortex/aisql.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 01-Nov-24 |
| API-level Role-based Access Control (RBAC) for Cortex Analyst | **Added** to *AI & ML updates* | 05-Nov-24 |
| Tasks: Serverless tasks user control | **Added** to *Data pipeline updates* section | 06-Nov-24 |
| Outbound private connectivity for Snowflake features | **Removed** from *New features* section | 07-Nov-24 |
| *Trust Center: Two new scanners in the Security Essentials scanner package* | **Added** to *New features* section | 11-Nov-24 |

---
title: November 06, 2024 — SPLIT_TEXT_RECURSIVE_CHARACTER Cortex function — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-06-split-text-recursive-character.md
section: Release Notes
---

# November 06, 2024 — SPLIT_TEXT_RECURSIVE_CHARACTER Cortex function — *Preview*

Snowflake is pleased to announce the preview of the SPLIT_TEXT_RECURSIVE_CHARACTER function. This function splits a
string into smaller chunks of text, recursively, so that the text can be passed to embedding or search indexing
functions. Since many language models have a limit on the number of tokens they can process, this function is essential
to processing text larger than the token limit.

For more information, see [SPLIT_TEXT_RECURSIVE_CHARACTER (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/split_text_recursive_character-snowflake-cortex.md).

---
title: November 08, 2024 — Grouped Query History — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-08-grouped-query-history.md
section: Release Notes
---

# November 08, 2024 — Grouped Query History — *Preview*

Snowflake is pleased to announce the preview of Grouped Query History, a view in Snowsight to monitor usage
and performance of critical and frequently run queries. This graphical view is based on information
that is recorded in the [AGGREGATE_QUERY_HISTORY view](../../../sql-reference/account-usage/aggregate_query_history.md) and is particularly useful for
monitoring and analyzing [Unistore workloads](https://www.snowflake.com/en/data-cloud/workloads/unistore/).

For more information, see:

* [Use the Grouped Query History view in Snowsight](../../../user-guide/ui-snowsight-activity.md)
* [Monitor hybrid table workloads](../../../user-guide/tables-hybrid-monitor-workload.md)

---
title: November 08, 2024 — Snowflake Microsoft Sharepoint connector
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-08.md
section: Release Notes
---

# November 08, 2024 — Snowflake Microsoft Sharepoint connector

## Snowflake Connector for SharePoint

With this release, we are pleased to announce the preview of Snowflake Connector for SharePoint.

The Snowflake Connector for SharePoint connector connects a Microsoft 365 SharePoint site and Snowflake to ingest
files and user permissions and keeps them up to date.
Snowflake Connector for SharePoint also supports the Cortex Search service and can make ingested files ready for conversational
analysis for use in AI Assistants using SQL, Python or REST APIs.

For more details, see [About the Snowflake Connector for SharePoint](../../../connectors/unstructured-data-connectors/sharepoint/about.md).

Release notes:

* [Snowflake Connector for SharePoint release notes](../../connectors/sharepoint.md)

---
title: November 09-10, 2023 — 7.40 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_40.md
section: Release Notes
---

# November 09-10, 2023 — 7.40 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## SQL Updates

### Search Optimization: Support for Substring Search in Semi-Structured Data — *General Availability*

With this release, we are pleased to announce the general availability of [Search Optimization](../../user-guide/search-optimization-service.md)
support for substring and regular expression search in [semi-structured data](../../sql-reference/data-types-semistructured.md), including
ARRAY, OBJECT, and VARIANT columns. Previously, only equality searches on such columns could be optimized.

Substring queries include predicates that use the following keywords:

* LIKE, ILIKE, LIKE ANY, LIKE ALL, ILIKE ANY
* STARTSWITH, ENDSWITH, CONTAINS
* RLIKE, REGEXP, REXEP_LIKE
* SPLIT_PART

You can enable substring search optimization for semi-structured data by running ALTER TABLE commands on specific columns. For example:

```sqlexample
ALTER TABLE test_table
  ADD SEARCH OPTIMIZATION ON SUBSTRING(variant_col:data.field);
```

For more information on this search optimization improvement, including its capabilities and limitations, see
[Substring search in VARIANT types](../../user-guide/search-optimization/semi-structured-queries.md).

### Email Notification Integrations: ALLOWED_RECIPIENTS No Longer Required

With this release, you no longer need to set the ALLOWED_RECIPIENTS property of email notification integrations before you call the
[SYSTEM$SEND_EMAIL](../../sql-reference/stored-procedures/system_send_email.md) stored procedure.

If the ALLOWED_RECIPIENTS property is not set, you can use SYSTEM$SEND_EMAIL to send email messages to any Snowflake user in your
account, provided that the [user has verified their email address](../../user-guide/notifications/email-notifications.md).

To create an email notification integration without specifying a set of allowed recipients, execute the
[CREATE NOTIFICATION INTEGRATION](../../sql-reference/sql/create-notification-integration.md) command with TYPE=EMAIL without specifying ALLOWED_RECIPIENTS.

To remove the set of allowed recipients from an existing email notification integration, execute
[ALTER NOTIFICATION INTEGRATION … UNSET ALLOWED_RECIPIENTS](../../sql-reference/sql/alter-notification-integration.md).

For more information on sending email notifications, see [Using SYSTEM$SEND_EMAIL to send email notifications](../../user-guide/notifications/email-stored-procedures.md).

## Web Interface Updates

### Replication and Client Redirect in Snowsight — *Preview*

With this release, we are pleased to announce the preview of the Replication page in Snowsight. You can now create, edit, and monitor
replication and failover groups, and connections for Client Redirect using Snowsight.

In the Groups tab, you can create primary and secondary replication and failover groups, and edit existing groups. You can also view
the status and history of refresh operations, including detailed information about each refresh operation, for each replication and
failover group in your organization.

In the Client Redirect tab, you can create a connection, edit the target account for an existing connection, view details, and
monitor usage of connections in your organization.

For more information, see:

* [Replicating databases and account objects across multiple accounts](../../user-guide/account-replication-config.md)
* [Redirecting client connections](../../user-guide/client-redirect.md)

### Snowsight is the default interface for Snowflake accounts in US government regions

Starting November 6, 2023, all users in Snowflake accounts in US government regions see Snowsight after logging in.

For more information, see [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

## Release Notes Change Log

| Announcement | Update | Date Updated |
| --- | --- | --- |
| *Replication and Client Redirect in Snowsight — Preview* | **Added** to *Web Interface Updates* | 10-Nov-2023 |

---
title: November 11, 2024 — Snowflake Native App Framework release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-11-native-apps.md
section: Release Notes
---

# November 11, 2024 — Snowflake Native App Framework release notes

Snowflake is pleased to announce the availability of the following new features and enhancements.

## Snowflake Native Apps with Snowpark Container Services in AWS — *General availability*

Snowflake Native App with Snowpark Container Services is now generally available on Amazon Web Services (AWS). Snowflake Native Apps built using containers can be
distributed to any Snowflake customer who can use them in production AWS commercial regions. Apps with containers
provide all of the functionality of Snowpark Container Services, including compute pools, services, jobs, external
access integrations, etc. within a Snowflake Native App.

## Snowflake Native Apps with Snowpark Container Services in Azure — *Preview*

Snowflake Native App with Snowpark Container Services is now in preview on Microsoft Azure. Snowflake Native Apps built using containers can be
distributed to any Snowflake customer who can use them in Azure commercial regions.

Providers can develop apps that can be distributed to Snowflake customers in both AWS or Azure. Apps running in
Azure can use the functionality of Snowpark Container Services, including compute pools, services, jobs, external access
integrations, etc. within a Snowflake Native App.

## Native App Framework support for Budgets

With this release, a Snowflake Native App can use [Budgets](../../../user-guide/budgets.md) to monitor credit usage. Customers can
set up spending limits for app, which include all the app’s compute resources, including compute pools and warehouses.
After installing an app, consumers can view and create budgets.

---
title: November 11, 2024 — Snowflake Notebooks Warehouse Runtime — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-11-notebooks-wh-ga.md
section: Release Notes
---

# November 11, 2024 — Snowflake Notebooks Warehouse Runtime — *General Availability*

With this release, we are pleased to announce the general availability of Snowflake Notebooks on Amazon Web Services (AWS), Microsoft
Azure, and Google Cloud Platform (GCP) commercial regions.

Snowflake Notebooks is a development interface in Snowsight that offers an interactive, cell-based programming environment for Python
and SQL. In Snowflake Notebooks, you can perform exploratory data analysis, develop machine learning models, and perform other data science and
data engineering tasks all in one place. For more information, see [About Legacy Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks.md).

## Updates

* **Cell output limits** - When viewing cell results for each DataFrame output, only 10,000 rows or 8 MB is displayed, whichever is lower.
  For each cell, only 20 MB of output is allowed.
* **Reconnection/idle time setting** - Users can configure an idle timeout setting to automatically shut down the notebook session once the
  setting is met. The default setting is 30 minutes. Notebooks will automatically reconnect if you return to the session before the timeout
  setting has elapsed. You can change the idle time for each notebook in Snowsight or using the IDLE_AUTO_SHUTDOWN_TIME_SECONDS
  property using SQL. For more information, see [Develop and run code in Snowflake Notebooks](../../../user-guide/ui-snowsight/notebooks-develop-run.md).

---
title: November 11-14, 2023 — 7.41 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2023/7_41.md
section: Release Notes
---

# November 11-14, 2023 — 7.41 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Behavior Change Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2023_08](../bcr-bundles/2023_08_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2023_07](../bcr-bundles/2023_07_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_06](../bcr-bundles/2023_06_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for January 2024; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL Updates

### New SQL functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Encryption | [TRY_DECRYPT](../../sql-reference/functions/try_decrypt.md) | A special version of [DECRYPT](../../sql-reference/functions/decrypt.md) that returns a NULL value if an error occurs during decryption. |
| Encryption | [TRY_DECRYPT_RAW](../../sql-reference/functions/try_decrypt_raw.md) | A special version of [DECRYPT_RAW](../../sql-reference/functions/decrypt_raw.md) that returns a NULL value if an error occurs during decryption. |

### UDFs and Stored Procedures: Support for Optional Arguments

You can now specify that an argument in a UDF or stored procedure is optional. In the CREATE FUNCTION or CREATE PROCEDURE statement, use the new DEFAULT keyword to specify the default value for an optional argument. For example:

```sqlexample
CREATE FUNCTION my_udf(
  arg_1 VARCHAR,
  arg_2_optional VARCHAR DEFAULT 'default'
  arg_3_optional INTEGER DEFAULT 0) ...
```

When you call the UDF or stored procedure, you can omit any of the optional arguments:

```sqlexample
SELECT my_udf(arg_1 => 'value', arg_3_optional => 1);
```

For more information, see the [documentation on optional arguments for UDFs and stored procedures](../../developer-guide/udf-stored-procedure-arguments.md).

### Snowflake alerts: Manual execution of alerts

With this release, we are pleased to announce the new EXECUTE ALERT SQL command, which you can use to execute an alert manually.

You can use the EXECUTE ALERT command to:

* Verify that a new alert works as you would expect.
* Execute an alert at a specific point in your data pipeline (for example, at the end of a stored procedure call).

You can run the EXECUTE ALERT command interactively. You can also run this command within a stored procedure or a Snowflake Scripting block.

For more information, see [EXECUTE ALERT](../../sql-reference/sql/execute-alert.md).

## Security Updates

### Replication of network rules — *Preview*

With this release, we are pleased to announce that network rules, which are currently in preview, are now replicated when you replicate the database in which they are contained. References between network policies and network policies are also replicated.

For more information, see [Replicating network policies](../../user-guide/account-replication-security-integrations.md).

## Data Pipeline Updates

### Dynamic tables: Support for GRANT <privilege> ON ALL/FUTURE DYNAMIC TABLE - Preview

As part of Behavior Change [bundle 2023_08](../bcr-bundles/2023_08_bundle.md), Snowflake has enabled support for granting privileges on all and future dynamic tables.

For more information, see [GRANT <privileges> … TO ROLE](../../sql-reference/sql/grant-privilege.md).

### Dynamic tables: Support for GRANT ALL/ALL PRIVILEGES ON DYNAMIC TABLE - Preview

As part of Behavior Change [bundle 2023_08](../bcr-bundles/2023_08_bundle.md), Snowflake has enabled support for bulk grants on dynamic tables.

For more information, see [Dynamic table privileges](../../user-guide/security-access-control-privileges.md).

## Web Interface Updates

### More control over notification contacts in Snowsight

Managing your notification contacts in Snowsight is improved, with more granular control over who can receive notifications for an organization or account, and the ability to send notifications to multiple email addresses.

For more information, see [About notification contacts for Snowflake](../../user-guide/ui-snowsight-contacts.md).

### Snowsight is the default interface for Snowflake accounts in US government regions

Concluding the week of November 13, 2023, all users in Snowflake accounts in US government regions see Snowsight after logging in.

For more information, see [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

### Changes to formatting of query results in worksheets and dashboards

As part of a behavior change in bundle 2023_07, query results no longer have automatic formatting applied in Snowsight.

For more information, see [Snowsight worksheets and dashboards: Changes to formatting of query results](../bcr-bundles/2023_07/bcr-1314.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Dynamic tables: Support for GRANT <privilege> ON ALL/FUTURE DYNAMIC TABLE - Preview | **Added** to *Data Pipeline Updates* | 15-Nov-2023 |
| Dynamic tables: Support for GRANT ALL/ALL PRIVILEGES ON DYNAMIC TABLE - Preview | **Added** to *Data Pipeline Updates* | 15-Nov-2023 |

---
title: November 12, 2024 — Additional CREATE OR ALTER commands — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-12-create-or-alter-pupr.md
section: Release Notes
---

# November 12, 2024 — Additional CREATE OR ALTER commands — *Preview*

With this release, we are pleased to announce the preview of additional CREATE OR ALTER commands. These
commands combine the functionality of the CREATE command and the ALTER command. A CREATE OR ALTER statement
executes as a CREATE statement if the object doesn’t exist. If it does exist, it transforms the object
according to the object definition in the statement.

CREATE OR ALTER TABLE provides a declarative and idempotent approach to defining your Snowflake objects. When
used together with the Git integration, this enables an Infrastructure-as-Code (IaC) approach to database
change management.

With this preview, the following additional objects are supported:

* [CREATE OR ALTER APPLICATION ROLE](../../../sql-reference/sql/create-application-role.md): Creates an application role if it doesn’t exist or alters an existing
  application role.
* [CREATE OR ALTER DATABASE](../../../sql-reference/sql/create-database.md): Creates a database if it doesn’t exist or alters an existing
  database.
* [CREATE OR ALTER DATABASE ROLE](../../../sql-reference/sql/create-database-role.md): Creates a database role if it doesn’t exist or alters an existing
  database role.
* [CREATE OR ALTER ROLE](../../../sql-reference/sql/create-role.md): Creates a role if it doesn’t exist or alters an existing role.
* [CREATE OR ALTER SCHEMA](../../../sql-reference/sql/create-schema.md): Creates a schema if it doesn’t exist or alters an existing
  schema.
* [CREATE OR ALTER STAGE](../../../sql-reference/sql/create-stage.md): Creates a stage if it doesn’t exist or alters an existing stage.
* [CREATE OR ALTER VIEW](../../../sql-reference/sql/create-view.md): Creates a view if it doesn’t exist or alters an existing view.
* [CREATE OR ALTER WAREHOUSE](../../../sql-reference/sql/create-warehouse.md): Creates a warehouse if it doesn’t exist or alters an existing warehouse.

For more information, see [CREATE OR ALTER <object>](../../../sql-reference/sql/create-or-alter.md).

---
title: November 12, 2024 — Budgets: Support for cloud provider queue and webhook notifications
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-12-budget-notification-queue-webhook.md
section: Release Notes
---

# November 12, 2024 — Budgets: Support for cloud provider queue and webhook notifications

You can now configure your account budget and custom budgets so that notifications are sent to the following:

* A queue provided by a cloud service (Amazon SNS, Azure Event Grid, or Google Cloud PubSub).
* A webhook for Slack, Microsoft Teams, or PagerDuty.

To do this, you create a [notification integration for a queue](../../../user-guide/notifications/queue-notifications.md) or a
[webhook](../../../user-guide/notifications/webhook-notifications.md), and you call a method to associate the integration with the
budget. The BUDGET class now supports the following new methods:

| Method | Description |
| --- | --- |
| [<budget_name>!ADD_NOTIFICATION_INTEGRATION](../../../sql-reference/classes/budget/methods/add_notification_integration.md) | Adds a queue or webhook notification integration to a custom budget or the account budget. |
| [<budget_name>!GET_NOTIFICATION_INTEGRATIONS](../../../sql-reference/classes/budget/methods/get_notification_integrations.md) | Returns information about the queue and webhook notification integrations associated with a custom budget or the account budget. |
| [<budget_name>!REMOVE_NOTIFICATION_INTEGRATION](../../../sql-reference/classes/budget/methods/remove_notification_integration.md) | Removes a queue or webhook notification integration from a custom budget or the account budget. |

For information, see [Notifications for budgets](../../../user-guide/budgets/notifications.md).

---
title: November 12, 2024 — Classification (Snowflake ML Function) — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-12-ml-functions-classification-ga.md
section: Release Notes
---

# November 12, 2024 — Classification (Snowflake ML Function) — *General Availability*

We are pleased to announce the general availability of Snowflake ML Classification, a machine learning function that
sorts data into different classes using patterns detected in training data, making it easy for data scientists and
analysts to quickly get binary or multi-class predictions. Top use cases for Snowflake ML Classification include
powering buy and churn predictions, credit card detection, and spam detection.

Snowflake ML Classification is fully managed and abstracts away the complexity of working with different machine
learning frameworks and algorithms for categorical prediction. For more information,
see [Classification](../../../user-guide/ml-functions/classification.md).

---
title: November 12, 2024 — Dynamic tables: Support for reading from Snowflake-managed Iceberg tables and creating dynamic Apache Iceberg™ tables –— General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-12-dynamic-iceberg-tables.md
section: Release Notes
---

# November 12, 2024 — Dynamic tables: Support for reading from Snowflake-managed Iceberg tables and creating dynamic Apache Iceberg™ tables –— *General Availability*

We are pleased to announce the general availability of the following new capabilities related to dynamic tables and Snowflake-managed Apache Iceberg™ tables:

* Creating a dynamic table that reads from a Snowflake-managed Iceberg table as the source, just like regular tables.
* Creating a dynamic Iceberg table, a new dynamic table type that stores query results in the Iceberg table format.

You can use dynamic Iceberg tables for the following scenarios:

* **Data lake integration:** You can store large datasets cost-effectively while performing transformations and analytics within
  Snowflake, leveraging the Iceberg format for efficient querying and management.
* **Defining continuous data transformation pipelines:** By using dynamic tables, you can ensure data is always up to date without
  manual intervention and handle high-velocity data streams efficiently with incremental processing.

For more information, see [Create dynamic Apache Iceberg™ tables](../../../user-guide/dynamic-tables-create-iceberg.md) and [Introduction to streams](../../../user-guide/streams-intro.md).

---
title: November 12, 2024 — Organizational listings and the Internal Marketplace — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-12-organizational-listings.md
section: Release Notes
---

# November 12, 2024 — Organizational listings and the Internal Marketplace — *General Availability*

Snowflake is pleased to announce the general availability of the Internal Marketplace – a directory of data products shared across the
customer’s organization. It lets customers discover and access available data and app products from all teams and business units within
their Snowflake Organization.

To share data products in the Internal Marketplace, you use an organizational listing in Snowsight or in the API.
You control access by adding designated roles and/or accounts within the same Snowflake Organization.

For more information, see [About organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-about.md).

> **Note:**
>
> Rolling out a feature is a phased process designed to ensure smooth implementation and proactively address potential issues.
> New releases are introduced gradually across Snowflake regions, starting with the first region on the initial release date and
> continuing over time, typically spanning about a month.
>
> Documentation is published in alignment with this rollout process. For questions, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support) or your
> Snowflake representative.

---
title: November 12, 2024 — Snowflake ML: Distributed Hyperparameter Optimization on Snowpark Container Services — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-12-snowflake-ml-hpo-spcs.md
section: Release Notes
---

# November 12, 2024 — Snowflake ML: Distributed Hyperparameter Optimization on Snowpark Container Services — *Preview*

Snowflake is pleased to announce the preview of the Snowflake ML Hyperparameter Optimization (HPO) API, a model-agnostic
framework that enables efficient, parallelized hyperparameter tuning of models. This API is available within a Snowflake
Notebook configured to use the Container Runtime on Snowpark Container Services (SPCS).

For more information, see [Parallel Hyperparameter Optimization (HPO) on Container Runtime](../../../developer-guide/snowflake-ml/container-hpo.md).

---
title: November 12-14, 2024 — 8.43 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_43.md
section: Release Notes
---

# November 12-14, 2024 — 8.43 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Full-text search — *General availability*

Full-text search is now generally available. To use full-text search, call the new [SEARCH](../../sql-reference/functions/search.md) and
[SEARCH_IP](../../sql-reference/functions/search_ip.md) functions to find character data (text) and IP addresses in specified columns from a table,
including elements in VARIANT, OBJECT, and ARRAY columns. In most cases, you call the function by specifying it in the SELECT list or the WHERE
clause of a SELECT statement.

The SEARCH function supports token-based text search across multiple columns (or all columns) of a table, which is a good solution
for the following use cases:

* Searching for text in data with an inherent structure, where tokens naturally correspond to words, fields, or message components.

  Token searches can exactly match the specified text in a large amount of data, which results in fewer false positives and simpler queries than
  substring searches. For example, a token search for “unauthorized access” in the system logs finds case-insensitive instances of “unauthorized”
  and “access” but doesn’t find instances of “authorized” or “accessible.”

  In addition, for these cases, SEARCH is typically faster than comparable queries that use ILIKE.
* Searching for text without knowing the exact location of relevant data. Because full-text search supports a column wildcard, you can search
  for relevant text in a set of columns or entire tables without writing complex SQL queries. For example, you can use full-text search to
  search for a list of usernames in a table.

The SEARCH_IP function searches for valid IPv4 addresses in specified character-string columns, including elements in VARIANT, OBJECT, and ARRAY
columns. The search can find matches for a single IP address or a CIDR range of IP addresses in a large amount of data.

To improve the performance of full-text search queries, you can optionally enable FULL_TEXT search optimization on a specific column or set
of columns in a table. To do so, execute an ALTER TABLE … ADD SEARCH OPTIMIZATION ON FULL_TEXT statement. The resulting access path is
generally faster and cheaper to build, and requires less storage on disk than `ON SUBSTRING`.

For more information about full-text search, see [Using full-text search](../../user-guide/querying-with-search-functions.md). For more information about search
optimization for full-text search queries, see [Enabling and disabling search optimization](../../user-guide/search-optimization/enabling.md).

### Leaked password protection

With this release, we are pleased to announce leaked password protection, a background service in Snowflake that monitors and disables
passwords that have been leaked to help prevent unauthorized access to Snowflake accounts. The leaked password protection service provides a
notification system for administrators so they are aware of leaked passwords when they are detected in external databases.

For more information, see [Leaked password protection](../../user-guide/leaked-password-protection.md).

### Tasks: Python and JVM support for serverless tasks — *General availability*

## SQL updates

### EXECUTE IMMEDIATE FROM: Support for using content from staged files in templates

With this release, in a [Jinja2 template](../../sql-reference/sql/execute-immediate-from.md), you can include, import, inherit from, and read
content from other files on a stage.

You can use Jinja2’s [include](https://jinja.palletsprojects.com/en/stable/templates/#include), [import](https://jinja.palletsprojects.com/en/stable/templates/#import),
and [inheritance](https://jinja.palletsprojects.com/en/stable/templates/#template-inheritance) features or call the
[SnowflakeFile API](/developer-guide/snowpark/reference/python/latest/snowpark/files) to use content from files on a stage. This enables
you to make your templates more modular. For example, you can define macros in a common file and use those macros in different templates.

For more information, see [Using content from staged files in a template](../../sql-reference/sql/execute-immediate-from.md).

### Automatic logging and tracing for Snowflake Scripting stored procedures

With this release, you can automatically log and emit trace information about the execution of a Snowflake Scripting stored procedure. The
additional log information includes the BEGIN/END of a Snowflake Scripting block and a child job request. The additional types of trace events
include exception catching, information about child job execution, child job statistics, and stored procedure statistics, including execution
time and input values. By using this feature, you can generate this additional information without modifying the body of the stored procedure.

To use the feature, set the new [AUTO_EVENT_LOGGING](../../sql-reference/parameters.md) parameter to LOGGING, TRACING, or ALL using the [ALTER PROCEDURE](../../sql-reference/sql/alter-procedure.md)
command.

For more information, see [Automatically add log messages about blocks and child jobs](../../developer-guide/logging-tracing/logging-snowflake-scripting.md) and [Automatically emit trace events for child jobs and exceptions](../../developer-guide/logging-tracing/tracing-snowflake-scripting.md).

### ACCOUNT_USAGE: New SERVERLESS_ALERT_HISTORY view

With this release, we are pleased to announce the SERVERLESS_ALERT_HISTORY view in the ACCOUNT_USAGE schema of the shared
SNOWFLAKE database. You can query this view to get information about the credits used for serverless alerts.

For more information, see [SERVERLESS_ALERT_HISTORY view](../../sql-reference/account-usage/serverless_alert_history.md).

## Extensibility updates

### Authentication with AWS IAM from procedures and functions — *General availability*

With this release, we are pleased to announce general availability of support for authenticating with AWS services from a procedure or
functions using [Snowpark External Access](../../developer-guide/external-network-access/external-network-access-overview.md) via Identity and
Access Management (IAM).

For more information, see [Accessing Amazon S3 with AWS IAM](../../developer-guide/external-network-access/external-network-access-examples.md).

## Listings updates

### LISTING_REFRESH_HISTORY — *General availability*

With this release, we are pleased to announce general availability of the new function LISTING_REFRESH_HISTORY. You can use this function
to view the past 14 days of refresh history for a cross-cloud auto-fulfillment listing. The information returned contains replication
details for refresh events where the listing is synchronized to a specified target region.

For more information, see [LISTING_REFRESH_HISTORY](../../sql-reference/functions/listing_refresh_history.md).

## Data pipeline updates

### Dynamic tables: Support for replication across different failover groups

With this release, we are pleased to announce support for replication of dynamic tables and base tables that are in different failover groups.

For more information, see [Replication and dynamic tables](../../user-guide/account-replication-considerations.md).

## Data Lake updates

### Apache Iceberg™ tables: Support for Microsoft Fabric OneLake storage — *Preview*

With this release, we are pleased to announce support for Microsoft Fabric OneLake as a storage destination for Iceberg tables.
You can now create an external volume that connects Snowflake to Fabric OneLake storage, then create a Snowflake-managed table that writes to that location.
You can query the table using both Snowflake and Fabric.

For more information, see [CREATE EXTERNAL VOLUME](../../sql-reference/sql/create-external-volume.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 08-Nov-24 |
| *ACCOUNT_USAGE: New SERVERLESS_ALERT_HISTORY view* | **Added** to *SQL updates* section | 11-Nov-24 |
| *LISTING_REFRESH_HISTORY — General availability* | **Added** to *Listings updates* section | 14-Nov-24 |
| *Tasks: Python and JVM support for serverless tasks — General availability* | **Added** to *New features* section | 14-Nov-24 |
| *Apache Iceberg tables: Support for Microsoft Fabric OneLake storage — Preview* | **Added** to *Data Lake updates* section | 14-Nov-24 |
| Dynamic tables: Support for replication across different failover groups | **Added** to *Data pipeline updates* section | 25-Nov-24 |
| *Leaked password protection* | **Added** to *New features* section | 19-Nov-24 |

---
title: November 13, 2024 — Hybrid tables support extended to additional AWS regions
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-13-hybrid-tables-ga-regions.md
section: Release Notes
---

# November 13, 2024 — Hybrid tables support extended to additional AWS regions

With this release, Snowflake is pleased to announce the general availability of hybrid tables in all commercial
[AWS regions](../../../user-guide/intro-regions.md). Support for the following regions has been added:

| Cloud Region | Cloud Region ID |
| --- | --- |
| Europe (Zurich) | eu-central-2 |
| Asia Pacific (Jakarta) | ap-southeast-3 |

For more information, see [Hybrid tables](../../../user-guide/tables-hybrid.md).

---
title: November 14, 2024 — Cortex Analyst
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-14-cortex-analyst.md
section: Release Notes
---

# November 14, 2024 — Cortex Analyst

Snowflake is pleased to announce the following Cortex Analyst features.

## Multi-turn conversation in Cortex Analyst — *Preview*

Cortex Analyst now supports multi-turn conversations for data-related questions. This feature enables asking
follow-up questions that build on previous queries, creating a more dynamic and interactive data exploration
experience. For example, the user asks: “What is the month-over-month revenue growth for 2021 in Asia?”,
and then follows up with: “What about North America?”

For more information, see [Multi-turn conversation in Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md).

## Joins support in Cortex Analyst — *Preview*

Cortex Analyst now supports SQL joins, enabling more advanced data analysis across multiple tables, especially
in star schema structures. This feature allows you to query data from fact tables and associated dimension
tables with ease.

For more information, see [Identifying the relationships between logical tables](../../../user-guide/views-semantic/sql.md).

---
title: November 14, 2024 — Manage account preview features — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-14-manage-preview.md
section: Release Notes
---

# November 14, 2024 — Manage account preview features — *General Availability*

Snowflake is pleased to announce the General Availability of account level management of preview features.
With this capability, account administrators can enable, disable and view the status of [preview features](../../preview-features.md) within their account.

For additional details, see:

* [SYSTEM$DISABLE_PREVIEW_ACCESS](../../../sql-reference/functions/system_disable_preview_access.md)
* [SYSTEM$ENABLE_PREVIEW_ACCESS](../../../sql-reference/functions/system_enable_preview_access.md)
* [SYSTEM$GET_PREVIEW_ACCESS_STATUS](../../../sql-reference/functions/system_get_preview_access_status.md)

---
title: November 15, 2024 — Apache Iceberg™ tables: Efficient bulk loading, continuous ingestion, and data streaming — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-15-iceberg-tables-loading.md
section: Release Notes
---

# November 15, 2024 — Apache Iceberg™ tables: Efficient bulk loading, continuous ingestion, and data streaming — *General Availability*

With this release, Snowflake is pleased to announce the general availability of the following features,
which support efficient bulk loading, continuous ingestion, and data streaming into Snowflake-managed Iceberg tables.

You can now use the same core Snowflake ingestion features like COPY INTO <table>, Snowpipe, and Snowpipe Streaming, to
load data into both standard Snowflake tables and Iceberg tables.

For more information, see [Load data into Apache Iceberg™ tables](../../../user-guide/tables-iceberg-load.md).

## COPY INTO <table> and Snowpipe continuous file ingestion

You can use the following `LOAD_MODE` options with the [COPY INTO <table>](../../../sql-reference/sql/copy-into-table.md) command
and [Snowpipe automated loading](../../../user-guide/data-load-snowpipe-auto.md) to load data from files into a Snowflake-managed Iceberg table:

* `FULL_INGEST`: Loads data from any supported file format, converts to validated Iceberg-compatible Parquet,
  and optionally lets you transform or filter the data before loading.
* `ADD_FILES_COPY`: Loads data from Iceberg-compatible Parquet data files by performing a server-side copy of the files
  into the table’s base location and fast registering the files to the table.

## Snowpipe Streaming

With Snowflake Ingest SDK versions 3.0.0 and later, Snowpipe Streaming can stream rows into Snowflake-managed Iceberg tables.
To enable this feature, set the property `ENABLE_ICEBERG_STREAMING=true` in the `profile.json` file.

For more information, see [Load data into Apache Iceberg™ tables](../../../user-guide/tables-iceberg-load.md).

---
title: November 18, 2024 — S3-compatible storage for externally managed Apache Iceberg™ tables — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-18-s3-compatible-externally-managed-iceberg-ga.md
section: Release Notes
---

# November 18, 2024 — S3-compatible storage for externally managed Apache Iceberg™ tables — *General Availability*

We are pleased to announce the general availability of support for Amazon S3-compatible storage for
externally managed Iceberg tables.

You can now use an external volume to connect to and query Iceberg tables in an S3-compatible storage location.

For more information, see the following pages:

* [Configure an external volume for S3-compatible storage](../../../user-guide/tables-iceberg-s3-compatible.md)
* [Work with Amazon S3-compatible storage](../../../user-guide/data-load-s3-compatible-storage.md)

---
title: November 18, 2024 — Sensitive data classification
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-18-sensitive-data-classification.md
section: Release Notes
---

# November 18, 2024 — Sensitive data classification

## Automatic Sensitive Data Classification — *Preview*

Snowflake is pleased to announce Automatic Sensitive Data Classification, a serverless feature that can help automatically detect sensitive
data using native and custom classifiers. It can also apply user-defined tags and masking policies on columns automatically
when sensitive data is detected.

For information, see [Use SQL to set up sensitive data classification](../../../user-guide/classify-auto.md).

## Classifier improvements

The following classifiers are now available:

| Country | Classifier |
| --- | --- |
| New Zealand | * Organization identifier now includes business number. * National identifier now includes student number. |
| Japan | * Phone number * Postal code |

The accuracy of the following geo-specific classifiers has been improved:

| Supported country | Classifier |
| --- | --- |
| Australia | Phone Number |
| Canada | * City * Phone number * Street address * Bank account |
| New Zealand | * City * Street address * Bank account |
| United Kingdom | * Postal code * Phone number |
| United States | * City * Phone number * Street address * Bank account |

The accuracy of the following global classifiers has been improved:

* Name
* Latitude
* Longitude
* Payment card

---
title: November 18-21, 2024 — 8.44 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_44.md
section: Release Notes
---

# November 18-21, 2024 — 8.44 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Outbound private connectivity for Snowflake features

Outbound private connectivity lets you create private endpoints in Snowflake to access a cloud platform using the platform’s private
connectivity solution rather than the Internet. This lets you access cloud platform services privately and securely from Snowflake.

With this release, outbound private connectivity is now available for the following Snowflake features:

* External stages using Azure Private Link —- Preview
* External volumes using Azure Private Link —- Preview
* Snowpipe automation using Azure Private Link — Preview

#### External stages using Azure Private Link —- *Preview*

You can configure an external stage and create a private endpoint so bulk loading from Azure storage occurs over Azure Private Link.

For more information, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

#### External volumes using Azure Private Link —- *Preview*

You can configure an external volume and create a private endpoint so you can connect Snowflake to your external cloud storage for Iceberg
tables using Azure Private Link instead of the public Internet.

For more information, see [Configure an external volume for Azure](../../user-guide/tables-iceberg-configure-external-volume-azure.md).

#### Snowpipe automation using Azure Private Link — *Preview*

You can configure an external stage and notification integration, and create a private endpoint, so that automatic Snowpipe data loads that
are triggered by Microsoft Azure Event Grid use Azure Private Link instead of the public Internet.

For more information, see [Private connectivity to external stages and Snowpipe automation for Microsoft Azure](../../user-guide/data-load-azure-private.md).

For general information about using outbound private connectivity with these Snowflake features, see [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md).

### Visual Studio Code extension for Snowpark Python — *General availability*

With this release, Snowflake is pleased to announce that the Snowflake Visual Studio Code extension now integrates with Snowpark Python to
provide authoring and debugging features for Snowpark Python code.

These new features include:

* Inline debugging of Snowpark Python functions.
* Syntax highlighting and autocomplete suggestions for Snowflake SQL in Python strings within Python files or notebook cells.
* Syntax highlighting and bracket autocomplete of Jinja templates in Snowflake SQL.

For more information, see [Snowflake Extension for Visual Studio Code](../../user-guide/vscode-ext.md).

## Extensibility updates

### External network access for Azure Gov regions — *General availability*

With this release, we are pleased to announce general availability of external network access in Azure Gov regions. Access network locations
external to Snowflake from within procedure and UDF handler code is now generally available in all regions.

For more information, see [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md).

## Data lake updates

### Specify an external ID for SIGV4 REST catalog integrations

With this release, we are pleased to announce support for specifying an external ID when you create a catalog integration for Apache Iceberg™
REST that uses SIGV4 authentication.

Specifying an external ID lets you use the same IAM role across multiple catalog integrations. This can be useful in testing scenarios when
you need to recreate or replace a catalog integration many times.

For more information, see [CREATE CATALOG INTEGRATION (Apache Iceberg™ REST)](../../sql-reference/sql/create-catalog-integration-rest.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 15-Nov-24 |

---
title: November 20, 2024 — Snowsight rate limits — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-20-snowsight-rate-limits.md
section: Release Notes
---

# November 20, 2024 — Snowsight rate limits — *General Availability*

With this release, we are pleased to announce the general availability of rate limiting features in Snowsight. This functionality
enhances platform security and performance under high traffic conditions, such as DDoS attacks or unexpected surges in request volume.
Controlled limits are posed on the number of requests that can be made to Snowsight within specified time frames, ensuring
consistent availability and stability across all user sessions. This new capability aligns with Snowflake’s commitment to providing a
secure, resilient environment for data operations.

---
title: November 21, 2024 — Logical replication of clones  — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-21-logical-repl-clones.md
section: Release Notes
---

# November 21, 2024 — Logical replication of clones — *General Availability*

With this release, we are pleased to announce the general availability of logical replication of clones.

With logical replication of clones, when the original table and cloned table are
included in the same replication or failover group, the cloned table can be replicated
logically to the target account. As a result, logical replication, versus
physical replication, reduces egress and replica storage costs.

For additional details, see [Logical replication of clones](../../../user-guide/account-replication-considerations.md).

---
title: November 21, 2024 — Snowflake Data Clean Rooms release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-21-dcr.md
section: Release Notes
---

# November 21, 2024 — Snowflake Data Clean Rooms release notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

> **Important:**
>
> You must log out and back into the web app for these updates to take effect in your account.

## Non-overlap metrics

Clean room statistics now include non-overlap metrics when using the Audience Overlap & Segmentation template in the web app. This shows how
many records in your data did not match join IDs in the collaborator’s data. This capability must be enabled by the data provider.

## Unlink datasets API

Users can now unlink datasets that were previously linked to the clean room using the API.

For more information, see [the provider reference API guide](/user-guide/cleanrooms/provider).

## Dynamic table support

Users can now register and use dynamic tables in their clean rooms. To register these objects via the APIs, see `library.register_objects` in the [consumer](../../../user-guide/cleanrooms/consumer.md) or [provider](../../../user-guide/cleanrooms/provider.md) documentation.

## Custom Python code in consumer templates

Consumers who create a custom template can now upload and reference custom Python code in their template.

For more information, see [generate_python_request_template](../../../user-guide/cleanrooms/consumer.md).

## Merkury Identity connector

The Merkury Identity connector is an identity service provided by Merkle to Snowflake Data Clean Room customers so that they can
encode their customer identifiers into encoded Merkury IDs prior to collaborating within the clean room. The Merkury Identity connector
supports multiple type of IDs: HMID (hashed Merkury ID), email (in clear-form, or encoded in MD5, SHA1 or SHA256), device ID, IP address,
and phone (only supported in cleartext).

For more information, see [the identity connectors guide](/user-guide/cleanrooms/connector-identity).

## Google Display & Video 360 - Customer Match activation connector

Push your first-party, custom-audience data into your Google DV360 account. For more information, see [the activation connectors guide](/user-guide/cleanrooms/connector-activation)

---
title: November 25, 2024 — Data governance release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-26-data-governance.md
section: Release Notes
---

# November 25, 2024 — Data governance release notes

## Governance for organization listings through access history

Organization-level access history has been enhanced with columns that provide information about how data provided by
[organizational listings](../../../user-guide/collaboration/listings/organizational/org-listing-about.md) is being queried by consumers. For each query,
data governors can determine which account provided the organizational listing and exactly which data object was accessed. They can also
determine if the data object provided by the organizational listing is protected by a policy (such as a masking policy or row access policy)
in the provider’s account.

For more information, see [Organizational listing governance](../../../user-guide/collaboration/listings/organizational/org-listing-governance.md).

---
title: November 25, 2024 — Snowflake Cortex AI TRANSLATE — Updates
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-25-cortex-translate-update.md
section: Release Notes
---

# November 25, 2024 — Snowflake Cortex AI TRANSLATE — Updates

We are pleased to announce the availability of the following new features and enhancements in this update to the
Snowflake Cortex TRANSLATE function. The TRANSLATE function provides high-quality, reliable translations for call
transcripts, product reviews, social media comments, and other text.

* **Improved translation quality.** Translation quality is now on par wit the most powerful models in the market,
  with no need to optimize a prompt or train a model.
* **Improved translation reliability.** The new version of TRANSLATE never refuses to complete translations.
* **Longer context length.** The supported length of text to be translated has been increased from 1,024 to 4,096
  tokens. (A token is approximately four characters.)
* **Additional languages.** The TRANSLATE function now supports Dutch, Chinese, and Hindi. See the complete
  [list of supported languages](../../../sql-reference/functions/ai_translate.md).
* **Mixed language support.** Text written in a mixture of two languages can now be translated to a single language.
  For example, the TRANSLATE function can now translate “Spanglish” (an informal mix of English and Spanish used in
  parts of the United States) to just English.

For more information, see [TRANSLATE (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/translate-snowflake-cortex.md).

---
title: November 27, 2024 — Snowflake Native Apps: Multiple app installs — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-11-27-na-mult-install.md
section: Release Notes
---

# November 27, 2024 — Snowflake Native Apps: Multiple app installs — *General Availability*

Snowflake is pleased to announce the general availability of multiple installs for a Snowflake Native App. This
feature allows consumers to install multiple instances of an app in their account. For more information, see
[Allow consumers to install multiple instances of an app](../../../developer-guide/native-apps/creating-app-package.md).

---
title: November 29-30, 2023 — 7.42 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_42.md
section: Release Notes
---

# November 29-30, 2023 — 7.42 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## New Features

### Native Apps: Support for reference and privilege validation in the manifest file — *Preview*

References and privileges are now supported in an APPLICATION object installed in development mode using files on a named stage. This enables
providers using the Snowflake Native App Framework to test references and privilege requests locally before defining a version in the
application package.

For more information, see [Request references and object-level privileges from consumers](../../developer-guide/native-apps/requesting-refs.md).

### Schema detection for JSON and CSV — *General Availability*

With this release, we are pleased to announce the general availability of the schema detection feature for JSON and CSV. The schema detection
feature uses the INFER_SCHEMA function to automatically detect the schema in a set of staged data files and retrieve the column definitions.
The INFER_SCHEMA function general availability now applies to all the following file formats: Apache Parquet, Apache Avro, ORC, JSON and CSV.

For more information, see [Schema detection of column definitions from staged semi-structured data files](../../user-guide/data-load-overview.md).

### Table schema evolution — *General Availability*

With this release, we are pleased to announce the general availability of the table schema evolution feature. The structure of tables in
Snowflake can now evolve automatically to support the structure of new data received from the data sources. Snowflake allows adding new columns
or dropping the NOT NULL constraint from columns missing in new data files.

To enable table schema evolution, you can set the ENABLE_SCHEMA_EVOLUTION parameter to TRUE when you create or alter a table.

For more information, see [Enable automatic table schema evolution](../../user-guide/data-load-schema-evolution.md).

### Apache Iceberg™ tables — *Preview*

With this release, we are pleased to announce the preview of Apache Iceberg™ tables in Snowflake. Iceberg tables for Snowflake combine the performance and query
semantics of regular Snowflake tables with external cloud storage that you manage. They are ideal for maintaining a single copy of data with
interoperability across a variety of compute engines.

For more information, see [Apache Iceberg™ tables](../../user-guide/tables-iceberg.md).

### Self-service: Enabling the ORGADMIN role — *General Availability*

With this release, we are pleased to announce the general availability of a new ALTER ACCOUNT … SET IS_ORG_ADMIN syntax that allows an
organization administrator to enable the ORGADMIN role within a specific account, without contacting Snowflake Support.

Once the ORGADMIN role is enabled for an account, organization administrators can log in to the account and use the role to perform
organization-focused tasks like listing and creating accounts. Enabling the ORGADMIN role in an account also allows queries to access data in
the ORGANIZATION_USAGE schema.

For more information, see [Enabling the ORGADMIN role in an account](../../user-guide/organization-administrators.md).

### Self-service: Deleting an account — *General Availability*

With this release, we are pleased to announce the general availability of self-service account deletion. An organization administrator can now
delete an account without contacting Snowflake Support.

An organization administrator starts the process of deleting an account by dropping it. Once dropped, the account enters a grace period during
which the account can be restored (“undropped”). Snowflake automatically deletes the account when the grace period expires.

To support the process for deleting an account, this release also includes a new syntax for the SHOW ORGANIZATION ACCOUNTS command. When the
HISTORY keyword is appended to the command, the output contains dropped accounts along with additional columns such as scheduled deletion time.

For more information, see [Dropping an account](../../user-guide/organizations-manage-accounts-delete.md).

## Security Updates

### Key pair authentication: Improved troubleshooting

For more information, see [Key Pair Authentication: Troubleshooting](../../user-guide/key-pair-auth-troubleshooting.md).

## SQL Updates

### Structured types — *Preview*

With this release, we are pleased to announce the preview of structured types. A structured type is an ARRAY, OBJECT, or MAP that contains elements
or key-value pairs with specific Snowflake [data types](../../sql-reference-data-types.md).

The following are examples of structured types:

* An ARRAY of INTEGER elements.
* An OBJECT with VARCHAR and NUMBER key-value pairs.
* A MAP that associates a VARCHAR key with a DOUBLE value.

You can use structured types in the following ways:

* You can define a structured type column in an Apache Iceberg™ table.

  The [Iceberg data types](../../user-guide/tables-iceberg-data-types.md) `list`, `struct`, and `map` correspond
  to the structured ARRAY, structured OBJECT, and MAP types in Snowflake.
* You use structured types when accessing data from a structured type column in an Iceberg table.
* You can cast a semi-structured [ARRAY](../../sql-reference/data-types-semistructured.md), [OBJECT](../../sql-reference/data-types-semistructured.md), or [VARIANT](../../sql-reference/data-types-semistructured.md) to a corresponding
  structured type (e.g. an ARRAY to an ARRAY of INTEGER elements). You can also cast a structured type to a semi-structured type.

> **Note:**
>
> Currently, tables other than Iceberg tables do not support structured types. In a regular table, you cannot define a column of a structured type.

For more information, see [Structured data types](../../sql-reference/data-types-structured.md).

## Data Governance Updates

### Row access policies: Reference a protected mapping table in a row access policy — *General availability*

With this release, Snowflake is pleased to announce the general availability for policy administrators to reference a mapping table that is
protected by a row access policy in the policy conditions of a different row access policy. The result is more assurance to compliance officers
when a user queries a table protected by a row access policy. This update entered preview in the 7.32 release.

For more information, see [Protect the mapping table with a row access policy](../../user-guide/security-row-using.md).

## Data Collaboration Updates

### Recurring subscription-based pricing plans for paid listings —– *General Availability*

With this release, we are pleased to announce the general availability of recurring subscription-based pricing plans for paid listings.
With this plan, you can bill consumers upfront on a recurring basis for access to your listing.

For more information, see [Paid Listings Pricing Models](https://other-docs.snowflake.com/collaboration/provider-listings-pricing-model).

### Cross-Cloud Auto-Fulfillment support for sharing a Snowflake Native App — *Preview*

With this release, we are pleased to announce the preview of Cross-Cloud Auto-Fulfillment support for sharing a Snowflake Native App.
Cross-Cloud Auto-Fulfillment lets you, as a provider, share your Snowflake Native App with consumers in other supported regions.

For more information, see [Auto-fulfillment for listings](../../collaboration/provider-listings-auto-fulfillment.md).

As part of this release, the process for upgrading and versioning your application package has been improved, including cross-region support
for DROP APPLICATION PACKAGE and application status in the APPLICATION_STATE view.

For more information, see [Update an app (Legacy)](../../developer-guide/native-apps/update-app.md).

## Release Notes Change Log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-Nov-23 |
| *Recurring subscription-based pricing plans for paid listings* | **Added** to *Data Collaboration Updates* | 27-Nov-23 |
| *Cross-Cloud Auto-Fulfillment support for sharing a Snowflake Native App* | **Added** to *Data Collaboration Updates* | 29-Nov-23 |

---
title: OAuth authentication: Change in network policy behavior (Canceled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2094.md
section: Release Notes
---

# OAuth authentication: Change in network policy behavior (Canceled)

> **Note:**
>
> This behavior change was part of the 2025_06 bundle, but the change has been canceled.
>
> Even though this behavior change was canceled, Snowflake implemented some of the changes outside the behavior change process. To learn how Snowflake enforces network policies that are attached to a security integration for Snowflake OAuth or External OAuth, see:
>
> * [Restricting network traffic for Snowflake OAuth](../../../user-guide/oauth-snowflake-overview.md)
> * [Restricting network traffic for External OAuth](../../../user-guide/oauth-ext-overview.md)

## Snowflake OAuth authentication: Change in the network policy used for a request from client to Snowflake

Snowflake OAuth lets you use a network policy to restrict network traffic from the OAuth client and
from the user who is authenticating. There can be three different network policies restricting this traffic:

* A network policy controlling requests from the OAuth client. This network policy is associated with the security integration that allows
  the client to interact with Snowflake.
* A network policy controlling requests from the user who is authenticating. This network policy is associated with the user.
* An account-level network policy that governs when there isn’t an integration-level or user-level network policy.

This behavior change affects which network policy governs a request from the OAuth client to Snowflake. The following diagram highlights the
request that is affected by the change:

Before the change:
:   The user-level network policy, not the integration-level network policy, governs a request that sends an access token from the OAuth
    client to Snowflake as the Resource Server.

After the change:
:   The integration-level network policy, if specified, governs a request that sends an access token from the OAuth client to Snowflake as the
    Resource Server. If there is no integration-level network policy, the account-level network policy governs.

The network policies that govern other requests to Snowflake have not changed:

* User authorization and user consent requests sent from the user to Snowflake are still governed by the user-level network policy, if
  specified.
* The access token request sent from the OAuth client to Snowflake is still governed by the integration-level network policy, if specified.

## External OAuth: Integration-level network policy takes precedence

When this bundle is enabled, you’ll be able to associate a network policy with an External OAuth security integration. Previously, only a
Snowflake OAuth security integration could be associated with a network policy.

As part of this change, Snowflake will no longer consider user-level network policies when restricting network traffic from the OAuth
client. Snowflake will enforce this change incrementally according to the following schedule:

* **When the change is Enabled by Default**, Snowflake considers the user-level network policy and the integration-level network
  policy when restricting requests from the OAuth client.

  To avoid failures during this period, attach the current user-level network policy to the security integration. The following code shows
  you how to determine the network policy that is assigned to the user, and then assign that same policy to the integration.

  Find the network policy attached to the user:

  ```sqlexample
  SHOW PARAMETERS LIKE 'network_policy' IN USER <user_name>;
  ```

  Attach the network policy returned by the preceding command to the External OAuth security integration:

  ```sqlexample
  ALTER SECURITY INTEGRATION <external_oauth_integration_name>
    SET NETWORK_POLICY = <network_policy_attached_to_user>;
  ```
* **When the change is Generally Enabled**, a user-level network policy has no effect on requests from the OAuth client to Snowflake.
  Snowflake checks these requests against the integration-level network policy, then checks the account-level policy. Because the user-level
  network policy has no effect, you should remove it from the user by running the following command:

  ```sqlexample
  ALTER USER <user_name> UNSET NETWORK_POLICY;
  ```

Ref: 2094

---
title: OAuth: Proper normalization of explicit mixed case role names
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2192.md
section: Release Notes
---

# OAuth: Proper normalization of explicit mixed case role names

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

OAuth role name handling is changing to properly normalize role names that are explicitly specified
using double quotation characters:

Before the change:
:   Role names that use a combination of uppercase and lowercase characters enclosed by double quotes to preserve these cases, such as `RoLe_NaMe`, are capitalized during OAuth client checks. This behavior is unexpected.

After the change:
:   Role names that use a combination of uppercase and lowercase characters enclosed by double quotes to preserve these cases, such as `RoLe_NaMe`, pass OAuth checks without capitalization. The new behavior preserves role names that intentionally use characters with mixed case.

This behavior change corrects the unexpected behavior.

This behavior change is only relevant when a role name is explicitly passed during an OAuth workflow.

The following table lists examples of current and post-change behavior. Row 2 shows the changed behavior.

| Specified role name | Current Behavior | Post-change behavior |
| --- | --- | --- |
| Role1 | ROLE1 | ROLE1 |
| “RoLe1” | ROLE1 | RoLe1 |
| roLe1 | ROLE1 | ROLE1 |
| role1 | ROLE1 | ROLE1 |

Ref: 2192

---
title: Object tagging commands, functions, and views: New column and property
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1777.md
section: Release Notes
---

# Object tagging commands, functions, and views: New column and property

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, commands, functions, and views related to object tagging will change. These changes include
the following:

* The output of the [TAG_REFERENCES](../../../sql-reference/functions/tag_references.md) and
  [TAG_REFERENCES_ALL_COLUMNS](../../../sql-reference/functions/tag_references_all_columns.md) functions in the Snowflake Information Schema
  includes the following new column at the end:

  | Column name | Data type | Description |
  | --- | --- | --- |
  | APPLY_METHOD | VARCHAR | Reserved for future use. |
* The [TAG_REFERENCES view](../../../sql-reference/account-usage/tag_references.md) and the [TAG_REFERENCES_WITH_LINEAGE](../../../sql-reference/functions/tag_references_with_lineage.md) view in
  the ACCOUNT_USAGE schema include the following new column at the end:

  | Column name | Data type | Description |
  | --- | --- | --- |
  | APPLY_METHOD | VARCHAR | Reserved for future use. |
* The output of the [SHOW TAGS](../../../sql-reference/sql/show-tags.md) command contains the following new column at the end:

  | Column name | Description |
  | --- | --- |
  | PROPAGATE | Reserved for future use. |
* The output of the [GET_DDL](../../../sql-reference/functions/get_ddl.md) command can contain a PROPAGATE property when executed to recreate a tag. This
  property is reserved for future use.

Ref: 1777

---
title: Object Tagging: Tag Assignment Not Allowed When Creating Secondary Databases
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-961.md
section: Release Notes
---

# Object Tagging: Tag Assignment Not Allowed When Creating Secondary Databases

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

In the current release, you are not allowed to set tags on a secondary database using the WITH TAG clause when you create or replace a
database using the AS REPLICA OF clause.

Previously:
:   You could specify the WITH TAG clause when you create a secondary database using the AS REPLICA OF clause. For example:

    ```sqlexample
    CREATE DATABASE <name> AS REPLICA OF <primary_database_name>
        WITH TAG (tag_name = 'tag_value');
    ```

Currently:
:   Specifying the WITH TAG clause when creating a secondary database using the AS REPLICA OF clause generates the following error message:

    `SQL compilation error: Tags cannot be assigned to a secondary database replica since it is a read-only database.`

Please update existing workflows to remove the WITH TAG clause when creating secondary databases.

Ref: 961

---
title: Observability Views (Account Usage and Information Schema): New Columns in Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1070.md
section: Release Notes
---

# Observability Views (Account Usage and Information Schema): New Columns in Views

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The ATTEMPT_NUMBER and SCHEDULED FROM columns were added to the following views:

| Column Name | Data Type | Description | Affected Views |
| --- | --- | --- | --- |
| ATTEMPT_NUMBER | NUMBER | The number of manual retries for a given graph run. For tasks which have never been restarted a value of 0 is returned. | In both ACCOUNT_USAGE and INFORMATION_SCHEMA:   * TASK_HISTORY * COMPLETE_TASK_GRAPHS * CURRENT_TASK_GRAPHS |
| SCHEDULED_FROM | TEXT | * SCHEDULE for DAG run as a result of its cron/interval schedule. * EXECUTE TASK for DAG run as a result of ‘executing task’ command. * MANUAL RETRY for DAG runs as a result of manual retry initiated by the user. | In both ACCOUNT_USAGE and INFORMATION_SCHEMA:   * COMPLETE_TASK_GRAPHS * CURRENT_TASK_GRAPHS |

Ref: 1070

---
title: Oct 01, 2025: New OBJECT_VISIBILITY property (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-01-object-visibility.md
section: Release Notes
---

# Oct 01, 2025: New OBJECT_VISIBILITY property (*Preview*)

The OBJECT_VISIBILITY property controls the discoverability of objects in the account, enabling users without explicit access
privileges to find objects and request access. Currently, this property only affects Universal Search and its results.

You can use the OBJECT_VISIBILITY property to do the following:

* Expand object visibility to specific organization accounts, databases, or schemas.
* View and manage an object’s visibility in Universal Search in Snowsight.

For more information, see [Make database objects discoverable in Universal Search](../../../user-guide/ui-snowsight/object-visibility-universal-search.md).

---
title: Oct 02, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-02-dcr.md
section: Release Notes
---

# Oct 02, 2025: Snowflake Data Clean Rooms updates

**Clean Rooms API Version: 10.3**

The following new features and changes are now available in Snowflake Data Clean Rooms:

* **Managed Account Invites Upon Request:** Clean room users now need to reach out to their account representative to enable managed
  account invitations for their account. Each account will have a specific number of invitations available after requests are approved.
  This process is being implemented to ensure that users understand that initiating and accepting these invitations will result in separate
  billing invoices for these accounts.

---
title: Oct 02, 2025: Snowflake-managed MCP server (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-02-mcp-server.md
section: Release Notes
---

# Oct 02, 2025: Snowflake-managed MCP server (*Preview*)

The Snowflake-managed MCP server lets AI agents securely retrieve data from Snowflake accounts without needing to deploy separate infrastructure. You can configure the MCP server to serve Cortex Analyst and Cortex Search as tools on the standards-based interface. MCP clients discover and invoke these tools, and retrieve data required for the application. With managed MCP servers on Snowflake, you can build scalable enterprise-grade applications while maintaining access and privacy controls. The MCP server on Snowflake provides:

* **Standardized integration:** Unified interface for tool discovery and invocation, in compliance with the rapidly evolving standards.
* **Comprehensive authentication:** Snowflake’s built-in OAuth service to enable OAuth-based authentication for MCP integrations.
* **Robust governance:** Role based access control (RBAC) for the MCP server and tools to manage tool discovery and invocation.

For more information, see [Snowflake-managed MCP server](../../../user-guide/snowflake-cortex/cortex-agents-mcp.md).

---
title: Oct 02, 2025: Using the database object explorer in Snowsight to create and manage semantic views (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-02-semantic-views-in-snowsight.md
section: Release Notes
---

# Oct 02, 2025: Using the database object explorer in Snowsight to create and manage semantic views (*General availability*)

Creating and managing semantic views in Snowsight is now generally available and is no longer in
[Preview](../../preview-features.md).

In Snowsight, you can use the database object explorer to create and manage semantic views.

For information, see [Using Snowsight to create and manage semantic views](../../../user-guide/views-semantic/ui.md).

---
title: Oct 03, 2025: Lineage for stored procedures and tasks (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-03-process-lineage.md
section: Release Notes
---

# Oct 03, 2025: Lineage for stored procedures and tasks (*General availability*)

Snowflake is extending its lineage capabilities beyond data and ML lineage to capture processes connecting source and target objects. As you
view the lineage graph in Snowsight, you can now obtain details about a stored procedure or task that resulted in a downstream
object.

You can select the arrow that connects the source and target objects to obtain more information about the stored procedure or task. For
example, if a stored procedure is nested within other stored procedures, you can view details about the stored procedure that is at the top
of the hierarchy of nested procedures.

For more information, see [Data Lineage](../../../user-guide/ui-snowsight-lineage.md).

---
title: Oct 03, 2025: Named scoring profiles for Cortex Search Services (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-03-cortex-search-named-scoring-profiles.md
section: Release Notes
---

# Oct 03, 2025: Named scoring profiles for Cortex Search Services (*General availability*)

Cortex Search Services now support *named scoring profiles*, which allow you to save and reuse scoring configurations
when querying a Cortex Search Service. A scoring configuration consists of optional boost and decay functions, as
well as an optional reranker setting.

Using a named scoring profile lets you easily use a scoring configuration across applications and queries without having
to specify the full scoring configuration each time. If you change the scoring configuration, you only need to update it
in one place, not in every query.

For more information about named scoring profiles, see [Named scoring profiles](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-customize-scoring.md).

---
title: Oct 06, 2025: Hybrid table support for Microsoft Azure (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-06-hybrid-tables-azure-ga.md
section: Release Notes
---

# Oct 06, 2025: Hybrid table support for Microsoft Azure (*General availability*)

Hybrid tables are now generally available in Microsoft Azure commercial regions and no longer in
[preview](../../preview-features.md). For more information, see [Hybrid tables](../../../user-guide/tables-hybrid.md).

---
title: Oct 07, 2024: AWS PrivateLink in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-07-sis.md
section: Release Notes
---

# Oct 07, 2024: AWS PrivateLink in Streamlit in Snowflake (Preview)

With this release, we are pleased to announce the preview of AWS PrivateLink in Streamlit in Snowflake.

AWS PrivateLink is an AWS service for creating private VPC endpoints that allow direct, secure connectivity
between your AWS VPCs and the Snowflake VPC without traversing the public Internet.

For more information, see [Private connectivity for Streamlit in Snowflake](../../../developer-guide/streamlit/object-management/privatelink.md).

---
title: Oct 07, 2025: Query insights in Snowsight (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-07-query-insights-in-snowsight-ga.md
section: Release Notes
---

# Oct 07, 2025: Query insights in Snowsight (*General availability*)

Query insights in Snowsight is now generally available and is no longer in
[Preview](../../preview-features.md).

You can now view [query insights](../../../user-guide/query-insights.md) in Snowsight. The
[Query Profile](../../../user-guide/ui-snowsight-activity.md) tab under Query History now displays insights about conditions
that affect query performance. Each insight includes a message that explains how query performance might be affected and provides
a general recommendation for next steps.

For more information, see [Viewing the query insights in Snowsight](../../../user-guide/query-insights.md).

---
title: Oct 09, 2025: dbt Projects on Snowflake: Recent improvements (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md
section: Release Notes
---

# Oct 09, 2025: dbt Projects on Snowflake: Recent improvements (*Preview*)

dbt Projects on Snowflake now support the following functionalities:

* dbt Project failures show up as failed queries
* Compile on create
* Install deps on compile
* MONITOR privilege
* Accessing execution results is easier

## dbt Project failures show up as failed queries

Any dbt Project errors — like compile or test failures — now appear as query failures. This makes it easier to handle them with tasks or
other orchestration tools. You can view detailed logs using `SELECT SYSTEM$get_dbt_log('<query_id>')`.

> **Important:**
>
> This might cause a breaking change for anyone relying on the previous method of checking the return values to determine dbt Project
> execution outcomes.

## Compile on create

Whenever you deploy or update a dbt Project object, it’s automatically compiled so build artifacts are up to date and Snowsight works
smoothly.

This could cause a breaking change if you’re deploying projects that fail during compilation.

Compilation currently uses the profile in your `profiles.yml` by default. As a workaround, you can update your `profiles.yml`
prior to deployment to point to the production target before deploying. In a future release, you’ll be able to override this with
`DEFAULT_TARGET` on the Project object.

## Install deps on compile

You can optionally run `dbt deps` during deployment to install project dependencies by setting `EXTERNAL_ACCESS_INTEGRATIONS=[...ext]`
on your deploy or update commands. This means you no longer need to include `/dbt_packages` when deploying projects with external
dependencies.

In a future release, compile on create will support the `local:` syntax.

## MONITOR privilege

dbt Projects now support the MONITOR privilege. This allows you to see the execution history, download the build artifacts of a dbt Project
object and download build artifacts of each dbt Project execution. This privilege can be granted at the DATABASE or SCHEMA level.

## Accessing execution results is easier

You can download build artifacts directly from the Query History page or use the following new system functions:

* `SELECT SYSTEM$LOCATE_DBT_ARTIFACTS($latest_query_id)`: Returns the file path for dbt Project artifacts from a run (for example, `snow://dbt/DB_TEST.PUBLIC.DBT_PROJECT_TEST/results/query_id_01bf3f5a-010b-4d87-0000-53493abb7cce/`).
* `SELECT SYSTEM$LOCATE_DBT_ARCHIVE($latest_query_id)`: Returns the location of the dbt Project output archive zip.
* `SELECT SYSTEM$GET_DBT_LOG($latest_query_id)`: Returns the last 1000 lines of the `dbt.log` file. For full logs, download the archive zip.

Use the Snowflake CLI to download these artifacts from the results stage, for example:

```snowcli
snowsql -q “GET 'snow://dbt_project/DB_TEST.PUBLIC.DBT_PROJECT_TEST/results/query_id_01bf3f89-0300-0001-0000-0000000c1229/dbt_artifacts.zip' file:///Users/user_name/Code/temp"
```

This new approach replaces `OUTPUT_ARCHIVE_URL` and improves interoperability with Snowflake CLI and other services.

> **Important:**
>
> dbt Project output logs from executions before this release won’t appear on the Query History page.

For more information, see [dbt Projects on Snowflake](../../../user-guide/data-engineering/dbt-projects-on-snowflake.md).

---
title: Oct 09, 2025: Organization user groups with organizational listings (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-09-org-user-groups-with-org-listings.md
section: Release Notes
---

# Oct 09, 2025: Organization user groups with organizational listings (*Preview*)

Providers can use [organization user groups](../../../user-guide/organization-users.md) to assign consumers to organizational listings.

For more information, see [Use organization user groups with organizational listings](../../../user-guide/collaboration/listings/organizational/org-listings-org-user-groups.md).

---
title: Oct 09, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-09-dcr.md
section: Release Notes
---

# Oct 09, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 10.4

The following new features, enhancements, and fixes are now available in Snowflake Data Clean Rooms:

**Features and enhancements:**

* Clean rooms now supports linking external and Apache Iceberg™ views. Previously, linking external views or Iceberg views would result in
  failure of the clean room; now linking these view types in clean rooms is supported.
* Reference usage grants update: You can now include a dataset with a Snowflake policy defined in a different database than the source. To
  do so, you must [grant your clean room access to that policy database](../../../user-guide/cleanrooms/register-data.md) to be able to
  link the data into a clean room.

**Fixes:**

* If a clean room has a template that depends on a dataset that has become unavailable, previously the analysis would fail, and the
  clean room would become unusable in the UI. Now the template remains available, but the user is prompted to update the clean room to
  replace the missing dataset.

---
title: Oct 09, 2025: Verified query suggestions (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-09-verified-query-suggestions.md
section: Release Notes
---

# Oct 09, 2025: Verified query suggestions (*Preview*)

Verified query suggestions are now available in Snowsight in [preview](../../preview-features.md). Cortex Analyst monitors incoming requests to surface queries for inclusion in a [Verified Query Repository](../../../user-guide/snowflake-cortex/cortex-analyst/verified-query-repository.md), allowing you to craft verified SQL responses for similar queries.

For more information, see [Suggestions for semantic models and views](../../../user-guide/snowflake-cortex/cortex-analyst/verified-query-suggestions.md).

---
title: Oct 10, 2025: Cortex Search Component Scores (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-10-cortex-search-component-scores.md
section: Release Notes
---

# Oct 10, 2025: Cortex Search Component Scores (*Preview*)

Access detailed scoring information for search results using Cortex Search Component Scores. Component scores allow developers to understand how search rankings are determined and debug search performance.

For more information, see
[Query a Cortex Search Service](../../../user-guide/snowflake-cortex/cortex-search/query-cortex-search-service.md).

---
title: Oct 13, 2025: CORTEX_EMBED_USER database role (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-13-cortex-embed-user-db-role.md
section: Release Notes
---

# Oct 13, 2025: CORTEX_EMBED_USER database role (*General availability*)

Snowflake has added a CORTEX_EMBED_USER database role in the SNOWFLAKE database to better manage access to Cortex
embedding functions. Embedding functions, which convert text to a vector of numbers that represent the meaning of the
text, include AI_EMBED, EMBED_TEXT_768, and EMBED_TEXT_1024. This new role allows you to grant users access to embedding
functions without granting them access to other Cortex features. The pre-existing CORTEX_USER role continues to provide
access to Cortex features including embedding functions.

For more information about this role, see [SNOWFLAKE.CORTEX_EMBED_USER database role](../../../sql-reference/snowflake-db-roles.md).

---
title: Oct 15, 2025: Enforced join order with directed joins (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-15-directed-join.md
section: Release Notes
---

# Oct 15, 2025: Enforced join order with directed joins (*General availability*)

Support for directed joins is now generally available and is no longer in
[Preview](../../preview-features.md).

When you run join queries, you can now enforce the join order of the tables using the `DIRECTED` keyword.
When you run a query with a directed join, the first, or left, table is scanned before the second, or right, table.
For example, `o1 INNER DIRECTED JOIN o2` scans the `o1` table before the `o2` table.

Directed joins are useful in the following situations:

> * You are migrating workloads into Snowflake that have join order directives.
> * You want to improve performance by scanning join tables in a specific order.

For more information, see [JOIN](../../../sql-reference/constructs/join.md).

---
title: Oct 16, 2025: AI_EXTRACT function (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-16-ai-extract.md
section: Release Notes
---

# Oct 16, 2025: AI_EXTRACT function (*General availability*)

The Snowflake AI_EXTRACT function lets you extract information from text or document files using large language models.

This release adds the following features to the existing AI_EXTRACT capabilities:

* **Table extraction support:** Extract tabular data from documents, which helps you analyze financial reports, data sheets, invoices, and other documents that contain tabular data.
* **Flexible response formats:** Define the response format using simple object schemas, arrays of questions, or JSON schemas that support both entity and table extraction.
* **Contextual guidance:** Provide context to the model using the optional `description` field; for example, to help the model localize the correct table in a document.
* **Output length:** The maximum output length for entity extraction is 512 tokens per question. For table extraction, the model returns answers that are a maximum of 4096 tokens long.

For more information, see [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md).

---
title: Oct 16, 2025: Cross-region inference for US Commercial Gov
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-16-aisql-cross-region-gov-preview.md
section: Release Notes
---

# Oct 16, 2025: Cross-region inference for US Commercial Gov

Cross-region inference for Snowflake Cortex is now available for US Commercial Government regions on AWS. Cross-region inference on US Commercial Gov securely routes your traffic only through regions operating under the same compliance tier. All processing occurs on FIPS-validated infrastructure, keeping your workloads compliant with security requirements.

For more information, see [Cross-region inference](../../../user-guide/snowflake-cortex/cross-region-inference.md).

---
title: Oct 16, 2025: Organization account in a hybrid organization
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-16-hybrid-orgs.md
section: Release Notes
---

# Oct 16, 2025: Organization account in a hybrid organization

A hybrid organization contains accounts in both regulated regions and non-regulated regions. For example, a hybrid organization can have one
account in a [U.S. SnowGov Region](../../../user-guide/intro-regions.md) and another in a
[commercial region](../../../user-guide/intro-regions.md).

You can now create an [organization account](../../../user-guide/organization-accounts.md) in a hybrid organization, which lets you leverage
organization-level features like [organization users](../../../user-guide/organization-users.md) and
[organization profiles](../../../user-guide/collaboration/organization-profiles/org-profiles-create-manage.md). The organization account also
contains [premium views](../../../user-guide/organization-accounts-premium-views.md) that contain organization-level usage data.

For more information about creating the organization account for a hybrid organization, see
[Compliance considerations for hybrid organizations](../../../user-guide/organization-accounts.md).

---
title: Oct 16, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-16-dcr.md
section: Release Notes
---

# Oct 16, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 10.5

The following new features, enhancements, and bug fixes are now available in Snowflake Data Clean Rooms:

* **Non-overlap Results & Messaging Improvements:** Updated handling to ensure that non-overlap result percentage does not display above
  100%; added updated messaging for non-overlap results being unavailable when filtering by a collaborator’s column.
* **Jinja2 Library Upgrade:** Updated Jinja2 templating library to version 3.1.6 with compatibility improvements.

---
title: Oct 17, 2025: Partitioned writes for Apache Iceberg™ tables (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-17-iceberg-partitioned-writes-ga.md
section: Release Notes
---

# Oct 17, 2025: Partitioned writes for Apache Iceberg™ tables (*General availability*)

Partitioned writes for Iceberg tables are now generally available. With partitioned write support for Iceberg tables, Snowflake improves
compatibility with the wider Iceberg ecosystem and enables accelerated read queries from external Iceberg tools.
You can now use Snowflake to create and write to both Snowflake-managed and
externally managed Iceberg tables with partitioning schemes.

This release includes support for all partition transforms in version 2 of the Apache Iceberg specification.

For more information, see [Iceberg partitioning](../../../user-guide/tables-iceberg-metadata.md).

---
title: Oct 17, 2025: Set a target file size for Apache Iceberg™ tables (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-17-set-target-file-size-ga.md
section: Release Notes
---

# Oct 17, 2025: Set a target file size for Apache Iceberg™ tables (*General availability*)

Setting a target Parquet file size for Iceberg tables is now generally available. Doing so improves cross-engine query performance when you
use an external Iceberg engine such as Apache Spark, Delta, or Trino that’s optimized for larger file sizes. You can set the target file
size when you create a table, or update it later by using the [ALTER ICEBERG TABLE](../../../sql-reference/sql/alter-iceberg-table.md) command.

For more information, see [Set a target file size](../../../user-guide/tables-iceberg-manage.md).

---
title: Oct 17, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-17-iceberg-external-writes-cld-ga.md
section: Release Notes
---

# Oct 17, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (*General availability*)

The following Apache Iceberg™ table features are now generally available are no longer in
[Preview](../../preview-features.md):

* Write operations for externally managed Iceberg tables.
* Catalog-linked databases that connect to external Iceberg REST catalogs.

Key capabilities:

* Create new Iceberg tables directly in your remote catalog using Snowflake.
* Perform full DML operations — for example, INSERT, UPDATE, DELETE, MERGE — on externally managed tables.
* Create a Snowflake database that’s linked to your remote Iceberg REST catalog; for example, AWS Glue, Snowflake Open Catalog, and others.
* Modify the properties of a catalog-linked database, including suspending or resuming automatic table discovery.
* Discover and access multiple remote Iceberg tables without individually defining them in Snowflake.
* Use vended credentials with external writes.

  > **Note:**
  >
  > CREATE ICEBERG TABLE (catalog-linked database) … AS SELECT isn’t supported with vended credentials.

For more information, see the following topics:

* [Write support for externally managed Apache Iceberg™ tables](../../../user-guide/tables-iceberg-externally-managed-writes.md)
* [Use a catalog-linked database for Apache Iceberg™ tables](../../../user-guide/tables-iceberg-catalog-linked-database.md)
* [ALTER DATABASE (catalog-linked)](../../../sql-reference/sql/alter-database-catalog-linked.md)

> **Note:**
>
> Update: Billing for catalog-linked databases started on December 15, 2025.

---
title: Oct 20, 2025: Performance Explorer (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-20-performance-explorer.md
section: Release Notes
---

# Oct 20, 2025: Performance Explorer (*Preview*)

You can use Performance Explorer in Snowsight to monitor interactive metrics for SQL workloads.
The metrics show the overall health of your Snowflake environment, query activity, changes to warehouses,
and changes to tables.

For more information, see [Analyzing query workloads with Performance Explorer](../../../user-guide/performance-explorer.md).

---
title: Oct 23, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-23-dcr.md
section: Release Notes
---

# Oct 23, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 10.6

* Private preview features have been updated.

---
title: Oct 29, 2025: CLIENT_POLICY parameter for authentication policies
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-29-client-version-policies.md
section: Release Notes
---

# Oct 29, 2025: CLIENT_POLICY parameter for authentication policies

You can now create an authentication policy that sets the minimum version that is allowed for each specified client type. For more information, see the description of the CLIENT_POLICY parameter in the [CREATE AUTHENTICATION POLICY](../../../sql-reference/sql/create-authentication-policy.md) command.

---
title: Oct 29, 2025: Guided account failover in Snowsight (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-29-guided-account-failover-snowsight.md
section: Release Notes
---

# Oct 29, 2025: Guided account failover in Snowsight (*General availability*)

With this release, guided account failover in Snowsight is generally available. You can now
select all the applicable failover groups and connections and promote them all at the same time,
using Snowsight. That way, you can promote a target account to serve as the source account
in a single step. We refer to this operation as a bulk failover.

For more information about account failover, see [Failing over account objects](../../../user-guide/account-replication-failover-failback.md).

---
title: Oct 29, 2025: Snowflake Native Apps: Shareback (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-29-nativeapps-shareback.md
section: Release Notes
---

# Oct 29, 2025: Snowflake Native Apps: Shareback (*Preview*)

Your Snowflake Native Apps can now securely request permission from consumers to share data back with you (the provider) or designated third parties.

This powerful capability supports essential business needs such as compliance reporting, telemetry and analytics sharing, and data preprocessing by providing a secure, governed channel for data exchange.

For more information, see [Request data sharing with app specifications](../../../developer-guide/native-apps/requesting-app-specs-listing.md).

---
title: Oct 30, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-30-dcr.md
section: Release Notes
---

# Oct 30, 2025: Snowflake Data Clean Rooms updates

## Clean Rooms API Version: 11.0

The following new features and enhancements are now available in Snowflake Data Clean Rooms:

* **Enhanced error messaging:** When IP addresses are blocked by network policies, enhanced error messages now provide better feedback to users.
* **Autodetection of modified or removed data sources:** If a data source becomes unavailable after a clean room is created or configured, the edit flow in the UI now prompts the user to pick from a current list of available data objects and prompts for removal of unavailable data sources.
* Updates to private preview features.

---
title: Oct 31, 2024: AWS PrivateLink in Streamlit in Snowflake (General Availability)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-31-sis-privatelink.md
section: Release Notes
---

# Oct 31, 2024: AWS PrivateLink in Streamlit in Snowflake (General Availability)

With this release, we are pleased to announce the general availability of AWS PrivateLink in Streamlit in Snowflake.

AWS PrivateLink is an AWS service for creating private VPC endpoints that allow direct, secure connectivity
between your AWS VPCs and the Snowflake VPC without traversing the public Internet.

For more information, see [Private connectivity for Streamlit in Snowflake](../../../developer-guide/streamlit/object-management/privatelink.md).

---
title: Oct 31, 2024: Custom themes in Streamlit in Snowflake (Preview)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-31-sis-custom-themes.md
section: Release Notes
---

# Oct 31, 2024: Custom themes in Streamlit in Snowflake (Preview)

With this release, we are pleased to announce support for custom themes in Streamlit in Snowflake.

For more information, see [Streamlit documentation](https://docs.streamlit.io/develop/concepts/configuration/theming).

---
title: Oct 31, 2025: Organization-level findings in the Trust Center
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-31-trust-center-org-findings.md
section: Release Notes
---

# Oct 31, 2025: Organization-level findings in the Trust Center

Use the Trust Center to gain insights into the security violations found in the accounts of an organization. These insights include
the following information:

* The number of violations in the organization.
* The accounts with the most critical violations.
* The number of violations for each account in the organization. You can select an account to drill down into the individual violations in
  the account.

For more information, see [Trust Center findings](../../../user-guide/trust-center/overview.md).

---
title: Oct 31, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-10-31-na-spcs-gcp-ga.md
section: Release Notes
---

# Oct 31, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (*General availability*)

With this release, support for Snowflake Native App with Snowpark Container Services on Google Cloud is generally available. Apps with containers can be
deployed and operated on Google Cloud.

See [About Snowflake Native Apps with Snowpark Container Services](../../../developer-guide/native-apps/native-apps-about.md) for information on Snowflake Native App with Snowpark Container Services.
See [Support for private connectivity, VPS, and government regions](../../../developer-guide/native-apps/limitations.md) for more information on platforms supported by the Snowflake Native App Framework.

---
title: October 01, 2024 — Cortex Fine-tuning Sharing — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-01-cortex-finetuning-sharing.md
section: Release Notes
---

# October 01, 2024 — Cortex Fine-tuning Sharing — *Preview*

We are pleased to announce that you can now share [Cortex Fine-tuning](../../../user-guide/snowflake-cortex/cortex-finetuning.md) models
using [Data Sharing](../../../user-guide/data-sharing-intro.md). For more information. see [Sharing models](../../../user-guide/snowflake-cortex/cortex-finetuning.md).

---
title: October 02, 2024 — Notebooks on Container Runtime — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-02-notebooks-on-spcs.md
section: Release Notes
---

# October 02, 2024 — Notebooks on Container Runtime — *Preview*

With this release, we are pleased to announce the preview of Snowflake Notebooks on Container Runtime.
You can now run Snowflake Notebooks on Snowpark Container Services through Container Runtime. Snowpark Container Services gives you a flexible container
infrastructure that supports building and operationalizing a wide variety of workflows entirely within Snowflake. Container Runtime
provides software and hardware options to support advanced data science and machine learning workloads on Snowpark Container Services.

For more information, see [Notebooks on Container Runtime](../../../developer-guide/snowflake-ml/notebooks-on-spcs.md).

---
title: October 02, 2024 — Organization accounts — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-01-organization-account.md
section: Release Notes
---

# October 02, 2024 — Organization accounts — *Preview*

With this release, we are pleased to announce the preview of the organization account, which is a new type of account that organization
administrators use to perform their tasks. During this preview, organization administrators can continue to use an ORGADMIN-enabled account
to perform their tasks, but eventually all organization-level tasks for multi-account organizations will be performed using the organization
account.

The ORGANIZATION_USAGE schema in the organization account contains premium views that aggregate account usage across accounts. These premium
views are not found in the ORGANIZATION_USAGE schema of a regular account, and incur additional storage and compute costs. For example, the
organization account allows you to use a single view to track access history across the organization, something you can’t do in a regular
account.

For more information, see [Organization accounts](../../../user-guide/organization-accounts.md).

---
title: October 03, 2024 — New Cortex LLM Function - PARSE_DOCUMENT — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-03-parse-document.md
section: Release Notes
---

# October 03, 2024 — New Cortex LLM Function - PARSE_DOCUMENT — *Preview*

With this release, we are pleased to announce the preview of a new Snowflake Cortex PARSE_DOCUMENT function for text and layout
extraction from documents. The PARSE_DOCUMENT function gives you the ability to extract text or layout natively using SQL from
documents stored in a Snowflake or an external stage.

PARSE_DOCUMENT combines powerful Optical Character Recognition (OCR) capabilities with machine learning models to identify text content,
information stored in tables, and the structural elements of PDF documents. For details, see [Parsing documents with AI_PARSE_DOCUMENT](../../../user-guide/snowflake-cortex/parse-document.md).

---
title: October 03-05, 2023 — 7.35 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_35.md
section: Release Notes
---

# October 03-05, 2023 — 7.35 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## New Features

### Budgets — *Preview*

With this release, we are pleased to announce the preview of Budgets which enables account-level monitoring and notification of
Snowflake credit usage for a group of specific Snowflake objects. You can define a monthly spending limit on the compute costs for
supported objects in your account. In addition to your account budget, you can create custom budgets to monitor credit usage for a
custom group of objects. Budgets sends you a notification when your credit usage is on track to exceed your monthly limit.

For more information, see [Monitor credit usage with budgets](../../user-guide/budgets.md).

### Dynamic tables refreshed on creation by default — *Preview*

With this release, we are pleased to announce that Snowflake dynamic tables, including those with a downstream lag, now refresh
upon creation.

For more information, see [Understanding dynamic table initialization](../../user-guide/dynamic-tables-refresh.md).

### Dynamic tables new sharing capabilities — *Preview*

With this release, we are pleased to announce that Snowflake has added sharing capabilities for dynamic tables, allowing for
easier collaboration.

For more information, see [Data sharing with dynamic tables](../../user-guide/dynamic-tables-data-sharing.md).

## SQL Updates

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Context Functions (Session) | [CURRENT_ACCOUNT_NAME](../../sql-reference/functions/current_account_name.md) | Returns the name of the current account.  The account name is used in the preferred [account identifier](../../user-guide/admin-account-identifier.md). |

## Data Collaboration Updates

### Allow non-admins to set up Cross-Cloud Auto-Fulfillment

With this release, users with the ACCOUNTADMIN role can delegate privileges to non-admin roles to allow other users to set up
auto-fulfillment and share listings with consumers in other regions.

For details, see [Use listings as a provider](../../collaboration/provider-becoming.md).

### Offer a limited trial of a data product on the Snowflake Marketplace — *Preview*

With this release, we are pleased to announce the preview of offering a limited trial of a data product. You can now offer a
trial or sample of a data product to any consumer on the Snowflake Marketplace. Consumers can then trial your data product and
request unlimited access to a full data product. As a provider, you can then choose which requests to fulfill and offer unlimited
access to your data product privately to specific consumers.

For more details about different ways to share data products, see [About listings](https://other-docs.snowflake.com/en/collaboration/collaboration-listings-about).

For details about preparing to offer a limited trial of a data product, see
[Preparing to offer a limited trial listing](https://other-docs.snowflake.com/collaboration/provider-listings-preparing#label-prepare-limited-trial-listing).

> **Note:**
>
> With the release of limited trials of data products, personalized listings will no longer be available for new listings.

## Web Interface Updates

### Task graph run debugging — *Preview*

With this release, we are pleased to announce the preview of task graph run debugging.

For a given task graph in your account, you can review the run history to identify critical failing tasks that prevent a graph from
completing, long-running tasks, inefficient task graphs, and other monitoring and debugging cases.

For more information, see [View tasks and task graphs in Snowsight](../../user-guide/ui-snowsight-tasks.md).

## Release Notes Change Log

| Announcement | Update |
| --- | --- |
| *Task graph run debugging — Preview* | **Added** to *Web Interface Updates* |
| *Budgets — Preview* | **Added** to *New Features* |

---
title: October 04, 2024 — Cortex Analyst integration with Cortex Search — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-04-cortex-analyst-search-integration.md
section: Release Notes
---

# October 04, 2024 — Cortex Analyst integration with Cortex Search — *Preview*

We are pleased to announce the preview of Cortex Analyst integration with Cortex Search.

You can now integrate Cortex Analyst with Cortex Search to mprove literal string searches to help Cortex Analyst generate more accurate
SQL queries. [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md)
is a feature that enables low-latency, high-quality “fuzzy” search over text data. You can create a Cortex Search service to do a semantic
search over the underlying database column to find any literal values needed for Cortex Analyst to use in the SQL query that answers the
user’s question.

For more details, see [Improve literal search to enhance Cortex Analyst responses](../../../user-guide/snowflake-cortex/cortex-analyst/cortex-analyst-search-integration.md).

---
title: October 04, 2024 — Cortex Search — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-04-cortex-search-ga.md
section: Release Notes
---

# October 04, 2024 — Cortex Search — *General Availability*

We are pleased to announce the general availability of Cortex Search.

Cortex Search is a text search service that simplifies the development of high-quality search and large language model (LLM) chatbot
applications. Cortex Search is a hybrid search service, leveraging vector and keyword search for optimal quality. You can use Cortex Search
as the retrieval service in a RAG chatbot or as a standalone search engine.

For more details, see [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

---
title: October 04, 2024 — Differential Privacy — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-04-differential-privacy.md
section: Release Notes
---

# October 04, 2024 — Differential Privacy — *General Availability*

With this release, we are pleased to announce the general availability of differential privacy in Snowflake.

Differential privacy is a widely recognized standard for data privacy that limits the risk that someone could leak sensitive information
from a sensitive dataset, even if they are carrying out a targeted privacy attack. Data providers implement differential privacy by
assigning privacy policies to their sensitive tables and views. As analysts query the protected data, Snowflake uses rigorous mathematics to
ensure that they cannot identify individuals and entities in the dataset to an unacceptable degree of certainty.

Data owners can now change the default settings of a privacy budget, and analysts and data owners can call a system function to estimate how
many more queries can be run before reaching the limit of a privacy budget.

For more information, see [Differential privacy in Snowflake](../../../user-guide/diff-privacy/differential-privacy-overview.md).

---
title: October 04, 2024 — Suggested Questions for Cortex Analyst — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-04-cortex-analyst-suggested-questions.md
section: Release Notes
---

# October 04, 2024 — Suggested Questions for Cortex Analyst — *Preview*

We are pleased to announce the preview of Suggested Questions for Cortex Analyst.

The Suggested Questions feature in Cortex Analyst provides relevant suggestions for questions your users can ask while
interacting with your Cortex Analyst-powered conversational app. Use this feature to help your users get started.

For more details, see [Onboarding questions in Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst/suggested-questions-feature.md).

---
title: October 07, 2024 — Updated event sharing for Snowflake Native Apps — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-07-na-event-sharing.md
section: Release Notes
---

# October 07, 2024 — Updated event sharing for Snowflake Native Apps — *General Availability*

We are pleased to announce updated event sharing functionality for the Snowflake Native App Framework. This update
allows providers and consumers more granular control over the log messages and trace events that
are shared with providers.

For information on how providers can configure log messages, event traces, and event sharing for an app, see
[Use logging and event tracing for an app](../../../developer-guide/native-apps/event-about.md). For information on how consumers can specify the
log messages and trace events they want to share with providers, see
[Set up event tracing for an app](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging).

---
title: October 07-09, 2024 — 8.38 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2024/8_38.md
section: Release Notes
---

# October 07-09, 2024 — 8.38 Release Notes (with behavior changes)

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Behavior change bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2024_08](../bcr-bundles/2024_08_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2024_07](../bcr-bundles/2024_07_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2024_06](../bcr-bundles/2024_06_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for January 2025; however, this schedule
is subject to change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Semi-structured | [ARRAY_REVERSE](../../sql-reference/functions/array_reverse.md) | Returns an array with the elements of the input array in reverse order. |

### Query objects larger than 16 MB in files on a stage

With this release, you can now query objects up to 128 MB in files on a stage. Although you still can’t store objects larger than 16 MB in a
column, you can reduce their size before storing them in columns. For example, you can split large objects across multiple columns or rows,
transform nested JSON into a tabular format, or simplify complex geometries.

> **Note:**
>
> With the 9.17 release, you can now store objects larger than 16 MB in a column. For more information,
> see [Size limits for database objects](../../user-guide/data-load-considerations-prepare.md).

## Data pipeline updates

### Dynamic tables: Updates to input types

Before this release, if a dynamic table read from a regular table that was dropped and replaced by a new dynamic table with the same name,
the refresh would fail, requiring the dynamic table to be recreated. Now, the refresh will succeed without user intervention.

The same applies if a dynamic table reads from another dynamic table, and that dynamic table is replaced by a table. In both cases, the
refresh now succeeds.

This change makes table/view/dynamic table input types interchangeable, but it only applies to dynamic table created after October 10, 2024.

## Data governance updates

### Data quality: New SYSTEM$DATA_METRIC_SCAN function

With this release, we are pleased to announce a new system function, SYSTEM$DATA_METRIC_SCAN, that returns records that failed a data
quality check. Previously, you could use data metric functions to identify data quality issues but did not have the ability to pinpoint the
records responsible for the problems.

With this new capability, you can use a single command to extract records that failed data quality checks. This simplifies the
identification of problematic records in your data and makes it easier to remediate data quality issues.

For more information, see [SYSTEM$DATA_METRIC_SCAN](../../sql-reference/functions/system_data_metric_scan.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 04-Oct-24 |
| *Apache Iceberg™ tables: New SYSTEM$VERIFY_EXTERNAL_VOLUME function* | **Postponed** to a later release. | 08-Oct-2024 |
| *Data quality: New SYSTEM$DATA_METRIC_SCAN function* | **Added** to *Data governance updates* section. | 09-Oct-2024 |
| *Dynamic tables: Updates to input types* | **Added** to *Data pipeline updates* section | 13-Jun-2025 |

---
title: October 08, 2024 — Native App support for AWS PrivateLink — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-08-na-aws-pl.md
section: Release Notes
---

# October 08, 2024 — Native App support for AWS PrivateLink — *Preview*

We are pleased to announce the preview of support for AWS PrivateLink in the Snowflake Native App Framework.
For general information, see [AWS PrivateLink and Snowflake](../../../user-guide/admin-security-privatelink.md).

> **Note:**
>
> AWS PrivateLink has a known limitation where links in email notifications from apps do not
> correctly link to a private link account.

---
title: October 09-10, 2023 — 7.36 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_36.md
section: Release Notes
---

# October 09-10, 2023 — 7.36 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Extensibility Updates

### Support for Python 3.11 in Snowpark, UDFs, UDTFs and stored procedures — *Preview*

With this release, we are pleased to announce support for Python 3.11 in Snowpark Python, Python UDFs, Python UDTFs and Python stored procedures as a preview feature to all accounts.

For more information, see:

* [Setting up your development environment for Snowpark Python](../../developer-guide/snowpark/python/setup.md).
* [Introduction to Python UDFs](../../developer-guide/udf/python/udf-python-introduction.md)
* [Writing stored procedures with SQL and Python](../../developer-guide/stored-procedure/python/procedure-python-overview.md).

## Data Collaboration Updates

### Company name for listing analytics

With this release, you can see the name of the company or organization that is a consumer of your listings on the Analytics tab in
Provider Studio. Previously, you could see the account name and organization name for a consumer, but not the name of the company.

For more details, see [Monitor listing use](../../collaboration/provider-listings-monitor-studio.md).

## Web Interface Updates

### Accessing billing usage statements — *General Availability*

With this release, we are pleased to announce the general availability of using Snowsight to view and download billing usage statements, starting with the July 2023 statement. The retention period will be 1 year.

For more information, see [Access a billing usage statement](../../user-guide/billing-usage-statement.md).

### Viewing Query History in worksheets — *Preview*

With this release, we are pleased to announce the preview of Query History in worksheets in Snowsight. When you view Query History
for a worksheet, you can review the queries run in a Snowsight worksheet, as well as the query results.

For more information, see [View query history](../../user-guide/ui-snowsight-query.md).

## Release Notes Change Log

| Announcement | Update |
| --- | --- |
| *Company name for listing analytics* | **Added** to *Data Collaboration Updates* |
| *Logging and tracing from handler code — General Availability* | **Removed** from *New Features* |

---
title: October 10, 2024 — CORTEX_FINE_TUNING_USAGE_HISTORY view — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-10-cortex-finetuning-usage-history.md
section: Release Notes
---

# October 10, 2024 — CORTEX_FINE_TUNING_USAGE_HISTORY view — *General Availability*

With this release, we are pleased to announce the general availability of the CORTEX_FINE_TUNING_USAGE_HISTORY view in the
Account Usage schema, giving
you the ability to query the usage history for using [Cortex Fine-tuning](../../../user-guide/snowflake-cortex/cortex-finetuning.md).

This new view allows you to track consumption associated with each fine-tuning job, aggregated on the hourly level.

For more information, see [CORTEX_FINE_TUNING_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_fine_tuning_usage_history.md).

---
title: October 10, 2024 — CORTEX_SEARCH_SERVING_USAGE_HISTORY view — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-10-cortex-search-serving-usage-history.md
section: Release Notes
---

# October 10, 2024 — CORTEX_SEARCH_SERVING_USAGE_HISTORY view — *General Availability*

With this release, we are pleased to announce the general availability of the CORTEX_SEARCH_SERVING_USAGE_HISTORY view in the
Account Usage schema, giving
you the ability to query the usage history for serving for [Cortex Search](../../../user-guide/snowflake-cortex/cortex-search/cortex-search-overview.md).

This new view allows you to track consumption associated with each Cortex Search service, aggregated on the hourly level.

For more information, see [CORTEX_SEARCH_SERVING_USAGE_HISTORY view](../../../sql-reference/account-usage/cortex_search_serving_usage_history.md).

---
title: October 14, 2024 — Cortex Analyst: New regions
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-14-new-regions-cortex-analyst.md
section: Release Notes
---

# October 14, 2024 — Cortex Analyst: New regions

We’re pleased to announce that [Cortex Analyst](../../../user-guide/snowflake-cortex/cortex-analyst.md) is now available in
the following additional regions:

* AWS ap-southeast-2 (Sydney)
* AWS eu-west-1 (Ireland)

---
title: October 14, 2024 — Snowflake Data Clean Rooms release notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-14-snowflake-data-clean-rooms.md
section: Release Notes
---

# October 14, 2024 — Snowflake Data Clean Rooms release notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Clean room overlap stats

Clean room statistics now include overlap statistics. These statistics describe how many distinct identifiers within join columns belong to
a certain group based on the attribute columns enabled by the template.

For more information, see [Run an analysis in the UI](../../../user-guide/cleanrooms/v1/web-app-working.md).

## Provider-initiated activation for third-party connectors

Providers can now activate audiences to third-party activation endpoints (including Google Ads, Meta, Ads, LiveRamp, The Trade Desk, and
Yahoo). Note that consumers still need to enable this option for providers while installing the clean room.

For more information, see [Working with Clean Rooms](../../../user-guide/cleanrooms/v1/activation.md).

## Security scans for custom templates

Providers can view the results of security scans that run automatically to help identify vulnerabilities in custom templates created using
the developer APIs. This helps identify parts of the custom template that might be susceptible to SQL injection attacks.

For more information, see [Security scans for custom templates](../../../user-guide/cleanrooms/scan-custom-template.md).

---
title: October 14-17, 2024 — 8.39 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_39.md
section: Release Notes
---

# October 14-17, 2024 — 8.39 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Cortex Analyst fully supported in Streamlit in Snowflake

With this release, [Cortex Analyst](../../user-guide/snowflake-cortex/cortex-analyst.md) is now fully supported in Streamlit in Snowflake.
You no longer need to implement an authentication workaround to use Cortex Analyst in a Streamlit in Snowflake app.

For a step by step example, see [Creating a Streamlit in Snowflake App](../../user-guide/snowflake-cortex/cortex-analyst.md).

## Data pipeline updates

### Dynamic tables: Changes to the output of the GET_DDL function

With this release, the GET_DDL function will return `target_lag` instead of `lag` for dynamic tables to match the CREATE DYNAMIC TABLE
statement.

## Data lake updates

### Apache Iceberg™ tables: New SYSTEM$VERIFY_EXTERNAL_VOLUME function

With the new SYSTEM$VERIFY_EXTERNAL_VOLUME function, you can validate the configuration of an external volume for Apache Iceberg™ tables.

For more information, see [SYSTEM$VERIFY_EXTERNAL_VOLUME](../../sql-reference/functions/system_verify_external_volume.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 11-Oct-24 |
| Cortex Analyst fully supported in Streamlit in Snowflake | **Added** to *New features* section | 16-Oct-24 |

---
title: October 16-17, 2023 — 7.37 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_37.md
section: Release Notes
---

# October 16-17, 2023 — 7.37 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## New Features

### Logging and tracing from handler code — *General Availability*

With this release, we are pleased to announce the general availability of logging and tracing from handler code, which was previously available as a preview feature.

With this feature, you can emit log and trace event data from UDF and procedure handler code so that the data is stored in an event table associated with your account. You can then query the stored data to analyze it.

For more information, see [Logging, tracing, and metrics](../../developer-guide/logging-tracing/logging-tracing-overview.md).

## Extensibility Updates

### Reading files with a Python function or procedure — *General Availability*

With this release, we are pleased to announce the general availability of Python support for reading files with the `SnowflakeFile` class.

`SnowflakeFile` is a new class in the `snowflake.snowpark.files` module that provides dynamic read access for files on an internal or external stage. With `SnowflakeFile`, you can stream files to accomplish tasks such as reading unstructured data or using your own machine learning model in a user-defined function (UDF), user-defined table function (UDTF), or stored procedure.

For more information, see:

> * [Reading Files from a UDF with the Snowpark API](../../developer-guide/udf/python/udf-python-examples.md)
> * [Reading a File from a Python UDF Handler](../../developer-guide/snowpark/python/creating-udfs.md)
> * [Reading a File from a Python Stored Procedure Handler](../../developer-guide/stored-procedure/python/procedure-python-read-files.md)
> * [Reading Files from a Stored Procedure with the Snowpark API](../../developer-guide/snowpark/python/creating-sprocs.md)

### Reading files with a Scala function or procedure handler — *General Availability*

With this release, we are pleased to announce the general availability of support for reading staged files with a UDF or procedure handler code written in Scala.

For more information, see [Reading a File with a Scala UDF](../../developer-guide/udf/scala/udf-scala-examples.md) and [Reading files with a Scala stored procedure](../../developer-guide/stored-procedure/scala/procedure-scala-read-files.md).

## SQL Updates

### Fixed an issue with column aliases for aggregates and the GROUP BY ALL clause

Previously, if a SELECT statement with a [GROUP BY ALL](../../sql-reference/constructs/group-by.md) clause defined and referred to a column alias
for an [aggregate](../../sql-reference/functions-aggregation.md), the statement would fail with the error
`not a valid group by expression`.

For example, the following statement has a GROUP BY ALL clause, defines the column alias `total` for an aggregate, and refers
to that alias (`ROUND(total)`):

```sqlexample
SELECT ... , SUM(my_column) AS total, ROUND(total) FROM mytable GROUP BY ALL ... ;
```

This statement would fail with the following error message:

```output
Error Code: 000979
  Error Message: SQL compilation error:
    [SUM(MYTABLE.MY_COLUMN)] is not a valid group by expression
```

This issue has been fixed, and these types of statements no longer fail with `not a valid group by expression` errors.

## Web Interface Updates

### Can no longer add or manage payment details using Classic Console

With this release, customers can no longer add or manage payment details for Snowflake On Demand using the Classic Console web
interface. Instead, you must use Snowsight to manage payment details.

For details, see [Converting to a paid account](../../user-guide/admin-trial-account.md).

## Release Notes Change Log

| Announcement | Update |
| --- | --- |
| *Can no longer add or manage payment details using Classic Console* | **Added** to *Web Interface Updates* |
| *Fixed an issue with column aliases for aggregates and the GROUP BY ALL clause* | **Added** to *SQL Updates* |

---
title: October 18, 2024 —Apache Iceberg™ tables: Support for Snowflake Open Catalog — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-18-snowflake-open-catalog-ga.md
section: Release Notes
---

# October 18, 2024 —Apache Iceberg™ tables: Support for Snowflake Open Catalog — *General Availability*

With this release, Snowflake is pleased to announce the general availability of support for integrating Apache Iceberg™ tables in Snowflake
with Snowflake Open Catalog, which was previously named Polaris Catalog. With general availability, we’ve made the following updates:

* When syncing a Snowflake-managed table with Snowflake Open Catalog, Snowflake performs validation when you attempt to create the following objects:

  + A catalog integration
  + An Iceberg table that you’re syncing to Open Catalog

  This validation checks whether the configuration for the catalog integration or table will successfully sync the Iceberg table to
  Open Catalog.
* When you modify the properties for an existing Apache Iceberg™ table by specifying the name of a catalog integration for Open Catalog,
  validation now checks whether the configuration for the table will successfully sync the Iceberg table to Open Catalog.

For more information about the updates available with Snowflake Open Catalog general availability in Open Catalog, see the
[Snowflake Open Catalog release notes](https://other-docs.snowflake.com/en/opencatalog/release-notes)

---
title: October 21, 2024 — Document AI — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-21-document-ai.md
section: Release Notes
---

# October 21, 2024 — Document AI — *General Availability*

With this release, we are pleased to announce the general availability of Document AI.

Document AI enables setting up intelligent document processing (IDP) workflows within Snowflake by extracting information from documents,
such as invoices or contracts, and directly applying it to operational workflows. Document AI is powered by Snowflake Arctic-TILT
(Text Image Layout Transformer), a proprietary large language model (LLM).

With Document AI, you can prepare pipelines for continuous processing of new documents of a specific type, and turn
unstructured data from documents into structured data in tables.

Document AI is available to accounts in AWS and Microsoft Azure commercial regions,
with some exceptions.

---
title: October 21-23, 2024 — 8.40 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_40.md
section: Release Notes
---

# October 21-23, 2024 — 8.40 Release Notes

> **Attention:**
>
> The release has completed.
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Trust Center: New Threat Intelligence scanner package

The Threat Intelligence scanner package is a new scanner package available in the Trust Center. This scanner package lets you discover risky
users based on user type, authentication methods, authentication policies, and network policies used. This scanner package provides a risk
severity for each risky user, to help you prioritize which users to address first.

For more information, see the following references:

* [Threat Intelligence scanner package](../../user-guide/trust-center/overview.md)
* [User TYPE properties](../../sql-reference/sql/create-user.md)
* [ACCOUNTADMIN ADMIN_USER_TYPE property](../../sql-reference/sql/create-account.md)

### Estimate the cost of Automatic Clustering — *General availability*

With this release, we are pleased to announce the general availability of the system function, SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS,
which estimates the cost of enabling Automatic Clustering for a table and maintaining the table in a well-clustered state. It can also
estimate the cost of changing the clustering key for a clustered table.

For more information, see [SYSTEM$ESTIMATE_AUTOMATIC_CLUSTERING_COSTS](../../sql-reference/functions/system_estimate_automatic_clustering_costs.md).

### Snowflake REST APIs — *General availability*

With this release, we are pleased to announce the general availability of Snowflake REST APIs.

Snowflake REST APIs for resource management provide a set of endpoints that lets users programmatically interact with and control
various resources within the Snowflake Data Cloud.

In this release, Snowflake REST APIs includes the following updates:

* Added the following new Snowflake resources:

  + Account
  + Alert
  + Catalog integration
  + Event table
  + External volume
  + Network policy
  + Notebook
  + Notification integration
  + Pipe
  + Procedure
  + Stream
  + User-defined function
  + View
* Added the following access control endpoints to replace the endpoints in the deprecated Grant API:

  + `/api/v2/roles/{name}/grants`
  + `/api/v2/roles/{name}/grants:revoke`
  + `/api/v2/databases/{database}/database-roles/{name}/grants`
  + `/api/v2/databases/{database}/database-roles/{name}/grants:revoke`
  + `/api/v2/users/{name}/grants`
  + `/api/v2/users/{name}/grants:revoke`
* Added support for PUT endpoints in the Service and Compute Pool APIs
* Deprecated the Grant API and various endpoints (see Deprecated features)

For more information, see [Snowflake REST APIs](../../developer-guide/snowflake-rest-api/snowflake-rest-api.md).

> **Note:**
>
> Snowflake REST APIs are not supported in the [Snowflake SnowGov regions](../../user-guide/intro-regions.md).

## Deprecated features

### Snowflake REST APIs

In this release, Snowflake REST APIs deprecated the following APIs and endpoints:

* Grant API
* Database API endpoints:

  + `/api/v2/databases/{name}:from_share`
* Table API endpoints:

  + `/api/v2/databases/{database}/schemas/{schema}/tables/{name}:as_select`
  + `/api/v2/databases/{database}/schemas/{schema}/tables/{name}:using_template`
  + `/api/v2/databases/{database}/schemas/{schema}/tables/{name}:create_like`
  + `/api/v2/databases/{database}/schemas/{schema}/tables/{name}:suspend_recluster`
  + `/api/v2/databases/{database}/schemas/{schema}/tables/{name}:resume_recluster`
  + `/api/v2/databases/{database}/schemas/{schema}/tables/{name}:swapwith`
* Task API endpoints:

  + `/api/v2/databases/{database}/schemas/{schema}/tasks/{name}/current_graphs`
  + `/api/v2/databases/{database}/schemas/{schema}/tasks/{name}/complete_graphs`
* Warehouse API:

  + `/api/v2/warehouses/{name}:use`

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Information Schema, Table Functions | [SERVERLESS_ALERT_HISTORY](../../sql-reference/functions/serverless_alert_history.md) | This table function is used for querying the serverless alert usage history. The information returned by the function includes the alert name and credits consumed by runs of each alert. |

## Data lake updates

### Apache Iceberg™ tables: Catalog integration for Iceberg REST — *General availability*

With this release, we are pleased to announce the general availability of REST catalog integrations for externally managed Iceberg tables.
This feature lets you connect Snowflake to Iceberg tables in a remote
catalog that complies with the open source Apache Iceberg REST OpenAPI specification.

For more information, see [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 18-Oct-24 |
| Apache Iceberg™ tables: Catalog integration for Iceberg REST | **Added** to *Data lake updates* section | 23-Oct-24 |
| *Trust Center: New Threat Intelligence scanner package* | **Added** to *New Features* section | 25-Oct-24 |

---
title: October 23-24, 2023 — 7.38 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_38.md
section: Release Notes
---

# October 23-24, 2023 — 7.38 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release Notes Change Log.

## Security Updates

### Network rules support Azure private endpoints — *Preview*

With this release, Snowflake is pleased to announce that the preview of network rules has been expanded to include support for the identifiers
of Azure private endpoints. Now, you can use network rules with network policies to restrict access to the Snowflake service when the request
originates from an Azure private endpoint.

For details about using network rules with network policies, see [Working with Network Rules](../../user-guide/network-policies.md).

## SQL Updates

### ALTER TABLE: Support for IF [NOT] EXISTS with ADD COLUMN and DROP COLUMN

The ALTER TABLE command now supports the keywords IF NOT EXISTS with ADD COLUMN and IF EXISTS with DROP COLUMN:

* When adding a column, you can specify IF NOT EXISTS to add the column only if no column with that name exists.

  For example, the following statement adds a column named `total` only if no column with that name exists:

  ```sqlexample
  ALTER TABLE my_table ADD COLUMN IF NOT EXISTS total NUMBER;
  ```

  If a `total` column already exists, the statement does not return an error, and the existing column is left unchanged.
* When dropping a column, you can specify IF EXISTS to drop the column only if a column with that name exists.

  For example, the following statement drops a column named `total` only if a column with that name exists:

  ```sqlexample
  ALTER TABLE my_table DROP COLUMN IF EXISTS total;
  ```

  If the `total` column does not exist, the statement does not return an error.

For details, see [ALTER TABLE](../../sql-reference/sql/alter-table.md).

### H3 functions for GEOGRAPHY objects — *Preview*

With this release, we are pleased to announce the preview of H3 functions for GEOGRAPHY objects.
[H3](https://h3geo.org/docs/) is a [hierarchical geospatial index](https://h3geo.org/docs/highlights/indexing/)
that partitions the world into hexagonal cells in a
[discrete global grid system](https://en.wikipedia.org/wiki/Discrete_global_grid). Snowflake provides the SQL functions that
enable you to use H3 with [GEOGRAPHY](../../sql-reference/data-types-geospatial.md) objects in Snowflake.

For more information, see [Using GEOGRAPHY objects with H3](../../sql-reference/data-types-geospatial.md).

## Extensibility Updates

### Python packages policies — *Preview*

With this release, we are pleased to announce support for Python packages policies as a preview feature to all accounts.

Using a packages policy enables you to set allowlists and blocklists for third-party Python packages from Anaconda at the account level.
This lets you meet stricter auditing and security requirements and gives you more fine-grained control over which packages are available
or blocked in your environment.

For more information, see [Packages Policies](../../developer-guide/udf/python/packages-policy.md).

## Web Interface Updates

### Snowsight is the default interface for Snowflake accounts in US government regions

Starting November 6, 2023, all users in Snowflake accounts in US government regions see Snowsight after logging in.

| Cloud | Snowflake Region | Week of change |
| --- | --- | --- |
| Amazon AWS | US Gov West 1 | Week of November 6, 2023 |
| Amazon AWS | US Gov 1 West (FedRAMP High Plus) | Week of November 13, 2023 |
| Amazon AWS | US Gov East 1 (FedRAMP High Plus) | Week of November 13, 2023 |
| Microsoft Azure | US Gov Virginia | Week of November 13, 2023 |
| Amazon AWS | US East (Commercial Gov - N. Virginia) | Week of November 13, 2023 |

Users in those accounts can no longer choose Classic Console as the default experience for their user profile. If needed, users can
access Classic Console from Snowsight.

If you use private connectivity to access Snowflake and have not yet set up your private connectivity configuration to access Snowsight,
you are not affected by this change. See [Configuring private connectivity for Snowsight](../../user-guide/ui-snowsight-gs.md) to prepare for upgrading to Snowsight.

For more information, see [Snowsight: The Snowflake web interface](../../user-guide/ui-snowsight.md).

## Release Notes Change Log

| Announcement | Update | Date Updated |
| --- | --- | --- |
| *Snowsight is the Default Interface for Snowflake Accounts in US Government Regions* | **Added** to *Web Interface Updates* | 25-Oct-2023 |
| *H3 functions for GEOGRAPHY objects — Preview* | **Added** to *SQL Updates* | 25-Oct-2023 |

---
title: October 28-30, 2024 — 8.41 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_41.md
section: Release Notes
---

# October 28-30, 2024 — 8.41 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Outbound private connectivity for Snowflake features

By default, Snowflake features that generate outbound network traffic from Snowflake to a cloud platform traverse the public Internet. With
this release, we are pleased to announce that you can create private endpoints in Snowflake to access the cloud platform using the platform’s
private connectivity solution rather than the Internet. This lets you access cloud platform services privately and securely from Snowflake.

With this release, outbound private connectivity is now available for the following Snowflake features:

#### External network access from Snowpark using AWS PrivateLink and Azure Private Link — *General availability*

You can configure external network access and create a private endpoint to use private connectivity to connect to an external network location
from a UDF/UDTF or stored procedure within Snowpark. Snowflake accounts on AWS can use AWS PrivateLink to access the external network location
and Snowflake accounts on Azure can use Azure Private Link.

For more information, see:

* [External network access and private connectivity on AWS](../../developer-guide/external-network-access/creating-using-private-aws.md)
* [External network access and private connectivity on Microsoft Azure](../../developer-guide/external-network-access/creating-using-private-azure.md)

#### External network access from Snowpark Container Services via AWS PrivateLink and Azure Private Link - *Preview*

You can configure external network access and create a private endpoint so outgoing network traffic from Snowpark Container Services uses AWS
PrivateLink or Azure Private Link instead of the public Internet.

For more information, see [Network egress using private connectivity](../../developer-guide/snowpark-container-services/service-network-communications.md).

#### External functions using Azure Private Link — *General availability*

You can configure an external function and create a private endpoint to use Azure Private Link when calling executable code that is developed,
maintained, stored, and executed in Azure. You can securely connect to the Azure resource via Azure API Management, using both the Azure
Portal and the Azure ARM template.

For more information, see:

* [Private connectivity with external functions: Azure ARM template](../../sql-reference/external-functions-creating-azure-template-private-connect.md)
* [Private connectivity with external functions: Azure Portal](../../sql-reference/external-functions-creating-azure-ui-private-connect.md)

For general information about using outbound private connectivity with these Snowflake features, see [Private connectivity for outbound network traffic](../../user-guide/private-connectivity-outbound.md).

### EXECUTE IMMEDIATE FROM: Preview SQL rendered from Jinja2 templates

With this release, we are pleased to announce support for previewing the SQL statements rendered by Jinja2 templates.

If you are using the EXECUTE IMMEDIATE FROM command to render and execute SQL statements from a Jinja2 template, you can preview the rendered
statements without executing them by specifying `DRY_RUN=TRUE`. This parameter is useful for debugging templating code and for previewing
SQL statements from staged files not intended for execution.

For more information, see [EXECUTE IMMEDIATE FROM](../../sql-reference/sql/execute-immediate-from.md).

### GENERATE_SYNTHETIC_DATA: New system stored procedure for generating synthetic data — *Preview*

With this release, we are pleased to announce the preview of the GENERATE_SYNTHETIC_DATA system stored procedure. With the synthetic data
generation feature, you can now programmatically create realistic datasets that closely mirror your original data. This allows you to
represent sensitive, confidential, or restricted information across various workloads, such as testing and validation.

For more information, see [GENERATE_SYNTHETIC_DATA](../../sql-reference/stored-procedures/generate_synthetic_data.md).

## Security updates

### Increased limits for network policies on internal stages

With this release, we are pleased to announce the general availability of using network policies to restrict incoming network traffic to the
internal stages of AWS accounts.

For Business Critical or higher customers, this release increases the limits on how many network identifiers can be included in a network
policy.

For more information, see [Protecting internal stages on AWS](../../user-guide/network-policies.md).

## SQL updates

### Extended support for bind variables

You can use bind variables to replace literals in SQL statements, which allows applications to dynamically construct SQL statements based on
user input. Bind variables are commonly used with Snowflake drivers, Snowflake Scripting, and the SQL REST API.

With this release, Snowflake extends support for bind variables so that you can use them for more use cases. The extended support includes
the use of bind variables for stage names and other parameters in COPY INTO <table> statements.

For more information, see a [Example that uses bind variables to set parameters in a command](../../developer-guide/stored-procedure/stored-procedures-snowflake-scripting.md).

## Extensibility updates

### Writing files from Snowpark Python UDFs and UDTFs — *Preview*

With this release, we are pleased to announce the preview of Writing files from Snowpark Python UDFs and UDTFs. With this feature in Snowpark
Python, you can now write files to stages using user-defined functions (UDFs), vectorized UDFs, user-defined table functions (UDTFs), and
vectorized UDTFs.

For more information, see [Writing files from Snowpark Python UDFs and UDTFs](../../developer-guide/snowpark/python/creating-udfs.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 25-Oct-24 |

---
title: October 29, 2024 — Universal Search in Virtual Private Snowflake (VPS)
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-29-snowsight-vps.md
section: Release Notes
---

# October 29, 2024 — Universal Search in Virtual Private Snowflake (VPS)

We are pleased to announce that Universal Search in Snowsight is now available in VPS. With Universal Search, you can quickly and easily
find database objects in your account, data products available to you in the Snowflake Marketplace, relevant Snowflake Documentation
topics, and relevant Snowflake Community Knowledge Base articles.

You can use natural language to search because Universal Search understands your query and information about your database objects and can
find objects with names that differ from your search terms.

For more details, see [Search Snowflake objects and resources](../../../user-guide/ui-snowsight-universal-search.md).

---
title: October 30, 2024 — Hybrid tables — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-10-30-hybrid-tables-ga.md
section: Release Notes
---

# October 30, 2024 — Hybrid tables — *General Availability*

With this release, Snowflake is pleased to announce the general availability of hybrid tables in commercial
[AWS regions](../../../user-guide/tables-hybrid-limitations.md). Hybrid tables simplify application development by
supporting execution of both transactional and analytics use cases within a single database.

For more information, see [Hybrid tables](../../../user-guide/tables-hybrid.md).

---
title: ODBC Driver Change Log (Prior to January 2022)
source: https://docs.snowflake.com/en/release-notes/client-change-log-odbc.md
section: Release Notes
---

# ODBC Driver Change Log (Prior to January 2022)

This topic lists the fixes, enhancements, and other changes introduced across all released, production versions of the Snowflake [ODBC Driver](../developer-guide/odbc/odbc.md) prior to January 2022.

See the [ODBC Driver release notes](clients-drivers/odbc.md) for current release note and change log information from January 2022 and later.

Note that this list does not include all changes made to the driver; it only lists significant changes or changes that may impact your usage.

In addition, this list is updated independently from the ODBC driver releases and, therefore, may not include the most recently-released version. To
see all available versions, go to the [ODBC Download](https://developers.snowflake.com/odbc/) page.

| Version | Change | Description |
| --- | --- | --- |
| **ODBC Driver 2.24.4** |  |  |
|  |  | Fixed an issue with arrow that occurred when using ODBC_TREAT_DECIMAL_AS_INT. |
|  |  |  |
| **ODBC Driver 2.24.3** |  |  |
|  |  | Added the MapToLongVarchar property. |
|  |  | Updated the OpenSSL version from 1.1.1k to 1.1.1l. |
|  |  | Updated the curl version from 7.74.0 to 7.78.0 |
|  | SNOW-30433 | Fixed a issue to add retry on invalid arrow chunks. |
|  |  | Fixed an issue to remove retry on HTTP 403 errors. |
|  |  |  |
| **ODBC Driver 2.24.2** |  |  |
|  |  | Added the UseCurrentCatalog connection parameter. |
|  | SNOW-464077 | Fixed a bug in stage binding related to timestamps. |
|  | SNOW-452624 | Fixed installer registry issues. |
|  |  | Performance improvement. |
|  | SNOW-452032 | Replaced invalid UTF-8 characters returned from the server. |
|  | SNOW-366898 | Added additional checks to prevent potential crash issues. |
|  | SNOW-279670 | Add test button on DSN dialog. |
|  |  |  |
| **ODBC Driver 2.24.1** |  |  |
|  |  | Added fast fail and maximum retry support to the GET command for ODBC. |
|  | SNOW-395216 | Added telemetry for unsupported APIs. |
|  |  | Fixed a bug with empty binary data in JSON format. |
|  |  | Added a warning message when an invalid key is passed to a connection string. |
|  |  | Fixed an issue with the default location of the CA Bundle file on Windows. |
|  |  | Fixed a bug where multiple statements beginning with a CALL stored procedure caused a crash. |
|  |  |  |
| **ODBC Driver 2.24.0** |  |  |
|  |  | Updated the minimum supported version of MacOS from 10.13 to 10.14. |
|  |  | Fixed an issue where Arrow crashed when calling the ValueOrDie() function. |
|  |  | Fixed an issue related to parameter array binding. |
|  | SNOW-373871 | Added PUT/GET support when connecting to a FIPS enabled endpoint. |
|  | SNOW-227282 | Added the ability for Telemetry to record the number of result rows consumed by an application. |
|  |  |  |
| **ODBC Driver 2.23.3** |  |  |
|  | SNOW-293206 | Added ability to return argument names from SQLProcedureColumns(). |
|  |  | Added option to set PUT_COMPRESSLV as a connection / configuration parameter. |
|  |  | Fixed an issue with the UPDATE/DELETE/INSERT statements where parameter array binding failed in some cases. |
|  |  | Fixed an issue where DEFAULT_VARCHAR_SIZE and DEFAULT_BINARY_SIZE did not work with SQLColumn(). |
|  |  | Added support for streaming values for the bind variables SQLParamData() and SQLPutData(). |
|  | SNOW-355132 | Added feature to make CURLOPT_MAXAGE_CONN configurable. |
|  |  | Fixed an issue where arrow chunk downloading caused a crash. |
|  |  | Upgraded OpenSSL from 1.1.1i to 1.1.1k. |
|  | SNOW-350996 | Changed behavior so that the PUT command does not retry when the file being processed already exists in the stage. |
|  |  | Fixed an issue where AWS logging caused a crash when using multiple threads. |
|  |  |  |
| **ODBC Driver 2.23.2** |  |  |
|  | SNOW-293206 | Added support for SQLProcedureColumns. |
|  |  | Fixed a bug with SQLColumns() and the GEOGRAPHY data type. |
|  | SNOW-291407 | Added connection / configuration parameters for specifying the default sizes of BINARY and VARCHAR columns when the column size is undetermined. |
|  |  | Improved performance when using TRACING=6. |
|  |  | Improved the performance of the secret detector. |
|  |  | Improvements for log settings. |
|  |  | When the ODBC version is 3 or higher, the ODBC Driver now uses SQL_TYPE_DATE, SQL_TYPE_TIME and SQL_TYPE_TIMESTAMP as the data types of date, time and timestamp. |
|  | SNOW-334403 | ODBC now generates regional url for aws us-east-1. |
|  |  |  |
| **ODBC Driver 2.23.1** |  |  |
|  | SNOW-249530 | Updated the driver to send only supported statements in SQLPrepare (including SELECT, DML, and SHOW statements). Prior to this change, if a statement was not supported in SQLPrepare (e.g. BEGIN, SET, or COMMIT), the driver would send the statement in SQLPrepare, and the server would return an error. |
|  | SNOW-269456 | Fixed a null pointer issue with timestamps. |
|  |  | Escaped unsafe characters in parameters in the connection string. |
|  |  | Added a configuration / connection parameter for specifying the temporary directory for PUT commands. |
|  |  | Captured the use of the session context in the telemetry. |
|  | SNOW-282587 | Without a sqlfetch the query is canceled. |
|  |  |  |
| **ODBC Driver 2.23.0** |  |  |
|  | SNOW 194654 | Added support for caching MFA tokens. |
|  | SNOW-239674 | Updated the driver to capture escape characters in telemetry. |
|  |  | Set a default value for the CA certificate bundle file name. |
|  |  | Update the driver to free up memory when downloading chunks of results in Arrow format . |
|  | SNOW-274791 | Updated the driver to prevent overscoping when listing foreign keys. |
|  | SNOW-295726 | Added a secret detector and masking module. |
|  | SNOW-278585 | Added support for using the Arrow data format for transferring data to Snowflake. |
|  |  |  |
| **ODBC Driver 2.22.5** |  |  |
|  | SNOW-219403 | Added support for specifying the PUT_FASTFAIL and PUT_MAXRETRIES parameters in the simba.ini file. |
|  | SNOW-215983 | Added support for unicode in folder names in PUT / GET statements. |
|  | SNOW-275777 | Updated the driver to use JSON format for Win32 applications when exchanging data with Snowflake. |
|  | SNOW-269456 | Upgraded the version of Arrow to 0.17.0. |
|  | SNOW-78018 | Updated the driver to return the query Id for a successful ODBC call that executes the PUT/GET command. |
|  |  |  |
| **ODBC Driver 2.22.4** |  |  |
|  | SNOW-218025 | Caught exception during the heartbeat sync, which prevents crashes during large (10G) uploads. |
|  | SNOW-240901 | Added Security Verification for Query Texts. |
|  | SNOW-218019 | Updated the telemetry payloads. |
|  | SNOW-195691 | Added support for the ODBC SQLProcedures() function. |
|  | SNOW-231762 | Fixed error with recognizing multi-statements. |
|  |  |  |
| **ODBC Driver 2.22.3** |  |  |
|  | SNOW-219403 | Added support for configurable parameters to enable fast fail and specify the maximum number of retries for PUT command failures. |
|  | SNOW-197194 | Improved the error message for ODBC SSL Certificate failures. |
|  | SNOW-201816 | Reverted a change that overwrote proxy configurations that were set in environment variables. |
| **ODBC Driver 2.22.2** |  |  |
|  | SNOW-199839 | Added inband telemetry when the PUT command fails. |
|  | SNOW-200183 | Added the EnablePidLogFileNames configuration parameter, which causes different processes to generate separate log files. |
|  | SNOW-201047 | Added exceptions for unsupported features to inband telemetry. |
|  | SNOW-201816 | Fixed a problem where the proxy details could not be cleared after being set in the ODBC driver. |
|  | SNOW-204142 | When enabled, SQL_DESC_TYPE_NAME returns the GEOGRAPHY type when GEOGRAPHY_OUTPUT_TYPE is GeoJSON (not (E)WKT or (E)WKB). |
|  | SNOW-209045 | Fixed a problem where a crash occurred with concurrent connections. |
|  | SNOW-213639 | Fixed ODBC bulk array binding errors that occurred when parsing data in DATE format. |
| **ODBC Driver 2.22.1** |  |  |
|  | SNOW-170804 | Addressed a security vulnerability finding for util-linux-v2.33.1. |
|  | SNOW-170805 | Addressed a security vulnerability finding for openssl-OpenSSL_1_1_1b. |
|  | SNOW-177073 | Send inband telemetry objects for metadata API calls. |
|  | SNOW-178485 | Addressed a security vulnerability finding for openssl-1.1.1b-v1.1.1b. |
|  | SNOW-197540 | Added metadata to the telemetry for derived ODBC Show commands. |
| **ODBC Driver 2.22.0** |  |  |
|  | SNOW-170120 | Added the configuration parameter EnableAutoIpdByDefault to override the default value of SQL_ATTR_ENABLE_AUTO_IPD. |
|  | SNOW-181235 | Addressed a connection glitch introduced in version 2.21.8. |
|  | SNOW-183721 | Updated the CACert Bundle in ODBC Drivers. |
|  | SNOW-184163 | Improved PUT performance by using /dev/urandom as the default device. |
|  | SNOW-187198 | Fixed support for the CLIENT_MEMORY_LIMIT parameter, which is used as a max memory limit for Chunk downloading. |
|  | SNOW-187534 | Masked signatures in GCP URLs in logs. |
| **ODBC Driver 2.21.8** |  |  |
|  | SNOW-160149 | set Min version of ODBC to receive Arrow result set. |
|  | SNOW-170279 | Add usage stats of SqlPrepare Defer execution stats to CLIENT_ENVIRONMENT. |
|  | SNOW-175663 | Enable MULTI STATEMENT Support for ODBC on server side. |
|  | SNOW-175667 | Increase the PUT threshold value on the server side to 200MB |
|  | SNOW-177137 | Added new parameter named UseURandomDevice that changes the driver to use /dev/urandom instead of /dev/random. |
| **ODBC Driver 2.21.7** |  |  |
|  | SNOW-101559 | Fixed issue where PUT command with slashes did not work as documented. |
|  | SNOW-156582 | Fixed the following error that occurred when uploading a file into AWS S3 internal stage using PUT command:`AwsSdk::AWSClient::: No response body. Response code: 404`. |
|  | SNOW-159839 | Fixed issue with reading and writing data containing an em-dash when using the latest Snowflake ODBC driver with Informatica Cloud Services. |
|  | SNOW-162610 | Performance improvements for using PUT commands with internal stages. |
|  | SNOW-163154 | Fixed issue where PUT commands failed when no file extension was specified. |
|  | SNOW-163664 | Fixed issue for Private Preview feature. |
|  | SNOW-165820 | Fixed issue where PUT commands did not upload files without returning errors. |
|  | SNOW-168900 | Fixed issue where the driver continued to open connections to localhost when successive PUT commands were issued; this caused excessive TCP connections (in 3rd-party connectors for Attunity and Razorsql). |
|  | SNOW-169965 | Added Logging level to client environment telemetry. |
|  | SNOW-170115 | For Windows, fixed issue where PUT commands failed even when an escape character was provided and delimited using single quotes. |
|  | SNOW-170233 | Fixed issue where PUT / GET commands fail when paths use forward slashes. |
| **ODBC Driver 2.21.6** |  |  |
|  | SNOW-135244 | For Windows, fixed issue where `externalbrowser` authentication was not working properly. |
|  | SNOW-143536 | Added the `NoExecuteInSQLPrepare` parameter to enable controlling how DDL statements are handled in `SQLPrepare` and `SQLExecute`. |
|  | SNOW-158500 | Fixed issue where queries executed with the driver showed failing DESCRIBE_QUERY results; related to the fix for SNOW-143536. |
|  | SNOW-160829 | Fixed performance issue caused by driver not picking up schema/database. |
| **ODBC Driver 2.21.5** |  |  |
|  | SNOW-45633, . SNOW-144591 | Support added for bulk array binding. |
|  | SNOW-75496 | For Snowflake accounts hosted on GCP, support added for PUT and GET commands. |
|  | SNOW-165067 | Security fix. |
| **ODBC Driver 2.21.4** |  |  |
|  | N/A | Version is not available for download; all fixes are available in 2.21.5 (and higher). |
| **ODBC Driver 2.21.3** |  |  |
|  | SNOW-136211 | Implemented Arrow Bulk fetch. |
|  | SNOW-157756 | Notarized mac package. |
| **ODBC Driver 2.21.2** |  |  |
|  | SNOW-52894, . SNOW-152727, . SNOW-152768, . SNOW-153310 | Fixed issues related to GA of secure SSO ID tokens to support browser-based SSO (for Windows and macOS only). |
|  | SNOW-140235 | Fixed issue where using `yum` to upgrade the driver to a new version deleted the driver RPM, which caused the upgrade to fail. |
|  | SNOW-147376 | Fixed issue where OOB (Out Of Band) Telemetry did not capture connections if curl code was not set to `CURL_OK`. |
|  | SNOW-150687 | Fixed the following session expiration error for long running queries: `"GS error code=390112, GS error message=Your session has expired. Please login again"` |
|  | SNOW-151169 | Upgraded curl to 7.68.0. |
| **ODBC Driver 2.21.1** |  |  |
|  | SNOW-139254 | Internal enhancement. |
|  | SNOW-147190 | Removed unnecessary `{"message":"Limit Exceeded"}` error message from displaying in output buffer. |
|  | SNOW-147420 | Fixed issue that caused a driver failure when a property in the connection string was too long. |
|  | SNOW-148261 | Fixed issue with incorrect Heartbeat endpoint that caused the CLIENT_SESSION_KEEP_ALIVE parameter to fail if set to true; this was a regression introduced in version 2.20.5. |
| **ODBC Driver 2.21.0** |  |  |
|  | SNOW-75961 | Set ODBC SQL_ATTR_ENABLE_AUTO_IPD default value to true, which reverts the default value change introduced in version 2.20.0 of the driver. |
|  | SNOW-120324 | For macOS and Windows, implemented additional updates to support secure SSO ID tokens (preview feature). |
|  | SNOW-137581 | For Linux, implemented guarding of `getaddrinfo()` with `mutex` in `libcurl`; also introduced `ForceLockGetaddrinfo` parameter in ODBC configuration settings to fix segmentation fault when application is not pthread compatible. |
|  | SNOW-139281 | For Linux, disabled SSO ID token cache. |
|  | SNOW-141543 | Fixed issue with rendering of results for LIST and REMOVE commands. |
|  | SNOW-141622 | Updated SSO ID token secure storage to make it ODBC-specific, rending it inaccessible to other drivers. |
| **ODBC Driver 2.20.5** |  |  |
|  | SNOW-120324 | For macOS and Windows, added support for secure SSO ID tokens (preview feature); this enables applications to use browser-based SSO while minimizing the number of authentication popups when connecting to Snowflake. |
|  | SNOW-123641 | Added support for multi-threading in the driver to implement thread-safety in Snowflake native objects. |
|  | SNOW-134689 | Increased multi-part upload threshold to 64MB for PUT commands. |
|  | SNOW-139112 | Fixed potential security issue due to raw message logging. |
| **ODBC Driver 2.20.4** |  |  |
|  | SNOW-121054 | Reduced unnecessary calls to `ALTER SESSION SET AUTOCOMMIT=TRUE`. |
| **ODBC Driver 2.20.3** |  |  |
|  | SNOW-124921 | Merged partner code changes to implement partner requests and fix reported issues. |
|  | SNOW-126811 | Changed the behavior of the PUT command that skips file upload if the file exists in the stage and no overwrite option is set. |
| **ODBC Driver 2.20.2** |  |  |
|  | SNOW-91853 | Fixed issue where system locale takes precedence over any locale settings in the driver. |
|  | SNOW-110240 | For Linux and Snowflake accounts hosted on Azure, fixed segmentation violation error that occurred when using PUT with SAS. |
|  | SNOW-115888 | For Windows and Snowflake accounts on Azure, fixed issue that occurred with large file uploads when using PUT. |
|  | SNOW-121236 | (Corrected: it appears that this was a false alarm, and is no longer an issue for the customer.) Fixed issue where CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX and CLIENT_SESSION_KEEP_ALIVE parameters could not be set in the ODBC connect string. |
| **ODBC Driver 2.20.1** |  |  |
|  | SNOW-115888 | For Windows, fixed issue with uploading/downloading large files to/from stages in Azure (using PUT/GET). |
|  | SNOW-110240 | Fixed issue that resulted in a segmentation fault on Redhat when uploading files to stages in Azure (using PUT). |
| **ODBC Driver 2.20.0** |  |  |
|  | SNOW-97263 | Implemented the following fixes from Simba, some of which introduced behavior changes: . 1. Fix issue with setting DSI_CONN_CURRENT_CATALOG to non-null value; also, implement `SFSemantics` and change the default behavior for it. . 2. Set SQL_DESC_CASE_SENSITIVE to false for non-character data types. . 3. When using non-existent name or invalid character (e.g. `"`) in filters for catalog functions, return empty result instead of error. . 4. Set SQL_ATTR_ENABLE_AUTO_IPD to false by default to match ODBC specification. . 5. Add support for binding SQL_BIT parameter. . 6. Fix incorrect value when binding SQL_REAL parameter. . 7. Support Inf/Nan values when binding SQL_REAL/SQL_DOUBLE parameters. . 8. Return truncation warning when data retrieval buffer size is smaller than actual data. . 9. Support binding parameter with custom data types (SQL_SF_TIMESTAMP_LTZ, SQL_SF_TIMESTAMP_NTZ, SQL_SF_TIMESTAMP_TZ). . 10. Provide correct information from `SQLGetInfo(SQL_DATABASE_NAME)` and `SQLGetInfo(SQL_USERNAME)`. . |
|  | SNOW-97669 | Fixed issue with SOURCE_COMPRESSION = GZIP by matching the value case-insensitively. |
|  | SNOW-98456 | Internal enhancement. |
|  | SNOW-100023 | Fixed issue where Azure SDK fails to upload large files from Mac/Windows. |
|  | SNOW-101569 | Replaced `int128` and `uint128` libraries. |
| **ODBC Driver 2.19.16** |  |  |
|  | SNOW-14287 | Fixed Wrong Column Size error for `string` data type in the result set metadata. |
|  | SNOW-86742 | Added client information to the USER-AGENT HTTP header. |
|  | SNOW-90398 | Improved handling of Cache Directory creation errors. |
|  | SNOW-90427 | Fixed issue where `ensureCacheDir` failure was not properly handled in `readOCSPCacheFile()`. |
|  | SNOW-98251 | Fixed performance degradation by removing `CURLOPT_FORBID_REUSE` from `curl` option. |
| **ODBC Driver 2.19.15** |  |  |
|  | SNOW-98251 | Fixed a performance regression introduced in v2.19.10 of the driver. Due to this fix, versions 2.19.10 to 2.19.14 have been removed from distribution and are no longer available for download. |
| **ODBC Driver 2.19.14** . (removed from distribution due to fix in 2.19.15) |  |  |
|  | SNOW-81418 | Added support for `OVERWRITE` option in PUT and GET commands. |
|  | SNOW-91145 | Implemented behavior change for values returned by `SQLTable()` function, based on the table type (`TABLE`, `VIEW`, or `TABLE,VIEW`). |
| **ODBC Driver 2.19.13** . (removed from distribution due to fix in 2.19.15) |  |  |
|  | SNOW-92671 | Fixed issue with duplicate row being inserted by ensuring `requestID` is consistent with expired session. |
| **ODBC Driver 2.19.12** . (removed from distribution due to fix in 2.19.15) |  |  |
|  | SNOW-76184 | Fixed extra space in end-of-timestamp output by introducing `ODBC_USE_STANDARD_TIMESTAMP_COLUMNSIZE=true` where the output size is estimated to be 29 instead of 35. |
|  | SNOW-76710 | Implemented Out-of-Band Telemetry. |
|  | SNOW-90409 | Fixed support for OCSP Fail-open. |
| **ODBC Driver 2.19.11** . (removed from distribution due to fix in 2.19.15) |  |  |
|  | SNOW-80091 | Driver now sends `clientStartTime` and `retryCount` with each `/queries/v1/query-request`. |
|  | SNOW-88346 | Internal change for pending feature. |
|  | SNOW-82846 | Fixed issue where inserting a TIMESTAMP into a STRING data type field via Parameterized insert left-trims the month, day, and time using MS ODBC TEST Tool (`odbcte32.exe`). |
|  | SNOW-90640 | Fixed issue with `CABundleFile` parameter for PUT and GET support. |
|  | SNOW-90246 | Fixed issue with `OCSP_FAIL_OPEN` parameter normalization. |
| **ODBC Driver 2.19.10** . (removed from distribution due to fix in 2.19.15) |  |  |
|  | SNOW-88730 | In Windows, fixed AWS PrivateLink connection issue by adding support for the `CABundleFile` parameter to the connect string. |
|  | SNOW-88853 | Added support for optionally setting application name through the `.ini` file or connect string. |
| **ODBC Driver 2.19.9** |  |  |
|  | SNOW-82352 | Enhanced prepared statements to support queries that start with an open parenthesis. |
|  | SNOW-84995 | Driver now checks the OCSP Response Cert Status before checking the time validity for the cert; this prevents expired REVOKED OCSP responses from failing open. |
|  | SNOW-86966 | Driver now sets empty SERVICE_NAME if passed from the services layer. |
|  | SNOW-86970 | Replaced insecure CRT functions with secure functions. |
| **ODBC Driver 2.19.8** |  |  |
|  | SNOW-85722 | Driver now checks on the return value for `TlsAlloc()` and calls `TlsFree()` as needed. |
| **ODBC Driver 2.19.7** |  |  |
|  | SNOW-85249 | Fixed an issue where SERVICE_NAME was not propagated to the services layer. |
|  | SNOW-85264 | Fixed a critical stability issue in OCSP fail-open handling that was introduced in version 2.19.0. Due to this fix, versions 2.19.0 to 2.19.6 have been removed from distribution and are no longer available for download. |
| **ODBC Driver 2.19.6** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-81831 | Driver now uses standard connection fields for global URLs. |
| **ODBC Driver 2.19.5** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-80433 | Fixed issue with PUT commands encountering a data error (e.g. `'LOAD00000001.csv.gz',compression type used: 'GZIP', cause: 'data error'`) due to files with the same name being uploaded in separate, but concurrent sessions. |
| **ODBC Driver 2.19.4** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-67606 | Internal change. |
|  | SNOW-70889 | Updated OCSP hostname/URL for AWS PrivateLink. |
| **ODBC Driver 2.19.3** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-79225 | Internal change for pending feature. |
| **ODBC Driver 2.19.2** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-78624 | Fixed issue with Linux dependency on gcc and g++. |
| **ODBC Driver 2.19.1** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-74552 | For Windows, driver now returns the query ID for a successful ODBC call. |
|  | SNOW-77593 | Improved logging for OCSP fail-open, as well as updated configuration naming from Soft Fail to Fail Open. |
|  | SNOW-77750 | To facilitate downloading the driver automatically/programmatically, the Client Driver Repository now includes a `Latest` directory for each supported OS. The directory is a symlink to the latest version directory. |
|  | SNOW-77781 | Implemented various fixes for issues caused by OCSP fail-open. |
| **ODBC Driver 2.19.0** . (removed from distribution due to fix in 2.19.7) |  |  |
|  | SNOW-73827 | Driver upgraded from SimbaSDK 10.1.11 to 10.1.15. |
|  | SNOW-76151 | Implemented support for OCSP fail-open. |
|  | SNOW-76979 | Updated priority of ways to configure OCSP fail-open. |
|  | SNOW-77160 | Added OCSP_MODE metric. |
| **ODBC Driver 2.18.4** |  |  |
|  | SNOW-66128 | Driver now supports SERVICE_NAME. |
|  | SNOW-73120 | Fixed issue with PUT command failing to load file to internal stage. |
|  | SNOW-73304 | Fixed the TIMESTAMP_LTZ behavior for the driver. |
| **ODBC Driver 2.18.3** |  |  |
|  | SNOW-63521 | Driver upgraded to OpenSSL 1.1.1b. |
| **ODBC Driver 2.18.2** |  |  |
|  | SNOW-39055 | Documented support for defining custom C data types. |
|  | SNOW-60376 | For Windows, fixed issue that prevented changing the installation location from the default. |
| **ODBC Driver 2.18.1** |  |  |
|  | SNOW-56250 | Fixed issue where cancel does not record `requestId`. |
|  | SNOW-64779 | Added BIGINT support to ODBC Data Type table. |
| **ODBC Driver 2.18.0** |  |  |
|  | SNOW-65165 | Driver upgraded to SimbaSDK 10.1. |
| **ODBC Driver 2.17.6** |  |  |
|  | SNOW-60066 | For Mac OS, fixed key pair segfault when extracting public key. |
|  | SNOW-60617 | Added support for setting `APPLICATION` property. |
|  | SNOW-63031 | Driver now invalidates outdated OCSP responses when checking cache hit. |
|  | SNOW-63305 | Improvements for future use. |
| **ODBC Driver 2.17.5** |  |  |
|  | SNOW-62431 | For Snowflake accounts hosted on AWS, support added for PUT and GET commands. |
|  | SNOW-62880 | Support added for loading private key file for key-pair authentication. |
|  | SNOW-62922 | Fixed issue with driver crashing when DB2 ODBC library is also used. |
| **ODBC Driver 2.17.4** |  |  |
|  | SNOW-61962 | Improved the precision for floating point numbers to mitigate precision loss. |
|  | SNOW-62077 | Driver now checks HTTP Response Codes for OCSP Cache Download. |
| **ODBC Driver 2.17.3** |  |  |
|  | SNOW-55056 | Fixed issue caused by including region and cloud platform in `account` parameter in `odbc.ini`. |
| **ODBC Driver 2.17.2** |  |  |
|  | SNOW-52535 | Internal change for pending feature. |
|  | SNOW-58250 | Driver now filters client application names to pass only alphanumeric characters and underscore characters (`_`); all other characters in client application names are ignored. |
|  | SNOW-60207 | Fixed an issue where the warehouse specified in the connection parameters is not set when a session is created by an ID token. |
| **ODBC Driver 2.17.1** |  |  |
|  | SNOW-55036 | Added `request_guid` to all HTTP requests to support better tracing. |
| **ODBC Driver 2.17.0** |  |  |
|  | SNOW-55095 | Internal change for pending feature. |
|  | SNOW-56912 | Changed mapping for BOOLEAN data type from SQL_INTEGER to SQL_BIT. |
| **ODBC Driver 2.16.11** |  |  |
|  | SNOW-55003 | For Windows ODBC configuration, changed UID parameter from required to optional, enabling creation of system DSNs without a username. |
| **ODBC Driver 2.16.10** |  |  |
|  | SNOW-45298 | Driver no longer generates incidents for errors caused by user environment. |
| **ODBC Driver 2.16.9** |  |  |
|  | SNOW-40171 | Fixed a memory leak when setting the `autocommit` attribute. |
|  | SNOW-53452 | Internal change for pending feature. |
|  | SNOW-53650 | Internal change for pending feature. |
|  | SNOW-53955 | Fixed the following error: `failed to create a id token cache` |
| **ODBC Driver 2.16.8** |  |  |
|  | SNOW-50766 | Updated driver to enforce virtual host style for S3 URLs. |
|  | SNOW-51436 | Fixed issue with underflow of INTEGER values. |
| **ODBC Driver 2.16.7** |  |  |
|  | SNOW-50618 | Internal change for pending feature. |
|  | SNOW-51002 | Fixed issue introduced in v2.16.4 of the driver, in which numeric values fetched as the FLOAT/DOUBLE data type using the bulk fetch API could return wrong results. |
| **ODBC Driver 2.16.6** |  |  |
|  | SNOW-42835 | For Mac OS, added version number to package file metadata. |
|  | SNOW-49898 | Driver now returns Okta-specific error code when Okta authentication fails. |
| **ODBC Driver 2.16.5** |  |  |
|  | SNOW-49793 | Driver now deletes the OCSP response cache from the memory cache if validity check fails. |
|  | SNOW-49860 | For Mac OS, fixed the default driver manager encoding. |
| **ODBC Driver 2.16.4** |  |  |
|  | SNOW-48678 | Internal change for pending feature. |
| **ODBC Driver 2.16.3** |  |  |
|  | SNOW-44911 | For Windows, fixed issue with non-ASCII character binding. |
| **ODBC Driver 2.16.2** |  |  |
|  | SNOW-44075 | Removed login name requirement when authenticating with an OAuth access token. |
| **ODBC Driver 2.16.1** |  |  |
|  | SNOW-42987 | Added support for WCHAR and WVARCHAR data types to the converter to address Power BI failures in Direct Query mode due to non-ASCII characters. |
|  | SNOW-43215 | Added support for OCSP dynamic cache server for AWS PrivateLink. |
| **ODBC Driver 2.16.0** |  |  |
|  | SNOW-42632 | Enabled the OCSP Cache Server by default. |
|  | SNOW-43021 | Added support for using the DSN proxy parameters and `simba.ini` parameters to override the HTTP_PROXY, HTTPS_PROXY, and NO_PROXY environment variables. |
| **ODBC Driver 2.15.0** |  |  |
|  | SNOW-38487 | For Windows, driver now uses OCSP via OpenSSL instead of WinSSL. |
| **ODBC Driver 2.14.0** |  |  |
|  | SNOW-38487 | For Mac OS, driver now uses cURL 7.58.0 and OpenSSL 1.1.0g to support OCSP revocation check instead of using the preinstalled cURL and SecureTransport. |
|  | SNOW-38487 | For Linux, upgraded cURL 7.54.0 and OpenSSL 1.1.0e to 7.58.0 and 1.1.0g, respectively. |
| **ODBC Driver 2.13.21** |  |  |
|  | SNOW-34055 | Added OS and OS_VERSION to the session info. |
|  | SNOW-39429 | Added filtering of primary key and foreign keys by connection database and schema if CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX session parameter is enabled. |
|  | SNOW-40307 | Fixed incorrect formatting of trailing and leading zeros for numeric data type. |
| **ODBC Driver 2.13.20** |  |  |
|  | SNOW-38487 | For Linux, added OCSP cache server support. |
| **ODBC Driver 2.13.19** |  |  |
|  | SNOW-39883 | Fixed SIGSEGV caused by null pointer reference in base64 encoding. |
| **ODBC Driver 2.13.18** |  |  |
|  | SNOW-39049 | Driver now uses cURL library to retrieve OCSP responses to honor `proxy` config set by environment variable. |
|  | SNOW-39305 | Fixed a segmentation fault that occurred when converting TIMESTAMP to STRING for custom SQL data types (pending feature; not currently enabled). |
| **ODBC Driver 2.13.17** |  |  |
|  | SNOW-38353 | Fixed bulk converter and decimal digits for custom timestamps (pending feature; not currently enabled). |
|  | SNOW-38772 | Driver now honors output format for individual timestamp type. Also returns the value length after conversion. |
| **ODBC Driver 2.13.16** |  |  |
|  | SNOW-36102 | Added a parameter to allow the driver to treat big numbers (i.e. precision over 19) as a string. |
|  | SNOW-37994 | Fixed issue caused by wrong column byte size for VARCHAR type in result set metadata. |
| **ODBC Driver 2.13.15** |  |  |
|  | SNOW-23881 | Added support for custom timestamp formatter (pending feature; not currently enabled). |
| **ODBC Driver 2.13.14** |  |  |
|  | SNOW-34096 | Added support for custom SQL data types (pending feature; not enabled by default) in result set metadata. |
| **ODBC Driver 2.13.13** |  |  |
|  | SNOW-32391 | Fixed an issue that caused large inserts to overflow `rowCount`. |
| **ODBC Driver 2.13.12** |  |  |
|  | SNOW-31347 | Fixed an issue where `SQLDescribeCol` always returned 6 decimal digits (i.e. microseconds) as the precision for TIME and TIMESTAMP data types, regardless of whether the precision was set to a different value. Now, the driver returns the precision, from 0 (seconds) to 9 (nanoseconds), defined for the data type. |
| **ODBC Driver 2.13.11** |  |  |
|  | SNOW-31998 | Added support for SAML 2.0-compliant services/applications for federated authentication by adding the `externalbrowser` option to the `authenticator` connection parameter. |
| **ODBC Driver 2.13.10** |  |  |
|  | SNOW-29705 | Fixed issue where ODBC sessions were not closing properly; now the driver tries to close sessions in the destructor for the ODBC Connection object. |
|  | SNOW-33074 | Added support for `timezone` as a session parameter that can be set in `odbc.ini` for connecting to Snowflake. |
| **ODBC Driver 2.13.9** |  |  |
|  | SNOW-25562 | If `metadata_request_use_connection_ctx` is set to true, the driver now applies the database name to the ODBC API call if the schema name is not null. |
|  | SNOW-31998 | Added support for federated authentication/SSO/ADFS. |
| **ODBC Driver 2.13.8** |  |  |
|  | SNOW-31847 | For Windows, fixed an issue with a `curl failed initialization` error. |
| **ODBC Driver 2.13.7** |  |  |
|  | SNOW-30968 | Added an ODBC driver property to support `noproxy`. |
| **ODBC Driver 2.13.6** |  |  |
|  | SNOW-31211 | For Windows, applied fix for timestamps older than 1970 to dates. |
| **ODBC Driver 2.13.5** |  |  |
|  | SNOW-31211 | For Windows, added internal flag that enables an exception when TIMESTAMP_LTZ is out of range. By default, 1970-01-01 is quietly used in the event of an error. Previously, 1969-12-31 was returned. |
| **ODBC Driver 2.13.4** |  |  |
|  | SNOW-31211 | For Windows, fixed issue with timestamps older than 1970 not being supported. |
| **ODBC Driver 2.13.3** |  |  |
|  | SNOW-26793 | Added the version number to ODBC packages. |
|  | SNOW-28379 | For Mac OS, changed the namespace used to identify the installation package for the operating system from `com.snowflake.odbc` to `net.snowflake.odbc`. |
|  | SNOW-29592 | For Linux, changed the underlying SSL library from NSS to OpenSSL. No change to ODBC for Mac OS and Windows. |
| **ODBC Driver 2.12.99** |  |  |
|  | SNOW-22240 | Fixed an issue with merge count not adding up. |
|  | SNOW-30586 | Fixed an issue with number conversion in the driver. |
| **ODBC Driver 2.12.98** |  |  |
|  | SNOW-25562 | Added CLIENT_METADATA_REQUEST_USE_CONNECTION_CTX session parameter (to filter object names by the current database and schema if not specified). |
| **ODBC Driver 2.12.97** |  |  |
|  | SNOW-28617 | Client package signed with new GPG key (and secret). |
| **ODBC Driver 2.12.96** |  |  |
|  | SNOW-24601 | Implemented security patch for federated authentication. |
| **ODBC Driver 2.12.95** |  |  |
|  | SNOW-28234 | Added CLIENT_TIMESTAMP_TYPE_MAPPING to the list of parameters that can be set in connection properties. |
| **ODBC Driver 2.12.94** |  |  |
|  | SNOW-25540 | Added support for binding timestamp variables as timestamp_ntz for applications that use the bind API to load data into datetime columns (which are equivalent to the timestamp_ntz data type). |
|  | SNOW-26451 | Implemented CLIENT_SESSION_KEEP_ALIVE session parameter as a supported connection option. |
| **ODBC Driver 2.12.93** |  |  |
|  | SNOW-26953 | Fixed an issue which caused network disruptions to return an exception. Now, disruptions return a user error instead of an exception. |
| **ODBC Driver 2.12.92** |  |  |
|  | SNOW-26215, . SNOW-26227 | If the client tries to send a delete request to the server for a session that has already expired, the request is ignored. |
| **ODBC Driver 2.12.91** |  |  |
|  | SNOW-25999 | Driver processes SQL_DECIMAL as SQL_BIGINT if the scale is set to 0. |
| **ODBC Driver 2.12.90** |  |  |
|  | SNOW-11970 | Improved the resiliency for intermittent network errors when receiving query results. |
| **ODBC Driver 2.12.89** |  |  |
|  | SNOW-22102 | Fixed a potential deadlock when the main application thread waiting for a result chunk, being downloaded by an asynchronous thread, times out. |
|  | SNOW-22351 | Improved memory management for downloading large result sets. |
|  | SNOW-21795, . SNOW-24366, . SNOW-24519, . SNOW-24589 | Improved handling of connection failures and re-establishing a connection. |
| **ODBC Driver 2.12.88** |  |  |
|  | SNOW-22865 | The BUlkFetch API is now supported. |
|  | SNOW-23884 | Improved performance on ODBC initial connection. |
| **ODBC Driver 2.12.87** |  |  |
|  | SNOW-18996 | Added support for BINARY data type. |
|  | SNOW-22697 | Improved performance when fetching large result sets. |

---
title: ODBC Driver release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/odbc.md
section: Release Notes
---

# ODBC Driver release notes

The ODBC Driver release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](odbc-2026.md)
* [2025 releases](odbc-2025.md)
* [2024 releases](odbc-2024.md)
* [2023 releases](odbc-2023.md)
* [2022 releases](odbc-2022.md)

See [ODBC Driver](../../developer-guide/odbc/odbc.md) for documentation.

---
title: ODBC Driver release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/odbc-2022.md
section: Release Notes
---

# ODBC Driver release notes for 2022

This article contains the release notes for the ODBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for ODBC Driver updates.

See [ODBC Driver](../../developer-guide/odbc/odbc.md) for documentation.

## Version 2.25.7 (December 13, 2022)

### Documentation-only update

This month’s update includes only changes to the ODBC documentation and does not contain any software updates.

* Updated the [Managing/Using Federated Authentication](../../user-guide/admin-security-fed-auth-use.md)
  documentation to reflect a limitation relating to using OKTA and MFA (multi-factor authentication) with ODBC.

## Version 2.25.7 (October 28, 2022)

### BCR (Behavior Change Release) change

> **Caution:**
>
> Version 2.25.7 of the Snowflake ODBC driver included updates to match functionality described in
> the [ODBC Driver](../../developer-guide/odbc/odbc.md) documentation. Snowflake recommends that you do not install this version
> into a production environment before testing.

### Bug fixes

* Fixed an issue in which proxy settings in the connection parameters could incorrectly impact other connections.
* Fixed an issue where GET command calls might corrupt files.

## Version 2.25.6 (October 12, 2022)

### New features

* Updated supported versions for the following libraries:

  + Updated openssl to version 1.1.1q.
  + Updated curl to version 7.84.0.

### Bug fixes

* Fixed an issue where a stage binding issue caused the application to crash.
* Fixed an issue with the pathname for the Macintosh arm64 processor installation package.

## Version 2.25.5 (August 23, 2022)

### New features

* Added support for the new OKTA Identity Engine (OIE).

### Bug fixes

* Fixed issues for catalog functions with names that contain special characters.
* Fixed an issue with connection failures that occurred when the `validateSessionParam` configuration parameter
  is enabled and a database name contains a hyphen.
* Fixed an issue where corrupted files occasionally occurred with GET downloads.
* Fixed upload failures for files larger than 2 GB.

## Version 2.25.4 (August 1, 2022)

### Bug fixes

* Improved performance for parameter array binding.

## Version 2.25.3 (June 21, 2022)

### New features

* Added support for Arm64 on Linux.

## Version 2.25.2 (June 01, 2022)

### New features

* Added support for a new `SecondaryRoles` parameter.
* Improved parameter binding performance.

### Bug fixes

* Fixed an issue with using user-defined formats for TIMESTAMP_INPUT_FORMAT during stage binding by always
  using the default format for TIMESTAMP_INPUT_FORMAT for stage binding.

## Version 2.25.0 (May 09, 2022)

### New features

* Added Arm64 support for M1 Macintosh systems.
* Upgraded the openssl library from version 1.1.1l to 1.1.1n.

### Bug fixes

* Fixed an issue where stage binding failed with timestamp and date data types.
* Fixed an issue that caused a crash when changing the result format during a query.
* Fixed an issue that caused `SQLSetConnectAttr(SQL_ATTR_CURRENT_CATALOG)` to fail when the database name used
  lowercase letters.

## Version 2.24.7 (March 17, 2022)

### New features

* Added support for Snowflake server-side encryption.
* Added support for non-ascii characters in the catalog function filters.

### Bug fixes

* Fixed an issue with the `SQLGetData()` function.
* Removed extraneous XML files from the installation package.
* Fixed an issue with the encryption key size for PUT and GET calls.
* Fixed an invalid database error.

## Version 2.24.6 (February 21, 2022)

### Bug fixes

* Improved performance for `TIMESTAMP_TZ` and `TIMESTAMP_LTZ` on Windows.
* Enabled support for the undocumented `CLIENT_TELEMETRY_ENABLED` parameter.

## Version 2.24.5 (January 20, 2022)

### Bug fixes

* Fixed an issue where the test button crashed
* Fixed an issue where loading an OCSP cache file always failed.
* Fixed an issue with Invalid arrow chunks when retrying a query.

---
title: ODBC Driver release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/odbc-2023.md
section: Release Notes
---

# ODBC Driver release notes for 2023

This article contains the release notes for the ODBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for ODBC Driver updates.

See [ODBC Driver](../../developer-guide/odbc/odbc.md) for documentation.

## Version 3.1.4 (December 07, 2023)

### New features and updates

* None.

### Bug fixes

* Added the `BROWSER_RESPONSE_TIMEOUT` connection parameter to fix an issue with external browser authentication.
* Added the `allowEmptyProxy` connection parameter to fix an issue where an empty proxy setting could overwrite the configuration setting.
* Fixed an issue that caused intermittent crashes when sending telemetry data.
* Removed CRT functions banned by Microsoft due to security concerns.

## Version 3.1.3 (November 13, 2023)

### New features and updates

* Updated the following libraries:

  + openssl from 3.0.9 to 3.0.11
  + curl from 8.1.2 to 8.4.0
* Updated `SQLGetStmtAttr(SQL_SF_STMT_ATTR_LAST_QUERY_ID)` to return the query id for failed query.
* Added support for managing the frequency of retries for unsuccessful connection requests:

  + Added the `retryTimeout` parameter with a default value of 300 seconds.
  + Updated how the driver uses the `LOGIN_TIMEOUT` and `maxHttpRetries` connection parameters and changed the default value of `LOGIN_TIMEOUT` to 300 seconds.

### Bug fixes

* Fixed an issue where the driver failed when fetching query results due to a timeout in OSCP validation.
* Fixed an issue where PUT and GET commands failed when a file path contained non-ASCII characters.
* Fixed an issue where a PUT command on GCP overwrote existing files when `overwrite=true` is not specified.
* Removed CRT functions banned by Microsoft due to security concerns.

## Version 3.1.1 (September 29, 2023)

### New features and updates

* Updated the `cacert` bundle used for SSL connections.

### Bug fixes

* Improved error messages related to PUT/GET command failures to provide specific errors instead of “unknown exception”.
* Fixed an issue where the ODBC driver would continue retrying chunk downloads even after the application canceled the related query.
* Fixed an issue where using `SQLGetData()` with the ARROW result format could decrease performance.
* Fixed an issue where credentials were shown in error messages.
* Fixed an issue with OCSP fallback requests in AWS PrivateLink environments.
* Fixed an issue where the driver did not use the entire OCSP URL in the certificate when performing OCSP validation.

## Version 3.1.0 (August 23, 2023)

### Behavior Change Release (BCR) Changes

* Fixed an issue where, under certain conditions, the driver could retry HTTP requests indefinitely.

  Previously, during an outage the driver would retry the failed HTTP call continuously until the request succeeds or
  until someone force kills the operation.

  With this change, the driver disables infinite HTTP retries originating from `execute` and `executeQuery` calls. Now, the driver
  limits HTTP retries to seven, by default. Customers can set the `maxHttpRetries` connection parameter to customize
  the maximum number of retries. Customers can set `maxHttpRetries=0` to remove the retry limit, but doing so runs the
  risk of the driver infinitely retrying failed HTTP calls.
* To improve performance, the `SQLExecDirect()` function no longer unnecessarily validates parameter bindings for a query.

  Previously, the driver sent two requests for each `SQLExecDirect()` call: a describe request and an execute request.
  To improve the performance describe request is omitted. With this change the driver won’t validate the parameter
  bindings needed by the query. If the parameter bindings from the previous query are not cleared
  using `SQLFreeStmt(SQL_RESET_PARAMS)`, they could be applied to the following query incorrectly and cause
  issues.

### New features and updates

* Added the `CLIENT_OUT_OF_BAND_TELEMETRY_ENABLED` session parameter to enable and disable out-of-band (OOB)
  telemetry support.

### Bug fixes

* Fixed an issue where the driver might fail when getting a query result for multi-statement queries that begin with new
  statement types, such as CALL.
* Fixed an issue that could cause the `SQLColAttribute()` function to return an incorrect value
  of `SQL_DESC_OCTET_LENGTH` on `VARCHAR` columns, which could truncate data.
* Fixed an issue where the driver failed to download query results by incorrectly sending out-of-band (OOB) telemetry
  timeouts when using private links.

## Version 3.0.2 (July 27, 2023)

### New features and updates

* Updated the following software libraries:

  + util-linux to version 2.39.0.
  + curl to version 8.1.2.
* Enhanced hybrid transactional/analytical processing (HTAP).
* Set the `LogLevel` default to `OFF` for ODBC clients running on Windows platforms.

### Bug fixes

* Fixed an issue that cause intermittent crashes when sending telemetry.

## Version 3.0.1 (July 06, 2023)

### BCR (Behavior Change Release) changes

Starting with version 3.0.1 of the ODBC driver:

* Upgraded from openssl 1.1.1 to openssl 3.0.9. Consequently, private keys generated using the deprecated encryption
  algorithms in previous openssl library version no longer work. When you update to ODBC 3.0.1, you must regenerate
  your private key file used for key pair authentication.
* Dropped support for CentOS 6 and MacOS 10.14 and 10.15.

### New features and updates

* Updated the following software libraries:

  + openssl to version 3.0.9.
  + ICU to version 71.1.0.
* Created a single, unified release package architecture that supports both x86_64 and arm64 Mac systems.

### Bug fixes

* Fixed an issue where the driver crashed intermittently when CLIENT_SESSION_KEEP_ALIVE is set to true on Windows systems.

## Version 2.25.12 (June 06, 2023)

### New features and updates

None.

### Bug fixes

* Fixed an issue where very large requests with large amounts of parameter bindings could crash an application by
  exceeding the logging size.
* Fixed an issue with OCSP validation.
* Fixed an issue that could inadvertently reveal a proxy password in the Snowflake log file.

## Version 2.25.11 (April 20, 2023)

### New features and updates

* Updated the libcurl library from version 7.87.0 to 7.88.1.
* Updated the zlib library from version 1.2.11 to 1.2.13.

### Bug fixes

* Fixed an invalid URL issue that could occur during OCSP validation when making connections.
* Fixed an issue where connections would fail when credentials were supplied when using proxies that do not need them.
* Removed deprecated openssl function calls.
* Fixed an issue where double type parameter bindings could lose precision.
* Removed unsafe function calls banned by Microsoft.

## Version 2.25.10 (March 22, 2023)

### New features and updates

* Updated the libcurl library from version 7.84.0 to 7.87.0.

### Bug fixes

* Fixed an issue the prevented clients from creating a file DSN (data source name).
* Fixed an issue where the PUT command failed to replicated data.
* Fixed an issue a Macintosh application running an the ARM64 architecture would fail to connect to Snowflake
  using the native Apple Silicon ODBC driver.

## Version 2.25.9 (February 28, 2023)

### New features and updates

None.

### Bug fixes

* Added support for the GEOMETRY data type in the `SnowflakeType` enum to fix an issue that occurred when calling
  the `SQLColumns()` function to return metadata that included GEOMETRY data.
* Fixed an issue where timestamp data was wrongly returned as NULL in some cases.

## Version 2.25.8 (February 8, 2023)

### New features and updates

None.

### Bug fixes

* Fixed an issue where an array binding INSERT statement failed when the schema is not defined in the session.
* Fixed an issue that occasionally caused the ODBC driver to crash when executing GET and PUT queries.
* Fixed an issue where the ODBC driver sent SIGPIPE signals after the session was idle for around 120 seconds.
* Fixed an issue where using Okta authentication failed when receiving an HTTP 429 error.

---
title: ODBC Driver release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/odbc-2024.md
section: Release Notes
---

# ODBC Driver release notes for 2024

This article contains the release notes for the ODBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for ODBC Driver updates.

See [ODBC Driver](../../developer-guide/odbc/odbc.md) for documentation.

## Version 3.5.0 (November 04, 2024)

### New features and updates

* Added support for Red Hat Enterprise Linux (RHEL) 8 and CentOS 8 for ARM64 processors.
* Added a connectivity diagnostics mode, including the following new connection parameters:

  + `enable_connection_diag`, which controls whether the connector generates a connectivity diagnostic report.
  + `connection_diag_log_path`, which is the absolute path where the connectivity report is stored.
  + `connection_diag_allowlist_path`, which is the absolute path to a JSON file containing the output of `SYSTEM$ALLOWLIST()`
    or `SYSTEM$ALLOWLIST_PRIVATELINK()`.
* Updated the following libraries:

  + curl from version 8.7.1 to 8.10.1.
  + openssl from version 3.0.13 to 3.0.15.

### Bug fixes

* None.

## Version 3.4.1 (September 03, 2024)

### New features and updates

* Improved error messages for network errors.

### Bug fixes

* Fixed an issue of introducing delays in some cases when running put/get command.
* Fixed an issue where unsupported usage of `SQL_DEFAULT_PARAM` is not handled correctly.

## Version 3.4.0 (July 29, 2024)

### New features and updates

* Added support for passing private key file content through `SQLSetConnectAttr` when using key-pair authentication.
* Added support for different top-level domains, such as `.cn` in China.
* Added a change log in the Linux `rpm` package.
* Increased the maximum allowable LOB (large object) size.

### Bug fixes

* Fixed an issue that caused the driver to become unresponsive in some cases when logging.
* Fixed an issue where an error is returned when calling a SQL procedure that returns NULL.
* Fixed an issue where multi-statement queries returned incorrect query results when the multi-statement query had more than one USE command.
* Fixed an issue with memory leaks when reading environment variables.
* Fixed an issue where proxy settings in environment variables is not honored on Windows.

## Version 3.3.2 (June 24, 2024)

### New features and updates

* Added the `disableSamlUrlCheck` connection parameter to disable verification for SAML URLs.

### Bug fixes

* Fixed an issue with choosing the S3 regional URL domain based on the region name.
* Fixed an issue where the driver didn’t return query results correctly in some cases using scripting.
* Fixed an issue where `SQLColAttribute(SQL_DESC_TYPE_NAME)` didn’t return type names for custom SQL data types.
* Fixed an issue with incorrect information in logging.

## Version 3.3.1 (May 02, 2024)

### New features and updates

* Updated the following library versions:

  + arrow from 0.17.1 to 15.0.0.
  + aws sdk from 1.3.50 to 1.11.283.
  + curl from 8.6.0 to 8.7.1.

### Bug fixes

* None.

## Version 3.3.0 (April 08, 2024)

### New features and updates

* Added support for log settings in a logging configuration file.
* Updated the following library versions:

  + curl from 8.4.0 to 8.6.0.
  + openssl from 3.0.11 to 3.0.13.
  + zlib from 1.2.13 to 1.3.1.

### Bug fixes

* Fixed performance regression on Windows when using `SQLGetData` to retrieve the query results.
* Fixed an issue where `SQLProcedures()` didn’t list stored procedures that returned a table without a column definition.
* Upgraded the compiler on Windows and added more build flags to address security concerns.
* Fixed an issue where memory allocation failures could cause applications to terminate with a potential resource leak.
* Fixed an issue relating to out of bounds memory access errors.
* Removed CRT functions banned by Microsoft due to security concerns.
* Fixed an issue where using key pair authentication failed when a private key file path contained non-ASCII characters.

## Version 3.2.0 (January 19, 2024)

### BCR (Behavior Change Release) changes

With the 3.2.0 release, the ODBC driver removed the `ODBCInstLib` setting in the configuration file when initially installing the driver. During installation, the driver now searches the driver manager library for different possible locations based on the platform. This approach provides greater flexibility for various platforms. If the driver cannot find the library, it displays an `Unable to locate SQLGetPrivateProfileString function` error. In this case, you might need to set `ODBCInstLib` manually with the name of the driver manager on your system. For more information, see [Configure the ODBC Driver](../../developer-guide/odbc/odbc-linux.md).

### New features and updates

* Added support for multiple SAML integrations.

### Bug fixes

* Fixed performance regression on Windows when using SQLGetData to retrieve the query result. Some cases, such as `varchar`, might still experience performance regression. These issues will be fixed in a future release.
* Fixed an issue where using Okta authentication failed when receiving an HTTP 429 error.
* Fixed an improper error message when creating a DSN with an invalid name.

---
title: ODBC Driver release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/odbc-2025.md
section: Release Notes
---

# ODBC Driver release notes for 2025

This article contains the release notes for the ODBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for ODBC Driver updates.

See [ODBC Driver](../../developer-guide/odbc/odbc.md) for documentation.

## Version 3.13.0 (November 20, 2025)

### New features and updates

* Added support for Decfloat types.
* Support cross-signed chains during OCSP check.
* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. For information on enabling this feature, see [Certificate revocation list (CRL) options](../../developer-guide/odbc/odbc-parameters.md). Snowflake recommends that you test this feature before enabling it in production.

### Bug fixes

* Removed the trailing null termination character from the JWT header and payload.

## Version 3.11.1 (October 21, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed a bug with numeric data conversion when using bulk fetching.

## Version 3.10.1 (October 21, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed a bug with numeric data conversion when using bulk fetching.

## Version 3.9.1 (October 20, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed a bug with numeric data conversion when using bulk fetching.

## Version 3.8.1 (October 20, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed a bug with numeric data conversion when using bulk fetching.

## Version 3.7.1 (October 20, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed a bug with numeric data conversion when using bulk fetching.

## Version 3.12.0 (October 14, 2025)

### New features and updates

* Improved performance of the multi-threaded bulk fetching workflow.

### Bug fixes

* Fixed a bug that, during OIDC usage, the token was not required, causing errors.
* Fixed the MacOS release to include the x86_64 architecture.
* Fix a bug with `DEFAULT_VARCHAR_SIZE` in configuration of the default `varchar length` parameter.
* Fixed a bug with numeric data conversion when using bulk fetching.

## Version 3.11.0 (August 13, 2025)

### New features and updates

* Added support for workload identity federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `workload_identity_provider` connection parameter.
  + Added `WORKLOAD_IDENTITY` to the values for the `authenticator` connection parameter.
* Added the following configuration parameters:

  + `DisableTelemetry` to disable telemetry.
  + `SSLVersionMax` to specify the maximum SSL version.
* Added the `PRIV_KEY_BASE64` and `PRIV_KEY_PWD` connection parameters that allow passing a base64-encoded private key.

### Bug fixes

* Fixed an issue with the in-band telemetry event handler to properly reset the events.
* Fixed the HTTP headers used to authenticate via OKTA.
* Removed the trailing slash from the default `RedirectUri` within the OAuth Authorization process.

## Version 3.10.0 (July 07, 2025)

### Private Preview (PrPr) features

* Added support for sovereign clouds for workload identity federation (WIF).

  + These features can only be accessed by setting the `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
  + You should use these features only with non-production data.
  + Support is unavailable for these PrPr features. However, the Product and Engineering teams are available for consultation during PrPr.
  + Contact your account team for participation and documentation.

### New features and updates

* Added support for configuring connection parameters in TOML files.

### Bug fixes

* Fixed an issue with supporting virtual-style domains.
* Fixed an issue that could potentially cause a buffer overflow.

## Version 3.9.0 (June 12, 2025)

### Private Preview (PrPr) features

* Added support for Workload Identity Federation in the AWS, Azure, GCP, and Kubernetes platforms.

  + These features can only be accessed by setting the `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
  + You should use these features only with non-production data.
  + Support is unavailable for these PrPr features. However, the Product and Engineering teams are available for consultation during PrPr.
  + Contact your account team for participation and documentation.

### New features and updates

* Added the `LOCAL_APPLICATION` default for the `oauth_client_id` and `oauth_client_secret` OAUTH parameters.
* Extended the Windows UI of the ODBC driver with key-pair authentication parameters `PRIV_KEY_FILE` and `PRIV_KEY_FILE_PWD`.
* Added support virtual-style domains.
* Added the `DriverManagerOverride` configuration parameter, which allows specifying the driver manager on Linux and MacOS.
* Upgraded the driver to SimbaSDK 10.3.

### Bug fixes

* Fixed the incorrect error thrown by fetching the cancellation request.
* Fixed a possible crash triggered by using bulk fetching first (retrieve multiple rows per fetch call), then switching to single row mode.
* Fixed the issue when handling easy-logging configuration could break the connection.
* Fixed an OCSP validation issue on session resuming that could lead to out of memory problem.

## Version 3.8.0 (April 30, 2025)

### New features and updates

* Added support for PAT (Programmatic Access Token), OAuth 2.0 Authorization Code Flow, OAuth 2.0 Client Credentials Flow, and OAuth Token caching:

  + For PAT:

    - Added the `PROGRAMMATIC_ACCESS_TOKEN` parameter for the parameter authenticator.
  + For OAuth 2.0 Authorization Code Flow:

    - Added the `oauth_client_id`, `oauth_client_secret`, `oauth_authorization_url`, `oauth_token_request_url`, and `oauth_scope` DSN parameters.
    - Added the `OAUTH_AUTHORIZATION_CODE` parameter for the parameter authenticator.
  + For OAuth 2.0 Client Credentials Flow:

    - Added the `oauth_client_id`, `oauth_client_secret`, `oauth_token_request_url`, and `oauth_scope` DSN parameters.
    - Added the `OAUTH_CLIENT_CREDENTIALS` parameter for the parameter authenticator.
  + For OAuth Token caching:

    - Passing the UID (username) to the driver configuration is required, and `client_store_temporary_credential` property cannot be set to `false`.
* Implemented and improved the file-based credentials cache for Linux, including enhanced token caching.

### Bug fixes

* None.

## Version 3.7.0 (April 14, 2025)

### Private Preview (PrPr) features

Added support for PAT (Programmatic Access Token), OAuth 2.0 Authorization Code Flow, OAuth 2.0 Client Credentials Flow, and OAuth Token caching in Private Preview:

* For PAT:

  + Added the `PROGRAMMATIC_ACCESS_TOKEN` parameter for the parameter authenticator.
* For OAuth 2.0 Authorization Code Flow:

  + Added the `oauth_client_id`, `oauth_client_secret`, `oauth_authorization_url`, `oauth_token_request_url`, and `oauth_scope` DSN parameters.
  + Added the `OAUTH_AUTHORIZATION_CODE` parameter for the parameter authenticator.
* For OAuth 2.0 Client Credentials Flow:

  + Added the `oauth_client_id`, `oauth_client_secret`, `oauth_token_request_url`, and `oauth_scope` DSN parameters.
  + Added the `OAUTH_CLIENT_CREDENTIALS` parameter for the parameter authenticator.
* For OAuth Token caching:

  + Passing the UID (username) to the driver configuration is required, and `clientStoreTemporaryCredential` property cannot be set to `false`.

Disclaimer:

> * These features can only be accessed by setting the `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
> * You should use these features only with non-production data.
> * Support is unavailable for these PrPr features. However, the Product and Engineering teams are available for consultation during PrPr.
> * Contact your account team for participation and documentation.

### New features and updates

* Updated the curl library (libcurl) from version 8.10.1 to 8.12.1.

### Bug fixes

* Enabled the Address Space Layout Randomization (ASLR) security compiler option for Windows.
* Fixed an issue with certain code paths logging the entire SQL query text using the INFO level. For more information see [CVE-2025-46614](https://community.snowflake.com/s/article/Snowflake-Connector-for-ODBC-Security-Advisory-CVE-2025-46614).

## Version 3.6.0 (March 08, 2025)

### New features and updates

* Added support for regional Google Cloud Storage endpoints.

### Bug fixes

* Fixed an issue with the driver crashing when `basic_string::_M_construct` is `null` or not valid or when a segmentation fault because the `HOME` environment variable is unset.
* Fixed an issue with the MacOS Secure Storage helper.
* Fixed issues with lowercasing URL when using OKTA authentication.
* Fixed a logging issue with the `test` button.
* Fixed an issue when a query response omits its length.
* Fixed an issue with the HTTP Date header format depending on the locale.

---
title: ODBC Driver release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/odbc-2026.md
section: Release Notes
---

# ODBC Driver release notes for 2026

This article contains the release notes for the ODBC Driver, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for ODBC Driver updates.

See [ODBC Driver](../../developer-guide/odbc/odbc.md) for documentation.

## Version 3.16.0 (Mar 11, 2026)

### New features and updates

* Upgraded SimbaSDK to version 10.3.7.
* Upgraded libsnowflakeclient to version 2.7.1.
* Upgraded OpenSSL to version 3.0.19.
* Updated `client_environment` telemetry signals to provide more information about the environment.

### Bug fixes

* Fixed the incorrect return size for `SQL_NUMERIC` when `SQL_DECIMAL` to `SQL_C_BINARY` conversion takes place.
* Fixed `SQLProcedures` not returning all procedures.
* Fixed the OAuth Client Credentials flow not routing IdP token requests through the configured HTTP proxy.
* Fixed the incorrect return of `SQL_SUCCESS` instead of `SQL_SUCCESS_WITH_INFO` when the buffer for the converted string is too small.

## Version 3.15.0 (Feb 9, 2026)

### New features and updates

* Deprecated support for CentOS 7, Red Hat Enterprise Linux (RHEL) 7, and Ubuntu 18.04. The minimum supported operating systems are now Red Hat Enterprise Linux (RHEL) 8, Rocky Linux 8, CentOS 8, and Ubuntu 20.04.
* Added the `WORKLOAD_IDENTITY_IMPERSONATION_PATH` connection parameter to support GCP and AWS Workload Identity Federation (WIF) impersonation.
* Added the `singleAuthenticationPrompt` connection parameter to control the authentication flow.
* Added the following operating system details from the `/etc/os-release` file as telemetry during the login request:

  + `NAME`
  + `PRETTY_NAME`
  + `ID`
  + `BUILD_ID`
  + `IMAGE_ID`
  + `IMAGE_VERSION`
  + `VERSION`
  + `VERSION_ID`
* Updated curl to version 8.16.0.
* Updated OpenSSL to version 3.0.18.
* Set `LOCAL_APPLICATION` as the default value for `client_id` and `client_secret` in the OAuth authorization code flow.

### Bug fixes

* Fixed the expired file lock on Linux for secure storage.
* Removed the username requirement for WIF authentication.

## Version 3.14.0 (Jan 12, 2026)

### New features and updates

* Added support for Red Hat Enterprise Linux (RHEL) 9 for x86 and ARM64 architectures.
* Introduced a shared library for extended telemetry to identify and prepare a testing platform for native Rust extensions.
* Introduced warning log messages when HTTP is used for OAuth authorization and token endpoints.
* Added support for injecting the SPCS service identifier token (`SPCS_TOKEN`) into login requests when present in SPCS containers.

  + Introduced the `token_file_path` parameter in the TOML configuration to specify the path to the file containing the token.
  + Introduced the `SKIP_TOKEN_FILE_PERMISSIONS_VERIFICATION` parameter. If set to `true`, the file permission check is omitted.
* Introduced a specific error when exceeding the parameter limit in a query.
* Improved logging.
* Added support for specifying the Azure client ID.
* Enabled handling of the 307 and 308 HTTP redirect codes.

### Bug fixes

* Fixed duplicate error message codes.
* Fixed the default session scope for OAuth authentication.
* Fixed the default CRL cache path creation on Windows.
* Fixed session token leakage in the logs.

---
title: Organization Usage: Updated billing views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1584.md
section: Release Notes
---

# Organization Usage: Updated billing views

The following views have been updated to help reconcile monthly usage statements:

* [CONTRACT_ITEMS](../../../sql-reference/organization-usage/contract_items.md)
* [RATE_SHEET_DAILY](../../../sql-reference/organization-usage/rate_sheet_daily.md)
* [REMAINING_BALANCE_DAILY](../../../sql-reference/organization-usage/remaining_balance_daily.md)
* [USAGE_IN_CURRENCY_DAILY](../../../sql-reference/organization-usage/usage_in_currency_daily.md)

As part of these changes, a small, one-time adjustment might be made for customers who have an active contract. Contracts begin by
completing an order form.

Billing views of the Organization Usage schema and monthly usage statements behave as follows:

Before the change:
:   The following makes billing reconciliation difficult:

    * Billing views include usage that is not included on the monthly usage statement. For example, usage with a value of less than $0.01 is
      not billed on a statement.
    * Billing views do not use rounded values, while official values on the usage statements do.
    * Data types in the billing views are inconsistent.
    * The SERVICE_TYPE column of the RATE_SHEET_DAILY view does not include all types.

    In addition, usage statements contain total usage for the contract by appending the current month to the end of the list.

After the change:
:   Monthly usage statements now contain usage for the current month only.

    The billing views contain the following new columns and updated columns:

    | View | New columns | Updated columns |
    | --- | --- | --- |
    | CONTRACT_ITEMS |  | Data type of AMOUNT changes from NUMBER(38,10) to NUMBER(38,2).  Data type of CONTRACT_MODIFIED changes from TIMESTAMP_LTZ to DATE. |
    | RATE_SHEET_DAILY | * BILLING_TYPE * RATING_TYPE * IS_ADJUSTMENT | SERVICE_TYPE can be any of the possible service types, similar to other views. `compute` is no longer a possible value. |
    | REMAINING_BALANCE_DAILY | MARKETPLACE_CAPACITY_ DRAWDOWN_BALANCE | Data type of ROLLOVER_BALANCE changes from FLOAT to NUMBER(38,2). |
    | USAGE_IN_CURRENCY_DAILY | * BILLING_TYPE * RATING_TYPE * SERVICE_TYPE * IS_ADJUSTMENT | Data type of USAGE changes from NUMBER(38,6) to NUMBER(38,3).  Data type of USAGE_IN_CURRENCY changes from NUMBER(38,6) to NUMBER(38,2). |

Ref: 1584

---
title: Overview of Snowflake release notes
source: https://docs.snowflake.com/en/release-notes/overview.md
section: Release Notes
---

# Overview of Snowflake release notes

[New server releases of Snowflake are deployed each week](../user-guide/intro-releases.md). In addition, between these server
releases, features are updated, and new versions of clients, drivers, libraries, and Snowflake Connectors are released.

## Release notes

[All release notes](all-release-notes)
:   Use this page to filter release notes by type (for example, server releases, feature updates, client releases, and connector
    releases) and by date.

[Snowflake server release notes and feature updates](new-features.md)
:   Release notes for [Snowflake server releases](../user-guide/intro-releases.md) and feature updates.

    * For the most recent release note announcements, see [Snowflake server release notes and feature updates](new-features.md).
    * For earlier release notes, see:

      + [Server releases and feature updates in 2025](new-features-2025.md)
      + [Server releases and feature updates in 2024](new-features-2024.md)
      + [Server releases and feature updates in 2023](new-features-2023.md)

Client, driver, and library release notes
:   [Release notes](clients-drivers/monthly-releases.md) and [support information](requirements.md) on clients (such as
    SnowSQL and Snowflake CLI), drivers (such as the JDBC driver, ODBC driver, and the Snowflake Connector for Python), and libraries
    (such as the Snowpark API and the Snowflake Python APIs).

Snowflake Connectors release notes
:   Release notes for [Snowflake Connectors](https://other-docs.snowflake.com/connectors.html), which provide native integration of third-party applications and database systems in
    Snowflake (for example, [Snowflake Connector for ServiceNow](https://other-docs.snowflake.com/connectors/servicenow/about.html)).

## Behavior changes

[Behavior change announcements](behavior-changes.md)
:   Announcements about [behavior changes](intro-bcr-releases.md).

## Recent improvements

[Performance Improvements](performance-improvements.md)
:   Summaries of recent features that improve performance.

[SQL improvements](sql-improvements.md)
:   Summaries of recent features that make it easier to write simpler SQL.

## Feature information

[Preview features](preview-features.md)
:   Information about preview features in Snowflake.

---
title: PACKAGES View (Information Schema): New RUNTIME_VERSION Column in View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1094.md
section: Release Notes
---

# PACKAGES View (Information Schema): New RUNTIME_VERSION Column in View

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The following column was added to the PACKAGE view in the INFORMATION_SCHEMA schema:

| Column Name | Data Type | Description |
| --- | --- | --- |
| RUNTIME_VERSION | VARCHAR | The supported language runtime version that the package can run with. |

Previously the column only applies for Python packages:

* Python packages: The value of the column is of the form `<major>.<minor>` reflecting the Python version (e.g. `3.9`). If a package
  can run with multiple Python versions, there will be a row for each version.
* Non-Python packages: The value is NULL.

For example, the following query will return all of the available packages that can run with Python 3.9 on the Snowpark platform:

```sqlexample
SELECT * FROM <db_name>.INFORMATION_SCHEMA.PACKAGE
    WHERE LANGUAGE = 'python' AND RUNTIME_VERSION='3.9';
```

Where `<db_name>` is the name of any existing database.

Ref: 1094

---
title: Parquet Files: Statistics Included for Decimal Columns Unloaded as FixedLengthByteArrays
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-976.md
section: Release Notes
---

# Parquet Files: Statistics Included for Decimal Columns Unloaded as FixedLengthByteArrays

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Parquet files behave as follows:

Previously:
:   Parquet files do not include column statistics for decimal columns unloaded as `FixedLengthByteArrays`.

Currently:
:   Parquet files will include column statistics for decimal columns unloaded as `FixedLengthByteArrays`.

Ref: 976

---
title: PASSWORD_POLICIES view: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1309.md
section: Release Notes
---

# PASSWORD_POLICIES view: New columns

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

The Account Usage PASSWORD_POLICIES view behaves as follows:

Before the change:
:   The PASSWORD_POLICIES view did not include the PASSWORD_MIN_AGE_DAYS and PASSWORD_HISTORY columns.

After the change:
:   The PASSWORD_POLICIES view includes the PASSWORD_MIN_AGE_DAYS and PASSWORD_HISTORY history columns as follows:

    | Column Name | Data Type | Description | Notes |
    | --- | --- | --- | --- |
    | `PASSWORD_MIN_AGE_DAYS` | INT | The number of days a user must wait before a recently changed password can be changed again. | This is the fifteenth ordinal column in the output. |
    | `PASSWORD_HISTORY` | INT | The number of the most recent passwords that Snowflake stores. These stored passwords cannot be repeated when a user updates their password value. | This is the last column in the output. |

    Snowflake adds these columns to the view to keep track of properties that are set for the password policy. For details, see
    [Best practices for password policies and passwords](../../../user-guide/password-authentication.md).

Ref: 1309

---
title: Performance Improvements
source: https://docs.snowflake.com/en/release-notes/performance-improvements.md
section: Release Notes
---

# Performance Improvements

Snowflake is constantly introducing features that improve performance. Many of these features require no user interaction and no additional
cost. Improvements that make queries run faster can reduce costs because a warehouse only consumes credits while it is running.

> **Important:**
>
> Performance improvements often target specific query patterns or workloads. These improvements might or might not have a material impact
> on a specific workload.

If you have questions about any of these features, please feel free to contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

* [2026 Performance improvements](performance-improvements-2026.md)
* [2025 Performance improvements](performance-improvements-2025.md)
* [2024 Performance Improvements](performance-improvements-2024.md)
* [2023 Performance Improvements](performance-improvements-2023.md)
* [2022 Performance Improvements](performance-improvements-2022.md)

---
title: Personal databases and private notebooks (Private notebooks deprecated)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1872.md
section: Release Notes
---

# Personal databases and private notebooks (*Private notebooks deprecated*)

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

This behavior change was originally introduced in the 2025_01 bundle. To give users additional evaluation time, this behavior change remains
disabled in 2025_02. If you previously enabled personal databases in 2025_01 or set the `ENABLE_PERSONAL_DATABASE` parameter and want to opt
out, you must now [disable](../../../user-guide/ui-snowsight/workspaces.md) the `ENABLE_PERSONAL_DATABASE` parameter.

When a user creates a private notebook, a personal database and a schema named PUBLIC are created for the user. The personal database acts
as a dedicated workspace for creating, modifying, and managing private notebooks, which are accessible only to the individual user.

You can use the following SHOW and DESCRIBE commands to view information about private notebooks, personal databases, and schemas
inside those personal databases:

* [DESCRIBE NOTEBOOK](../../../sql-reference/sql/desc-notebook.md)
* [SHOW DATABASES](../../../sql-reference/sql/show-databases.md)
* [SHOW SCHEMAS](../../../sql-reference/sql/show-schemas.md)
* [SHOW NOTEBOOKS](../../../sql-reference/sql/show-notebooks.md)

Before the change:
:   When a user creates a notebook, it is owned by the user’s current role and viewable by any user with that role.

After the change:
:   By default, notebooks are private and owned by the current user. When a private notebook is ready for production, a user can deploy it to
    a standard, non-personal database.

Ref: 1872

---
title: PHP PDO Driver for Snowflake release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/php-pdo.md
section: Release Notes
---

# PHP PDO Driver for Snowflake release notes

The PHP PDO Driver for Snowflake release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](php-pdo-2026.md)
* [2025 releases](php-pdo-2025.md)
* [2024 releases](php-pdo-2024.md)
* [2023 releases](php-pdo-2023.md)
* [2022 releases](php-pdo-2022.md)

See [PHP PDO Driver for Snowflake](../../developer-guide/php-pdo/php-pdo-driver.md) for documentation.

---
title: PHP PDO Driver for Snowflake release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/php-pdo-2022.md
section: Release Notes
---

# PHP PDO Driver for Snowflake release notes for 2022

This article contains the release notes for the PHP PDO Driver for Snowflake, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> For release note information for versions released prior to January 2022, see the [Client Release History](https://community.snowflake.com/s/article/client-release-history).

See [PHP PDO Driver for Snowflake](../../developer-guide/php-pdo/php-pdo-driver.md) for documentation.

## Version 1.2.5 (October 26, 2022)

### New features

* Added new `proxy` and `no_proxy` connection settings.

### Bug fixes

* Fixed an issue with key pair authentication.

## Version 1.2.4 (August 23, 2022)

### New features

* Added support for key-pair authentication.

## Version 1.2.3 (July 08, 2022)

### Bug fixes

* Fixed an issue that caused `autocommit` to turn over when creating a connection with options.

Version 1.2.2 (May 24, 2022)

### Bug fixes

* Fixed an issue related to loading the PHP driver with php-fpm.
* Upgraded libsnowflakeclient to version 0.6.12.

## Version 1.2.1 (Mar 16, 2022)

### New features

* Added support for PHP 8.1.
* Updated documentation to update the supported PHP versions.

---
title: PHP PDO Driver for Snowflake release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/php-pdo-2023.md
section: Release Notes
---

# PHP PDO Driver for Snowflake release notes for 2023

This article contains the release notes for the PHP PDO Driver for Snowflake, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for PHP PDO Driver for Snowflake updates.

See [PHP PDO Driver for Snowflake](../../developer-guide/php-pdo/php-pdo-driver.md) for documentation.

## Version 2.0.1 (November 09, 2023)

### Behavior Change Release (BCR) changes

Starting with version 2.0.1 of the PHP PDO driver, PHP versions 7.3 and 7.4 are no longer supported.

### New features and updates

* Updated the following libraries:

  + openssl from 3.0.9 to 3.0.11
  + curl from 8.1.2 to 8.4.0
* Added the `login_timeout`, `retryTimeout`, and `max_login_retries` connection parameters to manage the frequency of retries for
  unsuccessful connection requests.

### Bug fixes

* None.

## Version 2.0.0 (September 29, 2023)

### Behavior Change Release (BCR) changes

Starting with version 2.0.0 of the PHP PDO driver:

* Upgraded from openssl 1.1.1 to openssl 3.0.9. Consequently, private keys generated using the deprecated encryption
  algorithms in previous openssl library versions no longer work. When you update to PHP PDO 2.0.0 you must
  regenerate your private key file used for key pair authentication.

### New features and updates

* Added support for PHP 8.2.
* Added support for Mac ARM64 systems.
* Added specific error messages that are generated when building an application if `cmake` is not installed.
* Added support for getting the driver version programmatically with `PDO::getAttribute()` with `PDO::ATTR_CLIENT_VERSION`.
* Added the `PDO::SNOWFLAKE_ATTR_QUERY_ID` attribute to get query ids through `PDO::getAttribute()` or `PDOStatement::getAttribute()`.
* Added support for hybrid transactional and analytical processing:

  + Added retry context in retries for query requests.
  + Added query context caching.
* Updated the following software libraries:

  + Updated `curl` from version 7.88.1 to 8.1.2.
  + Updated `util-linux` from version 2.36.1 to 2.39.0.
  + Updated the `cacert` bundle used for SSL connections.

### Bug fixes

* Fixed an issue where the driver did not use the entire OCSP URL in the certificate when performing OCSP validation.

## Version 1.2.7 (May 23, 2023)

### New features

None.

### Bug fixes

* Fixed an issue where a connection could fail when using a proxy that doesn’t need a username and password.

## Version 1.2.6 (January 24, 2023)

### New features

None.

### Bug fixes

* Fixed an issue where the driver returned empty strings (“”) instead of NULL values when using PHP 8.1.

---
title: PHP PDO Driver for Snowflake release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/php-pdo-2024.md
section: Release Notes
---

# PHP PDO Driver for Snowflake release notes for 2024

This article contains the release notes for the PHP PDO Driver for Snowflake, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for PHP PDO Driver for Snowflake updates.

See [PHP PDO Driver for Snowflake](../../developer-guide/php-pdo/php-pdo-driver.md) for documentation.

## Version 3.0.3 (October 30, 2024)

### New features and updates

* Upgraded libsnowflakeclient to version 1.1.0.
* Upgraded openssl to version 3.0.15.
* Upgraded curl to version 8.10.1.

### Bug fixes

* None.

## Version 3.0.2 (August 29, 2024)

### New features and updates

* Increased the maximum allowable large object (LOB) size.

### Bug fixes

* None.

## Version 3.0.1 (July 24, 2024)

### New features and updates

* Removed the hardcoded top-level domain.

### Bug fixes

* Fixed an issue with Microsoft Windows not honoring proxy settings in environment variables.

## Version 3.0.0 (June 18, 2024)

### BCR (Behavior Change Release) changes

* PHP 8.0 is no longer supported.
* The minimum gcc compiler version for Linux changed from version 5.2 to 8.3.
* The default `loglevel` changed from TRACE to FATAL.

### New features and updates

* Added support for PHP version 8.3.
* Improved performance for connection reuse.

### Bug fixes

* Fixed an issue where the driver always created a log folder whether or not logging is actually needed.

## Version 2.0.3 (April 29, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed an issue where timeout values in connection parameters weren’t honored.

## Version 2.0.2 (February 22, 2024)

### New features and updates

* Updated the following libraries:

  + curl from version 8.4.0 to 8.6.0
  + openssl from version 3.0.11 to 3.0.13
  + zlib from version 1.2.13 to 1.3.1

### Bug fixes

* Fixed an issue related to segmentation faults when the log level is set to DEBUG.
* Fixed an issue where the dependency libraries could’nt be downloaded when building the driver with Visual Studio 2019.

---
title: PHP PDO Driver for Snowflake release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/php-pdo-2025.md
section: Release Notes
---

# PHP PDO Driver for Snowflake release notes for 2025

This article contains the release notes for the PHP PDO Driver for Snowflake, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for PHP PDO Driver for Snowflake updates.

See [PHP PDO Driver for Snowflake](../../developer-guide/php-pdo/php-pdo-driver.md) for documentation.

## Version 3.4.0 (Dec 03, 2025)

### New features and updates

* Added native OKTA authentication support.
* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. Snowflake recommend you test this feature in advisory mode before enabling it in production.

### Bug fixes

* Fixed the aarch64 build on MacOS.

## Version 3.3.0 (Aug 27, 2025)

### New features and updates

* Added ARM64 support for Linux.
* Added support for the Easy Logging feature in a configuration file.

### Bug fixes

* None.

## Version 3.2.0 (May 20, 2025)

### New features and updates

* Added support mult-factor authentication (MFA).

### Bug fixes

* Fixed a memory leak that occurred when fetching results.
* Fixed an OCSP configuration issue.

## Version 3.1.0 (January 29, 2025)

### New features and updates

* Added support for Visual Studio 2022 (VS17).
* Added support for PHP 8.4.

### Bug fixes

* Fixed an issue with executing unsupported queries like PUT or GET on stages causes a signed-to-unsigned conversion error that crashes the application using the driver. For more information, see [CVE-2025-24792](https://github.com/snowflakedb/pdo_snowflake/security/advisories/GHSA-f8q2-7fv5-cg93).

---
title: PHP PDO Driver for Snowflake release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/php-pdo-2026.md
section: Release Notes
---

# PHP PDO Driver for Snowflake release notes for 2026

This article contains the release notes for the PHP PDO Driver for Snowflake, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for PHP PDO Driver for Snowflake updates.

See [PHP PDO Driver for Snowflake](../../developer-guide/php-pdo/php-pdo-driver.md) for documentation.

## Version 3.6.0 (Mar 05, 2026)

### New features and updates

* Implemented richer `client_environment` telemetry information to include on which environment the driver runs (such as Lambda, EC2, GCP, Azure VM, and so on) and whether the managed identity is enabled.
* Added support for workload identity federation authentication, including the following new connection parameters:

  + `workload_identity_provider` - Platform of the workload identity provider. Possible values include: AWS, AZURE, GCP, and OIDC.
  + `workload_identity_azure_resource` - If the AZURE `workload_identity_provider` is used, this parameter sets the resource that the driver should use to idenitify itself.
  + `workload_identity_impersonation_path` - An array of strings that provides an identity chain to use when connecting to Snowflake. Array elements are either a full service account address or a service account’s unique ID.

    Impersonation works by following each array entry to obtain a token that allows authorization of the next service account. Each account in the identity chain needs permissions to impersonate the next account only. The final account in the list obtains your Snowflake connection token and uses it to connect to Snowflake.

    This parameter is supported for AWS and Google Cloud workloads and only applies when `authenticator=WORKLOAD_IDENTITY`.
* Updated OpenSSL to 3.0.19.
* Added support for multistatement queries.

### Bug fixes

* None.

## Version 3.5.0 (Feb 03, 2026)

### New features and updates

* Added support for Red Hat Enterprise Linux (RHEL) 9.
* Deprecated CentOS 7 builds. Rocky 8/RHEL8 is now the minimum system version.
* Added a warning for HTTP usage in OAuth authentication flows.
* Set `LOCAL_APPLICATION` as a default for the `client_id` and `client_secret` for the OAuth Authorization code flow.
* Updated Curl to 8.16.0.
* Removed the workload identity federation (WIF) auto-detection mechanism.
* Added auto-detection of the application path and included it in the `CLIENT_ENVIRONMENT` variable.
* Updated OpenSSL to 3.0.18

### Bug fixes

* Fixed the expired file lock on Linux for the Secure Storage.
* Removed the username requirement for the WIF authentication.

---
title: Pipe usage history and COPY history views: New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2045.md
section: Release Notes
---

# Pipe usage history and COPY history views: New column

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, the following views include a new column, `BYTES_BILLED`:

* [ACCOUNT_USAGE.PIPE_USAGE_HISTORY](../../../sql-reference/account-usage/pipe_usage_history.md)
* [ORGANIZATION_USAGE.PIPE_USAGE_HISTORY](../../../sql-reference/organization-usage/pipe_usage_history.md)
* [INFORMATION_SCHEMA.PIPE_USAGE_HISTORY](../../../sql-reference/functions/pipe_usage_history.md)
* [ACCOUNT_USAGE.COPY_HISTORY](../../../sql-reference/account-usage/copy_history.md)
* [ORGANIZATION_USAGE.COPY_HISTORY](../../../sql-reference/organization-usage/copy_history.md)
* [INFORMATION_SCHEMA.COPY_HISTORY](../../../sql-reference/functions/copy_history.md)

| Column name | Data type | Description |
| --- | --- | --- |
| `BYTES_BILLED` | NUMBER | Represents the number of bytes Snowpipe uses for billing purposes, providing clearer visibility into Snowpipe’s cost implications directly within these history views. |

The new `BYTES_BILLED` column aims to enhance transparency and simplify cost analysis for Snowpipe usage. This new column provides a direct metric for the data volume that is considered for billing, making it easier to monitor and manage Snowpipe-related expenses. Users can now directly query these views to gain insights into the billed data volume for their pipe and copy operations.

Ref: 2045

---
title: PIPES views and commands: New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2137.md
section: Release Notes
---

# PIPES views and commands: New column

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the following commands and views include the new column IS_SNOWFLAKE_MANAGED:

* [SHOW PIPES](../../../sql-reference/sql/show-pipes.md) command: New column in output
* [DESCRIBE PIPES](../../../sql-reference/sql/desc-pipe.md) command: New column in output
* [PIPES](../../../sql-reference/account-usage/pipes.md) view (Account Usage): New column
* [PIPES](../../../sql-reference/organization-usage/pipes.md) view (Organization Usage): New column

| Column name | Data type | Description |
| --- | --- | --- |
| IS_SNOWFLAKE_MANAGED | BOOLEAN | Indicates whether the pipe is managed by Snowflake.  If TRUE, the pipe is a Snowflake-managed default pipe for Snowpipe Streaming with high-performance architecture. Snowflake automatically creates and manages default pipes for every target table, so that you can start streaming data immediately without creating a pipe manually. For more information, see [Default pipe](../../../user-guide/snowpipe-streaming/snowpipe-streaming-pipe-object.md).  If FALSE, the pipe is user-managed. |

Ref: 2137

---
title: Preview features
source: https://docs.snowflake.com/en/release-notes/preview-features.md
section: Release Notes
---

# Preview features

Preview features have been implemented and tested in Snowflake; however, full usability and corner-case handling may not be complete yet.
We do not guarantee the use of these features against defects that may produce unexpected or undesired results. Additionally, we may change
the behavior of features while they are in preview. If we change the behavior of a preview feature, we do our best to notify users before
making the change, but we do not guarantee to always pre-announce changes.

In addition, preview features can be disabled, enabled, or viewed for an entire account.
See Managing access to all preview features, on this page, for details.

> **Attention:**
>
> Preview features are provided primarily for evaluation and testing purposes. They should not be used in production systems or with
> production data.
>
> For more details about the usage of preview features, see the Snowflake
> [Preview Terms of Service](https://www.snowflake.com/legal/preview-terms-of-service/).

## Preview availability

Availability is determined on a per-feature basis:

Open:
:   Most preview features are *Open*, meaning they are enabled by default for all accounts and, therefore, openly available for use.

On Request:
:   Some preview features are provided *On Request*, particularly in the early stages of the preview period. To request access to these
    features for your account, you must contact Snowflake.

Some preview features are available only in certain [Snowflake editions](../user-guide/intro-editions.md) or in specific
[cloud platforms](../user-guide/intro-cloud-platforms.md) or [regions](../user-guide/intro-regions.md).

## Features currently in preview

The following features are currently available for preview, listed roughly in the order in which they were introduced:

| Feature | Availability | Introduced | Additional reading | Notes |
| --- | --- | --- | --- | --- |
| Adaptive Compute | Open | April 2026 | [Adaptive Compute](../user-guide/warehouses-adaptive.md) |  |
| Batch Cortex Search | Open | March 2026 | [Batch Cortex Search](../user-guide/snowflake-cortex/cortex-search/batch-cortex-search.md) |  |
| Interval data types | Open | March 2026 | [Interval data types](../sql-reference/data-types-datetime.md) |  |
| Apache Iceberg™ tables: Support for the Azure Data Lake Storage Gen2 with external volumes | Open | March 2026 | [Configure an external volume for Azure](../user-guide/tables-iceberg-configure-external-volume-azure.md) |  |
| Artifacts in Snowflake Intelligence | Open | March 2026 | [Artifacts in Snowflake Intelligence](../user-guide/snowflake-cortex/snowflake-intelligence/artifacts.md) |  |
| Apache Iceberg™ tables: Write support by using an external query engine | Open | March 2026 | [Write to Iceberg tables with an external query engine](../user-guide/tables-iceberg-access-using-external-query-engine-snowflake-horizon.md) |  |
| Snowflake storage for Apache Iceberg™ tables | Open | April 2026 | [Snowflake storage for Apache Iceberg™ tables](../user-guide/tables-iceberg-internal-storage.md) |  |
| AI code suggestions in Workspaces | Open | March 2026 | [AI code suggestions](../user-guide/cortex-code/cortex-code-snowsight.md) |  |
| Semantic view materializations | Closed | March 2026 | [Materializing dimensions and metrics in semantic views](../LIMITEDACCESS/semantic-views-materialization.md) |  |
| Specifying the relationship to use in a semantic view | Open | March 2026 | [Specifying the relationship for a metric when multiple relationship paths exist](../user-guide/views-semantic/sql.md) |  |
| Supported Java and Scala APIs for Snowpark Connect | Open | March 2026 | [PySpark APIs supported for Snowpark Connect for Spark](../developer-guide/snowpark-connect/snowpark-connect-supported-apis.md) |  |
| Exporting a semantic view to a Tableau Data Source (TDS) file | Open | March 2026 | [Exporting a semantic view to a Tableau Data Source (TDS) file](../user-guide/views-semantic/sql.md) |  |
| Support for version 3 of the Apache Iceberg™ table specification | Open | March 2026 | [Apache Iceberg™ tables: Support for Apache Iceberg™ v3 (Preview)](../user-guide/tables-iceberg-v3-specification-support.md) | Includes support for the `geography`, `geometry`, `nanosecond`, and `variant` data types and default values, deletion vectors, and row lineage. |
| pg_lake extension for Snowflake Postgres | Open | March 2026 | [Configuring S3 Storage for pg_lake](../user-guide/snowflake-postgres/postgres-pg_lake.md) |  |
| Apache Iceberg™ tables: Partitioned writes with a hierarchical path layout | Open | February 2026 | [Partitioning with hierarchical paths](../user-guide/tables-iceberg-metadata.md) |  |
| AGENT_RUN function | Open | February 2026 | [AGENT_RUN (SNOWFLAKE.CORTEX)](../sql-reference/functions/agent_run-snowflake-cortex.md) |  |
| Container Runtime version selection | Open | February 2026 | [Snowflake Container Runtime release notes](../developer-guide/snowflake-ml/container-runtime/releases.md) |  |
| Restricted caller’s rights in Streamlit in Snowflake | Open | February 2026 | [Restricted caller’s rights and Streamlit in Snowflake](../developer-guide/streamlit/features/restricted-callers-rights.md) |  |
| Semantic views: Joining tables containing ranges of values | Open | February 2026 | [Joining logical tables that contain ranges of values](../user-guide/views-semantic/sql.md) |  |
| DATA_AGENT_RUN function | Open | February 2026 | [DATA_AGENT_RUN (SNOWFLAKE.CORTEX)](../sql-reference/functions/data_agent_run-snowflake-cortex.md) |  |
| Strong Authentication Hub | Open | February 2026 | [Strong Authentication Hub](../user-guide/strong-authentication-hub.md) |  |
| Cortex Code Data Science and Machine Learning skill | Open | February 2026 | [Cortex Code CLI](../user-guide/cortex-code/cortex-code-cli.md) |  |
| Overview tab in the Trust Center | Open | February 2026 | [Trust Center](../user-guide/trust-center/overview.md) |  |
| Collaboration Data Clean Rooms | Open | February 2026 | [About Snowflake Data Clean Rooms](../user-guide/cleanrooms/about.md) |  |
| Cortex Code in Snowsight | Open | February 2026 | [Cortex Code in Snowsight](../user-guide/cortex-code/cortex-code-snowsight.md) |  |
| Fine-tuning `arctic-extract` models | Open | January 2026 | [Fine-tuning arctic-extract models](../user-guide/snowflake-cortex/arctic-extract-finetuning.md) |  |
| Image extraction with AI_PARSE_DOCUMENT | Open | January 2026 | [Cortex AI Functions: Image extraction with AI_PARSE_DOCUMENT](../user-guide/snowflake-cortex/image-extraction.md) |  |
| Consumer-controlled maintenance policies | Open | January 2026 | [Consumer-controlled maintenance policies](../developer-guide/native-apps/consumer-maintenance-policies.md), [Consumer-controlled maintenance policies: Provider guide](../developer-guide/native-apps/consumer-maintenance-policies-provider.md) | Provider-side support added April 2026. |
| Sensitive data classification in the Trust Center | Open | January 2026 | [Use the Trust Center to set up sensitive data classification](../user-guide/classify-ui-trust-center.md) | Sensitive data classification using SQL is already generally available. |
| External lineage | Open | January 2026 | [External lineage](../user-guide/external-lineage.md) |  |
| Using the Snowpark Python JDBC | Open | January 2026 | [Using the Snowpark Python JDBC](../developer-guide/snowpark/python/snowpark-jdbc.md) |  |
| Data quality notifications | Open | January 2026 | [Sending notifications for data quality issues](../user-guide/data-quality-notifications.md) |  |
| Snowflake High Performance connector for Kafka | Open | December 2025 | [Snowflake High Performance connector for Kafka](../connectors/kafkahp/about.md) |  |
| Notebooks in Workspaces | Open | December 2025 | [Snowflake Notebooks in Workspaces](../user-guide/ui-snowsight/notebooks-in-workspaces/notebooks-in-workspaces-overview.md) |  |
| Support for Scala version 2.13 | Open | December 2025 | [Prerequisites](../developer-guide/udf/scala/udf-scala-introduction.md) |  |
| Optimize an existing semantic view or model with verified queries | Open | December 2025 | [Optimize an existing semantic view or model with verified queries](../user-guide/snowflake-cortex/cortex-analyst/analyst-optimization.md) |  |
| Import machine learning models from external services | Open | November 2025 | [Import and deploy models from an external service](../developer-guide/snowflake-ml/model-registry/snowsight-ui.md) |  |
| Document Processing Playground | Open | November 2025 | [Document Processing Playground](../user-guide/snowflake-cortex/document-processing-playground.md) |  |
| Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric | Open | November 2025 | [Query Snowflake-managed Apache Iceberg™ tables by using Microsoft Fabric](../user-guide/tables-iceberg-query-using-microsoft-fabric.md) |  |
| Configure a catalog integration for OneLake REST | Open | November 2025 | [Configure a catalog integration for OneLake REST](../user-guide/tables-iceberg-configure-catalog-integration-rest-onelake.md) |  |
| Cortex Analyst Routing Mode | Open | November 2025 | [Routing Mode for Cortex Analyst](../user-guide/snowflake-cortex/cortex-analyst/cortex-analyst-routing-mode.md) |  |
| Data quality anomaly detection | Open | November 2025 | [Detecting anomalies in data quality](../user-guide/data-quality-anomaly.md) |  |
| Configure replication for Snowflake-managed Apache Iceberg™ tables | Open | November 2025 | [Configure replication for Snowflake-managed Apache Iceberg™ tables](../user-guide/tables-iceberg-replication.md) |  |
| Trust Center extensions | Open | November 2025 | [Using Trust Center extensions](../user-guide/trust-center/trust-center-extensions.md) |  |
| Executing Scala code using Snowpark Connect for Spark | Open | November 2025 | [Run Scala code from your client](../developer-guide/snowpark-connect/snowpark-connect-workloads-jupyter.md) |  |
| Publishing and consuming public marketplace listings in VPS regions | Open | October 2025 | [Snowflake Marketplace version 2 listings in VPS deployments](../collaboration/collaboration-marketplace-about.md) |  |
| Listings in government regions can be shared on the internal marketplace | Open | October 2025 | [About organizational listings](../user-guide/collaboration/listings/organizational/org-listing-about.md) |  |
| Verified Query suggestions | Open | October 2025 | [Suggestions for semantic models and views](../user-guide/snowflake-cortex/cortex-analyst/verified-query-suggestions.md) |  |
| Use organization user groups with organizational listings | Open | October 2025 | [Use organization user groups with organizational listings](../user-guide/collaboration/listings/organizational/org-listings-org-user-groups.md) |  |
| Make database objects discoverable in Universal Search | Open | October 2025 | [Make database objects discoverable in Universal Search](../user-guide/ui-snowsight/object-visibility-universal-search.md) |  |
| Declarative Sharing for Native Apps | Open | September 2025 | [About Declarative Sharing in the Native Application Framework](../developer-guide/declarative-sharing/about.md) |  |
| Cortex Agent Monitoring | Open | September 2025 | [Monitor Cortex Agent requests](../user-guide/snowflake-cortex/cortex-agents-monitor.md) |  |
| Position row-level deletes for writing to externally managed Apache Iceberg™ tables | Open | September 2025 | * [Write support for externally managed Apache Iceberg™ tables](../user-guide/tables-iceberg-externally-managed-writes.md) * [Use row-level deletes](../user-guide/tables-iceberg-manage.md) |  |
| SnowConvert AI Verification | Open | September 2025 | [AI code conversion](../migrations/snowconvert-docs/snowconvert-ai-verification.md) |  |
| SnowConvert AI - ETL Migration | Open | October 2025 | [ETL Migration](../migrations/snowconvert-docs/general/user-guide/etl-migration-replatform.md) | Public preview feature for migrating SSIS packages to dbt projects on Snowflake. |
| Data quality in Snowsight | Open | September 2025 | * [Monitoring data quality checks in Snowsight](../user-guide/data-quality-ui-monitor.md) * [Use data profiling to understand your data](../user-guide/data-quality-profile.md) |  |
| Gap-filling time-series data | Open | September 2025 | [RESAMPLE](../sql-reference/constructs/resample.md) |  |
| Cortex Agents object REST API | Open | September 2025 | [Configure and interact with Agents](../user-guide/snowflake-cortex/cortex-agents-manage.md) |  |
| Surcharging of compute pool usage of a Snowflake Native App with containers | Open | August 2025 | * [Compute pool surcharges in Snowflake Native Apps with containers](../developer-guide/snowpark-container-services/provider-pricing-surcharges.md) * [MARKETPLACE_PROVIDER_SPCS_USAGE View](../collaboration/views/marketplace-provider-spcs-usage-ds.md) |  |
| Updated access control for data quality monitoring | Open | August 2025 | [Required privilege on the table or view](../user-guide/data-quality-access-control.md) |  |
| Cortex Search Service replication | Open | August 2025 | [Replicate a Cortex Search Service](../user-guide/snowflake-cortex/cortex-search/cortex-search-replication.md) |  |
| Snowpark Container Services new stage volume implementation | Open | August 2025 | [Using Snowflake stage volumes with services](../developer-guide/snowpark-container-services/snowflake-stage-volume.md) |  |
| Snapshots | Open | August 2025 | [Backups for disaster recovery and immutable storage](../user-guide/backups.md) |  |
| Reading data from external data sources using Snowpark Python DB-API | Open | August 2025 | [Using the Snowpark Python DB-API](../developer-guide/snowpark/python/reading-data-from-external-sources.md) | Use Snowpark Python to programmatically pull data from external databases into Snowflake. |
| Cortex Agents admin configuration UI | Open | August 2025 | [Configure and interact with Agents](../user-guide/snowflake-cortex/cortex-agents-manage.md) |  |
| Snowpark Container Services batch jobs | Open | August 2025 | [Run multiple replicas of a job service (batch jobs)](../developer-guide/snowpark-container-services/working-with-services.md) |  |
| Snowflake Intelligence | Open | August 2025 | [Overview of Snowflake Intelligence](../user-guide/snowflake-cortex/snowflake-intelligence.md) |  |
| Disable public access to privatelink-only accounts | Open | July 2025 | [Enforcement of privatelink-only access](../user-guide/security-disable-public-access-privatelink.md) |  |
| Manage integrations using Snowsight | Open | June 2025 | [Managing integrations in Snowsight](../user-guide/ui-snowsight-integrations.md) |  |
| Preconfigured Notebook runtimes | Open | June 2025 | [Create a notebook](../user-guide/ui-snowsight/notebooks-create.md) |  |
| Snowflake Copilot inline | Open | June 2025 | [Using Snowflake Copilot inline](../user-guide/snowflake-copilot-inline.md) |  |
| Snowflake Cortex Playground | Open | June 2025 | [Cortex Playground](../user-guide/snowflake-cortex/cortex-playground.md) |  |
| New in-app notifications from Trust Center in Snowsight | Open | May 2025 |  |  |
| Snowpark Container Services available to Snowflake accounts on Google Cloud | Open | May 2025 | [Snowpark Container Services](../developer-guide/snowpark-container-services/overview.md) |  |
| Automated refresh and auto-ingest pipes for internal named stages | Open | April 2025 | * [CREATE STAGE](../sql-reference/sql/create-stage.md) * [CREATE PIPE](../sql-reference/sql/create-pipe.md) * [Automated directory table refreshes for internal stages](../user-guide/data-load-dirtables-auto.md) | Currently available for Snowflake accounts hosted on AWS. |
| Configuring automatic suspension of a Snowpark Container Services service | Open | April 2025 | [CREATE SERVICE](../sql-reference/sql/create-service.md) |  |
| Google Cloud Private Service Connect in Streamlit in Snowflake | Open | April 2025 | [Private connectivity for Streamlit in Snowflake](../developer-guide/streamlit/object-management/privatelink.md) |  |
| Multi-file editing in Streamlit in Snowflake | Open | March 2025 | [Edit your Streamlit app](../developer-guide/streamlit/app-development/editing-your-app.md) |  |
| Git integration for Streamlit in Snowflake | Open | March 2025 | [Sync Streamlit in Snowflake apps with a Git repository](../developer-guide/streamlit/features/git-integration.md) |  |
| Cloning databases that contain hybrid tables | Open | March 2025 | [Clone databases that contain hybrid tables](../user-guide/tables-hybrid-clone.md) |  |
| Snowflake Native Apps with Snowpark Container Services - Support for Azure Private Link | Open | March 2025 | [Mar 03, 2025: Native Apps with Snowpark Container Services - Support for Azure Private Link (Preview)](2025/other/2025-03-03-na-spcs-azure-pl-pupr.md) |  |
| Release channels in Snowflake Native Apps | Open | February 2025 | [Publish an app using release channels](../developer-guide/native-apps/release-channels.md) |  |
| CREATE OR ALTER <OBJECT> | Open | February 2025 | * [CREATE OR ALTER AUTHENTICATION POLICY](../sql-reference/sql/create-authentication-policy.md) * [CREATE OR ALTER FILE FORMAT](../sql-reference/sql/create-file-format.md) * [CREATE OR ALTER TAG](../sql-reference/sql/create-tag.md) | Additional commands that create an object if it doesn’t exist, or alters it according to the object definition. |
| Viewing Snowpipe in Snowsight | Open | February 2025 | [Manage Snowpipe in Snowsight](../user-guide/data-load-snowpipe-snowsight.md) |  |
| Snowpark Checkpoints Library | Open | January 2025 | [Snowpark Checkpoints](../developer-guide/snowpark/python/snowpark-checkpoints-library.md) |  |
| Snowflake Python Demos API | Open | January 2025 | [Snowflake Python Demos API](../developer-guide/snowflake-python-api/snowflake-python-demos.md) |  |
| Join policies | Open | January 2025 | [Join policies](../user-guide/join-policies.md) |  |
| CREATE OR ALTER <OBJECT> | Open | December 2024 | * [CREATE OR ALTER DATA METRIC FUNCTION](../sql-reference/sql/create-data-metric-function.md) * [CREATE OR ALTER EXTERNAL FUNCTION](../sql-reference/sql/create-external-function.md) * [CREATE OR ALTER FUNCTION](../sql-reference/sql/create-function.md) * [CREATE OR ALTER FUNCTION (Snowpark Container Services)](../sql-reference/sql/create-function-spcs.md) * [CREATE OR ALTER PROCEDURE](../sql-reference/sql/create-procedure.md) | Additional commands that create an object if it doesn’t exist, or alters it according to the object definition. |
| Executing a Snowpark Container Services job service asynchronously | Open | December 2024 | [EXECUTE JOB SERVICE](../sql-reference/sql/execute-job-service.md) |  |
| Restricted caller’s rights | Open | December 2024 | [Restricted caller’s rights](../developer-guide/restricted-callers-rights.md) |  |
| Using block storage volumes with job services. | Open | November 2024 | [Using block storage volumes with services](../developer-guide/snowpark-container-services/block-storage-volume.md) |  |
| Apache Iceberg™ table support for Microsoft Fabric OneLake | Open | November 2024 | [CREATE EXTERNAL VOLUME (Azure)](../sql-reference/sql/create-external-volume.md) |  |
| CREATE OR ALTER <OBJECT> | Open | November 2024 | * [CREATE OR ALTER <object>](../sql-reference/sql/create-or-alter.md) * [CREATE OR ALTER APPLICATION ROLE](../sql-reference/sql/create-application-role.md) * [CREATE OR ALTER DATABASE](../sql-reference/sql/create-database.md) * [CREATE OR ALTER DATABASE ROLE](../sql-reference/sql/create-database-role.md) * [CREATE OR ALTER ROLE](../sql-reference/sql/create-role.md) * [CREATE OR ALTER SCHEMA](../sql-reference/sql/create-schema.md) * [CREATE OR ALTER STAGE](../sql-reference/sql/create-stage.md) * [CREATE OR ALTER VIEW](../sql-reference/sql/create-view.md) * [CREATE OR ALTER WAREHOUSE](../sql-reference/sql/create-warehouse.md) | Additional commands that create an object if it doesn’t exist, or alters it according to the object definition. |
| Snowflake Connector for SharePoint | Open | November 2024 | [About the Snowflake Connector for SharePoint](../connectors/unstructured-data-connectors/sharepoint/about.md) |  |
| Snowflake ML - Model Serving in Snowpark Container Services | Open | October 2024 | [Deploy models for Real time Inference (REST API)](../developer-guide/snowflake-ml/inference/real-time-inference-rest-api.md) |  |
| Writing files from Snowpark Python UDFs and UDTFs | Open | October 2024 | [Writing files from Snowpark Python UDFs and UDTFs](../developer-guide/snowpark/python/creating-udfs.md) |  |
| Cortex Analyst Suggested Questions | Open | October 2024 | [Onboarding questions in Cortex Analyst](../user-guide/snowflake-cortex/cortex-analyst/suggested-questions-feature.md) |  |
| Cortex Analyst and Search integration | Open | October 2024 | [Improve literal search to enhance Cortex Analyst responses](../user-guide/snowflake-cortex/cortex-analyst/cortex-analyst-search-integration.md) |  |
| Support for IAM authentication in external network access | Open | October 2024 | [External network access overview](../developer-guide/external-network-access/external-network-access-overview.md) |  |
| Sharing of Cortex fine-tuned models in model registry | Open | October 2024 | [Sharing models](../developer-guide/snowflake-ml/model-registry/overview.md) |  |
| Resource constraints for Snowpark-optimized warehouses | Open | September 2024 | [Snowpark-optimized warehouses](../user-guide/warehouses-snowpark-optimized.md) |  |
| Support for Cross-Cloud Auto-Fulfillment in a Snowflake Native App with Snowpark Container Services | Open | August 2024 | [Auto-fulfillment for listings](../collaboration/provider-listings-auto-fulfillment.md) | Currently only supported on Amazon Web Services and Microsoft Azure. |
| Snowflake Connector for PostgreSQL | Open | July 2024 | [About the Snowflake Connector for PostgreSQL](../connectors/postgres6/about.md) |  |
| Snowflake Connector for MySQL | Open | July 2024 | [About the Snowflake Connector for MySQL](../connectors/mysql6/about.md) |  |
| Create and manage a Snowflake Native App in the Snowflake VS Code extension | Open | July 2024 | [Work with the Snowflake Native App Framework](../user-guide/vscode-ext.md) |  |
| Snowsight Notebooks default warehouses | Open | July 2024 | * [Overview of warehouses](../user-guide/warehouses-overview.md) * [Warehouse considerations](../user-guide/warehouses-considerations.md) * [Set up Snowflake Notebooks](../user-guide/ui-snowsight/notebooks-setup.md) |  |
| Support for external and Apache Iceberg™ tables in the Snowflake Native App Framework | Open | July 2024 | [Support for external and Apache Iceberg™ tables](../developer-guide/native-apps/preparing-data-content.md) |  |
| Event definitions in the Snowflake Native App Framework | Open | July 2024 | [About event sharing](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging#about-event-sharing) |  |
| VS Code extension | Open | July 2024 | [Edit the Snowflake connections.toml file](../user-guide/vscode-ext.md) |  |
| Snowflake Notebooks external access support | Open | July 2024 | [Set up external access for Snowflake Notebooks](../user-guide/ui-snowsight/notebooks-external-access.md) |  |
| Snowflake Native SDK for Connectors | Open | June 2024 | [Snowflake Native SDK for Connectors](../developer-guide/native-apps/connector-sdk/about-connector-sdk.md) |  |
| CREATE OR ALTER TABLE | Open | May 2024 | [CREATE TABLE](../sql-reference/sql/create-table.md) | Creates a table if it doesn’t exist, or alters it according to the table definition. |
| CREATE OR ALTER TASK | Open | May 2024 | [CREATE TASK](../sql-reference/sql/create-task.md) | Creates a task if it doesn’t exist, or alters it according to the task definition. |
| EXECUTE IMMEDIATE FROM template file | Open | May 2024 | [EXECUTE IMMEDIATE FROM](../sql-reference/sql/execute-immediate-from.md) | Execute a template file using the Jinja2 templating language. |
| View more rows of query results in Snowsight worksheet | Open | March 2024 | [Exploring the worksheet results](../user-guide/ui-snowsight-query.md) | Not available to accounts in U.S. government regions, accounts using Virtual Private Snowflake (VPS), and accounts that use Private Connectivity to access Snowflake. |
| Limit functionality of Snowflake Native App | Open | March 2024 | * [Limit functionality of your Snowflake Native App for trial consumers](https://other-docs.snowflake.com/collaboration/provider-listings-preparing#label-listings-trial-limit-functionality-app) * [SYSTEM$IS_LISTING_TRIAL](../sql-reference/functions/system_is_listing_trial.md) |  |
| COPY FILES | Open | February 2024 | [COPY FILES](../sql-reference/sql/copy-files.md) |  |
| Snowflake Connector for Google Analytics Raw Data | Open | January 2024 | [Snowflake Connector for Google Analytics Raw Data](../connectors/google/gard/gard-connector-about.md) |  |
| Snowflake Connector for Google Analytics Aggregate Data | Open | January 2024 | [Snowflake Connector for Google Analytics Aggregate Data](https://other-docs.snowflake.com/connectors/google/gaad/gaad-connector-about.html) |  |
| Support for the Arrow format in the Snowflake .NET driver | Open | November 2023 | [snowflake-connector-net git repo](https://github.com/snowflakedb/snowflake-connector-net) |  |
| Set up and Monitor Client Redirect Using Snowsight | Open | November 2023 | [Redirecting client connections](../user-guide/client-redirect.md) |  |
| Python Package Version Range Support | Open | August 2023 | * [CREATE FUNCTION](../sql-reference/sql/create-function.md) * [CREATE PROCEDURE](../sql-reference/sql/create-procedure.md) |  |
| Snowflake ML - FileSystem and FileSet | Open | N/A | [Load and write data](../developer-guide/snowflake-ml/load-data.md) | This feature is currently supported, but will not be made generally available. |
| Custom Event Billing | Open | [June 2023](2023-06.md) | [Add billable events to an application package](../developer-guide/native-apps/adding-custom-event-billing.md) |  |
| Tracking DDL commands, tags, and policies in the ACCESS_HISTORY view | Open | [June 2023](2023-06.md) | [Access History](../user-guide/access-history.md) |  |
| ACCOUNTS view (Organization Usage) | Open | [June 2023](2023-06.md) | [ACCOUNTS view](../sql-reference/organization-usage/accounts.md) |  |
| External table support for Delta Lake | Open | February 2022 | [Introduction to external tables](../user-guide/tables-external-intro.md) |  |

## Managing access to all preview features

Snowflake provides the ability for account administrators to manage access to
preview features at the account level.

* Account administrators can enable or disable access to preview features for their entire Snowflake account.
  Additionally, account administrators can check whether all preview features are enabled or disabled.
* This setting affects all users and all preview features (including private preview features) within the account.
* By default, access to all preview features is enabled for most accounts.

> **Caution:**
>
> Before disabling or enabling preview features for your account, please review
> the associated documentation for a complete list of limitations and other information.

The following limitations apply to enabling and disabling preview feature access:

* Applies to both private and open preview features.
* This is an all-or-nothing setting that affects all users and all previews within an account.
* Any user in the account who is using a preview feature will lose access to that feature immediately after SYSTEM$DISABLE_PREVIEW_ACCESS is executed.
* Snowflake Marketplace products, which are managed separately through [IMPORTED PRIVILEGES](../user-guide/data-exchange-marketplace-privileges.md), are not covered as part of this capability.
* Client-side libraries (such as Snowpark API) are not covered as part of this capability.

### Checking the status of preview features in your account

To check whether preview features are enabled in your account, call the [SYSTEM$GET_PREVIEW_ACCESS_STATUS](../sql-reference/functions/system_get_preview_access_status.md) function.

For example:

```sqlexample
SELECT SYSTEM$GET_PREVIEW_ACCESS_STATUS();
```

Which returns:

```output
+-------------------------------------------------------+
| SYSTEM$GET_PREVIEW_ACCESS_STATUS()                    |
+-------------------------------------------------------+
| Preview access is [ENABLED|DISABLED] for this account |
+-------------------------------------------------------+
```

Indicating the current state of preview features for the account.

### Enabling preview features in your account

To enable preview features for your account, call the [SYSTEM$ENABLE_PREVIEW_ACCESS](../sql-reference/functions/system_enable_preview_access.md) function.

For example:

```sqlexample
SELECT SYSTEM$ENABLE_PREVIEW_ACCESS();
```

Which returns:

```output
+---------------------------------------------------------------+
| SELECT SYSTEM$ENABLE_PREVIEW_ACCESS();                        |
+---------------------------------------------------------------+
| Preview access has been successfully enabled for this account |
+---------------------------------------------------------------+
```

### Disabling preview features in your account

To disable preview features for your account, call the [SYSTEM$DISABLE_PREVIEW_ACCESS](../sql-reference/functions/system_disable_preview_access.md) function.

> **Caution:**
>
> Caution should be exercised when disabling preview features.
> All preview features, including both public and private, are disabled when you call SYSTEM$DISABLE_PREVIEW_ACCESS.
> Private preview features cannot be enabled by calling SYSTEM$ENABLE_PREVIEW_ACCESS.

```sqlexample
SELECT SYSTEM$DISABLE_PREVIEW_ACCESS();
```

Which returns:

```output
+----------------------------------------------------------------+
| SYSTEM$DISABLE_PREVIEW_ACCESS()                                |
+----------------------------------------------------------------+
| Preview access has been successfully disabled for this account |
+----------------------------------------------------------------+
```

---
title: Private Connectivity Functions: OCSP Connection URLs Added to Output (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1111.md
section: Release Notes
---

# Private Connectivity Functions: OCSP Connection URLs Added to Output (Pending)

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The output of the SYSTEM$ALLOWLIST_PRIVATELINK and SYSTEM$GET_PRIVATELINK_CONFIG functions is as follows:

Previously:
:   The output did not include the private connectivity OCSP endpoints for the Client Redirect connection URLs.

Currently:
:   The output of these functions does include the private connectivity OCSP endpoints for the Client Redirect connection URLs.

    * The SYSTEM$GET_PRIVATELINK_CONFIG function will contain a new field called `privatelink-connection-ocsp-connection-urls`, which specifies
      a comma-separated list of URL values.
    * The SYSTEM$ALLOWLIST_PRIVATELINK function will contain a new TYPE called OCSP_CLIENT_FAILOVER that specifies the host for the OCSP
      connection URL. Note that there might be multiple hosts of this type in the function output.

Update your DNS configuration to include the new OCSP endpoints. For details about private connectivity and Client Redirect, refer to
[Configuring the DNS settings for private connectivity to the Snowflake service](../../../user-guide/client-redirect.md).

Ref: 1111

---
title: Privileges: WITH GRANT OPTION No Longer Allowed When Granting Privileges to Shares
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1096.md
section: Release Notes
---

# Privileges: WITH GRANT OPTION No Longer Allowed When Granting Privileges to Shares

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The behavior of the GRANT <privileges> … TO SHARE command is as follows:

Previously:
:   You could specify the WITH GRANT OPTION when granting privileges to a share.

Currently:
:   You cannot specify the WITH GRANT OPTION when granting privileges to a share.

    Snowflake returns the following error message:

    > `Unsupported feature 'GRANT PRIVILEGES TO SHARE WITH GRANT OPTION'.`

Ref: 1096

---
title: Procedures (caller’s rights): SQL statements that include PUT and GET commands produce a compiler error
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1244.md
section: Release Notes
---

# Procedures (caller’s rights): SQL statements that include PUT and GET commands produce a compiler error

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, [caller’s rights procedures](../../../developer-guide/stored-procedure/stored-procedures-rights.md) written in JavaScript or Snowflake
Scripting will throw a compiler error if their handler code attempts to execute a [PUT](../../../sql-reference/sql/put.md) or
[GET](../../../sql-reference/sql/get.md) statement.

Previously:
:   In the JavaScript or Snowflake Scripting handler code of a caller’s rights procedure, attempting to use PUT or GET in a SQL statement
    neither succeeds nor throws an error. In other words, the PUT or GET has no effect even though the procedure continues and appears to
    succeed.

Currently:
:   In the JavaScript or Snowflake Scripting handler code of a caller’s rights procedure, attempting to use PUT or GET in a SQL statement
    will throw a compiler error such as the following:

    ```none
    Unsupported statement type PUT_FILES
    ```

Ref: 1244

---
title: PROCEDURES view (Account Usage and Information Schema): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1786.md
section: Release Notes
---

# PROCEDURES view (Account Usage and Information Schema): New columns

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the following views includes new columns:

* The [Account Usage PROCEDURES view](../../../sql-reference/account-usage/procedures.md) output includes the following new column:

  | Column name | Data type | Description |
  | --- | --- | --- |
  | INSTALLED_PACKAGES | STRING | Lists all packages installed by the procedure. Output for Python procedures only. |
* The [Information Schema PROCEDURES view](../../../sql-reference/info-schema/procedures.md) output includes the following new columns:

  | Column name | Data type | Description |
  | --- | --- | --- |
  | PACKAGES | STRING | Specifies the packages requested by the procedure. |
  | RUNTIME_VERSION | STRING | Specifies the runtime version of the procedure. NULL if the function is SQL or JavaScript. |
  | INSTALLED_PACKAGES | STRING | Lists all packages installed by the procedure. Output for Python procedures only. |

Ref: 1786

---
title: Providers must explicitly authorize event sharing when testing apps that include mandatory event definitions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1851.md
section: Release Notes
---

# Providers must explicitly authorize event sharing when testing apps that include mandatory event definitions

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

To test an app locally, providers can create an app in development mode in their account before publishing
to consumers. This includes apps that have logging and event tracing enabled.

When this behavior change bundle is enabled, providers must specify the `AUTHORIZE_TELEMETRY_EVENT_SHARING = TRUE`
clause when using the [CREATE APPLICATION](../../../sql-reference/sql/create-application.md) command to create an app in the same account that
has mandatory event definitions defined in the manifest file.

Before the change:
:   The `AUTHORIZE_TELEMETRY_EVENT_SHARING = TRUE` clause is not required as part of the
    [CREATE APPLICATION](../../../sql-reference/sql/create-application.md) command.

After the change:
:   Providers must specify the `AUTHORIZE_TELEMETRY_EVENT_SHARING = TRUE` clause when using the
    [CREATE APPLICATION](../../../sql-reference/sql/create-application.md) command to create an app for testing.

    > **Note:**
    >
    > This clause is required only if the provider specifies mandatory event definitions in the `manifest.yml` file.

Ref: 1851

---
title: PUT command on GCP: OVERWRITE parameter must be set to TRUE to overwrite files
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1253.md
section: Release Notes
---

# PUT command on GCP: OVERWRITE parameter must be set to TRUE to overwrite files

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, you must set the OVERWRITE parameter equal to TRUE for all PUT commands in order to overwrite files when your Snowflake account is hosted on Google Cloud Platform.

Previously:
:   For Snowflake accounts that are hosted on Google Cloud Platform, PUT statements do not recognize when the OVERWRITE parameter is set to TRUE. A PUT operation always overwrites any existing files in the target stage with the local files you are uploading. This behavior for GCP is different from Azure and AWS.

Currently:
:   For Snowflake accounts that are hosted on Google Cloud Platform, PUT statements will overwrite files only if the OVERWRITE parameter is explicitly set to TRUE. This behavior will be the same across all three platforms: GCP, Azure, and AWS.

We recommend that you review any code or scripts that use the PUT command. If the intention is to overwrite the file in the target stage, you must change the code or script to set the OVERWRITE parameter to TRUE. If this parameter is not set for the PUT command, and there is an existing file with the same name, the default value of OVERWRITE=FALSE will be used. In this case, the PUT command will complete without error, but will not overwrite the existing file.

If you are a Google Cloud Platform customer, you must update all clients to a new set of minimum versions by November 1, 2023 to avoid disruptions to your client connectivity. For more information, read [this help article](https://community.snowflake.com/s/article/faq-2023-client-driver-deprecation-for-GCP-customers).

Ref: 1253

---
title: PUT command: Drivers affected by upcoming Google authentication method changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1345.md
section: Release Notes
---

# PUT command: Drivers affected by upcoming Google authentication method changes

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

Due to enforced PUT command authentication changes by Google, driver applications behave as follows:

Before the change:
:   Customers with their Snowflake accounts hosted on Google Cloud using driver versions lower than the updated minimum versions listed below can still use older Google
    authentication methods for PUT requests.

After the change:
:   Once the bundle is enabled, Snowflake will no longer allow applications to use this older Google authentication method, in preparation for the changes that will be enforced by Google in January 2024. For any applications based on older driver versions, Snowflake will automatically throw the following exception for any PUT command:

    ```output
    091032 (22000): Your client app version, {0}, is using a deprecated pre-signed URL for
    PUT. Please upgrade to a version that supports GCP downscoped token. See
    https://community.snowflake.com/s/article/faq-2023-client-driver-deprecation-for-GCP-customers.
    ```

    To continue using your driver applications without disruption, you must upgrade your drivers to at least the new minimum versions below. Snowflake recommends that you upgrade to the latest versions listed in the [Client Versions & Support Policy](../../requirements.md) topic by October 30, 2023 when this behavior bundle will be enabled by default.

    > **Caution:**
    >
    > Beginning on January 15, 2024, Google will enforce the new PUT authentication method without exception. Consequently, Snowflake will NOT be able to allow customers to opt out of this behavior change after this date.

    Please note that newer versions also introduce new PUT overwrite behavior that might require you to update code or scripts that use the PUT command. For more information, see the BCR 2023_06 [PUT overwrite](../2023_06/bcr-1253.md) documentation topic.

    > **Note:**
    >
    > For more background on this issue, please see this [help article](https://community.snowflake.com/s/article/faq-2023-client-driver-deprecation-for-GCP-customers). Affected customers have also received previous emails regarding this issue with the subject line “Important! Action Required: Upgrade Client Drivers for your Snowflake accounts on Google Cloud”.

    | Client Driver | Min Version for GCP | Upgrade Link |
    | --- | --- | --- |
    | JDBC | 3.13.25 | [JDBC](../../../developer-guide/jdbc/jdbc-download.md) |
    | ODBC | 2.25.9 | [ODBC](../../../developer-guide/odbc/odbc-download.md) |
    | Python | 2.7.8 | [Python](../../../developer-guide/python-connector/python-connector-install.md) |
    | Go | 1.16.17 | [JDBC](../../../developer-guide/golang/go-driver.md) |
    | .NET | 2.0.21 | [.NET](../../../developer-guide/dotnet/dotnet-driver.md) |
    | Node.js | 1.6.21 | [Node.js](../../../developer-guide/node-js/nodejs-driver-install.md) |
    | Kafka Connector | 1.9.4 | [Kafka](../../../user-guide/kafka-connector-install.md) |
    | Spark Connector | 2.11.3 | [Spark](../../../user-guide/spark-connector-install.md) |
    | Snowpark Java/Scala API | 1.8.0 | [Snowpark Java/Scala](../../../developer-guide/snowpark/java/index.md) |
    | Snowpark Python API | 0.9.0 | [Snowpark Python](../../../developer-guide/snowpark/python/setup.md) |
    | Java Ingest SDK | 2.0.0 | [Java SDK](../../../user-guide/data-load-snowpipe-rest-gs.md) |

Ref: 1345

---
title: PYPI_REPOSITORY_USER database role granted to the PUBLIC role
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2280.md
section: Release Notes
---

# PYPI_REPOSITORY_USER database role granted to the PUBLIC role

Users must have the `PYPI_REPOSITORY_USER` database role to use packages from the shared PyPI repository when creating
Python UDFs, UDTFs, UDAFs, and stored procedures. This change affects which roles have the `PYPI_REPOSITORY_USER`
database role by default.

Before the change:
:   Access to the shared PyPI repository (`snowflake.snowpark.pypi_shared_repository`) is opt-in. Account administrators
    must explicitly grant the `PYPI_REPOSITORY_USER` database role before users can use packages from the shared PyPI
    repository:

    ```sql
    GRANT DATABASE ROLE snowflake.snowpark.pypi_repository_user TO ROLE my_role;
    ```

After the change:
:   For new accounts, Snowflake grants the `PYPI_REPOSITORY_USER` database role to the `PUBLIC` role during account
    creation, so all users in the account can use the shared PyPI repository by default.

    For existing accounts, a one-time backfill grants the `PYPI_REPOSITORY_USER` database role to the `PUBLIC` role.

    This means a user can use any role to create Python functions and procedures that use packages from the shared PyPI
    repository without requiring an explicit grant from an account administrator.

If you want to restrict access to the shared PyPI repository after this change, you can either proactively opt out or
reactively revoke access:

```sql
-- Proactive opt-out via account parameter (only available before the change is rolled out)
ALTER ACCOUNT SET ENABLE_PYPI_REPOSITORY_USER_PUBLIC_GRANT = FALSE;

-- Reactive revocation
REVOKE DATABASE ROLE snowflake.snowpark.pypi_repository_user FROM ROLE PUBLIC;

-- Optionally grant to specific roles
GRANT DATABASE ROLE snowflake.snowpark.pypi_repository_user TO ROLE data_science;
```

Ref: 2280

---
title: Python Snowpark Stored Procedures and UDFs: Tracing improvements in Event table
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1520.md
section: Release Notes
---

# Python Snowpark Stored Procedures and UDFs: Tracing improvements in Event table

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

Users can now see how queries from chained calls are linked when a Python stored procedure calls another Python stored procedure or a Python
stored procedure calls a Python UDF. To use this capability, an Event table must be configured and tracing must be enabled.
The values displayed in the `TRACE:"trace_id"` and `RECORD:"parent_span_id"` columns of the Event table are as follows:

Before the change:
:   The `trace_id` of each of the spans created by chained Python stored procedures or UDFs is unique.
    The `parent_span_id` field does not exist in the RECORD column of the Event table.
    Native apps providers and consumers see different `trace_ids` for shared events. The provider sees the hashed version.

After the change:
:   Spans generated by chained Python stored procedures or UDFs have the same `trace_id`.

    Spans generated by chained Python stored procedures or UDFs have a parent-child relationship between `span_id` and `parent_span_id`.
    Python stored procedures can call other stored procedures in a chain of any length, but UDFs can’t execute SQL statements so calling
    a UDF ends the chain. However, the trace info is still propagated to the UDF’s spans.

    If the Python stored procedure or UDF was called by the user directly (the root), then the `trace_id` will be a random ID and there
    will be no `parent_span_id`. If tracing is disabled for a stored procedure and it calls another stored procedure or UDF, then the
    `trace_id` of the child’s spans will be random and they will have no `parent_span_id`. In other words, the trace is restarted at the child.

    Native apps providers and consumers see the same `trace_id` for shared Python stored procedure or UDF events, so they can be debugged more easily.

Native Apps providers who share applications containing Python stored procedures that call other Python stored procedures or UDFs expose
the call stack and parent-child relationships of the stored procedures to the consumer. To avoid this, disable tracing.

Ref: 1520

---
title: Python telemetry library automatically installed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2120.md
section: Release Notes
---

# Python telemetry library automatically installed

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change is enabled, Snowflake automatically installs the `snowflake-telemetry-python` package when a function or procedure with a Python handler is created.

Before the change:
:   You must explicitly specify the `snowflake-telemetry-python` package (such as with the PACKAGES parameter) when you create a function or procedure with a Python handler.

After the change:
:   Snowflake automatically includes the `snowflake-telemetry-python` package by default.
    However, if you specify a package policy to allow or disallow specific packages explicitly,
    Snowflake will not automatically include the `snowflake-telemetry-python` package.
    In this case, you must specify the package.

Ref: 2120

---
title: Python UDFs and stored procedures: Stop implicit auto-injection of the psutil package
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1948.md
section: Release Notes
---

# Python UDFs and stored procedures: Stop implicit auto-injection of the psutil package

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

psutil is a Python library that provides convenient functions for retrieving information about system utilization. Currently Snowflake installs the psutil package implicitly inside a sandbox when a UDF or stored procedure is created. But this can lead to a violation of the packages policy blocklist if you set the blocklist, and it contains psutil.

Hence this behavior change stops auto-injection of the psutil package. When this behavior change bundle is enabled, you have to mention the psutil package explicitly inside the package list while creating the UDF stored procedure if you require it.

Before the change:
:   Snowflake installs the psutil package implicitly inside a sandbox when a UDF or stored procedure is created.

After the change:
:   You now have to add the psutil package explicitly inside the package list while creating the UDF or stored procedure if you require it.

Ref: 1948

---
title: Python UDFs: Changes to return value types for semi-structured data
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1546.md
section: Release Notes
---

# Python UDFs: Changes to return value types for semi-structured data

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The behavior of Python UDFs that return an Array, Object, or Variant has changed slightly. When Python UDFs return a floating point number in the handler,
the behavior is as follows:

Before the change:
:   For Python float numeric type values, scientific notation values are deserialized as scientific notation values; otherwise, they are deserialized to `FIXED` values.

After the change:
:   All Python float numbers are deserialized as scientific notation values.

Ref: 1546

---
title: Query Acceleration Service: Expanded support for COPY statements
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1749.md
section: Release Notes
---

# Query Acceleration Service: Expanded support for COPY statements

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

The [Query Acceleration Service](../../../user-guide/query-acceleration-service.md) (QAS) can accelerate
[eligible queries](../../../user-guide/query-acceleration-service.md) by offloading portions of the query processing to
serverless compute resources that are provided by the service.

QAS support for COPY statements changes as follows:

Before the change:
:   COPY statements are not accelerated by the Query Acceleration Service.

After the change:
:   Eligible COPY statements are accelerated by the Query Acceleration Service and appear in the Account Usage
    [QUERY_ACCELERATION_ELIGIBLE view](../../../sql-reference/account-usage/query_acceleration_eligible.md).

This change might result in additional QAS usage for your account. Additionally, you can check the QUERY_ACCELERATION_ELIGIBLE
view to see if you have new warehouses that may benefit from QAS.

Ref: 1749

---
title: Query Acceleration Service: Expanded support for INSERT statements
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1487.md
section: Release Notes
---

# Query Acceleration Service: Expanded support for INSERT statements

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The [query acceleration service](../../../user-guide/query-acceleration-service.md) can accelerate
[eligible queries](../../../user-guide/query-acceleration-service.md) by offloading portions of the query processing to serverless compute
resources that are provided by the service.

Query acceleration service support for INSERT statements behaves as follows:

Before the change:
:   Eligible INSERT statements that include a SELECT statement can be accelerated. Only the scan portion with a selective filter can be
    eligible for acceleration.

After the change:
:   All portions of eligible INSERT statements can be accelerated.

This change might result in additional query acceleration service usage for your account.

Ref: 1487

---
title: Query and task history views and functions: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1147.md
section: Release Notes
---

# Query and task history views and functions: New columns

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the output of query and task history views and functions include new columns. The views and functions
that are affected include:

* The following Account Usage views:

  + [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md)
  + [QUERY_ACCELERATION_ELIGIBLE](../../../sql-reference/account-usage/query_acceleration_eligible.md)
  + [TASK_HISTORY](../../../sql-reference/account-usage/task_history.md)
* The following Information Schema table functions:

  + [QUERY_HISTORY](../../../sql-reference/functions/query_history.md)
  + [TASK_HISTORY](../../../sql-reference/functions/task_history.md)

The output of these views and functions includes the following new columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| `query_hash` | TEXT | Hash value that is computed based on the canonicalized text of the SQL statement. |
| `query_hash_version` | NUMBER | The version of the hash in the `query_hash` column. |
| `query_parameterized_hash` | TEXT | Hash value of the query text after literals are parameterized |
| `query_parameterized_hash_version` | NUMBER | The version of the hash in the `query_parameterized_hash` column. |

If you previously defined a view that
[selects all columns (SELECT \*) from any of these views](../../../sql-reference/account-usage.md), querying the
view returns an error. You must recreate your view by using the [CREATE OR REPLACE VIEW](../../../sql-reference/sql/create-view.md)
command.

For example, suppose that you defined a view that selected all columns from the TASK_HISTORY view:

```sqlexample
CREATE OR REPLACE VIEW my_task_history
  AS SELECT * FROM SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY;
```

Previously:
:   Querying your view (`my_task_history` in this example) returns the results from the view.

Currently:
:   Querying your view (`my_task_history` in this example) returns an error about the number of columns in the view:

    ```output
    View definition for MY_DB.MY_SCHEMA.MY_TASK_HISTORY' declared 22 column(s),
    but view query produces 27 column(s).
    ```

As noted in the [Usage Notes for CREATE VIEW](../../../sql-reference/sql/create-view.md), if a view selects all columns from an
underlying table or view, the view is not updated automatically when a new column is added to the underlying table or view.
Querying the view returns a column-related error.

To prevent this error, you must recreate the view. For example, to recreate the view in the example above with specific columns
selected (to avoid problems in the future due to columns being added):

```sqlexample
CREATE OR REPLACE VIEW my_task_history
  AS SELECT query_text, completed_time FROM SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY;
```

In addition, if you created a table that has the same columns as one of these views (for example, by using
CREATE TABLE … LIKE SNOWFLAKE.ACCOUNT_USAGE.TASK_HISTORY), and you are copying rows from that view to the table, you must
add the new columns to your table. Use the [ALTER TABLE … ADD COLUMN](../../../sql-reference/sql/alter-table.md) command to add the
same columns to your table that were added to the view.

Ref: 1147

---
title: Query History: Queries for alert conditions and actions included in history
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1233.md
section: Release Notes
---

# Query History: Queries for alert conditions and actions included in history

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the queries for [Snowflake alert](../../../user-guide/alerts.md) conditions and actions are included in the
query history. This affects the following interfaces:

* The [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) and
  [QUERY_ACCELERATION_ELIGIBLE](../../../sql-reference/account-usage/query_acceleration_eligible.md) views in the ACCOUNT_USAGE schema.
* The [QUERY_HISTORY and QUERY_HISTORY_BY_\* functions](../../../sql-reference/functions/query_history.md) in the INFORMATION_SCHEMA
  schema.
* The Query History page in Snowsight.
* The History page in the Classic Console.

Previously:
:   The queries for the conditions and actions in Snowflake alerts do not appear in the query history.

Currently:
:   The queries for the conditions and actions in Snowflake alerts will appear in the query history.

    In the query history, the name of the user who executed the query will be SYSTEM. (The alerts are run by the system service.)

Ref: 1233

---
title: Query History: Redacted SQL Upon Syntax Error
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-936.md
section: Release Notes
---

# Query History: Redacted SQL Upon Syntax Error

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

For the most up-to-date details about the version and date in which it will be enabled, as well as other release-related details,
see the Behavior Change Log.
The views, pages, and functions that provide a query history now redact the content of a query that fails due to a syntax or parsing
error:

Previously:
:   When a query failed due to a syntax or parsing error, its content could be viewed in the views, pages, and functions that
    provide a query history.

Currently:
:   The query history redacts the content of a query that fails due to a syntax or parsing error. The query text is replaced with
    `<redacted>`.

This implementation is done mainly for security reasons, where sensitive information like passwords cannot be redacted for queries with
invalid syntax (which is done for syntactically valid queries). However, the user who executed the query would still be able to view the
un-redacted query.

Note that “redacted” means that only the query text will be redacted, not the whole row in the query history for that syntactically
invalid query.

In order to clarify who can see this text un-redacted, please be aware that the USER who executed the query (no matter what role they
use) can see the query text. However, another user (even if they use the same role used for executing the failed query) will not be able
to see the query text. The entry in the [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) view is available for everyone
who has the necessary privileges to check this view.

Ref: 936

---
title: Query Profile: Changes to the Update, Delete, and Insert Operators
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-978.md
section: Release Notes
---

# Query Profile: Changes to the Update, Delete, and Insert Operators

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

This behavior is as follows:

Previously:
:   Using Query Profile:

    * The Insert operator displays the **Table names** attribute for single target tables.
    * The Update and Delete operators display the **Table name** attribute.

Currently:
:   * The Insert operator displays the **Full table name** attribute for a single table target, and the **Full table names** attribute for multiple target tables.
    * The Update and Delete operators display the **Full table name** attribute.

Ref: 978

---
title: QUERY_ACCELERATION_ELIGIBLE View (ACCOUNT_USAGE): Changes to columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1585.md
section: Release Notes
---

# QUERY_ACCELERATION_ELIGIBLE View (ACCOUNT_USAGE): Changes to columns

> **Attention:**
>
> This behavior change is temporarily unavailable in the 2024_03 behavior change bundle.

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

Prior to the 2024_02 bundle, the Query Acceleration Service (QAS) only supported eligible INSERT statements that contained SELECT statements.
This change aligns the rows in the QUERY_ACCELERATION_ELIGIBLE view with the expanded INSERT support improvements for QAS. For more
information, see [Query Acceleration Service: Expanded support for INSERT statements](../2024_02/bcr-1487.md)

The Account Usage [QUERY_ACCELERATION_ELIGIBLE view](../../../sql-reference/account-usage/query_acceleration_eligible.md) will change as follows:

Before the change:
:   QUERY_ACCELERATION_ELIGIBLE shows all eligible queries.

After the change:
:   QUERY_ACCELERATION_ELIGIBLE shows all eligible queries, including newly eligible INSERT statements.

Ref: 1585

---
title: QUERY_HISTORY view (Account Usage): Changes to columns and new columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1497-1524-1540.md
section: Release Notes
---

# QUERY_HISTORY view (Account Usage): Changes to columns and new columns

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The [QUERY_HISTORY view](../../../sql-reference/account-usage/query_history.md) includes the following new columns
and changes to columns:

## New columns in QUERY_HISTORY view

When this behavior change bundle is enabled, the Account Usage QUERY_HISTORY view includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| SECONDARY_ROLE_STATS | VARCHAR | A JSON-formatted string that contains three fields regarding secondary roles that were evaluated in the query: a list of secondary roles or `ALL` depending on the session, a count of the number of secondary roles, and the internal/system-generated ID for each secondary role. The count and number of IDs have a maximum of 50. |
| ROWS_WRITTEN_TO_RESULT | NUMBER | Number of rows written to a result object. For CREATE TABLE AS SELECT (CTAS) and all DML operations, this result is `1`. The values in the ROWS_INSERTED, ROWS_UPDATED, and ROWS_DELETED columns reflect the number of rows actually inserted, updated, or deleted.  For more information, see ROWS_PRODUCED column deprecated |
| ROWS_INSERTED | NUMBER | Number of rows inserted by the query. |
| QUERY_RETRY_TIME | NUMBER | Total execution time (in milliseconds) for query retries caused by actionable errors. For more information, see Query retry columns. |
| QUERY_RETRY_CAUSE | VARCHAR | Error that caused the query to retry. If there is no query retry, the field is NULL. For more information, see Query retry columns. |
| FAULT_HANDLING_TIME | NUMBER | Total execution time (in milliseconds) for query retries caused by errors that are *not* actionable. For more information, see Query retry columns. |

These columns are added as the last (right-most) columns in the view.

For more information, see also [Information Schema: New columns in output for QUERY_HISTORY, QUERY_HISTORY_BY_\* functions](bcr-1431-1524-1540.md).

### Query retry columns

A query might need to be retried one or more times in order to successfully complete. There can be multiple causes that result in a query
retry. Some of these causes are *actionable*, that is, a user can make changes to reduce or eliminate query retries for a specific query.
For example, if a query is retried due to an out of memory error, modifying warehouse settings might resolve the issue.

Some query retries are caused by a fault tolerance that is not actionable. That is, there is no change a user can make to prevent the
query retry. For example, a network outage might result in a query retry. In this case, there is no change to the query or to the
warehouse that executes it that can prevent the query retry.

The QUERY_RETRY_TIME, QUERY_RETRY_CAUSE, and FAULT_HANDLING_TIME columns can help you optimize queries that are retried and better
understand fluctuations in query performance.

### ROWS_PRODUCED column deprecated

The ROWS_PRODUCED column will be deprecated in a future release. The value in the ROWS_PRODUCED column doesn’t always reflect the logical
number of rows affected by a query. For example, the value in the ROWS_PRODUCED column might include rows that were deleted due to
rewriting of micro-partitions and could be larger than the actual number of rows affected. Snowflake recommends using the ROWS_INSERTED,
ROWS_UPDATED, ROWS_WRITTEN_TO RESULTS, or ROWS_DELETED columns instead.

## Changes to columns in QUERY_HISTORY view

The following columns are included in the Account Usage [QUERY_HISTORY view](../../../sql-reference/account-usage/query_history.md):

* BYTES_WRITTEN_TO_RESULT
* ROWS_INSERTED

The values in the these columns for specific types of queries are as follows:

Before the change:
:   |  |  |
    | --- | --- |
    | BYTES_WRITTEN_TO_RESULT | `0` for small queries. |
    | ROWS_INSERTED: | `0` for CREATE TABLE AS SELECT (CTAS) queries. |

After the change:
:   |  |  |
    | --- | --- |
    | BYTES_WRITTEN_TO_RESULT | Number of bytes written to a result object for small queries. |
    | ROWS_INSERTED: | Number of rows inserted for CREATE TABLE AS SELECT (CTAS) queries. |

For more information, see also [Information Schema: New columns in output for QUERY_HISTORY, QUERY_HISTORY_BY_\* functions](bcr-1431-1524-1540.md).

Ref: 1431, 1524, 1540

---
title: QUERY_HISTORY View (Account Usage): New Columns Added
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-921.md
section: Release Notes
---

# QUERY_HISTORY View (Account Usage): New Columns Added

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The following columns have been added to the Account Usage [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) view
in the shared SNOWFLAKE database.

| Column Name | Data Type | Description |
| --- | --- | --- |
| TRANSACTION_ID | NUMBER | Specifies either the [ID of the transaction](../../../sql-reference/transactions.md) that wraps the statement or 0 if the statement is not executed within a transaction. The new column can be used to join data in the QUERY_HISTORY and [LOCK_WAIT_HISTORY](../../../sql-reference/account-usage/lock_wait_history.md) views to examine multi-statement transactions. |
| CHILD_QUERIES_WAIT_TIME | NUMBER | Specifies the number of milliseconds to complete a child job for a query. Note that a child job only applies to certain Snowflake queries. Snowflake pauses the main query, the child job completes, and then the main query resumes. |

Ref: 921

---
title: QUERY_HISTORY view and table functions: New MULTI_STATEMENT value in the query_type column for multi-statement queries
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1214.md
section: Release Notes
---

# QUERY_HISTORY view and table functions: New MULTI_STATEMENT value in the `query_type` column for multi-statement queries

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

When a query is executed through any interface, you can view details about the executed query in the output of the [QUERY_HISTORY table function](../../../sql-reference/functions/query_history.md) and in the [QUERY_HISTORY view](../../../sql-reference/account-usage/query_history.md) (in the ACCOUNT_USAGE schema). This change addresses an error regarding the type displayed in the `query_type` column when drivers submit multi-statement queries.

The behavior of this change is as follows:

Before the change:
:   When a driver submitted queries that contained multiple SQL statements, the `query_type` column in the QUERY_HISTORY view and QUERY_HISTORY table function output incorrectly displayed SELECT as the query type.

After the change:
:   The `query_type` column correctly displays the query type as MULTI_STATEMENT when a driver submits a multi-statement query.

Ref: 1214

---
title: QUERY_HISTORY views and function: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-1980-2050.md
section: Release Notes
---

# QUERY_HISTORY views and function: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the [ACCOUNT_USAGE QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md)
view, the [ORGANIZATION_USAGE QUERY_HISTORY](../../../sql-reference/organization-usage/query_history.md) view, and the output of the
[QUERY_HISTORY](../../../sql-reference/functions/query_history.md) table function include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `authn_event_id` | NUMBER | ID for the event for the authentication of the user for this query. This ID corresponds to the value in the `event_id` column in the [LOGIN_HISTORY view](../../../sql-reference/account-usage/login_history.md).  **Note:** This column appears between the `session_id` and `user_name` columns.  If you have queries that rely on the order of the columns in this view, you need to update those queries to account for the new column. |
| `bind_values` | VARIANT | Values of bind variables used in this query.  The column contains an OBJECT value. The value contains a key-value pair, where the key is the name of the bind variable and the value is another OBJECT value with the following key-value pairs:   * `type`: Snowflake data type of the value. * `value`: Value of the bind variable.   For example:  ```sqlexample {   "model_name": {     "type": "TEXT",     "value": "mistral-large2"   } } ```  This column appears after the last column in the view (`user_schema_id`).  If you don’t want bind values to be accessible to users, set the ALLOW_BIND_VALUES_ACCESS account-level parameter to FALSE:  ```sqlexample ALTER ACCOUNT SET ALLOW_BIND_VALUES_ACCESS = FALSE; ``` |

Ref: 1980, 2050

---
title: Reader accounts: DROP ACCOUNT command not supported
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1271.md
section: Release Notes
---

# Reader accounts: DROP ACCOUNT command not supported

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The behavior of a reader account is as follows:

Previously:
:   When you use the DROP ACCOUNT command to remove a reader account from your account, Snowflake returns a successful status message but
    the reader account is not dropped.

Currently:
:   Snowflake returns an error message if you try to use the DROP ACCOUNT command to remove a reader account from your account.

    ```sqlexample
    DROP ACCOUNT reader_acct1;
    ```

    ```output
    Drop account command not allowed for managed accounts. Use command drop managed account. For more details visit https://docs.snowflake.com/en/sql-reference/sql/drop-managed-account.
    ```

    Instead, use the [DROP MANAGED ACCOUNT](../../../sql-reference/sql/drop-managed-account.md) command to remove a reader account from your account.

    For details, see [Dropping a reader account](../../../user-guide/data-sharing-reader-create.md).

Ref: 1271

---
title: Reader Accounts: ORGADMIN Role Removed from Accounts
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1045.md
section: Release Notes
---

# Reader Accounts: ORGADMIN Role Removed from Accounts

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, no reader accounts have the ORGADMIN role enabled. Reader accounts are used by consumers, not providers.
There is no reason to allow the consumer to have an organization administrator who can work with other accounts.

Previously:
:   Some reader accounts have the ORGADMIN role enabled, but administrators are blocked from enabling the role in new reader accounts.

Currently:
:   The ORGADMIN role will be removed from all reader accounts, ensuring that no reader accounts have the ORGADMIN role enabled.

Ref: 1045

---
title: Rename the CREATE DATA EXCHANGE LISTING privilege to CREATE LISTING
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1926.md
section: Release Notes
---

# Rename the CREATE DATA EXCHANGE LISTING privilege to CREATE LISTING

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the privilege granted to create listings is CREATE LISTINGS instead of CREATE DATA EXCHANGE LISTINGS.

Before the change:
:   Users specify the following to grant the privilege of creating data exchange listings on an account to a custom role:

    ```sqlexample
    GRANT CREATE DATA EXCHANGE LISTING ON ACCOUNT TO ROLE <role_name>;
    ```

After the change:
:   When this behavior change bundle is enabled, users specify the following to grant the privilege of creating data exchange listings on an account to a custom role:

    ```sqlexample
    GRANT CREATE LISTING ON ACCOUNT TO ROLE <role_name>;
    ```

Ref: 1926

---
title: Replication Groups: New column in output of SHOW command and Information Schema View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1490.md
section: Release Notes
---

# Replication Groups: New column in output of SHOW command and Information Schema View

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The output of the [SHOW REPLICATION GROUPS](../../../sql-reference/sql/show-replication-groups.md) command and the Information Schema
[REPLICATION_GROUPS view](../../../sql-reference/info-schema/replication_groups.md) view is as follows:

Before the change:
:   The output does not include the IS_LISTING_AUTO_FULFILLMENT_GROUP column.

After the change:
:   The output includes IS_LISTING_AUTO_FULFILLMENT_GROUP, as follows:

| Column name | Data type | Description |
| --- | --- | --- |
| IS_LISTING_AUTO_FULFILLMENT_GROUP | BOOLEAN | TRUE if the replication group is used for [Cross-Cloud Auto-fulfillment](../../../collaboration/provider-listings-auto-fulfillment.md). FALSE otherwise. |

Ref: 1490

---
title: Replication Groups: New column in output of SHOW command and Information Schema view
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1273.md
section: Release Notes
---

# Replication Groups: New column in output of SHOW command and Information Schema view

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW REPLICATION GROUPS](../../../sql-reference/sql/show-replication-groups.md) command and
Information Schema [REPLICATION_GROUPS view](../../../sql-reference/info-schema/replication_groups.md) include the following new column:

| Column name | Description |
| --- | --- |
| ERROR_INTEGRATION | The name of the notification integration for the replication group or failover group to which the error notification is sent in cases of refresh failures. |

Ref: 1273

---
title: Replication support for CREATE <class_name> privilege
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1607.md
section: Release Notes
---

# Replication support for CREATE <class_name> privilege

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The CREATE <class_name> privilege is granted on a schema to a role. A role granted this privilege can be used to create an
instance of class <class_name>.

For example, the following statement grants the role `budget_creator` the privilege to create instances of the
[SNOWFLAKE.CORE.BUDGET](../../../sql-reference/classes/budget.md) class in schema `budgets_db.budgets_schema`:

```sqlexample
GRANT CREATE SNOWFLAKE.CORE.BUDGET
  ON SCHEMA budgets_db.budgets_schema
  TO ROLE budget_creator;
```

> **Note:**
>
> Privileges granted to roles are replicated to target accounts only if the ROLES object type is included in the OBJECT_TYPES list for a
> replication or failover group. The object the privilege is granted must also be replicated.

The replication of the CREATE <class_name> privilege behaves as follows:

Before the change:
:   If a role is granted the CREATE <class_name> privilege on a schema in a source account, this privilege grant is *not* replicated to
    target accounts.

    For example, the CREATE [SNOWFLAKE.ML.FORECAST](../../../sql-reference/classes/forecast.md) privilege is granted using the
    following statement in a source account:

    ```sqlexample
    GRANT CREATE SNOWFLAKE.ML.FORECAST
      ON SCHEMA admin_db.admin_schema
      TO ROLE analyst;
    ```

    If the database `admin_db` and roles are replicated to a target account, the CREATE SNOWFLAKE.ML.FORECAST privilege grant is
    not replicated. The role `analyst` can’t create instances of the SNOWFLAKE.ML.FORECAST class in the target account.

After the change:
:   If a role is granted the CREATE <class_name> privilege on a schema in a source account, the privilege grant is replicated to the
    target account if the following objects are included in the replication or failover group:

    * The database that contains the schema on which the privilege is granted.
    * The ROLES object type is included in the OBJECT_TYPES list.

    A user granted the role with the CREATE <class_name> privilege in a target account can create an instance of <class_name> in the
    replicated schema in the target account.

    For example, the CREATE SNOWFLAKE.ML.FORECAST privilege is granted using the following statement in a source account:

    ```sqlexample
    GRANT CREATE SNOWFLAKE.ML.FORECAST
      ON SCHEMA admin_db.admin_schema
      TO ROLE analyst;
    ```

    If the database `admin_db` and roles are replicated to a target account, the CREATE SNOWFLAKE.ML.FORECAST privilege grant is
    replicated. The role `analyst` can create instances of the SNOWFLAKE.ML.FORECAST class in the target account.

For a list of available Snowflake classes, see [SQL class reference](../../../sql-reference-classes.md).

Ref: 1607

---
title: Replication views and functions: New refresh phase SECONDARY_COMMITTING
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2043.md
section: Release Notes
---

# Replication views and functions: New refresh phase SECONDARY_COMMITTING

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

Before the change:
:   Views and functions related to replication show report `SECONDARY_DOWNLOADING_DATA` as the last phase before
    a refresh job is marked as `COMPLETED`, `FAILED`, or `CANCELED`.

After the change:
:   When this behavior change bundle is enabled, the views and functions that report progress and history for account replication
    operations include a new value in the `phase_name` column: `SECONDARY_COMMITTING`. This phase happens after
    `SECONDARY_DOWNLOADING_DATA` and before the final phase where the refresh job is marked as `COMPLETED`, `FAILED`, or
    `CANCELED`. During the `SECONDARY_COMMITTING` phase, Snowflake applies the changes to tables from the data files that were
    transmitted from the primary account.

    The output of history-related replication table functions includes a new column `committed_object_count`. This value represents
    the number of tables that have been processed during the associated refresh operation:

    * The subfields have the same names as the subfields in the `object_count` column.
    * This column appears in the result set of the table functions, but not in the views for refresh history.

    The output of progress-related replication table functions includes new subfields in the `progress` and `details` columns:

    * The `progress` column shows the percentage of tables that finished replicating.
      This value reflects only the number of tables that were replicated, not other kinds of objects.
    * The `details` column reflects the same subfields, such as `totalObjects` and `completedObjects`, as the
      `SECONDARY_DOWNLOADING_METADATA` phase.

## Changed table functions

The following table functions are affected by this behavior change bundle.

### History functions

These table functions report the new `SECONDARY_COMMITTING` phase, and include the new `committed_object_count` column in their
result set.

* [REPLICATION_GROUP_REFRESH_HISTORY](../../../sql-reference/functions/replication_group_refresh_history.md)
* [REPLICATION_GROUP_REFRESH_HISTORY_ALL](../../../sql-reference/functions/replication_group_refresh_history.md)
* [LISTING_REFRESH_HISTORY](../../../sql-reference/functions/listing_refresh_history.md)

### Progress functions

These table functions report the new `SECONDARY_COMMITTING` phase in the `progress` and `details` columns of their result
sets. They don’t include the `committed_object_count` column in their result set.

* [REPLICATION_GROUP_REFRESH_PROGRESS](../../../sql-reference/functions/replication_group_refresh_progress.md)
* [REPLICATION_GROUP_REFRESH_PROGRESS_BY_JOB](../../../sql-reference/functions/replication_group_refresh_progress.md)
* [REPLICATION_GROUP_REFRESH_PROGRESS_ALL](../../../sql-reference/functions/replication_group_refresh_progress.md)
* [AVAILABLE_LISTING_REFRESH_HISTORY](../../../sql-reference/functions/available_listing_refresh_history.md) (despite the name, this
  function reports the progress of a refresh)

## Changed account usage and organization usage views

The following account usage and organization usage views are affected by this behavior change bundle. These views show the new
`SECONDARY_COMMITTING` phase. These views don’t include any new column. For real-time monitoring, use the table functions.

* [ACCOUNT_USAGE.REPLICATION_GROUP_REFRESH_HISTORY](../../../sql-reference/account-usage/replication_group_refresh_history.md)
* [ORGANIZATION_USAGE.REPLICATION_GROUP_REFRESH_HISTORY](../../../sql-reference/organization-usage/replication_group_refresh_history.md)

## Deprecated functions

This BCR has no effect on the deprecated functions for database replication, whose names start with `SYSTEM$DATABASE_REFRESH_`.
The `SECONDARY_COMMITTING` phase *isn’t* shown by those functions.

Ref: 2043

---
title: Replication: Add support for secret object
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1278.md
section: Release Notes
---

# Replication: Add support for secret object

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of replicating a secret is as follows:

Before the change:
:   The [secret](../../../sql-reference/sql/create-secret.md) object is not included in the database that contains the secret when you replicate
    the database.

After the change:
:   You can replicate the secret using a [replication or failover group](../../../user-guide/account-replication-intro.md). Specify the database
    that contains the secret, the database that contains UDFs or procedures that reference the secret, and the integrations that reference
    the secret in a single replication or failover group.

    If you have the database that contains the secret in one replication or failover group and the integration that references the secret in
    a different replication or failover group then:

    * If you replicate the integration first and then the secret, the operation is successful: all objects are replicated and there are no
      dangling references.
    * If you replicate the secret before the integration and the secret does not already exist in the target account, a “placeholder secret”
      is added in the target account to prevent a dangling reference. Snowflake maps the placeholder secret to the integration.

      After you replicate or failover the group that contains the integration and failover the group that contains the secret again,
      Snowflake updates the target account to replace the placeholder secret with the secret that is referenced in the integration.
    * If you replicate the secret and do not replicate or failover the group that contains the integration, when you decide to failover the
      target account back to the source account the secret and integration references match and the placeholder secret is not used. This
      allows you to use the security integration and the secret that contains the credentials.

Ref: 1274

---
title: Replication: Changes to refresh operations that fail with dangling reference errors
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1555.md
section: Release Notes
---

# Replication: Changes to refresh operations that fail with dangling reference errors

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

A [dangling reference](../../../user-guide/account-replication-considerations.md) occurs when an object in a replication or failover group has
an [object dependency](../../../user-guide/object-dependencies.md) on another object that is not included in the group. For example, a
materialized view `v1` in database `db1` references table `t1` in database `db2`. If `db1` is included
in replication group `rg1`, but `db2` is *not* included in the group, a dangling reference occurs because the referenced
object `t1` is not included in the group that contains the referencing object `v1`.

In some cases, a dangling reference causes the refresh operation to fail:

* Referenced [security policies](../../../user-guide/account-replication-security-integrations.md) are not included in the replication or
  failover group.
* Security policies are included in the group, but other required objects are not included in the group (for example, see
  [Replicating network policies](../../../user-guide/account-replication-security-integrations.md)).
* The referenced object for a [stream](../../../user-guide/account-replication-considerations.md) is not included in the replication or failover group.

The behavior for refresh operations that fail with dangling reference errors is as follows:

Before the change:
:   * Dangling reference error messages are not aggregated. Multiple dangling reference errors cause refresh operation failures to
      occur one after another, making it hard to address all issues at once.
    * Dangling reference error messages do not include the fully-qualified domain name of the missing referenced object.
    * A refresh operation with a dangling reference error might partially complete before failure, resulting in some objects being updated.

After the change:
:   * Dangling reference error messages are aggregated such that all cases that can cause refresh operation failures are surfaced at the
      same time.
    * Dangling reference error messages include the fully-qualified domain name of the missing referenced object.
    * Refresh operations with dangling reference errors fail before any secondary objects are updated.

Ref: 1555

---
title: Replication: Skip event tables and hybrid tables during refresh operation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1560-1582.md
section: Release Notes
---

# Replication: Skip event tables and hybrid tables during refresh operation

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

[Replication](../../../user-guide/account-replication-intro.md) doesn’t currently support
[event tables](../../../developer-guide/logging-tracing/event-table-setting-up.md) or [hybrid tables](../../../user-guide/tables-hybrid.md).
The behavior of a refresh operation for a primary database that contains an event table or hybrid table is as follows:

Before the change:
:   If a primary database contains an event table or a hybrid table, the refresh operation *fails*.

After the change:
:   If a primary database contains events tables or hybrid tables, the events tables and hybrid tables are skipped during the refresh
    operation and the refresh operation *succeeds*.

Ref: 1560, 1582

---
title: Replication: Skip external and Apache Iceberg™ tables during refresh operation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1528.md
section: Release Notes
---

# Replication: Skip external and Apache Iceberg™ tables during refresh operation

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

[Replication](../../../user-guide/account-replication-intro.md) doesn’t currently support
[external tables](../../../user-guide/tables-external-intro.md) or [Apache Iceberg™ tables](../../../user-guide/tables-iceberg.md). The behavior of a
refresh operation for a primary database that contains an external table or Iceberg table is as follows:

Before the change:
:   If a primary database contains an external table or an Iceberg table, the refresh operation fails.

After the change:
:   If a primary database contains external tables or Iceberg tables, the external tables or Iceberg tables are skipped during the
    refresh operation and the refresh operation succeeds.

Ref: 1528

---
title: Replication: Stages, pipes, storage integrations, and load history
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1461.md
section: Release Notes
---

# Replication: Stages, pipes, storage integrations, and load history

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

> **Note:**
>
> * This behavior change was part of the 2024_01 bundle, however it has been moved to the 2024_02
>   bundle.
> * Enabling the 2024_02 bundle might result in dropped objects in the target account.
>
>   If you have secondary databases in target accounts and you have manually created stages or pipes
>   in those target accounts, enabling this bundle might result in those objects being dropped. Once dropped, those objects
>   are *not* recoverable by disabling the BCR bundle. Please review the full text of this behavior change before enabling the bundle.

Snowflake accounts can be [replicated](../../../user-guide/account-replication-intro.md) across regions and cloud platforms. Supported
[database objects](../../../user-guide/account-replication-intro.md) are replicated to target accounts when a database is replicated.

The replication of internal and external stage objects, pipe objects, storage integrations, and table load history is available in
[preview](../../../user-guide/account-replication-stages-pipes-load-history.md). This change makes replication of stages, pipes, storage
integrations, and load history generally available when this BCR bundle becomes enabled by default.

Before the change:
:   Primary stage objects, pipe objects, storage integrations, and table load history are not replicated to target accounts (unless you
    have enabled the preview feature). Any existing stages and pipes in a target account are *not* modified during
    a refresh operation.

    If you are participating in the preview for storage integrations replication, and you include storage integrations in a replication
    or failover group by including `integrations` in the group’s `object_types` list *and* include `storage integrations`
    in the `allowed_integration_types` list, then any existing manually created storage integrations in the target account
    are *dropped*.

    If you are *not* participating in the preview for storage integrations replication (that is, you are not replicating storage
    integrations in a replication or failover group), existing storage integrations in a target account are not modified during a
    refresh operation.

After the change:
:   Primary stage objects, pipe objects, and table load history are replicated to target accounts when the database
    that contains them is replicated in a [replication or failover group](../../../user-guide/account-replication-intro.md). Primary storage
    integrations are replicated to target accounts if they are included in the replication or failover group. To replicate
    storage integrations, the `object_types` [parameter](../../../sql-reference/sql/create-replication-group.md) must include INTEGRATIONS
    *and* the `allowed_integrations` parameter must include STORAGE INTEGRATIONS for the group.

    If a target account has secondary databases with manually created internal or external stages, or pipes, these manually created
    objects are *dropped* when the replication or failover group is refreshed after this feature is enabled. Similarly, if the primary
    replication or failover group includes storage integrations, these manually created storage integrations are *dropped* in the target
    account during the refresh operation.

    If the primary database has an internal stage with directory table enabled, the files on the stage are also replicated. If there are
    files on the stage that are larger than 5GB, the refresh operation for the replication or failover group fails. To work around this
    limitation, move any files larger than 5GB to another stage. For more information, see
    [Considerations](../../../user-guide/account-replication-stages-pipes-load-history.md).

Stage, pipe, and load history replication is supported for databases that are replicated in replication or failover groups. This feature is not supported for database replication.

For more information, see [Stage, pipe, and load history replication](../../../user-guide/account-replication-stages-pipes-load-history.md).

As part of pipe object replication, two new execution states, `FAILING_OVER` and `READ_ONLY` are added to [SYSTEM$PIPE_STATUS](../../../sql-reference/functions/system_pipe_status.md) and is generally enabled, not configurable by this BCR bundle.

Ref: 1461

---
title: Replication: Support for replicating machine learning models
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1746.md
section: Release Notes
---

# Replication: Support for replicating machine learning models

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, [replication](../../../user-guide/account-replication-intro.md)
includes all machine learning models in the
[Snowflake Model Registry](../../../developer-guide/snowflake-ml/model-registry/overview.md).

> **Note:**
>
> For models to be replicated, this BCR bundle must be enabled on the source account **and** on all target accounts.

ML models are schema-level objects and are replicated when you replicate the database that contains them. This might
result in increased data transfer, compute, and storage costs.

|  |  |
| --- | --- |
| Before the change | Machine learning models are skipped during replication. |
| After the change | Machine learning models are replicated. |

Ref: 1746

---
title: REPLICATION_DATABASES View (Information Schema): Changes to Column Values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-892.md
section: Release Notes
---

# REPLICATION_DATABASES View (Information Schema): Changes to Column Values

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The following columns in the Information Schema [REPLICATION_DATABASES](../../../sql-reference/info-schema/replication_databases.md) view
use the legacy account identifier format:

* PRIMARY
* REPLICATION_ALLOWED_TO_ACCOUNTS
* FAILOVER_ALLOWED_TO_ACCOUNTS
* ACCOUNT_NAME

The column values have changed as follows:

Previously:
:   For the following columns:

    * PRIMARY
    * REPLICATION_ALLOWED_TO_ACCOUNTS
    * FAILOVER_ALLOWED_TO_ACCOUNTS

    The account identifier used the format:

    `<region_group>.<snowflake_ region>.<account_locator>`

    The ACCOUNT_NAME column displayed `<account_locator>`.

Currently:
:   For the following columns:

    * PRIMARY
    * REPLICATION_ALLOWED_TO_ACCOUNTS
    * FAILOVER_ALLOWED_TO_ACCOUNTS

    The account identifier uses the format:

    `<organization_name>.<account_name>`

    The ACCOUNT_NAME column displays `<account_name>`.

Ref: 892

---
title: REPLICATION_GROUPS View (Information Schema): New Column in View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-950.md
section: Release Notes
---

# REPLICATION_GROUPS View (Information Schema): New Column in View

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The Information Schema [REPLICATION_GROUPS](../../../sql-reference/info-schema/replication_groups.md) view now includes a REGION_GROUP column:

| Column Name | Data Type | Description |
| --- | --- | --- |
| REGION_GROUP | TEXT | Region group where the account that stores the replication or failover group is located. |

To maintain consistency with the Information Schema [REPLICATION_DATABASES](../../../sql-reference/info-schema/replication_databases.md) view,
this column has been added as the first (leftmost) column of the view.

Ref: 950

---
title: Restrict Apache Iceberg™ binary columns to maximum size
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2244.md
section: Release Notes
---

# Restrict Apache Iceberg™ binary columns to maximum size

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When you specify the binary data type for a column in an Apache Iceberg™ table, the column will always be mapped to the Iceberg binary
data type.

Before the change:
:   You can specify the `binary(L)` data type for a new column in an Iceberg table, which has the associated maximum length L. The `binary(L)`
    data type is mapped to the Iceberg `fixed(L)` data type instead of the Iceberg binary data type.
    In addition, you can specify the binary(L) data type for the keys and elements of new structured type columns, which also map to the
    Iceberg `fixed(L)` data type. For CTAS statements, a `binary(L)` column in the source table is created with a
    `binary(L)` column in the new table.

After the change:
:   In Iceberg tables, you must specify the `binary data` type as either binary or `binary(67108864)`. This requirement applies
    when you create new columns or define the key or element of structured type columns. Both types are mapped to the Iceberg binary data type.
    For CTAS statements, a `binary(L)` column in the source table is created with a `binary(67108864)` column in the new table.
    This change only affects new tables and new columns in existing tables.

This behavior change is being introduced to align Snowflake’s binary columns on Iceberg tables with the Iceberg binary type in the
Apache Iceberg™ table specification, which has no maximum length. This change eliminates ambiguous `binary(L)` definitions that conflict with
the Iceberg specification and can cause interoperability issues with external engines. For example, before the change, you can add a new
column with a binary data type that has an associated maximum length, such as `binary(10)`. Then an external engine could insert a value into
this column that exceeds the defined maximum length.

Ref: 2244

---
title: RESULT_SCAN Table Function: Changes to Duplicate Column Names
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1039.md
section: Release Notes
---

# RESULT_SCAN Table Function: Changes to Duplicate Column Names

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Currently, if there are duplicate column names in the results processed by the [RESULT_SCAN](../../../sql-reference/functions/result_scan.md) function,
the function appends a suffix to each duplicate column name to make the column names unique.
For example, if two columns are named `id`, the function returns a table with the column names `id` and `id_1`.

The way that the RESULT_SCAN function handles duplicate column names behaves as follows:

Previously:
:   If the results contain duplicate column names, the RESULT_SCAN function appends *_<n>* to each duplicate column name to make the column names unique.

    The function appends this suffix even when there are other columns with the same suffix. This can result in duplicate column names.

    For example:

    ```sqlexample
    SELECT 1 AS a, 2 AS a_1, 3 AS a;
    +---+-----+---+
    | A | A_1 | A |
    |---+-----+---|
    | 1 |   2 | 3 |
    +---+-----+---+
    SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    +---+-----+-----+
    | A | A_1 | A_1 |
    |---+-----+-----|
    | 1 |   2 |   3 |
    +---+-----+-----+
    ```

Currently:
:   The RESULT_SCAN function appends a suffix with the next available number to make the column names unique:

    ```sqlexample
    SELECT * FROM TABLE(RESULT_SCAN(LAST_QUERY_ID()));
    +---+-----+-----+
    | A | A_1 | A_2 |
    |---+-----+-----|
    | 1 |   2 |   3 |
    +---+-----+-----+
    ```

Ref: 1039

---
title: Retirement window for free listings published on the Snowflake Marketplace
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1574.md
section: Release Notes
---

# Retirement window for free listings published on the Snowflake Marketplace

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

When a listing is removed from the Snowflake Marketplace, and the provider of the listing is eligible to offer paid listings,
the listing goes through a retirement process before consumers lose access to the data product. The retirement process for free listings
offered by these providers on the Snowflake Marketplace behaves as follows:

Before the change:
:   Consumers retain access to the data product for one full calendar month after they retire the listing, in addition to partial months, and
    lose access on the first calendar day of the next month.

After the change:
:   For free listings published on the Snowflake Marketplace that are retired after May 1, 2024 and offered by providers that are eligible to
    offer paid listings, consumers retain access to the data product for 30 days after they retire the listing.
    Paid listings are unaffected by this change and continue to follow the retirement process from before the change.

Ref: 1574

---
title: Roles and Privileges: Changes to Secondary Roles and the REPLICATE Privilege
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1042.md
section: Release Notes
---

# Roles and Privileges: Changes to Secondary Roles and the REPLICATE Privilege

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The [REPLICATE privilege](../../../user-guide/account-replication-considerations.md) grants the ability to refresh a secondary [replication or failover group](../../../user-guide/account-replication-intro.md). A user with the REPLICATE privilege granted to a *primary* role (the current role activated by executing the [USE ROLE](../../../sql-reference/sql/use-role.md) command) or any *secondary* role (activated by executing the [USE SECONDARY ROLES](../../../sql-reference/sql/use-secondary-roles.md) command) can successfully refresh a secondary replication or failover group.

A secondary replication or failover group can be refreshed by executing the respective command:

* [ALTER REPLICATION GROUP group_name REFRESH](../../../sql-reference/sql/alter-replication-group.md)
* [ALTER FAILOVER GROUP group_name REFRESH](../../../sql-reference/sql/alter-failover-group.md)

Secondary replication or failure behaves as follows:

Previously:
:   A user with a role that is granted the REPLICATE privilege activated as either the primary or secondary role can successfully refresh a secondary replication or failover group.

Currently:
:   A user with a role that is granted the REPLICATE privilege must have the role activated as the *primary* role to successfully refresh a secondary replication or failover group.

    Otherwise, executing the refresh command will fail.

Ref: 1042

---
title: ROLES view (Account Usage): New columns and new value for ROLE_TYPE column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1229-1240.md
section: Release Notes
---

# ROLES view (Account Usage): New columns and new value for ROLE_TYPE column

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The Account Usage [ROLES](../../../sql-reference/account-usage/roles.md) view in the shared SNOWFLAKE database behaves as follows:

Previously:
:   The ROLES view does not include the ROLE_ID and ROLE_INSTANCE_ID columns.

    Additionally, the ROLE_TYPE column does not support INSTANCE_ROLE as a possible value.

Currently:
:   The output from a query on the ROLES view includes the ROLE_ID and ROLE_INSTANCE_ID columns.

    | Column | Data type | Description | Notes |
    | --- | --- | --- | --- |
    | ROLE_ID | NUMBER | Internal/system-generated identifier for the role. | This is the first ordinal column in the output. |
    | ROLE_INSTANCE_ID | NUMBER | Internal/system-generated identifier for the class instance that the role belongs to. | This is the last ordinal column in the output. |

    Additionally, the ROLE_TYPE column includes a new value, INSTANCE_ROLE, to indicate that the role is associated with a particular class
    instance.

    For details about classes, instances, and the associated roles, see [Snowflake classes](../../../sql-reference/snowflake-db-classes.md).

Ref: 1229 , 1240

---
title: ROLES view: New column is_from_organization_user_group
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2104.md
section: Release Notes
---

# ROLES view: New column `is_from_organization_user_group`

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the ROLES view in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) and [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md) schemas includes the following new
column:

| Column name | Data type | Description |
| --- | --- | --- |
| `is_from_organization_user_group` | BOOLEAN | If TRUE, the role was imported from an [organization user group](../../../user-guide/organization-users.md). |

Ref: 2104

---
title: Roles: Changes to How Regrants Are Recorded in the GRANTS_TO_USERS View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1132.md
section: Release Notes
---

# Roles: Changes to How Regrants Are Recorded in the GRANTS_TO_USERS View

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The output of the GRANTS_TO_USERS view changed in terms of grants of the same role to the same user were recorded:

Previously:
:   The view included a row for each grant of the same role to the same user. The DELETED_ON column value is NULL for the row containing the
    active grant. When a regrant occurs, the row containing the previous grant has the DELETED_ON column value updated to the timestamp for
    when the regrant occurred.

Currently:
:   The view includes one row for the grant of the same role to the same user. Regrants of the same role to the same user are not recorded as new
    rows. The DELETED_ON column remains NULL while the grant is active, and the column value is updated when the role is REVOKED from the user.

    After revoking the role from the user, a grant of the same role to the same user will be recorded in a new row. In this new row, the DELETED_ON
    column value is NULL because the grant is now active.

Use the following query to help identify whether your account has records in the view that will be affected:

* TRUE: There are records in the view that will be affected.
* FALSE: There are no records in the view that will be affected.

```sqlexample
SELECT
    COUNT(*) > 0 AS IS_IMPACTED
FROM
    SNOWFLAKE.ACCOUNT_USAGE.GRANTS_TO_USERS AS GL
        INNER JOIN SNOWFLAKE.ACCOUNT_USAGE.GRANTS_TO_USERS AS GR
            ON GL.ROLE = GR.ROLE
            AND GL.GRANTED_TO = GR.GRANTED_TO
            AND GL.GRANTEE_NAME = GR.GRANTEE_NAME
            AND GL.GRANTED_BY = GR.GRANTED_BY
            AND GL.DELETED_ON = GR.CREATED_ON
            AND GR.DELETED_ON IS NOT NULL;
```

Ref: 1132

---
title: SCIM Security Integrations: Using the ENABLED Parameter to Enable or Disable an Integration
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-937.md
section: Release Notes
---

# SCIM Security Integrations: Using the ENABLED Parameter to Enable or Disable an Integration

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The ENABLED parameter of a SCIM security integration now controls whether the integration is enabled or not:

Previously:
:   Though every SCIM security integration had an ENABLED parameter, its value had no effect. All SCIM security integrations were enabled.

Currently:
:   The ENABLED parameter controls whether a SCIM security integration is enabled. Setting ENABLED=FALSE disables the integration.

    As part of this change:

    * All existing SCIM security integrations are now set to ENABLED=TRUE, which reflects the fact that they have always been enabled.
    * All new security integrations default to ENABLED=TRUE.

Ref: 937

---
title: SCIM: List API returns paginated results; unsupported filters rejected (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2276.md
section: Release Notes
---

# SCIM: List API returns paginated results; unsupported filters rejected (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

The SCIM List API endpoints (`GET /scim/v2/Users` and `GET /scim/v2/Groups`)
are changing to return a paginated list of users or groups in the account,
optionally matching a provided `eq` or `sw` filter. Previously, these
endpoints returned a single synthetic sample object.

**Unfiltered list calls**

Before the change:
:   `GET /scim/v2/Users` and `GET /scim/v2/Groups` without a `filter` parameter
    returned a single synthetic sample object. For example, `GET /scim/v2/Users`
    returned:

    ```json
    {
      "totalResults": 1,
      "startIndex": 1,
      "itemsPerPage": 1,
      "Resources": [{
        "schemas": ["urn:ietf:params:scim:schemas:core:2.0:User", "..."],
        "id": "1",
        "externalId": "synthesis_user_external_id",
        "userName": "synthesis_user",
        "displayName": "synthesis_user_display_name",
        "active": false
      }]
    }
    ```

    Similarly, `GET /scim/v2/Groups` returned a single `synthetic_group` object.

After the change:
:   The same calls return a paginated list of users or groups in the account,
    with standard SCIM pagination (`startIndex`, `count`).

**Filter expression validation**

Before the change:
:   Filter values were parsed loosely. Unsupported filter expressions, including
    those with logical operators (`and`, `or`) and malformed filter values,
    were silently accepted and returned `200 OK` with an empty `Resources`
    array.

After the change:
:   Filter values are now parsed as quoted JSON strings, consistent with SCIM’s
    JSON encoding requirements. Unsupported or malformed filter expressions are
    rejected with HTTP `400 Bad Request` and SCIM error type `invalidFilter`.
    Supported single-attribute filters with `eq` or `sw` are not affected.

**ServiceProviderConfig filter support**

Before the change:
:   `GET /scim/v2/ServiceProviderConfig` reported `filter.supported` as an
    integer (`0`).

After the change:
:   `filter.supported` is a boolean (`true`), as required by RFC 7643.

Customers whose SCIM clients send filter expressions with `and` or `or`
should update the client to use simple filters or handle `400 invalidFilter`
responses.

> **Note:**
>
> Customers with large numbers of users or groups may see increased response
> times for unfiltered list calls. To reduce response sizes, use
> `excludedAttributes=groups` when listing users or `excludedAttributes=members`
> when listing groups. This prevents membership expansion, which is the primary
> driver of response size and latency for large accounts.

Ref: 2276

---
title: Search Optimization: Removing Search Optimization from a Table Requires the ADD SEARCH OPTIMIZATION Privilege
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1046.md
section: Release Notes
---

# Search Optimization: Removing Search Optimization from a Table Requires the ADD SEARCH OPTIMIZATION Privilege

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

According to [What Access Control Privileges Are Needed For the Search Optimization Service](../../../user-guide/search-optimization-service.md),
making changes to a table’s search optimization configuration requires not only ownership of the table, but also
the [ADD SEARCH OPTIMIZATION](../../../user-guide/security-access-control-privileges.md) access control privilege on the schema that
contains the table.

However, currently, you can [remove search optimization](../../../user-guide/search-optimization/enabling.md) from a table
you own without having the ADD SEARCH OPTIMIZATION privilege.

Snowflake requires the ADD SEARCH OPTIMIZATION privilege to behave as documented:

Previously:
:   The command ALTER TABLE DROP SEARCH OPTIMIZATION succeeds when used on a table you own, even if your role does not have the ADD SEARCH OPTIMIZATION privilege on the schema that contains the table.

Currently:
:   The ALTER TABLE DROP SEARCH OPTIMIZATION command fails when used on a table you own if your role does not have the ADD SEARCH OPTIMIZATION privilege. The error message is as follows:

    ```sqlexample
    FAILURE: SQL access control error:
    Insufficient privileges to operate on schema '<schema_name>'
    ```

    If you have this privilege, the command succeeds.

To grant the required privilege to a role, issue the following command:

`GRANT ADD SEARCH OPTIMIZATION ON SCHEMA <schema_name> TO ROLE <role>;`

Ref: 1046

---
title: Secure objects: Redaction of information in error messages
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1858.md
section: Release Notes
---

# Secure objects: Redaction of information in error messages

Error messages related to secure objects behave as follows:

Before the change:
:   Error messages related to secure objects show the full message.

After the change:
:   Error messages related to secure objects might be redacted.

The change applies to error messages related to the following types of objects:

* [Secure views](../../../user-guide/views-secure.md)
* [Secure functions](../../../developer-guide/secure-udf-procedure.md) (including secure table functions)
* [Masking policies](../../../user-guide/security-column-intro.md)
* [Row access policy bodies](../../../user-guide/security-row-intro.md)

For more information about secure objects, see [Use secure objects to control data access](../../../user-guide/data-sharing-secure-views.md).

When an error is detected during the expansion or evaluation of a secure object, the error message is considered
for redaction. When an error message is redacted, the error code remains unchanged.

Two types of changes to error messages are possible: redaction during execution and redaction in
metadata after execution. These types of changes are described in the following sections.

## Redaction during execution

A whole error message or a part of an error message can be redacted when the error is returned during an operation.
Generally, this type of error message redaction occurs when a user tries to use a secure object without
having the [OWNERSHIP](../../../user-guide/security-access-control-privileges.md) privilege on the secure object.

## Redaction in metadata after execution

Users can view metadata about errors after they occur, including the error messages. For example, users can
view this metadata in the Query History page in Snowsight, or by querying views and calling functions in
the [Snowflake Information Schema](../../../sql-reference/info-schema.md). When an error message is redacted during execution, the error message is always
redacted in the metadata after execution for all users.

When an error message isn’t redacted during execution, the message appears unchanged in the metadata for some
users and is redacted for other users. The error message is unchanged in the metadata in either of the following
cases:

* The user viewing the metadata has the [AUDIT](../../../user-guide/security-access-control-privileges.md) privilege.
* The user viewing the metadata has the [ENABLE_UNREDACTED_SECURE_OBJECT_ERROR](../../../sql-reference/parameters.md)
  user parameter set to `TRUE`. A user with the AUDIT privilege can set this parameter for a user.
* The user viewing the metadata executed the statement that caused the error.

In all other cases, the error message is redacted in the metadata. Redacted error messages include the text:
`Error in secure object`.

## Examples of error message redaction

The following examples show error messages that are redacted. The redaction can occur during execution or in metadata
after execution.

### Example 1: Querying a secure view

In the following example, a user with the SELECT privilege on a secure view executes a query on the view that returns an error.

Create the secure view:

```sqlexample
CREATE SECURE VIEW myview
  AS SELECT a FROM mytable;
```

Drop the table used in the view query:

```sqlexample
DROP TABLE mytable;
```

Execute a query on the view:

```sqlexample
SELECT * FROM myview;
```

#### Error message displayed to all users before the change

```output
002037 (42601): SQL compilation error:
Failure during expansion of view 'MYVIEW': SQL compilation error:
Object 'DB.SC.MYTABLE' does not exist or not authorized.
```

#### Redacted error message displayed to some users after the change

```output
002037 (42601): SQL compilation error:
Failure during expansion of view 'MYVIEW': Error in secure object
```

### Example 2: Running a query that calls a secure function

In the following examples, a user with the USAGE privilege on a secure function executes a query that calls the secure
function, but the secure function returns an error.

#### Example 2a: The function arguments result in an error

Create the secure function:

```sqlexample
CREATE SECURE FUNCTION myfunction1(x FLOAT, y FLOAT)
  RETURNS FLOAT
  LANGUAGE SQL
AS
  'SELECT x / y';
```

Execute a query that calls the secure function:

```sqlexample
SELECT myfunction1(1, 0);
```

##### Error message displayed to all users before the change

```output
100051 (22012): Division by zero
```

##### Redacted error message displayed to some users after the change

```output
100051 (22012): Error in secure object
```

#### Example 2b: An object the function depends on is deleted

Create the secure function:

```sqlexample
CREATE SECURE FUNCTION myfunction2()
  RETURNS TABLE (a NUMBER)
  LANGUAGE SQL
AS
  'SELECT * FROM mytable';
```

Drop the table used in the function:

```sqlexample
DROP TABLE mytable;
```

Execute a query that calls the secure function:

```sqlexample
SELECT * FROM TABLE(myfunction2());
```

##### Error message displayed to all users before the change

```output
002003 (42S02): SQL compilation error:
Object 'DB.SC.MYTABLE' does not exist or not authorized
```

##### Redacted error message displayed to some users after the change

```output
002003 (42S02): Error in secure object
```

### Example 3: A masking policy returns an error

In the following example, a user runs a query on a view with a masking policy that encounters an error.

Create a masking policy:

```sqlexample
CREATE MASKING POLICY allowed_role_names_mp as (val NUMBER) RETURNS NUMBER ->
  CASE
    WHEN EXISTS
      (SELECT role FROM allowed_roles WHERE role = CURRENT_ROLE()) THEN val
    ELSE '********'
  END;
```

Create a view and set the masking policy on a column in the view:

```sqlexample
CREATE TABLE test_masking_policy(x NUMBER) AS
  SELECT * FROM VALUES (1), (2), (3);

CREATE VIEW myview_mp
  AS SELECT * FROM test_masking_policy;

ALTER VIEW myview_mp
  MODIFY COLUMN x SET MASKING POLICY allowed_role_names_mp;
```

Drop the table used in the masking policy:

```sqlexample
DROP TABLE allowed_roles;
```

Execute a query on the view as a user that doesn’t have ownership privileges on the masking policy:

```sqlexample
SELECT * FROM myview_mp;
```

#### Error message displayed to all users before the change

```output
002003 (42S02): SQL compilation error:
Object 'DB.SC.ALLOWED_ROLES' does not exist or not authorized.
```

#### Redacted error message displayed to some users after the change

```output
002003 (42S02): Error in secure object
```

### Example 4: A row access policy returns an error

In the following example, a user runs a query on a view with a row access policy and encounters an error.

Create a row access policy:

```sqlexample
CREATE OR REPLACE ROW ACCESS POLICY myrap AS (role NUMBER) RETURNS BOOLEAN ->
  EXISTS (
    SELECT 1 FROM allowed_roles
      WHERE role::STRING = CURRENT_ROLE());
```

Create a view and add the row access policy on the view:

```sqlexample
CREATE TABLE test_row_access_policy(x NUMBER) AS
  SELECT * FROM VALUES (1), (2), (3);

CREATE VIEW myview_rap
  AS SELECT * FROM test_row_access_policy;

ALTER VIEW myview_rap
  ADD ROW ACCESS POLICY myrap ON (x);
```

Drop the table used in the row access policy:

```sqlexample
DROP TABLE allowed_roles;
```

Query the view as a user that doesn’t have OWNERSHIP privileges on the row access policy:

```sqlexample
SELECT * FROM myview_rap;
```

#### Error message displayed to all users before the change

```output
002003 (42S02): SQL compilation error:
Object 'DB.SC.ALLOWED_ROLES' does not exist or not authorized.
```

#### Redacted error message displayed to some users after the change

```output
002003 (42S02): Error in secure object
```

Ref: 1858

---
title: Security: Update dangling network policy references
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1622.md
section: Release Notes
---

# Security: Update dangling network policy references

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

Snowflake network policies behave as follows:

Before the change:
:   You can specify a network policy in the [NETWORK_POLICY](../../../sql-reference/parameters.md) parameter and drop the network policy. The result is a
    dangling reference of the network policy because it no longer exists. Subsequently, network traffic is allowed to access Snowflake
    regardless of the definition of the dropped network policy and any network rules added to the dropped network policy.

After the change:
:   Snowflake sends you an automated email with information about how to fix dangling network policy references in the NETWORK_POLICY
    parameter. The email is sent daily until you fix the dangling network policy references.

    Additionally, if you specify a network policy in this parameter, you cannot drop the network policy using a DROP NETWORK POLICY command
    or replace the network policy with a CREATE OR REPLACE NETWORK POLICY command. To do either of these actions, update the parameter value
    to remove the network policy and then execute the desired command.

---
title: SEMANTIC_CATEGORY system tag: Allowed values constraint removed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1295.md
section: Release Notes
---

# SEMANTIC_CATEGORY system tag: Allowed values constraint removed

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The behavior of the SNOWFLAKE.CORE.SEMANTIC_CATEGORY system tag is as follows:

Previously:
:   You can call the [SYSTEM$GET_TAG_ALLOWED_VALUES](../../../sql-reference/functions/system_get_tag_allowed_values.md) function to return a list of tag string values to set with
    the SEMANTIC_CATEGORY tag. The string value that you choose must be one of the values that the system function returns.

    Additionally, you can query the Account Usage [TAGS](../../../sql-reference/account-usage/tags.md) view to see the allowed values for the
    SEMANTIC_CATEGORY tag.

Currently:
:   When you call the SYSTEM$GET_TAG_ALLOWED_VALUES function and specify the SEMANTIC_CATEGORY tag, Snowflake returns NULL.

    You can specify any string value when you set this tag on a column. The tag value is always a string, and the maximum number of
    characters for the tag value is 256.

    When you query the Account Usage TAGS view, the ALLOWED_VALUES column returns NULL for the SEMANTIC_CATEGORY tag.

Ref: 1295

---
title: Sensitive Data Classification: Preserving user-specified tag values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1929.md
section: Release Notes
---

# Sensitive Data Classification: Preserving user-specified tag values

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

Snowflake provides the ability to automatically classify sensitive data and apply system classification tags (SEMANTIC_CATEGORY and
PRIVACY_CATEGORY) to data deemed to be sensitive. The behavior of this process changes as follows:

Before the change:
:   When Sensitive Data Classification automatically applied the SEMANTIC_CATEGORY and PRIVACY_CATEGORY tags, it always used values suggested
    by Snowflake, even if the user had previously set the value of these tags to a different value.

After the change:
:   If a user manually sets the value of the SEMANTIC_CATEGORY or PRIVACY_CATEGORY tag, automatic Sensitive Data Classification does not
    overwrite the value of the tag.

Ref: 1929

---
title: Sep 02, 2025: Cortex Agents: Admin object REST API (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-02-cortex-agents-rest-api-object.md
section: Release Notes
---

# Sep 02, 2025: Cortex Agents: Admin object REST API (*Preview*)

Create an agent object using the REST API, and then integrate the agent into your application to perform tasks or respond to queries. You can now configure a thread to maintain the context in memory, so that the client does not have to send the context at every turn of the conversation.

Includes expanded agent functionality, updates to the Cortex Agents workflow, and the addition of additional REST API endpoints.

For more information, see [Configure and interact with Agents](../../../user-guide/snowflake-cortex/cortex-agents-manage.md).

---
title: Sep 02, 2025: Document AI models in the model registry
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-02-document-ai.md
section: Release Notes
---

# Sep 02, 2025: Document AI models in the model registry

Document AI now stores any published or trained models within the [Snowflake Model Registry](../../../developer-guide/snowflake-ml/model-registry/overview.md).

You can now copy the Document AI models between databases or schemas in the same account or between different accounts in the same organization,
to easily manage and control model releases with versioning and role-based access control (RBAC). The model registry serves as the control plane
for deploying Document AI model versions safely and efficiently across environments.

This feature is available to accounts in AWS and Microsoft Azure. Google Cloud is not supported.

> **Note:**
>
> New Document AI models are automatically integrated into the model registry; existing models must be manually integrated.
>
> * To manually integrate an existing model into the model registry, when prompted, select Start on the integration banner in the UI.
>
> For more information, see [Document AI: CREATE MODEL privilege required to create, publish, and train model builds](../../bcr-bundles/un-bundled/bcr-1904.md).

---
title: Sep 02, 2025: Partitioned writes for Apache Iceberg™ tables (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-02-iceberg-partitioned-writes.md
section: Release Notes
---

# Sep 02, 2025: Partitioned writes for Apache Iceberg™ tables (*Preview*)

With partitioned write support for Iceberg tables, Snowflake improves compatibility with
the wider Iceberg ecosystem and enables accelerated read queries from external Iceberg tools.
You can now use Snowflake to create and write to both Snowflake-managed and
externally managed Iceberg tables with partitioning schemes.

For more information, see [Iceberg partitioning](../../../user-guide/tables-iceberg-metadata.md).

---
title: Sep 09, 2025: Hybrid table support for Microsoft Azure (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-09-hybrid-tables-azure-pupr.md
section: Release Notes
---

# Sep 09, 2025: Hybrid table support for Microsoft Azure (*Preview*)

Hybrid tables are now supported in Microsoft Azure commercial regions. For more information,
see [Clouds and regions](../../../user-guide/tables-hybrid-limitations.md).

---
title: Sep 09, 2025: Sensitive data classification
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-09-data-classification.md
section: Release Notes
---

# Sep 09, 2025: Sensitive data classification

## Classifying views automatically (*General availability*)

You can now configure sensitive data classification so the views in a database are automatically classified at regular intervals.
Previously, only tables could be classified automatically.

## Excluding objects from automatic classification (*Preview*)

By default, Snowflake automatically classifies all sensitive data in a database that has a classification profile set on it. You can now
configure Snowflake to exclude schemas, tables, or columns from automatic classification so that they are skipped during the classification
process.

For more information, see [Excluding data from sensitive data classification](../../../user-guide/classify-auto-exclude.md).

---
title: Sep 09, 2025: Using Snowsight to monitor data quality (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-11-dq-ui.md
section: Release Notes
---

# Sep 09, 2025: Using Snowsight to monitor data quality (*Preview*)

A new Data Quality tab is available when you are viewing a table or view in Snowsight. By selecting this tab, you can do the
following:

* Use data profiling to view statistics about your object, such as row counts, null values, and value distributions. Data profiling doesn’t
  require any setup, and helps users understand the structure, content, and potential quality issues in their data sets. For more
  information, see [Use data profiling to understand your data](../../../user-guide/data-quality-profile.md).
* Monitor the results of data metric functions (DMFs) associated with the object. The interactive interface displays DMF results, run
  schedules, and trends. You can drill down into failed quality checks, view impacted assets, and investigate specific records that violate
  a data quality check. For more information, see [Monitoring data quality checks in Snowsight](../../../user-guide/data-quality-ui-monitor.md).

---
title: Sep 11, 2025: Support for Snowflake Cortex AI Functions in incremental dynamic table refresh
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-11-dynamic-tables-cortex-aisql-support.md
section: Release Notes
---

# Sep 11, 2025: Support for Snowflake Cortex AI Functions in incremental dynamic table refresh

You can now use [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md) in the SELECT clause for dynamic tables in incremental refresh mode. The same
availability restrictions as described in [Cortex AI functions](../../../user-guide/snowflake-cortex/aisql.md) apply.

Cortex AI Functions let you add AI-powered insights directly to your dynamic tables, automatically analyzing data as it updates. For example, it can
classify customer reviews, support tickets, or survey responses as positive/negative or assign categories.

In the following example, `review_sentiment` uses AI_FILTER to evaluate each review with an LLM. In AI_FILTER, the
prompt `The reviewer enjoyed the restaurant` with the actual review text. The output column `enjoyed` is the
classification generated by AI_FILTER based on the prompt, indicating whether the reviewer enjoyed the restaurant.

```sqlexample
CREATE OR REPLACE TABLE reviews AS
  SELECT 'Wow... Loved this place.' AS review
  UNION ALL
  SELECT 'The pizza is not good.' AS review;

CREATE OR REPLACE DYNAMIC TABLE review_sentiment
  TARGET_LAG = DOWNSTREAM
  WAREHOUSE = mywh
  REFRESH_MODE = INCREMENTAL
  AS
    SELECT review, AI_FILTER(CONCAT('The reviewer enjoyed the restaurant', review), {'model': 'llama3.1-70b'}) AS enjoyed FROM reviews;
```

---
title: Sep 11, 2025: Workspaces (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-11-workspaces-ga.md
section: Release Notes
---

# Sep 11, 2025: Workspaces (*General availability*)

Workspaces in Snowsight are now generally available and are no longer in [Preview](../../preview-features.md). A workspace is a unified editor for creating, organizing, and managing code across multiple file types. Workspaces improves your SQL editing experience with nested folders, rich editor capabilities, Copilot assistance, improved charting, and column stats. All content in a workspace is file-based, which makes it easier to work on complex projects and integrate with Git for version control, collaboration, and alignment with existing workflows.

For more information, see [Workspaces](../../../user-guide/ui-snowsight/workspaces.md).

---
title: Sep 12, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Amazon S3 or Google Cloud (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-12-position-row-level-deletes-support-writing-to-externally-managed-iceberg-table-s3-google-cloud.md
section: Release Notes
---

# Sep 12, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Amazon S3 or Google Cloud (*Preview*)

Snowflake now supports position row-level deletes when writing to externally managed Apache Iceberg™ or Iceberg tables in catalog-linked
databases when the tables are stored on Amazon S3 or Google Cloud. These deletes are supported when Snowflake performs update, delete, and
merge operations on the tables. This feature is a performance improvement for these operations.

For more information, see [Write support for externally managed Apache Iceberg™ tables](../../../user-guide/tables-iceberg-externally-managed-writes.md) and [Use a catalog-linked database for Apache Iceberg™ tables](../../../user-guide/tables-iceberg-catalog-linked-database.md).

---
title: Sep 15, 2025: Billing views for Snowflake resellers and distributors
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-15-billing-schema.md
section: Release Notes
---

# Sep 15, 2025: Billing views for Snowflake resellers and distributors

Views in the new BILLING schema provide billing information for the customers of Snowflake resellers and distributors. For example, the
new PARTNER_CONTRACT_ITEMS view provides insights into the contracts between the reseller and their customers.

Only resellers and distributors can access these views.

For more information, see [BILLING schema](../../../sql-reference/billing.md).

---
title: Sep 15, 2025: Multi-factor authentication — Support for one-time passcodes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-15-otp.md
section: Release Notes
---

# Sep 15, 2025: Multi-factor authentication — Support for one-time passcodes

You can now generate one-time passcodes (OTPs) that users can use as their second factor of authentication when signing in to
Snowflake with multi-factor authentication (MFA). Organizations often use OTPs to provide *break glass access*, that is, access when regular
authentication methods are unavailable, such as when the organization’s identity provider has an outage.

To provide break glass access, an organization creates a dedicated Snowflake user, and then stores the user’s password and OTPs in a
key vault. To access Snowflake, an administrator retrieves the password and an OTP from the vault, and then signs in.

For more information, see [Setting up administrators for break glass access](../../../user-guide/security-mfa.md).

---
title: Sep 15, 2025: Snowflake Native Apps updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-15-native-app-ga.md
section: Release Notes
---

# Sep 15, 2025: Snowflake Native Apps updates

The following features are generally available:

## Automated granting of privileges (General availability)

Snowflake Native App providers can use automated granting of privileges to add the privileges the app requires to
the manifest file. Automated granting of privileges allows an app to create objects in the consumer account
without requiring the consumer to explicitly grant privileges to the app or create objects manually. For more
information on using automated granting of privileges, see [Configure the privileges required by an app](../../../developer-guide/native-apps/requesting-auto-privs.md).

To maintain control over what the app can do in the consumer account, a consumer account administrator may use
feature policies.

## App specifications (General availability)

App specifications allow a Snowflake Native App provider to specify the connection information the app requests.
When the consumer installs the app, they review the app specification and approve or decline it as necessary.

For more information on using app specifications to request privileges from the consumer, see
[Overview of app specifications](../../../developer-guide/native-apps/requesting-app-specs.md).

For information on approving app specifications when configuring an app, see
[Approve app specifications](../../../developer-guide/native-apps/ui-consumer-app-spec.md).

## Feature policies (General availability)

Snowflake Native App consumer account administrators can create feature policies to limit the objects an app can create in the
consumer account. Administrators can review the privileges the app requires in the listing before installing the app.

For more information on using feature policies, see [Use feature policies to limit the objects an app can create](../../../developer-guide/native-apps/ui-consumer-feature-policies.md).

---
title: Sep 16, 2025: Support for Streamlit in Snowflake in the People’s Republic of China (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-16-sis.md
section: Release Notes
---

# Sep 16, 2025: Support for Streamlit in Snowflake in the People’s Republic of China (Preview)

Streamlit in Snowflake is now available in the People’s Republic of China.

---
title: Sep 17, 2025: Data lineage for tasks
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-17-task-lineage.md
section: Release Notes
---

# Sep 17, 2025: Data lineage for tasks

You can now use data lineage to determine that data moved from a source object to a downstream
object as the result of a [task](../../../user-guide/tasks-intro.md). Selecting the arrow that connects the source object and the downstream
object displays information about the task.

For more information about using data lineage, see [Data Lineage](../../../user-guide/ui-snowsight-lineage.md).

---
title: Sep 17, 2025: New SYS_CONTEXT function for getting context about applications, sessions, and organizations
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-17-sys_context-function.md
section: Release Notes
---

# Sep 17, 2025: New SYS_CONTEXT function for getting context about applications, sessions, and organizations

You can call the new [SYS_CONTEXT](../../../sql-reference/functions/sys_context.md) function to get context information about:

* [The current application](../../../sql-reference/functions/sys_context_snowflake_application.md)
* [The current environment](../../../sql-reference/functions/sys_context_snowflake_environment.md) (for example, the current account or
  region)
* [The current session](../../../sql-reference/functions/sys_context_snowflake_session.md)

For example, you can:

* Determine if an application role is activated.
* Identify the client, driver, or library that is calling the function.
* Determine if the function is being called by a person, task, or SPCS service.

You can also get context information about
[organizations](../../../sql-reference/functions/sys_context_snowflake_organization.md), including information about the
[organization information related to the session](../../../sql-reference/functions/sys_context_snowflake_organization_session.md). For
example, you can:

* Determine if an organization user or group has been imported.
* Determine if the role representing an organization user group is activated.
* Identify the name of the organization user who started the session.

> **Note:**
>
> Getting context information about organizations is in [Preview](../../preview-features.md).

For more information, see [SYS_CONTEXT](../../../sql-reference/functions/sys_context.md).

---
title: Sep 17, 2025: Snowflake Openflow - Snowflake Deployments (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-17-openflow.md
section: Release Notes
---

# Sep 17, 2025: Snowflake Openflow - Snowflake Deployments (*Preview*)

Snowflake announces the preview of Openflow Snowflake Deployments, which run on Snowpark Container Services (SPCS).

> **Note:**
>
> Snowflake is gradually rolling out support for Openflow Snowflake Deployments.
> During this transition, not all accounts will include this new feature.
> Over the course of the next several weeks, starting September 17th, 2025,
> Snowflake Openflow deployments will start becoming available.

Openflow Snowflake deployments run on [Snowpark Container Services (SPCS)](../../../developer-guide/snowpark-container-services/overview.md) and
provide a streamlined and integrated solution for data integration and connectivity across interoperable
storage like Apache Iceberg™ tables and Snowflake native storage.
As a fully self-contained service within Snowflake, Snowflake Openflow is easy to deploy and manage,
offering a convenient and cost-effective environment for running your data flows.
A key advantage is its native integration with Snowflake’s security model, which allows seamless authentication,
authorization, and network security and simplified operations.

For more information, see [About Openflow - Snowflake Deployments](../../../user-guide/data-integration/openflow/about-spcs.md).

---
title: Sep 19, 2025: Read consistency mode for sessions with near-concurrent changes
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-19-read-consistency-mode.md
section: Release Notes
---

# Sep 19, 2025: Read consistency mode for sessions with near-concurrent changes

You can set the [READ_CONSISTENCY_MODE](../../../sql-reference/parameters.md) account-level parameter to define the level of consistency guarantees
that are required for sessions with near-concurrent changes.

For more information, see [Read consistency across sessions](../../../sql-reference/transactions.md) and [ALTER ACCOUNT](../../../sql-reference/sql/alter-account.md).

---
title: Sep 19, 2025: SnowConvert AI Verification (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-19-snowconvert-ai-verification.md
section: Release Notes
---

# Sep 19, 2025: SnowConvert AI Verification (*Preview*)

AI Verification strengthens SnowConvert AI by automating functional validation of converted database code.
AI Verification uses synthetic data generation, AI-driven unit testing, and AI-driven resolution of errors
identified in the conversion.

For more information, see
[AI code conversion](../../../migrations/snowconvert-docs/snowconvert-ai-verification.md).

---
title: Sep 19, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-19-native-apps-spcs-aws-fedramp-ga.md
section: Release Notes
---

# Sep 19, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (*Preview*)

Snowflake Native App with Snowpark Container Services now support FedRAMP on Amazon Web Services. Apps with containers can be
distributed to any Snowflake customer who can use them in FedRAMP region.

Apps running in FedRAMP can use the functionality of Snowpark Container Services, including compute pools, services, jobs, and external access integrations.

For information on limitations in Amazon government regions, see [Limitations on Snowflake Native App with Snowpark Container Services](../../../developer-guide/native-apps/limitations.md).

---
title: Sep 19, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Azure (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-19-position-row-level-deletes-support-writing-to-externally-managed-iceberg-table-azure.md
section: Release Notes
---

# Sep 19, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Azure (*Preview*)

Snowflake now supports position row-level deletes for Azure when writing to externally managed tables.
These deletes are supported when Snowflake performs update, delete, and merge operations on
the table files. This feature is a performance improvement for these operations.

For more information, see [Write support for externally managed Apache Iceberg™ tables](../../../user-guide/tables-iceberg-externally-managed-writes.md) and [Use a catalog-linked database for Apache Iceberg™ tables](../../../user-guide/tables-iceberg-catalog-linked-database.md).

---
title: Sep 22, 2025: Prevent data compaction on Snowflake-managed Apache Iceberg™ tables
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-22-enable-data-compaction-parameter.md
section: Release Notes
---

# Sep 22, 2025: Prevent data compaction on Snowflake-managed Apache Iceberg™ tables

Use the new ENABLE_DATA_COMPACTION parameter to specify whether Snowflake should perform data compaction on Snowflake-managed Apache Iceberg™
tables. Snowflake still performs compaction on these tables by default.

In most cases, compaction doesn’t have a significant impact on table optimization costs, but if it is a concern, you
can disable compaction. For example, you might want to disable it if you ingest a large number of small files for which compaction needs to
rewrite the files.

You can set this parameter at the account, database, schema, and table level.

For more information, see:

* [Set data compaction](../../../user-guide/tables-iceberg-manage.md)
* [ENABLE_DATA_COMPACTION](../../../sql-reference/parameters.md)

---
title: Sep 23, 2025: AI_FILTER Performance Optimization (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-23-ai-filter-optimization.md
section: Release Notes
---

# Sep 23, 2025: AI_FILTER Performance Optimization (*Preview*)

AI_FILTER includes a performance optimization that delivers a 2-10x speedup and reduces token usage by up to 60% for suitable queries using AI functions in SELECT, WHERE, and JOIN … ON clauses.

This optimization is triggered automatically when the query engine detects a suitable pattern. Similar to other query optimizations, Snowflake doesn’t guarantee that this optimization will be applied for every query. The engine leverages adaptive routing and context-aware rewriting to execute more efficient AI operations where possible. This enhancement lets customers run filtering queries faster and at a lower cost, with minimal impact on quality. This results in significant value through both performance gains and savings.

This optimization offers the following key enhancements:

* **Accelerated Performance:** A 2 to 10x speedup on qualifying queries that use AI functions within SELECT, WHERE, and JOIN … ON clauses when optimization is available.
* **Significant Cost Savings:** A token consumption reduction of up to 60%, lowering the cost of running filtering queries and other AI operations with optimization.

For more information, see [AI_FILTER](../../../sql-reference/functions/ai_filter.md).

---
title: Sep 23, 2025: Snowpipe Streaming with high-performance architecture (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md
section: Release Notes
---

# Sep 23, 2025: Snowpipe Streaming with high-performance architecture (*General availability*)

The high-performance architecture for Snowpipe Streaming is now generally available for all accounts on AWS. This new architecture is designed from the ground up for large-scale, real-time data ingestion with high throughput and low latency.

## Key features and benefits

**High performance and low latency:** Ingest data at up to 10 GB per second per table, with ingest-to-query latencies typically under 10 seconds.

**Multi-language client support:** Use the new, high-performance Java and Python SDKs, built on a shared Rust core for efficiency. A REST API is also available for lightweight and serverless ingestion workloads.

**Simplified data pipelines:** Centralize your ingestion logic using the PIPE object. The new architecture moves schema validation to the server side and supports in-flight data transformations using familiar COPY command syntax, reducing client-side complexity.

**Improved cost-efficiency:** Benefit from a new throughput-based pricing model that provides predictable costs.

## Key difference from classic architecture

This new architecture requires the use of the new SDKs or REST API. All ingestion configuration, including transformations and schema validation, is now managed through the server-side PIPE object, which acts as the entry point for all streaming data into a table.

## Recommended use cases

This architecture is recommended for consistent, high-volume streaming workloads, such as powering real-time analytics dashboards, log and event analysis, and integrating data from IoT devices.

For more information, see [Snowpipe Streaming: High-Performance Architecture](../../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md).

---
title: Sep 25, 2025: Cortex AI Functions – AI_TRANSLATE (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-25-ai-translate-updates.md
section: Release Notes
---

# Sep 25, 2025: Cortex AI Functions – AI_TRANSLATE (*General availability*)

The AI_TRANSLATE function, part of the Snowflake Cortex AI Functions, provides industry-leading quality for translations
of call transcripts, product reviews, social media comments, and other text content. This function is now available to
all Snowflake customers. Improvements over the original TRANSLATE (SNOWFLAKE.CORTEX) function include:

* Enhanced multilingual translation quality with industry-leading accuracy in all language directions, not just those that include English.
* Hebrew, Greek, Turkish, Finnish, Arabic, Croatian, Czech, Romanian, and Norwegian are now supported, bringing the total to 23 languages.
* Translation now better recognizes when the text is already in the target language, even when you don’t specify a source language.
* More than 20% fewer input tokens are used for typical sentences, especially for shorter phrases, reducing costs significantly.

For more information, see [AI_TRANSLATE](../../../sql-reference/functions/ai_translate.md).

---
title: Sep 25, 2025: Cost management — Updating budgets more frequently
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-25-budget-refresh-interval.md
section: Release Notes
---

# Sep 25, 2025: Cost management — Updating budgets more frequently

The time period between consumption and a budget receiving information about the consumption is called the budget refresh interval. You can
now set the budget refresh interval to one hour so budgets are updated with consumption data more frequently so you can watch spending
more closely.

Setting the budget refresh interval to one hour increases the compute costs associated with the budget. For more information, see
[Adjusting the budget refresh interval](../../../user-guide/budgets.md).

---
title: Sep 25, 2025: FILE data type (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-25-file-data-type-ga.md
section: Release Notes
---

# Sep 25, 2025: FILE data type (*General availability*)

The FILE data type enables multimodal Cortex AI Functions workflows with unstructured data stored on internal or external stages. FILE
values provide a way to reference files without encapsulating the actual file content. FILE objects let you:

* Store references to files in tables and pass them to Cortex AI Functions like AI_COMPLETE, AI_CLASSIFY, AI_EXTRACT,
  AI_PARSE_DOCUMENT, and AI_TRANSCRIBE for automated multimodal processing workflows.
* Avoid duplicating file data and process it more efficiently by creating ad passing FILE values as references.
* Integrate with existing data architectures by combining DIRECTORY functions with TO_FILE to create FILE references
  for entire collections of files for batch processing.

For more information, see the [FILE data type](../../../sql-reference/data-types-unstructured.md) and the [TO_FILE function](../../../sql-reference/functions/to_file.md).

---
title: Sep 25, 2025: Page filtering for AI_PARSE_DOCUMENT
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-25-ai-parse-document-page-filter.md
section: Release Notes
---

# Sep 25, 2025: Page filtering for AI_PARSE_DOCUMENT

The AI_PARSE_DOCUMENT function now includes page filtering capabilities, allowing you to parse specific pages or ranges within large documents.
You can process only the content you need, improving efficiency and reducing processing costs when working with multi-page documents.

These page filtering capabilities let you:

* Target specific content by specifying exact start and end points within a document.
* Build efficient document classification pipelines by extracting just the first page from multiple documents and using
  AI_CLASSIFY for instant categorization across document collections.
* Optimize batch processing workflows by combining directory scanning with selective page extraction to automatically categorize and process large document repositories based on content from key pages.

For more information, see [Parsing documents with AI_PARSE_DOCUMENT](../../../user-guide/snowflake-cortex/parse-document.md) and [AI_PARSE_DOCUMENT](../../../sql-reference/functions/ai_parse_document.md).

---
title: Sep 25, 2025: Snowflake Data Clean Rooms updates
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-25-dcr.md
section: Release Notes
---

# Sep 25, 2025: Snowflake Data Clean Rooms updates

**Clean Rooms API Version: 10.2**

With this release, Snowflake Data Clean Rooms has fixed a few issues in the clean rooms UI:

* **Welcome modal:** Fixed welcome modal links and UI display issues.
* **Add data step:** When no data is available, users can still refresh the tables list in the Add Data step, in case data has
  recently become available

---
title: Sep 26, 2025: AI_COUNT_TOKENS function (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-26-ai-count-tokens-function.md
section: Release Notes
---

# Sep 26, 2025: AI_COUNT_TOKENS function (*Preview*)

The Snowflake AI_COUNT_TOKENS function helps you size your AI workloads by calculating the total number of input
tokens processed by large language models and task-specific functions, so you can size queries appropriately before
hitting model limits and accurately estimate costs based on input token usage.

Key features of AI_COUNT_TOKENS include:

* **Count tokens:** Calculate the total number of input tokens for any AI Function, including AI_COMPLETE, AI_EMBED, AI_CLASSIFY, and AI_SENTIMENT. AI_COUNT_TOKENS
  also takes the specific model into account for functions that can use different models.
* **Get cost optimization insights:** Get precise input token counts before running operations, helping you optimize prompts and right-size AI spending across your organization.
* **Support complex configurations:** Analyze the impact of advanced features like classification with custom labels,
  descriptions, task definitions, and examples to understand the full token impact of your AI workflows.

For more information, see [AI_COUNT_TOKENS](../../../sql-reference/functions/ai_count_tokens.md).

---
title: Sep 29, 2025: External OAuth support for Snowflake Open Catalog catalog integration (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-29-open-catalog-support-external-oauth.md
section: Release Notes
---

# Sep 29, 2025: External OAuth support for Snowflake Open Catalog catalog integration (*General availability*)

Catalog integrations for Snowflake Open Catalog now support External OAuth. To configure a catalog integration with External OAuth, first
configure External OAuth in Open Catalog, and then use the new `OAUTH_TOKEN_URI` parameter for the integration.

For more information, see:

* [CREATE CATALOG INTEGRATION (Snowflake Open Catalog)](../../../sql-reference/sql/create-catalog-integration-open-catalog.md)
* [Overview of External OAuth in Snowflake Open Catalog](/user-guide/opencatalog/external-oauth-overview) in the
  Open Catalog documentation

---
title: Sep 29, 2025: Using SQL for Cortex Powered Object Descriptions (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-29-sql-object-descriptions.md
section: Release Notes
---

# Sep 29, 2025: Using SQL for Cortex Powered Object Descriptions (*General availability*)

Using SQL for Cortex Powered Object Descriptions is now generally available and is no longer in
[Preview](../../preview-features.md).

You can call a stored procedure, AI_GENERATE_TABLE_DESC, to programmatically generate Cortex Powered Object Descriptions. The Cortex
Powered Object Descriptions feature uses the [Snowflake Cortex COMPLETE function](../../../sql-reference/functions/complete-snowflake-cortex.md)
to generate descriptions for tables, views, and columns.

For more information, see [Using SQL to automatically generate object descriptions](../../../user-guide/sql-cortex-descriptions.md).

---
title: Sep 30, 2025: Cortex Agents integration for Microsoft Teams and Copilot (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-30-cortex-agents-teams-ga.md
section: Release Notes
---

# Sep 30, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*Preview*)

This update to the Cortex Agents integration, already in preview, enhances Microsoft Teams and Copilot 365 with
multi-agent support, higher-quality responses, multi-turn conversations, and a streamlined setup experience. The
integration lets users interact conversationally with a Cortex Agent within the Teams interface or in Microsoft 365
Copilot, bringing your Snowflake data closer to where users work.

For more information, see [Cortex Agents for Microsoft Teams and Microsoft 365 Copilot](../../../user-guide/snowflake-cortex/cortex-agents-teams-integration.md).

---
title: Sep 30, 2025: Declarative Sharing (Preview)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-30-declarative-sharing.md
section: Release Notes
---

# Sep 30, 2025: Declarative Sharing (*Preview*)

Declarative Sharing allows providers to share and sell data products, enhanced by Snowflake Notebooks to help Snowflake consumers visualize and explore the data.

Declarative Shared Native Apps is in Public Preview.

Declarative Sharing’s simplified development experience makes it easy to get started quickly.
Key features include:

* **Streamlined development**: Providers can define shared objects, including notebooks, using a straightforward YAML file format, with automatic version control.
* **Live notebook development**: You can interactively develop notebooks, edit notebook content and share it seamlessly.
* **Live environment development**: Declarative shared applications can be developed in a live environment, which streamlines the development and testing phases.
* **Controlled data visibility**: Application roles enable providers to categorize data, giving consumers easy control over data visibility.
* **Consumer-managed resources**: The application runs in the consumer’s account, allowing them to manage resource usage and costs.
* **Secure execution**: Declaratively shared applications operate within a tightly controlled environment, ensuring strict limitations on their actions and data access.

For more information, see [About Declarative Sharing in the Native Application Framework](../../../developer-guide/declarative-sharing/about.md).

---
title: Sep 30, 2025: GRANT OWNERSHIP ON NOTEBOOK (General availability)
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-30-grant-ownership-on-notebook.md
section: Release Notes
---

# Sep 30, 2025: GRANT OWNERSHIP ON NOTEBOOK (*General availability*)

GRANT OWNERSHIP ON NOTEBOOK is now generally available. This change is being rolled out gradually to all accounts.

You can transfer ownership of a Snowflake notebook in a schema from one role to another role.

For more information, see [GRANT OWNERSHIP](../../../sql-reference/sql/grant-ownership.md).

---
title: Sep 30, 2025: Support for derived metrics in semantic views
source: https://docs.snowflake.com/en/release-notes/2025/other/2025-09-30-semantic-view-derived-metrics.md
section: Release Notes
---

# Sep 30, 2025: Support for derived metrics in semantic views

In a [semantic view](../../../user-guide/views-semantic/overview.md), if you want to define a metric based on metrics from different
logical tables, you can define a *derived metric*. A derived metric is a metric that is scoped to the semantic view (rather than
to a specific logical table). A derived metric can combine metrics from multiple logical tables.

For more information, see [Defining derived metrics](../../../user-guide/views-semantic/sql.md).

---
title: September 01, 2024 — New Snowflake region
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-01-new-region.md
section: Release Notes
---

# September 01, 2024 — New Snowflake region

## China (Ningxia) region - *General Availability*

We are pleased to announce the general availability of the following region, which was previously introduced as a preview
in June 2024:

| Cloud platform | Region | Cloud region ID | Notes |
| --- | --- | --- | --- |
| Amazon Web Services (AWS) | China (Ningxia) | cn-northwest-1 | The China region is a separate region operated by Digital China Cloud Technology Limited (DCC), an authorized operating partner of Snowflake, Inc. |

The China region supports all [Snowflake editions](../../../user-guide/intro-editions.md). To create an initial account in the region, please contact
[DCC](mailto:snowflake.hosting%40dcclouds.com).

For more information about the region, see [Asia Pacific and China](../../../user-guide/intro-regions.md) (in [Supported cloud regions](../../../user-guide/intro-regions.md)) or visit the
[Snowflake China website](https://snowflake.cn).

> **Note:**
>
> Customers with existing Snowflake accounts are not able to access resources in the China region, and vice versa. Some features may
> not be available in the China region.

---
title: September 03-05, 2024 — 8.33 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_33.md
section: Release Notes
---

# September 03-05, 2024 — 8.33 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## New features

### Snowflake REST APIs — *Preview*

With this release, we are pleased to announce the preview of Snowflake REST APIs.

Snowflake REST APIs for resource management provide a set of endpoints that lets users programmatically interact with and control various resources within the Snowflake Data Cloud. In this preview, Snowflake REST APIs supports APIs for the following resources:

* Compute Pool
* Cortex Analyst
* Cortex Inference
* Cortex Search Service
* Database
* Dynamic Table
* Function
* Grant
* Image Repository
* Role
* Schema
* Service
* Stage
* Table
* Task
* User
* Warehouse

For more details, see [Snowflake REST APIs](../../developer-guide/snowflake-rest-api/snowflake-rest-api.md).

## SQL updates

### SHOW commands: Support for new WITH PRIVILEGES parameter

With this release, we are pleased to announce support for the WITH PRIVILEGES parameter for the following SHOW commands:

* SHOW DATABASES
* SHOW SCHEMAS
* SHOW WAREHOUSES

The WITH PRIVILEGES parameter lets you limit results to databases, schemas, and warehouses for which the role executing the
statement has been granted the privileges specified in the list. For example, you can list all the databases you have been
granted the APPLYBUDGET privilege on before adding a database to a custom budget.

For more information, see [SHOW DATABASES](../../sql-reference/sql/show-databases.md), [SHOW SCHEMAS](../../sql-reference/sql/show-schemas.md), or [SHOW WAREHOUSES](../../sql-reference/sql/show-warehouses.md).

## Data lake updates

### Apache Iceberg™ tables: Catalog integration for Iceberg REST — *Preview*

With this release, we are pleased to announce preview support for connecting Snowflake to Apache Iceberg™ tables managed in a remote
catalog that complies with the open source Apache Iceberg REST OpenAPI specification.

For more information, see [Configure a catalog integration for Apache Iceberg™ REST catalogs](../../user-guide/tables-iceberg-configure-catalog-integration-rest.md).

### Iceberg tables: Delta table support — *Preview*

With this release, we are pleased to announce preview support for creating read-only Iceberg tables from Delta table files in
object storage.

For more information, see [CREATE ICEBERG TABLE (Delta files in object storage)](../../sql-reference/sql/create-iceberg-table-delta.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 30-Aug-24 |
| *Iceberg tables: Automated refresh* | **Removed** | 03-Sep-24 |
| *Snowflake REST APIs — Preview* | **Added** | 06-Sep-24 |

---
title: September 04, 2024 — Calling stored procedures in the FROM clause of SELECT statements
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-04-call-stored-procedure-in-from-clause.md
section: Release Notes
---

# September 04, 2024 — Calling stored procedures in the FROM clause of SELECT statements

You can now call a stored procedure that returns tabular data directly in the FROM clause
of a SELECT statement. You can use this technique to simplify the SQL statements for saving
results to a table. For example, rather than using the
[SQLID](../../../developer-guide/snowflake-scripting/query-id.md) Snowflake Scripting variable with the
[RESULT_SCAN](../../../sql-reference/functions/result_scan.md) function to create a table containing these results,
you can use a query that directly selects from the results.

When calling the stored procedure, omit the [CALL](../../../sql-reference/sql/call.md) command. Instead, put the call
in parentheses, preceded by the TABLE keyword.

For more information, see [Selecting from a stored procedure](../../../developer-guide/stored-procedure/stored-procedures-selecting-from.md).

---
title: September 04, 2024 — Easier Training of Anomaly Detection Models from Real-World Data
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-04-anomaly-detection-preprocessing.md
section: Release Notes
---

# September 04, 2024 — Easier Training of Anomaly Detection Models from Real-World Data

We are pleased to announce that the Anomaly Detection ML Function now includes preprocessing features that allow
you to successfully train an anomaly detection model even when your training data has missing, duplicate, or misaligned time
steps. In the past, such issues, which are common in real-world data, typically prevented the model from being trained.
The new preprocessing features let you:

* Manually specify an event cadence in case the model fails to infer it or infers it incorrectly.
* Automatically interpolate missing target values from nearby time steps.
* Aggregate dimensional values from events occurring outside the canonical event cadence. You can specify aggregation
  behaviors for the type of value or per column, or use defaults.

A relatively small number of such corrections does not noticeably affect detection accuracy.

For more information, see [Dealing with real-world data in Time-Series Forecasting](../../../user-guide/ml-functions/preprocessing.md).

---
title: September 05-06, 2023 – 7.31 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_31.md
section: Release Notes
---

# September 05-06, 2023 – 7.31 Release Notes

## New Features

### New Information Schema Views for Class Instances

With this release, we are pleased to announce 3 new Information Schema views related to [class](../../sql-reference/snowflake-db-classes.md)
instances:

* [CLASS_INSTANCES view](../../sql-reference/info-schema/class_instances.md)
* [CLASS_INSTANCE_FUNCTIONS view](../../sql-reference/info-schema/class_instance_functions.md)
* [CLASS_INSTANCE_PROCEDURES view](../../sql-reference/info-schema/class_instance_procedures.md)

### External Network Access — *Preview*

With this release, we are pleased to announce the preview of access to external network locations from procedure and UDF handler code. Preview support for this feature is available to accounts on AWS except the Gov region.

With an external access integration, you can:

* Write UDF and procedure handlers that access external locations.
* Allow or block access to locations on a network external to Snowflake.
* Use secrets that represent stored credentials, rather than using literal values, within handler code to authenticate with external network locations.
* Specify which secrets are allowed for use with external network locations.

For more information about external network access, see [Overview of external network access](../../developer-guide/external-network-access/external-network-access-overview.md).

---
title: September 09, 2024 — New AI21 model available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-09-jamba-mini-model.md
section: Release Notes
---

# September 09, 2024 — New AI21 model available in Snowflake Cortex AI

We’re pleased to announce that AI21’s model, `jamba-1.5-mini`, is now available for serverless inference in
[Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/).

The AI21 Jamba 1.5 family of models is a state-of-the-art, hybrid SSM-Transformer instruction following foundation models. The `jamba-1.5-mini` with a context length of 256K supports use cases such as structured output (JSON), and grounded generation.

For details, see [Snowflake Cortex LLM Functions](../../../user-guide/snowflake-cortex/aisql.md).

---
title: September 09-11, 2024 — 8.34 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_34.md
section: Release Notes
---

# September 09-11, 2024 — 8.34 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Data loading/unloading updates

### The vectorized scanner option supports client-side encryption

With this release, we are pleased to announce that the Parquet file format option, `USE_VECTORIZED_SCANNER`, now supports client-side encryption. To use the `USE_VECTORIZED_SCANNER` option, you don’t need to configure the stage to use only server-side encryption anymore.

For more information, see [USE_VECTORIZED_SCANNER](../../sql-reference/sql/copy-into-table.md).

## Data pipeline updates

### Dynamic tables: New DYNAMIC_TABLE_REFRESH_HISTORY account usage view

The new DYNAMIC_TABLE_REFRESH_HISTORY view in the ACCOUNT_USAGE schema provides information about your dynamic tables’
refresh history up to a year.

For more information, see [DYNAMIC_TABLE_REFRESH_HISTORY view](../../sql-reference/account-usage/dynamic_table_refresh_history.md).

### Tasks: Python and JVM support for serverless tasks - *Preview*

With this release, we are pleased to announce the preview of Python and JVMsupport for serverless tasks. Serverless tasks can
now invoke the following object types and functions: UDFs (user-defined functions) and stored procedures written in Python,
Java, and Scala.

For more information, see [Python and Java support for serverless tasks](../../user-guide/tasks-python-jvm.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 06-Sep-24 |
| *Dynamic tables: New DYNAMIC_TABLE_REFRESH_HISTORY account usage view* | **Added** to *Data pipeline updates* section | 13-Sep-24 |
| *The vectorized scanner option supports client-side encryption* | **Added** to *Data loading/unloading updates* section | 18-Sep-24 |

---
title: September 11-12, 2023 — 7.32 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_32.md
section: Release Notes
---

# September 11-12, 2023 — 7.32 Release Notes

## SQL Updates

### New Function: IS_DATABASE_ROLE_IN_SESSION

The following function is available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Context | IS_DATABASE_ROLE_IN_SESSION | Verifies whether the database role is in the user’s active primary or secondary role hierarchy for the current session or if the specified column contains a database role that is in the user’s active primary or secondary role hierarchy for the current session.  For more information, see Share data protected by a role-based policy — Preview. |

## Data Loading / Unloading Updates

### Replicating streams on Snowflake tables populated by Snowpipe Streaming

With this release, we are pleased to announce that Snowflake supports replicating streams on Snowflake tables populated by Snowpipe
Streaming. For more information, see [Replication and Snowpipe Streaming](../../user-guide/account-replication-considerations.md).

### Snowpipe Streaming authentication updates

With the 2.0.3 release of the Snowpipe Streaming SDK, we are pleased to announce that Snowpipe Streaming supports OAuth authentication and
the `role` property for Snowpipe Streaming is now optional. When configuring Snowpipe Streaming, you can set the `authorization_type`
to `OAuth` and create an OAuth integration with the custom client.

For more information, see [Configure Snowflake OAuth for custom clients](../../user-guide/oauth-custom.md) and [Configurations and examples for Snowpipe Streaming classic architecture](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-configuration.md).

### New options for INFER_SCHEMA

With this release, we are pleased to announce that the INFER_SCHEMA table function supports two new options, MAX_FILE_COUNT and
MAX_RECORDS_PER_FILE. You can use MAX_FILE_COUNT to specify the maximum number of files scanned from stage. You can use MAX_RECORDS_PER_FILE
to specify the maximum number of records scanned per file.

For more information, see [INFER_SCHEMA](../../sql-reference/functions/infer_schema.md).

## Data Governance Updates

### Row access policies: Reference a protected mapping table in a row access policy — *Preview*

With this release, Snowflake is pleased to announce that policy administrators can reference a mapping table that is protected by a row
access policy in the policy conditions of a different row access policy. The result is more assurance to compliance officers when a user
queries a table protected by a row access policy.

For more information, see [Example: Protect the mapping table with a row access policy](../../user-guide/security-row-using.md).

### Share data protected by a role-based policy — *Preview*

With this release, Snowflake is pleased to announce the preview to enable a data sharing provider to use the IS_DATABASE_ROLE_IN_SESSION
function in the conditions of a masking policy or a row access policy to allow a data sharing consumer to access shared data that is
protected by either of these policies. The function argument takes either the name of a database role or a column that contains database
roles. This provides more options to the provider to share data, allows the consumer to access sensitive data that the provider makes
available, and removes restrictions on policy-protected data when the consumer queries a shared table protected by a policy.

For details, see [Share data protected by a policy](../../user-guide/data-sharing-policy-protected-data.md) and [IS_DATABASE_ROLE_IN_SESSION](../../sql-reference/functions/is_database_role_in_session.md).

## Web Interface Updates

### Managing data governance in Snowsight — *Generally Available*

With this release, we are pleased to announce the general availability of the Data Governance interface in Snowsight. The
Governance interface includes a Dashboard tab to monitor the most frequently used masking policies, row access policies,
and tags with their usage on tables and columns. The Governance interface also includes a Tagged Objects tab to report
on the Dashboard data, with the option to manually report on the usage of tags and policies on tables and columns.

When you select an element in the Dashboard, Snowsight automatically updates the Tagged Objects tab filters. Additionally,
when you select a row in the Tagged Objects tab, Snowsight automatically redirects you to the object or column in the
Data » Databases interface. You can then manage the policy and tag assignments as needed.

For more information, see:

* [Use Snowsight to set tags](../../user-guide/object-tagging/work.md)
* [Monitor tags with Snowsight](../../user-guide/object-tagging/monitor.md)
* [Monitor masking policies with Snowsight](../../user-guide/security-column-intro.md)
* [Monitor row access policies with Snowsight](../../user-guide/security-row-intro.md)

---
title: September 12, 2024 — New Cortex LLM Function - CLASSIFY_TEXT — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-12-classify-text-function.md
section: Release Notes
---

# September 12, 2024 — New Cortex LLM Function - CLASSIFY_TEXT — *Preview*

With this release, we are pleased to announce the preview of a new Snowflake Cortex LLM function, CLASSIFY_TEXT. This new Cortex LLM
task-specific function gives you an easy way to label text records into categories that are relevant for your business.

The new CLASSIFY_TEXT function is managed by the Snowflake AI team to deliver state-of-the art text
classification accuracy comparable to the most powerful models currently in the market. You can use CLASSIFY_TEXT to easily label
text records such as emails, call transcripts, and product reviews for different categories that are relevant for your business.
For example, you can label support tickets to automatically assign the support team based on the issue category. To easily
integrate the results of CLASSIFY_TEXT into your data pipeline, the outputs generated are structured in JSON, without any additional
post processing needed.

With the CLASSIFY_TEXT function, you can quickly and accurately label text records without the need for any prompt
engineering, post processing or providing any examples.

For more information, see [CLASSIFY_TEXT (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/classify_text-snowflake-cortex.md).

---
title: September 12, 2024 — New multilingual embedding model available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-12-voyage-embed-model.md
section: Release Notes
---

# September 12, 2024 — New multilingual embedding model available in Snowflake Cortex AI

We’re pleased to announce that the text embedding functions in [Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/)
now support the following additional multilingual model:

* `voyage-multilingual-2`

For additional details, see [EMBED_TEXT_1024 (SNOWFLAKE.CORTEX)](../../../sql-reference/functions/embed_text_1024-snowflake-cortex.md).

---
title: September 12, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-12-dcr.md
section: Release Notes
---

# September 12, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Integration with Yahoo DSP

When an analysis provides the Activation Hub, consumers can now activate the results of the analysis to their Yahoo DSP account. This lets
consumers buy against audiences they are generating within the clean room through Yahoo DSP.

For more information, see the following:

* If you are an administrator who is configuring the connector so clean room users can activate to Yahoo DSP, see
  [Yahoo DSP connector](../../../user-guide/cleanrooms/connector-activation.md).
* If you are a clean room user who is using the Activation Hub after running an analysis, see
  [Yahoo DSP connector](../../../user-guide/cleanrooms/connector-activation.md).

## Integration with Google PAIR and Google DV 360

Publishers and advertisers can leverage the Google PAIR protocol so the advertiser can run an audience overlap analysis on encrypted
identifiers, then push the results to their Google DV 360 account for activation without ever exposing unencrypted sensitive data.

For more information, see [Google Display & Video 360-PAIR connector](../../../user-guide/cleanrooms/connector-activation.md).

---
title: September 18-19, 2023 — 7.33 Release Notes
source: https://docs.snowflake.com/en/release-notes/2023/7_33.md
section: Release Notes
---

# September 18-19, 2023 — 7.33 Release Notes

## New Features

### Network Rules — *Preview*

With this release, we are pleased to announce the preview of network rules, which group related network identifiers into logical units.
When a Snowflake feature needs to restrict network traffic based on the origin or destination of a request, it can allow or block a
network rule that contains the identifiers that should be permitted or denied.

Network rules make possible the following preview features:

* [Enhanced network security](../../user-guide/network-policies.md)
* [External network access overview](../../developer-guide/external-network-access/external-network-access-overview.md)

For general information about network rules, see [Network rules](../../user-guide/network-rules.md).

### Enhanced Network Security — *Preview*

With this release, we are pleased to announce the preview of enhanced security when using network policies to restrict access to
Snowflake. When combined with network rules, network policies can now:

* Restrict access to the internal stage of a Snowflake account on AWS.
* Restrict access based on the identifier of an AWS S3 endpoint.

For more information about using network rules with a network policy, see [About network rules](../../user-guide/network-policies.md).

### Network Isolation to Internal Stages Using AWS PrivateLink — *Preview*

With this release, we are pleased to announce the preview of the ability to isolate network traffic to Snowflake internal stages when
connecting to them over AWS PrivateLink for Amazon S3. Snowflake recommends this approach for organizations that use AWS PrivateLink
to access the internal stages of multiple Snowflake accounts.

In this approach, an AWS administrator creates multiple S3 interface endpoints, one for each internal stage. Then the Snowflake
administrator uses the new [S3_STAGE_VPCE_DNS_NAME](../../sql-reference/parameters.md) parameter to associate an internal stage with its dedicated S3
interface endpoint.

Benefits of isolating private connectivity traffic include simplified DNS management, the ability to chargeback costs to a specific
Snowflake account, and the ability to implement different security requirements for each Snowflake account.

For more details, see [Accessing Internal stages with dedicated interface endpoints](../../user-guide/private-internal-stages-aws.md).

## Data Loading Updates

### Cross-platform Support for Snowpipe Auto-Ingest — *General Availability*

With this release, we are pleased to announce the general availability of the cross-platform support for Snowpipe auto-ingest. Triggering
automated Snowpipe data loads using S3 event messages, GCS Pub/Sub event messages, and Azure Event Grid messages are now supported by
Snowflake accounts hosted on [any supported cloud platforms](../../user-guide/intro-cloud-platforms.md).

For more information, see [Automate continuous data loading with cloud messaging](../../user-guide/data-load-snowpipe-auto.md).

### Amazon EventBridge Support for Snowpipe Auto-Ingest — *General Availability*

With this release, we are pleased to announce the general availability of Amazon EventBridge support for Snowpipe auto-ingest. You can
set up Amazon EventBridge for Snowpipe auto-ingest by following the steps in
[Automating Snowpipe for Amazon S3 with SNS](../../user-guide/data-load-snowpipe-auto-s3.md).

## Data Governance Updates

### Tag-based Masking Policy: Support for Database & Schema — *General Availability*

With this release, we are pleased to announce the general availability of setting a tag-based masking policy on a database and schema.
This update enables data engineers to protect all columns in a schema or database when the data type of the column matches the data type
of the policy set on the tag. Additionally, a new column is protected when its data type matches the data type of the policy set on the
tag. Setting the tag-based masking policy on the database or schema simplifies data protection management because you can set the
tag-based policy once and not have to set a masking policy on every column in the database or schema.

For more information, see [Tag-based masking policies](../../user-guide/tag-based-masking-policies.md).

### Shared Tag References — *Preview*

With this release, Snowflake is pleased to announce the preview to allow users in the data sharing consumer account to view shared tags
and the tag references on shared objects when the tags and shared objects are in the same database. This update helps the consumer to
understand the data sensitivity of a shared object, such as a table containing PII data when the PII tag is set on a table and its columns.

To enable the consumer to view the tag references of shared objects, the provider grants the READ privilege on the tag to a shared
database role or directly to the share. The consumer can use SQL to identify the shared tags and the tag references on shared objects.

For details, see [Shared tag references](../../user-guide/data-sharing-provider.md).

---
title: September 18-20, 2024 — 8.35 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_35.md
section: Release Notes
---

# September 18-20, 2024 — 8.35 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### RANGE BETWEEN support for FIRST_VALUE and LAST_VALUE functions

With this release, we are pleased to announce the availability of RANGE BETWEEN window frames with explicit offsets for two
more functions: [FIRST_VALUE](../../sql-reference/functions/first_value.md) and [LAST_VALUE](../../sql-reference/functions/last_value.md).

For more information, see [Window function syntax and usage](../../sql-reference/functions-window-syntax.md) and [Range-based versus row-based window frames](../../user-guide/functions-window-using.md).

## Extensibility updates

### Telemetry data to the event table from Snowflake Notebook cells temporarily disabled

As of this release, Snowflake Notebook cells can no longer emit logs, spans, or span events to event tables. This
integration has been disabled only temporarily and will be re-enabled in a future release. Any logs or traces emitted from
other objects called from Notebooks, such as stored procedures and UDFs, will continue to emit telemetry data into your
account’s event table.

If your Snowflake account was previously emitting logs or traces to the Event Table, the integration was not disabled for
your Snowflake account to prevent disruption. However, you should be aware of the following known issues:

* Notebooks will not start if the log level is DEBUG for your Notebooks (or the account, database, or schema that the
  Notebook is in). Set the log level to INFO or higher.
* When executing SQL cells, you may see additional, unexpected spans from the `snowflake-snowpark-python` library,
  specifically `DataFrame.collect` and `DataFrame.count`. These are emitted from the internals of Notebooks executing
  your SQL statements. You can remove these by setting your TRACE_LEVEL parameter to `ON_EVENT` or `OFF`.

### pandas on Snowflake - *General Availability*

With this release, we are pleased to announce the availability of pandas on Snowflake. pandas on Snowflake lets you run your
pandas code in a distributed manner directly on your data in Snowflake. Just by changing the import statement and a few
lines of code, you can get the familiar pandas experience you know and love with the scalability and security benefits of
Snowflake. With pandas on Snowflake, you can work with much larger datasets and avoid the time and expense of porting your
pandas pipelines to other big data frameworks or provisioning large and expensive machines. It runs workloads natively in
Snowflake through transpilation to SQL, enabling it to take advantage of parallelization and the data governance and
security benefits of Snowflake. pandas on Snowflake is delivered through the Snowpark pandas API as part of the Snowpark
Python library, which enables scalable data processing of Python code within the Snowflake platform.

For more information, see [pandas on Snowflake](../../developer-guide/snowpark/python/pandas-on-snowflake.md)

## Data lake updates

### Apache Iceberg™ tables: Automated refresh — *Preview*

With this release, we are pleased to announce preview support for automated metadata refreshes for Apache Iceberg™ tables that use
an external catalog. With automated refreshes, Snowflake polls your external Iceberg catalog in a continuous and serverless
fashion to synchronize the metadata with the most recent remote changes.

For more information, see [Automatically refresh Apache Iceberg™ tables](../../user-guide/tables-iceberg-auto-refresh.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 13-Sep-24 |
| *Telemetry data to the event table from Snowflake Notebook cells temporarily disabled* | **Removed** statement to contact Snowflake Support to re-enable telemetry on notebook cells. | 07-May-25 |

---
title: September 23-26, 2024 — 8.36 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_36.md
section: Release Notes
---

# September 23-26, 2024 — 8.36 Release Notes

> **Attention:**
>
> The release is complete.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## Data lake updates

### Cloning support for Snowflake-managed Apache Iceberg™ tables — *Preview*

With this release, a preview of support for cloning Snowflake-managed Iceberg tables is available.

For more information, see [CREATE <object> … CLONE](../../sql-reference/sql/create-clone.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 20-Sep-24 |

---
title: September 24, 2024 — DOCUMENT_AI_USAGE_HISTORY view — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-24-document-ai.md
section: Release Notes
---

# September 24, 2024 — DOCUMENT_AI_USAGE_HISTORY view — *General Availability*

With this release, we are pleased to announce the general availability of the DOCUMENT_AI_USAGE_HISTORY view in the
Account Usage schema, giving
you the ability to query the usage history for Document AI.

For more information, see [DOCUMENT_AI_USAGE_HISTORY view](../../../sql-reference/account-usage/document_ai_usage_history.md).

---
title: September 25, 2024 — New models available in Snowflake Cortex AI
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-25-new-cortex-models.md
section: Release Notes
---

# September 25, 2024 — New models available in Snowflake Cortex AI

We’re pleased to announce that the Cortex LLM COMPLETE function in [Snowflake Cortex AI](https://www.snowflake.com/en/data-cloud/cortex/)
now support the following additional models:

* `jamba-1.5-large`
* `llama3.2-1b`
* `llama3.2-3b`

For additional details, see [Snowflake Cortex AI Functions (including LLM functions)](../../../user-guide/snowflake-cortex/aisql.md).

---
title: September 25, 2024 — Snowflake Feature Store — General Availability
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-25-feature-store-ga.md
section: Release Notes
---

# September 25, 2024 — Snowflake Feature Store — *General Availability*

We are pleased to announce that the Snowflake Feature Store is now Generally Available in all regions. The Snowflake
Feature Store lets data scientists and ML engineers create, maintain, and use ML features in data science and ML
workloads, all within Snowflake, by standardizing commonly used feature transformations in a central repository.
For more information, see [Snowflake Feature Store](../../../developer-guide/snowflake-ml/feature-store/overview.md).

---
title: September 26, 2024 — Snowflake Data Clean Rooms Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-26-dcr.md
section: Release Notes
---

# September 26, 2024 — Snowflake Data Clean Rooms Release Notes

With this release, we are pleased to announce the availability of the following new features and enhancements in this update to Snowflake
Data Clean Rooms.

## Branded clean room tiles

Providers can now brand their clean rooms with a logo and company name by configuring the profile of their clean room environment. After the
profile is updated, collaborators see the provider’s logo and name on clean room tiles on the Joined and Invited tabs.

For more information, see [Brand your clean rooms](../../../user-guide/cleanrooms/admin-tasks.md).

## Consumer direct activation

Consumers can now push results from an analysis directly to their Snowflake account, allowing them to access row-level data after running
templates for overlap and data enrichment use cases. A provider can configure their clean room environment to prevent this consumer direct
activation.

For more information, see [Activating query results](../../../user-guide/cleanrooms/activation.md).

## Activation Hub column policies

Users can now specify which columns should be used as ID columns for their activation, which might be different from the join columns used
while running an analysis.

## Schedule analyses as a consumer

Consumers can configure an analysis to run on an hourly, daily, weekly, or monthly schedule, which lets them keep their data up-to-date
without manually rerunning an analysis. Scheduled analyses run as a background process. The ability to schedule an analysis is available for
the Audience Overlap & Segmentation template, SQL Query template, and custom templates created with the developer APIs.

For more information, see [Scheduling a repeating analysis in the clean rooms UI](../../../user-guide/cleanrooms/v1/schedule-analysis.md).

## Clean room data stats

Users can view data stats for their own tables within the clean room. These stats provide distinct counts for their join policy columns
across the top five values for all other columns. In cases where non-join columns have more than 20 distinct values, no distinct counts are
shown.

For more information, see [View details about a clean room](../../../user-guide/cleanrooms/v1/web-app-working.md).

## LiveRamp activation

Consumers can now activate their respective RampID back to their LiveRamp account using Snowflake shares or SFTP upload. Users can then
leverage their LiveRamp Connect account to push these segments downstream to additional destinations supported by LiveRamp.

For more information, see one of the following:

* If you are an administrator who needs to configure the LiveRamp connector, see [LiveRamp connector](../../../user-guide/cleanrooms/connector-activation.md).
* If you are a clean room user who wants to activate data to LiveRamp, see [the LiveRamp documentation](/user-guide/cleanrooms/connector-activation).

## The Trade Desk CRM activation

Users can now activate first-party PII information back to their The Trade Desk account. This allows users to integrate their CRM data to
create data segments for audience targeting and conversion measurement on The Trade Desk.

For more information, see one of the following:

* If you are an administrator who needs to configure the The Trade Desk connector, see [The Trade Desk - CRM connector](../../../user-guide/cleanrooms/connector-activation.md).
* If you are a clean room user who wants to activate data to The Trade Desk, see [The Trade Desk - UID 2.0 connector](../../../user-guide/cleanrooms/connector-activation.md).

## Managed account credit limit and monitoring

Collaborators who are using a clean room managed account can now set a monthly limit on how many Snowflake credits can be consumed by clean
room activity. Users cannot use the web app to access the clean room environment when credit consumption is within 10 credits of the limit.
Administrators can also monitor how many credits have been consumed for the month.

Snowflake does not start tracking credit consumption until a user logs into this release of the web app.

For more information, see [Monitor and manage the cost of your managed account](../../../user-guide/cleanrooms/managed-accounts.md).

## Audience Overlap, SQL Query and Custom template update

When a user runs an analysis using an Audience Overlap, SQL Query, or custom template, they are prompted to save the analysis and given the
option to schedule future runs of the analysis. Users can continue to use the application because the analysis runs in the background.

---
title: September 26, 2024 — Snowpark-optimized Warehouse RESOURCE_CONSTRAINT — Preview
source: https://docs.snowflake.com/en/release-notes/2024/other/2024-09-26-sow-resource-constraints.md
section: Release Notes
---

# September 26, 2024 — Snowpark-optimized Warehouse RESOURCE_CONSTRAINT — *Preview*

Snowflake is pleased to announce the preview of Snowpark-optimized Warehouse resource constraint support.

Snowpark Optimized Warehouse RESOURCE_CONSTRAINTs allow you to specify the memory and CPU architecture for Snowpark-optimized warehouses.

You can now specify which memory and CPU architecture combination for your Snowpark-optimized warehouse using the [CREATE](../../../sql-reference/sql/create-warehouse.md)
or [ALTER](../../../sql-reference/sql/alter-warehouse.md) warehouse commands.

For additional details, including examples, see [Snowpark-optimized warehouses](../../../user-guide/warehouses-snowpark-optimized.md).

---
title: September 27-29, 2023 — 7.34 Release Notes (with behavior changes)
source: https://docs.snowflake.com/en/release-notes/2023/7_34.md
section: Release Notes
---

# September 27-29, 2023 — 7.34 Release Notes (with behavior changes)

## Behavior Change Bundles

This release contains the following behavior change bundles:

| Bundle Name | Status in this Release | Previous Status |
| --- | --- | --- |
| [2023_07](../bcr-bundles/2023_07_bundle.md) | Disabled by default; admins can enable for testing | N/A (introduced in this release) |
| [2023_06](../bcr-bundles/2023_06_bundle.md) | Enabled by default; admins can disable for opt-out | Disabled by default |
| [2023_05](../bcr-bundles/2023_05_bundle.md) | Generally enabled; admins can no longer enable/disable | Enabled by default |

The status for each bundle will change in the next behavior change release, planned for October; however, this schedule is subject to
change.

For more information about bundle statuses and how they may impact your accounts, see [About Behavior Changes](../intro-bcr-releases.md).

## New Features

### Snowflake Alerts — *General Availability*

With this release, we are pleased to announce the general availability of Snowflake Alerts. A Snowflake Alert is a schema-level
object that you can use to send a notification or perform an action when data in Snowflake meets certain conditions.

For example, you can set up a Snowflake Alert to send a notification or perform an action when:

* The warehouse credit usage increases by a specified percentage of your current quota.
* The resource consumption for your pipelines, tasks, materialized views, etc. increases beyond a specified amount.
* Your data fails to comply with a particular business rule that you have set up.

For more information, see [Setting up alerts based on data in Snowflake](../../user-guide/alerts.md).

## SQL Updates

### New SQL Functions

The following function(s) are now available with this release:

| Function Category | New Function | Description |
| --- | --- | --- |
| Semi-Structured Data Functions (Array/Object) | [ARRAY_FLATTEN](../../sql-reference/functions/array_flatten.md) | Flattens an [ARRAY](../../sql-reference/data-types-semistructured.md) of ARRAYs into a single ARRAY. |

## Web Interface Updates

### Allow Snowflake On Demand customers to purchase listings

With this release, we are pleased to announce that Snowflake On Demand customers can now purchase listing.
For pricing information please see [Snowflake Pricing](https://www.snowflake.com/pricing/).

Customers with trial accounts will still need to convert to paid accounts to purchase listings.

For a list of supported countries, see [Where paid listings are available to consumers](../../collaboration/consumer-listings-paying.md).

---
title: September 30 - October 03, 2024 — 8.37 Release Notes
source: https://docs.snowflake.com/en/release-notes/2024/8_37.md
section: Release Notes
---

# September 30 - October 03, 2024 — 8.37 Release Notes

> **Attention:**
>
> The release has completed.
>
> For differences between the in-advance and final versions of these release notes, see Release notes change log.

## SQL updates

### New SQL functions

The following function(s) are now available with this release:

| Function category | New function | Description |
| --- | --- | --- |
| Semi-structured and structured data | [REDUCE](../../sql-reference/functions/reduce.md) | Reduces an array to a single value based on the logic in a lambda expression. |
| Geospatial | [ST_INTERPOLATE](../../sql-reference/functions/st_interpolate.md) | Given an input GEOGRAPHY object, returns an interpolated object that is within a specified tolerance.  You can call this function when you need to see how GEOGRAPHY objects look like in the planar coordinate system (for example, when using visualization tools for geospatial data). |

## Extensibility updates

### Authentication with AWS IAM from procedures and functions — *Preview*

With this release, we are pleased to announce preview support for authenticating with AWS services from a procedure or functions using
[Snowpark External Access](../../developer-guide/external-network-access/external-network-access-overview.md) via Identity and Access Management
(IAM).

For more information, see [Accessing Amazon S3 with AWS IAM](../../developer-guide/external-network-access/external-network-access-examples.md).

## Release notes change log

| Announcement | Update | Date |
| --- | --- | --- |
| Release notes | Initial publication (preview) | 27-Sep-24 |

---
title: Sequences and columns: Changes to SHOW command, view, and GET_DDL function output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1225.md
section: Release Notes
---

# Sequences and columns: Changes to SHOW command, view, and GET_DDL function output

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, Snowflake has introduced a new ORDER and NOORDER parameter for sequences and table columns:

* ORDER specifies that the values generated for a sequence or auto-incremented column are in
  [increasing order](../../../user-guide/querying-sequences.md).
* NOORDER specifies that the values are not guaranteed to be in increasing order.

These new parameters appear in the output of the commands, functions, and views.

The output of the following commands and views includes this new ordered column:

* The [SHOW SEQUENCES](../../../sql-reference/sql/show-sequences.md) command
* The [DESCRIBE SEQUENCE](../../../sql-reference/sql/desc-sequence.md) command
* The Information Schema [SEQUENCES](../../../sql-reference/info-schema/sequences.md) view
* The Account Usage [SEQUENCES](../../../sql-reference/account-usage/sequences.md) view

| Column Name | Data Type | Description |
| --- | --- | --- |
| `ordered` | TEXT | Specifies whether or not the values are generated in increasing order.   * For the SHOW SEQUENCES and DESCRIBE SEQUENCE commands, the column contains:    + `Y` (if the sequence has the ORDER parameter)   + `N` (if the sequence has the NOORDER parameter). * For the Information Schema and Account Usage SEQUENCES views, the column contains:    + `YES` (if the sequence has the ORDER parameter)   + `NO` (if the sequence has the NOORDER parameter). |

In the output of the SHOW COLUMNS command, the `autoincrement` column includes the ORDER or NOORDER parameter:

Previously:
:   If the column auto-increments by 1 with the starting value of 1, the `autoincrement` column contains:

    ```output
    start 1 increment 1
    ```

Currently:
:   If the column has the ORDER parameter set, the `autoincrement` column contains:

    ```output
    start 1 increment 1 order
    ```

    If the column has the NOORDER parameter set, the `autoincrement` column contains:

    ```output
    start 1 increment 1 noorder
    ```

The output of the Information Schema and Account Usage COLUMNS views includes a new `identity_ordered` column:

| Column Name | Data Type | Description |
| --- | --- | --- |
| `identity_ordered` | TEXT | Specifies whether or not this column is an identity column with generated values in increasing order.   * If the column is an identity column and has the ORDER parameter, the column contains `YES`. * If the column is an identity column and has the NOORDER parameter, the column contains `NO`. |

Finally, the output of the GET_DDL function will include the ORDER and NOORDER parameter for sequences and columns.

Currently:
:   If the column auto-increments by 1 with the starting value of 1, the output of the GET_DDL function does not include the
    ORDER or NOORDER parameters:

    ```output
    create or replace table MYTABLE(
      MYCOL ... start 1 increment 1
      ...
    ```

Pending:
:   If the column has the ORDER parameter set, the GET_DDL output includes the ORDER parameter:

    ```output
    create or replace table MYTABLE(
      MYCOL ... start 1 increment 1 order
      ...
    ```

    If the column has the NOORDER parameter set, the GET_DDL output includes the NOORDER parameter:

    ```output
    create or replace table MYTABLE(
      MYCOL ... start 1 increment 1 noorder
      ...
    ```

Ref: 1225

---
title: Sequences and columns: New sequences and columns use NOORDER by default
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1483.md
section: Release Notes
---

# Sequences and columns: New sequences and columns use NOORDER by default

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

When you [create a new sequence](../../../sql-reference/sql/create-sequence.md) or a new auto-incremented column, you can specify the
ORDER or NOORDER parameter to indicate whether or not the sequence can generate new values in increasing order.

* ORDER specifies that the values generated for a sequence or auto-incremented column are in increasing order (or, if the interval
  is a negative value, in decreasing order).

  For example, if a sequence or auto-incremented column has `START 1 INCREMENT 2`, the generated values might be
  `1`, `3`, `5`, `7`, `9`, etc.
* NOORDER specifies that the values are not guaranteed to be in increasing order.

  For example, if a sequence has `START 1 INCREMENT 2`, the generated values might be `1`, `3`, `101`, `5`, `103`, etc.

  NOORDER can improve performance when multiple INSERT operations are performed concurrently (for example, when multiple
  clients are executing multiple INSERT statements).

If you do not specify ORDER or NOORDER, a default value is used. This default value is changing in order to improve performance:

Before the change:
:   * If you create a new sequence without specifying ORDER or NOORDER, ORDER is used by default.
    * If you create a new table column and specify AUTOINCREMENT without specifying ORDER or NOORDER, ORDER is used by default.

After the change:
:   * If you create a new sequence without specifying ORDER or NOORDER, NOORDER is used by default.
    * If you create a new table column and specify AUTOINCREMENT without specifying ORDER or NOORDER, NOORDER is used by default.

Note the following:

* The changes to these default values do not affect existing sequences and existing auto-incremented columns.

  The changes only affect new sequences and columns that are created when the behavior change is enabled.
* The ORDER and NOORDER properties have no effect on the uniqueness of the generated values for sequences and auto-incremented
  columns.

## Changing the default from NOORDER to ORDER

To set the default back to ORDER, set the [NOORDER_SEQUENCE_AS_DEFAULT](../../../sql-reference/parameters.md) parameter to FALSE for the account, user, or
session.

If you set this parameter, the value that you set overrides the value in the 2024_01 behavior change bundle. Setting this
parameter to FALSE keeps the ORDER as the default, even after the 2024_01 behavior change bundle is
[generally enabled](../../behavior-change-policy.md).

## Changes to the output of the GET_DDL function

In addition, the output of the [GET_DDL](../../../sql-reference/functions/get_ddl.md) function is changing for auto-incremented columns that
have START 1 INCREMENT 1 set:

Before the change:
:   The column definition returned by GET_DDL only includes the AUTOINCREMENT keyword (for example,
    `column name data type AUTOINCREMENT`).

After the change:
:   The column definition returned by GET_DDL includes all properties, including the START property, the INCREMENT property and the
    ORDER / NOORDER property (for example, `column name data type AUTOINCREMENT START 1 INCREMENT 1 NOORDER`).

## Determining if a sequence or column has the ORDER or NOORDER property

Finally, to determine if a sequence has the ORDER or NOORDER property, you can use the following commands and views:

* For sequences, you can use any of the following:

  + The [SHOW SEQUENCES](../../../sql-reference/sql/show-sequences.md) command.
  + The [DESCRIBE SEQUENCE](../../../sql-reference/sql/desc-sequence.md) command.
  + The [SEQUENCES view in INFORMATION_SCHEMA](../../../sql-reference/info-schema/sequences.md).
* For auto-incremented columns, you can check either of the following:

  + The `autoincrement` column in the output of the [SHOW COLUMNS](../../../sql-reference/sql/show-columns.md) command.
  + The `identity_ordered` column in the [COLUMNS view in INFORMATION_SCHEMA](../../../sql-reference/info-schema/columns.md).
* For sequences and auto-incremented columns, you can check for the ORDER or NOORDER property in the sequence or column
  definition returned by the [GET_DDL](../../../sql-reference/functions/get_ddl.md) function.

Ref: 1483

---
title: Server releases and feature updates earlier in 2026
source: https://docs.snowflake.com/en/release-notes/new-features-2026.md
section: Release Notes
---

# Server releases and feature updates earlier in 2026

The following sections list the release notes for the server releases and feature updates that occurred earlier in 2026:

* Server releases earlier in 2026
* Feature updates earlier in 2026

For more recent releases and feature updates, see [Snowflake server release notes and feature updates](new-features.md).

## Server releases earlier in 2026

* [10.10 Release Notes: Mar 22, 2026-Mar 25, 2026](2026/10_10.md)
  + [SQL updates](2026/10_10.md)
    - [Interval data types (*Preview*)](2026/10_10.md)
  + [Snowflake Cortex updates](2026/10_10.md)
    - [Batch Cortex Search (*Preview*)](2026/10_10.md)
  + [Release notes change log](2026/10_10.md)
* [10.9 Release Notes: Mar 17, 2026-Mar 20, 2026](2026/10_9.md)
  + [New features](2026/10_9.md)
    - [Snowflake supports directory, root stage, and SnowGit imports](2026/10_9.md)
  + [Security updates](2026/10_9.md)
    - [TSS history account usage view (*General availability*)](2026/10_9.md)
  + [SQL updates](2026/10_9.md)
    - [DML error logging for tables](2026/10_9.md)
    - [Additional date and time formats](2026/10_9.md)
    - [Additional fixed-position numeric format models](2026/10_9.md)
  + [Snowflake Cortex updates](2026/10_9.md)
    - [CKE document access history](2026/10_9.md)
  + [Release notes change log](2026/10_9.md)
* [10.8 Release Notes: Mar 08, 2026-Mar 12, 2026](2026/10_8.md)
  + [SQL updates](2026/10_8.md)
    - [User-defined types](2026/10_8.md)
  + [Data collaboration updates](2026/10_8.md)
    - [Business Continuity and Disaster Recovery (BCDR) for listings](2026/10_8.md)
  + [Release notes change log](2026/10_8.md)
* [10.7 Release Notes (with behavior changes): Mar 02, 2026-Mar 05, 2026](2026/10_7.md)
  + [Behavior change bundles](2026/10_7.md)
  + [Data lake updates](2026/10_7.md)
    - [Apache Iceberg™ tables: Support for fixed(L) data type](2026/10_7.md)
  + [Release notes change log](2026/10_7.md)
* [10.6 Release Notes: Feb 23, 2026-Feb 27, 2026](2026/10_6.md)
  + [Data lake updates](2026/10_6.md)
    - [Apache Iceberg™ tables: Partitioned writes with hierarchical paths (*Preview*)](2026/10_6.md)
  + [Data governance updates](2026/10_6.md)
    - [Data quality: Non-owners can associate a data metric function with an object (*General availability*)](2026/10_6.md)
  + [Release notes change log](2026/10_6.md)
* [10.5 Release Notes: Feb 16, 2026-Feb 19, 2026](2026/10_5.md)
  + [Security updates](2026/10_5.md)
    - [SAML2 federated authentication: Support for metadata URL](2026/10_5.md)
    - [Tri-Secret Secure supports secure share area accounts](2026/10_5.md)
  + [Data governance updates](2026/10_5.md)
    - [DUPLICATE_COUNT DMF: Ability to specify multiple columns](2026/10_5.md)
  + [Release notes change log](2026/10_5.md)
* [10.4 Release Notes: Feb 09, 2026-Feb 13, 2026](2026/10_4.md)
  + [SQL updates](2026/10_4.md)
    - [New SQL functions](2026/10_4.md)
  + [Release notes change log](2026/10_4.md)
* [10.3 Release Notes: Feb 02, 2026-Feb 05, 2026](2026/10_3.md)
  + [Extensibility updates](2026/10_3.md)
    - [Owner’s rights contexts: Allow INFORMATION_SCHEMA, SHOW, and DESCRIBE](2026/10_3.md)
  + [Release notes change log](2026/10_3.md)
* [10.2 Release Notes: Jan 26, 2026-Jan 30, 2026](2026/10_2.md)
  + [SQL updates](2026/10_2.md)
    - [New UUID data type](2026/10_2.md)
  + [Data loading / unloading updates](2026/10_2.md)
    - [Support for Microsoft Fabric OneLake (*General availability*)](2026/10_2.md)
  + [Release notes change log](2026/10_2.md)
* [10.1 Release Notes (with behavior changes): Jan 19, 2026-Jan 23, 2026](2026/10_1.md)
  + [Behavior change bundles](2026/10_1.md)
  + [SQL updates](2026/10_1.md)
    - [Retrieve bind variable values (*General availability*)](2026/10_1.md)
  + [Release notes change log](2026/10_1.md)
* [10.0 Release Notes: Jan 12, 2026-Jan 15, 2026](2026/10_0.md)
  + [SQL updates](2026/10_0.md)
    - [Search optimization: Support for structured data types](2026/10_0.md)
  + [Data governance updates](2026/10_0.md)
    - [Copy tags when running a CREATE OR REPLACE TABLE command (*Preview*)](2026/10_0.md)
  + [Release notes change log](2026/10_0.md)

## Feature updates earlier in 2026

* [Mar 16, 2026: Metering disabled for hybrid table requests](2026/other/2026-03-16-hybrid-tables-metering-disabled.md)
* [Mar 16, 2026: Snowflake Notebooks renamed to Legacy Snowflake Notebooks](2026/other/2026-03-16-legacy-notebooks.md)
  + [Migration to Notebooks in Workspaces](2026/other/2026-03-16-legacy-notebooks.md)
* [Mar 16, 2026: Apache Iceberg™ tables: Write support by using an external query engine (*Preview*)](2026/other/2026-03-16-tables-iceberg-query-using-external-query-engine-snowflake-horizon-writes-feature.md)
* [Mar 13, 2026: Cortex Agent evaluations (*General availability*)](2026/other/2026-03-13-cortex-agent-evaluations.md)
* [Mar 13, 2026: Time distribution information added to STATISTICS column in dynamic table refresh history](2026/other/2026-03-13-dt-time-distribution.md)
* [Mar 13, 2026: Network Policy Advisor — *General availability*](2026/other/2026-03-13-network-policy-advisor-ga.md)
* [Mar 13, 2026: Support for specifying relationship paths in semantic views (*Preview*)](2026/other/2026-03-13-semantic-views-multi-path.md)
* [Mar 13, 2026: New OVERLAP_POLICY parameter for task graphs](2026/other/2026-03-13-tasks-overlap-policy.md)
* [Mar 12, 2026: AI_EXTRACT scale factor parameter (*General availability*)](2026/other/2026-03-12-ai-extract.md)
* [Mar 12, 2026: AI code suggestions in Workspaces (*Preview*)](2026/other/2026-03-12-cortex-code-ai-suggestions-preview.md)
* [Mar 12, 2026: Investigate cost anomalies using hourly consumption by service type](2026/other/2026-03-12-cost-anomaly-hourly-consumption-by-service-type.md)
* [Mar 12, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-03-12-dcr.md)
  + [Clean Rooms API Version: 13.6](2026/other/2026-03-12-dcr.md)
* [Mar 12, 2026: Multi-Location Resilience for Data Pipelines (General availability)](2026/other/2026-03-12-multi-location-resilience-data-pipelines-ga.md)
* [Mar 12, 2026: Recent Cortex Search updates (*Generally Available*)](2026/other/2026-03-12-recent-cortex-search.md)
  + [Multi-index search](2026/other/2026-03-12-recent-cortex-search.md)
  + [Custom vector embeddings](2026/other/2026-03-12-recent-cortex-search.md)
  + [Enhanced Cortex Search tool for Cortex Agents and Snowflake Intelligence](2026/other/2026-03-12-recent-cortex-search.md)
* [Mar 11, 2026: Resource budgets for Cortex Agents](2026/other/2026-03-11-cortex-agents-resource-budgets.md)
* [Mar 11, 2026: Resource budgets for Snowflake Intelligence](2026/other/2026-03-11-snowflake-intelligence-resource-budgets.md)
* [Mar 9, 2026: Cortex Code in Snowsight - *General availability*](2026/other/2026-03-09-cortex-code-snowsight-ga.md)
  + [Why this matters](2026/other/2026-03-09-cortex-code-snowsight-ga.md)
  + [Legal notices](2026/other/2026-03-09-cortex-code-snowsight-ga.md)
* [Mar 09, 2026: Streamlit in Snowflake container runtime and secrets support (*General availability*)](2026/other/2026-03-09-sis-container-runtime-ga.md)
* [Mar 06, 2026: SYSTEM$GET_CATALOG_LINKED_DATABASE_CONFIG function (*General availability*)](2026/other/2026-03-06-system-get-catalog-linked-database-config.md)
* [Mar 05, 2026: AI_COMPLETE document intelligence (*Preview*)](2026/other/2026-03-05-ai-complete-document-intelligence.md)
* [Mar 05, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-03-05-dcr.md)
  + [Clean Rooms API Version: 13.5](2026/other/2026-03-05-dcr.md)
* [Mar 05, 2026: Preventing a semantic view metric from being aggregated across specific dimensions](2026/other/2026-03-05-semantic-views-semi-additive-metrics.md)
* [Mar 05, 2026: Exporting a semantic view to a Tableau Data Source (TDS) file (*Preview*)](2026/other/2026-03-05-semantic-views-tableau-tds.md)
* [Mar 04, 2026: Support for Apache Iceberg™ version 3 (*Preview*)](2026/other/2026-03-04-iceberg-v3-support-preview.md)
* [Mar 2, 2026: Monitor and control Cortex AI Functions spending (*General availability*)](2026/other/2026-02-25-ai-functions-cost-management.md)
* [Mar 02, 2026: No limit on the number of backup sets per object](2026/other/2026-03-02-backups-no-limit-backup-sets.md)
* [Mar 02, 2026: Support for new dbt Core versions for dbt Projects on Snowflake](2026/other/2026-03-02-dbt-core-versions.md)
* [Mar 02, 2026: Simplified pricing for hybrid tables](2026/other/2026-03-02-hybrid-tables-pricing.md)
* [Mar 02, 2026: Query Delta-based Apache Iceberg™ tables with deletion vectors](2026/other/2026-03-02-iceberg-delta-deletion-vectors.md)
* [Mar 02, 2026: Using standard SQL clauses to query semantic views (*General availability*)](2026/other/2026-03-02-semantic-views-standard-sql.md)
* [Feb 27, 2026: Openflow Connector for Oracle (*General availability*)](2026/other/2026-02-27-openflow-oracle-ga.md)
* [Feb 27, 2026: Restricted caller’s rights in Streamlit in Snowflake (*Preview*)](2026/other/2026-02-27-sis-restricted-callers-rights.md)
* [Feb 26, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-26-dcr.md)
  + [Clean Rooms API Version: 13.4](2026/other/2026-02-26-dcr.md)
* [Feb 25, 2026: Account Usage CORTEX_AGENT_USAGE_HISTORY view (*General availability*)](2026/other/2026-02-25-cortex-agent-usage-history-view.md)
* [Feb 25, 2026: Joining logical tables that contain ranges of values in a semantic view (*Preview*)](2026/other/2026-02-25-semantic-views-range-joins.md)
* [Feb 25, 2026: Account Usage SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (*General availability*)](2026/other/2026-02-25-snowflake-intelligence-usage-history-view.md)
* [Feb 24, 2026: View invoices in Snowsight](2026/other/2026-02-24-billing-invoices.md)
* [Feb 24, 2026: User-defined actions for budgets](2026/other/2026-02-24-budget-user-defined-actions.md)
* [Feb 24, 2026: Enforcement of privatelink-only access (*General availability*)](2026/other/2026-02-24-enforce-privatelink-access-only.md)
* [Feb 24, 2026: Snowflake Postgres (*General availability*)](2026/other/2026-02-24-snowflake-postgres-ga.md)
* [Feb 23, 2026: Simplified setup for Data Quality Monitoring](2026/other/2026-02-23-data-quality-monitoring-setup.md)
  + [Cortex Data Quality (*Preview*)](2026/other/2026-02-23-data-quality-monitoring-setup.md)
  + [User interface for creating data quality checks (*Preview*)](2026/other/2026-02-23-data-quality-monitoring-setup.md)
* [Feb 23, 2026: Grouped Query History in Snowsight (*General availability*)](2026/other/2026-02-23-grouped-query-history-ui.md)
* [Feb 20, 2026: Snowflake Native Apps: Configuration (*Preview*)](2026/other/2026-02-20-nativeapps-configuration.md)
* [Feb 20, 2026: USE AI FUNCTIONS account privilege for Cortex AI Functions](2026/other/2026-02-20-use-ai-functions-privilege.md)
* [Feb 19, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-19-dcr.md)
  + [Clean Rooms API Version: 13.3](2026/other/2026-02-19-dcr.md)
* [Feb 19, 2026: Machine learning experiments (*General availability*)](2026/other/2026-02-19-ml-experiments-ga.md)
* [Feb 18, 2026: Snowflake Container Runtime versioning for ML Jobs (*Preview*)](2026/other/2026-02-18-container-runtime-versions-preview.md)
* [Feb 18, 2026: Account Usage New CORTEX_AGENT_USAGE_HISTORY view (*Preview*)](2026/other/2026-02-18-cortex-agent-usage-history-view.md)
* [Feb 18, 2026: Support for changing refresh user and secondary roles](2026/other/2026-02-18-dynamic-tables-execute-as-user.md)
* [Feb 18, 2026: Row timestamps for pipeline latency and event tracking (*General availability*)](2026/other/2026-02-18-row-timestamps.md)
* [Feb 18, 2026: Account Usage New SNOWFLAKE_INTELLIGENCE_USAGE_HISTORY view (*Preview*)](2026/other/2026-02-18-snowflake-intelligence-usage-history-view.md)
* [Feb 17, 2026: Access history improvements](2026/other/2026-02-17-access-history.md)
* [Feb 16, 2026: Sharing Streamlit in Snowflake apps (*Preview*)](2026/other/2026-02-16-sis.md)
* [Feb 13, 2026: Run Security Essentials scanners on demand](2026/other/2026-02-13-adhoc-security-essentials.md)
* [Feb 13, 2026: Snowflake Native Apps: Inter-App Communication (*Preview*)](2026/other/2026-02-13-nativeapps-iac.md)
* [Feb 12, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-12-dcr.md)
  + [Clean Rooms API Version: 13.2](2026/other/2026-02-12-dcr.md)
* [Feb 12, 2026: New checkout experience for private offers with flat-fee pricing (*General availability*)](2026/other/2026-02-12-marketplace-checkout-experience-ga.md)
* [Feb 12, 2026: Strong Authentication Hub (*Preview*)](2026/other/2026-02-12-strong-authentication-hub.md)
* [Feb 10, 2026: Snowflake Native Apps: Shareback (*General Availability*)](2026/other/2026-02-10-nativeapps-shareback.md)
* [Feb 09, 2026: Performance Explorer enhancements (*Preview*)](2026/other/2026-02-09-performance-explorer-enhancements-preview.md)
* [Feb 06, 2026: Cortex Code data science and machine learning skill (*Preview*)](2026/other/2026-02-06-cortex-code-data-science-preview.md)
* [Feb 06, 2026: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (*General availability*)](2026/other/2026-02-06-tables-iceberg-query-using-external-query-engine-snowflake-horizon-ga.md)
* [Feb 06, 2026: Trust Center Overview tab (*Preview*)](2026/other/2026-02-06-trust-center-overview-tab-preview.md)
* [Feb 05, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-02-05-dcr.md)
  + [Clean Rooms API Version: 12.9](2026/other/2026-02-05-dcr.md)
* [Feb 05, 2026: Notebooks in Workspaces (*General Availability*)](2026/other/2026-02-05-notebooks-in-workspaces.md)
  + [Key features](2026/other/2026-02-05-notebooks-in-workspaces.md)
* [Feb 05, 2026: Sensitive data classification: Support for semi-structured data (*General availability*)](2026/other/2026-02-05-sensitive-data-classification-json.md)
* [Feb 04, 2026: Cortex Search Component Scores (*General availability*)](2026/other/2026-02-04-cortex-search-component-scores-ga.md)
* [Feb 4, 2026: Object tagging support for interactive tables](2026/other/2026-02-04-interactive-tagging.md)
* [Feb 04, 2026: Sensitive data classification: Classify a subset of native semantic categories (*Preview*)](2026/other/2026-02-04-sensitive-data-classification-subset-categories.md)
* [Feb 02, 2026: Cortex Code CLI (*General availability*)](2026/other/2026-02-02-cortex-code-cli.md)
* [Feb 02, 2026: Cortex Code in Snowsight (*Preview*)](2026/other/2026-02-02-cortex-code-snowsight.md)
  + [Key capabilities](2026/other/2026-02-02-cortex-code-snowsight.md)
* [Feb 02, 2026: Support for listing and share observability (*General availability*)](2026/other/2026-02-02-listing-observability-ga.md)
  + [New views and functions in the INFORMATION_SCHEMA schema](2026/other/2026-02-02-listing-observability-ga.md)
    - [INFORMATION_SCHEMA.LISTINGS view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [INFORMATION_SCHEMA.SHARES view (for providers and consumers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [INFORMATION_SCHEMA.AVAILABLE_LISTINGS table function (for consumers)](2026/other/2026-02-02-listing-observability-ga.md)
  + [New and updated views in the ACCOUNT_USAGE schema](2026/other/2026-02-02-listing-observability-ga.md)
    - [ACCOUNT_USAGE.LISTINGS view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [ACCOUNT_USAGE.SHARES view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [ACCOUNT_USAGE.GRANTS_TO_SHARES view (for providers)](2026/other/2026-02-02-listing-observability-ga.md)
    - [Updates to ACCOUNT_USAGE.ACCESS_HISTORY view](2026/other/2026-02-02-listing-observability-ga.md)
* [Feb 02, 2026: Use Snowsight to manage external volumes (*Preview*)](2026/other/2026-02-02-manage-external-volumes-by-using-snowsight.md)
* [Feb 02, 2026: Share Connected Apps in Snowflake Marketplace listings (*General availability*)](2026/other/2026-02-02-share-connected-apps-in-sfmarketplace-listings-ga.md)
* [Feb 01, 2026: New ORGANIZATION_USAGE premium views](2026/other/2026-02-01-organization-usage-new-views.md)
* [Jan 30, 2026: Support for bi-directional data access with Microsoft Fabric (*General availability*)](2026/other/2026-01-30-iceberg-microsoft-fabric-bidirectional-data-access-ga.md)
* [Jan 30, 2026: New regions](2026/other/2026-01-30-new-regions.md)
* [Jan 29, 2026: Apache DataSketches functions (*General availability*)](2026/other/2026-01-29-datasketches-functions-ga.md)
* [Jan 29, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-01-29-dcr.md)
  + [Clean Rooms API Version: 12.8](2026/other/2026-01-29-dcr.md)
* [Jan 28, 2026: Fine-tuning `arctic-extract` models (*Preview*)](2026/other/2026-01-28-fine-tuning-arctic-extract-models.md)
* [Jan 28, 2026: Private connectivity for TSS on Google Cloud (*General availability*)](2026/other/2026-01-28-tss-private-connectivity-gcp.md)
* [Jan 27, 2026: Estimate token usage with AI_COUNT_TOKENS (*General availability*)](2026/other/2026-01-27-ai-count-tokens-function-ga.md)
* [Jan 27, 2026: Enforce data protection policies when querying Apache Iceberg™ tables from Apache Spark™](2026/other/2026-01-27-iceberg-enforce-access-policies-on-tables-queried-from-apache-spark.md)
* [Jan 26, 2026: Extract images from documents using AI_PARSE_DOCUMENT (Preview)](2026/other/2026-01-26-ai-parse-document-images-preview.md)
* [Jan 26, 2026: Specify a dynamic task configuration with EXECUTE TASK](2026/other/2026-01-26-dynamic-task-config.md)
* [Jan 23, 2026: Malicious IP Protection updates](2026/other/2026-01-23-malicious-ip-protection.md)
* [Jan 23, 2026: Consumer-controlled maintenance policies for Snowflake Native Apps (*Preview*)](2026/other/2026-01-23-native-apps-consumer-maintenance-policies.md)
* [Jan 23, 2026: Network policies for External OAuth](2026/other/2026-01-23-network-policies-external-oauth.md)
* [Jan 23, 2026: Organization users (*General availability*)](2026/other/2026-01-23-org-users-ga.md)
* [Jan 23, 2026: Storage lifecycle policies: Expanded support](2026/other/2026-01-23-storage-lifecycle-policies-azure.md)
* [Jan 22, 2026: AI_AGG and AI_SUMMARIZE_AGG (*General availability*)](2026/other/2026-01-22-ai-agg-ai-summarize-agg-ga.md)
* [Jan 22, 2026: AI_FILTER for filtering with natural language predicates (*General availability*)](2026/other/2026-01-22-ai-filter-ga.md)
* [Jan 22, 2026: Document Processing Playground (*General availability*)](2026/other/2026-01-22-document-processing-playground.md)
* [Jan 22, 2026: European Union categories for sensitive data classification](2026/other/2026-01-22-sensitive-data-classification-eu-india.md)
* [Jan 21, 2026: Snowflake OAuth for local applications](2026/other/2026-01-21-snowflake-oauth-local-applications.md)
* [Jan 20, 2026: Shared Workspaces (*General availability*)](2026/other/2026-01-20-shared-workspaces.md)
  + [Key features](2026/other/2026-01-20-shared-workspaces.md)
* [Jan 16, 2026: External lineage (*Preview*)](2026/other/2026-01-16-external-lineage.md)
* [Jan 16, 2026: Sensitive data classification in the Trust Center (*Preview*)](2026/other/2026-01-16-trust-center-sensitive-data-classification.md)
* [Jan 15, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-01-15-dcr.md)
  + [Clean Rooms API Version: 12.5](2026/other/2026-01-15-dcr.md)
* [Jan 14, 2026: Workspaces replication (*Preview*)](2026/other/2026-01-14-workspace-replication.md)
* [Jan 12, 2026: Specifying custom instructions in semantic views](2026/other/2026-01-12-semantic-views-custom-instructions.md)
* [Jan 08, 2026: Tri-Secret Secure in China (*General availability*)](2026/other/2026-01-08-tss-available-china.md)
* [Jan 07, 2026: Reorganized UI for listings (*General availability*)](2026/other/2026-01-07-listings-ui-reorganization.md)
  + [Changes to Provider Studio](2026/other/2026-01-07-listings-ui-reorganization.md)
  + [Changes to Internal Marketplace listings](2026/other/2026-01-07-listings-ui-reorganization.md)

---
title: Server releases and feature updates in 2023
source: https://docs.snowflake.com/en/release-notes/new-features-2023.md
section: Release Notes
---

# Server releases and feature updates in 2023

The following sections list the release notes for the server releases and feature updates in 2023:

* Server releases in 2023
* Feature updates in 2023

For more recent releases and feature updates, see [Snowflake server release notes and feature updates](new-features.md).

## Server releases in 2023

* [December 14-15, 2023 — 7.44 Release Notes](2023/7_44.md)
  + [New Features](2023/7_44.md)
    - [Organization Usage: Improved views for billing reconciliation — *General Availability*](2023/7_44.md)
  + [SQL Updates](2023/7_44.md)
    - [Snowflake Cortex ML-Based Time-Series Functions — *General Availability*](2023/7_44.md)
  + [Ecosystem Updates](2023/7_44.md)
    - [Snowpark ML Modeling API — *General Availability*](2023/7_44.md)
    - [Snowpark ML Distributed Hyperparameter Optimization — *Preview*](2023/7_44.md)
  + [Data Lake Updates](2023/7_44.md)
    - [Cross-Cloud/Cross-Region Support for Apache Iceberg™ Tables — *Preview*](2023/7_44.md)
  + [Release Notes Change Log](2023/7_44.md)
* [December 04-05, 2023 — 7.43 Release Notes](2023/7_43.md)
  + [New Features](2023/7_43.md)
    - [Finalizer Task — *General Availability*](2023/7_43.md)
  + [SQL Updates](2023/7_43.md)
    - [New SQL functions](2023/7_43.md)
  + [Extensibility Updates](2023/7_43.md)
    - [Python Snowpark Local Testing Framework — *Preview*](2023/7_43.md)
  + [Web Interface Updates](2023/7_43.md)
    - [Load Files onto Stages and Managed Staged Files using Snowsight — *General Availability*](2023/7_43.md)
  + [Release Notes Change Log](2023/7_43.md)
* [November 29-30, 2023 — 7.42 Release Notes](2023/7_42.md)
  + [New Features](2023/7_42.md)
    - [Native Apps: Support for reference and privilege validation in the manifest file — *Preview*](2023/7_42.md)
    - [Schema detection for JSON and CSV — *General Availability*](2023/7_42.md)
    - [Table schema evolution — *General Availability*](2023/7_42.md)
    - [Apache Iceberg™ tables — *Preview*](2023/7_42.md)
    - [Self-service: Enabling the ORGADMIN role — *General Availability*](2023/7_42.md)
    - [Self-service: Deleting an account — *General Availability*](2023/7_42.md)
  + [Security Updates](2023/7_42.md)
    - [Key pair authentication: Improved troubleshooting](2023/7_42.md)
  + [SQL Updates](2023/7_42.md)
    - [Structured types — *Preview*](2023/7_42.md)
  + [Data Governance Updates](2023/7_42.md)
    - [Row access policies: Reference a protected mapping table in a row access policy — *General availability*](2023/7_42.md)
  + [Data Collaboration Updates](2023/7_42.md)
    - [Recurring subscription-based pricing plans for paid listings —– *General Availability*](2023/7_42.md)
    - [Cross-Cloud Auto-Fulfillment support for sharing a Snowflake Native App — *Preview*](2023/7_42.md)
  + [Release Notes Change Log](2023/7_42.md)
* [November 11-14, 2023 — 7.41 Release Notes (with behavior changes)](2023/7_41.md)
  + [Behavior Change Bundles](2023/7_41.md)
  + [SQL Updates](2023/7_41.md)
    - [New SQL functions](2023/7_41.md)
    - [UDFs and Stored Procedures: Support for Optional Arguments](2023/7_41.md)
    - [Snowflake alerts: Manual execution of alerts](2023/7_41.md)
  + [Security Updates](2023/7_41.md)
    - [Replication of network rules — *Preview*](2023/7_41.md)
  + [Data Pipeline Updates](2023/7_41.md)
    - [Dynamic tables: Support for GRANT <privilege> ON ALL/FUTURE DYNAMIC TABLE - Preview](2023/7_41.md)
    - [Dynamic tables: Support for GRANT ALL/ALL PRIVILEGES ON DYNAMIC TABLE - Preview](2023/7_41.md)
  + [Web Interface Updates](2023/7_41.md)
    - [More control over notification contacts in Snowsight](2023/7_41.md)
    - [Snowsight is the default interface for Snowflake accounts in US government regions](2023/7_41.md)
    - [Changes to formatting of query results in worksheets and dashboards](2023/7_41.md)
  + [Release Notes Change Log](2023/7_41.md)
* [November 09-10, 2023 — 7.40 Release Notes](2023/7_40.md)
  + [SQL Updates](2023/7_40.md)
    - [Search Optimization: Support for Substring Search in Semi-Structured Data — *General Availability*](2023/7_40.md)
    - [Email Notification Integrations: ALLOWED_RECIPIENTS No Longer Required](2023/7_40.md)
  + [Web Interface Updates](2023/7_40.md)
    - [Replication and Client Redirect in Snowsight — *Preview*](2023/7_40.md)
    - [Snowsight is the default interface for Snowflake accounts in US government regions](2023/7_40.md)
  + [Release Notes Change Log](2023/7_40.md)
* [November 03-06, 2023 — 7.39 Release Notes (with Snowday 2023)](2023/7_39.md)
  + [New Features](2023/7_39.md)
    - [Account Usage: New AGGREGATE_QUERY_HISTORY View — *Preview*](2023/7_39.md)
    - [Budgets on Azure, GCP, and VPS — *Preview*](2023/7_39.md)
    - [Snowflake Native SDK for Connectors — *Preview*](2023/7_39.md)
  + [Security Updates](2023/7_39.md)
    - [Access control: Database roles — *General availability*](2023/7_39.md)
  + [Data Pipeline Updates](2023/7_39.md)
    - [New function SYSTEM$TASK_RUNTIME_INFO](2023/7_39.md)
  + [Extensibility Updates](2023/7_39.md)
    - [External network access — *Preview on Azure*](2023/7_39.md)
    - [Vectorized Python UDTFs — *General Availability*](2023/7_39.md)
  + [Data Governance Updates](2023/7_39.md)
    - [Set a masking policy on a virtual column — *General Availability*](2023/7_39.md)
  + [Release Notes Change Log](2023/7_39.md)
* [October 23-24, 2023 — 7.38 Release Notes](2023/7_38.md)
  + [Security Updates](2023/7_38.md)
    - [Network rules support Azure private endpoints — *Preview*](2023/7_38.md)
  + [SQL Updates](2023/7_38.md)
    - [ALTER TABLE: Support for IF [NOT] EXISTS with ADD COLUMN and DROP COLUMN](2023/7_38.md)
    - [H3 functions for GEOGRAPHY objects — *Preview*](2023/7_38.md)
  + [Extensibility Updates](2023/7_38.md)
    - [Python packages policies — *Preview*](2023/7_38.md)
  + [Web Interface Updates](2023/7_38.md)
    - [Snowsight is the default interface for Snowflake accounts in US government regions](2023/7_38.md)
  + [Release Notes Change Log](2023/7_38.md)
* [October 16-17, 2023 — 7.37 Release Notes](2023/7_37.md)
  + [New Features](2023/7_37.md)
    - [Logging and tracing from handler code — *General Availability*](2023/7_37.md)
  + [Extensibility Updates](2023/7_37.md)
    - [Reading files with a Python function or procedure — *General Availability*](2023/7_37.md)
    - [Reading files with a Scala function or procedure handler — *General Availability*](2023/7_37.md)
  + [SQL Updates](2023/7_37.md)
    - [Fixed an issue with column aliases for aggregates and the GROUP BY ALL clause](2023/7_37.md)
  + [Web Interface Updates](2023/7_37.md)
    - [Can no longer add or manage payment details using Classic Console](2023/7_37.md)
  + [Release Notes Change Log](2023/7_37.md)
* [October 09-10, 2023 — 7.36 Release Notes](2023/7_36.md)
  + [Extensibility Updates](2023/7_36.md)
    - [Support for Python 3.11 in Snowpark, UDFs, UDTFs and stored procedures — *Preview*](2023/7_36.md)
  + [Data Collaboration Updates](2023/7_36.md)
    - [Company name for listing analytics](2023/7_36.md)
  + [Web Interface Updates](2023/7_36.md)
    - [Accessing billing usage statements — *General Availability*](2023/7_36.md)
    - [Viewing Query History in worksheets — *Preview*](2023/7_36.md)
  + [Release Notes Change Log](2023/7_36.md)
* [October 03-05, 2023 — 7.35 Release Notes](2023/7_35.md)
  + [New Features](2023/7_35.md)
    - [Budgets — *Preview*](2023/7_35.md)
    - [Dynamic tables refreshed on creation by default — *Preview*](2023/7_35.md)
    - [Dynamic tables new sharing capabilities — *Preview*](2023/7_35.md)
  + [SQL Updates](2023/7_35.md)
    - [New SQL Functions](2023/7_35.md)
  + [Data Collaboration Updates](2023/7_35.md)
    - [Allow non-admins to set up Cross-Cloud Auto-Fulfillment](2023/7_35.md)
    - [Offer a limited trial of a data product on the Snowflake Marketplace — *Preview*](2023/7_35.md)
  + [Web Interface Updates](2023/7_35.md)
    - [Task graph run debugging — *Preview*](2023/7_35.md)
  + [Release Notes Change Log](2023/7_35.md)
* [September 27-29, 2023 — 7.34 Release Notes (with behavior changes)](2023/7_34.md)
  + [Behavior Change Bundles](2023/7_34.md)
  + [New Features](2023/7_34.md)
    - [Snowflake Alerts — *General Availability*](2023/7_34.md)
  + [SQL Updates](2023/7_34.md)
    - [New SQL Functions](2023/7_34.md)
  + [Web Interface Updates](2023/7_34.md)
    - [Allow Snowflake On Demand customers to purchase listings](2023/7_34.md)
* [September 18-19, 2023 — 7.33 Release Notes](2023/7_33.md)
  + [New Features](2023/7_33.md)
    - [Network Rules — *Preview*](2023/7_33.md)
    - [Enhanced Network Security — *Preview*](2023/7_33.md)
    - [Network Isolation to Internal Stages Using AWS PrivateLink — *Preview*](2023/7_33.md)
  + [Data Loading Updates](2023/7_33.md)
    - [Cross-platform Support for Snowpipe Auto-Ingest — *General Availability*](2023/7_33.md)
    - [Amazon EventBridge Support for Snowpipe Auto-Ingest — *General Availability*](2023/7_33.md)
  + [Data Governance Updates](2023/7_33.md)
    - [Tag-based Masking Policy: Support for Database & Schema — *General Availability*](2023/7_33.md)
    - [Shared Tag References — *Preview*](2023/7_33.md)
* [September 11-12, 2023 — 7.32 Release Notes](2023/7_32.md)
  + [SQL Updates](2023/7_32.md)
    - [New Function: IS_DATABASE_ROLE_IN_SESSION](2023/7_32.md)
  + [Data Loading / Unloading Updates](2023/7_32.md)
    - [Replicating streams on Snowflake tables populated by Snowpipe Streaming](2023/7_32.md)
    - [Snowpipe Streaming authentication updates](2023/7_32.md)
    - [New options for INFER_SCHEMA](2023/7_32.md)
  + [Data Governance Updates](2023/7_32.md)
    - [Row access policies: Reference a protected mapping table in a row access policy — *Preview*](2023/7_32.md)
    - [Share data protected by a role-based policy — *Preview*](2023/7_32.md)
  + [Web Interface Updates](2023/7_32.md)
    - [Managing data governance in Snowsight — *Generally Available*](2023/7_32.md)
* [September 05-06, 2023 – 7.31 Release Notes](2023/7_31.md)
  + [New Features](2023/7_31.md)
    - [New Information Schema Views for Class Instances](2023/7_31.md)
    - [External Network Access — *Preview*](2023/7_31.md)
* [August 28-29, 2023 — 7.30 Release Notes](2023/7_30.md)
  + [New Features](2023/7_30.md)
    - [Data Pipelines Replication Support — *Preview*](2023/7_30.md)
  + [Security Updates](2023/7_30.md)
    - [Password policies: Add support for password history and time to wait to change a password](2023/7_30.md)
  + [SQL Updates](2023/7_30.md)
    - [EXECUTE IMMEDIATE FROM File — *Preview*](2023/7_30.md)
    - [Organizations & Accounts: Dropping an account URL — *Preview*](2023/7_30.md)
  + [Developer and Extensibility Updates](2023/7_30.md)
    - [Support for Python 3.9 and 3.10 in Snowpark, UDFs, UDTFs and stored procedures — *General Availability*](2023/7_30.md)
    - [Tabular Return Values from Python Stored Procedures — *General Availability*](2023/7_30.md)
  + [Data Governance Updates](2023/7_30.md)
    - [Set a masking policy on a virtual column — *Preview*](2023/7_30.md)
  + [Web Interface Updates](2023/7_30.md)
    - [Governance area supports GOVERNANCE_VIEWER and OBJECT_VIEWER database roles](2023/7_30.md)
    - [Provider Studio Onboarding — *General Availability*](2023/7_30.md)
* [August 22-23, 2023 – 7.29 Release Notes (with behavior changes)](2023/7_29.md)
  + [Behavior Changes Bundles](2023/7_29.md)
  + [Non-bundled Pending Behavior Changes](2023/7_29.md)
  + [SQL Updates](2023/7_29.md)
    - [GET_QUERY_OPERATOR_STATS Function — *General Availability*](2023/7_29.md)
    - [Using the Query Hash to Identify Patterns and Trends in Queries](2023/7_29.md)
    - [New SQL Functions](2023/7_29.md)
* [August 16-17, 2023 — 7.28 Release Notes](2023/7_28.md)
  + [New Features](2023/7_28.md)
    - [Blocking Public Access to Azure Internal Stages — *General Availability*](2023/7_28.md)
  + [SQL Updates](2023/7_28.md)
    - [Python Package Version Range Support — *Preview*](2023/7_28.md)
  + [Data Loading Updates](2023/7_28.md)
    - [New File Format Option: USE_LOGICAL_TYPE](2023/7_28.md)
  + [Web Interface Updates](2023/7_28.md)
    - [Snowsight Worksheet Tabs — *General Availability*](2023/7_28.md)
* [August 07-08, 2023 — 7.27 Release Notes](2023/7_27.md)
  + [New Features](2023/7_27.md)
    - [Account Usage: New CLASS_INSTANCES View](2023/7_27.md)
  + [SQL Updates](2023/7_27.md)
    - [New System Stored Procedure for Sending Email Notifications — *General Availability*](2023/7_27.md)
  + [Web Interface Updates](2023/7_27.md)
    - [Sharing: Improved UI Messaging](2023/7_27.md)
* [August 01-02, 2023 — 7.26 Release Notes](2023/7_26.md)
  + [SQL Updates](2023/7_26.md)
    - [SELECT \*: Selecting Columns Matching a SQL Pattern and Replacing Column Values](2023/7_26.md)
    - [Transforming a GEOMETRY Object to a Different Spatial Reference System (ST_TRANSFORM) — *General Availability*](2023/7_26.md)
    - [Vectorized Python UDTFs — *Preview*](2023/7_26.md)
  + [Data Collaboration Updates](2023/7_26.md)
    - [Recurring Subscription-based Pricing Plans for Paid Listings — *Preview*](2023/7_26.md)
    - [Non-Recurring Subscription-based Pricing Plans for Paid Listings — *General Availability*](2023/7_26.md)
  + [Documentation and Learning Resources](2023/7_26.md)
    - [Weekly Release Notes in the Snowflake Documentation](2023/7_26.md)
* [July 25-26, 2023 — 7.25 Release Notes](2023/7_25.md)
  + [New Features](2023/7_25.md)
    - [Organization Usage: New QUERY_ACCELERATION_HISTORY View](2023/7_25.md)
  + [SQL Updates](2023/7_25.md)
    - [Snowflake Alerts: Support for Future Grants and Object Tagging](2023/7_25.md)
    - [Search Optimization: Support for Substring Search in Semi-Structured Data — *Preview*](2023/7_25.md)
  + [Data Governance Updates](2023/7_25.md)
    - [Access History: Track Masking & Row Access Policy References — *General Availability*](2023/7_25.md)
  + [Web Interface Updates](2023/7_25.md)
    - [Create Named Stages using Snowsight — *General Availability*](2023/7_25.md)
* [July 19-20, 2023 — 7.24 Release Notes](2023/7_24.md)
  + [New Features](2023/7_24.md)
    - [SQL Syntax for Enabling the ORGADMIN Role — *Preview*](2023/7_24.md)
* [July 10-12, 2023 — 7.23 Release Notes (with behavior changes)](2023/7_23.md)
  + [Behavior Changes Bundles](2023/7_23.md)
  + [New Features](2023/7_23.md)
    - [Schema Detection and Evolution for Kafka Connector With Snowpipe Streaming — *Preview*](2023/7_23.md)
  + [SQL Updates](2023/7_23.md)
    - [SYSTEM$CLUSTERING_INFORMATION Returns Error Messages](2023/7_23.md)
  + [Web Interface Updates](2023/7_23.md)
    - [Snowsight Set as Default Web Interface](2023/7_23.md)
* [July 05-06, 2023 — 7.22 Release Notes](2023/7_22.md)
  + [New Features](2023/7_22.md)
    - [Deleting an Account (Self-service) — *Preview*](2023/7_22.md)
    - [Organization Usage: New REPLICATION_GROUP_USAGE_HISTORY View](2023/7_22.md)
  + [SQL Updates](2023/7_22.md)
    - [New SQL Functions](2023/7_22.md)
    - [GROUP BY: New ALL Keyword](2023/7_22.md)
* [June 19-22, 2023 — 7.21 Release Notes](2023/7_21.md)
  + [SQL Updates](2023/7_21.md)
    - [New SQL Functions](2023/7_21.md)
  + [Data Loading Updates](2023/7_21.md)
    - [Support REPLACE_INVALID_CHARACTERS for Avro, Parquet, Orc, and XML](2023/7_21.md)
* [June 14-15, 2023 — 7.20 Release Notes](2023/7_20.md)
  + [New Features](2023/7_20.md)
    - [Snowpipe Streaming Replication Support — *Preview*](2023/7_20.md)
  + [Security Updates](2023/7_20.md)
    - [Access Control: New Privilege for Delegating Warehouse Management — *Preview*](2023/7_20.md)
  + [SQL Updates](2023/7_20.md)
    - [Improved Performance for SELECT Statements With LIMIT and ORDER BY Clauses — *General Availability*](2023/7_20.md)
    - [Support for Python 3.10 in Snowpark, UDFs, UDTFs and Stored Procedures — *Preview*](2023/7_20.md)
  + [Data Governance Updates](2023/7_20.md)
    - [Tag-based Masking Policy: Support for Database & Schema — *Preview*](2023/7_20.md)
    - [Access History: Track Objects Modified by a DDL Operation — *Preview*](2023/7_20.md)
  + [Web Interface Updates](2023/7_20.md)
    - [Load Files From a Stage Into a Table — *General Availability*](2023/7_20.md)
* [June 07-08, 2023 — 7.19 Release Notes (with behavior changes)](2023/7_19.md)
  + [Behavior Changes Bundles](2023/7_19.md)
  + [New Features](2023/7_19.md)
    - [Anonymous Procedures — *General Availability*](2023/7_19.md)
    - [Reading Files With a Java Function or Procedure Handler — *General Availability*](2023/7_19.md)
    - [Reading Files With a Scala Function or Procedure Handler — *Preview*](2023/7_19.md)
    - [Reading Files With a Python Function or Procedure — *Preview*](2023/7_19.md)
    - [Schema Detection for JSON and CSV — *Preview*](2023/7_19.md)
    - [Table Schema Evolution — *Preview*](2023/7_19.md)
  + [SQL Updates](2023/7_19.md)
    - [Support for Python 3.9 in Snowpark, UDFs, and Stored Procedures — *Preview*](2023/7_19.md)
    - [UDFs, UDTFs, and Stored Procedures Support Passing Arguments by Name](2023/7_19.md)
  + [Data Science Updates](2023/7_19.md)
    - [Work With Snowflake’s Upcoming ML features](2023/7_19.md)
  + [Organization Updates](2023/7_19.md)
    - [ACCOUNTS View (Organization Usage) — *Preview*](2023/7_19.md)
  + [Web Interface Updates](2023/7_19.md)
    - [New Organizations Only Have Snowsight Access](2023/7_19.md)
* [May 31-June 01, 2023 — 7.18 Release (no announcements)](2023/7_18.md)

## Feature updates in 2023

* [December 20, 2023 — Snowpark Container Services Release Notes](2023/other/2023-12-20.md)
* [December 15, 2023 — Cost Management Release Notes](2023/other/2023-12-15.md)
  + [Cost Management: Account Overview Page — *Preview*](2023/other/2023-12-15.md)
* [December 01, 2023 — Streamlit in Snowflake Release Notes](2023/other/2023-12-01.md)

---
title: Server releases and feature updates in 2024
source: https://docs.snowflake.com/en/release-notes/new-features-2024.md
section: Release Notes
---

# Server releases and feature updates in 2024

The following sections list the release notes for the server releases and feature updates in 2024:

* Server releases in 2024
* Feature updates in 2024

For more recent releases and feature updates, see [Snowflake server release notes and feature updates](new-features.md).

## Server releases in 2024

* [December 16-18, 2024 — 8.47 Release Notes](2024/8_47.md)
  + [SQL updates](2024/8_47.md)
    - [New SQL functions](2024/8_47.md)
  + [Extensibility updates](2024/8_47.md)
    - [Support for a wildcard character in network rule network identifiers —– *Preview*](2024/8_47.md)
  + [Data pipeline updates](2024/8_47.md)
    - [Dynamic tables: Maximum number of dynamic tables in an account increased to 10,000](2024/8_47.md)
  + [Data governance updates](2024/8_47.md)
    - [OBJECT_DEPENDENCIES view: Support for dynamic tables](2024/8_47.md)
  + [Release notes change log](2024/8_47.md)
* [December 09-13, 2024 — 8.46 Release Notes](2024/8_46.md)
  + [New features](2024/8_46.md)
    - [Restricted caller’s rights — *Preview*](2024/8_46.md)
  + [Snowsight updates](2024/8_46.md)
    - [New login screen version](2024/8_46.md)
  + [SQL Updates](2024/8_46.md)
    - [New SQL functions](2024/8_46.md)
  + [Release notes change log](2024/8_46.md)
* [December 03-05, 2024 — 8.45 Release Notes](2024/8_45.md)
  + [SQL updates](2024/8_45.md)
    - [Snowflake Scripting: Asynchronous child jobs — *Preview*](2024/8_45.md)
  + [Extensibility updates](2024/8_45.md)
    - [Profiling Python stored procedure handlers — *Preview*](2024/8_45.md)
    - [Java 17 support — *General Availability*](2024/8_45.md)
  + [Data pipeline updates](2024/8_45.md)
    - [Dynamic tables: Unlimited inputs](2024/8_45.md)
  + [Release notes change log](2024/8_45.md)
* [November 18-21, 2024 — 8.44 Release Notes](2024/8_44.md)
  + [New features](2024/8_44.md)
    - [Outbound private connectivity for Snowflake features](2024/8_44.md)
    - [Visual Studio Code extension for Snowpark Python — *General availability*](2024/8_44.md)
  + [Extensibility updates](2024/8_44.md)
    - [External network access for Azure Gov regions — *General availability*](2024/8_44.md)
  + [Data lake updates](2024/8_44.md)
    - [Specify an external ID for SIGV4 REST catalog integrations](2024/8_44.md)
  + [Release notes change log](2024/8_44.md)
* [November 12-14, 2024 — 8.43 Release Notes](2024/8_43.md)
  + [New features](2024/8_43.md)
    - [Full-text search — *General availability*](2024/8_43.md)
    - [Leaked password protection](2024/8_43.md)
    - [Tasks: Python and JVM support for serverless tasks — *General availability*](2024/8_43.md)
  + [SQL updates](2024/8_43.md)
    - [EXECUTE IMMEDIATE FROM: Support for using content from staged files in templates](2024/8_43.md)
    - [Automatic logging and tracing for Snowflake Scripting stored procedures](2024/8_43.md)
    - [ACCOUNT_USAGE: New SERVERLESS_ALERT_HISTORY view](2024/8_43.md)
  + [Extensibility updates](2024/8_43.md)
    - [Authentication with AWS IAM from procedures and functions — *General availability*](2024/8_43.md)
  + [Listings updates](2024/8_43.md)
    - [LISTING_REFRESH_HISTORY — *General availability*](2024/8_43.md)
  + [Data pipeline updates](2024/8_43.md)
    - [Dynamic tables: Support for replication across different failover groups](2024/8_43.md)
  + [Data Lake updates](2024/8_43.md)
    - [Apache Iceberg™ tables: Support for Microsoft Fabric OneLake storage — *Preview*](2024/8_43.md)
  + [Release notes change log](2024/8_43.md)
* [November 04-06, 2024 — 8.42 Release Notes](2024/8_42.md)
  + [New features](2024/8_42.md)
    - [Trust Center: Two new scanners in the Security Essentials scanner package](2024/8_42.md)
    - [Serverless alerts — *General availability*](2024/8_42.md)
  + [SQL updates](2024/8_42.md)
    - [PARSE_JSON and TRY_PARSE_JSON functions: Duplicate keys are now allowed](2024/8_42.md)
  + [Extensibility updates](2024/8_42.md)
    - [New Tensorflow version might require specifying Keras](2024/8_42.md)
  + [Data pipeline updates](2024/8_42.md)
    - [Tasks: Serverless tasks user control — *General availability*](2024/8_42.md)
    - [Tasks: Task success notifications — *General availability*](2024/8_42.md)
  + [AI & ML updates](2024/8_42.md)
    - [API-level Role-based Access Control (RBAC) for Cortex Analyst](2024/8_42.md)
  + [Release notes change log](2024/8_42.md)
* [October 28-30, 2024 — 8.41 Release Notes](2024/8_41.md)
  + [New features](2024/8_41.md)
    - [Outbound private connectivity for Snowflake features](2024/8_41.md)
    - [EXECUTE IMMEDIATE FROM: Preview SQL rendered from Jinja2 templates](2024/8_41.md)
    - [GENERATE_SYNTHETIC_DATA: New system stored procedure for generating synthetic data — *Preview*](2024/8_41.md)
  + [Security updates](2024/8_41.md)
    - [Increased limits for network policies on internal stages](2024/8_41.md)
  + [SQL updates](2024/8_41.md)
    - [Extended support for bind variables](2024/8_41.md)
  + [Extensibility updates](2024/8_41.md)
    - [Writing files from Snowpark Python UDFs and UDTFs — *Preview*](2024/8_41.md)
  + [Release notes change log](2024/8_41.md)
* [October 21-23, 2024 — 8.40 Release Notes](2024/8_40.md)
  + [New features](2024/8_40.md)
    - [Trust Center: New Threat Intelligence scanner package](2024/8_40.md)
    - [Estimate the cost of Automatic Clustering — *General availability*](2024/8_40.md)
    - [Snowflake REST APIs — *General availability*](2024/8_40.md)
  + [Deprecated features](2024/8_40.md)
    - [Snowflake REST APIs](2024/8_40.md)
  + [SQL updates](2024/8_40.md)
    - [New SQL functions](2024/8_40.md)
  + [Data lake updates](2024/8_40.md)
    - [Apache Iceberg™ tables: Catalog integration for Iceberg REST — *General availability*](2024/8_40.md)
  + [Release notes change log](2024/8_40.md)
* [October 14-17, 2024 — 8.39 Release Notes](2024/8_39.md)
  + [New features](2024/8_39.md)
    - [Cortex Analyst fully supported in Streamlit in Snowflake](2024/8_39.md)
  + [Data pipeline updates](2024/8_39.md)
    - [Dynamic tables: Changes to the output of the GET_DDL function](2024/8_39.md)
  + [Data lake updates](2024/8_39.md)
    - [Apache Iceberg™ tables: New SYSTEM$VERIFY_EXTERNAL_VOLUME function](2024/8_39.md)
  + [Release notes change log](2024/8_39.md)
* [October 07-09, 2024 — 8.38 Release Notes (with behavior changes)](2024/8_38.md)
  + [Behavior change bundles](2024/8_38.md)
  + [SQL updates](2024/8_38.md)
    - [New SQL functions](2024/8_38.md)
    - [Query objects larger than 16 MB in files on a stage](2024/8_38.md)
  + [Data pipeline updates](2024/8_38.md)
    - [Dynamic tables: Updates to input types](2024/8_38.md)
  + [Data governance updates](2024/8_38.md)
    - [Data quality: New SYSTEM$DATA_METRIC_SCAN function](2024/8_38.md)
  + [Release notes change log](2024/8_38.md)
* [September 30 - October 03, 2024 — 8.37 Release Notes](2024/8_37.md)
  + [SQL updates](2024/8_37.md)
    - [New SQL functions](2024/8_37.md)
  + [Extensibility updates](2024/8_37.md)
    - [Authentication with AWS IAM from procedures and functions — *Preview*](2024/8_37.md)
  + [Release notes change log](2024/8_37.md)
* [September 23-26, 2024 — 8.36 Release Notes](2024/8_36.md)
  + [Data lake updates](2024/8_36.md)
    - [Cloning support for Snowflake-managed Apache Iceberg™ tables — *Preview*](2024/8_36.md)
  + [Release notes change log](2024/8_36.md)
* [September 18-20, 2024 — 8.35 Release Notes](2024/8_35.md)
  + [SQL updates](2024/8_35.md)
    - [RANGE BETWEEN support for FIRST_VALUE and LAST_VALUE functions](2024/8_35.md)
  + [Extensibility updates](2024/8_35.md)
    - [Telemetry data to the event table from Snowflake Notebook cells temporarily disabled](2024/8_35.md)
    - [pandas on Snowflake - *General Availability*](2024/8_35.md)
  + [Data lake updates](2024/8_35.md)
    - [Apache Iceberg™ tables: Automated refresh — *Preview*](2024/8_35.md)
  + [Release notes change log](2024/8_35.md)
* [September 09-11, 2024 — 8.34 Release Notes](2024/8_34.md)
  + [Data loading/unloading updates](2024/8_34.md)
    - [The vectorized scanner option supports client-side encryption](2024/8_34.md)
  + [Data pipeline updates](2024/8_34.md)
    - [Dynamic tables: New DYNAMIC_TABLE_REFRESH_HISTORY account usage view](2024/8_34.md)
    - [Tasks: Python and JVM support for serverless tasks - *Preview*](2024/8_34.md)
  + [Release notes change log](2024/8_34.md)
* [September 03-05, 2024 — 8.33 Release Notes](2024/8_33.md)
  + [New features](2024/8_33.md)
    - [Snowflake REST APIs — *Preview*](2024/8_33.md)
  + [SQL updates](2024/8_33.md)
    - [SHOW commands: Support for new WITH PRIVILEGES parameter](2024/8_33.md)
  + [Data lake updates](2024/8_33.md)
    - [Apache Iceberg™ tables: Catalog integration for Iceberg REST — *Preview*](2024/8_33.md)
    - [Iceberg tables: Delta table support — *Preview*](2024/8_33.md)
  + [Release notes change log](2024/8_33.md)
* [August 26-30, 2024 — 8.32 Release Notes (with behavior changes)](2024/8_32.md)
  + [Behavior change bundles](2024/8_32.md)
  + [SQL updates](2024/8_32.md)
    - [New SQL functions](2024/8_32.md)
  + [Data pipeline updates](2024/8_32.md)
    - [Tasks: A new option for ALTER TASK](2024/8_32.md)
  + [Release notes change log](2024/8_32.md)
* [August 19-21, 2024 — 8.31 Release Notes](2024/8_31.md)
  + [Data lake updates](2024/8_31.md)
    - [Snowflake Open Catalog: New system function for troubleshooting issues with syncing Snowflake-managed Apache Iceberg™ tables - *Preview*](2024/8_31.md)
    - [Apache Iceberg™ tables: Support for time travel queries using third-party engines — *General availability*](2024/8_31.md)
  + [Release notes change log](2024/8_31.md)
* [August 11-14, 2024 — 8.30 Release Notes](2024/8_30.md)
  + [New features](2024/8_30.md)
    - [Outbound private connectivity with Azure External Network Access and External Functions — *Preview*](2024/8_30.md)
    - [Full-text search - *Preview*](2024/8_30.md)
  + [SQL updates](2024/8_30.md)
    - [Setting users as SNOWFLAKE_SUPPORT users no longer supported](2024/8_30.md)
    - [RANGE BETWEEN with explicit offsets: Additional window functions supported](2024/8_30.md)
    - [UNDROP command: Support for restoring objects using ID](2024/8_30.md)
    - [Wildcard filtering for functions](2024/8_30.md)
  + [Data loading / unloading updates](2024/8_30.md)
    - [Loading unstructured data with Document AI — *Preview*](2024/8_30.md)
  + [Release notes change log](2024/8_30.md)
* [August 07-08, 2024 — 8.29 Release Notes](2024/8_29.md)
  + [Security updates](2024/8_29.md)
    - [Session policies: Support added for secondary roles](2024/8_29.md)
  + [Extensibility updates](2024/8_29.md)
    - [Python user-defined aggregate functions — *General availability*](2024/8_29.md)
    - [Access to Git repositories from Snowflake — *General availability*](2024/8_29.md)
  + [Data lake updates](2024/8_29.md)
    - [Apache Iceberg™ tables: Support for government regions — *General availability*](2024/8_29.md)
  + [Release notes change log](2024/8_29.md)
* [July 29-August 01, 2024 — 8.28 Release Notes](2024/8_28.md)
  + [SQL updates](2024/8_28.md)
    - [New SQL functions](2024/8_28.md)
    - [CREATE and ALTER commands for replication and failover groups: Support added for tags](2024/8_28.md)
    - [Account Usage: New SEARCH_OPTIMIZATION_BENEFITS view](2024/8_28.md)
  + [Data governance updates](2024/8_28.md)
    - [Object Tagging: Support added for replication and failover groups](2024/8_28.md)
    - [Data Quality and data metric functions (DMFs) — *General Availability*](2024/8_28.md)
  + [Data loading/unloading updates](2024/8_28.md)
    - [Snowpipe: New output in SYSTEM$PIPE_STATUS](2024/8_28.md)
  + [Data pipelines updates](2024/8_28.md)
    - [Dynamic tables: Support for incremental lateral flatten](2024/8_28.md)
  + [Data lake updates](2024/8_28.md)
    - [Apache Iceberg™ tables: Support for Snowflake Open Catalog — *Preview*](2024/8_28.md)
  + [Release notes change log](2024/8_28.md)
* [July 22-25, 2024 — 8.27 Release Notes (with behavior changes)](2024/8_27.md)
  + [Behavior change bundles](2024/8_27.md)
  + [New features](2024/8_27.md)
    - [Support for sending webhook notifications to Slack, Microsoft Teams, and PagerDuty](2024/8_27.md)
    - [Triggered tasks — *General availability*](2024/8_27.md)
  + [SQL updates](2024/8_27.md)
    - [GET_DDL function: Support for warehouses](2024/8_27.md)
  + [Data governance updates](2024/8_27.md)
    - [Custom Data Classification — *General availability*](2024/8_27.md)
    - [Data Classification of tables in a schema with Snowsight — *General availability*](2024/8_27.md)
  + [Release notes change log](2024/8_27.md)
* [July 15-17, 2024 — 8.26 Release Notes](2024/8_26.md)
  + [New features](2024/8_26.md)
    - [Estimate the cost of Automatic Clustering — *Preview*](2024/8_26.md)
  + [SQL updates](2024/8_26.md)
    - [New TYPE property for USER —— *General availability*](2024/8_26.md)
  + [Extensibility updates](2024/8_26.md)
    - [Access to external network locations on AWS in the Gov region — *General availability*](2024/8_26.md)
  + [Cost management updates](2024/8_26.md)
    - [Support for Snowpark Container Services in custom budgets — *General availability*](2024/8_26.md)
  + [Release notes change log](2024/8_26.md)
* [July 08-12, 2024 — 8.25 Release Notes](2024/8_25.md)
  + [SQL updates](2024/8_25.md)
    - [Wildcards are now supported in OBJECT constants](2024/8_25.md)
  + [Release notes change log](2024/8_25.md)
* [July 01-03, 2024 — 8.24 Release Notes](2024/8_24.md)
  + [Security updates](2024/8_24.md)
    - [Trust Center: CIS Benchmarks scanner package — *General availability*](2024/8_24.md)
    - [Authentication policies: New multi-factor authentication parameters](2024/8_24.md)
  + [Virtual warehouse updates](2024/8_24.md)
    - [Hybrid tables: Changes to capacity quotas — *Preview*](2024/8_24.md)
  + [Release notes change log](2024/8_24.md)
* [June 17-30, 2024 — 8.23 Release Notes](2024/8_23.md)
  + [Security updates](2024/8_23.md)
    - [Trust Center: Security Essentials scanner package — *General availability*](2024/8_23.md)
  + [SQL updates](2024/8_23.md)
    - [Window functions: extended support for RANGE BETWEEN — *Preview*](2024/8_23.md)
    - [Account Usage: TABLE_DML_HISTORY and TABLE_PRUNING_HISTORY views — *General availability*](2024/8_23.md)
  + [Data governance updates](2024/8_23.md)
    - [Data quality: add new system DMFs — *Preview*](2024/8_23.md)
  + [Data pipelines updates](2024/8_23.md)
    - [ALTER DYNAMIC TABLE command: Support for adding search optimization and setting additional properties](2024/8_23.md)
  + [Snowflake Native App Framework](2024/8_23.md)
    - [Updates to logging and tracing for a Snowflake Native App — *Preview*](2024/8_23.md)
    - [New events generated during app installation and upgrade — *Preview*](2024/8_23.md)
  + [Release notes change log](2024/8_23.md)
* [June 10-15, 2024 — 8.22 Release Notes (with behavior changes)](2024/8_22.md)
  + [Behavior change bundles](2024/8_22.md)
  + [SQL updates](2024/8_22.md)
    - [Some bitwise expression functions support BINARY data](2024/8_22.md)
  + [Virtual warehouse updates](2024/8_22.md)
    - [Account Usage: WAREHOUSE_EVENTS_HISTORY view — *General availability*](2024/8_22.md)
  + [Release notes change log](2024/8_22.md)
* [May 28-30, 2024 — 8.21 Release Notes](2024/8_21.md)
  + [New features](2024/8_21.md)
    - [Triggered tasks — *Preview*](2024/8_21.md)
  + [SQL updates](2024/8_21.md)
    - [Email notification integrations no longer limited to 10](2024/8_21.md)
    - [UNPIVOT supports rows with NULLs in results](2024/8_21.md)
  + [Data loading / unloading updates](2024/8_21.md)
    - [New Parquet file format option USE_VECTORIZED_SCANNER — *General Availability*](2024/8_21.md)
  + [Streamlit in Snowflake updates](2024/8_21.md)
    - [Support for v1.29.0 and v1.31.1 of the Streamlit library](2024/8_21.md)
  + [Release notes change log](2024/8_21.md)
* [May 20-22, 2024 — 8.20 Release Notes](2024/8_20.md)
  + [New features](2024/8_20.md)
    - [Trust Center — *Preview*](2024/8_20.md)
  + [SQL updates](2024/8_20.md)
    - [CREATE OR ALTER TABLE and CREATE OR ALTER TASK — *Preview*](2024/8_20.md)
  + [Apache Iceberg™ table updates](2024/8_20.md)
    - [Apache Iceberg™ tables — *General availability*](2024/8_20.md)
    - [Replace invalid UTF-8 characters in Iceberg tables](2024/8_20.md)
    - [Structured type evolution for Iceberg tables](2024/8_20.md)
    - [Set a storage serialization policy](2024/8_20.md)
    - [Change ALLOW_WRITES to FALSE for external volumes](2024/8_20.md)
    - [New ICEBERG_ACCESS_ERRORS view](2024/8_20.md)
  + [Release notes change log](2024/8_20.md)
* [May 13-15, 2024 — 8.19 Release Notes](2024/8_19.md)
  + [New features](2024/8_19.md)
    - [Serverless alerts — *Preview*](2024/8_19.md)
  + [Security updates](2024/8_19.md)
    - [Tri-Secret Secure self-registration](2024/8_19.md)
  + [SQL updates](2024/8_19.md)
    - [Jinja2 template support for EXECUTE IMMEDIATE FROM — *Preview*](2024/8_19.md)
  + [Data loading/unloading updates](2024/8_19.md)
    - [Resolved a known issue for INCLUDE_METADATA copy option](2024/8_19.md)
  + [Release notes change log](2024/8_19.md)
* [May 08-09, 2024 — 8.18 Release Notes](2024/8_18.md)
  + [SQL updates](2024/8_18.md)
    - [Dynamic pivot available](2024/8_18.md)
    - [Added support for structured data types in UDFs](2024/8_18.md)
    - [New SQL functions](2024/8_18.md)
  + [Extensibility updates](2024/8_18.md)
    - [Python user-defined aggregate functions — *Preview*](2024/8_18.md)
    - [Access to external network locations on AWS in the Gov region — *Preview*](2024/8_18.md)
  + [Release notes change log](2024/8_18.md)
* [April 30-May 07, 2024 — 8.17 Release Notes (with behavior changes)](2024/8_17.md)
  + [Behavior change bundles](2024/8_17.md)
  + [Security updates](2024/8_17.md)
    - [Authentication enhancements — *General Availability*](2024/8_17.md)
  + [SQL updates](2024/8_17.md)
    - [READ ONLY property available for tables](2024/8_17.md)
    - [ST_INTERSECTION_AGG and ST_UNION_AGG functions — *General Availability*](2024/8_17.md)
  + [Data loading /unloading updates](2024/8_17.md)
    - [New copy option: INCLUDE_METADATA](2024/8_17.md)
  + [Release notes change log](2024/8_17.md)
* [April 22-24, 2024 — 8.16 Release Notes](2024/8_16.md)
  + [SQL updates](2024/8_16.md)
    - [New SQL command(s)](2024/8_16.md)
    - [SQL API support for hybrid tables](2024/8_16.md)
  + [Extensibility updates](2024/8_16.md)
    - [Asynchronous job support in Snowpark stored procedures](2024/8_16.md)
  + [Data Lake Updates](2024/8_16.md)
    - [Apache Iceberg™ tables: Support for un-materialized identity partition columns](2024/8_16.md)
  + [Release notes change log](2024/8_16.md)
* [April 17-19, 2024 — 8.15 Release Notes](2024/8_15.md)
  + [SQL updates](2024/8_15.md)
    - [New SQL functions](2024/8_15.md)
  + [Data loading / unloading updates](2024/8_15.md)
    - [Support for granting the READ and WRITE privileges on external stages](2024/8_15.md)
  + [Release notes change log](2024/8_15.md)
* [April 08-15, 2024 — 8.14 Release Notes](2024/8_14.md)
  + [New regions](2024/8_14.md)
  + [Extensibility updates](2024/8_14.md)
    - [Python UDTFs with vectorized process methods — *General Availability*](2024/8_14.md)
  + [Snowflake Cortex updates](2024/8_14.md)
    - [Forecasting improvements in Snowflake Cortex ML Functions](2024/8_14.md)
  + [Release notes change log](2024/8_14.md)
* [April 01-03, 2024 — 8.13 Release Notes](2024/8_13.md)
  + [Snowflake Cortex updates](2024/8_13.md)
    - [Evaluation Metrics for Forecasting and Anomaly Detection](2024/8_13.md)
  + [SQL updates](2024/8_13.md)
    - [Fixed an issue with the PARSE_IP function](2024/8_13.md)
    - [Fixed an issue with the SPLIT_PART function](2024/8_13.md)
  + [Extensibility updates](2024/8_13.md)
    - [Access to Git repositories from Snowflake — *Preview*](2024/8_13.md)
  + [Release notes change log](2024/8_13.md)
* [March 26-27, 2024 — 8.12 Release Notes (with behavior changes)](2024/8_12.md)
  + [Behavior Change Bundles](2024/8_12.md)
  + [SQL Updates](2024/8_12.md)
    - [Organization Usage: Improved views for billing reconciliation — *General Availability*](2024/8_12.md)
  + [Data Pipeline Updates](2024/8_12.md)
    - [Replication: Stages, pipes, storage integrations, load history, and Snowpipe Streaming — *General Availability*](2024/8_12.md)
    - [Schema detection and evolution for Kafka connector with Snowpipe Streaming — *General Availability*](2024/8_12.md)
  + [Data Governance Updates](2024/8_12.md)
    - [Memoizable functions with constant arguments](2024/8_12.md)
    - [Share data protected by a role-based policy — *General Availability*](2024/8_12.md)
    - [Access History: Stored procedure ancestor queries — *General Availability*](2024/8_12.md)
    - [Shared tag references — *General Availability*](2024/8_12.md)
    - [Access History: Track objects modified by a DDL operation — *General Availability*](2024/8_12.md)
  + [Release Notes Change Log](2024/8_12.md)
* [March 18-20, 2024 — 8.11 Release Notes](2024/8_11.md)
  + [SQL updates](2024/8_11.md)
    - [SELECT supports trailing commas](2024/8_11.md)
  + [Data loading / unloading Updates](2024/8_11.md)
    - [Performance improvements for loading JSON files](2024/8_11.md)
    - [Improvements to the SNOWPIPE_STREAMING_CLIENT_HISTORY view](2024/8_11.md)
  + [Release notes change log](2024/8_11.md)
* [March 11-12, 2024 — 8.10 Release Notes (no announcements)](2024/8_10.md)
  + [Announcements](2024/8_10.md)
  + [Release Notes Change Log](2024/8_10.md)
* [March 04-05, 2024 — 8.9 Release Notes](2024/8_09.md)
  + [Non-bundled Behavior Changes](2024/8_09.md)
  + [Data Governance Updates](2024/8_09.md)
    - [Custom Classification — *Preview*](2024/8_09.md)
  + [Data lake updates](2024/8_09.md)
    - [Primary key information added to Apache Iceberg™ table metadata](2024/8_09.md)
  + [Release Notes Change Log](2024/8_09.md)
* [February 26-28, 2024 — 8.8 Release Notes](2024/8_08.md)
  + [Data Lake Updates](2024/8_08.md)
    - [Secure Data Sharing for Apache Iceberg™ tables](2024/8_08.md)
    - [Query an Apache Iceberg™ table without granting the USAGE privilege on related objects](2024/8_08.md)
  + [Extensibility Updates](2024/8_08.md)
    - [External network access — *General Availability*](2024/8_08.md)
  + [Release Notes Change Log](2024/8_08.md)
* [February 19-21, 2024 — 8.7 Release Notes (with behavior changes)](2024/8_07.md)
  + [Behavior Change Bundles](2024/8_07.md)
  + [SQL Updates](2024/8_07.md)
    - [SQL Functions Add Support for the `upper`, `lower`, and `trim` Collation Specifiers](2024/8_07.md)
  + [Data Lake Updates](2024/8_07.md)
    - [GET_DDL for external tables supports fully-qualified location names](2024/8_07.md)
  + [Release Notes Change Log](2024/8_07.md)
* [February 12-14, 2024 — 8.6 Release Notes](2024/8_06.md)
  + [SQL Updates](2024/8_06.md)
    - [New SQL functions](2024/8_06.md)
  + [Data Loading / Unloading Updates](2024/8_06.md)
    - [Specify an external ID for AWS storage access](2024/8_06.md)
  + [Release Notes Change Log](2024/8_06.md)
* [February 05-06, 2024 — 8.5 Release Notes](2024/8_05.md)
  + [Security Updates](2024/8_05.md)
    - [External API authentication and secrets — *General Availability*](2024/8_05.md)
  + [Extensibility Updates](2024/8_05.md)
    - [External network access — *General Availability*](2024/8_05.md)
    - [Python packages policies — *General Availability*](2024/8_05.md)
  + [Data Loading / Unloading Updates](2024/8_05.md)
    - [COPY FILES — *Preview*](2024/8_05.md)
  + [Data Governance Updates](2024/8_05.md)
    - [Data Classification: Asynchronous tag assignments for columns of tables in a schema and automate tagging for a single classification event — *Preview*](2024/8_05.md)
  + [Release Notes Change Log](2024/8_05.md)
* [January 29-30, 2024 — 8.4 Release Notes](2024/8_04.md)
  + [Security Updates](2024/8_04.md)
    - [Authentication enhancements — *Preview*](2024/8_04.md)
  + [Virtual Warehouse Updates](2024/8_04.md)
    - [Larger warehouses — *General Availability in Microsoft Azure Regions*](2024/8_04.md)
  + [Extensibility Updates](2024/8_04.md)
    - [External network access — *Preview*](2024/8_04.md)
    - [Java 17 support — *Preview*](2024/8_04.md)
  + [Data Loading / Unloading Updates](2024/8_04.md)
    - [Snowpipe update: a new pipe status](2024/8_04.md)
  + [Data Pipeline Updates](2024/8_04.md)
    - [Automatic task graph retry — *General Availability*](2024/8_04.md)
  + [Release Notes Change Log](2024/8_04.md)
* [January 22-23, 2024 — 8.3 Release Notes](2024/8_03.md)
  + [Security Updates](2024/8_03.md)
    - [Network rules — *General Availability*](2024/8_03.md)
    - [Enhanced network security — *General Availability*](2024/8_03.md)
    - [Network isolation to internal stages using AWS PrivateLink — *General Availability*](2024/8_03.md)
  + [Release Notes Change Log](2024/8_03.md)
* [January 15-17, 2024 — 8.2 Release Notes (with behavior changes)](2024/8_02.md)
  + [Behavior Change Bundles](2024/8_02.md)
  + [New Features](2024/8_02.md)
    - [Access History: Support added for stored procedure ancestor queries — Preview](2024/8_02.md)
  + [Release Notes Change Log](2024/8_02.md)
* [January 08-10, 2024 — 8.1 Release Notes](2024/8_01.md)
  + [New Features](2024/8_01.md)
    - [EXECUTE IMMEDIATE FROM File — *General Availability*](2024/8_01.md)
  + [SQL Updates](2024/8_01.md)
    - [CREATE <object> … CLONE command: New parameter](2024/8_01.md)
  + [New SQL functions](2024/8_01.md)
  + [Extensibility Updates](2024/8_01.md)
    - [Support for Python 3.11 in Snowpark, UDFs, UDTFs and stored procedures — *General Availability*](2024/8_01.md)
  + [Release Notes Change Log](2024/8_01.md)
* [January 03-04, 2024 — 8.0 Release Notes](2024/8_00.md)
  + [Extensibility Updates](2024/8_00.md)
    - [Account Usage: New EXTERNAL_ACCESS_HISTORY View](2024/8_00.md)
  + [Data Collaboration Updates](2024/8_00.md)
    - [Organization Usage: New LISTING_AUTO_FULFILLMENT_USAGE_HISTORY View](2024/8_00.md)
  + [Release Notes Change Log](2024/8_00.md)

## Feature updates in 2024

* [Dec 20, 2024: Support for Streamlit 1.39.0 (Preview)](2024/other/2024-12-20-sis.md)
* [December 19, 2024 — Snowflake Native Apps with Azure Private Link support — *General Availability*](2024/other/2024-12-19-na-az-gov-ga.md)
* [December 19, 2024 — Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link — *Preview*](2024/other/2024-12-19-notebooks-wh-aws-azure-pl.md)
* [December 19, 2024 — New homepage for Snowsight —– *General Availability*](2024/other/2024-12-19-snowsight-homepage-ga.md)
* [December 18, 2024 — Inbound private connectivity to Snowpark Container Services for accounts on AWS — *Preview*](2024/other/2024-12-18-spcs-aws-inbound-private-connectivity.md)
* [Dec 16, 2024: Azure Private Link in Streamlit in Snowflake (General Availability)](2024/other/2024-12-16-sis.md)
* [December 12, 2024 — Document AI release notes](2024/other/2024-12-12-document-ai.md)
* [December 09, 2024 — Organizational listings: Discovery and access — *Preview*](2024/other/2024-12-09-dbna.md)
* [December 09, 2024 — Snowflake Native Apps with Azure Private Link support —– *Preview*](2024/other/2024-12-09-na-az-privatelink.md)
* [December 09, 2024 — Using block storage with Snowpark Container Services job services — *Preview*](2024/other/2024-12-09-spcs-block-storage-for-jobs-in-preview.md)
* [December 05, 2024 — Snowflake Cortex Powered Descriptions — *General Availability*](2024/other/2024-12-05-cortex-descriptions.md)
* [December 05, 2024 — Private Notebooks in a Personal Database — *Deprecated*](2024/other/2024-12-05-personal-db-private-nb.md)
* [Dec 4, 2024: Azure Private Link in Streamlit in Snowflake (Preview)](2024/other/2024-12-04-sis.md)
* [December 03, 2024 — Snowflake Native Apps in Azure Government regions — *Preview*](2024/other/2024-12-03-na-ga-gov-azure.md)
* [November 27, 2024 — Snowflake Native Apps: Multiple app installs — *General Availability*](2024/other/2024-11-27-na-mult-install.md)
* [November 25, 2024 — Snowflake Cortex AI TRANSLATE — Updates](2024/other/2024-11-25-cortex-translate-update.md)
* [November 25, 2024 — Data governance release notes](2024/other/2024-11-26-data-governance.md)
  + [Governance for organization listings through access history](2024/other/2024-11-26-data-governance.md)
* [November 21, 2024 — Snowflake Data Clean Rooms release notes](2024/other/2024-11-21-dcr.md)
  + [Non-overlap metrics](2024/other/2024-11-21-dcr.md)
  + [Unlink datasets API](2024/other/2024-11-21-dcr.md)
  + [Dynamic table support](2024/other/2024-11-21-dcr.md)
  + [Custom Python code in consumer templates](2024/other/2024-11-21-dcr.md)
  + [Merkury Identity connector](2024/other/2024-11-21-dcr.md)
  + [Google Display & Video 360 - Customer Match activation connector](2024/other/2024-11-21-dcr.md)
* [November 21, 2024 — Logical replication of clones — *General Availability*](2024/other/2024-11-21-logical-repl-clones.md)
* [November 20, 2024 — Snowsight rate limits — *General Availability*](2024/other/2024-11-20-snowsight-rate-limits.md)
* [November 18, 2024 — S3-compatible storage for externally managed Apache Iceberg™ tables — *General Availability*](2024/other/2024-11-18-s3-compatible-externally-managed-iceberg-ga.md)
* [November 18, 2024 — Sensitive data classification](2024/other/2024-11-18-sensitive-data-classification.md)
  + [Automatic Sensitive Data Classification — *Preview*](2024/other/2024-11-18-sensitive-data-classification.md)
  + [Classifier improvements](2024/other/2024-11-18-sensitive-data-classification.md)
* [November 15, 2024 — Apache Iceberg™ tables: Efficient bulk loading, continuous ingestion, and data streaming — *General Availability*](2024/other/2024-11-15-iceberg-tables-loading.md)
  + [COPY INTO <table> and Snowpipe continuous file ingestion](2024/other/2024-11-15-iceberg-tables-loading.md)
  + [Snowpipe Streaming](2024/other/2024-11-15-iceberg-tables-loading.md)
* [November 14, 2024 — Cortex Analyst](2024/other/2024-11-14-cortex-analyst.md)
  + [Multi-turn conversation in Cortex Analyst — *Preview*](2024/other/2024-11-14-cortex-analyst.md)
  + [Joins support in Cortex Analyst — *Preview*](2024/other/2024-11-14-cortex-analyst.md)
* [November 14, 2024 — Manage account preview features — *General Availability*](2024/other/2024-11-14-manage-preview.md)
* [November 13, 2024 — Hybrid tables support extended to additional AWS regions](2024/other/2024-11-13-hybrid-tables-ga-regions.md)
* [November 12, 2024 — Budgets: Support for cloud provider queue and webhook notifications](2024/other/2024-11-12-budget-notification-queue-webhook.md)
* [November 12, 2024 — Additional CREATE OR ALTER commands — *Preview*](2024/other/2024-11-12-create-or-alter-pupr.md)
* [November 12, 2024 — Dynamic tables: Support for reading from Snowflake-managed Iceberg tables and creating dynamic Apache Iceberg™ tables –— *General Availability*](2024/other/2024-11-12-dynamic-iceberg-tables.md)
* [November 12, 2024 — Classification (Snowflake ML Function) — *General Availability*](2024/other/2024-11-12-ml-functions-classification-ga.md)
* [November 12, 2024 — Organizational listings and the Internal Marketplace — *General Availability*](2024/other/2024-11-12-organizational-listings.md)
* [November 12, 2024 — Snowflake ML: Distributed Hyperparameter Optimization on Snowpark Container Services — *Preview*](2024/other/2024-11-12-snowflake-ml-hpo-spcs.md)
* [Nov 11, 2024 — Snowflake Data Clean Rooms release notes](2024/other/2024-11-11-dcr.md)
  + [All developer API clean rooms now available in the web app](2024/other/2024-11-11-dcr.md)
  + [Provider run for custom web app templates](2024/other/2024-11-11-dcr.md)
  + [Provider and consumer activation in custom web app templates](2024/other/2024-11-11-dcr.md)
  + [SQL policy configuration updates](2024/other/2024-11-11-dcr.md)
  + [Sync and naming support for data connectors](2024/other/2024-11-11-dcr.md)
* [November 11, 2024 — Snowflake Native App Framework release notes](2024/other/2024-11-11-native-apps.md)
  + [Snowflake Native Apps with Snowpark Container Services in AWS — *General availability*](2024/other/2024-11-11-native-apps.md)
  + [Snowflake Native Apps with Snowpark Container Services in Azure — *Preview*](2024/other/2024-11-11-native-apps.md)
  + [Native App Framework support for Budgets](2024/other/2024-11-11-native-apps.md)
* [November 11, 2024 — Snowflake Notebooks Warehouse Runtime — *General Availability*](2024/other/2024-11-11-notebooks-wh-ga.md)
  + [Updates](2024/other/2024-11-11-notebooks-wh-ga.md)
* [November 08, 2024 — Grouped Query History — *Preview*](2024/other/2024-11-08-grouped-query-history.md)
* [November 08, 2024 — Snowflake Microsoft Sharepoint connector](2024/other/2024-11-08.md)
  + [Snowflake Connector for SharePoint](2024/other/2024-11-08.md)
* [November 06, 2024 — SPLIT_TEXT_RECURSIVE_CHARACTER Cortex function — *Preview*](2024/other/2024-11-06-split-text-recursive-character.md)
* [November 04, 2024 — Classify Text (Snowflake Cortex LLM Function) — *General Availability*](2024/other/2024-11-04-classify-text-ga.md)
* [November 04, 2024 — Data Lineage — *Preview*](2024/other/2024-11-04-data-lineage.md)
* [November 04, 2024 — Replication error notifications — *General Availability*](2024/other/2024-11-04-error-notifications.md)
* [November 04, 2024 — Snowflake Native Apps support for AWS Private Link — *General Availability*](2024/other/2024-11-04-na-aws-pl-ga.md)
* [November 04, 2024 — Top Insights (Snowflake ML Function) — *General Availability*](2024/other/2024-11-04-top-insights-ga.md)
* [Oct 31, 2024: Custom themes in Streamlit in Snowflake (Preview)](2024/other/2024-10-31-sis-custom-themes.md)
* [Oct 31, 2024: AWS PrivateLink in Streamlit in Snowflake (General Availability)](2024/other/2024-10-31-sis-privatelink.md)
* [October 30, 2024 — Hybrid tables — *General Availability*](2024/other/2024-10-30-hybrid-tables-ga.md)
* [October 29, 2024 — Universal Search in Virtual Private Snowflake (VPS)](2024/other/2024-10-29-snowsight-vps.md)
* [October 21, 2024 — Document AI — *General Availability*](2024/other/2024-10-21-document-ai.md)
* [October 18, 2024 —Apache Iceberg™ tables: Support for Snowflake Open Catalog — *General Availability*](2024/other/2024-10-18-snowflake-open-catalog-ga.md)
* [October 14, 2024 — Cortex Analyst: New regions](2024/other/2024-10-14-new-regions-cortex-analyst.md)
* [October 14, 2024 — Snowflake Data Clean Rooms release notes](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
  + [Clean room overlap stats](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
  + [Provider-initiated activation for third-party connectors](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
  + [Security scans for custom templates](2024/other/2024-10-14-snowflake-data-clean-rooms.md)
* [October 10, 2024 — CORTEX_FINE_TUNING_USAGE_HISTORY view — *General Availability*](2024/other/2024-10-10-cortex-finetuning-usage-history.md)
* [October 10, 2024 — CORTEX_SEARCH_SERVING_USAGE_HISTORY view — *General Availability*](2024/other/2024-10-10-cortex-search-serving-usage-history.md)
* [October 08, 2024 — Native App support for AWS PrivateLink — *Preview*](2024/other/2024-10-08-na-aws-pl.md)
* [October 07, 2024 — Updated event sharing for Snowflake Native Apps — *General Availability*](2024/other/2024-10-07-na-event-sharing.md)
* [Oct 07, 2024: AWS PrivateLink in Streamlit in Snowflake (Preview)](2024/other/2024-10-07-sis.md)
* [October 04, 2024 — Cortex Analyst integration with Cortex Search — *Preview*](2024/other/2024-10-04-cortex-analyst-search-integration.md)
* [October 04, 2024 — Suggested Questions for Cortex Analyst — *Preview*](2024/other/2024-10-04-cortex-analyst-suggested-questions.md)
* [October 04, 2024 — Cortex Search — *General Availability*](2024/other/2024-10-04-cortex-search-ga.md)
* [October 04, 2024 — Differential Privacy — *General Availability*](2024/other/2024-10-04-differential-privacy.md)
* [October 03, 2024 — New Cortex LLM Function - PARSE_DOCUMENT — *Preview*](2024/other/2024-10-03-parse-document.md)
* [October 02, 2024 — Organization accounts — *Preview*](2024/other/2024-10-01-organization-account.md)
* [October 02, 2024 — Notebooks on Container Runtime — *Preview*](2024/other/2024-10-02-notebooks-on-spcs.md)
* [October 01, 2024 — Cortex Fine-tuning Sharing — *Preview*](2024/other/2024-10-01-cortex-finetuning-sharing.md)
* [September 26, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-09-26-dcr.md)
  + [Branded clean room tiles](2024/other/2024-09-26-dcr.md)
  + [Consumer direct activation](2024/other/2024-09-26-dcr.md)
  + [Activation Hub column policies](2024/other/2024-09-26-dcr.md)
  + [Schedule analyses as a consumer](2024/other/2024-09-26-dcr.md)
  + [Clean room data stats](2024/other/2024-09-26-dcr.md)
  + [LiveRamp activation](2024/other/2024-09-26-dcr.md)
  + [The Trade Desk CRM activation](2024/other/2024-09-26-dcr.md)
  + [Managed account credit limit and monitoring](2024/other/2024-09-26-dcr.md)
  + [Audience Overlap, SQL Query and Custom template update](2024/other/2024-09-26-dcr.md)
* [September 26, 2024 — Snowpark-optimized Warehouse RESOURCE_CONSTRAINT — *Preview*](2024/other/2024-09-26-sow-resource-constraints.md)
* [September 25, 2024 — Snowflake Feature Store — *General Availability*](2024/other/2024-09-25-feature-store-ga.md)
* [September 25, 2024 — New models available in Snowflake Cortex AI](2024/other/2024-09-25-new-cortex-models.md)
* [September 24, 2024 — DOCUMENT_AI_USAGE_HISTORY view — *General Availability*](2024/other/2024-09-24-document-ai.md)
* [September 12, 2024 — New Cortex LLM Function - CLASSIFY_TEXT — *Preview*](2024/other/2024-09-12-classify-text-function.md)
* [September 12, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-09-12-dcr.md)
  + [Integration with Yahoo DSP](2024/other/2024-09-12-dcr.md)
  + [Integration with Google PAIR and Google DV 360](2024/other/2024-09-12-dcr.md)
* [September 12, 2024 — New multilingual embedding model available in Snowflake Cortex AI](2024/other/2024-09-12-voyage-embed-model.md)
* [September 09, 2024 — New AI21 model available in Snowflake Cortex AI](2024/other/2024-09-09-jamba-mini-model.md)
* [September 04, 2024 — Easier Training of Anomaly Detection Models from Real-World Data](2024/other/2024-09-04-anomaly-detection-preprocessing.md)
* [September 04, 2024 — Calling stored procedures in the FROM clause of SELECT statements](2024/other/2024-09-04-call-stored-procedure-in-from-clause.md)
* [September 01, 2024 — New Snowflake region](2024/other/2024-09-01-new-region.md)
  + [China (Ningxia) region - *General Availability*](2024/other/2024-09-01-new-region.md)
* [August 30, 2024 — Query attribution costs](2024/other/2024-08-30-per-query-cost-attribution.md)
  + [Account Usage: New QUERY_ATTRIBUTION_HISTORY view](2024/other/2024-08-30-per-query-cost-attribution.md)
* [August 29, 2024 — Cortex Analyst: New regions](2024/other/2024-08-29-cortex-analyst-new-regions.md)
* [August 29, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-08-29-dcr.md)
  + [RSA authentication for the service account user](2024/other/2024-08-29-dcr.md)
  + [Activation for provider-run analyses](2024/other/2024-08-29-dcr.md)
* [August 29, 2024 — New Mistral Large 2 model available in Snowflake Cortex AI](2024/other/2024-08-29-mistral-large2.md)
* [August 29, 2024 — New multilingual embedding models available in Snowflake Cortex AI](2024/other/2024-08-29-multilingual-embed-models.md)
* [August 28, 2024 — Snowflake ML Functions: Top Insights Preview Update](2024/other/2024-08-28-top-insights-preview-refresh.md)
* [August 26, 2024 — Easier Training of Forecasting Models from Real-World Data](2024/other/2024-08-26-forecasting-preprocessing.md)
* [August 26, 2024 — Time Series ML Functions — Error Message Improvements](2024/other/2024-08-26-time-series-error-message.md)
* [August 20, 2024 — Differential Privacy — *Preview*](2024/other/2024-08-16-diff-privacy.md)
* [August 20, 2024 — Cortex LLM Functions — Release Notes](2024/other/2024-08-20-new-region-llama-405b.md)
* [August 16, 2024 — Snowflake Native App Framework: Support for government regions on AWS](2024/other/2024-08-16-na-gov-cloud.md)
* [August 14, 2024 — Cortex Analyst –— *Preview*](2024/other/2024-08-14-cortex-analyst.md)
* [August 09, 2024 — Streamlit in Snowflake on AWS GovCloud –— *General Availability*](2024/other/2024-08-09-sis.md)
* [August 08, 2024 — Cross-region inference for Snowflake AI & ML features — *General Availability*](2024/other/2024-08-08-cross-region-llm.md)
* [August 08, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-08-08-dcr.md)
  + [Support for external tables and Apache Iceberg™ tables](2024/other/2024-08-08-dcr.md)
  + [Integration with TransUnion TruAudience Identity](2024/other/2024-08-08-dcr.md)
* [August 08, 2024 — RANGE BETWEEN window frames with explicit offsets — *General Availability*](2024/other/2024-08-08-range-between-ga.md)
* [August 06, 2024: Document AI release notes](2024/other/2024-08-06-document-ai.md)
* [August 06, 2024 — Snowflake Native App Framework: Support for VPS on AWS](2024/other/2024-08-06-na-vps-aws.md)
* [August 02, 2024 — ML Functions: Improved Error Messages in Classification](2024/other/2024-08-02-classification-errors.md)
* [August 02, 2024 — Snowflake Native App Framework release notes](2024/other/2024-08-02-na-spcs-laf.md)
* [August 02, 2024 — Custom UI in Streamlit in Snowflake –— *General Availability*](2024/other/2024-08-02-sis.md)
* [August 01, 2024 — Support for Streamlit 1.35.0 in Streamlit in Snowflake](2024/other/2024-08-01-sis.md)
* [August 01, 2024 — Snowpark Container Services release notes](2024/other/2024-08-01-spcs.md)
* [July 31, 2024 — Context functions and row access policies in Streamlit in Snowflake –— *General Availability*](2024/other/2024-07-31-sis.md)
* [July 31, 2024 — Snowflake VS Code Extension Release Notes](2024/other/2024-07-31.md)
  + [Edit Snowflake `connections.toml` files](2024/other/2024-07-31.md)
  + [Work with the Snowflake Native App Framework](2024/other/2024-07-31.md)
* [July 25, 2024 — Cortex Search — *Preview*](2024/other/2024-07-25-cortex-search-preview.md)
* [July 25, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-07-25-dcr.md)
  + [Acxiom Real ID integration](2024/other/2024-07-25-dcr.md)
  + [Using developer APIs for provider activation](2024/other/2024-07-25-dcr.md)
  + [User interface for custom templates enhancement](2024/other/2024-07-25-dcr.md)
  + [SQL Query template enhancement](2024/other/2024-07-25-dcr.md)
* [July 25, 2024 — New AI21 model available in Snowflake Cortex AI](2024/other/2024-07-25-new-llm-model-jamba.md)
* [July 24, 2024 — Cortex Guard for Snowflake Cortex AI — *General Availability*](2024/other/2024-07-24-cortex-llm-updates.md)
* [July 24, 2024 — Document AI release notes](2024/other/2024-07-24-document-ai.md)
* [July 23, 2024 — New Meta AI models available in Snowflake Cortex AI](2024/other/2024-07-23-new-llm-models.md)
* [July 23, 2024 — Managing Listings using SQL — **Generally Available**](2024/other/2024-07-23-pl.md)
* [July 19, 2024 — CORTEX_FUNCTIONS_USAGE_HISTORY view — *General Availability*](2024/other/2024-07-19-cortex-functions-usage-history.md)
* [July 18, 2024 — Snowflake Native App Framework - Support for shared external table and Apache Iceberg™ tables — *Preview*](2024/other/2024-07-18-native-app-external-table.md)
* [July 15, 2024 — Snowflake Copilot — *Generally available*](2024/other/2024-07-15-snowflake-copilot-ga.md)
* [July 11, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-07-11-dcr.md)
  + [Sequenced template execution](2024/other/2024-07-11-dcr.md)
  + [Multi-factor authentication](2024/other/2024-07-11-dcr.md)
  + [Register objects in a managed access schema](2024/other/2024-07-11-dcr.md)
  + [Support for additional region](2024/other/2024-07-11-dcr.md)
  + [Single-party SQL query](2024/other/2024-07-11-dcr.md)
* [July 11, 2024 — Snowflake connectors](2024/other/2024-07-11.md)
  + [Snowflake Connector for PostgreSQL](2024/other/2024-07-11.md)
  + [Snowflake Connector for MySQL](2024/other/2024-07-11.md)
* [July 03, 2024 — Data pipelines: Support for Apache Iceberg™ tables with dynamic tables and streams –— *Preview*](2024/other/2024-07-03-dynamic-iceberg-tables.md)
* [July 03, 2024 — External network access in Streamlit in Snowflake –— *General Availability*](2024/other/2024-07-03-sis.md)
* [June 28, 2024 — New geospatial H3 functions — *General Availability*](2024/other/2024-06-28-geospatial-h3-functions-ga.md)
* [June 28, 2024 — Custom UI in Streamlit in Snowflake –— *Preview*](2024/other/2024-06-28-sis.md)
* [June 27, 2024 — Document AI release notes](2024/other/2024-06-27-document-ai.md)
* [June 26, 2024 — Cost Management Release Notes](2024/other/2024-06-26-cost.md)
  + [Organization Overview Page —– *General Availability*](2024/other/2024-06-26-cost.md)
* [June 25, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-06-25-dcr.md)
  + [Provider-run analyses](2024/other/2024-06-25-dcr.md)
  + [Consumer-defined templates](2024/other/2024-06-25-dcr.md)
  + [Granular access controls for tables and templates](2024/other/2024-06-25-dcr.md)
  + [Activating results across regions](2024/other/2024-06-25-dcr.md)
  + [SQL Template enhancement](2024/other/2024-06-25-dcr.md)
* [June 25, 2024 — New TO_QUERY table function](2024/other/2024-06-25-to-query-function.md)
* [June 24, 2024: Time Travel for hybrid tables –— *Preview*](2024/other/2024-06-24-time-travel-hybrid-tables.md)
* [June 21, 2024: Document AI release notes](2024/other/2024-06-21-document-ai.md)
* [June 17, 2024 — New LLM helper functions - TRY_COMPLETE and COUNT_TOKENS](2024/other/2024-06-17-new-llm-functions.md)
  + [New SQL function](2024/other/2024-06-17-new-llm-functions.md)
* [June 15, 2024 — Anomaly Detection](2024/other/2024-06-15-anomaly-detection.md)
* [June 11, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-06-11-dcr.md)
  + [Additional supported regions — *General Availability*](2024/other/2024-06-11-dcr.md)
  + [Granular access management for Snowflake data — *General Availability*](2024/other/2024-06-11-dcr.md)
  + [Choosing a warehouse when running an analysis — *General Availability*](2024/other/2024-06-11-dcr.md)
  + [Support for multiple custom templates in web app — *General Availability*](2024/other/2024-06-11-dcr.md)
* [Jun 11, 2024 — Sharing data in non-secure views –— *Preview*](2024/other/2024-06-11-sharing-non-secure-views.md)
* [June 10, 2024 — Apache Iceberg™ tables — *General Availability*](2024/other/2024-06-10-iceberg-tables.md)
* [June 05, 2024 — New geospatial functions in preview](2024/other/2024-06-05.md)
  + [New geospatial functions available –— *Preview*](2024/other/2024-06-05.md)
* [June 03, 2024 — New EMBED_TEXT_1024 function for 1024 dimensional output vectors](2024/other/2024-06-03-embed-text-1024.md)
  + [New SQL function](2024/other/2024-06-03-embed-text-1024.md)
* [June 3, 2024 — Entity-Level Privacy Release Notes](2024/other/2024-06-03-entity-level.md)
  + [Aggregation policies with entity-level privacy — *General Availability*](2024/other/2024-06-03-entity-level.md)
* [May 31, 2024 — Snowflake ML Classification Update –— *Preview*](2024/other/2024-05-31-classification-update.md)
* [May 31, 2024 — Structured data types — *General Availability*](2024/other/2024-05-31-structured-types-ga.md)
* [May 28, 2024 — ML Functions Release Notes](2024/other/2024-05-28-call-method-in-from-clause.md)
  + [Simpler SQL for storing results from ML functions](2024/other/2024-05-28-call-method-in-from-clause.md)
* [May 28, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-05-28-dcr.md)
  + [Multi-provider clean rooms via Developer APIs — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Additional supported regions — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Support for views in the web app — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Clean room customizations for identity & activation — *General Availability*](2024/other/2024-05-28-dcr.md)
  + [Custom template enhancements — *General Availability*](2024/other/2024-05-28-dcr.md)
* [May 22, 2024 — SQL Release Notes](2024/other/2024-05-22-table-references.md)
  + [Using the TABLE keyword as an alternative to SYSTEM$REFERENCE and SYSTEM$QUERY_REFERENCE](2024/other/2024-05-22-table-references.md)
* [May 20, 2024 — Cost Management Release Notes](2024/other/2024-05-20-cost.md)
  + [Cost Insights — *General Availability*](2024/other/2024-05-20-cost.md)
* [May 17, 2024 — Document AI Release Notes](2024/other/2024-05-17-document-ai.md)
  + [Document AI —– *Preview*](2024/other/2024-05-17-document-ai.md)
* [May 16, 2024 — Vector data type and vector similarity functions — *General Availability*](2024/other/2024-05-16-vector-data-type-ga.md)
  + [New SQL data type](2024/other/2024-05-16-vector-data-type-ga.md)
  + [New SQL functions](2024/other/2024-05-16-vector-data-type-ga.md)
* [May 14, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-05-14-dcr.md)
  + [Tracing user activity in the web app — *General Availability*](2024/other/2024-05-14-dcr.md)
* [May 14, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-05-14-sis.md)
* [May 13, 2024 — ASOF JOIN Release Notes](2024/other/2024-05-13-asof-join.md)
  + [ASOF JOIN — *General Availability*](2024/other/2024-05-13-asof-join.md)
* [May 08, 2024 — New model for vector embedding — *Preview*](2024/other/2024-05-08-embed-text-model.md)
* [May 08, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-05-08-sis.md)
  + [Streamlit in Snowflake: Custom sleep timer —– *Preview*](2024/other/2024-05-08-sis.md)
* [May 08, 2024 — Snowflake Notifications Release Notes](2024/other/2024-05-08.md)
  + [New SYSTEM$SEND_SNOWFLAKE_NOTIFICATION stored procedure for sending notifications](2024/other/2024-05-08.md)
* [May 07, 2024 — Cortex LLM Functions — *General Availability*](2024/other/2024-05-07-llm-functions-ga.md)
* [May 06, 2024 — Vector data type and vector similarity functions — *Preview*](2024/other/2024-05-06-vector-data-type.md)
  + [New SQL data type](2024/other/2024-05-06-vector-data-type.md)
  + [New SQL functions](2024/other/2024-05-06-vector-data-type.md)
* [May 03, 2024 — Aggregation and Projection Policies Release Notes](2024/other/2024-05-03-policies.md)
  + [Aggregation Policies — *General Availability*](2024/other/2024-05-03-policies.md)
  + [Projection Policies — *General Availability*](2024/other/2024-05-03-policies.md)
* [May 03, 2024 — Snowflake Model Registry – General Availability](2024/other/2024-05-03-snowflake-model-registry.md)
* [May 02, 2024 — Cost Management Release Notes](2024/other/2024-05-02-cost.md)
  + [Organization Overview Page —– *Preview*](2024/other/2024-05-02-cost.md)
* [May 02, 2024 — Snowsight Release Notes](2024/other/2024-05-02-snowsight-dd-preview.md)
  + [Data Dictionary with masked PII –— *Preview*](2024/other/2024-05-02-snowsight-dd-preview.md)
* [April 30, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-04-30-dcr.md)
* [April 30, 2024 — Snowflake Google connectors](2024/other/2024-04-30-gaad-gard-ga.md)
  + [Snowflake Connector for Google Analytics Raw Data](2024/other/2024-04-30-gaad-gard-ga.md)
  + [Snowflake Connector for Google Analytics Aggregate Data](2024/other/2024-04-30-gaad-gard-ga.md)
* [April 29, 2024 — Dynamic Tables — *General Availability*](2024/other/2024-04-29-dynamic-tables.md)
* [April 24, 2024 — New FAILOVER privilege for Client Redirect](2024/other/2024-04-24-failover-privilege.md)
* [April 24, 2024 — Managing Listings using SQL](2024/other/2024-04-24-pl.md)
* [April 23, 2024 — Snowflake Connector for ServiceNow® V2 — *General Availability*](2024/other/2024-04-23-svnc.md)
* [April 22, 2024 — Snowpark Container Services release notes](2024/other/2024-04-22.md)
* [April 17, 2024 — Snowpark Container Services Release Notes](2024/other/2024-04-17.md)
* [April 12, 2024 — Cost Management Release Notes](2024/other/2024-04-12-cost.md)
  + [Account Overview Page — *General Availability*](2024/other/2024-04-12-cost.md)
* [April 12, 2024 — Snowflake Cortex LLM Release Notes](2024/other/2024-04-12-snowflake-cortex-llm-update.md)
* [April 11, 2024 — Budgets Release Notes — *General Availability*](2024/other/2024-04-11-budgets.md)
* [April 11-25, 2024 — Snowflake Copilot — *Preview*](2024/other/2024-04-11-snowflake-copilot-in-snowsight.md)
* [April 09, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-04-09-dcr.md)
* [March 29, 2024 — Data Quality Monitoring Release Notes](2024/other/2024-03-29-dmf.md)
  + [Data Quality Monitoring and data metric functions — *Preview*](2024/other/2024-03-29-dmf.md)
* [March 28, 2024 — Snowflake Data Clean Rooms Release Notes](2024/other/2024-03-28-snowflake-data-clean-rooms.md)
* [March 18-20, 2024 — Limit functionality of your Snowflake Native App —– *Preview*](2024/other/2024-03-18-limit-app-functionality.md)
* [March 15, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-03-15.md)
  + [Streamlit in Snowflake: Support for Streamlit 1.26.0 —– *Preview*](2024/other/2024-03-15.md)
* [March 13, 2024 — Hybrid Tables Release Notes](2024/other/2024-03-13-hybrid-tables.md)
* [March 12, 2024 — Snowflake Cortex Classification Release Notes –— *Preview*](2024/other/2024-03-12-snowflake-cortex-classification.md)
* [March 08, 2024 — Geospatial Functions Release Notes](2024/other/2024-03-08.md)
  + [New Geospatial Functions Available](2024/other/2024-03-08.md)
* [March 05, 2024 — Hybrid Tables Release Notes](2024/other/2024-03-05-hybrid-tables.md)
* [March 05, 2024 — Snowflake Cortex LLM Functions Release Notes –— *Preview*](2024/other/2024-03-05-snowflake-cortex-llm-functions.md)
* [February 28, 2024 — ASOF JOIN Release Notes](2024/other/2024-02-28.md)
  + [ASOF JOIN –— *Preview*](2024/other/2024-02-28.md)
* [February 26, 2024 — Snowpark Container Services Release Notes](2024/other/2024-02-26.md)
* [February 22, 2024 — Snowflake Extension for Visual Studio Code Release Notes](2024/other/2024-02-22.md)
  + [Visual Studio Code extension for Snowpark Python — *Preview*](2024/other/2024-02-22.md)
* [February 21, 2024 — Data sharing & collaboration for accounts in U.S. government regions](2024/other/2024-02-21.md)
* [February 20, 2024 — Hybrid Tables Release Notes](2024/other/2024-02-20-hybrid-tables.md)
* [February 20 - March 5, 2024 — Universal Search in Snowsight –— *Preview*](2024/other/2024-02-20.md)
* [February 15, 2024 — Aggregation and Projection Policies Release Notes](2024/other/2024-02-15-policies.md)
  + [Aggregation Policies — *Preview*](2024/other/2024-02-15-policies.md)
  + [Projection Policies — *Preview*](2024/other/2024-02-15-policies.md)
* [February 15, 2024 — Geospatial Functions Release Notes](2024/other/2024-02-15.md)
  + [H3 Functions for GEOGRAPHY Objects — *General Availability*](2024/other/2024-02-15.md)
* [February 12-14, 2024 — New navigation for Snowsight —– *Preview*](2024/other/2024-02-12.md)
* [January 31, 2024 — Snowflake Native Apps Framework Release Notes](2024/other/2024-01-31.md)
* [January 29, 2024 — Snowflake Google connectors](2024/other/2024-01-29.md)
  + [Snowflake Connector for Google Analytics Raw Data](2024/other/2024-01-29.md)
  + [Snowflake Connector for Google Analytics Aggregate Data](2024/other/2024-01-29.md)
* [January 25, 2024 — Streamlit in Snowflake Release Notes](2024/other/2024-01-25.md)
* [January 18, 2024 — Snowflake Native Apps Framework Release Notes](2024/other/2024-01-18.md)

---
title: Server releases and feature updates in 2025
source: https://docs.snowflake.com/en/release-notes/new-features-2025.md
section: Release Notes
---

# Server releases and feature updates in 2025

The following sections list the release notes for the server releases and feature updates that occurred in 2025:

* Server releases in 2025
* Feature updates in 2025

For more recent releases and feature updates, see [Snowflake server release notes and feature updates](new-features.md).

## Server releases in 2025

* [9.40 Release Notes: Dec 15, 2025-Jan 09, 2026](2025/9_40.md)
  + [New features](2025/9_40.md)
    - [Notifications for data quality incidents (*Preview*)](2025/9_40.md)
  + [Deprecated features](2025/9_40.md)
    - [Deprecation of external OpenAI model routing for Cortex Analyst](2025/9_40.md)
  + [SQL updates](2025/9_40.md)
    - [Semantic views: Using standard SQL clauses to query semantic views (*Preview*)](2025/9_40.md)
  + [Release notes change log](2025/9_40.md)
* [9.39 Release Notes: Dec 08, 2025-Dec 12, 2025](2025/9_39.md)
  + [Security updates](2025/9_39.md)
    - [Trust Center: Detection findings and event-driven scanners (*Preview*)](2025/9_39.md)
    - [Programmatic access tokens: Removing the single-role restriction for service users](2025/9_39.md)
  + [Release notes change log](2025/9_39.md)
* [9.38 Release Notes: Dec 03, 2025-Dec 05, 2025](2025/9_38.md)
  + [SQL updates](2025/9_38.md)
    - [Query insights: Support for queries that benefit from the query acceleration service](2025/9_38.md)
  + [Release notes change log](2025/9_38.md)
* [9.37 Release Notes: Nov 17, 2025-Nov 20, 2025](2025/9_37.md)
  + [SQL updates](2025/9_37.md)
    - [Preparation for renaming Snapshots feature to Backups](2025/9_37.md)
    - [New DECFLOAT data type](2025/9_37.md)
  + [Documentation and learning resources](2025/9_37.md)
    - [New topic that provides an overview of Snowflake authentication methods](2025/9_37.md)
  + [Release notes change log](2025/9_37.md)
* [9.36 Release Notes: Nov 10, 2025-Nov 16, 2025](2025/9_36.md)
  + [SQL updates](2025/9_36.md)
    - [Enhanced SQL functionality](2025/9_36.md)
  + [Extensibility updates](2025/9_36.md)
    - [Support for OAuth when authenticating with GitHub (*General availability*)](2025/9_36.md)
    - [Run Apache Spark™ workloads on Snowflake (*General availability*)](2025/9_36.md)
    - [Support for connecting Scala applications to Snowpark Connect for Spark (*Preview*)](2025/9_36.md)
  + [Data governance updates](2025/9_36.md)
    - [Anomaly detection for Data Quality Monitoring (*Preview*)](2025/9_36.md)
  + [Release notes change log](2025/9_36.md)
* [9.35 Release Notes: Nov 03, 2025-Nov 07, 2025](2025/9_35.md)
  + [SQL updates](2025/9_35.md)
    - [Interval data types (*Preview*)](2025/9_35.md)
  + [Data lake updates](2025/9_35.md)
    - [Replicate Snowflake-managed Apache Iceberg™ tables (*Preview*)](2025/9_35.md)
  + [Release notes change log](2025/9_35.md)
* [9.34 Release Notes (no announcements): Oct 27, 2025-Oct 29, 2025](2025/9_34.md)
  + [Release notes change log](2025/9_34.md)
* [9.33 Release Notes: Oct 21, 2025-Oct 23, 2025](2025/9_33.md)
  + [Security updates](2025/9_33.md)
    - [AWS cross-region support for PrivateLink (*General availability*)](2025/9_33.md)
    - [Outbound network traffic to stages and volumes on Google Cloud Storage supports private connectivity (*General availability*)](2025/9_33.md)
    - [Snowflake-managed network rules (*General availability*)](2025/9_33.md)
  + [SQL updates](2025/9_33.md)
    - [Semantic views: Support for ASOF JOIN](2025/9_33.md)
  + [Release notes change log](2025/9_33.md)
* [9.32 Release Notes (with behavior changes): Oct 13, 2025-Oct 15, 2025](2025/9_32.md)
  + [Behavior change bundles](2025/9_32.md)
  + [Data lake updates](2025/9_32.md)
    - [Catalog-linked databases: Auto-refresh for Apache Iceberg™ table creation](2025/9_32.md)
    - [Table optimization for Snowflake-managed Apache Iceberg™ tables (*General availability*)](2025/9_32.md)
  + [Replication updates](2025/9_32.md)
    - [Snowflake Notebooks replication (*General availability*)](2025/9_32.md)
  + [Release notes change log](2025/9_32.md)
* [9.31 Release Notes: Oct 06, 2025-Oct 08, 2025](2025/9_31.md)
  + [Security updates](2025/9_31.md)
    - [Tri-Secret Secure supports private connectivity](2025/9_31.md)
  + [Data lake updates](2025/9_31.md)
    - [Query data compaction jobs for Apache Iceberg™ tables](2025/9_31.md)
  + [Release notes change log](2025/9_31.md)
* [9.30 Release Notes: Sep 29, 2025-Oct 01, 2025](2025/9_30.md)
  + [Security updates](2025/9_30.md)
    - [Hybrid table support for Tri-Secret Secure](2025/9_30.md)
  + [SQL updates](2025/9_30.md)
    - [Update to the 2025b release of the TZDB](2025/9_30.md)
    - [MERGE ALL BY NAME](2025/9_30.md)
    - [Aliases for PIVOT and UNPIVOT columns](2025/9_30.md)
    - [New SQL parameter: ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS](2025/9_30.md)
    - [Reference table columns in lambda expressions when calling higher-order functions](2025/9_30.md)
    - [SEARCH function supports PHRASE and EXACT search modes](2025/9_30.md)
    - [Snowflake Scripting CONTINUE handlers](2025/9_30.md)
    - [Snowflake Scripting user-defined functions (UDFs) (*General availability*)](2025/9_30.md)
    - [Semantic views: Support for dimensions that use a Cortex Search Service](2025/9_30.md)
  + [Release notes change log](2025/9_30.md)
* [9.29 Release Notes: Sep 24, 2025-Sep 26, 2025](2025/9_29.md)
  + [New features](2025/9_29.md)
    - [Declarative Shared Native Apps (*Preview*)](2025/9_29.md)
    - [Cortex Agent Monitoring (*Preview*)](2025/9_29.md)
  + [Data collaboration updates](2025/9_29.md)
    - [Cross-Cloud Auto-Fulfillment support for open table formats](2025/9_29.md)
  + [Data pipeline updates](2025/9_29.md)
    - [CREATE OR ALTER DYNAMIC TABLE (*Preview*)](2025/9_29.md)
  + [Data governance updates](2025/9_29.md)
    - [Data quality: FRESHNESS data metric function improvement](2025/9_29.md)
  + [Release notes change log](2025/9_29.md)
* [9.28 Release Notes: Sep 15, 2025-Sep 17, 2025](2025/9_28.md)
  + [SQL updates](2025/9_28.md)
    - [Query insights in Snowsight (*Preview*)](2025/9_28.md)
  + [Data pipeline updates](2025/9_28.md)
    - [dbt Projects on Snowflake: Support for dbt retry (*Preview*)](2025/9_28.md)
  + [Release notes change log](2025/9_28.md)
* [9.27 Release Notes (with behavior changes): Sep 08, 2025-Sep 10, 2025](2025/9_27.md)
  + [Behavior change bundles](2025/9_27.md)
  + [SQL updates](2025/9_27.md)
    - [Retrieve bind variable values (*Preview*)](2025/9_27.md)
  + [Data pipeline updates](2025/9_27.md)
    - [Dynamic tables: Support for base tables with zero data retention](2025/9_27.md)
  + [Data lake updates](2025/9_27.md)
    - [New system function to replace the catalog integration for an externally managed Apache Iceberg™ table](2025/9_27.md)
  + [Release notes change log](2025/9_27.md)
* [9.26 Release notes: Sep 01, 2025-Sep 04, 2025](2025/9_26.md)
  + [SQL updates](2025/9_26.md)
    - [Filling gaps in time-series data (*Preview*)](2025/9_26.md)
    - [Account Usage: New INGRESS_NETWORK_ACCESS_HISTORY view](2025/9_26.md)
    - [Account Usage: New INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](2025/9_26.md)
  + [Release notes change log](2025/9_26.md)
* [9.25 Release notes: Aug 25, 2025-Aug 28, 2025](2025/9_25.md)
  + [New features](2025/9_25.md)
    - [Sensitive data classification: Automatic classification of a database (*General availability*)](2025/9_25.md)
  + [Security updates](2025/9_25.md)
    - [Support for keys generated with Elliptic Curve Digital Signature Algorithms (ECDSA)](2025/9_25.md)
  + [SQL updates](2025/9_25.md)
    - [Querying semantic views (*General availability*)](2025/9_25.md)
    - [Semantic views: Listing facts in a view, schema, database, or account](2025/9_25.md)
    - [Semantic views: Support for renaming views](2025/9_25.md)
  + [Data lake updates](2025/9_25.md)
    - [Apache Iceberg™ tables: Row-level deletes for externally managed tables (*General availability*)](2025/9_25.md)
  + [Data governance updates](2025/9_25.md)
    - [Data quality: Updated privilege model allows non-owners to associate a data metric function with an object (*Preview*)](2025/9_25.md)
    - [Object tags: New limit for allowed values](2025/9_25.md)
  + [Release notes change log](2025/9_25.md)
* [9.24 Release notes: Aug 18, 2025-Aug 20, 2025](2025/9_24.md)
  + [Security updates](2025/9_24.md)
    - [Self-service activation of Tri-Secret Secure (*General availability*)](2025/9_24.md)
  + [SQL updates](2025/9_24.md)
    - [ALTER LISTING command to simplify adding and removing targets (*General availability*)](2025/9_24.md)
  + [New features](2025/9_24.md)
    - [Snowflake Native App Framework - MONITOR privilege support for apps (*General availability*)](2025/9_24.md)
  + [Data lake updates](2025/9_24.md)
    - [Set a target file size for Apache Iceberg™ tables (*Preview*)](2025/9_24.md)
  + [Release notes change log](2025/9_24.md)
* [9.23 Release notes: Aug 11, 2025-Aug 15, 2025](2025/9_23.md)
  + [SQL updates](2025/9_23.md)
    - [Snowflake Scripting user-defined functions (UDFs)](2025/9_23.md)
    - [Private facts and metrics in semantic views](2025/9_23.md)
  + [Data loading / unloading updates](2025/9_23.md)
    - [Apache Arrow library upgrade to version 21.0.0](2025/9_23.md)
  + [Data pipeline updates](2025/9_23.md)
    - [Dynamic tables: Support for UNION in incremental refresh mode](2025/9_23.md)
  + [Release notes change log](2025/9_23.md)
* [9.22 Release notes (with behavior changes): Aug 04, 2025-Aug 08, 2025](2025/9_22.md)
  + [Behavior change bundles](2025/9_22.md)
  + [New features](2025/9_22.md)
    - [Data quality: Using expectations to define quality checks (*General availability*)](2025/9_22.md)
  + [Extensibility updates](2025/9_22.md)
    - [Tracing SQL statements run from handler code (*General availability*)](2025/9_22.md)
  + [Data pipeline updates](2025/9_22.md)
    - [Dynamic tables: Support for immutability constraints](2025/9_22.md)
    - [Dynamic tables: Support for backfill](2025/9_22.md)
  + [Release notes change log](2025/9_22.md)
* [9.21 Release notes: Jul 29, 2025-Aug 01, 2025](2025/9_21.md)
  + [Security updates](2025/9_21.md)
    - [GENERATE_SYNTHETIC_DATA: Consistency secret now optional in most cases](2025/9_21.md)
  + [SQL updates](2025/9_21.md)
    - [Account Usage: TABLE_QUERY_PRUNING_HISTORY and COLUMN_QUERY_PRUNING_HISTORY views (*General availability*)](2025/9_21.md)
    - [The SEARCH_IP function supports searching for IPv6 addresses](2025/9_21.md)
    - [Generating YAML for a semantic view and creating a semantic view from YAML](2025/9_21.md)
  + [Data loading / unloading updates](2025/9_21.md)
    - [Simplified Snowpipe pricing](2025/9_21.md)
  + [Data pipeline updates](2025/9_21.md)
    - [Snowpark Connect for Spark and Snowpark Submit (*Preview*)](2025/9_21.md)
  + [Release notes change log](2025/9_21.md)
* [9.20 Release notes: Jul 21, 2025-Jul 25, 2025](2025/9_20.md)
  + [SQL updates](2025/9_20.md)
    - [CREATE INDEX command supports INCLUDE columns](2025/9_20.md)
    - [Semantic views: Listing dimensions and metrics in a view, schema, database, or account](2025/9_20.md)
    - [New query insights about join performance and optimization](2025/9_20.md)
  + [Data pipeline updates](2025/9_20.md)
    - [Tasks: New EXECUTE AS USER option and IMPERSONATE privilege for user objects](2025/9_20.md)
    - [Dynamic tables: Disallowed use of the COPY_SESSION attribute while manually refreshing dynamic tables on a serverless warehouse](2025/9_20.md)
  + [Release notes change log](2025/9_20.md)
* [9.19 Release notes: Jul 14, 2025-Jul 17, 2025](2025/9_19.md)
  + [SQL updates](2025/9_19.md)
    - [Data types: Structured types support for standard Snowflake tables (*General availability*)](2025/9_19.md)
  + [Data loading / unloading updates](2025/9_19.md)
    - [Optimize data ingestion with pre-clustering for Snowpipe Streaming - high-performance architecture (*Preview*)](2025/9_19.md)
    - [COPY FILES (*General availability*)](2025/9_19.md)
  + [Release notes change log](2025/9_19.md)
* [9.18 Release notes: Jul 02, 2025-Jul 08, 2025](2025/9_18.md)
  + [SQL updates](2025/9_18.md)
    - [Snowflake Scripting output (OUT) arguments (*General availability*)](2025/9_18.md)
  + [Data pipeline updates](2025/9_18.md)
    - [Dynamic tables: Support for externally managed Apache Iceberg™ tables (*General availability*)](2025/9_18.md)
  + [Data governance updates](2025/9_18.md)
    - [Data Quality: New system data metric function](2025/9_18.md)
  + [Release notes change log](2025/9_18.md)
* [9.17 Release notes (with behavior changes): Jun 24, 2025-Jun 30, 2025](2025/9_17.md)
  + [Behavior change bundles](2025/9_17.md)
  + [SQL updates](2025/9_17.md)
    - [New maximum size limits for database objects (*General availability*)](2025/9_17.md)
    - [Snowflake Scripting supports nested stored procedures (*General availability*)](2025/9_17.md)
  + [Data pipeline updates](2025/9_17.md)
    - [Snowsight: Task Overview and Graph Run History updates (*General availability*)](2025/9_17.md)
  + [Release notes change log](2025/9_17.md)
* [9.16 Release notes (no announcements): Jun 16, 2025-Jun 23, 2025](2025/9_16.md)
  + [Release notes change log](2025/9_16.md)
* [9.15 Release notes: Jun 09, 2025-Jun 11, 2025](2025/9_15.md)
  + [New features](2025/9_15.md)
    - [Artifact Repository (*General availability*)](2025/9_15.md)
  + [Security updates](2025/9_15.md)
    - [Malicious IP Protection](2025/9_15.md)
    - [Findings Lifecycle Management](2025/9_15.md)
  + [SQL updates](2025/9_15.md)
    - [UNION BY NAME operator](2025/9_15.md)
  + [Data pipeline updates](2025/9_15.md)
    - [Support for streams on externally managed Apache Iceberg™ tables with row-level deletes](2025/9_15.md)
  + [Release notes change log](2025/9_15.md)
* [9.14 Release notes: May 23, 2025-May 28, 2025](2025/9_14.md)
  + [New features](2025/9_14.md)
    - [Trust Center: In-app notifications (*Preview*)](2025/9_14.md)
    - [Trust Center: New Abnormal Failure Rate Detection scanners](2025/9_14.md)
  + [Snowpark Python version updates](2025/9_14.md)
  + [SQL updates](2025/9_14.md)
    - [Search optimization: Support for Apache Iceberg™ tables](2025/9_14.md)
    - [Query Acceleration Service: Support for Apache Iceberg™ tables](2025/9_14.md)
    - [Data types: Structured types support for standard Snowflake tables (*Preview*)](2025/9_14.md)
  + [Data pipeline updates](2025/9_14.md)
    - [Triggered tasks: Support for streams hosted on directory tables and data shares](2025/9_14.md)
  + [Release notes change log](2025/9_14.md)
* [9.13 Release notes: May 19, 2025-May 20, 2025](2025/9_13.md)
  + [Security updates](2025/9_13.md)
    - [Outbound private connectivity for AWS Government regions](2025/9_13.md)
  + [SQL updates](2025/9_13.md)
    - [Pipe operator](2025/9_13.md)
  + [Data loading/unloading updates](2025/9_13.md)
    - [INFER_SCHEMA function: Support for Apache Iceberg™ data types](2025/9_13.md)
  + [Data lake updates](2025/9_13.md)
    - [Cross-cloud/cross-region support for Snowflake-managed Apache Iceberg™ tables](2025/9_13.md)
  + [Release notes change log](2025/9_13.md)
* [9.12 Release notes (with behavior changes): May 05, 2025-May 12, 2025](2025/9_12.md)
  + [Behavior change bundles](2025/9_12.md)
  + [New features](2025/9_12.md)
    - [Release channels for Snowflake Native Apps (*General availability*)](2025/9_12.md)
  + [SQL Updates](2025/9_12.md)
    - [Improved error messages for Data Manipulation Language (DML) commands](2025/9_12.md)
    - [New SQL functions](2025/9_12.md)
  + [Extensibility updates](2025/9_12.md)
    - [Built-in code profiler for Python stored procedures (*General availability*)](2025/9_12.md)
  + [Data loading / unloading updates](2025/9_12.md)
    - [Support for internal stage cloning (*General availability*)](2025/9_12.md)
    - [Vectorized scanner now available without ON_ERROR restrictions](2025/9_12.md)
  + [Data governance updates](2025/9_12.md)
    - [Sensitive data classification: New classifiers for India](2025/9_12.md)
  + [Snowpark Container Services updates](2025/9_12.md)
    - [Using caller’s rights to connect to Snowflake (*General availability*)](2025/9_12.md)
  + [Release notes change log](2025/9_12.md)
* [9.11 Release notes: Apr 28, 2025-May 02, 2025](2025/9_11.md)
  + [New features](2025/9_11.md)
    - [Snowflake Native Apps session debugging (*General availability*)](2025/9_11.md)
    - [Writing files from Snowpark Python UDFs and UDTFs (*General availability*)](2025/9_11.md)
  + [Extensibility updates](2025/9_11.md)
    - [Support for allowing requests to all outbound endpoints from functions and procedures (*General availability*)](2025/9_11.md)
  + [Data lake updates](2025/9_11.md)
    - [Support for Iceberg tables in the People’s Republic of China (*General availability*)](2025/9_11.md)
  + [Decommissioned runtimes](2025/9_11.md)
    - [Python 3.8 decommissioned](2025/9_11.md)
  + [Release notes change log](2025/9_11.md)
* [9.10 Release notes: Apr 14, 2025-Apr 22, 2025](2025/9_10.md)
  + [Extensibility updates](2025/9_10.md)
    - [Support for public custom Git repository URLs (*General availability*)](2025/9_10.md)
  + [Data loading / unloading updates](2025/9_10.md)
    - [Automated refresh for internal named stages (*Preview*)](2025/9_10.md)
    - [Auto-ingest pipes for internal named stages (*Preview*)](2025/9_10.md)
  + [Data lake updates](2025/9_10.md)
    - [Apache Iceberg™ tables: Automated refresh table names now appear in the ACCOUNT_USAGE.PIPE_USAGE_HISTORY view](2025/9_10.md)
  + [Privacy updates](2025/9_10.md)
    - [Synthetic data generation (*General availability*)](2025/9_10.md)
  + [Release notes change log](2025/9_10.md)
* [9.9 Release notes: Apr 07, 2025-Apr 09, 2025](2025/9_09.md)
  + [SQL updates](2025/9_09.md)
    - [New Snowflake parameter: DEFAULT_NULL_ORDERING](2025/9_09.md)
  + [Extensibility updates](2025/9_09.md)
    - [Artifact Repository (*Preview*)](2025/9_09.md)
  + [Release notes change log](2025/9_09.md)
* [9.8 Release notes: Mar 31, 2025-Apr 04, 2025](2025/9_08.md)
  + [Security updates](2025/9_08.md)
    - [Trust Center: Risky Human and Service User scanners](2025/9_08.md)
  + [SQL updates](2025/9_08.md)
    - [Asynchronous refresh for failover groups and replication groups](2025/9_08.md)
    - [Bind variables in SHOW commands](2025/9_08.md)
  + [Data lake updates](2025/9_08.md)
    - [Apache Iceberg™ tables: Row-level deletes for externally managed tables (*Preview*)](2025/9_08.md)
    - [Apache Iceberg™ tables: Delta table support (*General availability*)](2025/9_08.md)
    - [New database properties: CATALOG_SYNC_NAMESPACE_MODE and CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER](2025/9_08.md)
  + [Snowpark Container Services updates](2025/9_08.md)
    - [Automatic suspension of a Snowpark Container Services service (*Preview*)](2025/9_08.md)
  + [Release notes change log](2025/9_08.md)
* [9.7 Release notes (with behavior changes): Mar 17, 2025-Mar 27, 2025](2025/9_07.md)
  + [Behavior change bundles](2025/9_07.md)
  + [New features](2025/9_07.md)
    - [Grant database roles to a Snowflake Native App — *Preview*](2025/9_07.md)
    - [DISABLE_UI_DOWNLOAD_BUTTON object parameter for Snowsight and the Classic Console (*General availability*)](2025/9_07.md)
  + [Replication updates](2025/9_07.md)
    - [Schema-level replication for failover groups (*General availability*)](2025/9_07.md)
  + [SQL updates](2025/9_07.md)
    - [Semi-structured data: XML format (*General availability*)](2025/9_07.md)
    - [Spread operator](2025/9_07.md)
    - [New maximum size limits for database objects (*Preview*)](2025/9_07.md)
  + [Release notes change log](2025/9_07.md)
* [9.6 Release notes: Mar 10, 2025-Mar 12, 2025](2025/9_06.md)
  + [SQL updates](2025/9_06.md)
    - [Search optimization: Support for column collations](2025/9_06.md)
  + [Data pipeline updates](2025/9_06.md)
    - [Dynamic tables: Maximum number of dynamic tables in an account increased to 50,000](2025/9_06.md)
  + [Release notes change log](2025/9_06.md)
* [9.5 Release notes: Mar 03, 2025-Mar 06, 2025](2025/9_05.md)
  + [New features](2025/9_05.md)
    - [Automatic sensitive data classification (*General availability*)](2025/9_05.md)
  + [SQL updates](2025/9_05.md)
    - [Snowflake Scripting: Asynchronous child jobs (*General availability*)](2025/9_05.md)
    - [Snowflake Scripting: Improved error messages](2025/9_05.md)
  + [Release notes change log](2025/9_05.md)
* [9.4 Release notes: Feb 24, 2025-Mar 01, 2025](2025/9_04.md)
  + [New features](2025/9_04.md)
    - [Additional information returned for objects bound to references (*General availability*)](2025/9_04.md)
    - [More granular control for log, trace, and metric levels in an app (*General availability*)](2025/9_04.md)
  + [SQL updates](2025/9_04.md)
    - [Cloning databases that contain hybrid tables (*Preview*)](2025/9_04.md)
    - [New SQL functions](2025/9_04.md)
  + [Extensibility updates](2025/9_04.md)
    - [Support for associating an event table with a database (*General availability*)](2025/9_04.md)
  + [Data loading updates](2025/9_04.md)
    - [Dynamic tables and tasks: Events logged for refreshes and task executions](2025/9_04.md)
  + [Data lake updates](2025/9_04.md)
    - [CATALOG_NAMESPACE parameter for catalog integrations is now optional](2025/9_04.md)
  + [Release notes change log](2025/9_04.md)
* [9.3 Release notes: Feb 18, 2025-Feb 21, 2025](2025/9_03.md)
  + [New features](2025/9_03.md)
    - [Tasks now support lower scheduling intervals (*General availability*)](2025/9_03.md)
    - [Data lineage (*General availability*)](2025/9_03.md)
  + [SQL updates](2025/9_03.md)
    - [SEARCH function: Support for conjunctive semantics](2025/9_03.md)
  + [Extensibility updates](2025/9_03.md)
    - [Support for a wildcard character in network rule network identifiers (*General availability*)](2025/9_03.md)
    - [Support for telemetry metrics and custom spans, with visualizations in Snowsight (*General availability*)](2025/9_03.md)
  + [Data pipeline updates](2025/9_03.md)
    - [Dynamic tables: Support for UNION ALL](2025/9_03.md)
  + [Data lake updates](2025/9_03.md)
    - [Cloning support for Snowflake-managed Apache Iceberg™ tables (*General availability*)](2025/9_03.md)
  + [Release notes change log](2025/9_03.md)
* [9.2 Release notes (with behavior changes): Jan 22, 2025-Feb 13, 2025](2025/9_02.md)
  + [Behavior change bundles](2025/9_02.md)
  + [Non-bundled behavior changes](2025/9_02.md)
  + [New features](2025/9_02.md)
    - [Triggered tasks now can operate as Serverless Tasks (*General availability*)](2025/9_02.md)
    - [Trust Center: Manage individual scanners](2025/9_02.md)
  + [Security updates](2025/9_02.md)
    - [Outbound private connectivity for Microsoft Azure Government regions](2025/9_02.md)
  + [SQL updates](2025/9_02.md)
    - [New SQL functions](2025/9_02.md)
    - [Additional CREATE OR ALTER commands (*Preview*)](2025/9_02.md)
  + [Data lake updates](2025/9_02.md)
    - [Apache Iceberg™ tables: Support for writing Apache Iceberg metadata for Delta-based tables](2025/9_02.md)
  + [Release notes change log](2025/9_02.md)
* [9.1 Release notes: Jan 13, 2025-Jan 16, 2025](2025/9_01.md)
  + [New features](2025/9_01.md)
    - [Outbound private connectivity for Snowflake features](2025/9_01.md)
  + [SQL updates](2025/9_01.md)
    - [ARRAY_AGG function support for window frames (*General availability*)](2025/9_01.md)
  + [Data pipelines updates](2025/9_01.md)
    - [CREATE DYNAMIC TABLE command: New REQUIRE USER parameter added](2025/9_01.md)
    - [ALTER DYNAMIC TABLE command: New COPY SESSION parameter added](2025/9_01.md)
  + [Data lake updates](2025/9_01.md)
    - [External stage and external volume support for Amazon S3 access points (*General availability*)](2025/9_01.md)
    - [Apache Iceberg™ tables: Automated refresh (*General availability*)](2025/9_01.md)
  + [Data governance updates](2025/9_01.md)
    - [Data metric functions: Support for referential integrity checks](2025/9_01.md)
  + [Privacy updates](2025/9_01.md)
    - [Join policies (*Preview*)](2025/9_01.md)
  + [Release notes change log](2025/9_01.md)
* [9.0 Release notes: Jan 07, 2025-Jan 09, 2025](2025/9_00.md)
  + [Security updates](2025/9_00.md)
    - [External key store integration for Tri-Secret Secure (*General availability*)](2025/9_00.md)
    - [Pinning private endpoints (*General availability*)](2025/9_00.md)
  + [Release notes change log](2025/9_00.md)

## Feature updates in 2025

* [Dec 18, 2025: Network rules and policies support Google Cloud Private Service Connect IDs (*General availability*)](2025/other/2025-12-18-gcp-pscid-network-rules-and-policies.md)
* [Dec 17, 2025 — Snowflake High Performance connector for Kafka (*Preview*)](2025/other/2025-12-17-kafkahp-pupr.md)
* [Dec 17, 2025: Schema evolution support for Snowpipe Streaming with high-performance architecture](2025/other/2025-12-17-schema-evolution-snowpipe-streaming.md)
* [Dec 17, 2025: Snowflake Postgres (*Preview*)](2025/other/2025-12-17-snowflake-postgres.md)
* [Dec 16, 2025: Cortex Search multi-indexing and custom vector embedding (*Preview*)](2025/other/2025-12-16-cortex-search-multi-index-preview.md)
* [Dec 16, 2025: Notebooks in Workspaces (*Preview*)](2025/other/2025-12-16-notebooks-in-workspaces.md)
  + [Key features](2025/other/2025-12-16-notebooks-in-workspaces.md)
* [Dec 15, 2025: Account Usage: New CATALOG_LINKED_DATABASE_USAGE_HISTORY view](2025/other/2025-12-15-catalog-linked-db-usage-history.md)
* [Dec 15, 2025: Vector aggregate functions](2025/other/2025-12-15-vector-aggregate-functions.md)
* [Dec 12, 2025: Private connectivity for internal stages on Google Cloud (*General availability*)](2025/other/2025-12-12-gcp-pl-internal-stages.md)
* [Dec 11, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-12-11-dcr.md)
  + [Clean Rooms API Version: 12.3](2025/other/2025-12-11-dcr.md)
* [Dec 11, 2025: Default pipe for Snowpipe Streaming with high-performance architecture](2025/other/2025-12-11-default-pipe.md)
* [Dec 11, 2025: Interactive tables and interactive warehouses (*General availability*)](2025/other/2025-12-11-interactive-tables-ga.md)
* [Dec 11, 2025: Support for Streamlit in Snowflake container runtime (Preview)](2025/other/2025-12-11-sis.md)
* [Dec 10, 2025: Cost anomalies (*General availability*)](2025/other/2025-12-02-cost-anomalies-ga.md)
* [Dec 10, 2025: General availability of WORM backups](2025/other/2025-12-10-worm-backups.md)
  + [Terminology change](2025/other/2025-12-10-worm-backups.md)
* [Dec 08, 2025: AI_REDACT for automated redaction of PII (*General availability*)](2025/other/2025-12-08-ai-redact-ga.md)
* [Dec 08, 2025: Dynamic tables: Support for dual warehouses](2025/other/2025-12-08-dynamic-tables-dual-warehouses.md)
* [Dec 08, 2025: Snowpipe simplified pricing](2025/other/2025-12-08-snowpipe-simplified-pricing.md)
* [Dec 04, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-12-04-dcr.md)
  + [Clean Rooms API Version: 12.2](2025/other/2025-12-04-dcr.md)
* [Dec 03, 2025: Access history improvements](2025/other/2025-12-03-access-history.md)
* [Dec 02, 2025: Optimize existing semantic views or models with verified queries (*Preview*)](2025/other/2025-12-02-cortex-analyst-optimization.md)
* [Dec 02, 2025: Private connectivity for Apache Iceberg™ REST catalog integrations (*General availability*)](2025/other/2025-12-02-iceberg-rest-catalog-private-connectivity.md)
* [Dec 02, 2025: Auto-fulfillment for listings that span databases (*General availability*)](2025/other/2025-12-02-laf-listings-span-databases.md)
* [Nov 21, 2025: AI_COMPLETE function (*General availability*)](2025/other/2025-11-21-ai-complete-ga.md)
* [Nov 21, 2025: Import models from Hugging Face to Snowflake (*Preview*)](2025/other/2025-11-21-hugging-face-model-import-preview.md)
* [Nov 21, 2025: Tri-Secret Secure data protection for Snowpark Container Services block volumes (General availability)](2025/other/2025-11-21-spcs-tri-secret-secure.md)
* [Nov 21, 2025: External query engine support for Apache Iceberg™ tables with Snowflake Horizon Catalog (*Preview*)](2025/other/2025-11-21-tables-iceberg-query-using-external-query-engine-snowflake-horizon-preview.md)
* [Nov 21, 2025: Trust Center notifications in Snowsight (*General availability*)](2025/other/2025-11-21-trust-center-in-app-notifications.md)
* [Nov 20, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-11-20-dcr.md)
  + [Clean Rooms API Version: 11.9](2025/other/2025-11-20-dcr.md)
* [Nov 20, 2025: New versions of Streamlit supported in Streamlit in Snowflake (General availability)](2025/other/2025-11-20-sis.md)
* [Nov 20, 2025: SnowConvert AI interface improvements](2025/other/2025-11-20-snowconvert-ai-interface-improvements.md)
* [Nov 18, 2025: Apache Iceberg™ tables: Support for bi-directional data access with Microsoft Fabric (*Preview*)](2025/other/2025-11-18-iceberg-microsoft-fabric-bidirectional-data-access.md)
* [Nov 17, 2025: Access control enhancements for cost anomalies](2025/other/2025-11-17-cost-anomaly.md)
* [Nov 17, 2025: Document Processing Playground (*Preview*)](2025/other/2025-11-17-document-processing-playground.md)
* [Nov 17, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (*General availability*)](2025/other/2025-11-17-native-apps-spcs-aws-gov-fedram-ga.md)
* [Nov 14, 2025: Cortex Analyst Routing Mode (*Preview*)](2025/other/2025-11-14-cortex-analyst-routing-mode.md)
* [Nov 13, 2025: Excluding objects from sensitive data classification (*General availability*)](2025/other/2025-11-13-data-classification.md)
* [Nov 13, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-11-13-dcr.md)
  + [Clean Rooms API Version: 11.8](2025/other/2025-11-13-dcr.md)
* [Nov 13, 2025: Improved stage volume implementation in Snowpark Container Services (*General availability*)](2025/other/2025-11-13-spcs-improved-storage-mount-ga.md)
* [Nov 10, 2025: Snowpipe Streaming with high-performance architecture on Google Cloud Platform (GCP) (*General availability*)](2025/other/2025-11-10-snowpipe-streaming-gcp-ga.md)
* [Nov 07, 2025: AI_REDACT function (*Preview*)](2025/other/2025-11-07-aisql-redact-pii.md)
* [Nov 07, 2025: Pricing plans and offers (*General availability*)](2025/other/2025-11-07-pricing-plans-offers.md)
* [Nov 07, 2025: Storage lifecycle policies (*General availability*)](2025/other/2025-11-07-storage-lifecycle-policies-ga.md)
* [Nov 07, 2025: Trust Center extensions (*Preview*)](2025/other/2025-11-07-trust-center-extensions.md)
* [Dec 01, 2025: CORTEX_AISQL_USAGE_HISTORY Account Usage view (*General availability*)](2025/other/2025-12-01-cortex-aisql-usage-history.md)
* [Nov 06, 2025: dbt Projects on Snowflake (*General availability*)](2025/other/2025-11-06-dbt-projects-on-snowflake-ga.md)
  + [What’s new since preview](2025/other/2025-11-06-dbt-projects-on-snowflake-ga.md)
* [Nov 06, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-11-06-dcr.md)
  + [Clean Rooms API Version: 11.2](2025/other/2025-11-06-dcr.md)
* [Nov 05, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*General availability*)](2025/other/2025-11-05-cortex-agents-teams-ga.md)
* [Nov 05, 2025: Shared Workspaces (*Preview*)](2025/other/2025-11-05-shared-workspaces.md)
  + [Key features](2025/other/2025-11-05-shared-workspaces.md)
* [Nov 05, 2025: Snowpipe Streaming with high-performance architecture on Azure (*General availability*)](2025/other/2025-11-05-snowpipe-streaming-azure-ga.md)
* [Nov 05, 2025: Support for paid listings in the Kingdom of Saudi Arabia (KSA) (*General availability*)](2025/other/2025-11-05-support-for-paid-listings-ksa.md)
* [Nov 04, 2025: Snowflake-managed MCP server (*General availability*)](2025/other/2025-11-04-cortex-agents-mcp.md)
* [Nov 04, 2025: Cortex Agents (*General availability*)](2025/other/2025-11-04-cortex-agents.md)
* [Nov 04, 2025: Interactive tables and interactive warehouses (*Preview*)](2025/other/2025-11-04-interactive-tables-and-interactive-warehouses.md)
* [Nov 04, 2025: Snowflake Machine Learning Experiments (*Preview*)](2025/other/2025-11-04-ml-experiment-tracking.md)
* [Nov 04, 2025: Snowflake Openflow - Snowflake Deployments (*General availability*)](2025/other/2025-11-04-openflow.md)
* [Nov 04, 2025: Performance Explorer (*General availability*)](2025/other/2025-11-04-performance-explorer-ga.md)
* [Nov 04, 2025: Sharing semantic views](2025/other/2025-11-04-sharing-semantic-views.md)
* [Nov 04, 2025: Snowflake Intelligence (*General availability*)](2025/other/2025-11-04-snowflake-intelligence.md)
* [Nov 03, 2025: Semantic views support for account replication](2025/other/2025-11-03-semantic-views-replication.md)
* [Oct 31, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (*General availability*)](2025/other/2025-10-31-na-spcs-gcp-ga.md)
* [Oct 31, 2025: Organization-level findings in the Trust Center](2025/other/2025-10-31-trust-center-org-findings.md)
* [Oct 30, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-30-dcr.md)
  + [Clean Rooms API Version: 11.0](2025/other/2025-10-30-dcr.md)
* [Oct 29, 2025: CLIENT_POLICY parameter for authentication policies](2025/other/2025-10-29-client-version-policies.md)
* [Oct 29, 2025: Guided account failover in Snowsight (*General availability*)](2025/other/2025-10-29-guided-account-failover-snowsight.md)
* [Oct 29, 2025: Snowflake Native Apps: Shareback (*Preview*)](2025/other/2025-10-29-nativeapps-shareback.md)
* [Oct 23, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-23-dcr.md)
  + [Clean Rooms API Version: 10.6](2025/other/2025-10-23-dcr.md)
* [Oct 20, 2025: Performance Explorer (*Preview*)](2025/other/2025-10-20-performance-explorer.md)
* [Oct 17, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (*General availability*)](2025/other/2025-10-17-iceberg-external-writes-cld-ga.md)
* [Oct 17, 2025: Partitioned writes for Apache Iceberg™ tables (*General availability*)](2025/other/2025-10-17-iceberg-partitioned-writes-ga.md)
* [Oct 17, 2025: Set a target file size for Apache Iceberg™ tables (*General availability*)](2025/other/2025-10-17-set-target-file-size-ga.md)
* [Oct 16, 2025: AI_EXTRACT function (*General availability*)](2025/other/2025-10-16-ai-extract.md)
* [Oct 16, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-16-dcr.md)
  + [Clean Rooms API Version: 10.5](2025/other/2025-10-16-dcr.md)
* [Oct 16, 2025: Organization account in a hybrid organization](2025/other/2025-10-16-hybrid-orgs.md)
* [Oct 15, 2025: Enforced join order with directed joins (*General availability*)](2025/other/2025-10-15-directed-join.md)
* [Oct 16, 2025: Cross-region inference for US Commercial Gov](2025/other/2025-10-16-aisql-cross-region-gov-preview.md)
* [Oct 13, 2025: CORTEX_EMBED_USER database role (*General availability*)](2025/other/2025-10-13-cortex-embed-user-db-role.md)
* [Oct 10, 2025: Cortex Search Component Scores (*Preview*)](2025/other/2025-10-10-cortex-search-component-scores.md)
* [Oct 09, 2025: dbt Projects on Snowflake: Recent improvements (*Preview*)](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [dbt Project failures show up as failed queries](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [Compile on create](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [Install deps on compile](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [MONITOR privilege](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
  + [Accessing execution results is easier](2025/other/2025-10-09-dbt-projects-on-snowflake-updates.md)
* [Oct 09, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-09-dcr.md)
  + [Clean Rooms API Version: 10.4](2025/other/2025-10-09-dcr.md)
* [Oct 09, 2025: Organization user groups with organizational listings (*Preview*)](2025/other/2025-10-09-org-user-groups-with-org-listings.md)
* [Oct 09, 2025: Verified query suggestions (*Preview*)](2025/other/2025-10-09-verified-query-suggestions.md)
* [Oct 07, 2025: Query insights in Snowsight (*General availability*)](2025/other/2025-10-07-query-insights-in-snowsight-ga.md)
* [Oct 06, 2025: Hybrid table support for Microsoft Azure (*General availability*)](2025/other/2025-10-06-hybrid-tables-azure-ga.md)
* [Oct 03, 2025: Named scoring profiles for Cortex Search Services (*General availability*)](2025/other/2025-10-03-cortex-search-named-scoring-profiles.md)
* [Oct 03, 2025: Lineage for stored procedures and tasks (*General availability*)](2025/other/2025-10-03-process-lineage.md)
* [Oct 02, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-10-02-dcr.md)
* [Oct 02, 2025: Snowflake-managed MCP server (*Preview*)](2025/other/2025-10-02-mcp-server.md)
* [Oct 02, 2025: Using the database object explorer in Snowsight to create and manage semantic views (*General availability*)](2025/other/2025-10-02-semantic-views-in-snowsight.md)
* [Oct 01, 2025: New OBJECT_VISIBILITY property (*Preview*)](2025/other/2025-10-01-object-visibility.md)
* [Sep 30, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*Preview*)](2025/other/2025-09-30-cortex-agents-teams-ga.md)
* [Sep 30, 2025: Declarative Sharing (*Preview*)](2025/other/2025-09-30-declarative-sharing.md)
* [Sep 30, 2025: GRANT OWNERSHIP ON NOTEBOOK (*General availability*)](2025/other/2025-09-30-grant-ownership-on-notebook.md)
* [Sep 30, 2025: Support for derived metrics in semantic views](2025/other/2025-09-30-semantic-view-derived-metrics.md)
* [Nov 04, 2025: Cortex AI_TRANSCRIBE function (*General availability*)](2025/other/2025-11-04-cortex-ai-transcribe-ga.md)
* [Nov 04, 2025: Cortex AI Functions (*General availability*)](2025/other/2025-11-04-cortex-aisql-operators-ga.md)
* [Sep 29, 2025: External OAuth support for Snowflake Open Catalog catalog integration (*General availability*)](2025/other/2025-09-29-open-catalog-support-external-oauth.md)
* [Sep 29, 2025: Using SQL for Cortex Powered Object Descriptions (*General availability*)](2025/other/2025-09-29-sql-object-descriptions.md)
* [Sep 26, 2025: AI_COUNT_TOKENS function (*Preview*)](2025/other/2025-09-26-ai-count-tokens-function.md)
* [Sep 25, 2025: Page filtering for AI_PARSE_DOCUMENT](2025/other/2025-09-25-ai-parse-document-page-filter.md)
* [Sep 25, 2025: Cortex AI Functions – AI_TRANSLATE (*General availability*)](2025/other/2025-09-25-ai-translate-updates.md)
* [Sep 25, 2025: Cost management — Updating budgets more frequently](2025/other/2025-09-25-budget-refresh-interval.md)
* [Sep 25, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-09-25-dcr.md)
* [Sep 25, 2025: FILE data type (*General availability*)](2025/other/2025-09-25-file-data-type-ga.md)
* [Sep 23, 2025: AI_FILTER Performance Optimization (*Preview*)](2025/other/2025-09-23-ai-filter-optimization.md)
* [Sep 23, 2025: Snowpipe Streaming with high-performance architecture (*General availability*)](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
  + [Key features and benefits](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
  + [Key difference from classic architecture](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
  + [Recommended use cases](2025/other/2025-09-23-snowpipe-streaming-high-performance-architecture.md)
* [Sep 22, 2025: Prevent data compaction on Snowflake-managed Apache Iceberg™ tables](2025/other/2025-09-22-enable-data-compaction-parameter.md)
* [Sep 19, 2025: Snowflake Native Apps support for FedRAMP on AWS for apps with containers (*Preview*)](2025/other/2025-09-19-native-apps-spcs-aws-fedramp-ga.md)
* [Sep 19, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Azure (*Preview*)](2025/other/2025-09-19-position-row-level-deletes-support-writing-to-externally-managed-iceberg-table-azure.md)
* [Sep 19, 2025: Read consistency mode for sessions with near-concurrent changes](2025/other/2025-09-19-read-consistency-mode.md)
* [Sep 19, 2025: SnowConvert AI Verification (*Preview*)](2025/other/2025-09-19-snowconvert-ai-verification.md)
* [Sep 17, 2025: Snowflake Openflow - Snowflake Deployments (*Preview*)](2025/other/2025-09-17-openflow.md)
* [Sep 17, 2025: New SYS_CONTEXT function for getting context about applications, sessions, and organizations](2025/other/2025-09-17-sys_context-function.md)
* [Sep 17, 2025: Data lineage for tasks](2025/other/2025-09-17-task-lineage.md)
* [Sep 16, 2025: Support for Streamlit in Snowflake in the People’s Republic of China (Preview)](2025/other/2025-09-16-sis.md)
* [Sep 15, 2025: Billing views for Snowflake resellers and distributors](2025/other/2025-09-15-billing-schema.md)
* [Sep 15, 2025: Snowflake Native Apps updates](2025/other/2025-09-15-native-app-ga.md)
  + [Automated granting of privileges (General availability)](2025/other/2025-09-15-native-app-ga.md)
  + [App specifications (General availability)](2025/other/2025-09-15-native-app-ga.md)
  + [Feature policies (General availability)](2025/other/2025-09-15-native-app-ga.md)
* [Sep 15, 2025: Multi-factor authentication — Support for one-time passcodes](2025/other/2025-09-15-otp.md)
* [Sep 12, 2025: Support for position row-level deletes when writing to externally managed Apache Iceberg™ tables or catalog-linked databases on Amazon S3 or Google Cloud (*Preview*)](2025/other/2025-09-12-position-row-level-deletes-support-writing-to-externally-managed-iceberg-table-s3-google-cloud.md)
* [Sep 11, 2025: Support for Snowflake Cortex AI Functions in incremental dynamic table refresh](2025/other/2025-09-11-dynamic-tables-cortex-aisql-support.md)
* [Sep 11, 2025: Workspaces (*General availability*)](2025/other/2025-09-11-workspaces-ga.md)
* [Sep 09, 2025: Sensitive data classification](2025/other/2025-09-09-data-classification.md)
  + [Classifying views automatically (*General availability*)](2025/other/2025-09-09-data-classification.md)
  + [Excluding objects from automatic classification (*Preview*)](2025/other/2025-09-09-data-classification.md)
* [Sep 09, 2025: Hybrid table support for Microsoft Azure (*Preview*)](2025/other/2025-09-09-hybrid-tables-azure-pupr.md)
* [Sep 09, 2025: Using Snowsight to monitor data quality (*Preview*)](2025/other/2025-09-11-dq-ui.md)
* [Sep 02, 2025: Cortex Agents: Admin object REST API (*Preview*)](2025/other/2025-09-02-cortex-agents-rest-api-object.md)
* [Sep 02, 2025: Document AI models in the model registry](2025/other/2025-09-02-document-ai.md)
* [Sep 02, 2025: Partitioned writes for Apache Iceberg™ tables (*Preview*)](2025/other/2025-09-02-iceberg-partitioned-writes.md)
* [Aug 29, 2025: Snowflake Native Apps: Restricted caller’s rights (*General availability*)](2025/other/2025-08-29-native-apps-rcr-ga.md)
* [Aug 28, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-08-28-dcr.md)
* [Aug 28, 2025: Hybrid table support for periodic rekeying (*General availability*)](2025/other/2025-08-28-hybrid-tables-periodic-rekeying.md)
* [Aug 28, 2025: Monitoring events for Snowpipe](2025/other/2025-08-28-monitoring-events-for-snowpipe.md)
  + [Snowpipe: data ingestion events](2025/other/2025-08-28-monitoring-events-for-snowpipe.md)
  + [Externally managed Apache Iceberg™ tables: automated refresh events](2025/other/2025-08-28-monitoring-events-for-snowpipe.md)
* [Aug 26, 2025: Using the database object explorer in Snowsight to create and manage semantic views (*Preview*)](2025/other/2025-08-26-semantic-views-in-snowsight.md)
* [Aug 25, 2025: Snowflake Connectors for Microsoft Power Apps (*General availability*)](2025/other/2025-08-25-mspowerapps.md)
* [Aug 22, 2025: AI_EXTRACT function (*Preview*)](2025/other/2025-08-22-ai-extract.md)
* [Aug 22, 2025: Organization profile updates](2025/other/2025-08-22-org-profiles.md)
* [Aug 21, 2025: AI Parse Document layout mode (*General availability*)](2025/other/2025-08-21-aisql-ai-parse-document-layout-ga.md)
* [Aug 21, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-08-21-dcr.md)
* [Aug 20, 2025: Cortex Search Service replication (*Preview*)](2025/other/2025-08-20-cortex-search-service-replication.md)
* [Aug 20, 2025: Distributed processing in Snowflake ML: Many Model Training and Distributed Partition Function](2025/other/2025-08-20-snowflake-ml-distributed-processing.md)
* [Aug 20, 2025: New stage volume implementation in Snowpark Container Services (*Preview*)](2025/other/2025-08-20-spcs-stage-volume-new.md)
* [Aug 28, 2025: Model Registry model deployment UI (*Preview*)](2025/other/2025-08-28-model-deployment-ui.md)
* [Aug 19, 2025: Trust Center email notifications (*General availability*)](2025/other/2025-08-19-trust-center-email-notifications-ga.md)
* [Aug 18, 2025: Snowsight navigation menu updates (Gradual rollout)](2025/other/2025-08-18-snowsight-navigation.md)
* [Aug 18, 2025: Write Once, Read Many (WORM) snapshots (*Preview*)](2025/other/2025-08-18-worm-snapshots.md)
* [Aug 14, 2025: Support for stored procedures in data lineage (*Preview*)](2025/other/2025-08-14-lineage.md)
* [Aug 14, 2025: Using SQL for Cortex Powered Object Descriptions (*Preview*)](2025/other/2025-08-14-sql-object-descriptions.md)
* [Aug 14, 2025: Workload identity federation (*General availability*)](2025/other/2025-08-14-wif.md)
* [Aug 12, 2025: Snowflake ML Jobs (*General availability*)](2025/other/2025-08-12-distributed-ml-jobs.md)
* [Aug 12, 2025: Support for Streamlit 1.46 (General availability)](2025/other/2025-08-12-sis.md)
* [Aug 11, 2025: CORS configuration to enable cross-origin requests to a Snowpark Container Services service (*General availability*)](2025/other/2025-08-11-spcs-cors-ga.md)
* [Aug 08, 2025: Contacts (*General availability*)](2025/other/2025-08-08-contacts.md)
* [Aug 07, 2025: Cortex AI_TRANSCRIBE (*Preview*)](2025/other/2025-08-07-cortex-aisql-ai-transcribe.md)
* [Aug 07, 2025: Enforced join order with directed joins (*Preview*)](2025/other/2025-08-07-directed-join.md)
* [Aug 07, 2025: Snowpark Container Services batch jobs (*Preview*)](2025/other/2025-08-07-spcs-batch-jobs-pupr.md)
* [Aug 06, 2025: Cortex Agents: admin configuration UI (*Preview*)](2025/other/2025-08-06-cortex-agents-admin-ui.md)
* [Aug 06, 2025: Support for custom components in Streamlit in Snowflake (Preview)](2025/other/2025-08-06-sis.md)
* [Aug 05, 2025: Document AI table extraction (*General availability*)](2025/other/2025-08-05-document-ai.md)
* [Aug 04, 2025: Hybrid table storage for Time Travel data](2025/other/2025-08-04-hybrid-tables-time-travel-billing.md)
* [Aug 01, 2025: Snowflake Intelligence (*Preview*)](2025/other/2025-08-01-snowflake-intelligence.md)
* [Aug 01, 2025: Snowpark Container Services in Google Cloud (*General availability*)](2025/other/2025-08-01-spcs-google-cloud-ga.md)
* [Jul 31, 2025: AI Observability in Snowflake Cortex (*General availability*)](2025/other/2025-07-31-ai-observability-ga.md)
* [Jul 30, 2025: External network access with private connectivity: Google Cloud](2025/other/2025-07-30-outbound-network-access-private-gcp.md)
* [Jul 29, 2025: Cortex Agents integration for Microsoft Teams and Copilot (*Preview*)](2025/other/2025-07-29-cortex-agents-for-ms-teams.md)
* [Jul 28, 2025: Single-use refresh tokens for Snowflake OAuth](2025/other/2025-07-28-oauth.md)
* [Jul 28, 2025: Cortex Powered Object Descriptions](2025/other/2025-07-28-object-descriptions.md)
  + [Ability to generate descriptions without being the owner](2025/other/2025-07-28-object-descriptions.md)
* [Jul 25, 2025: Cortex AI Functions AI_SENTIMENT (*General availability*)](2025/other/2025-07-25-cortex-aisql-ai-sentiment.md)
* [Jul 25, 2025: Snowflake Native App Framework support for Snowflake machine learning models (*General availability*)](2025/other/2025-07-25-na-ml-ga.md)
* [Jul 24, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-07-24-dcr.md)
* [Jul 21, 2025: Billing contact information updates (*General availability*)](2025/other/2025-07-21-billing-contact-info-updates.md)
* [Jul 21, 2025: CREATE_BILLING_EVENT and CREATE_BILLING_EVENTS system functions (*General availability*)](2025/other/2025-07-21-create-billing-events-ga.md)
* [Jul 18, 2025: Alerts on new data (*General availability*)](2025/other/2025-07-18-alerts-on-new-data.md)
* [Jul 18, 2025: Sensitive data classification](2025/other/2025-07-18-database-classification.md)
  + [Automatic classification of a database (*Preview*)](2025/other/2025-07-18-database-classification.md)
  + [Determine which databases and schemas are monitored by automatic sensitive data classification (*Preview*)](2025/other/2025-07-18-database-classification.md)
* [Jul 18, 2025: Write support for externally managed Apache Iceberg™ tables and catalog-linked databases (*Preview*)](2025/other/2025-07-18-iceberg-external-writes-cld.md)
* [Jul 17, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-07-17-dcr.md)
* [Jul 16, 2025: Data governance release notes](2025/other/2025-07-16-tag-propagation-log.md)
  + [Automatic tag propagation: Event table to monitor conflicts (*General availability*)](2025/other/2025-07-16-tag-propagation-log.md)
* [Jul 15, 2025: Support for Streamlit 1.45.1 (General availability)](2025/other/2025-07-15-sis.md)
* [Jul 08, 2025: Snowflake AI_EMBED multimodal embeddings (*Preview*)](2025/other/2025-07-08-aisql-image-ai-embed.md)
* [Jul 08, 2025: ML Explainability visualizations (*General availability*)](2025/other/2025-07-08-ml-explainability-visualizations.md)
* [Jul 07, 2025: Account Usage: New CREDENTIALS view](2025/other/2025-07-07-credentials-view.md)
* [Jul 04, 2025: Snowflake Native App with Snowpark Container Services support for Google Cloud (*Preview*)](2025/other/2025-07-04-na-spcs-gcp-ga.md)
* [Jul 03, 2025: Query insights](2025/other/2025-07-03-query-insights.md)
* [Jul 01, 2025: Snowflake Multi-Node ML Jobs (*Preview*)](2025/other/2025-07-01-distributed-ml-jobs.md)
* [Jun 27, 2025: dbt Projects on Snowflake (*Preview*)](2025/other/2025-06-27-dbt-projects-on-snowflake.md)
* [Jun 26, 2025: Clone dynamic tables as tables (*General availability*)](2025/other/2025-06-26-clone-dt-as-table.md)
* [Jun 24, 2025: Premium views in the organization account (*General availability*)](2025/other/2025-06-24-premium-views.md)
* [Jun 23, 2025: Snowflake Native App Framework updates](2025/other/2025-06-23-auto-privs-app-spec.md)
  + [Automated granting of privileges (*Preview*)](2025/other/2025-06-23-auto-privs-app-spec.md)
  + [App specifications (*Preview*)](2025/other/2025-06-23-auto-privs-app-spec.md)
* [Jun 19, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-06-19-dcr.md)
* [Jun 18, 2025: Customized runtime environments in Warehouse notebooks (*Preview*)](2025/other/2025-06-18-preconfigured-nb-wh-runtime.md)
* [Jun 16, 2025: Cost Management release notes](2025/other/2025-06-16-budget.md)
  + [Budgets: Using tags to add objects](2025/other/2025-06-16-budget.md)
* [Jun 12, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-06-12-dcr.md)
* [Jun 03, 2025: Snowflake Copilot inline (*Preview*)](2025/other/2025-06-03-copilot-inline.md)
* [Jun 03, 2025: Workspaces in Snowsight (*Preview*)](2025/other/2025-06-03-workspaces.md)
* [Jun 02, 2025: AI_CLASSIFY supports up to 500 labels and multi-label classification](2025/other/2025-06-02-ai-classify-label-increase.md)
* [Jun 02, 2025: Snowflake Cortex AI Functions (*Preview*)](2025/other/2025-06-02-cortex-aisql-public-preview.md)
  + [AI capability meets SQL operators across multimodal data](2025/other/2025-06-02-cortex-aisql-public-preview.md)
* [Jun 01, 2025: Snowsight templates in trial accounts (*General availability*)](2025/other/2025-06-01-snowsight-templates.md)
* [May 30, 2025: Additional model support for Cortex AISQL Images](2025/other/2025-05-30-complete-multimodal-new-models.md)
* [May 30, 2025: Snowflake Openflow (*General availability*)](2025/other/2025-05-30-openflow.md)
* [May 30, 2025: Request Approval Workflow (*General availability*)](2025/other/2025-05-30-raw.md)
* [May 30, 2025: Data Governance release notes](2025/other/2025-05-30-tags.md)
  + [Object tags available in Standard Edition](2025/other/2025-05-30-tags.md)
* [May 29, 2025: Table extraction in Document AI (*Preview*)](2025/other/2025-05-29-document-ai.md)
* [May 29, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-05-30-dcr.md)
* [May 28, 2025: Organization users (*Preview*)](2025/other/2025-05-29-org-users.md)
* [May 27, 2025: Data sharing & collaboration for accounts in Kingdom of Saudi Arabia region](2025/other/2025-05-27-KSA-regions.md)
* [May 27, 2025: Snowflake Native App with Snowpark Container Services support for Azure Private Link (*General availability*)](2025/other/2025-05-27-na-spcs-azure-pl-ga.md)
* [May 27, 2025: Security release notes](2025/other/2025-05-23-mfa.md)
  + [New authentication methods for multi-factor authentication (MFA) (*General availability*)](2025/other/2025-05-23-mfa.md)
* [May 23, 2025: Notebooks `st.secrets` support for Warehouse and Container Runtimes (*General availability*)](2025/other/2025-05-23-st-secrets-support.md)
* [May 22, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-05-22-dcr.md)
* [May 22, 2025: New Snowsight navigation menu (*Preview*)](2025/other/2025-05-22-snowsight-navigation-menu.md)
* [May 21, 2025: Snowflake Openflow updates](2025/other/2025-05-21-openflow.md)
* [May 20, 2025: Data Governance release notes](2025/other/2025-05-20-contacts.md)
  + [Contacts for objects (*Preview*)](2025/other/2025-05-20-contacts.md)
* [May 20, 2025: Snowflake Copilot model level RBAC](2025/other/2025-05-20-model-level-rbac.md)
* [May 20, 2025: Snowflake Openflow (*Preview*)](2025/other/2025-05-20-openflow.md)
* [May 20, 2025: Snowpark Container Services preview available in Google Cloud (*Preview*)](2025/other/2025-05-20-spcs-preview-available-in-gcp.md)
* [May 19, 2025: Cortex COMPLETE Structured Output schema references](2025/other/2025-05-19-complete-structured-output-json-refs.md)
* [May 19, 2025: Snowflake ML Data Connector release notes](2025/other/2025-05-19-data-connector-container-runtime.md)
  + [Snowflake ML Data Connector for Container Runtime (*General availability*)](2025/other/2025-05-19-data-connector-container-runtime.md)
* [May 19, 2025: Snowflake Notebooks Container Runtime - Support for Azure and Azure Private Link (*General availability*)](2025/other/2025-05-19-nb-spcs-azure-pl-ga.md)
* [May 16, 2025: Cost Management release notes](2025/other/2025-05-16-cost.md)
  + [Cost anomalies (*Preview*)](2025/other/2025-05-16-cost.md)
* [May 16, 2025: Universal Search support for pipes, tasks, and streams (*General availability*)](2025/other/2025-05-16-universal-search-pipes-tasks-streams.md)
* [May 15, 2025: Organizational listings: discovery and access](2025/other/2025-05-15-dna.md)
* [May 14, 2025: Data Governance release notes](2025/other/2025-05-14-tag-propagation.md)
  + [Automatic propagation of user-defined tags (*General availability*)](2025/other/2025-05-14-tag-propagation.md)
* [May 13, 2025: Support for Streamlit 1.44.0 (General availability)](2025/other/2025-05-13-sis.md)
* [May 08, 2025: Document AI updates](2025/other/2025-05-08-document-ai.md)
* [May 08, 2025: Dynamic tables: Support for IS_ROLE_IN_SESSION in access policies (*General availability*)](2025/other/2025-05-08-dynamic-tables-is-role-in-session.md)
* [May 05, 2025: Generation 2 standard warehouses (*General availability*)](2025/other/2025-05-05-gen2-standard-warehouses.md)
* [May 05, 2025: Snowflake Cortex Provisioned Throughput (*General availability*)](2025/other/2025-05-05-provisioned-throughput.md)
* [May 01, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-05-01-dcr.md)
* [May 01, 2025: Dynamic tables: Support for filtering by current time and date for incremental refresh (*General availability*)](2025/other/2025-05-01-dynamic-tables-current-timestamp.md)
* [Apr 30, 2025: Programmatic access tokens](2025/other/2025-04-30-programmatic-access-tokens.md)
* [Apr 28, 2025: Boost Cortex Search results based on metadata signals (*General availability*)](2025/other/2025-04-28-boost-decay.md)
* [Apr 28, 2025: Role-Based Access Control for Cortex LLM Models](2025/other/2025-04-28-cortex-llm-model-rbac.md)
* [Apr 28, 2025: Disable reranker in Cortex Search queries (*General availability*)](2025/other/2025-04-28-reranking.md)
* [Apr 24, 2025: Container Runtime for ML on multi-node clusters (*Preview*)](2025/other/2025-04-24-container-runtime-multi-node.md)
* [Apr 24, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-04-24-dcr.md)
* [Apr 22, 2025: Trust Center email notifications (*Preview*)](2025/other/2025-04-22-trust-center-email-notifications.md)
* [Apr 18, 2025: Support for `st.query_params` (General availability)](2025/other/2025-04-18-sis.md)
* [Apr 17, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-04-17-dcr.md)
* [Apr 17, 2025: Semantic views (*Preview*)](2025/other/2025-04-17-semantic-views.md)
* [Apr 16, 2025: Document AI multi-language support](2025/other/2025-04-16-document-ai.md)
* [Apr 16, 2025: Snowflake ML Jobs (*Preview*)](2025/other/2025-04-16-snowflake-ml-jobs.md)
* [Apr 15, 2025: Snowflake Cortex AI state-of-the-art Entity Sentiment (*Preview*)](2025/other/2025-04-15-cortex-entity-sentiment-function.md)
* [Apr 15, 2025: Snowflake Egress Cost Optimizer (*General availability*)](2025/other/2025-04-15-eco-ga.md)
* [Apr 15, 2025: Search optimization improves the performance of queries containing scalar functions](2025/other/2025-04-15-search-optimization-scalar-functions.md)
* [Apr 14, 2025: Snowflake Cortex AI COMPLETE multimodal support (*Preview*)](2025/other/2025-04-14-cortex-complete-multimodal.md)
* [Apr 14, 2025: EMBED Function Added to Cortex REST API (*General availability*)](2025/other/2025-04-14-cortex-offers-embed-rest-api.md)
* [Apr 14, 2025: Mistral AI’s multimodal Pixtral Large now available for Snowflake Cortex AI (*General availability*)](2025/other/2025-04-14-cortex-offers-pixtral-large.md)
* [Apr 14, 2025: FILE data type to create tables for multimodal analysis (*Preview*)](2025/other/2025-04-14-file-data-type.md)
* [Apr 14, 2025: PROMPT helper function (*Preview*)](2025/other/2025-04-14-prompt-helper-function.md)
* [Apr 14, 2025: Support for Streamlit 1.42.0 (General availability)](2025/other/2025-04-14-sis.md)
* [Apr 11, 2025: Snowsight replication configuration and monitoring (*General availability*)](2025/other/2025-04-11-snowsight-replication-setup-and-monitoring.md)
* [Apr 10, 2025: Snowflake Data Clean Rooms updates](2025/other/2025-04-10-dcr.md)
* [Apr 7, 2025: Google Cloud Private Service Connect in Streamlit in Snowflake (Preview)](2025/other/2025-04-07-sis.md)
* [Apr 04, 2025: Cortex AI Observability (*Preview*)](2025/other/2025-04-04-cortex-ai-observability.md)
* [Apr 04, 2025: Cortex COMPLETE Structured Outputs (*General availability*)](2025/other/2025-04-04-cortex-complete-structured-outputs.md)
* [Mar 31, 2025: Data Governance release notes](2025/other/2025-03-31-ah-joins.md)
  + [Access history: Support for joins](2025/other/2025-03-31-ah-joins.md)
* [Mar 27, 2025: Snowflake Data Clean Rooms release notes](2025/other/2025-03-27-dcr.md)
  + [Simplified onboarding](2025/other/2025-03-27-dcr.md)
  + [Analysis Error Messaging in the clean rooms UI](2025/other/2025-03-27-dcr.md)
  + [Obfuscated provider templates](2025/other/2025-03-27-dcr.md)
  + [Cross-cloud collaboration support for multiple accounts](2025/other/2025-03-27-dcr.md)
  + [Update to default caching behavior in consumer run analysis](2025/other/2025-03-27-dcr.md)
  + [New limited API access role for developers](2025/other/2025-03-27-dcr.md)
  + [LiveRamp Identity & Translation integration update](2025/other/2025-03-27-dcr.md)
* [Mar 27, 2025: Git integration and multi-file editing in Streamlit in Snowflake (Preview)](2025/other/2025-03-27-sis.md)
* [Mar 26, 2025: Support for multiple semantic models in Cortex Analyst queries (*General availability*)](2025/other/2025-03-26-multiple-models-cortex-analyst.md)
* [Mar 24, 2025: Support for `st.experimental_audio_input` and `st.camera_input` (General availability)](2025/other/2025-03-24-sis.md)
* [Mar 20, 2025: Data Governance release notes](2025/other/2025-03-20-cortex-descriptions.md)
  + [Cortex Powered Object Descriptions: Support for additional table types](2025/other/2025-03-20-cortex-descriptions.md)
* [Mar 20, 2025: Snowflake Datasets (*General availability*)](2025/other/2025-03-20-snowflake-ml-datasets.md)
* [Mar 19, 2025: Alerts on new data (*Preview*)](2025/other/2025-03-19-alerts-on-new-data.md)
* [Mar 19, 2025: Additional file format support for Cortex AI Parse Document](2025/other/2025-03-19-parse-document-more-file-formats.md)
* [Mar 17, 2025: Document AI release notes](2025/other/2025-03-17-document-ai.md)
* [Mar 17, 2025: Snowflake Notebooks on Container Runtime for AWS (*General availability*)](2025/other/2025-03-17-notebooks-on-spcs-aws.md)
  + [New features](2025/other/2025-03-17-notebooks-on-spcs-aws.md)
* [Mar 12, 2025: Support for `st.file_uploader` (General availability)](2025/other/2025-03-12-sis.md)
* [Mar 07, 2025: RESOURCE_CONSTRAINT clause for Snowpark-optimized warehouses (*General availability*)](2025/other/2025-03-07-snowpark-optimized-warehouses-resource_constraint.md)
* [Mar 06, 2025: Cortex AI PARSE_DOCUMENT function for OCR (*General availability*)](2025/other/2025-03-06-ocr-mode-parse-document.md)
* [Mar 05, 2025: Search optimization improves the performance of queries containing scalar subqueries](2025/other/2025-03-05-search-optimization-scalar-subqueries.md)
* [Mar 05, 2025: Snowpark Container Services support for application metrics](2025/other/2025-03-05-spcs-application-metrics.md)
* [Mar 04, 2025: Universal Search ML model support (*General availability*)](2025/other/2025-03-04-universal-search-ml-models.md)
* [Mar 03, 2025: Snowflake Cortex Document Processing Usage History](2025/other/2025-03-03-cortex-document-processing-usage-history.md)
* [Mar 03, 2025: Native Apps with Snowpark Container Services - Support for AWS PrivateLink (*General availability*)](2025/other/2025-03-03-na-spcs-aws-pl-ga.md)
* [Mar 03, 2025: Native Apps with Snowpark Container Services - Support for Azure Private Link (*Preview*)](2025/other/2025-03-03-na-spcs-azure-pl-pupr.md)
* [Mar 03, 2025: Collapsible navigation bar in Snowsight (*General availability*)](2025/other/2025-03-03-snowsight-collapsible-nav-bar.md)
* [Feb 28, 2025: Increased max_cluster_count limits for multi-cluster warehouses](2025/other/2025-02-28-increased-max_cluster_count-limits.md)
* [Feb 27, 2025: Snowflake Data Clean Rooms release notes](2025/other/2025-02-27-dcr.md)
  + [UI loading improvements](2025/other/2025-02-27-dcr.md)
  + [External and Apache Iceberg™ table support in SQL templates](2025/other/2025-02-27-dcr.md)
  + [Data Clean Rooms available with data sharing terms](2025/other/2025-02-27-dcr.md)
  + [Improvements to provider-linked views in the API](2025/other/2025-02-27-dcr.md)
  + [Multi-template approval](2025/other/2025-02-27-dcr.md)
  + [Change in UI form handling with custom templates](2025/other/2025-02-27-dcr.md)
* [Feb 27, 2025: Snowflake Native Apps release channels (*Preview*)](2025/other/2025-02-27-na-release-channels.md)
* [Feb 26, 2025: Generating connection settings for a client, driver, library, or third-party application](2025/other/2025-02-26-connect-to-snowflake.md)
* [Feb 24, 2025: Changes to the app toolbar in Snowsight](2025/other/2025-02-24-na-toolbar-change.md)
* [Feb 19, 2025: Snowflake ML Model Serving Automatic Suspension (*Preview*)](2025/other/2025-02-19-spcs-model-serving-auto-suspend.md)
* [Feb 14, 2025: Document AI release notes](2025/other/2025-02-14-document-ai.md)
* [Feb 14, 2025: Support for `st.file_uploader` (Preview)](2025/other/2025-02-14-sis.md)
* [Feb 13, 2025: Snowpark Container Services (SPCS) Model Serving on Azure (*Preview*)](2025/other/2025-02-13-spcs-model-serving-azure.md)
* [Feb 11, 2025: Snowflake Cortex COMPLETE Structured Outputs (*Preview*)](2025/other/2025-02-11-cortex-complete-structured-outputs.md)
* [Feb 07, 2025: Snowflake Cortex Fine-tuning (*General availability*)](2025/other/2025-02-07-cortex-finetuning.md)
* [Feb 7, 2025: Support for material icons (General Availability)](2025/other/2025-02-07-sis.md)
* [Feb 03, 2025: Snowflake Native Apps with Snowpark Container Services support for Azure (*General availability*)](2025/other/2025-02-03-na-spcs-azure-ga.md)
* [Jan 31, 2025: Support for future grants in Streamlit in Snowflake (General Availability)](2025/other/2025-01-31-sis.md)
* [Jan 27, 2025: Organization account (*General availability*)](2025/other/2025-01-27-org-account.md)
* [Jan 23, 2025: Document AI on GCP (*General availability*)](2025/other/2025-01-23-document-ai.md)
* [Jan 20, 2025: Snowflake Native Apps with Snowpark Container Services support for AWS PrivateLink (*Preview*)](2025/other/2025-01-20-na-spcs-aws-pl-pupr.md)
* [Jan 16, 2025: Snowsight enhancements to contact email management (*General availability*)](2025/other/2025-01-16-snowsight-contact-email-update.md)
* [Jan 15, 2025: Custom instructions in Cortex Analyst (*Preview*)](2025/other/2025-01-15-cortex-analyst-custom-instructions.md)
* [Jan 15, 2025: Optimized COPY and INSERT bulk loads on empty hybrid tables (*General availability*)](2025/other/2025-01-15-ht-optimized-bulk-load.md)
* [Jan 07, 2025: Snowflake Cortex Playground (*Preview*)](2025/other/2025-01-07-cortex-llm-playground.md)
* [Jan 06, 2025: Snowflake Notebooks warehouse runtime on AWS PrivateLink and Azure Private Link (*General availability*)](2025/other/2025-01-06-notebooks-wh-aws-azure-pl.md)

---
title: Server releases earlier in 2026
source: https://docs.snowflake.com/en/release-notes/weekly-releases-2026.md
section: Release Notes
---

# Server releases earlier in 2026

This topic lists the release notes for server releases that occurred earlier in 2026.

For more recent releases, see [Snowflake server release notes and feature updates](new-features.md).

* [10.10 Release Notes: Mar 22, 2026-Mar 25, 2026](2026/10_10.md)
  + [SQL updates](2026/10_10.md)
    - [Interval data types (*Preview*)](2026/10_10.md)
  + [Snowflake Cortex updates](2026/10_10.md)
    - [Batch Cortex Search (*Preview*)](2026/10_10.md)
  + [Release notes change log](2026/10_10.md)
* [10.9 Release Notes: Mar 17, 2026-Mar 20, 2026](2026/10_9.md)
  + [New features](2026/10_9.md)
    - [Snowflake supports directory, root stage, and SnowGit imports](2026/10_9.md)
  + [Security updates](2026/10_9.md)
    - [TSS history account usage view (*General availability*)](2026/10_9.md)
  + [SQL updates](2026/10_9.md)
    - [DML error logging for tables](2026/10_9.md)
    - [Additional date and time formats](2026/10_9.md)
    - [Additional fixed-position numeric format models](2026/10_9.md)
  + [Snowflake Cortex updates](2026/10_9.md)
    - [CKE document access history](2026/10_9.md)
  + [Release notes change log](2026/10_9.md)
* [10.8 Release Notes: Mar 08, 2026-Mar 12, 2026](2026/10_8.md)
  + [SQL updates](2026/10_8.md)
    - [User-defined types](2026/10_8.md)
  + [Data collaboration updates](2026/10_8.md)
    - [Business Continuity and Disaster Recovery (BCDR) for listings](2026/10_8.md)
  + [Release notes change log](2026/10_8.md)
* [10.7 Release Notes (with behavior changes): Mar 02, 2026-Mar 05, 2026](2026/10_7.md)
  + [Behavior change bundles](2026/10_7.md)
  + [Data lake updates](2026/10_7.md)
    - [Apache Iceberg™ tables: Support for fixed(L) data type](2026/10_7.md)
  + [Release notes change log](2026/10_7.md)
* [10.6 Release Notes: Feb 23, 2026-Feb 27, 2026](2026/10_6.md)
  + [Data lake updates](2026/10_6.md)
    - [Apache Iceberg™ tables: Partitioned writes with hierarchical paths (*Preview*)](2026/10_6.md)
  + [Data governance updates](2026/10_6.md)
    - [Data quality: Non-owners can associate a data metric function with an object (*General availability*)](2026/10_6.md)
  + [Release notes change log](2026/10_6.md)
* [10.5 Release Notes: Feb 16, 2026-Feb 19, 2026](2026/10_5.md)
  + [Security updates](2026/10_5.md)
    - [SAML2 federated authentication: Support for metadata URL](2026/10_5.md)
    - [Tri-Secret Secure supports secure share area accounts](2026/10_5.md)
  + [Data governance updates](2026/10_5.md)
    - [DUPLICATE_COUNT DMF: Ability to specify multiple columns](2026/10_5.md)
  + [Release notes change log](2026/10_5.md)
* [10.4 Release Notes: Feb 09, 2026-Feb 13, 2026](2026/10_4.md)
  + [SQL updates](2026/10_4.md)
    - [New SQL functions](2026/10_4.md)
  + [Release notes change log](2026/10_4.md)
* [10.3 Release Notes: Feb 02, 2026-Feb 05, 2026](2026/10_3.md)
  + [Extensibility updates](2026/10_3.md)
    - [Owner’s rights contexts: Allow INFORMATION_SCHEMA, SHOW, and DESCRIBE](2026/10_3.md)
  + [Release notes change log](2026/10_3.md)
* [10.2 Release Notes: Jan 26, 2026-Jan 30, 2026](2026/10_2.md)
  + [SQL updates](2026/10_2.md)
    - [New UUID data type](2026/10_2.md)
  + [Data loading / unloading updates](2026/10_2.md)
    - [Support for Microsoft Fabric OneLake (*General availability*)](2026/10_2.md)
  + [Release notes change log](2026/10_2.md)
* [10.1 Release Notes (with behavior changes): Jan 19, 2026-Jan 23, 2026](2026/10_1.md)
  + [Behavior change bundles](2026/10_1.md)
  + [SQL updates](2026/10_1.md)
    - [Retrieve bind variable values (*General availability*)](2026/10_1.md)
  + [Release notes change log](2026/10_1.md)
* [10.0 Release Notes: Jan 12, 2026-Jan 15, 2026](2026/10_0.md)
  + [SQL updates](2026/10_0.md)
    - [Search optimization: Support for structured data types](2026/10_0.md)
  + [Data governance updates](2026/10_0.md)
    - [Copy tags when running a CREATE OR REPLACE TABLE command (*Preview*)](2026/10_0.md)
  + [Release notes change log](2026/10_0.md)

---
title: Server releases in 2023
source: https://docs.snowflake.com/en/release-notes/weekly-releases-2023.md
section: Release Notes
---

# Server releases in 2023

This topic lists the release notes for the server releases in 2023.

For more recent releases, see [Snowflake server release notes and feature updates](new-features.md).

* [December 14-15, 2023 — 7.44 Release Notes](2023/7_44.md)
  + [New Features](2023/7_44.md)
    - [Organization Usage: Improved views for billing reconciliation — *General Availability*](2023/7_44.md)
  + [SQL Updates](2023/7_44.md)
    - [Snowflake Cortex ML-Based Time-Series Functions — *General Availability*](2023/7_44.md)
  + [Ecosystem Updates](2023/7_44.md)
    - [Snowpark ML Modeling API — *General Availability*](2023/7_44.md)
    - [Snowpark ML Distributed Hyperparameter Optimization — *Preview*](2023/7_44.md)
  + [Data Lake Updates](2023/7_44.md)
    - [Cross-Cloud/Cross-Region Support for Apache Iceberg™ Tables — *Preview*](2023/7_44.md)
  + [Release Notes Change Log](2023/7_44.md)
* [December 04-05, 2023 — 7.43 Release Notes](2023/7_43.md)
  + [New Features](2023/7_43.md)
    - [Finalizer Task — *General Availability*](2023/7_43.md)
  + [SQL Updates](2023/7_43.md)
    - [New SQL functions](2023/7_43.md)
  + [Extensibility Updates](2023/7_43.md)
    - [Python Snowpark Local Testing Framework — *Preview*](2023/7_43.md)
  + [Web Interface Updates](2023/7_43.md)
    - [Load Files onto Stages and Managed Staged Files using Snowsight — *General Availability*](2023/7_43.md)
  + [Release Notes Change Log](2023/7_43.md)
* [November 29-30, 2023 — 7.42 Release Notes](2023/7_42.md)
  + [New Features](2023/7_42.md)
    - [Native Apps: Support for reference and privilege validation in the manifest file — *Preview*](2023/7_42.md)
    - [Schema detection for JSON and CSV — *General Availability*](2023/7_42.md)
    - [Table schema evolution — *General Availability*](2023/7_42.md)
    - [Apache Iceberg™ tables — *Preview*](2023/7_42.md)
    - [Self-service: Enabling the ORGADMIN role — *General Availability*](2023/7_42.md)
    - [Self-service: Deleting an account — *General Availability*](2023/7_42.md)
  + [Security Updates](2023/7_42.md)
    - [Key pair authentication: Improved troubleshooting](2023/7_42.md)
  + [SQL Updates](2023/7_42.md)
    - [Structured types — *Preview*](2023/7_42.md)
  + [Data Governance Updates](2023/7_42.md)
    - [Row access policies: Reference a protected mapping table in a row access policy — *General availability*](2023/7_42.md)
  + [Data Collaboration Updates](2023/7_42.md)
    - [Recurring subscription-based pricing plans for paid listings —– *General Availability*](2023/7_42.md)
    - [Cross-Cloud Auto-Fulfillment support for sharing a Snowflake Native App — *Preview*](2023/7_42.md)
  + [Release Notes Change Log](2023/7_42.md)
* [November 11-14, 2023 — 7.41 Release Notes (with behavior changes)](2023/7_41.md)
  + [Behavior Change Bundles](2023/7_41.md)
  + [SQL Updates](2023/7_41.md)
    - [New SQL functions](2023/7_41.md)
    - [UDFs and Stored Procedures: Support for Optional Arguments](2023/7_41.md)
    - [Snowflake alerts: Manual execution of alerts](2023/7_41.md)
  + [Security Updates](2023/7_41.md)
    - [Replication of network rules — *Preview*](2023/7_41.md)
  + [Data Pipeline Updates](2023/7_41.md)
    - [Dynamic tables: Support for GRANT <privilege> ON ALL/FUTURE DYNAMIC TABLE - Preview](2023/7_41.md)
    - [Dynamic tables: Support for GRANT ALL/ALL PRIVILEGES ON DYNAMIC TABLE - Preview](2023/7_41.md)
  + [Web Interface Updates](2023/7_41.md)
    - [More control over notification contacts in Snowsight](2023/7_41.md)
    - [Snowsight is the default interface for Snowflake accounts in US government regions](2023/7_41.md)
    - [Changes to formatting of query results in worksheets and dashboards](2023/7_41.md)
  + [Release Notes Change Log](2023/7_41.md)
* [November 09-10, 2023 — 7.40 Release Notes](2023/7_40.md)
  + [SQL Updates](2023/7_40.md)
    - [Search Optimization: Support for Substring Search in Semi-Structured Data — *General Availability*](2023/7_40.md)
    - [Email Notification Integrations: ALLOWED_RECIPIENTS No Longer Required](2023/7_40.md)
  + [Web Interface Updates](2023/7_40.md)
    - [Replication and Client Redirect in Snowsight — *Preview*](2023/7_40.md)
    - [Snowsight is the default interface for Snowflake accounts in US government regions](2023/7_40.md)
  + [Release Notes Change Log](2023/7_40.md)
* [November 03-06, 2023 — 7.39 Release Notes (with Snowday 2023)](2023/7_39.md)
  + [New Features](2023/7_39.md)
    - [Account Usage: New AGGREGATE_QUERY_HISTORY View — *Preview*](2023/7_39.md)
    - [Budgets on Azure, GCP, and VPS — *Preview*](2023/7_39.md)
    - [Snowflake Native SDK for Connectors — *Preview*](2023/7_39.md)
  + [Security Updates](2023/7_39.md)
    - [Access control: Database roles — *General availability*](2023/7_39.md)
  + [Data Pipeline Updates](2023/7_39.md)
    - [New function SYSTEM$TASK_RUNTIME_INFO](2023/7_39.md)
  + [Extensibility Updates](2023/7_39.md)
    - [External network access — *Preview on Azure*](2023/7_39.md)
    - [Vectorized Python UDTFs — *General Availability*](2023/7_39.md)
  + [Data Governance Updates](2023/7_39.md)
    - [Set a masking policy on a virtual column — *General Availability*](2023/7_39.md)
  + [Release Notes Change Log](2023/7_39.md)
* [October 23-24, 2023 — 7.38 Release Notes](2023/7_38.md)
  + [Security Updates](2023/7_38.md)
    - [Network rules support Azure private endpoints — *Preview*](2023/7_38.md)
  + [SQL Updates](2023/7_38.md)
    - [ALTER TABLE: Support for IF [NOT] EXISTS with ADD COLUMN and DROP COLUMN](2023/7_38.md)
    - [H3 functions for GEOGRAPHY objects — *Preview*](2023/7_38.md)
  + [Extensibility Updates](2023/7_38.md)
    - [Python packages policies — *Preview*](2023/7_38.md)
  + [Web Interface Updates](2023/7_38.md)
    - [Snowsight is the default interface for Snowflake accounts in US government regions](2023/7_38.md)
  + [Release Notes Change Log](2023/7_38.md)
* [October 16-17, 2023 — 7.37 Release Notes](2023/7_37.md)
  + [New Features](2023/7_37.md)
    - [Logging and tracing from handler code — *General Availability*](2023/7_37.md)
  + [Extensibility Updates](2023/7_37.md)
    - [Reading files with a Python function or procedure — *General Availability*](2023/7_37.md)
    - [Reading files with a Scala function or procedure handler — *General Availability*](2023/7_37.md)
  + [SQL Updates](2023/7_37.md)
    - [Fixed an issue with column aliases for aggregates and the GROUP BY ALL clause](2023/7_37.md)
  + [Web Interface Updates](2023/7_37.md)
    - [Can no longer add or manage payment details using Classic Console](2023/7_37.md)
  + [Release Notes Change Log](2023/7_37.md)
* [October 09-10, 2023 — 7.36 Release Notes](2023/7_36.md)
  + [Extensibility Updates](2023/7_36.md)
    - [Support for Python 3.11 in Snowpark, UDFs, UDTFs and stored procedures — *Preview*](2023/7_36.md)
  + [Data Collaboration Updates](2023/7_36.md)
    - [Company name for listing analytics](2023/7_36.md)
  + [Web Interface Updates](2023/7_36.md)
    - [Accessing billing usage statements — *General Availability*](2023/7_36.md)
    - [Viewing Query History in worksheets — *Preview*](2023/7_36.md)
  + [Release Notes Change Log](2023/7_36.md)
* [October 03-05, 2023 — 7.35 Release Notes](2023/7_35.md)
  + [New Features](2023/7_35.md)
    - [Budgets — *Preview*](2023/7_35.md)
    - [Dynamic tables refreshed on creation by default — *Preview*](2023/7_35.md)
    - [Dynamic tables new sharing capabilities — *Preview*](2023/7_35.md)
  + [SQL Updates](2023/7_35.md)
    - [New SQL Functions](2023/7_35.md)
  + [Data Collaboration Updates](2023/7_35.md)
    - [Allow non-admins to set up Cross-Cloud Auto-Fulfillment](2023/7_35.md)
    - [Offer a limited trial of a data product on the Snowflake Marketplace — *Preview*](2023/7_35.md)
  + [Web Interface Updates](2023/7_35.md)
    - [Task graph run debugging — *Preview*](2023/7_35.md)
  + [Release Notes Change Log](2023/7_35.md)
* [September 27-29, 2023 — 7.34 Release Notes (with behavior changes)](2023/7_34.md)
  + [Behavior Change Bundles](2023/7_34.md)
  + [New Features](2023/7_34.md)
    - [Snowflake Alerts — *General Availability*](2023/7_34.md)
  + [SQL Updates](2023/7_34.md)
    - [New SQL Functions](2023/7_34.md)
  + [Web Interface Updates](2023/7_34.md)
    - [Allow Snowflake On Demand customers to purchase listings](2023/7_34.md)
* [September 18-19, 2023 — 7.33 Release Notes](2023/7_33.md)
  + [New Features](2023/7_33.md)
    - [Network Rules — *Preview*](2023/7_33.md)
    - [Enhanced Network Security — *Preview*](2023/7_33.md)
    - [Network Isolation to Internal Stages Using AWS PrivateLink — *Preview*](2023/7_33.md)
  + [Data Loading Updates](2023/7_33.md)
    - [Cross-platform Support for Snowpipe Auto-Ingest — *General Availability*](2023/7_33.md)
    - [Amazon EventBridge Support for Snowpipe Auto-Ingest — *General Availability*](2023/7_33.md)
  + [Data Governance Updates](2023/7_33.md)
    - [Tag-based Masking Policy: Support for Database & Schema — *General Availability*](2023/7_33.md)
    - [Shared Tag References — *Preview*](2023/7_33.md)
* [September 11-12, 2023 — 7.32 Release Notes](2023/7_32.md)
  + [SQL Updates](2023/7_32.md)
    - [New Function: IS_DATABASE_ROLE_IN_SESSION](2023/7_32.md)
  + [Data Loading / Unloading Updates](2023/7_32.md)
    - [Replicating streams on Snowflake tables populated by Snowpipe Streaming](2023/7_32.md)
    - [Snowpipe Streaming authentication updates](2023/7_32.md)
    - [New options for INFER_SCHEMA](2023/7_32.md)
  + [Data Governance Updates](2023/7_32.md)
    - [Row access policies: Reference a protected mapping table in a row access policy — *Preview*](2023/7_32.md)
    - [Share data protected by a role-based policy — *Preview*](2023/7_32.md)
  + [Web Interface Updates](2023/7_32.md)
    - [Managing data governance in Snowsight — *Generally Available*](2023/7_32.md)
* [September 05-06, 2023 – 7.31 Release Notes](2023/7_31.md)
  + [New Features](2023/7_31.md)
    - [New Information Schema Views for Class Instances](2023/7_31.md)
    - [External Network Access — *Preview*](2023/7_31.md)
* [August 28-29, 2023 — 7.30 Release Notes](2023/7_30.md)
  + [New Features](2023/7_30.md)
    - [Data Pipelines Replication Support — *Preview*](2023/7_30.md)
  + [Security Updates](2023/7_30.md)
    - [Password policies: Add support for password history and time to wait to change a password](2023/7_30.md)
  + [SQL Updates](2023/7_30.md)
    - [EXECUTE IMMEDIATE FROM File — *Preview*](2023/7_30.md)
    - [Organizations & Accounts: Dropping an account URL — *Preview*](2023/7_30.md)
  + [Developer and Extensibility Updates](2023/7_30.md)
    - [Support for Python 3.9 and 3.10 in Snowpark, UDFs, UDTFs and stored procedures — *General Availability*](2023/7_30.md)
    - [Tabular Return Values from Python Stored Procedures — *General Availability*](2023/7_30.md)
  + [Data Governance Updates](2023/7_30.md)
    - [Set a masking policy on a virtual column — *Preview*](2023/7_30.md)
  + [Web Interface Updates](2023/7_30.md)
    - [Governance area supports GOVERNANCE_VIEWER and OBJECT_VIEWER database roles](2023/7_30.md)
    - [Provider Studio Onboarding — *General Availability*](2023/7_30.md)
* [August 22-23, 2023 – 7.29 Release Notes (with behavior changes)](2023/7_29.md)
  + [Behavior Changes Bundles](2023/7_29.md)
  + [Non-bundled Pending Behavior Changes](2023/7_29.md)
  + [SQL Updates](2023/7_29.md)
    - [GET_QUERY_OPERATOR_STATS Function — *General Availability*](2023/7_29.md)
    - [Using the Query Hash to Identify Patterns and Trends in Queries](2023/7_29.md)
    - [New SQL Functions](2023/7_29.md)
* [August 16-17, 2023 — 7.28 Release Notes](2023/7_28.md)
  + [New Features](2023/7_28.md)
    - [Blocking Public Access to Azure Internal Stages — *General Availability*](2023/7_28.md)
  + [SQL Updates](2023/7_28.md)
    - [Python Package Version Range Support — *Preview*](2023/7_28.md)
  + [Data Loading Updates](2023/7_28.md)
    - [New File Format Option: USE_LOGICAL_TYPE](2023/7_28.md)
  + [Web Interface Updates](2023/7_28.md)
    - [Snowsight Worksheet Tabs — *General Availability*](2023/7_28.md)
* [August 07-08, 2023 — 7.27 Release Notes](2023/7_27.md)
  + [New Features](2023/7_27.md)
    - [Account Usage: New CLASS_INSTANCES View](2023/7_27.md)
  + [SQL Updates](2023/7_27.md)
    - [New System Stored Procedure for Sending Email Notifications — *General Availability*](2023/7_27.md)
  + [Web Interface Updates](2023/7_27.md)
    - [Sharing: Improved UI Messaging](2023/7_27.md)
* [August 01-02, 2023 — 7.26 Release Notes](2023/7_26.md)
  + [SQL Updates](2023/7_26.md)
    - [SELECT \*: Selecting Columns Matching a SQL Pattern and Replacing Column Values](2023/7_26.md)
    - [Transforming a GEOMETRY Object to a Different Spatial Reference System (ST_TRANSFORM) — *General Availability*](2023/7_26.md)
    - [Vectorized Python UDTFs — *Preview*](2023/7_26.md)
  + [Data Collaboration Updates](2023/7_26.md)
    - [Recurring Subscription-based Pricing Plans for Paid Listings — *Preview*](2023/7_26.md)
    - [Non-Recurring Subscription-based Pricing Plans for Paid Listings — *General Availability*](2023/7_26.md)
  + [Documentation and Learning Resources](2023/7_26.md)
    - [Weekly Release Notes in the Snowflake Documentation](2023/7_26.md)
* [July 25-26, 2023 — 7.25 Release Notes](2023/7_25.md)
  + [New Features](2023/7_25.md)
    - [Organization Usage: New QUERY_ACCELERATION_HISTORY View](2023/7_25.md)
  + [SQL Updates](2023/7_25.md)
    - [Snowflake Alerts: Support for Future Grants and Object Tagging](2023/7_25.md)
    - [Search Optimization: Support for Substring Search in Semi-Structured Data — *Preview*](2023/7_25.md)
  + [Data Governance Updates](2023/7_25.md)
    - [Access History: Track Masking & Row Access Policy References — *General Availability*](2023/7_25.md)
  + [Web Interface Updates](2023/7_25.md)
    - [Create Named Stages using Snowsight — *General Availability*](2023/7_25.md)
* [July 19-20, 2023 — 7.24 Release Notes](2023/7_24.md)
  + [New Features](2023/7_24.md)
    - [SQL Syntax for Enabling the ORGADMIN Role — *Preview*](2023/7_24.md)
* [July 10-12, 2023 — 7.23 Release Notes (with behavior changes)](2023/7_23.md)
  + [Behavior Changes Bundles](2023/7_23.md)
  + [New Features](2023/7_23.md)
    - [Schema Detection and Evolution for Kafka Connector With Snowpipe Streaming — *Preview*](2023/7_23.md)
  + [SQL Updates](2023/7_23.md)
    - [SYSTEM$CLUSTERING_INFORMATION Returns Error Messages](2023/7_23.md)
  + [Web Interface Updates](2023/7_23.md)
    - [Snowsight Set as Default Web Interface](2023/7_23.md)
* [July 05-06, 2023 — 7.22 Release Notes](2023/7_22.md)
  + [New Features](2023/7_22.md)
    - [Deleting an Account (Self-service) — *Preview*](2023/7_22.md)
    - [Organization Usage: New REPLICATION_GROUP_USAGE_HISTORY View](2023/7_22.md)
  + [SQL Updates](2023/7_22.md)
    - [New SQL Functions](2023/7_22.md)
    - [GROUP BY: New ALL Keyword](2023/7_22.md)
* [June 19-22, 2023 — 7.21 Release Notes](2023/7_21.md)
  + [SQL Updates](2023/7_21.md)
    - [New SQL Functions](2023/7_21.md)
  + [Data Loading Updates](2023/7_21.md)
    - [Support REPLACE_INVALID_CHARACTERS for Avro, Parquet, Orc, and XML](2023/7_21.md)
* [June 14-15, 2023 — 7.20 Release Notes](2023/7_20.md)
  + [New Features](2023/7_20.md)
    - [Snowpipe Streaming Replication Support — *Preview*](2023/7_20.md)
  + [Security Updates](2023/7_20.md)
    - [Access Control: New Privilege for Delegating Warehouse Management — *Preview*](2023/7_20.md)
  + [SQL Updates](2023/7_20.md)
    - [Improved Performance for SELECT Statements With LIMIT and ORDER BY Clauses — *General Availability*](2023/7_20.md)
    - [Support for Python 3.10 in Snowpark, UDFs, UDTFs and Stored Procedures — *Preview*](2023/7_20.md)
  + [Data Governance Updates](2023/7_20.md)
    - [Tag-based Masking Policy: Support for Database & Schema — *Preview*](2023/7_20.md)
    - [Access History: Track Objects Modified by a DDL Operation — *Preview*](2023/7_20.md)
  + [Web Interface Updates](2023/7_20.md)
    - [Load Files From a Stage Into a Table — *General Availability*](2023/7_20.md)
* [June 07-08, 2023 — 7.19 Release Notes (with behavior changes)](2023/7_19.md)
  + [Behavior Changes Bundles](2023/7_19.md)
  + [New Features](2023/7_19.md)
    - [Anonymous Procedures — *General Availability*](2023/7_19.md)
    - [Reading Files With a Java Function or Procedure Handler — *General Availability*](2023/7_19.md)
    - [Reading Files With a Scala Function or Procedure Handler — *Preview*](2023/7_19.md)
    - [Reading Files With a Python Function or Procedure — *Preview*](2023/7_19.md)
    - [Schema Detection for JSON and CSV — *Preview*](2023/7_19.md)
    - [Table Schema Evolution — *Preview*](2023/7_19.md)
  + [SQL Updates](2023/7_19.md)
    - [Support for Python 3.9 in Snowpark, UDFs, and Stored Procedures — *Preview*](2023/7_19.md)
    - [UDFs, UDTFs, and Stored Procedures Support Passing Arguments by Name](2023/7_19.md)
  + [Data Science Updates](2023/7_19.md)
    - [Work With Snowflake’s Upcoming ML features](2023/7_19.md)
  + [Organization Updates](2023/7_19.md)
    - [ACCOUNTS View (Organization Usage) — *Preview*](2023/7_19.md)
  + [Web Interface Updates](2023/7_19.md)
    - [New Organizations Only Have Snowsight Access](2023/7_19.md)
* [May 31-June 01, 2023 — 7.18 Release (no announcements)](2023/7_18.md)

---
title: Server releases in 2024
source: https://docs.snowflake.com/en/release-notes/weekly-releases-2024.md
section: Release Notes
---

# Server releases in 2024

This topic lists the release notes for the server releases that occurred in 2024.

For more recent releases, see [Snowflake server release notes and feature updates](new-features.md).

* [December 16-18, 2024 — 8.47 Release Notes](2024/8_47.md)
  + [SQL updates](2024/8_47.md)
    - [New SQL functions](2024/8_47.md)
  + [Extensibility updates](2024/8_47.md)
    - [Support for a wildcard character in network rule network identifiers —– *Preview*](2024/8_47.md)
  + [Data pipeline updates](2024/8_47.md)
    - [Dynamic tables: Maximum number of dynamic tables in an account increased to 10,000](2024/8_47.md)
  + [Data governance updates](2024/8_47.md)
    - [OBJECT_DEPENDENCIES view: Support for dynamic tables](2024/8_47.md)
  + [Release notes change log](2024/8_47.md)
* [December 09-13, 2024 — 8.46 Release Notes](2024/8_46.md)
  + [New features](2024/8_46.md)
    - [Restricted caller’s rights — *Preview*](2024/8_46.md)
  + [Snowsight updates](2024/8_46.md)
    - [New login screen version](2024/8_46.md)
  + [SQL Updates](2024/8_46.md)
    - [New SQL functions](2024/8_46.md)
  + [Release notes change log](2024/8_46.md)
* [December 03-05, 2024 — 8.45 Release Notes](2024/8_45.md)
  + [SQL updates](2024/8_45.md)
    - [Snowflake Scripting: Asynchronous child jobs — *Preview*](2024/8_45.md)
  + [Extensibility updates](2024/8_45.md)
    - [Profiling Python stored procedure handlers — *Preview*](2024/8_45.md)
    - [Java 17 support — *General Availability*](2024/8_45.md)
  + [Data pipeline updates](2024/8_45.md)
    - [Dynamic tables: Unlimited inputs](2024/8_45.md)
  + [Release notes change log](2024/8_45.md)
* [November 18-21, 2024 — 8.44 Release Notes](2024/8_44.md)
  + [New features](2024/8_44.md)
    - [Outbound private connectivity for Snowflake features](2024/8_44.md)
    - [Visual Studio Code extension for Snowpark Python — *General availability*](2024/8_44.md)
  + [Extensibility updates](2024/8_44.md)
    - [External network access for Azure Gov regions — *General availability*](2024/8_44.md)
  + [Data lake updates](2024/8_44.md)
    - [Specify an external ID for SIGV4 REST catalog integrations](2024/8_44.md)
  + [Release notes change log](2024/8_44.md)
* [November 12-14, 2024 — 8.43 Release Notes](2024/8_43.md)
  + [New features](2024/8_43.md)
    - [Full-text search — *General availability*](2024/8_43.md)
    - [Leaked password protection](2024/8_43.md)
    - [Tasks: Python and JVM support for serverless tasks — *General availability*](2024/8_43.md)
  + [SQL updates](2024/8_43.md)
    - [EXECUTE IMMEDIATE FROM: Support for using content from staged files in templates](2024/8_43.md)
    - [Automatic logging and tracing for Snowflake Scripting stored procedures](2024/8_43.md)
    - [ACCOUNT_USAGE: New SERVERLESS_ALERT_HISTORY view](2024/8_43.md)
  + [Extensibility updates](2024/8_43.md)
    - [Authentication with AWS IAM from procedures and functions — *General availability*](2024/8_43.md)
  + [Listings updates](2024/8_43.md)
    - [LISTING_REFRESH_HISTORY — *General availability*](2024/8_43.md)
  + [Data pipeline updates](2024/8_43.md)
    - [Dynamic tables: Support for replication across different failover groups](2024/8_43.md)
  + [Data Lake updates](2024/8_43.md)
    - [Apache Iceberg™ tables: Support for Microsoft Fabric OneLake storage — *Preview*](2024/8_43.md)
  + [Release notes change log](2024/8_43.md)
* [November 04-06, 2024 — 8.42 Release Notes](2024/8_42.md)
  + [New features](2024/8_42.md)
    - [Trust Center: Two new scanners in the Security Essentials scanner package](2024/8_42.md)
    - [Serverless alerts — *General availability*](2024/8_42.md)
  + [SQL updates](2024/8_42.md)
    - [PARSE_JSON and TRY_PARSE_JSON functions: Duplicate keys are now allowed](2024/8_42.md)
  + [Extensibility updates](2024/8_42.md)
    - [New Tensorflow version might require specifying Keras](2024/8_42.md)
  + [Data pipeline updates](2024/8_42.md)
    - [Tasks: Serverless tasks user control — *General availability*](2024/8_42.md)
    - [Tasks: Task success notifications — *General availability*](2024/8_42.md)
  + [AI & ML updates](2024/8_42.md)
    - [API-level Role-based Access Control (RBAC) for Cortex Analyst](2024/8_42.md)
  + [Release notes change log](2024/8_42.md)
* [October 28-30, 2024 — 8.41 Release Notes](2024/8_41.md)
  + [New features](2024/8_41.md)
    - [Outbound private connectivity for Snowflake features](2024/8_41.md)
    - [EXECUTE IMMEDIATE FROM: Preview SQL rendered from Jinja2 templates](2024/8_41.md)
    - [GENERATE_SYNTHETIC_DATA: New system stored procedure for generating synthetic data — *Preview*](2024/8_41.md)
  + [Security updates](2024/8_41.md)
    - [Increased limits for network policies on internal stages](2024/8_41.md)
  + [SQL updates](2024/8_41.md)
    - [Extended support for bind variables](2024/8_41.md)
  + [Extensibility updates](2024/8_41.md)
    - [Writing files from Snowpark Python UDFs and UDTFs — *Preview*](2024/8_41.md)
  + [Release notes change log](2024/8_41.md)
* [October 21-23, 2024 — 8.40 Release Notes](2024/8_40.md)
  + [New features](2024/8_40.md)
    - [Trust Center: New Threat Intelligence scanner package](2024/8_40.md)
    - [Estimate the cost of Automatic Clustering — *General availability*](2024/8_40.md)
    - [Snowflake REST APIs — *General availability*](2024/8_40.md)
  + [Deprecated features](2024/8_40.md)
    - [Snowflake REST APIs](2024/8_40.md)
  + [SQL updates](2024/8_40.md)
    - [New SQL functions](2024/8_40.md)
  + [Data lake updates](2024/8_40.md)
    - [Apache Iceberg™ tables: Catalog integration for Iceberg REST — *General availability*](2024/8_40.md)
  + [Release notes change log](2024/8_40.md)
* [October 14-17, 2024 — 8.39 Release Notes](2024/8_39.md)
  + [New features](2024/8_39.md)
    - [Cortex Analyst fully supported in Streamlit in Snowflake](2024/8_39.md)
  + [Data pipeline updates](2024/8_39.md)
    - [Dynamic tables: Changes to the output of the GET_DDL function](2024/8_39.md)
  + [Data lake updates](2024/8_39.md)
    - [Apache Iceberg™ tables: New SYSTEM$VERIFY_EXTERNAL_VOLUME function](2024/8_39.md)
  + [Release notes change log](2024/8_39.md)
* [October 07-09, 2024 — 8.38 Release Notes (with behavior changes)](2024/8_38.md)
  + [Behavior change bundles](2024/8_38.md)
  + [SQL updates](2024/8_38.md)
    - [New SQL functions](2024/8_38.md)
    - [Query objects larger than 16 MB in files on a stage](2024/8_38.md)
  + [Data pipeline updates](2024/8_38.md)
    - [Dynamic tables: Updates to input types](2024/8_38.md)
  + [Data governance updates](2024/8_38.md)
    - [Data quality: New SYSTEM$DATA_METRIC_SCAN function](2024/8_38.md)
  + [Release notes change log](2024/8_38.md)
* [September 30 - October 03, 2024 — 8.37 Release Notes](2024/8_37.md)
  + [SQL updates](2024/8_37.md)
    - [New SQL functions](2024/8_37.md)
  + [Extensibility updates](2024/8_37.md)
    - [Authentication with AWS IAM from procedures and functions — *Preview*](2024/8_37.md)
  + [Release notes change log](2024/8_37.md)
* [September 23-26, 2024 — 8.36 Release Notes](2024/8_36.md)
  + [Data lake updates](2024/8_36.md)
    - [Cloning support for Snowflake-managed Apache Iceberg™ tables — *Preview*](2024/8_36.md)
  + [Release notes change log](2024/8_36.md)
* [September 18-20, 2024 — 8.35 Release Notes](2024/8_35.md)
  + [SQL updates](2024/8_35.md)
    - [RANGE BETWEEN support for FIRST_VALUE and LAST_VALUE functions](2024/8_35.md)
  + [Extensibility updates](2024/8_35.md)
    - [Telemetry data to the event table from Snowflake Notebook cells temporarily disabled](2024/8_35.md)
    - [pandas on Snowflake - *General Availability*](2024/8_35.md)
  + [Data lake updates](2024/8_35.md)
    - [Apache Iceberg™ tables: Automated refresh — *Preview*](2024/8_35.md)
  + [Release notes change log](2024/8_35.md)
* [September 09-11, 2024 — 8.34 Release Notes](2024/8_34.md)
  + [Data loading/unloading updates](2024/8_34.md)
    - [The vectorized scanner option supports client-side encryption](2024/8_34.md)
  + [Data pipeline updates](2024/8_34.md)
    - [Dynamic tables: New DYNAMIC_TABLE_REFRESH_HISTORY account usage view](2024/8_34.md)
    - [Tasks: Python and JVM support for serverless tasks - *Preview*](2024/8_34.md)
  + [Release notes change log](2024/8_34.md)
* [September 03-05, 2024 — 8.33 Release Notes](2024/8_33.md)
  + [New features](2024/8_33.md)
    - [Snowflake REST APIs — *Preview*](2024/8_33.md)
  + [SQL updates](2024/8_33.md)
    - [SHOW commands: Support for new WITH PRIVILEGES parameter](2024/8_33.md)
  + [Data lake updates](2024/8_33.md)
    - [Apache Iceberg™ tables: Catalog integration for Iceberg REST — *Preview*](2024/8_33.md)
    - [Iceberg tables: Delta table support — *Preview*](2024/8_33.md)
  + [Release notes change log](2024/8_33.md)
* [August 26-30, 2024 — 8.32 Release Notes (with behavior changes)](2024/8_32.md)
  + [Behavior change bundles](2024/8_32.md)
  + [SQL updates](2024/8_32.md)
    - [New SQL functions](2024/8_32.md)
  + [Data pipeline updates](2024/8_32.md)
    - [Tasks: A new option for ALTER TASK](2024/8_32.md)
  + [Release notes change log](2024/8_32.md)
* [August 19-21, 2024 — 8.31 Release Notes](2024/8_31.md)
  + [Data lake updates](2024/8_31.md)
    - [Snowflake Open Catalog: New system function for troubleshooting issues with syncing Snowflake-managed Apache Iceberg™ tables - *Preview*](2024/8_31.md)
    - [Apache Iceberg™ tables: Support for time travel queries using third-party engines — *General availability*](2024/8_31.md)
  + [Release notes change log](2024/8_31.md)
* [August 11-14, 2024 — 8.30 Release Notes](2024/8_30.md)
  + [New features](2024/8_30.md)
    - [Outbound private connectivity with Azure External Network Access and External Functions — *Preview*](2024/8_30.md)
    - [Full-text search - *Preview*](2024/8_30.md)
  + [SQL updates](2024/8_30.md)
    - [Setting users as SNOWFLAKE_SUPPORT users no longer supported](2024/8_30.md)
    - [RANGE BETWEEN with explicit offsets: Additional window functions supported](2024/8_30.md)
    - [UNDROP command: Support for restoring objects using ID](2024/8_30.md)
    - [Wildcard filtering for functions](2024/8_30.md)
  + [Data loading / unloading updates](2024/8_30.md)
    - [Loading unstructured data with Document AI — *Preview*](2024/8_30.md)
  + [Release notes change log](2024/8_30.md)
* [August 07-08, 2024 — 8.29 Release Notes](2024/8_29.md)
  + [Security updates](2024/8_29.md)
    - [Session policies: Support added for secondary roles](2024/8_29.md)
  + [Extensibility updates](2024/8_29.md)
    - [Python user-defined aggregate functions — *General availability*](2024/8_29.md)
    - [Access to Git repositories from Snowflake — *General availability*](2024/8_29.md)
  + [Data lake updates](2024/8_29.md)
    - [Apache Iceberg™ tables: Support for government regions — *General availability*](2024/8_29.md)
  + [Release notes change log](2024/8_29.md)
* [July 29-August 01, 2024 — 8.28 Release Notes](2024/8_28.md)
  + [SQL updates](2024/8_28.md)
    - [New SQL functions](2024/8_28.md)
    - [CREATE and ALTER commands for replication and failover groups: Support added for tags](2024/8_28.md)
    - [Account Usage: New SEARCH_OPTIMIZATION_BENEFITS view](2024/8_28.md)
  + [Data governance updates](2024/8_28.md)
    - [Object Tagging: Support added for replication and failover groups](2024/8_28.md)
    - [Data Quality and data metric functions (DMFs) — *General Availability*](2024/8_28.md)
  + [Data loading/unloading updates](2024/8_28.md)
    - [Snowpipe: New output in SYSTEM$PIPE_STATUS](2024/8_28.md)
  + [Data pipelines updates](2024/8_28.md)
    - [Dynamic tables: Support for incremental lateral flatten](2024/8_28.md)
  + [Data lake updates](2024/8_28.md)
    - [Apache Iceberg™ tables: Support for Snowflake Open Catalog — *Preview*](2024/8_28.md)
  + [Release notes change log](2024/8_28.md)
* [July 22-25, 2024 — 8.27 Release Notes (with behavior changes)](2024/8_27.md)
  + [Behavior change bundles](2024/8_27.md)
  + [New features](2024/8_27.md)
    - [Support for sending webhook notifications to Slack, Microsoft Teams, and PagerDuty](2024/8_27.md)
    - [Triggered tasks — *General availability*](2024/8_27.md)
  + [SQL updates](2024/8_27.md)
    - [GET_DDL function: Support for warehouses](2024/8_27.md)
  + [Data governance updates](2024/8_27.md)
    - [Custom Data Classification — *General availability*](2024/8_27.md)
    - [Data Classification of tables in a schema with Snowsight — *General availability*](2024/8_27.md)
  + [Release notes change log](2024/8_27.md)
* [July 15-17, 2024 — 8.26 Release Notes](2024/8_26.md)
  + [New features](2024/8_26.md)
    - [Estimate the cost of Automatic Clustering — *Preview*](2024/8_26.md)
  + [SQL updates](2024/8_26.md)
    - [New TYPE property for USER —— *General availability*](2024/8_26.md)
  + [Extensibility updates](2024/8_26.md)
    - [Access to external network locations on AWS in the Gov region — *General availability*](2024/8_26.md)
  + [Cost management updates](2024/8_26.md)
    - [Support for Snowpark Container Services in custom budgets — *General availability*](2024/8_26.md)
  + [Release notes change log](2024/8_26.md)
* [July 08-12, 2024 — 8.25 Release Notes](2024/8_25.md)
  + [SQL updates](2024/8_25.md)
    - [Wildcards are now supported in OBJECT constants](2024/8_25.md)
  + [Release notes change log](2024/8_25.md)
* [July 01-03, 2024 — 8.24 Release Notes](2024/8_24.md)
  + [Security updates](2024/8_24.md)
    - [Trust Center: CIS Benchmarks scanner package — *General availability*](2024/8_24.md)
    - [Authentication policies: New multi-factor authentication parameters](2024/8_24.md)
  + [Virtual warehouse updates](2024/8_24.md)
    - [Hybrid tables: Changes to capacity quotas — *Preview*](2024/8_24.md)
  + [Release notes change log](2024/8_24.md)
* [June 17-30, 2024 — 8.23 Release Notes](2024/8_23.md)
  + [Security updates](2024/8_23.md)
    - [Trust Center: Security Essentials scanner package — *General availability*](2024/8_23.md)
  + [SQL updates](2024/8_23.md)
    - [Window functions: extended support for RANGE BETWEEN — *Preview*](2024/8_23.md)
    - [Account Usage: TABLE_DML_HISTORY and TABLE_PRUNING_HISTORY views — *General availability*](2024/8_23.md)
  + [Data governance updates](2024/8_23.md)
    - [Data quality: add new system DMFs — *Preview*](2024/8_23.md)
  + [Data pipelines updates](2024/8_23.md)
    - [ALTER DYNAMIC TABLE command: Support for adding search optimization and setting additional properties](2024/8_23.md)
  + [Snowflake Native App Framework](2024/8_23.md)
    - [Updates to logging and tracing for a Snowflake Native App — *Preview*](2024/8_23.md)
    - [New events generated during app installation and upgrade — *Preview*](2024/8_23.md)
  + [Release notes change log](2024/8_23.md)
* [June 10-15, 2024 — 8.22 Release Notes (with behavior changes)](2024/8_22.md)
  + [Behavior change bundles](2024/8_22.md)
  + [SQL updates](2024/8_22.md)
    - [Some bitwise expression functions support BINARY data](2024/8_22.md)
  + [Virtual warehouse updates](2024/8_22.md)
    - [Account Usage: WAREHOUSE_EVENTS_HISTORY view — *General availability*](2024/8_22.md)
  + [Release notes change log](2024/8_22.md)
* [May 28-30, 2024 — 8.21 Release Notes](2024/8_21.md)
  + [New features](2024/8_21.md)
    - [Triggered tasks — *Preview*](2024/8_21.md)
  + [SQL updates](2024/8_21.md)
    - [Email notification integrations no longer limited to 10](2024/8_21.md)
    - [UNPIVOT supports rows with NULLs in results](2024/8_21.md)
  + [Data loading / unloading updates](2024/8_21.md)
    - [New Parquet file format option USE_VECTORIZED_SCANNER — *General Availability*](2024/8_21.md)
  + [Streamlit in Snowflake updates](2024/8_21.md)
    - [Support for v1.29.0 and v1.31.1 of the Streamlit library](2024/8_21.md)
  + [Release notes change log](2024/8_21.md)
* [May 20-22, 2024 — 8.20 Release Notes](2024/8_20.md)
  + [New features](2024/8_20.md)
    - [Trust Center — *Preview*](2024/8_20.md)
  + [SQL updates](2024/8_20.md)
    - [CREATE OR ALTER TABLE and CREATE OR ALTER TASK — *Preview*](2024/8_20.md)
  + [Apache Iceberg™ table updates](2024/8_20.md)
    - [Apache Iceberg™ tables — *General availability*](2024/8_20.md)
    - [Replace invalid UTF-8 characters in Iceberg tables](2024/8_20.md)
    - [Structured type evolution for Iceberg tables](2024/8_20.md)
    - [Set a storage serialization policy](2024/8_20.md)
    - [Change ALLOW_WRITES to FALSE for external volumes](2024/8_20.md)
    - [New ICEBERG_ACCESS_ERRORS view](2024/8_20.md)
  + [Release notes change log](2024/8_20.md)
* [May 13-15, 2024 — 8.19 Release Notes](2024/8_19.md)
  + [New features](2024/8_19.md)
    - [Serverless alerts — *Preview*](2024/8_19.md)
  + [Security updates](2024/8_19.md)
    - [Tri-Secret Secure self-registration](2024/8_19.md)
  + [SQL updates](2024/8_19.md)
    - [Jinja2 template support for EXECUTE IMMEDIATE FROM — *Preview*](2024/8_19.md)
  + [Data loading/unloading updates](2024/8_19.md)
    - [Resolved a known issue for INCLUDE_METADATA copy option](2024/8_19.md)
  + [Release notes change log](2024/8_19.md)
* [May 08-09, 2024 — 8.18 Release Notes](2024/8_18.md)
  + [SQL updates](2024/8_18.md)
    - [Dynamic pivot available](2024/8_18.md)
    - [Added support for structured data types in UDFs](2024/8_18.md)
    - [New SQL functions](2024/8_18.md)
  + [Extensibility updates](2024/8_18.md)
    - [Python user-defined aggregate functions — *Preview*](2024/8_18.md)
    - [Access to external network locations on AWS in the Gov region — *Preview*](2024/8_18.md)
  + [Release notes change log](2024/8_18.md)
* [April 30-May 07, 2024 — 8.17 Release Notes (with behavior changes)](2024/8_17.md)
  + [Behavior change bundles](2024/8_17.md)
  + [Security updates](2024/8_17.md)
    - [Authentication enhancements — *General Availability*](2024/8_17.md)
  + [SQL updates](2024/8_17.md)
    - [READ ONLY property available for tables](2024/8_17.md)
    - [ST_INTERSECTION_AGG and ST_UNION_AGG functions — *General Availability*](2024/8_17.md)
  + [Data loading /unloading updates](2024/8_17.md)
    - [New copy option: INCLUDE_METADATA](2024/8_17.md)
  + [Release notes change log](2024/8_17.md)
* [April 22-24, 2024 — 8.16 Release Notes](2024/8_16.md)
  + [SQL updates](2024/8_16.md)
    - [New SQL command(s)](2024/8_16.md)
    - [SQL API support for hybrid tables](2024/8_16.md)
  + [Extensibility updates](2024/8_16.md)
    - [Asynchronous job support in Snowpark stored procedures](2024/8_16.md)
  + [Data Lake Updates](2024/8_16.md)
    - [Apache Iceberg™ tables: Support for un-materialized identity partition columns](2024/8_16.md)
  + [Release notes change log](2024/8_16.md)
* [April 17-19, 2024 — 8.15 Release Notes](2024/8_15.md)
  + [SQL updates](2024/8_15.md)
    - [New SQL functions](2024/8_15.md)
  + [Data loading / unloading updates](2024/8_15.md)
    - [Support for granting the READ and WRITE privileges on external stages](2024/8_15.md)
  + [Release notes change log](2024/8_15.md)
* [April 08-15, 2024 — 8.14 Release Notes](2024/8_14.md)
  + [New regions](2024/8_14.md)
  + [Extensibility updates](2024/8_14.md)
    - [Python UDTFs with vectorized process methods — *General Availability*](2024/8_14.md)
  + [Snowflake Cortex updates](2024/8_14.md)
    - [Forecasting improvements in Snowflake Cortex ML Functions](2024/8_14.md)
  + [Release notes change log](2024/8_14.md)
* [April 01-03, 2024 — 8.13 Release Notes](2024/8_13.md)
  + [Snowflake Cortex updates](2024/8_13.md)
    - [Evaluation Metrics for Forecasting and Anomaly Detection](2024/8_13.md)
  + [SQL updates](2024/8_13.md)
    - [Fixed an issue with the PARSE_IP function](2024/8_13.md)
    - [Fixed an issue with the SPLIT_PART function](2024/8_13.md)
  + [Extensibility updates](2024/8_13.md)
    - [Access to Git repositories from Snowflake — *Preview*](2024/8_13.md)
  + [Release notes change log](2024/8_13.md)
* [March 26-27, 2024 — 8.12 Release Notes (with behavior changes)](2024/8_12.md)
  + [Behavior Change Bundles](2024/8_12.md)
  + [SQL Updates](2024/8_12.md)
    - [Organization Usage: Improved views for billing reconciliation — *General Availability*](2024/8_12.md)
  + [Data Pipeline Updates](2024/8_12.md)
    - [Replication: Stages, pipes, storage integrations, load history, and Snowpipe Streaming — *General Availability*](2024/8_12.md)
    - [Schema detection and evolution for Kafka connector with Snowpipe Streaming — *General Availability*](2024/8_12.md)
  + [Data Governance Updates](2024/8_12.md)
    - [Memoizable functions with constant arguments](2024/8_12.md)
    - [Share data protected by a role-based policy — *General Availability*](2024/8_12.md)
    - [Access History: Stored procedure ancestor queries — *General Availability*](2024/8_12.md)
    - [Shared tag references — *General Availability*](2024/8_12.md)
    - [Access History: Track objects modified by a DDL operation — *General Availability*](2024/8_12.md)
  + [Release Notes Change Log](2024/8_12.md)
* [March 18-20, 2024 — 8.11 Release Notes](2024/8_11.md)
  + [SQL updates](2024/8_11.md)
    - [SELECT supports trailing commas](2024/8_11.md)
  + [Data loading / unloading Updates](2024/8_11.md)
    - [Performance improvements for loading JSON files](2024/8_11.md)
    - [Improvements to the SNOWPIPE_STREAMING_CLIENT_HISTORY view](2024/8_11.md)
  + [Release notes change log](2024/8_11.md)
* [March 11-12, 2024 — 8.10 Release Notes (no announcements)](2024/8_10.md)
  + [Announcements](2024/8_10.md)
  + [Release Notes Change Log](2024/8_10.md)
* [March 04-05, 2024 — 8.9 Release Notes](2024/8_09.md)
  + [Non-bundled Behavior Changes](2024/8_09.md)
  + [Data Governance Updates](2024/8_09.md)
    - [Custom Classification — *Preview*](2024/8_09.md)
  + [Data lake updates](2024/8_09.md)
    - [Primary key information added to Apache Iceberg™ table metadata](2024/8_09.md)
  + [Release Notes Change Log](2024/8_09.md)
* [February 26-28, 2024 — 8.8 Release Notes](2024/8_08.md)
  + [Data Lake Updates](2024/8_08.md)
    - [Secure Data Sharing for Apache Iceberg™ tables](2024/8_08.md)
    - [Query an Apache Iceberg™ table without granting the USAGE privilege on related objects](2024/8_08.md)
  + [Extensibility Updates](2024/8_08.md)
    - [External network access — *General Availability*](2024/8_08.md)
  + [Release Notes Change Log](2024/8_08.md)
* [February 19-21, 2024 — 8.7 Release Notes (with behavior changes)](2024/8_07.md)
  + [Behavior Change Bundles](2024/8_07.md)
  + [SQL Updates](2024/8_07.md)
    - [SQL Functions Add Support for the `upper`, `lower`, and `trim` Collation Specifiers](2024/8_07.md)
  + [Data Lake Updates](2024/8_07.md)
    - [GET_DDL for external tables supports fully-qualified location names](2024/8_07.md)
  + [Release Notes Change Log](2024/8_07.md)
* [February 12-14, 2024 — 8.6 Release Notes](2024/8_06.md)
  + [SQL Updates](2024/8_06.md)
    - [New SQL functions](2024/8_06.md)
  + [Data Loading / Unloading Updates](2024/8_06.md)
    - [Specify an external ID for AWS storage access](2024/8_06.md)
  + [Release Notes Change Log](2024/8_06.md)
* [February 05-06, 2024 — 8.5 Release Notes](2024/8_05.md)
  + [Security Updates](2024/8_05.md)
    - [External API authentication and secrets — *General Availability*](2024/8_05.md)
  + [Extensibility Updates](2024/8_05.md)
    - [External network access — *General Availability*](2024/8_05.md)
    - [Python packages policies — *General Availability*](2024/8_05.md)
  + [Data Loading / Unloading Updates](2024/8_05.md)
    - [COPY FILES — *Preview*](2024/8_05.md)
  + [Data Governance Updates](2024/8_05.md)
    - [Data Classification: Asynchronous tag assignments for columns of tables in a schema and automate tagging for a single classification event — *Preview*](2024/8_05.md)
  + [Release Notes Change Log](2024/8_05.md)
* [January 29-30, 2024 — 8.4 Release Notes](2024/8_04.md)
  + [Security Updates](2024/8_04.md)
    - [Authentication enhancements — *Preview*](2024/8_04.md)
  + [Virtual Warehouse Updates](2024/8_04.md)
    - [Larger warehouses — *General Availability in Microsoft Azure Regions*](2024/8_04.md)
  + [Extensibility Updates](2024/8_04.md)
    - [External network access — *Preview*](2024/8_04.md)
    - [Java 17 support — *Preview*](2024/8_04.md)
  + [Data Loading / Unloading Updates](2024/8_04.md)
    - [Snowpipe update: a new pipe status](2024/8_04.md)
  + [Data Pipeline Updates](2024/8_04.md)
    - [Automatic task graph retry — *General Availability*](2024/8_04.md)
  + [Release Notes Change Log](2024/8_04.md)
* [January 22-23, 2024 — 8.3 Release Notes](2024/8_03.md)
  + [Security Updates](2024/8_03.md)
    - [Network rules — *General Availability*](2024/8_03.md)
    - [Enhanced network security — *General Availability*](2024/8_03.md)
    - [Network isolation to internal stages using AWS PrivateLink — *General Availability*](2024/8_03.md)
  + [Release Notes Change Log](2024/8_03.md)
* [January 15-17, 2024 — 8.2 Release Notes (with behavior changes)](2024/8_02.md)
  + [Behavior Change Bundles](2024/8_02.md)
  + [New Features](2024/8_02.md)
    - [Access History: Support added for stored procedure ancestor queries — Preview](2024/8_02.md)
  + [Release Notes Change Log](2024/8_02.md)
* [January 08-10, 2024 — 8.1 Release Notes](2024/8_01.md)
  + [New Features](2024/8_01.md)
    - [EXECUTE IMMEDIATE FROM File — *General Availability*](2024/8_01.md)
  + [SQL Updates](2024/8_01.md)
    - [CREATE <object> … CLONE command: New parameter](2024/8_01.md)
  + [New SQL functions](2024/8_01.md)
  + [Extensibility Updates](2024/8_01.md)
    - [Support for Python 3.11 in Snowpark, UDFs, UDTFs and stored procedures — *General Availability*](2024/8_01.md)
  + [Release Notes Change Log](2024/8_01.md)
* [January 03-04, 2024 — 8.0 Release Notes](2024/8_00.md)
  + [Extensibility Updates](2024/8_00.md)
    - [Account Usage: New EXTERNAL_ACCESS_HISTORY View](2024/8_00.md)
  + [Data Collaboration Updates](2024/8_00.md)
    - [Organization Usage: New LISTING_AUTO_FULFILLMENT_USAGE_HISTORY View](2024/8_00.md)
  + [Release Notes Change Log](2024/8_00.md)

---
title: Server releases in 2025
source: https://docs.snowflake.com/en/release-notes/weekly-releases-2025.md
section: Release Notes
---

# Server releases in 2025

This topic lists the release notes for server releases that occurred in 2025.

For more recent releases, see [Snowflake server release notes and feature updates](new-features.md).

* [9.40 Release Notes: Dec 15, 2025-Jan 09, 2026](2025/9_40.md)
  + [New features](2025/9_40.md)
    - [Notifications for data quality incidents (*Preview*)](2025/9_40.md)
  + [Deprecated features](2025/9_40.md)
    - [Deprecation of external OpenAI model routing for Cortex Analyst](2025/9_40.md)
  + [SQL updates](2025/9_40.md)
    - [Semantic views: Using standard SQL clauses to query semantic views (*Preview*)](2025/9_40.md)
  + [Release notes change log](2025/9_40.md)
* [9.39 Release Notes: Dec 08, 2025-Dec 12, 2025](2025/9_39.md)
  + [Security updates](2025/9_39.md)
    - [Trust Center: Detection findings and event-driven scanners (*Preview*)](2025/9_39.md)
    - [Programmatic access tokens: Removing the single-role restriction for service users](2025/9_39.md)
  + [Release notes change log](2025/9_39.md)
* [9.38 Release Notes: Dec 03, 2025-Dec 05, 2025](2025/9_38.md)
  + [SQL updates](2025/9_38.md)
    - [Query insights: Support for queries that benefit from the query acceleration service](2025/9_38.md)
  + [Release notes change log](2025/9_38.md)
* [9.37 Release Notes: Nov 17, 2025-Nov 20, 2025](2025/9_37.md)
  + [SQL updates](2025/9_37.md)
    - [Preparation for renaming Snapshots feature to Backups](2025/9_37.md)
    - [New DECFLOAT data type](2025/9_37.md)
  + [Documentation and learning resources](2025/9_37.md)
    - [New topic that provides an overview of Snowflake authentication methods](2025/9_37.md)
  + [Release notes change log](2025/9_37.md)
* [9.36 Release Notes: Nov 10, 2025-Nov 16, 2025](2025/9_36.md)
  + [SQL updates](2025/9_36.md)
    - [Enhanced SQL functionality](2025/9_36.md)
  + [Extensibility updates](2025/9_36.md)
    - [Support for OAuth when authenticating with GitHub (*General availability*)](2025/9_36.md)
    - [Run Apache Spark™ workloads on Snowflake (*General availability*)](2025/9_36.md)
    - [Support for connecting Scala applications to Snowpark Connect for Spark (*Preview*)](2025/9_36.md)
  + [Data governance updates](2025/9_36.md)
    - [Anomaly detection for Data Quality Monitoring (*Preview*)](2025/9_36.md)
  + [Release notes change log](2025/9_36.md)
* [9.35 Release Notes: Nov 03, 2025-Nov 07, 2025](2025/9_35.md)
  + [SQL updates](2025/9_35.md)
    - [Interval data types (*Preview*)](2025/9_35.md)
  + [Data lake updates](2025/9_35.md)
    - [Replicate Snowflake-managed Apache Iceberg™ tables (*Preview*)](2025/9_35.md)
  + [Release notes change log](2025/9_35.md)
* [9.34 Release Notes (no announcements): Oct 27, 2025-Oct 29, 2025](2025/9_34.md)
  + [Release notes change log](2025/9_34.md)
* [9.33 Release Notes: Oct 21, 2025-Oct 23, 2025](2025/9_33.md)
  + [Security updates](2025/9_33.md)
    - [AWS cross-region support for PrivateLink (*General availability*)](2025/9_33.md)
    - [Outbound network traffic to stages and volumes on Google Cloud Storage supports private connectivity (*General availability*)](2025/9_33.md)
    - [Snowflake-managed network rules (*General availability*)](2025/9_33.md)
  + [SQL updates](2025/9_33.md)
    - [Semantic views: Support for ASOF JOIN](2025/9_33.md)
  + [Release notes change log](2025/9_33.md)
* [9.32 Release Notes (with behavior changes): Oct 13, 2025-Oct 15, 2025](2025/9_32.md)
  + [Behavior change bundles](2025/9_32.md)
  + [Data lake updates](2025/9_32.md)
    - [Catalog-linked databases: Auto-refresh for Apache Iceberg™ table creation](2025/9_32.md)
    - [Table optimization for Snowflake-managed Apache Iceberg™ tables (*General availability*)](2025/9_32.md)
  + [Replication updates](2025/9_32.md)
    - [Snowflake Notebooks replication (*General availability*)](2025/9_32.md)
  + [Release notes change log](2025/9_32.md)
* [9.31 Release Notes: Oct 06, 2025-Oct 08, 2025](2025/9_31.md)
  + [Security updates](2025/9_31.md)
    - [Tri-Secret Secure supports private connectivity](2025/9_31.md)
  + [Data lake updates](2025/9_31.md)
    - [Query data compaction jobs for Apache Iceberg™ tables](2025/9_31.md)
  + [Release notes change log](2025/9_31.md)
* [9.30 Release Notes: Sep 29, 2025-Oct 01, 2025](2025/9_30.md)
  + [Security updates](2025/9_30.md)
    - [Hybrid table support for Tri-Secret Secure](2025/9_30.md)
  + [SQL updates](2025/9_30.md)
    - [Update to the 2025b release of the TZDB](2025/9_30.md)
    - [MERGE ALL BY NAME](2025/9_30.md)
    - [Aliases for PIVOT and UNPIVOT columns](2025/9_30.md)
    - [New SQL parameter: ENABLE_GET_DDL_USE_DATA_TYPE_ALIAS](2025/9_30.md)
    - [Reference table columns in lambda expressions when calling higher-order functions](2025/9_30.md)
    - [SEARCH function supports PHRASE and EXACT search modes](2025/9_30.md)
    - [Snowflake Scripting CONTINUE handlers](2025/9_30.md)
    - [Snowflake Scripting user-defined functions (UDFs) (*General availability*)](2025/9_30.md)
    - [Semantic views: Support for dimensions that use a Cortex Search Service](2025/9_30.md)
  + [Release notes change log](2025/9_30.md)
* [9.29 Release Notes: Sep 24, 2025-Sep 26, 2025](2025/9_29.md)
  + [New features](2025/9_29.md)
    - [Declarative Shared Native Apps (*Preview*)](2025/9_29.md)
    - [Cortex Agent Monitoring (*Preview*)](2025/9_29.md)
  + [Data collaboration updates](2025/9_29.md)
    - [Cross-Cloud Auto-Fulfillment support for open table formats](2025/9_29.md)
  + [Data pipeline updates](2025/9_29.md)
    - [CREATE OR ALTER DYNAMIC TABLE (*Preview*)](2025/9_29.md)
  + [Data governance updates](2025/9_29.md)
    - [Data quality: FRESHNESS data metric function improvement](2025/9_29.md)
  + [Release notes change log](2025/9_29.md)
* [9.28 Release Notes: Sep 15, 2025-Sep 17, 2025](2025/9_28.md)
  + [SQL updates](2025/9_28.md)
    - [Query insights in Snowsight (*Preview*)](2025/9_28.md)
  + [Data pipeline updates](2025/9_28.md)
    - [dbt Projects on Snowflake: Support for dbt retry (*Preview*)](2025/9_28.md)
  + [Release notes change log](2025/9_28.md)
* [9.27 Release Notes (with behavior changes): Sep 08, 2025-Sep 10, 2025](2025/9_27.md)
  + [Behavior change bundles](2025/9_27.md)
  + [SQL updates](2025/9_27.md)
    - [Retrieve bind variable values (*Preview*)](2025/9_27.md)
  + [Data pipeline updates](2025/9_27.md)
    - [Dynamic tables: Support for base tables with zero data retention](2025/9_27.md)
  + [Data lake updates](2025/9_27.md)
    - [New system function to replace the catalog integration for an externally managed Apache Iceberg™ table](2025/9_27.md)
  + [Release notes change log](2025/9_27.md)
* [9.26 Release notes: Sep 01, 2025-Sep 04, 2025](2025/9_26.md)
  + [SQL updates](2025/9_26.md)
    - [Filling gaps in time-series data (*Preview*)](2025/9_26.md)
    - [Account Usage: New INGRESS_NETWORK_ACCESS_HISTORY view](2025/9_26.md)
    - [Account Usage: New INTERNAL_STAGE_NETWORK_ACCESS_HISTORY view](2025/9_26.md)
  + [Release notes change log](2025/9_26.md)
* [9.25 Release notes: Aug 25, 2025-Aug 28, 2025](2025/9_25.md)
  + [New features](2025/9_25.md)
    - [Sensitive data classification: Automatic classification of a database (*General availability*)](2025/9_25.md)
  + [Security updates](2025/9_25.md)
    - [Support for keys generated with Elliptic Curve Digital Signature Algorithms (ECDSA)](2025/9_25.md)
  + [SQL updates](2025/9_25.md)
    - [Querying semantic views (*General availability*)](2025/9_25.md)
    - [Semantic views: Listing facts in a view, schema, database, or account](2025/9_25.md)
    - [Semantic views: Support for renaming views](2025/9_25.md)
  + [Data lake updates](2025/9_25.md)
    - [Apache Iceberg™ tables: Row-level deletes for externally managed tables (*General availability*)](2025/9_25.md)
  + [Data governance updates](2025/9_25.md)
    - [Data quality: Updated privilege model allows non-owners to associate a data metric function with an object (*Preview*)](2025/9_25.md)
    - [Object tags: New limit for allowed values](2025/9_25.md)
  + [Release notes change log](2025/9_25.md)
* [9.24 Release notes: Aug 18, 2025-Aug 20, 2025](2025/9_24.md)
  + [Security updates](2025/9_24.md)
    - [Self-service activation of Tri-Secret Secure (*General availability*)](2025/9_24.md)
  + [SQL updates](2025/9_24.md)
    - [ALTER LISTING command to simplify adding and removing targets (*General availability*)](2025/9_24.md)
  + [New features](2025/9_24.md)
    - [Snowflake Native App Framework - MONITOR privilege support for apps (*General availability*)](2025/9_24.md)
  + [Data lake updates](2025/9_24.md)
    - [Set a target file size for Apache Iceberg™ tables (*Preview*)](2025/9_24.md)
  + [Release notes change log](2025/9_24.md)
* [9.23 Release notes: Aug 11, 2025-Aug 15, 2025](2025/9_23.md)
  + [SQL updates](2025/9_23.md)
    - [Snowflake Scripting user-defined functions (UDFs)](2025/9_23.md)
    - [Private facts and metrics in semantic views](2025/9_23.md)
  + [Data loading / unloading updates](2025/9_23.md)
    - [Apache Arrow library upgrade to version 21.0.0](2025/9_23.md)
  + [Data pipeline updates](2025/9_23.md)
    - [Dynamic tables: Support for UNION in incremental refresh mode](2025/9_23.md)
  + [Release notes change log](2025/9_23.md)
* [9.22 Release notes (with behavior changes): Aug 04, 2025-Aug 08, 2025](2025/9_22.md)
  + [Behavior change bundles](2025/9_22.md)
  + [New features](2025/9_22.md)
    - [Data quality: Using expectations to define quality checks (*General availability*)](2025/9_22.md)
  + [Extensibility updates](2025/9_22.md)
    - [Tracing SQL statements run from handler code (*General availability*)](2025/9_22.md)
  + [Data pipeline updates](2025/9_22.md)
    - [Dynamic tables: Support for immutability constraints](2025/9_22.md)
    - [Dynamic tables: Support for backfill](2025/9_22.md)
  + [Release notes change log](2025/9_22.md)
* [9.21 Release notes: Jul 29, 2025-Aug 01, 2025](2025/9_21.md)
  + [Security updates](2025/9_21.md)
    - [GENERATE_SYNTHETIC_DATA: Consistency secret now optional in most cases](2025/9_21.md)
  + [SQL updates](2025/9_21.md)
    - [Account Usage: TABLE_QUERY_PRUNING_HISTORY and COLUMN_QUERY_PRUNING_HISTORY views (*General availability*)](2025/9_21.md)
    - [The SEARCH_IP function supports searching for IPv6 addresses](2025/9_21.md)
    - [Generating YAML for a semantic view and creating a semantic view from YAML](2025/9_21.md)
  + [Data loading / unloading updates](2025/9_21.md)
    - [Simplified Snowpipe pricing](2025/9_21.md)
  + [Data pipeline updates](2025/9_21.md)
    - [Snowpark Connect for Spark and Snowpark Submit (*Preview*)](2025/9_21.md)
  + [Release notes change log](2025/9_21.md)
* [9.20 Release notes: Jul 21, 2025-Jul 25, 2025](2025/9_20.md)
  + [SQL updates](2025/9_20.md)
    - [CREATE INDEX command supports INCLUDE columns](2025/9_20.md)
    - [Semantic views: Listing dimensions and metrics in a view, schema, database, or account](2025/9_20.md)
    - [New query insights about join performance and optimization](2025/9_20.md)
  + [Data pipeline updates](2025/9_20.md)
    - [Tasks: New EXECUTE AS USER option and IMPERSONATE privilege for user objects](2025/9_20.md)
    - [Dynamic tables: Disallowed use of the COPY_SESSION attribute while manually refreshing dynamic tables on a serverless warehouse](2025/9_20.md)
  + [Release notes change log](2025/9_20.md)
* [9.19 Release notes: Jul 14, 2025-Jul 17, 2025](2025/9_19.md)
  + [SQL updates](2025/9_19.md)
    - [Data types: Structured types support for standard Snowflake tables (*General availability*)](2025/9_19.md)
  + [Data loading / unloading updates](2025/9_19.md)
    - [Optimize data ingestion with pre-clustering for Snowpipe Streaming - high-performance architecture (*Preview*)](2025/9_19.md)
    - [COPY FILES (*General availability*)](2025/9_19.md)
  + [Release notes change log](2025/9_19.md)
* [9.18 Release notes: Jul 02, 2025-Jul 08, 2025](2025/9_18.md)
  + [SQL updates](2025/9_18.md)
    - [Snowflake Scripting output (OUT) arguments (*General availability*)](2025/9_18.md)
  + [Data pipeline updates](2025/9_18.md)
    - [Dynamic tables: Support for externally managed Apache Iceberg™ tables (*General availability*)](2025/9_18.md)
  + [Data governance updates](2025/9_18.md)
    - [Data Quality: New system data metric function](2025/9_18.md)
  + [Release notes change log](2025/9_18.md)
* [9.17 Release notes (with behavior changes): Jun 24, 2025-Jun 30, 2025](2025/9_17.md)
  + [Behavior change bundles](2025/9_17.md)
  + [SQL updates](2025/9_17.md)
    - [New maximum size limits for database objects (*General availability*)](2025/9_17.md)
    - [Snowflake Scripting supports nested stored procedures (*General availability*)](2025/9_17.md)
  + [Data pipeline updates](2025/9_17.md)
    - [Snowsight: Task Overview and Graph Run History updates (*General availability*)](2025/9_17.md)
  + [Release notes change log](2025/9_17.md)
* [9.16 Release notes (no announcements): Jun 16, 2025-Jun 23, 2025](2025/9_16.md)
  + [Release notes change log](2025/9_16.md)
* [9.15 Release notes: Jun 09, 2025-Jun 11, 2025](2025/9_15.md)
  + [New features](2025/9_15.md)
    - [Artifact Repository (*General availability*)](2025/9_15.md)
  + [Security updates](2025/9_15.md)
    - [Malicious IP Protection](2025/9_15.md)
    - [Findings Lifecycle Management](2025/9_15.md)
  + [SQL updates](2025/9_15.md)
    - [UNION BY NAME operator](2025/9_15.md)
  + [Data pipeline updates](2025/9_15.md)
    - [Support for streams on externally managed Apache Iceberg™ tables with row-level deletes](2025/9_15.md)
  + [Release notes change log](2025/9_15.md)
* [9.14 Release notes: May 23, 2025-May 28, 2025](2025/9_14.md)
  + [New features](2025/9_14.md)
    - [Trust Center: In-app notifications (*Preview*)](2025/9_14.md)
    - [Trust Center: New Abnormal Failure Rate Detection scanners](2025/9_14.md)
  + [Snowpark Python version updates](2025/9_14.md)
  + [SQL updates](2025/9_14.md)
    - [Search optimization: Support for Apache Iceberg™ tables](2025/9_14.md)
    - [Query Acceleration Service: Support for Apache Iceberg™ tables](2025/9_14.md)
    - [Data types: Structured types support for standard Snowflake tables (*Preview*)](2025/9_14.md)
  + [Data pipeline updates](2025/9_14.md)
    - [Triggered tasks: Support for streams hosted on directory tables and data shares](2025/9_14.md)
  + [Release notes change log](2025/9_14.md)
* [9.13 Release notes: May 19, 2025-May 20, 2025](2025/9_13.md)
  + [Security updates](2025/9_13.md)
    - [Outbound private connectivity for AWS Government regions](2025/9_13.md)
  + [SQL updates](2025/9_13.md)
    - [Pipe operator](2025/9_13.md)
  + [Data loading/unloading updates](2025/9_13.md)
    - [INFER_SCHEMA function: Support for Apache Iceberg™ data types](2025/9_13.md)
  + [Data lake updates](2025/9_13.md)
    - [Cross-cloud/cross-region support for Snowflake-managed Apache Iceberg™ tables](2025/9_13.md)
  + [Release notes change log](2025/9_13.md)
* [9.12 Release notes (with behavior changes): May 05, 2025-May 12, 2025](2025/9_12.md)
  + [Behavior change bundles](2025/9_12.md)
  + [New features](2025/9_12.md)
    - [Release channels for Snowflake Native Apps (*General availability*)](2025/9_12.md)
  + [SQL Updates](2025/9_12.md)
    - [Improved error messages for Data Manipulation Language (DML) commands](2025/9_12.md)
    - [New SQL functions](2025/9_12.md)
  + [Extensibility updates](2025/9_12.md)
    - [Built-in code profiler for Python stored procedures (*General availability*)](2025/9_12.md)
  + [Data loading / unloading updates](2025/9_12.md)
    - [Support for internal stage cloning (*General availability*)](2025/9_12.md)
    - [Vectorized scanner now available without ON_ERROR restrictions](2025/9_12.md)
  + [Data governance updates](2025/9_12.md)
    - [Sensitive data classification: New classifiers for India](2025/9_12.md)
  + [Snowpark Container Services updates](2025/9_12.md)
    - [Using caller’s rights to connect to Snowflake (*General availability*)](2025/9_12.md)
  + [Release notes change log](2025/9_12.md)
* [9.11 Release notes: Apr 28, 2025-May 02, 2025](2025/9_11.md)
  + [New features](2025/9_11.md)
    - [Snowflake Native Apps session debugging (*General availability*)](2025/9_11.md)
    - [Writing files from Snowpark Python UDFs and UDTFs (*General availability*)](2025/9_11.md)
  + [Extensibility updates](2025/9_11.md)
    - [Support for allowing requests to all outbound endpoints from functions and procedures (*General availability*)](2025/9_11.md)
  + [Data lake updates](2025/9_11.md)
    - [Support for Iceberg tables in the People’s Republic of China (*General availability*)](2025/9_11.md)
  + [Decommissioned runtimes](2025/9_11.md)
    - [Python 3.8 decommissioned](2025/9_11.md)
  + [Release notes change log](2025/9_11.md)
* [9.10 Release notes: Apr 14, 2025-Apr 22, 2025](2025/9_10.md)
  + [Extensibility updates](2025/9_10.md)
    - [Support for public custom Git repository URLs (*General availability*)](2025/9_10.md)
  + [Data loading / unloading updates](2025/9_10.md)
    - [Automated refresh for internal named stages (*Preview*)](2025/9_10.md)
    - [Auto-ingest pipes for internal named stages (*Preview*)](2025/9_10.md)
  + [Data lake updates](2025/9_10.md)
    - [Apache Iceberg™ tables: Automated refresh table names now appear in the ACCOUNT_USAGE.PIPE_USAGE_HISTORY view](2025/9_10.md)
  + [Privacy updates](2025/9_10.md)
    - [Synthetic data generation (*General availability*)](2025/9_10.md)
  + [Release notes change log](2025/9_10.md)
* [9.9 Release notes: Apr 07, 2025-Apr 09, 2025](2025/9_09.md)
  + [SQL updates](2025/9_09.md)
    - [New Snowflake parameter: DEFAULT_NULL_ORDERING](2025/9_09.md)
  + [Extensibility updates](2025/9_09.md)
    - [Artifact Repository (*Preview*)](2025/9_09.md)
  + [Release notes change log](2025/9_09.md)
* [9.8 Release notes: Mar 31, 2025-Apr 04, 2025](2025/9_08.md)
  + [Security updates](2025/9_08.md)
    - [Trust Center: Risky Human and Service User scanners](2025/9_08.md)
  + [SQL updates](2025/9_08.md)
    - [Asynchronous refresh for failover groups and replication groups](2025/9_08.md)
    - [Bind variables in SHOW commands](2025/9_08.md)
  + [Data lake updates](2025/9_08.md)
    - [Apache Iceberg™ tables: Row-level deletes for externally managed tables (*Preview*)](2025/9_08.md)
    - [Apache Iceberg™ tables: Delta table support (*General availability*)](2025/9_08.md)
    - [New database properties: CATALOG_SYNC_NAMESPACE_MODE and CATALOG_SYNC_NAMESPACE_FLATTEN_DELIMITER](2025/9_08.md)
  + [Snowpark Container Services updates](2025/9_08.md)
    - [Automatic suspension of a Snowpark Container Services service (*Preview*)](2025/9_08.md)
  + [Release notes change log](2025/9_08.md)
* [9.7 Release notes (with behavior changes): Mar 17, 2025-Mar 27, 2025](2025/9_07.md)
  + [Behavior change bundles](2025/9_07.md)
  + [New features](2025/9_07.md)
    - [Grant database roles to a Snowflake Native App — *Preview*](2025/9_07.md)
    - [DISABLE_UI_DOWNLOAD_BUTTON object parameter for Snowsight and the Classic Console (*General availability*)](2025/9_07.md)
  + [Replication updates](2025/9_07.md)
    - [Schema-level replication for failover groups (*General availability*)](2025/9_07.md)
  + [SQL updates](2025/9_07.md)
    - [Semi-structured data: XML format (*General availability*)](2025/9_07.md)
    - [Spread operator](2025/9_07.md)
    - [New maximum size limits for database objects (*Preview*)](2025/9_07.md)
  + [Release notes change log](2025/9_07.md)
* [9.6 Release notes: Mar 10, 2025-Mar 12, 2025](2025/9_06.md)
  + [SQL updates](2025/9_06.md)
    - [Search optimization: Support for column collations](2025/9_06.md)
  + [Data pipeline updates](2025/9_06.md)
    - [Dynamic tables: Maximum number of dynamic tables in an account increased to 50,000](2025/9_06.md)
  + [Release notes change log](2025/9_06.md)
* [9.5 Release notes: Mar 03, 2025-Mar 06, 2025](2025/9_05.md)
  + [New features](2025/9_05.md)
    - [Automatic sensitive data classification (*General availability*)](2025/9_05.md)
  + [SQL updates](2025/9_05.md)
    - [Snowflake Scripting: Asynchronous child jobs (*General availability*)](2025/9_05.md)
    - [Snowflake Scripting: Improved error messages](2025/9_05.md)
  + [Release notes change log](2025/9_05.md)
* [9.4 Release notes: Feb 24, 2025-Mar 01, 2025](2025/9_04.md)
  + [New features](2025/9_04.md)
    - [Additional information returned for objects bound to references (*General availability*)](2025/9_04.md)
    - [More granular control for log, trace, and metric levels in an app (*General availability*)](2025/9_04.md)
  + [SQL updates](2025/9_04.md)
    - [Cloning databases that contain hybrid tables (*Preview*)](2025/9_04.md)
    - [New SQL functions](2025/9_04.md)
  + [Extensibility updates](2025/9_04.md)
    - [Support for associating an event table with a database (*General availability*)](2025/9_04.md)
  + [Data loading updates](2025/9_04.md)
    - [Dynamic tables and tasks: Events logged for refreshes and task executions](2025/9_04.md)
  + [Data lake updates](2025/9_04.md)
    - [CATALOG_NAMESPACE parameter for catalog integrations is now optional](2025/9_04.md)
  + [Release notes change log](2025/9_04.md)
* [9.3 Release notes: Feb 18, 2025-Feb 21, 2025](2025/9_03.md)
  + [New features](2025/9_03.md)
    - [Tasks now support lower scheduling intervals (*General availability*)](2025/9_03.md)
    - [Data lineage (*General availability*)](2025/9_03.md)
  + [SQL updates](2025/9_03.md)
    - [SEARCH function: Support for conjunctive semantics](2025/9_03.md)
  + [Extensibility updates](2025/9_03.md)
    - [Support for a wildcard character in network rule network identifiers (*General availability*)](2025/9_03.md)
    - [Support for telemetry metrics and custom spans, with visualizations in Snowsight (*General availability*)](2025/9_03.md)
  + [Data pipeline updates](2025/9_03.md)
    - [Dynamic tables: Support for UNION ALL](2025/9_03.md)
  + [Data lake updates](2025/9_03.md)
    - [Cloning support for Snowflake-managed Apache Iceberg™ tables (*General availability*)](2025/9_03.md)
  + [Release notes change log](2025/9_03.md)
* [9.2 Release notes (with behavior changes): Jan 22, 2025-Feb 13, 2025](2025/9_02.md)
  + [Behavior change bundles](2025/9_02.md)
  + [Non-bundled behavior changes](2025/9_02.md)
  + [New features](2025/9_02.md)
    - [Triggered tasks now can operate as Serverless Tasks (*General availability*)](2025/9_02.md)
    - [Trust Center: Manage individual scanners](2025/9_02.md)
  + [Security updates](2025/9_02.md)
    - [Outbound private connectivity for Microsoft Azure Government regions](2025/9_02.md)
  + [SQL updates](2025/9_02.md)
    - [New SQL functions](2025/9_02.md)
    - [Additional CREATE OR ALTER commands (*Preview*)](2025/9_02.md)
  + [Data lake updates](2025/9_02.md)
    - [Apache Iceberg™ tables: Support for writing Apache Iceberg metadata for Delta-based tables](2025/9_02.md)
  + [Release notes change log](2025/9_02.md)
* [9.1 Release notes: Jan 13, 2025-Jan 16, 2025](2025/9_01.md)
  + [New features](2025/9_01.md)
    - [Outbound private connectivity for Snowflake features](2025/9_01.md)
  + [SQL updates](2025/9_01.md)
    - [ARRAY_AGG function support for window frames (*General availability*)](2025/9_01.md)
  + [Data pipelines updates](2025/9_01.md)
    - [CREATE DYNAMIC TABLE command: New REQUIRE USER parameter added](2025/9_01.md)
    - [ALTER DYNAMIC TABLE command: New COPY SESSION parameter added](2025/9_01.md)
  + [Data lake updates](2025/9_01.md)
    - [External stage and external volume support for Amazon S3 access points (*General availability*)](2025/9_01.md)
    - [Apache Iceberg™ tables: Automated refresh (*General availability*)](2025/9_01.md)
  + [Data governance updates](2025/9_01.md)
    - [Data metric functions: Support for referential integrity checks](2025/9_01.md)
  + [Privacy updates](2025/9_01.md)
    - [Join policies (*Preview*)](2025/9_01.md)
  + [Release notes change log](2025/9_01.md)
* [9.0 Release notes: Jan 07, 2025-Jan 09, 2025](2025/9_00.md)
  + [Security updates](2025/9_00.md)
    - [External key store integration for Tri-Secret Secure (*General availability*)](2025/9_00.md)
    - [Pinning private endpoints (*General availability*)](2025/9_00.md)
  + [Release notes change log](2025/9_00.md)

---
title: SESSIONS and LOGIN_HISTORY Views (Account Usage): Events from Internal Users Removed from Views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1053.md
section: Release Notes
---

# SESSIONS and LOGIN_HISTORY Views (Account Usage): Events from Internal Users Removed from Views

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The behavior of the SESSIONS and LOGIN_HISTORY Account Usage views in the shared SNOWFLAKE database behaves as follows:

Previously:
:   Events from internal system users (e.g. WORKSHEET_APP_USER) are shown in these views.

Currently:
:   Events from internal system users (e.g. WORKSHEET_APP_USER) are not shown in these views.

Ref: 1053

---
title: SESSIONS View (Account Usage): New Column in View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-991.md
section: Release Notes
---

# SESSIONS View (Account Usage): New Column in View

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The behavior of the SESSIONS Account Usage view in the shared SNOWFLAKE database behaves as follows:

Previously:
:   The view does not include the CLOSED_REASON column.

Currently:
:   The view includes the CLOSED_REASON column as the last column in the view. The column contains one of the following values to indicate why the Snowflake session closes:

    * UNKNOWN
    * DROP_USER
    * LOGOUT
    * FORCED_LOGOUT
    * ABANDONED
    * OAUTH_CRITICAL_CHANGE_INTEGRATION
    * DROP_ACCOUNT
    * OAUTH_CONSENT_REVOKED
    * TASK_COMPLETED
    * SFC_FORCED_LOGOUT

Regarding logouts, note the following:

> * FORCED_LOGOUT refers to an account administrator in your Snowflake account forcing a user to logout.
> * SFC_FORCED_LOGOUT refers to a Snowflake administrator (e.g. Support engineer) forcing a user to logout.

Ref: 991

---
title: SESSIONS views: New columns and a behavior change for the closed_reason column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2149.md
section: Release Notes
---

# SESSIONS views: New columns and a behavior change for the `closed_reason` column

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the [ACCOUNT_USAGE.SESSIONS](../../../sql-reference/account-usage/sessions.md)
and [ORGANIZATION_USAGE.SESSIONS](../../../sql-reference/organization-usage/sessions.md) views include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `access_time` | TIMESTAMP_LTZ | Date and time when the session was last used. |
| `is_open` | BOOLEAN | Whether the session is currently open (TRUE) or closed (FALSE). |

Additionally, the behavior of the `closed_reason` column changes as follows:

Before the change:
:   When a session is still open, the `closed_reason` column returns UNKNOWN.

After the change:
:   When a session is still open, the `closed_reason` column returns NULL.

This change affects customers with code that explicitly checks for `closed_reason = UNKNOWN`
or that depends on the old column structure.

Ref: 2149

---
title: SHOW <class_name> INSTANCES commands: Changes in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1735.md
section: Release Notes
---

# SHOW <class_name> INSTANCES commands: Changes in output

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

The SHOW <class_name> INSTANCES command for individual [Snowflake classes](../../../sql-reference-classes.md) list the instances
for which you have access privileges. For example, the
[SHOW SNOWFLAKE.CORE.BUDGET INSTANCES](../../../sql-reference/classes/budget/commands/show-budget.md) command lists the budgets in
your account that you have access privileges for. For a full list, see [Available classes](../../../sql-reference-classes.md).

When this behavior change bundle is enabled, the output of the SHOW <class_name> INSTANCES command changes as follows:

Before the change:
:   Results are returned in lexicographical order by name.

After the change:
:   Result order is not guaranteed.

Ref: 1735

---
title: SHOW <objects> commands: Remove BUDGET column from output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1913.md
section: Release Notes
---

# SHOW <objects> commands: Remove BUDGET column from output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

SHOW <objects> commands (for example, SHOW WAREHOUSES) behave as follows:

Before the change:
:   The output of a SHOW <objects> command includes a `budget` column that provides the name of a budget if the object is monitored by
    one.

After the change:
:   The output of SHOW <objects> commands does not include a `budget` column.

Ref: 1913

---
title: SHOW ACCOUNT command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2174.md
section: Release Notes
---

# SHOW ACCOUNT command: New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW ACCOUNTS](../../../sql-reference/sql/show-accounts.md) command includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| tenant_type | VARCHAR | Shows the tenant type, one of:   * 1 (internal) * 2 (external)   or NULL if not set |

Ref: 2174

---
title: SHOW and DESCRIBE commands for listings: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1756.md
section: Release Notes
---

# SHOW and DESCRIBE commands for listings: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the following listing commands are changed:

* [DESCRIBE AVAILABLE LISTING](../../../sql-reference/sql/desc-available-listing.md)
* [DESCRIBE LISTING](../../../sql-reference/sql/desc-listing.md)
* [SHOW AVAILABLE LISTINGS](../../../sql-reference/sql/show-available-listings.md)
* [SHOW LISTINGS](../../../sql-reference/sql/show-listings.md)

When enabled, the output of these commands includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| distribution | VARCHAR | Defines where a listing’s data is distributed in relation to the organization. Possible values are `EXTERNAL` and `INTERNAL`. |
| is_mountless_queryable | BOOLEAN | Indicates whether the consumer can query the listing directly, without needing to retrieve (GET) the listing and create a local database in their account. If set to `TRUE`, the consumer can run queries directly on the data through the listing itself. |
| organization_profile_name | VARCHAR | The provider profile for an organization. For an organizational listing, this value is set to `INTERNAL`. |
| unified_listing_locator | VARCHAR | The unique identifier that represents a listing and its data product as one entity. This value is either set by the provider or generated by Snowflake. |

Ref: 1756

---
title: SHOW APPLICATIONS and SHOW APPLICATION PACKAGES commands: New column in output: TYPE
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2065.md
section: Release Notes
---

# SHOW APPLICATIONS and SHOW APPLICATION PACKAGES commands: New column in output: TYPE

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) and [SHOW APPLICATION PACKAGES](../../../sql-reference/sql/show-application-packages.md) commands includes the following new column:

| Column name | Description |
| --- | --- |
| `type` | Shows whether the app is a standard Snowflake Native App or a Snowflake Declarative Native App. |

For selected accounts, providers can use Declarative Sharing in the Native App framework to build apps.

For more information, see [Run tasks with user privileges](../../../user-guide/tasks-intro.md).

Ref: 2065

---
title: SHOW APPLICATIONS command: Changes to the LABEL column output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-show-applications-output-change.md
section: Release Notes
---

# SHOW APPLICATIONS command: Changes to the LABEL column output

The output of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command behaves as follows:

Before the change:
:   The value of the LABEL column for the APPLICATION object is `NONE` if no label is specified.

After the change:
:   The value of the LABEL column for the APPLICATION object will be empty if no label is specified.

Ref: n/a

---
title: SHOW APPLICATIONS command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1879.md
section: Release Notes
---

# SHOW APPLICATIONS command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command will include a new column in the output.

Before the change:
:   The `release_channel_name` column does not appear in the output of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md)
    command.

After the change:
:   The output of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command contains the `release_channel_name` column at
    the end of the output.

    This column is reserved for future use.

Ref: 1879

---
title: SHOW APPLICATIONS command: New UPGRADE_STATUS column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1521.md
section: Release Notes
---

# SHOW APPLICATIONS command: New `UPGRADE_STATUS` column

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

During installation and upgrade of a Snowflake Native App, a consumer may need to monitor the current status of
the installation or upgrade.

Before the change:
:   The [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command did not include the `upgrade_status` column.

After the change:
:   The [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command includes the `upgrade_status` column, which is defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | UPGRADE_STATUS | String | Lists the current status of the installation or upgrade. |

Possible values for this column are:

* `INSTALLING`: The application object is in the process of being created.
* `INSTALL_FAILED`: The creation of the application object failed. The application object
  remains in the `INSTALL_FAILED` state until it is dropped. See the `UPGRADE_FAILURE_REASON`
  column of the [DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) command for information about why the
  installation or upgrade failed.
* `COMPLETE`: The setup script successfully completed and the application object was created
  or upgraded.
* `QUEUED`: The application object is queued for upgrade.
* `UPGRADING`: The application object is in the process of being upgraded.
* `FAILED`: All upgrade attempts failed. The reason for the failure is listed in the
  `UPGRADE_FAILURE_REASON` column, if present. The instance remains in the `FAILED` state until
  a release directive is updated to point to a different version than the one that the upgrade was
  targeting, as defined in the `TARGET_UPGRADE_VERSION` column.
* `QUEUED_DELAYED`: The application object is queued for an upgrade that is scheduled for a future time.
* `QUEUED_RETRY`: The instance failed one or more upgrade attempts. The reason for the failure
  is indicated in `UPGRADE_FAILURE_REASON`: The instance is queued to perform another upgrade attempt.
* `DISABLED`: The application object and its upgrades were disabled. In this state the instance will be
  inaccessible for consumers, it will not be considered for upgrades and will not block application package
  version drop. The reason for the failure is listed in the `UPGRADE_FAILURE_REASON` column, if present.

Ref: 1521

---
title: SHOW AVAILABLE LISTINGS and DESCRIBE LISTING commands: Reformat the regions column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1986.md
section: Release Notes
---

# SHOW AVAILABLE LISTINGS and DESCRIBE LISTING commands: Reformat the regions column in output

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, the `regions` column that’s returned in the [SHOW AVAILABLE LISTINGS](../../../sql-reference/sql/show-available-listings.md) and [DESCRIBE AVAILABLE LISTING](../../../sql-reference/sql/desc-available-listing.md) commands includes the region group prefix.

Before the change:
:   SHOW AVAILABLE LISTINGS and DESCRIBE AVAILABLE LISTING return a `regions` column that contains a comma-separated string list of regions in the format `snowflake_region`. For example: `AWS_US_WEST_2`.

After the change:
:   SHOW AVAILABLE LISTINGS and DESCRIBE AVAILABLE LISTING return a reformatted `regions` column. These commands still return a comma-separated string list of regions, but the format is now `region_group.snowflake_region`. For example: `PUBLIC.AWS_US_WEST_2`.

> **Note:**
>
> If you have any scripts that call SHOW AVAILABLE LISTINGS or DESCRIBE AVAILABLE LISTING, be sure to parse the `regions` column to handle this new format.

Ref: 1926

---
title: SHOW AVAILABLE LISTINGS command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2201.md
section: Release Notes
---

# SHOW AVAILABLE LISTINGS command: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW AVAILABLE LISTINGS](../../../sql-reference/sql/show-available-listings.md) command includes two new
columns. These columns are only populated with data for Internal Marketplace (organization) listings. For other listings, the columns show
null values.

| Column name | Description |
| --- | --- |
| `discover_only` | Indicates whether a listing is discoverable. |
| `is_data_available_to_query` | Indicates whether a listing exposes any data objects in the consumer account that can be queried. |

Ref: 2201

---
title: SHOW CHANNELS command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1950.md
section: Release Notes
---

# SHOW CHANNELS command: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW CHANNELS](../../../sql-reference/sql/show-channels.md) command includes the following new columns.

| Column name | Description |
| --- | --- |
| `parent_domain` | Indicates the type of the target domain of the channel. Possible values are TABLE or PIPE. |
| `parent_name` | Displays the name of the specific table or pipe that is the target of the channel. |

Ref: 1950

---
title: SHOW commands for objects owned by an application: New column owner_role_type
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1370.md
section: Release Notes
---

# SHOW commands for objects owned by an application: New column `owner_role_type`

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the following SHOW commands is as follows:

* SHOW ALERTS
* SHOW CLASSES
* SHOW DATABASES
* SHOW DYNAMIC TABLES
* SHOW MATERIALIZED VIEWS
* SHOW NETWORK RULES
* SHOW SECRETS
* SHOW STREAMLITS
* SHOW WAREHOUSES

Before the change:
:   The output of the command does not include the `owner_role_type` column.

After the change:
:   The output of the command includes the `owner_role_type` column.

    | Column name | Description |
    | --- | --- |
    | `owner_role_type` | The type of role that owns the object: either `ROLE` and `DATABASE_ROLE`.  When the object is owned by a native app, the value is `APPLICATION`.  Snowflake returns NULL if you delete the object because there is no owner role for a deleted object, nor can a native app own a deleted object. |

Ref: 1274

---
title: SHOW commands: New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1281.md
section: Release Notes
---

# SHOW commands: New column

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the following SHOW commands include a new BUDGET column in the output:

* SHOW TABLES
* SHOW SCHEMAS
* SHOW TASKS
* SHOW PIPES
* SHOW DATABASES
* SHOW WAREHOUSES
* SHOW MATERIALIZED VIEWS

| Column | Data type | Description |
| --- | --- | --- |
| BUDGET | TEXT | Reserved for future use. |

The BUDGET column will be the last column in the output.

Ref: 1281

---
title: SHOW Commands: New Column Added to Output for Certain Commands
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-865.md
section: Release Notes
---

# SHOW Commands: New Column Added to Output for Certain Commands

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The output of these SHOW commands now includes a new OPTIONS column:

* [SHOW MASKING POLICIES](../../../sql-reference/sql/show-masking-policies.md)
* SHOW ACCESS POLICIES
* [SHOW SESSION POLICIES](../../../sql-reference/sql/show-session-policies.md)
* [SHOW PASSWORD POLICIES](../../../sql-reference/sql/show-password-policies.md)

The column was added to support future functionality. Currently, this column returns an empty string.

Ref: 865

---
title: SHOW Commands: New OWNER_ROLE_TYPE Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-747.md
section: Release Notes
---

# SHOW Commands: New OWNER_ROLE_TYPE Column in Output

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The OWNER_ROLE_TYPE column is now included in the output of the following [SHOW <objects>](../../../sql-reference/sql/show.md) commands:

* SHOW EXTERNAL TABLES
* SHOW FILE FORMATS
* SHOW MASKING POLICIES
* SHOW MATERIALIZED VIEWS
* SHOW OBJECTS
* SHOW PIPES
* SHOW ROW ACCESS POLICIES
* SHOW SCHEMAS
* SHOW SEQUENCES
* SHOW STAGES
* SHOW STREAMS
* SHOW TABLES
* SHOW TAGS
* SHOW TASKS
* SHOW VIEWS

The column shows the type of role that owns the object: ROLE or DATABASE_ROLE. The new column has be added immediately after the existing
OWNER column.

Ref: 747

---
title: SHOW Commands: New OWNER_ROLE_TYPE Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-747.md
section: Release Notes
---

# SHOW Commands: New OWNER_ROLE_TYPE Column in Output

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The OWNER_ROLE_TYPE column is included in the output of the following SHOW <object_type> commands:

* SHOW EXTERNAL TABLES
* SHOW FILE FORMATS
* SHOW MASKING POLICIES
* SHOW MATERIALIZED VIEWS
* SHOW OBJECTS
* SHOW PIPES
* SHOW ROW ACCESS POLICIES
* SHOW SCHEMAS
* SHOW SEQUENCES
* SHOW STAGES
* SHOW STREAMS
* SHOW TABLES
* SHOW TAGS
* SHOW TASKS
* SHOW VIEWS

The column displays the type of role that owns the object: ROLE or DATABASE_ROLE.

Ref: 747

---
title: SHOW Commands: Pagination Support
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1080.md
section: Release Notes
---

# SHOW Commands: Pagination Support

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, the behavior of these commands changes as follows:

* [SHOW APPLICATION ROLES](../../../sql-reference/sql/show-application-roles.md)
* [SHOW DATABASE ROLES](../../../sql-reference/sql/show-database-roles.md)
* [SHOW GRANTS](../../../sql-reference/sql/show-grants.md)
* [SHOW ROLES](../../../sql-reference/sql/show-roles.md)

Previously:
:   These commands do not support limiting the row output or pagination of the results.

Currently:
:   These commands support pagination of the output using a LIMIT … FROM clause:

    > * [SHOW APPLICATION ROLES](../../../sql-reference/sql/show-application-roles.md)
    > * [SHOW DATABASE ROLES](../../../sql-reference/sql/show-database-roles.md)
    > * [SHOW ROLES](../../../sql-reference/sql/show-roles.md)

    ```sqlsyntax
    SHOW <domain_plural> [ LIMIT <rows> [ FROM '<name_string>' ] ]
    ```

    The SHOW GRANTS command only supports the LIMIT clause. For example:

    ```sqlsyntax
    SHOW GRANTS [ LIMIT <rows> ]
    ```

    Where:

    > `domain_plural`
    > :   Use one of the following plural forms of the object domain:
    >
    >     > * `APPLICATION ROLES`
    >     > * `DATABASE ROLES`
    >     > * `ROLES`
    >
    > `LIMIT rows [ FROM 'name_string' ]`
    > :   Optionally limits the maximum number of rows returned, while also enabling “pagination” of the results. The actual number of rows
    >     returned night be less than the specified limit (e.g. the number of existing objects is less than the specified limit).
    >
    >     The optional `FROM 'name_string'` subclause effectively serves as a “cursor” for the results. This enables fetching the
    >     specified number of rows following the first row whose object name matches the specified string:
    >
    >     The string must be enclosed in single quotes and is case-sensitive. In addition, the string does not have to include the full
    >     object name; partial names are supported.
    >
    > Default: No value (no limit is applied to the output).
    >
    > For example:
    >
    > > ```sqlexample
    > > SHOW APPLICATION ROLES IN APPLICATION myapp LIMIT 10 FROM 'app_role2';
    > > ```
    >
    > The statement returns up to ten application roles in the application named `myapp` after the first application role named
    > `app_role2`.

Ref: 1080

---
title: SHOW commands: Privilege updates
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1665.md
section: Release Notes
---

# SHOW commands: Privilege updates

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

The behavior of the [SHOW <objects>](../../../sql-reference/sql/show.md) command is as follows:

Before the change:
:   The role used to execute the command requires the USAGE privilege on the object to see the objects.

    For example, the SHOW SCHEMAS command requires the USAGE privilege on the schema.

    Other privileges on the schema, such as MODIFY, are not sufficient to see the table objects in the schema.

After the change:
:   The role used to execute the command can be granted any privilege on the object to see the objects in the output.

    For example, if the role used to execute the SHOW SCHEMAS command is granted the MODIFY privilege on the schema, the role can see those
    schemas in the output.

    Consequently, the output of the SHOW command might return more objects than before the change depending on the grants to the role that
    executes the command.

    These changes apply to all SHOW commands.

Ref: 1665

---
title: SHOW commands: Update OWNER column for objects owned by Snowflake
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1475.md
section: Release Notes
---

# SHOW commands: Update OWNER column for objects owned by Snowflake

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of the `owner` column is as follows:

Before the change:
:   The `owner` column does not return a value for the following commands:

    * SHOW APPLICATION ROLES IN APPLICATION SNOWFLAKE
    * SHOW SCHEMAS IN APPLICATION SNOWFLAKE
    * SHOW SCHEMAS IN DATABASE SNOWFLAKE.

After the change:
:   The `owner` column returns a value for the following commands as follows:

    | Command | `owner` column value |
    | --- | --- |
    | SHOW APPLICATION ROLE IN APPLICATION SNOWFLAKE | `SNOWFLAKE` |
    | SHOW SCHEMAS IN APPLICATION SNOWFLAKE  SHOW SCHEMAS IN DATABASE SNOWFLAKE | `SNOWFLAKE` for the schema named `LOCAL`.  This output applies to both commands. |

> **Note:**
>
> [Disabling](../../../sql-reference/functions/system_disable_behavior_change_bundle.md) the `2024_01` behavior change bundle does not
> disable the changes described in this topic.

Ref: 1475

---
title: SHOW COMPUTE POOL and DESCRIBE COMPUTE POOL commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2119.md
section: Release Notes
---

# SHOW COMPUTE POOL and DESCRIBE COMPUTE POOL commands: New column in output

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW COMPUTE POOLS](../../../sql-reference/sql/show-compute-pools.md) and [DESCRIBE COMPUTE POOL](../../../sql-reference/sql/desc-compute-pool.md) commands includes the following new column:

| Column name | Description |
| --- | --- |
| `placement_group` | Specifies the fault domain into which the compute pool nodes are placed. A fault domain is similar to the cloud provider’s availability zone. |

Ref: 2119

---
title: SHOW COMPUTE POOLS and DESC COMPUTE POOL commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1595-1652.md
section: Release Notes
---

# SHOW COMPUTE POOLS and DESC COMPUTE POOL commands: New column in output

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW COMPUTE POOLS](../../../sql-reference/sql/show-compute-pools.md) and [DESCRIBE COMPUTE POOL](../../../sql-reference/sql/desc-compute-pool.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| BUDGET | The name of the [budget](../../../user-guide/budgets.md) monitoring the credit usage of the compute pool. |
| TARGET_NODES | Indicates the number of nodes that Snowflake is targeting for your compute pool. If `active_nodes` is not equal to the `target_nodes`, then Snowflake will autoscale the cluster to add or remove the nodes  **Note:** The `target_nodes` column appears immediately after the existing `idle_nodes` column. |

The following examples demonstrate how to interpret the values in the `target_nodes` column.

**Example 1:** Suppose in a [CREATE COMPUTE POOL](../../../sql-reference/sql/create-compute-pool.md) command, you specify MIN_NODES=1 and MAX_NODES=3.

While Snowflake is provisioning a node, initially the value in the `active_nodes` and `idle_nodes` columns is 0, and the value in the `target_nodes` column is 1. (The value in the `target_nodes` column is the same as the value that you specified for the MIN_NODES parameter.) This indicates that there should be one node in the compute pool that Snowflake is provisioning.

After Snowflake provisions one node, the value in the `idle_nodes` column is 1 (assuming that there are no services running). The value in the `target_nodes` column is still 1, indicating there should be one node in the compute pool.

**Example 2:** Snowflake might try to add a node to an existing compute pool due to autoscaling or changes to the minimum number of nodes (through
[ALTER COMPUTE POOL … SET MIN_NODES](../../../sql-reference/sql/alter-compute-pool.md)).

While Snowflake is provisioning a node, the value in the `state` column is `resizing`. To determine how many nodes Snowflake is adding, check the value in the `target_nodes` column.

For example, suppose that the value in the, `active_nodes` column is 1, the value in the `idle_nodes` column is 0, and you resize the compute pool by updating the MIN_NODES property from 1 to 2. In this case, the value in the `target_nodes` column as 2 (the number of nodes that should be in the compute pool). From this you can infer that Snowflake is provisioning one additional node.

Ref: 1595, 1652

---
title: SHOW CONTACTS command and CONTACTS view (Account Usage): New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2222.md
section: Release Notes
---

# SHOW CONTACTS command and CONTACTS view (Account Usage): New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW CONTACTS](../../../sql-reference/sql/show-contacts.md) command and the
[CONTACTS](../../../sql-reference/account-usage/contacts.md) [Account Usage](../../../sql-reference/account-usage.md) view include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| EMAIL_LIST | ARRAY | List of email addresses. |

Ref: 2222

---
title: SHOW DATABASES command: Changes to the KIND column output (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-show-databases-kind-column-output.md
section: Release Notes
---

# SHOW DATABASES command: Changes to the KIND column output (Pending)

The output of the [SHOW DATABASES](../../../sql-reference/sql/show-databases.md) command will change as follows:

Previously:
:   The value of the KIND column for the SNOWFLAKE database is `IMPORTED DATABASE`.

Pending:
:   The value of the KIND column for the SNOWFLAKE database will be `APPLICATION`.

Ref: n/a

---
title: SHOW DATABASES Command: New Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1021.md
section: Release Notes
---

# SHOW DATABASES Command: New Column in Output

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The SHOW DATABASES command output includes a KIND column. The column is added to support future functionality.

Previously:
:   This column does not exist.

Currently:
:   This column is displayed and specifies one of the following values:

    * STANDARD: Specifies a normal database.
    * IMPORTED DATABASE: Specifies a database that is created from a share.

Ref: 1021

---
title: SHOW DATABASES command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2199.md
section: Release Notes
---

# SHOW DATABASES command: New column in output

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW DATABASES](../../../sql-reference/sql/show-databases.md) command includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| DATA_QUALITY_MONITORING_SETTINGS | VARIANT | A JSON string that describes the data quality configuration of this database. Information includes `enabled` (boolean), `integrations` (array of any integrations in this database), and `metadata_included` (boolean). |

Ref: 2199

---
title: SHOW DYNAMIC TABLES command and DYNAMIC_TABLES function: New changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1796.md
section: Release Notes
---

# SHOW DYNAMIC TABLES command and DYNAMIC_TABLES function: New changes to output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

The [SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md) and [DYNAMIC_TABLES](../../../sql-reference/functions/dynamic_tables.md) output behaves as follows:

Before the change:
:   The output of the SHOW DYNAMIC TABLES command and the DYNAMIC_TABLES function does not include the `is_iceberg` column.

After the change:
:   The output of the SHOW DYNAMIC TABLES command and the DYNAMIC_TABLES function includes the `is_iceberg` column, which is defined as
    follows:

    | Column Name | `is_iceberg` |
    | --- | --- |
    | Data Type | Text |
    | Description | Y if the table is an Apache Iceberg™ dynamic table; otherwise, N. |

Ref: 1796

---
title: SHOW DYNAMIC TABLES command: New columns added to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-2001.md
section: Release Notes
---

# SHOW DYNAMIC TABLES command: New columns added to output

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW DYNAMIC TABLES](../../../sql-reference/sql/show-dynamic-tables.md) command includes
the following new columns:

| Column name | Description |
| --- | --- |
| INSERT ONLY INPUTS | Reserved for future use |
| IMMUTABLE WHERE | Reserved for future use |

These columns were added to support future functionality. Currently, these columns return an empty string.

Ref: 2001

---
title: SHOW ENDPOINTS command: New column and changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1586.md
section: Release Notes
---

# SHOW ENDPOINTS command: New column and changes to output

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The [SHOW ENDPOINTS](../../../sql-reference/sql/show-endpoints.md) command behaves as follows:

Before the change:
:   In commands output, the `port` column always has a value.

After the change:
:   The output includes a new `port_range` column that appears immediately after the `port` column.

    | Column name | Description |
    | --- | --- |
    | PORT_RANGE | If `port` is null, contains a port range in the form ####-####. |

    If the `port_range` column is NOT NULL, the `port` column is NULL.

Ref: 1586

---
title: SHOW ENDPOINTS command: Output column name change
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1563.md
section: Release Notes
---

# SHOW ENDPOINTS command: Output column name change

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The [SHOW ENDPOINTS](../../../sql-reference/sql/show-endpoints.md) command output behaves as follows:

Before the change:
:   Output includes the column `ingress_enabled`.

After the change:
:   Output includes the column `is_public`, which was previously named `ingress_enabled`.

Ref: 1563

---
title: SHOW EVENT TABLES command: Add owner_role_type column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1294.md
section: Release Notes
---

# SHOW EVENT TABLES command: Add owner_role_type column

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

The [SHOW EVENT TABLES](../../../sql-reference/sql/show-event-tables.md) command behaves as follows:

Before the change:
:   The output of the SHOW EVENT TABLES command does not include an OWNER_ROLE_TYPE column.

After the change:
:   The output of the SHOW EVENT TABLES command includes the OWNER_ROLE_TYPE column, which is defined as follows:

    | Column Name | Data type | Description |
    | --- | --- | --- |
    | owner_role_type | String | The type of role that owns the object, for example `ROLE`. . If a Snowflake Native App owns the object, the value is `APPLICATION`. . Snowflake returns NULL if you delete the object because a deleted object does not have an owner role. |

    This column will be added as the last column in the output.

Ref: 1294

---
title: SHOW FUNCTIONS and SHOW PROCEDURES commands: Changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1508.md
section: Release Notes
---

# SHOW FUNCTIONS and SHOW PROCEDURES commands: Changes to output

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The output of the [SHOW FUNCTIONS](../../../sql-reference/sql/show-functions.md) and [SHOW PROCEDURES](../../../sql-reference/sql/show-procedures.md) commands include optional
arguments in the `arguments` column. For an example procedure:

```sqlexample
CREATE OR REPLACE PROCEDURE my_proc (
  arg1 string,
  arg2 boolean default true
)
RETURNS string
LANGUAGE JAVASCRIPT
AS
$$
  return 'hello world';
$$;
```

The value in the `arguments` column for optional arguments is displayed as follows:

Before the change:
:   Optional arguments for functions and procedures are surrounded by brackets (`[]`).

    For example, the value of the `arguments` column in the output of a SHOW PROCEDURES statement for
    the example procedure is:

    `MY_PROC(VARCHAR [, BOOLEAN]) RETURN VARCHAR`

After the change:
:   Optional arguments for functions and procedures are displayed with the DEFAULT keyword.

    For example, the value of the `arguments` column in the output of a SHOW PROCEDURES statement for
    the example procedure is:

    `MY_PROC(VARCHAR , BOOLEAN DEFAULT) RETURN VARCHAR`

    Snowsight correctly displays the definition of functions and procedures with optional arguments.

Ref: 1508

---
title: SHOW FUNCTIONS and SHOW PROCEDURES commands: The complete data type for arguments is displayed in output (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1944.md
section: Release Notes
---

# SHOW FUNCTIONS and SHOW PROCEDURES commands: The complete data type for arguments is displayed in output (Postponed)

> **Attention:**
>
> This behavior change was originally in the [2025_03 Bundle](../2025_03_bundle.md)
> and intended to become enabled by default in the 2025_04 bundle.
> However, it has been postponed and a new release date has not been determined.

When this behavior change bundle is enabled, the output of the SHOW command for functions and procedures will display complete data types
(when the type is not the default) for function and procedure arguments.

Before the change:
:   When you execute the [SHOW PROCEDURES](../../../sql-reference/sql/show-procedures.md) or [SHOW FUNCTIONS](../../../sql-reference/sql/show-functions.md) command, values in the ARGUMENT column do not always include the complete data
    type—including the type’s precision—when the type isn’t the default.

    For example, when an argument in the column’s value is NUMBER(20, 0), the displayed value is simply NUMBER, as in the following example:

    ```output
    MY_UDF(TIMESTAMP_NTZ, TIMESTAMP_LTZ, TIMESTAMP_TZ, VARCHAR, NUMBER) RETURN NUMBER
    ```

    This makes the signature less useful when you want to use it with commands such as DESC, DROP, or GET_DDL, where the incomplete
    signature would result in a name resolution failure.

After the change:
:   When you execute the SHOW PROCEDURES or SHOW FUNCTIONS command, values in the ARGUMENT column include the complete data type—including
    the type’s precision—when the type isn’t the default.

    For example, when an argument in the column’s value is NUMBER(20, 0), the displayed value is NUMBER(20, 0), as in the following example:

    ```output
    MY_UDF(TIMESTAMP_NTZ(3), TIMESTAMP_LTZ(3), TIMESTAMP_TZ(3), VARCHAR(100), NUMBER(20,0)) RETURN NUMBER(20,0)
    ```

    You can use this column value with commands such as DESC, DROP, or GET_DDL.

    This change affects the following types when the precision of the type used for the argument isn’t the default:

    * NUMBER
    * VARCHAR
    * BINARY
    * TIMESTAMP_LTZ
    * TIMESTAMP_NTZ
    * TIMESTAMP_TZ
    * TIME

Ref: 1944

---
title: SHOW FUNCTIONS commands: New is_data_metric column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1248.md
section: Release Notes
---

# SHOW FUNCTIONS commands: New is_data_metric column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the SHOW FUNCTIONS command is as follows:

Before the change:
:   The output of the command does not include the is_data_metric column.

After the change:
:   The output of the command includes the is_data_metric column. This column is a placeholder for future functionality.

Ref: 1248

---
title: SHOW FUNCTIONS IN MODEL Command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1678.md
section: Release Notes
---

# SHOW FUNCTIONS IN MODEL Command: New column in output

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW FUNCTIONS IN MODEL](../../../sql-reference/sql/show-functions-in-model.md) command includes the following new column:

| Column Name | Description |
| --- | --- |
| `is_table_function` | Whether the function is a [table function](../../../sql-reference/functions-table.md), a function that returns tabular data rather than a single value. Possible values are:   * TRUE: The function is a table function (UDTF) * FALSE: The function is a regular function (UDF) |

Ref: 1678

---
title: SHOW GIT REPOSITORIES and DESC GIT REPOSITORY commands: LAST_OPERATION_STATUS column removed from output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1949.md
section: Release Notes
---

# SHOW GIT REPOSITORIES and DESC GIT REPOSITORY commands: LAST_OPERATION_STATUS column removed from output

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW GIT REPOSITORIES](../../../sql-reference/sql/show-git-repositories.md) and
[DESCRIBE GIT REPOSITORY](../../../sql-reference/sql/desc-git-repository.md) commands no longer includes the following column:

| Column name | Data type | Description |
| --- | --- | --- |
| LAST_OPERATION_STATUS | None. | Column will be removed. Its value was always NULL. |

Ref: 1949

---
title: SHOW GRANTS command: Updates for managed access schema
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1397.md
section: Release Notes
---

# SHOW GRANTS command: Updates for managed access schema

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the [SHOW GRANTS](../../../sql-reference/sql/show-grants.md) command with a managed access schema is as follows:

Before the change:
:   * The `grant_options` column returns `TRUE` when you run a `SHOW GRANTS ON object_type object_name` command for an
      object in the managed access schema.
    * The `privilege` column includes the OWNERSHIP privilege for the role that owns the managed access schema when you run a
      `SHOW GRANTS ON SCHEMA managed_access_schema` command.

After the change:
:   * The `grant_options` column returns `FALSE` when you run a `SHOW GRANTS ON object_type object_name` command for an
      object in the managed access schema.
    * In addition to the OWNERSHIP privilege, the `privilege` column includes the MANAGE GRANTS privilege for the role that owns the
      managed access schema when you run a `SHOW GRANTS ON SCHEMA managed_access_schema` command.

Ref: 1397

---
title: SHOW GRANTS ON Command: New GRANTED_BY_ROLE_TYPE Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-754.md
section: Release Notes
---

# SHOW GRANTS ON Command: New GRANTED_BY_ROLE_TYPE Column in Output

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

In the current release, the output of the SHOW GRANTS ON <object_type> <object_name> command syntax includes a new GRANTED_BY_ROLE_TYPE column.
The column indicates whether the grantor role is an account role or database role (ROLE or DATABASE_ROLE, respectively).

To help limit the impact of this change, the column is the last column in the output.

> **Note:**
>
> The column is limited to the SHOW GRANTS ON <object_type> <object_name> command syntax. Other syntax variations for the command will
> not include this column.

Ref: 754

---
title: SHOW GRANTS TO ROLE command: SNOWFLAKE database role grants to PUBLIC included in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1925.md
section: Release Notes
---

# SHOW GRANTS TO ROLE command: SNOWFLAKE database role grants to PUBLIC included in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

The output of the SHOW GRANTS TO ROLE command is changing as follows:

Before the change:
:   SNOWFLAKE database role grants **are not** shown in the output of SHOW GRANTS TO ROLE PUBLIC.

After the change:
:   SNOWFLAKE database role grants **are** shown in the output of SHOW GRANTS TO ROLE PUBLIC.

This change provides visibility of SHOW GRANTS output.

Ref: 1925

---
title: SHOW GRANTS TO USER and SHOW GRANTS commands: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1803.md
section: Release Notes
---

# SHOW GRANTS TO USER and SHOW GRANTS commands: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

SHOW GRANTS TO USER (and SHOW GRANTS) behaves as follows:

Before the change:
:   SHOW GRANTS TO USER (and SHOW GRANTS) has the following columns:

    * created_on
    * role
    * granted_to
    * grantee_name
    * granted_by

After the change:
:   SHOW GRANTS TO USER (and SHOW GRANTS) has the following columns:

    * created_on
    * privilege
    * granted_on
    * name
    * role
    * granted_to
    * grantee_name
    * grant_option
    * granted_by

When this behavior change bundle is enabled, the output of the SHOW GRANTS TO USER and SHOW GRANTS commands includes the following new columns:

| Column name | Description |
| --- | --- |
| privilege | Name of the granted privilege. |
| granted_on | Object kind, such as TABLE or DATABASE, on which the privilege is granted. |
| name | Name of the object on which the privilege is granted. |
| grant_option | TRUE / FALSE. If set to TRUE, the user can grant the privilege to other roles or users. |

This change aligns the output of SHOW GRANTS TO ROLE and SHOW GRANTS TO USER, in support of user-based access control.

Ref: 1803

---
title: SHOW GRANTS: Changes to output for grants on functions and procedures
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2190.md
section: Release Notes
---

# SHOW GRANTS: Changes to output for grants on functions and procedures

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

In the output of the [SHOW GRANTS](../../../sql-reference/sql/show-grants.md) command, the value in the name column is changing for functions and
procedures:

Before the change:
:   The `name` column includes the names and types of the arguments and the return type.

    For example, for the following function:

    ```sqlexample
    CREATE FUNCTION area_of_circle(radius FLOAT)
      RETURNS FLOAT
      ...
    ```

    the value of the name column is:

    ```none
    MY_DB.MY_SCHEMA."AREA_OF_CIRCLE(RADIUS FLOAT):FLOAT"
    ```

    For the following procedure:

    ```sqlexample
    CREATE PROCEDURE output_message(message VARCHAR)
      RETURNS VARCHAR
      ...
    ```

    the value in the name column is:

    ```none
    MY_DB.MY_SCHEMA."OUTPUT_MESSAGE(MESSAGE VARCHAR):VARCHAR"
    ```

After the change:
:   The `name` column just includes the types of the arguments.

    For example, for the following function:

    ```sqlexample
    CREATE FUNCTION area_of_circle(radius FLOAT)
      RETURNS FLOAT
      ...
    ```

    the value of the name column is:

    ```none
    MY_DB.MY_SCHEMA.AREA_OF_CIRCLE(FLOAT)
    ```

    For the following procedure:

    ```sqlexample
    CREATE PROCEDURE output_message(message VARCHAR)
      RETURNS VARCHAR
      ...
    ```

    the value in the name column is:

    ```none
    MY_DB.MY_SCHEMA.OUTPUT_MESSAGE(VARCHAR)
    ```

This change makes it easier to use the value in the name column in [GRANT](../../../sql-reference/sql/grant-privilege.md) and
[REVOKE](../../../sql-reference/sql/revoke-privilege.md) statements that you want to execute.

For example, suppose that you want to revoke the privileges granted on functions and procedures to the `my_custom_role` role.
You can run the SHOW GRANTS command:

```sqlexample
SHOW GRANTS TO ROLE my_custom_role
  ->> SELECT "privilege", "granted_on", "name"
        FROM $1
        WHERE "granted_on" IN ('FUNCTION', 'PROCEDURE');
```

```output
+-----------+------------+-----------------------------------------+
| privilege | granted_on | name                                    |
|-----------+------------+-----------------------------------------|
| USAGE     | FUNCTION   | MY_DB.MY_SCHEMA.AREA_OF_CIRCLE(FLOAT)   |
| USAGE     | PROCEDURE  | MY_DB.MY_SCHEMA.OUTPUT_MESSAGE(VARCHAR) |
+-----------+------------+-----------------------------------------+
```

Then, you can copy and paste the returned values into REVOKE statements to revoke those privileges:

```sqlexample
REVOKE USAGE ON FUNCTION MY_DB.MY_SCHEMA.AREA_OF_CIRCLE(FLOAT) FROM ROLE my_custom_role;
```

```sqlexample
REVOKE USAGE ON PROCEDURE MY_DB.MY_SCHEMA.OUTPUT_MESSAGE(VARCHAR) FROM ROLE my_custom_role;
```

Ref: 2190

---
title: SHOW ICEBERG TABLES command: New column ICEBERG_TABLE_AUTO_REFRESH_STATUS in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1941.md
section: Release Notes
---

# SHOW ICEBERG TABLES command: New column ICEBERG_TABLE_AUTO_REFRESH_STATUS in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the output of the SHOW ICEBERG TABLES command includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| AUTO_REFRESH_STATUS | String | The automated refresh status for an externally managed Apache Iceberg™ table.  The column displays the same results for the table as the [SYSTEM$AUTO_REFRESH_STATUS](../../../sql-reference/functions/system_auto_refresh_status.md) function. |

Ref: 1941

---
title: SHOW ICEBERG TABLES command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1745.md
section: Release Notes
---

# SHOW ICEBERG TABLES command: New column in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW ICEBERG TABLES](../../../sql-reference/sql/show-iceberg-tables.md) command includes the
following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `catalog_sync_name` | String | Denotes the name of the catalog integration for Snowflake Open Catalog that the Snowflake-managed Apache Iceberg™ table is configured to be synced to.  This configuration is specified through one of the following:   * The `CATALOG_SYNC` parameter for the [CREATE ICEBERG TABLE (Snowflake as the Iceberg catalog)](../../../sql-reference/sql/create-iceberg-table-snowflake.md) command. * The [ALTER <object>](../../../sql-reference/sql/alter.md) command by using ALTER `<domain>` SET CATALOG_SYNC… syntax, where `<domain>` can   either be `account`, `database`, `schema`, or `iceberg table`. When setting `<domain>` on a non-table   domain, all tables under the domain will be propagated with the CATALOG_SYNC target, so their `catalog_sync_name` column from   SHOW ICEBERG TABLES will contain this value that was set from the higher domain.  If the `CATALOG_SYNC` parameter is set both on the table and a higher domain, the finer-grained domain (that is, the parameter on table)   will be respected and returned in the command output. For example, if the parameter is set on both schema and table, the parameter   value for the table is returned in the `catalog_sync_name column` for SHOW ICEBERG TABLES.   If one of the following is true, the value for `catalog_sync_name` is NULL:   * There are no sync targets configured for the Iceberg table. * The Iceberg table isn’t Snowflake managed.   The `catalog_sync_name` column is added as the last column of the output, immediately following `name_mapping`. |

Ref: 1745

---
title: SHOW ICEBERG TABLES command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2076.md
section: Release Notes
---

# SHOW ICEBERG TABLES command: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW ICEBERG TABLES](../../../sql-reference/sql/show-iceberg-tables.md) command includes the following
new columns:

| Column name | Description |
| --- | --- |
| `partition_specs` | List of objects containing the Apache Iceberg™ partition specs for the Iceberg table as they appear in the Iceberg metadata file. It returns the partition specs for both Snowflake-managed and externally managed Iceberg tables.  Each partition spec includes a `spec-id`, followed by the fields for the partition spec. Each field is an OBJECT value with the following key-value pairs:   * `name`: The name of the partition. * `transform`: The transformation applied to the source column to generate a partition value. This value determines how data is grouped   into partitions. * `source-id`: The identifier of the original table column or field used for partitioning. * `field-id`: The partition field id. This field is used to identify a partition field and is unique in a partition spec. However, for   v2 table metadata, it’s unique across all partition specs.   For example:  ```json [ { "spec-id" : 0, "fields" : [ { "name" : "COL1", "transform" : "identity", "source-id" : 1, "field-id" : 1000 }, { "name" : "COL1_trunc_100", "transform" : "truncate[100]", "source-id" : 1, "field-id" : 1001 }, // Additional fields omitted for brevity.     ] } ] ```  The example shows one partition spec; however, a table can have multiple partition specs.  This column appears after the `auto_refresh_status` column. |
| `current_partition_spec_id` | ID for the partition spec that is currently active for the Iceberg table. This ID corresponds to a value for spec-id in partition_specs. For example: `0`.  This column appears after the `partition_specs` column. It is the last column in the output. |

For more information about partition specs, see [Partition Specs](https://iceberg.apache.org/spec/#partition-specs) in the Apache Iceberg™
specification.

Ref: 2076

---
title: SHOW IMAGE REPOSITORIES command: New column in output and removal of image repositories from the SHOW STAGES command output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1825.md
section: Release Notes
---

# SHOW IMAGE REPOSITORIES command: New column in output and removal of image repositories from the SHOW STAGES command output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

## SHOW IMAGE REPOSITORIES command: New column in output

When this behavior change bundle is enabled, the output of the [SHOW IMAGE REPOSITORIES](../../../sql-reference/sql/show-image-repositories.md) command includes the following new column:

| Column name | Description |
| --- | --- |
| ENCRYPTION | Encryption type configured for the image repository. |

## SHOW STAGES command: Output changes

When this behavior change bundle is enabled, the output of the [SHOW STAGES](../../../sql-reference/sql/show-stages.md) command will not return image repositories in the Snowflake account. To list image repositories, use the [SHOW IMAGE REPOSITORIES](../../../sql-reference/sql/show-image-repositories.md) command instead.

Ref: 1825

---
title: SHOW INTEGRATIONS Command: USAGE Privilege Required to View Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-930.md
section: Release Notes
---

# SHOW INTEGRATIONS Command: USAGE Privilege Required to View Output

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

For the most up-to-date details about the version and date in which it will be enabled, as well as other release-related details,
see the Behavior Change Log.

The behavior of the [SHOW INTEGRATIONS](../../../sql-reference/sql/show-integrations.md) command has changed as follows:

Previously:
:   Snowflake returned all integrations regardless of the privileges granted to the role in use to run the command.

Currently:
:   Snowflake returns integrations for which the role has the USAGE privilege on the integration or the OWNERSHIP privilege on
    the integration.

Ref: 930

---
title: SHOW MANAGED ACCOUNTS command: New and modified columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1738.md
section: Release Notes
---

# SHOW MANAGED ACCOUNTS command: New and modified columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW MANAGED ACCOUNTS](../../../sql-reference/sql/show-managed-accounts.md) command includes the following new
columns at the end:

| Column name | Data type | Description |
| --- | --- | --- |
| OLD_ACCOUNT_URL | VARCHAR | If the original [account URL](../../../user-guide/organizations-connect.md) was saved when the account was renamed, provides the original URL. If the original account URL was dropped, the value is NULL even if the account was renamed. |
| ACCOUNT_OLD_URL_SAVED_ON | VARCHAR | If the original account URL was saved when the account was renamed, provides the date and time when the original account URL was saved. |
| ACCOUNT_OLD_URL_LAST_USED | VARCHAR | If the original account URL was saved when the account was renamed, indicates the last time the account was accessed using the original URL. |
| ORGANIZATION_OLD_URL | VARCHAR | If the account’s organization was changed in a way that created a new [account URL](../../../user-guide/organizations-connect.md) and the original account URL was saved, provides the original account URL. If the original account URL was dropped, the value is NULL even if the organization changed. |
| ORGANIZATION_OLD_URL_SAVED_ON | VARCHAR | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, provides the date and time when the original account URL was saved. |
| ORGANIZATION_OLD_URL_LAST_USED | VARCHAR | If the account’s organization was changed in a way that created a new account URL and the original account URL was saved, indicates the last time the account was accessed using the original account URL. |

In addition, the following columns of SHOW MANAGED ACCOUNTS are renamed as follows:

| Column name before change | Column name after change |
| --- | --- |
| NAME | ACCOUNT_NAME |
| LOCATOR | ACCOUNT_LOCATOR |
| URL | ACCOUNT_URL |

Ref: 1738

---
title: SHOW MANAGED ACCOUNTS command: New column tenant_type in output (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2265.md
section: Release Notes
---

# SHOW MANAGED ACCOUNTS command: New column tenant_type in output (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW MANAGED ACCOUNTS](../../../sql-reference/sql/show-managed-accounts.md) command includes the following new column as the last column:

| Column name | Data type | Description |
| --- | --- | --- |
| `tenant_type` | VARCHAR | Allowed values are `INTERNAL` or `EXTERNAL`. The GLOBALORGADMIN or ORGADMIN role in the [organization account](../../../user-guide/organization-accounts.md) can change `tenant_type`. |

Ref: 2265

---
title: SHOW MODELS Command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1653.md
section: Release Notes
---

# SHOW MODELS Command: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW MODELS](../../../sql-reference/sql/show-models.md)
command includes the following new columns:

| Column Name | Description |
| --- | --- |
| `model_type` | The type of the model. Possible values are:   * USER_MODEL: A Python model in the [Snowflake Model Registry](../../../developer-guide/snowflake-ml/model-registry/overview.md) * CORTEX_FINETUNED: A [fine-tuned Cortex LLM](../../../user-guide/snowflake-cortex/cortex-finetuning.md) model |
| `aliases` | A SQL object whose keys are model version aliases and whose values are the corresponding model version names. The keys include aliases you have created using [ALTER MODEL](../../../sql-reference/sql/alter-model.md) as well as any system aliases (DEFAULT, FIRST, or LAST) that apply to the model version.  If a model version has no aliases, this column contains an empty object, `{}`, rather than NULL. |

The `model_type` column is added as the third column of the output, immediately following `name`. The `aliases`
column is added to the end.

Ref: 1653

---
title: SHOW NOTIFICATION INTEGRATIONS and DESC NOTIFICATION INTEGRATION commands: Changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1846.md
section: Release Notes
---

# SHOW NOTIFICATION INTEGRATIONS and DESC NOTIFICATION INTEGRATION commands: Changes to output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW NOTIFICATION INTEGRATIONS](../../../sql-reference/sql/show-integrations.md) and
[DESC NOTIFICATION INTEGRATION](../../../sql-reference/sql/desc-integration.md) commands changes as described in the following
sections:

* New column in the output of SHOW NOTIFICATION INTEGRATIONS
* Changes to the output of DESC NOTIFICATION INTEGRATION

## New column in the output of SHOW NOTIFICATION INTEGRATIONS

With this behavior change, the output of the SHOW NOTIFICATION INTEGRATIONS command includes the following new column:

| Column name | Description |
| --- | --- |
| `direction` | Specifies one of the following values, which indicates whether the integration supports sending or receiving notifications:   * `OUTBOUND`: Snowflake uses the integration to send notifications to a third-party messaging service.  This value appears for notification integrations with:    + TYPE=QUEUE and DIRECTION=OUTBOUND   + TYPE=EMAIL   + TYPE=WEBHOOK * `INBOUND`: Snowflake uses the integration to receive notifications from a third-party messaging service.  This value appears for notification integrations that do not specify DIRECTION=OUTBOUND. |

## Changes to the output of DESC NOTIFICATION INTEGRATION

With this behavior change, the output of the DESC NOTIFICATION INTEGRATION command changes in the following way:

Before the change:
:   For notification integrations of the following types, the DIRECTION property was not included in the output:

    * TYPE = QUEUE and NOTIFICATION_PROVIDER = AZURE_STORAGE_QUEUE
    * TYPE = EMAIL
    * TYPE = WEBHOOK

After the change:
:   The DIRECTION property appears in the output.

In addition, the value in the `property_default` column for the DIRECTION property changes in the following way:

Before the change:
:   The `property_default` column contains `INBOUND` for the DIRECTION property.

After the change:
:   The `property_default` column contains an empty string for the DIRECTION property.

Ref: 1846

---
title: SHOW OBJECTS command: New column and changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1529.md
section: Release Notes
---

# SHOW OBJECTS command: New column and changes to output

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The [SHOW OBJECTS](../../../sql-reference/sql/show-objects.md) output behaves as follows:

Before the change:
:   The output of SHOW OBJECTS does not include the `is_dynamic` column, and the `kind`
    column returns DYNAMIC_TABLE for dynamic tables.

After the change:
:   The output of SHOW OBJECTS includes the `is_dynamic` column, which is defined as follows:

    | Column Name | `is_dynamic` |
    | --- | --- |
    | Data Type | Text |
    | Description | Y if the table is a dynamic table; otherwise, N. |

    The `kind` column in the output of SHOW OBJECTS displays TABLE for dynamic tables.

Ref: 1529

---
title: SHOW OBJECTS command: New IS_ICEBERG column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1842.md
section: Release Notes
---

# SHOW OBJECTS command: New IS_ICEBERG column in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW OBJECTS](../../../sql-reference/sql/show-objects.md) command includes the following new column:

| Column name | Description |
| --- | --- |
| IS_ICEBERG | Indicates whether the object is an Iceberg table. Y if the object is an Iceberg table, otherwise N. |

Ref: 1842

---
title: SHOW ORGANIZATION ACCOUNTS command / ACCOUNTS view (Organization Usage): New Column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1358.md
section: Release Notes
---

# SHOW ORGANIZATION ACCOUNTS command / ACCOUNTS view (Organization Usage): New Column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the [SHOW ORGANIZATION ACCOUNTS](../../../sql-reference/sql/show-organization-accounts.md) command and the [ACCOUNTS view](../../../sql-reference/organization-usage/accounts.md) in the Organization Usage schema is as follows:

Before the change:
:   The command output and view do not contain the IS_EVENTS_ACCOUNT column.

After the change:
:   The command output and view contain the IS_EVENTS_ACCOUNT column.

Which is defined as follows:

| Column Name | Data Type | Description |
| --- | --- | --- |
| IS_EVENTS_ACCOUNT | BOOLEAN | Indicates whether an account is an events account.  The [Snowflake Native Apps Framework](../../../developer-guide/native-apps/native-apps-about.md) uses an events account to set up logging and event sharing to help troubleshoot installed applications. |

Ref: 1358

---
title: SHOW ORGANIZATION ACCOUNTS Command: New Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-942.md
section: Release Notes
---

# SHOW ORGANIZATION ACCOUNTS Command: New Column in Output

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The following column has been added to the output of the [SHOW ORGANIZATION ACCOUNTS](../../../sql-reference/sql/show-organization-accounts.md) command:

| Column Name | Data Type | Description |
| --- | --- | --- |
| IS_ORG_ADMIN | BOOLEAN | Indicates whether the ORGADMIN role is enabled in an account. An account with the ORGADMIN role enabled is called the ORGADMIN account. |

To help minimize the impact of this addition, the column has been added as the last column in the output.

Ref: 942

---
title: SHOW ORGANIZATION ACCOUNTS Command: New Columns in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-803.md
section: Release Notes
---

# SHOW ORGANIZATION ACCOUNTS Command: New Columns in Output

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The output of the SHOW ORGANIZATION ACCOUNTS command includes five new columns. These columns are reserved for future use and return an empty string or NULL.

The following columns exist at the end of the output:

| New Column | Returned Value |
| --- | --- |
| account_old_url_saved_on | NULL |
| account_old_url_last_used | NULL |
| organization_old_url | Empty String |
| organization_old_url_saved_on | NULL |
| organization_old_url_last_used | NULL |

Ref: 803

---
title: SHOW ORGANIZATION ACCOUNTS command: Repurposed for new functionality
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1712.md
section: Release Notes
---

# SHOW ORGANIZATION ACCOUNTS command: Repurposed for new functionality

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

The SHOW ORGANIZATION ACCOUNT command behaves as follows:

Before the change:
:   The SHOW ORGANIZATION ACCOUNTS command returns all accounts in an organization.

After the change:
:   The SHOW ORGANIZATION ACCOUNTS command returns an error directing the user to execute the [SHOW ACCOUNTS](../../../sql-reference/sql/show-accounts.md) command
    to return all accounts in an organization.

    For customers enrolled in the private preview of organization accounts, which is a new type of account, the SHOW ORGANIZATION ACCOUNTS
    command returns the organization account, not all accounts in the organization.

    When organization accounts become generally available, all customers will use the SHOW ORGANIZATION ACCOUNTS command to return the
    organization account.

Ref: 1712

---
title: SHOW PARAMETERS: Changes to Retention Time Values for Databases Created From a Share
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1146.md
section: Release Notes
---

# SHOW PARAMETERS: Changes to Retention Time Values for Databases Created From a Share

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

The results of the [SHOW PARAMETERS IN DATABASE <database_name>](../../../sql-reference/sql/show-parameters.md) command
include default values for parameters that do not apply to a database created from a share.

The [DATA_RETENTION_TIME_IN_DAYS](../../../sql-reference/parameters.md) parameter sets the number of days data
is retained for [Time Travel](../../../user-guide/data-time-travel.md). This
[data retention time value is 0](../2023_02/bcr-945.md) for a database created from a share.

The [MAX_DATA_EXTENSION_TIME_IN_DAYS](../../../sql-reference/parameters.md) parameter sets the maximum number of days
Snowflake can extend the data retention period for tables to ingest streaming data to prevent streams from
becoming stale. This parameter does not apply to a database created from a share since it is read-only.

In the current release, this behavior changed as follows:

Previously:
:   For a database created from a share, the results of the `SHOW PARAMETERS IN DATABASE database_name` command includes the
    default values for:

    * DATA_RETENTION_TIME_IN_DAYS (1)
    * MAX_DATA_EXTENSION_TIME_IN_DAYS (14)

Currently:
:   For a database created from a share, the results of the `SHOW PARAMETERS IN DATABASE database_name` command will no
    longer include the parameters:

    * DATA_RETENTION_TIME_IN_DAYS
    * MAX_DATA_EXTENSION_TIME_IN_DAYS

Ref: 1146

---
title: SHOW REGIONS command: Changes to region display names in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1635.md
section: Release Notes
---

# SHOW REGIONS command: Changes to region display names in output

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

In a future release, display names for specific regions in the output of the [SHOW REGIONS](../../../sql-reference/sql/show-regions.md) command will be updated for VPS customers.
The SHOW REGIONS command behaves as follows:

Before the change:

The DISPLAY_NAME column for VPS customers is defined only based on the region the account is located in.
For example, for a VPS account VPSAccount, in region us-east-1, DISPLAY_NAME
would show as: US East (N. Virginia).

After the change:

The DISPLAY_NAME column for VPS customers includes a customer specific identifier.
For example, for a VPS account VPSAccount, in region us-east-1, DISPLAY_NAME would show as: US East (VPSAccount - N. Virginia).

Ref: 1635

---
title: SHOW REGIONS command: Changes to region names in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1335.md
section: Release Notes
---

# SHOW REGIONS command: Changes to region names in output

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

Some region names in the output of the SHOW REGIONS command are updated as follows:

Before the change:
:   Region display names appear in the output as follows:

    | snowflake_region | cloud | region | display_name |
    | --- | --- | --- | --- |
    | AWS_US_GOV_WEST_1_FHPLUS | aws | us-gov-west-1-fhplus | US Gov West 1 (Fedramp High Plus) |
    | AWS_US_GOV_EAST_1_FHPLUS | aws | us-gov-east-1-fhplus | US Gov East 1 (Fedramp High Plus) |
    | AWS_US_GOV_EAST_1 | aws | us-gov-east-1 | US East (N. Virginia) |

After the change:
:   Region display names appear in the output as follows:

    | snowflake_region | cloud | region | display_name |
    | --- | --- | --- | --- |
    | AWS_US_GOV_WEST_1_FHPLUS | aws | us-gov-west-1-fhplus | US Gov West 1 (FedRAMP High Plus) |
    | AWS_US_GOV_EAST_1_FHPLUS | aws | us-gov-east-1-fhplus | US Gov East 1 (FedRAMP High Plus) |
    | AWS_US_GOV_EAST_1 | aws | us-gov-east-1 | US East (Commercial Gov - N. Virginia) |

Ref: 1335

---
title: SHOW RELEASE DIRECTIVES command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1906.md
section: Release Notes
---

# SHOW RELEASE DIRECTIVES command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW RELEASE DIRECTIVES](../../../sql-reference/sql/show-release-directives.md)
command will contain the following new column:

| Column name | Description |
| --- | --- |
| UPGRADE_AFTER | Indicates the earliest date that the release directive is allowed to start an automatic upgrade. |

Ref: 1906

---
title: SHOW RELEASE DIRECTIVES command: new columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1376.md
section: Release Notes
---

# SHOW RELEASE DIRECTIVES command: new columns

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The output of the [SHOW RELEASE DIRECTIVES](../../../sql-reference/sql/show-release-directives.md) command behaves as follows:

Currently:
:   The output of this command does not include the ACTIVE_REGIONS, PENDING_REGIONS, RELEASE_STATUS,
    and DEPLOYED_ON columns.

Pending:
:   The output of this command includes the ACTIVE_REGIONS, PENDING_REGIONS, RELEASE_STATUS, and
    DEPLOYED_ON columns, which are defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | ACTIVE_REGIONS | TEXT | Displays a list of Snowflake regions where the Snowflake Native App has been successfully deployed. |
    | PENDING_REGIONS | TEXT | Displays a list of Snowflake regions where the Snowflake Native App is scheduled to be deployed. |
    | RELEASE_STATUS | TEXT | Displays the current status of the release. Possible values are:   * IN PROGRESS * HOLDING * DEPLOYED |
    | DEPLOYED_ON | TIMESTAMP | Displays the date and time when the application was installed or upgraded. |

Ref: 1376

---
title: SHOW ROLES command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2095.md
section: Release Notes
---

# SHOW ROLES command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW ROLES](../../../sql-reference/sql/show-roles.md) command and the SHOW TERSE ROLES command includes the following
new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `is_from_organization_user_group` | BOOLEAN | If TRUE, the role was imported from an [organization user group](../../../user-guide/organization-users.md). |

Ref: 2095

---
title: SHOW SCHEMAS command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1757.md
section: Release Notes
---

# SHOW SCHEMAS command: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SCHEMAS](../../../sql-reference/sql/show-schemas.md) command includes the following new
columns at the end:

| Column name | Description |
| --- | --- |
| CLASSIFICATION_PROFILE_DATABASE | Reserved for future use. |
| CLASSIFICATION_PROFILE_SCHEMA | Reserved for future use. |
| CLASSIFICATION_PROFILE | Reserved for future use. |

Ref: 1757

---
title: SHOW SERVICE CONTAINERS IN SERVICE command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1787.md
section: Release Notes
---

# SHOW SERVICE CONTAINERS IN SERVICE command: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SERVICE CONTAINERS IN SERVICE](../../../sql-reference/sql/show-service-containers-in-service.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| LAST_EXIT_CODE | Indicates the exit code when the container last exited. For service containers, Snowflake restarts the container if it exits prematurely.  The exit code is represented as an integer value:   * NULL: The container is currently running and has never exited. * 0: The container’s last exit was successful. * Non-zero value: The container encountered a failure. |
| LAST_RESTART_TIME | Provides the timestamp of the most recent restart of the container by Snowflake. NULL value indicates the container never restarted. |

Ref: 1787

---
title: SHOW SERVICE INSTANCES IN SERVICE and SHOW SERVICE CONTAINERS IN SERVICE commands: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1915.md
section: Release Notes
---

# SHOW SERVICE INSTANCES IN SERVICE and SHOW SERVICE CONTAINERS IN SERVICE commands: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SERVICE INSTANCES IN SERVICE](../../../sql-reference/sql/show-service-instances-in-service.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| SERVICE_STATUS | One of the following values, which indicates the current status of the service:   * `PENDING` * `RUNNING` * `FAILED` * `DONE` * `SUSPENDING` * `SUSPENDED` * `DELETING` * `DELETED` * `INTERNAL_ERROR`   Note that the value in this column is the same as the `status` column in the output of the [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md). |

The output of the [SHOW SERVICE CONTAINERS IN SERVICE](../../../sql-reference/sql/show-service-containers-in-service.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| SERVICE_STATUS | One of the following values, which indicates the current status of the service:   * `PENDING` * `RUNNING` * `FAILED` * `DONE` * `SUSPENDING` * `SUSPENDED` * `DELETING` * `DELETED` * `INTERNAL_ERROR`   Note that the value in this column is the same as the `status` column in the output of the [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) command. |
| INSTANCE_STATUS | One of the following values, which indicates the current status of the service instance:   * `PENDING`: The service instance is currently being deployed and is not yet ready to serve requests. * `READY`: All containers in the service instance are ready; the service instance is ready to serve requests. * `FAILED`: At least one container in the service instance has exited with a failure. * `TERMINATING`: The service instance is in the process of termination and will be removed after the process is complete. * `SUCCEEDED`: The service is a job service and all containers in the service instance have terminated successfully.   Note that for a given service instance, as identified by the `instance_id` column, the value in the `instance_status` column matches the value in the `status` column in the output of the SHOW SERVICE INSTANCES IN SERVICE command. |

Also, note the following changes in the rows returned by these commands:

* **During a service suspension:**

  Before the change:
  :   When you suspend a service, the output of the SHOW SERVICE CONTAINERS IN SERVICE command output includes list of containers in SUSPENDED status.

  After the change:
  :   The output includes the service being suspended. In the row for this service, the `service_status` column has the value `SUSPENDING` and the `instance_status` column has the value `TERMINATING`.
* **After a service suspension:**

  Before the change:
  :   When you suspend a service, the output of the SHOW SERVICE CONTAINERS IN SERVICE commands output includes list of containers in SUSPENDED status.

  After the change:
  :   The output includes a single row with the value `SUSPENDED` in the `service_status` and NULL in all other columns.
* **During a service upgrade:**

  Before the change:
  :   + The output of the SHOW SERVICE CONTAINERS IN SERVICE command included containers being shut down (containers that would to be restarted with a new image). For these containers, the value in the `status` column is `READY`.
      + The output of the SHOW SERVICE INSTANCES IN SERVICE command includes service instances with `TERMINATING` in the `status` column. This information is not included in the output of the
        SHOW SERVICE CONTAINERS IN SERVICE command.

  After the change:
  :   The output of the SHOW SERVICE CONTAINERS IN SERVICE command includes instances being shut down. These rows have the value `TERMINATING` in the `instance_status` and `status` columns.

Ref: 1915

---
title: SHOW SERVICE INSTANCES IN SERVICE command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1883.md
section: Release Notes
---

# SHOW SERVICE INSTANCES IN SERVICE command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SERVICE INSTANCES IN SERVICE](../../../sql-reference/sql/show-service-instances-in-service.md) command includes the following new column:

| Column name | Description |
| --- | --- |
| IP_ADDRESS | IP address of the service instance. Other instances of the same service (or other services) can use this IP address to connect to a specific service instance.  When you are running multiple service instances, you can implement leader election among the instances of a service by electing the instance with `instance_id` 0 as the leader. |

Ref: 1883

---
title: SHOW SERVICES and DESCRIBE SERVICE commands: New format for the DNS name of a service
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1656.md
section: Release Notes
---

# SHOW SERVICES and DESCRIBE SERVICE commands: New format for the DNS name of a service

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

The [SHOW SERVICES](../../../sql-reference/sql/show-services.md) and [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) commands behave as follows:

Before the change:
:   The `dns_name` column in the output of these commands contains the Snowflake-assigned DNS name of a service in the following format:

    `service-name.schema-name.db-name.snowflakecomputing.internal`

After the change:
:   The format of the DNS name in the column has changed to:

    `service-name.unique-id.svc.spcs.internal`

    The major changes in the format are:

    * `unique-id` replaces the `schema-name.db-name` and is a 4-8 character long alphanumeric identifier that is unique
      to a particular instance of a database schema.

      To find the unique ID for a schema, call the SYSTEM$GET_SERVICE_DNS_DOMAIN function. For example:

      ```sqlexample
      SELECT SYSTEM$GET_SERVICE_DNS_DOMAIN('mydb.myschema');
      ```

      Note the following:

      + If you rename a schema, the identifier remains unchanged.
      + If you drop and recreate a schema with the same name, the identifier will change.
    * `snowflakecomputing` is replaced by `svc.spcs` to reduce the verbosity of the fully qualified DNS name of the service.

Note the following:

* This is a change in the behavior of the [CREATE SERVICE](../../../sql-reference/sql/create-service.md) command. When a service is created, Snowflake assigns the DNS name to a service.

  However, the effects of this behavior change are visible when you use the [SHOW SERVICES](../../../sql-reference/sql/show-services.md) and [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) commands, which include the DNS name in the output.
* For services deployed after the 2024_06 bundle is enabled, the old style DNS names will continue to work for some time. Snowflake recommends that you update your code to use the new DNS format.

Ref: 1656

---
title: SHOW SHARES Command: Changes to Output and New OWNER_ACCOUNT Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1180.md
section: Release Notes
---

# SHOW SHARES Command: Changes to Output and New OWNER_ACCOUNT Column in Output

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, the output of the [SHOW SHARES](../../../sql-reference/sql/show-shares.md) or [SHOW SHARES IN REPLICATION GROUP](../../../sql-reference/sql/show-shares-in-replication-group.md)
commands includes the column OWNER_ACCOUNT and changes the output of an existing column which behaves as follows:

Previously:
:   The output from the SHOW SHARES or SHOW SHARES IN REPLICATION GROUP commands shows a fully-qualified name in the NAME column.

    For example: `COMPANY.SFC_SAMPLES.SAMPLE_DATA`.

Currently:
:   The output from the commands includes the column, OWNER_ACCOUNT, that contains the [account identifier](../../../user-guide/admin-account-identifier.md)
    for the account that owns the share, and the NAME column only shows the share name.

    For example, OWNER_ACCOUNT contains `COMPANY.SFC_SAMPLES` and NAME contains `SAMPLE_DATA`.

    The OWNER_ACCOUNT column is added before the NAME column in the output.

Ref: 1180

---
title: SHOW SHARES command: New column (SECURE_OBJECTS_ONLY)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1600.md
section: Release Notes
---

# SHOW SHARES command: New column (SECURE_OBJECTS_ONLY)

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SHARES](../../../sql-reference/sql/show-shares.md) command includes the following new column(s):

| Column name | Data type | Description |
| --- | --- | --- |
| SECURE_OBJECTS_ONLY | BOOLEAN | Indicates if non-secure objects are enabled:  If TRUE, the share allows secure views only.  If FALSE, the share also allows non-secure views. |

Ref: 1600

---
title: SHOW SNAPSHOTS and DESCRIBE SNAPSHOT commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1836.md
section: Release Notes
---

# SHOW SNAPSHOTS and DESCRIBE SNAPSHOT commands: New column in output

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the output of the [DESCRIBE SNAPSHOT](../../../sql-reference/sql/desc-snapshot.md) and [SHOW SNAPSHOTS](../../../sql-reference/sql/show-snapshots.md) commands include the following new column:

| Column name | Description |
| --- | --- |
| ENCRYPTION | Encryption type configured for the volume, from which the snapshot was created. The value can be SNOWFLAKE_SSE or SNOWFLAKE_FULL. |

Ref: 1836

---
title: SHOW STAGES command: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1131.md
section: Release Notes
---

# SHOW STAGES command: New columns

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the following columns will be included in the output of the [SHOW STAGES](../../../sql-reference/sql/show-stages.md) command:

| Column Name | Data Type | Description |
| --- | --- | --- |
| `ENDPOINT` | String | The S3-compatible API endpoint associated with the stage. The value is always NULL for stages that are not S3-compatible. |
| `DIRECTORY_ENABLED` | String | Indicates whether the stage has a directory table enabled. Values include `Y` (directory table is enabled) or `N` (not enabled). |

These columns have been added to identify which external stages are configured for S3-compatible storage,
and which stages have directory tables.

Ref: 1131

---
title: SHOW STREAMS command: Change in output for streams on directory tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2170.md
section: Release Notes
---

# SHOW STREAMS command: Change in output for streams on directory tables

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

The [SHOW STREAMS](../../../sql-reference/sql/show-streams.md) command for streams created on directory tables (for example, stages with the
`DIRECTORY = (ENABLE=TRUE)` property) behaves as follows:

Before the change:
:   The output for the `table_name` and `base_tables` columns in the SHOW STREAMS command returns the unqualified name of the stage
    for streams on a stage with directory tables enabled. For example, `mystage`.

After the change:
:   The output for the `table_name` and `base_tables` columns in the SHOW STREAMS command returns the fully qualified name of the
    stage for streams on a stage with directory stables enabled. For example, `my_db.my_schema.test_stage`.

Ref: 2170

---
title: SHOW TABLES and SHOW WAREHOUSES commands: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2165.md
section: Release Notes
---

# SHOW TABLES and SHOW WAREHOUSES commands: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

These columns help you to examine and monitor the objects associated with
[interactive tables and interactive warehouses](../../../user-guide/interactive.md).

When this behavior change bundle is enabled, the output of the [SHOW TABLES](../../../sql-reference/sql/show-tables.md) command
includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| is_interactive | BOOLEAN | Whether the table is an [interactive table](../../../user-guide/interactive.md). |

When this behavior change bundle is enabled, the output of the [SHOW WAREHOUSES](../../../sql-reference/sql/show-warehouses.md) command
includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| tables | NUMBER | Number of interactive tables associated with the [interactive warehouse](../../../user-guide/interactive.md). |

Ref: 2165

---
title: SHOW TABLES command / TABLES view: New is_iceberg column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1448.md
section: Release Notes
---

# SHOW TABLES command / TABLES view: New is_iceberg column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the following command and views is as follows:

* [SHOW TABLES command](../../../sql-reference/sql/show-tables.md)
* [TABLES view (Account Usage)](../../../sql-reference/account-usage/tables.md)
* [TABLES view (Information Schema)](../../../sql-reference/info-schema/tables.md)

Before the change:
:   The command output views do not include the is_iceberg column.

After the change:
:   The command output and views include the is_iceberg column.
    This column is a placeholder for future functionality.

Ref: 1448

---
title: SHOW TABLES command, TABLES view, and GET_DDL command: Changes related to the READ ONLY property for tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1572.md
section: Release Notes
---

# SHOW TABLES command, TABLES view, and GET_DDL command: Changes related to the READ ONLY property for tables

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, you can create tables with a new READ ONLY property. New output related
to this property is added when you run the SHOW TABLES command, query the TABLES view, or run the GET_DDL command.

## SHOW TABLES command: New is_immutable column

A new column is added to the output of the [SHOW TABLES](../../../sql-reference/sql/show-tables.md) command.

Before the change:
:   The output of the SHOW TABLES command does not include an `is_immutable` column.

After the change:
:   The output of the SHOW TABLES command includes an `is_immutable` column.

    | Column name | Description |
    | --- | --- |
    | `is_immutable` | `Y` if the table was created with the READ ONLY property; `N` otherwise. |

## TABLES view (Information Schema): New IS_IMMUTABLE column

A new column is added to the [TABLES view](../../../sql-reference/info-schema/tables.md).

Before the change:
:   The TABLES view does not include an IS_IMMUTABLE column.

After the change:
:   The TABLES view includes an IS_IMMUTABLE column.

    | Column name | Data type | Description |
    | --- | --- | --- |
    | IS_IMMUTABLE | TEXT | Indicates whether the table was created with the READ ONLY property. Valid values are `YES` or `NO`. |

> **Note:**
>
> The ACCOUNT_USAGE.TABLES view will not include the IS_IMMUTABLE column because temporary tables are not reported
> in this view.

## The GET_DDL command returns output for tables created with the READ ONLY property

The [GET_DDL](../../../sql-reference/functions/get_ddl.md) command returns the following output for tables that were created
with the READ ONLY property:

```output
CREATE OR REPLACE TEMPORARY READ ONLY TABLE <table_name> CLONE <src_table_name>
```

Ref: 1572

---
title: SHOW TABLES Command: Event Tables Listed and New Columns Added to Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1006-1157.md
section: Release Notes
---

# SHOW TABLES Command: Event Tables Listed and New Columns Added to Output

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

In the current release, the output of the SHOW TABLES command includes in its list [event tables](../../../developer-guide/logging-tracing/event-table-operations.md), along with an IS_EVENT column whose value indicates whether the listed table is an event table. It also includes an ENABLE_SCHEMA_EVOLUTION column whose value indicates whether schema evolution is enabled for the table.

Previously:
:   Output from the SHOW TABLES command does not include [event tables](../../../developer-guide/logging-tracing/event-table-operations.md) in its list of tables. The output currently doesn’t include an IS_EVENT column, nor an ENABLE_SCHEMA_EVOLUTION column.

Currently:
:   SHOW TABLES output includes event tables in its output.

    It also includes an IS_EVENT column whose boolean value is `true` if the row describes an event table, `false` otherwise.

    It also includes an ENABLE_SCHEMA_EVOLUTION column with a boolean value. If the value is `Y`, automatic table schema evolution is enabled. If the value is `N`, automatic table schema evolution is disabled. You can enable automatic table schema evolution by using the [CREATE TABLE](../../../sql-reference/sql/create-table.md) or [ALTER TABLE](../../../sql-reference/sql/alter-table.md) commands.

Ref: 1006 1557

---
title: SHOW TABLES command: New column is_dynamic
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1580.md
section: Release Notes
---

# SHOW TABLES command: New column `is_dynamic`

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The output of the [SHOW TABLES](../../../sql-reference/sql/show-tables.md) command, when displaying Dynamic tables, behaves as follows:

Before the change:
:   The output of SHOW TABLES does not include dynamic table rows and the `is_dynamic` column.

After the change:
:   The output of SHOW TABLES includes dynamic table rows and the `is_dynamic` column, defined
    as follows:

    | Column Name | `is_dynamic` |
    | --- | --- |
    | Data Type | Text |
    | Description | Y if the table is a dynamic table; otherwise, N. |

    The `kind` column in the output of SHOW TABLES displays TABLE for dynamic tables.

Ref: 1529

---
title: SHOW TABLES command: New is_hybrid column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1415.md
section: Release Notes
---

# SHOW TABLES command: New is_hybrid column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

A new column is added to the output of the SHOW TABLES command.

Before the change:
:   The output of the SHOW TABLES command does not include an IS_HYBRID column.

After the change:
:   The output of the SHOW TABLES command includes an IS_HYBRID column:

    | Column name | Description |
    | --- | --- |
    | is_hybrid | `Y` if it is a hybrid table; `N` otherwise. |

Ref: 1415

---
title: SHOW TAGS command and TAGS view (Account Usage): New columns in output (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2291.md
section: Release Notes
---

# SHOW TAGS command and TAGS view (Account Usage): New columns in output (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW TAGS](../../../sql-reference/sql/show-tags.md) command and the
[TAGS](../../../sql-reference/account-usage/tags.md) view in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema include new columns as described below.

## SHOW TAGS command

The output includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `ON_CONFLICT` | VARCHAR | When tag propagation is enabled for the tag, shows the configured `ON_CONFLICT` behavior for the tag. |
| `MULTI_VALUE` | BOOLEAN | Indicates whether the tag accepts multiple values at the time of association to an object. This property is reserved for a future capability; the column is present as a placeholder until that functionality is available. |

## TAGS view (Account Usage)

The view includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `MULTI_VALUE` | BOOLEAN | Indicates whether the tag accepts multiple values at the time of association to an object. This property is reserved for a future capability; the column is present as a placeholder until that functionality is available. |

Ref: 2291

---
title: SHOW TAGS: Shared Tags Require the READ Privilege on the Tag
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_05/bcr-1196.md
section: Release Notes
---

# SHOW TAGS: Shared Tags Require the READ Privilege on the Tag

> **Attention:**
>
> This behavior change is in the 2023_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_05_bundle.md).

The behavior of the SHOW TAGS command with respect to data sharing is as follows:

Previously:
:   If a data sharing provider shares a schema that stores tags, the consumer can view all of the shared tags using a SHOW TAGS command
    provided that the role that executes the SHOW TAGS command has the required privileges to access the shared schema.

Currently:
:   As a consumer, to use the SHOW TAGS command to view shared tags, you must use a role that is granted the READ privilege on each tag. The
    READ privilege for a tag is new and is only applicable in a data sharing context.

    The provider chooses how to grant the READ privilege on the tag to the share:

    * Grant the READ privilege on each tag to the share directly.

      ```sqlexample
      GRANT READ ON TAG mytag TO SHARE myshare;
      ```
    * Grant the READ privilege on the tag to a database role and grant the database role to the share.

      ```sqlexample
      GRANT READ ON TAG mytag TO DATABASE ROLE mydb.dbrole;
      GRANT DATABASE ROLE mydb.dbrole TO SHARE myshare;
      ```

    The consumer:

    * Creates a database from the share and grants privileges on the share to account roles (no changes).
    * Grants the database role to an account role (if applicable, no changes).
    * Uses SQL to execute the SHOW TAGS command.

    To determine whether you have tags that are affected by the pending behavior, use these commands:

    ```sqlexample
    SHOW TAGS IN shared_database;
    SHOW TAGS IN shared_schema;
    ```

Ref: 1196

---
title: SHOW TASKS and DESCRIBE TASK commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2242.md
section: Release Notes
---

# SHOW TASKS and DESCRIBE TASK commands: New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) and
[DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) commands includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| OVERLAP_POLICY | VARCHAR | The overlap policy for the task. |

Ref: 2242

---
title: SHOW TASKS and DESCRIBE TASK commands: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1385-1414.md
section: Release Notes
---

# SHOW TASKS and DESCRIBE TASK commands: New columns

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The output of SHOW TASKS and DESCRIBE TASK commands is as follows:

Before the change:
:   The output of SHOW TASKS and DESCRIBE TASK commands does not include last_suspended_reason or task_relations columns.

    The existing last_suspended_on column shows timestamps only for the root tasks and shows NULL for child tasks.

After the change:
:   The output of SHOW TASKS and DESCRIBE TASK commands includes last_suspended_reason and task_relations columns.

    The existing last_suspended_on column shows timestamps for both the root tasks and the child tasks.

    | Column Name | Description |
    | --- | --- |
    | last_suspended_reason | Displays the reason why the task was suspended. The possible reasons include the following:   * USER_SUSPENDED: The user suspended the task by running the `alter task <name> suspend` command. * SCHEMA_OR_DATABASE_DELETED: The schema or database of the task was dropped. * GRANT_OWNERSHIP: The user transferred the ownership of the task to another role by running the `grant ownership` command. * SUSPENDED_DUE_TO_ERRORS: The task failed a certain number of consecutive times and was suspended. You can set the [SUSPEND_TASK_AFTER_NUM_FAILURES](../../../sql-reference/parameters.md) parameter for the number of failures required to suspend this task. * CHILD_BECAME_ROOT: The task was previously a child task in a DAG of tasks, but all predecessors of the child task were removed and the child task became a root task. * FINALIZER_BECAME_ROOT: The task was previously a finalizer task in a DAG of tasks, but the finalization was removed and the task became a root task. * MATCHING_OWNER_NOT_FOUND: During [task replication](../../../user-guide/account-replication-considerations.md), the role that owns the task was not found on the secondary database. |
    | task_relations | Displays the relationship between the root task and its corresponding finalizer tasks. |
    | last_suspended_on | Displays the timestamps for both the root tasks and the child tasks. |

Ref: 1385 1414

---
title: SHOW TASKS and DESCRIBE TASKS commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2051.md
section: Release Notes
---

# SHOW TASKS and DESCRIBE TASKS commands: New column in output

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) and [DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) commands includes the following new column:

| Column name | Description |
| --- | --- |
| `execute_as_user` | Shows the user name of a user who runs a task by using EXECUTE AS USER. Shows as NULL if a task is run as the system user. |

Organizations that assign Snowflake security privileges by user can allow users to run team tasks by impersonating a user role or designated team role. A user or team grants this permission by using the GRANT IMPERSONATE AS USER command. Commands are run by users impersonating another user or team by using the CREATE TASKS … EXECUTE AS USER command.

For more information, see [Run tasks with user privileges](../../../user-guide/tasks-intro.md).

Ref: 2051

---
title: SHOW TASKS/DESC TASK commands: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1719-1807.md
section: Release Notes
---

# SHOW TASKS/DESC TASK commands: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) / [DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) commands include the following new column:

| Column name | Description |
| --- | --- |
| SUCCESS_INTEGRATION | Name of the notification integration used to access Amazon Simple Notification Service (SNS), Google Pub/Sub, or Microsoft Azure Event Grid to relay success notifications for the task. |
| SCHEDULING_MODE | Displays whether the serverless task is FIXED or FLEXIBLE.   * When the scheduling mode is FIXED, the task execution is based on the user-specified schedule for the task. * When the scheduling mode is FLEXIBLE, the task execution is based on the user-specified schedule and target completion interval for the task. |
| TARGET_COMPLETION_INTERVAL | Displays the target completion interval that the user wants, which the serverless task uses to determine the size of the compute resources to use to complete execution. |

Ref: 1719 1807

---
title: SHOW TERSE DATABASES Command: Values Populated in the KIND Column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1022.md
section: Release Notes
---

# SHOW TERSE DATABASES Command: Values Populated in the KIND Column

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The SHOW TERSE DATABASES command output includes non-NULL values in the KIND column.

Previously:
:   This column only allows NULL values.

Currently:
:   This column specifies one of the following values:

    * STANDARD: Specifies a normal database.
    * IMPORTED DATABASE: Specifies a database that is created from a share.

Ref: 1022

---
title: SHOW USERS and DESCRIBE USER commands: Changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1951.md
section: Release Notes
---

# SHOW USERS and DESCRIBE USER commands: Changes to output

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW USERS](../../../sql-reference/sql/show-users.md) and
[DESCRIBE USER](../../../sql-reference/sql/desc-user.md) commands includes additional information about different authentication methods:

* SHOW USERS: New columns in output
* DESCRIBE USER: New properties in output

## SHOW USERS: New columns in output

When this behavior change bundle is enabled, the output of the SHOW USERS command includes the following new columns:

| Column name | Description |
| --- | --- |
| `has_pat` | If `true`, the user has one or more programmatic access tokens. |
| `has_federated_workload_authentication` | Reserved for future use. |

These columns also appear in the output of SHOW TERSE USERS.

## DESCRIBE USER: New properties in output

When this behavior change bundle is enabled, the output of the DESCRIBE USER command includes the following new properties:

| Property name | Description |
| --- | --- |
| HAS_PAT | If `true`, the user has one or more programmatic access tokens. |
| HAS_FEDERATED_WORKLOAD_AUTHENTICATION | Reserved for future use. |

These properties are listed after the HAS_MFA property and before the MINS_TO_BYPASS_MFA property.

Ref: 1951

---
title: SHOW USERS and DESCRIBE USER commands: Changes to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1999.md
section: Release Notes
---

# SHOW USERS and DESCRIBE USER commands: Changes to output

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, the output of the SHOW USERS and DESCRIBE USER commands includes column name changes related to
workload identity federation.

> **Note:**
>
> This column was introduced into the [SHOW USERS](../../../sql-reference/sql/show-users.md) and [DESCRIBE USER](../../../sql-reference/sql/desc-user.md) commands in [BCR-1951](../2025_03/bcr-1951.md) in the
> [2025_03 bundle](../2025_03_bundle.md).

## SHOW USERS: Changes to column name

When this behavior change bundle is enabled, the output of the [SHOW USERS](../../../sql-reference/sql/show-users.md) command includes the
following column name change(s):

Before the change:
:   The column is named HAS_FEDERATED_WORKLOAD_AUTHENTCATION

After the change:
:   The column is named HAS_WORKLOAD_IDENTITY

This column also appears in the output of SHOW TERSE USERS.

## DESCRIBE USER: Changes to column name

When this behavior change bundle is enabled, the output of the [DESCRIBE USER](../../../sql-reference/sql/desc-user.md) command includes the
following column name change(s):

Before the change:
:   The column is named HAS_FEDERATED_WORKLOAD_AUTHENTCATION

After the change:
:   The column is named HAS_WORKLOAD_IDENTITY

This column is listed after the HAS_PAT property and before the MINS_TO_BYPASS_MFA property.

---

Ref: 1999

---
title: SHOW USERS and DESCRIBE USER commands: New column/property in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2066.md
section: Release Notes
---

# SHOW USERS and DESCRIBE USER commands: New column/property in output

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

Administrators can create users in an account by importing an [organization user](../../../user-guide/organization-users.md), which is a global
user entity. When this behavior change bundle is enabled, the output of the SHOW USERS and DESCRIBE USER commands includes information to
determine whether a user in a regular account was imported from an organization user.

## Output of SHOW USERS

When this behavior change bundle is enabled, the output of the [SHOW USERS](../../../sql-reference/sql/show-users.md) command includes the following new
column:

| Column name | Data type | Description |
| --- | --- | --- |
| `is_from_organization_user` | BOOLEAN | * If TRUE, the user was imported from an organization user that is defined in the organization account. * If FALSE, the user was created in the regular account and is not linked to an organization user that is defined in the organization   account. |

## Output of DESCRIBE USER

When this behavior change bundle is enabled, the output of the [DESCRIBE USER](../../../sql-reference/sql/desc-user.md) command includes the following new
property:

| Property | Description |
| --- | --- |
| `IS_FROM_ORGANIZATION_USER` | * If TRUE, the user was imported from an organization user that is defined in the organization account. * If FALSE, the user was created in the regular account and is not linked to an organization user that is defined in the organization   account. |

Ref: 2066

---
title: SHOW USERS command: NULL values replace default values in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1798.md
section: Release Notes
---

# SHOW USERS command: NULL values replace default values in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW USERS](../../../sql-reference/sql/show-users.md) command changes as follows:

Before the change:
:   Certain output columns show default values (such as an empty string or false) rather than actual NULL values. This behavior occurs both when the current user
    does not have access to the column value and when the property for the user in question is not defined.

    For example, create a user named `nulltest`:

    ```sqlexample
    CREATE OR REPLACE USER nulltest DISPLAY_NAME = 'iamnull';
    ```

    The SHOW USERS command for this new user returns an empty string for undefined properties such as `first_name`, `last_name`,
    and `email`.

After the change:
:   NULL is used to represent expected NULL values, instead of an empty string, `false`, and so on. For example, create a user named `nulltest`:

    ```sqlexample
    CREATE OR REPLACE USER nulltest DISPLAY_NAME = 'iamnull';
    ```

    The SHOW USERS command for this new user returns NULL values for undefined properties such as `first_name`, `last_name`,
    and `email`.

    This change affects the following SHOW USERS output columns:

    * `name`
    * `comment`
    * `display_name`
    * `email`
    * `first_name`
    * `last_name`
    * `has_password`
    * `has_rsa_public_key`
    * `has_mfa`
    * `namespace`
    * `warehouse`
    * `default_role`
    * `login_name`
    * `disabled`
    * `snowflake_lock`
    * `must_change_password`
    * `mins_to_unlock`
    * `days_to_expiry`
    * `mins_to_bypass_mfa`
    * `default_secondary_roles`

Ref: 1798

---
title: SHOW USERS command: Output filtered based on privileges granted to active role
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-975.md
section: Release Notes
---

# SHOW USERS command: Output filtered based on privileges granted to active role

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The [SHOW USERS](../../../sql-reference/sql/show-users.md) command behaves as follows:

Previously:
:   To see the output of the SHOW USERS command, the [active role](../../../user-guide/security-access-control-overview.md) must have the global
    MANAGE GRANTS privilege.

    When you try to use a role that does not have the global MANAGE GRANTS privilege and run the SHOW USERS command, Snowflake returns the
    following error message:

    ```output
    Insufficient privileges to operate on account '<account_name>'
    ```

Currently:
:   Any user can execute the SHOW USERS command. Snowflake returns all users and filters the output based upon the privileges granted to the
    active role that runs the command. The user that runs the command will always be able to see the username in the results. To see the
    output, the active role must have either:

    * The OWNERSHIP privilege on the user object.
    * The CREATE USER privilege on the account.

Ref: 975

---
title: SHOW VERSIONS command: New column in output (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2283.md
section: Release Notes
---

# SHOW VERSIONS command: New column in output (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

The output of the [SHOW VERSIONS IN APPLICATION PACKAGE](../../../sql-reference/sql/show-versions.md) command
changes as follows:

Before the change:
:   The output of the SHOW VERSIONS IN APPLICATION PACKAGE command did not include information about the
    state of a version within each release channel. Providers could not see whether a version was in a
    dropping state within a release channel.

After the change:
:   The output of the SHOW VERSIONS IN APPLICATION PACKAGE command includes a new `release_channels`
    column. This column contains a JSON object that describes the state of the version in each release
    channel, including the state and timestamps for when the version was added to or dropped from the
    channel.

Ref: 2283

---
title: SHOW VERSIONS IN APPLICATION command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1900.md
section: Release Notes
---

# SHOW VERSIONS IN APPLICATION command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW VERSIONS IN APPLICATION PACKAGE](../../../sql-reference/sql/show-versions.md) command will include the following new column:

| Column name | Description |
| --- | --- |
| METRIC_LEVEL | Displays the metric level specified in the application package |

Ref: 1900

---
title: SHOW VERSIONS IN APPLICATION PACKAGE command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2232.md
section: Release Notes
---

# SHOW VERSIONS IN APPLICATION PACKAGE command: New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW VERSIONS IN APPLICATION PACKAGE](../../../sql-reference/sql/show-versions.md) command includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| LOG_EVENT_LEVEL | VARCHAR | Specifies the event logging level to use for the app. |

Ref: 2232

---
title: SHOW VERSIONS IN MODEL command: module_name column in output renamed model_name
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1700.md
section: Release Notes
---

# SHOW VERSIONS IN MODEL command: `module_name` column in output renamed `model_name`

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, the `module_name` column in the output of the [SHOW VERSIONS IN MODEL](../../../sql-reference/sql/show-versions-in-model.md)
command is renamed `model_name`. The content of the column (the name of the model) does not change.

Ref: 1700

---
title: SHOW VERSIONS IN MODEL command: New column aliases
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1620.md
section: Release Notes
---

# SHOW VERSIONS IN MODEL command: New column aliases

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW VERSIONS IN MODEL](../../../sql-reference/sql/show-versions-in-model.md) command
changes to include a new column named `aliases`.

| Column Name | Data Type | Description |
| --- | --- | --- |
| `aliases` | ARRAY of VARCHAR | Aliases of the model version, if any. If a model version has no aliases, this column contains an empty ARRAY (`[]`) rather than NULL. |

`aliases` is added as the third column of the output, immediately following `name`.

Ref: 1620

---
title: SHOW VERSIONS IN MODEL: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1778.md
section: Release Notes
---

# SHOW VERSIONS IN MODEL: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, five new columns appear in the [SHOW VERSIONS IN MODEL](../../../sql-reference/sql/show-versions-in-model.md)
command output, as described below. Existing model versions do not have values for these columns.

| Column name | Description |
| --- | --- |
| MODEL_ATTRIBUTES | Attributes of a model version; for example, the framework it was developed with or the task it is intended to perform. |
| SIZE | The size of the model version in bytes, including all artifacts (code, weights, and so on). |
| ENVIRONMENT | Details about the environment where the model version runs. |
| RUNNABLE_IN | The environments where the model version can be executed. |
| INFERENCE_SERVICES | List of services where the model is deployed. |

Ref: 1778

---
title: SHOW WAREHOUSES command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1725.md
section: Release Notes
---

# SHOW WAREHOUSES command: New column in output

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW WAREHOUSES](../../../sql-reference/sql/show-warehouses.md)
command will include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `resource_constraint` | VARCHAR | Reserved for future use. |

Ref: 1725

---
title: SHOW WAREHOUSES command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2110.md
section: Release Notes
---

# SHOW WAREHOUSES command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, the [SHOW WAREHOUSES](../../../sql-reference/sql/show-warehouses.md) command
output includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `generation` | VARCHAR | A positive integer, currently either `1` or `2`. The VARCHAR type enables keyword values to be used in the future. |

This new column lets you check which of your standard warehouses are Gen1 or
[Gen2](../../../user-guide/warehouses-gen2.md).

Ref: 2110

---
title: SHOW WAREHOUSES command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1889.md
section: Release Notes
---

# SHOW WAREHOUSES command: New columns in output

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW WAREHOUSES](../../../sql-reference/sql/show-warehouses.md) command
includes the following new columns:

| Column name | Description |
| --- | --- |
| warehouse_credit_limit | Reserved for future use |
| target_statement_size | Reserved for future use |
| disabled_reasons | Reserved for future use |

Ref: 1889

---
title: SHOW/DESC AVAILABLE LISTING: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1865.md
section: Release Notes
---

# SHOW/DESC AVAILABLE LISTING: New column in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the
[SHOW AVAILABLE LISTINGS](../../../sql-reference/sql/show-available-listings.md) and [DESCRIBE AVAILABLE LISTING](../../../sql-reference/sql/desc-available-listing.md) commands
include the following new column:

| Column name | Description |
| --- | --- |
| COMPLIANCE_BADGES | Returns a TEXT string representing an array of compliance badges associated with the listing.  For example: `compliance_badges: [{"type": "SOC2", "expiry": "12-25-2055"]`. |

Ref: 1865

---
title: SHOW/DESC COMPUTE POOL command: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1602.md
section: Release Notes
---

# SHOW/DESC COMPUTE POOL command: New columns

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The output of the [SHOW COMPUTE POOLS](../../../sql-reference/sql/show-compute-pools.md) and [DESCRIBE COMPUTE POOL](../../../sql-reference/sql/desc-compute-pool.md) commands change as follows:

Before the change:
:   The output of the commands do not include the `IS_EXCLUSIVE` and `APPLICATION` columns.

After the change:
:   > The output of the commands include the `IS_EXCLUSIVE` and `APPLICATION` columns.

    | Column name | Data type | Description |
    | --- | --- | --- |
    | IS_EXCLUSIVE | BOOLEAN | `true` if the compute pool is created exclusively for a Snowflake Native App; `false` otherwise. |
    | APPLICATION | TEXT | Name of the Snowflake Native App if the compute pool is created exclusively for the app. Otherwise, NULL. |

Ref: 1602

---
title: SHOW/DESC PIPE[S] commands: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1659.md
section: Release Notes
---

# SHOW/DESC PIPE[S] commands: New column in output

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW PIPES](../../../sql-reference/sql/show-pipes.md)
and [DESCRIBE PIPE](../../../sql-reference/sql/desc-pipe.md) commands will include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| KIND | VARCHAR | The kind of the pipe, either KAFKA or STAGE. |

Ref: 1659

---
title: SHOW/DESC SERVICE commands and Information Schema and Account Usage SERVICES views: New IS_JOB column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1516.md
section: Release Notes
---

# SHOW/DESC SERVICE commands and Information Schema and Account Usage SERVICES views: New `IS_JOB` column

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The output of the [SHOW SERVICES](../../../sql-reference/sql/show-services.md) and [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) commands, and the
[Information Schema SERVICES](../../../sql-reference/info-schema/services.md) and [Account Usage SERVICES](../../../sql-reference/account-usage/services.md) views, change as follows:

Before the change:
:   The views and the output of the commands do not include the `is_job` column.

After the change:
:   The views and the output of the commands include the `is_job` column, which is defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | IS_JOB | BOOLEAN | `Y` if the service is started by running the [EXECUTE JOB SERVICE](../../../sql-reference/sql/execute-job-service.md) command. |

Ref: 1516

---
title: SHOW/DESC SERVICE commands and Information Schema SERVICES view: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1717-1723.md
section: Release Notes
---

# SHOW/DESC SERVICE commands and Information Schema SERVICES view: New columns

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SERVICES](../../../sql-reference/sql/show-services.md) and
[DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) commands and the Information Schema
[SERVICES](../../../sql-reference/info-schema/services.md) view include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| CURRENT_INSTANCES | NUMBER | The current number of instances for the service. |
| TARGET_INSTANCES | NUMBER | The target number of service instances that should be running as determined by Snowflake.  When the CURRENT_INSTANCES value is not equal to the TARGET_INSTANCES value, Snowflake is either in the process of shutting down or launching service instances.  For example,   * Suppose you create a service with MIN_INSTANCES = 1 and MAX_INSTANCES = 3. While the service is running, Snowflake might   determine that one instance is not enough. In this case, the value of TARGET_INSTANCES will increase, indicating   Snowflake is in the process of launching additional instances.  It is also possible that the TARGET_INSTANCES value is less than the CURRENT_INSTANCES value, which indicates that   Snowflake is in the process of reducing the number of running instances. * If you create services but the compute pool does not have capacity for the minimum number of instances that you   requested, the value of TARGET_INSTANCES will be equal to the value of MIN_INSTANCES. The value of CURRENT_INSTANCES   will be less than the value of TARGET_INSTANCES. |
| SPEC_DIGEST | VARCHAR | The unique and immutable identifier representing the service spec content.  To observe the changes to the value of the SPEC_DIGEST column over time, a service user might execute the SHOW SERVICES command periodically. If the service user notices a change in value, they can infer that the service was upgraded. |
| IS_UPGRADING | BOOLEAN | TRUE, if Snowflake is in the process of upgrading the service. |
| MANAGING_OBJECT_DOMAIN | VARCHAR | The domain of the managing object (for example, the domain of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |
| MANAGING_OBJECT_NAME | VARCHAR | The domain of the managing object (for example, the name of the notebook that manages the service). NULL if the service is not managed by a Snowflake entity. |

> **Note:**
>
> * The new CURRENT_INSTANCES and TARGET_INSTANCES columns appear after the existing DNS_NAME column.
> * The new SPEC_DIGEST, IS_UPGRADING, MANAGING_OBJECT_DOMAIN, and MANAGING_OBJECT_NAME columns appear at the end.

Ref: 1717, 1723

---
title: SHOW/DESC SERVICE commands and Information Schema view: New STATUS column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1596.md
section: Release Notes
---

# SHOW/DESC SERVICE commands and Information Schema view: New `STATUS` column

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

The output of the [SHOW SERVICES](../../../sql-reference/sql/show-services.md) and [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) commands, and the
[Information Schema SERVICES](../../../sql-reference/info-schema/services.md) view, change as follows:

Before the change:
:   The view and the output of the commands do not include the `status` column.

After the change:
:   The view and the output of the commands include the `status` column, which is defined as follows:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | STATUS | TEXT | Current status of the service: `PENDING`, `RUNNING`, `FAILED`, `DONE`, `SUSPENDING`, `SUSPENDED`, `DELETING`, `DELETED`, or `INTERNAL_ERROR`. |

Ref: 1596

---
title: SHOW/DESC SERVICE commands and SERVICES view (ACCOUNT_USAGE and INFORMATION SCHEMA): New column MIN_READY_INSTANCES
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1793.md
section: Release Notes
---

# SHOW/DESC SERVICE commands and SERVICES view (ACCOUNT_USAGE and INFORMATION SCHEMA): New column MIN_READY_INSTANCES

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW SERVICES](../../../sql-reference/sql/show-services.md) and [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) commands, [SERVICES view (ACCOUNT_USAGE)](../../../sql-reference/account-usage/services.md), and [SERVICES view (INFORMATION_SCHEMA)](../../../sql-reference/info-schema/services.md) include the following new column:

| Column name | Description |
| --- | --- |
| MIN_READY_INSTANCES | Indicates the minimum service instances that must be ready for Snowflake to consider the service is ready to process requests. The default is the same value as the MIN_INSTANCES. |

Ref: 1793

---
title: SHOW/DESCRIBE APPLICATION PACKAGE command: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1838.md
section: Release Notes
---

# SHOW/DESCRIBE APPLICATION PACKAGE command: New column in output

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW APPLICATION PACKAGES](../../../sql-reference/sql/show-application-packages.md)
and DESCRIBE APPLICATION PACKAGE commands will include a new column in their output.

Before the change:
:   The `multiple_instances` column does not appear in the output of the SHOW APPLICATION PACKAGES
    and DESCRIBE APPLICATION PACKAGE commands.

After the change:
:   The output of the SHOW APPLICATION PACKAGES and DESCRIBE APPLICATION PACKAGE
    commands contains the `multiple_instances` column.

    This column indicates if the provider has configured the application package to allow multiple instances of apps created from it.
    See [Allow consumers to install multiple instances of an app](../../../developer-guide/native-apps/creating-app-package.md) for more information.

Ref: 1838

---
title: Snowflake .NET driver update - August 2022
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/dot-net-driver-relnotes.md
section: Release Notes
---

# Snowflake .NET driver update - August 2022

An update for the Snowflake .NET driver is now available. See [.NET Driver release notes for 2023](../../clients-drivers/dotnet-2023.md).

---
title: Snowflake CLI release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowflake-cli.md
section: Release Notes
---

# Snowflake CLI release notes

The Snowflake CLI release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowflake-cli-2026.md)
* [2025 releases](snowflake-cli-2025.md)
* [2024 releases](snowflake-cli-2024.md)

See [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) for documentation.

---
title: Snowflake CLI release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowflake-cli-2024.md
section: Release Notes
---

# Snowflake CLI release notes for 2024

This article contains the release notes for the Snowflake CLI, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) for documentation.

## Version 3.2.2 (December 13, 2024)

### New features and updates

* None

### Bug fixes

* Fixed the `No module name 'pandas'` warning.

## Version 3.2.1 (December 03, 2024)

### New features and updates

* None

### Bug fixes

* Fixed an issue that caused failures when using older x86_64 Intel CPUs.

## Version 3.2.0 (November 25, 2024)

### Deprecations

* Deprecated the `manifest` field of the `application package` entity in the Native App project definition file. The field no longer has any functionality.

### New features and updates

* Added support for event sharing in Native App project definitions.

  + Added a new `telemetry` section to the `application` entity.
  + Added the following fields to the `telemetry` section: `share_mandatory_events` and `optional_shared_events`.
* Added new options to several `snow` commands:

  + `snow sql`: Added the `--retain-comments` option to support passing comments to Snowflake.
  + `snow object create`: Added the `--replace` and `--if-not-exists` options to support overwriting exist objects.
  + `snow stage copy`: Added the `--recursive` option to support copying local files and subdirectories to a stage, including glob support.
  + `snow app version create`: Added the `--label` option to support adding labels to versions and patches.
  + `snow connection add`: Added the `--no-interactive` option to skip interactive prompts for unspecified parameters.
  + `snow spcs service logs`: Added the following options to improve log retrieval and monitoring:

    - `--since`: Start log retrieval from a specified UTC timestamp.
    - `--include-timestamps`: Include timestamps in log entries for log streaming.
    - `--follow`: Stream logs in real-time.
    - `--follow-interval`: Set custom polling intervals during log streaming.
    - `--previous-logs`: Retrieve logs from the last terminated container.
* The `snow helpers v1-to-v2` command now converts v1 template references to v2 references in Native App artifacts that use the `templates` processor.
* Updated the `snow --info` command to return information about the `SNOWFLAKE_HOME` variable.

### Bug fixes

* Removed the requirement for an existing requirements.txt file for Python code executed with the `snow git execute` command. Previously, the file must have existed, even if empty, for the command to succeed.
* Removed the requirement for needing a privilege to create a table or schema to execute the `snow app version create` command if the schema and table already exist.
* Fixed an issue relating to configuration file updates when the `connection.toml` file exists, no longer incorrectly copying connections from `connections.toml` to `config.toml` files.
* Fixed an issue where the `snow connection generate-jwt` command failed with keys without a passphrase.
* Fixed a Windows permissions error for file created by Snowflake CLI when the owneris part of a custom group with granted default permissions.

## Version 3.1.0 (October 25, 2024)

### Deprecations

* Added a deprecation warning to the `snow spcs service status` and `snow spcs image-repository list-tags` commands. These commands will be removed in a future release.

### New features and updates

* Added the following commands:

  + `snow connection generate-jwt` command to generate JWT token for Snowflake connections.
  + `snow spcs service list-containers` to fetch information about containers in a service.
  + `snow spcs service list-instances` to fetch information about instances in a service.
  + `snow spcs service list-roles` to fetch information about roles in a service.
* Added the `--eai-name` option to the `snow spcs set` command to support updating external access integrations for a service.
* Updated the `snow spcs image-repository list-images` command to displays image tags and digests.

### Bug fixes

* Fixed a bug that caused the `deploy_root`, `bundle_root`, and `generated_root` directories to be created in the current working directory instead of the project root when invoking commands with the `--project` flag from a different directory.
* Aligned variables for the `snow stage` and `snow git execute` commands. For Python files, variables are stripped of leading and trailing quotes.
* Fixed an issue with `snow stage list-files` for paths with directories.

## Version 3.0.2 (October 15, 2024)

### New features and updates

### Bug fixes

* Fixed the handling of empty default values for strings by `snow snowpark deploy`.
* Added log error details if the `pip` command fails

## Version 3.0.1 (October 08, 2024)

### New features and updates

* Migrated the `snowflake-cli-labs` PyPi repository to `snowflake-cli`.

  To install or upgrade the Snowflake CLI, you can execute a command similar to the following:

  ```snowcli
  pip install --upgrade snowflake-cli
  ```

  > **Note:**
  >
  > Snowflake CLI will continue to support using the `snowflake-cli-labs` repository name to give you time to transition existing scripts and applications you might use.

### Bug fixes

* None.

## Version 2.8.2 (October 08, 2024)

### New features and updates

* Migrated the `snowflake-cli-labs` PyPi repository to `snowflake-cli`.

  To install or upgrade the Snowflake CLI, you can execute a command similar to the following:

  ```snowcli
  pip install --upgrade snowflake-cli
  ```

  > **Note:**
  >
  > Snowflake CLI will continue to support using the `snowflake-cli-labs` repository name to give you time to transition existing scripts and applications you might use.

### Bug fixes

* None.

## Version 3.0.0 (October 1, 2024)

### BCR (Behavior Change Release) changes

Beginning with version 3.0.0, Snowflake CLI introduced the following breaking changes:

* Implemented the following Python changes:

  + Dropped support for Python versions below 3.10.
  + Set the default Python version for Snowpark functions and procedures to 3.10.
* Replaced the `snow object stage` commands with `snow stage` commands.
* Replaced the `snow snowpark init` and `snow streamlit init` commands with the `snow init` command.
* Removed previously deprecated options from the `snow snowpark` commands.
* Modified the behavior of the following Snowpark commands:

  + The `snow snowpark build` creates a `.zip` file for each specified artifact that is a directory. Non-Anaconda dependencies are packaged once as `dependencies.zip`.
  + The `snow snowpark deploy` uploads all artifacts created during build step. The `dependencies.zip` file is upload once to every Snowpark stage specified in project definition.
  + The `snow snowpark package` commands no longer fallback to Anaconda Channel metadata when fetching available packages information fails.

    > **Note:**
    >
    > These changes are compatible with V1 project definition files, though the resulting file layout differs.

### New features and updates

* Added the following commands:

  + `snow spcs service execute-job` to support creating and executing a job service in the current schema.
  + `snow app events` to fetch logs and traces from local and customer Snowflake Native App installations.
  + `snow helpers v1-to-v2` to migrate snowflake.yml files from version 1.x to version 2.
* Added support for the following:

  + External access (API integrations and secrets) in Streamlit
  + <% … %> syntax in SQL templates
  + Multiple Streamlit applications in a single `snowflake.yml` project definition file
* Updated the project definition file to version 2.

### Bug fixes

* Fixed an issue with whitespace in the `snow connection add` command.
* Fixed a SQL error that occurred when running the `snow app version create` or `snow app version drop` commands with a version name that isn’t a valid Snowflake unquoted identifier.
* Added a check to verify the correctness of a token file and private key paths when adding a connection.
* Fixed a typo in the `spcs service name` argument description. It is the identifier of the `service` instead of the `service pool`.
* Fixed an issue with error handling and improved messaging when no artifacts are provided.
* Improved error messages for incompatible parameters.

## Version 2.8.1 (September 10, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed an issue where the `git execute` command did not correctly handle upper case in directory names.
* Fixed an issue where the `snow git setup` did note correctly handle fully qualified repository names.
* Fixed the `snow git setup` command behavior in cases where API integration, or a secret with a default name, already exists.
* Fixed an issue where the `snow snowpark package create` command created empty zip files when a package name contained capital letters.

## Version 2.8.0 (August 28, 2024)

### Deprecations

* Added a deprecation warning for the `native_app.package.scripts` property in project definition files.

### New features and updates

* Added support for project definition file defaults in templates.
* Added support for `native_app.package.post_deploy` scripts in project definition files.

  + These scripts execute when a Snowflake Native App package is created or updated.
  + Currently, Snowflake REST APIs supports only SQL scripts: `post_deploy: [{sql_script: script.sql}]`.

### Bug fixes

* Fixed an issue with invalid return values for `snow snowpark list`, `snow snowpark describe`, and `snow snowpark drop` commands.
* The `snow app run` command now shows warning returned by Snowflake.

## Version 2.7.0 (August 2, 2024)

### Deprecations

* The `snow snowpark init` and `snow streamlit init` commands are marked as deprecated. The commands are still functional, but you should use the new `snow init` command instead.

### New features and updates

* Added the `--token-file-path` option for the `snow connection add` command to support passing an OAuth token using a file. The function is also supported by setting the `token_file_path` parameter for connection definitions in the `config.toml` file.
* Added support for Python remote execution with the `snow stage execute` and `snow git execute` similar to existing EXECUTE IMMEDIATE support.
* Added support for autocomplete functionality in `snow connection add --connection` option.
* Added the `snow init` command to support initializing projects with external templates.
* Added support for user stages in the `stage execute` and `stage execute copy` commands.
* Improved support for quoted identifiers in Snowpark commands.
* The `snow app run` command now allows upgrading to an unversioned mode from a versioned or release mode application installation.
* The `snow app teardown` command now allows dropping a package with versions when the `--force` flag is provided.
* The `snow app version create` command now allows operating on application packages created outside Snowflake CLI.
* Updated the `application.post_deploy` SQL script to use the application database as the default.
* Snowflake CLI now supports regionless hosts when generating Snowsight URLs.
* The `snow app run` and `snow app deploy` commands now correctly determine the modified status for large files uploaded to AWS S3.

### Bug fixes

* Handle NULL md5 values correctly when returned by stage storage backends.

## Version 2.6.1 (July 15, 2024)

### New features and updates

* None.

### Bug fixes

* Clarified the error message returned when executing `snow object create` if a database is not defined for the connection.
* Fixed an issue that caused Snowflake CLI to crash when `save_logs` is `false` and the log directory does not exist.

## Version 2.6.0 (July 11, 2024)

### New features and updates

* Added the `snow object create` command.
* Added support for a `title` field in Streamlit definition in the `snowflake.yml` project file.
* Added the `--auto-compress` flag to the `snow stage copy` command to enable gzip compression files during upload.
* Added a new `native_app.application.post_deploy` section to `snowflake.yml` schema to execute actions after the application has been deployed via `snow app run`.

  + Added the `sql_script` hook type to run SQL scripts with template support.
* Added support for `--env` command-line arguments for templating.

  + Available for commands that use the project definition file.
  + Format of the argument: `--env key1=value1 --env key2=value2`.
  + Overrides environment variables values when used in templating.
  + Can be referenced in templating through `ctx.env.<key_name>`.
  + Templating reads environment variables in the following order of priority (highest priority to lowest priority):

    - Variables from the `--env` command-line argument.
    - Variables from shell environment variables.
    - Variables from the `env` section of project definition file.
* The `snow sql` command now show query text before executing it.

### Bug fixes

* Passing a directory to `snow app deploy` now deploys any contained file or subfolder specified in the application’s artifact rules.
* Fixed markup escaping errors in `snow sql` that could occur when users unintentionally use markup-like escape tags.
* Fixed cases where `snow app teardown` could not tear down orphan applications (those that have had their package dropped).
* Fixed cases where `snow app teardown` could leave behind orphan applications if they were not created by Snowflake CLI.
* Fixed cases where `snow app run` could fail to run an existing application whose package was dropped by prompting to drop and recreate the application.
* Improved terminal output sanitization to avoid ASCII escape codes.
* Improved the stage diff output in `snow app` commands
* Hid redundant diffs from the `snow app validate` output.
* Added log information into the file with loaded external plugins.
* Added warnings if users attempt to use templating with project definition version 1.
* Improved the output and format of Pydantic validation errors.
* Improved support for quoted identifiers in Streamlit commands.
* The `snow app run` command no longer overrides debug mode during an application upgrade unless explicitly set in `snowflake.yml`.

## Version 2.5.0 (June 20, 2024)

### New features and updates

* Added the following Snowflake Native App features:

  + Added the `snow app bundle` command that prepares a local folder in the project directory with artifacts to upload to a stage as part of creating a Snowflake Native App.

    Snowflake Native App projects can optionally generate CREATE FUNCTION and CREATE PROCEDURE declarations ins setup scripts from Snowpark Python code that includes decorators (such as `@sproc` and `@udf`).
  + Added the `snow app validate` command that validates the SQL in the setup script of a Snowflake Native App for valid syntax, invalid object references, and best practices.

    - Added the new `native_app.scratch_stage` field to the `snowflake.yml` schema to allow customizing the stage that Snowflake CLI uses to run the validation.
  + Changed the `snow app deploy` and `snow app run` commands to trigger automatic validation of the setup script SQL and to stop uploads if validation fails. Users can override this check by enabling the `--no-validate` parameter for the respective commands.
  + Changed the `snow app version create --patch` command to require an integer patch number, aligning with what Snowflake expects.
* Added the following commands to support notebooks:

  + `snow notebook execute` enables a head-less execution of a Snowflake Notebook.
  + `snow notebook create` creates a Snowflake Notebook from a file on a stage.
* Added templating support for project definition files. Template variables can now be used anywhere in a project defintion file.
* Added the `--default` parameter to the `snow connection add` command to let uses specify a connection as the default.

### Bug fixes

* Fixed error handling for improperly formatted `config.toml` files.
* Fixed ZIP packaging of Snowpark project dependencies containing implicit namespace packages like `snowflake`.
* Deploying functions or procedures with the `--replace` parameter now copies all grants.
* Fixed MFA caching.
* Fixed issues with `DeprecationWarning` and `SyntaxWarning` caused to invalid escape sequences.
* Improved error messages in the `snow spcs image-registry login` when Docker is not installed.
* Improved detection of conflicts between artifact rules for Snowflake Native App projects
* Fixed URL generation for applications, streamlits, and notebooks that use a quoted identifier with spaces.

## Version 2.4.1 (June 12, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed issues related to MFA caching and GCP deployments.

## Version 2.4.0 (May 31, 2024)

### New features and updates

* Added the `--cascade` option to `snow app teardown` command that automatically drops all application objects owned by an application.
* Added external access integration to `snow object` commands.
* Added aliases for `snow object` `list`, `describe`, and `drop` commands for the following:

  + `snow stage` for stages
  + `snow git` for git repository stages
  + `snow streamlit` for Streamlit apps
  + `snow snowpark` for Snowpark Python procedures and functions
  + `snow spcs compute-pool` for compute pools
  + `snow spcs image-repository` for image repositories
  + `snow spcs service` for services
* Added the following support to the `snow sql` command:

  + Works with the `snowflake.yml` file. The variables defined in the new `env` section of `snowflake.yml` can be used to expand templates.
  + Allows executing queries from multiple files by specifying multiple `-f/--file` options.
* Added support for passing input variables to the `snow git execute` and `snow stage execute` commands.
* Added the following `snow cortex` commands to support [Snowflake AI and ML](../../guides-overview-ai-features.md):

  + `complete`: Generates a response to a question using your choice of language model.
  + `extract-answer`: Extracts an answer to a given question from a text document.
  + `sentiment`: Returns a sentiment score for the given English-language input text.
  + `summarize`: Summarizes the given English-language input text.
  + `translate`: Translates text from the indicated or detected source language to a target language.
* Added tab-completion for `snow` commands.
* Added the following improvements:

  + Executing the `snow` command with no arguments or options now automatically displays the command-line help (as in `snow --help`).
  + Improved support for quoted identifiers.

### Bug fixes

* Fixed an issue with creating patches with `snow app version create` when a version had two or more existing.
* Added a trailing newline when using `--format=json` to avoid `%` being added by some terminals to signal no newline at the end of output.
* Enabled the `--interactive` option by default in interactive environments and added the `--no-interactive` option to disable prompting.

## Version 2.3.1 (May 20, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed bugs in the source artifact mapping logic for Snowflake Native Apps.

## Version 2.3.0 (May 15, 2024)

### New features and updates

* Added the `--info` option for the `snow command` to display the configured feature flags.
* Added the `-D/--variable` option to the `snow sql` command to support variable substitutions in SQL input (client-side query templating).
* Added support for full-qualified stage names in `snow stage` and `snow git execute` commands.
* Added the ability to specify files and directories as arguments for the `snow app deploy <some-file> <some-dir>` command.
* Added new options to the `snow app deploy` command:

  + `--recursive` to sync all files and subdirectories recursively.
  + `--prune` to delete specified files from the stage if they don’t exist locally.
* Optimized the Snowpark dependency search to reduce the size of `.zip` artifacts and the number of Anaconda dependencies for Snowpark projects.
* Improved error messages for a corrupted `config.toml` file.

### Bug fixes

* Fixed an issue with the `snow app` commands that cause files to be re-uploaded unnecessarily.
* Fixed an issue where the `snow app run` command did not upgrade an application when the local state and remote stage are identical.
* Fixed an issue with handling the stage pat separators on Windows.

## Version 2.2.0 (April 25, 2024)

### Deprecated features

> **Note:**
>
> The following features are deprecated in this version and will be removed when Snowflake releases Snowflake CLI 3.0.0. Please consider updating any existing scripts that use these deprecated features.

* The `snow snowpark package lookup` command no longer performs a check against PyPi. Using `--pypi-download` or `--yes` has no effect and causes a warning. The command now only checks whether a package is available in the Snowflake Anaconda channel.
* `snow snowpark package create` changes:

  + The `--pypi-download` or `--yes` options are deprecated, have no effect, and cause a warning. The command now always checks against PyPi.
  + The `--allow-native-libraries` option is deprecated in favor of the Boolean `--allow-shared-libraries` option. Using the deprecated option causes a warning.
* `snow snowpark build` changes:

  + The `--pypi-download` option is deprecated, has no effect, and causes a warning. The command now always checks against PyPi.
  + The `--check-anaconda-for-pypi-depts` option is deprecated and causes a warning. Use the `--ignore-anaconda` option instead.
  + The `--package-native-libraries` option is deprecated and causes a warning. Use the `--allow-shared-libraries` option instead.
* The `snow object stage` commands are deprecated and causes a warning. These commands are replaced with `snow stage` commands. Please consider migrating any existing scripts that use the `snow object stage` commands.

### New features and updates

* Added support for fully qualified names (`database.schema.name`) in the Streamlit project definition `name` parameter.
* Added support for fully qualified image repository names in `spcs image-repository` commands.
* Added the `--if-not-exists option` option to the `snow spcs service create` and `snow spcs compute-pool create` commands.
* Added the `--replace` and `--if-not-exists` options for the `snow spcs image-repository create` command.
* Added support for Snowflake Connector for Python diagnostic reports.
* Added the `snow app deploy` command that creates an application package and syncs the local changes to the stage without creating or updating the application.
* Added the `is_default` column to the `snow connection list` output to highlight the default connection.
* Updated the `snow snowpark package create` command:

  + Added the `--ignore-anaconda` option to disable package lookup in the Snowflake Anaconda channel, so dependencies are downloaded from PyPi.
  + Added the `--skip-version-check` option to skip comparing versions of dependencies between requirements and Anaconda.
  + Added the `--index-url` option to set the base URL of the Python Package Index to use for package lookup.
* Updated the `snow snowpark build` command:

  + Added the `--skip-version-check` option to skip comparing versions of dependencies between requirements and Anaconda.
  + Added the `--index-url` option set up the base URL of the Python Package Index to use for package lookup.
* Added the `--recursive` option to the `snow stage copy` command to reproduce the directory structure locally when copying from a stage.
* Added the following `snow git` commands to support for Git repositories in Snowflake:

  + `snow git setup`: Sets up a Git repository stage and creates all necessary objects.
  + `snow git fetch`: Fetches latest changes from the origin repository into a Snowflake repository.
  + `snow git list-branches`: Lists all branches in a repository.
  + `snow git list-tags`: Lists all tags in a repository.
  + `snow git list-files`: Lists all files on a specified branch, tag, or commit.
  + `snow git copy`: Copies files from a specified branch, tag, or commit into a stage or local directory.
  + `snow git execute`: Runs the SQL EXECUTE IMMEDIATE command for files in a repository.
* Added the `snow stage execute` command to run the SQL EXECUTE IMMEDIATE command from a stage path.
* Added the `--pattern` option to the `snow stage list-files` command to support filtering results with regex.
* Added support for any source supported by `pip` in `snow snowpark` commands.
* Added the ability to fetch available packages list from Snowflake instead of directly from Anaconda with fallback to the old method (for backward compatibility). As the new approach requires a connection to Snowflake, it adds connection options to the following commands:

  + `snow snowpark build`
  + `snow snowpark package lookup`
  + `snow snowpark package create`

### Bug fixes

* Added the `--image-name` option for the image name argument in the `spcs image-repository list-tags` command for consistency with other commands.
* Fixed an issue where `spcs image-registry login` errors were not formatted correctly.
* Project definitions no longer accept extra fields. Any extra fields cause an error.
* Fixed an issue with empty zip files for Snowpark build paths for builds that used the `--project` option.
* Improved error messages for the `snow snowpark build` command.
* Fixed version parsing for packages lookup on the Snowflake Anaconda channel.
* Fixed an issue with handling database, schema, and role identifiers containing dashes.
* Fixed a schema override bug in th `snow connection test` command.
* Due to a problem with Windows OSes, Snowflake CLI doesn’t show warnings when config file permissions are too wide for Windows systems.
* Improved `snow connection test` error messages when a role, warehouse, or database does not exist.

## Version 2.1.2 (March 27, 2024)

### New features and updates

* Added `pip` as a Snowflake CLI dependency.
* Optimized the `snow connection test` command.

### Bug fixes

* Fixed an issue with creating virtual environments in the `snow snowpark package create` and `snow snowpark build` commands.

## Version 2.1.1 (March 20, 2024)

### New features and updates

* Initial public release

### Bug fixes

* None.

---
title: Snowflake CLI release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowflake-cli-2025.md
section: Release Notes
---

# Snowflake CLI release notes for 2025

This article contains the release notes for the Snowflake CLI, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) for documentation.

## Version 3.14.0 (Dec 09, 2025)

### New features and updates

* Updated the `snow streamlit deploy` command to use the updated CREATE STREAMLIT syntax (FROM *source_location*) instead of the deprecated syntax (ROOT_LOCATION = ‘<stage_path_and_root_directory>’).

  > **Note:**
  >
  > The deprecated syntax is still supported, but Snowflake recommends using the new syntax for better clarity and consistency. You can use the `snow streamlit deploy --legacy` option to continue using the deprecated syntax.

### Bug fixes

* None.

## Version 3.13.1 (Dec 02, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with parsing the `--vars` values provided to `snow dbt execute` subcommands. This fix allows you to pass variables the same way as you would to the dbt CLI, such as `--vars '{"key": "value"}`’.

## Version 3.13.0 (Nov 03, 2025)

### New features and updates

* Added the `--decimal-precision` global option to allow setting arbitrary precision for Python’s `Decimal` type.
* Added support for the `auto_suspend_secs` parameter in SPCS service commands (`deploy`, `set`, `unset`) to configure automatic service suspension after a period of inactivity.
* Added the `snow dbt describe` and `snow dbt drop` commands.
* Added the `snow dbt execute ... retry` subcommand.
* Added the following `snow dbt deploy` command options:

  > + `--default-target` to set a default target.
  > + `--unset-default-target` to clear the default target.
  > + `--external-access-integration` to set external access integrations (needed to pull external dependencies for altering a dbt project object).
  > + `--install-local-deps` to install dependencies located in the project.
* Added support for running Streamlit apps on SPCS runtime.
* Added grant privileges definitions to the Streamlit `snowflake.yml` file.
* Updated snowflake-connector-python to version 3.18.0.
* Relaxed dbt `profiles.yml` validation rules; added extra validation for role specified in `profiles.yml`.

### Bug fixes

* None.

## Version 3.12.0 (Sep 24, 2025)

### New features and updates

* Added the `!edit` command to the `snow sql` command to support external editors.
* Added the `--partial` option to the `snow logs` command to support partial, case-insensitive matching of log messages.
* Improved parsing `!source` with trailing comments.
* Upgraded to `typer=0.17.3` to improve the display of help messages.
* Improved output handling with streaming queries in the `snow sql` command.

### Bug fixes

* Fixed crashes with older x86_64 Intel CPUs.
* Fixed the `!` commands in `snow sql` commands so they no longer require a trailing `;` for evaluation.
* Fixed using `ctx.var` in `snow sql` with Jinja templating.
* Fixed issues when pasting content with trailing new lines.
* Fixed an issue with `snow snowpark deploy` failing on duplicated packages.
* Fixed an issue causing a `snow spcs logs` `IndexOutOfRange` error.

## Version 3.11.0 (Aug 25, 2025)

### New features and updates

* Added the `snow connection remove` command.
* Added support for the `runtime_environment_version` field in notebook entity configurations to let you specify runtime environment version for containerized notebooks.
* Added the `snow auth oidc` commands for managing workload identity federation authentication:

  + `snow auth oidc read-token` to read and display OIDC tokens from CI/CD environments.

  Also included GitHub Actions provider support in these commands for password-less authentication in CI/CD pipelines.

### Bug fixes

* None.

## Version 3.10.1 (Aug 15, 2025)

### New features and updates

* None

### Bug fixes

* Fixed `snow dbt deploy` command to properly handle fully qualified names.
* Fixed `snow dbt deploy` command to properly handle local directories with dots in names.

## Version 3.10.0 (July 17, 2025)

### Deprecations

* This version deprecates the Snowpark processor in the Snowflake Native App Framework.

### New features and updates

* Added support for passing an OAuth token with the `--token` option.
* Added the ability to suppress new Snowflake CLI version messages.
* Added the following new `--format` options for outputting data:

  + `CSV`, which formats query output as CSV.
  + `JSON_EXT`, which outputs JSON as JSON objects instead of strings.
* Added the `--enabled_templating` option for the `snow sql` command that lets you specify which of the following templates to use when resolving variables:

  + Standard (`<% ... %>`), enabled by default.
  + Legacy (`&{ ... }`), enabled by default.
  + Jinja (`{{ ... }}`), disabled by default.
* Added a `packages` alias for `artifact_repository_packages` in the `snowflake.yml` schema.
* Added the `snow stage copy @src_stage @dst_stage` command for copying files directly between two named stages.
* Added support for the DBT `deploy`, `execute`, and `list` commands.

### Bug fixes

* Fixed an issue where the `snow sql` command would fail when `snowflake.yml` is invalid and the query has no templating.
* Fixed an issue with JSON serialzation for the `Decimal`, `time`, and `binary` data types.

## Version 3.9.1 (June 09, 2025)

### New features and updates

* Added the `--private-link` option to `snow spcs image-registry login` command to log in using private link URLs.

### Bug fixes

* None.

## Version 3.9.0 (May 29, 2025)

### New features and updates

* Added the `--encryption` option to the `snow stage create` command to define the type of encryption to use for all files on the stage.

### Bug fixes

* Fixed errors that occurred for `use` commands if the current database is not set.

## Version 3.8.3 (May 22, 2025)

### New features and updates

* None

### Bug fixes

* Added the `--private-link` option to the `snow spcs image-registry url` command for retrieving private link URLs.

## Version 3.8.2 (May 21, 2025)

### New features and updates

* None

### Bug fixes

* Changed the `enable_release_channels` property default from `False` to None.

## Version 3.8.1 (May 20, 2025)

### New features and updates

* None

### Bug fixes

* The upgrade message is now sent to `stderr`.
* Fixed a `snowflake.core` import issue on newer Python versions.

## Version 3.8.0 (May 16, 2025)

### New features and updates

* Added support for OAuth tokens.
* Added the following enhancements to the `snow sql` command:

  + Added an interactive mode.
  + Added support for asynchronous SQL queries.
  + Added support for the `!queries`, `!result`, and `!abort` SQL query commands.
  + Added the `--single-transaction` command-line option to execute multiple SQL queries as an all-or-nothing batch, ensuring that all commands complete successfully before any of the changes are committed.
  + Added the `artifact_repository` field to the Snowpark Entity Model to support using non-anaconda packages.

### Bug fixes

* Fixed an issue with deploying Snowpark project using the `!=` operator in `requirements.txt`.
* Fixed an issue with escaping identifiers for `use` commands.
* Moved the `enable_release_channels` parameter from the global level to the project level.
* Fixed the `snow spcs service metrics` command to accept fully qualified service names.

## Version 3.7.2 (May 12, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with errors appearing in help messages.

## Version 3.7.1 (April 28, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed certificate connection issues.
* Fixed a `snow spcs image-registry` login slow query problem.

## Version 3.7.0 (April 16, 2025)

### New features and updates

* Added the `--prune` option to the `snow notebook deploy`, `snow snowpark deploy`, and `snow streamlit deploy` commands that removes files that exist in the stage, but not in the local filesystem.
* Added the `snow logs` command for retrieving and streaming logs from the server.
* Added the `snow helper check-snowsql-env-vars` that reports environment variables from SnowSQL with their Snowflake CLI replacements.

### Bug fixes

* Updated the MacOS post-install script to update the `PATH` environment variable, if needed, to ensure the `snow` command is available.

## Version 3.6.0 (April 2, 2025)

### New features and updates

* Added support for the `!source` command in SQL queries to allow executing SQL from local files.

### Bug fixes

* Fixed an issue with incompatible options in `snow spcs compute-pool` commands that didn’t raise error.
* Changed binary builds to embed the whole Python environment.
* Fixed recursive copying to a stage for unbalanced directory trees.
* Fixed checking for a new Snowflake CLI version.
* Added file execution logs in `snow stage` and `snow git` commands.

## Version 3.5.0 (March 10, 2025)

### New features and updates

* Extended project definition (`snowflake.yml`) support for the following SPCS (Snowpark Container Services) entities:

  + Compute pool
  + Image repository
  + Service
* Added the `snow spcs compute pool deploy` command that reads a `snowflake.yml` project definition file.
* Added the `snow spcs image repository deploy` command that reads a `snowflake.yml` project definition file.
* Added the `snow spcs service deploy` command that reads a `snowflake.yml` project definition file.

### Bug fixes

* Fixed an issue with data type handling in the `snow sql` command when using JSON for the output format.

## Version 3.4.0 (February 13, 2025)

### New features and updates

* Added the optional `stage_subdirectory` field to the application package entity.
  When this value is specified, application artifacts are uploaded to this subdirectory instead of to the root of the application package’s stage.
* Added the following `snow spcs service` commands:

  + `snow spcs service events` retrieves service-specific events.
  + `snow spcs service metrics` fetches service metrics.
* Added the following `snow app release-directive` commands:

  + `snow app release-directive add-accounts` adds accounts to a release directive.
  + `snow app release-directive remove-accounts` removes accounts from a release directive.
* Added the `snow app release-channel set-accounts` command to set accounts for release channels.
* Added the `--force-replace` option to the `snow snowpark deploy` command to replace entities even if no changes are detected.
* Added the following notebook functionality:

  + Added the `snow notebook deploy` command that allows the creation of a notebook using a local file.
  + Added support for containerized notebooks.
  + Added `notebook` to the supported object types for the `snow object` commands.
* Added support for glob patterns (except `**)` in artifacts paths in Streamlit and Snowpark `snowflake.yyml` files.

  > **Note:**
  >
  > Using glob patterns in Snowpark `snowflake.yml` files requires enabling the ENABLE_SNOWPARK_GLOB_SUPPORT feature flag.
* Added support for the Mac OS x86_64 architecture.

### Bug fixes

* Fixed an MFA caching issue in the Snowflake CLI binary installation files.
* Fixed an auto-completion issue in the Snowflake CLI binary installation files.

## Version 3.3.0 (January 21, 2025)

> **Note:**
>
> On January 28, 2025, Snowflake updated the documentation for the `snow add release channel` commands to indicate that the feature is in Public Preview instead of General Availability.

### New features and updates

* Added the following Snowflake Native Apps features and updates:

  + Added the following commands to support release directives:

    - `snow app release-directive list`
    - `snow app release-directive set`
    - `snow app release-directive unset`
  + Added support for release channels, including the following:

    - Added support release channels in the `snow app version create` and `snow app version drop` commands.
    - Added the ability to specify a release channel when creating an application instance from a release directive (`snow app run --from-release-directive --channel=<channel>`).
    - Added the `snow app release-channel list` to list available release channels.
    - Added the `now app release-channel add-accounts` and `snow app release-channel remove-accounts` commands to support adding and removing accounts from release channels.
    - Added the `snow app release-channel add-version` and `snow app release-channel remove-version` commands to add versions to and remove versions from release channels.
  + Added the `snow app publish` command to simplify publishing versions to release channels and to update release directives.
  + Made the following changes to the `snow app version create` command:

    - The command now returns the version, patch, and label in JSON format.
    - Added the `--from-stage` option to allow version creation from the content of a stage without needing to re-synchronize to the stage.
* Added the `snow helpers import-snowsql-connections` command to import connections from existing SnowSQL configurations.
* Added support for restricting user access to Snowflake CLI only. For more information, see [Add an authentication policy that limits access to Snowflake CLI only](../../developer-guide/snowflake-cli/connecting/configure-cli.md).

### Bug fixes

* Fixed the inability to add patches to lowercase quoted versions.
* Fixed an issue with setting label to blank instead of `None` when not provided.
* Fixed the `snow connection generate-jwt` command to preserve command-line connection options.
* Fixed stage path handling for notebook commands.

---
title: Snowflake CLI release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowflake-cli-2026.md
section: Release Notes
---

# Snowflake CLI release notes for 2026

This article contains the release notes for the Snowflake CLI, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake CLI](../../developer-guide/snowflake-cli/index.md) for documentation.

## Version 3.16.0 (Mar 19, 2026)

### New features and updates

* Added support for DCM commands in preview.
* Added the `--in-account` option to list commands (for example, `snow object list`, `snow stage list`). This option lists all objects of a given type in the account. Cannot be used together with the `--in` option.
* Added the **experimental** command `snow spcs service build-image` to build container images using an SPCS service. The command uploads the local build context to a stage, executes a build job, and streams logs in real time until completion. This command is experimental and subject to change.
* Added the `--async` option to the `snow spcs service execute-job` command to execute job services asynchronously without waiting for completion.
* Added the `--replicas` option to the `snow spcs service execute-job` command to specify the number of job replicas to run.
* Added the `--dbt-version` option to the `snow dbt deploy` and `snow dbt execute` commands. This option sets the dbt Core version on a dbt project object (`snow dbt deploy` command) or executes a dbt command on a specific dbt Core version without altering the dbt object (`snow dbt execute` commands).
* All authenticators (including `snowflake-jwt`, `username_password_mfa`, and `workload_identity`) are now case-insensitive.
* Changed how the fully qualified names for temporary stages are established for `snow dbt deploy`. The database and schema from the dbt project object’s fully qualified name now take precedence over those from the session.

### Bug fixes

* Fixed `snow stage copy --recursive` dropping database and schema qualifiers from fully qualified stage names, which caused the command to resolve stages against the connection’s default database instead of the one specified in the FQN.
* Fixed `snow streamlit deploy --prune` failing with an incorrect stage path format for Streamlit entities using versioned deployment. The `snow://` prefix is now correctly preserved through all stage path operations.
* Fixed a bug with `snow dbt deploy` where the dbt project uploaded files first and updated project properties afterward. This could cause deploys to fail if, for example, the project lacked external access integrations and dependencies were specified.
* Fixed the `snow stage copy` and `snow stage put` commands failing when a local directory path contains glob special characters (such as, square brackets in [id] or [slug]). The path is now escaped before glob expansion, so literal directory names are matched correctly.

## Version 3.15.0 (Feb 03, 2026)

### New features and updates

* Added the `--if-exists` option to the `snow object drop` command and object-specific drop commands (for example, `snow stage drop`) to drop objects only if they exist, preventing errors when dropping non-existent objects.
* Updated the project definition with supported Python versions aligned with `snowflake-connector-python`.

### Bug fixes

* Fixed git repository path parsing to allow quotes around both repository and branch names (such as `@"example-repo"/branches/"feature/branch"/*`).
* Fixed external browser authentication (`EXTERNALBROWSER`) for headless systems.

---
title: Snowflake Connector for Google Analytics Aggregate Data release notes
source: https://docs.snowflake.com/en/release-notes/connectors/gaad.md
section: Release Notes
---

# Snowflake Connector for Google Analytics Aggregate Data release notes

This topic provides release notes for the Snowflake Connector for Google Analytics Aggregate Data. For additional
information, see [Snowflake Connector for Google Analytics Aggregate Data](../../connectors/google/gaad/gaad-connector-about.md).

## Version 2.2.2 (December 3rd, 2025)

### Bug fixes

* Fixed an issue where the report start date was calculated incorrectly when report ingestion exceeded 2 hours.

## Version 2.2.1 (November 7th, 2024)

### Behavior changes

* Event sharing is now mandatory for all event types.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 2.2.0 (October 29th, 2024)

### Behavior changes

* Revoked the USAGE privilege on the `STATE` schema from the ADMIN application role.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 2.1.2 (October 16th, 2024)

### Bug fixes

* The `IMPORT_STATE` procedure now grants `SELECT` privilege to the application roles `ADMIN` and `DATA_READER`.

## Version 2.1.1 (October 16th, 2024)

### Bug fixes

* Export process failed. Scoped temporary tables could not be created in the destination schema.

## Version 2.1.0 (October 16th, 2024)

### Behavior changes

* The connector creates additional tables in destination schema. The tables are used to store the configuration of the connector. The tables have _SFSDKEXPORT_V1 suffix.

### New features

* IMPORT_STATE procedure was added. The procedure can be used to recover a configuration of the reports, schedules, and history of the ingestions after the connector was uninstalled.

### Bug fixes

Not applicable.

## Version 2.0.0 (September 16th, 2024)

### Behavior changes

* The connector requires all configured identifiers to be quoted based on the
  [identifier requirements](../../sql-reference/identifiers-syntax.md).

### New features

* Report tables have `change_tracking` enabled.
* You can now reset the connector’s configuration before the configuration is finalized using the `RESET_CONFIGURATION` procedure.
* You can now recover a connector in the `ERROR`, `PAUSING`, or `STARTING` state using the `RECOVER_CONNECTOR_STATE` procedure.

### Bug fixes

Not applicable.

## Version 1.5.0 (July 22nd, 2024)

### Behavior changes

Not applicable.

### New features

* Reports can now be configured to avoid sampling by shortening ingestion interval length.

### Bug fixes

* Corrected handling of quoted identifiers in the GET_TROUBLESHOOTING_DATA procedure.
  See also [Troubleshooting](../../connectors/google/gaad/gaad-connector-troubleshooting.md).

## Version 1.4.0 (June 28th, 2024)

### Behavior changes

* Sampling metadata is available in the CONNECTOR_STATS view.
* The connector saves INGESTION_RUN_ID in tables with report data.

### New features

Not applicable.

### Bug fixes

* To avoid ingestion timeouts, existing task timeouts have been increased to 4 hours.
* Worker tasks can be rescheduled to avoid timeouts during ingestion.

## Version 1.3.1 (June 26th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Timeout for new tasks has been increased to 4 hours to prevent timeouts during ingestion.

## Version 1.3.0 (June 19th, 2024)

### Behavior changes

The minimal interval used during ingestion runs has been reduced to one day.

### New features

* Added the `UPDATE_WAREHOUSE` procedure.

### Bug fixes

The `CONFIGURE_REPORT` procedure can now be called in parallel.

## Version 1.2.0 (May 24th, 2024)

### Behavior changes

Not applicable.

### New features

* Added the healthcheck task to all connector instances.

### Bug fixes

Not applicable.

## Version 1.1.1 (May 21st, 2024)

### Behavior changes

Not applicable.

### New features

* Added the `UPDATE_CONNECTION_CONFIGURATION` procedure.

### Bug fixes

Not applicable.

## Version 1.0.1 (May 13th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed issue where the connector could enter an inconsistent state during pausing or resuming.

## Version 1.0.0 (April 29th, 2024)

### Behavior changes

* The CONFIGURE_REPORT procedure returns validation errors instead of throwing them.
* The connector will wait for 30 seconds before retrying ingestion after a 502 error.
* Worker tasks will not be suspended due to errors.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 0.25.0 (April 24th, 2024)

### Behavior changes

* The connector tries to avoid 502 errors by reducing the date range for large reports.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 0.24.0 (April 9th, 2024)

### Behavior changes

* The `CONNECTOR_EXECUTION_LOG` table is no longer visible to users.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 0.23.1 (April 4th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed start date calculation (in some cases, the start date could be set
  after the end date due to timezone differences).

## Version 0.23.0 (April 3rd, 2024)

### Behavior changes

* Date ranges for initial and incremental loads are split into 31-day intervals to
  reduce the number of 502 API errors.

### New features

Not applicable.

### Bug fixes

* Added metadata column to the `PUBLIC.CONNECTOR_STATS` view and to the
  result of `GET_TROUBLESHOOTING_DATA` procedure.

## Version 0.22.0 (March 26th, 2024)

### Behavior changes

Not applicable.

### New features

* Added the `GET_TROUBLESHOOTING_DATA` procedure

### Bug fixes

Not applicable.

## Version 0.21.1 (March 20th, 2024)

### Behavior changes

* Added request for the `EXECUTE MANAGED TASK` privilege to the connector’s manifest.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 0.21.0 (March 19th, 2024)

### Behavior changes

* The `CONNECTOR_EXECUTION_LOG` table is deprecated and will be removed in the future.

### New features

* Added more details to the `CONNECTOR_ERRORS` view, including the error reason.

### Bug fixes

* The `CONNECTOR_CONFIGURATION`, `CONNECTOR_STATS`, and `CONNECTOR_ERRORS` views are now visible to the `VIEWER` application role.

## Version 0.20.0 (March 14th, 2024)

### Behavior changes

* Internal changes only.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 0.19.0 (March 12th, 2024)

### Behavior changes

Not applicable.

### New features

* Added alerting handling. Procedures `DISABLE_ALERTS` and `CONFIGURE_ALERTS` were added.

### Bug fixes

Not applicable.

## Version 0.18.0 (March 7th, 2024)

### Behavior changes

Not applicable.

### New features

* Generated IDs for all CONNECTOR_EXECUTION_LOG entries.

### Bug fixes

* Fixed initial load start date calculation.

## Version 0.17.0 (February 22nd, 2024)

### Behavior changes

* The retry count for GA API requests was reduced to 1, and removed retries for 502 errors.

### New features

* Added ID column to CONNECTOR_EXECUTION_LOG. The column will be populated for new entries.

### Bug fixes

Not applicable.

## Version 0.16.0 (February 16th, 2024)

### Behavior changes

Not applicable.

### New features

* Added views containing basic ingestion statistics - `CONNECTOR_STATS` and `AGGREGATED_CONNECTOR_STATS`.
* Added a view containing ingestion errors - `CONNECTOR_ERRORS`.
* Added a procedure that immediately triggers ingestion for a given report - `INGEST_NOW(<report name>)`.
* Added a function to retrieve dimensions and metrics for a given property - `GET_DIMENSIONS_AND_METRICS(<property id>)`.

### Bug fixes

Not applicable.

## Version 0.15.0 (February 1st, 2024)

### Behavior changes

* Date ranges for initial ingestion are split into 6-month intervals. This reduces the risk of initial load failing for reports with a distant start date.
* Updated links to the documentation in prerequisites and README file.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 0.14.0 (January 29th, 2024)

### Behavior changes

Initial version.

### New features

Not applicable.

### Bug fixes

Not applicable.

---
title: Snowflake Connector for Google Analytics Raw Data release notes
source: https://docs.snowflake.com/en/release-notes/connectors/gard.md
section: Release Notes
---

# Snowflake Connector for Google Analytics Raw Data release notes

This topic provides release notes for the Snowflake Connector for Google Analytics Raw Data.

For additional information, see [Snowflake Connector for Google Analytics Raw Data](../../connectors/google/gard/gard-connector-about.md).

## Version 2.11.2 (August 19, 2025)

### Behavior changes

Not applicable.

### New features

Improved the logging for the connector operations. This improved logging includes the following more detailed information about BigQuery
streaming downloads:

* Download progress percentage
* Throttling information
* Amount of data downloaded in each batch

These improvements should help with troubleshooting streaming downloads that failed or ingestions that are stuck.

### Bug fixes

Not applicable.

## Version 2.11.1 (March 17, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Fixed an issue where the connector was unable to ingest data from Google Analytics with the `QUOTED_IDENTIFIERS_IGNORE_CASE` account
parameter set to `true`.

## Version 2.11.0 (February 13, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Restored support for the `USERS` and `PSEUDONYMOUS_USERS` export types.

## Version 2.10.0 (January 23, 2025)

### Behavior changes

Not applicable.

### New features

Added support for ingesting data by using the `FRESH_DAILY` export type:

* By default, the `FRESH_DAILY` export type is disabled. To enable it, call the `ENABLE_PROPERTIES` stored procedure. For details,
  see [Enabling or disabling the ingestion of a property](../../connectors/google/gard/gard-connector-setting-up-data.md).
* You can’t disable auto reloading data for the `FRESH_DAILY` export type. For more information,
  see [Updating data ingestion options](../../connectors/google/gard/gard-connector-managing.md).

### Bug fixes

Not applicable.

## Version 2.9.1 (January 15, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* The `USERS` and `PSEUDONYMOUS_USERS` export types, which in some cases caused the connector to stop responding when they were defined,
  are no longer defined in these cases.

## Version 2.9.0 (January 7, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue in the sink view where some columns were displayed multiple times.
* To reduce ingestion errors, Google OAuth2 security credentials are now refreshed more frequently.
* To optimize the performance of ingestion, Snowflake now limits the number of worker tasks that can use BigQuery data exports.

## Version 2.8.0 (December 3, 2024)

### Behavior changes

Not applicable.

### New features

* Migrated to telemetry v2.

### Bug fixes

* Data stream ingestions, which failed due to the `Out of Memory` errors, are now retried sooner.

## Version 2.7.1 (November 19, 2024)

### Behavior changes

Not applicable.

### New features

* Improved ingestion scalability.

### Bug fixes

Not applicable.

## Version 2.7.0 (October 31, 2024)

### Behavior changes

Not applicable.

### New features

* Added support for combinations of `DAILY`, `INTRADAY`, `USERS`, and `PSEUDONYMOUS_USERS` export types.
* Added support for multi-cluster warehouses.
* Performance and stability improvements.

### Bug fixes

Not applicable.

## Version 2.5.0 (October 24, 2024)

### Behavior changes

Not applicable.

### New features

* Historical data ingestion for new properties ingests in reverse chronological order, starting from the date on which the Google Analytics property was enabled, while the current data is ingested in parallel.

### Bug fixes

Not applicable.

## Version 2.4.0 (October 21, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 2.3.0 (October 7, 2024)

### Behavior changes

Not applicable.

### New features

Reloads of user data tables are now scheduled automatically after 72 hours.

### Bug fixes

* To prevent failures for a large number of properties, we’ve increased the timeout period
  on the view refresher task from 1 hour to 23 hours.
* Fixed issues related to identifier migration.
* Fixed the state of properties that got suspended due to an issue related to a race condition between
  refreshing the sink table views and disabling inaccessible Google Analytics properties.

## Version 2.2.1 (September 30, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Updated to Snowpark version 1.13.2 to address `ERROR_ON_NONDETERMINISTIC_UPDATE` error.

## Version 2.2.0 (September 27th, 2024)

### Behavior changes

Not applicable.

### New features

* Connectors now support the ingestion of the `USERS` and `PSEUDONYMOUS_USERS` export types.

### Bug fixes

* Fixed a race condition between the property cleaner and the view refresher.

## Version 2.1.0 (September 17th, 2024)

### Behavior changes

Not applicable.

### New features

* Added the `RESET_CONFIGURATION` procedure.

### Bug fixes

* Corrected missing parameters on dispatcher subprocess tasks.

## Version 2.0.0 (September 3rd, 2024)

### Behavior changes

Not applicable.

### New features

* Added support for identifiers in a worksheet format.

### Bug fixes

Not applicable.

## Version 1.8.2 (August 30th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fix for issue causing PAUSE and RESUME procedures to fail.

## Version 1.8.0 (August 27th, 2024)

Internal updates only.

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 1.7.2 (August 19th, 2024)

### Behavior changes

Not applicable.

### New features

* Added flattened `event_params` and `user_properties` columns in the sink table views.
* Enabled change tracking on sink tables.
* Sink table views are now refreshed with the copy grants statement.

### Bug fixes

* Application upgrade fix for certain customers.

## Version 1.6.6 (August 12th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Application upgrade fix for certain customers

## Version 1.6.3 (August 5th, 2024)

### Behavior changes

Not applicable.

### New features

* Sink table views are now refreshed automatically.
* Data is now synced sooner for timezones ahead of UTC.
* Improved scalability of scheduling ingestions for large number of properties.

### Bug fixes

Not applicable.

## Version 1.5.2 (July 18th, 2024)

### Behavior changes

Not applicable.

### New features

* New REFRESH_VIEWS procedure, which allows recreating flattened data view.

### Bug fixes

Not applicable.

## Version 1.4.1 (July 8th, 2024)

### Behavior changes

Not applicable.

### New features

* Reloads are now scheduled automatically for all properties.
* Added [UPDATE_INGESTION_OPTIONS](../../connectors/google/gard/gard-connector-managing.md) procedure
  which allows customizing the ingestion settings for certain properties supporting two parameters:

  > + EXCLUDE_NULLS - do not ingest fields containing nulls values, which can increase performance.
  > + DISABLE_AUTO_RELOADS - disable the auto reload mechanism for certain properties.

### Bug fixes

Not applicable.

## Version 1.3.0 (June 20th, 2024)

### Behavior changes

Not applicable.

### New features

* This release introduces the “reload property” feature. There are three new procedures for triggering reload:

  > + `RELOAD_PROPERTY('<property id>')`
  > + `RELOAD_PROPERTY('<property id>', <first date>, <last date>)`
  > + `RELOAD_PROPERTY('<property id>', '<export type>', <first date>, <last date>)`
* There is one new procedure for canceling ongoing reload:

  > + `CANCEL_RELOAD_PROPERTY('<load id>')`
* And a new view for observing ongoing reload:

  > + `PUBLIC.ONGOING_RELOADS`

### Bug fixes

Not applicable.

## Version 1.2.0 (May 28th, 2024)

### Behavior changes

* Ingestion tasks are scaled based on warehouse size. Ingestion time for
  larger warehouses should be decreased.
* Additional optimizations in ingestion and ingestion scheduling.
  These updates could result in slightly lower credit consumption, as well as increased ingestion throughput.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 1.1.0 (May 22nd, 2024)

### Behavior changes

* Disabling property which is ingesting incremental intraday data will remove
  currently ingested day if ingestion was not fully completed.
* Introduced managed task to check and report its health back to Snowflake for
  Connectors installed before version 1.0.0. See the [Snowflake Connector for
  Google Analytics Raw Data health check cost](../../connectors/google/gard/gard-connector-pricing.md) for details.

### New features

Not applicable.

### Bug fixes

* Fixed issue with Pausing/Resuming Connector which left Connector state in intermediate state `PAUSING/ STARTING`.

## Version 1.0.1 (May 14th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Dispatcher task was adjusted to never automatically suspend. See [SUSPEND_TASK_AFTER_NUM_FAILURES](../../sql-reference/parameters.md) for details.
* Ingestion worker tasks have prolonged timeout to 6h hours. This will override account level parameter settings.

## Version 1.0.0 (May 7th, 2024)

### Behavior changes

* Logging level is now set to INFO which should significantly decrease the
  amount of entries into the event table.
* The connector now runs a small, managed task to check and report its health
  back to Snowflake. See the [Snowflake Connector for Google Analytics Raw Data health check cost](../../connectors/google/gard/gard-connector-pricing.md)
  for details.

### New features

* New procedure `PUBLIC.UPDATE_CONNECTION` allows re-authenticating a running
  connector by providing a new set of external access and secret objects. See
  [Re-authentication of the Connector](../../connectors/google/gard/gard-connector-managing.md)
  for details.
* Re-installing the connector, and configuring it over an existing set of destination tables, will
  now automatically re-enable their related Google Analytics properties for
  ingestion. This should make reinstalling the connector much faster.

### Bug fixes

* Tasks created by the connector now have a fixed set of properties, mostly
  related to `AUTOCOMMIT` and date-time formats, required for these tasks to
  work correctly. These will override account-level properties.

  Operating on the
  connector by explicitly calling its functions or procedures still requires
  certain default values, as described in
  [Snowflake Connector for Google Analytics Raw Data known limitations](../../connectors/google/gard/gard-connector-about.md).

## Version 0.19.2 (May 6th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue with refreshing OAuth access tokens that was causing long-running ingestions to fail.

## Version 0.19.0 (April 26th, 2024)

### Behavior changes

Not applicable.

### New features

* A new procedure `PUBLIC.UPDATE_WAREHOUSE` is now available to
  replace the warehouse used by the connector.
* Scheduling ingestions should now be much faster, especially for instances
  configured with large numbers of Google Analytics properties.

### Bug fixes

Not applicable.

## Version 0.18.0 (April 15th, 2024)

### Behavior changes

Not applicable.

### New features

* The procedure `UPDATE_CONNECTION_CONFIGURATION` was introduced. It can be
  used for re-authentication purposes. Connector has to be paused to use this
  procedure. Currently it is only available from the worksheet and takes one argument
  type of VARIANT composed of fields: `external_access_integration`,
  `security_integration` and `secret`.

### Bug fixes

Not applicable.

## Version 0.16.4 (April 11th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue with disrupted scheduling of ingestions during execution of dispatcher task.

## Version 0.16.3 (April 8th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue where changing the ingestion schedule while the
  dispatcher task was running could lead to temporary duplication of the dispatcher task.

## Version 0.16.2 (March 25th, 2024)

### Behavior changes

* The connector now requests permission to `EXECUTE MANAGED TASK`. This
  is in preparation for internal monitoring features that will help us
  discover and address issues earlier.

### New features

Not applicable.

### Bug fixes

* Fixed an issue where changing the ingestion schedule while the
  dispatcher task was running could corrupt the internal state of the
  connector so that it would not be able to schedule further ingestions.

## Version 0.16.1 (March 19th, 2024)

Versions released in between this and prior release notes had only internal
changes, and thus no release notes were published.

### Behavior changes

Not applicable.

### New features

* A number of optimizations made to scheduling ingestions and ingesting itself.
  This should result in slightly lower credit consumption, as well as increased
  ingestion throughput, especially on tables with fewer than one million records.
* The connector now supports mixed-case and lowercase secret names.

### Bug fixes

* Ingestions should no longer fail due to issues with refreshing the access
  token. This may very rarely still occur for particularly large tables, but
  will always be re-tried by the connector automatically.
* Added validation to prevent enabling the same Google Analytics property
  ingestion from multiple Google Cloud Platform projects.
* Enabling multiple properties at once is now more resilient against issues with
  BigQuery connectivity. It will only fail the properties for which it
  couldn’t connect to BigQuery, and successfully enable all the others.

## Version 0.11.1 (February 12th, 2024)

### Behavior changes

* Several stored procedures, meant specifically for early user interface, access were removed.
  These stored procedures were generally not documented,
  and not part of our public API, but they may have been visible when listing
  procedures exposed by the connector. There’s no change in how the connector
  works, or how it should be operated.
* The procedure `CONFIGURE_CONNECTION` now requires an additional parameter `security_integration`
  with the name of the security integration created for
  the connector. This applies only for worksheet-based setups. If you’re setting
  the connector up via the user interface, this change is transparent to you.

### New features

* Updated links to the connector’s documentation.

### Bug fixes

* Improved reading tables available in BigQuery to ensure we only look at
  Google Analytics export tables, and filter out similarly-named tables. This
  issue could sometimes cause enabling new properties, or ingesting enabled
  properties to fail.

## Version 0.10.1 (January 26th, 2024)

Initial public preview release.

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Not Applicable

---
title: Snowflake Connector for Kafka release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/kafka-connector.md
section: Release Notes
---

# Snowflake Connector for Kafka release notes

The Snowflake Connector for Kafka release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](kafka-connector-2026.md)
* [2025 releases](kafka-connector-2025.md)
* [2024 releases](kafka-connector-2024.md)
* [2023 releases](kafka-connector-2023.md)
* [2022 releases](kafka-connector-2022.md)

See [Snowflake Connector for Kafka](../../user-guide/kafka-connector.md) for documentation.

---
title: Snowflake Connector for Kafka release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/kafka-connector-2022.md
section: Release Notes
---

# Snowflake Connector for Kafka release notes for 2022

This article contains the release notes for the Snowflake Connector for Kafka, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Kafka updates.

See [Snowflake Connector for Kafka](../../user-guide/kafka-connector.md) for documentation.

## Version 1.8.2 (November 18, 2022)

### New features

* Added docker setup resources to the Kafka connect repo.
* Added multiple fixes for Kubernetes cluster schematization schema mapping.
* Added the `correlationId` to logging.
* Moved rows with `JsonProcessingException` into the DLQ instead of ignoring them.
* Added log granularity instance id for tasks.
* Added support for schema evolution with schematization.
* Increased the version of protobuf-java from 3.19.4 to 3.19.6 in `/test/test_data/protobuf`.
* Checked the schema evolution table option.
* Added security upgrade for `com.fasterxml.jackson.core:jackson-databind` from 2.13.2.1 to 2.13.4.2.

### Bug fixes

* Fixed Blackduck vulnerabilities.

## Version 1.6.8/1.8.1 (August 24, 2022)

### Bug fixes

* Upgraded the jackson-core and jackson-databind libraries to versions 2.13.1 and 2/13/2/1, respectively, to fix some issues with version 1.6.7.

## Version 1.7.2 (January 18, 2022)

### Bug fixes

* Upgraded the snowflake-jdbc library to version 3.13.14.
* Upgraded the jackson-core and jackson-databind libraries to 2.12.6 to resolve Possible DoS if using JDK serialization to serialize JsonNode.

## Version 1.7.0 (January 18, 2022)

No customer facing changes.

## Version 1.6.5 (January 18, 2022)

### Bug fixes

* Upgraded the jackson-core and jackson-databind libraries to 2.12.6 to resolve Possible DoS if using JDK serialization to serialize JsonNode.

---
title: Snowflake Connector for Kafka release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/kafka-connector-2023.md
section: Release Notes
---

# Snowflake Connector for Kafka release notes for 2023

This article contains the release notes for the Snowflake Connector for Kafka, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Kafka updates.

See [Snowflake Connector for Kafka](../../user-guide/kafka-connector.md) for documentation.

## Version 2.1.2 (December 04, 2023)

### New features and updates

* Enabled Java Management Extensions (JMX) metrics for Snowpipe Streaming.
* Enabled tombstone ingestion for Snowpipe Streaming.
* Enabled Snowflake OAuth for Kafka connector with Snowpipe Streaming.
* Enabled schematization columns with special or reserved keywords.

### Bug fixes

* Fixed an issue that the one-client configuration option is not enabled by default. The one-client configuration option `enable.streaming.client.optimization` is now `TRUE` by default.
* Fixed an issue with channel naming.

## Version 2.0.1 (August 25, 2023)

### New features and updates

* Improved performance for schematization permission checks when rebalancing.

### Bug fixes

* Fixed a bug that caused missing data in tables due to issues with internal cache clearing during rebalancing.

## Version 2.0.0 (July 31, 2023)

### New features and updates

* Snowpipe Streaming with Kafka Connector is now Generally Available.

### Bug fixes

* None.

## Version 1.9.4 (July 13, 2023)

### New features and updates

* One client configuration:

  + Introduced the `enable.streaming.client.optimization` option, which is enabled by default.
  + With this client optimization, only one client is created for multiple topic partitions per Kafka connector. This feature can reduce client runtime and lower migration cost by creating larger files.
  + Note that in a high throughput scenario (for example, 50 MB/s per connector), we recommend that you disable this property if you see an increase in latency or costs.
* Permissions and security:

  + Unified Snowflake role and user for Snowpipe Streaming for table creation and insertion.
  + Upgraded guava dependency to 32.0.1.
  + Upgraded Snowpipe Streaming SDK dependency to 2.0.1.

### Bug fixes

* Fixed a wrong result issue that offsets are skipped when schematization is enabled.
* Snowpipe Streaming Channels are not closed on rebalance.

## Version 1.9.3 (May 22, 2023)

### New features and updates

* Added the ability to use one Streaming Ingest client (Default to false).
* Started using the MDC context logger.
* Upgraded to the following versions:

  + Ingest SDK version 1.1.4
  + JDBC version 3.13.30

### Bug fixes

* Fixed an issue related to using the GET command when using the downscoped token on GCP.
* Fixed Snowpipe-based KC’s commit offset behavior.

---
title: Snowflake Connector for Kafka release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/kafka-connector-2024.md
section: Release Notes
---

# Snowflake Connector for Kafka release notes for 2024

This article contains the release notes for the Snowflake Connector for Kafka, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Kafka updates.

See [Snowflake Connector for Kafka](../../user-guide/kafka-connector.md) for documentation.

## Version 3.0.0 (December 10, 2024)

### New features and updates

* With this release, the Snowflake Connector for Kafka can ingest data into a Snowflake-managed [Apache Iceberg™ table](../../user-guide/tables-iceberg.md). For more information, see [Using the Snowflake Connector for Kafka with Apache Iceberg™ tables](../../user-guide/kafka-connector-iceberg.md).

### Bug fixes

* Fixed dependency vulnerabilities.

## Version 2.5.0 (October 31, 2024)

### New features and updates

* Upgraded the Snowflake Ingest Java SDK to version 2.3.
* Closing channels in parallel is now enabled by default. This improves the speed of restarting the connector.

### Bug fixes

* Fixed logging issues.

## Version 2.4.1 (September 19, 2024)

### New features and updates

* Upgraded the Snowflake Ingest Java SDK to version 2.2.2.

### Bug fixes

* Fixed issues with schematization.

## Version 2.4.0 (August 15, 2024)

### New features and updates

* Upgraded the Snowflake Ingest Java SDK to version 2.2.0, which contains a critical fix for potential issues when `change_tracking` is enabled for streams and dynamic tables.
* Upgraded the Snowflake JDBC driver from version 3.14.5 to version 3.18.0.
* Improved the logging experience in various components for improved troubleshooting experience.
* Improved the channel reopening logic.

> **Note:**
>
> For all Snowpipe Streaming usage, Snowflake recommends using the Kafka connector version 2.4.0 or later.

### Bug fixes

* Updated dependencies with known vulnerabilities.

## Version 2.3.0 (July 10, 2024)

### New features and updates

* Added support to close Snowpipe Streaming channels in parallel, which significantly reduces time for rebalancing.
* Added a new `SnowflakeConnectorPushTime` property in the metadata that represents the time when the message was pushed by the connector.

### Bug fixes

* Updated dependencies with known vulnerabilities.

## Version 2.2.2 (May 07, 2024)

### Bug fixes

* Fixed an issue where the staged files are not cleaned up properly.

## Version 2.2.1 (March 15, 2024)

### New features and updates

* Added offset verification logic to make sure there is no missing or duplicate data.
* Added client provider overridden map for Snowpipe Streaming. The map uses comma-separated key value pairs as input.
* Upgraded to the following versions:

  + JDBC version to 3.14.5.
  + kafka connect-api version to 3.7.0.
  + jackson-core and jackson-databind to 2.16.1
  + commons-compress to 1.26.0

### Bug fixes

* Cleaned up streaming ingest threads when `SinkTask stop ()` is called.

## Version 2.2.0 (February 06, 2024)

### BCR (Behavior Change Release) changes

* Preserved the old data type that goes into an ARRAY column for schematization.

### New features and updates

* Added support for AVRO logical types.
* Implemented changes to prevent potential data duplication because of a new channel name format.
* Implemented changes to preserve the old data type that goes into an ARRAY column for schematization.
* Implemented changes to make schema evolution add columns idempotent.
* Enabled the Ingest SDK `MAX_CLIENT_LAG` configuration in Kafka connector.

### Bug fixes

* Fixed schema evolution cases that could cause non-exactly once delivery.
* Fixed issues with generating and building Java library.
* Fixed an issue that the Kafka offset is not reset correctly.

---
title: Snowflake Connector for Kafka release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/kafka-connector-2025.md
section: Release Notes
---

# Snowflake Connector for Kafka release notes for 2025

This article contains the release notes for the Snowflake Connector for Kafka, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Kafka updates.

See [Snowflake Connector for Kafka](../../user-guide/kafka-connector.md) for documentation.

## Version 3.5.0 (Dec 15, 2025)

### New features

* The connector now automatically handles client redirect during failover, thus eliminating the need to manually restart the connector after a primary deployment change.
* Added automatic failover support for Snowpipe Streaming.
* Added granular logging for schema evolution to improve troubleshooting.
* Upgraded Snowflake Ingest SDK to version 4.4.0.

## Version 3.4.0 (November 5, 2025)

### Behavior changes

* Changed default value of `enable.streaming.channel.offset.migration` property to `false`.
  This property was used only when the Snowflake Streaming channel was first used after migrating from
  Kafka Connector version 2.1.0 or 2.1.1: it enabled migration of offsets from channels created
  by those versions (which included the connector name in the channel name:
  `[connectorName]_[topic]_[partition]`) to the channel name format used by all other
  versions (`[topic]_[partition]`).

### New features

* Added `snowflake.streaming.channel.name.include.connector.name` property. When set to
  `true`, this includes the connector name in Snowpipe Streaming channel names
  (`[connectorName]_[topic]_[partition]`). Setting this property to `true` requires
  `enable.streaming.channel.offset.migration` to be set to `false`. This allows users
  of Kafka Connector versions 2.1.0 and 2.1.1 to upgrade without data loss.

  > **Warning:**
  >
  > Users upgrading from versions other than 2.1.0 and 2.1.1 who set
  > `snowflake.streaming.channel.name.include.connector.name` to `true` will
  > experience data duplication; there is no offset migration logic for other versions.
* Upgraded Snowflake Ingest SDK to version 4.3.2.

### Bug fixes

Not applicable.

## Version 3.3.1 (Oct 23, 2025)

### New features

* Upgraded Snowflake JDBC driver to version 3.26.1.

### Bug fixes

* OAuth URLs now support dots and hyphens.

## Version 3.3.0 (Aug 26, 2025)

### Behavior changes

Not applicable

### New features

* The Snowflake Connector for Kafka now supports `long` type with `timestamp` logical type in Apache Iceberg™ tables.

  > For a complete list of support types see [Data types for Apache Iceberg™ tables](../../user-guide/tables-iceberg-data-types.md).

### Bug fixes

Not applicable.

## Version 3.2.4 (Jul 31, 2025)

Internal updates only.

## Version 3.2.3 (Jul 14, 2025)

Internal updates only.

## Version 3.2.2 (Jun 26, 2025)

### Behavior changes

Not applicable

### New features

* Uses Confluent version 7.9.2 packages.

### Bug fixes

Not applicable

## Version 3.2.1 (Jun 2, 2025)

### Behavior changes

Not applicable

### New features

* Uses [JDBC](jdbc-2025.md) version 3.24.2.

### Bug fixes

Not applicable

## Version 3.2.0 (Apr 28, 2025)

### Behavior changes

Not applicable

### New features

* Removed support for double buffered version for SNOWPIPE_STREAMING ingestion type.

  Setting `snowflake.streaming.enable.single.buffer` has no effect.

### Bug fixes

* The connector no longer drops table rows with missing offsets.
* During schema evolution, on schema change, certain single records are no longer dropped.

## Version 3.1.3 (Apr 7, 2025)

### Behavior changes

* Snowpipe Streaming with double buffer is now deprecated. Only single buffer will be supported in future releases.

### New features and improvements

* Updated connector to use Kafka version 3.9.0.
* Updated connector to use slf4j-api version 2.0.17.
* Supports [JDBC](jdbc-2025.md) version 3.23.2.

### Bug fixes

* snowflake-jdbc no longer throws NullPointerException in certain situations.

## Version 3.1.2 (Mar 18, 2025)

### Behavior changes

Not applicable

### New features

* Supports using `-Infinity` values in a [floating-point number](../../sql-reference/data-types-numeric.md).
* Updated connector to use Confluent version 7.9.0 packages.
* Supports [JDBC](jdbc-2025.md) version 3.21.1.

### Bug fixes

Not applicable.

## Version 3.1.1 (Feb 26, 2025)

### Behavior changes

* [max_client_lag in Snowpipe Streaming](../../user-guide/snowpipe-streaming/snowpipe-streaming-classic-configuration.md) default value changed from `120s` to `30s`.

### New features

Not applicable

### Bug fixes

Not applicable.

## Version 3.1.0 (Jan 21, 2025)

> **Important:**
>
> If the `snowflake.topic2table.map` parameter is configured, Snowflake recommends using this version.
> We strongly recommend upgrading the connector if you are on earlier versions 2.x, 1.9.x, and 1.8.x.

### Behavior changes

Not applicable

### New features

* The Snowflake Connector for Kafka now supports external OAuth authentication.
* The Snowflake Connector for Kafka now uses Confluent version 7.8.0.

### Bug fixes

* The connector no longer throws the `IndexOutOfBoundException` when offsets are not continuous during schema evolution.
* For the Snowpipe ingestion method, when the `snowflake.topic2table.map` parameter is configured
  to map multiple topics to a single table, the connector adds the topic’s salt `hashCode` to the stage file prefixes to
  avoid file collision and load data from all specified topics.

---
title: Snowflake Connector for Kafka release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/kafka-connector-2026.md
section: Release Notes
---

# Snowflake Connector for Kafka release notes for 2026

This article contains the release notes for the Snowflake Connector for Kafka, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Kafka updates.

See [Snowflake Connector for Kafka](../../user-guide/kafka-connector.md) for documentation.

## Version 3.5.1 (Jan 8, 2026)

### New features

* Upgraded Snowflake Ingest SDK to version 4.4.1.

### Bug fixes

* Fixed a `NullPointerException` in `SnowflakeSinkTask#precommit()`.

---
title: Snowflake Connector for MySQL release notes
source: https://docs.snowflake.com/en/release-notes/connectors/mysql6.md
section: Release Notes
---

# Snowflake Connector for MySQL release notes

This topic provides release notes for the Snowflake Connector for MySQL.
For additional information, see [Snowflake connector for MySQL](https://other-docs.snowflake.com/en/connectors/mysql6/about).

## Version 6.11.2 (March 20, 2025)

### Behavior changes

* Reduced the number of INFO level logs in the agent for readability.
* The `_CONNECTORS_METADATA` and `JOURNALS` schemas were removed from the destination database in Snowflake. The connector can now only
  create journal tables in the database of the connector’s Snowflake Native App.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 6.10.1 (January 8, 2025)

### Behavior changes

* The snapshot load now resumes from the point of interruption. It’s supported only for specific column types described in [Resuming snapshot load after failures](https://other-docs.snowflake.com/en/connectors/mysql6/view-data#resuming-snapshot-load-after-failures).
  For other types, it starts from the beginning.
* The connector was optimized to result in a significant cloud services cost reduction.

### New features

Not applicable.

### Bug fixes

* Fixed a certain JDBC connection not being returned to the pool after use.
* The connector no longer creates short-lived tables, which increased cloud services usage.
* Fixed a bug when the database agent might enter an infinite loop when it couldn’t start incremental load for any table from a single data source.
* Snapshot and incremental replications do not fail on MySQL `DATE` or `DATETIME` values `0000-00-00` or values containing all zeroes in month or day. They are all replicated as `NULL`.

## Version 6.9.1 (December 12th, 2024)

### Behavior changes

* The native application has been migrated to the new [event-sharing model](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging). Telemetry sharing is now mandatory for all event types, such as logs and traces, and it can’t be disabled for the new installations. For already existing installations, you will receive an email with our account locator “EOA66985” asking you to review and configure the required event sharing.

### New features

Not applicable.

### Bug fixes

* Stopped printing the warning in the agent: “JAXB is unavailable. Will fallback to SDK implementation which may be less performant”.
* Stopped printing the warning in the agent: “Unable to load native-hadoop library for your platform… using builtin-java classes where applicable”.

## Version 6.8.0 (November 22nd, 2024)

### Behavior changes

* Increased the minimum schedule interval to 15 minutes.

  Pre-existing 10-minute intervals remain unchanged. Snowflake recommends setting them to at least 15 minutes.
* The operational warehouse will now suspend when there is no data traffic from the source database for more than 5 minutes.
* The `Killing agent on shutdown container` error message will not show when triggering the agent’s shutdown. Instead, the following message will be logged on the INFO level: `Stopping the agent on the container shutdown signal`.

### New features

* The `PUBLIC.REMOVE_TABLES(DATA_SOURCE_NAME STRING, SCHEMA_NAME STRING, TABLE_NAMES ARRAY)` procedure enables the removal of multiple tables with one call.

### Bug fixes

* The operational warehouse will now correctly suspend when all data sources are switched to a scheduled mode.
* The agent will no longer restart when the logging rate is too high.
* Fixed replication errors when the `DEFAULT_DDL_COLLATION` parameter for account is set.

## Version 6.6.1 (October 3rd, 2024)

### Behavior changes

* The maximum connection pool size to source databases has been increased to 7.

### New features

Not applicable.

### Bug fixes

* The agent no longer stops after encountering data source connectivity issues.

## Version 6.6.0 (September 17th, 2024)

### Behavior changes

Not applicable.

### New features

* You can now configure the agent through environment properties in addition to using `snowflake.json` and `datasources.json` for configuration.
* You can now pass the private key contents through the `snowflake.private-key` property or `SNOWFLAKE_PRIVATEKEY` environment variable.

For more information, see [Configure and run the agent](https://other-docs.snowflake.com/en/connectors/mysql6/install-agent#configure-and-run-the-agent).

### Bug fixes

* When schema introspection fails the operational warehouse suspends in the scheduled mode.
* Fixed a bug where some commands to stop table replication were ignored in a few cases.
* Fixed a bug where some source tables couldn’t be added to the replication and returned the error message `Tables are not ready to be re-added`.

## Version 6.5.0 (August 27th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* In continuous mode, the compute warehouse will now be able to suspend if there is no data to merge into destination tables.
* Fixed agent failure when MySQL server enforces secure connection

## Version 6.4.0 (August 15th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Corrected an issue where the connector could become stuck in a state where commands were not delivered to the agent.

## Version 6.3.2 (July 15th, 2024)

Initial release of version 6.3.2.

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed handling of deleted columns in subsequent schema changes.
* Fixed compute warehouse not suspending in scheduled mode.
* Fixed validation of reserved column names to be case insensitive.
* Fixed MySQL tables failing replication on huge transactions.

## Version 6.3.0 (July 11th, 2024)

Initial release of version 6.3.0.

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Not applicable.

---
title: Snowflake Connector for PostgreSQL release notes
source: https://docs.snowflake.com/en/release-notes/connectors/postgres6.md
section: Release Notes
---

# Snowflake Connector for PostgreSQL release notes

This topic provides release notes for the Snowflake Connector for PostgreSQL.
For additional information, see [Snowflake connector for PostgreSQL](https://other-docs.snowflake.com/en/connectors/postgres6/about).

## Version 6.11.2 (March 20, 2025)

### Behavior changes

* Reduced the number of INFO level logs in the agent for readability.
* The `_CONNECTORS_METADATA` and `JOURNALS` schemas were removed from the destination database in Snowflake. The connector can now only
  create journal tables in the database of the connector’s Snowflake Native App.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 6.10.1 (January 8, 2025)

### Behavior changes

* The snapshot load now resumes from the point of interruption. It’s supported only for specific column types described in [Resuming snapshot load after failures](https://other-docs.snowflake.com/en/connectors/postgres6/view-data#resuming-snapshot-load-after-failures).
  For other types, it starts from the beginning.
* The connector was optimized to result in a significant cloud services cost reduction.

### New features

Not applicable.

### Bug fixes

* Fixed a certain JDBC connection not being returned to the pool after use.
* The connector no longer creates short-lived tables, which increased cloud services usage.
* Fixed a bug when the database agent might enter an infinite loop when it couldn’t start incremental load for any table from a single data source.

## Version 6.9.1 (December 12th, 2024)

### Behavior changes

* The native application has been migrated to the new [event-sharing model](https://other-docs.snowflake.com/en/native-apps/consumer-enable-logging). Telemetry sharing is now mandatory for all event types, such as logs and traces, and it can’t be disabled for the new installations. For already existing installations, you will receive an email with our account locator “EOA66985” asking you to review and configure the required event sharing.

### New features

* The connector supports column types that are not listed in the pg_type table.

### Bug fixes

* Stopped printing the warning in the agent: “JAXB is unavailable. Will fallback to SDK implementation which may be less performant”.
* Stopped printing the warning in the agent: “Unable to load native-hadoop library for your platform… using builtin-java classes where applicable”.

## Version 6.8.0 (November 22nd, 2024)

### Behavior changes

* Increased the minimum schedule interval to 15 minutes.

  Pre-existing 10-minute intervals remain unchanged. Snowflake recommends setting them to at least 15 minutes.
* The operational warehouse will now suspend when there is no data traffic from the source database for more than 5 minutes.
* The `Killing agent on shutdown container` error message will not show when triggering the agent’s shutdown. Instead, the following message will be logged on the INFO level: `Stopping the agent on the container shutdown signal`.

### New features

* The `PUBLIC.REMOVE_TABLES(DATA_SOURCE_NAME STRING, SCHEMA_NAME STRING, TABLE_NAMES ARRAY)` procedure enables the removal of multiple tables with one call.

### Bug fixes

* The operational warehouse will now correctly suspend when all data sources are switched to a scheduled mode.
* The agent will no longer restart when the logging rate is too high.
* Fixed failing table replication on PostgreSQL numerics `NaN`, `Infinity`, `-Infinity` by ingesting them as `null`.
* Fixed replication errors when the `DEFAULT_DDL_COLLATION` parameter for account is set.

## Version 6.6.1 (October 3rd, 2024)

### Behavior changes

* The maximum connection pool size to source databases has been increased to 7.

### New features

Not applicable.

### Bug fixes

* The agent no longer stops after encountering data source connectivity issues.
* The agent now confirms the PostgreSQL WAL position even when the `QUOTED_IDENTIFIERS_IGNORE_CASE` parameter is enabled for an account.

## Version 6.6.0 (September 17th, 2024)

### Behavior changes

Not applicable.

### New features

* You can now configure the agent through environment properties in addition to using `snowflake.json` and `datasources.json` for configuration.
* You can now pass the private key contents through the `snowflake.private-key` property or `SNOWFLAKE_PRIVATEKEY` environment variable.

For more information, see [Configure and run the agent](https://other-docs.snowflake.com/en/connectors/postgres6/install-agent#configure-and-run-the-agent).

### Bug fixes

* When schema introspection fails the operational warehouse suspends in the scheduled mode.
* Fixed a bug where some commands to stop table replication were ignored in a few cases.
* Fixed a bug where some source tables couldn’t be added to the replication and returned the error message `Tables are not ready to be re-added`.

## Version 6.5.0 (August 27th, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* In continuous mode, the compute warehouse will now be able to suspend if there is no data to merge into destination tables.

## Version 6.4.0 (August 16th, 2024)

### Behavior changes

* The connector now supports all known types of Postgres publications.

### New features

* Added support for all PostgreSQL DOMAIN types based on native data types.

### Bug fixes

* Corrected an issue where the connector could become stuck in a state where commands were not delivered to the agent.

## Version 6.3.2 (July 15th, 2024)

Initial release of version 6.3.2.

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed handling of deleted columns in subsequent schema changes.
* Fixed compute warehouse not suspending in scheduled mode.
* Fixed validation of reserved column names to be case insensitive.
* Fixed MySQL tables failing replication on huge transactions.

## Version 6.3.0 (July 11th, 2024)

Initial release of version 6.3.0.

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Not applicable.

---
title: Snowflake Connector for Python release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/python-connector.md
section: Release Notes
---

# Snowflake Connector for Python release notes

The Snowflake Connector for Python release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](python-connector-2026.md)
* [2025 releases](python-connector-2025.md)
* [2024 releases](python-connector-2024.md)
* [2023 releases](python-connector-2023.md)
* [2022 releases](python-connector-2022.md)

See [Snowflake Connector for Python](../../developer-guide/python-connector/python-connector.md) for documentation.

---
title: Snowflake Connector for Python release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/python-connector-2022.md
section: Release Notes
---

# Snowflake Connector for Python release notes for 2022

This article contains the release notes for the Snowflake Connector for Python, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Python updates.

See [Snowflake Connector for Python](../../developer-guide/python-connector/python-connector.md) for documentation.

## Version 2.9.0 (December 14, 2022)

### New features and updates

* Reworked authentication internals to allow users to plug custom key-pair authenticators.
* Multi-statement query execution is now supported through `cursor.execute` and `cursor.executemany`.

  + The Snowflake parameter `MULTI_STATEMENT_COUNT` can be altered at the account, session, or statement level.
    An additional argument, `num_statements`, can be provided to execute to use this parameter at the statement level.
    It must be provided to `executemany` to submit a multi-statement query through the method. Note that bulk
    insert optimizations available through `executemany` are not available when submitting multi-statement queries.

    - By default the parameter is 1, meaning only a single query can be submitted at a time.
    - Set to 0 to submit any number of statements in a multi-statement query.
    - Set to >1 to submit the specified exact number of statements in a multi-statement query.
    - Bindings are accepted in the same way for multi-statements as they are for single statement queries.
* Asynchronous multi-statement query execution is supported. Users should still use `get_results_from_sfqid` to retrieve results.
* To access the results of each query, users can `call SnowflakeCursor.nextset()` as specified in the
  DB 2.0 API (PEP-249), to iterate through each statements results.

  + The first statement’s results are accessible immediately after calling execute (or `get_results_from_sfqid` if asynchronous)
    through the existing `fetch*()` methods.

### Bug fixes

* Fixed a bug where the permission of the file downloaded via GET command is changed.

## Version 2.8.3 (November 28, 2022)

### New features and updates

* Bumped cryptography dependency from <39.0.0 to <41.0.0.

### Bug fixes

* Fixed a bug where an expired OCSP response cache caused infinite recursion during cache loading.

## Version 2.8.2 (November 18, 2022)

### New features and updates

* Improved performance of OCSP response caching.
* No longer resolve target location on the local machine during the execution of GET commands.
* Improved performance of regexes used for PUT/GET SQL statement detection.

## Version 2.8.1 (October 28, 2022)

### New features and updates

* Bumped cryptography dependency from <37.0.0 to <39.0.0.
* When closing a connection, the async query status checking is now parallelized.

### Bug fixes

* Fixed an issue where `write_pandas` wouldn’t write an empty `DataFrame` to Snowflake.

## Version 2.8.0 (September 27, 2022)

### Bug fixes

* Fixed missing `dtypes` when calling `fetch_pandas()` and `fetch_arrow()` on empty results.
* Fixed a bug where `rowcount` was deleted when the cursor was closed.
* Fixed a bug where `extTypeName` was used even when it was empty.
* Updated how telemetry entries are constructed.
* Added telemetry for imported root packages during run-time.
* Added telemetry for using `write_pandas`.
* The `write_pandas` function now supports providing additional arguments to be used by `DataFrame.to_parquet`.
* All optional parameters of `write_pandas` can now be provided to `pd_writer` and `make_pd_writer` to be used with `DataFrame.to_sql`.

## Version 2.7.12 (August 24, 2022)

### New features and updates

* Added in-file caching for OCSP response caching.
* Added support for OKTA Identity Engine.
* The `write_pandas` function now supports transient tables through the new `table_type` argument that supersedes `create_temp_table` argument.

### Bug fixes

* Fixed a bug where timestamps fetched as `pandas.DataFrame` or `pyarrow.Table` would overflow for the sake of
  unnecessary precision. In the case where an overflow cannot be prevented, a clear error is now raised.
* Fixed a bug where calling `fetch_pandas_batches` incorrectly raised `NotSupportedError` after an async query was executed.

## Version 2.7.11 (July 28, 2022)

### Bug fixes

* Added a minimum version pin to `typing_extensions`.

## Version 2.7.10 (July 25, 2022)

### New features and updates

* Added an in-memory cache to OCSP requests.
* Added an overwrite option to `write_pandas`.
* Added the `lastrowid` attribute to `SnowflakeCursor` in compliance with PEP-249.
* Added new connection diagnostics capabilities.
* Updated the following libraries and resources:

  + Supported pyarrow versions to 8.0.X.
  + Vendored library versions requests to 2.28.1 and urllib3 to 1.26.10.
  + Supported numpy dependency versions from 1.23.0 to 1.24.0.

### Bug fixes

* Fixed an issue where gzip-compressed HTTP requests might be garbled by an unflushed buffer.

## Version 2.7.5 (March 18, 2022)

### Behavior change

* Deprecated support for Python 3.6.

### New feature

* Added an option for partners to inject their name through an environmental variable (`SF_PARTNER`).

### Bug fixes

* Fixed a bug where we would not wait for input if a browser window couldn’t be opened for SSO login.
* Exported a type definition for `SnowflakeConnection`.
* Fixed a bug where final Arrow table would contain duplicate index numbers when using `fetch_pandas_all`.

## Version 2.7.3 (January 18, 2022)

### Bug fixes

* Moved package metadata from `setup.py` to `setup.cfg`.
* Added `Timezone` to `Timestamp_TZ`.
* Fixed an error related to storage credentials.
* Fixed an issue where py.typed was not being included in wheels.
* Fix an issue where negative numbers were not correctly converted using `arrow_number_to_decimal`.
* Added file handling for empty files when using GET.
* Fixed the long description rendering for PyPi.
* Added error handling for DUO when SMS is not present.
* Added the ability to auto-create a table when writing a pandas `DataFrame` to a Snowflake table.
* Updated numpy requirement from <1.22.0 to <1.23.0.
* Updated the CODEOWNERS file.

---
title: Snowflake Connector for Python release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/python-connector-2023.md
section: Release Notes
---

# Snowflake Connector for Python release notes for 2023

This article contains the release notes for the Snowflake Connector for Python, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Python updates.

See [Snowflake Connector for Python](../../developer-guide/python-connector/python-connector.md) for documentation.

## Version 3.6.0 (December 07, 2023)

### New features and updates

* Added support for vector types.
* Added support for the `private_key_file` and `private_key_file_pwd` connection parameters.
* Added the new `expired` flag to the `SnowflakeConnection` class that tracks whether the connection’s master token has expired.
* Changed the urlib3 version pin to affect only Python versions lower than 3.10.

### Bug fixes

* Fixed a bug where date insertion failed when the date format is set and qmark-style binding is used.

## Version 3.5.0 (November 13, 2023)

### New features and updates

* Snowflake Connector for Python is now built solely on the apache arrow-nanoarrow project:

  + Reduced the wheel size to ~1MB and the installation size to ~5MB.
  + Removed a hard dependency on a specific version of pyarrow.
* Deprecated the following in support of the nanoarrow converter:

  + `snowflake.connector.cursor.NanoarrowUsage` class.
  + `NANOARROW_USAGE` environment variable.
  + `snowflake.connector.cursor.NANOARROW_USAGE` module variable.

### Bug fixes

* None.

## Version 3.4.1 (November 09, 2023)

### New features and updates

* Updated the following libraries:

  + Updated the vendored `urlib3` to version 1.26.18.
  + Updated the vendored `requests` to version 2.31.0.

### Bug fixes

* None.

## Version 3.4.0 (November 03, 2023)

### New features and updates

* Added support for `use_logical_type` in `write_pandas`.
* Added the `backoff_policy` argument to `snowflake.connector.connect` allowing for configurable backoff policy between retries of failed requests. See available implementations in the `backoff_policies` module.
* Added the `socket_timeout` argument to `snowflake.connector.connect` specifying socket read and connect timeout.
* Removed dependencies on pycryptodomex and oscrypto. All connections now go through OpenSSL via the cryptography library, which was already a dependency.

### Bug fixes

* Fixed `login_timeout` and `network_timeout` behavior. Retries of login and network requests are now properly halted after these timeouts expire.
* Fixed bug for issue [urllib3/urllib3#1878](https://github.com/urllib3/urllib3/issues/1878) in vendored `urllib`.
* Fixed issue with ingesting files over 80 GB to S3.

## Version 3.3.1 (October 18, 2023)

### New features and updates

* For non-Windows platforms, added command suggestions (`chown` or `chmod`) for insufficient file permissions of config files.

### Bug fixes

* Fixed an issue where connection diagnostics failed to complete certificate checks.
* Fixed an issue where the arrow iterator caused `ImportError` when the C extensions were not compiled.

## Version 3.3.0 (October 12, 2023)

### New features and updates

* Updated to Apache arrow-nanoarrow project for result arrow data conversion.
* Introduced the `NANOARROW_USAGE` environment variable to allow switching between the nanoarrow converter and the
  arrow converter. Valid values include:

  + `FOLLOW_SESSION_PARAMETER`, which uses the converter configured in the server.
  + `DISABLE_NANOARROW`, which uses the arrow converter, overriding the server setting.
  + `ENABLE_NANOARROW`, which uses the nanoarrow converter, overriding the server setting.
* Introduced the `snowflake.connector.cursor.NanoarrowUsage` enum, whose members include:

  + `NanoarrowUsage.FOLLOW_SESSION_PARAMETER`, which uses the converter configured in the server.
  + `NanoarrowUsage.DISABLE_NANOARROW`, which uses the arrow converter, overriding the server setting.
  + `NanoarrowUsage.ENABLE_NANOARROW`, which uses the nanoarrow converter, overriding the server setting.
* Introduced the `snowflake.connector.cursor.NANOARROW_USAGE` module variable to allow switching between the nanoarrow converter and the
  arrow converter. It works in conjunction with the `snowflake.connector.cursor.NanoarrowUsage` enum.

> **Note:**
>
> The newly-introduced environment variable, enum, and module variable are temporary. They will be removed in a
> future release when switch from arrow to nanoarrow for data conversion is complete.

### Bug fixes

* None.

## Version 3.2.1 (October 3, 2023)

### New features and updates

* Added thread safety in telemetry when instantiating multiple connections concurrently.
* Improved robustness in handling authentication changes.
* Removed the `urllib3.contrib.pyopenssl` deprecation warning from `urllib3` library.
* Updated the `platformdirs` dependency to versions 2.6.0 through 4.0.0 from versions 2.6.0 through 3.9.0.

### Bug fixes

* Fixed a bug where URL, port, and path were ignored in AWS PrivateLink OCSP retry attempts.

## Version 3.2.0 (September 7, 2023)

### New features and updates

* Made the `parser -> manager` renaming more consistent in `snowflake.connector.config_manager` module.
* Added support for default values for `ConfigOptions`.
* Added `default_connection_name` to `config.toml` file.

### Bug fixes

* None.

## Version 3.1.1 (August 28, 2023)

### New features and updates

* Added support for RSAPublicKey when constructing `AuthByKeyPair` in addition to raw bytes.

### Bug fixes

* Fixed a bug in retry logic for OKTA authentication to refresh token.
* Fixed a bug where the attribute `proxy_header` is missing in `SOCKSProxyManager` when connecting through SOCKS5 proxy.

## Version 3.1.0 (July 31, 2023)

### New features and updates

* Added a feature that lets you add connection definitions to the `connections.toml` configuration file.
  A connection definition refers to a collection of connection parameters, for example, if you wanted to define a
  connection named “prod”:

  ```bash
  [prod]
  account = "my_account"
  user = "my_user"
  password = "my_password"
  ```

  By default, we look for the `connections.toml` file in the location specified in the `SNOWFLAKE_HOME` environment
  variable (default: `~/.snowflake`). If this folder does not exist, the Python connector looks for the file in
  the `platformdirs` location, as follows:

  + On Linux: `~/.config/snowflake/`, but follows XDG settings
  + On Mac: `~/Library/Application Support/snowflake/`
  + On Windows: `%USERPROFILE%\AppData\Local\snowflake\`

  You can determine which file is used by running the following command:

  ```bash
  python -c "from snowflake.connector.constants import CONNECTIONS_FILE; print(str(CONNECTIONS_FILE))"
  ```
* Bumped cryptography dependency from <41.0.0,>=3.1.0 to >=3.1.0,<42.0.0.
* Improved OCSP response caching to remove tmp cache files on Windows
* Improved OCSP response caching to reduce the times of disk writing.
* Added a parameter `server_session_keep_alive` in `SnowflakeConnection` that skips session deletion when client connection closes.
* Tightened our pinning of `platformdirs`, to prevent their new releases breaking new versions of the connector.
* Allowed you to pass `type_mapper` to `fetch_pandas_batches()` and `fetch_pandas_all()`.
* Improved retry logic for okta authentication to refresh token if authentication gets throttled.
* Added retry reasons for queries that are retried by the client.
* Remove Python 3.7 support.
* Improved error handling of connection reset error.

### Bug fixes

* Fixed a bug where `SFPlatformDirs` would incorrectly append application_name/version to its path.
* Fixed a bug where `write_pandas` fails when user does not have the privilege to create stage or file format in the target schema, but has the right privilege for the current schema.
* Worked around a segfault which sometimes occurred during cache serialization in multi-threaded scenarios.
* Fixed a bug about deleting the temporary files happened when running PUT command.
* Fixed a bug where `pickle.dump` segfaults during cache serialization in multi-threaded scenarios.

## Version 3.0.4 (May 25, 2023)

### New features and updates

* Added the `json_result_force_utf8_decoding` connection parameter to force decoding JSON content in utf-8 when the
  result format is JSON.
* Bumped vendored library urllib3 to 1.26.15
* Bumped vendored library requests to 2.29.0
* Bumped pandas dependency from <1.6.0,>=1.0.0 to >=1.0.0,<2.1.0
* Add support for Geometry types.

### Bug fixes

* Fixed a bug in which `cursor.execute()` could modify the argument `statement_params` dictionary object when executing a multi-statement query.
* Fixed a bug prevented calling `SnowflakeCursor.nextset` before fetching the result of the first query if the cursor runs an async multi-statement query.
* Fixed a bug when `_prefetch_hook()` was not called before yielding results of `execute_async()`.
* Fixed a bug where some `ResultMetadata` fields were marked as required when they were optional.
* Fixed a bug where bulk insert converts date incorrectly.

## Version 3.0.3 (April 20, 2023)

### New features and updates

* Added a parameter that allows users to skip file uploads to stage if file exists on stage and contents of the file match.
* Improved type hint of `SnowflakeCursor.execute` method.
* Improved GET logging to warn when downloading multiple files with the same name.

### Bug fixes

* Fixed a bug that prints error in logs for GET command on GCS.
* Added a parameter that allows users to skip file uploads to stage if file exists on stage and contents of the file match.
* Fixed a bug that occurred when writing a Pandas DataFrame with column names containing double quotes in `snowflake.connector.pandas_tool.write_pandas`.
* Fixed a bug that occurred when writing a Pandas DataFrame with binary data in `snowflake.connector.pandas_tool.write_pandas`.

## Version 3.0.2 (March 23, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed a bug of incorrect type hints of `SnowflakeCursor.fetch_arrow_all` and `SnowflakeCursor.fetchall`.
* Improved logging to mask tokens in case of errors.
* Fixed a bug where `snowflake.connector.util_text.split_statements` swallowed the final line break in the case when there are no space between lines.
* Fixed a memory leak in the logging module of the Cython extension.
* Fixed a bug where the `put` command on AWS raised an `AttributeError` when uploading a file composed of multiple parts.
* Fixed a bug where the `put` command on AWS raised an `AttributeError` for file sizes larger than 200MB.

## Version 3.0.1 (March 01, 2023)

### New features and updates

* Improved the robustness of OCSP response caching to handle errors in cases of serialization and deserialization.
* Replaced the dependency on `setuptools` in favor of packaging.
* Updated `async_executes` method’s doc-string.
* Errors raised now have a query field that contains the SQL query that caused them when available.

### Bug fixes

* Fixed a bug where `AuthByKeyPair.handle_timeout` should pass keyword arguments instead of positional arguments
  when calling `AuthByKeyPair.prepare`.
* Fixed a bug where MFA token caching would refuse to work until restarted instead of re-authenticating.

## Version 3.0.0 (January 27, 2023)

### BCR (Behavior Change Release) change

* Fixed a bug where write_pandas did not use user-specified schemas and databases to create intermediate objects.

  Previously, the write_pandas function created temporary objects in the currently-used database and schema and only put
  the final table (that was created or appended) in the user-specified database and schema. With this version,
  if the database or schema parameters for `write_pandas` are different than the currently-selected one, you need to
  make sure that the user who is executing `write_pandas` has access to create/drop temporary stages, file formats,
  and tables with the schema referenced by the `write_pandas` function.

  Snowflake recommends that you test any new driver version in pre-production environments before deploying to
  production environments. With this behavior change, you should give special attention to the scenario(s)
  listed above (i.e. `write_pandas` with database or schemas parameters that differ from the current context).

### New features and updates

* Bumped pyarrow dependency from >=8.0.0,<8.1.0 to >=10.0.1,<10.1.0
* Bumped pyOpenSSL dependency from <23.0.0 to <24.0.0
* During browser-based authentication, the SSO url is now printed before opening it in the browser
* Increased the level of a log for when ArrowResult cannot be imported
* Added a minimum MacOS version check when compiling C-extensions

### Bug fixes

* Fixed a bug where `write_pandas` did not use user-specified schema and database to create intermediate objects
* Fixed a bug where HTTP response code of 429 were not retried
* Fixed a bug where MFA token caching was not working

---
title: Snowflake Connector for Python release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/python-connector-2024.md
section: Release Notes
---

# Snowflake Connector for Python release notes for 2024

This article contains the release notes for the Snowflake Connector for Python, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Python updates.

See [Snowflake Connector for Python](../../developer-guide/python-connector/python-connector.md) for documentation.

## Version 3.12.4 (December 03, 2024)

### New features and updates

* Bumped the pyOpenSSL dependency from >=16.2.0,<25.0.0 to >=22.0.0,<25.0.0.

### Bug fixes

* Fixed a bug where multi-part uploads to Azure were missing their MD5 hashes.
* Fixed a bug where `OpenTelemetry` header injection would sometimes cause Exceptions to be thrown.
* Fixed a bug where OCSP checks would throw `TypeError` and make mainly GCP blob storage unreachable.

## Version 3.12.3 (October 24, 2024)

### Security fixes

* Addressed issues raised by CVE-2024-49750. For more information, see advisory [GHSA-5vvg-pvhp-hv2m](https://github.com/snowflakedb/snowflake-connector-python/security/advisories/GHSA-5vvg-pvhp-hv2m).

### New features and updates

* Improved the error message for SSL-related issues to provide clearer resolution guidance.
* Improved the error message for SQL execution cancellations caused by a timeout.

### Bug fixes

* None.

## Version 3.12.2 (September 11, 2024)

### New features and updates

* None.

### Bug fixes

* Improved error handling for asynchronous queries, providing more detailed and informative error messages when an async query fails.
* Improved inference of top-level domains for accounts specifying a region in China, now defaulting to `snowflakecomputing.cn`.
* Improved implementation of `snowflake.connector.util_text.random_string` to reduce the likelihood of collisions.
* Updated the log level for OCSP fail-open warning messages from ERROR to WARNING.

## Version 3.12.1 (August 20, 2024)

### New features and updates

* None.

### Bug fixes

* Fixed a bug that logged the session token when renewing a session.
* Fixed a bug where disabling client telemetry did not work.
* Fixed a bug where passing `login_timeout` as a string raised a `TypeError` during the login retry step.
* Updated the connector to use `pathlib` instead of `os` for resolving the default configuration file location.
* Removed the upper `cryptography` version pin.
* Removed references to the `snowflake-export-certs` script, as its backing module was removed in a previous version.
* Enhanced the retry mechanism for handling transient network failures during query result polling when no server response is received.

## Version 3.12.0 (July 26, 2024)

### New features and updates

* Set the default connection timeout to 10 seconds and the socket read time to 10 minutes for HTTP calls in file transfers.
* Added the ability to connect to multiple domains.
* Optimized `to_pandas()` performance by using fully-parallel downloading logic.
* Bumped the keyring dependency from g>=23.1.0,<25.0.0 to g>=23.1.0,<26.0.0.

### Bug fixes

* Fixed a bug where specifying `client_session_keep_alive_heartbeat_frequency` in `snowflake-sqlalchemy` could make the connector unresponsive.
* Fixed an incorrect `private_key` connection parameter type hint.

## Version 3.11.0 (June 18, 2024)

### New features and updates

* Added support for the `token_file_path` connection parameter to read an OAuth token from a file when connecting to Snowflake.
* Added support for the `debug_arrow_chunk` connection parameter to allow debugging raw arrow data in cases of arrow data parsing failures.
* Added support for the `disable_saml_url_check` connection parameter to disable SAML URL checks in OKTA authentication.

### Bug fixes

* Fixed a bug where OCSP certificates signed using SHA384 algorithm cannot be verified.
* Fixed a bug where the status code showed as uploaded when a PUT command failed with a 400 error.
* Fixed a bug where a `PermissionError` was raised when the current user does not have the right permission on parent directory of configuration file path.
* Fixed a bug where an OCSP GET URL is not encoded correctly when it contains a slash.
* Fixed a bug where an SSO URL didn’t accept `:` in a query parameter, such as in `https://sso.abc.com/idp/startSSO.ping?PartnerSpId=https://xyz.snowflakecomputing.com/`.

## Version 3.10.1 (May 21, 2024)

### New features and updates

* None.

### Bug fixes

* Removed an incorrect error log message that could occur during arrow data conversion.

## Version 3.10.0 (April 29, 2024)

### New features and updates

* Added support for structured types to `fetch_pandas_all`.

### Bug fixes

* Fixed an issue relating to incorrectly formed China S3 endpoints.

## Version 3.9.1 (April 22, 2024)

### New features and updates

* Fixed an issue that caused a HTTP 400 error when connecting to a China endpoint.

### Bug fixes

* None.

## Version 3.9.0 (April 18, 2024)

### New features and updates

* Added support for log settings in a [logging configuration file](../../developer-guide/python-connector/python-connector-example.md).
* Improved S3 acceleration logic when connecting to a China endpoint.

### Bug fixes

* None.

## Version 3.8.1 (April 09, 2024)

### New features and updates

* Improved `externalbrowser` authentication in containerized environments:

  + Instructs the browser to not fetch `/favicon` on a success page.
  + Uses a simple retry strategy for an empty `socket.recv` call.
  + Adds a `SNOWFLAKE_AUTH_SOCKET_REUSE_PORT` flag (`SNOWFLAKE_AUTH_SOCKET_REUSE_PORT=true`) to set the underlying socket’s `SO_REUSEPORT` flag (as described in the [socket man page](https://man7.org/linux/man-pages/man7/socket.7.html)).

    - Setting this flag can be useful when the randomized port used in the localhost callback url is being followed before the container engine completes port forwarding to host.
    - You can then statically map a port between your host and container and allow that port to be reused in rapid succession with a command similar to the following:

      ```bash
      SF_AUTH_SOCKET_PORT=3037 SNOWFLAKE_AUTH_SOCKET_REUSE_PORT=true poetry run python somescript.py
      ```
  + Adds a `SNOWFLAKE_AUTH_SOCKET_MSG_DONTWAIT` flag (`SNOWFLAKE_AUTH_SOCKET_MSG_DONTWAIT=true`) to make a non-blocking `socket.recv` call and retry on an error.
* Added support for parsing structured type information in schema queries.
* Bumped `platformdirs` from >=2.6.0,<4.0.0 to >=2.6.0,<5.0.0.
* Updated diagnostics to use `system$allowlist` instead of `system$whitelist`.
* Improved the cleanup logic so connections now rely on an interpreter shutdown instead of the `__del__` method.
* Updated the logging level from INFO to DEBUG when logging the executed query using `SnowflakeCursor.execute`.

### Bug fixes

* Fixed a bug that the truncated password in log is not masked.

## Version 3.7.1 (February 22, 2024)

### New features and updates

* Bumped the following dependencies:

  + pandas from version >=1.0.0,<2.2.0 to >=1.0.0,<3.0.0
  + cryptography from version <42.0.0,>=3.1.0 to >=3.1.0,<43.0.0
  + pyOpenSSL from version >=16.2.0,<24.0.0 to >=16.2.0,<25.0.0
* Bumped the keyring dependency lower bound to version 23.1.0 to address a security vulnerability.

### Bug fixes

* Fixed a memory leak in decimal data conversion.
* Fixed a bug where `write_pandas` wasn’t truncating the target table.

## Version 3.7.0 (January 26, 2024)

### New features and updates

* Added support for Python 3.12.
* Added a new Boolean `force_return_table` parameter to `SnowflakeCursor.fetch_arrow_all` to force returning `pyarrow.Table` in case of zero rows.
* Cleanup some C++ code warnings and performance issues.
* Made local testing more robust against implicit assumptions.
* Added support for connecting using an existing connection via the session and master token.
* Added support for connecting to Snowflake by authenticating with multiple SAML IDP using an external browser.
* Improved configuration permissions warning message.

### Bug fixes

* Fixed an issue with PyArrow Table type hinting.
* Fixed a compilation issue due to a missing `cstdint` header on gcc13.

---
title: Snowflake Connector for Python release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/python-connector-2025.md
section: Release Notes
---

# Snowflake Connector for Python release notes for 2025

This article contains the release notes for the Snowflake Connector for Python, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Python updates.

See [Snowflake Connector for Python](../../developer-guide/python-connector/python-connector.md) for documentation.

## Version 4.1.1 (Dec 02, 2025)

### New features and updates

* None.

### Bug fixes

* Relaxed the pandas dependency requirements for Python below 3.12.
* Changed the CRL cache cleanup background task to a daemon thread to avoid blocking the main thread.
* Fixed NO_PROXY issues with PUT operations.

## Version 4.1.0 (Nov 13, 2025)

### New features and updates

* Added official support for RHEL9 (Red Hat Enterprise Linux 9).
* Added the `oauth_socket_uri` connection parameter to allow users to specify separate server and redirect URIs for local OAuth server.
* Added the `no_proxy` parameter for proxy configuration without using environmental variables.
* Added the `SNOWFLAKE_AUTH_FORCE_SERVER` environment variable to force the driver to receive SAML tokens even without opening a browser when using the `externalbrowser` authentication method. The variable allows headless environments, such as Docker or Airflow) that run locally to authenticate the connection using a browser URL.

### Bug fixes

* Fixed a compilation error when building from sources with libc++.
* Added `OAUTH_AUTHORIZATION_CODE` and `OAUTH_CLIENT_CREDENTIALS` to the list of authenticators that don’t require users to set the `user` parameter.

## Version 4.0.0 (Oct 9, 2025)

### BCR (Behavior Change Release) changes

* Configuration files writable by a group or others now raise a `ConfigSourceError` with detailed permission information, preventing potential credential tampering.
* Reverted changing the exception type in case of token expired scenario for `Oauth` authenticator back to `DatabaseError`.

### New features and updates

* Implemented a new CRL (Certificate Revocation List) checking mechanism.

  Enabling CRLs improves security by checking for revoked certificates during the TLS handshake process. For more information, see the [Replacing OCSP with CRL as the method of certificate revocation checking](https://community.snowflake.com/s/article/Replacing-OCSP-with-CRL-as-the-method-of-certificate-revocation-checking) Knowledge Base article.

  This feature is disabled by default. For information on enabling this feature, see [CertRevocationCheckMode](../../developer-guide/python-connector/python-connector-api.md). We recommend you test this feature in advisory mode before enabling it in production.
* Added the `workload_identity_impersonation_path` parameter to support service account impersonation for Workload Identity Federation. Impersonation is available only for Google Cloud and AWS workloads.
* Added the `oauth_credentials_in_body` parameter to support sending OAuth client credentials in a connection request body.
* Added an option to exclude `botocore` and `boto3` dependencies during installation by setting the `SNOWFLAKE_NO_BOTO` environment variable to `true`. For the full details, see [Installing the Python Connector](../../developer-guide/python-connector/python-connector-install.md).
* Added the `ocsp_root_certs_dict_lock_timeout` connection parameter to set the timeout (in seconds) for acquiring the lock on the OCSP root certs dictionary. The default value is -1, which represents no timeout.

### Bug fixes

* Fixed `get_results_from_sfqid` when using `DictCursor` and executing multiple statements at once.
* Fixed retry behavior for `ECONNRESET` errors.
* Fixed the return type of `SnowflakeConnection.cursor(cursor_class)` to match the type of `cursor_class`.
* Constrained the types of `fetchone`, `fetchmany`, and `fetchall`.
* Fixed the “No AWS region was found” error when AWS region was set in the `AWS_DEFAULT_REGION` variable instead of in `AWS_REGION` for the `WORKLOAD_IDENTITY` authenticator.

## Version 3.18.0 (Oct 6, 2025)

### New features and updates

* Added support for pandas conversion for Day-time and Year-Month Interval types.

### Bug fixes

* None.

## Version 3.17.4 (Sep 22, 2025)

### New features and updates

* Added support for allowing intermediate certificates from the trust store to act as root certificates.
* Updated bundled `urllib3` to version v2.5.0.
* Updated bundled `requests` to version v2.32.5.
* Dropped support for OpenSSL versions older than 1.1.1.

### Bug fixes

* None.

## Version 3.17.3 (Sep 3, 2025)

### New features and updates

* None.

### Bug fixes

* Enhanced configuration file permission warning messages.

  + Improved warning messages for readable permission issues to include clear instructions on how to skip warnings using the `SF_SKIP_WARNING_FOR_READ_PERMISSIONS_ON_CONFIG_FILE` environment variable.
* Fixed the bug with staging pandas dataframes on AWS — the regional endpoint is used when required.

  + This fix addresses the issue with the `create_dataframe` call on Snowpark.

## Version 3.17.2 (August 20, 2025)

### New features and updates

* None.

### Bug fixes

* Added the ability to disable endpoint-based platform detection by setting `platform_detection_timeout_seconds` to zero.
* Fixed a bug where `platform_detection` was retrying failed requests with warnings to non-existent endpoints.

## Version 3.17.1 (August 14, 2025)

### New features and updates

* Added the `infer_schema` parameter to `write_pandas` to perform schema inference on the passed data.

### Bug fixes

* Reverted the `snowflake` namespace back to non-module.

## Version 3.17.0 (August 13, 2025)

### New features and updates

* Added support for workload identity federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `workload_identity_provider` connection parameter.
  + Added `WORKLOAD_IDENTITY` to the values for the `authenticator` connection parameter.
* Added an `unsafe_skip_file_permissions_check` flag to skip file permission checks on the cache and configuration.
* Added basic JSON support for `Interval` types.
* Added populating of `type_code` in `ResultMetadata` for interval types.
* Relaxed the pyarrow version constraint; versions >= 19 can now be used.
* Introduced the `snowflake_version property` to the connection.
* Added support for the `use_vectorized_scanner` parameter in the `write_pandas` function.
* Added support of proxy setup using connection parameters without emitting environment variables.

### Bug fixes

* Fixed OAuth authenticator values.
* Fixed a bug where a PAT with an external session authenticator was used while `external_session_id` was not provided in `SnowflakeRestful.fetch`.
* Fixed the case-sensitivity of Oauth and `programmatic_access_token` authenticator values.
* Fixed unclear error messages for incorrect authenticator values.
* Fixed GCS staging by ensuring the endpoint has a scheme.
* Fixed a bug where time-zoned timestamps fetched as a `pandas.DataFrame` or `pyarrow.Table` would overflow due to unnecessary precision. A clear error is now raised if an overflow cannot be prevented.

## Version 3.16.0 (July 01, 2025)

### New features and updates

* Added the `client_fetch_use_mp` connection parameter that enables multi-processed fetching of result batches, which usually reduces fetching time.
* Added support for the new Personal Access Token (PAT) authentication mechanism with external session ID.
* Added the `bulk_upload_chunks` parameter to the `write_pandas` function. Setting this parameter to `True` changes the behavior of the `write_pandas` function to first write all the data chunks to the local disk and then perform the wildcard upload of the chunks folder to the stage. When set to `False` (default), the chunks are saved, uploaded, and deleted one by one.
* Added Windows support for Python 3.13.
* Added basic arrow support for `Interval` types.
* Added support for Snowflake OAuth for local applications.

### Bug fixes

* Fixed `write_pandas` special characters usage in the location name.
* Fixed the usage of `use_virtual_url` when building the location for a Google Cloud Storage (GCS) client.

## Version 3.15.0 (April 28, 2025)

### Private Preview (PrPr) features

Added support for workload identity federation in the AWS, Azure, GCP and Kubernetes platforms.

Disclaimer:

* This feature can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use this feature only with non-production data.
* This PrPr feature is not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Added new authentication methods support for OAuth 2.0 Authorization Code Flow, OAuth 2.0 Client Credentials Flow, and OAuth Token caching.

  + For OAuth 2.0 Authorization Code Flow:

    - Added the `oauth_client_id`, `oauth_client_secret`, `oauth_authorization_url`, `oauth_token_request_url`, `oauth_redirect_uri`, `oauth_scope`, `oauth_disable_pkce`, `oauth_enable_refresh_tokens` and `oauth_enable_single_use_refresh_tokens` parameters.
    - Added the `OAUTH_AUTHORIZATION_CODE` value for the parameter authenticator.
  + For OAuth 2.0 Client Credentials Flow:

    - Added the `oauth_client_id`, `oauth_client_secret`, `oauth_token_request_url`, and `oauth_scope` parameters.
    - Added the `OAUTH_CLIENT_CREDENTIALS` value for the parameter authenticator.
  + For OAuth Token caching: Passing a username to driver configuration is required, and the `client_store_temporary_credential property` is to be set to `true`.

### Bug fixes

* Increased the minimum required `boto` and `botocore` versions to 1.24.
* Fixed an issue with OSCP by terminating a certificate’s chain traversal if a trusted certificate was already reached.

## Version 3.14.1 (April 21, 2025)

### Private Preview (PrPr) features

* Added the `client_fetch_threads` experimental parameter to better utilize threads for fetching query results.
* Added new experimental authentication methods:

  + OAuth authorization code and client credentials flows.
  + Workload Identity Federation for AWS, Azure, GCP and Kubernetes platforms.

Disclaimer:

* These features can only be accessed by setting `SF_ENABLE_EXPERIMENTAL_AUTHENTICATION` environment variable to `true`.
* You should use these features only with non-production data.
* These PrPr features are not covered by Support. However, the Product and Engineering teams are available during the PrPr phase.
* Please contact your account team for participation and documentation.

### New features and updates

* Added support for Python 3.13.

  > **Note:**
  >
  > Windows 64 support is still experimental and should not yet be used for production environments.
* Dropped support for Python 3.8.
* Added support for the basic decimal `floating-point` type.
* Added support for providing a PAT in the `password` field.
* Added support for GCS regional endpoints.
* Added support for GCS virtual URLs. For more information, see [Request endpoints](https://cloud.google.com/storage/docs/request-endpoints#xml-api).
* Added support to allow the connector to inherit a UUID4 generated upstream, provided in statement parameters (field: `requestId`), rather than automatically generate a UUID4 to use for the HTTP Request ID.
* Improved logging in the urllib3, boto3, and botocore libraries to assure data masking even after a future migration to the external owned library.
* Lowered log levels from `info` to `debug` for some of the messages to make the output easier to follow.
* Improved security and robustness for the temporary credentials cache storage.
* Deprecated the `insecure_mode` connection property and replaced it with `disable_ocsp_checks` with the same behavior as the former property.
* Implemented and improved the file-based credentials cache for Linux, including enhanced token caching.

### Bug fixes

* Improved the error message for client-side query cancellations due to timeouts.
* Fixed a bug that caused the driver to fail silently on `TO_DATE` arrow to python conversion when an invalid date was followed by the correct one.
* Added the `check_arrow_conversion_error_on_every_column` connection property that can be set to `False` to restore previous behavior in which driver ignores errors until they occurs in the last column. This option lest you unblock workflows that might be impacted by the bug fix and will be removed in later releases.
* Fixed an issue with expired S3 credentials update and increment retry when expired credentials are found.

## Version 3.14.0 (March 03, 2025)

### New features and updates

* Bumped the pyOpenSSL dependency upper boundary from <25.0.0 to <26.0.0.
* Optimized distribution package lookup to improve import speed.
* Added support for iceberg tables to `write_pandas`.
* Added support for `File` types.

### Bug fixes

* Added a <19.0.0 pin to `pyarrow` as a workaround to a bug affecting Azure Batch.
* Fixed a bug where the privatelink OCSP Cache url could not be determined if the privatelink account name was specified in uppercase.
* Fixed base64 encoded private key tests.
* Fixed a bug with file permission checks on Windows.
* Added the `unsafe_file_write` connection parameter that restores the previous behavior of saving files downloaded with GET with 644 permissions.

## Version 3.13.2 (January 30, 2025)

### New features and updates

* The connector no longer uses scoped temporary objects.

### Bug fixes

* None.

## Version 3.13.1 (January 29, 2025)

### New features and updates

* None.

### Bug fixes

* Hardened the `snowflake.connector.pandas_tools` module against SQL injection. For more information, see [CVE-2025-24793](https://github.com/snowflakedb/snowflake-connector-python/security/advisories/GHSA-2vpq-fh52-j3wv).
* The local OCSP cache has been updated to use the json module instead of pickle to serialize its contents. For more information, see [CVE-2025-24794](https://github.com/snowflakedb/snowflake-connector-python/security/advisories/GHSA-m4f6-vcj4-w5mx).
* The Linux credential cache file permissions have been updated explicitly to be only be owner readable. For more information, see [CVE-2025-24795](https://github.com/snowflakedb/snowflake-connector-python/security/advisories/GHSA-r2x6-cjg7-8r43).
* Updated the file permissions for files downloaded with GET to be readable only by the file owner.

## Version 3.13.0 (January 23, 2025)

### New features and updates

* Added the `iobound_tpe_limit` connection parameter to limit the sizes of IO-bound `ThreadPoolExecutors` during PUT and GET commands. By default, the size is calculated to the lesser of the number of files and the number of CPU cores.
* Added the `Connection.is_valid()` method that verifies whether a connection is stable enough to receive queries.
* Updated the log level for cursor’s chunk `rowcount` from INFO to DEBUG.
* Added support for base64-encoded DER private key strings in the `private_key` authentication type.
* Updated `README.md` to include instructions on how to verify package signatures using `cosign`.

### Bug fixes

* None.

---
title: Snowflake Connector for Python release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/python-connector-2026.md
section: Release Notes
---

# Snowflake Connector for Python release notes for 2026

This article contains the release notes for the Snowflake Connector for Python, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Python updates.

See [Snowflake Connector for Python](../../developer-guide/python-connector/python-connector.md) for documentation.

## Version 4.4.0 (Mar 25, 2026)

### New features and updates

* Bumped the lower boundary of the `cryptography` package to 46.0.5 to address CVE-2026-26007.
* Added support for Python 3.14.
* Removed the upper bound dependency constraint on `pyOpenSSL` to allow installation of `pyOpenSSL` 26.0.0+, which includes a fix for GHSA-vp96-hxj8-p424.

### Deprecated features

* Renamed the environment variable for skipping config file permission warnings from `SF_SKIP_WARNING_FOR_READ_PERMISSIONS_ON_CONFIG_FILE` to `SF_SKIP_TOKEN_FILE_PERMISSIONS_VERIFICATION`. The old variable is still supported but emits a deprecation warning.

### Bug fixes

* Fixed the Azure IMDS `Metadata` header to use lowercase `"true"` instead of `"True"`, which caused 400 errors during Azure Workload Identity Federation authentication.
* Fixed the default `crl_download_max_size` to be 20MB instead of 200MB to prevent potential out-of-memory issues.
* Fixed a bug where Azure GET commands would incorrectly set the file status to `UPLOADED` instead of preserving the `DOWNLOADED` status during metadata retrieval.
* Fixed the `unsafe_skip_file_permissions_check` flag not being respected when reading `connections.toml`.
* Fixed a `JSONDecodeError` in `result_batch._load()` when fetching large result sets.

## Version 4.3.0 (Feb 12, 2026)

### Deprecated features

* Deprecated support for custom revocation error classes in OCSP response cache deserialization.

  > By default, only `RevocationCheckError` exceptions are deserialized from OCSP cache. Custom exception classes can be temporarily enabled by setting the `SNOWFLAKE_ENABLE_CUSTOM_REVOCATION_ERRORS` environment variable to `true` or `1`, but this support will be removed in a future release.

### New features and updates

* Bumped vendored `urllib3` to version 2.6.3.
* Added `force_microseconds_precision` to `cursor.fetch_arrow_all` and `cursor.fetch_pandas_all` to avoid PyArrow schema inconsistencies between batches.
* Added a warning when using HTTP protocol for OAuth URLs.
* Updated the `server_session_keep_alive` parameter in `SnowflakeConnection` to skip checking for pending asyncronous queries, providing faster connection close times, especially when many asyncronous queries are executed.

### Bug fixes

* Fixed the string representation of `INTERVAL YEAR` and `INTERVAL MONTH` types.
* Ensured proper list conversions; the converter now runs `to_snowflake` on all list items.

## Version 4.2.0 (Jan 07, 2026)

### New features and updates

* Added the `SnowflakeCursor.stats` property to expose granular DML statistics (rows inserted, deleted, updated, and duplicates) for operations like CTAS where `rowcount` is insufficient.
* Added support for injecting Snowpark Container Services (SPCS) service identifier tokens (`SPCS_TOKEN`) into login requests when present in SPCS containers.
* Introduced a shared library for extended telemetry to identify and prepare testing platforms for native Rust extensions.

### Bug fixes

* None.

---
title: Snowflake Connector for Python: Empty results of fetch_arrow and fetch_pandas are typed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-812.md
section: Release Notes
---

# Snowflake Connector for Python: Empty results of fetch_arrow and fetch_pandas are typed

This behavior change is not part of the 2022_07 bundle in the 6.32 release, and can only be tested once the specified version of the Snowflake Connector for Python has been released.

For the most up-to-date details about the change, as well as other release-related details, refer to the [Behavior Change Log](../../behavior-changes.md).

Currently, if the `fetch_arrow` and `fetch_pandas` Python functions return empty results, the empty columns are returned
as generic Objects. They do not have a specific data type.

In a future version of the connector, the behavior of these functions will change as follows:

Previously:
:   If `fetch_arrow` and `fetch_pandas` return empty results, the schema columns of the results have a generic Object type.

Currently:
:   If `fetch_arrow` and `fetch_pandas` return empty results, the schema columns returned will be assigned the same data type as the column.

This change will be made in **version 2.8.0** of the Snowflake Connector for Python. When this version of the connector is available,
you should verify these changes in a test environment before upgrading your production environment.
Ref: 215

---
title: Snowflake Connector for ServiceNow® V2 release notes
source: https://docs.snowflake.com/en/release-notes/connectors/servicenow-v2.md
section: Release Notes
---

# Snowflake Connector for ServiceNow® V2 release notes

This topic provides release notes for the Snowflake Connector for ServiceNow® V2. For additional
information, see
[Snowflake Connector for ServiceNow](https://other-docs.snowflake.com/en/connectors/servicenow/v2/about).

## Version 5.27.4 (March 5, 2026)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue that caused alert emails to be sent too often and with incorrect error data.

## Version 5.27.3 (February 17, 2026)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed a possible race condition during the reload finalization process.

## Version 5.27.2 (January 21, 2026)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue where display values were not processed correctly when ingesting rotated tables.
* Improved logging in the alerting system.

## Version 5.27.1 (Nov 25, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Changed the column used for API request length validation.

## Version 5.27.0 (Oct 29, 2025)

### Behavior changes

* The connector no longer has a `128 MB` memory limit for uncompressed responses from the ServiceNow API.

### New features

Not applicable.

### Bug fixes

Not applicable.

## Version 5.26.0 (Oct 1, 2025)

### Behavior changes

* Custom journal tables are currently disabled. They’ll be restored with new functionality in a future release.
* When you pause the connector, only worker tasks are forcefully canceled. Other tasks keep running until they finish, so pausing might take a bit longer.

### New features

* The connector now lets you use `NOT LIKE` and `NOT IN` operators for row filtering, so you can filter your data more flexibly during ingestion.

### Bug fixes

* The connector now retries `curl` errors more times, making it more resilient to network issues in Azure deployments.

## Version 5.25.2 (Sep 12, 2025)

### Behavior changes

Not applicable.

### New features

* Improved logging of HTML responses.
* Improved error handling for External Access.

### Bug fixes

Not applicable.

## Version 5.25.1 (Sep 10, 2025)

### Behavior changes

Not applicable.

### New features

* Improved handling of large response sizes during data decompression by reducing the page size when necessary.

### Bug fixes

Not applicable.

## Version 5.25.0 (Sep 5, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue that caused the schema table to be created incorrectly during state import.
* Fixed an issue in filtered reload mode where some state events could be saved in the wrong order, which could lead to missed updates.
* Improved handling of large responses from ServiceNow.
* Added more detailed logging for ServiceNow response properties.

## Version 5.24.0 (Jun 23, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue that could cause the metadata tables to be ingested incorrectly during reload when the connector is globally configured to fetch display values.
  As a result of this issue, flattened views were not created for some tables. If this issue
  occurred, the following metadata tables had to be reloaded:

  + `sys_dictionary`
  + `sys_db_object`
  + `sys_glide_object`

## Version 5.23.0 (Jun 12, 2025)

### Behavior changes

* For records in the `sys_created_on` or `sys_updated_on` columns with null values, the connector inserts an update event only when the
  record has changed since the last ingestion. Previously, the connector inserted an update event to the event log table during each
  ingestion cycle, regardless of whether the record changed. This behavior could cause the event log table to grow indefinitely, even
  if no changes were found in the table.

### New features

Not applicable.

### Bug fixes

* Increased the range of page sizes that the connector tries during filtered ingestion. When fetching data, the connector should now be
  more resilient to timeout errors that come from the ServiceNow® API.
* Fixed the internal cleanup job to retain internal connector information that is needed to perform ingestion. Previously, when this
  information was removed, it could cause ingestion failures.
* Fixed an error during the creation of flattened views. This error was caused by a missing column in the internal connector table.

## Version 5.22.3 (May 26, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue with page size persistence when reloading a table.

## Version 5.22.1 (Apr 28, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue that could cause the export of the connector state to fail when row filtering expressions are used on string values.

## Version 5.22.0 (Apr 24, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed an issue that could cause an HTTP response parsing failure. In some cases, this issue could cause the connector to fail to ingest
  data from ServiceNow®.

## Version 5.21.0 (Apr 15, 2025)

### Behavior changes

Not applicable.

### New features

* Added support for continuous schedules. You can use this feature to set an ingestion schedule for up to 20 tables that will be executed
  every one minute. Snowflake recommends using continuous schedules carefully and only for tables that require near-real-time data in
  Snowflake. To enable this feature, you can use the `ENABLE_TABLE` or `CONFIGURE_TABLES_SCHEDULE` procedures. To learn more, see
  [Specifying the Synchronization Schedule](https://other-docs.snowflake.com/en/connectors/servicenow/ingestion#specifying-the-synchronization-schedule).
* The maximum number of tables that can be ingested concurrently has been increased from 30 to 50. This update allows for better warehouse
  utilization and improves overall performance. To learn more, see [Scaling the connector](https://other-docs.snowflake.com/en/connectors/servicenow/managing#scaling-the-connector).

### Bug fixes

* The connector is more stable and more performant when ingesting multiple tables in parallel.

## Version 5.20.0 (Apr 8, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed a bug that caused the export of the connector state to fail.

## Version 5.19.1 (Mar 25, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed a bug that caused the parsing process of the API response from ServiceNow® to fail when a header name in the response didn’t match the expected format.
* Fixed a bug that caused the export of the connector state and configuration to fail, when a filtered reload was run on a table.

## Version 5.19.0 (Mar 20, 2025)

### Behavior changes

Not applicable.

### New features

* The `DELETE_TABLE` procedure now accepts an optional `drop_related_objects` boolean parameter.
  When this parameter is set to `true`, the procedure drops all the objects related to the table,
  such as the flattened views, the event log table, and the sink table.
* The filtered reload feature now supports detection of deletes and can filter out these records
  when using the `sys_ids` parameter in the `RELOAD_TABLE` procedure.
  Prior to this release, the filtered reload feature only detected data updates and insertion.

### Bug fixes

* Corrected error in `CONNECTOR_STATS` view ingested row statistics when running filtered reload.

## Version 5.18.1 (Mar 10, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Reverted a performance optimization that could cause increased warehouse consumption.

## Version 5.18.0 (Feb 28, 2025)

### Behavior changes

* To see the configuration details for tables you reload, use the new `RELOADED_TABLES` view instead of the `CONFIGURED_TABLES` view.
  This new view includes the configuration values for the table from the `CONFIGURED_TABLES` view plus new columns that
  provide information about the reload configuration that was used for the table and the reload status for the table. For more information,
  see [About Monitoring the Connector](https://other-docs.snowflake.com/connectors/servicenow/monitoring#about-monitoring-the-connector).

### New features

* Added support for OAuth client credentials grant flow. When setting up OAuth, we recommend that you use this flow instead of OAuth
  authentication code grant flow. For more information, see
  [Setting up OAuth](https://other-docs.snowflake.com/en/connectors/servicenow/installing-sql#setting-up-oauth). If the connector is
  already configured with another OAuth flow and then you configure it to use the client credentials grant flow, we recommend that you perform the following tasks, if possible:

  > + Recreate the secret and security integration objects to use client credentials. For instructions, see
  >   [Creating a security integration and Creating a secret object](https://other-docs.snowflake.com/en/connectors/servicenow/installing-sql#creating-a-security-integration-optional).
  > + Update the connection to ServiceNow instance to use new credentials. For more information, see
  >   [Updating the connection to ServiceNow® instance](https://other-docs.snowflake.com/en/connectors/servicenow/managing#updating-the-connection-to-servicenow-instance).
* Added a new config parameter to the `RELOAD_TABLE` procedure. This parameter allows you to reload specific records in a table instead of the
  whole table. For details, see [Filtered reload](https://other-docs.snowflake.com/en/connectors/servicenow/ingestion#filtered-reload).
* In views containing reference fields, columns with the `__DISPLAY_VALUE` suffix that contain data for reference fields now display the
  most recent data. Previously, these columns always returned the display value for the ingested raw
  value from the same table. To enable this feature, including in existing views, call the `CREATE_VIEW_WITH_DISPLAY_VALUES` stored procedure.
  For more information, see [Creating a View Containing Reference Fields](https://other-docs.snowflake.com/connectors/servicenow/accessing-data#creating-a-view-containing-reference-fields).

### Bug fixes

* Improved the performance of the initial test request when a new table is enabled for ingestion.
* Improved error handling when the returned error code is in a different format than expected.

## Version 5.17.1 (Feb 7, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Fixed an issue where the reference columns in flattened views displayed incomplete data when the view contained data from a table with
`fetch_display_values` enabled.

## Version 5.17.0 (Jan 31, 2025)

### Behavior changes

The flattened views now always display columns in alphabetical order. Previously, these views sometimes displayed columns in random
order.

### New features

Not applicable.

### Bug fixes

* Fixed an issue where data included in a view would shift between columns when the view contained reference fields.
* Fixed an issue where flattened views weren’t recreated correctly.
* For tables with `fetch_display_values` enabled, fixed an issue where the connector only retrieved a single page of up to 10,000 records
  for a table before the ingestion process stopped. However, you must reload these tables to apply the fix to them, including tables
  with `fetch_display_values` enabled through the global connector settings. For instructions on how to reload a table, see
  [Reloading Data in a Table](https://other-docs.snowflake.com/connectors/servicenow/ingestion#reloading-data-in-a-table).

## Version 5.16.1 (Jan 24, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Fixed an issue where calling the `CONFIGURE_DISPLAY_VALUE_FETCHING` stored procedure would fail to configure the default display values for
tables and cause the Snowflake Connector for ServiceNow® V2 to stop responding.

## Version 5.16.0 (Jan 15, 2025)

### Behavior changes

Not applicable.

### New features

* A new `CONFIGURE_DISPLAY_VALUE_FETCHING` procedure was added. It is used to set global, default configuration for handling display values.
  Display value synchronization can also be configured on the table level, using the `ENABLE_TABLE` procedure.
* Data with resolved display values can now be fetched, instead of only raw data.

### Bug fixes

* Fixes for the connector state export process.
* Improved handling of DNS errors.
* `CREATE_VIEW_WITH_DISPLAY_VALUES` and `ENABLE_REFERENCED_TABLES` procedures now handle included columns configuration.

## Version 5.15.2 (Jan 7, 2025)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* The connector now handles an exception when a table that is getting exported has incomplete configuration.
* The upgrade process no longer fails if the `GET_TROUBLESHOOTING_DATA` procedure doesn’t get created.
* The connector no longer fails when an internal state snapshot isn’t created because of its size.

## Version 5.15.1 (Dec 6, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Added migration to support old sync states in data export.

## Version 5.15.0 (Dec 3, 2024)

### Behavior changes

* The export process to store the connector internal state changed.

  In addition to storing metadata in the `__CONNECTOR_STATE_EXPORT` table, the data is also split into multiple tables with a `_SFSDKEXPORT_V1` suffix.

### New features

* Snowflake Connector for ServiceNow® V2 now supports disaster recovery in another region.
* Added support for configuring deletion synchronization at the table level using the `ENABLE_TABLE` procedure.

  For more information on using the `ENABLE_TABLE` procedure, see
  [Enabling a single table using custom configuration](https://other-docs.snowflake.com/en/connectors/servicenow/v2/ingestion#label-servicenow-connector-configure-custom-configuration-v2).

### Bug fixes

* Unexpected responses from the ServiceNow API are now correctly handled in the procedures such as `CHECK_ROW_COUNT`.

## Version 5.14 (Nov 18, 2024)

### Behavior changes

* Event sharing is now mandatory for new installations.

### New features

* You can now set a specified table page size with the `RESET_PAGE_SIZE` procedure instead of using the default connector’s value.
* If the connector’s default page size was set to an invalid value, the connector will use the recommended value of 10,000.

### Bug fixes

* Ingestion fails when a worker task reaches API timeout when discovering the initial table page size.

## Version 5.13 (Oct 29, 2024)

### Behavior changes

Not applicable.

### New features

* Add timeout on establishing the http connection.

### Bug fixes

Not applicable.

## Version 5.12 (Oct 16, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Incremental updates no longer fail if Snowflake doesn’t receive the timestamp of the newest record on the ingested table.

## Version 5.11.1 (Oct 8, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Incremental updates no longer fail when the event log table is empty.
* Incremental ingestion no longer fails when a fetched batch is empty due to having out-of-date
  rows during record updates from the source.

## Version 5.11.0 (Oct 7, 2024)

### Behavior changes

Modified the ServiceNow API request sorting rules applied during incremental updates to eliminate data loss while reading data from multiple read replicas.

### New features

Not applicable.

### Bug fixes

Page size is no longer reduced when the ServiceNow instance is not reachable.

## Version 5.10.1 (Sep 6, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fixed configuration validation in the `UPDATE_CONNECTION_CONFIGURATION` procedure.

## Version 5.10.0 (Aug 30, 2024)

### Behavior changes

* A request for the most recent timestamp is added at the beginning of updates and deletes.

### New features

* The `UPDATE_CONNECTION_CONFIGURATION` procedure is added. This procedure allows to change External Access Integration and Secret objects used by the connector.
* User Agent header in connector HTTP requests is now set to `snowflake-connector-for-service-now`.

### Bug fixes

* Handle HTTP Client timeout errors gracefully.

  Reduce page size on such an error.
* ServiceNow® and Snowflake time differences no longer cause data to be lost.

## Version 5.9.1 (Aug 14, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Migration script fix for certain users.

## Version 5.9.0 (Aug 8, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

* Fix `RELOAD_TABLE` procedure when both `row_filter` and `data_range_start_time` are set.
  Previously row filtering sync states were not cleaned up correctly.
* Improve error handling in the data ingestion process when the connector is not able to overcome
  errors related to authentication. In such cases, the connector should now be able to
  detect the error earlier and stop the ingestion process.

## Version 5.8.0 (Jul 23, 2024)

### Behavior changes

Not applicable.

### New features

* The `row_filter` field in `ENABLE_TABLE` procedure now accepts arbitrary number of whitespace characters
  in filtering expression rather than allowing only single space between expression elements.

  For more information see [Enabling a single table using custom configuration](https://other-docs.snowflake.com/en/connectors/servicenow/v2/ingestion#label-servicenow-connector-configure-custom-configuration-v2).

### Bug fixes

* During table reload row filter and column filtering now taken into account.
* Row filter now works as expected for tables without a `sys_updated_on` column

## Version 5.7.0 (Jul 11, 2024)

### Behavior changes

Not applicable.

### New features

Procedures CHECK_ROW_COUNT, ENABLE_TABLE (without custom configuration parameters) and
SHOW_REFERENCES_OF_TABLE can now be called in a user-owned task.

### Bug fixes

Not applicable.

## Version 5.6.0 (Jul 5, 2024)

### Behavior changes

Not applicable.

### New features

Row filtering is now available. Row filtering supports the filtering of ingested table rows based on
conditions evaluated against table columns.
The row filtering condition is set using the `ENABLE_TABLE` procedure.

For more information see Enabling a single table using custom configuration in
[Setting Up data ingestion for your ServiceNow® data](https://other-docs.snowflake.com/en/connectors/servicenow/v2/ingestion#label-servicenow-connector-configure-custom-configuration-v2).

### Bug fixes

Improve performance of the migration script from prior version.

## Version 5.5.1 (Jun 28, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Improve performance of the migration script from prior version.

## Version 5.5.0 (Jun 24, 2024)

### Behavior changes

Not applicable.

### New features

Add a default way to obtain the schema of a table when starting its
ingestion. This should help in a scenario where the connector couldn’t start to
ingest a table because of ACLs met on the first ingested row.

### Bug fixes

* Fix `RUN_HEALTHCHECK` as it sometimes could fail to send the connector’s status in a specific scenario.

## Version 5.4.0 (Jun 10, 2024)

### Behavior changes

Not applicable.

### New features

Change endpoint for fetching schema of the table. From version 5.4.0 and later, the `ADMIN` role
in ServiceNow® is no longer required to use `CREATE_VIEW_WITH_DISPLAY_VALUES`,
`SHOW_REFERENCES_OF_TABLE` and `ENABLE_TABLE` (when using column filtering)
procedures.

### Bug fixes

From version 5.4.0 and later, new event log table `DELETE` events include the `RAW` column, which is set
to a value from the newest update event instead of the first insert event.
Previously existing event log table events remain unchanged.

## Version 5.3.0 (May 17, 2024)

### Behavior changes

Not applicable.

### New features

Not applicable.

### Bug fixes

Fix handling the null value of the `journal_table` property in the object passed to
the `FINALIZE_CONNECTOR_CONFIGURATION` procedure. The `journal_table` parameter can now also be skipped.

## Version 5.2.0 (May 10, 2024)

### Behavior changes

Not applicable.

### New features

Add optional table_name and `sys_id` arguments to `FINALIZE_CONNECTOR_CONFIGURATION` to
help in journal table validation.

### Bug fixes

* Improve URL validation in `SET_CONNECTION_CONFIGURATION` to support custom ServiceNow® domains.

## Version 5.1.0 (Apr 29, 2024)

### Behavior changes

Not applicable.

### New features

`max_sys_created_on` argument in `CHECK_ROW_COUNT` procedure now defaults to `NULL`.

### Bug fixes

* Don’t start healthcheck reporting if the configuration hasn’t successfully completed.
* Fix `SHOW_REFERENCES_OF_TABLE` to include self-references of a given table in returned value.
* Fix `CREATE_VIEW_WITH_DISPLAY_VALUES` to handle situation when table references itself.

## Version 5.0.0 (Apr 23, 2024)

Initial release with version 5.0.0.

### Behavior changes

* External function making API calls to ServiceNow® are replaced with external access.
* Signatures and behavior of many procedures changed. Division of responsibility can be checked in the below table:

> |  |  |
> | --- | --- |
> | Prior procedure | New procedure |
> | `CONFIGURE_CONNECTOR` | Several specialized procedures `CONFIGURE_*`. |
> | `CONFIGURE_WAREHOUSE` | `UPDATE_WAREHOUSE` |
> | `STOP_CONNECTOR` | `PAUSE_CONNECTOR` |
> | `START_CONNECTOR` | Several procedures to install the app when using worksheets. |
> | `PREFILL_CONFIG_TABLE` | `GET_AVAILABLE_TABLES` |
> | `ENABLE_TABLE_WITH_COLUMNS` | `ENABLE_TABLE` |
> | `ENABLE_TABLES(VARCHAR, BOOLEAN)` | `ENABLE_TABLES(ARRAY), DISABLE_TABLES(ARRAY)` |
> | `TEST_SN_CONNECTION` | `TEST_CONNECTION` |
> | `CHECK_SN_ROW_COUNT` | `CHECK_ROW_COUNT` |
> | `GET_STATUS` |  |
> | `GET_CONNECTION_STATUS` |  |
> | `GET_VERSION` |  |
> | `RUN_UPGRADE` |  |

* Procedures return an object with `response_code` property. The procedure
  result with an optional error reason is displayed directly in the response.
* Signatures and behavior of several views changed. Division of responsibility
  can be checked in the below table:

  |  |  |
  | --- | --- |
  | Prior view | New view |
  | `ENABLED_TABLES` | `CONFIGURED_TABLES`, `TABLES_STATE` |
  | `CONNECTOR_RUNS_STATE` | Included in `GET_TROUBLESHOOTING_DATA` procedure. |
  | `CONNECTOR_STATS` | `AGGREGATED_CONNECTOR_STATS` |
  |  | `SYNC_STATUS` |

### New features

Not applicable.

### Bug fixes

Not applicable.

---
title: Snowflake Connector for SharePoint release notes
source: https://docs.snowflake.com/en/release-notes/connectors/sharepoint.md
section: Release Notes
---

# Snowflake Connector for SharePoint release notes

This topic provides release notes for the Snowflake Connector for SharePoint.

For additional information, see [About the Snowflake Connector for SharePoint](../../connectors/unstructured-data-connectors/sharepoint/about.md).

## Version 1.0.5 (December 9, 2024)

### Behavior changes

Not applicable

### New features

Not applicable

### Bug fixes

* Fixed an issue that was causing empty values to be returned in the `web_url` column in the Cortex Search service responses.

## Version 1.0.4 (December 6, 2024)

### Behavior changes

Not applicable

### New features

Not applicable

### Bug fixes

* During the data synchronization of Microsoft 365 groups, group members are now retrieved only once for each group.

## Version 1.0.3 (December 3, 2024)

### Behavior changes

* Added progress logs in the event table for the entire ingestion process.
* Unprocessed file updates and inserts are now visible through the PUBLIC.CONNECTOR_ERRORS view.

### New features

Not applicable

### Bug fixes

* Fixed internal table definitions that were causing connector application upgrade issues.
* Files without extensions no longer break the ingestion process.
* When upgrading the connector application, change tracking on connector tables is no longer disabled.
  We’ve also migrated broken Cortex Search indexes to make them refresh the data.

## Version 1.0.2 (November 15, 2024)

### Behavior changes

* You can now use a site URL with a custom domain.

### New features

Not applicable

### Bug fixes

* The data ingestion no longer continues if an error occurs at any step of ingestion.

## Version 1.0.1 (November 8, 2024)

### Behavior changes

Not applicable

### New features

Not applicable

### Bug fixes

* Fixed how the connector handles Sharepoint file deletions.

## Version 1.0.0 (November 8, 2024)

Initial release

---
title: Snowflake Connector for Spark release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/spark-connector.md
section: Release Notes
---

# Snowflake Connector for Spark release notes

The Snowflake Connector for Spark release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2025 releases](spark-connector-2026.md)
* [2025 releases](spark-connector-2025.md)
* [2024 releases](spark-connector-2024.md)
* [2023 releases](spark-connector-2023.md)
* [2022 releases](spark-connector-2022.md)

See [Snowflake Connector for Spark](../../user-guide/spark-connector.md) for documentation.

---
title: Snowflake Connector for Spark release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/spark-connector-2022.md
section: Release Notes
---

# Snowflake Connector for Spark release notes for 2022

This article contains the release notes for the Snowflake Connector for Spark, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Spark updates.

See [Snowflake Connector for Spark](../../user-guide/spark-connector.md) for documentation.

## Version 2.11.1 (December 13, 2022)

### New features

* Added support for AWS VPCE deployments by adding the S3_STAGE_VPCE_DNS_NAME configuration parameter to specifying the
  VPCE DNS name at the session level.
* Added a new configuration option treat_decimal_as_long to enable the Spark Connector to return Long values
  instead of `BigDecimal` values, if the query returns `Decimal(<any_precision>, 0)`. WARNING: If the value is
  greater than the maximum value of `Long`, an error will be raised.
* Added a new option proxy_protocol for specifying the proxy protocol (http or https) with AWS deployments.
  (The option has no effect on Azure and GCP deployments.).
* Added support for counting rows in a table where the row count is greater than the maximum value of Integer.
* Updated the connector to use the Snowflake JDBC driver 3.13.24.

### Bug fixes

* Updated the connector to close JDBC connections to avoid connection leakage.
* Fixed a `NullPointerException` issue when sending telemetry messages.

## Version 2.11.0 (September 2, 2022)

Compatible JDBC Driver version: 3.13.22

* Added support for Spark 3.3 and fixed some bugs:

  + Upgraded the version of the PostgreSQL JDBC Driver that tests depend on to avoid the security
    vulnerability [CVE-2022-31197](https://github.com/advisories/GHSA-r38f-c4h4-hqq2).
  + Updated the connector to use the Snowflake JDBC driver 3.13.22 and the Snowflake Ingest SDK 0.10.8.

> **Note:**
>
> * Starting from version 2.11.0, the Snowflake Connector for Spark supports Spark 3.1, 3.2 and 3.3.
>   Version 2.11.0 of the Snowflake Connector for Spark does not support Spark 3.0. Note that previous versions of the connector continue to support Spark 3.0.
> * For Snowflake GCP accounts, the Snowflake JDBC driver versions 3.13.16 through 3.13.21 do not work with the Spark connector.

## Version 2.10.1 (August 15, 2022)

Compatible JDBC Driver version: 3.13.14

### Bug fixes

* Removed unnecessary dependencies on libraries to avoid the security vulnerabilities
  [CVE-2020-8908](https://github.com/advisories/GHSA-5mg8-w23w-74h3) and [CVE-2018-10237](https://github.com/advisories/GHSA-mvr2-9pj6-7w5j).
* Added support for using the JDBC data type `TIMESTAMP_WITH_TIMEZONE` when reading data from Snowflake.
* Changed the logic for checking for the existence of a table before saving a DataFrame to Snowflake:

  + The connector now reuses the existing connection (rather than creating a new connection) to avoid potential problems with token expiration.
  + If the table name is not fully qualified (i.e. does not include the schema name), the connector now checks for the table under the schema specified by sfSchema, rather than the schema that is currently in use in the session.

    > **Note:**
    >
    > If you need to save a DataFrame to a table in a schema other than `sfSchema`, specify the schema as part of
    > the fully qualified name of the table, rather than executing USE SCHEMA to change the current schema.
* Improved performance by avoiding unnecessary `parse_json()` calls in the COPY INTO TABLE command when writing a
  DataFrame with `ArrayType`, `MapType`, or `StructType` columns to Snowflake.
* Added the `getLastSelectQueryId` and `getLastCopyLoadQueryId` methods to the `Utils` class. These methods
  return the query ID of the last query that read data from Snowflake and the last COPY INTO TABLE statement that
  was executed (respectively).

## Version 2.10.0 (February 17, 2022)

Compatible JDBC Driver version: 3.13.14

### Behavior change

* Added support for Spark, version 3.2. Beginning with this release, the Snowflake Connector for Spark only supports Spark 3.0, 3.1 and 3.2. Spark version 2.4 is no longer supported.

### Bug fix

* Fixed an issue where string “null” is regarded as type `NULL`.

---
title: Snowflake Connector for Spark release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/spark-connector-2023.md
section: Release Notes
---

# Snowflake Connector for Spark release notes for 2023

This article contains the release notes for the Snowflake Connector for Spark, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Spark updates.

See [Snowflake Connector for Spark](../../user-guide/spark-connector.md) for documentation.

## Version 2.12.0 (May 23, 2023)

> **Note:**
>
> Starting with this version (2.12.0), the Snowflake Connector for Spark no longer supports Spark 3.1, but continues to support versions 3.2, 3.3, and 3.4. Previous versions of the connector continue to support Spark 3.1.

### New features

* Added support for Spark 3.4.
* Built and tested with the Snowflake JDBC driver, version 3.13.30.

### Bug fixes

* None.

## Version 2.11.3 (April 21, 2023)

### New features

* Updated the mechanism for writing DataFrames to accounts on GCP. After December 2023, previous versions of the Spark Connector will no longer be able to write DataFrames, due to changes in GCP.
* Added the option to disable `preactions` and `postactions` validation for session sharing.

  To disable validation, set the option `FORCE_SKIP_PRE_POST_ACTION_CHECK_FOR_SHARED_SESSION` to `true`. The default is `false`.

  > **Important:**
  >
  > Before setting this option, make sure that the queries in `preactions` and `postactions` don’t affect the session settings. Otherwise, you might encounter issues with results.

### Bug fixes

* Fixed an issue when performing a join or union across different schemas when the two DataFrames are accessing
* tables with different `sfSchema` and the same name table in `sfSchema` is in the left `DataFrame`.

## Version 2.11.2 (March 21, 2023)

### New features

* Added support for sharing a JDBC connection.

  The Snowflake Connector for Spark can now use the same JDBC connection for different jobs and actions when the
  client uses the same connection options to access Snowflake. Previously, the Spark Connector created a new
  JDBC connection for each job or action.

  The Spark Connector supports the following options and API methods for enabling and disabling this feature:

  + To specify that the connector should not use the same JDBC connection, set the `support_share_connection` connector
    option to `false`. (The default value is `true`, which means that the feature is enabled.)
  + To enable or disable the feature programmatically, call one of the following global static functions:
    `SparkConnectorContext.disableSharedConnection()` and `SparkConnectorContext.enableSharingJDBCConnection()`.
  > **Note:**
  >
  > In the following special cases, the Spark Connector will not use the shared connection:
  >
  > + If `preactions` or `postactions` are set, and those `preactions` or `postactions` are not CREATE TABLE,
  >   DROP TABLE, or MERGE INTO, the Spark Connector will not use the shared connection.
  > + Utility functions in `Utils`, such as `Utils.runQuery()` and `Utils.getJDBCConnection()`, will not use the
  >   shared connection.
* Updated the connector to use the Snowflake JDBC driver 3.13.29.

### Bug fixes

* None.

---
title: Snowflake Connector for Spark release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/spark-connector-2024.md
section: Release Notes
---

# Snowflake Connector for Spark release notes for 2024

This article contains the release notes for the Snowflake Connector for Spark, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Spark updates.

See [Snowflake Connector for Spark](../../user-guide/spark-connector.md) for documentation.

## Version 3.0.0 (July 31, 2024)

### BCR (Behavior Change Release) changes

Beginning with version 3.0.0, the Snowflake Connector for Spark introduced the following breaking changes:

* Removed the Advanced Query Pushdown feature.

  Alternatives to the feature are available. For example, instead of loading data from Snowflake tables, users can directly load data from Snowflake SQL queries.

  Snowflake plans to introduce a tool to convert DataFrames between Spark and Snowpark in a future Snowflake Connector for Spark release.
* Each release now includes one artifact instead of multiple artifacts for different Spark versions.

  The single artifact works with multiple Spark versions. Currently, Snowflake verified support for Spark 3.4 and 3.5 in the Snowflake Connector for Spark 3.0.0 version.

Per Snowflake’s support policy, Snowflake will continue to support Spark 2.x.x versions for up to two years.

### New features

* Upgraded JDBC to 3.17.0 to Support LOB.
* Added support for Spark 3.5.0.

### Bug fixes

* Removed the requirement of the `SFUSER` parameter when using OAUTH.

## Version 2.16.0 (June 10, 2024)

### New features

* Upgraded JDBC to version 3.16.1.
* Improved legacy Spark streaming code.
* Disabled the `abort_detached_query` parameter at the session level by default.

### Bug fixes

* Fixed an issue with the proxy protocol that incorrectly impacted S3 protocols.

## Version 2.15.0 (February 26, 2024)

### New features

* Introduced a new `trim_space` parameter that you can use to trim values of `StringType` columns automatically when saving to a Snowflake table. Default: `false`.

### Bug fixes

* Fixed an issue that caused a “cancelled queries can be restarted in the Spark retries after application closed” message.

---
title: Snowflake Connector for Spark release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/spark-connector-2025.md
section: Release Notes
---

# Snowflake Connector for Spark release notes for 2025

This article contains the release notes for the Snowflake Connector for Spark, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Spark updates.

See [Snowflake Connector for Spark](../../user-guide/spark-connector.md) for documentation.

## Version 3.1.2 (June 03, 2025)

### New features

* Upgraded JDBC to version 3.24.2 to incorporate a bug fix for the Java TrustManager.
* Upgraded the parquet-avro library to mitigate security vulnerabilities.

### Bug fixes

* None.

---
title: Snowflake Connector for Spark release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/spark-connector-2026.md
section: Release Notes
---

# Snowflake Connector for Spark release notes for 2026

This article contains the release notes for the Snowflake Connector for Spark, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowflake Connector for Spark updates.

See [Snowflake Connector for Spark](../../user-guide/spark-connector.md) for documentation.

## Version 3.1.8 (March 13, 2026)

### New features

* None

### Bug fixes

* Fixed the `UnsupportedOperationException: Unexpected type: NullType` error that was raised when writing DataFrames with structured columns (StructType) containing all null values via the Parquet write path.

---
title: Snowflake connector, driver, and library monthly releases
source: https://docs.snowflake.com/en/release-notes/clients-drivers/monthly-releases.md
section: Release Notes
---

# Snowflake connector, driver, and library monthly releases

This topic provides a monthly list of the connector, driver, and library releases and includes links to the release
notes for each. For each client, the monthly table lists the version number and date of the latest release.
A TBD indicates that a new version has not yet been released for a client during the month, but does not preclude a
release later in the month. A TBD in a previous month indicates that a client did not release an update in that month.

Snowflake uses semantic versioning for client and driver updates, excluding Snowpark APIs.

> **Note:**
>
> Starting with January 2023, the **BCR?** column indicates whether a version contains a change that might break applications built on earlier versions.

To determine the latest minimum versions for the clients and drivers, refer to the [Client versions & support policy](../requirements.md) topic.

> **Tip:**
>
> To view a list of release note announcements, filtered by date and release type, see
> [All release notes](/release-notes/all-release-notes).

## April 2026 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 5.5.0 | 13-Apr-2026 |  |
| [Go Snowflake Driver](golang.md) | 2.0.1 | 08-Apr-2026 |  |
|  | 1.9.1 | 08-Apr-2026 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 4.1.0 | 08-Apr-2026 |  |
| [Node.js Driver](nodejs.md) | 2.4.0 | 07-Apr-2026 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.5.0 | 16-Apr-2026 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## March 2026 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | 2.0.0 | 03-Mar-2026 | **Y** |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 4.0.2 | 12-Mar-2026 |  |
| [Node.js Driver](nodejs.md) | 2.3.6 | 25-Mar-2026 |  |
|  | 2.3.5 | 17-Mar-2026 |  |
| [ODBC Driver](odbc.md) | 3.16.0 | 11-Mar-2026 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.6.0 | 05-Mar-2026 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.16.0 | 19-Mar-2026 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 4.4.0 | 25-Mar-2026 |  |
| [Snowflake Connector for Spark](spark-connector.md) | 3.1.8 | 13-Mar-2026 |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.47.0 | 05-Mar-2026 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | 1.17.0 | 13-Mar-2026 |  |
|  | 1.16.0 | 12-Mar-2026 |  |
|  | 1.15.0 | 06-Mar-2026 |  |
| [Snowpark ML](snowpark-ml.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.9.0 | 04-Mar-2026 |  |

## February 2026 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 5.4.1 | 17-Feb-2026 |  |
|  | 5.4.0 | 05-Feb-2026 |  |
| [Go Snowflake Driver](golang.md) | 1.19.0 | 03-Feb-2026 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 4.0.1 | 09-Feb-2026 |  |
| [Node.js Driver](nodejs.md) | 2.3.4 | 09-Feb-2026 |  |
| [ODBC Driver](odbc.md) | 3.15.0 | 09-Feb-2026 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.5.0 | 03-Feb-2026 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.15.0 | 03-Feb-2026 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 4.3.0 | 12-Feb-2026 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | 1.2.0 | 16-Feb-2026 |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.12.0 | 12-Feb-2026 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.46.0 | 25-Feb-2026 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | 1.14.0 | 19-Feb-2026 |  |
|  | 1.13.0 | 13-Feb-2026 |  |
| [Snowpark ML](snowpark-ml.md) | 1.27.0 | 12-Feb-2026 |  |
|  | 1.26.0 | 05-Feb-2026 |  |
|  | 1.25.1 | 03-Feb-2026 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## January 2026 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 5.3.0 | 07-Jan-2026 |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 4.4.2 | 12-Jan-2026 |  |
| [JDBC Driver](jdbc.md) | 4.0.0 | 27-Jan-2026 | **Y** |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | 3.14.0 | 12-Jan-2026 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 4.2.0 | 07-Jan-2026 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | 1.1.2 | 20-Jan-2026 |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.11.0 | 21-Jan-2026 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.45.0 | 26-Jan-2026 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | 1.11.0 | 28-Jan-2026 |  |
|  | 1.10.0 | 22-Jan-2026 |  |
|  | 1.9.0 | 14-Jan-2026 |  |
|  | 1.8.0 | 07-Jan-2026 |  |
| [Snowpark ML](snowpark-ml.md) | 1.25.0 | 28-Jan-2026 |  |
|  | 1.24.0 | 22-Jan-2026 |  |
|  | 1.23.0 | 15-Jan-2026 |  |
|  | 1.22.0 | 09-Jan-2026 |  |
|  | 1.21.0 | 05-Jan-2026 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## December 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 5.2.1 | 10-Dec-2025 |  |
|  | 5.2.0 | 03-Dec-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.18.1 | 15-Dec-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 3.28.0 | 15-Dec-2025 |  |
| [Node.js Driver](nodejs.md) | 2.3.3 | 11-Dec-2025 |  |
|  | 2.3.2 | 08-Dec-2025 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.4.0 | 03-Dec-2025 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.14.0 | 09-Dec-2025 |  |
|  | 3.13.1 | 02-Dec-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 4.1.1 | 02-Dec-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.10.0 | 08-Dec-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.44.0 | 15-Dec-2025 |  |
|  | 1.43.0 | 03-Dec-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.18.0 | 05-Dec-2025 |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | 1.7.0 | 18-Dec-2025 |  |
|  | 1.6.0 | 12-Dec-2025 |  |
|  | 1.5.0 | 04-Dec-2025 |  |
| [Snowpark ML](snowpark-ml.md) | 1.20.0 | 02-Dec-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.8.2 | 10-Dec-2025 |  |
|  | 1.8.0 | 04-Dec-2025 |  |

## November 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 5.1.0 | 04-Nov-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.18.0 | 20-Nov-2025 |  |
|  | 1.17.1 | 04-Nov-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 4.4.0 | 19-Nov-2025 |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | 3.13.0 | 20-Nov-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.13.0 | 03-Nov-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 4.1.0 | 13-Nov-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.9.0 | 13-Nov-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.17.0 | 10-Nov-2025 |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | 1.4.0 | 25-Nov-2025 |  |
|  | 1.3.0 | 19-Nov-2025 |  |
|  | 1.2.0 | 17-Nov-2025 |  |
| [Snowpark ML](snowpark-ml.md) | 1.19.0 | 13-Nov-2025 |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | 1.1.0 | 05-Nov-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## October 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 5.0.0 | 16-Oct-2025 | Y |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 4.3.1 | 08-Oct-2025 |  |
| [JDBC Driver](jdbc.md) | 3.27.1 | 30-Oct-2025 |  |
|  | 3.27.0 | 06-Oct-2025 |  |
| [Node.js Driver](nodejs.md) | 3.2.1 | 09-Oct-2025 |  |
| [ODBC Driver](odbc.md) | 3.11.1 | 21-Oct-2025 |  |
|  | 3.10.1 | 21-Oct-2025 |  |
|  | 3.09.1 | 20-Oct-2025 |  |
|  | 3.08.1 | 20-Oct-2025 |  |
|  | 3.07.1 | 20-Oct-2025 |  |
|  | 3.12.0 | 14-Oct-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 4.0.0 | 09-Oct-2025 | Y |
|  | 3.18.0 | 06-Oct-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.42.0 | 29-Oct-2025 |  |
|  | 1.41.0 | 23-Oct-2025 |  |
|  | 1.40.0 | 05-Oct-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark Connect for Spark](snowpark-connect.md) | 1.0.0 | 28-Oct-2025 |  |
|  | 0.33.0 | 23-Oct-2025 |  |
|  | 0.32.0 | 17-Oct-2025 |  |
|  | 0.31.0 | 09-Oct-2025 |  |
| [Snowpark ML](snowpark-ml.md) | 1.18.0 | 23-Oct-2025 |  |
|  | 1.17.0 | 20-Oct-2025 |  |
|  | 1.16.0 | 60-Oct-2025 |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | 1.0.2 | 10-Oct-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## September 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | 1.17.0 | 29-Sep-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | 2.3.0 | 30-Sep-2025 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.12.0 | 24-Sep-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.17.4 | 22-Sep-2025 |  |
|  | 3.17.3 | 03-Sep-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.8.0 | 22-Sep-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.39.1 | 25-Sep-2025 |  |
|  | 1.39.0 | 17-Sep-2025 |  |
|  | 1.38.0 | 04-Sep-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.15.0 | 29-Sep-2025 |  |
|  | 1.14.0 | 18-Sep-2025 |  |
|  | 1.13.0 | 11-Sep-2025 |  |
|  | 1.12.0 | 04-Sep-2025 |  |
| [Snowpipe Streaming SDK](snowpipe-streaming-sdk.md) | 1.0.1 | 22-Sep-2025 |  |
|  | 1.0.0 | 19-Sep-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.7.7 | 09-Sep-2025 |  |

## August 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.8.0 | 13-Aug-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.16.0 | 14-Aug-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 4.3.0 | 21-Aug-2025 |  |
| [JDBC Driver](jdbc.md) | 3.26.1 | 29-Aug-2025 |  |
|  | 3.26.0 | 13-Aug-2025 |  |
| [Node.js Driver](nodejs.md) | 2.2.0 | 13-Aug-2025 |  |
| [ODBC Driver](odbc.md) | 3.11.0 | 13-Aug-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.3.0 | 27-Aug-2025 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.11.0 | 25-Aug-2025 |  |
|  | 3.10.1 | 15-Aug-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.3.0 | 26-Aug-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.17.2 | 20-Aug-2025 |  |
|  | 3.17.1 | 14-Aug-2025 |  |
|  | 3.17.0 | 13-Aug-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.37.0 | 18-Aug-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.11.0 | 12-Aug-2025 |  |
|  | 1.10.0 | 01-Aug-2025 |  |
| [SnowSQL](snowsql.md) | 1.4.5 | 14-Aug-2025 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## July 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.7.0 | 01-Jul-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.15.0 | 01-Jul-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 3.25.1 | 21-Jul-2025 |  |
|  | 3.25.0 | 09-Jul-2025 |  |
| [Node.js Driver](nodejs.md) | 2.1.3 | 21-Jul-2025 |  |
|  | 2.1.2 | 10-Jul-2025 |  |
|  | 2.1.1 | 03-Jul-2025 |  |
| [ODBC Driver](odbc.md) | 3.10.0 | 07-Jul-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.10.0 | 17-Jul-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.2.4 | 31-Jul-2025 |  |
|  | 3.2.3 | 14-Jul-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.16.0 | 01-Jul-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.7.0 | 31-Jul-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.36.0 | 28-Jul-2025 |  |
|  | 1.35.0 | 24-Jul-2025 |  |
|  | 1.34.0 | 14-Jul-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.9.2 | 28-Jul-2025 |  |
|  | 1.9.1 | 18-Jul-2025 |  |
| [SnowSQL](snowsql.md) | 1.4.4 | 30-Jul-2025 |  |
|  | 1.4.3 | 10-Jul-2025 |  |
| [SQLAlchemy](sqlalchemy.md) | 1.7.6 | 10-Jul-2025 |  |

## June 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.6.0 | 18-Jun-2025 |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 4.1.0 | 11-Jun-2025 | **Y** |
| [JDBC Driver](jdbc.md) | 3.18.1 | 05-Jun-2025 |  |
|  | 3.17.1 | 05-Jun-2025 |  |
|  | 3.21.1 | 04-Jun-2025 |  |
|  | 3.20.1 | 04-Jun-2025 |  |
|  | 3.22.1 | 03-Jun-2025 |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | 3.9.0 | 12-Jun-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.9.1 | 09-Jun-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.2.2 | 26-Jun-2025 |  |
|  | 3.2.1 | 02-Jun-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | 3.1.2 | 03-Jun-2025 |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.6.0 | 26-Jun-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.33.0 | 19-Jun-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.16.0 | 30-Jun-2025 |  |
| [Snowpark ML](snowpark-ml.md) | 1.9.0 | 25-Jun-2025 |  |
|  | 1.8.6 | 18-Jun-2025 |  |
| [SnowSQL](snowsql.md) | 1.4.2 | 24-Jun-2025 |  |
| [SQLAlchemy](sqlalchemy.md) | 1.7.4 | 10-Jun-2025 |  |

## May 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.5.0 | 09-May-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.14.1 | 28-May-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 3.24.2 | 31-May-2025 |  |
|  | 3.24.1 | 28-May-2025 |  |
| [Node.js Driver](nodejs.md) | 2.1.0 | 11-May-2025 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.2.0 | 20-May-2025 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.9.0 | 29-May-2025 |  |
|  | 3.8.3 | 22-May-2025 |  |
|  | 3.8.2 | 21-May-2025 |  |
|  | 3.8.1 | 20-May-2025 |  |
|  | 3.8.0 | 16-May-2025 |  |
|  | 3.7.2 | 12-May-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.5.1 | 28-May-2025 |  |
|  | 1.5.0 | 14-May-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.32.0 | 15-May-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.8.5 | 27-May-2025 |  |
|  | 1.8.4 | 12-May-2025 |  |
| [SnowSQL](snowsql.md) | 1.4.1 | 29-May-2025 |  |
|  | 1.4.0 | 22-May-2025 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## April 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.4.1 | 28-Apr-2025 |  |
|  | 4.4.0 | 10-Apr-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.14.0 | 30-Apr-2025 |  |
|  | 1.13.3 | 28-Apr-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 4.0.0 | 14-Apr-2025 | **Y** |
| [JDBC Driver](jdbc.md) | 3.24.0 | 30-Apr-2025 |  |
|  | 3.23.2 | 03-Apr-2025 |  |
| [Node.js Driver](nodejs.md) | 2.0.4 | 28-Apr-2025 |  |
| [ODBC Driver](odbc.md) | 3.8.0 | 30-Apr-2025 |  |
|  | 3.7.0 | 14-Apr-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.7.1 | 28-Apr-2025 |  |
|  | 3.7.0 | 16-Apr-2025 |  |
|  | 3.6.0 | 02-Apr-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.2.0 | 28-Apr-2025 |  |
|  | 3.1.3 | 07-Apr-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.15.0 | 28-Apr-2025 |  |
|  | 3.14.1 | 22-Apr-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.4.0 | 23-Apr-2025 |  |
|  | 1.3.0 | 09-Apr-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.31.0 | 24-Apr-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.8.3 | 28-Apr-2025 |  |
|  | 1.8.2 | 15-Apr-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## March 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | 1.13.2 | 31-Mar-2025 |  |
|  | 1.13.1 | 04-Mar-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 3.1.2 | 17-Mar-2025 |  |
| [JDBC Driver](jdbc.md) | 3.23.1 | 13-Mar-2025 |  |
| [Node.js Driver](nodejs.md) | 2.0.3 | 13-Mar-2025 |  |
| [ODBC Driver](odbc.md) | 3.6.0 | 08-Mar-2025 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.5.0 | 10-March-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.1.2 | 18-Mar-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.14.0 | 03-Mar-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.2.0 | 26-Mar-2025 |  |
|  | 1.1.0 | 12-Mar-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.30.0 | 28-Mar-2025 |  |
|  | 1.29.1 | 12-Mar-2025 |  |
|  | 1.29.0 | 06-Mar-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) 1.8.1 | 26-Mar-2025 |  |  |
|  | 1.8.0 | 19-Mar-2025 |  |
|  | 1.7.5 | 06-Mar-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## February 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 3.1.1 | 27-Feb-2025 |  |
| [JDBC Driver](jdbc.md) | 3.23.0 | 27-Feb-2025 |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.4.0 | 13-Feb-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.1.1 | 26-Feb-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.0.5 | 19-Feb-2025 |  |
|  | 1.0.4 | 13-Feb-2025 |  |
|  | 1.0.3 | 04-Feb-2025 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.28.0 | 20-Feb-2025 |  |
|  | 1.27.0 | 05-Feb-2025 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.3.3 | 05-Feb-2025 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## January 2025 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.3.0 | 29-Jan-2025 |  |
| [Go Snowflake Driver](golang.md) | 1.13.0 | 29-Jan-2025 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 3.22.0 | 29-Jan-2025 |  |
| [Node.js Driver](nodejs.md) | 2.0.2 | 29-Jan-2025 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.1.0 | 29-Jan-2025 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.3.0 | 21-Jan-2025 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.1.0 | 21-Jan-2025 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.13.2 | 30-Jan-2025 |  |
|  | 3.13.1 | 29-Jan-2025 |  |
|  | 3.13.0 | 23-Jan-2025 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.7.4 | 28-Jan-2025 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.7.3 | 14-Jan-2025 |  |

## December 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | 1.12.1 | 05-Dec-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 3.0.1 | 04-Dec-2024 |  |
| [JDBC Driver](jdbc.md) | 3.21.0 | 11-Dec-2024 |  |
| [Node.js Driver](nodejs.md) | 2.0.1 | 13-Dec-2024 |  |
|  | 2.0.0 | 11-Dec-2024 | **Y** |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.2.2 | 13-Dec-2024 |  |
|  | 3.2.1 | 03-Dec-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 3.0.0 | 10-Dec-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.12.4 | 03-Dec-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.26.0 | 05-Dec-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.15.0 | 18-Dec-2024 |  |
| [Snowpark ML](snowpark-ml.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.7.2 | 17-Dec-2024 |  |
|  | 1.7.1 | 02-Dec-2024 |  |

## November 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.2.0 | 05-Nov-2024 |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 3.0.0 | 12-Nov-2024 |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | 1.15.0 | 07-Nov-2024 |  |
| [ODBC Driver](odbc.md) | 3.5.0 | 04-Nov-2024 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 3.2.0 | 25-Nov-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.0.2 | 13-Nov-2024 |  |
|  | 1.0.1 | 11-Nov-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.25.0 | 13-Nov-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.7.2 | 21-Nov-2024 |  |
|  | 1.7.1 | 05-Nov-2024 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.7.0 | 21-Nov-2024 |  |

## October 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | 1.12.0 | 30-Oct-2024 |  |
|  | 1.11.2 | 03-Oct-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.3.0 | 11-Oct-2024 | **Y** |
| [JDBC Driver](jdbc.md) | 3.20.0 | 30-Oct-2024 |  |
|  | 3.19.1 | 25-Oct-2024 |  |
| [Node.js Driver](nodejs.md) | 1.14.0 | 02-Oct-2024 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.0.3 | 30-Oct-2024 |  |
| [Snowflake CLI](snowflake-cli.md) | 3.1.0 | 25-Oct-2024 |  |
|  | 3.0.2 | 15-Oct-2024 |  |
|  | 3.0.1 | 08-Oct-2024 |  |
|  | 2.8.2 | 08-Oct-2024 |  |
|  | 3.0.0 | 01-Oct-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.5.0 | 31-Oct-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.12.3 | 24-Oct-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 1.0.0 | 22-Oct-2024 |  |
|  | 0.13.1 | 11-Oct-2024 |  |
|  | 0.13.0 | 04-Oct-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.24.0 | 28-Oct-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.7.0 | 22-Oct-2024 |  |
|  | 1.6.4 | 17-Oct-2024 |  |
|  | 1.6.3 | 07-Oct-2024 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## September 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.2.2 | 12-Sep-2024 |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | 1.13.1 | 04-Sep-2024 |  |
|  | 1.13.0 | 03-Sep-2024 |  |
| [ODBC Driver](odbc.md) | 3.4.1 | 03-Sep-2024 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 2.8.1 | 10-Sep-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.4.1 | 19-Sep-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.12.2 | 11-Sep-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.22.1 | 11-Sep-2024 |  |
|  | 1.22.0 | 10-Sep-2024 |  |
|  | 1.21.1 | 5-Sep-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.14.0 | 04-Sep-2024 |  |
| [Snowpark ML](snowpark-ml.md) | 1.6.2 | 04-Sep-2024 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## August 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.1.0 | 05-Aug-2024 |  |
| [Go Snowflake Driver](golang.md) | 1.11.1 | 29-Aug-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.2.0 | 09-Aug-2024 |  |
| [JDBC Driver](jdbc.md) | 3.19.0 | 29-Aug-2024 |  |
| [Node.js Driver](nodejs.md) | 1.12.0 | 05-Aug-2024 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.0.2 | 29-Aug-2024 |  |
| [Snowflake CLI](snowflake-cli.md) | 2.8.0 | 28-Aug-2024 |  |
|  | 2.7.0 | 02-Aug-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.4.0 | 15-Aug-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.12.1 | 20-Aug-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.12.1 | 29-Aug-2024 |  |
|  | 0.12.0 | 20-Aug-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.21.0 | 19-Aug-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.13.2 | 26-Aug-2024 |  |
|  | 1.13.1 | 21-Aug-2024 |  |
|  | 1.13.0 | 01-Aug-2024 |  |
| [Snowpark ML](snowpark-ml.md) | 1.6.1 | 13-Aug-2024 |  |
| [SnowSQL](snowsql.md) | 1.3.2 | 12-Aug-2024 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## July 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 4.0.0 | 08-Jul-2024 | **Y** |
| [Go Snowflake Driver](golang.md) | 1.11.0 | 31-Jul-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.1.2 | 29-Jul-2024 |  |
| [JDBC Driver](jdbc.md) | 3.18.0 | 24-Jul-2024 |  |
|  | 3.17.0 | 08-Jul-2024 |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | 3.4.0 | 29-Jul-2024 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.0.1 | 24-Jul-2024 |  |
| [Snowflake CLI](snowflake-cli.md) | 2.6.1 | 15-Jul-2024 |  |
|  | 2.6.0 | 11-Jul-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.3.0 | 10-Jul-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.12.0 | 26-Jul-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | 3.0.0 | 31-Jul-2024 |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.11.0 | 25-Jul-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.20.0 | 17-Jul-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.5.4 | 29-Jul-2024 | ✔ |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.6.1 | 09-Jul-2024 |  |
|  | 1.6.0 | 08-Jul-2024 |  |

## June 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | 3.3.2 | 24-Jun-2024 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 3.0.0 | 18-Jun-2024 | **Y** |
| [Snowflake CLI](snowflake-cli.md) | 2.5.0 | 20-Jun-2024 |  |
|  | 2.4.1 | 12-Jun-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.11.0 | 18-Jun-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | 2.16.0 | 10-Jun-2024 |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.10.0 | 24-Jun-2024 |  |
|  | 0.9.0 | 10-Jun-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.19.0 | 25-Jun-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.5.2 | 10-Jun-2024 |  |
|  | 1.5.3 | 17-Jun-2024 |  |
| [SnowSQL](snowsql.md) | 1.3.1 | 28-Jun-2024 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## May 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | 1.10.1 | 29-May-2024 |  |
|  | 1.10.0 | 08-May-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.1.1 | 09-May-2024 |  |
| [JDBC Driver](jdbc.md) | 3.16.1 | 27-May-2024 |  |
| [Node.js Driver](nodejs.md) | 1.11.0 | 28-May-2024 |  |
| [ODBC Driver](odbc.md) | 3.3.1 | 03-May-2024 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 2.4.0 | 31-May-2024 |  |
|  | 2.3.1 | 20-May-2024 |  |
|  | 2.3.0 | 15-May-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.2.2 | 07-May-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.10.1 | 21-May-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.8.1 | 31-May-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.18.0 | 28-May-2024 |  |
|  | 1.17.0 | 21-May-2024 |  |
|  | 1.16.0 | 08-May-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.12.1 | 13-May-2024 |  |
| [Snowpark ML](snowpark-ml.md) | 1.5.1 | 22-May-2024 |  |
|  | 1.5.0 | 01-May-2024 |  |
| [SnowSQL](snowsql.md) | 1.3.0 | 02-May-2024 | **Y** |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## April 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 3.16.0 | 29-Apr-1024 |  |
|  | 3.15.1 | 05-Apr-1024 |  |
| [Node.js Driver](nodejs.md) | 1.10.1 | 08-Apr-2024 |  |
| [ODBC Driver](odbc.md) | 3.3.0 | 08-Apr-2024 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 2.0.3 | 29-Apr-2024 |  |
| [Snowflake CLI](snowflake-cli.md) | 2.2.0 | 25-Apr-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.10.0 | 29-Apr-2024 |  |
|  | 3.9.1 | 22-Apr-2024 |  |
|  | 3.9.0 | 18-Apr-2024 |  |
|  | 3.8.1 | 09-Apr-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.8.0 | 30-Apr-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.15.0 | 24-Apr-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.12.0 | 16-Apr-2024 |  |
|  | 1.11.0 | 01-Apr-2024 |  |
| [Snowpark ML](snowpark-ml.md) | 1.4.0 | 08-Apr-2024 | **Y** |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.5.3 | 16-Apr-2024 |  |
|  | 1.5.2 | 11-Apr-2024 |  |

## March 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 3.1.0 | 27-Mar-2024 |  |
| [Go Snowflake Driver](golang.md) | 1.9.0 | 28-Mar-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake CLI](snowflake-cli.md) | 2.1.2 | 27-Mar-2024 |  |
|  | 2.1.1 | 20-Mar-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.2.1 | 15-Mar-2024 |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.7.0 | 20-Mar-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.14.0 | 20-Mar-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.3.1 | 21-Mar-2024 | **Y** |
| [SnowSQL](snowsql.md) | 1.2.32 | 05-Mar-2024 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## February 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 3.0.0 | 29-Feb-2024 | **Y** |
| [Go Snowflake Driver](golang.md) | 1.8.0 | 21-Feb-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.1.0 | 28-Feb-2024 | **Y** |
| [JDBC Driver](jdbc.md) | 3.15.0 | 20-Feb-2024 |  |
| [Node.js Driver](nodejs.md) | 1.10.0 | 27-Feb-2024 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 2.0.2 | 22-Feb-2024 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.2.0 | 06-Feb-2024 | **Y** |
| [Snowflake Connector for Python](python-connector.md) | 3.7.1 | 22-Feb-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | 2.15.0 | 26-Feb-2024 |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.6.0 | 06-Feb-2024 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.13.0 | 26-Feb-2024 |  |
|  | 1.12.1 | 8-Feb-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.10.0 | 9-Feb-2024 |  |
| [Snowpark ML](snowpark-ml.md) | 1.2.3 | 26-Feb-2024 |  |
|  | 1.2.2 | 13-Feb-2024 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## January 2024 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.2.0 | 17-Jan-2024 | **Y** |
| [Go Snowflake Driver](golang.md) | 1.7.2 | 17-Jan-2024 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.0.5 | 22-Jan-2024 |  |
| [JDBC Driver](jdbc.md) | 3.14.5 | 24-Jan-2024 |  |
| [Node.js Driver](nodejs.md) | 1.9.3 | 17-Jan-2024 |  |
| [ODBC Driver](odbc.md) | 3.2.0 | 19-Jan-2024 | **Y** |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.7.0 | 26-Jan-2024 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.12.0 | 29-Jan-2024 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.2.1 | 25-Jan-2024 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

## Older release notes

### December 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.1.5 | 18-Dec-2023 |  |
|  | 2.1.4 | 05-Dec-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.7.1 | 07-Dec-2023 |  |
| [JDBC Driver](jdbc.md) | 3.14.4 | 07-Dec-2023 | **Y** |
| [Node.js Driver](nodejs.md) | 1.9.2 | 07-Dec-2023 |  |
| [ODBC Driver](odbc.md) | 3.1.4 | 07-Dec-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.1.2 | 04-Dec-2023 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.6.0 | 07-Dec-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowflake Python APIs](snowapi-python.md) | 0.4.0 | 04-Dec-2023 |  |
|  | 0.5.0 | 06-Dec-2023 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.11.1 | 7-Dec-2023 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.1.2 | 18-Dec-2023 |  |
| [SnowSQL](snowsql.md) | 1.2.31 | 13-Dec-2023 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### November 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.1.3 | 15-Nov-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.7.0 | 15-Nov-2023 | **Y** |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | 3.14.3 | 07-Nov-2023 |  |
| [Node.js Driver](nodejs.md) | 1.9.1 | 14-Nov-2023 |  |
| [ODBC Driver](odbc.md) | 3.1.3 | 13-Nov-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 2.0.1 | 09-Nov-2023 | **Y** |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.5.0 | 13-Nov-2023 |  |
|  | 3.4.1 | 09-Nov-2023 |  |
|  | 3.4.0 | 03-Nov-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.0.12 | 14-Nov-2023 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.10.0 | 03-Nov-2023 | **Y** |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.2.29 | 13-Nov-2023 |  |
| [SQLAlchemy](sqlalchemy.md) | 1.5.1 | 02-Nov-2023 |  |

### October 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |  |
| [Go Snowflake Driver](golang.md) | TBD | TBD |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.0.4 | 31-Oct-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.34 | 26-Oct-2023 |  |
|  | 3.14.2 | 02-Oct-2023 |  |
| [Node.js Driver](nodejs.md) | TBD | TBD |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.3.1 | 18-Oct-2023 |  |
|  | 3.3.0 | 12-Oct-2023 |  |
|  | 3.2.1 | 03-Oct-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.9.0 | 16-Oct-2023 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.2.29 | 10-Oct-2023 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### September 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.1.2 | 27-Sep-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.25 | 26-Sep-2023 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | TBD | TBD |  |
| [JDBC Driver](jdbc.md) | TBD | TBD |  |
| [Node.js Driver](nodejs.md) | 1.9.0 | 28-Sep-2023 | **Y** |
| [ODBC Driver](odbc.md) | 3.1.1 | 29-Sep-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 2.0.0 | 29-Sep-2023 | **Y** |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.2.0 | 07-Sep-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.0.6 | 01-Sep-2023 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.8.0 | 14-Sep-2023 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### August 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.1.1 | 23-Aug-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.24 | 22-Aug-2023 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.0.3 | 31-Aug-2023 |  |
| [JDBC Driver](jdbc.md) | 3.14.1 | 24-Aug-2023 |  |
| [Node.js Driver](nodejs.md) | 1.8.0 | 29-Aug-2023 |  |
| [ODBC Driver](odbc.md) | 3.1.0 | 23-Aug-2023 | **Y** |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 2.0.1 | 25-Aug-2023 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.1.1 | 28-Aug-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark ML](snowpark-ml.md) | 1.0.5 | 17-Aug-2023 | **Y** |
| [Snowpark Library for Python](snowpark-python.md) | 1.7.0 | 28-Aug-2023 | **Y** |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.2.28 | 07-Aug-2023 |  |
| [SQLAlchemy](sqlalchemy.md) | 1.5.0 | 28-Aug-2023 |  |

### July 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.1.0 | 27-Jul-2023 | **Y** |
| [Go Snowflake Driver](golang.md) | 1.6.23 | 22-Jul-2023 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.0.2 | 25-Jul-2023 |  |
| [JDBC Driver](jdbc.md) | 3.14.0 | 27-Jul-2023 | **Y** |
| [Node.js Driver](nodejs.md) | 1.7.0 | 28-Jul-2023 |  |
| [ODBC Driver](odbc.md) | 3.0.1 | 06-Jul-2023 | **Y** |
|  | 3.0.2 | 27-Jul-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 1.9.4 | 13-Jul-2023 |  |
|  | 2.0.0 | 31-Jul-2023 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.1.0 | 31-Jul-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.2.28 | 07-Aug-2023 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### June 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.25 | 16-Jun-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.22 | 16-Jun-2023 |  |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.0.1 | 14-Jun-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.33 | 14-Jun-2023 |  |
| [Node.js Driver](nodejs.md) | 1.6.23 | 14-Jun-2023 |  |
| [ODBC Driver](odbc.md) | 2.25.10 | 06-Jun-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.5.0 | 14-Jun-2023 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | 1.2.27 | 14-Jun-2023 |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### May 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.24 | 23-May-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.21 | 23-May-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.31 | 25-May-2023 |  |
|  | 3.13.32 | 26-May-2023 |  |
| [Node.js Driver](nodejs.md) | 1.6.22 | 24-May-2023 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.7 | 23-May-2023 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | 1.9.3 | 21-May-2023 |  |
| [Snowflake Connector for Python](python-connector.md) | 3.0.5 | 25-May-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | 23-May-2023 |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### April 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.23 | 19-Apr-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.20 | 18-Apr-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.30 | 18-Apr-2023 |  |
| [Node.js Driver](nodejs.md) | 1.6.21 | 18-Apr-2023 |  |
| [ODBC Driver](odbc.md) | 2.25.11 | 20-Apr-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.0.3 | 20-Apr-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | 2.11.3 | 21-Apr-2023 |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### March 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.22 | 22-Mar-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.19 | 21-Mar-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.29 | 17-Mar-2023 |  |
| [Node.js Driver](nodejs.md) | 1.6.20 | 23-Mar-2023 |  |
| [ODBC Driver](odbc.md) | 2.25.10 | 22-Mar-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.0.1 | 01-Mar-2023 |  |
|  | 3.0.2 | 23-Mar-2023 |  |
| [Snowflake Connector for Spark](spark-connector.md) | 2.11.2 | 21-Mar-2023 |  |
| [Snowpark Library for Python](snowpark-python.md) | 1.2.0 | 2-Mar-2023 |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | 1.4.7 | 21-Mar-2023 |  |

### February 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.21 | 22-Feb-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.18 | 22-Feb-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.28 | 22-Feb-2023 |  |
| [Node.js Driver](nodejs.md) | 1.6.19 | 27-Feb-2023 |  |
| [ODBC Driver](odbc.md) | 2.25.8 | 08-Feb-2023 |  |
|  | 3.0.2 | 23-Mar-2023 |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.7.1 | 08-Feb-2023 |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### January 2023 release notes

| Client | Version | Date | BCR? |
| --- | --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.20 | 24-Jan-2023 |  |
| [Go Snowflake Driver](golang.md) | 1.6.17 | 26-Jan-2023 |  |
| [JDBC Driver](jdbc.md) | 3.13.27 | 30-Jan-2023 |  |
| [Node.js Driver](nodejs.md) | 1.6.18 | 31-Jan-2023 |  |
| [ODBC Driver](odbc.md) | TBD | TBD |  |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.6 | 24-Jan-2023 |  |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |  |
| [Snowflake Connector for Python](python-connector.md) | 3.0.0 | 27-Jan-2023 | **Y** |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |  |
| [Snowpark Library for Python](snowpark-python.md) | 26-Jan-2023 |  |  |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |  |
| [SnowSQL](snowsql.md) | TBD | TBD |  |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |  |

### December 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |
| [Go Snowflake Driver](golang.md) | 1.6.16 | 14-Dec-2022 |
| [JDBC Driver](jdbc.md) | 3.13.26 | 14-Dec-2022 |
| [Node.js Driver](nodejs.md) | 1.6.17 | 14-Dec-2022 |
| [ODBC Driver](odbc.md) | 2.25.7 (Doc only) | 13-Dec-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.9.0 | 14-Dec-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | 2.11.1 | 13-Dec-2022 |
| [Snowpark Library for Python](snowpark-python.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | 1.4.5 | 09-Dec-2022 |

### November 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.18 | 02-Nov-2022 |
|  | 2.0.19 | 16-Nov-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.15 | 16-Nov-2022 |
| [JDBC Driver](jdbc.md) | 3.13.25 | 16-Nov-2022 |
| [Node.js Driver](nodejs.md) | 1.6.16 | 18-Nov-2022 |
| [ODBC Driver](odbc.md) | TBD | TBD |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | 1.8.2 | 18-Nov-2022 |
| [Snowflake Connector for Python](python-connector.md) | 2.8.2 | 18-Nov-2022 |
|  | 2.8.3 | 28-Nov-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | 2.11.1 | 13-Dec-2022 |
| [Snowpark Library for Python](snowpark-python.md) | 1.0.0 | 01-Nov-2022 |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | 1.4.4 | 16-Nov-2022 |

### October 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.17 | 03-Oct-2022 |
| [Go Snowflake Driver](golang.md) | TBD | TBD |
| [Ingest Java SDK](ingest-java-sdk.md) | 2.0.4 | 31-Oct-20 |
| [JDBC Driver](jdbc.md) | 3.13.24 | 28-Oct-2022 |
| [Node.js Driver](nodejs.md) | 1.6.15 | 28-Oct-2022 |
| [ODBC Driver](odbc.md) | 2.25.6 | 12-Oct-2022 |
|  | 2.25.7 | 31-Oct-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.5 | 26-Oct-2022 |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.6.1 | 03-Oct-2022 |
| [SnowSQL](snowsql.md) | 1.2.24 | 21-Oct-2022 |
| [SQLAlchemy](sqlalchemy.md) | 1.4.3 | 21-Oct-2022 |

### September 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |
| [Go Snowflake Driver](golang.md) | TBD | TBD |
| [JDBC Driver](jdbc.md) | 3.13.23 | 30-Sep-2022 |
| [Node.js Driver](nodejs.md) | TBD | TBD |
| [ODBC Driver](odbc.md) | TBD | TBD |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.8.0 | 27-Sep-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | 2.11.0 | 02-Sep-2022 |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | 1.4.2 | 28-Sep-2022 |

### August 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.16 | 23-Aug-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.13 | 22-Aug-2022 |
| [JDBC Driver](jdbc.md) | 3.13.22 | 23-Aug-2022 |
| [Node.js Driver](nodejs.md) | 1.6.13 | 24-Aug-2022 |
| [ODBC Driver](odbc.md) | 2.25.4 | 01-Aug-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.4 | 24-Aug-2022 |
| [Snowflake Connector for Kafka](kafka-connector.md) | 1.6.8 | 23-Aug-2022 |
|  | 1.8.1 | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.7.12 | 24-Aug-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | 2.10.1 | 15-Aug-2022 |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.6.0 | 12-Aug-2022 |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | 1.4.1 | 23-Aug-2022 |

### July 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.15 | 19-Jul-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.12 | 29-Jul-2022 |
| [JDBC Driver](jdbc.md) | 3.13.21 | 13-Jul-2022 |
| [Node.js Driver](nodejs.md) | 1.6.12 | 25-Jul-2022 |
| [ODBC Driver](odbc.md) | TBD | TBD |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.3 | 08-Jul-2022 |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.7.10 | 25-Jul-2022 |
|  | 2.7.11 | 28-Jul-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.6.0 | 12-Aug-2022 |
| [SnowSQL](snowsql.md) | 1.2.23 | 28-Jul-2022 |
| [SQLAlchemy](sqlalchemy.md) | 1.4.0 | 21-Jul-2022 |

### June 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.14 | 23-Jun-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.11 | 23-Jun-2022 |
| [JDBC Driver](jdbc.md) | 3.13.20 | 23-Jun-2022 |
| [Node.js Driver](nodejs.md) | 1.6.11 | 23-Jun-2022 |
| [ODBC Driver](odbc.md) | 2.25.3 | 29-Jun-2022 |
|  | 2.25.2 | 01-Jun-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.7.9 | 24-Jun-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | TBD | TBD |
| [SnowSQL](snowsql.md) | 1.2.22 | 29-Jun-2022 |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |

### May 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.12 | 06-May-2022 |
|  | 2.0.13 | 18-May-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.10 | 25-May-2022 |
| [JDBC Driver](jdbc.md) | 3.13.19 | 25-May-2022 |
| [Node.js Driver](nodejs.md) | 1.6.10 | 25-May-2022 |
| [ODBC Driver](odbc.md) | 22.25.0 | 09-May-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.2 | 24-May-2022 |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.7.8 | 26-May-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.4.1 | 26-May-2022 |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | TBD | TBD |

### April 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | TBD | TBD |
| [Go Snowflake Driver](golang.md) | 1.6.9 | 19-Apr-2022 |
| [JDBC Driver](jdbc.md) | 3.13.17 | 14-Apr-2022 |
| [Node.js Driver](nodejs.md) | 1.6.9 | 20-Apr-2022 |
| [ODBC Driver](odbc.md) | TBD | TBD |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | 1.7.2 | 26-Apr-2022 |
| [Snowflake Connector for Python](python-connector.md) | 2.7.7 | 27-Apr-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.4.0 | 28-Apr-2022 |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | 1.3.4 | 27-Apr-2022 |

### March 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.11 | 15-Mar-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.8 | 15-Mar-2022 |
| [JDBC Driver](jdbc.md) | 3.13.16 | 17-Mar-2022 |
| [Node.js Driver](nodejs.md) | 1.6.8 | 17-Mar-2022 |
| [ODBC Driver](odbc.md) | 2.24.7 | 17-Mar-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | 1.2.1 | 16-Mar-2022 |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | 2.7.6 | 18-Mar-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.3.0 | 18-Mar-2022 |
|  | 1.2.0 | 02-Mar-2022 |
| [SnowSQL](snowsql.md) | TBD | TBD |
| [SQLAlchemy](sqlalchemy.md) | 1.3.4 | 27-Apr-2022 |

### February 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.10 | 16-Feb-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.7 | 16-Feb-2022 |
| [JDBC Driver](jdbc.md) | 3.13.15 | 25-Feb-2022 |
| [Node.js Driver](nodejs.md) | 1.6.7 | 16-Feb-2022 |
| [ODBC Driver](odbc.md) | 2.24.7 | 17-Mar-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | TBD | TBD |
| [Snowflake Connector for Python](python-connector.md) | TBD | TBD |
| [Snowflake Connector for Spark](spark-connector.md) | 2.10.0 | 17-Feb-2022 |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.1.0 | 04-Feb-2022 |
| [SnowSQL](snowsql.md) | TBD | TBD |

### January 2022 release notes

| Client | Version | Date |
| --- | --- | --- |
| [.NET Driver](dotnet.md) | 2.0.9 18-Jan-2022 |  |
|  | 1.2.9 | 18-Jan-2022 |
| [Go Snowflake Driver](golang.md) | 1.6.6 | 18-Jan-2022 |
| [JDBC Driver](jdbc.md) | 3.13.14 | 21-Jan-2022 |
|  | 3.13.13 | 18-Jan-2022 |
| [Node.js Driver](nodejs.md) | TBD | TBD |
| [ODBC Driver](odbc.md) | 2.24.5 | 21-Jan-2022 |
| [PHP PDO Driver for Snowflake](php-pdo.md) | TBD | TBD |
| [Snowflake Connector for Kafka](kafka-connector.md) | 1.7.0 | 18-Jan-2022 |
| [Snowflake Connector for Python](python-connector.md) | 2.7.3 | 18-Jan-2022 |
| [Snowflake Connector for Spark](spark-connector.md) | TBD | TBD |
| [Snowpark Library for Scala and Java](snowpark-scala-java.md) | 1.0.0 | 26-Jan-2022 |
|  | 0.12.0 | 04-Jan-2022 |
| [SnowSQL](snowsql.md) | TBD | TBD |

---
title: Snowflake Cortex AI Function: Multirow error handling improvements
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2184.md
section: Release Notes
---

# Snowflake Cortex AI Function: Multirow error handling improvements

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, most [Snowflake Cortex AI Functions](../../../user-guide/snowflake-cortex/aisql.md) return NULL on error, rather than raising the error. This allows multi-row queries to complete even if some rows have errors.
Additionally, the [AI_PARSE_DOCUMENT](../../../sql-reference/functions/ai_parse_document.md) function changes its return value to be more consistent with other AI Functions’ error handling.

|  |  |
| --- | --- |
| Before the change | Most AI Functions raise an error when the function does not succeed, preventing multi-row queries from completing when even one row cannot be processed. |
| After the change | Most AI Functions return NULL when the function does not succeed, allowing multi-row queries to complete when some rows cannot be processed. Rows with errors can easily be excluded from multi-row results.  A new, optional, final parameter in the affected AI Functions, `return_error_details`, when present and set to TRUE, causes the functions to return an [OBJECT](../../../sql-reference/data-types-semistructured.md) with `value` and `error` fields rather than rather than the former result type. If the function succeeded, the `error` field is NULL and the `value` field contains the actual return value. If the function failed, the `value` field is NULL and the `error` field contains an error message. In addition to allowing rows with errors to be easily excluded from multi-row results, this behavior allows errors to be recorded for later review.  Additionally, minor changes to the [AI_PARSE_DOCUMENT](../../../sql-reference/functions/ai_parse_document.md) function’s return value make it more consistent with other AI Functions, as follows:   * When the `return_error_details` argument is FALSE or not present, and an error occurs, the function returns NULL. * When `return_error_details` is present and TRUE, the return value has the following changes compared to the previous behavior:    + The `metadata` field, formerly a subfield of the top-level value field, is now itself a top-level field.   + The `errorInformation` subfield of the top-level `value` field is renamed `error` for consistency with the top-level error field. However, the `error` subfield is not present when no error occurs,     while the top-level `error` field is NULL. |

## Affected AI functions

The following AI Functions are affected by this behavior change:

* [AI_COMPLETE](../../../sql-reference/functions/ai_complete.md): Generates text responses from text or image prompts using a specified large language model (LLM).
* [AI_CLASSIFY](../../../sql-reference/functions/ai_classify.md): Classifies text or images into user-defined categories.
* [AI_FILTER](../../../sql-reference/functions/ai_filter.md): Applies semantic filters on text and images expressed in natural language.
* [AI_PARSE_DOCUMENT](../../../sql-reference/functions/ai_parse_document.md): Extracts document structure, text, images, and tables as Markdown.
* [AI_TRANSCRIBE](../../../sql-reference/functions/ai_transcribe.md): Transcribes audio or video files with speaker identification and timestamps.
* [AI_TRANSLATE](../../../sql-reference/functions/ai_translate.md): Translates text between supported languages.
* [AI_SENTIMENT](../../../sql-reference/functions/ai_sentiment.md): Performs sentiment classification on text content.
* [AI_COUNT_TOKENS](../../../sql-reference/functions/ai_count_tokens.md): Estimates token usage for prompts.

### Unaffected AI functions

The following AI Functionsare *not* affected by this behavior change:

* [AI_EXTRACT](../../../sql-reference/functions/ai_extract.md): This function already handles errors by returning error information as a separate field in the result and does not cause multi-row queries
  to fail due to a single error. AI_EXTRACT’s behavior is similar to the new behavior of other AI Functions when `return_error_details` is TRUE, although this function does not accept `return_error_details`.
* [AI_AGG](../../../sql-reference/functions/ai_agg.md) and [AI_SUMMARIZE_AGG](../../../sql-reference/functions/ai_summarize_agg.md): Aggregate functions are not in scope for this BCR.
  Snowflake is still considering how rows that cause errors should behave in aggregation. These functions’ behavior might change in a future BCR.
* [AI_EMBED](../../../sql-reference/functions/ai_embed.md): This function returns a VECTOR, which is not currently supported for VARIANT objects. This function’s behavior might change in a future BCR.
* Older AI Functions in the SNOWFLAKE.CORTEX namespace. Snowflake does not intend to change these functions’ behavior.

## Helper functions

This BCR includes two helper functions that assist with extracting information from the error details object returned when `return_error_details` is set to TRUE.
These functions provide convenient access to alternative error handling behavior when `return_error_details` is set to TRUE.

* AI_NULL_IF_ERROR: Returns NULL if the `error` field of the given value is not NULL, otherwise returns the
  `value` field. This is the same behavior as when `return_error_details` is set to FALSE.
* AI_THROW_IF_ERROR: Raises an error from `error` field of the provided object if it is not NULL, otherwise
  returns the `value` field. This is the same behavior that AI Functions had before this behavior change.

## Examples

The following examples illustrate the new error handling behavior. These examples use AI_TRANSLATE, but the behavior is the same for all affected functions.

### New behavior with and without error

The first code sample shows output when the function succeeds, and the second example shows output when the function fails due to an invalid language code.

```sqlexample
-- succeeds
SELECT AI_TRANSLATE(spanish_comment, 'es', 'en') as english_comment, "Este es un commentario" as spanish_comment;
```

Result:

```output
+-------------------+------------------+
| ENGLISH_COMMENT   | SPANISH_COMMENT  |
|-------------------+------------------|
| This is a comment | Este es un       |
|                   | comentario       |
+-------------------+------------------+
```

```sqlexample
-- fails
SELECT AI_TRANSLATE(spanish_comment, 'es', 'xx') as english_comment, "Este es un commentario" as spanish_comment;
```

Result

```output
+-------------------+------------------+
| ENGLISH_COMMENT   | SPANISH_COMMENT  |
|-------------------+------------------|
| NULL              | Este es un       |
|                   | comentario       |
+-------------------+------------------+
```

### New behavior with error details

As before, the first code sample is the success case and the second is the error case.

```sqlexample
  -- succeeds
  SELECT AI_TRANSLATE(spanish_comment, 'es', 'en', TRUE) as result, "Este es un commentario" as spanish_comment;

Result:
```

```output
+--------------------------------+------------------+
| RESULT                         | SPANISH_COMMENT  |
|--------------------------------|------------------|
| {                              | Este es un       |
|   "value": "This is a comment",| comentario       |
|   "error": NULL                |                  |
| }                              |                  |
+--------------------------------+------------------+
```

```sqlexample
-- fails
SELECT AI_TRANSLATE(spanish_comment, 'es', 'xx', TRUE) as result, "Este es un commentario" as spanish_comment;
```

Result:

```output
+--------------------------------+------------------+
| RESULT                         | SPANISH_COMMENT  |
|--------------------------------|------------------|
| {                              | Este es un       |
|   "value": NULL,               | comentario       |
|   "error": "Invalid language   |                  |
|           \"xx\"               |                  |
+--------------------------------+------------------+
```

### Multirow query

The following examples show how to use the new error handling behavior in a multirow query. If an error occurs when processing a row, that row is not included in the result.
The example data is assumed to be a table containing user comments in various languages, and the query attempts to translate them all into English using AI_TRANSLATE.

```sqlexample
SELECT
  AI_TRANSLATE(comment, comment_language, 'en') as translation_result,
  comment_language,
  comment
FROM comments
WHERE translation_result IS NOT NULL;
```

The example below shows how to use the `return_error_details` parameter to achieve the same result as the previous example.

```sqlexample
SELECT
  AI_TRANSLATE(comment, comment_language, 'en', TRUE) as translation_result,
  comment_language,
  comment
FROM comments
WHERE translation_result:value IS NOT NULL;
```

Ref: 2184

---
title: Snowflake Cortex AI Functions Model RBAC Rollout
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2220.md
section: Release Notes
---

# Snowflake Cortex AI Functions Model RBAC Rollout

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

The following Snowflake Cortex AI Functions now fully enforce model access controls through both the `CORTEX_MODELS_ALLOWLIST`
parameter and model role-based access control (RBAC):

* `AI_TRANSCRIBE` / `SNOWFLAKE.CORTEX.TRANSCRIBE`
* `AI_EXTRACT` / `SNOWFLAKE.CORTEX.EXTRACT`
* `AI_SENTIMENT` / `SNOWFLAKE.CORTEX.SENTIMENT` / `SNOWFLAKE.CORTEX.ENTITY_SENTIMENT`
* `AI_TRANSLATE` / `SNOWFLAKE.CORTEX.TRANSLATE`
* `CLASSIFY_TEXT`
* `SUMMARIZE`
* `EXTRACT_ANSWER`
* `AI_PARSE_DOCUMENT` / `SNOWFLAKE.CORTEX.PARSE_DOCUMENT`
* `AI_REDACT`

Before the change:
:   Model access controls, `CORTEX_MODELS_ALLOWLIST` and model RBAC, were fully enforced for
    `AI_COMPLETE` / `SNOWFLAKE.CORTEX.COMPLETE`, `AI_CLASSIFY`, `AI_FILTER`,
    `AI_AGG` , and `AI_SUMMARIZE_AGG`.

    For the preceding Snowflake Cortex AI Functions, model access controls were not enforced. Queries using these
    functions could succeed even when the underlying model was restricted by `CORTEX_MODELS_ALLOWLIST`
    or model RBAC.

After the change:
:   When you call any of the listed Snowflake Cortex AI Functions, Snowflake will:

    1. Check model RBAC first: If the calling role has usage on the corresponding model object (for example,
       via `SNOWFLAKE."CORTEX-MODEL-ROLE-ARCTIC-TRANSLATE"`), the call is allowed.
    2. If no model object access is found, check `CORTEX_MODELS_ALLOWLIST`: If the underlying model or alias
       is listed in `CORTEX_MODELS_ALLOWLIST`, or if `CORTEX_MODELS_ALLOWLIST = 'All'`, the call is allowed.
    3. Otherwise, the call fails with a model-authorization error.

    This aligns behavior across all listed Snowflake Cortex AI Functions and ensures that your existing model restrictions are
    respected consistently.

    > **Note:**
    >
    > **If you’re affected by this change:**
    >
    > You’re affected if you’ve set the `CORTEX_MODELS_ALLOWLIST` parameter to a different value from the default `All` value **and** you use any of the preceding Snowflake Cortex AI Functions.
    >
    > If you haven’t changed the value for the allowlist parameter and you’re not using model RBAC, you won’t see any behavioral changes.
    >
    > If you’ve customized `CORTEX_MODELS_ALLOWLIST`, affected queries might start failing with a
    > model-authorization error unless:
    >
    > * The underlying model or its function-specific alias is permitted by `CORTEX_MODELS_ALLOWLIST`, or
    > * The role executing the query has the corresponding model application role.
    >
    > **To prepare for this change:**
    >
    > 1. Check which affected functions you use:
    >
    >    ```sqlexample
    >    SELECT DISTINCT FUNCTION_NAME
    >    FROM SNOWFLAKE.ACCOUNT_USAGE.CORTEX_AISQL_USAGE_HISTORY
    >    WHERE USAGE_TIME >= DATEADD(day, -90, CURRENT_TIMESTAMP())
    >      AND FUNCTION_NAME IN (
    >          'AI_TRANSCRIBE', 'TRANSCRIBE', 'AI_EXTRACT', 'AI_SENTIMENT',
    >          'SENTIMENT', 'ENTITY_SENTIMENT', 'AI_TRANSLATE', 'TRANSLATE',
    >          'CLASSIFY_TEXT', 'SUMMARIZE', 'EXTRACT_ANSWER',
    >          'AI_PARSE_DOCUMENT', 'PARSE_DOCUMENT', 'AI_REDACT'
    >      )
    >    ORDER BY FUNCTION_NAME;
    >    ```
    > 2. Review your current model governance settings:
    >
    >    ```sqlexample
    >    SHOW PARAMETERS LIKE 'CORTEX_MODELS_ALLOWLIST' IN ACCOUNT;
    >    ```
    > 3. If using an allowlist (not `All`), add the required model aliases for the functions you use.
    >    Common model aliases include:
    >
    >    * `arctic-translate` (for `AI_TRANSLATE`)
    >    * `arctic-transcribe` (for `AI_TRANSCRIBE`)
    >    * `arctic-extract` (for `AI_EXTRACT`)
    >    * `arctic-parse-document` (for `AI_PARSE_DOCUMENT`)
    >    * `arctic-extract-answer` (for `EXTRACT_ANSWER`)
    >    * `arctic-sentiment` (for `AI_SENTIMENT`)
    >    * `llama3.1-70b` (for `CLASSIFY_TEXT`, `AI_REDACT`)
    >    * `mistral-7b` (for `SUMMARIZE`)
    >
    >    Example:
    >
    >    ```sqlexample
    >    ALTER ACCOUNT SET CORTEX_MODELS_ALLOWLIST =
    >      'llama3.1-70b,arctic-translate,arctic-extract,arctic-sentiment,arctic-parse-document,arctic-extract-answer';
    >    ```
    >
    > Alternatively, use model RBAC by setting `CORTEX_MODELS_ALLOWLIST = 'None'` and granting
    > model application roles (for example, `SNOWFLAKE."CORTEX-MODEL-ROLE-ARCTIC-TRANSLATE"`) to
    > the appropriate roles.

This change is being made to:

* Maintain consistent governance
* Enforce least-privilege access uniformly
* Align managed AI Functions with existing Cortex model access controls
* Strengthen enterprise-grade compliance guarantees
* Provide transparent model authorization behavior

Ref: 2220

---
title: Snowflake Cortex: Model deprecation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-1984.md
section: Release Notes
---

# Snowflake Cortex: Model deprecation

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, Snowflake Cortex deprecates the models listed on this page:

Before the change:
:   Snowflake Cortex supports the following models:

    * gemma-7b
    * jamba-1.5-large
    * jamba-1.5-mini
    * jamba-instruct
    * llama2-70b-chat
    * llama3.2-1b
    * llama3.2-3b
    * reka-core
    * reka-flash

After the change:
:   Snowflake Cortex deprecates the following models:

    * gemma-7b
    * jamba-1.5-large
    * jamba-1.5-mini
    * jamba-instruct
    * llama2-70b-chat
    * llama3.2-1b
    * llama3.2-3b
    * reka-core
    * reka-flash

Ref: 1984

---
title: Snowflake Information Schema views: New column GRANTED_TO
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1753.md
section: Release Notes
---

# Snowflake Information Schema views: New column GRANTED_TO

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, the following views in the Snowflake Information Schema include a new column:

* OBJECT_PRIVILEGES
* TABLE_PRIVILEGES
* USAGE_PRIVILEGES

The new column in these views is:

| Column name | Data type | Description |
| --- | --- | --- |
| GRANTED_TO | VARCHAR | Object that has been granted the privilege. Possible values currently include ROLE, APPLICATION, and APPLICATION ROLE. |

The GRANTED_TO column appears after the GRANTEE column.

Ref: 1753

---
title: Snowflake ML Python release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-ml.md
section: Release Notes
---

# Snowflake ML Python release notes

The Snowflake ML Python release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowpark-ml-2026.md)
* [2025 releases](snowpark-ml-2025.md)
* [2024 releases](snowpark-ml-2024.md)
* [2023 releases](snowpark-ml-2023.md)

See [Snowflake ML: End-to-End Machine Learning](../../developer-guide/snowflake-ml/overview.md) for documentation.

---
title: Snowflake ML Python release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-ml-2026.md
section: Release Notes
---

# Snowflake ML Python release notes

This article contains the release notes for the Snowflake ML Python, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> These notes do not include changes in features that have not been publicly announced.
> Such features might appear in the Snowflake ML Python source code but not in the public documentation.

See [Snowflake ML: End-to-End Machine Learning](../../developer-guide/snowflake-ml/overview.md) for documentation.

## Verifying the `snowflake-ml-python` package

All Snowflake packages are signed, allowing you to verify their origin. To verify the `snowflake.ml.python` package, follow the steps below:

1. Install `cosign`. This example uses the Go installation:
   [Installing cosign with Go](https://edu.chainguard.dev/open-source/sigstore/cosign/how-to-install-cosign/#installing-cosign-with-go).
2. Download the file from a repository such as [PyPi](https://pypi.org/project/snowflake-ml-python/#files).
3. Download a `.sig` file for that release from the GitHub [releases page](https://github.com/snowflakedb/snowflake-ml-python/releases/).
4. Verify the signature using `cosign`. For example:

```bash
cosign verify-blob snowflake_ml_python-1.27.0.tar.gz --key snowflake-ml-python-1.27.0.pub --signature resources.linux.snowflake_ml_python-1.27.0.tar.gz.sig

cosign verify-blob snowflake_ml_python-1.27.0.tar.gz --key snowflake-ml-python-1.27.0.pub --signature resources.linux.snowflake_ml_python-1.27.0.tar.gz.sig
```

> **Note:**
>
> This example uses the library and signature for version 1.27.0 of the package. Use the filenames of the version you are verifying.

## Deprecation notices

* `snowflake.ml.fileset.FileSet` has been deprecated and will be removed in a future release. Use
  [snowflake.ml.dataset.Dataset](../../developer-guide/snowflake-ml/dataset.md) and
  [snowflake.ml.data.DataConnector](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector) instead.
* The “CamelCase” function names in `snowflake.ml.cortex` have been deprecated and will be removed in a future
  release. Use the “snake_case” names for these functions instead. For example, use `classify_text` instead of
  `ClassifyText`.
* The `partitioned_inference_api` decorator has been deprecated and will be removed in a future release. Use `custom_model.partitioned_api` instead.
* The `additional_payloads` argument of the `MLJob.submit_*` methods has been deprecated and will be removed in a future release.
  Use the `imports` argument instead.
* The `snowflake.ml.model.models.huggingface_pipeline.HuggingfacePipelineModel` class has been deprecated and will be removed in a future release.

## Version 1.27.0 (2026-02-12)

### Bug fixes

Model Registry bug fixes:

* Fixed failure of `model_version.run` caused by requiring READ privilege on the model instead of USAGE when the
  user’s role had only the USAGE privilege.

Feature store bug fixes:

* Fixed failure of `register_feature_view` with `overwrite=True` when the existing feature view is external and the
  new feature view is managed, or vice versa.

## Version 1.26.0 (2026-02-05)

### New features

New Model Registry features:

* Model signatures can now include inference parameters via `ParamSpec`, allowing you to define
  constant parameters to be passed at inference time without including them in the input data.
  Example:

  ```python
  import pandas as pd
  from snowflake.ml.model import custom_model, model_signature
  from snowflake.ml.registry import Registry

  # Define a custom model with inference parameters
  class MyModelWithParams(custom_model.CustomModel):
      @custom_model.inference_api
      def predict(
          self,
          input_df: pd.DataFrame,
          *,
          temperature: float = 1.0,  # keyword-only param with default
      ) -> pd.DataFrame:
          return pd.DataFrame({"output": input_df["feature"] * temperature})

  # Create sample data
  model = MyModelWithParams(custom_model.ModelContext())
  sample_input = pd.DataFrame({"feature": [1.0, 2.0, 3.0]})
  sample_output = model.predict(sample_input, temperature=1.0)

  # Define ParamSpec for the inference parameter
  params = [
      model_signature.ParamSpec(
          name="temperature",
          dtype=model_signature.DataType.FLOAT,
          default_value=1.0,
      ),
  ]

  # Infer signature with params
  sig = model_signature.infer_signature(
      input_data=sample_input,
      output_data=sample_output,
      params=params,
  )

  # Log model with the signature
  registry = Registry(session)
  mv = registry.log_model(
      model=model,
      model_name="my_model_with_params",
      version_name="v1",
      signatures={"predict": sig},
  )

  # Run inference with custom parameter value
  result = mv.run(sample_input, function_name="predict", params={"temperature": 2.0})
  ```

New Feature Store features:

* New `auto_prefix` parameter and `with_name` method to prevent column name collisions when joining multiple feature
  views in dataset generation.
* Dynamic Iceberg tables can now be used as backing storage for Feature Views. Use `StorageConfig` with
  `StorageFormat.ICEBERG` to store data in Apache Iceberg format on external cloud storage. A new
  `default_iceberg_external_volume` parameter is available in `FeatureStore` to set a default external volume for
  Iceberg feature views.

## Version 1.25.1 (2026-02-03)

No public-facing changes. This release includes changes to a preview feature that has not been publicly announced.

## Version 1.25.0 (2026-01-28)

### New features

New Model Serving features:

* The `create_service` method accepts a new `autocapture` argument to indicate whether inference data should be captured
  (see [Autocapture inference logs for realtime inference](../../developer-guide/snowflake-ml/inference/auto-capture-inference-logs.md)).
* The `create_service` and `log_model_and_create_service` methods now accept an optional `min_instances` argument
  to specify the minimum number of instances for the service. The service automatically scales between the specified
  minimum and maximum instances based on traffic and hardware utilization. If `min_instances` is 0, the service
  automatically suspends when no traffic is detected for a period of time. The default value for `min_instances` is 0.

## Version 1.24.0 (2026-01-22)

### New features

New Feature Store features:

* Tile-based aggregation support using a new `Feature` API for efficient and point-in-time correct time-series
  feature computation using pre-computed tiles.

New Model Registry features:

* SentenceTransformer models now support automatic signature inference. When logging a SentenceTransformer model,
  `sample_input_data` is optional. The signature is automatically inferred from the model’s embedding dimension when
  sample input data is not provided.. The `encode`, `encode_query`, `encode_document`, `encode_queries`,
  `encode_documents` methods are supported.

  ```python
  import sentence_transformers
  from snowflake.ml.registry import Registry

  # Create model
  model = sentence_transformers.SentenceTransformer("all-MiniLM-L6-v2")

  # Log model without sample_input_data - signature is auto-inferred
  registry = Registry(session)
  mv = registry.log_model(
      model=model,
      model_name="my_sentence_transformer",
      version_name="v1",
  )

  # Run inference with auto-inferred signature (input: "text", output: "output")
  import pandas as pd
  result = mv.run(pd.DataFrame({"text": ["Hello world"]}))
  ```

## Version 1.23.0 (2026-01-165)

### New features

New ML Jobs features:

* ML Jobs now support Python 3.11 and Python 3.12. Jobs automatically select a runtime environment
  matching the client Python version.

### Bug fixes

Model Registry bug fixes:

* Empty output in HuggingFace’s Token Classification (Named Entity Recognition) models no longer causes failures.

Model Serving bug fixes:

* Container statuses are now correctly reported and should not be blank.

## Version 1.22.0 (2026-01-09)

### New features

New Model Registry features:

* You can now remotely log a transformer pipeline model using a Snowpark Container Services (SPCS) job.

  ```python
  # create reference to the model
  model = huggingface.TransformersPipeline(
      model="TinyLlama/TinyLlama-1.1B-Chat-v1.0",
      task="text-generation",
  )

  # Remotely log the model, a SPCS job will run async and log the model
  mv = registry.log_model(
      model=model,
      model_name="tinyllama_remote_log",
      target_platforms=["SNOWPARK_CONTAINER_SERVICES"],
      signatures=openai_signatures.OPENAI_CHAT_SIGNATURE,
  )
  ```

## Version 1.21.0 (2026-01-05)

### Behavior changes

ML Jobs behavior changes:

* The behavior of the `additional_payloads` parameter is changing. Use the `imports` argument to declare additional
  dependencies, such as ZIP files and Python modules. Local directories and Python files are automatically compressed,
  and their internal layout is determined by the specified import path. The import path applies only to local
  directories, Python files, and staged python files; it has no effect on other import types. When referencing files in a
  stage, only individual files are supported, not directories.

Experiment Tracking behavior changes:

* `ExperimentTracking` is now a singleton class.

### Bug fixes

Experiment Tracking bug fixes:

* Reaching the run metadata size limit in `log_metrics` or `log_params` now issues a warning instead of raising an exception.

Model Registry bug fixes:

* `ModelVersion.run` now raises a `ValueError` if the model is a SPCS-only model and `service_name` is not provided.

### New preview features

* The `create_service` method now accepts the Boolean argument `autocapture` to indicate whether inference data is automatically captured.

### New release features

New Model Registry features:

* The new `snowflake.ml.model.models.huggingface.TransformersPipeline` class is intended to replace `snowflake.ml.model.models.huggingface_pipeline.HuggingfacePipelineModel`,
  although the older class is not yet deprecated. The new class knows model signatures for common tasks so that you do not need to specify them manually.
  The supported tasks are currently:

  + `fill-mask`
  + `question-answering`
  + `summarization`
  + `table-question-answering`
  + `text2text-generation`
  + `text-classification` (alias `sentiment-analysis`)
  + `text-generation`
  + `token-classification` (alias `ner`)
  + `translation`
  + `translation_xx_to_yy`
  + `zero-shot-classification` (lets you log models without loading them into memory)
* The `list_services` API now shows an internal endpoint that can be called from another SPCS node or notebook without Enterprise Application Integration.
  It also indicates whether autocapture is enabled for each service.

New DataConnector features:

* New `to_huggingface_dataset` method converts Snowflake data to HuggingFace datasets. Supports both in-memory
  `Dataset` (`streaming=False`) and streaming `IterableDataset` (`streaming=True`) modes.

### Deprecation notices

* The `snowflake.ml.model.models.huggingface_pipeline.HuggingfacePipelineModel` class has been deprecated and will be removed in a future release.

---
title: Snowflake ML release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-ml-2023.md
section: Release Notes
---

# Snowflake ML release notes

This article contains the release notes for the Snowflake ML, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> These notes do not include changes in features that have not been publicly released.

See [Snowflake ML: End-to-End Machine Learning](../../developer-guide/snowflake-ml/overview.md) for documentation.

## Version 1.1.2 (2023-12-18)

### New features and updates

Model development updates:

* Implemented `precision_score` metric in SQL.

### Bug fixes

* Fixed an issue where stack trace was being hidden by telemetry.

Model development fixes:

* Inferring model signatures no longer materializes the full dataframe in memory.

## Version 1.1.1 (2023-12-6)

### New features and updates

* Designated Snowpark ML Modeling API as Generally Available.
* New `passthrough_col` parameter in the Modeling API allows you to exclude specific columns, like index columns, during
  training or inference when not explicitly specifying `input_cols`.

### Bug fixes

* Model development fixes:

  + `confusion_matrix` provided incorrect results when the row number could not be divided by the batch size.

## Version 1.1.0 (2023-12-1)

### New features and updates

* `GridSearchCV` and `RandomizedSearchCV` execution is now distributed on multi-node warehouses.

### Bug fixes

* Model development fixes:

  + Columns were being excluded if their normalized names did not match the names specified in `output_columns`
    in `OrdinalEncoder` and `LabelEncoder`. Output columns no longer need to be valid Snowflake identifiers.

## Version 1.0.12 (2023-11-15)

### New features and updates

* None

### Bug fixes

* Model development fixes:

  + Increased the column capacity of `OrdinalEncoder`.

## Version 1.0.11 (2023-10-28)

### New features and updates

* Add support for `kneighbors`.
* Support `DecimalType` as a data type.

### Bug fixes

* Model development fixes:

  + Fix support for XGBoost and LightGBM models using SKLearn Grid Search and Randomized Search model selectors.
  + Fix metrics compatibility with Snowpark DataFrames that use Snowflake identifiers

## Version 1.0.10 (2023-10-15)

### New features and updates

* `precision_score`, `recall_score`, `f1_score`, `fbeta_score`, `precision_recall_fscore_support`,
  `mean_absolute_error`, `mean_squared_error`, and `mean_absolute_percentage_error` metric calculations are now
  distributed.

### Bug fixes

* Model development fixes:

  + Fix UTF-8 decoding errors when using modeling modules on Windows.
  + Fix alias definitions causing `SnowparkSQLUnexpectedAliasException` in inference.

## Version 1.0.9 (2023-09-28)

### New features and updates

* Calculation of `log_loss` metric is now distributed.

### Bug fixes

* Model development fixes:

  + Building images no longer fails with some Docker setups.
  + Embedding local ML library no longer fails when the library is imported by zipimport.
  + Update incorrect documentation about platform argument in the `deploy` function.

## Version 1.0.8 (2023-09-15)

### New features and updates

* None

### Bug fixes

* Model development fixes:

  + Ordinal encoder can be used with mixed input column types.
  + Fix an issue when the sklearn default value is `np.nan`.

## Version 1.0.7 (2023-09-05)

### New features and updates

> * Allow disabling telemetry.

### Bug fixes

* Model development fixes:

  + Fix an error related to `pandas.io.json.json_normalizer`.

## Version 1.0.6 (2023-09-01)

### New features and updates

* Model development: Size of metrics result can exceed previous 8MB limit.

### Bug fixes

* Model development fixes:

  + Fixed a bug when using simple imputer with NumPy >= 1.25.
  + Fixed a bug when inferring the type of label columns.

## Version 1.0.5 (2023-08-17)

This release contains internal changes only.

## Version 1.0.4 (2023-07-28)

### New features and updates

* Model development: Input dataframes can now be joined against data loaded from staged files.
* Model development: Added support for non-English languages.

### Bug fixes

* None

## Version 1.0.3 (2023-07-14)

This release contains internal changes only.

## Version 1.0.2 (2023-06-22)

### New features and updates

* Model development: Added metrics:

  + d2_absolute_error_score
  + d2_pinball_score
  + explained_variance_score
  + mean_absolute_error
  + mean_absolute_percentage_error
  + mean_squared_error

### Bug fixes

Model development: `accuracy_score` now works when given label column names that are lists of single values.

## Version 1.0.1 (2023-06-16)

### Behavior changes

* Model development: Changed Metrics APIs to follow scikit-learn metrics modules:

  + `accuracy_score`, `confusion_matrix`, `precision_recall_fscore_support`, and `precision_score` methods move to `metrics.classification`.

### New features and updates

* Model development: Added metrics:

  + `f1_score`
  + `fbeta_score`
  + `recall_score`
  + `roc_auc_score`
  + `roc_curve`
  + `log_loss`
  + `precision_recall_curve`

## Version 1.0.0 (2023-06-09)

Initial public preview release.

---
title: Snowflake ML release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-ml-2024.md
section: Release Notes
---

# Snowflake ML release notes

This article contains the release notes for the Snowflake ML, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> These notes do not include changes in features that have not been publicly announced.
> Such features may appear in the Snowflake ML source code but not in the public documentation.

See [Snowflake ML: End-to-End Machine Learning](../../developer-guide/snowflake-ml/overview.md) for documentation.

## Verifying the snowflake-ml-python package

All Snowflake packages are signed, allowing you to verify their origin. To verify the `snowflake.ml.python` package, follow the steps below:

1. Install `cosign`. This example uses the Go installation:
   [Installing cosign with Go](https://edu.chainguard.dev/open-source/sigstore/cosign/how-to-install-cosign/#installing-cosign-with-go).
2. Download the file from a repository such as [PyPi](https://pypi.org/project/snowflake-ml-python/#files).
3. Download a `.sig` file for that release from the GitHub [releases page](https://github.com/snowflakedb/snowflake-ml-python/releases/).
4. Verify the signature using `cosign`. For example:

```bash
cosign verify-blob snowflake_ml_python-1.7.0.tar.gz --key snowflake-ml-python-1.7.0.pub --signature resources.linux.snowflake_ml_python-1.7.0.tar.gz.sig

cosign verify-blob snowflake_ml_python-1.7.0.tar.gz --key snowflake-ml-python-1.7.0.pub --signature resources.linux.snowflake_ml_python-1.7.0
```

> **Note:**
>
> This example uses the library and signature for version 1.7.0 of the package. Use the filenames of the version you are verifying.

## Version 1.7.2 (2024-11-21)

### New features

New model registry features:

* Model registry now supports asynchronous model inference service creation with the `block` option in the `Modelversion.create_service` method.
  Set this option to `False` to create the service asynchronously. The default is `True`.

### Bug fixes

Model explainability bug fixes:

* Fixed issue where `explain` is enabled for scikit-learn pipelines whose task is UNKNOWN, only to later fail when invoked.

## Version 1.7.1 (2024-11-05)

### New features

New model registry features:

* Null values are now ignored in the dataframe used for model signature inference. Only non-null values are used to infer signatures.
* Null values are now allowed in dataframes used for prediction.
* pandas extension data types are now supported in model signature inference.
* pandas `Series` can be used in input and output data.

New model monitoring features:

* The option `enable_monitoring` is now available when logging a model in the registry. This option gates access to private preview features of model monitoring.

### Bug fixes

Data bug fixes:

* Missing `snowflake.ml.data` exports in wheel have been added.

Dataset bug fixes:

* Missing `snowflake.ml.dataset` exports in wheel have been added.

Model registry bug fixes:

* Fixed issue where `tf_keras.Model` was not recognized as a keras model when logging.

## Version 1.7.0 (2024-10-22)

### Behavior changes

General behavior changes:

* Python 3.9 is now the minimum required version.

Data connector behavior changes:

* `to_torch_dataset` and `to_torch_datapipe` now create a dimension of 1 for scalar data.
  This allows more seamless integration with the PyTorch DataLoader, which creates batches by stacking inputs.
  The following example illustrates the difference.

  ```python
  ds = connector.to_torch_dataset(shuffle=False, batch_size=3)
  ```

  + Input data: `"col1": [10, 11, 12]`

    - Previous result: `array([10., 11., 12.])` with shape `(3,)`
    - New result: `array([[10.], [11.], [12.]])` with shape `(3, 1)`
  + Input data: `[[0, 100], [1, 110], [2, 200]]`

    - Previous result: `array([[ 0, 100], [ 1, 110], [ 2, 200]])` with shape `(3,2)`
    - New result: No change
* You can now specify a batch size of `None` in `to_torch_dataset` to squeeze dimensions of 1 for better
  interoperability with the PyTorch DataLoader. `None` is the new default batch size.

Model Development behavior changes:

* The `eps` (epsilon) argument is no longer used with the `log_loss` metric. The argument is still accepted for
  backward compatibility, but its value is ignored, and the epsilon is now computed by the underlying scikit-lean
  implementation.

Model Registry behavior changes:

* External access integrations are no longer required when creating an inference service in Snowflake 8.40 or later.

### New features

New Model Registry features:

* You can now pass keyword arguments when instantiating `ModelContext` to provide a variable number of
  context values. For example:

  ```python
  mc = custom_model.ModelContext(
      config = 'local_model_dir/config.json',
      m1 = model1
  )

  class ExamplePipelineModel(custom_model.CustomModel):
      def __init__(self, context: custom_model.ModelContext) -> None:
          super().__init__(context)
          v = open(self.context['config']).read()
          self.bias = json.loads(v)['bias']
      @custom_model.inference_api
      def predict(self, input: pd.DataFrame) -> pd.DataFrame:
          model_output = self.context['m1'].predict(input)
          return pd.DataFrame({'output': model_output + self.bias})
  ```
* Support for pandas’s `CategoricalDtype` for categorical columns.
* `log_model` method now accepts both `signature` and `sample_input_data` parameters
  to capture background data from explainability and data lineage.

### Bug fixes

Data Connector bug fixes:

* For multi-dimensional data, `to_torch_dataset` and `to_torch_datapipe` now return a numpy array with an appropriate
  data type instead of a list.

Feature Store bug fixes:

* Fixed an issue where `ExampleHelper` used an incomplete table name.
* Changed weather features aggregation time to one hour instead of one day.

Model Explainability bug fixes:

* Fixed an issue with explainability for XGBoost models by using a new SHAP library version.

## Version 1.6.4 (2024-10-17)

### Bug fixes

Model Registry bug fixes:

* Fix issue with using `ModelVersion.run` with Model Serving (inference on SPCS).

## Version 1.6.3 (2024-10-07)

### Behavior changes

Model Registry behavior changes:

* This release no longer contains the preview Model Registry API. Use the public API in `snowflake.ml.model_registry` instead.

### Bug fixes

Model Registry bug fixes:

* Fix unexpected package name normaliations for packages that do not follow [PEP-508](https://peps.python.org/pep-0508/)
  conventions when logging a model.
* Fix “Not a valid remote URI” error when logging MLflow models.
* Fix nested calls to `ModelVersion.run`.
* Fix `log_model` failure when a local package version number contains parts other than the base version.

### New features

New Model Registry features:

* You can now set a task type for the model in `log_model` via the `task` parameter.

New Feature Store features:

* `FeatureView` now supports `ON_CREATE` and `ON_SCHEDULE` initializion modes.

## Version 1.6.2 (2024-09-04)

### Bug fixes

* Fix a bug involving invalid names passed where fully-qualified names were required. These now correctly raise
  an exception.

Modeling bug fixes:

* Correctly log models built using XGBoost version 2 and higher.

Model explainability bug fixes:

* Workarounds and better error handling for XGBoost version 2.1.0 and higher.
* Correctly handle multiclass XGBoost classification models

### New features

New Feature Store features:

* The `update_feature_view` method now accepts a `FeatureView` object as an alternative to name and version.

## Version 1.6.1 (2024-08-13)

### Bug fixes

Feature Store bug fixes:

* Metadata size is no longer limited when generating a dataset.

Model Registry bug fixes:

* Fix an error message in the `run` method of model versions when a function name is not given and the model has
  multiple target methods.

### New features

New Modeling features:

* The `set_params` method is now available to set the parameters of the underlying scikit-learn estimator, if the
  Snowpark ML model has been fitted.

New Model Registry features:

* Support for model explainability in XGBoost, LightGBM, CatBoost, and scikit-learn models supported by the `shap` ibrary.

## Version 1.6.0 (2024-07-29)

### Behavior changes

Feature Store behavior changes:

* Many positional arguments are now keyword arguments. The following table lists the affected arguments for each method.

  | Method | Arguments |
  | --- | --- |
  | `Entity` initializer | `desc` |
  | `FeatureView` initializer | `timestamp_col`, `refresh_freq`, `desc` |
  | `FeatureStore` initializer | `creation_mode` |
  | `FeatureStore.update_entity` | `desc` |
  | `FeatureStore.register_feature_view` | `block`, `overwrite` |
  | `FeatureStore.list_feature_views` | `entity_name`, `feature_view_name` |
  | `FeatureStore.get_refresh_history` | `verbose` |
  | `Feature:Store.retrieve_feature_values` | `spine_timestamp_col`, `exclude_columns`, `include_feature_view_timestamp_col` |
  | `FeatureStore.generate_training_set` | `save_as`, `spine_timestamp_col`, `spine_label_cols`, `exclude_columns`, `include_feature_view_timestamp_col` |
  | `FeatureStore.generate_dataset` | `version`, `spine_timestamp_col`, `spine_label_cols`, `exclude_columns`, `include_feature_view_timestamp_col`, `desc`, `output_type` |
* Add new column `warehouse` to the output of `list_feature_views`.

### Bug fixes

Modeling bug fixes:

* Fixed an issue in which `SimpleImputer` could not impute integer columns with integer values.

Model Registry bug fixes:

* Fixed an issue when providing a non-zero-index-based pandas Dataframe `ModelVersion.run`.

### New features

New Feature Store features:

* Added overloads to certain methods to accept both a `FeatureView` and name/version strings. Affected APIs include `read_feature_view`,
  `refresh_feature_view`, `get_refresh_history`, `resume_feature_view`, `suspend_feature_view`, and `delete_feature_view`.
* Added docstring inline examples for all public APIs.
* Added `ExampleHelper` utility class to help with loading source data to simplify public notebooks.
* Added `update_entity` method.
* Added `warehouse` argument to `FeatureView` constructor to override the default warehouse.

New Model Registry features:

* Added option to enable explainability when registering XGBoost, LightGBM, and Catboost models.
* Added support for logging a model from a `ModelVersion` object.

New modeling features:

* You can disable the 10GB training data size limit in distributed hyperparameter optimization by executing:

  ```python
  from snowflake.ml.modeling._internal.snowpark_implementations import ( distributed_hpo_trainer, )
  distributed_hpo_trainer.ENABLE_EFFICIENT_MEMORY_USAGE = False
  ```

## Version 1.5.4 (2024-07-11)

### Bug fixes

Model Registry bug fixes:

* Fixed “401 Unauthorized” issue when deploying a model to Snowpark Container Services.

Feature Store bug fixes:

* Some exceptions in property setters have been downgraded to warnings, allowing you to change `desc`,
  `refresh_freq`, and `warehouse` in “draft” feature views.

Modeling bug fixes:

* Fixed issues with calling `OneHotEncoder` and `OrdinalEncoder` with a dictionary as the `categories` parameter and
  the data in a pandas DataFrame.

### New features

New Model Registry features:

* Allow overriding `device_map` and `device` when loading Hugging Face pipeline models.
* Add `set_alias` and `unset_alias` methods to `ModelVersion` instances to manage the model version’s aliases.
* Add `partitioned_inference_api` decorator to create partitioned inference methods in models.

New Feature Store features:

* New `refresh_freq`, `refresh_mode`, and `scheduling_state` columns have been added to the output of the
  `list_feature_views` method.
* The `update_feature_view` method now supports updating a feature view’s description.
* New methods `refresh_feature_view` and `get_refresh_history` manage updates of feature views.
* New method `generate_training_set` generates table-backed feature snapshots. `generate_dataset(...,
  output_type="table")` has been deprecated and generates a `DeprecationWarning`.

New Modeling features:

* `OneHotEncoder` and `OrdinalEncoder` now accept a list of array-like values for the `categories` argument.

## Version 1.5.3 (2024-06-17)

### Bug fixes

Model Registry bug fixes:

* Fix an issue causing incorrect results when using a pandas Dataframe with over 100,000 rows as the input of `ModelVersion.run` method in Stored Procedures.

Modeling bug fixes:

* Fix an issue with passing categories to `OneHotEncoder` and `OrdinalEncoder` as a dictionary or as a pandas DataFrame.

### New features

New Model Registry features:

* Model Registry now supports timestamp (TIMESTAMP_NTZ) columns in input and output data.

New modeling features:

* `OneHotEncoder` and `OrdinalEncoder` now support a list of array-like values for the `categories` argument.

New Dataset features:

* `DatasetVersion` instances now have `label_cols` and `exclude_cols` properties.

## Version 1.5.2 (2024-06-10)

### Bug fixes

Model Registry bug fixes:

* Fixed an issue that prevented calls to `log_model` in a stored procedure.

Modeling bug fixes:

* Quick fix for `import snowflake.ml.modeling.parameters.enable_anonymous_sproc` not working due to package dependency error.

## Version 1.5.1 (2024-05-22)

### New features

New Model Registry features:

* `log_model`, `get_model`, and `delete_model` methods now support fully-qualified names.

New modeling features:

* You can now use an anonymous stored procedure during fitting, so that modeling does not require privileges to operate
  on the registry schema. Call `import snowflake.ml.modeling.parameters.enable_anonymous_sproc` to enable this feature.

### Bug fixes

Model registry bug fixes:

* Fix issue with loading older models.

## Version 1.5.0 (2024-05-01)

### Behavior changes

Model Registry behavior changes:

* The `fit_transform` method can now return either a Snowpark DataFrame or a pandas DataFrame, matching the kind of
  DataFrame passed to the method.

### New features

New Model Registry features:

* Added support for exporting models from the registry (`ModelVersion.export`).
* Added support for loading the underlying model object (`ModelVersion.load`).
* Added support for renaming models (`Model.rename`).

### Bug fixes

Model Registry bug fixes:

* Fixed the “invalid parameter `SHOW_MODEL_DETAILS_IN_SHOW_VERSIONS_IN_MODEL`” error.

## Version 1.4.1 (2024-04-18)

### New features

New Model Registry features:

* Added support for catboost models (`catboost.CatBoostClassifier`, `catboost.CatBoostRegressor`).
* Added support for lightgbm models (`lightgbm.Booster`, `lightgbm.LightGBMClassifier`, `lightgbm.LightGBMRegressor`).

### Bug fixes

Model Registry bug fixes:

* Fixed bug that caused `relax_version` option to not work.

## Version 1.4.0 (2024-04-08)

### Behavior changes

Model Registry behavior changes:

* The `apply` method is no longer included as a target method by default when logging an XGBoost model. If you need
  this method available in logged models, included it manually in the `target-methods` option:

  ```python
  log_model(..., options={"target_methods": ["apply", ...]})
  ```

### New features

New model registry features:

* The registry now supports logging sentence transformer models (`sentence_transformers.SentenceTransformer`).
* The `version_name` argument is no longer required when logging a model. A random human-readable ID is generated if
  none is provided.

### Bug fixes

Model registry bug fixes:

* Fix issue where, when multiple models are called in the same query, models after the first returned incorrect results.
  This fix is applied when models are logged and does not benefit existing models; you must log your models again to
  correct this behavior.

Modeling bug fixes:

* Fix bug in registering a model where only methods mentioned in `save_model` were added to the model signature for
  Snowpark ML models.
* Fix bug in batch inference methods such as such as `predict` and `predict_log_probe` where, when `n_jobs` was
  not 1, the methods would not be executed.
* Fix bug in batch inference methods where they could not infer datatypes when the first row of data contained NULL.
* The output column names from distributed hyperparameter optimization are now correctly matched with the Snowflake identifier.
* Relaxed the versions of dependencies of distributed hyperparameter optimization methods; these were too strict and
  caused these methods to fail.
* scikit-learn is now listed as a dependency of the LightGBM package.

## Version 1.3.1 (2024-03-21)

### New features

FileSet/FileSystem updates:

* `snowflake.ml.fileset.sfcfs.SFFileSystem` can now be used in UDFs and stored procedures.

## Version 1.3.0 (2024-03-12)

### Behavior changes

Model registry behavior changes:

* As previously announced, the default for the `relax_version` option (in the `options` argument of `log_model`)
  is now `True`, allowing more reliable deployment in most cases by permitting dependency versions available in
  Snowflake.
* When running model methods, value range based input validation (which prevents input from overflowing) is now optional.
  This should improve performance and should not lead to issues for most types of models. To enable validation, pass the
  named argument `strict_input_validation=True` when calling the model’s `run` method.

Model development behavior changes:

* The `fit_predict` method now returns either a pandas or a Snowpark DataFrame, depending on the type of the input
  data, and is available on all classes where it is available in the underlying scikit-learn, xgboost, or lightgbm
  class.

### New features and updates

FileSet/FileSystem updates:

* Instances of `snowflake.ml.fileset.sfcfs.SFFileSystem` can now be serialized with `pickle`.

### Bug fixes

Model registry bug fixes:

* Fix a problem with importing `log_model` in some circumstances.
* Fix an incorrect error message when validating input Snowpark DataFrame with an array feature.

Model development bug fixes:

* Relax package versions for all inference methods when the installed version of a dependency is not available in the
  Snowflake conda channel.

## Version 1.2.3 (2024-02-26)

### New features and updates

Model development updates:

* All modeling classes now include a `score_samples` method to calculate the log-likelihood of the given samples.

Model registry updates:

* Decimal type features are automatically cast (with a warning) to a DOUBLE or FLOAT instead of producing an error.
* Improve error message for currently-unsupported `pip-requirements` option.
* You can now delete a version of a model.

### Bug fixes

Model development fixes:

* `precision_recall_fscore_support` returned incorrect results with `average="samples"`.

Model registry fixes:

* Descriptions, models, and tags were not retrieved correctly in newly-created registries under the private preview
  model registry API due to a recent Snowflake behavior change.

## Version 1.2.2 (2024-02-13)

### New features and updates

Model registry updates:

* You can now specify external access integrations when deploying a model to Snowpark Container Services using the
  private preview registry API, allowing models to access the internet to retrieve dependencies during deployment. The
  following endpoints are required for all deployments:

  + docker.com:80
  + docker.com:443
  + anaconda.com:80
  + anaconda.com:443
  + anaconda.org:80
  + anaconda.org:443
  + pypi.org:80
  + pypi.org:443

  For models derived from `HuggingFacePipeLineModel`, the following endpoints are required.

  + huggingface.com:80
  + huggingface.com:443
  + huggingface.co:80
  + huggingface.co:443

## Version 1.2.1 (2024-01-25)

### New features and updates

Model development updates:

* Infer column data type for transformers when possible.

Model registry updates:

* `relax_version` option (in `options` argument of `log_model`) relaxes dependencies of stated versions
  to allow newer minor versions when set to `True`.

## Version 1.2.0 (2024-01-12)

### New features and updates

Public preview release of model registry. See [Snowflake Model Registry](../../developer-guide/snowflake-ml/model-registry/overview.md).
The previous private preview release of the model registry has been deprecated, but will continue to be supported
while it includes features not yet available in the public preview version.

Model development updates:

* Added support for `fit_predict` method in AgglomerativeClustering, DBSCAN, and OPTICS classes.
* Added support for `fit_transform` method in MDS, SpectralEmbedding and TSNE class.

---
title: Snowflake ML release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-ml-2025.md
section: Release Notes
---

# Snowflake ML release notes

This article contains the release notes for the Snowflake ML, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> These notes do not include changes in features that have not been publicly announced.
> Such features might appear in the Snowflake ML source code but not in the public documentation.

See [Snowflake ML: End-to-End Machine Learning](../../developer-guide/snowflake-ml/overview.md) for documentation.

## Verifying the `snowflake-ml-python` package

All Snowflake packages are signed, allowing you to verify their origin. To verify the `snowflake.ml.python` package, follow the steps below:

1. Install `cosign`. This example uses the Go installation:
   [Installing cosign with Go](https://edu.chainguard.dev/open-source/sigstore/cosign/how-to-install-cosign/#installing-cosign-with-go).
2. Download the file from a repository such as [PyPi](https://pypi.org/project/snowflake-ml-python/#files).
3. Download a `.sig` file for that release from the GitHub [releases page](https://github.com/snowflakedb/snowflake-ml-python/releases/).
4. Verify the signature using `cosign`. For example:

```bash
cosign verify-blob snowflake_ml_python-1.7.0.tar.gz --key snowflake-ml-python-1.7.0.pub --signature resources.linux.snowflake_ml_python-1.7.0.tar.gz.sig

cosign verify-blob snowflake_ml_python-1.7.0.tar.gz --key snowflake-ml-python-1.7.0.pub --signature resources.linux.snowflake_ml_python-1.7.0
```

> **Note:**
>
> This example uses the library and signature for version 1.7.0 of the package. Use the filenames of the version you are verifying.

## Deprecation notices

* `snowflake.ml.fileset.FileSet` has been deprecated and will be removed in a future release. Use
  [snowflake.ml.dataset.Dataset](../../developer-guide/snowflake-ml/dataset.md) and
  [snowflake.ml.data.DataConnector](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector) instead.
* The “CamelCase” function names in `snowflake.ml.cortex` have been deprecated and will be removed in a future
  release. Use the “snake_case” names for these functions instead. For example, use `classify_text` instead of
  `ClassifyText`.
* The `partitioned_inference_api` decorator has been deprecated and will be removed in a future release. Use `custom_model.partitioned_api` instead.
* The `additional_payloads` argument of the `MLJob.submit_*` methods has been deprecated and will be removed in a future release.
  Use the `imports` argument instead.

## Version 1.20.0 (2025-12-02)

### Bug fixes

Experiment Tracking bug fixes:

* Exceeding the run metadata size limit in `log_metrics` or `og_params` issues a warning rather than raising an exception.

### New features

New Model Registry features:

* vLLM is now supported as an inference back-end. The `create_service` API accepts a new argument, `inference_engine_options`,
  which allows you to specify the inference engine to use and other engine-specific options. To specify vLLM, set the `inference_engine` option to `InferenceEngine.VLLM`.
  The following code is an example of creating a service using the vLLM inference engine:

  ```python
  from snowflake.ml.model.inference_engine import InferenceEngine

  mv = snowflake_registry.log_model(
      model=generator,
      model_name=...,
      ...,
      # Specifying OPENAI_CHAT_SIGNATURE is necessary to use vLLM inference engine
      signatures=openai_signatures.OPENAI_CHAT_SIGNATURE,
  )

  mv.create_service(
      service_name=my_serv,
      service_compute_pool=...,
      ...,
      inference_engine_options={
          "engine": InferenceEngine.VLLM,
          "engine_args_override": [
              "--max-model-len=2048",
              "--gpu-memory-utilization=0.9"
          ]
      }
  )
  ```

  + Prophet is now supported as a modeling framework.

## Version 1.19.0 (2025-11-13)

### Bug fixes

Model Registry bug fixes:

* `get_version_by_alias` now requires an exact match of the version’s Snowflake identifier.

### New preview features

* Experiment Tracking API (`snowflake.ml.ExperimentTracking` module)
* Online feature serving in Feature Store.

## Version 1.18.0 (2025-10-23)

### New features

New Model Registry features:

* The `create_service` API validates that a model has a GPU runtime configuration and throws a descriptive error if
  the configuration is missing

### Deprecations

Support for Python 3.9 has been deprecated. Python 3.10 or later is recommended.

## Version 1.17.0 (2025-10-20)

### New features

New modeling features:

* Support for `xgboost` 3.x

New ML Jobs features:

* `MLJobs.result` API is more broadly cross-version compatible and support pandas DataFrames,
  pyarrow Tables, and NumPy arrays.
* Job submission now uses v2 of the job submission API by default. v2 APIs use the latest container
  runtime imade by default. Set the MLRS_USE_SUBMIT_JOB_V2 to false to use v1 of the job submission API.
* Now supports retriieving details of deleted jobs, including status, compute pool, and target instances.

## Version 1.16.0 (2025-10-06)

### Bug fixes

Model Registry bug fixes:

* Remove redundant pip dependency warnings when `artifact_repository_map` is provided for warehouse model deployments.

### New features

New modeling features:

* Support for `scikit-learn` versions earlier than 1.8.

New ML Jobs features:

* Support for configuring the runtime image via the `runtime_environment` parameter at submission time. You may specify an image tag or a full image URL.

  Examples for `@remote` decorator and `submit_file` function:

  ```python
  @remote(compute_pool, stage_name = 'payload_stage', runtime_environment = '1.8.0')

  submit_file('/path/to/repo/test.py', compute_pool, stage_name = 'payload_stage',
    runtime_environment = '/mydb/myschema/myrepo/myimage:latest')
  ```

New Model Registry features:

* Ability to mark model methods as volatile or immutable. Volatile methods may return different results when called multiple times with the same input,
  while immutable methods always return the same result for the same input. Methods in supported model types are immutable by default, while methods
  in custom models are volatile by default. Use the `Volatility` enum to specify the volatility of model methods when logging a model as follows:

  ```python
  from snowflake.ml.model.volatility import Volatility

  options = {
      "embed_local_ml_library": True,
      "relax_version": True,
      "save_location": "/path/to/my/directory",
      "function_type": "TABLE_FUNCTION",
      "volatility": Volatility.IMMUTABLE,
      "method_options": {
          "predict": {
              "case_sensitive": False,
              "max_batch_size": 100,
              "function_type": "TABLE_FUNCTION",
              "volatility": Volatility.VOLATILE,
          },
  }
  ```

## Version 1.15.0 (2025-09-29)

### Behavior changes

Model Registry behavior changes:

* Drop support for deprecated `conversational` task type for Hugging Face models. This task type has been
  deprecated by Hugging Face for some time and is due for removal from their API imminently.

## Version 1.14.0 (2025-09-18)

### New features

New ML Jobs features:

* The `additional_payloads` argument of the `MLJob.submit_*` methods has been renamed `imports` to better reflect its purpose.
  `additional_payloads` has been deprecated and will be removed in a future release.

## Version 1.13.0 (2025-09-11)

### New features

New Model Registry features:

* You can now log a HuggingFace model without having to load the model in memory using `huggingface_pipeline.HuggingFacePipelineModel`.
  Requires the `huggingface_hub` package. To disable downloading from the HuggingFace repository, pall `download_snapshot=False` when
  instantiating `huggingface_pipeline.HuggingFacePipelineModel`.
* You can now use XGBoost’s `enable_categorical=True` models to with pandas DataFrames
* When listing services, the PrivateLink inference endpoint in shown in the `ModelVersion` list.

## Version 1.12.0 (2025-09-04)

### Bug fixes

Model Registry bug fixes:

* Fixed an issue where the string representation of dictionary-type output columns was being incorrectly created during structured output deserialization,
  losing the original data type.
* Fixed an inference server performance issue for wide (500+ features) and JSON inputs.

### New features

New Model Registry features:

* You can now log text-generation models with signatures compatible with OpenAI chat completion compatible signature, as shown in the following example:

  > ```python
  > from snowflake.ml.model import openai_signatures
  > import pandas as pd
  >
  > mv = snowflake_registry.log_model(
  >     model=generator,
  >     model_name=...,
  >     ...,
  >     signatures=openai_signatures.OPENAI_CHAT_SIGNATURE,
  > )
  >
  > # create a pd.DataFrame with openai.client.chat.completions arguments like below:
  > x_df = pd.DataFrame.from_records(
  >     [
  >         {
  >             "messages": [
  >                 {"role": "system", "content": "Complete the sentence."},
  >                 {
  >                     "role": "user",
  >                     "content": "A descendant of the Lost City of Atlantis, who swam to Earth while saying, ",
  >                 },
  >             ],
  >             "max_completion_tokens": 250,
  >             "temperature": 0.9,
  >             "stop": None,
  >             "n": 3,
  >             "stream": False,
  >             "top_p": 1.0,
  >             "frequency_penalty": 0.1,
  >             "presence_penalty": 0.2,
  >         }
  >     ],
  > )
  > ```

New Model Monitoring features:

* Model monitors now support segment columns to enable filtered analysis, specified in the `segment_columns` field in the model monitor source options.
  Segment columns must exist in the source table and be of string type. `add_segment_column` and `drop_segment_column` methods are provided
  to add or remove segment columns in existing model monitors.

## Version 1.11.0 (2025-08-12)

### New features

New Model Registry features:

* Made `image_repo` argument optional in `ModelVersion.create_service`. If not specified, a default image repository is used.

### Bug fixes

ML Jobs bug fixes:

* Fixed `TypeError: SnowflakeCursor.execute() got an unexpected keyword argument '_force_qmark_paramstyle'` inside stored procedure.
* Fixed `Error: Unable to retrieve head IP address` when not all instances start within the timeout period.

## Version 1.10.0 (2025-08-01)

### New features

New Model Registry features:

* Added progress bars for `ModelVersion.create_service` and `ModelVersion.log_model`.
* Logs from `ModelVersion.create_service` are now written to a file. The location of the log file is shown in the console.

## Version 1.9.2 (2025-07-28)

### Bug fixes

DataConnector bug fixes:

* Fixed a problem that caused errors mentioning `self._session`.

Model Registry bug fixes:

* Fixed a bug when passing None to array (`pd.dtype('O')`) in model signature and pandas data handler.

## Version 1.9.1 (2025-07-18)

### Bug fixes

Model Registry bug fixes:

* Fix a bug with setting the PAD token when the HuggingFace text-generation model had multiple EOS tokens. The handler now picks the first EOS token as PAD token.

### New features

New DataConnector features:

* DataConnector objects can now be pickled.

New Dataset features:

* Dataset objects can now be pickled.

New Model Registry features:

* Models hosted on Snowpark Container Services now support wide input (500+ features).

## Version 1.9.0 (2025-06-25)

### Behavior changes

ML Jobs behavior changes:

* Removed `scope` parameter from `list_jobs` method.
* Added optional `database` and `schema` parameters to `list_jobs` method.
* The `list_jobs` method now returns a pandas DataFrame rather than a Snowpark DataFrame.
* The `list_jobs` method now returns the following columns: `name`, `status`, `message`, `database_name`, `schema_name`,
  `owner`, `compute_pool`, `target_instances`, `created_time`, and `completed_time`.

Model registry behavior changes:

* Set `relax_version` to false when `pip_requirements` is specified in `log_model` call.
* `UserWarning` is raised only on specified `target_platforms` to address spurious warnings

### Bug fixes

Model registry bug fixes:

* Fixed failure in converting Snowpark DataFrame to pandas DataFrame when QUOTED_IDENTIFIERS_IGNORE_CASE parameter is enabled
* Fixed duplicate `UserWarning` log entries during model packaging

### New features

New model registry features:

* New APIs for representing target platforms (`snowflake.ml.model.target_platform.TargetPlatform`), target platform
  constant,s and tasks (`snowflake.ml.model.task.Task`).
* The `target_platform` argument in the `log_model` method now accepts a `TargetPlatformMode` constant, which can be
  WAREHOUSE_ONLY, SNOWPARK_CONTAINER_SERVICES_ONLY, or BOTH_WAREHOUSE_AND_SNOWPARK_CONTAINER_SERVICES.

New ML Jobs features:

* Less-frequently-used job submission arguments have been moved to `**kwargs`.
* Platform metrics are enabled by default.

With this release, single-node ML Job APIs are now stable and have been designated Generally Available.

## Version 1.8.6 (2025-06-18)

### New features

New model registry features:

* Added service container information to logs

## Version 1.8.5 (2025-05-27)

### Behavior changes

ML Jobs behavior changes:

* Argument `num_instances` has been renamed to `target_instances` in job submission APIs and is now required.

### Bug fixes

Model Registry bug fixes:

* Fixed a bug in listing and deleting container services.
* Fixed a bug with logging scikit-learn pipelines where the `explain` function was not created.
* Logging a container-only model no longer checks to make sure the required version of `snowflake-ml-python` is available in the Snowflake conda channel.

Explainability bug fixes:

* Minimum `streamlit` version has been decreased to 1.30 to improve compatibility.

Modeling bug fixes:

* `xgboost` is now a required dependency again (it was optional in v1.8.4).

### New features

ML Jobs new features:

* Job decorator now has `min_instances` argument that makes a job wait for the specified number of workers to be ready before starting.

## Version 1.8.4 (2025-05-12)

### Behavior changes

ML Jobs behavior changes:

* The `id` property is now the job’s fully-qualified name. A new property, `name`, has been introduced to represent the ML Job name.
* The `list_jobs` method now returns the ML Job name instead of the job ID.

Model Registry behavior changes:

* In `log_model`, enabling explainability when the model is deployed only to Snowpark Container Services is now an error instead of a warning and will prevent the log operation from completing.

### Bug fixes

Model Registry bug fixes:

* Fixed a bug in which logging PyTorch and TensorFlow models that caused `UnboundLocalError: local variable 'multiple_inputs' referenced before assignment.`

### New features

New Model Registry features:

* Automatically enable explainability for models that can be deployed to a warehouse.

New Explainability features:

* New visualization functions in `snowflake.ml.monitoring` plot explanations in notebooks.
* Support for categorical transforms in scikit-learn pipelines.

New Modeling features:

* Support categorical types for `xgboost.DMatrix` inputs in XGBoost models.

## Version 1.8.3 (2025-04-28)

### New features

New Model Registry features:

* Default to a CUDA container image, if available, when logging a GPU-capable model for deployment to Container Runtime for ML.
* Model versions have a `run_job` method that runs inference methods as a single-node Snowpark Container Services job. This method is available for all models, including those that are not deployed to Container Runtime for ML.
* The target platform defaults to a Snowflake warehouse when logging a partitioned model.

## Version 1.8.2 (2025-04-15)

### New features

The [ML Jobs](../../developer-guide/snowflake-ml/ml-jobs/overview.md) API, which allows you to run code on
Container Runtime for ML from your local workstation, is available in preview. Accordingly, documentation for this API is available
in the Snowflake ML API Reference, and changes to the API appear in these release notes. New features in the ML Jobs API
might not appear here until they are publicly announced, but they do appear in the API reference.

New Model Registry features:

* You can specify the path to write the model versions files that are stored in the model’s Snowflake stage using the `save_location` option in the `log_model` method.
* When logging models in Container Runtime for ML, model dependencies are now included in `pip_requirements` by default.

## Version 1.8.1 (2025-03-20)

### Bug fixes

Model Registry bug fixes:

* Fix `unsupported model type` error when logging a scikit-learn model with a `score_samples` inference method.
* Fix failure of inference service creation on an existing suspended service.

### New features

New Model Registry features:

* Creating a copy of a model version with `log_model` with unsupported arguments now raises an exception.

## Version 1.8.0 (2025-03-20)

### Behavior changes

Model Registry behavior changes:

* Automatically-inferred signatures in `transformers.Pipeline` have been changed to use the `FeatureGroupSpec` task class, including:

  + Signature for fill-mask tasks:

    Before v1.8.0v1.8.0 and later

    ```python
      ModelSignature(
          inputs=[
              FeatureSpec(name="inputs", dtype=DataType.STRING),
          ],
          outputs=[
              FeatureSpec(name="outputs", dtype=DataType.STRING),
          ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="inputs", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureGroupSpec(
                name="outputs",
                specs=[
                    FeatureSpec(name="sequence", dtype=DataType.STRING),
                    FeatureSpec(name="score", dtype=DataType.DOUBLE),
                    FeatureSpec(name="token", dtype=DataType.INT64),
                    FeatureSpec(name="token_str", dtype=DataType.STRING),
                ],
                shape=(-1,),
            ),
        ],
    )
    ```
  + Signature for token classification tasks:

    Before v1.8.0v1.8.0 and later

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="inputs", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureSpec(name="outputs", dtype=DataType.STRING),
        ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[FeatureSpec(name="inputs", dtype=DataType.STRING)],
        outputs=[
            FeatureGroupSpec(
                name="outputs",
                specs=[
                    FeatureSpec(name="word", dtype=DataType.STRING),
                    FeatureSpec(name="score", dtype=DataType.DOUBLE),
                    FeatureSpec(name="entity", dtype=DataType.STRING),
                    FeatureSpec(name="index", dtype=DataType.INT64),
                    FeatureSpec(name="start", dtype=DataType.INT64),
                    FeatureSpec(name="end", dtype=DataType.INT64),
                ],
                shape=(-1,),
            ),
        ],
    )
    ```
  + Signature for question-answering tasks:

    Before v1.8.0v1.8.0 and later

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="question", dtype=DataType.STRING),
            FeatureSpec(name="context", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureSpec(name="outputs", dtype=DataType.STRING),
        ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="question", dtype=DataType.STRING),
            FeatureSpec(name="context", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureGroupSpec(
                name="answers",
                specs=[
                    FeatureSpec(name="score", dtype=DataType.DOUBLE),
                    FeatureSpec(name="start", dtype=DataType.INT64),
                    FeatureSpec(name="end", dtype=DataType.INT64),
                    FeatureSpec(name="answer", dtype=DataType.STRING),
                ],
                shape=(-1,),
            ),
        ],
    )
    ```
  + Signature for question-answering tasks when `top_k` is greater than 1:

    Before v1.8.0v1.8.0 and later

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="question", dtype=DataType.STRING),
            FeatureSpec(name="context", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureSpec(name="outputs", dtype=DataType.STRING),
        ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="question", dtype=DataType.STRING),
            FeatureSpec(name="context", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureGroupSpec(
                name="answers",
                specs=[
                    FeatureSpec(name="score", dtype=DataType.DOUBLE),
                    FeatureSpec(name="start", dtype=DataType.INT64),
                    FeatureSpec(name="end", dtype=DataType.INT64),
                    FeatureSpec(name="answer", dtype=DataType.STRING),
                ],
                shape=(-1,),
            ),
        ],
    )
    ```
  + Signature for text-classification tasks when `top_k` is `None`:

    Before v1.8.0v1.8.0 and later

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="text", dtype=DataType.STRING),
            FeatureSpec(name="text_pair", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureSpec(name="label", dtype=DataType.STRING),
            FeatureSpec(name="score", dtype=DataType.DOUBLE),
        ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="text", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureSpec(name="label", dtype=DataType.STRING),
            FeatureSpec(name="score", dtype=DataType.DOUBLE),
        ],
    )
    ```
  + Signature for text-classification tasks when `top_k` is not `None`:

    Before v1.8.0v1.8.0 and later

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="text", dtype=DataType.STRING),
            FeatureSpec(name="text_pair", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureSpec(name="outputs", dtype=DataType.STRING),
        ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[
            FeatureSpec(name="text", dtype=DataType.STRING),
        ],
        outputs=[
            FeatureGroupSpec(
                name="labels",
                specs=[
                    FeatureSpec(name="label", dtype=DataType.STRING),
                    FeatureSpec(name="score", dtype=DataType.DOUBLE),
                ],
                shape=(-1,),
            ),
        ],
    )
    ```
  + Signature for text-generation tasks:

    Before v1.8.0v1.8.0 and later

    ```python
    ModelSignature(
        inputs=[FeatureSpec(name="inputs", dtype=DataType.STRING)],
        outputs=[
            FeatureSpec(name="outputs", dtype=DataType.STRING),
        ],
    )
    ```

    ```python
    ModelSignature(
        inputs=[
            FeatureGroupSpec(
                name="inputs",
                specs=[
                    FeatureSpec(name="role", dtype=DataType.STRING),
                    FeatureSpec(name="content", dtype=DataType.STRING),
                ],
                shape=(-1,),
            ),
        ],
        outputs=[
            FeatureGroupSpec(
                name="outputs",
                specs=[
                    FeatureSpec(name="generated_text", dtype=DataType.STRING),
                ],
                shape=(-1,),
            )
        ],
    )
    ```
* PyTorch and TensorFlow models now expect a single tensor input and output by default when they are logged to the Model Registry. To
  use multiple tensors (previous behavior), set `options={"multiple_inputs": True}`.

  Example with single tensor input:

  ```python
  import torch

  class TorchModel(torch.nn.Module):
      def __init__(self, n_input: int, n_hidden: int, n_out: int, dtype: torch.dtype = torch.float32) -> None:
          super().__init__()
          self.model = torch.nn.Sequential(
              torch.nn.Linear(n_input, n_hidden, dtype=dtype),
              torch.nn.ReLU(),
              torch.nn.Linear(n_hidden, n_out, dtype=dtype),
              torch.nn.Sigmoid(),
          )

      def forward(self, tensor: torch.Tensor) -> torch.Tensor:
          return cast(torch.Tensor, self.model(tensor))

  # Sample usage:
  data_x = torch.rand(size=(batch_size, n_input))

  # Log model with single tensor
  reg.log_model(
      model=model,
      ...,
      sample_input_data=data_x
  )

  # Run inference with single tensor
  mv.run(data_x)
  ```

  For multiple tensor inputs or outputs, use:

  ```python
  reg.log_model(
      model=model,
      ...,
      sample_input_data=[data_x_1, data_x_2],
      options={"multiple_inputs": True}
  )
  ```
* `enable_explainability` now defaults to `False` when the model can be deployed to Snowpark Container Services.

### Bug fixes

Modeling bug fixes:

* Fix a bug in some metrics that allowed an unsupported version of numpy to be installed automatically in the stored
  procedure, resulting in a numpy error on execution.

Model Registry bug fixes:

* Fix a bug that leads to incorrect `Model does not have _is_inference_api` error message when assigning a supported model as a property of a `CustomModel`.
* Fix a bug where inference does not work when models with more than 500 input features are deployed to SPCS.

### New features

New Model Registry features:

* Support for using a single `torch.Tensor`, `tensorflow.Tensor` and `tensorflow.Variable` as input or output data.
* Support for `xgboost.DMatrix datatype` for XGBoost models.

## Version 1.7.5 (2025-03-06)

`snowflake-ml-python` 1.7.5 adds support for Python 3.12.

### Bug fixes

Model Registry bug fixes:

* Fixed a compatibility issue where, when using `snowflake-ml-python` 1.7.0 or later to save a `tensorflow.keras` model with keras 2.x,
  the model could not be run in Snowflake. This issue occurred when `relax_version` is set to `True` (or default) and a new version of `snowflake-ml-python` is available. If you have logged an affected model, you can recover it by loading it using `ModelVerison.load`
  and logging it again with the latest version of `snowflake-ml-python`.
* Removed the validation that prevents data that does not have non-null values from being passed to `ModelVersion.run`.

### New features

New Model Registry features:

* Support for Hugging Face model configurations with auto-mapping functionality.
* Support for keras 3.x models with tensorflow and pytorch backends.

New Model Explainability features:

* Support for native and `snowflake-ml-python` sklearn pipelines.

## Version 1.7.4 (2025-01-28)

> **Important:**
>
> `snowflake.ml.fileset.FileSet` has been deprecated and will be removed in a future release. Use
> [snowflake.ml.dataset.Dataset](../../developer-guide/snowflake-ml/dataset.md) and
> [snowflake.ml.data.DataConnector](/developer-guide/snowpark-ml/reference/latest/api/data/snowflake.ml.data.data_connector.DataConnector) instead.

### Bug fixes

Registry bug fixes:

* Fixed an issue in which Hugging Face pipelines were loaded using an incorrect data type.
* Fixed an issue in which only one row was actually used when inferring a model signature.

### New features

New Cortex features:

* New `guardrails` option on the `Complete` function.

## Version 1.7.3 (2025-01-09)

### Dependency upgrades

* `fsspec` and `s3fs` must be 2024.6.1 or later and less than 2026.
* `mlflow` must be 2.16.0 or later and less than 3.

### New features

New Cortex features:

* Cortex functions now have “snake_case” names. For example, `ClassifyText` is now `classify_text`. The old “CamelCase” names still work, but will be removed in a future release.

New Model Registry features:

* Registry now supports more than 500,000 features.
* Added `user_files` argument to `Registry.log_model` for including images or other files with the model.
* Added support for handling Hugging Face model configurations with auto-mapping functionality.

New Data features:

* Added the `DataConnector.from_sql` constructor.

### Bug fixes

Registry bug fixes:

* Fixed a bug that occurred when providing a non-range index pandas DataFrame as the input to `ModelVersion.run`.
* Improved random model registry name generation to avoid collisions.
* Fixed an issue when inferring a signature or running inference with Snowpark DataFrame that has a column whose type is ARRAY and contains a NULL value.
* `ModelVersion.run` now accepts a fully-qualified service name.
* Fixed an error in `log_model` for any scikit-learn models with only data preprocessing, including preprocessing-only pipeline models.

Monitoring bug fixes:

* Fixed an issue with creating monitors using fully-qualified names.

---
title: Snowflake Native App Framework Cannot use “UNVERSIONED” as the prefix of a version label
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-prevent-unversioned-in-version-name.md
section: Release Notes
---

# Snowflake Native App Framework Cannot use “UNVERSIONED” as the prefix of a version label

Beginning with the **7.40** release, applications installed from staged files will use the string “UNVERSIONED” as the version name.
This means that providers will not be able to create a version name using “UNVERSIONED” as a prefix.

Before the change:
:   Providers can begin a version name with “UNVERSIONED” as a prefix.

After the change:
:   Providers can no longer begin a version name with “UNVERSIONED” as a prefix.

    Attempts to use “UNVERSIONED” in the version name will result in an error.

Ref: n/a

---
title: Snowflake Native App Framework Changes to the version output for the SHOW APPLICATIONS and DESC APPLICATION commands
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-add-unversioned-status.md
section: Release Notes
---

# Snowflake Native App Framework Changes to the version output for the SHOW APPLICATIONS and DESC APPLICATION commands

For applications [created using staged files](../../../developer-guide/native-apps/installing-testing-application.md),
the output of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) and
[DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) will change as follows:

Before the change:
:   The value of the `version` column of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command
    is `dev_stage`.

    The value of the `version` row of the [DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) command
    is `dev_stage`.

After the change:
:   The value of the `version` column of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md) command
    will be `UNVERSIONED`.

    The value of the `version` row of the [DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) command
    is `UNVERSIONED`.

Ref: n/a

---
title: Snowflake Native App Framework: Apps with containers share events with provider when consumer event table not set
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1800.md
section: Release Notes
---

# Snowflake Native App Framework: Apps with containers share events with provider when consumer event table not set

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

Previously, consumers could disable an event table by setting the value of the event table to NONE as
shown in the following example:

```sqlexample
ALTER ACCOUNT SET EVENT_TABLE = NONE
```

If the consumer set this value to NONE and they have enabled event sharing, log messages and trace events
were no longer shared with the provider.

[BCR-1724](../2024_06/bcr-1724.md) introduced a change where the log messages and
trace events the consumer has agreed to share are still shared with the provider, even if the event table is
set to NONE.

This behavior change introduces the same change for apps with containers. When this behavior change bundle
is enabled, the behavior of event sharing for apps with containers changes as follows:

Before the change:
:   If the consumer sets the event table to NONE and event sharing is enabled for an app with containers, log messages and trace
    events are no longer shared with the provider.

After the change:
:   If event sharing is enabled for an app with containers, the log messages and trace events the consumer has agreed to share are still
    shared with the provider, even if the event table is set to NONE.

Ref: 1800

---
title: Snowflake Native App Framework: Block creating event tables and temporary stages within an application package
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1366.md
section: Release Notes
---

# Snowflake Native App Framework: Block creating event tables and temporary stages within an application package

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

This behavior change prohibits providers from creating event tables or temporary stages within an
application package.

Currently:
:   Providers could create event tables and temporary stages within an application package

Pending:
:   Providers are prohibited from creating event tables and temporary stages within an application
    package. If a provider attempts to create an event table or temporary stage within the application
    package, they receive an error message that the object could not be created.

    To display a list of application packages, a provider can run the
    [SHOW APPLICATION PACKAGES](../../../sql-reference/sql/show-application-packages.md) command. To display the event tables in an
    application package, a provider can run the [SHOW EVENT TABLES](../../../sql-reference/sql/show-event-tables.md) command

Ref: 1366

---
title: Snowflake Native App Framework: Changes to the SHOW APPLICATION and DESC APPLICATION commands
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1729.md
section: Release Notes
---

# Snowflake Native App Framework: Changes to the SHOW APPLICATION and DESC APPLICATION commands

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, the output of the [SHOW APPLICATIONS](../../../sql-reference/sql/show-applications.md)
[DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) commands include the following changes.

## Changes to the SHOW APPLICATIONS command

The output of the SHOW APPLICATIONS command includes the following new columns:

| Column name | Description |
| --- | --- |
| disablement_reasons | An array containing the reasons why the Snowflake Native App was disabled. See Possible statuses for a disabled app for the list of possible reasons. |
| last_upgraded_on | The timestamp of the last successful upgrade of the app. The timestamp is empty if there is no successful upgrade. |

## Changes to the DESCRIBE APPLICATION command

Before the change:
:   Previously, if an app is disabled, the DESCRIBE APPLICATION command returns an error code to indicate that the app is disabled.

After the change:
:   The DESCRIBE APPLICATION command succeeds and the reason the app is disabled is included in the output in a new column:

    | Column name | Description |
    | --- | --- |
    | disablement_reasons | An array containing the reasons why the Snowflake Native App was disabled. See Possible statuses for a disabled app for the list of possible reasons. |

## Possible statuses for a disabled app

The following table lists the possible values for the DISABLEMENT_REASONS column:

| Value | Status description | Is recoverable? |
| --- | --- | --- |
| MANUALLY_DISABLED | The app is disabled by Snowflake | Yes. To re-enable the app, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support). |
| ACCOUNT_INACTIVE | The account becomes inactive by being locked or suspended causing the app to be unavailable. In this state a consumer cannot execute any SQL queries in their account and the app cannot be upgraded. | Yes. The app is automatically re-enabled if the account lock or suspension is removed |
| PACKAGE_VERSION_IS_MISSING | The application package version for the app was dropped by the provider. | Possibly. This can be caused by a temporary platform outage, in which case the app may recover automatically. Otherwise, the provider can work with Snowflake Support to attempt version recovery. Contact the application provider for more details. |
| CMK_ACCESS_DENIED | The consumer manages the encryption key themselves (ENCRYPT_USE_CMK_KMS is enabled) and Snowflake doesn’t have access to this key. | Yes. To re-enable the app, ensure that the cloud provider configuration to retrieve the CMK is correct and that Snowflake has access to the key. |
| LISTING_ACCESS_REVOKED | The listing used to create the app is no longer available. Possible reasons for this status include:   * The provider deleted the listing * The provider manually removed access to the private listing from the consumer account | Possibly. Recoverability depends on the reason why access was revoked.  For example, if the listing was deleted it is not recoverable. If a consumer account was manually removed from the private listing, access to the listing and app can be restored. |
| LISTING_TRIAL_USAGE_EXCEEDED | The application has exceeded the usage limit for a usage-based trial listing. | No |
| LISTING_PAYMENT_REQUIRED | The listing used to install the app is a paid listing and requires payment for further usage. | Yes. The consumer must correctly set up payment for the app. |
| LISTING_TRIAL_TIME_EXCEEDED | The application exceeded the trial duration. | No |
| APPLICATION_PACKAGE_NOT_AVAILABLE | The application package used to create the app no longer exists. The provider may have dropped the corresponding application package. | No |
| APPLICATION_PACKAGE_DISABLED | The application package used to create the app is disabled by the Snowflake. | Yes. The app is re-enabled, if Snowflake re-enables the application package. |
| APPLICATION_SUSPENDED | The app resources for example, tasks, services, and compute pools, are suspended due to the app being disabled.  The suspended objects remain suspended until the app is re-enabled and there are no other reasons the app was disabled. | Yes |
| APPLICATION_SUSPEND_RESUME_IN_PROGRESS | The app resources, for example tasks, services, and compute pools, are currently resuming. | Yes |

---
title: Snowflake Native App Framework: Enable event sharing for all apps in an account
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1697.md
section: Release Notes
---

# Snowflake Native App Framework: Enable event sharing for all apps in an account

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

Currently, consumers with a role that has the MANAGE EVENT SHARING privilege can describe, enable, and disable event
sharing for any app they have privileges to view.
However, this requires other privileges to be granted to the role in order to be able to view the app.

Before the change:
:   Even if a consumer uses a role that has the MANAGE EVENT SHARING privilege,
    they require additional privileges on the application object to be able to configure event sharing.

After the change:
:   Consumers with the MANAGE EVENT SHARING privilege can enable or disable event sharing for any app in their account.

Ref: 1697

---
title: Snowflake Native App Framework: Enforce REFERENCE usage on databases containing tags and policies
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1367.md
section: Release Notes
---

# Snowflake Native App Framework: Enforce REFERENCE usage on databases containing tags and policies

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

In a previous BCR, providers were required to grant reference usage on shared objects within an application package.
However, there was no effect on any installed Snowflake Native App as part of the previous BCR, including shared databases
referencing tags and policies.

In this BCR, an installed Snowflake Native App will fail if it is based on an application package that contains a database with dependencies on
tags or policies and REFERENCE usage has not been granted on that database to the application package.

Currently:
:   A Snowflake Native App installed from an application package containing dependencies on tags or policies continues to work even if
    REFERENCE usage on the parent or reference database was not granted to the application package.

Pending:
:   A Snowflake Native App installed from an application package containing dependencies on tags or policies will fail if REFERENCE
    usage is not granted on the parent or reference database to the application package.

    Providers must ensure that all apps installed in consumer accounts have the correct privileges granted
    to event tables and temporary stages. To grant the correct privileges, run the following command:

    ```sqlexample
    GRANT REFERENCE USAGE ON DATABASE <database_name> TO SHARE IN APPLICATION PACKAGE <app_package>;
    ```

Ref: 1367

---
title: Snowflake Native App Framework: Event sharing continues after consumer disables event table
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1724.md
section: Release Notes
---

# Snowflake Native App Framework: Event sharing continues after consumer disables event table

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

Consumers can disable an event table by setting the value of the event table to NONE as shown in the following example:
`ALTER ACCOUNT SET EVENT_TABLE = NONE`.
If the consumer sets this value to NONE and they have enabled event sharing,
log messages and trace events are no longer shared with the provider.

Before the change:
:   If the consumer sets the event table to NONE and event sharing is enabled,
    log messages and trace events are no longer shared with the provider.

After the change:
:   If event sharing is enabled, the log messages and trace events the consumer
    has agreed to share back are still shared with the provider, even if the event table is set to `NONE`.

Ref: 1724

---
title: Snowflake Native App Framework: Need to recreate or update some APPLICATION objects
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-update-app-dev-mode.md
section: Release Notes
---

# Snowflake Native App Framework: Need to recreate or update some APPLICATION objects

As part of the rollout of the
[Support for reference and privilege validation in the manifest file](../../2023/7_42.md)
feature, an APPLICATION object created in development mode that was based on staged files will no longer
function correctly if the APPLICATION object contains a versioned schema.

To resolve this issue, Snowflake recommends doing one of the following:

* If you do not need to preserve the existing APPLICATION object, you can delete the app and recreate it
  using the [CREATE APPLICATION](../../../sql-reference/sql/create-application.md) command.

  Following this process recreates the app and enables the new functionality added in the
  [Support for reference and privilege validation in the manifest file](../../2023/7_42.md)
  feature.
* If you need to preserve an existing APPLICATION object, use the [ALTER APPLICATION](../../../sql-reference/sql/alter-application.md)
  command and specify the staged files using UPGRADE USING <path_to_stage>.

  Following this process will return the app to a functioning state, but it will not include the new
  functionality added in the
  [Support for reference and privilege validation in the manifest file](../../2023/7_42.md)
  feature.

---
title: Snowflake Native App Framework: New application packages enable release channels by default
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_04/bcr-1977.md
section: Release Notes
---

# Snowflake Native App Framework: New application packages enable release channels by default

> **Attention:**
>
> This behavior change is in the 2025_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_04_bundle.md).

When this behavior change bundle is enabled, new application packages will have
[release channels](../../../developer-guide/native-apps/release-channels.md) enabled by default.

|  |  |
| --- | --- |
| Before the change | Release channels are not enabled by default for an application package. The ENABLE_RELEASE_CHANNELS property of an application package is set to `FALSE`. |
| After the change | Release channels are enabled by default for an application package. The ENABLE_RELEASE_CHANNELS property of an application package is set to `TRUE`. |

After release channels are enabled for an application package, they cannot be disabled. To create an application package without release channels you must use the following syntax:

```sqlexample
CREATE APPLICATION PACKAGE <pkg_name> ENABLE_RELEASE_CHANNELS=FALSE;
```

Ref: 1977

---
title: Snowflake Native App Framework: Providers must accept terms of service to set the DISTRIBUTION property to EXTERNAL
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-enforce-provider-tos.md
section: Release Notes
---

# Snowflake Native App Framework: Providers must accept terms of service to set the DISTRIBUTION property to EXTERNAL

To publish or upgrade a Snowflake Native App in a consumer account, a provider must set the `DISTRIBUTION`
property of the application package to `EXTERNAL`. A provider can set this property using the
[CREATE APPLICATION PACKAGE](../../../sql-reference/sql/create-application-package.md) or
[ALTER APPLICATION PACKAGE](../../../sql-reference/sql/alter-application-package.md) commands.

Before the change:
:   Before, providers could set `DISTRIBUTION=EXTERNAL` or create a version or patch for an application
    package without accepting the Provider Terms of Service.

After the change:
:   If a provider tries to set the `DISTRIBUTION` property to `EXTERNAL`, or if they create a version
    or patch for an application package where the `DISTRIBUTION` property has been set to `EXTERNAL`,
    they get an error message prompting them to accept the terms. The action they took with the command does
    not complete.

Ref: n/a

---
title: Snowflake Native App Framework: Roles with the ATTACH LISTING privilege can run the DESCRIBE APPLICATION PACKAGE command
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1603.md
section: Release Notes
---

# Snowflake Native App Framework: Roles with the ATTACH LISTING privilege can run the DESCRIBE APPLICATION PACKAGE command

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

This behavior change alters the privileges required to run the DESCRIBE APPLICATION PACKAGE command.
The ATTACH LISTING privilege is a database-level privilege that can be granted on an application package.
This privilege allows a provider to add an application package as the data product of a listing.

Before the change:
:   Having only the ATTACH LISTING privilege granted on an application package did not allow a user to run the DESCRIBE APPLICATION PACKAGE command.
    Only users with roles granted the OWNERSHIP privilege on the application package could run this command.

After the change:
:   Users with a role that has been granted only the ATTACH LISTING privilege can run the DESCRIBE APPLICATION PACKAGE command.

Ref: 1603

---
title: Snowflake Native App Framework: Update error message when an app is disabled
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1551.md
section: Release Notes
---

# Snowflake Native App Framework: Update error message when an app is disabled

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

After a consumer installs a Snowflake Native App, the app can become disabled for multiple reasons.

Before the change:
:   When a Snowflake Native App becomes disabled, the app returns a single error message that does not mention
    the reason the app is no longer available.

After the change:
:   When a Snowflake Native App becomes disabled, the app returns an error message that provides information
    on why the app is no longer available. The possible error messages are:

    * `Application is no longer available for use: Application manually disabled.`
    * `Application is no longer available for use: Account or organization is inactive.`
    * `Application is no longer available for use: The version of this application is no longer available, please contact the application provider for more details.`
    * `Application is no longer available for use: The Customer Managed Key (CMK) is not available for this account.`
    * `Application is no longer available for use: Access to the listing this application was installed from has
      been revoked, please contact the application provider for more details.`

Ref: 1551

---
title: Snowflake Native Apps Framework: Changes to the MANAGE EVENT SHARING privilege
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1576.md
section: Release Notes
---

# Snowflake Native Apps Framework: Changes to the MANAGE EVENT SHARING privilege

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

The MANAGE EVENT SHARING privilege allows users to see information about an
installed Snowflake Native App and to enable event sharing for an application.
The ACCOUNTADMIN role has this privilege by default and can grant it to other roles.

Before the change:
:   Users with a role that has the MANAGE EVENT SHARING privilege can run the
    [DESCRIBE APPLICATION](../../../sql-reference/sql/desc-application.md) command and enable event sharing for installed apps if
    they are granted a role that has ownership of the app.

After the change:
:   Users with a role with the MANAGE EVENT SHARING privilege can run the
    DESCRIBE APPLICATION command and enable event sharing for installed apps
    that are granted a role that has ownership of the app or has the RESOLVE ALL
    global privilege.

For more information on privileges, see [Access control privileges](../../../user-guide/security-access-control-privileges.md).

Ref: 1576

---
title: Snowflake Native Apps: changes to hash values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1901.md
section: Release Notes
---

# Snowflake Native Apps: changes to hash values

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the following changes will be implemented:

* The values of the following shared events on the provider side will change:

  + snow.application.hash
  + snow.database.hash
  + snow.query.hash
* The hash value of the APPLICATION_NAME_HASH column of the
  [DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY](../../../sql-reference/data-sharing-usage/application-state-view.md)
  will change.
* The application hash value of the APPLICATION_OBJECTS_ACCESSED column in the
  [DATA_SHARING_USAGE.LISTING_ACCESS_HISTORY](../../../sql-reference/data-sharing-usage/listing-access-history.md)
  will change.

Before the change:
:   Snowflake uses SHA1 to calculate the hash value of query id, app name, or database name. A consumer
    could call the native SHA1() function to calculate the hash value.

After the change:
:   Snowflake uses HMAC to calculate the hash value of query id, app name, or database name.
    The consumer must call the SYSTEM$GET_HASH_FOR_APPLICATION function to calculate the hash
    value.

Ref: 1901

---
title: Snowflake Native Apps: Changes to privileges commonly used by apps
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1952.md
section: Release Notes
---

# Snowflake Native Apps: Changes to privileges commonly used by apps

> **Note:**
>
> This behavior change is not applicable if you don’t have a Snowflake Native App running in your
> account, or you’re not planning to install a Snowflake Native App before this behavior change
> is enabled.

In a future release, privileges commonly used by apps will change from opt-in to opt-out.
Privileges that currently require explicit grants during installation or upgrade will be available
to a new app installation or an upgrade by default. This includes new versions and patches
of a previously installed app.

This change affects the following privileges:

* EXECUTE TASK
* EXECUTE MANAGED TASK
* CREATE WAREHOUSE
* CREATE COMPUTE POOL
* BIND SERVICE ENDPOINT
* CREATE DATABASE

Before the change:
:   If an app requires one of the privileges listed above, the consumer must explicitly
    grant these privileges to the app during installation or upgrade.

After the change:
:   If an app requires these privileges, they will be granted to the app automatically during
    installation or upgrade. Consumers must explicitly deny access to these privileges.

Ref: 1952

---
title: Snowflake Native Apps: Changes to restrictions on version name, setup file name
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2169.md
section: Release Notes
---

# Snowflake Native Apps: Changes to restrictions on version name, setup file name

In a future release, restrictions on version names and setup file names as defined in the manifest file for new Snowflake Native Apps will be enforced. Existing Snowflake Native Apps are not affected by this change.

Before the change:
:   Version names and setup file names could use any characters.

After the change:
:   The version name can only contain alphanumeric characters, underscores (_), hyphens (-), dollar signs ($), periods (.), and spaces.

    The setup script name and path can only contain alphanumeric characters, underscores (_), hyphens (-), periods (.), backslashes (), and forward slashes (/).

Ref: 2169

---
title: Snowflake Native Apps: Deprecation of Python versions 3.8 and 3.9 (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2072.md
section: Release Notes
---

# Snowflake Native Apps: Deprecation of Python versions 3.8 and 3.9 (Pending)

> **Note:**
>
> This behavior change is not applicable if you are not a Snowflake Native App Provider and
> you have not installed or intend to install a Snowflake Native App in your account.

Snowflake deprecates and eventually decommissions versions of the Python runtime that are
no longer actively maintained. When Python version 3.9 is deprecated in October, 2025,
Snowflake Native Apps will no longer support any deprecated versions of Python. For more information
on Snowflake’s Python runtime support policy, see
[Snowflake Python Runtime Support](../../../developer-guide/python-runtime-support-policy.md).

Before the change:
:   Snowflake Native Apps can create functions using decommissioned versions of the Python runtime.

After the change:
:   Snowflake Native Apps no longer supports decommissioned versions of the Python runtime. For more
    information, see Considerations for Snowflake Native App providers and
    Considerations for Snowflake Native App consumers.

    As a Snowflake Native App provider, you will begin receiving email notifications about this change
    during the deprecation period starting in October, 2025. You will have time to update your app before the Python versions are decommissioned.

## Considerations for Snowflake Native App providers

When this change is enabled, apps cannot create functions that use decommissioned
versions of Python. Providers cannot create or publish new versions of an app that
attempt to create functions that use decommissioned versions of Python. Existing published
versions of an app that use decommissioned versions of Python cannot be installed.

If an existing app uses a decommissioned version of the Python runtime, but does not create
new functions that use the decommissioned version, the app will continue to function.

## Considerations for Snowflake Native App consumers

When this change is enabled, consumers cannot install versions of an app that use
decommissioned versions of Python. Existing installations of an app that create new
functions that use decommissioned versions of Python will fail. If an existing app
uses a decommissioned version of the Python runtime, but does not create new functions
that use the decommissioned version, the app will continue to function.

Ref: 2072

---
title: Snowflake Native Apps: Introduce maximum number of scanned patches
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1466.md
section: Release Notes
---

# Snowflake Native Apps: Introduce maximum number of scanned patches

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

Snowflake requires an [automated security scan](../../../developer-guide/native-apps/security-overview.md)
when setting the `DISTRIBUTION` parameter for an application package to `EXTERNAL`.
The security scan applies to active versions and patches defined for the application package. Versions that are not active are not scanned.

Before the change:
:   Setting the `DISTRIBUTION` parameter to `EXTERNAL` initiates the security scan on all patches for an active version defined in
    the application package.

After the change:
:   When setting the `DISTRIBUTION` parameter to `EXTERNAL` only the ten most recent patches are scanned.
    Only the ten most recent patches can be published when the `DISTRIBUTION` parameter is set.

    Note that this is a limit on the number of patches available at the time the version is published. A single version can have up to 130 patches.

    Use the [SHOW VERSIONS IN APPLICATION PACKAGE](../../../sql-reference/sql/show-versions.md) command to view the versions, patches, and review status for an application package.

    This change is only applicable to new application packages created after the behavior change. Existing application packages are unaffected.

Ref: 1466

---
title: Snowflake Notebooks: replication
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2058.md
section: Release Notes
---

# Snowflake Notebooks: replication

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, notebook replication can be explicitly enabled at the account level for customers that have configured database replication
for databases that include notebooks.

Notebooks are replicated as part of any database included in a replication or failover group. In the secondary account, replicated notebooks can be executed but not edited.

* To run a replicated notebook as intended in the secondary account, any associated objects (such as warehouses, external access
  integrations, and secrets) must exist in the secondary account with the same names. These objects can either be replicated or manually created.
* Scheduled notebook tasks won’t be replicated unless the database containing those tasks is also replicated.

For details, see [Notebook replication](../../../user-guide/ui-snowsight/notebooks-replication.md).

Ref: 2058

---
title: Snowflake Python APIs release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowapi-python.md
section: Release Notes
---

# Snowflake Python APIs release notes

The Snowflake Python APIs release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowapi-python-2026.md)
* [2025 releases](snowapi-python-2025.md)
* [2024 releases](snowapi-python-2024.md)
* [2023 releases](snowapi-python-2023.md)

See [Snowflake Python APIs: Managing Snowflake objects with Python](../../developer-guide/snowflake-python-api/snowflake-python-overview.md) for documentation.

---
title: Snowflake Python APIs release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowapi-python-2023.md
section: Release Notes
---

# Snowflake Python APIs release notes for 2023

This article contains the release notes for the Snowflake Python APIs, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake Python APIs: Managing Snowflake objects with Python](../../developer-guide/snowflake-python-api/snowflake-python-overview.md) for documentation.

## Version 0.5.0 (2023-12-06)

### New features and updates

* Removed the experimental tags on all entities.

### Bug fixes

* Fixed a bug that raised an exception when listing databases and schemas.

## Version 0.4.0 (2023-12-04)

Initial public preview release.

### New features and updates

* Added support for Python 3.11.
* Updated the dependency on snowflake-snowpark-python to 1.5.0.
* Removed the Pydantic types from the model class.
* Renamed exception class names in `snowflake.core.exceptions`.

### Bug fixes

* Fixed a bug that raised an exception when listing some entities that have non-alphanumeric characters in the names.

---
title: Snowflake Python APIs release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowapi-python-2024.md
section: Release Notes
---

# Snowflake Python APIs release notes for 2024

This article contains the release notes for the Snowflake Python APIs, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake Python APIs: Managing Snowflake objects with Python](../../developer-guide/snowflake-python-api/snowflake-python-overview.md) for documentation.

## Version 1.0.2 (2024-11-13)

### New features and updates

* Removed the `async_req` parameter (asynchronous mode) from the `execute_job` API in the `Service` resource.

### Bug fixes

* None.

## Version 1.0.1 (2024-11-11)

### New features and updates

* Added support for the following new resources:

  + Cortex Chat
  + Cortex inference
* Added support for customized user agents.

### Bug fixes

* Fixed the `ValueError` message for `Enum` types.
* Fixed the API documentation for `Enum` types to show possible values.
* Added the missing `DeleteMode` type to the API documentation.

## Version 1.0.0 (2024-10-22)

Initial general availability release.

### New features and updates

* Improved error messages by shortening stack traces. To control this behavior, use the `_SNOWFLAKE_PRINT_VERBOSE_STACK_TRACE`
  environment variable option.
* Now includes read-only properties by default in dictionaries returned by `to_dict()` from models. To toggle this option, use
  `to_dict (hide_readonly_properties=True)`.
* Added the `if_exists` property, which toggles whether you can perform an action without erroring if the given resource does not
  exist, to the following methods and resources:

  + `drop()` for `Database`, `NetworkPolicy`, `View`, `User`, `ComputePool`, `ImageRepository`,
    `Pipe`, `Role`, `Service`, `Stage`, `Table`, `Task`, `DynamicTable`, `Role`,
    `Alert`, `Procedure`, `Warehouse`, `Schema`, and `Function`.
  + `refresh()` for `Database` and `DynamicTable`.
  + `suspend()` and `resume()` for `Service`, `DynamicTable`, and `Warehouse`.
  + `suspendRecluster()` and `resumeRecluster()` for `DynamicTable` and `Table`.
* `Database` now supports the `undrop()` method.
* `Service` now supports the `from_name` parameter in `iter()`.
* `Table` now supports the `target_database` and `target_schema` parameters in `swap_with()`.
* `Procedure` now supports the `copy_grants` parameter in `create()`.

### Bug fixes

* Creating dynamic tables now properly allows cloning source objects from different databases and schemas.
* Fixed an SSL connection issue for accounts and organizations with underscores when used in hostnames.

## Version 0.13.1 (2024-10-11)

### New features and updates

* Added support for the database role resource.
* Added new methods to the role, database role, and user resources to manage access privileges.
* Improved logs with secrets scrubbed.

### Bug fixes

* None.

## Version 0.13.0 (2024-10-04)

### New features and updates

* Improved the API documentation significantly.
* Removed `snowflake-snowpark-python` as a dependency of `snowflake.core`. However, this package is still required for some
  features, such as task graph (DAG) concepts; the check and requirement for these features is performed at runtime.
* Added support for all Python versions 3.8 or newer.
* Added support for `targetDatabase` and `targetSchema` for cloning tables.
* Added support for `targetDatabase` for cloning Schemas.
* Exposed type definitions.
* Added support for `execute_job` in `ServiceCollection`.
* Added support for `get_containers`, `get_instances`, and `get_roles` in `ServiceResource`.
* Added support for `create_or_update` in `Service` and `ComputePool`.
* Added support for the following new resources:

  + Account
  + Alert
  + Catalog integration
  + Event table
  + External volume
  + Managed account
  + Network policy
  + Notebook
  + Notification integration
  + Pipe
  + Procedure
  + Stream
  + User defined functions
  + View

### Bug fixes

* Fixed a bug relating to the logging of URLs, where not all the URL pieces were injected into logging.

## Version 0.12.1 (2024-08-29)

### New features and updates

* None.

### Bug fixes

* Fixed multiple issues related to handling large results.

## Version 0.12.0 (2024-08-20)

### New features and updates

* The client now retries requests on retryable error codes.
* The following `StageResource` methods are now deprecated and have been renamed. The old method names are now aliases.

  + From `upload_file` to `put`.
  + From `download_file` to `get`.

## Version 0.11.0 (2024-07-25)

### New features and updates

> **Note:**
>
> The following new features require the Snowflake version 8.27 release.

* Added client logging to the library to enhance debug ability.
* Added `undrop` support to the `DynamicTable`, `Schema`, and `Table` object types.
* Enhanced support for the `Grant` object type with the following limitations:

  + The SQL command SHOW GRANTS ON is not supported.
  + Only `Grantees.role` is supported as the grantee value for the `Grants.to` method (SHOW GRANTS TO).
* To be more consistent with their equivalent SQL commands, the following methods are now deprecated and have been renamed as follows. The
  old method names are now aliases that call the new method names, so the old method names will still work as expected.

  + From `create_or_update` to `create_or_alter`.
  + From `delete` to `drop`.
  + From `undelete` to `undrop`.

### Bug fixes

* Fixed a bug in stored procedure generated code.

## Version 0.10.0 (2024-06-24)

### New features and updates

> **Note:**
>
> The following new features are dependent on the release of Snowflake version 8.23.

* Added API support for the following resources:

  + `DynamicTable`
  + `Function` (Currently supports service functions only)
  + `Grant`
* Added support for finalizers in tasks and task graphs (DAGs).

## Version 0.9.0 (2024-06-10)

### New features and updates

* Added API support in *experimental* mode for the following resources:

  + `User`
  + `Role`
  + Management `Stage`
* Re-added `create_or_update` support for the `Warehouse`, `Schema`, and `Database` resources.

  > **Note:**
  >
  > The `create_or_update` feature for these resources requires the upcoming release of Snowflake version 8.23, which is currently
  > unreleased as of June 10, 2024.
* Added the `get_endpoints` method for `Service` resources that returns a list of endpoints for a given `Service` object.

### Bug fixes

* `with_managed_access` is now properly returned as a property of `SchemaResource`.

## Version 0.8.1 (2024-05-31)

### New features and updates

* Added the `with_managed_access` Boolean option in `create_or_update` for `SchemaResource`. This option is equivalent to
  the WITH MANAGED ACCESS optional parameter in [CREATE SCHEMA](../../sql-reference/sql/create-schema.md).

  + Usage example:

    ```python
    schema.create_or_update(schema_def, with_managed_access = True)
    ```
* Added the `get_endpoints` method for `Service` resources that returns a list of endpoints for a given `Service` object.

## Version 0.8.0 (2024-04-30)

### Behavior changes

* Removed the `deep` parameter from `fetch()` on `TableResource` objects. `fetch()` always returns detailed
  columns and constraints information of a `TableResource`.
* `create_or_update()` currently no longer works for `Schema`, `Warehouse`, `Database`, and `ComputePool`
  resources. `create()` does work for these resources.
* Creating tables using `as_select` no longer carries over information from any source tables used in the `as_select` query.
* The `data_retention_time_in_days` and `max_data_extension_time_in_days` properties of a table are inherited from schema or
  database settings when not explicitly set in a `create_or_update` statement that alters an existing table.

### New features and updates

* Added support for the Cortex Search API endpoint.
* Added support for large results.
* Added support for long-running queries.
* Added the `ServiceSpec` helper function to infer the specification type from a provided string in `Service` resources.
* Updated to use the Snowflake API REST platform for all resources.
* `pip install snowflake[ml]` installs `snowflake-ml-python` v1.4.0.

### Bug fixes

* Various bug fixes.

## Version 0.7.0 (2024-03-20)

Version 0.7.0 introduces updates across the `snowflake` and `snowflake.core` packages.

### New features and updates

`snowflake` package updates:

* You can now run `pip install snowflake[ml]` to install the [Snowpark ML](https://pypi.org/project/snowflake-ml-python/) library
  as an extra package dependency.

`snowflake.core` package updates:

* Task predecessors now return their fully qualified name.
* Added the `__str__()` and `__repr_html__()` methods to `DAGRun` to make it notebook compatible.
* Replaced “DAGs” with “task graphs” in the API reference documentation to better align with Snowflake documentation.

### Bug fixes

`snowflake.core` package fixes:

* Fixed code generator and updated OpenAPI-spec driven models.
* Fixed Pydantic compatibility issues.
* Fixed a bug in the `Task.error_integration` property.
* Fixed a bug in the `Task.config` property when the REST property was missing.

## Version 0.6.0 (2024-02-06)

### New features and updates

* The `>>` and `<<` operators of `DAGTask` now accept a function directly.
* `DAGTask` now uses the DAG’s warehouse by default.
* `DAGTask` accepts a new parameter `session_parameters`.
* Updated `TaskContext`:

  + The method `get_predecessor_return_value` now works for both long and short names of a `DAGTask`.
  + Added the methods `get_current_task_short_name` and `get_task_graph_config_property`.
* Added support for pydantic 2.x.
* Added support for Python 3.11.

### Bug fixes

* Fixed a bug where `DAGOperation.run()` raised an exception if the DAG doesn’t have a schedule.
* Fixed a bug where deleting a DAG didn’t delete all of its sub-tasks.
* Fixed a bug that raised an error when a DAG’s `config` is set.

---
title: Snowflake Python APIs release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowapi-python-2025.md
section: Release Notes
---

# Snowflake Python APIs release notes for 2025

This article contains the release notes for the Snowflake Python APIs, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake Python APIs: Managing Snowflake objects with Python](../../developer-guide/snowflake-python-api/snowflake-python-overview.md) for documentation.

## Version 1.10.0 (Dec 8, 2025)

### New features and updates

* Added support for the Streamlit resource.
* Added support for the DECFLOAT data type.

### Bug fixes

None.

## Version 1.9.0 (Nov 13, 2025)

### New features and updates

* Added support for the following resources:

  + Artifact repository
  + Network rule
  + Password policy
  + Secret
  + Sequence
  + Tag

### Bug fixes

None.

## Version 1.8.0 (Sep 22, 2025)

### New features and updates

* Added support for proxy configuration. You can provide proxy settings by using the `HTTPS_PROXY` environment variable.

### Bug fixes

None.

## Version 1.7.0 (Jul 31, 2025)

### New features and updates

* Added support to the following methods for specifying the point-of-time reference when you use Time Travel to create streams:

  + `PointOfTimeStatement`
  + `PointOfTimeStream`
  + `PointOfTimeTimestamp`

### Bug fixes

* Fixed a warning: `'allow_population_by_field_name' has been renamed to 'validate_by_name'`.
* Restored the behavior of the `drop` method of `DAGOperation` such that `drop_finalizer` must be set to `True` before
  the finalizer task is dropped.

  As a result of changes in the 9.20 Snowflake release, `fetch_task_dependents` started returning the finalizer task alongside other
  tasks that belong to the Directed Acyclic Graph (DAG). This behavior caused the `drop` method to always drop the finalizer.

## Version 1.6.0 (Jun 26, 2025)

### New features and updates

* Optionalized the `query` and `column` parameters in `QueryRequest` for the Cortex Search service API.

### Bug fixes

None.

## Version 1.5.1 (May 28, 2025)

### New features and updates

None.

### Bug fixes

* Fixed a bug in `ProcedureResource` that caused the `call` method to return wrong results when using the `extract`
  option with the `ReturnTable` type.
* `CortexInferenceService.complete` can now be called from Python worksheets and notebooks.

## Version 1.5.0 (May 14, 2025)

### New features and updates

* Deprecated the `ServiceResource.get_service_status` method in favor of the `ServiceResource.get_containers` method.
* Added the `extract` option to the `procedure.call` method. Enabling this option causes the method to extract results from the
  returned payload.

  For example, setting `extract=False` (current default behavior) returns a result such as `[{'procedure_name': 42}]`. In this
  example, you can set `extract=True` to return the value `42`.

  > **Note:**
  >
  > `extract=False` remains the current default setting but now returns a deprecation warning. The recommendation is to switch to using
  > `extract=True`, which will become the new default in the next major release.
* Added support for mapping the VARIANT type in a stored procedure call.

### Bug fixes

* Fixed the type mapping for the GEOMETRY, GEOGRAPHY, OBJECT return types in stored procedures.
* The `__repr__` implementation for stored procedures and functions now shows a list of arguments in addition to the name.

## Version 1.4.0 (Apr 23, 2025)

### New features and updates

* Implemented the `__repr__` method for all collection, resource, and model classes.

### Bug fixes

* Changed the `_SNOWFLAKE_PRINT_VERBOSE_STACK_TRACE` environment variable to be enabled by default, which causes printed error messages
  to display the full stack trace.

  This change was made to avoid disabling stack traces for all exceptions, which happens when `SNOWFLAKE_PRINT_VERBOSE_STACK_TRACE` is
  not set.

## Version 1.3.0 (Apr 9, 2025)

### New features and updates

* Added the `snowflake.core.FQN` class, which represents an object identifier.
* The `DAGOperation.drop` method drops the finalizer task associated with the DAG if the `drop_finalizer` argument is set to `True`.

  > **Important:**
  >
  > The `drop_finalizer` argument will be removed in the next major API release, and the `DAGOperation.drop` method will always
  > drop the associated finalizer task along with the DAG.

### Bug fixes

None.

## Version 1.2.0 (Mar 26, 2025)

### New features and updates

* Added support for asynchronous requests across all of the existing endpoints.

  Asynchronous methods are denoted by the `_async` suffix in their names and use polling to determine whether an operation was completed.

  The number of calls that can execute in parallel depends on the number of CPUs. To change the size of the thread pool, use the `_SNOWFLAKE_MAX_THREADS` environment variable.

  For example usage, see the [snowflake.core.PollingOperation](/developer-guide/snowflake-python-api/reference/latest/_autosummary/snowflake.core.PollingOperation) class documentation.
* Added support for creating serverless tasks using the `StoredProcedureCall` definition.
* Added support for the SERVERLESS_TASK_MIN_STATEMENT_SIZE and SERVERLESS_TASK_MAX_STATEMENT_SIZE serverless attributes to the
  `Database` and `Schema` resources (dependent on Snowflake version 9.8).
* Added support for setting the SUSPEND_TASK_AFTER_NUM_FAILURES, USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE, and USER_TASK_TIMEOUT_MS
  attributes on cloned databases and schemas (dependent on Snowflake version 9.8).
* Deprecated `CortexAgentService.Run` in favor of `CortexAgentService.run`.
* Added new optional attributes to various models within the Cortex Search service API:

  + `text_boosts` and `vector_boosts` to the `Function` model
  + `weights` to the `ScoringConfig` model

### Bug fixes

* You can now call `create_or_alter` with a task object returned from the `iter` method.

## Version 1.1.0 (Mar 12, 2025)

### New features and updates

* Added support for the TARGET_COMPLETION_INTERVAL, SERVERLESS_TASK_MIN_STATEMENT_SIZE, and SERVERLESS_TASK_MAX_STATEMENT_SIZE serverless
  attributes to the Task resource.
* Added support for the following new resources:

  + API integrations
  + Iceberg tables (dependent on Snowflake version 9.6)

### Bug fixes

None.

## Version 1.0.5 (Feb 19, 2025)

### New features and updates

* Removed the `protobuf` dependency from `snowflake.core`.

### Bug fixes

None.

## Version 1.0.4 (Feb 13, 2025)

### New features and updates

* Added support for the Cortex Lite Agent resource.

### Bug fixes

None.

## Version 1.0.3 (Feb 4, 2025)

### New features and updates

* Added support for the Cortex Embed resource.

### Bug fixes

None.

---
title: Snowflake Python APIs release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowapi-python-2026.md
section: Release Notes
---

# Snowflake Python APIs release notes for 2026

This article contains the release notes for the Snowflake Python APIs, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

See [Snowflake Python APIs: Managing Snowflake objects with Python](../../developer-guide/snowflake-python-api/snowflake-python-overview.md) for documentation.

## Version 1.12.0 (Feb 12, 2026)

### New features and updates

* Added support for setting (`set_tags`), unsetting (`unset_tags`), and fetching tag assignments (`get_tags`).
  Tagging support for specific resources is introduced in the following Snowflake server releases:

  + **10.3**: Alert, database, database role, dynamic table, event table, image repository, network policy, notebook, password policy, pipe,
    procedure, role, schema, stream, table, task, user, user-defined function, view, warehouse.
  + **10.4**: API integration, catalog integration, compute pool, function, iceberg table, notification integration, Streamlit.

### Bug fixes

* None.

## Version 1.11.0 (Jan 21, 2026)

### New features and updates

* The `DAGTask` object type now accepts custom objects with a `to_sql()` method as task definitions.
* The `UserDefinedFunction` object type now supports executing scalar UDFs using the `execute` method.

### Bug fixes

* Creating, fetching, and listing stored procedures that use a staged handler (where the `body` property is empty) no longer raises a `ValidationError`.
* Pydantic deprecation warnings related to `class-based config` and the `update_forward_refs`, `parse_obj`, and `_iter` methods no longer occur.

---
title: Snowflake Scripting: Changes to global variables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1850.md
section: Release Notes
---

# Snowflake Scripting: Changes to global variables

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the Snowflake Scripting global variables for DML commands
behave differently. These variables are described in [Determining the number of rows affected by SQL statements](../../../developer-guide/snowflake-scripting/dml-status.md).

When a non-DML statement is executed after the last DML statement in a Snowflake Scripting block
or stored procedure, these variables behave as follows:

Before the change:
:   The variables return the value set by the last DML statement:

    * `SQLROWCOUNT` - Number of rows affected by the last DML statement.
    * `SQLFOUND` - `TRUE` or `FALSE` based on the last DML statement.
    * `SQLNOTFOUND` - `TRUE` or `FALSE` based on the last DML statement.

After the change:
:   The variables return NULL.

For example, the following Snowflake Scripting anonymous block returns different values before and
after the change:

```sqlexample
EXECUTE IMMEDIATE
$$
BEGIN
  CREATE OR REPLACE TABLE test_snowflake_scripting_gv (i INT);
  INSERT INTO test_snowflake_scripting_gv VALUES (1);
  SELECT 1;
  RETURN SQLROWCOUNT;
END;
$$;
```

Returned value before the change::
:   ```output
    +-----------------+
    | anonymous block |
    |-----------------|
    |               1 |
    +-----------------+
    ```

Returned value after the change::
:   ```output
    +-----------------+
    | anonymous block |
    |-----------------|
    |            NULL |
    +-----------------+
    ```

To achieve the previous behavior after the change, save the Snowflake Scripting global variable value
in a new variable before subsequent non-DML statements, and then return the value of the new variable.
For example:

```sqlexample
EXECUTE IMMEDIATE
$$
BEGIN
  LET sql_row_count_var := 0;
  CREATE OR REPLACE TABLE test_snowflake_scripting_gv (i INT);
  INSERT INTO test_snowflake_scripting_gv VALUES (1);
  sql_row_count_var := SQLROWCOUNT;
  SELECT 1;
  RETURN sql_row_count_var;
END;
$$;
```

Ref: 1850

---
title: Snowflake server release notes and feature updates
source: https://docs.snowflake.com/en/release-notes/new-features.md
section: Release Notes
---

# Snowflake server release notes and feature updates

This topic lists the release notes for the most recent server releases and feature updates.
For earlier releases and feature updates, see [Server releases and feature updates earlier in 2026](new-features-2026.md).

> **Tip:**
>
> To view a list of release note announcements, filtered by date and release type, see
> [All release notes](/release-notes/all-release-notes).

If you have questions about any of these features, contact
[Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Recent server releases

* [10.13 Release Notes (no announcements): Apr 11, 2026-Apr 16, 2026](2026/10_13.md)
  + [Release notes change log](2026/10_13.md)
* [10.12 Release Notes (with behavior changes): Apr 03, 2026-Apr 08, 2026](2026/10_12.md)
  + [Behavior change bundles](2026/10_12.md)
  + [SQL updates](2026/10_12.md)
    - [CHECK constraints for standard tables (*General availability*)](2026/10_12.md)
  + [New features](2026/10_12.md)
    - [Dynamic table refresh boundaries](2026/10_12.md)
    - [Access history improvements](2026/10_12.md)
  + [Release notes change log](2026/10_12.md)
* [10.11 Release Notes (no announcements): Mar 30, 2026-Apr 1, 2026](2026/10_11.md)
  + [Release notes change log](2026/10_11.md)

For earlier server releases, see [Server releases earlier in 2026](weekly-releases-2026.md).

## Recent feature updates

* [March 19, 2026: MIN_BY and MAX_BY functions are supported with dynamic table incremental refresh (*General availability*)](2026/other/2026-03-19-min-max-incremental-dynamic-table.md)
* [April 13, 2026: Dynamic Apache Iceberg™ tables now support PARTITION BY, TARGET_FILE_SIZE, and PATH_LAYOUT (*General availability*)](2026/other/2026-04-13-dynamic-iceberg-tables-partition-file-path.md)
* [Apr 17, 2026: Performance Explorer tabs, filter presets, CSV export, and side-panel search](2026/other/2026-04-17-performance-explorer-usability.md)
* [Apr 16, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-04-16-dcr.md)
  + [Clean Rooms API Version: 14.4](2026/other/2026-04-16-dcr.md)
* [Apr 16, 2026: Primary key support in dynamic tables (*General availability*)](2026/other/2026-04-16-dynamic-table-primary-keys.md)
* [Apr 16, 2026: Consumer-controlled maintenance policies: Provider support (*Preview*)](2026/other/2026-04-16-native-apps-consumer-maintenance-policies-provider.md)
* [Apr 15, 2026: Snowflake documentation for AI agents and LLMs](2026/other/2026-04-15-agent-friendly-docs.md)
* [Apr 15, 2026: Openflow Connector for HubSpot (*Preview*)](2026/other/2026-04-15-openflow-hubspot-pupr.md)
* [Apr 14, 2026: Monitor Cortex Search requests (*Preview*)](2026/other/2026-04-14-cortex-search-monitoring.md)
* [Apr 14, 2026: Cortex Search Service replication (*General availability*)](2026/other/2026-04-14-cortex-search-replication-ga.md)
* [Apr 14, 2026: Snowflake storage for Apache Iceberg™ tables (*Preview*)](2026/other/2026-04-14-iceberg-snowflake-storage.md)
* [Apr 10, 2026: Budgets for AI features (*General availability*)](2026/other/2026-04-10-budgets-ai-features-ga.md)
* [Apr 9, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-04-09-dcr.md)
  + [Clean Rooms API Version: 14.3](2026/other/2026-04-09-dcr.md)
* [Apr 8, 2026: Error logging for Snowpipe Streaming (*General availability*)](2026/other/2026-04-08-snowpipe-streaming-error-tables.md)
* [Apr 7, 2026: Workspaces replication (*General availability*)](2026/other/2026-04-07-workspace-replication-ga.md)
* [Apr 6, 2026: AI_SERVICES billing breakout for implemented AI Credits services](2026/other/2026-04-06-ai-services-billing-breakout.md)
* [Apr 6, 2026: Apache Iceberg™ tables: Write support for Databricks Unity Catalog on Azure (*General availability*)](2026/other/2026-04-06-iceberg-write-support-azure-unity-catalog.md)
* [Apr 3, 2026: Medical and health data classifiers in sensitive data classification (*General availability*)](2026/other/2026-04-03-sensitive-data-classification-medical-health-ga.md)
* [Apr 02, 2026: AI_COMPLETE document intelligence (*General availability*)](2026/other/2026-04-02-ai-complete-document-intelligence-ga.md)
* [Apr 02, 2026: AI_FUNCTIONS_USER database role (*General availability*)](2026/other/2026-04-02-ai-functions-user-db-role.md)
* [Apr 02, 2026: AI_PARSE_DOCUMENT now available in AWS Europe West 2 (London)](2026/other/2026-04-02-ai-parse-document-london-region.md)
* [Apr 2, 2026: Copy tags when running a CREATE OR REPLACE TABLE command (*General availability*)](2026/other/2026-04-02-create-table-copy-tags-ga.md)
* [Apr 2, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-04-02-dcr.md)
  + [Clean Rooms API Version: 14.0](2026/other/2026-04-02-dcr.md)
* [Apr 2, 2026: Performance Explorer granular access aligned with your privileges](2026/other/2026-04-02-performance-explorer-granular-access.md)
* [Mar 31, 2026: CTAS support for Databricks Unity Catalog with external volumes (*General availability*)](2026/other/2026-03-31-ctas-unity-catalog-external-volumes.md)
* [Mar 31, 2026: Use Snowsight to manage external volumes (*General availability*)](2026/other/2026-03-31-manage-external-volumes-snowsight-ga.md)
* [Mar 26, 2026: Snowflake Data Clean Rooms updates](2026/other/2026-03-26-dcr.md)
  + [Clean Rooms API Version: 13.9](2026/other/2026-03-26-dcr.md)
* [Mar 26, 2026: New SCHEDULER attribute for dynamic tables — *General availability*](2026/other/2026-03-26-dynamic-table-scheduler-attribute.md)
* [Mar 24, 2026: ARRAY_REPEAT function](2026/other/2026-03-24-array-repeat-function.md)
* [Mar 24, 2026: MAP_ENTRIES function](2026/other/2026-03-24-map-entries-function.md)
* [Mar 20, 2026: Block public access to internal stages with IP allowlist exceptions (*General availability*)](2026/other/2026-03-20-block-public-stage-access-with-exceptions.md)
* [Mar 20, 2026: DCM Projects (*Preview*)](2026/other/2026-03-20-dcm-projects.md)
* [Mar 20, 2026: Apache Iceberg™ tables: Support for the Azure Data Lake Storage Gen2 with external volumes (*Preview*)](2026/other/2026-03-20-iceberg-azure-dls-external-volumes.md)
* [Mar 20, 2026: Trust Center Extensions (*General availability*)](2026/other/2026-03-20-tc-extensions-ga.md)
* [Mar 19, 2026: Artifacts in Snowflake Intelligence (*Preview*)](2026/other/2026-03-19-snowflake-intelligence-artifacts.md)
* [Mar 17, 2026: Openflow Connector for Google BigQuery (*Preview*)](2026/other/2026-03-17-openflow-bigquery-pupr.md)

For earlier feature updates, see [Feature updates earlier in 2026](feature-releases-2026.md).

---
title: Snowflake Support page: Access requirements changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2188.md
section: Release Notes
---

# Snowflake Support page: Access requirements changes

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, the following changes to access to the Snowflake Support page occur:

Before the change:
:   * Any user with a valid Snowflake account and a role with Support privileges could view the Snowflake Support page.
    * Users with a role that includes Support privileges could access the page regardless of which role was active in their session. This access was
      allowed even when a session policy prevented the Support-privileged role from being activated as a secondary role.

After the change:
:   * To access the page, a Support-privileged role must be active in the current session, either as the primary role or as an allowed secondary role.
    * Access is denied if a session policy blocks the Support-privileged role from being activated as a secondary role and that role isn’t currently active as the primary role.
    * If access is denied, users see a warning that instructs them to switch to a Support-privileged role or contact their administrator.

Benefits:

> * Improved performance, security, and reliability when loading and navigating the Support page.
> * Consistent enforcement of administrator-defined session policies for secondary role activation.

To verify if your Support-privileged roles are currently restricted from being activated as secondary roles by a session policy, run the following queries.

To identify the active session policies in your account, run this query:

```sqlexample
SHOW SESSION POLICIES;
```

To inspect the BLOCKED_SECONDARY_ROLES and ALLOWED_SECONDARY_ROLES fields for each active policy, run this query:

```sqlexample
DESC SESSION POLICY <policy_name>;
```

Verify if any of your Support-privileged roles are explicitly blocked or are excluded from the allowed list. If your Support roles are restricted
by a session policy, we recommend that you take one of the following actions:

* Communicate to affected users that they must switch to the Support-privileged role to access the Support page.
* Adjust your session policy to allow your Support-privileged roles to be activated as secondary roles.
* Create a new role with Support privileges that isn’t restricted by the session policy and grant this role to your users.

Ref: 2188

---
title: Snowpark Connect for Spark release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-connect.md
section: Release Notes
---

# Snowpark Connect for Spark release notes

The release notes for [Snowpark Connect for Spark](../../developer-guide/snowpark-connect/snowpark-connect-overview.md) provide details for each release,
including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowpark-connect-2026.md)
* [2025 releases](snowpark-connect-2025.md)

See [Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark](../../developer-guide/snowpark-connect/snowpark-connect-overview.md) for documentation.

---
title: Snowpark Connect for Spark release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-connect-2025.md
section: Release Notes
---

# Snowpark Connect for Spark release notes for 2025

Snowflake uses semantic versioning for Snowpark Connect for Spark updates.

For documentation, see [Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark](../../developer-guide/snowpark-connect/snowpark-connect-overview.md) and
[Submitting Spark applications](../../developer-guide/snowpark-connect/snowpark-submit.md).

## Version 1.7.0 (December 18, 2025)

### Snowpark Connect for Spark

#### New features

* Add support for Spark integral types.
* Add support for Scala 2.13.
* Introduce support for integral types overflow behind `snowpark.connect.handleIntegralOverflow` configuration.
* Add a configuration for using custom JAR files in UDFs.
* Support Scala UDFs if `UDFPacket` lacks input types metadata.
* Allow as input and output types case classes in `reduce` function.

#### Bug fixes

* Fix Parquet logical types (TIMESTAMP, DATE, DECIMAL) handling. Previously, Parquet files were read using physical types only (such as `LongType` for timestamps). Logical types can now be interpreted by returning proper types like `TimestampType`, `DateType`, and `DecimalType`. You can enable this by setting Spark configuration `snowpark.connect.parquet.useLogicalType` to `true`.
* Use the output schema when converting Spark’s `Row` to `Variant`.
* Handle empty `JAVA_HOME`.
* Fix `from_json` function for `MapType`.
* Support of configuration `spark.sql.parquet.outputTimestampType` for `NTZ` timezone.

#### Improvements

None.

### Snowpark Submit

#### New Features

* Add support for Scala 2.13.
* Add support for `--files` argument.

#### Bug Fixes

* Add support for `--jars` for pyspark workload.
* Fix bug for Snowpark Submit JWT authentication.

## Version 1.6.0 (December 12, 2025)

### Snowpark Connect for Spark

#### New features

* Support any type as output or input type in the Scala `map` and `flatmap` functions.
* Support `joinWith`.
* Support any return type in Scala UDFs.
* Support `registerJavaFunction`.

#### Bug fixes

* Fix JSON schema inference issue for JSON reads from Scala.
* Change return types of functions returning incorrect integral types.
* Fix update fields bug with `struct` type.
* Fix unbounded input decoder.
* Fix `struct` function when the argument is `unresolved_star`.
* Fix column name for Scala UDFs when the proto contains no function name.
* Add support for PATTERN in Parquet format.
* Handle `error` and `errorIfExists` write modes.

#### Improvements

None.

## Version 1.5.0 (December 04, 2025)

### Snowpark Connect for Spark

#### New features

* Bump snowflake-connector-python to <4.2.0.
* Add basic support for single-column map and `flatMap` operations on Scala datasets.
* Iceberg writing support `TargetFileSize` and `PartitionBy`.

#### Bug fixes

* Make SAS server initialization synchronous.
* Use `snowpark-connect-deps-1==3.56.3`.
* Fix `saveAsTable` with `input_filename` columns.
* Remove duplicated reading of the cache in Scala UDFs.
* Increase recursion limit.
* Fix `format_number`.
* Fix infer schema when query is provided in JDBC read.
* Only lock dict operation in `cache.py` to improve performance.
* Fix grouped data tests.
* Throw more detailed errors on table and read/write operations.

#### Improvements

None.

## Version 1.4.0 (November 25, 2025)

### Snowpark Connect for Spark

#### New features

* Introduce reduce function for Scala.

#### Improvements

None.

#### Bug fixes

* Fix failing array insert for nullable elements.
* Throw correct error on non-numeric args in covariance.

## Version 1.3.0 (November 19, 2025)

### Snowpark Connect for Spark

#### New features

* Support `filter` on a simple (single column) `Dataset`.
* Support Azure scheme URL parsing and special character file name.

#### Bug fixes

* Fix “Dataframe has no attribute dataframe” error in Scala catalog API.
* Fix aliases in subquery, document not working subqueries.
* Fix `plan_id` resolution after joins.
* Fix `meta.yaml` for multi-py versions.
* Enable `use_vectorized_scanner` as map type from parquet file was error.
* CSV reading `inferSchema` option specify datatype.
* Fix `substr` function handling of negative length.
* Use cached file formats in `read_parquet`.
* Improve local relation performance.
* Generate summary _common_metadata for parquet files.
* Remove repetitive `setSchema`, `setRole`, etc, for Snowflake pushdown.

#### Improvements

None.

## Version 1.2.0 (November 17, 2025)

### Snowpark Connect for Spark

#### New features

* Relax version requirements for grpcio and aiobotocore.

#### Improvements

* Specify dependencies version in `meta.yaml`.
* Build compiled and architecture-specific conda package.
* Ensure all `CloudPickleSerializer.loads` are not done in TCM.
* Include OSS SQL tests that start with the WITH clause.
* Do not upload Spark jars when running the server for pyt.
* Update internal queries count.

#### Bug fixes

* Fix tests for tcm.
* Fix CSV column name discrepancy from Spark.
* Use type cache for empty frames.
* Resolve Windows OSS runner general issues.

### Snowpark Submit

#### Improvements

* Generate unique workload names.

#### Bug Fixes

* Fix staged file reading.

## Version 1.0.1 (November 3, 2025)

> **Note:**
>
> With the release of this version, version 0.24 and previous versions are deprecated.

### Snowpark Connect for Spark

#### New features

* Add parameter for view creation strategies.
* Support string <-> year month interval.
* Support multiple pivot columns and aliases for pivot values in Spark SQL.
* Integrate OpenTelemetry span and traces.

#### Improvements

None.

#### Bug fixes

* Add a trailing slash for remove command.
* Invalid GROUP BY issue with aggregation function and nilary functions.
* Notebook exceeds gRPC maximum message size.
* Fix temporary view creation with colliding names.
* `array_size` with null argument.
* Fix for `$.0` JSON array access in `get_json_object` function.
* Fix self ANTI and SEMI LEFT joins.
* Handle different types in SQL function range.
* Fixed temporary view describe.

## Version 1.0.0 (October 28, 2025)

### Snowpark Connect for Spark

#### New features

* Add `rowToInferSchema` for CSV reading.
* Support INSERT INTO with CTE SQL command.
* I/O changes to add _SUCCESS file generation and metadata file filtering.
* `update(submit)`: Support installing Snowpark Connect for Spark in the Snowpark Submit client container.

#### Improvements

None.

#### Bug fixes

* Fix _SUCCESS path update.
* Throw error on remove failure update.
* Sequence function supporting integral types inputs.
* Fix types in empty `CreateTempViewUsing`.
* Fix Parquet file repartitioning on write.
* Resolve aliases in ORDER BY clause correctly.
* Remove scope temp session parameter.
* Fixed multiple self joins with join condition.
* Fix column name resolution in pivot.
* SQL parser aware of session timezone.
* Interval type coercion with other types.
* Fix having with nested CTEs.
* Improve qualified name resolution in Spark.

## Version 0.33.0 (October 10, 2025)

### Snowpark Connect for Spark

#### New features

* Add script to run on the output from Git action for merging SQLs.
* Add `--rebuild-whl` parameter to notebook test runner.
* Add support for both qualifiers after join.

#### Improvements

None.

#### Bug fixes

* Support escape parameter in SQL LIKE commands.
* Overwrite bug in partitions.
* Validate column count on INSERT.
* Incompatibility for pow with NAN.
* Cross JOIN with condition.
* Column attribution logic in nested queries.
* Update error message for interval test.
* String type coercion in set operation UNION and EXCEPT, coerce NUMERIC, DATE, DATETIME to STRING.
* Correctly resolve Snowpark columns after a full outer self JOIN.
* Expression in aggregate function might be zero improvement.
* Update: Revert “[SCOS GA BUG] string type coercion in set opera”
* DataFrame union of decimal type columns now widen as necessary.
* String type coercion in set operation UNION and EXCEPT, coerce NUMERIC, DATE, DATETIME to STRING (part1).
* Object not existed issue in TCM.
* Fix `to_binary(x, 'hex')` where `x` has odd number of letters and digits.
* Fix joins with empty tables.
* Fix HAVING clause to prioritize grouping columns over aggregate aliases with same name.

## Version 0.32.0 (October 17, 2025)

### Snowpark Connect for Spark

#### New features

* Support for `RepairTable`.
* Make `jdk4py` an optional dependency of Snowpark Connect for Spark to simplify configuring Java home for end users.
* Support more interval type cases.

#### Improvements

None.

#### Bug fixes

* Fix `Join` issues by refactoring qualifiers
* Fix `percentile_cont` to allow filter and sort order expressions.
* Fix `histogram_numeric` UDAF.
* Fix the `COUNT` function when called with multiple args.

## Version 0.31.0 (October 9, 2025)

### Snowpark Connect for Spark

#### New features

* Add support for expressions in the GROUP BY clause when the clause is explicitly selected.
* Add error codes to the error messages for better troubleshooting.

#### Improvements

None.

#### Bug fixes

* Fix the window function unsupported cast issue.

---
title: Snowpark Connect for Spark release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-connect-2026.md
section: Release Notes
---

# Snowpark Connect for Spark release notes for 2026

Snowflake uses semantic versioning for Snowpark Connect for Spark updates.

For documentation, see [Run Apache Spark™ workloads on Snowflake with Snowpark Connect for Spark](../../developer-guide/snowpark-connect/snowpark-connect-overview.md) and
[Submitting Spark applications](../../developer-guide/snowpark-connect/snowpark-submit.md).

## 1.19.0 (March 26, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Fix accessing struct field from array via getItem
* Fix names for accessing array elements
* Added missing compression for TEXT format
* Reduce query size in `DataFrame.replace`, UDTF creation, and `read_parquet`
* Emulate types on create [temp] view
* Fixed casting structured types to
* Fix text write type validation
* Support XML read dir in parallel
* Optimize `conv` function usage
* Support both Snowflake and `net.snowflake.spark.snowflake` format read and write
* Emulate types on create table
* Fix accessing nested structs with arrays
* Fix Parquet error message
* Optimize to_number reducing query size
* Fix UDF cache to consider query database change
* Optimize `mask` function
* Pass PATTERN to NVS fallback reader during Parquet schema inference
* Null and structured type coercion

#### New features

* Introduce DIRECTED join hint
* Integrate XML inferSchema

## 1.18.0 (March 19, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Added missing JDBC Type mapping
* Support user provided schema in parquet
* Handle invalid UTF-8 characters in JSON gracefully
* Resolve LCA columns only if actually used
* Optimize get_json_object query generation
* Strip semicolon from SQL query
* Make `processInBulk=True` the default for JSON reads and fix `NullType` schema inference
* Fix bug regarding incorrect stage read
* Add non check in udf registration
* Tighten limit for error message
* Allow missing fields in user provided schema
* JSON and CSV compression inference
* Fix for `coalesce(1)` creating a single file

#### New features

* Add `execute_jar` method to launch Java/Scala workloads

### Snowpark Submit

#### Bug fixes

* Fix error swallowing with `--wait-for-completion` flag

## 1.17.0 (March 13, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* JSON and CSV compression inference.
* Fix for `coalesce` creating a single file.
* Refactor JSON read to use `COPY INTO` for single-file reads and add `VariantType` schema inference.
* Allow JSON loading without explicit schema.
* Fix `multi_line` in JSON.
* Fix JSON infer schema to avoid scanning whole files.
* Correctly handle casting to timestamp `ltz`.
* Clamp hash returned value.
* Fix for `repartition` with `partitionBy`.
* Fix to use `[connections.spark-connect]` section header in `config.toml`.
* Convert Java `date`/`timestamp` format tokens to Snowflake equivalents for CSV reads.
* Calculate schema for `pivot` functions.
* Fix UDTFs in aliased lateral join.
* Align result for SQL `SET` command.
* Fix return type for `CEIL` and `FLOOR` functions.
* Improve query generation in `unbase64` v2.
* Fix some of option to Snowflake mapping for CSV.
* Fix serialization for `POJO`.
* Improve CSV header error messages.
* Improve `mapType` detection logic with `try_cast` for Parquet reads.

#### New features

* Support for `reduceGroups` API.
* Support specifying connection name inside `init_spark_session`.
* Add config param to use UDF for `unbase64`.

## 1.16.0 (March 12, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Optimize SQL generation in function `unbase64`.
* Fix `from_json` regression
* Fix for records that span multiple BZ2 compression block boundaries
* Fix nullability mapping in unresolved attribute
* Initialize `spark-connect` session with any connection, not just one named `spark-connect`
* Add XML options validation
* Drop CSV ESCAPE option when it matches the quote character to prevent compilation error
* Fix incorrect conversion of named tuples in `productEncoder`
* Verify `mergeSchema` for CSV and JSON is not supported
* Fix Parquet complex type round-trip (write + read)
* Fix schema for `pivot`/`unpivot`
* Fix return type for `MOD` and `PMOD` functions
* Fix CSV header extraction for files with leading blank lines
* Test timezones correctly and replace string-based date/time serialization with epoch-based
* Update Java version check for Windows
* Flatten nested `withColumn` calls
* Change logic for `Literal _IntegralType` in add/sub operations
* Return `LongType` for `COUNT` functions
* Read JSON: test compression = bz2/bzip2/none
* Improve performance of `to_varchar`/`to_char`
* Make better comparison in I/O testing
* Set `multi_line` to `False` by default for copy JSON

### Snowpark Submit

#### Bug fixes

* Throw error on unspecified compute pool.

## 1.15.0 (March 06, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Remove result scan when calling `df.count()`
* Make sure infer schema runs on limited rows for reading JSON
* Fix `createDataFrame` for interval types
* Change logic for `Literal _IntegralType` in multiplication and division operations
* Widen and coerce type for `Set` operations
* Fix `neo4j` multi label support
* Modify JAR metadata so that Grype does not detect Netty vulnerability
* Return correct type for `ANY_VALUE` function
* Return widened type for sequence
* Add support for config `spark.sql.parquet.inferTimestampNTZ.enabled`
* Batch column rename/cast in `_validate_schema_and_get_writer`
* JDBC hang when partitioned queries given with fetch size
* Return trimmed exception message when it exceeds the HTTP header limits
* Fix `map_type_to_snowflake_type` for `BigDecimal`
* Fix literal decimal precision and scale
* Improve random string generation
* Make BZ2 compressed JSON loading ignore corrupt records

#### New features

* Use staged files from config in Scala UDFs
* Use permissive `TRY_CAST` in JSON reading
* Make the number of server threads configurable

### Snowpark Submit

#### Bug fixes

* Adding back `init_spark_session()` to testing
* Update `snowpark-submit` command line output to clarify `snowflake-connection-name` is required.

## 1.14.0 (February 19, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Cache table type when running `saveAsTable`
* Optimize literal input for substring and type casting for `coalesce`
* Handle decimal overflow in `avg`/`mean` and fix decimal type coercion
* Iceberg - Preserve grants on overwrite
* Standardize SQL passthrough mode
* Optimize `from_utc_timestamp`/`to_utc_timestamp` for literal timezone
* Handle JSON null values in structured types to match Spark semantics
* Emulate integral types on creating tables from SQL
* Fix edge case with mapping nested rows in Scala UDFs
* Fix how Parquet handles read and write of complex structured datatypes
* Support save ignore argument for parquet files
* Add support for artifact repository
* Fix array nullability in Scala UDxF
* Fix `log1p` for args from (-1, 0) range
* Fix `first_value` and `last_value` in aggregate context
* Fix reading `DayTimeIntervalType` for Scala client

#### New features

* Handle timezones correctly in Scala UDFs
* Support Java 11 and 17 without any configuration

### Snowpark Submit updates

#### New features

* Support `snowpark-submit` for python 3.9
* Enhance `init_spark_session` to be usable in `snowpark-submit` workflow

## 1.13.0 (February 13, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Fixed `split` function issue
* Downgraded snowflake-snowpark-python dependency to version 1.44
* Fixed `Neo4j` dialect matching to improve SQL translation
* Fixed operation ID returned in execute responses to be consistent
* Fixed `gRPC` metadata handling for TCP channel connections

#### New features

* Added support for `partition_hint` in `mapPartitions` operations
* Added XML reader support for scenarios with user-defined schemas

## 1.11.0 (January 28, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Preserve hidden columns after various DataFrame operators
* Fix issues for scala udf input types (`byte`, `binary`, `scala.math.BigDecimal`)

#### Other updates

* Add `snowpark-submit` User Defined Args to comment

## 1.10.0 (January 22, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Fix config unset error for session configuration.
* Use copy into to load CSV files in parallel.
* Fix writes for DataFrames using outer joins.
* Handle nulls in Scala UDFs.
* Optimize CTE query generation with parameter protection.
* Avoid casting arguments of `DATEDIFF`.
* Fix appending partitioned files and reading of null partitions.
* Make a 10X performance improvement for conversion between base 10 and 16 using SQL.

#### New features

* Overwrite only modified partitions for parquet files.

#### Other updates

* Updated logic to detect if Snowpark Connect for Spark is running on XP.
* Support writing to a table with variant data type in Snowflake.
* Remove unnecessary info logs.
* Move Java tests out of Scala tests job to a separate job.
* Update the dependency version for gcsfs.

### Snowpark Submit

None.

## 1.9.0 (January 14, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Fix serializing Scala tuples.
* Fix loading huge JSON files.
* Implement small fixes for customer issues.
* Implement fixes for struct comparisons.
* Add handling for 0-column DataFrames.
* Correct upload file path.
* Fix `Upload_files_if_needed` not running in parallel.
* Improve input type inference when UDF input types are not defined in the proto.
* Fix NA edge cases.

#### New features

* Support reading single JSON BZ2 file.
* Support Scala UDFs in server-side Snowpark Connect for Spark.
* Implement cast between string and `daytime`.
* Add support for Scala UDFs in `group_map`.

### Snowpark Submit

#### Bug fixes

* Reduce generated workload names.

## 1.8.0 (January 07, 2026)

### Snowpark Connect for Spark

#### Bug fixes

* Fixed JAVA_HOME handling for Windows.

#### New features

* Support `neo4j` data source via JDBC.

### Snowpark Submit

None.

---
title: Snowpark Container Services container logs: Changes to resource attributes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1682.md
section: Release Notes
---

# Snowpark Container Services container logs: Changes to resource attributes

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

Snowflake can [capture and record local container logs](../../../developer-guide/snowpark-container-services/monitoring-services.md) from your application containers into the event table configured for your account.

The JSON object in the RESOURCE_ATTRIBUTES column in the event table provides information such as the service container that generated the log message, compute pool where the service is running, and others.

When this behavior change bundle is enabled, the JSON object in the RESOURCE_ATTRIBUTES column in the event table will change as described below:

| Field names before the change | Field names after the change |
| --- | --- |
| “snow.containers.compute_pool.id” . “snow.containers.compute_pool.name” | “snow.compute_pool.id” . “snow.compute_pool.name” |
| Not previously present | “snow.compute_pool.node.id” . “snow.compute_pool.node.instance_family” |
| “snow.containers.container.name” . “snow.containers.instance.name” . “snow.containers.restart.id” | “snow.service.container.name” . “snow.service.container.instance” . “snow.service.container.run.id” |
| “snow.executable.id” . “snow.executable.name” . “snow.executable.type” | “snow.service.id” . “snow.service.name” . “snow.service.type” . |
| “snow.executable.type” | Removed |
| “snow.account.name” . “snow.database.id” . “snow.database.name” . “snow.schema.id” | No change. These field remain as is. |

The following is an example of resource attributes for a container log entry in the event table when this behavior change is enabled.

```output
+-----------------------------------------------------------+
| RESOURCE_ATTRIBUTES                                       |
|-----------------------------------------------------------|
| {                                                         |
|   "snow.account.name": "SPCSDOCS1",                       |
|   "snow.compute_pool.id": 20,                             |
|   "snow.compute_pool.name": "TUTORIAL_COMPUTE_POOL",      |
|   "snow.compute_pool.node.id": "a17e8157",                |
|   "snow.compute_pool.node.instance_family": "CPU_X64_XS", |
|   "snow.database.id": 26,                                 |
|   "snow.database.name": "TUTORIAL_DB",                    |
|   "snow.schema.id": 212,                                  |
|   "snow.schema.name": "DATA_SCHEMA",                      |
|   "snow.service.container.instance": "0",                 |
|   "snow.service.container.name": "echo",                  |
|   "snow.service.container.run.id": "b30566",              |
|   "snow.service.id": 114,                                 |
|   "snow.service.name": "ECHO_SERVICE2",                   |
|   "snow.service.type": "Service"                          |
| }                                                         |
+-----------------------------------------------------------+
```

Ref: 1682

---
title: Snowpark Container Services job service:  Retention-time increase
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2093.md
section: Release Notes
---

# Snowpark Container Services job service: Retention-time increase

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When you run the [EXECUTE JOB SERVICE](../../../sql-reference/sql/execute-job-service.md) command, you start a Snowpark Container Services service as a job. After the job is complete, Snowflake performs the necessary cleanup automatically.

When this behavior change bundle is enabled, the behavior of this automatic cleanup process changes:

Before the change:
:   The automatic cleanup behavior depends on whether you configured the job to run synchronously or asynchronously. You configure the job by specifying or omitting the ASYNC property when running EXECUTE JOB SERVICE.

    * **Synchronous execution (omitting the ASYNC property or setting ASYNC=FALSE):** Snowflake automatically deletes the job service 10 to 20 minutes after completion.
    * **Asynchronous execution (with ASYNC=true):** Snowflake automatically deletes the job service 7 days after completion.

After the change:
:   Snowflake retains both synchronous and asynchronous job services for 14 days after completion before cleanup.

> **Note:**
>
> After a job service completes, Snowflake automatically cleans up the resources allocated to the job service to help reduce costs. You can still access job metadata for up to 14 days by using the [DESCRIBE SERVICE](../../../sql-reference/sql/desc-service.md) and [SHOW SERVICES](../../../sql-reference/sql/show-services.md) commands.
>
> When you recreate a synchronous job with the same name you might get “Object already exists” error because of the retention time increase. In this case, you might consider using a different name when recreating the job, or leave the name out and Snowflake will generate a job name automatically for you.

Ref: 2093

---
title: Snowpark Container Services job service: Retention time increase
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2206.md
section: Release Notes
---

# Snowpark Container Services job service: Retention time increase

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

When this behavior change bundle is enabled, Snowpark Container Services changes how [block volumes](../../../developer-guide/snowpark-container-services/block-storage-volume.md) are deleted:

Before the change:
:   When you delete a block volume, no snapshots are automatically created before the block volume is deleted, and no billable events related to snapshots are automatically created during volume deletion.

After the change:
:   When you delete a block volume by using any of the following commands, Snowflake first creates snapshots for the block volume, and then later deletes the snapshots:

    * [DROP SERVICE](../../../sql-reference/sql/drop-service.md) <service-name> FORCE
    * [ALTER COMPUTE POOL](../../../sql-reference/sql/alter-compute-pool.md) <compute-pool-name> STOP ALL
    * [ALTER SERVICE](../../../sql-reference/sql/alter-service.md) <service-name> RESTORE VOLUME <volume-name> FROM SNAPSHOT

    Snowflake assigns a snapshot name in this format: `SYS_BACKUP_ON_DELETEstring_timestamp`. These snapshots have a default retention period of seven days, after which Snowflake drops the snapshots. You are billed for these snapshots. This feature protects you from accidentally deleting block volumes.

    > **Note:**
    >
    > This change doesn’t apply to block volumes that are used by job services.

    You can opt-out of this change by setting the `snapshotOnDelete` option to `false` in the service specification:

    ```yaml
    volumes:                               # optional volume list
      - name: <name>
        source: local | stage | memory | block
        size: <bytes-of-storage>           # specify if memory or block is the volume source
        uid: <UID-value>                   # optional, only for stage volumes
        gid: <GID-value>                   # optional, only for stage volumes
        blockConfig:                       # optional
          initialContents:
            fromSnapshot: <snapshot-name>
          iops: <number-of-operations>
          throughput: <MiB-per-second>
          encryption: SNOWFLAKE_SSE | SNOWFLAKE_FULL
          snapshotOnDelete: true # defaults true for services and false for jobs, false to opt-out
          snapshotDeleteAfter: (<hours>h)|(<days>d)     # defaults to 7 days
    ```

Ref: 2206

---
title: Snowpark Container Services: Changes to access control for services and endpoints
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1611.md
section: Release Notes
---

# Snowpark Container Services: Changes to access control for services and endpoints

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

With this behavior change bundle, Snowpark Container Services now supports fine-grained access control allowing you to manage access privileges for endpoints that a service exposes. You can now grant a service role (defined in the service specification) access privileges to a specific endpoint and use service roles to control who can access the service endpoints.

> **Note:**
>
> The owner role for a service always has access to that service’s endpoints. This change applies only if the current role is not the owner role.

## Changes to the privileges required to create and alter a service function

The privileges required to create or alter a service function are changing:

Before the change:
:   Privileges required to create and manage service functions:

    * **To create a service function:** The current role must have the USAGE privilege on the service being referenced.
    * **To alter a service function:** You can alter a service function and associate it with another service. The current role must have the USAGE privilege on that other service.

After the change:
:   Privileges required to create and manage service functions:

    * **To create a service function:** The current role must have the USAGE privilege on the endpoint. You grant this privilege by granting the service role to the current role.
    * **To alter a service function:** You can alter a service function and associate it with another endpoint. The current role must have been granted the service role with the USAGE privilege on the new endpoint being referenced.

## Changes to the privileges required for ingress to a public endpoint

Users in the Snowflake account where the service is created can use ingress to a public endpoint. The privileges required to use the public endpoint are changing:

Before the change:
:   The current role must have the USAGE privilege on a service that exposes the endpoint.

After the change:
:   The current role must have the USAGE privilege on the endpoint. You grant this privilege by granting the service role to the current role.

## Change to the SHOW ENDPOINTS command output

The list of endpoints returned by this command is changing:

Before the change:
:   Returns a list of all the endpoints associated with the service.

After the change:
:   Returns a list of endpoints associated with the service that the current role has USAGE privileges for.

Ref: 1611

---
title: Snowpark Container Services: Changes to failed batch retry logic and new columns in the DESC FUNCTION command output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1938.md
section: Release Notes
---

# Snowpark Container Services: Changes to failed batch retry logic and new columns in the DESC FUNCTION command output

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

## Changes to retry logic for a failed batch request

> When you execute a query that includes a service function, Snowflake sends a series of HTTP requests, each containing a batch of rows to the service. If the service returns errors (the HTTP error 404, 429, 500, 502, 503, or 504), Snowflake retries the request. When Snowflake retries, it resends the same batch after a delay determined by an exponential backoff algorithm.

Before the change:
:   If the total sleep time between retries reaches one hour, Snowflake marks the batch execution as failed, and the service function execution fails as well.

After the change:
:   By default, Snowflake retries the failed batch three times. You can optionally specify the number of times you want Snowflake to retry the batch by executing the CREATE FUNCTION or ALTER FUNCTION command with the MAX_BATCH_RETRIES parameter. For example:

    * For the [CREATE FUNCTION](../../../sql-reference/sql/create-function-spcs.md) command, specify the MAX_BATCH_RETRIES parameter, as shown below:

      ```sqlexample
      CREATE [ OR REPLACE ] FUNCTION <name> ( [ <arg_name> <arg_data_type> ] [ , ... ] )
        RETURNS <result_data_type>
        …
        [ MAX_BATCH_RETRIES = <integer> ]
        AS '<http_path_to_request_handler>'
        …
      ```
    * For the [ALTER FUNCTION](../../../sql-reference/sql/alter-function-spcs.md) command, use `SET MAX_BATCH_RETRIES = integer` to set the number of retries:

      ```sqlexample
      ALTER FUNCTION [ IF EXISTS ] <name> ( [ <arg_data_type> , ... ] ) SET MAX_BATCH_RETRIES = <integer>
      ```

## DESCRIBE FUNCTION (Snowpark Container Services) command: New columns in output

When this behavior change bundle is enabled, the output of the [DESCRIBE FUNCTION (Snowpark Container Services)](../../../sql-reference/sql/desc-function-spcs.md) command includes the following new columns:

| Column name | Description |
| --- | --- |
| MAX_BATCH_RETRIES | The maximum number of retries for each batch of rows processed by the service function. |
| ON_BATCH_FAILURE | The service function’s behavior when a batch of rows reaches the maximum retry limit. |
| BATCH_TIMEOUT_SECS | The maximum time Snowflake waits for a single batch of rows to be processed (including retries and async request polling) before terminating the batch request. |

Ref: 1938

---
title: Snowpark Container Services: Changes to the compute pool maintenance window
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1856.md
section: Release Notes
---

# Snowpark Container Services: Changes to the compute pool maintenance window

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

The [compute pool maintenance window](../../../developer-guide/snowpark-container-services/working-with-compute-pool.md) is changing:

Before the change:
:   The maintenance window is: Monday-Thursday, 11 PM to 5 AM.

After the change:
:   The maintenance window is: Saturday 8 PM to Sunday 8 AM, and Sunday 8 PM to Monday 8 AM.

Ref: 1856

---
title: Snowpark Container Services: Error if image is not found when creating service or job
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1550.md
section: Release Notes
---

# Snowpark Container Services: Error if image is not found when creating service or job

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

When attempting to create a Snowpark Container Services [service](../../../sql-reference/sql/create-service.md) or [job](../../../sql-reference/sql/execute-job-service.md) and where the image identified in the specification is not found in the repository, the behavior is as follows:

Before the change:
:   The error message `Failed to read image manifest : [status code]` is displayed.

After the change:
:   If Snowflake cannot connect to the repository, the error message `Failed to retrieve image [image path] from image repository : [status code]` is displayed.

    In addition, if the image does not exist in the repository, the error message `Image [image path] not found. Please verify the image exists in the image repository.` is displayed.

Ref: 1550

---
title: Snowpark Container Services: Ingress and web app security updates for Azure
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_03/bcr-1953.md
section: Release Notes
---

# Snowpark Container Services: Ingress and web app security updates for Azure

> **Attention:**
>
> This behavior change is in the 2025_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_03_bundle.md).

As explained in [Ingress and web app security](../../../developer-guide/snowpark-container-services/service-network-communications.md), when you create a Snowpark Container Services service
for web hosting (network ingress), for added security, the Snowflake proxy service monitors incoming requests to your service and outgoing
responses from your service to the clients.

For Snowflake accounts on Azure, the proxy is changing the way that it modifies the `Content-Security-Policy` (CSP) response header:

Before the change:
:   The CSP does not provide the connectivity restrictions described in [Responses outgoing to the clients](../../../developer-guide/snowpark-container-services/service-network-communications.md).

    As a result, application clients can connect to sites that are not defined in EAI.

After the change:
:   The CSP restricts application clients from connecting to sites that are not defined in the EAI.

Ref: 1953

---
title: Snowpark Container Services: New default values and validation of resource requirements for a service
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1648.md
section: Release Notes
---

# Snowpark Container Services: New default values and validation of resource requirements for a service

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

> **Attention:**
>
> This behavior change was going to be introduced with the 2024_05 bundle. However, it had been postponed and is now in the 2024_06 bundle.

You provide [resource requirements](../../../developer-guide/snowpark-container-services/specification-reference.md) for a service in the service specification.

The way in which Snowflake handles services with unspecified resource requirements is changing. In addition, the way in which Snowflake validates specified resource requirements is changing:

Before the change:
:   * If you do not provide any resource requirements, Snowflake assumes your service will consume negligible resources.
    * When you provide resource requirements, Snowflake validates the values against the entire node capacity. Resources consumed by Snowpark Container Services system components are not considered.

After the change:
:   If resource requirements are not provided the following defaults are applied. Note that `resource.requests` and `resource.limits` are relative to the node capacity (vCPU and memory) of the instance family of the associated [compute pool](../../../developer-guide/snowpark-container-services/working-with-compute-pool.md).

    * If a resource request (cpu, memory, or both) is not provided, Snowflake derives one for you:

      + For `cpu`, the derived value is either 0.5 or the `cpu` limit you provided, whichever is smaller.
      + For `memory`, the derived value is either 0.5 GiB or the `memory` limit you provided, whichever is smaller.
    * If a resource limit (cpu, memory, or both) is not provided, Snowflake defaults the limits to the node capacity for the instance family of the associated [compute pool](../../../developer-guide/snowpark-container-services/working-with-compute-pool.md).
    * If you do provide `resource.limits` and they exceed the node capacity, Snowflake will cap the limit to the node capacity.
    * Snowflake evaluates these resource requirements independently for `cpu` and `memory`.

    Note that if it’s theoretically impossible for Snowflake to schedule the service on the given compute pool, CREATE SERVICE will fail. Theoretically impossible assumes the compute pool has the maximum number of allowed nodes and there are no other services running on the compute pool. That is, there is no way Snowflake could allocate the requested resources within the compute pool limits.
    If it’s theoretically possible, but required resources are in use, then CREATE SERVICE will succeed. Some service instances will report status indicating that the service cannot be scheduled due to insufficient resources until resources become available.

Also, with this BCR the node capacity (vCPU and memory) for each instance type has changed as shown:

| Instance family | vCPU . before change | vCPU . after change | Memory (GiB) . before change | Memory (GiB) . after change |
| --- | --- | --- | --- | --- |
| CPU_X64_XS | 2 | 1 | 8 | 6 |
| CPU_X64_S | 4 | 3 | 16 | 13 |
| CPU_X64_M | 8 | 6 | 32 | 28 |
| CPU_X64_L | 32 | 28 | 128 | 116 |
| HIGHMEM_X64_S | 8 | 6 | 64 | 58 |
| HIGHMEM_X64_M | 32 | 28 | 256 (AWA) . 256 (Azure) | 240 (AWS) . 244 (Azure) |
| HIGHMEM_X64_L | 128 (AWS) . 96 (Azure) | 124 (AWS) . 92 (Azure) | 1024 (AWS) . 672 (Azure) | 984 (AWS) . 654 (Azure) |
| GPU_NV_S . (AWS only) | 8 | 6 | 32 | 27 |
| GPU_NV_M . (AWS only) | 48 | 44 | 192 | 178 |
| GPU_NV_L . (AWS only) | 96 | 92 | 1152 | 1112 |
| GPU_NV_XS . (Azure only) | 4 | 3 | 28 | 26 |
| GPU_NV_SM . (Azure only) | 36 | 32 | 440 | 424 |
| GPU_NV_2M . (Azure only) | 72 | 68 | 880 | 858 |
| GPU_NV_3M . (Azure only) | 48 | 44 | 440 | 424 |
| GPU_NV_SL . (Azure only) | 96 | 92 | 880 | 858 |

Ref: 1648

---
title: Snowpark Container Services: New stage mount allotment limit per compute pool node
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1698.md
section: Release Notes
---

# Snowpark Container Services: New stage mount allotment limit per compute pool node

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

When choosing a compute pool node on which to run a service instance, Snowflake behavior is:

Before the change:
:   Snowflake ensures that the resources requested in the `containers.resources` field in the [specification](../../../developer-guide/snowpark-container-services/specification-reference.md) are available on the specific node.

After the change:
:   Snowflake also ensures that the compute pool nodes do not exceed eight stage mount allotments per node.

    Snowflake supports storage volumes for use by application containers. Snowflake internal stage is one of the [supported storage volume types](../../../developer-guide/snowpark-container-services/snowflake-stage-volume.md).

    For optimal performance, Snowflake now limits the number of service instances that use a stage volume mount to eight per compute pool node. It does not matter whether these instances belong to the same service or different services.

    When the limit is reached, Snowflake does not use that node to start new service instances that use a stage volume. Instead, Snowflake starts the service instance on a different node in the compute pool.

    To accommodate this stage mount allotment limit on a node, in some cases, you can increase the maximum number of nodes that you request for a compute pool. This ensures that additional nodes are available for Snowflake to start your service instances.

Ref: 1698

---
title: Snowpark Library for Python release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-python.md
section: Release Notes
---

# Snowpark Library for Python release notes

The [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md) release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowpark-python-2026.md)
* [2025 releases](snowpark-python-2025.md)
* [2024 releases](snowpark-python-2024.md)
* [2023 releases](snowpark-python-2023.md)
* [2022 releases](snowpark-python-2022.md)

See [Snowpark Developer Guide for Python](../../developer-guide/snowpark/python/index.md) for documentation.

---
title: Snowpark Library for Python release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-python-2022.md
section: Release Notes
---

# Snowpark Library for Python release notes for 2022

This article contains the release notes for the [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md) updates.

See [Snowpark Developer Guide for Python](../../developer-guide/snowpark/python/index.md) for documentation.

## Version 1.0.0 (2022-11-01)

### New features

* Added `Session.generator()` to create a new DataFrame using the [GENERATOR](../../sql-reference/functions/generator.md) table function.
* Added the SECURE parameter to the functions that create a secure UDF or UDTF.

## Version 0.12.0 (2022-10-14)

### New features

* Added new APIs for async job:

  + `Session.create_async_job()` to create an `AsyncJob` instance from a query id.
  + `AsyncJob.result()` now accepts the argument `result_type` to return the results in different formats.
  + `AsyncJob.to_df()` returns a `DataFrame` built from the result of this asynchronous job.
  + `AsyncJob.query()` returns the SQL text of the executed query.
* `DataFrame.agg()` and `RelationalGroupedDataFrame.agg()` now accept variable-length arguments.
* Added parameters `lsuffix` and `rsuffix` to `DataFrame.join()` and `DataFrame.cross_join()` to conveniently
  rename overlapping columns.
* Added `Table.drop_table()` so you can drop the temp table after calling `DataFrame.cache_result()`.
  `Table` is also a context manager, so you can use the with statement to drop the cache temp table after use.
* Added `Session.use_secondary_roles()`.
* Added functions `first_value()` and `last_value()`. (contributed by @chasleslr)
* Added on as an alias for `using_columns` and how as an alias for `join_type` in `DataFrame.join()`.

### Bug fixes

* Fixed a bug in `Session.create_dataframe()` that raised an error when schema names had special characters.
* Fixed a bug in which options set in `Session.read.option()` were not passed to `DataFrame.copy_into_table()` as default values.
* Fixed a bug in which `DataFrame.copy_into_table()` raised an error when a copy option had single quotes in the value.

## Version 0.11.0 (2022-09-28)

### Behavior changes

* `Session.add_packages()` now raises a `ValueError` when the version of a package cannot be found in Snowflake
  Anaconda channel. Previously, `Session.add_packages()` succeeded and a `SnowparkSQLException` exception was raised
  later in the UDF or stored procedure registration step.

### New features

* Added method `FileOperation.get_stream()` to support downloading stage files as a stream.
* Added support in `functions.ntiles()` to accept an int argument.
* Added the following aliases:

  + `functions.call_function()` for `functions.call_builtin()`.
  + `functions.function()` for `functions.builtin()`.
  + `DataFrame.order_by()` for `DataFrame.sort()`.
  + `DataFrame.orderBy()` for `DataFrame.sort()`.
* Improved `DataFrame.cache_result()` to return a more accurate `Table` class instead of a DataFrame class.
* Added support to allow `session` as the first argument when calling `StoredProcedure`.

### Improvements

* Improved nested query generation by flattening queries when applicable. This improvement can be enabled by setting
  `Session.sql_simplifier_enabled = True`. `DataFrame.select()`, `DataFrame.with_column()`, `DataFrame.drop()`, and
  other select-related APIs have more flattened SQL now. `DataFrame.union()`, `DataFrame.union_all()`,
  `DataFrame.except_()`, `DataFrame.intersect()`, and `DataFrame.union_by_name()` have flattened SQL generated when
  multiple set operators are chained.
* Improved type annotations for async job APIs.

### Bug fixes

* Fixed a bug in which `Table.update()`, `Table.delete()`, and `Table.merge()` tried to reference a temp table that did not exist.

## Version 0.10.0 (2022-09-16)

### New features

* Added experimental APIs for evaluating Snowpark dataframes with asynchronous queries:

  + Added keyword argument block to the following action APIs on Snowpark dataframes (which execute queries) to allow asynchronous evaluations:

    - `DataFrame.collect()`, `DataFrame.to_local_iterator()`, `DataFrame.to_pandas()`, `DataFrame.to_pandas_batches()`,
      `DataFrame.count()`, `DataFrame.first()`, `DataFrameWriter.save_as_table()`,
      `DataFrameWriter.copy_into_location()`, `Table.delete()`, `Table.update()`, and `Table.merge()`.
  + Added method `DataFrame.collect_nowait()` to allow asynchronous evaluations.
  + Added class `AsyncJob` to retrieve results from asynchronously executed queries and check their status.
* Added support for `table_type` in `Session.write_pandas()`. You can now choose from these `table_type` options:
  `temporary`, `temp`, and `transient`.
* Added support for using Python structured data (`list`, `tuple`, and `dict`) as literal values in Snowpark.
* Added keyword argument `execute_as` to `functions.sproc()` and `session.sproc.register()` to allow registering a
  stored procedure as a caller or owner.
* Added support for specifying a pre-configured file format when reading files from a stage in Snowflake.

### Improvements

* Added support for displaying details of a Snowpark session.

### Bug fixes

* Fixed a bug in which `DataFrame.copy_into_table()` and `DataFrameWriter.save_as_table()` mistakenly created a new
  table if the table name was fully qualified, and the table already existed.

### Deprecations

* Deprecated keyword argument `create_temp_table` in `Session.write_pandas()`.
* Deprecated invoking UDFs using arguments wrapped in a Python list or tuple. You can use variable-length arguments
  without a list or tuple.

### Dependency updates

* Updated `snowflake-connector-python` to 2.7.12.

## Version 0.9.0 (2022-08-30)

### New features

* Added support for displaying source code as comments in the generated scripts when registering UDFs. This feature is
  turned on by default. To turn it off, pass the new keyword argument `source_code_display` as False when
  calling `register()` or `@udf()`.
* Added support for calling table functions from `DataFrame.select()`, `DataFrame.with_column()`,
  and `DataFrame.with_columns()`, which now take parameters of type `table_function.TableFunctionCall` for columns.
* Added keyword argument `overwrite` to `session.write_pandas()` to allow you to overwrite contents of a
  Snowflake table with that of a Pandas DataFrame.
* Added keyword argument `column_order` to `df.write.save_as_table()` to specify the matching rules when inserting
  data into a table in append mode.
* Added method `FileOperation.put_stream()` to upload local files to a stage via a file stream.
* Added methods `TableFunctionCall.alias()` and `TableFunctionCall.as_()` to allow aliasing the names of columns
  that come from the output of table function joins.
* Added function `get_active_session()` in module `snowflake.snowpark.context` to get the current active Snowpark session.

### Improvements

Improved the function `function.uniform()` to infer the types of inputs `max_` and `min_` and cast the limits to `IntegerType` or `FloatType`, respectively.

### Bug fixes

* Fixed a bug in which batch insert should not raise an error when `statement_params` is not passed to the function.
* Fixed a bug in which column names should be quoted when `session.create_dataframe()` is called with `dicts` and a
  given schema.
* Fixed a bug in which creation of a table should be skipped if the table already exists and is in append mode when
  calling `df.write.save_as_table()`.
* Fixed a bug in which third-party packages with underscores cannot be added when registering UDFs.

## Version 0.8.0 (2022-07-22)

### New features

* Added keyword only argument `statement_params` to the following methods to allow for specifying statement level parameters:

  + `collect`, `to_local_iterator`, `to_pandas`, `to_pandas_batches`, `count`, `copy_into_table`, `show`, `create_or_replace_view`, `create_or_replace_temp_view`, `first`, `cache_result`, and `random_split` on class `snowflake.snowpark.Dataframe`.
  + `update`, `delete` and `merge` on class `snowflake.snowpark.Table`.
  + `save_as_table` and `copy_into_location` on class `snowflake.snowpark.DataFrameWriter`.
  + `approx_quantile`, `statement_params`, `cov`, and `crosstab` on class `snowflake.snowpark.DataFrameStatFunctions`.
  + `register` and `register_from_file` on class `snowflake.snowpark.udf.UDFRegistration`.
  + `register` and `register_from_file` on class `snowflake.snowpark.udtf.UDTFRegistration`.
  + `register` and `register_from_file` on class `snowflake.snowpark.stored_procedure.StoredProcedureRegistration`.
  + `udf`, `udtf`, and `sproc` in `snowflake.snowpark.functions`.
* Added support for `Column` as an input argument to `session.call()`.
* Added support for `table_type` in `df.write.save_as_table()`. You can now choose from these `table_type` options: `temporary`, `temp`, and `transient`.

### Improvements

* Added validation of object name in `session.use_*` methods.
* Updated the `query` tag in SQL to escape it when it contains special characters.
* Added a check to see if Anaconda terms are acknowledged when adding missing packages.

### Bug fixes

* Fixed the limited length of the string column in `session.create_dataframe()`.
* Fixed a bug in which `session.create_dataframe()` mistakenly converted 0 and `False` to `None` when the input data was only a list.
* Fixed a bug in which calling `session.create_dataframe()` using a large local dataset sometimes created a temp table twice.
* Aligned the definition of `function.trim()` with the SQL function definition.
* Fixed an issue where snowpark-python would hang when using the Python system-defined (built-in function) sum vs. the Snowpark `function.sum()`.

## Version 0.7.0 (2022-05-25)

### New features

* Added support for user-defined table functions (UDTFs).

  + Use function `snowflake.snowpark.functions.udtf()` to register a UDTF, or use it as a decorator to register the UDTF.
  + You can also use `Session.udtf.register()` to register a UDTF.
  + Use `Session.udtf.register_from_file()` to register a UDTF from a Python file.
* Updated APIs to query a table function, including both Snowflake built-in table functions and UDTFs.

  + Use function `snowflake.snowpark.functions.table_function()` to create a callable representing a table function and use it to call the table function in a query.
  + Alternatively, use function `snowflake.snowpark.functions.call_table_function()` to call a table function.
  + Added support for the `over` clause, which specifies partition by and order by when lateral joining a table function.
  + Updated `Session.table_function()` and `DataFrame.join_table_function()` to accept `TableFunctionCall` instances.

### Breaking changes

* When creating a function with `functions.udf()` and `functions.sproc()`, you can now specify an empty list for the
  imports or packages argument to indicate that no import or package is used for this UDF or stored procedure.
  Previously, specifying an empty list meant that the function would use session-level imports or packages.
* Improved the `__repr__` implementation of data types in `types.py`. The unused `type_name` property has been removed.
* Added a Snowpark-specific exception class for SQL errors. This replaces the previous `ProgrammingError` from the Python connector.

### Improvements

* Added a lock to a UDF or UDTF when it is called for the first time per thread.
* Improved the error message for pickling errors that occurred during UDF creation.
* Included the query ID when logging the failed query.

### Bug fixes

* Fixed a bug in which non-integral data (such as timestamps) was occasionally converted to integer when calling `DataFrame.to_pandas()`.
* Fixed a bug in which `DataFrameReader.parquet()` failed to read a parquet file when its column contained spaces.
* Fixed a bug in which `DataFrame.copy_into_table()` failed when the dataframe is created by reading a file with inferred schemas.

### Deprecations

* `Session.flatten()` and `DataFrame.flatten()`.

### Dependency Updates

* Restricted the version of cloudpickle <= 2.0.0.

## Version 0.6.0 (2022-04-27)

### New features

* Added support for the vectorized UDFs via Python UDF Batch API. The Python UDF batch API enables defining Python
  functions that receive batches of input rows as [Pandas DataFrames](https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.html)
  and return batches of results as [Pandas arrays](https://pandas.pydata.org/docs/reference/api/pandas.array.html)
  or [Series](https://pandas.pydata.org/docs/reference/series.html). This can improve the performance of UDFs in Snowpark.
* Added support for inferring the schema of a DataFrame by default when it is created by reading a Parquet, Avro, or ORC file in the stage.
* Added functions current_session(), current_statement(), current_user(), current_version(), current_warehouse(), date_from_parts(), date_trunc(), dayname(), dayofmonth(), dayofweek(), dayofyear(), grouping(), grouping_id(), hour(), last_day(), minute(), next_day(), previous_day(), second(), month(), monthname(), quarter(), year(), current_database(), current_role(), current_schema(), current_schemas(), current_region(), current_available_roles(), add_months(), any_value(), bitnot(), bitshiftleft(), bitshiftright(), convert_timezone(), uniform(), strtok_to_array(), sysdate(), time_from_parts(), timestamp_from_parts(), timestamp_ltz_from_parts(), timestamp_ntz_from_parts(), timestamp_tz_from_parts(), weekofyear(), percentile_cont() to snowflake.snowflake.functions.

### Improvements

* Added support for creating an empty DataFrame with a specific schema using the Session.create_dataframe() method.
* Changed the logging level from INFO to DEBUG for several logs (e.g., the executed query) when evaluating a dataframe.
* Improved the error message when failing to create a UDF due to pickle errors.
* Removed the following APIs that were deprecated in 0.4.0: DataFrame.groupByGroupingSets(), DataFrame.naturalJoin(), DataFrame.joinTableFunction, DataFrame.withColumns(), Session.getImports(), Session.addImport(), Session.removeImport(), Session.clearImports(), Session.getSessionStage(), Session.getDefaultDatabase(), Session.getDefaultSchema(), Session.getCurrentDatabase(), Session.getCurrentSchema(), Session.getFullyQualifiedCurrentSchema().
* Added typing-extension as a new dependency with the version >= 4.1.0.

### Bug fixes

* Removed pandas hard dependencies in the Session.create_dataframe() method.

## Version 0.5.0 (2022-03-22)

### New features

* Added stored procedures API.
* Added Session.sproc property and sproc() to snowflake.snowpark.functions, so you can register stored procedures.
* Added Session.call to call stored procedures by name.
* Added UDFRegistration.register_from_file() to allow registering UDFs from Python source files or zip files directly.
* Added UDFRegistration.describe() to describe a UDF.
* Added DataFrame.random_split() to provide a way to randomly split a dataframe.
* Added functions md5(), sha1(), sha2(), ascii(), initcap(), length(), lower(), lpad(), ltrim(), rpad(), rtrim(), repeat(), soundex(), regexp_count(), replace(), charindex(), collate(), collation(), insert(), left(), right(), endswith() to snowflake.snowpark.functions.
* The call_udf() function now also accepts literal values.
* Provided a distinct keyword in array_agg().

### Bug fixes

* Fixed an issue that caused DataFrame.to_pandas() to have a string column if Column.cast(IntegerType()) was used.
* Fixed a bug in DataFrame.describe() when there is more than one string column.

## Version 0.4.0 (2022-02-15)

### New features

* You can now specify which Anaconda packages to use when defining UDFs.
* Added add_packages(), get_packages(), clear_packages(), and remove_package() to class Session.
* Added add_requirements() to Session so you can use a requirements file to specify which packages this session will use.
* Added parameter packages to function snowflake.snowpark.functions.udf() and method UserDefinedFunction.register() to indicate UDF-level Anaconda package dependencies when creating a UDF.
* Added parameter imports to snowflake.snowpark.functions.udf() and UserDefinedFunction.register() to specify UDF-level code imports.
* Added a parameter session to function udf() and UserDefinedFunction.register() so you can specify which session to use to create a UDF if you have multiple sessions.
* Added types Geography and Variant to snowflake.snowpark.types to be used as type hints for Geography and Variant data when defining a UDF.
* Added support for Geography geoJSON data.
* Added Table, a subclass of DataFrame for table operations.
* Methods update and delete update and delete rows of a table in Snowflake.
* Method merge merges data from a DataFrame to a Table.
* Overrided method DataFrame.sample() with an additional parameter seed, which works on tables but not on views and sub-queries.
* Added DataFrame.to_local_iterator() and DataFrame.to_pandas_batches() to allow getting results from an iterator when the result set returned from the Snowflake database is too large.
* Added DataFrame.cache_result() for caching the operations performed on a DataFrame in a temporary table. Subsequent operations on the original DataFrame have no effect on the cached result DataFrame.
* Added property DataFrame.queries to get SQL queries that will be executed to evaluate the DataFrame.
* Added Session.query_history() as a context manager to track SQL queries executed on a session, including all SQL queries to evaluate DataFrames created from a session. Both query ID and query text are recorded.
* You can now create a Session instance from an existing established snowflake.connector.SnowflakeConnection. Use parameter connection in Session.builder.configs().
* Added use_database(), use_schema(), use_warehouse(), and use_role() to class Session to switch database/schema/warehouse/role after a session is created.
* Added DataFrameWriter.copy_into_table() to unload a DataFrame to stage files.
* Added DataFrame.unpivot().
* Added Column.within_group() for sorting the rows by columns with some aggregation functions.
* Added functions listagg(), mode(), div0(), acos(), asin(), atan(), atan2(), cos(), cosh(), sin(), sinh(), tan(), tanh(), degrees(), radians(), round(), trunc(), and factorial() to snowflake.snowpark.functions.
* Added an optional argument ignore_nulls in function lead() and lag().
* The condition parameter of function when() and iff() now accepts SQL expressions.

### Improvements

* All function and method names have been renamed to use the snake case naming style, which is more Pythonic. For convenience, some camel case names are kept as aliases to the snake case APIs. It is recommended to use the snake case APIs.
* Deprecated these methods on class Session and replaced them with their snake case equivalents: getImports(), addImports(), removeImport(), clearImports(), getSessionStage(), getDefaultSchema(), getDefaultSchema(), getCurrentDatabase(), and getFullyQualifiedCurrentSchema().
* Deprecated these methods on class DataFrame and replaced them with their snake case equivalents: groupingByGroupingSets(), naturalJoin(), withColumns(), and joinTableFunction().
* Property DataFrame.columns is now consistent with DataFrame.schema.names and the Snowflake database identifier requirements.
* Column.__bool__() now raises a TypeError. This will ban the use of logical operators and, or, not on Column object. For example, col(“a”) > 1 and col(“b”) > 2 will raise a TypeError. Use (col(“a”) > 1) & (col(“b”) > 2) instead.
* Changed PutResult and GetResult to subclass NamedTuple.
* Fixed a bug which raised an error when the local path or stage location has a space or other special characters.
* Changed DataFrame.describe() so that non-numeric and non-string columns are ignored instead of raising an exception.

### Dependency Updates

* Updated snowflake-connector-python to 2.7.4.

## Version 0.3.0 (2022-01-09)

### New features

* Added Column.isin() with an alias Column.in_().
* Added Column.try_cast(), which is a special version of cast(). It tries to cast a string expression to other types and returns null if the cast is not possible.
* Added Column.startswith() and Column.substr() to process string columns.
* Column.cast() now also accepts a str value to indicate the cast type in addition to a DataType instance.
* Added DataFrame.describe() to summarize the stats of a DataFrame.
* Added DataFrame.explain() to print the query plan of a DataFrame.
* DataFrame.filter() and DataFrame.select_expr() now accept a SQL expression.
* Added a new bool parameter called create_temp_table to methods DataFrame.saveAsTable() and Session.write_pandas() to optionally create a temp table.
* Added DataFrame.minus() and DataFrame.subtract() as aliases to DataFrame.except_().
* Added regexp_replace(), concat(), concat_ws(), to_char(), current_timestamp(), current_date(), current_time(), months_between(), cast(), try_cast(), greatest(), least(), and hash() to the snowflake.snowpark.functions module.

### Bug fixes

* Fixed an issue where Session.createDataFrame(pandas_df) and Session.write_pandas(pandas_df) raised an exception when the Pandas DataFrame had spaces in the column name.
* Fixed an issue where DataFrame.copy_into_table() sometimes erroneously printed an error level log entry.
* Fixed an API documentation issue where some DataFrame APIs were missing from the documentation.

### Dependency Updates

* Updated snowflake-connector-python to 2.7.2, which upgrades the pyarrow dependency to 6.0.x. Refer to the Python connector 2.7.2 release notes for more information.

## Version 0.2.0 (2021-12-02)

### New features

* Added the createDataFrame() method for creating a DataFrame from a Pandas DataFrame.
* Added the write_pandas() method for writing a Pandas DataFrame to a table in Snowflake and getting a Snowpark DataFrame object back.
* Added new classes and methods for calling window functions.
* Added the new functions cume_dist(), to find the cumulative distribution of a value with regard to other values within a window partition, and row_number(), which returns a unique row number for each row within a window partition.
* Added functions for computing statistics for DataFrames in the DataFrameStatFunctions class.
* Added functions for handling missing values in a DataFrame in the DataFrameNaFunctions class.
* Added new methods: rollup(), cube(), and pivot() to the DataFrame class.
* Added the GroupingSets class, which you can use with the DataFrame groupByGroupingSets method to perform a SQL GROUP BY GROUPING SETS.
* Added the new FileOperation(session) class that you can use to upload and download files to and from a stage.
* Added the copy_into_table() method for loading data from files in a stage into a table.
* In CASE expressions, the functions when and otherwise now accept Python types in addition to Column objects.
* When you register a UDF you can now optionally set the replace parameter to True to overwrite an existing UDF with the same name.

### Improvements

* UDFs are now compressed before they are uploaded to the server. This makes them about 10 times smaller, which can help when you are using large ML model files.
* When the size of a UDF is less than 8196 bytes, it will be uploaded as in-line code instead of uploaded to a stage.

### Bug fixes

* Fixed an issue where the statement df.select(when(col(“a”) == 1, 4).otherwise(col(“a”))), [Row(4), Row(2), Row(3)] raised an exception.
* Fixed an issue where df.toPandas() raised an exception when a DataFrame was created from large local data.

---
title: Snowpark Library for Python release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-python-2023.md
section: Release Notes
---

# Snowpark Library for Python release notes for 2023

This article contains the release notes for the [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md) updates.

See [Snowpark Developer Guide for Python](../../developer-guide/snowpark/python/index.md) for documentation.

## Version 1.11.1 (2023-12-07)

Version 1.11.1 of the Snowpark library introduces some new features.

### New features

* Added the `conn_error` attribute to `SnowflakeSQLException`, which stores the whole underlying exception from `snowflake-connector-python`.
* Added support for `RelationalGroupedDataframe.pivot()` to access `pivot` in the following pattern `Dataframe.group_by(...).pivot(...)`.
* Added the experimental feature, Local Testing Mode, which allows you to create and operate on Snowpark Python DataFrames locally without connecting to a Snowflake account.
  You can use the local testing framework to test your DataFrame operations locally, on your development machine or in a CI (continuous integration) pipeline, before deploying code changes to your account.
* Added support for `arrays_to_object` new functions in `snowflake.snowpark.functions`.
* Added support for the vector data type.

### Dependency updates

* Bumped the cloudpickle dependency to work with `cloudpickle==2.2.1`.
* Updated `snowflake-connector-python` to version `3.4.0`.

### Bug fixes

* DataFrame column names quoting check now supports newline characters.
* Fixed a bug where a DataFrame generated by `session.read.with_metadata` created an inconsistent table when doing `df.write.save_as_table`.

## Version 1.10.0 (2023-11-03)

Version 1.10.0 of the Snowpark library introduces some new features.

### New features

* Added support for managing case sensitivity in `DataFrame.to_local_iterator()`.
* Added support for specifying vectorized UDTF’s input column names by using the optional parameter `input_names` in
  `UDTFRegistration.register`, `UDTFRegistration.register_file`, and `functions.pandas_udtf`.
  By default, `RelationalGroupedDataFrame.applyInPandas` will infer the column names from current DataFrame schema.
* Added `sql_error_code` and `raw_message` attributes to `SnowflakeSQLException` when it is caused by a SQL exception.

### Bug fixes

* Fixed a bug in `DataFrame.to_pandas()` where converting Snowpark DataFrames to Pandas DataFrames was losing precision on integers with more than 19 digits.
* Fixed a bug in `session.add_packages` where it could not handle a requirement specifier that contained a project name with an underscore and a version.
* Fixed a bug in `DataFrame.limit()` when `offset` is used and the parent `DataFrame` uses `limit`. Now the `offset` won’t impact the parent DataFrame’s `limit`.
* Fixed a bug in `DataFrame.write.save_as_table` where DataFrames created from the read API could not save data into Snowflake because of an invalid column name `$1`.

### Behavior changes

* Changed the behavior of `date_format`:

  + The `format` argument changed from optional to required.
  + The returned result changed from a date object to a date-formatted string.
* When a window function or a sequence-dependent data generator (`normal`, `zipf`, `uniform`, `seq1`, `seq2`, `seq4`, `seq8`) function is used, the
  sort and filter operation will no longer be flattened when generating the query.

## Version 1.9.0 (2023-10-16)

Version 1.9.0 of the Snowpark library introduces some new features.

### New features

* Added support for the Python 3.11 runtime environment.
* Support `PythonObjJSONEncoder` JSON-serializable objects for `ARRAY` and `OBJECT` literals.

### Dependency updates

* Re-added the dependency of `typing-extensions`.

### Bug fixes

* Fixed a bug where imports from permanent stage locations were ignored for temporary stored procedures, UDTFs, UDFs, and UDAFs.
* Revert back to using CTAS (CREATE TABLE AS SELECT) statement for `DataFrameWriter.save_as_table` which does not need insert permission for writing tables.

## Version 1.8.0 (2023-09-14)

Version 1.8.0 of the Snowpark library introduces some new features.

### New features

* Added support for `VOLATILE` and `IMMUTABLE` keywords when registering UDFs.
* Added support for specifying clustering keys when saving dataframes using `DataFrame.save_as_table`.
* Accept `Iterable` objects input for `schema` when creating dataframes using `Session.create_dataframe`.
* Added the `DataFrame.session` property to return a `Session` object.
* Added the `Session.session_id` property to return an integer that represents the session ID.
* Added the `Session.connection` property to return a `SnowflakeConnection` object.
* Added support for creating a Snowpark session from a configuration file or environment variables.

### Dependency updates

* Updated `snowflake-connector-python` to 3.2.0.

### Bug fixes

* Fixed a bug where an automatic package upload would raise `ValueError` even when compatible package versions were added in `session.add_packages`.
* Fixed a bug where table stored procedures were not registered correctly when using `register_from_file`.
* Fixed a bug where dataframe joins failed with `invalid_identifier` error.
* Fixed a bug where `DataFrame.copy` disabled SQL simplifier for the returned copy.
* Fixed a bug where `session.sql().select()` would fail if any parameters were specified to `session.sql()`.

## Version 1.7.0 (2023-08-28)

Version 1.7.0 of the Snowpark library introduces some new features.

### Behavior changes

* When creating stored procedures, UDFs, UDTFs, and UDAFs with the parameter `is_permanent=False`, temporary objects are created
  even when `stage_name` is provided. The default value of `is_permanent` is `False`, which is why if this value is not
  explicitly set to `True` for permanent objects, users will notice a change in behavior.
* `types.StructField` now enquotes column identifier by default.

### New features

* Added parameters `external_access_integrations` and `secrets` that can be used when creating a UDF, UDTF or stored procedure from Snowpark Python to allow integration with external access.
* Added support for these new functions in `snowflake.snowpark.functions`: `array_flatten` and `flatten`.
* Added support for `apply_in_pandas` in `snowflake.snowpark.relational_grouped_dataframe`.
* Added support for replicating your local Python environment on Snowflake via `Session.replicate_local_environment`.

### Bug fixes

* Fixed a bug where `session.create_dataframe` fails to properly set nullable columns where nullability was affected by order or when data was given.
* Fixed a bug where `DataFrame.select` could not identify and alias columns when using table functions when output columns of the table function overlapped with columns in the DataFrame.

## Version 1.6.1 (2023-08-02)

### Behavior changes

* `DataFrameWriter.save_as_table` now respects nullable field of for schema provided by the user, or inferred schema based on data from user input.

### New features

* Added support for new functions in `snowflake.snowpark.functions`:

  + `array_sort`
  + `sort_array`
  + `array_min`
  + `array_max`
  + `explode_outer`
* Added support for pure Python packages specified via `Session.add_requirements` or `Session.add_packages`.
  They are now usable in stored procedures and UDFs even if packages are not present on the Snowflake Anaconda channel.
* Added the Session parameter `custom_packages_upload_enabled` and `custom_packages_force_upload_enabled` to enable
  the support for pure Python packages feature mentioned above. Both parameters default to `False`.
* Added support for specifying package requirements by passing a conda environment YAML file to `Session.add_requirements`.
* Added support for asynchronous execution of multi-query dataframes that contain binding variables.
* Added support for renaming multiple columns in `DataFrame.rename`.
* Added support for Geometry datatypes.
* Added support for params in `session.sql()` in stored procedures.
* Added support for user-defined aggregate functions (UDAFs). This feature is currently in private preview.
* Added support for vectorized user-defined table functions (vectorized UDTFs). This feature is currently in public preview.
* Added support for Snowflake Timestamp variants (i.e., `TIMESTAMP_NTZ`, `TIMESTAMP_LTZ`, `TIMESTAMP_TZ`):

  + Added TimestampTimezone as an argument in `TimestampType` constructor.
  + Added type hints: `NTZ`, `LTZ`, `TZ` and Timestamp to annotate functions when registering UDFs.

### Improvements

* Removed redundant dependency typing-extensions.
* `DataFrame.cache_result` now creates a temp table of fully-qualified names under the current database and schema.

### Bug fixes

* Fixed a bug where type check happens on pandas before it is imported.
* Fixed a bug when creating a UDF from `numpy.ufunc`.
* Fixed a bug where `DataFrame.union` was not generating the correct `Selectable.schema_query` when SQL simplifier is enabled.

### Dependency updates

* Updated `snowflake-connector-python` to version 3.0.4.

## Version 1.5.1 (2023-06-20)

### New features and updates

* Added support for the Python 3.10 runtime environment.

## Version 1.5.0 (2023-06-13)

### Behavior changes

* Aggregation results, from functions such as `DataFrame.agg` and `DataFrame.describe`, no longer strip away
  non-printing characters from column names.

### New features and updates

* Added support for the Python 3.9 runtime environment.
* Added support for new functions in `snowflake.snowpark.functions`:
* `array_generate_range`
* `array_unique_agg`
* `collect_set`
* `sequence`
* Added support for registering and calling stored procedures with the `TABLE` return type.
* Added support for parameter length in `StringType()` to specify the maximum number of characters that can be
  stored by the column.
* Added the alias `functions.element_at()` for `functions.get()`.
* Added the alias `Column.contains` for `functions.contains`.
* Added the experimental feature `DataFrame.alias`.
* Added support for querying metadata columns from stage when creating `DataFrame` using `DataFrameReader`.
* Added support for `StructType.add` to append more fields to existing `StructType` objects.
* Added support for parameter `execute_as` in `StoredProcedureRegistration.register_from_file()` to specify stored
  procedure caller rights.

### Bug fixes

* Fixed a bug where the `Dataframe.join_table_function` did not run all of the necessary queries to set up the
  join table function when SQL simplifier was enabled.
* Fixed type hint declaration for custom types: `ColumnOrName`, `ColumnOrLiteralStr`, `ColumnOrSqlExpr`,
  `LiteralType` and `ColumnOrLiteral` that were breaking `mypy` checks.
* Fixed a bug where `DataFrameWriter.save_as_table` and `DataFrame.copy_into_table` failed to parse fully qualified table names.

## Version 1.4.0 (2023-04-24)

### New features

* Added support for `session.getOrCreate`.
* Added support for alias `Column.getField`.
* Added support for new functions in `snowflake.snowpark.functions`:

  + `date_add` and `date_sub` to make add and subtract operations easier.
  + `ddaydiff`
  + `dexplode`
  + `darray_distinct`
  + `dregexp_extract`
  + `dstruct`
  + `dformat_number`
  + `dbround`
  + `dsubstring_index`
* Added parameter `skip_upload_on_content_match` when creating UDFs, UDTFs, and stored procedures
  using `register_from_file` to skip uploading files to a stage if the same version of the files are already on the stage.
* Added support for the `DataFrame.save_as_table` method to take table names that contain dots.
* Flattened generated SQL when `DataFrame.filter()` or `DataFrame.order_by()` is followed by a projection
  statement (e.g. `DataFrame.select()`, `DataFrame.with_column()`).
* Added support for creating dynamic tables (in private preview) using `Dataframe.create_or_replace_dynamic_table`.
* Added an optional argument, `params`, in `session.sql()` to support binding variables. Note that this
  argument is not supported in stored procedures yet.

### Bug fixes

* Fixed a bug in `strtok_to_array` where an exception was thrown when a delimiter was passed in.
* Fixed a bug in `session.add_import` where the module had the same namespace as other dependencies.

## Version 1.3.0 (2023-03-28)

### New features

* Added support for the delimiters parameter in `functions.initcap()`.
* Added support for `functions.hash()` to accept a variable number of input expressions.
* Added API `Session.conf` for getting, setting or checking the mutability of any runtime configuration.
* Added support for managing case sensitivity in `Row` results from `DataFrame.collect` using `case_sensitive` parameter.
* Added indexer support for `snowflake.snowpark.types.StructType`.
* Added a keyword argument `log_on_exception` to `Dataframe.collect` and `Dataframe.collect_no_wait` to optionally
  disable error logging for SQL exceptions.

### Bug fixes

* Fixed a bug where a DataFrame set operation (`DataFrame.subtract`, `DataFrame.union`, etc.) being called after
  another DataFrame set operation and `DataFrame.select` or `DataFrame.with_column` throws an exception.
* Fixed a bug where chained sort statements are overwritten by the SQL simplifier.

### Improvements

* Simplified JOIN queries to use constant subquery aliases (`SNOWPARK_LEFT`, `SNOWPARK_RIGHT`) by default. Users can
  disable this at runtime with `session.conf.set('use_constant_subquery_alias', False)` to use randomly generated
  alias names instead.
* Allowed specifying statement parameters in `session.call()`.
* Enabled the uploading of large pandas DataFrames in stored procedures by defaulting to a chunk size of 100,000 rows.

## Version 1.2.0 (2023-03-02)

### New features and updates

* Added support for displaying source code as comments in the generated scripts when registering stored procedures.
  This is enabled by default, turn off by specifying `source_code_display=False` at registration.
* Added a parameter `if_not_exists` when creating a UDF, UDTF or Stored Procedure from Snowpark Python to ignore
  creating the specified function or procedure if it already exists.
* Accept integers when calling `snowflake.snowpark.functions.get` to extract value from array.
* Added `functions.reverse` in functions to open access to Snowflake built-in function [REVERSE](../../sql-reference/functions/reverse.md).
* Added parameter `require_scoped_url` in `snowflake.snowflake.files.SnowflakeFile.open()` (in Private Preview) to
  replace `is_owner_file`, which is marked for deprecation.

### Bug fixes

* Fixed a bug that overwrote `paramstyle` to `qmark` when creating a Snowpark session.
* Fixed a bug where `df.join(..., how="cross")` fails with `SnowparkJoinException: (1112): Unsupported using join type 'Cross'`.
* Fixed a bug where querying a `DataFrame` column created from chained function calls used a wrong column name.

## Version 1.1.0 (2023-01-26)

### New features and updates

* Added `asc`, `asc_nulls_first`, `asc_nulls_last`, `desc`, `desc_nulls_first`, `desc_nulls_last`,
  `date_part`, and `unix_timestamp` in functions.
* Added the property `DataFrame.dtypes` to return a list of column name and data type pairs.
* Added the following aliases:

  + `functions.expr() for functions.sql_expr()`.
  + `functions.date_format() for functions.to_date()`.
  + `functions.monotonically_increasing_id() for functions.seq8()`.
  + `functions.from_unixtime() for functions.to_timestamp()`.

### Bug fixes

* Fixed a bug in SQL simplifier that didn’t handle Column alias and join well in some cases.
  See <https://github.com/snowflakedb/snowpark-python/issues/658> for details.
* Fixed a bug in SQL simplifier that generated wrong column names for function calls, NaN and INF.

### Improvements

* The session parameter `PYTHON_SNOWPARK_USE_SQL_SIMPLIFIER` will be `True` after Snowflake 7.3 is released.
  In snowpark-python, `session.sql_simplifier_enabled` reads the value of `PYTHON_SNOWPARK_USE_SQL_SIMPLIFIER` by
  default, meaning that the SQL simplifier is enabled by default after the Snowflake 7.3 release. To turn this off,
  set `PYTHON_SNOWPARK_USE_SQL_SIMPLIFIER` in Snowflake to False or run `session.sql_simplifier_enabled = False` from
  Snowpark. It is recommended to use the SQL simplifier because it helps to generate more concise SQL.

---
title: Snowpark Library for Python release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-python-2024.md
section: Release Notes
---

# Snowpark Library for Python release notes for 2024

This article contains the release notes for the [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md) updates.

See [Snowpark Developer Guide for Python](../../developer-guide/snowpark/python/index.md) for documentation.

> **Warning:**
>
> As Python 3.8 has reached its [End of Life](https://devguide.python.org/versions/), deprecation warnings will be triggered when using snowpark-python with Python 3.8. For more details, see [Snowflake Python Runtime Support](../../developer-guide/python-runtime-support-policy.md). Snowpark Python 1.24.0 will be the last client and server version to support Python 3.8, in accordance with [Anaconda’s policy](https://forum.anaconda.com/t/python-3-8-reaches-end-of-life/87265). Upgrade your existing Python 3.8 objects to Python 3.9 or greater.

## Version 1.26.0 (2024-12-05)

### New features

* Added support for property `version` and class method `get_active_session` for `Session` class.
* Added new methods and variables to enhance data type handling and JSON serialization/deserialization:

  + To `DataType`, its derived classes, and `StructField`:

    - `type_name`: Returns the type name of the data.
    - `simple_string`: Provides a simple string representation of the data.
    - `json_value`: Returns the data as a JSON-compatible value.
    - `json`: Converts the data to a JSON string.
  + To `ArrayType`, `MapType`, `StructField`, `PandasSeriesType`, `PandasDataFrameType` and `StructType`:

    - `from_json`: Enables these types to be created from JSON data.
  + To `MapType`:

    - `keyType`: keys of the map
    - `valueType`: values of the map
* Added support for method `appName` in `SessionBuilder`.
* Added support for `include_nulls` argument in `DataFrame.unpivot`.
* Added support for following functions in `functions.py`:

  + `size` to get size of array, object, or map columns.
  + `collect_list` an alias of `array_agg`.
  + `concat_ws_ignore_nulls` to concatenate strings with a separator, ignoring null values.
  + `substring` makes `len` argument optional.
* Added parameter `ast_enabled` to session for internal usage (default: `False`).

### Improvements

* Added support for specifying the following to `DataFrame.create_or_replace_dynamic_table`:

  + `iceberg_config` A dictionary that can hold the following iceberg configuration options:

    - `external_volume`
    - `catalog`
    - `base_location`
    - `catalog_sync`
    - `storage_serialization_policy`
* Added support for nested data types to `DataFrame.print_schema`
* Added support for `level` parameter to `DataFrame.print_schema`
* Improved flexibility of `DataFrameReader` and `DataFrameWriter` API by adding support for the following:

  + Added `format` method to `DataFrameReader` and `DataFrameWriter` to specify file format when loading or unloading results.
  + Added `load` method to `DataFrameReader` to work in conjunction with `format`.
  + Added `save` method to `DataFrameWriter` to work in conjunction with `format`.
  + Added support to read keyword arguments to `options` method for `DataFrameReader` and `DataFrameWriter`.
* Relaxed the cloudpickle dependency for Python 3.11 to simplify build requirements. However, for Python 3.11, `cloudpickle==2.2.1` remains the only supported version.

### Bug fixes

* Removed warnings that dynamic pivot features were in private preview, because
  dynamic pivot is now generally available.
* Fixed a bug in `session.read.options` where `False` Boolean values were incorrectly parsed as `True` in the generated file format.

### Dependency updates

* Added a runtime dependency on `python-dateutil`.

### Snowpark pandas API updates

#### New features

* Added partial support for `Series.map` when `arg` is a pandas `Series` or a
  `collections.abc.Mapping`. No support for instances of `dict` that implement
  `__missing__` but are not instances of `collections.defaultdict`.
* Added support for `DataFrame.align` and `Series.align` for `axis=1` and `axis=None`.
* Added support for `pd.json_normalize`.
* Added support for `GroupBy.pct_change` with `axis=0`, `freq=None`, and `limit=None`.
* Added support for `DataFrameGroupBy.__iter__` and `SeriesGroupBy.__iter__`.
* Added support for `np.sqrt`, `np.trunc`, `np.floor`, numpy trig functions, `np.exp`, `np.abs`, `np.positive` and `np.negative`.
* Added partial support for the dataframe interchange protocol method
  `DataFrame.__dataframe__()`.

#### Bug fixes

* Fixed a bug in `df.loc` where setting a single column from a series results in unexpected `None` values.

#### Improvements

* Use UNPIVOT INCLUDE NULLS for unpivot operations in pandas instead of sentinel values.
* Improved documentation for `pd.read_excel`.

## Version 1.25.0 (2024-11-13)

### New features

* Added the following new functions in `snowflake.snowpark.dataframe`:

  > + `map`

### Improvements

* When target stage is not set in profiler, a default stage from `Session.get_session_stage` is used instead of raising `SnowparkSQLException`.
* Allowed lower case or mixed case input when calling `Session.stored_procedure_profiler.set_active_profiler`.
* Added distributed tracing using open telemetry APIs for action function in `DataFrame`:

  > + `cache_result`
* Removed opentelemetry warning from logging.

### Bug fixes

* Fixed the pre-action and post-action query propagation when `In` expressions were used in selects.
* Fixed a bug that raised error `AttributeError` while calling `Session.stored_procedure_profiler.get_output` when `Session.stored_procedure_profiler` is disabled.

### Dependency updates

* Added a dependency on `protobuf>=5.28` and `tzlocal` at runtime.
* Added a dependency on `protoc-wheel-0` for the development profile.
* Require `snowflake-connector-python>=3.12.0, <4.0.0` (was `>=3.10.0`).

### Snowpark pandas API updates

#### New features

* Added support for `Index.to_numpy`.
* Added support for `DataFrame.align` and `Series.align` for `axis=0`.
* Added support for `snowflake.snowpark.functions.window`
* Added support for `pd.read_pickle` (Uses native pandas for processing).
* Added support for `pd.read_html` (Uses native pandas for processing).
* Added support for `pd.read_xml` (Uses native pandas for processing).
* Added support for aggregation functions `"size"` and `len` in `GroupBy.aggregate`, `DataFrame.aggregate`, and `Series.aggregate`.
* Added support for list values in `Series.str.len`.

#### Bug fixes

* Fixed a bug where aggregating a single-column dataframe with a single callable function (e.g. `pd.DataFrame([0]).agg(np.mean)`) would fail to transpose the result.
* Fixed bugs where `DataFrame.dropna()` would:

  + Treat an empty `subset` (e.g. `[]`) as if it specified all columns instead of no columns.
  + Raise a `TypeError` for a scalar `subset` instead of filtering on just that column.
  + Raise a `ValueError` for a `subset` of type `pandas.Index` instead of filtering on the columns in the index.
* Creation of scoped read-only tables to mitigate `TableNotFoundError` when using dynamic pivot in a notebook environment.
* Fixed a bug when concat dataframe or series objects are coming from the same dataframe when axis = 1.

#### Improvements

* Improve `np.where` with scalar x value by eliminating unnecessary join and temp table creation.
* Improve `get_dummies` performance by flattening the pivot with join.

### Snowpark local testing updates

#### New features

* Added support for patching functions that are unavailable in the `snowflake.snowpark.functions` module.
* Added support for `snowflake.snowpark.functions.any_value`

#### Bug fixes

* Fixed a bug where `Table.update` could not handle `VariantType`, `MapType`, and `ArrayType` data types.
* Fixed a bug where column aliases were incorrectly resolved in `DataFrame.join`, causing errors when selecting columns from a joined DataFrame.
* Fixed a bug where `Table.update` and `Table.merge` could fail if the target table’s index was not the default `RangeIndex`.

## Version 1.24.0 (2024-10-28)

### New features

* Updated `Session` class to be thread-safe. This allows concurrent DataFrame transformations, DataFrame actions, UDF and stored procedure registration, and concurrent file uploads when using the same `Session` object.

  + The feature is disabled by default and can be enabled by setting `FEATURE_THREAD_SAFE_PYTHON_SESSION` to `True` for account.
  + Updating session configurations, like changing database or schema, when multiple threads are using the session may lead to unexpected behavior.
  + When enabled, some internally created temporary table names returned from `DataFrame.queries` API are not deterministic, and may be different when DataFrame actions are executed. This does not affect explicit user-created temporary tables.
* Added support for ‘Service’ domain to `session.lineage.trace` API.
* Added support for the following methods in `DataFrameWriter` to support daisy-chaining:

  + `option`
  + `options`
  + `partition_by`
* Added support for `snowflake_cortex_summarize`.

### Improvements

* Improved the following new capability for function `snowflake.snowpark.functions.array_remove` so that it is now possible to use in python.
* Disables SQL simplification when sort is performed after limit.

  + Previously, `df.sort().limit()` and `df.limit().sort()` generated the same query with sort in front of limit. Now, `df.limit().sort()` generates a query that reads `df.limit().sort()`.
  + Improve performance of generated queries for `df.limit().sort()` because limit stops table scanning as soon as the number of records is satisfied.

### Bug fixes

* Fixed a bug where the automatic cleanup of temporary tables could interfere with the results of async query execution.
* Fixed a bug in `DataFrame.analytics.time_series_agg` function to handle multiple data points in same sliding interval.
* Fixed a bug that created inconsistent casing in field names of structured objects in iceberg schemas.

### Deprecations

* As Python 3.8 has reached its [End of Life](https://devguide.python.org/versions/), deprecation warnings will be triggered when using snowpark-python with Python 3.8. For more details, see [Snowflake Python Runtime Support](../../developer-guide/python-runtime-support-policy.md).
* Snowpark 1.24.0 is the last client and server version to support Python 3.8, in accordance with [Anaconda’s policy](https://www.anaconda.com/blog/python-3-8-reaches-end-of-life). Upgrade your existing Python 3.8 objects to Python 3.9 or greater.

### Snowpark pandas API updates

#### New features

* Added support for `np.subtract`, `np.multiply`, `np.divide`, and `np.true_divide`.
* Added support for tracking usages of `__array_ufunc__`.
* Added numpy compatibility support for `np.float_power`, `np.mod`, `np.remainder`, `np.greater`, `np.greater_equal`, `np.less`, `np.less_equal`, `np.not_equal`, and `np.equal`.
* Added numpy compatibility support for `np.log`, `np.log2`, and `np.log10`
* Added support for `DataFrameGroupBy.bfill`, `SeriesGroupBy.bfill`, `DataFrameGroupBy.ffill`, and `SeriesGroupBy.ffill`.
* Added support for `on` parameter with `Resampler`.
* Added support for timedelta inputs in `value_counts()`.
* Added support for applying Snowpark Python function `snowflake_cortex_summarize`.
* Added support for `DataFrame.attrs` and `Series.attrs`.
* Added support for `DataFrame.style`.
* Added numpy compatibility support for `np.full_like`

#### Improvements

* Improved generated SQL query for `head` and `iloc` when the row key is a slice.
* Improved error message when passing an unknown timezone to `tz_convert` and `tz_localize` in `Series`, `DataFrame`, `Series.dt`, and `DatetimeIndex`.
* Improved documentation for `tz_convert` and `tz_localize` in `Series`, `DataFrame`, `Series.dt`, and `DatetimeIndex` to specify the supported timezone formats.
* Added additional kwargs support for `df.apply` and `series.apply` ( as well as `map` and `applymap` ) when using snowpark functions. This allows for some position independent compatibility between apply and functions where the first argument is not a pandas object.
* Improved generated SQL query for `iloc` and `iat` when the row key is a scalar.
* Removed all joins in `iterrows`.
* Improved documentation for `Series.map` to reflect the unsupported features.
* Added support for `np.may_share_memory` which is used internally by many scikit-learn functions. This method will always return false when called with a Snowpark pandas object.

#### Bug Fixes

* Fixed a bug where `DataFrame` and `Series` `pct_change()` would raise `TypeError` when input contained timedelta columns.
* Fixed a bug where `replace()` would sometimes propagate `Timedelta` types incorrectly through `replace()`. Instead raise `NotImplementedError` for `replace()` on `Timedelta`.
* Fixed a bug where `DataFrame` and `Series` `round()` would raise `AssertionError` for `Timedelta` columns. Instead raise `NotImplementedError` for `round()` on `Timedelta`.
* Fixed a bug where `reindex` fails when the new index is a Series with non-overlapping types from the original index.
* Fixed a bug where calling `__getitem__` on a DataFrameGroupBy object always returned a DataFrameGroupBy object if `as_index=False`.
* Fixed a bug where inserting timedelta values into an existing column would silently convert the values to integers instead of raising `NotImplementedError`.
* Fixed a bug where `DataFrame.shift()` on axis=0 and axis=1 would fail to propagate timedelta types.
* `DataFrame.abs()`, `DataFrame.__neg__()`, `DataFrame.stack()`, and `DataFrame.unstack()` now raise `NotImplementedError` for timedelta inputs instead of failing to propagate timedelta types.

### Snowpark local testing updates

#### Bug fixes

* Fixed a bug where `DataFrame.alias` raises `KeyError` for input column name.
* Fixed a bug where `to_csv` on Snowflake stage fails when data contains empty strings.

## Version 1.23.0 (2024-10-09)

### New features

* Added the following new functions in `snowflake.snowpark.functions`:

  + `make_interval`
* Added support for using Snowflake Interval constants with `Window.range_between()` when the order by column is TIMESTAMP or DATE type.
* Added support for file writes. This feature is currently in private preview.
* Added `thread_id` to `QueryRecord` to track the thread id submitting the query history.
* Added support for `Session.stored_procedure_profiler`.

### Bug fixes

* Fixed a bug where registering a stored procedure or UDxF with type hints would give a warning `NoneType` has no `len()` when trying to read default values from function.

### Snowpark pandas API updates

#### New features

* Added support for `TimedeltaIndex.mean` method.
* Added support for some cases of aggregating `Timedelta` columns on `axis=0` with `agg` or `aggregate`.
* Added support for `by`, `left_by`, `right_by`, `left_index`, and `right_index` for `pd.merge_asof`.
* Added support for passing parameter `include_describe` to `Session.query_history`.
* Added support for `DatetimeIndex.mean` and `DatetimeIndex.std` methods.
* Added support for `Resampler.asfreq`, `Resampler.indices`, `Resampler.nunique`, and `Resampler.quantile`.
* Added support for `resample` frequency `W`, `ME`, `YE` with `closed = "left"`.
* Added support for `DataFrame.rolling.corr` and `Series.rolling.corr` for `pairwise = False` and int `window`.
* Added support for string time-based `window` and `min_periods = None` for `Rolling`.
* Added support for `DataFrameGroupBy.fillna` and `SeriesGroupBy.fillna`.
* Added support for constructing `Series` and `DataFrame` objects with the lazy `Index` object as `data`, `index`, and `columns` arguments.
* Added support for constructing `Series` and `DataFrame` objects with `index` and `column` values not present in `DataFrame`/`Series` `data`.
* Added support for `pd.read_sas` (Uses native pandas for processing).
* Added support for applying `rolling().count()` and `expanding().count()` to `Timedelta` series and columns.
* Added support for `tz` in both `pd.date_range` and `pd.bdate_range`.
* Added support for `Series.items`.
* Added support for `errors="ignore"` in `pd.to_datetime`.
* Added support for `DataFrame.tz_localize` and `Series.tz_localize`.
* Added support for `DataFrame.tz_convert` and `Series.tz_convert`.
* Added support for applying Snowpark Python functions (e.g., `sin`) in `Series.map`, `Series.apply`, `DataFrame.apply` and `DataFrame.applymap`.

#### Improvements

* Improved `to_pandas` to persist the original timezone offset for TIMESTAMP_TZ type.
* Improved `dtype` results for TIMESTAMP_TZ type to show correct timezone offset.
* Improved `dtype` results for TIMESTAMP_LTZ type to show correct timezone.
* Improved error message when passing non-bool value to `numeric_only` for groupby aggregations.
* Removed unnecessary warning about sort algorithm in `sort_values`.
* Use SCOPED object for internal create temp tables. The SCOPED objects will be stored sproc scoped if created within stored sproc, otherwise will be session scoped, and the object will be automatically cleaned at the end of the scope.
* Improved warning messages for operations that lead to materialization with inadvertent slowness.
* Removed unnecessary warning message about `convert_dtype` in `Series.apply`.

#### Bug fixes

* Fixed a bug where an `Index` object created from a `Series`/`DataFrame` incorrectly updates the `Series`/`DataFrame`’s index name after an inplace update has been applied to the original `Series`/`DataFrame`.
* Suppressed an unhelpful `SettingWithCopyWarning` that sometimes appeared when printing `Timedelta` columns.
* Fixed `inplace` argument for `Series` objects derived from other `Series` objects.
* Fixed a bug where `Series.sort_values` failed if series name overlapped with index column name.
* Fixed a bug where transposing a dataframe would map `Timedelta` index levels to integer column levels.
* Fixed a bug where `Resampler` methods on timedelta columns would produce integer results.
* Fixed a bug where `pd.to_numeric()` would leave `Timedelta` inputs as `Timedelta` instead of converting them to integers.
* Fixed `loc` set when setting a single row, or multiple rows, of a DataFrame with a Series value.

## Version 1.22.1 (2024-09-11)

* This is a re-release of 1.22.0. Please refer to the 1.22.0 release notes for detailed release content.

## Version 1.22.0 (2024-09-10)

### New features

* Added the following new functions in `snowflake.snowpark.functions`:

  + `array_remove`
  + `ln`

### Improvements

* Improved documentation for `Session.write_pandas` by making the `use_logical_type` option more explicit.
* Added support for specifying the following to `DataFrameWriter.save_as_table`:

  + `enable_schema_evolution`
  + `data_retention_time`
  + `max_data_extension_time`
  + `change_tracking`
  + `copy_grants`
  + `iceberg_config` - A dictionary that can hold the following iceberg configuration options:

    > - `external_volume`
    > - `catalog`
    > - `base_location`
    > - `catalog_sync`
    > - `storage_serialization_policy`
* Added support for specifying the following to `DataFrameWriter.copy_into_table`:

  + `iceberg_config` - A dictionary that can hold the following iceberg configuration options:

    > - `external_volume`
    > - `catalog`
    > - `base_location`
    > - `catalog_sync`
    > - `storage_serialization_policy`
* Added support for specifying the following parameters to `DataFrame.create_or_replace_dynamic_table`:

  + `mode`
  + `refresh_mode`
  + `initialize`
  + `clustering_keys`
  + `is_transient`
  + `data_retention_time`
  + `max_data_extension_time`

### Bug fixes

* Fixed a bug in `session.read.csv` that caused an error when setting `PARSE_HEADER = True` in an externally defined file format.
* Fixed a bug in query generation from set operations that allowed generation of duplicate queries when children have common subqueries.
* Fixed a bug in `session.get_session_stage` that referenced a non-existing stage after switching database or schema.
* Fixed a bug where calling `DataFrame.to_snowpark_pandas` without explicitly initializing the Snowpark pandas plugin caused an error.
* Fixed a bug where using the `explode` function in dynamic table creation caused a SQL compilation error due to improper boolean type
  casting on the `outer` parameter.

### Snowpark local testing updates

#### New features

* Added support for type coercion when passing columns as input to UDF calls.
* Added support for `Index.identical`.

#### Bug fixes

* Fixed a bug where the truncate mode in `DataFrameWriter.save_as_table` incorrectly handled DataFrames containing only a subset of
  columns from the existing table.
* Fixed a bug where function `to_timestamp` does not set the default timezone of the column datatype.

### Snowpark pandas API updates

#### New features

* Added limited support for the `Timedelta` type, including the following features. Snowpark pandas will raise `NotImplementedError`
  for unsupported `Timedelta` use cases.

  + support for tracking the `Timedelta` type through `copy`, `cache_result`, `shift`, `sort_index`, `assign`,
    `bfill`, `ffill`, `fillna`, `compare`, `diff`, `drop`, `dropna`, `duplicated`,
    `empty`, `equals`, `insert`, `isin`, `isna`, `items`, `iterrows`, `join`, `len`,
    `mask`, `melt`, `merge`, `nlargest`, `nsmallest`, `to_pandas`.
  + support for converting non-timedelta to timedelta via `astype`.
  + `NotImplementedError` will be raised for the rest of methods that do not support `Timedelta`.
  + support for subtracting two timestamps to get a `Timedelta`.
  + support indexing with `Timedelta` data columns.
  + support for adding or subtracting timestamps and `Timedelta`.
  + support for binary arithmetic between two `Timedelta` values.
  + support for binary arithmetic and comparisons between `Timedelta` values and numeric values.
  + support for lazy `TimedeltaIndex`.
  + support for `pd.to_timedelta`.
  + support for `GroupBy` aggregations `min`, `max`, `mean`, `idxmax`, `idxmin`, `std`, `sum`,
    `median`, `count`, `any`, `all`, `size`, `nunique`, `head`, `tail`, `aggregate`.
  + support for `GroupBy` filtrations `first` and `last`.
  + support for `TimedeltaIndex` attributes: `days`, `seconds`, `microseconds` and `nanoseconds`.
  + support for `diff` with timestamp columns on `axis=0` and `axis=1`.
  + support for `TimedeltaIndex` methods: `ceil`, `floor` and `round`.
  + support for `TimedeltaIndex.total_seconds` method.
* Added support for index’s arithmetic and comparison operators.
* Added support for `Series.dt.round`.
* Added documentation pages for `DatetimeIndex`.
* Added support for `Index.name`, `Index.names`, `Index.rename`, and `Index.set_names`.
* Added support for `Index.__repr__`.
* Added support for `DatetimeIndex.month_name` and `DatetimeIndex.day_name`.
* Added support for `Series.dt.weekday`, `Series.dt.time`, and `DatetimeIndex.time`.
* Added support for `Index.min` and `Index.max`.
* Added support for `pd.merge_asof`.
* Added support for `Series.dt.normalize` and `DatetimeIndex.normalize`.
* Added support for `Index.is_boolean`, `Index.is_integer`, `Index.is_floating`, `Index.is_numeric`, and `Index.is_object`.
* Added support for `DatetimeIndex.round`, `DatetimeIndex.floor` and `DatetimeIndex.ceil`.
* Added support for `Series.dt.days_in_month` and `Series.dt.daysinmonth`.
* Added support for `DataFrameGroupBy.value_counts` and `SeriesGroupBy.value_counts`.
* Added support for `Series.is_monotonic_increasing` and `Series.is_monotonic_decreasing`.
* Added support for `Index.is_monotonic_increasing` and `Index.is_monotonic_decreasing`.
* Added support for `pd.crosstab`.
* Added support for `pd.bdate_range` and included business frequency support (B, BME, BMS, BQE, BQS, BYE, BYS) for both `pd.date_range` and `pd.bdate_range`.
* Added support for lazy `Index` objects as `labels` in `DataFrame.reindex` and `Series.reindex`.
* Added support for `Series.dt.days`, `Series.dt.seconds`, `Series.dt.microseconds`, and `Series.dt.nanoseconds`.
* Added support for creating a `DatetimeIndex` from an `Index` of numeric or string type.
* Added support for string indexing with `Timedelta` objects.
* Added support for `Series.dt.total_seconds` method.

#### Improvements

* Improved concat and join performance when operations are performed on a series coming from the same DataFrame by avoiding unnecessary joins.
* Refactored `quoted_identifier_to_snowflake_type` to avoid making metadata queries if the types have been cached locally.
* Improved `pd.to_datetime` to handle all local input cases.
* Create a lazy index from another lazy index without pulling data to client.
* Raised `NotImplementedError` for Index bitwise operators.
* Display a more clear error message when `Index.names` is set to a non-list-like object.
* Raise a warning whenever `MultiIndex` values are pulled in locally.
* Improved warning message for `pd.read_snowflake` to include the creation reason when temp table creation is triggered.
* Improved performance for `DataFrame.set_index`, or setting `DataFrame.index` or `Series.index` by avoiding checks that
  require eager evaluation. As a consequence, when the new index that does not match the current `Series` or `DataFrame` object
  length, a `ValueError` is no longer raised. Instead, when the `Series` or `DataFrame` object is longer than the provided
  index, the new index of the `Series` or `DataFrame` is filled with `NaN` values for the “extra” elements. Otherwise,
  the extra values in the provided index are ignored.

#### Bug fixes

* Stopped ignoring nanoseconds in `pd.Timedelta` scalars.
* Fixed `AssertionError` in tree of binary operations.
* Fixed bug in `Series.dt.isocalendar` using a named Series
* Fixed `inplace` argument for Series objects derived from DataFrame columns.
* Fixed a bug where `Series.reindex` and `DataFrame.reindex` did not update the result index’s name correctly.
* Fixed a bug where `Series.take` did not give an error when `axis=1` was specified.

## Version 1.21.1 (2024-09-05)

### Bug fixes

* Fixed a bug where using `to_pandas_batches` with async jobs caused an error due to improper
  handling of waiting for asynchronous query completion.

## Version 1.21.0 (2024-08-19)

### New features

* Added support for `snowflake.snowpark.testing.assert_dataframe_equal`, which is a utility function to check the equality of two Snowpark DataFrames.

### Improvements

* Added support for server-side string size limitations.
* Added support for creating and invoking stored procedures, UDFs and UDTFs with optional arguments.
* Added support for column lineage in the `DataFrame.lineage.trace` API.
* Added support for passing `INFER_SCHEMA` options to `DataFrameReader` via `INFER_SCHEMA_OPTIONS`.
* Added support for passing `parameters` parameter to `Column.rlike` and `Column.regexp`.
* Added support for automatically cleaning up temporary tables created by `df.cache_result()` in the current session
  when the DataFrame is no longer referenced (i.e., gets garbage collected). It is still an experimental feature and not enabled by default.
  It can be enabled by setting `session.auto_clean_up_temp_table_enabled` to `True`.
* Added support for string literals to the `fmt` parameter of `snowflake.snowpark.functions.to_date`.

### Bug fixes

* Fixed a bug where the SQL generated for selecting `*` column has an incorrect subquery.
* Fixed a bug in `DataFrame.to_pandas_batches` where the iterator could throw an error if a certain transformation
  is made to the pandas DataFrame due to the wrong isolation level.
* Fixed a bug in `DataFrame.lineage.trace` to split the quoted feature view’s name and version correctly.
* Fixed a bug in `Column.isin` that caused invalid SQL generation when passed an empty list.
* Fixed a bug that fails to raise `NotImplementedError` while setting a cell with a list-like item.

### Snowpark local testing updates

#### New features

* Added support for the following APIs:

  + `snowflake.snowpark.functions`

    - `rank`
    - `dense_rank`
    - `percent_rank`
    - `cume_dist`
    - `ntile`
    - `datediff`
    - `array_agg`
  + `snowflake.snowpark.column.Column.within_group`
* Added support for parsing flags in Regex statements for mocked plans. This maintains parity with the `rlike` and `regexp` changes above.

#### Bug fixes

* Fixed a bug where the window functions `LEAD` and `LAG` do not handle the option `ignore_nulls` properly.
* Fixed a bug where values were not populated into the result DataFrame during the insertion of a table merge operation.

#### Improvements

* Fix pandas `FutureWarning` about integer indexing.

### Snowpark pandas API updates

#### New features

* Added support for `DataFrame.backfill`, `DataFrame.bfill`, `Series.backfill`, and `Series.bfill`.
* Added support for `DataFrame.compare` and `Series.compare` with default parameters.
* Added support for `Series.dt.microsecond` and `Series.dt.nanosecond`.
* Added support for `Index.is_unique` and `Index.has_duplicates`.
* Added support for `Index.equals`.
* Added support for `Index.value_counts`.
* Added support for `Series.dt.day_name` and `Series.dt.month_name`.
* Added support for indexing on Index, e.g., `df.index[:10]`.
* Added support for `DataFrame.unstack` and `Series.unstack`.
* Added support for `DataFrame.asfreq` and `Series.asfreq`.
* Added support for `Series.dt.is_month_start` and `Series.dt.is_month_end`.
* Added support for `Index.all` and `Index.any`.
* Added support for `Series.dt.is_year_start` and `Series.dt.is_year_end`.
* Added support for `Series.dt.is_quarter_start` and `Series.dt.is_quarter_end`.
* Added support for lazy `DatetimeIndex`.
* Added support for `Series.argmax` and `Series.argmin`.
* Added support for `Series.dt.is_leap_year`.
* Added support for `DataFrame.items`.
* Added support for `Series.dt.floor` and `Series.dt.ceil`.
* Added support for `Index.reindex`.
* Added support for `DatetimeIndex` properties: `year`, `month`, `day`, `hour`, `minute`, `second`, `microsecond`,
  `nanosecond`, `date`, `dayofyear`, `day_of_year`, `dayofweek`, `day_of_week`, `weekday`, `quarter`,
  `is_month_start`, `is_month_end`, `is_quarter_start`, `is_quarter_end`, `is_year_start`, `is_year_end`
  and `is_leap_year`.
* Added support for `Resampler.fillna` and `Resampler.bfill`.
* Added limited support for the `Timedelta` type, including creating `Timedelta` columns and `to_pandas`.
* Added support for `Index.argmax` and `Index.argmin`.

#### Improvements

* Removed the public preview warning message when importing Snowpark pandas.
* Removed unnecessary count query from the `SnowflakeQueryCompiler.is_series_like` method.
* `Dataframe.columns` now returns a native pandas Index object instead of a Snowpark Index object.
* Refactored and introduced `query_compiler` argument in `Index` constructor to create `Index` from query compiler.
* `pd.to_datetime` now returns a `DatetimeIndex` object instead of a `Series` object.
* `pd.date_range` now returns a `DatetimeIndex` object instead of a `Series` object.

#### Bug fixes

* Made passing an unsupported aggregation function to `pivot_table` raise `NotImplementedError` instead of `KeyError`.
* Removed axis labels and callable names from error messages and telemetry about unsupported aggregations.
* Fixed `AssertionError` in `Series.drop_duplicates` and `DataFrame.drop_duplicates` when called after `sort_values`.
* Fixed a bug in `Index.to_frame` where the result frame’s column name may be wrong where name is unspecified.
* Fixed a bug where some Index docstrings are ignored.
* Fixed a bug in `Series.reset_index(drop=True)` where the result name may be wrong.
* Fixed a bug in `Groupby.first/last` ordering by the correct columns in the underlying window expression.

## Version 1.20.0 (2024-07-17)

Version 1.20.0 of the Snowpark Library for Python introduces some new features.

### New features

* Added distributed tracing using open telemetry APIs for table stored procedure functions in `DataFrame`:

  + `_execute_and_get_query_id`
* Added support for the `arrays_zip` function.
* Improved performance for binary column expressions and `df._in` by avoiding unnecessary casts for numeric values. You can enable this optimization by setting `session.eliminate_numeric_sql_value_cast_enabled = True`.
* Improved error messages for `write_pandas` when the target table does not exist and `auto_create_table=False`.
* Added open telemetry tracing on UDxF functions in Snowpark.
* Added open telemetry tracing on stored procedure registration in Snowpark.
* Added a new optional parameter called `format_json` to the `Session.SessionBuilder.app_name` function that sets the app name in the `Session.query_tag` in JSON format. By default, this parameter is set to `False`.

### Bug fixes

* Fixed a bug where the SQL generated for `lag(x, 0)` was incorrect and failed with the error message `argument 1 to function LAG needs to be constant, found 'SYSTEM$NULL_TO_FIXED(null)'`.

### Snowpark local testing updates

#### New features

* Added support for the following APIs:

  + `snowflake.snowpark.functions`

    - `random`
* Added new parameters to the `patch` function when registering a mocked function:

  + `distinct` allows an alternate function to be specified for when a SQL function should be distinct.
  + `pass_column_index` passes a named parameter, `column_index`, to the mocked function that contains the `pandas.Index` for the input data.
  + `pass_row_index` passes a named parameter, `row_index`, to the mocked function that is the 0-indexed row number on which the function is currently operating.
  + `pass_input_data` passes a named parameter, `input_data`, to the mocked function that contains the entire input dataframe for the current expression.
  + Added support for the `column_order` parameter in the `DataFrameWriter.save_as_table` method.

#### Bug fixes

* Fixed a bug that caused `DecimalType` columns to be incorrectly truncated to integer precision when used in `BinaryExpressions`.

### Snowpark pandas API Updates

#### New features

* Added new API support for the following:

  + DataFrames

    - `DataFrame.nlargest` and `DataFrame.nsmallest`
    - `DataFrame.assign`
    - `DataFrame.stack`
    - `DataFrame.pivot`
    - `DataFrame.to_csv`
    - `DataFrame.corr`
    - `DataFrame.corr`
    - `DataFrame.equals`
    - `DataFrame.reindex`
    - `DataFrame.at` and `DataFrame.iat`
  + Series

    - `Series.nlargest` and `Series.nsmallest`
    - `Series.at` and `Series.iat`
    - `Series.dt.isocalendar`
    - `Series.equals`
    - `Series.reindex`
    - `Series.to_csv`
    - `Series.case_when` except when condition or replacement is callable
    - `series.plot()` with data materialized the data to the local client
  + GroupBy

    - `DataFrameGroupBy.all` and `DataFrameGroupBy.any`
    - `DataFrameGroupBy` and `SeriesGroupBy` aggregations `first` and `last`
    - `DataFrameGroupBy.get_group`
    - `SeriesGroupBy.all` and `SeriesGroupBy.any`
  + General

    - `pd.pivot`
    - `read_excel` (Uses local pandas for processing)
    - `df.plot()` with data materialized the data to the local client
* Extended existing APIs as follows:

  + Added support for `replace` and `frac > 1` in `DataFrame.sample` and `Series.sample`.
  + Added partial support for `Series.str.translate` where the values in the `table` are single-codepoint strings.
  + Added support for `limit` parameter when `method` parameter is used in `fillna`.
* Added documentation pages for `Index` and its APIs.

#### Bug fixes

* Fixed an issue when using np.where and df.where when the scalar `other` is the literal 0.
* Fixed a bug regarding precision loss when converting to Snowpark pandas `DataFrame` or `Series` with `dtype=np.uint64`.
* Fixed a bug where `values` is set to `index` when `index` and `columns` contain all columns in DataFrame during `pivot_table`.

#### Improvements

* Added support for `Index.copy()`.
* Added support for Index APIs: `dtype`, `values`, `item()`, `tolist()`, `to_series()` and `to_frame()`.
* Expand support for DataFrames with no rows in `pd.pivot_table` and `DataFrame.pivot_table`.
* Added support for `inplace` parameter in `DataFrame.sort_index` and `Series.sort_index`.

## Version 1.19.0 (2024-06-25)

Version 1.19.0 of the Snowpark Library for Python introduces some new features.

### New features

* Added support for the `to_boolean` function.
* Added documentation pages for `Index` and its APIs.

### Bug fixes

* Fixed a bug where Python stored procedures with tables return type fails when run in a task.
* Fixed a bug where `df.dropna` fails due to `RecursionError: maximum recursion depth exceeded` when the DataFrame has more than 500 columns.
* Fixed a bug where `AsyncJob.result("no_result")` doesn’t wait for the query to finish execution.

### Local testing updates

#### New features

* Added support for the `strict` parameter when registering UDFs and Stored Procedures.

#### Bug fixes

* Fixed a bug in `convert_timezone` that made setting the `source_timezone` parameter return an error.
* Fixed a bug where creating a DataFrame with empty data of type `DateType` raises `AttributeError`.
* Fixed a bug where table merge fails when an update clause exists but no update takes place.
* Fixed a bug in the mock implementation of `to_char` that raises `IndexError` when an incoming column has a nonconsecutive row index.
* Fixed a bug in handling `CaseExpr` expressions that raises `IndexError` when an incoming column has a nonconsecutive row index.
* Fixed a bug in the implementation of `Column.like` that raises `IndexError` when an incoming column has a nonconsecutive row index.

#### Improvements

* Added support for type coercion in the implementation of `DataFrame.replace`, `DataFrame.dropna`, and the mock function `iff`.

### Snowpark pandas API updates

#### New features

* Added partial support for `DataFrame.pct_change` and `Series.pct_change` without the `freq` and `limit` parameters.
* Added support for `Series.str.get`.
* Added support for `Series.dt.dayofweek`, `Series.dt.day_of_week`, `Series.dt.dayofyear`, and `Series.dt.day_of_year`.
* Added support for `Series.str.__getitem__ (Series.str[...])`.
* Added support for `Series.str.lstrip` and `Series.str.rstrip`.
* Added support for `DataFrameGroupby.size` and `SeriesGroupby.size`.
* Added support for `DataFrame.expanding` and `Series.expanding` for aggregations `count`, `sum`, `min`, `max`, `mean`, `std`, and `var` with `axis=0`.
* Added support for `DataFrame.rolling` and `Series.rolling` for aggregation count with `axis=0`.
* Added support for `Series.str.match`.
* Added support for `DataFrame.resample` and `Series.resample` for aggregation size.

#### Bug fixes

* Fixed a bug that causes output of `GroupBy.aggregate` columns to be ordered incorrectly.
* Fixed a bug where calling `DataFrame.describe` on a frame with duplicate columns of differing `dtypes` could cause an error or incorrect results.
* Fixed a bug in `DataFrame.rolling` and `Series.rolling` so `window=0` now throws `NotImplementedError` instead of `ValueError`

#### Improvements

* Added support for named aggregations in `DataFrame.aggregate` and `Series.aggregate` with `axis=0`.
* `pd.read_csv` reads using the native pandas CSV parser, then uploads data to Snowflake using parquet. This enables most of the parameters supported by `read_csv`, including date parsing and numeric conversions. Uploading via parquet is roughly twice as fast as uploading via CSV.
* Initial work to support a `pd.Index` directly in Snowpark pandas. Support for `pd.Index` as a first-class component of Snowpark pandas is under active development.
* Added a lazy index constructor and support for `len`, `shape`, `size`, `empty`, `to_pandas()`, and `names`. For `df.index`, Snowpark pandas creates a lazy index object.
* For `df.columns`, Snowpark pandas supports a non-lazy version of an `Index` as the data is already stored locally.

## Version 1.18.0 (2024-05-28)

Version 1.18.0 of the Snowpark library introduces some new features.

### New features

* Added the `DataFrame.cache_result` and `Series.cache_result` methods for users to persist `DataFrame` and `Series`
  objects to a temporary table for the duration of a session to improve latency of subsequent operations.

### Improvements

* Added support for `DataFrame.pivot_table` with no `index` parameter and with the `margins` parameter.
* Updated the signature of `DataFrame.shift`, `Series.shift`, `DataFrameGroupBy.shift`, and `SeriesGroupBy.shift` to
  match pandas 2.2.1. Snowpark pandas does not yet support the newly-added suffix argument or sequence values of periods.
* Re-added support for `Series.str.split`.

### Bug fixes

* Fixed an issue with mixed columns for string methods (`Series.str.*`).

### Local testing updates

#### New features

* Added support for the following `DataFrameReader` read options to file formats CSV and JSON:

  + PURGE
  + PATTERN
  + INFER_SCHEMA with value `False`
  + ENCODING with value `UTF8`
* Added support for `DataFrame.analytics.moving_agg` and `DataFrame.analytics.cumulative_agg_agg`.
* Added support for the `if_not_exists` parameter during UDF and stored procedure registration.

#### Bug fixes

* Fixed a bug with processing time formats where the fractional second part was not handled properly.
* Fixed a bug that caused function calls on `*` to fail.
* Fixed a bug that prevented the creation of `map` and `struct` type objects.
* Fixed a bug where the function `date_add` was unable to handle some numeric types.
* Fixed a bug where `TimestampType` casting resulted in incorrect data.
* Fixed a bug that caused `DecimalType` data to have incorrect precision in some cases.
* Fixed a bug where referencing a missing table or view raised an `IndexError`.
* Fixed a bug where the mocked function `to_timestamp_ntz` could not handle `None` data.
* Fixed a bug where mocked UDFs handled output data of `None` improperly.
* Fixed a bug where `DataFrame.with_column_renamed` ignored attributes from parent `DataFrames` after join operations.
* Fixed a bug where the integer precision of large values was lost when converted to a pandas `DataFrame`.
* Fixed a bug where the schema of a `datetime` object was wrong when creating a `DataFrame` from a pandas `DataFrame`.
* Fixed a bug in the implementation of `Column.equal_nan` where null data was handled incorrectly.
* Fixed a bug where `DataFrame.drop` ignored attributes from parent `DataFrames` after join operations.
* Fixed a bug in mocked function `date_part` where column type was set incorrectly.
* Fixed a bug where `DataFrameWriter.save_as_table` did not raise exceptions when inserting null data into non-nullable columns.
* Fixed a bug in the implementation of `DataFrameWriter.save_as_table` where:

  + Append or truncate failed when incoming data had a different schema than the existing table.
  + Truncate failed when incoming data did not specify columns that are nullable.

#### Improvements

* Removed the dependency check for `pyarrow` because it is not used.
* Improved the target type coverage of `Column.cast`, adding support for casting to boolean and all integral types.
* Aligned the error experience when calling UDFs and stored procedures.
* Added appropriate error messages for the `is_permanent` and `anonymous` options in UDFs and stored procedures registration to
  make it clearer that those features are not yet supported.
* File read operations with unsupported options and values now raise `NotImplementedError` instead of warnings and unclear error
  information.

## Version 1.17.0 (2024-05-21)

Version 1.17.0 of the Snowpark library introduces some new features.

### New features

* Added support to add a comment on tables and views using the functions listed below:

  + `DataFrameWriter.save_as_table`
  + `DataFrame.create_or_replace_view`
  + `DataFrame.create_or_replace_temp_view`
  + `DataFrame.create_or_replace_dynamic_table`

### Improvements

* Improved error message to remind users to set `{"infer_schema": True}` when reading CSV file without specifying its schema.

### Local testing updates

#### New features

* Added support for `NumericType` and `VariantType` data conversion in the mocked function `to_timestamp_ltz`, `to_timestamp_ntz`, `to_timestamp_tz` and `to_timestamp`.
* Added support for `DecimalType`, `BinaryType`, `ArrayType`, `MapType`, `TimestampType`, `DateType` and `TimeType` data conversion in the mocked function `to_char`.
* Added support for the following APIs:

  + `snowflake.snowpark.functions.to_varchar`
  + `snowflake.snowpark.DataFrame.pivot`
  + `snowflake.snowpark.Session.cancel_all`
* Introduced a new exception class `snowflake.snowpark.mock.exceptions.SnowparkLocalTestingException`.
* Added support for casting to `FloatType`.

#### Bug fixes

* Fixed a bug that stored procedures and UDFs should not remove imports already in the `sys.path` during the clean-up step.
* Fixed a bug that when processing `datetime` format, the fractional second part is not handled properly.
* Fixed a bug where file operations on the Windows platform were unable to properly handle file separators in directory names.
* Fixed a bug that on the Windows platform that, when reading a pandas dataframe, an `IntervalType` column with integer data can not be processed.
* Fixed a bug that prevented users from being able to select multiple columns with the same alias.
* Fixed a bug where `Session.get_current_[schema|database|role|user|account|warehouse]` returns uppercased identifiers when identifiers are quoted.
* Fixed a bug that function `substr` and `substring` can not handle a zero-based `start_expr`.

#### Improvements

* Standardized the error experience by raising `SnowparkLocalTestingException` in error cases, which is on par with the `SnowparkSQLException` raised in non-local execution.
* Improved the error experience of the `Session.write_pandas` method so that `NotImplementError` will be raised when called.
* Aligned the error experience with reusing a closed session in non-local execution.

## Version 1.16.0 (2024-05-08)

Version 1.16.0 of the Snowpark library introduces some new features.

### New features

* Added `snowflake.snowpark.Session.lineage.trace` to explore data lineage of Snowflake objects.
* Added support for registering stored procedures with packages given as Python modules.
* Added support for structured type schema parsing.

### Bug fixes

* Fixed a bug where, when inferring a schema, single quotes were added to stage files that already had single quotes.

### Local testing updates

#### New features

* Added support for `StringType`, `TimestampType` and `VariantType` data conversion in the mocked function `to_date`.
* Added support for the following APIs:

  + `snowflake.snowpark.functions`:

    - `get`
    - `concat`
    - `concat_ws`

#### Bug fixes

* Fixed a bug that caused `NaT` and `NaN` values to not be recognized.
* Fixed a bug where, when inferring a schema, single quotes were added to stage files that already had single quotes.
* Fixed a bug where `DataFrameReader.csv` was unable to handle quoted values containing a delimiter.
* Fixed a bug that when there is a `None` value in an arithmetic calculation, the output should remain `None` instead of `math.nan`.
* Fixed a bug in function `sum` and `covar_pop` that when there is a `math.nan` value in the data, the output should also be `math.nan`.
* Fixed a bug where stage operations can not handle directories.
* Fixed a bug that `DataFrame.to_pandas` should take Snowflake numeric types with precision 38 as `int64`.

## Version 1.15.0 (2024-04-24)

Version 1.15.0 of the Snowpark library introduces some new features.

### New features

* Added `truncate` save mode in `DataFrameWrite` to overwrite existing tables by truncating the underlying table instead of dropping it.
* Added telemetry to calculate query plan height and number of duplicate nodes during collect operations.
* Added the functions below to unload data from a `DataFrame` into one or more files in a stage:

  + `DataFrame.write.json`
  + `DataFrame.write.csv`
  + `DataFrame.write.parquet`
* Added distributed tracing using open telemetry APIs for action functions in `DataFrame` and `DataFrameWriter`:

  + `snowflake.snowpark.DataFrame`:

    - `collect`
    - `collect_nowait`
    - `to_pandas`
    - `count`
    - `show`
  + `snowflake.snowpark.DataFrameWriter`:

    - `save_as_table`
* Added support for `snow://` URLs to `snowflake.snowpark.Session.file.get` and `snowflake.snowpark.Session.file.get_stream`
* Added support to register stored procedures and UDFs with a `comment`.
* UDAF client support is ready for public preview. Please stay tuned for the Snowflake announcement of UDAF public preview.
* Added support for dynamic pivot. This feature is currently in private preview.

### Improvements

* Improved the generated query performance for both compilation and execution by converting duplicate subqueries to Common Table Expressions (CTEs).
  It is still an experimental feature and it is not enabled by default. You can enable it by setting `session.cte_optimization_enabled` to `True`.

### Bug fixes

* Fixed a bug where `statement_params` is not passed to query executions that register stored procedures and user defined functions.
* Fixed a bug causing `snowflake.snowpark.Session.file.get_stream` to fail for quoted stage locations.
* Fixed a bug that an internal type hint in `utils.py` might raise `AttributeError` when the underlying module can not be found.

### Local testing updates

#### New features

* Added support for registering UDFs and stored procedures.
* Added support for the following APIs:

  + `snowflake.snowpark.Session`:

    - `file.put`
    - `file.put_stream`
    - `file.get`
    - `file.get_stream`
    - `read.json`
    - `add_import`
    - `remove_import`
    - `get_imports`
    - `clear_imports`
    - `add_packages`
    - `add_requirements`
    - `clear_packages`
    - `remove_package`
    - `udf.register`
    - `udf.register_from_file`
    - `sproc.register`
    - `sproc.register_from_file`
  + `snowflake.snowpark.functions`

    - `current_database`
    - `current_session`
    - `date_trunc`
    - `object_construct`
    - `object_construct_keep_null`
    - `pow`
    - `sqrt`
    - `udf`
    - `sproc`
* Added support for `StringType`, `TimestampType` and `VariantType` data conversion in the mocked function `to_time`.

#### Bug fixes

* Fixed a bug that null filled columns for constant functions.
* Fixed `to_object`, `to_array` and `to_binary` to better handle null inputs.
* Fixed a bug that timestamp data comparison can not handle years beyond 2262.
* Fixed a bug that `Session.builder.getOrCreate` should return the created mock session.

## Version 1.14.0 (2024-03-20)

Version 1.14.0 of the Snowpark library introduces some new features.

### New features

* Added support for creating vectorized UDTFs with the `process` method.
* Added support for dataframe functions:

  + `to_timestamp_ltz`
  + `to_timestamp_ntz`
  + `to_timestamp_tz`
  + `locate`
* Added support for ASOF JOIN type.
* Added support for the following local testing APIs:

  + snowflake.snowpark.functions:

    - `to_double`
    - `to_timestamp`
    - `to_timestamp_ltz`
    - `to_timestamp_ntz`
    - `to_timestamp_tz`
    - `greatest`
    - `least`
    - `convert_timezone`
    - `dateadd`
    - `date_part`
  + snowflake.snowpark.Session:

    - `get_current_account`
    - `get_current_warehouse`
    - `get_current_role`
    - `use_schema`
    - `use_warehouse`
    - `use_database`
    - `use_role`

### Improvements

* Added telemetry to local testing.
* Improved the error message of `DataFrameReader` to raise `FileNotFound` error when reading a path that does not exist or when there are no files under the path.

### Bug fixes

* Fixed a bug in `SnowflakePlanBuilder` where `save_as_table` does not correctly filter columns whose names start with `$` and is followed by a number.
* Fixed a bug where statement parameters might have no effect when resolving imports and packages.
* Fixed bugs in local testing:

  + LEFT ANTI and LEFT SEMI joins drop rows with null values.
  + `DataFrameReader.csv` incorrectly parses data when the optional parameter `field_optionally_enclosed_by` is specified.
  + `Column.regexp` only considers the first entry when `pattern` is a `Column`.
  + `Table.update` raises `KeyError` when updating null values in the rows.
  + VARIANT columns raise errors at `DataFrame.collect`.
  + `count_distinct` does not work correctly when counting.
  + Null values in integer columns raise `TypeError`.

## Version 1.13.0 (2024-02-26)

Version 1.13.0 of the Snowpark library introduces some new features.

### New Features

* Added support for an optional `date_part` argument in function `last_day`.
* `SessionBuilder.app_name` will set the `query_tag` after the session is created.
* Added support for the following local testing functions:

  + `current_timestamp`
  + `current_date`
  + `current_time`
  + `strip_null_value`
  + `upper`
  + `lower`
  + `length`
  + `initcap`

### Improvements

* Added cleanup logic at interpreter shutdown to close all active sessions.

### Bug fixes

* Fixed a bug in `DataFrame.to_local_iterator` where the iterator could yield wrong results if another query is executed before the iterator finishes due to wrong isolation level.
* Fixed a bug that truncated table names in error messages while running a plan with local testing enabled.
* Fixed a bug that `Session.range` returns empty result when the range is large.

## Version 1.12.1 (2024-02-08)

Version 1.12.1 of the Snowpark library introduces some new features.

### Improvements

* Use `split_blocks=True` by default, during `to_pandas` conversion, for optimal memory allocation. This parameter is passed to `pyarrow.Table.to_pandas`, which enables `PyArrow`
  to split the memory allocation into smaller, more manageable blocks instead of allocating a single contiguous block. This results in better memory management when dealing with larger datasets.

### Bug fixes

* Fixed a bug in `DataFrame.to_pandas` that caused an error when evaluating on a Dataframe with an `IntegerType` column with null values.

## Version 1.12.0 (2024-01-29)

Version 1.12.0 of the Snowpark library introduces some new features.

### Behavior Changes (API Compatible)

* When parsing data types during a `to_pandas` operation, we rely on GS precision value to fix precision issues for large integer values. This may affect users where a column that was earlier returned as `int8` gets returned as `int64`. Users can fix this by explicitly specifying precision values for their return column.
* Aligned behavior for `Session.call` in case of table stored procedures where running `Session.call` would not trigger a stored procedure unless a `collect()` operation was performed.
* `StoredProcedureRegistration` now automatically adds `snowflake-snowpark-python` as a package dependency on the client’s local version of the library. An error is thrown if the server cannot support that version.

### New features

* Exposed `statement_params` in `StoredProcedure.__call__`.
* Added two optional arguments to `Session.add_import`:

  + `chunk_size`: The number of bytes to hash per chunk of the uploaded files.
  + `whole_file_hash`: By default only the first chunk of the uploaded import is hashed to save time. When this is set to True each uploaded file is fully hashed instead.
* Added parameters `external_access_integrations` and `secrets` when creating a UDAF from Snowpark Python to allow integration with external access.
* Added a new method `Session.append_query_tag`, which allows an additional tag to be added to the current query tag by appending it as a comma separated value.
* Added a new method `Session.update_query_tag`, which allows updates to a JSON encoded dictionary query tag.
* `SessionBuilder.getOrCreate` will now attempt to replace the singleton it returns when token expiration has been detected.
* Added the following functions in `snowflake.snowpark.functions`:

  + `array_except`
  + `create_map`
  + `sign` / `signum`
* Added the following functions to `DataFrame.analytics`:

  + Added the `moving_agg` function in `DataFrame.analytics` to enable moving aggregations like sums and averages with multiple window sizes.
  + Added the `cumulative_agg` function in `DataFrame.analytics` to enable moving aggregations like sums and averages with multiple window sizes.

### Bug fixes

* Fixed a bug in `DataFrame.na.fill` that caused Boolean values to erroneously override integer values.
* Fixed a bug in `Session.create_dataframe` where the Snowpark DataFrames created using pandas DataFrames were not inferring the type for timestamp columns correctly. The behavior is as follows:

  + Earlier timestamp columns without a timezone would be converted to nanosecond epochs and inferred as `LongType()`, but will now be correctly maintained as timestamp values and be inferred as `TimestampType(TimestampTimeZone.NTZ)`.
  + Earlier timestamp columns with a timezone would be inferred as `TimestampType(TimestampTimeZone.NTZ)` and loose timezone information but will now be correctly inferred as `TimestampType(TimestampTimeZone.LTZ)` and timezone information is retained correctly.
  + Set session parameter `PYTHON_SNOWPARK_USE_LOGICAL_TYPE_FOR_CREATE_DATAFRAME` to revert back to old behavior. Snowflake recommends that you update your code to align with correct behavior because the parameter will be removed in the future.
* Fixed a bug that `DataFrame.to_pandas` gets decimal type when scale is not 0, and creates an object dtype in `pandas`. Instead, we cast the value to a float64 type.
* Fixed bugs that wrongly flattened the generated SQL when one of the following happens:

  + `DataFrame.filter()` is called after `DataFrame.sort().limit()`.
  + `DataFrame.sort()` or `filter()` is called on a DataFrame that already has a window function or sequence-dependent data generator column.
    For instance, `df.select("a", seq1().alias("b")).select("a", "b").sort("a")` won’t flatten the sort clause anymore.
  + A window or sequence-dependent data generator column is used after `DataFrame.limit()`. For instance, `df.limit(10).select(row_number().over())` won’t flatten the limit and select in the generated SQL.
* Fixed a bug where aliasing a DataFrame column raised an error when the DataFrame was copied from another DataFrame with an aliased column. For instance,

  ```python
  df = df.select(col("a").alias("b"))
  df = copy(df)
  df.select(col("b").alias("c"))  # Threw an error. Now it's fixed.
  ```
* Fixed a bug in `Session.create_dataframe` that the non-nullable field in a schema is not respected for Boolean type. Note that this fix is only effective when the user has the privilege to create a temp table.
* Fixed a bug in SQL simplifier where non-select statements in `session.sql` dropped a SQL query when used with `limit()`.
* Fixed a bug that raised an exception when session parameter `ERROR_ON_NONDETERMINISTIC_UPDATE` is true.

---
title: Snowpark Library for Python release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-python-2025.md
section: Release Notes
---

# Snowpark Library for Python release notes for 2025

This article contains the release notes for the [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md) updates.

See [Snowpark Developer Guide for Python](../../developer-guide/snowpark/python/index.md) for documentation.

> **Warning:**
>
> Because Python 3.8 has reached its [End of Life](https://devguide.python.org/versions/), deprecation warnings will be triggered when you use snowpark-python with Python 3.8. For more information, see [Snowflake Python Runtime Support](../../developer-guide/python-runtime-support-policy.md). Snowpark Python 1.24.0 will be the last client and server version to support Python 3.8, in accordance with [Anaconda’s policy](https://forum.anaconda.com/t/python-3-8-reaches-end-of-life/87265). Upgrade your existing Python 3.8 objects to Python 3.9 or later.

## Version 1.44.0: Dec 15, 2025

### New features

* Added support for targeted delete-insert via the `overwrite_condition` parameter in `DataFrameWriter.save_as_table`.

### Improvements

* Improved `DataFrameReader` to return columns in deterministic order when using `INFER_SCHEMA`.
* Added a dependency on `protobuf<6.34` (was `<6.32`).

## Version 1.43.0: Dec 03, 2025

### New features

* Added support for `DataFrame.lateral_join`
* Added support for Private Preview feature `Session.client_telemetry`.
* Added support for `Session.udf_profiler`.
* Added support for `functions.ai_translate`.
* Added support for the following `iceberg_config` options in `DataFrameWriter.save_as_table` and `DataFrame.copy_into_table`:

  + `target_file_size`
  + `partition_by`
* Added support for the following functions in `functions.py`:

  > + String and Binary functions:
  >
  >   - `base64_decode_binary`
  >   - `bucket`
  >   - `compress`
  >   - `day`
  >   - `decompress_binary`
  >   - `decompress_string`
  >   - `md5_binary`
  >   - `md5_number_lower64`
  >   - `md5_number_upper64`
  >   - `sha1_binary`
  >   - `sha2_binary`
  >   - `soundex_p123`
  >   - `strtok`
  >   - `truncate`
  >   - `try_base64_decode_binary`
  >   - `try_base64_decode_string`
  >   - `try_hex_decode_binary`
  >   - `try_hex_decode_string`
  >   - `unicode`
  >   - `uuid_string`
  > + Conditional expressions:
  >
  >   - `booland_agg`
  >   - `boolxor_agg`
  >   - `regr_valy`
  >   - `zeroifnull`
  > + Numeric expressions:
  >
  >   - `cot`
  >   - `mod`
  >   - `pi`
  >   - `square`
  >   - `width_bucket`

### Bug fixes

* Fixed a bug where automatically-generated temporary objects were not properly cleaned up.
* Fixed a bug in SQL generation when joining two `DataFrames` created using `DataFrame.alias` and CTE optimization is enabled.
* Fixed a bug in `XMLReader` where finding the start position of a row tag could return an incorrect file position.

### Improvements

* Enhanced `DataFrame.sort()` to support ORDER BY ALL when no columns are specified.
* Removed experimental warning from `Session.cte_optimization_enabled`.

### Snowpark pandas API updates

#### New features

* Added support for `DataFrame.groupby.rolling()`.
* Added support for mapping `np.percentile` with DataFrame and Series inputs to `Series.quantile`.
* Added support for setting the `random_state` parameter to an integer when calling `DataFrame.sample` or `Series.sample`.
* Added support for the following `iceberg_config` options in `to_iceberg`:

  + `target_file_size`
  + `partition_by`

#### Improvements

* Enhanced autoswitching functionality from Snowflake to native pandas for methods with unsupported argument combinations:

  + `shift()` with `suffix` or non-integer `periods` parameters
  + `sort_index()` with `axis=1` or `key` parameters
  + `sort_values()` with `axis=1`
  + `melt()` with `col_level` parameter
  + `apply()` with `result_type` parameter for DataFrame
  + `pivot_table()` with `sort=True`, non-string `index` list, non-string `columns` list, non-string `values` list, or `aggfunc` dict with non-string values
  + `fillna()` with `downcast` parameter or using `limit` together with `value`
  + `dropna()` with `axis=1`
  + `asfreq()` with `how` parameter, `fill_value` parameter, `normalize=True`, or `freq` parameter being week, month, quarter, or year
  + `groupby()` with `axis=1`, `by!=None and level!=None`, or by containing any non-pandas hashable labels.
  + `groupby_fillna()` with `downcast` parameter
  + `groupby_first()` with `min_count>1`
  + `groupby_last()` with `min_count>1`
  + `groupby_shift()` with `freq` parameter
* Slightly improved the performance of `agg`, `nunique`, `describe`, and related methods on 1-column DataFrame and Series objects.
* Add support for the following in faster pandas:

  + `groupby.apply`
  + `groupby.nunique`
  + `groupby.size`
  + `concat`
  + `copy`
  + `str.isdigit`
  + `str.islower`
  + `str.isupper`
  + `str.istitle`
  + `str.lower`
  + `str.upper`
  + `str.title`
  + `str.match`
  + `str.capitalize`
  + `str.__getitem__`
  + `str.center`
  + `str.count`
  + `str.get`
  + `str.pad`
  + `str.len`
  + `str.ljust`
  + `str.rjust`
  + `str.split`
  + `str.replace`
  + `str.strip`
  + `str.lstrip`
  + `str.rstrip`
  + `str.translate`
  + `dt.tz_localize`
  + `dt.tz_convert`
  + `dt.ceil`
  + `dt.round`
  + `dt.floor`
  + `dt.normalize`
  + `dt.month_name`
  + `dt.day_name`
  + `dt.strftime`
  + `dt.dayofweek`
  + `dt.weekday`
  + `dt.dayofyear`
  + `dt.isocalendar`
  + `rolling.min`
  + `rolling.max`
  + `rolling.count`
  + `rolling.sum`
  + `rolling.mean`
  + `rolling.std`
  + `rolling.var`
  + `rolling.sem`
  + `rolling.corr`
  + `expanding.min`
  + `expanding.max`
  + `expanding.count`
  + `expanding.sum`
  + `expanding.mean`
  + `expanding.std`
  + `expanding.var`
  + `expanding.sem`
  + `cumsum`
  + `cummin`
  + `cummax`
  + `groupby.groups`
  + `groupby.indices`
  + `groupby.first`
  + `groupby.last`
  + `groupby.rank`
  + `groupby.shift`
  + `groupby.cumcount`
  + `groupby.cumsum`
  + `groupby.cummin`
  + `groupby.cummax`
  + `groupby.any`
  + `groupby.all`
  + `groupby.unique`
  + `groupby.get_group`
  + `groupby.rolling`
  + `groupby.resample`
  + `to_snowflake`
  + `to_snowpark`
  + `resample.min`
  + `resample.max`
  + `resample.count`
  + `resample.sum`
  + `resample.mean`
  + `resample.median`
  + `resample.std`
  + `resample.var`
  + `resample.size`
  + `resample.first`
  + `resample.last`
  + `resample.quantile`
  + `resample.nunique`
* Make faster pandas disabled by default (opt-in instead of opt-out).
* Improve performance of `drop_duplicates` by avoiding joins when `keep!=False` in faster pandas.

#### Bug fixes

* Fixed a bug in `DataFrameGroupBy.agg` where func is a list of tuples used to set the names of the output columns.
* Fixed a bug where converting a modin datetime index with a timezone to a numpy array with `np.asarray` would cause a `TypeError`.
* Fixed a bug where `Series.isin` with a Series argument matched index labels instead of the row position.

## Version 1.42.0: Oct 29, 2025

### New features

* Snowpark Python DB-API is now generally available.

  To access this feature, use `DataFrameReader.dbapi()` to read data from a database table or query into a DataFrame using a DB-API connection.

## Version 1.41.0: Oct 23, 2025

### New features

* Added a new function `service` in `snowflake.snowpark.functions` that allows users to create a callable representing a Snowpark Container Services (SPCS) service.
* Added a new function `group_by_all()` to the `DataFrame` class.
* Added `connection_parameters` parameter to `DataFrameReader.dbapi()` (Public Preview) method to allow passing keyword arguments to the `create_connection` callable.
* Added support for `Session.begin_transaction`, `Session.commit`, and `Session.rollback`.
* Added support for the following functions in `functions.py`:

  + Geospatial functions:

    - `st_interpolate`
    - `st_intersection`
    - `st_intersection_agg`
    - `st_intersects`
    - `st_isvalid`
    - `st_length`
    - `st_makegeompoint`
    - `st_makeline`
    - `st_makepolygon`
    - `st_makepolygonoriented`
    - `st_disjoint`
    - `st_distance`
    - `st_dwithin`
    - `st_endpoint`
    - `st_envelope`
    - `st_geohash`
    - `st_geomfromgeohash`
    - `st_geompointfromgeohash`
    - `st_hausdorffdistance`
    - `st_makepoint`
    - `st_npoints`
    - `st_perimeter`
    - `st_pointn`
    - `st_setsrid`
    - `st_simplify`
    - `st_srid`
    - `st_startpoint`
    - `st_symdifference`
    - `st_transform`
    - `st_union`
    - `st_union_agg`
    - `st_within`
    - `st_x`
    - `st_xmax`
    - `st_xmin`
    - `st_y`
    - `st_ymax`
    - `st_ymin`
    - `st_geogfromgeohash`
    - `st_geogpointfromgeohash`
    - `st_geographyfromwkb`
    - `st_geographyfromwkt`
    - `st_geometryfromwkb`
    - `st_geometryfromwkt`
    - `try_to_geography`
    - `try_to_geometry`
* Added a parameter to enable and disable automatic column name aliasing for `interval_day_time_from_parts` and `interval_year_month_from_parts` functions.

### Bug fixes

* Fixed a bug that `DataFrameReader.xml` fails to parse XML files with undeclared namespaces when `ignoreNamespace` is `True`.
* Added a fix for floating point precision discrepancies in `interval_day_time_from_parts`.
* Fixed a bug where writing Snowpark pandas DataFrames on the pandas backend with a column multiindex to Snowflake with `to_snowflake` would raise `KeyError`.
* Fixed a bug that `DataFrameReader.dbapi` (Public Preview) is not compatible with oracledb 3.4.0.
* Fixed a bug where `modin` would unintentionally be imported during session initialization in some scenarios.
* Fixed a bug where `session.udf|udtf|udaf|sproc.register` failed when an extra session argument was passed. These methods do not expect a session argument; please remove it if provided.

### Improvements

* The default maximum length for inferred StringType columns during schema inference in `DataFrameReader.dbapi` is now increased from 16 MB to 128 MB in parquet file–based ingestion.

### Dependency updates

* Updated dependency of `snowflake-connector-python>=3.17,<5.0.0`.

### Snowpark pandas API updates

#### New features

* Added support for the `dtypes` parameter of `pd.get_dummies`.
* Added support for `nunique` in `df.pivot_table`, `df.agg`, and other places where aggregate functions can be used.
* Added support for `DataFrame.interpolate` and `Series.interpolate` with the “linear”, “ffill”/”pad”, and “backfill”/bfill” methods. These use the SQL `INTERPOLATE_LINEAR`, `INTERPOLATE_FFILL`, and `INTERPOLATE_BFILL` functions (Public Preview).

#### Improvements

* Improved performance of `Series.to_snowflake` and `pd.to_snowflake(series)` for large data by uploading data via a parquet file. You can control the dataset size at which Snowpark pandas switches to parquet with the variable `modin.config.PandasToSnowflakeParquetThresholdBytes`.
* Enhanced autoswitching functionality from Snowflake to native pandas for methods with unsupported argument combinations:

  + `get_dummies()` with `dummy_na=True`, `drop_first=True`, or custom `dtype` parameters
  + `cumsum()`, `cummin()`, `cummax()` with `axis=1` (column-wise operations)
  + `skew()` with `axis=1` or `numeric_only=False` parameters
  + `round()` with `decimals` parameter as a Series
  + `corr()` with `method!=pearson` parameter
* Set `cte_optimization_enabled` to True for all Snowpark pandas sessions.
* Add support for the following in faster pandas:

  + `isin`
  + `isna`
  + `isnull`
  + `notna`
  + `notnull`
  + `str.contains`
  + `str.startswith`
  + `str.endswith`
  + `str.slice`
  + `dt.date`
  + `dt.time`
  + `dt.hour`
  + `dt.minute`
  + `dt.second`
  + `dt.microsecond`
  + `dt.nanosecond`
  + `dt.year`
  + `dt.month`
  + `dt.day`
  + `dt.quarter`
  + `dt.is_month_start`
  + `dt.is_month_end`
  + `dt.is_quarter_start`
  + `dt.is_quarter_end`
  + `dt.is_year_start`
  + `dt.is_year_end`
  + `dt.is_leap_year`
  + `dt.days_in_month`
  + `dt.daysinmonth`
  + `sort_values`
  + `loc` (setting columns)
  + `to_datetime`
  + `drop`
  + `invert`
  + `duplicated`
  + `iloc`
  + `head`
  + `columns` (e.g., df.columns = [“A”, “B”])
  + `agg`
  + `min`
  + `max`
  + `count`
  + `sum`
  + `mean`
  + `median`
  + `std`
  + `var`
  + `groupby.agg`
  + `groupby.min`
  + `groupby.max`
  + `groupby.count`
  + `groupby.sum`
  + `groupby.mean`
  + `groupby.median`
  + `groupby.std`
  + `groupby.var`
  + `drop_duplicates`
* Reuse row count from the relaxed query compiler in `get_axis_len`.

#### Bug fixes

* Fixed a bug where the row count was not cached in the ordered DataFrame each time `count_rows()` was called.

## Version 1.40.0: October 6, 2025

### New features

* Added a new module `snowflake.snowpark.secrets` that provides Python wrappers for accessing Snowflake Secrets within Python UDFs and stored procedures that execute inside Snowflake.

  + `get_generic_secret_string`
  + `get_oauth_access_token`
  + `get_secret_type`
  + `get_username_password`
  + `get_cloud_provider_token`
* Added support for the following scalar functions in `functions.py`:

  > + Conditional expression functions:
  >
  >   - `booland`
  >   - `boolnot`
  >   - `boolor`
  >   - `boolxor`
  >   - `boolor_agg`
  >   - `decode`
  >   - `greatest_ignore_nulls`
  >   - `least_ignore_nulls`
  >   - `nullif`
  >   - `nvl2`
  >   - `regr_valx`
  > + Semi-structured and structured date functions:
  >
  >   - `array_remove_at`
  >   - `as_boolean`
  >   - `map_delete`
  >   - `map_insert`
  >   - `map_pick`
  >   - `map_size`
  > + String & binary functions:
  >
  >   - `chr`
  >   - `hex_decode_binary`
  > + Numeric functions:
  >
  >   - `div0null`
  > + Differential privacy functions:
  >
  >   - `dp_interval_high`
  >   - `dp_interval_low`
  > + Context functions:
  >
  >   - `last_query_id`
  >   - `last_transaction`
  > + Geospatial functions:
  >
  >   - `h3_cell_to_boundary`
  >   - `h3_cell_to_children`
  >   - `h3_cell_to_children_string`
  >   - `h3_cell_to_parent`
  >   - `h3_cell_to_point`
  >   - `h3_compact_cells`
  >   - `h3_compact_cells_strings`
  >   - `h3_coverage`
  >   - `h3_coverage_strings`
  >   - `h3_get_resolution`
  >   - `h3_grid_disk`
  >   - `h3_grid_distance`
  >   - `h3_int_to_string`
  >   - `h3_polygon_to_cells`
  >   - `h3_polygon_to_cells_strings`
  >   - `h3_string_to_int`
  >   - `h3_try_grid_path`
  >   - `h3_try_polygon_to_cells`
  >   - `h3_try_polygon_to_cells_strings`
  >   - `h3_uncompact_cells`
  >   - `h3_uncompact_cells_strings`
  >   - `haversine`
  >   - `h3_grid_path`
  >   - `h3_is_pentagon`
  >   - `h3_is_valid_cell`
  >   - `h3_latlng_to_cell`
  >   - `h3_latlng_to_cell_string`
  >   - `h3_point_to_cell`
  >   - `h3_point_to_cell_string`
  >   - `h3_try_coverage`
  >   - `h3_try_coverage_strings`
  >   - `h3_try_grid_distance`
  >   - `st_area`
  >   - `st_asewkb`
  >   - `st_asewkt`
  >   - `st_asgeojson`
  >   - `st_aswkb`
  >   - `st_aswkt`
  >   - `st_azimuth`
  >   - `st_buffer`
  >   - `st_centroid`
  >   - `st_collect`
  >   - `st_contains`
  >   - `st_coveredby`
  >   - `st_covers`
  >   - `st_difference`
  >   - `st_dimension`

### Bug fixes

* Fixed a bug that caused `DataFrame.limit()` to fail if the executed SQL contained parameter binding when used in non-stored-procedure/udxf environments.
* Added an experimental fix for a bug in schema query generation that could cause invalid sql to be generated when using nested structured types.
* Fixed multiple bugs in `DataFrameReader.dbapi` (Public Preview):

  + Fixed UDTF ingestion failure with `pyodbc` driver caused by unprocessed row data.
  + Fixed SQL Server query input failure due to incorrect select query generation.
  + Fixed UDTF ingestion not preserving column nullability in the output schema.
  + Fixed an issue that caused the program to hang during multithreaded Parquet based ingestion when a data fetching error occurred.
  + Fixed a bug in schema parsing when custom schema strings used upper-cased data type names (`NUMERIC`, `NUMBER`, `DECIMAL`, `VARCHAR`, `STRING`, `TEXT`).
* Fixed a bug in `Session.create_dataframe` where schema string parsing failed when using upper-cased data type names (e.g., `NUMERIC`, `NUMBER`, `DECIMAL`, `VARCHAR`, `STRING`, `TEXT`).

### Improvements

* Improved `DataFrameReader.dbapi` (Public Preview) so it doesn’t retry on non-retryable errors, such as SQL syntax error on external data source query.
* Removed unnecessary warnings about local package version mismatch when using `session.read.option('rowTag', <tag_name>).xml(<stage_file_path>)` or `xpath` functions.
* Improved `DataFrameReader.dbapi` (Public Preview) reading performance by setting the default `fetch_size` parameter value to 100000.
* Improved error message for XSD validation failure when reading XML files using `session.read.option('rowValidationXSDPath', <xsd_path>).xml(<stage_file_path>)`.

### Snowpark pandas API updates

#### Dependency updates

* Updated the supported `modin` versions to >=0.36.0 and <0.38.0 (was >= 0.35.0 and <0.37.0).

#### New features

* Added support for `DataFrame.query` for DataFrames with single-level indexes.
* Added support for `DataFrameGroupby.__len__` and `SeriesGroupBy.__len__`.

#### Improvements

* Hybrid execution mode is now enabled by default. Certain operations on smaller data now automatically execute in native pandas in-memory. Use `from modin.config import AutoSwitchBackend; AutoSwitchBackend.disable()` to turn this off and force all execution to occur in Snowflake.
* Added a session parameter `pandas_hybrid_execution_enabled` to enable/disable hybrid execution as an alternative to using `AutoSwitchBackend`.
* Removed an unnecessary `SHOW OBJECTS` query issued from `read_snowflake` under certain conditions.
* When hybrid execution is enabled, `pd.merge`, `pd.concat`, `DataFrame.merge`, and `DataFrame.join` can now move arguments to backends other than those among the function arguments.
* Improved performance of `DataFrame.to_snowflake` and `pd.to_snowflake(dataframe)` for large data by uploading data via a parquet file. You can control the dataset size at which Snowpark pandas switches to parquet with the variable `modin.config.PandasToSnowflakeParquetThresholdBytes`.

## Version 1.39.1: September 25, 2025

### Bug fixes

* Added an experimental fix for a bug in schema query generation that could cause invalid SQL to be genrated when using nested structured types.

## Version 1.39.0: September 17, 2025

### New features

* Downgraded to level `logging.DEBUG - 1` the log message saying that the
  Snowpark `DataFrame` reference of an internal `DataFrameReference` object
  has changed.
* Eliminate duplicate parameter check queries for casing status when retrieving the session.
* Retrieve DataFrame row counts through object metadata to avoid a COUNT(\*) query (performance)
* Added support for applying the Snowflake Cortex function `Complete`.
* Introduce faster pandas: Improved performance by deferring row position computation.

  + The following operations are currently supported and can benefit from the optimization: `read_snowflake`, `repr`, `loc`, `reset_index`, `merge`, and binary operations.
  + If a lazy object (e.g., DataFrame or Series) depends on a mix of supported and unsupported operations, the optimization will not be used.
* Updated the error message for when Snowpark pandas is referenced within `apply`.
* Added a session parameter `dummy_row_pos_optimization_enabled` to enable/disable dummy row position optimization in faster pandas.

### Dependency updates

* Updated the supported `modin` versions to >=0.35.0 and <0.37.0 (was previously >= 0.34.0 and <0.36.0).

### Bug fixes

* Fixed an issue with `drop_duplicates` where the same data source could be read multiple times in the same query but in a different order each time, resulting in missing rows in the final result. The fix ensures that the data source is read only once.
* Fixed a bug with hybrid execution mode where an `AssertionError` was unexpectedly raised by certain indexing operations.

### Snowpark local testing updates

#### New features

* Added support to allow patching `functions.ai_complete`.

## Version 1.38.0: September 4, 2025

### New features

* Added support for the following AI-powered functions in `functions.py`:

  > + `ai_extract`
  > + `ai_parse_document`
  > + `ai_transcribe`
* Added time travel support for querying historical data:

  > + `Session.table()` now supports time travel parameters:
  >
  >   - `time_travel_mode`
  >   - `statement`
  >   - `offset`
  >   - `timestamp`
  >   - `timestamp_type`
  >   - `stream`
  > + `DataFrameReader.table()` supports the same time travel parameters as direct arguments.
  > + `DataFrameReader` supports time travel via option chaining (e.g., `session.read.option("time_travel_mode", "at").option("offset", -60).table("my_table")`).
* Added support for specifying the following parameters to `DataFrameWriter.copy_into_location` for validation and writing data to external locations:

  > + `validation_mode`
  > + `storage_integration`
  > + `credentials`
  > + `encryption`
* Added support for `Session.directory` and `Session.read.directory` to retrieve the list of all files on a stage with metadata.
* Added support for `DataFrameReader.jdbc(Private Preview)` that allows the JDBC driver to ingest external data sources.
* Added support for `FileOperation.copy_files` to copy files from a source location to an output stage.
* Added support for the following scalar functions in `functions.py`:

  > + `all_user_names`
  > + `bitand`
  > + `bitand_agg`
  > + `bitor`
  > + `bitor_agg`
  > + `bitxor`
  > + `bitxor_agg`
  > + `current_account_name`
  > + `current_client`
  > + `current_ip_address`
  > + `current_role_type`
  > + `current_organization_name`
  > + `current_organization_user`
  > + `current_secondary_roles`
  > + `current_transaction`
  > + `getbit`

### Bug fixes

* Fixed the `_repr_` of `TimestampType` to match the actual subtype it represents.
* Fixed a bug in `DataFrameReader.dbapi` that `UDTF` ingestion does not work in stored procedures.
* Fixed a bug in schema inference that caused incorrect stage prefixes to be used.

### Improvements

* Enhanced error handling in `DataFrameReader.dbapi` thread-based ingestion to prevent unnecessary operations, which improves resource efficiency.
* Bumped cloudpickle dependency to also support `cloudpickle==3.1.1` in addition to previous versions.
* Improved `DataFrameReader.dbapi` (Public Preview) ingestion performance for PostgreSQL and MySQL by using a server-side cursor to fetch data.

### Snowpark pandas API Updates

### New features

* Completed support for the following functions on the “Pandas” and “Ray” backends:

  > + `pd.read_snowflake()`
  > + `pd.to_iceberg()`
  > + `pd.to_pandas()`
  > + `pd.to_snowpark()`
  > + `pd.to_snowflake()`
  > + `DataFrame.to_iceberg()`
  > + `DataFrame.to_pandas()`
  > + `DataFrame.to_snowpark()`
  > + `DataFrame.to_snowflake()`
  > + `Series.to_iceberg()`
  > + `Series.to_pandas()`
  > + `Series.to_snowpark()`
  > + `Series.to_snowflake()`
  >
  >   on the “Pandas” and “Ray” backends. Previously, only some of these functions and methods were supported on the Pandas backend.
* Added support for `Index.get_level_values()`.

### Improvements

* Set the default transfer limit in hybrid execution for data leaving Snowflake to 100k, which can be overridden with the `SnowflakePandasTransferThreshold` environment variable. This configuration is appropriate for scenarios with two available engines, “pandas” and “Snowflake,” on relational workloads.
* Improved the import error message by adding `--upgrade` to `pip install "snowflake-snowpark-python[modin]"` in the message.
* Reduced the telemetry messages from the modin client by pre-aggregating into five-second windows and only keeping a narrow band of metrics that are useful for tracking hybrid execution and native pandas performance.
* Set the initial row count only when hybrid execution is enabled, which reduces the number of queries issued for many workloads.
* Added a new test parameter for integration tests to enable hybrid execution.

### Bug fixes

* Raised `NotImplementedError` instead of `AttributeError` on attempting to call
  Snowflake extension functions/methods `to_dynamic_table()`, `cache_result()`,
  `to_view()`, `create_or_replace_dynamic_table()`, and
  `create_or_replace_view()` on DataFrames or series using the pandas or ray
  backends.

## Version 1.37.0: August 18, 2025

### New features

* Added support for the following `xpath` functions in `functions.py`:

  > + `xpath`
  > + `xpath_string`
  > + `xpath_boolean`
  > + `xpath_int`
  > + `xpath_float`
  > + `xpath_double`
  > + `xpath_long`
  > + `xpath_short`
* Added support for the `use_vectorized_scanner` parameter in the `Session.write_arrow()` function.
* DataFrame profiler adds the following information about each query: `describe query time`, `execution time`, and `sql query text`. To view this information, call `session.dataframe_profiler.enable()` and call `get_execution_profile` on a DataFrame.
* Added support for `DataFrame.col_ilike`.
* Added support for non-blocking stored procedure calls that return `AsyncJob` objects.

  > + Added the `block: bool = True` parameter to `Session.call()`. When `block=False`, returns an `AsyncJob` instead of blocking until completion.
  > + Added the `block: bool = True` parameter to `StoredProcedure.__call__()` for async support across both named and anonymous stored procedures.
  > + Added `Session.call_nowait()` that is equivalent to `Session.call(block=False)`.

### Bug fixes

* Fixed a bug in CTE optimization stage where `deepcopy` of internal plans would cause a memory spike when a DataFrame is created locally using `session.create_dataframe()` using large input data.
* Fixed a bug in `DataFrameReader.parquet` where the `ignore_case` option in the `infer_schema_options` was not respected.
* Fixed a bug where `to_pandas()` had a different format of column name when the query result format is set to `JSON` and `ARROW`.

### Deprecations

* Deprecated `pkg_resources`.

### Dependency updates

* Added a dependency on `protobuf<6.32`

### Snowpark pandas API Updates

### New features

* Added support for efficient transfer of data between Snowflake and [<Ray](https://www.ray.io/) with the `DataFrame.set_backend` method. The installed version of `modin` must be at least 0.35.0, and `ray` must be installed.

### Dependency updates

* Updated the supported modin versions to >=0.34.0 and <0.36.0 (was previously >= 0.33.0 and <0.35.0).
* Added support for pandas 2.3 when the installed modin version is 0.35.0 or greater.

### Bug fixes

* Fixed an issue in hybrid execution mode (Private Preview) where `pd.to_datetime` and `pd.to_timedelta` would unexpectedly raise `IndexError`.
* Fixed a bug where `pd.explain_switch` would raise `IndexError` or return `None` if called before any potential switch operations were performed.

## Version 1.36.0: August 5, 2025

### New features

* `Session.create_dataframe` now accepts keyword arguments that are forwarded in the internal call to `Session.write_pandas` or `Session.write_arrow` when creating a DataFrame from a pandas DataFrame or a `pyarrow` table.
* Added new APIs for AsyncJob:

  > + `AsyncJob.is_failed()` returns a bool indicating whether a job has failed. Can be used in combination with `AsyncJob.is_done()` to determine if a job is finished and erred.
  > + `AsyncJob.status()` returns a string representing the current query status (such as, “RUNNING”, “SUCCESS”, “FAILED_WITH_ERROR”) for detailed monitoring without calling `result()`.
* Added a DataFrame profiler. To use, you can call `get_execution_profile()` on your desired DataFrame. This profiler reports the queries executed to evaluate a DataFrame and statistics about each of the query operators. Currently an experimental feature.
* Added support for the following functions in `functions.py`:

  > + `ai_sentiment`
* Updated the interface for the `context.configure_development_features` experimental feature. All development features are disabled by default unless explicitly enabled by the user.

### Improvements

* Hybrid execution row estimate improvements and a reduction of eager calls.
* Added a new configuration variable to control transfer costs out of Snowflake when using hybrid execution.
* Added support for creating permanent and immutable UDFs/UDTFs with DataFrame/Series/GroupBy.apply, map, and transform by passing the `snowflake_udf_params` keyword argument.
* Added support for `mapping np.unique` to DataFrame and Series inputs using `pd.unique`.

### Bug fixes

* Fixed an issue where the Snowpark pandas plugin would unconditionally disable `AutoSwitchBackend` even when users have explicitly configured it programmatically or with environment variables.

## Version 1.35.0: July 24, 2025

### New features

* Added support for the following functions in `functions.py`:

  > + `ai_embed`
  > + `try_parse_json`

### Improvements

* Improved `query` parameter in `DataFrameReader.dbapi` (Private Preview) so that parentheses aren’t needed around the query.
* Improved error experience in `DataFrameReader.dbapi` (Private Preview) for exceptions raised when inferring the schema of the target data source.

### Bug fixes

* Fixed a bug in `DataFrameReader.dbapi` (Private Preview) that fails `dbapi` with process exit code 1 in a Python stored procedure.
* Fixed a bug in `DataFrameReader.dbapi` (Private Preview) where `custom_schema` accepts an illegal schema.
* Fixed a bug in `DataFrameReader.dbapi` (Private Preview) where `custom_schema` doesn’t work when connecting to Postgres and MySQL.
* Fixed a bug in schema inference that causes it to fail for external stages.

### Snowpark local testing updates

#### New features

* Added local testing support for reading files with `SnowflakeFile`. The testing support uses local file paths, the Snow URL semantic (`snow://...`), local testing framework stages, and Snowflake stages (`@stage/file_path`).

## Version 1.34.0: Jul 14, 2025

### New features

* Added a new option `TRY_CAST` to `DataFrameReader`. When `TRY_CAST` is `True`, columns are wrapped in a `TRY_CAST` statement instead of a hard cast when loading data.
* Added a new option `USE_RELAXED_TYPES` to the `INFER_SCHEMA_OPTIONS` of `DataFrameReader`. When set to `True`, this option casts all strings to max length strings and all numeric types to `DoubleType`.
* Added debuggability improvements to eagerly validate dataframe schema metadata. Enable it using `snowflake.snowpark.context.configure_development_features()`.
* Added a new function `snowflake.snowpark.dataframe.map_in_pandas` that allows users to map a function across a dataframe. The mapping function takes an iterator of pandas DataFrames as input and provides one as output.
* Added a `ttl cache` to describe queries. Repeated queries in a 15-second interval use the cached value rather than requery Snowflake.
* Added a parameter `fetch_with_process` to `DataFrameReader.dbapi` (PrPr) to enable multiprocessing for parallel data fetching in local ingestion. By default, local ingestion uses multithreading. Multiprocessing can improve performance for CPU-bound tasks like Parquet file generation.
* Added a new function `snowflake.snowpark.functions.model` that allows users to call methods of a model.

### Improvements

* Added support for row validation using XSD schema using `rowValidationXSDPath` option when reading XML files with a row tag using `rowTag` option.
* Improved SQL generation for `session.table().sample()` to generate a flat SQL statement.
* Added support for complex column expression as input for `functions.explode`.
* Added debuggability improvements to show which Python lines a SQL compilation error corresponds to. Enable it using `snowflake.snowpark.context.configure_development_features()`. This feature also depends on AST collections to be enabled in the session, which can be done using `session.ast_enabled = True`.
* Set `enforce_ordering=True` when calling `to_snowpark_pandas()` from a Snowpark DataFrame containing DML/DDL queries instead of throwing a `NotImplementedError`.

### Bug fixes

* Fixed a bug caused by redundant validation when creating an iceberg table.
* Fixed a bug in `DataFrameReader.dbapi` (Private Preview) where closing the cursor or connection could unexpectedly raise an error and terminate the program.
* Fixed ambiguous column errors when using table functions in `DataFrame.select()` that have output columns matching the input DataFrame’s columns. This improvement works when DataFrame columns are provided as `Column` objects.
* Fixed a bug where having a NULL in a column with DecimalTypes would cast the column to FloatTypes instead and lead to precision loss.

### Snowpark Local testing Updates

* Fixed a bug when processing windowed functions that lead to incorrect indexing in results.
* When a scalar numeric is passed to `fillna`, Snowflake will ignore non-numeric columns instead of producing an error.

### Snowpark pandas API Updates

#### New features

* Added support for `DataFrame.to_excel` and `Series.to_excel`.
* Added support for `pd.read_feather`, `pd.read_orc`, and `pd.read_stata`.
* Added support for `pd.explain_switch()` to return debugging information on hybrid execution decisions.
* Support `pd.read_snowflake` when the global modin backend is `Pandas`.
* Added support for `pd.to_dynamic_table`, `pd.to_iceberg`, and `pd.to_view`.

#### Improvements

* Added modin telemetry on API calls and hybrid engine switches.
* Show more helpful error messages to Snowflake Notebook users when the `modin` or `pandas` version does not match our requirements.
* Added a data type guard to the cost functions for hybrid execution mode (Private Preview) that checks for data type compatibility.
* Added automatic switching to the pandas backend in hybrid execution mode (Private Preview) for many methods that are not directly implemented in pandas on Snowflake.
* Set the `type` and other standard fields for pandas on Snowflake telemetry.

#### Dependency updates

* Added `tqdm` and `ipywidgets` as dependencies so that progress bars appear when the user switches between modin backends.
* Updated the supported `modin` versions to >=0.33.0 and <0.35.0 (was previously >= 0.32.0 and <0.34.0).

#### Bug fixes

* Fixed a bug in Hybrid Execution mode (Private Preview) where certain series operations would raise `TypeError: numpy.ndarray object is not callable`.
* Fixed a bug in hybrid execution mode (Private Preview) where calling `numpy` operations like `np.where` on modin objects with the Pandas backend would raise an `AttributeError`. This fix requires `modin` version 0.34.0 or later.
* Fixed an issue in `df.melt` where the resulting values have an additional suffix applied.

## Version 1.33.0 (2025-06-19)

### New features

* Added support for MySQL in `DataFrameWriter.dbapi` (Private Preview) for both Parquet and UDTF-based ingestion.
* Added support for PostgreSQL in `DataFrameReader.dbapi` (Private Preview) for both Parquet and UDTF-based ingestion.
* Added support for Databricks in `DataFrameWriter.dbapi` (Private Preview) for UDTF-based ingestion, consolidating with other mentions of Databricks support.
* Added support to `DataFrameReader` to enable use of `PATTERN` when reading files with `INFER_SCHEMA` enabled.
* Added support for the following AI-powered functions in `functions.py`:

  > + `ai_complete`
  > + `ai_similarity`
  > + `ai_summarize_agg` (originally `summarize_agg`)
  > + different config options for `ai_classify`
* Added support for more options when reading XML files with a row tag using `rowTag` option:

  + Added support for removing namespace prefixes from column names using `ignoreNamespace` option.
  + Added support for specifying the prefix for the attribute column in the result table using `attributePrefix` option.
  + Added support for excluding attributes from the XML element using `excludeAttributes` option.
  + Added support for specifying the column name for the value when there are attributes in an element that has no child elements using `valueTag` option.
  + Added support for specifying the value to treat as a null value using `nullValue` option.
  + Added support for specifying the character encoding of the XML file using `charset` option.
  + Added support for ignoring surrounding whitespace in the XML element using `ignoreSurroundingWhitespace` option.
* Added support for parameter `return_dataframe` in `Session.call`, which can be used to set the return type of the functions to a `DataFrame` object.
* Added a new argument to `Dataframe.describe` called `strings_include_math_stats` that triggers `stddev` and `mean` to be calculated for String columns.
* Added support for retrieving `Edge.properties` when retrieving lineage from `DGQL` in `DataFrame.lineage.trace`.
* Added a parameter `table_exists` to `DataFrameWriter.save_as_table` that allows specifying if a table already exists. This allows skipping a table lookup that can be expensive.

### Bug fixes

* Fixed a bug in `DataFrameReader.dbapi` (Private Preview) where the `create_connection` defined as local function was incompatible with multiprocessing.
* Fixed a bug in `DataFrameReader.dbapi` (Private Preview) where Databricks `TIMESTAMP` type was converted to Snowflake `TIMESTAMP_NTZ` type which should be `TIMESTAMP_LTZ` type.
* Fixed a bug in `DataFrameReader.json` where repeated reads with the same reader object would create incorrectly quoted columns.
* Fixed a bug in `DataFrame.to_pandas()` that would drop column names when converting a DataFrame that did not originate from a select statement.
* Fixed a bug where `DataFrame.create_or_replace_dynamic_table` raises an error when the DataFrame contains a UDTF and `SELECT *` in the UDTF is not parsed correctly.
* Fixed a bug where casted columns could not be used in the values clause of functions.

### Improvements

* Improved the error message for `Session.write_pandas()` and `Session.create_dataframe()` when the input pandas DataFrame does not have a column.
* Improved `DataFrame.select` when the arguments contain a table function with output columns that collide with columns of current DataFrame. With the improvement, if user provides non-colliding columns in `df.select("col1", "col2", table_func(...))` as string arguments, then the query generated by Snowpark client will not raise ambiguous column error.
* Improved `DataFrameReader.dbapi` (Private Preview) to use in-memory Parquet-based ingestion for better performance and security.
* Improved `DataFrameReader.dbapi` (Private Preview) to use `MATCH_BY_COLUMN_NAME=CASE_SENSITIVE` in copy into table operation.

### Snowpark Local testing Updates

#### New features

* Added support for snow URLs (`snow://`) in local file testing.

#### Bug fixes

* Fixed a bug in `Column.isin` that would cause incorrect filtering on joined or previously filtered data.
* Fixed a bug in `snowflake.snowpark.functions.concat_ws` that would cause results to have an incorrect index.

### Snowpark pandas API Updates

#### Dependency updates

* Updated `modin` dependency constraint from 0.32.0 to >=0.32.0, <0.34.0. The latest version tested with Snowpark pandas is `modin` 0.33.1.

#### New features

* Added support for **Hybrid Execution (Private Preview)**. By running `from modin.config import AutoSwitchBackend; AutoSwitchBackend.enable()`, pandas on Snowflake automatically chooses whether to run certain pandas operations locally or on Snowflake. This feature is disabled by default.

#### Improvements

* Set the default value of the `index` parameter to `False` for `DataFrame.to_view`, `Series.to_view`, `DataFrame.to_dynamic_table`, and `Series.to_dynamic_table`.
* Added `iceberg_version` option to table creation functions.
* Reduced query count for many operations, including `insert`, `repr`, and `groupby`, that previously issued a query to retrieve the input data’s size.

#### Bug fixes

* Fixed a bug in `Series.where` when the `other` parameter is an unnamed `Series`.

## Version 1.32.0 (2025-05-15)

### Improvements

* Invoking Snowflake system procedures does not invoke an additional `describe procedure` call to check the return type of the procedure.
* Added support for `Session.create_dataframe()` with the stage URL and `FILE` data type.
* Added support for different modes for dealing with corrupt XML records when reading an XML file using `session.read.option('mode', <mode>), option('rowTag', <tag_name>).xml(<stage_file_path>)`. Currently `PERMISSIVE`, `DROPMALFORMED` and `FAILFAST` are supported.
* Improved the error message of the XML reader when the specified `ROWTAG` is not found in the file.
* Improved query generation for `Dataframe.drop` to use `SELECT * EXCLUDE ()` to exclude the dropped columns. To enable this feature, set `session.conf.set("use_simplified_query_generation", True)`.
* Added support for `VariantType` to `StructType.from_json`.

### Bug fixes

* Fixed a bug in `DataFrameWriter.dbapi` (Private preview) where unicode or double-quoted column names in external databases cause errors because they are not quoted correctly.
* Fixed a bug where named fields in nested `OBJECT` data could cause errors when containing spaces.

### Snowpark local testing updates

#### Bug fixes

* Fixed a bug in `snowflake.snowpark.functions.rank` that would not respect sort direction.
* Fixed a bug in `snowflake.snowpark.functions.to_timestamp_*` that would cause incorrect results on filtered data.

### Snowpark pandas API Updates

#### New features

* Added support for dict values in `Series.str.get`, `Series.str.slice`, and `Series.str.__getitem__` (`Series.str[...]`).
* Added support for `DataFrame.to_html`.
* Added support for `DataFrame.to_string` and `Series.to_string`.
* Added support for reading files from S3 buckets using `pd.read_csv`.

#### Improvements

* Make `iceberg_config` a required parameter for `DataFrame.to_iceberg` and `Series.to_iceberg`.

## Version 1.31.0 (2025-04-24)

### New features

* Added support for the `restricted caller` permission of `execute_as` argument in `StoredProcedure.register()`.
* Added support for non-select statements in `DataFrame.to_pandas()`.
* Added support for the `artifact_repository` parameter to `Session.add_packages`, `Session.add_requirements`, `Session.get_packages`, `Session.remove_package`, and `Session.clear_packages`.
* Added support for reading an XML file using a row tag by `session.read.option('rowTag', <tag_name>).xml(<stage_file_path>)` (experimental).

  + Each XML record is extracted as a separate row.
  + Each field within that record becomes a separate column of type `VARIANT`, which can be further queried using the dot notation, such as `col(a.b.c)`.
* Added updates to `DataFrameReader.dbapi` (PrPr):

  + Added the `fetch_merge_count` parameter for optimizing performance by merging multiple fetched data into a single Parquet file.
  + Added support for Databricks.
  + Added support for ingestion with Snowflake UDTF.
* Added support for the following AI-powered functions in `functions.py` (Private Preview):

  + `prompt`
  + `ai_filter` (added support for `prompt()` function and image files, and changed the second argument name from `expr` to `file`)
  + `ai_classify`

#### Improvements

* Renamed the `relaxed_ordering` param into `enforce_ordering` for `DataFrame.to_snowpark_pandas`. Also the new default values is `enforce_ordering=False` which has the opposite effect of the previous default value, `relaxed_ordering=False`.
* Improved `DataFrameReader.dbapi` (PrPr) reading performance by setting the default `fetch_size` parameter value to 1000.
* Improve the error message for invalid identifier SQL error by suggesting the potentially matching identifiers.
* Reduced the number of describe queries issued when creating a DataFrame from a Snowflake table using `session.table`.
* Improved performance and accuracy of `DataFrameAnalyticsFunctions.time_series_agg()`.

#### Bug fixes

* Fixed a bug in `DataFrame.group_by().pivot().agg` when the pivot column and aggregate column are the same.
* Fixed a bug in `DataFrameReader.dbapi` (PrPr) where a `TypeError` was raised when `create_connection` returned a connection object of an unsupported driver type.
* Fixed a bug where `df.limit(0)` call would not properly apply.
* Fixed a bug in `DataFrameWriter.save_as_table` that caused reserved names to throw errors when using append mode.

#### Deprecations

* Deprecated support for Python3.8.
* Deprecated argument `sliding_interval` in `DataFrameAnalyticsFunctions.time_series_agg()`.

### Snowpark local testing updates

#### New features

* Added support for Interval expression to `Window.range_between`.
* Added support for `array_construct` function.

#### Bug fixes

* Fixed a bug in local testing where transient `__pycache__` directory was unintentionally copied during stored procedure execution via import.
* Fixed a bug in local testing that created incorrect result for `Column.like` calls.
* Fixed a bug in local testing that caused `Column.getItem` and `snowpark.snowflake.functions.get` to raise `IndexError` rather than return `null`.
* Fixed a bug in local testing where `df.limit(0)` call would not properly apply.
* Fixed a bug in local testing where a `Table.merge` into an empty table would cause an exception.

### Snowpark pandas API updates

#### Dependency updates

* Updated `modin` from 0.30.1 to 0.32.0.
* Added support for `numpy` 2.0 and above.

#### New features

* Added support for `DataFrame.create_or_replace_view` and `Series.create_or_replace_view`.
* Added support for `DataFrame.create_or_replace_dynamic_table` and `Series.create_or_replace_dynamic_table`.
* Added support for `DataFrame.to_view` and `Series.to_view`.
* Added support for `DataFrame.to_dynamic_table` and `Series.to_dynamic_table`.
* Added support for `DataFrame.groupby.resample` for aggregations `max`, `mean`, `median`, `min`, and `sum`.
* Added support for reading stage files using:

  + `pd.read_excel`
  + `pd.read_html`
  + `pd.read_pickle`
  + `pd.read_sas`
  + `pd.read_xml`
* Added support for `DataFrame.to_iceberg` and `Series.to_iceberg`.
* Added support for dict values in `Series.str.len`.

#### Improvements

* Improve the performance of `DataFrame.groupby.apply` and `Series.groupby.apply` by avoiding expensive pivot step.
* Added an estimate for the row count upper bound to `OrderedDataFrame` to enable better engine switching. This could potentially result in increased query counts.
* Renamed the `relaxed_ordering` parameter in `enforce_ordering` with `pd.read_snowflake`. Also the new default value is `enforce_ordering=False` which has the opposite effect of the previous default value, `relaxed_ordering=False`.

#### Bug fixes

* Fixed a bug for `pd.read_snowflake` when reading iceberg tables and `enforce_ordering=True`.

## Version 1.30.0 (2025-03-27)

### New features

* Added Support for relaxed consistency and ordering guarantees in `Dataframe.to_snowpark_pandas` by introducing the `relaxed_ordering` parameter.
* `DataFrameReader.dbapi` (preview) now accepts a list of strings for the `session_init_statement` parameter, allowing multiple SQL statements to be executed during session initialization.

### Improvements

* Improved query generation for `Dataframe.stat.sample_by` to generate a single flat query that scales well with large `fractions` dictionary compared to older method of creating a UNION ALL subquery for each key in `fractions`. To enable this feature, set `session.conf.set("use_simplified_query_generation", True)`.
* Improved the performance of `DataFrameReader.dbapi` by enabling the vectorized option when copying a parquet file into a table.
* Improved query generation for `DataFrame.random_split` in the following ways. They can be enabled by setting `session.conf.set("use_simplified_query_generation", True)`:

  + Removed the need to `cache_result` in the internal implementation of the input dataframe resulting in a pure lazy dataframe operation.
  + The `seed` argument now behaves as expected with repeatable results across multiple calls and sessions.
* `DataFrame.fillna` and `DataFrame.replace` now both support fitting `int` and `float` into `Decimal` columns if `include_decimal` is set to `True`.
* Added documentation for the following UDF and stored procedure functions in `files.py` as a result of their General Availability.

  + `SnowflakeFile.write`
  + `SnowflakeFile.writelines`
  + `SnowflakeFile.writeable`
* Minor documentation changes for `SnowflakeFile` and `SnowflakeFile.open()`.

### Bug fixes

* Fixed a bug for the following functions that raised errors. `.cast()` is applied to their output:

  + `from_json`
  + `size`

### Snowpark local testing updates

#### Bug fixes

* Fixed a bug in aggregation that caused empty groups to still produce rows.
* Fixed a bug in `Dataframe.except_` that would cause rows to be incorrectly dropped.
* Fixed a bug that caused `to_timestamp` to fail when casting filtered columns.

### Snowpark pandas API updates

#### New features

* Added support for list values in `Series.str.__getitem__` (`Series.str[...]`).
* Added support for `pd.Grouper` objects in GROUP BY operations. When `freq` is specified, the default values of the `sort`, `closed`, `label`, and `convention` arguments are supported; `origin` is supported when it is `start` or `start_day`.
* Added support for relaxed consistency and ordering guarantees in `pd.read_snowflake` for both named data sources (for example, tables and views) and query data sources by introducing the new parameter `relaxed_ordering`.

#### Improvements

* Raise a warning whenever `QUOTED_IDENTIFIERS_IGNORE_CASE` is found to be set, ask user to unset it.
* Improved how a missing `index_label` in `DataFrame.to_snowflake` and `Series.to_snowflake` is handled when `index=True`. Instead of raising a `ValueError`, system-defined labels are used for the index columns.
* Improved the error message for `groupby`, `DataFrame`, or `Series.agg` when the function name is not supported.

### Snowpark local testing updates

#### Improvements

* Raise a warning whenever `QUOTED_IDENTIFIERS_IGNORE_CASE` is found to be set, ask user to unset it.
* Improved how a missing `index_label` in `DataFrame.to_snowflake` and `Series.to_snowflake` is handled when `index=True`. Instead of raising a `ValueError`, system-defined labels are used for the index columns.
* Improved error message for `groupby or DataFrame or Series.agg` when the function name is not supported.

## Version 1.29.1 (2025-03-12)

### Bug fixes

* Fixed a bug in `DataFrameReader.dbapi` (private preview) that prevents usage in stored procedures and Snowbooks.

## Version 1.29.0 (2025-03-05)

### New features

* Added support for the following AI-powered functions in `functions.py` (Private Preview):

  > + `ai_filter`
  > + `ai_agg`
  > + `summarize_agg`

> * Added support for the new FILE SQL type, with the following related functions in `functions.py` (Private Preview):
>
>   + `fl_get_content_type`
>   + `fl_get_etag`
>   + `fl_get_file_type`
>   + `fl_get_last_modified`
>   + `fl_get_relative_path`
>   + `fl_get_scoped_file_url`
>   + `fl_get_size`
>   + `fl_get_stage`
>   + `fl_get_stage_file_url`
>   + `fl_is_audio`
>   + `fl_is_compressed`
>   + `fl_is_document`
>   + `fl_is_image`
>   + `fl_is_video`
> * Added support for importing third-party packages from PyPi using Artifact Repository (Private Preview):
>
>   + Use keyword arguments `artifact_repository` and `packages` to specify your artifact repository and packages respectively when registering stored procedures or user defined functions.
>   + Supported APIs are:
>
>     - `Session.sproc.register`
>     - `Session.udf.register`
>     - `Session.udaf.register`
>     - `Session.udtf.register`
>     - `functions.sproc`
>     - `functions.udf`
>     - `functions.udaf`
>     - `functions.udtf`
>     - `functions.pandas_udf`
>     - `functions.pandas_udtf`

### Improvements

> * Improved version validation warnings for `snowflake-snowpark-python` package compatibility when registering stored procedures. Now, warnings are only triggered if the major or minor version does not match, while bugfix version differences no longer generate warnings.
> * Bumped cloudpickle dependency to also support `cloudpickle==3.0.0` in addition to previous versions.

### Bug fixes

> * Fixed a bug where creating a Dataframe with large number of values raised `Unsupported feature 'SCOPED_TEMPORARY'.` error if thread-safe session was disabled.
> * Fixed a bug where `df.describe` raised internal SQL execution error when the DataFrame is created from reading a stage file and CTE optimization is enabled.
> * Fixed a bug where `df.order_by(A).select(B).distinct()` would generate invalid SQL when simplified query generation was enabled using `session.conf.set("use_simplified_query_generation", True)`.
>
>   > + Disabled simplified query generation by default.

### Snowpark pandas API updates

#### Improvements

> * Improve error message for `pd.to_snowflake`, `DataFrame.to_snowflake`, and `Series.to_snowflake` when the table does not exist.
> * Improve readability of docstring for the `if_exists` parameter in `pd.to_snowflake`, `DataFrame.to_snowflake`, and `Series.to_snowflake`.
> * Improve error message for all pandas functions that use UDFs with Snowpark objects.

#### Bug fixes

> * Fixed a bug in `Series.rename_axis` where an `AttributeError` was being raised.
> * Fixed a bug where `pd.get_dummies` didn’t ignore NULL/NaN values by default.
> * Fixed a bug where repeated calls to `pd.get_dummies` results in ‘Duplicated column name error’.
> * Fixed a bug in `pd.get_dummies` where passing list of columns generated incorrect column labels in output DataFrame.
> * Update `pd.get_dummies` to return bool values instead of int.

### Snowpark local testing updates

#### New features

* Added support for literal values to `range_between` window function.

## Version 1.28.0 (2025-02-20)

### New features

* Added support for the following functions in `functions.py`

  + `normal`
  + `randn`
* Added support for `allow_missing_columns` parameter to `Dataframe.union_by_name` and `Dataframe.union_all_by_name`.

### Improvements

* Improved random object name generation to avoid collisions.
* Improved query generation for `Dataframe.distinct` to generate SELECT DISTINCT instead of SELECT with GROUP BY all columns. To disable this feature, set `session.conf.set("use_simplified_query_generation", False)`.

### Deprecations

* Deprecated Snowpark Python function `snowflake_cortex_summarize`. Users can install `snowflake-ml-python` and use the `snowflake.cortex.summarize` function instead.
* Deprecated Snowpark Python function `snowflake_cortex_sentiment`. Users can install `snowflake-ml-python` and use the `snowflake.cortex.sentiment` function instead.

### Bug fixes

* Fixed a bug where session-level query tag was overwritten by a stack trace for DataFrames that generate multiple queries. Now, the query tag will only be set to the stacktrace if `session.conf.set("collect_stacktrace_in_query_tag", True)`.
* Fixed a bug in `Session._write_pandas` where it was erroneously passing `use_logical_type` parameter to `Session._write_modin_pandas_helper` when writing a Snowpark pandas object.
* Fixed a bug in options SQL generation that could cause multiple values to be formatted incorrectly.
* Fixed a bug in `Session.catalog` where empty strings for database or schema were not handled correctly and were generating erroneous SQL statements.

### Experimental Features

* Added support for writing pyarrow Tables to Snowflake tables.

### Snowpark pandas API updates

#### New features

* Added support for applying Snowflake Cortex functions `Summarize` and `Sentiment`.
* Added support for list values in `Series.str.get`.

### Bug fixes

* Fixed a bug in `apply` where kwargs were not being correctly passed into the applied function.

### Snowpark local testing updates

#### New features

* Added support for the following functions
  :   + `hour`
      + `minute`
* Added support for NULL_IF parameter to CSV reader.
* Added support for `date_format`, `datetime_format`, and `timestamp_format` options when loading CSVs.

### Bug fixes

* Fixed a bug in `DataFrame.join` that caused columns to have incorrect typing.
* Fixed a bug in `when` statements that caused incorrect results in the `otherwise` clause.

## Version 1.27.0 (2025-02-05)

### New features

Added support for the following functions in `functions.py`:

* `array_reverse`
* `divnull`
* `map_cat`
* `map_contains_key`
* `map_keys`
* `nullifzero`
* `snowflake_cortex_sentiment`
* `acosh`
* `asinh`
* `atanh`
* `bit_length`
* `bitmap_bit_position`
* `bitmap_bucket_number`
* `bitmap_construct_agg`
* `cbrt`
* `equal_null`
* `from_json`
* `ifnull`
* `localtimestamp`
* `max_by`
* `min_by`
* `nth_value`
* `nvl`
* `octet_length`
* `position`
* `regr_avgx`
* `regr_avgy`
* `regr_count`
* `regr_intercept`
* `regr_r2`
* `regr_slope`
* `regr_sxx`
* `regr_sxy`
* `regr_syy`
* `try_to_binary`
* `base64`
* `base64_decode_string`
* `base64_encode`
* `editdistance`
* `hex`
* `hex_encode`
* `instr`
* `log1p`
* `log2`
* `log10`
* `percentile_approx`
* `unbase64`
* Added support for specifying a schema string (including implicit struct syntax) when calling `DataFrame.create_dataframe`.
* Added support for `DataFrameWriter.insert_into/insertInto`. This method also supports local testing mode.
* Added support for `DataFrame.create_temp_view` to create a temporary view. It will fail if the view already exists.
* Added support for multiple columns in the functions `map_cat` and `map_concat`.
* Added an option `keep_column_order` for keeping original column order in `DataFrame.with_column` and `DataFrame.with_columns`.
* Added options to column casts that allow renaming or adding fields in `StructType` columns.
* Added support for `contains_null parameter` to `ArrayType`.
* Added support for creating a temporary view via `DataFrame.create_or_replace_temp_view` from a DataFrame created by reading a file from a stage.
* Added support for `value_contains_null` parameter to `MapType`.
* Added interactive to telemetry that indicates whether the current environment is an interactive one.
* Allow `session.file.get` in a Native App to read file paths starting with / from the current version
* Added support for multiple aggregation functions after `DataFrame.pivot`.

### Experimental features

* Added `Session.catalog` class to manage Snowflake objects. It can be accessed via `Session.catalog`.

  > + `snowflake.core` is a dependency required for this feature.
* Allow user input schema or user input schemas when reading JSON file on stage.
* Added support for specifying a schema string (including implicit struct syntax) when calling `DataFrame.create_dataframe`.

### Improvements

* Updated `README.md` to include instructions on how to verify package signatures using `cosign`.

### Bug fixes

* Fixed a bug in local testing mode that caused a column to contain None when it should contain 0.
* Fixed a bug in `StructField.from_json` that prevented `TimestampTypes` with `tzinfo` from being parsed correctly.
* Fixed a bug in `function date_format` that caused an error when the input column was date type or timestamp type.
* Fixed a bug in DataFrame that allowed null values to be inserted in a non-nullable column.
* Fixed a bug in functions `replace` and `lit` which raised type hint assertion error when passing Column expression objects.
* Fixed a bug in `pandas_udf` and `pandas_udtf` where session parameters were erroneously ignored.
* Fixed a bug that raised an incorrect type conversion error for system function called through `session.call`.

### Snowpark pandas API updates

#### New features

* Added support for `Series.str.ljust` and `Series.str.rjust`.
* Added support for `Series.str.center`.
* Added support for `Series.str.pad`.
* Added support for applying the Snowpark Python function `snowflake_cortex_sentiment`.
* Added support for `DataFrame.map`.
* Added support for `DataFrame.from_dict` and `DataFrame.from_records`.
* Added support for mixed case field names in struct type columns.
* Added support for `SeriesGroupBy.unique`
* Added support for `Series.dt.strftime` with the following directives:

  + %d: Day of the month as a zero-padded decimal number.
  + %m: Month as a zero-padded decimal number.
  + %Y: Year with century as a decimal number.
  + %H: Hour (24-hour clock) as a zero-padded decimal number.
  + %M: Minute as a zero-padded decimal number.
  + %S: Second as a zero-padded decimal number.
  + %f: Microsecond as a decimal number, zero-padded to 6 digits.
  + %j: Day of the year as a zero-padded decimal number.
  + %X: Locale’s appropriate time representation.
  + %%: A literal ‘%’ character.
* Added support for `Series.between`.
* Added support for `include_groups=False` in `DataFrameGroupBy.apply`.
* Added support for `expand=True` in `Series.str.split`.
* Added support for `DataFrame.pop` and `Series.pop`.
* Added support for `first` and `last` in `DataFrameGroupBy.agg` and `SeriesGroupBy.agg`.
* Added support for `Index.drop_duplicates`.
* Added support for aggregations `"count"`, `"median"`, `np.median`,
  `"skew"`, `"std"`, `np.std` `"var"`, and `np.var` in
  `pd.pivot_table()`, `DataFrame.pivot_table()`, and `pd.crosstab()`.

#### Improvements

* Improved performance of `DataFrame.map`, `Series.apply` and `Series.map` methods by mapping numpy functions to Snowpark functions if possible.
* Added documentation for `DataFrame.map`.
* Improved performance of `DataFrame.apply` by mapping numpy functions to Snowpark functions if possible.
* Added documentation on the extent of Snowpark pandas interoperability with scikit-learn.
* Infer return type of functions in `Series.map`, `Series.apply` and `DataFrame.map` if type-hint is not provided.
* Added `call_count` to telemetry that counts method calls including interchange protocol calls.

---
title: Snowpark Library for Python release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-python-2026.md
section: Release Notes
---

# Snowpark Library for Python release notes for 2026

This article contains the release notes for the [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for [Snowpark Library for Python](../../developer-guide/snowpark/python/index.md) updates.

See [Snowpark Developer Guide for Python](../../developer-guide/snowpark/python/index.md) for documentation.

> **Warning:**
>
> Because Python 3.8 has reached its [End of Life](https://devguide.python.org/versions/), deprecation warnings will be triggered when you use `snowpark-python` with Python 3.8. For more information, see [Snowflake Python Runtime Support](../../developer-guide/python-runtime-support-policy.md). Snowpark Python 1.24.0 will be the last client and server version to support Python 3.8, in accordance with [Anaconda’s policy](https://forum.anaconda.com/t/python-3-8-reaches-end-of-life/87265). Upgrade your existing Python 3.8 objects to Python 3.9 or later.

## Version 1.47.0: Mar 05, 2026

### New features

* Added support for the `array_union_agg` function in the `snowflake.snowpark.functions` module.

### Bug fixes

* Fixed a bug where `Session.udf.register_from_file` did not properly process the `strict` and `secure` parameters.
* Fixed a bug where an error is raised on a string value in `DecimalType` column when creating a DataFrame with small data (< array binding threshold).
* Reverted the following improvements introduced in 1.46.0 as they caused unintended breaking changes in some query patterns:

  > + Reduced the size of queries generated by certain `DataFrame.join` operations.
  > + Removed redundant aliases in generated queries (for example, `SELECT "A" AS "A"` is now always simplified to `SELECT "A"`).

## Version 1.46.0: Feb 25, 2026

### New features

* Added support for the `DECFLOAT` data type that allows users to represent decimal numbers exactly with 38 digits of precision and a dynamic base-10 exponent.
* Added support for the `DEFAULT_PYTHON_ARTIFACT_REPOSITORY` parameter that allows users to configure the default artifact repository at the account, database, and schema level.

### Bug fixes

* Fixed a bug where `cloudpickle` was not automatically added to the package list when using `artifact_repository` with custom packages, causing `ModuleNotFoundError` at runtime.
* Fixed a bug when reading xml with custom schema that results in element attributes that are included when a column is not of the `StructType` type.
* Fixed a bug where `Session.udf.register_from_file` did not properly process the `strict` and `secure` parameters.

### Improvements

* Reduced the size of queries generated by certain `DataFrame.join` operations.
* Removed redundant aliases in generated queries (for example, `SELECT "A" AS "A"` is now always simplified to `SELECT "A"`).

## Version 1.45.0: Jan 26, 2026

### New features

* Allow user input schema when reading an XML file on a stage.
* Added support for the following functions in `functions.py`:

  > + `hex_decode_string`
  > + `jarowinkler_similarity`
  > + `parse_url`
  > + `regexp_instr`
  > + `regexp_like`
  > + `regexp_substr`
  > + `regexp_substr_all`
  > + `rtrimmed_length`
  > + `space`
  > + `split_part`
* Added the `preserve_parameter_names` flag to stored procedure, UDF, UDTF, and UDAF creation.

### Bug fixes

* Fixed a bug where `opentelemetry` is not correctly imported when using `Session.client_telemetry.enable_event_table_telemetry_collection`.

### Improvements

* `snowflake.snowpark.context.configure_development_features` is effective for multiple sessions including newly created sessions after the configuration. There is no longer a duplicate experimental warning.
* Removed the experimental warning from `DataFrame.to_arrow` and `DataFrame.to_arrow_batches`.
* When both `Session.reduce_describe_query_enabled` and `Session.cte_optimization_enabled` are enabled, fewer `DESCRIBE` queries are issued when resolving a table schema.

---
title: Snowpark Library for Scala and Java release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-scala-java.md
section: Release Notes
---

# Snowpark Library for Scala and Java release notes

The release notes for
the [Snowpark Library for Scala](../../developer-guide/snowpark/scala/index.md)
and [Snowpark Library for Java](../../developer-guide/snowpark/java/index.md) provide details for each release,
including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2025 releases](snowpark-scala-java-2025.md)
* [2024 releases](snowpark-scala-java-2024.md)
* [2023 releases](snowpark-scala-java-2023.md)
* [2022 releases](snowpark-scala-java-2022.md)

See [Snowpark Developer Guide for Java](../../developer-guide/snowpark/java/index.md) and [Snowpark Developer Guide for Scala](../../developer-guide/snowpark/scala/index.md) for documentation.

---
title: Snowpark Library for Scala and Java release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-scala-java-2022.md
section: Release Notes
---

# Snowpark Library for Scala and Java release notes for 2022

This article contains the release notes for
the [Snowpark Library for Scala](../../developer-guide/snowpark/scala/index.md)
and [Snowpark Library for Java](../../developer-guide/snowpark/java/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowpark Library for Scala and Java updates.

See [Snowpark Developer Guide for Java](../../developer-guide/snowpark/java/index.md) and [Snowpark Developer Guide for Scala](../../developer-guide/snowpark/scala/index.md) for documentation.

## Version 1.6.2 (October 26, 2022)

Compatible Snowflake release: 6.35.x

### Improvements

* Made internal improvements for stored procedures written in Java or Scala.

## Version 1.6.1 (September 30, 2022)

Compatible Snowflake release: 6.31.x

This version has a known issue which might break temp object creation. Please use 1.6.2 instead.

### Improvements

* Made internal improvements for stored procedures written in Java or Scala.

## Version 1.6.0 (August 12, 2022)

Compatible Snowflake release: 6.27.x

### Improvements

* Made internal improvements to UDTFs.

## Version 1.5.0 (July 1, 2022)

Compatible Snowflake release: 6.22.x

### New features

* Added support for writing DataFrames to files on a stage to the
  [Scala API](../../developer-guide/snowpark/scala/working-with-dataframes.md)
  and [Java API](../../developer-guide/snowpark/java/working-with-dataframes.md).

### Improvements

* Optimized the SQL queries generated by the Snowpark client library.
* Improved the error message that is logged when the Snowpark library fails to resolve a column name in
* a DataFrame (e.g. when you attempt to access a column that does not exist).

## Version 1.4.1 (May 26, 2022)

Compatible Snowflake release: 6.17.x

### Changes

* Updated the version of `jackson-core` and `jackson-annotations` that the Snowpark library depends on to 2.13.2.
* Updated the version of `jackson-databind` that the Snowpark library depends on to 2.13.2.2.
* Removed the `jackson-core`, `jackson-databind`, and `jackson-annotations` classes from Snowpark JAR file.

  If you downloaded the `.tar.gz` / `.zip` file, the JAR files for the Jackson classes are now provided
  separately in the `lib/` subdirectory (`jackson-core-2.13.2.jar`, `jackson-databind-2.13.2.2.jar`,
  and `jackson-annotations-2.13.2.jar`).

  If you are specifying the Snowpark library as a dependency in your `pom.xml` file and you want to depend on a
  **different version** of the Jackson libraries in your pom.xml, you can
  [exclude the dependency on the Jackson libraries from the Snowpark library dependency](https://maven.apache.org/guides/introduction/introduction-to-optional-and-excludes-dependencies.html#dependency-exclusions).

## Version 1.4.0 (April 28, 2022)

Compatible Snowflake release: 6.14.x

### New features

* Made the Snowpark Java API generally available on AWS and Azure.
* The API is still available as a preview feature in GCS.
* Made the Snowpark Scala API generally available on Azure.

  Prior to this release, the API was only generally available on AWS. The API is still available as a preview feature on GCS.
* Added a [Java API for creating UDTFs](../../developer-guide/snowpark/java/creating-udfs.md). Note that this is a preview feature.
* Added new APIs in [Scala](../../developer-guide/snowpark/scala/working-with-dataframes.md) and
  [Java](../../developer-guide/snowpark/java/working-with-dataframes.md) for uploading and downloading data
  from a stage (`FileOperation.uploadStream and FileOperation.downloadStream`).
* Added the `DataFrameWriter.option` method in [Scala](../../developer-guide/snowpark/scala/working-with-dataframes.md) and
  [Java](../../developer-guide/snowpark/java/working-with-dataframes.md) for specifying how values in columns in the
  DataFrame should be mapped to columns in the table. The option method allows you to specify that
  the `DataFrameWriter` should use the column name, rather than the column order.

### Improvements

* Disabled the Closure Cleaner in Java sessions. The Closure Cleaner only works in Scala programs.
* Improved `Array` and `Map` support in the Java Row API.

## Version 1.3.0 (March 18, 2022)

Compatible Snowflake release: 6.8.x

### New features

* Added support for [writing stored procedures](../../developer-guide/stored-procedure/java/procedure-java-overview.md) in Java .
* Added support for [asynchronously merging rows into a table](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/MergeBuilder.html#async:com.snowflake.snowpark.MergeBuilderAsyncActor)
  in Scala.

## Version 1.2.0 (March 2, 2022)

Compatible Snowflake release: 6.5.x

### New features

* Added the [Java API for Snowpark](../../developer-guide/snowpark/java/index.md).
* Added preview support in the Scala API for [creating UDTFs](../../developer-guide/snowpark/scala/creating-udfs.md).
* Added a separate version of the library that complies with the security requirements of FIPS (Federal Information Processing Standard). You can download this library from:

  + [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark-fips/java/1.2.0/index.html).
  + [Maven Central Repository](https://search.maven.org/artifact/com.snowflake/snowpark-fips/1.2.0/jar).

  To point to the FIPS-compliant library from an sbt build file or Maven project, use `snowpark-fips` as the `artifactId`.

## Version 1.1.0 (February 4, 2022)

Compatible Snowflake release: 6.2.x

Added support for [Writing Scala handlers for stored procedures created with SQL](../../developer-guide/stored-procedure/scala/procedure-scala-overview.md).

The API reference for this release is available in
the [Snowflake documentation](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/index.html)
and in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/1.1.0/index.html).

## Version 1.0.0 (January 26, 2022)

Compatible Snowflake release: 6.1.x

General availability (GA) release on AWS. (Snowpark is still a preview feature on Azure and GCP.)

The API reference for this release is available in a `.zip` or `.tar.gz` file in
the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/1.0.0/index.html).

## Version 0.12.0 (January 4, 2022)

Compatible Snowflake release: 5.45.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in
the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.12.0/index.html).

### New features

* Added the [listagg](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#listagg(col:com.snowflake.snowpark.Column,delimiter:String):com.snowflake.snowpark.Column) function to the `functions` object.
* Added support for UDFs with 11 to 22 arguments.
* Added the [any_value](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/RelationalGroupedDataFrame.html#any_value(cols:com.snowflake.snowpark.Column*):com.snowflake.snowpark.DataFrame) function to the `RelationalGroupedDataFrame` class.

### Improvements

* In the generated code for UDFs, replaced a static code block with an object instance function.
* Reorganized error messages.
* Changed the `saveAsTable` function so that a new table is not created in Append mode.
* Improved the `callUDF` function to support any type of argument.
* Changed the library to set the query tag at the statement level, rather than at the session level.

## Version 0.11.0 (November 16, 2021)

Compatible Snowflake release: 5.45.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.11.0/index.html).

### New features

* Added the [generator](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/Session.html#generator(rowCount:Long,col:com.snowflake.snowpark.Column,cols:com.snowflake.snowpark.Column*):com.snowflake.snowpark.DataFrame)
  method to the `Session` class and the
  [seq1](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#seq1(startsFromZero:Boolean):com.snowflake.snowpark.Column),
  [seq2](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#seq2(startsFromZero:Boolean):com.snowflake.snowpark.Column),
  [seq4](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#seq4(startsFromZero:Boolean):com.snowflake.snowpark.Column),
  [seq8](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#seq8(startsFromZero:Boolean):com.snowflake.snowpark.Column),
  and [uniform](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#uniform(min:com.snowflake.snowpark.Column,max:com.snowflake.snowpark.Column,gen:com.snowflake.snowpark.Column):com.snowflake.snowpark.Column)
  functions to the functions object.
* Added the [getSessionInfo](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#uniform(min:com.snowflake.snowpark.Column,max:com.snowflake.snowpark.Column,gen:com.snowflake.snowpark.Column):com.snowflake.snowpark.Column) method to the Session class.
* Added APIs for [performing actions on DataFrames asynchronously](../../developer-guide/snowpark/scala/working-with-dataframes.md).

### Improvements

Upgraded the Snowflake JDBC driver to 3.13.9.
Improved the error message reported when no current database is selected for use.

## Version 0.10.1 (October 27, 2021)

Compatible Snowflake release: 5.38.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.10.1/index.html).

### Bug fixes

* Fixed a problem with uploading files to a GCP stage where the wrong prefix was used.
* Fixed a problem in which a 403 HTTP response was returned when accessing a pre-signed URL for GCP.

## Version 0.10.0 (October 18, 2021)

Compatible Snowflake release: 5.37.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.10.0/index.html).

### New features

* Added the new method [dropDuplicates](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/DataFrame.html#dropDuplicates(colNames:String*):com.snowflake.snowpark.DataFrame) to the DataFrame class.
* Added support for in expressions to the Column class (with the [in](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/DataFrame.html#dropDuplicates(colNames:String*):com.snowflake.snowpark.DataFrame) method) and the functions object (with the in function).
* Extended the Iterator returned by [DataFrame.toLocalIterator](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#in(columns:Seq[com.snowflake.snowpark.Column],df:com.snowflake.snowpark.DataFrame):com.snowflake.snowpark.Column)
  to support the `Closeable` interface, which allows you to call the close method on the iterator.
* Added support for the new configuration property `snowpark_request_timeout_in_seconds`. You
  can [set this in the configuration map / file](../../developer-guide/snowpark/scala/creating-session.md)
  to adjust the timeout that the library uses when uploading dependencies to a stage. By default, the timeout is 86400 (1 day).

### Improvements

* Added logic to the [DataFrame.withColumns](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/DataFrame.html#withColumns(colNames:Seq[String],values:Seq[com.snowflake.snowpark.Column]):com.snowflake.snowpark.DataFrame) method to verify that duplicate input column names are not specified.
* Updated the `clone` methods in the `Copyable` and `Updatable` classes return correct DataFrame types.
* Added support for specifying the application ID
* by [setting the application JDBC property in the configuration map / file](../../developer-guide/snowpark/scala/creating-session.md).

### Behavior changes

* Removed APIs intended only for Java from the Scala API.
* Replaced the default logger log4j with SLF4J SimpleLogger.

### Bug fixes

* Updated the library to close unused statements automatically in order to reduce memory usage.
* Fixed the column order in the result of the `DataFrame.withColumns` method.

## Version 0.9.0 (September 20, 2021)

Compatible Snowflake release: 5.34.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.9.0/index.html).

### New features

* Added a new DataFrame subclass, [CopyableDataFrame](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/CopyableDataFrame.html), that you can use
  to [copy data from a staged file into a table](../../developer-guide/snowpark/scala/working-with-dataframes.md).
  This is equivalent to the [COPY INTO <table>](../../sql-reference/sql/copy-into-table.md) command.
* Added the new method [DataFrame.rename()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/DataFrame.html#rename(newName:String,col:com.snowflake.snowpark.Column):com.snowflake.snowpark.DataFrame) for renaming columns in a DataFrame.
* Added the new function [functions.iff()](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/functions$.html#iff(condition:com.snowflake.snowpark.Column,expr1:com.snowflake.snowpark.Column,expr2:com.snowflake.snowpark.Column):com.snowflake.snowpark.Column)
  for specifying an if-then-else expression. This is equivalent to the [IFF](../../sql-reference/functions/iff.md) function.
* Added new constructors for the [DecimalType](https://docs.snowflake.com/en/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/types/DecimalType.html) class.

### Behavior changes

* Changed the DataFrame.union() and DataFrame.unionByName() methods to use UNION, rather than UNION ALL.

### Bug fixes

* Fixed the error `SQL compilation error: Missing column specification` that could occur when the Snowpark library created a temporary view.

## Version 0.8.0 (August 9, 2021)

Compatible Snowflake release: 5.30.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.8.0/index.html).

### Improvements

* Refactored some internal code to remove some dependencies.

### Bug fixes

* Fixed an issue with BigDecimal literals in cases where scale might be larger than precision.
* Fixed an issue that could occur when performing multiple set operations (e.g. union, intersect, etc.).

## Version 0.7.0 (July 23, 2021)

Compatible Snowflake release: 5.29.x

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.7.0/index.html).

### New APIs

* Introduced the new Session.close() method. Call this method to close the Snowpark session, which cancels all running queries and prevents the subsequent use of this session to execute queries.
* Introduced the new Updatable class. Updatable extends the DataFrame class and provides additional table-related capabilities (e.g. the ability to update and delete values).
* The Session.table() method now returns an Updatable object, rather than a DataFrame object.
* Introduced new signatures for the registerTemporary methods in the UDFRegistration class. These signatures do not have a parameter for the name of the UDF, which means that you can use these to register an anonymous temporary UDF.

### API Changes

* As mentioned above, the `Session.table()` method now returns an `Updatable` object, which extends `DataFrame`.
* In the `Geography` class, removed support for formats other than GeoJSON. Now, `Geography` only supports the GeoJSON data format.

### Improvements

* Improved the `DataFrame.cacheResult()` method to reduce the possibility of “object already exists” errors.
* Improved some error messages.
* Added a new log message that prints out session information after you log in.

### Bug fixes

* Fixed an issue in which the `DataFrame.show()` method did not display binary data correctly.
* Fixed an error that occurred when getting the version number.

## Version 0.6.0 (June 14, 2021)

Compatible Snowflake release: 5.21.x

Preview release on AWS

The API reference for this release is available in a `.zip` or `.tar.gz` file in the [Snowflake Client Repository](https://sfc-repo.snowflakecomputing.com/snowpark/java/0.6.0/index.html).

### API Changes

In this release, the following methods in RelationalGroupedDataFrame now require an argument:

* avg
* max
* median
* min
* sum

In previous releases, if you called these methods without an argument, these methods were applied to all
numeric columns in the DataFrame. For example, for a DataFrame `df` with the columns `(a int, b string, c float)`,
calling `df.groupBy("a").max()` was equivalent to calling `df.groupBy("a").max(col("a"), col("c"))`.

With this release, calling these methods without an argument results in a `SnowparkClientException`.

## Version 0.5.0

### New features

* Added a maxWidth parameter to the DataFrame.show() method. You can use this parameter to adjust the number of characters printed in the output for each column.
* Added the Session.cancelAll() method, which you can use to cancel all running actions on this session.
* Added the DataFrame.toLocalIterator() method, which returns an iterator that you can use to retrieve data, row by row. You can use this rather than DataFrame.collect(), if you don’t want to load all of the data into memory at once.
* Added the median method to the RelationalGroupedDataFrame class.

### Improvements

* Improved the error message returned when an identifier is invalid.
* Enhanced the error checking to report an error when no database or schema name is specified.
* Added a performance improvement when inserting a large number of values in a table.
* Updated the library to consistently handle Snowflake object identifiers (table and view names). Now, all parameters that specify table or view names support the use of:

  + Short names (e.g. table_name and view_name)
  + Fully-qualified names (e.g. database.schema.table_name)
  + Multi-part identifiers (e.g. Seq(“database”, “schema”, “view_name”))
* Added a check to verify that the supported version of Scala is being used. The library will report error if the Scala version is not compatible.

### Bug fixes

* Fixed a problem with registering UDFs on Microsoft Windows.
* Fixed a problem with the order of results when using DataFrame.sort() with DataFrame.limit().
* Fixed Session.range() to generate a sequence of numbers without gaps.

## Version 0.4.1

In this version, you no longer need to specify a temporary schema or temporary database for Snowpark objects (the TEMP_SCHEMA and TEMP_DB settings). The Snowpark library automatically creates temporary versions of the objects needed.

### API Changes

Replaced the DataFrame.cache() method with the DataFrame.cacheResult() method.

The new method creates and returns a new DataFrame with the cached results and has no effect on the current DataFrame. As a result of this change, the DataFrame object is now immutable.

### New APIs

* Added the following new methods to the RelationalGroupedDataFrame class:

  + avg
  + max
* Added the following new methods to the DataFrame class:

  + groupByGroupingSets
  + clone
  + createOrReplaceTempView
* Added the following new functions to the functions object:

  + toScalar
* Added a Session.file object, which provides the following new methods for performing file operations:

  + get
  + put
* Made the following changes to the Session.createDataFrame method:

  + Added support for user-provided schemas.
  + Added support for specifying an array/map of variant/geography data.
  + Added support for Geography/Variant data types in UDFs.
* Added registerPermanent methods to the UDFRegistration class.

### Bug fixes

* Fixed a problem when the DataFrame column name contains quotation marks.
* Fixed a problem with the inability to escape data that contains backslashes, single quotes, and newline characters.
* Fixed a problem where UDF creation fails with the error message “code too larger”.
* Fixed a problem where the UDF closure failed to capture the value of a local string variable.
* Added the result schema for the following SQL clauses:

  + GRANT/REVOKE
  + DESCRIBE
  + CREATE
  + USE
* Fixed a problem when using Snowpark in Visual Studio Code with the Metals extension to create a UDF.

---
title: Snowpark Library for Scala and Java release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-scala-java-2023.md
section: Release Notes
---

# Snowpark Library for Scala and Java release notes for 2023

This article contains the release notes for
the [Snowpark Library for Scala](../../developer-guide/snowpark/scala/index.md)
and [Snowpark Library for Java](../../developer-guide/snowpark/java/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowpark Library for Scala and Java updates.

See [Snowpark Developer Guide for Java](../../developer-guide/snowpark/java/index.md) and [Snowpark Developer Guide for Scala](../../developer-guide/snowpark/scala/index.md) for documentation.

## Version 1.9.0 (October 17, 2023)

Compatible Snowflake release: 7.36

### New features

* Supports `regexp_replace` function.
* Supports `PKCS#8` RSA private key.

### Improvements

* Upgraded Snowflake JDBC to 3.14.1.

### Bug fixes

* None.

## Version 1.8.0 (April 28, 2023)

Compatible Snowflake release: 7.14

### New features

* New APIs for creating and calling stored procedures.

  This release includes APIs for registering named permanent, named session-temporary, and anonymous session-temporary stored procedures. It also includes APIs for calling stored procedures, both registered in Snowflake and to run locally.

  For related APIs, refer to the following.

  + For Java: [com.snowflake.snowpark_java.SProcRegistration](https://docs.snowflake.com/developer-guide/snowpark/reference/java/com/snowflake/snowpark_java/SProcRegistration.html)
  + For Scala: [com.snowflake.snowpark.SProcRegistration](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/SProcRegistration.html)
* `Session.tableFunction` function now also works with `DataFrame` columns.

  Previously, the `Session.tableFunction` method only supported literal function arguments. With this release, you can
  specify `DataFrame` columns from a single frame as an argument. For more information, refer
  to [tableFunction](https://docs.snowflake.com/developer-guide/snowpark/reference/scala/com/snowflake/snowpark/Session.html#tableFunction(func:com.snowflake.snowpark.TableFunction,args:Seq[com.snowflake.snowpark.Column]):com.snowflake.snowpark.DataFrame)
  in the reference documentation.

  Note that all `DataFrame` columns used as arguments should come from the same `DataFrame`.

### Improvements

* Upgraded the Snowflake JDBC driver to 3.13.28.

### Bug fixes

* None.

## Version 1.7.2 (February 16, 2023)

Compatible Snowflake release: 7.13

### New features

* None.

### Improvements

* Updated `SnowflakeFile` class to the latest version.

### Bug fixes

* None.

## Version 1.7.1 (February 8, 2023)

Compatible Snowflake release: 7.6.x

### New features and Updates

* Improved an internal feature for stored procedure support.
* Updated the `SnowflakeFile` class to the latest version.

### Bug fixes

* None.

## Version 1.7.0 (January 7, 2023)

Compatible Snowflake release: 7.0.x

### New features

* Added methods that support PARTITION BY and ORDER By when joining a DataFrame with the output of a UDTF.

### Improvements

* Made more predictable the result when column heads are duplicated across joined DataFrames.
  As of this release, duplicated column names will be presented as found in the DataFrames that were joined.
  Previously, aliases were used for duplicated column heads. Aliases will still be used for duplicated column
  heads when the result of a join is saved to a table or cached – you should deduplicate before saving or caching.

### Behavior changes

* Changed from `int` to `long` the return value data type of methods that return a count of rows merged, updated, or
  deleted. For these methods, see the `MergeResult`, `UpdateResult`, and `DeleteResult` types.

---
title: Snowpark Library for Scala and Java release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-scala-java-2024.md
section: Release Notes
---

# Snowpark Library for Scala and Java release notes for 2024

This article contains the release notes for
the [Snowpark Library for Scala](../../developer-guide/snowpark/scala/index.md)
and [Snowpark Library for Java](../../developer-guide/snowpark/java/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowpark Library for Scala and Java updates.

See [Snowpark Developer Guide for Java](../../developer-guide/snowpark/java/index.md) and [Snowpark Developer Guide for Scala](../../developer-guide/snowpark/scala/index.md) for documentation.

## Version 1.15.0 (December 18, 2024)

Compatible Snowflake release: 8.46

### New features

* New functions:

  + `months_between`
  + `instr`
  + `format_number`
  + `from_unix_timestamp`
  + `to_unix_timestamp`
  + `Row.getAs`
* Support for SQL bind in `Session.sql` function.

## Version 1.14.0 (September 4, 2024)

Compatible Snowflake release: 8.35

### New features

* Added support for reading structured types from Snowflake.
* Added the following new functions:

  + `Variant.asJsonNode`
  + `Functions.round`
  + `Functions.hex`
  + `Functions.unhex`
  + `Functions.shiftleft`
  + `Functions.shiftright`
  + `Functions.reverse`
  + `Functions.isnull`
  + `Functions.unix_timestamp`
  + `Functions.locate`
  + `Functions.ntile`
  + `Functions.radn`
  + `Functions.randn`
  + `Functions.regexp_extract`
  + `Functions.signum`
  + `Functions.sign`
  + `Functions.substring_index`
  + `Functions.collect_list`
  + `Functions.log10`
  + `Functions.log1p`
  + `Functions.base64`
  + `Functions.unbase64`
  + `Functions.expr`
  + `Functions.array`
  + `Functions.date_format`
  + `Functions.last`
  + `Functions.desc`
  + `Functions.asc`
  + `Functions.size`

### Improvements

None.

### Bug fixes

* Fixed incorrect time info in the Open Telemetry span
* Fix duplicated Open Telemetry span in the count action

## Version 1.13.2 (August 26, 2024)

Compatible Snowflake release: 8.31

### New features

None.

### Improvements

None.

### Bug fixes

* Fixed Jackson Scala module compatibility issue.

## Version 1.13.1 (August 21, 2024)

Compatible Snowflake release: 8.31

### New features

None.

### Improvements

None.

### Bug fixes

* When the session parameter `ERROR_ON_NONDETERMINISTIC_UPDATE` is set to `true`, calls to `session.table(...).update(...)`
  no longer report errors.

## Version 1.13.0 (August 1, 2024)

Compatible Snowflake release: 8.28

### New features

* Emit span in Java/Scala stored procedure. Support functions:

  + All action functions
  + Register UDF/UDTF/SProc
* Enable retrieving cloud provider tokens in the `SnowflakeSecrets` class.
* New functions:

  + `Session.updateQueryTag`
  + `functions.countDistinct`
  + `functions.max(String)`
  + `functions.min(String)`
  + `functions.mean(String)`

### Improvements

* App name in the session query tag is JSON format now.
* Upgraded SLF4J to 2.0.4
* Update documentation for `SnowflakeFile`

### Bug fixes

* Variant object can’t handle null value
* `DataFrame` alias doesn’t work in the JOIN condition

## Version 1.12.1 (May 13, 2024)

Compatible Snowflake release: 8.18

### New features

None.

### Improvements

None.

### Bug fixes

Fixed “Dataframe alias doesn’t work in the JOIN condition”.

## Version 1.12.0 (April 16, 2024)

Compatible Snowflake release: 8.14

### New features

* Support the `Geometry` data type.
* New function: `sum(String)`.
* Support setting an app name when creating a new session.

### Improvements

Added code examples for the `split` function in the API document.

### Bug fixes

None.

## Version 1.11.0 (April 1, 2024)

Compatible Snowflake release: 8.12

### New features

* Support Java 17 stored procedure

  + When registering a stored procedure, Snowpark automatically sets `runtime_version` to 17 if the client is running with JVM 17.

### Improvements

None.

### Bug fixes

None.

## Version 1.10.0 (February 9, 2024)

Compatible Snowflake release: 8.5

### New features

* Support Java 17.

  + Compatible with JVM 17.
  + When registering a UDF or UDTF, Snowpark automatically sets the `runtime_version` to `17` if the client is running with
    JVM 17.
* Support Dataframe alias.

  + You can use the `DataFrame.alias` function to assign DataFrames an alias for future reference.

    For example, you could use code such as the following:

    ```scala
    val df1 = df.alias("A")
    df1.join(df2).select(col("A.col"))
    ```

    This is equivalent to `df1.join(df2).select(df1("col"))`.
* Support for the `explode` function.
* You can invoke table functions in the `DataFrame.select` method.
* You can use table functions to read function arguments through the `TableFunction.apply` method.
* New session constructor `Session.getOrCreate`.

### Improvements

* Upgraded JDBC to version 3.14.4.
* New wrapper for `is_null` function.
* Upgrade Scala to version 2.12.18.

### Bug fixes

* Updated wrong license information.

---
title: Snowpark Library for Scala and Java release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpark-scala-java-2025.md
section: Release Notes
---

# Snowpark Library for Scala and Java release notes for 2025

This article contains the release notes for
the [Snowpark Library for Scala](../../developer-guide/snowpark/scala/index.md)
and [Snowpark Library for Java](../../developer-guide/snowpark/java/index.md), including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowpark Library for Scala and Java updates.

See [Snowpark Developer Guide for Java](../../developer-guide/snowpark/java/index.md) and [Snowpark Developer Guide for Scala](../../developer-guide/snowpark/scala/index.md) for documentation.

## Version 1.18.0 (December 5, 2025)

### Improvements

* Add `functions.try_to_date` overload for format parameter.
* Add `functions.try_to_timestamp` overload for format parameter.
* Add `Column.cast` support for `Any` parameter type.
* Add `Column.equal_to` support for `Any` parameter type.
* Add `Column.not_equal` support for `Any` parameter type.
* Add `Column.gt` support for `Any` parameter type.
* Add `Column.lt` support for `Any` parameter type.
* Add `Column.leq` support for `Any` parameter type.
* Add `Column.geq` support for `Any` parameter type.
* Add `Column.equal_null` support for `Any` parameter type.
* Add `Column.plus` support for `Any` parameter type.
* Add `Column.minus` support for `Any` parameter type.
* Add `Column.multiply` support for `Any` parameter type.
* Add `Column.divide` support for `Any` parameter type.
* Add `Column.mod` support for `Any` parameter type.

## Version 1.17.0 (November 10, 2025)

Compatible Snowflake release: 9.32

### New features

Added the following new APIs:

* `DataFrame.isEmpty`
* `functions.try_to_timestamp`
* `functions.try_to_date`
* `functions.concat_ws_ignore_nulls`
* `functions.array_flatten`
* `Row.mkString` (with overloads for customizable separators and formatting options)
* `StructType.fieldNames` (alias for `StructType.names`)

### Improvements

* Support both Scala 2.12 and 2.13 (currently in public preview) from release 1.17.0 onwards.
* `functions.when` and `Column.when`, along with `Column.otherwise`, now accept any literal arguments (for example, `String`, `int`, `boolean`, or `null`) in addition to `Column` instances.
* Add `functions.substring` overload with support for start position and length arguments.
* Add `functions.lpad` overloads to pad with `String`, or `Array[Byte]`.
* Add `functions.rpad` overloads to pad with `String`, or `Array[Byte]`.
* Add `DataFrame.sort` overload with support for variadic arguments.
* Add `DataFrame.show` overloads with parameters to control truncation and number of displayed rows.

### Bug Fixes

None.

## Version 1.16.0 (June 30, 2025)

Compatible Snowflake release: 9.17

### New features

None.

### Improvements

* Upgraded Snowflake JDBC to 3.24.2.
* Added support for empty input `Seq` in `Column.in`.
* Added support for creating views from `Union` results.

### Bug Fixes

* Fixed a wrong order issue when merging a Dataframe.

---
title: Snowpark Python: Eliminate repeated subqueries in Snowpark-generated queries (Canceled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1995.md
section: Release Notes
---

# Snowpark Python: Eliminate repeated subqueries in Snowpark-generated queries (Canceled)

> **Attention:**
>
> This BCR is canceled and removed from the [2025_04 Bundle](../2025_04_bundle.md).

Repeated subquery elimination identifies identical sub-DataFrames within a query plan and employs Common Table Expressions (CTEs) to
construct the final query. Almost half of the queries with compilation times exceeding one second contain at least one redundant subquery.
The benefit of this optimization scales with the quantity and complexity of the identified duplicate subqueries.

* Diagnostic steps:

  + SQL compilation error on a Snowpark data pipeline that was running with no errors previously.
  + Wrong results generated due to bugs in SQL generation.
* Mitigation:

  + Downgrading to an older version of Snowpark client mitigates the issue.
  + Unsetting parameter `PYTHON_SNOWPARK_USE_CTE_OPTIMIZATION_VERSION` mitigates the issue.

    To be unaffected by this change, keep Snowpark-based workflows pinned to a Snowpark Python version that is lower than 1.31.1. For example, if you are using a Python stored procedure, set `PACKAGES=('snowflake-snowpark-python==1.30.0')` when creating the stored procedure. In the case of a Snowflake Notebook or Python worksheet, switch to a Snowpark Python version lower than 1.31.1.

To demonstrate the difference between old and new behavior, consider the following DataFrame transformations in Snowpark Python:

```python
df = session.table("test_table")
df1 = df.with_column("a", F.col("A") + 1).filter(df.a > 1)
df1 = df1.union_all(df1)

print(df1.queries["queries"][0])
```

Before the change:
:   Because the `union_all` above is using the same DataFrame `df1` twice, the generated SQL queries will repeat the underlying subquery twice:

    ```sqlexample
    ( SELECT * FROM ( SELECT "B", "C", ( "A" + 1 ) AS "A" FROM test_table )
      WHERE ( "A" > 1 ) )
    UNION ALL
    ( SELECT * FROM ( SELECT "B", "C", ( "A" + 1 ) AS "A" FROM test_table )
      WHERE ( "A" > 1 ) )
    ```

After the change:
:   The optimization will detect that `df1` is being used twice, will replace the subquery with a CTE expression, and then use that to build the query:

    ```sqlexample
    WITH SNOWPARK_TEMP_CTE_7G3ZFVJYBK AS
      ( SELECT * FROM ( SELECT "B", "C", ( "A" + 1 ) AS "A" FROM test_table )
          WHERE ( "A" > 1 ) ) ( SELECT * FROM ( SNOWPARK_TEMP_CTE_7G3ZFVJYBK ) )
      UNION ALL
      ( SELECT * FROM ( SNOWPARK_TEMP_CTE_7G3ZFVJYBK ) )
    ```

Ref: 1995

---
title: Snowpark stored procedures: Enable billing for PUT calls to external stages
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2002.md
section: Release Notes
---

# Snowpark stored procedures: Enable billing for PUT calls to external stages

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, billing for PUT calls to external stages from Snowpark stored procedures changes in the following way:

Before the change:
:   No billing charges for PUT calls to external stages from Snowpark stored procedures.

After the change:
:   Billing at normal data transfer rates for PUT calls to external stages from Snowpark stored procedures.

Ref: 2002

---
title: Snowpark: Creation Time Validation Disabled for Python and Java Temporary UDFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-922.md
section: Release Notes
---

# Snowpark: Creation Time Validation Disabled for Python and Java Temporary UDFs

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The validation of temporary UDFs created in Snowpark has changed slightly:

Previously:
:   When you created a temporary Python or Java UDF in Snowpark, it was validated both at creation time and at execution time.

Currently:
:   The temporary UDF is validated only at execution time.

    This change has been implemented to improve performance.

Ref: 922

---
title: SNOWPARK_CONTAINER_SERVICES_HISTORY View (ACCOUNT_USAGE): New Columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_05/bcr-1649.md
section: Release Notes
---

# SNOWPARK_CONTAINER_SERVICES_HISTORY View (ACCOUNT_USAGE): New Columns

> **Attention:**
>
> This behavior change is in the 2024_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_05_bundle.md).

When this behavior change bundle is enabled, the [SNOWPARK_CONTAINER_SERVICES_HISTORY view](../../../sql-reference/account-usage/snowpark_container_services_history.md) includes the following new columns that appear immediately after the `compute_pool_name` column.

| Column name | Data type | Description |
| --- | --- | --- |
| IS_EXCLUSIVE | BOOLEAN | TRUE, if the compute pool was created for an [application](../../../developer-guide/native-apps/native-apps-about.md). |
| APPLICATION_NAME | STRING | The name of the application for which the compute pool was created. NULL if the compute pool was not created for an application or if the application no longer exists. |
| APPLICATION_ID | STRING | The ID of the application for which the compute pool was created; otherwise NULL. |

Ref: 1649

---
title: Snowpipe and Tasks: Updates to Azure Event Grid notifications
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1421.md
section: Release Notes
---

# Snowpipe and Tasks: Updates to Azure Event Grid notifications

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The fields in the Azure Event Grid notifications for Snowpipe and tasks are as follows.

Before the change:
:   The subject and eventTypes fields are:

    > * Subject : `snowpipe`
    > * eventType: `error-notification`

After the change:
:   The subject and eventTypes fields are:

    * Subject : `Snowflake Queue Notification`
    * eventType: `eventgrid-notification`

Ref: 1421

---
title: SNOWPIPE Commands: New INVALID_REASON Column in Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1085.md
section: Release Notes
---

# SNOWPIPE Commands: New INVALID_REASON Column in Output

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The output of the following commands includes an additional column:

* SHOW PIPES
* DESCRIBE PIPES

| Column Name | Data Type | Description |
| --- | --- | --- |
| INVALID_REASON | TEXT | Displays some detailed information for your pipes that may have issues. You can use the provided information to troubleshoot your pipes more effectively. If there is no issue with the pipe, the value is NULL. |

To help minimize the impact of this addition, the column was added as the last column in the output.

Ref: 1085

---
title: Snowpipe Streaming Invalidates Older Versions of Snowflake Ingest SDK and the Kafka Connector
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1102.md
section: Release Notes
---

# Snowpipe Streaming Invalidates Older Versions of Snowflake Ingest SDK and the Kafka Connector

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

You can use Snowpipe Streaming with only the following versions of Snowflake Ingest SDK and the Kafka connector:

* Snowflake Ingest SDK version 1.1.0 and later.
* Kafka connector version 1.9.0 and later.

Any previous versions of Snowflake Ingest SDK and the Kafka connector will no longer work with Snowpipe Streaming and will be disabled preventing
ingestion into Snowflake. Previous versions were only available as beta versions used for private preview, and they do not contain all the bug
fixes and dependency upgrades required to continue working.

> **Note:**
>
> This change only applies to the Kafka connector with Snowpipe Streaming and does not apply to the older versions of the Kafka connector that
> utilizes file-based Snowpipe.
>
> We recommend that you always use the latest versions of Snowflake Ingest SDK and the Kafka connector with Snowpipe Streaming because the
> latest versions always include the most recent security updates and issue fixes.

Ref: 1102

---
title: Snowpipe Streaming SDK (for high-performance architecture) release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpipe-streaming-sdk.md
section: Release Notes
---

# Snowpipe Streaming SDK (for high-performance architecture) release notes

The Snowpipe Streaming SDK release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowpipe-streaming-sdk-2026.md)
* [2025 releases](snowpipe-streaming-sdk-2025.md)

---
title: Snowpipe Streaming SDK release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpipe-streaming-sdk-2025.md
section: Release Notes
---

# Snowpipe Streaming SDK release notes for 2025

This article contains the release notes for the Snowpipe Streaming SDK, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowpipe Streaming SDK updates.

## Version 1.1.0 (November 05, 2025)

### New features and updates

* With the release of the SDK version 1.1.0, Snowpipe Streaming’s high-performance architecture is now generally available for all accounts on Microsoft Azure, expanding its availability from Amazon Web Services (AWS).
* Update on November 10, 2025: Support for Google Cloud Platform (GCP) is also added and is now generally available for all accounts.

## Version 1.0.2 (October 10, 2025)

### New features and updates

* System Proxy Support (AWS): The Snowpipe Streaming SDK now supports connecting through a system-wide proxy when running on AWS. Users can configure proxy settings by setting the following system properties or environment variables:

  + HTTP_PROXY
  + HTTPS_PROXY
  + ALL_PROXY

## Version 1.0.1 (September 22, 2025)

### Improvements

* Telemetry and reporting capabilities are enhanced to provide better insight into ingestion metrics. This improvement makes it easier to monitor client performance.

## Version 1.0.0 (September 19, 2025)

### New features and updates

* Released the SDK to support the general availability of Snowpipe Streaming with high performance architecture for AWS deployments.
* Performance and stability improvements.

---
title: Snowpipe Streaming SDK release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowpipe-streaming-sdk-2026.md
section: Release Notes
---

# Snowpipe Streaming SDK release notes for 2026

This article contains the release notes for the Snowpipe Streaming SDK, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for Snowpipe Streaming SDK updates.

## Version 1.2.0 (February 16, 2026)

### New features and updates

* Added support for encrypted key-pair authentication. You can now connect to your instances by using both encrypted and unencrypted private keys, providing greater flexibility for your security workflows.

## Version 1.1.2 (January 20, 2026)

### Behavior changes

* Fixed a race condition in the channel status cache to improve multi-threaded stability.
* Reduced log flooding by removing redundant messages for cleaner monitoring.

### New features and updates

* Added support for account locators with region suffixes.

### Bug fixes

* Removed unused configuration parameters and addressed minor internal logic issues to improve reliability.

---
title: Snowpipe: Disable pipe role drop prevention
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2216.md
section: Release Notes
---

# Snowpipe: Disable pipe role drop prevention

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

The behavior when dropping a role that holds OWNERSHIP on a PIPE object is changing as follows:

Before the change:
:   Dropping a role that holds the OWNERSHIP privilege on a PIPE object fails. This restriction was
    originally intended to prevent privilege escalation when a pipe’s owner role is dropped. Pipes
    are the only object type with this restriction.

After the change:
:   Dropping a role that holds the OWNERSHIP privilege on a PIPE object succeeds. This aligns the
    behavior of pipes with all other Snowflake object types, including owner’s rights objects such
    as stored procedures and services.

This change is being made to ensure consistency across object types and to enable automation
workflows that drop roles.

Ref: 2216

---
title: Snowpipe: Modification of auto-ingest notification integration queue for Azure and GCP not allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1186.md
section: Release Notes
---

# Snowpipe: Modification of auto-ingest notification integration queue for Azure and GCP not allowed

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, Snowflake will not allow modifications of auto-ingest notification integration queue for Azure and GCP.

Previously:
:   You can alter the AZURE_STORAGE_QUEUE_PRIMARY_URI for Azure notification integration.

    You can alter the GCP_PUBSUB_SUBSCRIPTION_NAME for GCP notification integration.

    However, these alterations are not guaranteed to take effect in a predictable timeframe.

Currently:
:   You cannot alter the AZURE_STORAGE_QUEUE_PRIMARY_URI for Azure notification integration.

    You cannot alter the GCP_PUBSUB_SUBSCRIPTION_NAME for GCP notification integration.

Ref: 1186

---
title: Snowpipe: Multiple auto-ingest notification integrations with the same URL not allowed for Azure and GCP
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1394.md
section: Release Notes
---

# Snowpipe: Multiple auto-ingest notification integrations with the same URL not allowed for Azure and GCP

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

Snowflake does not allow multiple auto-ingest notification integrations with the same URL for Azure and GCP.

Before the change:
:   When you create a new pipe using a notification integration with the same queue URL as another notification integration, the pipe creation succeeds without error.

After the change:
:   When you create a new pipe using a notification integration with the same queue URL as another notification integration, the pipe creation fails with an error: “Notification queue already in use with another integration.”

Ref: 1394

---
title: Snowsight is the only interface available for all users
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2080.md
section: Release Notes
---

# Snowsight is the only interface available for all users

When this behavior change bundle is enabled, Snowsight will become the only available interface for all users, including Private Link
and VPS account customers. Users will no longer be able to access Classic Console.

Before the change:
:   All users, including Private Link and VPS account customers, see Snowsight after signing in. If needed, users can access Classic
    Console from within Snowsight.

After the change:
:   Snowsight becomes the only available interface for all users, including Private Link and VPS account customers. Users can no longer
    access Classic Console from Snowsight.

For more information, see [Snowsight: The Snowflake web interface](../../../user-guide/ui-snowsight.md).

Ref: 2080

---
title: Snowsight session timeout parameter: Change default value
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2139.md
section: Release Notes
---

# Snowsight session timeout parameter: Change default value

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

For Snowsight, the SESSION_UI_IDLE_TIMEOUT_MINS parameter defines the number of
minutes in which a session can be idle before a user must authenticate to Snowflake again.
The default value of the SESSION_UI_IDLE_TIMEOUT_MINS parameter behaves in the following manner:

Before the change:
:   SESSION_UI_IDLE_TIMEOUT_MINS is set to 240 minutes (4 hours). The default behavior
    without a session policy is the Snowsight session timing out after
    240 minutes.

After the change:
:   SESSION_UI_IDLE_TIMEOUT_MINS is set to 1080 minutes (18 hours). The default behavior
    without a session policy is the Snowsight session timing out after
    1080 minutes.

This pending change affects all Standard Edition customer accounts. Standard Edition
account administrators can’t create session policies, so Snowsight sessions for
Standard Edition accounts time out after 1080 minutes.

Administrators for Enterprise Edition or higher accounts can create a session policy that
specifies a timeout value for Snowsight sessions other than the default value.
For more information, see [Snowflake sessions and session policies](../../../user-guide/session-policies.md).

Ref: 2139

---
title: Snowsight Templates learning environment
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1992.md
section: Release Notes
---

# Snowsight Templates learning environment

This behavior change will be enabled during the windows between December 2 - December 17, 2025 and January 7 - January 30, 2026. During this
time, the dedicated Snowflake learning environment (`SNOWFLAKE_LEARNING`) is enabled automatically. This environment is required for
Snowsight Templates, which let users interactively explore Snowflake features and use cases through workspaces, notebooks, or
Streamlit apps that they can run. Templates come preconfigured with sample data and the necessary permissions.

The `SNOWFLAKE_LEARNING` environment includes a pre-provisioned role (`SNOWFLAKE_LEARNING_ROLE`), compute warehouse (`SNOWFLAKE_LEARNING_WH`),
and database (`SNOWFLAKE_LEARNING_DB`).

To opt out of the learning environment before the change is enabled:

```sqlsyntax
SELECT SYSTEM$DISABLE_SNOWFLAKE_LEARNING_ENVIRONMENT();
```

To disable and drop the learning environment after it is enabled:

```sqlsyntax
USE ROLE ACCOUNTADMIN;
SELECT SYSTEM$DISABLE_SNOWFLAKE_LEARNING_ENVIRONMENT();

-- DATABASE
SHOW DATABASES LIKE 'SNOWFLAKE_LEARNING_DB';
DROP DATABASE SNOWFLAKE_LEARNING_DB;

-- WAREHOUSE
SHOW WAREHOUSES LIKE 'SNOWFLAKE_LEARNING_WH';
DROP WAREHOUSE SNOWFLAKE_LEARNING_WH;

-- ROLE
SHOW ROLES LIKE 'SNOWFLAKE_LEARNING_ROLE';
DROP ROLE SNOWFLAKE_LEARNING_ROLE;
```

For more information, see [Snowsight templates](../../../user-guide/ui-snowsight/snowsight-templates.md).

Ref: 1992, 2159

---
title: Snowsight worksheets and dashboards: Changes to formatting of query results
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1314.md
section: Release Notes
---

# Snowsight worksheets and dashboards: Changes to formatting of query results

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

Query results no longer have automatic formatting applied in Snowsight. Instead, the raw data
format of the results is displayed unless parameters or SQL format models are applied. With this change, the formatting for numeric
and timestamp data types changes as follows:

Before the change:
:   NUMBER and INTEGER data types:

    * Numbers greater than or equal to 16 digits are displayed using scientific notation (1.012345678901234567e+19).
    * Numbers greater than or equal to 16 digits include comma separators.
    * Numbers with trailing zeros have trailing zeros removed and display with comma separators.

    FLOAT data types:

    * Numbers with trailing zeros have trailing zeros removed and display with comma separators.

    TIMESTAMP data types:

    * TIMESTAMP_NTZ results with 10,000 or fewer rows display as YYYY-MM-DD HH24:MI:SS.FF3.
    * TIMESTAMP_NTZ results with 10,001 or greater rows display as YYYY-MM-DD HH24:MI:SS.
    * TIMESTAMP_TZ and TIMESTAMP_LTZ results with 10,000 or fewer rows display as YYYY-MM-DD HH24:MI:SS.FF3 TZH.
    * TIMESTAMP_TZ and TIMESTAMP_LTZ results with 10,001 or greater rows display as YYYY-MM-DD HH24:MI:SS.
    * TIMESTAMP_NTZ results mapped to TIMESTAMP_LTZ display as YYYY-MM-DD HH24:MI:SS.FF3.

After the change:
:   NUMBER and INTEGER data types:

    * Numbers greater than or equal to 16 characters display without formatting (1012345678901234567).
    * Numbers with trailing zeros have trailing zeros removed, unless the provided number scale exceeds the number of digits in the value.

    FLOAT data types:

    * Numbers with trailing zeros have trailing zeros removed, unless the provided number scale exceeds the number of digits in the value.

    TIMESTAMP data types:

    * TIMESTAMP_NTZ results display as YYYY-MM-DDTHH24:MI:SS.FF3Z.
    * TIMESTAMP_TZ and TIMESTAMP_LTZ results display as YYYY-MM-DDTHH24:MI:SS.FF3TZH. If TZH is +0000, TZH is removed.

If you want your results to appear with formatting in worksheets or dashboard tiles, update your query to use the relevant formatting.
See [SQL format models](../../../sql-reference/sql-format-models.md).

Ref: 1314

---
title: Snowsight worksheets: Changes to working with versions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-1313.md
section: Release Notes
---

# Snowsight worksheets: Changes to working with versions

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

The way that you work with versions of a worksheet in Snowsight is different:

Before the change:
:   When you select versions of a worksheet, the code for the selected version updates for the worksheet and replaces the code in
    the worksheet. Some versions are designated as drafts, and you can see the results for selected past versions of a worksheet.

After the change:
:   When you select versions of a worksheet, the code for the selected version appears in a dialog box where you can copy the code
    to modify in the current worksheet version context.

    You cannot view the results for selected versions of a worksheet, and draft versions are no longer distinguished in the list.

    Versions are de-duplicated based on the query text, and worksheet filters in place, and the context in which the worksheet
    was executed.

Ref: 1313

---
title: Snowsight: Default all users in all accounts to Snowsight
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_03/bcr-1511.md
section: Release Notes
---

# Snowsight: Default all users in all accounts to Snowsight

> **Attention:**
>
> This behavior change is in the 2024_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_03_bundle.md).

All Snowflake customers are upgraded to Snowsight and all users see Snowsight after signing in to the Snowflake
web interface.

If you use private connectivity to access Snowflake, you are not yet affected by this change.
See [Configuring private connectivity for Snowsight](../../../user-guide/ui-snowsight-gs.md) to prepare to upgrade to Snowsight.

Before the change:
:   Users can choose which web interface is the default experience for their user profile within an account. After signing in, users see
    the web interface specified as the default experience, Classic Console or Snowsight.

After the change:
:   All users in an account see Snowsight after signing in.

    Users can no longer choose a default experience for their user profile. If needed, users can access
    Classic Console from Snowsight.

> **Important:**
>
> This change happens slowly during the behavior change periods.
>
> If your account enables the bundle during the opt-in testing period, this change does not happen in your account immediately.
> Instead, you see this change within one week of the bundle being enabled on your account.
>
> During the opt-out period, this change rolls out slowly and all accounts see the change within three weeks of the start of the opt-out
> period.

Ref: 1511

---
title: Snowsight: Default all users, including VPS and Private Link, to Snowsight
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1930.md
section: Release Notes
---

# Snowsight: Default all users, including VPS and Private Link, to Snowsight

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

All Snowflake Private Link and VPS account customers are upgraded to Snowsight
and see Snowsight after signing in to the Snowflake web interface.

Before the change:
:   Users see their default experience in the web interface, as specified in their user
    profile for each account: Classic Console or Snowsight.

After the change:
:   All users, including Private Link and VPS account customers see Snowsight after logging in.
    Private Link and VPS account users can no longer choose Classic Console as the default experience for their user profile.
    If needed, users can access Classic Console from Snowsight.

In order to ensure that users can access Snowsight, it is vital that you validate connectivity
to Snowsight URLs within your private network.

* We strongly encourage you to confirm users can connect and log into Snowsight before April 4th.
* If there are problems, we encourage you to review the information in [Configuring private connectivity for Snowsight](../../../user-guide/ui-snowsight-gs.md)
  or contact Snowflake Support to discuss next steps.

Ref: 1930

---
title: Snowsight: Default interface for all users in Standard Edition accounts
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1338.md
section: Release Notes
---

# Snowsight: Default interface for all users in Standard Edition accounts

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

Snowflake customers with a Capacity commitment where all accounts in the organization are
[Standard Edition](../../../user-guide/intro-editions.md) accounts are upgraded to Snowsight.
For those organizations, all users see Snowsight after signing in to the Snowflake web interface.

To see if your organization is affected by this change, an organization administrator (user with the ORGADMIN role) can run
[SHOW ORGANIZATION ACCOUNTS](../../../sql-reference/sql/show-organization-accounts.md) and review the output.
If the following conditions are both true, this change affects users in your organization:

* In the EDITION column, all accounts listed display STANDARD.
* In the CREATED_ON column, all dates are earlier than October 4, 2023, or 2023-10-04 00:00:00.000 +0000 to match the format of the output.

If you use private connectivity to access Snowflake and have not yet set up your private connectivity configuration to access Snowsight,
you are not affected by this change. See [Configuring private connectivity for Snowsight](../../../user-guide/ui-snowsight-gs.md) to prepare for upgrading to Snowsight.

Before the change:
:   Users can choose which web interface is the default experience for their user profile within an account. After signing in, users see
    the web interface specified as the default experience, Classic Console or Snowsight.

After the change:
:   All users in an account see Snowsight after signing in.

    Users can no longer choose a default experience for their user profile. If needed, users can access
    Classic Console from Snowsight.

Ref: 1338

---
title: Snowsight: Default Interface for All Users of Snowflake On Demand™
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-969.md
section: Release Notes
---

# Snowsight: Default Interface for All Users of Snowflake On Demand™

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

Customers of Snowflake On Demand accounts default to Snowsight.

Previously:
:   Users could choose which web interface is the default experience for their user profile within an account. After logging in, users see
    the web interface specified as the default experience, Classic Console or Snowsight.

Currently:
:   For Snowflake On Demand customers, all users in an account see Snowsight after logging in.

    Users can no longer choose Classic Console as the default experience for their user profile. If needed, users can access
    Classic Console from Snowsight.

> **Note:**
>
> Where possible, customers using private connectivity to connect to Snowsight have been proactively excluded from this behavior
> change. For more information, see
> [Preparing Private Connectivity for Snowsight](https://community.snowflake.com/s/article/Private-Connectivity-URLs-with-Snowsight-And-Client-Redirect)
> (in the Snowflake Community)

Ref: 969

---
title: Snowsight: Default Interface for New Users
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1113.md
section: Release Notes
---

# Snowsight: Default Interface for New Users

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

All new users of Snowflake have their Default Experience user preference set to Snowsight.

Users still have access to Classic Console and can change their Default Experience preference back to Classic Console in their user
profile if needed.

If you use private connectivity to access Snowflake, make sure that access to Snowsight is set up.

Previously:
:   New Snowflake users sign in to the web interface and see the Classic Console.

Currently:
:   New Snowflake users sign in to the web interface and see Snowsight.

> **Note:**
>
> Where possible, customers using private connectivity to connect to Snowsight have been proactively excluded from this behavior
> change. For more information, see
> [Preparing Private Connectivity for Snowsight](https://community.snowflake.com/s/article/Private-Connectivity-URLs-with-Snowsight-And-Client-Redirect)
> (in the Snowflake Community).

Ref: 1113

---
title: Snowsight: Roles Removed from Worksheet Folders
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1025.md
section: Release Notes
---

# Snowsight: Roles Removed from Worksheet Folders

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Folders used to organize worksheets in Snowsight will not have roles assigned.
This change affects all customers that use folders to organize worksheets in Snowsight.

Previously:
:   Folders used to organize worksheets in Snowsight are assigned roles, and the role for the folder applies to all worksheets in the folder.

Currently:
:   Folders used to organize worksheets in Snowsight do not have assigned roles. Worksheets keep their previously assigned roles.
    A worksheet in a folder can be assigned a different role than other worksheets in the same folder.

Ref: 1025

---
title: SnowSQL Change Log (Prior to January 2022)
source: https://docs.snowflake.com/en/release-notes/client-change-log-snowsql.md
section: Release Notes
---

# SnowSQL Change Log (Prior to January 2022)

This topic lists the fixes, enhancements, and other changes introduced across all released, production versions of [SnowSQL](../user-guide/snowsql.md),
the Snowflake CLI (command-line interface) client prior to January 2022.

See the [Snowflake connector, driver, and library monthly releases](clients-drivers/monthly-releases.md) for current release note and change log information from January 2022 and later.

Note that this list does not include all changes made to SnowSQL; it only lists significant changes or changes that may impact your usage.

In addition, this list is updated independently from the SnowSQL releases and, therefore, may not include the most recently-released version; to see
all available versions, go to the [SnowSQL Download](https://developers.snowflake.com/snowsql/) page.

| Version | Change | Description |
| --- | --- | --- |
| **SnowSQL 1.2.21** |  |  |
|  | SNOW-480963 | Fixes Python connector bug that Updated URL escaping when uploading to AWS S3 to match how S3 escapes URLs. |
|  |  |  |
| **SnowSQL 1.2.20** |  |  |
|  | SNOW-475359 | Upgrade sqlparse library version from 0.2.3 to 0.4.2. |
|  |  |  |
| **SnowSQL 1.2.19** |  |  |
|  | SNOW-467701 | Added ability to set arbitrary connection parameters in Snowflake connections. |
|  | SNOW-276705 | Make the use of encrypted private key for Key Pair authentication optional. |
|  |  |  |
| **SnowSQL 1.2.18** |  |  |
|  | SNOW-377123 | Added the ability to accept and substitute empty variables, this new behavior is protected by the environmental variable SNOWSQL_ALLOW_EMPTY_ENV_VARS. |
|  | SNOW-407614 | Changed the behavior of printing unicode characters instead of their escape sequences when using csv format, this new behavior is protected by the environmental variable SNOWSQL_OUTPUT_AS_UNICODE. |
|  |  |  |
| **SnowSQL 1.2.17** |  |  |
|  | SNOW-378268 | Fixes Python Connector bug that prevents the connector from using AWS S3 Regional URL. The driver currently overrides the regional URL information with the default S3 URL causing failure in PUT. |
|  |  |  |
| **SnowSQL 1.2.16** |  |  |
|  | SNOW-365900 | Fix for incorrect JWT token invalidity when an account alias with a dash in it is used for regionless account URL. |
|  |  |  |
| **SnowSQL 1.2.15** |  |  |
|  | SNOW-303944 | Added command-line flags to generate JSON Web Tokens (JWT) in SnowSQL. |
|  |  |  |
| **SnowSQL 1.2.14** |  |  |
|  | SNOW-298813 | Fix for Progress percentage computation to handle the case when file size is zero. |
|  |  |  |
| **SnowSQL 1.2.13** |  |  |
|  | SNOW-270946 | Fix a zero division error while computing progress percentage which gets triggered when the size of file to upload/download is 0. |
|  |  |  |
| **SnowSQL 1.2.12** |  |  |
|  | SNOW-293541 | Released Mac package with renewed Developer Installer Certificate. |
|  |  |  |
| **SnowSQL 1.2.11** |  |  |
|  | SNOW-232777 | The fix to add proper proxy CONNECT headers for connections made over proxies. |
|  |  |  |
| **SnowSQL 1.2.10** |  |  |
|  | SNOW-170647 | Removed an unnecessary dependency. |
| **SnowSQL 1.2.9** |  |  |
|  | SNOW-181011 | Fixed missing dependency for keyring package, which caused issues when connecting using `authenticator = externalbrowser`. |
| **SnowSQL 1.2.8** |  |  |
|  | SNOW-123267 | Support added for forcing quit by pressing **[CTRL]-c** twice. Note that, if you use this option, SnowSQL does not verify previously-started queries were successfully canceled. |
|  | SNOW-159538 | Internal improvement for masking logs for sensitive information. |
| **SnowSQL 1.2.7** |  |  |
|  | SNOW-150710 | Added new custom `sql_delimiter` variable to enable specifying a character other than a semicolon as the delimiter for SQL statements. |
|  | SNOW-170458 | Fixed an issue where SnowSQL failed on multi-line queries, which was caused by a regression introduced in the SnowSQL 1.2.6 internal release. |
| **SnowSQL 1.2.6** |  |  |
|  | N/A | Version is not available for download. |
| **SnowSQL 1.2.5** |  |  |
|  | SNOW-135171 | Updated the SnowSQL distribution to allow specifying the install location. |
|  | SNOW-136164 | Fixed issue where SnowSQL could not be installed if `~/.snowsql` doesn’t exist. |
| **SnowSQL 1.2.4** |  |  |
|  | SNOW-126786 | For Snowflake accounts hosted on GCP, fixed exception when using PUT to upload a file to a stage with `auto_compress=false`. |
|  | SNOW-134305 | Increased threshold to 64MB for multi-part upload to S3. |
| **SnowSQL 1.2.3** |  |  |
|  | SNOW-93304 | Full/main SnowSQL module included in the bootstrap distribution to facilitate offline/hosted installation. |
|  | SNOW-120329 | Added support for OAuth token authentication method. |
| **SnowSQL 1.2.2** |  |  |
|  | SNOW-75495 | Internal change for pending feature. |
|  | SNOW-121787 | Pinned the keyring version to 19.2.0. |
|  | SNOW-122376 | Fixed issue with no content cache for downloading newer version of SnowSQL. |
|  | SNOW-122797 | For MacOS Catalina 10.15.1, fixed `oscrypto` and OpenSSL conflict. |
| **SnowSQL 1.2.1** |  |  |
|  | SNOW-106130 | Added Cask Installer for SnowSQL 1.2.0. |
|  | SNOW-110191 | Enabled `fix_parameter_precedence` connection parameter to `true` for SnowSQL. |
|  | SNOW-118881 | Added .zprofile support to SnowSQL installer. |
| **SnowSQL 1.2.0** |  |  |
|  | SNOW-110647 | Moved upgrade repository from S3 to sfc-repo; original S3 repository is still available for earlier versions. |
| **SnowSQL 1.1.86** |  |  |
|  | SNOW-64718 | Internal change for pending feature. |
|  | SNOW-92738 | Improved SnowSQL installation through `brew cask` for `zshell` users. |
| **SnowSQL 1.1.85** |  |  |
|  | SNOW-94184 | Fixed issue related to Arrow format (internal enhancement). |
| **SnowSQL 1.1.84** |  |  |
|  | SNOW-66323 | Driver now suppresses echo of sensitive data output. |
|  | SNOW-82276 | Removed support for old OCSP URL for AWS PrivateLink. |
| **SnowSQL 1.1.83** |  |  |
|  | SNOW-88844 | Fixed grammatical issues in SnowSQL error message. |
|  | SNOW-89190 | For Snowflake accounts in the US Gov Virginia region (Azure), fixed issue with PUT and GET commands. |
| **SnowSQL 1.1.82** |  |  |
|  | SNOW-82268 | This version of SnowSQL does not use the new OCSP hostname/URL for AWS PrivateLink; the new hostname/URL will be implemented in a future version. |
| **SnowSQL 1.1.81** |  |  |
|  | SNOW-80440 | Fixed an issue where extra linefeed characters were generated when `output_format` was set to `tsv` and a DESCRIBE SCHEMA command was executed on an empty schema. |
| **SnowSQL 1.1.80** |  |  |
|  | SNOW-57024 | Fixed an issue where casting a timestamp earlier than UNIX epoch time added 0.100 seconds to the output. |
| **SnowSQL 1.1.79** |  |  |
|  | SNOW-75465 | Fixed issue with `!SET` indent with comment. |
|  | SNOW-76043 | Added option to skip request pooling. |
|  | SNOW-76797 | Implemented support for OCSP fail-open. |
|  | SNOW-77160 | Added OCSP_MODE metric. |
| **SnowSQL 1.1.78** |  |  |
|  | SNOW-74395 | Fixed issue with Azure token renewal for long running jobs. |
|  | SNOW-75372 | Enhanced SQL syntax highlighting in the SnowSQL editor. |
| **SnowSQL 1.1.77** |  |  |
|  | SNOW-74042 | Implemented the custom OCSP Cache Server URL in the Python Connector used by SnowSQL. |
| **SnowSQL 1.1.76** |  |  |
|  | SNOW-66025 | Added support for FORCE_PUT_OVERWRITE option. |
| **SnowSQL 1.1.75** |  |  |
|  | SNOW-66722 | For Windows, fixed regression for DATE format. |
| **SnowSQL 1.1.74** |  |  |
|  | SNOW-64148 | Upgraded Python version to 3.6. |
| **SnowSQL 1.1.73** |  |  |
|  | SNOW-57001 | Driver now ignores exceptions from Heartbeat. |
|  | SNOW-63422 | Added support for negative values for year. |
|  | SNOW-63839 | Fixed out-of-range error for year values. |
|  | SNOW-64053 | Added an option to automatically print query ids. |
| **SnowSQL 1.1.72** |  |  |
|  | SNOW-37156 | Added new SQL functions to the keyword list for auto-completion and syntax highlighting. |
|  | SNOW-54514 | Fixed issue with explicitly-specified default region causing SnowSQL to hanging indefinitely. |
| **SnowSQL 1.1.71** |  |  |
|  | SNOW-36812 | Added the `!pause` command to pause and continue running queries. |
|  | SNOW-56234 | For Snowflake accounts hosted on Azure, fixed the PUT/GET progress bar. |
|  | SNOW-59077 | Added the `timing_in_output_file` option to store the query timing in the output file. |
|  | SNOW-60603 | Added the `progress_bar` option to suppress displaying the progress of PUT and GET commands. |
|  | SNOW-61860 | Adjusted the log level to mitigate confusion. |
| **SnowSQL 1.1.70** |  |  |
|  | SNOW-60580 | For Snowflake accounts in the EU region, fixed 403 error. |
| **SnowSQL 1.1.69** |  |  |
|  | SNOW-58838 | Added service name support for multi-GS clustering (internal feature). |
|  | SNOW-58845 | Added SNOWSQL_DOWNLOAD_DIR environment variable to set the download directory. |
|  | SNOW-60056 | For Windows, fixed issue with Python connector failing to convert pre-epoch TIMESTAMP_NTZ data. |
| **SnowSQL 1.1.68** |  |  |
|  | SNOW-58177 | SnowSQL now raises a more user-friendly error if `localhost` is not found. |
| **SnowSQL 1.1.67** |  |  |
|  | SNOW-56812 | Fixed an issue where `exit_on_error=true` didn’t work if an error occurred for a PUT or GET command. |
|  | SNOW-56882 | Fixed an issue caused by using a backslash followed by a single quote in a literal (e.g. `'text\'s string'`). |
| **SnowSQL 1.1.66** |  |  |
|  | SNOW-55034 | Added `request_guid` to each HTTP request for tracing. |
|  | SNOW-56079 | Fixed an issue with setting a single right angle bracket (`>`) as the command-line prompt. |
| **SnowSQL 1.1.65** |  |  |
|  | SNOW-55027 | Added support for binding the `datetime` object with the Snowflake TEXT data type. |
|  | SNOW-55093 | Internal change for pending feature. |
|  | SNOW-55253 | Added the `--client-session-keep-alive` option. |
| **SnowSQL 1.1.64** |  |  |
|  | SNOW-31060 | Adjusted log levels by changing most `INFO` logs to `DEBUG`. |
|  | SNOW-54322 | Fixed a misspelling in an error message for SSO. |
|  | SNOW-54714 | SnowSQL now retries if HTTP 405 error is encountered. |
| **SnowSQL 1.1.63** |  |  |
|  | SNOW-52668 | Added `-U` and `--upgrade` options to enable forcing upgrading to the latest version of SnowSQL. |
|  | SNOW-53452 | Internal change for pending feature. |
|  | SNOW-53650 | Internal change for pending feature. |
|  | SNOW-53890 | Fixed the incorrect description in the SnowSQL help for the `friendly` option. |
|  | SNOW-53891 | Fixed issue which incorrectly displayed the following message when **[Ctrl]+[D]** was used to exit SnowSQL: `If the error message is not clear, enable the logging using -o log_level=DEBUG...`; the message is no longer displayed. |
| **SnowSQL 1.1.62** |  |  |
|  | SNOW-53405 | Deprecated the `region` parameter; instead, region information is specified (if needed) in the `account` parameter. |
|  | SNOW-53629 | Removed hardcoded `testaccount` names. |
| **SnowSQL 1.1.61** |  |  |
|  | SNOW-50629 | SnowSQL now uses UTC timestamp for logging. |
|  | SNOW-50766 | Updated SnowSQL to enforce virtual host style for S3 URLs. |
| **SnowSQL 1.1.60** |  |  |
|  | SNOW-50514 | Internal change for pending feature. |
|  | SNOW-51669 | Internal change for pending feature. |
| **SnowSQL 1.1.59** |  |  |
|  | SNOW-48675 | Added support for client-side job telemetry (for internal use). |
|  | SNOW-48678 | Internal change for pending feature. |
| **SnowSQL 1.1.58** |  |  |
|  | SNOW-45021 | Removed login name requirement when authenticating with an OAuth access token. |
| **SnowSQL 1.1.57** |  |  |
|  | SNOW-43215 | SnowSQL now uses OCSP dynamic server if OCSP response doesn’t exist in the cache. This is currently only for AWS PrivateLink. |
| **SnowSQL 1.1.56** |  |  |
|  | SNOW-28419 | SnowSQL now dumps the TLS/SSL Certificate to `stdout` if the handshake fails; provided primarily for troubleshooting connection issues. |
|  | SNOW-39938 | Fixed issue where the `key_bindings` configuration parameter did not work correctly when set to `vi` mode. |
| **SnowSQL 1.1.55** |  |  |
|  | SNOW-41707 | Internal change for pending feature. |
| **SnowSQL 1.1.54** |  |  |
|  | SNOW-42833 | Internal change for pending feature. |
| **SnowSQL 1.1.53** |  |  |
|  | SNOW-41694 | Added support for key pair authentication. |
| **SnowSQL 1.1.52** |  |  |
|  | SNOW-40919 | Added `login_timeout` option. |
|  | SNOW-41377 | Fixed TypeError that occurred when using PUT or GET command to upload/download extremely large numbers of small files. |
| **SnowSQL 1.1.51** |  |  |
|  | SNOW-34467 | Internal change for pending feature. |
| **SnowSQL 1.1.50** |  |  |
|  | SNOW-28376 | SnowSQL now uses the shared OCSP response cache file in `~/.cache/snowflake/ocsp_response_cache.json`. |
|  | SNOW-38618 | SnowSQL now handles cases where `stdin/stdout/stderr` is closed. |
| **SnowSQL 1.1.49** |  |  |
|  | SNOW-21492 | Added flag for OCSP response cache server. |
| **SnowSQL 1.1.48** |  |  |
|  | SNOW-37395 | Added support for the `authenticator` option in the configuration file. |
| **SnowSQL 1.1.47** |  |  |
|  | SNOW-37262 | Fixed `string index out of range` error, which occurred when a string, ending with an escape sequence, was truncated. |
| **SnowSQL 1.1.46** |  |  |
|  | SNOW-24653 | Fixed issue that generated an error stack if the specified log file was not accessible; now, the error stack is suppressed. |
|  | SNOW-24710 | Added connection parameter to support single transactions. |
|  | SNOW-28482 | Added option to support paging output. |
|  | SNOW-32282 | Internal change for pending feature. |
| **SnowSQL 1.1.45** |  |  |
|  | SNOW-32806 | Internal change for pending feature. |
|  | SNOW-34176 | Upgraded underlying PyInstaller to 3.3 along with the base Python version to 3.5. |
|  | SNOW-34418 | Fixed performance issue with SHOW COLUMN IN ACCOUNT command. |
|  | SNOW-36332 | Windows: Fixed an issue where the output was truncated. |
| **SnowSQL 1.1.44** |  |  |
|  | SNOW-35404 | Fixed issue where fractions of seconds in timestamps were reported incorrectly. |
| **SnowSQL 1.1.43** |  |  |
|  | SNOW-30483 | Added support for SAML 2.0-compliant services/applications for federated authentication by adding the `externalbrowser` value to the `authenticator` connection option. |
|  | SNOW-32139 | Added correct verification of the proof key, login name, and request ID in support of SAML 2.0-compliant services/applications for federated authentication. |
| **SnowSQL 1.1.42** |  |  |
|  | SNOW-33973 | SnowSQL now retries all HTTP 5xx errors returned by the Python Connector. |
|  | SNOW-34027 | To prevent AWS token expiration issues, SnowSQL now renews the AWS token if a `S3UploadFailedError` error occurs. |
|  | SNOW-34123 | Fixed a minor issue where an error was generated with no error message. |
| **SnowSQL 1.1.41** |  |  |
|  | SNOW-29826 | Error message details improved for connection errors caused by an invalid SSL certificate. |
|  | SNOW-31859 | SnowSQL now assigns a `__rowcount` variable to the number of rows updated/selected by the previous statement, which can then be called using the SnowSQL variables syntax (e.g. `&__rowcount`). |
|  | SNOW-33405 | SnowSQL now monitors the status of queries when running in asynchronous mode and waits for the queries to finish before disconnecting from Snowflake. |
| **SnowSQL 1.1.40** |  |  |
|  | SNOW-33112 | SnowSQL now retries queries indefinitely to mitigate HTTP 500 errors. |
| **SnowSQL 1.1.39** |  |  |
|  | SNOW-30483 | Fixed a security issue for SAML integration. |
|  | SNOW-31153 | Implemented support for using **Ctrl+C** to exit when prompted for a password (during re-authentication). |
|  | SNOW-32445 | Fixed an issue with fetching large result sets for Azure BLOB. |
| **SnowSQL 1.1.38** |  |  |
|  | SNOW-29144 | SnowSQL now flushes output to a file on each write. |
| **SnowSQL 1.1.37** |  |  |
|  | SNOW-30483 | Added support for web-based SAML authentication. |
| **SnowSQL 1.1.36** |  |  |
|  | SNOW-32074 | Fixed issue introduced in previous rolled-back version of SnowSQL. |
| **SnowSQL 1.1.35** |  |  |
|  | SNOW-30483 | Internal fix (rolled back). |
| **SnowSQL 1.1.34** |  |  |
|  | SNOW-31790 | Minor improvements to SnowSQL help text. |
| **SnowSQL 1.1.33** |  |  |
|  | SNOW-31712 | Fixed regression introduced in 1.1.32: missing parameter, `src_file_size`, resulted in GET commands returning errors. |
| **SnowSQL 1.1.32** |  |  |
|  | SNOW-31396 | Removed scanning of all existing files in a stage before executing a PUT command. Now, each individual upload operation checks the target file, and if the file digests are identical, the file is not uploaded. This reduces overhead on the PUT command. |
| **SnowSQL 1.1.31** |  |  |
|  | SNOW-18939 | Added support for ORC file format in PUT command. |
|  | SNOW-30785 | Added support for the current role in the SnowSQL prompt. |
| **SnowSQL 1.1.30** |  |  |
|  | SNOW-30376 | Set AUTOCOMMIT and ABORT_DETACHED_QUERY session parameter in authentication time instead of separate command executions. |
|  | SNOW-30422 | Changed the log levels for OCSP and some network related messages from INFO to DEBUG. |
|  | SNOW-30428 | Added region parameter to S3 connection so that PUT and GET can support cross-region stages. |
| **SnowSQL 1.1.29** |  |  |
|  | SNOW-29714 | Added check to make sure file isn’t empty when checking to see if compression type is zstd. |
|  | SNOW-29933 | Driver suppresses ‘No data returned’ message when no data is returned and `friendly=false`. |
| **SnowSQL 1.1.28** |  |  |
|  | SNOW-27327 | Added support for brotli and zstd in PUT statements for the python connector. |
|  | SNOW-29584 | Implemented timeout OCSP server requests to mitigate hang. |
| **SnowSQL 1.1.27** |  |  |
|  | SNOW-29146 | Fixed issue with the bootstrap process that may cause invalid literal for `int()` with base 10. |
|  | SNOW-29283 | Fixed issue with Python3.5 DLL that fails to get loaded on Windows. |
| **SnowSQL 1.1.26** |  |  |
|  | SNOW-29023 | Added `remove_trailing_semicolons` option. |
|  | SNOW-29098 | Fixed issue for undeleted sessions by explicitly closing a session at the end of an event loop. |
| **SnowSQL 1.1.25** |  |  |
|  | SNOW-28883 | Fixed issue where auto-completion caused a non-fatal exception when typing the AS keyword in a SQL statement, e.g. when defining a view. |
| **SnowSQL 1.1.24** |  |  |
|  | SNOW-17790 | Fixed how the fractional seconds (FF) timestamp format is handled. |
|  | SNOW-28596 | Fixed issue with SnowSQL not closing sessions correctly. |
|  | SNOW-28810 | Fixed issue with `!edit` command not returning the edited text to the prompt. |
|  | SNOW-28812 | Improved user experience for `!exit` and `!quit` commands by allowing SnowSQL to quit without deleting the session connection. |
| **SnowSQL 1.1.23** |  |  |
|  | SNOW-28202 | Improved retry of the PUT command on `OpenSSL.SSL.SysCallError 10053` with lower concurrency to mitigate connection saturation. |
|  | SNOW-28345 | Improved OKTA authentication by securing the hostname matching. |
|  | SNOW-28380 | Added `query_id_in_error` option to show or hide the query ID in the error message. |
|  | SNOW-28570 | Fixed issue where command (string beginning with an exclamation point) could not be executed if it contained a tailing semi-colon; driver now ignores the semi-colon. |
| **SnowSQL 1.1.22** |  |  |
|  | SNOW-18260 | Added support for executing multiple SQL files. |
|  | SNOW-24118 | Added SnowSQL installation files to the Amazon S3 artifact repository, in addition to the Snowflake web interface |
|  | SNOW-28224 | Fixed issue in which SnowSQL exited before asynchronous queries could complete execution. |
|  | SNOW-28266 | Fixed issue in which the `!quit` command caused the following exception: `AttributeError: 'Statement' object has no attribute 'to_unicode'`. |
|  | SNOW-28247 | Fixed issue in which a non-SQL command producing empty results failed. |
| **SnowSQL 1.1.21** |  |  |
|  | SNOW-22313 | Changed the transaction completion behavior to roll back in-progress transactions when SnowSQL exits or quits a session. |
|  | SNOW-28072 | Fixed a conversion failure issue that caused a `collections.defaultdict` exception. |
|  | SNOW-28220 | Fixed an issue in which autocomplete raised an exception if the previous token had a comparison type. |
| **SnowSQL 1.1.20** |  |  |
|  | SNOW-21252 | Fixed an issue with inconsistent behavior for account, username, and password inputs with MFA and new password. |
|  | SNOW-23904 | Improved auto-completion support for warehouses and stages; also includes various fixes for auto-completion. |
|  | SNOW-27292 | Changed auto-upgrade check to run once per hour after startup, rather than after every restart. This change requires a manual reinstall of SnowSQL. |
| **SnowSQL 1.1.19** |  |  |
|  | SNOW-25342 | Added support for version as a configuration parameter, in addition to the `-v , --version` connection parameters that are already supported. |
|  | SNOW-27620 | Implemented general performance improvements. |
|  | SNOW-27647 | Fixed internal issue with elapsed time. |
|  | SNOW-27657 | Added support for proxy parameter for OCSP validation. |
|  | SNOW-27671 | Extended the token retry period for PUT and GET to 2 hours; if all retries fail, an error is returned. |
|  | SNOW-27710 | Fixed issue where, in interactive mode, SnowSQL executed commands that were not properly started with an exclamation point or ended with a semi-colon; this issue was caused by an issue introduced in v1.1.17. |
|  | SNOW-27715 | Added support for proxy parameter for PUT and GET commands. |
|  | SNOW-27732 | SnowSQL now ignores/removes the protocol prefix, i.e. `http://` or `https://`, if included in the `--proxy-host` connection parameter. |
| **SnowSQL 1.1.18** |  |  |
|  | SNOW-25251 | Fixed issue with semi-colons in comments stopping parsing of the rest of the statement. |
|  | SNOW-27443 | Fixed issue where specifying an invalid account name returned irrelevant exception. |
| **SnowSQL 1.1.17** |  |  |
|  | SNOW-21299 | Fixed the `reauth` failure that occurs when a session expires and the password is wrong. |
|  | SNOW-27328 | Fixed issue with the `--region` connection option that caused one character to be truncated from the end of the account name. |
|  | SNOW-27345 | Implemented performance improvements when parsing SQL scripts. |
|  | SNOW-27356 | Fixed issue with the `New Password` prompt not displaying in non-interactive mode; this was caused by an issue introduced in v1.1.15. |
|  | SNOW-27374 | Added `execution_only` option for executing queries without fetching data. |
| **SnowSQL 1.1.16** |  |  |
|  | SNOW-27308 | Fixed an issue with converting DATE columns to Python data. |
| **SnowSQL 1.1.15** |  |  |
|  | SNOW-22443 | Added support for MFA passcode input. |
|  | SNOW-26262 | Implemented performance improvements for fetching numeric and timestamp data types. |
|  | SNOW-27094 | Added the `--region` connection option to support specifying the Snowflake deployment region for the account. |
| **SnowSQL 1.1.14** |  |  |
|  | SNOW-26990 | Fixed issue with OCSP access retries when a non-200 HTTP response code is returned. |
| **SnowSQL 1.1.13** |  |  |
|  | SNOW-26802 | Fixed issue in Windows environment where the bootstrap process and main executable caused conflicts writing to the same log file. Issue fixed by writing the bootstrap log to a separate file. |
| **SnowSQL 1.1.12** |  |  |
|  | SNOW-26586 | Fixed issue where the client failed to decode JSON output due to invalid UTF-8 byte sequence. |
| **SnowSQL 1.1.11** |  |  |
|  | SNOW-26352 | Fixed issue with VARIABLE not found; this was caused by an issue introduced in v1.1.10. |
| **SnowSQL 1.1.10** |  |  |
|  | SNOW-26081 | Increased the validity date acceptance window to prevent OCSP returning invalid responses due to out-of-scope validity dates for certificates; also enabled OCSP response cache file by default. |
|  | SNOW-26246 | Fixed issue where the value of variables in SnowSQL cannot include the equal character (`=`). |
|  | SNOW-26264 | Fixed issue with results command that dumped error stack. |
|  | SNOW-26265 | Fixed issue where the `!result` and `!abort` commands hang if no variable substitution is enabled. |
| **SnowSQL 1.1.9** |  |  |
|  | SNOW-25189 | Fixed issue with SnowSQL unexpectedly converting strings to numbers. |
| **SnowSQL 1.1.8** |  |  |
|  | SNOW-25368 | Fixed issue with return timing for results with 0 rows. |
| **SnowSQL 1.1.7** |  |  |
|  | SNOW-25260 | Added `noup=true` configuration option so that users can skip auto-upgrade by adding the option directly to the config file. SnowSQL already has the `--noup` connection option to prevent auto-upgrade when connecting to Snowflake. |
| **SnowSQL 1.1.6** |  |  |
|  | SNOW-24965 | Fixed issue where empty results return the following error: `max() arg is an empty sequence when specify -o output_format=expanded`. |
| **SnowSQL 1.1.5** |  |  |
|  | SNOW-17258 | Fixed issues with how AUTO handled Parquet file compression: COMPRESSION parameter for CREATE / ALTER FILE FORMAT commands and AUTO_COMPRESS parameter for PUT command. |
|  | SNOW-21492 | SnowSQL now uses the OCSP response cache file located in `~/.snowsql/ocsp_response_cache`. This file is used to store OCSP responses up to 24 hours. |
| **SnowSQL 1.1.4** |  |  |
|  | SNOW-24548 | Set the signature version for AWS client to v3 (no change in functionality). |
| **SnowSQL 1.1.3** |  |  |
|  | SNOW-23198 | Fixed a problem that caused output spanning multiple lines to sometimes result in a misaligned table format. |
| **SnowSQL 1.1.2** |  |  |
|  | SNOW-20418 | Added support for PUT command. |
|  | SNOW-23840 | Added command-line options `--proxy-user` and `--proxy-password` to support proxy authentication. |
| **SnowSQL 1.1.0** |  |  |
|  | Feature | Various minor enhancements. |
|  | Bug fix | Command-line options are now passed through to the main SnowSQL executable. |
|  | Bug fix | SnowSQL online upgrade is now transactional. |
|  | Bug fix | Option names are now case-insensitive. |
|  | Bug fix | Various minor bug fixes. |
| **SnowSQL 1.0.0** |  |  |
|  | Initial release |  |

---
title: SnowSQL release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowsql.md
section: Release Notes
---

# SnowSQL release notes

The SnowSQL release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](snowsql-2026.md)
* [2025 releases](snowsql-2025.md)
* [2024 releases](snowsql-2024.md)
* [2023 releases](snowsql-2023.md)
* [2022 releases](snowsql-2022.md)

See [SnowSQL (CLI client)](../../user-guide/snowsql.md) for documentation.

---
title: SnowSQL release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowsql-2022.md
section: Release Notes
---

# SnowSQL release notes for 2022

This article contains the release notes for the SnowSQL, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for SnowSQL updates.

See [SnowSQL (CLI client)](../../user-guide/snowsql.md) for documentation.

## Version 1.2.24 (October 21, 2022)

### New Features

* Fixed an issue where a StopIteration exception was raised while using a TAB to complete a command.

## Version 1.2.23 (July 28, 2022)

### Bug Fixes

* Reverted to the legacy SQL splitter implementation as the default, as the new splitter implementation led to
  unforeseen behavior changes.

## Version 1.2.22 (June 29, 2022)

### New Features

* SnowSQL now captures proxies set by environment variables in logs.

### Bug Fixes

* Fixed an issue where SnowSQL returned Unicode characters as escape sequences instead of the actual Unicode characters.
* Removed the deprecation warning when using proxy variables.

---
title: SnowSQL release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowsql-2023.md
section: Release Notes
---

# SnowSQL release notes for 2023

This article contains the release notes for the SnowSQL, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> For release note information for versions released prior to January 2022, see the [Client Release History](https://community.snowflake.com/s/article/client-release-history).

See [SnowSQL (CLI client)](../../user-guide/snowsql.md) for documentation.

## Version 1.2.31 (December 13, 2023)

### New features and updates

* Added a new `--include_connector_version` command-line option to display the version of the Snowflake Connector for Python software that is packaged in the SnowSQL binary.
* Stopped removing whitespace from SnowSQL variables.

### Bug fixes

* None.

## Version 1.2.30 (November 13, 2023)

### New features and updates

* Updated the `snowflake-connector-python` dependency to 3.4.1.
* Removed the `oscrypto` dependency, while maintaining the `pycryptodomex` dependency.

### Bug fixes

* Updated `isAuthorizedToRun` in `CallCtx` to fix an issue that prevented the [VALIDATE_PIPE_LOAD](../../sql-reference/functions/validate_pipe_load.md) function from accessing the pipe inside an app when the app is created from listing.

## Version 1.2.29 (October 10, 2023)

### New features and updates

* Updated the cryptography dependency to version 41.0.3.

### Bug fixes

* None.

## Version 1.2.28 (August 07, 2023)

### New features and updates

* Added support for Mac ARM64 binaries.

### Bug fixes

* None.

## Version 1.2.27 (June 15, 2023)

### New features and updates

* Added the `json_result_force_utf8_decoding` option to force result data to be decoded in UTF-8.
  By default, `json_result_force_utf8_decoding` is set to `false` for compatibility with legacy data. Snowflake
  recommends setting the value to true.

### Bug fixes

* Fixed a bug where default logging settings referenced parent directory and the level was set to debug by default.

## Version 1.2.26 (April, 2023)

> **Note:**
>
> Version 1.2.25 was removed shortly after release. Version 1.2.26 includes the features and fixes initially
> included in version 1.2.25.

### New features

* Added the recursion_limit option to limit the Python recursion depth.
* Added the QUERY_TAG CLI argument to specify query tags for running queries in SnowSQL. By default, QUERY_TAG
  reads the value of the `SNOWSQL_QUERY_TAG` environment variable.

### Bug fixes

* Fixed a bug where SnowSQL used an incorrect version of LD_LIBRARY_PATH to open web browsers.
* Fixed an issue where SnowSQL warned of failure to import ArrowResult.

---
title: SnowSQL release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowsql-2024.md
section: Release Notes
---

# SnowSQL release notes for 2024

This article contains the release notes for the SnowSQL, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> For release note information for versions released prior to January 2022, see the [Client Release History](https://community.snowflake.com/s/article/client-release-history).

See [SnowSQL (CLI client)](../../user-guide/snowsql.md) for documentation.

## Version 1.3.2 (August 12, 2024)

### New features and updates

None.

### Bug fixes

* Fixed an issue with the `snowsql --version` command failing when automatic upgrades are disabled (`noup=False`).

## Version 1.3.1 (June 28, 2024)

### New features and updates

* Added Linux aarch64 binaries.

### Bug fixes

* None

## Version 1.3.0 (May 03, 2024)

### BCR (Behavior Change Release) changes

* The SnowSQL 1.3.0 release disabled automatic upgrades. To use this version, please [download and reinstall](../../user-guide/snowsql-install-config.md) SnowSQL 1.3.0. Going forward, you must manually install new versions of SnowSQL.

### New features and updates

* None.

### Bug fixes

* Disabled automatic updates to fix an issue where expired S3 licenses caused SnowSQL to fail.
* Fixed an issue where the lack of permission to create log directory aborted SnowSQL.
* Fixed an issue that endpoint is not created correctly when connecting to China deployment.

## Version 1.2.32 (March 05, 2024)

### New features and updates

* Bumped the `keyring` dependency to 23.1.0 to address a security vulnerability.

### Bug fixes

* None.

---
title: SnowSQL release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowsql-2025.md
section: Release Notes
---

# SnowSQL release notes for 2025

This article contains the release notes for the SnowSQL, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> For release note information for versions released prior to January 2022, see the [Client Release History](https://community.snowflake.com/s/article/client-release-history).

See [SnowSQL (CLI client)](../../user-guide/snowsql.md) for documentation.

## Version 1.4.5 (Aug 13, 2025)

### New features and updates

* Added support for workload identity federation in the AWS, Azure, Google Cloud, and Kubernetes platforms.

  + Added the `workload_identity_provider` connection parameter.
  + Added `WORKLOAD_IDENTITY` to the values for the authenticator connection parameter.

### Bug fixes

* None

## Version 1.4.4 (Jul 30, 2025)

### New features and updates

* Updated openssl to version 3 for Windows.

### Bug fixes

* None

## Version 1.4.3 (Jul 10, 2025)

### New features and updates

* None

### Bug fixes

* Updated `!system` command library cleanup. Removed deprecation warning for setuptools.

## Version 1.4.2 (Jun 24, 2025)

### New features and updates

* None

### Bug fixes

* Removed SnowSQL _internals from LD_LIBRARY_PATH for subprocess on linux.

## Version 1.4.1 (May 29, 2025)

### New features and updates

* Upgraded `snowflake-connector-python` to 3.15.0.

### Bug fixes

* None.

## Version 1.4.0 (May 22, 2025)

### New features and updates

* Added support for OAuth 2.0 Authorization Code Flow and OAuth 2.0 Client Credentials Flow.
* Upgraded openssl to version 3.5.0, cryptography <= 44.0.3.
* Updated how Windows binaries sign internally upgradable components.

### Bug fixes

* Fixed an issue with the `snowsql --version` command failing when automatic upgrades are disabled (`noup=False`).

## Version 1.3.3 (February 05, 2025)

### New features and updates

* Improved inference of top-level domains for accounts specifying a region in China, now defaulting to `snowflakecomputing.cn` with new `snowflake-conenctor-python` 3.13.2.
* Bumped `pycryptodomex` to version 3.21.0.
* Updated the build system with latest openssl 1.1.1w version.

### Bug fixes

* Fixed an issue with the `snowsql --version` command failing when automatic upgrades are disabled (`noup=False`).

---
title: SnowSQL release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/snowsql-2026.md
section: Release Notes
---

# SnowSQL release notes for 2026

This article contains the release notes for the SnowSQL, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

> **Note:**
>
> For release note information for versions released prior to January 2022, see the [Client Release History](https://community.snowflake.com/s/article/client-release-history).

See [SnowSQL (CLI client)](../../user-guide/snowsql.md) for documentation.

## Version 1.5.0 (Apr 16, 2026)

### New features and updates

* Upgraded `snowflake-connector-python` to 4.4.0.
* Upgraded OpenSSL to 3.5.5 to address CVE-2025-9230.

### Bug fixes

* None.

---
title: SnowSQL: Change to the value of the sql_split property (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-792.md
section: Release Notes
---

# SnowSQL: Change to the value of the sql_split property (Pending)

In September 2022, the SQL splitter used by the `sql_split` property will change. In January 2023, the `sql_split` property will be removed and the new value will be the only option.
You should test your implementation by setting the value of sql_split to the new value described below to determine if you encounter any compatibility issues.

Configuring the current behavior:

You can set `sql_split` in the SnowSQL configuration file:

```ini
[options]
sql_split=snowflake.connector.util_text
```

Or from the command line:

```console
snowsql -o sql_split=snowflake.connector.util_text
```

Configuring the new behavior:

You can set `sql_split` in the SnowSQL configuration file:

```ini
[options]
sql_split=snowflake.cli.sqlsplit
```

Or from the command line:

```console
snowsql -o sql_split=snowflake.cli.sqlsplit ...
```

Ref: 792

---
title: Some Unused Data No Longer Sent to Drivers, Connectors, and Clients
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-916.md
section: Release Notes
---

# Some Unused Data No Longer Sent to Drivers, Connectors, and Clients

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The server no longer includes some unused data in the response sent to drivers, connectors, and clients:

Previously:
:   When the server sent a response to drivers, connectors, and clients, the response included some data that is not used.

Currently:
:   The server no longer returns this unused data in the response.

Ref: 916

---
title: SQL Changes — General: Correctly set byteLength for VARCHAR string columns (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2286.md
section: Release Notes
---

# SQL Changes — General: Correctly set byteLength for VARCHAR string columns (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

This behavior change fixes the byte length calculation for VARCHAR columns to consistently account for UTF-8 encoding (4 bytes per
character). Prior to this fix, VARCHAR columns with character lengths greater than 4,194,304 and up to 16,777,216 could have incorrectly
calculated byte lengths.

Before the change:
:   For VARCHAR columns with character length > 4,194,304 and <= 16,777,216, the `byteLength` was incorrectly capped at 16,777,216 bytes.
    This did not properly account for UTF-8 encoding, which requires up to 4 bytes per character.

    For example:

    ```sql
    CREATE TABLE example_table (
      col1 VARCHAR(10000000) -- 10M characters
    );
    SHOW COLUMNS IN TABLE example_table;
    ```

    Result:

    ```json
    {
      "length": 10000000,
      "byteLength": 16777216
    }
    ```

    The `byteLength` should be 40,000,000 (4 x 10,000,000), but was incorrectly capped at 16,777,216.

After the change:
:   For VARCHAR columns with character length > 4,194,304 and <= 16,777,216, the `byteLength` is correctly calculated as
    4 x character_length, properly accounting for UTF-8 encoding where each character can be up to 4 bytes.

    Using the same example:

    ```json
    {
      "length": 10000000,
      "byteLength": 40000000
    }
    ```

This change only affects new string columns. String columns with character length > 16,777,216 are not affected because `byteLength` is
already correctly set for those cases. The `byteLength` is still capped at 134,217,728.

Ref: 2286

---
title: SQL Changes: Add new date and time format elements (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2281.md
section: Release Notes
---

# SQL Changes: Add new date and time format elements (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

When this behavior change bundle is enabled, new short-form date and time format elements are enabled, which affects datetime formatting
and parsing logic.

Before the change:
:   The following format elements were parsed and serialized as literal characters in datetime-to-string or string-to-datetime conversion:
    `Y`, `MO`, `D`, `H24`, `H12`, `H`, `ME`, `S`, `P`.

After the change:
:   The following new format elements are now interpreted by the parsing and formatting logic:

    * `Y` for “year” (non-padded)
    * `MO` for “month” (non-padded)
    * `D` for “day” (non-padded)
    * `H24` for “24-hour based hour of the day” (non-padded)
    * `H12` for “12-hour based hour of the day” (non-padded)
    * `H` as a synonym for `H24`
    * `ME` for “minute” (non-padded)
    * `S` for “second” (non-padded)
    * `P` for “single letter AM/PM indicator” (A for AM or P for PM)

    Any unquoted usage of these characters or sequences will be interpreted as a format element instead of a literal.

This behavior change may constitute a breaking change for users who have used these new format elements as unquoted characters in their
format models for DATE, TIME, TIMESTAMP_LTZ, TIMESTAMP_NTZ, or TIMESTAMP_TZ.

For example:

* Previously, `SELECT TO_CHAR(current_timestamp(), 'YYYY-MM-DD JST')` would output the sequence “JST” as literal characters,
  for example `2026-03-18 JST`.
* Now, the `S` in `JST` will be interpreted as a “second” format element, and the formatting logic will insert a numerical value:
  `2026-03-18 J47T`.

## What you need to do

If you use any of these new format elements as unquoted characters or strings in your format models, quote any parts of the format model
that should be kept as literal characters. For example:

```sql
-- Before (incorrect after this change):
SELECT TO_CHAR(current_timestamp(), 'YYYY-MM-DD JST');

-- After (corrected):
SELECT TO_CHAR(current_timestamp(), 'YYYY-MM-DD "JST"');
```

Ref: 2281

---
title: SQL data types: Changes to maximum length, output, and error messages
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1779.md
section: Release Notes
---

# SQL data types: Changes to maximum length, output, and error messages

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

With this behavior change, compiled SQL expressions and some error messages behave as follows:

Before the change:
:   * In compiled SQL expressions and error messages, Snowflake explicitly specified the length of the data type (for example,
      `VARCHAR(16777216)`).
    * When loading objects larger than 16 MB, an error related to parsing or processing a large string or file is returned (for example,
      `100069 (22P02): Error parsing JSON: document is too large, max size 16777216 bytes`).

After the change:
:   * In compiled SQL expressions and error messages, Snowflake omits the length of the data type (for example, `VARCHAR`).
    * When loading objects larger than 16 MB, an error related to storing a large object is returned (for example,
      `100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>`).

In the past, an error occurred when you attempted to query an object larger than 16 MB (8 MB for BINARY,
GEOMETRY, and GEOGRAPHY) on a stage. You can now read and process objects up to 128 MB in size.

> **Note:**
>
> With the 9.17 release, you can now also store objects larger than 16 MB in a column. For more information,
> see [Size limits for database objects](../../../user-guide/data-load-considerations-prepare.md).

The new size limit isn’t explicitly exposed in SQL query output or metadata. However, you can implicitly
observe the new increased length by creating or reading objects of a larger size, but not storing them.
Enabling this feature introduces the following behavior changes:

* VARCHAR and BINARY types appear without length in the output of GET_DDL, SHOW, and DESCRIBE commands
  for column expressions, UDFs, and stored procedures.

  For example, `VARCHAR` is shown instead of `VARCHAR(16777216)`. This change applies only to newly
  created objects where you haven’t explicitly specified the length in the DDL statement. The change doesn’t apply
  to existing objects.
* Some statements that failed before with a `maximum size exceeded` (or similar) error will now succeed.

  Statements that only load or create, but never store or return, a large value will succeed now.
* Some statements that before failed with a `maximum size exceeded` (or similar) error will keep failing,
  however, with a different error code or message.

  The new error code and message are still related to exceeding the 16 MB limit, but the error can originate from a
  different part of the execution plan. For example, `cannot load value` might change to `cannot store value`
  or `cannot output value`.

The first change affects all customers. The second and third changes affect customers who try to load or generate
objects larger than 16 MB.

> **Important:**
>
> We strongly advise against creating logic that depends on error messages associated with objects larger than 16 MB.
> Instead, you can build logic that uses the [BIT_LENGTH](../../../sql-reference/functions/bit_length.md) function to check the size of
> the value.

## Changes in metadata

There are behavior changes that affect the following types of operations:

* Returning metadata for UDFs
* Returning metadata for tables with column expressions
* Returning metadata for SYSTEM$TYPEOF
* Returning metadata for SHOW COLUMNS

For these types of operations, there are changes in metadata in the results set.

> **Note:**
>
> This list is not exhaustive.

### Returning metadata for UDFs

For new user-defined functions (UDFs) that use VARCHAR or BINARY values as input or output, changes in the metadata for
DDL statements related to UDFs affect the output returned when you call the [GET_DDL](../../../sql-reference/functions/get_ddl.md)
function, run the [DESCRIBE FUNCTION](../../../sql-reference/sql/desc-function.md) statement, or query the
[event table](../../../developer-guide/logging-tracing/event-table-setting-up.md). The following example creates a UDF:

```sqlexample
CREATE OR REPLACE FUNCTION udf_varchar(g1 VARCHAR)
  RETURNS VARCHAR
  AS $$
    'Hello' || g1
  $$;
```

#### GET_DDL

The metadata returned from a GET_DDL function call changes in the following way:

```sqlexample
SELECT GET_DDL('function', 'udf_varchar(VARCHAR)');
```

Metadata before the change:
:   ```output
    CREATE OR REPLACE FUNCTION "UDF_VARCHAR"("G1" VARCHAR(16777216))
    RETURNS VARCHAR(16777216)
    LANGUAGE SQL
    AS '
      ''Hello'' || g1
    ';
    ```

Metadata after the change:
:   ```output
    CREATE OR REPLACE FUNCTION "UDF_VARCHAR"("G1" VARCHAR)
    RETURNS VARCHAR
    LANGUAGE SQL
    AS '
      ''Hello'' || g1
    ';
    ```

#### DESCRIBE FUNCTION

The metadata returned for a DESCRIBE FUNCTION statement changes in the following way:

```sqlexample
DESCRIBE FUNCTION udf_varchar(VARCHAR);
```

Metadata before the change:
:   ```output
    +-----------+-------------------+
    | property  | value             |
    |-----------+-------------------|
    | signature | (G1 VARCHAR)      |
    | returns   | VARCHAR(16777216) |
    | language  | SQL               |
    | body      |                   |
    |           |   'Hello' || g1   |
    |           |                   |
    +-----------+-------------------+
    ```

Metadata after the change:
:   ```output
    +-----------+-------------------+
    | property  | value             |
    |-----------+-------------------|
    | signature | (G1 VARCHAR)      |
    | returns   | VARCHAR           |
    | language  | SQL               |
    | body      |                   |
    |           |   'Hello' || g1   |
    |           |                   |
    +-----------+-------------------+
    ```

#### Event table

For new user-defined functions that return VARCHAR or BINARY values as output, the `snow.executable.name` attribute in
the [RESOURCE_ATTRIBUTES](../../../developer-guide/logging-tracing/event-table-columns.md) column of the
[event table](../../../developer-guide/logging-tracing/event-table-setting-up.md) changes as follows:

Metadata before the change:
:   ```json
    {
      "db.user": "MYUSERNAME",
      "snow.database.id": 13,
      "snow.database.name": "MY_DB",
      "snow.executable.id": 197,
      "snow.executable.name": "UDF_VARCHAR(X VARCHAR):VARCHAR(16777216)",
      "snow.executable.type": "FUNCTION",
      "snow.owner.id": 2,
      "snow.owner.name": "MY_ROLE",
      "snow.query.id": "01ab0f07-0000-15c8-0000-0129000592c2",
      "snow.schema.id": 16,
      "snow.schema.name": "PUBLIC",
      "snow.session.id": 1275605667850,
      "snow.session.role.primary.id": 2,
      "snow.session.role.primary.name": "MY_ROLE",
      "snow.user.id": 25,
      "snow.warehouse.id": 5,
      "snow.warehouse.name": "MYWH",
      "telemetry.sdk.language": "python"
    }
    ```

Metadata after the change:
:   ```json
    {
      "db.user": "MYUSERNAME",
      "snow.database.id": 13,
      "snow.database.name": "MY_DB",
      "snow.executable.id": 197,
      "snow.executable.name": "UDF_VARCHAR(X VARCHAR):VARCHAR",
      "snow.executable.type": "FUNCTION",
      "snow.owner.id": 2,
      "snow.owner.name": "MY_ROLE",
      "snow.query.id": "01ab0f07-0000-15c8-0000-0129000592c2",
      "snow.schema.id": 16,
      "snow.schema.name": "PUBLIC",
      "snow.session.id": 1275605667850,
      "snow.session.role.primary.id": 2,
      "snow.session.role.primary.name": "MY_ROLE",
      "snow.user.id": 25,
      "snow.warehouse.id": 5,
      "snow.warehouse.name": "MYWH",
      "telemetry.sdk.language": "python"
    }
    ```

### Returning metadata for tables with column expressions

For new tables that use VARCHAR or BINARY in column expressions, changes in the metadata for DDL statements related
to these columns affect the output returned when you call the GET_DDL function.

The following example creates a table with column expression:

```sqlexample
CREATE OR REPLACE TABLE table_with_default(x INT, v TEXT DEFAULT x::VARCHAR);
```

The metadata returned from a GET_DDL function call changes in the following way:

```sqlexample
SELECT GET_DDL('table', 'table_with_default');
```

Metadata before the change:
:   ```output
    create or replace TABLE TABLE_WITH_DEFAULT ( |
          X NUMBER(38,0),
          V VARCHAR(16777216) DEFAULT CAST(TABLE_WITH_DEFAULT.X AS VARCHAR(16777216))
    );
    ```

Metadata after the change:
:   ```output
    create or replace TABLE TABLE_WITH_DEFAULT ( |
          X NUMBER(38,0),
          V VARCHAR(16777216) DEFAULT CAST(TABLE_WITH_DEFAULT.X AS VARCHAR)
    );
    ```

#### External tables

The following example creates an external table:

```sqlexample
CREATE OR REPLACE EXTERNAL TABLE ext_table(
    data_str VARCHAR AS (value:c1::TEXT))
  LOCATION = @csv_stage
  AUTO_REFRESH = false
  FILE_FORMAT =(type = csv);
```

The metadata returned from a GET_DDL function call changes in the following way:

```sqlexample
SELECT GET_DDL('table', 'ext_table');
```

Metadata before the change:
:   ```output
    create or replace external table EXT_TABLE(
          DATA_STR VARCHAR(16777216) AS (CAST(GET(VALUE, 'c1') AS VARCHAR(16777216))))
    location=@CSV_STAGE/
    auto_refresh=false
    file_format=(TYPE=csv)
    ;
    ```

Metadata after the change:
:   ```output
    create or replace external table EXT_TABLE(
          DATA_STR VARCHAR(16777216) AS (CAST(GET(VALUE, 'c1') AS VARCHAR)))
    location=@CSV_STAGE/
    auto_refresh=false
    file_format=(TYPE=csv)
    ;
    ```

### Returning metadata for SYSTEM$TYPEOF

The metadata returned for a call to the [SYSTEM$TYPEOF](../../../sql-reference/functions/system_typeof.md) function changes
in the following way:

```sqlexample
SELECT SYSTEM$TYPEOF(REPEAT('a',10));
```

Metadata before the change:
:   ```output
    VARCHAR(16777216)[LOB]
    ```

Metadata after the change:
:   ```output
    VARCHAR[LOB]
    ```

### Returning metadata for SHOW COLUMNS

This change affects both existing and new tables. The metadata returned for a
[SHOW COLUMNS](../../../sql-reference/sql/show-columns.md) statement changes in the following way:

```sqlexample
CREATE OR REPLACE TABLE t AS
  SELECT TO_VARIANT('abc') AS col;

SHOW COLUMNS IN t;
```

Metadata before the change:
:   ```output
    {
      "type":"VARIANT",
      "length":16777216,
      "byteLength":16777216,
      "nullable":true,
      "fixed":false
    }
    ```

Metadata after the change:
:   ```output
    {
      "type":"VARIANT",
      "nullable":true,
      "fixed":false
    }
    ```

## Changes in loading and processing objects larger than 16 MB

There are behavior changes that affect cases when you try to load or process objects larger than 16 MB
using the following types of operations:

* Loading data by scanning files on a stage
* Querying a whole large object from a source file
* Including large objects in query results
* Creating a large object using aggregation

> **Note:**
>
> This list is not exhaustive.

### Loading data by scanning files on a stage

When you attempt to load data larger than 16 MB by scanning files on a stage, an error message is returned.

* Loading a whole large object using CREATE TABLE AS SELECT
* Loading a whole large object using COPY INTO <table_name> … FROM SELECT
* Loading a whole large object using COPY INTO <table_name> … FROM <stage_or_location>

#### Loading a whole large object using CREATE TABLE AS SELECT

A different error message appears when you try to use a CREATE TABLE AS SELECT statement to load objects that
are larger than 16 MB for VARCHAR, VARIANT, OBJECT, and ARRAY (or larger than 8 MB for BINARY, GEOMETRY, or GEOGRAPHY).
The error depends on the type of the source. The same message change applies when an INSERT INTO SELECT statement is
used for this scenario.

* Loading a whole large object from a JSON source
* Loading a whole large object from an XML source

##### Loading a whole large object from a JSON source

The following example tries to load a whole object larger than 16 MB from a JSON source using CREATE TABLE AS SELECT:

```sqlexample
CREATE OR REPLACE FILE FORMAT json_format TYPE = JSON;

CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR) AS
  SELECT $1::VARCHAR
    FROM @lob_int_stage/driver_status.json.gz (FILE_FORMAT => 'json_format');
```

Error message before the change:
:   ```output
    100069 (22P02): Error parsing JSON: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>
    ```

##### Loading a whole large object from an XML source

The following example tries to load a whole object larger than 16 MB from an XML source using CREATE TABLE AS SELECT:

```sqlexample
CREATE or REPLACE FILE FORMAT xml_format TYPE = XML;

CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR) AS
  SELECT $1 AS XML
    FROM @lob_int_stage/large_xml.xte (FILE_FORMAT => 'xml_format');
```

Error message before the change:
:   ```output
    100100 (22P02): Error parsing XML: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100078 (22000): String '<string_preview>' is too long and would be truncated
    ```

#### Loading a whole large object using COPY INTO *<table_name>* … FROM SELECT

A different error message appears when you try to use a COPY INTO *<table_name>* … FROM SELECT statement to load objects that
are larger than 16 MB for VARCHAR, VARIANT, OBJECT, and ARRAY (or larger than 8 MB for BINARY, GEOMETRY, or GEOGRAPHY).
The error depends on the type of the source.

* Loading a whole large object from a JSON source
* Loading large nested objects from a JSON source
* Loading a whole large object from an XML source

> **Important:**
>
> If you attempt to load data that contains objects larger than 16 MB using the COPY INTO command with `ON_ERROR=CONTINUE`
> and rely on the error messages written in the error log, the change in the error message could cause problems in logic that
> depends on the error message.

##### Loading a whole large object from a JSON source

The following example tries to load a whole object larger than 16 MB from a JSON source using COPY INTO *<table_name>* … FROM SELECT:

```sqlexample
CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR);

COPY INTO table_varchar FROM (
  SELECT $1::VARCHAR
    FROM @lob_int_stage/driver_status.json.gz (FILE_FORMAT => 'json_format'));
```

Error message before the change:
:   ```output
    100069 (22P02): Error parsing JSON: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>
    ```

##### Loading large nested objects from a JSON source

The following example tries to load JSON data when accessing large nested objects:

```sqlexample
CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR);

COPY INTO table_varchar FROM (
  SELECT $1:"Driver_Status"
    FROM @lob_int_stage/driver_status.json.gz (FILE_FORMAT => 'json_format'));
```

Error message before the change:
:   ```output
    100069 (22P02): Max LOB size (16777216) exceeded, actual size of parsed column is <object_size>
    ```

Error message after the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>
    ```

##### Loading a whole large object from an XML source

The following example tries to load a whole object larger than 16 MB from an XML source using COPY INTO *<table_name>* … FROM SELECT:

```sqlexample
CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR);

COPY INTO table_varchar FROM (
  SELECT $1::VARCHAR AS lob_column
    FROM @lob_int_stage/large_xml.xte (FILE_FORMAT => 'xml_format'));
```

Error message before the change:
:   ```output
    100100 (22P02): Error parsing XML: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <object_size>
    ```

#### Loading a whole large object using COPY INTO *<table_name>* … FROM *<stage_or_location>*

A different error message appears when you try to use a COPY INTO *<table_name>* … FROM *<stage_or_location>* statement to load objects that
are larger than 16 MB for VARCHAR, VARIANT, OBJECT, and ARRAY (or larger than 8 MB for BINARY, GEOMETRY,
or GEOGRAPHY). The error depends on the type of the source.

If you use the COPY command with large objects, queries might fail even when the `ON_ERROR` parameter is
set to `CONTINUE`. For more information, see the [usage notes for the COPY command](../../../sql-reference/sql/copy-into-table.md).

* Loading a whole large object from a JSON source
* Loading a whole large object from an XML source

> **Important:**
>
> If you attempt to load data that contains objects larger than 16 MB using the COPY INTO command with
> `ON_ERROR=CONTINUE` and rely on the error messages written in the error log, the change in the error
> message could cause problems in logic that depends on the message.

##### Loading a whole large object from a JSON source

The following example tries to load a whole object larger than 16 MB from a JSON source using COPY INTO *<table_name>* … FROM *<stage_or_location>*:

```sqlexample
CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR);

COPY INTO table_varchar (lob_column)
  FROM @lob_int_stage/driver_status.json.gz
  FILE_FORMAT = (FORMAT_NAME = json_format);
```

Error message before the change:
:   ```output
    100069 (22P02): Error parsing JSON: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>
    ```

##### Loading a whole large object from an XML source

The following example tries to load a whole object larger than 16 MB from an XML source using COPY INTO *<table_name>* … FROM *<stage_or_location>*:

```sqlexample
CREATE OR REPLACE TABLE table_varchar (lob_column VARCHAR);

COPY INTO table_varchar (lob_column)
  FROM @lob_int_stage/large_xml.xte
  FILE_FORMAT = (FORMAT_NAME = xml_format);
```

Error message before the change:
:   ```output
    100100 (22P02): Error parsing XML: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>
    ```

### Querying a whole large object from a source file

Because objects larger than 16 MB currently are not allowed in a result set, a different error message
appears when you try to query objects from a source file that are larger than 16 MB for VARCHAR, VARIANT,
OBJECT, and ARRAY (or larger than 8 MB for BINARY, GEOMETRY, or GEOGRAPHY). The error depends on the type
of the source.

* Querying a whole large object from a JSON source
* Querying a whole large object from an XML source
* Querying a whole large object from a CSV source
* Querying a whole large object from a Parquet source

#### Querying a whole large object from a JSON source

The following example tries to query a whole object larger than 16 MB from a JSON source:

```sqlexample
SELECT $1
  FROM @lob_int_stage/driver_status.json.gz (FILE_FORMAT => 'json_format');
```

Error message before the change:
:   ```output
    100069 (22P02): Error parsing JSON: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): The data length in result column $1 is not supported by this version of the client. Actual length <actual_length> exceeds supported length of 16777216.
    ```

#### Querying a whole large object from an XML source

The following example tries to query a whole object larger than 16 MB from an XML source:

```sqlexample
SELECT $1 as lob_column
  FROM @lob_int_stage/large_xml.xte (FILE_FORMAT => 'xml_format');
```

Error message before the change:
:   ```output
    100100 (22P02): Error parsing XML: document is too large, max size 16777216 bytes
    ```

Error message after the change:
:   ```output
    100082 (22000): The data length in result column $1 is not supported by this version of the client. Actual length <actual_length> exceeds supported length of 16777216.
    ```

#### Querying a whole large object from a CSV source

The following example tries to query a whole object larger than 16 MB from a CSV source:

```sqlexample
SELECT $1
  FROM @lob_int_stage/driver_status.csv.gz (FILE_FORMAT => 'csv_format');
```

Error message before the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <object_size>
    ```

Error message after the change:
:   ```output
    100082 (22000): The data length in result column $1 is not supported by this version of the client. Actual length <actual_length> exceeds supported length of 16777216.
    ```

#### Querying a whole large object from a Parquet source

The following example tries to query a whole object larger than 16 MB from a Parquet source:

```sqlexample
SELECT $1
  FROM @lob_int_stage/driver_status.parquet (FILE_FORMAT => 'parquet_format');
```

Error message before the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <object_size>
    ```

Error message after the change:
:   ```output
    100082 (22000): The data length in result column $1 is not supported by this version of the client. Actual length <actual_length> exceeds supported length of 16777216.
    ```

### Including large objects in query results

You can now create objects larger than 16 MB in memory. However, you cannot include these objects in query
results or store them in a table. When you attempt to do so, an error message is returned.

* Attempting to include an object larger than 16 MB in query results
* Attempting to store an object larger than 16 MB in a table

#### Attempting to include an object larger than 16 MB in query results

The following query attempts to concatenate two large strings:

```sqlexample
SELECT large_str || large_str FROM lob_strings;
```

Error message before the change:
:   ```output
    100078 (22000): String '<preview_of_string>' is too long and would be truncated in 'CONCAT'
    ```

Error message after the change:
:   ```output
    100067 (54000): The data length in result column <column_name> is not supported by this version of the client. Actual length <actual_size> exceeds supported length of 16777216.
    ```

#### Attempting to store an object larger than 16 MB in a table

The following CREATE TABLE AS SELECT statement attempts to concatenate two large strings:

```sqlexample
CREATE OR REPLACE TABLE table_varchar
  AS SELECT large_str || large_str as LOB_column
  FROM lob_strings;
```

Error message before the change:
:   ```output
    100078 (22000): String '<preview_of_string>' is too long and would be truncated in 'CONCAT'
    ```

Error message after the change:
:   ```output
    100067 (54000): The data length in result column <column_name> is not supported by this version of the client. Actual length <actual_size> exceeds supported length of 16777216.
    ```

### Creating a large object using aggregation

When you try to create an object larger than 16 MB and return output for it, an error message is returned.

The following example uses the [ARRAY_AGG](../../../sql-reference/functions/array_agg.md) function in a query of a large object column:

```sqlexample
SELECT ARRAY_AGG(status) FROM lob_object;
```

Error message before the change:
:   ```output
    100082 (22000): Max LOB size (16777216) exceeded, actual size of parsed column is <actual_size>
    ```

Error message after the change:
:   ```output
    100067 (54000): The data length in result column <column_name> is not supported by this version of the client. Actual length <actual_size> exceeds supported length of 16777216.
    ```

Ref: 1779

---
title: SQL functions: Passing in columns that have the upper, lower, or trim collation specifier
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1535.md
section: Release Notes
---

# SQL functions: Passing in columns that have the `upper`, `lower`, or `trim` collation specifier

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

The `upper` and `lower` [collation specifiers](../../../sql-reference/collation.md) convert strings to upper or lower case
(respectively) before strings are compared. The `trim` collation specifier removes leading and trailing spaces before strings
are compared.

In cases when you pass in columns with these specifiers to some of the SQL functions, the behavior changes in the following ways:

Before the change:
:   The [LIKE](../../../sql-reference/functions/like.md) function ignores the `upper`, `lower`, and `trim` specifiers, which
    results in a case-sensitive comparison.

    In addition, the following functions do not support columns with the `upper`, `lower`, and `trim` specifiers:

    * Variants of the LIKE function:

      + [LIKE ALL](../../../sql-reference/functions/like_all.md)
      + [LIKE ANY](../../../sql-reference/functions/like_any.md)
      + [ILIKE ANY](../../../sql-reference/functions/ilike_any.md)
    * [CONTAINS](../../../sql-reference/functions/contains.md)
    * [ENDSWITH](../../../sql-reference/functions/endswith.md)
    * [POSITION](../../../sql-reference/functions/position.md)
    * [REPLACE](../../../sql-reference/functions/replace.md)
    * [SPLIT](../../../sql-reference/functions/split.md)
    * [SPLIT_PART](../../../sql-reference/functions/split_part.md)
    * [STARTSWITH](../../../sql-reference/functions/startswith.md)

    If you pass a column with the `upper`, `lower`, or `trim` specifier to these functions, a compilation error occurs.

After the change:
:   The LIKE function respects the `upper`, `lower`, and `trim` specifiers, which results in a case-insensitive comparison.

    As a result, queries that use the LIKE function might return additional rows (see the example below).

    Note that the LIKE function does not support combinations with locale specifiers (for example, `en-upper`).

    In addition, the following functions now support columns with the `upper`, `lower`, and `trim` specifiers.

    * Variants of the LIKE function:

      + LIKE ALL
      + LIKE ANY
      + ILIKE ANY
    * CONTAINS
    * ENDSWITH
    * POSITION
    * REPLACE
    * SPLIT
    * SPLIT_PART
    * STARTSWITH

## Example of the effects of the change on the LIKE function

As noted above, if a column has the `upper`, `lower`, or `trim` specifier, queries with the LIKE function might return
additional rows. For example, suppose that a table has a column with the `lower` specifier. Suppose that the text in the table
differs in case.

```sqlexample
CREATE OR REPLACE TABLE collated_like (
  col_a VARCHAR,
  col_b VARCHAR COLLATE 'lower'
);

INSERT INTO collated_like VALUES ('abc', 'abc'), ('ABC','ABC');
```

Before the behavior change, each of the following queries that use the LIKE function return one row with the value `'abc'`:

```sqlexample
SELECT * FROM collated_like WHERE col_a LIKE '%b%';

SELECT * FROM collated_like WHERE col_a COLLATE 'lower' LIKE '%b%';

SELECT * FROM collated_like WHERE col_b LIKE '%b%';
```

```output
+-------+-------+
| COL_A | COL_B |
|-------+-------|
| abc   | abc   |
+-------+-------+
```

After the behavior change, the query that does not use the `lower` specification column with the LIKE function still returns one
row:

```sqlexample
SELECT * FROM collated_like WHERE col_a LIKE '%b%';
```

```output
+-------+-------+
| COL_A | COL_B |
|-------+-------|
| abc   | abc   |
+-------+-------+
```

However, the queries that use the [COLLATE](../../../sql-reference/functions/collate.md) function to specify `lower` and queries that pass a
column with the `lower` specification to the LIKE function return two rows:

```sqlexample
SELECT * FROM collated_like WHERE col_a COLLATE 'lower' LIKE '%b%';

SELECT * FROM collated_like WHERE col_b LIKE '%b%';
```

```output
+-------+-------+
| COL_A | COL_B |
|-------+-------|
| abc   | abc   |
| ABC   | ABC   |
+-------+-------+
```

This is roughly equivalent to [ILIKE](../../../sql-reference/functions/ilike.md). To determine if you should expect changes to these
queries, you can replace LIKE with ILIKE in these queries.

## Preserving the behavior before the change

If your columns use the `upper`, `lower`, or `trim` specification and you want to preserve the behavior before the change,
you can use the [COLLATE](../../../sql-reference/functions/collate.md) function with an empty specification to indicate that the `upper`,
`lower`, or `trim` specification associated with the column should not be used:

```sqlexample
SELECT * FROM collated_like WHERE col_b COLLATE '' LIKE '%b%';
```

> **Note:**
>
> If you are using this approach with the LIKE function, make sure that both the subject and pattern do not have a collation
> specification applied.

Ref: 1535

---
title: SQL general: Changes to error messages for subqueries
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_01/bcr-2140.md
section: Release Notes
---

# SQL general: Changes to error messages for subqueries

> **Attention:**
>
> This behavior change is in the 2026_01 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_01_bundle.md).

Error messages for subqueries behave in the following manner:

Before the change:
:   When a subquery returns an error, the error message contains no specific information about the
    SQL code or object.

After the change:
:   When some subqueries return an error, the error message contains specific information, such as
    the following details:

    * The line and position of the unsupported code in the subquery.
    * The name of the object, such as a view or function, that contains the unsupported subquery. When
      objects are nested, the name of the outermost object is returned. For example, if a view with an
      unsupported subquery is nested in a secure view, the error message shows the name of the secure view.
    * The name of the type of object, such as a masking policy, that contains the unsupported subquery.

There are no changes to the error codes related to subqueries.

> **Note:**
>
> This behavior change doesn’t apply to subqueries in lateral joins or user-defined table functions (UDTFs).

## Examples

The following examples show changes to the error messages for subqueries.

Create three tables and insert data into each of them:

```sqlexample
CREATE TABLE testsub1(a INT, b INT)
  AS SELECT * FROM VALUES
    (1, 1),
    (2, 2),
    (NULL, NULL);

CREATE TABLE testsub2(x INT, y INT)
  AS SELECT * FROM VALUES
    (1, 1),
    (2, 2),
    (NULL, NULL);

CREATE TABLE testsub3(u INT, v INT)
  AS SELECT * FROM VALUES
    (1, 1),
    (2, 2),
    (NULL, NULL);
```

These tables are used in the following examples:

### Unsupported subquery

Run an unsupported subquery that returns an error:

```sqlexample
SELECT *
  FROM testsub1
  WHERE a IN(
    SELECT x FROM testsub2 LEFT JOIN testsub3 ON x+a = u
  );
```

This query returns an error because the correlated column `a` is in the ON clause of a left join.

Before the behavior change, the following error is returned:

```output
002031 (42601): SQL compilation error:
Unsupported subquery type cannot be evaluated
```

After the behavior change, the following error is returned:

```output
002031 (42601): SQL compilation error:
Unsupported subquery type cannot be evaluated at line 4, position 4
```

### Secure view with an unsupported subquery

Create a secure view with an unsupported subquery, and query the view:

```sqlexample
CREATE SECURE VIEW svw
  AS SELECT *
    FROM testsub1
    WHERE a IN (
      SELECT x FROM testsub2 LEFT JOIN testsub3 ON x+a = u);

SELECT * FROM svw;
```

Before the behavior change, the following error is returned:

```output
002031 (42601): SQL compilation error:
Unsupported subquery type cannot be evaluated
```

After the behavior change, the following error is returned:

```output
002031 (42601): SQL compilation error:
Unsupported subquery type cannot be evaluated inside VIEW object: SVW
```

### Masking policy with an unsupported subquery

Create a masking policy with an unsupported subquery, alter a table to use the masking policy,
and query the table:

```sqlexample
CREATE MASKING POLICY mp AS
  (i INT) RETURNS INT -> IFF(i < (SELECT MAX(a) FROM svw), i, -1);

CREATE TABLE masked_testsub1(a INT, b INT)
  AS SELECT * FROM VALUES
    (1, 1),
    (2, 2),
    (NULL, NULL);

ALTER TABLE masked_testsub1
  ALTER COLUMN a SET MASKING POLICY mp;

SELECT * FROM masked_testsub1;
```

Before the behavior change, the following error is returned:

```output
002031 (42601): SQL compilation error:
Unsupported subquery type cannot be evaluated
```

After the behavior change, the following error is returned:

```output
002031 (42601): SQL compilation error:
Unsupported subquery type cannot be evaluated inside MASKING POLICY
```

Ref: 2140

---
title: SQL general: New default column sizes for string and binary data types (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2118.md
section: Release Notes
---

# SQL general: New default column sizes for string and binary data types (Postponed)

> **Note:**
>
> This behavior change was part of the 2025_07 bundle, but the change has been postponed. The change will be introduced in a future bundle.
> The change is not available for testing.

When this behavior change bundle is enabled, the default sizes for
[string and binary data type](../../../sql-reference/data-types-text.md) change:

Before the change:
:   The default size for [text string data types](../../../sql-reference/data-types-text.md) was 16 MB.

    The default size for [binary data types](../../../sql-reference/data-types-text.md) was 8 MB.

After the change:
:   The default size for text string data types is 128 MB.

    The default size for binary data types is 64 MB.

Before this change, DDL statements could explicitly specify sizes larger than 16 MB for text string columns and 8 MB
for binary columns, but the defaults were 16 MB and 8 MB, respectively, when no size was specified.

After this behavior change, the default size is 128 MB for text string columns and 64 MB for binary columns when no
size is specified in DDL statements. So, INSERT statements can insert values larger than 16 MB into text string columns
and larger than 8 MB into binary columns without explicitly specifying larger sizes.

> **Note:**
>
> This change doesn’t affect DDL statements for Apache Iceberg™ tables and user-defined functions (UDFs) because
> they already use the larger default sizes.

The change applies to columns of the VARCHAR data type and to columns of data types that are synonymous with VARCHAR,
such as STRING, except for data types with a default size of 1 (including CHAR, CHARACTER, and NCHAR). The change also
applies to columns of the BINARY data type and to columns of data types that are synonymous with BINARY, such as VARBINARY.

For example, the following statement creates a table without specifying maximum sizes for the columns:

```sqlexample
CREATE TABLE test_new_default_sizes (
  name VARCHAR,
  profile_image BINARY);
```

Run the following query to show the maximum sizes of the columns:

```sqlexample
DESCRIBE TABLE test_new_default_sizes
  ->> SELECT "name", "type" FROM $1;
```

Before the behavior change, the query returns the following output:

```output
+---------------+-------------------+
| name          | type              |
|---------------+-------------------|
| NAME          | VARCHAR(16777216) |
| PROFILE_IMAGE | BINARY(8388608)   |
+---------------+-------------------+
```

After the behavior change, the query returns the following output:

```output
+---------------+--------------------+
| name          | type               |
|---------------+--------------------|
| NAME          | VARCHAR(134217728) |
| PROFILE_IMAGE | BINARY(67108864)   |
+---------------+--------------------+
```

## Views and materialized views can inherit large default sizes

In some cases, when you create a view or a materialized view that uses expressions in column definitions,
the columns inherit the new default sizes, even if the columns in the source table explicitly specify smaller
sizes.

For example, create a source table that explicitly sets the maximum size for a column of VARCHAR data type to
16777216:

```sqlexample
CREATE TABLE test_default_size_source_table (
  id INTEGER,
  description VARCHAR(16777216));
```

Create a view and a materialized view based on this table without using expressions in the column definitions:

```sqlexample
CREATE VIEW test_default_size_view AS
SELECT id, description FROM test_default_size_source_table;

CREATE MATERIALIZED VIEW test_default_size_mv AS
SELECT id, description FROM test_default_size_source_table;
```

Run the following queries to show the maximum sizes of the columns:

```sqlexample
DESCRIBE VIEW test_default_size_view
  ->> SELECT "name", "type" FROM $1;

DESCRIBE MATERIALIZED VIEW test_default_size_mv
  ->> SELECT "name", "type" FROM $1;
```

Both before and after the change, these queries return the following output:

```output
+-------------+-------------------+
| name        | type              |
|-------------+-------------------|
| ID          | NUMBER(38,0)      |
| DESCRIPTION | VARCHAR(16777216) |
+-------------+-------------------+
```

Create a view and a materialized view based on the source table and use expressions in the column definitions:

```sqlexample
CREATE VIEW test_default_size_view_with_exp AS
SELECT description || RANDSTR(10, 1) AS col
  FROM test_default_size_source_table;

CREATE MATERIALIZED VIEW test_default_size_mv_with_exp AS
SELECT description || RANDSTR(10, 1) AS col
  FROM test_default_size_source_table;
```

Run the following queries to show the maximum sizes of the columns:

```sqlexample
DESCRIBE VIEW test_default_size_view_with_exp
  ->> SELECT "name", "type" FROM $1;

DESCRIBE MATERIALIZED VIEW test_default_size_mv_with_exp
  ->> SELECT "name", "type" FROM $1;
```

Before the behavior change, these queries return the following output:

```output
+------+-------------------+
| name | type              |
|------+-------------------|
| COL  | VARCHAR(16777216) |
+------+-------------------+
```

After the behavior change, these queries return the following output:

```output
+------+--------------------+
| name | type               |
|------+--------------------|
| COL  | VARCHAR(134217728) |
+------+--------------------+
```

## Tables created using CREATE TABLE AS SELECT can inherit large default sizes

In some cases, when you create a table using a CREATE TABLE AS SELECT (CTAS) statement that uses expressions
in column definitions, the columns inherit the new default sizes, even if the columns in the source table explicitly
specify smaller sizes.

For example, create a source table that explicitly sets the maximum size for VARCHAR and BINARY columns:

```sqlexample
CREATE TABLE test_default_size_ctas_source_table (
  small_text VARCHAR(1000),
  medium_text VARCHAR(50000),
  large_text VARCHAR(16777216),
  binary_data BINARY(1000000));
```

Use a CTAS statement to create a table from this source table:

```sqlexample
CREATE TABLE test_default_size_ctas AS
SELECT small_text,
       medium_text,
       large_text || RANDSTR(10, 1) AS processed_text,
       binary_data
  FROM test_default_size_ctas_source_table;
```

In this example, the column definition for the `processed_text` column uses an expression.

Run the following queries to show the maximum sizes of the columns:

```sqlexample
DESCRIBE TABLE test_default_size_ctas
  ->> SELECT "name", "type" FROM $1;
```

Before the behavior change, the query returns the following output, and the `processed_text` column shows the
smaller default size:

```output
+----------------+-------------------+
| name           | type              |
|----------------+-------------------|
| SMALL_TEXT     | VARCHAR(1000)     |
| MEDIUM_TEXT    | VARCHAR(50000)    |
| PROCESSED_TEXT | VARCHAR(16777216) |
| BINARY_DATA    | BINARY(1000000)   |
+----------------+-------------------+
```

After the behavior change, the query returns the following output, and the `processed_text` column shows the
larger default size:

```output
+----------------+--------------------+
| name           | type               |
|----------------+--------------------|
| SMALL_TEXT     | VARCHAR(1000)      |
| MEDIUM_TEXT    | VARCHAR(50000)     |
| PROCESSED_TEXT | VARCHAR(134217728) |
| BINARY_DATA    | BINARY(1000000)    |
+----------------+--------------------+
```

Ref: 2118

---
title: SQL improvements
source: https://docs.snowflake.com/en/release-notes/sql-improvements.md
section: Release Notes
---

# SQL improvements

Snowflake is continually introducing enhancements that make it easier to write queries. With these new keywords and functions,
you can write simpler, shorter SELECT statements.

## SQL improvements in 2025

The following SQL improvements were introduced in 2025:

| Date released | Improvement | Impact |
| --- | --- | --- |
| October 2025 | Directed joins are now generally available and are no longer in preview. You can enforce join ordering when you run a query with the [JOIN](../sql-reference/constructs/join.md) clause by adding the `DIRECTED` keyword. | You can more easily migrate workloads into Snowflake that have join order directives and possibly improve performance by scanning joined tables in a specific order. |
| October 2025 | In [PIVOT](../sql-reference/constructs/pivot.md) queries, you can use the AS clause to specify aliases for the pivot column names. In [UNPIVOT](../sql-reference/constructs/unpivot.md) queries, you can use the AS clause to specify aliases for column names that appear in the result of the UNPIVOT operation. | The AS clause makes it easier to customize column names that appear in the output for PIVOT and UNPIVOT operations. |
| October 2025 | You can use the `WHEN MATCHED ... THEN ALL BY NAME` and `WHEN NOT MATCHED ... THEN ALL BY NAME` subclauses in the [MERGE](../sql-reference/sql/merge.md) command to update or insert all columns in the target table with changes from the source. | When the target table and the source have the same number of columns and the same names for these columns, you can use these subclauses to avoid maintaining column lists in the INSERT and UPDATE clauses of MERGE statements. |
| September 2025 | You can use the [RESAMPLE](../sql-reference/constructs/resample.md) clause and a set of [interpolation functions](../sql-reference/functions/interpolate_bfill.md) to fill gaps in time-series data. | This SQL functionality simplifies the process of generating continuous, uniformly-sampled time-series data. |
| August 2025 | Preview support for directed joins. You can enforce join ordering when you run a query with the [JOIN](../sql-reference/constructs/join.md) clause by adding the `DIRECTED` keyword. | You can more easily migrate workloads into Snowflake that have join order directives and possibly improve performance by scanning joined tables in a specific order. |
| July 2025 | You can specify the [ORDER BY ALL](../sql-reference/constructs/order-by.md) clause to sort by all columns specified in the SELECT list. | You can sort results by all columns in the SELECT list without having to specify each column by name. |
| June 2025 | You can use the [UNION BY NAME operator](../sql-reference/operators-query.md) to combine rows by name instead of by position. | The UNION BY NAME operator simplifies combining subsets of columns that have different positions in the tables. |
| May 2025 | You can use the [pipe operator](../sql-reference/operators-flow.md) (`->>`) to chain SQL statements together. In the chain of SQL statements, the results of one statement can serve as the input to another statement. | The pipe operator can simplify the execution of dependent SQL statements and improve the readability and flexibility of complex SQL operations. |
| March 2025 | You can use the [spread operator](../sql-reference/operators-expansion.md) (`**`) to expand an array into a list of individual values. | The spread operator can simplify function calls and queries that accept a variable number of values. For more information, see the [Snowflake Introduces SQL Spread Operator (\*\*)](https://www.snowflake.com/en/engineering-blog/sql-spread-operator/) blog post. |
| February 2025 | The [SEARCH](../sql-reference/functions/search.md) function supports conjunctive (AND) semantics. | When you specify `'AND'` for the SEARCH_MODE argument, there is a match if the tokens extracted from at least one of the columns or fields being searched match all of the tokens extracted from the search string. |
| January 2025 | Support for row-based and range-based window frames in the [ARRAY_AGG](../sql-reference/functions/array_agg.md) function. | Users can aggregate subsets of data by collecting the values from moving window frames into an array. |

## SQL improvements in 2024

The following SQL improvements were introduced in 2024:

| Date released | Improvement | Impact |
| --- | --- | --- |
| November 2024 | [Full-text search](../user-guide/querying-with-search-functions.md) with the [SEARCH](../sql-reference/functions/search.md) and [SEARCH_IP](../sql-reference/functions/search_ip.md) functions is now generally available and is no longer in preview. | You can find character data (text) and IPv4 addresses in specified columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. |
| October 2024 | Support for querying objects up to 128 MB in files on a stage. | You can more easily reduce the size of an object before storing it in a column. Also, with the 9.17 release, you can now store objects larger than 16 MB in a column. For more information, see [Size limits for database objects](../user-guide/data-load-considerations-prepare.md). |
| October 2024 | Support for [higher-order functions](../user-guide/querying-semistructured.md) extended with the [REDUCE](../sql-reference/functions/reduce.md) function. | You can use lambda expressions to reduce semi-structured and structured data, providing a concise, readable, and efficient way to perform data manipulation and advanced analysis. |
| September 2024 | Support for [selecting from a stored procedure](../developer-guide/stored-procedure/stored-procedures-selecting-from.md) that returns tabular data. | You can simplify the SQL statements for saving results to a table. For example, rather than using the [SQLID](../developer-guide/snowflake-scripting/query-id.md) Snowflake Scripting variable with the [RESULT_SCAN](../sql-reference/functions/result_scan.md) function to create a table containing the query results, you can use a query that directly selects from the results. |
| September 2024 | Support extended for [RANGE BETWEEN window frames with explicit offsets](../sql-reference/functions-window-syntax.md) (n PRECEDING and n FOLLOWING) to include the [FIRST_VALUE](../sql-reference/functions/first_value.md) and [LAST_VALUE](../sql-reference/functions/last_value.md) window functions. | You can use additional functions to run moving aggregations when expected or unexpected missing records cause gaps to occur in time-series data sets. |
| August 2024 | [RANGE BETWEEN window frames with explicit offsets](../sql-reference/functions-window-syntax.md) (n PRECEDING and n FOLLOWING) are now generally available and are no longer in preview. | You can more easily run moving aggregations when expected or unexpected missing records cause gaps to occur in time-series data sets. |
| August 2024 | Preview support for [full-text search](../user-guide/querying-with-search-functions.md) with the [SEARCH](../sql-reference/functions/search.md) function and the [SEARCH_IP](../sql-reference/functions/search_ip.md) function. | You can find character data (text) and IPv4 addresses in specified columns from one or more tables, including fields in VARIANT, OBJECT, and ARRAY columns. |
| August 2024 | Support for using the ILIKE and EXCLUDE keywords for filtering in a SELECT list or GROUP BY clause in [function calls](2024/8_30.md) and [object constants](../sql-reference/data-types-semistructured.md). | In function calls and object constants, you can filter for columns that match a pattern, and you can exclude specific columns. |
| July 2024 | Support for specifying wildcards in [OBJECT constants](../sql-reference/data-types-semistructured.md) for filtering in a SELECT list or GROUP BY clause. | You can construct an OBJECT value from the specified data using the attribute names as keys and the associated values as values. |
| June 2024 | Preview support for [RANGE BETWEEN window frames with explicit offsets](../sql-reference/functions-window-syntax.md) (n PRECEDING and n FOLLOWING) for the following window functions: [AVG](../sql-reference/functions/avg.md), [COUNT](../sql-reference/functions/count.md), [MIN](../sql-reference/functions/min.md), [MAX](../sql-reference/functions/max.md) and [SUM](../sql-reference/functions/sum.md). | You can more easily run moving aggregations when expected or unexpected missing records cause gaps to occur in time-series data sets. |
| May 2024 | Support for using the `{ INCLUDE | EXCLUDE } NULLS` option in an [UNPIVOT](../sql-reference/constructs/unpivot.md) subclause to specify whether to include rows with NULL values in the results. | You have more flexibility when you use the UNPIVOT subclause in a SQL statement. |
| May 2024 | Support for using the TABLE keyword to [get a reference to a table, view, secure view, or query](../developer-guide/stored-procedure/stored-procedures-calling-references.md) and to [call a method in a class in the FROM clause](../sql-reference/snowflake-db-classes.md). | You can use the TABLE keyword to write simpler SQL statements. |
| May 2024 | New [ASOF JOIN](../sql-reference/constructs/asof-join.md) construct. | You can write simpler SQL statements to join tables that contain [time-series data](../user-guide/querying-time-series-data.md). |
| May 2024 | Support for specifying the ANY keyword or a subquery with the [PIVOT](../sql-reference/constructs/pivot.md) construct. | You can easily pivot on all distinct values or on all values returned by a subquery. |
| May 2024 | Support for the [FILTER](../sql-reference/functions/filter.md) and [TRANSFORM](../sql-reference/functions/transform.md) [higher-order functions](../user-guide/querying-semistructured.md). | You can use lambda expressions to filter and transform semi-structured and structured data, providing a concise, readable, and efficient way to perform data manipulation and advanced analysis. |
| March 2024 | New [GREATEST_IGNORE_NULLS](../sql-reference/functions/greatest_ignore_nulls.md) and [LEAST_IGNORE_NULLS](../sql-reference/functions/least_ignore_nulls.md) functions. | You can return the lowest or highest non-NULL value from a list of expressions. |
| March 2024 | Support for trailing commas in [SELECT](../sql-reference/sql/select.md) lists. | You can delete or move the last columns in a SELECT list without having to delete the preceding comma. |
| February 2024 | Support for the `upper`, `lower`, and `trim` [collations](../sql-reference/collation.md) in [additional SQL functions](../sql-reference/collation.md). | You can pass strings that use the `upper`, `lower`, and `trim` collations to these functions without having to change the collation. |

## SQL improvements in 2023

The following SQL improvements were introduced in 2023:

| Date released | Improvement | Impact |
| --- | --- | --- |
| August 2023 | New [ARRAY_MIN](../sql-reference/functions/array_min.md), [ARRAY_MAX](../sql-reference/functions/array_max.md), and [ARRAY_SORT](../sql-reference/functions/array_sort.md) functions. | You can now easily select the array elements with the lowest value and the highest value.  You can easily get a sorted array of elements. |
| August 2023 | New ILIKE and REPLACE parameters in the [SELECT](../sql-reference/sql/select.md) command. | You can now select all columns that match a pattern containing SQL wildcards.  When selecting all columns, you can replace the value of specific columns with expressions. |
| July 2023 | New ALL keyword in the [GROUP BY](../sql-reference/constructs/group-by.md) construct. | You can group results by all non-aggregate columns in the SELECT list without having to specify each column by name. |
| February 2023 | Support for bankers’ rounding ([rounding half to even](https://en.wikipedia.org/wiki/Rounding#Rounding_half_to_even)) in the [ROUND](../sql-reference/functions/round.md) function. | You can now use bankers’ rounding when rounding values. |
| January 2023 | New [MIN_BY](../sql-reference/functions/min_by.md) and [MAX_BY](../sql-reference/functions/max_by.md) functions. | You can find the row containing the minimum or maximum value in a column and retrieve the value from a different column. |

## SQL improvements in 2022

The following SQL improvements were introduced in 2022:

| Date released | Improvement | Impact |
| --- | --- | --- |
| November 2022 | New EXCLUDE and RENAME parameters in the [SELECT](../sql-reference/sql/select.md) command. | You can now select all columns and specify that you want to exclude or rename specific columns. |
| November 2022 | New [ARRAY_EXCEPT](../sql-reference/functions/array_except.md) and [ARRAY_DISTINCT](../sql-reference/functions/array_distinct.md) functions. | You can now easily select the array elements that are in one array but not in another array.  You can easily get the distinct elements in an array. |
| May 2022 | New [REGEXP_SUBSTR_ALL](../sql-reference/functions/regexp_substr_all.md) function. | You can now easily extract the substrings that match a regular expression from a string.. |

---
title: SQL parameters: Disallow setting date and time output formats to AUTO
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2115.md
section: Release Notes
---

# SQL parameters: Disallow setting date and time output formats to AUTO

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, you can’t set
[date and time output formats](../../../sql-reference/date-time-input-output.md) to `AUTO`:

Before the change:
:   Date and time output formats could be set to `AUTO`.

After the change:
:   Date and time output formats can’t be set to `AUTO`.

The following parameters define the formats for date and time output from Snowflake:

* [DATE_OUTPUT_FORMAT](../../../sql-reference/parameters.md)
* [TIME_OUTPUT_FORMAT](../../../sql-reference/parameters.md)
* [CSV_TIMESTAMP_FORMAT](../../../sql-reference/parameters.md)
* [TIMESTAMP_OUTPUT_FORMAT](../../../sql-reference/parameters.md)
* [TIMESTAMP_LTZ_OUTPUT_FORMAT](../../../sql-reference/parameters.md)
* [TIMESTAMP_NTZ_OUTPUT_FORMAT](../../../sql-reference/parameters.md)
* [TIMESTAMP_TZ_OUTPUT_FORMAT](../../../sql-reference/parameters.md)

Before this change, setting these parameters to `AUTO` was undocumented but allowed.

After this change, the `AUTO` setting is no longer allowed for these parameters.

If you use this undocumented and unsupported parameter setting, complete the following actions:

1. Remove all parameter overrides that set the date and time output parameters to `AUTO`.

   The setting is case insensitive.
2. Check your scripts and other code for the setting, and if any of your scripts or code sets the date and time output parameters
   to `AUTO`, remove the setting or replace it with a valid setting.

   After the change, attempting to set a date and time output parameter to this value returns the following error:

   ```output
   SQL compilation error: invalid value [auto] for parameter '<parameter_name>'
   ```

Ref: 2115

---
title: SQLAlchemy release notes
source: https://docs.snowflake.com/en/release-notes/clients-drivers/sqlalchemy.md
section: Release Notes
---

# SQLAlchemy release notes

The SQLAlchemy release notes provide details for each release, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

You can browse the release notes for the following years:

* [2026 releases](sqlalchemy-2026.md)
* [2025 releases](sqlalchemy-2025.md)
* [2024 releases](sqlalchemy-2024.md)
* [2023 releases](sqlalchemy-2023.md)
* [2022 releases](sqlalchemy-2022.md)

See [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../../developer-guide/python-connector/sqlalchemy.md) for documentation.

---
title: SQLAlchemy release notes for 2022
source: https://docs.snowflake.com/en/release-notes/clients-drivers/sqlalchemy-2022.md
section: Release Notes
---

# SQLAlchemy release notes for 2022

This article contains the release notes for the SQLAlchemy, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for SQLAlchemy updates.

See [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../../developer-guide/python-connector/sqlalchemy.md) for documentation.

## Version 1.4.5 (December 9, 2022)

### New features

* Updated the application name for the driver connection from SnowflakeConnection to SnowflakeSQLAlchemy.

## Version 1.4.4 (November 16, 2022)

### Bug Fixes

* Fixed a bug where percent signs (%) in a non-compiled statement should not be interpolated with empty sequence
  when executed.

## Version 1.4.3 (October 21, 2022)

### Bug fixes

* Fixed an issue `whereSnowflakeDialect.normalize_name` and `SnowflakeDialect.denormalize_name` could not handle
  empty strings.
* Fixed a compatibility issue to vendor function `sqlalchemy.engine.url._rfc_1738_quote` as it is removed from
  SQLAlchemy v1.4.42.

## Version 1.4.2 (September 28, 2022)

### Updates

* Improved reliability by always using context managers.

## Version 1.4.1 (August 23, 2022)

### Updates

* None.

### Bug Fixes

* Fixed an issue where DATE was incorrectly removed from `SnowflakeDialect.ischema_names`.
* Fixed breaking changes introduced in release 1.4.0 that:

  + Changed the behavior of processing numeric, datetime, and timestamp values returned from service.
  + Changed the sequence order of primary/foreign keys in list returned by `inspect.get_foreign_keys`
    and `inspect.get_pk_constraint`.

## Version 1.4.0 (July 21, 2022)

### New Features

* Added support for `regexp_match` and `regexp_replace` in `sqlalchemy.sql.expression.ColumnOperators`.
* Added support for Identity Column.
* Added support for handling literal values for the sql types: `Date`, `DateTime`, `Time`, `Float`, and `Numeric`;
  also added support for converting the values into corresponding Python objects.
* Added support for `get_sequence_names` in `SnowflakeDialect`.

### Bug Fixes

* Fixed a bug where insert with `autoincrement` failed due to incompatible column type affinity.
* Fixed a bug when creating a column with sequence, default value was set incorrectly.
* Fixed a bug that identifier having percents in a compiled statement was not interpolated.
* Fixed a bug when visiting sequence value from another schema, the sequence name is not formatted with the schema name.
* Fixed a bug where the sequence order of columns were not maintained when retrieving primary keys and foreign
  keys for a table.

---
title: SQLAlchemy release notes for 2023
source: https://docs.snowflake.com/en/release-notes/clients-drivers/sqlalchemy-2023.md
section: Release Notes
---

# SQLAlchemy release notes for 2023

This article contains the release notes for the SQLAlchemy, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for SQLAlchemy updates.

See [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../../developer-guide/python-connector/sqlalchemy.md) for documentation.

## Version 1.5.1 (November 2, 2023)

### New features and updates

* None.

### Bug fixes

* Fixed a compatibility issue with restrictions on outer lateral joins. For more details check [Table Functions (Except SQL UDTFs): Restrictions With Lateral Table Functions and Outer Lateral Joins](../bcr-bundles/2023_04/bcr-1057.md).
* Fixed credentials with externalbrowser authentication not caching due to incorrect parsing of Boolean query parameters, as well as other Boolean parameters passed to the driver.

## Version 1.5.0 (August 24, 2023)

### New features and updates

* Added support for the `GEOMETRY` data type.

### Bug fixes

* Fixed a compatibility issue with standard SQLAlchemy 1.4.49 library.

## Version 1.4.7 (March 21, 2023)

### New features and updates

* `SnowflakeDialect.get_columns` now throws a `NoSuchTableError` exception when the specified table doesn’t exist,
  instead of the more vague `KeyError`.
* Fixed a bug where dialect can not be created with empty host name.
* Fixed a bug where `sqlalchemy.func.now` did not render correctly.

### Bug fixes

* None.

## Version 1.4.6 (February 8, 2023)

### New features and updates

* Bumped the `snowflake-connector-python` dependency to newest version, which supports Python 3.11.
* Reverted the change of application name introduced in version 1.4.5 until the feature is supported.

### Bug fixes

None.

---
title: SQLAlchemy release notes for 2024
source: https://docs.snowflake.com/en/release-notes/clients-drivers/sqlalchemy-2024.md
section: Release Notes
---

# SQLAlchemy release notes for 2024

This article contains the release notes for the SQLAlchemy, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for SQLAlchemy updates.

See [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../../developer-guide/python-connector/sqlalchemy.md) for documentation.

## Version 1.7.2 (December 17, 2024)

### New features and updates

* Added support for structured `OBJECT` and `ARRAY` datatypes.

### Bug fixes

* Fixed an issue with quoting an underscore (`_`) as a column name.
* Fixed an issue with index columns not being reflected.
* Fixed an issue with the index reflection cache.

## Version 1.7.1 (December 02, 2024)

### New features and updates

* Added support for PARTITION BY to the COPY INTO command.

### Bug fixes

* Fixed the `BOOLEAN type not found` error in `snowdialect`.

## Version 1.7.0 (November 21, 2024)

### New features and updates

* Added support for the following features:

  + Dynamic Tables
  + Hybrid Tables
  + Iceberg Tables with the Snowflake Catalog
* Added support for the `MAP` data type.
* Added the ability to define options in key arguments instead of arguments.
* Updated the `cluster_by` option to support explicit expressions

### Bug fixes

* Fixed the `SAWarning` when registering functions with existing name in the default namespace.

## Version 1.6.1 (July 9, 2024)

### New features and updates

* None.

### Bug fixes

* Updated the internal project workflow with pypi publishing.

## Version 1.6.0 (July 8, 2024)

### New features and updates

* Added support for SQLAlchemy 2.0 syntax.

### Bug fixes

* None.

## Version 1.5.3 (April 16, 2024)

### New features and updates

* Limited the maximum SQLAlchemy dependency version to lower than 2.0.0.

### Bug fixes

* None.

## Version 1.5.2 (April 11, 2024)

### New features and updates

* Bumped the minimum SQLAlchemy version to 1.4.19 for Outer Lateral Joins.
* Added support for sequence ordering in tests.

### Bug fixes

* None.

---
title: SQLAlchemy release notes for 2025
source: https://docs.snowflake.com/en/release-notes/clients-drivers/sqlalchemy-2025.md
section: Release Notes
---

# SQLAlchemy release notes for 2025

This article contains the release notes for the SQLAlchemy, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for SQLAlchemy updates.

See [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../../developer-guide/python-connector/sqlalchemy.md) for documentation.

## Version 1.8.2 (December 10, 2025)

### New features and updates

* None.

### Bug fixes

* Aligned the supported maximum python version with snowflake-connector-python to 3.13.

## Version 1.8.0 (December 04, 2025)

### New features and updates

* Added logging of the SQLAlchemy version and pandas (if used).
* Added support for Python 3.14 and earlier.

### Bug fixes

* None.

## Version 1.7.7 (Sep 09, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue that threw an exception for structured type columns dropped while collecting metadata.

## Version 1.7.6 (July 10, 2025)

### New features and updates

* None.

### Bug fixes

* Fixed an issue with `get_multi_indexes` that assigned the wrong returned indexes when processing multiple indexes in a table.

## Version 1.7.4 (June 10, 2025)

### New features and updates

* Updated `README.md` to include instructions on how to verify package signatures using cosign.

### Bug fixes

* Fixed a dependency on DESCRIBE TABLE columns quantity (differences in columns caused by Snowflake parameters).
* Fixed an unnecessary condition that caused issues when parsing `StructuredTypes` columns.

## Version 1.7.3 (January 14, 2025)

### New features and updates

* Added the `force_div_is_floordiv` flag to override the new default value (`False`) for `div_is_floor_div` in `SnowflakeDialect`.

  + When `force_div_is_floordiv` is `False`, the division (`/`) operator is treated as a float division, while the `//` operator is treated as floor division.
  + This flag is added to maintain backward compatibility with the previous `SnowflakeDialect` behavior.
  + This flag will be removed in a future release, and `SnowflakeDialect` will use `div_is_floor_div` as `False`.

### Bug fixes

* Fixed an issue with support for the SqlAlchemy ARRAY,
* Fixed the return value of `snowflake get_table_names`.
* Fixed incorrect quoting of identifiers beginning with an underscore (`_`).
* Fixed the “ARRAY type not supported in HYBRID tables” error.

---
title: SQLAlchemy release notes for 2026
source: https://docs.snowflake.com/en/release-notes/clients-drivers/sqlalchemy-2026.md
section: Release Notes
---

# SQLAlchemy release notes for 2026

This article contains the release notes for the SQLAlchemy, including the following when applicable:

* Behavior changes
* New features
* Customer-facing bug fixes

Snowflake uses semantic versioning for SQLAlchemy updates.

See [Using the Snowflake SQLAlchemy toolkit with the Python Connector](../../developer-guide/python-connector/sqlalchemy.md) for documentation.

## Version 1.9.0 (March 04, 2026)

### New features and updates

* Added support for `DECFLOAT` and `VECTOR` data types.
* Added support for `server_version_info` support.
* Added support for `ILIKE` in queries.
* Introduced a shared helper for fully-qualified schema name resolution, replacing inconsistent ad-hoc patterns across reflection methods.
* Refactored column reflection internals into dedicated helpers to reduce complexity without changing behavior.
* Added `pytest-xdist` parallel test support via per-worker schema provisioning hooks.
* Bumped pandas lower bound in the sa14 test environment from <2.1 to >=2.1.1,<2.2 to ensure pre-built wheels are available for Python 3.12.
* Added support for timezone in timestamp and datetime types.

### Bug fixes

* Fixed `SYSDATE()` rendering.
* Fixed and improved schema reflection.
* Fixed a crash issue when reflecting without specifying a schema, caused by `None` arguments in internal schema resolution.
* Fixed a crash issue when SHOW TABLES returns empty string table names, causing `IndexError` during reflection.
* Fixed incomplete identity column reflection metadata. This column now includes all fields required by SQLAlchemy 2.0+ (`always`, `cycle`, `order`, and so on).
* Fixed SQLAlchemy version parsing.

---
title: STAGES View (ACCOUNT_USAGE): New Column STORAGE_INTEGRATION
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1547.md
section: Release Notes
---

# STAGES View (ACCOUNT_USAGE): New Column STORAGE_INTEGRATION

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, the [ACCOUNT_USAGE.STAGES view](../../../sql-reference/account-usage/stages.md) includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| STORAGE_INTEGRATION | TEXT | The name of the [storage integration](../../../sql-reference/sql/create-storage-integration.md) associated with the stage; NULL for internal stages or stages that do not use a storage integration. |

Ref: 1547

---
title: Standard warehouses: Gen2 is the default generation (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2250.md
section: Release Notes
---

# Standard warehouses: Gen2 is the default generation (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

Gen2 is the latest generation of standard warehouses, with improved performance for analytics
and data engineering workloads. We recommend using it for new warehouses.

When you create a standard warehouse without specifying a `GENERATION`
value, the default generation is changing.

Before the change:
:   The default generation for a new standard warehouse depends on the cloud service provider region
    and whether your organization was created after Gen2 support became available in that region.
    For organizations or regions where Gen2 isn’t the default, Snowflake creates a Gen1 standard warehouse.

After the change:
:   In regions where Gen2 is available, the default generation for all new standard warehouses
    is Gen2 (`GENERATION = '2'`). If you don’t
    specify the `GENERATION` clause when you create a standard
    warehouse, Snowflake creates a Gen2 standard warehouse. In regions where Gen2 isn’t
    available, the default remains Gen1. For a list of supported regions, see
    [Region availability](../../../user-guide/warehouses-gen2.md).

    Existing Gen1 warehouses, whether running or suspended, remain Gen1 unless you explicitly
    change them.

    > **Note:**
    >
    > Currently, a CREATE OR ALTER command without a GENERATION clause on a Gen1 warehouse will change
    > the warehouse to Gen2 if it performs an ALTER operation. To keep an existing warehouse as Gen1,
    > include the clause `GENERATION = '1'` in the CREATE OR ALTER command.

    X5LARGE and X6LARGE warehouse sizes don’t support Gen2. Warehouses created with those
    sizes default to Gen1.

    Snowpark-optimized warehouses aren’t affected by this change. The GENERATION clause applies
    only to standard warehouses.

For more information, see [Snowflake generation 2 standard warehouses](../../../user-guide/warehouses-gen2.md).

Ref: 2250

---
title: STORAGE_USAGE view (ACCOUNT_USAGE): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2098.md
section: Release Notes
---

# STORAGE_USAGE view (ACCOUNT_USAGE): New columns

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the [STORAGE_USAGE](../../../sql-reference/account-usage/storage_usage.md) view in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `archive_storage_cool_bytes` | NUMBER | Reserved for future use. |
| `archive_storage_cold_bytes` | NUMBER | Reserved for future use. |
| `archive_storage_retrieval_temp_bytes` | NUMBER | Reserved for future use. |

Ref: 2098

---
title: Stored Procedures and UDTFs: Argument Names Respected in Calls
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1017.md
section: Release Notes
---

# Stored Procedures and UDTFs: Argument Names Respected in Calls

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Calls to stored procedures and user-defined table functions (UDTFs), where the call includes named arguments,
Snowflake will give precedence to argument names over argument position as follows.

Previously:
:   When calling procedures or user-defined table functions (UDTFs) that have named arguments, Snowflake ignores argument names and uses argument position to determine which value is passed to the procedure or function.

Currently:
:   Argument names will take precedence over argument position in calls to procedures and UDTFs. This may result in the following behavior that differs from before the behavior change:

    * A previously-working call to a stored procedure or user-defined table function could result in an error.
    * Argument names could cause arguments to be passed in a different order than previously. This could result in an error, wrong results, or wrong data insertion.
    * Argument names could cause a different stored procedure with the same name to be called. This could result in an error, wrong results, or wrong data insertion.
    * The following example illustrates how calls to two stored procedures might differ before and after the change.

    The following example illustrates how calls to two stored procedures might differ before and after the change.

    ```sqlexample
    -- One stored procedure
    CREATE OR REPLACE PROCEDURE proc(STRARG1 varchar, STRARG2 varchar)
    RETURNS VARCHAR
    LANGUAGE SQL
    AS
    $$
    BEGIN
      return 'hello';
    END
    $$;
    -- Another procedure
    CREATE OR REPLACE PROCEDURE proc(ARG1 number, ARG2 number)
    RETURNS BOOLEAN
    LANGUAGE SQL
    AS
    $$
    BEGIN
      return ARG1 < ARG2;
    END
    $$;
    -- Example A
    -- Before: returns 'hello'
    -- After: returns TRUE
    CALL PROC(ARG1 => '5', ARG2 => '100');
    -- Example B
    -- Before: returns TRUE
    -- After: returns 'hello'
    CALL PROC(STRARG1 => 5, STRARG2 => 100); -- 'hello'.
    ```

Ref: 1017

---
title: Stored Procedures: Calls to BUILD_SCOPED_FILE_URL Function Allowed Within Owner’s Rights Procedures
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1007.md
section: Release Notes
---

# Stored Procedures: Calls to BUILD_SCOPED_FILE_URL Function Allowed Within Owner’s Rights Procedures

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Calling the BUILD_SCOPED_FILE_URL function in an owner’s right stored procedure behaves as follows

Previously:
:   Attempting to call the [BUILD_SCOPED_FILE_URL](../../../sql-reference/functions/build_scoped_file_url.md) function in an [owner’s rights stored procedure](../../../developer-guide/stored-procedure/stored-procedures-rights.md) results in an internal error.

Current:
:   Calling the BUILD_SCOPED_FILE_URL in an owner’s rights stored procedure will succeed.

Ref: 1007

---
title: Stored Procedures: put_stream Uses Different Way to Get the File Name
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-943.md
section: Release Notes
---

# Stored Procedures: put_stream Uses Different Way to Get the File Name

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The way you use `put_stream` inside stored procedures to upload files to a stage has changed and now aligns with the `put_stream`
protocol to Snowpark Python client.

Previously:
:   Uploading files using `put_stream` from stored procedures was called using a `stage_prefix` and an `input_stream`. The stored
    procedure inferred the file name from `input_stream` using `input_stream.name` and uploading it to the `stage_prefix.put_stream`
    would break in the case where `input_stream` did not have the `name` attribute. This behavior differed from `put_stream` on
    the Snowpark Python client.

Currently:
:   Uploading files using put_stream from stored procedures is called using a `stage_location` = `stage_prefix` + `/` + `file_name`,
    and an `input_stream`. The stored procedure infers the file name using `stage_location`. This would work in the case where
    `input_stream` does not have a `name` attribute and the behavior also aligns with the Snowpark Python client.

> **Note:**
>
> Customers using `put_stream` without a full stage location should update their code to upload files using a full stage location with
> stage prefix and target file name.

Ref: 943

---
title: Stream and Task Replication: Changes for GA
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1048.md
section: Release Notes
---

# Stream and Task Replication: Changes for GA

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Stream and task replication is currently in preview and can be [explicitly enabled](../../../user-guide/account-replication-config.md) for a database, replication or failover group, or an account.

In a future release, stream and task replication will be generally enabled and will change as follows:

Currently:
:   * Streams and tasks in a database are not replicated unless you have enabled the stream and task replication preview.
    * If you have not explicitly enabled stream and task replication, replication of a database with streams succeeds,
      but the streams are ignored (that is to say, they are not replicated to the target account).

Pending:
:   Replication of databases that contain streams will fail in the following scenarios:

    > * If the database that contains the stream and the database that contains the base object referenced by the stream:
    >
    >   > + Are separate databases *or*
    >   > + Are in different replication or failover groups
    > * If a database contains a stream that references an unsupported object. These include:
    >
    >   > + Directory tables
    >   > + External tables
    >   > + Tables or views in shared databases (i.e. databases shared from provider accounts to your account)
    >   > + Dropped objects

    Tasks that have an initial version of the task set in the source account will be replicated to target accounts. For task replication scenarios, refer to [Replication Scenarios](../../../user-guide/account-replication-considerations.md).

To avoid replication failures for databases containing streams:

* The replicated database must include both the stream and its base object *or*
* The database that contains the stream and the database that contains the base object referenced by the stream must be included in the same replication or failover group.

For detailed information on stream and task replication, refer to [Replication and Streams](../../../user-guide/account-replication-considerations.md) and [Replication and Tasks](../../../user-guide/account-replication-considerations.md).

Ref: 1048

---
title: Streamlit in Snowflake: Default Python version for Streamlit in Snowflake apps changes from 3.8 to 3.11
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1804.md
section: Release Notes
---

# Streamlit in Snowflake: Default Python version for Streamlit in Snowflake apps changes from 3.8 to 3.11

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

The Streamlit in Snowflake apps behave as follows:

Before the change:
:   All Streamlit in Snowflake apps run in Python 3.8.

After the change:
:   * Streamlit in Snowflake apps support Python 3.8, 3.9, 3.10, and 3.11. You can either pin the Python version in the `environment.yml` file or choose the version in the Streamlit editor in Snowsight by selecting Packages.
    * Newly created Streamlit in Snowflake apps run in Python 3.11 by default.
    * Streamlit in Snowflake apps that do not have a Python version pinned in the `environment.yml` file, run in Python 3.11 by default.
    * Streamlit in Snowflake apps that have a Python version pinned to Python 3.8 in the `environment.yml` file, continue to run as before.

Ref: 1804

---
title: Streamlit in Snowflake: Enable Git integration and multi-file editing for Streamlit in Snowflake apps
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1888.md
section: Release Notes
---

# Streamlit in Snowflake: Enable Git integration and multi-file editing for Streamlit in Snowflake apps

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

Streamlit in Snowflake apps behave as follows:

Before this change:
:   Git integration and multi-file editing for Streamlit in Snowflake apps are not supported in Snowsight.

After this change:
:   Git integration and multi-file editing are supported for new Streamlit in Snowflake apps and include changes in the following areas:

    * Snowsight

      > + Supports connecting Streamlit in Snowflake apps to a Git repository.
      > + Supports creating Streamlit in Snowflake apps from a Git repository.
      > + Supports editing multiple files within Snowsight itself.
    * Changes to CREATE STREAMLIT and ALTER STREAMLIT commands
    * New columns in DESCRIBE STREAMLIT output

    > **Note:**
    >
    > The existing Streamlit in Snowflake apps using [ROOT_LOCATION](../../../sql-reference/sql/create-streamlit.md) work as before,
    > but Git integration and multi-file editing are not supported.

## Snowsight

### Create a Streamlit in Snowflake app from a file in a Git repository

To create a Streamlit app from a file in a Git repository, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit.
3. Next to + Streamlit, open the drop-down menu and select Create from repository.
4. For File location in repository, select the repository and branch in the repository that contain the Streamlit app file, then select
   the specific `.py` file. For details on connecting Snowflake to your Git repository, see [Setting up Snowflake to use Git](../../../developer-guide/git/git-setting-up.md).
5. For App location, select a database and schema to contain the Streamlit app. These cannot be changed after you create the app.
6. For Query warehouse and App warehouse, select a warehouse.
7. Select Create to create a Streamlit app from the `.py` file in your Git repository.

### Connect a Streamlit in Snowflake app with a Git repository

To connect an existing Streamlit app to a Git repository, do the following:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then open or create a Streamlit app.
3. In the Files tab, next to the database object explorer, select Connect Git Repository.
4. For File location in repository, select the repository and branch in the repository with which you want to sync the Streamlit app.
5. Select Select Folder.
6. When you are prompted to commit and push your app to the Git repository, select Push to Git.

### Edit multiple files within Snowsight

To edit multiple files of your Streamlit in Snowflake app within Snowsight:

1. Sign in to [Snowsight](../../../user-guide/ui-snowsight-gs.md).
2. In the navigation menu, select Projects » Streamlit, and then open or create a Streamlit app.
3. In the Files tab, select a file to edit.

## Changes to CREATE STREAMLIT and ALTER STREAMLIT commands

When this behavior change bundle is enabled, the following changes to [CREATE STREAMLIT](../../../sql-reference/sql/create-streamlit.md)
and [ALTER STREAMLIT](../../../sql-reference/sql/alter-streamlit.md) commands are introduced.

### CREATE STREAMLIT

The code for Streamlit apps no longer requires a separate stage object; the Streamlit object stores the app code.
Using the FROM clause, you can indicate the existing location to copy the code from to the Streamlit app.

#### Syntax

```sqlsyntax
CREATE [ OR REPLACE ] STREAMLIT [ IF NOT EXISTS ] <name>
  [ { VERSION <version_alias_name> |
      VERSION (COMMENT = <version_comment>) |
      VERSION <version_alias_name> (COMMENT = <version_comment>) } ]
  [ FROM <source_location>]
  MAIN_FILE = '<path_to_main_file_in_root_directory>'
  QUERY_WAREHOUSE = <warehouse_name>
  [ COMMENT = <create_comment> ]
  [ DEFAULT_VERSION = <default_version_name_or_alias> ]
  [ TITLE = '<app_title>' ]
  [ IMPORTS = ( '<stage_path_and_file_name_to_read>' [ , ... ] ) ]
  [ EXTERNAL_ACCESS_INTEGRATIONS = ( <integration_name> [ , ... ] ) ]
```

##### Required parameters

`name`
:   Name of the Streamlit app.

`path_to_main_file_in_root_directory`
:   Specifies the filename of the Streamlit app. This filename is relative to the value of `ROOT_LOCATION`.

`warehouse_name`
:   Specifies the warehouse to run SQL queries issued by the Streamlit app.

#### Optional parameters

`version_alias_name`
:   A user-specified version alias name.

`version_comment`
:   A user-provided comment for this version.

`source_location`
:   A location where the source files are copied from.

`create_comment`
:   Specifies a comment for the Streamlit object. By default, there is no value.

`default_version_name_or_alias`
:   The name of the default version used.

`app_title`
:   Specifies a title for the Streamlit app to display in Snowsight.

`stage_path_and_file_name_to_read`
:   The location (stage), path, and name of the file(s) to import.

`integration_name`
:   The names of external access integrations needed in order for the Streamlit app code to access external networks.

#### Examples

To create a Streamlit app from a stage, run the CREATE STREAMLIT command, as shown in the following example:

```sqlexample
CREATE STREAMLIT app
  FROM @streamlit_db.streamlit_schema.streamlit_stage;
  MAIN_FILE = 'streamlit_app.py'
  QUERY_WAREHOUSE = my_warehouse;
```

To create a Streamlit app from a Git repository, run the CREATE STREAMLIT command, as shown in the following example:

```sqlexample
CREATE STREAMLIT app
  FROM @streamlit_db.streamlit_schema.streamlit_repo/branches/streamlit_branch/;
  MAIN_FILE = 'streamlit_app.py'
  QUERY_WAREHOUSE = my_warehouse;
```

### ALTER STREAMLIT

When this behavior change bundle is enabled, the ALTER STREAMLIT command is updated to include the following:

#### Syntax

```sqlsyntax
ALTER STREAMLIT <name> ADD VERSION [ [ IF NOT EXISTS] <version_alias_name> ]
FROM <source_location>
[ COMMENT = <add_version_comment> ]

ALTER STREAMLIT <name> ADD VERSION <version_name>
FROM { <snowgit_tag_uri> | <snowgit_commit_uri> }
[ COMMENT = <git_pull_comment> ]

ALTER STREAMLIT <name> ADD LIVE VERSION [ [IF NOT EXISTS] <version_alias_name> ]
[ FROM LAST ]
[ COMMENT = <add_version_comment> ]

ALTER STREAMLIT <name> VERSION <existing_version_name_or_alias>
SET ALIAS = <new_version_name_alias>

ALTER STREAMLIT <name> COMMIT [ VERSION <live_version_alias> ] [COMMENT = <version_comment>]

ALTER STREAMLIT <name> SET DEFAULT_VERSION = <version_name> | <version_name_alias>

ALTER STREAMLIT <name> PUSH [TO <git_branch_uri>] [ { GIT_CREDENTIALS = <snowflake_secret> | USERNAME = <git_username> password = <git_password> } NAME = <git_author_name> EMAIL = <git_author_email> ] [ COMMENT = <git_push_comment>]

ALTER STREAMLIT <name> ABORT [ VERSION  <live_version_alias> ]

ALTER STREAMLIT <name> PULL
```

#### Parameters

`name`
:   Name of the Streamlit app.

`version_alias_name`
:   A user-specified version alias name.

`source_location`
:   A location where the source files are copied from. Requires the OWNERSHIP privilege.

`ALTER STREAMLIT name PUSH`
:   Pushes the latest committed changes to the Git repo, using the branch stored in the base version if `git_branch_uri` is not specified.

    If the base version is not based on a Git branch, throws an error. Requires OWNERSHIP privilege.

    `git-branch-uri`
    :   Target branch to push committed changes to.

    `git_author_name`
    :   Name of the git author to use.

    `git_author_email`
    :   A valid e-mail address to use as the git author’s name.

    `git_push_comment`
    :   A user-specified comment to include in the git push.

`ALTER STREAMLIT name ABORT`
:   Removes an existing version and deletes its files. If no version specified, it deletes live_version by default. Requires OWNERSHIP privilege.

`ALTER STREAMLIT name PULL`
:   Pulls latest changes from source to the live version of this Streamlit. Requires OWNERSHIP privilege.

## New columns in DESCRIBE STREAMLIT output

When this behavior change bundle is enabled, the output of the [DESCRIBE STREAMLIT](../../../sql-reference/sql/desc-streamlit.md) command
includes the following new columns:

| Column name | Description |
| --- | --- |
| default_version | For future use. |
| default_version_name | For future use. |
| default_version_alias | For future use. |
| default_version_location_uri | For future use. |
| default_version_source_location_uri | For future use. |
| default_version_git_commit_hash | For future use. |
| live_version | String. Specifies which version is the live version for this app. |
| live_version_name | String. The name of the live version for this app. |
| live_version_alias | String. The alias of the live version for this app. |
| live_version_location_uri | String. The URI to where files of the live version are stored. |
| live_version_source_location_uri | String. Specifies the URI to where the live_version was copied from. Null if this Streamlit was not cloned. |
| live_version_git_commit_hash | String. Hexademical hash of the git commit that the live_version points to. Null if a Git repository is not connected. |

Ref: 1888

---
title: Streams on views: Changes to column behavior when selecting from a stream
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1834.md
section: Release Notes
---

# Streams on views: Changes to column behavior when selecting from a stream

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, queries on streams that read from views behave as follows:

Before the change:
:   When you create a stream on a view with an explicit column list, streams on that view contain the columns that appear in the
    view’s SELECT statement, rather than those in the column list.

    In the following example, stream `stream1` would contain columns `columnA` and `columnB`.

    ```sqlexample
    CREATE TABLE table1(columnA INT, columnB INT);

    CREATE VIEW view1(columnC, columnD)
      AS
        SELECT * FROM table1;

    CREATE STREAM stream1 ON VIEW view1;
    ```

After the change:
:   When you create a stream on a view with an explicit column list, the stream contains exactly the same columns as the view.

    In the following example, stream `stream1` would contain columns `columnC` and `columnD`.

    ```sqlexample
    CREATE TABLE table1(columnA INT, columnB INT);

    CREATE VIEW view1(columnC, columnD)
    AS
      SELECT * FROM table1;

    CREATE STREAM stream1 ON VIEW view1;
    ```

Ref: 1834

---
title: Streams: Changes to replication support in replication groups
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2079.md
section: Release Notes
---

# Streams: Changes to replication support in replication groups

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, all secondary streams in a replication group behave as follows.

This change doesn’t affect failover groups.

Before the change:
:   Streams in a primary replication group are replicated to a secondary replication group on refresh.

After the change:
:   New streams in a primary replication group are not replicated to a secondary replication group on refresh.
    Existing streams in secondary replication groups are kept as is and will eventually become stale.

Ref: 2079

---
title: Streams: CREATE STREAM with INSERT_ONLY = TRUE Not Allowed on Non-external Tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-795.md
section: Release Notes
---

# Streams: CREATE STREAM with INSERT_ONLY = TRUE Not Allowed on Non-external Tables

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

[CREATE STREAM](../../../sql-reference/sql/create-stream.md) with INSERT_ONLY = TRUE specified for a non-external table now produces an error and the
statement fails:

Previously:
:   The Snowflake Documentation states that the use of INSERT_ONLY is not allowed on non-external tables. However, when
    INSERT_ONLY was specified, the command executed successfully, but INSERT_ONLY was not enforced.

Currently:
:   Any attempt to create an INSERT_ONLY stream on non-external tables throws the following error:

    `Streams of type INSERT_ONLY can only be created on external tables.`

Ref: 795

---
title: Streams: Joins on Views for Append-only Streams No Longer Produce Unexpected Results
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-920.md
section: Release Notes
---

# Streams: Joins on Views for Append-only Streams No Longer Produce Unexpected Results

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

When querying APPEND_ONLY streams on views that use joins, results are now consistent between the left and right side join contents:

Previously:
:   Append-only stream queries over a view that used joins could return results that were inconsistent with the expectations of
    an append-only stream. In addition, they possibly included updates that have happened since the stream offset.

Currently:
:   Append-only stream queries return inserts that have occurred on either join input, joined with the current values of the tables
    at the stream offset.

Ref: 920

---
title: Streams: Updates to stream replication
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2112.md
section: Release Notes
---

# Streams: Updates to stream replication

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

Stream replication is being updated to prevent incorrect query results. Some streams that previously worked
might now be blocked because the system can’t determine which changes are safe, so both safe
and unsafe changes are blocked to ensure data accuracy.

Before the change:
:   During database or failover group replication, a stream could return incorrect results if its
    underlying views change after failover but before consumption.

After the change:
:   During database or failover group replication, streams mark themselves as unreadable instead of
    returning incorrect data. Users see an error with recovery guidance, preventing accidental use of
    incorrect results.

    To recover an unreadable stream, restore the view to its original definition, consume any pending
    changes from the stream, then apply your intended changes to the view.

Ref: 2112

---
title: Stronger Default Password Policies
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1776.md
section: Release Notes
---

# Stronger Default Password Policies

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the default values for the PASSWORD_MIN_LENGTH and PASSWORD_HISTORY parameters in password
policies behaves as follows:

Before the change:
:   Default value for PASSWORD_MIN_LENGTH is 8

    Default value for PASSWORD_HISTORY is 0

After the change:
:   Default value for PASSWORD_MIN_LENGTH is 14

    Default value for PASSWORD_HISTORY is 5

Ref: 1776

---
title: Stronger UTF-8 validation for external files
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1013-1014.md
section: Release Notes
---

# Stronger UTF-8 validation for external files

This behavior change has been implemented with the 7.34 release. For the most up-to-date details about behavior changes,
see the [Behavior Change Log](../../behavior-changes.md).

Snowflake has stronger UTF-8 validation for external files.

Before the change:
:   When you query external Avro, Parquet, Orc, CSV, JSON, or XML files that contain invalid UTF-8 data, the queries usually succeed.

After the change:
:   When you query external Avro, Parquet, Orc, CSV, JSON, or XML files on a stage that contain invalid UTF-8 data, the queries fail.

    If you load external files with [COPY INTO <table>](../../../sql-reference/sql/copy-into-table.md) or [Snowpipe](../../../user-guide/data-load-snowpipe-intro.md)
    that contain invalid UTF-8 data, Snowflake proceeds with the copy option `ON_ERROR` specified in the object definition.

    When you query an external table, Snowflake omits results for records that contain invalid UTF-8 data.
    After encountering invalid data, Snowflake continues to scan the file (similar to `ON_ERROR = CONTINUE`) but doesn’t return an error message.

To avoid UTF-8 validation errors, Snowflake recommends that you specify `REPLACE_INVALID_CHARACTERS = TRUE` for your file format
so that any invalid UTF-8 characters will be replaced with the Unicode replacement character (`�`).

For Parquet files, you can also set `BINARY_AS_TEXT = FALSE` for your file format so that the columns
with no defined logical data type will be interpreted as binary data instead of as UTF-8 text.

Note that this behavior change does not apply to existing accounts that are currently loading invalid UTF8.
It only affects new accounts. For any issues, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Ref: 1013 1014

---
title: Summit announcements - Jun 26-29, 2023
source: https://docs.snowflake.com/en/release-notes/2023/june-summit.md
section: Release Notes
---

# Summit announcements - Jun 26-29, 2023

The following major features and enhancements were announced during Summit 2023.

> **Important:**
>
> This topic does not include every feature or enhancement announced during the Summit. In particular, it does not include features
> and enhancements that were announced, but are not yet in public preview or generally available.

## New features

### Dynamic tables — *Preview*

We are pleased to announce the preview of Dynamic Tables.

Dynamic tables are the building blocks of declarative data transformation pipelines.
They significantly simplify data engineering in Snowflake and provide a reliable, cost-effective, and automated way to transform your data
for consumption. Instead of defining data transformation steps as a series of tasks and having to monitor dependencies and scheduling, you
can simply define the end state of the transformation using dynamic tables and leave the complex pipeline management to Snowflake.

For more information, see [Dynamic Tables](../../user-guide/dynamic-tables-about.md).

### Amazon S3-compatible storage — *General Availability*

We are pleased to announce the general availability of support for accessing data in Amazon S3-compatible storage. You can create
external stages for on-premises or other cloud storage services and devices that are highly compliant with the Amazon S3 REST API. With this
feature, you can efficiently manage, govern, and analyze your data regardless of where the data is stored.

For more information, see [Work with Amazon S3-compatible storage](../../user-guide/data-load-s3-compatible-storage.md).

### Passing references for tables, views, functions, and queries to a stored procedure — *Preview*

We are pleased to announce the preview of the ability to pass references for tables, views, functions, and queries to a stored procedure.

A reference is a unique identifier for a table, view, function, or query. When you pass a reference to a stored procedure, the stored
procedure performs actions using the active role or secondary roles of the user who created the reference. For example, if you are calling
an owner’s rights stored procedure, you can create and pass in a reference to a table to allow the stored procedure to perform actions on
the table using your active role.

In addition, if the table, view, or function is not fully qualified, the name of the object is resolved by using the current database and
schema when the reference was created (i.e. the database and schema of the user who created the reference).

For more information, see [Passing references for objects and queries to stored procedures](../../developer-guide/stored-procedure/stored-procedures-calling-references.md).

### Snowpark ML: Machine learning at scale — *Preview*

We are pleased to announce the preview of Snowpark ML. Snowpark ML is a set of Python tools, including SDKs and underlying infrastructure,
for building and deploying machine learning models within Snowflake. This preview includes preprocessing and modeling classes based on
popular machine learning libraries such as [scikit-learn](https://scikit-learn.org/stable/),
[xgboost](https://xgboost.readthedocs.io/en/stable/), and [lightgbm](https://lightgbm.readthedocs.io/en/stable/).

Snowpark ML works with [Snowpark Python](../../developer-guide/snowpark/python/index.md). You use Snowpark DataFrames to hold your training or
test data and to receive your prediction results.

For more information, see [Snowflake ML: End-to-End Machine Learning](../../developer-guide/snowflake-ml/overview.md).

### ML functions — *Preview*

We are pleased to announce the preview of three new analysis tools powered by machine learning algorithms.

These three features train a machine learning model on your time-series data to determine how a specified metric varies over time and
relative to other features. The model then provides insights and predictions based on the trends detected in the data.

* **Forecasting**: Predicts future metric values from trends in historical data.
* **Anomaly Detection**: Flags metric values that differ from typical expectations.
* **Contribution Explorer**: Helps you find dimensions and values that affect the metric in surprising ways.

For more information, see [ML Functions](../../guides-overview-ml-functions.md).

### Native Applications Framework — *Preview*

We are pleased to announce the preview of the Native Apps Framework that enables you to create data applications that expand the
capabilities of other Snowflake features by sharing data and related business logic with other Snowflake accounts.

For more information, see [About the Native Apps
Framework](../../developer-guide/native-apps/native-apps-about.md) and [Tutorial: Developing an Application
with the Native Apps Framework](../../developer-guide/native-apps/tutorials/getting-started-tutorial.md).

### Custom event billing for applications — *Preview*

We are pleased to announce the preview of custom event billing, a usage-based pricing plan that providers can use to charge consumers for
usage of apps built with the Snowflake Native Apps Framework.

For more information, see [Paid listings pricing models](../../collaboration/provider-listings-pricing-model.md) and [Adding Billable Events to
Applications](../../developer-guide/native-apps/adding-custom-event-billing.md).

### Marketplace Capacity Drawdown Program — *General Availability*

We are pleased to announce the general availability of the Marketplace Capacity Drawdown Program, which allows eligible customers with a
capacity contract at Snowflake to pay for listings with their committed capacity.

See [Pay for listings](../../collaboration/consumer-listings-paying.md) for more information.

---
title: Summit announcements: June 02-05, 2025
source: https://docs.snowflake.com/en/release-notes/2025/june-summit.md
section: Release Notes
---

# Summit announcements: June 02-05, 2025

The following major features and enhancements were announced during Summit 2025.

> **Important:**
>
> This topic does not include every feature or enhancement announced during Summit 2025. In particular, it does not include features
> and enhancements that were announced, but are not yet in public preview or generally available.

## SQL updates

### Defining semantic views (*General availability*)

The ability to define [semantic views](../../user-guide/views-semantic/overview.md) (which are schema-level objects that correspond
to semantic models) is now generally available.

To create and manage semantic views, you can use SQL commands (such as [CREATE SEMANTIC VIEW](../../sql-reference/sql/create-semantic-view.md)) and the
[Cortex Analyst Semantic View Generator](../../user-guide/views-semantic/ui.md), which is a wizard in Snowsight that guides you
through the process of creating a semantic view.

Once you create a semantic view, Cortex Analyst can leverage the information in the semantic view definition and generate the
SQL against the physical tables directly. Semantic views can improve the accuracy of responses by combining LLM reasoning with
rule-based definitions.

For more information, see [Overview of semantic views](../../user-guide/views-semantic/overview.md).

### Querying semantic views (*Preview*)

The ability to query semantic views is now available as a preview feature. You can now use a SELECT statement to query a
semantic view by specifying the [SEMANTIC_VIEW](../../sql-reference/constructs/semantic_view.md) clause. In this clause, you specify the
dimensions and metrics that you want to retrieve. You can also filter the results based on dimensions.

For information, see [Querying semantic views](../../user-guide/views-semantic/querying.md).

## Data loading / unloading updates

### Snowpipe Streaming with high-performance architecture (*Preview*)

With this release, we’re pleased to announce the preview of Snowpipe Streaming’s new high-performance architecture. This
next-generation implementation delivers significantly enhanced throughput and optimized streaming performance with a predictable,
throughput-based pricing model (credits per uncompressed GB). It utilizes the new Snowpipe Streaming SDK and introduces a PIPE
object for managing data flow, enabling lightweight transformations during ingestion and server-side schema validation. We
recommend evaluating this advanced architecture for new streaming projects due to its performance, scalability, and cost
predictability.

For more information, see [Snowpipe Streaming key concepts](../../user-guide/snowpipe-streaming/snowpipe-streaming-high-performance-overview.md).

## Snowflake Native App Framework

With this release, the Snowflake Native App Framework supports the following features:

### Restricted caller’s rights (*Preview*)

Snowflake Native App Framework supports using restricted caller’s rights in stored procedures and Snowpark Container Services service in an app.
Restricted caller’s rights allow a stored procedure or service to run with caller’s rights, but restricts which of the caller’s privileges they run with.

See [Restricted Caller’s Rights](../../developer-guide/restricted-callers-rights.md) and
[Use owner’s rights and restricted caller’s rights in an app](https://other-docs.snowflake.com/en/native-apps/consumer-restricted-callers-rights) for more information.

### Feature policies (*Preview*)

In this release, the Snowflake Native App Framework introduces feature policies.
Feature policies allow consumers to restrict the types of objects an app can create.
For example, consumers can create a feature policy to prohibit an app from creating a warehouse.
If an app attempts to create a warehouse during installation, the installation fails.
See [Use feature policies to limit the objects an app can create](https://other-docs.snowflake.com/en/native-apps/consumer-feature-policies) for more information.

### Support for Snowflake ML in Snowflake Native Apps (*Preview*)

The Snowflake Native App Framework supports models created using [Snowflake ML](../../developer-guide/snowflake-ml/overview.md).
Providers can include access to pre-trained models in an app or train a model after the app is installed. The app can train models on data in the provider or consumer accounts.

See [Use Snowflake machine learning models in a Snowflake Native App](../../developer-guide/native-apps/snowflake-ml-na-about.md) for more information.

---
title: System Functions: MONITOR SECURITY Privilege Required to Execute Certain System Functions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-805.md
section: Release Notes
---

# System Functions: MONITOR SECURITY Privilege Required to Execute Certain System Functions

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The behavior of these system functions has changed as follows:

* SYSTEM$GET_CMK_KMS_KEY_POLICY
* SYSTEM$GET_CMK_AKV_CONSENT_URL
* SYSTEM$GET_GCP_KMS_CMK_GRANT_ACCESS_CMD

Previously:
:   These functions were not guarded by any specific privilege.

Currently:
:   To call these functions, the active role or a role in the active role hierarchy must have the MONITOR SECURITY privilege.

Ref: 805

---
title: SYSTEM$ALLOWLIST function: Fail query when socket connection hangs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1357.md
section: Release Notes
---

# SYSTEM$ALLOWLIST function: Fail query when socket connection hangs

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the [SYSTEM$ALLOWLIST](../../../sql-reference/functions/system_allowlist.md) function is as follows:

Before the change:
:   Occasionally, the socket connection between the client that calls the function and Snowflake results in a state that Snowflake cannot
    resolve. In these cases, Snowflake returns the function results and includes an empty list for the OCSP values.

After the change:
:   When Snowflake cannot resolve the socket connection, the query that calls either allowlist function fails with one of the following error
    messages:

    * `SYSTEM$ALLOWLIST: Fail to get SSL context`
    * `SYSTEM$ALLOWLIST: SSLContext init failed`
    * `SYSTEM$ALLOWLIST: Could not find host in OCSP dumping`
    * `SYSTEM$ALLOWLIST: Peer unverified`
    * `SYSTEM$ALLOWLIST: Connection failure`

    If your network connection is transient, wait a few minutes and rerun the query. If the issue persists, contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

Ref: 1357

---
title: SYSTEM$ESTIMATE_QUERY_ACCELERATION function: New property in result value
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2044.md
section: Release Notes
---

# SYSTEM$ESTIMATE_QUERY_ACCELERATION function: New property in result value

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

When this behavior change bundle is enabled, the JSON value returned by the
[SYSTEM$ESTIMATE_QUERY_ACCELERATION](../../../sql-reference/functions/system_estimate_query_acceleration.md) function
includes the following new property:

| Property name | Description |
| --- | --- |
| `ineligibleReason` | If the `status` property has the value `ineligible`, this property contains an explanation of why this query couldn’t use Query Acceleration Service (QAS). |

Ref: 2044

---
title: SYSTEM$GET_COMPUTE_POOL_STATUS function:  Deprecated
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1830.md
section: Release Notes
---

# SYSTEM$GET_COMPUTE_POOL_STATUS function: Deprecated

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

The system_get_compute_pool_status function is being deprecated.

Before the change:
:   The SYSTEM$GET_COMPUTE_POOL_STATUS function works and provides compute pool status information.

After the change:
:   The SYSTEM$GET_COMPUTE_POOL_STATUS function is deprecated and will no longer work. Use the [DESCRIBE COMPUTE POOL](../../../sql-reference/sql/desc-compute-pool.md) command instead to get the same information.

Ref: 1830

---
title: SYSTEM$GET_PRIVATELINK_CONFIG function: OCSP account identifier URL added to output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1212.md
section: Release Notes
---

# SYSTEM$GET_PRIVATELINK_CONFIG function: OCSP account identifier URL added to output

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The behavior of the [SYSTEM$GET_PRIVATELINK_CONFIG](../../../sql-reference/functions/system_get_privatelink_config.md) function is as follows:

Previously:
:   The output of this function does not include the OCSP URL for your [account identifier](../../../user-guide/admin-account-identifier.md).

Currently:
:   The output of this function does include the OCSP URL for your account identifier. The value is recorded as follows:

    `"regionless-privatelink-ocsp-url": "ocsp.org_name-account_name.privatelink.snowflakecomputing.com"`

    Where:

    `org_name` is the name of your Snowflake organization.

    `account_name` is the unique name of your account within your organization.

    Call the function to obtain the value:

    ```sqlexample
    SELECT KEY, VALUE
      FROM TABLE(FLATTEN(
        input=>PARSE_JSON(
          SYSTEM$GET_PRIVATELINK_CONFIG())));
    ```

Ref: 1212

---
title: SYSTEM$REFERENCE function: Creating a Reference with Mismatched Object Types Fails
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1315.md
section: Release Notes
---

# SYSTEM$REFERENCE function: Creating a Reference with Mismatched Object Types Fails

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

You can create a [reference](../../../sql-reference/references.md) to an object using the [SYSTEM$REFERENCE](../../../sql-reference/functions/system_reference.md)
function. A reference authorizes access on an object to a stored procedure, class instance, or application that cannot otherwise access
the object. The reference is passed as an identifier to an owner’s rights stored procedure, class instance, or application with specific
and limited privileges on an object.

The `object_type` argument of the SYSTEM$REFERENCE function should match the type of the object the reference identifies. In
the following example, `t1` is a table and matches the TABLE object type:

```sqlexample
SELECT SYSTEM$REFERENCE('TABLE', 't1', 'SESSION', 'SELECT');
```

The SYSTEM$REFERENCE function behaves as follows:

Before the change:
:   If you create a reference using the SYSTEM$REFERENCE function and the `object_type` argument is TABLE, and the object
    name resolves to any table-like object type (that is to say, TABLE, VIEW, EXTERNAL TABLE, or MATERIALIZED VIEW), the function
    *succeeds*.

After the change:
:   If you try to create a reference using the SYSTEM$REFERENCE function and the `object_type` argument is TABLE, but the object
    name resolves to a table-like object type other than TABLE (that is to say, VIEW, EXTERNAL TABLE, or MATERIALIZED VIEW), the function
    *fails*.

    For example, if you use the TABLE object type for view `v1` with the following statement:

    ```sqlexample
    SELECT SYSTEM$REFERENCE('TABLE', 'v1', 'SESSION', 'SELECT');
    ```

    The statement results in the following error:

    ```output
    505028 (42601): Object type VIEW does not match the specified type TABLE for reference creation
    ```

Ref: 1315

---
title: Table Aliases: Changes to Name Resolution for Quoted Column Identifiers
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_07/bcr-881.md
section: Release Notes
---

# Table Aliases: Changes to Name Resolution for Quoted Column Identifiers

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

In the [FROM](../../../sql-reference/constructs/from.md) clause in a [SELECT](../../../sql-reference/sql/select.md) statement, you can use the AS clause to
define an alias for the table and its columns. For example:

```sqlexample
SELECT * FROM my_table AS my_table_alias(my_column_1_alias, my_column_2_alias);
```

This announcement describes the changes to way in which quoted column identifiers in the table alias are resolved:

Before the change:
:   If you use double quotes around the column identifier in the table alias, the double quotes become part of the column alias.

    For example, the following query defines a table alias (table_alias) and a column alias that includes quotes (“column_alias”):

    ```sqlexample
    SELECT * FROM table_1 AS my_table_alias("my_column_alias");
    ```

    In the output, the name of the first column includes the quotes (`"my_column_alias"`).

    Because the quotes are currently part of the column alias name, if you need to refer to this column alias, you must include
    the quotes. For example, if you want to refer to the column alias in the list of selected columns, you must include the
    quotes in the column alias name:

    ```sqlexample
    SELECT """my_column_alias""" FROM table_1 AS my_table_alias("my_column_alias");
    ```

    Note that in the example above, the column alias is enclosed in quotes, and the quotes within the column alias name are
    specified with two double-quote characters.

After the change:
:   If you use double quotes around the column identifier in the table alias, the double quotes are not used as part of the column
    alias.

    ```sqlexample
    SELECT * FROM table_1 AS my_table_alias("my_column_alias");
    ```

    In the output, the name of the first column does not include the quotes (`my_column_alias`).

    Queries that use quotes within the column alias fail with an “invalid identifier” error:

    ```sqlexample
    SELECT """my_column_alias""" FROM table_1 AS my_table_alias("my_column_alias");
    ```

If you use double quoted identifiers for columns in a table alias, use one of the following approaches to fix your SQL
statements:

* If you need to keep the column names as is (preserving quotes and case sensitivity), rewrite the existing queries to use
  common table expressions to define column aliases.

  For example, change:

  ```sqlexample
  SELECT """My_Column_Alias"""
    FROM table_1 AS my_table_alias("My_Column_Alias")
  ```

  to:

  ```sqlexample
  WITH my_table_alias("""My_Column_Alias""")
      AS (SELECT * FROM table_1)
    SELECT """My_Column_Alias""" FROM my_table_alias
  ```
* If you can change output column names of the existing queries, consider removing quotes from column alias definitions.

  For example, change:

  ```sqlexample
  SELECT """my_column_alias"""
    FROM table_1 AS my_table_alias("my_column_alias");
  ```

  to:

  ```sqlexample
  SELECT my_column_alias
    FROM table_1 AS my_table_alias(my_column_alias);
  ```

  Note that this statement creates and resolves the `my_column_alias` identifier as uppercase characters without any quotes.
  If you use this approach, you might need to adjust code or statements that refer to this column alias.

  For example, instead of accessing the above column using `"my_column_alias"`, client applications might need to refer to it
  as `MY_COLUMN_ALIAS`.

Ref: 881

---
title: Table Functions (Except SQL UDTFs): Restrictions With Lateral Table Functions and Outer Lateral Joins
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1057.md
section: Release Notes
---

# Table Functions (Except SQL UDTFs): Restrictions With Lateral Table Functions and Outer Lateral Joins

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

In the current release, the following are no longer be allowed:

* Lateral table functions (other than SQL UDTFs) that are specified with the ON clause.
* Outer lateral joins to table functions (other than SQL UDTFs) that specify the ON clause.

Note that this restriction also applies to statements that use clauses equivalent to the ON clause (the USING clause and NATURAL JOIN).

Table functions other than SQL UDTFs include built-in table functions and the UDTFs defined in languages other than SQL.

Previously:
:   You could specify:

    * The ON clause (or the USING clause or NATURAL JOIN) for lateral table functions (other than SQL UDTFs).

      For example:

      ```sqlexample
      SELECT ... FROM my_table
      JOIN TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
      SELECT ... FROM my_table
      INNER JOIN TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
      SELECT ... FROM my_table
      JOIN TABLE(my_js_udtf(col_a))
      ON ...;
      SELECT ... FROM my_table
      INNER JOIN TABLE(my_js_udtf(col_a))
      ON ... ;
      ```
    * An outer lateral join to a table function (other than SQL UDTFs), using the ON clause (or the USING clause or NATURAL JOIN).

      For example:

      ```sqlexample
      SELECT ... FROM my_table
      LEFT JOIN TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
      SELECT ... FROM my_table
      FULL JOIN TABLE(FLATTEN(input=>[col_a]))
      ON ... ;
      SELECT ... FROM my_table
      LEFT JOIN TABLE(my_js_udtf(col_a))
      ON ... ;
      SELECT ... FROM my_table
      FULL JOIN TABLE(my_js_udtf(col_a))
      ON ... ;
      ```

Currently:
:   When you execute the statements above, an error occurs with the following message:

    ```none
    000002 (0A000): Unsupported feature 'lateral table function called with
        OUTER JOIN syntax or a join predicate (ON clause)'
    ```

This restriction does not apply if you are using a comma to specify the join:

```sqlsyntax
SELECT ... FROM <table>,
    TABLE(<ptable_function_other_than_sql_udtf>) ... ;
```

For example:

```sqlexample
SELECT ... FROM my_table,
TABLE(FLATTEN(input=>[col_a]));
```

This restriction also does not apply if the ON clause (or the USING clause or NATURAL JOIN) is not specified:

```sqlsyntax
SELECT ... FROM <table>
[{ [ INNER  | { LEFT | RIGHT | FULL } [ OUTER ] | CROSS } JOIN
TABLE(<table_function_other_than_sql_udtf>) ...;
```

For example:

```sqlexample
SELECT ... FROM my_table
FULL JOIN TABLE(FLATTEN(input=>[col_a]));
SELECT ... FROM my_table
LEFT JOIN OUTER TABLE(FLATTEN(input=>[col_a]));
```

This restriction was added because these types of queries can result in inconsistent behavior. These types of queries are not supported.

Ref: 1057

---
title: TABLE_PRIVILEGES View: Update GRANTOR column value in the consumer account (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1321.md
section: Release Notes
---

# TABLE_PRIVILEGES View: Update GRANTOR column value in the consumer account (Postponed)

> **Attention:**
>
> This behavior change is in the 2023_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_07_bundle.md).

This behavior change was originally planned for **September, 2023**; however, it has been postponed and a new release date has not been
determined.

This change is not available for testing.

For the most up-to-date details about the release date, as well as other release-related details, see the [Behavior Change Log](https://docs.snowflake.com/release-notes/behavior-changes).

The Information Schema [TABLE_PRIVILEGES](../../../sql-reference/info-schema/table_privileges.md) view behaves as follows:

Before the change:
:   When a consumer creates a database from a share and queries the Information Schema TABLE_PRIVILEGES view, the GRANTOR column in the view
    specifies the name of the role in the provider account that made the grant to the share.

After the change:
:   If the consumer queries the TABLE_PRIVILEGES view and the GRANTOR column contains a role from the provider account that made the grant on
    the table to the share, Snowflake removes this row from the view output.

Ref: 1321

---
title: TABLE_STORAGE_METRICS and TABLES views (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2127.md
section: Release Notes
---

# TABLE_STORAGE_METRICS and TABLES views (ACCOUNT_USAGE, ORGANIZATION_USAGE): New columns

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this behavior change bundle is enabled, the [TABLE_STORAGE_METRICS view](../../../sql-reference/account-usage/table_storage_metrics.md) in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema and the [TABLE_STORAGE_METRICS view](../../../sql-reference/organization-usage/table_storage_metrics.md) in the [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md) schema include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| ARCHIVE_STORAGE_COOL_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COLD_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COOL_TIME_TRAVEL_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COLD_TIME_TRAVEL_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COOL_FAILSAFE_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COLD_FAILSAFE_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COOL_EARLY_DELETION_PENALTY_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COLD_EARLY_DELETION_PENALTY_BYTES | NUMBER | Reserved for future use. |

When this behavior change bundle is enabled, the [TABLES view](../../../sql-reference/account-usage/tables.md) in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) schema and the [TABLES view](../../../sql-reference/organization-usage/tables.md) in the [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md) schema include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| ARCHIVE_STORAGE_COOL_ROW_COUNT | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COOL_BYTES | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COLD_ROW_COUNT | NUMBER | Reserved for future use. |
| ARCHIVE_STORAGE_COLD_BYTES | NUMBER | Reserved for future use. |

Ref: 2127

---
title: TABLE_STORAGE_METRICS view (Account Usage): New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1361.md
section: Release Notes
---

# TABLE_STORAGE_METRICS view (Account Usage): New column

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

A new column is added to the TABLE_STORAGE_METRICS (Account Usage) view.

Before the change:
:   The TABLE_STORAGE_METRICS (Account Usage) view does not include an INSTANCE_ID column.

After the change:
:   The TABLE_STORAGE_METRICS (Account Usage) view includes an INSTANCE_ID column:

    | Column name | Data type | Description |
    | --- | --- | --- |
    | INSTANCE_ID | NUMBER | Internal/system-generated identifier for the instance which the object belongs to. |

    The columnn will display a null value if the table is under a behavior change bundle instance.

Ref: 1361

---
title: TABLES and SCHEMATA Views (Account Usage): Changes to RETENTION_TIME Column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-928.md
section: Release Notes
---

# TABLES and SCHEMATA Views (Account Usage): Changes to RETENTION_TIME Column

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The data retention period for an object is determined by the retention time parameter settings set on the object and account. These parameters
are DATA_RETENTION_TIME _IN_DAYS and MIN_DATA_RETENTION_TIME_IN_DAYS.

* If the retention time is not explicitly set for an object, it inherits the setting from its parent object.
* If there is no retention time set at the account level, the default retention time for the object is 1 day.
* The maximum retention time for a transient object is 1 day, regardless of the account-level setting.
* If there is a minimum retention time set for the account, and a retention time explicitly set on an object, the effective retention time is the
  greater of the two: MAX(DATA_RETENTION_TIME_IN_DAYS, MIN_DATA_RETENTION_TIME_IN_DAYS).

The RETENTION_TIME column in the Account Usage views listed below might display the incorrect value in the following scenarios:

* If there is no explicit retention time set for a transient table or schema, and the retention time for the account is set to 7 days, the
  RETENTION_TIME column value is 7 days. This is incorrect. The maximum data retention time for a transient object is 1 day.
* If the minimum retention time for an account is 7 days, and the retention time setting for a table or schema is 4 days, the RETENTION_TIME column
  value is 4 days. This is incorrect. The minimum account retention time is longer and therefore overrides the retention time explicitly set for the
  table or schema.
* If the retention time is set to 10 days for a table or schema, then unset, the RETENTION_TIME column value is the unset value (in this case 10).
  This might be incorrect.

In the current release, the RETENTION_TIME column value changed as follows for the ACCOUNT_USAGE views listed below:

* TABLES View
* SCHEMATA View

Previously:
:   In some cases, the RETENTION_TIME column displays an incorrect data retention time for the object.

Currently:
:   The RETENTION_TIME column will display the correct data retention time for tables and schemas.

For more information about setting the data retention period, refer to
[Specifying the Data Retention Period for an Object](../../../user-guide/data-time-travel.md).

Ref: 928

---
title: TABLES view (Account Usage and Information Schema): New column and column values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1260.md
section: Release Notes
---

# TABLES view (Account Usage and Information Schema): New column and column values

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The version of the TABLES view for Account Usage and Information Schema is as follows:

Previously:
:   For [Account Usage TABLES view](../../../sql-reference/account-usage/tables.md) and [Information Schema TABLES view](../../../sql-reference/info-schema/tables.md), a value of `EVENT_TABLE` for the `TABLE_TYPE` column is not available.

Currently:
:   For Account Usage TABLES view:

    > * A new value of `EVENT TABLE` for the `TABLE_TYPE` column is enabled. The value indicates whether the table is an event table. See [Setting up an Event Table](../../../developer-guide/logging-tracing/event-table-setting-up.md).

    For Information Schema TABLES view:

    > * A new value of `EVENT TABLE` for the `TABLE_TYPE` column is enabled. The value indicates whether the table is an event table. See [Setting up an Event Table](../../../developer-guide/logging-tracing/event-table-setting-up.md).
    > * A new `IS_TEMPORARY` column is enabled:
    >
    >   > | Column Name | Data Type | Description |
    >   > | --- | --- | --- |
    >   > | IS_TEMPORARY | TEXT | Indicates whether this is a temporary table. Valid values are `YES` and `NO`. |

Ref: 1260

---
title: TABLES View (Account Usage): Changes to the RETENTION_TIME Column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-853.md
section: Release Notes
---

# TABLES View (Account Usage): Changes to the RETENTION_TIME Column

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The RETENTION_TIME column value in the Account Usage [TABLES](../../../sql-reference/account-usage/tables.md) view has changed
as follows:

Previously:
:   In the following cases, the RETENTION_TIME column might have displayed an incorrect data retention time for tables:

    * If the DATA_RETENTION_TIME_IN_DAYS was not explicitly set for a transient table, the RETENTION_TIME column displayed the value
      inherited from a parent object. This value might have been incorrect. The maximum data retention time for transient tables
      is 1 day.
    * If the MIN_DATA_RETENTION_TIME_IN_DAYS parameter was set for an account, the RETENTION_TIME column ignored this minimum
      retention time and might have displayed an incorrect value for a table.

Currently:
:   The RETENTION_TIME column now display the correct data retention time for tables.

Ref: 853

---
title: TABLES view and SHOW TABLES command: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2168.md
section: Release Notes
---

# TABLES view and SHOW TABLES command: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the output of the [TABLES](../../../sql-reference/info-schema/tables.md) view,
in the [INFORMATION SCHEMA](../../../sql-reference/info-schema.md), includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| ROW_TIMESTAMP_ON | TEXT | Describes whether the table includes a row timestamp.   * `YES` if the table includes row timestamp * `NO` otherwise |

When this behavior change bundle is enabled, the output of the [SHOW TABLES](../../../sql-reference/sql/show-tables.md) command includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| ROW_TIMESTAMP | TEXT | Describes whether the table includes a row timestamp.   * `ON` if the table includes row timestamp * `OFF` otherwise |

Ref: 2168

---
title: TABLES views (Account Usage and Information Schema): TABLE_TYPE column value shows correct value for event tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1169.md
section: Release Notes
---

# TABLES views (Account Usage and Information Schema): TABLE_TYPE column value shows correct value for event tables

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, in the [Account Usage schema TABLES view](../../../sql-reference/account-usage/tables.md) and
[Information schema TABLES view](../../../sql-reference/info-schema/tables.md), the `TABLE_TYPE` column value have changed for certain rows.
When the table described in the view row is an [event table](../../../developer-guide/logging-tracing/event-table-columns.md), the
`TABLE_TYPE` column value will be `EVENT TABLE`.

Previously:
:   When the TABLES view row describes an event table, the row’s `TABLE_TYPE` column value is null.

Currently:
:   When the TABLES view row describes an event table, the row’s `TABLE_TYPE` column value is EVENT TABLE.

Ref: 1169

---
title: TABLES views and SHOW OBJECTS command: New column IS_HYBRID
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1732.md
section: Release Notes
---

# TABLES views and SHOW OBJECTS command: New column IS_HYBRID

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the [INFORMATION_SCHEMA.TABLES view](../../../sql-reference/info-schema/tables.md) and
the [ACCOUNT_USAGE.TABLES view](../../../sql-reference/account-usage/tables.md) both include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| IS_HYBRID | Boolean | Specifies whether the table is a hybrid table (`YES` or `NO`). |

For both TABLES views, the TABLE_TYPE column for a hybrid table shows `BASE TABLE` instead of `HYBRID TABLE`.

When this behavior change bundle is enabled, the output of the [SHOW OBJECTS](../../../sql-reference/sql/show-objects.md) command includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| IS_HYBRID | Boolean | Specifies whether the table is a hybrid table (`Y` or `N`). |

Ref: 1732

---
title: TABLES, VIEWS, and EXTERNAL_TABLES Views (Account Usage, Information Schema): New Columns Added
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-891.md
section: Release Notes
---

# TABLES, VIEWS, and EXTERNAL_TABLES Views (Account Usage, Information Schema): New Columns Added

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

New columns for tracking DDL operations have been added to the following Account Usage and Information Schema views:

* ACCOUNT_USAGE:

  + [TABLES](../../../sql-reference/account-usage/tables.md)
  + [VIEWS](../../../sql-reference/account-usage/views.md)
* INFORMATION_SCHEMA:

  + [TABLES](../../../sql-reference/info-schema/tables.md) and [EXTERNAL_TABLES](../../../sql-reference/info-schema/external_tables.md)
  + [VIEWS](../../../sql-reference/info-schema/views.md)

New columns:

| Column Name | Data Type | Description |
| --- | --- | --- |
| LAST_DDL | TIMESTAMP | Specifies the timestamp of the last DDL operation performed on the table or view. |
| LAST_DDL_BY | TEXT | Specifies the username of the user who performed the last DDL operation on the table or view. |

Ref: 891

---
title: Tag Must Exist When Calling System Functions
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-938.md
section: Release Notes
---

# Tag Must Exist When Calling System Functions

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The behavior of the [SYSTEM$GET_TAG_ON_CURRENT_COLUMN](../../../sql-reference/functions/system_get_tag_on_current_column.md) and
[SYSTEM$GET_TAG_ON_CURRENT_TABLE](../../../sql-reference/functions/system_get_tag_on_current_table.md) functions has changed as follows:

Previously:
:   If the tag did not exist, Snowflake allowed using these functions when creating or altering a masking policy or row access
    policy. The query on the protected column failed because the tag did not exist.

Currently:
:   The tag must exist when using these functions while creating a masking policy or row access policy. If the tag does not exist,
    Snowflake returns the following error message:

    `Tag '<tag_name>' does not exist or not authorized.`

Ref: 938

---
title: TAGS view (ACCOUNT_USAGE): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_02/bcr-1937.md
section: Release Notes
---

# TAGS view (ACCOUNT_USAGE): New columns

> **Attention:**
>
> This behavior change is in the 2025_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_02_bundle.md).

When this behavior change bundle is enabled, the ACCOUNT_USAGE TAGS view includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `propagate` | VARCHAR | Indicates whether the tag is configured for automatic propagation. Possible values are the following:   * NULL — Tag is not propagated. * `ON_DEPENDENCY` — Tag is propagated when there is an object dependency (for example, creating a view from a tagged table). * `ON_DATA_MOVEMENT` — Tag is propagated when there is data movement (for example, using a CTAS statement to create a table   from a tagged table). * `ON_DEPENDENCY_AND_DATA_MOVEMENT` — Tag is propagated for both object dependencies and data movement. |
| `on_conflict` | VARCHAR | If the tag is configured for automatic propagation, indicates what happens when the value of the tag being propagated conflicts with the value that was specified when the tag was manually applied to the same object. |

These columns are being added as the last columns of the view.

Ref: 1937

---
title: Task Parameters Preserved When Cloning Databases, Schemas, and Tables
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-913.md
section: Release Notes
---

# Task Parameters Preserved When Cloning Databases, Schemas, and Tables

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

When a schema, database, or table is cloned, the target (cloned) object now includes the same parameter set, with each parameter value
set to match the source:

Previously:
:   The following four parameters were not propagated to the clone target instance if set on the source schema or database:

    * ENABLE_STREAM_TASK_REPLICATION
    * SUSPEND_TASK_AFTER_NUM_FAILURES
    * USER_TASK_MANAGED_INITIAL_WAREHOUSE_SIZE
    * USER_TASK_TIMEOUT_MS

    Note that these parameters do not apply to tables.

Currently:
:   All parameters set on the schema, database, or table are now copied intact to the cloned target instance.

Ref: 913

---
title: Task Parameters Preserved When Cloning Tasks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-912.md
section: Release Notes
---

# Task Parameters Preserved When Cloning Tasks

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

When a task is cloned, the target (cloned) task now includes the same parameter set with each parameter value set to match the original
source task:

Previously:
:   The cloned task had the default value for each parameter. That is, parameter values were not copied from the source task.

Currently:
:   All parameters set on the source task are copied intact to the clone task instance.

Ref: 912

---
title: Task views and functions (Account Usage and Information Schema): ATTEMPT_NUMBER value changed to 1-based
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1280.md
section: Release Notes
---

# Task views and functions (Account Usage and Information Schema): ATTEMPT_NUMBER value changed to 1-based

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

The value of ATTEMPT_NUMBER field in the following view behaves as follows:

* Account Usage views:

  + [COMPLETE_TASK_GRAPHS](../../../sql-reference/account-usage/complete_task_graphs.md)
  + [TASK_HISTORY](../../../sql-reference/account-usage/task_history.md)
* Information Schema table functions:

  + [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md)
  + [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md)
  + [TASK_HISTORY](../../../sql-reference/functions/task_history.md)

Previously:
:   The ATTEMPT_NUMBER column is 0 based.
    For example, when the task/graph is run for the first time, the ATTEMPT_NUMBER will be 0.
    If the task/graph is later retried, the ATTEMPT_NUMBER will be 1, then 2, etc.

Currently:
:   The ATTEMPT_NUMBER column will be 1 based.
    For example, when the task/graph is run for the first time, the ATTEMPT_NUMBER will be 1.
    If the task/graph is later retried, the ATTEMPT_NUMBER will be 2, then 3, etc.

Ref: 1280

---
title: Task views and functions (Account Usage and Information Schema): New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1279.md
section: Release Notes
---

# Task views and functions (Account Usage and Information Schema): New column

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, the output of the task history, current task graphs, and complete task graphs views and functions include a new column. The affected views include:

* Account Usage views:

  + [COMPLETE_TASK_GRAPHS](../../../sql-reference/account-usage/complete_task_graphs.md)
  + [TASK_HISTORY](../../../sql-reference/account-usage/task_history.md)
* Information Schema table functions:

  + [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md)
  + [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md)
  + [TASK_HISTORY](../../../sql-reference/functions/task_history.md)

| Column Name | Data Type | Description |
| --- | --- | --- |
| graph_run_group_id | NUMBER | Identifier for the graph run. When a graph run has multiple task runs, each task run will show the same `graph_run_group_id`. The combination of `graph_run_group_id`, and `attempt_number` can be used to uniquely identify a graph run. |

Additionally, `run_id` behaves as follows for:

* ACCOUNT_USAGE.TASK_HISTORY
* ACCOUNT_USAGE.COMPLETE_TASK_GRAPHS
* INFORMATION_SCHEMA.TASK_HISTORY
* INFORMATION_SCHEMA:CURRENT_TASK_GRAPHS
* INFORMATION_SCHEMA:COMPLETE_TASK_GRAPHS

Previously:
:   `run_id` is unique for the current task/graph run prior to retry.

Currently:
:   `run_id` may not be a unique identifier for the current task/graph run prior to retry.
    You may use `graph_run_group_id` column as a replacement for this column.

Also refer to [Query and task history views and functions: New columns](bcr-1147.md) for additional related new columns.

Ref: 1279

---
title: Task views, functions, and commands: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2212.md
section: Release Notes
---

# Task views, functions, and commands: New columns in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the following tasks related commands, views, and functions include new columns.

## SHOW TASKS and DESCRIBE TASK commands: New column

When this behavior change bundle is enabled, the output of the [SHOW TASKS](../../../sql-reference/sql/show-tasks.md) and
[DESCRIBE TASK](../../../sql-reference/sql/desc-task.md) commands includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| CREATED_BY_USER | VARCHAR | The name of the user who created the task. |

> **Note:**
>
> * This column is populated for new tasks only and is not backfilled for existing tasks.
> * For normal tasks, this is the user who directly created the task. For tasks created within other tasks
>   or Snowpark Container Services (SPCS) that execute with system accounts, the behavior differs: for nested tasks, this is the
>   user who created the outermost task; for SPCS-based jobs, this is the user who created the SPCS
>   service or job.

## Task history and task graph views and functions: New column

When this behavior change bundle is enabled, the following views and functions include a new column:

* Account Usage views:

  + [TASK_HISTORY](../../../sql-reference/account-usage/task_history.md)
  + [COMPLETE_TASK_GRAPHS](../../../sql-reference/account-usage/complete_task_graphs.md)
* Information Schema table functions:

  + [TASK_HISTORY](../../../sql-reference/functions/task_history.md)
  + [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md)
  + [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md)

| Column name | Data type | Description |
| --- | --- | --- |
| SCHEDULED_BY_USER | VARCHAR | The name of the user triggering the task to run. |

Ref: 2212

---
title: TASK_HISTORY function: Consistent FAILED state for timed-out tasks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1681.md
section: Release Notes
---

# TASK_HISTORY function: Consistent FAILED state for timed-out tasks

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

The TASK_HISTORY function behaves as follows:

Before the change:
:   The tasks that are timed out have inconsistent status in task history. The states for these tasks are sometimes `FAILED`, and sometimes `CANCELED`.

After the change:
:   The timed-out tasks always have a `FAILED` state in task history.

Ref: 1681

---
title: TASK_HISTORY Table Function (Information Schema): Consistent Values for Failed and Auto-suspended Tasks in ERROR_ONLY Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-990.md
section: Release Notes
---

# TASK_HISTORY Table Function (Information Schema): Consistent Values for Failed and Auto-suspended Tasks in ERROR_ONLY Output

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

Auto suspended tasks which fail will be in state FAILED_AND_AUTO_SUSPENDED across all calls of the INFORMATION_SCHEMA.TASK_HISTORY table function:

Previously:
:   Calling TASK_HISTORY with ERROR_ONLY = true with auto-suspended tasks returns state = FAILED.

Currently:
:   Calling TASK_HISTORY with ERROR_ONLY = true with auto-suspended tasks returns state = FAILED_AND_AUTO_SUSPENDED.

Ref: 990

---
title: TASK_HISTORY View (Account Usage): Change to Status for Failed and Auto-suspended Tasks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-899.md
section: Release Notes
---

# TASK_HISTORY View (Account Usage): Change to Status for Failed and Auto-suspended Tasks

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

The Account Usage [TASK_HISTORY](../../../sql-reference/account-usage/task_history.md) view has changed as follows when examining a
task that has been auto-suspended by hitting its SUSPEND_TASK_AFTER_NUM_FAILURES limit:

Previously:
:   The task displayed the state `FAILED`.

Currently:
:   The task displays the state `FAILED_AND_AUTO_SUSPENDED`.

Ref: 899

---
title: TASK_HISTORY view (ACCOUNT_USAGE): task usage history restricted to 1 year
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1806.md
section: Release Notes
---

# TASK_HISTORY view (ACCOUNT_USAGE): task usage history restricted to 1 year

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

With this behavior change, the TASK_HISTORY view in the ACCOUNT_USAGE schema behaves as follows:

Before the change:
:   With the TASK_HISTORY view, you can retrieve the history of task usage longer than 1 year.

After the change:
:   With the TASK_HISTORY view, you can retrieve the history of task usage only within 1 year.

Ref: 1806

---
title: TASK_HISTORY views: New column in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2182.md
section: Release Notes
---

# TASK_HISTORY views: New column in output

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the following TASK_HISTORY views and table function
include a new column:

* [INFORMATION_SCHEMA](../../../sql-reference/info-schema.md) table function:

  + [TASK_HISTORY](../../../sql-reference/functions/task_history.md)
* [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) view:

  + [TASK_HISTORY](../../../sql-reference/account-usage/task_history.md)
* [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md) view:

  + [TASK_HISTORY](../../../sql-reference/organization-usage/task_history.md)

| Column name | Data type | Description |
| --- | --- | --- |
| SPCS_JOB_ID | NUMBER | The identifier of the Snowpark Container Services job service that the task spawned in its root query, or NULL if the task’s root query did not spawn any job service. The identifier corresponds to the SERVICE_ID column in [ACCOUNT_USAGE.SERVICES](../../../sql-reference/account-usage/services.md). |

Ref: 2182

---
title: TASK_VERSIONS view (ACCOUNT_USAGE): New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1882.md
section: Release Notes
---

# TASK_VERSIONS view (ACCOUNT_USAGE): New columns

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the ACCOUNT_USAGE.TASK_VERSIONS view includes the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| CONFIG | TEXT | The graph level configuration if set for the root task, otherwise displays NULL. |
| TASK_RELATIONS | ARRAY | The JSON array of any predecessor tasks—that is, any tasks specified in the `AFTER` parameter in the task definition. When run successfully to completion, these tasks trigger the current task. Individual task names in the array are fully-qualified. They include the container database and schema names. |
| TARGET_COMPLETION_INTERVAL | TEXT | The desired task completion time as specified using the `TARGET_COMPLETION_INTERVAL` parameter in the task definition. |
| SUCCESS_INTEGRATION | TEXT | The name of the notification integration used to communicate with Amazon SNS, MS Azure Event Grid, or Google Pub/Sub. |
| SCHEDULING_MODE | TEXT | Reserved for future use. |

Ref: 1882

---
title: Tasks: Automatically suspend failed task runs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1412.md
section: Release Notes
---

# Tasks: Automatically suspend failed task runs

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

The behavior of the parameter [SUSPEND_TASK_AFTER_NUM_FAILURES](../../../sql-reference/parameters.md) is as follows:

Before the change:
:   The parameter SUSPEND_TASK_AFTER_NUM_FAILURES is set to 0 by default. That is, the failed task runs are not suspended by default. To automatically suspend failed task runs, you need to set up the parameter to a value greater than 0.

After the change:
:   The parameter SUSPEND_TASK_AFTER_NUM_FAILURES is set to 10 by default. That is, the task runs are automatically suspended after 10 consecutive task runs either fail or time out. You can resume the failed tasks at any time after the suspension. This change applies to all new and existing tasks. You can override the setting to 0 or another number of failures by setting the parameter at the account, database, schema, or task level.

    To view the tasks that are automatically suspended, check the STATE column for FAILED_AND_AUTO_SUSPENDED tasks in the TASK_HISTORY [TASK_HISTORY](../../../sql-reference/functions/task_history.md) table function (Information Schema) or [TASK_HISTORY view](../../../sql-reference/account-usage/task_history.md) TASK_HISTORY view (Account_Usage).

    It’s recommended that you fix the failed task runs before resuming them.

Ref: 1412

---
title: Tasks: Graph completion time differs from final task
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1251.md
section: Release Notes
---

# Tasks: Graph completion time differs from final task

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

Table function [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md) and [COMPLETE_TASK_GRAPHS](../../../sql-reference/account-usage/complete_task_graphs.md)
view now both include details on completion times.

Task and task graph completion times behave as follows for a given task graph:

Previously:
:   COMPLETED_TIME is the completion time of the last task run in the graph.

Currently:
:   COMPLETED_TIME is the time when the graph run is completed. This value can be greater than the completion time of the last task run due to tracking overhead.

Ref: 1251

---
title: Tasks: New BACKFILL_INFO column in views
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_08/bcr-1375.md
section: Release Notes
---

# Tasks: New BACKFILL_INFO column in views

> **Attention:**
>
> This behavior change is in the 2023_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_08_bundle.md).

This behavior change affects the output of the following Information Schema table functions and Account Usage views:

* [TASK_HISTORY](../../../sql-reference/functions/task_history.md) (Information Schema)
* [CURRENT_TASK_GRAPHS](../../../sql-reference/functions/current_task_graphs.md) (Information Schema)
* [COMPLETE_TASK_GRAPHS](../../../sql-reference/functions/complete_task_graphs.md) (Information Schema)
* [TASK_HISTORY view](../../../sql-reference/account-usage/task_history.md) (Account Usage)
* [COMPLETE_TASK_GRAPHS view](../../../sql-reference/account-usage/complete_task_graphs.md) (Account Usage)

Before the change:
:   The output of the aforementioned Information Schema table functions and Account Usage views does not include the BACKFILL_INFO column.

After the change:
:   The output of the aforementioned Information Schema table functions and Account Usage views includes the BACKFILL_INFO column.

    | Column name | Data type | Description |
    | --- | --- | --- |
    | BACKFILL_INFO | OBJECT | Reserved for future use. The returned value for all rows is NULL. |

Ref: 1375

---
title: Tasks: New Column in Views and SQL Command Output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1080.md
section: Release Notes
---

# Tasks: New Column in Views and SQL Command Output

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

The CONFIG column were added to the following views in the ACCOUNT_USAGE and INFORMATION_SCHEMA schemas and the
output of the following SQL commands:

* ACCOUNT_USAGE

  + COMPLETE_TASK_GRAPHS
  + TASK_HISTORY
* INFORMATION_SCHEMA

  + CURRENT_TASK_GRAPHS
  + COMPLETE_TASK_GRAPHS
  + TASK_HISTORY
* DESCRIBE TASK
* SHOW TASKS

Added column:

| Column Name | Data Type | Description |
| --- | --- | --- |
| CONFIG | TEXT | JSON string representing the configuration for the task. |

Ref: 1080

---
title: Tasks: Reducing the number of SKIPPED tasks
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1710.md
section: Release Notes
---

# Tasks: Reducing the number of SKIPPED tasks

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

For tasks with a user-specified schedule, the tasks run as follows:

Before the change:
:   A task runs based on the user-specified schedule, for example, every 1 minute. The task completes with a SKIPPED state if there is no new data in the stream within the specified time interval (for example, 1 minute). In this case, the SCHEDULED_FROM column of the TASK_HISTORY view is SCHEDULE.

After the change:
:   A task still follows the user-specified schedule but only executes when there is new data in the stream. For example, a user has a task with a schedule to run every minute (SCHEDULE = ‘1 m’) and a stream_has_data WHEN condition. The task checks the stream a minute after the last task start time, and only executes when there is new data. In this case, the SCHEDULED_FROM column of the TASK_HISTORY view is TRIGGER. The task is executed approximately every 12 hours to prevent the stream from going stale.

    If you set up monitoring or alerts to check your task run states, we recommend adjusting your monitoring to account for this behavior change because the tasks in the SKIPPED state are significantly reduced.

    > **Note:**
    >
    > Even though skipped tasks are significantly reduced, there might still occasionally be tasks that are skipped. Tasks might also be skipped for other reasons, such as when a task is designed to periodically check for new data in a stream, and no new data is present.

Ref: 1710

---
title: Telemetry: Event table attribute name and value changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_07/bcr-1668.md
section: Release Notes
---

# Telemetry: Event table attribute name and value changes

> **Attention:**
>
> This behavior change is in the 2024_07 bundle.

For the current status of the bundle, refer to [Bundle History](../2024_07_bundle.md).

When this behavior change bundle is enabled, some telemetry data in the event table and in Snowsight changes when a
function or procedure handler is written in Python.

Before the change:
:   In the event table when the handler is written in Python, the following is true:

    * In the RECORD column for a `SPAN` RECORD_TYPE, the `name` attribute’s value is a fixed value such as
      `snow.auto_instrumented`.
    * The RECORD column for a `SPAN` RECORD_TYPE can contain a `snowflake.max_memory_usage_bytes` attribute.

    In Snowsight, in the Query Profile view, the `snowflake.max_memory_usage_bytes` value is listed as
    Python sandbox max memory usage.

After the change:
:   In the event table when the handler is written in Python, the following is true:

    > * In the RECORD column for a `SPAN` RECORD_TYPE, the `name` attribute’s value identifies the handler for the function or
    >   procedure that emitted the data. This varies by executable type:
    >
    >   + Procedure — handler function name
    >   + User-defined function (UDF) — handler function name
    >   + User-defined table function (UDTF) — handler class name
    >   + Client code — name of the client-side API that began the span
    > * The RECORD column for a `SPAN` RECORD_TYPE can contain a `snow.process.memory.usage.max` attribute renamed from `snowflake.max_memory_usage_bytes`.

    In Snowsight, in the Query Profile view, the `snow.process.memory.usage.max` value is listed as Max Python process memory usage.

    Similarly, in the GET_QUERY_OPERATOR_STATS function’s output, the OPERATOR_STATISTICS column’s `Python sandbox max memory usage nested key`
    has been renamed to `Max Python process memory usage`.

Ref: 1668

---
title: Telemetry: Event table attribute name and value changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1767.md
section: Release Notes
---

# Telemetry: Event table attribute name and value changes

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, some telemetry data in an [event table](../../../developer-guide/logging-tracing/event-table-setting-up.md)
will change when a function or procedure handler is written in Java or Scala.

Before the change:
:   In the event table when the handler is written in Java or Scala, the following is true:

    * In the [RECORD column](../../../developer-guide/logging-tracing/event-table-columns.md) for a `SPAN` RECORD_TYPE, the `name` attribute’s value
      is a fixed value such as `snow.auto_instrumented`.

After the change:
:   In the event table when the handler is written in Java or Scala, the following is true:

    * In the [RECORD column](../../../developer-guide/logging-tracing/event-table-columns.md) for a `SPAN` RECORD_TYPE, the `name` attribute’s
      value identifies the handler for the function or procedure that emitted the data. This varies by executable type:

      + Procedure — handler function name
      + User-defined function (UDF) — handler function name
      + User-defined table function (UDTF) — handler class name
      + Client code — name of the client-side API that began the span

Ref: 1767

---
title: Temporary Tables: Changes to Table Creation in Schemas
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-427.md
section: Release Notes
---

# Temporary Tables: Changes to Table Creation in Schemas

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

The behavior of the creating temporary table in a schema behaves as follows:

Previously:
:   You can create a temporary table that has the same name as the table in the same schema without having any privileges on the table itself.

Currently:
:   To create a temporary table with the same name as the table in a schema, the role in use must be granted or inherit a role that has the OWNERSHIP privilege on the table.

This change is to prevent naming conflicts with permanent tables in the same schema. Note that you should update your scripts that create temporary tables with the same name as the table in the same schema. You can either rename the temporary table or grant the OWNERSHIP privilege for the table to the role used to create the temporary table.

Ref: 427

---
title: Temporary Tables: Changes to Table Creation in Schemas (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_01/bcr-934.md
section: Release Notes
---

# Temporary Tables: Changes to Table Creation in Schemas (Pending)

> **Attention:**
>
> This behavior change is in the 2023_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_01_bundle.md).

## Streams: CREATE STREAM and CREATE | ALTER VIEW Propagate CHANGE_TRACKING Parameter to Underlying Objects

[CREATE STREAM](../../../sql-reference/sql/create-stream.md), [CREATE VIEW](../../../sql-reference/sql/create-view.md), and [ALTER VIEW](../../../sql-reference/sql/alter-view.md) now
propagate CHANGE_TRACKING = TRUE to associated underlying tables and views, and fails accordingly when insufficient permissions are
encountered:

Previously:
:   * CREATE STREAM … CHANGE_TRACKING = TRUE statements attempted to enable CHANGE_TRACKING recursively on base tables, and they failed if
      insufficient permissions were encountered.
    * ALTER/CREATE VIEW … CHANGE_TRACKING = TRUE statements attempted to enable CHANGE_TRACKING recursively on base views and tables of
      the target view. If insufficient permissions were encountered, the statement should have failed, but didn’t.

Currently:
:   CREATE STREAM and ALTER/CREATE VIEW now correctly propagate CHANGE_TRACKING = TRUE to the underlying database objects. Both
    types of statements fail if permissions are insufficient for any part of the base object tree.

> **Note:**
>
> With CREATE STREAM, if sufficient permissions aren’t available, base tables and views are left unchanged. With ALTER/CREATE VIEW, base
> objects are left unchanged, but the view itself has CHANGE_TRACKING enabled.

Ref: 934

---
title: Time Travel: Data Retention Disabled for a Database Created from a Share
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-945.md
section: Release Notes
---

# Time Travel: Data Retention Disabled for a Database Created from a Share

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

When a [data consumer](../../../user-guide/data-share-consumers.md) creates a local database from a provider share, that database inherits the
default [Time Travel](../../../user-guide/data-time-travel.md) retention period (i.e. the [DATA_RETENTION_TIME_IN_DAYS](../../../sql-reference/parameters.md) parameter
setting). If the database is dropped, its historical data is saved for the specified data retention period. However, once a database
created from a share is dropped, it cannot be recovered from Time Travel (e.g. undropped) regardless of the data retention period setting.

This behavior has changed as follows:

Previously:
:   When a database created from a share was dropped, its historical data was saved for the retention period specified by the inherited
    value of the DATA_RETENTION_TIME_IN_DAYS parameter. However Time Travel could not be used to recover the database (e.g. the database
    could not be undropped) regardless of the data retention period.

Currently:
:   When a database is created from a share, the DATA_RETENTION_TIME_IN_DAYS parameter value is 0.

Ref: 945

---
title: Tracing: Span and trace IDs propagated from parent to child through procedure calls
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_06/bcr-1683.md
section: Release Notes
---

# Tracing: Span and trace IDs propagated from parent to child through procedure calls

> **Attention:**
>
> This behavior change is in the 2024_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_06_bundle.md).

In [event tracing](../../../developer-guide/logging-tracing/tracing.md), span and trace IDs behave as follows:

Before the change:
:   The `trace_id` of each of the spans created by chained stored procedures or UDFs is unique when the handler is written in
    JavaScript or Snowflake Scripting.

    The `parent_span_id` field does not exist in the RECORD column of the event table.

    Native apps providers and consumers see different `trace_id` values for shared events. The provider sees the hashed version.

After the change:
:   This change adds trace ID propagation support for handlers written in JavaScript and Snowflake Scripting. This already exists for
    handlers written in Python, Java, and Scala. The new behavior applies to stored procedures and UDFs whose handlers are written in
    JavaScript, as well as to stored procedure handlers written in Snowflake Scripting.

    Spans generated by chained stored procedures or UDFs with JavaScript or Snowflake Scripting handlers have the same `trace_id`.
    The RECORD column may have a `parent_span_id` attribute.

    Spans generated by chained stored procedures or UDFs with JavaScript or Snowflake Scripting handlers have a parent-child relationship
    between `parent_span_id` and `span_id`. Stored procedures with JavaScript or Snowflake Scripting handlers can call other
    stored procedures in a chain of any length. UDFs can’t execute SQL statements, so calling a UDF ends the chain. However, the trace info
    is still propagated to the UDF’s spans.

    If the stored procedure or UDF with a JavaScript or Snowflake Scripting handler was called by the user directly (the root), then the
    `trace_id` will be a random ID and there will be no `parent_span_id`. If tracing is disabled for a stored procedure and it
    calls another stored procedure or UDF, then the `trace_id` of the child’s spans will be random and they will have no
    `parent_span_id`. In other words, the trace is restarted at the child.

    Native apps providers and consumers see the same `trace_id` for shared JavaScript or Snowflake Scripting stored procedure or UDF
    events, so they can be debugged more easily.

This change is being made to improve debugging.

Ref: 1683

---
title: Transforming data during a load: Disallow using MATCH_BY_COLUMN_NAME with a SELECT statement
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_02/bcr-1514.md
section: Release Notes
---

# Transforming data during a load: Disallow using MATCH_BY_COLUMN_NAME with a SELECT statement

> **Attention:**
>
> This behavior change is in the 2024_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_02_bundle.md).

Using COPY with MATCH_BY_COLUMN_NAME is still a recommended approach. This behavior change only fixes a corner case where the behavior of transforming data during a load is undefined.

Use of the MATCH_BY_COLUMN_NAME copy option and a SELECT statement for transforming data during a load behaves as follows:

Before the change:
:   You are allowed to use the MATCH_BY_COLUMN_NAME copy option with a SELECT statement for transforming data during a load in certain cases. However, these cases may result in undefined behavior.

After the change:
:   You are not allowed to use the MATCH_BY_COLUMN_NAME copy option with a SELECT statement for transforming data during a load in all cases. These two options can still be used separately, but cannot be used together. Any attempt to do so will result in the following error: `SQL compilation error: match_by_column_name is not supported with copy transform`.

Ref: 1514

---
title: UDFs: Functions with Handler Code That Reads Files from a Stage Execute in the Owner’s Context
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_03/bcr-1008.md
section: Release Notes
---

# UDFs: Functions with Handler Code That Reads Files from a Stage Execute in the Owner’s Context

> **Attention:**
>
> This behavior change is in the 2023_03 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_03_bundle.md).

UDFs whose handler code reads files from a stage will execute in the owner’s context.
A caller must pass the file location as a scoped URL, whether the handler code is in a UDF or procedure and regardless of whether
they are running with caller’s or owner’s rights:

Previously:
:   When Java code in a UDF handler [reads a file from a stage](../../../user-guide/unstructured-data-java.md), it does so using the caller’s context.

    A caller may pass the file’s location to the function using a form such as `select my_function('@stage-name/filename.txt').`

    Handler code receives the stage URL and reads the file with `SnowflakeFile.newInstance` and `SnowflakeFile.getInputStream` or
    receives the file as a `java.io.InputStream`:

    ```java
    SnowflakeFile sfFile = SnowflakeFile.newInstance(file_url);
    InputStream is = sfFile.getInputStream();
    ```

    Files read by handler code may be on an external stage or on a user or named internal stage.

Currently:
:   When Java code in a UDF handler reads a file from a stage, it will do so using the owner’s context.

    For both UDFs and procedures, a caller must pass the file’s location in a scoped URL by using the [BUILD_SCOPED_FILE_URL](../../../sql-reference/functions/build_scoped_file_url.md) function, as in the following function example:

    `select my_func(build_scoped_file_url(@my_stage, 'filename.txt'));`

    Handler code receives the scoped URL and reads the file as before.

    ```java
    SnowflakeFile sfFile = SnowflakeFile.newInstance(scopedFileUrl);
    InputStream is = sfFile.getInputStream();
    ```

    For files whose locations are specified within handler code (not as scoped URLs passed in by a caller), you can read the staged file in one of two ways: handler code can call the newInstance method with a boolean value for a new requireScopedUrl parameter; or handler code must use a scoped URL when creating an InputStream from the file path.

    The following example uses `SnowflakeFile.newInstance`:

    ```java
    String filename = "@my_stage/filename.txt";
    SnowflakeFile sfFile = SnowflakeFile.newInstance(filename, require_scoped_url = false);
    ```

    In addition, user stages are no longer supported as locations for files read by a handler. Files read by handler code must be on an
    external stage or on a named internal stage.

Ref: 1008

---
title: UDFs: Input to JavaScript UDTFs Grouped Into a Single Partition
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-595.md
section: Release Notes
---

# UDFs: Input to JavaScript UDTFs Grouped Into a Single Partition

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

For a JavaScript UDTF with an empty OVER clause, the behavior of the grouping of input rows is as follows:

Previously:
:   In a JavaScript UDTF with an empty OVER clause, input is partitioned as if the OVER clause is absent. In contrast, documentation states that
    input rows belong to a single partition.

Currently:
:   JavaScript Table function input rows will be grouped into a single partition. This is as previously documented.

    To discover impact from this change, confirm whether you’ve used a JavaScript UDTF with an explicit empty OVER clause and whether the results
    have more per-partition rows than you expect (there should be one or zero such rows).

Ref: 595

---
title: UDTFs: Default column names updated for Python vectorized UDTFs
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1275.md
section: Release Notes
---

# UDTFs: Default column names updated for Python vectorized UDTFs

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

This change only affects users of Vectorized Python UDTFs (user-defined table functions).

Previously:

The default column names for the input DataFrame to a vectorized UDTF are the indices, such as 0,1,2, … etc.

Currently:

The default column names for the input DataFrame to a vectorized UDTF match the signature of the SQL function.
Using numeric indices by default will no longer work and you will get a “Key Error”.

The column names will follow the [SQL identifier requirements](../../../sql-reference/identifiers-syntax.md).
Namely, if an identifier is unquoted it will be capitalized, and if it’s double quoted it will be preserved as it is.

For details, see [Vectorized Python UDTFs](../../../developer-guide/udf/python/udf-python-tabular-vectorized.md).

Ref: 1275

---
title: Unbundled behavior changes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/unbundled-behavior-changes.md
section: Release Notes
---

# Unbundled behavior changes

Unbundled behavior changes are not associated or released with a behavior change bundle.

To help you manage your operations and minimize disruption to your Snowflake service, we document behavior changes that may impact your usage,
including:

* Recently implemented changes that were previously pending/disabled, were not part of a behavior change bundle, and cannot be disabled.
* Upcoming pending changes that will not be part of a behavior change bundle and cannot be enabled in advance.
* [Canceled behavior changes](unbundled-cancelled-behavior-changes.md) that have been removed from BCR bundles and will not be implemented.

If you have questions about any of these behavior changes, please feel free to contact [Snowflake Support](https://docs.snowflake.com/user-guide/contacting-support).

## Recently implemented changes

This table lists behavior changes that were implemented independently of a behavior change bundle.
These changes include certain types of changes to Snowflake clients (connectors, drivers, etc.), platforms, and libraries. Such changes cannot be disabled.

See also Upcoming pending changes.

| Release Date | Functional Area | Implemented Behavior Change | Additional Notes |
| --- | --- | --- | --- |
| **April 1, 2026** | Cortex Model Updates for April | [Cortex model deprecations for April 2026](bcr-april-model-deprecations.md) |  |
| **Week of March 16, 2026** | Document AI decommission | [Document AI decommission](bcr-2156.md) |  |
| **Weeks of December 2, 2025 (early adopters) and January 7, 2026 (late adopters)** | Snowsight Templates learning environment | [Snowsight Templates learning environment](bcr-1992.md) | This change is being rolled out gradually. |
| **Week of November 17, 2025** | Data quality monitoring | [Data quality: DATA_METRIC_USER database role granted to the PUBLIC role](bcr-2155.md) |  |
| **Week of November 10, 2025** | Native Apps | [Snowflake Native Apps: Changes to restrictions on version name, setup file name](bcr-2169.md) |  |
| **Week of November 10, 2025** | Security | [Authentication for local applications: Built-in security integration for Snowflake OAuth](bcr-2056.md) | This change is being rolled out gradually to all accounts. |
| **Week of August 4, 2025** | Data quality monitoring | [Data quality: DATA_QUALITY_MONITORING_LOOKUP application role granted to the PUBLIC role](bcr-2068.md) |  |
| **Week of June 23, 2025** | Snowflake Native Apps | [Snowflake Native Apps: Changes to privileges commonly used by apps](bcr-1952.md) |  |
| **Week of March 17, 2025** | Data Lineage | [Data Lineage: VIEW LINEAGE privilege granted to the PUBLIC role](bcr-1933.md) |  |
| **Week of March 3, 2025** | Document AI | [Document AI: CREATE MODEL privilege required to create, publish, and train model builds](bcr-1904.md) |  |
| **Week of February 24, 2025** | Virtual Private Cloud IDs | [Amazon Virtual Private Cloud ID for external stage, external function, and external volume](bcr-vpc-change-2025-02-03.md) |  |
| **Week of January 20th, 2025** | Secure objects | [Secure objects: Redaction of information in error messages](bcr-1858.md) |  |
| **Week of June 17th, 2024** | HTTP Error Codes | [Change in HTTP error code for URL not found error](bcr-1669.md) |  |
| **May 10, 2024** | Snowflake Cortex ML Functions Changes | [Cortex ML Functions - New column in single-series Forecasting and Anomaly Detection results](bcr-cortex-forecast-anomaly-detection-series-column.md) |  |
| **March 26, 2024** | SQL Changes — Organization Usage Views | [Organization Usage: Updated billing views](bcr-1584.md) |  |
| **March 04-05, 2024** | Data Pipelines: Dynamic Tables | [Dynamic tables: Changes to ACCOUNT_USAGE.TABLES and INFORMATION_SCHEMA.TABLES](bcr-account-usage-and-info-schema-changes.md) |  |

For additional released, but archived, unbundled BCRs see: [Archived implemented unbundled behavior changes](unbundled-behavior-changes-implemented-archive.md).

## Upcoming pending changes

The following table lists unbundled behavior changes that are pending.
Pending unbundled behavior changes aren’t released yet.

> **Important:**
>
> All information in this table, including planned versions and dates, is subject to change; the information is provided only as a guideline
> for any updates you must make to accommodate the changes.
>
> If a link isn’t provided to the individual pending behavior change, the release in which the bundle was introduced hasn’t started or is
> still in progress.

| Planned Release | Functional Area | Pending behavior change | Additional notes |
| --- | --- | --- | --- |
| Pending | Snowpark Python | [PYPI_REPOSITORY_USER database role granted to the PUBLIC role](bcr-2280.md) | This change will be rolled out gradually. |
| To be determined | Data pipelines | [Completed rollout of BYTES_BILLED column in history views (Pending)](bcr-2241.md) | None |
| This change is planned to start with the 2026_04 behavior change bundle, which is tentatively scheduled for May 2026. | Behavior Changes | [New columns in views and SHOW command output are no longer treated as behavior changes (Pending)](bcr-no-bcrs-for-new-columns.md) |  |
| Pending | Security | [Wider variety of Certificate Authorities and shorter certificate lifetimes](bcr-2255.md) |  |
| This change is planned for February 2026. | Network connectivity | [GCP PSC propagated connection limit set to 0](bcr-2193.md) |  |
|  | Virtual Warehouses | [Warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses (Postponed)](bcr-2113.md) | This behavior change was originally in the 2025_07 bundle. It has been postponed and a new release date has not been determined. |
| This change is planned for March 2026. | SQL Changes - General | [SQL general: New default column sizes for string and binary data types (Postponed)](bcr-2118.md) |  |
| This change is planned for October 2025. | Snowflake Native Apps: Decommissioning of Python versions 3.8 and 3.9 | [Snowflake Native Apps: Deprecation of Python versions 3.8 and 3.9 (Pending)](bcr-2072.md) | Snowflake Native Apps will no longer support any decommissioned versions of Python when Python version 3.9 is deprecated in October, 2025. For more information on Snowflake’s Python runtime support policy, see [Snowflake Python Runtime Support](../../../developer-guide/python-runtime-support-policy.md). |
| This change will occur gradually. | Worksheets to Workspaces upgrade | [Defaulting accounts from Worksheets to Workspaces](bcr-2117.md) | This change will roll out gradually beginning the week of September 22, 2025. |
| This change is planned for September 2025. | New VNET subnet IDs required for rules that filter based on subnet ID | [Azure access: New VNET subnet IDs required for rules that filter based on subnet ID (Pending)](bcr-1955-2078.md) |  |
| This change will occur gradually across all AWS regions from January 5 - January 31 2025. | Change of Certificate Authority & OCSP Allowlist for AWS Customers | [Change of Certificate Authority and OCSP Allowlist for AWS Customers](bcr-1657.md) |  |
| This change will occur gradually across all regions from October - November 2024 (Azure & GCP), February 2025 (AWS). | TLS Cipher Suite Requirements | [Changes in TLS Cipher Suite Requirements](bcr-1727.md) |  |
|  | Snowflake CLI, Connectors, Drivers, and SQL API Changes | [SnowSQL- Change to the value of the sql_split property (Pending)](bcr-792.md) |  |
|  | Account Usage and Information Schema views: Changes to DATA_TYPE output for string columns. | [Account Usage and Information Schema views: Changes to DATA_TYPE output for string columns (Postponed)](bcr-1960.md) | This behavior change was originally in the 2025_03 bundle and intended to become enabled by default in the 2025_04 bundle. However, it has been postponed and a new release date has not been determined.  This change is not available for testing. |
|  | SHOW FUNCTIONS and SHOW PROCEDURES commands. | [SHOW FUNCTIONS and SHOW PROCEDURES commands: The complete data type for arguments is displayed in output (Postponed)](bcr-1944.md) | This behavior change was originally in the 2025_03 bundle and intended to become enabled by default in the 2025_04 bundle. However, it has been postponed and a new release date has not been determined.  This change is not available for testing. |
|  | Data Loading / Unloading Changes. | [Data loading, data unloading, and file staging DML commands: Single-character pattern matches (Postponed)](bcr-209969.md) | This change was originally planned for February, 2021; however, it has been postponed and a new release date has not been determined.  This change is not available for testing. |
|  | Information Schema: TABLE_PRIVILEGES view. | [TABLE_PRIVILEGES View: Update GRANTOR column value in the consumer account (Postponed)](bcr-1321.md) | This behavior change was originally planned for **September, 2023**; however, it has been postponed and a new release date has not been determined.  This change is not available for testing. |
|  | Cloning: Table history. | [Cloning: Table history not preserved on clone (Postponed)](../2023_07/bcr-908.md) | This behavior change was originally in the 2023_07 bundle and intended to become enabled by default in the 2023_08 bundle. However, it has been postponed and a new release date has not been determined.  This change is not available for testing. |

---
title: Update the default Streamlit version for Snowflake Native Apps
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_01/bcr-1857.md
section: Release Notes
---

# Update the default Streamlit version for Snowflake Native Apps

> **Attention:**
>
> This behavior change is in the 2025_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_01_bundle.md).

When this behavior change bundle is enabled, the default version of a Streamlit used by the Snowflake Native App Framework will be updated
to Streamlit version 1.35.

Before this change:
:   The default version of Streamlit used by a Snowflake Native App is version 1.22

After this change:
:   The default version of Streamlit used by a Snowflake Native App is version 1.35

This change only affects apps that do not define a specific Streamlit version. If a provider uses the
`environment.yml` file to control dependency versions, the Streamlit version used by an app can be
pinned to a specific version by setting `streamlit=1.22.0`. If a version is not specified in the
`environment.yml` file, then the default version of Streamlit is used.

This change only affects new versions or patches of an app. Existing apps will continue to work on either the previous
default Streamlit version (1.22) or the version specified in the `environment.yml` file of the app.

This change only affects Streamlit apps used within a Snowflake Native App. Streamlit apps outside of an app are
not affected. The default version for Streamlit in Snowflake was already updated to Streamlit version 1.35.

## Test the new version of Streamlit in your app

If you plan to use Streamlit 1.35, ensure that you test your app to verify that it is compatible with this version:

> 1. Update the Streamlit version in the `environment.yml` file of the app to 1.35. Do not make any other changes.
> 2. Install the app.
> 3. Test the Streamlit app and make any necessary changes to the Streamlit python source files.

If you discover any issues with the updated version, you can specify the previous version of Streamlit in the
`environment.yml` file.

Ref: 1857

---
title: USAGE_IN_CURRENCY_DAILY View (Org Usage): New Usage Types
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_02/bcr-965.md
section: Release Notes
---

# USAGE_IN_CURRENCY_DAILY View (Org Usage): New Usage Types

> **Attention:**
>
> This behavior change is in the 2023_02 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_02_bundle.md).

The following new usage types are now possible values in the USAGE_TYPE column in the
[USAGE_IN_CURRENCY_DAILY](../../../sql-reference/organization-usage/usage_in_currency_daily.md) view in the ORGANIZATION_USAGE schema:

* `priority support`: Indicates how much was charged for priority support services in a given month. This charge is associated with a
  stipulation in a contract, not with an account.
* `support credit`: Indicates that Snowflake Support credited the account to reverse charges attributed to an issue in Snowflake.
  Represents credits applied to the account for a given month.

Ref: 965

---
title: Use primary role for authorizing view and materialized view creation (Canceled)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-1782.md
section: Release Notes
---

# Use primary role for authorizing view and materialized view creation (Canceled)

> **Attention:**
>
> This BCR is canceled and removed from the [2024_08 Bundle](../2024_08_bundle.md).

When this behavior change bundle is enabled, the creation of a view or materialized view behaves as follows:

Before the change:
:   When creating a view or materialized view, the primary and all secondary roles are considered when checking the privileges of the SQL
    definition.

After the change:
:   When creating a view or materialized view, only the primary role is considered when checking the privileges of the SQL definition.

Ref: 1782

---
title: User types: TYPE property is set to PERSON instead of NULL
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_05/bcr-2067.md
section: Release Notes
---

# User types: TYPE property is set to PERSON instead of NULL

> **Attention:**
>
> This behavior change is in the 2025_05 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_05_bundle.md).

There are different [types of Snowflake users](../../../user-guide/admin-user-management.md), where each type corresponds to a value of the user
object’s TYPE property. Users of type NULL were always treated as if they were of type PERSON, and this change does not alter
that behavior. This behavior change is about whether the user type can be NULL.

Before the change:
:   A user of type NULL is treated as a user of type PERSON. The user type can be NULL in the following situations:

    * The TYPE property isn’t set or is set to NULL when running a CREATE USER command.
    * Someone runs an ALTER USER … SET TYPE=NULL command.
    * Someone runs an ALTER USER … UNSET TYPE command.
    * A SCIM request doesn’t specify the `type` attribute or sets it to NULL.

After the change:
:   The TYPE property of a new user object can’t be NULL, and you can’t set the TYPE property of an existing user to NULL. This is enforced
    by the following behavior:

    * If you run a CREATE USER command without setting the TYPE property, it is set to PERSON.
    * If you run an ALTER USER … UNSET TYPE command, the TYPE property is set to PERSON.
    * If you run an ALTER USER … SET TYPE=NULL command, the TYPE property is set to PERSON.
    * If you send a SCIM POST request to create a user, and the `type` attribute is unspecified or NULL, the TYPE property is set to
      PERSON.
    * If you send a SCIM PATCH request with a `replace` operation that specifies the `type` attribute as NULL, the TYPE property
      doesn’t change.
    * If you send a SCIM PUT request with a `replace` operation, and the `type` attribute is unspecified or NULL, the TYPE
      property is set to PERSON.
    * If you send a SCIM PATCH request with a `remove` operation that unsets the `type` attribute, the TYPE property doesn’t
      change.

Ref: 2067

---
title: Users and Groups: Changes to Initial Replication
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1044.md
section: Release Notes
---

# Users and Groups: Changes to Initial Replication

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

If you are using a replication or failover group to replicate USERS and/or ROLES from a source account to a target account, and there are
existing users/roles in that account that were created by means other than replication, the initial refresh operation of these object types
results in deleting all users and/or roles in the target account.

If the initial refresh deletes users and/or roles in the target account, it can result in data and metadata loss:

* If USERS are included in the OBJECT_TYPES list for the replication or failover group:

  + Worksheets are lost
  + Query history is lost
* If USERS are included in the OBJECT_TYPES list, but ROLES is not:

  + Privilege grants to users are lost
* If ROLES are included in the OBJECT_TYPES list:

  + Privilege grants to share objects are lost

In a future release, this behavior will change as follows:

Previously:
:   If a replication or failover group includes USERS and/or ROLES in the OBJECT_TYPES list, the initial refresh operation of these object types
    results in dropped users and/or roles in the target account.

Currently:
:   If a replication or failover group includes USERS and/or ROLES in the OBJECT_TYPES list, the initial refresh operation of these object
    types will fail with an error message if:

    * There are existing users and/or roles in the target account that match objects with the same name in the source account and
    * USERS and/or ROLES have not previously been replicated to the target account

    The error message will prompt the user with two options:

    * Force the refresh operation and allow any existing users and/or roles in the target account to be deleted. Users/roles in the source account
      will be recreated in the target account.

      To force a refresh, you would execute the following statement:

      ```sqlexample
      ALTER ( { FAILOVER | REPLICATION } ) GROUP <rg_name> REFRESH FORCE;
      ```
    * Link the account objects by name: users and/or roles with the same name in the target account and the source account will be linked. The
      users/roles in the target account that are linked will not be deleted.

      To link account objects by name, execute the following statement:

      ```sqlexample
      SELECT SYSTEM$LINK_ACCOUNT_OBJECTS_BY_NAME('<rg_name>');
      ```

    > **Note:**
    >
    > Any user/role in the target account that does not have a matching object in the source account with the same name is dropped.

For more information, refer to [Apply global IDs to objects created by scripts in target accounts](../../../user-guide/account-replication-config.md).

Ref: 1044

---
title: USERS and QUERY_HISTORY views (ACCOUNT_USAGE) and QUERY_HISTORY function: New columns
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1771.md
section: Release Notes
---

# USERS and QUERY_HISTORY views (ACCOUNT_USAGE) and QUERY_HISTORY function: New columns

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the [USERS](../../../sql-reference/account-usage/users.md) and [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) views and the output of the [QUERY_HISTORY](../../../sql-reference/functions/query_history.md) function include the following new columns.

## USERS view (ACCOUNT_USAGE)

When the [user](../../../sql-reference/account-usage/users.md) TYPE is SNOWFLAKE_SERVICE, indicating it is a [service user](../../../developer-guide/snowpark-container-services/spcs-execute-sql.md), the following new columns provide the service’s database and schema information. Otherwise, these columns are null.

| Column name | Type | Description |
| --- | --- | --- |
| DATABASE_NAME | VARCHAR | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it is NULL. |
| DATABASE_ID | NUMBER | When the user TYPE is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s database; otherwise, it’s NULL. |
| SCHEMA_NAME | VARCHAR | When the user type is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it is NULL. |
| SCHEMA_ID | NUMBER | When the user type is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s schema; otherwise, it’s NULL. |

## QUERY_HISTORY view (ACCOUNT_USAGE)

When the [QUERY_HISTORY](../../../sql-reference/account-usage/query_history.md) USER_TYPE is SNOWFLAKE_SERVICE, indicating the query is executed by a Snowpark Container Services service, the other columns provide the service’s database and schema information.

| Column name | Type | Description |
| --- | --- | --- |
| USER_TYPE | VARCHAR | Specifies the type of the user executing the query. It is the same as the `type` field on the user entity (see [USERS view](../../../sql-reference/account-usage/users.md)). If a Snowpark Container Services service executes the query, the user type is SNOWFLAKE_SERVICE. |
| USER_DATABASE_NAME | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |
| USER_DATABASE_ID | NUMBER | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s database; otherwise, it’s NULL. |
| USER_SCHEMA_NAME | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |
| USER_SCHEMA_ID | NUMBER | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the internal, Snowflake-generated identifier for the service’s schema, otherwise it’s NULL. |

## QUERY_HISTORY function

The following new columns are added to the output (see [QUERY_HISTORY function](../../../sql-reference/functions/query_history.md)). They identify the user type (USER_TYPE). When the USER_TYPE is SNOWFLAKE_SERVICE, the other columns identify the service’s database and schema.

| Column name | Type | Description |
| --- | --- | --- |
| USER_TYPE | VARCHAR | Specifies the type of the user executing the query. It’s the same as the `type` field on the user entity (see [USERS view](../../../sql-reference/account-usage/users.md)). If a Snowpark Container Services service executes the query, the user type is SNOWFLAKE_SERVICE. |
| USER_DATABASE_NAME | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s database name; otherwise, it’s NULL. |
| USER_SCHEMA_NAME | VARCHAR | When the value in the `user_type` column is SNOWFLAKE_SERVICE, it specifies the service’s schema name; otherwise, it’s NULL. |

## Examples

The service’s database and schema information, along with the user name (which, for the SNOWFLAKE_SERVICE [user](../../../sql-reference/account-usage/users.md) `type`, is also the service name) can help during debugging. For example, you can write a query to retrieve information about the service that executed the queries. The following are some example queries:

**Example 1:** Fetch queries run by a service.

```sqlexample
SELECT query_history.*
FROM snowflake.account_usage.query_history
WHERE user_type = 'SNOWFLAKE_SERVICE'
AND user_name = '<service-name>'
AND user_database_name = '<service-db-name>'
AND user_schema_name = '<service-schema-name>'
order by start_time;
```

In the WHERE clause:

* `user_name = '<service-name>'` because a service executes queries as the [service user](../../../developer-guide/snowpark-container-services/spcs-execute-sql.md), and the service user’s name is the same as the service name.
* `user_type = 'SNOWFLAKE_SERVICE'` and `user_name = '<service-name>'` retrieve only queries executed by a service.
* `user_database_name` and `user_schema_name` names, for a service user, are the service’s database and schema.

You can get the same results by querying the information_schema.query_history function.

```sqlexample
SELECT *
FROM TABLE(<any-user-db-name>.information_schema.query_history())
WHERE user_database_name = '<service-db-name>'
AND user_schema_name = '<service-schema-name>'
AND user_type = 'SNOWFLAKE_SERVICE'
AND user_name = '<service-name>'
order by start_time;
```

In the WHERE clause:

* `user_type = 'SNOWFLAKE_SERVICE'` and `user_name = '<service-name>'` retrieves only queries executed by a service.
* `user_database_name` and `user_schema_name` names, for a service user, are the service’s database and schema.

**Example 2:** Fetch queries run by services and corresponding service information.

```sqlexample
SELECT query_history.*, services.*
FROM snowflake.account_usage.query_history
JOIN snowflake.account_usage.services
ON query_history.user_name = services.service_name
AND query_history.user_schema_id = services.service_schema_id
AND query_history.user_type = 'SNOWFLAKE_SERVICE'
```

The query joins the QUERY_HISTORY and SERVICES views to retrieve information about the queries and services that executed the queries. Note the following:

* For queries run by services, the `query_history.user_name` is the service user’s name, which is the same as the service name.
* The query joins the views using the schema IDs (not schema name) to ensure you refer to the same schema, because if you drop and recreate a schema, the schema ID changes but the name remains the same.

You can add optional filters to the query. For example:

* Filter `query_history` to retrieve only services that executed specific queries.
* Filter `services` to retrieve only queries executed by specific services.

**Example 3:** For every service, fetch service user information.

```sqlexample
SELECT services.*, users.*
FROM snowflake.account_usage.users
JOIN snowflake.account_usage.services
ON users.name = services.service_name
AND users.schema_id = services.service_schema_id
AND users.type = 'SNOWFLAKE_SERVICE'
```

The query join SERVICES and USERS views in the ACCOUNT_USAGE schema to retrieve services and service user information. Note the following:

* When a service runs queries, it runs the queries as service user and the service user’s name is the same as the service name. Therefore, you specify the join condition: `users.name = services.service_name`.
* Service names are unique only within a schema. Therefore, the query specifies the join condition (`users.schema_id = services.service_schema_id`) to ensure each service user is matched against the specific service they belong to (and not any other same-named service running in different schemas).

Ref: 1771

---
title: USERS view (Account Usage / Organization Usage): New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_02/bcr-2217.md
section: Release Notes
---

# USERS view (Account Usage / Organization Usage): New column

> **Attention:**
>
> This behavior change is in the 2026_02 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_02_bundle.md).

When this behavior change bundle is enabled, the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md).[USERS](../../../sql-reference/account-usage/users.md) and [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md).[USERS](../../../sql-reference/organization-usage/users.md) views include the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| `PROVISIONED_BY` | VARCHAR | Whether or not this user was provisioned by SCIM. Possible values:   * `SCIM`: The user was provisioned by SCIM. * `NULL`: The user was provisioned using another method, such as SQL or Snowsight. |

Ref: 2217

---
title: USERS view (ACCOUNT_USAGE and ORGANIZATION_USAGE): New columns and changes to has_mfa column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2102.md
section: Release Notes
---

# USERS view (ACCOUNT_USAGE and ORGANIZATION_USAGE): New columns and changes to `has_mfa` column

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

> **Note:**
>
> This behavior change was part of the 2025_04 behavior change bundle, but the change has been postponed to the 2025_06 bundle.

When this behavior change bundle is enabled, the [ACCOUNT_USAGE USERS view](../../../sql-reference/account-usage/users.md) and the
[ORGANIZATION_USAGE USERS view](../../../sql-reference/organization-usage/users.md) include the following new columns:

| Column name | Data type | Description |
| --- | --- | --- |
| `has_pat` | BOOLEAN | If TRUE, a [programmatic access token (PAT)](../../../user-guide/programmatic-access-tokens.md) has been generated for the user. |
| `has_workload_identity` | BOOLEAN | If TRUE, the user is configured to use [workload identity federation](../../../user-guide/workload-identity-federation.md) to authenticate with Snowflake. |

> **Note:**
>
> These new columns appear between the `default_secondary_role` column and the `type` column.
>
> If you have SELECT statements that use column positions to select specific columns, you need to update those SELECT statements
> to use the new column positions.

In addition, the `has_mfa` column now supports the newly introduced [MFA methods](../../../user-guide/security-mfa.md) (TOTP and
passkeys). If the user is [enrolled in any MFA method](../../../user-guide/security-mfa-second-factor.md), `has_mfa` is TRUE.

Ref: 2102

---
title: USERS view: New column is_from_organization_user
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_06/bcr-2105.md
section: Release Notes
---

# USERS view: New column `is_from_organization_user`

> **Attention:**
>
> This behavior change is in the 2025_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_06_bundle.md).

When this behavior change bundle is enabled, the USERS view in the [ACCOUNT_USAGE](../../../sql-reference/account-usage.md) and [ORGANIZATION_USAGE](../../../sql-reference/organization-usage.md) schemas includes the following new
column:

| Column name | Data type | Description |
| --- | --- | --- |
| `is_from_organization_user` | BOOLEAN | If TRUE, the user was imported from an [organization user](../../../user-guide/organization-users.md). |

Ref: 2105

---
title: VARIANT data: Casting some of the fixed numeric values to floating-point results in different approximate values
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2025_07/bcr-2106.md
section: Release Notes
---

# VARIANT data: Casting some of the fixed numeric values to floating-point results in different approximate values

> **Attention:**
>
> This behavior change is in the 2025_07 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2025_07_bundle.md).

When this bundle is enabled and you extract a [fixed numeric value](../../../sql-reference/data-types-numeric.md) from a
[VARIANT value](../../../sql-reference/data-types-semistructured.md) and cast the numeric value to a
[floating-point type](../../../sql-reference/data-types-numeric.md), the resulting value might be different.

This behavior change occurs in cases when the cast is narrow — in other words, when the cast type cannot hold the value. The
cast might return a different approximate value that is closer to the original value.

For example, when the fixed numeric value `264208207.9326154526043824` in a VARIANT column is cast to FLOAT, the resulting value
before the behavior change is different than the value after the behavior change:

* Original numeric value (before casting to FLOAT):

  ```none
  264208207.9326154526043824
  ```
* Numeric value before the behavior change (after casting to FLOAT):

  ```none
  264208207.93261543
  ```
* Numeric value after the behavior change (after casting to FLOAT):

  ```none
  264208207.93261546
  ```

This behavior change is being introduced to make the casting result consistent with casting a numeric value that is not in a
VARIANT value.

Ref: 2106

---
title: Versioned schemas: Disallow policy assignments across schemas
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1453.md
section: Release Notes
---

# Versioned schemas: Disallow policy assignments across schemas

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of versioned schemas, masking policies, and row access policies in a Snowflake Native App is as follows:

Before the change:
:   A policy assignment can occur when you create a:

    * Policy in the versioned schema and assign the policy to a table or view in a different schema.
    * Policy in a non-versioned schema and assign the policy to a table or view in a versioned schema.
    * Table or view in the versioned schema and assign a policy to the table or view when the policy exists in a non-versioned schema.
    * Table or view in a non-versioned schema and assign a policy to the tables or views when the policy exists in a versioned schema.

After the change:
:   The policy assignment fails with the following error message:

    ```output
    A POLICY in a versioned schema can only be assigned to the objects in the same schema. An object in a versioned schema can only have a POLICY assigned that is defined in the same schema.
    ```

    To prevent the error message, the Snowflake Native App provider should update the setup script to ensure that the policy is set on
    objects within the same schema as the policy when a versioned schema contains either the policy or the object on which the policy is set.
    If a non-versioned schema contains either the policy or the object on which the policy is set, it is not necessary to update the setup
    script.

    If you are a consumer of the Snowflake Native App and you see this error message after installing the app, ask the provider to update
    their set up script.

Ref: 1453

---
title: Versioned schemas: Disallow tag propagation
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_01/bcr-1401.md
section: Release Notes
---

# Versioned schemas: Disallow tag propagation

> **Attention:**
>
> This behavior change is in the 2024_01 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_01_bundle.md).

The behavior of versioned schemas and tags in a Snowflake Native App is as follows:

Before the change:
:   Tag propagation in a Snowflake Native App occurs with a versioned schema under the following conditions:

    * Create a tag in the versioned schema and assign the tag to an object in a different schema.
    * Create a tag in a non-versioned schema and assign the tag to an object in a versioned schema.
    * Create tables or views in the versioned schema and assign a tag to the tables or views when the tag exists in a non-versioned schema.
    * Create tables or views in a non-versioned schema and assign a tag to the tables or views when the tag exists in a versioned schema.

    In these cases, tag propagation refers to tags being set on objects when the versioned schema increments to the next version.

After the change:
:   Tag propagation does not occur with versioned schemas in a Snowflake Native App and results in the following error message:

    ```output
    A TAG in a versioned schema can only be assigned to the objects in the same schema. An object in a versioned schema can only have a TAG assigned that is defined in the same schema.
    ```

    To prevent the error message, the Snowflake Native App provider should update the setup script to ensure that tags are set on objects within the
    same schema as the tag when a versioned schema contains either the tag or the object on which the tag is set. If a non-versioned schema
    contains either the tag or the object on which the tag is set, it is not necessary to update the setup script.

    If you are a consumer of the Snowflake Native App and you see this error message after installing the app, ask the provider to update their set
    up script.

Ref: 1401

---
title: VIEWS View (Information Schema): New Columns in View
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_04/bcr-1127.md
section: Release Notes
---

# VIEWS View (Information Schema): New Columns in View

> **Attention:**
>
> This behavior change is in the 2023_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_04_bundle.md).

Snowflake has added two columns, LAST_DDL and LAST_DDL_BY, to the VIEWS view in the INFORMATION_SCHEMA schema. These
columns record the timestamp of the last DDL operation performed on each view and the username of the user who performed it.

| Column Name | Data Type | Description |
| --- | --- | --- |
| LAST_DDL | TIMESTAMP_LTZ | Timestamp of the last DDL operation performed on the table or view, including database refreshes for replication (if the refresh changes visible properties for the table/view). |
| LAST_DDL_BY | TEXT | Username of the user who performed the last DDL operation on the table or view. |

Ref: 1127

---
title: WAREHOUSE_EVENTS_HISTORY view (ACCOUNT_USAGE): New columns and changes to events
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_04/bcr-1616.md
section: Release Notes
---

# WAREHOUSE_EVENTS_HISTORY view (ACCOUNT_USAGE): New columns and changes to events

> **Attention:**
>
> This behavior change is in the 2024_04 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_04_bundle.md).

When this behavior change bundle is enabled, the Account Usage [WAREHOUSE_EVENTS_HISTORY view](../../../sql-reference/account-usage/warehouse_events_history.md)
changes as follows.

## New columns

The following new columns are added to the view:

| Column Name | Data Type | Description |
| --- | --- | --- |
| SIZE | VARCHAR | Current size of the warehouse at the time of the event. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |
| CLUSTER_COUNT | NUMBER | Number of warehouse clusters at the time of the event. This value is only available for WAREHOUSE_CONSISTENT events. Otherwise, this value is NULL. |

These columns are added as the last (right-most) columns in the view.

## New event: WAREHOUSE_CONSISTENT

Events that create a warehouse, change the size of the warehouse or the number of clusters, or suspend a warehouse, are not atomic
operations. This means that some small amount of time is required for these operations to fully complete.

For example, if a warehouse is suspended using an ALTER WAREHOUSE … SUSPEND statement, any queries that are currently executing on the
warehouse must complete (or time out) before it can be suspended. In some cases, multiple warehouse events might be in-flight (for example,
resize and suspend). When all warehouse events have completed, the warehouse is in a consistent state.

| EVENT_NAME | Description |
| --- | --- |
| WAREHOUSE_CONSISTENT | This state is reached when any pending changes to a warehouse are completed. |

## Changes to events

When this behavior change bundle is enabled, the event state PARTIALLY_COMPLETED is obsoleted.

If an event is logged with the STARTED state, it is never logged with the COMPLETED state. Instead, an event logged with the STARTED
state is always followed by a subsequent WAREHOUSE_CONSISTENT event. If multiple warehouse events are logged with the STARTED event
state, those events coalesce to the same WAREHOUSE_CONSISTENT event.

If an event is logged with the COMPLETED state, no subsequent WAREHOUSE_CONSISTENT event follows unless another pending
event is logged with a STARTED state.

For example, an ALTER statement is logged with the COMPLETED state when there are no additional changes pending:

```sqlexample
ALTER WAREHOUSE my_wh SET
  COMMENT = 'Updated comment for warehouse';
```

This statement results in the following row in the WAREHOUSE_EVENTS_HISTORY view:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-04-26 16:42:13.513 +0000 | MY_WH | ALTER_WAREHOUSE | COMPLETED | NULL | NULL |

When an ALTER statement changes the warehouse size, a resize warehouse event follows:

```sqlexample
ALTER WAREHOUSE my_wh SET
  WAREHOUSE_SIZE = 'SMALL';
```

This statement results in the following rows in the WAREHOUSE_EVENTS_HISTORY view:

| TIMESTAMP | WAREHOUSE_NAME | EVENT_NAME | EVENT_STATE | SIZE | CLUSTER_COUNT |
| --- | --- | --- | --- | --- | --- |
| 2024-04-26 16:42:35.694 +0000 | MY_WH | ALTER_WAREHOUSE | STARTED | NULL | NULL |
| 2024-04-26 16:42:35.694 +0000 | MY_WH | RESIZE_WAREHOUSE | STARTED | NULL | NULL |
| 2024-04-26 16:42:35.773 +0000 | MY_WH | WAREHOUSE_CONSISTENT | COMPLETED | SMALL | 1 |

Note that the WAREHOUSE_CONSISTENT event might share the same timestamp with another warehouse event and be listed out of order.

The possible values for the EVENT_STATE column for warehouse events changes as follows:

| EVENT_NAME | EVENT_STATE before the change | EVENT_STATE after the change |
| --- | --- | --- |
| CREATE_WAREHOUSE | COMPLETED | COMPLETED if no further changes are pending, otherwise STARTED.  In most cases, followed by a RESUME_WAREHOUSE event. |
| DROP_WAREHOUSE | COMPLETED | COMPLETED if no further changes are pending, otherwise STARTED.  In most cases, followed by a SUSPEND_WAREHOUSE event. |
| ALTER_WAREHOUSE | COMPLETED | COMPLETED if no further changes are pending, otherwise STARTED if the ALTER statement results in an eventually consistent event to follow. |
| RESIZE_WAREHOUSE | STARTED, . COMPLETED, . PARTIALLY_COMPLETED | STARTED |
| RESUME_WAREHOUSE | STARTED, . COMPLETED, . PARTIALLY_COMPLETED | STARTED |
| SUSPEND_WAREHOUSE | STARTED, . COMPLETED, . PARTIALLY_COMPLETED | STARTED |
| RESUME_CLUSTER | COMPLETED | STARTED |
| SUSPEND_CLUSTER | COMPLETED | STARTED |
| RESIZE_CLUSTER | COMPLETED | STARTED |
| SPINUP_CLUSTER | STARTED, . COMPLETED, . PARTIALLY_COMPLETED | STARTED |
| SPINDOWN_CLUSTER | STARTED, . COMPLETED, . PARTIALLY_COMPLETED | STARTED |

Ref: 1616

---
title: WAREHOUSE_EVENTS_HISTORY view: Change to the CLUSTER_NUMBER column output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-warehouse-events-history-cluster-number.md
section: Release Notes
---

# WAREHOUSE_EVENTS_HISTORY view: Change to the CLUSTER_NUMBER column output

The output of the [WAREHOUSE_EVENTS_HISTORY](../../../sql-reference/account-usage/warehouse_events_history.md) view behaves as follows:

Before the change:
:   The default value of the CLUSTER_NUMBER column is `0`.

After the change:
:   The default value of the CLUSTER_NUMBER column is `1`.

Ref: n/a

---
title: WAREHOUSE_EVENTS_HISTORY view: New columns in output
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1770.md
section: Release Notes
---

# WAREHOUSE_EVENTS_HISTORY view: New columns in output

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the [Account Usage](../../../sql-reference/account-usage.md)
[WAREHOUSE_EVENTS_HISTORY view](../../../sql-reference/account-usage/warehouse_events_history.md) includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| WAREHOUSE_TYPE | TEXT | One of `STANDARD` or `SNOWPARK-OPTIMIZED`. |
| RESOURCE_CONSTRAINT | TEXT | One of:   * `MEMORY_1X` * `MEMORY_1X_x86` * `MEMORY_16X` * `MEMORY_16X_x86` * `MEMORY_64X` * `MEMORY_64X_x86` |

> **Note:**
>
> `WAREHOUSE_TYPE` and `RESOURCE_CONSTRAINT` are only populated for `EVENTNAME=WAREHOUSE_CONSISTENT` events, otherwise `NULL` is displayed.

Ref: 1770

---
title: WAREHOUSE_METERING_HISTORY view (ACCOUNT_USAGE): New column
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2024_08/bcr-1714.md
section: Release Notes
---

# WAREHOUSE_METERING_HISTORY view (ACCOUNT_USAGE): New column

> **Attention:**
>
> This behavior change is in the 2024_08 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2024_08_bundle.md).

When this behavior change bundle is enabled, the [Account Usage](../../../sql-reference/account-usage.md) [WAREHOUSE_METERING_HISTORY view](../../../sql-reference/account-usage/warehouse_metering_history.md) includes the following new column:

| Column name | Data type | Description |
| --- | --- | --- |
| CREDITS_ATTRIBUTED_COMPUTE_QUERIES | NUMBER | Number of credits attributed to queries in the hour.  Includes only the credit usage for query execution and doesn’t include usage for any warehouse idle time. |

## Usage notes

* Latency for this column might be up to six hours.
* Warehouse idle time is not included in the CREDITS_ATTRIBUTED_COMPUTE_QUERIES column. See Examples for a query that calculates the cost of idle time.

## Examples

For example, to determine the cost of idle time for each warehouse for the last 10 days, execute the following statement:

```sqlexample
SELECT (
   SUM(credits_used_compute) -
   SUM(credits_attributed_compute_queries)
) AS idle_cost,
    warehouse_name
 FROM SNOWFLAKE.ACCOUNT_USAGE.WAREHOUSE_METERING_HISTORY
    WHERE start_time >= DATEADD('days', -10, CURRENT_DATE())
          AND end_time < CURRENT_DATE()
    GROUP BY warehouse_name;
```

Ref: 1714

---
title: Warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses (Pending)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2026_03/bcr-2269.md
section: Release Notes
---

# Warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses (Pending)

> **Attention:**
>
> This behavior change is in the 2026_03 bundle.
>
> For the current status of the bundle, refer to [Bundle history](../2026_03_bundle.md).

This behavior change applies to the
[Query Acceleration Service (QAS)](../../../user-guide/query-acceleration-service.md)
feature in combination with Gen1, [Gen2](../../../user-guide/warehouses-gen2.md), and
[multi-cluster](../../../user-guide/warehouses-multicluster.md) warehouses.

Warehouse creation for standard warehouses behaves as follows:

Before the change:
:   QAS is disabled by default when creating new Gen1, Gen2, and multi-cluster warehouses.
    When QAS is enabled, the default maximum scaling factor is 8.

After the change:
:   QAS is enabled by default when creating new Gen2 and multi-cluster warehouses.
    It’s still disabled by default for single-cluster Gen1 warehouses.
    The default maximum scaling factor is 2 when Snowflake automatically enables QAS for Gen2
    and multi-cluster warehouses. The default is still 8 for Gen1 warehouses, or when you explicitly
    enable QAS in the CREATE WAREHOUSE statement.

Altering an existing warehouse does **not** enable QAS by default or turn it off if it’s already
enabled. For example, the following operations don’t enable QAS:

* Altering a single-cluster warehouse to multi-cluster does **not** enable QAS.
* Altering the minimum or maximum cluster count does **not** enable QAS.
* Altering the multi-cluster warehouse scaling policy from economy to standard (or vice versa) does **not** enable QAS.
* Altering a Gen1 warehouse to Gen2 does **not** enable QAS.

And the following operations don’t turn QAS off if it’s already enabled:

* If you alter a multi-cluster warehouse with QAS enabled and change it back to single-cluster,
  QAS remains enabled.
* If you alter a Gen2 warehouse with QAS enabled and change it back to Gen1, QAS remains enabled.

This change is being made to enable more optimization opportunities and improve query performance
for standard warehouses that are typically used for high-volume and performance-critical workloads.

Ref: 2269

---
title: Warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses (Postponed)
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2113.md
section: Release Notes
---

# Warehouses: Enable QAS by default for newly created Gen2 and multi-cluster warehouses (Postponed)

> **Note:**
>
> This behavior change was part of the 2025_07 bundle, but the change has been postponed. The change will be introduced in a future bundle.
> The change is not available for testing.

This behavior change applies to the
[Query Acceleration Service (QAS)](../../../user-guide/query-acceleration-service.md)
feature in combination with Gen1, [Gen2](../../../user-guide/warehouses-gen2.md), and
[multi-cluster](../../../user-guide/warehouses-multicluster.md) warehouses.

Warehouse creation for standard warehouses behaves as follows:

Before the change:
:   QAS is disabled by default when creating new Gen1, Gen2, and multi-cluster warehouses.
    When QAS is enabled, the default maximum scaling factor is 8.

After the change:
:   QAS is enabled by default when creating new Gen2 warehouses and multi-cluster warehouses (including Gen1).
    It remains disabled by default for single-cluster Gen1 warehouses.
    The default maximum scaling factor is 2 when Snowflake automatically enables QAS for Gen2
    and multi-cluster warehouses. The default is still 8 for Gen1 warehouses, or when you explicitly
    enable QAS in the CREATE WAREHOUSE statement.

Altering an existing warehouse does **not** enable QAS by default or turn it off if it’s already
enabled. For example, the following operations don’t enable QAS:

* Altering a single-cluster warehouse to multi-cluster does **not** enable QAS.
* Altering the minimum or maximum cluster count does **not** enable QAS.
* Altering the multi-cluster warehouse scaling policy from economy to standard (or vice versa) does **not** enable QAS.
* Altering a Gen1 warehouse to Gen2 does **not** enable QAS.

And the following operations don’t turn QAS off if it’s already enabled:

* If you alter a multi-cluster warehouse with QAS enabled and change it back to single-cluster,
  QAS remains enabled.
* If you alter a Gen2 warehouse with QAS enabled and change it back to Gen1, QAS remains enabled.

This change is being made to enable more optimization opportunities and improve query performance
for standard warehouses that are typically used for high-volume and performance-critical workloads.

Ref: 2113

---
title: Wider variety of Certificate Authorities and shorter certificate lifetimes
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/un-bundled/bcr-2255.md
section: Release Notes
---

# Wider variety of Certificate Authorities and shorter certificate lifetimes

> **Note:**
>
> This change is pending.

As part of Snowflake’s continued commitment to providing best-in-class transport-layer-security (TLS)
we are introducing Let’s Encrypt and Google Trust Services as additional Certificate Authority (CA)
providers alongside our existing providers. You may see certificates issued by these CAs when connecting
to endpoints owned by Snowflake. We are also investing in shortening our certificate lifetime to meet
and exceed upcoming CA/Browser Forum standards.

This unbundled change will not affect the endpoints most drivers use to connect to Snowflake, specifically
the `SNOWFLAKE_DEPLOYMENT` and `SNOWFLAKE_DEPLOYMENT_REGIONLESS` types returned by
[SYSTEM$ALLOWLIST](../../../sql-reference/functions/system_allowlist.md) and
[SYSTEM$ALLOWLIST_PRIVATELINK](../../../sql-reference/functions/system_allowlist_privatelink.md).
When the list of possible CAs for these existing endpoints is expanded, separate communications will be sent.

This unbundled change may affect any of the other domains owned by Snowflake and returned by
[SYSTEM$ALLOWLIST](../../../sql-reference/functions/system_allowlist.md) and
[SYSTEM$ALLOWLIST_PRIVATELINK](../../../sql-reference/functions/system_allowlist_privatelink.md),
such as `SNOWSIGHT_DEPLOYMENT`. You will likely see these new certificates if you access Snowflake
services through a web browser, if you access SPCS endpoints programmatically, or if you use
Snowpipe Streaming with High Performance Architecture.

A small number of Snowflake sites, including `app.snowflake.com`, were already using certificates
issued by some of these CAs prior to 2026.

## Validation

Your Operating System, Browser or Application level TLS Certificate Authority trust store must contain
the root CA certificates described at <https://letsencrypt.org/certificates/> and
<https://pki.goog/repository/>. These CA certificates are present in the default trust stores of
all major operating systems, browsers and client environments, so this migration will be transparent
and require no changes for the majority of Snowflake customers.

You can open connections to the test websites listed below to ensure that you trust certificates
issued by those CAs.

| Root CA | Test URL |
| --- | --- |
| ISRG Root X1 | <https://valid-isrgrootx1.letsencrypt.org/> |
| ISRG Root X2 | <https://valid-isrgrootx2.letsencrypt.org/> |
| GTS Root R1 | <https://good.gtsr1.demosite.pki.goog/> |
| GTS Root R2 | <https://good.gtsr2.demosite.pki.goog/> |
| GTS Root R3 | <https://good.gtsr3.demosite.pki.goog/> |
| GTS Root R4 | <https://good.gtsr4.demosite.pki.goog/> |
| GlobalSign R4 | <https://good.gsr4.demosite.pki.goog/> |

Operating system trust stores are implemented by the OS provider, and all recently patched operating
systems contain sufficient certificates in their default trust stores. Please reach out to your OS
vendor for additional assistance. We recommend that clients accept certs from any CA in the
[Mozilla CA/Included Certificates list](https://wiki.mozilla.org/CA/Included_Certificates).

Ref: 2255

---
title: WRITE/USAGE stage privileges: Directory table refreshes allowed
source: https://docs.snowflake.com/en/release-notes/bcr-bundles/2023_06/bcr-1222.md
section: Release Notes
---

# WRITE/USAGE stage privileges: Directory table refreshes allowed

> **Attention:**
>
> This behavior change is in the 2023_06 bundle.
>
> For the current status of the bundle, refer to [Bundle History](../2023_06_bundle.md).

In the current Snowflake release, privileges on internal and external stages have changed as follows:

Previously:
:   To refresh the metadata for a directory table on a stage using an [ALTER STAGE … REFRESH](../../../sql-reference/sql/alter-stage.md) statement,
    a user must be granted or inherit the OWNERSHIP privilege.

Currently:
:   Users who have been granted or who inherit the following privileges will have
    the ability to refresh the metadata for a directory table:

    * WRITE on internal stages.
    * USAGE on external stages.

Ref: 1222